; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh14G005120 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh14G005120
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionFilamentous hemagglutinin
Genome locationCmo_Chr14:2530231..2535578
RNA-Seq ExpressionCmoCh14G005120
SyntenyCmoCh14G005120
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580880.1 hypothetical protein SDJN03_20882, partial [Cucurbita argyrosperma subsp. sororia]1.9e-24289.74Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKN
        MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKN
Subjt:  MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKN

Query:  HIFELEDNIFGEIPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGGITIIPPQS
        HIFELEDNIFGEIPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGGITIIPPQS
Subjt:  HIFELEDNIFGEIPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGGITIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT
        AFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT

Query:  VFGKVK---------------------------------------------QDATYSPSPGTEEHKYAPKNGISSAPEAGSSPVESPASKKRNYEATPPG
        VFGK +                                             QDATYSPSPGTEEHKYAPKNGISSAPEAGSSPVESPASKKRNYEATPPG
Subjt:  VFGKVK---------------------------------------------QDATYSPSPGTEEHKYAPKNGISSAPEAGSSPVESPASKKRNYEATPPG

Query:  FRYGYKGLSAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVTVSISASSPLQGVALSNVQPPEKGDRSAPSVLPPQFSFSVGVRVHTIRWTLGLFLV
        FRYGYKGLS KVRKRSHLGSI SPSSPP SPYLRVGLPAPVTVSISASSPL GVALSNVQPPEKGDRSAPSVLPPQFSFSVGVRVHTIRWTLGLFLV
Subjt:  FRYGYKGLSAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVTVSISASSPLQGVALSNVQPPEKGDRSAPSVLPPQFSFSVGVRVHTIRWTLGLFLV

XP_022934949.1 uncharacterized protein LOC111441963 [Cucurbita moschata]7.6e-24787.52Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKN
        MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKN
Subjt:  MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKN

Query:  HIFELEDNIFGEIPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGGITIIPPQS
        HIFELEDNIFGEIPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGGITIIPPQS
Subjt:  HIFELEDNIFGEIPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGGITIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT
        AFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT

Query:  VFGKVK-----------------------------------------------------------------QDATYSPSPGTEEHKYAPKNGISSAPEAG
        VFGKVK                                                                 QDATYSPSPGTEEHKYAPKNGISSAPEAG
Subjt:  VFGKVK-----------------------------------------------------------------QDATYSPSPGTEEHKYAPKNGISSAPEAG

Query:  SSPVESPASKKRNYEATPPGFRYGYKGLSAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVTVSISASSPLQGVALSNVQPPEKGDRSAPSVLPPQFSFS
        SSPVESPASKKRNYEATPPGFRYGYKGLSAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVTVSISASSPLQGVALSNVQPPEKGDRSAPSVLPPQFSFS
Subjt:  SSPVESPASKKRNYEATPPGFRYGYKGLSAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVTVSISASSPLQGVALSNVQPPEKGDRSAPSVLPPQFSFS

Query:  VGVRVHTIRWTLGLFLVVWHV
        VGVRVHTIRWTLGLFLVVWHV
Subjt:  VGVRVHTIRWTLGLFLVVWHV

XP_022983746.1 uncharacterized protein LOC111482272 isoform X1 [Cucurbita maxima]2.9e-23881.31Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKN
        MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKN
Subjt:  MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKN

Query:  HIFELEDNIFGEIPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGGITIIPPQS
        HIFELEDNIFGEIPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGGITIIPPQS
Subjt:  HIFELEDNIFGEIPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGGITIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT
        AFLLQTAQIYFNFTLNYSIYQIQVNF+DLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT

Query:  VFGKVKQ---------------------------------------------------------------------------------------------
        VFGKVKQ                                                                                             
Subjt:  VFGKVKQ---------------------------------------------------------------------------------------------

Query:  --DATYSPSPGTEEHKYAPKNGISSAPEAGSSPVESPASKKRNYEATPPGFRYGYKGLSAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVTVSISASSP
           A YSPSPGTEEHK+APKNGISSAPEAGSSPVESPASKKRNYEATPPGFRYGYKGLS KVRKRSHLGSI SPSSPPSSPYLRVGLPAPVTVSISASSP
Subjt:  --DATYSPSPGTEEHKYAPKNGISSAPEAGSSPVESPASKKRNYEATPPGFRYGYKGLSAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVTVSISASSP

Query:  LQGVALSNVQPPEKGDRSAPSVLPPQFSFSVGVRVHTIRWTLGLFLVVWHV
        L GVALSNVQPPEKGDRSAPSVLPPQFSFSVGVRVHTIRWTL LFLVVWHV
Subjt:  LQGVALSNVQPPEKGDRSAPSVLPPQFSFSVGVRVHTIRWTLGLFLVVWHV

XP_022983747.1 uncharacterized protein LOC111482272 isoform X2 [Cucurbita maxima]1.6e-24184.88Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKN
        MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKN
Subjt:  MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKN

Query:  HIFELEDNIFGEIPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGGITIIPPQS
        HIFELEDNIFGEIPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGGITIIPPQS
Subjt:  HIFELEDNIFGEIPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGGITIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT
        AFLLQTAQIYFNFTLNYSIYQIQVNF+DLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT

Query:  VFGKVK-------------------------------------------------------------------------QDATYSPSPGTEEHKYAPKNG
        VFGKVK                                                                         QDA YSPSPGTEEHK+APKNG
Subjt:  VFGKVK-------------------------------------------------------------------------QDATYSPSPGTEEHKYAPKNG

Query:  ISSAPEAGSSPVESPASKKRNYEATPPGFRYGYKGLSAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVTVSISASSPLQGVALSNVQPPEKGDRSAPSV
        ISSAPEAGSSPVESPASKKRNYEATPPGFRYGYKGLS KVRKRSHLGSI SPSSPPSSPYLRVGLPAPVTVSISASSPL GVALSNVQPPEKGDRSAPSV
Subjt:  ISSAPEAGSSPVESPASKKRNYEATPPGFRYGYKGLSAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVTVSISASSPLQGVALSNVQPPEKGDRSAPSV

Query:  LPPQFSFSVGVRVHTIRWTLGLFLVVWHV
        LPPQFSFSVGVRVHTIRWTL LFLVVWHV
Subjt:  LPPQFSFSVGVRVHTIRWTLGLFLVVWHV

XP_023528289.1 uncharacterized protein LOC111791252 [Cucurbita pepo subsp. pepo]6.9e-24085.09Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKN
        MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSA+FWLPPFLSYGDWPDQAAD TYRDHEIVACFRARKPVPFLKN
Subjt:  MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKN

Query:  HIFELEDNIFGEIPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGGITIIPPQS
        HIFELEDNIFGEIPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGGITIIPPQS
Subjt:  HIFELEDNIFGEIPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGGITIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT
        AFLLQTAQIYFNFTLNYSI+QIQVNFDDLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT

Query:  VFGKVK-------------------------------------------------------------------QDATYSPSPGTEEHKYAPKNGISSAPE
        VFGKVK                                                                   QDA YSPSP TEEHK+APKNGISSAPE
Subjt:  VFGKVK-------------------------------------------------------------------QDATYSPSPGTEEHKYAPKNGISSAPE

Query:  AGSSPVESPASKKRNYEATPPGFRYGYKGLSAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVTVSISASSPLQGVALSNVQPPEKGDRSAPSVLPPQFS
        AGSSPVESPASKKRNY+ATPPGFRYGYKGLS KVRKRSHLGSI SPSSPPSSPYLRVGLPAPVTVSISASSPL GVALSNVQPPEKGDRSAPSVLPPQFS
Subjt:  AGSSPVESPASKKRNYEATPPGFRYGYKGLSAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVTVSISASSPLQGVALSNVQPPEKGDRSAPSVLPPQFS

Query:  FSVGVRVHTIRWTLGLFLVVWHV
        FSVGVRVHTIRWTL LFLVVWHV
Subjt:  FSVGVRVHTIRWTLGLFLVVWHV

TrEMBL top hitse value%identityAlignment
A0A0A0LHD1 Uncharacterized protein8.6e-20475.34Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKN
        MGKSEEEQPLPVG SSSELSD  V++RCGGGGC  IRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYG+WPD+  DS YRDH+IVA F A KPVPFL+ 
Subjt:  MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKN

Query:  HIFELEDNIFGEIPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGGITIIPPQS
        HIFELEDNIFGEIP+P VKVA+LSLQSLGG NVT I+F+VD DAKYSKIPPTSQSLIKETFETLVIN+PPLRLN SLFGNTSLFEVLKFPGGITIIPPQS
Subjt:  HIFELEDNIFGEIPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGGITIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTN--SSNQRLKQLAQTITNSHSGNLGLN
        AFLLQTAQIYFNFTLNYSIYQIQVNFDDL+SQLRSGLRLS YENLYVSLSNERGST+ APT+VQSSVLMAIGTN  SS QRLKQLA TITNSHSGNLGLN
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTN--SSNQRLKQLAQTITNSHSGNLGLN

Query:  NTVFGKVKQ----------------------------------------------DATYSPSPGTEEHKYAPKNGISSAPEAGSSPVESPASKKRNYEAT
        NTVFGKVKQ                                              DA YSPSPGTEEHK+APKNG+SSAPEAGSSP+E P S+KRNYEAT
Subjt:  NTVFGKVKQ----------------------------------------------DATYSPSPGTEEHKYAPKNGISSAPEAGSSPVESPASKKRNYEAT

Query:  PPGFRYGYKGLSAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVTVSISASSPLQGVALSNVQPPEKG-------DRSAPSVLPPQFSFSVGVRVHTIRW
        PP FRYGYK    K+RK  +LG I SPSS PSSPYLRVG PAPV+ SISASSPL GV LSNVQPP  G       +RS+PSVLPPQFS + GVRV+TI+W
Subjt:  PPGFRYGYKGLSAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVTVSISASSPLQGVALSNVQPPEKG-------DRSAPSVLPPQFSFSVGVRVHTIRW

Query:  TLGLFLVVWHV
        TL LFL++WHV
Subjt:  TLGLFLVVWHV

A0A1S3B8E9 uncharacterized protein LOC1034871654.7e-20271.14Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKN
        MGKSEEEQPLPVGVSSSELSD  V++RCGGGGC  IR+LIAVRCVFFLLLSAAVFLSAIFWLPPFLSYG+WPD+  DS YRDH+IVA F A KPVPFL+N
Subjt:  MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKN

Query:  HIFELEDNIFGEIPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGGITIIPPQS
        HIFELEDNIFGEIP+P VKVA+LSLQSL G NVT I+F+VD DAKYSKIPPTSQSLIKETFETLVIN+PPLRLN SLFGNTSLFEVLKFPGGITIIPPQS
Subjt:  HIFELEDNIFGEIPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGGITIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTN--SSNQRLKQLAQTITNSHSGNLGLN
        AFLLQTAQIYFNFTLNYSIYQIQVNFDDL+SQLRSGLRLS YENLYVSLSNERGSTM APT+VQSSVLMAIGTN  SS QRLKQLA TITNSHSGNLGLN
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTN--SSNQRLKQLAQTITNSHSGNLGLN

Query:  NTVFGKVK-------------------------------------------------------------------------------QDATYSPSPGTEE
        NTVFGKVK                                                                               Q A YSPSPGTEE
Subjt:  NTVFGKVK-------------------------------------------------------------------------------QDATYSPSPGTEE

Query:  HKYAPKNGISSAPEAGSSPVESPASKKRNYEATPPGFRYGYKGLSAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVTVSISASSPLQGVALSNVQPPEK
        HK+APKNG+SSAPEAGSSP+E P S+KRNYEATPP FRYGYK  S K+RK+ HLG I SPSS P SPYLRVGLPAPV+ SISASSPL GV LSNVQPP  
Subjt:  HKYAPKNGISSAPEAGSSPVESPASKKRNYEATPPGFRYGYKGLSAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVTVSISASSPLQGVALSNVQPPEK

Query:  G-------DRSAPSVLPPQFSFSVGVRVHTIRWTLGLFLVVWHV
        G       +RS+PSVLPPQFS +  VRV+TI+WTL LFL+VWHV
Subjt:  G-------DRSAPSVLPPQFSFSVGVRVHTIRWTLGLFLVVWHV

A0A6J1F409 uncharacterized protein LOC1114419633.7e-24787.52Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKN
        MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKN
Subjt:  MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKN

Query:  HIFELEDNIFGEIPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGGITIIPPQS
        HIFELEDNIFGEIPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGGITIIPPQS
Subjt:  HIFELEDNIFGEIPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGGITIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT
        AFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT

Query:  VFGKVK-----------------------------------------------------------------QDATYSPSPGTEEHKYAPKNGISSAPEAG
        VFGKVK                                                                 QDATYSPSPGTEEHKYAPKNGISSAPEAG
Subjt:  VFGKVK-----------------------------------------------------------------QDATYSPSPGTEEHKYAPKNGISSAPEAG

Query:  SSPVESPASKKRNYEATPPGFRYGYKGLSAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVTVSISASSPLQGVALSNVQPPEKGDRSAPSVLPPQFSFS
        SSPVESPASKKRNYEATPPGFRYGYKGLSAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVTVSISASSPLQGVALSNVQPPEKGDRSAPSVLPPQFSFS
Subjt:  SSPVESPASKKRNYEATPPGFRYGYKGLSAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVTVSISASSPLQGVALSNVQPPEKGDRSAPSVLPPQFSFS

Query:  VGVRVHTIRWTLGLFLVVWHV
        VGVRVHTIRWTLGLFLVVWHV
Subjt:  VGVRVHTIRWTLGLFLVVWHV

A0A6J1J074 uncharacterized protein LOC111482272 isoform X27.9e-24284.88Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKN
        MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKN
Subjt:  MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKN

Query:  HIFELEDNIFGEIPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGGITIIPPQS
        HIFELEDNIFGEIPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGGITIIPPQS
Subjt:  HIFELEDNIFGEIPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGGITIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT
        AFLLQTAQIYFNFTLNYSIYQIQVNF+DLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT

Query:  VFGKVK-------------------------------------------------------------------------QDATYSPSPGTEEHKYAPKNG
        VFGKVK                                                                         QDA YSPSPGTEEHK+APKNG
Subjt:  VFGKVK-------------------------------------------------------------------------QDATYSPSPGTEEHKYAPKNG

Query:  ISSAPEAGSSPVESPASKKRNYEATPPGFRYGYKGLSAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVTVSISASSPLQGVALSNVQPPEKGDRSAPSV
        ISSAPEAGSSPVESPASKKRNYEATPPGFRYGYKGLS KVRKRSHLGSI SPSSPPSSPYLRVGLPAPVTVSISASSPL GVALSNVQPPEKGDRSAPSV
Subjt:  ISSAPEAGSSPVESPASKKRNYEATPPGFRYGYKGLSAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVTVSISASSPLQGVALSNVQPPEKGDRSAPSV

Query:  LPPQFSFSVGVRVHTIRWTLGLFLVVWHV
        LPPQFSFSVGVRVHTIRWTL LFLVVWHV
Subjt:  LPPQFSFSVGVRVHTIRWTLGLFLVVWHV

A0A6J1J390 uncharacterized protein LOC111482272 isoform X11.4e-23881.31Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKN
        MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKN
Subjt:  MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKN

Query:  HIFELEDNIFGEIPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGGITIIPPQS
        HIFELEDNIFGEIPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGGITIIPPQS
Subjt:  HIFELEDNIFGEIPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGGITIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT
        AFLLQTAQIYFNFTLNYSIYQIQVNF+DLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT

Query:  VFGKVKQ---------------------------------------------------------------------------------------------
        VFGKVKQ                                                                                             
Subjt:  VFGKVKQ---------------------------------------------------------------------------------------------

Query:  --DATYSPSPGTEEHKYAPKNGISSAPEAGSSPVESPASKKRNYEATPPGFRYGYKGLSAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVTVSISASSP
           A YSPSPGTEEHK+APKNGISSAPEAGSSPVESPASKKRNYEATPPGFRYGYKGLS KVRKRSHLGSI SPSSPPSSPYLRVGLPAPVTVSISASSP
Subjt:  --DATYSPSPGTEEHKYAPKNGISSAPEAGSSPVESPASKKRNYEATPPGFRYGYKGLSAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVTVSISASSP

Query:  LQGVALSNVQPPEKGDRSAPSVLPPQFSFSVGVRVHTIRWTLGLFLVVWHV
        L GVALSNVQPPEKGDRSAPSVLPPQFSFSVGVRVHTIRWTL LFLVVWHV
Subjt:  LQGVALSNVQPPEKGDRSAPSVLPPQFSFSVGVRVHTIRWTLGLFLVVWHV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10790.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT3G56590.2)2.8e-3736.19Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGC-FAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYR-DHEIVACFRARKPVPFL
        M K  +E  L +   + +L +     R  G  C  A  RL+ +RC+  L+LS A+ LSAIFWL P  S  ++    AD T + +  + A FR +KPV  +
Subjt:  MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGC-FAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYR-DHEIVACFRARKPVPFL

Query:  KNHIFELEDNIFGEIPVP-FVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGGITIIP
          H  ++E +I   I +    KV VLSL   G SN TD+ F+V P     +I   S SL++ +F  L      L+L  S FG  + F+VLKFPGGIT+ P
Subjt:  KNHIFELEDNIFGEIPVP-FVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGGITIIP

Query:  PQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGL
         + A +   A + F+ T+  SI  +Q   D L       L L  YE+++  L+N++GST+  P   Q  V   +     +QRL    Q I  S + NLGL
Subjt:  PQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGL

Query:  NNTVFGKVKQDATYS
        +  VFG+VK D T+S
Subjt:  NNTVFGKVKQDATYS

AT3G10810.1 zinc finger (C3HC4-type RING finger) family protein6.9e-8141.12Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKN
        MGK+E++  L V    +        +RC  G C  I   +  +C+F LLLS A+FLSA+F L PF    D  D   D  +R H IVA F   +   FL  
Subjt:  MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKN

Query:  HIFELEDNIFGEIPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGGITIIPPQS
        +  +L+++IF E+    +KV +L+++     N+T ++F +DPD  Y +I P S S IKE FE+++IN   L+L  SLFG T LFEVLKFPGGIT+IPPQS
Subjt:  HIFELEDNIFGEIPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGGITIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT
        AF LQ  +I FNFTLNYSI+QIQ+NF+ L SQL++GL L+ YENLYVSLSN  GST+  PT V SSVL+ +GT++S+ RLKQL  TIT S S NLGLNNT
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT

Query:  VFGKVKQ------------DATYSPSPGT---------------------EEHKYAPKNGISSAPEAGSSPVESPA---SKKRNYEATP---PGFRYGYK
        +FGKVKQ             +T SPSP                         H +   + +S       SPV SPA   S+KR   A P   PG R  +K
Subjt:  VFGKVKQ------------DATYSPSPGT---------------------EEHKYAPKNGISSAPEAGSSPVESPA---SKKRNYEATP---PGFRYGYK

Query:  GLSAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVTVS----ISASSPLQGVALSN-VQPP--EKGDRSAPSVL--PPQFSFSVGVRVHTIRWTLGLFLV
               KR    S  +P+    +P+ ++  PAP++ +    +  S+PL  V  ++  QPP  E  +  A  V    PQ S S    +  + W + L L+
Subjt:  GLSAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVTVS----ISASSPLQGVALSN-VQPP--EKGDRSAPSVL--PPQFSFSVGVRVHTIRWTLGLFLV

Query:  V
        V
Subjt:  V

AT3G56590.1 hydroxyproline-rich glycoprotein family protein4.5e-8846.17Show/hide
Query:  MGKSE-EEQPLPVGVSSSELSDWTVQSRCGGGG-------CFAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRAR
        MGK+  EEQ LPV       SD    +R  GGG       C  I    ++RCV  L  SAAVFLSA+FWLPPFL + D  D   D  ++DH IVA F   
Subjt:  MGKSE-EEQPLPVGVSSSELSDWTVQSRCGGGG-------CFAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRAR

Query:  KPVPFLKNHIFELEDNIFGEIPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGG
        KP+ F+++++ +LE++I  EI  P  KV VL+L+ LG  N T +IF++DP+ + SKIP   +SLIK  FETLV      RL  SLFG    FEVLKFPGG
Subjt:  KPVPFLKNHIFELEDNIFGEIPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGG

Query:  ITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHS
        IT+IPPQ  F LQ AQ+ FNFTLN+SIYQIQ NF++L SQL+ G+ L+ YENLY++LSN RGST+  PTIV SSVL+  G++S   RLKQLAQTIT+SHS
Subjt:  ITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHS

Query:  GNLGLNNTVFGKVKQ-------------DATYSPSPGTEEHKY--------------APKNGISSAPEAGSSPVESPASKKRNYEATP--PGFRYGYKGL
         NLGLN+TVFGKVKQ              +T SPSP  E H+Y              AP+  + S P  G +P  +P          P  P  +   KG 
Subjt:  GNLGLNNTVFGKVKQ-------------DATYSPSPGTEEHKY--------------APKNGISSAPEAGSSPVESPASKKRNYEATP--PGFRYGYKGL

Query:  SAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVT-VSISASSPLQGVALSNVQPPEK
        SA     +H  +  +P+   S P+     PAP    +I  SSPL  V  +++ PP K
Subjt:  SAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVT-VSISASSPLQGVALSNVQPPEK

AT3G56590.2 hydroxyproline-rich glycoprotein family protein4.1e-8945.02Show/hide
Query:  MGKSE-EEQPLPVGVSSSELSDWTVQSRCGGGG-------CFAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRAR
        MGK+  EEQ LPV       SD    +R  GGG       C  I    ++RCV  L  SAAVFLSA+FWLPPFL + D  D   D  ++DH IVA F   
Subjt:  MGKSE-EEQPLPVGVSSSELSDWTVQSRCGGGG-------CFAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRAR

Query:  KPVPFLKNHIFELEDNIFGEIPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGG
        KP+ F+++++ +LE++I  EI  P  KV VL+L+ LG  N T +IF++DP+ + SKIP   +SLIK  FETLV      RL  SLFG    FEVLKFPGG
Subjt:  KPVPFLKNHIFELEDNIFGEIPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGG

Query:  ITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHS
        IT+IPPQ  F LQ AQ+ FNFTLN+SIYQIQ NF++L SQL+ G+ L+ YENLY++LSN RGST+  PTIV SSVL+  G++S   RLKQLAQTIT+SHS
Subjt:  ITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHS

Query:  GNLGLNNTVFGKVKQ-------------DATYSPSPGTEEHKY--------------APKNGISSAPEAGSSPVESPASKKRNYEATP--PGFRYGYKGL
         NLGLN+TVFGKVKQ              +T SPSP  E H+Y              AP+  + S P  G +P  +P          P  P  +   KG 
Subjt:  GNLGLNNTVFGKVKQ-------------DATYSPSPGTEEHKY--------------APKNGISSAPEAGSSPVESPASKKRNYEATP--PGFRYGYKGL

Query:  SAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVT-VSISASSPLQGVALSNVQPPEKGD-------RSAPSVLPPQFSFSVG
        SA     +H  +  +P+   S P+     PAP    +I  SSPL  V  +++ PP K           +PS  P   S S+G
Subjt:  SAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVT-VSISASSPLQGVALSNVQPPEKGD-------RSAPSVLPPQFSFSVG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAAGAGCGAGGAAGAACAGCCGCTGCCGGTTGGAGTGAGCTCCTCCGAGCTTTCTGATTGGACTGTGCAGAGTAGATGTGGCGGCGGTGGGTGCTTTGCGATTCG
TAGACTGATTGCTGTGAGATGTGTCTTCTTCCTGTTACTGTCGGCGGCTGTGTTTCTTTCTGCTATTTTTTGGCTGCCGCCGTTCCTTTCATACGGAGATTGGCCGGATC
AGGCGGCTGATTCTACTTATAGAGATCATGAAATCGTAGCGTGTTTTCGTGCTCGGAAGCCAGTTCCTTTTCTGAAAAACCATATTTTTGAGCTTGAAGACAACATTTTT
GGAGAAATTCCTGTTCCTTTTGTCAAGGTGGCCGTCCTCTCGCTACAATCATTAGGCGGATCGAACGTAACAGATATCATTTTCTCCGTAGATCCTGATGCCAAGTATTC
AAAAATTCCACCAACTTCTCAAAGTTTAATCAAGGAAACGTTTGAAACATTGGTTATAAACGACCCTCCTCTCAGATTGAACGCATCGTTATTCGGCAATACTTCGTTGT
TCGAGGTGTTGAAATTTCCTGGTGGGATAACTATTATTCCTCCTCAGAGTGCATTTCTTCTGCAGACAGCACAGATCTATTTCAATTTTACGTTGAATTATTCTATCTAT
CAAATTCAAGTGAATTTCGACGATCTTACCAGCCAGCTGAGGTCGGGATTACGTCTATCTCGTTATGAGAATTTATATGTTAGCCTATCGAACGAACGAGGTTCGACAAT
GCAGGCTCCTACTATTGTTCAGTCGTCTGTTCTGATGGCTATTGGGACGAATTCATCGAATCAAAGACTCAAACAGTTGGCTCAAACCATCACGAATTCTCATTCGGGAA
ATCTTGGCCTGAACAACACTGTTTTTGGCAAGGTCAAGCAGGACGCTACATATTCACCGAGTCCTGGAACAGAGGAGCACAAATATGCACCGAAGAATGGGATCTCATCA
GCTCCTGAAGCTGGTTCATCCCCAGTGGAAAGTCCAGCTTCAAAGAAACGAAACTATGAAGCAACTCCGCCTGGTTTTCGATACGGATATAAAGGGTTGTCAGCAAAAGT
CAGAAAACGATCTCATTTAGGCTCTATTCTGTCTCCAAGCAGTCCTCCATCGTCGCCATACTTACGAGTAGGCCTACCAGCACCGGTCACGGTTTCTATATCTGCTTCAA
GTCCACTGCAAGGGGTAGCTCTATCTAATGTACAGCCTCCAGAAAAAGGCGACAGAAGTGCCCCTTCAGTCTTGCCACCACAATTTTCTTTTTCTGTAGGCGTTCGTGTT
CATACAATTCGATGGACACTCGGGCTGTTTCTTGTTGTATGGCATGTATAA
mRNA sequenceShow/hide mRNA sequence
CAGAAGTGGAAGTGATGAAATGGGCATAACCCATCTTCCACGGCTCTTCATTTAAGTGCGCAATTAGTGCCATCAATTCCCTTTTTCAATTTCCTTGTTCCCCCATTTCT
CTGTTTTTGGCGCTTCTTCTCCATTTCTTACCTTTTCCCCTTCTACTGTCTTGTCATCTTCTTCCATTTTCCCTTTCGATTTATAAATTTTGGTGAAGAAAACGGATAGA
TGGATGGTTGATGGATGAGTAATAAACCCCCCTCTCTATATCCACATTTACTTCGAAGACTGTGGCTGTCATTTGTAGTTGTTGCGCCAGTTAAATCAGACCGAATCGCC
CCCAATTTCTCTCTCTCTCTCTCTCTCTCTCTCTCGAACCCTAGATTTTCTTTTTGAATTTGGGTTCTGGGTTTGCGGTGGAGAAGCTCCACGAGGTGGGATTTTGAGCT
TTTGTGGTTCTGGGATCGATTTGCTTTGAGTTTGAGGTGTAATGGGGGGAGATAATTTGGACCCGATTGAGGGAGGTGGCAATGGCGGTTGTTAACCCACTTCTCATGCA
TTGCCTCCATGGGAAAGAGCGAGGAAGAACAGCCGCTGCCGGTTGGAGTGAGCTCCTCCGAGCTTTCTGATTGGACTGTGCAGAGTAGATGTGGCGGCGGTGGGTGCTTT
GCGATTCGTAGACTGATTGCTGTGAGATGTGTCTTCTTCCTGTTACTGTCGGCGGCTGTGTTTCTTTCTGCTATTTTTTGGCTGCCGCCGTTCCTTTCATACGGAGATTG
GCCGGATCAGGCGGCTGATTCTACTTATAGAGATCATGAAATCGTAGCGTGTTTTCGTGCTCGGAAGCCAGTTCCTTTTCTGAAAAACCATATTTTTGAGCTTGAAGACA
ACATTTTTGGAGAAATTCCTGTTCCTTTTGTCAAGGTGGCCGTCCTCTCGCTACAATCATTAGGCGGATCGAACGTAACAGATATCATTTTCTCCGTAGATCCTGATGCC
AAGTATTCAAAAATTCCACCAACTTCTCAAAGTTTAATCAAGGAAACGTTTGAAACATTGGTTATAAACGACCCTCCTCTCAGATTGAACGCATCGTTATTCGGCAATAC
TTCGTTGTTCGAGGTGTTGAAATTTCCTGGTGGGATAACTATTATTCCTCCTCAGAGTGCATTTCTTCTGCAGACAGCACAGATCTATTTCAATTTTACGTTGAATTATT
CTATCTATCAAATTCAAGTGAATTTCGACGATCTTACCAGCCAGCTGAGGTCGGGATTACGTCTATCTCGTTATGAGAATTTATATGTTAGCCTATCGAACGAACGAGGT
TCGACAATGCAGGCTCCTACTATTGTTCAGTCGTCTGTTCTGATGGCTATTGGGACGAATTCATCGAATCAAAGACTCAAACAGTTGGCTCAAACCATCACGAATTCTCA
TTCGGGAAATCTTGGCCTGAACAACACTGTTTTTGGCAAGGTCAAGCAGGACGCTACATATTCACCGAGTCCTGGAACAGAGGAGCACAAATATGCACCGAAGAATGGGA
TCTCATCAGCTCCTGAAGCTGGTTCATCCCCAGTGGAAAGTCCAGCTTCAAAGAAACGAAACTATGAAGCAACTCCGCCTGGTTTTCGATACGGATATAAAGGGTTGTCA
GCAAAAGTCAGAAAACGATCTCATTTAGGCTCTATTCTGTCTCCAAGCAGTCCTCCATCGTCGCCATACTTACGAGTAGGCCTACCAGCACCGGTCACGGTTTCTATATC
TGCTTCAAGTCCACTGCAAGGGGTAGCTCTATCTAATGTACAGCCTCCAGAAAAAGGCGACAGAAGTGCCCCTTCAGTCTTGCCACCACAATTTTCTTTTTCTGTAGGCG
TTCGTGTTCATACAATTCGATGGACACTCGGGCTGTTTCTTGTTGTATGGCATGTATAACCAAGGAGATAGAACCTACATGCGTACTTCTGGGTAACAACAGGACTCGTA
ATCGATATCAGAGTTGTGATAGCGATAGCGAGACCGACGCAAAGGCGTGCTCCTGCTAGAGTTGATATTAATGTAAATATGAGATGAAGCAAGTTATTAGGAGATGCATT
TTTCCAGGTCAAAGTCACAGAGGTGGCAGGCCTTGTGTTGTTTTTCTTCTGCAGAAAATGTAAAGTAGAGAAGAAATCAGCAAATGGATCTTGTTCTCAACTTCTTAATC
AACCAATCACAATTTTTCACCCCCGTTTTTTTAATTTTAATATCCCCGTCTCATATTGTCGGTCTCTTTGATCCTTTGGCAACGTTCATCTGTGTCAACGTTCATCCTGT
Protein sequenceShow/hide protein sequence
MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKNHIFELEDNIF
GEIPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIY
QIQVNFDDLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNTVFGKVKQDATYSPSPGTEEHKYAPKNGISS
APEAGSSPVESPASKKRNYEATPPGFRYGYKGLSAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVTVSISASSPLQGVALSNVQPPEKGDRSAPSVLPPQFSFSVGVRV
HTIRWTLGLFLVVWHV