; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0019588 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0019588
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionglutelin type-B 2-like
Genome locationchr5:43581924..43583325
RNA-Seq ExpressionLag0019588
SyntenyLag0019588
Gene Ontology termsGO:0045735 - nutrient reservoir activity (molecular function)
InterPro domainsIPR006045 - Cupin 1
IPR011051 - RmlC-like cupin domain superfamily
IPR014710 - RmlC-like jelly roll fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYJ99759.1 glutelin type-A 2-like [Cucumis melo var. makuwa]1.2e-15982.2Show/hide
Query:  MEPMSPKPFFEGDGGSYHKWLPSDYPLLAQTNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFVFPNKSNEVVVKLKKGDLIPVQNGVTSWWFN
        ME M+PKPFFEG+GGSY KWLPSDYPLLAQTNVA GRLLLRPRGFAVPHYAD SK GYVLQG++GV GFVFPNK NEVV+KLKKGDLIPV +G+TSWWFN
Subjt:  MEPMSPKPFFEGDGGSYHKWLPSDYPLLAQTNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFVFPNKSNEVVVKLKKGDLIPVQNGVTSWWFN

Query:  DGDSDLEIIFLGETKSAHIPGDITYFLLSGAGSLLQGFSPEYIGKTYSLKEEETTTLLKSQPNILIFTVQQTQSLPKPHQHSKLVYNIDAAAPDTRAKLG
        DGDSDLEIIFLGETK+AH+PGDITYF+LSG   LLQGF+PEY+ K+YSL +EET   LKSQ N+LIFTVQ +QSLPKPH+HSKLVYNIDAA PD RAK+G
Subjt:  DGDSDLEIIFLGETKSAHIPGDITYFLLSGAGSLLQGFSPEYIGKTYSLKEEETTTLLKSQPNILIFTVQQTQSLPKPHQHSKLVYNIDAAAPDTRAKLG

Query:  TAAVTTVTESKFPFIGQSGLTSILEKLDANAIRSPVYVAEPSDQLIYIAKGSGKIQIVGISSNIDAEVKMGQLILVPKYFAVGKIAGEEGLECISIITAT
         AAVT VTES FPFIGQ+GLT++LEKLDANAIRSPVY+AEPSDQLIY+ KGSGKIQ+VG SS  DA+VK+GQLILVP+YFAVGK+AGEEGLECIS+I AT
Subjt:  TAAVTTVTESKFPFIGQSGLTSILEKLDANAIRSPVYVAEPSDQLIYIAKGSGKIQIVGISSNIDAEVKMGQLILVPKYFAVGKIAGEEGLECISIITAT

Query:  HPLVEELAGKKSVLEALSPEVFQVSFNVTAEFEKLFR
        HP+VEELAGK SVLEALS EVFQVSFNVTAEFEKLFR
Subjt:  HPLVEELAGKKSVLEALSPEVFQVSFNVTAEFEKLFR

XP_004151504.1 legumin J [Cucumis sativus]8.1e-16182.89Show/hide
Query:  MEPMSPKPFFEGDGGSYHKWLPSDYPLLAQTNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFVFPNKSNEVVVKLKKGDLIPVQNGVTSWWFN
        ME M+PKPFFEG+GGSYHKWLPSDYPLLAQTNVA GRLLLRPRGFAVPHY+D SK GYVLQG++GV GFVFP K NEVV+KLKKGDLIPV  GVTSWWFN
Subjt:  MEPMSPKPFFEGDGGSYHKWLPSDYPLLAQTNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFVFPNKSNEVVVKLKKGDLIPVQNGVTSWWFN

Query:  DGDSDLEIIFLGETKSAHIPGDITYFLLSGAGSLLQGFSPEYIGKTYSLKEEETTTLLKSQPNILIFTVQQTQSLPKPHQHSKLVYNIDAAAPDTRAKLG
        DGDSDLEIIFLGETK AH+PGDITYF+LSG   LLQGF+PEY+ K+ SL +EET T LKSQPN+LIFTVQ +QSLPKPH++SKLVYNIDAAAPD RAK+G
Subjt:  DGDSDLEIIFLGETKSAHIPGDITYFLLSGAGSLLQGFSPEYIGKTYSLKEEETTTLLKSQPNILIFTVQQTQSLPKPHQHSKLVYNIDAAAPDTRAKLG

Query:  TAAVTTVTESKFPFIGQSGLTSILEKLDANAIRSPVYVAEPSDQLIYIAKGSGKIQIVGISSNIDAEVKMGQLILVPKYFAVGKIAGEEGLECISIITAT
         AAVT VTES FPFIGQ+GLT +LEKLDANAIRSPVY+AEPSDQLIY+ KGSGKIQ+VG SS  DA+VK GQLILVP+YFAVGKIAGEEGLECIS+I AT
Subjt:  TAAVTTVTESKFPFIGQSGLTSILEKLDANAIRSPVYVAEPSDQLIYIAKGSGKIQIVGISSNIDAEVKMGQLILVPKYFAVGKIAGEEGLECISIITAT

Query:  HPLVEELAGKKSVLEALSPEVFQVSFNVTAEFEKLFRSK
        HP+VEELAGK SVLEALS EVFQVSFNVTAEFEKLFRSK
Subjt:  HPLVEELAGKKSVLEALSPEVFQVSFNVTAEFEKLFRSK

XP_008456076.1 PREDICTED: glutelin type-A 2-like [Cucumis melo]1.1e-16082.3Show/hide
Query:  MEPMSPKPFFEGDGGSYHKWLPSDYPLLAQTNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFVFPNKSNEVVVKLKKGDLIPVQNGVTSWWFN
        ME M+PKPFFEG+GGSY KWLPSDYPLLAQTNVA GRLLLRPRGFAVPHYAD SK GYVLQG++GV GFVFPNK NEVV+KLKKGDLIPV +G+TSWWFN
Subjt:  MEPMSPKPFFEGDGGSYHKWLPSDYPLLAQTNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFVFPNKSNEVVVKLKKGDLIPVQNGVTSWWFN

Query:  DGDSDLEIIFLGETKSAHIPGDITYFLLSGAGSLLQGFSPEYIGKTYSLKEEETTTLLKSQPNILIFTVQQTQSLPKPHQHSKLVYNIDAAAPDTRAKLG
        DGDSDLEIIFLGETK+AH+PGDITYF+LSG   LLQGF+PEY+ K+YSL +EET   LKSQ N+LIFTVQ +QSLPKPH+HSKLVYNIDAA PD RAK+G
Subjt:  DGDSDLEIIFLGETKSAHIPGDITYFLLSGAGSLLQGFSPEYIGKTYSLKEEETTTLLKSQPNILIFTVQQTQSLPKPHQHSKLVYNIDAAAPDTRAKLG

Query:  TAAVTTVTESKFPFIGQSGLTSILEKLDANAIRSPVYVAEPSDQLIYIAKGSGKIQIVGISSNIDAEVKMGQLILVPKYFAVGKIAGEEGLECISIITAT
         AAVT VTES FPFIGQ+GLT++LEKLDANAIRSPVY+AEPSDQLIY+ KGSGKIQ+VG SS  DA+VK+GQLILVP+YFAVGK+AGEEGLECIS+I AT
Subjt:  TAAVTTVTESKFPFIGQSGLTSILEKLDANAIRSPVYVAEPSDQLIYIAKGSGKIQIVGISSNIDAEVKMGQLILVPKYFAVGKIAGEEGLECISIITAT

Query:  HPLVEELAGKKSVLEALSPEVFQVSFNVTAEFEKLFRSK
        HP+VEELAGK SVLEALS EVFQVSFNVTAEFEKLFRSK
Subjt:  HPLVEELAGKKSVLEALSPEVFQVSFNVTAEFEKLFRSK

XP_022922755.1 legumin J-like [Cucurbita moschata]2.4e-15782.84Show/hide
Query:  EPMSPKPFFEGDGGSYHKWLPSDYPLLAQTNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFVFPNKSNEVVVKLKKGDLIPVQNGVTSWWFND
        +PM+PKPF E + GSYHKWLPS+YPLLAQ  VAAGRLLLRPRGF VPHYAD SKVGYVLQG+NGVAG VFP+KS+EVVV LKKGDLIPV NGV+SWWFND
Subjt:  EPMSPKPFFEGDGGSYHKWLPSDYPLLAQTNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFVFPNKSNEVVVKLKKGDLIPVQNGVTSWWFND

Query:  GDSDLEIIFLGETKSAHIPGDITYFLLSGAGSLLQGFSPEYIGKTYSLKEEETTTLLKSQPNILIFTVQQTQSLPKPHQHSKLVYNIDAAAPDTRAKLGT
        GDSDLEIIFLGE+K+AH+PGDI+YF+LSG  SLL GFSPEY+GKTYSL  EETT  LKSQ N LIF++QQTQSLPKP ++SK VYNIDAAAPD R K G 
Subjt:  GDSDLEIIFLGETKSAHIPGDITYFLLSGAGSLLQGFSPEYIGKTYSLKEEETTTLLKSQPNILIFTVQQTQSLPKPHQHSKLVYNIDAAAPDTRAKLGT

Query:  AAVTTVTESKFPFIGQSGLTSILEKLDANAIRSPVYVAEPSDQLIYIAKGSGKIQIVGISSNIDAEVKMGQLILVPKYFAVGKIAGEEGLECISIITATH
         AVTTVTESKFPFIGQSGLT+ILEKL+ANA+RSPVYVAEP DQLIY+AKG GKIQIVG SS IDAEVKMGQLILVPK+FAVGKIAGE+GLECISIITATH
Subjt:  AAVTTVTESKFPFIGQSGLTSILEKLDANAIRSPVYVAEPSDQLIYIAKGSGKIQIVGISSNIDAEVKMGQLILVPKYFAVGKIAGEEGLECISIITATH

Query:  PLVEELAGKKSVLEALSPEVFQVSFNVTAEFEKLFRSK
        P+VEELAGK SVLEALSPE+FQVSFNVTAEFEKL RSK
Subjt:  PLVEELAGKKSVLEALSPEVFQVSFNVTAEFEKLFRSK

XP_022985328.1 12S seed storage protein CRD-like [Cucurbita maxima]4.2e-15782.84Show/hide
Query:  EPMSPKPFFEGDGGSYHKWLPSDYPLLAQTNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFVFPNKSNEVVVKLKKGDLIPVQNGVTSWWFND
        +PM+PKPF E + GSYHKWLPS+YPLLA   VAAGRLLLRPRGF VPHYAD SKVGYVLQG+NGVAG VFP+KS+EVVV LKKGDLIPV NGV+SWWFND
Subjt:  EPMSPKPFFEGDGGSYHKWLPSDYPLLAQTNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFVFPNKSNEVVVKLKKGDLIPVQNGVTSWWFND

Query:  GDSDLEIIFLGETKSAHIPGDITYFLLSGAGSLLQGFSPEYIGKTYSLKEEETTTLLKSQPNILIFTVQQTQSLPKPHQHSKLVYNIDAAAPDTRAKLGT
        GDSDLEIIFLGE+K+AH+PGDI+YF+LSG  SLL GFSPEY+G+TYSL  EETT  LKSQ N LIF++QQTQSLPKP ++SK VYNIDAAAPD R K G 
Subjt:  GDSDLEIIFLGETKSAHIPGDITYFLLSGAGSLLQGFSPEYIGKTYSLKEEETTTLLKSQPNILIFTVQQTQSLPKPHQHSKLVYNIDAAAPDTRAKLGT

Query:  AAVTTVTESKFPFIGQSGLTSILEKLDANAIRSPVYVAEPSDQLIYIAKGSGKIQIVGISSNIDAEVKMGQLILVPKYFAVGKIAGEEGLECISIITATH
         AVTTVTESKFPFIGQSGLT+ILEKLDANA+RSPVYVAEP DQLIY+AKG GKIQIVG SS IDAEVKMGQLILVPK+FAVGKIAGE+GLECISIITATH
Subjt:  AAVTTVTESKFPFIGQSGLTSILEKLDANAIRSPVYVAEPSDQLIYIAKGSGKIQIVGISSNIDAEVKMGQLILVPKYFAVGKIAGEEGLECISIITATH

Query:  PLVEELAGKKSVLEALSPEVFQVSFNVTAEFEKLFRSK
        P+VEELAGK SVLEALSPEVFQVSFNVTAEFEKL RSK
Subjt:  PLVEELAGKKSVLEALSPEVFQVSFNVTAEFEKLFRSK

TrEMBL top hitse value%identityAlignment
A0A0A0L6K0 Uncharacterized protein3.9e-16182.89Show/hide
Query:  MEPMSPKPFFEGDGGSYHKWLPSDYPLLAQTNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFVFPNKSNEVVVKLKKGDLIPVQNGVTSWWFN
        ME M+PKPFFEG+GGSYHKWLPSDYPLLAQTNVA GRLLLRPRGFAVPHY+D SK GYVLQG++GV GFVFP K NEVV+KLKKGDLIPV  GVTSWWFN
Subjt:  MEPMSPKPFFEGDGGSYHKWLPSDYPLLAQTNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFVFPNKSNEVVVKLKKGDLIPVQNGVTSWWFN

Query:  DGDSDLEIIFLGETKSAHIPGDITYFLLSGAGSLLQGFSPEYIGKTYSLKEEETTTLLKSQPNILIFTVQQTQSLPKPHQHSKLVYNIDAAAPDTRAKLG
        DGDSDLEIIFLGETK AH+PGDITYF+LSG   LLQGF+PEY+ K+ SL +EET T LKSQPN+LIFTVQ +QSLPKPH++SKLVYNIDAAAPD RAK+G
Subjt:  DGDSDLEIIFLGETKSAHIPGDITYFLLSGAGSLLQGFSPEYIGKTYSLKEEETTTLLKSQPNILIFTVQQTQSLPKPHQHSKLVYNIDAAAPDTRAKLG

Query:  TAAVTTVTESKFPFIGQSGLTSILEKLDANAIRSPVYVAEPSDQLIYIAKGSGKIQIVGISSNIDAEVKMGQLILVPKYFAVGKIAGEEGLECISIITAT
         AAVT VTES FPFIGQ+GLT +LEKLDANAIRSPVY+AEPSDQLIY+ KGSGKIQ+VG SS  DA+VK GQLILVP+YFAVGKIAGEEGLECIS+I AT
Subjt:  TAAVTTVTESKFPFIGQSGLTSILEKLDANAIRSPVYVAEPSDQLIYIAKGSGKIQIVGISSNIDAEVKMGQLILVPKYFAVGKIAGEEGLECISIITAT

Query:  HPLVEELAGKKSVLEALSPEVFQVSFNVTAEFEKLFRSK
        HP+VEELAGK SVLEALS EVFQVSFNVTAEFEKLFRSK
Subjt:  HPLVEELAGKKSVLEALSPEVFQVSFNVTAEFEKLFRSK

A0A1S3C2D5 glutelin type-A 2-like5.1e-16182.3Show/hide
Query:  MEPMSPKPFFEGDGGSYHKWLPSDYPLLAQTNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFVFPNKSNEVVVKLKKGDLIPVQNGVTSWWFN
        ME M+PKPFFEG+GGSY KWLPSDYPLLAQTNVA GRLLLRPRGFAVPHYAD SK GYVLQG++GV GFVFPNK NEVV+KLKKGDLIPV +G+TSWWFN
Subjt:  MEPMSPKPFFEGDGGSYHKWLPSDYPLLAQTNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFVFPNKSNEVVVKLKKGDLIPVQNGVTSWWFN

Query:  DGDSDLEIIFLGETKSAHIPGDITYFLLSGAGSLLQGFSPEYIGKTYSLKEEETTTLLKSQPNILIFTVQQTQSLPKPHQHSKLVYNIDAAAPDTRAKLG
        DGDSDLEIIFLGETK+AH+PGDITYF+LSG   LLQGF+PEY+ K+YSL +EET   LKSQ N+LIFTVQ +QSLPKPH+HSKLVYNIDAA PD RAK+G
Subjt:  DGDSDLEIIFLGETKSAHIPGDITYFLLSGAGSLLQGFSPEYIGKTYSLKEEETTTLLKSQPNILIFTVQQTQSLPKPHQHSKLVYNIDAAAPDTRAKLG

Query:  TAAVTTVTESKFPFIGQSGLTSILEKLDANAIRSPVYVAEPSDQLIYIAKGSGKIQIVGISSNIDAEVKMGQLILVPKYFAVGKIAGEEGLECISIITAT
         AAVT VTES FPFIGQ+GLT++LEKLDANAIRSPVY+AEPSDQLIY+ KGSGKIQ+VG SS  DA+VK+GQLILVP+YFAVGK+AGEEGLECIS+I AT
Subjt:  TAAVTTVTESKFPFIGQSGLTSILEKLDANAIRSPVYVAEPSDQLIYIAKGSGKIQIVGISSNIDAEVKMGQLILVPKYFAVGKIAGEEGLECISIITAT

Query:  HPLVEELAGKKSVLEALSPEVFQVSFNVTAEFEKLFRSK
        HP+VEELAGK SVLEALS EVFQVSFNVTAEFEKLFRSK
Subjt:  HPLVEELAGKKSVLEALSPEVFQVSFNVTAEFEKLFRSK

A0A5A7T7U8 Glutelin type-A 2-like5.1e-16182.3Show/hide
Query:  MEPMSPKPFFEGDGGSYHKWLPSDYPLLAQTNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFVFPNKSNEVVVKLKKGDLIPVQNGVTSWWFN
        ME M+PKPFFEG+GGSY KWLPSDYPLLAQTNVA GRLLLRPRGFAVPHYAD SK GYVLQG++GV GFVFPNK NEVV+KLKKGDLIPV +G+TSWWFN
Subjt:  MEPMSPKPFFEGDGGSYHKWLPSDYPLLAQTNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFVFPNKSNEVVVKLKKGDLIPVQNGVTSWWFN

Query:  DGDSDLEIIFLGETKSAHIPGDITYFLLSGAGSLLQGFSPEYIGKTYSLKEEETTTLLKSQPNILIFTVQQTQSLPKPHQHSKLVYNIDAAAPDTRAKLG
        DGDSDLEIIFLGETK+AH+PGDITYF+LSG   LLQGF+PEY+ K+YSL +EET   LKSQ N+LIFTVQ +QSLPKPH+HSKLVYNIDAA PD RAK+G
Subjt:  DGDSDLEIIFLGETKSAHIPGDITYFLLSGAGSLLQGFSPEYIGKTYSLKEEETTTLLKSQPNILIFTVQQTQSLPKPHQHSKLVYNIDAAAPDTRAKLG

Query:  TAAVTTVTESKFPFIGQSGLTSILEKLDANAIRSPVYVAEPSDQLIYIAKGSGKIQIVGISSNIDAEVKMGQLILVPKYFAVGKIAGEEGLECISIITAT
         AAVT VTES FPFIGQ+GLT++LEKLDANAIRSPVY+AEPSDQLIY+ KGSGKIQ+VG SS  DA+VK+GQLILVP+YFAVGK+AGEEGLECIS+I AT
Subjt:  TAAVTTVTESKFPFIGQSGLTSILEKLDANAIRSPVYVAEPSDQLIYIAKGSGKIQIVGISSNIDAEVKMGQLILVPKYFAVGKIAGEEGLECISIITAT

Query:  HPLVEELAGKKSVLEALSPEVFQVSFNVTAEFEKLFRSK
        HP+VEELAGK SVLEALS EVFQVSFNVTAEFEKLFRSK
Subjt:  HPLVEELAGKKSVLEALSPEVFQVSFNVTAEFEKLFRSK

A0A5D3BLA4 Glutelin type-A 2-like5.7e-16082.2Show/hide
Query:  MEPMSPKPFFEGDGGSYHKWLPSDYPLLAQTNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFVFPNKSNEVVVKLKKGDLIPVQNGVTSWWFN
        ME M+PKPFFEG+GGSY KWLPSDYPLLAQTNVA GRLLLRPRGFAVPHYAD SK GYVLQG++GV GFVFPNK NEVV+KLKKGDLIPV +G+TSWWFN
Subjt:  MEPMSPKPFFEGDGGSYHKWLPSDYPLLAQTNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFVFPNKSNEVVVKLKKGDLIPVQNGVTSWWFN

Query:  DGDSDLEIIFLGETKSAHIPGDITYFLLSGAGSLLQGFSPEYIGKTYSLKEEETTTLLKSQPNILIFTVQQTQSLPKPHQHSKLVYNIDAAAPDTRAKLG
        DGDSDLEIIFLGETK+AH+PGDITYF+LSG   LLQGF+PEY+ K+YSL +EET   LKSQ N+LIFTVQ +QSLPKPH+HSKLVYNIDAA PD RAK+G
Subjt:  DGDSDLEIIFLGETKSAHIPGDITYFLLSGAGSLLQGFSPEYIGKTYSLKEEETTTLLKSQPNILIFTVQQTQSLPKPHQHSKLVYNIDAAAPDTRAKLG

Query:  TAAVTTVTESKFPFIGQSGLTSILEKLDANAIRSPVYVAEPSDQLIYIAKGSGKIQIVGISSNIDAEVKMGQLILVPKYFAVGKIAGEEGLECISIITAT
         AAVT VTES FPFIGQ+GLT++LEKLDANAIRSPVY+AEPSDQLIY+ KGSGKIQ+VG SS  DA+VK+GQLILVP+YFAVGK+AGEEGLECIS+I AT
Subjt:  TAAVTTVTESKFPFIGQSGLTSILEKLDANAIRSPVYVAEPSDQLIYIAKGSGKIQIVGISSNIDAEVKMGQLILVPKYFAVGKIAGEEGLECISIITAT

Query:  HPLVEELAGKKSVLEALSPEVFQVSFNVTAEFEKLFR
        HP+VEELAGK SVLEALS EVFQVSFNVTAEFEKLFR
Subjt:  HPLVEELAGKKSVLEALSPEVFQVSFNVTAEFEKLFR

A0A6J1E9P2 legumin J-like1.2e-15782.84Show/hide
Query:  EPMSPKPFFEGDGGSYHKWLPSDYPLLAQTNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFVFPNKSNEVVVKLKKGDLIPVQNGVTSWWFND
        +PM+PKPF E + GSYHKWLPS+YPLLAQ  VAAGRLLLRPRGF VPHYAD SKVGYVLQG+NGVAG VFP+KS+EVVV LKKGDLIPV NGV+SWWFND
Subjt:  EPMSPKPFFEGDGGSYHKWLPSDYPLLAQTNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFVFPNKSNEVVVKLKKGDLIPVQNGVTSWWFND

Query:  GDSDLEIIFLGETKSAHIPGDITYFLLSGAGSLLQGFSPEYIGKTYSLKEEETTTLLKSQPNILIFTVQQTQSLPKPHQHSKLVYNIDAAAPDTRAKLGT
        GDSDLEIIFLGE+K+AH+PGDI+YF+LSG  SLL GFSPEY+GKTYSL  EETT  LKSQ N LIF++QQTQSLPKP ++SK VYNIDAAAPD R K G 
Subjt:  GDSDLEIIFLGETKSAHIPGDITYFLLSGAGSLLQGFSPEYIGKTYSLKEEETTTLLKSQPNILIFTVQQTQSLPKPHQHSKLVYNIDAAAPDTRAKLGT

Query:  AAVTTVTESKFPFIGQSGLTSILEKLDANAIRSPVYVAEPSDQLIYIAKGSGKIQIVGISSNIDAEVKMGQLILVPKYFAVGKIAGEEGLECISIITATH
         AVTTVTESKFPFIGQSGLT+ILEKL+ANA+RSPVYVAEP DQLIY+AKG GKIQIVG SS IDAEVKMGQLILVPK+FAVGKIAGE+GLECISIITATH
Subjt:  AAVTTVTESKFPFIGQSGLTSILEKLDANAIRSPVYVAEPSDQLIYIAKGSGKIQIVGISSNIDAEVKMGQLILVPKYFAVGKIAGEEGLECISIITATH

Query:  PLVEELAGKKSVLEALSPEVFQVSFNVTAEFEKLFRSK
        P+VEELAGK SVLEALSPE+FQVSFNVTAEFEKL RSK
Subjt:  PLVEELAGKKSVLEALSPEVFQVSFNVTAEFEKLFRSK

SwissProt top hitse value%identityAlignment
P07728 Glutelin type-A 13.9e-1721.13Show/hide
Query:  TNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFVFP----------------------------NKSNEVVVKLKKGDLIPVQNGVTSWWFNDG
        T V+  R ++ PRG  +PHY + + + Y++QG  G+ G  FP                               ++ + + ++GD+I +  GV  W +NDG
Subjt:  TNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFVFP----------------------------NKSNEVVVKLKKGDLIPVQNGVTSWWFNDG

Query:  DSDLEIIFLGETKSAHIPGDITY--FLLSG---------------AGSLLQGFSPEYIGKTYSLKEEETTTL-LKSQPNILIFTVQQTQSLPKPH-----
        +  +  I++ +  +     D     FLL+G               + ++  GFS E + +   +  +    L  ++     I  V+   SL +P+     
Subjt:  DSDLEIIFLGETKSAHIPGDITY--FLLSG---------------AGSLLQGFSPEYIGKTYSLKEEETTTL-LKSQPNILIFTVQQTQSLPKPH-----

Query:  ---------------QHSKLVY-------------------NIDAAAPDTRAKLGTAAVTTVTESKFPFIGQSGLTSILEKLDANAIRSPVYVAEPSDQL
                       Q+ +  Y                   NID              VT +    FP +    ++++   L  NA+ SP +    +  +
Subjt:  ---------------QHSKLVY-------------------NIDAAAPDTRAKLGTAAVTTVTESKFPFIGQSGLTSILEKLDANAIRSPVYVAEPSDQL

Query:  IYIAKGSGKIQIVGISSN--IDAEVKMGQLILVPKYFAVGKIAGEEGLECISIITATHPLVEELAGKKSVLEALSPEVFQVSFNVTAE
        +YI +G  ++Q+V  +     + E++ GQL+++P+++AV K A  EG   I+  T  + +V  +AGK S+  AL  +V   ++ ++ E
Subjt:  IYIAKGSGKIQIVGISSN--IDAEVKMGQLILVPKYFAVGKIAGEEGLECISIITATHPLVEELAGKKSVLEALSPEVFQVSFNVTAE

P07730 Glutelin type-A 21.8e-1720.62Show/hide
Query:  TNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFVFP----------------------------NKSNEVVVKLKKGDLIPVQNGVTSWWFNDG
        T V+  R ++ PRG  +PHY + + + Y++QG  G+ G  FP                               ++ + + ++GD+I +  GV  W +NDG
Subjt:  TNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFVFP----------------------------NKSNEVVVKLKKGDLIPVQNGVTSWWFNDG

Query:  DSDLEIIFLGETKSAHIPGDITY--FLLSG---------------AGSLLQGFSPEYIGKTYSLKEEETTTLLKS-----------------QPNILIFT
        +  +  I++ +  +     D     FLL+G               + ++  GFS E + + + +  +    L                    QP   +  
Subjt:  DSDLEIIFLGETKSAHIPGDITY--FLLSG---------------AGSLLQGFSPEYIGKTYSLKEEETTTLLKS-----------------QPNILIFT

Query:  VQQTQSLPKPHQHS-----------------------KLVYNIDAAAPDTRAKLGTAAVTTVTESKFPFIGQSGLTSILEKLDANAIRSPVYVAEPSDQL
         +Q Q   + H                          ++  NID              VT +    FP +    ++++   L  NA+ SP +    +  +
Subjt:  VQQTQSLPKPHQHS-----------------------KLVYNIDAAAPDTRAKLGTAAVTTVTESKFPFIGQSGLTSILEKLDANAIRSPVYVAEPSDQL

Query:  IYIAKGSGKIQIVGISSN--IDAEVKMGQLILVPKYFAVGKIAGEEGLECISIITATHPLVEELAGKKSVLEALSPEVFQVSFNVTAE
        +YI +G  ++Q+V  +     + E++ GQL++VP+++ V K A  EG   I+  T  + +V  +AGK S+  AL  +V   ++ ++ E
Subjt:  IYIAKGSGKIQIVGISSN--IDAEVKMGQLILVPKYFAVGKIAGEEGLECISIITATHPLVEELAGKKSVLEALSPEVFQVSFNVTAE

P11828 Glycinin G34.2e-1921.9Show/hide
Query:  MEPMSPKPFFEGDGGSYHKWLPSDYPLLAQTNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFVF------------------PNKSNEVVVKL
        +  + P    E +GG    W P++ P      VA  R  L       P Y ++ +  Y+ QG +G+ G +F                  P   ++ +   
Subjt:  MEPMSPKPFFEGDGGSYHKWLPSDYPLLAQTNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFVF------------------PNKSNEVVVKL

Query:  KKGDLIPVQNGVTSWWFNDGDSDLEIIFLGETKSAHIPGD--ITYFLLSG--------------------------------AGSLLQGFSPEYIGKTYS
        ++GDLI V  G   W +N+ D+ +  + L +T S     D     F L+G                                 GS+L GF+PE++   + 
Subjt:  KKGDLIPVQNGVTSWWFNDGDSDLEIIFLGETKSAHIPGD--ITYFLLSG--------------------------------AGSLLQGFSPEYIGKTYS

Query:  L------------KEEETTTLLKSQPNILIFTVQQTQSLPKPHQHSK--------------------------LVYNIDAAAPDTRAKLGTAAVTTVTES
        +            +EEE   ++  +  + + +    +   +P +  K                          L +NI   +          ++TT T  
Subjt:  L------------KEEETTTLLKSQPNILIFTVQQTQSLPKPHQHSK--------------------------LVYNIDAAAPDTRAKLGTAAVTTVTES

Query:  KFPFIGQSGLTSILEKLDANAIRSPVYVAEPSDQLIYIAKGSGKIQIVGISSN--IDAEVKMGQLILVPKYFAVGKIAGEEGLECISIITATHPLVEELA
         FP +    L++    L  NA+  P Y    ++ +IY   G   +Q+V  +     D E++ GQ+++VP+ FAV   +  +  E +S  T   P +  LA
Subjt:  KFPFIGQSGLTSILEKLDANAIRSPVYVAEPSDQLIYIAKGSGKIQIVGISSN--IDAEVKMGQLILVPKYFAVGKIAGEEGLECISIITATHPLVEELA

Query:  GKKSVLEALSPEVFQVSFNV
        G  S+L AL  EV Q +FN+
Subjt:  GKKSVLEALSPEVFQVSFNV

P15456 12S seed storage protein CRB8.8e-1720.75Show/hide
Query:  MEPMSPKPFFEGDGGSYHKWLPSDYPLLAQTNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFVFP------------------------NKSN
        +  + P    + +GG    W     P L  +  A  R ++ P+G  +P + ++ K+ +V+ G  G+ G V P                           +
Subjt:  MEPMSPKPFFEGDGGSYHKWLPSDYPLLAQTNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFVFP------------------------NKSN

Query:  EVVVKLKKGDLIPVQNGVTSWWFNDGDSDLEIIFLGETKS--AHIPGDITYFLLSG----------------AGSLLQGFSPEYIGKTYSLKEEETTTLL
        + V  L+ GD I   +GV  W++N+G+  L ++   +  S    +  ++  FL++G                  ++  GF+PE + + + +  E    L 
Subjt:  EVVVKLKKGDLIPVQNGVTSWWFNDGDSDLEIIFLGETKS--AHIPGDITYFLLSG----------------AGSLLQGFSPEYIGKTYSLKEEETTTLL

Query:  KSQPN------------ILIFTVQQTQSLPKPHQHS----------KLVYNIDAAAPDTRAKLGTAAVTTVTESKFPFIGQSGLTSILEKLDANAIRSPV
          Q N            ++   +++ +   +PH+ +          +   N+D  +     K     ++T+     P +    L+++   +  NA+  P 
Subjt:  KSQPN------------ILIFTVQQTQSLPKPHQHS----------KLVYNIDAAAPDTRAKLGTAAVTTVTESKFPFIGQSGLTSILEKLDANAIRSPV

Query:  YVAEPSDQLIYIAKGSGKIQIVGISSN--IDAEVKMGQLILVPKYFAVGKIAGEEGLECISIITATHPLVEELAGKKSVLEALSPEVFQVSFNVTAEFEK
        +    ++  +Y+  G   IQ+V  +     D E+  GQL++VP+ F+V K A  E  E I   T  +  V  LAG+ SV+  L  EV    + ++ E  K
Subjt:  YVAEPSDQLIYIAKGSGKIQIVGISSN--IDAEVKMGQLILVPKYFAVGKIAGEEGLECISIITATHPLVEELAGKKSVLEALSPEVFQVSFNVTAEFEK

Q9ZWA9 12S seed storage protein CRD1.0e-1724.05Show/hide
Query:  PLLAQTNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFV---FPNKSNEV----------------------VVKLKKGDLIPVQNGVTSWWFN
        P L    V   R+ L+P    +P +     + YV+QG+ GV G +    P    EV                      +   ++GD+     GV+ WW+N
Subjt:  PLLAQTNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFV---FPNKSNEV----------------------VVKLKKGDLIPVQNGVTSWWFN

Query:  DGDSDLEIIFLGETKSAHIPGDITYFLLSGAGSLLQ----------------GFSPEYIGKTYSLKEEETTTLLKSQPN---------ILIFTV------
         GDSD  I+ + +  +     D    +   AGS  Q                GF P  I + + +  E    L   + N          L F +      
Subjt:  DGDSDLEIIFLGETKSAHIPGDITYFLLSGAGSLLQ----------------GFSPEYIGKTYSLKEEETTTLLKSQPN---------ILIFTV------

Query:  QQ---TQSLPKPHQHSKLVYNIDAAAPDTRAKLGTAAVTTVTESKFPFIGQSGLTSILEKLDANAIRSPVYVAEPSDQLIYIAKGSGKIQIVGIS--SNI
        QQ      + + +  +K+  NID              ++T+     P +    L ++   L +  +  P + A  +  ++Y+  G  KIQ+V  +  S  
Subjt:  QQ---TQSLPKPHQHSKLVYNIDAAAPDTRAKLGTAAVTTVTESKFPFIGQSGLTSILEKLDANAIRSPVYVAEPSDQLIYIAKGSGKIQIVGIS--SNI

Query:  DAEVKMGQLILVPKYFAVGKIAGEEGLECISIITATHPLVEELAGKKSVLEALSPEVFQVSFNVTAEFEK
        + +V  GQ+I++P+ FAV K AGE G E IS  T  +  +  L+G+ S L A+  +V + S+ V  E  K
Subjt:  DAEVKMGQLILVPKYFAVGKIAGEEGLECISIITATHPLVEELAGKKSVLEALSPEVFQVSFNVTAEFEK

Arabidopsis top hitse value%identityAlignment
AT1G03880.1 cruciferin 26.2e-1820.75Show/hide
Query:  MEPMSPKPFFEGDGGSYHKWLPSDYPLLAQTNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFVFP------------------------NKSN
        +  + P    + +GG    W     P L  +  A  R ++ P+G  +P + ++ K+ +V+ G  G+ G V P                           +
Subjt:  MEPMSPKPFFEGDGGSYHKWLPSDYPLLAQTNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFVFP------------------------NKSN

Query:  EVVVKLKKGDLIPVQNGVTSWWFNDGDSDLEIIFLGETKS--AHIPGDITYFLLSG----------------AGSLLQGFSPEYIGKTYSLKEEETTTLL
        + V  L+ GD I   +GV  W++N+G+  L ++   +  S    +  ++  FL++G                  ++  GF+PE + + + +  E    L 
Subjt:  EVVVKLKKGDLIPVQNGVTSWWFNDGDSDLEIIFLGETKS--AHIPGDITYFLLSG----------------AGSLLQGFSPEYIGKTYSLKEEETTTLL

Query:  KSQPN------------ILIFTVQQTQSLPKPHQHS----------KLVYNIDAAAPDTRAKLGTAAVTTVTESKFPFIGQSGLTSILEKLDANAIRSPV
          Q N            ++   +++ +   +PH+ +          +   N+D  +     K     ++T+     P +    L+++   +  NA+  P 
Subjt:  KSQPN------------ILIFTVQQTQSLPKPHQHS----------KLVYNIDAAAPDTRAKLGTAAVTTVTESKFPFIGQSGLTSILEKLDANAIRSPV

Query:  YVAEPSDQLIYIAKGSGKIQIVGISSN--IDAEVKMGQLILVPKYFAVGKIAGEEGLECISIITATHPLVEELAGKKSVLEALSPEVFQVSFNVTAEFEK
        +    ++  +Y+  G   IQ+V  +     D E+  GQL++VP+ F+V K A  E  E I   T  +  V  LAG+ SV+  L  EV    + ++ E  K
Subjt:  YVAEPSDQLIYIAKGSGKIQIVGISSN--IDAEVKMGQLILVPKYFAVGKIAGEEGLECISIITATHPLVEELAGKKSVLEALSPEVFQVSFNVTAEFEK

AT1G03890.1 RmlC-like cupins superfamily protein7.4e-1924.05Show/hide
Query:  PLLAQTNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFV---FPNKSNEV----------------------VVKLKKGDLIPVQNGVTSWWFN
        P L    V   R+ L+P    +P +     + YV+QG+ GV G +    P    EV                      +   ++GD+     GV+ WW+N
Subjt:  PLLAQTNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFV---FPNKSNEV----------------------VVKLKKGDLIPVQNGVTSWWFN

Query:  DGDSDLEIIFLGETKSAHIPGDITYFLLSGAGSLLQ----------------GFSPEYIGKTYSLKEEETTTLLKSQPN---------ILIFTV------
         GDSD  I+ + +  +     D    +   AGS  Q                GF P  I + + +  E    L   + N          L F +      
Subjt:  DGDSDLEIIFLGETKSAHIPGDITYFLLSGAGSLLQ----------------GFSPEYIGKTYSLKEEETTTLLKSQPN---------ILIFTV------

Query:  QQ---TQSLPKPHQHSKLVYNIDAAAPDTRAKLGTAAVTTVTESKFPFIGQSGLTSILEKLDANAIRSPVYVAEPSDQLIYIAKGSGKIQIVGIS--SNI
        QQ      + + +  +K+  NID              ++T+     P +    L ++   L +  +  P + A  +  ++Y+  G  KIQ+V  +  S  
Subjt:  QQ---TQSLPKPHQHSKLVYNIDAAAPDTRAKLGTAAVTTVTESKFPFIGQSGLTSILEKLDANAIRSPVYVAEPSDQLIYIAKGSGKIQIVGIS--SNI

Query:  DAEVKMGQLILVPKYFAVGKIAGEEGLECISIITATHPLVEELAGKKSVLEALSPEVFQVSFNVTAEFEK
        + +V  GQ+I++P+ FAV K AGE G E IS  T  +  +  L+G+ S L A+  +V + S+ V  E  K
Subjt:  DAEVKMGQLILVPKYFAVGKIAGEEGLECISIITATHPLVEELAGKKSVLEALSPEVFQVSFNVTAEFEK

AT1G07750.1 RmlC-like cupins superfamily protein1.1e-6740.35Show/hide
Query:  MEPMSPKPFFEGDGGSYHKWLPSDYPLLAQTNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFVFPNKSNEVVVKLKKGDLIPVQNGVTSWWFN
        + P  PK  + GDGGSY  W P + P+L Q N+ A +L L   GFAVP Y+DSSKV YVLQG +G AG V P K  E V+ +K+GD I +  GV +WWFN
Subjt:  MEPMSPKPFFEGDGGSYHKWLPSDYPLLAQTNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFVFPNKSNEVVVKLKKGDLIPVQNGVTSWWFN

Query:  DGDSDLEIIFLGETKSAHIPGDITYFLLSGAGSLLQGFSPEYIGKTYSLKEEETTTLLKSQPNILIFTVQQTQSLPKPHQHSK--LVYNIDAAAPDTRAK
        + D +L I+FLGET   H  G  T F L+G   +  GFS E++G+ + L E     L+ SQ    I  +     +P+P + ++   V N   A  D   K
Subjt:  DGDSDLEIIFLGETKSAHIPGDITYFLLSGAGSLLQGFSPEYIGKTYSLKEEETTTLLKSQPNILIFTVQQTQSLPKPHQHSK--LVYNIDAAAPDTRAK

Query:  LGTAAVTTVTESKFPFIGQSGLTSILEKLDANAIRSPVYVAEPSDQLIYIAKGSGKIQIVGISSN--IDAEVKMGQLILVPKYFAVGKIAGEEGLECISI
         G   V   T++  P +G+ G  + L ++DA+++ SP +  + + Q+ YI  GSG++Q+VG      ++  +K G L +VP++F V KIA  +G+   SI
Subjt:  LGTAAVTTVTESKFPFIGQSGLTSILEKLDANAIRSPVYVAEPSDQLIYIAKGSGKIQIVGISSN--IDAEVKMGQLILVPKYFAVGKIAGEEGLECISI

Query:  ITATHPLVEELAGKKSVLEALSPEVFQVSFNVTAEFEKLFRS
        +T   P+   LAG  SV ++LSPEV Q +F V  E EK FRS
Subjt:  ITATHPLVEELAGKKSVLEALSPEVFQVSFNVTAEFEKLFRS

AT2G28680.1 RmlC-like cupins superfamily protein3.3e-6740.52Show/hide
Query:  MEPMSPKPFFEGDGGSYHKWLPSDYPLLAQTNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFVFPNKSNEVVVKLKKGDLIPVQNGVTSWWFN
        + P  PK  + GDGGSY  W P + P+L   N+ A +L L   G A+P Y+DS KV YVLQG  G AG V P K  E V+ +KKGD I +  GV +WWFN
Subjt:  MEPMSPKPFFEGDGGSYHKWLPSDYPLLAQTNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFVFPNKSNEVVVKLKKGDLIPVQNGVTSWWFN

Query:  DGDSDLEIIFLGETKSAHIPGDITYFLLSGAGSLLQGFSPEYIGKTYSLKEEETTTLLKSQPNILIFTVQQTQSLPKPHQHSK--LVYNIDAAAPDTRAK
        + D++L ++FLGET   H  G  T F L+G+  +  GFS E++G+ + L E     L+ SQ    I  V  +  +P+P +  +   V N   A  D   K
Subjt:  DGDSDLEIIFLGETKSAHIPGDITYFLLSGAGSLLQGFSPEYIGKTYSLKEEETTTLLKSQPNILIFTVQQTQSLPKPHQHSK--LVYNIDAAAPDTRAK

Query:  LGTAAVTTVTESKFPFIGQSGLTSILEKLDANAIRSPVYVAEPSDQLIYIAKGSGKIQIVGISSN--IDAEVKMGQLILVPKYFAVGKIAGEEGLECISI
         G   V   T++  P +G+ G  + L ++D +++ SP +  + + Q+ YI  GSG++QIVG      ++  VK G L +VP++F V KIA  +GL   SI
Subjt:  LGTAAVTTVTESKFPFIGQSGLTSILEKLDANAIRSPVYVAEPSDQLIYIAKGSGKIQIVGISSN--IDAEVKMGQLILVPKYFAVGKIAGEEGLECISI

Query:  ITATHPLVEELAGKKSVLEALSPEVFQVSFNVTAEFEKLFRSK
        +T   P+   LAG+ SV +ALSPEV Q +F V  E EK FRSK
Subjt:  ITATHPLVEELAGKKSVLEALSPEVFQVSFNVTAEFEKLFRSK

AT5G44120.3 RmlC-like cupins superfamily protein2.5e-1421.04Show/hide
Query:  MEPMSPKPFFEGDGGSYHKWLPSDYPLLAQTNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFVFP-------------------------NKS
        +  + P    + + G    W     P L  + V+  R ++  +G  +P + +++K+ +V +G  G+ G V P                            
Subjt:  MEPMSPKPFFEGDGGSYHKWLPSDYPLLAQTNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFVFP-------------------------NKS

Query:  NEVVVKLKKGDLIPVQNGVTSWWFNDGDSDLEIIFLGETKS--AHIPGDITYFLLSGAG----------------SLLQGFSPEYIGKTYSLKEEETTTL
        ++ V  ++ GD I    GV  W++NDG   L I+ + +  S    +  +   F L+G                  ++  GF PE I +   +  +    L
Subjt:  NEVVVKLKKGDLIPVQNGVTSWWFNDGDSDLEIIFLGETKS--AHIPGDITYFLLSGAG----------------SLLQGFSPEYIGKTYSLKEEETTTL

Query:  LKSQPNI-LIFTVQQTQSLPKP-------------------HQH--------SKLVYNIDAAAPDTRAKLGTAAVTTVTESKFPFIGQSGLTSILEKLDA
             N   I  VQ    + +P                   H +        ++   N+D  +     K     ++T+     P +    L+++   +  
Subjt:  LKSQPNI-LIFTVQQTQSLPKP-------------------HQH--------SKLVYNIDAAAPDTRAKLGTAAVTTVTESKFPFIGQSGLTSILEKLDA

Query:  NAIRSPVYVAEPSDQLIYIAKGSGKIQIVGISSN--IDAEVKMGQLILVPKYFAVGKIAGEEGLECISIITATHPLVEELAGKKSVLEALSPEVFQVSFN
        NA+  P + A  ++ ++Y+  G  +IQIV  + N   D +V  GQLI VP+ F+V K A     + +   T  +  +  LAG+ SVL  L  EV    F 
Subjt:  NAIRSPVYVAEPSDQLIYIAKGSGKIQIVGISSN--IDAEVKMGQLILVPKYFAVGKIAGEEGLECISIITATHPLVEELAGKKSVLEALSPEVFQVSFN

Query:  VTAE
        ++ E
Subjt:  VTAE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCCAATGAGTCCCAAGCCCTTCTTTGAGGGAGATGGTGGATCCTATCACAAATGGCTGCCTTCTGACTATCCCTTGCTTGCTCAGACCAACGTCGCCGCCGGCCG
CCTTCTCCTCCGCCCCCGTGGCTTTGCTGTTCCCCATTATGCTGATTCGTCTAAAGTGGGCTACGTTCTTCAAGGTGACAATGGAGTTGCAGGATTCGTGTTTCCAAACA
AGTCGAACGAAGTAGTGGTGAAACTAAAGAAAGGAGATTTGATTCCCGTGCAGAACGGAGTCACGTCGTGGTGGTTCAACGACGGAGATTCCGATCTGGAAATCATCTTT
TTAGGTGAAACCAAAAGCGCCCACATCCCCGGAGACATCACTTACTTCTTGCTCTCCGGGGCCGGCAGTCTCCTGCAAGGCTTCTCGCCGGAGTACATCGGAAAAACCTA
CTCTCTAAAAGAAGAAGAAACAACCACACTTCTCAAAAGCCAACCCAACATTCTCATCTTCACCGTTCAACAAACTCAATCCCTCCCCAAACCCCACCAACACAGCAAAC
TAGTTTACAACATCGACGCCGCCGCACCGGACACCAGAGCCAAACTCGGCACCGCCGCCGTGACGACGGTGACGGAATCCAAATTTCCGTTCATTGGCCAATCTGGGTTG
ACGTCAATTCTCGAAAAGCTCGACGCCAACGCCATCCGATCACCGGTGTACGTCGCCGAGCCGTCCGATCAACTGATCTACATAGCTAAAGGATCCGGGAAGATTCAGAT
CGTCGGAATTTCGAGTAACATTGATGCAGAGGTGAAAATGGGTCAGCTGATTTTGGTCCCCAAATACTTCGCCGTCGGAAAGATCGCCGGAGAAGAAGGCTTGGAGTGCA
TCTCAATTATCACAGCTACACATCCTCTGGTGGAAGAATTGGCCGGAAAGAAGTCGGTTTTGGAGGCATTGTCGCCGGAAGTTTTTCAAGTTTCGTTCAACGTGACGGCG
GAGTTCGAGAAGCTTTTCAGGTCAAAGGCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGCCAATGAGTCCCAAGCCCTTCTTTGAGGGAGATGGTGGATCCTATCACAAATGGCTGCCTTCTGACTATCCCTTGCTTGCTCAGACCAACGTCGCCGCCGGCCG
CCTTCTCCTCCGCCCCCGTGGCTTTGCTGTTCCCCATTATGCTGATTCGTCTAAAGTGGGCTACGTTCTTCAAGGTGACAATGGAGTTGCAGGATTCGTGTTTCCAAACA
AGTCGAACGAAGTAGTGGTGAAACTAAAGAAAGGAGATTTGATTCCCGTGCAGAACGGAGTCACGTCGTGGTGGTTCAACGACGGAGATTCCGATCTGGAAATCATCTTT
TTAGGTGAAACCAAAAGCGCCCACATCCCCGGAGACATCACTTACTTCTTGCTCTCCGGGGCCGGCAGTCTCCTGCAAGGCTTCTCGCCGGAGTACATCGGAAAAACCTA
CTCTCTAAAAGAAGAAGAAACAACCACACTTCTCAAAAGCCAACCCAACATTCTCATCTTCACCGTTCAACAAACTCAATCCCTCCCCAAACCCCACCAACACAGCAAAC
TAGTTTACAACATCGACGCCGCCGCACCGGACACCAGAGCCAAACTCGGCACCGCCGCCGTGACGACGGTGACGGAATCCAAATTTCCGTTCATTGGCCAATCTGGGTTG
ACGTCAATTCTCGAAAAGCTCGACGCCAACGCCATCCGATCACCGGTGTACGTCGCCGAGCCGTCCGATCAACTGATCTACATAGCTAAAGGATCCGGGAAGATTCAGAT
CGTCGGAATTTCGAGTAACATTGATGCAGAGGTGAAAATGGGTCAGCTGATTTTGGTCCCCAAATACTTCGCCGTCGGAAAGATCGCCGGAGAAGAAGGCTTGGAGTGCA
TCTCAATTATCACAGCTACACATCCTCTGGTGGAAGAATTGGCCGGAAAGAAGTCGGTTTTGGAGGCATTGTCGCCGGAAGTTTTTCAAGTTTCGTTCAACGTGACGGCG
GAGTTCGAGAAGCTTTTCAGGTCAAAGGCGTAA
Protein sequenceShow/hide protein sequence
MEPMSPKPFFEGDGGSYHKWLPSDYPLLAQTNVAAGRLLLRPRGFAVPHYADSSKVGYVLQGDNGVAGFVFPNKSNEVVVKLKKGDLIPVQNGVTSWWFNDGDSDLEIIF
LGETKSAHIPGDITYFLLSGAGSLLQGFSPEYIGKTYSLKEEETTTLLKSQPNILIFTVQQTQSLPKPHQHSKLVYNIDAAAPDTRAKLGTAAVTTVTESKFPFIGQSGL
TSILEKLDANAIRSPVYVAEPSDQLIYIAKGSGKIQIVGISSNIDAEVKMGQLILVPKYFAVGKIAGEEGLECISIITATHPLVEELAGKKSVLEALSPEVFQVSFNVTA
EFEKLFRSKA