; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001707 (gene) of Snake gourd v1 genome

Gene IDTan0001707
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionglutelin type-A 2-like
Genome locationLG01:7228944..7230809
RNA-Seq ExpressionTan0001707
SyntenyTan0001707
Gene Ontology termsGO:0045735 - nutrient reservoir activity (molecular function)
InterPro domainsIPR006045 - Cupin 1
IPR011051 - RmlC-like cupin domain superfamily
IPR014710 - RmlC-like jelly roll fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576976.1 12S seed storage protein CRD, partial [Cucurbita argyrosperma subsp. sororia]1.1e-15180.34Show/hide
Query:  EPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFND
        +PMNPKPFTE +AGSYHKWLPSEYPLLA+ KVAAGRLLLRPRGFVVPHYADCSKVGYVLQGE+GVAGLVFP+KSDEVVV LKKGDLIPVP GV+SWWFN+
Subjt:  EPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFND

Query:  GDSDFEIIFLGETKTAHVAGDISYFILSGPLGFLQGFSPD-------------------QSNALIFALAQPQSLPKPQKHSKLVYNIDAAAPDTTPKPSG
        GDSD EIIFLGE+K AHV GDISYF+LSGPL  L GFSP+                   QSNALIF++ Q QSLPKP K SK VYNIDAAAPD   K  G
Subjt:  GDSDFEIIFLGETKTAHVAGDISYFILSGPLGFLQGFSPD-------------------QSNALIFALAQPQSLPKPQKHSKLVYNIDAAAPDTTPKPSG

Query:  GGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQLIYVAKGFGKIQIVGFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITAT
         GAVTTVTESKFP IGQSGLTAILEKLDANAVRSPVYVAEP DQLIYVAKG GKIQIVG SSK+DAEVKMGQLILVPK+F  GKIAGE+GLEC SIITAT
Subjt:  GGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQLIYVAKGFGKIQIVGFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITAT

Query:  HPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKITKTSPVIPPSD
        HP+VEELAGKTSV EALSPEI QVSFNVTAEFEKLLRSKIT  SPVI  SD
Subjt:  HPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKITKTSPVIPPSD

XP_008456076.1 PREDICTED: glutelin type-A 2-like [Cucumis melo]1.2e-14074.78Show/hide
Query:  MEPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFN
        ME MNPKPF EG+ GSY KWLPS+YPLLAQT VA GRLLLRPRGF VPHYADCSK GYVLQGEDGV G VFPNK +EVV+KLKKGDLIPVP G+TSWWFN
Subjt:  MEPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFN

Query:  DGDSDFEIIFLGETKTAHVAGDISYFILSGPLGFLQGFSPD-------------------QSNALIFALAQPQSLPKPQKHSKLVYNIDAAAPDTTPKPS
        DGDSD EIIFLGETK AHV GDI+YFILSGP G LQGF+P+                   QSN LIF +   QSLPKP KHSKLVYNIDAA PD   K  
Subjt:  DGDSDFEIIFLGETKTAHVAGDISYFILSGPLGFLQGFSPD-------------------QSNALIFALAQPQSLPKPQKHSKLVYNIDAAAPDTTPKPS

Query:  GGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQLIYVAKGFGKIQIVGFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITA
        G  AVT VTES FP IGQ+GLTA+LEKLDANA+RSPVY+AEPSDQLIYV KG GKIQ+VGFSSK DA+VK+GQLILVP+YF  GK+AGEEGLEC S+I A
Subjt:  GGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQLIYVAKGFGKIQIVGFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITA

Query:  THPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKI
        THP+VEELAGKTSV EALS E+ QVSFNVTAEFEKL RSK+
Subjt:  THPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKI

XP_022922755.1 legumin J-like [Cucurbita moschata]2.3e-15280.63Show/hide
Query:  EPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFND
        +PMNPKPFTE +AGSYHKWLPSEYPLLAQ KVAAGRLLLRPRGFVVPHYADCSKVGYVLQGE+GVAGLVFP+KSDEVVV LKKGDLIPVP GV+SWWFND
Subjt:  EPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFND

Query:  GDSDFEIIFLGETKTAHVAGDISYFILSGPLGFLQGFSPD-------------------QSNALIFALAQPQSLPKPQKHSKLVYNIDAAAPDTTPKPSG
        GDSD EIIFLGE+K AHV GDISYF+LSGPL  L GFSP+                   QSNALIF++ Q QSLPKP K+SK VYNIDAAAPD   K  G
Subjt:  GDSDFEIIFLGETKTAHVAGDISYFILSGPLGFLQGFSPD-------------------QSNALIFALAQPQSLPKPQKHSKLVYNIDAAAPDTTPKPSG

Query:  GGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQLIYVAKGFGKIQIVGFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITAT
         GAVTTVTESKFP IGQSGLTAILEKL+ANAVRSPVYVAEP DQLIYVAKG GKIQIVG SSK+DAEVKMGQLILVPK+F  GKIAGE+GLEC SIITAT
Subjt:  GGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQLIYVAKGFGKIQIVGFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITAT

Query:  HPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKITKTSPVIPPSD
        HP+VEELAGKTSV EALSPEI QVSFNVTAEFEKLLRSKIT  SPVI  SD
Subjt:  HPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKITKTSPVIPPSD

XP_022985328.1 12S seed storage protein CRD-like [Cucurbita maxima]6.7e-15280.34Show/hide
Query:  EPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFND
        +PMNPKPFTE +AGSYHKWLPSEYPLLA  KVAAGRLLLRPRGFVVPHYADCSKVGYVLQGE+GVAGLVFP+KSDEVVV LKKGDLIPVP GV+SWWFND
Subjt:  EPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFND

Query:  GDSDFEIIFLGETKTAHVAGDISYFILSGPLGFLQGFSPD-------------------QSNALIFALAQPQSLPKPQKHSKLVYNIDAAAPDTTPKPSG
        GDSD EIIFLGE+K AHV GDISYF+LSG L  L GFSP+                   QSNALIF++ Q QSLPKP K+SK VYNIDAAAPD   K  G
Subjt:  GDSDFEIIFLGETKTAHVAGDISYFILSGPLGFLQGFSPD-------------------QSNALIFALAQPQSLPKPQKHSKLVYNIDAAAPDTTPKPSG

Query:  GGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQLIYVAKGFGKIQIVGFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITAT
         GAVTTVTESKFP IGQSGLTAILEKLDANAVRSPVYVAEP DQLIYVAKG GKIQIVGFSSK+DAEVKMGQLILVPK+F  GKIAGE+GLEC SIITAT
Subjt:  GGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQLIYVAKGFGKIQIVGFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITAT

Query:  HPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKITKTSPVIPPSD
        HP+VEELAGKTSV EALSPE+ QVSFNVTAEFEKLLRSKIT  SPVI  SD
Subjt:  HPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKITKTSPVIPPSD

XP_023552908.1 12S seed storage globulin 1-like [Cucurbita pepo subsp. pepo]1.4e-14979.49Show/hide
Query:  EPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFND
        +PMNPKPFTE +AGSYHKWLPSEYPLLA+ KVAAGRLLLRPRGFVVPHYADCSKVGYVLQGE+GV GLVFP+KSDEVVV LKKGDLIPVP GV+SWWFND
Subjt:  EPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFND

Query:  GDSDFEIIFLGETKTAHVAGDISYFILSGPLGFLQGFSPD-------------------QSNALIFALAQPQSLPKPQKHSKLVYNIDAAAPDTTPKPSG
        GDSD EIIFLGE+K AHV GDISYF+LSGPL  L GFSP+                   QSNALI ++ Q QSLPKP K SK VYNIDAAAPD   K S 
Subjt:  GDSDFEIIFLGETKTAHVAGDISYFILSGPLGFLQGFSPD-------------------QSNALIFALAQPQSLPKPQKHSKLVYNIDAAAPDTTPKPSG

Query:  GGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQLIYVAKGFGKIQIVGFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITAT
         GAVTTVTESKFP IGQSGLTAILEKLDANAVRSPVYVAEP DQLIYVAKG GKIQIVG SSK+DAEVKMGQLILVPK+F  GK AGE+GLEC SIITAT
Subjt:  GGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQLIYVAKGFGKIQIVGFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITAT

Query:  HPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKITKTSPVIPPSD
        HP+VEELAGKTSV EALSPE+ QVSFNVTAEFEKLLRSKIT  SPVI  SD
Subjt:  HPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKITKTSPVIPPSD

TrEMBL top hitse value%identityAlignment
A0A0A0L6K0 Uncharacterized protein6.4e-14074.49Show/hide
Query:  MEPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFN
        ME MNPKPF EG+ GSYHKWLPS+YPLLAQT VA GRLLLRPRGF VPHY+DCSK GYVLQGEDGV G VFP K +EVV+KLKKGDLIPVP GVTSWWFN
Subjt:  MEPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFN

Query:  DGDSDFEIIFLGETKTAHVAGDISYFILSGPLGFLQGFSPD-------------------QSNALIFALAQPQSLPKPQKHSKLVYNIDAAAPDTTPKPS
        DGDSD EIIFLGETK AHV GDI+YFILSGP G LQGF+P+                   Q N LIF +   QSLPKP K+SKLVYNIDAAAPD   K  
Subjt:  DGDSDFEIIFLGETKTAHVAGDISYFILSGPLGFLQGFSPD-------------------QSNALIFALAQPQSLPKPQKHSKLVYNIDAAAPDTTPKPS

Query:  GGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQLIYVAKGFGKIQIVGFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITA
        G  AVT VTES FP IGQ+GLT +LEKLDANA+RSPVY+AEPSDQLIYV KG GKIQ+VGFSSK DA+VK GQLILVP+YF  GKIAGEEGLEC S+I A
Subjt:  GGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQLIYVAKGFGKIQIVGFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITA

Query:  THPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKI
        THP+VEELAGKTSV EALS E+ QVSFNVTAEFEKL RSK+
Subjt:  THPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKI

A0A1S3C2D5 glutelin type-A 2-like5.8e-14174.78Show/hide
Query:  MEPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFN
        ME MNPKPF EG+ GSY KWLPS+YPLLAQT VA GRLLLRPRGF VPHYADCSK GYVLQGEDGV G VFPNK +EVV+KLKKGDLIPVP G+TSWWFN
Subjt:  MEPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFN

Query:  DGDSDFEIIFLGETKTAHVAGDISYFILSGPLGFLQGFSPD-------------------QSNALIFALAQPQSLPKPQKHSKLVYNIDAAAPDTTPKPS
        DGDSD EIIFLGETK AHV GDI+YFILSGP G LQGF+P+                   QSN LIF +   QSLPKP KHSKLVYNIDAA PD   K  
Subjt:  DGDSDFEIIFLGETKTAHVAGDISYFILSGPLGFLQGFSPD-------------------QSNALIFALAQPQSLPKPQKHSKLVYNIDAAAPDTTPKPS

Query:  GGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQLIYVAKGFGKIQIVGFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITA
        G  AVT VTES FP IGQ+GLTA+LEKLDANA+RSPVY+AEPSDQLIYV KG GKIQ+VGFSSK DA+VK+GQLILVP+YF  GK+AGEEGLEC S+I A
Subjt:  GGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQLIYVAKGFGKIQIVGFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITA

Query:  THPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKI
        THP+VEELAGKTSV EALS E+ QVSFNVTAEFEKL RSK+
Subjt:  THPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKI

A0A5A7T7U8 Glutelin type-A 2-like5.8e-14174.78Show/hide
Query:  MEPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFN
        ME MNPKPF EG+ GSY KWLPS+YPLLAQT VA GRLLLRPRGF VPHYADCSK GYVLQGEDGV G VFPNK +EVV+KLKKGDLIPVP G+TSWWFN
Subjt:  MEPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFN

Query:  DGDSDFEIIFLGETKTAHVAGDISYFILSGPLGFLQGFSPD-------------------QSNALIFALAQPQSLPKPQKHSKLVYNIDAAAPDTTPKPS
        DGDSD EIIFLGETK AHV GDI+YFILSGP G LQGF+P+                   QSN LIF +   QSLPKP KHSKLVYNIDAA PD   K  
Subjt:  DGDSDFEIIFLGETKTAHVAGDISYFILSGPLGFLQGFSPD-------------------QSNALIFALAQPQSLPKPQKHSKLVYNIDAAAPDTTPKPS

Query:  GGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQLIYVAKGFGKIQIVGFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITA
        G  AVT VTES FP IGQ+GLTA+LEKLDANA+RSPVY+AEPSDQLIYV KG GKIQ+VGFSSK DA+VK+GQLILVP+YF  GK+AGEEGLEC S+I A
Subjt:  GGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQLIYVAKGFGKIQIVGFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITA

Query:  THPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKI
        THP+VEELAGKTSV EALS E+ QVSFNVTAEFEKL RSK+
Subjt:  THPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKI

A0A6J1E9P2 legumin J-like1.1e-15280.63Show/hide
Query:  EPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFND
        +PMNPKPFTE +AGSYHKWLPSEYPLLAQ KVAAGRLLLRPRGFVVPHYADCSKVGYVLQGE+GVAGLVFP+KSDEVVV LKKGDLIPVP GV+SWWFND
Subjt:  EPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFND

Query:  GDSDFEIIFLGETKTAHVAGDISYFILSGPLGFLQGFSPD-------------------QSNALIFALAQPQSLPKPQKHSKLVYNIDAAAPDTTPKPSG
        GDSD EIIFLGE+K AHV GDISYF+LSGPL  L GFSP+                   QSNALIF++ Q QSLPKP K+SK VYNIDAAAPD   K  G
Subjt:  GDSDFEIIFLGETKTAHVAGDISYFILSGPLGFLQGFSPD-------------------QSNALIFALAQPQSLPKPQKHSKLVYNIDAAAPDTTPKPSG

Query:  GGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQLIYVAKGFGKIQIVGFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITAT
         GAVTTVTESKFP IGQSGLTAILEKL+ANAVRSPVYVAEP DQLIYVAKG GKIQIVG SSK+DAEVKMGQLILVPK+F  GKIAGE+GLEC SIITAT
Subjt:  GGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQLIYVAKGFGKIQIVGFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITAT

Query:  HPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKITKTSPVIPPSD
        HP+VEELAGKTSV EALSPEI QVSFNVTAEFEKLLRSKIT  SPVI  SD
Subjt:  HPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKITKTSPVIPPSD

A0A6J1JDB2 12S seed storage protein CRD-like3.3e-15280.34Show/hide
Query:  EPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFND
        +PMNPKPFTE +AGSYHKWLPSEYPLLA  KVAAGRLLLRPRGFVVPHYADCSKVGYVLQGE+GVAGLVFP+KSDEVVV LKKGDLIPVP GV+SWWFND
Subjt:  EPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFND

Query:  GDSDFEIIFLGETKTAHVAGDISYFILSGPLGFLQGFSPD-------------------QSNALIFALAQPQSLPKPQKHSKLVYNIDAAAPDTTPKPSG
        GDSD EIIFLGE+K AHV GDISYF+LSG L  L GFSP+                   QSNALIF++ Q QSLPKP K+SK VYNIDAAAPD   K  G
Subjt:  GDSDFEIIFLGETKTAHVAGDISYFILSGPLGFLQGFSPD-------------------QSNALIFALAQPQSLPKPQKHSKLVYNIDAAAPDTTPKPSG

Query:  GGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQLIYVAKGFGKIQIVGFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITAT
         GAVTTVTESKFP IGQSGLTAILEKLDANAVRSPVYVAEP DQLIYVAKG GKIQIVGFSSK+DAEVKMGQLILVPK+F  GKIAGE+GLEC SIITAT
Subjt:  GGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQLIYVAKGFGKIQIVGFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITAT

Query:  HPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKITKTSPVIPPSD
        HP+VEELAGKTSV EALSPE+ QVSFNVTAEFEKLLRSKIT  SPVI  SD
Subjt:  HPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKITKTSPVIPPSD

SwissProt top hitse value%identityAlignment
A0A222NNM9 Cocosin 18.9e-1421.69Show/hide
Query:  VAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFP-----------------------NKSDEVVVKLKKGDLIPVPEGVTSWWFNDGDSDFEII
        V+  R ++ PRG ++P  ++  ++ Y++QG  G+ GLV P                           + V + ++GD++ VP G   W +N+G++    I
Subjt:  VAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFP-----------------------NKSDEVVVKLKKGDLIPVPEGVTSWWFNDGDSDFEII

Query:  FLGETKTAHVAGDISY--FILSG---------------PLGFLQGFSPDQSNALI--------------------------FALAQPQSLPKPQKHS---
         + +T       D S+  F+L+G                   L+GFS +   A                              + +P  + + ++     
Subjt:  FLGETKTAHVAGDISY--FILSG---------------PLGFLQGFSPDQSNALI--------------------------FALAQPQSLPKPQKHS---

Query:  -----------KLVYNIDAAAPDTTPKPSGGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQLIYVAKGFGKIQIVGFSSKV--DAEV
                   K+  NI          P  GG +TT+   K P +    ++A    L  NA+ SP +    +  ++Y   G G++++     +   D E+
Subjt:  -----------KLVYNIDAAAPDTTPKPSGGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQLIYVAKGFGKIQIVGFSSKV--DAEV

Query:  KMGQLILVPKYFVAGKIAGEEGLECFSIITATHPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKITKTSPV
        + GQL++VP+ F   + AG EG +  SI T+   +V  + GKTS    +  E+L  S+ ++   ++  R K+T+   V
Subjt:  KMGQLILVPKYFVAGKIAGEEGLECFSIITATHPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKITKTSPV

P07728 Glutelin type-A 15.6e-1620.51Show/hide
Query:  TKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFP----------------------------NKSDEVVVKLKKGDLIPVPEGVTSWWFNDG
        T V+  R ++ PRG ++PHY + + + Y++QG  G+ G  FP                                + + + ++GD+I +P GV  W +NDG
Subjt:  TKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFP----------------------------NKSDEVVVKLKKGDLIPVPEGVTSWWFNDG

Query:  DSDFEIIFLG--------------------------------ETKTAHVAGDISYFILSGPLGFLQGFS------PDQSNALI-----FALAQPQSLPKP
        +     I++                                 E ++ ++    S  +LS  LG     +       DQ   ++      +L QP +  + 
Subjt:  DSDFEIIFLG--------------------------------ETKTAHVAGDISYFILSGPLGFLQGFS------PDQSNALI-----FALAQPQSLPKP

Query:  QKHS---------------------------------KLVYNIDAAAPDTTPKPSGGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQ
        Q+                                   ++  NID      T  P   G VT +    FP +    ++A+   L  NA+ SP +    +  
Subjt:  QKHS---------------------------------KLVYNIDAAAPDTTPKPSGGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQ

Query:  LIYVAKGFGKIQIVGFSSKV--DAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITATHPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLR
        ++Y+ +G  ++Q+V  + K   + E++ GQL+++P+++   K A  EG    +  T  + +V  +AGK+S+F AL  ++L  ++ ++ E  + L+
Subjt:  LIYVAKGFGKIQIVGFSSKV--DAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITATHPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLR

P07730 Glutelin type-A 21.9e-1621.27Show/hide
Query:  TKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFP----------------------------NKSDEVVVKLKKGDLIPVPEGVTSWWFNDG
        T V+  R ++ PRG ++PHY + + + Y++QG  G+ G  FP                                + + + ++GD+I +P GV  W +NDG
Subjt:  TKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFP----------------------------NKSDEVVVKLKKGDLIPVPEGVTSWWFNDG

Query:  DSDFEIIFLGETKTAHVAGDISY--FILSG---------------PLGFLQGFSP---------------------DQSNALI-----FALAQPQSLPKP
        +     I++ +        D     F+L+G                     GFS                      DQ   ++      +L QP +  + 
Subjt:  DSDFEIIFLGETKTAHVAGDISY--FILSG---------------PLGFLQGFSP---------------------DQSNALI-----FALAQPQSLPKP

Query:  QKHSKLV---------------------------------YNIDAAAPDTTPKPSGGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQ
        Q+  ++                                   NID      T  P   G VT +    FP +    ++A+   L  NA+ SP +    +  
Subjt:  QKHSKLV---------------------------------YNIDAAAPDTTPKPSGGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQ

Query:  LIYVAKGFGKIQIVGFSSKV--DAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITATHPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLR
        ++Y+ +G  ++Q+V  + K   + E++ GQL++VP+++V  K A  EG    +  T  + +V  +AGK+S+F AL  ++L  ++ ++ E  + L+
Subjt:  LIYVAKGFGKIQIVGFSSKV--DAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITATHPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLR

Q09151 Glutelin type-A 33.5e-1822.55Show/hide
Query:  TKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFP----------------------------NKSDEVVVKLKKGDLIPVPEGVTSWWFNDG
        T V   R ++ PRG ++PHY++ + + YV+QG  G+ G  FP                                + + + ++GD++ +P GV  W +NDG
Subjt:  TKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFP----------------------------NKSDEVVVKLKKGDLIPVPEGVTSWWFNDG

Query:  DSDFEIIFL--------------------GETK-------------TAHVAGDISYFILSGPLGFLQGFS------PDQSNALI-----FALAQP-QSLP
        D+    I++                    G  K             + +V G  S  +LS  LG   G +       DQ   ++      +L QP  SL 
Subjt:  DSDFEIIFL--------------------GETK-------------TAHVAGDISYFILSGPLGFLQGFS------PDQSNALI-----FALAQP-QSLP

Query:  KPQKHS-------------------------------KLVYNIDAAAPDTTPKPSGGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQ
        + Q+                                 ++  NID      T  P   G +T +   KFP +    ++A+   L  NA+ SP +    +  
Subjt:  KPQKHS-------------------------------KLVYNIDAAAPDTTPKPSGGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQ

Query:  LIYVAKGFGKIQIVGFSSKV--DAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITATHPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKITK
        ++Y+ +G  ++Q+V  + K   D E++ GQL+++P++ V  K A  EG    ++ T    +V  +AGK S+F AL  +++  ++ ++ E  + L+     
Subjt:  LIYVAKGFGKIQIVGFSSKV--DAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITATHPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKITK

Query:  TSPVIPPS
           V  PS
Subjt:  TSPVIPPS

Q9ZWA9 12S seed storage protein CRD2.0e-1323.53Show/hide
Query:  PKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLV---FPNKSDEV----------------------VV
        P   T+ +AG    W     P L    V   R+ L+P    +P +     + YV+QGE GV G +    P    EV                      + 
Subjt:  PKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLV---FPNKSDEV----------------------VV

Query:  KLKKGDLIPVPEGVTSWWFNDGDSDFEI-IFLGETKTAHVAGDI-SYFILSG--------PLGF------LQGFSPD-----------------------
          ++GD+     GV+ WW+N GDSD  I I L  T   +    +   F L+G        PL +        GF P+                       
Subjt:  KLKKGDLIPVPEGVTSWWFNDGDSDFEI-IFLGETKTAHVAGDI-SYFILSG--------PLGF------LQGFSPD-----------------------

Query:  -----QSNALIFALAQPQ---------SLPKPQKHSKLVYNIDAAAPDTTPK-PSGGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQ
              +  L F +  P+          + +    +K+  NID   P+ +    +  G ++T+     P +    L A+   L +  +  P + A  +  
Subjt:  -----QSNALIFALAQPQ---------SLPKPQKHSKLVYNIDAAAPDTTPK-PSGGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQ

Query:  LIYVAKGFGKIQIV--GFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITATHPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKITK
        ++YV  G  KIQ+V     S  + +V  GQ+I++P+ F   K AGE G E  S  T  +  +  L+G+TS   A+  ++++ S+ V  E  K ++    +
Subjt:  LIYVAKGFGKIQIV--GFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITATHPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKITK

Query:  TSPVIPPS
        T   + PS
Subjt:  TSPVIPPS

Arabidopsis top hitse value%identityAlignment
AT1G03880.1 cruciferin 22.9e-1221.65Show/hide
Query:  MEPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFPNKSD------------------------
        +  + P    + + G    W     P L  +  A  R ++ P+G  +P + +  K+ +V+ G  G+ G V P  ++                        
Subjt:  MEPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFPNKSD------------------------

Query:  EVVVKLKKGDLIPVPEGVTSWWFNDGDSDFEIIFLGE--TKTAHVAGDISYFILSG--PLG--FLQGFSPDQSNALI--FA---LAQ-------------
        + V  L+ GD I  P GV  W++N+G+    ++   +  +    +  ++  F+++G  P G  +LQG    + N +   FA   LAQ             
Subjt:  EVVVKLKKGDLIPVPEGVTSWWFNDGDSDFEIIFLGE--TKTAHVAGDISYFILSG--PLG--FLQGFSPDQSNALI--FA---LAQ-------------

Query:  --------------PQSLPKP---------QKHS------------KLVYNIDAAAPDTTPKPSGGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSP
                      P  + +P         Q H             +   N+D  +     KPS  G ++T+     P +    L+A+   +  NA+  P
Subjt:  --------------PQSLPKP---------QKHS------------KLVYNIDAAAPDTTPKPSGGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSP

Query:  VYVAEPSDQLIYVAKGFGKIQIVGFSSK--VDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITATHPLVEELAGKTSVFEALSPEILQVSFNVTAEFE
         +    ++  +YV  G   IQ+V  + +   D E+  GQL++VP+ F   K A  E  E     T  +  V  LAG+TSV   L  E++   + ++ E  
Subjt:  VYVAEPSDQLIYVAKGFGKIQIVGFSSK--VDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITATHPLVEELAGKTSVFEALSPEILQVSFNVTAEFE

Query:  KLLRSKITKTS
        K ++    +T+
Subjt:  KLLRSKITKTS

AT1G03890.1 RmlC-like cupins superfamily protein1.4e-1423.53Show/hide
Query:  PKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLV---FPNKSDEV----------------------VV
        P   T+ +AG    W     P L    V   R+ L+P    +P +     + YV+QGE GV G +    P    EV                      + 
Subjt:  PKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLV---FPNKSDEV----------------------VV

Query:  KLKKGDLIPVPEGVTSWWFNDGDSDFEI-IFLGETKTAHVAGDI-SYFILSG--------PLGF------LQGFSPD-----------------------
          ++GD+     GV+ WW+N GDSD  I I L  T   +    +   F L+G        PL +        GF P+                       
Subjt:  KLKKGDLIPVPEGVTSWWFNDGDSDFEI-IFLGETKTAHVAGDI-SYFILSG--------PLGF------LQGFSPD-----------------------

Query:  -----QSNALIFALAQPQ---------SLPKPQKHSKLVYNIDAAAPDTTPK-PSGGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQ
              +  L F +  P+          + +    +K+  NID   P+ +    +  G ++T+     P +    L A+   L +  +  P + A  +  
Subjt:  -----QSNALIFALAQPQ---------SLPKPQKHSKLVYNIDAAAPDTTPK-PSGGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQ

Query:  LIYVAKGFGKIQIV--GFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITATHPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKITK
        ++YV  G  KIQ+V     S  + +V  GQ+I++P+ F   K AGE G E  S  T  +  +  L+G+TS   A+  ++++ S+ V  E  K ++    +
Subjt:  LIYVAKGFGKIQIV--GFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITATHPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKITK

Query:  TSPVIPPS
        T   + PS
Subjt:  TSPVIPPS

AT1G07750.1 RmlC-like cupins superfamily protein1.6e-6337.01Show/hide
Query:  MEPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFN
        + P  PK    GD GSY  W P E P+L Q  + A +L L   GF VP Y+D SKV YVLQG  G AG+V P K +E V+ +K+GD I +P GV +WWFN
Subjt:  MEPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFN

Query:  DGDSDFEIIFLGETKTAHVAGDISYFILSGPLGFLQGFSPD-------------------QSNALIFALAQPQSLPKPQKHSKLVYNIDAAAPDTTPKPS
        + D +  I+FLGET   H AG  + F L+G  G   GFS +                   Q+   I  L     +P+P++ ++  + ++           
Subjt:  DGDSDFEIIFLGETKTAHVAGDISYFILSGPLGFLQGFSPD-------------------QSNALIFALAQPQSLPKPQKHSKLVYNIDAAAPDTTPKPS

Query:  GGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQLIYVAKGFGKIQIVGFSSK--VDAEVKMGQLILVPKYFVAGKIAGEEGLECFSII
         GG V  +     P +G+ G  A L ++DA+++ SP +  + + Q+ Y+  G G++Q+VG   K  ++  +K G L +VP++FV  KIA  +G+  FSI+
Subjt:  GGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQLIYVAKGFGKIQIVGFSSK--VDAEVKMGQLILVPKYFVAGKIAGEEGLECFSII

Query:  TATHPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKITKTSPVIPPSD
        T   P+   LAG TSV+++LSPE+LQ +F V  E EK  RS  T ++   PPS+
Subjt:  TATHPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKITKTSPVIPPSD

AT2G28680.1 RmlC-like cupins superfamily protein1.5e-6136.72Show/hide
Query:  MEPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFN
        + P  PK    GD GSY  W P E P+L    + A +L L   G  +P Y+D  KV YVLQG  G AG+V P K +E V+ +KKGD I +P GV +WWFN
Subjt:  MEPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFN

Query:  DGDSDFEIIFLGETKTAHVAGDISYFILSGPLGFLQGFSPD-------------------QSNALIFALAQPQSLPKPQKHSKLVYNIDAAAPDTTPKPS
        + D++  ++FLGET   H AG  + F L+G  G   GFS +                   Q+   I  +     +P+P+K  +  + ++           
Subjt:  DGDSDFEIIFLGETKTAHVAGDISYFILSGPLGFLQGFSPD-------------------QSNALIFALAQPQSLPKPQKHSKLVYNIDAAAPDTTPKPS

Query:  GGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQLIYVAKGFGKIQIVGFSSK--VDAEVKMGQLILVPKYFVAGKIAGEEGLECFSII
         GG V  +     P +G+ G  A L ++D +++ SP +  + + Q+ Y+  G G++QIVG   K  ++  VK G L +VP++FV  KIA  +GL  FSI+
Subjt:  GGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQLIYVAKGFGKIQIVGFSSK--VDAEVKMGQLILVPKYFVAGKIAGEEGLECFSII

Query:  TATHPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKITKTSPVIPPSD
        T   P+   LAG+TSV++ALSPE+LQ +F V  E EK  RSK T  +    PS+
Subjt:  TATHPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKITKTSPVIPPSD

AT4G28520.3 cruciferin 31.6e-1022.94Show/hide
Query:  QGEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFNDGDSDFEIIFL-----------GETKTAHVAGDISYFILSGPLGFLQGFSPDQSNALIFA
        QG+ G  G        + V  +++GD+     G   W +N G+    II L              +  H+AG       +   G   G    Q    +++
Subjt:  QGEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFNDGDSDFEIIFL-----------GETKTAHVAGDISYFILSGPLGFLQGFSPDQSNALIFA

Query:  LAQPQSLPKPQKHSKLVYNIDAAAPDTTPKPSGGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQLIYVAKGFGKIQIVGFSSK--VD
            Q + +  K       ID   P         G VT+V     P +    L+A    L  NA+  P Y    +++++Y   G G+IQ+V  + +  +D
Subjt:  LAQPQSLPKPQKHSKLVYNIDAAAPDTTPKPSGGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYVAEPSDQLIYVAKGFGKIQIVGFSSK--VD

Query:  AEVKMGQLILVPKYFVAGKIAGEEGLECFSIITATHPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKITKTS
         +V+ GQL+++P+ F     +     E  S  T  + ++  LAG+TS+  AL  E++   F ++ E  + ++    +T+
Subjt:  AEVKMGQLILVPKYFVAGKIAGEEGLECFSIITATHPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKITKTS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACCGATGAATCCCAAGCCCTTCACTGAGGGAGATGCTGGATCGTATCACAAATGGCTTCCTTCTGAATATCCCTTACTTGCTCAGACCAAGGTCGCCGCCGGCCG
CCTTCTCCTCCGCCCTCGCGGCTTCGTCGTTCCCCACTACGCCGATTGCTCTAAAGTGGGCTATGTTCTTCAAGGTGAAGATGGGGTTGCAGGATTGGTGTTTCCAAACA
AGTCCGATGAAGTGGTAGTGAAACTTAAGAAAGGAGATCTGATTCCGGTGCCGGAAGGAGTCACGTCGTGGTGGTTCAACGACGGAGACTCCGATTTCGAGATTATCTTT
TTGGGTGAAACCAAAACCGCTCATGTCGCCGGTGACATCTCTTACTTCATTCTCTCCGGCCCTCTTGGCTTCCTGCAAGGCTTCTCGCCGGACCAATCCAACGCCCTTAT
CTTCGCCCTTGCACAACCCCAATCCCTCCCCAAACCCCAAAAACACAGCAAACTAGTTTACAACATTGACGCCGCCGCGCCGGACACCACACCCAAGCCTAGCGGCGGCG
GCGCCGTCACGACGGTGACGGAATCCAAATTTCCCTCCATTGGCCAATCTGGGTTGACGGCAATTCTTGAAAAGCTTGACGCCAACGCCGTTCGATCGCCGGTGTACGTT
GCTGAGCCGTCCGATCAACTGATCTATGTGGCTAAAGGATTCGGGAAGATTCAGATTGTTGGATTTTCGAGTAAAGTTGATGCAGAGGTGAAAATGGGTCAGCTTATTTT
AGTCCCCAAATACTTCGTCGCCGGAAAAATCGCCGGAGAAGAAGGCTTGGAGTGCTTCTCCATTATCACAGCTACACATCCTCTGGTGGAAGAATTGGCCGGAAAGACGT
CGGTTTTCGAGGCATTGTCGCCGGAGATTCTTCAAGTTTCGTTCAACGTCACGGCGGAGTTCGAAAAGCTTCTTAGATCGAAGATCACAAAAACTTCACCAGTGATTCCA
CCTTCAGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAACCGATGAATCCCAAGCCCTTCACTGAGGGAGATGCTGGATCGTATCACAAATGGCTTCCTTCTGAATATCCCTTACTTGCTCAGACCAAGGTCGCCGCCGGCCG
CCTTCTCCTCCGCCCTCGCGGCTTCGTCGTTCCCCACTACGCCGATTGCTCTAAAGTGGGCTATGTTCTTCAAGGTGAAGATGGGGTTGCAGGATTGGTGTTTCCAAACA
AGTCCGATGAAGTGGTAGTGAAACTTAAGAAAGGAGATCTGATTCCGGTGCCGGAAGGAGTCACGTCGTGGTGGTTCAACGACGGAGACTCCGATTTCGAGATTATCTTT
TTGGGTGAAACCAAAACCGCTCATGTCGCCGGTGACATCTCTTACTTCATTCTCTCCGGCCCTCTTGGCTTCCTGCAAGGCTTCTCGCCGGACCAATCCAACGCCCTTAT
CTTCGCCCTTGCACAACCCCAATCCCTCCCCAAACCCCAAAAACACAGCAAACTAGTTTACAACATTGACGCCGCCGCGCCGGACACCACACCCAAGCCTAGCGGCGGCG
GCGCCGTCACGACGGTGACGGAATCCAAATTTCCCTCCATTGGCCAATCTGGGTTGACGGCAATTCTTGAAAAGCTTGACGCCAACGCCGTTCGATCGCCGGTGTACGTT
GCTGAGCCGTCCGATCAACTGATCTATGTGGCTAAAGGATTCGGGAAGATTCAGATTGTTGGATTTTCGAGTAAAGTTGATGCAGAGGTGAAAATGGGTCAGCTTATTTT
AGTCCCCAAATACTTCGTCGCCGGAAAAATCGCCGGAGAAGAAGGCTTGGAGTGCTTCTCCATTATCACAGCTACACATCCTCTGGTGGAAGAATTGGCCGGAAAGACGT
CGGTTTTCGAGGCATTGTCGCCGGAGATTCTTCAAGTTTCGTTCAACGTCACGGCGGAGTTCGAAAAGCTTCTTAGATCGAAGATCACAAAAACTTCACCAGTGATTCCA
CCTTCAGATTGA
Protein sequenceShow/hide protein sequence
MEPMNPKPFTEGDAGSYHKWLPSEYPLLAQTKVAAGRLLLRPRGFVVPHYADCSKVGYVLQGEDGVAGLVFPNKSDEVVVKLKKGDLIPVPEGVTSWWFNDGDSDFEIIF
LGETKTAHVAGDISYFILSGPLGFLQGFSPDQSNALIFALAQPQSLPKPQKHSKLVYNIDAAAPDTTPKPSGGGAVTTVTESKFPSIGQSGLTAILEKLDANAVRSPVYV
AEPSDQLIYVAKGFGKIQIVGFSSKVDAEVKMGQLILVPKYFVAGKIAGEEGLECFSIITATHPLVEELAGKTSVFEALSPEILQVSFNVTAEFEKLLRSKITKTSPVIP
PSD