; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0013563 (gene) of Chayote v1 genome

Gene IDSed0013563
OrganismSechium edule (Chayote v1)
Descriptionglutelin type-A 2-like
Genome locationLG13:3613560..3617219
RNA-Seq ExpressionSed0013563
SyntenySed0013563
Gene Ontology termsGO:0045735 - nutrient reservoir activity (molecular function)
InterPro domainsIPR006045 - Cupin 1
IPR011051 - RmlC-like cupin domain superfamily
IPR014710 - RmlC-like jelly roll fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576976.1 12S seed storage protein CRD, partial [Cucurbita argyrosperma subsp. sororia]3.1e-14175Show/hide
Query:  EPMNPKPFFEGDAGFYHKWLPSDYPLLARNNVAAGRLLLRPRGFVVPHYSDCSKVGYVLQGENGVAGLVIPGKSGEEVVKLKKGDLIPVPNGATSWWFND
        +PMNPKPF E +AG YHKWLPS+YPLLARN VAAGRLLLRPRGFVVPHY+DCSKVGYVLQGENGVAGLV P KS E VV LKKGDLIPVPNG +SWWFN+
Subjt:  EPMNPKPFFEGDAGFYHKWLPSDYPLLARNNVAAGRLLLRPRGFVVPHYSDCSKVGYVLQGENGVAGLVIPGKSGEEVVKLKKGDLIPVPNGATSWWFND

Query:  GDADFEIVYLGETKTAHVAGDISYFILSGPLSLLQGFSPEFVGKTYSLNEQQTTQLLNNQSNGLIFEIKKSQSLPKPQKQSKLVYEISA-----------
        GD+D EI++LGE+K AHV GDISYF+LSGPLSLL GFSPE+VGKTYSLN ++TTQ L +QSN LIF I+++QSLPKP K SK VY I A           
Subjt:  GDADFEIVYLGETKTAHVAGDISYFILSGPLSLLQGFSPEFVGKTYSLNEQQTTQLLNNQSNGLIFEIKKSQSLPKPQKQSKLVYEISA-----------

Query:  -AVATVTEAEFPFIGKTGLSFVLEKLDGGAVRSPVFVAEPADQVIYVVKGSGKIQIVGFSGKVDADVEMGQVILVPKYFVAGKVAGEDGLECISIITASN
         AV TVTE++FPFIG++GL+ +LEKLD  AVRSPV+VAEP DQ+IYV KG GKIQIVG S K+DA+V+MGQ+ILVPK+F  GK+AGEDGLECISIITA++
Subjt:  -AVATVTEAEFPFIGKTGLSFVLEKLDGGAVRSPVFVAEPADQVIYVVKGSGKIQIVGFSGKVDADVEMGQVILVPKYFVAGKVAGEDGLECISIITASN

Query:  PVVEELAGKTSVLNALSPEVLQVSFNVTAEFEKLLR
        PVVEELAGKTSVL ALSPE+ QVSFNVTAEFEKLLR
Subjt:  PVVEELAGKTSVLNALSPEVLQVSFNVTAEFEKLLR

XP_008456076.1 PREDICTED: glutelin type-A 2-like [Cucumis melo]2.3e-13370.33Show/hide
Query:  MEPMNPKPFFEGDAGFYHKWLPSDYPLLARNNVAAGRLLLRPRGFVVPHYSDCSKVGYVLQGENGVAGLVIPGKSGEEVVKLKKGDLIPVPNGATSWWFN
        ME MNPKPFFEG+ G Y KWLPSDYPLLA+ NVA GRLLLRPRGF VPHY+DCSK GYVLQGE+GV G V P K  E V+KLKKGDLIPVP+G TSWWFN
Subjt:  MEPMNPKPFFEGDAGFYHKWLPSDYPLLARNNVAAGRLLLRPRGFVVPHYSDCSKVGYVLQGENGVAGLVIPGKSGEEVVKLKKGDLIPVPNGATSWWFN

Query:  DGDADFEIVYLGETKTAHVAGDISYFILSGPLSLLQGFSPEFVGKTYSLNEQQTTQLLNNQSNGLIFEIKKSQSLPKPQKQSKLVYEI------------
        DGD+D EI++LGETK AHV GDI+YFILSGP  LLQGF+PE+V K+YSL++++T + L +QSN LIF ++ SQSLPKP K SKLVY I            
Subjt:  DGDADFEIVYLGETKTAHVAGDISYFILSGPLSLLQGFSPEFVGKTYSLNEQQTTQLLNNQSNGLIFEIKKSQSLPKPQKQSKLVYEI------------

Query:  SAAVATVTEAEFPFIGKTGLSFVLEKLDGGAVRSPVFVAEPADQVIYVVKGSGKIQIVGFSGKVDADVEMGQVILVPKYFVAGKVAGEDGLECISIITAS
        +AAV  VTE+ FPFIG+TGL+ VLEKLD  A+RSPV++AEP+DQ+IYV KGSGKIQ+VGFS K DADV++GQ+ILVP+YF  GK+AGE+GLECIS+I A+
Subjt:  SAAVATVTEAEFPFIGKTGLSFVLEKLDGGAVRSPVFVAEPADQVIYVVKGSGKIQIVGFSGKVDADVEMGQVILVPKYFVAGKVAGEDGLECISIITAS

Query:  NPVVEELAGKTSVLNALSPEVLQVSFNVTAEFEKLLR
        +P+VEELAGKTSVL ALS EV QVSFNVTAEFEKL R
Subjt:  NPVVEELAGKTSVLNALSPEVLQVSFNVTAEFEKLLR

XP_022922755.1 legumin J-like [Cucurbita moschata]6.8e-14174.7Show/hide
Query:  EPMNPKPFFEGDAGFYHKWLPSDYPLLARNNVAAGRLLLRPRGFVVPHYSDCSKVGYVLQGENGVAGLVIPGKSGEEVVKLKKGDLIPVPNGATSWWFND
        +PMNPKPF E +AG YHKWLPS+YPLLA+N VAAGRLLLRPRGFVVPHY+DCSKVGYVLQGENGVAGLV P KS E VV LKKGDLIPVPNG +SWWFND
Subjt:  EPMNPKPFFEGDAGFYHKWLPSDYPLLARNNVAAGRLLLRPRGFVVPHYSDCSKVGYVLQGENGVAGLVIPGKSGEEVVKLKKGDLIPVPNGATSWWFND

Query:  GDADFEIVYLGETKTAHVAGDISYFILSGPLSLLQGFSPEFVGKTYSLNEQQTTQLLNNQSNGLIFEIKKSQSLPKPQKQSKLVYEISA-----------
        GD+D EI++LGE+K AHV GDISYF+LSGPLSLL GFSPE+VGKTYSLN ++TTQ L +QSN LIF I+++QSLPKP K SK VY I A           
Subjt:  GDADFEIVYLGETKTAHVAGDISYFILSGPLSLLQGFSPEFVGKTYSLNEQQTTQLLNNQSNGLIFEIKKSQSLPKPQKQSKLVYEISA-----------

Query:  -AVATVTEAEFPFIGKTGLSFVLEKLDGGAVRSPVFVAEPADQVIYVVKGSGKIQIVGFSGKVDADVEMGQVILVPKYFVAGKVAGEDGLECISIITASN
         AV TVTE++FPFIG++GL+ +LEKL+  AVRSPV+VAEP DQ+IYV KG GKIQIVG S K+DA+V+MGQ+ILVPK+F  GK+AGEDGLECISIITA++
Subjt:  -AVATVTEAEFPFIGKTGLSFVLEKLDGGAVRSPVFVAEPADQVIYVVKGSGKIQIVGFSGKVDADVEMGQVILVPKYFVAGKVAGEDGLECISIITASN

Query:  PVVEELAGKTSVLNALSPEVLQVSFNVTAEFEKLLR
        PVVEELAGKTSVL ALSPE+ QVSFNVTAEFEKLLR
Subjt:  PVVEELAGKTSVLNALSPEVLQVSFNVTAEFEKLLR

XP_022985328.1 12S seed storage protein CRD-like [Cucurbita maxima]1.2e-14075Show/hide
Query:  EPMNPKPFFEGDAGFYHKWLPSDYPLLARNNVAAGRLLLRPRGFVVPHYSDCSKVGYVLQGENGVAGLVIPGKSGEEVVKLKKGDLIPVPNGATSWWFND
        +PMNPKPF E +AG YHKWLPS+YPLLA N VAAGRLLLRPRGFVVPHY+DCSKVGYVLQGENGVAGLV P KS E VV LKKGDLIPVPNG +SWWFND
Subjt:  EPMNPKPFFEGDAGFYHKWLPSDYPLLARNNVAAGRLLLRPRGFVVPHYSDCSKVGYVLQGENGVAGLVIPGKSGEEVVKLKKGDLIPVPNGATSWWFND

Query:  GDADFEIVYLGETKTAHVAGDISYFILSGPLSLLQGFSPEFVGKTYSLNEQQTTQLLNNQSNGLIFEIKKSQSLPKPQKQSKLVYEISA-----------
        GD+D EI++LGE+K AHV GDISYF+LSG LSLL GFSPE+VG+TYSLN ++TTQ L +QSN LIF I+++QSLPKP K SK VY I A           
Subjt:  GDADFEIVYLGETKTAHVAGDISYFILSGPLSLLQGFSPEFVGKTYSLNEQQTTQLLNNQSNGLIFEIKKSQSLPKPQKQSKLVYEISA-----------

Query:  -AVATVTEAEFPFIGKTGLSFVLEKLDGGAVRSPVFVAEPADQVIYVVKGSGKIQIVGFSGKVDADVEMGQVILVPKYFVAGKVAGEDGLECISIITASN
         AV TVTE++FPFIG++GL+ +LEKLD  AVRSPV+VAEP DQ+IYV KG GKIQIVGFS K+DA+V+MGQ+ILVPK+F  GK+AGEDGLECISIITA++
Subjt:  -AVATVTEAEFPFIGKTGLSFVLEKLDGGAVRSPVFVAEPADQVIYVVKGSGKIQIVGFSGKVDADVEMGQVILVPKYFVAGKVAGEDGLECISIITASN

Query:  PVVEELAGKTSVLNALSPEVLQVSFNVTAEFEKLLR
        PVVEELAGKTSVL ALSPEV QVSFNVTAEFEKLLR
Subjt:  PVVEELAGKTSVLNALSPEVLQVSFNVTAEFEKLLR

XP_023552908.1 12S seed storage globulin 1-like [Cucurbita pepo subsp. pepo]7.5e-14075Show/hide
Query:  EPMNPKPFFEGDAGFYHKWLPSDYPLLARNNVAAGRLLLRPRGFVVPHYSDCSKVGYVLQGENGVAGLVIPGKSGEEVVKLKKGDLIPVPNGATSWWFND
        +PMNPKPF E +AG YHKWLPS+YPLLARN VAAGRLLLRPRGFVVPHY+DCSKVGYVLQGENGV GLV P KS E VV LKKGDLIPVPNG +SWWFND
Subjt:  EPMNPKPFFEGDAGFYHKWLPSDYPLLARNNVAAGRLLLRPRGFVVPHYSDCSKVGYVLQGENGVAGLVIPGKSGEEVVKLKKGDLIPVPNGATSWWFND

Query:  GDADFEIVYLGETKTAHVAGDISYFILSGPLSLLQGFSPEFVGKTYSLNEQQTTQLLNNQSNGLIFEIKKSQSLPKPQKQSKLVYEISA-----------
        GD+D EI++LGE+K AHV GDISYF+LSGPLSLL GFSPE+VGKTYSLN ++TTQ L +QSN LI  I+++QSLPKP K SK VY I A           
Subjt:  GDADFEIVYLGETKTAHVAGDISYFILSGPLSLLQGFSPEFVGKTYSLNEQQTTQLLNNQSNGLIFEIKKSQSLPKPQKQSKLVYEISA-----------

Query:  -AVATVTEAEFPFIGKTGLSFVLEKLDGGAVRSPVFVAEPADQVIYVVKGSGKIQIVGFSGKVDADVEMGQVILVPKYFVAGKVAGEDGLECISIITASN
         AV TVTE++FPFIG++GL+ +LEKLD  AVRSPV+VAEP DQ+IYV KG GKIQIVG S K+DA+V+MGQ+ILVPK+F  GK AGEDGLECISIITA++
Subjt:  -AVATVTEAEFPFIGKTGLSFVLEKLDGGAVRSPVFVAEPADQVIYVVKGSGKIQIVGFSGKVDADVEMGQVILVPKYFVAGKVAGEDGLECISIITASN

Query:  PVVEELAGKTSVLNALSPEVLQVSFNVTAEFEKLLR
        PVVEELAGKTSVL ALSPEV QVSFNVTAEFEKLLR
Subjt:  PVVEELAGKTSVLNALSPEVLQVSFNVTAEFEKLLR

TrEMBL top hitse value%identityAlignment
A0A1S3C2D5 glutelin type-A 2-like1.1e-13370.33Show/hide
Query:  MEPMNPKPFFEGDAGFYHKWLPSDYPLLARNNVAAGRLLLRPRGFVVPHYSDCSKVGYVLQGENGVAGLVIPGKSGEEVVKLKKGDLIPVPNGATSWWFN
        ME MNPKPFFEG+ G Y KWLPSDYPLLA+ NVA GRLLLRPRGF VPHY+DCSK GYVLQGE+GV G V P K  E V+KLKKGDLIPVP+G TSWWFN
Subjt:  MEPMNPKPFFEGDAGFYHKWLPSDYPLLARNNVAAGRLLLRPRGFVVPHYSDCSKVGYVLQGENGVAGLVIPGKSGEEVVKLKKGDLIPVPNGATSWWFN

Query:  DGDADFEIVYLGETKTAHVAGDISYFILSGPLSLLQGFSPEFVGKTYSLNEQQTTQLLNNQSNGLIFEIKKSQSLPKPQKQSKLVYEI------------
        DGD+D EI++LGETK AHV GDI+YFILSGP  LLQGF+PE+V K+YSL++++T + L +QSN LIF ++ SQSLPKP K SKLVY I            
Subjt:  DGDADFEIVYLGETKTAHVAGDISYFILSGPLSLLQGFSPEFVGKTYSLNEQQTTQLLNNQSNGLIFEIKKSQSLPKPQKQSKLVYEI------------

Query:  SAAVATVTEAEFPFIGKTGLSFVLEKLDGGAVRSPVFVAEPADQVIYVVKGSGKIQIVGFSGKVDADVEMGQVILVPKYFVAGKVAGEDGLECISIITAS
        +AAV  VTE+ FPFIG+TGL+ VLEKLD  A+RSPV++AEP+DQ+IYV KGSGKIQ+VGFS K DADV++GQ+ILVP+YF  GK+AGE+GLECIS+I A+
Subjt:  SAAVATVTEAEFPFIGKTGLSFVLEKLDGGAVRSPVFVAEPADQVIYVVKGSGKIQIVGFSGKVDADVEMGQVILVPKYFVAGKVAGEDGLECISIITAS

Query:  NPVVEELAGKTSVLNALSPEVLQVSFNVTAEFEKLLR
        +P+VEELAGKTSVL ALS EV QVSFNVTAEFEKL R
Subjt:  NPVVEELAGKTSVLNALSPEVLQVSFNVTAEFEKLLR

A0A5A7T7U8 Glutelin type-A 2-like1.1e-13370.33Show/hide
Query:  MEPMNPKPFFEGDAGFYHKWLPSDYPLLARNNVAAGRLLLRPRGFVVPHYSDCSKVGYVLQGENGVAGLVIPGKSGEEVVKLKKGDLIPVPNGATSWWFN
        ME MNPKPFFEG+ G Y KWLPSDYPLLA+ NVA GRLLLRPRGF VPHY+DCSK GYVLQGE+GV G V P K  E V+KLKKGDLIPVP+G TSWWFN
Subjt:  MEPMNPKPFFEGDAGFYHKWLPSDYPLLARNNVAAGRLLLRPRGFVVPHYSDCSKVGYVLQGENGVAGLVIPGKSGEEVVKLKKGDLIPVPNGATSWWFN

Query:  DGDADFEIVYLGETKTAHVAGDISYFILSGPLSLLQGFSPEFVGKTYSLNEQQTTQLLNNQSNGLIFEIKKSQSLPKPQKQSKLVYEI------------
        DGD+D EI++LGETK AHV GDI+YFILSGP  LLQGF+PE+V K+YSL++++T + L +QSN LIF ++ SQSLPKP K SKLVY I            
Subjt:  DGDADFEIVYLGETKTAHVAGDISYFILSGPLSLLQGFSPEFVGKTYSLNEQQTTQLLNNQSNGLIFEIKKSQSLPKPQKQSKLVYEI------------

Query:  SAAVATVTEAEFPFIGKTGLSFVLEKLDGGAVRSPVFVAEPADQVIYVVKGSGKIQIVGFSGKVDADVEMGQVILVPKYFVAGKVAGEDGLECISIITAS
        +AAV  VTE+ FPFIG+TGL+ VLEKLD  A+RSPV++AEP+DQ+IYV KGSGKIQ+VGFS K DADV++GQ+ILVP+YF  GK+AGE+GLECIS+I A+
Subjt:  SAAVATVTEAEFPFIGKTGLSFVLEKLDGGAVRSPVFVAEPADQVIYVVKGSGKIQIVGFSGKVDADVEMGQVILVPKYFVAGKVAGEDGLECISIITAS

Query:  NPVVEELAGKTSVLNALSPEVLQVSFNVTAEFEKLLR
        +P+VEELAGKTSVL ALS EV QVSFNVTAEFEKL R
Subjt:  NPVVEELAGKTSVLNALSPEVLQVSFNVTAEFEKLLR

A0A5D3BLA4 Glutelin type-A 2-like1.1e-13370.33Show/hide
Query:  MEPMNPKPFFEGDAGFYHKWLPSDYPLLARNNVAAGRLLLRPRGFVVPHYSDCSKVGYVLQGENGVAGLVIPGKSGEEVVKLKKGDLIPVPNGATSWWFN
        ME MNPKPFFEG+ G Y KWLPSDYPLLA+ NVA GRLLLRPRGF VPHY+DCSK GYVLQGE+GV G V P K  E V+KLKKGDLIPVP+G TSWWFN
Subjt:  MEPMNPKPFFEGDAGFYHKWLPSDYPLLARNNVAAGRLLLRPRGFVVPHYSDCSKVGYVLQGENGVAGLVIPGKSGEEVVKLKKGDLIPVPNGATSWWFN

Query:  DGDADFEIVYLGETKTAHVAGDISYFILSGPLSLLQGFSPEFVGKTYSLNEQQTTQLLNNQSNGLIFEIKKSQSLPKPQKQSKLVYEI------------
        DGD+D EI++LGETK AHV GDI+YFILSGP  LLQGF+PE+V K+YSL++++T + L +QSN LIF ++ SQSLPKP K SKLVY I            
Subjt:  DGDADFEIVYLGETKTAHVAGDISYFILSGPLSLLQGFSPEFVGKTYSLNEQQTTQLLNNQSNGLIFEIKKSQSLPKPQKQSKLVYEI------------

Query:  SAAVATVTEAEFPFIGKTGLSFVLEKLDGGAVRSPVFVAEPADQVIYVVKGSGKIQIVGFSGKVDADVEMGQVILVPKYFVAGKVAGEDGLECISIITAS
        +AAV  VTE+ FPFIG+TGL+ VLEKLD  A+RSPV++AEP+DQ+IYV KGSGKIQ+VGFS K DADV++GQ+ILVP+YF  GK+AGE+GLECIS+I A+
Subjt:  SAAVATVTEAEFPFIGKTGLSFVLEKLDGGAVRSPVFVAEPADQVIYVVKGSGKIQIVGFSGKVDADVEMGQVILVPKYFVAGKVAGEDGLECISIITAS

Query:  NPVVEELAGKTSVLNALSPEVLQVSFNVTAEFEKLLR
        +P+VEELAGKTSVL ALS EV QVSFNVTAEFEKL R
Subjt:  NPVVEELAGKTSVLNALSPEVLQVSFNVTAEFEKLLR

A0A6J1E9P2 legumin J-like3.3e-14174.7Show/hide
Query:  EPMNPKPFFEGDAGFYHKWLPSDYPLLARNNVAAGRLLLRPRGFVVPHYSDCSKVGYVLQGENGVAGLVIPGKSGEEVVKLKKGDLIPVPNGATSWWFND
        +PMNPKPF E +AG YHKWLPS+YPLLA+N VAAGRLLLRPRGFVVPHY+DCSKVGYVLQGENGVAGLV P KS E VV LKKGDLIPVPNG +SWWFND
Subjt:  EPMNPKPFFEGDAGFYHKWLPSDYPLLARNNVAAGRLLLRPRGFVVPHYSDCSKVGYVLQGENGVAGLVIPGKSGEEVVKLKKGDLIPVPNGATSWWFND

Query:  GDADFEIVYLGETKTAHVAGDISYFILSGPLSLLQGFSPEFVGKTYSLNEQQTTQLLNNQSNGLIFEIKKSQSLPKPQKQSKLVYEISA-----------
        GD+D EI++LGE+K AHV GDISYF+LSGPLSLL GFSPE+VGKTYSLN ++TTQ L +QSN LIF I+++QSLPKP K SK VY I A           
Subjt:  GDADFEIVYLGETKTAHVAGDISYFILSGPLSLLQGFSPEFVGKTYSLNEQQTTQLLNNQSNGLIFEIKKSQSLPKPQKQSKLVYEISA-----------

Query:  -AVATVTEAEFPFIGKTGLSFVLEKLDGGAVRSPVFVAEPADQVIYVVKGSGKIQIVGFSGKVDADVEMGQVILVPKYFVAGKVAGEDGLECISIITASN
         AV TVTE++FPFIG++GL+ +LEKL+  AVRSPV+VAEP DQ+IYV KG GKIQIVG S K+DA+V+MGQ+ILVPK+F  GK+AGEDGLECISIITA++
Subjt:  -AVATVTEAEFPFIGKTGLSFVLEKLDGGAVRSPVFVAEPADQVIYVVKGSGKIQIVGFSGKVDADVEMGQVILVPKYFVAGKVAGEDGLECISIITASN

Query:  PVVEELAGKTSVLNALSPEVLQVSFNVTAEFEKLLR
        PVVEELAGKTSVL ALSPE+ QVSFNVTAEFEKLLR
Subjt:  PVVEELAGKTSVLNALSPEVLQVSFNVTAEFEKLLR

A0A6J1JDB2 12S seed storage protein CRD-like5.6e-14175Show/hide
Query:  EPMNPKPFFEGDAGFYHKWLPSDYPLLARNNVAAGRLLLRPRGFVVPHYSDCSKVGYVLQGENGVAGLVIPGKSGEEVVKLKKGDLIPVPNGATSWWFND
        +PMNPKPF E +AG YHKWLPS+YPLLA N VAAGRLLLRPRGFVVPHY+DCSKVGYVLQGENGVAGLV P KS E VV LKKGDLIPVPNG +SWWFND
Subjt:  EPMNPKPFFEGDAGFYHKWLPSDYPLLARNNVAAGRLLLRPRGFVVPHYSDCSKVGYVLQGENGVAGLVIPGKSGEEVVKLKKGDLIPVPNGATSWWFND

Query:  GDADFEIVYLGETKTAHVAGDISYFILSGPLSLLQGFSPEFVGKTYSLNEQQTTQLLNNQSNGLIFEIKKSQSLPKPQKQSKLVYEISA-----------
        GD+D EI++LGE+K AHV GDISYF+LSG LSLL GFSPE+VG+TYSLN ++TTQ L +QSN LIF I+++QSLPKP K SK VY I A           
Subjt:  GDADFEIVYLGETKTAHVAGDISYFILSGPLSLLQGFSPEFVGKTYSLNEQQTTQLLNNQSNGLIFEIKKSQSLPKPQKQSKLVYEISA-----------

Query:  -AVATVTEAEFPFIGKTGLSFVLEKLDGGAVRSPVFVAEPADQVIYVVKGSGKIQIVGFSGKVDADVEMGQVILVPKYFVAGKVAGEDGLECISIITASN
         AV TVTE++FPFIG++GL+ +LEKLD  AVRSPV+VAEP DQ+IYV KG GKIQIVGFS K+DA+V+MGQ+ILVPK+F  GK+AGEDGLECISIITA++
Subjt:  -AVATVTEAEFPFIGKTGLSFVLEKLDGGAVRSPVFVAEPADQVIYVVKGSGKIQIVGFSGKVDADVEMGQVILVPKYFVAGKVAGEDGLECISIITASN

Query:  PVVEELAGKTSVLNALSPEVLQVSFNVTAEFEKLLR
        PVVEELAGKTSVL ALSPEV QVSFNVTAEFEKLLR
Subjt:  PVVEELAGKTSVLNALSPEVLQVSFNVTAEFEKLLR

SwissProt top hitse value%identityAlignment
P04405 Glycinin G23.4e-1822.56Show/hide
Query:  MEPMNPKPFFEGDAGFYHKWLPSDYPLLARNNVAAGRLLLRPRGFVVPHYSDCSKVGYVLQGENGVAGLVIPG---------------------KSGEEV
        +  + P    E + GF   W P++ P      VA  R  L       P Y++  +  Y+ QG NG+ G++ PG                        ++V
Subjt:  MEPMNPKPFFEGDAGFYHKWLPSDYPLLARNNVAAGRLLLRPRGFVVPHYSDCSKVGYVLQGENGVAGLVIPG---------------------KSGEEV

Query:  VKLKKGDLIPVPNGATSWWFNDGDADFEIVYLGETKTA-----------HVAGDISYFIL---------------------SGPLSLLQGFSPEFVGKTY
         + ++GDLI VP G   W +N+ D     V + +T +            ++AG+     L                     +   ++L GF+PEF+ + +
Subjt:  VKLKKGDLIPVPNGATSWWFNDGDADFEIVYLGETKTA-----------HVAGDISYFIL---------------------SGPLSLLQGFSPEFVGKTY

Query:  SLNEQQTTQLL---NNQSNGLIFEIK-----KSQSLPKP----------------------QKQSKL-------------------------VYEISA-A
         +N Q    L      + +G I  +K      + ++ KP                      Q+QSK                          +Y   A +
Subjt:  SLNEQQTTQLL---NNQSNGLIFEIK-----KSQSLPKP----------------------QKQSKL-------------------------VYEISA-A

Query:  VATVTEAEFPFIGKTGLSFVLEKLDGGAVRSPVFVAEPADQVIYVVKGSGKIQIVGFSGK--VDADVEMGQVILVPKYFVAGKVAGEDGLECISIITASN
        + T T  +FP +    LS     L   A+  P +    A+ +IY + G   +Q+V  +G+   D +++ G V++VP+ F     +  D  E +S  T   
Subjt:  VATVTEAEFPFIGKTGLSFVLEKLDGGAVRSPVFVAEPADQVIYVVKGSGKIQIVGFSGK--VDADVEMGQVILVPKYFVAGKVAGEDGLECISIITASN

Query:  PVVEELAGKTSVLNALSPEVLQVSFNVTAE
        P +  LAG  S+LNAL  EV+Q +FN+ ++
Subjt:  PVVEELAGKTSVLNALSPEVLQVSFNVTAE

P07728 Glutelin type-A 12.0e-1821.94Show/hide
Query:  VAAGRLLLRPRGFVVPHYSDCSKVGYVLQGENGVAGLVIPG----------KSG------------------EEVVKLKKGDLIPVPNGATSWWFNDGDA
        V+  R ++ PRG ++PHY++ + + Y++QG  G+ G   PG          +SG                  +++ + ++GD+I +P G   W +NDG+ 
Subjt:  VAAGRLLLRPRGFVVPHYSDCSKVGYVLQGENGVAGLVIPG----------KSG------------------EEVVKLKKGDLIPVPNGATSWWFNDGDA

Query:  DFEIVYLGETKTAHVAGDISY--FILSG---------------PLSLLQGFSPEFVGKTYSLNEQQTTQL-LNNQSNGLIFEIKKSQSLPKP--------
            +Y+ +        D     F+L+G                 ++  GFS E + +   ++ Q   QL   N   G I  ++   SL +P        
Subjt:  DFEIVYLGETKTAHVAGDISY--FILSG---------------PLSLLQGFSPEFVGKTYSLNEQQTTQL-LNNQSNGLIFEIKKSQSLPKP--------

Query:  --QKQSKLVYE-----------------------------------------ISAAVATVTEAEFPFIGKTGLSFVLEKLDGGAVRSPVFVAEPADQVIY
          Q QS+  Y+                                          +  V  +    FP +    +S V   L   A+ SP F    A  V+Y
Subjt:  --QKQSKLVYE-----------------------------------------ISAAVATVTEAEFPFIGKTGLSFVLEKLDGGAVRSPVFVAEPADQVIY

Query:  VVKGSGKIQIVGFSGKV--DADVEMGQVILVPKYFVAGKVAGEDGLECISIITASNPVVEELAGKTSVLNALSPEVLQVSFNVTAEFEKLLR
        + +G  ++Q+V  +GK   + ++  GQ++++P+++   K A  +G   I+  T  N +V  +AGK+S+  AL  +VL  ++ ++ E  + L+
Subjt:  VVKGSGKIQIVGFSGKV--DADVEMGQVILVPKYFVAGKVAGEDGLECISIITASNPVVEELAGKTSVLNALSPEVLQVSFNVTAEFEKLLR

P07730 Glutelin type-A 24.8e-2022.19Show/hide
Query:  VAAGRLLLRPRGFVVPHYSDCSKVGYVLQGENGVAGLVIPG----------KSG------------------EEVVKLKKGDLIPVPNGATSWWFNDGDA
        V+  R ++ PRG ++PHY++ + + Y++QG  G+ G   PG          +SG                  +++ + ++GD+I +P G   W +NDG+ 
Subjt:  VAAGRLLLRPRGFVVPHYSDCSKVGYVLQGENGVAGLVIPG----------KSG------------------EEVVKLKKGDLIPVPNGATSWWFNDGDA

Query:  DFEIVYLGETKTAHVAGDISY--FILSG---------------PLSLLQGFSPEFVGKTYSLNEQQTTQL-LNNQSNGLIFEIKKSQSLPKP--------
            +Y+ +        D     F+L+G                 ++  GFS E + + + ++ Q   QL   N   G I  +++  SL +P        
Subjt:  DFEIVYLGETKTAHVAGDISY--FILSG---------------PLSLLQGFSPEFVGKTYSLNEQQTTQL-LNNQSNGLIFEIKKSQSLPKP--------

Query:  --QKQSKLVYE-----------------------------------------ISAAVATVTEAEFPFIGKTGLSFVLEKLDGGAVRSPVFVAEPADQVIY
          Q QS+  Y+                                          +  V  +    FP +    +S V   L   A+ SP F    A  ++Y
Subjt:  --QKQSKLVYE-----------------------------------------ISAAVATVTEAEFPFIGKTGLSFVLEKLDGGAVRSPVFVAEPADQVIY

Query:  VVKGSGKIQIVGFSGKV--DADVEMGQVILVPKYFVAGKVAGEDGLECISIITASNPVVEELAGKTSVLNALSPEVLQVSFNVTAEFEKLLR
        + +G  ++Q+V  +GK   + ++  GQ+++VP+++V  K A  +G   I+  T  N +V  +AGK+S+  AL  +VL  ++ ++ E  + L+
Subjt:  VVKGSGKIQIVGFSGKV--DADVEMGQVILVPKYFVAGKVAGEDGLECISIITASNPVVEELAGKTSVLNALSPEVLQVSFNVTAEFEKLLR

P11828 Glycinin G32.6e-1823.1Show/hide
Query:  MEPMNPKPFFEGDAGFYHKWLPSDYPLLARNNVAAGRLLLRPRGFVVPHYSDCSKVGYVLQGENGVAGLVIPG------------------KSGEEVVKL
        +  + P    E + GF   W P++ P      VA  R  L       P Y++  +  Y+ QG +G+ G++ PG                     +++   
Subjt:  MEPMNPKPFFEGDAGFYHKWLPSDYPLLARNNVAAGRLLLRPRGFVVPHYSDCSKVGYVLQGENGVAGLVIPG------------------KSGEEVVKL

Query:  KKGDLIPVPNGATSWWFNDGDADFEIVYLGET-----------KTAHVAGDISY-FILSGPL----------------------SLLQGFSPEFVGKTYS
        ++GDLI VP G   W +N+ D     V L +T           +  ++AG+    F+   P                       S+L GF+PEF+   + 
Subjt:  KKGDLIPVPNGATSWWFNDGDADFEIVYLGET-----------KTAHVAGDISY-FILSGPL----------------------SLLQGFSPEFVGKTYS

Query:  LNEQQTTQLL---NNQSNGLIFEIKKSQSLPKP------------------------QKQS-----------KLVYEI------------SAAVATVTEA
        ++ Q   +L      +  G I  +K   S+  P                        Q QS           +L + I            + ++ T T  
Subjt:  LNEQQTTQLL---NNQSNGLIFEIKKSQSLPKP------------------------QKQS-----------KLVYEI------------SAAVATVTEA

Query:  EFPFIGKTGLSFVLEKLDGGAVRSPVFVAEPADQVIYVVKGSGKIQIVGFSGK--VDADVEMGQVILVPKYFVAGKVAGEDGLECISIITASNPVVEELA
        +FP +    LS     L   A+  P +    A+ +IY + G   +Q+V  +G+   D +++ GQV++VP+ F     +  D  E +S  T   P +  LA
Subjt:  EFPFIGKTGLSFVLEKLDGGAVRSPVFVAEPADQVIYVVKGSGKIQIVGFSGK--VDADVEMGQVILVPKYFVAGKVAGEDGLECISIITASNPVVEELA

Query:  GKTSVLNALSPEVLQVSFNV
        G  S+LNAL  EV+Q +FN+
Subjt:  GKTSVLNALSPEVLQVSFNV

Q8GZP6 11S globulin seed storage protein Ana o 2.0101 (Fragment)2.0e-1823.82Show/hide
Query:  MEPMNPKPFFEGDAGFYHKWLPSDYPLLARNNVAAGRLLLRPRGFVVPHYSDCSKVGYVLQGENGVAGLVIP---------------GKSG------EEV
        ++ + P    E +AG    W P ++       VA  R  ++P G ++P YS+  ++ YV+QGE G+ G+  P               G+SG      +++
Subjt:  MEPMNPKPFFEGDAGFYHKWLPSDYPLLARNNVAAGRLLLRPRGFVVPHYSDCSKVGYVLQGENGVAGLVIP---------------GKSG------EEV

Query:  VKLKKGDLIPVPNGATSWWFNDGDADFEIVYLGETKTA-----------HVAGDISYFI------LSGPLSLLQGFSPEFVGKTYSLNEQQTTQLLNNQS
         + ++GD+I +P G   W +N+G++    V L +   +           H+AG+            S   +L  GF  E + + + ++E+   QL +  +
Subjt:  VKLKKGDLIPVPNGATSWWFNDGDADFEIVYLGETKTA-----------HVAGDISYFI------LSGPLSLLQGFSPEFVGKTYSLNEQQTTQLLNNQS

Query:  NGLIFEIK----------KSQSLPKPQKQSKLVYE----------ISAAVATVTEAE-----------FPFIGKTG---------LSFVLEKLDGGAVRS
         G I ++K          +SQS    + + +   E          I   + T+   E            P +G+           L ++   ++ G +  
Subjt:  NGLIFEIK----------KSQSLPKPQKQSKLVYE----------ISAAVATVTEAE-----------FPFIGKTG---------LSFVLEKLDGGAVRS

Query:  PVFVAE----PADQVIYVVKGSGKIQIV-GFSGKV-DADVEMGQVILVPKYFVAGKVAGEDGLECISIITASNPVVEELAGKTSVLNALSPEVLQVSFNV
           V       +  +IY  KG G++Q+V  F  +V D +V  GQ+++VP+ F   K A E+  E IS  T    +   LAG+TSVL  +  EVL  +F +
Subjt:  PVFVAE----PADQVIYVVKGSGKIQIV-GFSGKV-DADVEMGQVILVPKYFVAGKVAGEDGLECISIITASNPVVEELAGKTSVLNALSPEVLQVSFNV

Query:  TAE
        + E
Subjt:  TAE

Arabidopsis top hitse value%identityAlignment
AT1G03890.1 RmlC-like cupins superfamily protein1.6e-1824.7Show/hide
Query:  SDCSKVGYVLQGENGVAGLVIPGKSGEE----VVKLKKGDLIPVPNGATSWWFNDGDADFEIV-YLGETKTAHVAGDI-SYFILSG--------PL----
        S C +    ++G +G  G   PG+  E+    +   ++GD+     G + WW+N GD+D  IV  L  T   +    +   F L+G        PL    
Subjt:  SDCSKVGYVLQGENGVAGLVIPGKSGEE----VVKLKKGDLIPVPNGATSWWFNDGDADFEIV-YLGETKTAHVAGDI-SYFILSG--------PL----

Query:  --SLLQGFSPEFVGKTYSLNEQQTTQLLNNQSN------------------------GLIFEIKKS-------QSLPKPQKQSKLVYEISAAVATVTEAE
          +   GF P  + + + +N +   QL N + N                        G+   I+++       +++  P++        +  ++T+    
Subjt:  --SLLQGFSPEFVGKTYSLNEQQTTQLLNNQSN------------------------GLIFEIKKS-------QSLPKPQKQSKLVYEISAAVATVTEAE

Query:  FPFIGKTGLSFVLEKLDGGAVRSPVFVAEPADQVIYVVKGSGKIQIVGFSGK--VDADVEMGQVILVPKYFVAGKVAGEDGLECISIITASNPVVEELAG
         P +    L+ +   L  G +  P + A  A  V+YV  G  KIQ+V  +G+   +  V  GQ+I++P+ F   K AGE G E IS  T  N  +  L+G
Subjt:  FPFIGKTGLSFVLEKLDGGAVRSPVFVAEPADQVIYVVKGSGKIQIVGFSGK--VDADVEMGQVILVPKYFVAGKVAGEDGLECISIITASNPVVEELAG

Query:  KTSVLNALSPEVLQVSFNVTAEFEKLLR
        +TS L A+  +V++ S+ V  E  K ++
Subjt:  KTSVLNALSPEVLQVSFNVTAEFEKLLR

AT1G07750.1 RmlC-like cupins superfamily protein5.3e-6737.65Show/hide
Query:  MEPMNPKPFFEGDAGFYHKWLPSDYPLLARNNVAAGRLLLRPRGFVVPHYSDCSKVGYVLQGENGVAGLVIPGKSGEEVVKLKKGDLIPVPNGATSWWFN
        + P  PK  + GD G Y  W P + P+L + N+ A +L L   GF VP YSD SKV YVLQG +G AG+V+P K  E+V+ +K+GD I +P G  +WWFN
Subjt:  MEPMNPKPFFEGDAGFYHKWLPSDYPLLARNNVAAGRLLLRPRGFVVPHYSDCSKVGYVLQGENGVAGLVIPGKSGEEVVKLKKGDLIPVPNGATSWWFN

Query:  DGDADFEIVYLGETKTAHVAGDISYFILSGPLSLLQGFSPEFVGKTYSLNEQQTTQLLNNQSNGLIFEIKKSQSLPKPQKQSKLVYEIS-----------
        + D +  I++LGET   H AG  + F L+G   +  GFS EFVG+ + L+E    +L+ +Q+   I ++     +P+P+++++  + ++           
Subjt:  DGDADFEIVYLGETKTAHVAGDISYFILSGPLSLLQGFSPEFVGKTYSLNEQQTTQLLNNQSNGLIFEIKKSQSLPKPQKQSKLVYEIS-----------

Query:  --AAVATVTEAEFPFIGKTGLSFVLEKLDGGAVRSPVFVAEPADQVIYVVKGSGKIQIVGFSGK--VDADVEMGQVILVPKYFVAGKVAGEDGLECISII
            V  +     P +G+ G    L ++D  ++ SP F  + A QV Y+V GSG++Q+VG  GK  ++  ++ G + +VP++FV  K+A  DG+   SI+
Subjt:  --AAVATVTEAEFPFIGKTGLSFVLEKLDGGAVRSPVFVAEPADQVIYVVKGSGKIQIVGFSGK--VDADVEMGQVILVPKYFVAGKVAGEDGLECISII

Query:  TASNPVVEELAGKTSVLNALSPEVLQVSFNVTAEFEKLLR
        T  +P+   LAG TSV  +LSPEVLQ +F V  E EK  R
Subjt:  TASNPVVEELAGKTSVLNALSPEVLQVSFNVTAEFEKLLR

AT2G28680.1 RmlC-like cupins superfamily protein4.1e-6738.82Show/hide
Query:  MEPMNPKPFFEGDAGFYHKWLPSDYPLLARNNVAAGRLLLRPRGFVVPHYSDCSKVGYVLQGENGVAGLVIPGKSGEEVVKLKKGDLIPVPNGATSWWFN
        + P  PK  + GD G Y  W P + P+L   N+ A +L L   G  +P YSD  KV YVLQG  G AG+V+P K  E+V+ +KKGD I +P G  +WWFN
Subjt:  MEPMNPKPFFEGDAGFYHKWLPSDYPLLARNNVAAGRLLLRPRGFVVPHYSDCSKVGYVLQGENGVAGLVIPGKSGEEVVKLKKGDLIPVPNGATSWWFN

Query:  DGDADFEIVYLGETKTAHVAGDISYFILSGPLSLLQGFSPEFVGKTYSLNEQQTTQLLNNQSNGLIFEIKKSQSLPKPQKQSKLVYEIS-----------
        + D +  +++LGET   H AG  + F L+G   +  GFS EFVG+ + L+E    +L+ +Q+   I ++  S  +P+P+K  +  + ++           
Subjt:  DGDADFEIVYLGETKTAHVAGDISYFILSGPLSLLQGFSPEFVGKTYSLNEQQTTQLLNNQSNGLIFEIKKSQSLPKPQKQSKLVYEIS-----------

Query:  --AAVATVTEAEFPFIGKTGLSFVLEKLDGGAVRSPVFVAEPADQVIYVVKGSGKIQIVGFSGK--VDADVEMGQVILVPKYFVAGKVAGEDGLECISII
            V  +     P +G+ G    L ++DG ++ SP F  + A QV Y+V GSG++QIVG  GK  ++  V+ G + +VP++FV  K+A  DGL   SI+
Subjt:  --AAVATVTEAEFPFIGKTGLSFVLEKLDGGAVRSPVFVAEPADQVIYVVKGSGKIQIVGFSGK--VDADVEMGQVILVPKYFVAGKVAGEDGLECISII

Query:  TASNPVVEELAGKTSVLNALSPEVLQVSFNVTAEFEKLLR
        T  +P+   LAG+TSV  ALSPEVLQ +F V  E EK  R
Subjt:  TASNPVVEELAGKTSVLNALSPEVLQVSFNVTAEFEKLLR

AT4G28520.1 cruciferin 31.1e-1123.1Show/hide
Query:  QGENGVAGLVIPGKSGEEVVKLKKGDLIPVPNGATSWWFNDGDADFEIVYL-----------GETKTAHVAGDISYFILSG------PLSLLQGFSPEFV
        QG+ G  G        ++V  +++GD+     G+  W +N G+    I+ L              +  H+AG+       G        +L  GF  + +
Subjt:  QGENGVAGLVIPGKSGEEVVKLKKGDLIPVPNGATSWWFNDGDADFEIVYL-----------GETKTAHVAGDISYFILSG------PLSLLQGFSPEFV

Query:  GKTYSLNEQQTTQLLNNQ-SNGLIFEIKKSQSLPKPQKQSKL------------------------------------VYEIS-AAVATVTEAEFPFIGK
         +   ++ Q   QL N Q S G I  +K    + +P  +                                       VY+ S   V +V     P +  
Subjt:  GKTYSLNEQQTTQLLNNQ-SNGLIFEIKKSQSLPKPQKQSKL------------------------------------VYEIS-AAVATVTEAEFPFIGK

Query:  TGLSFVLEKLDGGAVRSPVFVAEPADQVIYVVKGSGKIQIVGFSGK--VDADVEMGQVILVPKYFVAGKVAGEDGLECISIITASNPVVEELAGKTSVLN
          LS     L G A+  P +    A++++Y   G G+IQ+V  +G+  +D  V+ GQ++++P+ F     +  +  E IS  T  N ++  LAG+TS+L 
Subjt:  TGLSFVLEKLDGGAVRSPVFVAEPADQVIYVVKGSGKIQIVGFSGK--VDADVEMGQVILVPKYFVAGKVAGEDGLECISIITASNPVVEELAGKTSVLN

Query:  ALSPEVLQVSFNVTAE
        AL  EV+   F ++ E
Subjt:  ALSPEVLQVSFNVTAE

AT4G28520.3 cruciferin 32.1e-1024.81Show/hide
Query:  QGENGVAGLVIPGKSGEEVVKLKKGDLIPVPNGATSWWFNDGDADFEIVYLGETKTAHVAGDISYFILSGPLSLLQGFSPEFVGKTYSLNEQQTTQLLNN
        QG+ G  G        ++V  +++GD+     G+  W +N G+    I+ L +        D +  +     +  QG      G   S  +Q+   L + 
Subjt:  QGENGVAGLVIPGKSGEEVVKLKKGDLIPVPNGATSWWFNDGDADFEIVYLGETKTAHVAGDISYFILSGPLSLLQGFSPEFVGKTYSLNEQQTTQLLNN

Query:  QSNGLIFEIKKSQSLPKPQKQSKLVYEIS-AAVATVTEAEFPFIGKTGLSFVLEKLDGGAVRSPVFVAEPADQVIYVVKGSGKIQIVGFSGK--VDADVE
            +I +  K             VY+ S   V +V     P +    LS     L G A+  P +    A++++Y   G G+IQ+V  +G+  +D  V+
Subjt:  QSNGLIFEIKKSQSLPKPQKQSKLVYEIS-AAVATVTEAEFPFIGKTGLSFVLEKLDGGAVRSPVFVAEPADQVIYVVKGSGKIQIVGFSGK--VDADVE

Query:  MGQVILVPKYFVAGKVAGEDGLECISIITASNPVVEELAGKTSVLNALSPEVLQVSFNVTAE
         GQ++++P+ F     +  +  E IS  T  N ++  LAG+TS+L AL  EV+   F ++ E
Subjt:  MGQVILVPKYFVAGKVAGEDGLECISIITASNPVVEELAGKTSVLNALSPEVLQVSFNVTAE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCCAATGAATCCCAAGCCCTTCTTTGAGGGAGATGCTGGATTTTATCACAAATGGCTGCCTTCTGACTATCCCTTGCTCGCCCGGAACAATGTCGCCGCTGGCCG
CCTTCTCCTCCGCCCCCGTGGCTTCGTCGTTCCTCACTATTCTGATTGCTCTAAAGTTGGCTATGTTCTTCAAGGGGAAAACGGAGTCGCCGGATTAGTGATTCCGGGCA
AGTCCGGCGAAGAGGTAGTGAAATTAAAGAAAGGGGATTTAATTCCGGTGCCAAACGGGGCCACATCGTGGTGGTTCAACGACGGCGACGCCGATTTTGAGATCGTCTAT
TTGGGCGAAACCAAAACTGCCCACGTCGCCGGCGATATTTCTTACTTCATCCTCTCCGGCCCCCTCAGCCTCCTCCAAGGATTCTCGCCGGAGTTCGTCGGAAAAACTTA
TTCCCTAAACGAACAACAAACAACCCAACTTCTCAATAACCAATCCAACGGTTTAATCTTCGAAATTAAAAAATCCCAATCCCTCCCCAAGCCCCAAAAACAGAGCAAAT
TAGTTTATGAAATTAGCGCCGCCGTGGCGACGGTGACGGAGGCTGAGTTTCCGTTTATTGGGAAAACTGGGCTGAGTTTTGTTCTGGAAAAGCTTGATGGCGGCGCCGTC
CGGTCGCCGGTGTTCGTGGCCGAGCCGGCGGATCAAGTGATTTATGTGGTGAAGGGAAGTGGTAAAATTCAGATTGTTGGGTTTTCGGGTAAGGTTGATGCGGATGTGGA
AATGGGTCAGGTGATTTTGGTTCCTAAATATTTTGTCGCCGGAAAAGTCGCCGGAGAAGATGGCTTGGAGTGCATCTCCATTATCACAGCTTCAAATCCTGTAGTGGAAG
AACTGGCCGGAAAGACGTCGGTTTTGAATGCATTGTCGCCGGAAGTTTTACAAGTTTCGTTCAACGTCACGGCGGAGTTCGAGAAACTTCTTAGATGA
mRNA sequenceShow/hide mRNA sequence
TTTTTGCAATTCTCTTAAAAGATTCATTACAAATTTGAATGACAACAAAAATTATAAAATCATGTAATCATCATGTAATCATTGTTTCTATTGTTTTTTTTTTCTTTTTC
AAAAAGGCCACATGATTGATAAAATTTGCCTTAAATACCTCAAGTTGTGAACCACACAACCATACCAAAGAAGAGCCCAAGAACTTTGTACCATAATTTGTTCAACCATT
TGAATGGAGCCAATGAATCCCAAGCCCTTCTTTGAGGGAGATGCTGGATTTTATCACAAATGGCTGCCTTCTGACTATCCCTTGCTCGCCCGGAACAATGTCGCCGCTGG
CCGCCTTCTCCTCCGCCCCCGTGGCTTCGTCGTTCCTCACTATTCTGATTGCTCTAAAGTTGGCTATGTTCTTCAAGGGGAAAACGGAGTCGCCGGATTAGTGATTCCGG
GCAAGTCCGGCGAAGAGGTAGTGAAATTAAAGAAAGGGGATTTAATTCCGGTGCCAAACGGGGCCACATCGTGGTGGTTCAACGACGGCGACGCCGATTTTGAGATCGTC
TATTTGGGCGAAACCAAAACTGCCCACGTCGCCGGCGATATTTCTTACTTCATCCTCTCCGGCCCCCTCAGCCTCCTCCAAGGATTCTCGCCGGAGTTCGTCGGAAAAAC
TTATTCCCTAAACGAACAACAAACAACCCAACTTCTCAATAACCAATCCAACGGTTTAATCTTCGAAATTAAAAAATCCCAATCCCTCCCCAAGCCCCAAAAACAGAGCA
AATTAGTTTATGAAATTAGCGCCGCCGTGGCGACGGTGACGGAGGCTGAGTTTCCGTTTATTGGGAAAACTGGGCTGAGTTTTGTTCTGGAAAAGCTTGATGGCGGCGCC
GTCCGGTCGCCGGTGTTCGTGGCCGAGCCGGCGGATCAAGTGATTTATGTGGTGAAGGGAAGTGGTAAAATTCAGATTGTTGGGTTTTCGGGTAAGGTTGATGCGGATGT
GGAAATGGGTCAGGTGATTTTGGTTCCTAAATATTTTGTCGCCGGAAAAGTCGCCGGAGAAGATGGCTTGGAGTGCATCTCCATTATCACAGCTTCAAATCCTGTAGTGG
AAGAACTGGCCGGAAAGACGTCGGTTTTGAATGCATTGTCGCCGGAAGTTTTACAAGTTTCGTTCAACGTCACGGCGGAGTTCGAGAAACTTCTTAGATGAAGATCACAA
TTTATAATTATTTTTATTTCATGTTTTGGGTATTTTTGGTATTTTAATTATGCAACATAATCCTTAGTTCTATGCCAATAATAAAAAAAAATTAATTAATTATAAGGATT
GTGTTTTCTAGTTTGAAGCTTTGATGGGTACTTTTTAGTCATTTTGTAACTTGTTTGTCCTCTTTTCCTTTTGAATGAAAAAGATGTGTGTGTGTTTTTTTTCTTTTATG
ATGGTTTGTTGTTATTATTATTTAGGGAAAGATGCTGGTTGAATCTGTATTTAAACTATTTGGTAATGAATAATTTAAGTATGTTGTCATTTGTTGATGAGTTGTATTAT
ATAATATATACATTATTG
Protein sequenceShow/hide protein sequence
MEPMNPKPFFEGDAGFYHKWLPSDYPLLARNNVAAGRLLLRPRGFVVPHYSDCSKVGYVLQGENGVAGLVIPGKSGEEVVKLKKGDLIPVPNGATSWWFNDGDADFEIVY
LGETKTAHVAGDISYFILSGPLSLLQGFSPEFVGKTYSLNEQQTTQLLNNQSNGLIFEIKKSQSLPKPQKQSKLVYEISAAVATVTEAEFPFIGKTGLSFVLEKLDGGAV
RSPVFVAEPADQVIYVVKGSGKIQIVGFSGKVDADVEMGQVILVPKYFVAGKVAGEDGLECISIITASNPVVEELAGKTSVLNALSPEVLQVSFNVTAEFEKLLR