; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C06G113120 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C06G113120
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
Descriptionglutelin type-D 1-like
Genome locationCla97Chr06:4148901..4156846
RNA-Seq ExpressionCla97C06G113120
SyntenyCla97C06G113120
Gene Ontology termsGO:0045735 - nutrient reservoir activity (molecular function)
InterPro domainsIPR006044 - 11-S seed storage protein, plant
IPR006045 - Cupin 1
IPR011051 - RmlC-like cupin domain superfamily
IPR014710 - RmlC-like jelly roll fold
IPR021641 - Protein of unknown function DUF3245


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592225.1 Glutelin type-D 1, partial [Cucurbita argyrosperma subsp. sororia]5.5e-17691.28Show/hide
Query:  KIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVL
        KIYGSDGGSY+SWSPKELPMLREGNIGA+KLA+EKNGFALP YSDSAKVAYVLQGNG+AGIIL ESEEKV  IKKGDAIALPFG+VTWWFNKEATDLVVL
Subjt:  KIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVL

Query:  FLGSTSKSHKSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEPKMEHRNGMVLNCEEALLDVDVKNGGRVVVLN
        FLG TSK+HKSGEFTDFFLTGANGIFTGFS EFV RA DMDE SVKSLVKNQTGT IVKLK+G KMPEPK EHRNGM LNCEEA LDVDVKNGGRVVVLN
Subjt:  FLGSTSKSHKSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEPKMEHRNGMVLNCEEALLDVDVKNGGRVVVLN

Query:  TKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTH
        TKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIV+GSGR EVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGD EGMEWFSII+TPNPVFTH
Subjt:  TKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTH

Query:  LAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPS
        LAGSIGVWKSLS EVIQA+FNVDA+LVK FSSKR SDAIFFPPS
Subjt:  LAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPS

XP_004150394.1 glutelin type-D 1 [Cucumis sativus]2.9e-17791.59Show/hide
Query:  KIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVL
        KIYGSDGGSY++WSPKELPMLREGNIGASKLA+EKNGFALP YSDSAKVAYVLQGNG+AGIIL ESEEKV  IKKGDAIALPFG+VTWWFNKEATDLVVL
Subjt:  KIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVL

Query:  FLGSTSKSHKSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEPKMEHRNGMVLNCEEALLDVDVKNGGRVVVLN
        FLG TSK+HKSGEFTDFFLTGANGIFTGFS EFV RA DMDE SVKSLVKNQTGT IVKLKEGTKMPEPK EHRNGM LNCEEA LDVDVKNGGRVVVLN
Subjt:  FLGSTSKSHKSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEPKMEHRNGMVLNCEEALLDVDVKNGGRVVVLN

Query:  TKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTH
        TKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGR EVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGD EGMEWFSIISTPNPVFTH
Subjt:  TKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTH

Query:  LAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPSN
        LAGSIGVWK+LS EVI+A+FNV+A+LVK FSSKR+SDAIFFPPSN
Subjt:  LAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPSN

XP_008461502.1 PREDICTED: glutelin type-B 5-like [Cucumis melo]6.5e-17791.59Show/hide
Query:  KIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVL
        KIYG DGGSY+SWSPKELPMLREGNIGASKLA+EKNGFALP YSDSAKVAYVLQG+G+AGIIL ESEEKV  IKKGDAIALPFG+VTWWFNKEATDLVVL
Subjt:  KIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVL

Query:  FLGSTSKSHKSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEPKMEHRNGMVLNCEEALLDVDVKNGGRVVVLN
        FLG TSK+HKSGEFTDFFLTGANGIFTGFS EFV RA DMDE SVKSLVKNQTGT IVKLKEGTKMPEPK EHRNGM LNCEEA LDVDVKNGGRVVVLN
Subjt:  FLGSTSKSHKSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEPKMEHRNGMVLNCEEALLDVDVKNGGRVVVLN

Query:  TKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTH
        TKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGR EVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGD EGMEWFSIISTPNPVFTH
Subjt:  TKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTH

Query:  LAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPSN
        LAGSIGVWK+LS EVIQA+FNV+A+LVK FSSKR+SDAIFFPPSN
Subjt:  LAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPSN

XP_022976927.1 glutelin type-D 1-like [Cucurbita maxima]3.6e-17590.99Show/hide
Query:  KIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVL
        KIYG DGGSY+SWSPKELPMLREGNIGA+KLA+EKNGFALP YSDSAKVAYVLQGNG+AGIIL ESEEKV  IKKGDAIALPFG+VTWWFNKEATDLVVL
Subjt:  KIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVL

Query:  FLGSTSKSHKSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEPKMEHRNGMVLNCEEALLDVDVKNGGRVVVLN
        FLG TSK+HKSGEFTDFFLTGANGIFTGFS EFV RA DMDE SVKSLVK+QTGT IVKLK+G KMPEPK EHRNGM LNCEEA LDVDVKNGGRVVVLN
Subjt:  FLGSTSKSHKSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEPKMEHRNGMVLNCEEALLDVDVKNGGRVVVLN

Query:  TKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTH
        TKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIV+GSGR EVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGD EGMEWFSIISTPNPVFTH
Subjt:  TKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTH

Query:  LAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPS
        LAGSIGVWKSLS EVIQA+FNVDA+LVK FSSKR SDAIFFPPS
Subjt:  LAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPS

XP_023535755.1 glutelin type-D 1-like [Cucurbita pepo subsp. pepo]2.5e-17691.57Show/hide
Query:  KIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVL
        KIYGSDGGSY+SWSPKELPMLREGNIGA+KLA+EKNGFALP YSDSAKVAYVLQGNG+AGIIL ESEEKV  IKKGDAIALPFG+VTWWFNKEATDLVVL
Subjt:  KIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVL

Query:  FLGSTSKSHKSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEPKMEHRNGMVLNCEEALLDVDVKNGGRVVVLN
        FLG TSK+HKSGEFTDFFLTGANGIFTGFS EFV RA DMDE SVKSLVKNQTGT IVKLK+G KMPEPK EHRNGM LNCEEA LDVDVKNGGRVVVLN
Subjt:  FLGSTSKSHKSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEPKMEHRNGMVLNCEEALLDVDVKNGGRVVVLN

Query:  TKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTH
        TKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIV+GSGR EVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGD EGMEWFSIISTPNPVFTH
Subjt:  TKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTH

Query:  LAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPS
        LAGSIGVWKSLS EVIQA+FNVDA+LVK FSSKR SDAIFFPPS
Subjt:  LAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPS

TrEMBL top hitse value%identityAlignment
A0A0A0K666 Uncharacterized protein1.4e-17791.59Show/hide
Query:  KIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVL
        KIYGSDGGSY++WSPKELPMLREGNIGASKLA+EKNGFALP YSDSAKVAYVLQGNG+AGIIL ESEEKV  IKKGDAIALPFG+VTWWFNKEATDLVVL
Subjt:  KIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVL

Query:  FLGSTSKSHKSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEPKMEHRNGMVLNCEEALLDVDVKNGGRVVVLN
        FLG TSK+HKSGEFTDFFLTGANGIFTGFS EFV RA DMDE SVKSLVKNQTGT IVKLKEGTKMPEPK EHRNGM LNCEEA LDVDVKNGGRVVVLN
Subjt:  FLGSTSKSHKSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEPKMEHRNGMVLNCEEALLDVDVKNGGRVVVLN

Query:  TKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTH
        TKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGR EVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGD EGMEWFSIISTPNPVFTH
Subjt:  TKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTH

Query:  LAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPSN
        LAGSIGVWK+LS EVI+A+FNV+A+LVK FSSKR+SDAIFFPPSN
Subjt:  LAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPSN

A0A1S3CG59 glutelin type-B 5-like3.1e-17791.59Show/hide
Query:  KIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVL
        KIYG DGGSY+SWSPKELPMLREGNIGASKLA+EKNGFALP YSDSAKVAYVLQG+G+AGIIL ESEEKV  IKKGDAIALPFG+VTWWFNKEATDLVVL
Subjt:  KIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVL

Query:  FLGSTSKSHKSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEPKMEHRNGMVLNCEEALLDVDVKNGGRVVVLN
        FLG TSK+HKSGEFTDFFLTGANGIFTGFS EFV RA DMDE SVKSLVKNQTGT IVKLKEGTKMPEPK EHRNGM LNCEEA LDVDVKNGGRVVVLN
Subjt:  FLGSTSKSHKSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEPKMEHRNGMVLNCEEALLDVDVKNGGRVVVLN

Query:  TKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTH
        TKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGR EVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGD EGMEWFSIISTPNPVFTH
Subjt:  TKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTH

Query:  LAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPSN
        LAGSIGVWK+LS EVIQA+FNV+A+LVK FSSKR+SDAIFFPPSN
Subjt:  LAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPSN

A0A5A7UAB0 Glutelin type-B 5-like3.1e-17791.59Show/hide
Query:  KIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVL
        KIYG DGGSY+SWSPKELPMLREGNIGASKLA+EKNGFALP YSDSAKVAYVLQG+G+AGIIL ESEEKV  IKKGDAIALPFG+VTWWFNKEATDLVVL
Subjt:  KIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVL

Query:  FLGSTSKSHKSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEPKMEHRNGMVLNCEEALLDVDVKNGGRVVVLN
        FLG TSK+HKSGEFTDFFLTGANGIFTGFS EFV RA DMDE SVKSLVKNQTGT IVKLKEGTKMPEPK EHRNGM LNCEEA LDVDVKNGGRVVVLN
Subjt:  FLGSTSKSHKSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEPKMEHRNGMVLNCEEALLDVDVKNGGRVVVLN

Query:  TKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTH
        TKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGR EVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGD EGMEWFSIISTPNPVFTH
Subjt:  TKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTH

Query:  LAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPSN
        LAGSIGVWK+LS EVIQA+FNV+A+LVK FSSKR+SDAIFFPPSN
Subjt:  LAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPSN

A0A6J1EX25 glutelin type-D 1-like2.9e-17590.99Show/hide
Query:  KIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVL
        KIY SDGGSY+SWSPKELPMLREGNIGA+KLA+EKNGFALP YSDSAKVAYVLQGNG+AGIIL ESEEKV  IKKGDAIALPFG+VTWWFNKEATDLVVL
Subjt:  KIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVL

Query:  FLGSTSKSHKSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEPKMEHRNGMVLNCEEALLDVDVKNGGRVVVLN
        FLG TSK+HKSGEFTDFFLTGANGIFTGFS EFV RA DMDE SVKSLVKNQTGT IVKLK+G KMPEPK EHRNGM LNCEEA LDVDVKNGGRVVVLN
Subjt:  FLGSTSKSHKSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEPKMEHRNGMVLNCEEALLDVDVKNGGRVVVLN

Query:  TKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTH
        TKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIV+GSGR EVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGD EGMEWFSII+TPNPVFTH
Subjt:  TKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTH

Query:  LAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPS
        LAGSIGVWKSLS EVIQA+FNVDA+LVK FSSKR SDAIFFPPS
Subjt:  LAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPS

A0A6J1IH21 glutelin type-D 1-like1.7e-17590.99Show/hide
Query:  KIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVL
        KIYG DGGSY+SWSPKELPMLREGNIGA+KLA+EKNGFALP YSDSAKVAYVLQGNG+AGIIL ESEEKV  IKKGDAIALPFG+VTWWFNKEATDLVVL
Subjt:  KIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVL

Query:  FLGSTSKSHKSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEPKMEHRNGMVLNCEEALLDVDVKNGGRVVVLN
        FLG TSK+HKSGEFTDFFLTGANGIFTGFS EFV RA DMDE SVKSLVK+QTGT IVKLK+G KMPEPK EHRNGM LNCEEA LDVDVKNGGRVVVLN
Subjt:  FLGSTSKSHKSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEPKMEHRNGMVLNCEEALLDVDVKNGGRVVVLN

Query:  TKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTH
        TKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIV+GSGR EVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGD EGMEWFSIISTPNPVFTH
Subjt:  TKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTH

Query:  LAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPS
        LAGSIGVWKSLS EVIQA+FNVDA+LVK FSSKR SDAIFFPPS
Subjt:  LAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPS

SwissProt top hitse value%identityAlignment
P07728 Glutelin type-A 14.1e-2524.07Show/hide
Query:  IGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAG-------------------IILSESE----------EKVFGIKKGDAIALPFGIVTWWFNKEATD
        +   +  IE  G  LP Y++ A + Y++QG GI G                     L+ES+          +K+   ++GD IALP G+  W +N     
Subjt:  IGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAG-------------------IILSESE----------EKVFGIKKGDAIALPFGIVTWWFNKEATD

Query:  LVVLFLG--STSKSHKSGEFTDFFLTG---------------ANGIFTGFSMEFVKRALDMDEVSVKSL-VKNQTGTRIVKLKEGTKMPEP---KMEHRN
        +V +++   +   +       DF L G               +  IF+GFS E +  AL +     + L  +N     IV+++ G  + +P     E   
Subjt:  LVVLFLG--STSKSHKSGEFTDFFLTG---------------ANGIFTGFSMEFVKRALDMDEVSVKSL-VKNQTGTRIVKLKEGTKMPEP---KMEHRN

Query:  GMVLN-----------------CEEAL----------LDVDVKN--------GGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYI
        G V +                 C   L           ++D  N         GRV  LNT+N P++  V + A  V L  +A+ SP ++  +A  V YI
Subjt:  GMVLN-----------------CEEAL----------LDVDVKN--------GGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYI

Query:  VKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTHLAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIF
         +G  RV+VV  +GK V    ++ G L I+P+ + V K    EG  + +  + PN + +H+AG   ++++L ++V+  ++ +     ++    R  +   
Subjt:  VKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTHLAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIF

Query:  FPP
        F P
Subjt:  FPP

P07730 Glutelin type-A 24.6e-2422.58Show/hide
Query:  IGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAG-------------------IILSESE----------EKVFGIKKGDAIALPFGIVTWWFNKEATD
        +   +  IE  G  LP Y++ A + Y++QG GI G                     L+ES+          +K+   ++GD IALP G+  W +N     
Subjt:  IGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAG-------------------IILSESE----------EKVFGIKKGDAIALPFGIVTWWFNKEATD

Query:  LVVLFLG--STSKSHKSGEFTDFFLTG---------------ANGIFTGFSMEFVKRALDM-DEVSVKSLVKNQTGTRIVKLKEGTKMPEP---------
        +V +++   +   +       DF L G               +  IF+GFS E +  A  + ++V+ +   +N     IV+++ G  + +P         
Subjt:  LVVLFLG--STSKSHKSGEFTDFFLTG---------------ANGIFTGFSMEFVKRALDM-DEVSVKSLVKNQTGTRIVKLKEGTKMPEP---------

Query:  ---------------KMEHRNGMVLNCEEALLDVDVK--------------NGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYI
                       + ++ +G     +E    + V+                GRV  LN++N P++  V + A  V L  +A+ SP ++  +A  + YI
Subjt:  ---------------KMEHRNGMVLNCEEALLDVDVK--------------NGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYI

Query:  VKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTHLAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIF
         +G  +V+VV  +GK V    ++ G L IVP+ +VV K    EG  + +  + PN + +H+AG   ++++L ++V+  ++ +     ++    R  +   
Subjt:  VKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTHLAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIF

Query:  FPP
        F P
Subjt:  FPP

Q09151 Glutelin type-A 34.1e-2523.12Show/hide
Query:  IEKNGFALPCYSDSAKVAYVLQGNGIAGII-----------------------------LSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVLFLG
        IE  G  LP YS+ A + YV+QG GI G                                 +  +K+   ++GD +ALP G+  W +N     +V +++ 
Subjt:  IEKNGFALPCYSDSAKVAYVLQGNGIAGII-----------------------------LSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVLFLG

Query:  S--TSKSHKSGEFTDFFLTGAN----------------GIFTGFSMEFVKRALDMDEVSVKSL-VKNQTGTRIVKLKEGTKMPEP----------KMEHR
            S +       DFFL G N                 +F GFS+E +  AL +     + L  +N     IV+++ G  + +P          +++ R
Subjt:  S--TSKSHKSGEFTDFFLTGAN----------------GIFTGFSMEFVKRALDMDEVSVKSL-VKNQTGTRIVKLKEGTKMPEP----------KMEHR

Query:  NGMVLNCEEALLDVDVKNG---------------------------GRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGRV
        +      ++  L     NG                           GR+  LN +  P++  V + A  V L  +A+ SP ++  +A  V YI +G  RV
Subjt:  NGMVLNCEEALLDVDVKNG---------------------------GRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGRV

Query:  EVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTHLAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPSN
        +VV  +GK V +  ++ G L I+P+  VV K    EG  + ++ + P+ + +H+AG   ++++L  +V+  ++ +     ++    R  +   F PS+
Subjt:  EVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTHLAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPSN

Q6K508 Glutelin type-D 15.4e-2523.7Show/hide
Query:  IEKNGFALPCYSDSAKVAYVLQGNGIAGII-------------------------LSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVLFL-----
        IE  G  +P YS++  +AY++QG G  G+                            +  +K+   ++GD +ALP  +  W++N   T  VV+++     
Subjt:  IEKNGFALPCYSDSAKVAYVLQGNGIAGII-------------------------LSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVLFL-----

Query:  ---------------GSTSKSHKSGEFTDFFLTGANGIFTGFSMEFVKRALDMD-EVSVKSLVKNQTGTRIVKLKEGTKMPEPKM-----EHR-------
                       G+  +  +  E + F  +G N IF+GF+ E +  AL ++ E S +   +N     I+++K G ++ +P +     EHR       
Subjt:  ---------------GSTSKSHKSGEFTDFFLTGANGIFTGFSMEFVKRALDMD-EVSVKSLVKNQTGTRIVKLKEGTKMPEPKM-----EHR-------

Query:  -----NGMVLNCEEALLDVDVKN----------GGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGRVEVVGVDGKKVLE
             NG+  N       V+++N           GR+ +LN +  P++  +G+GA  V L  +A+ SP ++  +A  V YI++GS RV+V    G+ V  
Subjt:  -----NGMVLNCEEALLDVDVKN----------GGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGRVEVVGVDGKKVLE

Query:  TRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTHLAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPP
          +  G L I+P+   V K  +  G ++ +I +  +P  + +AG   + ++L  +VI  ++ +  +  ++  + R  +   F P
Subjt:  TRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTHLAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPP

Q9XHP0 11S globulin seed storage protein 21.2e-2422.6Show/hide
Query:  SDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAGIIL---------------------SESE---------EKVFGIK
        S+GG+   W  ++    +   I A +  I  NG +LP Y  S ++ Y+ +G G+  I++                     SE +         +KV  ++
Subjt:  SDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAGIIL---------------------SESE---------EKVFGIK

Query:  KGDAIALPFGIVTWWFNKEATDLVVLFLGSTS--KSHKSGEFTDFFLTGA---------------NGIFTGFSMEFVKRALDMDEVSVKSL-VKNQTGTR
        +GD +A+P G   W +N  + DLV + +   +   +    +F  F+L G                + IF  F  E +  A ++ + +++ +  + +    
Subjt:  KGDAIALPFGIVTWWFNKEATDLVVLFLGSTS--KSHKSGEFTDFFLTGA---------------NGIFTGFSMEFVKRALDMDEVSVKSL-VKNQTGTR

Query:  IVKLKEGTKMPEP-----KMEHRNGMVLN-CEEAL--------------LDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQ
        IV  +E      P     + EHR   + N  EE                 D+  +  GRV V++   LP++  + L A+   L  +A+ SP +S  +   
Subjt:  IVKLKEGTKMPEP-----KMEHRNGMVLN-CEEAL--------------LDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQ

Query:  VTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTHLAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTS
        + Y+ +G  +V+VV  +G+ ++  RV  G +F+VP+++  +    + G EW +  +T +P+ + LAG   V +++  +VI  S+ +  N  +     R S
Subjt:  VTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTHLAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTS

Query:  DAIFFPP
         +    P
Subjt:  DAIFFPP

Arabidopsis top hitse value%identityAlignment
AT1G03880.1 cruciferin 27.5e-2223.83Show/hide
Query:  AKIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAGIIL-------------------------SESEEKVFGIK
        ++I  S+GG    W     P LR       +  IE  G  LP + ++ K+ +V+ G G+ G ++                          +  +KV  ++
Subjt:  AKIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAGIIL-------------------------SESEEKVFGIK

Query:  KGDAIALPFGIVTWWFNKEATDLVVLFLG--STSKSHKSGEFTDFFLTG----------------ANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTR
         GD IA P G+  W++N     L+++     +++++        F + G                 N IF GF+ E + +A  ++ V     ++NQ   R
Subjt:  KGDAIALPFGIVTWWFNKEATDLVVLFLG--STSKSHKSGEFTDFFLTG----------------ANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTR

Query:  --IVKLK-------------EGTKMPEPKMEHRNGM-----VLNCEEAL-----LDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGFSC
          IVK+              EG + P    E  NG+      + C E L      DV   + G +  LN+ NLP++  + L A    +  +AM  P ++ 
Subjt:  --IVKLK-------------EGTKMPEPKMEHRNGM-----VLNCEEAL-----LDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGFSC

Query:  DSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTHLAGSIGVWKSLSSEVIQASFNVDANLVK--K
         +A    Y+  G   +++V  +G++V +  + +G L +VP+ F V K    E  EW    +  N     LAG   V + L  EVI   + +     K  K
Subjt:  DSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTHLAGSIGVWKSLSSEVIQASFNVDANLVK--K

Query:  FSSKRTS
        FS+  T+
Subjt:  FSSKRTS

AT1G03890.1 RmlC-like cupins superfamily protein4.7e-2422.75Show/hide
Query:  DGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAGIILS--------------------------ESEEKVFGIKKGDAI
        + G    W     P LR   +  +++ ++ N   LP +     +AYV+QG G+ G I S                          +  +K+   ++GD  
Subjt:  DGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAGIILS--------------------------ESEEKVFGIKKGDAI

Query:  ALPFGIVTWWFNKEATD-LVVLFLGSTSKSHKSGEFTDFF-LTGA--------------NGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTR--IVKLK
        A   G+  WW+N+  +D ++V+ L  T++ ++  +    F L G+              N  F+GF    +  A  ++  + K L +NQ   R  I++  
Subjt:  ALPFGIVTWWFNKEATD-LVVLFLGSTSKSHKSGEFTDFF-LTGA--------------NGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTR--IVKLK

Query:  EGTK--MPEPKMEHRNGMVLNCEEALLDVDV--------------KNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSG
              +P P+   ++G+    EE      +                 GR+  LN+ NLP++  V L A    L    M  P ++  +A  V Y+  G  
Subjt:  EGTK--MPEPKMEHRNGMVLNCEEALLDVDV--------------KNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSG

Query:  RVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTHLAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPSN
        +++VV  +G+ V   +V  G + ++P+ F VSK     G EW S  +  N     L+G     +++  +VI+AS+ V+    K+    +    +   PS+
Subjt:  RVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTHLAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPSN

AT1G07750.1 RmlC-like cupins superfamily protein9.7e-14772.46Show/hide
Query:  KIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVL
        K+YG DGGSY +W P+ELPML++GNIGA+KLA+EKNGFA+P YSDS+KVAYVLQG+G AGI+L E EEKV  IK+GD+IALPFG+VTWWFN E  +LV+L
Subjt:  KIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVL

Query:  FLGSTSKSHKSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEPKMEHRNGMVLNCEEALLDVDVKNGGRVVVLN
        FLG T K HK+G+FT+F+LTG NGIFTGFS EFV RA D+DE +VK LV +QTG  IVKL  G KMP+PK E+R G VLNC EA LDVD+K+GGRVVVLN
Subjt:  FLGSTSKSHKSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEPKMEHRNGMVLNCEEALLDVDVKNGGRVVVLN

Query:  TKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTH
        TKNLPLVGEVG GADLVR+D  +MCSPGFSCDSALQVTYIV GSGRV+VVG DGK+VLET +KAG+LFIVPRFFVVSKI D++GM WFSI++TP+P+FTH
Subjt:  TKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTH

Query:  LAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPSN
        LAG+  VWKSLS EV+QA+F V   + K F S RTS AIFFPPSN
Subjt:  LAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPSN

AT2G28680.1 RmlC-like cupins superfamily protein8.8e-14873.91Show/hide
Query:  KIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVL
        K+YG DGGSY +W P+ELPMLR+GNIGASKLA+EK G ALP YSDS KVAYVLQG G AGI+L E EEKV  IKKGD+IALPFG+VTWWFN E T+LVVL
Subjt:  KIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVL

Query:  FLGSTSKSHKSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEPKMEHRNGMVLNCEEALLDVDVKNGGRVVVLN
        FLG T K HK+G+FTDF+LTG+NGIFTGFS EFV RA D+DE +VK LV +QTG  IVK+    KMPEPK   R G VLNC EA LDVD+K+GGRVVVLN
Subjt:  FLGSTSKSHKSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEPKMEHRNGMVLNCEEALLDVDVKNGGRVVVLN

Query:  TKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTH
        TKNLPLVGEVG GADLVR+DG +MCSPGFSCDSALQVTYIV GSGRV++VG DGK+VLET VKAG LFIVPRFFVVSKI DS+G+ WFSI++TP+P+FTH
Subjt:  TKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTH

Query:  LAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPSN
        LAG   VWK+LS EV+QA+F VD  + K F SKRTSDAIFF PSN
Subjt:  LAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPSN

AT3G61370.1 Protein of unknown function (DUF3245)2.7e-1142.24Show/hide
Query:  ERPQKSRPPEIIKLNDAAKLAAQLMSKMTNSEWKETAVDNESTAIEPEIRPPRLGLGAKATRHFRNGPSNDPLDRKLYAKLDAGKRQAARKMSEDTPELN
        E  +K   P+I K + A KLA + ++ MT    + T  D   TA   E RP RLGLGA+ +R  +  PS+DPLD+KL AK  A KR+ AR ++E      
Subjt:  ERPQKSRPPEIIKLNDAAKLAAQLMSKMTNSEWKETAVDNESTAIEPEIRPPRLGLGAKATRHFRNGPSNDPLDRKLYAKLDAGKRQAARKMSEDTPELN

Query:  NKDDGDDDDDEDESRT
        N  D  +DDDE ES++
Subjt:  NKDDGDDDDDEDESRT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAGCAACATGGAGAGGCCGCAGAAATCTAGACCTCCCGAAATTATTAAGTTGAATGATGCGGCGAAATTGGCAGCGCAATTGATGAGCAAGATGACTAATTCCGA
GTGGAAAGAAACTGCAGTTGATAATGAATCAACTGCAATTGAACCCGAGATTCGACCTCCTAGACTTGGACTTGGAGCCAAAGCAACACGCCATTTCAGAAATGGACCAT
CTAATGACCCACTTGATAGAAAACTATATGCCAAATTAGATGCTGGCAAAAGACAAGCTGCTAGAAAGATGTCGGAGGATACTCCCGAACTTAACAATAAAGATGATGGT
GATGATGATGATGATGAAGACGAGAGTAGAACTGCCAAGATTTATGGTAGCGATGGAGGTTCCTATCATTCATGGTCTCCCAAGGAGCTTCCCATGCTCCGTGAAGGAAA
CATTGGCGCTTCCAAGCTTGCTATTGAGAAGAATGGCTTCGCTCTCCCTTGTTATTCTGATTCTGCCAAGGTTGCTTATGTTCTTCAAGGCAATGGAATAGCTGGAATCA
TTTTGTCGGAATCAGAGGAGAAGGTTTTTGGGATAAAAAAAGGAGATGCAATTGCTCTTCCATTCGGCATAGTGACGTGGTGGTTCAATAAAGAAGCTACTGATTTGGTA
GTTCTATTCTTGGGCAGCACATCAAAGTCTCACAAATCAGGTGAGTTCACTGACTTCTTCCTCACTGGCGCCAACGGAATCTTTACTGGATTCTCAATGGAGTTTGTCAA
GCGAGCCTTGGATATGGATGAGGTATCGGTGAAATCTTTAGTGAAAAATCAAACTGGAACCAGAATCGTGAAGTTGAAGGAAGGAACGAAGATGCCAGAGCCAAAAATGG
AGCACCGAAACGGAATGGTGCTGAATTGTGAGGAGGCACTGCTAGATGTGGACGTGAAGAACGGAGGACGAGTTGTGGTTCTAAATACGAAGAATCTACCCCTGGTAGGG
GAGGTAGGATTGGGAGCAGATTTGGTTCGATTGGATGGAAGTGCGATGTGCTCGCCTGGATTCTCGTGTGATTCGGCGTTGCAGGTGACATATATCGTGAAAGGAAGCGG
GAGAGTCGAGGTTGTAGGAGTAGACGGGAAGAAGGTTCTGGAGACAAGAGTGAAAGCTGGAAATTTGTTCATAGTACCAAGGTTTTTTGTGGTATCAAAGATCGGAGATT
CTGAAGGAATGGAGTGGTTCTCCATCATTAGTACTCCCAATCCCGTTTTCACTCACTTGGCCGGCAGTATCGGGGTCTGGAAGTCTCTTTCATCGGAAGTTATTCAAGCA
TCCTTTAATGTAGATGCTAATTTGGTGAAAAAATTTTCTTCCAAGAGAACTTCTGATGCCATCTTCTTCCCTCCTTCTAATTAG
mRNA sequenceShow/hide mRNA sequence
GAATAGGCATCTCAAAACCAATCATCTGTACAAATATTCCTTATTTCTCCATCTCCTCACACACTCGCCTGATACCTAATCCTCCGGCGGCGGCAGCTCACCAGCCCCAC
TCCTCTCCGGCAAGTACGTACTTCCCTTGTCGAACCAAATCCTGGGACCGCGATAGACCCGCCGGGAAGGCAGTTGACACCCACCAACACGCCTCAGAAACCCTCTTGCT
CCCGTCGCAACATACCAGCGGTCAAATTTATCAAGTCCTTAGGGACTACTACATCCATTCAGTTTCGGCTCAATTAGTTCTGAAGACCCGTATTTTTGTTTATAAATTTC
GGATCATATCGCAGTCTTGAAGTTGCAGTAGCAAGCGCAATGAGCAGCAACATGGAGAGGCCGCAGAAATCTAGACCTCCCGAAATTATTAAGTTGAATGATGCGGCGAA
ATTGGCAGCGCAATTGATGAGCAAGATGACTAATTCCGAGTGGAAAGAAACTGCAGTTGATAATGAATCAACTGCAATTGAACCCGAGATTCGACCTCCTAGACTTGGAC
TTGGAGCCAAAGCAACACGCCATTTCAGAAATGGACCATCTAATGACCCACTTGATAGAAAACTATATGCCAAATTAGATGCTGGCAAAAGACAAGCTGCTAGAAAGATG
TCGGAGGATACTCCCGAACTTAACAATAAAGATGATGGTGATGATGATGATGATGAAGACGAGAGTAGAACTGCCAAGATTTATGGTAGCGATGGAGGTTCCTATCATTC
ATGGTCTCCCAAGGAGCTTCCCATGCTCCGTGAAGGAAACATTGGCGCTTCCAAGCTTGCTATTGAGAAGAATGGCTTCGCTCTCCCTTGTTATTCTGATTCTGCCAAGG
TTGCTTATGTTCTTCAAGGCAATGGAATAGCTGGAATCATTTTGTCGGAATCAGAGGAGAAGGTTTTTGGGATAAAAAAAGGAGATGCAATTGCTCTTCCATTCGGCATA
GTGACGTGGTGGTTCAATAAAGAAGCTACTGATTTGGTAGTTCTATTCTTGGGCAGCACATCAAAGTCTCACAAATCAGGTGAGTTCACTGACTTCTTCCTCACTGGCGC
CAACGGAATCTTTACTGGATTCTCAATGGAGTTTGTCAAGCGAGCCTTGGATATGGATGAGGTATCGGTGAAATCTTTAGTGAAAAATCAAACTGGAACCAGAATCGTGA
AGTTGAAGGAAGGAACGAAGATGCCAGAGCCAAAAATGGAGCACCGAAACGGAATGGTGCTGAATTGTGAGGAGGCACTGCTAGATGTGGACGTGAAGAACGGAGGACGA
GTTGTGGTTCTAAATACGAAGAATCTACCCCTGGTAGGGGAGGTAGGATTGGGAGCAGATTTGGTTCGATTGGATGGAAGTGCGATGTGCTCGCCTGGATTCTCGTGTGA
TTCGGCGTTGCAGGTGACATATATCGTGAAAGGAAGCGGGAGAGTCGAGGTTGTAGGAGTAGACGGGAAGAAGGTTCTGGAGACAAGAGTGAAAGCTGGAAATTTGTTCA
TAGTACCAAGGTTTTTTGTGGTATCAAAGATCGGAGATTCTGAAGGAATGGAGTGGTTCTCCATCATTAGTACTCCCAATCCCGTTTTCACTCACTTGGCCGGCAGTATC
GGGGTCTGGAAGTCTCTTTCATCGGAAGTTATTCAAGCATCCTTTAATGTAGATGCTAATTTGGTGAAAAAATTTTCTTCCAAGAGAACTTCTGATGCCATCTTCTTCCC
TCCTTCTAATTAG
Protein sequenceShow/hide protein sequence
MSSNMERPQKSRPPEIIKLNDAAKLAAQLMSKMTNSEWKETAVDNESTAIEPEIRPPRLGLGAKATRHFRNGPSNDPLDRKLYAKLDAGKRQAARKMSEDTPELNNKDDG
DDDDDEDESRTAKIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLV
VLFLGSTSKSHKSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEPKMEHRNGMVLNCEEALLDVDVKNGGRVVVLNTKNLPLVG
EVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTHLAGSIGVWKSLSSEVIQA
SFNVDANLVKKFSSKRTSDAIFFPPSN