; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC10g0435 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC10g0435
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionglutelin type-A 2-like
Genome locationMC10:3501118..3505852
RNA-Seq ExpressionMC10g0435
SyntenyMC10g0435
Gene Ontology termsGO:0045735 - nutrient reservoir activity (molecular function)
InterPro domainsIPR006045 - Cupin 1
IPR011051 - RmlC-like cupin domain superfamily
IPR014710 - RmlC-like jelly roll fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039049.1 glutelin type-B 5 [Cucumis melo var. makuwa]2.83e-16867.79Show/hide
Query:  MEFDLKAMNDQTFSESDGGAYYNWPSSQFPVLSKADVAAGRLVLRPRGFALPHYADSSKVGYVTQGNGIGGLLFPGKSSEFVMKLEKGDLIPVPEGVTSW
        ME DLK M+   F   +GG+++ W  S   ++ +  V AGRL+L PRGFA+PH +DSSKVGYV QG+G+ G++FP KS E V++L+KGD+IPVPEGVTSW
Subjt:  MEFDLKAMNDQTFSESDGGAYYNWPSSQFPVLSKADVAAGRLVLRPRGFALPHYADSSKVGYVTQGNGIGGLLFPGKSSEFVMKLEKGDLIPVPEGVTSW

Query:  WFNPENDDSDFEILLIGDSSHALIPGDVTYVVFAGRLGILQSFPPQYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTLPEPDESSGL-FNIYDAVPDA
        WFN  + DSDFE+LL+GD+ +ALIPGD+TYVVFAG LG+LQ F   Y+   Y L EEE   LL+SQ NGLIFKL+ DQTLPEPD  S L FNIYDA PD+
Subjt:  WFNPENDDSDFEILLIGDSSHALIPGDVTYVVFAGRLGILQSFPPQYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTLPEPDESSGL-FNIYDAVPDA

Query:  RLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGG-FLGK-MDSEVKAGQLVLVPKYFAVGKVAGDEGMECF
         ++ GG+VT +TE+KFPFIGKSGLTAVLEKLEANAVR+PVYVADPSVQLIYV+ GSGRI++   F+ K +D+EVKAGQL+LVPKYFAVGK+AG+EG+ECF
Subjt:  RLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGG-FLGK-MDSEVKAGQLVLVPKYFAVGKVAGDEGMECF

Query:  SIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPSN
        +IITTTHPL+EELGGK SIFG+ S QVFQ SFNVTA FEKLL SKITK+SPLVPPS+
Subjt:  SIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPSN

XP_008456077.1 PREDICTED: glutelin type-B 5 [Cucumis melo]8.53e-17068.07Show/hide
Query:  MEFDLKAMNDQTFSESDGGAYYNWPSSQFPVLSKADVAAGRLVLRPRGFALPHYADSSKVGYVTQGNGIGGLLFPGKSSEFVMKLEKGDLIPVPEGVTSW
        ME DLK M+   F   +GG+++ W  S  P++ +  V AGRL+L PRGFA+PH +DSSKVGYV QG+G+ G++FP KS E V++L+KGD+IPVPEGVTSW
Subjt:  MEFDLKAMNDQTFSESDGGAYYNWPSSQFPVLSKADVAAGRLVLRPRGFALPHYADSSKVGYVTQGNGIGGLLFPGKSSEFVMKLEKGDLIPVPEGVTSW

Query:  WFNPENDDSDFEILLIGDSSHALIPGDVTYVVFAGRLGILQSFPPQYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTLPEPDESSGL-FNIYDAVPDA
        WFN  + DSDFE+LL+GD+ +ALIPGD+TYVVFAG LG+LQ F   Y+   Y L EEE   LL+SQ NGLIFKL+ DQTLPEPD  S L FNIYDA PD+
Subjt:  WFNPENDDSDFEILLIGDSSHALIPGDVTYVVFAGRLGILQSFPPQYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTLPEPDESSGL-FNIYDAVPDA

Query:  RLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGG-FLGK-MDSEVKAGQLVLVPKYFAVGKVAGDEGMECF
         ++ GG+VT +TE+KFPFIGKSGLTAVLEKLEANAVR+PVYVADPSVQLIYV+ GSGRI++   F+ K +D+EVKAGQL+LVPKYFAVGK+AG+EG+ECF
Subjt:  RLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGG-FLGK-MDSEVKAGQLVLVPKYFAVGKVAGDEGMECF

Query:  SIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPSN
        +IITTTHPL+EELGGK SIFG+ S QVFQ SFNVTA FEKLL SKITK+SPLVPPS+
Subjt:  SIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPSN

XP_011651276.2 legumin J [Cucumis sativus]2.83e-16866.67Show/hide
Query:  MEFDLKAMNDQTFSESDGGAYYNWPSSQFPVLSKADVAAGRLVLRPRGFALPHYADSSKVGYVTQGNGIGGLLFPGKSSEFVMKLEKGDLIPVPEGVTSW
        ME +LK M+   F   +GG+++ W  S FP++S+  V AGRL+L PRGFA+PH +DSSKVGYV QG+G+ G++FP KS E  ++L+KGD+IPVPEGVTSW
Subjt:  MEFDLKAMNDQTFSESDGGAYYNWPSSQFPVLSKADVAAGRLVLRPRGFALPHYADSSKVGYVTQGNGIGGLLFPGKSSEFVMKLEKGDLIPVPEGVTSW

Query:  WFNPENDDSDFEILLIGDSSHALIPGDVTYVVFAGRLGILQSFPPQYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTLPEPDESSGL-FNIYDAVPDA
        WFN  + DSDFE+LL+GD+ +ALIPGD+TYVVFAG LG+LQ F   Y+   Y L E+E   LL+SQ NGLIFKL+ DQTLPEPD  S L FNIY   PDA
Subjt:  WFNPENDDSDFEILLIGDSSHALIPGDVTYVVFAGRLGILQSFPPQYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTLPEPDESSGL-FNIYDAVPDA

Query:  RLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGGFLGK--MDSEVKAGQLVLVPKYFAVGKVAGDEGMECF
         ++ GGSVT +TE+KFPFIGKSGLTAVLEKLEANAVR+PVYVADPSVQLIYV+ GSGR+++     +  +D+EVKAGQLVLVPKYFAVGK+AG+EG+ECF
Subjt:  RLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGGFLGK--MDSEVKAGQLVLVPKYFAVGKVAGDEGMECF

Query:  SIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPSN
        +IITTTHPL+EELGGK SIFG+ S QVF+ SFN+TA FEKL RSKITK+SPLVPPS+
Subjt:  SIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPSN

XP_022985328.1 12S seed storage protein CRD-like [Cucurbita maxima]1.15e-14760.29Show/hide
Query:  MNDQTFSESDGGAYYNWPSSQFPVLSKADVAAGRLVLRPRGFALPHYADSSKVGYVTQG-NGIGGLLFPGKSSEFVMKLEKGDLIPVPEGVTSWWFNPEN
        MN + F+E + G+Y+ W  S++P+L+   VAAGRL+LRPRGF +PHYAD SKVGYV QG NG+ GL+FP KS E V+ L+KGDLIPVP GV+SWWFN  +
Subjt:  MNDQTFSESDGGAYYNWPSSQFPVLSKADVAAGRLVLRPRGFALPHYADSSKVGYVTQG-NGIGGLLFPGKSSEFVMKLEKGDLIPVPEGVTSWWFNPEN

Query:  DDSDFEILLIGDSSHALIPGDVTYVVFAGRLGILQSFPPQYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTLPEPDE-SSGLFNIYDAVPDARLEAG-
         DSD EI+ +G+S +A +PGD++Y V +G L +L  F P+YV  +YSLN EE+   L+SQSN LIF ++  Q+LP+P + S  ++NI  A PD R++ G 
Subjt:  DDSDFEILLIGDSSHALIPGDVTYVVFAGRLGILQSFPPQYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTLPEPDE-SSGLFNIYDAVPDARLEAG-

Query:  GSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGGFLGKMDSEVKAGQLVLVPKYFAVGKVAGDEGMECFSIITTTH
        G+VT VTE KFPFIG+SGLTA+LEKL+ANAVR+PVYVA+P  QLIYV++G G+I++ GF  K+D+EVK GQL+LVPK+FAVGK+AG++G+EC SIIT TH
Subjt:  GSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGGFLGKMDSEVKAGQLVLVPKYFAVGKVAGDEGMECFSIITTTH

Query:  PLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPSN
        P++EEL GK S+  +LS +VFQVSFNVTAEFEKLLRSKIT ASP++  S+
Subjt:  PLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPSN

XP_038879635.1 legumin type B-like [Benincasa hispida]7.09e-17169.38Show/hide
Query:  MEFDLKAMNDQTFSESDGGAYYNWPSSQFPVLSKADVAAGRLVLRPRGFALPHYADSSKVGYVTQGNGIGGLLFPGKSSEFVMKLEKGDLIPVPEGVTSW
        M+ +LK M+   F + +GG+++ W  S FP++++  V AGRL+L PRGFA+PH +DSSKVGYV QG+G+ G+LFP KS E V++L+KGDLIPVPEGVTSW
Subjt:  MEFDLKAMNDQTFSESDGGAYYNWPSSQFPVLSKADVAAGRLVLRPRGFALPHYADSSKVGYVTQGNGIGGLLFPGKSSEFVMKLEKGDLIPVPEGVTSW

Query:  WFNPENDDSDFEILLIGDSSHALIPGDVTYVVFAGRLGILQSFPPQYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTLPEPDESSGL-FNIYDAVPDA
        WFN  + DSDFE+LL+GD+ +ALIPGD+TYVVFAG LGILQ F   Y+   Y LNEEE   LL+SQ+NGLIFKL+ DQTLPEP+  S L FNIY A+PDA
Subjt:  WFNPENDDSDFEILLIGDSSHALIPGDVTYVVFAGRLGILQSFPPQYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTLPEPDESSGL-FNIYDAVPDA

Query:  RLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIR-VGGFLGK-MDSEVKAGQLVLVPKYFAVGKVAGDEGMECF
         ++ GGSVT VT++KFPFIGKSGLTAVLEKLEANAVR+PVYVADPSVQLIY++ GSGR++ V  FL K +D+EVKAGQLVLVPKYFAVGKVAG+EG+ECF
Subjt:  RLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIR-VGGFLGK-MDSEVKAGQLVLVPKYFAVGKVAGDEGMECF

Query:  SIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPS
        +IITTTHPL+EELGG  SIFG+ S QVFQ SFNVTA FEKLLRSKITK S LVPPS
Subjt:  SIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPS

TrEMBL top hitse value%identityAlignment
A0A0A0LC21 Uncharacterized protein1.37e-16866.67Show/hide
Query:  MEFDLKAMNDQTFSESDGGAYYNWPSSQFPVLSKADVAAGRLVLRPRGFALPHYADSSKVGYVTQGNGIGGLLFPGKSSEFVMKLEKGDLIPVPEGVTSW
        ME +LK M+   F   +GG+++ W  S FP++S+  V AGRL+L PRGFA+PH +DSSKVGYV QG+G+ G++FP KS E  ++L+KGD+IPVPEGVTSW
Subjt:  MEFDLKAMNDQTFSESDGGAYYNWPSSQFPVLSKADVAAGRLVLRPRGFALPHYADSSKVGYVTQGNGIGGLLFPGKSSEFVMKLEKGDLIPVPEGVTSW

Query:  WFNPENDDSDFEILLIGDSSHALIPGDVTYVVFAGRLGILQSFPPQYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTLPEPDESSGL-FNIYDAVPDA
        WFN  + DSDFE+LL+GD+ +ALIPGD+TYVVFAG LG+LQ F   Y+   Y L E+E   LL+SQ NGLIFKL+ DQTLPEPD  S L FNIY   PDA
Subjt:  WFNPENDDSDFEILLIGDSSHALIPGDVTYVVFAGRLGILQSFPPQYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTLPEPDESSGL-FNIYDAVPDA

Query:  RLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGGFLGK--MDSEVKAGQLVLVPKYFAVGKVAGDEGMECF
         ++ GGSVT +TE+KFPFIGKSGLTAVLEKLEANAVR+PVYVADPSVQLIYV+ GSGR+++     +  +D+EVKAGQLVLVPKYFAVGK+AG+EG+ECF
Subjt:  RLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGGFLGK--MDSEVKAGQLVLVPKYFAVGKVAGDEGMECF

Query:  SIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPSN
        +IITTTHPL+EELGGK SIFG+ S QVF+ SFN+TA FEKL RSKITK+SPLVPPS+
Subjt:  SIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPSN

A0A1S3C332 glutelin type-B 54.13e-17068.07Show/hide
Query:  MEFDLKAMNDQTFSESDGGAYYNWPSSQFPVLSKADVAAGRLVLRPRGFALPHYADSSKVGYVTQGNGIGGLLFPGKSSEFVMKLEKGDLIPVPEGVTSW
        ME DLK M+   F   +GG+++ W  S  P++ +  V AGRL+L PRGFA+PH +DSSKVGYV QG+G+ G++FP KS E V++L+KGD+IPVPEGVTSW
Subjt:  MEFDLKAMNDQTFSESDGGAYYNWPSSQFPVLSKADVAAGRLVLRPRGFALPHYADSSKVGYVTQGNGIGGLLFPGKSSEFVMKLEKGDLIPVPEGVTSW

Query:  WFNPENDDSDFEILLIGDSSHALIPGDVTYVVFAGRLGILQSFPPQYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTLPEPDESSGL-FNIYDAVPDA
        WFN  + DSDFE+LL+GD+ +ALIPGD+TYVVFAG LG+LQ F   Y+   Y L EEE   LL+SQ NGLIFKL+ DQTLPEPD  S L FNIYDA PD+
Subjt:  WFNPENDDSDFEILLIGDSSHALIPGDVTYVVFAGRLGILQSFPPQYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTLPEPDESSGL-FNIYDAVPDA

Query:  RLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGG-FLGK-MDSEVKAGQLVLVPKYFAVGKVAGDEGMECF
         ++ GG+VT +TE+KFPFIGKSGLTAVLEKLEANAVR+PVYVADPSVQLIYV+ GSGRI++   F+ K +D+EVKAGQL+LVPKYFAVGK+AG+EG+ECF
Subjt:  RLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGG-FLGK-MDSEVKAGQLVLVPKYFAVGKVAGDEGMECF

Query:  SIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPSN
        +IITTTHPL+EELGGK SIFG+ S QVFQ SFNVTA FEKLL SKITK+SPLVPPS+
Subjt:  SIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPSN

A0A5A7TCP0 Glutelin type-B 51.37e-16867.79Show/hide
Query:  MEFDLKAMNDQTFSESDGGAYYNWPSSQFPVLSKADVAAGRLVLRPRGFALPHYADSSKVGYVTQGNGIGGLLFPGKSSEFVMKLEKGDLIPVPEGVTSW
        ME DLK M+   F   +GG+++ W  S   ++ +  V AGRL+L PRGFA+PH +DSSKVGYV QG+G+ G++FP KS E V++L+KGD+IPVPEGVTSW
Subjt:  MEFDLKAMNDQTFSESDGGAYYNWPSSQFPVLSKADVAAGRLVLRPRGFALPHYADSSKVGYVTQGNGIGGLLFPGKSSEFVMKLEKGDLIPVPEGVTSW

Query:  WFNPENDDSDFEILLIGDSSHALIPGDVTYVVFAGRLGILQSFPPQYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTLPEPDESSGL-FNIYDAVPDA
        WFN  + DSDFE+LL+GD+ +ALIPGD+TYVVFAG LG+LQ F   Y+   Y L EEE   LL+SQ NGLIFKL+ DQTLPEPD  S L FNIYDA PD+
Subjt:  WFNPENDDSDFEILLIGDSSHALIPGDVTYVVFAGRLGILQSFPPQYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTLPEPDESSGL-FNIYDAVPDA

Query:  RLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGG-FLGK-MDSEVKAGQLVLVPKYFAVGKVAGDEGMECF
         ++ GG+VT +TE+KFPFIGKSGLTAVLEKLEANAVR+PVYVADPSVQLIYV+ GSGRI++   F+ K +D+EVKAGQL+LVPKYFAVGK+AG+EG+ECF
Subjt:  RLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGG-FLGK-MDSEVKAGQLVLVPKYFAVGKVAGDEGMECF

Query:  SIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPSN
        +IITTTHPL+EELGGK SIFG+ S QVFQ SFNVTA FEKLL SKITK+SPLVPPS+
Subjt:  SIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPSN

A0A5D3BKT3 Glutelin type-B 54.13e-17068.07Show/hide
Query:  MEFDLKAMNDQTFSESDGGAYYNWPSSQFPVLSKADVAAGRLVLRPRGFALPHYADSSKVGYVTQGNGIGGLLFPGKSSEFVMKLEKGDLIPVPEGVTSW
        ME DLK M+   F   +GG+++ W  S  P++ +  V AGRL+L PRGFA+PH +DSSKVGYV QG+G+ G++FP KS E V++L+KGD+IPVPEGVTSW
Subjt:  MEFDLKAMNDQTFSESDGGAYYNWPSSQFPVLSKADVAAGRLVLRPRGFALPHYADSSKVGYVTQGNGIGGLLFPGKSSEFVMKLEKGDLIPVPEGVTSW

Query:  WFNPENDDSDFEILLIGDSSHALIPGDVTYVVFAGRLGILQSFPPQYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTLPEPDESSGL-FNIYDAVPDA
        WFN  + DSDFE+LL+GD+ +ALIPGD+TYVVFAG LG+LQ F   Y+   Y L EEE   LL+SQ NGLIFKL+ DQTLPEPD  S L FNIYDA PD+
Subjt:  WFNPENDDSDFEILLIGDSSHALIPGDVTYVVFAGRLGILQSFPPQYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTLPEPDESSGL-FNIYDAVPDA

Query:  RLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGG-FLGK-MDSEVKAGQLVLVPKYFAVGKVAGDEGMECF
         ++ GG+VT +TE+KFPFIGKSGLTAVLEKLEANAVR+PVYVADPSVQLIYV+ GSGRI++   F+ K +D+EVKAGQL+LVPKYFAVGK+AG+EG+ECF
Subjt:  RLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGG-FLGK-MDSEVKAGQLVLVPKYFAVGKVAGDEGMECF

Query:  SIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPSN
        +IITTTHPL+EELGGK SIFG+ S QVFQ SFNVTA FEKLL SKITK+SPLVPPS+
Subjt:  SIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPSN

A0A6J1JDB2 12S seed storage protein CRD-like5.59e-14860.29Show/hide
Query:  MNDQTFSESDGGAYYNWPSSQFPVLSKADVAAGRLVLRPRGFALPHYADSSKVGYVTQG-NGIGGLLFPGKSSEFVMKLEKGDLIPVPEGVTSWWFNPEN
        MN + F+E + G+Y+ W  S++P+L+   VAAGRL+LRPRGF +PHYAD SKVGYV QG NG+ GL+FP KS E V+ L+KGDLIPVP GV+SWWFN  +
Subjt:  MNDQTFSESDGGAYYNWPSSQFPVLSKADVAAGRLVLRPRGFALPHYADSSKVGYVTQG-NGIGGLLFPGKSSEFVMKLEKGDLIPVPEGVTSWWFNPEN

Query:  DDSDFEILLIGDSSHALIPGDVTYVVFAGRLGILQSFPPQYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTLPEPDE-SSGLFNIYDAVPDARLEAG-
         DSD EI+ +G+S +A +PGD++Y V +G L +L  F P+YV  +YSLN EE+   L+SQSN LIF ++  Q+LP+P + S  ++NI  A PD R++ G 
Subjt:  DDSDFEILLIGDSSHALIPGDVTYVVFAGRLGILQSFPPQYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTLPEPDE-SSGLFNIYDAVPDARLEAG-

Query:  GSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGGFLGKMDSEVKAGQLVLVPKYFAVGKVAGDEGMECFSIITTTH
        G+VT VTE KFPFIG+SGLTA+LEKL+ANAVR+PVYVA+P  QLIYV++G G+I++ GF  K+D+EVK GQL+LVPK+FAVGK+AG++G+EC SIIT TH
Subjt:  GSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGGFLGKMDSEVKAGQLVLVPKYFAVGKVAGDEGMECFSIITTTH

Query:  PLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPSN
        P++EEL GK S+  +LS +VFQVSFNVTAEFEKLLRSKIT ASP++  S+
Subjt:  PLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPSN

SwissProt top hitse value%identityAlignment
A0A222NNM9 Cocosin 12.1e-2124.43Show/hide
Query:  SESDGGAYYNWPSSQFPVLSKADVAAGRLVLRPRGFALPHYADSSKVGYVTQGNGIGGLLFPGKSSEF-----------------------VMKLEKGDL
        SE+    Y++  + QF     A V+  R V+ PRG  LP  +++ ++ Y+ QG GI GL+ PG    F                       V + ++GD+
Subjt:  SESDGGAYYNWPSSQFPVLSKADVAAGRLVLRPRGFALPHYADSSKVGYVTQGNGIGGLLFPGKSSEF-----------------------VMKLEKGDL

Query:  IPVPEGVTSWWFNPENDDSDFEILLIGDSSHALIPGDVTYVVF--AGRL---------------GILQSFPPQYVAGSYSLNEEESAAL-LRSQSNGLIF
        + VP G   W +N  N ++    + + D+S+     D ++  F  AGR                 IL+ F  + +A ++ +N E +  L  R  + G I 
Subjt:  IPVPEGVTSWWFNPENDDSDFEILLIGDSSHALIPGDVTYVVF--AGRL---------------GILQSFPPQYVAGSYSLNEEESAAL-LRSQSNGLIF

Query:  K-------LRPDQTLPEPDESSGLFNIYDAV---------------PDARLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIY
        +       LRP     E  E     N ++                  D     GG +T +  +K P +    ++A    L  NA+ +P +  + +  ++Y
Subjt:  K-------LRPDQTLPEPDESSGLFNIYDAV---------------PDARLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIY

Query:  VSRGSGRIRVGGFLGK--MDSEVKAGQLVLVPKYFAVGKVAGDEGMECFSIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITK
         + G GR+ V    G+   D E++ GQL++VP+ FA+ + AG EG +  SI T+   ++  + GK S    +  +V   S+ ++   ++  R K+T+
Subjt:  VSRGSGRIRVGGFLGK--MDSEVKAGQLVLVPKYFAVGKVAGDEGMECFSIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITK

O23880 13S globulin seed storage protein 23.4e-1924.05Show/hide
Query:  AGRLVLRPRGFALPHYADSSKVGYVTQGNGIGGLLFPG------KSSEF----------------------------VMKLEKGDLIPVPEGVTSWWFNP
        A R+V++P G  LP Y+++  + +V QG G+ G++ PG        SEF                            + ++ +GD+IP P GV  W  N 
Subjt:  AGRLVLRPRGFALPHYADSSKVGYVTQGNGIGGLLFPG------KSSEF----------------------------VMKLEKGDLIPVPEGVTSWWFNP

Query:  ENDDSDFEILLIGDSSHALIPGDVTYVVFAGR-------------------------LGILQSFPPQYVAGSYSLNEEESAALLRSQSN--GLI-----F
         NDD     LL  +S H  +  +V     AG+                           IL  F  + +   +   + E+ + LR +++  G I      
Subjt:  ENDDSDFEILLIGDSSHALIPGDVTYVVFAGR-------------------------LGILQSFPPQYVAGSYSLNEEESAALLRSQSN--GLI-----F

Query:  KLRPDQTLPEPDE---------------SSGL----------------FNIYDAVPDARLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYV
        KLR  Q   E  E               S+G+                 N Y   P A     G +  V  +  P +    L+A    L  NA+  P + 
Subjt:  KLRPDQTLPEPDE---------------SSGL----------------FNIYDAVPDARLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYV

Query:  ADPSVQLIYVSRGSGRIRVGGFLGK--MDSEVKAGQLVLVPKYFAVGKVAGDEGMECFSIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAE
         + +   +YV+RG GR++V G  GK   D +V+ GQ+++VP+ FAV   AG EG+E   +  + + +   +GG+ S+  ++  +V   S++++ +
Subjt:  ADPSVQLIYVSRGSGRIRVGGFLGK--MDSEVKAGQLVLVPKYFAVGKVAGDEGMECFSIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAE

P04405 Glycinin G27.5e-1923.66Show/hide
Query:  LKAMNDQTFSESDGGAYYNWPSSQFPVLSKADVAAGRLVLRPRGFALPHYADSSKVGYVTQGNGIGGLLFPGKSSEF---------------------VM
        L A+      ES+GG    W  +  P    A VA  R  L       P Y +  +  Y+ QGNGI G++FPG  S +                     V 
Subjt:  LKAMNDQTFSESDGGAYYNWPSSQFPVLSKADVAAGRLVLRPRGFALPHYADSSKVGYVTQGNGIGGLLFPGKSSEF---------------------VM

Query:  KLEKGDLIPVPEGVTSWWFNPEN------------------DDSDFEILLIGDSSHALI-------------PGDVTYVVFAGRLGILQSFPPQYVAGSY
        +  +GDLI VP GV  W +N E+                  D       L G+     +              G        G   IL  F P+++  ++
Subjt:  KLEKGDLIPVPEGVTSWWFNPEN------------------DDSDFEILLIGDSSHALI-------------PGDVTYVVFAGRLGILQSFPPQYVAGSY

Query:  SL-----------NEEESAALLRSQSNGLIFKL----RPDQTLPEPDES--------------------SGL----------FNI-YDAVPDARLEAGGS
         +           NEEE +  + +   GL        +P Q   + DE                     +G+           NI  ++ PD      GS
Subjt:  SL-----------NEEESAALLRSQSNGLIFKL----RPDQTLPEPDES--------------------SGL----------FNI-YDAVPDARLEAGGS

Query:  VTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGGFLGK--MDSEVKAGQLVLVPKYFAVGKVAGDEGMECFSIITTTH
        +T  T   FP +    L+A    L  NA+  P Y  + +  +IY   G   ++V    G+   D E++ G +++VP+ FAV   +  +  E  S  T   
Subjt:  VTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGGFLGK--MDSEVKAGQLVLVPKYFAVGKVAGDEGMECFSIITTTH

Query:  PLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPP
        P I  L G +S+  +L  +V Q +FN+ ++  + +++     S LVPP
Subjt:  PLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPP

P11828 Glycinin G32.6e-1924.09Show/hide
Query:  LKAMNDQTFSESDGGAYYNWPSSQFPVLSKADVAAGRLVLRPRGFALPHYADSSKVGYVTQGNGIGGLLFPGKSSEF------------------VMKLE
        L A+      ES+GG    W  +  P    A VA  R  L       P Y ++ +  Y+ QG+GI G++FPG  S F                  +    
Subjt:  LKAMNDQTFSESDGGAYYNWPSSQFPVLSKADVAAGRLVLRPRGFALPHYADSSKVGYVTQGNGIGGLLFPGKSSEF------------------VMKLE

Query:  KGDLIPVPEGVTSWWFNPEN---------DDSDFEILLIGDSSHALIPGD-----VTYVVFAGRLG------------------ILQSFPPQYVAGSYSL
        +GDLI VP G   W +N E+         D + F+  L        + G+     + Y     + G                  IL  F P+++  ++ +
Subjt:  KGDLIPVPEGVTSWWFNPEN---------DDSDFEILLIGDSSHALIPGD-----VTYVVFAGRLG------------------ILQSFPPQYVAGSYSL

Query:  -----------NEEESAALLRSQSNGLIFKLRP---DQTLPEPDE------------SSGLFNIYDAV--------------PDARLEAGGSVTAVTEDK
                   NEEE    + +   GL     P    Q  PE +E            S     I + +              PD      GS+T  T   
Subjt:  -----------NEEESAALLRSQSNGLIFKLRP---DQTLPEPDE------------SSGLFNIYDAV--------------PDARLEAGGSVTAVTEDK

Query:  FPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGGFLGK--MDSEVKAGQLVLVPKYFAVGKVAGDEGMECFSIITTTHPLIEELGG
        FP +    L+A    L  NA+  P Y  + +  +IY   G   ++V    G+   D E++ GQ+++VP+ FAV   +  +  E  S  T   P I  L G
Subjt:  FPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGGFLGK--MDSEVKAGQLVLVPKYFAVGKVAGDEGMECFSIITTTHPLIEELGG

Query:  KDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPP
         +S+  +L  +V Q +FN+  +  + +++     S LVPP
Subjt:  KDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPP

Q02897 Glutelin type-B 22.2e-1822.4Show/hide
Query:  RLVLRPRGFALPHYADSSKVGYVTQGNGIGGLLFPG------------------KSSEF------VMKLEKGDLIPVPEGVTSWWFNPENDDSDFEILL-
        R V++P+G  +P Y+++  + Y+ QG G  GL FPG                  +S +F      + +  +GD++ +P GV  W++N + D S   I + 
Subjt:  RLVLRPRGFALPHYADSSKVGYVTQGNGIGGLLFPG------------------KSSEF------VMKLEKGDLIPVPEGVTSWWFNPENDDSDFEILL-

Query:  -IGDSSHALIPGDVTYVVFAGR----------------LGILQSFPPQYVAGSYSLNEEESAALLRSQS----------NGLIFKLRPDQT---------
         I +S++ L P    +++                      I   F  + ++ +  +N   +A  L+SQ+          NGL   L+P  T         
Subjt:  -IGDSSHALIPGDVTYVVFAGR----------------LGILQSFPPQYVAGSYSLNEEESAALLRSQS----------NGLIFKLRPDQT---------

Query:  -----LPEPDESSGLFNIYD---------------AVPDARLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIR
               E  ++S  +N  +               +  D+     G +++V   KFP +    ++A    L  NA+ +P +  + +  L+Y+ +G  R++
Subjt:  -----LPEPDESSGLFNIYD---------------AVPDARLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIR

Query:  VGGFLGK--MDSEVKAGQLVLVPKYFAVGKVAGDEGMECFSIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRS
        V    GK   D  ++ GQL+++P+++AV K A  EG +  +I T  +  +  L GK+S+F +L   V   ++ ++ E  + +++
Subjt:  VGGFLGK--MDSEVKAGQLVLVPKYFAVGKVAGDEGMECFSIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRS

Arabidopsis top hitse value%identityAlignment
AT1G03890.1 RmlC-like cupins superfamily protein1.1e-1721.91Show/hide
Query:  PVLSKADVAAGRLVLRPRGFALPHYADSSKVGYVTQGNGIGGLLFPGKSSEFV----------------------MKLE---KGDLIPVPEGVTSWWFNP
        P L  A V   R+ L+P    LP +     + YV QG G+ G +  G    F                        KLE   +GD+     GV+ WW+N 
Subjt:  PVLSKADVAAGRLVLRPRGFALPHYADSSKVGYVTQGNGIGGLLFPGKSSEFV----------------------MKLE---KGDLIPVPEGVTSWWFNP

Query:  ENDDSDFEILLIGDSSHA-----------LIPGDVTY-----VVFAGRLGILQSFPPQYVAGSYSLNEEESAALLRSQSN---------GLIFKLRPDQT
           DSD  I+++ D ++             + G  T      + +         F P  +A ++ +N E +  L   + N          L F + P + 
Subjt:  ENDDSDFEILLIGDSSHA-----------LIPGDVTY-----VVFAGRLGILQSFPPQYVAGSYSLNEEESAALLRSQSN---------GLIFKLRPDQT

Query:  LPEPDESSGL------FNIYDAVPDARLE-----AGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGGFLGK--
          +   ++G+        I++ + D           G ++ +     P +    L A+   L +  +  P + A+    ++YV+ G  +I+V    G+  
Subjt:  LPEPDESSGL------FNIYDAVPDARLE-----AGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGGFLGK--

Query:  MDSEVKAGQLVLVPKYFAVGKVAGDEGMECFSIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPSN
         + +V  GQ++++P+ FAV K AG+ G E  S  T  +  I  L G+ S   ++   V + S+ V  E  K ++    +    + PS+
Subjt:  MDSEVKAGQLVLVPKYFAVGKVAGDEGMECFSIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPSN

AT1G07750.1 RmlC-like cupins superfamily protein1.0e-6335.93Show/hide
Query:  MEFDLKAMNDQTFSESDGGAYYNWPSSQFPVLSKADVAAGRLVLRPRGFALPHYADSSKVGYVTQGNGIGGLLFPGKSSEFVMKLEKGDLIPVPEGVTSW
        ME DL     +     DGG+Y  W   + P+L + ++ A +L L   GFA+P Y+DSSKV YV QG+G  G++ P K  E V+ +++GD I +P GV +W
Subjt:  MEFDLKAMNDQTFSESDGGAYYNWPSSQFPVLSKADVAAGRLVLRPRGFALPHYADSSKVGYVTQGNGIGGLLFPGKSSEFVMKLEKGDLIPVPEGVTSW

Query:  WFNPENDDSDFEILLIGDSSHALIPGDVTYVVFAGRLGILQSFPPQYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTLPEPDESSG---LFNIYDAVP
        WFN  N+D +  IL +G++      G  T     G  GI   F  ++V  ++ L+E     L+ SQ+   I KL     +P+P E +    + N  +A  
Subjt:  WFNPENDDSDFEILLIGDSSHALIPGDVTYVVFAGRLGILQSFPPQYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTLPEPDESSG---LFNIYDAVP

Query:  DARLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGGFLGK--MDSEVKAGQLVLVPKYFAVGKVAGDEGME
        D  ++ GG V  +     P +G+ G  A L +++A+++ +P +  D ++Q+ Y+  GSGR++V G  GK  +++ +KAG L +VP++F V K+A  +GM 
Subjt:  DARLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGGFLGK--MDSEVKAGQLVLVPKYFAVGKVAGDEGME

Query:  CFSIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPSN
         FSI+TT  P+   L G  S++ SLS +V Q +F V  E EK  RS  T ++   PPSN
Subjt:  CFSIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPSN

AT2G28680.1 RmlC-like cupins superfamily protein9.6e-6234.82Show/hide
Query:  MEFDLKAMNDQTFSESDGGAYYNWPSSQFPVLSKADVAAGRLVLRPRGFALPHYADSSKVGYVTQGNGIGGLLFPGKSSEFVMKLEKGDLIPVPEGVTSW
        ME DL     +     DGG+Y+ W   + P+L   ++ A +L L   G ALP Y+DS KV YV QG G  G++ P K  E V+ ++KGD I +P GV +W
Subjt:  MEFDLKAMNDQTFSESDGGAYYNWPSSQFPVLSKADVAAGRLVLRPRGFALPHYADSSKVGYVTQGNGIGGLLFPGKSSEFVMKLEKGDLIPVPEGVTSW

Query:  WFNPENDDSDFEILLIGDSSHALIPGDVTYVVFAGRLGILQSFPPQYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTLPEP---DESSGLFNIYDAVP
        WFN  N+D++  +L +G++      G  T     G  GI   F  ++V  ++ L+E     L+ SQ+   I K+     +PEP   D    + N  +A  
Subjt:  WFNPENDDSDFEILLIGDSSHALIPGDVTYVVFAGRLGILQSFPPQYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTLPEP---DESSGLFNIYDAVP

Query:  DARLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGGFLGK--MDSEVKAGQLVLVPKYFAVGKVAGDEGME
        D  ++ GG V  +     P +G+ G  A L +++ +++ +P +  D ++Q+ Y+  GSGR+++ G  GK  +++ VKAG L +VP++F V K+A  +G+ 
Subjt:  DARLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGGFLGK--MDSEVKAGQLVLVPKYFAVGKVAGDEGME

Query:  CFSIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPSN
         FSI+TT  P+   L G+ S++ +LS +V Q +F V  E EK  RSK T  +    PSN
Subjt:  CFSIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPSN

AT4G28520.1 cruciferin 31.4e-0927.59Show/hide
Query:  QSNGL---IFKLRPDQTLPEPDESSGLFNIYDAVPDARLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGG
        Q NGL   I  +R  + + +P           A  D    + G VT+V     P +    L+A    L+ NA+  P Y  + + +++Y + G GRI+V  
Subjt:  QSNGL---IFKLRPDQTLPEPDESSGLFNIYDAVPDARLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGG

Query:  FLGK--MDSEVKAGQLVLVPKYFAVGKVAGDEGMECFSIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAE
          G+  +D +V+ GQLV++P+ FA    +     E  S  T  + +I  L G+ S+  +L  +V    F ++ E
Subjt:  FLGK--MDSEVKAGQLVLVPKYFAVGKVAGDEGMECFSIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAE

AT4G28520.3 cruciferin 32.5e-0929.55Show/hide
Query:  GSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGGFLGK--MDSEVKAGQLVLVPKYFAVGKVAGDEGMECFSIITT
        G VT+V     P +    L+A    L+ NA+  P Y  + + +++Y + G GRI+V    G+  +D +V+ GQLV++P+ FA    +     E  S  T 
Subjt:  GSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGGFLGK--MDSEVKAGQLVLVPKYFAVGKVAGDEGMECFSIITT

Query:  THPLIEELGGKDSIFGSLSAQVFQVSFNVTAE
         + +I  L G+ S+  +L  +V    F ++ E
Subjt:  THPLIEELGGKDSIFGSLSAQVFQVSFNVTAE

AT4G28520.3 cruciferin 33.1e-0433.82Show/hide
Query:  ESDGGAYYNWPSSQFPVLSKADVAAGRLVLRPRGFALPHYADSSKVGYVTQGNGIGGLLFPGKSSEFV
        +S+ G    W  +  P L    V+  R V+   G  LP +  S K+ YV QG GI G + PG +  F+
Subjt:  ESDGGAYYNWPSSQFPVLSKADVAAGRLVLRPRGFALPHYADSSKVGYVTQGNGIGGLLFPGKSSEFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTTCGACTTGAAGGCAATGAATGACCAGACCTTCTCGGAGAGCGACGGCGGAGCCTACTACAACTGGCCGTCTTCTCAGTTTCCGGTGCTGTCGAAAGCCGATGT
CGCCGCTGGCAGACTCGTCCTCCGCCCCCGCGGCTTCGCCCTCCCCCATTACGCCGATTCTTCTAAAGTTGGTTACGTTACACAAGGCAATGGAATTGGCGGACTTTTAT
TTCCGGGCAAGTCCAGCGAATTTGTGATGAAACTAGAGAAAGGAGACCTAATTCCGGTACCGGAAGGCGTCACCTCCTGGTGGTTCAACCCCGAAAATGACGACTCCGAT
TTCGAGATCCTCCTCATCGGCGACTCCTCACACGCCCTCATCCCCGGCGACGTCACATACGTCGTATTCGCGGGACGTCTCGGAATCCTCCAGAGTTTCCCGCCGCAGTA
CGTCGCCGGATCTTACTCCCTAAACGAAGAAGAATCCGCCGCTCTTCTCAGAAGCCAATCCAACGGCCTGATCTTCAAGCTCCGGCCGGACCAAACCCTACCCGAACCGG
ACGAAAGCAGCGGTCTGTTTAACATATACGACGCCGTTCCGGACGCCCGACTGGAGGCCGGCGGGTCAGTGACGGCCGTGACAGAGGATAAATTTCCGTTCATTGGGAAG
TCTGGGCTGACGGCGGTGCTCGAGAAGCTTGAGGCTAACGCCGTTAGGGCACCGGTCTACGTGGCGGACCCGTCGGTGCAACTTATTTATGTGAGTCGCGGGTCGGGTCG
GATTCGGGTGGGCGGGTTTTTGGGGAAAATGGATTCGGAGGTGAAAGCGGGGCAGCTGGTTTTGGTTCCAAAGTACTTCGCCGTCGGGAAGGTCGCCGGCGATGAGGGAA
TGGAGTGCTTCTCCATTATCACAACTACACACCCTCTAATAGAAGAATTGGGAGGGAAAGATTCGATTTTTGGGAGTTTATCAGCACAAGTTTTTCAAGTTTCATTCAAT
GTCACAGCTGAGTTTGAGAAGCTTCTCAGGTCAAAGATAACAAAAGCCTCACCCCTGGTTCCTCCCTCAAAT
mRNA sequenceShow/hide mRNA sequence
TCGAAGATAGCGTTGAGTTTATCGGCCTCTCTTTTCTTTGCCATTGTTTTTTCCCTTTCATCTCTCATATTTAAGCTCTTCAATCATCAATGGAGTTCGACTTGAAGGCA
ATGAATGACCAGACCTTCTCGGAGAGCGACGGCGGAGCCTACTACAACTGGCCGTCTTCTCAGTTTCCGGTGCTGTCGAAAGCCGATGTCGCCGCTGGCAGACTCGTCCT
CCGCCCCCGCGGCTTCGCCCTCCCCCATTACGCCGATTCTTCTAAAGTTGGTTACGTTACACAAGGCAATGGAATTGGCGGACTTTTATTTCCGGGCAAGTCCAGCGAAT
TTGTGATGAAACTAGAGAAAGGAGACCTAATTCCGGTACCGGAAGGCGTCACCTCCTGGTGGTTCAACCCCGAAAATGACGACTCCGATTTCGAGATCCTCCTCATCGGC
GACTCCTCACACGCCCTCATCCCCGGCGACGTCACATACGTCGTATTCGCGGGACGTCTCGGAATCCTCCAGAGTTTCCCGCCGCAGTACGTCGCCGGATCTTACTCCCT
AAACGAAGAAGAATCCGCCGCTCTTCTCAGAAGCCAATCCAACGGCCTGATCTTCAAGCTCCGGCCGGACCAAACCCTACCCGAACCGGACGAAAGCAGCGGTCTGTTTA
ACATATACGACGCCGTTCCGGACGCCCGACTGGAGGCCGGCGGGTCAGTGACGGCCGTGACAGAGGATAAATTTCCGTTCATTGGGAAGTCTGGGCTGACGGCGGTGCTC
GAGAAGCTTGAGGCTAACGCCGTTAGGGCACCGGTCTACGTGGCGGACCCGTCGGTGCAACTTATTTATGTGAGTCGCGGGTCGGGTCGGATTCGGGTGGGCGGGTTTTT
GGGGAAAATGGATTCGGAGGTGAAAGCGGGGCAGCTGGTTTTGGTTCCAAAGTACTTCGCCGTCGGGAAGGTCGCCGGCGATGAGGGAATGGAGTGCTTCTCCATTATCA
CAACTACACACCCTCTAATAGAAGAATTGGGAGGGAAAGATTCGATTTTTGGGAGTTTATCAGCACAAGTTTTTCAAGTTTCATTCAATGTCACAGCTGAGTTTGAGAAG
CTTCTCAGGTCAAAGATAACAAAAGCCTCACCCCTGGTTCCTCCCTCAAAT
Protein sequenceShow/hide protein sequence
MEFDLKAMNDQTFSESDGGAYYNWPSSQFPVLSKADVAAGRLVLRPRGFALPHYADSSKVGYVTQGNGIGGLLFPGKSSEFVMKLEKGDLIPVPEGVTSWWFNPENDDSD
FEILLIGDSSHALIPGDVTYVVFAGRLGILQSFPPQYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTLPEPDESSGLFNIYDAVPDARLEAGGSVTAVTEDKFPFIGK
SGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGGFLGKMDSEVKAGQLVLVPKYFAVGKVAGDEGMECFSIITTTHPLIEELGGKDSIFGSLSAQVFQVSFN
VTAEFEKLLRSKITKASPLVPPSN