; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0019589 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0019589
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionglutelin type-A 2-like
Genome locationchr5:43590830..43595067
RNA-Seq ExpressionLag0019589
SyntenyLag0019589
Gene Ontology termsGO:0045735 - nutrient reservoir activity (molecular function)
InterPro domainsIPR006045 - Cupin 1
IPR011051 - RmlC-like cupin domain superfamily
IPR014710 - RmlC-like jelly roll fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039049.1 glutelin type-B 5 [Cucumis melo var. makuwa]9.9e-16280.85Show/hide
Query:  MELNLEPMNPKAFFEGEDGSFHKWFPSEFPMIAQTKVGAGRLLLRPRGFAVPHNSDSSKVGYVLQGSGVAGLLFPGKSDEAIVRLKKGDLIPVPEGVTSW
        MEL+L+PM+P  FF GE GSFHKWFPS+  +I QTKVGAGRLLL PRGFAVPHNSDSSKVGYVLQGSGVAG++FP KS+EA+VRLKKGD+IPVPEGVTSW
Subjt:  MELNLEPMNPKAFFEGEDGSFHKWFPSEFPMIAQTKVGAGRLLLRPRGFAVPHNSDSSKVGYVLQGSGVAGLLFPGKSDEAIVRLKKGDLIPVPEGVTSW

Query:  WFNDDHSDFQVLLVGDTRNALIPGDITYVVFAGPLGILQGFSPDYVQKVYNLNEEEKDILLKSQTNGLIFKLPPGQTLPEPGEHSDLVFNIYNAVPDAGV
        WFND  SDF+VLLVGDTRNALIPGDITYVVFAGPLG+LQGFS DY++KVY+L EEE+++LLKSQ NGLIFKL   QTLPEP  HSDLVFNIY+A PD+ V
Subjt:  WFNDDHSDFQVLLVGDTRNALIPGDITYVVFAGPLGILQGFSPDYVQKVYNLNEEEKDILLKSQTNGLIFKLPPGQTLPEPGEHSDLVFNIYNAVPDAGV

Query:  EGGGSVTTVTEKEFPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYIAGGSGRVQIV-GFLGK-IDAEVKAGQLVLVPKYFAVGKVAGEEGLECFTI
        +GGG+VT +TE++FPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIY+A GSGR+QI   F+ K IDAEVKAGQL+LVPKYFAVGK+AGEEGLECFTI
Subjt:  EGGGSVTTVTEKEFPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYIAGGSGRVQIV-GFLGK-IDAEVKAGQLVLVPKYFAVGKVAGEEGLECFTI

Query:  ITATQPLLEELAGKTSIFGTLSPEVFQASFNVTAKFEKLLRSKITKASPLVPPSD
        IT T PLLEEL GK+SIFG  SP+VFQASFNVTA FEKLL SKITK+SPLVPPSD
Subjt:  ITATQPLLEELAGKTSIFGTLSPEVFQASFNVTAKFEKLLRSKITKASPLVPPSD

KAG7014994.1 12S seed storage protein CRD, partial [Cucurbita argyrosperma subsp. argyrosperma]1.9e-15283.85Show/hide
Query:  MELNLEPMNPKAFFEGEDGSFHKWFPSEFPMIAQTKVGAGRLLLRPRGFAVPHNSDSSKVGYVLQGSGVAGLLFPGKSDEAIVRLKKGDLIPVPEGVTSW
        MELNLEPM+PKAFF GE GSFHKW PS+FPMIA TKVGAGRLLLRPRGFA+PHNSDSSKVGYVLQG G+AG+LFPG SDEA+VRLKKGDLIPVPEGVTSW
Subjt:  MELNLEPMNPKAFFEGEDGSFHKWFPSEFPMIAQTKVGAGRLLLRPRGFAVPHNSDSSKVGYVLQGSGVAGLLFPGKSDEAIVRLKKGDLIPVPEGVTSW

Query:  WFNDDHSDFQVLLVGDTRNALIPGDITYVVFAGPLGILQGFSPDYVQKVYNLNEEEKDILLKSQTNGLIFKLPPGQTLPEPGEHSDLVFNIYNAVPDAGV
        WFND  SDF+VLLVGDTRNALIPGDITYVVF GPLG+LQGFSPDYVQKVYNLN EE D LLKSQTNGLIFKL   Q +PEP  H DLVFNIY+ V  +  
Subjt:  WFNDDHSDFQVLLVGDTRNALIPGDITYVVFAGPLGILQGFSPDYVQKVYNLNEEEKDILLKSQTNGLIFKLPPGQTLPEPGEHSDLVFNIYNAVPDAGV

Query:  EGGGSVTTVTEKEFPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYIAGGSGRVQIVGFLGKIDAEVKAGQLVLVPKYFAVGKVAGEEGLECFTIIT
        EG GSVT VTEKEFPFIGKSGLTAVLEKLEAN  RSPVYVADPSVQL+YIA GSGRVQIVGFLGKID  VKAGQLVLVPKYFA GK+AGEEGLECFTIIT
Subjt:  EGGGSVTTVTEKEFPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYIAGGSGRVQIVGFLGKIDAEVKAGQLVLVPKYFAVGKVAGEEGLECFTIIT

Query:  ATQPLLEELAGKTSIFGTLSPE
        +T P LEEL GKTSI GT SP+
Subjt:  ATQPLLEELAGKTSIFGTLSPE

XP_008456077.1 PREDICTED: glutelin type-B 5 [Cucumis melo]6.9e-16381.13Show/hide
Query:  MELNLEPMNPKAFFEGEDGSFHKWFPSEFPMIAQTKVGAGRLLLRPRGFAVPHNSDSSKVGYVLQGSGVAGLLFPGKSDEAIVRLKKGDLIPVPEGVTSW
        MEL+L+PM+P  FF GE GSFHKWFPS+ P+I QTKVGAGRLLL PRGFAVPHNSDSSKVGYVLQGSGVAG++FP KS+EA+VRLKKGD+IPVPEGVTSW
Subjt:  MELNLEPMNPKAFFEGEDGSFHKWFPSEFPMIAQTKVGAGRLLLRPRGFAVPHNSDSSKVGYVLQGSGVAGLLFPGKSDEAIVRLKKGDLIPVPEGVTSW

Query:  WFNDDHSDFQVLLVGDTRNALIPGDITYVVFAGPLGILQGFSPDYVQKVYNLNEEEKDILLKSQTNGLIFKLPPGQTLPEPGEHSDLVFNIYNAVPDAGV
        WFND  SDF+VLLVGDTRNALIPGDITYVVFAGPLG+LQGFS DY++KVY+L EEE+++LLKSQ NGLIFKL   QTLPEP  HSDLVFNIY+A PD+ V
Subjt:  WFNDDHSDFQVLLVGDTRNALIPGDITYVVFAGPLGILQGFSPDYVQKVYNLNEEEKDILLKSQTNGLIFKLPPGQTLPEPGEHSDLVFNIYNAVPDAGV

Query:  EGGGSVTTVTEKEFPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYIAGGSGRVQIV-GFLGK-IDAEVKAGQLVLVPKYFAVGKVAGEEGLECFTI
        +GGG+VT +TE++FPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIY+A GSGR+QI   F+ K IDAEVKAGQL+LVPKYFAVGK+AGEEGLECFTI
Subjt:  EGGGSVTTVTEKEFPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYIAGGSGRVQIV-GFLGK-IDAEVKAGQLVLVPKYFAVGKVAGEEGLECFTI

Query:  ITATQPLLEELAGKTSIFGTLSPEVFQASFNVTAKFEKLLRSKITKASPLVPPSD
        IT T PLLEEL GK+SIFG  SP+VFQASFNVTA FEKLL SKITK+SPLVPPSD
Subjt:  ITATQPLLEELAGKTSIFGTLSPEVFQASFNVTAKFEKLLRSKITKASPLVPPSD

XP_011651276.2 legumin J [Cucumis sativus]3.6e-16481.41Show/hide
Query:  MELNLEPMNPKAFFEGEDGSFHKWFPSEFPMIAQTKVGAGRLLLRPRGFAVPHNSDSSKVGYVLQGSGVAGLLFPGKSDEAIVRLKKGDLIPVPEGVTSW
        MELNL+PM+P  FF GE GSFHKWFPS+FP+I+QTKVGAGRLLL PRGFAVPHNSDSSKVGYVLQGSGVAG++FP KS+EA VRLKKGD+IPVPEGVTSW
Subjt:  MELNLEPMNPKAFFEGEDGSFHKWFPSEFPMIAQTKVGAGRLLLRPRGFAVPHNSDSSKVGYVLQGSGVAGLLFPGKSDEAIVRLKKGDLIPVPEGVTSW

Query:  WFNDDHSDFQVLLVGDTRNALIPGDITYVVFAGPLGILQGFSPDYVQKVYNLNEEEKDILLKSQTNGLIFKLPPGQTLPEPGEHSDLVFNIYNAVPDAGV
        WFND  SDF+VLLVGDTRNALIPGDITYVVFAGPLG+LQGFS DY++KVY+L E+E+++LLKSQ NGLIFKL   QTLPEP  HSDLVFNIY+  PDA V
Subjt:  WFNDDHSDFQVLLVGDTRNALIPGDITYVVFAGPLGILQGFSPDYVQKVYNLNEEEKDILLKSQTNGLIFKLPPGQTLPEPGEHSDLVFNIYNAVPDAGV

Query:  EGGGSVTTVTEKEFPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYIAGGSGRVQIVGFLGK--IDAEVKAGQLVLVPKYFAVGKVAGEEGLECFTI
        +GGGSVT +TE++FPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIY+A GSGRVQI     +  IDAEVKAGQLVLVPKYFAVGK+AGEEGLECFTI
Subjt:  EGGGSVTTVTEKEFPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYIAGGSGRVQIVGFLGK--IDAEVKAGQLVLVPKYFAVGKVAGEEGLECFTI

Query:  ITATQPLLEELAGKTSIFGTLSPEVFQASFNVTAKFEKLLRSKITKASPLVPPSD
        IT T PLLEEL GKTSIFG  SP+VF+ASFN+TA FEKL RSKITK+SPLVPPSD
Subjt:  ITATQPLLEELAGKTSIFGTLSPEVFQASFNVTAKFEKLLRSKITKASPLVPPSD

XP_038879635.1 legumin type B-like [Benincasa hispida]9.9e-17086.16Show/hide
Query:  MELNLEPMNPKAFFEGEDGSFHKWFPSEFPMIAQTKVGAGRLLLRPRGFAVPHNSDSSKVGYVLQGSGVAGLLFPGKSDEAIVRLKKGDLIPVPEGVTSW
        M+LNL+PM+P  FF+GE GSFHKWFPS+FP+IAQTKVGAGRLLL PRGFAVPHNSDSSKVGYVLQGSGVAG+LFP KS+EA+VRLKKGDLIPVPEGVTSW
Subjt:  MELNLEPMNPKAFFEGEDGSFHKWFPSEFPMIAQTKVGAGRLLLRPRGFAVPHNSDSSKVGYVLQGSGVAGLLFPGKSDEAIVRLKKGDLIPVPEGVTSW

Query:  WFNDDHSDFQVLLVGDTRNALIPGDITYVVFAGPLGILQGFSPDYVQKVYNLNEEEKDILLKSQTNGLIFKLPPGQTLPEPGEHSDLVFNIYNAVPDAGV
        WFND  SDF+VLLVGDTRNALIPGDITYVVFAGPLGILQGFS DY+QKVY+LNEEE+DILLKSQTNGLIFKL   QTLPEP  HS LVFNIY+A+PDA V
Subjt:  WFNDDHSDFQVLLVGDTRNALIPGDITYVVFAGPLGILQGFSPDYVQKVYNLNEEEKDILLKSQTNGLIFKLPPGQTLPEPGEHSDLVFNIYNAVPDAGV

Query:  EGGGSVTTVTEKEFPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYIAGGSGRVQIV-GFLGK-IDAEVKAGQLVLVPKYFAVGKVAGEEGLECFTI
        +GGGSVT VT+++FPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYIA GSGRVQIV  FL K IDAEVKAGQLVLVPKYFAVGKVAGEEGLECFTI
Subjt:  EGGGSVTTVTEKEFPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYIAGGSGRVQIV-GFLGK-IDAEVKAGQLVLVPKYFAVGKVAGEEGLECFTI

Query:  ITATQPLLEELAGKTSIFGTLSPEVFQASFNVTAKFEKLLRSKITKASPLVPPS
        IT T PLLEEL G TSIFGT SP+VFQASFNVTA+FEKLLRSKITK S LVPPS
Subjt:  ITATQPLLEELAGKTSIFGTLSPEVFQASFNVTAKFEKLLRSKITKASPLVPPS

TrEMBL top hitse value%identityAlignment
A0A0A0LC21 Uncharacterized protein1.8e-16481.41Show/hide
Query:  MELNLEPMNPKAFFEGEDGSFHKWFPSEFPMIAQTKVGAGRLLLRPRGFAVPHNSDSSKVGYVLQGSGVAGLLFPGKSDEAIVRLKKGDLIPVPEGVTSW
        MELNL+PM+P  FF GE GSFHKWFPS+FP+I+QTKVGAGRLLL PRGFAVPHNSDSSKVGYVLQGSGVAG++FP KS+EA VRLKKGD+IPVPEGVTSW
Subjt:  MELNLEPMNPKAFFEGEDGSFHKWFPSEFPMIAQTKVGAGRLLLRPRGFAVPHNSDSSKVGYVLQGSGVAGLLFPGKSDEAIVRLKKGDLIPVPEGVTSW

Query:  WFNDDHSDFQVLLVGDTRNALIPGDITYVVFAGPLGILQGFSPDYVQKVYNLNEEEKDILLKSQTNGLIFKLPPGQTLPEPGEHSDLVFNIYNAVPDAGV
        WFND  SDF+VLLVGDTRNALIPGDITYVVFAGPLG+LQGFS DY++KVY+L E+E+++LLKSQ NGLIFKL   QTLPEP  HSDLVFNIY+  PDA V
Subjt:  WFNDDHSDFQVLLVGDTRNALIPGDITYVVFAGPLGILQGFSPDYVQKVYNLNEEEKDILLKSQTNGLIFKLPPGQTLPEPGEHSDLVFNIYNAVPDAGV

Query:  EGGGSVTTVTEKEFPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYIAGGSGRVQIVGFLGK--IDAEVKAGQLVLVPKYFAVGKVAGEEGLECFTI
        +GGGSVT +TE++FPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIY+A GSGRVQI     +  IDAEVKAGQLVLVPKYFAVGK+AGEEGLECFTI
Subjt:  EGGGSVTTVTEKEFPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYIAGGSGRVQIVGFLGK--IDAEVKAGQLVLVPKYFAVGKVAGEEGLECFTI

Query:  ITATQPLLEELAGKTSIFGTLSPEVFQASFNVTAKFEKLLRSKITKASPLVPPSD
        IT T PLLEEL GKTSIFG  SP+VF+ASFN+TA FEKL RSKITK+SPLVPPSD
Subjt:  ITATQPLLEELAGKTSIFGTLSPEVFQASFNVTAKFEKLLRSKITKASPLVPPSD

A0A1S3C332 glutelin type-B 53.3e-16381.13Show/hide
Query:  MELNLEPMNPKAFFEGEDGSFHKWFPSEFPMIAQTKVGAGRLLLRPRGFAVPHNSDSSKVGYVLQGSGVAGLLFPGKSDEAIVRLKKGDLIPVPEGVTSW
        MEL+L+PM+P  FF GE GSFHKWFPS+ P+I QTKVGAGRLLL PRGFAVPHNSDSSKVGYVLQGSGVAG++FP KS+EA+VRLKKGD+IPVPEGVTSW
Subjt:  MELNLEPMNPKAFFEGEDGSFHKWFPSEFPMIAQTKVGAGRLLLRPRGFAVPHNSDSSKVGYVLQGSGVAGLLFPGKSDEAIVRLKKGDLIPVPEGVTSW

Query:  WFNDDHSDFQVLLVGDTRNALIPGDITYVVFAGPLGILQGFSPDYVQKVYNLNEEEKDILLKSQTNGLIFKLPPGQTLPEPGEHSDLVFNIYNAVPDAGV
        WFND  SDF+VLLVGDTRNALIPGDITYVVFAGPLG+LQGFS DY++KVY+L EEE+++LLKSQ NGLIFKL   QTLPEP  HSDLVFNIY+A PD+ V
Subjt:  WFNDDHSDFQVLLVGDTRNALIPGDITYVVFAGPLGILQGFSPDYVQKVYNLNEEEKDILLKSQTNGLIFKLPPGQTLPEPGEHSDLVFNIYNAVPDAGV

Query:  EGGGSVTTVTEKEFPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYIAGGSGRVQIV-GFLGK-IDAEVKAGQLVLVPKYFAVGKVAGEEGLECFTI
        +GGG+VT +TE++FPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIY+A GSGR+QI   F+ K IDAEVKAGQL+LVPKYFAVGK+AGEEGLECFTI
Subjt:  EGGGSVTTVTEKEFPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYIAGGSGRVQIV-GFLGK-IDAEVKAGQLVLVPKYFAVGKVAGEEGLECFTI

Query:  ITATQPLLEELAGKTSIFGTLSPEVFQASFNVTAKFEKLLRSKITKASPLVPPSD
        IT T PLLEEL GK+SIFG  SP+VFQASFNVTA FEKLL SKITK+SPLVPPSD
Subjt:  ITATQPLLEELAGKTSIFGTLSPEVFQASFNVTAKFEKLLRSKITKASPLVPPSD

A0A5A7TCP0 Glutelin type-B 54.8e-16280.85Show/hide
Query:  MELNLEPMNPKAFFEGEDGSFHKWFPSEFPMIAQTKVGAGRLLLRPRGFAVPHNSDSSKVGYVLQGSGVAGLLFPGKSDEAIVRLKKGDLIPVPEGVTSW
        MEL+L+PM+P  FF GE GSFHKWFPS+  +I QTKVGAGRLLL PRGFAVPHNSDSSKVGYVLQGSGVAG++FP KS+EA+VRLKKGD+IPVPEGVTSW
Subjt:  MELNLEPMNPKAFFEGEDGSFHKWFPSEFPMIAQTKVGAGRLLLRPRGFAVPHNSDSSKVGYVLQGSGVAGLLFPGKSDEAIVRLKKGDLIPVPEGVTSW

Query:  WFNDDHSDFQVLLVGDTRNALIPGDITYVVFAGPLGILQGFSPDYVQKVYNLNEEEKDILLKSQTNGLIFKLPPGQTLPEPGEHSDLVFNIYNAVPDAGV
        WFND  SDF+VLLVGDTRNALIPGDITYVVFAGPLG+LQGFS DY++KVY+L EEE+++LLKSQ NGLIFKL   QTLPEP  HSDLVFNIY+A PD+ V
Subjt:  WFNDDHSDFQVLLVGDTRNALIPGDITYVVFAGPLGILQGFSPDYVQKVYNLNEEEKDILLKSQTNGLIFKLPPGQTLPEPGEHSDLVFNIYNAVPDAGV

Query:  EGGGSVTTVTEKEFPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYIAGGSGRVQIV-GFLGK-IDAEVKAGQLVLVPKYFAVGKVAGEEGLECFTI
        +GGG+VT +TE++FPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIY+A GSGR+QI   F+ K IDAEVKAGQL+LVPKYFAVGK+AGEEGLECFTI
Subjt:  EGGGSVTTVTEKEFPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYIAGGSGRVQIV-GFLGK-IDAEVKAGQLVLVPKYFAVGKVAGEEGLECFTI

Query:  ITATQPLLEELAGKTSIFGTLSPEVFQASFNVTAKFEKLLRSKITKASPLVPPSD
        IT T PLLEEL GK+SIFG  SP+VFQASFNVTA FEKLL SKITK+SPLVPPSD
Subjt:  ITATQPLLEELAGKTSIFGTLSPEVFQASFNVTAKFEKLLRSKITKASPLVPPSD

A0A5D3BKT3 Glutelin type-B 53.3e-16381.13Show/hide
Query:  MELNLEPMNPKAFFEGEDGSFHKWFPSEFPMIAQTKVGAGRLLLRPRGFAVPHNSDSSKVGYVLQGSGVAGLLFPGKSDEAIVRLKKGDLIPVPEGVTSW
        MEL+L+PM+P  FF GE GSFHKWFPS+ P+I QTKVGAGRLLL PRGFAVPHNSDSSKVGYVLQGSGVAG++FP KS+EA+VRLKKGD+IPVPEGVTSW
Subjt:  MELNLEPMNPKAFFEGEDGSFHKWFPSEFPMIAQTKVGAGRLLLRPRGFAVPHNSDSSKVGYVLQGSGVAGLLFPGKSDEAIVRLKKGDLIPVPEGVTSW

Query:  WFNDDHSDFQVLLVGDTRNALIPGDITYVVFAGPLGILQGFSPDYVQKVYNLNEEEKDILLKSQTNGLIFKLPPGQTLPEPGEHSDLVFNIYNAVPDAGV
        WFND  SDF+VLLVGDTRNALIPGDITYVVFAGPLG+LQGFS DY++KVY+L EEE+++LLKSQ NGLIFKL   QTLPEP  HSDLVFNIY+A PD+ V
Subjt:  WFNDDHSDFQVLLVGDTRNALIPGDITYVVFAGPLGILQGFSPDYVQKVYNLNEEEKDILLKSQTNGLIFKLPPGQTLPEPGEHSDLVFNIYNAVPDAGV

Query:  EGGGSVTTVTEKEFPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYIAGGSGRVQIV-GFLGK-IDAEVKAGQLVLVPKYFAVGKVAGEEGLECFTI
        +GGG+VT +TE++FPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIY+A GSGR+QI   F+ K IDAEVKAGQL+LVPKYFAVGK+AGEEGLECFTI
Subjt:  EGGGSVTTVTEKEFPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYIAGGSGRVQIV-GFLGK-IDAEVKAGQLVLVPKYFAVGKVAGEEGLECFTI

Query:  ITATQPLLEELAGKTSIFGTLSPEVFQASFNVTAKFEKLLRSKITKASPLVPPSD
        IT T PLLEEL GK+SIFG  SP+VFQASFNVTA FEKLL SKITK+SPLVPPSD
Subjt:  ITATQPLLEELAGKTSIFGTLSPEVFQASFNVTAKFEKLLRSKITKASPLVPPSD

A0A6J1E9P2 legumin J-like5.5e-13467.14Show/hide
Query:  EPMNPKAFFEGEDGSFHKWFPSEFPMIAQTKVGAGRLLLRPRGFAVPHNSDSSKVGYVLQG-SGVAGLLFPGKSDEAIVRLKKGDLIPVPEGVTSWWFND
        +PMNPK F E E GS+HKW PSE+P++AQ KV AGRLLLRPRGF VPH +D SKVGYVLQG +GVAGL+FP KSDE +V LKKGDLIPVP GV+SWWFND
Subjt:  EPMNPKAFFEGEDGSFHKWFPSEFPMIAQTKVGAGRLLLRPRGFAVPHNSDSSKVGYVLQG-SGVAGLLFPGKSDEAIVRLKKGDLIPVPEGVTSWWFND

Query:  DHSDFQVLLVGDTRNALIPGDITYVVFAGPLGILQGFSPDYVQKVYNLNEEEKDILLKSQTNGLIFKLPPGQTLPEPGEHSDLVFNIYNAVPDAGVEGG-
          SD +++ +G+++NA +PGDI+Y V +GPL +L GFSP+YV K Y+LN EE    LKSQ+N LIF +   Q+LP+P ++S  V+NI  A PD  V+GG 
Subjt:  DHSDFQVLLVGDTRNALIPGDITYVVFAGPLGILQGFSPDYVQKVYNLNEEEKDILLKSQTNGLIFKLPPGQTLPEPGEHSDLVFNIYNAVPDAGVEGG-

Query:  GSVTTVTEKEFPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYIAGGSGRVQIVGFLGKIDAEVKAGQLVLVPKYFAVGKVAGEEGLECFTIITATQ
        G+VTTVTE +FPFIG+SGLTA+LEKL ANAVRSPVYVA+P  QLIY+A G G++QIVG   KIDAEVK GQL+LVPK+FAVGK+AGE+GLEC +IITAT 
Subjt:  GSVTTVTEKEFPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYIAGGSGRVQIVGFLGKIDAEVKAGQLVLVPKYFAVGKVAGEEGLECFTIITATQ

Query:  PLLEELAGKTSIFGTLSPEVFQASFNVTAKFEKLLRSKITKASPLVPPSD
        P++EELAGKTS+   LSPE+FQ SFNVTA+FEKLLRSKIT ASP++  SD
Subjt:  PLLEELAGKTSIFGTLSPEVFQASFNVTAKFEKLLRSKITKASPLVPPSD

SwissProt top hitse value%identityAlignment
A0A222NNM9 Cocosin 12.1e-2124.18Show/hide
Query:  RLLLRPRGFAVPHNSDSSKVGYVLQGSGVAGLLFPG-----------------------KSDEAIVRLKKGDLIPVPEGVTSWWFNDDHSDFQVLLVGDT
        R ++ PRG  +P  S++ ++ Y++QG G+ GL+ PG                          + + + ++GD++ VP G   W +N+  +    + V DT
Subjt:  RLLLRPRGFAVPHNSDSSKVGYVLQGSGVAGLLFPG-----------------------KSDEAIVRLKKGDLIPVPEGVTSWWFNDDHSDFQVLLVGDT

Query:  RNALIPGDITYVVF--AG---------------PLGILQGFSPDYVQKVYNLNEE-EKDILLKSQTNGLIFKLPPGQTLPEPG----------------E
         N     D ++  F  AG                  IL+GFS + +   + +N E  + +  +  T G I +   G  +  P                 E
Subjt:  RNALIPGDITYVVF--AG---------------PLGILQGFSPDYVQKVYNLNEE-EKDILLKSQTNGLIFKLPPGQTLPEPG----------------E

Query:  HSDLVFNIYNAVPD---AGV--EGGGSVTTVTEKEFPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYIAGGSGRVQIVGFLGK--IDAEVKAGQLV
         +     I   + D   A V    GG +TT+  ++ P +    ++A    L  NA+ SP +  + +  ++Y  GG GRV++    G+   D E++ GQL+
Subjt:  HSDLVFNIYNAVPD---AGV--EGGGSVTTVTEKEFPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYIAGGSGRVQIVGFLGK--IDAEVKAGQLV

Query:  LVPKYFAVGKVAGEEGLECFTIITATQPLLEELAGKTSIFGTLSPEVFQASFNVTAKFEKLLRSKITK
        +VP+ FA+ + AG EG +  +I T+ + ++  + GKTS    +  EV   S+ ++   ++  R K+T+
Subjt:  LVPKYFAVGKVAGEEGLECFTIITATQPLLEELAGKTSIFGTLSPEVFQASFNVTAKFEKLLRSKITK

P07728 Glutelin type-A 12.3e-2022.98Show/hide
Query:  TKVGAGRLLLRPRGFAVPHNSDSSKVGYVLQGSGVAGLLFPG----------------------------KSDEAIVRLKKGDLIPVPEGVTSWWFND--
        T V   R ++ PRG  +PH ++ + + Y++QG G+ G  FPG                               + I R ++GD+I +P GV  W +ND  
Subjt:  TKVGAGRLLLRPRGFAVPHNSDSSKVGYVLQGSGVAGLLFPG----------------------------KSDEAIVRLKKGDLIPVPEGVTSWWFND--

Query:  ------------------DHSDFQVLLVGDTRNALIPGDITYVVFAGPLGILQGFSPDYVQKVYNLNEE-EKDILLKSQTNGLIFKLPPGQTLPEP----
                          D      LL G+ RN   P      V      I  GFS + + +   ++ +  + +  ++   G I ++  G +L +P    
Subjt:  ------------------DHSDFQVLLVGDTRNALIPGDITYVVFAGPLGILQGFSPDYVQKVYNLNEE-EKDILLKSQTNGLIFKLPPGQTLPEP----

Query:  ----------------GEHSDLVF-------------------NIYNA-VPDAGVEGGGSVTTVTEKEFPFIGKSGLTAVLEKLEANAVRSPVYVADPSV
                        G++    +                   NI N    D      G VT +  + FP +    ++AV   L  NA+ SP +  + + 
Subjt:  ----------------GEHSDLVF-------------------NIYNA-VPDAGVEGGGSVTTVTEKEFPFIGKSGLTAVLEKLEANAVRSPVYVADPSV

Query:  QLIYIAGGSGRVQIVGFLGK--IDAEVKAGQLVLVPKYFAVGKVAGEEGLECFTIITATQPLLEELAGKTSIFGTLSPEVFQASFNVTAKFEKLLR
         ++YI  G  RVQ+V   GK   + E++ GQL+++P+++AV K A  EG       T    ++  +AGK+SIF  L  +V   ++ ++ +  + L+
Subjt:  QLIYIAGGSGRVQIVGFLGK--IDAEVKAGQLVLVPKYFAVGKVAGEEGLECFTIITATQPLLEELAGKTSIFGTLSPEVFQASFNVTAKFEKLLR

P07730 Glutelin type-A 21.4e-2023.74Show/hide
Query:  TKVGAGRLLLRPRGFAVPHNSDSSKVGYVLQGSGVAGLLFPG----------------------------KSDEAIVRLKKGDLIPVPEGVTSWWFND--
        T V   R ++ PRG  +PH ++ + + Y++QG G+ G  FPG                               + I R ++GD+I +P GV  W +ND  
Subjt:  TKVGAGRLLLRPRGFAVPHNSDSSKVGYVLQGSGVAGLLFPG----------------------------KSDEAIVRLKKGDLIPVPEGVTSWWFND--

Query:  ------------------DHSDFQVLLVGDTRNALIPGDITYVVFAGPLGILQGFSPDYVQKVYNL-NEEEKDILLKSQTNGLIFKLPPGQTLPEP----
                          D      LL G+ RN   P      V      I  GFS + + + + + N+  + +  ++   G I ++  G +L +P    
Subjt:  ------------------DHSDFQVLLVGDTRNALIPGDITYVVFAGPLGILQGFSPDYVQKVYNL-NEEEKDILLKSQTNGLIFKLPPGQTLPEP----

Query:  ----------GEH------------------SDLVF-------NIYNA-VPDAGVEGGGSVTTVTEKEFPFIGKSGLTAVLEKLEANAVRSPVYVADPSV
                   EH                   D  F       NI N    D      G VT +  + FP +    ++AV   L  NA+ SP +  + + 
Subjt:  ----------GEH------------------SDLVF-------NIYNA-VPDAGVEGGGSVTTVTEKEFPFIGKSGLTAVLEKLEANAVRSPVYVADPSV

Query:  QLIYIAGGSGRVQIVGFLGK--IDAEVKAGQLVLVPKYFAVGKVAGEEGLECFTIITATQPLLEELAGKTSIFGTLSPEVFQASFNVTAKFEKLLR
         ++YI  G  +VQ+V   GK   + E++ GQL++VP+++ V K A  EG       T    ++  +AGK+SIF  L  +V   ++ ++ +  + L+
Subjt:  QLIYIAGGSGRVQIVGFLGK--IDAEVKAGQLVLVPKYFAVGKVAGEEGLECFTIITATQPLLEELAGKTSIFGTLSPEVFQASFNVTAKFEKLLR

P11828 Glycinin G33.2e-2225.34Show/hide
Query:  LEPMNPKAFFEGEDGSFHKWFPSEFPMIAQTKVGAGRLLLRPRGFAVPHNSDSSKVGYVLQGSGVAGLLFPG------------------KSDEAIVRLK
        L  + P    E E G    W P+  P      V   R  L       P  +++ +  Y+ QGSG+ G++FPG                     + I   +
Subjt:  LEPMNPKAFFEGEDGSFHKWFPSEFPMIAQTKVGAGRLLLRPRGFAVPHNSDSSKVGYVLQGSGVAGLLFPG------------------KSDEAIVRLK

Query:  KGDLIPVPEGVTSWWFND-----------DHSDFQVLLVGDTRNALIPGD-----ITYVVFAGPLG------------------ILQGFSPDY-------
        +GDLI VP G   W +N+           D + FQ  L    R   + G+     + Y       G                  IL GF+P++       
Subjt:  KGDLIPVPEGVTSWWFND-----------DHSDFQVLLVGDTRNALIPGD-----ITYVVFAGPLG------------------ILQGFSPDY-------

Query:  ----VQKVYNLNEEEKDILLKSQTNGLIFKLPP---GQTLPEPGEHSD------------------------LVFNI-YNAVPDAGVEGGGSVTTVTEKE
            V+K+   NEEE+   + +   GL    PP    Q  PE  E  D                        L  NI   + PD      GS+TT T  +
Subjt:  ----VQKVYNLNEEEKDILLKSQTNGLIFKLPP---GQTLPEPGEHSD------------------------LVFNI-YNAVPDAGVEGGGSVTTVTEKE

Query:  FPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYIAGGSGRVQIVGFLGK--IDAEVKAGQLVLVPKYFAVGKVAGEEGLECFTIITATQPLLEELAG
        FP +    L+A    L  NA+  P Y  + +  +IY   G   VQ+V   G+   D E++ GQ+++VP+ FAV   +  +  E  +  T  +P +  LAG
Subjt:  FPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYIAGGSGRVQIVGFLGK--IDAEVKAGQLVLVPKYFAVGKVAGEEGLECFTIITATQPLLEELAG

Query:  KTSIFGTLSPEVFQASFNVTAKFEKLLRSKITKASPLVPPSD
          S+   L  EV Q +FN+  +  + +++     S LVPP +
Subjt:  KTSIFGTLSPEVFQASFNVTAKFEKLLRSKITKASPLVPPSD

Q8GZP6 11S globulin seed storage protein Ana o 2.0101 (Fragment)8.0e-2124.19Show/hide
Query:  LEPMNPKAFFEGEDGSFHKWFPSEFPMIAQTKVGAGRLLLRPRGFAVPHNSDSSKVGYVLQGSGVAGLLFPGKSD---------------------EAIV
        L+ + P    E E G+   W P+         V   R  ++P G  +P  S++ ++ YV+QG G+ G+ +PG  +                     + I 
Subjt:  LEPMNPKAFFEGEDGSFHKWFPSEFPMIAQTKVGAGRLLLRPRGFAVPHNSDSSKVGYVLQGSGVAGLLFPGKSD---------------------EAIV

Query:  RLKKGDLIPVPEGVTSWWFNDDHSDFQVLLVGDTRNALIPGDIT---YVVFAGPLGILQ--------------GFSPDYVQKVYNLNEEEKDILLKSQTN
        R ++GD+I +P GV  W +N+ +S    + + D  N+    D T   + +   P  + Q              GF  + + + + ++E     L      
Subjt:  RLKKGDLIPVPEGVTSWWFNDDHSDFQVLLVGDTRNALIPGDIT---YVVFAGPLGILQ--------------GFSPDYVQKVYNLNEEEKDILLKSQTN

Query:  GLIFK--------LPPGQTLPEPGEHSD-------------------------LVFNIYN-AVPDAGVEGGGSVTTVTEKEFPFIGKSGLTAVLEKLEAN
        G I K        + P ++  E G  S+                         L  NI + A  D      G +TT+     P +    L+     L  N
Subjt:  GLIFK--------LPPGQTLPEPGEHSD-------------------------LVFNIYN-AVPDAGVEGGGSVTTVTEKEFPFIGKSGLTAVLEKLEAN

Query:  AVRSPVYVADPSVQLIYIAGGSGRVQIVGFLGK--IDAEVKAGQLVLVPKYFAVGKVAGEEGLECFTIITATQPLLEELAGKTSIFGTLSPEVFQASFNV
        A+  P +  + S  +IY   G G+VQ+V   G    D EV+ GQ+++VP+ FAV K A EE  E  +  T  + +   LAG+TS+ G +  EV   +F +
Subjt:  AVRSPVYVADPSVQLIYIAGGSGRVQIVGFLGK--IDAEVKAGQLVLVPKYFAVGKVAGEEGLECFTIITATQPLLEELAGKTSIFGTLSPEVFQASFNV

Query:  T
        +
Subjt:  T

Arabidopsis top hitse value%identityAlignment
AT1G03880.1 cruciferin 28.2e-1323.97Show/hide
Query:  EAIVRLKKGDLIPVPEGVTSWWFNDDHSDFQVLLVGD--TRNALIPGDITYVVFAG--PLG--------------ILQGFSPDYVQKVYNLNEEEKDILL
        + +  L+ GD I  P GV  W++N+ +    ++   D  +    +  ++   + AG  P G              I  GF+P+ + + + +N E    L 
Subjt:  EAIVRLKKGDLIPVPEGVTSWWFNDDHSDFQVLLVGD--TRNALIPGDITYVVFAG--PLG--------------ILQGFSPDYVQKVYNLNEEEKDILL

Query:  KSQTN-GLIFKL--PPGQTLP--EPGEHSDLVFNIYNAV----------------PDAGV--EGGGSVTTVTEKEFPFIGKSGLTAVLEKLEANAVRSPV
          Q N G I K+  P G   P    GE       I N +                 DA V     G ++T+     P +    L+A+   +  NA+  P 
Subjt:  KSQTN-GLIFKL--PPGQTLP--EPGEHSDLVFNIYNAV----------------PDAGV--EGGGSVTTVTEKEFPFIGKSGLTAVLEKLEANAVRSPV

Query:  YVADPSVQLIYIAGGSGRVQIVGFLGK--IDAEVKAGQLVLVPKYFAVGKVAGEEGLECFTIITATQPLLEELAGKTSIFGTLSPEVFQASFNVTAKFEK
        +  + +  L Y+  G   +Q+V   G+   D E+ +GQL++VP+ F+V K A  E  E     T     +  LAG+TS+   L  EV    + ++ +  K
Subjt:  YVADPSVQLIYIAGGSGRVQIVGFLGK--IDAEVKAGQLVLVPKYFAVGKVAGEEGLECFTIITATQPLLEELAGKTSIFGTLSPEVFQASFNVTAKFEK

Query:  -----LLRSKITKASPL
              + + +T +SP+
Subjt:  -----LLRSKITKASPL

AT1G03890.1 RmlC-like cupins superfamily protein9.0e-2022.58Show/hide
Query:  PMIAQTKVGAGRLLLRPRGFAVPHNSDSSKVGYVLQGSGVAGLLFPGKSD-------------------------EAIVRLKKGDLIPVPEGVTSWWFND
        P +    V   R+ L+P    +P       + YV+QG GV G +  G  +                         + +   ++GD+     GV+ WW+N 
Subjt:  PMIAQTKVGAGRLLLRPRGFAVPHNSDSSKVGYVLQGSGVAGLLFPGKSD-------------------------EAIVRLKKGDLIPVPEGVTSWWFND

Query:  DHSDFQVLLVGDTRNALIPGDITYVVF--AG--------PL------GILQGFSPDYVQKVYNLNEEEKDILLKSQTN---------GLIFKLPPGQTLP
          SD  +++V D  N     D    +F  AG        PL          GF P+ + + + +N E    L   + N          L F +PP +   
Subjt:  DHSDFQVLLVGDTRNALIPGDITYVVF--AG--------PL------GILQGFSPDYVQKVYNLNEEEKDILLKSQTN---------GLIFKLPPGQTLP

Query:  EPG-----EHSDLVFNIYNAVPDAGVEG-----GGSVTTVTEKEFPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYIAGGSGRVQIVGFLGK--ID
        + G     E +     I+  + D           G ++T+     P +    L A+   L +  +  P + A+    ++Y+ GG  ++Q+V   G+   +
Subjt:  EPG-----EHSDLVFNIYNAVPDAGVEG-----GGSVTTVTEKEFPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYIAGGSGRVQIVGFLGK--ID

Query:  AEVKAGQLVLVPKYFAVGKVAGEEGLECFTIITATQPLLEELAGKTSIFGTLSPEVFQASFNVTAKFEKLLR
         +V  GQ++++P+ FAV K AGE G E  +  T     +  L+G+TS    +  +V +AS+ V  +  K ++
Subjt:  AEVKAGQLVLVPKYFAVGKVAGEEGLECFTIITATQPLLEELAGKTSIFGTLSPEVFQASFNVTAKFEKLLR

AT1G07750.1 RmlC-like cupins superfamily protein2.5e-7841.18Show/hide
Query:  MELNLEPMNPKAFFEGEDGSFHKWFPSEFPMIAQTKVGAGRLLLRPRGFAVPHNSDSSKVGYVLQGSGVAGLLFPGKSDEAIVRLKKGDLIPVPEGVTSW
        MEL+L P  PK  + G+ GS+  W P E PM+ Q  +GA +L L   GFAVP  SDSSKV YVLQGSG AG++ P K +E ++ +K+GD I +P GV +W
Subjt:  MELNLEPMNPKAFFEGEDGSFHKWFPSEFPMIAQTKVGAGRLLLRPRGFAVPHNSDSSKVGYVLQGSGVAGLLFPGKSDEAIVRLKKGDLIPVPEGVTSW

Query:  WFNDDHSDFQVLLVGDTRNALIPGDITYVVFAGPLGILQGFSPDYVQKVYNLNEEEKDILLKSQTNGLIFKLPPGQTLPEPGE--HSDLVFNIYNAVPDA
        WFN++  +  +L +G+T      G  T     G  GI  GFS ++V + ++L+E     L+ SQT   I KL  G  +P+P E   +  V N   A  D 
Subjt:  WFNDDHSDFQVLLVGDTRNALIPGDITYVVFAGPLGILQGFSPDYVQKVYNLNEEEKDILLKSQTNGLIFKLPPGQTLPEPGE--HSDLVFNIYNAVPDA

Query:  GVEGGGSVTTVTEKEFPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYIAGGSGRVQIVGFLGK--IDAEVKAGQLVLVPKYFAVGKVAGEEGLECF
         ++ GG V  +  K  P +G+ G  A L +++A+++ SP +  D ++Q+ YI GGSGRVQ+VG  GK  ++  +KAG L +VP++F V K+A  +G+  F
Subjt:  GVEGGGSVTTVTEKEFPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYIAGGSGRVQIVGFLGK--IDAEVKAGQLVLVPKYFAVGKVAGEEGLECF

Query:  TIITATQPLLEELAGKTSIFGTLSPEVFQASFNVTAKFEKLLRSKITKASPLVPPSD
        +I+T   P+   LAG TS++ +LSPEV QA+F V  + EK  RS  T ++   PPS+
Subjt:  TIITATQPLLEELAGKTSIFGTLSPEVFQASFNVTAKFEKLLRSKITKASPLVPPSD

AT2G28680.1 RmlC-like cupins superfamily protein1.2e-7540.62Show/hide
Query:  MELNLEPMNPKAFFEGEDGSFHKWFPSEFPMIAQTKVGAGRLLLRPRGFAVPHNSDSSKVGYVLQGSGVAGLLFPGKSDEAIVRLKKGDLIPVPEGVTSW
        MEL+L P  PK  + G+ GS+  W P E PM+    +GA +L L   G A+P  SDS KV YVLQG+G AG++ P K +E ++ +KKGD I +P GV +W
Subjt:  MELNLEPMNPKAFFEGEDGSFHKWFPSEFPMIAQTKVGAGRLLLRPRGFAVPHNSDSSKVGYVLQGSGVAGLLFPGKSDEAIVRLKKGDLIPVPEGVTSW

Query:  WFNDDHSDFQVLLVGDTRNALIPGDITYVVFAGPLGILQGFSPDYVQKVYNLNEEEKDILLKSQTNGLIFKLPPGQTLPEP--GEHSDLVFNIYNAVPDA
        WFN++ ++  VL +G+T      G  T     G  GI  GFS ++V + ++L+E     L+ SQT   I K+     +PEP  G+    V N   A  D 
Subjt:  WFNDDHSDFQVLLVGDTRNALIPGDITYVVFAGPLGILQGFSPDYVQKVYNLNEEEKDILLKSQTNGLIFKLPPGQTLPEP--GEHSDLVFNIYNAVPDA

Query:  GVEGGGSVTTVTEKEFPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYIAGGSGRVQIVGFLGK--IDAEVKAGQLVLVPKYFAVGKVAGEEGLECF
         ++ GG V  +  K  P +G+ G  A L +++ +++ SP +  D ++Q+ YI GGSGRVQIVG  GK  ++  VKAG L +VP++F V K+A  +GL  F
Subjt:  GVEGGGSVTTVTEKEFPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYIAGGSGRVQIVGFLGK--IDAEVKAGQLVLVPKYFAVGKVAGEEGLECF

Query:  TIITATQPLLEELAGKTSIFGTLSPEVFQASFNVTAKFEKLLRSKITKASPLVPPSD
        +I+T   P+   LAG+TS++  LSPEV QA+F V  + EK  RSK T  +    PS+
Subjt:  TIITATQPLLEELAGKTSIFGTLSPEVFQASFNVTAKFEKLLRSKITKASPLVPPSD

AT4G28520.1 cruciferin 36.5e-1030.77Show/hide
Query:  GSVTTVTEKEFPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYIAGGSGRVQIVGFLGK--IDAEVKAGQLVLVPKYFAVGKVAGEEGLECFTIITA
        G VT+V     P +    L+A    L+ NA+  P Y  + + +++Y  GG GR+Q+V   G+  +D +V+ GQLV++P+ FA    +     E  +  T 
Subjt:  GSVTTVTEKEFPFIGKSGLTAVLEKLEANAVRSPVYVADPSVQLIYIAGGSGRVQIVGFLGK--IDAEVKAGQLVLVPKYFAVGKVAGEEGLECFTIITA

Query:  TQPLLEELAGKTSIFGTLSPEVFQASFNVT
           ++  LAG+TS+   L  EV    F ++
Subjt:  TQPLLEELAGKTSIFGTLSPEVFQASFNVT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTTGAATTTGGAGCCAATGAATCCAAAGGCCTTCTTTGAGGGAGAGGATGGATCGTTCCATAAATGGTTCCCTTCGGAGTTTCCGATGATCGCTCAGACCAAAGT
CGGCGCCGGCAGACTCCTCCTCCGCCCCCGCGGCTTCGCCGTTCCCCATAACTCCGACTCCTCAAAAGTTGGCTATGTTCTTCAAGGTAGCGGAGTGGCCGGACTTCTAT
TTCCGGGCAAGTCCGACGAAGCAATAGTGAGACTTAAGAAAGGAGACCTAATTCCGGTGCCGGAGGGCGTCACCTCCTGGTGGTTCAACGACGACCACTCCGATTTCCAA
GTCCTGCTCGTCGGCGACACTCGAAACGCCCTCATCCCCGGCGACATCACCTACGTCGTCTTCGCCGGACCTCTCGGAATCCTGCAGGGCTTCTCGCCGGACTACGTCCA
AAAAGTCTACAATTTAAACGAGGAAGAAAAGGACATACTCCTCAAAAGCCAAACCAACGGCCTAATCTTCAAGCTCCCACCGGGCCAGACCCTGCCGGAGCCCGGCGAGC
ACAGCGACCTCGTTTTCAACATTTACAATGCCGTTCCGGACGCCGGAGTGGAGGGCGGCGGGTCGGTGACGACGGTGACTGAAAAGGAGTTTCCGTTCATCGGGAAATCT
GGGTTGACGGCGGTTCTCGAGAAGCTTGAGGCAAACGCCGTCCGGTCGCCGGTGTACGTCGCCGACCCTTCGGTGCAGCTGATATATATCGCCGGCGGGTCGGGTCGGGT
TCAGATTGTTGGGTTTTTGGGGAAGATTGATGCGGAGGTGAAAGCCGGTCAGTTGGTTTTGGTTCCGAAGTACTTCGCCGTCGGAAAAGTCGCCGGAGAAGAAGGCTTGG
AGTGCTTCACTATTATCACTGCCACACAGCCTTTACTAGAGGAGTTGGCAGGAAAAACATCAATCTTTGGGACATTATCACCAGAAGTTTTTCAAGCTTCATTCAATGTC
ACAGCTAAGTTTGAGAAACTTCTCAGATCAAAGATAACAAAGGCTTCACCCTTGGTTCCTCCTTCAGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTTGAATTTGGAGCCAATGAATCCAAAGGCCTTCTTTGAGGGAGAGGATGGATCGTTCCATAAATGGTTCCCTTCGGAGTTTCCGATGATCGCTCAGACCAAAGT
CGGCGCCGGCAGACTCCTCCTCCGCCCCCGCGGCTTCGCCGTTCCCCATAACTCCGACTCCTCAAAAGTTGGCTATGTTCTTCAAGGTAGCGGAGTGGCCGGACTTCTAT
TTCCGGGCAAGTCCGACGAAGCAATAGTGAGACTTAAGAAAGGAGACCTAATTCCGGTGCCGGAGGGCGTCACCTCCTGGTGGTTCAACGACGACCACTCCGATTTCCAA
GTCCTGCTCGTCGGCGACACTCGAAACGCCCTCATCCCCGGCGACATCACCTACGTCGTCTTCGCCGGACCTCTCGGAATCCTGCAGGGCTTCTCGCCGGACTACGTCCA
AAAAGTCTACAATTTAAACGAGGAAGAAAAGGACATACTCCTCAAAAGCCAAACCAACGGCCTAATCTTCAAGCTCCCACCGGGCCAGACCCTGCCGGAGCCCGGCGAGC
ACAGCGACCTCGTTTTCAACATTTACAATGCCGTTCCGGACGCCGGAGTGGAGGGCGGCGGGTCGGTGACGACGGTGACTGAAAAGGAGTTTCCGTTCATCGGGAAATCT
GGGTTGACGGCGGTTCTCGAGAAGCTTGAGGCAAACGCCGTCCGGTCGCCGGTGTACGTCGCCGACCCTTCGGTGCAGCTGATATATATCGCCGGCGGGTCGGGTCGGGT
TCAGATTGTTGGGTTTTTGGGGAAGATTGATGCGGAGGTGAAAGCCGGTCAGTTGGTTTTGGTTCCGAAGTACTTCGCCGTCGGAAAAGTCGCCGGAGAAGAAGGCTTGG
AGTGCTTCACTATTATCACTGCCACACAGCCTTTACTAGAGGAGTTGGCAGGAAAAACATCAATCTTTGGGACATTATCACCAGAAGTTTTTCAAGCTTCATTCAATGTC
ACAGCTAAGTTTGAGAAACTTCTCAGATCAAAGATAACAAAGGCTTCACCCTTGGTTCCTCCTTCAGATTGA
Protein sequenceShow/hide protein sequence
MELNLEPMNPKAFFEGEDGSFHKWFPSEFPMIAQTKVGAGRLLLRPRGFAVPHNSDSSKVGYVLQGSGVAGLLFPGKSDEAIVRLKKGDLIPVPEGVTSWWFNDDHSDFQ
VLLVGDTRNALIPGDITYVVFAGPLGILQGFSPDYVQKVYNLNEEEKDILLKSQTNGLIFKLPPGQTLPEPGEHSDLVFNIYNAVPDAGVEGGGSVTTVTEKEFPFIGKS
GLTAVLEKLEANAVRSPVYVADPSVQLIYIAGGSGRVQIVGFLGKIDAEVKAGQLVLVPKYFAVGKVAGEEGLECFTIITATQPLLEELAGKTSIFGTLSPEVFQASFNV
TAKFEKLLRSKITKASPLVPPSD