; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg002723 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg002723
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionglutelin type-A 2-like
Genome locationscaffold6:5306911..5311705
RNA-Seq ExpressionSpg002723
SyntenySpg002723
Gene Ontology termsGO:0045735 - nutrient reservoir activity (molecular function)
InterPro domainsIPR006044 - 11-S seed storage protein, plant
IPR006045 - Cupin 1
IPR011051 - RmlC-like cupin domain superfamily
IPR014710 - RmlC-like jelly roll fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYJ99759.1 glutelin type-A 2-like [Cucumis melo var. makuwa]1.1e-10754.44Show/hide
Query:  LKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVIQENDSKRKHEHAGHNGVVGLVFPNESKEQVLKLSKGDAIP
        ++ M  + F E +GG Y  W  S +P+L+Q NVA GRL+LRPRGFA+PHYAD  K GYV+Q           G +GV G VFPN+  E V+KL KGD IP
Subjt:  LKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVIQENDSKRKHEHAGHNGVVGLVFPNESKEQVLKLSKGDAIP

Query:  IPTGVVSWWYNDGDSDLEITFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETNKLTHSQTGILIIKLQPNNSLPKPRGHHNKMVFNL
        +P+G+ SWW+NDGDSDLEI FLGET N+H+ G+ITYF+L+G +G+L  F+PE++ K+Y+L+++ETNK   SQ+ +LI  +QP+ SLPKP   H+K+V+N+
Subjt:  IPTGVVSWWYNDGDSDLEITFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETNKLTHSQTGILIIKLQPNNSLPKPRGHHNKMVFNL

Query:  ANAEPNHHKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPNGVRSPIYTVDSSVQVIYVVSGSGRVEIVGFNAVKMAAEVKAGQLLVVPKYFVVGKEAG
          A P++       ++T + ES FPFIGQTGL+A+LEKL  N +RSP+Y  + S Q+IYV  GSG++++VGF++ K  A+VK GQL++VP+YF VGK AG
Subjt:  ANAEPNHHKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPNGVRSPIYTVDSSVQVIYVVSGSGRVEIVGFNAVKMAAEVKAGQLLVVPKYFVVGKEAG

Query:  EEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSADFEKMF
        EEG+EC S++  T P++EELAGK SV EALS EV QVSFNV+A+FEK+F
Subjt:  EEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSADFEKMF

XP_004151504.1 legumin J [Cucumis sativus]2.5e-10453.12Show/hide
Query:  LKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVIQENDSKRKHEHAGHNGVVGLVFPNESKEQVLKLSKGDAIP
        ++ M  + F E +GG Y+ W  S +P+L+Q NVA GRL+LRPRGFA+PHY+D  K GYV+Q           G +GV G VFP +  E V+KL KGD IP
Subjt:  LKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVIQENDSKRKHEHAGHNGVVGLVFPNESKEQVLKLSKGDAIP

Query:  IPTGVVSWWYNDGDSDLEITFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETNKLTHSQTGILIIKLQPNNSLPKPRGHHNKMVFNL
        +P GV SWW+NDGDSDLEI FLGET  +H+ G+ITYF+L+G +G+L  F+PE++ K+ +LN++ETN    SQ  +LI  +QP+ SLPKP   ++K+V+N+
Subjt:  IPTGVVSWWYNDGDSDLEITFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETNKLTHSQTGILIIKLQPNNSLPKPRGHHNKMVFNL

Query:  ANAEPNHHKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPNGVRSPIYTVDSSVQVIYVVSGSGRVEIVGFNAVKMAAEVKAGQLLVVPKYFVVGKEAG
          A P++       ++T + ES FPFIGQTGL+ +LEKL  N +RSP+Y  + S Q+IYV  GSG++++VGF++ K  A+VK GQL++VP+YF VGK AG
Subjt:  ANAEPNHHKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPNGVRSPIYTVDSSVQVIYVVSGSGRVEIVGFNAVKMAAEVKAGQLLVVPKYFVVGKEAG

Query:  EEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSADFEKMFISK
        EEG+EC S++  T P++EELAGK SV EALS EV QVSFNV+A+FEK+F SK
Subjt:  EEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSADFEKMFISK

XP_008456076.1 PREDICTED: glutelin type-A 2-like [Cucumis melo]2.2e-10854.55Show/hide
Query:  LKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVIQENDSKRKHEHAGHNGVVGLVFPNESKEQVLKLSKGDAIP
        ++ M  + F E +GG Y  W  S +P+L+Q NVA GRL+LRPRGFA+PHYAD  K GYV+Q           G +GV G VFPN+  E V+KL KGD IP
Subjt:  LKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVIQENDSKRKHEHAGHNGVVGLVFPNESKEQVLKLSKGDAIP

Query:  IPTGVVSWWYNDGDSDLEITFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETNKLTHSQTGILIIKLQPNNSLPKPRGHHNKMVFNL
        +P+G+ SWW+NDGDSDLEI FLGET N+H+ G+ITYF+L+G +G+L  F+PE++ K+Y+L+++ETNK   SQ+ +LI  +QP+ SLPKP   H+K+V+N+
Subjt:  IPTGVVSWWYNDGDSDLEITFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETNKLTHSQTGILIIKLQPNNSLPKPRGHHNKMVFNL

Query:  ANAEPNHHKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPNGVRSPIYTVDSSVQVIYVVSGSGRVEIVGFNAVKMAAEVKAGQLLVVPKYFVVGKEAG
          A P++       ++T + ES FPFIGQTGL+A+LEKL  N +RSP+Y  + S Q+IYV  GSG++++VGF++ K  A+VK GQL++VP+YF VGK AG
Subjt:  ANAEPNHHKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPNGVRSPIYTVDSSVQVIYVVSGSGRVEIVGFNAVKMAAEVKAGQLLVVPKYFVVGKEAG

Query:  EEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSADFEKMFISK
        EEG+EC S++  T P++EELAGK SV EALS EV QVSFNV+A+FEK+F SK
Subjt:  EEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSADFEKMFISK

XP_030937791.1 glutelin type-A 3-like [Quercus lobata]1.6e-10355.31Show/hide
Query:  MEFDLKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVIQENDSKRKHEHAGHNGVVGLVFPNESKEQVLKLSKG
        MEFDL P   Q   E +GG YY W SS+FP+L +  V AG+LVL+PRGFALPHYADS KVGYV+            G +G+VG+VFP  SKE VLKL KG
Subjt:  MEFDLKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVIQENDSKRKHEHAGHNGVVGLVFPNESKEQVLKLSKG

Query:  DAIPIPTGVVSWWYNDGDSDLEITFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETNKLTHSQTGILIIKLQPNNSLPKPRGHH-NK
        D IP+  G VSWW+NDGDS+L I FLGETS S+I GE TYF L+GA GI+  FSPEF+S+AY +NK E NKL  SQTG+LIIKLQ   ++PKP  ++ NK
Subjt:  DAIPIPTGVVSWWYNDGDSDLEITFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETNKLTHSQTGILIIKLQPNNSLPKPRGHH-NK

Query:  MVFNLANAEPNHHKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPNGVRSPIYTVDSSVQVIYVVSGSGRVEIVGFNAVK-MAAEVKAGQLLVVPKYFV
        +++N+    P+  + + G  +T+L E+KFPF+ Q  LS  L +L  N + SP YT DSSVQ+IYVV G G+++IVG N  + +  EVK G LLVVP++FV
Subjt:  MVFNLANAEPNHHKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPNGVRSPIYTVDSSVQVIYVVSGSGRVEIVGFNAVK-MAAEVKAGQLLVVPKYFV

Query:  VGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSADFEKMFISK
        V K AG +G+ECFSI T+ KP  E LA KASVW+ALS  V++ S NVS + E++F SK
Subjt:  VGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSADFEKMFISK

XP_030955195.1 glutelin type-A 3-like [Quercus lobata]6.2e-10355.03Show/hide
Query:  MEFDLKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVIQENDSKRKHEHAGHNGVVGLVFPNESKEQVLKLSKG
        MEFDL P   Q   E +GG YY W SS+FP+L +  V AG+LVL+PRGFALPHYA S KVGYV+            G +G+VG+VFP  SKE VLKL KG
Subjt:  MEFDLKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVIQENDSKRKHEHAGHNGVVGLVFPNESKEQVLKLSKG

Query:  DAIPIPTGVVSWWYNDGDSDLEITFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETNKLTHSQTGILIIKLQPNNSLPKPRGHH-NK
        D IP+  G VSWW+NDGDS+L I FLGETS S+I GE TYF L+GA GI+  FSPEF+S+AY +NK E NKL  SQTG+LIIKLQ   ++PKP  ++ NK
Subjt:  DAIPIPTGVVSWWYNDGDSDLEITFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETNKLTHSQTGILIIKLQPNNSLPKPRGHH-NK

Query:  MVFNLANAEPNHHKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPNGVRSPIYTVDSSVQVIYVVSGSGRVEIVGFNAVK-MAAEVKAGQLLVVPKYFV
        +++N+    P+  + + G  +T+L E+KFPF+ Q  LS  L +L  N + SP YT DSSVQ+IYVV G G+++IVG N  + +  EVK G LLVVP++FV
Subjt:  MVFNLANAEPNHHKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPNGVRSPIYTVDSSVQVIYVVSGSGRVEIVGFNAVK-MAAEVKAGQLLVVPKYFV

Query:  VGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSADFEKMFISK
        V K AG +G+ECFSI T+ KP  E LA KASVW+ALS  V++ S NVS + E++F SK
Subjt:  VGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSADFEKMFISK

TrEMBL top hitse value%identityAlignment
A0A0A0L6K0 Uncharacterized protein1.2e-10453.12Show/hide
Query:  LKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVIQENDSKRKHEHAGHNGVVGLVFPNESKEQVLKLSKGDAIP
        ++ M  + F E +GG Y+ W  S +P+L+Q NVA GRL+LRPRGFA+PHY+D  K GYV+Q           G +GV G VFP +  E V+KL KGD IP
Subjt:  LKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVIQENDSKRKHEHAGHNGVVGLVFPNESKEQVLKLSKGDAIP

Query:  IPTGVVSWWYNDGDSDLEITFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETNKLTHSQTGILIIKLQPNNSLPKPRGHHNKMVFNL
        +P GV SWW+NDGDSDLEI FLGET  +H+ G+ITYF+L+G +G+L  F+PE++ K+ +LN++ETN    SQ  +LI  +QP+ SLPKP   ++K+V+N+
Subjt:  IPTGVVSWWYNDGDSDLEITFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETNKLTHSQTGILIIKLQPNNSLPKPRGHHNKMVFNL

Query:  ANAEPNHHKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPNGVRSPIYTVDSSVQVIYVVSGSGRVEIVGFNAVKMAAEVKAGQLLVVPKYFVVGKEAG
          A P++       ++T + ES FPFIGQTGL+ +LEKL  N +RSP+Y  + S Q+IYV  GSG++++VGF++ K  A+VK GQL++VP+YF VGK AG
Subjt:  ANAEPNHHKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPNGVRSPIYTVDSSVQVIYVVSGSGRVEIVGFNAVKMAAEVKAGQLLVVPKYFVVGKEAG

Query:  EEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSADFEKMFISK
        EEG+EC S++  T P++EELAGK SV EALS EV QVSFNV+A+FEK+F SK
Subjt:  EEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSADFEKMFISK

A0A1S3C2D5 glutelin type-A 2-like1.1e-10854.55Show/hide
Query:  LKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVIQENDSKRKHEHAGHNGVVGLVFPNESKEQVLKLSKGDAIP
        ++ M  + F E +GG Y  W  S +P+L+Q NVA GRL+LRPRGFA+PHYAD  K GYV+Q           G +GV G VFPN+  E V+KL KGD IP
Subjt:  LKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVIQENDSKRKHEHAGHNGVVGLVFPNESKEQVLKLSKGDAIP

Query:  IPTGVVSWWYNDGDSDLEITFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETNKLTHSQTGILIIKLQPNNSLPKPRGHHNKMVFNL
        +P+G+ SWW+NDGDSDLEI FLGET N+H+ G+ITYF+L+G +G+L  F+PE++ K+Y+L+++ETNK   SQ+ +LI  +QP+ SLPKP   H+K+V+N+
Subjt:  IPTGVVSWWYNDGDSDLEITFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETNKLTHSQTGILIIKLQPNNSLPKPRGHHNKMVFNL

Query:  ANAEPNHHKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPNGVRSPIYTVDSSVQVIYVVSGSGRVEIVGFNAVKMAAEVKAGQLLVVPKYFVVGKEAG
          A P++       ++T + ES FPFIGQTGL+A+LEKL  N +RSP+Y  + S Q+IYV  GSG++++VGF++ K  A+VK GQL++VP+YF VGK AG
Subjt:  ANAEPNHHKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPNGVRSPIYTVDSSVQVIYVVSGSGRVEIVGFNAVKMAAEVKAGQLLVVPKYFVVGKEAG

Query:  EEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSADFEKMFISK
        EEG+EC S++  T P++EELAGK SV EALS EV QVSFNV+A+FEK+F SK
Subjt:  EEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSADFEKMFISK

A0A2N9F3X5 Uncharacterized protein2.3e-10355.03Show/hide
Query:  MEFDLKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVIQENDSKRKHEHAGHNGVVGLVFPNESKEQVLKLSKG
        MEFDL P   Q   E DGG YY W SS+FP+L +  V AG+L LRP GFALPHYADS K+GYV+Q           G +G+VG+V PN  +E VLKL KG
Subjt:  MEFDLKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVIQENDSKRKHEHAGHNGVVGLVFPNESKEQVLKLSKG

Query:  DAIPIPTGVVSWWYNDGDSDLEITFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETNKLTHSQTGILIIKLQPNNSLPKP-RGHHNK
        D IP+  G VSWW+NDGDS+L++ FLGETS S+I GE TYF L+G QGI++ FS EF+S+AYN+NK + NKL  SQTG L+ KLQ   +LPKP + + NK
Subjt:  DAIPIPTGVVSWWYNDGDSDLEITFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETNKLTHSQTGILIIKLQPNNSLPKP-RGHHNK

Query:  MVFNLANAEPNHHKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPNGVRSPIYTVDSSVQVIYVVSGSGRVEIVGFNAVK-MAAEVKAGQLLVVPKYFV
        +V+N+  + P+      G+ +TSL E+KFPF+GQ  LSA L K+    + SP+YT DSSVQ+IYVV G+GR+EIVG N  + +  EVK G LLVVP++F 
Subjt:  MVFNLANAEPNHHKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPNGVRSPIYTVDSSVQVIYVVSGSGRVEIVGFNAVK-MAAEVKAGQLLVVPKYFV

Query:  VGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSADFEKMFISK
          K AG +G+E FSI+TT +P++E+LA KASVWEALS  V++ S NVSA+F ++F SK
Subjt:  VGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSADFEKMFISK

A0A5A7T7U8 Glutelin type-A 2-like1.1e-10854.55Show/hide
Query:  LKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVIQENDSKRKHEHAGHNGVVGLVFPNESKEQVLKLSKGDAIP
        ++ M  + F E +GG Y  W  S +P+L+Q NVA GRL+LRPRGFA+PHYAD  K GYV+Q           G +GV G VFPN+  E V+KL KGD IP
Subjt:  LKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVIQENDSKRKHEHAGHNGVVGLVFPNESKEQVLKLSKGDAIP

Query:  IPTGVVSWWYNDGDSDLEITFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETNKLTHSQTGILIIKLQPNNSLPKPRGHHNKMVFNL
        +P+G+ SWW+NDGDSDLEI FLGET N+H+ G+ITYF+L+G +G+L  F+PE++ K+Y+L+++ETNK   SQ+ +LI  +QP+ SLPKP   H+K+V+N+
Subjt:  IPTGVVSWWYNDGDSDLEITFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETNKLTHSQTGILIIKLQPNNSLPKPRGHHNKMVFNL

Query:  ANAEPNHHKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPNGVRSPIYTVDSSVQVIYVVSGSGRVEIVGFNAVKMAAEVKAGQLLVVPKYFVVGKEAG
          A P++       ++T + ES FPFIGQTGL+A+LEKL  N +RSP+Y  + S Q+IYV  GSG++++VGF++ K  A+VK GQL++VP+YF VGK AG
Subjt:  ANAEPNHHKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPNGVRSPIYTVDSSVQVIYVVSGSGRVEIVGFNAVKMAAEVKAGQLLVVPKYFVVGKEAG

Query:  EEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSADFEKMFISK
        EEG+EC S++  T P++EELAGK SV EALS EV QVSFNV+A+FEK+F SK
Subjt:  EEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSADFEKMFISK

A0A5D3BLA4 Glutelin type-A 2-like5.3e-10854.44Show/hide
Query:  LKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVIQENDSKRKHEHAGHNGVVGLVFPNESKEQVLKLSKGDAIP
        ++ M  + F E +GG Y  W  S +P+L+Q NVA GRL+LRPRGFA+PHYAD  K GYV+Q           G +GV G VFPN+  E V+KL KGD IP
Subjt:  LKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVIQENDSKRKHEHAGHNGVVGLVFPNESKEQVLKLSKGDAIP

Query:  IPTGVVSWWYNDGDSDLEITFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETNKLTHSQTGILIIKLQPNNSLPKPRGHHNKMVFNL
        +P+G+ SWW+NDGDSDLEI FLGET N+H+ G+ITYF+L+G +G+L  F+PE++ K+Y+L+++ETNK   SQ+ +LI  +QP+ SLPKP   H+K+V+N+
Subjt:  IPTGVVSWWYNDGDSDLEITFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETNKLTHSQTGILIIKLQPNNSLPKPRGHHNKMVFNL

Query:  ANAEPNHHKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPNGVRSPIYTVDSSVQVIYVVSGSGRVEIVGFNAVKMAAEVKAGQLLVVPKYFVVGKEAG
          A P++       ++T + ES FPFIGQTGL+A+LEKL  N +RSP+Y  + S Q+IYV  GSG++++VGF++ K  A+VK GQL++VP+YF VGK AG
Subjt:  ANAEPNHHKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPNGVRSPIYTVDSSVQVIYVVSGSGRVEIVGFNAVKMAAEVKAGQLLVVPKYFVVGKEAG

Query:  EEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSADFEKMF
        EEG+EC S++  T P++EELAGK SV EALS EV QVSFNV+A+FEK+F
Subjt:  EEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSADFEKMF

SwissProt top hitse value%identityAlignment
A0A222NNM9 Cocosin 13.0e-2324.69Show/hide
Query:  TDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVIQENDSKRKHEHAGHNGVVGLVFP---------------------
        T  + SE+    Y+   + QF     A V+  R V+ PRG  LP  +++P++ Y++Q              G+VGLV P                     
Subjt:  TDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVIQENDSKRKHEHAGHNGVVGLVFP---------------------

Query:  --NESKEQVLKLSKGDAIPIPTGVVSWWYNDGDSDLEITFLGETSN--SHIHGEITYFLLAGAQ---------------GILSTFSPEFISKAYNLNKQE
           +  ++V +  +GD + +P G   W YN+G++ +    + +TSN  + +      FLLAG Q                IL  FS E ++ A+ +N + 
Subjt:  --NESKEQVLKLSKGDAIPIPTGVVSWWYNDGDSDLEITFLGETSN--SHIHGEITYFLLAGAQ---------------GILSTFSPEFISKAYNLNKQE

Query:  TNKL-THSQTGILIIKLQPNNSLPKPRGHHN-------------------KMVFNLANAEPNHHKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPNGV
          KL     T   I++ +    + +P G                      K+  N+ +          G  IT+L   K P +    +SA    L  N +
Subjt:  TNKL-THSQTGILIIKLQPNNSLPKPRGHHN-------------------KMVFNLANAEPNHHKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPNGV

Query:  RSPIYTVDSSVQVIYVVSGSGRVEIVGFNA-VKMAAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSA
         SP + +++   ++Y   G GRVE+           E++ GQLL+VP+ F + + AG EG +  SI T+ + ++  + GK S    +  EVL  S+ +S 
Subjt:  RSPIYTVDSSVQVIYVVSGSGRVEIVGFNA-VKMAAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSA

Query:  D
        D
Subjt:  D

P07730 Glutelin type-A 23.0e-2322.86Show/hide
Query:  VAAGRLVLRPRGFALPHYADSPKVGYVIQENDSKRKHEHAGHNGVVGLVFP----------------------------NESKEQVLKLSKGDAIPIPTG
        V+  R V+ PRG  LPHY +   + Y+IQ              G+ G  FP                             +  +++ +  +GD I +P G
Subjt:  VAAGRLVLRPRGFALPHYADSPKVGYVIQENDSKRKHEHAGHNGVVGLVFP----------------------------NESKEQVLKLSKGDAIPIPTG

Query:  VVSWWYNDGDSDLEITFLGETSN--SHIHGEITYFLLAG---------------AQGILSTFSPEFISKAYNLNKQETNKL--THSQTGILIIKLQPNNS
        V  W YNDG+  +   ++ + +N  + +      FLLAG               +Q I S FS E +S+A+ ++ Q   +L   + Q G  I++++   S
Subjt:  VVSWWYNDGDSDLEITFLGETSN--SHIHGEITYFLLAG---------------AQGILSTFSPEFISKAYNLNKQETNKL--THSQTGILIIKLQPNNS

Query:  LPKP-----RGHHNKMVFNLANAEPNHHKAEFGVS---------------------------------ITSLEESKFPFIGQTGLSAMLEKLCPNGVRSP
        L +P          +M       E  + ++++G                                   +T+L    FP +    +SA+   L  N + SP
Subjt:  LPKP-----RGHHNKMVFNLANAEPNHHKAEFGVS---------------------------------ITSLEESKFPFIGQTGLSAMLEKLCPNGVRSP

Query:  IYTVDSSVQVIYVVSGSGRVEIVGFNA-VKMAAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSAD
         + +++   ++Y+  G  +V++V  N       E++ GQLL+VP+++VV K+A  EG    +  T    ++  +AGK+S++ AL  +VL  ++ +S +
Subjt:  IYTVDSSVQVIYVVSGSGRVEIVGFNA-VKMAAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSAD

P14323 Glutelin type-B 11.1e-2222.49Show/hide
Query:  RLVLRPRGFALPHYADSPKVGYVIQENDS------------KRKHEHAGHNGVVGLVFPNESKEQVLKLSKGDAIPIPTGVVSWWYNDGDSDLEITFLGE
        R V++P+G  +P Y + P V Y+IQ   S            +++ +     G        +  +++ +  +GD + +P GV  W+YNDGD+ +   ++ +
Subjt:  RLVLRPRGFALPHYADSPKVGYVIQENDS------------KRKHEHAGHNGVVGLVFPNESKEQVLKLSKGDAIPIPTGVVSWWYNDGDSDLEITFLGE

Query:  TSN--SHIHGEITYFLLAG-------------------AQGILSTFSPEFISKAYNLN----------KQETNKLTHSQTGILIIK--LQPNNSLPKPRG
         +N  + +      FLLAG                    Q I S F  E +S+A  +N            +  ++ H + G+ ++K  L       + + 
Subjt:  TSN--SHIHGEITYFLLAG-------------------AQGILSTFSPEFISKAYNLN----------KQETNKLTHSQTGILIIK--LQPNNSLPKPRG

Query:  HHNKMVF------------------------NLANAEPNHHKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPNGVRSPIYTVDSSVQVIYVVSGSGRV
         + ++ +                        N+ N             ITS+   KFP +    +SA    L  N + SP + V++   ++Y++ G  RV
Subjt:  HHNKMVF------------------------NLANAEPNHHKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPNGVRSPIYTVDSSVQVIYVVSGSGRV

Query:  EIV-GFNAVKMAAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSAD
        ++V  F        ++ GQLL++P+++ V K+A  EG +  +I T     +  LAGK SV+ AL  +V+  ++ +S +
Subjt:  EIV-GFNAVKMAAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSAD

Q02897 Glutelin type-B 21.7e-2321.93Show/hide
Query:  RLVLRPRGFALPHYADSPKVGYVIQENDS------------KRKHEHAGHNGVVGLVFPNESKEQVLKLSKGDAIPIPTGVVSWWYNDGDSDLEITFLGE
        R V++P+G  +P Y+++P + Y+IQ   S            +++ +     G        +  +++ +  +GD + +P GV  W+YNDGD+ +   ++ +
Subjt:  RLVLRPRGFALPHYADSPKVGYVIQENDS------------KRKHEHAGHNGVVGLVFPNESKEQVLKLSKGDAIPIPTGVVSWWYNDGDSDLEITFLGE

Query:  TSNS--HIHGEITYFLLAG-----------------AQGILSTFSPEFISKAYNLN----------KQETNKLTHSQTGILIIKLQPNNSLPKPRGHHNK
         +NS   +      FLLAG                 +Q I + F  E +S+A  +N            +  ++ H + G+ ++K        + +  + +
Subjt:  TSNS--HIHGEITYFLLAG-----------------AQGILSTFSPEFISKAYNLN----------KQETNKLTHSQTGILIIKLQPNNSLPKPRGHHNK

Query:  MVF------------------------NLANAEPNHHKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPNGVRSPIYTVDSSVQVIYVVSGSGRVEIV-
        + +                        N+ N             I+S+   KFP +    +SA    L  N + SP + V++   ++Y++ G  RV++V 
Subjt:  MVF------------------------NLANAEPNHHKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPNGVRSPIYTVDSSVQVIYVVSGSGRVEIV-

Query:  GFNAVKMAAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSAD
         F        ++ GQLL++P+++ V K+A  EG +  +I T     +  LAGK SV+ AL  +V+  ++ +S +
Subjt:  GFNAVKMAAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSAD

Q6K508 Glutelin type-D 15.1e-2324.04Show/hide
Query:  QIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVIQENDSKRKHEHAGHNGVVGLVFP-----------------------
        ++ SE+    Y+   + QF     A V   R V+ P+G  +P Y+++P + Y+IQ              G VGL FP                       
Subjt:  QIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVIQENDSKRKHEHAGHNGVVGLVFP-----------------------

Query:  -NESKEQVLKLSKGDAIPIPTGVVSWWYNDGDSDLEITFLGETSN--SHIHGEITYFLLAG-----------------AQGILSTFSPEFISKAYNLNKQ
          +  +++ +  +GD + +P  V  W+YN GD+   + ++ +  +  + +      FLLAG                  Q I S F+ E +S+A  +N +
Subjt:  -NESKEQVLKLSKGDAIPIPTGVVSWWYNDGDSDLEITFLGETSN--SHIHGEITYFLLAG-----------------AQGILSTFSPEFISKAYNLNKQ

Query:  ETNKL--THSQTGILIIKLQPNNSLPKP------------------RGHHNKMVFNLANAE-----PNHHKAEF----GVSITSLEESKFPFIGQTGLSA
         + +L   + Q G  II+++    L KP                   G +N +  N    +      N  +A++       IT L   KFP +   G+ A
Subjt:  ETNKL--THSQTGILIIKLQPNNSLPKP------------------RGHHNKMVFNLANAE-----PNHHKAEF----GVSITSLEESKFPFIGQTGLSA

Query:  MLEKLCPNGVRSPIYTVDSSVQVIYVVSGSGRVEIVG------FNAVKMAAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVTTTKPVIEELAGKASVWE
            L  N + SP + +++   V+Y++ GS RV++        FN V     +  GQLL++P+   V K+A   G +  +I T + P +  +AGK S+  
Subjt:  MLEKLCPNGVRSPIYTVDSSVQVIYVVSGSGRVEIVG------FNAVKMAAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVTTTKPVIEELAGKASVWE

Query:  ALSEEVLQVSFNVSAD
        AL  +V+  ++ +S D
Subjt:  ALSEEVLQVSFNVSAD

Arabidopsis top hitse value%identityAlignment
AT1G03880.1 cruciferin 27.8e-1923.41Show/hide
Query:  ESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVIQEN---------------DSKRKHEHAGHNGVVGLVFPNESKEQVLKLSK
        +S+GG    W     P L  +  A  R V+ P+G  LP + ++ K+ +V+                  +S    E  G     G     +  ++V  L  
Subjt:  ESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVIQEN---------------DSKRKHEHAGHNGVVGLVFPNESKEQVLKLSK

Query:  GDAIPIPTGVVSWWYNDGDSDLEITFLGE--TSNSHIHGEITYFLLAG----------------AQGILSTFSPEFISKAYNLNKQETNKLTHSQTGI-L
        GD I  P+GV  W+YN+G+  L +    +  ++ + +   +  FL+AG                   I + F+PE +++A+ +N +   +L + Q     
Subjt:  GDAIPIPTGVVSWWYNDGDSDLEITFLGE--TSNSHIHGEITYFLLAG----------------AQGILSTFSPEFISKAYNLNKQETNKLTHSQTGI-L

Query:  IIKLQPNNSLPKP---RGHHNKMVFNLAN------------------AEPNHHKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPNGVRSPIYTVDSSV
        I+K+     + +P   RG   +    +AN                  ++ + +K   G  I++L     P +    LSA+   +  N +  P + V+++ 
Subjt:  IIKLQPNNSLPKP---RGHHNKMVFNLAN------------------AEPNHHKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPNGVRSPIYTVDSSV

Query:  QVIYVVSGSGRVEIVGFNAVKM-AAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSADFEK
          +YV +G   +++V  N  ++   E+ +GQLLVVP+ F V K A  E  E     T     +  LAG+ SV   L  EV+   + +S +  K
Subjt:  QVIYVVSGSGRVEIVGFNAVKM-AAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSADFEK

AT1G03890.1 RmlC-like cupins superfamily protein9.8e-2225.87Show/hide
Query:  PVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVIQ-------------------ENDSKRKHEHAGHNGVVGLVFPNESKEQVLKLSKGDAIPIPTGVV
        P L  A V   R+ L+P    LP +   P + YV+Q                   E  S R     G  G  G  F  +  +++    +GD      GV 
Subjt:  PVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVIQ-------------------ENDSKRKHEHAGHNGVVGLVFPNESKEQVLKLSKGDAIPIPTGVV

Query:  SWWYNDGDSDLEITFLGETSN--SHIHGEITYFLLAGAQ--------------GILSTFSPEFISKAYNLNKQETNKLTHSQ--TGILIIKLQP-NNSLP
         WWYN GDSD  I  + + +N  + +      F LAG++                 S F P  I++A+ +N +   +L + +   G +I    P +  +P
Subjt:  SWWYNDGDSDLEITFLGETSN--SHIHGEITYFLLAGAQ--------------GILSTFSPEFISKAYNLNKQETNKLTHSQ--TGILIIKLQP-NNSLP

Query:  KPR-----GHHN---------KMVFNLANAEPNHHKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPNGVRSPIYTVDSSVQVIYVVSGSGRVEIVGFN
         PR     G  N         K+  N+ + E + H +     I++L     P +    L+A+   L   G+  P +T ++   V+YV  G  ++++V  N
Subjt:  KPR-----GHHN---------KMVFNLANAEPNHHKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPNGVRSPIYTVDSSVQVIYVVSGSGRVEIVGFN

Query:  AVKMAAE-VKAGQLLVVPKYFVVGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSADFEK
           +  E V  GQ++V+P+ F V K AGE G E  S  T     I  L+G+ S   A+  +V++ S+ V+ +  K
Subjt:  AVKMAAE-VKAGQLLVVPKYFVVGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSADFEK

AT1G07750.1 RmlC-like cupins superfamily protein3.3e-7040.06Show/hide
Query:  MEFDLKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVIQENDSKRKHEHAGHNGVVGLVFPNESKEQVLKLSKG
        ME DL P   +     DGG Y  W   + P+L Q N+ A +L L   GFA+P Y+DS KV YV+Q             +G  G+V P E +E+V+ + +G
Subjt:  MEFDLKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVIQENDSKRKHEHAGHNGVVGLVFPNESKEQVLKLSKG

Query:  DAIPIPTGVVSWWYNDGDSDLEITFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETNKLTHSQTGILIIKLQPNNSLPKPRGHHNK-
        D+I +P GVV+WW+N+ D +L I FLGET   H  G+ T F L G  GI + FS EF+ +A++L++    KL  SQTG  I+KL     +P+P+  +   
Subjt:  DAIPIPTGVVSWWYNDGDSDLEITFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETNKLTHSQTGILIIKLQPNNSLPKPRGHHNK-

Query:  MVFNLANAEPNHHKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPNGVRSPIYTVDSSVQVIYVVSGSGRVEIVGFNAVK-MAAEVKAGQLLVVPKYFV
         V N   A P     + G  +  L     P +G+ G  A L ++  + + SP ++ DS++QV Y+V GSGRV++VG +  + +   +KAG L +VP++FV
Subjt:  MVFNLANAEPNHHKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPNGVRSPIYTVDSSVQVIYVVSGSGRVEIVGFNAVK-MAAEVKAGQLLVVPKYFV

Query:  VGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSADFEKMFIS
        V K A  +GM  FSIVTT  P+   LAG  SVW++LS EVLQ +F V+ + EK F S
Subjt:  VGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSADFEKMFIS

AT2G28680.1 RmlC-like cupins superfamily protein7.9e-7240.78Show/hide
Query:  MEFDLKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVIQENDSKRKHEHAGHNGVVGLVFPNESKEQVLKLSKG
        ME DL P   +     DGG Y+ W   + P+L   N+ A +L L   G ALP Y+DSPKV YV+Q              G  G+V P E +E+V+ + KG
Subjt:  MEFDLKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVIQENDSKRKHEHAGHNGVVGLVFPNESKEQVLKLSKG

Query:  DAIPIPTGVVSWWYNDGDSDLEITFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETNKLTHSQTGILIIKLQPNNSLPKP-RGHHNK
        D+I +P GVV+WW+N+ D++L + FLGET   H  G+ T F L G+ GI + FS EF+ +A++L++    KL  SQTG  I+K+  +  +P+P +G    
Subjt:  DAIPIPTGVVSWWYNDGDSDLEITFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETNKLTHSQTGILIIKLQPNNSLPKP-RGHHNK

Query:  MVFNLANAEPNHHKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPNGVRSPIYTVDSSVQVIYVVSGSGRVEIVGFNAVK-MAAEVKAGQLLVVPKYFV
         V N   A P     + G  +  L     P +G+ G  A L ++  + + SP ++ DS++QV Y+V GSGRV+IVG +  + +   VKAG L +VP++FV
Subjt:  MVFNLANAEPNHHKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPNGVRSPIYTVDSSVQVIYVVSGSGRVEIVGFNAVK-MAAEVKAGQLLVVPKYFV

Query:  VGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSADFEKMFISK
        V K A  +G+  FSIVTT  P+   LAG+ SVW+ALS EVLQ +F V  + EK F SK
Subjt:  VGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSADFEKMFISK

AT5G44120.3 RmlC-like cupins superfamily protein6.4e-1321.99Show/hide
Query:  PVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVIQENDSKRKHEHAGHNGVVGLVFP-------------------------NESKEQVLKLSKGDAIP
        P L  + V+  R ++  +G  LP + ++ K+ +V            A   G++G V P                          +  ++V  +  GD I 
Subjt:  PVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVIQENDSKRKHEHAGHNGVVGLVFP-------------------------NESKEQVLKLSKGDAIP

Query:  IPTGVVSWWYNDGDSDLEITFLGETSN--SHIHGEITYFLLAG----------------AQGILSTFSPEFISKAYNLNKQETNKLTHSQTGI-LIIKLQ
           GV  W+YNDG   L I  + + ++  + +      F LAG                 + I + F PE I++A  ++ Q   +L +       I+++Q
Subjt:  IPTGVVSWWYNDGDSDLEITFLGETSN--SHIHGEITYFLLAG----------------AQGILSTFSPEFISKAYNLNKQETNKLTHSQTGI-LIIKLQ

Query:  PNNSLPKP--RG---------------HHNKMVFNLANA-------EPNH---HKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPNGVRSPIYTVDSS
            + +P  RG               H N +   + +A       +P+    +K + G  I++L     P +    LSA+   +  N +  P +  +++
Subjt:  PNNSLPKP--RG---------------HHNKMVFNLANA-------EPNH---HKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPNGVRSPIYTVDSS

Query:  VQVIYVVSGSGRVEIVGFNAVKM-AAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSAD
          ++YV  G  +++IV  N  ++   +V  GQL+ VP+ F V K A     +     T     I  LAG+ SV   L  EV+   F +S +
Subjt:  VQVIYVVSGSGRVEIVGFNAVKM-AAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSAD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTTCGACTTGAAGCCAATGACTGACCAAATCTTCTCGGAGAGCGACGGAGGAGTCTACTACCATTGGCCTTCTTCTCAGTTTCCAGTGCTTTCCCAAGCCAATGT
CGCCGCCGGGAGACTCGTCCTCCGTCCCCGTGGCTTCGCTCTTCCTCATTACGCGGATTCTCCGAAAGTAGGCTATGTTATACAAGAAAATGATTCCAAACGTAAGCACG
AACATGCAGGACACAACGGAGTCGTAGGACTAGTGTTTCCAAATGAGTCGAAAGAGCAAGTTCTGAAGCTCAGTAAAGGAGATGCGATTCCGATTCCCACCGGAGTCGTC
TCCTGGTGGTACAACGACGGAGATTCCGACTTAGAAATCACATTCCTCGGCGAGACTTCAAATTCCCACATCCATGGCGAAATCACCTACTTCCTTCTCGCCGGAGCCCA
AGGAATCCTGTCAACTTTCTCGCCGGAATTCATCTCCAAGGCCTATAACCTAAACAAACAAGAAACAAACAAGCTCACCCACAGCCAAACAGGAATTTTGATCATCAAGC
TGCAGCCAAACAACTCCTTACCAAAGCCACGAGGCCATCACAACAAGATGGTGTTCAACTTGGCAAATGCAGAGCCCAACCATCATAAGGCTGAATTTGGTGTGTCGATT
ACAAGTTTGGAAGAATCCAAGTTTCCATTCATTGGACAAACTGGGCTGAGTGCAATGCTCGAGAAACTCTGTCCCAATGGAGTTCGTTCTCCCATTTACACTGTTGATTC
GTCAGTTCAAGTGATCTACGTCGTAAGCGGGTCGGGTCGGGTCGAGATTGTCGGGTTCAACGCCGTGAAGATGGCTGCTGAAGTGAAGGCTGGTCAGTTGCTGGTGGTTC
CCAAGTACTTTGTGGTTGGCAAAGAAGCTGGTGAAGAGGGAATGGAGTGCTTCTCCATTGTCACCACCACCAAGCCTGTGATAGAGGAGTTGGCTGGGAAGGCATCAGTG
TGGGAGGCATTATCTGAGGAAGTTTTACAAGTGTCTTTTAATGTATCTGCTGATTTTGAAAAGATGTTCATTTCAAAAGCAGCTAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTTCGACTTGAAGCCAATGACTGACCAAATCTTCTCGGAGAGCGACGGAGGAGTCTACTACCATTGGCCTTCTTCTCAGTTTCCAGTGCTTTCCCAAGCCAATGT
CGCCGCCGGGAGACTCGTCCTCCGTCCCCGTGGCTTCGCTCTTCCTCATTACGCGGATTCTCCGAAAGTAGGCTATGTTATACAAGAAAATGATTCCAAACGTAAGCACG
AACATGCAGGACACAACGGAGTCGTAGGACTAGTGTTTCCAAATGAGTCGAAAGAGCAAGTTCTGAAGCTCAGTAAAGGAGATGCGATTCCGATTCCCACCGGAGTCGTC
TCCTGGTGGTACAACGACGGAGATTCCGACTTAGAAATCACATTCCTCGGCGAGACTTCAAATTCCCACATCCATGGCGAAATCACCTACTTCCTTCTCGCCGGAGCCCA
AGGAATCCTGTCAACTTTCTCGCCGGAATTCATCTCCAAGGCCTATAACCTAAACAAACAAGAAACAAACAAGCTCACCCACAGCCAAACAGGAATTTTGATCATCAAGC
TGCAGCCAAACAACTCCTTACCAAAGCCACGAGGCCATCACAACAAGATGGTGTTCAACTTGGCAAATGCAGAGCCCAACCATCATAAGGCTGAATTTGGTGTGTCGATT
ACAAGTTTGGAAGAATCCAAGTTTCCATTCATTGGACAAACTGGGCTGAGTGCAATGCTCGAGAAACTCTGTCCCAATGGAGTTCGTTCTCCCATTTACACTGTTGATTC
GTCAGTTCAAGTGATCTACGTCGTAAGCGGGTCGGGTCGGGTCGAGATTGTCGGGTTCAACGCCGTGAAGATGGCTGCTGAAGTGAAGGCTGGTCAGTTGCTGGTGGTTC
CCAAGTACTTTGTGGTTGGCAAAGAAGCTGGTGAAGAGGGAATGGAGTGCTTCTCCATTGTCACCACCACCAAGCCTGTGATAGAGGAGTTGGCTGGGAAGGCATCAGTG
TGGGAGGCATTATCTGAGGAAGTTTTACAAGTGTCTTTTAATGTATCTGCTGATTTTGAAAAGATGTTCATTTCAAAAGCAGCTAATTGA
Protein sequenceShow/hide protein sequence
MEFDLKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVIQENDSKRKHEHAGHNGVVGLVFPNESKEQVLKLSKGDAIPIPTGVV
SWWYNDGDSDLEITFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETNKLTHSQTGILIIKLQPNNSLPKPRGHHNKMVFNLANAEPNHHKAEFGVSI
TSLEESKFPFIGQTGLSAMLEKLCPNGVRSPIYTVDSSVQVIYVVSGSGRVEIVGFNAVKMAAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVTTTKPVIEELAGKASV
WEALSEEVLQVSFNVSADFEKMFISKAAN