; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0019590 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0019590
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionglutelin type-A 2-like
Genome locationchr5:43600534..43605662
RNA-Seq ExpressionLag0019590
SyntenyLag0019590
Gene Ontology termsGO:0045735 - nutrient reservoir activity (molecular function)
InterPro domainsIPR006044 - 11-S seed storage protein, plant
IPR006045 - Cupin 1
IPR011051 - RmlC-like cupin domain superfamily
IPR014710 - RmlC-like jelly roll fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYJ99759.1 glutelin type-A 2-like [Cucumis melo var. makuwa]1.5e-10956.21Show/hide
Query:  LKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFPNESKEHVLKLSKGDAIPIPTGVVSWWYN
        ++ M  + F E +GG Y  W  S +P+L+Q NVA GRL+LRPRGFA+PHYAD  K GYV QG +GV G VFPN+  E V+KL KGD IP+P+G+ SWW+N
Subjt:  LKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFPNESKEHVLKLSKGDAIPIPTGVVSWWYN

Query:  DGDSDLEIAFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETDKLTHSQTGILIIKLQPNNSLPKPRGHHNKMVFKLANAEPNHKAEF
        DGDSDLEI FLGET N+H+ G+ITYF+L+G +G+L  F+PE++ K+Y+L+++ET+K   SQ+ +LI  +QP+ SLPKP   H+K+V+ +  A P+++A+ 
Subjt:  DGDSDLEIAFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETDKLTHSQTGILIIKLQPNNSLPKPRGHHNKMVFKLANAEPNHKAEF

Query:  G-VSITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSVQVIYVVNGSGRVEIVGFDAVKMAAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVT
        G  ++T + ES FPFIGQTGL+A+LEKL  + +RSP+Y A+ S Q+IYV  GSG++++VGF + K  A+VK GQL++VP+YF VGK AGEEG+EC S++ 
Subjt:  G-VSITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSVQVIYVVNGSGRVEIVGFDAVKMAAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVT

Query:  TTKPVIEELAGKASVWEALSEEVLQVSFNVSTDFEKLF
         T P++EELAGK SV EALS EV QVSFNV+ +FEKLF
Subjt:  TTKPVIEELAGKASVWEALSEEVLQVSFNVSTDFEKLF

XP_004151504.1 legumin J [Cucumis sativus]2.6e-10654.84Show/hide
Query:  LKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFPNESKEHVLKLSKGDAIPIPTGVVSWWYN
        ++ M  + F E +GG Y+ W  S +P+L+Q NVA GRL+LRPRGFA+PHY+D  K GYV QG +GV G VFP +  E V+KL KGD IP+P GV SWW+N
Subjt:  LKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFPNESKEHVLKLSKGDAIPIPTGVVSWWYN

Query:  DGDSDLEIAFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETDKLTHSQTGILIIKLQPNNSLPKPRGHHNKMVFKLANAEPNHKAEF
        DGDSDLEI FLGET  +H+ G+ITYF+L+G +G+L  F+PE++ K+ +LN++ET+    SQ  +LI  +QP+ SLPKP   ++K+V+ +  A P+++A+ 
Subjt:  DGDSDLEIAFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETDKLTHSQTGILIIKLQPNNSLPKPRGHHNKMVFKLANAEPNHKAEF

Query:  G-VSITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSVQVIYVVNGSGRVEIVGFDAVKMAAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVT
        G  ++T + ES FPFIGQTGL+ +LEKL  + +RSP+Y A+ S Q+IYV  GSG++++VGF + K  A+VK GQL++VP+YF VGK AGEEG+EC S++ 
Subjt:  G-VSITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSVQVIYVVNGSGRVEIVGFDAVKMAAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVT

Query:  TTKPVIEELAGKASVWEALSEEVLQVSFNVSTDFEKLFISK
         T P++EELAGK SV EALS EV QVSFNV+ +FEKLF SK
Subjt:  TTKPVIEELAGKASVWEALSEEVLQVSFNVSTDFEKLFISK

XP_008456076.1 PREDICTED: glutelin type-A 2-like [Cucumis melo]3.0e-11056.3Show/hide
Query:  LKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFPNESKEHVLKLSKGDAIPIPTGVVSWWYN
        ++ M  + F E +GG Y  W  S +P+L+Q NVA GRL+LRPRGFA+PHYAD  K GYV QG +GV G VFPN+  E V+KL KGD IP+P+G+ SWW+N
Subjt:  LKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFPNESKEHVLKLSKGDAIPIPTGVVSWWYN

Query:  DGDSDLEIAFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETDKLTHSQTGILIIKLQPNNSLPKPRGHHNKMVFKLANAEPNHKAEF
        DGDSDLEI FLGET N+H+ G+ITYF+L+G +G+L  F+PE++ K+Y+L+++ET+K   SQ+ +LI  +QP+ SLPKP   H+K+V+ +  A P+++A+ 
Subjt:  DGDSDLEIAFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETDKLTHSQTGILIIKLQPNNSLPKPRGHHNKMVFKLANAEPNHKAEF

Query:  G-VSITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSVQVIYVVNGSGRVEIVGFDAVKMAAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVT
        G  ++T + ES FPFIGQTGL+A+LEKL  + +RSP+Y A+ S Q+IYV  GSG++++VGF + K  A+VK GQL++VP+YF VGK AGEEG+EC S++ 
Subjt:  G-VSITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSVQVIYVVNGSGRVEIVGFDAVKMAAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVT

Query:  TTKPVIEELAGKASVWEALSEEVLQVSFNVSTDFEKLFISK
         T P++EELAGK SV EALS EV QVSFNV+ +FEKLF SK
Subjt:  TTKPVIEELAGKASVWEALSEEVLQVSFNVSTDFEKLFISK

XP_030937791.1 glutelin type-A 3-like [Quercus lobata]7.6e-10656.65Show/hide
Query:  MEFDLKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFPNESKEHVLKLSKGDAIPIPTGVVS
        MEFDL P   Q   E +GG YY W SS+FP+L +  V AG+LVL+PRGFALPHYADS KVGYV  G +G+VG+VFP  SKE VLKL KGD IP+  G VS
Subjt:  MEFDLKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFPNESKEHVLKLSKGDAIPIPTGVVS

Query:  WWYNDGDSDLEIAFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETDKLTHSQTGILIIKLQPNNSLPKPRGHH-NKMVFKLANAEPN
        WW+NDGDS+L I FLGETS S+I GE TYF L+GA GI+  FSPEF+S+AY +NK E +KL  SQTG+LIIKLQ   ++PKP  ++ NK+++ +    P+
Subjt:  WWYNDGDSDLEIAFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETDKLTHSQTGILIIKLQPNNSLPKPRGHH-NKMVFKLANAEPN

Query:  HKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSVQVIYVVNGSGRVEIVGFDAVK-MAAEVKAGQLLVVPKYFVVGKEAGEEGMEC
         + + G  +T+L E+KFPF+ Q  LS  L +L  + + SP YT DSSVQ+IYVV G G+++IVG +  + +  EVK G LLVVP++FVV K AG +G+EC
Subjt:  HKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSVQVIYVVNGSGRVEIVGFDAVK-MAAEVKAGQLLVVPKYFVVGKEAGEEGMEC

Query:  FSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSTDFEKLFISK
        FSI T+ KP  E LA KASVW+ALS  V++ S NVST+ E+LF SK
Subjt:  FSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSTDFEKLFISK

XP_030955195.1 glutelin type-A 3-like [Quercus lobata]2.9e-10556.36Show/hide
Query:  MEFDLKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFPNESKEHVLKLSKGDAIPIPTGVVS
        MEFDL P   Q   E +GG YY W SS+FP+L +  V AG+LVL+PRGFALPHYA S KVGYV  G +G+VG+VFP  SKE VLKL KGD IP+  G VS
Subjt:  MEFDLKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFPNESKEHVLKLSKGDAIPIPTGVVS

Query:  WWYNDGDSDLEIAFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETDKLTHSQTGILIIKLQPNNSLPKPRGHH-NKMVFKLANAEPN
        WW+NDGDS+L I FLGETS S+I GE TYF L+GA GI+  FSPEF+S+AY +NK E +KL  SQTG+LIIKLQ   ++PKP  ++ NK+++ +    P+
Subjt:  WWYNDGDSDLEIAFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETDKLTHSQTGILIIKLQPNNSLPKPRGHH-NKMVFKLANAEPN

Query:  HKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSVQVIYVVNGSGRVEIVGFDAVK-MAAEVKAGQLLVVPKYFVVGKEAGEEGMEC
         + + G  +T+L E+KFPF+ Q  LS  L +L  + + SP YT DSSVQ+IYVV G G+++IVG +  + +  EVK G LLVVP++FVV K AG +G+EC
Subjt:  HKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSVQVIYVVNGSGRVEIVGFDAVK-MAAEVKAGQLLVVPKYFVVGKEAGEEGMEC

Query:  FSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSTDFEKLFISK
        FSI T+ KP  E LA KASVW+ALS  V++ S NVST+ E+LF SK
Subjt:  FSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSTDFEKLFISK

TrEMBL top hitse value%identityAlignment
A0A0A0L6K0 Uncharacterized protein1.3e-10654.84Show/hide
Query:  LKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFPNESKEHVLKLSKGDAIPIPTGVVSWWYN
        ++ M  + F E +GG Y+ W  S +P+L+Q NVA GRL+LRPRGFA+PHY+D  K GYV QG +GV G VFP +  E V+KL KGD IP+P GV SWW+N
Subjt:  LKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFPNESKEHVLKLSKGDAIPIPTGVVSWWYN

Query:  DGDSDLEIAFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETDKLTHSQTGILIIKLQPNNSLPKPRGHHNKMVFKLANAEPNHKAEF
        DGDSDLEI FLGET  +H+ G+ITYF+L+G +G+L  F+PE++ K+ +LN++ET+    SQ  +LI  +QP+ SLPKP   ++K+V+ +  A P+++A+ 
Subjt:  DGDSDLEIAFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETDKLTHSQTGILIIKLQPNNSLPKPRGHHNKMVFKLANAEPNHKAEF

Query:  G-VSITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSVQVIYVVNGSGRVEIVGFDAVKMAAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVT
        G  ++T + ES FPFIGQTGL+ +LEKL  + +RSP+Y A+ S Q+IYV  GSG++++VGF + K  A+VK GQL++VP+YF VGK AGEEG+EC S++ 
Subjt:  G-VSITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSVQVIYVVNGSGRVEIVGFDAVKMAAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVT

Query:  TTKPVIEELAGKASVWEALSEEVLQVSFNVSTDFEKLFISK
         T P++EELAGK SV EALS EV QVSFNV+ +FEKLF SK
Subjt:  TTKPVIEELAGKASVWEALSEEVLQVSFNVSTDFEKLFISK

A0A1S3C2D5 glutelin type-A 2-like1.4e-11056.3Show/hide
Query:  LKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFPNESKEHVLKLSKGDAIPIPTGVVSWWYN
        ++ M  + F E +GG Y  W  S +P+L+Q NVA GRL+LRPRGFA+PHYAD  K GYV QG +GV G VFPN+  E V+KL KGD IP+P+G+ SWW+N
Subjt:  LKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFPNESKEHVLKLSKGDAIPIPTGVVSWWYN

Query:  DGDSDLEIAFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETDKLTHSQTGILIIKLQPNNSLPKPRGHHNKMVFKLANAEPNHKAEF
        DGDSDLEI FLGET N+H+ G+ITYF+L+G +G+L  F+PE++ K+Y+L+++ET+K   SQ+ +LI  +QP+ SLPKP   H+K+V+ +  A P+++A+ 
Subjt:  DGDSDLEIAFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETDKLTHSQTGILIIKLQPNNSLPKPRGHHNKMVFKLANAEPNHKAEF

Query:  G-VSITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSVQVIYVVNGSGRVEIVGFDAVKMAAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVT
        G  ++T + ES FPFIGQTGL+A+LEKL  + +RSP+Y A+ S Q+IYV  GSG++++VGF + K  A+VK GQL++VP+YF VGK AGEEG+EC S++ 
Subjt:  G-VSITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSVQVIYVVNGSGRVEIVGFDAVKMAAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVT

Query:  TTKPVIEELAGKASVWEALSEEVLQVSFNVSTDFEKLFISK
         T P++EELAGK SV EALS EV QVSFNV+ +FEKLF SK
Subjt:  TTKPVIEELAGKASVWEALSEEVLQVSFNVSTDFEKLFISK

A0A5A7T7U8 Glutelin type-A 2-like1.4e-11056.3Show/hide
Query:  LKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFPNESKEHVLKLSKGDAIPIPTGVVSWWYN
        ++ M  + F E +GG Y  W  S +P+L+Q NVA GRL+LRPRGFA+PHYAD  K GYV QG +GV G VFPN+  E V+KL KGD IP+P+G+ SWW+N
Subjt:  LKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFPNESKEHVLKLSKGDAIPIPTGVVSWWYN

Query:  DGDSDLEIAFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETDKLTHSQTGILIIKLQPNNSLPKPRGHHNKMVFKLANAEPNHKAEF
        DGDSDLEI FLGET N+H+ G+ITYF+L+G +G+L  F+PE++ K+Y+L+++ET+K   SQ+ +LI  +QP+ SLPKP   H+K+V+ +  A P+++A+ 
Subjt:  DGDSDLEIAFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETDKLTHSQTGILIIKLQPNNSLPKPRGHHNKMVFKLANAEPNHKAEF

Query:  G-VSITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSVQVIYVVNGSGRVEIVGFDAVKMAAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVT
        G  ++T + ES FPFIGQTGL+A+LEKL  + +RSP+Y A+ S Q+IYV  GSG++++VGF + K  A+VK GQL++VP+YF VGK AGEEG+EC S++ 
Subjt:  G-VSITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSVQVIYVVNGSGRVEIVGFDAVKMAAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVT

Query:  TTKPVIEELAGKASVWEALSEEVLQVSFNVSTDFEKLFISK
         T P++EELAGK SV EALS EV QVSFNV+ +FEKLF SK
Subjt:  TTKPVIEELAGKASVWEALSEEVLQVSFNVSTDFEKLFISK

A0A5D3BLA4 Glutelin type-A 2-like7.2e-11056.21Show/hide
Query:  LKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFPNESKEHVLKLSKGDAIPIPTGVVSWWYN
        ++ M  + F E +GG Y  W  S +P+L+Q NVA GRL+LRPRGFA+PHYAD  K GYV QG +GV G VFPN+  E V+KL KGD IP+P+G+ SWW+N
Subjt:  LKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFPNESKEHVLKLSKGDAIPIPTGVVSWWYN

Query:  DGDSDLEIAFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETDKLTHSQTGILIIKLQPNNSLPKPRGHHNKMVFKLANAEPNHKAEF
        DGDSDLEI FLGET N+H+ G+ITYF+L+G +G+L  F+PE++ K+Y+L+++ET+K   SQ+ +LI  +QP+ SLPKP   H+K+V+ +  A P+++A+ 
Subjt:  DGDSDLEIAFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETDKLTHSQTGILIIKLQPNNSLPKPRGHHNKMVFKLANAEPNHKAEF

Query:  G-VSITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSVQVIYVVNGSGRVEIVGFDAVKMAAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVT
        G  ++T + ES FPFIGQTGL+A+LEKL  + +RSP+Y A+ S Q+IYV  GSG++++VGF + K  A+VK GQL++VP+YF VGK AGEEG+EC S++ 
Subjt:  G-VSITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSVQVIYVVNGSGRVEIVGFDAVKMAAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVT

Query:  TTKPVIEELAGKASVWEALSEEVLQVSFNVSTDFEKLF
         T P++EELAGK SV EALS EV QVSFNV+ +FEKLF
Subjt:  TTKPVIEELAGKASVWEALSEEVLQVSFNVSTDFEKLF

A0A7N2L1W4 Uncharacterized protein1.4e-10556.36Show/hide
Query:  MEFDLKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFPNESKEHVLKLSKGDAIPIPTGVVS
        MEFDL P   Q   E +GG YY W SS+FP+L +  V AG+LVL+PRGFALPHYA S KVGYV  G +G+VG+VFP  SKE VLKL KGD IP+  G VS
Subjt:  MEFDLKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFPNESKEHVLKLSKGDAIPIPTGVVS

Query:  WWYNDGDSDLEIAFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETDKLTHSQTGILIIKLQPNNSLPKPRGHH-NKMVFKLANAEPN
        WW+NDGDS+L I FLGETS S+I GE TYF L+GA GI+  FSPEF+S+AY +NK E +KL  SQTG+LIIKLQ   ++PKP  ++ NK+++ +    P+
Subjt:  WWYNDGDSDLEIAFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETDKLTHSQTGILIIKLQPNNSLPKPRGHH-NKMVFKLANAEPN

Query:  HKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSVQVIYVVNGSGRVEIVGFDAVK-MAAEVKAGQLLVVPKYFVVGKEAGEEGMEC
         + + G  +T+L E+KFPF+ Q  LS  L +L  + + SP YT DSSVQ+IYVV G G+++IVG +  + +  EVK G LLVVP++FVV K AG +G+EC
Subjt:  HKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSVQVIYVVNGSGRVEIVGFDAVK-MAAEVKAGQLLVVPKYFVVGKEAGEEGMEC

Query:  FSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSTDFEKLFISK
        FSI T+ KP  E LA KASVW+ALS  V++ S NVST+ E+LF SK
Subjt:  FSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSTDFEKLFISK

SwissProt top hitse value%identityAlignment
A0A222NNM9 Cocosin 14.0e-2525.9Show/hide
Query:  TDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFP--------------NESKEH---------VLK
        T  + SE+    Y+   + QF     A V+  R V+ PRG  LP  +++P++ Y+ QG  G+VGLV P               E + H         V +
Subjt:  TDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFP--------------NESKEH---------VLK

Query:  LSKGDAIPIPTGVVSWWYNDGDSDLEIAFLGETSN--SHIHGEITYFLLAGAQ---------------GILSTFSPEFISKAYNLNKQETDKL-------
          +GD + +P G   W YN+G++ +    + +TSN  + +      FLLAG Q                IL  FS E ++ A+ +N +   KL       
Subjt:  LSKGDAIPIPTGVVSWWYNDGDSDLEIAFLGETSN--SHIHGEITYFLLAGAQ---------------GILSTFSPEFISKAYNLNKQETDKL-------

Query:  ---THSQTGILIIKLQPNNSLPKPRGH--------HNKMVFKLANAEPNHKAEF---GVSITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSV
             ++ G+ +++        +  G         +  M  K    +P     F   G  IT+L   K P +    +SA    L  + + SP +  ++  
Subjt:  ---THSQTGILIIKLQPNNSLPKPRGH--------HNKMVFKLANAEPNHKAEF---GVSITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSV

Query:  QVIYVVNGSGRVEIV-GFDAVKMAAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSTD
         ++Y   G GRVE+           E++ GQLL+VP+ F + + AG EG +  SI T+ + ++  + GK S    +  EVL  S+ +S D
Subjt:  QVIYVVNGSGRVEIV-GFDAVKMAAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSTD

P07730 Glutelin type-A 27.6e-2424.35Show/hide
Query:  VAAGRLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFP--------------------NESKEHVLK--------LSKGDAIPIPTGVVSWWYNDGDS
        V+  R V+ PRG  LPHY +   + Y+ QG  G+ G  FP                    ++S+ H  K          +GD I +P GV  W YNDG+ 
Subjt:  VAAGRLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFP--------------------NESKEHVLK--------LSKGDAIPIPTGVVSWWYNDGDS

Query:  DLEIAFLGETSN--SHIHGEITYFLLAG---------------AQGILSTFSPEFISKAYNLNKQETDKL--THSQTGILI------IKLQPNNSLP---
         +   ++ + +N  + +      FLLAG               +Q I S FS E +S+A+ ++ Q   +L   + Q G ++        LQP  SL    
Subjt:  DLEIAFLGETSN--SHIHGEITYFLLAG---------------AQGILSTFSPEFISKAYNLNKQETDKL--THSQTGILI------IKLQPNNSLP---

Query:  ----KPRGHHNKMVFKLAN----------------------AEPNHKAEFGV---SITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSVQVIY
            + R H+ +  ++ +                         PN    +      +T+L    FP +    +SA+   L  + + SP +  ++   ++Y
Subjt:  ----KPRGHHNKMVFKLAN----------------------AEPNHKAEFGV---SITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSVQVIY

Query:  VVNGSGRVEIVGFDA-VKMAAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSTD
        +  G  +V++V  +       E++ GQLL+VP+++VV K+A  EG    +  T    ++  +AGK+S++ AL  +VL  ++ +S +
Subjt:  VVNGSGRVEIVGFDA-VKMAAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSTD

P14323 Glutelin type-B 12.2e-2322.96Show/hide
Query:  RLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFP------------------------NESKEHVLKLSKGDAIPIPTGVVSWWYNDGDSDLEIAFLG
        R V++P+G  +P Y + P V Y+ QG  G +GL FP                         +  + + +  +GD + +P GV  W+YNDGD+ +   ++ 
Subjt:  RLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFP------------------------NESKEHVLKLSKGDAIPIPTGVVSWWYNDGDSDLEIAFLG

Query:  ETSN--SHIHGEITYFLLAG-------------------AQGILSTFSPEFISKAYNLN----------KQETDKLTHSQTGILIIK--LQPNNSLPKPR
        + +N  + +      FLLAG                    Q I S F  E +S+A  +N            +  ++ H + G+ ++K  L       + +
Subjt:  ETSN--SHIHGEITYFLLAG-------------------AQGILSTFSPEFISKAYNLN----------KQETDKLTHSQTGILIIK--LQPNNSLPKPR

Query:  GHHNKMVFK---------------------LANAEPNHKAEF----GVSITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSVQVIYVVNGSGR
          + ++ +                        N E   +A+        ITS+   KFP +    +SA    L  + + SP +  ++   ++Y++ G  R
Subjt:  GHHNKMVFK---------------------LANAEPNHKAEF----GVSITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSVQVIYVVNGSGR

Query:  VEIV-GFDAVKMAAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSTD
        V++V  F        ++ GQLL++P+++ V K+A  EG +  +I T     +  LAGK SV+ AL  +V+  ++ +S +
Subjt:  VEIV-GFDAVKMAAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSTD

Q02897 Glutelin type-B 22.6e-2422.67Show/hide
Query:  RLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFP------------------------NESKEHVLKLSKGDAIPIPTGVVSWWYNDGDSDLEIAFLG
        R V++P+G  +P Y+++P + Y+ QG  G +GL FP                         +  + + +  +GD + +P GV  W+YNDGD+ +   ++ 
Subjt:  RLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFP------------------------NESKEHVLKLSKGDAIPIPTGVVSWWYNDGDSDLEIAFLG

Query:  ETSNS--HIHGEITYFLLAG-----------------AQGILSTFSPEFISKAYNLN----------KQETDKLTHSQTGILIIKLQPNNSLPKPRGHHN
        + +NS   +      FLLAG                 +Q I + F  E +S+A  +N            +  ++ H + G+ ++K        + +  + 
Subjt:  ETSNS--HIHGEITYFLLAG-----------------AQGILSTFSPEFISKAYNLN----------KQETDKLTHSQTGILIIKLQPNNSLPKPRGHHN

Query:  KMVF--------KLANAEPNH---KAEFGVS--------------ITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSVQVIYVVNGSGRVEIV
        ++ +        +    E N    KA   +               I+S+   KFP +    +SA    L  + + SP +  ++   ++Y++ G  RV++V
Subjt:  KMVF--------KLANAEPNH---KAEFGVS--------------ITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSVQVIYVVNGSGRVEIV

Query:  -GFDAVKMAAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSTD
          F        ++ GQLL++P+++ V K+A  EG +  +I T     +  LAGK SV+ AL  +V+  ++ +S +
Subjt:  -GFDAVKMAAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSTD

Q6K508 Glutelin type-D 11.7e-2324.25Show/hide
Query:  QIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFP------------------------NESKEHVLKL
        ++ SE+    Y+   + QF     A V   R V+ P+G  +P Y+++P + Y+ QG  G VGL FP                         +  + + + 
Subjt:  QIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFP------------------------NESKEHVLKL

Query:  SKGDAIPIPTGVVSWWYNDGDSDLEIAFLGETSN--SHIHGEITYFLLAG-----------------AQGILSTFSPEFISKAYNLNKQETDKL--THSQ
         +GD + +P  V  W+YN GD+   + ++ +  +  + +      FLLAG                  Q I S F+ E +S+A  +N + + +L   + Q
Subjt:  SKGDAIPIPTGVVSWWYNDGDSDLEIAFLGETSN--SHIHGEITYFLLAG-----------------AQGILSTFSPEFISKAYNLNKQETDKL--THSQ

Query:  TGILIIKLQPNNSLPKP------------------RGHHNKM------VFKLANAEPNHKAEF----GVSITSLEESKFPFIGQTGLSAMLEKLCPSGVR
         G  II+++    L KP                   G +N +      +    N E   +A++       IT L   KFP +   G+ A    L  + + 
Subjt:  TGILIIKLQPNNSLPKP------------------RGHHNKM------VFKLANAEPNHKAEF----GVSITSLEESKFPFIGQTGLSAMLEKLCPSGVR

Query:  SPIYTADSSVQVIYVVNGSGRVEIVGFDAVKMAAEV-KAGQLLVVPKYFVVGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSTD
        SP +  ++   V+Y++ GS RV++       +   V   GQLL++P+   V K+A   G +  +I T + P +  +AGK S+  AL  +V+  ++ +S D
Subjt:  SPIYTADSSVQVIYVVNGSGRVEIVGFDAVKMAAEV-KAGQLLVVPKYFVVGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSTD

Arabidopsis top hitse value%identityAlignment
AT1G03880.1 cruciferin 24.0e-2023.53Show/hide
Query:  ESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFP------------------------NESKEHVLKLSKGD
        +S+GG    W     P L  +  A  R V+ P+G  LP + ++ K+ +V  G  G++G V P                         +  + V  L  GD
Subjt:  ESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFP------------------------NESKEHVLKLSKGD

Query:  AIPIPTGVVSWWYNDGDSDLEIAFLGE--TSNSHIHGEITYFLLAG----------------AQGILSTFSPEFISKAYNLNKQETDKLTHSQTGI-LII
         I  P+GV  W+YN+G+  L +    +  ++ + +   +  FL+AG                   I + F+PE +++A+ +N +   +L + Q     I+
Subjt:  AIPIPTGVVSWWYNDGDSDLEIAFLGE--TSNSHIHGEITYFLLAG----------------AQGILSTFSPEFISKAYNLNKQETDKLTHSQTGI-LII

Query:  KLQPNNSLPKP---RGHHNKMVFKLANA---------------EPN----HKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSVQV
        K+     + +P   RG   +   ++AN                +P+    +K   G  I++L     P +    LSA+   +  + +  P +  +++   
Subjt:  KLQPNNSLPKP---RGHHNKMVFKLANA---------------EPN----HKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSVQV

Query:  IYVVNGSGRVEIVGFDAVKM-AAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSTDFEK
        +YV NG   +++V  +  ++   E+ +GQLLVVP+ F V K A  E  E     T     +  LAG+ SV   L  EV+   + +S +  K
Subjt:  IYVVNGSGRVEIVGFDAVKM-AAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSTDFEK

AT1G03890.1 RmlC-like cupins superfamily protein3.6e-2125.14Show/hide
Query:  PVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFP-------------------------NESKEHVLKLSKGDAIPIPTGVVSWWYN
        P L  A V   R+ L+P    LP +   P + YV QG  GV+G +                            +  + +    +GD      GV  WWYN
Subjt:  PVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFP-------------------------NESKEHVLKLSKGDAIPIPTGVVSWWYN

Query:  DGDSDLEIAFLGETSN--SHIHGEITYFLLAGAQ--------------GILSTFSPEFISKAYNLNKQETDKLTHSQ--TGILIIKLQP-NNSLPKPR--
         GDSD  I  + + +N  + +      F LAG++                 S F P  I++A+ +N +   +L + +   G +I    P +  +P PR  
Subjt:  DGDSDLEIAFLGETSN--SHIHGEITYFLLAGAQ--------------GILSTFSPEFISKAYNLNKQETDKLTHSQ--TGILIIKLQP-NNSLPKPR--

Query:  ---GHHNKMVFKLANA-------EPNHKAEFGV---SITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSVQVIYVVNGSGRVEIVGFDAVKMA
           G  N +      A       +P     F      I++L     P +    L+A+   L   G+  P +TA++   V+YV  G  ++++V  +   + 
Subjt:  ---GHHNKMVFKLANA-------EPNHKAEFGV---SITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSVQVIYVVNGSGRVEIVGFDAVKMA

Query:  AE-VKAGQLLVVPKYFVVGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSTDFEK
         E V  GQ++V+P+ F V K AGE G E  S  T     I  L+G+ S   A+  +V++ S+ V+ +  K
Subjt:  AE-VKAGQLLVVPKYFVVGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSTDFEK

AT1G07750.1 RmlC-like cupins superfamily protein9.0e-7341.45Show/hide
Query:  MEFDLKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFPNESKEHVLKLSKGDAIPIPTGVVS
        ME DL P   +     DGG Y  W   + P+L Q N+ A +L L   GFA+P Y+DS KV YV QG +G  G+V P E +E V+ + +GD+I +P GVV+
Subjt:  MEFDLKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFPNESKEHVLKLSKGDAIPIPTGVVS

Query:  WWYNDGDSDLEIAFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETDKLTHSQTGILIIKLQPNNSLPKPRGHHNK-MVFKLANAEPN
        WW+N+ D +L I FLGET   H  G+ T F L G  GI + FS EF+ +A++L++    KL  SQTG  I+KL     +P+P+  +    V     A  +
Subjt:  WWYNDGDSDLEIAFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETDKLTHSQTGILIIKLQPNNSLPKPRGHHNK-MVFKLANAEPN

Query:  HKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSVQVIYVVNGSGRVEIVGFDAVK-MAAEVKAGQLLVVPKYFVVGKEAGEEGMEC
           + G  +  L     P +G+ G  A L ++    + SP ++ DS++QV Y+V GSGRV++VG D  + +   +KAG L +VP++FVV K A  +GM  
Subjt:  HKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSVQVIYVVNGSGRVEIVGFDAVK-MAAEVKAGQLLVVPKYFVVGKEAGEEGMEC

Query:  FSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSTDFEKLFIS
        FSIVTT  P+   LAG  SVW++LS EVLQ +F V+ + EK F S
Subjt:  FSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSTDFEKLFIS

AT2G28680.1 RmlC-like cupins superfamily protein1.6e-7442.2Show/hide
Query:  MEFDLKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFPNESKEHVLKLSKGDAIPIPTGVVS
        ME DL P   +     DGG Y+ W   + P+L   N+ A +L L   G ALP Y+DSPKV YV QG  G  G+V P E +E V+ + KGD+I +P GVV+
Subjt:  MEFDLKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFPNESKEHVLKLSKGDAIPIPTGVVS

Query:  WWYNDGDSDLEIAFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETDKLTHSQTGILIIKLQPNNSLPKP-RGHHNKMVFKLANAEPN
        WW+N+ D++L + FLGET   H  G+ T F L G+ GI + FS EF+ +A++L++    KL  SQTG  I+K+  +  +P+P +G     V     A  +
Subjt:  WWYNDGDSDLEIAFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETDKLTHSQTGILIIKLQPNNSLPKP-RGHHNKMVFKLANAEPN

Query:  HKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSVQVIYVVNGSGRVEIVGFDAVK-MAAEVKAGQLLVVPKYFVVGKEAGEEGMEC
           + G  +  L     P +G+ G  A L ++    + SP ++ DS++QV Y+V GSGRV+IVG D  + +   VKAG L +VP++FVV K A  +G+  
Subjt:  HKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSVQVIYVVNGSGRVEIVGFDAVK-MAAEVKAGQLLVVPKYFVVGKEAGEEGMEC

Query:  FSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSTDFEKLFISK
        FSIVTT  P+   LAG+ SVW+ALS EVLQ +F V  + EK F SK
Subjt:  FSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSTDFEKLFISK

AT5G44120.3 RmlC-like cupins superfamily protein1.1e-1422.37Show/hide
Query:  PVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFP-------------------------NESKEHVLKLSKGDAIPIPTGVVSWWYN
        P L  + V+  R ++  +G  LP + ++ K+ +V +G  G++G V P                          +  + V  +  GD I    GV  W+YN
Subjt:  PVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFP-------------------------NESKEHVLKLSKGDAIPIPTGVVSWWYN

Query:  DGDSDLEIAFLGETSN--SHIHGEITYFLLAG----------------AQGILSTFSPEFISKAYNLNKQETDKLTHSQTGI-LIIKLQPNNSLPKP--R
        DG   L I  + + ++  + +      F LAG                 + I + F PE I++A  ++ Q   +L +       I+++Q    + +P  R
Subjt:  DGDSDLEIAFLGETSN--SHIHGEITYFLLAG----------------AQGILSTFSPEFISKAYNLNKQETDKLTHSQTGI-LIIKLQPNNSLPKP--R

Query:  G---------------HHNKMVFKLANA-------EPN----HKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSVQVIYVVNGSG
        G               H N +   + +A       +P+    +K + G  I++L     P +    LSA+   +  + +  P + A+++  ++YV +G  
Subjt:  G---------------HHNKMVFKLANA-------EPN----HKAEFGVSITSLEESKFPFIGQTGLSAMLEKLCPSGVRSPIYTADSSVQVIYVVNGSG

Query:  RVEIVGFDAVKM-AAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSTD
        +++IV  +  ++   +V  GQL+ VP+ F V K A     +     T     I  LAG+ SV   L  EV+   F +S +
Subjt:  RVEIVGFDAVKM-AAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVSFNVSTD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTTCGACTTGAAGCCAATGACTGACCAAATCTTCTCGGAGAGCGACGGAGGAGTCTACTACCATTGGCCTTCTTCTCAGTTTCCAGTGCTTTCCCAAGCCAATGT
CGCCGCCGGGAGACTCGTCCTCCGTCCCCGTGGCTTCGCTCTCCCCCATTACGCCGATTCTCCGAAAGTTGGCTATGTTACACAAGGACACAACGGAGTCGTAGGACTAG
TGTTTCCAAACGAGTCGAAAGAGCATGTTTTGAAGCTCAGTAAAGGAGACGCGATTCCGATTCCCACCGGAGTCGTCTCCTGGTGGTACAACGACGGAGATTCTGACTTG
GAAATCGCATTCCTCGGCGAGACTTCAAATTCCCACATCCATGGAGAAATCACCTACTTCCTTCTCGCCGGAGCCCAAGGAATCCTGTCAACTTTCTCGCCGGAATTCAT
CTCCAAGGCCTATAACCTAAACAAACAAGAAACAGACAAGCTCACCCACAGCCAAACAGGAATTTTGATCATCAAGCTGCAGCCAAACAACTCCTTGCCAAAGCCACGAG
GCCATCACAACAAGATGGTGTTCAAGTTGGCAAATGCAGAGCCCAACCATAAGGCTGAATTTGGTGTGTCGATTACAAGTTTGGAAGAATCCAAGTTTCCATTCATTGGA
CAAACTGGGCTGAGTGCAATGCTCGAGAAACTCTGTCCCAGTGGAGTTCGTTCTCCCATTTACACTGCTGATTCGTCAGTTCAAGTGATCTATGTCGTCAACGGGTCGGG
TCGGGTCGAGATTGTCGGGTTCGACGCCGTGAAGATGGCGGCTGAAGTGAAGGCTGGGCAGTTGCTGGTGGTTCCCAAGTACTTTGTGGTTGGCAAAGAAGCTGGTGAAG
AGGGAATGGAGTGCTTCTCCATTGTCACCACCACTAAGCCTGTGATAGAGGAGTTGGCTGGGAAGGCATCAGTGTGGGAGGCATTATCTGAGGAAGTTTTACAAGTGTCT
TTTAATGTATCTACTGATTTTGAAAAGCTGTTTATTTCAAAACCAGCTAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTTCGACTTGAAGCCAATGACTGACCAAATCTTCTCGGAGAGCGACGGAGGAGTCTACTACCATTGGCCTTCTTCTCAGTTTCCAGTGCTTTCCCAAGCCAATGT
CGCCGCCGGGAGACTCGTCCTCCGTCCCCGTGGCTTCGCTCTCCCCCATTACGCCGATTCTCCGAAAGTTGGCTATGTTACACAAGGACACAACGGAGTCGTAGGACTAG
TGTTTCCAAACGAGTCGAAAGAGCATGTTTTGAAGCTCAGTAAAGGAGACGCGATTCCGATTCCCACCGGAGTCGTCTCCTGGTGGTACAACGACGGAGATTCTGACTTG
GAAATCGCATTCCTCGGCGAGACTTCAAATTCCCACATCCATGGAGAAATCACCTACTTCCTTCTCGCCGGAGCCCAAGGAATCCTGTCAACTTTCTCGCCGGAATTCAT
CTCCAAGGCCTATAACCTAAACAAACAAGAAACAGACAAGCTCACCCACAGCCAAACAGGAATTTTGATCATCAAGCTGCAGCCAAACAACTCCTTGCCAAAGCCACGAG
GCCATCACAACAAGATGGTGTTCAAGTTGGCAAATGCAGAGCCCAACCATAAGGCTGAATTTGGTGTGTCGATTACAAGTTTGGAAGAATCCAAGTTTCCATTCATTGGA
CAAACTGGGCTGAGTGCAATGCTCGAGAAACTCTGTCCCAGTGGAGTTCGTTCTCCCATTTACACTGCTGATTCGTCAGTTCAAGTGATCTATGTCGTCAACGGGTCGGG
TCGGGTCGAGATTGTCGGGTTCGACGCCGTGAAGATGGCGGCTGAAGTGAAGGCTGGGCAGTTGCTGGTGGTTCCCAAGTACTTTGTGGTTGGCAAAGAAGCTGGTGAAG
AGGGAATGGAGTGCTTCTCCATTGTCACCACCACTAAGCCTGTGATAGAGGAGTTGGCTGGGAAGGCATCAGTGTGGGAGGCATTATCTGAGGAAGTTTTACAAGTGTCT
TTTAATGTATCTACTGATTTTGAAAAGCTGTTTATTTCAAAACCAGCTAATTGA
Protein sequenceShow/hide protein sequence
MEFDLKPMTDQIFSESDGGVYYHWPSSQFPVLSQANVAAGRLVLRPRGFALPHYADSPKVGYVTQGHNGVVGLVFPNESKEHVLKLSKGDAIPIPTGVVSWWYNDGDSDL
EIAFLGETSNSHIHGEITYFLLAGAQGILSTFSPEFISKAYNLNKQETDKLTHSQTGILIIKLQPNNSLPKPRGHHNKMVFKLANAEPNHKAEFGVSITSLEESKFPFIG
QTGLSAMLEKLCPSGVRSPIYTADSSVQVIYVVNGSGRVEIVGFDAVKMAAEVKAGQLLVVPKYFVVGKEAGEEGMECFSIVTTTKPVIEELAGKASVWEALSEEVLQVS
FNVSTDFEKLFISKPAN