; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020692 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020692
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionglutelin type-A 2-like
Genome locationtig00153554:261803..265327
RNA-Seq ExpressionSgr020692
SyntenySgr020692
Gene Ontology termsGO:0045735 - nutrient reservoir activity (molecular function)
InterPro domainsIPR006044 - 11-S seed storage protein, plant
IPR006045 - Cupin 1
IPR011051 - RmlC-like cupin domain superfamily
IPR014710 - RmlC-like jelly roll fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8009057.1 hypothetical protein FH972_005514 [Carpinus fangiana]1.1e-10959.65Show/hide
Query:  MEFDLKPMTDRTFSESDGGAYYNWSSSQFPVLSDINVAAGRLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFPNESEEQVLKLNKGDAIPVPAGVVS
        MEFDL P    T  + DGG YY+WSSS+FP L +  V AG+LVLHP GFALPHYADS K+GYV QG  GVVG+VFPN SEE VLKL KGD IPVP G VS
Subjt:  MEFDLKPMTDRTFSESDGGAYYNWSSSQFPVLSDINVAAGRLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFPNESEEQVLKLNKGDAIPVPAGVVS

Query:  WWFNGGDSDLEIIFLGETSSSHIHGEITYFLLAGAQGILSTFSPEYISKAYNLNKEEANKLACSQKGILIVKLQPGRPLPVPR-RLSGKMVFNLANAEAD
        WWFNGGDS+L I+FLGETS S+I GE TYFLL+GAQGI++ FSPE++S AYN +K+ ANKLA SQ G+LI+KLQ G  +P P    +GK+V+N+  A  D
Subjt:  WWFNGGDSDLEIIFLGETSSSHIHGEITYFLLAGAQGILSTFSPEYISKAYNLNKEEANKLACSQKGILIVKLQPGRPLPVPR-RLSGKMVFNLANAEAD

Query:  REATFGATVKTVKESKFPFIGRTG-LSASLEKLRPGGVRSPVYTADSSVQVMYVASGSGRVEIVGFGGLKTA-AEVKAGHLVVVPKYFVIGKEAGEDGME
             G  V  + + KF F+G+   LSASL KL    +RSP+Y ADS+VQ++YVA GSGRV+IVG  G +   +EVK GHL+VVP+YFV+ + AG DG+E
Subjt:  REATFGATVKTVKESKFPFIGRTG-LSASLEKLRPGGVRSPVYTADSSVQVMYVASGSGRVEIVGFGGLKTA-AEVKAGHLVVVPKYFVIGKEAGEDGME

Query:  CFSIVTTTRPVIEELAGKASVWEALSEEVLQVSFDVSADFEKMFISK
         FSI+T+  PV+EELAGK SVWEA S  V++ S +VS +F K+F SK
Subjt:  CFSIVTTTRPVIEELAGKASVWEALSEEVLQVSFDVSADFEKMFISK

TYJ99759.1 glutelin type-A 2-like [Cucumis melo var. makuwa]3.5e-10355.03Show/hide
Query:  LKPMTDRTFSESDGGAYYNWSSSQFPVLSDINVAAGRLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFPNESEEQVLKLNKGDAIPVPAGVVSWWFN
        ++ M  + F E +GG+Y  W  S +P+L+  NVA GRL+L P GFA+PHYAD  K GYV QG  GV G VFPN+  E V+KL KGD IPVP+G+ SWWFN
Subjt:  LKPMTDRTFSESDGGAYYNWSSSQFPVLSDINVAAGRLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFPNESEEQVLKLNKGDAIPVPAGVVSWWFN

Query:  GGDSDLEIIFLGETSSSHIHGEITYFLLAGAQGILSTFSPEYISKAYNLNKEEANKLACSQKGILIVKLQPGRPLPVPRRLSGKMVFNLANAEADREATF
         GDSDLEIIFLGET ++H+ G+ITYF+L+G +G+L  F+PEY+ K+Y+L++EE NK   SQ  +LI  +QP + LP P + S K+V+N+  A  D  A  
Subjt:  GGDSDLEIIFLGETSSSHIHGEITYFLLAGAQGILSTFSPEYISKAYNLNKEEANKLACSQKGILIVKLQPGRPLPVPRRLSGKMVFNLANAEADREATF

Query:  GATVKT-VKESKFPFIGRTGLSASLEKLRPGGVRSPVYTADSSVQVMYVASGSGRVEIVGFGGLKTAAEVKAGHLVVVPKYFVIGKEAGEDGMECFSIVT
        GA   T V ES FPFIG+TGL+A LEKL    +RSPVY A+ S Q++YV  GSG++++VGF   K  A+VK G L++VP+YF +GK AGE+G+EC S++ 
Subjt:  GATVKT-VKESKFPFIGRTGLSASLEKLRPGGVRSPVYTADSSVQVMYVASGSGRVEIVGFGGLKTAAEVKAGHLVVVPKYFVIGKEAGEDGMECFSIVT

Query:  TTRPVIEELAGKASVWEALSEEVLQVSFDVSADFEKMF
         T P++EELAGK SV EALS EV QVSF+V+A+FEK+F
Subjt:  TTRPVIEELAGKASVWEALSEEVLQVSFDVSADFEKMF

XP_008456076.1 PREDICTED: glutelin type-A 2-like [Cucumis melo]7.1e-10455.13Show/hide
Query:  LKPMTDRTFSESDGGAYYNWSSSQFPVLSDINVAAGRLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFPNESEEQVLKLNKGDAIPVPAGVVSWWFN
        ++ M  + F E +GG+Y  W  S +P+L+  NVA GRL+L P GFA+PHYAD  K GYV QG  GV G VFPN+  E V+KL KGD IPVP+G+ SWWFN
Subjt:  LKPMTDRTFSESDGGAYYNWSSSQFPVLSDINVAAGRLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFPNESEEQVLKLNKGDAIPVPAGVVSWWFN

Query:  GGDSDLEIIFLGETSSSHIHGEITYFLLAGAQGILSTFSPEYISKAYNLNKEEANKLACSQKGILIVKLQPGRPLPVPRRLSGKMVFNLANAEADREATF
         GDSDLEIIFLGET ++H+ G+ITYF+L+G +G+L  F+PEY+ K+Y+L++EE NK   SQ  +LI  +QP + LP P + S K+V+N+  A  D  A  
Subjt:  GGDSDLEIIFLGETSSSHIHGEITYFLLAGAQGILSTFSPEYISKAYNLNKEEANKLACSQKGILIVKLQPGRPLPVPRRLSGKMVFNLANAEADREATF

Query:  GATVKT-VKESKFPFIGRTGLSASLEKLRPGGVRSPVYTADSSVQVMYVASGSGRVEIVGFGGLKTAAEVKAGHLVVVPKYFVIGKEAGEDGMECFSIVT
        GA   T V ES FPFIG+TGL+A LEKL    +RSPVY A+ S Q++YV  GSG++++VGF   K  A+VK G L++VP+YF +GK AGE+G+EC S++ 
Subjt:  GATVKT-VKESKFPFIGRTGLSASLEKLRPGGVRSPVYTADSSVQVMYVASGSGRVEIVGFGGLKTAAEVKAGHLVVVPKYFVIGKEAGEDGMECFSIVT

Query:  TTRPVIEELAGKASVWEALSEEVLQVSFDVSADFEKMFISK
         T P++EELAGK SV EALS EV QVSF+V+A+FEK+F SK
Subjt:  TTRPVIEELAGKASVWEALSEEVLQVSFDVSADFEKMFISK

XP_023903934.1 glutelin type-A 3-like isoform X2 [Quercus suber]1.9e-10455.78Show/hide
Query:  MEFDLKPMTDRTFSESDGGAYYNWSSSQFPVLSDINVAAGRLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFPNESEEQVLKLNKGDAIPVPAGVVS
        MEFDL P   +   E +GGAYY+WSSS+FP+L +  V AG+LVLHP GFALPHYADS KVGYV  GG G+VG+VFPN SEE VLKL KGD IPV  G VS
Subjt:  MEFDLKPMTDRTFSESDGGAYYNWSSSQFPVLSDINVAAGRLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFPNESEEQVLKLNKGDAIPVPAGVVS

Query:  WWFNGGDSDLEIIFLGETSSSHIHGEITYFLLAGAQGILSTFSPEYISKAYNLNKEEANKLACSQKGILIVKLQPGRPLPVP-RRLSGKMVFNLANAEAD
        WWFN GDS+L I+FLGETS S+I GE TYF L+GAQGI+  FSPE++S+AY +NK+EANKLA SQ G+LI+KLQ G+ +P P    + K+V+N+     D
Subjt:  WWFNGGDSDLEIIFLGETSSSHIHGEITYFLLAGAQGILSTFSPEYISKAYNLNKEEANKLACSQKGILIVKLQPGRPLPVP-RRLSGKMVFNLANAEAD

Query:  REATFGATVKTVKESKFPFIGRTGLSASLEKLRPGGVRSPVYTADSSVQVMYVASGSGRVEIVGFGGLKTA-AEVKAGHLVVVPKYFVIGKEAGEDGMEC
             G  V T+ ++KFPF+ +  LS  L +L    + SP YT DSSVQ++YV  G G+++IVG  G +    EVK GHL+VVP++FV+ K AG   +EC
Subjt:  REATFGATVKTVKESKFPFIGRTGLSASLEKLRPGGVRSPVYTADSSVQVMYVASGSGRVEIVGFGGLKTA-AEVKAGHLVVVPKYFVIGKEAGEDGMEC

Query:  FSIVTTTRPVIEELAGKASVWEALSEEVLQVSFDVSADFEKMFISK
        FSI T+ +P  E+LA KASVW+ALS  V++ S +VS + E++F SK
Subjt:  FSIVTTTRPVIEELAGKASVWEALSEEVLQVSFDVSADFEKMFISK

XP_030937791.1 glutelin type-A 3-like [Quercus lobata]1.6e-10355.2Show/hide
Query:  MEFDLKPMTDRTFSESDGGAYYNWSSSQFPVLSDINVAAGRLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFPNESEEQVLKLNKGDAIPVPAGVVS
        MEFDL P   +   E +GGAYY+WSSS+FP+L +  V AG+LVL P GFALPHYADS KVGYV  GG G+VG+VFP  S+E VLKL KGD IPV  G VS
Subjt:  MEFDLKPMTDRTFSESDGGAYYNWSSSQFPVLSDINVAAGRLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFPNESEEQVLKLNKGDAIPVPAGVVS

Query:  WWFNGGDSDLEIIFLGETSSSHIHGEITYFLLAGAQGILSTFSPEYISKAYNLNKEEANKLACSQKGILIVKLQPGRPLPVP-RRLSGKMVFNLANAEAD
        WWFN GDS+L I+FLGETS S+I GE TYF L+GA GI+  FSPE++S+AY +NK+EANKLA SQ G+LI+KLQ G+ +P P    + K+++N+     D
Subjt:  WWFNGGDSDLEIIFLGETSSSHIHGEITYFLLAGAQGILSTFSPEYISKAYNLNKEEANKLACSQKGILIVKLQPGRPLPVP-RRLSGKMVFNLANAEAD

Query:  REATFGATVKTVKESKFPFIGRTGLSASLEKLRPGGVRSPVYTADSSVQVMYVASGSGRVEIVGFGGLKTA-AEVKAGHLVVVPKYFVIGKEAGEDGMEC
             G  V T+ E+KFPF+ +  LS  L +L    + SP YT DSSVQ++YV  G G+++IVG  G +    EVK GHL+VVP++FV+ K AG DG+EC
Subjt:  REATFGATVKTVKESKFPFIGRTGLSASLEKLRPGGVRSPVYTADSSVQVMYVASGSGRVEIVGFGGLKTA-AEVKAGHLVVVPKYFVIGKEAGEDGMEC

Query:  FSIVTTTRPVIEELAGKASVWEALSEEVLQVSFDVSADFEKMFISK
        FSI T+ +P  E LA KASVW+ALS  V++ S +VS + E++F SK
Subjt:  FSIVTTTRPVIEELAGKASVWEALSEEVLQVSFDVSADFEKMFISK

TrEMBL top hitse value%identityAlignment
A0A1S3C2D5 glutelin type-A 2-like3.5e-10455.13Show/hide
Query:  LKPMTDRTFSESDGGAYYNWSSSQFPVLSDINVAAGRLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFPNESEEQVLKLNKGDAIPVPAGVVSWWFN
        ++ M  + F E +GG+Y  W  S +P+L+  NVA GRL+L P GFA+PHYAD  K GYV QG  GV G VFPN+  E V+KL KGD IPVP+G+ SWWFN
Subjt:  LKPMTDRTFSESDGGAYYNWSSSQFPVLSDINVAAGRLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFPNESEEQVLKLNKGDAIPVPAGVVSWWFN

Query:  GGDSDLEIIFLGETSSSHIHGEITYFLLAGAQGILSTFSPEYISKAYNLNKEEANKLACSQKGILIVKLQPGRPLPVPRRLSGKMVFNLANAEADREATF
         GDSDLEIIFLGET ++H+ G+ITYF+L+G +G+L  F+PEY+ K+Y+L++EE NK   SQ  +LI  +QP + LP P + S K+V+N+  A  D  A  
Subjt:  GGDSDLEIIFLGETSSSHIHGEITYFLLAGAQGILSTFSPEYISKAYNLNKEEANKLACSQKGILIVKLQPGRPLPVPRRLSGKMVFNLANAEADREATF

Query:  GATVKT-VKESKFPFIGRTGLSASLEKLRPGGVRSPVYTADSSVQVMYVASGSGRVEIVGFGGLKTAAEVKAGHLVVVPKYFVIGKEAGEDGMECFSIVT
        GA   T V ES FPFIG+TGL+A LEKL    +RSPVY A+ S Q++YV  GSG++++VGF   K  A+VK G L++VP+YF +GK AGE+G+EC S++ 
Subjt:  GATVKT-VKESKFPFIGRTGLSASLEKLRPGGVRSPVYTADSSVQVMYVASGSGRVEIVGFGGLKTAAEVKAGHLVVVPKYFVIGKEAGEDGMECFSIVT

Query:  TTRPVIEELAGKASVWEALSEEVLQVSFDVSADFEKMFISK
         T P++EELAGK SV EALS EV QVSF+V+A+FEK+F SK
Subjt:  TTRPVIEELAGKASVWEALSEEVLQVSFDVSADFEKMFISK

A0A2N9F3X5 Uncharacterized protein2.4e-10556.07Show/hide
Query:  MEFDLKPMTDRTFSESDGGAYYNWSSSQFPVLSDINVAAGRLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFPNESEEQVLKLNKGDAIPVPAGVVS
        MEFDL P   +   E DGGAYY+WSSS+FP+L +  V AG+L L P GFALPHYADS K+GYV QG  G+VG+V PN  EE VLKL KGD IPV  G VS
Subjt:  MEFDLKPMTDRTFSESDGGAYYNWSSSQFPVLSDINVAAGRLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFPNESEEQVLKLNKGDAIPVPAGVVS

Query:  WWFNGGDSDLEIIFLGETSSSHIHGEITYFLLAGAQGILSTFSPEYISKAYNLNKEEANKLACSQKGILIVKLQPGRPLPVP-RRLSGKMVFNLANAEAD
        WWFN GDS+L+++FLGETS S+I GE TYF L+G QGI++ FS E++S+AYN+NK++ANKLA SQ G L+ KLQ G+ LP P +  + K+V+N+  +  D
Subjt:  WWFNGGDSDLEIIFLGETSSSHIHGEITYFLLAGAQGILSTFSPEYISKAYNLNKEEANKLACSQKGILIVKLQPGRPLPVP-RRLSGKMVFNLANAEAD

Query:  REATFGATVKTVKESKFPFIGRTGLSASLEKLRPGGVRSPVYTADSSVQVMYVASGSGRVEIVGFGGLKTA-AEVKAGHLVVVPKYFVIGKEAGEDGMEC
         +   G  V ++ E+KFPF+G+  LSA+L K+    + SPVYT DSSVQ++YV  G+GR+EIVG  G +    EVK GHL+VVP++F   K AG DG+E 
Subjt:  REATFGATVKTVKESKFPFIGRTGLSASLEKLRPGGVRSPVYTADSSVQVMYVASGSGRVEIVGFGGLKTA-AEVKAGHLVVVPKYFVIGKEAGEDGMEC

Query:  FSIVTTTRPVIEELAGKASVWEALSEEVLQVSFDVSADFEKMFISK
        FSI+TT +P++E+LA KASVWEALS  V++ S +VSA+F ++F SK
Subjt:  FSIVTTTRPVIEELAGKASVWEALSEEVLQVSFDVSADFEKMFISK

A0A5A7T7U8 Glutelin type-A 2-like3.5e-10455.13Show/hide
Query:  LKPMTDRTFSESDGGAYYNWSSSQFPVLSDINVAAGRLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFPNESEEQVLKLNKGDAIPVPAGVVSWWFN
        ++ M  + F E +GG+Y  W  S +P+L+  NVA GRL+L P GFA+PHYAD  K GYV QG  GV G VFPN+  E V+KL KGD IPVP+G+ SWWFN
Subjt:  LKPMTDRTFSESDGGAYYNWSSSQFPVLSDINVAAGRLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFPNESEEQVLKLNKGDAIPVPAGVVSWWFN

Query:  GGDSDLEIIFLGETSSSHIHGEITYFLLAGAQGILSTFSPEYISKAYNLNKEEANKLACSQKGILIVKLQPGRPLPVPRRLSGKMVFNLANAEADREATF
         GDSDLEIIFLGET ++H+ G+ITYF+L+G +G+L  F+PEY+ K+Y+L++EE NK   SQ  +LI  +QP + LP P + S K+V+N+  A  D  A  
Subjt:  GGDSDLEIIFLGETSSSHIHGEITYFLLAGAQGILSTFSPEYISKAYNLNKEEANKLACSQKGILIVKLQPGRPLPVPRRLSGKMVFNLANAEADREATF

Query:  GATVKT-VKESKFPFIGRTGLSASLEKLRPGGVRSPVYTADSSVQVMYVASGSGRVEIVGFGGLKTAAEVKAGHLVVVPKYFVIGKEAGEDGMECFSIVT
        GA   T V ES FPFIG+TGL+A LEKL    +RSPVY A+ S Q++YV  GSG++++VGF   K  A+VK G L++VP+YF +GK AGE+G+EC S++ 
Subjt:  GATVKT-VKESKFPFIGRTGLSASLEKLRPGGVRSPVYTADSSVQVMYVASGSGRVEIVGFGGLKTAAEVKAGHLVVVPKYFVIGKEAGEDGMECFSIVT

Query:  TTRPVIEELAGKASVWEALSEEVLQVSFDVSADFEKMFISK
         T P++EELAGK SV EALS EV QVSF+V+A+FEK+F SK
Subjt:  TTRPVIEELAGKASVWEALSEEVLQVSFDVSADFEKMFISK

A0A5D3BLA4 Glutelin type-A 2-like1.7e-10355.03Show/hide
Query:  LKPMTDRTFSESDGGAYYNWSSSQFPVLSDINVAAGRLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFPNESEEQVLKLNKGDAIPVPAGVVSWWFN
        ++ M  + F E +GG+Y  W  S +P+L+  NVA GRL+L P GFA+PHYAD  K GYV QG  GV G VFPN+  E V+KL KGD IPVP+G+ SWWFN
Subjt:  LKPMTDRTFSESDGGAYYNWSSSQFPVLSDINVAAGRLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFPNESEEQVLKLNKGDAIPVPAGVVSWWFN

Query:  GGDSDLEIIFLGETSSSHIHGEITYFLLAGAQGILSTFSPEYISKAYNLNKEEANKLACSQKGILIVKLQPGRPLPVPRRLSGKMVFNLANAEADREATF
         GDSDLEIIFLGET ++H+ G+ITYF+L+G +G+L  F+PEY+ K+Y+L++EE NK   SQ  +LI  +QP + LP P + S K+V+N+  A  D  A  
Subjt:  GGDSDLEIIFLGETSSSHIHGEITYFLLAGAQGILSTFSPEYISKAYNLNKEEANKLACSQKGILIVKLQPGRPLPVPRRLSGKMVFNLANAEADREATF

Query:  GATVKT-VKESKFPFIGRTGLSASLEKLRPGGVRSPVYTADSSVQVMYVASGSGRVEIVGFGGLKTAAEVKAGHLVVVPKYFVIGKEAGEDGMECFSIVT
        GA   T V ES FPFIG+TGL+A LEKL    +RSPVY A+ S Q++YV  GSG++++VGF   K  A+VK G L++VP+YF +GK AGE+G+EC S++ 
Subjt:  GATVKT-VKESKFPFIGRTGLSASLEKLRPGGVRSPVYTADSSVQVMYVASGSGRVEIVGFGGLKTAAEVKAGHLVVVPKYFVIGKEAGEDGMECFSIVT

Query:  TTRPVIEELAGKASVWEALSEEVLQVSFDVSADFEKMF
         T P++EELAGK SV EALS EV QVSF+V+A+FEK+F
Subjt:  TTRPVIEELAGKASVWEALSEEVLQVSFDVSADFEKMF

A0A5N6QPH9 Uncharacterized protein5.5e-11059.65Show/hide
Query:  MEFDLKPMTDRTFSESDGGAYYNWSSSQFPVLSDINVAAGRLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFPNESEEQVLKLNKGDAIPVPAGVVS
        MEFDL P    T  + DGG YY+WSSS+FP L +  V AG+LVLHP GFALPHYADS K+GYV QG  GVVG+VFPN SEE VLKL KGD IPVP G VS
Subjt:  MEFDLKPMTDRTFSESDGGAYYNWSSSQFPVLSDINVAAGRLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFPNESEEQVLKLNKGDAIPVPAGVVS

Query:  WWFNGGDSDLEIIFLGETSSSHIHGEITYFLLAGAQGILSTFSPEYISKAYNLNKEEANKLACSQKGILIVKLQPGRPLPVPR-RLSGKMVFNLANAEAD
        WWFNGGDS+L I+FLGETS S+I GE TYFLL+GAQGI++ FSPE++S AYN +K+ ANKLA SQ G+LI+KLQ G  +P P    +GK+V+N+  A  D
Subjt:  WWFNGGDSDLEIIFLGETSSSHIHGEITYFLLAGAQGILSTFSPEYISKAYNLNKEEANKLACSQKGILIVKLQPGRPLPVPR-RLSGKMVFNLANAEAD

Query:  REATFGATVKTVKESKFPFIGRTG-LSASLEKLRPGGVRSPVYTADSSVQVMYVASGSGRVEIVGFGGLKTA-AEVKAGHLVVVPKYFVIGKEAGEDGME
             G  V  + + KF F+G+   LSASL KL    +RSP+Y ADS+VQ++YVA GSGRV+IVG  G +   +EVK GHL+VVP+YFV+ + AG DG+E
Subjt:  REATFGATVKTVKESKFPFIGRTG-LSASLEKLRPGGVRSPVYTADSSVQVMYVASGSGRVEIVGFGGLKTA-AEVKAGHLVVVPKYFVIGKEAGEDGME

Query:  CFSIVTTTRPVIEELAGKASVWEALSEEVLQVSFDVSADFEKMFISK
         FSI+T+  PV+EELAGK SVWEA S  V++ S +VS +F K+F SK
Subjt:  CFSIVTTTRPVIEELAGKASVWEALSEEVLQVSFDVSADFEKMFISK

SwissProt top hitse value%identityAlignment
A0A222NNM9 Cocosin 19.9e-2425.71Show/hide
Query:  SESDGGAYYNWSSSQFPVLSDINVAAGRLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFP-----------------------NESEEQVLKLNKGD
        SE+    Y++  + QF       V+  R V+ P G  LP  +++P++ Y+ Q G G+VGLV P                        +  ++V +  +GD
Subjt:  SESDGGAYYNWSSSQFPVLSDINVAAGRLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFP-----------------------NESEEQVLKLNKGD

Query:  AIPVPAGVVSWWFNGGDSDLEIIFLGETS--SSHIHGEITYFLLAGAQ---------------GILSTFSPEYISKAYNLNKEEANKLAC----------
         + VP G   W +N G++ +  I + +TS  ++ +      FLLAG Q                IL  FS E ++ A+ +N E A KL C          
Subjt:  AIPVPAGVVSWWFNGGDSDLEIIFLGETS--SSHIHGEITYFLLAGAQ---------------GILSTFSPEYISKAYNLNKEEANKLAC----------

Query:  SQKGILIVK--------LQPGRPLP--VPRRLSGKMVFNLAN-AEADREATFGATVKTVKESKFPFIGRTGLSASLEKLRPGGVRSPVYTADSSVQVMYV
        ++ G+ +++         + GR +        S K+  N+ +   AD     G  + T+   K P +    +SA    L    + SP +  ++   +MY 
Subjt:  SQKGILIVK--------LQPGRPLP--VPRRLSGKMVFNLAN-AEADREATFGATVKTVKESKFPFIGRTGLSASLEKLRPGGVRSPVYTADSSVQVMYV

Query:  ASGSGRVEIVGFGGLKT-AAEVKAGHLVVVPKYFVIGKEAGEDGMECFSIVTTTRPVIEELAGKASVWEALSEEVLQVSFDVSAD
          G GRVE+    G      E++ G L++VP+ F + + AG +G +  SI T+ R ++  + GK S    +  EVL  S+ +S D
Subjt:  ASGSGRVEIVGFGGLKT-AAEVKAGHLVVVPKYFVIGKEAGEDGMECFSIVTTTRPVIEELAGKASVWEALSEEVLQVSFDVSAD

P07730 Glutelin type-A 22.3e-2023.2Show/hide
Query:  VAAGRLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFP----------------------------NESEEQVLKLNKGDAIPVPAGVVSWWFNGGDS
        V+  R V+ P G  LPHY +   + Y+ Q G G+ G  FP                             +  +++ +  +GD I +PAGV  W +N G+ 
Subjt:  VAAGRLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFP----------------------------NESEEQVLKLNKGDAIPVPAGVVSWWFNGGDS

Query:  DLEIIFLGE--TSSSHIHGEITYFLLAG---------------AQGILSTFSPEYISKAYNLNKEEANKLAC--SQKGILIVKLQPGRPLPVP-----RR
         +  I++ +    ++ +      FLLAG               +Q I S FS E +S+A+ ++ + A +L C   Q+G  IV+++ G  L  P      +
Subjt:  DLEIIFLGE--TSSSHIHGEITYFLLAG---------------AQGILSTFSPEYISKAYNLNKEEANKLAC--SQKGILIVKLQPGRPLPVP-----RR

Query:  LSGKMVF---------------------------------NLANA-EADREATFGATVKTVKESKFPFIGRTGLSASLEKLRPGGVRSPVYTADSSVQVM
          G+M                                   N+ N   AD        V  +    FP +    +SA    L    + SP +  ++   ++
Subjt:  LSGKMVF---------------------------------NLANA-EADREATFGATVKTVKESKFPFIGRTGLSASLEKLRPGGVRSPVYTADSSVQVM

Query:  YVASGSGRVEIVGFGGLKTA--AEVKAGHLVVVPKYFVIGKEAGEDGMECFSIVTTTRPVIEELAGKASVWEALSEEVLQVSFDVSAD
        Y+  G  +V++V   G KT    E++ G L++VP+++V+ K+A  +G    +  T    ++  +AGK+S++ AL  +VL  ++ +S +
Subjt:  YVASGSGRVEIVGFGGLKTA--AEVKAGHLVVVPKYFVIGKEAGEDGMECFSIVTTTRPVIEELAGKASVWEALSEEVLQVSFDVSAD

Q02897 Glutelin type-B 22.1e-2122.67Show/hide
Query:  RLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFP------------------------NESEEQVLKLNKGDAIPVPAGVVSWWFNGGDSDLEIIFLG
        R V+ P G  +P Y+++P + Y+ Q G G +GL FP                         +  +++ +  +GD + +PAGV  W++N GD+ +  I++ 
Subjt:  RLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFP------------------------NESEEQVLKLNKGDAIPVPAGVVSWWFNGGDSDLEIIFLG

Query:  E--TSSSHIHGEITYFLLAG-----------------AQGILSTFSPEYISKAYNLNKEEANKLACS----------QKGILIVK---------------
        +   S++ +      FLLAG                 +Q I + F  E +S+A  +N   A +L             + G+ ++K               
Subjt:  E--TSSSHIHGEITYFLLAG-----------------AQGILSTFSPEYISKAYNLNKEEANKLACS----------QKGILIVK---------------

Query:  -LQPGRPLPVPRRLSG--------KMVFNLAN-AEADREATFGATVKTVKESKFPFIGRTGLSASLEKLRPGGVRSPVYTADSSVQVMYVASGSGRVEIV
         +Q         R +G        K   N+ N + AD        + +V   KFP +    +SA+   L    + SP +  ++   ++Y+  G  RV++V
Subjt:  -LQPGRPLPVPRRLSG--------KMVFNLAN-AEADREATFGATVKTVKESKFPFIGRTGLSASLEKLRPGGVRSPVYTADSSVQVMYVASGSGRVEIV

Query:  -GFGGLKTAAEVKAGHLVVVPKYFVIGKEAGEDGMECFSIVTTTRPVIEELAGKASVWEALSEEVLQVSFDVSAD
          FG       ++ G L+++P+++ + K+A  +G +  +I T     +  LAGK SV+ AL  +V+  ++ +S +
Subjt:  -GFGGLKTAAEVKAGHLVVVPKYFVIGKEAGEDGMECFSIVTTTRPVIEELAGKASVWEALSEEVLQVSFDVSAD

Q6K508 Glutelin type-D 14.2e-2224.24Show/hide
Query:  SESDGGAYYNWSSSQFPVLSDINVAAGRLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFP------------------------NESEEQVLKLNKG
        SE+    Y++  + QF       V   R V+ P G  +P Y+++P + Y+ Q G G VGL FP                         +  +++ +  +G
Subjt:  SESDGGAYYNWSSSQFPVLSDINVAAGRLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFP------------------------NESEEQVLKLNKG

Query:  DAIPVPAGVVSWWFNGGDSDLEIIFLGETSS--SHIHGEITYFLLAG-----------------AQGILSTFSPEYISKAYNLNKEEANKLAC--SQKG-
        D + +PA V  W++NGGD+   ++++ +  S  + +      FLLAG                  Q I S F+ E +S+A  +N E + +L     Q+G 
Subjt:  DAIPVPAGVVSWWFNGGDSDLEIIFLGETSS--SHIHGEITYFLLAG-----------------AQGILSTFSPEYISKAYNLNKEEANKLAC--SQKG-

Query:  ILIVK--LQPGRPLPVPRR-----------------------LSGKMVFNLAN-AEADREATFGATVKTVKESKFPFIGRTGLSASLEKLRPGGVRSPVY
        I+ VK  LQ  +P    R+                        + K   N+ N + AD        +  +   KFP +   G+ A+   L    + SP +
Subjt:  ILIVK--LQPGRPLPVPRR-----------------------LSGKMVFNLAN-AEADREATFGATVKTVKESKFPFIGRTGLSASLEKLRPGGVRSPVY

Query:  TADSSVQVMYVASGSGRVEIVGFGGLKTAAEV-KAGHLVVVPKYFVIGKEAGEDGMECFSIVTTTRPVIEELAGKASVWEALSEEVLQVSFDVSAD
          ++   V+Y+  GS RV++    G      V   G L+++P+   + K+A  +G +  +I T + P +  +AGK S+  AL  +V+  ++ +S D
Subjt:  TADSSVQVMYVASGSGRVEIVGFGGLKTAAEV-KAGHLVVVPKYFVIGKEAGEDGMECFSIVTTTRPVIEELAGKASVWEALSEEVLQVSFDVSAD

Q9ZWA9 12S seed storage protein CRD1.6e-2125.07Show/hide
Query:  PVLSDINVAAGRLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFP-------------------------NESEEQVLKLNKGDAIPVPAGVVSWWFN
        P L    V   R+ L P+   LP +   P + YV Q G GV+G +                            +  +++    +GD     AGV  WW+N
Subjt:  PVLSDINVAAGRLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFP-------------------------NESEEQVLKLNKGDAIPVPAGVVSWWFN

Query:  GGDSDLEIIFLGETSS--SHIHGEITYFLLAGAQ--------------GILSTFSPEYISKAYNLNKEEANKLACSQKGILIVKLQPGRPL----PVPRR
         GDSD  I+ + + ++  + +      F LAG++                 S F P  I++A+ +N E A +L  +QK      ++   PL    P PR 
Subjt:  GGDSDLEIIFLGETSS--SHIHGEITYFLLAGAQ--------------GILSTFSPEYISKAYNLNKEEANKLACSQKGILIVKLQPGRPL----PVPRR

Query:  --------------LSGKMVFNLANAE-ADREATFGATVKTVKESKFPFIGRTGLSASLEKLRPGGVRSPVYTADSSVQVMYVASGSGRVEIVGFGGLKT
                       + K+  N+ + E +D  +T    + T+     P +    L+A    L  GG+  P +TA++   V+YV  G  ++++V   G   
Subjt:  --------------LSGKMVFNLANAE-ADREATFGATVKTVKESKFPFIGRTGLSASLEKLRPGGVRSPVYTADSSVQVMYVASGSGRVEIVGFGGLKT

Query:  AAE-VKAGHLVVVPKYFVIGKEAGEDGMECFSIVTTTRPVIEELAGKASVWEALSEEVLQVSFDVSADFEK
          E V  G ++V+P+ F + K AGE G E  S  T     I  L+G+ S   A+  +V++ S+ V+ +  K
Subjt:  AAE-VKAGHLVVVPKYFVIGKEAGEDGMECFSIVTTTRPVIEELAGKASVWEALSEEVLQVSFDVSADFEK

Arabidopsis top hitse value%identityAlignment
AT1G03880.1 cruciferin 26.8e-2023.53Show/hide
Query:  ESDGGAYYNWSSSQFPVLSDINVAAGRLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFPNESE------------------------EQVLKLNKGD
        +S+GG    W     P L     A  R V+ P G  LP + ++ K+ +V   G G++G V P  +E                        ++V  L  GD
Subjt:  ESDGGAYYNWSSSQFPVLSDINVAAGRLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFPNESE------------------------EQVLKLNKGD

Query:  AIPVPAGVVSWWFNGGDSDLEIIFLGETSS--SHIHGEITYFLLAG----------------AQGILSTFSPEYISKAYNLNKEEANKLACSQ--KGILI
         I  P+GV  W++N G+  L ++   + +S  + +   +  FL+AG                   I + F+PE +++A+ +N E A +L   Q  +G ++
Subjt:  AIPVPAGVVSWWFNGGDSDLEIIFLGETSS--SHIHGEITYFLLAG----------------AQGILSTFSPEYISKAYNLNKEEANKLACSQ--KGILI

Query:  VKLQPG---RPLPVPRRLSGKMVFNLANAEADREATFGAT------------------VKTVKESKFPFIGRTGLSASLEKLRPGGVRSPVYTADSSVQV
            P    RP P+ R   G+    +AN   +   T   T                  + T+     P +    LSA    +R   +  P +  +++   
Subjt:  VKLQPG---RPLPVPRRLSGKMVFNLANAEADREATFGAT------------------VKTVKESKFPFIGRTGLSASLEKLRPGGVRSPVYTADSSVQV

Query:  MYVASGSGRVEIVGFGGLKT-AAEVKAGHLVVVPKYFVIGKEAGEDGMECFSIVTTTRPVIEELAGKASVWEALSEEVLQVSFDVSADFEK
        +YV +G   +++V   G +    E+ +G L+VVP+ F + K A  +  E     T     +  LAG+ SV   L  EV+   + +S +  K
Subjt:  MYVASGSGRVEIVGFGGLKT-AAEVKAGHLVVVPKYFVIGKEAGEDGMECFSIVTTTRPVIEELAGKASVWEALSEEVLQVSFDVSADFEK

AT1G03890.1 RmlC-like cupins superfamily protein1.1e-2225.07Show/hide
Query:  PVLSDINVAAGRLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFP-------------------------NESEEQVLKLNKGDAIPVPAGVVSWWFN
        P L    V   R+ L P+   LP +   P + YV Q G GV+G +                            +  +++    +GD     AGV  WW+N
Subjt:  PVLSDINVAAGRLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFP-------------------------NESEEQVLKLNKGDAIPVPAGVVSWWFN

Query:  GGDSDLEIIFLGETSS--SHIHGEITYFLLAGAQ--------------GILSTFSPEYISKAYNLNKEEANKLACSQKGILIVKLQPGRPL----PVPRR
         GDSD  I+ + + ++  + +      F LAG++                 S F P  I++A+ +N E A +L  +QK      ++   PL    P PR 
Subjt:  GGDSDLEIIFLGETSS--SHIHGEITYFLLAGAQ--------------GILSTFSPEYISKAYNLNKEEANKLACSQKGILIVKLQPGRPL----PVPRR

Query:  --------------LSGKMVFNLANAE-ADREATFGATVKTVKESKFPFIGRTGLSASLEKLRPGGVRSPVYTADSSVQVMYVASGSGRVEIVGFGGLKT
                       + K+  N+ + E +D  +T    + T+     P +    L+A    L  GG+  P +TA++   V+YV  G  ++++V   G   
Subjt:  --------------LSGKMVFNLANAE-ADREATFGATVKTVKESKFPFIGRTGLSASLEKLRPGGVRSPVYTADSSVQVMYVASGSGRVEIVGFGGLKT

Query:  AAE-VKAGHLVVVPKYFVIGKEAGEDGMECFSIVTTTRPVIEELAGKASVWEALSEEVLQVSFDVSADFEK
          E V  G ++V+P+ F + K AGE G E  S  T     I  L+G+ S   A+  +V++ S+ V+ +  K
Subjt:  AAE-VKAGHLVVVPKYFVIGKEAGEDGMECFSIVTTTRPVIEELAGKASVWEALSEEVLQVSFDVSADFEK

AT1G07750.1 RmlC-like cupins superfamily protein8.2e-7442.03Show/hide
Query:  MEFDLKPMTDRTFSESDGGAYYNWSSSQFPVLSDINVAAGRLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFPNESEEQVLKLNKGDAIPVPAGVVS
        ME DL P   +     DGG+Y  W   + P+L   N+ A +L L  +GFA+P Y+DS KV YV QG  G  G+V P E EE+V+ + +GD+I +P GVV+
Subjt:  MEFDLKPMTDRTFSESDGGAYYNWSSSQFPVLSDINVAAGRLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFPNESEEQVLKLNKGDAIPVPAGVVS

Query:  WWFNGGDSDLEIIFLGETSSSHIHGEITYFLLAGAQGILSTFSPEYISKAYNLNKEEANKLACSQKGILIVKLQPGRPLPVPRRLS-GKMVFNLANAEAD
        WWFN  D +L I+FLGET   H  G+ T F L G  GI + FS E++ +A++L++    KL  SQ G  IVKL  G  +P P+  +    V N   A  D
Subjt:  WWFNGGDSDLEIIFLGETSSSHIHGEITYFLLAGAQGILSTFSPEYISKAYNLNKEEANKLACSQKGILIVKLQPGRPLPVPRRLS-GKMVFNLANAEAD

Query:  REATFGATVKTVKESKFPFIGRTGLSASLEKLRPGGVRSPVYTADSSVQVMYVASGSGRVEIVGFGGLKT-AAEVKAGHLVVVPKYFVIGKEAGEDGMEC
         +   G  V  +     P +G  G  A L ++    + SP ++ DS++QV Y+  GSGRV++VG  G +     +KAG L +VP++FV+ K A  DGM  
Subjt:  REATFGATVKTVKESKFPFIGRTGLSASLEKLRPGGVRSPVYTADSSVQVMYVASGSGRVEIVGFGGLKT-AAEVKAGHLVVVPKYFVIGKEAGEDGMEC

Query:  FSIVTTTRPVIEELAGKASVWEALSEEVLQVSFDVSADFEKMFIS
        FSIVTT  P+   LAG  SVW++LS EVLQ +F V+ + EK F S
Subjt:  FSIVTTTRPVIEELAGKASVWEALSEEVLQVSFDVSADFEKMFIS

AT2G28680.1 RmlC-like cupins superfamily protein3.0e-7643.06Show/hide
Query:  MEFDLKPMTDRTFSESDGGAYYNWSSSQFPVLSDINVAAGRLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFPNESEEQVLKLNKGDAIPVPAGVVS
        ME DL P   +     DGG+Y+ W   + P+L D N+ A +L L   G ALP Y+DSPKV YV QG  G  G+V P E EE+V+ + KGD+I +P GVV+
Subjt:  MEFDLKPMTDRTFSESDGGAYYNWSSSQFPVLSDINVAAGRLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFPNESEEQVLKLNKGDAIPVPAGVVS

Query:  WWFNGGDSDLEIIFLGETSSSHIHGEITYFLLAGAQGILSTFSPEYISKAYNLNKEEANKLACSQKGILIVKLQPGRPLPVPRRLSGK-MVFNLANAEAD
        WWFN  D++L ++FLGET   H  G+ T F L G+ GI + FS E++ +A++L++    KL  SQ G  IVK+     +P P++   K  V N   A  D
Subjt:  WWFNGGDSDLEIIFLGETSSSHIHGEITYFLLAGAQGILSTFSPEYISKAYNLNKEEANKLACSQKGILIVKLQPGRPLPVPRRLSGK-MVFNLANAEAD

Query:  REATFGATVKTVKESKFPFIGRTGLSASLEKLRPGGVRSPVYTADSSVQVMYVASGSGRVEIVGFGGLKT-AAEVKAGHLVVVPKYFVIGKEAGEDGMEC
         +   G  V  +     P +G  G  A L ++    + SP ++ DS++QV Y+  GSGRV+IVG  G +     VKAG L +VP++FV+ K A  DG+  
Subjt:  REATFGATVKTVKESKFPFIGRTGLSASLEKLRPGGVRSPVYTADSSVQVMYVASGSGRVEIVGFGGLKT-AAEVKAGHLVVVPKYFVIGKEAGEDGMEC

Query:  FSIVTTTRPVIEELAGKASVWEALSEEVLQVSFDVSADFEKMFISK
        FSIVTT  P+   LAG+ SVW+ALS EVLQ +F V  + EK F SK
Subjt:  FSIVTTTRPVIEELAGKASVWEALSEEVLQVSFDVSADFEKMFISK

AT4G28520.1 cruciferin 31.7e-0727.69Show/hide
Query:  VKTVKESKFPFIGRTGLSASLEKLRPGGVRSPVYTADSSVQVMYVASGSGRVEIVGFGGLKTA-AEVKAGHLVVVPKYFVIGKEAGEDGMECFSIVTTTR
        V +V     P +    LSA+   L+   +  P Y  +++ +++Y   G GR+++V   G      +V+ G LVV+P+ F    ++  +  E  S  T   
Subjt:  VKTVKESKFPFIGRTGLSASLEKLRPGGVRSPVYTADSSVQVMYVASGSGRVEIVGFGGLKTA-AEVKAGHLVVVPKYFVIGKEAGEDGMECFSIVTTTR

Query:  PVIEELAGKASVWEALSEEVLQVSFDVSAD
         +I  LAG+ S+  AL  EV+   F +S +
Subjt:  PVIEELAGKASVWEALSEEVLQVSFDVSAD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTTCGACTTGAAGCCCATGACTGACAGAACCTTCTCGGAGAGCGACGGCGGAGCCTACTACAACTGGTCGTCTTCTCAGTTTCCGGTGCTTTCCGACATCAATGT
CGCCGCCGGCAGACTCGTCCTCCACCCCAGCGGCTTCGCTCTCCCCCATTACGCCGATTCTCCTAAAGTTGGTTACGTCACACAAGGAGGCTATGGAGTCGTCGGATTGG
TGTTCCCAAACGAGTCGGAAGAGCAAGTCTTGAAGCTCAACAAAGGAGACGCAATACCAGTCCCCGCCGGAGTCGTCTCCTGGTGGTTCAACGGCGGAGATTCCGATCTC
GAAATCATATTCCTCGGCGAGACTTCAAGCTCCCACATCCATGGCGAAATCACCTACTTCCTTCTCGCCGGAGCCCAAGGAATCCTGTCGACTTTCTCGCCGGAATACAT
CTCCAAGGCCTATAACCTAAACAAAGAAGAAGCGAACAAGCTCGCCTGCAGCCAAAAAGGGATTTTGATCGTCAAGCTACAGCCAGGGAGACCCCTGCCGGTGCCCCGGC
GGCTGTCCGGCAAGATGGTGTTCAACCTGGCGAATGCAGAGGCCGATCGGGAGGCCACGTTTGGTGCGACGGTCAAAACTGTGAAGGAATCGAAGTTCCCGTTCATCGGA
CGGACGGGGCTAAGTGCGAGCCTTGAGAAGCTCCGTCCCGGCGGCGTTCGTTCTCCGGTTTACACCGCCGATTCGTCGGTTCAAGTGATGTACGTCGCGAGTGGGTCGGG
TCGGGTCGAGATAGTCGGGTTCGGCGGCCTGAAGACGGCGGCTGAAGTGAAGGCGGGTCACTTGGTGGTGGTTCCGAAGTACTTTGTGATTGGGAAAGAAGCTGGTGAAG
ATGGAATGGAGTGCTTCTCCATTGTCACCACCACTAGGCCTGTGATAGAGGAGTTGGCTGGAAAGGCATCAGTATGGGAGGCCTTGTCTGAGGAAGTATTACAAGTCTCT
TTTGACGTATCTGCTGATTTTGAAAAGATGTTCATTTCAAAAGCAGCTAATTATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTTCGACTTGAAGCCCATGACTGACAGAACCTTCTCGGAGAGCGACGGCGGAGCCTACTACAACTGGTCGTCTTCTCAGTTTCCGGTGCTTTCCGACATCAATGT
CGCCGCCGGCAGACTCGTCCTCCACCCCAGCGGCTTCGCTCTCCCCCATTACGCCGATTCTCCTAAAGTTGGTTACGTCACACAAGGAGGCTATGGAGTCGTCGGATTGG
TGTTCCCAAACGAGTCGGAAGAGCAAGTCTTGAAGCTCAACAAAGGAGACGCAATACCAGTCCCCGCCGGAGTCGTCTCCTGGTGGTTCAACGGCGGAGATTCCGATCTC
GAAATCATATTCCTCGGCGAGACTTCAAGCTCCCACATCCATGGCGAAATCACCTACTTCCTTCTCGCCGGAGCCCAAGGAATCCTGTCGACTTTCTCGCCGGAATACAT
CTCCAAGGCCTATAACCTAAACAAAGAAGAAGCGAACAAGCTCGCCTGCAGCCAAAAAGGGATTTTGATCGTCAAGCTACAGCCAGGGAGACCCCTGCCGGTGCCCCGGC
GGCTGTCCGGCAAGATGGTGTTCAACCTGGCGAATGCAGAGGCCGATCGGGAGGCCACGTTTGGTGCGACGGTCAAAACTGTGAAGGAATCGAAGTTCCCGTTCATCGGA
CGGACGGGGCTAAGTGCGAGCCTTGAGAAGCTCCGTCCCGGCGGCGTTCGTTCTCCGGTTTACACCGCCGATTCGTCGGTTCAAGTGATGTACGTCGCGAGTGGGTCGGG
TCGGGTCGAGATAGTCGGGTTCGGCGGCCTGAAGACGGCGGCTGAAGTGAAGGCGGGTCACTTGGTGGTGGTTCCGAAGTACTTTGTGATTGGGAAAGAAGCTGGTGAAG
ATGGAATGGAGTGCTTCTCCATTGTCACCACCACTAGGCCTGTGATAGAGGAGTTGGCTGGAAAGGCATCAGTATGGGAGGCCTTGTCTGAGGAAGTATTACAAGTCTCT
TTTGACGTATCTGCTGATTTTGAAAAGATGTTCATTTCAAAAGCAGCTAATTATTGA
Protein sequenceShow/hide protein sequence
MEFDLKPMTDRTFSESDGGAYYNWSSSQFPVLSDINVAAGRLVLHPSGFALPHYADSPKVGYVTQGGYGVVGLVFPNESEEQVLKLNKGDAIPVPAGVVSWWFNGGDSDL
EIIFLGETSSSHIHGEITYFLLAGAQGILSTFSPEYISKAYNLNKEEANKLACSQKGILIVKLQPGRPLPVPRRLSGKMVFNLANAEADREATFGATVKTVKESKFPFIG
RTGLSASLEKLRPGGVRSPVYTADSSVQVMYVASGSGRVEIVGFGGLKTAAEVKAGHLVVVPKYFVIGKEAGEDGMECFSIVTTTRPVIEELAGKASVWEALSEEVLQVS
FDVSADFEKMFISKAANY