; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0030882 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0030882
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionmajor latex protein 149-like
Genome locationchr11:2424006..2425214
RNA-Seq ExpressionLag0030882
SyntenyLag0030882
Gene Ontology termsGO:0006952 - defense response (biological process)
InterPro domainsIPR000916 - Bet v I/Major latex protein
IPR023393 - START-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6581204.1 hypothetical protein SDJN03_21206, partial [Cucurbita argyrosperma subsp. sororia]1.0e-4660.67Show/hide
Query:  MAQIAKISEQVQLKSSGRRFYEFIKN-MDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKYDFGIPLEVKERRAVDDANKSISMECLEGDLFRDFEELKVN
        MAQIAK+S++VQL+SSG +FYE +KN MD + +MFP+ +K ++++EG+G  HGS++  KY+   P E KER A+DDANKSI+ ECLEGDLFRDFE  K+ 
Subjt:  MAQIAKISEQVQLKSSGRRFYEFIKN-MDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKYDFGIPLEVKERRAVDDANKSISMECLEGDLFRDFEELKVN

Query:  LQVVDNDSNGSSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYLCNN
        ++VV+N SNG S NW +E+VKANEDVAPP+NYL +  K SKGID YLC N
Subjt:  LQVVDNDSNGSSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYLCNN

KAG6581206.1 Major latex protein 146, partial [Cucurbita argyrosperma subsp. sororia]1.3e-4662Show/hide
Query:  MAQIAKISEQVQLKSSGRRFYEFIKN-MDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKYDFGIPLEVKERRAVDDANKSISMECLEGDLFRDFEELKVN
        MAQIAK+S++VQL+SS  +FYEF+KN MD + +MFPE +K +++VEG+G+ HGSV+  KY+   P EVKER ++DDAN+SI+ ECLEGDL RDFE +K+ 
Subjt:  MAQIAKISEQVQLKSSGRRFYEFIKN-MDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKYDFGIPLEVKERRAVDDANKSISMECLEGDLFRDFEELKVN

Query:  LQVVDNDSNGSSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYLCNN
        ++VV+N SNGSS NW +EFVKANEDVA P+NYL    K SKGID YLC N
Subjt:  LQVVDNDSNGSSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYLCNN

XP_022935319.1 kirola-like [Cucurbita moschata]3.0e-4660Show/hide
Query:  MAQIAKISEQVQLKSSGRRFYEFIKN-MDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKYDFGIPLEVKERRAVDDANKSISMECLEGDLFRDFEELKVN
        MAQIAK+S++VQL+SSG +FYE +KN MD + +MFP+ +K ++++EG+G  HGS++  KY+   P + KER A+DDANKSI+ ECLEGDLFRDFE  K+ 
Subjt:  MAQIAKISEQVQLKSSGRRFYEFIKN-MDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKYDFGIPLEVKERRAVDDANKSISMECLEGDLFRDFEELKVN

Query:  LQVVDNDSNGSSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYLCNN
        ++VV+N SNG S NW +E+VKANEDVAPP+NYL +  K SKGID YLC N
Subjt:  LQVVDNDSNGSSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYLCNN

XP_023528799.1 kirola-like [Cucurbita pepo subsp. pepo]1.7e-4661.33Show/hide
Query:  MAQIAKISEQVQLKSSGRRFYEFIKN-MDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKYDFGIPLEVKERRAVDDANKSISMECLEGDLFRDFEELKVN
        MAQIAK+S++VQL+SSG +FYE +KN MD + +MFPE +K ++++EG+G  HGS++  KYD     E KER A+DDANKSI+ ECLEGDLFRDFE  K+ 
Subjt:  MAQIAKISEQVQLKSSGRRFYEFIKN-MDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKYDFGIPLEVKERRAVDDANKSISMECLEGDLFRDFEELKVN

Query:  LQVVDNDSNGSSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYLCNN
        ++VV+N SNG S NW +E+VKANEDVAPP+NYL +  K SKGID YLC N
Subjt:  LQVVDNDSNGSSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYLCNN

XP_038893753.1 kirola-like [Benincasa hispida]4.1e-4867.55Show/hide
Query:  MAQIAKISEQVQLKSSGRRFYEFIK-NMDCL-PRMFPENFKGYEIVEGHGFTHGSVVLWKYDFGIPLEVKERRAVDDANKSISMECLEGDLFRDFEELKV
        MAQIAKIS+QVQLKSSG +FYEF K NMD +  +MFPE ++  ++VEG+GF+HGSV+  KY+ G P EVKER A+DDANKSI+ ECLEGDLFRDFE +K+
Subjt:  MAQIAKISEQVQLKSSGRRFYEFIK-NMDCL-PRMFPENFKGYEIVEGHGFTHGSVVLWKYDFGIPLEVKERRAVDDANKSISMECLEGDLFRDFEELKV

Query:  NLQVVDNDSNGSSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYLCNN
         +QV +N SNGSSVNW VEFVKANEDVA P++YL    K SKG+D YLCNN
Subjt:  NLQVVDNDSNGSSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYLCNN

TrEMBL top hitse value%identityAlignment
A0A1S3CQL5 MLP-like protein 282.1e-4561.44Show/hide
Query:  MAQIAKISEQVQLKSSGRRFYEFIKN-MDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKYD--FGIPLEVKERRAVDDANKSISMECLEGDLFRDFEELK
        MAQIAKI+E+VQLKSSG +F+EF KN  D  PRMFP N K Y+ VEG+ F+HGSV +WKYD  FG  +EVK +  +D+ANK+I  ECLEGDLF+DF+  K
Subjt:  MAQIAKISEQVQLKSSGRRFYEFIKN-MDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKYD--FGIPLEVKERRAVDDANKSISMECLEGDLFRDFEELK

Query:  VNLQVVDNDSNG-SSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYLCNN
        V ++V D  S+G SSVNW +EFVK+NE+VAPP +YL  G K  K +DAYLCNN
Subjt:  VNLQVVDNDSNG-SSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYLCNN

A0A5A7UCZ9 MLP-like protein 282.1e-4561.44Show/hide
Query:  MAQIAKISEQVQLKSSGRRFYEFIKN-MDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKYD--FGIPLEVKERRAVDDANKSISMECLEGDLFRDFEELK
        MAQIAKI+E+VQLKSSG +F+EF KN  D  PRMFP N K Y+ VEG+ F+HGSV +WKYD  FG  +EVK +  +D+ANK+I  ECLEGDLF+DF+  K
Subjt:  MAQIAKISEQVQLKSSGRRFYEFIKN-MDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKYD--FGIPLEVKERRAVDDANKSISMECLEGDLFRDFEELK

Query:  VNLQVVDNDSNG-SSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYLCNN
        V ++V D  S+G SSVNW +EFVK+NE+VAPP +YL  G K  K +DAYLCNN
Subjt:  VNLQVVDNDSNG-SSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYLCNN

A0A6J1CNR9 uncharacterized protein LOC1110133016.0e-4560.39Show/hide
Query:  MAQIAKISEQVQLKSSGRRFYEFIKN-MDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKYDFGI--PLEVKERRAVDDANKSISMECLEGDLFRDFEELK
        MAQIAKISEQVQLKS G +FYEF+KN MD  PRMFP N + Y+  EG+ FTHGS+  WKYD G+   +EVK R  VD+ NK+I  ECLEGDLF+DFE  +
Subjt:  MAQIAKISEQVQLKSSGRRFYEFIKN-MDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKYDFGI--PLEVKERRAVDDANKSISMECLEGDLFRDFEELK

Query:  VNLQVVDNDSNG--SSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYLCNN
        V ++V D + N   SSV W +EFVKANE+V PP++YL +GVK  K +DA LCNN
Subjt:  VNLQVVDNDSNG--SSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYLCNN

A0A6J1F4J9 kirola-like isoform X16.0e-4559.33Show/hide
Query:  MAQIAKISEQVQLKSSGRRFYEFIKN-MDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKYDFGIPLEVKERRAVDDANKSISMECLEGDLFRDFEELKVN
        MAQIAK+S++V+L+SS  +FYEF+KN MD + +MFPE +K +++VEG+G+ HGSV+  KY+     EVKER ++DDANKS++ EC+EGDL RDFE +K+ 
Subjt:  MAQIAKISEQVQLKSSGRRFYEFIKN-MDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKYDFGIPLEVKERRAVDDANKSISMECLEGDLFRDFEELKVN

Query:  LQVVDNDSNGSSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYLCNN
        ++VV+N SNGSS NW +EFVKANEDVA P+NYL    K S+GID YLC N
Subjt:  LQVVDNDSNGSSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYLCNN

A0A6J1F532 kirola-like1.4e-4660Show/hide
Query:  MAQIAKISEQVQLKSSGRRFYEFIKN-MDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKYDFGIPLEVKERRAVDDANKSISMECLEGDLFRDFEELKVN
        MAQIAK+S++VQL+SSG +FYE +KN MD + +MFP+ +K ++++EG+G  HGS++  KY+   P + KER A+DDANKSI+ ECLEGDLFRDFE  K+ 
Subjt:  MAQIAKISEQVQLKSSGRRFYEFIKN-MDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKYDFGIPLEVKERRAVDDANKSISMECLEGDLFRDFEELKVN

Query:  LQVVDNDSNGSSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYLCNN
        ++VV+N SNG S NW +E+VKANEDVAPP+NYL +  K SKGID YLC N
Subjt:  LQVVDNDSNGSSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYLCNN

SwissProt top hitse value%identityAlignment
P19825 Major latex protein 155.7e-1629.45Show/hide
Query:  IAKISEQVQLKSSGRRFYEFIKNMDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKYDF-GIPLEVKERRAVDDANKSISMECLEGDLFRDFEELKVNLQV
        + K+  + ++  +  ++Y+  K+ + LP   P  +   + VEGHG T G V  W Y   G PL VKE+   +D  ++I+   +EG +  D+++    L V
Subjt:  IAKISEQVQLKSSGRRFYEFIKNMDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKYDF-GIPLEVKERRAVDDANKSISMECLEGDLFRDFEELKVNLQV

Query:  VDN-DSNGSSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYLC
            +  GS V W V++ K NED   P++YL    +  + ++++LC
Subjt:  VDN-DSNGSSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYLC

Q06394 Major latex protein 1462.2e-1529.93Show/hide
Query:  IAKISEQVQLKSSGRRFYEFIKNMDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKYDF-GIPLEVKERRAVDDANKSISMECLEGDLFRDFEELKVNLQV
        + K+  + ++  +  ++Y+  K+ + LP + P  +   + VEGHG T G V  W Y   G PL  KE+   +D  ++I    + GDL  D+++    L V
Subjt:  IAKISEQVQLKSSGRRFYEFIKNMDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKYDF-GIPLEVKERRAVDDANKSISMECLEGDLFRDFEELKVNLQV

Query:  VDNDSNGSS--VNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYLC
        V+  SNG    V W +++ K NED   P+ YL    + ++ + ++LC
Subjt:  VDNDSNGSS--VNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYLC

Q06395 Major latex protein 1494.8e-1530.14Show/hide
Query:  IAKISEQVQLKSSGRRFYEFIKNMDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKY-DFGIPLEVKERRAVDDANKSISMECLEGDLFRDFEELKVNLQV
        + K+  + ++  +  ++Y+  K+ + LP   P      + VEGHG T G V  W Y   G  L  KE+   +D  ++I     EGDL  D+++    L V
Subjt:  IAKISEQVQLKSSGRRFYEFIKNMDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKY-DFGIPLEVKERRAVDDANKSISMECLEGDLFRDFEELKVNLQV

Query:  VDNDS-NGSSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYLC
           D+ +GS V + +++ K NED   P +YL +  +A++ ++ YLC
Subjt:  VDNDS-NGSSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYLC

Q41020 Major latex protein 229.7e-1631.03Show/hide
Query:  IAKISEQVQLKSSGRRFYEFIKNMDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKYDF-GIPLEVKERRAVDDANKSISMECLEGDLFRDFEELKVNLQV
        + K+  ++++  +   +Y+  K+ + LP   P  ++G + VEG   T G +  W Y   G PL  KER   +D  ++I    +EG L  D+++    L  
Subjt:  IAKISEQVQLKSSGRRFYEFIKNMDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKYDF-GIPLEVKERRAVDDANKSISMECLEGDLFRDFEELKVNLQV

Query:  VDNDSNGSSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYLC
           D +GS V W VE+ K NED   P +YL    K  + ++ YLC
Subjt:  VDNDSNGSSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYLC

Q9SSK7 MLP-like protein 345.3e-1431.25Show/hide
Query:  ISEQVQLKSSGRRFYE-FIKNMDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKYDFGIPLEVKERR--AVDDANKSISMECLEGDLFRDFEELKVNLQVV
        +  +V++K+S  +F+  F      + +  P N +  ++ EG   T GS+V W Y      +V + R  AVD     I+   +EGDL ++++   + +QV 
Subjt:  ISEQVQLKSSGRRFYE-FIKNMDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKYDFGIPLEVKERR--AVDDANKSISMECLEGDLFRDFEELKVNLQVV

Query:  -DNDSNGSSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYL
          +  +GS V+W  E+ K NE+VA P   L   V+ SK ID +L
Subjt:  -DNDSNGSSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYL

Arabidopsis top hitse value%identityAlignment
AT1G24020.1 MLP-like protein 4231.7e-1228.77Show/hide
Query:  QVQLKSSGRRFYEFI-KNMDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKYDFGIPL---EVKERRAVDDANKSISMECLEGDLFRDFEELKVNLQVVDN
        +V++KS   +F+  +   ++  P+ FP ++K  +++ G G   GS+ L  Y  G PL     +   AVD  NKS+S   + G++   ++  K  + V+  
Subjt:  QVQLKSSGRRFYEFI-KNMDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKYDFGIPL---EVKERRAVDDANKSISMECLEGDLFRDFEELKVNLQVVDN

Query:  DSNGSSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYLCNNST
        D  GS + W  EF K   ++  P+      VK  K ID YL   ++
Subjt:  DSNGSSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYLCNNST

AT1G24020.2 MLP-like protein 4231.7e-1228.77Show/hide
Query:  QVQLKSSGRRFYEFI-KNMDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKYDFGIPL---EVKERRAVDDANKSISMECLEGDLFRDFEELKVNLQVVDN
        +V++KS   +F+  +   ++  P+ FP ++K  +++ G G   GS+ L  Y  G PL     +   AVD  NKS+S   + G++   ++  K  + V+  
Subjt:  QVQLKSSGRRFYEFI-KNMDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKYDFGIPL---EVKERRAVDDANKSISMECLEGDLFRDFEELKVNLQVVDN

Query:  DSNGSSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYLCNNST
        D  GS + W  EF K   ++  P+      VK  K ID YL   ++
Subjt:  DSNGSSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYLCNNST

AT1G70840.1 MLP-like protein 317.1e-1429.93Show/hide
Query:  KISEQVQLKSSGRRFYE-FIKNMDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKYDFGIPLEVKERR--AVDDANKSISMECLEGDLFRDFEELKVNLQV
        K+   +++K+S  +F+  F      + +  P   +G E+ EG     GS+V W Y      +V + R  AV+     I+   +EGDL ++++   + +QV
Subjt:  KISEQVQLKSSGRRFYE-FIKNMDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKYDFGIPLEVKERR--AVDDANKSISMECLEGDLFRDFEELKVNLQV

Query:  VDNDSN-GSSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYLCN
               GS V+W VE+ K ++ VA P  +L   V+ SK ID +L N
Subjt:  VDNDSN-GSSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYLCN

AT1G70850.1 MLP-like protein 343.8e-1531.25Show/hide
Query:  ISEQVQLKSSGRRFYE-FIKNMDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKYDFGIPLEVKERR--AVDDANKSISMECLEGDLFRDFEELKVNLQVV
        +  +V++K+S  +F+  F      + +  P N +  ++ EG   T GS+V W Y      +V + R  AVD     I+   +EGDL ++++   + +QV 
Subjt:  ISEQVQLKSSGRRFYE-FIKNMDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKYDFGIPLEVKERR--AVDDANKSISMECLEGDLFRDFEELKVNLQVV

Query:  -DNDSNGSSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYL
          +  +GS V+W  E+ K NE+VA P   L   V+ SK ID +L
Subjt:  -DNDSNGSSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYL

AT1G70850.3 MLP-like protein 343.8e-1531.25Show/hide
Query:  ISEQVQLKSSGRRFYE-FIKNMDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKYDFGIPLEVKERR--AVDDANKSISMECLEGDLFRDFEELKVNLQVV
        +  +V++K+S  +F+  F      + +  P N +  ++ EG   T GS+V W Y      +V + R  AVD     I+   +EGDL ++++   + +QV 
Subjt:  ISEQVQLKSSGRRFYE-FIKNMDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKYDFGIPLEVKERR--AVDDANKSISMECLEGDLFRDFEELKVNLQVV

Query:  -DNDSNGSSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYL
          +  +GS V+W  E+ K NE+VA P   L   V+ SK ID +L
Subjt:  -DNDSNGSSVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCAGATTGCTAAGATCTCAGAGCAGGTGCAGCTGAAGTCTTCTGGTCGAAGGTTTTATGAGTTTATTAAGAACATGGACTGTCTTCCTCGAATGTTTCCTGAAAA
CTTTAAGGGATACGAAATTGTGGAAGGACATGGTTTCACTCATGGCAGCGTCGTCCTTTGGAAATATGACTTTGGTATCCCATTAGAAGTAAAGGAGAGGCGAGCTGTGG
ATGATGCCAACAAATCAATAAGTATGGAGTGTCTTGAAGGAGATCTGTTCAGAGATTTTGAAGAACTCAAAGTGAATCTTCAAGTTGTTGACAATGATAGCAATGGCAGC
TCAGTTAATTGGTTTGTAGAATTTGTAAAGGCAAATGAAGATGTGGCTCCACCCTATAATTATCTCCACATGGGAGTTAAAGCAAGCAAAGGCATTGATGCTTACCTTTG
CAACAACTCAACTAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCAGATTGCTAAGATCTCAGAGCAGGTGCAGCTGAAGTCTTCTGGTCGAAGGTTTTATGAGTTTATTAAGAACATGGACTGTCTTCCTCGAATGTTTCCTGAAAA
CTTTAAGGGATACGAAATTGTGGAAGGACATGGTTTCACTCATGGCAGCGTCGTCCTTTGGAAATATGACTTTGGTATCCCATTAGAAGTAAAGGAGAGGCGAGCTGTGG
ATGATGCCAACAAATCAATAAGTATGGAGTGTCTTGAAGGAGATCTGTTCAGAGATTTTGAAGAACTCAAAGTGAATCTTCAAGTTGTTGACAATGATAGCAATGGCAGC
TCAGTTAATTGGTTTGTAGAATTTGTAAAGGCAAATGAAGATGTGGCTCCACCCTATAATTATCTCCACATGGGAGTTAAAGCAAGCAAAGGCATTGATGCTTACCTTTG
CAACAACTCAACTAATTAA
Protein sequenceShow/hide protein sequence
MAQIAKISEQVQLKSSGRRFYEFIKNMDCLPRMFPENFKGYEIVEGHGFTHGSVVLWKYDFGIPLEVKERRAVDDANKSISMECLEGDLFRDFEELKVNLQVVDNDSNGS
SVNWFVEFVKANEDVAPPYNYLHMGVKASKGIDAYLCNNSTN