; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi11G015600 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi11G015600
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionBet_v_1 domain-containing protein
Genome locationchr11:23914932..23916527
RNA-Seq ExpressionLsi11G015600
SyntenyLsi11G015600
Gene Ontology termsGO:0006952 - defense response (biological process)
GO:0009738 - abscisic acid-activated signaling pathway (biological process)
GO:0043086 - negative regulation of catalytic activity (biological process)
GO:0080163 - regulation of protein serine/threonine phosphatase activity (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0004864 - protein phosphatase inhibitor activity (molecular function)
GO:0010427 - abscisic acid binding (molecular function)
GO:0038023 - signaling receptor activity (molecular function)
InterPro domainsIPR000916 - Bet v I/Major latex protein
IPR023393 - START-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031572.1 MLP-like protein 423 isoform X1 [Cucumis melo var. makuwa]3.0e-7594.59Show/hide
Query:  MKGEVLLNLPAEKAWQMYRDNDVVSKINPEMLSRAEYVEGDGGPGTLRLFKLGPAVRSYVEESVEKIEKVETGRSVSYDVVGGELRKMYNPYKVTFTFTP
        MKGEVLLNLPA+KAWQMYRDNDVVSKINPE+LSRAEYV+GDG PGTLRLFKLGPAV SYVEESVEKIEKVE GRSVSYDVVGGELRKMYNPYKVTFTFTP
Subjt:  MKGEVLLNLPAEKAWQMYRDNDVVSKINPEMLSRAEYVEGDGGPGTLRLFKLGPAVRSYVEESVEKIEKVETGRSVSYDVVGGELRKMYNPYKVTFTFTP

Query:  VEGKEKEMCIAQWKAEYEPLTPAIPPPDKARDAALQFLQCFDKFQLSY
        VEGKEKEMCIAQWKAEYEPLTP IPPPDKARDAALQFLQ FDKFQLSY
Subjt:  VEGKEKEMCIAQWKAEYEPLTPAIPPPDKARDAALQFLQCFDKFQLSY

XP_004136826.1 major strawberry allergen Fra a 1.08 [Cucumis sativus]5.5e-7794.7Show/hide
Query:  MRSMKGEVLLNLPAEKAWQMYRDNDVVSKINPEMLSRAEYVEGDGGPGTLRLFKLGPAVRSYVEESVEKIEKVETGRSVSYDVVGGELRKMYNPYKVTFT
        M+SMKGEVLLNLPA+KAWQMYRDNDVVSKINPE+LSRAEYV+GDGGPGTLRLFKLGPAV SYVEESVEKIEKVETGRSVSYDVVGGELRKMY+PYKVTFT
Subjt:  MRSMKGEVLLNLPAEKAWQMYRDNDVVSKINPEMLSRAEYVEGDGGPGTLRLFKLGPAVRSYVEESVEKIEKVETGRSVSYDVVGGELRKMYNPYKVTFT

Query:  FTPVEGKEKEMCIAQWKAEYEPLTPAIPPPDKARDAALQFLQCFDKFQLSY
        FTPVEGKEKEMC AQWKAEYEPLTPAIPPPDKARDAALQFLQ FDKFQLSY
Subjt:  FTPVEGKEKEMCIAQWKAEYEPLTPAIPPPDKARDAALQFLQCFDKFQLSY

XP_008455315.1 PREDICTED: uncharacterized protein LOC103495511 [Cucumis melo]5.5e-7794.7Show/hide
Query:  MRSMKGEVLLNLPAEKAWQMYRDNDVVSKINPEMLSRAEYVEGDGGPGTLRLFKLGPAVRSYVEESVEKIEKVETGRSVSYDVVGGELRKMYNPYKVTFT
        M+SMKGEVLLNLPA+KAWQMYRDNDVVSKINPE+LSRAEYV+GDG PGTLRLFKLGPAV SYVEESVEKIEKVETGRSVSYDVVGGELRKMYNPYKVTFT
Subjt:  MRSMKGEVLLNLPAEKAWQMYRDNDVVSKINPEMLSRAEYVEGDGGPGTLRLFKLGPAVRSYVEESVEKIEKVETGRSVSYDVVGGELRKMYNPYKVTFT

Query:  FTPVEGKEKEMCIAQWKAEYEPLTPAIPPPDKARDAALQFLQCFDKFQLSY
        FTPVEGKEKEMCIAQWKAEYEPLTP IPPPDKARDAALQFLQ FDKFQLSY
Subjt:  FTPVEGKEKEMCIAQWKAEYEPLTPAIPPPDKARDAALQFLQCFDKFQLSY

XP_022147870.1 uncharacterized protein LOC111016704 [Momordica charantia]5.9e-7996.03Show/hide
Query:  MRSMKGEVLLNLPAEKAWQMYRDNDVVSKINPEMLSRAEYVEGDGGPGTLRLFKLGPAVRSYVEESVEKIEKVETGRSVSYDVVGGELRKMYNPYKVTFT
        MRSMKGEVLLNLPAEKAWQMYRDNDVVSKINPE+LSRAEYV GDGGPGTLRLFKLGPAVR YVEESVEKIEKVETGRSVSYDVVGGELRKMYNPYKVTFT
Subjt:  MRSMKGEVLLNLPAEKAWQMYRDNDVVSKINPEMLSRAEYVEGDGGPGTLRLFKLGPAVRSYVEESVEKIEKVETGRSVSYDVVGGELRKMYNPYKVTFT

Query:  FTPVEGKEKEMCIAQWKAEYEPLTPAIPPPDKARDAALQFLQCFDKFQLSY
        FTPVEGKEKEMC+AQWKAEYEPLTP IPPP+KARDAALQFLQCFDKFQLSY
Subjt:  FTPVEGKEKEMCIAQWKAEYEPLTPAIPPPDKARDAALQFLQCFDKFQLSY

XP_038887757.1 major strawberry allergen Fra a 1.08-like [Benincasa hispida]5.9e-7996.03Show/hide
Query:  MRSMKGEVLLNLPAEKAWQMYRDNDVVSKINPEMLSRAEYVEGDGGPGTLRLFKLGPAVRSYVEESVEKIEKVETGRSVSYDVVGGELRKMYNPYKVTFT
        MRSMKGEVLL LPAEKAW+MYRDNDVVSKINPE+LSRAEYV+GDGGPGTLRLFKLGPAVRSYVEESVEKIEKVETGRSVSYDVVGGELRKMYNPYKVTFT
Subjt:  MRSMKGEVLLNLPAEKAWQMYRDNDVVSKINPEMLSRAEYVEGDGGPGTLRLFKLGPAVRSYVEESVEKIEKVETGRSVSYDVVGGELRKMYNPYKVTFT

Query:  FTPVEGKEKEMCIAQWKAEYEPLTPAIPPPDKARDAALQFLQCFDKFQLSY
        FTPVEGKEKEMCIAQW+AEYEP+TPAIPPPDKARDAALQFLQCFDKFQLSY
Subjt:  FTPVEGKEKEMCIAQWKAEYEPLTPAIPPPDKARDAALQFLQCFDKFQLSY

TrEMBL top hitse value%identityAlignment
A0A0A0K2C4 Bet_v_1 domain-containing protein2.7e-7794.7Show/hide
Query:  MRSMKGEVLLNLPAEKAWQMYRDNDVVSKINPEMLSRAEYVEGDGGPGTLRLFKLGPAVRSYVEESVEKIEKVETGRSVSYDVVGGELRKMYNPYKVTFT
        M+SMKGEVLLNLPA+KAWQMYRDNDVVSKINPE+LSRAEYV+GDGGPGTLRLFKLGPAV SYVEESVEKIEKVETGRSVSYDVVGGELRKMY+PYKVTFT
Subjt:  MRSMKGEVLLNLPAEKAWQMYRDNDVVSKINPEMLSRAEYVEGDGGPGTLRLFKLGPAVRSYVEESVEKIEKVETGRSVSYDVVGGELRKMYNPYKVTFT

Query:  FTPVEGKEKEMCIAQWKAEYEPLTPAIPPPDKARDAALQFLQCFDKFQLSY
        FTPVEGKEKEMC AQWKAEYEPLTPAIPPPDKARDAALQFLQ FDKFQLSY
Subjt:  FTPVEGKEKEMCIAQWKAEYEPLTPAIPPPDKARDAALQFLQCFDKFQLSY

A0A1S3C1C9 uncharacterized protein LOC1034955112.7e-7794.7Show/hide
Query:  MRSMKGEVLLNLPAEKAWQMYRDNDVVSKINPEMLSRAEYVEGDGGPGTLRLFKLGPAVRSYVEESVEKIEKVETGRSVSYDVVGGELRKMYNPYKVTFT
        M+SMKGEVLLNLPA+KAWQMYRDNDVVSKINPE+LSRAEYV+GDG PGTLRLFKLGPAV SYVEESVEKIEKVETGRSVSYDVVGGELRKMYNPYKVTFT
Subjt:  MRSMKGEVLLNLPAEKAWQMYRDNDVVSKINPEMLSRAEYVEGDGGPGTLRLFKLGPAVRSYVEESVEKIEKVETGRSVSYDVVGGELRKMYNPYKVTFT

Query:  FTPVEGKEKEMCIAQWKAEYEPLTPAIPPPDKARDAALQFLQCFDKFQLSY
        FTPVEGKEKEMCIAQWKAEYEPLTP IPPPDKARDAALQFLQ FDKFQLSY
Subjt:  FTPVEGKEKEMCIAQWKAEYEPLTPAIPPPDKARDAALQFLQCFDKFQLSY

A0A5A7SKH6 MLP-like protein 423 isoform X11.5e-7594.59Show/hide
Query:  MKGEVLLNLPAEKAWQMYRDNDVVSKINPEMLSRAEYVEGDGGPGTLRLFKLGPAVRSYVEESVEKIEKVETGRSVSYDVVGGELRKMYNPYKVTFTFTP
        MKGEVLLNLPA+KAWQMYRDNDVVSKINPE+LSRAEYV+GDG PGTLRLFKLGPAV SYVEESVEKIEKVE GRSVSYDVVGGELRKMYNPYKVTFTFTP
Subjt:  MKGEVLLNLPAEKAWQMYRDNDVVSKINPEMLSRAEYVEGDGGPGTLRLFKLGPAVRSYVEESVEKIEKVETGRSVSYDVVGGELRKMYNPYKVTFTFTP

Query:  VEGKEKEMCIAQWKAEYEPLTPAIPPPDKARDAALQFLQCFDKFQLSY
        VEGKEKEMCIAQWKAEYEPLTP IPPPDKARDAALQFLQ FDKFQLSY
Subjt:  VEGKEKEMCIAQWKAEYEPLTPAIPPPDKARDAALQFLQCFDKFQLSY

A0A6J1D1C2 uncharacterized protein LOC1110167042.8e-7996.03Show/hide
Query:  MRSMKGEVLLNLPAEKAWQMYRDNDVVSKINPEMLSRAEYVEGDGGPGTLRLFKLGPAVRSYVEESVEKIEKVETGRSVSYDVVGGELRKMYNPYKVTFT
        MRSMKGEVLLNLPAEKAWQMYRDNDVVSKINPE+LSRAEYV GDGGPGTLRLFKLGPAVR YVEESVEKIEKVETGRSVSYDVVGGELRKMYNPYKVTFT
Subjt:  MRSMKGEVLLNLPAEKAWQMYRDNDVVSKINPEMLSRAEYVEGDGGPGTLRLFKLGPAVRSYVEESVEKIEKVETGRSVSYDVVGGELRKMYNPYKVTFT

Query:  FTPVEGKEKEMCIAQWKAEYEPLTPAIPPPDKARDAALQFLQCFDKFQLSY
        FTPVEGKEKEMC+AQWKAEYEPLTP IPPP+KARDAALQFLQCFDKFQLSY
Subjt:  FTPVEGKEKEMCIAQWKAEYEPLTPAIPPPDKARDAALQFLQCFDKFQLSY

A0A6J1HZI6 uncharacterized protein LOC1114683377.3e-7590.73Show/hide
Query:  MRSMKGEVLLNLPAEKAWQMYRDNDVVSKINPEMLSRAEYVEGDGGPGTLRLFKLGPAVRSYVEESVEKIEKVETGRSVSYDVVGGELRKMYNPYKVTFT
        MRSMKGEV LNLPAEKAWQMYRDN+V+SKINPE+LSRAEYV+GDGGPGTLRLFKLGPA+RSYV ESVEKIEKVETGRSVSYDVVGGELRKMYNPYKVTFT
Subjt:  MRSMKGEVLLNLPAEKAWQMYRDNDVVSKINPEMLSRAEYVEGDGGPGTLRLFKLGPAVRSYVEESVEKIEKVETGRSVSYDVVGGELRKMYNPYKVTFT

Query:  FTPVEGKEKEMCIAQWKAEYEPLTPAIPPPDKARDAALQFLQCFDKFQLSY
        FTPVEGKE EMC AQW+AEYEPL+P IPPPDKARDAAL+FLQCFDKF LSY
Subjt:  FTPVEGKEKEMCIAQWKAEYEPLTPAIPPPDKARDAALQFLQCFDKFQLSY

SwissProt top hitse value%identityAlignment
A0A1S3THR8 Phytohormone-binding protein CSBP5.3e-0624.65Show/hide
Query:  LNLPAEKAWQMYRDN--DVVSKINPEMLSRAEYVEGDGGPGTLRLFKLGPAVR-SYVEESVEKIEKVETGRSVSYDVV-GGELRKMYNPYKVTFTFTPVE
        L++  E  W +   +   VV K+ P ++   + +EGDGG GT+ +F   P V  SY  E + + +  E+   +   V+ GG L +  + YK TF  + +E
Subjt:  LNLPAEKAWQMYRDN--DVVSKINPEMLSRAEYVEGDGGPGTLRLFKLGPAVR-SYVEESVEKIEKVETGRSVSYDVV-GGELRKMYNPYKVTFTFTPVE

Query:  GKEKEMCIAQWKAEYEPLTPAIPPPDKARDAALQFLQCFDKF
         ++K +   +   +++        P K   + L +L+  +++
Subjt:  GKEKEMCIAQWKAEYEPLTPAIPPPDKARDAALQFLQCFDKF

C0HKF5 Allergen Pet c 19.0e-0629.92Show/hide
Query:  AEKAWQMYRDNDVV-SKINPEMLSRA-EYVEGDGGPGTLRLFKLGPAVR-SYVEESVEKIEKVETGRSVSYDVVGGE-LRKMYNPYKVTFTFTPVEGKEK
        AEK +    D D V  K+ P+++ ++ E +EGDG  GT++L  LG A   + +++ V+ I+K   G + +Y  +GG+ L ++       F   P +G   
Subjt:  AEKAWQMYRDNDVV-SKINPEMLSRA-EYVEGDGGPGTLRLFKLGPAVR-SYVEESVEKIEKVETGRSVSYDVVGGE-LRKMYNPYKVTFTFTPVEGKEK

Query:  EMCIAQWKAEYEPLTPAIPPPDKARDA
          CI +    Y     A+ P DK ++A
Subjt:  EMCIAQWKAEYEPLTPAIPPPDKARDA

P27538 Pathogenesis-related protein 26.9e-0626.09Show/hide
Query:  EVLLNLPAE---KAWQMYRDNDVVSKINPEMLSRAEYVEGDGGPGTLRLFKLGPAVR-SYVEESVEKIEKVETGRSVSYDVVGGE-LRKMYNPYKVTFTF
        EV  ++PA+   K + +  DN ++ K+ P+ +   E + GDGG GT++   LG   + + V++ +++I+        SY ++ G+ L  +       FT 
Subjt:  EVLLNLPAE---KAWQMYRDNDVVSKINPEMLSRAEYVEGDGGPGTLRLFKLGPAVR-SYVEESVEKIEKVETGRSVSYDVVGGE-LRKMYNPYKVTFTF

Query:  TPVEGKEKEMCIAQWKAEYEPLTPAIPPPDKARDAALQ
         P +G     CI +    Y P+  A+ P +  ++A  Q
Subjt:  TPVEGKEKEMCIAQWKAEYEPLTPAIPPPDKARDAALQ

P92918 Major allergen Api g 1, isoallergen 21.1e-0631.5Show/hide
Query:  AEKAWQMY-RDNDVV-SKINPEMLSRAEYVEGDGGPGTLRLFKLGPAVR-SYVEESVEKIEKVETGRSVSYDVVGGE-LRKMYNPYKVTFTFTPVEGKEK
        AEK +Q +  D D V  K+ P+++   E +EGDGG GT++L  LG A   + +++ V+ I+K   G + +Y  +GG+ L  +       F   P +G   
Subjt:  AEKAWQMY-RDNDVV-SKINPEMLSRAEYVEGDGGPGTLRLFKLGPAVR-SYVEESVEKIEKVETGRSVSYDVVGGE-LRKMYNPYKVTFTFTPVEGKEK

Query:  EMCIAQWKAEYEPLTPAIPPPDKARDA
          CI +    Y     A+ P DK ++A
Subjt:  EMCIAQWKAEYEPLTPAIPPPDKARDA

Q93VR4 MLP-like protein 4239.3e-1130.82Show/hide
Query:  EVLLNLPAEKAWQMYRDN-DVVSKINPEMLSRAEYVEGDG-GPGTLRLFKLGPAVRSYVEESVEKIEKVE-TGRSVSYDVVGGELRKMYNPYKVTFTFTP
        EV +  PAEK W    D  ++  K  P      + + GDG  PG++RL   G      V+ S E+IE V+   +S+SY ++GGE+ + Y  +K T T  P
Subjt:  EVLLNLPAEKAWQMYRDN-DVVSKINPEMLSRAEYVEGDG-GPGTLRLFKLGPAVRSYVEESVEKIEKVE-TGRSVSYDVVGGELRKMYNPYKVTFTFTP

Query:  VEGKEKEMCIAQWKAEYEPLTPAIPPPDKARDAALQFLQCFDKFQL
         +G      + +W  E+E     I  P   +D A++  +  D++ L
Subjt:  VEGKEKEMCIAQWKAEYEPLTPAIPPPDKARDAALQFLQCFDKFQL

Arabidopsis top hitse value%identityAlignment
AT1G23120.1 Polyketide cyclase/dehydrase and lipid transport superfamily protein6.2e-1028.03Show/hide
Query:  MRSMKGEVLLNLPAEKAWQMYRDNDVVSKINPEMLSRAEYVEGDGGPGTLRLFKLGPAVRSYVEESVEKIEKVETGRSVSYDVVGGELRKMYNPYKVTFT
        +++++ E+ +N+ AE+ ++ ++  +     N    + A YV  D       +      V   +E+  EKI+  E  +SVS+  + G++ K Y  YK+T  
Subjt:  MRSMKGEVLLNLPAEKAWQMYRDNDVVSKINPEMLSRAEYVEGDGGPGTLRLFKLGPAVRSYVEESVEKIEKVETGRSVSYDVVGGELRKMYNPYKVTFT

Query:  FTPVEGKEKEMCIAQWKAEYEPLTPAIPPPDK
          P   K+ ++CIA+W  EYE L   +PPP +
Subjt:  FTPVEGKEKEMCIAQWKAEYEPLTPAIPPPDK

AT1G24020.1 MLP-like protein 4236.6e-1230.82Show/hide
Query:  EVLLNLPAEKAWQMYRDN-DVVSKINPEMLSRAEYVEGDG-GPGTLRLFKLGPAVRSYVEESVEKIEKVE-TGRSVSYDVVGGELRKMYNPYKVTFTFTP
        EV +  PAEK W    D  ++  K  P      + + GDG  PG++RL   G      V+ S E+IE V+   +S+SY ++GGE+ + Y  +K T T  P
Subjt:  EVLLNLPAEKAWQMYRDN-DVVSKINPEMLSRAEYVEGDG-GPGTLRLFKLGPAVRSYVEESVEKIEKVE-TGRSVSYDVVGGELRKMYNPYKVTFTFTP

Query:  VEGKEKEMCIAQWKAEYEPLTPAIPPPDKARDAALQFLQCFDKFQL
         +G      + +W  E+E     I  P   +D A++  +  D++ L
Subjt:  VEGKEKEMCIAQWKAEYEPLTPAIPPPDKARDAALQFLQCFDKFQL

AT1G24020.2 MLP-like protein 4236.6e-1230.82Show/hide
Query:  EVLLNLPAEKAWQMYRDN-DVVSKINPEMLSRAEYVEGDG-GPGTLRLFKLGPAVRSYVEESVEKIEKVE-TGRSVSYDVVGGELRKMYNPYKVTFTFTP
        EV +  PAEK W    D  ++  K  P      + + GDG  PG++RL   G      V+ S E+IE V+   +S+SY ++GGE+ + Y  +K T T  P
Subjt:  EVLLNLPAEKAWQMYRDN-DVVSKINPEMLSRAEYVEGDG-GPGTLRLFKLGPAVRSYVEESVEKIEKVE-TGRSVSYDVVGGELRKMYNPYKVTFTFTP

Query:  VEGKEKEMCIAQWKAEYEPLTPAIPPPDKARDAALQFLQCFDKFQL
         +G      + +W  E+E     I  P   +D A++  +  D++ L
Subjt:  VEGKEKEMCIAQWKAEYEPLTPAIPPPDKARDAALQFLQCFDKFQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGAGCATGAAAGGAGAGGTGTTACTGAACCTCCCCGCCGAGAAAGCTTGGCAGATGTACAGAGACAACGATGTGGTCAGTAAAATCAATCCTGAGATGCTTTCTCG
AGCTGAATATGTTGAAGGAGATGGTGGCCCTGGAACTCTCAGGCTTTTCAAGCTTGGCCCTGCTGTTAGAAGCTACGTGGAGGAATCGGTGGAGAAGATAGAGAAGGTAG
AGACAGGTCGATCAGTGAGCTACGATGTGGTGGGAGGAGAGCTAAGGAAGATGTACAATCCATACAAAGTGACATTCACATTCACTCCAGTTGAAGGAAAAGAGAAGGAA
ATGTGCATTGCTCAATGGAAAGCTGAGTATGAGCCACTGACTCCGGCCATCCCTCCACCGGACAAAGCCAGGGATGCTGCTTTGCAATTTCTCCAATGCTTTGACAAGTT
TCAGCTCAGCTACTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGGAGCATGAAAGGAGAGGTGTTACTGAACCTCCCCGCCGAGAAAGCTTGGCAGATGTACAGAGACAACGATGTGGTCAGTAAAATCAATCCTGAGATGCTTTCTCG
AGCTGAATATGTTGAAGGAGATGGTGGCCCTGGAACTCTCAGGCTTTTCAAGCTTGGCCCTGCTGTTAGAAGCTACGTGGAGGAATCGGTGGAGAAGATAGAGAAGGTAG
AGACAGGTCGATCAGTGAGCTACGATGTGGTGGGAGGAGAGCTAAGGAAGATGTACAATCCATACAAAGTGACATTCACATTCACTCCAGTTGAAGGAAAAGAGAAGGAA
ATGTGCATTGCTCAATGGAAAGCTGAGTATGAGCCACTGACTCCGGCCATCCCTCCACCGGACAAAGCCAGGGATGCTGCTTTGCAATTTCTCCAATGCTTTGACAAGTT
TCAGCTCAGCTACTAAGAAGTAAGACTTCCTTCAATATGTCAGTGTGTGAAAATAACTATGGAAAGGGTAAAACAGTCTATATGCCCTCTCTCTCTCTTTCTCACTGAAA
AAACCAGTTTAATCATGTTTCTCTCTCTTCTGTGATGTTTCTCTCTCTCTCTTTCTCTCTCTCTCCTGTGAAAAAACCAGTTGGTCATCTCTCTCTCTCTTGTGAAAAAC
TAGTTGGGTCATGTCAGTGTTATGTGAGAGAGACTTGGTAGTTGTGGATTTTAGGTGCCATTTGTGAAGGGTAATGTATGTGGTGTTATTGTGAGCAAATCCCAATGGAG
AAAGTCAAAGAAGAAAGAATATCGTAAATTTATGTGGGGGACTTTTCTGCAGGAAGAAGTGTTATATTTTGCATACGTATTTGAAAGAGAGAATATCA
Protein sequenceShow/hide protein sequence
MRSMKGEVLLNLPAEKAWQMYRDNDVVSKINPEMLSRAEYVEGDGGPGTLRLFKLGPAVRSYVEESVEKIEKVETGRSVSYDVVGGELRKMYNPYKVTFTFTPVEGKEKE
MCIAQWKAEYEPLTPAIPPPDKARDAALQFLQCFDKFQLSY