; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS007689 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS007689
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptioncellulose synthase-like protein E1
Genome locationscaffold13:521376..521813
RNA-Seq ExpressionMS007689
SyntenyMS007689
Gene Ontology termsGO:0009833 - plant-type primary cell wall biogenesis (biological process)
GO:0030244 - cellulose biosynthetic process (biological process)
GO:0097502 - mannosylation (biological process)
GO:0005886 - plasma membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0016760 - cellulose synthase (UDP-forming) activity (molecular function)
GO:0051753 - mannan synthase activity (molecular function)
InterPro domainsIPR005150 - Cellulose synthase
IPR029044 - Nucleotide-diphospho-sugar transferases


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8527734.1 hypothetical protein F0562_035397 [Nyssa sinensis]6.8e-5668.49Show/hide
Query:  WALVTVAELLFAFVWLLTQSFRWLPVSRSVSPENLPGKEEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGSPVTLYAVKQAGD
        W L+T AE++F+F+W L Q+FRW PV RSV PEN+PG  E PGVDVF+CTADP KEPTVEVMNTVLS++ALDYPPEKLAVYLSDDGG P+TLYA+K+AG 
Subjt:  WALVTVAELLFAFVWLLTQSFRWLPVSRSVSPENLPGKEEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGSPVTLYAVKQAGD

Query:  FAKAWVPFCKEYGINSRCPEVYFSALADDERIFRDDKFVAEEKEIK
        FA++W+PFC++YGI +RCPE YFS+  D ER+ R D+F A+E++IK
Subjt:  FAKAWVPFCKEYGINSRCPEVYFSALADDERIFRDDKFVAEEKEIK

XP_023000525.1 cellulose synthase-like protein E1 [Cucurbita maxima]2.6e-6380.82Show/hide
Query:  WALVTVAELLFAFVWLLTQSFRWLPVSRSVSPENLPGKEEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGSPVTLYAVKQAGD
        WAL T+AEL+F F W+LTQSFRW PVSRSVSPEN+PG EE PGVDVF+ TADPAKEPTVE MNTVLSSLAL+YP EK+ VYLSDDGGSPVTLYAVK+AG 
Subjt:  WALVTVAELLFAFVWLLTQSFRWLPVSRSVSPENLPGKEEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGSPVTLYAVKQAGD

Query:  FAKAWVPFCKEYGINSRCPEVYFSALADDERIFRDDKFVAEEKEIK
        FAK WVPFCKEYGINSRCPE+YFS+LA++ERI+RD+KF A+EKEIK
Subjt:  FAKAWVPFCKEYGINSRCPEVYFSALADDERIFRDDKFVAEEKEIK

XP_028055208.1 cellulose synthase-like protein G1 isoform X2 [Camellia sinensis]9.8e-5567.81Show/hide
Query:  WALVTVAELLFAFVWLLTQSFRWLPVSRSVSPENLPGKEEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGSPVTLYAVKQAGD
        W L+T AEL+F F+W L+Q+FRW P++R+  PE L G  E PGVDVF+CTADP KEPTVEVMNTVLS++ALDYPPEKLAVYLSDDGGSP+TLYA+K A +
Subjt:  WALVTVAELLFAFVWLLTQSFRWLPVSRSVSPENLPGKEEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGSPVTLYAVKQAGD

Query:  FAKAWVPFCKEYGINSRCPEVYFSALADDERIFRDDKFVAEEKEIK
        FA+ W+PFC++YGI SRCP+ YFSA  D+ER+ R D+F AEE+ IK
Subjt:  FAKAWVPFCKEYGINSRCPEVYFSALADDERIFRDDKFVAEEKEIK

XP_038876181.1 cellulose synthase-like protein G1 isoform X1 [Benincasa hispida]6.7e-6482.88Show/hide
Query:  WALVTVAELLFAFVWLLTQSFRWLPVSRSVSPENLPGKEEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGSPVTLYAVKQAGD
        W+L+T+AELLFAFVWLL QSFRW PVSRSVSPENLPG+ E PGVDVF+CTADPAKEPTVEVMNTVLS LALDYP EKLAVYLSDDGGSP T + VK+AG 
Subjt:  WALVTVAELLFAFVWLLTQSFRWLPVSRSVSPENLPGKEEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGSPVTLYAVKQAGD

Query:  FAKAWVPFCKEYGINSRCPEVYFSALADDERIFRDDKFVAEEKEIK
        FAK WVPFC EYGINS CPEVYFS+LAD++RIFRD KFVA+EKEIK
Subjt:  FAKAWVPFCKEYGINSRCPEVYFSALADDERIFRDDKFVAEEKEIK

XP_038876182.1 cellulose synthase-like protein G1 isoform X2 [Benincasa hispida]6.7e-6482.88Show/hide
Query:  WALVTVAELLFAFVWLLTQSFRWLPVSRSVSPENLPGKEEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGSPVTLYAVKQAGD
        W+L+T+AELLFAFVWLL QSFRW PVSRSVSPENLPG+ E PGVDVF+CTADPAKEPTVEVMNTVLS LALDYP EKLAVYLSDDGGSP T + VK+AG 
Subjt:  WALVTVAELLFAFVWLLTQSFRWLPVSRSVSPENLPGKEEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGSPVTLYAVKQAGD

Query:  FAKAWVPFCKEYGINSRCPEVYFSALADDERIFRDDKFVAEEKEIK
        FAK WVPFC EYGINS CPEVYFS+LAD++RIFRD KFVA+EKEIK
Subjt:  FAKAWVPFCKEYGINSRCPEVYFSALADDERIFRDDKFVAEEKEIK

TrEMBL top hitse value%identityAlignment
A0A4S4EVA9 Uncharacterized protein4.7e-5567.81Show/hide
Query:  WALVTVAELLFAFVWLLTQSFRWLPVSRSVSPENLPGKEEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGSPVTLYAVKQAGD
        W L+T AEL+F F+W L+Q+FRW P++R+  PE L G  E PGVDVF+CTADP KEPTVEVMNTVLS++ALDYPPEKLAVYLSDDGGSP+TLYA+K A +
Subjt:  WALVTVAELLFAFVWLLTQSFRWLPVSRSVSPENLPGKEEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGSPVTLYAVKQAGD

Query:  FAKAWVPFCKEYGINSRCPEVYFSALADDERIFRDDKFVAEEKEIK
        FA+ W+PFC++YGI SRCP+ YFSA  D+ER+ R D+F AEE+ IK
Subjt:  FAKAWVPFCKEYGINSRCPEVYFSALADDERIFRDDKFVAEEKEIK

A0A5C7IUP2 Uncharacterized protein3.1e-5468.49Show/hide
Query:  WALVTVAELLFAFVWLLTQSFRWLPVSRSVSPENLPGKEEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGSPVTLYAVKQAGD
        W+LVTV+EL+ AF+W L+Q FRWLPVSRSV PE  P     PG+DVFVCTAD  KEPTVEVMNTV+S+LALDYPPEKL+VYLSDDGGS +TLYA+K+A +
Subjt:  WALVTVAELLFAFVWLLTQSFRWLPVSRSVSPENLPGKEEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGSPVTLYAVKQAGD

Query:  FAKAWVPFCKEYGINSRCPEVYFSALADDERIFRDDKFVAEEKEIK
        FAK+W+PFC++YGI +RCPE YFS LA+DER+   D+F  EE+ IK
Subjt:  FAKAWVPFCKEYGINSRCPEVYFSALADDERIFRDDKFVAEEKEIK

A0A5J5ABU8 Uncharacterized protein3.3e-5668.49Show/hide
Query:  WALVTVAELLFAFVWLLTQSFRWLPVSRSVSPENLPGKEEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGSPVTLYAVKQAGD
        W L+T AE++F+F+W L Q+FRW PV RSV PEN+PG  E PGVDVF+CTADP KEPTVEVMNTVLS++ALDYPPEKLAVYLSDDGG P+TLYA+K+AG 
Subjt:  WALVTVAELLFAFVWLLTQSFRWLPVSRSVSPENLPGKEEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGSPVTLYAVKQAGD

Query:  FAKAWVPFCKEYGINSRCPEVYFSALADDERIFRDDKFVAEEKEIK
        FA++W+PFC++YGI +RCPE YFS+  D ER+ R D+F A+E++IK
Subjt:  FAKAWVPFCKEYGINSRCPEVYFSALADDERIFRDDKFVAEEKEIK

A0A6J1KIL2 cellulose synthase-like protein E11.2e-6380.82Show/hide
Query:  WALVTVAELLFAFVWLLTQSFRWLPVSRSVSPENLPGKEEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGSPVTLYAVKQAGD
        WAL T+AEL+F F W+LTQSFRW PVSRSVSPEN+PG EE PGVDVF+ TADPAKEPTVE MNTVLSSLAL+YP EK+ VYLSDDGGSPVTLYAVK+AG 
Subjt:  WALVTVAELLFAFVWLLTQSFRWLPVSRSVSPENLPGKEEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGSPVTLYAVKQAGD

Query:  FAKAWVPFCKEYGINSRCPEVYFSALADDERIFRDDKFVAEEKEIK
        FAK WVPFCKEYGINSRCPE+YFS+LA++ERI+RD+KF A+EKEIK
Subjt:  FAKAWVPFCKEYGINSRCPEVYFSALADDERIFRDDKFVAEEKEIK

A0A7J7IB63 Uncharacterized protein4.0e-5467.12Show/hide
Query:  WALVTVAELLFAFVWLLTQSFRWLPVSRSVSPENLPGKEEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGSPVTLYAVKQAGD
        W L+T AEL+F F+W L+Q+FRW P++R+  PE L G  E PGVDVF+CTADP KEPTVEVMNTVLS++ALDYPPEKLAVYLSDDGGS +TLYA+K A +
Subjt:  WALVTVAELLFAFVWLLTQSFRWLPVSRSVSPENLPGKEEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGSPVTLYAVKQAGD

Query:  FAKAWVPFCKEYGINSRCPEVYFSALADDERIFRDDKFVAEEKEIK
        FA+ W+PFC++YGI SRCP+ YFSA  D+ER+ R D+F AEE+ IK
Subjt:  FAKAWVPFCKEYGINSRCPEVYFSALADDERIFRDDKFVAEEKEIK

SwissProt top hitse value%identityAlignment
Q0DXZ1 Cellulose synthase-like protein E21.2e-3150Show/hide
Query:  WALVTVAELLFAFVWLLTQSFRWLPVSRSVSPENLP---GKEEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGSPVTLYAVKQ
        W  +  AEL F F W+LT S RW PV R    + L     ++E P VD+FVCTADP  EP + V++TVLS +A DY PEKL +YLSDD GS +T Y + +
Subjt:  WALVTVAELLFAFVWLLTQSFRWLPVSRSVSPENLP---GKEEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGSPVTLYAVKQ

Query:  AGDFAKAWVPFCKEYGINSRCPEVYFSALA
        A +FAK W+PFCK+Y +  R P  YF+ +A
Subjt:  AGDFAKAWVPFCKEYGINSRCPEVYFSALA

Q0WVN5 Cellulose synthase-like protein G31.6e-3148.8Show/hide
Query:  ALVTVAELLFAFVWLLTQSFRWLPVSRSVSPENLPGK-EEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGSPVTLYAVKQAGD
        +L+ +++++ AF+W  T S R+ PV R+  PE    + E+FP +DVF+CTADP KEP + V+NT LS +A +YP +K++VY+SDDGGS +TL+A+ +A  
Subjt:  ALVTVAELLFAFVWLLTQSFRWLPVSRSVSPENLPGK-EEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGSPVTLYAVKQAGD

Query:  FAKAWVPFCKEYGINSRCPEVYFSA
        F+K W+PFCK+  +  R PEVYFS+
Subjt:  FAKAWVPFCKEYGINSRCPEVYFSA

Q651X6 Cellulose synthase-like protein E65.4e-3253.97Show/hide
Query:  WALVTVAELLFAFVWLLTQSFRWLPVSRSVSPENLPG--KEEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGSPVTLYAVKQA
        W  +  AEL FA  W++TQS RW PV R      L    KE  PGVDVFVCTADP  EP   V++T+LS +A +YP EK++VYLSDDGGS +T YA+ +A
Subjt:  WALVTVAELLFAFVWLLTQSFRWLPVSRSVSPENLPG--KEEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGSPVTLYAVKQA

Query:  GDFAKAWVPFCKEYGINSRCPEVYFS
          FAK W+PFC+ Y I  R P  YFS
Subjt:  GDFAKAWVPFCKEYGINSRCPEVYFS

Q7EZW6 Cellulose synthase-like protein D36.0e-3144.94Show/hide
Query:  WALVTVAELLFAFVWLLTQSFRWLPVSRSV------------SPENLPGKEEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGS
        W +  V EL FAF WLL    +  PV+RS             SP N  G+ + PG+DVFV TADP KEP +    T+LS LA+DYP EKLA Y+SDDGG+
Subjt:  WALVTVAELLFAFVWLLTQSFRWLPVSRSV------------SPENLPGKEEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGS

Query:  PVTLYAVKQAGDFAKAWVPFCKEYGINSRCPEVYFSALADDERIFRDDKFVAEEKEIK
         +T  A+ +A  FA  WVPFCK++ I  R P+ YFS   D  +  R + FV + + +K
Subjt:  PVTLYAVKQAGDFAKAWVPFCKEYGINSRCPEVYFSALADDERIFRDDKFVAEEKEIK

Q8VZK9 Cellulose synthase-like protein E11.1e-3251.54Show/hide
Query:  WALVTVAELLFAFVWLLTQSFRWLPVSRSVSPENLPGK--EEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGSPVTLYAVKQA
        W ++ + E+ F   W++TQS RW PV R    + L  +   + P +DVFVCTADP  EP + V+NTVLS  ALDYPPEKLAVYLSDDGGS +T YA+ +A
Subjt:  WALVTVAELLFAFVWLLTQSFRWLPVSRSVSPENLPGK--EEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGSPVTLYAVKQA

Query:  GDFAKAWVPFCKEYGINSRCPEVYFSALAD
         +FAK WVPFCK++ +    P  Y S+ A+
Subjt:  GDFAKAWVPFCKEYGINSRCPEVYFSALAD

Arabidopsis top hitse value%identityAlignment
AT1G32180.1 cellulose synthase-like D65.6e-3244.94Show/hide
Query:  WALVTVAELLFAFVWLLTQSFRWLPVSRSV------------SPENLPGKEEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGS
        W L  + EL FAF WLL Q  +  PV+ +             +P+N  GK + PG+DVFV TAD  KEP +   NT+LS L++DYP EKL+VY+SDDGGS
Subjt:  WALVTVAELLFAFVWLLTQSFRWLPVSRSV------------SPENLPGKEEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGS

Query:  PVTLYAVKQAGDFAKAWVPFCKEYGINSRCPEVYFSALADDERIFRDDKFVAEEKEIK
         VT  A+ +A  FAK WVPFC+++ I  R PE YF    D  +      FV E + +K
Subjt:  PVTLYAVKQAGDFAKAWVPFCKEYGINSRCPEVYFSALADDERIFRDDKFVAEEKEIK

AT1G55850.1 cellulose synthase like E17.8e-3451.54Show/hide
Query:  WALVTVAELLFAFVWLLTQSFRWLPVSRSVSPENLPGK--EEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGSPVTLYAVKQA
        W ++ + E+ F   W++TQS RW PV R    + L  +   + P +DVFVCTADP  EP + V+NTVLS  ALDYPPEKLAVYLSDDGGS +T YA+ +A
Subjt:  WALVTVAELLFAFVWLLTQSFRWLPVSRSVSPENLPGK--EEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGSPVTLYAVKQA

Query:  GDFAKAWVPFCKEYGINSRCPEVYFSALAD
         +FAK WVPFCK++ +    P  Y S+ A+
Subjt:  GDFAKAWVPFCKEYGINSRCPEVYFSALAD

AT4G23990.1 cellulose synthase like G31.1e-3248.8Show/hide
Query:  ALVTVAELLFAFVWLLTQSFRWLPVSRSVSPENLPGK-EEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGSPVTLYAVKQAGD
        +L+ +++++ AF+W  T S R+ PV R+  PE    + E+FP +DVF+CTADP KEP + V+NT LS +A +YP +K++VY+SDDGGS +TL+A+ +A  
Subjt:  ALVTVAELLFAFVWLLTQSFRWLPVSRSVSPENLPGK-EEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGSPVTLYAVKQAGD

Query:  FAKAWVPFCKEYGINSRCPEVYFSA
        F+K W+PFCK+  +  R PEVYFS+
Subjt:  FAKAWVPFCKEYGINSRCPEVYFSA

AT4G24000.1 cellulose synthase like G25.6e-3249.19Show/hide
Query:  LVTVAELLFAFVWLLTQSFRWLPVSRSVSPENLPGK-EEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGSPVTLYAVKQAGDF
        L+ +++++ AF+W  T S R  P+ R+  PE    K E+FP +DVF+CTADP KEP + V+NT LS +A +YP  K++VY+SDDGGS +TL+A+ +A  F
Subjt:  LVTVAELLFAFVWLLTQSFRWLPVSRSVSPENLPGK-EEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGSPVTLYAVKQAGDF

Query:  AKAWVPFCKEYGINSRCPEVYFSA
        +K W+PFCK   +  R PEVYFS+
Subjt:  AKAWVPFCKEYGINSRCPEVYFSA

AT4G24010.1 cellulose synthase like G15.6e-3249.19Show/hide
Query:  LVTVAELLFAFVWLLTQSFRWLPVSRSVSPENLPGK-EEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGSPVTLYAVKQAGDF
        L+ +++++ AF+W  T S R  PV R+  PE    K E+FP +DVF+CTADP KEP + V+NT LS +A +YP +K++VY+SDDGGS +T +A+ +A  F
Subjt:  LVTVAELLFAFVWLLTQSFRWLPVSRSVSPENLPGK-EEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGSPVTLYAVKQAGDF

Query:  AKAWVPFCKEYGINSRCPEVYFSA
        +K W+PFCK+  +  R PEVYFS+
Subjt:  AKAWVPFCKEYGINSRCPEVYFSA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TGGGCACTCGTCACCGTCGCCGAACTTCTCTTCGCCTTCGTCTGGCTCCTAACGCAGTCGTTCCGATGGCTGCCGGTGTCCCGCTCCGTCTCGCCGGAGAATCTTCCGGG
CAAAGAGGAGTTTCCCGGCGTCGACGTGTTCGTCTGCACGGCGGATCCGGCGAAAGAGCCGACGGTGGAGGTGATGAACACGGTTCTGTCGTCTCTGGCTCTGGACTACC
CGCCGGAGAAGCTCGCCGTGTATCTCTCCGACGACGGCGGGTCTCCGGTTACTCTGTACGCCGTGAAACAAGCCGGTGATTTCGCGAAGGCGTGGGTCCCTTTCTGTAAA
GAATACGGAATTAACTCGAGATGTCCTGAAGTTTACTTCTCGGCGCTTGCCGACGACGAGCGGATCTTTCGAGACGACAAATTCGTAGCAGAGGAGAAAGAAATTAAG
mRNA sequenceShow/hide mRNA sequence
TGGGCACTCGTCACCGTCGCCGAACTTCTCTTCGCCTTCGTCTGGCTCCTAACGCAGTCGTTCCGATGGCTGCCGGTGTCCCGCTCCGTCTCGCCGGAGAATCTTCCGGG
CAAAGAGGAGTTTCCCGGCGTCGACGTGTTCGTCTGCACGGCGGATCCGGCGAAAGAGCCGACGGTGGAGGTGATGAACACGGTTCTGTCGTCTCTGGCTCTGGACTACC
CGCCGGAGAAGCTCGCCGTGTATCTCTCCGACGACGGCGGGTCTCCGGTTACTCTGTACGCCGTGAAACAAGCCGGTGATTTCGCGAAGGCGTGGGTCCCTTTCTGTAAA
GAATACGGAATTAACTCGAGATGTCCTGAAGTTTACTTCTCGGCGCTTGCCGACGACGAGCGGATCTTTCGAGACGACAAATTCGTAGCAGAGGAGAAAGAAATTAAG
Protein sequenceShow/hide protein sequence
WALVTVAELLFAFVWLLTQSFRWLPVSRSVSPENLPGKEEFPGVDVFVCTADPAKEPTVEVMNTVLSSLALDYPPEKLAVYLSDDGGSPVTLYAVKQAGDFAKAWVPFCK
EYGINSRCPEVYFSALADDERIFRDDKFVAEEKEIK