; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G002900 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G002900
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Description10 kDa chaperonin
Genome locationCG_Chr05:2778952..2782376
RNA-Seq ExpressionClCG05G002900
SyntenyClCG05G002900
Gene Ontology termsGO:0051085 - chaperone cofactor-dependent protein refolding (biological process)
GO:0005759 - mitochondrial matrix (cellular component)
GO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0051082 - unfolded protein binding (molecular function)
GO:0051087 - chaperone binding (molecular function)
InterPro domainsIPR011032 - GroES-like superfamily
IPR020818 - GroES chaperonin family
IPR037124 - GroES chaperonin superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004133964.1 10 kDa chaperonin 1, chloroplastic [Cucumis sativus]9.7e-6090.07Show/hide
Query:  MAMASTFVTVPKPFINRPNSS-SVSARRLIIGGLRSSTLTVSAISKKLEPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVGTDVG
        MAMASTF TVPKPFIN PN+S SVS RRLI GGLRSS L VSAISKK EPAKVVPQADRVLVRLEELPEKS GGVLLPKSAVKFERYLVG ILSVGT+VG
Subjt:  MAMASTFVTVPKPFINRPNSS-SVSARRLIIGGLRSSTLTVSAISKKLEPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVGTDVG

Query:  ENDIAPGKKVLLSDINAYEVDLGTDAKHCFCKAGDLLAVVE
         NDIAPGKKVLLSDINAYEVDLGTDAKHCFCKAGDLLA+VE
Subjt:  ENDIAPGKKVLLSDINAYEVDLGTDAKHCFCKAGDLLAVVE

XP_008438275.1 PREDICTED: 10 kDa chaperonin isoform X1 [Cucumis melo]7.9e-6292.91Show/hide
Query:  MAMASTFVTVPKPFINRPNSS-SVSARRLIIGGLRSSTLTVSAISKKLEPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVGTDVG
        MAMASTF+TVPKPFIN PN+S SVSARRLIIGGLRSSTL VSAIS+K EPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVG +VG
Subjt:  MAMASTFVTVPKPFINRPNSS-SVSARRLIIGGLRSSTLTVSAISKKLEPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVGTDVG

Query:  ENDIAPGKKVLLSDINAYEVDLGTDAKHCFCKAGDLLAVVE
         NDIAPGKKVLLSDINAYEVDLGTDAKHCFCKAGDLLAVVE
Subjt:  ENDIAPGKKVLLSDINAYEVDLGTDAKHCFCKAGDLLAVVE

XP_022936135.1 10 kDa chaperonin 1, chloroplastic-like [Cucurbita moschata]4.2e-5582.14Show/hide
Query:  MAMASTFVTVPKPFINRPNSSSVSARRLIIGGLRSSTLTVSAISKKLEPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVGTDVGE
        MAMASTFVTVPKPF +  NSSSVS+R+ I GG R S+L VSA+SKK EP KVVPQADRVL+RLEELPEKSAGGVLLPKSAVKFER+LVGEILS+G++VG 
Subjt:  MAMASTFVTVPKPFINRPNSSSVSARRLIIGGLRSSTLTVSAISKKLEPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVGTDVGE

Query:  NDIAPGKKVLLSDINAYEVDLGTDAKHCFCKAGDLLAVVE
        +D+APGKKV+LSDINAYEVDLGTDAKHCFCKA DLLAVVE
Subjt:  NDIAPGKKVLLSDINAYEVDLGTDAKHCFCKAGDLLAVVE

XP_022974766.1 10 kDa chaperonin 1, chloroplastic-like isoform X1 [Cucurbita maxima]2.9e-5683.57Show/hide
Query:  MAMASTFVTVPKPFINRPNSSSVSARRLIIGGLRSSTLTVSAISKKLEPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVGTDVGE
        MAMASTFVTVPKPF +  NSSSVS+RRLI GG R S+L VSA+SKK EP KVVPQADRVL+RLEELPEKSAGGVLLPKSAVKFER+LVGEILS+G++VG 
Subjt:  MAMASTFVTVPKPFINRPNSSSVSARRLIIGGLRSSTLTVSAISKKLEPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVGTDVGE

Query:  NDIAPGKKVLLSDINAYEVDLGTDAKHCFCKAGDLLAVVE
        +D+APGKKV+LSDINAYEVDLGTDAKHCFCKA DLLAVVE
Subjt:  NDIAPGKKVLLSDINAYEVDLGTDAKHCFCKAGDLLAVVE

XP_038880425.1 10 kDa chaperonin 1, chloroplastic-like [Benincasa hispida]6.5e-6495.71Show/hide
Query:  MAMASTFVTVPKPFINRPNSSSVSARRLIIGGLRSSTLTVSAISKKLEPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVGTDVGE
        MAMASTFVTVPK  INRPNS SVSARRLIIGGLRSSTL VSAISKKLEPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVGTDVG 
Subjt:  MAMASTFVTVPKPFINRPNSSSVSARRLIIGGLRSSTLTVSAISKKLEPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVGTDVGE

Query:  NDIAPGKKVLLSDINAYEVDLGTDAKHCFCKAGDLLAVVE
        NDIAPGKKVLLSD+NAYEVDLGTDAKHCFCKAGDLLAVVE
Subjt:  NDIAPGKKVLLSDINAYEVDLGTDAKHCFCKAGDLLAVVE

TrEMBL top hitse value%identityAlignment
A0A0A0L6L9 Uncharacterized protein4.7e-6090.07Show/hide
Query:  MAMASTFVTVPKPFINRPNSS-SVSARRLIIGGLRSSTLTVSAISKKLEPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVGTDVG
        MAMASTF TVPKPFIN PN+S SVS RRLI GGLRSS L VSAISKK EPAKVVPQADRVLVRLEELPEKS GGVLLPKSAVKFERYLVG ILSVGT+VG
Subjt:  MAMASTFVTVPKPFINRPNSS-SVSARRLIIGGLRSSTLTVSAISKKLEPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVGTDVG

Query:  ENDIAPGKKVLLSDINAYEVDLGTDAKHCFCKAGDLLAVVE
         NDIAPGKKVLLSDINAYEVDLGTDAKHCFCKAGDLLA+VE
Subjt:  ENDIAPGKKVLLSDINAYEVDLGTDAKHCFCKAGDLLAVVE

A0A1S3AWL8 10 kDa chaperonin isoform X13.8e-6292.91Show/hide
Query:  MAMASTFVTVPKPFINRPNSS-SVSARRLIIGGLRSSTLTVSAISKKLEPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVGTDVG
        MAMASTF+TVPKPFIN PN+S SVSARRLIIGGLRSSTL VSAIS+K EPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVG +VG
Subjt:  MAMASTFVTVPKPFINRPNSS-SVSARRLIIGGLRSSTLTVSAISKKLEPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVGTDVG

Query:  ENDIAPGKKVLLSDINAYEVDLGTDAKHCFCKAGDLLAVVE
         NDIAPGKKVLLSDINAYEVDLGTDAKHCFCKAGDLLAVVE
Subjt:  ENDIAPGKKVLLSDINAYEVDLGTDAKHCFCKAGDLLAVVE

A0A5D3CZW2 10 kDa chaperonin isoform X13.8e-6292.91Show/hide
Query:  MAMASTFVTVPKPFINRPNSS-SVSARRLIIGGLRSSTLTVSAISKKLEPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVGTDVG
        MAMASTF+TVPKPFIN PN+S SVSARRLIIGGLRSSTL VSAIS+K EPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVG +VG
Subjt:  MAMASTFVTVPKPFINRPNSS-SVSARRLIIGGLRSSTLTVSAISKKLEPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVGTDVG

Query:  ENDIAPGKKVLLSDINAYEVDLGTDAKHCFCKAGDLLAVVE
         NDIAPGKKVLLSDINAYEVDLGTDAKHCFCKAGDLLAVVE
Subjt:  ENDIAPGKKVLLSDINAYEVDLGTDAKHCFCKAGDLLAVVE

A0A6J1FCS3 10 kDa chaperonin 1, chloroplastic-like2.0e-5582.14Show/hide
Query:  MAMASTFVTVPKPFINRPNSSSVSARRLIIGGLRSSTLTVSAISKKLEPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVGTDVGE
        MAMASTFVTVPKPF +  NSSSVS+R+ I GG R S+L VSA+SKK EP KVVPQADRVL+RLEELPEKSAGGVLLPKSAVKFER+LVGEILS+G++VG 
Subjt:  MAMASTFVTVPKPFINRPNSSSVSARRLIIGGLRSSTLTVSAISKKLEPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVGTDVGE

Query:  NDIAPGKKVLLSDINAYEVDLGTDAKHCFCKAGDLLAVVE
        +D+APGKKV+LSDINAYEVDLGTDAKHCFCKA DLLAVVE
Subjt:  NDIAPGKKVLLSDINAYEVDLGTDAKHCFCKAGDLLAVVE

A0A6J1ICA1 10 kDa chaperonin 1, chloroplastic-like isoform X11.4e-5683.57Show/hide
Query:  MAMASTFVTVPKPFINRPNSSSVSARRLIIGGLRSSTLTVSAISKKLEPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVGTDVGE
        MAMASTFVTVPKPF +  NSSSVS+RRLI GG R S+L VSA+SKK EP KVVPQADRVL+RLEELPEKSAGGVLLPKSAVKFER+LVGEILS+G++VG 
Subjt:  MAMASTFVTVPKPFINRPNSSSVSARRLIIGGLRSSTLTVSAISKKLEPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVGTDVGE

Query:  NDIAPGKKVLLSDINAYEVDLGTDAKHCFCKAGDLLAVVE
        +D+APGKKV+LSDINAYEVDLGTDAKHCFCKA DLLAVVE
Subjt:  NDIAPGKKVLLSDINAYEVDLGTDAKHCFCKAGDLLAVVE

SwissProt top hitse value%identityAlignment
A2C4I3 10 kDa chaperonin1.7e-0637Show/hide
Query:  LEPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVGTDVGEND-------IAPGKKVLLSDINAYEVDLGTDAKHCFCKAGDLLAVV
        L  + V P  DRV V++ E  EK+AGG+LLP +A   E+  VGE+  VG      D       ++ G KVL S     ++ LG+D ++      D+LAVV
Subjt:  LEPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVGTDVGEND-------IAPGKKVLLSDINAYEVDLGTDAKHCFCKAGDLLAVV

A5GNB0 10 kDa chaperonin7.5e-0737Show/hide
Query:  LEPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVGTDVGEND-------IAPGKKVLLSDINAYEVDLGTDAKHCFCKAGDLLAVV
        L  + V P  DRV V++ E  EK+AGG+LLP +A   E+  VGE++ VG      D       +  G KVL S     ++ LG+D ++      D+LAVV
Subjt:  LEPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVGTDVGEND-------IAPGKKVLLSDINAYEVDLGTDAKHCFCKAGDLLAVV

O80504 10 kDa chaperonin 2, chloroplastic1.8e-4063.57Show/hide
Query:  MASTFV-TVPKPFINRP-NSSSVSARRLIIGGLRSSTLTVSAISKKLEPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVGTDVGE
        MASTFV ++P PF   P  +++ S     + G R   L + AIS K EP KVVPQADRVLVRLE+LP KS+GGVLLPK+AVKFERYL GEI+SVG++VG+
Subjt:  MASTFV-TVPKPFINRP-NSSSVSARRLIIGGLRSSTLTVSAISKKLEPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVGTDVGE

Query:  NDIAPGKKVLLSDINAYEVDLGTDAKHCFCKAGDLLAVVE
          + PGK+VL SD++AYEVDLGTDA+HCFCK  DLLA+VE
Subjt:  NDIAPGKKVLLSDINAYEVDLGTDAKHCFCKAGDLLAVVE

Q0I7U2 10 kDa chaperonin5.8e-0737Show/hide
Query:  LEPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVGTDVGEND-------IAPGKKVLLSDINAYEVDLGTDAKHCFCKAGDLLAVV
        L  + V P  DRV V++ E  EK+AGG+LLP +A   E+  VGE++ VG     +D       +  G KVL S     ++ LG+D ++      D+LAVV
Subjt:  LEPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVGTDVGEND-------IAPGKKVLLSDINAYEVDLGTDAKHCFCKAGDLLAVV

Q9M1C2 10 kDa chaperonin 1, chloroplastic3.8e-4364.79Show/hide
Query:  MASTFVTVPKPFINRP---NSSSVSARRLIIGGLRSSTLTVSAISKKLEPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVGTDVG
        MAS+F+TVPKPF++ P   N+ ++  + L+  G+R ++  ++A+S K EPAKVVPQADRVLVRLE LPEKS+GGVLLPKSAVKFERYL GE++SVG++VG
Subjt:  MASTFVTVPKPFINRP---NSSSVSARRLIIGGLRSSTLTVSAISKKLEPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVGTDVG

Query:  ENDIAPGKKVLLSDINAYEVDLGT-DAKHCFCKAGDLLAVVE
        E  + PGKKVL SD++AYEVD GT DAKHCFCK  DLLA+V+
Subjt:  ENDIAPGKKVLLSDINAYEVDLGT-DAKHCFCKAGDLLAVVE

Arabidopsis top hitse value%identityAlignment
AT2G44650.1 chloroplast chaperonin 101.3e-4163.57Show/hide
Query:  MASTFV-TVPKPFINRP-NSSSVSARRLIIGGLRSSTLTVSAISKKLEPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVGTDVGE
        MASTFV ++P PF   P  +++ S     + G R   L + AIS K EP KVVPQADRVLVRLE+LP KS+GGVLLPK+AVKFERYL GEI+SVG++VG+
Subjt:  MASTFV-TVPKPFINRP-NSSSVSARRLIIGGLRSSTLTVSAISKKLEPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVGTDVGE

Query:  NDIAPGKKVLLSDINAYEVDLGTDAKHCFCKAGDLLAVVE
          + PGK+VL SD++AYEVDLGTDA+HCFCK  DLLA+VE
Subjt:  NDIAPGKKVLLSDINAYEVDLGTDAKHCFCKAGDLLAVVE

AT3G60210.1 GroES-like family protein2.7e-4464.79Show/hide
Query:  MASTFVTVPKPFINRP---NSSSVSARRLIIGGLRSSTLTVSAISKKLEPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVGTDVG
        MAS+F+TVPKPF++ P   N+ ++  + L+  G+R ++  ++A+S K EPAKVVPQADRVLVRLE LPEKS+GGVLLPKSAVKFERYL GE++SVG++VG
Subjt:  MASTFVTVPKPFINRP---NSSSVSARRLIIGGLRSSTLTVSAISKKLEPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVGTDVG

Query:  ENDIAPGKKVLLSDINAYEVDLGT-DAKHCFCKAGDLLAVVE
        E  + PGKKVL SD++AYEVD GT DAKHCFCK  DLLA+V+
Subjt:  ENDIAPGKKVLLSDINAYEVDLGT-DAKHCFCKAGDLLAVVE

AT5G20720.1 chaperonin 204.5e-0732.56Show/hide
Query:  SSSVSARRLIIGGLRSS---TLTVSAISKKL-EPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVK----FERYLVGEILSVGTDVGENDIAPGKKVLL
        +SSV    L  G LR S    L V A S    +   + P  DRVLV+++E  EK+ GG+LLP +A       E   VGE  ++G +  +  +  G +++ 
Subjt:  SSSVSARRLIIGGLRSS---TLTVSAISKKL-EPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVK----FERYLVGEILSVGTDVGENDIAPGKKVLL

Query:  SDINAYEVDLGTDAKHCFCKAGDLLAVVE
        S     EV+   D KH   K  D++ ++E
Subjt:  SDINAYEVDLGTDAKHCFCKAGDLLAVVE

AT5G20720.2 chaperonin 204.5e-0732.56Show/hide
Query:  SSSVSARRLIIGGLRSS---TLTVSAISKKL-EPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVK----FERYLVGEILSVGTDVGENDIAPGKKVLL
        +SSV    L  G LR S    L V A S    +   + P  DRVLV+++E  EK+ GG+LLP +A       E   VGE  ++G +  +  +  G +++ 
Subjt:  SSSVSARRLIIGGLRSS---TLTVSAISKKL-EPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVK----FERYLVGEILSVGTDVGENDIAPGKKVLL

Query:  SDINAYEVDLGTDAKHCFCKAGDLLAVVE
        S     EV+   D KH   K  D++ ++E
Subjt:  SDINAYEVDLGTDAKHCFCKAGDLLAVVE

AT5G20720.3 chaperonin 204.5e-0732.56Show/hide
Query:  SSSVSARRLIIGGLRSS---TLTVSAISKKL-EPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVK----FERYLVGEILSVGTDVGENDIAPGKKVLL
        +SSV    L  G LR S    L V A S    +   + P  DRVLV+++E  EK+ GG+LLP +A       E   VGE  ++G +  +  +  G +++ 
Subjt:  SSSVSARRLIIGGLRSS---TLTVSAISKKL-EPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVK----FERYLVGEILSVGTDVGENDIAPGKKVLL

Query:  SDINAYEVDLGTDAKHCFCKAGDLLAVVE
        S     EV+   D KH   K  D++ ++E
Subjt:  SDINAYEVDLGTDAKHCFCKAGDLLAVVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGATGGCGTCTACGTTCGTCACAGTGCCAAAGCCCTTCATCAATAGGCCTAATTCATCTTCCGTATCTGCACGGCGACTAATAATCGGAGGATTGCGAAGTAGCAC
TTTGACAGTCAGCGCAATTTCTAAGAAATTGGAGCCTGCGAAGGTGGTTCCTCAAGCTGATAGAGTTCTCGTCCGTCTTGAGGAGCTTCCTGAGAAATCAGCTGGTGGAG
TTTTGCTGCCTAAATCAGCTGTCAAATTTGAGCGGTATCTTGTAGGAGAGATTCTATCCGTCGGAACAGACGTTGGAGAAAATGATATTGCACCTGGAAAGAAGGTTCTT
TTATCCGACATAAATGCTTATGAGGTGGATTTGGGCACAGATGCTAAGCACTGCTTCTGTAAAGCTGGTGATTTGTTAGCTGTGGTTGAGTAG
mRNA sequenceShow/hide mRNA sequence
CCACTTAAGTTTTTGGTTCTCACTTCTTATCTTACTTGTAGTTCGTTAGAATTAAGAACGCTCTAGAATTTCTCTTCCCTCGCTAAAACGTTTCTCCCGCCCACCTCATT
CGATTCCCACAGAGTTTTACTTCTTCTTCGACTAGATTCATGGCGATGGCGTCTACGTTCGTCACAGTGCCAAAGCCCTTCATCAATAGGCCTAATTCATCTTCCGTATC
TGCACGGCGACTAATAATCGGAGGATTGCGAAGTAGCACTTTGACAGTCAGCGCAATTTCTAAGAAATTGGAGCCTGCGAAGGTGGTTCCTCAAGCTGATAGAGTTCTCG
TCCGTCTTGAGGAGCTTCCTGAGAAATCAGCTGGTGGAGTTTTGCTGCCTAAATCAGCTGTCAAATTTGAGCGGTATCTTGTAGGAGAGATTCTATCCGTCGGAACAGAC
GTTGGAGAAAATGATATTGCACCTGGAAAGAAGGTTCTTTTATCCGACATAAATGCTTATGAGGTGGATTTGGGCACAGATGCTAAGCACTGCTTCTGTAAAGCTGGTGA
TTTGTTAGCTGTGGTTGAGTAGAGAAGCTTCCTCCAACACTACAATGCACATCCACATCAGTTCATTTCACTCAGATTAGTTCTCTTTCCTGATTATCATTTGCTTGTTA
TCTCGTTTTATTTTCGTTTTTATTGATCTGTGTGACCTGTATAAAGATGTTAGTTATTCTCACTAGGAAGAATACCTTCGATTATTTTTAAAAAATTAGTCAACCTTAAA
AGAATAACAAGAATGATATAGTTATGCAAATAATTTTTTGCTTTTTCCACTCCTTGAAAATATAGAACCTTTTTTATGAGTATCTGTCG
Protein sequenceShow/hide protein sequence
MAMASTFVTVPKPFINRPNSSSVSARRLIIGGLRSSTLTVSAISKKLEPAKVVPQADRVLVRLEELPEKSAGGVLLPKSAVKFERYLVGEILSVGTDVGENDIAPGKKVL
LSDINAYEVDLGTDAKHCFCKAGDLLAVVE