; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G6421 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G6421
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionBEST Arabidopsis thaliana protein match is: Integral membrane protein hemolysin-III homolog .
Genome locationctg1449:350930..353673
RNA-Seq ExpressionCucsat.G6421
SyntenyCucsat.G6421
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008464851.1 PREDICTED: uncharacterized protein LOC103502625 isoform X1 [Cucumis melo]5.89e-16093.25Show/hide
Query:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEPNVYANG
        MIDS+LNSGGMSSCET+L +YQSKQSPIAQKKVALRDVQNDNRS+IYNYPETSC+LGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNE NVYANG
Subjt:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEPNVYANG

Query:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD
        EVDVKPGKKRASGGSTSCAP  FSSFLA SP AFSP RSSFPIFTEKPGNFLAV GSNLLGI PGLEILRSDDSNGITDEQR+ERLF+LQKLLKHFDK+D
Subjt:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD

Query:  QKGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
        QKGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
Subjt:  QKGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ

XP_008464853.1 PREDICTED: uncharacterized protein LOC103502625 isoform X2 [Cucumis melo]4.46e-15792.46Show/hide
Query:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEPNVYANG
        MIDS+LNSGGMSSCET+L +YQSKQSPIAQKKVALRDVQNDNRS+IYNYPETSC+LGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNE NVYANG
Subjt:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEPNVYANG

Query:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD
        EVDVKPGKKRASGGSTSCAP  FSSFLA SP AFSP RSSFPIFTEKPGNFLAV GSNLLGI PGLEILRSDDSNGITDEQR+ERLF+LQKLLKHFDK+D
Subjt:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD

Query:  QKGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
        QKGVIE+ LHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
Subjt:  QKGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ

XP_011657346.1 uncharacterized protein LOC101208160 isoform X1 [Cucumis sativus]3.09e-17198.41Show/hide
Query:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEPNVYANG
        MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNE NVYANG
Subjt:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEPNVYANG

Query:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSPRSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTDQ
        EVDVKPGKKRASGGSTSCAP LFSSF A SPTAFSPRSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTDQ
Subjt:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSPRSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTDQ

Query:  KGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
        KGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
Subjt:  KGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ

XP_011657348.1 uncharacterized protein LOC101208160 isoform X2 [Cucumis sativus]2.35e-16897.61Show/hide
Query:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEPNVYANG
        MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNE NVYANG
Subjt:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEPNVYANG

Query:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSPRSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTDQ
        EVDVKPGKKRASGGSTSCAP LFSSF A SPTAFSPRSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTDQ
Subjt:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSPRSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTDQ

Query:  KGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
        KGVIE+ LHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
Subjt:  KGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ

XP_011657349.1 uncharacterized protein LOC101208160 isoform X3 [Cucumis sativus]1.36e-14487.25Show/hide
Query:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEPNVYANG
        MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNE NVYANG
Subjt:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEPNVYANG

Query:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSPRSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTDQ
        EVDVKPGKKRASG                               EKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTDQ
Subjt:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSPRSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTDQ

Query:  KGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
        KGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
Subjt:  KGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ

TrEMBL top hitse value%identityAlignment
A0A1S3CME9 uncharacterized protein LOC103502625 isoform X12.85e-16093.25Show/hide
Query:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEPNVYANG
        MIDS+LNSGGMSSCET+L +YQSKQSPIAQKKVALRDVQNDNRS+IYNYPETSC+LGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNE NVYANG
Subjt:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEPNVYANG

Query:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD
        EVDVKPGKKRASGGSTSCAP  FSSFLA SP AFSP RSSFPIFTEKPGNFLAV GSNLLGI PGLEILRSDDSNGITDEQR+ERLF+LQKLLKHFDK+D
Subjt:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD

Query:  QKGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
        QKGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
Subjt:  QKGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ

A0A1S3CMI7 uncharacterized protein LOC103502625 isoform X22.16e-15792.46Show/hide
Query:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEPNVYANG
        MIDS+LNSGGMSSCET+L +YQSKQSPIAQKKVALRDVQNDNRS+IYNYPETSC+LGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNE NVYANG
Subjt:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEPNVYANG

Query:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD
        EVDVKPGKKRASGGSTSCAP  FSSFLA SP AFSP RSSFPIFTEKPGNFLAV GSNLLGI PGLEILRSDDSNGITDEQR+ERLF+LQKLLKHFDK+D
Subjt:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD

Query:  QKGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
        QKGVIE+ LHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
Subjt:  QKGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ

A0A5A7TAR2 Uncharacterized protein3.30e-14289.63Show/hide
Query:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEPNVYANG
        MIDS+LNSGGMSSCET+L +YQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSC+LGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNE NVYANG
Subjt:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEPNVYANG

Query:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD
        EVDVKPGKKRASGGSTSCAP  FSSFLA SP AFSP RSSFPIFTEKPGNFLAV GSNLLGI PGLEILRSDDSNGITDEQR+ERLF+LQKLLKHFDK+D
Subjt:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD

Query:  QKGVIEIVLHGLPPSELSNFAINLEKRSMHLS---VEEGKE
        QKGVIE+ LHGLPPSELSNFAINLEKRSMHL     +EG E
Subjt:  QKGVIEIVLHGLPPSELSNFAINLEKRSMHLS---VEEGKE

A0A6J1G2G3 uncharacterized protein LOC1114502151.08e-14386.51Show/hide
Query:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEPNVYANG
        MIDS+LNSGGM SCETHLSMY+SKQSPIA KKVALRDVQNDNRSV+YNYPETSC+LGGKL+NGSK+SGSKRSNPTCSPSSAIHQSFKGIGVNE  +YANG
Subjt:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEPNVYANG

Query:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD
        EVDVKPGKKRA GGSTSCAP  FSS LAASP AFSP RSS PIFTEKPGNFLAVAGSNLLGI PGLEIL S DSNGITDEQR+ERLF+LQ LLKH D++D
Subjt:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD

Query:  QKGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
        QKG IE+ LHGLPPSELS  AINLEK+SMHLSVEEGKEIQRMKALNIL NLQ
Subjt:  QKGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ

A0A6J1KE02 uncharacterized protein LOC1114940104.38e-14385.71Show/hide
Query:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEPNVYANG
        MIDS+LN+GGM SCETHLSMY+SKQSPIA KKVALRDVQNDNRSV+YNYPETSC+LGGKL+NGSK+SGSKRSNPTCSP+SAIHQSFKGIGVNE  +YANG
Subjt:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEPNVYANG

Query:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD
        EVDVKPGKKRA GGSTSCAPT FSS LAASP AFSP RSS PIFTEKPGNFLAV GSNLLGI PGLEIL S DSNGITDEQR+ERLF+LQ LLKH D++D
Subjt:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD

Query:  QKGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
        QKG IE+ LHGLPPSELS  AINLEK+SMHLSVEEGKEIQRMKALNIL NLQ
Subjt:  QKGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G45250.1 Integral membrane protein hemolysin-III homolog5.6e-0844.62Show/hide
Query:  ERLFHLQKLLKHFDKTDQKGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNIL
        ER  HLQ LL   +++D+   ++ +L  L  +ELS  A++LEKRS+  S+EE +E+QR+ ALN+L
Subjt:  ERLFHLQKLLKHFDKTDQKGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNIL

AT4G38280.1 BEST Arabidopsis thaliana protein match is: Integral membrane protein hemolysin-III homolog (TAIR:AT2G45250.1)9.6e-0844.62Show/hide
Query:  ERLFHLQKLLKHFDKTDQKGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNIL
        ER  HLQ LL   +++D+   ++ +L  L  +ELS  A++LEKRS+  S+EE +E+QR+ ALN+L
Subjt:  ERLFHLQKLLKHFDKTDQKGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGATAGCGAGTTGAATAGTGGTGGAATGAGCAGCTGCGAAACTCATTTGTCTATGTATCAGAGCAAACAGTCACCTATTGCACAGAAAAAGGTTGCTTTAAGGGA
TGTGCAAAATGATAATAGGAGCGTCATATATAACTATCCTGAAACTTCCTGTGCTTTGGGTGGAAAACTTATGAATGGGAGTAAGCTTTCAGGAAGTAAGAGATCCAACC
CTACATGCTCACCGAGCTCTGCAATCCATCAATCCTTCAAAGGAATTGGGGTAAATGAGCCCAATGTTTATGCCAACGGAGAAGTTGATGTGAAGCCTGGCAAAAAAAGA
GCATCGGGAGGTAGCACATCTTGTGCACCTACATTATTTTCTTCTTTTCTTGCAGCCTCTCCAACGGCATTTTCACCTAGGTCTTCATTTCCCATTTTCACAGAAAAGCC
TGGTAATTTTCTGGCTGTTGCTGGATCCAATCTTCTGGGAATCTCTCCTGGTTTGGAGATTCTTCGATCTGATGATTCAAATGGGATTACTGATGAGCAGAGATCAGAGC
GTTTATTCCATCTACAGAAGCTCCTAAAACATTTTGACAAGACGGATCAAAAAGGGGTCATTGAGATAGTGCTCCATGGTTTGCCTCCCTCTGAGCTCAGCAATTTTGCC
ATTAATCTGGAAAAGAGATCCATGCACCTGTCAGTAGAGGAAGGGAAAGAGATCCAACGGATGAAGGCTTTAAATATTCTGAGCAACCTTCAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGATTGATAGCGAGTTGAATAGTGGTGGAATGAGCAGCTGCGAAACTCATTTGTCTATGTATCAGAGCAAACAGTCACCTATTGCACAGAAAAAGGTTGCTTTAAGGGA
TGTGCAAAATGATAATAGGAGCGTCATATATAACTATCCTGAAACTTCCTGTGCTTTGGGTGGAAAACTTATGAATGGGAGTAAGCTTTCAGGAAGTAAGAGATCCAACC
CTACATGCTCACCGAGCTCTGCAATCCATCAATCCTTCAAAGGAATTGGGGTAAATGAGCCCAATGTTTATGCCAACGGAGAAGTTGATGTGAAGCCTGGCAAAAAAAGA
GCATCGGGAGGTAGCACATCTTGTGCACCTACATTATTTTCTTCTTTTCTTGCAGCCTCTCCAACGGCATTTTCACCTAGGTCTTCATTTCCCATTTTCACAGAAAAGCC
TGGTAATTTTCTGGCTGTTGCTGGATCCAATCTTCTGGGAATCTCTCCTGGTTTGGAGATTCTTCGATCTGATGATTCAAATGGGATTACTGATGAGCAGAGATCAGAGC
GTTTATTCCATCTACAGAAGCTCCTAAAACATTTTGACAAGACGGATCAAAAAGGGGTCATTGAGATAGTGCTCCATGGTTTGCCTCCCTCTGAGCTCAGCAATTTTGCC
ATTAATCTGGAAAAGAGATCCATGCACCTGTCAGTAGAGGAAGGGAAAGAGATCCAACGGATGAAGGCTTTAAATATTCTGAGCAACCTTCAGTGA
Protein sequenceShow/hide protein sequence
MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEPNVYANGEVDVKPGKKR
ASGGSTSCAPTLFSSFLAASPTAFSPRSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTDQKGVIEIVLHGLPPSELSNFA
INLEKRSMHLSVEEGKEIQRMKALNILSNLQ