; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy6G018130 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy6G018130
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionIntegral membrane protein hemolysin-III homolog
Genome locationGy14Chr6:18925858..18928848
RNA-Seq ExpressionCsGy6G018130
SyntenyCsGy6G018130
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008464851.1 PREDICTED: uncharacterized protein LOC103502625 isoform X1 [Cucumis melo]1.33e-15692.06Show/hide
Query:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSDPTCSPSSAIHQSFKGIGVNEPNVYANG
        MIDS+LNSGGMSSCET+L +YQSKQSPIAQKKVALRDVQNDNRS+IYNYPETSC+LGGKLMNGSKLSGSKRS+PTCSPSSAIHQSFKGIGVNE NVYANG
Subjt:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSDPTCSPSSAIHQSFKGIGVNEPNVYANG

Query:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD
        EVDVKPGKKRASGGSTSCAP  FSSFLA SP AFSP RSSFPIFTEKPGNFLAV GSNLLGI PGLEILRSDDSNGITDEQR+ERLF+LQKLLKHFDK+D
Subjt:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD

Query:  QKGVIEM-LHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
        QKGVIE+ LHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
Subjt:  QKGVIEM-LHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ

XP_008464853.1 PREDICTED: uncharacterized protein LOC103502625 isoform X2 [Cucumis melo]4.66e-15992.83Show/hide
Query:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSDPTCSPSSAIHQSFKGIGVNEPNVYANG
        MIDS+LNSGGMSSCET+L +YQSKQSPIAQKKVALRDVQNDNRS+IYNYPETSC+LGGKLMNGSKLSGSKRS+PTCSPSSAIHQSFKGIGVNE NVYANG
Subjt:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSDPTCSPSSAIHQSFKGIGVNEPNVYANG

Query:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD
        EVDVKPGKKRASGGSTSCAP  FSSFLA SP AFSP RSSFPIFTEKPGNFLAV GSNLLGI PGLEILRSDDSNGITDEQR+ERLF+LQKLLKHFDK+D
Subjt:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD

Query:  QKGVIEMLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
        QKGVIEMLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
Subjt:  QKGVIEMLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ

XP_011657346.1 uncharacterized protein LOC101208160 isoform X1 [Cucumis sativus]6.98e-16897.21Show/hide
Query:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSDPTCSPSSAIHQSFKGIGVNEPNVYANG
        MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRS+PTCSPSSAIHQSFKGIGVNE NVYANG
Subjt:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSDPTCSPSSAIHQSFKGIGVNEPNVYANG

Query:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSPRSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTDQ
        EVDVKPGKKRASGGSTSCAP LFSSF A SPTAFSPRSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTDQ
Subjt:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSPRSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTDQ

Query:  KGVIEM-LHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
        KGVIE+ LHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
Subjt:  KGVIEM-LHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ

XP_011657348.1 uncharacterized protein LOC101208160 isoform X2 [Cucumis sativus]2.45e-17098Show/hide
Query:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSDPTCSPSSAIHQSFKGIGVNEPNVYANG
        MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRS+PTCSPSSAIHQSFKGIGVNE NVYANG
Subjt:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSDPTCSPSSAIHQSFKGIGVNEPNVYANG

Query:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSPRSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTDQ
        EVDVKPGKKRASGGSTSCAP LFSSF A SPTAFSPRSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTDQ
Subjt:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSPRSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTDQ

Query:  KGVIEMLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
        KGVIEMLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
Subjt:  KGVIEMLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ

XP_038884507.1 uncharacterized protein LOC120075309 [Benincasa hispida]3.42e-14586.45Show/hide
Query:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSDPTCSPSSAIHQSFKGIGVNEPNVYANG
        MIDS+LNSGGM SCETHLSMYQ+KQSPIAQKKVALRDVQNDNRS++YNYPETSC+LGGKL+NGSKLSGSKRS+PT SPSSAIHQSFKGIGVNE  +YA+G
Subjt:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSDPTCSPSSAIHQSFKGIGVNEPNVYANG

Query:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD
        EVDVKPGKKRASGGSTSCAP  FSS LAASP A SP RSS PIFTEKPGNFLAVAGS+LLGI PG EILRS DSNGITDEQR+ERLF+LQK LKH D++D
Subjt:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD

Query:  QKGVIEMLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
        +KGVIE LHGLPPSELSNFAINLEKRSM+LSVEEGKEIQRMKALNIL NLQ
Subjt:  QKGVIEMLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ

TrEMBL top hitse value%identityAlignment
A0A1S3CME9 uncharacterized protein LOC103502625 isoform X16.42e-15792.06Show/hide
Query:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSDPTCSPSSAIHQSFKGIGVNEPNVYANG
        MIDS+LNSGGMSSCET+L +YQSKQSPIAQKKVALRDVQNDNRS+IYNYPETSC+LGGKLMNGSKLSGSKRS+PTCSPSSAIHQSFKGIGVNE NVYANG
Subjt:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSDPTCSPSSAIHQSFKGIGVNEPNVYANG

Query:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD
        EVDVKPGKKRASGGSTSCAP  FSSFLA SP AFSP RSSFPIFTEKPGNFLAV GSNLLGI PGLEILRSDDSNGITDEQR+ERLF+LQKLLKHFDK+D
Subjt:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD

Query:  QKGVIEM-LHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
        QKGVIE+ LHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
Subjt:  QKGVIEM-LHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ

A0A1S3CMI7 uncharacterized protein LOC103502625 isoform X22.26e-15992.83Show/hide
Query:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSDPTCSPSSAIHQSFKGIGVNEPNVYANG
        MIDS+LNSGGMSSCET+L +YQSKQSPIAQKKVALRDVQNDNRS+IYNYPETSC+LGGKLMNGSKLSGSKRS+PTCSPSSAIHQSFKGIGVNE NVYANG
Subjt:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSDPTCSPSSAIHQSFKGIGVNEPNVYANG

Query:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD
        EVDVKPGKKRASGGSTSCAP  FSSFLA SP AFSP RSSFPIFTEKPGNFLAV GSNLLGI PGLEILRSDDSNGITDEQR+ERLF+LQKLLKHFDK+D
Subjt:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD

Query:  QKGVIEMLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
        QKGVIEMLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
Subjt:  QKGVIEMLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ

A0A5A7TAR2 Uncharacterized protein4.92e-14490Show/hide
Query:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSDPTCSPSSAIHQSFKGIGVNEPNVYANG
        MIDS+LNSGGMSSCET+L +YQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSC+LGGKLMNGSKLSGSKRS+PTCSPSSAIHQSFKGIGVNE NVYANG
Subjt:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSDPTCSPSSAIHQSFKGIGVNEPNVYANG

Query:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD
        EVDVKPGKKRASGGSTSCAP  FSSFLA SP AFSP RSSFPIFTEKPGNFLAV GSNLLGI PGLEILRSDDSNGITDEQR+ERLF+LQKLLKHFDK+D
Subjt:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD

Query:  QKGVIEMLHGLPPSELSNFAINLEKRSMHLS---VEEGKE
        QKGVIEMLHGLPPSELSNFAINLEKRSMHL     +EG E
Subjt:  QKGVIEMLHGLPPSELSNFAINLEKRSMHLS---VEEGKE

A0A6J1G2G3 uncharacterized protein LOC1114502154.58e-14586.45Show/hide
Query:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSDPTCSPSSAIHQSFKGIGVNEPNVYANG
        MIDS+LNSGGM SCETHLSMY+SKQSPIA KKVALRDVQNDNRSV+YNYPETSC+LGGKL+NGSK+SGSKRS+PTCSPSSAIHQSFKGIGVNE  +YANG
Subjt:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSDPTCSPSSAIHQSFKGIGVNEPNVYANG

Query:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD
        EVDVKPGKKRA GGSTSCAP  FSS LAASP AFSP RSS PIFTEKPGNFLAVAGSNLLGI PGLEIL S DSNGITDEQR+ERLF+LQ LLKH D++D
Subjt:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD

Query:  QKGVIEMLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
        QKG IE+LHGLPPSELS  AINLEK+SMHLSVEEGKEIQRMKALNIL NLQ
Subjt:  QKGVIEMLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ

A0A6J1KE02 uncharacterized protein LOC1114940101.86e-14485.66Show/hide
Query:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSDPTCSPSSAIHQSFKGIGVNEPNVYANG
        MIDS+LN+GGM SCETHLSMY+SKQSPIA KKVALRDVQNDNRSV+YNYPETSC+LGGKL+NGSK+SGSKRS+PTCSP+SAIHQSFKGIGVNE  +YANG
Subjt:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSDPTCSPSSAIHQSFKGIGVNEPNVYANG

Query:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD
        EVDVKPGKKRA GGSTSCAPT FSS LAASP AFSP RSS PIFTEKPGNFLAV GSNLLGI PGLEIL S DSNGITDEQR+ERLF+LQ LLKH D++D
Subjt:  EVDVKPGKKRASGGSTSCAPTLFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD

Query:  QKGVIEMLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
        QKG IE+LHGLPPSELS  AINLEK+SMHLSVEEGKEIQRMKALNIL NLQ
Subjt:  QKGVIEMLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G45250.1 Integral membrane protein hemolysin-III homolog6.0e-1046.88Show/hide
Query:  ERLFHLQKLLKHFDKTDQKGVIEMLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNIL
        ER  HLQ LL   +++D+   ++ML  L  +ELS  A++LEKRS+  S+EE +E+QR+ ALN+L
Subjt:  ERLFHLQKLLKHFDKTDQKGVIEMLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNIL

AT2G45250.2 Integral membrane protein hemolysin-III homolog2.0e-0545.1Show/hide
Query:  ERLFHLQKLLKHFDKTDQKGVIEMLHGLPPSELSNFAINLEKRSMHLSVEE
        ER  HLQ LL   +++D+   ++ML  L  +ELS  A++LEKRS+  S+EE
Subjt:  ERLFHLQKLLKHFDKTDQKGVIEMLHGLPPSELSNFAINLEKRSMHLSVEE

AT4G38280.1 BEST Arabidopsis thaliana protein match is: Integral membrane protein hemolysin-III homolog (TAIR:AT2G45250.1)1.0e-0946.88Show/hide
Query:  ERLFHLQKLLKHFDKTDQKGVIEMLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNIL
        ER  HLQ LL   +++D+   ++ML  L  +ELS  A++LEKRS+  S+EE +E+QR+ ALN+L
Subjt:  ERLFHLQKLLKHFDKTDQKGVIEMLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGATAGCGAGTTGAATAGTGGTGGAATGAGCAGCTGCGAAACTCATTTGTCTATGTATCAGAGCAAACAGTCACCTATTGCACAGAAAAAGGTTGCTTTAAGGGA
TGTGCAAAATGATAATAGGAGCGTCATATATAACTATCCTGAAACTTCCTGTGCTTTGGGTGGAAAACTTATGAATGGGAGTAAGCTTTCAGGAAGTAAGAGATCCGACC
CTACATGCTCACCGAGCTCTGCAATCCATCAATCCTTCAAAGGAATTGGGGTAAATGAGCCCAATGTTTATGCCAACGGAGAAGTTGATGTGAAGCCTGGCAAAAAAAGA
GCATCGGGAGGTAGCACATCTTGTGCACCTACATTATTTTCTTCTTTTCTTGCAGCCTCTCCAACGGCATTTTCACCTAGGTCTTCATTTCCCATTTTCACAGAAAAGCC
TGGTAATTTTCTGGCTGTTGCTGGATCCAATCTTCTGGGAATCTCTCCTGGTTTGGAGATTCTTCGATCTGATGATTCAAATGGGATTACTGATGAGCAGAGATCAGAGC
GTTTATTCCATCTACAGAAGCTCCTAAAACATTTTGACAAGACGGATCAAAAAGGGGTCATTGAGATGCTCCATGGTTTGCCTCCCTCTGAGCTCAGCAATTTTGCCATT
AATCTGGAAAAGAGATCCATGCACCTGTCAGTAGAGGAAGGGAAAGAGATCCAACGGATGAAGGCTTTAAATATTCTGAGCAACCTTCAGTGA
mRNA sequenceShow/hide mRNA sequence
CCCATCTTCTTCGAGCCATACTTTTCCCTTTTGAACTCGTCTCTCCCTCTCCTTCTCCCTCTCCCTCTCCCCCTCCCTCTCCCCGCTTCCACAACAACAAGCTTCATACT
CAGATGGAGTTTTCTAGGAGATAATTTGAAGTTATTATTTGGTTGAAGCTCTTCAAGTTTTCTAATGATTGATAGCGAGTTGAATAGTGGTGGAATGAGCAGCTGCGAAA
CTCATTTGTCTATGTATCAGAGCAAACAGTCACCTATTGCACAGAAAAAGGTTGCTTTAAGGGATGTGCAAAATGATAATAGGAGCGTCATATATAACTATCCTGAAACT
TCCTGTGCTTTGGGTGGAAAACTTATGAATGGGAGTAAGCTTTCAGGAAGTAAGAGATCCGACCCTACATGCTCACCGAGCTCTGCAATCCATCAATCCTTCAAAGGAAT
TGGGGTAAATGAGCCCAATGTTTATGCCAACGGAGAAGTTGATGTGAAGCCTGGCAAAAAAAGAGCATCGGGAGGTAGCACATCTTGTGCACCTACATTATTTTCTTCTT
TTCTTGCAGCCTCTCCAACGGCATTTTCACCTAGGTCTTCATTTCCCATTTTCACAGAAAAGCCTGGTAATTTTCTGGCTGTTGCTGGATCCAATCTTCTGGGAATCTCT
CCTGGTTTGGAGATTCTTCGATCTGATGATTCAAATGGGATTACTGATGAGCAGAGATCAGAGCGTTTATTCCATCTACAGAAGCTCCTAAAACATTTTGACAAGACGGA
TCAAAAAGGGGTCATTGAGATGCTCCATGGTTTGCCTCCCTCTGAGCTCAGCAATTTTGCCATTAATCTGGAAAAGAGATCCATGCACCTGTCAGTAGAGGAAGGGAAAG
AGATCCAACGGATGAAGGCTTTAAATATTCTGAGCAACCTTCAGTGAAATCTGAAAGAAGTTTTGTTACACTGATGAATAACGATTTTGGAGACGAACGAGACGAAAGGG
CCATTTGGCATTTCTGCGATAATTGCACTAGTGGACAAACTAGTCGATTGGTTGATGAAACAGTCTGCCGTAATTAAGCAAACCAATATTCCTTGAGAGCTGGCAAGGTA
GTCTTTCTATGGAGCTTTAGGTTATGGAGTTTGATTTGGGCAACATGAGGTTAAAAAATGTTGCTCCTGTCAACATGAAACTCCTCCACCAGGAGTTTCATTTGAATCAG
TGGAAGGGGTTTTGAAAGTTTTTTTTTTTTTTTTTTAAAAAAAGAAGAAAATAGTTTAGTTACATGTTGAGTTGATTACACTTTGGAATCTAACATTCAAATCAGTCCAA
TTACCATGTGATTGTTGTCTTAATCATTAGCTTCATTTTGGAAGTTTTTAACCCATGATTTCCCTCCCTTTTTTCTTTTTTCTTTCGGCTATGATTAAACTATGTATAAC
ATGCGATTGGGGATTCAAATTTGAATCTTTTGGGATTTTGATCCAACTTATAAGTGGTGTTTGTTGTACCTTTGTCA
Protein sequenceShow/hide protein sequence
MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSDPTCSPSSAIHQSFKGIGVNEPNVYANGEVDVKPGKKR
ASGGSTSCAPTLFSSFLAASPTAFSPRSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTDQKGVIEMLHGLPPSELSNFAI
NLEKRSMHLSVEEGKEIQRMKALNILSNLQ