; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G18670 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G18670
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionIntegral membrane protein hemolysin-III homolog
Genome locationChr6:16958819..16961735
RNA-Seq ExpressionCSPI06G18670
SyntenyCSPI06G18670
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038455.1 uncharacterized protein E6C27_scaffold119G00220 [Cucumis melo var. makuwa]8.6e-13593.33Show/hide
Query:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG
        MIDS+LNSGGMSSCET+L +YQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSC+LGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG
Subjt:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG

Query:  EVDVKPGKKRASGGSTSCAPALFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD
        EVDVKPGKKRASGGSTSCAPA FSSFLA SP AFSP RSSFPIFTEKPGNFLAV GSNLLGI PGLEILRSDDSNGITDEQR+ERLF+LQKLLKHFDK+D
Subjt:  EVDVKPGKKRASGGSTSCAPALFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD

Query:  QKGVIEMLHGLPPSELSNFAINLEKRSMHLERDPTDEGFEYSEQPSVKSERSFVTLMNNDFGDERDERAI
        QKGVIEMLHGLPPSELSNFAINLEKRSMHLERDPTDEGFEYSEQPSVKSERSFVTLMNNDFGD+R   AI
Subjt:  QKGVIEMLHGLPPSELSNFAINLEKRSMHLERDPTDEGFEYSEQPSVKSERSFVTLMNNDFGDERDERAI

XP_008464851.1 PREDICTED: uncharacterized protein LOC103502625 isoform X1 [Cucumis melo]1.3e-11190.04Show/hide
Query:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG
        MIDS+LNSGGMSSCET+L +YQSKQSPIAQKKVALRDVQNDNRS+IYNYPETSC+LGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG
Subjt:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG

Query:  EVDVKPGKKRASGGSTSCAPALFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD
        EVDVKPGKKRASGGSTSCAPA FSSFLA SP AFSP RSSFPIFTEKPGNFLAV GSNLLGI PGLEILRSDDSNGITDEQR+ERLF+LQKLLKHFDK+D
Subjt:  EVDVKPGKKRASGGSTSCAPALFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD

Query:  QKGVIE-MLHGLPPSELSNFAINLEKRSMHLERDPTDEGFE
        QKGVIE +LHGLPPSELSNFAINLEKRSMHL     +EG E
Subjt:  QKGVIE-MLHGLPPSELSNFAINLEKRSMHLERDPTDEGFE

XP_008464853.1 PREDICTED: uncharacterized protein LOC103502625 isoform X2 [Cucumis melo]1.9e-11390.83Show/hide
Query:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG
        MIDS+LNSGGMSSCET+L +YQSKQSPIAQKKVALRDVQNDNRS+IYNYPETSC+LGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG
Subjt:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG

Query:  EVDVKPGKKRASGGSTSCAPALFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD
        EVDVKPGKKRASGGSTSCAPA FSSFLA SP AFSP RSSFPIFTEKPGNFLAV GSNLLGI PGLEILRSDDSNGITDEQR+ERLF+LQKLLKHFDK+D
Subjt:  EVDVKPGKKRASGGSTSCAPALFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD

Query:  QKGVIEMLHGLPPSELSNFAINLEKRSMHLERDPTDEGFE
        QKGVIEMLHGLPPSELSNFAINLEKRSMHL     +EG E
Subjt:  QKGVIEMLHGLPPSELSNFAINLEKRSMHLERDPTDEGFE

XP_011657346.1 uncharacterized protein LOC101208160 isoform X1 [Cucumis sativus]3.5e-12095.42Show/hide
Query:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG
        MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG
Subjt:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG

Query:  EVDVKPGKKRASGGSTSCAPALFSSFLAASPTAFSPRSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTDQ
        EVDVKPGKKRASGGSTSCAPALFSSF A SPTAFSPRSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTDQ
Subjt:  EVDVKPGKKRASGGSTSCAPALFSSFLAASPTAFSPRSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTDQ

Query:  KGVIE-MLHGLPPSELSNFAINLEKRSMHLERDPTDEGFE
        KGVIE +LHGLPPSELSNFAINLEKRSMHL     +EG E
Subjt:  KGVIE-MLHGLPPSELSNFAINLEKRSMHLERDPTDEGFE

XP_011657348.1 uncharacterized protein LOC101208160 isoform X2 [Cucumis sativus]4.9e-12296.23Show/hide
Query:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG
        MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG
Subjt:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG

Query:  EVDVKPGKKRASGGSTSCAPALFSSFLAASPTAFSPRSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTDQ
        EVDVKPGKKRASGGSTSCAPALFSSF A SPTAFSPRSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTDQ
Subjt:  EVDVKPGKKRASGGSTSCAPALFSSFLAASPTAFSPRSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTDQ

Query:  KGVIEMLHGLPPSELSNFAINLEKRSMHLERDPTDEGFE
        KGVIEMLHGLPPSELSNFAINLEKRSMHL     +EG E
Subjt:  KGVIEMLHGLPPSELSNFAINLEKRSMHLERDPTDEGFE

TrEMBL top hitse value%identityAlignment
A0A0A0KCZ3 Uncharacterized protein2.8e-13985.86Show/hide
Query:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG
        MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG
Subjt:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG

Query:  EVDVKPGKKRASGGSTSCAPALFSSFLAASPTAFSPRSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTDQ
        EVDVKPGKKRASG                               EKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTDQ
Subjt:  EVDVKPGKKRASGGSTSCAPALFSSFLAASPTAFSPRSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTDQ

Query:  KGVIEMLHGLPPSELSNFAINLEKRSMHL------------ERDPTDEGFEYSEQPSVKSERSFVTLMNNDFGDERDERAIWHFCDNCTSGQISRLVDET
        KGVIEMLHGLPPSELSNFAINLEKRSMHL            ERDPTDEGFEYSEQPSVKSERSFVTLMNNDFGDERDERAIWHFCDNCTSGQISRLVDET
Subjt:  KGVIEMLHGLPPSELSNFAINLEKRSMHL------------ERDPTDEGFEYSEQPSVKSERSFVTLMNNDFGDERDERAIWHFCDNCTSGQISRLVDET

Query:  VCRN
        VCRN
Subjt:  VCRN

A0A1S3CME9 uncharacterized protein LOC103502625 isoform X16.5e-11290.04Show/hide
Query:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG
        MIDS+LNSGGMSSCET+L +YQSKQSPIAQKKVALRDVQNDNRS+IYNYPETSC+LGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG
Subjt:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG

Query:  EVDVKPGKKRASGGSTSCAPALFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD
        EVDVKPGKKRASGGSTSCAPA FSSFLA SP AFSP RSSFPIFTEKPGNFLAV GSNLLGI PGLEILRSDDSNGITDEQR+ERLF+LQKLLKHFDK+D
Subjt:  EVDVKPGKKRASGGSTSCAPALFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD

Query:  QKGVIE-MLHGLPPSELSNFAINLEKRSMHLERDPTDEGFE
        QKGVIE +LHGLPPSELSNFAINLEKRSMHL     +EG E
Subjt:  QKGVIE-MLHGLPPSELSNFAINLEKRSMHLERDPTDEGFE

A0A1S3CMI7 uncharacterized protein LOC103502625 isoform X29.0e-11490.83Show/hide
Query:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG
        MIDS+LNSGGMSSCET+L +YQSKQSPIAQKKVALRDVQNDNRS+IYNYPETSC+LGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG
Subjt:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG

Query:  EVDVKPGKKRASGGSTSCAPALFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD
        EVDVKPGKKRASGGSTSCAPA FSSFLA SP AFSP RSSFPIFTEKPGNFLAV GSNLLGI PGLEILRSDDSNGITDEQR+ERLF+LQKLLKHFDK+D
Subjt:  EVDVKPGKKRASGGSTSCAPALFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD

Query:  QKGVIEMLHGLPPSELSNFAINLEKRSMHLERDPTDEGFE
        QKGVIEMLHGLPPSELSNFAINLEKRSMHL     +EG E
Subjt:  QKGVIEMLHGLPPSELSNFAINLEKRSMHLERDPTDEGFE

A0A5A7TAR2 Uncharacterized protein4.2e-13593.33Show/hide
Query:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG
        MIDS+LNSGGMSSCET+L +YQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSC+LGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG
Subjt:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG

Query:  EVDVKPGKKRASGGSTSCAPALFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD
        EVDVKPGKKRASGGSTSCAPA FSSFLA SP AFSP RSSFPIFTEKPGNFLAV GSNLLGI PGLEILRSDDSNGITDEQR+ERLF+LQKLLKHFDK+D
Subjt:  EVDVKPGKKRASGGSTSCAPALFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD

Query:  QKGVIEMLHGLPPSELSNFAINLEKRSMHLERDPTDEGFEYSEQPSVKSERSFVTLMNNDFGDERDERAI
        QKGVIEMLHGLPPSELSNFAINLEKRSMHLERDPTDEGFEYSEQPSVKSERSFVTLMNNDFGD+R   AI
Subjt:  QKGVIEMLHGLPPSELSNFAINLEKRSMHLERDPTDEGFEYSEQPSVKSERSFVTLMNNDFGDERDERAI

A0A6J1G2G3 uncharacterized protein LOC1114502152.5e-10384.58Show/hide
Query:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG
        MIDS+LNSGGM SCETHLSMY+SKQSPIA KKVALRDVQNDNRSV+YNYPETSC+LGGKL+NGSK+SGSKRSNPTCSPSSAIHQSFKGIGVNEH +YANG
Subjt:  MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG

Query:  EVDVKPGKKRASGGSTSCAPALFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD
        EVDVKPGKKRA GGSTSCAPA FSS LAASP AFSP RSS PIFTEKPGNFLAVAGSNLLGI PGLEIL S DSNGITDEQR+ERLF+LQ LLKH D++D
Subjt:  EVDVKPGKKRASGGSTSCAPALFSSFLAASPTAFSP-RSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTD

Query:  QKGVIEMLHGLPPSELSNFAINLEKRSMHLERDPTDEGFE
        QKG IE+LHGLPPSELS  AINLEK+SMHL     +EG E
Subjt:  QKGVIEMLHGLPPSELSNFAINLEKRSMHLERDPTDEGFE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G45250.1 Integral membrane protein hemolysin-III homolog7.5e-0438.89Show/hide
Query:  ERLFHLQKLLKHFDKTDQKGVIEMLHGLPPSELSNFAINLEKRSMHLERDPTDE
        ER  HLQ LL   +++D+   ++ML  L  +ELS  A++LEKRS+    +   E
Subjt:  ERLFHLQKLLKHFDKTDQKGVIEMLHGLPPSELSNFAINLEKRSMHLERDPTDE

AT2G45250.2 Integral membrane protein hemolysin-III homolog1.2e-0433.8Show/hide
Query:  ERLFHLQKLLKHFDKTDQKGVIEMLHGLPPSELSNFAINLEKRSMHLERDPTDEGFEYSEQPSVKSERSFV
        ER  HLQ LL   +++D+   ++ML  L  +ELS  A++LEKRS+    +     ++     S ++ERS +
Subjt:  ERLFHLQKLLKHFDKTDQKGVIEMLHGLPPSELSNFAINLEKRSMHLERDPTDEGFEYSEQPSVKSERSFV

AT4G38280.1 BEST Arabidopsis thaliana protein match is: Integral membrane protein hemolysin-III homolog (TAIR:AT2G45250.1)7.5e-0438.89Show/hide
Query:  ERLFHLQKLLKHFDKTDQKGVIEMLHGLPPSELSNFAINLEKRSMHLERDPTDE
        ER  HLQ LL   +++D+   ++ML  L  +ELS  A++LEKRS+    +   E
Subjt:  ERLFHLQKLLKHFDKTDQKGVIEMLHGLPPSELSNFAINLEKRSMHLERDPTDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGATAGCGAGTTGAATAGTGGTGGAATGAGCAGCTGCGAAACTCATTTGTCTATGTATCAGAGCAAACAGTCACCTATTGCACAGAAAAAGGTTGCTTTAAGGGA
TGTGCAAAATGATAATAGGAGCGTCATATATAACTATCCTGAAACTTCCTGTGCTTTGGGTGGAAAACTTATGAATGGGAGTAAGCTTTCAGGAAGTAAGAGATCCAACC
CTACATGCTCACCGAGCTCTGCAATCCATCAATCCTTCAAAGGAATTGGGGTAAATGAGCACAATGTTTATGCCAACGGAGAAGTTGATGTGAAGCCTGGCAAAAAAAGA
GCATCGGGAGGTAGCACATCTTGTGCACCTGCATTATTTTCTTCTTTTCTTGCAGCCTCTCCAACGGCATTTTCACCTAGGTCTTCATTTCCCATTTTCACAGAAAAGCC
TGGTAATTTTCTGGCTGTTGCTGGATCCAATCTTCTGGGAATCTCTCCTGGTTTGGAGATTCTTCGATCTGATGATTCAAATGGGATTACTGATGAGCAGAGATCAGAGC
GTTTATTCCATCTACAGAAGCTCCTAAAACATTTTGACAAGACGGATCAAAAAGGGGTCATTGAGATGCTCCATGGTTTGCCTCCCTCTGAGCTCAGCAATTTTGCCATT
AATCTGGAAAAGAGATCCATGCACCTGGAAAGAGATCCAACGGATGAAGGCTTTGAATATTCTGAGCAACCTTCAGTGAAATCTGAAAGAAGTTTTGTTACACTGATGAA
TAACGATTTTGGAGACGAACGAGACGAAAGGGCCATTTGGCATTTCTGCGATAATTGCACGAGTGGACAAATTAGTCGATTGGTTGATGAAACAGTCTGCCGTAATTGA
mRNA sequenceShow/hide mRNA sequence
AAAAAACAAAACCCATCTTCTTCGAGCCATACTTTTCCCTTTTGAACTCGTCTCTCCCTCTCCTTCTCCCTCTCCCTCTCCCCCTCCCTCTCCCCGCTTCCACAACAACA
AGCTTCATACTCAGATGGAGTTTTCTAGGAGATAATTTGAAGTTATTATTTGGTTGAAGCTCTTCAAGTTTTCTAATGATTGATAGCGAGTTGAATAGTGGTGGAATGAG
CAGCTGCGAAACTCATTTGTCTATGTATCAGAGCAAACAGTCACCTATTGCACAGAAAAAGGTTGCTTTAAGGGATGTGCAAAATGATAATAGGAGCGTCATATATAACT
ATCCTGAAACTTCCTGTGCTTTGGGTGGAAAACTTATGAATGGGAGTAAGCTTTCAGGAAGTAAGAGATCCAACCCTACATGCTCACCGAGCTCTGCAATCCATCAATCC
TTCAAAGGAATTGGGGTAAATGAGCACAATGTTTATGCCAACGGAGAAGTTGATGTGAAGCCTGGCAAAAAAAGAGCATCGGGAGGTAGCACATCTTGTGCACCTGCATT
ATTTTCTTCTTTTCTTGCAGCCTCTCCAACGGCATTTTCACCTAGGTCTTCATTTCCCATTTTCACAGAAAAGCCTGGTAATTTTCTGGCTGTTGCTGGATCCAATCTTC
TGGGAATCTCTCCTGGTTTGGAGATTCTTCGATCTGATGATTCAAATGGGATTACTGATGAGCAGAGATCAGAGCGTTTATTCCATCTACAGAAGCTCCTAAAACATTTT
GACAAGACGGATCAAAAAGGGGTCATTGAGATGCTCCATGGTTTGCCTCCCTCTGAGCTCAGCAATTTTGCCATTAATCTGGAAAAGAGATCCATGCACCTGGAAAGAGA
TCCAACGGATGAAGGCTTTGAATATTCTGAGCAACCTTCAGTGAAATCTGAAAGAAGTTTTGTTACACTGATGAATAACGATTTTGGAGACGAACGAGACGAAAGGGCCA
TTTGGCATTTCTGCGATAATTGCACGAGTGGACAAATTAGTCGATTGGTTGATGAAACAGTCTGCCGTAATTGA
Protein sequenceShow/hide protein sequence
MIDSELNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCALGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANGEVDVKPGKKR
ASGGSTSCAPALFSSFLAASPTAFSPRSSFPIFTEKPGNFLAVAGSNLLGISPGLEILRSDDSNGITDEQRSERLFHLQKLLKHFDKTDQKGVIEMLHGLPPSELSNFAI
NLEKRSMHLERDPTDEGFEYSEQPSVKSERSFVTLMNNDFGDERDERAIWHFCDNCTSGQISRLVDETVCRN