; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0076441 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0076441
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionIntegral membrane protein hemolysin-III homolog
Genome locationCMiso1.1chr03:23664515..23667603
RNA-Seq ExpressionCmc03g0076441
SyntenyCmc03g0076441
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038455.1 uncharacterized protein E6C27_scaffold119G00220 [Cucumis melo var. makuwa]9.9e-12496.65Show/hide
Query:  MIDSKLNSGGMSSCETNLCVYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG
        MIDSKLNSGGMSSCETNLCVYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG
Subjt:  MIDSKLNSGGMSSCETNLCVYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG

Query:  EVDVKPGKKRASGGSTSCAPAFSSFLANSPMAFSPVRSSFPIFTEKPGNFLAVTGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ
        EVDVKPGKKRASGGSTSCAPAFSSFLA SPMAFSPVRSSFPIFTEKPGNFLAVTGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ
Subjt:  EVDVKPGKKRASGGSTSCAPAFSSFLANSPMAFSPVRSSFPIFTEKPGNFLAVTGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ

Query:  KGVIEMLHGLPPSELSNFAINLEKRSMHLS---VEEGKE
        KGVIEMLHGLPPSELSNFAINLEKRSMHL     +EG E
Subjt:  KGVIEMLHGLPPSELSNFAINLEKRSMHLS---VEEGKE

XP_008464851.1 PREDICTED: uncharacterized protein LOC103502625 isoform X1 [Cucumis melo]6.9e-13398.41Show/hide
Query:  MIDSKLNSGGMSSCETNLCVYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG
        MIDSKLNSGGMSSCETNLCVYQSKQSPIAQKKVALRDVQNDNRS+IYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG
Subjt:  MIDSKLNSGGMSSCETNLCVYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG

Query:  EVDVKPGKKRASGGSTSCAPAFSSFLANSPMAFSPVRSSFPIFTEKPGNFLAVTGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ
        EVDVKPGKKRASGGSTSCAPAFSSFLA SPMAFSPVRSSFPIFTEKPGNFLAVTGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ
Subjt:  EVDVKPGKKRASGGSTSCAPAFSSFLANSPMAFSPVRSSFPIFTEKPGNFLAVTGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ

Query:  KGVIE-MLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
        KGVIE +LHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
Subjt:  KGVIE-MLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ

XP_008464853.1 PREDICTED: uncharacterized protein LOC103502625 isoform X2 [Cucumis melo]9.6e-13599.2Show/hide
Query:  MIDSKLNSGGMSSCETNLCVYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG
        MIDSKLNSGGMSSCETNLCVYQSKQSPIAQKKVALRDVQNDNRS+IYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG
Subjt:  MIDSKLNSGGMSSCETNLCVYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG

Query:  EVDVKPGKKRASGGSTSCAPAFSSFLANSPMAFSPVRSSFPIFTEKPGNFLAVTGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ
        EVDVKPGKKRASGGSTSCAPAFSSFLA SPMAFSPVRSSFPIFTEKPGNFLAVTGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ
Subjt:  EVDVKPGKKRASGGSTSCAPAFSSFLANSPMAFSPVRSSFPIFTEKPGNFLAVTGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ

Query:  KGVIEMLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
        KGVIEMLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
Subjt:  KGVIEMLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ

XP_011657346.1 uncharacterized protein LOC101208160 isoform X1 [Cucumis sativus]4.2e-12293.25Show/hide
Query:  MIDSKLNSGGMSSCETNLCVYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG
        MIDS+LNSGGMSSCET+L +YQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSC+LGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG
Subjt:  MIDSKLNSGGMSSCETNLCVYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG

Query:  EVDVKPGKKRASGGSTSCAPA-FSSFLANSPMAFSPVRSSFPIFTEKPGNFLAVTGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSD
        EVDVKPGKKRASGGSTSCAPA FSSF A SP AFSP RSSFPIFTEKPGNFLAV GSNLLGI PGLEILRSDDSNGITDEQR+ERLF+LQKLLKHFDK+D
Subjt:  EVDVKPGKKRASGGSTSCAPA-FSSFLANSPMAFSPVRSSFPIFTEKPGNFLAVTGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSD

Query:  QKGVIE-MLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
        QKGVIE +LHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
Subjt:  QKGVIE-MLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ

XP_011657348.1 uncharacterized protein LOC101208160 isoform X2 [Cucumis sativus]5.8e-12494.02Show/hide
Query:  MIDSKLNSGGMSSCETNLCVYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG
        MIDS+LNSGGMSSCET+L +YQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSC+LGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG
Subjt:  MIDSKLNSGGMSSCETNLCVYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG

Query:  EVDVKPGKKRASGGSTSCAPA-FSSFLANSPMAFSPVRSSFPIFTEKPGNFLAVTGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSD
        EVDVKPGKKRASGGSTSCAPA FSSF A SP AFSP RSSFPIFTEKPGNFLAV GSNLLGI PGLEILRSDDSNGITDEQR+ERLF+LQKLLKHFDK+D
Subjt:  EVDVKPGKKRASGGSTSCAPA-FSSFLANSPMAFSPVRSSFPIFTEKPGNFLAVTGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSD

Query:  QKGVIEMLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
        QKGVIEMLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
Subjt:  QKGVIEMLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ

TrEMBL top hitse value%identityAlignment
A0A1S3CME9 uncharacterized protein LOC103502625 isoform X13.3e-13398.41Show/hide
Query:  MIDSKLNSGGMSSCETNLCVYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG
        MIDSKLNSGGMSSCETNLCVYQSKQSPIAQKKVALRDVQNDNRS+IYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG
Subjt:  MIDSKLNSGGMSSCETNLCVYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG

Query:  EVDVKPGKKRASGGSTSCAPAFSSFLANSPMAFSPVRSSFPIFTEKPGNFLAVTGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ
        EVDVKPGKKRASGGSTSCAPAFSSFLA SPMAFSPVRSSFPIFTEKPGNFLAVTGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ
Subjt:  EVDVKPGKKRASGGSTSCAPAFSSFLANSPMAFSPVRSSFPIFTEKPGNFLAVTGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ

Query:  KGVIE-MLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
        KGVIE +LHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
Subjt:  KGVIE-MLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ

A0A1S3CMI7 uncharacterized protein LOC103502625 isoform X24.6e-13599.2Show/hide
Query:  MIDSKLNSGGMSSCETNLCVYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG
        MIDSKLNSGGMSSCETNLCVYQSKQSPIAQKKVALRDVQNDNRS+IYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG
Subjt:  MIDSKLNSGGMSSCETNLCVYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG

Query:  EVDVKPGKKRASGGSTSCAPAFSSFLANSPMAFSPVRSSFPIFTEKPGNFLAVTGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ
        EVDVKPGKKRASGGSTSCAPAFSSFLA SPMAFSPVRSSFPIFTEKPGNFLAVTGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ
Subjt:  EVDVKPGKKRASGGSTSCAPAFSSFLANSPMAFSPVRSSFPIFTEKPGNFLAVTGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ

Query:  KGVIEMLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
        KGVIEMLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
Subjt:  KGVIEMLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ

A0A5A7TAR2 Uncharacterized protein4.8e-12496.65Show/hide
Query:  MIDSKLNSGGMSSCETNLCVYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG
        MIDSKLNSGGMSSCETNLCVYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG
Subjt:  MIDSKLNSGGMSSCETNLCVYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG

Query:  EVDVKPGKKRASGGSTSCAPAFSSFLANSPMAFSPVRSSFPIFTEKPGNFLAVTGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ
        EVDVKPGKKRASGGSTSCAPAFSSFLA SPMAFSPVRSSFPIFTEKPGNFLAVTGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ
Subjt:  EVDVKPGKKRASGGSTSCAPAFSSFLANSPMAFSPVRSSFPIFTEKPGNFLAVTGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ

Query:  KGVIEMLHGLPPSELSNFAINLEKRSMHLS---VEEGKE
        KGVIEMLHGLPPSELSNFAINLEKRSMHL     +EG E
Subjt:  KGVIEMLHGLPPSELSNFAINLEKRSMHLS---VEEGKE

A0A6J1G2G3 uncharacterized protein LOC1114502152.0e-11788.8Show/hide
Query:  MIDSKLNSGGMSSCETNLCVYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG
        MIDSKLNSGGM SCET+L +Y+SKQSPIA KKVALRDVQNDNRSV+YNYPETSCSLGGKL+NGSK+SGSKRSNPTCSPSSAIHQSFKGIGVNEH +YANG
Subjt:  MIDSKLNSGGMSSCETNLCVYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG

Query:  EVDVKPGKKRASGGSTSCAPAFSSFLANSPMAFSPVRSSFPIFTEKPGNFLAVTGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ
        EVDVKPGKKRA GGSTSCAPAFSS LA SPMAFSPVRSS PIFTEKPGNFLAV GSNLLGI PGLEIL S DSNGITDEQRTERLFNLQ LLKH D+SDQ
Subjt:  EVDVKPGKKRASGGSTSCAPAFSSFLANSPMAFSPVRSSFPIFTEKPGNFLAVTGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ

Query:  KGVIEMLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
        KG IE+LHGLPPSELS  AINLEK+SMHLSVEEGKEIQRMKALNIL NLQ
Subjt:  KGVIEMLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ

A0A6J1KE02 uncharacterized protein LOC1114940102.0e-11788Show/hide
Query:  MIDSKLNSGGMSSCETNLCVYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG
        MIDSKLN+GGM SCET+L +Y+SKQSPIA KKVALRDVQNDNRSV+YNYPETSCSLGGKL+NGSK+SGSKRSNPTCSP+SAIHQSFKGIGVNEH +YANG
Subjt:  MIDSKLNSGGMSSCETNLCVYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANG

Query:  EVDVKPGKKRASGGSTSCAPAFSSFLANSPMAFSPVRSSFPIFTEKPGNFLAVTGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ
        EVDVKPGKKRA GGSTSCAP FSS LA SP+AFSPVRSS PIFTEKPGNFLAVTGSNLLGIPPGLEIL S DSNGITDEQRTERLFNLQ LLKH D+SDQ
Subjt:  EVDVKPGKKRASGGSTSCAPAFSSFLANSPMAFSPVRSSFPIFTEKPGNFLAVTGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ

Query:  KGVIEMLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ
        KG IE+LHGLPPSELS  AINLEK+SMHLSVEEGKEIQRMKALNIL NLQ
Subjt:  KGVIEMLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNILSNLQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G45250.1 Integral membrane protein hemolysin-III homolog3.0e-0946.88Show/hide
Query:  ERLFNLQKLLKHFDKSDQKGVIEMLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNIL
        ER  +LQ LL   ++SD+   ++ML  L  +ELS  A++LEKRS+  S+EE +E+QR+ ALN+L
Subjt:  ERLFNLQKLLKHFDKSDQKGVIEMLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNIL

AT2G45250.2 Integral membrane protein hemolysin-III homolog5.8e-0545.1Show/hide
Query:  ERLFNLQKLLKHFDKSDQKGVIEMLHGLPPSELSNFAINLEKRSMHLSVEE
        ER  +LQ LL   ++SD+   ++ML  L  +ELS  A++LEKRS+  S+EE
Subjt:  ERLFNLQKLLKHFDKSDQKGVIEMLHGLPPSELSNFAINLEKRSMHLSVEE

AT4G38280.1 BEST Arabidopsis thaliana protein match is: Integral membrane protein hemolysin-III homolog (TAIR:AT2G45250.1)5.1e-0946.88Show/hide
Query:  ERLFNLQKLLKHFDKSDQKGVIEMLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNIL
        ER  +LQ LL   ++SD+   ++ML  L  +ELS  A++LEKRS+  S+EE +E+QR+ ALN+L
Subjt:  ERLFNLQKLLKHFDKSDQKGVIEMLHGLPPSELSNFAINLEKRSMHLSVEEGKEIQRMKALNIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGATAGCAAGTTGAATAGTGGTGGAATGAGCAGCTGCGAAACTAATTTGTGTGTGTATCAGAGTAAACAGTCACCTATTGCACAGAAAAAGGTTGCTTTAAGGGA
TGTGCAAAATGATAATAGGAGCGTCATATATAACTATCCTGAAACTTCCTGTTCTTTGGGCGGAAAACTTATGAATGGGAGTAAGCTTTCAGGAAGTAAGAGATCCAACC
CTACATGCTCACCGAGCTCTGCAATCCATCAATCCTTCAAAGGAATTGGGGTAAATGAGCACAACGTTTATGCCAACGGAGAAGTTGATGTGAAGCCTGGCAAAAAAAGA
GCATCGGGAGGTAGCACATCTTGTGCACCTGCATTTTCTTCTTTTCTTGCAAACTCTCCTATGGCATTTTCACCTGTTAGGTCTTCATTTCCCATTTTCACAGAAAAGCC
TGGTAATTTTCTGGCTGTCACTGGATCCAATCTTCTGGGAATCCCTCCTGGTTTGGAGATTCTTCGATCTGACGATTCAAATGGGATTACTGATGAGCAGAGAACAGAGC
GTCTATTCAATCTACAGAAGCTCCTAAAACATTTTGACAAGTCGGATCAAAAAGGGGTCATTGAGATGCTCCATGGTTTGCCTCCCTCTGAGCTCAGCAATTTTGCCATT
AATCTGGAAAAGAGATCCATGCACCTGTCCGTAGAGGAAGGGAAAGAGATCCAACGGATGAAGGCTTTGAATATTCTGAGCAACCTTCAGTGA
mRNA sequenceShow/hide mRNA sequence
CCCTCGGGCCGCTACTTTTAAATTAAATTAATTTTCAAAAAATTACAAAATTGAAAAAAAAAAAAAAACAAAAAAACAAAACCCATCTTCTTCGAGCCAAACTTTTCCCT
TTTGAACTCGTCTCTCCCTCTCCTTCTCCCTCTCCCTCTCCCCCTCCCTCTCGCCGGTTCCAGAACAACAAGCTTCACACTCAGATGGACTTTTCTAGGAGATAATTTGA
AATTGTTATTTGGTTGAAGCTCTTCAAGTTTTCTAAATGATTGATAGCAAGTTGAATAGTGGTGGAATGAGCAGCTGCGAAACTAATTTGTGTGTGTATCAGAGTAAACA
GTCACCTATTGCACAGAAAAAGGTTGCTTTAAGGGATGTGCAAAATGATAATAGGAGCGTCATATATAACTATCCTGAAACTTCCTGTTCTTTGGGCGGAAAACTTATGA
ATGGGAGTAAGCTTTCAGGAAGTAAGAGATCCAACCCTACATGCTCACCGAGCTCTGCAATCCATCAATCCTTCAAAGGAATTGGGGTAAATGAGCACAACGTTTATGCC
AACGGAGAAGTTGATGTGAAGCCTGGCAAAAAAAGAGCATCGGGAGGTAGCACATCTTGTGCACCTGCATTTTCTTCTTTTCTTGCAAACTCTCCTATGGCATTTTCACC
TGTTAGGTCTTCATTTCCCATTTTCACAGAAAAGCCTGGTAATTTTCTGGCTGTCACTGGATCCAATCTTCTGGGAATCCCTCCTGGTTTGGAGATTCTTCGATCTGACG
ATTCAAATGGGATTACTGATGAGCAGAGAACAGAGCGTCTATTCAATCTACAGAAGCTCCTAAAACATTTTGACAAGTCGGATCAAAAAGGGGTCATTGAGATGCTCCAT
GGTTTGCCTCCCTCTGAGCTCAGCAATTTTGCCATTAATCTGGAAAAGAGATCCATGCACCTGTCCGTAGAGGAAGGGAAAGAGATCCAACGGATGAAGGCTTTGAATAT
TCTGAGCAACCTTCAGTGAAATCTGAAAGAAGTTTTGTTACACTAATGAATAACGATTTTGGAGACCAGCGAGGGCCATTTGCCATTTCCGCCATAGTTGCACGAGTGGA
CAAACTAAGTGAGCAAACCAGTATTCCTTGAGAGCTGGCAAGGTAGTCTTTCTATGGAGCTTTAGGTTATGGAGTTTAATTTGGGCAACATGAAGTTAGAAAATGTTGCT
CCTATCAACATGAAACTCTTCCAGGAGTTTCATTTGAATCAGTTGGGAAAAAAACATAAGAATCCAATTAACTTTGAAGTTCGAACAATTTGGAAGGGCTTTAGAAAGTC
TTTTTTTTTTTTTTTTTTTTTTTTTTAAAAGAAGAAAATAGTTTAGTTACTTGTTGAGTTGATTACACTTTGGGATCTAACATTCAAATCTGTTCAATTACCATGTAATT
GTTGTCTAGATCATTAACTTTCTTTTGGAAGTTTTTAACCGATATTTTCTCCCTTTTTCTTTCAGTTATGATTAAATTATGTTTAACCTGCGATTGGGGATTCAAATTTG
AATCTTTTGCGATTTTGATCCGACTTATAAGTGGAGTTTAATAACTAATATTGAAGTTTGTTCCTTT
Protein sequenceShow/hide protein sequence
MIDSKLNSGGMSSCETNLCVYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNVYANGEVDVKPGKKR
ASGGSTSCAPAFSSFLANSPMAFSPVRSSFPIFTEKPGNFLAVTGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQKGVIEMLHGLPPSELSNFAI
NLEKRSMHLSVEEGKEIQRMKALNILSNLQ