; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0020134 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0020134
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionBEST Arabidopsis thaliana protein match is: Integral membrane protein hemolysin-III homolog .
Genome locationchr03:6366193..6369142
RNA-Seq ExpressionPI0020134
SyntenyPI0020134
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008464851.1 PREDICTED: uncharacterized protein LOC103502625 isoform X1 [Cucumis melo]1.7e-13196.81Show/hide
Query:  MIDSKLNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNIYANG
        MIDSKLNSGGMSSCET+L +YQSKQSPIAQKKVALRDVQNDNRS+IYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHN+YANG
Subjt:  MIDSKLNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNIYANG

Query:  EVDVKPGKKRASGGSTSCAPAFSSFLAASPMAFSPVRSSFPIFTEKPGNFLAVAGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ
        EVDVKPGKKRASGGSTSCAPAFSSFLA SPMAFSPVRSSFPIFTEKPGNFLAV GSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ
Subjt:  EVDVKPGKKRASGGSTSCAPAFSSFLAASPMAFSPVRSSFPIFTEKPGNFLAVAGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ

Query:  KGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIHRMKALNILSNLQ
        KGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEI RMKALNILSNLQ
Subjt:  KGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIHRMKALNILSNLQ

XP_008464853.1 PREDICTED: uncharacterized protein LOC103502625 isoform X2 [Cucumis melo]2.7e-12996.02Show/hide
Query:  MIDSKLNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNIYANG
        MIDSKLNSGGMSSCET+L +YQSKQSPIAQKKVALRDVQNDNRS+IYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHN+YANG
Subjt:  MIDSKLNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNIYANG

Query:  EVDVKPGKKRASGGSTSCAPAFSSFLAASPMAFSPVRSSFPIFTEKPGNFLAVAGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ
        EVDVKPGKKRASGGSTSCAPAFSSFLA SPMAFSPVRSSFPIFTEKPGNFLAV GSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ
Subjt:  EVDVKPGKKRASGGSTSCAPAFSSFLAASPMAFSPVRSSFPIFTEKPGNFLAVAGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ

Query:  KGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIHRMKALNILSNLQ
        KGVIE +LHGLPPSELSNFAINLEKRSMHLSVEEGKEI RMKALNILSNLQ
Subjt:  KGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIHRMKALNILSNLQ

XP_011657346.1 uncharacterized protein LOC101208160 isoform X1 [Cucumis sativus]4.8e-12694.84Show/hide
Query:  MIDSKLNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNIYANG
        MIDS+LNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSC+LGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHN+YANG
Subjt:  MIDSKLNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNIYANG

Query:  EVDVKPGKKRASGGSTSCAPA-FSSFLAASPMAFSPVRSSFPIFTEKPGNFLAVAGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSD
        EVDVKPGKKRASGGSTSCAPA FSSF A SP AFSP RSSFPIFTEKPGNFLAVAGSNLLGI PGLEILRSDDSNGITDEQR+ERLF+LQKLLKHFDK+D
Subjt:  EVDVKPGKKRASGGSTSCAPA-FSSFLAASPMAFSPVRSSFPIFTEKPGNFLAVAGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSD

Query:  QKGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIHRMKALNILSNLQ
        QKGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEI RMKALNILSNLQ
Subjt:  QKGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIHRMKALNILSNLQ

XP_011657348.1 uncharacterized protein LOC101208160 isoform X2 [Cucumis sativus]7.6e-12494.05Show/hide
Query:  MIDSKLNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNIYANG
        MIDS+LNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSC+LGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHN+YANG
Subjt:  MIDSKLNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNIYANG

Query:  EVDVKPGKKRASGGSTSCAPA-FSSFLAASPMAFSPVRSSFPIFTEKPGNFLAVAGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSD
        EVDVKPGKKRASGGSTSCAPA FSSF A SP AFSP RSSFPIFTEKPGNFLAVAGSNLLGI PGLEILRSDDSNGITDEQR+ERLF+LQKLLKHFDK+D
Subjt:  EVDVKPGKKRASGGSTSCAPA-FSSFLAASPMAFSPVRSSFPIFTEKPGNFLAVAGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSD

Query:  QKGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIHRMKALNILSNLQ
        QKGVIE +LHGLPPSELSNFAINLEKRSMHLSVEEGKEI RMKALNILSNLQ
Subjt:  QKGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIHRMKALNILSNLQ

XP_038884507.1 uncharacterized protein LOC120075309 [Benincasa hispida]1.9e-11990.84Show/hide
Query:  MIDSKLNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNIYANG
        MIDSKLNSGGM SCETHLSMYQ+KQSPIAQKKVALRDVQNDNRS++YNYPETSCSLGGKL+NGSKLSGSKRSNPT SPSSAIHQSFKGIGVNEH IYA+G
Subjt:  MIDSKLNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNIYANG

Query:  EVDVKPGKKRASGGSTSCAPAFSSFLAASPMAFSPVRSSFPIFTEKPGNFLAVAGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ
        EVDVKPGKKRASGGSTSCAPAFSS LAASPMA SPVRSS PIFTEKPGNFLAVAGS+LLGIPPG EILRS DSNGITDEQRTERLFNLQK LKH D+SD+
Subjt:  EVDVKPGKKRASGGSTSCAPAFSSFLAASPMAFSPVRSSFPIFTEKPGNFLAVAGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ

Query:  KGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIHRMKALNILSNLQ
        KGVIE  LHGLPPSELSNFAINLEKRSM+LSVEEGKEI RMKALNIL NLQ
Subjt:  KGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIHRMKALNILSNLQ

TrEMBL top hitse value%identityAlignment
A0A1S3CME9 uncharacterized protein LOC103502625 isoform X18.2e-13296.81Show/hide
Query:  MIDSKLNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNIYANG
        MIDSKLNSGGMSSCET+L +YQSKQSPIAQKKVALRDVQNDNRS+IYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHN+YANG
Subjt:  MIDSKLNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNIYANG

Query:  EVDVKPGKKRASGGSTSCAPAFSSFLAASPMAFSPVRSSFPIFTEKPGNFLAVAGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ
        EVDVKPGKKRASGGSTSCAPAFSSFLA SPMAFSPVRSSFPIFTEKPGNFLAV GSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ
Subjt:  EVDVKPGKKRASGGSTSCAPAFSSFLAASPMAFSPVRSSFPIFTEKPGNFLAVAGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ

Query:  KGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIHRMKALNILSNLQ
        KGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEI RMKALNILSNLQ
Subjt:  KGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIHRMKALNILSNLQ

A0A1S3CMI7 uncharacterized protein LOC103502625 isoform X21.3e-12996.02Show/hide
Query:  MIDSKLNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNIYANG
        MIDSKLNSGGMSSCET+L +YQSKQSPIAQKKVALRDVQNDNRS+IYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHN+YANG
Subjt:  MIDSKLNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNIYANG

Query:  EVDVKPGKKRASGGSTSCAPAFSSFLAASPMAFSPVRSSFPIFTEKPGNFLAVAGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ
        EVDVKPGKKRASGGSTSCAPAFSSFLA SPMAFSPVRSSFPIFTEKPGNFLAV GSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ
Subjt:  EVDVKPGKKRASGGSTSCAPAFSSFLAASPMAFSPVRSSFPIFTEKPGNFLAVAGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ

Query:  KGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIHRMKALNILSNLQ
        KGVIE +LHGLPPSELSNFAINLEKRSMHLSVEEGKEI RMKALNILSNLQ
Subjt:  KGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIHRMKALNILSNLQ

A0A5A7TAR2 Uncharacterized protein3.6e-11993.75Show/hide
Query:  MIDSKLNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNIYANG
        MIDSKLNSGGMSSCET+L +YQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHN+YANG
Subjt:  MIDSKLNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNIYANG

Query:  EVDVKPGKKRASGGSTSCAPAFSSFLAASPMAFSPVRSSFPIFTEKPGNFLAVAGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ
        EVDVKPGKKRASGGSTSCAPAFSSFLA SPMAFSPVRSSFPIFTEKPGNFLAV GSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ
Subjt:  EVDVKPGKKRASGGSTSCAPAFSSFLAASPMAFSPVRSSFPIFTEKPGNFLAVAGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ

Query:  KGVIEIVLHGLPPSELSNFAINLEKRSMHLS---VEEGKE
        KGVIE +LHGLPPSELSNFAINLEKRSMHL     +EG E
Subjt:  KGVIEIVLHGLPPSELSNFAINLEKRSMHLS---VEEGKE

A0A6J1G2G3 uncharacterized protein LOC1114502152.3e-11890.44Show/hide
Query:  MIDSKLNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNIYANG
        MIDSKLNSGGM SCETHLSMY+SKQSPIA KKVALRDVQNDNRSV+YNYPETSCSLGGKL+NGSK+SGSKRSNPTCSPSSAIHQSFKGIGVNEH IYANG
Subjt:  MIDSKLNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNIYANG

Query:  EVDVKPGKKRASGGSTSCAPAFSSFLAASPMAFSPVRSSFPIFTEKPGNFLAVAGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ
        EVDVKPGKKRA GGSTSCAPAFSS LAASPMAFSPVRSS PIFTEKPGNFLAVAGSNLLGI PGLEIL S DSNGITDEQRTERLFNLQ LLKH D+SDQ
Subjt:  EVDVKPGKKRASGGSTSCAPAFSSFLAASPMAFSPVRSSFPIFTEKPGNFLAVAGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ

Query:  KGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIHRMKALNILSNLQ
        KG IE+ LHGLPPSELS  AINLEK+SMHLSVEEGKEI RMKALNIL NLQ
Subjt:  KGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIHRMKALNILSNLQ

A0A6J1KE02 uncharacterized protein LOC1114940102.6e-11788.84Show/hide
Query:  MIDSKLNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNIYANG
        MIDSKLN+GGM SCETHLSMY+SKQSPIA KKVALRDVQNDNRSV+YNYPETSCSLGGKL+NGSK+SGSKRSNPTCSP+SAIHQSFKGIGVNEH IYANG
Subjt:  MIDSKLNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNIYANG

Query:  EVDVKPGKKRASGGSTSCAPAFSSFLAASPMAFSPVRSSFPIFTEKPGNFLAVAGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ
        EVDVKPGKKRA GGSTSCAP FSS LAASP+AFSPVRSS PIFTEKPGNFLAV GSNLLGIPPGLEIL S DSNGITDEQRTERLFNLQ LLKH D+SDQ
Subjt:  EVDVKPGKKRASGGSTSCAPAFSSFLAASPMAFSPVRSSFPIFTEKPGNFLAVAGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQ

Query:  KGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIHRMKALNILSNLQ
        KG IE+ LHGLPPSELS  AINLEK+SMHLSVEEGKEI RMKALNIL NLQ
Subjt:  KGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIHRMKALNILSNLQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G45250.1 Integral membrane protein hemolysin-III homolog6.2e-0743.08Show/hide
Query:  ERLFNLQKLLKHFDKSDQKGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIHRMKALNIL
        ER  +LQ LL   ++SD+   ++ +L  L  +ELS  A++LEKRS+  S+EE +E+ R+ ALN+L
Subjt:  ERLFNLQKLLKHFDKSDQKGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIHRMKALNIL

AT4G38280.1 BEST Arabidopsis thaliana protein match is: Integral membrane protein hemolysin-III homolog (TAIR:AT2G45250.1)1.1e-0643.08Show/hide
Query:  ERLFNLQKLLKHFDKSDQKGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIHRMKALNIL
        ER  +LQ LL   ++SD+   ++ +L  L  +ELS  A++LEKRS+  S+EE +E+ R+ ALN+L
Subjt:  ERLFNLQKLLKHFDKSDQKGVIEIVLHGLPPSELSNFAINLEKRSMHLSVEEGKEIHRMKALNIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGATAGCAAGTTGAATAGTGGTGGAATGAGCAGCTGCGAAACTCATTTGTCTATGTATCAGAGCAAACAGTCACCTATTGCACAGAAAAAGGTTGCTTTAAGGGA
TGTGCAAAATGATAATAGGAGCGTCATATATAACTATCCTGAAACTTCCTGTTCTTTGGGCGGAAAACTTATGAATGGGAGTAAGCTTTCAGGAAGTAAGAGATCCAACC
CTACATGCTCACCAAGCTCTGCAATCCATCAATCCTTCAAAGGAATTGGGGTAAATGAGCACAACATTTATGCCAATGGAGAAGTTGATGTGAAGCCTGGCAAAAAAAGA
GCATCAGGAGGTAGCACATCTTGTGCACCTGCATTTTCTTCTTTTCTTGCAGCCTCTCCAATGGCATTTTCACCTGTTAGGTCTTCATTTCCCATTTTCACAGAAAAGCC
TGGTAATTTTCTGGCTGTTGCTGGATCCAATCTTCTGGGAATCCCTCCTGGTTTGGAGATTCTTCGATCTGATGATTCAAACGGGATTACTGATGAGCAGAGAACAGAGC
GTTTATTCAATCTACAGAAGCTCCTAAAACATTTTGACAAGTCGGATCAAAAAGGGGTCATTGAGATAGTGCTCCATGGTTTGCCTCCCTCTGAGCTCAGCAATTTTGCC
ATTAATCTGGAAAAGAGATCCATGCATCTGTCAGTGGAGGAAGGGAAAGAGATCCATCGGATGAAGGCTCTGAATATTCTGAGCAACCTTCAGTGA
mRNA sequenceShow/hide mRNA sequence
TAATTTTCAAAAAATTACAAAATTGAAAAAAACAAAACCCATCTTCTTCGAGCCAAACTTTTCCCTTTTGAACTCGTCTCTCCTTCTCCCTCTCCCTCTCCCTCTCGCCG
GTTCCACAAAAACAAGCTTCATTCTCAGATGGACTTTTCTAGGAGATAATTTGAAATTGTTATTTGGTTGAAGCTCTTCAAGTTTTCTAAATGATTGATAGCAAGTTGAA
TAGTGGTGGAATGAGCAGCTGCGAAACTCATTTGTCTATGTATCAGAGCAAACAGTCACCTATTGCACAGAAAAAGGTTGCTTTAAGGGATGTGCAAAATGATAATAGGA
GCGTCATATATAACTATCCTGAAACTTCCTGTTCTTTGGGCGGAAAACTTATGAATGGGAGTAAGCTTTCAGGAAGTAAGAGATCCAACCCTACATGCTCACCAAGCTCT
GCAATCCATCAATCCTTCAAAGGAATTGGGGTAAATGAGCACAACATTTATGCCAATGGAGAAGTTGATGTGAAGCCTGGCAAAAAAAGAGCATCAGGAGGTAGCACATC
TTGTGCACCTGCATTTTCTTCTTTTCTTGCAGCCTCTCCAATGGCATTTTCACCTGTTAGGTCTTCATTTCCCATTTTCACAGAAAAGCCTGGTAATTTTCTGGCTGTTG
CTGGATCCAATCTTCTGGGAATCCCTCCTGGTTTGGAGATTCTTCGATCTGATGATTCAAACGGGATTACTGATGAGCAGAGAACAGAGCGTTTATTCAATCTACAGAAG
CTCCTAAAACATTTTGACAAGTCGGATCAAAAAGGGGTCATTGAGATAGTGCTCCATGGTTTGCCTCCCTCTGAGCTCAGCAATTTTGCCATTAATCTGGAAAAGAGATC
CATGCATCTGTCAGTGGAGGAAGGGAAAGAGATCCATCGGATGAAGGCTCTGAATATTCTGAGCAACCTTCAGTGAAATCTGAAAGAAGTTTTGTTACACTAATGAATAA
CGATTTTGGAGACGAACGAGATGAGAGGGCCATTTCCCATTTCTGCCATAATTGCACGAGTGGACAAACTAGTCGATTGGTTGATGAAACAGTCTGCCGTGATTGAGCAA
ACCAGTATTCCTTGAGAGCTGGCAAGGTAGTCTTTCTATGGAGCTTTAGGTTATGGAGTTTGATTTGGGCAACATGAAGTTAAAAAGTGTTGCTGCTCCTGTCAACATGA
AACTCCTCCGGGAGTTTCATTTGGATCAGTGGGGAAAAAAAACCTAAGAATCCAATTAACTTTGAAGTTCGAACAATTTGGAAGGGCTTTTGAAAGTTTCTTTTTTCCTT
TTTAGAAAAGAAGAAAATAGTTTGTTACGTGTTGAGTTGATTACACTTTGGAATCTAACATTCAAATCAGTCCAATTACCATGTAATTGTTGTCTTAATCATTAACGTCC
TTTTGGAAGTTTTTAACCGATATTTCCCCCCGCC
Protein sequenceShow/hide protein sequence
MIDSKLNSGGMSSCETHLSMYQSKQSPIAQKKVALRDVQNDNRSVIYNYPETSCSLGGKLMNGSKLSGSKRSNPTCSPSSAIHQSFKGIGVNEHNIYANGEVDVKPGKKR
ASGGSTSCAPAFSSFLAASPMAFSPVRSSFPIFTEKPGNFLAVAGSNLLGIPPGLEILRSDDSNGITDEQRTERLFNLQKLLKHFDKSDQKGVIEIVLHGLPPSELSNFA
INLEKRSMHLSVEEGKEIHRMKALNILSNLQ