; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MELO3C032140 (gene) of Melon (DHL92) v4 genome

Gene IDMELO3C032140
OrganismCucumis melo DHL92 (Melon (DHL92) v4)
DescriptionCACTA en-spm transposon protein
Genome locationchr06:22022239..22024274
RNA-Seq ExpressionMELO3C032140
SyntenyMELO3C032140
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033296.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]9.6e-11480.78Show/hide
Query:  GSSSQQETSTPKRRAQSRFLELERHVIINGRIPMTIALGAEKPISPHA---------------------WADVGREYIEVVKSDLQRLFVLDFNDQEMNR
        GSSSQQ T TP+RRAQSR LELE HV INGRI MTIA GAEKPISPHA                     WADVGREYIEVVK DLQRLFVLDFNDQ MNR
Subjt:  GSSSQQETSTPKRRAQSRFLELERHVIINGRIPMTIALGAEKPISPHA---------------------WADVGREYIEVVKSDLQRLFVLDFNDQEMNR

Query:  FVERQMLTIFKEFRADCHKYFKKYSDPEEARANPPNALEQSWTNKAARQKQPYNHSSWFKSFLQRQYELAERKGE--------RETHVRAGTFVSQVAED
        FVE QMLT FKEFRADCH++FKKYSDPEEARANPPNALEQS TNK ARQKQPYNHSS  KSFLQRQYELAER+G+        RET VRAGTFVSQ AED
Subjt:  FVERQMLTIFKEFRADCHKYFKKYSDPEEARANPPNALEQSWTNKAARQKQPYNHSSWFKSFLQRQYELAERKGE--------RETHVRAGTFVSQVAED

Query:  AHNQMLELQSQSTPEGSQPLSEDEICDQVLGRRPDYSKGLGWGPKPKARRTASASSSSTSCSQSTEKEIELQAKLHEALER
        AHNQ+LELQSQ TPEGSQPLSEDEICDQVLGRRPDYSKGLGWGPKPKARRTASASSSSTSCSQST+KEIELQAKLHEALER
Subjt:  AHNQMLELQSQSTPEGSQPLSEDEICDQVLGRRPDYSKGLGWGPKPKARRTASASSSSTSCSQSTEKEIELQAKLHEALER

KAA0041316.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]2.5e-11480.14Show/hide
Query:  MGSSSQQETSTPKRRAQSRFLELERHVIINGRIPMTIALGAEKPISPHA---------------------WADVGREYIEVVKSDLQRLFVLDFNDQEMN
        +GSSSQQ T TP+RRAQSR LELERHV INGRIPMTIA GAEKPISPHA                     W DVGREYIEVVK DLQRLFVLDFNDQ MN
Subjt:  MGSSSQQETSTPKRRAQSRFLELERHVIINGRIPMTIALGAEKPISPHA---------------------WADVGREYIEVVKSDLQRLFVLDFNDQEMN

Query:  RFVERQMLTIFKEFRADCHKYFKKYSDPEEARANPPNALEQSWTNKAARQKQPYNHSSWFKSFLQRQYELAERKGE--------RETHVRAGTFVSQVAE
        RFVE +MLT FKEFRADCH++FKKYSDPEEARANPPNALEQS TNKAARQKQPYNHSS  KSFLQRQYEL ER+G+        +ETHVRAGTFVSQ AE
Subjt:  RFVERQMLTIFKEFRADCHKYFKKYSDPEEARANPPNALEQSWTNKAARQKQPYNHSSWFKSFLQRQYELAERKGE--------RETHVRAGTFVSQVAE

Query:  DAHNQMLELQSQSTPEGSQPLSEDEICDQVLGRRPDYSKGLGWGPKPKARRTASASSSSTSCSQSTEKEIELQAKLHEALER
        DAHNQMLELQSQ TPEGSQPLSEDEICDQVLGRRP YSKG GWGPKPKARRTASASSSSTSCSQST+KEIELQAKLHEALER
Subjt:  DAHNQMLELQSQSTPEGSQPLSEDEICDQVLGRRPDYSKGLGWGPKPKARRTASASSSSTSCSQSTEKEIELQAKLHEALER

KAA0042203.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]2.8e-11386.92Show/hide
Query:  SSSQQETSTPKRRAQSRFLELERHVIINGRIPMTIALGAEKPISPHAWA-DVGREYIEVVKSDLQRLFVLDFNDQEMNRFVERQMLTIFKEFRADCHKYF
        SSSQQ T TP+RRAQSR LELERHV INGRIPMTIA GAEKPISPHA       EYIEVVK DLQRLFVLDFNDQ MNRFVE QMLT FKEF+ADCH++F
Subjt:  SSSQQETSTPKRRAQSRFLELERHVIINGRIPMTIALGAEKPISPHAWA-DVGREYIEVVKSDLQRLFVLDFNDQEMNRFVERQMLTIFKEFRADCHKYF

Query:  KKYSDPEEARANPPNALEQSWTNKAARQKQPYNHSSWFKSFLQRQYELAERKGE--------RETHVRAGTFVSQVAEDAHNQMLELQSQSTPEGSQPLS
        KKYSDPEEARANPPNALEQS TNKAARQKQPYNHSS  KSFLQRQYELAERKGE        RETHVRAGTFVSQ AEDAHNQMLELQSQSTPEGSQPLS
Subjt:  KKYSDPEEARANPPNALEQSWTNKAARQKQPYNHSSWFKSFLQRQYELAERKGE--------RETHVRAGTFVSQVAEDAHNQMLELQSQSTPEGSQPLS

Query:  EDEICDQVLGRRPDYSKGLGWGPKPKARRTASASSSSTSCSQSTEKEIELQAKLHEALER
        EDEICDQVLGRRP YSKGLGWGPKPKARRTASASSSSTSCSQSTEKEIELQAKL EALER
Subjt:  EDEICDQVLGRRPDYSKGLGWGPKPKARRTASASSSSTSCSQSTEKEIELQAKLHEALER

TYK19836.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]8.1e-11376.74Show/hide
Query:  GSSSQQETSTPKRRAQSRFLELERHVIINGRIPMTIALGAEKPISPHA---------------------WADVGREYIEVVKSDLQRLFVLDFNDQEMNR
        GSSSQQ T TP+RRAQSR LELERHV INGRIPMTIA GAEKPISPHA                     W DVGREYIEVVK DLQR FVLDFNDQ MNR
Subjt:  GSSSQQETSTPKRRAQSRFLELERHVIINGRIPMTIALGAEKPISPHA---------------------WADVGREYIEVVKSDLQRLFVLDFNDQEMNR

Query:  FVERQMLTIFKEFRADCHKYFKKYSDPEEARANPPNAL--------------------EQSWTNKAARQKQPYNHSSWFKSFLQRQYELAERKGE-----
        FVE QMLT FKEFRADCHK+FKKYSDPEEARANPPNAL                    EQS TNKAARQKQPYNHSS  KSFLQRQYELAER+G+     
Subjt:  FVERQMLTIFKEFRADCHKYFKKYSDPEEARANPPNAL--------------------EQSWTNKAARQKQPYNHSSWFKSFLQRQYELAERKGE-----

Query:  ---RETHVRAGTFVSQVAEDAHNQMLELQSQSTPEGSQPLSEDEICDQVLGRRPDYSKGLGWGPKPKARRTASASSSSTSCSQSTEKEIELQAKLHEALE
           RETHVRAGTFVSQ AEDAHNQMLELQSQ TPEGSQPLSEDEICDQVLGRRP YSKGLGWGPKPKARRTASASSSSTSCSQSTEKEIELQAKLHEALE
Subjt:  ---RETHVRAGTFVSQVAEDAHNQMLELQSQSTPEGSQPLSEDEICDQVLGRRPDYSKGLGWGPKPKARRTASASSSSTSCSQSTEKEIELQAKLHEALE

Query:  R
        R
Subjt:  R

TYK21492.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]1.1e-11277.08Show/hide
Query:  GSSSQQETSTPKRRAQSRFLELERHVIINGRIPMTIALGAEKPISPHA---------------------WADVGREYIEVVKSDLQRLFVLDFNDQEMNR
        GSSSQQ T TP+RRAQSR LELERHV INGRIPMTIA GAEKPISPHA                     W DVGREYIEVVK DLQRLFVLDFNDQ MNR
Subjt:  GSSSQQETSTPKRRAQSRFLELERHVIINGRIPMTIALGAEKPISPHA---------------------WADVGREYIEVVKSDLQRLFVLDFNDQEMNR

Query:  FVERQMLTIFKEFRADCHKYFKKYSDPEEARANPPNAL--------------------EQSWTNKAARQKQPYNHSSWFKSFLQRQYELAERKGE-----
        FVE QMLT FKEFRADCH++FKKYSDPEEARANPPNAL                    EQS TNKAARQKQPYNHSS  KSFLQRQYELAERKGE     
Subjt:  FVERQMLTIFKEFRADCHKYFKKYSDPEEARANPPNAL--------------------EQSWTNKAARQKQPYNHSSWFKSFLQRQYELAERKGE-----

Query:  ---RETHVRAGTFVSQVAEDAHNQMLELQSQSTPEGSQPLSEDEICDQVLGRRPDYSKGLGWGPKPKARRTASASSSSTSCSQSTEKEIELQAKLHEALE
           RETHVRAGTFVSQ AEDAHNQMLELQSQ TPEGSQPLSEDEICDQVLGRRP YSKGLGWGPKPKARRTASASSSSTSCSQSTEKEIELQAKL EALE
Subjt:  ---RETHVRAGTFVSQVAEDAHNQMLELQSQSTPEGSQPLSEDEICDQVLGRRPDYSKGLGWGPKPKARRTASASSSSTSCSQSTEKEIELQAKLHEALE

Query:  R
        R
Subjt:  R

TrEMBL top hitse value%identityAlignment
A0A5A7STH7 CACTA en-spm transposon protein4.7e-11480.78Show/hide
Query:  GSSSQQETSTPKRRAQSRFLELERHVIINGRIPMTIALGAEKPISPHA---------------------WADVGREYIEVVKSDLQRLFVLDFNDQEMNR
        GSSSQQ T TP+RRAQSR LELE HV INGRI MTIA GAEKPISPHA                     WADVGREYIEVVK DLQRLFVLDFNDQ MNR
Subjt:  GSSSQQETSTPKRRAQSRFLELERHVIINGRIPMTIALGAEKPISPHA---------------------WADVGREYIEVVKSDLQRLFVLDFNDQEMNR

Query:  FVERQMLTIFKEFRADCHKYFKKYSDPEEARANPPNALEQSWTNKAARQKQPYNHSSWFKSFLQRQYELAERKGE--------RETHVRAGTFVSQVAED
        FVE QMLT FKEFRADCH++FKKYSDPEEARANPPNALEQS TNK ARQKQPYNHSS  KSFLQRQYELAER+G+        RET VRAGTFVSQ AED
Subjt:  FVERQMLTIFKEFRADCHKYFKKYSDPEEARANPPNALEQSWTNKAARQKQPYNHSSWFKSFLQRQYELAERKGE--------RETHVRAGTFVSQVAED

Query:  AHNQMLELQSQSTPEGSQPLSEDEICDQVLGRRPDYSKGLGWGPKPKARRTASASSSSTSCSQSTEKEIELQAKLHEALER
        AHNQ+LELQSQ TPEGSQPLSEDEICDQVLGRRPDYSKGLGWGPKPKARRTASASSSSTSCSQST+KEIELQAKLHEALER
Subjt:  AHNQMLELQSQSTPEGSQPLSEDEICDQVLGRRPDYSKGLGWGPKPKARRTASASSSSTSCSQSTEKEIELQAKLHEALER

A0A5A7TG53 CACTA en-spm transposon protein1.4e-11386.92Show/hide
Query:  SSSQQETSTPKRRAQSRFLELERHVIINGRIPMTIALGAEKPISPHAWA-DVGREYIEVVKSDLQRLFVLDFNDQEMNRFVERQMLTIFKEFRADCHKYF
        SSSQQ T TP+RRAQSR LELERHV INGRIPMTIA GAEKPISPHA       EYIEVVK DLQRLFVLDFNDQ MNRFVE QMLT FKEF+ADCH++F
Subjt:  SSSQQETSTPKRRAQSRFLELERHVIINGRIPMTIALGAEKPISPHAWA-DVGREYIEVVKSDLQRLFVLDFNDQEMNRFVERQMLTIFKEFRADCHKYF

Query:  KKYSDPEEARANPPNALEQSWTNKAARQKQPYNHSSWFKSFLQRQYELAERKGE--------RETHVRAGTFVSQVAEDAHNQMLELQSQSTPEGSQPLS
        KKYSDPEEARANPPNALEQS TNKAARQKQPYNHSS  KSFLQRQYELAERKGE        RETHVRAGTFVSQ AEDAHNQMLELQSQSTPEGSQPLS
Subjt:  KKYSDPEEARANPPNALEQSWTNKAARQKQPYNHSSWFKSFLQRQYELAERKGE--------RETHVRAGTFVSQVAEDAHNQMLELQSQSTPEGSQPLS

Query:  EDEICDQVLGRRPDYSKGLGWGPKPKARRTASASSSSTSCSQSTEKEIELQAKLHEALER
        EDEICDQVLGRRP YSKGLGWGPKPKARRTASASSSSTSCSQSTEKEIELQAKL EALER
Subjt:  EDEICDQVLGRRPDYSKGLGWGPKPKARRTASASSSSTSCSQSTEKEIELQAKLHEALER

A0A5A7TGY7 CACTA en-spm transposon protein1.2e-11480.14Show/hide
Query:  MGSSSQQETSTPKRRAQSRFLELERHVIINGRIPMTIALGAEKPISPHA---------------------WADVGREYIEVVKSDLQRLFVLDFNDQEMN
        +GSSSQQ T TP+RRAQSR LELERHV INGRIPMTIA GAEKPISPHA                     W DVGREYIEVVK DLQRLFVLDFNDQ MN
Subjt:  MGSSSQQETSTPKRRAQSRFLELERHVIINGRIPMTIALGAEKPISPHA---------------------WADVGREYIEVVKSDLQRLFVLDFNDQEMN

Query:  RFVERQMLTIFKEFRADCHKYFKKYSDPEEARANPPNALEQSWTNKAARQKQPYNHSSWFKSFLQRQYELAERKGE--------RETHVRAGTFVSQVAE
        RFVE +MLT FKEFRADCH++FKKYSDPEEARANPPNALEQS TNKAARQKQPYNHSS  KSFLQRQYEL ER+G+        +ETHVRAGTFVSQ AE
Subjt:  RFVERQMLTIFKEFRADCHKYFKKYSDPEEARANPPNALEQSWTNKAARQKQPYNHSSWFKSFLQRQYELAERKGE--------RETHVRAGTFVSQVAE

Query:  DAHNQMLELQSQSTPEGSQPLSEDEICDQVLGRRPDYSKGLGWGPKPKARRTASASSSSTSCSQSTEKEIELQAKLHEALER
        DAHNQMLELQSQ TPEGSQPLSEDEICDQVLGRRP YSKG GWGPKPKARRTASASSSSTSCSQST+KEIELQAKLHEALER
Subjt:  DAHNQMLELQSQSTPEGSQPLSEDEICDQVLGRRPDYSKGLGWGPKPKARRTASASSSSTSCSQSTEKEIELQAKLHEALER

A0A5D3CIP7 CACTA en-spm transposon protein5.1e-11377.08Show/hide
Query:  GSSSQQETSTPKRRAQSRFLELERHVIINGRIPMTIALGAEKPISPHA---------------------WADVGREYIEVVKSDLQRLFVLDFNDQEMNR
        GSSSQQ T TP+RRAQSR LELERHV INGRIPMTIA GAEKPISPHA                     W DVGREYIEVVK DLQRLFVLDFNDQ MNR
Subjt:  GSSSQQETSTPKRRAQSRFLELERHVIINGRIPMTIALGAEKPISPHA---------------------WADVGREYIEVVKSDLQRLFVLDFNDQEMNR

Query:  FVERQMLTIFKEFRADCHKYFKKYSDPEEARANPPNAL--------------------EQSWTNKAARQKQPYNHSSWFKSFLQRQYELAERKGE-----
        FVE QMLT FKEFRADCH++FKKYSDPEEARANPPNAL                    EQS TNKAARQKQPYNHSS  KSFLQRQYELAERKGE     
Subjt:  FVERQMLTIFKEFRADCHKYFKKYSDPEEARANPPNAL--------------------EQSWTNKAARQKQPYNHSSWFKSFLQRQYELAERKGE-----

Query:  ---RETHVRAGTFVSQVAEDAHNQMLELQSQSTPEGSQPLSEDEICDQVLGRRPDYSKGLGWGPKPKARRTASASSSSTSCSQSTEKEIELQAKLHEALE
           RETHVRAGTFVSQ AEDAHNQMLELQSQ TPEGSQPLSEDEICDQVLGRRP YSKGLGWGPKPKARRTASASSSSTSCSQSTEKEIELQAKL EALE
Subjt:  ---RETHVRAGTFVSQVAEDAHNQMLELQSQSTPEGSQPLSEDEICDQVLGRRPDYSKGLGWGPKPKARRTASASSSSTSCSQSTEKEIELQAKLHEALE

Query:  R
        R
Subjt:  R

A0A5D3D8C2 CACTA en-spm transposon protein3.9e-11376.74Show/hide
Query:  GSSSQQETSTPKRRAQSRFLELERHVIINGRIPMTIALGAEKPISPHA---------------------WADVGREYIEVVKSDLQRLFVLDFNDQEMNR
        GSSSQQ T TP+RRAQSR LELERHV INGRIPMTIA GAEKPISPHA                     W DVGREYIEVVK DLQR FVLDFNDQ MNR
Subjt:  GSSSQQETSTPKRRAQSRFLELERHVIINGRIPMTIALGAEKPISPHA---------------------WADVGREYIEVVKSDLQRLFVLDFNDQEMNR

Query:  FVERQMLTIFKEFRADCHKYFKKYSDPEEARANPPNAL--------------------EQSWTNKAARQKQPYNHSSWFKSFLQRQYELAERKGE-----
        FVE QMLT FKEFRADCHK+FKKYSDPEEARANPPNAL                    EQS TNKAARQKQPYNHSS  KSFLQRQYELAER+G+     
Subjt:  FVERQMLTIFKEFRADCHKYFKKYSDPEEARANPPNAL--------------------EQSWTNKAARQKQPYNHSSWFKSFLQRQYELAERKGE-----

Query:  ---RETHVRAGTFVSQVAEDAHNQMLELQSQSTPEGSQPLSEDEICDQVLGRRPDYSKGLGWGPKPKARRTASASSSSTSCSQSTEKEIELQAKLHEALE
           RETHVRAGTFVSQ AEDAHNQMLELQSQ TPEGSQPLSEDEICDQVLGRRP YSKGLGWGPKPKARRTASASSSSTSCSQSTEKEIELQAKLHEALE
Subjt:  ---RETHVRAGTFVSQVAEDAHNQMLELQSQSTPEGSQPLSEDEICDQVLGRRPDYSKGLGWGPKPKARRTASASSSSTSCSQSTEKEIELQAKLHEALE

Query:  R
        R
Subjt:  R

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATCTTCTTCTCAACAAGAGACTTCGACTCCTAAGAGACGTGCGCAGTCTCGATTCTTGGAGTTAGAGCGCCACGTTATAATAAATGGGCGCATTCCGATGACGAT
CGCCCTTGGAGCGGAGAAGCCTATTTCTCCACACGCCTGGGCGGACGTTGGGAGAGAATACATTGAGGTCGTCAAGAGCGACCTCCAGCGATTGTTTGTGCTTGATTTCA
ATGATCAAGAAATGAACAGGTTTGTTGAGCGTCAGATGCTCACGATCTTTAAAGAGTTTCGGGCAGACTGTCATAAATATTTCAAAAAGTACAGCGACCCGGAGGAGGCT
CGTGCCAACCCACCAAACGCATTGGAGCAATCATGGACGAACAAGGCTGCTAGACAGAAGCAGCCTTACAATCATAGTAGCTGGTTCAAGTCATTTCTACAACGACAATA
TGAGCTCGCTGAGAGAAAAGGGGAGCGGGAAACACACGTTCGAGCTGGGACATTCGTGTCGCAGGTCGCCGAGGATGCGCATAATCAAATGCTAGAACTTCAATCCCAGT
CTACCCCAGAGGGTAGTCAGCCACTCTCTGAGGATGAGATATGCGATCAGGTGTTGGGTAGACGACCAGACTACTCAAAAGGCCTTGGTTGGGGACCCAAGCCGAAGGCC
CGCAGAACGGCGAGTGCAAGCAGTTCGTCGACATCTTGTTCGCAGTCCACAGAAAAAGAGATTGAATTACAAGCTAAACTTCATGAAGCTTTGGAACGGACCGGAGGTGT
AGAATGA
mRNA sequenceShow/hide mRNA sequence
CACATATTTATATCTAGTTTATATATTTTGATTCTAGCTATGTGTTCGTATTTGCTTATTGAATGTTTGTTTTCTATGTCCATAGCCATTATGTCATATCGATAGTCAAA
TTTTATGGAGAAGGACGATATGTTCCTCTAGTTTGAGGACGATTTAGATAACATCGCGGGAGGGTCGTCATATGTGGGCGACAATATGGGATCTTCTTCTCAACAAGAGA
CTTCGACTCCTAAGAGACGTGCGCAGTCTCGATTCTTGGAGTTAGAGCGCCACGTTATAATAAATGGGCGCATTCCGATGACGATCGCCCTTGGAGCGGAGAAGCCTATT
TCTCCACACGCCTGGGCGGACGTTGGGAGAGAATACATTGAGGTCGTCAAGAGCGACCTCCAGCGATTGTTTGTGCTTGATTTCAATGATCAAGAAATGAACAGGTTTGT
TGAGCGTCAGATGCTCACGATCTTTAAAGAGTTTCGGGCAGACTGTCATAAATATTTCAAAAAGTACAGCGACCCGGAGGAGGCTCGTGCCAACCCACCAAACGCATTGG
AGCAATCATGGACGAACAAGGCTGCTAGACAGAAGCAGCCTTACAATCATAGTAGCTGGTTCAAGTCATTTCTACAACGACAATATGAGCTCGCTGAGAGAAAAGGGGAG
CGGGAAACACACGTTCGAGCTGGGACATTCGTGTCGCAGGTCGCCGAGGATGCGCATAATCAAATGCTAGAACTTCAATCCCAGTCTACCCCAGAGGGTAGTCAGCCACT
CTCTGAGGATGAGATATGCGATCAGGTGTTGGGTAGACGACCAGACTACTCAAAAGGCCTTGGTTGGGGACCCAAGCCGAAGGCCCGCAGAACGGCGAGTGCAAGCAGTT
CGTCGACATCTTGTTCGCAGTCCACAGAAAAAGAGATTGAATTACAAGCTAAACTTCATGAAGCTTTGGAACGGACCGGAGGTGTAGAATGACGCGCATACGCACCTCGT
TGGGAGACTTTTTCTTATGTTTATGTTTTTTCGATTTTGAGAACTATATTTATTGTAGGGAACTTATTCGTATTATTAATTTTAATTCTATTTTTTAAATTTTTGTTTCT
ATATTTTTTAATTTAATCCAAAACGTCGGGAAATTTTTTTGAGCAATCCAAACATCATTGTGAATGTTTGAGCAAAATATTATAAGGAAAAAAGTAGAAATAAAATATTA
AAAAAAATATATAAAAAGAAATTTCGGGTGTAAAGAAACGTCGGGAAAAAACGTCGGAAAAGAGGATTTCCCGACGCTGAAAGGTGCGCCGGC
Protein sequenceShow/hide protein sequence
MGSSSQQETSTPKRRAQSRFLELERHVIINGRIPMTIALGAEKPISPHAWADVGREYIEVVKSDLQRLFVLDFNDQEMNRFVERQMLTIFKEFRADCHKYFKKYSDPEEA
RANPPNALEQSWTNKAARQKQPYNHSSWFKSFLQRQYELAERKGERETHVRAGTFVSQVAEDAHNQMLELQSQSTPEGSQPLSEDEICDQVLGRRPDYSKGLGWGPKPKA
RRTASASSSSTSCSQSTEKEIELQAKLHEALERTGGVE