; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0064121 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0064121
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr03:4674999..4675673
RNA-Seq ExpressionCmc03g0064121
SyntenyCmc03g0064121
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045284.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.4e-10588.26Show/hide
Query:  LRPSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPRIDDLFDQLQGATVFSKIDLRPGYHQLRIRDSDIPKMAFRLRYGHYGFIMMSFGLTN
        +RPSVSPWGA VLFVKKKDGSMRLCIDYRELNKVT+KNRYPLPRIDDLFDQLQGATVFSKIDLR GYHQLR+RDSDIPK AFR RYG+Y FI+MSFGLTN
Subjt:  LRPSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPRIDDLFDQLQGATVFSKIDLRPGYHQLRIRDSDIPKMAFRLRYGHYGFIMMSFGLTN

Query:  APAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAKHEEHLRQALETLRANRLYAKFSKCEFWLKKVSFLGHVVSSEGGSVDPAKIEAVTNWPRPSTIA
        APAVFMDLMN VFKDFLDSF+IVFIDDILIYSKTEA+HEEHL Q LETLR+N+LYAKFSKCEFWLKKV+FLGHVVSSEG SVDPAKIEAVTNWPR   +A
Subjt:  APAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAKHEEHLRQALETLRANRLYAKFSKCEFWLKKVSFLGHVVSSEGGSVDPAKIEAVTNWPRPSTIA

Query:  RFVVSWAWQVTIG
        RFVVSW WQVT G
Subjt:  RFVVSWAWQVTIG

KAA0059723.1 pol protein [Cucumis melo var. makuwa]1.3e-10190.5Show/hide
Query:  LRPSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPRIDDLFDQLQGATVFSKIDLRPGYHQLRIRDSDIPKMAFRLRYGHYGFIMMSFGLTN
        +RPSVSPWGA VLFVKKKDGSMRLCIDYRELNKVT+KNRYPLPRIDDLFDQLQGATVFSKIDLR GYHQLRIRD DIPK AFRLRYGHY F++MSFGLTN
Subjt:  LRPSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPRIDDLFDQLQGATVFSKIDLRPGYHQLRIRDSDIPKMAFRLRYGHYGFIMMSFGLTN

Query:  APAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAKHEEHLRQALETLRANRLYAKFSKCEFWLKKVSFLGHVVSSEGGSVDPAKIEAVTNWPRPSTIA
         PAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEA+HEEHL Q LETLRAN+LYAKFSKCEFWL+KV+FLGHVVSSEG SVDPAKIEAVTNWPRPST++
Subjt:  APAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAKHEEHLRQALETLRANRLYAKFSKCEFWLKKVSFLGHVVSSEGGSVDPAKIEAVTNWPRPSTIA

KAA0063098.1 pol protein [Cucumis melo var. makuwa]1.6e-10191Show/hide
Query:  LRPSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPRIDDLFDQLQGATVFSKIDLRPGYHQLRIRDSDIPKMAFRLRYGHYGFIMMSFGLTN
        +RPSVSPWGA VLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPRIDDLFDQLQGATVFSKIDLR GYHQLRIRD DIPK AFR RYGHY F++MSFGLTN
Subjt:  LRPSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPRIDDLFDQLQGATVFSKIDLRPGYHQLRIRDSDIPKMAFRLRYGHYGFIMMSFGLTN

Query:  APAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAKHEEHLRQALETLRANRLYAKFSKCEFWLKKVSFLGHVVSSEGGSVDPAKIEAVTNWPRPSTIA
        APAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEA+HEEHL Q LETLRAN+LYAKFSKCEFWL+KV+FLGHVVSSEG SVDPAKIEAVTNWPRPST++
Subjt:  APAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAKHEEHLRQALETLRANRLYAKFSKCEFWLKKVSFLGHVVSSEGGSVDPAKIEAVTNWPRPSTIA

KAA0063793.1 pol protein [Cucumis melo var. makuwa]2.1e-10190.5Show/hide
Query:  LRPSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPRIDDLFDQLQGATVFSKIDLRPGYHQLRIRDSDIPKMAFRLRYGHYGFIMMSFGLTN
        +RPSVSPWGA VLFVKKKDGSMRLCIDYRELNKVT+KNRYPLPRIDDLFDQLQGATVFSKIDLR GYHQLRIRD DIPK AFR RYGHY F++MSFGLTN
Subjt:  LRPSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPRIDDLFDQLQGATVFSKIDLRPGYHQLRIRDSDIPKMAFRLRYGHYGFIMMSFGLTN

Query:  APAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAKHEEHLRQALETLRANRLYAKFSKCEFWLKKVSFLGHVVSSEGGSVDPAKIEAVTNWPRPSTIA
        APAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEA+HEEHL Q LETLRAN+LYAKFSKCEFWL+KV+FLGHVVSSEG SVDPAKIEAVTNWPRPST++
Subjt:  APAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAKHEEHLRQALETLRANRLYAKFSKCEFWLKKVSFLGHVVSSEGGSVDPAKIEAVTNWPRPSTIA

TYK01306.1 pol protein [Cucumis melo var. makuwa]1.1e-10292Show/hide
Query:  LRPSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPRIDDLFDQLQGATVFSKIDLRPGYHQLRIRDSDIPKMAFRLRYGHYGFIMMSFGLTN
        +RPSVSPWGA VLFVKKKDGSMRLCIDYRELNKVT+KNRYPLPRIDDLFDQLQGATVFSKIDLR GYHQLRIRDSDIPK AFRLRYGHY FI+MSFGLTN
Subjt:  LRPSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPRIDDLFDQLQGATVFSKIDLRPGYHQLRIRDSDIPKMAFRLRYGHYGFIMMSFGLTN

Query:  APAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAKHEEHLRQALETLRANRLYAKFSKCEFWLKKVSFLGHVVSSEGGSVDPAKIEAVTNWPRPSTIA
        APAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEA+HEEHL Q LETLRAN+LYAKFSKCEFWL+KV+FLGHVVSSEG SVDPAKIEAVTNWPRPST++
Subjt:  APAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAKHEEHLRQALETLRANRLYAKFSKCEFWLKKVSFLGHVVSSEGGSVDPAKIEAVTNWPRPSTIA

TrEMBL top hitse value%identityAlignment
A0A5A7TQ85 Reverse transcriptase6.9e-10688.26Show/hide
Query:  LRPSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPRIDDLFDQLQGATVFSKIDLRPGYHQLRIRDSDIPKMAFRLRYGHYGFIMMSFGLTN
        +RPSVSPWGA VLFVKKKDGSMRLCIDYRELNKVT+KNRYPLPRIDDLFDQLQGATVFSKIDLR GYHQLR+RDSDIPK AFR RYG+Y FI+MSFGLTN
Subjt:  LRPSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPRIDDLFDQLQGATVFSKIDLRPGYHQLRIRDSDIPKMAFRLRYGHYGFIMMSFGLTN

Query:  APAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAKHEEHLRQALETLRANRLYAKFSKCEFWLKKVSFLGHVVSSEGGSVDPAKIEAVTNWPRPSTIA
        APAVFMDLMN VFKDFLDSF+IVFIDDILIYSKTEA+HEEHL Q LETLR+N+LYAKFSKCEFWLKKV+FLGHVVSSEG SVDPAKIEAVTNWPR   +A
Subjt:  APAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAKHEEHLRQALETLRANRLYAKFSKCEFWLKKVSFLGHVVSSEGGSVDPAKIEAVTNWPRPSTIA

Query:  RFVVSWAWQVTIG
        RFVVSW WQVT G
Subjt:  RFVVSWAWQVTIG

A0A5A7UUX8 Reverse transcriptase6.1e-10290.5Show/hide
Query:  LRPSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPRIDDLFDQLQGATVFSKIDLRPGYHQLRIRDSDIPKMAFRLRYGHYGFIMMSFGLTN
        +RPSVSPWGA VLFVKKKDGSMRLCIDYRELNKVT+KNRYPLPRIDDLFDQLQGATVFSKIDLR GYHQLRIRD DIPK AFRLRYGHY F++MSFGLTN
Subjt:  LRPSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPRIDDLFDQLQGATVFSKIDLRPGYHQLRIRDSDIPKMAFRLRYGHYGFIMMSFGLTN

Query:  APAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAKHEEHLRQALETLRANRLYAKFSKCEFWLKKVSFLGHVVSSEGGSVDPAKIEAVTNWPRPSTIA
         PAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEA+HEEHL Q LETLRAN+LYAKFSKCEFWL+KV+FLGHVVSSEG SVDPAKIEAVTNWPRPST++
Subjt:  APAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAKHEEHLRQALETLRANRLYAKFSKCEFWLKKVSFLGHVVSSEGGSVDPAKIEAVTNWPRPSTIA

A0A5A7V646 Reverse transcriptase7.9e-10291Show/hide
Query:  LRPSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPRIDDLFDQLQGATVFSKIDLRPGYHQLRIRDSDIPKMAFRLRYGHYGFIMMSFGLTN
        +RPSVSPWGA VLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPRIDDLFDQLQGATVFSKIDLR GYHQLRIRD DIPK AFR RYGHY F++MSFGLTN
Subjt:  LRPSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPRIDDLFDQLQGATVFSKIDLRPGYHQLRIRDSDIPKMAFRLRYGHYGFIMMSFGLTN

Query:  APAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAKHEEHLRQALETLRANRLYAKFSKCEFWLKKVSFLGHVVSSEGGSVDPAKIEAVTNWPRPSTIA
        APAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEA+HEEHL Q LETLRAN+LYAKFSKCEFWL+KV+FLGHVVSSEG SVDPAKIEAVTNWPRPST++
Subjt:  APAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAKHEEHLRQALETLRANRLYAKFSKCEFWLKKVSFLGHVVSSEGGSVDPAKIEAVTNWPRPSTIA

A0A5A7V6R2 Reverse transcriptase1.0e-10190.5Show/hide
Query:  LRPSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPRIDDLFDQLQGATVFSKIDLRPGYHQLRIRDSDIPKMAFRLRYGHYGFIMMSFGLTN
        +RPSVSPWGA VLFVKKKDGSMRLCIDYRELNKVT+KNRYPLPRIDDLFDQLQGATVFSKIDLR GYHQLRIRD DIPK AFR RYGHY F++MSFGLTN
Subjt:  LRPSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPRIDDLFDQLQGATVFSKIDLRPGYHQLRIRDSDIPKMAFRLRYGHYGFIMMSFGLTN

Query:  APAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAKHEEHLRQALETLRANRLYAKFSKCEFWLKKVSFLGHVVSSEGGSVDPAKIEAVTNWPRPSTIA
        APAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEA+HEEHL Q LETLRAN+LYAKFSKCEFWL+KV+FLGHVVSSEG SVDPAKIEAVTNWPRPST++
Subjt:  APAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAKHEEHLRQALETLRANRLYAKFSKCEFWLKKVSFLGHVVSSEGGSVDPAKIEAVTNWPRPSTIA

A0A5D3BSV9 Reverse transcriptase5.5e-10392Show/hide
Query:  LRPSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPRIDDLFDQLQGATVFSKIDLRPGYHQLRIRDSDIPKMAFRLRYGHYGFIMMSFGLTN
        +RPSVSPWGA VLFVKKKDGSMRLCIDYRELNKVT+KNRYPLPRIDDLFDQLQGATVFSKIDLR GYHQLRIRDSDIPK AFRLRYGHY FI+MSFGLTN
Subjt:  LRPSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPRIDDLFDQLQGATVFSKIDLRPGYHQLRIRDSDIPKMAFRLRYGHYGFIMMSFGLTN

Query:  APAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAKHEEHLRQALETLRANRLYAKFSKCEFWLKKVSFLGHVVSSEGGSVDPAKIEAVTNWPRPSTIA
        APAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEA+HEEHL Q LETLRAN+LYAKFSKCEFWL+KV+FLGHVVSSEG SVDPAKIEAVTNWPRPST++
Subjt:  APAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAKHEEHLRQALETLRANRLYAKFSKCEFWLKKVSFLGHVVSSEGGSVDPAKIEAVTNWPRPSTIA

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein9.2e-3937.56Show/hide
Query:  ILRPSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPRIDDLFDQLQGATVFSKIDLRPGYHQLRIRDSDIPKMAFRLRYGHYGFIMMSFGLT
        I+R S +     V+FV KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+T+F+K+DL+  YH +R+R  D  K+AFR   G + +++M +G++
Subjt:  ILRPSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPRIDDLFDQLQGATVFSKIDLRPGYHQLRIRDSDIPKMAFRLRYGHYGFIMMSFGLT

Query:  NAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAKHEEHLRQALETLRANRLYAKFSKCEFWLKKVSFLGHVVSSEGGSVDPAKIEAVTNWPRP
         APA F   +N +  +  +S V+ ++DDILI+SK+E++H +H++  L+ L+   L    +KCEF   +V F+G+ +S +G +     I+ V  W +P
Subjt:  NAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAKHEEHLRQALETLRANRLYAKFSKCEFWLKKVSFLGHVVSSEGGSVDPAKIEAVTNWPRP

P0CT35 Transposon Tf2-2 polyprotein9.2e-3937.56Show/hide
Query:  ILRPSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPRIDDLFDQLQGATVFSKIDLRPGYHQLRIRDSDIPKMAFRLRYGHYGFIMMSFGLT
        I+R S +     V+FV KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+T+F+K+DL+  YH +R+R  D  K+AFR   G + +++M +G++
Subjt:  ILRPSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPRIDDLFDQLQGATVFSKIDLRPGYHQLRIRDSDIPKMAFRLRYGHYGFIMMSFGLT

Query:  NAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAKHEEHLRQALETLRANRLYAKFSKCEFWLKKVSFLGHVVSSEGGSVDPAKIEAVTNWPRP
         APA F   +N +  +  +S V+ ++DDILI+SK+E++H +H++  L+ L+   L    +KCEF   +V F+G+ +S +G +     I+ V  W +P
Subjt:  NAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAKHEEHLRQALETLRANRLYAKFSKCEFWLKKVSFLGHVVSSEGGSVDPAKIEAVTNWPRP

P0CT36 Transposon Tf2-3 polyprotein9.2e-3937.56Show/hide
Query:  ILRPSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPRIDDLFDQLQGATVFSKIDLRPGYHQLRIRDSDIPKMAFRLRYGHYGFIMMSFGLT
        I+R S +     V+FV KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+T+F+K+DL+  YH +R+R  D  K+AFR   G + +++M +G++
Subjt:  ILRPSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPRIDDLFDQLQGATVFSKIDLRPGYHQLRIRDSDIPKMAFRLRYGHYGFIMMSFGLT

Query:  NAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAKHEEHLRQALETLRANRLYAKFSKCEFWLKKVSFLGHVVSSEGGSVDPAKIEAVTNWPRP
         APA F   +N +  +  +S V+ ++DDILI+SK+E++H +H++  L+ L+   L    +KCEF   +V F+G+ +S +G +     I+ V  W +P
Subjt:  NAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAKHEEHLRQALETLRANRLYAKFSKCEFWLKKVSFLGHVVSSEGGSVDPAKIEAVTNWPRP

P0CT37 Transposon Tf2-4 polyprotein9.2e-3937.56Show/hide
Query:  ILRPSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPRIDDLFDQLQGATVFSKIDLRPGYHQLRIRDSDIPKMAFRLRYGHYGFIMMSFGLT
        I+R S +     V+FV KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+T+F+K+DL+  YH +R+R  D  K+AFR   G + +++M +G++
Subjt:  ILRPSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPRIDDLFDQLQGATVFSKIDLRPGYHQLRIRDSDIPKMAFRLRYGHYGFIMMSFGLT

Query:  NAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAKHEEHLRQALETLRANRLYAKFSKCEFWLKKVSFLGHVVSSEGGSVDPAKIEAVTNWPRP
         APA F   +N +  +  +S V+ ++DDILI+SK+E++H +H++  L+ L+   L    +KCEF   +V F+G+ +S +G +     I+ V  W +P
Subjt:  NAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAKHEEHLRQALETLRANRLYAKFSKCEFWLKKVSFLGHVVSSEGGSVDPAKIEAVTNWPRP

P0CT41 Transposon Tf2-12 polyprotein9.2e-3937.56Show/hide
Query:  ILRPSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPRIDDLFDQLQGATVFSKIDLRPGYHQLRIRDSDIPKMAFRLRYGHYGFIMMSFGLT
        I+R S +     V+FV KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+T+F+K+DL+  YH +R+R  D  K+AFR   G + +++M +G++
Subjt:  ILRPSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPRIDDLFDQLQGATVFSKIDLRPGYHQLRIRDSDIPKMAFRLRYGHYGFIMMSFGLT

Query:  NAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAKHEEHLRQALETLRANRLYAKFSKCEFWLKKVSFLGHVVSSEGGSVDPAKIEAVTNWPRP
         APA F   +N +  +  +S V+ ++DDILI+SK+E++H +H++  L+ L+   L    +KCEF   +V F+G+ +S +G +     I+ V  W +P
Subjt:  NAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAKHEEHLRQALETLRANRLYAKFSKCEFWLKKVSFLGHVVSSEGGSVDPAKIEAVTNWPRP

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein8.6e-0841.38Show/hide
Query:  HLRQALETLRANRLYAKFSKCEFWLKKVSFLG--HVVSSEGGSVDPAKIEAVTNWPRP
        HL   L+    ++ YA   KC F   ++++LG  H++S EG S DPAK+EA+  WP P
Subjt:  HLRQALETLRANRLYAKFSKCEFWLKKVSFLG--HVVSSEGGSVDPAKIEAVTNWPRP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATATTTTACGACCCAGTGTGTCACCTTGGGGAGCCTCAGTGTTGTTTGTGAAGAAGAAGGATGGGTCGATGCGCCTTTGCATTGACTACCGAGAGCTGAACAAGGT
GACAATTAAGAACCGCTACCCCTTGCCCAGGATTGATGACTTGTTCGATCAGTTGCAGGGAGCCACTGTCTTTTCTAAGATCGACCTGCGACCAGGCTACCACCAGTTGA
GGATCAGGGATAGTGACATTCCTAAGATGGCTTTTCGTTTGAGATACGGGCATTACGGGTTCATTATGATGTCTTTTGGTTTGACTAATGCCCCTGCGGTATTCATGGAC
TTGATGAACAGGGTGTTTAAGGACTTCCTAGACTCGTTCGTCATAGTTTTCATTGACGACATTTTGATTTACTCCAAGACTGAGGCTAAGCATGAGGAGCATTTACGCCA
AGCTTTGGAGACTCTTCGAGCCAATAGACTGTATGCCAAGTTCTCCAAGTGCGAATTTTGGCTGAAGAAGGTATCTTTCCTTGGACATGTGGTGTCCAGCGAGGGAGGTT
CTGTGGATCCAGCAAAGATCGAAGCGGTGACCAACTGGCCTCGACCGTCTACTATTGCGAGATTCGTAGTTTCCTGGGCTTGGCAGGTTACTATAGGAGGTTCGTGGAAG
ACTTCTCACATATAG
mRNA sequenceShow/hide mRNA sequence
ATGTATATTTTACGACCCAGTGTGTCACCTTGGGGAGCCTCAGTGTTGTTTGTGAAGAAGAAGGATGGGTCGATGCGCCTTTGCATTGACTACCGAGAGCTGAACAAGGT
GACAATTAAGAACCGCTACCCCTTGCCCAGGATTGATGACTTGTTCGATCAGTTGCAGGGAGCCACTGTCTTTTCTAAGATCGACCTGCGACCAGGCTACCACCAGTTGA
GGATCAGGGATAGTGACATTCCTAAGATGGCTTTTCGTTTGAGATACGGGCATTACGGGTTCATTATGATGTCTTTTGGTTTGACTAATGCCCCTGCGGTATTCATGGAC
TTGATGAACAGGGTGTTTAAGGACTTCCTAGACTCGTTCGTCATAGTTTTCATTGACGACATTTTGATTTACTCCAAGACTGAGGCTAAGCATGAGGAGCATTTACGCCA
AGCTTTGGAGACTCTTCGAGCCAATAGACTGTATGCCAAGTTCTCCAAGTGCGAATTTTGGCTGAAGAAGGTATCTTTCCTTGGACATGTGGTGTCCAGCGAGGGAGGTT
CTGTGGATCCAGCAAAGATCGAAGCGGTGACCAACTGGCCTCGACCGTCTACTATTGCGAGATTCGTAGTTTCCTGGGCTTGGCAGGTTACTATAGGAGGTTCGTGGAAG
ACTTCTCACATATAG
Protein sequenceShow/hide protein sequence
MYILRPSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPRIDDLFDQLQGATVFSKIDLRPGYHQLRIRDSDIPKMAFRLRYGHYGFIMMSFGLTNAPAVFMD
LMNRVFKDFLDSFVIVFIDDILIYSKTEAKHEEHLRQALETLRANRLYAKFSKCEFWLKKVSFLGHVVSSEGGSVDPAKIEAVTNWPRPSTIARFVVSWAWQVTIGGSWK
TSHI