; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0067331 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0067331
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr03:10060616..10061089
RNA-Seq ExpressionCmc03g0067331
SyntenyCmc03g0067331
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042119.1 pol protein [Cucumis melo var. makuwa]2.4e-7591.72Show/hide
Query:  MSFGLTNAPAVFIDLMNKVFREFLETFVIVFIDDILIYSKTEVEHEEHLRMVLETLRANKLYAKFSKCVFWLKQVSFLGHMVSKVGASMDPAKIEAVTSW
        MSFGLTNAPAVF+DLMN+VFREFL+TFVIVFIDDILIYSKTE EHEEHLRMVL+TLR NKLYAKFSKC FWLKQVSFLGH+VSK G S+DPAKIEAVTSW
Subjt:  MSFGLTNAPAVFIDLMNKVFREFLETFVIVFIDDILIYSKTEVEHEEHLRMVLETLRANKLYAKFSKCVFWLKQVSFLGHMVSKVGASMDPAKIEAVTSW

Query:  PRPSTVSEVRSFLGLAGYYRRFVENFSCIATPLTQLTRKGAPFVWSKACEDSFQNLK
         RPSTVSEVRSFLGLAGYYRRFVENFS IATPLTQLTRKGAPFVWSKACEDSFQNLK
Subjt:  PRPSTVSEVRSFLGLAGYYRRFVENFSCIATPLTQLTRKGAPFVWSKACEDSFQNLK

KAA0046185.1 pol protein [Cucumis melo var. makuwa]2.4e-7591.72Show/hide
Query:  MSFGLTNAPAVFIDLMNKVFREFLETFVIVFIDDILIYSKTEVEHEEHLRMVLETLRANKLYAKFSKCVFWLKQVSFLGHMVSKVGASMDPAKIEAVTSW
        MSFGLTNAPAVF+DLMN+VFREFL+TFVIVFIDDILIYSKTE EHEEHLRMVL+TLR NKLYAKFSKC FWLKQVSFLGHMVSK G S+DPAKIEAVT W
Subjt:  MSFGLTNAPAVFIDLMNKVFREFLETFVIVFIDDILIYSKTEVEHEEHLRMVLETLRANKLYAKFSKCVFWLKQVSFLGHMVSKVGASMDPAKIEAVTSW

Query:  PRPSTVSEVRSFLGLAGYYRRFVENFSCIATPLTQLTRKGAPFVWSKACEDSFQNLK
         RPSTVSEVRSFLGLAGYYRRFVENFS IATPLTQLTRKGAPFVWSKACEDSFQNLK
Subjt:  PRPSTVSEVRSFLGLAGYYRRFVENFSCIATPLTQLTRKGAPFVWSKACEDSFQNLK

KAA0056200.1 pol protein [Cucumis melo var. makuwa]3.1e-7590.45Show/hide
Query:  MSFGLTNAPAVFIDLMNKVFREFLETFVIVFIDDILIYSKTEVEHEEHLRMVLETLRANKLYAKFSKCVFWLKQVSFLGHMVSKVGASMDPAKIEAVTSW
        MSFGLTNAPAVF+DLMN+VFREFL+TFVIVFIDDILIYSK E EHEEHLRMVL+TLR NKLYAKFSKC FWLKQVSFLGH+VSK G S+DPAKIEAVT W
Subjt:  MSFGLTNAPAVFIDLMNKVFREFLETFVIVFIDDILIYSKTEVEHEEHLRMVLETLRANKLYAKFSKCVFWLKQVSFLGHMVSKVGASMDPAKIEAVTSW

Query:  PRPSTVSEVRSFLGLAGYYRRFVENFSCIATPLTQLTRKGAPFVWSKACEDSFQNLK
         RPSTVSEVRSFLGLAGYYRRFVENFSCIATPLTQLTRKGAP+VWSKACEDSFQNLK
Subjt:  PRPSTVSEVRSFLGLAGYYRRFVENFSCIATPLTQLTRKGAPFVWSKACEDSFQNLK

KAA0065613.1 pol protein [Cucumis melo var. makuwa]3.1e-7591.08Show/hide
Query:  MSFGLTNAPAVFIDLMNKVFREFLETFVIVFIDDILIYSKTEVEHEEHLRMVLETLRANKLYAKFSKCVFWLKQVSFLGHMVSKVGASMDPAKIEAVTSW
        MSFGL NAPAVF+DLMNKVFREFL TF+IVFIDDILIYSKTE EHEEHLRMVLETLRANKLYAKFSKC FWLKQVSFLGH+VSK G S+DP KIEAVTSW
Subjt:  MSFGLTNAPAVFIDLMNKVFREFLETFVIVFIDDILIYSKTEVEHEEHLRMVLETLRANKLYAKFSKCVFWLKQVSFLGHMVSKVGASMDPAKIEAVTSW

Query:  PRPSTVSEVRSFLGLAGYYRRFVENFSCIATPLTQLTRKGAPFVWSKACEDSFQNLK
        P+PSTVSEVRSFLGLAGYYRRFVENFS IATPLTQLTRKGAPF WSKACEDSFQNLK
Subjt:  PRPSTVSEVRSFLGLAGYYRRFVENFSCIATPLTQLTRKGAPFVWSKACEDSFQNLK

TYK07181.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.8e-7591.08Show/hide
Query:  MSFGLTNAPAVFIDLMNKVFREFLETFVIVFIDDILIYSKTEVEHEEHLRMVLETLRANKLYAKFSKCVFWLKQVSFLGHMVSKVGASMDPAKIEAVTSW
        MSFGL NAPAVF+DLMNKVFREFL+TF+IVFIDDILIYSKTE EHEEHLRMVLETLRANKLYAKFSKC FWLKQVSFLGH+VSK G S+DP KIEAVTSW
Subjt:  MSFGLTNAPAVFIDLMNKVFREFLETFVIVFIDDILIYSKTEVEHEEHLRMVLETLRANKLYAKFSKCVFWLKQVSFLGHMVSKVGASMDPAKIEAVTSW

Query:  PRPSTVSEVRSFLGLAGYYRRFVENFSCIATPLTQLTRKGAPFVWSKACEDSFQNLK
        P+PSTVSEVRSFLGLAGYYRRFVENFS IATPLTQLTRKGAPF WSKACEDSFQNLK
Subjt:  PRPSTVSEVRSFLGLAGYYRRFVENFSCIATPLTQLTRKGAPFVWSKACEDSFQNLK

TrEMBL top hitse value%identityAlignment
A0A5A7TLA3 Pol protein1.2e-7591.72Show/hide
Query:  MSFGLTNAPAVFIDLMNKVFREFLETFVIVFIDDILIYSKTEVEHEEHLRMVLETLRANKLYAKFSKCVFWLKQVSFLGHMVSKVGASMDPAKIEAVTSW
        MSFGLTNAPAVF+DLMN+VFREFL+TFVIVFIDDILIYSKTE EHEEHLRMVL+TLR NKLYAKFSKC FWLKQVSFLGH+VSK G S+DPAKIEAVTSW
Subjt:  MSFGLTNAPAVFIDLMNKVFREFLETFVIVFIDDILIYSKTEVEHEEHLRMVLETLRANKLYAKFSKCVFWLKQVSFLGHMVSKVGASMDPAKIEAVTSW

Query:  PRPSTVSEVRSFLGLAGYYRRFVENFSCIATPLTQLTRKGAPFVWSKACEDSFQNLK
         RPSTVSEVRSFLGLAGYYRRFVENFS IATPLTQLTRKGAPFVWSKACEDSFQNLK
Subjt:  PRPSTVSEVRSFLGLAGYYRRFVENFSCIATPLTQLTRKGAPFVWSKACEDSFQNLK

A0A5A7TXM6 Reverse transcriptase1.2e-7591.72Show/hide
Query:  MSFGLTNAPAVFIDLMNKVFREFLETFVIVFIDDILIYSKTEVEHEEHLRMVLETLRANKLYAKFSKCVFWLKQVSFLGHMVSKVGASMDPAKIEAVTSW
        MSFGLTNAPAVF+DLMN+VFREFL+TFVIVFIDDILIYSKTE EHEEHLRMVL+TLR NKLYAKFSKC FWLKQVSFLGHMVSK G S+DPAKIEAVT W
Subjt:  MSFGLTNAPAVFIDLMNKVFREFLETFVIVFIDDILIYSKTEVEHEEHLRMVLETLRANKLYAKFSKCVFWLKQVSFLGHMVSKVGASMDPAKIEAVTSW

Query:  PRPSTVSEVRSFLGLAGYYRRFVENFSCIATPLTQLTRKGAPFVWSKACEDSFQNLK
         RPSTVSEVRSFLGLAGYYRRFVENFS IATPLTQLTRKGAPFVWSKACEDSFQNLK
Subjt:  PRPSTVSEVRSFLGLAGYYRRFVENFSCIATPLTQLTRKGAPFVWSKACEDSFQNLK

A0A5A7UM36 Reverse transcriptase1.5e-7590.45Show/hide
Query:  MSFGLTNAPAVFIDLMNKVFREFLETFVIVFIDDILIYSKTEVEHEEHLRMVLETLRANKLYAKFSKCVFWLKQVSFLGHMVSKVGASMDPAKIEAVTSW
        MSFGLTNAPAVF+DLMN+VFREFL+TFVIVFIDDILIYSK E EHEEHLRMVL+TLR NKLYAKFSKC FWLKQVSFLGH+VSK G S+DPAKIEAVT W
Subjt:  MSFGLTNAPAVFIDLMNKVFREFLETFVIVFIDDILIYSKTEVEHEEHLRMVLETLRANKLYAKFSKCVFWLKQVSFLGHMVSKVGASMDPAKIEAVTSW

Query:  PRPSTVSEVRSFLGLAGYYRRFVENFSCIATPLTQLTRKGAPFVWSKACEDSFQNLK
         RPSTVSEVRSFLGLAGYYRRFVENFSCIATPLTQLTRKGAP+VWSKACEDSFQNLK
Subjt:  PRPSTVSEVRSFLGLAGYYRRFVENFSCIATPLTQLTRKGAPFVWSKACEDSFQNLK

A0A5A7VBQ2 Pol protein1.5e-7591.08Show/hide
Query:  MSFGLTNAPAVFIDLMNKVFREFLETFVIVFIDDILIYSKTEVEHEEHLRMVLETLRANKLYAKFSKCVFWLKQVSFLGHMVSKVGASMDPAKIEAVTSW
        MSFGL NAPAVF+DLMNKVFREFL TF+IVFIDDILIYSKTE EHEEHLRMVLETLRANKLYAKFSKC FWLKQVSFLGH+VSK G S+DP KIEAVTSW
Subjt:  MSFGLTNAPAVFIDLMNKVFREFLETFVIVFIDDILIYSKTEVEHEEHLRMVLETLRANKLYAKFSKCVFWLKQVSFLGHMVSKVGASMDPAKIEAVTSW

Query:  PRPSTVSEVRSFLGLAGYYRRFVENFSCIATPLTQLTRKGAPFVWSKACEDSFQNLK
        P+PSTVSEVRSFLGLAGYYRRFVENFS IATPLTQLTRKGAPF WSKACEDSFQNLK
Subjt:  PRPSTVSEVRSFLGLAGYYRRFVENFSCIATPLTQLTRKGAPFVWSKACEDSFQNLK

A0A5D3C5J7 Ty3-gypsy retrotransposon protein8.9e-7691.08Show/hide
Query:  MSFGLTNAPAVFIDLMNKVFREFLETFVIVFIDDILIYSKTEVEHEEHLRMVLETLRANKLYAKFSKCVFWLKQVSFLGHMVSKVGASMDPAKIEAVTSW
        MSFGL NAPAVF+DLMNKVFREFL+TF+IVFIDDILIYSKTE EHEEHLRMVLETLRANKLYAKFSKC FWLKQVSFLGH+VSK G S+DP KIEAVTSW
Subjt:  MSFGLTNAPAVFIDLMNKVFREFLETFVIVFIDDILIYSKTEVEHEEHLRMVLETLRANKLYAKFSKCVFWLKQVSFLGHMVSKVGASMDPAKIEAVTSW

Query:  PRPSTVSEVRSFLGLAGYYRRFVENFSCIATPLTQLTRKGAPFVWSKACEDSFQNLK
        P+PSTVSEVRSFLGLAGYYRRFVENFS IATPLTQLTRKGAPF WSKACEDSFQNLK
Subjt:  PRPSTVSEVRSFLGLAGYYRRFVENFSCIATPLTQLTRKGAPFVWSKACEDSFQNLK

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.1e-2639.24Show/hide
Query:  MSFGLTNAPAVFIDLMNKVFREFLETFVIVFIDDILIYSKTEVEHEEHLRMVLETLRANKLYAKFSKCVFWLKQVSFLGHMVSKVGASMDPAKIEAVTSW
        M FGL NAPA F   MN + R  L    +V++DDI+++S +  EH + L +V E L    L  +  KC F  ++ +FLGH+++  G   +P KIEA+  +
Subjt:  MSFGLTNAPAVFIDLMNKVFREFLETFVIVFIDDILIYSKTEVEHEEHLRMVLETLRANKLYAKFSKCVFWLKQVSFLGHMVSKVGASMDPAKIEAVTSW

Query:  PRPSTVSEVRSFLGLAGYYRRFVENFSCIATPLTQLTRKGAPFVWSKACEDS-FQNLK
        P P+   E+++FLGL GYYR+F+ NF+ IA P+T+  +K      +    DS F+ LK
Subjt:  PRPSTVSEVRSFLGLAGYYRRFVENFSCIATPLTQLTRKGAPFVWSKACEDS-FQNLK

P0CT41 Transposon Tf2-12 polyprotein1.1e-2234.39Show/hide
Query:  MSFGLTNAPAVFIDLMNKVFREFLETFVIVFIDDILIYSKTEVEHEEHLRMVLETLRANKLYAKFSKCVFWLKQVSFLGHMVSKVGASMDPAKIEAVTSW
        M +G++ APA F   +N +  E  E+ V+ ++DDILI+SK+E EH +H++ VL+ L+   L    +KC F   QV F+G+ +S+ G +     I+ V  W
Subjt:  MSFGLTNAPAVFIDLMNKVFREFLETFVIVFIDDILIYSKTEVEHEEHLRMVLETLRANKLYAKFSKCVFWLKQVSFLGHMVSKVGASMDPAKIEAVTSW

Query:  PRPSTVSEVRSFLGLAGYYRRFVENFSCIATPLTQLTRKGAPFVWSKACEDSFQNLK
         +P    E+R FLG   Y R+F+   S +  PL  L +K   + W+     + +N+K
Subjt:  PRPSTVSEVRSFLGLAGYYRRFVENFSCIATPLTQLTRKGAPFVWSKACEDSFQNLK

P10401 Retrovirus-related Pol polyprotein from transposon gypsy7.6e-2434.52Show/hide
Query:  MSFGLTNAPAVFIDLMNKVFREFLETFVIVFIDDILIYSKTEVEHEEHLRMVLETLRANKLYAKFSKCVFWLKQVSFLGHMVSKVGASMDPAKIEAVTSW
        + FGL NA ++F   ++ V RE +     V++DD++I+S+ E +H  H+  VL+ L    +     K  F+ + V +LG +VSK G   DP K++A+  +
Subjt:  MSFGLTNAPAVFIDLMNKVFREFLETFVIVFIDDILIYSKTEVEHEEHLRMVLETLRANKLYAKFSKCVFWLKQVSFLGHMVSKVGASMDPAKIEAVTSW

Query:  PRPSTVSEVRSFLGLAGYYRRFVENFSCIATPLTQLTR-----------KGAPFVWSKACEDSFQNLK
        P P  V +VRSFLGLA YYR F+++F+ IA P+T + +           K  P  +++   ++FQ L+
Subjt:  PRPSTVSEVRSFLGLAGYYRRFVENFSCIATPLTQLTR-----------KGAPFVWSKACEDSFQNLK

P20825 Retrovirus-related Pol polyprotein from transposon 2973.7e-2641.01Show/hide
Query:  MSFGLTNAPAVFIDLMNKVFREFLETFVIVFIDDILIYSKTEVEHEEHLRMVLETLRANKLYAKFSKCVFWLKQVSFLGHMVSKVGASMDPAKIEAVTSW
        M FGL NAPA F   MN + R  L    +V++DDI+I+S +  EH   +++V   L    L  +  KC F  K+ +FLGH+V+  G   +P K++A+ S+
Subjt:  MSFGLTNAPAVFIDLMNKVFREFLETFVIVFIDDILIYSKTEVEHEEHLRMVLETLRANKLYAKFSKCVFWLKQVSFLGHMVSKVGASMDPAKIEAVTSW

Query:  PRPSTVSEVRSFLGLAGYYRRFVENFSCIATPLTQLTRK
        P P+   E+R+FLGL GYYR+F+ N++ IA P+T   +K
Subjt:  PRPSTVSEVRSFLGLAGYYRRFVENFSCIATPLTQLTRK

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus2.6e-2439.13Show/hide
Query:  MSFGLTNAPAVFIDLMNKVFREFLETFVIVFIDDILIYSKTEVEHEEHLRMVLETLRANKLYAKFSKCVFWLKQVSFLGHMVSKVGASMDPAKIEAVTSW
        + FGL NAPA+F  +++ + RE +     V+IDDI+++S+    H ++LR+VL +L    L     K  F   QV FLG++V+  G   DP K+ A++  
Subjt:  MSFGLTNAPAVFIDLMNKVFREFLETFVIVFIDDILIYSKTEVEHEEHLRMVLETLRANKLYAKFSKCVFWLKQVSFLGHMVSKVGASMDPAKIEAVTSW

Query:  PRPSTVSEVRSFLGLAGYYRRFVENFSCIATPLTQLTR
        P P++V E++ FLG+  YYR+F+++++ +A PLT LTR
Subjt:  PRPSTVSEVRSFLGLAGYYRRFVENFSCIATPLTQLTR

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.1e-2244.64Show/hide
Query:  HLRMVLETLRANKLYAKFSKCVFWLKQVSFLG--HMVSKVGASMDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSCIATPLTQLTRKGAPFVW
        HL MVL+    ++ YA   KC F   Q+++LG  H++S  G S DPAK+EA+  WP P   +E+R FLGL GYYRRFV+N+  I  PLT+L +K +   W
Subjt:  HLRMVLETLRANKLYAKFSKCVFWLKQVSFLG--HMVSKVGASMDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSCIATPLTQLTRKGAPFVW

Query:  SKACEDSFQNLK
        ++    +F+ LK
Subjt:  SKACEDSFQNLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCTTTGGGTTGACGAATGCTCCAGCAGTGTTCATAGATTTGATGAACAAAGTGTTTAGGGAGTTCCTAGAAACTTTTGTGATTGTGTTCATTGACGATATTTTGAT
ATATTCCAAGACAGAGGTCGAGCATGAGGAGCATTTACGTATGGTTCTAGAAACCCTTCGAGCTAATAAACTGTATGCAAAGTTCTCAAAATGTGTGTTTTGGTTGAAGC
AGGTATCCTTTCTAGGCCATATGGTTTCTAAAGTTGGTGCTTCTATGGATCCAGCTAAGATAGAGGCAGTCACCAGTTGGCCCCGACCTTCCACAGTCAGTGAGGTTCGT
AGCTTTCTGGGTTTAGCCGGTTATTATCGACGGTTTGTGGAGAACTTTTCTTGTATAGCTACTCCTCTTACTCAATTGACCAGGAAGGGAGCTCCTTTTGTTTGGAGCAA
GGCCTGTGAGGACAGTTTTCAGAACCTTAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCCTTTGGGTTGACGAATGCTCCAGCAGTGTTCATAGATTTGATGAACAAAGTGTTTAGGGAGTTCCTAGAAACTTTTGTGATTGTGTTCATTGACGATATTTTGAT
ATATTCCAAGACAGAGGTCGAGCATGAGGAGCATTTACGTATGGTTCTAGAAACCCTTCGAGCTAATAAACTGTATGCAAAGTTCTCAAAATGTGTGTTTTGGTTGAAGC
AGGTATCCTTTCTAGGCCATATGGTTTCTAAAGTTGGTGCTTCTATGGATCCAGCTAAGATAGAGGCAGTCACCAGTTGGCCCCGACCTTCCACAGTCAGTGAGGTTCGT
AGCTTTCTGGGTTTAGCCGGTTATTATCGACGGTTTGTGGAGAACTTTTCTTGTATAGCTACTCCTCTTACTCAATTGACCAGGAAGGGAGCTCCTTTTGTTTGGAGCAA
GGCCTGTGAGGACAGTTTTCAGAACCTTAAATAG
Protein sequenceShow/hide protein sequence
MSFGLTNAPAVFIDLMNKVFREFLETFVIVFIDDILIYSKTEVEHEEHLRMVLETLRANKLYAKFSKCVFWLKQVSFLGHMVSKVGASMDPAKIEAVTSWPRPSTVSEVR
SFLGLAGYYRRFVENFSCIATPLTQLTRKGAPFVWSKACEDSFQNLK