; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc01g0020021 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc01g0020021
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase Ty1/copia-type domain-containing protein
Genome locationCMiso1.1chr01:18046277..18046786
RNA-Seq ExpressionCmc01g0020021
SyntenyCmc01g0020021
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032842.1 hypothetical protein E6C27_scaffold1987G00140 [Cucumis melo var. makuwa]1.5e-8395.73Show/hide
Query:  MSKGFKVNESDKCIYYKIEGRLCIIICLYVDDMLIFGSNLHVINDVKSMLSVNFDMKDLGEADVILDIKITRIENGISLYQSHYIEKILKKYNYFDSKPA
        MSKGFKVNESDKCIYYKIEGRLCIIICLYVDDMLIFGSNLHVINDVKSML+VNFDMKDLGEADVIL IKITR ENGISL QSHYIEKILKKYNYFDSKPA
Subjt:  MSKGFKVNESDKCIYYKIEGRLCIIICLYVDDMLIFGSNLHVINDVKSMLSVNFDMKDLGEADVILDIKITRIENGISLYQSHYIEKILKKYNYFDSKPA

Query:  CTPYDSSVKLFKNTGDSVNQSEYASIIGSLRYATDCTRLDIAYVVGLLCRFTSRPSLEHWNAIE
        CTPYDSSVKLFKNTGDSVNQSEYASIIGSLRYA DCTR DIAY VGLLCRFTSRPSLEHWNAIE
Subjt:  CTPYDSSVKLFKNTGDSVNQSEYASIIGSLRYATDCTRLDIAYVVGLLCRFTSRPSLEHWNAIE

KAA0058878.1 putative polyprotein [Cucumis melo var. makuwa]1.6e-7789.02Show/hide
Query:  MSKGFKVNESDKCIYYKIEGRLCIIICLYVDDMLIFGSNLHVINDVKSMLSVNFDMKDLGEADVILDIKITRIENGISLYQSHYIEKILKKYNYFDSKPA
        MSKGFKVNESDKC+YYK +GRLCIIICLYVDDML F SNLHVINDVKSMLSVNFDMKDLGEAD+IL IKITR +NGISL QS+YIEKILKKYNYFDSKPA
Subjt:  MSKGFKVNESDKCIYYKIEGRLCIIICLYVDDMLIFGSNLHVINDVKSMLSVNFDMKDLGEADVILDIKITRIENGISLYQSHYIEKILKKYNYFDSKPA

Query:  CTPYDSSVKLFKNTGDSVNQSEYASIIGSLRYATDCTRLDIAYVVGLLCRFTSRPSLEHWNAIE
        CTPYDSSVKLFKNTGD+VNQSEY SIIGSLRY  DCTR DIAY VGLLCRFTSRP LEHWNAIE
Subjt:  CTPYDSSVKLFKNTGDSVNQSEYASIIGSLRYATDCTRLDIAYVVGLLCRFTSRPSLEHWNAIE

KAA0060459.1 hypothetical protein E6C27_scaffold22G002870 [Cucumis melo var. makuwa]3.9e-7992.07Show/hide
Query:  MSKGFKVNESDKCIYYKIEGRLCIIICLYVDDMLIFGSNLHVINDVKSMLSVNFDMKDLGEADVILDIKITRIENGISLYQSHYIEKILKKYNYFDSKPA
        MSKGFKVNESDKCIYYK EGRLCIIICLYVDDMLIFGSNLHVINDVKSMLSVNFDMKDLGEADVIL IKITR ENGI L QSHYIEK LKKYNYFDSKP 
Subjt:  MSKGFKVNESDKCIYYKIEGRLCIIICLYVDDMLIFGSNLHVINDVKSMLSVNFDMKDLGEADVILDIKITRIENGISLYQSHYIEKILKKYNYFDSKPA

Query:  CTPYDSSVKLFKNTGDSVNQSEYASIIGSLRYATDCTRLDIAYVVGLLCRFTSRPSLEHWNAIE
        CTPYDSSVKLFKNTGDSVNQ+EYASIIGSLRYA D TR  IAY VGLLCRFTSRPSLEHWNAIE
Subjt:  CTPYDSSVKLFKNTGDSVNQSEYASIIGSLRYATDCTRLDIAYVVGLLCRFTSRPSLEHWNAIE

TYK00058.1 hypothetical protein E5676_scaffold596G00040 [Cucumis melo var. makuwa]1.7e-7990.85Show/hide
Query:  MSKGFKVNESDKCIYYKIEGRLCIIICLYVDDMLIFGSNLHVINDVKSMLSVNFDMKDLGEADVILDIKITRIENGISLYQSHYIEKILKKYNYFDSKPA
        MSKGFKVNESD+CIYYK EGR+CIIICLYVDDMLIFGSNLH+INDVKSMLS NFDMKDLGEADVIL IKI R ENGISL QSHYIEKILKKYNYF+SKPA
Subjt:  MSKGFKVNESDKCIYYKIEGRLCIIICLYVDDMLIFGSNLHVINDVKSMLSVNFDMKDLGEADVILDIKITRIENGISLYQSHYIEKILKKYNYFDSKPA

Query:  CTPYDSSVKLFKNTGDSVNQSEYASIIGSLRYATDCTRLDIAYVVGLLCRFTSRPSLEHWNAIE
        CTPYDSSVKLFKNTGDSVNQSEYASIIGSLRYA DCTR DIAY +GLLCRFTSR SLEHWNAIE
Subjt:  CTPYDSSVKLFKNTGDSVNQSEYASIIGSLRYATDCTRLDIAYVVGLLCRFTSRPSLEHWNAIE

TYK06419.1 hypothetical protein E5676_scaffold163G001210 [Cucumis melo var. makuwa]7.3e-7889.63Show/hide
Query:  MSKGFKVNESDKCIYYKIEGRLCIIICLYVDDMLIFGSNLHVINDVKSMLSVNFDMKDLGEADVILDIKITRIENGISLYQSHYIEKILKKYNYFDSKPA
        +SKGFKVNESDKCIYYK EGRLCIIICLYVDD+LIFGSNL+VINDVKSMLS NFDMK+L EADVIL IKITR +NGISL QSHYIEKILKKYNYFDSKP 
Subjt:  MSKGFKVNESDKCIYYKIEGRLCIIICLYVDDMLIFGSNLHVINDVKSMLSVNFDMKDLGEADVILDIKITRIENGISLYQSHYIEKILKKYNYFDSKPA

Query:  CTPYDSSVKLFKNTGDSVNQSEYASIIGSLRYATDCTRLDIAYVVGLLCRFTSRPSLEHWNAIE
        CTPYDSSVKLF+NTGDSVNQSEYASIIGSLRYA DCTR DIAY VGLLCRFTSRPSLEHWN IE
Subjt:  CTPYDSSVKLFKNTGDSVNQSEYASIIGSLRYATDCTRLDIAYVVGLLCRFTSRPSLEHWNAIE

TrEMBL top hitse value%identityAlignment
A0A5A7SNZ1 Reverse transcriptase Ty1/copia-type domain-containing protein7.3e-8495.73Show/hide
Query:  MSKGFKVNESDKCIYYKIEGRLCIIICLYVDDMLIFGSNLHVINDVKSMLSVNFDMKDLGEADVILDIKITRIENGISLYQSHYIEKILKKYNYFDSKPA
        MSKGFKVNESDKCIYYKIEGRLCIIICLYVDDMLIFGSNLHVINDVKSML+VNFDMKDLGEADVIL IKITR ENGISL QSHYIEKILKKYNYFDSKPA
Subjt:  MSKGFKVNESDKCIYYKIEGRLCIIICLYVDDMLIFGSNLHVINDVKSMLSVNFDMKDLGEADVILDIKITRIENGISLYQSHYIEKILKKYNYFDSKPA

Query:  CTPYDSSVKLFKNTGDSVNQSEYASIIGSLRYATDCTRLDIAYVVGLLCRFTSRPSLEHWNAIE
        CTPYDSSVKLFKNTGDSVNQSEYASIIGSLRYA DCTR DIAY VGLLCRFTSRPSLEHWNAIE
Subjt:  CTPYDSSVKLFKNTGDSVNQSEYASIIGSLRYATDCTRLDIAYVVGLLCRFTSRPSLEHWNAIE

A0A5D3BLQ7 Reverse transcriptase Ty1/copia-type domain-containing protein8.4e-8090.85Show/hide
Query:  MSKGFKVNESDKCIYYKIEGRLCIIICLYVDDMLIFGSNLHVINDVKSMLSVNFDMKDLGEADVILDIKITRIENGISLYQSHYIEKILKKYNYFDSKPA
        MSKGFKVNESD+CIYYK EGR+CIIICLYVDDMLIFGSNLH+INDVKSMLS NFDMKDLGEADVIL IKI R ENGISL QSHYIEKILKKYNYF+SKPA
Subjt:  MSKGFKVNESDKCIYYKIEGRLCIIICLYVDDMLIFGSNLHVINDVKSMLSVNFDMKDLGEADVILDIKITRIENGISLYQSHYIEKILKKYNYFDSKPA

Query:  CTPYDSSVKLFKNTGDSVNQSEYASIIGSLRYATDCTRLDIAYVVGLLCRFTSRPSLEHWNAIE
        CTPYDSSVKLFKNTGDSVNQSEYASIIGSLRYA DCTR DIAY +GLLCRFTSR SLEHWNAIE
Subjt:  CTPYDSSVKLFKNTGDSVNQSEYASIIGSLRYATDCTRLDIAYVVGLLCRFTSRPSLEHWNAIE

A0A5D3C7T6 Reverse transcriptase Ty1/copia-type domain-containing protein3.5e-7889.63Show/hide
Query:  MSKGFKVNESDKCIYYKIEGRLCIIICLYVDDMLIFGSNLHVINDVKSMLSVNFDMKDLGEADVILDIKITRIENGISLYQSHYIEKILKKYNYFDSKPA
        +SKGFKVNESDKCIYYK EGRLCIIICLYVDD+LIFGSNL+VINDVKSMLS NFDMK+L EADVIL IKITR +NGISL QSHYIEKILKKYNYFDSKP 
Subjt:  MSKGFKVNESDKCIYYKIEGRLCIIICLYVDDMLIFGSNLHVINDVKSMLSVNFDMKDLGEADVILDIKITRIENGISLYQSHYIEKILKKYNYFDSKPA

Query:  CTPYDSSVKLFKNTGDSVNQSEYASIIGSLRYATDCTRLDIAYVVGLLCRFTSRPSLEHWNAIE
        CTPYDSSVKLF+NTGDSVNQSEYASIIGSLRYA DCTR DIAY VGLLCRFTSRPSLEHWN IE
Subjt:  CTPYDSSVKLFKNTGDSVNQSEYASIIGSLRYATDCTRLDIAYVVGLLCRFTSRPSLEHWNAIE

A0A5D3D4T4 Reverse transcriptase Ty1/copia-type domain-containing protein1.9e-7992.07Show/hide
Query:  MSKGFKVNESDKCIYYKIEGRLCIIICLYVDDMLIFGSNLHVINDVKSMLSVNFDMKDLGEADVILDIKITRIENGISLYQSHYIEKILKKYNYFDSKPA
        MSKGFKVNESDKCIYYK EGRLCIIICLYVDDMLIFGSNLHVINDVKSMLSVNFDMKDLGEADVIL IKITR ENGI L QSHYIEK LKKYNYFDSKP 
Subjt:  MSKGFKVNESDKCIYYKIEGRLCIIICLYVDDMLIFGSNLHVINDVKSMLSVNFDMKDLGEADVILDIKITRIENGISLYQSHYIEKILKKYNYFDSKPA

Query:  CTPYDSSVKLFKNTGDSVNQSEYASIIGSLRYATDCTRLDIAYVVGLLCRFTSRPSLEHWNAIE
        CTPYDSSVKLFKNTGDSVNQ+EYASIIGSLRYA D TR  IAY VGLLCRFTSRPSLEHWNAIE
Subjt:  CTPYDSSVKLFKNTGDSVNQSEYASIIGSLRYATDCTRLDIAYVVGLLCRFTSRPSLEHWNAIE

A0A5D3DJH9 Putative polyprotein7.8e-7889.02Show/hide
Query:  MSKGFKVNESDKCIYYKIEGRLCIIICLYVDDMLIFGSNLHVINDVKSMLSVNFDMKDLGEADVILDIKITRIENGISLYQSHYIEKILKKYNYFDSKPA
        MSKGFKVNESDKC+YYK +GRLCIIICLYVDDML F SNLHVINDVKSMLSVNFDMKDLGEAD+IL IKITR +NGISL QS+YIEKILKKYNYFDSKPA
Subjt:  MSKGFKVNESDKCIYYKIEGRLCIIICLYVDDMLIFGSNLHVINDVKSMLSVNFDMKDLGEADVILDIKITRIENGISLYQSHYIEKILKKYNYFDSKPA

Query:  CTPYDSSVKLFKNTGDSVNQSEYASIIGSLRYATDCTRLDIAYVVGLLCRFTSRPSLEHWNAIE
        CTPYDSSVKLFKNTGD+VNQSEY SIIGSLRY  DCTR DIAY VGLLCRFTSRP LEHWNAIE
Subjt:  CTPYDSSVKLFKNTGDSVNQSEYASIIGSLRYATDCTRLDIAYVVGLLCRFTSRPSLEHWNAIE

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.8e-1634.16Show/hide
Query:  VNES-DKCIYYKIEGRL--CIIICLYVDDMLIFGSNLHVINDVKSMLSVNFDMKDLGEADVILDIKITRIENGISLYQSHYIEKILKKYNYFDSKPACTP
        VN S D+CIY   +G +   I + LYVDD++I   ++  +N+ K  L   F M DL E    + I+I   E+ I L QS Y++KIL K+N  +     TP
Subjt:  VNES-DKCIYYKIEGRL--CIIICLYVDDMLIFGSNLHVINDVKSMLSVNFDMKDLGEADVILDIKITRIENGISLYQSHYIEKILKKYNYFDSKPACTP

Query:  YDSSVKLFKNTGDSVNQSEYASIIGSLRYATDCTRLDIAYVVGLLCRFTSRPSLEHWNAIE
          S +       D    +   S+IG L Y   CTR D+   V +L R++S+ + E W  ++
Subjt:  YDSSVKLFKNTGDSVNQSEYASIIGSLRYATDCTRLDIAYVVGLLCRFTSRPSLEHWNAIE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.9e-2641.82Show/hide
Query:  SDKCIYYK-IEGRLCIIICLYVDDMLIFGSNLHVINDVKSMLSVNFDMKDLGEADVILDIKITRIENGISLY--QSHYIEKILKKYNYFDSKPACTPYDS
        SD C+Y+K       II+ LYVDDMLI G +  +I  +K  LS +FDMKDLG A  IL +KI R      L+  Q  YIE++L+++N  ++KP  TP   
Subjt:  SDKCIYYK-IEGRLCIIICLYVDDMLIFGSNLHVINDVKSMLSVNFDMKDLGEADVILDIKITRIENGISLY--QSHYIEKILKKYNYFDSKPACTPYDS

Query:  SVKLFKNTGDSVNQSE-------YASIIGSLRYATDCTRLDIAYVVGLLCRFTSRPSLEHWNAIE
         +KL K    +  + +       Y+S +GSL YA  CTR DIA+ VG++ RF   P  EHW A++
Subjt:  SVKLFKNTGDSVNQSE-------YASIIGSLRYATDCTRLDIAYVVGLLCRFTSRPSLEHWNAIE

P92519 Uncharacterized mitochondrial protein AtMg008106.3e-1635.25Show/hide
Query:  ICLYVDDMLIFGSNLHVINDVKSMLSVNFDMKDLGEADVILDIKITRIENGISLYQSHYIEKILKKYNYFDSKPACTPYDSSVKLFKNTGDSVNQSEYAS
        + LYVDD+L+ GS+  ++N +   LS  F MKDLG     L I+I    +G+ L Q+ Y E+IL      D KP  TP    +    +T    + S++ S
Subjt:  ICLYVDDMLIFGSNLHVINDVKSMLSVNFDMKDLGEADVILDIKITRIENGISLYQSHYIEKILKKYNYFDSKPACTPYDSSVKLFKNTGDSVNQSEYAS

Query:  IIGSLRYATDCTRLDIAYVVGLLCRFTSRPSLEHWNAIE
        I+G+L+Y T  TR DI+Y V ++C+    P+L  ++ ++
Subjt:  IIGSLRYATDCTRLDIAYVVGLLCRFTSRPSLEHWNAIE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.4e-1733.95Show/hide
Query:  GFKVNESDKCIYYKIEGRLCIIICLYVDDMLIFGSNLHVINDVKSMLSVNFDMKDLGEADVILDIKITRIENGISLYQSHYIEKILKKYNYFDSKPACTP
        GF  + SD  ++    G+  + + +YVDD+LI G++  ++++    LS  F +KD  E    L I+  R+  G+ L Q  YI  +L + N   +KP  TP
Subjt:  GFKVNESDKCIYYKIEGRLCIIICLYVDDMLIFGSNLHVINDVKSMLSVNFDMKDLGEADVILDIKITRIENGISLYQSHYIEKILKKYNYFDSKPACTP

Query:  YDSSVKLFKNTGDSV-NQSEYASIIGSLRYATDCTRLDIAYVVGLLCRFTSRPSLEHWNAIE
           S KL   +G  + + +EY  I+GSL+Y    TR DI+Y V  L +F   P+ EH  A++
Subjt:  YDSSVKLFKNTGDSV-NQSEYASIIGSLRYATDCTRLDIAYVVGLLCRFTSRPSLEHWNAIE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.7e-1832.12Show/hide
Query:  MSKGFKVNESDKCIYYKIEGRLCIIICLYVDDMLIFGSNLHVINDVKSMLSVNFDMKDLGEADVILDIKITRIENGISLYQSHYIEKILKKYNYFDSKPA
        ++ GF  + SD  ++    GR  I + +YVDD+LI G++  ++      LS  F +K+  +    L I+  R+  G+ L Q  Y   +L + N   +KP 
Subjt:  MSKGFKVNESDKCIYYKIEGRLCIIICLYVDDMLIFGSNLHVINDVKSMLSVNFDMKDLGEADVILDIKITRIENGISLYQSHYIEKILKKYNYFDSKPA

Query:  CTPYDSSVKLFKNTGDSV-NQSEYASIIGSLRYATDCTRLDIAYVVGLLCRFTSRPSLEHWNAIE
         TP  +S KL  ++G  + + +EY  I+GSL+Y    TR D++Y V  L ++   P+ +HWNA++
Subjt:  CTPYDSSVKLFKNTGDSV-NQSEYASIIGSLRYATDCTRLDIAYVVGLLCRFTSRPSLEHWNAIE

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 86.2e-1933.54Show/hide
Query:  GFKVNESDKCIYYKIEGRLCIIICLYVDDMLIFGSNLHVINDVKSMLSVNFDMKDLGEADVILDIKITRIENGISLYQSHYIEKILKKYNYFDSKPACTP
        GF  + SD   + KI   L + + +YVDD++I  +N   ++++KS L   F ++DLG     L ++I R   GI++ Q  Y   +L +      KP+  P
Subjt:  GFKVNESDKCIYYKIEGRLCIIICLYVDDMLIFGSNLHVINDVKSMLSVNFDMKDLGEADVILDIKITRIENGISLYQSHYIEKILKKYNYFDSKPACTP

Query:  YDSSVKLFKNT-GDSVNQSEYASIIGSLRYATDCTRLDIAYVVGLLCRFTSRPSLEHWNAI
         D SV    ++ GD V+   Y  +IG L Y    TRLDI++ V  L +F+  P L H  A+
Subjt:  YDSSVKLFKNT-GDSVNQSEYASIIGSLRYATDCTRLDIAYVVGLLCRFTSRPSLEHWNAI

ATMG00810.1 DNA/RNA polymerases superfamily protein4.5e-1735.25Show/hide
Query:  ICLYVDDMLIFGSNLHVINDVKSMLSVNFDMKDLGEADVILDIKITRIENGISLYQSHYIEKILKKYNYFDSKPACTPYDSSVKLFKNTGDSVNQSEYAS
        + LYVDD+L+ GS+  ++N +   LS  F MKDLG     L I+I    +G+ L Q+ Y E+IL      D KP  TP    +    +T    + S++ S
Subjt:  ICLYVDDMLIFGSNLHVINDVKSMLSVNFDMKDLGEADVILDIKITRIENGISLYQSHYIEKILKKYNYFDSKPACTPYDSSVKLFKNTGDSVNQSEYAS

Query:  IIGSLRYATDCTRLDIAYVVGLLCRFTSRPSLEHWNAIE
        I+G+L+Y T  TR DI+Y V ++C+    P+L  ++ ++
Subjt:  IIGSLRYATDCTRLDIAYVVGLLCRFTSRPSLEHWNAIE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAAAAGGATTCAAAGTAAATGAAAGTGACAAATGTATCTACTATAAGATTGAAGGTAGGCTATGTATTATCATATGCCTATACGTAGATGACATGTTAATCTTTGG
ATCAAACTTGCACGTCATAAATGATGTAAAATCTATGTTGAGTGTAAATTTTGACATGAAAGACCTAGGTGAAGCTGATGTAATCTTAGACATCAAAATTACAAGAATTG
AAAATGGAATTTCTTTATATCAATCTCATTACATAGAAAAGATTCTAAAGAAGTACAACTACTTCGATAGTAAACCAGCTTGTACACCTTATGACTCTAGTGTAAAACTA
TTTAAGAACACTGGTGACAGTGTTAACCAATCCGAGTACGCTAGTATCATAGGTAGTTTAAGGTATGCTACTGATTGCACTAGACTAGACATAGCTTACGTCGTAGGATT
ATTATGTAGGTTTACCAGCAGACCCAGTCTAGAACATTGGAATGCGATAGAGAATAATGAGATACCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCAAAAGGATTCAAAGTAAATGAAAGTGACAAATGTATCTACTATAAGATTGAAGGTAGGCTATGTATTATCATATGCCTATACGTAGATGACATGTTAATCTTTGG
ATCAAACTTGCACGTCATAAATGATGTAAAATCTATGTTGAGTGTAAATTTTGACATGAAAGACCTAGGTGAAGCTGATGTAATCTTAGACATCAAAATTACAAGAATTG
AAAATGGAATTTCTTTATATCAATCTCATTACATAGAAAAGATTCTAAAGAAGTACAACTACTTCGATAGTAAACCAGCTTGTACACCTTATGACTCTAGTGTAAAACTA
TTTAAGAACACTGGTGACAGTGTTAACCAATCCGAGTACGCTAGTATCATAGGTAGTTTAAGGTATGCTACTGATTGCACTAGACTAGACATAGCTTACGTCGTAGGATT
ATTATGTAGGTTTACCAGCAGACCCAGTCTAGAACATTGGAATGCGATAGAGAATAATGAGATACCTTAA
Protein sequenceShow/hide protein sequence
MSKGFKVNESDKCIYYKIEGRLCIIICLYVDDMLIFGSNLHVINDVKSMLSVNFDMKDLGEADVILDIKITRIENGISLYQSHYIEKILKKYNYFDSKPACTPYDSSVKL
FKNTGDSVNQSEYASIIGSLRYATDCTRLDIAYVVGLLCRFTSRPSLEHWNAIENNEIP