; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc07g0190401 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc07g0190401
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCMiso1.1chr07:9481663..9482313
RNA-Seq ExpressionCmc07g0190401
SyntenyCmc07g0190401
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG8473177.1 hypothetical protein CXB51_035100 [Gossypium anomalum]7.9e-4551.45Show/hide
Query:  MIEKQTDKEIKYLKTNNSLEFRGEFFNVFYMENGITRHKAVRYTPQQNRVSERLNRIIMERVRCLLSNAILEEKFWAKVVVYAMYTLKKSPHISLGFLTP
        MIEKQT K+IKYL+T+N LEF  + FN  Y   GI RH  VR+TPQQN V+ER+NR IME+VRC+LSNA L + FWA+    A + + +SP +++   TP
Subjt:  MIEKQTDKEIKYLKTNNSLEFRGEFFNVFYMENGITRHKAVRYTPQQNRVSERLNRIIMERVRCLLSNAILEEKFWAKVVVYAMYTLKKSPHISLGFLTP

Query:  KEKWSKHRPNLNDLKVFGCVGYVHQNQGKLKARATKCMFVGFTKWVKGFKMWHPTDKKFIISRDVHFKEIRCL
        +E WS +  N +DLK+FGC  Y H N GKL+ R+ KC+F+G+   VKG+K+W P ++K +ISRD+ F E   L
Subjt:  KEKWSKHRPNLNDLKVFGCVGYVHQNQGKLKARATKCMFVGFTKWVKGFKMWHPTDKKFIISRDVHFKEIRCL

KAG8487564.1 hypothetical protein CXB51_018489 [Gossypium anomalum]7.2e-4652.6Show/hide
Query:  MIEKQTDKEIKYLKTNNSLEFRGEFFNVFYMENGITRHKAVRYTPQQNRVSERLNRIIMERVRCLLSNAILEEKFWAKVVVYAMYTLKKSPHISLGFLTP
        MIEKQT K+IKYL+T+N LEF  + FN FY   GI RH  VR+TPQQN V+ER+NR IME+VRC+LSNA L + FWA+    A + + +SP +++   TP
Subjt:  MIEKQTDKEIKYLKTNNSLEFRGEFFNVFYMENGITRHKAVRYTPQQNRVSERLNRIIMERVRCLLSNAILEEKFWAKVVVYAMYTLKKSPHISLGFLTP

Query:  KEKWSKHRPNLNDLKVFGCVGYVHQNQGKLKARATKCMFVGFTKWVKGFKMWHPTDKKFIISRDVHFKEIRCL
        +E WS +  N +DLK+FGC  Y H N GKL+ R+ KC+F+G+   VKG+K+W P ++K +ISRDV F E   L
Subjt:  KEKWSKHRPNLNDLKVFGCVGYVHQNQGKLKARATKCMFVGFTKWVKGFKMWHPTDKKFIISRDVHFKEIRCL

KAG8499522.1 hypothetical protein CXB51_006046 [Gossypium anomalum]7.9e-4552.02Show/hide
Query:  MIEKQTDKEIKYLKTNNSLEFRGEFFNVFYMENGITRHKAVRYTPQQNRVSERLNRIIMERVRCLLSNAILEEKFWAKVVVYAMYTLKKSPHISLGFLTP
        MIEKQT K+IKYL+T+N LEF  + FN      GI RH  VR+TPQQN V+ER+NR IME+VRC+LSNA L + FWA+    A + + +SP +++   TP
Subjt:  MIEKQTDKEIKYLKTNNSLEFRGEFFNVFYMENGITRHKAVRYTPQQNRVSERLNRIIMERVRCLLSNAILEEKFWAKVVVYAMYTLKKSPHISLGFLTP

Query:  KEKWSKHRPNLNDLKVFGCVGYVHQNQGKLKARATKCMFVGFTKWVKGFKMWHPTDKKFIISRDVHFKEIRCL
        +E WS +  N +DLK+FGC  Y H N GKL+ R+ KC+F+G+   VKG+K+W P ++KF+ISRDV F E   L
Subjt:  KEKWSKHRPNLNDLKVFGCVGYVHQNQGKLKARATKCMFVGFTKWVKGFKMWHPTDKKFIISRDVHFKEIRCL

XP_038885954.1 uncharacterized protein LOC120076257 [Benincasa hispida]1.5e-4866.42Show/hide
Query:  ENGITRHKAVRYTPQQNRVSERLNRIIMERVRCLLSNAILEEKFWAKVVVYAMYTLKKSPHISLGFLTPKEKWSKHRPNLNDLKVFGCVGYVHQNQGKLK
        ENGITRHK VRYTPQQN V ERLNR IMERVR  LS+ IL E +W +   Y +YTL + P+ SL FLT +EKW+ H PNL++LKVFGCVG+VHQN+GKLK
Subjt:  ENGITRHKAVRYTPQQNRVSERLNRIIMERVRCLLSNAILEEKFWAKVVVYAMYTLKKSPHISLGFLTPKEKWSKHRPNLNDLKVFGCVGYVHQNQGKLK

Query:  ARATKCMFVGFTKWVKGFKMWHPTDKKFIISRDVHFK
        A+  KCMFVGFT+ VKG+KMWHP ++KFI+SRD+ F+
Subjt:  ARATKCMFVGFTKWVKGFKMWHPTDKKFIISRDVHFK

XP_038904504.1 uncharacterized protein LOC120090876 [Benincasa hispida]5.3e-4961.18Show/hide
Query:  SLEFRGEFFNVFYMENGITRHKAVRYTPQQNRVSERLNRIIMERVRCLLSNAILEEKFWAKVVVYAMYTLKKSPHISLGFLTPKEKWSKHRPNLNDLKVF
        SL    E FN F  E  ITRH+ +R+T Q+N V+ERLNR IMERVRCLLS+A L EK+WA+   Y ++TL + PH SL  LT +EKW+KH PNL +L+VF
Subjt:  SLEFRGEFFNVFYMENGITRHKAVRYTPQQNRVSERLNRIIMERVRCLLSNAILEEKFWAKVVVYAMYTLKKSPHISLGFLTPKEKWSKHRPNLNDLKVF

Query:  GCVGYVHQNQGKLKARATKCMFVGFTKWVKGFKMWHPTDKKFIISRDVHFKE
        GCVGY+HQ+QGKLK+R  KCMFVGF++ VKGFKMW+P +K+FI+S+DV F+E
Subjt:  GCVGYVHQNQGKLKARATKCMFVGFTKWVKGFKMWHPTDKKFIISRDVHFKE

TrEMBL top hitse value%identityAlignment
A0A5A7U2U7 Retrotransposon protein, putative, Ty1-copia sub-class2.2e-4050.59Show/hide
Query:  IEKQTDKEIKYLKTNNSLEFRGEFFNVFYMENGITRHKAVRYTPQQNRVSERLNRIIMERVRCLLSNAILEEKFWAKVVVYAMYTLKKSPHISLGFLTPK
        +E QT +++KYL+T+N LEF    FN F    GITRH  V YTPQQN ++ER NR IMER RCLL+NA L  KFW +    A Y + +SP  +L   TP+
Subjt:  IEKQTDKEIKYLKTNNSLEFRGEFFNVFYMENGITRHKAVRYTPQQNRVSERLNRIIMERVRCLLSNAILEEKFWAKVVVYAMYTLKKSPHISLGFLTPK

Query:  EKWSKHRPNLNDLKVFGCVGYVHQNQGKLKARATKCMFVGFTKWVKGFKMW--HPTDKKFIISRDVHFKE
        E W+   P+L  L+VFGC  Y H   GKL  RA KCMF+G+ + VKG+K+W       K IISRDV F E
Subjt:  EKWSKHRPNLNDLKVFGCVGYVHQNQGKLKARATKCMFVGFTKWVKGFKMW--HPTDKKFIISRDVHFKE

A0A5D3BMV3 Copia-like retrotransposable element8.6e-4573.95Show/hide
Query:  MERVRCLLSNAILEEKFWAKVVVYAMYTLKKSPHISLGFLTPKEKWSKHRPNLNDLKVFGCVGYVHQNQGKLKARATKCMFVGFTKWVKGFKMWHPTDKK
        M+RVRCLLS+AILE+K W +VV Y +YTL +SPH SL FLTPKE+W+KH PNLNDLKVFGCV YVHQ+QGK KA+ATKCMFVGF K VK FK+WH  DKK
Subjt:  MERVRCLLSNAILEEKFWAKVVVYAMYTLKKSPHISLGFLTPKEKWSKHRPNLNDLKVFGCVGYVHQNQGKLKARATKCMFVGFTKWVKGFKMWHPTDKK

Query:  FIISRDVHFKEIRCLCKGK
        FIISRDVHFKE   + KGK
Subjt:  FIISRDVHFKEIRCLCKGK

A0A5D3DNU1 Putative gag-pol polyprotein2.2e-4050.59Show/hide
Query:  IEKQTDKEIKYLKTNNSLEFRGEFFNVFYMENGITRHKAVRYTPQQNRVSERLNRIIMERVRCLLSNAILEEKFWAKVVVYAMYTLKKSPHISLGFLTPK
        +E QT +++KYL+T+N LEF    FN F    GITRH  V YTPQQN ++ER NR IMER RCLL+NA L  KFW +    A Y + +SP  +L   TP+
Subjt:  IEKQTDKEIKYLKTNNSLEFRGEFFNVFYMENGITRHKAVRYTPQQNRVSERLNRIIMERVRCLLSNAILEEKFWAKVVVYAMYTLKKSPHISLGFLTPK

Query:  EKWSKHRPNLNDLKVFGCVGYVHQNQGKLKARATKCMFVGFTKWVKGFKMW--HPTDKKFIISRDVHFKE
        E W+   P+L  L+VFGC  Y H   GKL  RA KCMF+G+ + VKG+K+W       K IISRDV F E
Subjt:  EKWSKHRPNLNDLKVFGCVGYVHQNQGKLKARATKCMFVGFTKWVKGFKMW--HPTDKKFIISRDVHFKE

A0A6V7P941 Integrase catalytic domain-containing protein6.6e-4552.33Show/hide
Query:  IEKQTDKEIKYLKTNNSLEFRGEFFNVFYMENGITRHKAVRYTPQQNRVSERLNRIIMERVRCLLSNAILEEKFWAKVVVYAMYTLKKSPHISLGFLTPK
        IEKQT K+IK+L+T+N LEF    FN F  + GI RH+ V YTPQQN V+ER+NR ++E+ RCLLSNA   E+FWA+    A Y + +SP  S+   TPK
Subjt:  IEKQTDKEIKYLKTNNSLEFRGEFFNVFYMENGITRHKAVRYTPQQNRVSERLNRIIMERVRCLLSNAILEEKFWAKVVVYAMYTLKKSPHISLGFLTPK

Query:  EKWSKHRPNLNDLKVFGCVGYVHQNQGKLKARATKCMFVGFTKWVKGFKMWHPTDKKFIISRDVHFKEIRCL
        E WS H P+ +DL++FGC  Y H +Q KLK R+ +C F+G+    KGF++W P DKK IISRDV F E   L
Subjt:  EKWSKHRPNLNDLKVFGCVGYVHQNQGKLKARATKCMFVGFTKWVKGFKMWHPTDKKFIISRDVHFKEIRCL

W9RE40 Retrovirus-related Pol polyprotein from transposon TNT 1-947.5e-4150.29Show/hide
Query:  MIEKQTDKEIKYLKTNNSLEFRGEFFNVFYMENGITRHKAVRYTPQQNRVSERLNRIIMERVRCLLSNAILEEKFWAKVVVYAMYTLKKSPHISLGFLTP
        MIE QT K+IK L+T+N LEF G+ FN F  +  + RHK VR TPQQN ++ER+NR ++ERVRC+L  A L ++FW +VV  A Y + + P  ++ F TP
Subjt:  MIEKQTDKEIKYLKTNNSLEFRGEFFNVFYMENGITRHKAVRYTPQQNRVSERLNRIIMERVRCLLSNAILEEKFWAKVVVYAMYTLKKSPHISLGFLTP

Query:  KEKWSKHRPNLNDLKVFGCVGYVHQNQGKLKARATKCMFVGFTKWVKGFKMW--HPTDKKFIISRDVHFKE
        +E W+ +  N  +LKVFGC  Y HQ +GKL  RA KCMFVG+ + VKG+K+W  +    K  ISRDV F+E
Subjt:  KEKWSKHRPNLNDLKVFGCVGYVHQNQGKLKARATKCMFVGFTKWVKGFKMW--HPTDKKFIISRDVHFKE

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.0e-2437.13Show/hide
Query:  EKQTDKEIKYLKTNNSLEFRGEFFNVFYMENGITRHKAVRYTPQQNRVSERLNRIIMERVRCLLSNAILEEKFWAKVVVYAMYTLKKSPHISL--GFLTP
        E   + ++ YL  +N  E+       F ++ GI+ H  V +TPQ N VSER+ R I E+ R ++S A L++ FW + V+ A Y + + P  +L     TP
Subjt:  EKQTDKEIKYLKTNNSLEFRGEFFNVFYMENGITRHKAVRYTPQQNRVSERLNRIIMERVRCLLSNAILEEKFWAKVVVYAMYTLKKSPHISL--GFLTP

Query:  KEKWSKHRPNLNDLKVFGCVGYVH--QNQGKLKARATKCMFVGFTKWVKGFKMWHPTDKKFIISRDV
         E W   +P L  L+VFG   YVH    QGK   ++ K +FVG+     GFK+W   ++KFI++RDV
Subjt:  KEKWSKHRPNLNDLKVFGCVGYVH--QNQGKLKARATKCMFVGFTKWVKGFKMWHPTDKKFIISRDV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-2836.05Show/hide
Query:  MIEKQTDKEIKYLKTNNSLEFRGEFFNVFYMENGITRHKAVRYTPQQNRVSERLNRIIMERVRCLLSNAILEEKFWAKVVVYAMYTLKKSPHISLGFLTP
        ++E++T +++K L+++N  E+    F  +   +GI   K V  TPQ N V+ER+NR I+E+VR +L  A L + FW + V  A Y + +SP + L F  P
Subjt:  MIEKQTDKEIKYLKTNNSLEFRGEFFNVFYMENGITRHKAVRYTPQQNRVSERLNRIIMERVRCLLSNAILEEKFWAKVVVYAMYTLKKSPHISLGFLTP

Query:  KEKWSKHRPNLNDLKVFGCVGYVH---QNQGKLKARATKCMFVGFTKWVKGFKMWHPTDKKFIISRDVHFKE
        +  W+    + + LKVFGC  + H   + + KL  ++  C+F+G+     G+++W P  KK I SRDV F+E
Subjt:  KEKWSKHRPNLNDLKVFGCVGYVH---QNQGKLKARATKCMFVGFTKWVKGFKMWHPTDKKFIISRDVHFKE

P92512 Uncharacterized mitochondrial protein AtMg007108.6e-1038.55Show/hide
Query:  LNRIIMERVRCLLSNAILEEKFWAKVVVYAMYTLKKSPHISLGFLTPKEKWSKHRPNLNDLKVFGCVGYVHQNQGKLKARATK
        +NR I+E+VR +L    L + F A     A++ + K P  ++ F  P E W +  P  + L+ FGCV Y+H ++GKLK RA K
Subjt:  LNRIIMERVRCLLSNAILEEKFWAKVVVYAMYTLKKSPHISLGFLTPKEKWSKHRPNLNDLKVFGCVGYVHQNQGKLKARATK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.7e-1330.92Show/hide
Query:  GEFFNV--FYMENGITRHKAVRYTPQQNRVSERLNRIIMERVRCLLSNAILEEKFWAKVVVYAMYTLKKSPHISLGFLTPKEKWSKHRPNLNDLKVFGCV
        GEF  +  ++ ++GI+   +  +TP+ N +SER +R I+E    LLS+A + + +W      A+Y + + P   L   +P +K     PN + L+VFGC 
Subjt:  GEFFNV--FYMENGITRHKAVRYTPQQNRVSERLNRIIMERVRCLLSNAILEEKFWAKVVVYAMYTLKKSPHISLGFLTPKEKWSKHRPNLNDLKVFGCV

Query:  GYV---HQNQGKLKARATKCMFVGFTKWVKGFKMWHPTDKKFIISRDVHFKE
         Y      NQ KL  ++ +C+F+G++     +   H    +  ISR V F E
Subjt:  GYV---HQNQGKLKARATKCMFVGFTKWVKGFKMWHPTDKKFIISRDVHFKE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.4e-1229.14Show/hide
Query:  MIEKQTDKEIKYLKTNNSLEFRGEFFNVFYMENGITRHKAVRYTPQQNRVSERLNRIIMERVRCLLSNAILEEKFWAKVVVYAMYTLKKSPHISLGFLTP
        ++E +    I  L ++N  EF       +  ++GI+   +  +TP+ N +SER +R I+E    LLS+A + + +W      A+Y + + P   L   +P
Subjt:  MIEKQTDKEIKYLKTNNSLEFRGEFFNVFYMENGITRHKAVRYTPQQNRVSERLNRIIMERVRCLLSNAILEEKFWAKVVVYAMYTLKKSPHISLGFLTP

Query:  KEKWSKHRPNLNDLKVFGCVGYV---HQNQGKLKARATKCMFVGFTKWVKGFKMWHPTDKKFIISRDVHFKEIRC
         +K     PN   LKVFGC  Y      N+ KL+ ++ +C F+G++     +   H    +   SR V F E RC
Subjt:  KEKWSKHRPNLNDLKVFGCVGYV---HQNQGKLKARATKCMFVGFTKWVKGFKMWHPTDKKFIISRDVHFKEIRC

Arabidopsis top hitse value%identityAlignment
ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein6.1e-1138.55Show/hide
Query:  LNRIIMERVRCLLSNAILEEKFWAKVVVYAMYTLKKSPHISLGFLTPKEKWSKHRPNLNDLKVFGCVGYVHQNQGKLKARATK
        +NR I+E+VR +L    L + F A     A++ + K P  ++ F  P E W +  P  + L+ FGCV Y+H ++GKLK RA K
Subjt:  LNRIIMERVRCLLSNAILEEKFWAKVVVYAMYTLKKSPHISLGFLTPKEKWSKHRPNLNDLKVFGCVGYVHQNQGKLKARATK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAGAAAAACAAACAGATAAAGAGATTAAGTATCTTAAAACTAACAATAGTTTGGAGTTCCGTGGAGAGTTTTTCAATGTTTTTTATATGGAAAATGGTATCACAAG
ACATAAGGCTGTGAGATACACACCTCAACAAAATAGGGTGTCAGAAAGACTCAATAGAATAATCATGGAAAGAGTTAGATGCTTATTATCAAATGCTATCCTAGAAGAAA
AGTTTTGGGCTAAAGTTGTTGTCTATGCTATGTACACACTAAAGAAAAGCCCTCATATTTCTTTAGGATTCTTAACACCTAAGGAGAAATGGTCCAAACATCGTCCAAAT
CTAAATGATCTCAAGGTGTTTGGATGTGTAGGGTATGTTCATCAAAACCAAGGGAAACTAAAGGCAAGAGCTACTAAATGCATGTTTGTTGGCTTCACAAAATGGGTCAA
GGGTTTCAAGATGTGGCATCCCACTGACAAGAAGTTTATAATCAGTAGGGATGTTCATTTCAAAGAAATAAGATGTTTATGCAAGGGAAAGATAATCTTGATGAAAATCC
TAAAGCCACAAAAACCTATATTACTCAGTTTGAGGTGGAGAATACTAGAAATGATGCTCAATCTACTAAGCAAACTATTGCTACTAATAAAAAACAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGATAGAAAAACAAACAGATAAAGAGATTAAGTATCTTAAAACTAACAATAGTTTGGAGTTCCGTGGAGAGTTTTTCAATGTTTTTTATATGGAAAATGGTATCACAAG
ACATAAGGCTGTGAGATACACACCTCAACAAAATAGGGTGTCAGAAAGACTCAATAGAATAATCATGGAAAGAGTTAGATGCTTATTATCAAATGCTATCCTAGAAGAAA
AGTTTTGGGCTAAAGTTGTTGTCTATGCTATGTACACACTAAAGAAAAGCCCTCATATTTCTTTAGGATTCTTAACACCTAAGGAGAAATGGTCCAAACATCGTCCAAAT
CTAAATGATCTCAAGGTGTTTGGATGTGTAGGGTATGTTCATCAAAACCAAGGGAAACTAAAGGCAAGAGCTACTAAATGCATGTTTGTTGGCTTCACAAAATGGGTCAA
GGGTTTCAAGATGTGGCATCCCACTGACAAGAAGTTTATAATCAGTAGGGATGTTCATTTCAAAGAAATAAGATGTTTATGCAAGGGAAAGATAATCTTGATGAAAATCC
TAAAGCCACAAAAACCTATATTACTCAGTTTGAGGTGGAGAATACTAGAAATGATGCTCAATCTACTAAGCAAACTATTGCTACTAATAAAAAACAAGTAG
Protein sequenceShow/hide protein sequence
MIEKQTDKEIKYLKTNNSLEFRGEFFNVFYMENGITRHKAVRYTPQQNRVSERLNRIIMERVRCLLSNAILEEKFWAKVVVYAMYTLKKSPHISLGFLTPKEKWSKHRPN
LNDLKVFGCVGYVHQNQGKLKARATKCMFVGFTKWVKGFKMWHPTDKKFIISRDVHFKEIRCLCKGKIILMKILKPQKPILLSLRWRILEMMLNLLSKLLLLIKNK