; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc12g0318241 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc12g0318241
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCMiso1.1chr12:3299338..3299919
RNA-Seq ExpressionCmc12g0318241
SyntenyCmc12g0318241
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAU90333.1 Putative gag and pol polyprotein, identical [Solanum demissum]3.0e-5155.79Show/hide
Query:  CSQAKITKTSHKSVTRVTEPLELIHSDLCEFDGSLTRNSKRYVLTFIDDCSDYTFIYLLKNKSDAYEMFKVFVTKIENQFNKRIKRLRSDRGTEYDSVAF
        CS+AKITK  H  V R T  LEL+H+D+CE  G LTR   R  +TFIDD S +T++YL+KNKSDA+E FK ++ ++ENQF ++IKR+RSDRG EY+S  F
Subjt:  CSQAKITKTSHKSVTRVTEPLELIHSDLCEFDGSLTRNSKRYVLTFIDDCSDYTFIYLLKNKSDAYEMFKVFVTKIENQFNKRIKRLRSDRGTEYDSVAF

Query:  NEFYSSKGIIHETTVPYSLEMNGKAERKNRTLTELVVAILLELGAAPSCWGEIIKTVNYVLNRIPKSNSKTSPYEVLKHKTPNLTYLITW
        N F  S GIIHETT PYS   NG AERKNRTL EL  A+L+E  A  + WGE I T  YVLNR+P   SK + +E+ K   P+L YL  W
Subjt:  NEFYSSKGIIHETTVPYSLEMNGKAERKNRTLTELVVAILLELGAAPSCWGEIIKTVNYVLNRIPKSNSKTSPYEVLKHKTPNLTYLITW

ABI34306.1 Polyprotein, putative [Solanum demissum]1.9e-5356.84Show/hide
Query:  CSQAKITKTSHKSVTRVTEPLELIHSDLCEFDGSLTRNSKRYVLTFIDDCSDYTFIYLLKNKSDAYEMFKVFVTKIENQFNKRIKRLRSDRGTEYDSVAF
        CS+AKITK  H  V R T+ LEL+H+D+CE  G LTR   RY +TFIDD S +T++YL+KNKSDA+E FK ++ ++ENQF ++IKR+RSDRG EY+S  F
Subjt:  CSQAKITKTSHKSVTRVTEPLELIHSDLCEFDGSLTRNSKRYVLTFIDDCSDYTFIYLLKNKSDAYEMFKVFVTKIENQFNKRIKRLRSDRGTEYDSVAF

Query:  NEFYSSKGIIHETTVPYSLEMNGKAERKNRTLTELVVAILLELGAAPSCWGEIIKTVNYVLNRIPKSNSKTSPYEVLKHKTPNLTYLITW
        N F  S GIIHETT PYS   NG AERKNRTL EL  A+L+E  A  + WGE I T  YVLNR+P   SK +P+E+ K   P+L YL  W
Subjt:  NEFYSSKGIIHETTVPYSLEMNGKAERKNRTLTELVVAILLELGAAPSCWGEIIKTVNYVLNRIPKSNSKTSPYEVLKHKTPNLTYLITW

AOZ57169.1 integrase, partial [Coccinia grandis]1.2e-4975.38Show/hide
Query:  ITKTSHKSVTRVTEPLELIHSDLCEFDGSLTRNSKRYVLTFIDDCSDYTFIYLLKNKSDAYEMFKVFVTKIENQFNKRIKRLRSDRGTEYDSVAFNEFYS
        ITKTSHKSV RV+EPL+LIHS+LCEFDG LTRNSKRY +TFIDDCSDYTFIYLLKNKSDA++ FK FV ++ENQFN+++KRL  DRGT YDS +FN FY+
Subjt:  ITKTSHKSVTRVTEPLELIHSDLCEFDGSLTRNSKRYVLTFIDDCSDYTFIYLLKNKSDAYEMFKVFVTKIENQFNKRIKRLRSDRGTEYDSVAFNEFYS

Query:  SKGIIHETTVPYSLEMNGKAERKNRTLTEL
        S GIIH  T PYS EMNGKAERKNRT   L
Subjt:  SKGIIHETTVPYSLEMNGKAERKNRTLTEL

KAA0034938.1 putative Polyprotein [Cucumis melo var. makuwa]4.1e-9391.58Show/hide
Query:  CSQAKITKTSHKSVTRVTEPLELIHSDLCEFDGSLTRNSKRYVLTFIDDCSDYTFIYLLKNKSDAYEMFKVFVTKIENQFNKRIKRLRSDRGTEYDSVAF
        CSQAKITKTSHK VTRVT+PLELIHSDLCEFDG+LTRNSKRYV+TFIDDCSDYTFIYLLKNKSDAYEMFKVFVT+IENQFNKRIKRLRSDRGTEYDSVAF
Subjt:  CSQAKITKTSHKSVTRVTEPLELIHSDLCEFDGSLTRNSKRYVLTFIDDCSDYTFIYLLKNKSDAYEMFKVFVTKIENQFNKRIKRLRSDRGTEYDSVAF

Query:  NEFYSSKGIIHETTVPYSLEMNGKAERKNRTLTELVVAILLELGAAPSCWGEIIKTVNYVLNRIPKSNSKTSPYEVLKHKTPNLTYLITW
        NEFY+SKGIIHETT PYS EMNGK ERKNRTLTEL VAILLE  AAPS WGEIIKTVNYVLNRIPKSNSKTSPYEVLKHK PNL+YL TW
Subjt:  NEFYSSKGIIHETTVPYSLEMNGKAERKNRTLTELVVAILLELGAAPSCWGEIIKTVNYVLNRIPKSNSKTSPYEVLKHKTPNLTYLITW

WP_192824023.1 DDE-type integrase/transposase/recombinase, partial [Escherichia coli]1.2e-4749.74Show/hide
Query:  MCSQAKITKTSHKSVTRVTEPLELIHSDLCEFDGSLTRNSKRYVLTFIDDCSDYTFIYLLKNKSDAYEMFKVFVTKIENQFNKRIKRLRSDRGTEYDSVA
        +C +AK+TKTS KSV R TEPLELIHSD+C+F    TR   +Y +TFIDD + Y+++YLLKNK +A + F ++  ++ENQ N+RIK +RSDRG EY++  
Subjt:  MCSQAKITKTSHKSVTRVTEPLELIHSDLCEFDGSLTRNSKRYVLTFIDDCSDYTFIYLLKNKSDAYEMFKVFVTKIENQFNKRIKRLRSDRGTEYDSVA

Query:  FNEFYSSKGIIHETTVPYSLEMNGKAERKNRTLTELVVAILLELGAAPSCWGEIIKTVNYVLNRIPKSNSKTSPYEVLKHKTPNLTYLITW
          +F +S GI+HE T PYS + NG AERKNRTL +++ A+L+  G   + WGE I T NY+LN++P+     +PYE+ K + P+  YL  W
Subjt:  FNEFYSSKGIIHETTVPYSLEMNGKAERKNRTLTELVVAILLELGAAPSCWGEIIKTVNYVLNRIPKSNSKTSPYEVLKHKTPNLTYLITW

TrEMBL top hitse value%identityAlignment
A0A5D3DCJ1 Putative Polyprotein2.0e-9391.58Show/hide
Query:  CSQAKITKTSHKSVTRVTEPLELIHSDLCEFDGSLTRNSKRYVLTFIDDCSDYTFIYLLKNKSDAYEMFKVFVTKIENQFNKRIKRLRSDRGTEYDSVAF
        CSQAKITKTSHK VTRVT+PLELIHSDLCEFDG+LTRNSKRYV+TFIDDCSDYTFIYLLKNKSDAYEMFKVFVT+IENQFNKRIKRLRSDRGTEYDSVAF
Subjt:  CSQAKITKTSHKSVTRVTEPLELIHSDLCEFDGSLTRNSKRYVLTFIDDCSDYTFIYLLKNKSDAYEMFKVFVTKIENQFNKRIKRLRSDRGTEYDSVAF

Query:  NEFYSSKGIIHETTVPYSLEMNGKAERKNRTLTELVVAILLELGAAPSCWGEIIKTVNYVLNRIPKSNSKTSPYEVLKHKTPNLTYLITW
        NEFY+SKGIIHETT PYS EMNGK ERKNRTLTEL VAILLE  AAPS WGEIIKTVNYVLNRIPKSNSKTSPYEVLKHK PNL+YL TW
Subjt:  NEFYSSKGIIHETTVPYSLEMNGKAERKNRTLTELVVAILLELGAAPSCWGEIIKTVNYVLNRIPKSNSKTSPYEVLKHKTPNLTYLITW

A0A7N2L531 Uncharacterized protein2.5e-5963.16Show/hide
Query:  CSQAKITKTSHKSVTRVTEPLELIHSDLCEFDGSLTRNSKRYVLTFIDDCSDYTFIYLLKNKSDAYEMFKVFVTKIENQFNKRIKRLRSDRGTEYDSVAF
        CSQAKITK  HK+V R TE LELIHSDLCEF+G LTR   RY++TFIDD S YT IYLLKNKSDA+E F+ F+ ++ENQF ++IKR+RSDRG EY+S AF
Subjt:  CSQAKITKTSHKSVTRVTEPLELIHSDLCEFDGSLTRNSKRYVLTFIDDCSDYTFIYLLKNKSDAYEMFKVFVTKIENQFNKRIKRLRSDRGTEYDSVAF

Query:  NEFYSSKGIIHETTVPYSLEMNGKAERKNRTLTELVVAILLELGAAPSCWGEIIKTVNYVLNRIPKSNSKTSPYEVLKHKTPNLTYLITW
        N F  S GIIHETT PYS   NG AERKNRTL EL  A+L+E GA    WGE I T  +VLNR+P   S T+P+E+ K   PNL YL  W
Subjt:  NEFYSSKGIIHETTVPYSLEMNGKAERKNRTLTELVVAILLELGAAPSCWGEIIKTVNYVLNRIPKSNSKTSPYEVLKHKTPNLTYLITW

A0A7N2R9F3 Uncharacterized protein2.5e-5963.16Show/hide
Query:  CSQAKITKTSHKSVTRVTEPLELIHSDLCEFDGSLTRNSKRYVLTFIDDCSDYTFIYLLKNKSDAYEMFKVFVTKIENQFNKRIKRLRSDRGTEYDSVAF
        CSQAKITK  HKSV R TE LELIHSDLCEF+G LTR   RY++TFIDD S YT IYLLKNKSDA+E F+ F+ ++ENQF ++IKR+RSDRG EY+S AF
Subjt:  CSQAKITKTSHKSVTRVTEPLELIHSDLCEFDGSLTRNSKRYVLTFIDDCSDYTFIYLLKNKSDAYEMFKVFVTKIENQFNKRIKRLRSDRGTEYDSVAF

Query:  NEFYSSKGIIHETTVPYSLEMNGKAERKNRTLTELVVAILLELGAAPSCWGEIIKTVNYVLNRIPKSNSKTSPYEVLKHKTPNLTYLITW
        N F  S GIIHETT PYS   NG  ERKNRTL EL  A+L+E GA    WGE I T  +VLNR+P   S T+P+E+ K   PNL YL  W
Subjt:  NEFYSSKGIIHETTVPYSLEMNGKAERKNRTLTELVVAILLELGAAPSCWGEIIKTVNYVLNRIPKSNSKTSPYEVLKHKTPNLTYLITW

Q0KIN7 Polyprotein, putative9.0e-5456.84Show/hide
Query:  CSQAKITKTSHKSVTRVTEPLELIHSDLCEFDGSLTRNSKRYVLTFIDDCSDYTFIYLLKNKSDAYEMFKVFVTKIENQFNKRIKRLRSDRGTEYDSVAF
        CS+AKITK  H  V R T+ LEL+H+D+CE  G LTR   RY +TFIDD S +T++YL+KNKSDA+E FK ++ ++ENQF ++IKR+RSDRG EY+S  F
Subjt:  CSQAKITKTSHKSVTRVTEPLELIHSDLCEFDGSLTRNSKRYVLTFIDDCSDYTFIYLLKNKSDAYEMFKVFVTKIENQFNKRIKRLRSDRGTEYDSVAF

Query:  NEFYSSKGIIHETTVPYSLEMNGKAERKNRTLTELVVAILLELGAAPSCWGEIIKTVNYVLNRIPKSNSKTSPYEVLKHKTPNLTYLITW
        N F  S GIIHETT PYS   NG AERKNRTL EL  A+L+E  A  + WGE I T  YVLNR+P   SK +P+E+ K   P+L YL  W
Subjt:  NEFYSSKGIIHETTVPYSLEMNGKAERKNRTLTELVVAILLELGAAPSCWGEIIKTVNYVLNRIPKSNSKTSPYEVLKHKTPNLTYLITW

Q60D13 Putative gag and pol polyprotein, identical1.4e-5155.79Show/hide
Query:  CSQAKITKTSHKSVTRVTEPLELIHSDLCEFDGSLTRNSKRYVLTFIDDCSDYTFIYLLKNKSDAYEMFKVFVTKIENQFNKRIKRLRSDRGTEYDSVAF
        CS+AKITK  H  V R T  LEL+H+D+CE  G LTR   R  +TFIDD S +T++YL+KNKSDA+E FK ++ ++ENQF ++IKR+RSDRG EY+S  F
Subjt:  CSQAKITKTSHKSVTRVTEPLELIHSDLCEFDGSLTRNSKRYVLTFIDDCSDYTFIYLLKNKSDAYEMFKVFVTKIENQFNKRIKRLRSDRGTEYDSVAF

Query:  NEFYSSKGIIHETTVPYSLEMNGKAERKNRTLTELVVAILLELGAAPSCWGEIIKTVNYVLNRIPKSNSKTSPYEVLKHKTPNLTYLITW
        N F  S GIIHETT PYS   NG AERKNRTL EL  A+L+E  A  + WGE I T  YVLNR+P   SK + +E+ K   P+L YL  W
Subjt:  NEFYSSKGIIHETTVPYSLEMNGKAERKNRTLTELVVAILLELGAAPSCWGEIIKTVNYVLNRIPKSNSKTSPYEVLKHKTPNLTYLITW

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.3e-2836.7Show/hide
Query:  QAKITKTSHKSVTRVTEPLELIHSDLCEFDGSLTRNSKRYVLTFIDDCSDYTFIYLLKNKSDAYEMFKVFVTKIENQFNKRIKRLRSDRGTEYDSVAFNE
        QA++     K  T +  PL ++HSD+C     +T + K Y + F+D  + Y   YL+K KSD + MF+ FV K E  FN ++  L  D G EY S    +
Subjt:  QAKITKTSHKSVTRVTEPLELIHSDLCEFDGSLTRNSKRYVLTFIDDCSDYTFIYLLKNKSDAYEMFKVFVTKIENQFNKRIKRLRSDRGTEYDSVAFNE

Query:  FYSSKGIIHETTVPYSLEMNGKAERKNRTLTELVVAILLELGAAPSCWGEIIKTVNYVLNRIPKS---NSKTSPYEVLKHKTPNLTYL
        F   KGI +  TVP++ ++NG +ER  RT+TE    ++       S WGE + T  Y++NRIP     +S  +PYE+  +K P L +L
Subjt:  FYSSKGIIHETTVPYSLEMNGKAERKNRTLTELVVAILLELGAAPSCWGEIIKTVNYVLNRIPKS---NSKTSPYEVLKHKTPNLTYL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.7e-2641.38Show/hide
Query:  LELIHSDLCEFDGSLTRNSKRYVLTFIDDCSDYTFIYLLKNKSDAYEMFKVFVTKIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTVPYSLE
        L+L++SD+C      +    +Y +TFIDD S   ++Y+LK K   +++F+ F   +E +  +++KRLRSD G EY S  F E+ SS GI HE TVP + +
Subjt:  LELIHSDLCEFDGSLTRNSKRYVLTFIDDCSDYTFIYLLKNKSDAYEMFKVFVTKIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTVPYSLE

Query:  MNGKAERKNRTLTELVVAILLELGAAPSCWGEIIKTVNYVLNRIP
         NG AER NRT+ E V ++L       S WGE ++T  Y++NR P
Subjt:  MNGKAERKNRTLTELVVAILLELGAAPSCWGEIIKTVNYVLNRIP

Q12337 Transposon Ty2-GR1 Gag-Pol polyprotein5.3e-1125.14Show/hide
Query:  CSQAKITKTSHKSVTRVT-----EPLELIHSDLCEFDGSLTRNSKRYVLTFIDDCSDYTFIYLL--KNKSDAYEMFKVFVTKIENQFNKRIKRLRSDRGT
        C   K TK  H   +R+      EP + +H+D+      L +++  Y ++F D+ + + ++Y L  + +     +F   +  I+NQFN R+  ++ DRG+
Subjt:  CSQAKITKTSHKSVTRVT-----EPLELIHSDLCEFDGSLTRNSKRYVLTFIDDCSDYTFIYLL--KNKSDAYEMFKVFVTKIENQFNKRIKRLRSDRGT

Query:  EYDSVAFNEFYSSKGIIHETTVPYSLEMNGKAERKNRTLTELVVAILLELGAAPSCWGEIIKTVNYVLNRIPKSNSKTS
        EY +   ++F++++GI    T       +G AER NRTL      +L   G     W   ++    + N +     + S
Subjt:  EYDSVAFNEFYSSKGIIHETTVPYSLEMNGKAERKNRTLTELVVAILLELGAAPSCWGEIIKTVNYVLNRIPKSNSKTS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.8e-2034.04Show/hide
Query:  CSQAKITKTS----HKSVTRVTEPLELIHSDLCEFDGSLTRNSKRYVLTFIDDCSDYTFIYLLKNKSDAYEMFKVFVTKIENQFNKRIKRLRSDRGTEYD
        CS   I K++     +S    T PLE I+SD+      L+ ++ RY + F+D  + YT++Y LK KS   E F  F   +EN+F  RI    SD G E+ 
Subjt:  CSQAKITKTS----HKSVTRVTEPLELIHSDLCEFDGSLTRNSKRYVLTFIDDCSDYTFIYLLKNKSDAYEMFKVFVTKIENQFNKRIKRLRSDRGTEYD

Query:  SVAFNEFYSSKGIIHETTVPYSLEMNGKAERKNRTLTELVVAILLELGAAPSCWGEIIKTVNYVLNRIPKSNSK-TSPYEVLKHKTPN
         VA  E++S  GI H T+ P++ E NG +ERK+R + E  + +L       + W        Y++NR+P    +  SP++ L   +PN
Subjt:  SVAFNEFYSSKGIIHETTVPYSLEMNGKAERKNRTLTELVVAILLELGAAPSCWGEIIKTVNYVLNRIPKSNSK-TSPYEVLKHKTPN

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.2e-2033.86Show/hide
Query:  CSQAKITKTSHK-----SVTRVTEPLELIHSDLCEFDGSLTRNSKRYVLTFIDDCSDYTFIYLLKNKSDAYEMFKVFVTKIENQFNKRIKRLRSDRGTEY
        CS   I K SHK     S    ++PLE I+SD+      L+ ++ RY + F+D  + YT++Y LK KS   + F +F + +EN+F  RI  L SD G E+
Subjt:  CSQAKITKTSHK-----SVTRVTEPLELIHSDLCEFDGSLTRNSKRYVLTFIDDCSDYTFIYLLKNKSDAYEMFKVFVTKIENQFNKRIKRLRSDRGTEY

Query:  DSVAFNEFYSSKGIIHETTVPYSLEMNGKAERKNRTLTELVVAILLELGAAPSCWGEIIKTVNYVLNRIPKSNSK-TSPYEVLKHKTPN
          V   ++ S  GI H T+ P++ E NG +ERK+R + E+ + +L       + W        Y++NR+P    +  SP++ L  + PN
Subjt:  DSVAFNEFYSSKGIIHETTVPYSLEMNGKAERKNRTLTELVVAILLELGAAPSCWGEIIKTVNYVLNRIPKSNSK-TSPYEVLKHKTPN

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTAGTCAAGCTAAGATAACTAAAACCTCGCATAAGTCTGTAACTAGAGTAACAGAGCCTTTAGAATTAATTCATTCTGACTTATGTGAATTTGATGGCTCTTTAAC
TAGAAACAGTAAAAGGTATGTACTTACCTTTATAGATGACTGCTCTGACTACACTTTTATTTATCTGCTTAAAAATAAAAGTGATGCATATGAAATGTTCAAAGTCTTTG
TAACTAAAATAGAGAACCAGTTTAACAAAAGAATTAAGAGACTTCGTAGTGATAGAGGAACTGAATATGATTCAGTTGCTTTCAATGAGTTTTATAGCTCAAAAGGAATA
ATACATGAAACTACTGTGCCTTATTCTCTTGAAATGAATGGAAAAGCAGAAAGAAAGAATAGAACTCTAACTGAGTTAGTAGTTGCTATCTTACTTGAGTTAGGAGCAGC
ACCATCTTGTTGGGGTGAAATAATTAAGACTGTTAATTATGTTCTTAATAGGATTCCTAAATCTAATAGTAAAACTTCACCATACGAAGTCCTTAAACACAAAACACCAA
ACTTGACTTATCTTATAACTTGGGCTGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGTAGTCAAGCTAAGATAACTAAAACCTCGCATAAGTCTGTAACTAGAGTAACAGAGCCTTTAGAATTAATTCATTCTGACTTATGTGAATTTGATGGCTCTTTAAC
TAGAAACAGTAAAAGGTATGTACTTACCTTTATAGATGACTGCTCTGACTACACTTTTATTTATCTGCTTAAAAATAAAAGTGATGCATATGAAATGTTCAAAGTCTTTG
TAACTAAAATAGAGAACCAGTTTAACAAAAGAATTAAGAGACTTCGTAGTGATAGAGGAACTGAATATGATTCAGTTGCTTTCAATGAGTTTTATAGCTCAAAAGGAATA
ATACATGAAACTACTGTGCCTTATTCTCTTGAAATGAATGGAAAAGCAGAAAGAAAGAATAGAACTCTAACTGAGTTAGTAGTTGCTATCTTACTTGAGTTAGGAGCAGC
ACCATCTTGTTGGGGTGAAATAATTAAGACTGTTAATTATGTTCTTAATAGGATTCCTAAATCTAATAGTAAAACTTCACCATACGAAGTCCTTAAACACAAAACACCAA
ACTTGACTTATCTTATAACTTGGGCTGTCTAG
Protein sequenceShow/hide protein sequence
MCSQAKITKTSHKSVTRVTEPLELIHSDLCEFDGSLTRNSKRYVLTFIDDCSDYTFIYLLKNKSDAYEMFKVFVTKIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGI
IHETTVPYSLEMNGKAERKNRTLTELVVAILLELGAAPSCWGEIIKTVNYVLNRIPKSNSKTSPYEVLKHKTPNLTYLITWAV