; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc01g0021071 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc01g0021071
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationCMiso1.1chr01:19608252..19608884
RNA-Seq ExpressionCmc01g0021071
SyntenyCmc01g0021071
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]1.3e-4749.72Show/hide
Query:  GTNLANSCVAMPKTVWHKRLGHPSSKVLNSVIKNCNFPVKDNFDVEFFDSCQLGKAHNLPFPKSQSHATAPFNLVYSDLWGPVPVCSTDGFRYYIMFVDD
        GT    S     K  WH+RLGHP++KVL+ V+++C   V  + +  F ++CQ GK H LPF  S SHA  P  LV++D+WGP P+ ++ GF+YY+ FVDD
Subjt:  GTNLANSCVAMPKTVWHKRLGHPSSKVLNSVIKNCNFPVKDNFDVEFFDSCQLGKAHNLPFPKSQSHATAPFNLVYSDLWGPVPVCSTDGFRYYIMFVDD

Query:  YNRYSWIYPLKQKSVAVEAFNHFVIYVKNQFNKSIKEFPSDNGDEYHKIHQICSNMGISSRLSCPYTSGQNGRVERKHR
        ++R++WIYPLKQKS  V+AF  F    +NQFNK IK    D G EY  + ++    GI  R+SCPYTS QNGR ERKHR
Subjt:  YNRYSWIYPLKQKSVAVEAFNHFVIYVKNQFNKSIKEFPSDNGDEYHKIHQICSNMGISSRLSCPYTSGQNGRVERKHR

KYP50444.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]4.4e-4853.89Show/hide
Query:  KTVWHKRLGHPSSKVLNSVIKNCNFPVKDNFDVEFFDSCQLGKAHNLPFPKSQSHATAPFNLVYSDLWGPVPVCSTDGFRYYIMFVDDYNRYSWIYPLKQ
        K  WH++LGHP+SKVLN V+K CN       + EF ++CQ GKAHNLPF  S S A  P +LV+SD+WGP P+ S  GF+YY++F+DD++R++WIYPLKQ
Subjt:  KTVWHKRLGHPSSKVLNSVIKNCNFPVKDNFDVEFFDSCQLGKAHNLPFPKSQSHATAPFNLVYSDLWGPVPVCSTDGFRYYIMFVDDYNRYSWIYPLKQ

Query:  KSVAVEAFNHFVIYVKNQFNKSIKEFPSDNGDEYHKIHQICSNMGISSRLSCPYTSGQNGRVERKHR
        KS   +AF  F   V+NQFNK IK    D G E+  + ++    GI  R SCPYTS QNGR ERKHR
Subjt:  KSVAVEAFNHFVIYVKNQFNKSIKEFPSDNGDEYHKIHQICSNMGISSRLSCPYTSGQNGRVERKHR

MCH94186.1 retrovirus-related pol polyprotein from transposon tnt 1-94 [Trifolium medium]1.2e-4850.84Show/hide
Query:  GTNLANSCVAMPKTVWHKRLGHPSSKVLNSVIKNCNFPVKDNFDVEFFDSCQLGKAHNLPFPKSQSHATAPFNLVYSDLWGPVPVCSTDGFRYYIMFVDD
        GT    S     K  WH+RLGHP++KVL+ V+K+CN  V  + +  F ++CQ GK H LPF  S SHA  P  LV++D+WGP P+ ++ GF+YY+ FVDD
Subjt:  GTNLANSCVAMPKTVWHKRLGHPSSKVLNSVIKNCNFPVKDNFDVEFFDSCQLGKAHNLPFPKSQSHATAPFNLVYSDLWGPVPVCSTDGFRYYIMFVDD

Query:  YNRYSWIYPLKQKSVAVEAFNHFVIYVKNQFNKSIKEFPSDNGDEYHKIHQICSNMGISSRLSCPYTSGQNGRVERKHR
        ++R++WIYPLKQKS  V+AF  F    +NQFNK IK    D G EY  + ++    GI  R+SCPYTS QNGR ERKHR
Subjt:  YNRYSWIYPLKQKSVAVEAFNHFVIYVKNQFNKSIKEFPSDNGDEYHKIHQICSNMGISSRLSCPYTSGQNGRVERKHR

PNY02796.1 copia protein (gag-int-pol protein), partial [Trifolium pratense]2.6e-4849.71Show/hide
Query:  NSCVAMP-KTVWHKRLGHPSSKVLNSVIKNCNFPVKDNFDVEFFDSCQLGKAHNLPFPKSQSHATAPFNLVYSDLWGPVPVCSTDGFRYYIMFVDDYNRY
        ++C  M  K  WH++LGHP++KVL+ V+K+CN     +   +F ++CQ GK H LPF  S SHA  P +L+++D+WGP P+ S  GF+YY+ F+DD++R+
Subjt:  NSCVAMP-KTVWHKRLGHPSSKVLNSVIKNCNFPVKDNFDVEFFDSCQLGKAHNLPFPKSQSHATAPFNLVYSDLWGPVPVCSTDGFRYYIMFVDDYNRY

Query:  SWIYPLKQKSVAVEAFNHFVIYVKNQFNKSIKEFPSDNGDEYHKIHQICSNMGISSRLSCPYTSGQNGRVERKHR
        +WIYPLKQKS  + AF  F   V+NQFNK IK    D G EY  + ++    GI  R+SCPYTS QNGR ERKHR
Subjt:  SWIYPLKQKSVAVEAFNHFVIYVKNQFNKSIKEFPSDNGDEYHKIHQICSNMGISSRLSCPYTSGQNGRVERKHR

TYJ96768.1 putative mitochondrial protein [Cucumis melo var. makuwa]2.6e-5394.29Show/hide
Query:  SGGTNLANSCVAMPKTVWHKRLGHPSSKVLNSVIKNCNFPVKDNFDVEFFDSCQLGKAHNLPFPKSQSHATAPFNLVYSDLWGPVPVCSTDGFRYYIMFV
        S GTNLANSCVA+PKTVWHKRLGHPSSKVLNSVIKNCNFPVKDNFDVEF DSCQLGKAHNLPFPKSQSHATAPFNLVYSDLWGP PVCSTD FRYYIMFV
Subjt:  SGGTNLANSCVAMPKTVWHKRLGHPSSKVLNSVIKNCNFPVKDNFDVEFFDSCQLGKAHNLPFPKSQSHATAPFNLVYSDLWGPVPVCSTDGFRYYIMFV

Query:  DDYNR
        DDY+R
Subjt:  DDYNR

TrEMBL top hitse value%identityAlignment
A0A151S6M8 Retrovirus-related Pol polyprotein from transposon TNT 1-942.1e-4853.89Show/hide
Query:  KTVWHKRLGHPSSKVLNSVIKNCNFPVKDNFDVEFFDSCQLGKAHNLPFPKSQSHATAPFNLVYSDLWGPVPVCSTDGFRYYIMFVDDYNRYSWIYPLKQ
        K  WH++LGHP+SKVLN V+K CN       + EF ++CQ GKAHNLPF  S S A  P +LV+SD+WGP P+ S  GF+YY++F+DD++R++WIYPLKQ
Subjt:  KTVWHKRLGHPSSKVLNSVIKNCNFPVKDNFDVEFFDSCQLGKAHNLPFPKSQSHATAPFNLVYSDLWGPVPVCSTDGFRYYIMFVDDYNRYSWIYPLKQ

Query:  KSVAVEAFNHFVIYVKNQFNKSIKEFPSDNGDEYHKIHQICSNMGISSRLSCPYTSGQNGRVERKHR
        KS   +AF  F   V+NQFNK IK    D G E+  + ++    GI  R SCPYTS QNGR ERKHR
Subjt:  KSVAVEAFNHFVIYVKNQFNKSIKEFPSDNGDEYHKIHQICSNMGISSRLSCPYTSGQNGRVERKHR

A0A2K3NIC3 Copia protein (Gag-int-pol protein) (Fragment)1.2e-4849.71Show/hide
Query:  NSCVAMP-KTVWHKRLGHPSSKVLNSVIKNCNFPVKDNFDVEFFDSCQLGKAHNLPFPKSQSHATAPFNLVYSDLWGPVPVCSTDGFRYYIMFVDDYNRY
        ++C  M  K  WH++LGHP++KVL+ V+K+CN     +   +F ++CQ GK H LPF  S SHA  P +L+++D+WGP P+ S  GF+YY+ F+DD++R+
Subjt:  NSCVAMP-KTVWHKRLGHPSSKVLNSVIKNCNFPVKDNFDVEFFDSCQLGKAHNLPFPKSQSHATAPFNLVYSDLWGPVPVCSTDGFRYYIMFVDDYNRY

Query:  SWIYPLKQKSVAVEAFNHFVIYVKNQFNKSIKEFPSDNGDEYHKIHQICSNMGISSRLSCPYTSGQNGRVERKHR
        +WIYPLKQKS  + AF  F   V+NQFNK IK    D G EY  + ++    GI  R+SCPYTS QNGR ERKHR
Subjt:  SWIYPLKQKSVAVEAFNHFVIYVKNQFNKSIKEFPSDNGDEYHKIHQICSNMGISSRLSCPYTSGQNGRVERKHR

A0A2Z6MBG6 Integrase catalytic domain-containing protein6.2e-4849.72Show/hide
Query:  GTNLANSCVAMPKTVWHKRLGHPSSKVLNSVIKNCNFPVKDNFDVEFFDSCQLGKAHNLPFPKSQSHATAPFNLVYSDLWGPVPVCSTDGFRYYIMFVDD
        GT    S     K  WH+RLGHP++KVL+ V+++C   V  + +  F ++CQ GK H LPF  S SHA  P  LV++D+WGP P+ ++ GF+YY+ FVDD
Subjt:  GTNLANSCVAMPKTVWHKRLGHPSSKVLNSVIKNCNFPVKDNFDVEFFDSCQLGKAHNLPFPKSQSHATAPFNLVYSDLWGPVPVCSTDGFRYYIMFVDD

Query:  YNRYSWIYPLKQKSVAVEAFNHFVIYVKNQFNKSIKEFPSDNGDEYHKIHQICSNMGISSRLSCPYTSGQNGRVERKHR
        ++R++WIYPLKQKS  V+AF  F    +NQFNK IK    D G EY  + ++    GI  R+SCPYTS QNGR ERKHR
Subjt:  YNRYSWIYPLKQKSVAVEAFNHFVIYVKNQFNKSIKEFPSDNGDEYHKIHQICSNMGISSRLSCPYTSGQNGRVERKHR

A0A392N2Z1 Retrovirus-related pol polyprotein from transposon tnt 1-94 (Fragment)5.6e-4950.84Show/hide
Query:  GTNLANSCVAMPKTVWHKRLGHPSSKVLNSVIKNCNFPVKDNFDVEFFDSCQLGKAHNLPFPKSQSHATAPFNLVYSDLWGPVPVCSTDGFRYYIMFVDD
        GT    S     K  WH+RLGHP++KVL+ V+K+CN  V  + +  F ++CQ GK H LPF  S SHA  P  LV++D+WGP P+ ++ GF+YY+ FVDD
Subjt:  GTNLANSCVAMPKTVWHKRLGHPSSKVLNSVIKNCNFPVKDNFDVEFFDSCQLGKAHNLPFPKSQSHATAPFNLVYSDLWGPVPVCSTDGFRYYIMFVDD

Query:  YNRYSWIYPLKQKSVAVEAFNHFVIYVKNQFNKSIKEFPSDNGDEYHKIHQICSNMGISSRLSCPYTSGQNGRVERKHR
        ++R++WIYPLKQKS  V+AF  F    +NQFNK IK    D G EY  + ++    GI  R+SCPYTS QNGR ERKHR
Subjt:  YNRYSWIYPLKQKSVAVEAFNHFVIYVKNQFNKSIKEFPSDNGDEYHKIHQICSNMGISSRLSCPYTSGQNGRVERKHR

A0A5D3BEZ6 Putative mitochondrial protein1.3e-5394.29Show/hide
Query:  SGGTNLANSCVAMPKTVWHKRLGHPSSKVLNSVIKNCNFPVKDNFDVEFFDSCQLGKAHNLPFPKSQSHATAPFNLVYSDLWGPVPVCSTDGFRYYIMFV
        S GTNLANSCVA+PKTVWHKRLGHPSSKVLNSVIKNCNFPVKDNFDVEF DSCQLGKAHNLPFPKSQSHATAPFNLVYSDLWGP PVCSTD FRYYIMFV
Subjt:  SGGTNLANSCVAMPKTVWHKRLGHPSSKVLNSVIKNCNFPVKDNFDVEFFDSCQLGKAHNLPFPKSQSHATAPFNLVYSDLWGPVPVCSTDGFRYYIMFV

Query:  DDYNR
        DDY+R
Subjt:  DDYNR

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.9e-1833.91Show/hide
Query:  VWHKRLGHPS-SKVLNSVIKN--CNFPVKDNFDV--EFFDSCQLGKAHNLPFP--KSQSHATAPFNLVYSDLWGPVPVCSTDGFRYYIMFVDDYNRYSWI
        +WH+R GH S  K+L    KN   +  + +N ++  E  + C  GK   LPF   K ++H   P  +V+SD+ GP+   + D   Y+++FVD +  Y   
Subjt:  VWHKRLGHPS-SKVLNSVIKN--CNFPVKDNFDV--EFFDSCQLGKAHNLPFP--KSQSHATAPFNLVYSDLWGPVPVCSTDGFRYYIMFVDDYNRYSWI

Query:  YPLKQKSVAVEAFNHFVIYVKNQFNKSIKEFPSDNGDEY--HKIHQICSNMGISSRLSCPYTSGQNGRVERKHR
        Y +K KS     F  FV   +  FN  +     DNG EY  +++ Q C   GIS  L+ P+T   NG  ER  R
Subjt:  YPLKQKSVAVEAFNHFVIYVKNQFNKSIKEFPSDNGDEY--HKIHQICSNMGISSRLSCPYTSGQNGRVERKHR

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.6e-2432.8Show/hide
Query:  SSAFVLSGGTNLANSCVAMPKTVWHKRLGHPSSKVLNSVIKNCNFPVKDNFDVEFFDSCQLGKAHNLPFPKSQSHATAPFNLVYSDLWGPVPVCSTDGFR
        ++A +  G  N A   +++   +WHKR+GH S K L  + K           V+  D C  GK H + F  S        +LVYSD+ GP+ + S  G +
Subjt:  SSAFVLSGGTNLANSCVAMPKTVWHKRLGHPSSKVLNSVIKNCNFPVKDNFDVEFFDSCQLGKAHNLPFPKSQSHATAPFNLVYSDLWGPVPVCSTDGFR

Query:  YYIMFVDDYNRYSWIYPLKQKSVAVEAFNHFVIYVKNQFNKSIKEFPSDNGDEY--HKIHQICSNMGISSRLSCPYTSGQNGRVERKHR
        Y++ F+DD +R  W+Y LK K    + F  F   V+ +  + +K   SDNG EY   +  + CS+ GI    + P T   NG  ER +R
Subjt:  YYIMFVDDYNRYSWIYPLKQKSVAVEAFNHFVIYVKNQFNKSIKEFPSDNGDEY--HKIHQICSNMGISSRLSCPYTSGQNGRVERKHR

Q12491 Transposon Ty2-B Gag-Pol polyprotein9.6e-1425Show/hide
Query:  NLANSCVAMPKTVWHKRLGHPSSKVLNSVIKNCNFPVKDNFDVEFFDS-------CQLGKAHNLPFPKSQ----SHATAPFNLVYSDLWGPVPVCSTDGF
        N + S    P  + H+ LGH + + +   +K          D+E+ ++       C +GK+      K        +  PF  +++D++GPV        
Subjt:  NLANSCVAMPKTVWHKRLGHPSSKVLNSVIKNCNFPVKDNFDVEFFDS-------CQLGKAHNLPFPKSQ----SHATAPFNLVYSDLWGPVPVCSTDGF

Query:  RYYIMFVDDYNRYSWIYPL--KQKSVAVEAFNHFVIYVKNQFNKSIKEFPSDNGDEY--HKIHQICSNMGISSRLSCPYTSGQNGRVERKHR
         Y+I F D+  R+ W+YPL  +++   +  F   + ++KNQFN  +     D G EY    +H+  +N GI++  +    S  +G  ER +R
Subjt:  RYYIMFVDDYNRYSWIYPL--KQKSVAVEAFNHFVIYVKNQFNKSIKEFPSDNGDEY--HKIHQICSNMGISSRLSCPYTSGQNGRVERKHR

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.0e-3343.37Show/hide
Query:  WHKRLGHPSSKVLNSVIKNCNFPVKDNFDVEFF--DSCQLGKAHNLPFPKSQSHATAPFNLVYSDLWGPVPVCSTDGFRYYIMFVDDYNRYSWIYPLKQK
        WH RLGHP+  +LNSVI N +  V  N   +F     C + K++ +PF +S  ++T P   +YSD+W   P+ S D +RYY++FVD + RY+W+YPLKQK
Subjt:  WHKRLGHPSSKVLNSVIKNCNFPVKDNFDVEFF--DSCQLGKAHNLPFPKSQSHATAPFNLVYSDLWGPVPVCSTDGFRYYIMFVDDYNRYSWIYPLKQK

Query:  SVAVEAFNHFVIYVKNQFNKSIKEFPSDNGDEYHKIHQICSNMGISSRLSCPYTSGQNGRVERKHR
        S   E F  F   ++N+F   I  F SDNG E+  + +  S  GIS   S P+T   NG  ERKHR
Subjt:  SVAVEAFNHFVIYVKNQFNKSIKEFPSDNGDEYHKIHQICSNMGISSRLSCPYTSGQNGRVERKHR

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-3341.48Show/hide
Query:  ANSCVAMPKTVWHKRLGHPSSKVLNSVIKNCNFPV-KDNFDVEFFDSCQLGKAHNLPFPKSQSHATAPFNLVYSDLWGPVPVCSTDGFRYYIMFVDDYNR
        A+ C     + WH RLGHPS  +LNSVI N + PV   +  +     C + K+H +PF  S   ++ P   +YSD+W   P+ S D +RYY++FVD + R
Subjt:  ANSCVAMPKTVWHKRLGHPSSKVLNSVIKNCNFPV-KDNFDVEFFDSCQLGKAHNLPFPKSQSHATAPFNLVYSDLWGPVPVCSTDGFRYYIMFVDDYNR

Query:  YSWIYPLKQKSVAVEAFNHFVIYVKNQFNKSIKEFPSDNGDEYHKIHQICSNMGISSRLSCPYTSGQNGRVERKHR
        Y+W+YPLKQKS   + F  F   V+N+F   I    SDNG E+  +    S  GIS   S P+T   NG  ERKHR
Subjt:  YSWIYPLKQKSVAVEAFNHFVIYVKNQFNKSIKEFPSDNGDEYHKIHQICSNMGISSRLSCPYTSGQNGRVERKHR

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACAAAAATAAAGGTTCATCTGCATTTGTGTTGTCAGGAGGAACAAATCTTGCCAATTCTTGTGTAGCGATGCCTAAAACTGTTTGGCACAAACGGTTAGGACATCC
TTCATCAAAAGTTTTGAACTCAGTTATAAAGAATTGTAACTTTCCTGTTAAAGACAATTTTGATGTTGAATTTTTTGATTCCTGTCAGTTGGGAAAAGCTCATAATCTTC
CCTTTCCTAAATCTCAATCGCATGCTACTGCACCTTTTAATCTTGTTTATTCTGATTTATGGGGGCCAGTCCCAGTATGCTCTACTGATGGTTTTCGTTATTATATAATG
TTTGTTGATGACTACAACAGGTATTCGTGGATTTATCCTCTTAAACAAAAGAGTGTTGCTGTTGAAGCTTTTAACCATTTTGTTATATATGTCAAAAATCAATTTAACAA
ATCTATTAAGGAGTTCCCATCAGACAACGGAGATGAGTATCATAAGATACATCAAATATGCTCAAATATGGGAATCAGTAGTAGATTATCTTGTCCTTATACTTCGGGTC
AAAATGGAAGGGTAGAAAGAAAACATAGACAAACTTGGTTGAAACTGGACTTACACTGCTTGCTAAAGCAAATATTACATTAA
mRNA sequenceShow/hide mRNA sequence
ATGCACAAAAATAAAGGTTCATCTGCATTTGTGTTGTCAGGAGGAACAAATCTTGCCAATTCTTGTGTAGCGATGCCTAAAACTGTTTGGCACAAACGGTTAGGACATCC
TTCATCAAAAGTTTTGAACTCAGTTATAAAGAATTGTAACTTTCCTGTTAAAGACAATTTTGATGTTGAATTTTTTGATTCCTGTCAGTTGGGAAAAGCTCATAATCTTC
CCTTTCCTAAATCTCAATCGCATGCTACTGCACCTTTTAATCTTGTTTATTCTGATTTATGGGGGCCAGTCCCAGTATGCTCTACTGATGGTTTTCGTTATTATATAATG
TTTGTTGATGACTACAACAGGTATTCGTGGATTTATCCTCTTAAACAAAAGAGTGTTGCTGTTGAAGCTTTTAACCATTTTGTTATATATGTCAAAAATCAATTTAACAA
ATCTATTAAGGAGTTCCCATCAGACAACGGAGATGAGTATCATAAGATACATCAAATATGCTCAAATATGGGAATCAGTAGTAGATTATCTTGTCCTTATACTTCGGGTC
AAAATGGAAGGGTAGAAAGAAAACATAGACAAACTTGGTTGAAACTGGACTTACACTGCTTGCTAAAGCAAATATTACATTAA
Protein sequenceShow/hide protein sequence
MHKNKGSSAFVLSGGTNLANSCVAMPKTVWHKRLGHPSSKVLNSVIKNCNFPVKDNFDVEFFDSCQLGKAHNLPFPKSQSHATAPFNLVYSDLWGPVPVCSTDGFRYYIM
FVDDYNRYSWIYPLKQKSVAVEAFNHFVIYVKNQFNKSIKEFPSDNGDEYHKIHQICSNMGISSRLSCPYTSGQNGRVERKHRQTWLKLDLHCLLKQILH