; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0229971 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0229971
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr08:25404411..25404938
RNA-Seq ExpressionCmc08g0229971
SyntenyCmc08g0229971
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]3.9e-7480.81Show/hide
Query:  MDLRFQDYLIEHGIQSQLFAPSTPQQNDVSERGNQTLLDVVCSMMSFAQLPDSFWGYALEITICILNNVPSKSVSQTPYELWKRRKGSLGHFMTWGCPAH
        MD +FQDYLIE GIQSQL APSTPQQN VSER N+TLLD+V SMMS+AQLPDSFWGYALE  I ILNNVPSKSV +TPYELWK RK SL +F  WGCPAH
Subjt:  MDLRFQDYLIEHGIQSQLFAPSTPQQNDVSERGNQTLLDVVCSMMSFAQLPDSFWGYALEITICILNNVPSKSVSQTPYELWKRRKGSLGHFMTWGCPAH

Query:  VLAQNPKKLEHRSKLCLFVGYPKESKGGLFYDPQENKVFVSTNATFLEEDHIRNHQTRSKLVLEEISKNVRD
        VL QNPKKLE RSKLCLFVGYPKES+GGLFY PQENKVFVSTNATFLEEDH RNHQ RSK+VL+E+ KN  D
Subjt:  VLAQNPKKLEHRSKLCLFVGYPKESKGGLFYDPQENKVFVSTNATFLEEDHIRNHQTRSKLVLEEISKNVRD

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]4.0e-7178.18Show/hide
Query:  MDLRFQDYLIEHGIQSQLFAPSTPQQNDVSERGNQTLLDVVCSMMSFAQLPDSFWGYALEITICILNNVPSKSVSQTPYELWKRRKGSLGHFMTWGCPAH
        MDLRFQDY+IEHGIQSQL AP TPQQN VSER N+TLLD+V SMMS+AQLP SFWGYA+E  + ILNNVPSKSVS+TP+ELW+ RK SL HF  WGCPAH
Subjt:  MDLRFQDYLIEHGIQSQLFAPSTPQQNDVSERGNQTLLDVVCSMMSFAQLPDSFWGYALEITICILNNVPSKSVSQTPYELWKRRKGSLGHFMTWGCPAH

Query:  VLAQNPKKLEHRSKLCLFVGYPKESKGGLFYDPQENKVFVSTNATFLEEDHIRNHQTRSKLVLEE
        VL  NPKKLE RS+LC FVGYPKE++GGLF+DPQEN+VFVSTNATFLEEDH+RNH+ RSKLVL E
Subjt:  VLAQNPKKLEHRSKLCLFVGYPKESKGGLFYDPQENKVFVSTNATFLEEDHIRNHQTRSKLVLEE

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]4.0e-7178.18Show/hide
Query:  MDLRFQDYLIEHGIQSQLFAPSTPQQNDVSERGNQTLLDVVCSMMSFAQLPDSFWGYALEITICILNNVPSKSVSQTPYELWKRRKGSLGHFMTWGCPAH
        MDLRFQDY+IEHGIQSQL AP TPQQN VSER N+TLLD+V SMMS+AQLP SFWGYA+E  + ILNNVPSKSVS+TP+ELW+ RK SL HF  WGCPAH
Subjt:  MDLRFQDYLIEHGIQSQLFAPSTPQQNDVSERGNQTLLDVVCSMMSFAQLPDSFWGYALEITICILNNVPSKSVSQTPYELWKRRKGSLGHFMTWGCPAH

Query:  VLAQNPKKLEHRSKLCLFVGYPKESKGGLFYDPQENKVFVSTNATFLEEDHIRNHQTRSKLVLEE
        VL  NPKKLE RS+LC FVGYPKE++GGLF+DPQEN+VFVSTNATFLEEDH+RNH+ RSKLVL E
Subjt:  VLAQNPKKLEHRSKLCLFVGYPKESKGGLFYDPQENKVFVSTNATFLEEDHIRNHQTRSKLVLEE

TYK03644.1 gag/pol protein [Cucumis melo var. makuwa]6.8e-7985.47Show/hide
Query:  MDLRFQDYLIEHGIQSQLFAPSTPQQNDVSERGNQTLLDVVCSMMSFAQLPDSFWGYALEITICILNNVPSKSVSQTPYELWKRRKGSLGHFMTWGCPAH
        MDLRFQDYLIEHGIQSQ  APS PQQN V +R N+ LLD+V SMMSF QLPDSFW YALE TI ILNNVPSKSVS+TPYELWK RKGSL HF  WGCPAH
Subjt:  MDLRFQDYLIEHGIQSQLFAPSTPQQNDVSERGNQTLLDVVCSMMSFAQLPDSFWGYALEITICILNNVPSKSVSQTPYELWKRRKGSLGHFMTWGCPAH

Query:  VLAQNPKKLEHRSKLCLFVGYPKESKGGLFYDPQENKVFVSTNATFLEEDHIRNHQTRSKLVLEEISKNVRD
        V  QNPKKLE RSKLCLFVGYPKESKGGLFYDPQENKVFVSTNATFLEEDHIRNHQTRSKLVLEEISKN  D
Subjt:  VLAQNPKKLEHRSKLCLFVGYPKESKGGLFYDPQENKVFVSTNATFLEEDHIRNHQTRSKLVLEEISKNVRD

TYK30724.1 gag/pol protein [Cucumis melo var. makuwa]8.9e-7985.47Show/hide
Query:  MDLRFQDYLIEHGIQSQLFAPSTPQQNDVSERGNQTLLDVVCSMMSFAQLPDSFWGYALEITICILNNVPSKSVSQTPYELWKRRKGSLGHFMTWGCPAH
        MDLRFQDYLIEH IQSQL AP+TPQQN VSER N+ LLD+V SM SFAQLPDSFWGYALE TI ILNNVPSKSVS+TPYELWK RKGSL HF  WGCP+H
Subjt:  MDLRFQDYLIEHGIQSQLFAPSTPQQNDVSERGNQTLLDVVCSMMSFAQLPDSFWGYALEITICILNNVPSKSVSQTPYELWKRRKGSLGHFMTWGCPAH

Query:  VLAQNPKKLEHRSKLCLFVGYPKESKGGLFYDPQENKVFVSTNATFLEEDHIRNHQTRSKLVLEEISKNVRD
        VL QNP KLEHRSKLCLFVGYPKESKGGLFYDPQENKVFVS NATFLEE HIRNHQTR+KLVLEEISKN  D
Subjt:  VLAQNPKKLEHRSKLCLFVGYPKESKGGLFYDPQENKVFVSTNATFLEEDHIRNHQTRSKLVLEEISKNVRD

TrEMBL top hitse value%identityAlignment
A0A5A7TZD0 Gag/pol protein1.9e-7178.18Show/hide
Query:  MDLRFQDYLIEHGIQSQLFAPSTPQQNDVSERGNQTLLDVVCSMMSFAQLPDSFWGYALEITICILNNVPSKSVSQTPYELWKRRKGSLGHFMTWGCPAH
        MDLRFQDY+IEHGIQSQL AP TPQQN VSER N+TLLD+V SMMS+AQLP SFWGYA+E  + ILNNVPSKSVS+TP+ELW+ RK SL HF  WGCPAH
Subjt:  MDLRFQDYLIEHGIQSQLFAPSTPQQNDVSERGNQTLLDVVCSMMSFAQLPDSFWGYALEITICILNNVPSKSVSQTPYELWKRRKGSLGHFMTWGCPAH

Query:  VLAQNPKKLEHRSKLCLFVGYPKESKGGLFYDPQENKVFVSTNATFLEEDHIRNHQTRSKLVLEE
        VL  NPKKLE RS+LC FVGYPKE++GGLF+DPQEN+VFVSTNATFLEEDH+RNH+ RSKLVL E
Subjt:  VLAQNPKKLEHRSKLCLFVGYPKESKGGLFYDPQENKVFVSTNATFLEEDHIRNHQTRSKLVLEE

A0A5D3BUN8 Gag/pol protein1.9e-7178.18Show/hide
Query:  MDLRFQDYLIEHGIQSQLFAPSTPQQNDVSERGNQTLLDVVCSMMSFAQLPDSFWGYALEITICILNNVPSKSVSQTPYELWKRRKGSLGHFMTWGCPAH
        MDLRFQDY+IEHGIQSQL AP TPQQN VSER N+TLLD+V SMMS+AQLP SFWGYA+E  + ILNNVPSKSVS+TP+ELW+ RK SL HF  WGCPAH
Subjt:  MDLRFQDYLIEHGIQSQLFAPSTPQQNDVSERGNQTLLDVVCSMMSFAQLPDSFWGYALEITICILNNVPSKSVSQTPYELWKRRKGSLGHFMTWGCPAH

Query:  VLAQNPKKLEHRSKLCLFVGYPKESKGGLFYDPQENKVFVSTNATFLEEDHIRNHQTRSKLVLEE
        VL  NPKKLE RS+LC FVGYPKE++GGLF+DPQEN+VFVSTNATFLEEDH+RNH+ RSKLVL E
Subjt:  VLAQNPKKLEHRSKLCLFVGYPKESKGGLFYDPQENKVFVSTNATFLEEDHIRNHQTRSKLVLEE

A0A5D3BX45 Gag/pol protein3.3e-7985.47Show/hide
Query:  MDLRFQDYLIEHGIQSQLFAPSTPQQNDVSERGNQTLLDVVCSMMSFAQLPDSFWGYALEITICILNNVPSKSVSQTPYELWKRRKGSLGHFMTWGCPAH
        MDLRFQDYLIEHGIQSQ  APS PQQN V +R N+ LLD+V SMMSF QLPDSFW YALE TI ILNNVPSKSVS+TPYELWK RKGSL HF  WGCPAH
Subjt:  MDLRFQDYLIEHGIQSQLFAPSTPQQNDVSERGNQTLLDVVCSMMSFAQLPDSFWGYALEITICILNNVPSKSVSQTPYELWKRRKGSLGHFMTWGCPAH

Query:  VLAQNPKKLEHRSKLCLFVGYPKESKGGLFYDPQENKVFVSTNATFLEEDHIRNHQTRSKLVLEEISKNVRD
        V  QNPKKLE RSKLCLFVGYPKESKGGLFYDPQENKVFVSTNATFLEEDHIRNHQTRSKLVLEEISKN  D
Subjt:  VLAQNPKKLEHRSKLCLFVGYPKESKGGLFYDPQENKVFVSTNATFLEEDHIRNHQTRSKLVLEEISKNVRD

A0A5D3E496 Gag/pol protein4.3e-7985.47Show/hide
Query:  MDLRFQDYLIEHGIQSQLFAPSTPQQNDVSERGNQTLLDVVCSMMSFAQLPDSFWGYALEITICILNNVPSKSVSQTPYELWKRRKGSLGHFMTWGCPAH
        MDLRFQDYLIEH IQSQL AP+TPQQN VSER N+ LLD+V SM SFAQLPDSFWGYALE TI ILNNVPSKSVS+TPYELWK RKGSL HF  WGCP+H
Subjt:  MDLRFQDYLIEHGIQSQLFAPSTPQQNDVSERGNQTLLDVVCSMMSFAQLPDSFWGYALEITICILNNVPSKSVSQTPYELWKRRKGSLGHFMTWGCPAH

Query:  VLAQNPKKLEHRSKLCLFVGYPKESKGGLFYDPQENKVFVSTNATFLEEDHIRNHQTRSKLVLEEISKNVRD
        VL QNP KLEHRSKLCLFVGYPKESKGGLFYDPQENKVFVS NATFLEE HIRNHQTR+KLVLEEISKN  D
Subjt:  VLAQNPKKLEHRSKLCLFVGYPKESKGGLFYDPQENKVFVSTNATFLEEDHIRNHQTRSKLVLEEISKNVRD

E2GK51 Gag/pol protein (Fragment)1.9e-7480.81Show/hide
Query:  MDLRFQDYLIEHGIQSQLFAPSTPQQNDVSERGNQTLLDVVCSMMSFAQLPDSFWGYALEITICILNNVPSKSVSQTPYELWKRRKGSLGHFMTWGCPAH
        MD +FQDYLIE GIQSQL APSTPQQN VSER N+TLLD+V SMMS+AQLPDSFWGYALE  I ILNNVPSKSV +TPYELWK RK SL +F  WGCPAH
Subjt:  MDLRFQDYLIEHGIQSQLFAPSTPQQNDVSERGNQTLLDVVCSMMSFAQLPDSFWGYALEITICILNNVPSKSVSQTPYELWKRRKGSLGHFMTWGCPAH

Query:  VLAQNPKKLEHRSKLCLFVGYPKESKGGLFYDPQENKVFVSTNATFLEEDHIRNHQTRSKLVLEEISKNVRD
        VL QNPKKLE RSKLCLFVGYPKES+GGLFY PQENKVFVSTNATFLEEDH RNHQ RSK+VL+E+ KN  D
Subjt:  VLAQNPKKLEHRSKLCLFVGYPKESKGGLFYDPQENKVFVSTNATFLEEDHIRNHQTRSKLVLEEISKNVRD

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.6e-1731.14Show/hide
Query:  QDYLIEHGIQSQLFAPSTPQQNDVSERGNQTLLDVVCSMMSFAQLPDSFWGYALEITICILNNVPSKSV---SQTPYELWKRRKGSLGHFMTWGCPAHVL
        + + ++ GI   L  P TPQ N VSER  +T+ +   +M+S A+L  SFWG A+     ++N +PS+++   S+TPYE+W  +K  L H   +G   +V 
Subjt:  QDYLIEHGIQSQLFAPSTPQQNDVSERGNQTLLDVVCSMMSFAQLPDSFWGYALEITICILNNVPSKSV---SQTPYELWKRRKGSLGHFMTWGCPAHVL

Query:  AQNPK-KLEHRSKLCLFVGYPKESKGGLFYDPQENKVFVSTNATFLEEDHIRNHQTRSKLVLEEISK
         +N + K + +S   +FVGY  E  G   +D    K  V+ +    E + + +   + + V  + SK
Subjt:  AQNPK-KLEHRSKLCLFVGYPKESKGGLFYDPQENKVFVSTNATFLEEDHIRNHQTRSKLVLEEISK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.4e-2335.12Show/hide
Query:  FQDYLIEHGIQSQLFAPSTPQQNDVSERGNQTLLDVVCSMMSFAQLPDSFWGYALEITICILNNVPSKSVS-QTPYELWKRRKGSLGHFMTWGCP--AHV
        F++Y   HGI+ +   P TPQ N V+ER N+T+++ V SM+  A+LP SFWG A++    ++N  PS  ++ + P  +W  ++ S  H   +GC   AHV
Subjt:  FQDYLIEHGIQSQLFAPSTPQQNDVSERGNQTLLDVVCSMMSFAQLPDSFWGYALEITICILNNVPSKSVS-QTPYELWKRRKGSLGHFMTWGCP--AHV

Query:  LAQNPKKLEHRSKLCLFVGYPKESKGGLFYDPQENKVFVSTNATFLEEDHIRNHQTRSKLVLEEISKN
          +   KL+ +S  C+F+GY  E  G   +DP + KV  S +  F  E  +R     S+ V   I  N
Subjt:  LAQNPKKLEHRSKLCLFVGYPKESKGGLFYDPQENKVFVSTNATFLEEDHIRNHQTRSKLVLEEISKN

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.0e-1324.14Show/hide
Query:  DYLIEHGIQSQLFAPSTPQQNDVSERGNQTLLDVVCSMMSFAQLPDSFWGYALEITICILNNVPSKSVS-QTPYELWKRRKGSLGHFMTWGCPAH--VLA
        +Y  +HGI      P TP+ N +SER ++ +++   +++S A +P ++W YA  + + ++N +P+  +  ++P++       +      +GC  +  +  
Subjt:  DYLIEHGIQSQLFAPSTPQQNDVSERGNQTLLDVVCSMMSFAQLPDSFWGYALEITICILNNVPSKSVS-QTPYELWKRRKGSLGHFMTWGCPAH--VLA

Query:  QNPKKLEHRSKLCLFVGYPKESKGGLFYDPQENKVFVSTNATFLE
         N  KL+ +S+ C+F+GY       L    Q +++++S +  F E
Subjt:  QNPKKLEHRSKLCLFVGYPKESKGGLFYDPQENKVFVSTNATFLE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.6e-1426.71Show/hide
Query:  QDYLIEHGIQSQLFAPSTPQQNDVSERGNQTLLDVVCSMMSFAQLPDSFWGYALEITICILNNVPSKSVS-QTPYELWKRRKGSLGHFMTWGCPAH--VL
        +DYL +HGI      P TP+ N +SER ++ ++++  +++S A +P ++W YA  + + ++N +P+  +  Q+P++    +  +      +GC  +  + 
Subjt:  QDYLIEHGIQSQLFAPSTPQQNDVSERGNQTLLDVVCSMMSFAQLPDSFWGYALEITICILNNVPSKSVS-QTPYELWKRRKGSLGHFMTWGCPAH--VL

Query:  AQNPKKLEHRSKLCLFVGYPKESKGGLFYDPQENKVFVSTNATFLE
          N  KLE +SK C F+GY       L       +++ S +  F E
Subjt:  AQNPKKLEHRSKLCLFVGYPKESKGGLFYDPQENKVFVSTNATFLE

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTTGCGATTCCAAGACTATTTGATAGAGCATGGAATCCAATCACAACTTTTTGCACCTAGTACGCCTCAACAGAACGATGTATCAGAAAGAGGAAACCAAACTTT
GTTAGACGTGGTTTGCTCTATGATGAGTTTTGCTCAGTTGCCAGATTCTTTTTGGGGATATGCTTTAGAAATAACTATCTGTATTTTGAACAATGTTCCCTCTAAAAGTG
TTTCTCAAACACCTTATGAGCTCTGGAAAAGACGTAAAGGTAGTTTAGGTCACTTTATGACTTGGGGATGTCCAGCACACGTGTTGGCACAAAACCCTAAAAAATTGGAA
CATCGTTCAAAATTATGCCTATTTGTAGGATATCCAAAAGAATCAAAAGGTGGTTTATTTTATGATCCTCAAGAAAATAAAGTATTTGTATCGACAAATGCTACGTTCTT
AGAGGAAGACCACATAAGAAATCATCAAACTCGCAGTAAACTAGTATTAGAAGAAATTTCCAAGAATGTTAGAGATATCATCTACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGACTTGCGATTCCAAGACTATTTGATAGAGCATGGAATCCAATCACAACTTTTTGCACCTAGTACGCCTCAACAGAACGATGTATCAGAAAGAGGAAACCAAACTTT
GTTAGACGTGGTTTGCTCTATGATGAGTTTTGCTCAGTTGCCAGATTCTTTTTGGGGATATGCTTTAGAAATAACTATCTGTATTTTGAACAATGTTCCCTCTAAAAGTG
TTTCTCAAACACCTTATGAGCTCTGGAAAAGACGTAAAGGTAGTTTAGGTCACTTTATGACTTGGGGATGTCCAGCACACGTGTTGGCACAAAACCCTAAAAAATTGGAA
CATCGTTCAAAATTATGCCTATTTGTAGGATATCCAAAAGAATCAAAAGGTGGTTTATTTTATGATCCTCAAGAAAATAAAGTATTTGTATCGACAAATGCTACGTTCTT
AGAGGAAGACCACATAAGAAATCATCAAACTCGCAGTAAACTAGTATTAGAAGAAATTTCCAAGAATGTTAGAGATATCATCTACTAA
Protein sequenceShow/hide protein sequence
MDLRFQDYLIEHGIQSQLFAPSTPQQNDVSERGNQTLLDVVCSMMSFAQLPDSFWGYALEITICILNNVPSKSVSQTPYELWKRRKGSLGHFMTWGCPAHVLAQNPKKLE
HRSKLCLFVGYPKESKGGLFYDPQENKVFVSTNATFLEEDHIRNHQTRSKLVLEEISKNVRDIIY