; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0103111 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0103111
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr04:20132968..20133384
RNA-Seq ExpressionCmc04g0103111
SyntenyCmc04g0103111
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032016.1 retrovirus-related pol polyprotein from transposon tnt 1-94 [Cucumis melo var. makuwa]1.1e-5884.21Show/hide
Query:  NIGQTHPSQELGEPRRSGRVVQQPDHYLGLSEAHIIIPDDGIEDPLTYKQAMYDVDCDQLIKAMDLEMESMYSNSIWTLVDQPNDVKPIGCKWIYKRKRD
        NIGQTHPSQELGEPRRSGRVV+QPD YLGLSEA IIIPDDGIEDPLTYKQAM DVDCDQ IKAMD EMESMYSNS+WTLVDQP++V+PIGCKW+YKRKR+
Subjt:  NIGQTHPSQELGEPRRSGRVVQQPDHYLGLSEAHIIIPDDGIEDPLTYKQAMYDVDCDQLIKAMDLEMESMYSNSIWTLVDQPNDVKPIGCKWIYKRKRD

Query:  QASKVQTFKARLVAKGYTQKGGIDYEKTLSYCH
        QASKVQTF ARL+AKGYTQ  GIDY++T  +CH
Subjt:  QASKVQTFKARLVAKGYTQKGGIDYEKTLSYCH

KAA0037569.1 gag/pol protein [Cucumis melo var. makuwa]2.1e-5987.69Show/hide
Query:  NIGQTHPSQELGEPRRSGRVVQQPDHYLGLSEAHIIIPDDGIEDPLTYKQAMYDVDCDQLIKAMDLEMESMYSNSIWTLVDQPNDVKPIGCKWIYKRKRD
        NIGQTHPSQELGEPRRSGRVV+QPD YLGLSEA IIIPDDGIEDPLT+KQAM DVDCD+ IKAMDLEMESMYSNS+WTLVDQPNDVKPIGCKWIYKRKRD
Subjt:  NIGQTHPSQELGEPRRSGRVVQQPDHYLGLSEAHIIIPDDGIEDPLTYKQAMYDVDCDQLIKAMDLEMESMYSNSIWTLVDQPNDVKPIGCKWIYKRKRD

Query:  QASKVQTFKARLVAKGYTQKGGIDYEKTLS
        Q  KVQTFKA+LVAKGYTQK G+D E+T S
Subjt:  QASKVQTFKARLVAKGYTQKGGIDYEKTLS

KAA0050328.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-5887.69Show/hide
Query:  NIGQTHPSQELGEPRRSGRVVQQPDHYLGLSEAHIIIPDDGIEDPLTYKQAMYDVDCDQLIKAMDLEMESMYSNSIWTLVDQPNDVKPIGCKWIYKRKRD
        NIGQTHPSQELGEP RSGRVV+QPD YLGLSEA IIIPDDGIEDPLT KQAM DVDC+Q IKAMDLEMES+YSNS+WTLVDQPNDV+PIGCKWIYKRKRD
Subjt:  NIGQTHPSQELGEPRRSGRVVQQPDHYLGLSEAHIIIPDDGIEDPLTYKQAMYDVDCDQLIKAMDLEMESMYSNSIWTLVDQPNDVKPIGCKWIYKRKRD

Query:  QASKVQTFKARLVAKGYTQKGGIDYEKTLS
        QA KV TFKARLVAKGYTQK GIDYE+T S
Subjt:  QASKVQTFKARLVAKGYTQKGGIDYEKTLS

KAA0059016.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-5886.92Show/hide
Query:  NIGQTHPSQELGEPRRSGRVVQQPDHYLGLSEAHIIIPDDGIEDPLTYKQAMYDVDCDQLIKAMDLEMESMYSNSIWTLVDQPNDVKPIGCKWIYKRKRD
        NIGQTHPSQELGEPRRSGRVV+QP+ Y GLSEA IIIPDD I+DPLTYKQAM DVD DQLIKAMDLEMESMYSNS+WTLVDQP++V+PIGCKWIYKRKRD
Subjt:  NIGQTHPSQELGEPRRSGRVVQQPDHYLGLSEAHIIIPDDGIEDPLTYKQAMYDVDCDQLIKAMDLEMESMYSNSIWTLVDQPNDVKPIGCKWIYKRKRD

Query:  QASKVQTFKARLVAKGYTQKGGIDYEKTLS
        QA KVQTFKARLVAKGYTQK GIDYE+T S
Subjt:  QASKVQTFKARLVAKGYTQKGGIDYEKTLS

TYK05032.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-5882.48Show/hide
Query:  NIGQTHPSQELGEPRRSGRVVQQPDHYLGLSEAHIIIPDDGIEDPLTYKQAMYDVDCDQLIKAMDLEMESMYSNSIWTLVDQPNDVKPIGCKWIYKRKRD
        NIGQ H SQELGEPR SGRVV+QPD YLGLSEA I+IPD+GIEDPLTYKQAM DVDCDQ +KAMDLE+ESMY NS+WTLVDQ ND K IGCKWIYKRKRD
Subjt:  NIGQTHPSQELGEPRRSGRVVQQPDHYLGLSEAHIIIPDDGIEDPLTYKQAMYDVDCDQLIKAMDLEMESMYSNSIWTLVDQPNDVKPIGCKWIYKRKRD

Query:  QASKVQTFKARLVAKGYTQKGGIDYEKTLSYCHDKVD
        QA KVQTFKARLVAKGYTQK G+DYE+TLS CH +VD
Subjt:  QASKVQTFKARLVAKGYTQKGGIDYEKTLSYCHDKVD

TrEMBL top hitse value%identityAlignment
A0A5A7T8E8 Gag/pol protein1.0e-5987.69Show/hide
Query:  NIGQTHPSQELGEPRRSGRVVQQPDHYLGLSEAHIIIPDDGIEDPLTYKQAMYDVDCDQLIKAMDLEMESMYSNSIWTLVDQPNDVKPIGCKWIYKRKRD
        NIGQTHPSQELGEPRRSGRVV+QPD YLGLSEA IIIPDDGIEDPLT+KQAM DVDCD+ IKAMDLEMESMYSNS+WTLVDQPNDVKPIGCKWIYKRKRD
Subjt:  NIGQTHPSQELGEPRRSGRVVQQPDHYLGLSEAHIIIPDDGIEDPLTYKQAMYDVDCDQLIKAMDLEMESMYSNSIWTLVDQPNDVKPIGCKWIYKRKRD

Query:  QASKVQTFKARLVAKGYTQKGGIDYEKTLS
        Q  KVQTFKA+LVAKGYTQK G+D E+T S
Subjt:  QASKVQTFKARLVAKGYTQKGGIDYEKTLS

A0A5A7U4Y4 Gag/pol protein6.7e-5987.69Show/hide
Query:  NIGQTHPSQELGEPRRSGRVVQQPDHYLGLSEAHIIIPDDGIEDPLTYKQAMYDVDCDQLIKAMDLEMESMYSNSIWTLVDQPNDVKPIGCKWIYKRKRD
        NIGQTHPSQELGEP RSGRVV+QPD YLGLSEA IIIPDDGIEDPLT KQAM DVDC+Q IKAMDLEMES+YSNS+WTLVDQPNDV+PIGCKWIYKRKRD
Subjt:  NIGQTHPSQELGEPRRSGRVVQQPDHYLGLSEAHIIIPDDGIEDPLTYKQAMYDVDCDQLIKAMDLEMESMYSNSIWTLVDQPNDVKPIGCKWIYKRKRD

Query:  QASKVQTFKARLVAKGYTQKGGIDYEKTLS
        QA KV TFKARLVAKGYTQK GIDYE+T S
Subjt:  QASKVQTFKARLVAKGYTQKGGIDYEKTLS

A0A5A7UXV1 Gag/pol protein6.7e-5986.92Show/hide
Query:  NIGQTHPSQELGEPRRSGRVVQQPDHYLGLSEAHIIIPDDGIEDPLTYKQAMYDVDCDQLIKAMDLEMESMYSNSIWTLVDQPNDVKPIGCKWIYKRKRD
        NIGQTHPSQELGEPRRSGRVV+QP+ Y GLSEA IIIPDD I+DPLTYKQAM DVD DQLIKAMDLEMESMYSNS+WTLVDQP++V+PIGCKWIYKRKRD
Subjt:  NIGQTHPSQELGEPRRSGRVVQQPDHYLGLSEAHIIIPDDGIEDPLTYKQAMYDVDCDQLIKAMDLEMESMYSNSIWTLVDQPNDVKPIGCKWIYKRKRD

Query:  QASKVQTFKARLVAKGYTQKGGIDYEKTLS
        QA KVQTFKARLVAKGYTQK GIDYE+T S
Subjt:  QASKVQTFKARLVAKGYTQKGGIDYEKTLS

A0A5D3C1H5 Gag/pol protein5.1e-5982.48Show/hide
Query:  NIGQTHPSQELGEPRRSGRVVQQPDHYLGLSEAHIIIPDDGIEDPLTYKQAMYDVDCDQLIKAMDLEMESMYSNSIWTLVDQPNDVKPIGCKWIYKRKRD
        NIGQ H SQELGEPR SGRVV+QPD YLGLSEA I+IPD+GIEDPLTYKQAM DVDCDQ +KAMDLE+ESMY NS+WTLVDQ ND K IGCKWIYKRKRD
Subjt:  NIGQTHPSQELGEPRRSGRVVQQPDHYLGLSEAHIIIPDDGIEDPLTYKQAMYDVDCDQLIKAMDLEMESMYSNSIWTLVDQPNDVKPIGCKWIYKRKRD

Query:  QASKVQTFKARLVAKGYTQKGGIDYEKTLSYCHDKVD
        QA KVQTFKARLVAKGYTQK G+DYE+TLS CH +VD
Subjt:  QASKVQTFKARLVAKGYTQKGGIDYEKTLSYCHDKVD

A0A5D3CYG9 Retrovirus-related pol polyprotein from transposon tnt 1-945.1e-5984.21Show/hide
Query:  NIGQTHPSQELGEPRRSGRVVQQPDHYLGLSEAHIIIPDDGIEDPLTYKQAMYDVDCDQLIKAMDLEMESMYSNSIWTLVDQPNDVKPIGCKWIYKRKRD
        NIGQTHPSQELGEPRRSGRVV+QPD YLGLSEA IIIPDDGIEDPLTYKQAM DVDCDQ IKAMD EMESMYSNS+WTLVDQP++V+PIGCKW+YKRKR+
Subjt:  NIGQTHPSQELGEPRRSGRVVQQPDHYLGLSEAHIIIPDDGIEDPLTYKQAMYDVDCDQLIKAMDLEMESMYSNSIWTLVDQPNDVKPIGCKWIYKRKRD

Query:  QASKVQTFKARLVAKGYTQKGGIDYEKTLSYCH
        QASKVQTF ARL+AKGYTQ  GIDY++T  +CH
Subjt:  QASKVQTFKARLVAKGYTQKGGIDYEKTLSYCH

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.8e-0830.61Show/hide
Query:  AHIIIPDDGIEDPLTYKQAMYDVDCDQLIKAMDLEMESMYSNSIWTLVDQPNDVKPIGCKWIYKRKRDQASKVQTFKARLVAKGYTQKGGIDYEKTLS
        AH I  D     P ++ +  Y  D     +A++ E+ +   N+ WT+  +P +   +  +W++  K ++      +KARLVA+G+TQK  IDYE+T +
Subjt:  AHIIIPDDGIEDPLTYKQAMYDVDCDQLIKAMDLEMESMYSNSIWTLVDQPNDVKPIGCKWIYKRKRDQASKVQTFKARLVAKGYTQKGGIDYEKTLS

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.9e-1639.06Show/hide
Query:  HPSQ--ELGEP-RRSGRVVQQPDHYLGLSEAHIIIPDDGIEDPLTYKQAMYDVDCDQLIKAMDLEMESMYSNSIWTLVDQPNDVKPIGCKWIYKRKRDQA
        HP+Q  E  +P RRS R   +   Y   S  +++I DD   +P + K+ +   + +QL+KAM  EMES+  N  + LV+ P   +P+ CKW++K K+D  
Subjt:  HPSQ--ELGEP-RRSGRVVQQPDHYLGLSEAHIIIPDDGIEDPLTYKQAMYDVDCDQLIKAMDLEMESMYSNSIWTLVDQPNDVKPIGCKWIYKRKRDQA

Query:  SKVQTFKARLVAKGYTQKGGIDYEKTLS
         K+  +KARLV KG+ QK GID+++  S
Subjt:  SKVQTFKARLVAKGYTQKGGIDYEKTLS

P92520 Uncharacterized mitochondrial protein AtMg008208.8e-0839.13Show/hide
Query:  KAMDLEMESMYSNSIWTLVDQPNDVKPIGCKWIYKRKRDQASKVQTFKARLVAKGYTQKGGIDYEKTLS
        +AM  E++++  N  W LV  P +   +GCKW++K K      +   KARLVAKG+ Q+ GI + +T S
Subjt:  KAMDLEMESMYSNSIWTLVDQPNDVKPIGCKWIYKRKRDQASKVQTFKARLVAKGYTQKGGIDYEKTLS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.5e-1132.26Show/hide
Query:  SQELGEPRRSGRVVQQPDHYLGLSEAHIIIPDDGIEDPLTYKQAMYDVDCDQLIKAMDLEMESMYSNSIWTLV-DQPNDVKPIGCKWIYKRKRDQASKVQ
        +  +G   ++G +   P + L +S A          +P T  QA+ D   ++   AM  E+ +   N  W LV   P+ V  +GC+WI+ +K +    + 
Subjt:  SQELGEPRRSGRVVQQPDHYLGLSEAHIIIPDDGIEDPLTYKQAMYDVDCDQLIKAMDLEMESMYSNSIWTLV-DQPNDVKPIGCKWIYKRKRDQASKVQ

Query:  TFKARLVAKGYTQKGGIDYEKTLS
         +KARLVAKGY Q+ G+DY +T S
Subjt:  TFKARLVAKGYTQKGGIDYEKTLS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.9e-1040.91Show/hide
Query:  DPLTYKQAMYDVDCDQLIKAMDLEMESMYSNSIWTLV-DQPNDVKPIGCKWIYKRKRDQASKVQTFKARLVAKGYTQKGGIDYEKTLS
        +P T  QAM D   D+  +AM  E+ +   N  W LV   P  V  +GC+WI+ +K +    +  +KARLVAKGY Q+ G+DY +T S
Subjt:  DPLTYKQAMYDVDCDQLIKAMDLEMESMYSNSIWTLV-DQPNDVKPIGCKWIYKRKRDQASKVQTFKARLVAKGYTQKGGIDYEKTLS

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.4e-1443.82Show/hide
Query:  EDPLTYKQAM-YDVDCDQLIKAMDLEMESMYSNSIWTLVDQPNDVKPIGCKWIYKRKRDQASKVQTFKARLVAKGYTQKGGIDYEKTLS
        ++P TY +A  + V C     AMD E+ +M +   W +   P + KPIGCKW+YK K +    ++ +KARLVAKGYTQ+ GID+ +T S
Subjt:  EDPLTYKQAM-YDVDCDQLIKAMDLEMESMYSNSIWTLVDQPNDVKPIGCKWIYKRKRDQASKVQTFKARLVAKGYTQKGGIDYEKTLS

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)6.2e-0939.13Show/hide
Query:  KAMDLEMESMYSNSIWTLVDQPNDVKPIGCKWIYKRKRDQASKVQTFKARLVAKGYTQKGGIDYEKTLS
        +AM  E++++  N  W LV  P +   +GCKW++K K      +   KARLVAKG+ Q+ GI + +T S
Subjt:  KAMDLEMESMYSNSIWTLVDQPNDVKPIGCKWIYKRKRDQASKVQTFKARLVAKGYTQKGGIDYEKTLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATATTGGTCAAACACATCCTTCTCAAGAGTTGGGAGAGCCTCGTCGTAGTGGGAGGGTTGTACAACAGCCTGATCACTATTTGGGCTTAAGTGAAGCTCATATCAT
CATACCTGATGATGGGATAGAGGATCCTTTGACCTATAAACAGGCGATGTATGATGTGGACTGTGACCAATTGATCAAAGCCATGGACCTCGAAATGGAATCTATGTATT
CCAATTCTATCTGGACTCTAGTAGATCAACCAAATGATGTAAAACCTATTGGTTGTAAATGGATCTACAAGAGAAAACGAGACCAAGCTAGTAAAGTACAGACTTTCAAA
GCTCGACTAGTGGCAAAAGGTTATACACAAAAGGGGGGAATAGATTATGAAAAAACTCTCTCCTATTGCCATGATAAAGTCGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATATTGGTCAAACACATCCTTCTCAAGAGTTGGGAGAGCCTCGTCGTAGTGGGAGGGTTGTACAACAGCCTGATCACTATTTGGGCTTAAGTGAAGCTCATATCAT
CATACCTGATGATGGGATAGAGGATCCTTTGACCTATAAACAGGCGATGTATGATGTGGACTGTGACCAATTGATCAAAGCCATGGACCTCGAAATGGAATCTATGTATT
CCAATTCTATCTGGACTCTAGTAGATCAACCAAATGATGTAAAACCTATTGGTTGTAAATGGATCTACAAGAGAAAACGAGACCAAGCTAGTAAAGTACAGACTTTCAAA
GCTCGACTAGTGGCAAAAGGTTATACACAAAAGGGGGGAATAGATTATGAAAAAACTCTCTCCTATTGCCATGATAAAGTCGATTAG
Protein sequenceShow/hide protein sequence
MNIGQTHPSQELGEPRRSGRVVQQPDHYLGLSEAHIIIPDDGIEDPLTYKQAMYDVDCDQLIKAMDLEMESMYSNSIWTLVDQPNDVKPIGCKWIYKRKRDQASKVQTFK
ARLVAKGYTQKGGIDYEKTLSYCHDKVD