; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0094851 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0094851
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr04:7868027..7868443
RNA-Seq ExpressionCmc04g0094851
SyntenyCmc04g0094851
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037650.1 gag-pol polyprotein [Cucumis melo var. makuwa]9.2e-6389.13Show/hide
Query:  MQEELLQFKHNNVWTLVPKPDGANIIETKWIFKNKTDESESVIRNKTCLVAQGYAQVEGVDFDETFAPVARLEAIRLLLSISCLRKFKLFQIDVKSAFLN
        MQEELLQFK NNVWT+VPKPDGAN+I TKWIFKNKTDES S+IRNK  L+AQGYAQVEGVDFDETFA VARLEAIRLLLSISC RKFKLF++DVKSAFLN
Subjt:  MQEELLQFKHNNVWTLVPKPDGANIIETKWIFKNKTDESESVIRNKTCLVAQGYAQVEGVDFDETFAPVARLEAIRLLLSISCLRKFKLFQIDVKSAFLN

Query:  GYLNEEVYVAQPKKFVDYEFPQYVYKLNKALYGLKQAP
        GYLNEEVYVAQPK FVD EFPQYVYKLNKALYGLKQAP
Subjt:  GYLNEEVYVAQPKKFVDYEFPQYVYKLNKALYGLKQAP

KAA0042206.1 gag-pol polyprotein [Cucumis melo var. makuwa]3.0e-6188.41Show/hide
Query:  MQEELLQFKHNNVWTLVPKPDGANIIETKWIFKNKTDESESVIRNKTCLVAQGYAQVEGVDFDETFAPVARLEAIRLLLSISCLRKFKLFQIDVKSAFLN
        MQEELLQFK NN+WTLVPKPD ANII TKWIFKNKTDESESVIRN+  LVAQGYAQV+GVDF++TFAPVARLEAIRLLLSISC RKFKLFQ+DVKSAFLN
Subjt:  MQEELLQFKHNNVWTLVPKPDGANIIETKWIFKNKTDESESVIRNKTCLVAQGYAQVEGVDFDETFAPVARLEAIRLLLSISCLRKFKLFQIDVKSAFLN

Query:  GYLNEEVYVAQPKKFVDYEFPQYVYKLNKALYGLKQAP
        GYLNEEVYVAQ K+FVD EFPQYVYK NKALYGLKQAP
Subjt:  GYLNEEVYVAQPKKFVDYEFPQYVYKLNKALYGLKQAP

KAA0051798.1 gag-pol polyprotein [Cucumis melo var. makuwa]5.1e-6187.68Show/hide
Query:  MQEELLQFKHNNVWTLVPKPDGANIIETKWIFKNKTDESESVIRNKTCLVAQGYAQVEGVDFDETFAPVARLEAIRLLLSISCLRKFKLFQIDVKSAFLN
        MQEE LQFK NNVWTLVPKPDGANII TKWIFKNKTDES S+IRNK  LVAQGY QVEGVD DETFAPVARLEAIRLLLSISC +KFKLFQ+DVKSAFLN
Subjt:  MQEELLQFKHNNVWTLVPKPDGANIIETKWIFKNKTDESESVIRNKTCLVAQGYAQVEGVDFDETFAPVARLEAIRLLLSISCLRKFKLFQIDVKSAFLN

Query:  GYLNEEVYVAQPKKFVDYEFPQYVYKLNKALYGLKQAP
        GYLNEEV VA+PK F+D EFPQYVYKLNKALYGLKQAP
Subjt:  GYLNEEVYVAQPKKFVDYEFPQYVYKLNKALYGLKQAP

TYJ98295.1 gag-pol polyprotein [Cucumis melo var. makuwa]3.1e-6692.03Show/hide
Query:  MQEELLQFKHNNVWTLVPKPDGANIIETKWIFKNKTDESESVIRNKTCLVAQGYAQVEGVDFDETFAPVARLEAIRLLLSISCLRKFKLFQIDVKSAFLN
        MQEELLQFKHNNVWTLVPKPDGANII TKWIFKNKTDES SV+RNK CLVAQGYAQVEGVDFDETFAPVARLEAIRLLL ISC RKFKLFQ+DVKSAFLN
Subjt:  MQEELLQFKHNNVWTLVPKPDGANIIETKWIFKNKTDESESVIRNKTCLVAQGYAQVEGVDFDETFAPVARLEAIRLLLSISCLRKFKLFQIDVKSAFLN

Query:  GYLNEEVYVAQPKKFVDYEFPQYVYKLNKALYGLKQAP
        GYLNEEVYVAQPK F+D EFPQYVYK+NKALYGLKQAP
Subjt:  GYLNEEVYVAQPKKFVDYEFPQYVYKLNKALYGLKQAP

TYK21443.1 gag-pol polyprotein [Cucumis melo var. makuwa]5.1e-6187.68Show/hide
Query:  MQEELLQFKHNNVWTLVPKPDGANIIETKWIFKNKTDESESVIRNKTCLVAQGYAQVEGVDFDETFAPVARLEAIRLLLSISCLRKFKLFQIDVKSAFLN
        MQEE LQFK NNVWTLVPKPDGANII TKWIFKNKTDES S+IRNK  LVAQGY QVEGVD DETFAPVARLEAIRLLLSISC +KFKLFQ+DVKSAFLN
Subjt:  MQEELLQFKHNNVWTLVPKPDGANIIETKWIFKNKTDESESVIRNKTCLVAQGYAQVEGVDFDETFAPVARLEAIRLLLSISCLRKFKLFQIDVKSAFLN

Query:  GYLNEEVYVAQPKKFVDYEFPQYVYKLNKALYGLKQAP
        GYLNEEV VA+PK F+D EFPQYVYKLNKALYGLKQAP
Subjt:  GYLNEEVYVAQPKKFVDYEFPQYVYKLNKALYGLKQAP

TrEMBL top hitse value%identityAlignment
A0A5A7T2Q0 Gag-pol polyprotein4.5e-6389.13Show/hide
Query:  MQEELLQFKHNNVWTLVPKPDGANIIETKWIFKNKTDESESVIRNKTCLVAQGYAQVEGVDFDETFAPVARLEAIRLLLSISCLRKFKLFQIDVKSAFLN
        MQEELLQFK NNVWT+VPKPDGAN+I TKWIFKNKTDES S+IRNK  L+AQGYAQVEGVDFDETFA VARLEAIRLLLSISC RKFKLF++DVKSAFLN
Subjt:  MQEELLQFKHNNVWTLVPKPDGANIIETKWIFKNKTDESESVIRNKTCLVAQGYAQVEGVDFDETFAPVARLEAIRLLLSISCLRKFKLFQIDVKSAFLN

Query:  GYLNEEVYVAQPKKFVDYEFPQYVYKLNKALYGLKQAP
        GYLNEEVYVAQPK FVD EFPQYVYKLNKALYGLKQAP
Subjt:  GYLNEEVYVAQPKKFVDYEFPQYVYKLNKALYGLKQAP

A0A5A7U931 Gag-pol polyprotein2.4e-6187.68Show/hide
Query:  MQEELLQFKHNNVWTLVPKPDGANIIETKWIFKNKTDESESVIRNKTCLVAQGYAQVEGVDFDETFAPVARLEAIRLLLSISCLRKFKLFQIDVKSAFLN
        MQEE LQFK NNVWTLVPKPDGANII TKWIFKNKTDES S+IRNK  LVAQGY QVEGVD DETFAPVARLEAIRLLLSISC +KFKLFQ+DVKSAFLN
Subjt:  MQEELLQFKHNNVWTLVPKPDGANIIETKWIFKNKTDESESVIRNKTCLVAQGYAQVEGVDFDETFAPVARLEAIRLLLSISCLRKFKLFQIDVKSAFLN

Query:  GYLNEEVYVAQPKKFVDYEFPQYVYKLNKALYGLKQAP
        GYLNEEV VA+PK F+D EFPQYVYKLNKALYGLKQAP
Subjt:  GYLNEEVYVAQPKKFVDYEFPQYVYKLNKALYGLKQAP

A0A5D3BIP9 Gag-pol polyprotein1.5e-6692.03Show/hide
Query:  MQEELLQFKHNNVWTLVPKPDGANIIETKWIFKNKTDESESVIRNKTCLVAQGYAQVEGVDFDETFAPVARLEAIRLLLSISCLRKFKLFQIDVKSAFLN
        MQEELLQFKHNNVWTLVPKPDGANII TKWIFKNKTDES SV+RNK CLVAQGYAQVEGVDFDETFAPVARLEAIRLLL ISC RKFKLFQ+DVKSAFLN
Subjt:  MQEELLQFKHNNVWTLVPKPDGANIIETKWIFKNKTDESESVIRNKTCLVAQGYAQVEGVDFDETFAPVARLEAIRLLLSISCLRKFKLFQIDVKSAFLN

Query:  GYLNEEVYVAQPKKFVDYEFPQYVYKLNKALYGLKQAP
        GYLNEEVYVAQPK F+D EFPQYVYK+NKALYGLKQAP
Subjt:  GYLNEEVYVAQPKKFVDYEFPQYVYKLNKALYGLKQAP

A0A5D3DCZ8 Gag-pol polyprotein2.4e-6187.68Show/hide
Query:  MQEELLQFKHNNVWTLVPKPDGANIIETKWIFKNKTDESESVIRNKTCLVAQGYAQVEGVDFDETFAPVARLEAIRLLLSISCLRKFKLFQIDVKSAFLN
        MQEE LQFK NNVWTLVPKPDGANII TKWIFKNKTDES S+IRNK  LVAQGY QVEGVD DETFAPVARLEAIRLLLSISC +KFKLFQ+DVKSAFLN
Subjt:  MQEELLQFKHNNVWTLVPKPDGANIIETKWIFKNKTDESESVIRNKTCLVAQGYAQVEGVDFDETFAPVARLEAIRLLLSISCLRKFKLFQIDVKSAFLN

Query:  GYLNEEVYVAQPKKFVDYEFPQYVYKLNKALYGLKQAP
        GYLNEEV VA+PK F+D EFPQYVYKLNKALYGLKQAP
Subjt:  GYLNEEVYVAQPKKFVDYEFPQYVYKLNKALYGLKQAP

A0A5D3DSN1 Gag-pol polyprotein1.4e-6188.41Show/hide
Query:  MQEELLQFKHNNVWTLVPKPDGANIIETKWIFKNKTDESESVIRNKTCLVAQGYAQVEGVDFDETFAPVARLEAIRLLLSISCLRKFKLFQIDVKSAFLN
        MQEELLQFK NN+WTLVPKPD ANII TKWIFKNKTDESESVIRN+  LVAQGYAQV+GVDF++TFAPVARLEAIRLLLSISC RKFKLFQ+DVKSAFLN
Subjt:  MQEELLQFKHNNVWTLVPKPDGANIIETKWIFKNKTDESESVIRNKTCLVAQGYAQVEGVDFDETFAPVARLEAIRLLLSISCLRKFKLFQIDVKSAFLN

Query:  GYLNEEVYVAQPKKFVDYEFPQYVYKLNKALYGLKQAP
        GYLNEEVYVAQ K+FVD EFPQYVYK NKALYGLKQAP
Subjt:  GYLNEEVYVAQPKKFVDYEFPQYVYKLNKALYGLKQAP

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.2e-2645.52Show/hide
Query:  ELLQFKHNNVWTLVPKPDGANIIETKWIFKNKTDESESVIRNKTCLVAQGYAQVEGVDFDETFAPVARLEAIRLLLSISCLRKFKLFQIDVKSAFLNGYL
        EL   K NN WT+  +P+  NI++++W+F  K +E  + IR K  LVA+G+ Q   +D++ETFAPVAR+ + R +LS+      K+ Q+DVK+AFLNG L
Subjt:  ELLQFKHNNVWTLVPKPDGANIIETKWIFKNKTDESESVIRNKTCLVAQGYAQVEGVDFDETFAPVARLEAIRLLLSISCLRKFKLFQIDVKSAFLNGYL

Query:  NEEVYVAQPKKFVDYEFPQYVYKLNKALYGLKQA
         EE+Y+  P+          V KLNKA+YGLKQA
Subjt:  NEEVYVAQPKKFVDYEFPQYVYKLNKALYGLKQA

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.5e-2744.2Show/hide
Query:  MQEELLQFKHNNVWTLVPKPDGANIIETKWIFKNKTDESESVIRNKTCLVAQGYAQVEGVDFDETFAPVARLEAIRLLLSISCLRKFKLFQIDVKSAFLN
        MQEE+   + N  + LV  P G   ++ KW+FK K D    ++R K  LV +G+ Q +G+DFDE F+PV ++ +IR +LS++     ++ Q+DVK+AFL+
Subjt:  MQEELLQFKHNNVWTLVPKPDGANIIETKWIFKNKTDESESVIRNKTCLVAQGYAQVEGVDFDETFAPVARLEAIRLLLSISCLRKFKLFQIDVKSAFLN

Query:  GYLNEEVYVAQPKKFVDYEFPQYVYKLNKALYGLKQAP
        G L EE+Y+ QP+ F        V KLNK+LYGLKQAP
Subjt:  GYLNEEVYVAQPKKFVDYEFPQYVYKLNKALYGLKQAP

P92520 Uncharacterized mitochondrial protein AtMg008209.1e-1343.9Show/hide
Query:  MQEELLQFKHNNVWTLVPKPDGANIIETKWIFKNKTDESESVIRNKTCLVAQGYAQVEGVDFDETFAPVARLEAIRLLLSIS
        MQEEL     N  W LVP P   NI+  KW+FK K     ++ R K  LVA+G+ Q EG+ F ET++PV R   IR +L+++
Subjt:  MQEELLQFKHNNVWTLVPKPDGANIIETKWIFKNKTDESESVIRNKTCLVAQGYAQVEGVDFDETFAPVARLEAIRLLLSIS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.9e-2646.51Show/hide
Query:  NNVWTLVPKPDG-ANIIETKWIFKNKTDESESVIRNKTCLVAQGYAQVEGVDFDETFAPVARLEAIRLLLSISCLRKFKLFQIDVKSAFLNGYLNEEVYV
        N+ W LVP P     I+  +WIF  K +   S+ R K  LVA+GY Q  G+D+ ETF+PV +  +IR++L ++  R + + Q+DV +AFL G L ++VY+
Subjt:  NNVWTLVPKPDG-ANIIETKWIFKNKTDESESVIRNKTCLVAQGYAQVEGVDFDETFAPVARLEAIRLLLSISCLRKFKLFQIDVKSAFLNGYLNEEVYV

Query:  AQPKKFVDYEFPQYVYKLNKALYGLKQAP
        +QP  F+D + P YV KL KALYGLKQAP
Subjt:  AQPKKFVDYEFPQYVYKLNKALYGLKQAP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.5e-2646.51Show/hide
Query:  NNVWTLV-PKPDGANIIETKWIFKNKTDESESVIRNKTCLVAQGYAQVEGVDFDETFAPVARLEAIRLLLSISCLRKFKLFQIDVKSAFLNGYLNEEVYV
        N+ W LV P P    I+  +WIF  K +   S+ R K  LVA+GY Q  G+D+ ETF+PV +  +IR++L ++  R + + Q+DV +AFL G L +EVY+
Subjt:  NNVWTLV-PKPDGANIIETKWIFKNKTDESESVIRNKTCLVAQGYAQVEGVDFDETFAPVARLEAIRLLLSISCLRKFKLFQIDVKSAFLNGYLNEEVYV

Query:  AQPKKFVDYEFPQYVYKLNKALYGLKQAP
        +QP  FVD + P YV +L KA+YGLKQAP
Subjt:  AQPKKFVDYEFPQYVYKLNKALYGLKQAP

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.1e-2439.72Show/hide
Query:  MQEELLQFKHNNVWTLVPKPDGANIIETKWIFKNKTDESESVIRNKTCLVAQGYAQVEGVDFDETFAPVARLEAIRLLLSISCLRKFKLFQIDVKSAFLN
        M +E+   +  + W +   P     I  KW++K K +   ++ R K  LVA+GY Q EG+DF ETF+PV +L +++L+L+IS +  F L Q+D+ +AFLN
Subjt:  MQEELLQFKHNNVWTLVPKPDGANIIETKWIFKNKTDESESVIRNKTCLVAQGYAQVEGVDFDETFAPVARLEAIRLLLSISCLRKFKLFQIDVKSAFLN

Query:  GYLNEEVYVAQPKKFV----DYEFPQYVYKLNKALYGLKQA
        G L+EE+Y+  P  +     D   P  V  L K++YGLKQA
Subjt:  GYLNEEVYVAQPKKFV----DYEFPQYVYKLNKALYGLKQA

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)6.4e-1443.9Show/hide
Query:  MQEELLQFKHNNVWTLVPKPDGANIIETKWIFKNKTDESESVIRNKTCLVAQGYAQVEGVDFDETFAPVARLEAIRLLLSIS
        MQEEL     N  W LVP P   NI+  KW+FK K     ++ R K  LVA+G+ Q EG+ F ET++PV R   IR +L+++
Subjt:  MQEELLQFKHNNVWTLVPKPDGANIIETKWIFKNKTDESESVIRNKTCLVAQGYAQVEGVDFDETFAPVARLEAIRLLLSIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGAAGAGTTACTACAGTTCAAGCATAACAACGTTTGGACTTTGGTTCCTAAACCTGATGGGGCAAACATCATAGAAACTAAGTGGATCTTTAAAAATAAAACTGA
TGAATCTGAGAGTGTAATAAGGAACAAGACCTGTTTGGTGGCTCAAGGTTATGCACAGGTAGAAGGTGTTGATTTTGATGAAACGTTTGCACCTGTGGCTAGACTTGAAG
CTATTCGCCTCTTGCTCAGTATATCATGTTTACGAAAATTTAAATTGTTTCAAATTGACGTTAAAAGTGCCTTCCTGAATGGATACTTAAATGAGGAAGTCTATGTAGCA
CAACCTAAAAAGTTTGTTGATTATGAATTTCCTCAGTATGTCTACAAGCTTAATAAAGCTCTATATGGGTTAAAGCAAGCTCCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAAGAAGAGTTACTACAGTTCAAGCATAACAACGTTTGGACTTTGGTTCCTAAACCTGATGGGGCAAACATCATAGAAACTAAGTGGATCTTTAAAAATAAAACTGA
TGAATCTGAGAGTGTAATAAGGAACAAGACCTGTTTGGTGGCTCAAGGTTATGCACAGGTAGAAGGTGTTGATTTTGATGAAACGTTTGCACCTGTGGCTAGACTTGAAG
CTATTCGCCTCTTGCTCAGTATATCATGTTTACGAAAATTTAAATTGTTTCAAATTGACGTTAAAAGTGCCTTCCTGAATGGATACTTAAATGAGGAAGTCTATGTAGCA
CAACCTAAAAAGTTTGTTGATTATGAATTTCCTCAGTATGTCTACAAGCTTAATAAAGCTCTATATGGGTTAAAGCAAGCTCCTTAG
Protein sequenceShow/hide protein sequence
MQEELLQFKHNNVWTLVPKPDGANIIETKWIFKNKTDESESVIRNKTCLVAQGYAQVEGVDFDETFAPVARLEAIRLLLSISCLRKFKLFQIDVKSAFLNGYLNEEVYVA
QPKKFVDYEFPQYVYKLNKALYGLKQAP