; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc09g0251341 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc09g0251341
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr09:17405248..17405766
RNA-Seq ExpressionCmc09g0251341
SyntenyCmc09g0251341
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]8.2e-7784.88Show/hide
Query:  MSRCSGRVVSQPNHYLGLTETHVVIPNVGVEDLLSYTQTTGDVDKNQWVKVMDLEMEFMYFNSVWELVDLPEGEKPLGCKWIYKRNRDSAWKVQTFKARL
        M R SGRVVSQPN YLGLTET VVIP+ GVED LSY Q   DVDK+QWVK MDLEME MYFNSVWELVDLPEG KP+GCKWIYKR RDSA KVQTFKARL
Subjt:  MSRCSGRVVSQPNHYLGLTETHVVIPNVGVEDLLSYTQTTGDVDKNQWVKVMDLEMEFMYFNSVWELVDLPEGEKPLGCKWIYKRNRDSAWKVQTFKARL

Query:  VAKGYTQQEGVDYEETFSPIAMLKSIRILLSIAIFYDYEIWKMDVKIAFLNGNLEESIFMSQPEGIITQCKE
        VAKGYTQ+EGVDYEETFSP+AMLKSIRILLSIA FYDYEIW+MDVK AFLNGNLEESIFMSQPEG ITQ +E
Subjt:  VAKGYTQQEGVDYEETFSPIAMLKSIRILLSIAIFYDYEIWKMDVKIAFLNGNLEESIFMSQPEGIITQCKE

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]8.2e-7784.88Show/hide
Query:  MSRCSGRVVSQPNHYLGLTETHVVIPNVGVEDLLSYTQTTGDVDKNQWVKVMDLEMEFMYFNSVWELVDLPEGEKPLGCKWIYKRNRDSAWKVQTFKARL
        M R SGRVVSQPN YLGLTET VVIP+ GVED LSY Q   DVDK+QWVK MDLEME MYFNSVWELVDLPEG KP+GCKWIYKR RDSA KVQTFKARL
Subjt:  MSRCSGRVVSQPNHYLGLTETHVVIPNVGVEDLLSYTQTTGDVDKNQWVKVMDLEMEFMYFNSVWELVDLPEGEKPLGCKWIYKRNRDSAWKVQTFKARL

Query:  VAKGYTQQEGVDYEETFSPIAMLKSIRILLSIAIFYDYEIWKMDVKIAFLNGNLEESIFMSQPEGIITQCKE
        VAKGYTQ+EGVDYEETFSP+AMLKSIRILLSIA FYDYEIW+MDVK AFLNGNLEESIFMSQPEG ITQ +E
Subjt:  VAKGYTQQEGVDYEETFSPIAMLKSIRILLSIAIFYDYEIWKMDVKIAFLNGNLEESIFMSQPEGIITQCKE

KAA0067545.1 gag/pol protein [Cucumis melo var. makuwa]3.0e-95100Show/hide
Query:  MSRCSGRVVSQPNHYLGLTETHVVIPNVGVEDLLSYTQTTGDVDKNQWVKVMDLEMEFMYFNSVWELVDLPEGEKPLGCKWIYKRNRDSAWKVQTFKARL
        MSRCSGRVVSQPNHYLGLTETHVVIPNVGVEDLLSYTQTTGDVDKNQWVKVMDLEMEFMYFNSVWELVDLPEGEKPLGCKWIYKRNRDSAWKVQTFKARL
Subjt:  MSRCSGRVVSQPNHYLGLTETHVVIPNVGVEDLLSYTQTTGDVDKNQWVKVMDLEMEFMYFNSVWELVDLPEGEKPLGCKWIYKRNRDSAWKVQTFKARL

Query:  VAKGYTQQEGVDYEETFSPIAMLKSIRILLSIAIFYDYEIWKMDVKIAFLNGNLEESIFMSQPEGIITQCKE
        VAKGYTQQEGVDYEETFSPIAMLKSIRILLSIAIFYDYEIWKMDVKIAFLNGNLEESIFMSQPEGIITQCKE
Subjt:  VAKGYTQQEGVDYEETFSPIAMLKSIRILLSIAIFYDYEIWKMDVKIAFLNGNLEESIFMSQPEGIITQCKE

TYK02840.1 gag/pol protein [Cucumis melo var. makuwa]6.9e-7684.3Show/hide
Query:  MSRCSGRVVSQPNHYLGLTETHVVIPNVGVEDLLSYTQTTGDVDKNQWVKVMDLEMEFMYFNSVWELVDLPEGEKPLGCKWIYKRNRDSAWKVQTFKARL
        M R SGRVVSQPN YLGLTET VVIP+ GVED LSY Q   DVDK+QWVK MDLEME MYFNSVWELVDLPEG KP+GCKWIYKR RDSA KVQTFKARL
Subjt:  MSRCSGRVVSQPNHYLGLTETHVVIPNVGVEDLLSYTQTTGDVDKNQWVKVMDLEMEFMYFNSVWELVDLPEGEKPLGCKWIYKRNRDSAWKVQTFKARL

Query:  VAKGYTQQEGVDYEETFSPIAMLKSIRILLSIAIFYDYEIWKMDVKIAFLNGNLEESIFMSQPEGIITQCKE
        VAKGYTQ+EGVDYEETFSP+AMLKSIRILLSIA FYDYEIW+MDVK AFLNGNLEESIFMSQPE  ITQ +E
Subjt:  VAKGYTQQEGVDYEETFSPIAMLKSIRILLSIAIFYDYEIWKMDVKIAFLNGNLEESIFMSQPEGIITQCKE

TYK15984.1 gag/pol protein [Cucumis melo var. makuwa]8.2e-7784.88Show/hide
Query:  MSRCSGRVVSQPNHYLGLTETHVVIPNVGVEDLLSYTQTTGDVDKNQWVKVMDLEMEFMYFNSVWELVDLPEGEKPLGCKWIYKRNRDSAWKVQTFKARL
        M R SGRVVSQPN YLGLTET VVIP+ GVED LSY Q   DVDK+QWVK MDLEME MYFNSVWELVDLPEG KP+GCKWIYKR RDSA KVQTFKARL
Subjt:  MSRCSGRVVSQPNHYLGLTETHVVIPNVGVEDLLSYTQTTGDVDKNQWVKVMDLEMEFMYFNSVWELVDLPEGEKPLGCKWIYKRNRDSAWKVQTFKARL

Query:  VAKGYTQQEGVDYEETFSPIAMLKSIRILLSIAIFYDYEIWKMDVKIAFLNGNLEESIFMSQPEGIITQCKE
        VAKGYTQ+EGVDYEETFSP+AMLKSIRILLSIA FYDYEIW+MDVK AFLNGNLEESIFMSQPEG ITQ +E
Subjt:  VAKGYTQQEGVDYEETFSPIAMLKSIRILLSIAIFYDYEIWKMDVKIAFLNGNLEESIFMSQPEGIITQCKE

TrEMBL top hitse value%identityAlignment
A0A5A7TZD0 Gag/pol protein4.0e-7784.88Show/hide
Query:  MSRCSGRVVSQPNHYLGLTETHVVIPNVGVEDLLSYTQTTGDVDKNQWVKVMDLEMEFMYFNSVWELVDLPEGEKPLGCKWIYKRNRDSAWKVQTFKARL
        M R SGRVVSQPN YLGLTET VVIP+ GVED LSY Q   DVDK+QWVK MDLEME MYFNSVWELVDLPEG KP+GCKWIYKR RDSA KVQTFKARL
Subjt:  MSRCSGRVVSQPNHYLGLTETHVVIPNVGVEDLLSYTQTTGDVDKNQWVKVMDLEMEFMYFNSVWELVDLPEGEKPLGCKWIYKRNRDSAWKVQTFKARL

Query:  VAKGYTQQEGVDYEETFSPIAMLKSIRILLSIAIFYDYEIWKMDVKIAFLNGNLEESIFMSQPEGIITQCKE
        VAKGYTQ+EGVDYEETFSP+AMLKSIRILLSIA FYDYEIW+MDVK AFLNGNLEESIFMSQPEG ITQ +E
Subjt:  VAKGYTQQEGVDYEETFSPIAMLKSIRILLSIAIFYDYEIWKMDVKIAFLNGNLEESIFMSQPEGIITQCKE

A0A5A7UYE8 Gag/pol protein4.0e-7784.88Show/hide
Query:  MSRCSGRVVSQPNHYLGLTETHVVIPNVGVEDLLSYTQTTGDVDKNQWVKVMDLEMEFMYFNSVWELVDLPEGEKPLGCKWIYKRNRDSAWKVQTFKARL
        M R SGRVVSQPN YLGLTET VVIP+ GVED LSY Q   DVDK+QWVK MDLEME MYFNSVWELVDLPEG KP+GCKWIYKR RDSA KVQTFKARL
Subjt:  MSRCSGRVVSQPNHYLGLTETHVVIPNVGVEDLLSYTQTTGDVDKNQWVKVMDLEMEFMYFNSVWELVDLPEGEKPLGCKWIYKRNRDSAWKVQTFKARL

Query:  VAKGYTQQEGVDYEETFSPIAMLKSIRILLSIAIFYDYEIWKMDVKIAFLNGNLEESIFMSQPEGIITQCKE
        VAKGYTQ+EGVDYEETFSP+AMLKSIRILLSIA FYDYEIW+MDVK AFLNGNLEESIFMSQPEG ITQ +E
Subjt:  VAKGYTQQEGVDYEETFSPIAMLKSIRILLSIAIFYDYEIWKMDVKIAFLNGNLEESIFMSQPEGIITQCKE

A0A5D3BDY8 Gag/pol protein1.4e-95100Show/hide
Query:  MSRCSGRVVSQPNHYLGLTETHVVIPNVGVEDLLSYTQTTGDVDKNQWVKVMDLEMEFMYFNSVWELVDLPEGEKPLGCKWIYKRNRDSAWKVQTFKARL
        MSRCSGRVVSQPNHYLGLTETHVVIPNVGVEDLLSYTQTTGDVDKNQWVKVMDLEMEFMYFNSVWELVDLPEGEKPLGCKWIYKRNRDSAWKVQTFKARL
Subjt:  MSRCSGRVVSQPNHYLGLTETHVVIPNVGVEDLLSYTQTTGDVDKNQWVKVMDLEMEFMYFNSVWELVDLPEGEKPLGCKWIYKRNRDSAWKVQTFKARL

Query:  VAKGYTQQEGVDYEETFSPIAMLKSIRILLSIAIFYDYEIWKMDVKIAFLNGNLEESIFMSQPEGIITQCKE
        VAKGYTQQEGVDYEETFSPIAMLKSIRILLSIAIFYDYEIWKMDVKIAFLNGNLEESIFMSQPEGIITQCKE
Subjt:  VAKGYTQQEGVDYEETFSPIAMLKSIRILLSIAIFYDYEIWKMDVKIAFLNGNLEESIFMSQPEGIITQCKE

A0A5D3BUN8 Gag/pol protein3.4e-7684.3Show/hide
Query:  MSRCSGRVVSQPNHYLGLTETHVVIPNVGVEDLLSYTQTTGDVDKNQWVKVMDLEMEFMYFNSVWELVDLPEGEKPLGCKWIYKRNRDSAWKVQTFKARL
        M R SGRVVSQPN YLGLTET VVIP+ GVED LSY Q   DVDK+QWVK MDLEME MYFNSVWELVDLPEG KP+GCKWIYKR RDSA KVQTFKARL
Subjt:  MSRCSGRVVSQPNHYLGLTETHVVIPNVGVEDLLSYTQTTGDVDKNQWVKVMDLEMEFMYFNSVWELVDLPEGEKPLGCKWIYKRNRDSAWKVQTFKARL

Query:  VAKGYTQQEGVDYEETFSPIAMLKSIRILLSIAIFYDYEIWKMDVKIAFLNGNLEESIFMSQPEGIITQCKE
        VAKGYTQ+EGVDYEETFSP+AMLKSIRILLSIA FYDYEIW+MDVK AFLNGNLEESIFMSQPE  ITQ +E
Subjt:  VAKGYTQQEGVDYEETFSPIAMLKSIRILLSIAIFYDYEIWKMDVKIAFLNGNLEESIFMSQPEGIITQCKE

A0A5D3CYF4 Gag/pol protein4.0e-7784.88Show/hide
Query:  MSRCSGRVVSQPNHYLGLTETHVVIPNVGVEDLLSYTQTTGDVDKNQWVKVMDLEMEFMYFNSVWELVDLPEGEKPLGCKWIYKRNRDSAWKVQTFKARL
        M R SGRVVSQPN YLGLTET VVIP+ GVED LSY Q   DVDK+QWVK MDLEME MYFNSVWELVDLPEG KP+GCKWIYKR RDSA KVQTFKARL
Subjt:  MSRCSGRVVSQPNHYLGLTETHVVIPNVGVEDLLSYTQTTGDVDKNQWVKVMDLEMEFMYFNSVWELVDLPEGEKPLGCKWIYKRNRDSAWKVQTFKARL

Query:  VAKGYTQQEGVDYEETFSPIAMLKSIRILLSIAIFYDYEIWKMDVKIAFLNGNLEESIFMSQPEGIITQCKE
        VAKGYTQ+EGVDYEETFSP+AMLKSIRILLSIA FYDYEIW+MDVK AFLNGNLEESIFMSQPEG ITQ +E
Subjt:  VAKGYTQQEGVDYEETFSPIAMLKSIRILLSIAIFYDYEIWKMDVKIAFLNGNLEESIFMSQPEGIITQCKE

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.9e-2339.02Show/hide
Query:  DKNQWVKVMDLEMEFMYFNSVWELVDLPEGEKPLGCKWIYKRNRDSAWKVQTFKARLVAKGYTQQEGVDYEETFSPIAMLKSIRILLSIAIFYDYEIWKM
        DK+ W + ++ E+     N+ W +   PE +  +  +W++    +       +KARLVA+G+TQ+  +DYEETF+P+A + S R +LS+ I Y+ ++ +M
Subjt:  DKNQWVKVMDLEMEFMYFNSVWELVDLPEGEKPLGCKWIYKRNRDSAWKVQTFKARLVAKGYTQQEGVDYEETFSPIAMLKSIRILLSIAIFYDYEIWKM

Query:  DVKIAFLNGNLEESIFMSQPEGI
        DVK AFLNG L+E I+M  P+GI
Subjt:  DVKIAFLNGNLEESIFMSQPEGI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.1e-3149.18Show/hide
Query:  DKNQWVKVMDLEMEFMYFNSVWELVDLPEGEKPLGCKWIYKRNRDSAWKVQTFKARLVAKGYTQQEGVDYEETFSPIAMLKSIRILLSIAIFYDYEIWKM
        +KNQ +K M  EME +  N  ++LV+LP+G++PL CKW++K  +D   K+  +KARLV KG+ Q++G+D++E FSP+  + SIR +LS+A   D E+ ++
Subjt:  DKNQWVKVMDLEMEFMYFNSVWELVDLPEGEKPLGCKWIYKRNRDSAWKVQTFKARLVAKGYTQQEGVDYEETFSPIAMLKSIRILLSIAIFYDYEIWKM

Query:  DVKIAFLNGNLEESIFMSQPEG
        DVK AFL+G+LEE I+M QPEG
Subjt:  DVKIAFLNGNLEESIFMSQPEG

P92520 Uncharacterized mitochondrial protein AtMg008202.3e-1339.53Show/hide
Query:  WVKVMDLEMEFMYFNSVWELVDLPEGEKPLGCKWIYKRNRDSAWKVQTFKARLVAKGYTQQEGVDYEETFSPIAMLKSIRILLSIA
        W + M  E++ +  N  W LV  P  +  LGCKW++K    S   +   KARLVAKG+ Q+EG+ + ET+SP+    +IR +L++A
Subjt:  WVKVMDLEMEFMYFNSVWELVDLPEGEKPLGCKWIYKRNRDSAWKVQTFKARLVAKGYTQQEGVDYEETFSPIAMLKSIRILLSIA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.0e-2241.8Show/hide
Query:  QWVKVMDLEMEFMYFNSVWELVDLPEGEKPL-GCKWIYKRNRDSAWKVQTFKARLVAKGYTQQEGVDYEETFSPIAMLKSIRILLSIAIFYDYEIWKMDV
        +W   M  E+     N  W+LV  P     + GC+WI+ +  +S   +  +KARLVAKGY Q+ G+DY ETFSP+    SIRI+L +A+   + I ++DV
Subjt:  QWVKVMDLEMEFMYFNSVWELVDLPEGEKPL-GCKWIYKRNRDSAWKVQTFKARLVAKGYTQQEGVDYEETFSPIAMLKSIRILLSIAIFYDYEIWKMDV

Query:  KIAFLNGNLEESIFMSQPEGII
          AFL G L + ++MSQP G I
Subjt:  KIAFLNGNLEESIFMSQPEGII

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.0e-2140.65Show/hide
Query:  NQWVKVMDLEMEFMYFNSVWELVDLPEGEKPL-GCKWIYKRNRDSAWKVQTFKARLVAKGYTQQEGVDYEETFSPIAMLKSIRILLSIAIFYDYEIWKMD
        ++W + M  E+     N  W+LV  P     + GC+WI+ +  +S   +  +KARLVAKGY Q+ G+DY ETFSP+    SIRI+L +A+   + I ++D
Subjt:  NQWVKVMDLEMEFMYFNSVWELVDLPEGEKPL-GCKWIYKRNRDSAWKVQTFKARLVAKGYTQQEGVDYEETFSPIAMLKSIRILLSIAIFYDYEIWKMD

Query:  VKIAFLNGNLEESIFMSQPEGII
        V  AFL G L + ++MSQP G +
Subjt:  VKIAFLNGNLEESIFMSQPEGII

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.6e-3046.61Show/hide
Query:  WVKVMDLEMEFMYFNSVWELVDLPEGEKPLGCKWIYKRNRDSAWKVQTFKARLVAKGYTQQEGVDYEETFSPIAMLKSIRILLSIAIFYDYEIWKMDVKI
        W   MD E+  M     WE+  LP  +KP+GCKW+YK   +S   ++ +KARLVAKGYTQQEG+D+ ETFSP+  L S++++L+I+  Y++ + ++D+  
Subjt:  WVKVMDLEMEFMYFNSVWELVDLPEGEKPLGCKWIYKRNRDSAWKVQTFKARLVAKGYTQQEGVDYEETFSPIAMLKSIRILLSIAIFYDYEIWKMDVKI

Query:  AFLNGNLEESIFMSQPEG
        AFLNG+L+E I+M  P G
Subjt:  AFLNGNLEESIFMSQPEG

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.6e-1439.53Show/hide
Query:  WVKVMDLEMEFMYFNSVWELVDLPEGEKPLGCKWIYKRNRDSAWKVQTFKARLVAKGYTQQEGVDYEETFSPIAMLKSIRILLSIA
        W + M  E++ +  N  W LV  P  +  LGCKW++K    S   +   KARLVAKG+ Q+EG+ + ET+SP+    +IR +L++A
Subjt:  WVKVMDLEMEFMYFNSVWELVDLPEGEKPLGCKWIYKRNRDSAWKVQTFKARLVAKGYTQQEGVDYEETFSPIAMLKSIRILLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCGATGCAGTGGGAGGGTTGTATCACAACCTAACCATTACTTAGGTTTAACTGAAACTCATGTTGTCATACCAAATGTTGGTGTTGAGGATCTATTGTCC
TATACACAGACAACGGGTGATGTAGACAAGAACCAATGGGTCAAAGTCATGGACCTTGAAATGGAGTTTATGTACTTCAATTCAGTGTGGGAGCTTGTAGATCTA
CCTGAAGGGGAAAAGCCTTTAGGGTGTAAATGGATCTATAAGAGAAATAGAGATTCAGCTTGGAAGGTACAGACCTTCAAAGCTAGACTTGTAGCAAAAGGGTAT
ACCCAACAGGAAGGGGTTGACTATGAGGAAACCTTTTCCCCTATTGCTATGTTAAAGTCTATAAGGATTCTCTTGTCCATAGCAATATTTTATGATTATGAAATA
TGGAAAATGGATGTGAAGATTGCTTTTCTGAATGGCAATCTTGAAGAGAGTATCTTTATGTCTCAGCCCGAGGGGATCATAACCCAATGTAAGGAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCGATGCAGTGGGAGGGTTGTATCACAACCTAACCATTACTTAGGTTTAACTGAAACTCATGTTGTCATACCAAATGTTGGTGTTGAGGATCTATTGTCC
TATACACAGACAACGGGTGATGTAGACAAGAACCAATGGGTCAAAGTCATGGACCTTGAAATGGAGTTTATGTACTTCAATTCAGTGTGGGAGCTTGTAGATCTA
CCTGAAGGGGAAAAGCCTTTAGGGTGTAAATGGATCTATAAGAGAAATAGAGATTCAGCTTGGAAGGTACAGACCTTCAAAGCTAGACTTGTAGCAAAAGGGTAT
ACCCAACAGGAAGGGGTTGACTATGAGGAAACCTTTTCCCCTATTGCTATGTTAAAGTCTATAAGGATTCTCTTGTCCATAGCAATATTTTATGATTATGAAATA
TGGAAAATGGATGTGAAGATTGCTTTTCTGAATGGCAATCTTGAAGAGAGTATCTTTATGTCTCAGCCCGAGGGGATCATAACCCAATGTAAGGAGTAA
Protein sequenceShow/hide protein sequence
MSRCSGRVVSQPNHYLGLTETHVVIPNVGVEDLLSYTQTTGDVDKNQWVKVMDLEMEFMYFNSVWELVDLPEGEKPLGCKWIYKRNRDSAWKVQTFKARLVAKGY
TQQEGVDYEETFSPIAMLKSIRILLSIAIFYDYEIWKMDVKIAFLNGNLEESIFMSQPEGIITQCKE