; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0094981 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0094981
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr04:8113642..8114097
RNA-Seq ExpressionCmc04g0094981
SyntenyCmc04g0094981
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033543.1 gag-pol polyprotein [Cucumis melo var. makuwa]4.2e-6989.4Show/hide
Query:  MQTRRKEKIDYMKMVADLCYISTIEPSTTDSAPKDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGIKWVFKNKTDEVGCVTKNKARLVAQGYTQVEG
        MQTRRKEKIDY KMVADLCY STIEPST DSA K+EYWLNAMQE+LLQFRRNNVWTLVSKPEGVNVIG KWVFKNKTDE  CVTKNKARLVAQGY QV+G
Subjt:  MQTRRKEKIDYMKMVADLCYISTIEPSTTDSAPKDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGIKWVFKNKTDEVGCVTKNKARLVAQGYTQVEG

Query:  IYFDETFAPVAQLEAIRLLLGISCIQKLKLYQMDVKSAFLNGYLNEVVYVA
        I F+ETFAPVA+LEAIRLLLGISCIQK KLYQMDVKSAFLNGYLNE VYVA
Subjt:  IYFDETFAPVAQLEAIRLLLGISCIQKLKLYQMDVKSAFLNGYLNEVVYVA

KAA0066164.1 gag-pol polyprotein [Cucumis melo var. makuwa]3.5e-7190.73Show/hide
Query:  MQTRRKEKIDYMKMVADLCYISTIEPSTTDSAPKDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGIKWVFKNKTDEVGCVTKNKARLVAQGYTQVEG
        MQTRRKEKIDYMKMVADLCYIST+EPST DSA +DEYWLNAMQEELLQFR+NNVWTLVSKPEGVNVIG KWVFKNKTDE GCVTKNKA+LVAQGYTQVEG
Subjt:  MQTRRKEKIDYMKMVADLCYISTIEPSTTDSAPKDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGIKWVFKNKTDEVGCVTKNKARLVAQGYTQVEG

Query:  IYFDETFAPVAQLEAIRLLLGISCIQKLKLYQMDVKSAFLNGYLNEVVYVA
        I FDETFA VA+LEAIRLLLGISCIQK KLYQMDVKSAFL+GYLNE VYVA
Subjt:  IYFDETFAPVAQLEAIRLLLGISCIQKLKLYQMDVKSAFLNGYLNEVVYVA

KAA0066255.1 gag-pol polyprotein [Cucumis melo var. makuwa]3.2e-6989.4Show/hide
Query:  MQTRRKEKIDYMKMVADLCYISTIEPSTTDSAPKDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGIKWVFKNKTDEVGCVTKNKARLVAQGYTQVEG
        MQTRRKEKIDYM MVADLCYIST EPST D A +DEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIG KWVFKNKTDE GCV K KARLVAQGYTQVEG
Subjt:  MQTRRKEKIDYMKMVADLCYISTIEPSTTDSAPKDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGIKWVFKNKTDEVGCVTKNKARLVAQGYTQVEG

Query:  IYFDETFAPVAQLEAIRLLLGISCIQKLKLYQMDVKSAFLNGYLNEVVYVA
        I FDETFAPVA++EAIRLLLGISCIQK KLYQMDVKSAFLNGYLN  VYVA
Subjt:  IYFDETFAPVAQLEAIRLLLGISCIQKLKLYQMDVKSAFLNGYLNEVVYVA

TYJ98791.1 gag-pol polyprotein [Cucumis melo var. makuwa]3.2e-6989.4Show/hide
Query:  MQTRRKEKIDYMKMVADLCYISTIEPSTTDSAPKDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGIKWVFKNKTDEVGCVTKNKARLVAQGYTQVEG
        MQTRRKEKIDYMKMVADLCYIST EPST D + +DEY LNAMQEELLQF+RNNVWTLV KPEGVNVIG KWVFKNKTDE GCVTKNKARLVAQGYTQVEG
Subjt:  MQTRRKEKIDYMKMVADLCYISTIEPSTTDSAPKDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGIKWVFKNKTDEVGCVTKNKARLVAQGYTQVEG

Query:  IYFDETFAPVAQLEAIRLLLGISCIQKLKLYQMDVKSAFLNGYLNEVVYVA
        I FDETF+PVA+LEAIRLLLGISCIQK KLYQMDVKSAFLNGYLNE VYVA
Subjt:  IYFDETFAPVAQLEAIRLLLGISCIQKLKLYQMDVKSAFLNGYLNEVVYVA

TYK13607.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.6e-6890.07Show/hide
Query:  MQTRRKEKIDYMKMVADLCYISTIEPSTTDSAPKDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGIKWVFKNKTDEVGCVTKNKARLVAQGYTQVEG
        MQTRRKEKIDYMKMV DLCY STIEPST DSA KDEYWLNAMQEELLQFR+NNVWTLVSKPE VNVIG K VFKNKT EVGCVTKNKARLVAQGYTQV+G
Subjt:  MQTRRKEKIDYMKMVADLCYISTIEPSTTDSAPKDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGIKWVFKNKTDEVGCVTKNKARLVAQGYTQVEG

Query:  IYFDETFAPVAQLEAIRLLLGISCIQKLKLYQMDVKSAFLNGYLNEVVYVA
        I FDETFAPVA LEAIRLLLGISCIQK KLYQMDVKS FLNGYLNE VYVA
Subjt:  IYFDETFAPVAQLEAIRLLLGISCIQKLKLYQMDVKSAFLNGYLNEVVYVA

TrEMBL top hitse value%identityAlignment
A0A5D3B9N4 Gag-pol polyprotein2.0e-6989.4Show/hide
Query:  MQTRRKEKIDYMKMVADLCYISTIEPSTTDSAPKDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGIKWVFKNKTDEVGCVTKNKARLVAQGYTQVEG
        MQTRRKEKIDY KMVADLCY STIEPST DSA K+EYWLNAMQE+LLQFRRNNVWTLVSKPEGVNVIG KWVFKNKTDE  CVTKNKARLVAQGY QV+G
Subjt:  MQTRRKEKIDYMKMVADLCYISTIEPSTTDSAPKDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGIKWVFKNKTDEVGCVTKNKARLVAQGYTQVEG

Query:  IYFDETFAPVAQLEAIRLLLGISCIQKLKLYQMDVKSAFLNGYLNEVVYVA
        I F+ETFAPVA+LEAIRLLLGISCIQK KLYQMDVKSAFLNGYLNE VYVA
Subjt:  IYFDETFAPVAQLEAIRLLLGISCIQKLKLYQMDVKSAFLNGYLNEVVYVA

A0A5D3BEV2 Gag-pol polyprotein1.6e-6989.4Show/hide
Query:  MQTRRKEKIDYMKMVADLCYISTIEPSTTDSAPKDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGIKWVFKNKTDEVGCVTKNKARLVAQGYTQVEG
        MQTRRKEKIDYM MVADLCYIST EPST D A +DEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIG KWVFKNKTDE GCV K KARLVAQGYTQVEG
Subjt:  MQTRRKEKIDYMKMVADLCYISTIEPSTTDSAPKDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGIKWVFKNKTDEVGCVTKNKARLVAQGYTQVEG

Query:  IYFDETFAPVAQLEAIRLLLGISCIQKLKLYQMDVKSAFLNGYLNEVVYVA
        I FDETFAPVA++EAIRLLLGISCIQK KLYQMDVKSAFLNGYLN  VYVA
Subjt:  IYFDETFAPVAQLEAIRLLLGISCIQKLKLYQMDVKSAFLNGYLNEVVYVA

A0A5D3BJA9 Gag-pol polyprotein1.6e-6989.4Show/hide
Query:  MQTRRKEKIDYMKMVADLCYISTIEPSTTDSAPKDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGIKWVFKNKTDEVGCVTKNKARLVAQGYTQVEG
        MQTRRKEKIDYMKMVADLCYIST EPST D + +DEY LNAMQEELLQF+RNNVWTLV KPEGVNVIG KWVFKNKTDE GCVTKNKARLVAQGYTQVEG
Subjt:  MQTRRKEKIDYMKMVADLCYISTIEPSTTDSAPKDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGIKWVFKNKTDEVGCVTKNKARLVAQGYTQVEG

Query:  IYFDETFAPVAQLEAIRLLLGISCIQKLKLYQMDVKSAFLNGYLNEVVYVA
        I FDETF+PVA+LEAIRLLLGISCIQK KLYQMDVKSAFLNGYLNE VYVA
Subjt:  IYFDETFAPVAQLEAIRLLLGISCIQKLKLYQMDVKSAFLNGYLNEVVYVA

A0A5D3CNZ6 Gag-pol polyprotein7.8e-6990.07Show/hide
Query:  MQTRRKEKIDYMKMVADLCYISTIEPSTTDSAPKDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGIKWVFKNKTDEVGCVTKNKARLVAQGYTQVEG
        MQTRRKEKIDYMKMV DLCY STIEPST DSA KDEYWLNAMQEELLQFR+NNVWTLVSKPE VNVIG K VFKNKT EVGCVTKNKARLVAQGYTQV+G
Subjt:  MQTRRKEKIDYMKMVADLCYISTIEPSTTDSAPKDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGIKWVFKNKTDEVGCVTKNKARLVAQGYTQVEG

Query:  IYFDETFAPVAQLEAIRLLLGISCIQKLKLYQMDVKSAFLNGYLNEVVYVA
        I FDETFAPVA LEAIRLLLGISCIQK KLYQMDVKS FLNGYLNE VYVA
Subjt:  IYFDETFAPVAQLEAIRLLLGISCIQKLKLYQMDVKSAFLNGYLNEVVYVA

A0A5D3CXU0 Gag-pol polyprotein1.7e-7190.73Show/hide
Query:  MQTRRKEKIDYMKMVADLCYISTIEPSTTDSAPKDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGIKWVFKNKTDEVGCVTKNKARLVAQGYTQVEG
        MQTRRKEKIDYMKMVADLCYIST+EPST DSA +DEYWLNAMQEELLQFR+NNVWTLVSKPEGVNVIG KWVFKNKTDE GCVTKNKA+LVAQGYTQVEG
Subjt:  MQTRRKEKIDYMKMVADLCYISTIEPSTTDSAPKDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGIKWVFKNKTDEVGCVTKNKARLVAQGYTQVEG

Query:  IYFDETFAPVAQLEAIRLLLGISCIQKLKLYQMDVKSAFLNGYLNEVVYVA
        I FDETFA VA+LEAIRLLLGISCIQK KLYQMDVKSAFL+GYLNE VYVA
Subjt:  IYFDETFAPVAQLEAIRLLLGISCIQKLKLYQMDVKSAFLNGYLNEVVYVA

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.1e-2245.13Show/hide
Query:  WLNAMQEELLQFRRNNVWTLVSKPEGVNVIGIKWVFKNKTDEVGCVTKNKARLVAQGYTQVEGIYFDETFAPVAQLEAIRLLLGISCIQKLKLYQMDVKS
        W  A+  EL   + NN WT+  +PE  N++  +WVF  K +E+G   + KARLVA+G+TQ   I ++ETFAPVA++ + R +L +     LK++QMDVK+
Subjt:  WLNAMQEELLQFRRNNVWTLVSKPEGVNVIGIKWVFKNKTDEVGCVTKNKARLVAQGYTQVEGIYFDETFAPVAQLEAIRLLLGISCIQKLKLYQMDVKS

Query:  AFLNGYLNEVVYV
        AFLNG L E +Y+
Subjt:  AFLNGYLNEVVYV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.4e-1940.83Show/hide
Query:  SAPKDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGIKWVFKNKTDEVGCVTKNKARLVAQGYTQVEGIYFDETFAPVAQLEAIRLLLGISCIQKLKL
        S P+    + AMQEE+   ++N  + LV  P+G   +  KWVFK K D    + + KARLV +G+ Q +GI FDE F+PV ++ +IR +L ++    L++
Subjt:  SAPKDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGIKWVFKNKTDEVGCVTKNKARLVAQGYTQVEGIYFDETFAPVAQLEAIRLLLGISCIQKLKL

Query:  YQMDVKSAFLNGYLNEVVYV
         Q+DVK+AFL+G L E +Y+
Subjt:  YQMDVKSAFLNGYLNEVVYV

P92520 Uncharacterized mitochondrial protein AtMg008202.9e-2043.28Show/hide
Query:  MQTRRKEKIDYMKMVADLCYISTI--EPSTTDSAPKDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGIKWVFKNKTDEVGCVTKNKARLVAQGYTQV
        M TR K  I+ +     L   +TI  EP +   A KD  W  AMQEEL    RN  W LV  P   N++G KWVFK K    G + + KARLVA+G+ Q 
Subjt:  MQTRRKEKIDYMKMVADLCYISTI--EPSTTDSAPKDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGIKWVFKNKTDEVGCVTKNKARLVAQGYTQV

Query:  EGIYFDETFAPVAQLEAIRLLLGISCIQKLKLYQ
        EGIYF ET++PV +   IR +L ++  Q+L++ Q
Subjt:  EGIYFDETFAPVAQLEAIRLLLGISCIQKLKLYQ

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.9e-2140.62Show/hide
Query:  EPSTTDSAPKDEYWLNAMQEELLQFRRNNVWTLVSKPEG-VNVIGIKWVFKNKTDEVGCVTKNKARLVAQGYTQVEGIYFDETFAPVAQLEAIRLLLGIS
        EP T   A KDE W NAM  E+     N+ W LV  P   V ++G +W+F  K +  G + + KARLVA+GY Q  G+ + ETF+PV +  +IR++LG++
Subjt:  EPSTTDSAPKDEYWLNAMQEELLQFRRNNVWTLVSKPEG-VNVIGIKWVFKNKTDEVGCVTKNKARLVAQGYTQVEGIYFDETFAPVAQLEAIRLLLGIS

Query:  CIQKLKLYQMDVKSAFLNGYLNEVVYVA
          +   + Q+DV +AFL G L + VY++
Subjt:  CIQKLKLYQMDVKSAFLNGYLNEVVYVA

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.4e-2039.06Show/hide
Query:  EPSTTDSAPKDEYWLNAMQEELLQFRRNNVWTLV-SKPEGVNVIGIKWVFKNKTDEVGCVTKNKARLVAQGYTQVEGIYFDETFAPVAQLEAIRLLLGIS
        EP T   A KD+ W  AM  E+     N+ W LV   P  V ++G +W+F  K +  G + + KARLVA+GY Q  G+ + ETF+PV +  +IR++LG++
Subjt:  EPSTTDSAPKDEYWLNAMQEELLQFRRNNVWTLV-SKPEGVNVIGIKWVFKNKTDEVGCVTKNKARLVAQGYTQVEGIYFDETFAPVAQLEAIRLLLGIS

Query:  CIQKLKLYQMDVKSAFLNGYLNEVVYVA
          +   + Q+DV +AFL G L + VY++
Subjt:  CIQKLKLYQMDVKSAFLNGYLNEVVYVA

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.7e-2642.11Show/hide
Query:  LCYISTIEPSTTDSAPKDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGIKWVFKNKTDEVGCVTKNKARLVAQGYTQVEGIYFDETFAPVAQLEAIR
        +C     EPST + A +   W  AM +E+      + W + + P     IG KWV+K K +  G + + KARLVA+GYTQ EGI F ETF+PV +L +++
Subjt:  LCYISTIEPSTTDSAPKDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGIKWVFKNKTDEVGCVTKNKARLVAQGYTQVEGIYFDETFAPVAQLEAIR

Query:  LLLGISCIQKLKLYQMDVKSAFLNGYLNEVVYV
        L+L IS I    L+Q+D+ +AFLNG L+E +Y+
Subjt:  LLLGISCIQKLKLYQMDVKSAFLNGYLNEVVYV

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.0e-2143.28Show/hide
Query:  MQTRRKEKIDYMKMVADLCYISTI--EPSTTDSAPKDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGIKWVFKNKTDEVGCVTKNKARLVAQGYTQV
        M TR K  I+ +     L   +TI  EP +   A KD  W  AMQEEL    RN  W LV  P   N++G KWVFK K    G + + KARLVA+G+ Q 
Subjt:  MQTRRKEKIDYMKMVADLCYISTI--EPSTTDSAPKDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGIKWVFKNKTDEVGCVTKNKARLVAQGYTQV

Query:  EGIYFDETFAPVAQLEAIRLLLGISCIQKLKLYQ
        EGIYF ET++PV +   IR +L ++  Q+L++ Q
Subjt:  EGIYFDETFAPVAQLEAIRLLLGISCIQKLKLYQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGACCAGAAGGAAAGAAAAGATTGATTACATGAAGATGGTTGCTGATTTATGTTATATTTCCACCATTGAACCTTCTACTACTGACTCTGCTCCCAAAGATGAGTA
TTGGCTAAATGCTATGCAAGAGGAGCTACTCCAATTTAGACGAAACAATGTCTGGACATTAGTTTCAAAGCCAGAAGGTGTAAACGTTATTGGCATTAAATGGGTGTTTA
AAAATAAGACTGATGAAGTTGGATGTGTGACGAAAAATAAAGCCAGATTAGTAGCTCAAGGGTATACTCAAGTTGAAGGTATTTACTTTGATGAAACGTTTGCTCCTGTT
GCTCAACTTGAAGCCATTCGACTATTACTTGGTATATCATGCATACAGAAACTTAAATTGTATCAGATGGATGTAAAGAGTGCCTTCTTGAATGGTTATTTGAATGAGGT
GGTTTATGTTGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAGACCAGAAGGAAAGAAAAGATTGATTACATGAAGATGGTTGCTGATTTATGTTATATTTCCACCATTGAACCTTCTACTACTGACTCTGCTCCCAAAGATGAGTA
TTGGCTAAATGCTATGCAAGAGGAGCTACTCCAATTTAGACGAAACAATGTCTGGACATTAGTTTCAAAGCCAGAAGGTGTAAACGTTATTGGCATTAAATGGGTGTTTA
AAAATAAGACTGATGAAGTTGGATGTGTGACGAAAAATAAAGCCAGATTAGTAGCTCAAGGGTATACTCAAGTTGAAGGTATTTACTTTGATGAAACGTTTGCTCCTGTT
GCTCAACTTGAAGCCATTCGACTATTACTTGGTATATCATGCATACAGAAACTTAAATTGTATCAGATGGATGTAAAGAGTGCCTTCTTGAATGGTTATTTGAATGAGGT
GGTTTATGTTGCTTAA
Protein sequenceShow/hide protein sequence
MQTRRKEKIDYMKMVADLCYISTIEPSTTDSAPKDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGIKWVFKNKTDEVGCVTKNKARLVAQGYTQVEGIYFDETFAPV
AQLEAIRLLLGISCIQKLKLYQMDVKSAFLNGYLNEVVYVA