; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0068571 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0068571
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr03:12460128..12460664
RNA-Seq ExpressionCmc03g0068571
SyntenyCmc03g0068571
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033149.1 gag-pol polyprotein [Cucumis melo var. makuwa]8.1e-96100Show/hide
Query:  MVANVCYTSSLESITVSVAVTDEHWILAMQEELLQFERNQVWELVPKPPHANIIGTNWIFKNKADEQGRVIRNKARLVAQRYSQIEGLDFGETFASVPRL
        MVANVCYTSSLESITVSVAVTDEHWILAMQEELLQFERNQVWELVPKPPHANIIGTNWIFKNKADEQGRVIRNKARLVAQRYSQIEGLDFGETFASVPRL
Subjt:  MVANVCYTSSLESITVSVAVTDEHWILAMQEELLQFERNQVWELVPKPPHANIIGTNWIFKNKADEQGRVIRNKARLVAQRYSQIEGLDFGETFASVPRL

Query:  EAIRPLLSYACFRRFKLFQMDVKSTFLNGYLSEEVYVAQPKGFVDLVHHDQVYKLRKALYGLKQAFRAWYERLSTYLL
        EAIRPLLSYACFRRFKLFQMDVKSTFLNGYLSEEVYVAQPKGFVDLVHHDQVYKLRKALYGLKQAFRAWYERLSTYLL
Subjt:  EAIRPLLSYACFRRFKLFQMDVKSTFLNGYLSEEVYVAQPKGFVDLVHHDQVYKLRKALYGLKQAFRAWYERLSTYLL

KAA0035460.1 gag-pol polyprotein [Cucumis melo var. makuwa]3.4e-7883.15Show/hide
Query:  MVANVCYTSSLESITVSVAVTDEHWILAMQEELLQFERNQVWELVPKPPHANIIGTNWIFKNKADEQGRVIRNKARLVAQRYSQIEGLDFGETFASVPRL
        MVANV YTS++E  TV+ A+TDEHWILA+QEELLQFERNQVW LVPKPPH NIIGT WIFKNK DEQGR+IRNK RLVAQ YSQI+GLDFGETFA   RL
Subjt:  MVANVCYTSSLESITVSVAVTDEHWILAMQEELLQFERNQVWELVPKPPHANIIGTNWIFKNKADEQGRVIRNKARLVAQRYSQIEGLDFGETFASVPRL

Query:  EAIRPLLSYACFRRFKLFQMDVKSTFLNGYLSEEVYVAQPKGFVDLVHHDQVYKLRKALYGLKQAFRAWYERLSTYLL
        EAIR +LS+ CFR FKLFQMDVKS FLNGYLSEEVYVAQPKGFVD VHHD VYKLRKALYGLKQA RAWYERLSTYL+
Subjt:  EAIRPLLSYACFRRFKLFQMDVKSTFLNGYLSEEVYVAQPKGFVDLVHHDQVYKLRKALYGLKQAFRAWYERLSTYLL

KAA0059225.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.4e-7986.52Show/hide
Query:  MVANVCYTSSLESITVSVAVTDEHWILAMQEELLQFERNQVWELVPKPPHANIIGTNWIFKNKADEQGRVIRNKARLVAQRYSQIEGLDFGETFASVPRL
        MVANVCYTS LE  TVS A++DEHWIL +QEELLQFERNQVWELVPKPP+ANIIGT WIFKNK DE+GRVIRNKARLVAQ YSQIEGLDFGETFA V RL
Subjt:  MVANVCYTSSLESITVSVAVTDEHWILAMQEELLQFERNQVWELVPKPPHANIIGTNWIFKNKADEQGRVIRNKARLVAQRYSQIEGLDFGETFASVPRL

Query:  EAIRPLLSYACFRRFKLFQMDVKSTFLNGYLSEEVYVAQPKGFVDLVHHDQVYKLRKALYGLKQAFRAWYERLSTYLL
        EAIR LLSYACF RFKLFQMDVKS FLNGYL EEVYVAQPKGFVD VH D VYKLRKALY LKQA RAWYERLSTYLL
Subjt:  EAIRPLLSYACFRRFKLFQMDVKSTFLNGYLSEEVYVAQPKGFVDLVHHDQVYKLRKALYGLKQAFRAWYERLSTYLL

KAA0059939.1 gag-pol polyprotein [Cucumis melo var. makuwa]7.4e-7384.52Show/hide
Query:  MVANVCYTSSLESITVSVAVTDEHWILAMQEELLQFERNQVWELVPKPPHANIIGTNWIFKNKADEQGRVIRNKARLVAQRYSQIEGLDFGETFASVPRL
        MVANVCYTSSLE   VS  ++DEHWIL MQEELLQFERNQ+WELVPKPP+ANIIGT WIFKNK DE+GRVIRNKARLVAQRY QIEGLDFGETF  V RL
Subjt:  MVANVCYTSSLESITVSVAVTDEHWILAMQEELLQFERNQVWELVPKPPHANIIGTNWIFKNKADEQGRVIRNKARLVAQRYSQIEGLDFGETFASVPRL

Query:  EAIRPLLSYACFRRFKLFQMDVKSTFLNGYLSEEVYVAQPKGFVDLVHHDQVYKLRKALYGLKQAFRA
        E IR LLSYA FRRFKLFQMDVKS FLNGYLSEEVYVAQPKGFVD VH D VYKL+KALYGLKQA RA
Subjt:  EAIRPLLSYACFRRFKLFQMDVKSTFLNGYLSEEVYVAQPKGFVDLVHHDQVYKLRKALYGLKQAFRA

TYK31065.1 gag-pol polyprotein [Cucumis melo var. makuwa]3.4e-7883.15Show/hide
Query:  MVANVCYTSSLESITVSVAVTDEHWILAMQEELLQFERNQVWELVPKPPHANIIGTNWIFKNKADEQGRVIRNKARLVAQRYSQIEGLDFGETFASVPRL
        MVANV YTS++E  TV+ A+TDEHWILA+QEELLQFERNQVW LVPKPPH NIIGT WIFKNK DEQGR+IRNK RLVAQ YSQI+GLDFGETFA   RL
Subjt:  MVANVCYTSSLESITVSVAVTDEHWILAMQEELLQFERNQVWELVPKPPHANIIGTNWIFKNKADEQGRVIRNKARLVAQRYSQIEGLDFGETFASVPRL

Query:  EAIRPLLSYACFRRFKLFQMDVKSTFLNGYLSEEVYVAQPKGFVDLVHHDQVYKLRKALYGLKQAFRAWYERLSTYLL
        EAIR +LS+ CFR FKLFQMDVKS FLNGYLSEEVYVAQPKGFVD VHHD VYKLRKALYGLKQA RAWYERLSTYL+
Subjt:  EAIRPLLSYACFRRFKLFQMDVKSTFLNGYLSEEVYVAQPKGFVDLVHHDQVYKLRKALYGLKQAFRAWYERLSTYLL

TrEMBL top hitse value%identityAlignment
A0A5A7SWP6 Gag-pol polyprotein1.7e-7883.15Show/hide
Query:  MVANVCYTSSLESITVSVAVTDEHWILAMQEELLQFERNQVWELVPKPPHANIIGTNWIFKNKADEQGRVIRNKARLVAQRYSQIEGLDFGETFASVPRL
        MVANV YTS++E  TV+ A+TDEHWILA+QEELLQFERNQVW LVPKPPH NIIGT WIFKNK DEQGR+IRNK RLVAQ YSQI+GLDFGETFA   RL
Subjt:  MVANVCYTSSLESITVSVAVTDEHWILAMQEELLQFERNQVWELVPKPPHANIIGTNWIFKNKADEQGRVIRNKARLVAQRYSQIEGLDFGETFASVPRL

Query:  EAIRPLLSYACFRRFKLFQMDVKSTFLNGYLSEEVYVAQPKGFVDLVHHDQVYKLRKALYGLKQAFRAWYERLSTYLL
        EAIR +LS+ CFR FKLFQMDVKS FLNGYLSEEVYVAQPKGFVD VHHD VYKLRKALYGLKQA RAWYERLSTYL+
Subjt:  EAIRPLLSYACFRRFKLFQMDVKSTFLNGYLSEEVYVAQPKGFVDLVHHDQVYKLRKALYGLKQAFRAWYERLSTYLL

A0A5A7V046 Gag-pol polyprotein6.8e-8086.52Show/hide
Query:  MVANVCYTSSLESITVSVAVTDEHWILAMQEELLQFERNQVWELVPKPPHANIIGTNWIFKNKADEQGRVIRNKARLVAQRYSQIEGLDFGETFASVPRL
        MVANVCYTS LE  TVS A++DEHWIL +QEELLQFERNQVWELVPKPP+ANIIGT WIFKNK DE+GRVIRNKARLVAQ YSQIEGLDFGETFA V RL
Subjt:  MVANVCYTSSLESITVSVAVTDEHWILAMQEELLQFERNQVWELVPKPPHANIIGTNWIFKNKADEQGRVIRNKARLVAQRYSQIEGLDFGETFASVPRL

Query:  EAIRPLLSYACFRRFKLFQMDVKSTFLNGYLSEEVYVAQPKGFVDLVHHDQVYKLRKALYGLKQAFRAWYERLSTYLL
        EAIR LLSYACF RFKLFQMDVKS FLNGYL EEVYVAQPKGFVD VH D VYKLRKALY LKQA RAWYERLSTYLL
Subjt:  EAIRPLLSYACFRRFKLFQMDVKSTFLNGYLSEEVYVAQPKGFVDLVHHDQVYKLRKALYGLKQAFRAWYERLSTYLL

A0A5A7V293 Gag-pol polyprotein3.6e-7384.52Show/hide
Query:  MVANVCYTSSLESITVSVAVTDEHWILAMQEELLQFERNQVWELVPKPPHANIIGTNWIFKNKADEQGRVIRNKARLVAQRYSQIEGLDFGETFASVPRL
        MVANVCYTSSLE   VS  ++DEHWIL MQEELLQFERNQ+WELVPKPP+ANIIGT WIFKNK DE+GRVIRNKARLVAQRY QIEGLDFGETF  V RL
Subjt:  MVANVCYTSSLESITVSVAVTDEHWILAMQEELLQFERNQVWELVPKPPHANIIGTNWIFKNKADEQGRVIRNKARLVAQRYSQIEGLDFGETFASVPRL

Query:  EAIRPLLSYACFRRFKLFQMDVKSTFLNGYLSEEVYVAQPKGFVDLVHHDQVYKLRKALYGLKQAFRA
        E IR LLSYA FRRFKLFQMDVKS FLNGYLSEEVYVAQPKGFVD VH D VYKL+KALYGLKQA RA
Subjt:  EAIRPLLSYACFRRFKLFQMDVKSTFLNGYLSEEVYVAQPKGFVDLVHHDQVYKLRKALYGLKQAFRA

A0A5D3CZY5 Gag-pol polyprotein3.9e-96100Show/hide
Query:  MVANVCYTSSLESITVSVAVTDEHWILAMQEELLQFERNQVWELVPKPPHANIIGTNWIFKNKADEQGRVIRNKARLVAQRYSQIEGLDFGETFASVPRL
        MVANVCYTSSLESITVSVAVTDEHWILAMQEELLQFERNQVWELVPKPPHANIIGTNWIFKNKADEQGRVIRNKARLVAQRYSQIEGLDFGETFASVPRL
Subjt:  MVANVCYTSSLESITVSVAVTDEHWILAMQEELLQFERNQVWELVPKPPHANIIGTNWIFKNKADEQGRVIRNKARLVAQRYSQIEGLDFGETFASVPRL

Query:  EAIRPLLSYACFRRFKLFQMDVKSTFLNGYLSEEVYVAQPKGFVDLVHHDQVYKLRKALYGLKQAFRAWYERLSTYLL
        EAIRPLLSYACFRRFKLFQMDVKSTFLNGYLSEEVYVAQPKGFVDLVHHDQVYKLRKALYGLKQAFRAWYERLSTYLL
Subjt:  EAIRPLLSYACFRRFKLFQMDVKSTFLNGYLSEEVYVAQPKGFVDLVHHDQVYKLRKALYGLKQAFRAWYERLSTYLL

A0A5D3E6X3 Gag-pol polyprotein1.7e-7883.15Show/hide
Query:  MVANVCYTSSLESITVSVAVTDEHWILAMQEELLQFERNQVWELVPKPPHANIIGTNWIFKNKADEQGRVIRNKARLVAQRYSQIEGLDFGETFASVPRL
        MVANV YTS++E  TV+ A+TDEHWILA+QEELLQFERNQVW LVPKPPH NIIGT WIFKNK DEQGR+IRNK RLVAQ YSQI+GLDFGETFA   RL
Subjt:  MVANVCYTSSLESITVSVAVTDEHWILAMQEELLQFERNQVWELVPKPPHANIIGTNWIFKNKADEQGRVIRNKARLVAQRYSQIEGLDFGETFASVPRL

Query:  EAIRPLLSYACFRRFKLFQMDVKSTFLNGYLSEEVYVAQPKGFVDLVHHDQVYKLRKALYGLKQAFRAWYERLSTYLL
        EAIR +LS+ CFR FKLFQMDVKS FLNGYLSEEVYVAQPKGFVD VHHD VYKLRKALYGLKQA RAWYERLSTYL+
Subjt:  EAIRPLLSYACFRRFKLFQMDVKSTFLNGYLSEEVYVAQPKGFVDLVHHDQVYKLRKALYGLKQAFRAWYERLSTYLL

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.2e-2743.54Show/hide
Query:  WILAMQEELLQFERNQVWELVPKPPHANIIGTNWIFKNKADEQGRVIRNKARLVAQRYSQIEGLDFGETFASVPRLEAIRPLLSYACFRRFKLFQMDVKS
        W  A+  EL   + N  W +  +P + NI+ + W+F  K +E G  IR KARLVA+ ++Q   +D+ ETFA V R+ + R +LS       K+ QMDVK+
Subjt:  WILAMQEELLQFERNQVWELVPKPPHANIIGTNWIFKNKADEQGRVIRNKARLVAQRYSQIEGLDFGETFASVPRLEAIRPLLSYACFRRFKLFQMDVKS

Query:  TFLNGYLSEEVYVAQPKGFVDLVHHDQVYKLRKALYGLKQAFRAWYE
         FLNG L EE+Y+  P+G     + D V KL KA+YGLKQA R W+E
Subjt:  TFLNGYLSEEVYVAQPKGFVDLVHHDQVYKLRKALYGLKQAFRAWYE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-2640.67Show/hide
Query:  AMQEELLQFERNQVWELVPKPPHANIIGTNWIFKNKADEQGRVIRNKARLVAQRYSQIEGLDFGETFASVPRLEAIRPLLSYACFRRFKLFQMDVKSTFL
        AMQEE+   ++N  ++LV  P     +   W+FK K D   +++R KARLV + + Q +G+DF E F+ V ++ +IR +LS A     ++ Q+DVK+ FL
Subjt:  AMQEELLQFERNQVWELVPKPPHANIIGTNWIFKNKADEQGRVIRNKARLVAQRYSQIEGLDFGETFASVPRLEAIRPLLSYACFRRFKLFQMDVKSTFL

Query:  NGYLSEEVYVAQPKGFVDLVHHDQVYKLRKALYGLKQAFRAWYERLSTYL
        +G L EE+Y+ QP+GF        V KL K+LYGLKQA R WY +  +++
Subjt:  NGYLSEEVYVAQPKGFVDLVHHDQVYKLRKALYGLKQAFRAWYERLSTYL

P92520 Uncharacterized mitochondrial protein AtMg008206.6e-1644.66Show/hide
Query:  TSSLESITVSVAVTDEHWILAMQEELLQFERNQVWELVPKPPHANIIGTNWIFKNKADEQGRVIRNKARLVAQRYSQIEGLDFGETFASVPRLEAIRPLL
        T   E  +V  A+ D  W  AMQEEL    RN+ W LVP P + NI+G  W+FK K    G + R KARLVA+ + Q EG+ F ET++ V R   IR +L
Subjt:  TSSLESITVSVAVTDEHWILAMQEELLQFERNQVWELVPKPPHANIIGTNWIFKNKADEQGRVIRNKARLVAQRYSQIEGLDFGETFASVPRLEAIRPLL

Query:  SYA
        + A
Subjt:  SYA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.9e-3446.29Show/hide
Query:  VCYTSSLESITVSVAVTDEHWILAMQEELLQFERNQVWELVPKPP-HANIIGTNWIFKNKADEQGRVIRNKARLVAQRYSQIEGLDFGETFASVPRLEAI
        V   +  E  T   A+ DE W  AM  E+     N  W+LVP PP H  I+G  WIF  K +  G + R KARLVA+ Y+Q  GLD+ ETF+ V +  +I
Subjt:  VCYTSSLESITVSVAVTDEHWILAMQEELLQFERNQVWELVPKPP-HANIIGTNWIFKNKADEQGRVIRNKARLVAQRYSQIEGLDFGETFASVPRLEAI

Query:  RPLLSYACFRRFKLFQMDVKSTFLNGYLSEEVYVAQPKGFVDLVHHDQVYKLRKALYGLKQAFRAWYERLSTYLL
        R +L  A  R + + Q+DV + FL G L+++VY++QP GF+D    + V KLRKALYGLKQA RAWY  L  YLL
Subjt:  RPLLSYACFRRFKLFQMDVKSTFLNGYLSEEVYVAQPKGFVDLVHHDQVYKLRKALYGLKQAFRAWYERLSTYLL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.4e-3446.89Show/hide
Query:  YTSSL----ESITVSVAVTDEHWILAMQEELLQFERNQVWELV-PKPPHANIIGTNWIFKNKADEQGRVIRNKARLVAQRYSQIEGLDFGETFASVPRLE
        Y +SL    E  T   A+ D+ W  AM  E+     N  W+LV P PP   I+G  WIF  K +  G + R KARLVA+ Y+Q  GLD+ ETF+ V +  
Subjt:  YTSSL----ESITVSVAVTDEHWILAMQEELLQFERNQVWELV-PKPPHANIIGTNWIFKNKADEQGRVIRNKARLVAQRYSQIEGLDFGETFASVPRLE

Query:  AIRPLLSYACFRRFKLFQMDVKSTFLNGYLSEEVYVAQPKGFVDLVHHDQVYKLRKALYGLKQAFRAWYERLSTYLL
        +IR +L  A  R + + Q+DV + FL G L++EVY++QP GFVD    D V +LRKA+YGLKQA RAWY  L TYLL
Subjt:  AIRPLLSYACFRRFKLFQMDVKSTFLNGYLSEEVYVAQPKGFVDLVHHDQVYKLRKALYGLKQAFRAWYERLSTYLL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.4e-2937.64Show/hide
Query:  VCYTSSLESITVSVAVTDEHWILAMQEELLQFERNQVWELVPKPPHANIIGTNWIFKNKADEQGRVIRNKARLVAQRYSQIEGLDFGETFASVPRLEAIR
        VC   + E  T + A     W  AM +E+   E    WE+   PP+   IG  W++K K +  G + R KARLVA+ Y+Q EG+DF ETF+ V +L +++
Subjt:  VCYTSSLESITVSVAVTDEHWILAMQEELLQFERNQVWELVPKPPHANIIGTNWIFKNKADEQGRVIRNKARLVAQRYSQIEGLDFGETFASVPRLEAIR

Query:  PLLSYACFRRFKLFQMDVKSTFLNGYLSEEVYVAQPKGFV----DLVHHDQVYKLRKALYGLKQAFRAWYERLSTYLL
         +L+ +    F L Q+D+ + FLNG L EE+Y+  P G+     D +  + V  L+K++YGLKQA R W+ + S  L+
Subjt:  PLLSYACFRRFKLFQMDVKSTFLNGYLSEEVYVAQPKGFV----DLVHHDQVYKLRKALYGLKQAFRAWYERLSTYLL

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)4.7e-1744.66Show/hide
Query:  TSSLESITVSVAVTDEHWILAMQEELLQFERNQVWELVPKPPHANIIGTNWIFKNKADEQGRVIRNKARLVAQRYSQIEGLDFGETFASVPRLEAIRPLL
        T   E  +V  A+ D  W  AMQEEL    RN+ W LVP P + NI+G  W+FK K    G + R KARLVA+ + Q EG+ F ET++ V R   IR +L
Subjt:  TSSLESITVSVAVTDEHWILAMQEELLQFERNQVWELVPKPPHANIIGTNWIFKNKADEQGRVIRNKARLVAQRYSQIEGLDFGETFASVPRLEAIRPLL

Query:  SYA
        + A
Subjt:  SYA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGCCAATGTATGCTACACGTCTTCCCTAGAATCTATCACTGTCTCAGTAGCGGTTACCGATGAACACTGGATTTTGGCTATGCAGGAAGAACTACTACAG
TTTGAAAGAAACCAAGTATGGGAACTAGTGCCAAAGCCACCTCATGCTAACATAATTGGTACCAACTGGATCTTTAAGAACAAAGCAGATGAACAAGGAAGAGTT
ATCCGTAATAAAGCAAGATTGGTTGCTCAAAGGTATTCTCAAATAGAAGGTCTGGATTTTGGAGAAACCTTTGCCTCGGTACCCAGACTAGAAGCCATCCGACCA
CTACTAAGCTATGCATGTTTCCGGAGGTTCAAACTATTTCAAATGGATGTAAAGAGTACGTTTCTAAATGGGTACTTATCCGAGGAAGTGTATGTGGCCCAACCA
AAAGGATTTGTTGATCTGGTGCATCATGATCAAGTTTACAAGCTTCGAAAGGCACTCTATGGACTCAAACAAGCTTTTAGAGCATGGTATGAGAGACTCTCCACT
TATCTGTTATAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTGCCAATGTATGCTACACGTCTTCCCTAGAATCTATCACTGTCTCAGTAGCGGTTACCGATGAACACTGGATTTTGGCTATGCAGGAAGAACTACTACAG
TTTGAAAGAAACCAAGTATGGGAACTAGTGCCAAAGCCACCTCATGCTAACATAATTGGTACCAACTGGATCTTTAAGAACAAAGCAGATGAACAAGGAAGAGTT
ATCCGTAATAAAGCAAGATTGGTTGCTCAAAGGTATTCTCAAATAGAAGGTCTGGATTTTGGAGAAACCTTTGCCTCGGTACCCAGACTAGAAGCCATCCGACCA
CTACTAAGCTATGCATGTTTCCGGAGGTTCAAACTATTTCAAATGGATGTAAAGAGTACGTTTCTAAATGGGTACTTATCCGAGGAAGTGTATGTGGCCCAACCA
AAAGGATTTGTTGATCTGGTGCATCATGATCAAGTTTACAAGCTTCGAAAGGCACTCTATGGACTCAAACAAGCTTTTAGAGCATGGTATGAGAGACTCTCCACT
TATCTGTTATAA
Protein sequenceShow/hide protein sequence
MVANVCYTSSLESITVSVAVTDEHWILAMQEELLQFERNQVWELVPKPPHANIIGTNWIFKNKADEQGRVIRNKARLVAQRYSQIEGLDFGETFASVPRLEAIRP
LLSYACFRRFKLFQMDVKSTFLNGYLSEEVYVAQPKGFVDLVHHDQVYKLRKALYGLKQAFRAWYERLSTYLL