; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc06g0168281 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc06g0168281
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr06:21995766..21996239
RNA-Seq ExpressionCmc06g0168281
SyntenyCmc06g0168281
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]2.3e-5786.82Show/hide
Query:  MIKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQKSIYELKQASRSWNIRFDIAVKSYSFEQNVDEPCVYKRIINS
        M+KSIRILLSIATFY+YEIWQMDVKTAFLNGNLEESIYMVQPEGFI + QEQKVCKLQKSIY LKQASRSWNIRFD A+KSY FEQNVDEPCVYK+I+NS
Subjt:  MIKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQKSIYELKQASRSWNIRFDIAVKSYSFEQNVDEPCVYKRIINS

Query:  TVAFLVLYVDDILLIGNDVGHLADIKKWL
         VAFL+LYVDDILLIGNDV +L D+KKWL
Subjt:  TVAFLVLYVDDILLIGNDVGHLADIKKWL

KAA0045356.1 gag/pol protein [Cucumis melo var. makuwa]7.5e-6193.02Show/hide
Query:  MIKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQKSIYELKQASRSWNIRFDIAVKSYSFEQNVDEPCVYKRIINS
        MIKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQKSIY LKQASRSWNIRFD  +KSY FEQNVDEPCVYKRIINS
Subjt:  MIKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQKSIYELKQASRSWNIRFDIAVKSYSFEQNVDEPCVYKRIINS

Query:  TVAFLVLYVDDILLIGNDVGHLADIKKWL
        TVAFLVLYVDDILLIGN+V HL DIK+WL
Subjt:  TVAFLVLYVDDILLIGNDVGHLADIKKWL

TYJ99632.1 gag/pol protein [Cucumis melo var. makuwa]1.0e-5777.18Show/hide
Query:  MIKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQKSIYELKQASRSWNIRFDIAVKSYSFEQNVDEPCVYKRIINS
        M+KSIRILLSI TFYDYEIWQMDVKTAFLN NLEESI+M QPEGFI +GQEQKVCKL +SIY LKQASRSWNIRFD A+KSY F+QNVDEPCVYK+I   
Subjt:  MIKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQKSIYELKQASRSWNIRFDIAVKSYSFEQNVDEPCVYKRIINS

Query:  TVAFLVLYVDDILLIGNDVGHLADIKKWLAA-IPNERFGKCTICSWCPN
         VAFLVLY++ ILLIGNDVG+L D+K WLAA   NERF +  ICSW PN
Subjt:  TVAFLVLYVDDILLIGNDVGHLADIKKWLAA-IPNERFGKCTICSWCPN

TYK03644.1 gag/pol protein [Cucumis melo var. makuwa]3.2e-5991.54Show/hide
Query:  MIKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQKSIYELKQASRSWNIRFDIAVKSYSFEQNVDEPCVYKRIINS
        MIKSIRILLSIATFYDYEIWQMDVKT FLN NLEESIYMVQPE FIQKGQEQK+CKLQKSIY LKQASRS NIRFD A+KSY  EQNVDEPCVYKRI+NS
Subjt:  MIKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQKSIYELKQASRSWNIRFDIAVKSYSFEQNVDEPCVYKRIINS

Query:  TVAFLVLYVDDILLIGNDVGHLADIKKWLA
        TVAFLVLYVDDILLIGNDVGHLADIKKWLA
Subjt:  TVAFLVLYVDDILLIGNDVGHLADIKKWLA

TYK09500.1 gag/pol protein [Cucumis melo var. makuwa]9.2e-5986.15Show/hide
Query:  MIKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQKSIYELKQASRSWNIRFDIAVKSYSFEQNVDEPCVYKRIINS
        M+KSIRILLSIATFYDYEIWQ+DVKTAFLNGNLEESIYM QPEGFI+KGQEQK+CK QKSIY LKQASRSWNIRFD A+KSY FEQNVD+PCVYK+++NS
Subjt:  MIKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQKSIYELKQASRSWNIRFDIAVKSYSFEQNVDEPCVYKRIINS

Query:  TVAFLVLYVDDILLIGNDVGHLADIKKWLA
         VAFLVLYVDDILL+GNDVG+L DIKKWLA
Subjt:  TVAFLVLYVDDILLIGNDVGHLADIKKWLA

TrEMBL top hitse value%identityAlignment
A0A5A7TTA2 Gag/pol protein3.6e-6193.02Show/hide
Query:  MIKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQKSIYELKQASRSWNIRFDIAVKSYSFEQNVDEPCVYKRIINS
        MIKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQKSIY LKQASRSWNIRFD  +KSY FEQNVDEPCVYKRIINS
Subjt:  MIKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQKSIYELKQASRSWNIRFDIAVKSYSFEQNVDEPCVYKRIINS

Query:  TVAFLVLYVDDILLIGNDVGHLADIKKWL
        TVAFLVLYVDDILLIGN+V HL DIK+WL
Subjt:  TVAFLVLYVDDILLIGNDVGHLADIKKWL

A0A5D3BIR3 Gag/pol protein4.9e-5877.18Show/hide
Query:  MIKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQKSIYELKQASRSWNIRFDIAVKSYSFEQNVDEPCVYKRIINS
        M+KSIRILLSI TFYDYEIWQMDVKTAFLN NLEESI+M QPEGFI +GQEQKVCKL +SIY LKQASRSWNIRFD A+KSY F+QNVDEPCVYK+I   
Subjt:  MIKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQKSIYELKQASRSWNIRFDIAVKSYSFEQNVDEPCVYKRIINS

Query:  TVAFLVLYVDDILLIGNDVGHLADIKKWLAA-IPNERFGKCTICSWCPN
         VAFLVLY++ ILLIGNDVG+L D+K WLAA   NERF +  ICSW PN
Subjt:  TVAFLVLYVDDILLIGNDVGHLADIKKWLAA-IPNERFGKCTICSWCPN

A0A5D3BX45 Gag/pol protein1.5e-5991.54Show/hide
Query:  MIKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQKSIYELKQASRSWNIRFDIAVKSYSFEQNVDEPCVYKRIINS
        MIKSIRILLSIATFYDYEIWQMDVKT FLN NLEESIYMVQPE FIQKGQEQK+CKLQKSIY LKQASRS NIRFD A+KSY  EQNVDEPCVYKRI+NS
Subjt:  MIKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQKSIYELKQASRSWNIRFDIAVKSYSFEQNVDEPCVYKRIINS

Query:  TVAFLVLYVDDILLIGNDVGHLADIKKWLA
        TVAFLVLYVDDILLIGNDVGHLADIKKWLA
Subjt:  TVAFLVLYVDDILLIGNDVGHLADIKKWLA

A0A5D3CDT9 Gag/pol protein4.4e-5986.15Show/hide
Query:  MIKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQKSIYELKQASRSWNIRFDIAVKSYSFEQNVDEPCVYKRIINS
        M+KSIRILLSIATFYDYEIWQ+DVKTAFLNGNLEESIYM QPEGFI+KGQEQK+CK QKSIY LKQASRSWNIRFD A+KSY FEQNVD+PCVYK+++NS
Subjt:  MIKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQKSIYELKQASRSWNIRFDIAVKSYSFEQNVDEPCVYKRIINS

Query:  TVAFLVLYVDDILLIGNDVGHLADIKKWLA
         VAFLVLYVDDILL+GNDVG+L DIKKWLA
Subjt:  TVAFLVLYVDDILLIGNDVGHLADIKKWLA

E2GK51 Gag/pol protein (Fragment)1.1e-5786.82Show/hide
Query:  MIKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQKSIYELKQASRSWNIRFDIAVKSYSFEQNVDEPCVYKRIINS
        M+KSIRILLSIATFY+YEIWQMDVKTAFLNGNLEESIYMVQPEGFI + QEQKVCKLQKSIY LKQASRSWNIRFD A+KSY FEQNVDEPCVYK+I+NS
Subjt:  MIKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQKSIYELKQASRSWNIRFDIAVKSYSFEQNVDEPCVYKRIINS

Query:  TVAFLVLYVDDILLIGNDVGHLADIKKWL
         VAFL+LYVDDILLIGNDV +L D+KKWL
Subjt:  TVAFLVLYVDDILLIGNDVGHLADIKKWL

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.3e-1941.22Show/hide
Query:  IKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQKSIYELKQASRSWNIRFDIAVKSYSFEQNVDEPCVY---KRII
        I S R +LS+   Y+ ++ QMDVKTAFLNG L+E IYM  P+G         VCKL K+IY LKQA+R W   F+ A+K   F  +  + C+Y   K  I
Subjt:  IKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQKSIYELKQASRSWNIRFDIAVKSYSFEQNVDEPCVY---KRII

Query:  NSTVAFLVLYVDDILLIGNDVGHLADIKKWL
        N  + +++LYVDD+++   D+  + + K++L
Subjt:  NSTVAFLVLYVDDILLIGNDVGHLADIKKWL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-2851.61Show/hide
Query:  SIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQKSIYELKQASRSWNIRFDIAVKSYSFEQNVDEPCVY-KRIINSTV
        SIR +LS+A   D E+ Q+DVKTAFL+G+LEE IYM QPEGF   G++  VCKL KS+Y LKQA R W ++FD  +KS ++ +   +PCVY KR   +  
Subjt:  SIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQKSIYELKQASRSWNIRFDIAVKSYSFEQNVDEPCVY-KRIINSTV

Query:  AFLVLYVDDILLIGNDVGHLADIK
          L+LYVDD+L++G D G +A +K
Subjt:  AFLVLYVDDILLIGNDVGHLADIK

P25600 Putative transposon Ty5-1 protein YCL074W1.1e-0936.56Show/hide
Query:  MDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQKSIYELKQASRSWNIRFDIAVKSYSFEQNVDEPCVYKRIINSTVAFLVLYVDDILL
        MDV TAFLN  ++E IY+ QP GF+ +     V +L   +Y LKQA   WN   +  +K   F ++  E  +Y R  +    ++ +YVDD+L+
Subjt:  MDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQKSIYELKQASRSWNIRFDIAVKSYSFEQNVDEPCVYKRIINSTVAFLVLYVDDILL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.3e-1840Show/hide
Query:  SIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQKSIYELKQASRSWNIRFDIAVKSYSFEQNVDEPCVYKRIINSTVA
        SIRI+L +A    + I Q+DV  AFL G L + +YM QP GFI K +   VCKL+K++Y LKQA R+W +     + +  F  +V +  ++      ++ 
Subjt:  SIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQKSIYELKQASRSWNIRFDIAVKSYSFEQNVDEPCVYKRIINSTVA

Query:  FLVLYVDDILLIGND
        ++++YVDDIL+ GND
Subjt:  FLVLYVDDILLIGND

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.8e-1838.26Show/hide
Query:  SIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQKSIYELKQASRSWNIRFDIAVKSYSFEQNVDEPCVYKRIINSTVA
        SIRI+L +A    + I Q+DV  AFL G L + +YM QP GF+ K +   VC+L+K+IY LKQA R+W +     + +  F  ++ +  ++      ++ 
Subjt:  SIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQKSIYELKQASRSWNIRFDIAVKSYSFEQNVDEPCVYKRIINSTVA

Query:  FLVLYVDDILLIGND
        ++++YVDDIL+ GND
Subjt:  FLVLYVDDILLIGND

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.4e-1933.33Show/hide
Query:  IKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMVQPEGFIQKGQE----QKVCKLQKSIYELKQASRSWNIRFDIAVKSYSFEQNVDEPCVYKRI
        + S++++L+I+  Y++ + Q+D+  AFLNG+L+E IYM  P G+  +  +      VC L+KSIY LKQASR W ++F + +  + F Q+  +   + +I
Subjt:  IKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMVQPEGFIQKGQE----QKVCKLQKSIYELKQASRSWNIRFDIAVKSYSFEQNVDEPCVYKRI

Query:  INSTVAFLVLYVDDILLIGNDVGHLADIKKWL
          +    +++YVDDI++  N+   + ++K  L
Subjt:  INSTVAFLVLYVDDILLIGNDVGHLADIKKWL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAAAGTCGATTAGAATACTCTTATCCATCGCCACTTTTTATGATTATGAAATTTGGCAGATGGATGTCAAGACCGCCTTTTTGAATGGAAATCTTGAAGAGAGTAT
CTATATGGTCCAACCAGAGGGGTTTATACAAAAGGGTCAAGAACAAAAGGTTTGTAAGCTTCAAAAATCCATTTATGAATTAAAACAAGCATCTAGATCCTGGAATATAA
GATTTGATATTGCGGTCAAATCTTATAGCTTTGAACAAAATGTTGATGAACCTTGTGTTTACAAAAGGATCATCAATTCTACTGTAGCATTCTTAGTTCTGTATGTAGAT
GACATTCTACTCATTGGGAATGATGTAGGTCATCTAGCTGATATTAAGAAATGGCTAGCTGCAATTCCAAATGAAAGATTTGGAAAATGCACAATATGTTCTTGGTGTCC
AAATAGTTCGGAACCGAAAAAAACAAAACACTAG
mRNA sequenceShow/hide mRNA sequence
ATGATAAAGTCGATTAGAATACTCTTATCCATCGCCACTTTTTATGATTATGAAATTTGGCAGATGGATGTCAAGACCGCCTTTTTGAATGGAAATCTTGAAGAGAGTAT
CTATATGGTCCAACCAGAGGGGTTTATACAAAAGGGTCAAGAACAAAAGGTTTGTAAGCTTCAAAAATCCATTTATGAATTAAAACAAGCATCTAGATCCTGGAATATAA
GATTTGATATTGCGGTCAAATCTTATAGCTTTGAACAAAATGTTGATGAACCTTGTGTTTACAAAAGGATCATCAATTCTACTGTAGCATTCTTAGTTCTGTATGTAGAT
GACATTCTACTCATTGGGAATGATGTAGGTCATCTAGCTGATATTAAGAAATGGCTAGCTGCAATTCCAAATGAAAGATTTGGAAAATGCACAATATGTTCTTGGTGTCC
AAATAGTTCGGAACCGAAAAAAACAAAACACTAG
Protein sequenceShow/hide protein sequence
MIKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQKSIYELKQASRSWNIRFDIAVKSYSFEQNVDEPCVYKRIINSTVAFLVLYVD
DILLIGNDVGHLADIKKWLAAIPNERFGKCTICSWCPNSSEPKKTKH