; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc02g0050121 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc02g0050121
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr02:16100349..16100807
RNA-Seq ExpressionCmc02g0050121
SyntenyCmc02g0050121
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]1.2e-6684.87Show/hide
Query:  MLKLIRILLSIATFYDYEIWQMDVKTAFLNENLEESIYMVQPEGFIQKGQEQKICKLQKSIYGLKQASRSWNIRFDTLIKSYGFEQNVDEPCVYKRIIKS
        MLK IRILLSIATFY+YEIWQMDVKTAFLN NLEESIYMVQPEGFI + QEQK+CKLQKSIYGLKQASRSWNIRFDT IKSYGFEQNVDEPCVYK+I+ S
Subjt:  MLKLIRILLSIATFYDYEIWQMDVKTAFLNENLEESIYMVQPEGFIQKGQEQKICKLQKSIYGLKQASRSWNIRFDTLIKSYGFEQNVDEPCVYKRIIKS

Query:  TVAFSVLYVDDILLIRNDIGHLTDIKKWLATQFQMKDLRNAQYVLGIQIVRN
         VAF +LYVDDILLI ND+ +LTD+KKWL TQFQMKDL  AQY+LGIQIVRN
Subjt:  TVAFSVLYVDDILLIRNDIGHLTDIKKWLATQFQMKDLRNAQYVLGIQIVRN

KAA0045356.1 gag/pol protein [Cucumis melo var. makuwa]1.5e-6989.47Show/hide
Query:  MLKLIRILLSIATFYDYEIWQMDVKTAFLNENLEESIYMVQPEGFIQKGQEQKICKLQKSIYGLKQASRSWNIRFDTLIKSYGFEQNVDEPCVYKRIIKS
        M+K IRILLSIATFYDYEIWQMDVKTAFLN NLEESIYMVQPEGFIQKGQEQK+CKLQKSIYGLKQASRSWNIRFDT IKSYGFEQNVDEPCVYKRII S
Subjt:  MLKLIRILLSIATFYDYEIWQMDVKTAFLNENLEESIYMVQPEGFIQKGQEQKICKLQKSIYGLKQASRSWNIRFDTLIKSYGFEQNVDEPCVYKRIIKS

Query:  TVAFSVLYVDDILLIRNDIGHLTDIKKWLATQFQMKDLRNAQYVLGIQIVRN
        TVAF VLYVDDILLI N++ HLTDIK+WL TQFQMKDL +AQYVLGIQIV+N
Subjt:  TVAFSVLYVDDILLIRNDIGHLTDIKKWLATQFQMKDLRNAQYVLGIQIVRN

KAA0058278.1 gag/pol protein [Cucumis melo var. makuwa]1.7e-6582.89Show/hide
Query:  MLKLIRILLSIATFYDYEIWQMDVKTAFLNENLEESIYMVQPEGFIQKGQEQKICKLQKSIYGLKQASRSWNIRFDTLIKSYGFEQNVDEPCVYKRIIKS
        MLK IRILLSIATFYDYEIWQMDVKTAFLN+NLEESI+M QPEGFI +GQEQK+CKL +SIYGLKQASRSWNIRFDT IKSYGF+QNVDEPCVYK+I K 
Subjt:  MLKLIRILLSIATFYDYEIWQMDVKTAFLNENLEESIYMVQPEGFIQKGQEQKICKLQKSIYGLKQASRSWNIRFDTLIKSYGFEQNVDEPCVYKRIIKS

Query:  TVAFSVLYVDDILLIRNDIGHLTDIKKWLATQFQMKDLRNAQYVLGIQIVRN
         VAF VLYVDDILLI ND+G+LTD+K WLA QFQMKDL  AQYVLGIQI+R+
Subjt:  TVAFSVLYVDDILLIRNDIGHLTDIKKWLATQFQMKDLRNAQYVLGIQIVRN

TYK03644.1 gag/pol protein [Cucumis melo var. makuwa]2.8e-6888.82Show/hide
Query:  MLKLIRILLSIATFYDYEIWQMDVKTAFLNENLEESIYMVQPEGFIQKGQEQKICKLQKSIYGLKQASRSWNIRFDTLIKSYGFEQNVDEPCVYKRIIKS
        M+K IRILLSIATFYDYEIWQMDVKT FLN NLEESIYMVQPE FIQKGQEQKICKLQKSIYGLKQASRS NIRFDT IKSYG EQNVDEPCVYKRI+ S
Subjt:  MLKLIRILLSIATFYDYEIWQMDVKTAFLNENLEESIYMVQPEGFIQKGQEQKICKLQKSIYGLKQASRSWNIRFDTLIKSYGFEQNVDEPCVYKRIIKS

Query:  TVAFSVLYVDDILLIRNDIGHLTDIKKWLATQFQMKDLRNAQYVLGIQIVRN
        TVAF VLYVDDILLI ND+GHL DIKKWLA QFQMKDL NAQYVLG+QIVRN
Subjt:  TVAFSVLYVDDILLIRNDIGHLTDIKKWLATQFQMKDLRNAQYVLGIQIVRN

TYK09500.1 gag/pol protein [Cucumis melo var. makuwa]2.8e-6886.18Show/hide
Query:  MLKLIRILLSIATFYDYEIWQMDVKTAFLNENLEESIYMVQPEGFIQKGQEQKICKLQKSIYGLKQASRSWNIRFDTLIKSYGFEQNVDEPCVYKRIIKS
        MLK IRILLSIATFYDYEIWQ+DVKTAFLN NLEESIYM QPEGFI+KGQEQKICK QKSIYGLKQASRSWNIRFDT IKSYGFEQNVD+PCVYK+++ S
Subjt:  MLKLIRILLSIATFYDYEIWQMDVKTAFLNENLEESIYMVQPEGFIQKGQEQKICKLQKSIYGLKQASRSWNIRFDTLIKSYGFEQNVDEPCVYKRIIKS

Query:  TVAFSVLYVDDILLIRNDIGHLTDIKKWLATQFQMKDLRNAQYVLGIQIVRN
         VAF VLYVDDILL+ ND+G+LTDIKKWLATQFQMKD  +AQYVLGIQIVRN
Subjt:  TVAFSVLYVDDILLIRNDIGHLTDIKKWLATQFQMKDLRNAQYVLGIQIVRN

TrEMBL top hitse value%identityAlignment
A0A5A7TTA2 Gag/pol protein7.1e-7089.47Show/hide
Query:  MLKLIRILLSIATFYDYEIWQMDVKTAFLNENLEESIYMVQPEGFIQKGQEQKICKLQKSIYGLKQASRSWNIRFDTLIKSYGFEQNVDEPCVYKRIIKS
        M+K IRILLSIATFYDYEIWQMDVKTAFLN NLEESIYMVQPEGFIQKGQEQK+CKLQKSIYGLKQASRSWNIRFDT IKSYGFEQNVDEPCVYKRII S
Subjt:  MLKLIRILLSIATFYDYEIWQMDVKTAFLNENLEESIYMVQPEGFIQKGQEQKICKLQKSIYGLKQASRSWNIRFDTLIKSYGFEQNVDEPCVYKRIIKS

Query:  TVAFSVLYVDDILLIRNDIGHLTDIKKWLATQFQMKDLRNAQYVLGIQIVRN
        TVAF VLYVDDILLI N++ HLTDIK+WL TQFQMKDL +AQYVLGIQIV+N
Subjt:  TVAFSVLYVDDILLIRNDIGHLTDIKKWLATQFQMKDLRNAQYVLGIQIVRN

A0A5A7USZ2 Gag/pol protein8.1e-6682.89Show/hide
Query:  MLKLIRILLSIATFYDYEIWQMDVKTAFLNENLEESIYMVQPEGFIQKGQEQKICKLQKSIYGLKQASRSWNIRFDTLIKSYGFEQNVDEPCVYKRIIKS
        MLK IRILLSIATFYDYEIWQMDVKTAFLN+NLEESI+M QPEGFI +GQEQK+CKL +SIYGLKQASRSWNIRFDT IKSYGF+QNVDEPCVYK+I K 
Subjt:  MLKLIRILLSIATFYDYEIWQMDVKTAFLNENLEESIYMVQPEGFIQKGQEQKICKLQKSIYGLKQASRSWNIRFDTLIKSYGFEQNVDEPCVYKRIIKS

Query:  TVAFSVLYVDDILLIRNDIGHLTDIKKWLATQFQMKDLRNAQYVLGIQIVRN
         VAF VLYVDDILLI ND+G+LTD+K WLA QFQMKDL  AQYVLGIQI+R+
Subjt:  TVAFSVLYVDDILLIRNDIGHLTDIKKWLATQFQMKDLRNAQYVLGIQIVRN

A0A5D3BX45 Gag/pol protein1.3e-6888.82Show/hide
Query:  MLKLIRILLSIATFYDYEIWQMDVKTAFLNENLEESIYMVQPEGFIQKGQEQKICKLQKSIYGLKQASRSWNIRFDTLIKSYGFEQNVDEPCVYKRIIKS
        M+K IRILLSIATFYDYEIWQMDVKT FLN NLEESIYMVQPE FIQKGQEQKICKLQKSIYGLKQASRS NIRFDT IKSYG EQNVDEPCVYKRI+ S
Subjt:  MLKLIRILLSIATFYDYEIWQMDVKTAFLNENLEESIYMVQPEGFIQKGQEQKICKLQKSIYGLKQASRSWNIRFDTLIKSYGFEQNVDEPCVYKRIIKS

Query:  TVAFSVLYVDDILLIRNDIGHLTDIKKWLATQFQMKDLRNAQYVLGIQIVRN
        TVAF VLYVDDILLI ND+GHL DIKKWLA QFQMKDL NAQYVLG+QIVRN
Subjt:  TVAFSVLYVDDILLIRNDIGHLTDIKKWLATQFQMKDLRNAQYVLGIQIVRN

A0A5D3CDT9 Gag/pol protein1.3e-6886.18Show/hide
Query:  MLKLIRILLSIATFYDYEIWQMDVKTAFLNENLEESIYMVQPEGFIQKGQEQKICKLQKSIYGLKQASRSWNIRFDTLIKSYGFEQNVDEPCVYKRIIKS
        MLK IRILLSIATFYDYEIWQ+DVKTAFLN NLEESIYM QPEGFI+KGQEQKICK QKSIYGLKQASRSWNIRFDT IKSYGFEQNVD+PCVYK+++ S
Subjt:  MLKLIRILLSIATFYDYEIWQMDVKTAFLNENLEESIYMVQPEGFIQKGQEQKICKLQKSIYGLKQASRSWNIRFDTLIKSYGFEQNVDEPCVYKRIIKS

Query:  TVAFSVLYVDDILLIRNDIGHLTDIKKWLATQFQMKDLRNAQYVLGIQIVRN
         VAF VLYVDDILL+ ND+G+LTDIKKWLATQFQMKD  +AQYVLGIQIVRN
Subjt:  TVAFSVLYVDDILLIRNDIGHLTDIKKWLATQFQMKDLRNAQYVLGIQIVRN

E2GK51 Gag/pol protein (Fragment)5.6e-6784.87Show/hide
Query:  MLKLIRILLSIATFYDYEIWQMDVKTAFLNENLEESIYMVQPEGFIQKGQEQKICKLQKSIYGLKQASRSWNIRFDTLIKSYGFEQNVDEPCVYKRIIKS
        MLK IRILLSIATFY+YEIWQMDVKTAFLN NLEESIYMVQPEGFI + QEQK+CKLQKSIYGLKQASRSWNIRFDT IKSYGFEQNVDEPCVYK+I+ S
Subjt:  MLKLIRILLSIATFYDYEIWQMDVKTAFLNENLEESIYMVQPEGFIQKGQEQKICKLQKSIYGLKQASRSWNIRFDTLIKSYGFEQNVDEPCVYKRIIKS

Query:  TVAFSVLYVDDILLIRNDIGHLTDIKKWLATQFQMKDLRNAQYVLGIQIVRN
         VAF +LYVDDILLI ND+ +LTD+KKWL TQFQMKDL  AQY+LGIQIVRN
Subjt:  TVAFSVLYVDDILLIRNDIGHLTDIKKWLATQFQMKDLRNAQYVLGIQIVRN

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.2e-2138.1Show/hide
Query:  RILLSIATFYDYEIWQMDVKTAFLNENLEESIYMVQPEGFIQKGQEQKICKLQKSIYGLKQASRSWNIRFDTLIKSYGFEQNVDEPCVY---KRIIKSTV
        R +LS+   Y+ ++ QMDVKTAFLN  L+E IYM  P+G         +CKL K+IYGLKQA+R W   F+  +K   F  +  + C+Y   K  I   +
Subjt:  RILLSIATFYDYEIWQMDVKTAFLNENLEESIYMVQPEGFIQKGQEQKICKLQKSIYGLKQASRSWNIRFDTLIKSYGFEQNVDEPCVY---KRIIKSTV

Query:  AFSVLYVDDILLIRNDIGHLTDIKKWLATQFQMKDLRNAQYVLGIQI
         + +LYVDD+++   D+  + + K++L  +F+M DL   ++ +GI+I
Subjt:  AFSVLYVDDILLIRNDIGHLTDIKKWLATQFQMKDLRNAQYVLGIQI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.7e-3348.65Show/hide
Query:  IRILLSIATFYDYEIWQMDVKTAFLNENLEESIYMVQPEGFIQKGQEQKICKLQKSIYGLKQASRSWNIRFDTLIKSYGFEQNVDEPCVY-KRIIKSTVA
        IR +LS+A   D E+ Q+DVKTAFL+ +LEE IYM QPEGF   G++  +CKL KS+YGLKQA R W ++FD+ +KS  + +   +PCVY KR  ++   
Subjt:  IRILLSIATFYDYEIWQMDVKTAFLNENLEESIYMVQPEGFIQKGQEQKICKLQKSIYGLKQASRSWNIRFDTLIKSYGFEQNVDEPCVY-KRIIKSTVA

Query:  FSVLYVDDILLIRNDIGHLTDIKKWLATQFQMKDLRNAQYVLGIQIVR
          +LYVDD+L++  D G +  +K  L+  F MKDL  AQ +LG++IVR
Subjt:  FSVLYVDDILLIRNDIGHLTDIKKWLATQFQMKDLRNAQYVLGIQIVR

P25600 Putative transposon Ty5-1 protein YCL074W1.8e-1434.38Show/hide
Query:  MDVKTAFLNENLEESIYMVQPEGFIQKGQEQKICKLQKSIYGLKQASRSWNIRFDTLIKSYGFEQNVDEPCVYKRIIKSTVAFSVLYVDDILLIRNDIGH
        MDV TAFLN  ++E IY+ QP GF+ +     + +L   +YGLKQA   WN   +  +K  GF ++  E  +Y R       +  +YVDD+L+       
Subjt:  MDVKTAFLNENLEESIYMVQPEGFIQKGQEQKICKLQKSIYGLKQASRSWNIRFDTLIKSYGFEQNVDEPCVYKRIIKSTVAFSVLYVDDILLIRNDIGH

Query:  LTDIKKWLATQFQMKDLRNAQYVLGIQI
           +K+ L   + MKDL      LG+ I
Subjt:  LTDIKKWLATQFQMKDLRNAQYVLGIQI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.5e-2136.73Show/hide
Query:  IRILLSIATFYDYEIWQMDVKTAFLNENLEESIYMVQPEGFIQKGQEQKICKLQKSIYGLKQASRSWNIRFDTLIKSYGFEQNVDEPCVYKRIIKSTVAF
        IRI+L +A    + I Q+DV  AFL   L + +YM QP GFI K +   +CKL+K++YGLKQA R+W +     + + GF  +V +  ++      ++ +
Subjt:  IRILLSIATFYDYEIWQMDVKTAFLNENLEESIYMVQPEGFIQKGQEQKICKLQKSIYGLKQASRSWNIRFDTLIKSYGFEQNVDEPCVYKRIIKSTVAF

Query:  SVLYVDDILLIRNDIGHLTDIKKWLATQFQMKDLRNAQYVLGIQIVR
         ++YVDDIL+  ND   L +    L+ +F +KD     Y LGI+  R
Subjt:  SVLYVDDILLIRNDIGHLTDIKKWLATQFQMKDLRNAQYVLGIQIVR

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.6e-2135.37Show/hide
Query:  IRILLSIATFYDYEIWQMDVKTAFLNENLEESIYMVQPEGFIQKGQEQKICKLQKSIYGLKQASRSWNIRFDTLIKSYGFEQNVDEPCVYKRIIKSTVAF
        IRI+L +A    + I Q+DV  AFL   L + +YM QP GF+ K +   +C+L+K+IYGLKQA R+W +   T + + GF  ++ +  ++      ++ +
Subjt:  IRILLSIATFYDYEIWQMDVKTAFLNENLEESIYMVQPEGFIQKGQEQKICKLQKSIYGLKQASRSWNIRFDTLIKSYGFEQNVDEPCVYKRIIKSTVAF

Query:  SVLYVDDILLIRNDIGHLTDIKKWLATQFQMKDLRNAQYVLGIQIVR
         ++YVDDIL+  ND   L      L+ +F +K+  +  Y LGI+  R
Subjt:  SVLYVDDILLIRNDIGHLTDIKKWLATQFQMKDLRNAQYVLGIQIVR

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 89.9e-2433.55Show/hide
Query:  LKLIRILLSIATFYDYEIWQMDVKTAFLNENLEESIYMVQPEGFIQKGQE----QKICKLQKSIYGLKQASRSWNIRFDTLIKSYGFEQNVDEPCVYKRI
        L  ++++L+I+  Y++ + Q+D+  AFLN +L+E IYM  P G+  +  +      +C L+KSIYGLKQASR W ++F   +  +GF Q+  +   + +I
Subjt:  LKLIRILLSIATFYDYEIWQMDVKTAFLNENLEESIYMVQPEGFIQKGQE----QKICKLQKSIYGLKQASRSWNIRFDTLIKSYGFEQNVDEPCVYKRI

Query:  IKSTVAFSVLYVDDILLIRNDIGHLTDIKKWLATQFQMKDLRNAQYVLGIQIVRN
          +     ++YVDDI++  N+   + ++K  L + F+++DL   +Y LG++I R+
Subjt:  IKSTVAFSVLYVDDILLIRNDIGHLTDIKKWLATQFQMKDLRNAQYVLGIQIVRN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAAAGTTGATTAGAATACTCTTATCCATTGCCACTTTTTATGATTATGAAATTTGGCAGATGGATGTCAAGACAGCCTTTTTGAATGAAAATCTTGAGGAGAGTAT
CTATATGGTTCAACCAGAGGGGTTTATACAAAAGGGTCAAGAACAAAAGATTTGTAAGCTTCAAAAATCCATATATGGATTAAAACAAGCATCTAGATCCTGGAATATAA
GGTTTGATACTCTAATCAAATCTTATGGTTTTGAACAAAATGTTGATGAACCTTGTGTTTACAAAAGGATTATCAAATCCACTGTAGCATTCTCAGTTCTGTATGTAGAT
GACATTCTACTCATTAGGAATGATATAGGTCATCTAACTGATATTAAGAAATGGCTAGCTACGCAATTCCAAATGAAAGATTTGAGAAATGCTCAATACGTTCTTGGTAT
CCAAATAGTTCGGAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTAAAGTTGATTAGAATACTCTTATCCATTGCCACTTTTTATGATTATGAAATTTGGCAGATGGATGTCAAGACAGCCTTTTTGAATGAAAATCTTGAGGAGAGTAT
CTATATGGTTCAACCAGAGGGGTTTATACAAAAGGGTCAAGAACAAAAGATTTGTAAGCTTCAAAAATCCATATATGGATTAAAACAAGCATCTAGATCCTGGAATATAA
GGTTTGATACTCTAATCAAATCTTATGGTTTTGAACAAAATGTTGATGAACCTTGTGTTTACAAAAGGATTATCAAATCCACTGTAGCATTCTCAGTTCTGTATGTAGAT
GACATTCTACTCATTAGGAATGATATAGGTCATCTAACTGATATTAAGAAATGGCTAGCTACGCAATTCCAAATGAAAGATTTGAGAAATGCTCAATACGTTCTTGGTAT
CCAAATAGTTCGGAACTGA
Protein sequenceShow/hide protein sequence
MLKLIRILLSIATFYDYEIWQMDVKTAFLNENLEESIYMVQPEGFIQKGQEQKICKLQKSIYGLKQASRSWNIRFDTLIKSYGFEQNVDEPCVYKRIIKSTVAFSVLYVD
DILLIRNDIGHLTDIKKWLATQFQMKDLRNAQYVLGIQIVRN