; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc11g0302301 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc11g0302301
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr11:23150606..23151076
RNA-Seq ExpressionCmc11g0302301
SyntenyCmc11g0302301
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-7290.38Show/hide
Query:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASRFWNIRFDNAIKSYSFDQNVDEPCVYKKINKGKATFLVLYVDDILLIGNDVGYLTIVKTWLAAQFQMKDL
        MSQPEGFITQGQEQKVCKLNRSIYGLKQASR WNIRFD AIKSY FDQNVDEPCVYKKINKGK  FLVLYVDDILLIGNDVGYLT VK WLAAQFQMKDL
Subjt:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASRFWNIRFDNAIKSYSFDQNVDEPCVYKKINKGKATFLVLYVDDILLIGNDVGYLTIVKTWLAAQFQMKDL

Query:  GEAQYVLDIQIIRDRKNKMLARSQATYIDKMLVRYLMHNSKKSLLPFRHGVHLSKK
        GEAQYVL IQIIRDRKNK LA SQATYIDK+LVRY M NSKK LLPFRHGVHLSK+
Subjt:  GEAQYVLDIQIIRDRKNKMLARSQATYIDKMLVRYLMHNSKKSLLPFRHGVHLSKK

KAA0033121.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-7290.38Show/hide
Query:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASRFWNIRFDNAIKSYSFDQNVDEPCVYKKINKGKATFLVLYVDDILLIGNDVGYLTIVKTWLAAQFQMKDL
        MSQPEGFITQGQEQKVCKLNRSIYGLKQASR WNIRFD AIKSY F+QNVDEPCVYKKINKGK  FLVLYVDDILLIGNDVGYLT VK WLAAQFQMKDL
Subjt:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASRFWNIRFDNAIKSYSFDQNVDEPCVYKKINKGKATFLVLYVDDILLIGNDVGYLTIVKTWLAAQFQMKDL

Query:  GEAQYVLDIQIIRDRKNKMLARSQATYIDKMLVRYLMHNSKKSLLPFRHGVHLSKK
        GEAQYVL IQIIRDRKNK LA SQATYIDKMLVRY M NSKK LLPFRHGVHLSK+
Subjt:  GEAQYVLDIQIIRDRKNKMLARSQATYIDKMLVRYLMHNSKKSLLPFRHGVHLSKK

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]3.2e-7289.74Show/hide
Query:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASRFWNIRFDNAIKSYSFDQNVDEPCVYKKINKGKATFLVLYVDDILLIGNDVGYLTIVKTWLAAQFQMKDL
        MSQPEGFITQGQEQKVCKLNRSIYGLKQASR WNIRFD AIKSY FDQNVDEPCVYKKINKGK  FLVLYVDDILLIGNDVGYLT VK WLAAQFQMKDL
Subjt:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASRFWNIRFDNAIKSYSFDQNVDEPCVYKKINKGKATFLVLYVDDILLIGNDVGYLTIVKTWLAAQFQMKDL

Query:  GEAQYVLDIQIIRDRKNKMLARSQATYIDKMLVRYLMHNSKKSLLPFRHGVHLSKK
        GE QYVL IQIIRDRKNK LA SQATYIDK+LVRY M NSKK LLPFRHGVHLSK+
Subjt:  GEAQYVLDIQIIRDRKNKMLARSQATYIDKMLVRYLMHNSKKSLLPFRHGVHLSKK

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-7290.38Show/hide
Query:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASRFWNIRFDNAIKSYSFDQNVDEPCVYKKINKGKATFLVLYVDDILLIGNDVGYLTIVKTWLAAQFQMKDL
        MSQPEGFITQGQEQKVCKLNRSIYGLKQASR WNIRFD AIKSY FDQNVDEPCVYKKINKGK  FLVLYVDDILLIGNDVGYLT VK WLAAQFQMKDL
Subjt:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASRFWNIRFDNAIKSYSFDQNVDEPCVYKKINKGKATFLVLYVDDILLIGNDVGYLTIVKTWLAAQFQMKDL

Query:  GEAQYVLDIQIIRDRKNKMLARSQATYIDKMLVRYLMHNSKKSLLPFRHGVHLSKK
        GEAQYVL IQIIRDRKNK LA SQATYIDK+LVRY M NSKK LLPFRHGVHLSK+
Subjt:  GEAQYVLDIQIIRDRKNKMLARSQATYIDKMLVRYLMHNSKKSLLPFRHGVHLSKK

TYK15984.1 gag/pol protein [Cucumis melo var. makuwa]2.5e-7289.74Show/hide
Query:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASRFWNIRFDNAIKSYSFDQNVDEPCVYKKINKGKATFLVLYVDDILLIGNDVGYLTIVKTWLAAQFQMKDL
        MSQPEGFITQGQEQKVCKLNRSIYGLKQASR WNIRFD AIKSY FDQNVDEPCVYKKINKGK  FLVLYVDDILLIGNDVGYLT VK WLAAQFQMKDL
Subjt:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASRFWNIRFDNAIKSYSFDQNVDEPCVYKKINKGKATFLVLYVDDILLIGNDVGYLTIVKTWLAAQFQMKDL

Query:  GEAQYVLDIQIIRDRKNKMLARSQATYIDKMLVRYLMHNSKKSLLPFRHGVHLSKK
        GEAQYVL IQIIRDRKNK LA SQATYIDKMLVRYLM NSKK LLPF+HG HLS++
Subjt:  GEAQYVLDIQIIRDRKNKMLARSQATYIDKMLVRYLMHNSKKSLLPFRHGVHLSKK

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein1.6e-7289.74Show/hide
Query:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASRFWNIRFDNAIKSYSFDQNVDEPCVYKKINKGKATFLVLYVDDILLIGNDVGYLTIVKTWLAAQFQMKDL
        MSQPEGFITQGQEQKVCKLNRSIYGLKQASR WNIRFD AIKSY FDQNVDEPCVYKKINKGK  FLVLYVDDILLIGNDVGYLT VK WLAAQFQMKDL
Subjt:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASRFWNIRFDNAIKSYSFDQNVDEPCVYKKINKGKATFLVLYVDDILLIGNDVGYLTIVKTWLAAQFQMKDL

Query:  GEAQYVLDIQIIRDRKNKMLARSQATYIDKMLVRYLMHNSKKSLLPFRHGVHLSKK
        GE QYVL IQIIRDRKNK LA SQATYIDK+LVRY M NSKK LLPFRHGVHLSK+
Subjt:  GEAQYVLDIQIIRDRKNKMLARSQATYIDKMLVRYLMHNSKKSLLPFRHGVHLSKK

A0A5A7TZD0 Gag/pol protein5.4e-7390.38Show/hide
Query:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASRFWNIRFDNAIKSYSFDQNVDEPCVYKKINKGKATFLVLYVDDILLIGNDVGYLTIVKTWLAAQFQMKDL
        MSQPEGFITQGQEQKVCKLNRSIYGLKQASR WNIRFD AIKSY FDQNVDEPCVYKKINKGK  FLVLYVDDILLIGNDVGYLT VK WLAAQFQMKDL
Subjt:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASRFWNIRFDNAIKSYSFDQNVDEPCVYKKINKGKATFLVLYVDDILLIGNDVGYLTIVKTWLAAQFQMKDL

Query:  GEAQYVLDIQIIRDRKNKMLARSQATYIDKMLVRYLMHNSKKSLLPFRHGVHLSKK
        GEAQYVL IQIIRDRKNK LA SQATYIDK+LVRY M NSKK LLPFRHGVHLSK+
Subjt:  GEAQYVLDIQIIRDRKNKMLARSQATYIDKMLVRYLMHNSKKSLLPFRHGVHLSKK

A0A5A7UYE8 Gag/pol protein5.4e-7390.38Show/hide
Query:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASRFWNIRFDNAIKSYSFDQNVDEPCVYKKINKGKATFLVLYVDDILLIGNDVGYLTIVKTWLAAQFQMKDL
        MSQPEGFITQGQEQKVCKLNRSIYGLKQASR WNIRFD AIKSY FDQNVDEPCVYKKINKGK  FLVLYVDDILLIGNDVGYLT VK WLAAQFQMKDL
Subjt:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASRFWNIRFDNAIKSYSFDQNVDEPCVYKKINKGKATFLVLYVDDILLIGNDVGYLTIVKTWLAAQFQMKDL

Query:  GEAQYVLDIQIIRDRKNKMLARSQATYIDKMLVRYLMHNSKKSLLPFRHGVHLSKK
        GEAQYVL IQIIRDRKNK LA SQATYIDK+LVRY M NSKK LLPFRHGVHLSK+
Subjt:  GEAQYVLDIQIIRDRKNKMLARSQATYIDKMLVRYLMHNSKKSLLPFRHGVHLSKK

A0A5D3CYF4 Gag/pol protein1.2e-7289.74Show/hide
Query:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASRFWNIRFDNAIKSYSFDQNVDEPCVYKKINKGKATFLVLYVDDILLIGNDVGYLTIVKTWLAAQFQMKDL
        MSQPEGFITQGQEQKVCKLNRSIYGLKQASR WNIRFD AIKSY FDQNVDEPCVYKKINKGK  FLVLYVDDILLIGNDVGYLT VK WLAAQFQMKDL
Subjt:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASRFWNIRFDNAIKSYSFDQNVDEPCVYKKINKGKATFLVLYVDDILLIGNDVGYLTIVKTWLAAQFQMKDL

Query:  GEAQYVLDIQIIRDRKNKMLARSQATYIDKMLVRYLMHNSKKSLLPFRHGVHLSKK
        GEAQYVL IQIIRDRKNK LA SQATYIDKMLVRYLM NSKK LLPF+HG HLS++
Subjt:  GEAQYVLDIQIIRDRKNKMLARSQATYIDKMLVRYLMHNSKKSLLPFRHGVHLSKK

A0A5D3CZY3 Gag/pol protein7.0e-7390.38Show/hide
Query:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASRFWNIRFDNAIKSYSFDQNVDEPCVYKKINKGKATFLVLYVDDILLIGNDVGYLTIVKTWLAAQFQMKDL
        MSQPEGFITQGQEQKVCKLNRSIYGLKQASR WNIRFD AIKSY F+QNVDEPCVYKKINKGK  FLVLYVDDILLIGNDVGYLT VK WLAAQFQMKDL
Subjt:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASRFWNIRFDNAIKSYSFDQNVDEPCVYKKINKGKATFLVLYVDDILLIGNDVGYLTIVKTWLAAQFQMKDL

Query:  GEAQYVLDIQIIRDRKNKMLARSQATYIDKMLVRYLMHNSKKSLLPFRHGVHLSKK
        GEAQYVL IQIIRDRKNK LA SQATYIDKMLVRY M NSKK LLPFRHGVHLSK+
Subjt:  GEAQYVLDIQIIRDRKNKMLARSQATYIDKMLVRYLMHNSKKSLLPFRHGVHLSKK

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.8e-1533.33Show/hide
Query:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASRFWNIRFDNAIKSYSFDQNVDEPCVY--KKINKGKATFLVLYVDDILLIGNDVGYLTIVKTWLAAQFQMK
        M  P+G         VCKLN++IYGLKQA+R W   F+ A+K   F  +  + C+Y   K N  +  +++LYVDD+++   D+  +   K +L  +F+M 
Subjt:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASRFWNIRFDNAIKSYSFDQNVDEPCVY--KKINKGKATFLVLYVDDILLIGNDVGYLTIVKTWLAAQFQMK

Query:  DLGEAQYVLDIQIIRDRKNKMLARSQATYIDKMLVRYLMHN
        DL E ++ + I+I  + +   +  SQ+ Y+ K+L ++ M N
Subjt:  DLGEAQYVLDIQIIRDRKNKMLARSQATYIDKMLVRYLMHN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-3143.95Show/hide
Query:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASRFWNIRFDNAIKSYSFDQNVDEPCVY-KKINKGKATFLVLYVDDILLIGNDVGYLTIVKTWLAAQFQMKD
        M QPEGF   G++  VCKLN+S+YGLKQA R W ++FD+ +KS ++ +   +PCVY K+ ++     L+LYVDD+L++G D G +  +K  L+  F MKD
Subjt:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASRFWNIRFDNAIKSYSFDQNVDEPCVY-KKINKGKATFLVLYVDDILLIGNDVGYLTIVKTWLAAQFQMKD

Query:  LGEAQYVLDIQIIRDRKNKMLARSQATYIDKMLVRYLMHNSKKSLLPFRHGVHLSKK
        LG AQ +L ++I+R+R ++ L  SQ  YI+++L R+ M N+K    P    + LSKK
Subjt:  LGEAQYVLDIQIIRDRKNKMLARSQATYIDKMLVRYLMHNSKKSLLPFRHGVHLSKK

P25600 Putative transposon Ty5-1 protein YCL074W1.8e-0930Show/hide
Query:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASRFWNIRFDNAIKSYSFDQNVDEPCVYKKINKGKATFLVLYVDDILLIGNDVGYLTIVKTWLAAQFQMKDL
        + QP GF+ +     V +L   +YGLKQA   WN   +N +K   F ++  E  +Y +       ++ +YVDD+L+          VK  L   + MKDL
Subjt:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASRFWNIRFDNAIKSYSFDQNVDEPCVYKKINKGKATFLVLYVDDILLIGNDVGYLTIVKTWLAAQFQMKDL

Query:  GEAQYVLDIQIIRDRKNKMLARSQATYIDK
        G+    L +  I    N  +  S   YI K
Subjt:  GEAQYVLDIQIIRDRKNKMLARSQATYIDK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.8e-1535.48Show/hide
Query:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASRFWNIRFDNAIKSYSFDQNVDEPCVYKKINKGKA-TFLVLYVDDILLIGNDVGYLTIVKTWLAAQFQMKD
        MSQP GFI + +   VCKL +++YGLKQA R W +   N + +  F  +V +  ++  + +GK+  ++++YVDDIL+ GND   L      L+ +F +KD
Subjt:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASRFWNIRFDNAIKSYSFDQNVDEPCVYKKINKGKA-TFLVLYVDDILLIGNDVGYLTIVKTWLAAQFQMKD

Query:  LGEAQYVLDIQIIRDRKNKMLARSQATYIDKMLVRYLMHNSKKSLLPFRHGVHLS
          E  Y L I+    R    L  SQ  YI  +L R  M  +K    P      LS
Subjt:  LGEAQYVLDIQIIRDRKNKMLARSQATYIDKMLVRYLMHNSKKSLLPFRHGVHLS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.0e-1230.82Show/hide
Query:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASRFWNIRFDNAIKSYSFDQNVDEPCVYKKINKGKATFLVLYVDDILLIGNDVGYLTIVKTWLAAQFQMKDL
        MSQP GF+ + +   VC+L ++IYGLKQA R W +     + +  F  ++ +  ++         ++++YVDDIL+ GND   L      L+ +F +K+ 
Subjt:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASRFWNIRFDNAIKSYSFDQNVDEPCVYKKINKGKATFLVLYVDDILLIGNDVGYLTIVKTWLAAQFQMKDL

Query:  GEAQYVLDIQIIRDRKNKMLARSQATYIDKMLVRYLMHNSKKSLLP
         +  Y L I+    R  + L  SQ  Y   +L R  M  +K    P
Subjt:  GEAQYVLDIQIIRDRKNKMLARSQATYIDKMLVRYLMHNSKKSLLP

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.4e-1234.19Show/hide
Query:  MSQPEGFIT-QGQE---QKVCKLNRSIYGLKQASRFWNIRFDNAIKSYSFDQNVDEPCVYKKINKGKATFLVLYVDDILLIGNDVGYLTIVKTWLAAQFQ
        M  P G+   QG       VC L +SIYGLKQASR W ++F   +  + F Q+  +   + KI       +++YVDDI++  N+   +  +K+ L + F+
Subjt:  MSQPEGFIT-QGQE---QKVCKLNRSIYGLKQASRFWNIRFDNAIKSYSFDQNVDEPCVYKKINKGKATFLVLYVDDILLIGNDVGYLTIVKTWLAAQFQ

Query:  MKDLGEAQYVLDIQIIR
        ++DLG  +Y L ++I R
Subjt:  MKDLGEAQYVLDIQIIR

ATMG00810.1 DNA/RNA polymerases superfamily protein6.2e-0541.79Show/hide
Query:  FLVLYVDDILLIGNDVGYLTIVKTWLAAQFQMKDLGEAQYVLDIQIIRDRKNKMLARSQATYIDKML
        +L+LYVDDILL G+    L ++   L++ F MKDLG   Y L IQI        L  SQ  Y +++L
Subjt:  FLVLYVDDILLIGNDVGYLTIVKTWLAAQFQMKDLGEAQYVLDIQIIRDRKNKMLARSQATYIDKML


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCAGCCCGAGGGGTTCATAACCCAAGGTCAGGAGCAAAAAGTTTGCAAGCTGAATCGATCCATTTATGGATTGAAACAAGCATCTAGATTTTGGAACATT
AGGTTTGATAATGCAATCAAATCTTACAGTTTTGACCAAAACGTTGATGAACCTTGTGTATACAAGAAAATCAACAAAGGTAAAGCAACTTTCTTAGTACTTTAT
GTGGACGATATCCTCCTAATTGGGAATGATGTGGGATACCTAACTATCGTTAAAACTTGGCTAGCAGCCCAATTCCAAATGAAAGATTTGGGAGAGGCACAATAT
GTTCTTGACATCCAAATCATAAGGGATCGTAAGAACAAAATGCTAGCACGGTCTCAAGCAACCTATATCGACAAAATGTTGGTTCGATATTTGATGCATAACTCT
AAGAAGAGTTTATTACCTTTCAGGCATGGGGTTCACTTGTCTAAGAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTCAGCCCGAGGGGTTCATAACCCAAGGTCAGGAGCAAAAAGTTTGCAAGCTGAATCGATCCATTTATGGATTGAAACAAGCATCTAGATTTTGGAACATT
AGGTTTGATAATGCAATCAAATCTTACAGTTTTGACCAAAACGTTGATGAACCTTGTGTATACAAGAAAATCAACAAAGGTAAAGCAACTTTCTTAGTACTTTAT
GTGGACGATATCCTCCTAATTGGGAATGATGTGGGATACCTAACTATCGTTAAAACTTGGCTAGCAGCCCAATTCCAAATGAAAGATTTGGGAGAGGCACAATAT
GTTCTTGACATCCAAATCATAAGGGATCGTAAGAACAAAATGCTAGCACGGTCTCAAGCAACCTATATCGACAAAATGTTGGTTCGATATTTGATGCATAACTCT
AAGAAGAGTTTATTACCTTTCAGGCATGGGGTTCACTTGTCTAAGAAATAG
Protein sequenceShow/hide protein sequence
MSQPEGFITQGQEQKVCKLNRSIYGLKQASRFWNIRFDNAIKSYSFDQNVDEPCVYKKINKGKATFLVLYVDDILLIGNDVGYLTIVKTWLAAQFQMKDLGEAQY
VLDIQIIRDRKNKMLARSQATYIDKMLVRYLMHNSKKSLLPFRHGVHLSKK