; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc09g0249501 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc09g0249501
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr09:15390899..15391491
RNA-Seq ExpressionCmc09g0249501
SyntenyCmc09g0249501
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032034.1 F5J5.1 [Cucumis melo var. makuwa]4.4e-7881.5Show/hide
Query:  MGICARYQADPRITHLEAVKRILKYVHGTSDFGMMYSYDTTPTLVGYCDAEWAGSTDDCKSTSGAEAKYIAAGSGCTQLIWMKNMLLEYGFNQDTIMLYC
        +GICARYQADPRI+HLEA+KRILKYVHGT+DFGM+YSY+TTPTLVGYCDA+WA S D+CK+    +A+Y+A GSGCTQLIWMKN+L EYGF+QDTI LYC
Subjt:  MGICARYQADPRITHLEAVKRILKYVHGTSDFGMMYSYDTTPTLVGYCDAEWAGSTDDCKSTSGAEAKYIAAGSGCTQLIWMKNMLLEYGFNQDTIMLYC

Query:  DNMSAIGISKNPVQHSQTKHIDIRHHFIRELVEDKVIKLDHICSNLQLADIFTKPLDASSFEYLRAGLGVCHT
        DNMSAI ISKNPVQHS+TKHI+IR HFIRELVEDK I+LDHI +NLQLADIFTKPLDA+SFEYLRAGLGVCHT
Subjt:  DNMSAIGISKNPVQHSQTKHIDIRHHFIRELVEDKVIKLDHICSNLQLADIFTKPLDASSFEYLRAGLGVCHT

KAA0042297.1 gag-pol polyprotein [Cucumis melo var. makuwa]7.2e-8186.71Show/hide
Query:  MGICARYQADPRITHLEAVKRILKYVHGTSDFGMMYSYDTTPTLVGYCDAEWAGSTDDCKSTSGAEAKYIAAGSGCTQLIWMKNMLLEYGFNQDTIMLYC
        +GICARYQ DPRITHLEAVKRILKYVHGTSDFGMMYSYDTTPT+VGYCDA+WAGSTDD K+    EA+YIAAGSGCTQLIWMKNMLLEYGF+QDT+ LYC
Subjt:  MGICARYQADPRITHLEAVKRILKYVHGTSDFGMMYSYDTTPTLVGYCDAEWAGSTDDCKSTSGAEAKYIAAGSGCTQLIWMKNMLLEYGFNQDTIMLYC

Query:  DNMSAIGISKNPVQHSQTKHIDIRHHFIRELVEDKVIKLDHICSNLQLADIFTKPLDASSFEYLRAGLGVCHT
        DNMSAI ISKN VQHS+TKHIDIRHHFIRELVEDKVIK DHI SNLQL DIFTKPLDASSFEYL  GLGVC T
Subjt:  DNMSAIGISKNPVQHSQTKHIDIRHHFIRELVEDKVIKLDHICSNLQLADIFTKPLDASSFEYLRAGLGVCHT

KAA0055610.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.2e-8078.68Show/hide
Query:  MGICARYQADPRITHLEAVKRILKYVHGTSDFGMMYSYDTTPTLVGYCDAEWAGSTDDCKSTSG------------------------AEAKYIAAGSGC
        +GICA YQADPRITHLEAVKRILKYVHGTSDFGMMYSYDTTP LVGYCDA+WAGS DD KSTSG                         EA+YIAAGSGC
Subjt:  MGICARYQADPRITHLEAVKRILKYVHGTSDFGMMYSYDTTPTLVGYCDAEWAGSTDDCKSTSG------------------------AEAKYIAAGSGC

Query:  TQLIWMKNMLLEYGFNQDTIMLYCDNMSAIGISKNPVQHSQTKHIDIRHHFIRELVEDKVIKLDHICSNLQLADIFTKPLDASSFEYLRAGLGVCHT
        TQLIWMKNML EYGF+QDT+ LYCDN SAI ISKNPVQHS+TKHIDIRHHFIRELVEDKVIKLDHI SNLQL DIFTKPLDASSFEYLRAGLGVC T
Subjt:  TQLIWMKNMLLEYGFNQDTIMLYCDNMSAIGISKNPVQHSQTKHIDIRHHFIRELVEDKVIKLDHICSNLQLADIFTKPLDASSFEYLRAGLGVCHT

KAA0066164.1 gag-pol polyprotein [Cucumis melo var. makuwa]2.4e-8496.27Show/hide
Query:  ADPRITHLEAVKRILKYVHGTSDFGMMYSYDTTPTLVGYCDAEWAGSTDDCKSTSGAEAKYIAAGSGCTQLIWMKNMLLEYGFNQDTIMLYCDNMSAIGI
        ADPRITHLEAVKRILKYVHGTSDFGMMYSYDTTPTLVGYCDAEWAGSTDDCK+    EAKY+AAGSGCTQLIWMKNMLLEYGFNQDTIMLYCDNMSAIGI
Subjt:  ADPRITHLEAVKRILKYVHGTSDFGMMYSYDTTPTLVGYCDAEWAGSTDDCKSTSGAEAKYIAAGSGCTQLIWMKNMLLEYGFNQDTIMLYCDNMSAIGI

Query:  SKNPVQHSQTKHIDIRHHFIRELVEDKVIKLDHICSNLQLADIFTKPLDASSFEYLRAGLG
        SKNPVQHSQTKHIDIRHHFIRELVEDKVIKLDHICSNLQLADIFTKPLDASSFEYLRAGLG
Subjt:  SKNPVQHSQTKHIDIRHHFIRELVEDKVIKLDHICSNLQLADIFTKPLDASSFEYLRAGLG

TYJ97126.1 gag-pol polyprotein [Cucumis melo var. makuwa]3.6e-8085.55Show/hide
Query:  MGICARYQADPRITHLEAVKRILKYVHGTSDFGMMYSYDTTPTLVGYCDAEWAGSTDDCKSTSGAEAKYIAAGSGCTQLIWMKNMLLEYGFNQDTIMLYC
        +GI ARYQA+PRITHLEAVKRILKYVHGTSDFGMMYSYDTTPT+VGYCDA+WAGS DD K+    EA+Y+AAGSGCTQLIWM+NMLLEYGF+QDT+ LYC
Subjt:  MGICARYQADPRITHLEAVKRILKYVHGTSDFGMMYSYDTTPTLVGYCDAEWAGSTDDCKSTSGAEAKYIAAGSGCTQLIWMKNMLLEYGFNQDTIMLYC

Query:  DNMSAIGISKNPVQHSQTKHIDIRHHFIRELVEDKVIKLDHICSNLQLADIFTKPLDASSFEYLRAGLGVCHT
        DNMSAI ISKNPVQHS+TKHIDIRHHF+RELVEDKVIK DHI SNLQLADIFTKPLDASSFEYL AGLGVC T
Subjt:  DNMSAIGISKNPVQHSQTKHIDIRHHFIRELVEDKVIKLDHICSNLQLADIFTKPLDASSFEYLRAGLGVCHT

TrEMBL top hitse value%identityAlignment
A0A5A7SLH7 F5J5.12.1e-7881.5Show/hide
Query:  MGICARYQADPRITHLEAVKRILKYVHGTSDFGMMYSYDTTPTLVGYCDAEWAGSTDDCKSTSGAEAKYIAAGSGCTQLIWMKNMLLEYGFNQDTIMLYC
        +GICARYQADPRI+HLEA+KRILKYVHGT+DFGM+YSY+TTPTLVGYCDA+WA S D+CK+    +A+Y+A GSGCTQLIWMKN+L EYGF+QDTI LYC
Subjt:  MGICARYQADPRITHLEAVKRILKYVHGTSDFGMMYSYDTTPTLVGYCDAEWAGSTDDCKSTSGAEAKYIAAGSGCTQLIWMKNMLLEYGFNQDTIMLYC

Query:  DNMSAIGISKNPVQHSQTKHIDIRHHFIRELVEDKVIKLDHICSNLQLADIFTKPLDASSFEYLRAGLGVCHT
        DNMSAI ISKNPVQHS+TKHI+IR HFIRELVEDK I+LDHI +NLQLADIFTKPLDA+SFEYLRAGLGVCHT
Subjt:  DNMSAIGISKNPVQHSQTKHIDIRHHFIRELVEDKVIKLDHICSNLQLADIFTKPLDASSFEYLRAGLGVCHT

A0A5A7TKS7 Gag-pol polyprotein3.5e-8186.71Show/hide
Query:  MGICARYQADPRITHLEAVKRILKYVHGTSDFGMMYSYDTTPTLVGYCDAEWAGSTDDCKSTSGAEAKYIAAGSGCTQLIWMKNMLLEYGFNQDTIMLYC
        +GICARYQ DPRITHLEAVKRILKYVHGTSDFGMMYSYDTTPT+VGYCDA+WAGSTDD K+    EA+YIAAGSGCTQLIWMKNMLLEYGF+QDT+ LYC
Subjt:  MGICARYQADPRITHLEAVKRILKYVHGTSDFGMMYSYDTTPTLVGYCDAEWAGSTDDCKSTSGAEAKYIAAGSGCTQLIWMKNMLLEYGFNQDTIMLYC

Query:  DNMSAIGISKNPVQHSQTKHIDIRHHFIRELVEDKVIKLDHICSNLQLADIFTKPLDASSFEYLRAGLGVCHT
        DNMSAI ISKN VQHS+TKHIDIRHHFIRELVEDKVIK DHI SNLQL DIFTKPLDASSFEYL  GLGVC T
Subjt:  DNMSAIGISKNPVQHSQTKHIDIRHHFIRELVEDKVIKLDHICSNLQLADIFTKPLDASSFEYLRAGLGVCHT

A0A5D3BDQ5 Gag-pol polyprotein1.7e-8085.55Show/hide
Query:  MGICARYQADPRITHLEAVKRILKYVHGTSDFGMMYSYDTTPTLVGYCDAEWAGSTDDCKSTSGAEAKYIAAGSGCTQLIWMKNMLLEYGFNQDTIMLYC
        +GI ARYQA+PRITHLEAVKRILKYVHGTSDFGMMYSYDTTPT+VGYCDA+WAGS DD K+    EA+Y+AAGSGCTQLIWM+NMLLEYGF+QDT+ LYC
Subjt:  MGICARYQADPRITHLEAVKRILKYVHGTSDFGMMYSYDTTPTLVGYCDAEWAGSTDDCKSTSGAEAKYIAAGSGCTQLIWMKNMLLEYGFNQDTIMLYC

Query:  DNMSAIGISKNPVQHSQTKHIDIRHHFIRELVEDKVIKLDHICSNLQLADIFTKPLDASSFEYLRAGLGVCHT
        DNMSAI ISKNPVQHS+TKHIDIRHHF+RELVEDKVIK DHI SNLQLADIFTKPLDASSFEYL AGLGVC T
Subjt:  DNMSAIGISKNPVQHSQTKHIDIRHHFIRELVEDKVIKLDHICSNLQLADIFTKPLDASSFEYLRAGLGVCHT

A0A5D3CXU0 Gag-pol polyprotein1.2e-8496.27Show/hide
Query:  ADPRITHLEAVKRILKYVHGTSDFGMMYSYDTTPTLVGYCDAEWAGSTDDCKSTSGAEAKYIAAGSGCTQLIWMKNMLLEYGFNQDTIMLYCDNMSAIGI
        ADPRITHLEAVKRILKYVHGTSDFGMMYSYDTTPTLVGYCDAEWAGSTDDCK+    EAKY+AAGSGCTQLIWMKNMLLEYGFNQDTIMLYCDNMSAIGI
Subjt:  ADPRITHLEAVKRILKYVHGTSDFGMMYSYDTTPTLVGYCDAEWAGSTDDCKSTSGAEAKYIAAGSGCTQLIWMKNMLLEYGFNQDTIMLYCDNMSAIGI

Query:  SKNPVQHSQTKHIDIRHHFIRELVEDKVIKLDHICSNLQLADIFTKPLDASSFEYLRAGLG
        SKNPVQHSQTKHIDIRHHFIRELVEDKVIKLDHICSNLQLADIFTKPLDASSFEYLRAGLG
Subjt:  SKNPVQHSQTKHIDIRHHFIRELVEDKVIKLDHICSNLQLADIFTKPLDASSFEYLRAGLG

A0A5D3DLL1 Gag-pol polyprotein5.9e-8178.68Show/hide
Query:  MGICARYQADPRITHLEAVKRILKYVHGTSDFGMMYSYDTTPTLVGYCDAEWAGSTDDCKSTSG------------------------AEAKYIAAGSGC
        +GICA YQADPRITHLEAVKRILKYVHGTSDFGMMYSYDTTP LVGYCDA+WAGS DD KSTSG                         EA+YIAAGSGC
Subjt:  MGICARYQADPRITHLEAVKRILKYVHGTSDFGMMYSYDTTPTLVGYCDAEWAGSTDDCKSTSG------------------------AEAKYIAAGSGC

Query:  TQLIWMKNMLLEYGFNQDTIMLYCDNMSAIGISKNPVQHSQTKHIDIRHHFIRELVEDKVIKLDHICSNLQLADIFTKPLDASSFEYLRAGLGVCHT
        TQLIWMKNML EYGF+QDT+ LYCDN SAI ISKNPVQHS+TKHIDIRHHFIRELVEDKVIKLDHI SNLQL DIFTKPLDASSFEYLRAGLGVC T
Subjt:  TQLIWMKNMLLEYGFNQDTIMLYCDNMSAIGISKNPVQHSQTKHIDIRHHFIRELVEDKVIKLDHICSNLQLADIFTKPLDASSFEYLRAGLGVCHT

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.4e-2534.34Show/hide
Query:  MGICARYQADPRITHLEAVKRILKYVHGTSDFGMMYSYDTT--PTLVGYCDAEWAGSTDDCKSTSG-------------------------AEAKYIAAG
        + I +RY +       + +KR+L+Y+ GT D  +++  +      ++GY D++WAGS  D KST+G                          EA+Y+A  
Subjt:  MGICARYQADPRITHLEAVKRILKYVHGTSDFGMMYSYDTT--PTLVGYCDAEWAGSTDDCKSTSG-------------------------AEAKYIAAG

Query:  SGCTQLIWMKNMLLEYGFN-QDTIMLYCDNMSAIGISKNPVQHSQTKHIDIRHHFIRELVEDKVIKLDHICSNLQLADIFTKPLDASSFEYLRAGLGV
            + +W+K +L       ++ I +Y DN   I I+ NP  H + KHIDI++HF RE V++ VI L++I +  QLADIFTKPL A+ F  LR  LG+
Subjt:  SGCTQLIWMKNMLLEYGFN-QDTIMLYCDNMSAIGISKNPVQHSQTKHIDIRHHFIRELVEDKVIKLDHICSNLQLADIFTKPLDASSFEYLRAGLGV

P0CV72 Secreted RxLR effector protein 1612.0e-0940.62Show/hide
Query:  MGICARYQADPRITHLEAVKRILKYVHGTSDFGMMYSYDTTPTLVGYCDAEWAGSTDDCKSTSG
        +G+ +++ +DP  TH +A+KR+L+Y+  T  +G+ ++   T  LVGY DA+WAG  +  +STSG
Subjt:  MGICARYQADPRITHLEAVKRILKYVHGTSDFGMMYSYDTTPTLVGYCDAEWAGSTDDCKSTSG

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-2836.08Show/hide
Query:  MGICARYQADPRITHLEAVKRILKYVHGTSDFGMMYSYDTTPTLVGYCDAEWAGSTDDCKSTSG------------------------AEAKYIAAGSGC
        +G+ +R+  +P   H EAVK IL+Y+ GT+   + +   + P L GY DA+ AG  D+ KS++G                         EA+YIAA    
Subjt:  MGICARYQADPRITHLEAVKRILKYVHGTSDFGMMYSYDTTPTLVGYCDAEWAGSTDDCKSTSG------------------------AEAKYIAAGSGC

Query:  TQLIWMKNMLLEYGFNQDTIMLYCDNMSAIGISKNPVQHSQTKHIDIRHHFIRELVEDKVIKLDHICSNLQLADIFTKPLDASSFEYLRAGLGV
         ++IW+K  L E G +Q   ++YCD+ SAI +SKN + H++TKHID+R+H+IRE+V+D+ +K+  I +N   AD+ TK +  + FE  +  +G+
Subjt:  TQLIWMKNMLLEYGFNQDTIMLYCDNMSAIGISKNPVQHSQTKHIDIRHHFIRELVEDKVIKLDHICSNLQLADIFTKPLDASSFEYLRAGLGV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.9e-2434.05Show/hide
Query:  PRITHLEAVKRILKYVHGTSDFGMMYSYDTTPTLVGYCDAEWAGSTDDCKSTSG------------------------AEAKYIAAGSGCTQLIWMKNML
        P   HL+A+KRIL+Y+ GT + G+      T +L  Y DA+WAG  DD  ST+G                         EA+Y +  +  +++ W+ ++L
Subjt:  PRITHLEAVKRILKYVHGTSDFGMMYSYDTTPTLVGYCDAEWAGSTDDCKSTSG------------------------AEAKYIAAGSGCTQLIWMKNML

Query:  LEYGFN-QDTIMLYCDNMSAIGISKNPVQHSQTKHIDIRHHFIRELVEDKVIKLDHICSNLQLADIFTKPLDASSFEYLRAGLGV
         E G       ++YCDN+ A  +  NPV HS+ KHI I +HFIR  V+   +++ H+ ++ QLAD  TKPL  ++F+   + +GV
Subjt:  LEYGFN-QDTIMLYCDNMSAIGISKNPVQHSQTKHIDIRHHFIRELVEDKVIKLDHICSNLQLADIFTKPLDASSFEYLRAGLGV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE29.9e-2533.51Show/hide
Query:  ARYQADPRITHLEAVKRILKYVHGTSDFGMMYSYDTTPTLVGYCDAEWAGSTDDCKSTSG------------------------AEAKYIAAGSGCTQLI
        ++Y   P   H  A+KR+L+Y+ GT D G+      T +L  Y DA+WAG TDD  ST+G                         EA+Y +  +  ++L 
Subjt:  ARYQADPRITHLEAVKRILKYVHGTSDFGMMYSYDTTPTLVGYCDAEWAGSTDDCKSTSG------------------------AEAKYIAAGSGCTQLI

Query:  WMKNMLLEYGFN-QDTIMLYCDNMSAIGISKNPVQHSQTKHIDIRHHFIRELVEDKVIKLDHICSNLQLADIFTKPLDASSFEYLRAGLGV
        W+ ++L E G       ++YCDN+ A  +  NPV HS+ KHI + +HFIR  V+   +++ H+ ++ QLAD  TKPL   +F+     +GV
Subjt:  WMKNMLLEYGFN-QDTIMLYCDNMSAIGISKNPVQHSQTKHIDIRHHFIRELVEDKVIKLDHICSNLQLADIFTKPLDASSFEYLRAGLGV

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.0e-1230.26Show/hide
Query:  ARYQADPRITHLEAVKRILKYVHGTSDFGMMYS------------------YDTTPTLVGYCD------AEWAGSTDDCKSTSGAEAKYIAAGSGCTQLI
        +++   PR+ H +AV +IL Y+ GT   G+ YS                   DT  +  GYC         W        S S AEA+Y A      +++
Subjt:  ARYQADPRITHLEAVKRILKYVHGTSDFGMMYS------------------YDTTPTLVGYCD------AEWAGSTDDCKSTSGAEAKYIAAGSGCTQLI

Query:  WMKNML--LEYGFNQDTIMLYCDNMSAIGISKNPVQHSQTKHIDIRHHFIRE
        W+      L+   ++ T +L+CDN +AI I+ N V H +TKHI+   H +RE
Subjt:  WMKNML--LEYGFNQDTIMLYCDNMSAIGISKNPVQHSQTKHIDIRHHFIRE

ATMG00810.1 DNA/RNA polymerases superfamily protein1.8e-0529.11Show/hide
Query:  ICARYQADPRITHLEAVKRILKYVHGTSDFGMMYSYDTTPTLVGYCDAEWAGSTDDCKSTSGAEAKYIAAGSGCTQLIW
        +C R   +P +   + +KR+L+YV GT   G+    ++   +  +CD++WAG T   +ST+G          GC  + W
Subjt:  ICARYQADPRITHLEAVKRILKYVHGTSDFGMMYSYDTTPTLVGYCDAEWAGSTDDCKSTSGAEAKYIAAGSGCTQLIW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAATATGTGCTCGTTATCAGGCGGATCCCCGCATCACTCACCTAGAAGCTGTTAAACGAATTCTTAAATACGTTCATGGGACCAGTGACTTTGGAATGATGTATTC
CTATGATACCACTCCCACTTTAGTTGGATATTGTGATGCTGAATGGGCAGGTTCAACTGATGATTGTAAAAGTACGTCTGGAGCTGAAGCTAAATATATAGCTGCTGGTA
GTGGTTGTACACAATTGATTTGGATGAAAAATATGCTGCTTGAATATGGCTTTAATCAGGACACTATAATGTTGTATTGTGACAATATGAGTGCAATTGGTATATCTAAG
AATCCCGTTCAACATAGTCAAACTAAGCATATTGATATAAGACACCACTTTATTCGAGAACTAGTTGAAGACAAAGTAATCAAGCTTGATCATATTTGTTCAAATTTACA
ATTAGCTGATATCTTCACTAAACCCCTGGATGCCAGCTCATTCGAATACTTACGTGCTGGTTTAGGTGTGTGTCACACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAATATGTGCTCGTTATCAGGCGGATCCCCGCATCACTCACCTAGAAGCTGTTAAACGAATTCTTAAATACGTTCATGGGACCAGTGACTTTGGAATGATGTATTC
CTATGATACCACTCCCACTTTAGTTGGATATTGTGATGCTGAATGGGCAGGTTCAACTGATGATTGTAAAAGTACGTCTGGAGCTGAAGCTAAATATATAGCTGCTGGTA
GTGGTTGTACACAATTGATTTGGATGAAAAATATGCTGCTTGAATATGGCTTTAATCAGGACACTATAATGTTGTATTGTGACAATATGAGTGCAATTGGTATATCTAAG
AATCCCGTTCAACATAGTCAAACTAAGCATATTGATATAAGACACCACTTTATTCGAGAACTAGTTGAAGACAAAGTAATCAAGCTTGATCATATTTGTTCAAATTTACA
ATTAGCTGATATCTTCACTAAACCCCTGGATGCCAGCTCATTCGAATACTTACGTGCTGGTTTAGGTGTGTGTCACACTTAA
Protein sequenceShow/hide protein sequence
MGICARYQADPRITHLEAVKRILKYVHGTSDFGMMYSYDTTPTLVGYCDAEWAGSTDDCKSTSGAEAKYIAAGSGCTQLIWMKNMLLEYGFNQDTIMLYCDNMSAIGISK
NPVQHSQTKHIDIRHHFIRELVEDKVIKLDHICSNLQLADIFTKPLDASSFEYLRAGLGVCHT