; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0226771 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0226771
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-Pol
Genome locationCMiso1.1chr08:19156340..19156678
RNA-Seq ExpressionCmc08g0226771
SyntenyCmc08g0226771
Gene Ontology termsGO:0045489 - pectin biosynthetic process (biological process)
GO:0006281 - DNA repair (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0006629 - lipid metabolic process (biological process)
GO:0016567 - protein ubiquitination (biological process)
GO:0015074 - DNA integration (biological process)
GO:0015979 - photosynthesis (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0009654 - photosystem II oxygen evolving complex (cellular component)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0047262 - polygalacturonate 4-alpha-galacturonosyltransferase activity (molecular function)
GO:0046982 - protein heterodimerization activity (molecular function)
GO:0020037 - heme binding (molecular function)
GO:0016705 - oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen (molecular function)
GO:0016298 - lipase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0003676 - nucleic acid binding (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0004842 - ubiquitin-protein transferase activity (molecular function)
GO:0004497 - monooxygenase activity (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GEY37770.1 retrovirus-related Pol polyprotein from transposon TNT 1-94 [Tanacetum cinerariifolium]5.9e-4477.68Show/hide
Query:  MSTIEVEYIAATQASKEAIWLKMLLEELGHEHKKISLFCDNQNALYLARNPAFHAKTKHIRVQYHFIHEKVEEGIIDMHKIHTKENLADYLTKAVNTDKF
        MSTI+ EY+AA QASKEA+WLKMLLEELGHE +KI+LFCDNQ+ALYLARNP FH+KTKHIRVQYHFI EKVEEG +DM KIHT +N++DYLTKA+N DKF
Subjt:  MSTIEVEYIAATQASKEAIWLKMLLEELGHEHKKISLFCDNQNALYLARNPAFHAKTKHIRVQYHFIHEKVEEGIIDMHKIHTKENLADYLTKAVNTDKF

Query:  IWCQSSNSLAET
        IWC+SS  LA+T
Subjt:  IWCQSSNSLAET

KAA0060043.1 Gag-Pol [Cucumis melo var. makuwa]1.4e-56100Show/hide
Query:  MSTIEVEYIAATQASKEAIWLKMLLEELGHEHKKISLFCDNQNALYLARNPAFHAKTKHIRVQYHFIHEKVEEGIIDMHKIHTKENLADYLTKAVNTDKF
        MSTIEVEYIAATQASKEAIWLKMLLEELGHEHKKISLFCDNQNALYLARNPAFHAKTKHIRVQYHFIHEKVEEGIIDMHKIHTKENLADYLTKAVNTDKF
Subjt:  MSTIEVEYIAATQASKEAIWLKMLLEELGHEHKKISLFCDNQNALYLARNPAFHAKTKHIRVQYHFIHEKVEEGIIDMHKIHTKENLADYLTKAVNTDKF

Query:  IWCQSSNSLAET
        IWCQSSNSLAET
Subjt:  IWCQSSNSLAET

KAA0067221.1 Gag-Pol [Cucumis melo var. makuwa]2.7e-4483.81Show/hide
Query:  MSTIEVEYIAATQASKEAIWLKMLLEELGHEHKKISLFCDNQNALYLARNPAFHAKTKHIRVQYHFIHEKVEEGIIDMHKIHTKENLADYLTKAVNTDKF
        MS  + EY+AATQASKE  WLKM+LE+L HEHKKISLFC NQ+ALYLARNPAFHAKTKHIRVQYHF+HEKVEEGIIDMHK+HTKENL DYLTKAVN DKF
Subjt:  MSTIEVEYIAATQASKEAIWLKMLLEELGHEHKKISLFCDNQNALYLARNPAFHAKTKHIRVQYHFIHEKVEEGIIDMHKIHTKENLADYLTKAVNTDKF

Query:  IWCQS
        IWC+S
Subjt:  IWCQS

TYK06685.1 Gag-Pol [Cucumis melo var. makuwa]1.9e-5088.39Show/hide
Query:  MSTIEVEYIAATQASKEAIWLKMLLEELGHEHKKISLFCDNQNALYLARNPAFHAKTKHIRVQYHFIHEKVEEGIIDMHKIHTKENLADYLTKAVNTDKF
        MST EVEY+AATQ SKE +WLKMLLEEL HEHKKISLFCDNQ+ALYLARNPAFHAKTKHIRVQYHF+ EKVEEGIIDMHKIHTKENLADYLTK VNTDKF
Subjt:  MSTIEVEYIAATQASKEAIWLKMLLEELGHEHKKISLFCDNQNALYLARNPAFHAKTKHIRVQYHFIHEKVEEGIIDMHKIHTKENLADYLTKAVNTDKF

Query:  IWCQSSNSLAET
        IWC+SSN L ET
Subjt:  IWCQSSNSLAET

TYK21896.1 Gag-Pol [Cucumis melo var. makuwa]1.2e-5599.11Show/hide
Query:  MSTIEVEYIAATQASKEAIWLKMLLEELGHEHKKISLFCDNQNALYLARNPAFHAKTKHIRVQYHFIHEKVEEGIIDMHKIHTKENLADYLTKAVNTDKF
        MSTIEVEYIAATQASKEAIWLKMLLEELGHEHKKISLFCDNQNALYLARNPAFHAKTKHIRVQYHFI EKVEEGIIDMHKIHTKENLADYLTKAVNTDKF
Subjt:  MSTIEVEYIAATQASKEAIWLKMLLEELGHEHKKISLFCDNQNALYLARNPAFHAKTKHIRVQYHFIHEKVEEGIIDMHKIHTKENLADYLTKAVNTDKF

Query:  IWCQSSNSLAET
        IWCQSSNSLAET
Subjt:  IWCQSSNSLAET

TrEMBL top hitse value%identityAlignment
A0A5A7UVP5 Gag-Pol6.6e-57100Show/hide
Query:  MSTIEVEYIAATQASKEAIWLKMLLEELGHEHKKISLFCDNQNALYLARNPAFHAKTKHIRVQYHFIHEKVEEGIIDMHKIHTKENLADYLTKAVNTDKF
        MSTIEVEYIAATQASKEAIWLKMLLEELGHEHKKISLFCDNQNALYLARNPAFHAKTKHIRVQYHFIHEKVEEGIIDMHKIHTKENLADYLTKAVNTDKF
Subjt:  MSTIEVEYIAATQASKEAIWLKMLLEELGHEHKKISLFCDNQNALYLARNPAFHAKTKHIRVQYHFIHEKVEEGIIDMHKIHTKENLADYLTKAVNTDKF

Query:  IWCQSSNSLAET
        IWCQSSNSLAET
Subjt:  IWCQSSNSLAET

A0A5A7VNN0 Gag-Pol1.3e-4483.81Show/hide
Query:  MSTIEVEYIAATQASKEAIWLKMLLEELGHEHKKISLFCDNQNALYLARNPAFHAKTKHIRVQYHFIHEKVEEGIIDMHKIHTKENLADYLTKAVNTDKF
        MS  + EY+AATQASKE  WLKM+LE+L HEHKKISLFC NQ+ALYLARNPAFHAKTKHIRVQYHF+HEKVEEGIIDMHK+HTKENL DYLTKAVN DKF
Subjt:  MSTIEVEYIAATQASKEAIWLKMLLEELGHEHKKISLFCDNQNALYLARNPAFHAKTKHIRVQYHFIHEKVEEGIIDMHKIHTKENLADYLTKAVNTDKF

Query:  IWCQS
        IWC+S
Subjt:  IWCQS

A0A5D3C5M6 Gag-Pol9.2e-5188.39Show/hide
Query:  MSTIEVEYIAATQASKEAIWLKMLLEELGHEHKKISLFCDNQNALYLARNPAFHAKTKHIRVQYHFIHEKVEEGIIDMHKIHTKENLADYLTKAVNTDKF
        MST EVEY+AATQ SKE +WLKMLLEEL HEHKKISLFCDNQ+ALYLARNPAFHAKTKHIRVQYHF+ EKVEEGIIDMHKIHTKENLADYLTK VNTDKF
Subjt:  MSTIEVEYIAATQASKEAIWLKMLLEELGHEHKKISLFCDNQNALYLARNPAFHAKTKHIRVQYHFIHEKVEEGIIDMHKIHTKENLADYLTKAVNTDKF

Query:  IWCQSSNSLAET
        IWC+SSN L ET
Subjt:  IWCQSSNSLAET

A0A5D3DE76 Gag-Pol5.6e-5699.11Show/hide
Query:  MSTIEVEYIAATQASKEAIWLKMLLEELGHEHKKISLFCDNQNALYLARNPAFHAKTKHIRVQYHFIHEKVEEGIIDMHKIHTKENLADYLTKAVNTDKF
        MSTIEVEYIAATQASKEAIWLKMLLEELGHEHKKISLFCDNQNALYLARNPAFHAKTKHIRVQYHFI EKVEEGIIDMHKIHTKENLADYLTKAVNTDKF
Subjt:  MSTIEVEYIAATQASKEAIWLKMLLEELGHEHKKISLFCDNQNALYLARNPAFHAKTKHIRVQYHFIHEKVEEGIIDMHKIHTKENLADYLTKAVNTDKF

Query:  IWCQSSNSLAET
        IWCQSSNSLAET
Subjt:  IWCQSSNSLAET

A0A699HME6 Gag-Pol polyprotein7.1e-4374.11Show/hide
Query:  MSTIEVEYIAATQASKEAIWLKMLLEELGHEHKKISLFCDNQNALYLARNPAFHAKTKHIRVQYHFIHEKVEEGIIDMHKIHTKENLADYLTKAVNTDKF
        MST+E EY+ A QASKEA+WLKMLLEE+GH+ +KI+LFCDNQ+ALYLARNPAFH+KTKHIRVQYHFI +KV+EGI+DMHKI+T +N+ADYL KA+N DKF
Subjt:  MSTIEVEYIAATQASKEAIWLKMLLEELGHEHKKISLFCDNQNALYLARNPAFHAKTKHIRVQYHFIHEKVEEGIIDMHKIHTKENLADYLTKAVNTDKF

Query:  IWCQSSNSLAET
        IWC+S+  LA+T
Subjt:  IWCQSSNSLAET

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.7e-1437.62Show/hide
Query:  STIEVEYIAATQASKEAIWLKMLLEELGHE-HKKISLFCDNQNALYLARNPAFHAKTKHIRVQYHFIHEKVEEGIIDMHKIHTKENLADYLTKAVNTDKF
        S+ E EY+A  +A +EA+WLK LL  +  +    I ++ DNQ  + +A NP+ H + KHI ++YHF  E+V+  +I +  I T+  LAD  TK +   +F
Subjt:  STIEVEYIAATQASKEAIWLKMLLEELGHE-HKKISLFCDNQNALYLARNPAFHAKTKHIRVQYHFIHEKVEEGIIDMHKIHTKENLADYLTKAVNTDKF

Query:  I
        +
Subjt:  I

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-2250Show/hide
Query:  MSTIEVEYIAATQASKEAIWLKMLLEELGHEHKKISLFCDNQNALYLARNPAFHAKTKHIRVQYHFIHEKVEEGIIDMHKIHTKENLADYLTKAVNTDKF
        +ST E EYIAAT+  KE IWLK  L+ELG   K+  ++CD+Q+A+ L++N  +HA+TKHI V+YH+I E V++  + + KI T EN AD LTK V  +KF
Subjt:  MSTIEVEYIAATQASKEAIWLKMLLEELGHEHKKISLFCDNQNALYLARNPAFHAKTKHIRVQYHFIHEKVEEGIIDMHKIHTKENLADYLTKAVNTDKF

Query:  IWCQ
          C+
Subjt:  IWCQ

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.5e-1339Show/hide
Query:  STIEVEYIAATQASKEAIWLKMLLEELG-HEHKKISLFCDNQNALYLARNPAFHAKTKHIRVQYHFIHEKVEEGIIDMHKIHTKENLADYLTKAVNTDKF
        S+ E EY +    S E  W+  LL ELG    +   ++CDN  A YL  NP FH++ KHI + YHFI  +V+ G + +  + T + LAD LTK ++   F
Subjt:  STIEVEYIAATQASKEAIWLKMLLEELG-HEHKKISLFCDNQNALYLARNPAFHAKTKHIRVQYHFIHEKVEEGIIDMHKIHTKENLADYLTKAVNTDKF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.5e-1339.6Show/hide
Query:  STIEVEYIAATQASKEAIWLKMLLEELGHE--HKKISLFCDNQNALYLARNPAFHAKTKHIRVQYHFIHEKVEEGIIDMHKIHTKENLADYLTKAVNTDK
        S+ E EY +    S E  W+  LL ELG +  H  + ++CDN  A YL  NP FH++ KHI + YHFI  +V+ G + +  + T + LAD LTK ++   
Subjt:  STIEVEYIAATQASKEAIWLKMLLEELGHE--HKKISLFCDNQNALYLARNPAFHAKTKHIRVQYHFIHEKVEEGIIDMHKIHTKENLADYLTKAVNTDK

Query:  F
        F
Subjt:  F

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.8e-0740Show/hide
Query:  STIEVEYIAATQASKEAIWLKMLLEELGHE-HKKISLFCDNQNALYLARNPAFHAKTKHIRVQYHFIHEK
        S+ E EY A + A+ E +WL     EL     K   LFCDN  A+++A N  FH +TKHI    H + E+
Subjt:  STIEVEYIAATQASKEAIWLKMLLEELGHE-HKKISLFCDNQNALYLARNPAFHAKTKHIRVQYHFIHEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAACAATAGAAGTAGAATACATTGCTGCTACACAAGCTAGTAAAGAAGCAATATGGTTAAAAATGCTATTGGAAGAGCTCGGGCATGAACATAAGAAAATCTCTTT
GTTTTGTGACAATCAGAATGCCTTGTATCTTGCAAGAAATCCAGCTTTCCATGCCAAGACCAAACATATTCGAGTGCAGTACCACTTTATTCATGAGAAAGTAGAGGAAG
GAATAATAGATATGCATAAAATTCATACAAAAGAAAACCTAGCAGATTACTTGACCAAGGCAGTCAACACTGACAAGTTCATTTGGTGTCAATCCTCAAATAGCCTAGCA
GAAACATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCAACAATAGAAGTAGAATACATTGCTGCTACACAAGCTAGTAAAGAAGCAATATGGTTAAAAATGCTATTGGAAGAGCTCGGGCATGAACATAAGAAAATCTCTTT
GTTTTGTGACAATCAGAATGCCTTGTATCTTGCAAGAAATCCAGCTTTCCATGCCAAGACCAAACATATTCGAGTGCAGTACCACTTTATTCATGAGAAAGTAGAGGAAG
GAATAATAGATATGCATAAAATTCATACAAAAGAAAACCTAGCAGATTACTTGACCAAGGCAGTCAACACTGACAAGTTCATTTGGTGTCAATCCTCAAATAGCCTAGCA
GAAACATAA
Protein sequenceShow/hide protein sequence
MSTIEVEYIAATQASKEAIWLKMLLEELGHEHKKISLFCDNQNALYLARNPAFHAKTKHIRVQYHFIHEKVEEGIIDMHKIHTKENLADYLTKAVNTDKFIWCQSSNSLA
ET