; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc05g0133541 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc05g0133541
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr05:14316771..14317250
RNA-Seq ExpressionCmc05g0133541
SyntenyCmc05g0133541
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]2.1e-7186.62Show/hide
Query:  DDGIEDPLTFKQAMNDVDCDQLIKAMDLEMESMYANSVWTLVDQPNDIKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQKEGIDYEATFSPVAMIKSI
        DDG+EDPLT+KQAMNDVD DQ IKAM+LEMESMY NSVWTLVD P+D+KPIGCKWIYKRKRDQ GKVQTFKARLVAKGYTQKEG+DYE TFSPVAM+KSI
Subjt:  DDGIEDPLTFKQAMNDVDCDQLIKAMDLEMESMYANSVWTLVDQPNDIKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQKEGIDYEATFSPVAMIKSI

Query:  RILLSIATFYDYEILKMDVKTVFLNGNLEESIYMVQPEGFIQKGQEQKVCELQKSIY
        RILLSIATFY+YEI +MDVKT FLNGNLEESIYMVQPEGFI + QEQKVC+LQKSIY
Subjt:  RILLSIATFYDYEILKMDVKTVFLNGNLEESIYMVQPEGFIQKGQEQKVCELQKSIY

KAA0032291.1 gag/pol protein [Cucumis melo var. makuwa]9.9e-6989.86Show/hide
Query:  DDGIEDPLTFKQAMNDVDCDQLIKAMDLEMESMYANSVWTLVDQPNDIKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQKEGIDYEATFSPVAMIKSI
        DDGIEDPLT+KQAMNDVDCDQ IKAMDLEMESMY+NSVW LVDQP++++ IGCKWIYKRKRDQTGKVQTFKARLVAKGYTQ+EGIDYE TFSPVAMIKSI
Subjt:  DDGIEDPLTFKQAMNDVDCDQLIKAMDLEMESMYANSVWTLVDQPNDIKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQKEGIDYEATFSPVAMIKSI

Query:  RILLSIATFYDYEILKMDVKTVFLNGNLEESIYMVQPEGFIQKGQEQK
        RILLSIATFYDYEI  MDVKT FLNGNLEESIYMVQ EGFIQKGQEQK
Subjt:  RILLSIATFYDYEILKMDVKTVFLNGNLEESIYMVQPEGFIQKGQEQK

KAA0045356.1 gag/pol protein [Cucumis melo var. makuwa]3.9e-7389.17Show/hide
Query:  DDGIEDPLTFKQAMNDVDCDQLIKAMDLEMESMYANSVWTLVDQPNDIKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQKEGIDYEATFSPVAMIKSI
        +D IEDPLTFKQA NDVD DQ IKAMDL+MESMY+NSVWTLVDQPN+I+PIGCKWIYKRKRDQT KVQTF+ARLVAKGYTQKEGIDYE TFSP+AMIKSI
Subjt:  DDGIEDPLTFKQAMNDVDCDQLIKAMDLEMESMYANSVWTLVDQPNDIKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQKEGIDYEATFSPVAMIKSI

Query:  RILLSIATFYDYEILKMDVKTVFLNGNLEESIYMVQPEGFIQKGQEQKVCELQKSIY
        RILLSIATFYDYEI +MDVKT FLNGNLEESIYMVQPEGFIQKGQEQKVC+LQKSIY
Subjt:  RILLSIATFYDYEILKMDVKTVFLNGNLEESIYMVQPEGFIQKGQEQKVCELQKSIY

KAA0051500.1 gag/pol protein [Cucumis melo var. makuwa]3.4e-6987.92Show/hide
Query:  DDGIEDPLTFKQAMNDVDCDQLIKAMDLEMESMYANSVWTLVDQPNDIKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQKEGIDYEATFSPVAMIKSI
        +DGIEDPLT+KQ MNDVDCDQ IK MDLEMESMY+NSVWTLVDQPN++KPI CKW+YKRKRDQ GKVQTF+ARLVAKGYTQKEGIDYE TFSPVAMIKSI
Subjt:  DDGIEDPLTFKQAMNDVDCDQLIKAMDLEMESMYANSVWTLVDQPNDIKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQKEGIDYEATFSPVAMIKSI

Query:  RILLSIATFYDYEILKMDVKTVFLNGNLEESIYMVQPEGFIQKGQEQKV
        RI+LSIATFYDYEI  MDVKT FLNGNL+ESIYMVQPEGFIQKGQEQKV
Subjt:  RILLSIATFYDYEILKMDVKTVFLNGNLEESIYMVQPEGFIQKGQEQKV

TYK03644.1 gag/pol protein [Cucumis melo var. makuwa]4.7e-7188.54Show/hide
Query:  DDGIEDPLTFKQAMNDVDCDQLIKAMDLEMESMYANSVWTLVDQPNDIKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQKEGIDYEATFSPVAMIKSI
        DDGIEDPLT+K AMNDVD DQ IKAMDLEMESMY+NSVWTLVDQPND+KPIGCKWIYKRKRDQ GKVQTFKARLVAKGYTQKEGIDYE  FS  AMIKSI
Subjt:  DDGIEDPLTFKQAMNDVDCDQLIKAMDLEMESMYANSVWTLVDQPNDIKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQKEGIDYEATFSPVAMIKSI

Query:  RILLSIATFYDYEILKMDVKTVFLNGNLEESIYMVQPEGFIQKGQEQKVCELQKSIY
        RILLSIATFYDYEI +MDVKT FLN NLEESIYMVQPE FIQKGQEQK+C+LQKSIY
Subjt:  RILLSIATFYDYEILKMDVKTVFLNGNLEESIYMVQPEGFIQKGQEQKVCELQKSIY

TrEMBL top hitse value%identityAlignment
A0A5A7SRW5 Gag/pol protein4.8e-6989.86Show/hide
Query:  DDGIEDPLTFKQAMNDVDCDQLIKAMDLEMESMYANSVWTLVDQPNDIKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQKEGIDYEATFSPVAMIKSI
        DDGIEDPLT+KQAMNDVDCDQ IKAMDLEMESMY+NSVW LVDQP++++ IGCKWIYKRKRDQTGKVQTFKARLVAKGYTQ+EGIDYE TFSPVAMIKSI
Subjt:  DDGIEDPLTFKQAMNDVDCDQLIKAMDLEMESMYANSVWTLVDQPNDIKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQKEGIDYEATFSPVAMIKSI

Query:  RILLSIATFYDYEILKMDVKTVFLNGNLEESIYMVQPEGFIQKGQEQK
        RILLSIATFYDYEI  MDVKT FLNGNLEESIYMVQ EGFIQKGQEQK
Subjt:  RILLSIATFYDYEILKMDVKTVFLNGNLEESIYMVQPEGFIQKGQEQK

A0A5A7TTA2 Gag/pol protein1.9e-7389.17Show/hide
Query:  DDGIEDPLTFKQAMNDVDCDQLIKAMDLEMESMYANSVWTLVDQPNDIKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQKEGIDYEATFSPVAMIKSI
        +D IEDPLTFKQA NDVD DQ IKAMDL+MESMY+NSVWTLVDQPN+I+PIGCKWIYKRKRDQT KVQTF+ARLVAKGYTQKEGIDYE TFSP+AMIKSI
Subjt:  DDGIEDPLTFKQAMNDVDCDQLIKAMDLEMESMYANSVWTLVDQPNDIKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQKEGIDYEATFSPVAMIKSI

Query:  RILLSIATFYDYEILKMDVKTVFLNGNLEESIYMVQPEGFIQKGQEQKVCELQKSIY
        RILLSIATFYDYEI +MDVKT FLNGNLEESIYMVQPEGFIQKGQEQKVC+LQKSIY
Subjt:  RILLSIATFYDYEILKMDVKTVFLNGNLEESIYMVQPEGFIQKGQEQKVCELQKSIY

A0A5A7U8L5 Gag/pol protein1.6e-6987.92Show/hide
Query:  DDGIEDPLTFKQAMNDVDCDQLIKAMDLEMESMYANSVWTLVDQPNDIKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQKEGIDYEATFSPVAMIKSI
        +DGIEDPLT+KQ MNDVDCDQ IK MDLEMESMY+NSVWTLVDQPN++KPI CKW+YKRKRDQ GKVQTF+ARLVAKGYTQKEGIDYE TFSPVAMIKSI
Subjt:  DDGIEDPLTFKQAMNDVDCDQLIKAMDLEMESMYANSVWTLVDQPNDIKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQKEGIDYEATFSPVAMIKSI

Query:  RILLSIATFYDYEILKMDVKTVFLNGNLEESIYMVQPEGFIQKGQEQKV
        RI+LSIATFYDYEI  MDVKT FLNGNL+ESIYMVQPEGFIQKGQEQKV
Subjt:  RILLSIATFYDYEILKMDVKTVFLNGNLEESIYMVQPEGFIQKGQEQKV

A0A5D3BX45 Gag/pol protein2.3e-7188.54Show/hide
Query:  DDGIEDPLTFKQAMNDVDCDQLIKAMDLEMESMYANSVWTLVDQPNDIKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQKEGIDYEATFSPVAMIKSI
        DDGIEDPLT+K AMNDVD DQ IKAMDLEMESMY+NSVWTLVDQPND+KPIGCKWIYKRKRDQ GKVQTFKARLVAKGYTQKEGIDYE  FS  AMIKSI
Subjt:  DDGIEDPLTFKQAMNDVDCDQLIKAMDLEMESMYANSVWTLVDQPNDIKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQKEGIDYEATFSPVAMIKSI

Query:  RILLSIATFYDYEILKMDVKTVFLNGNLEESIYMVQPEGFIQKGQEQKVCELQKSIY
        RILLSIATFYDYEI +MDVKT FLN NLEESIYMVQPE FIQKGQEQK+C+LQKSIY
Subjt:  RILLSIATFYDYEILKMDVKTVFLNGNLEESIYMVQPEGFIQKGQEQKVCELQKSIY

E2GK51 Gag/pol protein (Fragment)1.0e-7186.62Show/hide
Query:  DDGIEDPLTFKQAMNDVDCDQLIKAMDLEMESMYANSVWTLVDQPNDIKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQKEGIDYEATFSPVAMIKSI
        DDG+EDPLT+KQAMNDVD DQ IKAM+LEMESMY NSVWTLVD P+D+KPIGCKWIYKRKRDQ GKVQTFKARLVAKGYTQKEG+DYE TFSPVAM+KSI
Subjt:  DDGIEDPLTFKQAMNDVDCDQLIKAMDLEMESMYANSVWTLVDQPNDIKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQKEGIDYEATFSPVAMIKSI

Query:  RILLSIATFYDYEILKMDVKTVFLNGNLEESIYMVQPEGFIQKGQEQKVCELQKSIY
        RILLSIATFY+YEI +MDVKT FLNGNLEESIYMVQPEGFI + QEQKVC+LQKSIY
Subjt:  RILLSIATFYDYEILKMDVKTVFLNGNLEESIYMVQPEGFIQKGQEQKVCELQKSIY

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.3e-2539.07Show/hide
Query:  PLTFKQAMNDVDCDQLIKAMDLEMESMYANSVWTLVDQPNDIKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQKEGIDYEATFSPVAMIKSIRILLSI
        P +F +     D     +A++ E+ +   N+ WT+  +P +   +  +W++  K ++ G    +KARLVA+G+TQK  IDYE TF+PVA I S R +LS+
Subjt:  PLTFKQAMNDVDCDQLIKAMDLEMESMYANSVWTLVDQPNDIKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQKEGIDYEATFSPVAMIKSIRILLSI

Query:  ATFYDYEILKMDVKTVFLNGNLEESIYMVQPEGFIQKGQEQKVCELQKSIY
           Y+ ++ +MDVKT FLNG L+E IYM  P+G         VC+L K+IY
Subjt:  ATFYDYEILKMDVKTVFLNGNLEESIYMVQPEGFIQKGQEQKVCELQKSIY

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.7e-3647.17Show/hide
Query:  MSDDGIEDPLTFKQAMNDVDCDQLIKAMDLEMESMYANSVWTLVDQPNDIKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQKEGIDYEATFSPVAMIK
        +SDD   +P + K+ ++  + +QL+KAM  EMES+  N  + LV+ P   +P+ CKW++K K+D   K+  +KARLV KG+ QK+GID++  FSPV  + 
Subjt:  MSDDGIEDPLTFKQAMNDVDCDQLIKAMDLEMESMYANSVWTLVDQPNDIKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQKEGIDYEATFSPVAMIK

Query:  SIRILLSIATFYDYEILKMDVKTVFLNGNLEESIYMVQPEGFIQKGQEQKVCELQKSIY
        SIR +LS+A   D E+ ++DVKT FL+G+LEE IYM QPEGF   G++  VC+L KS+Y
Subjt:  SIRILLSIATFYDYEILKMDVKTVFLNGNLEESIYMVQPEGFIQKGQEQKVCELQKSIY

P92520 Uncharacterized mitochondrial protein AtMg008204.7e-1341.67Show/hide
Query:  KAMDLEMESMYANSVWTLVDQPNDIKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQKEGIDYEATFSPVAMIKSIRILLSIA
        +AM  E++++  N  W LV  P +   +GCKW++K K    G +   KARLVAKG+ Q+EGI +  T+SPV    +IR +L++A
Subjt:  KAMDLEMESMYANSVWTLVDQPNDIKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQKEGIDYEATFSPVAMIKSIRILLSIA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.5e-2741.18Show/hide
Query:  DPLTFKQAMNDVDCDQLIKAMDLEMESMYANSVWTLV-DQPNDIKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQKEGIDYEATFSPVAMIKSIRILL
        +P T  QA+ D   ++   AM  E+ +   N  W LV   P+ +  +GC+WI+ +K +  G +  +KARLVAKGY Q+ G+DY  TFSPV    SIRI+L
Subjt:  DPLTFKQAMNDVDCDQLIKAMDLEMESMYANSVWTLV-DQPNDIKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQKEGIDYEATFSPVAMIKSIRILL

Query:  SIATFYDYEILKMDVKTVFLNGNLEESIYMVQPEGFIQKGQEQKVCELQKSIY
         +A    + I ++DV   FL G L + +YM QP GFI K +   VC+L+K++Y
Subjt:  SIATFYDYEILKMDVKTVFLNGNLEESIYMVQPEGFIQKGQEQKVCELQKSIY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.0e-2842.48Show/hide
Query:  DPLTFKQAMNDVDCDQLIKAMDLEMESMYANSVWTLV-DQPNDIKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQKEGIDYEATFSPVAMIKSIRILL
        +P T  QAM D   D+  +AM  E+ +   N  W LV   P  +  +GC+WI+ +K +  G +  +KARLVAKGY Q+ G+DY  TFSPV    SIRI+L
Subjt:  DPLTFKQAMNDVDCDQLIKAMDLEMESMYANSVWTLV-DQPNDIKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQKEGIDYEATFSPVAMIKSIRILL

Query:  SIATFYDYEILKMDVKTVFLNGNLEESIYMVQPEGFIQKGQEQKVCELQKSIY
         +A    + I ++DV   FL G L + +YM QP GF+ K +   VC L+K+IY
Subjt:  SIATFYDYEILKMDVKTVFLNGNLEESIYMVQPEGFIQKGQEQKVCELQKSIY

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.9e-3044.53Show/hide
Query:  AMDLEMESMYANSVWTLVDQPNDIKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQKEGIDYEATFSPVAMIKSIRILLSIATFYDYEILKMDVKTVFL
        AMD E+ +M     W +   P + KPIGCKW+YK K +  G ++ +KARLVAKGYTQ+EGID+  TFSPV  + S++++L+I+  Y++ + ++D+   FL
Subjt:  AMDLEMESMYANSVWTLVDQPNDIKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQKEGIDYEATFSPVAMIKSIRILLSIATFYDYEILKMDVKTVFL

Query:  NGNLEESIYMVQPEGFIQKGQE----QKVCELQKSIY
        NG+L+E IYM  P G+  +  +      VC L+KSIY
Subjt:  NGNLEESIYMVQPEGFIQKGQE----QKVCELQKSIY

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.3e-1441.67Show/hide
Query:  KAMDLEMESMYANSVWTLVDQPNDIKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQKEGIDYEATFSPVAMIKSIRILLSIA
        +AM  E++++  N  W LV  P +   +GCKW++K K    G +   KARLVAKG+ Q+EGI +  T+SPV    +IR +L++A
Subjt:  KAMDLEMESMYANSVWTLVDQPNDIKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQKEGIDYEATFSPVAMIKSIRILLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGATGATGGCATAGAGGATCCATTGACCTTTAAACAGGCAATGAATGATGTGGATTGTGACCAATTGATCAAAGCCATGGACCTCGAAATGGAATCTATG
TATGCCAATTCTGTCTGGACTCTAGTAGATCAACCAAATGATATAAAACCGATTGGTTGTAAATGGATCTATAAGAGAAAACGAGACCAAACTGGTAAAGTACAG
ACTTTCAAAGCTCGACTTGTGGCAAAAGGTTATACACAAAAGGAGGGAATAGATTATGAAGCAACTTTCTCTCCTGTTGCCATGATAAAGTCGATTAGAATACTC
TTATCTATCGCCACTTTTTATGATTATGAAATTTTGAAGATGGATGTCAAGACAGTTTTTTTGAACGGTAATCTTGAGGAGAGTATTTATATGGTCCAACCAGAG
GGGTTTATACAAAAGGGTCAAGAACAAAAGGTTTGTGAGCTTCAAAAATCCATTTATTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGATGATGGCATAGAGGATCCATTGACCTTTAAACAGGCAATGAATGATGTGGATTGTGACCAATTGATCAAAGCCATGGACCTCGAAATGGAATCTATG
TATGCCAATTCTGTCTGGACTCTAGTAGATCAACCAAATGATATAAAACCGATTGGTTGTAAATGGATCTATAAGAGAAAACGAGACCAAACTGGTAAAGTACAG
ACTTTCAAAGCTCGACTTGTGGCAAAAGGTTATACACAAAAGGAGGGAATAGATTATGAAGCAACTTTCTCTCCTGTTGCCATGATAAAGTCGATTAGAATACTC
TTATCTATCGCCACTTTTTATGATTATGAAATTTTGAAGATGGATGTCAAGACAGTTTTTTTGAACGGTAATCTTGAGGAGAGTATTTATATGGTCCAACCAGAG
GGGTTTATACAAAAGGGTCAAGAACAAAAGGTTTGTGAGCTTCAAAAATCCATTTATTAA
Protein sequenceShow/hide protein sequence
MSDDGIEDPLTFKQAMNDVDCDQLIKAMDLEMESMYANSVWTLVDQPNDIKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQKEGIDYEATFSPVAMIKSIRIL
LSIATFYDYEILKMDVKTVFLNGNLEESIYMVQPEGFIQKGQEQKVCELQKSIY