; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0224261 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0224261
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr08:14152282..14152761
RNA-Seq ExpressionCmc08g0224261
SyntenyCmc08g0224261
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]1.7e-6886.16Show/hide
Query:  MDDILLIGNDVGYLTNVKTWLAAQFQMKDFREAQYVLGIQIIRDRKNKMLALSQATYICKMLVRYLMQNSKKGLLPFRHGVHLSKEQFPKTPQEVKDIRR
        +DDILLIGNDVGYLT+VK WLAAQFQMKD  EAQYVLGIQIIRDRKNK LALSQATYI K+LVRY MQNSKKGLLPFRHGVHLSKEQ PKTPQEV+D+RR
Subjt:  MDDILLIGNDVGYLTNVKTWLAAQFQMKDFREAQYVLGIQIIRDRKNKMLALSQATYICKMLVRYLMQNSKKGLLPFRHGVHLSKEQFPKTPQEVKDIRR

Query:  IFYASVVGSLMYVMLYTRPDICYAVGIISRYQSNPRLDHSMAVKIILKYLRITRDYMLV
        I YAS VGSLMY ML TRPDICYAVGI+SRYQSNP LDH  AVKI+LKYLR TRDYMLV
Subjt:  IFYASVVGSLMYVMLYTRPDICYAVGIISRYQSNPRLDHSMAVKIILKYLRITRDYMLV

KAA0043583.1 putative Integrase core domain [Cucumis melo var. makuwa]1.8e-7088.68Show/hide
Query:  MDDILLIGNDVGYLTNVKTWLAAQFQMKDFREAQYVLGIQIIRDRKNKMLALSQATYICKMLVRYLMQNSKKGLLPFRHGVHLSKEQFPKTPQEVKDIRR
        +DDILLIGNDVGYLT+VK WLAA+FQMKD  EAQYVLGIQIIRDRKNK LALSQATYI KMLVRY MQNSKKGLLPFRHGVHLSKEQ PKTPQEV+D+RR
Subjt:  MDDILLIGNDVGYLTNVKTWLAAQFQMKDFREAQYVLGIQIIRDRKNKMLALSQATYICKMLVRYLMQNSKKGLLPFRHGVHLSKEQFPKTPQEVKDIRR

Query:  IFYASVVGSLMYVMLYTRPDICYAVGIISRYQSNPRLDHSMAVKIILKYLRITRDYMLV
        I YASVVGSLMYVMLYTRPDICYAVGI+SRYQSNP LDH  AVKIILKYLR TRDYMLV
Subjt:  IFYASVVGSLMYVMLYTRPDICYAVGIISRYQSNPRLDHSMAVKIILKYLRITRDYMLV

KAA0059556.1 gag/pol protein [Cucumis melo var. makuwa]3.4e-6987.42Show/hide
Query:  MDDILLIGNDVGYLTNVKTWLAAQFQMKDFREAQYVLGIQIIRDRKNKMLALSQATYICKMLVRYLMQNSKKGLLPFRHGVHLSKEQFPKTPQEVKDIRR
        MDDILLIGNDVGYLT+VK WLAAQFQMKD  EAQYVLGIQIIRDRKNK LALSQATYI KMLVRY MQNSKKGLLPFRHGVHLSKEQ PKTPQEV+D+RR
Subjt:  MDDILLIGNDVGYLTNVKTWLAAQFQMKDFREAQYVLGIQIIRDRKNKMLALSQATYICKMLVRYLMQNSKKGLLPFRHGVHLSKEQFPKTPQEVKDIRR

Query:  IFYASVVGSLMYVMLYTRPDICYAVGIISRYQSNPRLDHSMAVKIILKYLRITRDYMLV
        I YAS VGSLMY ML TRPDICYAVGI+SRYQSNP LDH  AVKIILKYL+ TRDYMLV
Subjt:  IFYASVVGSLMYVMLYTRPDICYAVGIISRYQSNPRLDHSMAVKIILKYLRITRDYMLV

KAA0063766.1 gag/pol protein [Cucumis melo var. makuwa]9.9e-6987.42Show/hide
Query:  MDDILLIGNDVGYLTNVKTWLAAQFQMKDFREAQYVLGIQIIRDRKNKMLALSQATYICKMLVRYLMQNSKKGLLPFRHGVHLSKEQFPKTPQEVKDIRR
        +DDILLIGNDVGYLT+VK WLAAQFQMKD  EAQYVLGIQIIRD KNK LALSQATYI KMLVRY MQNSKKGLLPFR+GVHLSKEQ PKTPQ+V+DIRR
Subjt:  MDDILLIGNDVGYLTNVKTWLAAQFQMKDFREAQYVLGIQIIRDRKNKMLALSQATYICKMLVRYLMQNSKKGLLPFRHGVHLSKEQFPKTPQEVKDIRR

Query:  IFYASVVGSLMYVMLYTRPDICYAVGIISRYQSNPRLDHSMAVKIILKYLRITRDYMLV
        I YAS VGSLMYVMLYTRPDICYAV I+SRYQSNPRLDH  AVKIILKYLR TRDYMLV
Subjt:  IFYASVVGSLMYVMLYTRPDICYAVGIISRYQSNPRLDHSMAVKIILKYLRITRDYMLV

TYK05518.1 gag/pol protein [Cucumis melo var. makuwa]9.9e-6987.42Show/hide
Query:  MDDILLIGNDVGYLTNVKTWLAAQFQMKDFREAQYVLGIQIIRDRKNKMLALSQATYICKMLVRYLMQNSKKGLLPFRHGVHLSKEQFPKTPQEVKDIRR
        +DDILLIGNDVGYLT+VK WLAAQFQMKD  EAQYVLGIQIIRD KNK LALSQATYI KMLVRY MQNSKKGLLPFR+GVHLSKEQ PKTPQ+V+DIRR
Subjt:  MDDILLIGNDVGYLTNVKTWLAAQFQMKDFREAQYVLGIQIIRDRKNKMLALSQATYICKMLVRYLMQNSKKGLLPFRHGVHLSKEQFPKTPQEVKDIRR

Query:  IFYASVVGSLMYVMLYTRPDICYAVGIISRYQSNPRLDHSMAVKIILKYLRITRDYMLV
        I YAS VGSLMYVMLYTRPDICYAV I+SRYQSNPRLDH  AVKIILKYLR TRDYMLV
Subjt:  IFYASVVGSLMYVMLYTRPDICYAVGIISRYQSNPRLDHSMAVKIILKYLRITRDYMLV

TrEMBL top hitse value%identityAlignment
A0A5A7TJH9 Putative Integrase core domain8.7e-7188.68Show/hide
Query:  MDDILLIGNDVGYLTNVKTWLAAQFQMKDFREAQYVLGIQIIRDRKNKMLALSQATYICKMLVRYLMQNSKKGLLPFRHGVHLSKEQFPKTPQEVKDIRR
        +DDILLIGNDVGYLT+VK WLAA+FQMKD  EAQYVLGIQIIRDRKNK LALSQATYI KMLVRY MQNSKKGLLPFRHGVHLSKEQ PKTPQEV+D+RR
Subjt:  MDDILLIGNDVGYLTNVKTWLAAQFQMKDFREAQYVLGIQIIRDRKNKMLALSQATYICKMLVRYLMQNSKKGLLPFRHGVHLSKEQFPKTPQEVKDIRR

Query:  IFYASVVGSLMYVMLYTRPDICYAVGIISRYQSNPRLDHSMAVKIILKYLRITRDYMLV
        I YASVVGSLMYVMLYTRPDICYAVGI+SRYQSNP LDH  AVKIILKYLR TRDYMLV
Subjt:  IFYASVVGSLMYVMLYTRPDICYAVGIISRYQSNPRLDHSMAVKIILKYLRITRDYMLV

A0A5A7TZD0 Gag/pol protein8.2e-6986.16Show/hide
Query:  MDDILLIGNDVGYLTNVKTWLAAQFQMKDFREAQYVLGIQIIRDRKNKMLALSQATYICKMLVRYLMQNSKKGLLPFRHGVHLSKEQFPKTPQEVKDIRR
        +DDILLIGNDVGYLT+VK WLAAQFQMKD  EAQYVLGIQIIRDRKNK LALSQATYI K+LVRY MQNSKKGLLPFRHGVHLSKEQ PKTPQEV+D+RR
Subjt:  MDDILLIGNDVGYLTNVKTWLAAQFQMKDFREAQYVLGIQIIRDRKNKMLALSQATYICKMLVRYLMQNSKKGLLPFRHGVHLSKEQFPKTPQEVKDIRR

Query:  IFYASVVGSLMYVMLYTRPDICYAVGIISRYQSNPRLDHSMAVKIILKYLRITRDYMLV
        I YAS VGSLMY ML TRPDICYAVGI+SRYQSNP LDH  AVKI+LKYLR TRDYMLV
Subjt:  IFYASVVGSLMYVMLYTRPDICYAVGIISRYQSNPRLDHSMAVKIILKYLRITRDYMLV

A0A5A7UZF3 Gag/pol protein1.6e-6987.42Show/hide
Query:  MDDILLIGNDVGYLTNVKTWLAAQFQMKDFREAQYVLGIQIIRDRKNKMLALSQATYICKMLVRYLMQNSKKGLLPFRHGVHLSKEQFPKTPQEVKDIRR
        MDDILLIGNDVGYLT+VK WLAAQFQMKD  EAQYVLGIQIIRDRKNK LALSQATYI KMLVRY MQNSKKGLLPFRHGVHLSKEQ PKTPQEV+D+RR
Subjt:  MDDILLIGNDVGYLTNVKTWLAAQFQMKDFREAQYVLGIQIIRDRKNKMLALSQATYICKMLVRYLMQNSKKGLLPFRHGVHLSKEQFPKTPQEVKDIRR

Query:  IFYASVVGSLMYVMLYTRPDICYAVGIISRYQSNPRLDHSMAVKIILKYLRITRDYMLV
        I YAS VGSLMY ML TRPDICYAVGI+SRYQSNP LDH  AVKIILKYL+ TRDYMLV
Subjt:  IFYASVVGSLMYVMLYTRPDICYAVGIISRYQSNPRLDHSMAVKIILKYLRITRDYMLV

A0A5A7VBG2 Gag/pol protein4.8e-6987.42Show/hide
Query:  MDDILLIGNDVGYLTNVKTWLAAQFQMKDFREAQYVLGIQIIRDRKNKMLALSQATYICKMLVRYLMQNSKKGLLPFRHGVHLSKEQFPKTPQEVKDIRR
        +DDILLIGNDVGYLT+VK WLAAQFQMKD  EAQYVLGIQIIRD KNK LALSQATYI KMLVRY MQNSKKGLLPFR+GVHLSKEQ PKTPQ+V+DIRR
Subjt:  MDDILLIGNDVGYLTNVKTWLAAQFQMKDFREAQYVLGIQIIRDRKNKMLALSQATYICKMLVRYLMQNSKKGLLPFRHGVHLSKEQFPKTPQEVKDIRR

Query:  IFYASVVGSLMYVMLYTRPDICYAVGIISRYQSNPRLDHSMAVKIILKYLRITRDYMLV
        I YAS VGSLMYVMLYTRPDICYAV I+SRYQSNPRLDH  AVKIILKYLR TRDYMLV
Subjt:  IFYASVVGSLMYVMLYTRPDICYAVGIISRYQSNPRLDHSMAVKIILKYLRITRDYMLV

A0A5D3C2T5 Gag/pol protein4.8e-6987.42Show/hide
Query:  MDDILLIGNDVGYLTNVKTWLAAQFQMKDFREAQYVLGIQIIRDRKNKMLALSQATYICKMLVRYLMQNSKKGLLPFRHGVHLSKEQFPKTPQEVKDIRR
        +DDILLIGNDVGYLT+VK WLAAQFQMKD  EAQYVLGIQIIRD KNK LALSQATYI KMLVRY MQNSKKGLLPFR+GVHLSKEQ PKTPQ+V+DIRR
Subjt:  MDDILLIGNDVGYLTNVKTWLAAQFQMKDFREAQYVLGIQIIRDRKNKMLALSQATYICKMLVRYLMQNSKKGLLPFRHGVHLSKEQFPKTPQEVKDIRR

Query:  IFYASVVGSLMYVMLYTRPDICYAVGIISRYQSNPRLDHSMAVKIILKYLRITRDYMLV
        I YAS VGSLMYVMLYTRPDICYAV I+SRYQSNPRLDH  AVKIILKYLR TRDYMLV
Subjt:  IFYASVVGSLMYVMLYTRPDICYAVGIISRYQSNPRLDHSMAVKIILKYLRITRDYMLV

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.9e-1430.86Show/hide
Query:  MDDILLIGNDVGYLTNVKTWLAAQFQMKDFREAQYVLGIQIIRDRKNKMLALSQATYICKMLVRYLMQNSKKGLLPFRHGVH---LSKEQFPKTPQEVKD
        +DD+++   D+  + N K +L  +F+M D  E ++ +GI+I  + +   + LSQ+ Y+ K+L ++ M+N      P    ++   L+ ++   TP     
Subjt:  MDDILLIGNDVGYLTNVKTWLAAQFQMKDFREAQYVLGIQIIRDRKNKMLALSQATYICKMLVRYLMQNSKKGLLPFRHGVH---LSKEQFPKTPQEVKD

Query:  IRRIFYASVVGSLMYVMLYTRPDICYAVGIISRYQSNPRLDHSMAVKIILKYLRITRDYMLV
               S++G LMY+ML TRPD+  AV I+SRY S    +    +K +L+YL+ T D  L+
Subjt:  IRRIFYASVVGSLMYVMLYTRPDICYAVGIISRYQSNPRLDHSMAVKIILKYLRITRDYMLV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-949.4e-3044.44Show/hide
Query:  MDDILLIGNDVGYLTNVKTWLAAQFQMKDFREAQYVLGIQIIRDRKNKMLALSQATYICKMLVRYLMQNSKKGLLPFRHGVHLSKEQFPKTPQEVKDIRR
        +DD+L++G D G +  +K  L+  F MKD   AQ +LG++I+R+R ++ L LSQ  YI ++L R+ M+N+K    P    + LSK+  P T +E  ++ +
Subjt:  MDDILLIGNDVGYLTNVKTWLAAQFQMKDFREAQYVLGIQIIRDRKNKMLALSQATYICKMLVRYLMQNSKKGLLPFRHGVHLSKEQFPKTPQEVKDIRR

Query:  IFYASVVGSLMYVMLYTRPDICYAVGIISRYQSNPRLDHSMAVKIILKYLRIT
        + Y+S VGSLMY M+ TRPDI +AVG++SR+  NP  +H  AVK IL+YLR T
Subjt:  IFYASVVGSLMYVMLYTRPDICYAVGIISRYQSNPRLDHSMAVKIILKYLRIT

P25600 Putative transposon Ty5-1 protein YCL074W1.8e-0931.01Show/hide
Query:  MDDILLIGNDVGYLTNVKTWLAAQFQMKDFREAQYVLGIQIIRDRKNKMLALSQATYICKMLVRYLMQNSKKGLLPFRHGVHLSKEQFPKTPQEVKDIRR
        +DD+L+          VK  L   + MKD  +    LG+  I    N  + LS   YI K      +   K    P  +    SK  F  T   +KDI  
Subjt:  MDDILLIGNDVGYLTNVKTWLAAQFQMKDFREAQYVLGIQIIRDRKNKMLALSQATYICKMLVRYLMQNSKKGLLPFRHGVHLSKEQFPKTPQEVKDIRR

Query:  IFYASVVGSLMYVMLYTRPDICYAVGIISRYQSNPRLDHSMAVKIILKYLRITRDYML
          Y S+VG L++     RPDI Y V ++SR+   PR  H  + + +L+YL  TR   L
Subjt:  IFYASVVGSLMYVMLYTRPDICYAVGIISRYQSNPRLDHSMAVKIILKYLRITRDYML

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.2e-1436.54Show/hide
Query:  MDDILLIGNDVGYLTNVKTWLAAQFQMKDFREAQYVLGIQIIRDRKNKMLALSQATYICKMLVRYLMQNSKKGLLPFRHGVHLSKEQFPKTPQEVKDIRR
        +DDIL+ GND   L N    L+ +F +KD  E  Y LGI+    R    L LSQ  YI  +L R  M  +K    P      LS     K     +    
Subjt:  MDDILLIGNDVGYLTNVKTWLAAQFQMKDFREAQYVLGIQIIRDRKNKMLALSQATYICKMLVRYLMQNSKKGLLPFRHGVHLSKEQFPKTPQEVKDIRR

Query:  IFYASVVGSLMYVMLYTRPDICYAVGIISRYQSNPRLDHSMAVKIILKYLRITRDY
          Y  +VGSL Y + +TRPDI YAV  +S++   P  +H  A+K IL+YL  T ++
Subjt:  IFYASVVGSLMYVMLYTRPDICYAVGIISRYQSNPRLDHSMAVKIILKYLRITRDY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.5e-1434.62Show/hide
Query:  MDDILLIGNDVGYLTNVKTWLAAQFQMKDFREAQYVLGIQIIRDRKNKMLALSQATYICKMLVRYLMQNSKKGLLPFRHGVHLSKEQFPKTPQEVKDIRR
        +DDIL+ GND   L +    L+ +F +K+  +  Y LGI+    R  + L LSQ  Y   +L R  M  +K    P      L+     K P   +    
Subjt:  MDDILLIGNDVGYLTNVKTWLAAQFQMKDFREAQYVLGIQIIRDRKNKMLALSQATYICKMLVRYLMQNSKKGLLPFRHGVHLSKEQFPKTPQEVKDIRR

Query:  IFYASVVGSLMYVMLYTRPDICYAVGIISRYQSNPRLDHSMAVKIILKYLRITRDY
          Y  +VGSL Y + +TRPD+ YAV  +S+Y   P  DH  A+K +L+YL  T D+
Subjt:  IFYASVVGSLMYVMLYTRPDICYAVGIISRYQSNPRLDHSMAVKIILKYLRITRDY

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.2e-0928.1Show/hide
Query:  MDDILLIGNDVGYLTNVKTWLAAQFQMKDFREAQYVLGIQIIRDRKNKMLALSQATYICKMLVRYLMQNSKKGLLPFRHGVHLSKEQFPKTPQEVKDIRR
        +DDI++  N+   +  +K+ L + F+++D    +Y LG++I R      + + Q  Y   +L    +   K   +P    V  S      +  +  D + 
Subjt:  MDDILLIGNDVGYLTNVKTWLAAQFQMKDFREAQYVLGIQIIRDRKNKMLALSQATYICKMLVRYLMQNSKKGLLPFRHGVHLSKEQFPKTPQEVKDIRR

Query:  IFYASVVGSLMYVMLYTRPDICYAVGIISRYQSNPRLDHSMAVKIILKYLRIT
          Y  ++G LMY+ + TR DI +AV  +S++   PRL H  AV  IL Y++ T
Subjt:  IFYASVVGSLMYVMLYTRPDICYAVGIISRYQSNPRLDHSMAVKIILKYLRIT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGATATCCTCCTCATTGGGAATGATGTGGGATACCTTACTAACGTTAAAACTTGGCTAGCAGCCCAATTCCAAATGAAAGATTTCAGAGAGGCACAATATGTTCT
TGGGATCCAAATCATAAGGGATCGCAAGAACAAAATGCTAGCACTATCTCAAGCAACCTATATCTGCAAAATGTTGGTTCGATATTTGATGCAAAACTCTAAGAAGGGTT
TATTACCTTTCAGACATGGGGTTCACTTGTCTAAGGAACAGTTTCCTAAGACACCTCAAGAAGTTAAGGATATAAGACGTATTTTCTATGCCTCAGTTGTGGGCAGCTTA
ATGTATGTTATGCTCTACACTAGGCCAGACATTTGTTATGCAGTGGGAATAATCAGTAGGTATCAGTCCAACCCAAGGTTAGACCACTCGATGGCTGTTAAAATTATTCT
CAAGTATCTTAGGATAACGAGAGACTACATGCTTGTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGACGATATCCTCCTCATTGGGAATGATGTGGGATACCTTACTAACGTTAAAACTTGGCTAGCAGCCCAATTCCAAATGAAAGATTTCAGAGAGGCACAATATGTTCT
TGGGATCCAAATCATAAGGGATCGCAAGAACAAAATGCTAGCACTATCTCAAGCAACCTATATCTGCAAAATGTTGGTTCGATATTTGATGCAAAACTCTAAGAAGGGTT
TATTACCTTTCAGACATGGGGTTCACTTGTCTAAGGAACAGTTTCCTAAGACACCTCAAGAAGTTAAGGATATAAGACGTATTTTCTATGCCTCAGTTGTGGGCAGCTTA
ATGTATGTTATGCTCTACACTAGGCCAGACATTTGTTATGCAGTGGGAATAATCAGTAGGTATCAGTCCAACCCAAGGTTAGACCACTCGATGGCTGTTAAAATTATTCT
CAAGTATCTTAGGATAACGAGAGACTACATGCTTGTGTAG
Protein sequenceShow/hide protein sequence
MDDILLIGNDVGYLTNVKTWLAAQFQMKDFREAQYVLGIQIIRDRKNKMLALSQATYICKMLVRYLMQNSKKGLLPFRHGVHLSKEQFPKTPQEVKDIRRIFYASVVGSL
MYVMLYTRPDICYAVGIISRYQSNPRLDHSMAVKIILKYLRITRDYMLV