; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0227771 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0227771
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr08:21115808..21116065
RNA-Seq ExpressionCmc08g0227771
SyntenyCmc08g0227771
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]2.6e-3188.75Show/hide
Query:  LSLLKSIRILL--AAYFDYEIWQMDVKTAFLNGNLKETIYMQQPEGLIIPDQEQKVCKLNRSIYGLKQASRSWNIRFDTA
        +++LKSIRILL  AAYFDYEIWQMDVKTAFLNGNL+ETIYMQQPEG IIP QEQK+CKLNRSIYGLKQASRSWNIRFDTA
Subjt:  LSLLKSIRILL--AAYFDYEIWQMDVKTAFLNGNLKETIYMQQPEGLIIPDQEQKVCKLNRSIYGLKQASRSWNIRFDTA

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]2.6e-3188.75Show/hide
Query:  LSLLKSIRILL--AAYFDYEIWQMDVKTAFLNGNLKETIYMQQPEGLIIPDQEQKVCKLNRSIYGLKQASRSWNIRFDTA
        +++LKSIRILL  AAYFDYEIWQMDVKTAFLNGNL+ETIYMQQPEG IIP QEQK+CKLNRSIYGLKQASRSWNIRFDTA
Subjt:  LSLLKSIRILL--AAYFDYEIWQMDVKTAFLNGNLKETIYMQQPEGLIIPDQEQKVCKLNRSIYGLKQASRSWNIRFDTA

KAA0061339.1 gag/pol protein [Cucumis melo var. makuwa]2.6e-3188.75Show/hide
Query:  LSLLKSIRILL--AAYFDYEIWQMDVKTAFLNGNLKETIYMQQPEGLIIPDQEQKVCKLNRSIYGLKQASRSWNIRFDTA
        +++LKSIRILL  AAYFDYEIWQMDVKTAFLNGNL+ETIYMQQPEG IIP QEQK+CKLNRSIYGLKQASRSWNIRFDTA
Subjt:  LSLLKSIRILL--AAYFDYEIWQMDVKTAFLNGNLKETIYMQQPEGLIIPDQEQKVCKLNRSIYGLKQASRSWNIRFDTA

TYJ97618.1 gag/pol protein [Cucumis melo var. makuwa]2.6e-3188.75Show/hide
Query:  LSLLKSIRILL--AAYFDYEIWQMDVKTAFLNGNLKETIYMQQPEGLIIPDQEQKVCKLNRSIYGLKQASRSWNIRFDTA
        +++LKSIRILL  AAYFDYEIWQMDVKTAFLNGNL+ETIYMQQPEG IIP QEQK+CKLNRSIYGLKQASRSWNIRFDTA
Subjt:  LSLLKSIRILL--AAYFDYEIWQMDVKTAFLNGNLKETIYMQQPEGLIIPDQEQKVCKLNRSIYGLKQASRSWNIRFDTA

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]2.6e-3188.75Show/hide
Query:  LSLLKSIRILL--AAYFDYEIWQMDVKTAFLNGNLKETIYMQQPEGLIIPDQEQKVCKLNRSIYGLKQASRSWNIRFDTA
        +++LKSIRILL  AAYFDYEIWQMDVKTAFLNGNL+ETIYMQQPEG IIP QEQK+CKLNRSIYGLKQASRSWNIRFDTA
Subjt:  LSLLKSIRILL--AAYFDYEIWQMDVKTAFLNGNLKETIYMQQPEGLIIPDQEQKVCKLNRSIYGLKQASRSWNIRFDTA

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.2e-3188.75Show/hide
Query:  LSLLKSIRILL--AAYFDYEIWQMDVKTAFLNGNLKETIYMQQPEGLIIPDQEQKVCKLNRSIYGLKQASRSWNIRFDTA
        +++LKSIRILL  AAYFDYEIWQMDVKTAFLNGNL+ETIYMQQPEG IIP QEQK+CKLNRSIYGLKQASRSWNIRFDTA
Subjt:  LSLLKSIRILL--AAYFDYEIWQMDVKTAFLNGNLKETIYMQQPEGLIIPDQEQKVCKLNRSIYGLKQASRSWNIRFDTA

A0A5A7TWB9 Gag/pol protein1.2e-3188.75Show/hide
Query:  LSLLKSIRILL--AAYFDYEIWQMDVKTAFLNGNLKETIYMQQPEGLIIPDQEQKVCKLNRSIYGLKQASRSWNIRFDTA
        +++LKSIRILL  AAYFDYEIWQMDVKTAFLNGNL+ETIYMQQPEG IIP QEQK+CKLNRSIYGLKQASRSWNIRFDTA
Subjt:  LSLLKSIRILL--AAYFDYEIWQMDVKTAFLNGNLKETIYMQQPEGLIIPDQEQKVCKLNRSIYGLKQASRSWNIRFDTA

A0A5A7V4M1 Gag/pol protein1.2e-3188.75Show/hide
Query:  LSLLKSIRILL--AAYFDYEIWQMDVKTAFLNGNLKETIYMQQPEGLIIPDQEQKVCKLNRSIYGLKQASRSWNIRFDTA
        +++LKSIRILL  AAYFDYEIWQMDVKTAFLNGNL+ETIYMQQPEG IIP QEQK+CKLNRSIYGLKQASRSWNIRFDTA
Subjt:  LSLLKSIRILL--AAYFDYEIWQMDVKTAFLNGNLKETIYMQQPEGLIIPDQEQKVCKLNRSIYGLKQASRSWNIRFDTA

A0A5D3CPJ6 Gag/pol protein1.2e-3188.75Show/hide
Query:  LSLLKSIRILL--AAYFDYEIWQMDVKTAFLNGNLKETIYMQQPEGLIIPDQEQKVCKLNRSIYGLKQASRSWNIRFDTA
        +++LKSIRILL  AAYFDYEIWQMDVKTAFLNGNL+ETIYMQQPEG IIP QEQK+CKLNRSIYGLKQASRSWNIRFDTA
Subjt:  LSLLKSIRILL--AAYFDYEIWQMDVKTAFLNGNLKETIYMQQPEGLIIPDQEQKVCKLNRSIYGLKQASRSWNIRFDTA

A0A5D3DS88 Gag/pol protein1.2e-3188.75Show/hide
Query:  LSLLKSIRILL--AAYFDYEIWQMDVKTAFLNGNLKETIYMQQPEGLIIPDQEQKVCKLNRSIYGLKQASRSWNIRFDTA
        +++LKSIRILL  AAYFDYEIWQMDVKTAFLNGNL+ETIYMQQPEG IIP QEQK+CKLNRSIYGLKQASRSWNIRFDTA
Subjt:  LSLLKSIRILL--AAYFDYEIWQMDVKTAFLNGNLKETIYMQQPEGLIIPDQEQKVCKLNRSIYGLKQASRSWNIRFDTA

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.3e-1351.95Show/hide
Query:  LKSIRILLAAYFDY--EIWQMDVKTAFLNGNLKETIYMQQPEGLIIPDQEQKVCKLNRSIYGLKQASRSWNIRFDTA
        + S R +L+    Y  ++ QMDVKTAFLNG LKE IYM+ P+G  I      VCKLN++IYGLKQA+R W   F+ A
Subjt:  LKSIRILLAAYFDY--EIWQMDVKTAFLNGNLKETIYMQQPEGLIIPDQEQKVCKLNRSIYGLKQASRSWNIRFDTA

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.9e-1755.26Show/hide
Query:  LKSIRIL--LAAYFDYEIWQMDVKTAFLNGNLKETIYMQQPEGLIIPDQEQKVCKLNRSIYGLKQASRSWNIRFDT
        + SIR +  LAA  D E+ Q+DVKTAFL+G+L+E IYM+QPEG  +  ++  VCKLN+S+YGLKQA R W ++FD+
Subjt:  LKSIRIL--LAAYFDYEIWQMDVKTAFLNGNLKETIYMQQPEGLIIPDQEQKVCKLNRSIYGLKQASRSWNIRFDT

Q12491 Transposon Ty2-B Gag-Pol polyprotein8.6e-0636.99Show/hide
Query:  HLLSLLKSIRILLAAYFDYEIWQMDVKTAFLNGNLKETIYMQQPEGLIIPDQEQKVCKLNRSIYGLKQASRSW
        H  +L+ S+ I L    DY I Q+D+ +A+L  ++KE +Y++ P  L + D   K+ +L +S+YGLKQ+  +W
Subjt:  HLLSLLKSIRILLAAYFDYEIWQMDVKTAFLNGNLKETIYMQQPEGLIIPDQEQKVCKLNRSIYGLKQASRSW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.1e-1148.57Show/hide
Query:  SIRILLAAYFD--YEIWQMDVKTAFLNGNLKETIYMQQPEGLIIPDQEQKVCKLNRSIYGLKQASRSWNI
        SIRI+L    D  + I Q+DV  AFL G L + +YM QP G I  D+   VCKL +++YGLKQA R+W +
Subjt:  SIRILLAAYFD--YEIWQMDVKTAFLNGNLKETIYMQQPEGLIIPDQEQKVCKLNRSIYGLKQASRSWNI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.8e-1145.95Show/hide
Query:  SIRILLAAYFD--YEIWQMDVKTAFLNGNLKETIYMQQPEGLIIPDQEQKVCKLNRSIYGLKQASRSWNIRFDT
        SIRI+L    D  + I Q+DV  AFL G L + +YM QP G +  D+   VC+L ++IYGLKQA R+W +   T
Subjt:  SIRILLAAYFD--YEIWQMDVKTAFLNGNLKETIYMQQPEGLIIPDQEQKVCKLNRSIYGLKQASRSWNIRFDT

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.8e-1143.59Show/hide
Query:  LKSIRILLA--AYFDYEIWQMDVKTAFLNGNLKETIYMQQPEGLIIPDQE----QKVCKLNRSIYGLKQASRSWNIRF
        L S++++LA  A +++ + Q+D+  AFLNG+L E IYM+ P G      +      VC L +SIYGLKQASR W ++F
Subjt:  LKSIRILLA--AYFDYEIWQMDVKTAFLNGNLKETIYMQQPEGLIIPDQE----QKVCKLNRSIYGLKQASRSWNIRF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGAGACTTTCTCACCTGTTGTCATTGTTAAAGTCTATTCGAATACTTTTGGCTGCATATTTTGACTATGAGATTTGGCAAATGGATGTAAAGACTGCCTTTTTGAA
TGGCAATCTTAAGGAGACCATCTATATGCAACAACCAGAAGGATTAATAATTCCAGATCAAGAGCAAAAGGTTTGCAAGCTTAATCGTTCTATTTATGGATTGAAACAAG
CTTCTCGATCTTGGAACATAAGATTTGATACCGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGAGGAGACTTTCTCACCTGTTGTCATTGTTAAAGTCTATTCGAATACTTTTGGCTGCATATTTTGACTATGAGATTTGGCAAATGGATGTAAAGACTGCCTTTTTGAA
TGGCAATCTTAAGGAGACCATCTATATGCAACAACCAGAAGGATTAATAATTCCAGATCAAGAGCAAAAGGTTTGCAAGCTTAATCGTTCTATTTATGGATTGAAACAAG
CTTCTCGATCTTGGAACATAAGATTTGATACCGCATAA
Protein sequenceShow/hide protein sequence
MRRLSHLLSLLKSIRILLAAYFDYEIWQMDVKTAFLNGNLKETIYMQQPEGLIIPDQEQKVCKLNRSIYGLKQASRSWNIRFDTA