; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc01g0026801 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc01g0026801
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon 17.6
Genome locationCMiso1.1chr01:27925990..27926421
RNA-Seq ExpressionCmc01g0026801
SyntenyCmc01g0026801
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0063033.1 Retrovirus-related Pol polyprotein from transposon 17.6 [Cucumis melo var. makuwa]2.1e-4666.43Show/hide
Query:  MSFGLSNPPSTFMRLMNQILQPFLNQFIMVYFDDILIYSSSKEDHLKHIHLLFTILQENELQINLKKCEFLCYSIHFLGFIINCNGISIDPKKIDTISSW
        M FGLSN PSTFMRLMNQ+L PFLNQFI+VYFDDIL+YSSS+EDHL+H+  LF +L E EL IN KKC FL   I FLGF+I    I I+PKK++ I SW
Subjt:  MSFGLSNPPSTFMRLMNQILQPFLNQFIMVYFDDILIYSSSKEDHLKHIHLLFTILQENELQINLKKCEFLCYSIHFLGFIINCNGISIDPKKIDTISSW

Query:  PQPKTPKDIQCFLGLTSFYRRFIRNFSTIAAPSINCLKKGSFQ
        P P   K++Q FLGL SFY+RFIRNFS+I AP  NCLKKG+F+
Subjt:  PQPKTPKDIQCFLGLTSFYRRFIRNFSTIAAPSINCLKKGSFQ

KAE8652794.1 hypothetical protein Csa_022828 [Cucumis sativus]5.8e-6889.51Show/hide
Query:  MSFGLSNPPSTFMRLMNQILQPFLNQFIMVYFDDILIYSSSKEDHLKHIHLLFTILQENELQINLKKCEFLCYSIHFLGFIINCNGISIDPKKIDTISSW
        M FGLSN PSTFMRLMNQILQPFLNQFI+VYFDDILIYSSS+EDHLKHI LLFT LQENELQINLKKCEFLCYSIHFL FII+C+GIS+DPKKID+ISSW
Subjt:  MSFGLSNPPSTFMRLMNQILQPFLNQFIMVYFDDILIYSSSKEDHLKHIHLLFTILQENELQINLKKCEFLCYSIHFLGFIINCNGISIDPKKIDTISSW

Query:  PQPKTPKDIQCFLGLTSFYRRFIRNFSTIAAPSINCLKKGSFQ
        PQPKTPKDIQCFLGL SFYRRFI+NFSTIAAP  NCLKKGSFQ
Subjt:  PQPKTPKDIQCFLGLTSFYRRFIRNFSTIAAPSINCLKKGSFQ

XP_031120876.1 uncharacterized protein LOC116024113 [Ipomoea triloba]1.3e-4359.44Show/hide
Query:  MSFGLSNPPSTFMRLMNQILQPFLNQFIMVYFDDILIYSSSKEDHLKHIHLLFTILQENELQINLKKCEFLCYSIHFLGFIINCNGISIDPKKIDTISSW
        M FGLSN PSTFMRLMNQ+LQPF+ +F++VYFDDILIYS + E+HL H+H +F  L+EN+L INLKKC FL   + FLG+++  +GI +D  KI+ I  W
Subjt:  MSFGLSNPPSTFMRLMNQILQPFLNQFIMVYFDDILIYSSSKEDHLKHIHLLFTILQENELQINLKKCEFLCYSIHFLGFIINCNGISIDPKKIDTISSW

Query:  PQPKTPKDIQCFLGLTSFYRRFIRNFSTIAAPSINCLKKGSFQ
        P PKT  +++ F GL SFYRRFIR+FST+AAP  NC+K+G F+
Subjt:  PQPKTPKDIQCFLGLTSFYRRFIRNFSTIAAPSINCLKKGSFQ

XP_038989842.1 uncharacterized protein LOC120113111 [Phoenix dactylifera]9.9e-4458.74Show/hide
Query:  MSFGLSNPPSTFMRLMNQILQPFLNQFIMVYFDDILIYSSSKEDHLKHIHLLFTILQENELQINLKKCEFLCYSIHFLGFIINCNGISIDPKKIDTISSW
        M FGLSN PSTFMRLMNQI +P++ +F++VYFDDIL+YS  +E HL+H+  +F  LQ+ +L +NL+KC+FL  S+ FLG+++  NGI +DPKKI+ ISSW
Subjt:  MSFGLSNPPSTFMRLMNQILQPFLNQFIMVYFDDILIYSSSKEDHLKHIHLLFTILQENELQINLKKCEFLCYSIHFLGFIINCNGISIDPKKIDTISSW

Query:  PQPKTPKDIQCFLGLTSFYRRFIRNFSTIAAPSINCLKKGSFQ
        P P+T  +I+ F GL +FYRRFIR+FS+I AP  NCLK GS+Q
Subjt:  PQPKTPKDIQCFLGLTSFYRRFIRNFSTIAAPSINCLKKGSFQ

XP_040245606.1 uncharacterized protein LOC109732219 isoform X1 [Aegilops tauschii subsp. strangulata]2.9e-4358.04Show/hide
Query:  MSFGLSNPPSTFMRLMNQILQPFLNQFIMVYFDDILIYSSSKEDHLKHIHLLFTILQENELQINLKKCEFLCYSIHFLGFIINCNGISIDPKKIDTISSW
        M FGLSN PSTFM LMNQ+L+PFL+ F++VYFDDILIYS  +++H  HI  +  +L+ENEL +NLKKC FL   + FLGF+I C+GI +D  K++ I  W
Subjt:  MSFGLSNPPSTFMRLMNQILQPFLNQFIMVYFDDILIYSSSKEDHLKHIHLLFTILQENELQINLKKCEFLCYSIHFLGFIINCNGISIDPKKIDTISSW

Query:  PQPKTPKDIQCFLGLTSFYRRFIRNFSTIAAPSINCLKKGSFQ
        P PKT  +++ F GL +FYRRF++NFSTI AP   CLKKG FQ
Subjt:  PQPKTPKDIQCFLGLTSFYRRFIRNFSTIAAPSINCLKKGSFQ

TrEMBL top hitse value%identityAlignment
A0A5A7V4Q2 Retrovirus-related Pol polyprotein from transposon 17.61.0e-4666.43Show/hide
Query:  MSFGLSNPPSTFMRLMNQILQPFLNQFIMVYFDDILIYSSSKEDHLKHIHLLFTILQENELQINLKKCEFLCYSIHFLGFIINCNGISIDPKKIDTISSW
        M FGLSN PSTFMRLMNQ+L PFLNQFI+VYFDDIL+YSSS+EDHL+H+  LF +L E EL IN KKC FL   I FLGF+I    I I+PKK++ I SW
Subjt:  MSFGLSNPPSTFMRLMNQILQPFLNQFIMVYFDDILIYSSSKEDHLKHIHLLFTILQENELQINLKKCEFLCYSIHFLGFIINCNGISIDPKKIDTISSW

Query:  PQPKTPKDIQCFLGLTSFYRRFIRNFSTIAAPSINCLKKGSFQ
        P P   K++Q FLGL SFY+RFIRNFS+I AP  NCLKKG+F+
Subjt:  PQPKTPKDIQCFLGLTSFYRRFIRNFSTIAAPSINCLKKGSFQ

A0A5D3CXW2 DNA/RNA polymerases superfamily protein4.1e-4364.34Show/hide
Query:  MSFGLSNPPSTFMRLMNQILQPFLNQFIMVYFDDILIYSSSKEDHLKHIHLLFTILQENELQINLKKCEFLCYSIHFLGFIINCNGISIDPKKIDTISSW
        M FGLSN PSTFMRLMNQ+L PFLNQFI+VYFDDIL+ SSS+EDHL+H+  LF +L E EL IN KK  FL   I FLGF+I    I I+PKK++ I SW
Subjt:  MSFGLSNPPSTFMRLMNQILQPFLNQFIMVYFDDILIYSSSKEDHLKHIHLLFTILQENELQINLKKCEFLCYSIHFLGFIINCNGISIDPKKIDTISSW

Query:  PQPKTPKDIQCFLGLTSFYRRFIRNFSTIAAPSINCLKKGSFQ
        P P   K++Q FLGL SFY+RFIRNFS+I AP  N LKKG+F+
Subjt:  PQPKTPKDIQCFLGLTSFYRRFIRNFSTIAAPSINCLKKGSFQ

A0A5D3DSW8 DNA/RNA polymerases superfamily protein4.1e-4361.54Show/hide
Query:  MSFGLSNPPSTFMRLMNQILQPFLNQFIMVYFDDILIYSSSKEDHLKHIHLLFTILQENELQINLKKCEFLCYSIHFLGFIINCNGISIDPKKIDTISSW
        M FGLSN PSTFMRLMNQ+L PFLN+F++VYFDDILIYS  K++H+ H+  LF +L E EL IN KKC FL   I FLGFII    IS++PKK + I +W
Subjt:  MSFGLSNPPSTFMRLMNQILQPFLNQFIMVYFDDILIYSSSKEDHLKHIHLLFTILQENELQINLKKCEFLCYSIHFLGFIINCNGISIDPKKIDTISSW

Query:  PQPKTPKDIQCFLGLTSFYRRFIRNFSTIAAPSINCLKKGSFQ
        P P + K+IQ FLGL SFYR+FI+NFS+I  P   CLKKG+F+
Subjt:  PQPKTPKDIQCFLGLTSFYRRFIRNFSTIAAPSINCLKKGSFQ

A0A5E4FUK3 PREDICTED: reverse mRNAase (Fragment)2.6e-4257.75Show/hide
Query:  MSFGLSNPPSTFMRLMNQILQPFLNQFIMVYFDDILIYSSSKEDHLKHIHLLFTILQENELQINLKKCEFLCYSIHFLGFIINCNGISIDPKKIDTISSW
        M FGLSN PSTFMRLMNQ+L+PF+  F++VYFDDILIYS++KE+HL H+  +  +L+EN+L +NLKKC F    + FLGF++  NGI +D +KI  I  W
Subjt:  MSFGLSNPPSTFMRLMNQILQPFLNQFIMVYFDDILIYSSSKEDHLKHIHLLFTILQENELQINLKKCEFLCYSIHFLGFIINCNGISIDPKKIDTISSW

Query:  PQPKTPKDIQCFLGLTSFYRRFIRNFSTIAAPSINCLKKGSF
        P PKT  +++ F GL +FYRRF+R+FS+IAAP   CLKKG F
Subjt:  PQPKTPKDIQCFLGLTSFYRRFIRNFSTIAAPSINCLKKGSF

A0A5E4GNP8 PREDICTED: reverse mRNAase (Fragment)2.6e-4257.75Show/hide
Query:  MSFGLSNPPSTFMRLMNQILQPFLNQFIMVYFDDILIYSSSKEDHLKHIHLLFTILQENELQINLKKCEFLCYSIHFLGFIINCNGISIDPKKIDTISSW
        M FGLSN PSTFMRLMNQ+L+PF+  F++VYFDDILIYS++KE+HL H+  +  +L+EN+L +NLKKC F    + FLGF++  NGI +D +KI  I  W
Subjt:  MSFGLSNPPSTFMRLMNQILQPFLNQFIMVYFDDILIYSSSKEDHLKHIHLLFTILQENELQINLKKCEFLCYSIHFLGFIINCNGISIDPKKIDTISSW

Query:  PQPKTPKDIQCFLGLTSFYRRFIRNFSTIAAPSINCLKKGSF
        P PKT  +++ F GL +FYRRF+R+FS+IAAP   CLKKG F
Subjt:  PQPKTPKDIQCFLGLTSFYRRFIRNFSTIAAPSINCLKKGSF

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.63.7e-3348.2Show/hide
Query:  MSFGLSNPPSTFMRLMNQILQPFLNQFIMVYFDDILIYSSSKEDHLKHIHLLFTILQENELQINLKKCEFLCYSIHFLGFIINCNGISIDPKKIDTISSW
        M FGL N P+TF R MN IL+P LN+  +VY DDI+++S+S ++HL+ + L+F  L +  L++ L KCEFL     FLG ++  +GI  +P+KI+ I  +
Subjt:  MSFGLSNPPSTFMRLMNQILQPFLNQFIMVYFDDILIYSSSKEDHLKHIHLLFTILQENELQINLKKCEFLCYSIHFLGFIINCNGISIDPKKIDTISSW

Query:  PQPKTPKDIQCFLGLTSFYRRFIRNFSTIAAPSINCLKK
        P P  PK+I+ FLGLT +YR+FI NF+ IA P   CLKK
Subjt:  PQPKTPKDIQCFLGLTSFYRRFIRNFSTIAAPSINCLKK

P10401 Retrovirus-related Pol polyprotein from transposon gypsy1.7e-2234.78Show/hide
Query:  MSFGLSNPPSTFMRLMNQILQPFLNQFIMVYFDDILIYSSSKEDHLKHIHLLFTILQENELQINLKKCEFLCYSIHFLGFIINCNGISIDPKKIDTISSW
        + FGL N  S F R ++ +L+  + +   VY DD++I+S ++ DH++HI  +   L +  ++++ +K  F   S+ +LGFI++ +G   DP+K+  I  +
Subjt:  MSFGLSNPPSTFMRLMNQILQPFLNQFIMVYFDDILIYSSSKEDHLKHIHLLFTILQENELQINLKKCEFLCYSIHFLGFIINCNGISIDPKKIDTISSW

Query:  PQPKTPKDIQCFLGLTSFYRRFIRNFSTIAAPSINCLK
        P+P     ++ FLGL S+YR FI++F+ IA P  + LK
Subjt:  PQPKTPKDIQCFLGLTSFYRRFIRNFSTIAAPSINCLK

P20825 Retrovirus-related Pol polyprotein from transposon 2971.4e-3249.64Show/hide
Query:  MSFGLSNPPSTFMRLMNQILQPFLNQFIMVYFDDILIYSSSKEDHLKHIHLLFTILQENELQINLKKCEFLCYSIHFLGFIINCNGISIDPKKIDTISSW
        M FGL N P+TF R MN IL+P LN+  +VY DDI+I+S+S  +HL  I L+FT L +  L++ L KCEFL    +FLG I+  +GI  +P K+  I S+
Subjt:  MSFGLSNPPSTFMRLMNQILQPFLNQFIMVYFDDILIYSSSKEDHLKHIHLLFTILQENELQINLKKCEFLCYSIHFLGFIINCNGISIDPKKIDTISSW

Query:  PQPKTPKDIQCFLGLTSFYRRFIRNFSTIAAPSINCLKK
        P P   K+I+ FLGLT +YR+FI N++ IA P  +CLKK
Subjt:  PQPKTPKDIQCFLGLTSFYRRFIRNFSTIAAPSINCLKK

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.5e-2639.26Show/hide
Query:  MSFGLSNPPSTFMRLMNQILQPFLNQFIMVYFDDILIYSSSKEDHLKHIHLLFTILQENELQINLKKCEFLCYSIHFLGFIINCNGISIDPKKIDTISSW
        + FGL N P+ F R+++ IL+  + +   VY DDI+++S   + H K++ L+   L +  LQ+NL+K  FL   + FLG+I+  +GI  DPKK+  IS  
Subjt:  MSFGLSNPPSTFMRLMNQILQPFLNQFIMVYFDDILIYSSSKEDHLKHIHLLFTILQENELQINLKKCEFLCYSIHFLGFIINCNGISIDPKKIDTISSW

Query:  PQPKTPKDIQCFLGLTSFYRRFIRNFSTIAAPSIN
        P P + K+++ FLG+TS+YR+FI++++ +A P  N
Subjt:  PQPKTPKDIQCFLGLTSFYRRFIRNFSTIAAPSIN

Q99315 Transposon Ty3-G Gag-Pol polyprotein7.2e-2143.18Show/hide
Query:  MSFGLSNPPSTFMRLMNQILQPFLNQFIMVYFDDILIYSSSKEDHLKHIHLLFTILQENELQINLKKCEFLCYSIHFLGFIINCNGISIDPKKIDTISSW
        M FGL N PSTF R M    +    +F+ VY DDILI+S S E+H KH+  +   L+   L +  KKC+F      FLG+ I    I+    K   I  +
Subjt:  MSFGLSNPPSTFMRLMNQILQPFLNQFIMVYFDDILIYSSSKEDHLKHIHLLFTILQENELQINLKKCEFLCYSIHFLGFIINCNGISIDPKKIDTISSW

Query:  PQPKTPKDIQCFLGLTSFYRRFIRNFSTIAAP
        P PKT K  Q FLG+ ++YRRFI N S IA P
Subjt:  PQPKTPKDIQCFLGLTSFYRRFIRNFSTIAAP

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein5.5e-1637Show/hide
Query:  LKHIHLLFTILQENELQINLKKCEFLCYSIHFLG--FIINCNGISIDPKKIDTISSWPQPKTPKDIQCFLGLTSFYRRFIRNFSTIAAPSINCLKKGSFQ
        + H+ ++  I ++++   N KKC F    I +LG   II+  G+S DP K++ +  WP+PK   +++ FLGLT +YRRF++N+  I  P    LKK S +
Subjt:  LKHIHLLFTILQENELQINLKKCEFLCYSIHFLG--FIINCNGISIDPKKIDTISSWPQPKTPKDIQCFLGLTSFYRRFIRNFSTIAAPSINCLKKGSFQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTTTGGTCTCTCAAATCCCCCTAGCACCTTTATGCGTTTAATGAACCAAATCCTCCAACCTTTTTTGAATCAATTCATTATGGTATACTTTGATGACATACTCAT
CTATAGCTCCTCTAAAGAAGACCACCTCAAACACATCCATTTACTTTTTACTATCTTACAAGAAAATGAGTTGCAAATTAACCTTAAAAAGTGTGAATTTTTGTGTTATA
GCATTCATTTTCTTGGCTTTATCATAAATTGTAATGGTATTTCAATTGATCCTAAAAAGATCGATACTATTAGTTCTTGGCCTCAACCAAAAACTCCAAAAGATATTCAA
TGCTTTTTAGGCTTAACTTCCTTTTATAGAAGGTTTATAAGGAATTTCAGCACTATTGCAGCCCCCTCGATAAATTGTTTAAAAAAGGGTAGTTTCCAATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTTTTGGTCTCTCAAATCCCCCTAGCACCTTTATGCGTTTAATGAACCAAATCCTCCAACCTTTTTTGAATCAATTCATTATGGTATACTTTGATGACATACTCAT
CTATAGCTCCTCTAAAGAAGACCACCTCAAACACATCCATTTACTTTTTACTATCTTACAAGAAAATGAGTTGCAAATTAACCTTAAAAAGTGTGAATTTTTGTGTTATA
GCATTCATTTTCTTGGCTTTATCATAAATTGTAATGGTATTTCAATTGATCCTAAAAAGATCGATACTATTAGTTCTTGGCCTCAACCAAAAACTCCAAAAGATATTCAA
TGCTTTTTAGGCTTAACTTCCTTTTATAGAAGGTTTATAAGGAATTTCAGCACTATTGCAGCCCCCTCGATAAATTGTTTAAAAAAGGGTAGTTTCCAATAG
Protein sequenceShow/hide protein sequence
MSFGLSNPPSTFMRLMNQILQPFLNQFIMVYFDDILIYSSSKEDHLKHIHLLFTILQENELQINLKKCEFLCYSIHFLGFIINCNGISIDPKKIDTISSWPQPKTPKDIQ
CFLGLTSFYRRFIRNFSTIAAPSINCLKKGSFQ