; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G021110 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G021110
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRibonuclease H-like domain, reverse transcriptase, RNA-dependent DNA polymerase
Genome locationCmo_Chr04:13579804..13580483
RNA-Seq ExpressionCmoCh04G021110
SyntenyCmoCh04G021110
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EEC84282.1 hypothetical protein OsI_30754 [Oryza sativa Indica Group]3.6e-3460Show/hide
Query:  ALAAIYQTVSKDVLLMLTEKDSTKAAWETLQTMHLKKNSEGNVIKHKARLMEKGYVQQQGVNFEEVFAPVAKLGTVRLILALAAQHKWEVPHLDIKTTFL
        A+A   Q + K+    LT   +            LKKN+ G VIKHKARL+ KGYVQ+QGV+FEEVFAPVA+L TVR+ILA+AA  +WEV HLD+K+ FL
Subjt:  ALAAIYQTVSKDVLLMLTEKDSTKAAWETLQTMHLKKNSEGNVIKHKARLMEKGYVQQQGVNFEEVFAPVAKLGTVRLILALAAQHKWEVPHLDIKTTFL

Query:  NGDLQEEVYVAQPEGFVIKAEEHKVYKLSKTLYGL
        NGDL+EEVYVAQPEGFV + EEH V +LSK LYGL
Subjt:  NGDLQEEVYVAQPEGFVIKAEEHKVYKLSKTLYGL

KAB8107251.1 hypothetical protein EE612_041900 [Oryza sativa]3.6e-3460Show/hide
Query:  ALAAIYQTVSKDVLLMLTEKDSTKAAWETLQTMHLKKNSEGNVIKHKARLMEKGYVQQQGVNFEEVFAPVAKLGTVRLILALAAQHKWEVPHLDIKTTFL
        A+A   Q + K+    LT   +            LKKN+ G VIKHKARL+ KGYVQ+QGV+FEEVFAPVA+L TVR+ILA+AA  +WEV HLD+K+ FL
Subjt:  ALAAIYQTVSKDVLLMLTEKDSTKAAWETLQTMHLKKNSEGNVIKHKARLMEKGYVQQQGVNFEEVFAPVAKLGTVRLILALAAQHKWEVPHLDIKTTFL

Query:  NGDLQEEVYVAQPEGFVIKAEEHKVYKLSKTLYGL
        NGDL+EEVYVAQPEGFV + EEH V +LSK LYGL
Subjt:  NGDLQEEVYVAQPEGFVIKAEEHKVYKLSKTLYGL

KAG7538346.1 Ribonuclease H-like superfamily [Arabidopsis suecica]5.2e-3356.25Show/hide
Query:  TVSKDVLLMLTEKDSTKAAWETLQTMHLKKNSEGNVIKHKARLMEKGYVQQQGVNFEEVFAPVAKLGTVRLILALAAQHKWEVPHLDIKTTFLNGDLQEE
        +++K+    L EK              +KKN++G + K+KARL+ KGYVQ+ G++FEEVFAPVA+L T+RL++ALAA H WE+ HLD+KT FL+G+L+E+
Subjt:  TVSKDVLLMLTEKDSTKAAWETLQTMHLKKNSEGNVIKHKARLMEKGYVQQQGVNFEEVFAPVAKLGTVRLILALAAQHKWEVPHLDIKTTFLNGDLQEE

Query:  VYVAQPEGFVIKAEEHKVYKLSKTLYGL
        VYV QPEGF +K EEHKVYKLSK LYGL
Subjt:  VYVAQPEGFVIKAEEHKVYKLSKTLYGL

KAG7553951.1 Zinc finger CCHC-type [Arabidopsis suecica]5.2e-3356.25Show/hide
Query:  TVSKDVLLMLTEKDSTKAAWETLQTMHLKKNSEGNVIKHKARLMEKGYVQQQGVNFEEVFAPVAKLGTVRLILALAAQHKWEVPHLDIKTTFLNGDLQEE
        +++K+    L EK              +KKN++G + K+KARL+ KGYVQ+ G++FEEVFAPVA+L T+RL++ALAA H WE+ HLD+KT FL+G+L+E+
Subjt:  TVSKDVLLMLTEKDSTKAAWETLQTMHLKKNSEGNVIKHKARLMEKGYVQQQGVNFEEVFAPVAKLGTVRLILALAAQHKWEVPHLDIKTTFLNGDLQEE

Query:  VYVAQPEGFVIKAEEHKVYKLSKTLYGL
        VYV QPEGF +K EEHKVYKLSK LYGL
Subjt:  VYVAQPEGFVIKAEEHKVYKLSKTLYGL

KAG7559162.1 Integrase catalytic core [Arabidopsis thaliana x Arabidopsis arenosa]5.2e-3356.25Show/hide
Query:  TVSKDVLLMLTEKDSTKAAWETLQTMHLKKNSEGNVIKHKARLMEKGYVQQQGVNFEEVFAPVAKLGTVRLILALAAQHKWEVPHLDIKTTFLNGDLQEE
        +++K+    L EK              +KKN++G + K+KARL+ KGYVQ+ G++FEEVFAPVA+L T+RL++ALAA H WE+ HLD+KT FL+G+L+E+
Subjt:  TVSKDVLLMLTEKDSTKAAWETLQTMHLKKNSEGNVIKHKARLMEKGYVQQQGVNFEEVFAPVAKLGTVRLILALAAQHKWEVPHLDIKTTFLNGDLQEE

Query:  VYVAQPEGFVIKAEEHKVYKLSKTLYGL
        VYV QPEGF +K EEHKVYKLSK LYGL
Subjt:  VYVAQPEGFVIKAEEHKVYKLSKTLYGL

TrEMBL top hitse value%identityAlignment
A0A0P0XB91 Os08g0125300 protein1.7e-3460Show/hide
Query:  ALAAIYQTVSKDVLLMLTEKDSTKAAWETLQTMHLKKNSEGNVIKHKARLMEKGYVQQQGVNFEEVFAPVAKLGTVRLILALAAQHKWEVPHLDIKTTFL
        A+A   Q + K+    LT   +            LKKN+ G VIKHKARL+ KGYVQ+QGV+FEEVFAPVA+L TVR+ILA+AA  +WEV HLD+K+ FL
Subjt:  ALAAIYQTVSKDVLLMLTEKDSTKAAWETLQTMHLKKNSEGNVIKHKARLMEKGYVQQQGVNFEEVFAPVAKLGTVRLILALAAQHKWEVPHLDIKTTFL

Query:  NGDLQEEVYVAQPEGFVIKAEEHKVYKLSKTLYGL
        NGDL+EEVYVAQPEGFV + EEH V +LSK LYGL
Subjt:  NGDLQEEVYVAQPEGFVIKAEEHKVYKLSKTLYGL

A0A251U1A0 Putative zinc finger, CCHC-type5.1e-3455.32Show/hide
Query:  ERKDKMALAAIYQTVSKDVLLMLTEKDSTKAAWETLQTMHLKKNSEGNVIKHKARLMEKGYVQQQGVNFEEVFAPVAKLGTVRLILALAAQHKWEVPHLD
        E   + A+ A  +++ K+    LT+  S + A        LKK++ GNV KHKARL+ KGYVQQ+GV+FE+ FAPVA++ TVRLI+A+AA   W V HLD
Subjt:  ERKDKMALAAIYQTVSKDVLLMLTEKDSTKAAWETLQTMHLKKNSEGNVIKHKARLMEKGYVQQQGVNFEEVFAPVAKLGTVRLILALAAQHKWEVPHLD

Query:  IKTTFLNGDLQEEVYVAQPEGFVIKAEEHKVYKLSKTLYGL
        +K+ FLNGDLQEEVYV QPEGF +K +EH VYKL K LYGL
Subjt:  IKTTFLNGDLQEEVYVAQPEGFVIKAEEHKVYKLSKTLYGL

B8BDZ6 Uncharacterized protein1.7e-3460Show/hide
Query:  ALAAIYQTVSKDVLLMLTEKDSTKAAWETLQTMHLKKNSEGNVIKHKARLMEKGYVQQQGVNFEEVFAPVAKLGTVRLILALAAQHKWEVPHLDIKTTFL
        A+A   Q + K+    LT   +            LKKN+ G VIKHKARL+ KGYVQ+QGV+FEEVFAPVA+L TVR+ILA+AA  +WEV HLD+K+ FL
Subjt:  ALAAIYQTVSKDVLLMLTEKDSTKAAWETLQTMHLKKNSEGNVIKHKARLMEKGYVQQQGVNFEEVFAPVAKLGTVRLILALAAQHKWEVPHLDIKTTFL

Query:  NGDLQEEVYVAQPEGFVIKAEEHKVYKLSKTLYGL
        NGDL+EEVYVAQPEGFV + EEH V +LSK LYGL
Subjt:  NGDLQEEVYVAQPEGFVIKAEEHKVYKLSKTLYGL

Q0J8A6 Os08g0125300 protein1.7e-3460Show/hide
Query:  ALAAIYQTVSKDVLLMLTEKDSTKAAWETLQTMHLKKNSEGNVIKHKARLMEKGYVQQQGVNFEEVFAPVAKLGTVRLILALAAQHKWEVPHLDIKTTFL
        A+A   Q + K+    LT   +            LKKN+ G VIKHKARL+ KGYVQ+QGV+FEEVFAPVA+L TVR+ILA+AA  +WEV HLD+K+ FL
Subjt:  ALAAIYQTVSKDVLLMLTEKDSTKAAWETLQTMHLKKNSEGNVIKHKARLMEKGYVQQQGVNFEEVFAPVAKLGTVRLILALAAQHKWEVPHLDIKTTFL

Query:  NGDLQEEVYVAQPEGFVIKAEEHKVYKLSKTLYGL
        NGDL+EEVYVAQPEGFV + EEH V +LSK LYGL
Subjt:  NGDLQEEVYVAQPEGFVIKAEEHKVYKLSKTLYGL

Q10F84 Gag-pol polyprotein2.5e-3371.29Show/hide
Query:  LKKNSEGNVIKHKARLMEKGYVQQQGVNFEEVFAPVAKLGTVRLILALAAQHKWEVPHLDIKTTFLNGDLQEEVYVAQPEGFVIKAEEHKVYKLSKTLYG
        LKKN+ G VIKHKARL+ KGYVQ+QGV+F+EVFAPVA+L TVR IL +A   +W+V HLD+K+ FLNGDL+EEVYV+QPEGFV K +EH VYKLSK LYG
Subjt:  LKKNSEGNVIKHKARLMEKGYVQQQGVNFEEVFAPVAKLGTVRLILALAAQHKWEVPHLDIKTTFLNGDLQEEVYVAQPEGFVIKAEEHKVYKLSKTLYG

Query:  L
        L
Subjt:  L

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.4e-1939.5Show/hide
Query:  LTEKDSTKAAWETLQTMHLKKNSEGNVIKHKARLMEKGYVQQQGVNFEEVFAPVAKLGTVRLILALAAQHKWEVPHLDIKTTFLNGDLQEEVYVAQPEGF
        +T++   K   ++     +K N  GN I++KARL+ +G+ Q+  +++EE FAPVA++ + R IL+L  Q+  +V  +D+KT FLNG L+EE+Y+  P+G 
Subjt:  LTEKDSTKAAWETLQTMHLKKNSEGNVIKHKARLMEKGYVQQQGVNFEEVFAPVAKLGTVRLILALAAQHKWEVPHLDIKTTFLNGDLQEEVYVAQPEGF

Query:  VIKAEEHKVYKLSKTLYGL
         I      V KL+K +YGL
Subjt:  VIKAEEHKVYKLSKTLYGL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.4e-2540.14Show/hide
Query:  EERKDKMALAAIYQTVSKDVLLMLTEKDSTKAAWETLQTMHLKKNSEGNVIKHKARLMEKGYVQQQGVNFEEVFAPVAKLGTVRLILALAAQHKWEVPHL
        E+ +   A+    +++ K+    L E    K   +      LKK+ +  ++++KARL+ KG+ Q++G++F+E+F+PV K+ ++R IL+LAA    EV  L
Subjt:  EERKDKMALAAIYQTVSKDVLLMLTEKDSTKAAWETLQTMHLKKNSEGNVIKHKARLMEKGYVQQQGVNFEEVFAPVAKLGTVRLILALAAQHKWEVPHL

Query:  DIKTTFLNGDLQEEVYVAQPEGFVIKAEEHKVYKLSKTLYGL
        D+KT FL+GDL+EE+Y+ QPEGF +  ++H V KL+K+LYGL
Subjt:  DIKTTFLNGDLQEEVYVAQPEGFVIKAEEHKVYKLSKTLYGL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.4e-1943Show/hide
Query:  KKNSEGNVIKHKARLMEKGYVQQQGVNFEEVFAPVAKLGTVRLILALAAQHKWEVPHLDIKTTFLNGDLQEEVYVAQPEGFVIKAEEHKVYKLSKTLYGL
        K NS+G++ ++KARL+ KGY Q+ G+++ E F+PV K  ++R++L +A    W +  LD+   FL G L ++VY++QP GF+ K   + V KL K LYGL
Subjt:  KKNSEGNVIKHKARLMEKGYVQQQGVNFEEVFAPVAKLGTVRLILALAAQHKWEVPHLDIKTTFLNGDLQEEVYVAQPEGFVIKAEEHKVYKLSKTLYGL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-1843Show/hide
Query:  KKNSEGNVIKHKARLMEKGYVQQQGVNFEEVFAPVAKLGTVRLILALAAQHKWEVPHLDIKTTFLNGDLQEEVYVAQPEGFVIKAEEHKVYKLSKTLYGL
        K NS+G++ ++KARL+ KGY Q+ G+++ E F+PV K  ++R++L +A    W +  LD+   FL G L +EVY++QP GFV K     V +L K +YGL
Subjt:  KKNSEGNVIKHKARLMEKGYVQQQGVNFEEVFAPVAKLGTVRLILALAAQHKWEVPHLDIKTTFLNGDLQEEVYVAQPEGFVIKAEEHKVYKLSKTLYGL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.8e-2044.76Show/hide
Query:  LKKNSEGNVIKHKARLMEKGYVQQQGVNFEEVFAPVAKLGTVRLILALAAQHKWEVPHLDIKTTFLNGDLQEEVYVAQPEGFVIKAEE----HKVYKLSK
        +K NS+G + ++KARL+ KGY QQ+G++F E F+PV KL +V+LILA++A + + +  LDI   FLNGDL EE+Y+  P G+  +  +    + V  L K
Subjt:  LKKNSEGNVIKHKARLMEKGYVQQQGVNFEEVFAPVAKLGTVRLILALAAQHKWEVPHLDIKTTFLNGDLQEEVYVAQPEGFVIKAEE----HKVYKLSK

Query:  TLYGL
        ++YGL
Subjt:  TLYGL

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.7e-0542Show/hide
Query:  KKNSEGNVIKHKARLMEKGYVQQQGVNFEEVFAPVAKLGTVRLILALAAQ
        K +S+G + + KARL+ KG+ Q++G+ F E ++PV +  T+R IL +A Q
Subjt:  KKNSEGNVIKHKARLMEKGYVQQQGVNFEEVFAPVAKLGTVRLILALAAQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTGTCAACTTACAAGCACAAGGCATGTGGGATGTCATCGAGTATGGTGATGTTGAGGAGCGTAAGGATAAGATGGCTCTTGCCGCCATCTACCAAACAGTCTCGAA
GGACGTTCTTCTCATGTTGACAGAGAAGGACTCGACAAAGGCAGCATGGGAGACGCTGCAAACAATGCATTTGAAGAAGAATAGTGAAGGAAATGTCATCAAACATAAAG
CAAGACTCATGGAAAAGGGATACGTGCAACAACAAGGAGTTAATTTTGAGGAGGTTTTCGCGCCTGTTGCTAAACTAGGCACCGTAAGGTTGATTCTTGCTCTCGCAGCT
CAACACAAATGGGAGGTCCCTCACTTGGACATCAAAACAACATTCCTAAATGGTGACCTCCAAGAAGAAGTGTATGTTGCCCAACCTGAAGGGTTCGTCATTAAAGCCGA
AGAACACAAAGTGTACAAGTTGTCAAAGACCCTGTATGGTCTATAG
mRNA sequenceShow/hide mRNA sequence
ATGCGTGTCAACTTACAAGCACAAGGCATGTGGGATGTCATCGAGTATGGTGATGTTGAGGAGCGTAAGGATAAGATGGCTCTTGCCGCCATCTACCAAACAGTCTCGAA
GGACGTTCTTCTCATGTTGACAGAGAAGGACTCGACAAAGGCAGCATGGGAGACGCTGCAAACAATGCATTTGAAGAAGAATAGTGAAGGAAATGTCATCAAACATAAAG
CAAGACTCATGGAAAAGGGATACGTGCAACAACAAGGAGTTAATTTTGAGGAGGTTTTCGCGCCTGTTGCTAAACTAGGCACCGTAAGGTTGATTCTTGCTCTCGCAGCT
CAACACAAATGGGAGGTCCCTCACTTGGACATCAAAACAACATTCCTAAATGGTGACCTCCAAGAAGAAGTGTATGTTGCCCAACCTGAAGGGTTCGTCATTAAAGCCGA
AGAACACAAAGTGTACAAGTTGTCAAAGACCCTGTATGGTCTATAG
Protein sequenceShow/hide protein sequence
MRVNLQAQGMWDVIEYGDVEERKDKMALAAIYQTVSKDVLLMLTEKDSTKAAWETLQTMHLKKNSEGNVIKHKARLMEKGYVQQQGVNFEEVFAPVAKLGTVRLILALAA
QHKWEVPHLDIKTTFLNGDLQEEVYVAQPEGFVIKAEEHKVYKLSKTLYGL