; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh00G001430 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh00G001430
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionReverse transcriptase
Genome locationCmo_Chr00:25788762..25789304
RNA-Seq ExpressionCmoCh00G001430
SyntenyCmoCh00G001430
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0008194 - UDP-glycosyltransferase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055932.1 pol protein [Cucumis melo var. makuwa]7.5e-6568.85Show/hide
Query:  MGHGAWALLGNVMDTRKDTRSVMNVLVVQEFKDVFPDELPRLPPVKDMDFAIELEPSTTPISKAPYRMAPVELKELKEQLQDLLDWGFIRPSVSPWGAPV
        + HG W +L +V+D R+   S+ +  VV+E+ DVFPDELP LPP +++DFAIELEP TTPIS+APYRMAP ELKELK QLQ+LLD GFIRPSVSPWGAPV
Subjt:  MGHGAWALLGNVMDTRKDTRSVMNVLVVQEFKDVFPDELPRLPPVKDMDFAIELEPSTTPISKAPYRMAPVELKELKEQLQDLLDWGFIRPSVSPWGAPV

Query:  LSVKKKDGTLRLCIDYRELNKVTIKNKYSLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIKEVISPR-----RLSGADMVIMS
        L VKKKDG++RLCIDYRELNKVT+KN Y LPRIDDLFDQLQGA+VFSKIDLRSGYHQLRI++   P+     R    + V+MS
Subjt:  LSVKKKDGTLRLCIDYRELNKVTIKNKYSLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIKEVISPR-----RLSGADMVIMS

KAA0056047.1 reverse transcriptase [Cucumis melo var. makuwa]9.8e-6567.76Show/hide
Query:  MGHGAWALLGNVMDTRKDTRSVMNVLVVQEFKDVFPDELPRLPPVKDMDFAIELEPSTTPISKAPYRMAPVELKELKEQLQDLLDWGFIRPSVSPWGAPV
        +  G W +LG+V+DTR+   S+ +  VV+E+ DVFPDELP LPP +++DFAIELEP T PIS+APYRMAPVELKELK QLQ+LLD GFIRPSVSPWGAPV
Subjt:  MGHGAWALLGNVMDTRKDTRSVMNVLVVQEFKDVFPDELPRLPPVKDMDFAIELEPSTTPISKAPYRMAPVELKELKEQLQDLLDWGFIRPSVSPWGAPV

Query:  LSVKKKDGTLRLCIDYRELNKVTIKNKYSLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIKEVISPR-----RLSGADMVIMS
        L VKKKDG++RLCIDYRELNKVT+KN+Y LP+IDDLFDQLQGA+VFSKIDL+SGYHQLRI++   P+     R    + ++MS
Subjt:  LSVKKKDGTLRLCIDYRELNKVTIKNKYSLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIKEVISPR-----RLSGADMVIMS

KAA0063793.1 pol protein [Cucumis melo var. makuwa]3.7e-6467.76Show/hide
Query:  MGHGAWALLGNVMDTRKDTRSVMNVLVVQEFKDVFPDELPRLPPVKDMDFAIELEPSTTPISKAPYRMAPVELKELKEQLQDLLDWGFIRPSVSPWGAPV
        + HG W +L +V+D R+   S+ +  VV+E+ DVFPD+LP LPP +++DFAIELEP T PIS+APYRMAP ELKELK QLQ+LLD GFIRPSVSPWGAPV
Subjt:  MGHGAWALLGNVMDTRKDTRSVMNVLVVQEFKDVFPDELPRLPPVKDMDFAIELEPSTTPISKAPYRMAPVELKELKEQLQDLLDWGFIRPSVSPWGAPV

Query:  LSVKKKDGTLRLCIDYRELNKVTIKNKYSLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIKEVISPR-----RLSGADMVIMS
        L VKKKDG++RLCIDYRELNKVT+KN+Y LPRIDDLFDQLQGA+VFSKIDLRSGYHQLRI++   P+     R    + V+MS
Subjt:  LSVKKKDGTLRLCIDYRELNKVTIKNKYSLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIKEVISPR-----RLSGADMVIMS

XP_022931734.1 uncharacterized protein LOC111437896 [Cucurbita moschata]9.8e-6569.4Show/hide
Query:  MGHGAWALLGNVMDTRKDTRSVMNVLVVQEFKDVFPDELPRLPPVKDMDFAIELEPSTTPISKAPYRMAPVELKELKEQLQDLLDWGFIRPSVSPWGAPV
        +  GAWA+L +V    K +  V +V VV EF+DVFP+ELP LPP +++DF I+LEP TTPISK PYRMAP ELKELK QLQ+LLD GFIRPSVSPWGAPV
Subjt:  MGHGAWALLGNVMDTRKDTRSVMNVLVVQEFKDVFPDELPRLPPVKDMDFAIELEPSTTPISKAPYRMAPVELKELKEQLQDLLDWGFIRPSVSPWGAPV

Query:  LSVKKKDGTLRLCIDYRELNKVTIKNKYSLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIKEVISPR-----RLSGADMVIMS
        L VKKKDGT+RLCIDYRELNKVTIKNKY LPRIDDLFDQLQGA+VFSKIDLRSGYHQ+RI+E   P+     R    + ++MS
Subjt:  LSVKKKDGTLRLCIDYRELNKVTIKNKYSLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIKEVISPR-----RLSGADMVIMS

XP_022962669.1 uncharacterized protein LOC111463090 [Cucurbita moschata]4.9e-6470.49Show/hide
Query:  MGHGAWALLGNVMDTRKDTRSVMNVLVVQEFKDVFPDELPRLPPVKDMDFAIELEPSTTPISKAPYRMAPVELKELKEQLQDLLDWGFIRPSVSPWGAPV
        +GHGA A L +V++  +    V  V VV EF DVFPD+LP LPP ++++F IELEP TTPISKAPYRMAP ELKELK QLQ+LL+ GFIRPSVSPWGAPV
Subjt:  MGHGAWALLGNVMDTRKDTRSVMNVLVVQEFKDVFPDELPRLPPVKDMDFAIELEPSTTPISKAPYRMAPVELKELKEQLQDLLDWGFIRPSVSPWGAPV

Query:  LSVKKKDGTLRLCIDYRELNKVTIKNKYSLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIKEVISPR-----RLSGADMVIMS
        L VKKKDGTLRLCIDYRELNKVTIKNKY LPRIDDLFDQLQGA+VFSKIDLRSGYHQ+R+KE   P+     R    + V+MS
Subjt:  LSVKKKDGTLRLCIDYRELNKVTIKNKYSLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIKEVISPR-----RLSGADMVIMS

TrEMBL top hitse value%identityAlignment
A0A5A7UJ81 Reverse transcriptase3.6e-6568.85Show/hide
Query:  MGHGAWALLGNVMDTRKDTRSVMNVLVVQEFKDVFPDELPRLPPVKDMDFAIELEPSTTPISKAPYRMAPVELKELKEQLQDLLDWGFIRPSVSPWGAPV
        + HG W +L +V+D R+   S+ +  VV+E+ DVFPDELP LPP +++DFAIELEP TTPIS+APYRMAP ELKELK QLQ+LLD GFIRPSVSPWGAPV
Subjt:  MGHGAWALLGNVMDTRKDTRSVMNVLVVQEFKDVFPDELPRLPPVKDMDFAIELEPSTTPISKAPYRMAPVELKELKEQLQDLLDWGFIRPSVSPWGAPV

Query:  LSVKKKDGTLRLCIDYRELNKVTIKNKYSLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIKEVISPR-----RLSGADMVIMS
        L VKKKDG++RLCIDYRELNKVT+KN Y LPRIDDLFDQLQGA+VFSKIDLRSGYHQLRI++   P+     R    + V+MS
Subjt:  LSVKKKDGTLRLCIDYRELNKVTIKNKYSLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIKEVISPR-----RLSGADMVIMS

A0A5A7UN68 Reverse transcriptase4.8e-6567.76Show/hide
Query:  MGHGAWALLGNVMDTRKDTRSVMNVLVVQEFKDVFPDELPRLPPVKDMDFAIELEPSTTPISKAPYRMAPVELKELKEQLQDLLDWGFIRPSVSPWGAPV
        +  G W +LG+V+DTR+   S+ +  VV+E+ DVFPDELP LPP +++DFAIELEP T PIS+APYRMAPVELKELK QLQ+LLD GFIRPSVSPWGAPV
Subjt:  MGHGAWALLGNVMDTRKDTRSVMNVLVVQEFKDVFPDELPRLPPVKDMDFAIELEPSTTPISKAPYRMAPVELKELKEQLQDLLDWGFIRPSVSPWGAPV

Query:  LSVKKKDGTLRLCIDYRELNKVTIKNKYSLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIKEVISPR-----RLSGADMVIMS
        L VKKKDG++RLCIDYRELNKVT+KN+Y LP+IDDLFDQLQGA+VFSKIDL+SGYHQLRI++   P+     R    + ++MS
Subjt:  LSVKKKDGTLRLCIDYRELNKVTIKNKYSLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIKEVISPR-----RLSGADMVIMS

A0A5A7V6R2 Reverse transcriptase1.8e-6467.76Show/hide
Query:  MGHGAWALLGNVMDTRKDTRSVMNVLVVQEFKDVFPDELPRLPPVKDMDFAIELEPSTTPISKAPYRMAPVELKELKEQLQDLLDWGFIRPSVSPWGAPV
        + HG W +L +V+D R+   S+ +  VV+E+ DVFPD+LP LPP +++DFAIELEP T PIS+APYRMAP ELKELK QLQ+LLD GFIRPSVSPWGAPV
Subjt:  MGHGAWALLGNVMDTRKDTRSVMNVLVVQEFKDVFPDELPRLPPVKDMDFAIELEPSTTPISKAPYRMAPVELKELKEQLQDLLDWGFIRPSVSPWGAPV

Query:  LSVKKKDGTLRLCIDYRELNKVTIKNKYSLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIKEVISPR-----RLSGADMVIMS
        L VKKKDG++RLCIDYRELNKVT+KN+Y LPRIDDLFDQLQGA+VFSKIDLRSGYHQLRI++   P+     R    + V+MS
Subjt:  LSVKKKDGTLRLCIDYRELNKVTIKNKYSLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIKEVISPR-----RLSGADMVIMS

A0A6J1EV26 Reverse transcriptase4.8e-6569.4Show/hide
Query:  MGHGAWALLGNVMDTRKDTRSVMNVLVVQEFKDVFPDELPRLPPVKDMDFAIELEPSTTPISKAPYRMAPVELKELKEQLQDLLDWGFIRPSVSPWGAPV
        +  GAWA+L +V    K +  V +V VV EF+DVFP+ELP LPP +++DF I+LEP TTPISK PYRMAP ELKELK QLQ+LLD GFIRPSVSPWGAPV
Subjt:  MGHGAWALLGNVMDTRKDTRSVMNVLVVQEFKDVFPDELPRLPPVKDMDFAIELEPSTTPISKAPYRMAPVELKELKEQLQDLLDWGFIRPSVSPWGAPV

Query:  LSVKKKDGTLRLCIDYRELNKVTIKNKYSLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIKEVISPR-----RLSGADMVIMS
        L VKKKDGT+RLCIDYRELNKVTIKNKY LPRIDDLFDQLQGA+VFSKIDLRSGYHQ+RI+E   P+     R    + ++MS
Subjt:  LSVKKKDGTLRLCIDYRELNKVTIKNKYSLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIKEVISPR-----RLSGADMVIMS

A0A6J1HHR1 uncharacterized protein LOC1114630902.4e-6470.49Show/hide
Query:  MGHGAWALLGNVMDTRKDTRSVMNVLVVQEFKDVFPDELPRLPPVKDMDFAIELEPSTTPISKAPYRMAPVELKELKEQLQDLLDWGFIRPSVSPWGAPV
        +GHGA A L +V++  +    V  V VV EF DVFPD+LP LPP ++++F IELEP TTPISKAPYRMAP ELKELK QLQ+LL+ GFIRPSVSPWGAPV
Subjt:  MGHGAWALLGNVMDTRKDTRSVMNVLVVQEFKDVFPDELPRLPPVKDMDFAIELEPSTTPISKAPYRMAPVELKELKEQLQDLLDWGFIRPSVSPWGAPV

Query:  LSVKKKDGTLRLCIDYRELNKVTIKNKYSLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIKEVISPR-----RLSGADMVIMS
        L VKKKDGTLRLCIDYRELNKVTIKNKY LPRIDDLFDQLQGA+VFSKIDLRSGYHQ+R+KE   P+     R    + V+MS
Subjt:  LSVKKKDGTLRLCIDYRELNKVTIKNKYSLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIKEVISPR-----RLSGADMVIMS

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein4.5e-2034.78Show/hide
Query:  VVQEFKDVFPD-ELPRLP-PVKDMDFAIELEPSTTPISKAPYRMAPVELKELKEQLQDLLDWGFIRPSVSPWGAPVLSVKKKDGTLRLCIDYRELNKVTI
        + +EFKD+  +    +LP P+K ++F +EL      +    Y + P +++ + +++   L  G IR S +    PV+ V KK+GTLR+ +DY+ LNK   
Subjt:  VVQEFKDVFPD-ELPRLP-PVKDMDFAIELEPSTTPISKAPYRMAPVELKELKEQLQDLLDWGFIRPSVSPWGAPVLSVKKKDGTLRLCIDYRELNKVTI

Query:  KNKYSLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIKE
         N Y LP I+ L  ++QG+++F+K+DL+S YH +R+++
Subjt:  KNKYSLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIKE

P0CT41 Transposon Tf2-12 polyprotein4.5e-2034.78Show/hide
Query:  VVQEFKDVFPD-ELPRLP-PVKDMDFAIELEPSTTPISKAPYRMAPVELKELKEQLQDLLDWGFIRPSVSPWGAPVLSVKKKDGTLRLCIDYRELNKVTI
        + +EFKD+  +    +LP P+K ++F +EL      +    Y + P +++ + +++   L  G IR S +    PV+ V KK+GTLR+ +DY+ LNK   
Subjt:  VVQEFKDVFPD-ELPRLP-PVKDMDFAIELEPSTTPISKAPYRMAPVELKELKEQLQDLLDWGFIRPSVSPWGAPVLSVKKKDGTLRLCIDYRELNKVTI

Query:  KNKYSLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIKE
         N Y LP I+ L  ++QG+++F+K+DL+S YH +R+++
Subjt:  KNKYSLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIKE

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein3.5e-2540.67Show/hide
Query:  TRKDTRSVMNVLVVQEFKDVFPDELPRLP------PVKDMDFAIELEPSTTPISKAPYRMAPVELKELKEQLQDLLDWGFIRPSVSPWGAPVLSVKKKDG
        + KDT   + V + Q+++++  ++LP  P      PVK     IE++P        PY +     +E+ + +Q LLD  FI PS SP  +PV+ V KKDG
Subjt:  TRKDTRSVMNVLVVQEFKDVFPDELPRLP------PVKDMDFAIELEPSTTPISKAPYRMAPVELKELKEQLQDLLDWGFIRPSVSPWGAPVLSVKKKDG

Query:  TLRLCIDYRELNKVTIKNKYSLPRIDDLFDQLQGASVFSKIDLRSGYHQL
        T RLC+DYR LNK TI + + LPRID+L  ++  A +F+ +DL SGYHQ+
Subjt:  TLRLCIDYRELNKVTIKNKYSLPRIDDLFDQLQGASVFSKIDLRSGYHQL

Q99315 Transposon Ty3-G Gag-Pol polyprotein3.5e-2540.67Show/hide
Query:  TRKDTRSVMNVLVVQEFKDVFPDELPRLP------PVKDMDFAIELEPSTTPISKAPYRMAPVELKELKEQLQDLLDWGFIRPSVSPWGAPVLSVKKKDG
        + KDT   + V + Q+++++  ++LP  P      PVK     IE++P        PY +     +E+ + +Q LLD  FI PS SP  +PV+ V KKDG
Subjt:  TRKDTRSVMNVLVVQEFKDVFPDELPRLP------PVKDMDFAIELEPSTTPISKAPYRMAPVELKELKEQLQDLLDWGFIRPSVSPWGAPVLSVKKKDG

Query:  TLRLCIDYRELNKVTIKNKYSLPRIDDLFDQLQGASVFSKIDLRSGYHQL
        T RLC+DYR LNK TI + + LPRID+L  ++  A +F+ +DL SGYHQ+
Subjt:  TLRLCIDYRELNKVTIKNKYSLPRIDDLFDQLQGASVFSKIDLRSGYHQL

Q9UR07 Transposon Tf2-11 polyprotein4.5e-2034.78Show/hide
Query:  VVQEFKDVFPD-ELPRLP-PVKDMDFAIELEPSTTPISKAPYRMAPVELKELKEQLQDLLDWGFIRPSVSPWGAPVLSVKKKDGTLRLCIDYRELNKVTI
        + +EFKD+  +    +LP P+K ++F +EL      +    Y + P +++ + +++   L  G IR S +    PV+ V KK+GTLR+ +DY+ LNK   
Subjt:  VVQEFKDVFPD-ELPRLP-PVKDMDFAIELEPSTTPISKAPYRMAPVELKELKEQLQDLLDWGFIRPSVSPWGAPVLSVKKKDGTLRLCIDYRELNKVTI

Query:  KNKYSLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIKE
         N Y LP I+ L  ++QG+++F+K+DL+S YH +R+++
Subjt:  KNKYSLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIKE

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCCACGGAGCTTGGGCCCTATTGGGCAACGTAATGGACACGAGGAAAGATACTCGTAGTGTGATGAACGTGCTTGTAGTACAAGAGTTTAAAGATGTGTTTCCAGA
TGAACTGCCTAGGTTACCACCCGTCAAGGACATGGACTTTGCAATCGAACTGGAACCTAGTACAACTCCAATCTCCAAGGCACCTTACAGAATGGCTCCTGTCGAGTTGA
AGGAACTTAAGGAACAATTGCAGGATCTCTTAGATTGGGGTTTTATCAGACCAAGTGTATCGCCATGGGGAGCCCCAGTGCTTTCTGTCAAGAAGAAGGATGGGACCTTG
AGATTGTGCATTGACTATAGGGAATTGAATAAAGTAACAATAAAAAACAAATACTCGTTGCCTCGCATTGATGATCTATTTGATCAATTGCAAGGTGCTTCAGTATTTTC
AAAAATTGACCTACGATCGGGATACCATCAGTTGAGGATAAAGGAGGTGATATCCCCAAGACGGCTTTCAGGAGCCGATATGGTCATTATGAGTTTACCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCCACGGAGCTTGGGCCCTATTGGGCAACGTAATGGACACGAGGAAAGATACTCGTAGTGTGATGAACGTGCTTGTAGTACAAGAGTTTAAAGATGTGTTTCCAGA
TGAACTGCCTAGGTTACCACCCGTCAAGGACATGGACTTTGCAATCGAACTGGAACCTAGTACAACTCCAATCTCCAAGGCACCTTACAGAATGGCTCCTGTCGAGTTGA
AGGAACTTAAGGAACAATTGCAGGATCTCTTAGATTGGGGTTTTATCAGACCAAGTGTATCGCCATGGGGAGCCCCAGTGCTTTCTGTCAAGAAGAAGGATGGGACCTTG
AGATTGTGCATTGACTATAGGGAATTGAATAAAGTAACAATAAAAAACAAATACTCGTTGCCTCGCATTGATGATCTATTTGATCAATTGCAAGGTGCTTCAGTATTTTC
AAAAATTGACCTACGATCGGGATACCATCAGTTGAGGATAAAGGAGGTGATATCCCCAAGACGGCTTTCAGGAGCCGATATGGTCATTATGAGTTTACCGTGA
Protein sequenceShow/hide protein sequence
MGHGAWALLGNVMDTRKDTRSVMNVLVVQEFKDVFPDELPRLPPVKDMDFAIELEPSTTPISKAPYRMAPVELKELKEQLQDLLDWGFIRPSVSPWGAPVLSVKKKDGTL
RLCIDYRELNKVTIKNKYSLPRIDDLFDQLQGASVFSKIDLRSGYHQLRIKEVISPRRLSGADMVIMSLP