; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g28540 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g28540
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr8:20530274..20534862
RNA-Seq ExpressionMoc08g28540
SyntenyMoc08g28540
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
OMO64558.1 hypothetical protein CCACVL1_21676 [Corchorus capsularis]1.6e-2233.19Show/hide
Query:  EGRSWRFTGLYGQPDQQARKFTWDLIRHLHCHDEAAWVIGGDLNELSQVPKAKGD------MASNLR-RVGRCTLKSLFPISENKEVRLLKPTT----NI
        E   WRFTG YG PD   R  +W L+R LH  +   W IGGD NE+    +  G          N R  +  C L  L  +     +R  +        +
Subjt:  EGRSWRFTGLYGQPDQQARKFTWDLIRHLHCHDEAAWVIGGDLNELSQVPKAKGD------MASNLR-RVGRCTLKSLFPISENKEVRLLKPTT----NI

Query:  PLLNFELIHKLEADLDRLLDEEEVYWRQRSRKSWFKWGDRNMSWFHRRASIRKKKNKVKGISDTKGNWVTNPEGIENIFSNYFQSIFTSTNPSVGILDRV
           +  L+    A+L  L    E+YWRQR+++ W K GDRN S+FH+ AS RK +N V  I    G W T+ E IE IF  YF+ +FTS+   + ++D +
Subjt:  PLLNFELIHKLEADLDRLLDEEEVYWRQRSRKSWFKWGDRNMSWFHRRASIRKKKNKVKGISDTKGNWVTNPEGIENIFSNYFQSIFTSTNPSVGILDRV

Query:  LHCIPRRVTPEFITDTKHWDIPKLWQ
             R    + I   K W   ++W+
Subjt:  LHCIPRRVTPEFITDTKHWDIPKLWQ

XP_012472541.1 PREDICTED: uncharacterized protein LOC105789722 [Gossypium raimondii]1.1e-2131.22Show/hide
Query:  EGRSWRFTGLYGQPDQQARKFTWDLIRHLHCHDEAAWVIGGDLNELSQVPKAKGDMASNLRRVGR-------CTLKSLFPISE------------NKEVR
        EGR+WR TG YG P++  R+ +W+L+R L+   +  WV+ GD NE++   + KG +    R + R       C+L  +    +            N   R
Subjt:  EGRSWRFTGLYGQPDQQARKFTWDLIRHLHCHDEAAWVIGGDLNELSQVPKAKGDMASNLRRVGR-------CTLKSLFPISE------------NKEVR

Query:  LLKPTTNIPLLNFELIHKLEADLDRLLDEEEVYWRQRSRKSWFKWGDRNMSWFHRRASIRKKKNKVKGISDTKGNWVTNPEGIENIFSNYFQSIFTSTNP
        L +   +  +L   +  KL+ ++D   D EE+YW QR+R +W K  DRN  +FH+ A+ R ++NKV+ + D +GN   N + +  + ++YF S+FT+ N 
Subjt:  LLKPTTNIPLLNFELIHKLEADLDRLLDEEEVYWRQRSRKSWFKWGDRNMSWFHRRASIRKKKNKVKGISDTKGNWVTNPEGIENIFSNYFQSIFTSTNP

Query:  S--VGILDRVLHCIPRRVTPE
        S    IL+ +  CI +++  +
Subjt:  S--VGILDRVLHCIPRRVTPE

XP_022158772.1 uncharacterized protein LOC111025237 [Momordica charantia]6.2e-2249.54Show/hide
Query:  LNFELIHKLEADLDRLLDEEEVYWRQRSRKSWFKWGDRNMSWFHRRASIRKKKNKVKGISDTKGNWVTNPEGIENIFSNYFQSIFTSTNPSVGILDRVLH
        L+F +IH +E DL  LL+ EE++W+QRSR+ W KWGD N  WFH++A++RK  N + GI D  G W   P+ I NIF  YFQ IFTST+P    +D +  
Subjt:  LNFELIHKLEADLDRLLDEEEVYWRQRSRKSWFKWGDRNMSWFHRRASIRKKKNKVKGISDTKGNWVTNPEGIENIFSNYFQSIFTSTNPSVGILDRVLH

Query:  CIPRRVTPE
         IP R+T E
Subjt:  CIPRRVTPE

XP_024196188.1 uncharacterized protein LOC112199393 [Rosa chinensis]6.6e-2432.64Show/hide
Query:  LGPSMEGRSWRFTGLYGQPDQQARKFTWDLIRHLHCHDEAAWVIGGDLNELSQVPKAKGDMASNLRRV----------------GRCTL-----------
        +G S + + WRFTG YGQP  + R  +W L+R L  H    WVIGGDLNE+      +G +  ++R++                G+  L           
Subjt:  LGPSMEGRSWRFTGLYGQPDQQARKFTWDLIRHLHCHDEAAWVIGGDLNELSQVPKAKGDMASNLRRV----------------GRCTL-----------

Query:  -KSLFP--------------------ISENKE-----VRLLKPTTNIPLLNFELIHKLEADLDRLLDEEEVYWRQRSRKSWFKWGDRNMSWFHRRASIRK
         +++FP                    +  N E     + +   TT I  +  EL  +LEA L+ LL +E ++WRQR++  W + GD N  +FH+RAS RK
Subjt:  -KSLFP--------------------ISENKE-----VRLLKPTTNIPLLNFELIHKLEADLDRLLDEEEVYWRQRSRKSWFKWGDRNMSWFHRRASIRK

Query:  KKNKVKGISDTKGNWVTNPEGIENIFSNYFQSIFTSTNP
        KKN + G+ D  G W T+ E +E I   YF  +FTS+ P
Subjt:  KKNKVKGISDTKGNWVTNPEGIENIFSNYFQSIFTSTNP

XP_028113864.1 uncharacterized protein LOC114311894 [Camellia sinensis]6.2e-2228.57Show/hide
Query:  LGDLVLVGTSKRKGNGEELGPSMEG-RSWRFTGLYGQPDQQARKFTWDLIRHLHCHDEAAWVIGGDLNELSQVPKAKGDMASNLRR--------VGRCTL
        +G+L +   S  KG+ + +  +  G  SW+FTG YG P+   R  +W+L+R L       WV  GD NE+    + K  MA+  +R        +  C L
Subjt:  LGDLVLVGTSKRKGNGEELGPSMEG-RSWRFTGLYGQPDQQARKFTWDLIRHLHCHDEAAWVIGGDLNELSQVPKAKGDMASNLRR--------VGRCTL

Query:  KSL-----------------------FPISENKEVRLLKPTTNIPL---LNFELIHKLEADLDRLLDEEEVYWRQRSRKSWFKWGDRNMSWFHRRASIRK
          L                          S N ++  ++    +P+    +   +  ++ ++D LL++E V W QR+R +W K  DRN ++FH +AS R+
Subjt:  KSL-----------------------FPISENKEVRLLKPTTNIPL---LNFELIHKLEADLDRLLDEEEVYWRQRSRKSWFKWGDRNMSWFHRRASIRK

Query:  KKNKVKGISDTKGNWVTNPEGIENIFSNYFQSIFTSTNPSVGILDRVLHCI--------PRRVTPEFITDTKH
        KK  + G+ + +G W + P  +E I  +YFQ +FT+ NP    +D V+ CI         +R+T  F++D  H
Subjt:  KKNKVKGISDTKGNWVTNPEGIENIFSNYFQSIFTSTNPSVGILDRVLHCI--------PRRVTPEFITDTKH

TrEMBL top hitse value%identityAlignment
A0A1R3H2G0 Uncharacterized protein7.8e-2333.19Show/hide
Query:  EGRSWRFTGLYGQPDQQARKFTWDLIRHLHCHDEAAWVIGGDLNELSQVPKAKGD------MASNLR-RVGRCTLKSLFPISENKEVRLLKPTT----NI
        E   WRFTG YG PD   R  +W L+R LH  +   W IGGD NE+    +  G          N R  +  C L  L  +     +R  +        +
Subjt:  EGRSWRFTGLYGQPDQQARKFTWDLIRHLHCHDEAAWVIGGDLNELSQVPKAKGD------MASNLR-RVGRCTLKSLFPISENKEVRLLKPTT----NI

Query:  PLLNFELIHKLEADLDRLLDEEEVYWRQRSRKSWFKWGDRNMSWFHRRASIRKKKNKVKGISDTKGNWVTNPEGIENIFSNYFQSIFTSTNPSVGILDRV
           +  L+    A+L  L    E+YWRQR+++ W K GDRN S+FH+ AS RK +N V  I    G W T+ E IE IF  YF+ +FTS+   + ++D +
Subjt:  PLLNFELIHKLEADLDRLLDEEEVYWRQRSRKSWFKWGDRNMSWFHRRASIRKKKNKVKGISDTKGNWVTNPEGIENIFSNYFQSIFTSTNPSVGILDRV

Query:  LHCIPRRVTPEFITDTKHWDIPKLWQ
             R    + I   K W   ++W+
Subjt:  LHCIPRRVTPEFITDTKHWDIPKLWQ

A0A5B6VYT4 Reverse transcriptase6.6e-2227.78Show/hide
Query:  GRSWRFTGLYGQPDQQARKFTWDLIRHLHCHDEAAWVIGGDLNEL-SQVPKAKGDMASN-------------------------------------LRRV
        G  WR TG YG P ++ R  +WDL+RHLH  +   W++ GD NE+ S   K +G + S                                        R+
Subjt:  GRSWRFTGLYGQPDQQARKFTWDLIRHLHCHDEAAWVIGGDLNEL-SQVPKAKGDMASN-------------------------------------LRRV

Query:  GRCTLKSLFPISENKEVRLLKPTTNIPLL-------------------------------NFELIHKLEADLDRLLDEEEVYWRQRSRKSWFKWGDRNMS
         R  L   +    N  V  L        L                               N E I +++ DL+   D+EE++W QR+R +W K GD+N S
Subjt:  GRCTLKSLFPISENKEVRLLKPTTNIPLL-------------------------------NFELIHKLEADLDRLLDEEEVYWRQRSRKSWFKWGDRNMS

Query:  WFHRRASIRKKKNKVKGISDTKGNWVTNPEGIENIFSNYFQSIFTSTNPSVGILDRVLHCIPRRVTPEFITD-TKHWDIPKLWQFVNS
        +FHR A  R K+N++ GI D +G WVT  + + N+  +YF  +FT++N S G+  R+L  + +R+T +   +  + +   ++WQ V S
Subjt:  WFHRRASIRKKKNKVKGISDTKGNWVTNPEGIENIFSNYFQSIFTSTNPSVGILDRVLHCIPRRVTPEFITD-TKHWDIPKLWQFVNS

A0A6J1DY29 uncharacterized protein LOC1110252373.0e-2249.54Show/hide
Query:  LNFELIHKLEADLDRLLDEEEVYWRQRSRKSWFKWGDRNMSWFHRRASIRKKKNKVKGISDTKGNWVTNPEGIENIFSNYFQSIFTSTNPSVGILDRVLH
        L+F +IH +E DL  LL+ EE++W+QRSR+ W KWGD N  WFH++A++RK  N + GI D  G W   P+ I NIF  YFQ IFTST+P    +D +  
Subjt:  LNFELIHKLEADLDRLLDEEEVYWRQRSRKSWFKWGDRNMSWFHRRASIRKKKNKVKGISDTKGNWVTNPEGIENIFSNYFQSIFTSTNPSVGILDRVLH

Query:  CIPRRVTPE
         IP R+T E
Subjt:  CIPRRVTPE

A0A7N2KWB9 zf-RVT domain-containing protein3.0e-2234.38Show/hide
Query:  GLYGQPDQQARKFTWDLIRHLHCHDEAAWVIGGDLNELSQVPKAKGDMASNLRRVGRCTLKSLFPISENKEVRLLKPTTNIPLLNFELIHKLEADLDRLL
        G YG+ ++Q R  TWDL++HL     A W+  GD NE+                            SE KE  L            E I +++ +++ +L
Subjt:  GLYGQPDQQARKFTWDLIRHLHCHDEAAWVIGGDLNELSQVPKAKGDMASNLRRVGRCTLKSLFPISENKEVRLLKPTTNIPLLNFELIHKLEADLDRLL

Query:  DEEEVYWRQRSRKSWFKWGDRNMSWFHRRASIRKKKNKVKGISDTKGNWVTNPEGIENIFSNYFQSIFTSTNPSVGILDRVLHCIPRRVTPE
        + +E++WRQRSR +W K GD+N  +FH+RAS R++KN + GI D  G W  + +GI     +YF  +F STNPS   ++ VL+ + + VTPE
Subjt:  DEEEVYWRQRSRKSWFKWGDRNMSWFHRRASIRKKKNKVKGISDTKGNWVTNPEGIENIFSNYFQSIFTSTNPSVGILDRVLHCIPRRVTPE

A0A803P941 Uncharacterized protein8.7e-2228.16Show/hide
Query:  GRSWRFTGLYGQPDQQARKFTWDLIRHLHCHDEAAWVIGGDLNELSQVPKAKGDMASNLR-------------------------------------RVG
        G SWRFTG YG PD   RK TW L+  L    +  W+ GGD NE+    + K   +S  +                                     R+ 
Subjt:  GRSWRFTGLYGQPDQQARKFTWDLIRHLHCHDEAAWVIGGDLNELSQVPKAKGDMASNLR-------------------------------------RVG

Query:  RCTLKSLFPISENKEVRLLKPT-----------TNIPLLNFELIHKLEADLDRLLDEEEVYWRQRSRKSWFKWGDRNMSWFHRRASIRKKKNKVKGISDT
         C  + L   ++ K+  L   T           +++  ++++   ++E DL+   D+EE+ W+QRSR  W   GDRN  +FH +AS RKKKN + G+ D 
Subjt:  RCTLKSLFPISENKEVRLLKPT-----------TNIPLLNFELIHKLEADLDRLLDEEEVYWRQRSRKSWFKWGDRNMSWFHRRASIRKKKNKVKGISDT

Query:  KGNWVTNPEGIENIFSNYFQSIFTSTNPSVGILDRVLHCIPRRVT
        +  W    + IE I   Y+  +F+S+ P+  +++ +  C+P R++
Subjt:  KGNWVTNPEGIENIFSNYFQSIFTSTNPSVGILDRVLHCIPRRVT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGGGGTACCAGAAGATCGGCGAGAGGGTCCGCAGAAAGGAGGTAACGCCGGATTTGATGATGGCCAAACCCCTGCGACGGCACCGGAGCAACAGGTTGCT
TCTAGAATGGAATCACTATTGAATACTGCTGACTTACCAATTTGGGCTGGCTTATTGATGGACAGCTGTCAAACTGCTGAAGGAGAAAATCAAGAGAGGCATTTA
CCACCTATAGCTGTAATCCATGCCCCACCTACATCAAACAAAGTTGCGAGGCCCATGGCGGCTCCAACTGAATCAAAGAAAGTTATGTCATGGAAAAAAGGGGCT
CGTGGAAATATGAGATCAAATGAGGAATGTCTCGGAGATCTAGTCTTAGTTGGAACTTCTAAAAGGAAAGGCAATGGAGAGGAACTGGGTCCTTCGATGGAGGGT
AGAAGTTGGCGGTTCACAGGTTTGTATGGTCAACCAGACCAACAAGCTAGGAAATTCACATGGGATTTGATTCGACACTTACATTGTCATGATGAAGCTGCTTGG
GTGATCGGGGGTGACCTCAACGAGCTCAGTCAAGTTCCTAAGGCAAAAGGAGATATGGCTTCAAATTTGAGGAGAGTTGGGCGATGCACCCTAAAGAGTTTGTTC
CCCATATCAGAAAACAAAGAAGTGCGATTGCTGAAGCCAACAACCAACATCCCCCTGCTAAACTTTGAGCTCATCCACAAACTGGAGGCTGATCTTGACAGACTT
CTTGACGAGGAGGAAGTGTACTGGAGGCAGAGATCCAGGAAAAGCTGGTTCAAATGGGGCGATAGAAACATGTCGTGGTTTCATAGAAGGGCTTCCATTCGAAAA
AAGAAGAATAAGGTGAAGGGTATCTCAGACACTAAAGGAAATTGGGTGACTAATCCTGAGGGGATAGAGAACATTTTCTCTAACTATTTCCAATCGATTTTCACC
TCCACCAATCCCTCAGTTGGTATTCTGGATCGTGTCCTCCACTGCATCCCTCGAAGAGTCACTCCAGAGTTTATTACTGACACGAAGCATTGGGATATCCCGAAG
TTATGGCAGTTTGTTAACAGTGACTTAGAGAATGACTTCAACCAATCTGTCCAAGACCGTTTCCTTCTTCTTCGTAAGGTTTTATCTGCGGAGGATTTTATGTTG
GTATGTGTTAGCTGTTGGATAATCTGGACGAACAGAAATTCAATCAGACTTAGCAAACCCGTACCTGATCCTAATACCATATGTGAATGGATCCATTCCTATATG
AAGGAGATTGTGGGTGGATTCGCTGCTAGACAAGAGGAGGAGAAGTGCAGATTTACGCCCTGCCCTGAGGAAGGTGAAGCTCGCCGGAAATGGACACCACCACCG
GTAGGATGGGTAAAATTAAATGTTGATGCTGCGTCTGAGCTTAAGGCTATTCGGGAGAGTCTCCAATTAGTGGCTCACCTGAATGTTCATTTGGTACAAATCCAG
TCTGATTCAGCCCAGGCAATAGATCTGACCTTGAAGAGGAGGGTGAATAATTTGGAACCTGGAATATGGGTGGAAGACATCATTGATTTGGCGAAGGCTTTCCCC
CAAGTTGTGTTCTTGCATGTGCGAAGGAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAGGGGTACCAGAAGATCGGCGAGAGGGTCCGCAGAAAGGAGGTAACGCCGGATTTGATGATGGCCAAACCCCTGCGACGGCACCGGAGCAACAGGTTGCT
TCTAGAATGGAATCACTATTGAATACTGCTGACTTACCAATTTGGGCTGGCTTATTGATGGACAGCTGTCAAACTGCTGAAGGAGAAAATCAAGAGAGGCATTTA
CCACCTATAGCTGTAATCCATGCCCCACCTACATCAAACAAAGTTGCGAGGCCCATGGCGGCTCCAACTGAATCAAAGAAAGTTATGTCATGGAAAAAAGGGGCT
CGTGGAAATATGAGATCAAATGAGGAATGTCTCGGAGATCTAGTCTTAGTTGGAACTTCTAAAAGGAAAGGCAATGGAGAGGAACTGGGTCCTTCGATGGAGGGT
AGAAGTTGGCGGTTCACAGGTTTGTATGGTCAACCAGACCAACAAGCTAGGAAATTCACATGGGATTTGATTCGACACTTACATTGTCATGATGAAGCTGCTTGG
GTGATCGGGGGTGACCTCAACGAGCTCAGTCAAGTTCCTAAGGCAAAAGGAGATATGGCTTCAAATTTGAGGAGAGTTGGGCGATGCACCCTAAAGAGTTTGTTC
CCCATATCAGAAAACAAAGAAGTGCGATTGCTGAAGCCAACAACCAACATCCCCCTGCTAAACTTTGAGCTCATCCACAAACTGGAGGCTGATCTTGACAGACTT
CTTGACGAGGAGGAAGTGTACTGGAGGCAGAGATCCAGGAAAAGCTGGTTCAAATGGGGCGATAGAAACATGTCGTGGTTTCATAGAAGGGCTTCCATTCGAAAA
AAGAAGAATAAGGTGAAGGGTATCTCAGACACTAAAGGAAATTGGGTGACTAATCCTGAGGGGATAGAGAACATTTTCTCTAACTATTTCCAATCGATTTTCACC
TCCACCAATCCCTCAGTTGGTATTCTGGATCGTGTCCTCCACTGCATCCCTCGAAGAGTCACTCCAGAGTTTATTACTGACACGAAGCATTGGGATATCCCGAAG
TTATGGCAGTTTGTTAACAGTGACTTAGAGAATGACTTCAACCAATCTGTCCAAGACCGTTTCCTTCTTCTTCGTAAGGTTTTATCTGCGGAGGATTTTATGTTG
GTATGTGTTAGCTGTTGGATAATCTGGACGAACAGAAATTCAATCAGACTTAGCAAACCCGTACCTGATCCTAATACCATATGTGAATGGATCCATTCCTATATG
AAGGAGATTGTGGGTGGATTCGCTGCTAGACAAGAGGAGGAGAAGTGCAGATTTACGCCCTGCCCTGAGGAAGGTGAAGCTCGCCGGAAATGGACACCACCACCG
GTAGGATGGGTAAAATTAAATGTTGATGCTGCGTCTGAGCTTAAGGCTATTCGGGAGAGTCTCCAATTAGTGGCTCACCTGAATGTTCATTTGGTACAAATCCAG
TCTGATTCAGCCCAGGCAATAGATCTGACCTTGAAGAGGAGGGTGAATAATTTGGAACCTGGAATATGGGTGGAAGACATCATTGATTTGGCGAAGGCTTTCCCC
CAAGTTGTGTTCTTGCATGTGCGAAGGAATTGA
Protein sequenceShow/hide protein sequence
MKGVPEDRREGPQKGGNAGFDDGQTPATAPEQQVASRMESLLNTADLPIWAGLLMDSCQTAEGENQERHLPPIAVIHAPPTSNKVARPMAAPTESKKVMSWKKGA
RGNMRSNEECLGDLVLVGTSKRKGNGEELGPSMEGRSWRFTGLYGQPDQQARKFTWDLIRHLHCHDEAAWVIGGDLNELSQVPKAKGDMASNLRRVGRCTLKSLF
PISENKEVRLLKPTTNIPLLNFELIHKLEADLDRLLDEEEVYWRQRSRKSWFKWGDRNMSWFHRRASIRKKKNKVKGISDTKGNWVTNPEGIENIFSNYFQSIFT
STNPSVGILDRVLHCIPRRVTPEFITDTKHWDIPKLWQFVNSDLENDFNQSVQDRFLLLRKVLSAEDFMLVCVSCWIIWTNRNSIRLSKPVPDPNTICEWIHSYM
KEIVGGFAARQEEEKCRFTPCPEEGEARRKWTPPPVGWVKLNVDAASELKAIRESLQLVAHLNVHLVQIQSDSAQAIDLTLKRRVNNLEPGIWVEDIIDLAKAFP
QVVFLHVRRN