; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g25190 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g25190
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr1:17849282..17853383
RNA-Seq ExpressionMoc01g25190
SyntenyMoc01g25190
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]5.1e-5841.11Show/hide
Query:  VLSSPILPVDTKVETLINKELAQWKEDVIIGAFTPDEANSILSILIGRGSEEDMLIWHHEKTGVYTVKSGYKVALNKQP--SSKGSSSSGELERWWKGMW
        +LSSP LP+ ++V +L++ E   W+ DV+   FTPDEA  ILSI IGRG+EED LIW++EKTGVY+V+SGYKVAL   P   +  SSSS E+  WW G W
Subjt:  VLSSPILPVDTKVETLINKELAQWKEDVIIGAFTPDEANSILSILIGRGSEEDMLIWHHEKTGVYTVKSGYKVALNKQP--SSKGSSSSGELERWWKGMW

Query:  QMRIPSKIKVFLWRLCLDRLPTEANLSKRGVDVQNTC-----------------------------------------NERAFKPE-------------N
        +M IP+KIKVFLWRLCLDRLPT  NLSKRGV++ N C                                         +E   K +              
Subjt:  QMRIPSKIKVFLWRLCLDRLPTEANLSKRGVDVQNTC-----------------------------------------NERAFKPE-------------N

Query:  REPDRYLTRLLPLFRLGDKCL-----YDEQYLATKRNQMA-----------TPFD------NSDASFSSANLSAGLGIIIRNFKGQVMASATMYLDHVQS
        R    +      +F++G + +     Y  ++   K N +             P D      N+DASF +++  AGLGIII N +GQVMA+AT YL+++QS
Subjt:  REPDRYLTRLLPLFRLGDKCL-----YDEQYLATKRNQMA-----------TPFD------NSDASFSSANLSAGLGIIIRNFKGQVMASATMYLDHVQS

Query:  IDEAEAFAVTEGLRLAAEIDIR---MDLPKTGEIVRNAKRHRSSMVHASFNHTRRGGNEA
        +D AEA A  EGL+LA+EI +     DL +TGEIV  AK   +  +HASFN  +R GN+A
Subjt:  IDEAEAFAVTEGLRLAAEIDIR---MDLPKTGEIVRNAKRHRSSMVHASFNHTRRGGNEA

XP_023923926.1 uncharacterized protein LOC112035327 [Quercus suber]2.8e-2432.08Show/hide
Query:  PILPVDTKVETLINKELAQWKEDVIIGAFTPDEANSILSILIGRGSEEDMLIWHHEKTGVYTVKSGYKVALNKQP-SSKGSSSSGELERWWKGMWQMRIP
        P L  D KV +LI+     WK + +   F P E + IL I +      D +IW H  +G YT  S YK+ ++ +  SS G S     +++WKG+WQ+R+P
Subjt:  PILPVDTKVETLINKELAQWKEDVIIGAFTPDEANSILSILIGRGSEEDMLIWHHEKTGVYTVKSGYKVALNKQP-SSKGSSSSGELERWWKGMWQMRIP

Query:  SKIKVFLWRLCLDRLPTEANLSKRGVDVQNTCNERAFKPENREPDRYLTRLLPLFRLGDKCLYDEQYLATKRNQMATPFDNSDASFSSANLSAGLGIIIR
        +KI+ F+WR+C + LPT  NL +R +     C      PE    D   T  L  F    +     Q +  K N  AT F +S          AG+G+I R
Subjt:  SKIKVFLWRLCLDRLPTEANLSKRGVDVQNTCNERAFKPENREPDRYLTRLLPLFRLGDKCLYDEQYLATKRNQMATPFDNSDASFSSANLSAGLGIIIR

Query:  NFKGQVMASATMYLDHVQSIDEAEAFAVTEGLRLAAEIDI
        N  G+ + + +  +   QS+ + EA A  + ++ A EI +
Subjt:  NFKGQVMASATMYLDHVQSIDEAEAFAVTEGLRLAAEIDI

XP_030958760.1 uncharacterized protein LOC115980671 [Quercus lobata]3.1e-2348.8Show/hide
Query:  VETLINKELAQWKEDVIIGAFTPDEANSILSILIGRGSEEDMLIWHHEKTGVYTVKSGYKVALNKQP-SSKGSSSSGE-LERWWKGMWQMRIPSKIKVFL
        V  LI+ +L +W+ D++   F P EA++IL+I I     ED LIW   + GV++VKS Y VALN    S++G SS G+ LER WK +W + IPSKIK+F 
Subjt:  VETLINKELAQWKEDVIIGAFTPDEANSILSILIGRGSEEDMLIWHHEKTGVYTVKSGYKVALNKQP-SSKGSSSSGE-LERWWKGMWQMRIPSKIKVFL

Query:  WRLCLDRLPTEANLSKRGVDVQNTC
        WR CLD LPT  NL KRG+     C
Subjt:  WRLCLDRLPTEANLSKRGVDVQNTC

XP_030969741.1 uncharacterized protein LOC115990018 [Quercus lobata]3.1e-2329.67Show/hide
Query:  PILPVDTKVETLINKELAQWKEDVIIGAFTPDEANSILSILIGRGSEEDMLIWHHEKTGVYTVKSGYKVALNKQPSSKGSSSSGELE-RWWKGMWQMRIP
        P L  D KV TLI+++   WK + +   F P EA  IL I +      D +IW H  +G++T  S YK+ ++   SS   SS+ E + ++WKG+WQ+R+P
Subjt:  PILPVDTKVETLINKELAQWKEDVIIGAFTPDEANSILSILIGRGSEEDMLIWHHEKTGVYTVKSGYKVALNKQPSSKGSSSSGELE-RWWKGMWQMRIP

Query:  SKIKVFLWRLCLDRLPTEANLSKRGVDVQNTCNERAFKPEN-----------------------REPDR--YLTRLLPLFRLGDKCLYDEQY------LA
        +KI+ F+W +C + LPT  NL +R +    +C      PE+                         P R    T LL  F    +    E +      L 
Subjt:  SKIKVFLWRLCLDRLPTEANLSKRGVDVQNTCNERAFKPEN-----------------------REPDR--YLTRLLPLFRLGDKCLYDEQY------LA

Query:  TKRNQMATPFDNSDASFSSANLSAGLGIIIRNFKGQVMASATMYLDHVQSIDEAEAFAVTEGLRLAAEIDI-RMDLPKTGEIVRNAKRHRSSMVHASFNH
         +RN  A  F +     +S   SAG+GII RN  G+ + + +  +   QS+ + EA A  + ++ A EI + R+ +     ++ NA  H +  + ASF +
Subjt:  TKRNQMATPFDNSDASFSSANLSAGLGIIIRNFKGQVMASATMYLDHVQSIDEAEAFAVTEGLRLAAEIDI-RMDLPKTGEIVRNAKRHRSSMVHASFNH

XP_030970961.1 uncharacterized protein LOC115991405 [Quercus lobata]3.1e-2348.8Show/hide
Query:  VETLINKELAQWKEDVIIGAFTPDEANSILSILIGRGSEEDMLIWHHEKTGVYTVKSGYKVALNKQP-SSKGSSSSGE-LERWWKGMWQMRIPSKIKVFL
        V  LI+ +L +W+ D++   F P EA++IL+I I     ED LIW   + GV++VKS Y VALN    S++G SS G+ LER WK +W + IPSKIK+F 
Subjt:  VETLINKELAQWKEDVIIGAFTPDEANSILSILIGRGSEEDMLIWHHEKTGVYTVKSGYKVALNKQP-SSKGSSSSGE-LERWWKGMWQMRIPSKIKVFL

Query:  WRLCLDRLPTEANLSKRGVDVQNTC
        WR CLD LPT  NL KRG+     C
Subjt:  WRLCLDRLPTEANLSKRGVDVQNTC

TrEMBL top hitse value%identityAlignment
A0A2N9I609 Uncharacterized protein6.4e-2225.22Show/hide
Query:  LPVDTKVETLINKELAQWKEDVIIGAFTPDEANSILSILIGRGSEEDMLIWHHEKTGVYTVKSGYKVALNKQPSS-KGSSSSGELERWWKGMWQMRIPSK
        LP D  V  LI  E   W  ++I   F P EA+ ILS+ +   +  D+L+W  EK+G Y+V+S Y++    +P+   GSS++     +WK +W +++P K
Subjt:  LPVDTKVETLINKELAQWKEDVIIGAFTPDEANSILSILIGRGSEEDMLIWHHEKTGVYTVKSGYKVALNKQPSS-KGSSSSGELERWWKGMWQMRIPSK

Query:  IKVFLWRLCLDRLPTEANLSKRGVDVQNTCNERAFKPENREPDRY-LTRLLPLF------RLGDKCLYDE------------------------------
        I+ FLWR+C + LPT  NL +R +     C     + E+     +   +LLPL+      R   +C Y                                
Subjt:  IKVFLWRLCLDRLPTEANLSKRGVDVQNTCNERAFKPENREPDRY-LTRLLPLF------RLGDKCLYDE------------------------------

Query:  ------QYLATKRNQMATPFD---NSDASFSSANLSAGLGIIIRNFKGQVMASATMYLDHVQSIDEAEAFAVTEGLRLAAEIDIR---------------
              Q    +  Q  T F+   N DA+   ++ +  +G+IIR+  G  + +   +   +  +D+AEA AV E ++LA ++ +                
Subjt:  ------QYLATKRNQMATPFD---NSDASFSSANLSAGLGIIIRNFKGQVMASATMYLDHVQSIDEAEAFAVTEGLRLAAEIDIR---------------

Query:  ----MDLPKTGEIVRNAKRHRSSMVHASFNHTRRGGN
            +     G+I+++  +  SS+    F+H  R  N
Subjt:  ----MDLPKTGEIVRNAKRHRSSMVHASFNHTRRGGN

A0A2N9IYL5 RNase H domain-containing protein8.9e-2429.55Show/hide
Query:  VETLINKELAQWKEDVIIGAFTPDEANSILSILIGRGSEEDMLIWHHEKTGVYTVKSGYKVALNK-QPSSKGSSSSGELERWWKGMWQMRIPSKIKVFLW
        V  LI+ +L  WK  ++   F P EA +IL I +   S  D L+W   K GVY+V+SGYK  LN+      GSS    + + WK +W + +P KI+ FLW
Subjt:  VETLINKELAQWKEDVIIGAFTPDEANSILSILIGRGSEEDMLIWHHEKTGVYTVKSGYKVALNK-QPSSKGSSSSGELERWWKGMWQMRIPSKIKVFLW

Query:  RLCLDRLPTEANLSKRGVDVQNTCNERAFKPENREPDRYLTRLL-PLFR---LGDKCLYDEQYLA--TKRNQMATPFDNSDASFSSANLSAGLGIIIRNF
        R C + LPT++NL  R +    +C   +++ E+     +  +++ P+++    G + L     L       Q +   +   A FS  N +AG+G+I+RN 
Subjt:  RLCLDRLPTEANLSKRGVDVQNTCNERAFKPENREPDRYLTRLL-PLFR---LGDKCLYDEQYLA--TKRNQMATPFDNSDASFSSANLSAGLGIIIRNF

Query:  KGQVMASATMYLDHVQSIDEAEAFAVTEGLRLAAEID----------------IRMDLPKT---GEIVRNAKRHRSSMVHASFNHTRRGGN
        +G+VM S +  +    S++  EA A    ++ A ++                 + +  P T   G I+ + K+   S+    F HT+R GN
Subjt:  KGQVMASATMYLDHVQSIDEAEAFAVTEGLRLAAEID----------------IRMDLPKT---GEIVRNAKRHRSSMVHASFNHTRRGGN

A0A5B6VYT4 Reverse transcriptase3.7e-2232.11Show/hide
Query:  TKVETLINKELAQWKEDVIIGAFTPDEANSILSILIGRGSEEDMLIWHHEKTGVYTVKSGYKVALNKQP--SSKGSSSSGELERWWKGMWQMRIPSKIKV
        T V  LI++    WK+DVI      D+A  ILSI + R   ED+L+W ++ TG+YTVKSGY+V +  +P  +S      G+ + ++K +W+++IPSKIK+
Subjt:  TKVETLINKELAQWKEDVIIGAFTPDEANSILSILIGRGSEEDMLIWHHEKTGVYTVKSGYKVALNKQP--SSKGSSSSGELERWWKGMWQMRIPSKIKV

Query:  FLWRLCLDRLPTEANLSKRGVDVQNTCNERAFKPENREP-----DRYLTRLLPLFRLGDKCLYDEQYLATK-RNQMATPFD----NSDASFSSANLSAGL
         LWRL  + +P   NL KR + ++  C      PE+ +      +RY      +  L   C+      +TK       P D      DASF   + SA  
Subjt:  FLWRLCLDRLPTEANLSKRGVDVQNTCNERAFKPENREP-----DRYLTRLLPLFRLGDKCLYDEQYLATK-RNQMATPFD----NSDASFSSANLSAGL

Query:  GIIIRNFKGQVMASATMYLDHVQSIDEAEAFAVTEGLRLAAEIDIR
         ++ RN KG VM +       V     A+A A    +  A ++  R
Subjt:  GIIIRNFKGQVMASATMYLDHVQSIDEAEAFAVTEGLRLAAEIDIR

A0A6J1CNZ5 uncharacterized protein LOC1110132205.7e-2358Show/hide
Query:  FTPDEANSILSILIGRGSEEDMLIWHHEKTGVYTVKSGYKVALNKQPSSKGSSSSGE-LERWWKGMWQMRIPSKIKVFLWRLCLDRLPTEANLSKRGVDV
        FT DE  +ILSI +G G   D LIW+ EK G+ TVKS YK+A  + P +  S+S  E L +WWK +WQ+ +PSKIKVF WR CLDRLPT ANL  RGVDV
Subjt:  FTPDEANSILSILIGRGSEEDMLIWHHEKTGVYTVKSGYKVALNKQPSSKGSSSSGE-LERWWKGMWQMRIPSKIKVFLWRLCLDRLPTEANLSKRGVDV

A0A6J1DAR4 uncharacterized protein LOC1110189542.5e-5841.11Show/hide
Query:  VLSSPILPVDTKVETLINKELAQWKEDVIIGAFTPDEANSILSILIGRGSEEDMLIWHHEKTGVYTVKSGYKVALNKQP--SSKGSSSSGELERWWKGMW
        +LSSP LP+ ++V +L++ E   W+ DV+   FTPDEA  ILSI IGRG+EED LIW++EKTGVY+V+SGYKVAL   P   +  SSSS E+  WW G W
Subjt:  VLSSPILPVDTKVETLINKELAQWKEDVIIGAFTPDEANSILSILIGRGSEEDMLIWHHEKTGVYTVKSGYKVALNKQP--SSKGSSSSGELERWWKGMW

Query:  QMRIPSKIKVFLWRLCLDRLPTEANLSKRGVDVQNTC-----------------------------------------NERAFKPE-------------N
        +M IP+KIKVFLWRLCLDRLPT  NLSKRGV++ N C                                         +E   K +              
Subjt:  QMRIPSKIKVFLWRLCLDRLPTEANLSKRGVDVQNTC-----------------------------------------NERAFKPE-------------N

Query:  REPDRYLTRLLPLFRLGDKCL-----YDEQYLATKRNQMA-----------TPFD------NSDASFSSANLSAGLGIIIRNFKGQVMASATMYLDHVQS
        R    +      +F++G + +     Y  ++   K N +             P D      N+DASF +++  AGLGIII N +GQVMA+AT YL+++QS
Subjt:  REPDRYLTRLLPLFRLGDKCL-----YDEQYLATKRNQMA-----------TPFD------NSDASFSSANLSAGLGIIIRNFKGQVMASATMYLDHVQS

Query:  IDEAEAFAVTEGLRLAAEIDIR---MDLPKTGEIVRNAKRHRSSMVHASFNHTRRGGNEA
        +D AEA A  EGL+LA+EI +     DL +TGEIV  AK   +  +HASFN  +R GN+A
Subjt:  IDEAEAFAVTEGLRLAAEIDIR---MDLPKTGEIVRNAKRHRSSMVHASFNHTRRGGNEA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G22440.1 BEST Arabidopsis thaliana protein match is: Ribonuclease H-like superfamily protein (TAIR:AT4G29090.1)3.5e-0440.32Show/hide
Query:  VETLINKELAQWKEDVIIGAFTPDEANSILSILIGRGSEEDMLIWHHEKTGVYTVKSGYKVA
        V  LI++    WK D +     P +   IL I   R    D   W H K+G YTVKSGY VA
Subjt:  VETLINKELAQWKEDVIIGAFTPDEANSILSILIGRGSEEDMLIWHHEKTGVYTVKSGYKVA

AT3G09510.1 Ribonuclease H-like superfamily protein1.5e-0728.7Show/hide
Query:  WKEDVIIGAFTPDEANSILSILIGRGSEEDMLIWHHEKTGVYTVKSGYKVALNKQPSSKGSSSS---GELERWWKGMWQMRIPSKIKVFLWRLCLDRLPT
        W +  I       +   I  I + +  + D +IW++  TG YTV+SGY + L   PS+   + +   G ++   + +W + I  K+K FLWR     L T
Subjt:  WKEDVIIGAFTPDEANSILSILIGRGSEEDMLIWHHEKTGVYTVKSGYKVALNKQPSSKGSSSS---GELERWWKGMWQMRIPSKIKVFLWRLCLDRLPT

Query:  EANLSKRGVDVQNTC
           L+ RG+ +  +C
Subjt:  EANLSKRGVDVQNTC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCCTCTCCTCGCCAATTCTCCCAGTGGATACGAAGGTTGAGACTCTAATTAACAAAGAACTAGCTCAATGGAAGGAAGATGTGATTATTGGTGCTTTTACACCAGA
TGAAGCTAATAGTATTTTATCCATCCTTATTGGTCGTGGATCTGAGGAGGACATGCTAATTTGGCACCATGAGAAAACAGGAGTTTATACTGTAAAAAGCGGTTACAAGG
TTGCTCTGAATAAGCAACCCAGTTCCAAAGGTTCGTCTTCTTCAGGGGAGTTGGAAAGGTGGTGGAAAGGCATGTGGCAAATGAGGATCCCCTCTAAAATCAAGGTTTTC
TTATGGCGGCTTTGTTTGGACAGACTCCCTACTGAAGCAAATCTTTCGAAAAGAGGTGTGGACGTTCAGAATACTTGCAATGAAAGGGCTTTCAAGCCGGAAAATAGGGA
ACCGGATCGATACCTGACTAGGCTCCTTCCTTTATTTCGACTTGGAGACAAATGCTTGTATGATGAACAATACCTAGCAACCAAGAGAAATCAGATGGCAACCCCTTTTG
ATAACTCAGATGCTTCTTTTTCTTCCGCTAATCTAAGTGCAGGGCTGGGAATAATTATTCGAAATTTTAAAGGACAGGTGATGGCAAGTGCTACGATGTACCTCGATCAT
GTGCAATCGATCGACGAAGCTGAAGCTTTTGCGGTGACAGAGGGCTTGAGGCTTGCAGCAGAAATCGACATTCGCATGGATCTTCCGAAAACAGGGGAAATAGTGAGGAA
TGCAAAGAGACACCGGTCTTCGATGGTGCATGCATCTTTTAATCACACTAGAAGAGGAGGGAATGAAGCTTTTTTGAGGGAGGATTTTCTGGCGGATTTGGCTTTGCAAG
TGGGTCGGGAGTGTGTGGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCCTCTCCTCGCCAATTCTCCCAGTGGATACGAAGGTTGAGACTCTAATTAACAAAGAACTAGCTCAATGGAAGGAAGATGTGATTATTGGTGCTTTTACACCAGA
TGAAGCTAATAGTATTTTATCCATCCTTATTGGTCGTGGATCTGAGGAGGACATGCTAATTTGGCACCATGAGAAAACAGGAGTTTATACTGTAAAAAGCGGTTACAAGG
TTGCTCTGAATAAGCAACCCAGTTCCAAAGGTTCGTCTTCTTCAGGGGAGTTGGAAAGGTGGTGGAAAGGCATGTGGCAAATGAGGATCCCCTCTAAAATCAAGGTTTTC
TTATGGCGGCTTTGTTTGGACAGACTCCCTACTGAAGCAAATCTTTCGAAAAGAGGTGTGGACGTTCAGAATACTTGCAATGAAAGGGCTTTCAAGCCGGAAAATAGGGA
ACCGGATCGATACCTGACTAGGCTCCTTCCTTTATTTCGACTTGGAGACAAATGCTTGTATGATGAACAATACCTAGCAACCAAGAGAAATCAGATGGCAACCCCTTTTG
ATAACTCAGATGCTTCTTTTTCTTCCGCTAATCTAAGTGCAGGGCTGGGAATAATTATTCGAAATTTTAAAGGACAGGTGATGGCAAGTGCTACGATGTACCTCGATCAT
GTGCAATCGATCGACGAAGCTGAAGCTTTTGCGGTGACAGAGGGCTTGAGGCTTGCAGCAGAAATCGACATTCGCATGGATCTTCCGAAAACAGGGGAAATAGTGAGGAA
TGCAAAGAGACACCGGTCTTCGATGGTGCATGCATCTTTTAATCACACTAGAAGAGGAGGGAATGAAGCTTTTTTGAGGGAGGATTTTCTGGCGGATTTGGCTTTGCAAG
TGGGTCGGGAGTGTGTGGTTTGA
Protein sequenceShow/hide protein sequence
MVLSSPILPVDTKVETLINKELAQWKEDVIIGAFTPDEANSILSILIGRGSEEDMLIWHHEKTGVYTVKSGYKVALNKQPSSKGSSSSGELERWWKGMWQMRIPSKIKVF
LWRLCLDRLPTEANLSKRGVDVQNTCNERAFKPENREPDRYLTRLLPLFRLGDKCLYDEQYLATKRNQMATPFDNSDASFSSANLSAGLGIIIRNFKGQVMASATMYLDH
VQSIDEAEAFAVTEGLRLAAEIDIRMDLPKTGEIVRNAKRHRSSMVHASFNHTRRGGNEAFLREDFLADLALQVGRECVV