; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS019103 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS019103
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold20:1486448..1486892
RNA-Seq ExpressionMS019103
SyntenyMS019103
Gene Ontology termsGO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]3.9e-4355.56Show/hide
Query:  QATPCYSMSCFRLPKNLIHEINSLIARFWWDNDRENK-IHRIGWKTLCKPKCQGGMGFRDLELFNKALLAKQGWRILNEPNSMLAQVLKGRYFKDGTFLS
        QA P YSMSCFR+PK L  E+N ++ARFWW   ++ + IH + W+ LCK K  GG+GFRDLE FN+ALLAKQ WRIL  P S++A++ + RY     FL 
Subjt:  QATPCYSMSCFRLPKNLIHEINSLIARFWWDNDRENK-IHRIGWKTLCKPKCQGGMGFRDLELFNKALLAKQGWRILNEPNSMLAQVLKGRYFKDGTFLS

Query:  AKVGGNPSFILRSLLWGRSLLTEGGRWRIGNGGTVRIYRDNWFP
        A+VG NPSFI RSL WG+ LL +G RWR+GNG ++++Y D W P
Subjt:  AKVGGNPSFILRSLLWGRSLLTEGGRWRIGNGGTVRIYRDNWFP

XP_022145148.1 uncharacterized protein LOC111014662 [Momordica charantia]1.1e-4561.27Show/hide
Query:  QATPCYSMSCFRLPKNLIHEINSLIARFWW-DNDRENKIHRIGWKTLCKPKCQGGMGFRDLELFNKALLAKQGWRILNEPNSMLAQVLKGRYFKDGTFLS
        QA P YSMSCFR P NL  EINSL ARFWW  NDRE KIH   WK LC  K QGG+GF+DL +FN+A+LAKQ W+I+  PNS+L +VL+G+YFK G F++
Subjt:  QATPCYSMSCFRLPKNLIHEINSLIARFWW-DNDRENKIHRIGWKTLCKPKCQGGMGFRDLELFNKALLAKQGWRILNEPNSMLAQVLKGRYFKDGTFLS

Query:  AKVGGNPSFILRSLLWGRSLLTEGGRWRIGNGGTVRIYRDNW
        A++G NPSF+ RS+LWG+ L  +G RWRIGNG +V I  D W
Subjt:  AKVGGNPSFILRSLLWGRSLLTEGGRWRIGNGGTVRIYRDNW

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]6.8e-5668.24Show/hide
Query:  QATPCYSMSCFRLPKNLIHEINSLIARFWWDNDREN-KIHRIGWKTLCKPKCQGGMGFRDLELFNKALLAKQGWRILNEPNSMLAQVLKGRYFKDGTFLS
        QA PCY+MSCFRLPK LI E + + ARFWW + +E+ KIH + W +L  PKC+GGMGFRDLELFNKALLAKQ WRILN PNSML++VLKGRYFKD +F+ 
Subjt:  QATPCYSMSCFRLPKNLIHEINSLIARFWWDNDREN-KIHRIGWKTLCKPKCQGGMGFRDLELFNKALLAKQGWRILNEPNSMLAQVLKGRYFKDGTFLS

Query:  AKVGGNPSFILRSLLWGRSLLTEGGRWRIGNGGTVRIYRDNWFPNYPS
        AK+ GNPS+I RS+LWGR LL +G RWRIGNG +V IY DNW PN P+
Subjt:  AKVGGNPSFILRSLLWGRSLLTEGGRWRIGNGGTVRIYRDNWFPNYPS

XP_022159001.1 uncharacterized protein LOC111025446 [Momordica charantia]1.1e-4865.96Show/hide
Query:  MSCFRLPKNLIHEINSLIARFWWDNDREN-KIHRIGWKTLCKPKCQGGMGFRDLELFNKALLAKQGWRILNEPNSMLAQVLKGRYFKDGTFLSAKVGGNP
        MSCFRLPK  I+E++ + ARFWW +  E+ KIH +GW TL KPK  GGMGFRDLELFNKALLAKQ WRILN P+S+LA+VLKGRYF++   L A++ G+P
Subjt:  MSCFRLPKNLIHEINSLIARFWWDNDREN-KIHRIGWKTLCKPKCQGGMGFRDLELFNKALLAKQGWRILNEPNSMLAQVLKGRYFKDGTFLSAKVGGNP

Query:  SFILRSLLWGRSLLTEGGRWRIGNGGTVRIYRDNWFPNYPS
        SFI RSL+WG  LL +G RWRIGNG  V +YRDNW PN PS
Subjt:  SFILRSLLWGRSLLTEGGRWRIGNGGTVRIYRDNWFPNYPS

XP_030505362.1 uncharacterized protein LOC115720349 [Cannabis sativa]1.7e-4356.94Show/hide
Query:  QATPCYSMSCFRLPKNLIHEINSLIARFWWDNDREN-KIHRIGWKTLCKPKCQGGMGFRDLELFNKALLAKQGWRILNEPNSMLAQVLKGRYFKDGTFLS
        QA P YSMSCFRLPK LIH ++SL A FWW + +EN KIH   W  LCKPK +GG+GFR L  FN+ALLAKQGWR+++ P+S+LA+VLK  Y+ + +FL 
Subjt:  QATPCYSMSCFRLPKNLIHEINSLIARFWWDNDREN-KIHRIGWKTLCKPKCQGGMGFRDLELFNKALLAKQGWRILNEPNSMLAQVLKGRYFKDGTFLS

Query:  AKVGGNPSFILRSLLWGRSLLTEGGRWRIGNGGTVRIYRDNWFP
        AK     S I + + WGR ++ EG RWR+GNG TVRI+ D W P
Subjt:  AKVGGNPSFILRSLLWGRSLLTEGGRWRIGNGGTVRIYRDNWFP

TrEMBL top hitse value%identityAlignment
A0A6J1CV63 uncharacterized protein LOC1110146625.3e-4661.27Show/hide
Query:  QATPCYSMSCFRLPKNLIHEINSLIARFWW-DNDRENKIHRIGWKTLCKPKCQGGMGFRDLELFNKALLAKQGWRILNEPNSMLAQVLKGRYFKDGTFLS
        QA P YSMSCFR P NL  EINSL ARFWW  NDRE KIH   WK LC  K QGG+GF+DL +FN+A+LAKQ W+I+  PNS+L +VL+G+YFK G F++
Subjt:  QATPCYSMSCFRLPKNLIHEINSLIARFWW-DNDRENKIHRIGWKTLCKPKCQGGMGFRDLELFNKALLAKQGWRILNEPNSMLAQVLKGRYFKDGTFLS

Query:  AKVGGNPSFILRSLLWGRSLLTEGGRWRIGNGGTVRIYRDNW
        A++G NPSF+ RS+LWG+ L  +G RWRIGNG +V I  D W
Subjt:  AKVGGNPSFILRSLLWGRSLLTEGGRWRIGNGGTVRIYRDNW

A0A6J1DAR4 uncharacterized protein LOC1110189543.3e-5668.24Show/hide
Query:  QATPCYSMSCFRLPKNLIHEINSLIARFWWDNDREN-KIHRIGWKTLCKPKCQGGMGFRDLELFNKALLAKQGWRILNEPNSMLAQVLKGRYFKDGTFLS
        QA PCY+MSCFRLPK LI E + + ARFWW + +E+ KIH + W +L  PKC+GGMGFRDLELFNKALLAKQ WRILN PNSML++VLKGRYFKD +F+ 
Subjt:  QATPCYSMSCFRLPKNLIHEINSLIARFWWDNDREN-KIHRIGWKTLCKPKCQGGMGFRDLELFNKALLAKQGWRILNEPNSMLAQVLKGRYFKDGTFLS

Query:  AKVGGNPSFILRSLLWGRSLLTEGGRWRIGNGGTVRIYRDNWFPNYPS
        AK+ GNPS+I RS+LWGR LL +G RWRIGNG +V IY DNW PN P+
Subjt:  AKVGGNPSFILRSLLWGRSLLTEGGRWRIGNGGTVRIYRDNWFPNYPS

A0A6J1E2L3 uncharacterized protein LOC1110254465.1e-4965.96Show/hide
Query:  MSCFRLPKNLIHEINSLIARFWWDNDREN-KIHRIGWKTLCKPKCQGGMGFRDLELFNKALLAKQGWRILNEPNSMLAQVLKGRYFKDGTFLSAKVGGNP
        MSCFRLPK  I+E++ + ARFWW +  E+ KIH +GW TL KPK  GGMGFRDLELFNKALLAKQ WRILN P+S+LA+VLKGRYF++   L A++ G+P
Subjt:  MSCFRLPKNLIHEINSLIARFWWDNDREN-KIHRIGWKTLCKPKCQGGMGFRDLELFNKALLAKQGWRILNEPNSMLAQVLKGRYFKDGTFLSAKVGGNP

Query:  SFILRSLLWGRSLLTEGGRWRIGNGGTVRIYRDNWFPNYPS
        SFI RSL+WG  LL +G RWRIGNG  V +YRDNW PN PS
Subjt:  SFILRSLLWGRSLLTEGGRWRIGNGGTVRIYRDNWFPNYPS

A0A803NGK5 Uncharacterized protein1.7e-4458.33Show/hide
Query:  QATPCYSMSCFRLPKNLIHEINSLIARFWW-DNDRENKIHRIGWKTLCKPKCQGGMGFRDLELFNKALLAKQGWRILNEPNSMLAQVLKGRYFKDGTFLS
        QA P Y MSCFRLPK LI++I+ ++ARFWW  +D ++KIH   WK LCKPK +GGMGF+DLE FN+ALLAKQGW+I+N P+SMLA+VLK  Y+ + +FL 
Subjt:  QATPCYSMSCFRLPKNLIHEINSLIARFWW-DNDRENKIHRIGWKTLCKPKCQGGMGFRDLELFNKALLAKQGWRILNEPNSMLAQVLKGRYFKDGTFLS

Query:  AKVGGNPSFILRSLLWGRSLLTEGGRWRIGNGGTVRIYRDNWFP
        AKVGG  SF+ RS+LWGR ++ +G RWR+  G  + I  D W P
Subjt:  AKVGGNPSFILRSLLWGRSLLTEGGRWRIGNGGTVRIYRDNWFP

A0A803PAN7 Uncharacterized protein2.6e-4557.93Show/hide
Query:  QATPCYSMSCFRLPKNLIHEINSLIARFWW-DNDRENKIHRIGWKTLCKPKCQGGMGFRDLELFNKALLAKQGWRILNEPNSMLAQVLKGRYFKDGTFLS
        QA P Y MSCFRLPK LI +I++++ARFWW  ++ +NKIH   W+ LCKPK +GGMGF+DLE FN++LLAKQGW+I+N P+S+LAQVLK  YF + TF+ 
Subjt:  QATPCYSMSCFRLPKNLIHEINSLIARFWW-DNDRENKIHRIGWKTLCKPKCQGGMGFRDLELFNKALLAKQGWRILNEPNSMLAQVLKGRYFKDGTFLS

Query:  AKVGGNPSFILRSLLWGRSLLTEGGRWRIGNGGTVRIYRDNWFPN
        AK+GG  SF+ RS+LWGR ++  G RWR+ +G  VRI  D W P+
Subjt:  AKVGGNPSFILRSLLWGRSLLTEGGRWRIGNGGTVRIYRDNWFPN

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657501.8e-1934.23Show/hide
Query:  PCYSMSCFRLPKNLIHEINSLIARFWWDNDRE-NKIHRIGWKTLCKPKCQGGMGFRDLELFNKALLAKQGWRILNEPNSMLAQVLKGRY----FKDGTFL
        P +SMS   LP+++++ ++ L   F W +  E  K H + W  +C PK +GG+G R  +  N+AL++K GWR+L E NS+   VL+ +Y     +D  +L
Subjt:  PCYSMSCFRLPKNLIHEINSLIARFWWDNDRE-NKIHRIGWKTLCKPKCQGGMGFRDLELFNKALLAKQGWRILNEPNSMLAQVLKGRY----FKDGTFL

Query:  SAKVGGNPSFILRSLLWG-RSLLTEGGRWRIGNGGTVRIYRDNWFPNYP
          K  G+ S   RS+  G R +++ G  W  G+G  +R + D W    P
Subjt:  SAKVGGNPSFILRSLLWG-RSLLTEGGRWRIGNGGTVRIYRDNWFPNYP

P93295 Uncharacterized mitochondrial protein AtMg003102.3e-3042.96Show/hide
Query:  ATPCYSMSCFRLPKNLIHEINSLIARFWWDN-DRENKIHRIGWKTLCKPK-CQGGMGFRDLELFNKALLAKQGWRILNEPNSMLAQVLKGRYFKDGTFLS
        A P Y+MSCFRL K L  ++ S +  FWW + + + KI  + W+ LCK K   GG+GFRDL  FN+ALLAKQ +RI+++P+++L+++L+ RYF   + + 
Subjt:  ATPCYSMSCFRLPKNLIHEINSLIARFWWDN-DRENKIHRIGWKTLCKPK-CQGGMGFRDLELFNKALLAKQGWRILNEPNSMLAQVLKGRYFKDGTFLS

Query:  AKVGGNPSFILRSLLWGRSLLTEGGRWRIGNGGTVRIYRDNW
          VG  PS+  RS++ GR LL+ G    IG+G   +++ D W
Subjt:  AKVGGNPSFILRSLLWGRSLLTEGGRWRIGNGGTVRIYRDNW

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein4.0e-0645Show/hide
Query:  LKGRYFKDGTFLSAKVGGNPSFILRSLLWGRSLLTEGGRWRIGNGGTVRIYRDNWFPNYP
        +K RYFKD + L AKV    S+   SLL G +LL +G R  IG+G  +RI  DN   ++P
Subjt:  LKGRYFKDGTFLSAKVGGNPSFILRSLLWGRSLLTEGGRWRIGNGGTVRIYRDNWFPNYP

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.9e-0834.58Show/hide
Query:  MSCFRLPKNLIHEINSLIARFWWDNDREN-KIHRIGWKTLCKPKCQGGMGFRDLELFNKALLAKQGWRILNEP---NSMLAQVLKGRYFKDGTFLSAKV-
        MS FRLP   I EI+S+ + F W     N K  ++ W  +C PK +GG+G R L+  NK       W I       + M  ++LK R    G F+   + 
Subjt:  MSCFRLPKNLIHEINSLIARFWWDNDREN-KIHRIGWKTLCKPKCQGGMGFRDLELFNKALLAKQGWRILNEP---NSMLAQVLKGRYFKDGTFLSAKV-

Query:  -GGNPSF
         G N SF
Subjt:  -GGNPSF

AT3G25720.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein9.0e-0628.97Show/hide
Query:  LCKPKCQGGMGFRDLELFNKALLAKQGWRILNEPNSMLAQVLKGRYFKD------GTFLSAKVGGNPSFILRSLLWGRSLLTEGGRWRIGNGGTVRIYRD
        +C PK +GG+G R    +N  L  K  WR+ +   S+     +  + +         F +++   + S+  + LL  R L     R  IGNG T R + D
Subjt:  LCKPKCQGGMGFRDLELFNKALLAKQGWRILNEPNSMLAQVLKGRYFKD------GTFLSAKVGGNPSFILRSLLWGRSLLTEGGRWRIGNGGTVRIYRD

Query:  NWFPNYP
        NW P  P
Subjt:  NWFPNYP

AT4G29090.1 Ribonuclease H-like superfamily protein8.6e-3342.86Show/hide
Query:  ATPCYSMSCFRLPKNLIHEINSLIARFWWDNDRENK-IHRIGWKTLCKPKCQGGMGFRDLELFNKALLAKQGWRILNEPNSMLAQVLKGRYFKDGTFLSA
        A P Y+M+CF LPK +  +I S++A FWW N +E K +H   W  L   K +GG+GF+D+E FN ALL KQ WR+L+ P S++A+V K RYF     L+A
Subjt:  ATPCYSMSCFRLPKNLIHEINSLIARFWWDNDRENK-IHRIGWKTLCKPKCQGGMGFRDLELFNKALLAKQGWRILNEPNSMLAQVLKGRYFKDGTFLSA

Query:  KVGGNPSFILRSLLWGRSLLTEGGRWRIGNGGTVRIYRDNWFPNYPS
         +G  PSF+ +S+   + +L +G R  +GNG  + I+R  W  + P+
Subjt:  KVGGNPSFILRSLLWGRSLLTEGGRWRIGNGGTVRIYRDNWFPNYPS

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.6e-3142.96Show/hide
Query:  ATPCYSMSCFRLPKNLIHEINSLIARFWWDN-DRENKIHRIGWKTLCKPK-CQGGMGFRDLELFNKALLAKQGWRILNEPNSMLAQVLKGRYFKDGTFLS
        A P Y+MSCFRL K L  ++ S +  FWW + + + KI  + W+ LCK K   GG+GFRDL  FN+ALLAKQ +RI+++P+++L+++L+ RYF   + + 
Subjt:  ATPCYSMSCFRLPKNLIHEINSLIARFWWDN-DRENKIHRIGWKTLCKPK-CQGGMGFRDLELFNKALLAKQGWRILNEPNSMLAQVLKGRYFKDGTFLS

Query:  AKVGGNPSFILRSLLWGRSLLTEGGRWRIGNGGTVRIYRDNW
          VG  PS+  RS++ GR LL+ G    IG+G   +++ D W
Subjt:  AKVGGNPSFILRSLLWGRSLLTEGGRWRIGNGGTVRIYRDNW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CAGGCAACCCCTTGTTACTCTATGTCATGTTTCAGGCTCCCCAAAAATCTTATTCATGAGATCAATTCCTTAATAGCTCGGTTTTGGTGGGACAATGATAGGGAGAATAA
AATCCATAGGATTGGATGGAAGACCTTATGTAAGCCAAAATGTCAAGGGGGAATGGGATTCCGTGACCTTGAACTTTTCAATAAGGCTTTGCTCGCTAAGCAGGGATGGA
GAATATTGAATGAACCTAACTCTATGTTGGCGCAGGTCCTTAAGGGACGATACTTCAAAGACGGTACTTTCCTTTCGGCCAAGGTGGGTGGGAACCCATCCTTTATCTTG
AGAAGCTTGTTGTGGGGAAGATCGTTGCTCACAGAGGGGGGTCGTTGGCGCATTGGGAATGGGGGGACGGTGAGGATTTATAGGGATAACTGGTTTCCAAATTATCCATC
C
mRNA sequenceShow/hide mRNA sequence
CAGGCAACCCCTTGTTACTCTATGTCATGTTTCAGGCTCCCCAAAAATCTTATTCATGAGATCAATTCCTTAATAGCTCGGTTTTGGTGGGACAATGATAGGGAGAATAA
AATCCATAGGATTGGATGGAAGACCTTATGTAAGCCAAAATGTCAAGGGGGAATGGGATTCCGTGACCTTGAACTTTTCAATAAGGCTTTGCTCGCTAAGCAGGGATGGA
GAATATTGAATGAACCTAACTCTATGTTGGCGCAGGTCCTTAAGGGACGATACTTCAAAGACGGTACTTTCCTTTCGGCCAAGGTGGGTGGGAACCCATCCTTTATCTTG
AGAAGCTTGTTGTGGGGAAGATCGTTGCTCACAGAGGGGGGTCGTTGGCGCATTGGGAATGGGGGGACGGTGAGGATTTATAGGGATAACTGGTTTCCAAATTATCCATC
C
Protein sequenceShow/hide protein sequence
QATPCYSMSCFRLPKNLIHEINSLIARFWWDNDRENKIHRIGWKTLCKPKCQGGMGFRDLELFNKALLAKQGWRILNEPNSMLAQVLKGRYFKDGTFLSAKVGGNPSFIL
RSLLWGRSLLTEGGRWRIGNGGTVRIYRDNWFPNYPS