; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g09800 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g09800
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr6:7143598..7144775
RNA-Seq ExpressionMoc06g09800
SyntenyMoc06g09800
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG50387.1 hypothetical protein EZV62_022911 [Acer yangbiense]1.3e-1627.07Show/hide
Query:  DKLIWHYDRRGQYNIKSGYRLAQRGIVHGLSSPSSLES---RLQW------------WKGFG----NSNSLVKLRFSDGGYALTVYQRCSKFSH-LLHLL
        D L WH+D+RG Y ++SGY++A   +  G+ S S   S   R  W            WK F        +L + R     Y     Q C+  S  + H+L
Subjt:  DKLIWHYDRRGQYNIKSGYRLAQRGIVHGLSSPSSLES---RLQW------------WKGFG----NSNSLVKLRFSDGGYALTVYQRCSKFSH-LLHLL

Query:  W----------------VLARSDIGHFMRVWTDMVSWQHIGAIVMLL-----WAIWNARNQTPQHFSLGGLLSDIV------------------------
        W                V+ R  +  F  +   +  W+ + ++V  L     W +W  RN    H S G   +D+V                        
Subjt:  W----------------VLARSDIGHFMRVWTDMVSWQHIGAIVMLL-----WAIWNARNQTPQHFSLGGLLSDIV------------------------

Query:  --SCASP-QGE----CGCSFQKESFVVGVSVIIRDSVGLVYLTVVRPLARARDVDWVEGSAVYEGILLAVEAGFIRFQIETDSLRIFNLLTIDCVDDFEV
          S  +P +GE    C  SFQ  S   GV VIIRD  G        P+     V+ +E  A  EGI LA++ G     IE+D+  +  LL+   V   E+
Subjt:  --SCASP-QGE----CGCSFQKESFVVGVSVIIRDSVGLVYLTVVRPLARARDVDWVEGSAVYEGILLAVEAGFIRFQIETDSLRIFNLLTIDCVDDFEV

Query:  GTLYFVIKLFLSSHGMWVSFSFTRRDGNAAAHLLAQLALTSPHLQIWVEEWPDEISLMFAVD
        G +     L L +    +S+   RR+ N+ AH +AQ AL+     +W EE P +I+ +   D
Subjt:  GTLYFVIKLFLSSHGMWVSFSFTRRDGNAAAHLLAQLALTSPHLQIWVEEWPDEISLMFAVD

XP_022135942.1 uncharacterized protein LOC111007775 [Momordica charantia]6.4e-2734.35Show/hide
Query:  DMVSWQHIGAIVMLLWAIWNARNQTPQHFSLGGLLSDIVSCASP------------------QGECG------------------CSFQKESFVVGVSVI
        D + W  +  + + LWAIWNARNQ+      G LL ++V   +                    G  G                   +F+K +F  G+ ++
Subjt:  DMVSWQHIGAIVMLLWAIWNARNQTPQHFSLGGLLSDIVSCASP------------------QGECG------------------CSFQKESFVVGVSVI

Query:  IRDSVGLVYLTVVRPLARARDVDWVEGSAVYEGILLAVEAGFIRFQIETDSLRIFNLLTIDCVDDFEVGTLYFVIKLFLSSHGMWVSFSFTRRDGNAAAH
        IRDS   V L+ +  +    DV   E  A  EG+ LA+EAG I FQIETDS ++FNLL  DC D+ E+G L   I+  +SS  +   FSF  R+GN+ AH
Subjt:  IRDSVGLVYLTVVRPLARARDVDWVEGSAVYEGILLAVEAGFIRFQIETDSLRIFNLLTIDCVDDFEVGTLYFVIKLFLSSHGMWVSFSFTRRDGNAAAH

Query:  LLAQLALTSPHLQIWVEEWPDEISLMFAVD
         LA++ + S    +WVEEW  ++S + A D
Subjt:  LLAQLALTSPHLQIWVEEWPDEISLMFAVD

XP_022139684.1 uncharacterized protein LOC111010533 [Momordica charantia]2.2e-1429.65Show/hide
Query:  HFMRVWTDMVSWQHIGAIVMLLWAIWNARN------QTPQHFSLGGLLSDIVS-----------------------------------CASPQG----EC
        + +R W DM++W+    +V+ LW++WN RN      +  +   L G +S  ++                                   C + +G    + 
Subjt:  HFMRVWTDMVSWQHIGAIVMLLWAIWNARN------QTPQHFSLGGLLSDIVS-----------------------------------CASPQG----EC

Query:  GCSFQKESFVVGVSV-IIRDSVGLVYLTVVRPLARARDVDWVEGSAVYEGILLAVEAGFIRFQIETDSLRIFNLLTIDCVDDFEVGTLYFVIKLFLSSHG
          SF    F  G+ V IIRD  G V  +  + L     VD  E  A  EG+ +A+E G     +ETDSLRI+NL   D     + G++   +K  L++  
Subjt:  GCSFQKESFVVGVSV-IIRDSVGLVYLTVVRPLARARDVDWVEGSAVYEGILLAVEAGFIRFQIETDSLRIFNLLTIDCVDDFEVGTLYFVIKLFLSSHG

Query:  MWVSFSFTRRDGNAAAHLLAQLALTS
        + VS+SFT+R GN  AHLLA+ AL S
Subjt:  MWVSFSFTRRDGNAAAHLLAQLALTS

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]4.3e-1524.74Show/hide
Query:  DSTLAIRRLPIPLGVSPLDKLIWHYDRRGQYNIKSGYRLA--QRGIVHGLSSPSSLESRLQWWKGFGNSNSLVKLR----------------FSDGGYAL
        D    I  +PI  G    D+LIW+Y++ G Y+++SGY++A      V   SS SS E R  WW GF   +   K++                 S  G  +
Subjt:  DSTLAIRRLPIPLGVSPLDKLIWHYDRRGQYNIKSGYRLA--QRGIVHGLSSPSSLESRLQWWKGFGNSNSLVKLR----------------FSDGGYAL

Query:  TVYQRCSKF-----SHLLHLLWVL-------ARSDIGH-----FMRVWTDMVSWQHIGAIVMLLWAIWNARN------QTPQHFSLGGLL----------
        T    C  F        +HL W+          S  G       +R   + +S      + +++W +WN RN       T   F +G  L          
Subjt:  TVYQRCSKF-----SHLLHLLWVL-------ARSDIGH-----FMRVWTDMVSWQHIGAIVMLLWAIWNARN------QTPQHFSLGGLL----------

Query:  ----------------SDIVSCASPQG----ECGCSFQKESFVVGVSVIIRDSVGLVYLTVVRPLARARDVDWVEGSAVYEGILLAVEAGFIRFQIETDS
                        ++I+     +G        SF       G+ +II +  G V     + L   + VD  E  A  EG+ LA E G          
Subjt:  ----------------SDIVSCASPQG----ECGCSFQKESFVVGVSVIIRDSVGLVYLTVVRPLARARDVDWVEGSAVYEGILLAVEAGFIRFQIETDS

Query:  LRIFNLLTIDCVDDFEVGTLYFVIKLFLSSHGMWVSFSFTRRDGNAAAHLLAQLALTSPHLQIWVEEWPDEISLMFAVDC
              +     D  E G +    K F  +  +  SF+F +R+GN AAH+LA+ AL      IW+E+WP E+     ++C
Subjt:  LRIFNLLTIDCVDDFEVGTLYFVIKLFLSSHGMWVSFSFTRRDGNAAAHLLAQLALTSPHLQIWVEEWPDEISLMFAVDC

XP_022150944.1 uncharacterized protein LOC111018973 [Momordica charantia]9.8e-9269.85Show/hide
Query:  RCSKFSHLLHLLWVLARSDIGHFMRVWTDMVSWQHIGAIVMLLWAIWNARNQTPQHFSLGGLLSDIVS----------------------CASP------
        +CSKFSH LHLLWV   SD+GHFMRVWTD+VSWQHIGAIV+LLWAIWNARNQTPQHFSLGG LSD+VS                      C  P      
Subjt:  RCSKFSHLLHLLWVLARSDIGHFMRVWTDMVSWQHIGAIVMLLWAIWNARNQTPQHFSLGGLLSDIVS----------------------CASP------

Query:  -------QGECGCSFQKESFVVGVSVIIRDSVGLVYLTVVRPLARARDVDWVEGSAVYEGILLAVEAGFIRFQIETDSLRIFNLLTIDCVDDFEVGTLYF
               +     +F+KESFV GV VIIRDS GLVYLT +R LARA DVDWVEG AVYEGILLAVEAGFIRFQIETDSLRIFNLLT DCVDD EVG L  
Subjt:  -------QGECGCSFQKESFVVGVSVIIRDSVGLVYLTVVRPLARARDVDWVEGSAVYEGILLAVEAGFIRFQIETDSLRIFNLLTIDCVDDFEVGTLYF

Query:  VIKLFLSSHGMWVSFSFTRRDGNAAAHLLAQLALTSPHLQIWVEEWPDEISLMFAVDCSSCS
        VIKLFLSSH   VSFSFT R+GNA AHLLAQLALTSPHLQIWVEEWPDEIS + AVDC S S
Subjt:  VIKLFLSSHGMWVSFSFTRRDGNAAAHLLAQLALTSPHLQIWVEEWPDEISLMFAVDCSSCS

TrEMBL top hitse value%identityAlignment
A0A2N9FT78 RNase H domain-containing protein2.6e-1324.43Show/hide
Query:  LPIPLGV-SPLDKLIWHYDRRGQYNIKSGYRLAQR-------------------GIVHGLSSPSSLESRLQWWKGFGNS----NSLVKLRFSDGGYALTV
        L IPL   SP DKLIWH  + G+++++SGY L  +                    ++  +  P+ + S +  W+    S      L++ +  D  +  + 
Subjt:  LPIPLGV-SPLDKLIWHYDRRGQYNIKSGYRLAQR-------------------GIVHGLSSPSSLESRLQWWKGFGNS----NSLVKLRFSDGGYALTV

Query:  YQRCSKFSH------LLHLLWVLARSDIGHFMRV----WTDMVSWQHIGAI---------VMLLWAIWNARNQTPQHF-SLGGLLSDIVSCASPQGEC--
                H      LL+  W  ++       +V    +T++V  + +G +          M+ W +W+ RN+   H  S G + +   S   P+     
Subjt:  YQRCSKFSH------LLHLLWVLARSDIGHFMRV----WTDMVSWQHIGAI---------VMLLWAIWNARNQTPQHF-SLGGLLSDIVSCASPQGEC--

Query:  -------------GCSFQKESFVVGVSVIIRDSVGLVYLTVVRPLARARDVDWVEGSAVYEGILLAVEAGFIRFQIETDSLRIFNLLTIDCVDDFEVGTL
                     G  FQ E+ + G+ V++RD  G+V  T+ + +  +   + +E  A    I  A+E G +  Q E DS  I   L+   V     G +
Subjt:  -------------GCSFQKESFVVGVSVIIRDSVGLVYLTVVRPLARARDVDWVEGSAVYEGILLAVEAGFIRFQIETDSLRIFNLLTIDCVDDFEVGTL

Query:  YFVIKLFLSSHGMWVSFSFTRRDGNAAAHLLAQLALTSPHLQIWVEEWPDEI
            K  L  H     F+ TRR GN+ AH LA+ AL  P+  +W+E+ P +I
Subjt:  YFVIKLFLSSHGMWVSFSFTRRDGNAAAHLLAQLALTSPHLQIWVEEWPDEI

A0A6J1C467 uncharacterized protein LOC1110077753.1e-2734.35Show/hide
Query:  DMVSWQHIGAIVMLLWAIWNARNQTPQHFSLGGLLSDIVSCASP------------------QGECG------------------CSFQKESFVVGVSVI
        D + W  +  + + LWAIWNARNQ+      G LL ++V   +                    G  G                   +F+K +F  G+ ++
Subjt:  DMVSWQHIGAIVMLLWAIWNARNQTPQHFSLGGLLSDIVSCASP------------------QGECG------------------CSFQKESFVVGVSVI

Query:  IRDSVGLVYLTVVRPLARARDVDWVEGSAVYEGILLAVEAGFIRFQIETDSLRIFNLLTIDCVDDFEVGTLYFVIKLFLSSHGMWVSFSFTRRDGNAAAH
        IRDS   V L+ +  +    DV   E  A  EG+ LA+EAG I FQIETDS ++FNLL  DC D+ E+G L   I+  +SS  +   FSF  R+GN+ AH
Subjt:  IRDSVGLVYLTVVRPLARARDVDWVEGSAVYEGILLAVEAGFIRFQIETDSLRIFNLLTIDCVDDFEVGTLYFVIKLFLSSHGMWVSFSFTRRDGNAAAH

Query:  LLAQLALTSPHLQIWVEEWPDEISLMFAVD
         LA++ + S    +WVEEW  ++S + A D
Subjt:  LLAQLALTSPHLQIWVEEWPDEISLMFAVD

A0A6J1CDQ4 uncharacterized protein LOC1110105331.0e-1429.65Show/hide
Query:  HFMRVWTDMVSWQHIGAIVMLLWAIWNARN------QTPQHFSLGGLLSDIVS-----------------------------------CASPQG----EC
        + +R W DM++W+    +V+ LW++WN RN      +  +   L G +S  ++                                   C + +G    + 
Subjt:  HFMRVWTDMVSWQHIGAIVMLLWAIWNARN------QTPQHFSLGGLLSDIVS-----------------------------------CASPQG----EC

Query:  GCSFQKESFVVGVSV-IIRDSVGLVYLTVVRPLARARDVDWVEGSAVYEGILLAVEAGFIRFQIETDSLRIFNLLTIDCVDDFEVGTLYFVIKLFLSSHG
          SF    F  G+ V IIRD  G V  +  + L     VD  E  A  EG+ +A+E G     +ETDSLRI+NL   D     + G++   +K  L++  
Subjt:  GCSFQKESFVVGVSV-IIRDSVGLVYLTVVRPLARARDVDWVEGSAVYEGILLAVEAGFIRFQIETDSLRIFNLLTIDCVDDFEVGTLYFVIKLFLSSHG

Query:  MWVSFSFTRRDGNAAAHLLAQLALTS
        + VS+SFT+R GN  AHLLA+ AL S
Subjt:  MWVSFSFTRRDGNAAAHLLAQLALTS

A0A6J1CIF1 uncharacterized protein LOC1110112378.8e-1435.17Show/hide
Query:  SFQKESFVVGVSVIIRDSVGLVYLTVVRPLARARDVDWVEGSAVYEGILLAVEAGFIRFQIETDSLRIFNLLTIDCVDDFEVGTLYFVIKLFLSSHGMWV
        SF       G+ +IIR+  G V  +  + L   + VD  E     EG+ LA + G     +ETDS RIFNL +    D  E G +    K F  +  +  
Subjt:  SFQKESFVVGVSVIIRDSVGLVYLTVVRPLARARDVDWVEGSAVYEGILLAVEAGFIRFQIETDSLRIFNLLTIDCVDDFEVGTLYFVIKLFLSSHGMWV

Query:  SFSFTRRDGNAAAHLLAQLALTSPHLQIWVEEWPDEISLMFAVDC
        SF+F +R+GN AAH+LA+ AL      IW+E+WP E+     ++C
Subjt:  SFSFTRRDGNAAAHLLAQLALTSPHLQIWVEEWPDEISLMFAVDC

A0A6J1DBJ7 uncharacterized protein LOC1110189734.7e-9269.85Show/hide
Query:  RCSKFSHLLHLLWVLARSDIGHFMRVWTDMVSWQHIGAIVMLLWAIWNARNQTPQHFSLGGLLSDIVS----------------------CASP------
        +CSKFSH LHLLWV   SD+GHFMRVWTD+VSWQHIGAIV+LLWAIWNARNQTPQHFSLGG LSD+VS                      C  P      
Subjt:  RCSKFSHLLHLLWVLARSDIGHFMRVWTDMVSWQHIGAIVMLLWAIWNARNQTPQHFSLGGLLSDIVS----------------------CASP------

Query:  -------QGECGCSFQKESFVVGVSVIIRDSVGLVYLTVVRPLARARDVDWVEGSAVYEGILLAVEAGFIRFQIETDSLRIFNLLTIDCVDDFEVGTLYF
               +     +F+KESFV GV VIIRDS GLVYLT +R LARA DVDWVEG AVYEGILLAVEAGFIRFQIETDSLRIFNLLT DCVDD EVG L  
Subjt:  -------QGECGCSFQKESFVVGVSVIIRDSVGLVYLTVVRPLARARDVDWVEGSAVYEGILLAVEAGFIRFQIETDSLRIFNLLTIDCVDDFEVGTLYF

Query:  VIKLFLSSHGMWVSFSFTRRDGNAAAHLLAQLALTSPHLQIWVEEWPDEISLMFAVDCSSCS
        VIKLFLSSH   VSFSFT R+GNA AHLLAQLALTSPHLQIWVEEWPDEIS + AVDC S S
Subjt:  VIKLFLSSHGMWVSFSFTRRDGNAAAHLLAQLALTSPHLQIWVEEWPDEISLMFAVDCSSCS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGAGAAAATTCGACAGCACTTTAGCGATAAGGAGACTTCCTATTCCGCTTGGTGTTAGTCCGCTGGATAAACTCATCTGGCATTATGACAGAAGGGGACAATACAA
TATTAAAAGTGGGTACCGTCTTGCACAGCGGGGCATCGTTCATGGTTTGTCTTCCCCTTCTTCCCTCGAGTCACGGTTACAGTGGTGGAAGGGTTTTGGAAACTCCAACT
CCCTAGTAAAATTAAGATTTTCGGATGGAGGTTATGCCTTGACCGTTTACCAACGGTGCTCTAAATTCTCTCATCTTCTCCATCTTTTGTGGGTGCTGGCGAGATCTGAT
ATAGGTCATTTTATGCGGGTGTGGACAGATATGGTTTCTTGGCAGCATATTGGAGCTATTGTGATGTTGTTGTGGGCTATTTGGAATGCGCGTAATCAAACTCCCCAACA
TTTCTCGTTGGGTGGCCTTTTGTCGGACATTGTTTCTTGCGCCTCTCCTCAAGGTGAATGTGGATGCAGCTTTCAGAAAGAATCTTTTGTAGTAGGGGTGAGTGTAATTA
TTCGGGATTCGGTCGGTCTTGTGTATCTTACGGTCGTTCGTCCGCTTGCTCGTGCTCGTGATGTTGATTGGGTGGAAGGCTCTGCGGTTTATGAGGGTATTCTCCTTGCT
GTGGAAGCAGGTTTTATTCGGTTCCAGATCGAAACAGATTCACTCCGGATTTTTAATTTACTGACGATAGATTGTGTGGATGATTTTGAAGTTGGGACTCTCTACTTTGT
CATCAAGCTTTTTTTGTCTTCTCATGGTATGTGGGTTTCTTTTAGTTTTACACGTAGGGACGGTAACGCTGCAGCCCATCTGTTAGCTCAACTGGCATTGACTTCACCAC
ACCTTCAAATCTGGGTGGAGGAATGGCCTGATGAGATCTCTTTGATGTTCGCGGTGGATTGTAGTTCCTGTTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTGGAGAAAATTCGACAGCACTTTAGCGATAAGGAGACTTCCTATTCCGCTTGGTGTTAGTCCGCTGGATAAACTCATCTGGCATTATGACAGAAGGGGACAATACAA
TATTAAAAGTGGGTACCGTCTTGCACAGCGGGGCATCGTTCATGGTTTGTCTTCCCCTTCTTCCCTCGAGTCACGGTTACAGTGGTGGAAGGGTTTTGGAAACTCCAACT
CCCTAGTAAAATTAAGATTTTCGGATGGAGGTTATGCCTTGACCGTTTACCAACGGTGCTCTAAATTCTCTCATCTTCTCCATCTTTTGTGGGTGCTGGCGAGATCTGAT
ATAGGTCATTTTATGCGGGTGTGGACAGATATGGTTTCTTGGCAGCATATTGGAGCTATTGTGATGTTGTTGTGGGCTATTTGGAATGCGCGTAATCAAACTCCCCAACA
TTTCTCGTTGGGTGGCCTTTTGTCGGACATTGTTTCTTGCGCCTCTCCTCAAGGTGAATGTGGATGCAGCTTTCAGAAAGAATCTTTTGTAGTAGGGGTGAGTGTAATTA
TTCGGGATTCGGTCGGTCTTGTGTATCTTACGGTCGTTCGTCCGCTTGCTCGTGCTCGTGATGTTGATTGGGTGGAAGGCTCTGCGGTTTATGAGGGTATTCTCCTTGCT
GTGGAAGCAGGTTTTATTCGGTTCCAGATCGAAACAGATTCACTCCGGATTTTTAATTTACTGACGATAGATTGTGTGGATGATTTTGAAGTTGGGACTCTCTACTTTGT
CATCAAGCTTTTTTTGTCTTCTCATGGTATGTGGGTTTCTTTTAGTTTTACACGTAGGGACGGTAACGCTGCAGCCCATCTGTTAGCTCAACTGGCATTGACTTCACCAC
ACCTTCAAATCTGGGTGGAGGAATGGCCTGATGAGATCTCTTTGATGTTCGCGGTGGATTGTAGTTCCTGTTCTTAA
Protein sequenceShow/hide protein sequence
MWRKFDSTLAIRRLPIPLGVSPLDKLIWHYDRRGQYNIKSGYRLAQRGIVHGLSSPSSLESRLQWWKGFGNSNSLVKLRFSDGGYALTVYQRCSKFSHLLHLLWVLARSD
IGHFMRVWTDMVSWQHIGAIVMLLWAIWNARNQTPQHFSLGGLLSDIVSCASPQGECGCSFQKESFVVGVSVIIRDSVGLVYLTVVRPLARARDVDWVEGSAVYEGILLA
VEAGFIRFQIETDSLRIFNLLTIDCVDDFEVGTLYFVIKLFLSSHGMWVSFSFTRRDGNAAAHLLAQLALTSPHLQIWVEEWPDEISLMFAVDCSSCS