; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g31270 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g31270
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRNase H domain-containing protein
Genome locationchr8:22405380..22412018
RNA-Seq ExpressionMoc08g31270
SyntenyMoc08g31270
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6599033.1 hypothetical protein SDJN03_08811, partial [Cucurbita argyrosperma subsp. sororia]6.3e-11665.12Show/hide
Query:  FLPTTSINGC-FNPYWRSSFHNATLKVAALDSVCSRFNLHCYSSRKIAKGSSRSWKLDTKPSSMELEKGDFLL-------------------YGTGICDL
        F  +TSINGC FN YW SSFH+ TLK   LDS+CSRF L CYSSRK+ KG+S S  LD+KP  ME + GDF +                    G+ ICDL
Subjt:  FLPTTSINGC-FNPYWRSSFHNATLKVAALDSVCSRFNLHCYSSRKIAKGSSRSWKLDTKPSSMELEKGDFLL-------------------YGTGICDL

Query:  PVSMYKRPSLPKDTEEYLASVGLKNALYTIKAADMRPDIFGSLVPCT--DGATSLKGEASDQAATKKRSREAFATLLHLFHLLSQKLPNVEPFSSSSSFL
        PVS+YK  SLPKDT EYL+SVGLKNALYTIKAADMRPD+F SLVPCT  D ATSLKGEAS Q A KKRSRE          ++S+ + +     +S   L
Subjt:  PVSMYKRPSLPKDTEEYLASVGLKNALYTIKAADMRPDIFGSLVPCT--DGATSLKGEASDQAATKKRSREAFATLLHLFHLLSQKLPNVEPFSSSSSFL

Query:  CRTTLFGSLDPKFFEHYTSNMLSRLQECCFLEFDGALKGNLGQARAGAVLRAHDGSVICRLREGLGTATNNIAEYRAILLGLKYALQKGFTRIHVQGDSK
         +      L+        SN      E C LEFDGA KGN GQA AGAVLRAHDGSVICRLREGLG ATNN+AEYRAILLGLKYAL+KGFTRIHVQGDSK
Subjt:  CRTTLFGSLDPKFFEHYTSNMLSRLQECCFLEFDGALKGNLGQARAGAVLRAHDGSVICRLREGLGTATNNIAEYRAILLGLKYALQKGFTRIHVQGDSK

Query:  LVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFQISHVLRNLNSEADAQANLAITLADGEVLEYEE
        LVCMQVQGLWKVKNENI+ELCNEV+KLKDKFLSF+ISHVLRNLNSEADAQANLA+TL DGE  E+EE
Subjt:  LVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFQISHVLRNLNSEADAQANLAITLADGEVLEYEE

XP_022946833.1 uncharacterized protein LOC111450776 [Cucurbita moschata]2.4e-11564.85Show/hide
Query:  FLPTTSINGC-FNPYWRSSFHNATLKVAALDSVCSRFNLHCYSSRKIAKGSSRSWKLDTKPSSMELEKGDFLL-------------------YGTGICDL
        F  +TSI+GC FN YW SSFH+ TLK   LDS+CSRF L CYSSRK+ KG+S S  LD+KP  ME + GDF +                    G+ ICDL
Subjt:  FLPTTSINGC-FNPYWRSSFHNATLKVAALDSVCSRFNLHCYSSRKIAKGSSRSWKLDTKPSSMELEKGDFLL-------------------YGTGICDL

Query:  PVSMYKRPSLPKDTEEYLASVGLKNALYTIKAADMRPDIFGSLVPCT--DGATSLKGEASDQAATKKRSREAFATLLHLFHLLSQKLPNVEPFSSSSSFL
        PVS+YK  SLPKDT EYL+SVGLKNALYTIKAADMRPD+F SLVPCT  D AT+LKGEAS Q A KKRSRE          ++S+ + +     +S   L
Subjt:  PVSMYKRPSLPKDTEEYLASVGLKNALYTIKAADMRPDIFGSLVPCT--DGATSLKGEASDQAATKKRSREAFATLLHLFHLLSQKLPNVEPFSSSSSFL

Query:  CRTTLFGSLDPKFFEHYTSNMLSRLQECCFLEFDGALKGNLGQARAGAVLRAHDGSVICRLREGLGTATNNIAEYRAILLGLKYALQKGFTRIHVQGDSK
         +      L+        SN      E CFLEFDGA KGN GQA AGAVLRAHDGSVICRLREGLG ATNN+AEYRAILLGLKYAL+KGFTRIHVQGDSK
Subjt:  CRTTLFGSLDPKFFEHYTSNMLSRLQECCFLEFDGALKGNLGQARAGAVLRAHDGSVICRLREGLGTATNNIAEYRAILLGLKYALQKGFTRIHVQGDSK

Query:  LVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFQISHVLRNLNSEADAQANLAITLADGEVLEYEE
        LVCMQVQGLWKVKNENI+ELCNEV+KLKDKFLSF+ISHVLRNLNSEADAQANLA+TL DGE  E+EE
Subjt:  LVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFQISHVLRNLNSEADAQANLAITLADGEVLEYEE

XP_022999715.1 uncharacterized protein LOC111493983 [Cucurbita maxima]3.7e-11665.12Show/hide
Query:  FLPTTSINGC-FNPYWRSSFHNATLKVAALDSVCSRFNLHCYSSRKIAKGSSRSWKLDTKPSSMELEKGDFLL-------------------YGTGICDL
        F  +TSI+GC FNPYW SS H+ TLK   LDS+CSRF L CYSSRK+ KG+S S  LD++P  ME + GDF +                    G+ ICDL
Subjt:  FLPTTSINGC-FNPYWRSSFHNATLKVAALDSVCSRFNLHCYSSRKIAKGSSRSWKLDTKPSSMELEKGDFLL-------------------YGTGICDL

Query:  PVSMYKRPSLPKDTEEYLASVGLKNALYTIKAADMRPDIFGSLVPCT--DGATSLKGEASDQAATKKRSREAFATLLHLFHLLSQKLPNVEPFSSSSSFL
        PVS+YK  SLPKDT+EYLASVGLKNALYTI+AADMRPD+F SLVPCT  D ATSLKGEAS Q A KKRSRE          ++S+ + +     +S   L
Subjt:  PVSMYKRPSLPKDTEEYLASVGLKNALYTIKAADMRPDIFGSLVPCT--DGATSLKGEASDQAATKKRSREAFATLLHLFHLLSQKLPNVEPFSSSSSFL

Query:  CRTTLFGSLDPKFFEHYTSNMLSRLQECCFLEFDGALKGNLGQARAGAVLRAHDGSVICRLREGLGTATNNIAEYRAILLGLKYALQKGFTRIHVQGDSK
         +      L+        SN      E CFLEFDGA KGN GQA AGAVLRAHDGSVICRLREGLG ATNN+AEYRAILLGLKYAL+KGFTRIHVQGDSK
Subjt:  CRTTLFGSLDPKFFEHYTSNMLSRLQECCFLEFDGALKGNLGQARAGAVLRAHDGSVICRLREGLGTATNNIAEYRAILLGLKYALQKGFTRIHVQGDSK

Query:  LVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFQISHVLRNLNSEADAQANLAITLADGEVLEYEE
        LVCMQVQGLWKVKNENISELCNEV+KLKDKFLSF+ISHVLRNLNSEADAQANLAI+L DGE  E+EE
Subjt:  LVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFQISHVLRNLNSEADAQANLAITLADGEVLEYEE

XP_023547239.1 uncharacterized protein LOC111806115 [Cucurbita pepo subsp. pepo]3.4e-11765.4Show/hide
Query:  FLPTTSINGC-FNPYWRSSFHNATLKVAALDSVCSRFNLHCYSSRKIAKGSSRSWKLDTKPSSMELEKGDFLL-------------------YGTGICDL
        F  +TSI+GC FNPYW SSFH+ TLK   LDS+CSRF L CYSSRK+ KG+S S  LD+KP  ME + GDF +                    G+ ICDL
Subjt:  FLPTTSINGC-FNPYWRSSFHNATLKVAALDSVCSRFNLHCYSSRKIAKGSSRSWKLDTKPSSMELEKGDFLL-------------------YGTGICDL

Query:  PVSMYKRPSLPKDTEEYLASVGLKNALYTIKAADMRPDIFGSLVPCT--DGATSLKGEASDQAATKKRSREAFATLLHLFHLLSQKLPNVEPFSSSSSFL
        PVS+YK  SLPKDT EYLASVGLKNALYTIKAADMRPD+F SLVPCT  D ATSLKGEAS Q A KKRSRE          ++S+ + +     +S   L
Subjt:  PVSMYKRPSLPKDTEEYLASVGLKNALYTIKAADMRPDIFGSLVPCT--DGATSLKGEASDQAATKKRSREAFATLLHLFHLLSQKLPNVEPFSSSSSFL

Query:  CRTTLFGSLDPKFFEHYTSNMLSRLQECCFLEFDGALKGNLGQARAGAVLRAHDGSVICRLREGLGTATNNIAEYRAILLGLKYALQKGFTRIHVQGDSK
         +      L+        SN      E CFLEFDGA KGN GQA AGAVLRAHDGSVICRLREGLG ATNN+AEYRAILLGLKYAL+KGFTRIHVQGDSK
Subjt:  CRTTLFGSLDPKFFEHYTSNMLSRLQECCFLEFDGALKGNLGQARAGAVLRAHDGSVICRLREGLGTATNNIAEYRAILLGLKYALQKGFTRIHVQGDSK

Query:  LVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFQISHVLRNLNSEADAQANLAITLADGEVLEYEE
        LVCMQVQGLWKVKNENI+ELCNEV+KLKDKFLSF+ISHVLRNLNSEADA+ANLA+TL DGE  E+EE
Subjt:  LVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFQISHVLRNLNSEADAQANLAITLADGEVLEYEE

XP_038889960.1 uncharacterized protein LOC120079705 [Benincasa hispida]1.4e-11866.49Show/hide
Query:  TTSINGCFNPYWRSSF--HNATLKVAALDSVCSRFNLHCYSSRKIAKGSSRSWKLDTKPSSMELEKGDFLL-------------------YGTGICDLPV
        +TSING  N YW SSF  HN  +K  A+DS+CSRF L CYSSRK+ K +S S KLD++P + E E GDF +                    G+ ICDLPV
Subjt:  TTSINGCFNPYWRSSF--HNATLKVAALDSVCSRFNLHCYSSRKIAKGSSRSWKLDTKPSSMELEKGDFLL-------------------YGTGICDLPV

Query:  SMYKRPSLPKDTEEYLASVGLKNALYTIKAADMRPDIFGSLVPCT--DGATSLKGEASDQAATKKRSREAFATLLHLFHLLSQKLPNVEPFSSSSSFLCR
        S+YK  SLPKDT+EYLASVGLKNALYTIKAADMRPD+FGSLVPCT  DG TS+KGEAS Q A KKR REA         ++S+ +         SS L  
Subjt:  SMYKRPSLPKDTEEYLASVGLKNALYTIKAADMRPDIFGSLVPCT--DGATSLKGEASDQAATKKRSREAFATLLHLFHLLSQKLPNVEPFSSSSSFLCR

Query:  TTLFGSLDP-----KFFEHYTSNMLSRLQECCFLEFDGALKGNLGQARAGAVLRAHDGSVICRLREGLGTATNNIAEYRAILLGLKYALQKGFTRIHVQG
        T    S DP     K  +   S+ LS  +E CFLEFDGA KGN GQA AGAVLRAHDGSVICRLREGLG ATNN+AEYRAILLGLKYALQKGFTRIHVQG
Subjt:  TTLFGSLDP-----KFFEHYTSNMLSRLQECCFLEFDGALKGNLGQARAGAVLRAHDGSVICRLREGLGTATNNIAEYRAILLGLKYALQKGFTRIHVQG

Query:  DSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFQISHVLRNLNSEADAQANLAITLADGEVLEYEE
        DSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSF+I+HVLRNLNSEADAQANLAITLADGEV E+E+
Subjt:  DSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFQISHVLRNLNSEADAQANLAITLADGEVLEYEE

TrEMBL top hitse value%identityAlignment
A0A0A0KTZ9 RNase H domain-containing protein2.3e-11160.89Show/hide
Query:  FSKEPAFFLPTTSINGCFNPYWRSSFHNATLKVAALDSVCSRFNLHCYSS---RKIAKGSSRSWKLDTKPSSMELEKGDFLL------------------
        F +    F  +TSI+GC N YW SSFHN  +K  ALDS+CSRF L CYS+   RK  K +S S KLD++P  +E E GDF +                  
Subjt:  FSKEPAFFLPTTSINGCFNPYWRSSFHNATLKVAALDSVCSRFNLHCYSS---RKIAKGSSRSWKLDTKPSSMELEKGDFLL------------------

Query:  -YGTGICDLPVSMYKRPSLPKDTEEYLASVGLKNALYTIKAADMRPDIFGSLVPCT--DGATSLKGEASDQAATKKRSREAFATLLHLFHLLSQKLPNVE
          G+ ICDLPVS++K  SLPKDTEEYLASVGLKNALYTIKAADMRPD+FGSL PCT   G TSL GE S Q A KKRSREA                 + 
Subjt:  -YGTGICDLPVSMYKRPSLPKDTEEYLASVGLKNALYTIKAADMRPDIFGSLVPCT--DGATSLKGEASDQAATKKRSREAFATLLHLFHLLSQKLPNVE

Query:  PFSSSSSFLCRTTLFGSLDP-----KFFEHYTSNMLSRLQECCFLEFDGALKGNLGQARAGAVLRAHDGSVICRLREGLGTATNNIAEYRAILLGLKYAL
        P +  S+ L  T      DP     K  +   S+ +S  +E CFLEFDGA KGN GQA AGAVLRAHDGSVICRLREGLG ATNN+AEYRAILLGLK AL
Subjt:  PFSSSSSFLCRTTLFGSLDP-----KFFEHYTSNMLSRLQECCFLEFDGALKGNLGQARAGAVLRAHDGSVICRLREGLGTATNNIAEYRAILLGLKYAL

Query:  QKGFTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFQISHVLRNLNSEADAQANLAITLADGEVLEYEE
        +KGFTRIHVQGDSKLVCMQVQGLWK K+EN+SELCNEV KLK+KFLSF+++HVLR+LNSEADAQANLA+TLA+GEV E+E+
Subjt:  QKGFTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFQISHVLRNLNSEADAQANLAITLADGEVLEYEE

A0A1S3CPT7 uncharacterized protein LOC103503315 isoform X18.3e-11461.58Show/hide
Query:  FSKEPAFFLPTTSINGCFNPYWRSSFHNATLKVAALDSVCSRFNLHCYSS---RKIAKGSSRSWKLDTKPSSMELEKGDFLL------------------
        F +    F  +TSI+GC NPYW S+FHN  +K  ALDS+CSRF L CYS+   RK  K +S S KLD++P  ME E GDF +                  
Subjt:  FSKEPAFFLPTTSINGCFNPYWRSSFHNATLKVAALDSVCSRFNLHCYSS---RKIAKGSSRSWKLDTKPSSMELEKGDFLL------------------

Query:  -YGTGICDLPVSMYKRPSLPKDTEEYLASVGLKNALYTIKAADMRPDIFGSLVPCT--DGATSLKGEASDQAATKKRSREAFATLLHLFHLLSQKLPNVE
          G+ ICDLPVS++K  SLPKD+EEYLAS+GLKNALYTIKAADMRPD+FGSLVPCT  DG  SL GE S Q A KKRSREA  +             NV 
Subjt:  -YGTGICDLPVSMYKRPSLPKDTEEYLASVGLKNALYTIKAADMRPDIFGSLVPCT--DGATSLKGEASDQAATKKRSREAFATLLHLFHLLSQKLPNVE

Query:  PFSSSSSFLCRTTLFGSLDPKFFEHYTSN----MLSRLQECCFLEFDGALKGNLGQARAGAVLRAHDGSVICRLREGLGTATNNIAEYRAILLGLKYALQ
             SS L  T++  + +    +H         LS   E CFLEFDGA KGN GQA AGAVLRAHDGSVICRLREGLG ATNN+AEYRAILLGLK+AL+
Subjt:  PFSSSSSFLCRTTLFGSLDPKFFEHYTSN----MLSRLQECCFLEFDGALKGNLGQARAGAVLRAHDGSVICRLREGLGTATNNIAEYRAILLGLKYALQ

Query:  KGFTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFQISHVLRNLNSEADAQANLAITLADGEVLEYEE
        KGFTRIHVQGDSKLVCMQVQGLWK KNENISELCNEV+KLK+KFLSF+++HVLR+LNSEADAQANLA+TLADGE+ E E+
Subjt:  KGFTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFQISHVLRNLNSEADAQANLAITLADGEVLEYEE

A0A5A7TCE2 RNase H family protein, putative isoform 28.3e-11461.58Show/hide
Query:  FSKEPAFFLPTTSINGCFNPYWRSSFHNATLKVAALDSVCSRFNLHCYSS---RKIAKGSSRSWKLDTKPSSMELEKGDFLL------------------
        F +    F  +TSI+GC NPYW S+FHN  +K  ALDS+CSRF L CYS+   RK  K +S S KLD++P  ME E GDF +                  
Subjt:  FSKEPAFFLPTTSINGCFNPYWRSSFHNATLKVAALDSVCSRFNLHCYSS---RKIAKGSSRSWKLDTKPSSMELEKGDFLL------------------

Query:  -YGTGICDLPVSMYKRPSLPKDTEEYLASVGLKNALYTIKAADMRPDIFGSLVPCT--DGATSLKGEASDQAATKKRSREAFATLLHLFHLLSQKLPNVE
          G+ ICDLPVS++K  SLPKD+EEYLAS+GLKNALYTIKAADMRPD+FGSLVPCT  DG  SL GE S Q A KKRSREA  +             NV 
Subjt:  -YGTGICDLPVSMYKRPSLPKDTEEYLASVGLKNALYTIKAADMRPDIFGSLVPCT--DGATSLKGEASDQAATKKRSREAFATLLHLFHLLSQKLPNVE

Query:  PFSSSSSFLCRTTLFGSLDPKFFEHYTSN----MLSRLQECCFLEFDGALKGNLGQARAGAVLRAHDGSVICRLREGLGTATNNIAEYRAILLGLKYALQ
             SS L  T++  + +    +H         LS   E CFLEFDGA KGN GQA AGAVLRAHDGSVICRLREGLG ATNN+AEYRAILLGLK+AL+
Subjt:  PFSSSSSFLCRTTLFGSLDPKFFEHYTSN----MLSRLQECCFLEFDGALKGNLGQARAGAVLRAHDGSVICRLREGLGTATNNIAEYRAILLGLKYALQ

Query:  KGFTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFQISHVLRNLNSEADAQANLAITLADGEVLEYEE
        KGFTRIHVQGDSKLVCMQVQGLWK KNENISELCNEV+KLK+KFLSF+++HVLR+LNSEADAQANLA+TLADGE+ E E+
Subjt:  KGFTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFQISHVLRNLNSEADAQANLAITLADGEVLEYEE

A0A6J1G4Y7 uncharacterized protein LOC1114507761.2e-11564.85Show/hide
Query:  FLPTTSINGC-FNPYWRSSFHNATLKVAALDSVCSRFNLHCYSSRKIAKGSSRSWKLDTKPSSMELEKGDFLL-------------------YGTGICDL
        F  +TSI+GC FN YW SSFH+ TLK   LDS+CSRF L CYSSRK+ KG+S S  LD+KP  ME + GDF +                    G+ ICDL
Subjt:  FLPTTSINGC-FNPYWRSSFHNATLKVAALDSVCSRFNLHCYSSRKIAKGSSRSWKLDTKPSSMELEKGDFLL-------------------YGTGICDL

Query:  PVSMYKRPSLPKDTEEYLASVGLKNALYTIKAADMRPDIFGSLVPCT--DGATSLKGEASDQAATKKRSREAFATLLHLFHLLSQKLPNVEPFSSSSSFL
        PVS+YK  SLPKDT EYL+SVGLKNALYTIKAADMRPD+F SLVPCT  D AT+LKGEAS Q A KKRSRE          ++S+ + +     +S   L
Subjt:  PVSMYKRPSLPKDTEEYLASVGLKNALYTIKAADMRPDIFGSLVPCT--DGATSLKGEASDQAATKKRSREAFATLLHLFHLLSQKLPNVEPFSSSSSFL

Query:  CRTTLFGSLDPKFFEHYTSNMLSRLQECCFLEFDGALKGNLGQARAGAVLRAHDGSVICRLREGLGTATNNIAEYRAILLGLKYALQKGFTRIHVQGDSK
         +      L+        SN      E CFLEFDGA KGN GQA AGAVLRAHDGSVICRLREGLG ATNN+AEYRAILLGLKYAL+KGFTRIHVQGDSK
Subjt:  CRTTLFGSLDPKFFEHYTSNMLSRLQECCFLEFDGALKGNLGQARAGAVLRAHDGSVICRLREGLGTATNNIAEYRAILLGLKYALQKGFTRIHVQGDSK

Query:  LVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFQISHVLRNLNSEADAQANLAITLADGEVLEYEE
        LVCMQVQGLWKVKNENI+ELCNEV+KLKDKFLSF+ISHVLRNLNSEADAQANLA+TL DGE  E+EE
Subjt:  LVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFQISHVLRNLNSEADAQANLAITLADGEVLEYEE

A0A6J1KGD7 uncharacterized protein LOC1114939831.8e-11665.12Show/hide
Query:  FLPTTSINGC-FNPYWRSSFHNATLKVAALDSVCSRFNLHCYSSRKIAKGSSRSWKLDTKPSSMELEKGDFLL-------------------YGTGICDL
        F  +TSI+GC FNPYW SS H+ TLK   LDS+CSRF L CYSSRK+ KG+S S  LD++P  ME + GDF +                    G+ ICDL
Subjt:  FLPTTSINGC-FNPYWRSSFHNATLKVAALDSVCSRFNLHCYSSRKIAKGSSRSWKLDTKPSSMELEKGDFLL-------------------YGTGICDL

Query:  PVSMYKRPSLPKDTEEYLASVGLKNALYTIKAADMRPDIFGSLVPCT--DGATSLKGEASDQAATKKRSREAFATLLHLFHLLSQKLPNVEPFSSSSSFL
        PVS+YK  SLPKDT+EYLASVGLKNALYTI+AADMRPD+F SLVPCT  D ATSLKGEAS Q A KKRSRE          ++S+ + +     +S   L
Subjt:  PVSMYKRPSLPKDTEEYLASVGLKNALYTIKAADMRPDIFGSLVPCT--DGATSLKGEASDQAATKKRSREAFATLLHLFHLLSQKLPNVEPFSSSSSFL

Query:  CRTTLFGSLDPKFFEHYTSNMLSRLQECCFLEFDGALKGNLGQARAGAVLRAHDGSVICRLREGLGTATNNIAEYRAILLGLKYALQKGFTRIHVQGDSK
         +      L+        SN      E CFLEFDGA KGN GQA AGAVLRAHDGSVICRLREGLG ATNN+AEYRAILLGLKYAL+KGFTRIHVQGDSK
Subjt:  CRTTLFGSLDPKFFEHYTSNMLSRLQECCFLEFDGALKGNLGQARAGAVLRAHDGSVICRLREGLGTATNNIAEYRAILLGLKYALQKGFTRIHVQGDSK

Query:  LVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFQISHVLRNLNSEADAQANLAITLADGEVLEYEE
        LVCMQVQGLWKVKNENISELCNEV+KLKDKFLSF+ISHVLRNLNSEADAQANLAI+L DGE  E+EE
Subjt:  LVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFQISHVLRNLNSEADAQANLAITLADGEVLEYEE

SwissProt top hitse value%identityAlignment
F9VN79 Ribonuclease HI3.3e-0636.67Show/hide
Query:  ATNNIAEYRAILLGLKYALQKGFTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFQISHVLRNLNSEADAQANLAITL
        +TNN+AEY  ++  ++  L+ G +   ++GDS+LV  Q+ G +KVK + I  L  + I+LK K L+  +  V R  N EAD  + +A  L
Subjt:  ATNNIAEYRAILLGLKYALQKGFTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFQISHVLRNLNSEADAQANLAITL

P64956 Uncharacterized protein Mb2253c1.9e-1439.53Show/hide
Query:  LEFDGALKGNLGQARAGAVLRAHDGS-VICRLREGLGTATNNIAEYRAILLGLKYALQKGFTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKD
        +E DG  +GN G A  GAV+   D S V+   ++ +G ATNN+AEYR ++ GL  A++ G T   V  DSKLV  Q+ G WKVK+ ++ +L  +   L  
Subjt:  LEFDGALKGNLGQARAGAVLRAHDGS-VICRLREGLGTATNNIAEYRAILLGLKYALQKGFTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKD

Query:  KFLSFQISHVLRNLNSEADAQANLAITLA
        +F       V R  N+ AD  AN A+  A
Subjt:  KFLSFQISHVLRNLNSEADAQANLAITLA

P9WLH4 Uncharacterized protein MT22871.9e-1439.53Show/hide
Query:  LEFDGALKGNLGQARAGAVLRAHDGS-VICRLREGLGTATNNIAEYRAILLGLKYALQKGFTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKD
        +E DG  +GN G A  GAV+   D S V+   ++ +G ATNN+AEYR ++ GL  A++ G T   V  DSKLV  Q+ G WKVK+ ++ +L  +   L  
Subjt:  LEFDGALKGNLGQARAGAVLRAHDGS-VICRLREGLGTATNNIAEYRAILLGLKYALQKGFTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKD

Query:  KFLSFQISHVLRNLNSEADAQANLAITLA
        +F       V R  N+ AD  AN A+  A
Subjt:  KFLSFQISHVLRNLNSEADAQANLAITLA

P9WLH5 Bifunctional protein Rv2228c1.9e-1439.53Show/hide
Query:  LEFDGALKGNLGQARAGAVLRAHDGS-VICRLREGLGTATNNIAEYRAILLGLKYALQKGFTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKD
        +E DG  +GN G A  GAV+   D S V+   ++ +G ATNN+AEYR ++ GL  A++ G T   V  DSKLV  Q+ G WKVK+ ++ +L  +   L  
Subjt:  LEFDGALKGNLGQARAGAVLRAHDGS-VICRLREGLGTATNNIAEYRAILLGLKYALQKGFTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKD

Query:  KFLSFQISHVLRNLNSEADAQANLAITLA
        +F       V R  N+ AD  AN A+  A
Subjt:  KFLSFQISHVLRNLNSEADAQANLAITLA

Q9HSF6 Ribonuclease HI8.5e-1539.02Show/hide
Query:  FDGALKGNLGQARAGAVLRAHDGSVICRLREGLGTATNNIAEYRAILLGLKYALQKGFTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFL
        FDGA +GN G A  G VL + DG ++    + +G ATNN AEY A++  L+ A   GF  I ++GDS+LV  Q+ G W   + ++        +L   F 
Subjt:  FDGALKGNLGQARAGAVLRAHDGSVICRLREGLGTATNNIAEYRAILLGLKYALQKGFTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFL

Query:  SFQISHVLRNLNSEADAQANLAI
         + I+HV R  N  ADA AN A+
Subjt:  SFQISHVLRNLNSEADAQANLAI

Arabidopsis top hitse value%identityAlignment
AT1G24090.1 RNase H family protein5.1e-6341.94Show/hide
Query:  LKVAALDSVCSRFNLHCYSSRKIAKGSSRSWKLDTKPSSMELEKGDFLL-------------------YGTGICDLPVSMYKRPSLPKDTEEYLASVGLK
        LK  A+ SV    ++H YSSR  +K         T  S+++ EK  F +                    G+ + DLPVS+YK  SLPKDTEEYL+SVGLK
Subjt:  LKVAALDSVCSRFNLHCYSSRKIAKGSSRSWKLDTKPSSMELEKGDFLL-------------------YGTGICDLPVSMYKRPSLPKDTEEYLASVGLK

Query:  NALYTIKAADMRPDIFGSLVPC--TDGATSLKGEASDQAATKKRSREAFATLL---HLFHLLSQKLPNVEPFSSSSSFLCRTTLFGSLDPKFFEHYTSNM
          LY+++A+D++ D+FG+L PC   + A      + D+  ++ +S++     L    + +   +KL  VEP +  S                        
Subjt:  NALYTIKAADMRPDIFGSLVPC--TDGATSLKGEASDQAATKKRSREAFATLL---HLFHLLSQKLPNVEPFSSSSSFLCRTTLFGSLDPKFFEHYTSNM

Query:  LSRLQECCFLEFDGALKGNLGQARAGAVLRAHDGSVICRLREGLGTATNNIAEYRAILLGLKYALQKGFTRIHVQGDSKLVCMQVQGLWKVKNENISELC
             E CF+EFDGA KGN G + A AVL+  DGS+ICR+R+GLG ATNN AEY A++LGLKYA++KG+  I V+GDSKLVCMQ++G WKV +E +++L 
Subjt:  LSRLQECCFLEFDGALKGNLGQARAGAVLRAHDGSVICRLREGLGTATNNIAEYRAILLGLKYALQKGFTRIHVQGDSKLVCMQVQGLWKVKNENISELC

Query:  NEVIKLKDKFLSFQISHVLRNLNSEADAQANLAITLADGEV
         E   L +K +SF+ISHVLRNLN++AD QANLA+ L +GEV
Subjt:  NEVIKLKDKFLSFQISHVLRNLNSEADAQANLAITLADGEV

AT3G01410.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein6.2e-5343.17Show/hide
Query:  GTGICDLPVSMYKRPSLPKDTEEYLASVGLKNALYTIKAADMRPDIFGSLVPC-TDGATSLKGEASDQAATKKRSREAFATLLHLFH----LLSQKLPNV
        G+ +    +S+YK    PK  E+ L+S G+KNAL+++ A+ ++ D FG L+PC     +S +GE+ ++++  KR ++  +     F         K+ N 
Subjt:  GTGICDLPVSMYKRPSLPKDTEEYLASVGLKNALYTIKAADMRPDIFGSLVPC-TDGATSLKGEASDQAATKKRSREAFATLLHLFH----LLSQKLPNV

Query:  EPFSSSSSFLCRTTLFGSLDPKFFEHYTSNMLSRLQECCFLEFDGALKGNLGQARAGAVLRAHDGSVICRLREGLGTATNNIAEYRAILLGLKYALQKGF
              SS L RT +                  R  + C +EFDGA KGN G+A AGAVLRA D SV+  LREG+G ATNN+AEYRA+LLGL+ AL KGF
Subjt:  EPFSSSSSFLCRTTLFGSLDPKFFEHYTSNMLSRLQECCFLEFDGALKGNLGQARAGAVLRAHDGSVICRLREGLGTATNNIAEYRAILLGLKYALQKGF

Query:  TRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFQISHVLRNLNSEADAQANLAITLADGE
          +HV GDS LVCMQVQG WK  +  ++ELC +  +L + F +F I H+ R  NSEAD QAN AI LADG+
Subjt:  TRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFQISHVLRNLNSEADAQANLAITLADGE

AT5G51080.1 RNase H family protein1.1e-5740.17Show/hide
Query:  NGCFNPYWRSSFHNATLKVAALDSVCSRFNLHCYSSR-KIAKGSSRSWKLDTKPSSME------LEKGDFL-LY----------GTGICDLPVSMYKRPS
        N CF    +SS   A++ V+         ++HCYSSR K AK       +    S  E      + KGD + +Y          G+ + D PVS+YK  S
Subjt:  NGCFNPYWRSSFHNATLKVAALDSVCSRFNLHCYSSR-KIAKGSSRSWKLDTKPSSME------LEKGDFL-LY----------GTGICDLPVSMYKRPS

Query:  LPKDTEEYLASVGLKNALYTIKAADMRPDIFGSLVPCTDGATSLKGEASDQAATKKRSREAFATLLHLFHLLSQKLPNVEPFSSSSSFLCRTTLFGSLDP
        L KDTEE L++VGLK  LY  +A D++ D+FG+L PC            DQ  +   S E              KL  +EP + +S              
Subjt:  LPKDTEEYLASVGLKNALYTIKAADMRPDIFGSLVPCTDGATSLKGEASDQAATKKRSREAFATLLHLFHLLSQKLPNVEPFSSSSSFLCRTTLFGSLDP

Query:  KFFEHYTSNMLSRLQECCFLEFDGALKGNLGQARAGAVLRAHDGSVICRLREGLGTATNNIAEYRAILLGLKYALQKGFTRIHVQGDSKLVCMQVQGLWK
                       E C +EFDGA KGN G + A AVL+  DGS+I ++R+GLG ATNN AEY  ++LGLK+A++KG+T+I V+ DSKLVCMQ++G WK
Subjt:  KFFEHYTSNMLSRLQECCFLEFDGALKGNLGQARAGAVLRAHDGSVICRLREGLGTATNNIAEYRAILLGLKYALQKGFTRIHVQGDSKLVCMQVQGLWK

Query:  VKNENISELCNEVIKLKDKFLSFQISHVLRNLNSEADAQANLAITLADGEV
        V +E +S+L  E  +L DK LSF+ISHVLR+LNS+AD QAN+A  L++GEV
Subjt:  VKNENISELCNEVIKLKDKFLSFQISHVLRNLNSEADAQANLAITLADGEV

AT5G51080.2 RNase H family protein1.1e-5740.17Show/hide
Query:  NGCFNPYWRSSFHNATLKVAALDSVCSRFNLHCYSSR-KIAKGSSRSWKLDTKPSSME------LEKGDFL-LY----------GTGICDLPVSMYKRPS
        N CF    +SS   A++ V+         ++HCYSSR K AK       +    S  E      + KGD + +Y          G+ + D PVS+YK  S
Subjt:  NGCFNPYWRSSFHNATLKVAALDSVCSRFNLHCYSSR-KIAKGSSRSWKLDTKPSSME------LEKGDFL-LY----------GTGICDLPVSMYKRPS

Query:  LPKDTEEYLASVGLKNALYTIKAADMRPDIFGSLVPCTDGATSLKGEASDQAATKKRSREAFATLLHLFHLLSQKLPNVEPFSSSSSFLCRTTLFGSLDP
        L KDTEE L++VGLK  LY  +A D++ D+FG+L PC            DQ  +   S E              KL  +EP + +S              
Subjt:  LPKDTEEYLASVGLKNALYTIKAADMRPDIFGSLVPCTDGATSLKGEASDQAATKKRSREAFATLLHLFHLLSQKLPNVEPFSSSSSFLCRTTLFGSLDP

Query:  KFFEHYTSNMLSRLQECCFLEFDGALKGNLGQARAGAVLRAHDGSVICRLREGLGTATNNIAEYRAILLGLKYALQKGFTRIHVQGDSKLVCMQVQGLWK
                       E C +EFDGA KGN G + A AVL+  DGS+I ++R+GLG ATNN AEY  ++LGLK+A++KG+T+I V+ DSKLVCMQ++G WK
Subjt:  KFFEHYTSNMLSRLQECCFLEFDGALKGNLGQARAGAVLRAHDGSVICRLREGLGTATNNIAEYRAILLGLKYALQKGFTRIHVQGDSKLVCMQVQGLWK

Query:  VKNENISELCNEVIKLKDKFLSFQISHVLRNLNSEADAQANLAITLADGEV
        V +E +S+L  E  +L DK LSF+ISHVLR+LNS+AD QAN+A  L++GEV
Subjt:  VKNENISELCNEVIKLKDKFLSFQISHVLRNLNSEADAQANLAITLADGEV

AT5G51080.3 RNase H family protein4.3e-5444.57Show/hide
Query:  GTGICDLPVSMYKRPSLPKDTEEYLASVGLKNALYTIKAADMRPDIFGSLVPCTDGATSLKGEASDQAATKKRSREAFATLLHLFHLLSQKLPNVEPFSS
        G+ + D PVS+YK  SL KDTEE L++VGLK  LY  +A D++ D+FG+L PC            DQ  +   S E              KL  +EP + 
Subjt:  GTGICDLPVSMYKRPSLPKDTEEYLASVGLKNALYTIKAADMRPDIFGSLVPCTDGATSLKGEASDQAATKKRSREAFATLLHLFHLLSQKLPNVEPFSS

Query:  SSSFLCRTTLFGSLDPKFFEHYTSNMLSRLQECCFLEFDGALKGNLGQARAGAVLRAHDGSVICRLREGLGTATNNIAEYRAILLGLKYALQKGFTRIHV
        +S                             E C +EFDGA KGN G + A AVL+  DGS+I ++R+GLG ATNN AEY  ++LGLK+A++KG+T+I V
Subjt:  SSSFLCRTTLFGSLDPKFFEHYTSNMLSRLQECCFLEFDGALKGNLGQARAGAVLRAHDGSVICRLREGLGTATNNIAEYRAILLGLKYALQKGFTRIHV

Query:  QGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFQISHVLRNLNSEADAQANLAITLADGEV
        + DSKLVCMQ++G WKV +E +S+L  E  +L DK LSF+ISHVLR+LNS+AD QAN+A  L++GEV
Subjt:  QGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFQISHVLRNLNSEADAQANLAITLADGEV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCATTTTCAAAAGAACCAGCGTTTTTTTTGCCTACGACCTCCATTAATGGCTGCTTCAATCCCTACTGGAGGTCAAGCTTTCACAATGCCACCCTTAAGGTTGCTGC
TTTAGATTCCGTGTGCTCCAGATTCAATCTGCACTGCTATTCGTCCCGAAAAATTGCCAAGGGCAGTTCTCGTTCGTGGAAGTTAGATACTAAACCTTCTTCCATGGAAT
TGGAGAAAGGCGACTTCTTGTTGTACGGAACGGGGATATGTGATCTTCCTGTTAGCATGTATAAAAGACCCTCATTGCCTAAAGACACTGAGGAATATCTTGCTTCCGTT
GGACTTAAGAATGCTCTATACACTATTAAAGCTGCAGATATGAGGCCTGATATTTTCGGTTCGCTTGTGCCTTGCACTGATGGAGCTACTTCTCTCAAAGGTGAAGCTTC
TGACCAGGCAGCCACAAAGAAGAGATCAAGAGAAGCTTTTGCCACACTGCTACATTTGTTTCATCTCCTATCTCAAAAGCTTCCAAACGTTGAACCATTCTCATCAAGTT
CTTCATTTCTCTGTCGAACAACCCTCTTCGGAAGCCTAGATCCCAAGTTCTTTGAACATTACACCAGCAATATGCTATCACGGCTTCAGGAATGTTGCTTTCTAGAATTC
GATGGTGCTTTGAAAGGAAATCTTGGGCAAGCTAGAGCAGGAGCTGTTCTGCGAGCACATGATGGAAGTGTGATATGTAGACTGCGTGAAGGCTTAGGTACGGCAACCAA
TAATATTGCTGAATATCGAGCTATTCTTTTAGGATTGAAGTATGCTCTTCAGAAAGGTTTTACTAGGATCCATGTCCAAGGTGACTCCAAACTTGTCTGTATGCAGGTCC
AAGGATTGTGGAAGGTAAAAAATGAGAACATCTCTGAGTTATGTAATGAAGTAATCAAGCTGAAGGATAAATTTCTCTCGTTCCAGATCAGTCATGTACTACGGAATCTT
AATTCTGAAGCCGATGCTCAAGCAAACTTGGCTATCACTCTTGCTGATGGCGAAGTCCTGGAGTATGAAGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGCCATTTTCAAAAGAACCAGCGTTTTTTTTGCCTACGACCTCCATTAATGGCTGCTTCAATCCCTACTGGAGGTCAAGCTTTCACAATGCCACCCTTAAGGTTGCTGC
TTTAGATTCCGTGTGCTCCAGATTCAATCTGCACTGCTATTCGTCCCGAAAAATTGCCAAGGGCAGTTCTCGTTCGTGGAAGTTAGATACTAAACCTTCTTCCATGGAAT
TGGAGAAAGGCGACTTCTTGTTGTACGGAACGGGGATATGTGATCTTCCTGTTAGCATGTATAAAAGACCCTCATTGCCTAAAGACACTGAGGAATATCTTGCTTCCGTT
GGACTTAAGAATGCTCTATACACTATTAAAGCTGCAGATATGAGGCCTGATATTTTCGGTTCGCTTGTGCCTTGCACTGATGGAGCTACTTCTCTCAAAGGTGAAGCTTC
TGACCAGGCAGCCACAAAGAAGAGATCAAGAGAAGCTTTTGCCACACTGCTACATTTGTTTCATCTCCTATCTCAAAAGCTTCCAAACGTTGAACCATTCTCATCAAGTT
CTTCATTTCTCTGTCGAACAACCCTCTTCGGAAGCCTAGATCCCAAGTTCTTTGAACATTACACCAGCAATATGCTATCACGGCTTCAGGAATGTTGCTTTCTAGAATTC
GATGGTGCTTTGAAAGGAAATCTTGGGCAAGCTAGAGCAGGAGCTGTTCTGCGAGCACATGATGGAAGTGTGATATGTAGACTGCGTGAAGGCTTAGGTACGGCAACCAA
TAATATTGCTGAATATCGAGCTATTCTTTTAGGATTGAAGTATGCTCTTCAGAAAGGTTTTACTAGGATCCATGTCCAAGGTGACTCCAAACTTGTCTGTATGCAGGTCC
AAGGATTGTGGAAGGTAAAAAATGAGAACATCTCTGAGTTATGTAATGAAGTAATCAAGCTGAAGGATAAATTTCTCTCGTTCCAGATCAGTCATGTACTACGGAATCTT
AATTCTGAAGCCGATGCTCAAGCAAACTTGGCTATCACTCTTGCTGATGGCGAAGTCCTGGAGTATGAAGAATAG
Protein sequenceShow/hide protein sequence
MPFSKEPAFFLPTTSINGCFNPYWRSSFHNATLKVAALDSVCSRFNLHCYSSRKIAKGSSRSWKLDTKPSSMELEKGDFLLYGTGICDLPVSMYKRPSLPKDTEEYLASV
GLKNALYTIKAADMRPDIFGSLVPCTDGATSLKGEASDQAATKKRSREAFATLLHLFHLLSQKLPNVEPFSSSSSFLCRTTLFGSLDPKFFEHYTSNMLSRLQECCFLEF
DGALKGNLGQARAGAVLRAHDGSVICRLREGLGTATNNIAEYRAILLGLKYALQKGFTRIHVQGDSKLVCMQVQGLWKVKNENISELCNEVIKLKDKFLSFQISHVLRNL
NSEADAQANLAITLADGEVLEYEE