; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013299 (gene) of Snake gourd v1 genome

Gene IDTan0013299
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRNase H domain-containing protein
Genome locationLG07:61443403..61481990
RNA-Seq ExpressionTan0013299
SyntenyTan0013299
Gene Ontology termsGO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588976.1 hypothetical protein SDJN03_17541, partial [Cucurbita argyrosperma subsp. sororia]4.7e-12179.09Show/hide
Query:  MEGDKDAYYVVQKGDVFGFYRSLKDLEAQAGYLIFDPNATIYKGYHLSKEVEQYLASHGLQSATYSISAANVTKDLFGKLLACPYEQPSVSRGKMAEDYA
        MEGDKDAYYVV+KGDVFGFYRS K+LEAQAG LIFDPNATIYKGYHLSKE EQYLASHGL+SATYSISAANVT DLFGKLLACPYE+PS +RGKMAE+Y 
Subjt:  MEGDKDAYYVVQKGDVFGFYRSLKDLEAQAGYLIFDPNATIYKGYHLSKEVEQYLASHGLQSATYSISAANVTKDLFGKLLACPYEQPSVSRGKMAEDYA

Query:  EAKRPQQVHET-----------VDST--------------------YFLEFDGASKGNPGLAGAGAVLRASNGTTVCRLQEGVGIATNNVAEYRAVILGL
        +AKR + V  T            DS+                    YFLEFDGASKGNPGLAGAGAVLRA++GTT+CRLQEGVGIATNNVAEYRAVILGL
Subjt:  EAKRPQQVHET-----------VDST--------------------YFLEFDGASKGNPGLAGAGAVLRASNGTTVCRLQEGVGIATNNVAEYRAVILGL

Query:  KHALKSGFKHICVRGDSKLVCMQVQGLWKLKNPNMANLCKVAKELKDKFVSFEINHIPREQNSDADALANRAIHLRDGVVVEDCKHK
        KHALKSGFKHI VRGDSKLVCMQVQGLWKLKN NMA LCKVAK+LKDKFVSFEINHIPREQNSDADALANRAI+LRDG+VVEDCKHK
Subjt:  KHALKSGFKHICVRGDSKLVCMQVQGLWKLKNPNMANLCKVAKELKDKFVSFEINHIPREQNSDADALANRAIHLRDGVVVEDCKHK

XP_022928161.1 uncharacterized protein LOC111435064 [Cucurbita moschata]1.2e-12179.44Show/hide
Query:  MEGDKDAYYVVQKGDVFGFYRSLKDLEAQAGYLIFDPNATIYKGYHLSKEVEQYLASHGLQSATYSISAANVTKDLFGKLLACPYEQPSVSRGKMAEDYA
        MEGDKDAYYVV+KGDVFGFYRS K+LEAQAG LIFDPNATIYKGYHLSKE EQYLASHGL+SATYSISAANVT DLFGKLLACPYEQPS +RG+MAE+Y 
Subjt:  MEGDKDAYYVVQKGDVFGFYRSLKDLEAQAGYLIFDPNATIYKGYHLSKEVEQYLASHGLQSATYSISAANVTKDLFGKLLACPYEQPSVSRGKMAEDYA

Query:  EAKRPQQVHET-----------VDST--------------------YFLEFDGASKGNPGLAGAGAVLRASNGTTVCRLQEGVGIATNNVAEYRAVILGL
        +AKR + V  T            DS+                    YFLEFDGASKGNPGLAGAGAVLRA++GTT+CRLQEGVGIATNNVAEYRAVILGL
Subjt:  EAKRPQQVHET-----------VDST--------------------YFLEFDGASKGNPGLAGAGAVLRASNGTTVCRLQEGVGIATNNVAEYRAVILGL

Query:  KHALKSGFKHICVRGDSKLVCMQVQGLWKLKNPNMANLCKVAKELKDKFVSFEINHIPREQNSDADALANRAIHLRDGVVVEDCKHK
        KHALKSGFKHI VRGDSKLVCMQVQGLWKLKN NMA LCKVAK+LKDKFVSFEINHIPREQNSDADALANRAIHLRDG+VVEDCKHK
Subjt:  KHALKSGFKHICVRGDSKLVCMQVQGLWKLKNPNMANLCKVAKELKDKFVSFEINHIPREQNSDADALANRAIHLRDGVVVEDCKHK

XP_022989146.1 uncharacterized protein LOC111486303 [Cucurbita maxima]2.1e-12179.44Show/hide
Query:  MEGDKDAYYVVQKGDVFGFYRSLKDLEAQAGYLIFDPNATIYKGYHLSKEVEQYLASHGLQSATYSISAANVTKDLFGKLLACPYEQPSVSRGKMAEDYA
        MEGDKDAYYVV+KGDVFGFYRS K+LEAQAG LIFDPNATIYKGYHLSKE EQYLASHGL+SATYSISAANVT DLFGKLLACPYEQPS +RGKMAE+Y 
Subjt:  MEGDKDAYYVVQKGDVFGFYRSLKDLEAQAGYLIFDPNATIYKGYHLSKEVEQYLASHGLQSATYSISAANVTKDLFGKLLACPYEQPSVSRGKMAEDYA

Query:  EAKRPQQVHET----VDS---------------------------TYFLEFDGASKGNPGLAGAGAVLRASNGTTVCRLQEGVGIATNNVAEYRAVILGL
        +AKR + V  T    VD+                            YFLEFDGASKGNPGLAGAGAVLRA++GTT+CRLQEGVGIATNNVAEYRAVILGL
Subjt:  EAKRPQQVHET----VDS---------------------------TYFLEFDGASKGNPGLAGAGAVLRASNGTTVCRLQEGVGIATNNVAEYRAVILGL

Query:  KHALKSGFKHICVRGDSKLVCMQVQGLWKLKNPNMANLCKVAKELKDKFVSFEINHIPREQNSDADALANRAIHLRDGVVVEDCKHK
        KHALKSGFKHI VRGDSKLVCMQVQGLWKLKN NMA LCKVAK+LKDKFVSFEINHIPREQNSDADALANRAI+LRDG+VVEDCKHK
Subjt:  KHALKSGFKHICVRGDSKLVCMQVQGLWKLKNPNMANLCKVAKELKDKFVSFEINHIPREQNSDADALANRAIHLRDGVVVEDCKHK

XP_023529762.1 uncharacterized protein LOC111792488 [Cucurbita pepo subsp. pepo]6.2e-12179.44Show/hide
Query:  MEGDKDAYYVVQKGDVFGFYRSLKDLEAQAGYLIFDPNATIYKGYHLSKEVEQYLASHGLQSATYSISAANVTKDLFGKLLACPYEQPSVSRGKMAEDYA
        MEGDKDAYYVV+KGDVFGFYRS K+LEAQAG LIFDPNATIYKGYHLSKE EQYLASHGLQSATYSISAANVT DLFGKLLACPYEQPS + GKMAE+Y 
Subjt:  MEGDKDAYYVVQKGDVFGFYRSLKDLEAQAGYLIFDPNATIYKGYHLSKEVEQYLASHGLQSATYSISAANVTKDLFGKLLACPYEQPSVSRGKMAEDYA

Query:  EAKRPQQVHET-----------VDS--------------------TYFLEFDGASKGNPGLAGAGAVLRASNGTTVCRLQEGVGIATNNVAEYRAVILGL
        +AKR + V  T            DS                     YFLEFDGASKGNPGLAGAGAVLRA++GTT+CRLQEGVGIATNNVAEYRAVILGL
Subjt:  EAKRPQQVHET-----------VDS--------------------TYFLEFDGASKGNPGLAGAGAVLRASNGTTVCRLQEGVGIATNNVAEYRAVILGL

Query:  KHALKSGFKHICVRGDSKLVCMQVQGLWKLKNPNMANLCKVAKELKDKFVSFEINHIPREQNSDADALANRAIHLRDGVVVEDCKHK
        KHALKSGFKHI VRGDSKLVCMQVQGLWKLKN NMA LCKVAK+LKDKFVSFEINHIPREQNSDADALANRAI+LRDG+VVEDCKHK
Subjt:  KHALKSGFKHICVRGDSKLVCMQVQGLWKLKNPNMANLCKVAKELKDKFVSFEINHIPREQNSDADALANRAIHLRDGVVVEDCKHK

XP_038906129.1 uncharacterized protein LOC120092011 isoform X3 [Benincasa hispida]1.9e-12273.9Show/hide
Query:  MEGDKDAYYVVQKGDVFGFYRSLKDLEAQAGYLIFDPNATIYKGYHLSKEVEQYLASHGLQSATYSISAANVTKDLFGKLLACPYE--------------
        MEGDKDAYYVVQKGDVFGFYRSLK+L+AQAG LIFDPNATIYKGYHLSKE E YLASHGLQSATYSISAANVTKDLFGKLLACPYE              
Subjt:  MEGDKDAYYVVQKGDVFGFYRSLKDLEAQAGYLIFDPNATIYKGYHLSKEVEQYLASHGLQSATYSISAANVTKDLFGKLLACPYE--------------

Query:  ------------------------------------------------QPSVSRGKMAEDYAEAKRPQQVHETVDSTYFLEFDGASKGNPGLAGAGAVLR
                                                        QPSVSRGKMAEDY+EA R QQ HETV  TYFLEFDGASKGNPGLAGAGAVLR
Subjt:  ------------------------------------------------QPSVSRGKMAEDYAEAKRPQQVHETVDSTYFLEFDGASKGNPGLAGAGAVLR

Query:  ASNGTTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNPNMANLCKVAKELKDKFVSFEINHIPREQNSDADALA
        A++G+TVCRLQEGVG+ATNNVAEYRAVILGLKHALK G KHICV+GDSKLVCMQVQGLWKLKN NMANLCKVAKELKDKFVSFEI+HIPREQNSDAD LA
Subjt:  ASNGTTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNPNMANLCKVAKELKDKFVSFEINHIPREQNSDADALA

Query:  NRAIHLRDGVVVEDCKHK
        NRAIHLRDGVVVEDCKHK
Subjt:  NRAIHLRDGVVVEDCKHK

TrEMBL top hitse value%identityAlignment
A0A0A0LHV8 RNase H domain-containing protein1.9e-11280.16Show/hide
Query:  MEGDKDAYYVVQKGDVFGFYRSLKDLEAQAGYLIFDPNATIYKGYHLSKEVEQYLASHGLQSATYSISAANVTKDLFGKLLAC-PYEQPSVSRGKMAEDY
        MEGDKDAYYVV KGDVFGFYR+ K+L    G   FDP+ATIYKGYHLSKE E+YL +HGLQSATYSISAANVTKDLFGK+L C PYEQPS +RGKMAE+Y
Subjt:  MEGDKDAYYVVQKGDVFGFYRSLKDLEAQAGYLIFDPNATIYKGYHLSKEVEQYLASHGLQSATYSISAANVTKDLFGKLLAC-PYEQPSVSRGKMAEDY

Query:  AEAKRPQQVHETVDSTYFLEFDGASKGNPGLAGAGAVLRASNGTTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKL
        ++A+R ++V E  + TYFLEFDGASKGNPGLAGAGAVLRA++G+TVC+LQEGVGIAT NVAEYRAVILGLKHALK+G KHI V+GDSKLVCMQVQGLWKL
Subjt:  AEAKRPQQVHETVDSTYFLEFDGASKGNPGLAGAGAVLRASNGTTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKL

Query:  KNPNMANLCKVAKELKDKFVSFEINHIPREQNSDADALANRAIHLRDGVVVEDCKHK
        KNPNMA  CKVAKELKDKFVSFEI+H PR+QNSDADALAN AI L+DGVVVEDC HK
Subjt:  KNPNMANLCKVAKELKDKFVSFEINHIPREQNSDADALANRAIHLRDGVVVEDCKHK

A0A1S3B7N6 uncharacterized protein LOC103487060 isoform X12.4e-11581.71Show/hide
Query:  MEGDKDAYYVVQKGDVFGFYRSLKDLEAQAGYLIFDPNATIYKGYHLSKEVEQYLASHGLQSATYSISAANVTKDLFGKLLAC-PYEQPSVSRGKMAEDY
        MEGDKDAYYVVQKGDVFGFYRS K+L  Q G   FDPNATIYKGYHLSKE E+YL SHGLQSATYSISAANVTKDLFGK+L C PYEQPS +RGKMAE+Y
Subjt:  MEGDKDAYYVVQKGDVFGFYRSLKDLEAQAGYLIFDPNATIYKGYHLSKEVEQYLASHGLQSATYSISAANVTKDLFGKLLAC-PYEQPSVSRGKMAEDY

Query:  AEAKRPQQVHETVDSTYFLEFDGASKGNPGLAGAGAVLRASNGTTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKL
         +A+R ++V +  + TYFLEFDGASKGNPGLAGAGA+LRA++G+TVCRLQEGVGIAT NVAEYRA+ILGLKHALK+G KHI V+GDSKLVCMQVQGLWKL
Subjt:  AEAKRPQQVHETVDSTYFLEFDGASKGNPGLAGAGAVLRASNGTTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKL

Query:  KNPNMANLCKVAKELKDKFVSFEINHIPREQNSDADALANRAIHLRDGVVVEDCKHK
        KN NMA LCKVAKELKDKFVSFEI+H+PR +NSDADALANRAIHL+DGVVVEDC HK
Subjt:  KNPNMANLCKVAKELKDKFVSFEINHIPREQNSDADALANRAIHLRDGVVVEDCKHK

A0A6J1D7M4 uncharacterized protein LOC1110183841.0e-10860.75Show/hide
Query:  MEGDKDAYYVVQKGDVFGFYRSLKDLEAQAGYLIFDPNATIYKGYHLSKEVEQYLASHGLQSATYSISAANVTKDLFGKLLACPYEQP------------
        MEGDKDAYYVVQKG V GFY+SLK+ EAQ G  IFDPNATIYKGYHLSKE EQYLASHGLQSATYSISAANVT+DLFGKLLAC YEQP            
Subjt:  MEGDKDAYYVVQKGDVFGFYRSLKDLEAQAGYLIFDPNATIYKGYHLSKEVEQYLASHGLQSATYSISAANVTKDLFGKLLACPYEQP------------

Query:  -----------------------------------------------SVSRGKMAEDYAEAKRPQQ--------------------------VHETVD--
                                                       S SRGKMAE Y+ AKRP Q                           HETV+  
Subjt:  -----------------------------------------------SVSRGKMAEDYAEAKRPQQ--------------------------VHETVD--

Query:  -----------------------------STYFLEFDGASKGNPGLAGAGAVLRASNGTTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRG
                                      TYFLEFDGASKGNPGLAGAGAVLRA +G+TVCRLQEGVGIATNNVAEYRAVILGLKHALK+GFKHI V+G
Subjt:  -----------------------------STYFLEFDGASKGNPGLAGAGAVLRASNGTTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRG

Query:  DSKLVCMQVQGLWKLKNPNMANLCKVAKELKDKFVSFEINHIPREQNSDADALANRAIHLRDGVVVEDCKHK
        DSKLVCMQVQGLWK+KNPNM  LCKVAKELKDKF SFEI+HIPREQNSDADALANRAIHLRDGVVVEDCKHK
Subjt:  DSKLVCMQVQGLWKLKNPNMANLCKVAKELKDKFVSFEINHIPREQNSDADALANRAIHLRDGVVVEDCKHK

A0A6J1EJI9 uncharacterized protein LOC1114350646.0e-12279.44Show/hide
Query:  MEGDKDAYYVVQKGDVFGFYRSLKDLEAQAGYLIFDPNATIYKGYHLSKEVEQYLASHGLQSATYSISAANVTKDLFGKLLACPYEQPSVSRGKMAEDYA
        MEGDKDAYYVV+KGDVFGFYRS K+LEAQAG LIFDPNATIYKGYHLSKE EQYLASHGL+SATYSISAANVT DLFGKLLACPYEQPS +RG+MAE+Y 
Subjt:  MEGDKDAYYVVQKGDVFGFYRSLKDLEAQAGYLIFDPNATIYKGYHLSKEVEQYLASHGLQSATYSISAANVTKDLFGKLLACPYEQPSVSRGKMAEDYA

Query:  EAKRPQQVHET-----------VDST--------------------YFLEFDGASKGNPGLAGAGAVLRASNGTTVCRLQEGVGIATNNVAEYRAVILGL
        +AKR + V  T            DS+                    YFLEFDGASKGNPGLAGAGAVLRA++GTT+CRLQEGVGIATNNVAEYRAVILGL
Subjt:  EAKRPQQVHET-----------VDST--------------------YFLEFDGASKGNPGLAGAGAVLRASNGTTVCRLQEGVGIATNNVAEYRAVILGL

Query:  KHALKSGFKHICVRGDSKLVCMQVQGLWKLKNPNMANLCKVAKELKDKFVSFEINHIPREQNSDADALANRAIHLRDGVVVEDCKHK
        KHALKSGFKHI VRGDSKLVCMQVQGLWKLKN NMA LCKVAK+LKDKFVSFEINHIPREQNSDADALANRAIHLRDG+VVEDCKHK
Subjt:  KHALKSGFKHICVRGDSKLVCMQVQGLWKLKNPNMANLCKVAKELKDKFVSFEINHIPREQNSDADALANRAIHLRDGVVVEDCKHK

A0A6J1JJ82 uncharacterized protein LOC1114863031.0e-12179.44Show/hide
Query:  MEGDKDAYYVVQKGDVFGFYRSLKDLEAQAGYLIFDPNATIYKGYHLSKEVEQYLASHGLQSATYSISAANVTKDLFGKLLACPYEQPSVSRGKMAEDYA
        MEGDKDAYYVV+KGDVFGFYRS K+LEAQAG LIFDPNATIYKGYHLSKE EQYLASHGL+SATYSISAANVT DLFGKLLACPYEQPS +RGKMAE+Y 
Subjt:  MEGDKDAYYVVQKGDVFGFYRSLKDLEAQAGYLIFDPNATIYKGYHLSKEVEQYLASHGLQSATYSISAANVTKDLFGKLLACPYEQPSVSRGKMAEDYA

Query:  EAKRPQQVHET----VDS---------------------------TYFLEFDGASKGNPGLAGAGAVLRASNGTTVCRLQEGVGIATNNVAEYRAVILGL
        +AKR + V  T    VD+                            YFLEFDGASKGNPGLAGAGAVLRA++GTT+CRLQEGVGIATNNVAEYRAVILGL
Subjt:  EAKRPQQVHET----VDS---------------------------TYFLEFDGASKGNPGLAGAGAVLRASNGTTVCRLQEGVGIATNNVAEYRAVILGL

Query:  KHALKSGFKHICVRGDSKLVCMQVQGLWKLKNPNMANLCKVAKELKDKFVSFEINHIPREQNSDADALANRAIHLRDGVVVEDCKHK
        KHALKSGFKHI VRGDSKLVCMQVQGLWKLKN NMA LCKVAK+LKDKFVSFEINHIPREQNSDADALANRAI+LRDG+VVEDCKHK
Subjt:  KHALKSGFKHICVRGDSKLVCMQVQGLWKLKNPNMANLCKVAKELKDKFVSFEINHIPREQNSDADALANRAIHLRDGVVVEDCKHK

SwissProt top hitse value%identityAlignment
P54162 14.7 kDa ribonuclease H-like protein8.6e-0934.13Show/hide
Query:  DGASKGNPGLAGAGAVLRASNGTTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNPNMANLCKVAKELKDKFVS
        DGAS GNPG +G G  ++         +   +G+ TN  AE+ A+I G+K     G++ +  R DS +V  +   L  +KN       +    LK  F  
Subjt:  DGASKGNPGLAGAGAVLRASNGTTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNPNMANLCKVAKELKDKFVS

Query:  FEINHIPREQNSDADALANRAIHLRD
        F I  IP +QN  AD LA  AI L +
Subjt:  FEINHIPREQNSDADALANRAIHLRD

P64956 Uncharacterized protein Mb2253c1.3e-2044.44Show/hide
Query:  LEFDGASKGNPGLAGAGAVL-RASNGTTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNPNMANLCKVAKELKD
        +E DG S+GNPG AG GAV+  A + T +   ++ +G ATNNVAEYR +I GL  A+K G     V  DSKLV  Q+ G WK+K+P++  L   A+ L  
Subjt:  LEFDGASKGNPGLAGAGAVL-RASNGTTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNPNMANLCKVAKELKD

Query:  KFVSFEINHIPREQNSDADALANRAI
        +F       +PR +N+ AD LAN A+
Subjt:  KFVSFEINHIPREQNSDADALANRAI

P9WLH4 Uncharacterized protein MT22871.3e-2044.44Show/hide
Query:  LEFDGASKGNPGLAGAGAVL-RASNGTTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNPNMANLCKVAKELKD
        +E DG S+GNPG AG GAV+  A + T +   ++ +G ATNNVAEYR +I GL  A+K G     V  DSKLV  Q+ G WK+K+P++  L   A+ L  
Subjt:  LEFDGASKGNPGLAGAGAVL-RASNGTTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNPNMANLCKVAKELKD

Query:  KFVSFEINHIPREQNSDADALANRAI
        +F       +PR +N+ AD LAN A+
Subjt:  KFVSFEINHIPREQNSDADALANRAI

P9WLH5 Bifunctional protein Rv2228c1.3e-2044.44Show/hide
Query:  LEFDGASKGNPGLAGAGAVL-RASNGTTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNPNMANLCKVAKELKD
        +E DG S+GNPG AG GAV+  A + T +   ++ +G ATNNVAEYR +I GL  A+K G     V  DSKLV  Q+ G WK+K+P++  L   A+ L  
Subjt:  LEFDGASKGNPGLAGAGAVL-RASNGTTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNPNMANLCKVAKELKD

Query:  KFVSFEINHIPREQNSDADALANRAI
        +F       +PR +N+ AD LAN A+
Subjt:  KFVSFEINHIPREQNSDADALANRAI

Q9HSF6 Ribonuclease HI2.9e-2036.04Show/hide
Query:  EVEQYLASHGLQSATYSISAANVTKDLF----GKLLACPYEQPSVSRGKMAEDYAEAKRPQQVHETVDSTYFLEFDGASKGNPGLAGAGAVLRASNGTTV
        E +   A   L  A  S S  N   +L+    G   A  Y    V +G    D     +P +            FDGAS+GNPG A  G VL + +G  V
Subjt:  EVEQYLASHGLQSATYSISAANVTKDLF----GKLLACPYEQPSVSRGKMAEDYAEAKRPQQVHETVDSTYFLEFDGASKGNPGLAGAGAVLRASNGTTV

Query:  CRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNPNMANLCKVAKELKDKFVSFEINHIPREQNSDADALANRAI
            + +G ATNN AEY A+I  L+ A   GF  I +RGDS+LV  Q+ G W   +P++      A+EL   F  + I H+PR  N  ADALAN A+
Subjt:  CRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNPNMANLCKVAKELKDKFVSFEINHIPREQNSDADALANRAI

Arabidopsis top hitse value%identityAlignment
AT1G24090.1 RNase H family protein6.7e-6545.26Show/hide
Query:  MEGDKDAYYVVQKGDVFGFYRSLKDLEAQAGYLIFDPNATIYKGYHLSKEVEQYLASHGLQSATYSISAANVTKDLFGKLLACPYEQPSVSRGKMAED--
        ++ +KDA++VV+KGDV G Y+ L D +AQ G  +FD   ++YKGY L K+ E+YL+S GL+   YS+ A+++  D+FG L  C +++P+    K++ED  
Subjt:  MEGDKDAYYVVQKGDVFGFYRSLKDLEAQAGYLIFDPNATIYKGYHLSKEVEQYLASHGLQSATYSISAANVTKDLFGKLLACPYEQPSVSRGKMAED--

Query:  --------------------YAEAKRPQQVHETV---DSTYFLEFDGASKGNPGLAGAGAVLRASNGTTVCRLQEGVGIATNNVAEYRAVILGLKHALKS
                            Y   ++  +V  +    D T F+EFDGASKGNPGL+GA AVL+  +G+ +CR+++G+GIATNN AEY A+ILGLK+A++ 
Subjt:  --------------------YAEAKRPQQVHETV---DSTYFLEFDGASKGNPGLAGAGAVLRASNGTTVCRLQEGVGIATNNVAEYRAVILGLKHALKS

Query:  GFKHICVRGDSKLVCMQVQGLWKLKNPNMANLCKVAKELKDKFVSFEINHIPREQNSDADALANRAIHLRDGVV
        G+K+I V+GDSKLVCMQ++G WK+ +  +A L K AK L +K VSFEI+H+ R  N+DAD  AN A+ L +G V
Subjt:  GFKHICVRGDSKLVCMQVQGLWKLKNPNMANLCKVAKELKDKFVSFEINHIPREQNSDADALANRAIHLRDGVV

AT3G01410.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.2e-6747.16Show/hide
Query:  MEGDKDAYYVVQKGDVFGFYRSLKDLEAQAGYLIFDPNATIYKGYHLSKEVEQYLASHGLQSATYSISAANVTKDLFGKLLACPYEQPSVSRGKMAEDYA
        ME +KDA+Y+V+KGD+ G YRSL + + QAG  +  P  ++YKGY   K  E  L+S G+++A +S++A++V  D FGKL+ CP +QPS S+G+     +
Subjt:  MEGDKDAYYVVQKGDVFGFYRSLKDLEAQAGYLIFDPNATIYKGYHLSKEVEQYLASHGLQSATYSISAANVTKDLFGKLLACPYEQPSVSRGKMAEDYA

Query:  EAKRPQQVHETVDSTY-----------------------------------FLEFDGASKGNPGLAGAGAVLRASNGTTVCRLQEGVGIATNNVAEYRAV
         +KR Q +      ++                                    +EFDGASKGNPG AGAGAVLRAS+ + +  L+EGVG ATNNVAEYRA+
Subjt:  EAKRPQQVHETVDSTY-----------------------------------FLEFDGASKGNPGLAGAGAVLRASNGTTVCRLQEGVGIATNNVAEYRAV

Query:  ILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNPNMANLCKVAKELKDKFVSFEINHIPREQNSDADALANRAIHLRDG
        +LGL+ AL  GFK++ V GDS LVCMQVQG WK  +P MA LCK AKEL + F +F+I HI RE+NS+AD  AN AI L DG
Subjt:  ILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNPNMANLCKVAKELKDKFVSFEINHIPREQNSDADALANRAIHLRDG

AT3G01410.2 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.2e-6747.16Show/hide
Query:  MEGDKDAYYVVQKGDVFGFYRSLKDLEAQAGYLIFDPNATIYKGYHLSKEVEQYLASHGLQSATYSISAANVTKDLFGKLLACPYEQPSVSRGKMAEDYA
        ME +KDA+Y+V+KGD+ G YRSL + + QAG  +  P  ++YKGY   K  E  L+S G+++A +S++A++V  D FGKL+ CP +QPS S+G+     +
Subjt:  MEGDKDAYYVVQKGDVFGFYRSLKDLEAQAGYLIFDPNATIYKGYHLSKEVEQYLASHGLQSATYSISAANVTKDLFGKLLACPYEQPSVSRGKMAEDYA

Query:  EAKRPQQVHETVDSTY-----------------------------------FLEFDGASKGNPGLAGAGAVLRASNGTTVCRLQEGVGIATNNVAEYRAV
         +KR Q +      ++                                    +EFDGASKGNPG AGAGAVLRAS+ + +  L+EGVG ATNNVAEYRA+
Subjt:  EAKRPQQVHETVDSTY-----------------------------------FLEFDGASKGNPGLAGAGAVLRASNGTTVCRLQEGVGIATNNVAEYRAV

Query:  ILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNPNMANLCKVAKELKDKFVSFEINHIPREQNSDADALANRAIHLRDG
        +LGL+ AL  GFK++ V GDS LVCMQVQG WK  +P MA LCK AKEL + F +F+I HI RE+NS+AD  AN AI L DG
Subjt:  ILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNPNMANLCKVAKELKDKFVSFEINHIPREQNSDADALANRAIHLRDG

AT5G51080.1 RNase H family protein6.5e-6045.93Show/hide
Query:  DKDAYYVVQKGDVFGFYRSLKDLEAQAGYLIFDPNATIYKGYHLSKEVEQYLASHGLQSATYSISAANVTKDLFGKLLACPYEQPSVSRGKMAEDYAEAK
        +KDA++VV+KGD+ G Y+ L D +AQ G  ++DP  ++YKGY L K+ E+ L++ GL+   Y   A ++ +D+FG L  C ++    S     E  AE  
Subjt:  DKDAYYVVQKGDVFGFYRSLKDLEAQAGYLIFDPNATIYKGYHLSKEVEQYLASHGLQSATYSISAANVTKDLFGKLLACPYEQPSVSRGKMAEDYAEAK

Query:  RPQQVHETVDSTYFLEFDGASKGNPGLAGAGAVLRASNGTTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNPN
          +   +T   T  +EFDGASKGNPGL+GA AVL+  +G+ + ++++G+GIATNN AEY  +ILGLKHA++ G+  I V+ DSKLVCMQ++G WK+ +  
Subjt:  RPQQVHETVDSTYFLEFDGASKGNPGLAGAGAVLRASNGTTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNPN

Query:  MANLCKVAKELKDKFVSFEINHIPREQNSDADALANRAIHLRDGVV
        ++ L K AK+L DK +SFEI+H+ R  NSDAD  AN A  L +G V
Subjt:  MANLCKVAKELKDKFVSFEINHIPREQNSDADALANRAIHLRDGVV

AT5G51080.2 RNase H family protein6.5e-6045.93Show/hide
Query:  DKDAYYVVQKGDVFGFYRSLKDLEAQAGYLIFDPNATIYKGYHLSKEVEQYLASHGLQSATYSISAANVTKDLFGKLLACPYEQPSVSRGKMAEDYAEAK
        +KDA++VV+KGD+ G Y+ L D +AQ G  ++DP  ++YKGY L K+ E+ L++ GL+   Y   A ++ +D+FG L  C ++    S     E  AE  
Subjt:  DKDAYYVVQKGDVFGFYRSLKDLEAQAGYLIFDPNATIYKGYHLSKEVEQYLASHGLQSATYSISAANVTKDLFGKLLACPYEQPSVSRGKMAEDYAEAK

Query:  RPQQVHETVDSTYFLEFDGASKGNPGLAGAGAVLRASNGTTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNPN
          +   +T   T  +EFDGASKGNPGL+GA AVL+  +G+ + ++++G+GIATNN AEY  +ILGLKHA++ G+  I V+ DSKLVCMQ++G WK+ +  
Subjt:  RPQQVHETVDSTYFLEFDGASKGNPGLAGAGAVLRASNGTTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNPN

Query:  MANLCKVAKELKDKFVSFEINHIPREQNSDADALANRAIHLRDGVV
        ++ L K AK+L DK +SFEI+H+ R  NSDAD  AN A  L +G V
Subjt:  MANLCKVAKELKDKFVSFEINHIPREQNSDADALANRAIHLRDGVV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGGAGATAAAGATGCCTACTATGTTGTACAGAAAGGGGATGTTTTTGGATTTTACAGGAGCTTGAAGGACCTCGAGGCTCAAGCTGGATATTTGATATTTGATCC
TAATGCAACGATCTACAAAGGGTATCACTTATCTAAAGAAGTAGAGCAGTACCTTGCATCACATGGACTTCAGAGTGCAACTTACTCTATAAGTGCTGCTAATGTGACAA
AGGATCTATTTGGAAAACTACTCGCTTGTCCTTATGAGCAGCCATCTGTTTCTAGAGGAAAAATGGCTGAAGATTACGCTGAAGCTAAGAGACCACAGCAAGTCCATGAG
ACTGTTGATAGTACCTATTTTCTTGAGTTTGATGGTGCTTCAAAGGGGAATCCTGGGCTAGCAGGTGCTGGAGCTGTTTTACGTGCTAGCAATGGAACTACGGTTTGTAG
GTTGCAAGAAGGGGTTGGGATTGCTACAAATAATGTTGCTGAATATCGTGCTGTAATTCTAGGACTGAAACATGCTCTAAAGAGTGGCTTTAAACACATTTGTGTGCGAG
GAGACTCCAAGCTTGTTTGTATGCAGGTTCAGGGTCTATGGAAGCTCAAAAATCCAAATATGGCTAATTTGTGTAAAGTGGCAAAGGAGCTCAAGGATAAGTTTGTGTCA
TTTGAGATCAACCATATCCCCAGGGAACAAAATTCCGATGCCGATGCCCTAGCAAACCGTGCTATACATCTCCGAGATGGAGTTGTAGTAGAAGACTGCAAGCATAAATG
A
mRNA sequenceShow/hide mRNA sequence
ATGGAGGGAGATAAAGATGCCTACTATGTTGTACAGAAAGGGGATGTTTTTGGATTTTACAGGAGCTTGAAGGACCTCGAGGCTCAAGCTGGATATTTGATATTTGATCC
TAATGCAACGATCTACAAAGGGTATCACTTATCTAAAGAAGTAGAGCAGTACCTTGCATCACATGGACTTCAGAGTGCAACTTACTCTATAAGTGCTGCTAATGTGACAA
AGGATCTATTTGGAAAACTACTCGCTTGTCCTTATGAGCAGCCATCTGTTTCTAGAGGAAAAATGGCTGAAGATTACGCTGAAGCTAAGAGACCACAGCAAGTCCATGAG
ACTGTTGATAGTACCTATTTTCTTGAGTTTGATGGTGCTTCAAAGGGGAATCCTGGGCTAGCAGGTGCTGGAGCTGTTTTACGTGCTAGCAATGGAACTACGGTTTGTAG
GTTGCAAGAAGGGGTTGGGATTGCTACAAATAATGTTGCTGAATATCGTGCTGTAATTCTAGGACTGAAACATGCTCTAAAGAGTGGCTTTAAACACATTTGTGTGCGAG
GAGACTCCAAGCTTGTTTGTATGCAGGTTCAGGGTCTATGGAAGCTCAAAAATCCAAATATGGCTAATTTGTGTAAAGTGGCAAAGGAGCTCAAGGATAAGTTTGTGTCA
TTTGAGATCAACCATATCCCCAGGGAACAAAATTCCGATGCCGATGCCCTAGCAAACCGTGCTATACATCTCCGAGATGGAGTTGTAGTAGAAGACTGCAAGCATAAATG
A
Protein sequenceShow/hide protein sequence
MEGDKDAYYVVQKGDVFGFYRSLKDLEAQAGYLIFDPNATIYKGYHLSKEVEQYLASHGLQSATYSISAANVTKDLFGKLLACPYEQPSVSRGKMAEDYAEAKRPQQVHE
TVDSTYFLEFDGASKGNPGLAGAGAVLRASNGTTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNPNMANLCKVAKELKDKFVS
FEINHIPREQNSDADALANRAIHLRDGVVVEDCKHK