; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038242 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038242
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr2:14183926..14186140
RNA-Seq ExpressionLag0038242
SyntenyLag0038242
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_017250619.1 PREDICTED: uncharacterized protein LOC108221234 [Daucus carota subsp. sativus]4.0e-3722.71Show/hide
Query:  MNGESFGFIKPSLGIRQGDPLSPYLFLICT-GLSALLVSARLSSMEFGL-------------------------------FRNILQDFERAYGQSINYSK
        +NG+ +G + PS G+RQGDPLSPYLFLIC  G SALL  A   S   GL                                +NI   +    GQ IN++K
Subjt:  MNGESFGFIKPSLGIRQGDPLSPYLFLICT-GLSALLVSARLSSMEFGL-------------------------------FRNILQDFERAYGQSINYSK

Query:  SMVIFSRNVPLDSRQYLGNIVSMRVTKSLGSYQGLPS--------TFHRGFPKVYC-LKS-RPSAQSFGG--VLMESNVECISNDGRICV----------
        S++ FS N   D R      ++M+ T+++ +Y GLP          F     KV+  L S RP   S GG  +L+++ ++ +      C           
Subjt:  SMVIFSRNVPLDSRQYLGNIVSMRVTKSLGSYQGLPS--------TFHRGFPKVYC-LKS-RPSAQSFGG--VLMESNVECISNDGRICV----------

Query:  -------------SQRSISWDSFALIQD------------------------------------------------------------FWKGLVWGMDLL
                     S+R I W ++  I                                                               W+ +VWG  LL
Subjt:  -------------SQRSISWDSFALIQD------------------------------------------------------------FWKGLVWGMDLL

Query:  KCGLRKNLGNDQSIYIFQDPWLPCPSTFKVISQSEPMLKDATVTEFITPSLQWDIPKLNKYLVQIDVDVIKALPINSLVH---------GFGITMVKGS-
          GLR+ +GN QS   F+DPWL  P +F  I++     ++  V E+IT    W+   + +  +  D+ +I  +P++   H           G   VK   
Subjt:  KCGLRKNLGNDQSIYIFQDPWLPCPSTFKVISQSEPMLKDATVTEFITPSLQWDIPKLNKYLVQIDVDVIKALPINSLVH---------GFGITMVKGS-

Query:  ---TLLRVDLARNS-----SWWKRVWKMRVPNKVKLFVWKSFHNSIPTMTNLWNHHVPVLGNCPMCQEEMETTDHALFQCSRVREIWAILNPPMMRNLWD
           T L  D++ +S      WWK  W  ++P K+ +F W+ +H  +PT   L    + +  NCP+C    ++  HA+F C   +E+W ++  P +    +
Subjt:  ---TLLRVDLARNS-----SWWKRVWKMRVPNKVKLFVWKSFHNSIPTMTNLWNHHVPVLGNCPMCQEEMETTDHALFQCSRVREIWAILNPPMMRNLWD

Query:  QMDIKD--------------------RWQGFSE--EPLRRFRKANP--------------KGGSVVQ----TRDDIINLIHKSEK----TIMHTDASDKK
        ++  KD                     W  ++E  + + + ++  P              K   V +    +R D IN  H+ E+    + +  DA+  K
Subjt:  QMDIKD--------------------RWQGFSE--EPLRRFRKANP--------------KGGSVVQ----TRDDIINLIHKSEK----TIMHTDASDKK

Query:  --------GSLLEVHNLSTMVNKSPLE-------AEAVAVLEGIRLARSLNVVCLSIFSDSLALIKMIKEEMQGEDCIAATLWEIKTSLNSFQNLTFEYI
                 +++  +N +      PLE       AEA+A++ G++ A++       + +DS +L++ +  E +  + +   + + +  +  F  +   ++
Subjt:  --------GSLLEVHNLSTMVNKSPLE-------AEAVAVLEGIRLARSLNVVCLSIFSDSLALIKMIKEEMQGEDCIAATLWEIKTSLNSFQNLTFEYI

Query:  GRKFNVFAHRMACVGLSSSPCLWLDNYLEWM
         R  NV AH +A         L LD  + W+
Subjt:  GRKFNVFAHRMACVGLSSSPCLWLDNYLEWM

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]5.4e-5828.44Show/hide
Query:  MNGESFGFIKPSLGIRQGDPLSPYLFLICT-GLSALL----VSARLSSMEFG---------------------------LFRNILQDFERAYGQSINYSK
        +NG   G  +PS GIRQGDPLSPYLFL+C  GLSAL+     S RL+ + F                              R +L  + RA GQ IN+SK
Subjt:  MNGESFGFIKPSLGIRQGDPLSPYLFLICT-GLSALL----VSARLSSMEFG---------------------------LFRNILQDFERAYGQSINYSK

Query:  SMVIFSRNVPLDSRQYLGNIVSMRVTKSLGSYQGLPSTFHRG----------------FPKVYCLKSRPSAQSFGGVLMESNVECISNDGRICVSQ--RS
        S ++FS NV  + +QYL  I+++++    G+Y GLPS F R                 +PK     +    + F   L+  +V        + VS+  + 
Subjt:  SMVIFSRNVPLDSRQYLGNIVSMRVTKSLGSYQGLPSTFHRG----------------FPKVYCLKSRPSAQSFGGVLMESNVECISNDGRICVSQ--RS

Query:  ISWDSFALIQD--------FWKGLVWGMDLLKCGLRKNLGNDQSIYIFQDPWLPCPSTFKVISQSEPMLKDATVTEFITPSLQWDIPKLNKYLVQIDVDV
          +   +L+Q         FWKG +WG DLL  GLR  +GN  +I  F DPWLP P+TFK +  +   L D TV  FIT    WD+  ++      D D+
Subjt:  ISWDSFALIQD--------FWKGLVWGMDLLKCGLRKNLGNDQSIYIFQDPWLPCPSTFKVISQSEPMLKDATVTEFITPSLQWDIPKLNKYLVQIDVDV

Query:  IKALPINS-------LVH-----------GFGITMVKGSTLLRVDLARNSSWWKRVWKMRVPNKVKLFVWKSFHNSIPTMTNLWNHHVPVLGNCPMCQEE
        I ++PI+S       L H           G+ + M               + W  +WK+ VP K+K+F+W+S H  IPT  NL    +  L  C +C + 
Subjt:  IKALPINS-------LVH-----------GFGITMVKGSTLLRVDLARNSSWWKRVWKMRVPNKVKLFVWKSFHNSIPTMTNLWNHHVPVLGNCPMCQEE

Query:  METTDHALFQCSRVREIWAILNPPM-MRNLWDQMDIKDRWQGFSE--EPLRRFRKANPKGGSVVQTRDDIIN-------------------------LIH
         E+  HA F C R R+IW  L P +   +  D +   + W   +E  EP +    A   G  +   R+ +I+                         + +
Subjt:  METTDHALFQCSRVREIWAILNPPM-MRNLWDQMDIKDRWQGFSE--EPLRRFRKANPKGGSVVQTRDDIIN-------------------------LIH

Query:  KSEKT-------------------IMHTDAS-------------DKKGSLLEVHNLSTMVNKSPLEAEAVAVLEGIRLARSLNVVCLSIFSDSLALIKMI
         S +T                    ++TDA+             D   SL+   ++      SPL AE   +LEG++ A + N   L + SDSL  I++I
Subjt:  KSEKT-------------------IMHTDAS-------------DKKGSLLEVHNLSTMVNKSPLEAEAVAVLEGIRLARSLNVVCLSIFSDSLALIKMI

Query:  KEEMQGEDCIAATLWEIKTSLNSFQNLTFEYIGRKFNVFAHRMACVGLS--SSPCLWLDNYLEWMVSL
        + E+         + EI+     F  ++F +  R+ N  AH +A  G++  S+   WL N+  W++ L
Subjt:  KEEMQGEDCIAATLWEIKTSLNSFQNLTFEYIGRKFNVFAHRMACVGLS--SSPCLWLDNYLEWMVSL

XP_030478262.1 uncharacterized protein LOC115695328 [Cannabis sativa]1.6e-3828.19Show/hide
Query:  MNGESFGFIKPSLGIRQGDPLSPYLFLICT-GLSALLVSARLSSMEFGL-------------------------------FRNILQDFERAYGQSINYSK
        +NGE  G ++PS G+RQGDPLSPYLFLIC+ GLS LL S   S+   GL                                  +L  + +A GQ +N +K
Subjt:  MNGESFGFIKPSLGIRQGDPLSPYLFLICT-GLSALLVSARLSSMEFGL-------------------------------FRNILQDFERAYGQSINYSK

Query:  SMVIFSRNVPLDSRQYLGNIVSMRVTKSLGSYQGLPSTFHRGFPKVYC-LKSR---------PSAQSFGG--VLMESNVECISNDGRICV----------
        S++ FS N    ++ +    + M +++    Y GLP+   R   +++  +K R             S GG  VL+++ V+ I      C           
Subjt:  SMVIFSRNVPLDSRQYLGNIVSMRVTKSLGSYQGLPSTFHRGFPKVYC-LKSR---------PSAQSFGG--VLMESNVECISNDGRICV----------

Query:  -------------SQRSISWDSFALIQDF---------------------------WKGLVWGMDLLKCGLRKNLGNDQSIYIFQDPWLPCPSTFKVISQ
                         I W S+ L+                              W+G+ WG +LL  GLR  +GN   +    D W+P    FK +S 
Subjt:  -------------SQRSISWDSFALIQDF---------------------------WKGLVWGMDLLKCGLRKNLGNDQSIYIFQDPWLPCPSTFKVISQ

Query:  SEPMLKDATVTEFITPSLQWDIPKLNKYLVQIDVDVIKALPINSLVHG---------FGITMVKGSTLLRVDLA--RNSS-------WWKRVWKMRVPNK
        S P   +  V+ FIT   +W++P LN Y   IDVD I A+P++   +           GI  V     L  +LA  RNSS       WWK  W + +P+K
Subjt:  SEPMLKDATVTEFITPSLQWDIPKLNKYLVQIDVDVIKALPINSLVHG---------FGITMVKGSTLLRVDLA--RNSS-------WWKRVWKMRVPNK

Query:  VKLFVWKSFHNSIPTMTNLWNHHVPVLGNCPMCQEEMETTDHALFQCSRVREIW
        VK+F W+   N++P  T L    V     C +C    E+  HA+F C++ + +W
Subjt:  VKLFVWKSFHNSIPTMTNLWNHHVPVLGNCPMCQEEMETTDHALFQCSRVREIW

XP_030483444.1 uncharacterized protein LOC115700033 [Cannabis sativa]1.6e-4124.88Show/hide
Query:  MNGESFGFIKPSLGIRQGDPLSPYLFLI-CTGLSALLVSAR----------------LSSMEFG-----LFR-------NILQDFE---RAYGQSINYSK
        +NG+ +G + P+ G+RQGDPLSPYLFLI   G SAL+  A                 +S + F      +F+       +I ++F       GQ INY+K
Subjt:  MNGESFGFIKPSLGIRQGDPLSPYLFLI-CTGLSALLVSAR----------------LSSMEFG-----LFR-------NILQDFE---RAYGQSINYSK

Query:  SMVIFSRNVPLDSRQYLGNIVSMRVTKSLGSYQGLPSTFHRGFPKVYCLKSRPSAQSFGGVLMESNVECISNDGRICVSQRSISWDSFALIQDFWKGLVW
        S+++FS N P   R    + ++M +T+ + +Y GLP    R      C+ ++        VL            R    Q  +     +     W+G+VW
Subjt:  SMVIFSRNVPLDSRQYLGNIVSMRVTKSLGSYQGLPSTFHRGFPKVYCLKSRPSAQSFGGVLMESNVECISNDGRICVSQRSISWDSFALIQDFWKGLVW

Query:  GMDLLKCGLRKNLGNDQSIYIFQDPWLPCPSTFKVISQSEPMLKDATVTEFITPSLQWDIPKLNKYLVQIDVDVIKALPINSLVHG--------------
        G +LL  GLR+ +GN  +  +F+DPW+P P +F  I++    L    V+E I    QW+   +++     D  +I ++PI    H               
Subjt:  GMDLLKCGLRKNLGNDQSIYIFQDPWLPCPSTFKVISQSEPMLKDATVTEFITPSLQWDIPKLNKYLVQIDVDVIKALPINSLVHG--------------

Query:  --FGITMVKGSTLLRVDLARN--SSWWKRVWKMRVPNKVKLFVWKSFHNSIPTMTNLWNHHVPVLGNCPMCQEEMETTDHALFQCSRVREIWAILNPPMM
           G  +  G   L+   +    ++WW   W +++P K+  F WK FH  +PT ++L+        NCP+     +T  H +F CS  + IW  ++ P +
Subjt:  --FGITMVKGSTLLRVDLARN--SSWWKRVWKMRVPNKVKLFVWKSFHNSIPTMTNLWNHHVPVLGNCPMCQEEMETTDHALFQCSRVREIWAILNPPMM

Query:  RNLWDQMDIKDRWQGFSE----EPLRR---------FRKANPKGGSVVQT-----------RDDIINLIHKSEKTIMHTDASDKKGSLLEVHNLSTMVNK
         +  + +  K      SE    E   +         F +     G  ++T            ++I    +K++K +     S++  +  EV +    V+ 
Subjt:  RNLWDQMDIKDRWQGFSE----EPLRR---------FRKANPKGGSVVQT-----------RDDIINLIHKSEKTIMHTDASDKKGSLLEVHNLSTMVNK

Query:  ----------------------------------SPLEAEAVAVLEGIRLARSLNVVCLSIFSDSLALIKMIKEEMQGEDCIAATLWEIKTSLNSFQNLT
                                          S  +AEAVA+L G+  A+ + +    +FSDSL+L+  ++ +    + +     +IK  L+SF  ++
Subjt:  ----------------------------------SPLEAEAVAVLEGIRLARSLNVVCLSIFSDSLALIKMIKEEMQGEDCIAATLWEIKTSLNSFQNLT

Query:  FEYIGRKFNVFAHRMACVGLSSSPCLWLDNYLEWM
          ++ R FNV AHR+A         L +DN L WM
Subjt:  FEYIGRKFNVFAHRMACVGLSSSPCLWLDNYLEWM

XP_030508858.1 uncharacterized protein LOC115723499 [Cannabis sativa]2.4e-3727.65Show/hide
Query:  MNGESFGFIKPSLGIRQGDPLSPYLFLICT-GLSALLVSARLSSMEFGL-------------------------------FRNILQDFERAYGQSINYSK
        +NG+  G + P+ GIRQGDPLSPYLFLIC  GLS LL S        GL                                + +L  + RA GQ+IN  K
Subjt:  MNGESFGFIKPSLGIRQGDPLSPYLFLICT-GLSALLVSARLSSMEFGL-------------------------------FRNILQDFERAYGQSINYSK

Query:  SMVIFSRNVPLDSRQYLGNIVSMRVTKSLGSYQGLPS--------TFHRGFPKVYCLKSRPSAQSF--GG--VLMESNVECISNDGRICV----------
         ++ FS N    ++ +   +++M +      Y GLPS         F     K++ L S    Q F  GG  VL+++ V+ I      C           
Subjt:  SMVIFSRNVPLDSRQYLGNIVSMRVTKSLGSYQGLPS--------TFHRGFPKVYCLKSRPSAQSF--GG--VLMESNVECISNDGRICV----------

Query:  -SQRSISW-----DSFALIQD------------------------------FWKGLVWGMDLLKCGLRKNLGNDQSIYIFQDPWLPCPSTFKVISQSEPM
         S  +  W      ++ L++                                W+ +VWG +LL  GLR  +G+   I    DPWLP  + F   S  +  
Subjt:  -SQRSISW-----DSFALIQD------------------------------FWKGLVWGMDLLKCGLRKNLGNDQSIYIFQDPWLPCPSTFKVISQSEPM

Query:  LKDATVTEFITPSLQWDIPKLNKYLVQIDVDVIKALPI------NSLVHG---FGITMVKGSTLLRVDLARN---------SSWWKRVWKMRVPNKVKLF
             V + I    QWD+  ++    Q ++D I ++P+      ++L+      G  MVK        LA +          +WW + WK+++P+K+++F
Subjt:  LKDATVTEFITPSLQWDIPKLNKYLVQIDVDVIKALPI------NSLVHG---FGITMVKGSTLLRVDLARN---------SSWWKRVWKMRVPNKVKLF

Query:  VWKSFHNSIPTMTNLWNHHVPVLGNCPMCQEEMETTDHALFQCSRVREIWAI
        VWK FHN+IP    L   H+     CP+C+++ ET  HALF C+R +E+W +
Subjt:  VWKSFHNSIPTMTNLWNHHVPVLGNCPMCQEEMETTDHALFQCSRVREIWAI

TrEMBL top hitse value%identityAlignment
A0A2N9J2A6 Reverse transcriptase domain-containing protein5.9e-4229.7Show/hide
Query:  MNGESFGFIKPSLGIRQGDPLSPYLFLICT-GLSALLVSARLSSM-------------------------------EFGLFRNILQDFERAYGQSINYSK
        +NGE  GF+KP+ G+RQ DPLSP+LFLIC  GLSALL  A   S+                               E G  ++IL  +ERA GQ IN  K
Subjt:  MNGESFGFIKPSLGIRQGDPLSPYLFLICT-GLSALLVSARLSSM-------------------------------EFGLFRNILQDFERAYGQSINYSK

Query:  SMVIFSRNVPLDSRQYLGNIVSMRVTKSLGSYQGLPSTFHRG------------FPKVYCLKSRPSAQSFGGVLMESNVECISNDGRICVSQRSISWDSF
        +++ FS N PLD+R  + N+    VT     Y GLP    R               ++   K +  +Q+   VL+++ ++ I      C    +   D  
Subjt:  SMVIFSRNVPLDSRQYLGNIVSMRVTKSLGSYQGLPSTFHRG------------FPKVYCLKSRPSAQSFGGVLMESNVECISNDGRICVSQRSISWDSF

Query:  -ALIQDFWKGLVWGM----------------------DLLKCGLRKNLGNDQSIYIFQDPWLPCPSTFKVISQSEPMLKDATVTEFI-TPSLQWDIPKLN
         ++   FW    WG                       + LK GLR  +G+ +SI+I+ D WLP PST++VIS    + + ATV + I + S+ W+   ++
Subjt:  -ALIQDFWKGLVWGM----------------------DLLKCGLRKNLGNDQSIYIFQDPWLPCPSTFKVISQSEPMLKDATVTEFI-TPSLQWDIPKLN

Query:  KYLVQIDVDVIKALPINS-------LVHGFGITMVKGSTLLRVDLARN--------------SSWWKRVWKMRVPNKVKLFVWKSFHNSIPTMTNLWNHH
        +     D+++I ++P++        + HG     +   +  +V L+R               S  W  VW  +VP KV+LF+WK+    +PT   L+   
Subjt:  KYLVQIDVDVIKALPINS-------LVHGFGITMVKGSTLLRVDLARN--------------SSWWKRVWKMRVPNKVKLFVWKSFHNSIPTMTNLWNHH

Query:  VPVLGNCPMCQEEMETTDHALFQCSRVREIW
        +  L  CP C EE ET DH L+ C  V+++W
Subjt:  VPVLGNCPMCQEEMETTDHALFQCSRVREIW

A0A6J1DX30 uncharacterized protein LOC1110248742.6e-5828.44Show/hide
Query:  MNGESFGFIKPSLGIRQGDPLSPYLFLICT-GLSALL----VSARLSSMEFG---------------------------LFRNILQDFERAYGQSINYSK
        +NG   G  +PS GIRQGDPLSPYLFL+C  GLSAL+     S RL+ + F                              R +L  + RA GQ IN+SK
Subjt:  MNGESFGFIKPSLGIRQGDPLSPYLFLICT-GLSALL----VSARLSSMEFG---------------------------LFRNILQDFERAYGQSINYSK

Query:  SMVIFSRNVPLDSRQYLGNIVSMRVTKSLGSYQGLPSTFHRG----------------FPKVYCLKSRPSAQSFGGVLMESNVECISNDGRICVSQ--RS
        S ++FS NV  + +QYL  I+++++    G+Y GLPS F R                 +PK     +    + F   L+  +V        + VS+  + 
Subjt:  SMVIFSRNVPLDSRQYLGNIVSMRVTKSLGSYQGLPSTFHRG----------------FPKVYCLKSRPSAQSFGGVLMESNVECISNDGRICVSQ--RS

Query:  ISWDSFALIQD--------FWKGLVWGMDLLKCGLRKNLGNDQSIYIFQDPWLPCPSTFKVISQSEPMLKDATVTEFITPSLQWDIPKLNKYLVQIDVDV
          +   +L+Q         FWKG +WG DLL  GLR  +GN  +I  F DPWLP P+TFK +  +   L D TV  FIT    WD+  ++      D D+
Subjt:  ISWDSFALIQD--------FWKGLVWGMDLLKCGLRKNLGNDQSIYIFQDPWLPCPSTFKVISQSEPMLKDATVTEFITPSLQWDIPKLNKYLVQIDVDV

Query:  IKALPINS-------LVH-----------GFGITMVKGSTLLRVDLARNSSWWKRVWKMRVPNKVKLFVWKSFHNSIPTMTNLWNHHVPVLGNCPMCQEE
        I ++PI+S       L H           G+ + M               + W  +WK+ VP K+K+F+W+S H  IPT  NL    +  L  C +C + 
Subjt:  IKALPINS-------LVH-----------GFGITMVKGSTLLRVDLARNSSWWKRVWKMRVPNKVKLFVWKSFHNSIPTMTNLWNHHVPVLGNCPMCQEE

Query:  METTDHALFQCSRVREIWAILNPPM-MRNLWDQMDIKDRWQGFSE--EPLRRFRKANPKGGSVVQTRDDIIN-------------------------LIH
         E+  HA F C R R+IW  L P +   +  D +   + W   +E  EP +    A   G  +   R+ +I+                         + +
Subjt:  METTDHALFQCSRVREIWAILNPPM-MRNLWDQMDIKDRWQGFSE--EPLRRFRKANPKGGSVVQTRDDIIN-------------------------LIH

Query:  KSEKT-------------------IMHTDAS-------------DKKGSLLEVHNLSTMVNKSPLEAEAVAVLEGIRLARSLNVVCLSIFSDSLALIKMI
         S +T                    ++TDA+             D   SL+   ++      SPL AE   +LEG++ A + N   L + SDSL  I++I
Subjt:  KSEKT-------------------IMHTDAS-------------DKKGSLLEVHNLSTMVNKSPLEAEAVAVLEGIRLARSLNVVCLSIFSDSLALIKMI

Query:  KEEMQGEDCIAATLWEIKTSLNSFQNLTFEYIGRKFNVFAHRMACVGLS--SSPCLWLDNYLEWMVSL
        + E+         + EI+     F  ++F +  R+ N  AH +A  G++  S+   WL N+  W++ L
Subjt:  KEEMQGEDCIAATLWEIKTSLNSFQNLTFEYIGRKFNVFAHRMACVGLS--SSPCLWLDNYLEWMVSL

A0A803NM27 Uncharacterized protein2.0e-3722.42Show/hide
Query:  MNGESFGFIKPSLGIRQGDPLSPYLFLICT-GLSALLV----------------SARLSSMEF---------------GLFRNILQDFERAYGQSINYSK
        +NG   G +KP  G+RQGDPLSPYLFLIC+ GLS LL                 S  +S + F               G  + +L  + +A GQ +N  K
Subjt:  MNGESFGFIKPSLGIRQGDPLSPYLFLICT-GLSALLV----------------SARLSSMEF---------------GLFRNILQDFERAYGQSINYSK

Query:  SMVIFSRNVPLDSRQYLGNIVSMRVTKSLGSYQGLPSTFHRGFPKVY-CLKSR---------PSAQSFGG--VLMESNVECISNDGRICV----------
        S++ FS N    S+Q   NI+ M + +   SY GLP+   R   +++  +K R             S GG  VL+++ ++ I      C           
Subjt:  SMVIFSRNVPLDSRQYLGNIVSMRVTKSLGSYQGLPSTFHRGFPKVY-CLKSR---------PSAQSFGG--VLMESNVECISNDGRICV----------

Query:  -------------SQRSISWDSFALI------------------------------------------------QDF------------WKGLVWGMDLL
                      ++ I W  +  +                                                 DF            W+G  WG +LL
Subjt:  -------------SQRSISWDSFALI------------------------------------------------QDF------------WKGLVWGMDLL

Query:  KCGLRKNLGNDQSIYIFQDPWLPCPSTFKVISQSEPMLKDATVTEFITPSLQWDIPKLNKYLVQIDVDVIKALPINSLVH-----------------GFG
        K GLR  +GN   I    DPW+P  S F  I  +       TV ++ITP  +W++ KLN      DV+ I +LP++   H                 G+ 
Subjt:  KCGLRKNLGNDQSIYIFQDPWLPCPSTFKVISQSEPMLKDATVTEFITPSLQWDIPKLNKYLVQIDVDVIKALPINSLVH-----------------GFG

Query:  ITMVKGSTLLRVDLARNSSWWKRVWKMRVPNKVKLFVWKSFHNSIPTMTNLWNHHVPVLGNCPMCQEEMETTDHALFQCSRVREIWAILNPPMMRNLWDQ
        +  +          + N SWWK  W++++P KVKLF WK+ HN++P  + L+        +C +C    E+  HA+F C   R +W I            
Subjt:  ITMVKGSTLLRVDLARNSSWWKRVWKMRVPNKVKLFVWKSFHNSIPTMTNLWNHHVPVLGNCPMCQEEMETTDHALFQCSRVREIWAILNPPMMRNLWDQ

Query:  MDIKD--------------------RWQGFSEEPLRRFRKANPKGGSVVQTR-DDIINLIHKSEKTIMHT------------------------------
        M I+D                     W  +S+       K  P+  SV+  +    ++    +++  +H                               
Subjt:  MDIKD--------------------RWQGFSEEPLRRFRKANPKGGSVVQTR-DDIINLIHKSEKTIMHT------------------------------

Query:  DASDKK---GSLL--EVHNLSTMVNK------SPLEAEAVAVLEGIRLARSLNVVCLSIFSDSLALIKMIKEEMQGEDCIAATLWEIKTSLNSFQNLTFE
        D + K+   G+++     N+   ++        P + EA  +   ++ AR LN     + +DSL L+  +++           +++++T L+    +   
Subjt:  DASDKK---GSLL--EVHNLSTMVNK------SPLEAEAVAVLEGIRLARSLNVVCLSIFSDSLALIKMIKEEMQGEDCIAATLWEIKTSLNSFQNLTFE

Query:  YIGRKFNVFAHRMACVGL-SSSPCLWLDNYLEWMVSLSLQERSLF
        ++ R  N  AH +A   L   + C WL+++   ++S+ +++ SLF
Subjt:  YIGRKFNVFAHRMACVGL-SSSPCLWLDNYLEWMVSLSLQERSLF

A0A803PC16 Uncharacterized protein2.3e-3825.55Show/hide
Query:  MNGESFGFIKPSLGIRQGDPLSPYLFLICT-GLSAL-------------------------------LVSARLSSMEFGLFRNILQDFERAYGQSINYSK
        +NG + G + P+ GIRQGDPLSP+LF++C  GLS L                               LV  R ++      +  L  + RA GQS+N  K
Subjt:  MNGESFGFIKPSLGIRQGDPLSPYLFLICT-GLSAL-------------------------------LVSARLSSMEFGLFRNILQDFERAYGQSINYSK

Query:  SMVIFSRNVPLDSRQYLGNIVSMRVTKSLGSYQGLPS--------TFHRGFPKVYCLKSRPSAQ--SFGG---VLMESNVECISNDGRI---CVSQRSIS
        S++ FS N  L  +  +  I  M +      Y GLPS         F     K++ L S    Q  S GG   +L +     +SN   +    +  R   
Subjt:  SMVIFSRNVPLDSRQYLGNIVSMRVTKSLGSYQGLPS--------TFHRGFPKVYCLKSRPSAQ--SFGG---VLMESNVECISNDGRI---CVSQRSIS

Query:  WDSF--ALIQDF----WKGLVWGMDLLKCGLRKNLGNDQSIYIFQDPWLPCPSTFKVISQSEPMLKDATVTEFITPSLQWDIPKLNKYLVQIDVDVIKAL
          SF  A +  +    W+G+VWG +LL  GLR  +GN + I     PWLP  ++FK + Q         V++ I    QW+   LN   +  DV +I+++
Subjt:  WDSF--ALIQDF----WKGLVWGMDLLKCGLRKNLGNDQSIYIFQDPWLPCPSTFKVISQSEPMLKDATVTEFITPSLQWDIPKLNKYLVQIDVDVIKAL

Query:  PINSLVH---------GFGITMVKGSTLLRVDLARN---------SSWWKRVWKMRVPNKVKLFVWKSFHNSIPTMTNLWNHHVPVLGNCPMCQEEMETT
        P+  + H           G+  VK   LL   L              WWK+ W +++P+K+++F+W++ H+ +P    L   H+    +C +C    ET 
Subjt:  PINSLVH---------GFGITMVKGSTLLRVDLARN---------SSWWKRVWKMRVPNKVKLFVWKSFHNSIPTMTNLWNHHVPVLGNCPMCQEEMETT

Query:  DHALFQCSRVREIWAILNPPMMRNLWDQMDIKDRWQGFSEEPLRRFRKANPKGGSVVQTRDDIINLI-HKSEKTIMHTDASDKKGSLLEVHNLSTMVNKS
         HALF C R          P     +  + + +   G +    +     NP  G +    D  IN + +KS    +  D S +  + +           +
Subjt:  DHALFQCSRVREIWAILNPPMMRNLWDQMDIKDRWQGFSEEPLRRFRKANPKGGSVVQTRDDIINLI-HKSEKTIMHTDASDKKGSLLEVHNLSTMVNKS

Query:  PLEAEAVAVLEGIRLARSLNVVCLSIFSDSLALIKMIKEEMQGEDCIAATLWEIKTSLNSFQNLTFEYIGRKFNVFAHRMACVGLS-SSPCLWLD
        P   E +A++  ++  + L +    I +DSL+++K +    +        L  I   +++F      ++ R  N  AH +A   LS  S C W++
Subjt:  PLEAEAVAVLEGIRLARSLNVVCLSIFSDSLALIKMIKEEMQGEDCIAATLWEIKTSLNSFQNLTFEYIGRKFNVFAHRMACVGLS-SSPCLWLD

A0A803QP51 Uncharacterized protein6.7e-3824.75Show/hide
Query:  GFIKPSLGIRQGDPLSPYLFLICT-GLSALL-------------VSAR---LSSMEF---------------GLFRNILQDFERAYGQSINYSKSMVIFS
        G I P  G+RQGDPL PY+FLIC+ GLS LL             VS R   +S + F               G  +  L  + RA GQ +N  KS++ FS
Subjt:  GFIKPSLGIRQGDPLSPYLFLICT-GLSALL-------------VSAR---LSSMEF---------------GLFRNILQDFERAYGQSINYSKSMVIFS

Query:  RNVPLDSRQYLGNIVSMRVTKSLGSYQGLPSTFHRGFP--------------------------KVYCLKSRPSAQSFGGVLMESNVECISNDGRI----
         N P   +    NI+ M ++     Y GLP+   R                             K+ C         F  V+  +    +    RI    
Subjt:  RNVPLDSRQYLGNIVSMRVTKSLGSYQGLPSTFHRGFP--------------------------KVYCLKSRPSAQSFGGVLMESNVECISNDGRI----

Query:  ------CVSQRSISWDSFALIQD------FWKGLVWGMDLLKCGLRKNLGNDQSIYIFQDPWLPCPSTFKVISQSEPMLKDA--TVTEFITPSLQWDIPK
               +  R  S + F   Q        W+G+ WG DLL  GLR  +G+  S+    DPW+P  S F  I      L D    V+ +I+   +W++  
Subjt:  ------CVSQRSISWDSFALIQD------FWKGLVWGMDLLKCGLRKNLGNDQSIYIFQDPWLPCPSTFKVISQSEPMLKDA--TVTEFITPSLQWDIPK

Query:  LNKYLVQIDVDVIKALPINS-------LVH-----------GFGITMVKGSTLLRVDLARNSSWWKRVWKMRVPNKVKLFVWKSFHNSIPTMTNLWNHHV
        L +    +D+D I ++P+ S       ++H           G+ +      + L        +WWK  W +++ +KVK+FVWK+FH +IPT  +L N  +
Subjt:  LNKYLVQIDVDVIKALPINS-------LVH-----------GFGITMVKGSTLLRVDLARNSSWWKRVWKMRVPNKVKLFVWKSFHNSIPTMTNLWNHHV

Query:  PVLGNCPMCQEEMETTDHALFQCSRVREIWAILNPPMMRNLWDQMDIKDRWQGFSEEPLRRFRKANPKGGSVVQTRDDIINLIHKSEKTIMHTDASDKKG
             C +C    E+ +HALF     +++W         +      IKD     S       +  +P    +    D   ++ H   K  +     +  G
Subjt:  PVLGNCPMCQEEMETTDHALFQCSRVREIWAILNPPMMRNLWDQMDIKDRWQGFSEEPLRRFRKANPKGGSVVQTRDDIINLIHKSEKTIMHTDASDKKG

Query:  SLLEVHNLSTMVNKSPLEAEAVAVLEGIRLARSLNVVCLSIFSDSLALIKMIKEEMQGEDCIAATLWEIKTSLNSFQNLTFEYIGRKFNVFAHRMACVGL
         ++  ++          E EA A+L G+  A   N+    + SDSL L+  I  +          + +IK  L+   ++   Y+ R  N  AH +A   L
Subjt:  SLLEVHNLSTMVNKSPLEAEAVAVLEGIRLARSLNVVCLSIFSDSLALIKMIKEEMQGEDCIAATLWEIKTSLNSFQNLTFEYIGRKFNVFAHRMACVGL

Query:  -SSSPCLWLD
             C+W +
Subjt:  -SSSPCLWLD

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657502.6e-0722.77Show/hide
Query:  WKGLVWGM-DLLKCGLRKNLGNDQSIYIFQDPWLPCPSTFKVISQSEPMLKDATVTEFI-TPSLQWDIPKLNKYLVQIDVDVIKALPINSLVHGF-----
        W+ +  G+ D++  G+    G+ Q I  + D W+      ++ +   P   D  V + +  P   WD  K++ Y        ++A+ ++ LV G      
Subjt:  WKGLVWGM-DLLKCGLRKNLGNDQSIYIFQDPWLPCPSTFKVISQSEPMLKDATVTEFI-TPSLQWDIPKLNKYLVQIDVDVIKALPINSLVHGF-----

Query:  ------GITMVKGS----TLLRVDLARNSSWWKRVWKMRVPNKVKLFVWKSFHNSIPTMTNLWNHHVPVLGNCPMCQEEMETTDHALFQCSRVREIWAIL
              G   V+ +    T+  V     +S++  +WK+RVP +VK F+W   + ++ T       H+     C +C+  +E+  H L  C     IW  +
Subjt:  ------GITMVKGS----TLLRVDLARNSSWWKRVWKMRVPNKVKLFVWKSFHNSIPTMTNLWNHHVPVLGNCPMCQEEMETTDHALFQCSRVREIWAIL

Query:  NP
         P
Subjt:  NP

P92555 Uncharacterized mitochondrial protein AtMg012502.7e-0463.33Show/hide
Query:  MNGESFGFIKPSLGIRQGDPLSPYLFLICT
        +NG   G + PS G+RQGDPLSPYLF++CT
Subjt:  MNGESFGFIKPSLGIRQGDPLSPYLFLICT

Arabidopsis top hitse value%identityAlignment
AT1G10000.1 Ribonuclease H-like superfamily protein1.8e-0621.85Show/hide
Query:  KVKLFVWKSFHNSIPTMTNLWNHHVPVLGNCPMCQEEMETTDHALFQCSRVREIW------------------------------------AILNPPMMR
        K+KLF+WK+   ++P    L   H+    +C  C    ET+ H LF C    ++W                                    A L P +  
Subjt:  KVKLFVWKSFHNSIPTMTNLWNHHVPVLGNCPMCQEEMETTDHALFQCSRVREIW------------------------------------AILNPPMMR

Query:  NLWDQMD------------------IKD--RWQGFSEEPLRRFRKANPKGGSVVQTRDDII----------NLIHKSEKTIMHTDASDKKGSLLEVHNLS
        ++W   +                  ++D   WQ  ++  L + R A P+  S      D +          + +  S      T  S+K     E+   S
Subjt:  NLWDQMD------------------IKD--RWQGFSEEPLRRFRKANPKGGSVVQTRDDII----------NLIHKSEKTIMHTDASDKKGSLLEVHNLS

Query:  TMVNK--SPLEAEAVAVLEGIRLARSLNVVCLSIFSDSLALIKMIKEEMQGEDCIAATLWEIKTSLNSFQNLTFEYIGRKFNVFAHRMACVGLSSSPCLW
            +  SPL AEA A+   +  A  L    L + SDS +++  +   +   + I   L EI++  N F++++F++I R  N  A   A + L  S  + 
Subjt:  TMVNK--SPLEAEAVAVLEGIRLARSLNVVCLSIFSDSLALIKMIKEEMQGEDCIAATLWEIKTSLNSFQNLTFEYIGRKFNVFAHRMACVGLSSSPCLW

Query:  LD
        LD
Subjt:  LD

AT3G25270.1 Ribonuclease H-like superfamily protein4.6e-0731.25Show/hide
Query:  RVWKMRVPNKVKLFVWKSFHNSIPTMTNLWNHHVPVLGNCPMCQEEMETTDHALFQCSRVREIW
        ++WK++   K+K F+WK    ++ T  NL   H+     C  C +E ET+ H  F C   +++W
Subjt:  RVWKMRVPNKVKLFVWKSFHNSIPTMTNLWNHHVPVLGNCPMCQEEMETTDHALFQCSRVREIW

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.9e-0563.33Show/hide
Query:  MNGESFGFIKPSLGIRQGDPLSPYLFLICT
        +NG   G + PS G+RQGDPLSPYLF++CT
Subjt:  MNGESFGFIKPSLGIRQGDPLSPYLFLICT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGGGGAATCATTTGGGTTCATCAAACCGTCTCTTGGGATTAGGCAAGGGGATCCTTTGTCACCTTACCTTTTCCTCATTTGTACAGGTCTCTCTGCTCTTTTGGT
TTCAGCTAGATTAAGTTCCATGGAGTTTGGACTATTTCGGAATATTTTACAGGATTTTGAAAGAGCATATGGCCAGTCGATCAACTATTCCAAATCAATGGTGATATTCT
CAAGAAATGTTCCGCTGGATTCAAGGCAATATCTGGGTAACATTGTCTCAATGAGAGTGACTAAATCATTAGGATCATACCAAGGACTGCCATCTACCTTTCACAGAGGA
TTCCCAAAGGTATACTGCTTAAAATCTCGGCCCTCTGCGCAAAGTTTTGGTGGGGTTCTAATGGAGAGCAACGTCGAATGCATTAGCAACGATGGGAGAATTTGTGTCAG
CCAAAGGAGTATTAGCTGGGACAGTTTCGCCCTCATCCAAGACTTTTGGAAAGGGTTGGTTTGGGGGATGGATCTTTTAAAGTGTGGTCTTAGGAAGAATTTAGGGAACG
ACCAGTCAATTTATATTTTCCAGGATCCATGGCTCCCTTGCCCGTCTACTTTTAAGGTGATCTCCCAATCTGAACCAATGTTGAAGGATGCGACTGTGACGGAATTTATT
ACGCCATCTCTTCAATGGGACATACCCAAACTTAACAAATATTTGGTGCAAATAGATGTGGATGTCATTAAAGCCTTGCCGATTAATAGTTTAGTACATGGATTTGGCAT
TACGATGGTAAAGGGGAGTACTCTGTTAAGAGTGGATTTGGCCAGGAATAGCTCTTGGTGGAAAAGAGTATGGAAGATGAGAGTTCCTAATAAAGTTAAACTTTTCGTCT
GGAAATCTTTTCATAACTCAATTCCGACCATGACCAATCTCTGGAATCATCATGTTCCTGTTTTGGGGAACTGTCCGATGTGCCAGGAAGAGATGGAAACTACGGATCAT
GCATTGTTCCAATGTTCGAGGGTTAGGGAGATTTGGGCCATTCTTAATCCGCCAATGATGAGGAACCTCTGGGACCAAATGGATATCAAAGATCGGTGGCAAGGATTTTC
TGAAGAACCATTACGGAGGTTCCGGAAGGCTAATCCAAAAGGTGGATCTGTTGTTCAGACGAGGGATGATATCATTAATCTTATTCATAAGAGCGAAAAGACTATTATGC
ATACAGATGCTTCTGATAAAAAGGGTTCTCTTCTGGAAGTGCATAACTTATCCACTATGGTGAACAAGTCTCCCTTAGAAGCGGAAGCGGTGGCGGTCCTAGAAGGGATA
CGTCTGGCCCGAAGTTTGAATGTGGTTTGTTTATCTATTTTCTCAGATTCTCTGGCATTGATTAAGATGATTAAAGAGGAAATGCAGGGGGAGGACTGCATTGCAGCGAC
TTTATGGGAGATCAAAACAAGTCTAAATTCCTTTCAGAATTTAACTTTTGAGTATATTGGTCGGAAATTTAATGTATTTGCTCATAGAATGGCTTGTGTAGGTTTATCAT
CTAGTCCATGTTTGTGGTTAGATAATTATCTTGAGTGGATGGTGTCATTATCGCTTCAAGAGCGATCGTTATTTTGTACCCCGGGGTCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATGGGGAATCATTTGGGTTCATCAAACCGTCTCTTGGGATTAGGCAAGGGGATCCTTTGTCACCTTACCTTTTCCTCATTTGTACAGGTCTCTCTGCTCTTTTGGT
TTCAGCTAGATTAAGTTCCATGGAGTTTGGACTATTTCGGAATATTTTACAGGATTTTGAAAGAGCATATGGCCAGTCGATCAACTATTCCAAATCAATGGTGATATTCT
CAAGAAATGTTCCGCTGGATTCAAGGCAATATCTGGGTAACATTGTCTCAATGAGAGTGACTAAATCATTAGGATCATACCAAGGACTGCCATCTACCTTTCACAGAGGA
TTCCCAAAGGTATACTGCTTAAAATCTCGGCCCTCTGCGCAAAGTTTTGGTGGGGTTCTAATGGAGAGCAACGTCGAATGCATTAGCAACGATGGGAGAATTTGTGTCAG
CCAAAGGAGTATTAGCTGGGACAGTTTCGCCCTCATCCAAGACTTTTGGAAAGGGTTGGTTTGGGGGATGGATCTTTTAAAGTGTGGTCTTAGGAAGAATTTAGGGAACG
ACCAGTCAATTTATATTTTCCAGGATCCATGGCTCCCTTGCCCGTCTACTTTTAAGGTGATCTCCCAATCTGAACCAATGTTGAAGGATGCGACTGTGACGGAATTTATT
ACGCCATCTCTTCAATGGGACATACCCAAACTTAACAAATATTTGGTGCAAATAGATGTGGATGTCATTAAAGCCTTGCCGATTAATAGTTTAGTACATGGATTTGGCAT
TACGATGGTAAAGGGGAGTACTCTGTTAAGAGTGGATTTGGCCAGGAATAGCTCTTGGTGGAAAAGAGTATGGAAGATGAGAGTTCCTAATAAAGTTAAACTTTTCGTCT
GGAAATCTTTTCATAACTCAATTCCGACCATGACCAATCTCTGGAATCATCATGTTCCTGTTTTGGGGAACTGTCCGATGTGCCAGGAAGAGATGGAAACTACGGATCAT
GCATTGTTCCAATGTTCGAGGGTTAGGGAGATTTGGGCCATTCTTAATCCGCCAATGATGAGGAACCTCTGGGACCAAATGGATATCAAAGATCGGTGGCAAGGATTTTC
TGAAGAACCATTACGGAGGTTCCGGAAGGCTAATCCAAAAGGTGGATCTGTTGTTCAGACGAGGGATGATATCATTAATCTTATTCATAAGAGCGAAAAGACTATTATGC
ATACAGATGCTTCTGATAAAAAGGGTTCTCTTCTGGAAGTGCATAACTTATCCACTATGGTGAACAAGTCTCCCTTAGAAGCGGAAGCGGTGGCGGTCCTAGAAGGGATA
CGTCTGGCCCGAAGTTTGAATGTGGTTTGTTTATCTATTTTCTCAGATTCTCTGGCATTGATTAAGATGATTAAAGAGGAAATGCAGGGGGAGGACTGCATTGCAGCGAC
TTTATGGGAGATCAAAACAAGTCTAAATTCCTTTCAGAATTTAACTTTTGAGTATATTGGTCGGAAATTTAATGTATTTGCTCATAGAATGGCTTGTGTAGGTTTATCAT
CTAGTCCATGTTTGTGGTTAGATAATTATCTTGAGTGGATGGTGTCATTATCGCTTCAAGAGCGATCGTTATTTTGTACCCCGGGGTCCTGA
Protein sequenceShow/hide protein sequence
MNGESFGFIKPSLGIRQGDPLSPYLFLICTGLSALLVSARLSSMEFGLFRNILQDFERAYGQSINYSKSMVIFSRNVPLDSRQYLGNIVSMRVTKSLGSYQGLPSTFHRG
FPKVYCLKSRPSAQSFGGVLMESNVECISNDGRICVSQRSISWDSFALIQDFWKGLVWGMDLLKCGLRKNLGNDQSIYIFQDPWLPCPSTFKVISQSEPMLKDATVTEFI
TPSLQWDIPKLNKYLVQIDVDVIKALPINSLVHGFGITMVKGSTLLRVDLARNSSWWKRVWKMRVPNKVKLFVWKSFHNSIPTMTNLWNHHVPVLGNCPMCQEEMETTDH
ALFQCSRVREIWAILNPPMMRNLWDQMDIKDRWQGFSEEPLRRFRKANPKGGSVVQTRDDIINLIHKSEKTIMHTDASDKKGSLLEVHNLSTMVNKSPLEAEAVAVLEGI
RLARSLNVVCLSIFSDSLALIKMIKEEMQGEDCIAATLWEIKTSLNSFQNLTFEYIGRKFNVFAHRMACVGLSSSPCLWLDNYLEWMVSLSLQERSLFCTPGS