; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy04g002960 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy04g002960
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr04:23476171..23480700
RNA-Seq ExpressionLcy04g002960
SyntenyLcy04g002960
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4303756.1 unnamed protein product [Prunus armeniaca]1.8e-3628.64Show/hide
Query:  SPYQFLVCVEGSRFPVIVHPTLKGRRVNELLLDNGSWNEDMIKSLFIPADVEDILAIPLSNGRDKDEIIWNVDPKGFFSVKSAYHLATNIIQEKE-----
        +PY F         P +VH       ++ LL    +W +D+I   F+P +   IL+IP+S     D++IW+    G ++V+S Y LA ++ +  E     
Subjt:  SPYQFLVCVEGSRFPVIVHPTLKGRRVNELLLDNGSWNEDMIKSLFIPADVEDILAIPLSNGRDKDEIIWNVDPKGFFSVKSAYHLATNIIQEKE-----

Query:  -NSGSCSKPQSSRWTSIWDLNIIPRAKIGLWKIIKNFIPSKPNVQIKRMDINHVCEMCRCGKESASHIFWRCKRARGVRNSFFPNLMDLLFSGRANLEMV
            S S  QSS W S+W ++  P+ K  +W+   N +  + N++ +R+     C  C    E+ +HIF+ C  AR     +F + + L  S  A  + +
Subjt:  -NSGSCSKPQSSRWTSIWDLNIIPRAKIGLWKIIKNFIPSKPNVQIKRMDINHVCEMCRCGKESASHIFWRCKRARGVRNSFFPNLMDLLFSGRANLEMV

Query:  DCWD-----IIRSHHSKAEAEKIGLILWNIWSFRNQATISKKRADERQ-LLQQIKRTLEDQRKRTLPMLAEIQLESLRNHENWSLRPLNFLKLNSDASWN
          W      +  S H+    +     LW IW  RN A     +AD  + +L  +K+  E +  +       +Q  S     +W   P   LK+N DA+W+
Subjt:  DCWD-----IIRSHHSKAEAEKIGLILWNIWSFRNQATISKKRADERQ-LLQQIKRTLEDQRKRTLPMLAEIQLESLRNHENWSLRPLNFLKLNSDASWN

Query:  DDLEVGGVGWIIRDSSGSLVLARCRRITRKWSVKLLEMAEIKERLMTYLSSNFRGRSVVIVEADAMEVIKLLNSECCDISEDKALVLDIEDLAAKVEVVS
             GGVGW+IRDS G L+ A  +      S  ++E+  I+  L     +NF   + +IVE+D+ + I +LN      S+ + +V DI  L A +  VS
Subjt:  DDLEVGGVGWIIRDSSGSLVLARCRRITRKWSVKLLEMAEIKERLMTYLSSNFRGRSVVIVEADAMEVIKLLNSECCDISEDKALVLDIEDLAAKVEVVS

Query:  VVKCPRQSNRVA
         V  PR  N+ A
Subjt:  VVKCPRQSNRVA

CAB4303756.1 unnamed protein product [Prunus armeniaca]1.4e-1248.65Show/hide
Query:  MSKAYDCVEWSFINEMLSKMGFGNRCINNIMRCVESVSFSVMINGESKEEFKPTRGIRQGDPLSPYQFLVCVEG
        M+KA+D VEW F+  M+ K+GF  +  + I  C+ +VS+SV+ING    +  P+RG+RQGDPLSP+ FL+C +G
Subjt:  MSKAYDCVEWSFINEMLSKMGFGNRCINNIMRCVESVSFSVMINGESKEEFKPTRGIRQGDPLSPYQFLVCVEG

OMO52016.1 reverse transcriptase [Corchorus capsularis]1.4e-3624.01Show/hide
Query:  MSKAYDCVEWSFINEMLSKMGFGNRCINNIMRCVESVSFSVMINGESKEEFKPTRGIRQGDPLSPYQFLVCVEG--SRF------------------PVI
        +SKAYD VEW F+ + +  MGF +R I+ +M CVE+VS+SV++N +  E F PTRGIRQGDPLSPY FL+C+EG  S F                  P +
Subjt:  MSKAYDCVEWSFINEMLSKMGFGNRCINNIMRCVESVSFSVMINGESKEEFKPTRGIRQGDPLSPYQFLVCVEG--SRF------------------PVI

Query:  VH----------------------------PTLKGRRVN------------ELLLDN-------------------------------------------
         H                                G+++N             L + N                                           
Subjt:  VH----------------------------PTLKGRRVN------------ELLLDN-------------------------------------------

Query:  ------------------GSWNEDMIKSLFI-------------------PADVEDILAIPLSNGRDKDEIIWNVDPKGFFSVKSAYHLATNIIQEKENS
                          G  N D+ K  +I                     D   IL + +     +D +IWN    G FSVKSAY++A ++++  +  
Subjt:  ------------------GSWNEDMIKSLFI-------------------PADVEDILAIPLSNGRDKDEIIWNVDPKGFFSVKSAYHLATNIIQEKENS

Query:  GSCSKPQSSRWTSIWDLNIIPRAKIGLWKIIKNFIPSKPNVQIKRMDINHVCEMCRCGKESASHIFWRCKRARGVRNSFFPNLMDLLFSGRANLEMVDCW
         S   P+   W  +W   I+P+     W++I N +P+K N+  + +D+  +CE+C    ES  H+F+RCK +  V +   P +   L    ++    D W
Subjt:  GSCSKPQSSRWTSIWDLNIIPRAKIGLWKIIKNFIPSKPNVQIKRMDINHVCEMCRCGKESASHIFWRCKRARGVRNSFFPNLMDLLFSGRANLEMVDCW

Query:  D-IIRSHHSKAEAEKIGLILWNIWSFRNQATISKKRADERQLLQQIKRTLEDQRKRTLPMLAEIQLESLRNHENWSLRPLNFLKLNSDASWNDDLEVGGV
        D  I    S  + +K+ + LW IW+ RN+A           L +Q+                            W   P+  +K+N+D ++       G+
Subjt:  D-IIRSHHSKAEAEKIGLILWNIWSFRNQATISKKRADERQLLQQIKRTLEDQRKRTLPMLAEIQLESLRNHENWSLRPLNFLKLNSDASWNDDLEVGGV

Query:  GWIIRDSSGSLVLARCRRITRKWSVKLLEMAEIKERLMTYLSSNFRGRSVVIVEADAMEVIKLLNSECCDISEDKALVLDIEDLAAKVEVVSVVKCPRQS
        G +IRDS+G +++   R +     +     AE+   L  +  +   G    I E+D++  I  +N +     E   L+ +I DLA   +  +     R++
Subjt:  GWIIRDSSGSLVLARCRRITRKWSVKLLEMAEIKERLMTYLSSNFRGRSVVIVEADAMEVIKLLNSECCDISEDKALVLDIEDLAAKVEVVSVVKCPRQS

Query:  NRVA
        N +A
Subjt:  NRVA

OMO88292.1 reverse transcriptase [Corchorus capsularis]1.4e-3625.28Show/hide
Query:  MSKAYDCVEWSFINEMLSKMGFGNRCINNIMRCVESVSFSVMINGESKEEFKPTRGIRQGDPLSPYQFLVCVE--------------------------G
        +SKAYD VEW+F+ E +  MGF  + ++ +M+CV SVS+SV++N E  + F P+RG+ QGDPLSPY F++C+E                          G
Subjt:  MSKAYDCVEWSFINEMLSKMGFGNRCINNIMRCVESVSFSVMINGESKEEFKPTRGIRQGDPLSPYQFLVCVE--------------------------G

Query:  SRFPVIVHPTLKGRRVNE--------------------------------------------LLLDNGSWNEDMIKSLFIPADVEDILAIPLSNGRDKDE
        S    +    L  R+V E                                            +  +N SWN DM+ + F   D   IL++ +      D 
Subjt:  SRFPVIVHPTLKGRRVNE--------------------------------------------LLLDNGSWNEDMIKSLFIPADVEDILAIPLSNGRDKDE

Query:  IIWNVDPKGFFSVKSAYHLATNIIQEKENSGSCSKPQSSR---WTSIWDLNIIPRAKIGLWKIIKNFIPSKPNVQIKRMDINHVCEMCRCGKESASHIFW
        +IWN +  G FSV+SAY++A  ++      G    P  SR   W  IW  ++ P+ K   W+++ N +P+K  ++ + +D++  CE+C   + S  H+F+
Subjt:  IIWNVDPKGFFSVKSAYHLATNIIQEKENSGSCSKPQSSR---WTSIWDLNIIPRAKIGLWKIIKNFIPSKPNVQIKRMDINHVCEMCRCGKESASHIFW

Query:  RCKRARGVRNS---FFPNLMDLLFSGRANLEMVDCWD-IIRSHHSKAEAEKIGLILWNIWSFRNQATISKKRADERQLLQQIKRTLEDQRKRTLPMLAEI
         C+ +  V  S   + P  ++   S        D W+  +    +    +K   +LW +   RN++      +    L++ +  ++E Q    + +    
Subjt:  RCKRARGVRNS---FFPNLMDLLFSGRANLEMVDCWD-IIRSHHSKAEAEKIGLILWNIWSFRNQATISKKRADERQLLQQIKRTLEDQRKRTLPMLAEI

Query:  QLESLRNHE--NWSLRPLNFLKLNSDASWNDDLEVGGVGWIIRDSSGSLVLARCRRITRKWSVKLLEMAEIKERLMTYLSSNFRGRSVVIVEADAMEVIK
         +++L  H+   W+  P   LK+NSDAS        G+G +IR+S+G +V++  R++   + V     AE+   L  +  +   G    + E+D++  I 
Subjt:  QLESLRNHE--NWSLRPLNFLKLNSDASWNDDLEVGGVGWIIRDSSGSLVLARCRRITRKWSVKLLEMAEIKERLMTYLSSNFRGRSVVIVEADAMEVIK

Query:  LLNSECCDISEDKALVLDIEDLAAKVEVVSVVKCPRQSNRVA
         +        E   L+ +I DLA   +  +     R++N +A
Subjt:  LLNSECCDISEDKALVLDIEDLAAKVEVVSVVKCPRQSNRVA

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]6.2e-3728.07Show/hide
Query:  VNELLLDNGSWNEDMIKSLFIPADVEDILAIPLSNGRDKDEIIWNVDPKGFFSVKSAYHLATNIIQEKENSGSCSKPQSSRWTSIWDLNIIPRAKIGLWK
        V EL+ +   W ED+I   F P D E I+ IPL     +D++IW+ D KG++SVKS Y +A  I  +     SCS    + W  IW L I  + KI LW+
Subjt:  VNELLLDNGSWNEDMIKSLFIPADVEDILAIPLSNGRDKDEIIWNVDPKGFFSVKSAYHLATNIIQEKENSGSCSKPQSSRWTSIWDLNIIPRAKIGLWK

Query:  IIKNFIPSKPNVQIKRMDINHVCEMCRCGKESASHIFWRCKRARGVRNSFFPNLMDLLFSGRANLEMVDCWDIIRSHHSKAEAEKIGLILWNIWSFRNQA
           + +P+  N+  K++    +C+ C C  E+ SH    C RAR +    + NL + L  G    ++V         H+K E  ++  +LW IW  RN+ 
Subjt:  IIKNFIPSKPNVQIKRMDINHVCEMCRCGKESASHIFWRCKRARGVRNSFFPNLMDLLFSGRANLEMVDCWDIIRSHHSKAEAEKIGLILWNIWSFRNQA

Query:  TISKKRADERQLLQQIKRTLEDQRKRTLPMLAEIQLESLRNHENWSLRPLNFLKLNSDASWNDDLEVGGVGWIIRDSSGSLVLARCRRITRKWSVKLLEM
            K+ +  +++   +  +E  +K   P +      +    + WS  P  + K+N DA+ + + ++ G+G ++RDS G+   A  + +    SV + E 
Subjt:  TISKKRADERQLLQQIKRTLEDQRKRTLPMLAEIQLESLRNHENWSLRPLNFLKLNSDASWNDDLEVGGVGWIIRDSSGSLVLARCRRITRKWSVKLLEM

Query:  AEIKERLMTYLSSNFRGRSVVIVEADAMEVIKLLNSECCDISEDKALVLDIEDLAAKVEVVSVVKCPRQSNRVA
          ++  L     ++    +  I E+D++EVI L+N +   ++E   L+ DI++     +       PR  N  A
Subjt:  AEIKERLMTYLSSNFRGRSVVIVEADAMEVIKLLNSECCDISEDKALVLDIEDLAAKVEVVSVVKCPRQSNRVA

XP_030934661.1 uncharacterized protein LOC115960098 [Quercus lobata]2.5e-3824.92Show/hide
Query:  MSKAYDCVEWSFINEMLSKMGFGNRCINNIMRCVESVSFSVMINGESKEEFKPTRGIRQGDPLSPYQFLVCVEG--------------------------
        MSKAYD VEW ++ +++ KMGF    +  +M CV +VS+S++INGE     +P RGIRQGDPLSPY FL+C EG                          
Subjt:  MSKAYDCVEWSFINEMLSKMGFGNRCINNIMRCVESVSFSVMINGESKEEFKPTRGIRQGDPLSPYQFLVCVEG--------------------------

Query:  ------------------------------------------------------SRFPVIVHPTLKG-------RRVNELLLDNGSW-------------
                                                              +++  +  P   G       ++ N+ +L    W             
Subjt:  ------------------------------------------------------SRFPVIVHPTLKG-------RRVNELLLDNGSW-------------

Query:  ---------------------------------NEDMIKSLFIPADVEDILAIPLSNGRDKDEIIWNVDPKGFFSVKSAYHLATNIIQEKENSGSCSKPQ
                                         N+++I + F+P +   I AIPLS G  +D   W ++  G +SVKS YHL  N+  E +     S P 
Subjt:  ---------------------------------NEDMIKSLFIPADVEDILAIPLSNGRDKDEIIWNVDPKGFFSVKSAYHLATNIIQEKENSGSCSKPQ

Query:  SSR--WTSIWDLNIIPRAKIGLWKIIKNFIPSKPNVQIKRMDINHVCEMCRCGKESASHIFWRCKRARGVRNSFFPNLMDLLFSGRANLEMVDCWDIIRS
        S+R  W  +W+L +  R K  +W+   + +PSK N++ +++ I+  C  C    ES  H  W C     V N  F  L        + L+++      RS
Subjt:  SSR--WTSIWDLNIIPRAKIGLWKIIKNFIPSKPNVQIKRMDINHVCEMCRCGKESASHIFWRCKRARGVRNSFFPNLMDLLFSGRANLEMVDCWDIIRS

Query:  HHSKAEAEKIGLILWNIWSFRNQATISKKRADERQLLQQIKRTLEDQRKRTLPMLAEIQLESLRNHENWSLRPLNFLKLNSDASWNDDLEVGGVGWIIRD
        HH     +   +I+  IW+ RN+  +    A    L Q    +L + ++ +   L   +  S      W   P N++K+N D +   D    G+G IIR+
Subjt:  HHSKAEAEKIGLILWNIWSFRNQATISKKRADERQLLQQIKRTLEDQRKRTLPMLAEIQLESLRNHENWSLRPLNFLKLNSDASWNDDLEVGGVGWIIRD

Query:  SSGSLVLARCRRITRKWSVKLLEMAEIKERLMTYLSSNFRGRSVVIVEADAMEVIKLLNSECCDISEDKALVLDIEDLAAKVEVVSVVKCPRQSNRVA
          G ++ A  + I    SV+++E+   +  L       F   + V+VE D+  +IK L+S     S    ++ DI+ +++ +  V      RQ NRVA
Subjt:  SSGSLVLARCRRITRKWSVKLLEMAEIKERLMTYLSSNFRGRSVVIVEADAMEVIKLLNSECCDISEDKALVLDIEDLAAKVEVVSVVKCPRQSNRVA

TrEMBL top hitse value%identityAlignment
A0A2N9FQ02 Reverse transcriptase domain-containing protein3.4e-4128Show/hide
Query:  MSKAYDCVEWSFINEMLSKMGFGNRCINNIMRCVESVSFSVMINGESKEEFKPTRGIRQGDPLSPYQFLVCVEG--------SRFPVIVHPTL--KGRRV
        MSKAYD VEW+++  ++ KMGF  R I+ IM+C+ +VS+S+++NGE     KP+RG+RQGDPLSPY FL+CVEG        S    +   TL  KG R+
Subjt:  MSKAYDCVEWSFINEMLSKMGFGNRCINNIMRCVESVSFSVMINGESKEEFKPTRGIRQGDPLSPYQFLVCVEG--------SRFPVIVHPTL--KGRRV

Query:  NELLLDNGSWNEDMIKSLFIPADVED-------ILAIPLSNGRDKDEIIWNVDPKGFFSVKSAYHLATNIIQEKENSGSCSKPQSSRWTSIWDLNIIPRA
              + S        LF  A   D       +L    ++G  +D   W     G ++V+S Y    N+ ++ + S S S+P    W  +W L +  + 
Subjt:  NELLLDNGSWNEDMIKSLFIPADVED-------ILAIPLSNGRDKDEIIWNVDPKGFFSVKSAYHLATNIIQEKENSGSCSKPQSSRWTSIWDLNIIPRA

Query:  KIGLWKIIKNFIPSKPNVQIKRMDINHVCEMCRCGKESASHIFWRCKRA----------RGVRNSFFPNLMDLLFSGRANLEMVDCWDIIRSHHSKAEAE
        ++ +W+ +K+ +PSK N++ K +  +  CE+C    E   H  W C             +G R S F N  +L             W  +R+     + E
Subjt:  KIGLWKIIKNFIPSKPNVQIKRMDINHVCEMCRCGKESASHIFWRCKRA----------RGVRNSFFPNLMDLLFSGRANLEMVDCWDIIRSHHSKAEAE

Query:  KIGLILWNIWSFRNQATISK--KRADE-----RQLLQQIKRTLEDQRKRTLPMLAEIQLESLRNHENWSLRPLNFLKLNSDASWNDDLEVGGVGWIIRDS
              W IW  RN+  + +  ++AD      R+ LQ      E Q  +TL     +     +   +W        K+N D +   +    G+G I+R++
Subjt:  KIGLILWNIWSFRNQATISK--KRADE-----RQLLQQIKRTLEDQRKRTLPMLAEIQLESLRNHENWSLRPLNFLKLNSDASWNDDLEVGGVGWIIRDS

Query:  SGSLVLARCRRITRKWSVKLLEMAEIKERLMTYL-----SSNFRGRSVVIVEADAMEVIKLLNSECCDISEDKAL
         G  +    ++I    SV+  E   ++  +   L        F G S +I   DA+   K   ++   I EDK L
Subjt:  SGSLVLARCRRITRKWSVKLLEMAEIKERLMTYL-----SSNFRGRSVVIVEADAMEVIKLLNSECCDISEDKAL

A0A2N9HLU1 Reverse transcriptase domain-containing protein1.9e-3927.46Show/hide
Query:  MSKAYDCVEWSFINEMLSKMGFGNRCINNIMRCVESVSFSVMINGESKEEFKPTRGIRQGDPLSPYQFLVCVEG--------------------------
        MSKAYD VEW F+ +++ KMGF ++ ++ IM C+ +VS   +INGE      P RG+RQGDP+SPY FL+C EG                          
Subjt:  MSKAYDCVEWSFINEMLSKMGFGNRCINNIMRCVESVSFSVMINGESKEEFKPTRGIRQGDPLSPYQFLVCVEG--------------------------

Query:  --SRF--PVIVHPTLKGRRVNEL-------------------------------LLDNGSWNEDMIKSLFIPADVEDILAIPLSNGRDKDEIIWNVDPKG
          S+F    +V+ + K R+   L                               L    +WN D I++ F+P DV+ IL IPLS+    D++ W     G
Subjt:  --SRF--PVIVHPTLKGRRVNEL-------------------------------LLDNGSWNEDMIKSLFIPADVEDILAIPLSNGRDKDEIIWNVDPKG

Query:  FFSVKSAYHLATNIIQEK--ENSGSCSKPQ-SSRWTSIWDLNIIPRAKIGLWKIIKNFIPSKPNVQIKRMDINHVCEMCRCGKESASHIFWRCKRARGVR
         ++++S Y L   +++E+   N GS  + +    W  IW L    + K  +W+  +  +P+K  +  +++  +  C+ C    E   H  W+C     V 
Subjt:  FFSVKSAYHLATNIIQEK--ENSGSCSKPQ-SSRWTSIWDLNIIPRAKIGLWKIIKNFIPSKPNVQIKRMDINHVCEMCRCGKESASHIFWRCKRARGVR

Query:  NSFFPNLMDLLFSGRANLEMVDCWDIIRSHHSKAEAEKIGLILWNIWSFRNQATISKKRADERQLLQQIKRTLEDQRKRTLPMLAEIQLESLRNHENWSL
         S  P + +     +      D    +  + +    EK  ++ W IW  RNQ+ +    AD  QL       LE+  + T    +E  ++S     +W+ 
Subjt:  NSFFPNLMDLLFSGRANLEMVDCWDIIRSHHSKAEAEKIGLILWNIWSFRNQATISKKRADERQLLQQIKRTLEDQRKRTLPMLAEIQLESLRNHENWSL

Query:  RPLNFLKLNSDASWNDDLEVGGVGWIIRDSSGSLVLARCRRITRKWSVKLLEMAEIKERLMTYLSSNFRGRSVVIVEADAMEVIKLLN
           N  K+N D +   D   GG+G +IRD SG ++    +R+    SV L+E    K R +T+      G + V +E D   VI+ L+
Subjt:  RPLNFLKLNSDASWNDDLEVGGVGWIIRDSSGSLVLARCRRITRKWSVKLLEMAEIKERLMTYLSSNFRGRSVVIVEADAMEVIKLLN

A0A803NML1 Uncharacterized protein7.1e-4726.54Show/hide
Query:  MSKAYDCVEWSFINEMLSKMGFGNRCINNIMRCVESVSFSVMINGESKEEFKPTRGIRQGDPLSPYQFLVCVEG-SRF-------------------PVI
        MSKA+D VEW ++  ++ KMGF +  I+ IMRC+ + SFS  +NG+      P RG+RQGDPLSPY FL+C EG SRF                   P +
Subjt:  MSKAYDCVEWSFINEMLSKMGFGNRCINNIMRCVESVSFSVMINGESKEEFKPTRGIRQGDPLSPYQFLVCVEG-SRF-------------------PVI

Query:  VH-----------------------------------PTLKGRR-----------------------------------VNELLLDNGSWNEDMIKSLFI
         H                                     LKG R                                   V+  + DN  WN  ++ S F 
Subjt:  VH-----------------------------------PTLKGRR-----------------------------------VNELLLDNGSWNEDMIKSLFI

Query:  PADVEDILAIPLSNGRDKDEIIWNVDPKGFFSVKSAYHLATNIIQEKENSGSCSKPQSSRWTSIWDLNIIPRAKIGLWKIIKNFIPSKPNVQIKRMDINH
          D++ IL+IPL+     D ++W+  P G +SVK+ +HLAT +  E +NS S S  QS  W   W+L + P+ +I  WK+ +N +P+   +  +++  + 
Subjt:  PADVEDILAIPLSNGRDKDEIIWNVDPKGFFSVKSAYHLATNIIQEKENSGSCSKPQSSRWTSIWDLNIIPRAKIGLWKIIKNFIPSKPNVQIKRMDINH

Query:  VCEMCRCGKESASHIFWRCKRARGVRNSFFPNLMDLLFSGRANLEMVDCWDIIRSHHSKAEAEKIGLILWNIWSFRNQATISKKRADERQLLQQIKRTLE
         C +C    ES  H  + CK A+ +   +  +   + FS   N+   D    + + + + + E +  +LW IW+ RN+     +      ++Q   +  E
Subjt:  VCEMCRCGKESASHIFWRCKRARGVRNSFFPNLMDLLFSGRANLEMVDCWDIIRSHHSKAEAEKIGLILWNIWSFRNQATISKKRADERQLLQQIKRTLE

Query:  DQRKRTLPMLAEIQLESL----RNH------------ENWSLRPLNFLKLNSDASWNDDLEVGGVGWIIRDSSGSLVLARCRRITRKWSVKLLEMAEIKE
        D  K      A++Q+ S+    R+H            + W     N  KLN DA+ N + +  G+G I+RD  G+++ A  + +   +    +E   +  
Subjt:  DQRKRTLPMLAEIQLESL----RNH------------ENWSLRPLNFLKLNSDASWNDDLEVGGVGWIIRDSSGSLVLARCRRITRKWSVKLLEMAEIKE

Query:  RLMTYLSSNFRGRSVVIVEADAMEVIKLLNSECCDISEDKALVLDIEDLAAKVEVVSVVKCPRQSNRVA
         +     S F    +  +E DA  V   LN    D+S    L++DI  L +    V V    R +N+ A
Subjt:  RLMTYLSSNFRGRSVVIVEADAMEVIKLLNSECCDISEDKALVLDIEDLAAKVEVVSVVKCPRQSNRVA

A0A803PDG2 Uncharacterized protein4.2e-3927.63Show/hide
Query:  MSKAYDCVEWSFINEMLSKMGFGNRCINNIMRCVESVSFSVMINGESKEEFKPTRGIRQGDPLSPYQFLVCVEGSRFPVIVHPTLKGR------RVNELL
        MS A+D VE S+I  +++KMGF  R I+ IM+C+ S SFS  +NGE      P+RG+RQGDPLSPY FL+C EG    +    +L G         N+ L
Subjt:  MSKAYDCVEWSFINEMLSKMGFGNRCINNIMRCVESVSFSVMINGESKEEFKPTRGIRQGDPLSPYQFLVCVEGSRFPVIVHPTLKGR------RVNELL

Query:  LDNGSWN-EDMIKSL--------------FIPADVEDI------------------LAIPLSNGRDKDEIIWNVDPKGFFSVKSAYHLATNIIQEKENSG
        L   +W   +M  SL              F+ +++                     L   + NG   D +IW+    G ++VKS +H  T++  E  +  
Subjt:  LDNGSWN-EDMIKSL--------------FIPADVEDI------------------LAIPLSNGRDKDEIIWNVDPKGFFSVKSAYHLATNIIQEKENSG

Query:  SCSKPQSSRWTSIWDLNIIPRAKIGLWKIIKNFIPSKPNVQIKRMDINHVCEMCRCGKESASHIFWRCKRARGVRNSF-FPNLMDLLFSGRANLEMVDCW
        S S      W   W+L +  + KI  W++I+N +P    +  K++  +  C +C    ES  H  ++C  A+ V   F FP    + FS   N+   D  
Subjt:  SCSKPQSSRWTSIWDLNIIPRAKIGLWKIIKNFIPSKPNVQIKRMDINHVCEMCRCGKESASHIFWRCKRARGVRNSF-FPNLMDLLFSGRANLEMVDCW

Query:  DIIRSHHSKAEAEKIGLILWNIWSFRNQATISKKRADERQLLQQIKRTLEDQRKR----------TLPM-LAEIQLESLRNHENWSLRPLNFLKLNSDAS
          + +  S+ E E + +I+W IW+ + +    + R D   L       + + +K           T+P     +  E +     W LR +  LK N DA+
Subjt:  DIIRSHHSKAEAEKIGLILWNIWSFRNQATISKKRADERQLLQQIKRTLEDQRKR----------TLPM-LAEIQLESLRNHENWSLRPLNFLKLNSDAS

Query:  WNDDLEVGGVGWIIRDSSGSLVLARCRRITRKWSVKLLEMAEIKERLMTYLSSNFRGRSVVIVEADAMEVIKLLNSECCDISEDKALVLDIEDLAAKVEV
         N   +V GVG I+++  G +++A  + +   +    +E A+     + ++S       V +VEADA+ V   LN E  D+S    L+ D+  L +    
Subjt:  WNDDLEVGGVGWIIRDSSGSLVLARCRRITRKWSVKLLEMAEIKERLMTYLSSNFRGRSVVIVEADAMEVIKLLNSECCDISEDKALVLDIEDLAAKVEV

Query:  VSVVKCPRQSNRVA
        + +    R +N+ A
Subjt:  VSVVKCPRQSNRVA

A0A803PEK6 Uncharacterized protein1.9e-3927.51Show/hide
Query:  MSKAYDCVEWSFINEMLSKMGFGNRCINNIMRCVESVSFSVMINGESKEEFKPTRGIRQGDPLSPYQFLVCVEG--------------------------
        MSKAYD +EW F+  +L K+GFG   +  I+ CV S S+++   G+      PTRGIRQG PLSPY F++C EG                          
Subjt:  MSKAYDCVEWSFINEMLSKMGFGNRCINNIMRCVESVSFSVMINGESKEEFKPTRGIRQGDPLSPYQFLVCVEG--------------------------

Query:  ------------------------------SRFPVIVHPTLKGRRVNELLLDN-GSWNEDMIKSLFIPADVEDILAIPLSNGRDKDEIIWNVDPKGFFSV
                                      + F    HP L   +V+ L+      W+ ++I  LF   D   IL IPL      D +IW+ D  G +SV
Subjt:  ------------------------------SRFPVIVHPTLKGRRVNELLLDN-GSWNEDMIKSLFIPADVEDILAIPLSNGRDKDEIIWNVDPKGFFSV

Query:  KSAYHLATNIIQEKENSGSCSKPQSSRWTSIWDLNIIPRAKIGLWKIIKNFIPSKPNVQIKRMDINHVCEMCRCGKESASHIFWRCKRARGVRNSFFPNL
        KSAY    NI+Q+     S  +  S+ W S+W L I P+ K  +W+   N +P+  +++ KR++++ +C  C+   E+  H    C+      N      
Subjt:  KSAYHLATNIIQEKENSGSCSKPQSSRWTSIWDLNIIPRAKIGLWKIIKNFIPSKPNVQIKRMDINHVCEMCRCGKESASHIFWRCKRARGVRNSFFPNL

Query:  MDLLFSGRANLEMVDCWDIIRSHHSKAEAEKIGLI---LWNIWSFRNQATISKKRADERQLLQQIKRTLEDQR--KRTLPMLAEIQLESLRNHENWSLRP
           +  G    E     D  +S  S A++EK GLI    W IW  RN    + K      ++      L   R  + +L   +E         E+W    
Subjt:  MDLLFSGRANLEMVDCWDIIRSHHSKAEAEKIGLI---LWNIWSFRNQATISKKRADERQLLQQIKRTLEDQR--KRTLPMLAEIQLESLRNHENWSLRP

Query:  LNFLKLNSDASWNDDLEVGGVGWIIRDSSGSLVLARCRRITRKWSVK--LLEMAEIKERLMTYLSSNFRGRSVVIVEADAMEVIKLLNSECCDISEDKAL
         N +K+N DA+  +     G G ++RDS G  VL   R I ++  V   L E   ++E L    + +       ++E D++ V++ + S    IS    +
Subjt:  LNFLKLNSDASWNDDLEVGGVGWIIRDSSGSLVLARCRRITRKWSVK--LLEMAEIKERLMTYLSSNFRGRSVVIVEADAMEVIKLLNSECCDISEDKAL

Query:  VLDIEDLAAKVEVVSVVKCPRQSNRVA
        + D ++L A +  VS+    R  N VA
Subjt:  VLDIEDLAAKVEVVSVVKCPRQSNRVA

SwissProt top hitse value%identityAlignment
P11369 LINE-1 retrotransposable element ORF2 protein6.5e-0527.18Show/hide
Query:  KAYDCVEWSFINEMLSKMGFGNRCINNIMRCVESVSFSVMINGESKEEFKPTRGIRQGDPLSPYQFLVCVEGSRFPVIVHPTLKGRRVNELLLDNGSWNE
        KA+D ++  F+ ++L + G     +N I         ++ +NGE  E      G RQG PLSPY F + +E     +     +KG ++ +  +      +
Subjt:  KAYDCVEWSFINEMLSKMGFGNRCINNIMRCVESVSFSVMINGESKEEFKPTRGIRQGDPLSPYQFLVCVEGSRFPVIVHPTLKGRRVNELLLDNGSWNE

Query:  DMI
        DMI
Subjt:  DMI

P16423 Retrovirus-related Pol polyprotein from type-2 retrotransposable element R2DM1.3e-0527.62Show/hide
Query:  MSKAYDCVEWSFINEMLSKMGFGNRCINNIMRCVESVSFSVMINGESKEEFKPTRGIRQGDPLSPYQFLVCVEGSRFPVIVHPTLKGRRVNELLLDNGSW
        +SKA+D +  + I + L   G     ++ +    E    S+  +G S EEF P RG++QGDPLSP  F + ++     +   P+  G +V   + +  ++
Subjt:  MSKAYDCVEWSFINEMLSKMGFGNRCINNIMRCVESVSFSVMINGESKEEFKPTRGIRQGDPLSPYQFLVCVEGSRFPVIVHPTLKGRRVNELLLDNGSW

Query:  NEDMI
         +D++
Subjt:  NEDMI

Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein5.1e-1325.91Show/hide
Query:  CEMCRCGKESASHIFWRCKRARGV--------------RNSFFPNLMDLLFSGRANLEMVDCWDIIRSHHSKAEAEKIGLI-------LWNIWSFRNQAT
        C  C   +E+ +H+ ++C  AR V               +S + NL  +L     NLE+              E  K+G I       LW +W  RN+  
Subjt:  CEMCRCGKESASHIFWRCKRARGV--------------RNSFFPNLMDLLFSGRANLEMVDCWDIIRSHHSKAEAEKIGLI-------LWNIWSFRNQAT

Query:  ISKKRADERQLLQQIKRTLED-QRKRTLPMLAEIQLESLRNHENWSLRPLNFLKLNSDASWNDDLEVGGVGWIIRDSSGSLVLARCRRITRKWSVKLLEM
           K  D  ++L++     E+   +R L   A            W   P  ++K N+DA+W  +    G+GWI+R+ SG ++    R + R  +V   E+
Subjt:  ISKKRADERQLLQQIKRTLED-QRKRTLPMLAEIQLESLRNHENWSLRPLNFLKLNSDASWNDDLEVGGVGWIIRDSSGSLVLARCRRITRKWSVKLLEM

Query:  AEIKERLMTYLSSNFRGRSVVIVEADAMEVIKLLNSECCDISEDKALVLDIEDLAAKVEVVSVVKCPRQSNRVA
          ++  ++T    N++    +I E+DA  ++ LLNS+    +   AL  DI+ L    E V     PR  N+VA
Subjt:  AEIKERLMTYLSSNFRGRSVVIVEADAMEVIKLLNSECCDISEDKALVLDIEDLAAKVEVVSVVKCPRQSNRVA

AT3G09510.1 Ribonuclease H-like superfamily protein1.5e-2025.77Show/hide
Query:  PVIVHPTLKGRRVNELLLDNGS---WNEDMIKSLFIPADVEDILAIPLSNGRDKDEIIWNVDPKGFFSVKSAYHLATNIIQEKENSGSCSKPQSS--RWT
        P+    T K   +N L    GS   W++  I      +D   I  I L+  +  D+IIWN +  G ++V+S Y L T+      N  + + P  S    T
Subjt:  PVIVHPTLKGRRVNELLLDNGS---WNEDMIKSLFIPADVEDILAIPLSNGRDKDEIIWNVDPKGFFSVKSAYHLATNIIQEKENSGSCSKPQSS--RWT

Query:  SIWDLNIIPRAKIGLWKIIKNFIPSKPNVQIKRMDINHVCEMCRCGKESASH---------IFWRCKRARGVRNSFFPNLMDLLFSGRANLEMVDCWDII
         IW+L I+P+ K  LW+ +   + +   +  + M I+  C  C    ES +H         + WR   +  +RN    N  +   S   N       D  
Subjt:  SIWDLNIIPRAKIGLWKIIKNFIPSKPNVQIKRMDINHVCEMCRCGKESASH---------IFWRCKRARGVRNSFFPNLMDLLFSGRANLEMVDCWDII

Query:  RSHHSKAEAEKIGLILWNIWSFRNQATISKKRADERQLLQQIKRTLED------QRKRTLPMLAEIQLESLRNHENWSLRPLNFLKLNSDASWN-DDLEV
         S   K        ++W IW  RN    +K R    + +   K    D        K+T     +I      N   W   P  ++K N DA ++   LE 
Subjt:  RSHHSKAEAEKIGLILWNIWSFRNQATISKKRADERQLLQQIKRTLED------QRKRTLPMLAEIQLESLRNHENWSLRPLNFLKLNSDASWN-DDLEV

Query:  GGVGWIIRDSSGSLVLARCRRITRKWSVKLLEMAEIKERLMTYLSSNFRGRSVVIVEADAMEVIKLLNSECCDISEDKALVLDIEDLA
         G GWIIR+  G+ +     ++    +   LE AE K  L     +  RG + V +E D   +I L+N     IS   +L   +ED++
Subjt:  GGVGWIIRDSSGSLVLARCRRITRKWSVKLLEMAEIKERLMTYLSSNFRGRSVVIVEADAMEVIKLLNSECCDISEDKALVLDIEDLA

AT4G03566.1 unknown protein1.9e-0429.03Show/hide
Query:  ILWNIWSFRNQATISKKRADERQLL-QQIKRTLEDQRKRTLPMLAEIQLE-SLRNHENWSLRPLNFLKLNSDASWNDDLEVGGVGWIIRDSSG
        +LW IW  RN   ++  + D   L+    + +    + RT+P  +  +L  + R    W+  PL+ LK N   SW +D+ + G  WI+R+  G
Subjt:  ILWNIWSFRNQATISKKRADERQLL-QQIKRTLEDQRKRTLPMLAEIQLE-SLRNHENWSLRPLNFLKLNSDASWNDDLEVGGVGWIIRDSSG

AT4G29090.1 Ribonuclease H-like superfamily protein9.6e-2824.74Show/hide
Query:  RVNELLLDNG-SWNEDMIKSLFIPADVEDILAIPLSNGRDKDEIIWNVDPKGFFSVKSAYHLATNIIQEKENSGSCSKPQ-SSRWTSIWDLNIIPRAKIG
        +V++L+ ++G  W +D+I+ LF   + + I  +     R  D   W+    G ++VKS Y + T II ++ +    S+P  +  +  IW     P+ +  
Subjt:  RVNELLLDNG-SWNEDMIKSLFIPADVEDILAIPLSNGRDKDEIIWNVDPKGFFSVKSAYHLATNIIQEKENSGSCSKPQ-SSRWTSIWDLNIIPRAKIG

Query:  LWKIIKNFIPSKPNVQIKRMDINHVCEMCRCGKESASHIFWRCKRARGV--------------RNSFFPNLMDLLFSGRANLEMVDCWDIIRSHHSKAEA
        LWK + N +P    +  + +     C  C   KE+ +H+ ++C  AR                 +S + NL  +   G  N +    W+          +
Subjt:  LWKIIKNFIPSKPNVQIKRMDINHVCEMCRCGKESASHIFWRCKRARGV--------------RNSFFPNLMDLLFSGRANLEMVDCWDIIRSHHSKAEA

Query:  EKIGLILWNIWSFRNQATISKKRADERQLLQQIKRTLEDQRKRTLPMLAEIQLESLRNH-ENWSLRPLNFLKLNSDASWNDDLEVGGVGWIIRDSSGSLV
        + +  +LW +W  RN+     +  + +++L++ +  LE+ R RT       + +  R+    W   P  ++K N+DA+WN D E  G+GW++R+  G + 
Subjt:  EKIGLILWNIWSFRNQATISKKRADERQLLQQIKRTLEDQRKRTLPMLAEIQLESLRNH-ENWSLRPLNFLKLNSDASWNDDLEVGGVGWIIRDSSGSLV

Query:  LARCRRITRKWSVKLLEMAEIKERLMTYLSSNFRGRSVVIVEADAMEVIKLLNSECCDISEDKALVLDIEDLAAKVEVVSVVKCPRQSNRVA
            R + +  SV  LE AE++      LS +    + VI E+D+  +I++LN++    S  K  + D++ L ++   V  V  PR+ N +A
Subjt:  LARCRRITRKWSVKLLEMAEIKERLMTYLSSNFRGRSVVIVEADAMEVIKLLNSECCDISEDKALVLDIEDLAAKVEVVSVVKCPRQSNRVA

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.5e-0456.25Show/hide
Query:  MINGESKEEFKPTRGIRQGDPLSPYQFLVCVE
        +ING  +    P+RG+RQGDPLSPY F++C E
Subjt:  MINGESKEEFKPTRGIRQGDPLSPYQFLVCVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAAAGCGTACGACTGTGTGGAATGGAGCTTCATTAATGAGATGCTATCCAAGATGGGCTTCGGTAACAGGTGCATAAACAACATTATGAGATGCGTGGAA
TCTGTATCCTTTTCAGTTATGATAAACGGCGAGTCGAAGGAGGAATTTAAGCCTACCCGGGGAATCAGACAAGGGGATCCTCTATCTCCTTATCAGTTCTTGGTG
TGTGTTGAAGGTAGCAGATTTCCGGTCATAGTCCACCCGACGTTGAAAGGTAGAAGAGTGAATGAGCTCCTTTTGGATAATGGCAGCTGGAATGAAGATATGATC
AAAAGCCTTTTCATTCCTGCGGATGTCGAAGACATTCTGGCTATTCCTTTGAGTAATGGTAGAGATAAGGATGAAATCATCTGGAATGTCGATCCTAAAGGCTTT
TTCAGTGTAAAAAGTGCTTATCACCTAGCAACCAACATCATCCAAGAGAAGGAGAATTCAGGATCTTGTTCTAAACCTCAAAGCTCTAGATGGACGTCAATCTGG
GACCTCAATATTATCCCAAGGGCAAAGATAGGGCTGTGGAAGATCATAAAAAATTTTATCCCTTCTAAACCTAACGTTCAAATTAAAAGAATGGATATTAATCAT
GTTTGTGAAATGTGCAGGTGTGGTAAGGAGAGTGCTAGTCATATCTTTTGGAGATGCAAGCGGGCAAGGGGCGTTCGGAACAGTTTTTTCCCAAATCTGATGGAC
CTTCTGTTTTCTGGCAGGGCTAATTTGGAGATGGTAGATTGTTGGGATATCATTCGCTCACATCACAGTAAAGCAGAAGCAGAGAAGATAGGGCTTATCCTATGG
AATATTTGGAGTTTTAGAAACCAGGCAACAATCTCCAAAAAGCGTGCAGATGAGCGACAGCTTCTTCAACAGATTAAAAGAACCTTGGAAGATCAAAGGAAGCGT
ACCTTGCCTATGCTTGCGGAGATTCAGCTGGAGAGCCTGCGAAATCATGAGAATTGGTCCCTCCGTCCTCTGAATTTTCTCAAGCTGAACTCGGATGCCTCTTGG
AACGATGACCTTGAGGTTGGGGGCGTCGGCTGGATCATCCGTGATTCATCAGGATCTCTGGTCCTCGCTAGGTGTAGAAGAATTACTCGTAAATGGAGTGTGAAA
TTATTGGAGATGGCGGAAATTAAAGAAAGGTTGATGACGTACCTTTCCTCTAATTTTCGTGGGCGTTCGGTAGTGATTGTTGAAGCAGATGCCATGGAGGTGATC
AAATTGTTGAATTCTGAGTGTTGTGACATTTCTGAAGACAAGGCGCTAGTGCTTGACATTGAGGATCTGGCAGCTAAAGTGGAAGTTGTTTCCGTCGTGAAATGC
CCAAGACAGAGTAATCGTGTGGCCATTTTTTGGCGCGAGCAGCGGCGGGCTTTCCTCCGACCATTCTCATGCATCTTCTTCATCCACATGAAGACAGCAAAGAGG
TGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCAAAGCGTACGACTGTGTGGAATGGAGCTTCATTAATGAGATGCTATCCAAGATGGGCTTCGGTAACAGGTGCATAAACAACATTATGAGATGCGTGGAA
TCTGTATCCTTTTCAGTTATGATAAACGGCGAGTCGAAGGAGGAATTTAAGCCTACCCGGGGAATCAGACAAGGGGATCCTCTATCTCCTTATCAGTTCTTGGTG
TGTGTTGAAGGTAGCAGATTTCCGGTCATAGTCCACCCGACGTTGAAAGGTAGAAGAGTGAATGAGCTCCTTTTGGATAATGGCAGCTGGAATGAAGATATGATC
AAAAGCCTTTTCATTCCTGCGGATGTCGAAGACATTCTGGCTATTCCTTTGAGTAATGGTAGAGATAAGGATGAAATCATCTGGAATGTCGATCCTAAAGGCTTT
TTCAGTGTAAAAAGTGCTTATCACCTAGCAACCAACATCATCCAAGAGAAGGAGAATTCAGGATCTTGTTCTAAACCTCAAAGCTCTAGATGGACGTCAATCTGG
GACCTCAATATTATCCCAAGGGCAAAGATAGGGCTGTGGAAGATCATAAAAAATTTTATCCCTTCTAAACCTAACGTTCAAATTAAAAGAATGGATATTAATCAT
GTTTGTGAAATGTGCAGGTGTGGTAAGGAGAGTGCTAGTCATATCTTTTGGAGATGCAAGCGGGCAAGGGGCGTTCGGAACAGTTTTTTCCCAAATCTGATGGAC
CTTCTGTTTTCTGGCAGGGCTAATTTGGAGATGGTAGATTGTTGGGATATCATTCGCTCACATCACAGTAAAGCAGAAGCAGAGAAGATAGGGCTTATCCTATGG
AATATTTGGAGTTTTAGAAACCAGGCAACAATCTCCAAAAAGCGTGCAGATGAGCGACAGCTTCTTCAACAGATTAAAAGAACCTTGGAAGATCAAAGGAAGCGT
ACCTTGCCTATGCTTGCGGAGATTCAGCTGGAGAGCCTGCGAAATCATGAGAATTGGTCCCTCCGTCCTCTGAATTTTCTCAAGCTGAACTCGGATGCCTCTTGG
AACGATGACCTTGAGGTTGGGGGCGTCGGCTGGATCATCCGTGATTCATCAGGATCTCTGGTCCTCGCTAGGTGTAGAAGAATTACTCGTAAATGGAGTGTGAAA
TTATTGGAGATGGCGGAAATTAAAGAAAGGTTGATGACGTACCTTTCCTCTAATTTTCGTGGGCGTTCGGTAGTGATTGTTGAAGCAGATGCCATGGAGGTGATC
AAATTGTTGAATTCTGAGTGTTGTGACATTTCTGAAGACAAGGCGCTAGTGCTTGACATTGAGGATCTGGCAGCTAAAGTGGAAGTTGTTTCCGTCGTGAAATGC
CCAAGACAGAGTAATCGTGTGGCCATTTTTTGGCGCGAGCAGCGGCGGGCTTTCCTCCGACCATTCTCATGCATCTTCTTCATCCACATGAAGACAGCAAAGAGG
TGA
Protein sequenceShow/hide protein sequence
MSKAYDCVEWSFINEMLSKMGFGNRCINNIMRCVESVSFSVMINGESKEEFKPTRGIRQGDPLSPYQFLVCVEGSRFPVIVHPTLKGRRVNELLLDNGSWNEDMI
KSLFIPADVEDILAIPLSNGRDKDEIIWNVDPKGFFSVKSAYHLATNIIQEKENSGSCSKPQSSRWTSIWDLNIIPRAKIGLWKIIKNFIPSKPNVQIKRMDINH
VCEMCRCGKESASHIFWRCKRARGVRNSFFPNLMDLLFSGRANLEMVDCWDIIRSHHSKAEAEKIGLILWNIWSFRNQATISKKRADERQLLQQIKRTLEDQRKR
TLPMLAEIQLESLRNHENWSLRPLNFLKLNSDASWNDDLEVGGVGWIIRDSSGSLVLARCRRITRKWSVKLLEMAEIKERLMTYLSSNFRGRSVVIVEADAMEVI
KLLNSECCDISEDKALVLDIEDLAAKVEVVSVVKCPRQSNRVAIFWREQRRAFLRPFSCIFFIHMKTAKR