; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy12g001060 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy12g001060
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr12:1557732..1561001
RNA-Seq ExpressionLcy12g001060
SyntenyLcy12g001060
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4263564.1 unnamed protein product [Prunus armeniaca]8.6e-12133.15Show/hide
Query:  SQIRVAVDTVLRSVTPTVSAAQNQALLREFTRAEVEAALKSIGPTKAPGPDGVNALFYQRHWDLIGDETSALCLGILNGGGSVKELNQTHLVLIPKIRGP
        S + VA   V  +V   VS+     LL  ++R E+E AL  IGP+KAPGPDG+ ALFYQ++W ++G + S LCL +LNG   V + N T + LIPK+  P
Subjt:  SQIRVAVDTVLRSVTPTVSAAQNQALLREFTRAEVEAALKSIGPTKAPGPDGVNALFYQRHWDLIGDETSALCLGILNGGGSVKELNQTHLVLIPKIRGP

Query:  TSVQDFRPISLCNVVYKICAKAIANRLKGVLHSVVSDEQSAFVPGRLISDNAVIGFQCLHALASKTRGKTG--NVAMKLDMSKAYDR-------------
        T V ++RPISLCNV+YKI +K +ANRLK VL  V+S+ QSAF+P R+I DN +  F+ +H L  K  GKTG   + +KLDM+KAYDR             
Subjt:  TSVQDFRPISLCNVVYKICAKAIANRLKGVLHSVVSDEQSAFVPGRLISDNAVIGFQCLHALASKTRGKTG--NVAMKLDMSKAYDR-------------

Query:  -------------------------------------------------------------------------------------------------RDC
                                                                                                          + 
Subjt:  -------------------------------------------------------------------------------------------------RDC

Query:  LTIKSVLHIYELASGQVINYDKSMFMVSRNTSPGMQQFIRHTLGVVQTSSLGRYLGLPSQNARSKCVIFRSVRDRVWKVLQGWKEKLFSIGGKEVLIKAV
        L +K +  +YE ASGQ +N  KS    S +T   +Q  IR  L V       RYLGLP+   + K  +FR+V+DRVW  + GW+ KL S  GKEVLIK+V
Subjt:  LTIKSVLHIYELASGQVINYDKSMFMVSRNTSPGMQQFIRHTLGVVQTSSLGRYLGLPSQNARSKCVIFRSVRDRVWKVLQGWKEKLFSIGGKEVLIKAV

Query:  AQTIPTYTMSCFKLPVSLCNEINSLCANFWWGSTDAGQKIHWRSWNRLCRHTVLGGMGFRDLHVFNQAMLAKQCWRL-----------SRDQYFCTGSFL
         Q IP+Y+MS F+LPV LC EI S+ A FWW   D G+ IHW++W  +C+H   GG+GFR+L  FNQA+L KQ WRL            + +YF    FL
Subjt:  AQTIPTYTMSCFKLPVSLCNEINSLCANFWWGSTDAGQKIHWRSWNRLCRHTVLGGMGFRDLHVFNQAMLAKQCWRL-----------SRDQYFCTGSFL

Query:  RATLGSNPLYAWRSLLWGRVLFKQGVRWRVGNGHSIRIWGDPWIPRTDNDKVLGIRDEIRSESVSSLIMEDGSWDVEKVRGEFLQDDEEHILAIPLSGQR
         A+ GS P + W+SLLWGR L + G+RWR+G+G  + I+GDPW+P      +  I     +  V  L    G WDV KV   F   + E IL+IPL G  
Subjt:  RATLGSNPLYAWRSLLWGRVLFKQGVRWRVGNGHSIRIWGDPWIPRTDNDKVLGIRDEIRSESVSSLIMEDGSWDVEKVRGEFLQDDEEHILAIPLSGQR

Query:  EDDEIYWVPDGKGRFSVKSAY--ALGVSLVEDASGMIGILWITSHGLKTM--TWKMMGHTSFYFC------GIFGSSEI-----------------PRSF
         D  I W     GR+SVKS Y  AL    +E+ S   G+   +S  LK+    WK+                I  S E+                 P +F
Subjt:  EDDEIYWVPDGKGRFSVKSAY--ALGVSLVEDASGMIGILWITSHGLKTM--TWKMMGHTSFYFC------GIFGSSEI-----------------PRSF

Query:  ARESLAK-GGW-----------------------------DW-----------------------GLGWVLRDHVGNLIGAGCESVTGLLGVDTLEMQAI
           +LA  G W                              W                       G G ++RD  G L+GA        + V   E+ A+
Subjt:  ARESLAK-GGW-----------------------------DW-----------------------GLGWVLRDHVGNLIGAGCESVTGLLGVDTLEMQAI

Query:  RAGLLAVGEMGPPRLMVESDSLAAINLLNGVDENLTLVRFVAMEVQKLSSCLAGVQFKHIRRGQNLVADMLARRAMSNGISGIWFSGFPCWLLEAIEGD
        + G+    +M P  L +ESDSL A++++N  +E L     +   V++L    A    +HI R  N  A  +AR ++ +    +W    P WL++A+  D
Subjt:  RAGLLAVGEMGPPRLMVESDSLAAINLLNGVDENLTLVRFVAMEVQKLSSCLAGVQFKHIRRGQNLVADMLARRAMSNGISGIWFSGFPCWLLEAIEGD

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]8.6e-12136.85Show/hide
Query:  SAVGVIELRASEMVLDNGSQIRVAVDTVLRSVTPTVSAAQNQALLREFTRAEVEAALKSIGPTKAPGPDGVNALFYQRHWDLIGDETSALCLGILNGGGS
        S  G+ ++  S       S +   +  VL ++  TV+   N +L++EFTR E+E AL  + PTKAPGPDG++A+F+Q++W+++G++   + L +LN   S
Subjt:  SAVGVIELRASEMVLDNGSQIRVAVDTVLRSVTPTVSAAQNQALLREFTRAEVEAALKSIGPTKAPGPDGVNALFYQRHWDLIGDETSALCLGILNGGGS

Query:  VKELNQTHLVLIPKIRGPTSVQDFRPISLCNVVYKICAKAIANRLKGVLHSVVSDEQSAFVPGRLISDNAVIGFQCLHALASKTRGKTGNVAMKLDMSKA
        + E+N+T++ L+PKI+ PT + DFRPISLCNVVYK+ +K +ANRLK +L  ++S+ QSAF+ GRLI+DN ++ F+ +H L  K  GK G  A+KLDMSKA
Subjt:  VKELNQTHLVLIPKIRGPTSVQDFRPISLCNVVYKICAKAIANRLKGVLHSVVSDEQSAFVPGRLISDNAVIGFQCLHALASKTRGKTGNVAMKLDMSKA

Query:  YDR-------------------------------------------------------------------------------------------------
        YDR                                                                                                 
Subjt:  YDR-------------------------------------------------------------------------------------------------

Query:  -------------RDCLTIKSVLHIYELASGQVINYDKSMFMVSRNTSPGMQQFIRHTLGVVQTSSLGRYLGLPSQNARSKCVIFRSVRDRVWKVLQGWK
                     ++C T+  +L +YE ASGQ IN DKS    S NT    +  +   LG +Q +   +YLGLPS   +SK  IF  V++RV + L GWK
Subjt:  -------------RDCLTIKSVLHIYELASGQVINYDKSMFMVSRNTSPGMQQFIRHTLGVVQTSSLGRYLGLPSQNARSKCVIFRSVRDRVWKVLQGWK

Query:  EKLFSIGGKEVLIKAVAQTIPTYTMSCFKLPVSLCNEINSLCANFWWGSTDAGQKIHWRSWNRLCRHTVLGGMGFRDLHVFNQAMLAKQCWRL-------
        EKL S+GG+E+LIKAVAQ IPTYTMSCF++P +LC EI ++   FWWG      KI W SW +LC+    GGMGFR+L  FN AMLAKQ WRL       
Subjt:  EKLFSIGGKEVLIKAVAQTIPTYTMSCFKLPVSLCNEINSLCANFWWGSTDAGQKIHWRSWNRLCRHTVLGGMGFRDLHVFNQAMLAKQCWRL-------

Query:  ----SRDQYFCTGSFLRATLGSNPLYAWRSLLWGRVLFKQGVRWRVGNGHSIRIWGDPWIPRTDNDKVLGIRDEIRS-ESVSSLI-MEDGSWDVEKVRGE
             + +Y+  G   +A LG++P Y WRS+  G  + ++G RWRVGNG  I IW D W+P     KV+           VS+LI  E   W  + VR  
Subjt:  ----SRDQYFCTGSFLRATLGSNPLYAWRSLLWGRVLFKQGVRWRVGNGHSIRIWGDPWIPRTDNDKVLGIRDEIRS-ESVSSLI-MEDGSWDVEKVRGE

Query:  FLQDDEEHILAIPLSGQREDDEIYWVPDGKGRFSVKSAYALGVSLVEDASGMIGILWITSHGLKTMTWKMMGH
        FL  +   IL+IPLS    +D+I WV + KG FSVKSAY + V ++++    + +   +S   +++ W+ + H
Subjt:  FLQDDEEHILAIPLSGQREDDEIYWVPDGKGRFSVKSAYALGVSLVEDASGMIGILWITSHGLKTMTWKMMGH

XP_023897447.1 uncharacterized protein LOC112009345 [Quercus suber]5.9e-12232.49Show/hide
Query:  VLRSVTPTVSAAQNQALLREFTRAEVEAALKSIGPTKAPGPDGVNALFYQRHWDLIGDETSALCLGILNGGGSVKELNQTHLVLIPKIRGPTSVQDFRPI
        +L +V P VS   N+ L   F   EV +ALK + P K+PGP G+  LF+Q  W  IG+  + + L  LN G +  + N TH+VLIPK + P ++ D+RPI
Subjt:  VLRSVTPTVSAAQNQALLREFTRAEVEAALKSIGPTKAPGPDGVNALFYQRHWDLIGDETSALCLGILNGGGSVKELNQTHLVLIPKIRGPTSVQDFRPI

Query:  SLCNVVYKICAKAIANRLKGVLHSVVSDEQSAFVPGRLISDNAVIGFQCLHALASKTRGKTGNVAMKLDMSKAYDR------------------------
        SLCNV++KI +K IANRLK +L S++SD QSAFV GRLI+DN ++ F+ +H ++ K  G+ G +A+KLDMSKAYDR                        
Subjt:  SLCNVVYKICAKAIANRLKGVLHSVVSDEQSAFVPGRLISDNAVIGFQCLHALASKTRGKTGNVAMKLDMSKAYDR------------------------

Query:  --------------------------------------------------------------------------------------RDCLTIKSVLHIYE
                                                                                               DC T++ +  +YE
Subjt:  --------------------------------------------------------------------------------------RDCLTIKSVLHIYE

Query:  LASGQVINYDKSMFMVSRNTSPGMQQFIRHTLGVVQTSSLGRYLGLPSQNARSKCVIFRSVRDRVWKVLQGWKEKLFSIGGKEVLIKAVAQTIPTYTMSC
         ASGQ +N  K+    S NTS  +Q+ IR   G        +YLGLPS   R+K   F +++D++ K L GWK KL    GKEVLIKAVAQ IPTYTMSC
Subjt:  LASGQVINYDKSMFMVSRNTSPGMQQFIRHTLGVVQTSSLGRYLGLPSQNARSKCVIFRSVRDRVWKVLQGWKEKLFSIGGKEVLIKAVAQTIPTYTMSC

Query:  FKLPVSLCNEINSLCANFWWGSTDAGQKIHWRSWNRLCRHTVLGGMGFRDLHVFNQAMLAKQCWRLS-----------RDQYFCTGSFLRATLGSNPLYA
        FK+P SLC E+ S+   FWWG     +K+ W SW++LC     GGMGF+ L  FN A+LAKQ WRL            + +YF +  FL AT G+NP Y 
Subjt:  FKLPVSLCNEINSLCANFWWGSTDAGQKIHWRSWNRLCRHTVLGGMGFRDLHVFNQAMLAKQCWRLS-----------RDQYFCTGSFLRATLGSNPLYA

Query:  WRSLLWGRVLFKQGVRWRVGNGHSIRIWGDPWIPRTDNDKVLGIRDEIRSES-VSSLIMEDG-SWDVEKVRGEFLQDDEEHILAIPLSGQREDDEIYWVP
        WRS+L  + + ++G RWRVG+G +I++WGD W+PR  + +V+  R  +  E+ VS  I ++   W  E +R  F   D E IL IPLS +  +D + W  
Subjt:  WRSLLWGRVLFKQGVRWRVGNGHSIRIWGDPWIPRTDNDKVLGIRDEIRSES-VSSLIMEDG-SWDVEKVRGEFLQDDEEHILAIPLSGQREDDEIYWVP

Query:  DGKGRFSVKSAYALGVSLVEDASGMIGILWITSHGLKTMTWKMM-----GHTSF--------------YFCG---IFGSSE---------------IPRS
           G FSV+SAY + + L +  + +      +S+      WK +      H SF              ++C    I   +E               +  S
Subjt:  DGKGRFSVKSAYALGVSLVEDASGMIGILWITSHGLKTMTWKMM-----GHTSF--------------YFCG---IFGSSE---------------IPRS

Query:  FARESL------AKGGWD---------------------WGLGWVLRDHVGNLIGAGCESVTGLLGVDTLEMQAIRAGLLAVGEMGPPRLMVESDSLAAI
         A ES+          W+                      G+G V+RD  G ++ A    +   LG   +E++A  AGL    +MG   +++E DSL  +
Subjt:  FARESL------AKGGWD---------------------WGLGWVLRDHVGNLIGAGCESVTGLLGVDTLEMQAIRAGLLAVGEMGPPRLMVESDSLAAI

Query:  NLLNGVDENLTLVRFVAMEVQKLSSCLAGVQFKHIRRGQNLVADMLARRAMSNGISGIWFSGFPCWLLEAIEGD
          L G+  + + +  + + +Q   S    V   H++R +N  A +LA+ A+S   S +W    PC + + +  D
Subjt:  NLLNGVDENLTLVRFVAMEVQKLSSCLAGVQFKHIRRGQNLVADMLARRAMSNGISGIWFSGFPCWLLEAIEGD

XP_023899813.1 uncharacterized protein LOC112011695 [Quercus suber]9.5e-12031.65Show/hide
Query:  CWLRMRHIDDKGRGRNGWCGGTGIQSGFMPEHLSA---VGVIELRASEMVLDNGSQIRVA-----VDTVLRSVTPTVSAAQNQALLREFTRAEVEAALKS
        CWL+      + R R  W       + F     SA      IE    E+V+D  S++  +        +L S+ P VS   N  L R FT  EV  ALK 
Subjt:  CWLRMRHIDDKGRGRNGWCGGTGIQSGFMPEHLSA---VGVIELRASEMVLDNGSQIRVA-----VDTVLRSVTPTVSAAQNQALLREFTRAEVEAALKS

Query:  IGPTKAPGPDGVNALFYQRHWDLIGDETSALCLGILNGGGSVKELNQTHLVLIPKIRGPTSVQDFRPISLCNVVYKICAKAIANRLKGVLHSVVSDEQSA
        + P KAPGPDG+  +F+Q+ W   G+  +   L  LN G S    N+TH+VLIPKI+ P +V DFRPISLCNV YKI +KAI NRLK  L S++S+ QSA
Subjt:  IGPTKAPGPDGVNALFYQRHWDLIGDETSALCLGILNGGGSVKELNQTHLVLIPKIRGPTSVQDFRPISLCNVVYKICAKAIANRLKGVLHSVVSDEQSA

Query:  FVPGRLISDNAVIGFQCLHALASKTRGKTGNVAMKLDM----------------------------SKAYDRRD-----------CLTIKSVLHIYELAS
        FV GRLI+DN ++ F+ +H ++ K  G  G +A+KLDM                            S  +   D           C  ++ VL  YE AS
Subjt:  FVPGRLISDNAVIGFQCLHALASKTRGKTGNVAMKLDM----------------------------SKAYDRRD-----------CLTIKSVLHIYELAS

Query:  GQVINYDKSMFMVSRNTSPGMQQFIRHTLGVVQTSSLGRYLGLPSQNARSKCVIFRSVRDRVWKVLQGWKEKLFSIGGKEVLIKAVAQTIPTYTMSCFKL
        GQ +N  K+    S NT   +Q+ I+   G        +YLGLPS   R+K   F  ++++V + L GWKEKL S  GKEVLIKAVAQ IPTYTMSCFK+
Subjt:  GQVINYDKSMFMVSRNTSPGMQQFIRHTLGVVQTSSLGRYLGLPSQNARSKCVIFRSVRDRVWKVLQGWKEKLFSIGGKEVLIKAVAQTIPTYTMSCFKL

Query:  PVSLCNEINSLCANFWWGSTDAGQKIHWRSWNRLCRHTVLGGMGFRDLHVFNQAMLAKQCWRLS-----------RDQYFCTGSFLRATLGSNPLYAWRS
        P SLC+E+ ++  NFWWG      KI W  W ++C     GGMGF++L  FN A+LAKQ WRL            + +YF    F+ A++G+NP Y WRS
Subjt:  PVSLCNEINSLCANFWWGSTDAGQKIHWRSWNRLCRHTVLGGMGFRDLHVFNQAMLAKQCWRLS-----------RDQYFCTGSFLRATLGSNPLYAWRS

Query:  LLWGRVLFKQGVRWRVGNGHSIRIWGDPWIPRTDNDKVLGIRDEIRSES-VSSLIMED-GSWDVEKVRGEFLQDDEEHILAIPLSGQREDDEIYWVPDGK
        L+  + L K+G+RWRVGNG SIR+W D W+P     KV   R  +++++ V  LI ED   W    V   FL    + I +IP+S +   D++ W     
Subjt:  LLWGRVLFKQGVRWRVGNGHSIRIWGDPWIPRTDNDKVLGIRDEIRSES-VSSLIMED-GSWDVEKVRGEFLQDDEEHILAIPLSGQREDDEIYWVPDGK

Query:  GRFSVKSAYALGVSLVEDASG-------MIGILW------ITSHGLKTMTWKM-------------------------------MGHTSF--------YF
        G+F+V+SAY L ++     SG        +   W         H ++   W++                               +GH  +        + 
Subjt:  GRFSVKSAYALGVSLVEDASG-------MIGILW------ITSHGLKTMTWKM-------------------------------MGHTSF--------YF

Query:  C-------------------------GIFGSSEIPRSFA--------RESLAKGG----------W--------------------------DW------
        C                         G  G  ++  +          R  +  GG          W                           W      
Subjt:  C-------------------------GIFGSSEIPRSFA--------RESLAKGG----------W--------------------------DW------

Query:  -----------------GLGWVLRDHVGNLIGAGCESVTGLLGVDTLEMQAIRAGLLAVGEMGPPRLMVESDSLAAINLLNGVDENLTLVRFVAMEVQKL
                         G+G V+RD  G L  A C  +T  +G    E +A  AGLL   ++G   +++E  SLA  N L  +    + +  V   +  +
Subjt:  -----------------GLGWVLRDHVGNLIGAGCESVTGLLGVDTLEMQAIRAGLLAVGEMGPPRLMVESDSLAAINLLNGVDENLTLVRFVAMEVQKL

Query:  SSCLAGVQFKHIRRGQNLVADMLARRAMSNGISGIWFSGFPCWLLEAIEGD
        S     V + HIRR  N+ A +LA+ A        W    PC++++A+  D
Subjt:  SSCLAGVQFKHIRRGQNLVADMLARRAMSNGISGIWFSGFPCWLLEAIEGD

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]9.5e-12037.16Show/hide
Query:  HIDDKGRGRNGWCGGTGIQSGFMPEHLSAVGVIELRASEMVLDNGSQIRVAVDTVLRSVTPTVSAAQNQALLREFTRAEVEAALKSIGPTKAPGPDGVNA
        H     R R  +  G     G   E+L  VG +     + +   G+  +  ++  L +V   V+    + L  +FT  EV+AAL  +GPTKAPGPDG+NA
Subjt:  HIDDKGRGRNGWCGGTGIQSGFMPEHLSAVGVIELRASEMVLDNGSQIRVAVDTVLRSVTPTVSAAQNQALLREFTRAEVEAALKSIGPTKAPGPDGVNA

Query:  LFYQRHWDLIGDETSALCLGILNGGGSVKELNQTHLVLIPKIRGPTSVQDFRPISLCNVVYKICAKAIANRLKGVLHSVVSDEQSAFVPGRLISDNAVIG
        LFYQ+ W ++GD   +  L  LN G  + E+N T++VLIPK++ P  + +FRPISLCNV+YKI +K +ANRLK VL  ++S  QSAFVPGRLI+DN ++ 
Subjt:  LFYQRHWDLIGDETSALCLGILNGGGSVKELNQTHLVLIPKIRGPTSVQDFRPISLCNVVYKICAKAIANRLKGVLHSVVSDEQSAFVPGRLISDNAVIG

Query:  FQCLHALASKTRGKTGNVAMKLDMSKAYDR-----------------------RDCL-------------------------------------------
        ++ LH + ++ +GK G+VA+KLD+SKAYDR                         C+                                           
Subjt:  FQCLHALASKTRGKTGNVAMKLDMSKAYDR-----------------------RDCL-------------------------------------------

Query:  --------------------------------------------TIKSVLHIYELASGQVINYDKSMFMVSRNTSPGMQQFIRHTLGVVQTSSLGRYLGL
                                                    TI  +L IYE ASGQ IN +KS    S NTS G +  I   LGV +     +YLGL
Subjt:  --------------------------------------------TIKSVLHIYELASGQVINYDKSMFMVSRNTSPGMQQFIRHTLGVVQTSSLGRYLGL

Query:  PSQNARSKCVIFRSVRDRVWKVLQGWKEKLFSIGGKEVLIKAVAQTIPTYTMSCFKLPVSLCNEINSLCANFWWGSTDAGQKIHWRSWNRLCRHTVLGGM
        P+   R+K   F  ++DRVWK LQGWK  L S  GKE+LIKAVAQ IPTYTMS F++P+ LC+E+ +LCA FWWG     +KIHW+SW++L      GGM
Subjt:  PSQNARSKCVIFRSVRDRVWKVLQGWKEKLFSIGGKEVLIKAVAQTIPTYTMSCFKLPVSLCNEINSLCANFWWGSTDAGQKIHWRSWNRLCRHTVLGGM

Query:  GFRDLHVFNQAMLAKQCWRL-----------SRDQYFCTGSFLRATLGSNPLYAWRSLLWGRVLFKQGVRWRVGNGHSIRIWGDPWIPRTDNDKVLGIRD
        GFRDL  FN AMLAKQ WRL            + +YF   SFL A    N  + WRSL+  + + + G  WRVGNG+SI    D W+P    +KVL    
Subjt:  GFRDLHVFNQAMLAKQCWRL-----------SRDQYFCTGSFLRATLGSNPLYAWRSLLWGRVLFKQGVRWRVGNGHSIRIWGDPWIPRTDNDKVLGIRD

Query:  EIRSES-VSSLI-MEDGSWDVEKVRGEFLQDDEEHILAIPLSGQREDDEIYWVPDGKGRFSVKSAYALGVSLVEDASGMIGILWITSHGLKTMTWKM
           SE  V+ LI  E   W+ E++R  F +D+ E I  IPLS +   D I+W+   +G FSVKSAY +   ++ DA+     + + +  + +  WK+
Subjt:  EIRSES-VSSLI-MEDGSWDVEKVRGEFLQDDEEHILAIPLSGQREDDEIYWVPDGKGRFSVKSAYALGVSLVEDASGMIGILWITSHGLKTMTWKM

TrEMBL top hitse value%identityAlignment
A0A2N9GPZ7 Reverse transcriptase domain-containing protein1.7e-12236.38Show/hide
Query:  HIDDKGRGRNGWCGGTGIQSGFMPEHLSAVGVIELRASEMVLDNGSQIRVAVDTVLRSVTPTVSAAQNQALLREFTRAEVEAALKSIGPTKAPGPDGVNA
        H     R R     G   + G      + +  I +   + +  + +    ++ TVL+ +   V+ A N  L  EFT+ EV  ALK + PTKAPGPDG++A
Subjt:  HIDDKGRGRNGWCGGTGIQSGFMPEHLSAVGVIELRASEMVLDNGSQIRVAVDTVLRSVTPTVSAAQNQALLREFTRAEVEAALKSIGPTKAPGPDGVNA

Query:  LFYQRHWDLIGDETSALCLGILNGGGSVKELNQTHLVLIPKIRGPTSVQDFRPISLCNVVYKICAKAIANRLKGVLHSVVSDEQSAFVPGRLISDNAVIG
        +FYQ +WD++G E +   L IL+ G  ++++N TH+ LIPK++ P ++ DFRPISLCNV+YKI +K +ANRLK VL  V+S+ QSAFVPGRLI+DN ++ 
Subjt:  LFYQRHWDLIGDETSALCLGILNGGGSVKELNQTHLVLIPKIRGPTSVQDFRPISLCNVVYKICAKAIANRLKGVLHSVVSDEQSAFVPGRLISDNAVIG

Query:  FQCLHALASKTRGKTGNVAMKLDMSKAYDR----------------------------------------------------------------------
        F+ +H+++ K +GK G +A+KLDMSKAYDR                                                                      
Subjt:  FQCLHALASKTRGKTGNVAMKLDMSKAYDR----------------------------------------------------------------------

Query:  ----------------------------------------RDCLTIKSVLHIYELASGQVINYDKSMFMVSRNTSPGMQQFIRHTLGVVQTSSLGRYLGL
                                                 +C  + ++L  YE ASGQ +N  K+    +++TS GM++ I+    V +  S  +YLGL
Subjt:  ----------------------------------------RDCLTIKSVLHIYELASGQVINYDKSMFMVSRNTSPGMQQFIRHTLGVVQTSSLGRYLGL

Query:  PSQNARSKCVIFRSVRDRVWKVLQGWKEKLFSIGGKEVLIKAVAQTIPTYTMSCFKLPVSLCNEINSLCANFWWGSTDAGQKIHWRSWNRLCRHTVLGGM
        PS   RSK   F  ++ RVW+ + GWKEK  S  G+EVLIKAVAQ+IPTY+MSCFKLP SLCN++N++ +NFWWG  D  +K HW  W++LC     GG+
Subjt:  PSQNARSKCVIFRSVRDRVWKVLQGWKEKLFSIGGKEVLIKAVAQTIPTYTMSCFKLPVSLCNEINSLCANFWWGSTDAGQKIHWRSWNRLCRHTVLGGM

Query:  GFRDLHVFNQAMLAKQCWRLSRDQ-----------YFCTGSFLRATLGSNPLYAWRSLLWGRVLFKQGVRWRVGNGHSIRIWGDPWIPRTDNDKVLGIRD
        GFRDL  FN A+LAKQ WR  + Q           YF  G F+ A LG+ P YAWRS+   R + + G++W +G+G S++I  DPW+P + + K +  + 
Subjt:  GFRDLHVFNQAMLAKQCWRLSRDQ-----------YFCTGSFLRATLGSNPLYAWRSLLWGRVLFKQGVRWRVGNGHSIRIWGDPWIPRTDNDKVLGIRD

Query:  EI-RSESVSSLIMEDG-SWDVEKVRGEFLQDDEEHILAIPLSGQREDDEIYWVPDGKGRFSVKSAYAL
         +   E+VS LI ED  +W+V+ +   F + + + I AIPL  +++ D ++W     G F+VKSAY L
Subjt:  EI-RSESVSSLIMEDG-SWDVEKVRGEFLQDDEEHILAIPLSGQREDDEIYWVPDGKGRFSVKSAYAL

A0A2N9GQ35 Reverse transcriptase domain-containing protein1.4e-12137.96Show/hide
Query:  VDTVLRSVTPTVSAAQNQALLREFTRAEVEAALKSIGPTKAPGPDGVNALFYQRHWDLIGDETSALCLGILNGGGSVKELNQTHLVLIPKIRGPTSVQDF
        +++ L  +   V+A  N  LL +FT  EV AAL+ + PTKAPGPDG++A+FYQ +W+++G E +   L I++ G  + ++N TH+ L+PKI  P  + DF
Subjt:  VDTVLRSVTPTVSAAQNQALLREFTRAEVEAALKSIGPTKAPGPDGVNALFYQRHWDLIGDETSALCLGILNGGGSVKELNQTHLVLIPKIRGPTSVQDF

Query:  RPISLCNVVYKICAKAIANRLKGVLHSVVSDEQSAFVPGRLISDNAVIGFQCLHALASKTRGKTGNVAMKLDMSKAYDRRD-------------------
        RPI+LCNV+YKI +K +ANRLK +L  +VS+ QSAFVPGRLI+DN ++ F+ +H+++ K  G+ G +A+KLDMSKAYDR +                   
Subjt:  RPISLCNVVYKICAKAIANRLKGVLHSVVSDEQSAFVPGRLISDNAVIGFQCLHALASKTRGKTGNVAMKLDMSKAYDRRD-------------------

Query:  -------------------------------------------------------------------------------------------CLTIKSVLH
                                                                                                   C T+ ++L 
Subjt:  -------------------------------------------------------------------------------------------CLTIKSVLH

Query:  IYELASGQVINYDKSMFMVSRNTSPGMQQFIRHTLGVVQTSSLGRYLGLPSQNARSKCVIFRSVRDRVWKVLQGWKEKLFSIGGKEVLIKAVAQTIPTYT
         YE ASGQ +N  K+    ++NT+  M+Q I++   V +  S  +YLGLPS   RSK + F +V+ RVW+ + GWKEK  S  G+E+L+KAVAQ+IPTYT
Subjt:  IYELASGQVINYDKSMFMVSRNTSPGMQQFIRHTLGVVQTSSLGRYLGLPSQNARSKCVIFRSVRDRVWKVLQGWKEKLFSIGGKEVLIKAVAQTIPTYT

Query:  MSCFKLPVSLCNEINSLCANFWWGSTDAGQKIHWRSWNRLCRHTVLGGMGFRDLHVFNQAMLAKQCWRLSRDQ-----------YFCTGSFLRATLGSNP
        MSCFKLP SLCN++NS+ +NFWWG  D  +K HW  WN++C+  V GG+GFRD+ +FN+A+LAKQ WR  + Q           YF   SFL A +   P
Subjt:  MSCFKLPVSLCNEINSLCANFWWGSTDAGQKIHWRSWNRLCRHTVLGGMGFRDLHVFNQAMLAKQCWRLSRDQ-----------YFCTGSFLRATLGSNP

Query:  LYAWRSLLWGRVLFKQGVRWRVGNGHSIRIWGDPWIP-RTDNDKVLGIRDEIRSESVSSLIMEDG-SWDVEKVRGEFLQDDEEHILAIPLSGQREDDEIY
         YAWRSL+  R +   G+RW +GNG ++RI  DPW+P  T +   L +++   +E VS+LI  +G  W VE VR  F + +   I +IPL  + ++D ++
Subjt:  LYAWRSLLWGRVLFKQGVRWRVGNGHSIRIWGDPWIP-RTDNDKVLGIRDEIRSESVSSLIMEDG-SWDVEKVRGEFLQDDEEHILAIPLSGQREDDEIY

Query:  WVPDGKGRFSVKSAYALGV
        W     G F+V+SAY + V
Subjt:  WVPDGKGRFSVKSAYALGV

A0A2N9HTH6 Reverse transcriptase domain-containing protein2.3e-12439.06Show/hide
Query:  VDTVLRSVTPTVSAAQNQALLREFTRAEVEAALKSIGPTKAPGPDGVNALFYQRHWDLIGDETSALCLGILNGGGSVKELNQTHLVLIPKIRGPTSVQDF
        ++  L  +   V+ A N  L+ +FT  EV  ALK + PTKAPGPDG++A+FYQ +W+++G E +   L IL+ G  ++++N TH+VLIPKI+ P  + D+
Subjt:  VDTVLRSVTPTVSAAQNQALLREFTRAEVEAALKSIGPTKAPGPDGVNALFYQRHWDLIGDETSALCLGILNGGGSVKELNQTHLVLIPKIRGPTSVQDF

Query:  RPISLCNVVYKICAKAIANRLKGVLHSVVSDEQSAFVPGRLISDNAVIGFQCLHALASKTRGKTGNVAMKLDMSKAYDR---------------------
        RPI+LCNV+YKI +K +ANRLK VL  V+S+ QSAFVPGRLI+DN ++ F+ LH+++ K RGK G +A+KLDMSKAYDR                     
Subjt:  RPISLCNVVYKICAKAIANRLKGVLHSVVSDEQSAFVPGRLISDNAVIGFQCLHALASKTRGKTGNVAMKLDMSKAYDR---------------------

Query:  -----------------------------------------------------------------------------------------RDCLTIKSVLH
                                                                                                  +C  +  +L 
Subjt:  -----------------------------------------------------------------------------------------RDCLTIKSVLH

Query:  IYELASGQVINYDKSMFMVSRNTSPGMQQFIRHTLGVVQTSSLGRYLGLPSQNARSKCVIFRSVRDRVWKVLQGWKEKLFSIGGKEVLIKAVAQTIPTYT
        IYE ASGQ +N  K+    ++NTS  M+Q+I++   V++  S  +YLGLPS   RSK   F  ++DRVW+ + GWKEKL S  G+E+LIKAVAQ+IPTY 
Subjt:  IYELASGQVINYDKSMFMVSRNTSPGMQQFIRHTLGVVQTSSLGRYLGLPSQNARSKCVIFRSVRDRVWKVLQGWKEKLFSIGGKEVLIKAVAQTIPTYT

Query:  MSCFKLPVSLCNEINSLCANFWWGSTDAGQKIHWRSWNRLCRHTVLGGMGFRDLHVFNQAMLAKQCWRLSRD-----------QYFCTGSFLRATLGSNP
        MSCFKLP SLCNE+NS+ +NFWWG    G+ +HW  W +LC     GG+GFRDL  FN A+LAKQ WR+ +            +YF T +F+ ATLG+ P
Subjt:  MSCFKLPVSLCNEINSLCANFWWGSTDAGQKIHWRSWNRLCRHTVLGGMGFRDLHVFNQAMLAKQCWRLSRD-----------QYFCTGSFLRATLGSNP

Query:  LYAWRSLLWGRVLFKQGVRWRVGNGHSIRIWGDPWIPRTDNDKV-LGIRDEIRSESVSSLIMEDGS-WDVEKVRGEFLQDDEEHILAIPLSGQREDDEIY
         YAWRS+   R +   G+RW +G+G S+ IW DPW+P   +  V    +     E VS LI+ D   W+VE ++  F + +   I++IPL  ++  D ++
Subjt:  LYAWRSLLWGRVLFKQGVRWRVGNGHSIRIWGDPWIPRTDNDKV-LGIRDEIRSESVSSLIMEDGS-WDVEKVRGEFLQDDEEHILAIPLSGQREDDEIY

Query:  WVPDGKGRFSVKSAYAL
        W     G FSVKSAY L
Subjt:  WVPDGKGRFSVKSAYAL

A0A2N9IPS8 Reverse transcriptase domain-containing protein1.7e-12236.38Show/hide
Query:  HIDDKGRGRNGWCGGTGIQSGFMPEHLSAVGVIELRASEMVLDNGSQIRVAVDTVLRSVTPTVSAAQNQALLREFTRAEVEAALKSIGPTKAPGPDGVNA
        H     R R     G   + G      + +  I +   + +  + +    ++ TVL+ +   V+ A N  L  EFT+ EV  ALK + PTKAPGPDG++A
Subjt:  HIDDKGRGRNGWCGGTGIQSGFMPEHLSAVGVIELRASEMVLDNGSQIRVAVDTVLRSVTPTVSAAQNQALLREFTRAEVEAALKSIGPTKAPGPDGVNA

Query:  LFYQRHWDLIGDETSALCLGILNGGGSVKELNQTHLVLIPKIRGPTSVQDFRPISLCNVVYKICAKAIANRLKGVLHSVVSDEQSAFVPGRLISDNAVIG
        +FYQ +WD++G E +   L IL+ G  ++++N TH+ LIPK++ P ++ DFRPISLCNV+YKI +K +ANRLK VL  V+S+ QSAFVPGRLI+DN ++ 
Subjt:  LFYQRHWDLIGDETSALCLGILNGGGSVKELNQTHLVLIPKIRGPTSVQDFRPISLCNVVYKICAKAIANRLKGVLHSVVSDEQSAFVPGRLISDNAVIG

Query:  FQCLHALASKTRGKTGNVAMKLDMSKAYDR----------------------------------------------------------------------
        F+ +H+++ K +GK G +A+KLDMSKAYDR                                                                      
Subjt:  FQCLHALASKTRGKTGNVAMKLDMSKAYDR----------------------------------------------------------------------

Query:  ----------------------------------------RDCLTIKSVLHIYELASGQVINYDKSMFMVSRNTSPGMQQFIRHTLGVVQTSSLGRYLGL
                                                 +C  + ++L  YE ASGQ +N  K+    +++TS GM++ I+    V +  S  +YLGL
Subjt:  ----------------------------------------RDCLTIKSVLHIYELASGQVINYDKSMFMVSRNTSPGMQQFIRHTLGVVQTSSLGRYLGL

Query:  PSQNARSKCVIFRSVRDRVWKVLQGWKEKLFSIGGKEVLIKAVAQTIPTYTMSCFKLPVSLCNEINSLCANFWWGSTDAGQKIHWRSWNRLCRHTVLGGM
        PS   RSK   F  ++ RVW+ + GWKEK  S  G+EVLIKAVAQ+IPTY+MSCFKLP SLCN++N++ +NFWWG  D  +K HW  W++LC     GG+
Subjt:  PSQNARSKCVIFRSVRDRVWKVLQGWKEKLFSIGGKEVLIKAVAQTIPTYTMSCFKLPVSLCNEINSLCANFWWGSTDAGQKIHWRSWNRLCRHTVLGGM

Query:  GFRDLHVFNQAMLAKQCWRLSRDQ-----------YFCTGSFLRATLGSNPLYAWRSLLWGRVLFKQGVRWRVGNGHSIRIWGDPWIPRTDNDKVLGIRD
        GFRDL  FN A+LAKQ WR  + Q           YF  G F+ A LG+ P YAWRS+   R + + G++W +G+G S++I  DPW+P + + K +  + 
Subjt:  GFRDLHVFNQAMLAKQCWRLSRDQ-----------YFCTGSFLRATLGSNPLYAWRSLLWGRVLFKQGVRWRVGNGHSIRIWGDPWIPRTDNDKVLGIRD

Query:  EI-RSESVSSLIMEDG-SWDVEKVRGEFLQDDEEHILAIPLSGQREDDEIYWVPDGKGRFSVKSAYAL
         +   E+VS LI ED  +W+V+ +   F + + + I AIPL  +++ D ++W     G F+VKSAY L
Subjt:  EI-RSESVSSLIMEDG-SWDVEKVRGEFLQDDEEHILAIPLSGQREDDEIYWVPDGKGRFSVKSAYAL

A0A7N2L6Z9 Reverse transcriptase domain-containing protein1.5e-12332.6Show/hide
Query:  VDTVLRSVTPTVSAAQNQALLREFTRAEVEAALKSIGPTKAPGPDGVNALFYQRHWDLIGDETSALCLGILNGGGSVKELNQTHLVLIPKIRGPTSVQDF
        VD V  ++   ++   N  L R FTR E+  ALK I PTK+PGPDG++A+F+Q++WD++G   S + L +LN G S+  +N+T++VLIPK   P  + DF
Subjt:  VDTVLRSVTPTVSAAQNQALLREFTRAEVEAALKSIGPTKAPGPDGVNALFYQRHWDLIGDETSALCLGILNGGGSVKELNQTHLVLIPKIRGPTSVQDF

Query:  RPISLCNVVYKICAKAIANRLKGVLHSVVSDEQSAFVPGRLISDNAVIGFQCLHALASKTRGKTGNVAMKLDMSKAYD----------------------
        RPISLCNV+YK+ +K +ANRLK  L  ++++ QSAF   RLI+DN +I ++ +H L  K  GK   +A KLDMSKA+D                      
Subjt:  RPISLCNVVYKICAKAIANRLKGVLHSVVSDEQSAFVPGRLISDNAVIGFQCLHALASKTRGKTGNVAMKLDMSKAYD----------------------

Query:  ----------------------------------------------------------------------------------------RRDCLTIKSVLH
                                                                                                R +C  +K +L 
Subjt:  ----------------------------------------------------------------------------------------RRDCLTIKSVLH

Query:  IYELASGQVINYDKSMFMVSRNTSPGMQQFIRHTLGVVQTSSLGRYLGLPSQNARSKCVIFRSVRDRVWKVLQGWKEKLFSIGGKEVLIKAVAQTIPTYT
         YE ASGQ +N DKS    S NT+P +++ I + LG +Q S   +YLGLPS   RSK ++F  +++RV   L GWK KL S GGKE+LIKAVAQ IPTYT
Subjt:  IYELASGQVINYDKSMFMVSRNTSPGMQQFIRHTLGVVQTSSLGRYLGLPSQNARSKCVIFRSVRDRVWKVLQGWKEKLFSIGGKEVLIKAVAQTIPTYT

Query:  MSCFKLPVSLCNEINSLCANFWWGSTDAGQKIHWRSWNRLCRHTVLGGMGFRDLHVFNQAMLAKQCW-----------RLSRDQYFCTGSFLRATLGSNP
        MSCF LP SLC+E+  +  NFWWG  +   K+ W SW ++C+   LGG+GFR+LH FN A+LAKQ W           R+ + +YF  G  L A+LGSNP
Subjt:  MSCFKLPVSLCNEINSLCANFWWGSTDAGQKIHWRSWNRLCRHTVLGGMGFRDLHVFNQAMLAKQCW-----------RLSRDQYFCTGSFLRATLGSNP

Query:  LYAWRSLLWGRVLFKQGVRWRVGNGHSIRIWGDPWIPRTDNDKVL-GIRDEIRSESVSSLIMEDGS-WDVEKVRGEFLQDDEEHILAIPLSGQREDDEIY
         Y WRS+     + K+G RWRVGNG  I IW D W+P     KV+   R       VSSLI  D   W ++ +R  FL  D E IL IPLS    DD I 
Subjt:  LYAWRSLLWGRVLFKQGVRWRVGNGHSIRIWGDPWIPRTDNDKVL-GIRDEIRSESVSSLIMEDGS-WDVEKVRGEFLQDDEEHILAIPLSGQREDDEIY

Query:  WVPDGKGRFSVKSAYALGVSLVEDA------SG-----MIGILW--------------ITSHGLKTM-TWKMMGHTSFYFC-------------------
        W+ + KG FSVKSAY + V+L+E A      SG     +   LW                 +GL TM    M G  +  FC                   
Subjt:  WVPDGKGRFSVKSAYALGVSLVEDA------SG-----MIGILW--------------ITSHGLKTM-TWKMMGHTSFYFC-------------------

Query:  -------------GIFGSSEIPRSFARESLAKG-------------------------------------------------------------GW----
                     G+   S   ++ A   LA                                                               GW    
Subjt:  -------------GIFGSSEIPRSFARESLAKG-------------------------------------------------------------GW----

Query:  ------------------DWGLGWVLRDHVGNLIGAGCESVTGLLGVDTLEMQAIRAGLLAVGEMGPPRLMVESDSLAAINLLNGVD---ENLTLVRFVA
                            G+G V+RD  G +I A C+ +      +  E+ AI  GLL   EM  P++M+ESD+L+AI  +N  +   E   LV  + 
Subjt:  ------------------DWGLGWVLRDHVGNLIGAGCESVTGLLGVDTLEMQAIRAGLLAVGEMGPPRLMVESDSLAAINLLNGVD---ENLTLVRFVA

Query:  MEVQKLSSCLAGVQFKHIRRGQNLVADMLARRAMSNGISGIWFSGFPCWLLEAIEGD
              S C     F +++R  N+VA  LA+ A SN  S +W    P ++   I  D
Subjt:  MEVQKLSSCLAGVQFKHIRRGQNLVADMLARRAMSNGISGIWFSGFPCWLLEAIEGD

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein3.3e-1431.32Show/hide
Query:  VDTVLRSVT-PTVSAAQNQALLREFTRAEVEAALKSIGPTKAPGPDGVNALFYQRHWDLIGDETSALCLGILNGGGSVKELNQTHLVLIPKI-RGPTSVQ
        +DT L + T P ++  + ++L R  T +E+ A + S+   K+PGPDG  A FYQR+ + +      L   I   G       +  ++LIPK  R  T  +
Subjt:  VDTVLRSVT-PTVSAAQNQALLREFTRAEVEAALKSIGPTKAPGPDGVNALFYQRHWDLIGDETSALCLGILNGGGSVKELNQTHLVLIPKI-RGPTSVQ

Query:  DFRPISLCNVVYKICAKAIANRLKGVLHSVVSDEQSAFVPGRLISDNAVIGFQCLHALASKTRGKTGN-VAMKLDMSKAYDR
        +FRPISL N+  KI  K +ANR++  +  ++  +Q  F+PG     N     + ++ +    R K  N V + +D  KA+D+
Subjt:  DFRPISLCNVVYKICAKAIANRLKGVLHSVVSDEQSAFVPGRLISDNAVIGFQCLHALASKTRGKTGN-VAMKLDMSKAYDR

P0C2F6 Putative ribonuclease H protein At1g657507.0e-2528.2Show/hide
Query:  FRSVRDRVWKVLQGWKEKLFSIGGKEVLIKAVAQTIPTYTMSCFKLPVSLCNEINSLCANFWWGSTDAGQKIHWRSWNRLCRHTVLGGMGFRDLHVFNQA
        F  + +RV   + GW+EK  S  G+  L KAV  ++P ++MS   LP S+ N ++ L   F WGST   +K H   W+++C     GG+G R     N+A
Subjt:  FRSVRDRVWKVLQGWKEKLFSIGGKEVLIKAVAQTIPTYTMSCFKLPVSLCNEINSLCANFWWGSTDAGQKIHWRSWNRLCRHTVLGGMGFRDLHVFNQA

Query:  MLAKQCWRLSRDQYFCTGSFLRATLGSNPL-------------YAWRSLLWG-RVLFKQGVRWRVGNGHSIRIWGDPWIPRTDNDKVLGIRDEIRSESVS
        +++K  WRL +++       L+       +               WRS+  G R +   GV W  G+G  IR W D W+       +L + +  R     
Subjt:  MLAKQCWRLSRDQYFCTGSFLRATLGSNPL-------------YAWRSLLWG-RVLFKQGVRWRVGNGHSIRIWGDPWIPRTDNDKVLGIRDEIRSESVS

Query:  SLIMED-----GSWDVEKVRGEFLQDDEEHILAIPL---SGQREDDEIYWVPDGKGRFSVKSAYAL
        +++ +D       WD  K+      +    + A+ L   +G R  D + W     G+FSV+SAY +
Subjt:  SLIMED-----GSWDVEKVRGEFLQDDEEHILAIPL---SGQREDDEIYWVPDGKGRFSVKSAYAL

P11369 LINE-1 retrotransposable element ORF2 protein1.2e-1326.47Show/hide
Query:  RSVTPTVSAAQNQALLREFTRAEVEAALKSIGPTKAPGPDGVNALFYQRHWDLIGDETSALCLGILNGGGSVKELNQTHLVLIPK-IRGPTSVQDFRPIS
        R   P ++  Q   L    +  E+EA + S+   K+PGPDG +A FYQ   + +      L   I   G       +  + LIPK  + PT +++FRPIS
Subjt:  RSVTPTVSAAQNQALLREFTRAEVEAALKSIGPTKAPGPDGVNALFYQRHWDLIGDETSALCLGILNGGGSVKELNQTHLVLIPK-IRGPTSVQDFRPIS

Query:  LCNVVYKICAKAIANRLKGVLHSVVSDEQSAFVPGRLISDNAVIGFQCLHALASKTRGKTGNVAMKLDMSKAYDRRDCLTIKSVLHIYELASGQVINYDK
        L N+  KI  K +ANR++  + +++  +Q  F+PG     N       +H + +K + K  ++ + LD  KA+D+     +  VL    +  G  +N  K
Subjt:  LCNVVYKICAKAIANRLKGVLHSVVSDEQSAFVPGRLISDNAVIGFQCLHALASKTRGKTGNVAMKLDMSKAYDRRDCLTIKSVLHIYELASGQVINYDK

Query:  SMF---MVSRNTSPGMQQFIRHTLGVVQTSSLGRYLGLPSQNARSKCVIFRSVRDRVWKVLQGWKE-KLFSIGGKEVLIKAVAQTIPTYTMSCFKLPVSL
        +++   + +   +    + I    G  Q   L  YL             F  V + + + ++  KE K   IG +EV I  +A  +  Y          L
Subjt:  SMF---MVSRNTSPGMQQFIRHTLGVVQTSSLGRYLGLPSQNARSKCVIFRSVRDRVWKVLQGWKE-KLFSIGGKEVLIKAVAQTIPTYTMSCFKLPVSL

Query:  CNEINS
         N INS
Subjt:  CNEINS

P14381 Transposon TX1 uncharacterized 149 kDa protein7.5e-1932.83Show/hide
Query:  PTVSAAQNQALLREFTRAEVEAALKSIGPTKAPGPDGVNALFYQRHWDLIGDETSALCLGILNGGGSVKELNQTHLVLIPKIRGPTSVQDFRPISLCNVV
        P VS  + + L    T  E+  AL+ +   K+PG DG+   F+Q  WD +G +   +       G       +  L L+PK      ++++RP+SL +  
Subjt:  PTVSAAQNQALLREFTRAEVEAALKSIGPTKAPGPDGVNALFYQRHWDLIGDETSALCLGILNGGGSVKELNQTHLVLIPKIRGPTSVQDFRPISLCNVV

Query:  YKICAKAIANRLKGVLHSVVSDEQSAFVPGRLISDNAVIGFQCLHALASKTRGKTGNVAMKLDMSKAYDRRDCLTIKSVLHIYELASGQVINYDKSMF
        YKI AKAI+ RLK VL  V+  +QS  VPGR I DN  +    LH      R       + LD  KA+DR D   +   L  Y     Q + Y K+M+
Subjt:  YKICAKAIANRLKGVLHSVVSDEQSAFVPGRLISDNAVIGFQCLHALASKTRGKTGNVAMKLDMSKAYDRRDCLTIKSVLHIYELASGQVINYDKSMF

P93295 Uncharacterized mitochondrial protein AtMg003107.5e-2740.85Show/hide
Query:  IPTYTMSCFKLPVSLCNEINSLCANFWWGSTDAGQKIHWRSWNRLCRHTV-LGGMGFRDLHVFNQAMLAKQCW-----------RLSRDQYFCTGSFLRA
        +P Y MSCF+L   LC ++ S    FWW S +  +KI W +W +LC+     GG+GFRDL  FNQA+LAKQ +           RL R +YF   S +  
Subjt:  IPTYTMSCFKLPVSLCNEINSLCANFWWGSTDAGQKIHWRSWNRLCRHTV-LGGMGFRDLHVFNQAMLAKQCW-----------RLSRDQYFCTGSFLRA

Query:  TLGSNPLYAWRSLLWGRVLFKQGVRWRVGNGHSIRIWGDPWI
        ++G+ P YAWRS++ GR L  +G+   +G+G   ++W D WI
Subjt:  TLGSNPLYAWRSLLWGRVLFKQGVRWRVGNGHSIRIWGDPWI

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein5.3e-1240Show/hide
Query:  EVEAALKSIGPTKAPGPDGVNALFYQRHWDLIGDETSALCLGILNGGGSVKELNQTHLVLIPKIRGPTSVQDFRPISLCNVVYKI
        E+ AA+ ++   KAPGPD   A F+   W ++ D T A        G  +K  N T + LIPK+ G   +  FRP+S C VVYKI
Subjt:  EVEAALKSIGPTKAPGPDGVNALFYQRHWDLIGDETSALCLGILNGGGSVKELNQTHLVLIPKIRGPTSVQDFRPISLCNVVYKI

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein5.0e-1031.25Show/hide
Query:  GLGWVLRDHVGNLIGAGCESVTGLLGVDTLEMQAIRAGLLAVGEMGPPRLMVESDSLAAINLLNGVDENLTLVRFVAMEVQKLSSCLAGVQFKHIRRGQN
        G+GW+LR+  G ++  G  ++     V   E++A+R  +L +      R++ ESD+ A +NLLN  D+    ++    ++Q+L      V+F+   RG N
Subjt:  GLGWVLRDHVGNLIGAGCESVTGLLGVDTLEMQAIRAGLLAVGEMGPPRLMVESDSLAAINLLNGVDENLTLVRFVAMEVQKLSSCLAGVQFKHIRRGQN

Query:  LVADMLARRAMS-NGISGIWFSGFPCWL
         VAD +AR ++S +      FS  P WL
Subjt:  LVADMLARRAMS-NGISGIWFSGFPCWL

AT3G23320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.2e-0825.53Show/hide
Query:  GGWDWGLGWVLRDHVGNLIGAGCESVTGLLGVDTLEMQAIRAGLLAVGEMGPPRLMVESDSLAAINLLNGVDENLTLVRFVAMEVQKLSSCLAGVQFKHI
        G  D GL W++R+  G  +  GC    G   +   E  A+   +    ++G  R+  E D++    L+   + N  L R+    +Q+ S     V+F   
Subjt:  GGWDWGLGWVLRDHVGNLIGAGCESVTGLLGVDTLEMQAIRAGLLAVGEMGPPRLMVESDSLAAINLLNGVDENLTLVRFVAMEVQKLSSCLAGVQFKHI

Query:  RRGQNLVADMLARRAMSNGISGIWFSGFPCWLLEAIEGDKQ
         R QN+  D+LA++A++N I+   +   P +L+  +  D +
Subjt:  RRGQNLVADMLARRAMSNGISGIWFSGFPCWLLEAIEGDKQ

AT4G29090.1 Ribonuclease H-like superfamily protein2.1e-3232.35Show/hide
Query:  IPTYTMSCFKLPVSLCNEINSLCANFWWGSTDAGQKIHWRSWNRLCRHTVLGGMGFRDLHVFNQAMLAKQCWRL-----------SRDQYFCTGSFLRAT
        +PTYTM+CF LP ++C +I S+ A+FWW +    + +HW++W+ L  +   GG+GF+D+  FN A+L KQ WR+            + +YF     L A 
Subjt:  IPTYTMSCFKLPVSLCNEINSLCANFWWGSTDAGQKIHWRSWNRLCRHTVLGGMGFRDLHVFNQAMLAKQCWRL-----------SRDQYFCTGSFLRAT

Query:  LGSNPLYAWRSLLWGRVLFKQGVRWRVGNGHSIRIWGDPWIPRTDNDKVLGIR----DEIRSES----VSSLIMEDG-SWDVEKVRGEFLQDDEEHILAI
        LGS P + W+S+   + + +QG R  VGNG  I IW   W+        L ++     E  S S    VS LI E G  W  + +   F + + + I  +
Subjt:  LGSNPLYAWRSLLWGRVLFKQGVRWRVGNGHSIRIWGDPWIPRTDNDKVLGIR----DEIRSES----VSSLIMEDG-SWDVEKVRGEFLQDDEEHILAI

Query:  PLSGQREDDEIYWVPDGKGRFSVKSAYALGVSLVEDAS
           G+R  D   W     G ++VKS Y +   ++   S
Subjt:  PLSGQREDDEIYWVPDGKGRFSVKSAYALGVSLVEDAS

AT4G29090.1 Ribonuclease H-like superfamily protein5.2e-0731.25Show/hide
Query:  GLGWVLRDHVGNLIGAGCESVTGLLGVDTLEMQAIRAGLLAVGEMGPPRLMVESDSLAAINLLNGVDENLTLVRFVAMEVQKLSSCLAGVQFKHIRRGQN
        G+GWVLR+  G +   G  ++  L  V   E++A+R  +L++       ++ ESDS   I +LN  DE    ++    ++Q+L S    V+F  I R  N
Subjt:  GLGWVLRDHVGNLIGAGCESVTGLLGVDTLEMQAIRAGLLAVGEMGPPRLMVESDSLAAINLLNGVDENLTLVRFVAMEVQKLSSCLAGVQFKHIRRGQN

Query:  LVADMLARRAMS
         +A+ +AR ++S
Subjt:  LVADMLARRAMS

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein5.3e-2840.85Show/hide
Query:  IPTYTMSCFKLPVSLCNEINSLCANFWWGSTDAGQKIHWRSWNRLCRHTV-LGGMGFRDLHVFNQAMLAKQCW-----------RLSRDQYFCTGSFLRA
        +P Y MSCF+L   LC ++ S    FWW S +  +KI W +W +LC+     GG+GFRDL  FNQA+LAKQ +           RL R +YF   S +  
Subjt:  IPTYTMSCFKLPVSLCNEINSLCANFWWGSTDAGQKIHWRSWNRLCRHTV-LGGMGFRDLHVFNQAMLAKQCW-----------RLSRDQYFCTGSFLRA

Query:  TLGSNPLYAWRSLLWGRVLFKQGVRWRVGNGHSIRIWGDPWI
        ++G+ P YAWRS++ GR L  +G+   +G+G   ++W D WI
Subjt:  TLGSNPLYAWRSLLWGRVLFKQGVRWRVGNGHSIRIWGDPWI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGCTCTCTCCAGAGGGCCATCAAGTGTCAAGAGGAGGAGATTCGTAGGTTGGAGTCTGAAAAACCAGAAAATTGGGAGGCGAATTGGGTGGAGGCTGAAAAGGAG
TTGGAGTCTTTGCTGGTTGAGAATGAGGCATATTGACGACAAAGGTCGAGGGAGGAATGGTTGGTGTGGGGGGACAGGAATTCAAAGTGGTTTCATGCCCGAGCATCTCA
GCGCCGTCGGCGTAATAGAATTGAGGGCCTCAGAGATGGTGCTGGACAATGGCAGTCAGATCCGAGTGGCAGTTGATACGGTGCTTCGAAGTGTCACTCCTACAGTTAGT
GCTGCACAAAACCAGGCTTTACTCAGGGAGTTCACTCGGGCTGAGGTAGAGGCAGCCTTGAAAAGCATAGGCCCCACGAAGGCTCCAGGCCCCGATGGGGTTAACGCTTT
ATTTTACCAACGACATTGGGACTTGATCGGTGACGAGACTTCTGCGCTTTGCCTAGGTATTCTGAATGGTGGGGGCTCAGTAAAGGAGTTGAATCAAACACACTTGGTGC
TGATCCCGAAAATAAGGGGTCCTACGTCTGTTCAGGACTTTCGTCCAATCAGCCTGTGCAACGTGGTCTATAAAATTTGTGCCAAGGCAATCGCGAATCGCCTAAAAGGG
GTCCTACATTCAGTAGTATCTGATGAGCAATCGGCTTTCGTGCCTGGGCGATTGATTTCAGATAATGCGGTCATCGGTTTTCAGTGCTTGCATGCGCTGGCTAGTAAAAC
GAGAGGAAAAACTGGTAATGTGGCGATGAAATTGGATATGAGCAAGGCCTATGACCGGCGTGACTGTTTGACCATTAAATCTGTGTTACATATTTACGAGTTGGCTTCCG
GACAGGTTATTAATTATGATAAATCTATGTTCATGGTTAGCAGGAACACTAGCCCGGGCATGCAACAATTTATTAGGCACACATTGGGGGTGGTCCAGACGAGTTCGCTT
GGTCGTTATTTGGGGCTCCCTTCGCAAAATGCTCGGTCAAAGTGTGTAATCTTTCGATCAGTTAGGGATCGGGTTTGGAAGGTTCTACAGGGGTGGAAGGAAAAGCTTTT
TTCGATAGGTGGGAAGGAGGTCCTCATAAAGGCTGTTGCCCAGACAATTCCCACCTATACGATGAGCTGTTTTAAGCTGCCAGTGTCCCTATGTAATGAGATAAATAGTT
TGTGTGCCAATTTTTGGTGGGGGTCAACAGATGCTGGACAAAAGATCCATTGGCGGAGTTGGAACCGGTTGTGTCGCCATACGGTGTTGGGTGGTATGGGGTTCCGTGAT
CTACATGTTTTTAATCAGGCTATGCTGGCTAAGCAGTGTTGGCGGCTGTCCCGGGACCAGTATTTTTGTACGGGTTCTTTCCTTCGGGCAACATTGGGGTCGAACCCGTT
GTATGCATGGAGGAGCCTACTGTGGGGTCGGGTACTATTCAAACAAGGTGTAAGGTGGAGAGTTGGTAACGGCCATAGTATCCGGATATGGGGGGACCCGTGGATTCCGA
GGACCGATAACGACAAGGTTTTGGGGATTCGAGATGAGATTCGTTCAGAGTCGGTGTCCTCGTTGATTATGGAGGATGGGTCATGGGATGTAGAGAAGGTGAGGGGGGAA
TTTCTGCAGGACGATGAGGAACATATTTTGGCAATACCATTAAGTGGTCAGAGAGAGGATGATGAGATATATTGGGTGCCAGATGGTAAGGGGCGTTTCTCTGTGAAAAG
TGCGTATGCGCTTGGGGTGAGTTTGGTTGAAGATGCGTCTGGTATGATTGGGATCCTCTGGATCACTTCGCATGGGTTAAAGACCATGACGTGGAAGATGATGGGGCATA
CTTCGTTCTACTTCTGTGGCATATTTGGGAGTTCAGAAATCCCAAGATCTTTCGCAAGGGAGAGCTTGGCTAAAGGAGGCTGGGATTGGGGTTTGGGTTGGGTGTTGCGG
GACCATGTTGGAAATTTGATAGGGGCGGGGTGTGAGTCTGTTACAGGGCTCCTCGGGGTGGACACGTTGGAAATGCAGGCAATTCGAGCTGGATTATTGGCTGTTGGTGA
GATGGGGCCTCCTCGATTGATGGTGGAGTCAGATAGTTTGGCTGCTATAAATCTCTTGAATGGTGTGGATGAGAACCTCACCTTAGTTCGCTTTGTGGCTATGGAGGTGC
AGAAGCTTTCTTCTTGTTTGGCGGGAGTCCAGTTTAAGCATATTCGGCGAGGCCAAAATTTGGTGGCTGATATGTTGGCGCGACGAGCAATGAGTAATGGGATTTCTGGC
ATTTGGTTTTCGGGGTTCCCATGCTGGCTGCTTGAAGCTATAGAGGGTGACAAGCAAAAGTTGGTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGGCTCTCTCCAGAGGGCCATCAAGTGTCAAGAGGAGGAGATTCGTAGGTTGGAGTCTGAAAAACCAGAAAATTGGGAGGCGAATTGGGTGGAGGCTGAAAAGGAG
TTGGAGTCTTTGCTGGTTGAGAATGAGGCATATTGACGACAAAGGTCGAGGGAGGAATGGTTGGTGTGGGGGGACAGGAATTCAAAGTGGTTTCATGCCCGAGCATCTCA
GCGCCGTCGGCGTAATAGAATTGAGGGCCTCAGAGATGGTGCTGGACAATGGCAGTCAGATCCGAGTGGCAGTTGATACGGTGCTTCGAAGTGTCACTCCTACAGTTAGT
GCTGCACAAAACCAGGCTTTACTCAGGGAGTTCACTCGGGCTGAGGTAGAGGCAGCCTTGAAAAGCATAGGCCCCACGAAGGCTCCAGGCCCCGATGGGGTTAACGCTTT
ATTTTACCAACGACATTGGGACTTGATCGGTGACGAGACTTCTGCGCTTTGCCTAGGTATTCTGAATGGTGGGGGCTCAGTAAAGGAGTTGAATCAAACACACTTGGTGC
TGATCCCGAAAATAAGGGGTCCTACGTCTGTTCAGGACTTTCGTCCAATCAGCCTGTGCAACGTGGTCTATAAAATTTGTGCCAAGGCAATCGCGAATCGCCTAAAAGGG
GTCCTACATTCAGTAGTATCTGATGAGCAATCGGCTTTCGTGCCTGGGCGATTGATTTCAGATAATGCGGTCATCGGTTTTCAGTGCTTGCATGCGCTGGCTAGTAAAAC
GAGAGGAAAAACTGGTAATGTGGCGATGAAATTGGATATGAGCAAGGCCTATGACCGGCGTGACTGTTTGACCATTAAATCTGTGTTACATATTTACGAGTTGGCTTCCG
GACAGGTTATTAATTATGATAAATCTATGTTCATGGTTAGCAGGAACACTAGCCCGGGCATGCAACAATTTATTAGGCACACATTGGGGGTGGTCCAGACGAGTTCGCTT
GGTCGTTATTTGGGGCTCCCTTCGCAAAATGCTCGGTCAAAGTGTGTAATCTTTCGATCAGTTAGGGATCGGGTTTGGAAGGTTCTACAGGGGTGGAAGGAAAAGCTTTT
TTCGATAGGTGGGAAGGAGGTCCTCATAAAGGCTGTTGCCCAGACAATTCCCACCTATACGATGAGCTGTTTTAAGCTGCCAGTGTCCCTATGTAATGAGATAAATAGTT
TGTGTGCCAATTTTTGGTGGGGGTCAACAGATGCTGGACAAAAGATCCATTGGCGGAGTTGGAACCGGTTGTGTCGCCATACGGTGTTGGGTGGTATGGGGTTCCGTGAT
CTACATGTTTTTAATCAGGCTATGCTGGCTAAGCAGTGTTGGCGGCTGTCCCGGGACCAGTATTTTTGTACGGGTTCTTTCCTTCGGGCAACATTGGGGTCGAACCCGTT
GTATGCATGGAGGAGCCTACTGTGGGGTCGGGTACTATTCAAACAAGGTGTAAGGTGGAGAGTTGGTAACGGCCATAGTATCCGGATATGGGGGGACCCGTGGATTCCGA
GGACCGATAACGACAAGGTTTTGGGGATTCGAGATGAGATTCGTTCAGAGTCGGTGTCCTCGTTGATTATGGAGGATGGGTCATGGGATGTAGAGAAGGTGAGGGGGGAA
TTTCTGCAGGACGATGAGGAACATATTTTGGCAATACCATTAAGTGGTCAGAGAGAGGATGATGAGATATATTGGGTGCCAGATGGTAAGGGGCGTTTCTCTGTGAAAAG
TGCGTATGCGCTTGGGGTGAGTTTGGTTGAAGATGCGTCTGGTATGATTGGGATCCTCTGGATCACTTCGCATGGGTTAAAGACCATGACGTGGAAGATGATGGGGCATA
CTTCGTTCTACTTCTGTGGCATATTTGGGAGTTCAGAAATCCCAAGATCTTTCGCAAGGGAGAGCTTGGCTAAAGGAGGCTGGGATTGGGGTTTGGGTTGGGTGTTGCGG
GACCATGTTGGAAATTTGATAGGGGCGGGGTGTGAGTCTGTTACAGGGCTCCTCGGGGTGGACACGTTGGAAATGCAGGCAATTCGAGCTGGATTATTGGCTGTTGGTGA
GATGGGGCCTCCTCGATTGATGGTGGAGTCAGATAGTTTGGCTGCTATAAATCTCTTGAATGGTGTGGATGAGAACCTCACCTTAGTTCGCTTTGTGGCTATGGAGGTGC
AGAAGCTTTCTTCTTGTTTGGCGGGAGTCCAGTTTAAGCATATTCGGCGAGGCCAAAATTTGGTGGCTGATATGTTGGCGCGACGAGCAATGAGTAATGGGATTTCTGGC
ATTTGGTTTTCGGGGTTCCCATGCTGGCTGCTTGAAGCTATAGAGGGTGACAAGCAAAAGTTGGTGTAA
Protein sequenceShow/hide protein sequence
MGALSRGPSSVKRRRFVGWSLKNQKIGRRIGWRLKRSWSLCWLRMRHIDDKGRGRNGWCGGTGIQSGFMPEHLSAVGVIELRASEMVLDNGSQIRVAVDTVLRSVTPTVS
AAQNQALLREFTRAEVEAALKSIGPTKAPGPDGVNALFYQRHWDLIGDETSALCLGILNGGGSVKELNQTHLVLIPKIRGPTSVQDFRPISLCNVVYKICAKAIANRLKG
VLHSVVSDEQSAFVPGRLISDNAVIGFQCLHALASKTRGKTGNVAMKLDMSKAYDRRDCLTIKSVLHIYELASGQVINYDKSMFMVSRNTSPGMQQFIRHTLGVVQTSSL
GRYLGLPSQNARSKCVIFRSVRDRVWKVLQGWKEKLFSIGGKEVLIKAVAQTIPTYTMSCFKLPVSLCNEINSLCANFWWGSTDAGQKIHWRSWNRLCRHTVLGGMGFRD
LHVFNQAMLAKQCWRLSRDQYFCTGSFLRATLGSNPLYAWRSLLWGRVLFKQGVRWRVGNGHSIRIWGDPWIPRTDNDKVLGIRDEIRSESVSSLIMEDGSWDVEKVRGE
FLQDDEEHILAIPLSGQREDDEIYWVPDGKGRFSVKSAYALGVSLVEDASGMIGILWITSHGLKTMTWKMMGHTSFYFCGIFGSSEIPRSFARESLAKGGWDWGLGWVLR
DHVGNLIGAGCESVTGLLGVDTLEMQAIRAGLLAVGEMGPPRLMVESDSLAAINLLNGVDENLTLVRFVAMEVQKLSSCLAGVQFKHIRRGQNLVADMLARRAMSNGISG
IWFSGFPCWLLEAIEGDKQKLV