; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0030672 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0030672
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr11:313249..316697
RNA-Seq ExpressionLag0030672
SyntenyLag0030672
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBN69274.1 TatD related DNase [Prunus dulcis]1.0e-13438.86Show/hide
Query:  GLVIKLDIEKAFDKVDWNYLDRILLAKGFGNKWRSWIRGCLSSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKAESDGLIKGFRI
        GLV K+D EKA+D V+W ++D ++  KGFG KWR WI GCL SAN+SI+ING+PRGK  ASRGLRQGDPLSPFLF ++ D  SR++ +A+   L+ G   
Subjt:  GLVIKLDIEKAFDKVDWNYLDRILLAKGFGNKWRSWIRGCLSSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKAESDGLIKGFRI

Query:  DSESPSINHIQFADDTILFTQFEDD--LSAKSMLKVVEDFEAASGQNINLHKTEILGINVEHSILEIFAHHSGCKLGEWPSSYLGLPLNGSSNLRDFWLP
          +   ++H+QFADDTI F   +++  L+   MLK+  D    SG  IN  K+ ILGIN     L   A   GC++G WP  YLGLPL G+    +FW P
Subjt:  DSESPSINHIQFADDTILFTQFEDD--LSAKSMLKVVEDFEAASGQNINLHKTEILGINVEHSILEIFAHHSGCKLGEWPSSYLGLPLNGSSNLRDFWLP

Query:  IIEKIEKRLSNWGSSFISKGGRLTLLQATLSNLPTYYLSLYQLPKKIGTEIESMFRAFLWKGGNRNRSQHLVRWNKLLYPIDKGGLGLHSMIDKNKALLA
        +++K+EKRL  W  + +SKGGRLTL+QA LS++P+YY+SL+++P  +  ++E + R FLW+G    ++ HLVRW ++    ++GGLG+ S+ ++N+AL A
Subjt:  IIEKIEKRLSNWGSSFISKGGRLTLLQATLSNLPTYYLSLYQLPKKIGTEIESMFRAFLWKGGNRNRSQHLVRWNKLLYPIDKGGLGLHSMIDKNKALLA

Query:  KWSWRFNQERNALWRKFIKAKYGAPSK---VSKASLAKSKGPWKNILKHHHLISSRIAHQLGNGGETFFWTDPWINNTPLVTKYPLLYRLSHVKAATVKE
        KW WRF  E N+LW + IK+KYG  S      +      + PW+ I K +          +GNG +  FW D W+    L   +P LY LS  K   +  
Subjt:  KWSWRFNQERNALWRKFIKAKYGAPSK---VSKASLAKSKGPWKNILKHHHLISSRIAHQLGNGGETFFWTDPWINNTPLVTKYPLLYRLSHVKAATVKE

Query:  AW----NESLNFWDLKLGRNLKDLEVDEWAALSTDLDSISL-SHCEDKWKWPLNHNSIFSTKSLLSDMTGNISLIEPSLAKIIWRDNYPKKIKFFLWELA
        AW    +E    WD    RNL + E+ E   L   L ++ L     D+  W +     FS KS  S +    S++ P   + IW+   P KI+FF+W  A
Subjt:  AW----NESLNFWDLKLGRNLKDLEVDEWAALSTDLDSISL-SHCEDKWKWPLNHNSIFSTKSLLSDMTGNISLIEPSLAKIIWRDNYPKKIKFFLWELA

Query:  HKAINTKEVLQRRMPYLSLSPGWCYLCRSSDESQDHIFATCPFAWKFWSRISKSFGWSMSLPGNLVDILASVLLGHPFTNIKGLLWMNIARAFFWILWKE
        +  INT + +QRR P + LSP WC  C+ + E+ DH+F  C ++ K W R+  + G    +P    ++L+  L         G+L   +  A FW +W E
Subjt:  HKAINTKEVLQRRMPYLSLSPGWCYLCRSSDESQDHIFATCPFAWKFWSRISKSFGWSMSLPGNLVDILASVLLGHPFTNIKGLLWMNIARAFFWILWKE

Query:  RNRRVFQG-IDLSFDGFFDIVIYYAISWCKLSPLLASYSYTSLLNSWESLL
        RNRR+FQG   +  +  +D + ++A  W  +S     Y Y++++    ++L
Subjt:  RNRRVFQG-IDLSFDGFFDIVIYYAISWCKLSPLLASYSYTSLLNSWESLL

KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.1e-13639.22Show/hide
Query:  GLVIKLDIEKAFDKVDWNYLDRILLAKGFGNKWRSWIRGCLSSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKAESDGLIKGFRI
        G VIKLDIEKAFDK++W ++D +L+ K +  KWR  I  C+SS  YSILINGRPRG+I  SRG+RQGDPLSPF+F++ MD  SRLL        I G + 
Subjt:  GLVIKLDIEKAFDKVDWNYLDRILLAKGFGNKWRSWIRGCLSSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKAESDGLIKGFRI

Query:  DSESPSIN--HIQFADDTILFTQFEDDLSAKSMLKVVEDFEAASGQNINLHKTEILGINVEHSILEIFAHHSGCKLGEWPSSYLGLPLNGSSNLRDFWLP
           SP++N  HI FADD ++F +  DD    ++  ++  FE+ASG NINL K+ I  INV     +  A   G   G  P+SYLG+PL G  +  +FW  
Subjt:  DSESPSIN--HIQFADDTILFTQFEDDLSAKSMLKVVEDFEAASGQNINLHKTEILGINVEHSILEIFAHHSGCKLGEWPSSYLGLPLNGSSNLRDFWLP

Query:  IIEKIEKRLSNWGSSFISKGGRLTLLQATLSNLPTYYLSLYQLPKKIGTEIESMFRAFLWKGGNRNRSQHLVRWNKLLYPIDKGGLGLHSMIDKNKALLA
        +++KI+K+LSNW  S +SKGGR+TL+ +TL +LP Y +S++++PK I  +IE+ +R FLW G +   +  L+RWN+++ P +KGGLG+HS+   N ALL 
Subjt:  IIEKIEKRLSNWGSSFISKGGRLTLLQATLSNLPTYYLSLYQLPKKIGTEIESMFRAFLWKGGNRNRSQHLVRWNKLLYPIDKGGLGLHSMIDKNKALLA

Query:  KWSWRFNQERNALWRKFIKAKYGAP---SKVSKASLAKSKGPWKNILKHHHLISSRIAHQLGNGGETFFWTDPWINNTPLVTKYPLLYRLSHVKAATVKE
        KW W+F  E++ LW++ I +KY      S  S    + +  PWK + +        I+ ++ +G +  FW D W  N PL    P L+ LS  K  +VKE
Subjt:  KWSWRFNQERNALWRKFIKAKYGAP---SKVSKASLAKSKGPWKNILKHHHLISSRIAHQLGNGGETFFWTDPWINNTPLVTKYPLLYRLSHVKAATVKE

Query:  AWNESLNFWDLKLGRNLKDLEVDEWAALSTDLDSISLSHCEDKWKWPLNHNSIFSTKSL---LSDMTGNISLIEPSLAKIIWRDNYPKKIKFFLWELAHK
         WN S N W L + R L+D E + W  +   L +   +    K  W LN N+IF T S+   +++   + +   P+L K +W+  +PKK KFF+W L H 
Subjt:  AWNESLNFWDLKLGRNLKDLEVDEWAALSTDLDSISLSHCEDKWKWPLNHNSIFSTKSL---LSDMTGNISLIEPSLAKIIWRDNYPKKIKFFLWELAHK

Query:  AINTKEVLQRRMPYLSLSPGWCYLCRSSDESQDHIFATCPFAWKFWSRISKSFGWSMSLPGNLVDILASVLLGHPFTNIKGLLWMNIARAFFWILWKERN
         INT + LQ+R+P  +LSP WCY+C  S E  +H+F  CP++ + WS+      W+ S P ++  ++ ++       N KGL+  N      W +W ERN
Subjt:  AINTKEVLQRRMPYLSLSPGWCYLCRSSDESQDHIFATCPFAWKFWSRISKSFGWSMSLPGNLVDILASVLLGHPFTNIKGLLWMNIARAFFWILWKERN

Query:  RRVFQGIDLSFDGFFDIVIYYAISWCKLSPLLASYSYTSL
         R+F+  + +    ++  +     W   S L ++Y   S+
Subjt:  RRVFQGIDLSFDGFFDIVIYYAISWCKLSPLLASYSYTSL

KAA0041397.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]5.4e-13639.22Show/hide
Query:  GLVIKLDIEKAFDKVDWNYLDRILLAKGFGNKWRSWIRGCLSSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKAESDGLIKGFRI
        G VIKLDIEKAFDK++W ++D +L+ K +  KWR+ I  C+SS  YSILINGRPRG+I  +RG+RQGDPLSPF+F++ MD  S LL      G I G   
Subjt:  GLVIKLDIEKAFDKVDWNYLDRILLAKGFGNKWRSWIRGCLSSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKAESDGLIKGFRI

Query:  DSESPSIN--HIQFADDTILFTQFEDDLSAKSMLKVVEDFEAASGQNINLHKTEILGINVEHSILEIFAHHSGCKLGEWPSSYLGLPLNGSSNLRDFWLP
            P++N  HI FADD ++F + ++D    ++  ++  FE+ASG NINL K+ I  INV            G   G+ P++YLG+PL G  +  +FW  
Subjt:  DSESPSIN--HIQFADDTILFTQFEDDLSAKSMLKVVEDFEAASGQNINLHKTEILGINVEHSILEIFAHHSGCKLGEWPSSYLGLPLNGSSNLRDFWLP

Query:  IIEKIEKRLSNWGSSFISKGGRLTLLQATLSNLPTYYLSLYQLPKKIGTEIESMFRAFLWKGGNRNRSQHLVRWNKLLYPIDKGGLGLHSMIDKNKALLA
        I++KI+K+LS+W  S +SKGGR+TL+ +TL +LP Y LS++++PK I  +IE+ +R FLW G +   +  L+RWN+++ P +KGGLG+HS+   N ALL 
Subjt:  IIEKIEKRLSNWGSSFISKGGRLTLLQATLSNLPTYYLSLYQLPKKIGTEIESMFRAFLWKGGNRNRSQHLVRWNKLLYPIDKGGLGLHSMIDKNKALLA

Query:  KWSWRFNQERNALWRKFIKAKYGAPSK---VSKASLAKSKGPWKNILKHHHLISSRIAHQLGNGGETFFWTDPWINNTPLVTKYPLLYRLSHVKAATVKE
        KW W+F  E+  LW++ I +KY         S+   + +  PWK +          I  ++ +G +  FW D W  N+PL    P L+ LS  K  +VK+
Subjt:  KWSWRFNQERNALWRKFIKAKYGAPSK---VSKASLAKSKGPWKNILKHHHLISSRIAHQLGNGGETFFWTDPWINNTPLVTKYPLLYRLSHVKAATVKE

Query:  AWNESLNFWDLKLGRNLKDLEVDEWAALSTDLDSISLSHCEDKWKWPLNHNSIFSTKSL---LSDMTGNISLIEPSLAKIIWRDNYPKKIKFFLWELAHK
         WN SL  W++ + R L+D E + W  +   L +        K  W LN N+IF T S+   LS+ + + +   PSL K +W+ ++PKK KFF+W L H 
Subjt:  AWNESLNFWDLKLGRNLKDLEVDEWAALSTDLDSISLSHCEDKWKWPLNHNSIFSTKSL---LSDMTGNISLIEPSLAKIIWRDNYPKKIKFFLWELAHK

Query:  AINTKEVLQRRMPYLSLSPGWCYLCRSSDESQDHIFATCPFAWKFWSRISKSFGWSMSLPGNLVDILASVLLGHPFTNIKGLLWMNIARAFFWILWKERN
         INT + LQ+R+P  +LSP WCY+C  S E  +H+F  CP++ + WS+      W+ S P N V  LA  +        KGL+  N      W +W ERN
Subjt:  AINTKEVLQRRMPYLSLSPGWCYLCRSSDESQDHIFATCPFAWKFWSRISKSFGWSMSLPGNLVDILASVLLGHPFTNIKGLLWMNIARAFFWILWKERN

Query:  RRVFQGIDLSFDGFFDIVIYYAISWCKLSPLLASYSYTSL
         R+F+     F   ++ ++     W   S L ++Y   S+
Subjt:  RRVFQGIDLSFDGFFDIVIYYAISWCKLSPLLASYSYTSL

KAA0044556.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.7e-13438.91Show/hide
Query:  GLVIKLDIEKAFDKVDWNYLDRILLAKGFGNKWRSWIRGCLSSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKAESDGLIKGFRI
        G VIKLDIEKAFDK++W ++D +L+ K +  KWR+ I  C+SS  YSILINGRPRG+I  +RG+RQGDPLS F+F++ MD  S LL      G I G   
Subjt:  GLVIKLDIEKAFDKVDWNYLDRILLAKGFGNKWRSWIRGCLSSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKAESDGLIKGFRI

Query:  DSESPSIN--HIQFADDTILFTQFEDDLSAKSMLKVVEDFEAASGQNINLHKTEILGINVEHSILEIFAHHSGCKLGEWPSSYLGLPLNGSSNLRDFWLP
            P++N  HI FADD ++F + ++D    ++  ++  FE+ASG NINL K+ I  INV            G   G+ P++YLG+PL G  +  +FW  
Subjt:  DSESPSIN--HIQFADDTILFTQFEDDLSAKSMLKVVEDFEAASGQNINLHKTEILGINVEHSILEIFAHHSGCKLGEWPSSYLGLPLNGSSNLRDFWLP

Query:  IIEKIEKRLSNWGSSFISKGGRLTLLQATLSNLPTYYLSLYQLPKKIGTEIESMFRAFLWKGGNRNRSQHLVRWNKLLYPIDKGGLGLHSMIDKNKALLA
        I++KI+K+LS+W  S +SKGGR+TL+ +TL +LP Y LS++++PK I  +IE+ +R FLW G +   +  L+RWN+++ P +KGGLG+H +   N ALL 
Subjt:  IIEKIEKRLSNWGSSFISKGGRLTLLQATLSNLPTYYLSLYQLPKKIGTEIESMFRAFLWKGGNRNRSQHLVRWNKLLYPIDKGGLGLHSMIDKNKALLA

Query:  KWSWRFNQERNALWRKFIKAKYGAPSK---VSKASLAKSKGPWKNILKHHHLISSRIAHQLGNGGETFFWTDPWINNTPLVTKYPLLYRLSHVKAATVKE
        KW W+F  E+  LW++ I +KY         S+   + +  PWK +          I  ++ +G +  FW D W  N+PL    P L+ LS  K  +VK+
Subjt:  KWSWRFNQERNALWRKFIKAKYGAPSK---VSKASLAKSKGPWKNILKHHHLISSRIAHQLGNGGETFFWTDPWINNTPLVTKYPLLYRLSHVKAATVKE

Query:  AWNESLNFWDLKLGRNLKDLEVDEWAALSTDLDSISLSHCEDKWKWPLNHNSIFSTKSL---LSDMTGNISLIEPSLAKIIWRDNYPKKIKFFLWELAHK
         WN SL  W++ + R L+D E + W  +   L +        K  W LN N+IF T S+   LS+ + + +   PSL K +W+ ++PKK KFF+W L H 
Subjt:  AWNESLNFWDLKLGRNLKDLEVDEWAALSTDLDSISLSHCEDKWKWPLNHNSIFSTKSL---LSDMTGNISLIEPSLAKIIWRDNYPKKIKFFLWELAHK

Query:  AINTKEVLQRRMPYLSLSPGWCYLCRSSDESQDHIFATCPFAWKFWSRISKSFGWSMSLPGNLVDILASVLLGHPFTNIKGLLWMNIARAFFWILWKERN
         INT + LQ+R+P  +LSP WCY+C  S E  +H+F  CP++ + WS+      W+ S P N V  LA  +        KGL+  N      W +W ERN
Subjt:  AINTKEVLQRRMPYLSLSPGWCYLCRSSDESQDHIFATCPFAWKFWSRISKSFGWSMSLPGNLVDILASVLLGHPFTNIKGLLWMNIARAFFWILWKERN

Query:  RRVFQGIDLSFDGFFDIVIYYAISWCKLSPLLASYSYTSL
         R+F+     F   ++ ++     W   S L ++Y   S+
Subjt:  RRVFQGIDLSFDGFFDIVIYYAISWCKLSPLLASYSYTSL

RVW64408.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]1.5e-13338.89Show/hide
Query:  GLVIKLDIEKAFDKVDWNYLDRILLAKGFGNKWRSWIRGCLSSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKAESDGLIKGFRI
        G+V K+D EKA+D VDW +LD +L  KGF  KWR WIRGCLSS++++IL+NG  +G + ASRGLRQGDPLSPFLF ++ D  SR+L +AE  GL +GF +
Subjt:  GLVIKLDIEKAFDKVDWNYLDRILLAKGFGNKWRSWIRGCLSSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKAESDGLIKGFRI

Query:  DSESPSINHIQFADDTILFTQ--FEDDLSAKSMLKVVEDFEAASGQNINLHKTEILGINVEHSILEIFAHHSGCKLGEWPSSYLGLPLNGSSNLRDFWLP
          +   ++ +QFADDTI F++   E   + K +L V   F   SG  INL K+ I GIN    +L   A    C++ EWP SYLGLPL G+     FW P
Subjt:  DSESPSINHIQFADDTILFTQ--FEDDLSAKSMLKVVEDFEAASGQNINLHKTEILGINVEHSILEIFAHHSGCKLGEWPSSYLGLPLNGSSNLRDFWLP

Query:  IIEKIEKRLSNWGSSFISKGGRLTLLQATLSNLPTYYLSLYQLPKKIGTEIESMFRAFLWKGGNRNRSQHLVRWNKLLYPIDKGGLGLHSMIDKNKALLA
        ++E+I +RL  W  +++S GGR+TL+Q+ LS++P+Y+LSL+++P  I ++IE M R FLW G    +  HLVRW  +  P + GGLG   +  +N ALL 
Subjt:  IIEKIEKRLSNWGSSFISKGGRLTLLQATLSNLPTYYLSLYQLPKKIGTEIESMFRAFLWKGGNRNRSQHLVRWNKLLYPIDKGGLGLHSMIDKNKALLA

Query:  KWSWRFNQERNALWRKFIKAKYGAPSKVSKASLA---KSKGPWKNILKHHHLISSRIAHQLGNGGETFFWTDPWINNTPLVTKYPLLYRLSHVKAATVKE
        KW WRF +ER+ LW K I + YG       A++      + PWK I +     S  +   +GNG    FW D W  N  L +++  LYR+  VK  TV  
Subjt:  KWSWRFNQERNALWRKFIKAKYGAPSKVSKASLA---KSKGPWKNILKHHHLISSRIAHQLGNGGETFFWTDPWINNTPLVTKYPLLYRLSHVKAATVKE

Query:  AWNESLNF-WDLKLGRNLKDLEVDEWAALSTDLDSIS-LSHCEDKWKWPLNHNSIFSTKS--LLSDMTGNISLIEPSLAKIIWRDNYPKKIKFFLWELAH
            S    W+L   RNL D E+D    L + L S+       D   W L+ + +F+ KS  L      N  L  P  AK +W    P K+K   W +AH
Subjt:  AWNESLNF-WDLKLGRNLKDLEVDEWAALSTDLDSIS-LSHCEDKWKWPLNHNSIFSTKS--LLSDMTGNISLIEPSLAKIIWRDNYPKKIKFFLWELAH

Query:  KAINTKEVLQRRMPYLSLSPGWCYLCRSSDESQDHIFATCPFAWKFWSRISKSFGWSMSLPGNLVDILASVLLGHPFTNIKGLLWMNIARAFFWILWKER
          +NT + LQ R PY SL P WC LC+ + ES DH+F  CP     W+++ K  G     P +  D+L     G   +     LW        WI+W+ER
Subjt:  KAINTKEVLQRRMPYLSLSPGWCYLCRSSDESQDHIFATCPFAWKFWSRISKSFGWSMSLPGNLVDILASVLLGHPFTNIKGLLWMNIARAFFWILWKER

Query:  NRRVFQGIDLSFDGFFDIVIYYAISWCKLSPLLASYSYTSLLNSWESL
        N+R+F+    S +  +D++++Y+  W   S          +  +W  +
Subjt:  NRRVFQGIDLSFDGFFDIVIYYAISWCKLSPLLASYSYTSLLNSWESL

TrEMBL top hitse value%identityAlignment
A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein5.2e-13739.22Show/hide
Query:  GLVIKLDIEKAFDKVDWNYLDRILLAKGFGNKWRSWIRGCLSSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKAESDGLIKGFRI
        G VIKLDIEKAFDK++W ++D +L+ K +  KWR  I  C+SS  YSILINGRPRG+I  SRG+RQGDPLSPF+F++ MD  SRLL        I G + 
Subjt:  GLVIKLDIEKAFDKVDWNYLDRILLAKGFGNKWRSWIRGCLSSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKAESDGLIKGFRI

Query:  DSESPSIN--HIQFADDTILFTQFEDDLSAKSMLKVVEDFEAASGQNINLHKTEILGINVEHSILEIFAHHSGCKLGEWPSSYLGLPLNGSSNLRDFWLP
           SP++N  HI FADD ++F +  DD    ++  ++  FE+ASG NINL K+ I  INV     +  A   G   G  P+SYLG+PL G  +  +FW  
Subjt:  DSESPSIN--HIQFADDTILFTQFEDDLSAKSMLKVVEDFEAASGQNINLHKTEILGINVEHSILEIFAHHSGCKLGEWPSSYLGLPLNGSSNLRDFWLP

Query:  IIEKIEKRLSNWGSSFISKGGRLTLLQATLSNLPTYYLSLYQLPKKIGTEIESMFRAFLWKGGNRNRSQHLVRWNKLLYPIDKGGLGLHSMIDKNKALLA
        +++KI+K+LSNW  S +SKGGR+TL+ +TL +LP Y +S++++PK I  +IE+ +R FLW G +   +  L+RWN+++ P +KGGLG+HS+   N ALL 
Subjt:  IIEKIEKRLSNWGSSFISKGGRLTLLQATLSNLPTYYLSLYQLPKKIGTEIESMFRAFLWKGGNRNRSQHLVRWNKLLYPIDKGGLGLHSMIDKNKALLA

Query:  KWSWRFNQERNALWRKFIKAKYGAP---SKVSKASLAKSKGPWKNILKHHHLISSRIAHQLGNGGETFFWTDPWINNTPLVTKYPLLYRLSHVKAATVKE
        KW W+F  E++ LW++ I +KY      S  S    + +  PWK + +        I+ ++ +G +  FW D W  N PL    P L+ LS  K  +VKE
Subjt:  KWSWRFNQERNALWRKFIKAKYGAP---SKVSKASLAKSKGPWKNILKHHHLISSRIAHQLGNGGETFFWTDPWINNTPLVTKYPLLYRLSHVKAATVKE

Query:  AWNESLNFWDLKLGRNLKDLEVDEWAALSTDLDSISLSHCEDKWKWPLNHNSIFSTKSL---LSDMTGNISLIEPSLAKIIWRDNYPKKIKFFLWELAHK
         WN S N W L + R L+D E + W  +   L +   +    K  W LN N+IF T S+   +++   + +   P+L K +W+  +PKK KFF+W L H 
Subjt:  AWNESLNFWDLKLGRNLKDLEVDEWAALSTDLDSISLSHCEDKWKWPLNHNSIFSTKSL---LSDMTGNISLIEPSLAKIIWRDNYPKKIKFFLWELAHK

Query:  AINTKEVLQRRMPYLSLSPGWCYLCRSSDESQDHIFATCPFAWKFWSRISKSFGWSMSLPGNLVDILASVLLGHPFTNIKGLLWMNIARAFFWILWKERN
         INT + LQ+R+P  +LSP WCY+C  S E  +H+F  CP++ + WS+      W+ S P ++  ++ ++       N KGL+  N      W +W ERN
Subjt:  AINTKEVLQRRMPYLSLSPGWCYLCRSSDESQDHIFATCPFAWKFWSRISKSFGWSMSLPGNLVDILASVLLGHPFTNIKGLLWMNIARAFFWILWKERN

Query:  RRVFQGIDLSFDGFFDIVIYYAISWCKLSPLLASYSYTSL
         R+F+  + +    ++  +     W   S L ++Y   S+
Subjt:  RRVFQGIDLSFDGFFDIVIYYAISWCKLSPLLASYSYTSL

A0A5A7TIB8 LINE-1 retrotransposable element ORF2 protein2.6e-13639.22Show/hide
Query:  GLVIKLDIEKAFDKVDWNYLDRILLAKGFGNKWRSWIRGCLSSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKAESDGLIKGFRI
        G VIKLDIEKAFDK++W ++D +L+ K +  KWR+ I  C+SS  YSILINGRPRG+I  +RG+RQGDPLSPF+F++ MD  S LL      G I G   
Subjt:  GLVIKLDIEKAFDKVDWNYLDRILLAKGFGNKWRSWIRGCLSSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKAESDGLIKGFRI

Query:  DSESPSIN--HIQFADDTILFTQFEDDLSAKSMLKVVEDFEAASGQNINLHKTEILGINVEHSILEIFAHHSGCKLGEWPSSYLGLPLNGSSNLRDFWLP
            P++N  HI FADD ++F + ++D    ++  ++  FE+ASG NINL K+ I  INV            G   G+ P++YLG+PL G  +  +FW  
Subjt:  DSESPSIN--HIQFADDTILFTQFEDDLSAKSMLKVVEDFEAASGQNINLHKTEILGINVEHSILEIFAHHSGCKLGEWPSSYLGLPLNGSSNLRDFWLP

Query:  IIEKIEKRLSNWGSSFISKGGRLTLLQATLSNLPTYYLSLYQLPKKIGTEIESMFRAFLWKGGNRNRSQHLVRWNKLLYPIDKGGLGLHSMIDKNKALLA
        I++KI+K+LS+W  S +SKGGR+TL+ +TL +LP Y LS++++PK I  +IE+ +R FLW G +   +  L+RWN+++ P +KGGLG+HS+   N ALL 
Subjt:  IIEKIEKRLSNWGSSFISKGGRLTLLQATLSNLPTYYLSLYQLPKKIGTEIESMFRAFLWKGGNRNRSQHLVRWNKLLYPIDKGGLGLHSMIDKNKALLA

Query:  KWSWRFNQERNALWRKFIKAKYGAPSK---VSKASLAKSKGPWKNILKHHHLISSRIAHQLGNGGETFFWTDPWINNTPLVTKYPLLYRLSHVKAATVKE
        KW W+F  E+  LW++ I +KY         S+   + +  PWK +          I  ++ +G +  FW D W  N+PL    P L+ LS  K  +VK+
Subjt:  KWSWRFNQERNALWRKFIKAKYGAPSK---VSKASLAKSKGPWKNILKHHHLISSRIAHQLGNGGETFFWTDPWINNTPLVTKYPLLYRLSHVKAATVKE

Query:  AWNESLNFWDLKLGRNLKDLEVDEWAALSTDLDSISLSHCEDKWKWPLNHNSIFSTKSL---LSDMTGNISLIEPSLAKIIWRDNYPKKIKFFLWELAHK
         WN SL  W++ + R L+D E + W  +   L +        K  W LN N+IF T S+   LS+ + + +   PSL K +W+ ++PKK KFF+W L H 
Subjt:  AWNESLNFWDLKLGRNLKDLEVDEWAALSTDLDSISLSHCEDKWKWPLNHNSIFSTKSL---LSDMTGNISLIEPSLAKIIWRDNYPKKIKFFLWELAHK

Query:  AINTKEVLQRRMPYLSLSPGWCYLCRSSDESQDHIFATCPFAWKFWSRISKSFGWSMSLPGNLVDILASVLLGHPFTNIKGLLWMNIARAFFWILWKERN
         INT + LQ+R+P  +LSP WCY+C  S E  +H+F  CP++ + WS+      W+ S P N V  LA  +        KGL+  N      W +W ERN
Subjt:  AINTKEVLQRRMPYLSLSPGWCYLCRSSDESQDHIFATCPFAWKFWSRISKSFGWSMSLPGNLVDILASVLLGHPFTNIKGLLWMNIARAFFWILWKERN

Query:  RRVFQGIDLSFDGFFDIVIYYAISWCKLSPLLASYSYTSL
         R+F+     F   ++ ++     W   S L ++Y   S+
Subjt:  RRVFQGIDLSFDGFFDIVIYYAISWCKLSPLLASYSYTSL

A0A803P465 Uncharacterized protein8.7e-14040.92Show/hide
Query:  GLVIKLDIEKAFDKVDWNYLDRILLAKGFGNKWRSWIRGCLSSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKAESDGLIKGFRI
        GLV K+D EKA+D+V+W ++D +L  KGFG  WR WI+GC+SS ++S+ IN  PRGK   SRGLRQGDPLSPFLF ++ D   R+  KA S G I GF +
Subjt:  GLVIKLDIEKAFDKVDWNYLDRILLAKGFGNKWRSWIRGCLSSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKAESDGLIKGFRI

Query:  DSESPSINHIQFADDTILFTQFEDDLSAKSMLKVVEDFEAASGQNINLHKTEILGINVEHSILEIFAHHSGCKLGEWPSSYLGLPLNGSSNLRDFWLPII
          E   ++H+QFADDTI F   E++ S   +L+VVE F A SG  INL K+++LGI ++  I+   A   GC++G WP  YLG+PL GS     FW P++
Subjt:  DSESPSINHIQFADDTILFTQFEDDLSAKSMLKVVEDFEAASGQNINLHKTEILGINVEHSILEIFAHHSGCKLGEWPSSYLGLPLNGSSNLRDFWLPII

Query:  EKIEKRLSNWGSSFISKGGRLTLLQATLSNLPTYYLSLYQLPKKIGTEIESMFRAFLWKGGNRNRSQHLVRWNKLLYPIDKGGLGLHSMIDKNKALLAKW
        +K  KRL  W  +F+SKGGRLTL+Q+ LS+LP Y+LSL++ P+ +   +E M R FLW+G   +  +HLV W+++  P  +GGLG+  +  +NK+LL KW
Subjt:  EKIEKRLSNWGSSFISKGGRLTLLQATLSNLPTYYLSLYQLPKKIGTEIESMFRAFLWKGGNRNRSQHLVRWNKLLYPIDKGGLGLHSMIDKNKALLAKW

Query:  SWRFNQERNALWRKFIKAKYGAPSKV---SKASLAKSKGPWKNILKHHHLISSRIAHQLGNGGETFFWTDPWINNTPLVTKYPLLYRLSHVKAATVKE--
         WRF  E+N+LW + + ++YG    +    K S    KGPW++I   +      +  +LG G    FW D WI++ PLV+ +P L  +S  +   +KE  
Subjt:  SWRFNQERNALWRKFIKAKYGAPSKV---SKASLAKSKGPWKNILKHHHLISSRIAHQLGNGGETFFWTDPWINNTPLVTKYPLLYRLSHVKAATVKE--

Query:  -------AWNESLNFWDLKLGRNLKDLEVDEWAALSTDLDSIS-LSHCEDKWKWPLNHNSIFSTKSLLSDMTGNISLIEPSLAKIIWRDNYPKKIKFFLW
                W ES   W+ K  RNL D E+    +L   ++ +  LS  ED   W  + + +FS KS  S M  N S       K +W+   P K+K F W
Subjt:  -------AWNESLNFWDLKLGRNLKDLEVDEWAALSTDLDSIS-LSHCEDKWKWPLNHNSIFSTKSLLSDMTGNISLIEPSLAKIIWRDNYPKKIKFFLW

Query:  ELAHKAINTKEVLQRRMPYLSLSPGWCYLCRSSDESQDHIFATCPFAWKFWSRISKSFGWSMSLPGNLVDILASVLLGHPFTNIKGLLWMNIARAFFWIL
         LA + +N  + +Q+R P+L +SPGWC  C+ S ES  H+F  C F  + W  +   FG S  +P ++  ++AS L+G   +  K  LW +   A  W +
Subjt:  ELAHKAINTKEVLQRRMPYLSLSPGWCYLCRSSDESQDHIFATCPFAWKFWSRISKSFGWSMSLPGNLVDILASVLLGHPFTNIKGLLWMNIARAFFWIL

Query:  WKERNRRVFQG
        W ERN R+F+G
Subjt:  WKERNRRVFQG

A0A803P8A0 Uncharacterized protein9.2e-14238.02Show/hide
Query:  SISLGKISLNRQDLRFPHGLVIKLDIEKAFDKVDWNYLDRILLAKGFGNKWRSWIRGCLSSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSF
        S+ L   ++     R   G V+K+D EKA+D+VDW +LD +L  KGFG +WR WIRGC+SS ++SI +NGR RGK H SRGLRQGDPLSPFLF ++ D  
Subjt:  SISLGKISLNRQDLRFPHGLVIKLDIEKAFDKVDWNYLDRILLAKGFGNKWRSWIRGCLSSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSF

Query:  SRLLTKAESDGLIKGFRIDSESPSINHIQFADDTILFTQFEDDLSAKSMLKVVEDFEAASGQNINLHKTEILGINVEHSILEIFAHHSGCKLGEWPSSYL
         R++ KA       GF+I  ++  ++H+QFADDT+ F + ED L  + ++K+VE F   SG  +NL+K+++LGI +    +   A+  GC++G+WP +YL
Subjt:  SRLLTKAESDGLIKGFRIDSESPSINHIQFADDTILFTQFEDDLSAKSMLKVVEDFEAASGQNINLHKTEILGINVEHSILEIFAHHSGCKLGEWPSSYL

Query:  GLPLNGSSNLRDFWLPIIEKIEKRLSNWGSSFISKGGRLTLLQATLSNLPTYYLSLYQLPKKIGTEIESMFRAFLWKGGNRNRSQHLVRWNKLLYPIDKG
        G+PL GS   + FW P+++K  KR+  W  SF+S+GGRLTL+Q+ LS+LP YYLSL+++PK +  E+E M R F W+GG+     HLV W+++  P  +G
Subjt:  GLPLNGSSNLRDFWLPIIEKIEKRLSNWGSSFISKGGRLTLLQATLSNLPTYYLSLYQLPKKIGTEIESMFRAFLWKGGNRNRSQHLVRWNKLLYPIDKG

Query:  GLGLHSMIDKNKALLAKWSWRFNQERNALWRKFIKAKYGAPSKV--SKASLAKS-KGPWKNILKHHHLISSRIAHQLGNGGETFFWTDPWINNTPLVTKY
        GL +  +  +NK LL KW WRF  E N+LW K IK++YG       +K  +  S +GPW +I   +H     +  ++GNG    FW D WI    L  ++
Subjt:  GLGLHSMIDKNKALLAKWSWRFNQERNALWRKFIKAKYGAPSKV--SKASLAKS-KGPWKNILKHHHLISSRIAHQLGNGGETFFWTDPWINNTPLVTKY

Query:  PLLYRLSHVKAATVKEAWNES------LNFWDLKLGRNLKDLEVDEWAALSTDLDSIS-LSHCEDKWKWPLNHNSIFSTKSLLSDMTGNISLIEPSLAKI
        P L  LS  K A+++E   ++      +  WD K  RN+ D E+     L   L+ +  LS  +D   W  +   IFS+KS  S  T      E S  KI
Subjt:  PLLYRLSHVKAATVKEAWNES------LNFWDLKLGRNLKDLEVDEWAALSTDLDSIS-LSHCEDKWKWPLNHNSIFSTKSLLSDMTGNISLIEPSLAKI

Query:  IWRDNYPKKIKFFLWELAHKAINTKEVLQRRMPYLSLSPGWCYLCRSSDESQDHIFATCPFAWKFWSRISKSFGWSMSLPGNLVDILASVLLGHPFTNIK
        +W++  P K+K F W +A   +N    LQ++ P+L +SPGWC  C+ S E   H+F +C  A   W  +   F    ++P ++  +L S +     +N  
Subjt:  IWRDNYPKKIKFFLWELAHKAINTKEVLQRRMPYLSLSPGWCYLCRSSDESQDHIFATCPFAWKFWSRISKSFGWSMSLPGNLVDILASVLLGHPFTNIK

Query:  GLLWMNIARAFFWILWKERNRRVFQGIDLSFDGFFDIVIYYAISWCKLSPLLASYSYTSLLNSWESLL
          LW     +  W++W ERN R F+G   S +  ++ + ++  +W   +    + S+  L+  W SLL
Subjt:  GLLWMNIARAFFWILWKERNRRVFQGIDLSFDGFFDIVIYYAISWCKLSPLLASYSYTSLLNSWESLL

A0A803QEA6 Uncharacterized protein7.3e-13938Show/hide
Query:  GLVIKLDIEKAFDKVDWNYLDRILLAKGFGNKWRSWIRGCLSSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKAESDGLIKGFRI
        G V+K+D EKA+D+VDW +LD +L  KGFG +WR WIRGC+SS ++SI INGR RGK + SRGLRQGDPLSPFLF MI D   R++ KA     + GF+I
Subjt:  GLVIKLDIEKAFDKVDWNYLDRILLAKGFGNKWRSWIRGCLSSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKAESDGLIKGFRI

Query:  DSESPSINHIQFADDTILFTQFEDDLSAKSMLKVVEDFEAASGQNINLHKTEILGINVEHSILEIFAHHSGCKLGEWPSSYLGLPLNGSSNLRDFWLPII
          +   ++H+QFADDT+ F   +D++S + ++KVV+ F   SG  +NL+K+++LGI +    +   A   GC++G WP +YLG+ L GS   R FW P++
Subjt:  DSESPSINHIQFADDTILFTQFEDDLSAKSMLKVVEDFEAASGQNINLHKTEILGINVEHSILEIFAHHSGCKLGEWPSSYLGLPLNGSSNLRDFWLPII

Query:  EKIEKRLSNWGSSFISKGGRLTLLQATLSNLPTYYLSLYQLPKKIGTEIESMFRAFLWKGGNRNRSQHLVRWNKLLYPIDKGGLGLHSMIDKNKALLAKW
        +K  KR+  W  SF+S+GGRLTL+Q+ LS+LP YYLSL++ PK +  E+E M R F W+GG+     HLV W+++  P  +GGL +  +  +NK LL KW
Subjt:  EKIEKRLSNWGSSFISKGGRLTLLQATLSNLPTYYLSLYQLPKKIGTEIESMFRAFLWKGGNRNRSQHLVRWNKLLYPIDKGGLGLHSMIDKNKALLAKW

Query:  SWRFNQERNALWRKFIKAKYGAPSK---VSKASLAKSKGPWKNILKHHHLISSRIAHQLGNGGETFFWTDPWINNTPLVTKYPLLYRLSHVKAATVKEAW
         WRF  E N+LW K IK++YG         +      +GPWK+I   +      +  ++GNG    FW D W+  + L  ++P L  +S  K  +++E  
Subjt:  SWRFNQERNALWRKFIKAKYGAPSK---VSKASLAKSKGPWKNILKHHHLISSRIAHQLGNGGETFFWTDPWINNTPLVTKYPLLYRLSHVKAATVKEAW

Query:  NES------LNFWDLKLGRNLKDLEVDEWAALSTDLDSIS-LSHCEDKWKWPLNHNSIFSTKSLLSDMTGNISLIEPSLAKIIWRDNYPKKIKFFLWELA
         +       +  WDL   RN+ D E+     L   L+ +  L+  ED   W  +   IFS+KS  S  T   +  E    KI+W+   P K+K F W +A
Subjt:  NES------LNFWDLKLGRNLKDLEVDEWAALSTDLDSIS-LSHCEDKWKWPLNHNSIFSTKSLLSDMTGNISLIEPSLAKIIWRDNYPKKIKFFLWELA

Query:  HKAINTKEVLQRRMPYLSLSPGWCYLCRSSDESQDHIFATCPFAWKFWSRISKSFGWSMSLPGNLVDILASVLLGHPFTNIKGLLWMNIARAFFWILWKE
           +N    LQ++ P++S+SPGWC  C+ S E   H+F  C  A   W  +   F    ++P ++  +L S + G   +     LW     +  W++W E
Subjt:  HKAINTKEVLQRRMPYLSLSPGWCYLCRSSDESQDHIFATCPFAWKFWSRISKSFGWSMSLPGNLVDILASVLLGHPFTNIKGLLWMNIARAFFWILWKE

Query:  RNRRVFQGIDLSFDGFFDIVIYYAISWCKLSPLLASYSYTSLLNSWESLL
        RN R+F+G + S +  +D + ++  SW   +    + S+  L+  W +LL
Subjt:  RNRRVFQGIDLSFDGFFDIVIYYAISWCKLSPLLASYSYTSLLNSWESLL

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.2e-2123.1Show/hide
Query:  GWKEIREFLKISISLGKISLNRQDLRFPHGLVIKLDIEKAFDKVDWNYLDRILLAKGFGNKWRSWIRGCLSSANYSILINGRPRGKIHASRGLRQGDPLS
        GW  IR+ + +   + +        +  + ++I +D EKAFDK+   ++ + L   G    +   IR        +I++NG+         G RQG PLS
Subjt:  GWKEIREFLKISISLGKISLNRQDLRFPHGLVIKLDIEKAFDKVDWNYLDRILLAKGFGNKWRSWIRGCLSSANYSILINGRPRGKIHASRGLRQGDPLS

Query:  PFLFIMIMDSFSRLLTKAESDGLIKGFRIDSESPSINHIQFADDTILFTQFEDDLSAKSMLKVVEDFEAASGQNINLHKTEILGINVEHSILEIFAHHSG
        P LF ++++  +R + + +    IKG ++  E   ++   FADD I++ +    +SA+++LK++ +F   SG  IN+ K++    N              
Subjt:  PFLFIMIMDSFSRLLTKAESDGLIKGFRIDSESPSINHIQFADDTILFTQFEDDLSAKSMLKVVEDFEAASGQNINLHKTEILGINVEHSILEIFAHHSG

Query:  CKLGEWPSSYLGLPLNGSSN--LRDFWLPIIEKIEKRLSNWGSSFISKGGRLTLLQATLSNLPTYYLSL--YQLPKKIGTEIESMFRAFLWKGGNRNRSQ
          +      YLG+ L        ++ + P++++I++  + W +   S  GR+ +++  +     Y  +    +LP    TE+E     F+W     N+ +
Subjt:  CKLGEWPSSYLGLPLNGSSN--LRDFWLPIIEKIEKRLSNWGSSFISKGGRLTLLQATLSNLPTYYLSL--YQLPKKIGTEIESMFRAFLWKGGNRNRSQ

Query:  HLVRWNKLLYPIDKGGLGLHSMIDKNKALLAKWSWRFNQERN
          +  + L      GG+ L       KA + K +W + Q R+
Subjt:  HLVRWNKLLYPIDKGGLGLHSMIDKNKALLAKWSWRFNQERN

P08548 LINE-1 reverse transcriptase homolog1.8e-1722.29Show/hide
Query:  GWKEIREFLKISISLGKISLNRQDLRFPHGLVIKLDIEKAFDKVDWNYLDRILLAKGFGNKWRSWIRGCLSSANYSILINGRPRGKIHASRGLRQGDPLS
        GW  IR+ + +   + K       L+    +++ +D EKAFD +   ++ R L   G    +   I    S    +I++NG          G RQG PLS
Subjt:  GWKEIREFLKISISLGKISLNRQDLRFPHGLVIKLDIEKAFDKVDWNYLDRILLAKGFGNKWRSWIRGCLSSANYSILINGRPRGKIHASRGLRQGDPLS

Query:  PFLFIMIMDSFSRLLTKAESDGLIKGFRIDSESPSINHIQFADDTILFTQFEDDLSAKSMLKVVEDFEAASGQNINLHKTEILGINVEHSILEIFAHHSG
        P LF ++M+  +  + + ++   IKG  I SE   ++   FADD I++ +   D + K +L+V++++   SG  IN HK+        +   +       
Subjt:  PFLFIMIMDSFSRLLTKAESDGLIKGFRIDSESPSINHIQFADDTILFTQFEDDLSAKSMLKVVEDFEAASGQNINLHKTEILGINVEHSILEIFAHHSG

Query:  CKLGEWPSSYLGLPLNGSSNLRDFWLPIIEKIEKRL----SNWGSSFISKGGRLTLLQATLSNLPTYYLSLYQL--PKKIGTEIESMFRAFLWKGGNRNR
          +      YLG+ L  + +++D +    E + K +    + W +   S  GR+ +++ ++     Y  +   +  P     ++E +   F+W     N+
Subjt:  CKLGEWPSSYLGLPLNGSSNLRDFWLPIIEKIEKRL----SNWGSSFISKGGRLTLLQATLSNLPTYYLSLYQL--PKKIGTEIESMFRAFLWKGGNRNR

Query:  SQHLVRWNKLLYPIDKGGLGLHSMIDKNKALLAKWSWRFNQERNA-LWRK
         +  +    L      GG+ L  +    K+++ K +W +++ R   +W +
Subjt:  SQHLVRWNKLLYPIDKGGLGLHSMIDKNKALLAKWSWRFNQERNA-LWRK

P0C2F6 Putative ribonuclease H protein At1g657502.3e-4126.84Show/hide
Query:  RDFWLPIIEKIEKRLSNWGSSFISKGGRLTLLQATLSNLPTYYLSLYQLPKKIGTEIESMFRAFLWKGGNRNRSQHLVRWNKLLYPIDKGGLGLHSMIDK
        +D +  I+E++  R+S W    +S  GRLTL +A LS++P + +S   LP+ I   ++ + R FLW      + QHLV+W+K+  P  +GGLG+ +    
Subjt:  RDFWLPIIEKIEKRLSNWGSSFISKGGRLTLLQATLSNLPTYYLSLYQLPKKIGTEIESMFRAFLWKGGNRNRSQHLVRWNKLLYPIDKGGLGLHSMIDK

Query:  NKALLAKWSWRFNQERNALWRKFIKAKYGAPSKVSKASLAKSKGPWKNILKH-----HHLISSRIAHQLGNGGETFFWTDPWINNTPLVTKYPLLYRLSH
        N+AL++K  WR  QE+N+LW   ++ KY    ++  +     KG W +  +        ++S  +    G+G +  FWTD W++  PL+ +     R + 
Subjt:  NKALLAKWSWRFNQERNALWRKFIKAKYGAPSKVSKASLAKSKGPWKNILKH-----HHLISSRIAHQLGNGGETFFWTDPWINNTPLVTKYPLLYRLSH

Query:  VKAATVKEAWNESLNFWDLKLGRNLKDLEVDEWAALSTDLDSIS-----LSHCEDKWKWPLNHNSIFSTKSLLSDMTGNISLIEPSLAKI---IWRDNYP
              K+ W           GR     ++D +   +T L+  +     ++   D+  W  + +  FS +S    +T +  +  P++A     +W+   P
Subjt:  VKAATVKEAWNESLNFWDLKLGRNLKDLEVDEWAALSTDLDSIS-----LSHCEDKWKWPLNHNSIFSTKSLLSDMTGNISLIEPSLAKI---IWRDNYP

Query:  KKIKFFLWELAHKAINTKEVLQRRMPYLSLSPGWCYLCRSSDESQDHIFATCPFAWKFWSRI----SKSFGWSMSLPGNLVDILASVLLGHPFTNIKGLL
        +++K FLW + ++A+ T+E   RR  +LS S   C +C+   ES  H+   CP     W R+     +   +S SL   L D L         +  + + 
Subjt:  KKIKFFLWELAHKAINTKEVLQRRMPYLSLSPGWCYLCRSSDESQDHIFATCPFAWKFWSRI----SKSFGWSMSLPGNLVDILASVLLGHPFTNIKGLL

Query:  WMNIARAFFWILWKERNRRVF
        W  I     W  WK R   +F
Subjt:  WMNIARAFFWILWKERNRRVF

P11369 LINE-1 retrotransposable element ORF2 protein8.6e-2022.03Show/hide
Query:  GWKEIREFLKISISLGKISLNRQDLRFPHGLVIKLDIEKAFDKVDWNYLDRILLAKGFGNKWRSWIRGCLSSANYSILINGRPRGKIHASRGLRQGDPLS
        GW  IR+ + +   + K       L+  + ++I LD EKAFDK+   ++ ++L   G    + + I+   S    +I +NG     I    G RQG PLS
Subjt:  GWKEIREFLKISISLGKISLNRQDLRFPHGLVIKLDIEKAFDKVDWNYLDRILLAKGFGNKWRSWIRGCLSSANYSILINGRPRGKIHASRGLRQGDPLS

Query:  PFLFIMIMDSFSRLLTKAESDGLIKGFRIDSESPSINHIQFADDTILFTQFEDDLSAKSMLKVVEDFEAASGQNINLHKTEILGINVEHSILEIFAHHSG
        P+LF ++++  +R + + +    IKG +I  E   I+ +  ADD I++   +   S + +L ++  F    G  IN +K+            +     + 
Subjt:  PFLFIMIMDSFSRLLTKAESDGLIKGFRIDSESPSINHIQFADDTILFTQFEDDLSAKSMLKVVEDFEAASGQNINLHKTEILGINVEHSILEIFAHHSG

Query:  CKLGEWPSSYLGLPLNGSSNLRDFW----LPIIEKIEKRLSNWGSSFISKGGRLTLLQATLSNLPTYYLSLYQLPKKIGT----EIESMFRAFLWKGGNR
          +      YLG+ L  +  ++D +      + ++I++ L  W     S  GR+ +++  +  LP        +P KI T    E+E     F+W     
Subjt:  CKLGEWPSSYLGLPLNGSSNLRDFW----LPIIEKIEKRLSNWGSSFISKGGRLTLLQATLSNLPTYYLSLYQLPKKIGT----EIESMFRAFLWKGGNR

Query:  NRSQHLVRWNKLLYPIDKGGLGLHSMIDKNKALLAKWSWRFNQERNA-LWRKFIKAKYGAPSKVSKASLAKSKGPWKNILKHHHLISSRIAHQLGNGGET
          ++ L++  +       GG+ +  +    +A++ K +W + ++R    W +                    + P  N   + HLI  + A  +    ++
Subjt:  NRSQHLVRWNKLLYPIDKGGLGLHSMIDKNKALLAKWSWRFNQERNA-LWRKFIKAKYGAPSKVSKASLAKSKGPWKNILKHHHLISSRIAHQLGNGGET

Query:  FF----WTDPWINNTPLVTKYPLLYRLSHVKAATVKEAW--NESLNFWDLKLGRNLKDLEVDE
         F    W + W+ +   +   P L   + VK+  +KE     E+L   + K+G++L+D+   E
Subjt:  FF----WTDPWINNTPLVTKYPLLYRLSHVKAATVKEAW--NESLNFWDLKLGRNLKDLEVDE

P92555 Uncharacterized mitochondrial protein AtMg012502.4e-1452.94Show/hide
Query:  LINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKAESDGLIKGFRIDSESPSINHIQFADDT
        +ING P+G +  SRGLRQGDPLSP+LFI+  +  S L  +A+  G + G R+ + SP INH+ FADDT
Subjt:  LINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKAESDGLIKGFRIDSESPSINHIQFADDT

Arabidopsis top hitse value%identityAlignment
AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein8.0e-2926.95Show/hide
Query:  GEWPSSYLGLPLNGSSNLRDFWLPIIEKIEKRLSNWGSSFISKGGRLTLLQATLSNLPTYYLSLYQLPKKIGTEIESMFRAFLWKGGNRNRSQHLVRWNK
        G  P  YLGLPL         + P++EKI  R+  W +  +S  GRL L+ + + +L  +++S ++LP     EI+S+  +FLW G   N  +  V W+ 
Subjt:  GEWPSSYLGLPLNGSSNLRDFWLPIIEKIEKRLSNWGSSFISKGGRLTLLQATLSNLPTYYLSLYQLPKKIGTEIESMFRAFLWKGGNRNRSQHLVRWNK

Query:  LLYPIDKGGLGLHSMIDKNKALLAKWSWRFNQERNALWRKFIKAKYGAPSKVSKASLAKSKGPWKNILKHHHLISSRIAHQLGNGGETFFWTDPW--INN
        +  P D+GGLG+ S+ + NK   + WS   N    + W                         WK ILKH  L S  + H + NG  T FW D W  I  
Subjt:  LLYPIDKGGLGLHSMIDKNKALLAKWSWRFNQERNALWRKFIKAKYGAPSKVSKASLAKSKGPWKNILKHHHLISSRIAHQLGNGGETFFWTDPW--INN

Query:  TPLVTKYPLLYRLSHVKAATVKEAWNESLNFWDLKLGRNLKD--LEVDEWAALSTDLDSISLSHCEDKWKWPLN---HNSIFSTKSLLSDMTGNISLIEP
           VT +     +     A+V EA        + +  R+  D  L +++  A   ++    L+  ED  +W  N       F+TK   +      +  EP
Subjt:  TPLVTKYPLLYRLSHVKAATVKEAWNESLNFWDLKLGRNLKD--LEVDEWAALSTDLDSISLSHCEDKWKWPLN---HNSIFSTKSLLSDMTGNISLIEP

Query:  SLA----KIIWRDNYPKKIKFFLWELAHKAINTKEVLQRRMPYLSLSPGWCYLCRSSDESQDHIFATCPFA
         L     K +W  +   K     W      + T +   R + + + +   C LC    E++DH+F TCP++
Subjt:  SLA----KIIWRDNYPKKIKFFLWELAHKAINTKEVLQRRMPYLSLSPGWCYLCRSSDESQDHIFATCPFA

AT4G29090.1 Ribonuclease H-like superfamily protein2.1e-2124.74Show/hide
Query:  LPTYYLSLYQLPKKIGTEIESMFRAFLWKGGNRNRSQHLVRWNKLLYPIDKGGLGLHSMIDKNKALLAKWSWRFNQERNALWRKFIKAKYGAPSKVSKAS
        LPTY ++ + LPK +  +I S+   F W+     +  H   W+ L     +GG+G   +   N ALL K  WR      +L  K  K++Y   S    A 
Subjt:  LPTYYLSLYQLPKKIGTEIESMFRAFLWKGGNRNRSQHLVRWNKLLYPIDKGGLGLHSMIDKNKALLAKWSWRFNQERNALWRKFIKAKYGAPSKVSKAS

Query:  L-AKSKGPWKNILKHHHLISSRIAHQLGNGGETFFWTDPWINNTPLVTKYPLLYRLSHVKAATVKEAWNESLNFWDLKLGRNLKDLEVDEWAALSTDLDS
        L ++    WK+I     ++       +GNG +   W   W+++ P  +    + R+   + A+V    +  L   DL +  + ++   D    L  +++ 
Subjt:  L-AKSKGPWKNILKHHHLISSRIAHQLGNGGETFFWTDPWINNTPLVTKYPLLYRLSHVKAATVKEAWNESLNFWDLKLGRNLKDLEVDEWAALSTDLDS

Query:  ISLSHCE-------DKWKWPLNHNSIFSTKS-------LLSDMTGNISLIEPSLAKI---IWRDNYPKKIKFFLWELAHKAINTKEVLQRRMPYLSLSPG
          +           D + W    +  ++ KS       +++  +    + EPSL  I   IW+     KI+ FLW+    ++     L  R  +LS    
Subjt:  ISLSHCE-------DKWKWPLNHNSIFSTKS-------LLSDMTGNISLIEPSLAKI---IWRDNYPKKIKFFLWELAHKAINTKEVLQRRMPYLSLSPG

Query:  WCYLCRSSDESQDHIFATCPFAWKFW--SRISKSFG--WSMSLPGNLVDILASVLLGHPFTNIKGLL--WMNIARAFFWILWKERNRRVFQG
         C  C S  E+ +H+   C FA   W  S I    G  W+ S+  NL  +  ++  G+P       L  W+       W LWK RN  VF+G
Subjt:  WCYLCRSSDESQDHIFATCPFAWKFW--SRISKSFG--WSMSLPGNLVDILASVLLGHPFTNIKGLL--WMNIARAFFWILWKERNRRVFQG

AT5G43680.1 unknown protein9.2e-1762.9Show/hide
Query:  MLGISYGELFLLIGATAAFIGPKDLPVMARMAGRMAGRAIGYVQLARGQFDSVMQQTQARQI
        MLG+SYGEL L++GATAA +GPKDLP++AR  GR+ GRAIGY+ +ARG  D VM+Q Q ++I
Subjt:  MLGISYGELFLLIGATAAFIGPKDLPVMARMAGRMAGRAIGYVQLARGQFDSVMQQTQARQI

AT5G43680.2 unknown protein9.2e-1762.9Show/hide
Query:  MLGISYGELFLLIGATAAFIGPKDLPVMARMAGRMAGRAIGYVQLARGQFDSVMQQTQARQI
        MLG+SYGEL L++GATAA +GPKDLP++AR  GR+ GRAIGY+ +ARG  D VM+Q Q ++I
Subjt:  MLGISYGELFLLIGATAAFIGPKDLPVMARMAGRMAGRAIGYVQLARGQFDSVMQQTQARQI

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.7e-1552.94Show/hide
Query:  LINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKAESDGLIKGFRIDSESPSINHIQFADDT
        +ING P+G +  SRGLRQGDPLSP+LFI+  +  S L  +A+  G + G R+ + SP INH+ FADDT
Subjt:  LINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKAESDGLIKGFRIDSESPSINHIQFADDT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGGGCATTTCATATGGAGAACTCTTTCTCCTTATTGGAGCTACTGCTGCCTTCATTGGGCCAAAGGATCTCCCAGTGATGGCAAGAATGGCAGGAAGGATGGCTGG
CAGAGCAATAGGATATGTTCAGTTAGCTCGAGGCCAATTTGACTCTGTCATGCAACAAACCCAAGCTCGCCAGATTGGTTCATCTGATGATTCAAGAAATTGGTCACTAG
AGTCCTCTGGTTGTTTCACGTTGAAATCATTATCTCGTCAACTCGTTTCTTCCTCACCATTGGCTAATGATTTGTTTAGGAATTATGGTTGGAAAGAAATTAGAGAATTT
TTGAAGATAAGTATCTCCCTTGGCAAGATCAGTTTGAATCGGCAAGACTTAAGGTTTCCTCATGGTTTAGTCATTAAACTTGACATCGAAAAAGCTTTTGACAAGGTCGA
TTGGAATTACCTCGATCGTATTCTTTTAGCCAAAGGATTTGGTAACAAGTGGAGATCCTGGATTAGAGGCTGTTTATCTTCAGCAAATTATTCGATTCTCATAAATGGAC
GCCCCAGAGGTAAAATCCATGCTTCTCGAGGATTAAGACAAGGAGATCCACTGTCTCCTTTTCTTTTTATCATGATTATGGATAGTTTTAGTCGTTTGCTCACCAAAGCT
GAATCCGACGGGCTCATTAAAGGTTTCAGAATTGATTCAGAATCCCCTAGCATCAACCACATTCAATTCGCCGATGACACCATTTTATTCACACAGTTTGAAGATGATTT
ATCAGCAAAATCCATGCTCAAAGTGGTGGAGGATTTTGAAGCAGCTTCTGGACAAAACATTAACCTTCATAAAACTGAGATTCTGGGCATAAATGTGGAGCATAGTATTC
TGGAGATTTTTGCTCATCATTCTGGATGCAAATTGGGAGAGTGGCCTAGTTCATACTTGGGTCTGCCTTTAAATGGCTCTTCAAATCTTAGAGACTTTTGGCTGCCGATT
ATTGAAAAGATTGAAAAGAGGCTTTCAAATTGGGGGTCATCTTTTATATCAAAAGGGGGTAGGCTCACTCTCCTTCAAGCTACCTTATCAAATCTTCCCACATATTATCT
ATCTTTATATCAACTTCCCAAGAAAATTGGTACGGAGATAGAGAGTATGTTTAGGGCTTTTCTGTGGAAAGGGGGAAATCGCAACAGAAGCCAGCATCTAGTCCGATGGA
ACAAACTTCTCTATCCTATTGACAAAGGAGGTTTGGGCTTACATTCCATGATTGACAAAAACAAAGCCCTCTTGGCAAAATGGAGTTGGAGATTCAATCAAGAAAGGAAT
GCTTTGTGGAGAAAGTTCATAAAGGCTAAATATGGAGCCCCCTCAAAAGTTAGCAAGGCTTCCCTTGCTAAATCAAAGGGTCCCTGGAAGAATATTTTGAAACATCATCA
TCTTATATCTAGTCGCATAGCCCACCAGCTTGGCAATGGAGGGGAGACTTTTTTCTGGACGGACCCTTGGATAAACAACACCCCACTGGTCACGAAATATCCCCTTCTCT
ACCGCCTTTCTCATGTCAAGGCAGCCACAGTCAAAGAGGCATGGAACGAATCGTTAAATTTTTGGGACCTCAAACTGGGTAGAAATTTGAAAGATTTGGAGGTGGATGAA
TGGGCTGCTTTGAGCACGGATCTTGACTCCATTTCCTTATCTCATTGTGAAGACAAATGGAAATGGCCCCTCAATCATAACAGTATTTTTTCCACCAAATCTCTCCTCTC
TGATATGACGGGCAATATTAGCCTCATTGAGCCTTCCTTGGCAAAGATTATTTGGAGAGACAACTACCCAAAGAAGATAAAATTCTTTTTATGGGAACTAGCCCACAAAG
CCATCAACACCAAGGAGGTTCTTCAGAGACGCATGCCTTACTTGTCTCTCTCCCCCGGTTGGTGTTACTTATGTAGATCTAGTGACGAATCTCAAGATCACATTTTTGCA
ACATGCCCCTTTGCTTGGAAGTTCTGGAGTAGGATCTCTAAATCTTTTGGGTGGTCCATGTCTCTACCGGGTAATTTGGTGGATATCCTTGCCTCCGTTCTATTGGGACA
CCCTTTCACCAATATCAAAGGCCTTCTATGGATGAACATCGCTAGAGCCTTCTTTTGGATCTTATGGAAAGAAAGGAACAGAAGAGTTTTCCAAGGAATCGATCTCTCTT
TTGATGGTTTTTTCGACATTGTTATATACTATGCTATATCTTGGTGCAAACTCTCTCCCCTCCTTGCTTCGTACTCATATACTTCCCTCTTAAACAGTTGGGAAAGTCTT
TTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTGGGCATTTCATATGGAGAACTCTTTCTCCTTATTGGAGCTACTGCTGCCTTCATTGGGCCAAAGGATCTCCCAGTGATGGCAAGAATGGCAGGAAGGATGGCTGG
CAGAGCAATAGGATATGTTCAGTTAGCTCGAGGCCAATTTGACTCTGTCATGCAACAAACCCAAGCTCGCCAGATTGGTTCATCTGATGATTCAAGAAATTGGTCACTAG
AGTCCTCTGGTTGTTTCACGTTGAAATCATTATCTCGTCAACTCGTTTCTTCCTCACCATTGGCTAATGATTTGTTTAGGAATTATGGTTGGAAAGAAATTAGAGAATTT
TTGAAGATAAGTATCTCCCTTGGCAAGATCAGTTTGAATCGGCAAGACTTAAGGTTTCCTCATGGTTTAGTCATTAAACTTGACATCGAAAAAGCTTTTGACAAGGTCGA
TTGGAATTACCTCGATCGTATTCTTTTAGCCAAAGGATTTGGTAACAAGTGGAGATCCTGGATTAGAGGCTGTTTATCTTCAGCAAATTATTCGATTCTCATAAATGGAC
GCCCCAGAGGTAAAATCCATGCTTCTCGAGGATTAAGACAAGGAGATCCACTGTCTCCTTTTCTTTTTATCATGATTATGGATAGTTTTAGTCGTTTGCTCACCAAAGCT
GAATCCGACGGGCTCATTAAAGGTTTCAGAATTGATTCAGAATCCCCTAGCATCAACCACATTCAATTCGCCGATGACACCATTTTATTCACACAGTTTGAAGATGATTT
ATCAGCAAAATCCATGCTCAAAGTGGTGGAGGATTTTGAAGCAGCTTCTGGACAAAACATTAACCTTCATAAAACTGAGATTCTGGGCATAAATGTGGAGCATAGTATTC
TGGAGATTTTTGCTCATCATTCTGGATGCAAATTGGGAGAGTGGCCTAGTTCATACTTGGGTCTGCCTTTAAATGGCTCTTCAAATCTTAGAGACTTTTGGCTGCCGATT
ATTGAAAAGATTGAAAAGAGGCTTTCAAATTGGGGGTCATCTTTTATATCAAAAGGGGGTAGGCTCACTCTCCTTCAAGCTACCTTATCAAATCTTCCCACATATTATCT
ATCTTTATATCAACTTCCCAAGAAAATTGGTACGGAGATAGAGAGTATGTTTAGGGCTTTTCTGTGGAAAGGGGGAAATCGCAACAGAAGCCAGCATCTAGTCCGATGGA
ACAAACTTCTCTATCCTATTGACAAAGGAGGTTTGGGCTTACATTCCATGATTGACAAAAACAAAGCCCTCTTGGCAAAATGGAGTTGGAGATTCAATCAAGAAAGGAAT
GCTTTGTGGAGAAAGTTCATAAAGGCTAAATATGGAGCCCCCTCAAAAGTTAGCAAGGCTTCCCTTGCTAAATCAAAGGGTCCCTGGAAGAATATTTTGAAACATCATCA
TCTTATATCTAGTCGCATAGCCCACCAGCTTGGCAATGGAGGGGAGACTTTTTTCTGGACGGACCCTTGGATAAACAACACCCCACTGGTCACGAAATATCCCCTTCTCT
ACCGCCTTTCTCATGTCAAGGCAGCCACAGTCAAAGAGGCATGGAACGAATCGTTAAATTTTTGGGACCTCAAACTGGGTAGAAATTTGAAAGATTTGGAGGTGGATGAA
TGGGCTGCTTTGAGCACGGATCTTGACTCCATTTCCTTATCTCATTGTGAAGACAAATGGAAATGGCCCCTCAATCATAACAGTATTTTTTCCACCAAATCTCTCCTCTC
TGATATGACGGGCAATATTAGCCTCATTGAGCCTTCCTTGGCAAAGATTATTTGGAGAGACAACTACCCAAAGAAGATAAAATTCTTTTTATGGGAACTAGCCCACAAAG
CCATCAACACCAAGGAGGTTCTTCAGAGACGCATGCCTTACTTGTCTCTCTCCCCCGGTTGGTGTTACTTATGTAGATCTAGTGACGAATCTCAAGATCACATTTTTGCA
ACATGCCCCTTTGCTTGGAAGTTCTGGAGTAGGATCTCTAAATCTTTTGGGTGGTCCATGTCTCTACCGGGTAATTTGGTGGATATCCTTGCCTCCGTTCTATTGGGACA
CCCTTTCACCAATATCAAAGGCCTTCTATGGATGAACATCGCTAGAGCCTTCTTTTGGATCTTATGGAAAGAAAGGAACAGAAGAGTTTTCCAAGGAATCGATCTCTCTT
TTGATGGTTTTTTCGACATTGTTATATACTATGCTATATCTTGGTGCAAACTCTCTCCCCTCCTTGCTTCGTACTCATATACTTCCCTCTTAAACAGTTGGGAAAGTCTT
TTGTAA
Protein sequenceShow/hide protein sequence
MLGISYGELFLLIGATAAFIGPKDLPVMARMAGRMAGRAIGYVQLARGQFDSVMQQTQARQIGSSDDSRNWSLESSGCFTLKSLSRQLVSSSPLANDLFRNYGWKEIREF
LKISISLGKISLNRQDLRFPHGLVIKLDIEKAFDKVDWNYLDRILLAKGFGNKWRSWIRGCLSSANYSILINGRPRGKIHASRGLRQGDPLSPFLFIMIMDSFSRLLTKA
ESDGLIKGFRIDSESPSINHIQFADDTILFTQFEDDLSAKSMLKVVEDFEAASGQNINLHKTEILGINVEHSILEIFAHHSGCKLGEWPSSYLGLPLNGSSNLRDFWLPI
IEKIEKRLSNWGSSFISKGGRLTLLQATLSNLPTYYLSLYQLPKKIGTEIESMFRAFLWKGGNRNRSQHLVRWNKLLYPIDKGGLGLHSMIDKNKALLAKWSWRFNQERN
ALWRKFIKAKYGAPSKVSKASLAKSKGPWKNILKHHHLISSRIAHQLGNGGETFFWTDPWINNTPLVTKYPLLYRLSHVKAATVKEAWNESLNFWDLKLGRNLKDLEVDE
WAALSTDLDSISLSHCEDKWKWPLNHNSIFSTKSLLSDMTGNISLIEPSLAKIIWRDNYPKKIKFFLWELAHKAINTKEVLQRRMPYLSLSPGWCYLCRSSDESQDHIFA
TCPFAWKFWSRISKSFGWSMSLPGNLVDILASVLLGHPFTNIKGLLWMNIARAFFWILWKERNRRVFQGIDLSFDGFFDIVIYYAISWCKLSPLLASYSYTSLLNSWESL
L