; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0010859 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0010859
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationchr1:8062816..8064963
RNA-Seq ExpressionLag0010859
SyntenyLag0010859
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN65957.1 hypothetical protein VITISV_035610 [Vitis vinifera]1.2e-7230.77Show/hide
Query:  NVRGIGDWQKRALIKSFLLKFTPSLVILQETKLEYVDRNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIITTDITKG-------------------
        N RG+G  +KR ++K FL    P +V++QETK    DR  + S+W+ RN  W  L A G+SGGI+++W+   +++ ++  G                   
Subjt:  NVRGIGDWQKRALIKSFLLKFTPSLVILQETKLEYVDRNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIITTDITKG-------------------

Query:  -------------------------------------ISTCLDGLMRNRMGKSLKS----------------------------SIVPR-----YSDKFI
                                             I  C + L   R+  S+K                              +  R     YS+++ 
Subjt:  -------------------------------------ISTCLDGLMRNRMGKSLKS----------------------------SIVPR-----YSDKFI

Query:  NLEARKLD----RPTSDHYPLMLTMGSGKWGPSPFRFENMWLKHRSFLPLIDYWWKNTPMRGGPGHGFIMKLKDLKVRLRSWNTDVFGCNTTKRNQLLTE
         L  + L     R TSDH+P++L     KWGP+PFRFENMWL H SF      WW+     G  GH F+ KL+ LK +L+ WN + FG    ++  +L +
Subjt:  NLEARKLD----RPTSDHYPLMLTMGSGKWGPSPFRFENMWLKHRSFLPLIDYWWKNTPMRGGPGHGFIMKLKDLKVRLRSWNTDVFGCNTTKRNQLLTE

Query:  ISILDSIEETGHISPIQLSQRRSLKVEILNLAALRNKDGSKNK-----DVLAAFP-------DEIDWNPLQNHLAVDMEGTFTETEVWNAIKLLGSDKSP
        I+  DS+E+ G +SP  L QR  L+ E      L N D  K +     + L A P       + +DW+P+    A  +E  F E E+  AI  +  DK+P
Subjt:  ISILDSIEETGHISPIQLSQRRSLKVEILNLAALRNKDGSKNK-----DVLAAFP-------DEIDWNPLQNHLAVDMEGTFTETEVWNAIKLLGSDKSP

Query:  GSDGFTSEFFKKCWNILKDDIMRMFQDFLRM-------------------------------------------------------------AYVEGRQI
        G DGFT   F+ CW+++K+D++R+F +F R                                                              A+V+GRQI
Subjt:  GSDGFTSEFFKKCWNILKDDIMRMFQDFLRM-------------------------------------------------------------AYVEGRQI

Query:  LDASLIANEIIDEWNRKHIKGLVIKLDIEKAFDKVDWNYLDHILLAKGFGNKWRSWIRGCLSSANYSILING
        LDA LIANEI+DE  R   +G+V K+D EKA+D V W++LDH++  KGF  KWR WIRGCLSS +++IL+NG
Subjt:  LDASLIANEIIDEWNRKHIKGLVIKLDIEKAFDKVDWNYLDHILLAKGFGNKWRSWIRGCLSSANYSILING

RVW55793.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]1.2e-7229.89Show/hide
Query:  MPIPSISRKIKTTQKKAKLVRELAGLSSSV--NYNKSSTKHSRGKKTSHDYYLRNVRGIGDWQKRALIKSFLLKFTPSLVILQETKLEYVDRNIIKSLWS
        +P+     +  T +K  K+   L  L   +  +  K     +R   ++      N RG+G  +KR  ++ FL    P +V+LQETK E  DR ++ S+W 
Subjt:  MPIPSISRKIKTTQKKAKLVRELAGLSSSV--NYNKSSTKHSRGKKTSHDYYLRNVRGIGDWQKRALIKSFLLKFTPSLVILQETKLEYVDRNIIKSLWS

Query:  GRNISWAFLEASGSSGGIIIMWNDPSIITTDITKGIST---------------------------------------------CLDG------LMRNRMG
        G+++ W  L A G+SGGI+I+W+      ++   G  +                                             C+ G       +  +MG
Subjt:  GRNISWAFLEASGSSGGIIIMWNDPSIITTDITKGIST---------------------------------------------CLDG------LMRNRMG

Query:  KSLKSSIVPRYSDKFINLEARKLDRPTSDHYPLMLTMGSGKWGPSPFRFENMWLKHRSFLPLIDYWWKNTPMRGGPGHGFIMKLKDLKVRLRSWNTDVFG
         S + ++  R  D+FI  E+  L R TSDH P+ L      WGP+PFRFENMWL H  F      WW+   + G  GH F+ KLK +K +L+ WNT VFG
Subjt:  KSLKSSIVPRYSDKFINLEARKLDRPTSDHYPLMLTMGSGKWGPSPFRFENMWLKHRSFLPLIDYWWKNTPMRGGPGHGFIMKLKDLKVRLRSWNTDVFG

Query:  CNTTKRNQLLTEISILDSIEETGHISPIQLSQRRSLKVEILNLAALRNKDG--SKNKDVLAAFPDE----------------------------------
            ++  +LT++  +D IE+ G+++ ++L   R L+ + L    L+ + G   K ++V+ +   E                                  
Subjt:  CNTTKRNQLLTEISILDSIEETGHISPIQLSQRRSLKVEILNLAALRNKDG--SKNKDVLAAFPDE----------------------------------

Query:  -IDWNPLQNHLAVDMEGTFTETEVWNAIKLLGSDKSPGSDGFTSEFFKKCWNILKDDIMRMFQDF-----------------------------------
         IDW P+    A+ ++  F+E EV  A+  L  +K+PG DGFT   +++CW+++K+D+MR+F +F                                   
Subjt:  -IDWNPLQNHLAVDMEGTFTETEVWNAIKLLGSDKSPGSDGFTSEFFKKCWNILKDDIMRMFQDF-----------------------------------

Query:  ---------------LRM-----------AYVEGRQILDASLIANEIIDEWNRKHIKGLVIKLDIEKAFDKVDWNYLDHILLAKGFGNKWRSWIRGCLSS
                       LR            A+VEGRQILDA LIANE++DE  R   +G+V K+D EKA+D V+W +LDH+L  KGF  KWRSW+RGCLSS
Subjt:  ---------------LRM-----------AYVEGRQILDASLIANEIIDEWNRKHIKGLVIKLDIEKAFDKVDWNYLDHILLAKGFGNKWRSWIRGCLSS

Query:  ANYSILING
        ++++IL+NG
Subjt:  ANYSILING

RVW90400.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]8.0e-7231.08Show/hide
Query:  GIGDWQKRALIKSFLLKFTPSLVILQETKLEYVDRNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIITTDITKG-----ISTCLDG-------LMR
        G+G  +KR ++K+FL    P +V++QETK E  DR ++ S+WS RN  WA L ASG+SGGI+I+W+   +   ++  G     I   +DG        + 
Subjt:  GIGDWQKRALIKSFLLKFTPSLVILQETKLEYVDRNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIITTDITKG-----ISTCLDG-------LMR

Query:  NRMGKSLKSSIVPRYSD------KFINLEARKLD---------------------RPTSDHYPLMLTMGSGKWGPSPFRFENMWLKHRSFLPLIDYWWKN
             +L+       SD      +  N   ++LD                     R TSDH+P++L     KWGP+PFRFENMWL+H SF      WW  
Subjt:  NRMGKSLKSSIVPRYSD------KFINLEARKLD---------------------RPTSDHYPLMLTMGSGKWGPSPFRFENMWLKHRSFLPLIDYWWKN

Query:  TPMRGGPGHGFIMKLKDLKVRLRSWNTDVFGCNTTKRNQLLTEISILDSIEETGHISPIQLSQRRSLKVEILNL--------------------------
            G  GH F+ KL+ +K +L+ WN   FG  + K+  +L  ++  DS+E+ G +S  +L QR   K E+  L                          
Subjt:  TPMRGGPGHGFIMKLKDLKVRLRSWNTDVFGCNTTKRNQLLTEISILDSIEETGHISPIQLSQRRSLKVEILNL--------------------------

Query:  --------------AALRNKDG-------SKNKDVLAAFP-------------DEIDWNPLQNHLAVDMEGTFTETEVWNAIKLLGSDKSPGSDGFTSEF
                        L N+ G       S  +++L  F              + +DW+P+    A  +E  FTE E++ AI  +  DK+PG DGFT   
Subjt:  --------------AALRNKDG-------SKNKDVLAAFP-------------DEIDWNPLQNHLAVDMEGTFTETEVWNAIKLLGSDKSPGSDGFTSEF

Query:  FKKCWNILKDDIMRMFQDFLRM-------------------------------------------------------------AYVEGRQILDASLIANE
        F+ CW+++K+D++R+F +F R                                                              A+V+GRQILDA LIANE
Subjt:  FKKCWNILKDDIMRMFQDFLRM-------------------------------------------------------------AYVEGRQILDASLIANE

Query:  IIDEWNRKHIKGLVIKLDIEKAFDKVDWNYLDHILLAKGFGNKWRSWIRGCLSSANYSILING
        I+DE  R   +G+V K+D EKA+D V W++LDH+L  KGF  +WR W+RGCLSS +Y++L+NG
Subjt:  IIDEWNRKHIKGLVIKLDIEKAFDKVDWNYLDHILLAKGFGNKWRSWIRGCLSSANYSILING

RVW96808.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]1.2e-7230.03Show/hide
Query:  IMPIPSISRKIKTTQKKAKLVRELAGLSSSV--NYNKSSTKHSRGKKTSHDYYLRNVRGIGDWQKRALIKSFLLKFTPSLVILQETKLEYVDRNIIKSLW
        I+P+     +  T  K  K+   L  L   +     K + + ++    +      N RG+G  +KR +++ FL    P++V+LQETK E  DR  + S+W
Subjt:  IMPIPSISRKIKTTQKKAKLVRELAGLSSSV--NYNKSSTKHSRGKKTSHDYYLRNVRGIGDWQKRALIKSFLLKFTPSLVILQETKLEYVDRNIIKSLW

Query:  SGRNISWAFLEASGSSGGIIIMWNDPSI-----------ITTDITKGISTCLDGLMRNR--MGKSLKSSIVPRYSDKFI----------NLEARKLDRPT
        +GR + W  L A G+SGGI+I+W+               +T     G   CL   +RN      ++++  + +  D+F+                L R T
Subjt:  SGRNISWAFLEASGSSGGIIIMWNDPSI-----------ITTDITKGISTCLDGLMRNR--MGKSLKSSIVPRYSDKFI----------NLEARKLDRPT

Query:  SDHYPLMLTMGSGKWGPSPFRFENMWLKHRSFLPLIDYWWKNTPMRGGPGHGFIMKLKDLKVRLRSWNTDVFGCNTTKRNQLLTEISILDSIEETGHISP
        SDH P+ L     KWGP+PFRFENMWL H  F      WW+     G  GH F+ KLK +K++L+ WN   FG    ++  +LT++S +D IE+ G+++P
Subjt:  SDHYPLMLTMGSGKWGPSPFRFENMWLKHRSFLPLIDYWWKNTPMRGGPGHGFIMKLKDLKVRLRSWNTDVFGCNTTKRNQLLTEISILDSIEETGHISP

Query:  IQLSQRRSLKVEILNLAALRN-------------KDGSKN-----------------------------------KDVLAAF-------------PDEID
          L   R+L+ + L    L+              K+G  N                                   ++++  F              + ID
Subjt:  IQLSQRRSLKVEILNLAALRN-------------KDGSKN-----------------------------------KDVLAAF-------------PDEID

Query:  WNPLQNHLAVDMEGTFTETEVWNAIKLLGSDKSPGSDGFTSEFFKKCWNILKDDIMRMFQDF--------------------------------------
        W P+     V ++ +FTE EV  A+  L  +K+PG DGFT   +++CW+++K+D+ R+F +F                                      
Subjt:  WNPLQNHLAVDMEGTFTETEVWNAIKLLGSDKSPGSDGFTSEFFKKCWNILKDDIMRMFQDF--------------------------------------

Query:  ------------LRM-----------AYVEGRQILDASLIANEIIDEWNRKHIKGLVIKLDIEKAFDKVDWNYLDHILLAKGFGNKWRSWIRGCLSSANY
                    LR            A+VEGR ILDA LIANE++DE  R   +G+V K+D EKA+D VDW +LDH+L  KGF  KWRSWIRGCLSS+++
Subjt:  ------------LRM-----------AYVEGRQILDASLIANEIIDEWNRKHIKGLVIKLDIEKAFDKVDWNYLDHILLAKGFGNKWRSWIRGCLSSANY

Query:  SILING
        +IL+NG
Subjt:  SILING

RVX11280.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]1.2e-7232.45Show/hide
Query:  NVRGIGDWQKRALIKSFLLKFTPSLVILQETKLEYVDRNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIITTDITKG-----ISTCLDG-------
        NVRG+G   KR ++K FL    P +V++QETK E  DR  + S+W+ RN  W  L ASG+SGGI+I+W+  ++   ++  G     +   LDG       
Subjt:  NVRGIGDWQKRALIKSFLLKFTPSLVILQETKLEYVDRNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIITTDITKG-----ISTCLDG-------

Query:  ------------------------------------LMRNRMGKSLKSSIVP--RYSDKFI------------------NLE----ARKLD---------
                                            ++R    K   SS+ P  R  D FI                  N++     ++LD         
Subjt:  ------------------------------------LMRNRMGKSLKSSIVP--RYSDKFI------------------NLE----ARKLD---------

Query:  ------------RPTSDHYPLMLTMGSGKWGPSPFRFENMWLKHRSFLPLIDYWWKNTPMRGGPGHGFIMKLKDLKVRLRSWNTDVFGCNTTKRNQLLTE
                    R TSDH+P+++      WGP+PFRFENMWL+H +F      WW      G  GH F+ +L+ +K +L+ WN   FG    K+  +L +
Subjt:  ------------RPTSDHYPLMLTMGSGKWGPSPFRFENMWLKHRSFLPLIDYWWKNTPMRGGPGHGFIMKLKDLKVRLRSWNTDVFGCNTTKRNQLLTE

Query:  ISILDSIEETGHISPIQLSQRRSLKVEILNLAALRNKDG-SKNKDVLAAFP-------------DEIDWNPLQNHLAVDMEGTFTETEVWNAIKLLGSDK
        ++  D+IE+ G ++P  L  R+ +K E+ N   L  K+  S  +++L  F              + +DW+P+    A+ ++  FTE E+  AI  L  DK
Subjt:  ISILDSIEETGHISPIQLSQRRSLKVEILNLAALRNKDG-SKNKDVLAAFP-------------DEIDWNPLQNHLAVDMEGTFTETEVWNAIKLLGSDK

Query:  SPGSDGFTSEFFKKCWNILKDD----------------IMRMFQDFLRMAYVEGRQILDASLIANEIIDEWNRKHIKGLVIKLDIEKAFDKVDWNYLDHI
        +PG DGFT   F++CW+++K+D                ++     + + A+V+GRQILDA LIANEI+DE  R   +G+V K+D EKA+D V W++LDH+
Subjt:  SPGSDGFTSEFFKKCWNILKDD----------------IMRMFQDFLRMAYVEGRQILDASLIANEIIDEWNRKHIKGLVIKLDIEKAFDKVDWNYLDHI

Query:  LLAKGFGNKWRSWIRGCLSSANYSILINGA
        L  KGF  +WR W+ GCLSS +Y+IL+NG+
Subjt:  LLAKGFGNKWRSWIRGCLSSANYSILINGA

TrEMBL top hitse value%identityAlignment
A0A438C7C8 Transposon TX1 uncharacterized 149 kDa protein3.3e-7129.98Show/hide
Query:  NVRGIGDWQKRALIKSFLLKFTPSLVILQETKLEYVDRNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIITTDITKGI------------------
        NVRG+G   KR +IK FL    P +V++QETK E  DR ++ S+W+ RN  W  L A G+SGGI+ +W+   +   ++  G                   
Subjt:  NVRGIGDWQKRALIKSFLLKFTPSLVILQETKLEYVDRNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIITTDITKGI------------------

Query:  ---------------------------------------------------------------STCLDGLMRNR--MGKSLKSSIVPRYSDKFINLE---
                                                                       S  LD  +RN      +++ S V +  D+F+      
Subjt:  ---------------------------------------------------------------STCLDGLMRNR--MGKSLKSSIVPRYSDKFINLE---

Query:  -------ARKLDRPTSDHYPLMLTMGSGKWGPSPFRFENMWLKHRSFLPLIDYWWKNTPMRGGPGHGFIMKLKDLKVRLRSWNTDVFGCNTTKRNQLLTE
                  L R TSDH+P+ L      WGP+PFRFENMWL+H SF      WW+     G  GH F+ +L+ +K + + WN   FG    K+  +L +
Subjt:  -------ARKLDRPTSDHYPLMLTMGSGKWGPSPFRFENMWLKHRSFLPLIDYWWKNTPMRGGPGHGFIMKLKDLKVRLRSWNTDVFGCNTTKRNQLLTE

Query:  ISILDSIEETGHISPIQLSQRRSLKVEILNLAALRNKDGSKNKDVLAAFP-------------DEIDWNPLQNHLAVDMEGTFTETEVWNAIKLLGSDKS
        ++ LD+IE+ G ++  +L  R+ +K        + N   S  +++L  F              + +DW+P+    A+ +   FTE E+  AI  +  DK+
Subjt:  ISILDSIEETGHISPIQLSQRRSLKVEILNLAALRNKDGSKNKDVLAAFP-------------DEIDWNPLQNHLAVDMEGTFTETEVWNAIKLLGSDKS

Query:  PGSDGFTSEFFKKCWNILKDDIMRMFQDFLRM-----------------------------------AYVEGRQILDASLIANEIIDEWNRKHIKGLVIK
        PG DGFT   F+ CW+++K+D++R+F +F R                                    A+V+GRQI+DA LIANEI+DE  R   +G+V K
Subjt:  PGSDGFTSEFFKKCWNILKDDIMRMFQDFLRM-----------------------------------AYVEGRQILDASLIANEIIDEWNRKHIKGLVIK

Query:  LDIEKAFDKVDWNYLDHILLAKGFGNKWRSWIRGCLSSANYSILINGALEVKFMLLE
        +D EKA+D V W++LD +L  KGF  KWR W+ GCLSS +Y++L+NG+++ +  +LE
Subjt:  LDIEKAFDKVDWNYLDHILLAKGFGNKWRSWIRGCLSSANYSILINGALEVKFMLLE

A0A438DRE2 LINE-1 retrotransposable element ORF2 protein3.3e-7133.96Show/hide
Query:  NVRGIGDWQKRALIKSFLLKFTPSLVILQETKLEYVDRNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIITTDITKG-----ISTCLDG-------
        N RG+G  +KR ++K FL    P +V+ QETK E  DR  + S+W+ RN  WA L A G+SGGI+I+W+   +   ++  G     I   L+G       
Subjt:  NVRGIGDWQKRALIKSFLLKFTPSLVILQETKLEYVDRNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIITTDITKG-----ISTCLDG-------

Query:  ---------LMRN-------------RMGKSLKSSIVPRYSDKFINLEARKLD---------------------RPTSDHYPLMLTMGSGKWGPSPFRFE
                 L ++             R+   L+S+    +S+  +N   ++LD                     R TSDH+P++L     KWGP+PFRFE
Subjt:  ---------LMRN-------------RMGKSLKSSIVPRYSDKFINLEARKLD---------------------RPTSDHYPLMLTMGSGKWGPSPFRFE

Query:  NMWLKHRSFLPLIDYWWKNTPMRGGPGHGFIMKLKDLKVRLRSWNTDVFGCNTTKRNQLLTEISILDSIEETGHISPIQLSQRRSLKVEILN-LAALRNK
        NMWL+H SF      WW+     G  GH F+ KL+ +K +L+ WN   FG                               + +S+K EIL     L   
Subjt:  NMWLKHRSFLPLIDYWWKNTPMRGGPGHGFIMKLKDLKVRLRSWNTDVFGCNTTKRNQLLTEISILDSIEETGHISPIQLSQRRSLKVEILN-LAALRNK

Query:  DGSKNKDVLAAFPDEIDWNPLQNHLAVDMEGTFTETEVWNAIKLLGSDKSPGSDGFTSEFFKKCWNILKDDIMRMFQDFL---------------RMAYV
           ++  V     + +DW+P+    A  +E  FTE E++ AI  +  DK+PG DGFT   F+ CW ++K+D++++    L               + A+V
Subjt:  DGSKNKDVLAAFPDEIDWNPLQNHLAVDMEGTFTETEVWNAIKLLGSDKSPGSDGFTSEFFKKCWNILKDDIMRMFQDFL---------------RMAYV

Query:  EGRQILDASLIANEIIDEWNRKHIKGLVIKLDIEKAFDKVDWNYLDHILLAKGFGNKWRSWIRGCLSSANYSILING
        +GRQILDA LIANEI+DE  R   +G+V K+D EKA+D V W++LDH+L  KGFG +WR W+RGCLSS ++++L+NG
Subjt:  EGRQILDASLIANEIIDEWNRKHIKGLVIKLDIEKAFDKVDWNYLDHILLAKGFGNKWRSWIRGCLSSANYSILING

A0A438I181 Transposon TX1 uncharacterized 149 kDa protein3.9e-7231.08Show/hide
Query:  GIGDWQKRALIKSFLLKFTPSLVILQETKLEYVDRNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIITTDITKG-----ISTCLDG-------LMR
        G+G  +KR ++K+FL    P +V++QETK E  DR ++ S+WS RN  WA L ASG+SGGI+I+W+   +   ++  G     I   +DG        + 
Subjt:  GIGDWQKRALIKSFLLKFTPSLVILQETKLEYVDRNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIITTDITKG-----ISTCLDG-------LMR

Query:  NRMGKSLKSSIVPRYSD------KFINLEARKLD---------------------RPTSDHYPLMLTMGSGKWGPSPFRFENMWLKHRSFLPLIDYWWKN
             +L+       SD      +  N   ++LD                     R TSDH+P++L     KWGP+PFRFENMWL+H SF      WW  
Subjt:  NRMGKSLKSSIVPRYSD------KFINLEARKLD---------------------RPTSDHYPLMLTMGSGKWGPSPFRFENMWLKHRSFLPLIDYWWKN

Query:  TPMRGGPGHGFIMKLKDLKVRLRSWNTDVFGCNTTKRNQLLTEISILDSIEETGHISPIQLSQRRSLKVEILNL--------------------------
            G  GH F+ KL+ +K +L+ WN   FG  + K+  +L  ++  DS+E+ G +S  +L QR   K E+  L                          
Subjt:  TPMRGGPGHGFIMKLKDLKVRLRSWNTDVFGCNTTKRNQLLTEISILDSIEETGHISPIQLSQRRSLKVEILNL--------------------------

Query:  --------------AALRNKDG-------SKNKDVLAAFP-------------DEIDWNPLQNHLAVDMEGTFTETEVWNAIKLLGSDKSPGSDGFTSEF
                        L N+ G       S  +++L  F              + +DW+P+    A  +E  FTE E++ AI  +  DK+PG DGFT   
Subjt:  --------------AALRNKDG-------SKNKDVLAAFP-------------DEIDWNPLQNHLAVDMEGTFTETEVWNAIKLLGSDKSPGSDGFTSEF

Query:  FKKCWNILKDDIMRMFQDFLRM-------------------------------------------------------------AYVEGRQILDASLIANE
        F+ CW+++K+D++R+F +F R                                                              A+V+GRQILDA LIANE
Subjt:  FKKCWNILKDDIMRMFQDFLRM-------------------------------------------------------------AYVEGRQILDASLIANE

Query:  IIDEWNRKHIKGLVIKLDIEKAFDKVDWNYLDHILLAKGFGNKWRSWIRGCLSSANYSILING
        I+DE  R   +G+V K+D EKA+D V W++LDH+L  KGF  +WR W+RGCLSS +Y++L+NG
Subjt:  IIDEWNRKHIKGLVIKLDIEKAFDKVDWNYLDHILLAKGFGNKWRSWIRGCLSSANYSILING

A0A438I2T6 Transposon TX1 uncharacterized 149 kDa protein2.5e-7130.85Show/hide
Query:  GIGDWQKRALIKSFLLKFTPSLVILQETKLEYVDRNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIITTDITKG-----ISTCLDG----------
        G+G  +KR ++K+FL    P +V++QETK E  DR ++ S+WS RN  WA L ASG+SGGI+I+W+   +   ++  G     I   +DG          
Subjt:  GIGDWQKRALIKSFLLKFTPSLVILQETKLEYVDRNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIITTDITKG-----ISTCLDG----------

Query:  ---------------------------------LMRNRMGKSLKSSIVP--RYSDKFI----------------------NLEARKLD------------
                                         ++R    K   S + P  +  D+FI                      N   ++LD            
Subjt:  ---------------------------------LMRNRMGKSLKSSIVP--RYSDKFI----------------------NLEARKLD------------

Query:  ---------RPTSDHYPLMLTMGSGKWGPSPFRFENMWLKHRSFLPLIDYWWKNTPMRGGPGHGFIMKLKDLKVRLRSWNTDVFGCNTTKRNQLLTEISI
                 R TSDH+P++L     KWGP+PFRFENMWL+H SF      WW      G  GH F+ KL+ +K +L+ WN   FG  + K+  +L  ++ 
Subjt:  ---------RPTSDHYPLMLTMGSGKWGPSPFRFENMWLKHRSFLPLIDYWWKNTPMRGGPGHGFIMKLKDLKVRLRSWNTDVFGCNTTKRNQLLTEISI

Query:  LDSIEETGHISPIQLSQRRSLKVEILNL----------------------------------------AALRNKDG-------SKNKDVLAAFP------
         DS+E+ G +S   L QR   K E+  L                                          L N+ G       S  +++L  F       
Subjt:  LDSIEETGHISPIQLSQRRSLKVEILNL----------------------------------------AALRNKDG-------SKNKDVLAAFP------

Query:  -------DEIDWNPLQNHLAVDMEGTFTETEVWNAIKLLGSDKSPGSDGFTSEFFKKCWNILKDDIMRMFQDFLRM---------------------AYV
               + +DW+P+    A  +E  FTE E++ AI  +  DK+PG DGFT   F+ CW+++K+D++R+F +F R                      A+V
Subjt:  -------DEIDWNPLQNHLAVDMEGTFTETEVWNAIKLLGSDKSPGSDGFTSEFFKKCWNILKDDIMRMFQDFLRM---------------------AYV

Query:  EGRQILDASLIANEIIDEWNRKHIKGLVIKLDIEKAFDKVDWNYLDHILLAKGFGNKWRSWIRGCLSSANYSILING
        +GRQILDA LIANEI+DE  R   +G+V K+D EKA+D + W++LDH+L  KGF  +WR W+RGCLSS +Y++L+NG
Subjt:  EGRQILDASLIANEIIDEWNRKHIKGLVIKLDIEKAFDKVDWNYLDHILLAKGFGNKWRSWIRGCLSSANYSILING

A0A438JQP8 LINE-1 retrotransposable element ORF2 protein6.0e-7332.45Show/hide
Query:  NVRGIGDWQKRALIKSFLLKFTPSLVILQETKLEYVDRNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIITTDITKG-----ISTCLDG-------
        NVRG+G   KR ++K FL    P +V++QETK E  DR  + S+W+ RN  W  L ASG+SGGI+I+W+  ++   ++  G     +   LDG       
Subjt:  NVRGIGDWQKRALIKSFLLKFTPSLVILQETKLEYVDRNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIITTDITKG-----ISTCLDG-------

Query:  ------------------------------------LMRNRMGKSLKSSIVP--RYSDKFI------------------NLE----ARKLD---------
                                            ++R    K   SS+ P  R  D FI                  N++     ++LD         
Subjt:  ------------------------------------LMRNRMGKSLKSSIVP--RYSDKFI------------------NLE----ARKLD---------

Query:  ------------RPTSDHYPLMLTMGSGKWGPSPFRFENMWLKHRSFLPLIDYWWKNTPMRGGPGHGFIMKLKDLKVRLRSWNTDVFGCNTTKRNQLLTE
                    R TSDH+P+++      WGP+PFRFENMWL+H +F      WW      G  GH F+ +L+ +K +L+ WN   FG    K+  +L +
Subjt:  ------------RPTSDHYPLMLTMGSGKWGPSPFRFENMWLKHRSFLPLIDYWWKNTPMRGGPGHGFIMKLKDLKVRLRSWNTDVFGCNTTKRNQLLTE

Query:  ISILDSIEETGHISPIQLSQRRSLKVEILNLAALRNKDG-SKNKDVLAAFP-------------DEIDWNPLQNHLAVDMEGTFTETEVWNAIKLLGSDK
        ++  D+IE+ G ++P  L  R+ +K E+ N   L  K+  S  +++L  F              + +DW+P+    A+ ++  FTE E+  AI  L  DK
Subjt:  ISILDSIEETGHISPIQLSQRRSLKVEILNLAALRNKDG-SKNKDVLAAFP-------------DEIDWNPLQNHLAVDMEGTFTETEVWNAIKLLGSDK

Query:  SPGSDGFTSEFFKKCWNILKDD----------------IMRMFQDFLRMAYVEGRQILDASLIANEIIDEWNRKHIKGLVIKLDIEKAFDKVDWNYLDHI
        +PG DGFT   F++CW+++K+D                ++     + + A+V+GRQILDA LIANEI+DE  R   +G+V K+D EKA+D V W++LDH+
Subjt:  SPGSDGFTSEFFKKCWNILKDD----------------IMRMFQDFLRMAYVEGRQILDASLIANEIIDEWNRKHIKGLVIKLDIEKAFDKVDWNYLDHI

Query:  LLAKGFGNKWRSWIRGCLSSANYSILINGA
        L  KGF  +WR W+ GCLSS +Y+IL+NG+
Subjt:  LLAKGFGNKWRSWIRGCLSSANYSILINGA

SwissProt top hitse value%identityAlignment
P14381 Transposon TX1 uncharacterized 149 kDa protein7.9e-0622.84Show/hide
Query:  MEGTFTETEVWNAIKLLGSDKSPGSDGFTSEFFKKCWNILKDDIMRMFQDFLRMA---------------------------------------------
        +E   T  E+  A++L+  +KSPG DG T EFF+  W+ L  D  R+  +  +                                               
Subjt:  MEGTFTETEVWNAIKLLGSDKSPGSDGFTSEFFKKCWNILKDDIMRMFQDFLRMA---------------------------------------------

Query:  ----------------YVEGRQILDASLIANEIIDEWNRKHIKGLVIKLDIEKAFDKVDWNYLDHILLAKGFGNKWRSWIRGCLSSANYSILINGAL
                         V GR I D   +  +++    R  +    + LD EKAFD+VD  YL   L A  FG ++  +++   +SA   + IN +L
Subjt:  ----------------YVEGRQILDASLIANEIIDEWNRKHIKGLVIKLDIEKAFDKVDWNYLDHILLAKGFGNKWRSWIRGCLSSANYSILINGAL

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases3.6e-0634.92Show/hide
Query:  AYVEGRQILDASLIANEIIDEWNRKH-IKG-LVIKLDIEKAFDKVDWNYLDHILLAKGFGNKW
        +++ GR   D  +   E +    RK  +KG +++KLD+EKA+D++ W+YL+  L++ GF   W
Subjt:  AYVEGRQILDASLIANEIIDEWNRKH-IKG-LVIKLDIEKAFDKVDWNYLDHILLAKGFGNKW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGACCATGAAGATCAAGAAGGAAATTGTACCTATCAGGCATCTCCAGAAAAGGCCCTTCAAATATTAGTCCCTTGGTTGGAAAAATACCGCATGTGTATTATGCC
CATTCCCTCCATAAGCAGAAAGATCAAAACAACCCAAAAGAAAGCTAAATTAGTGAGAGAGTTGGCTGGCCTTTCCTCCTCTGTTAACTATAACAAATCTTCCACAAAGC
ATTCTAGGGGGAAGAAGACATCTCATGATTATTATCTCCGGAACGTCAGAGGTATTGGAGATTGGCAGAAAAGGGCTCTTATTAAATCCTTCCTATTGAAATTTACTCCG
TCTCTGGTCATTCTCCAAGAAACGAAGCTTGAGTATGTCGACCGCAATATTATTAAGTCTCTCTGGAGTGGCAGGAACATCAGTTGGGCTTTCTTGGAAGCTTCCGGTTC
TTCGGGTGGTATCATCATTATGTGGAATGATCCTTCGATTATCACCACAGACATTACGAAAGGGATTTCAACGTGTCTCGATGGTCTTATGAGAAATCGAATGGGAAAGT
CACTAAAAAGTTCTATTGTCCCAAGGTATTCTGATAAGTTCATAAATCTGGAGGCTAGGAAACTTGACCGACCAACATCTGACCATTACCCATTGATGCTTACCATGGGC
AGTGGCAAATGGGGGCCATCCCCTTTCCGTTTTGAGAATATGTGGTTGAAACACCGCTCTTTTTTGCCCCTGATTGATTACTGGTGGAAGAATACTCCTATGAGAGGCGG
GCCGGGCCATGGATTTATCATGAAATTGAAAGATTTGAAAGTGAGACTCCGCTCTTGGAATACGGATGTTTTTGGGTGTAACACTACCAAAAGAAACCAATTGCTGACAG
AGATATCGATCCTTGATAGCATAGAAGAGACAGGACATATTTCCCCAATCCAGCTCTCGCAGCGCAGATCTTTGAAAGTCGAAATCCTCAACCTCGCTGCCTTGAGGAAC
AAAGATGGAAGCAAAAATAAAGATGTATTGGCAGCCTTCCCAGATGAAATTGATTGGAATCCCTTACAAAACCACCTTGCTGTGGATATGGAAGGGACATTCACAGAGAC
AGAAGTTTGGAATGCTATTAAGCTTCTAGGCAGTGACAAATCACCAGGCTCGGATGGTTTTACATCAGAATTCTTTAAAAAATGTTGGAACATCCTCAAAGACGATATTA
TGAGAATGTTCCAAGATTTTTTAAGAATGGCTTACGTGGAAGGTAGACAAATCCTAGATGCATCCCTTATCGCCAATGAAATTATTGATGAGTGGAATCGCAAGCATATC
AAAGGTTTAGTCATTAAACTTGACATTGAAAAAGCCTTTGATAAGGTTGATTGGAATTACCTTGATCATATTCTCTTAGCCAAAGGATTCGGTAACAAGTGGAGATCCTG
GATTAGAGGTTGTTTATCTTCAGCAAATTATTCGATTCTCATTAATGGCGCCCTAGAGGTAAAATTCATGCTTCTCGAGGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGACCATGAAGATCAAGAAGGAAATTGTACCTATCAGGCATCTCCAGAAAAGGCCCTTCAAATATTAGTCCCTTGGTTGGAAAAATACCGCATGTGTATTATGCC
CATTCCCTCCATAAGCAGAAAGATCAAAACAACCCAAAAGAAAGCTAAATTAGTGAGAGAGTTGGCTGGCCTTTCCTCCTCTGTTAACTATAACAAATCTTCCACAAAGC
ATTCTAGGGGGAAGAAGACATCTCATGATTATTATCTCCGGAACGTCAGAGGTATTGGAGATTGGCAGAAAAGGGCTCTTATTAAATCCTTCCTATTGAAATTTACTCCG
TCTCTGGTCATTCTCCAAGAAACGAAGCTTGAGTATGTCGACCGCAATATTATTAAGTCTCTCTGGAGTGGCAGGAACATCAGTTGGGCTTTCTTGGAAGCTTCCGGTTC
TTCGGGTGGTATCATCATTATGTGGAATGATCCTTCGATTATCACCACAGACATTACGAAAGGGATTTCAACGTGTCTCGATGGTCTTATGAGAAATCGAATGGGAAAGT
CACTAAAAAGTTCTATTGTCCCAAGGTATTCTGATAAGTTCATAAATCTGGAGGCTAGGAAACTTGACCGACCAACATCTGACCATTACCCATTGATGCTTACCATGGGC
AGTGGCAAATGGGGGCCATCCCCTTTCCGTTTTGAGAATATGTGGTTGAAACACCGCTCTTTTTTGCCCCTGATTGATTACTGGTGGAAGAATACTCCTATGAGAGGCGG
GCCGGGCCATGGATTTATCATGAAATTGAAAGATTTGAAAGTGAGACTCCGCTCTTGGAATACGGATGTTTTTGGGTGTAACACTACCAAAAGAAACCAATTGCTGACAG
AGATATCGATCCTTGATAGCATAGAAGAGACAGGACATATTTCCCCAATCCAGCTCTCGCAGCGCAGATCTTTGAAAGTCGAAATCCTCAACCTCGCTGCCTTGAGGAAC
AAAGATGGAAGCAAAAATAAAGATGTATTGGCAGCCTTCCCAGATGAAATTGATTGGAATCCCTTACAAAACCACCTTGCTGTGGATATGGAAGGGACATTCACAGAGAC
AGAAGTTTGGAATGCTATTAAGCTTCTAGGCAGTGACAAATCACCAGGCTCGGATGGTTTTACATCAGAATTCTTTAAAAAATGTTGGAACATCCTCAAAGACGATATTA
TGAGAATGTTCCAAGATTTTTTAAGAATGGCTTACGTGGAAGGTAGACAAATCCTAGATGCATCCCTTATCGCCAATGAAATTATTGATGAGTGGAATCGCAAGCATATC
AAAGGTTTAGTCATTAAACTTGACATTGAAAAAGCCTTTGATAAGGTTGATTGGAATTACCTTGATCATATTCTCTTAGCCAAAGGATTCGGTAACAAGTGGAGATCCTG
GATTAGAGGTTGTTTATCTTCAGCAAATTATTCGATTCTCATTAATGGCGCCCTAGAGGTAAAATTCATGCTTCTCGAGGATTAA
Protein sequenceShow/hide protein sequence
MEDHEDQEGNCTYQASPEKALQILVPWLEKYRMCIMPIPSISRKIKTTQKKAKLVRELAGLSSSVNYNKSSTKHSRGKKTSHDYYLRNVRGIGDWQKRALIKSFLLKFTP
SLVILQETKLEYVDRNIIKSLWSGRNISWAFLEASGSSGGIIIMWNDPSIITTDITKGISTCLDGLMRNRMGKSLKSSIVPRYSDKFINLEARKLDRPTSDHYPLMLTMG
SGKWGPSPFRFENMWLKHRSFLPLIDYWWKNTPMRGGPGHGFIMKLKDLKVRLRSWNTDVFGCNTTKRNQLLTEISILDSIEETGHISPIQLSQRRSLKVEILNLAALRN
KDGSKNKDVLAAFPDEIDWNPLQNHLAVDMEGTFTETEVWNAIKLLGSDKSPGSDGFTSEFFKKCWNILKDDIMRMFQDFLRMAYVEGRQILDASLIANEIIDEWNRKHI
KGLVIKLDIEKAFDKVDWNYLDHILLAKGFGNKWRSWIRGCLSSANYSILINGALEVKFMLLED