; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011096 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011096
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr1:14356716..14358471
RNA-Seq ExpressionLag0011096
SyntenyLag0011096
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN65957.1 hypothetical protein VITISV_035610 [Vitis vinifera]9.6e-9035.92Show/hide
Query:  MWEKDVMEVRDSVVGAFSISVLCSFQGQAKGWITSVYGPCNLRERNYFWQEL---------------------------------------------NDL
        +W+   +   + V+G+FS+SV  +  G  + W+++VYGP +   R  FW EL                                             N+L
Subjt:  MWEKDVMEVRDSVVGAFSISVLCSFQGQAKGWITSVYGPCNLRERNYFWQEL---------------------------------------------NDL

Query:  TLPPMENGRFSWSRMGERPAASRIDRFLLSKQWVETFKEYKLD--------------------------RIENMWLEDPNFKGNVEKWWKEQTLKGWAGF
          PP+ +  F+WS M E P   R+DRFL S +W + F +   +                          R ENMWL  P+FK    +WW+E    GW G 
Subjt:  TLPPMENGRFSWSRMGERPAASRIDRFLLSKQWVETFKEYKLD--------------------------RIENMWLEDPNFKGNVEKWWKEQTLKGWAGF

Query:  KFMEKLKVLKGCIRRWNKEVFGNIIDKKQACLNRIEEVDVFEEQGNLSFQLKEERLKIKSE---LLDSSLLALEPELEEEIVGFYSKLYTDDKKVRFTLD
        KFM KL+ LK  ++ WNK  FG++I++K+  L  I   D  E++G LS +L  +R  +++E   +LD+S       ++EEI+ ++ KLY       + ++
Subjt:  KFMEKLKVLKGCIRRWNKEVFGNIIDKKQACLNRIEEVDVFEEQGNLSFQLKEERLKIKSE---LLDSSLLALEPELEEEIVGFYSKLYTDDKKVRFTLD

Query:  GLSWSPIEESNKAWLERRFTEDEIHRAIKNLGNNKSPRPDGI---------------------------IINKRTNGTYICLIPKKKNASRVKDFRPISL
        GL WSPI   + + LE  F E+EI +AI  +  +K+P PDG                            IIN+ TN ++I L+PKK  A ++ ++RPISL
Subjt:  GLSWSPIEESNKAWLERRFTEDEIHRAIKNLGNNKSPRPDGI---------------------------IINKRTNGTYICLIPKKKNASRVKDFRPISL

Query:  VTPLYKIIAKVLAERLKEVLLATISNSQAAFVHGRQILDAILVASESVEDRKISKKKGFILKLDLEKAYEKVSWVFLDEVLKLKGFRERWGGWIKGCLQS
        +T LYKIIAKVL  RL+ +L  TI ++Q AFV GRQILDA+L+A+E V+++K S  +G + K+D EKAY+ VSW FLD V++ KGF  +W  WI+GCL S
Subjt:  VTPLYKIIAKVLAERLKEVLLATISNSQAAFVHGRQILDAILVASESVEDRKISKKKGFILKLDLEKAYEKVSWVFLDEVLKLKGFRERWGGWIKGCLQS

Query:  TNFSIMMNEKPRGKIIASRGIRQGDSLSP
         +F+I++N   +G + ASRG+RQGD LSP
Subjt:  TNFSIMMNEKPRGKIIASRGIRQGDSLSP

CAN79190.1 hypothetical protein VITISV_000232 [Vitis vinifera]2.4e-8834.67Show/hide
Query:  MWEKDVMEVRDSVVGAFSISVLCSFQGQAKGWITSVYGPCNLRERNYFWQEL---------------------------------------------NDL
        MW+   +   + V+G+FS+SV  +  G  + W+++VYGP +   R  FW+EL                                             N+L
Subjt:  MWEKDVMEVRDSVVGAFSISVLCSFQGQAKGWITSVYGPCNLRERNYFWQEL---------------------------------------------NDL

Query:  TLPPMENGRFSWSRMGERPAASRIDRFLLSKQWVETFKEYKLD--------------------------RIENMWLEDPNFKGNVEKWWKEQTLKGWAGF
          PP+ +  F+WS M E P   R+DRFL S +W + F +   D                          R ENMWL  P+FK +   WW+E    GW G 
Subjt:  TLPPMENGRFSWSRMGERPAASRIDRFLLSKQWVETFKEYKLD--------------------------RIENMWLEDPNFKGNVEKWWKEQTLKGWAGF

Query:  KFMEKLKVLKGCIRRWNKEVFGNIIDKKQACLNRIEEVDVFEEQGNLSFQLKEERLKIKSELLD------------------------------------
        KFM KL+ LK  ++ WNK  FG++I++K+  L  I   D  E++G LS +L  +R   K EL +                                    
Subjt:  KFMEKLKVLKGCIRRWNKEVFGNIIDKKQACLNRIEEVDVFEEQGNLSFQLKEERLKIKSELLD------------------------------------

Query:  --SSLLALEPE----------LEEEIVGFYSKLYTDDKKVRFTLDGLSWSPIEESNKAWLERRFTEDEIHRAIKNLGNNKSPRPDGI-------------
            +  LE E          ++EEI+ ++ KLY       + ++GL WSPI   + + LE  FTE+EI +AI  +  +K+P PDG              
Subjt:  --SSLLALEPE----------LEEEIVGFYSKLYTDDKKVRFTLDGLSWSPIEESNKAWLERRFTEDEIHRAIKNLGNNKSPRPDGI-------------

Query:  --------------IINKRTNGTYICLIPKKKNASRVKDFRPISLVTPLYKIIAKVLAERLKEVLLATISNSQAAFVHGRQILDAILVASESVEDRKISK
                      IIN+ TN ++I L+PKK  A ++ D+RPISL+T LYKIIAKVLA RL+ VL  TI ++Q AFV GRQILDA+L+A+E V+++K S 
Subjt:  --------------IINKRTNGTYICLIPKKKNASRVKDFRPISLVTPLYKIIAKVLAERLKEVLLATISNSQAAFVHGRQILDAILVASESVEDRKISK

Query:  KKGFILKLDLEKAYEKVSWVFLDEVLKLKGFRERWGGWIKGCLQSTNFSIMMNEKPRGKIIASRGIRQGDSLSP
        ++G + K+D EKAY+ VSW FLD V++ KGF  RW  WI+GCL S +F+I++N   +G + ASRG+RQGD LSP
Subjt:  KKGFILKLDLEKAYEKVSWVFLDEVLKLKGFRERWGGWIKGCLQSTNFSIMMNEKPRGKIIASRGIRQGDSLSP

RVW53010.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]2.4e-8836.12Show/hide
Query:  MWEKDVMEVRDSVVGAFSISVLCSFQGQAKGWITSVYGPCNLRERNYFWQELND---------------------------------------------L
        +W+   M   + V+G+FS+S+  +  G    W+++VYGP N   R  FW EL+D                                             L
Subjt:  MWEKDVMEVRDSVVGAFSISVLCSFQGQAKGWITSVYGPCNLRERNYFWQELND---------------------------------------------L

Query:  TLPPMENGRFSWSRMGERPAASRIDRFLLSKQWVETFKE----------------------YKLD----RIENMWLEDPNFKGNVEKWWKEQTLKGWAGF
           P+ +  ++WS M E P   R+DRFL S +W + F +                      +K      R ENMWL+  +FK N  +WW E    GW G 
Subjt:  TLPPMENGRFSWSRMGERPAASRIDRFLLSKQWVETFKE----------------------YKLD----RIENMWLEDPNFKGNVEKWWKEQTLKGWAGF

Query:  KFMEKLKVLKGCIRRWNKEVFGNIIDKKQACLNRIEEVDVFEEQGNLSFQLKEERLKIKSELLDSSLLALEPELEEEIVGFYSKLYTDDKKVRFTLDGLS
        KFM KL+++K  ++ WNK  FG +  KK+  L  +   D  E++G LS +L  +R   K E        LE E +EEI+ ++ KLY    +  + ++GL 
Subjt:  KFMEKLKVLKGCIRRWNKEVFGNIIDKKQACLNRIEEVDVFEEQGNLSFQLKEERLKIKSELLDSSLLALEPELEEEIVGFYSKLYTDDKKVRFTLDGLS

Query:  WSPIEESNKAWLERRFTEDEIHRAIKNLGNNKSPRPDGI---------------------------IINKRTNGTYICLIPKKKNASRVKDFRPISLVTP
        WSPI+  + + LE  FTE+EI++AI  +  +K+P PDG                            IIN+ TN ++I L+PKK  + R+ DFRPISL+T 
Subjt:  WSPIEESNKAWLERRFTEDEIHRAIKNLGNNKSPRPDGI---------------------------IINKRTNGTYICLIPKKKNASRVKDFRPISLVTP

Query:  LYKIIAKVLAERLKEVLLATISNSQAAFVHGRQILDAILVASESVEDRKISKKKGFILKLDLEKAYEKVSWVFLDEVLKLKGFRERWGGWIKGCLQSTNF
        LYKIIAKVLA RL+ VL  TI ++Q AFV GRQILDA+L+A+E V++++ + ++G + K+D EKAY+ +SW FLD VL++KGF  RW  W++GCL S ++
Subjt:  LYKIIAKVLAERLKEVLLATISNSQAAFVHGRQILDAILVASESVEDRKISKKKGFILKLDLEKAYEKVSWVFLDEVLKLKGFRERWGGWIKGCLQSTNF

Query:  SIMMNEKPRGKIIASRGIRQGDSLSP
        ++++N   +G + ASRG+RQGD LSP
Subjt:  SIMMNEKPRGKIIASRGIRQGDSLSP

RVW64166.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]3.1e-8836.6Show/hide
Query:  MWEKDVMEVRDSVVGAFSISVLCSFQGQAKGWITSVYGPCNLRERNYFWQELND------------LTLPPMENGRFSWSRMGERPAASRIDRFLLSKQW
        +W+  ++   + V+G+FS+SV  S  G    WI++VYGP +   R  FW EL D            L  PP+ N  F+WS M E P   R+DRFL S +W
Subjt:  MWEKDVMEVRDSVVGAFSISVLCSFQGQAKGWITSVYGPCNLRERNYFWQELND------------LTLPPMENGRFSWSRMGERPAASRIDRFLLSKQW

Query:  VETFKEYKLD--------------------------RIENMWLEDPNFKGNVEKWWKEQTLKGWAGFKFMEKLKVLKGCIRRWNKEVFGNIIDKKQACLN
           F +   +                          R ENMWL+  NFK N   WW      GW G KFM +L+ +K  ++ WNK  FG + +KK++ LN
Subjt:  VETFKEYKLD--------------------------RIENMWLEDPNFKGNVEKWWKEQTLKGWAGFKFMEKLKVLKGCIRRWNKEVFGNIIDKKQACLN

Query:  RIEEVDVFEEQGNLSFQLKEERLKIKSELLDSSL------------------------------------------------LALEPELEEEIVGFYSKL
         +   D  E++G L+  L  +R+  K EL +  L                                                L     + EEI+ ++ KL
Subjt:  RIEEVDVFEEQGNLSFQLKEERLKIKSELLDSSL------------------------------------------------LALEPELEEEIVGFYSKL

Query:  YTDDKKVRFTLDGLSWSPIEESNKAWLERRFTEDEIHRAIKNLGNNKSPRPDGI---------------------------IINKRTNGTYICLIPKKKN
        YT+     + ++GL WSPI E +   LE  FTE+EI +AI  L  +K+P PDG                            IIN+ TN ++I LIPKK  
Subjt:  YTDDKKVRFTLDGLSWSPIEESNKAWLERRFTEDEIHRAIKNLGNNKSPRPDGI---------------------------IINKRTNGTYICLIPKKKN

Query:  ASRVKDFRPISLVTPLYKIIAKVLAERLKEVLLATISNSQAAFVHGRQILDAILVASESVEDRKISKKKGFILKLDLEKAYEKVSWVFLDEVLKLKGFRE
        + R+ DFRPISL+T LYKIIAKVL+ RL+ VL  TI  +Q AFV GRQILDA+L+A+E V +R+   ++G + K+D EKAY+ V W FLD VL+ KGF  
Subjt:  ASRVKDFRPISLVTPLYKIIAKVLAERLKEVLLATISNSQAAFVHGRQILDAILVASESVEDRKISKKKGFILKLDLEKAYEKVSWVFLDEVLKLKGFRE

Query:  RWGGWIKGCLQSTNFSIMMNEKPRGKIIASRGIRQGDSLSP
        RW  W+ GCL S +F+I++N   +G + ASRG+RQGD LSP
Subjt:  RWGGWIKGCLQSTNFSIMMNEKPRGKIIASRGIRQGDSLSP

RVX07754.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]5.1e-9134.91Show/hide
Query:  MWEKDVMEVRDSVVGAFSISVLCSFQGQAKGWITSVYGPCNLRERNYFWQELND---------------------------------------------L
        +W+       + V+G+FS++V  +   +   W+TSVYGP     R  FW EL D                                             L
Subjt:  MWEKDVMEVRDSVVGAFSISVLCSFQGQAKGWITSVYGPCNLRERNYFWQELND---------------------------------------------L

Query:  TLPPMENGRFSWSRMGERPAASRIDRFLLSKQWVETFKEYKLD--------------------------RIENMWLEDPNFKGNVEKWWKEQTLKGWAGF
          PP+ N  F+WS M   P   R+DRFL S +W   F +   +                          R ENMWL  P FK     WW+E T++GW G 
Subjt:  TLPPMENGRFSWSRMGERPAASRIDRFLLSKQWVETFKEYKLD--------------------------RIENMWLEDPNFKGNVEKWWKEQTLKGWAGF

Query:  KFMEKLKVLKGCIRRWNKEVFGNIIDKKQACLNRIEEVDVFEEQGNLSFQLKEERLKIKSELLD-----------------SSLLALEPE-------LEE
        KFM KLK +K  ++ WN  VFG++ ++K+  L  +  +D  E++GNL+ +L  ER+  + EL D                  SL++   E       + E
Subjt:  KFMEKLKVLKGCIRRWNKEVFGNIIDKKQACLNRIEEVDVFEEQGNLSFQLKEERLKIKSELLD-----------------SSLLALEPE-------LEE

Query:  EIVGFYSKLYTDDKKVRFTLDGLSWSPIEESNKAWLERRFTEDEIHRAIKNLGNNKSPRPDGI---------------------------IINKRTNGTY
        EIV F+  LY+  +   + ++G+ W+PI E +  WL+R F+E+E+  A+  L   K+P PDG                            +IN+ TN T+
Subjt:  EIVGFYSKLYTDDKKVRFTLDGLSWSPIEESNKAWLERRFTEDEIHRAIKNLGNNKSPRPDGI---------------------------IINKRTNGTY

Query:  ICLIPKKKNASRVKDFRPISLVTPLYKIIAKVLAERLKEVLLATISNSQAAFVHGRQILDAILVASESVEDRKISKKKGFILKLDLEKAYEKVSWVFLDE
        I ++PKK    ++ D+RPISLVT LYKIIAKVL+ RL++VL  TI  SQ AFV GRQILDA+L+A+E V++++ S ++G + K+D EKAY+ V W FLD 
Subjt:  ICLIPKKKNASRVKDFRPISLVTPLYKIIAKVLAERLKEVLLATISNSQAAFVHGRQILDAILVASESVEDRKISKKKGFILKLDLEKAYEKVSWVFLDE

Query:  VLKLKGFRERWGGWIKGCLQSTNFSIMMNEKPRGKIIASRGIRQGDSLSP
        VL+ KGF ++W  W++GCL S++F+I++N   +G + ASRG+RQGD LSP
Subjt:  VLKLKGFRERWGGWIKGCLQSTNFSIMMNEKPRGKIIASRGIRQGDSLSP

TrEMBL top hitse value%identityAlignment
A0A438EZ36 LINE-1 retrotransposable element ORF2 protein1.1e-8836.12Show/hide
Query:  MWEKDVMEVRDSVVGAFSISVLCSFQGQAKGWITSVYGPCNLRERNYFWQELND---------------------------------------------L
        +W+   M   + V+G+FS+S+  +  G    W+++VYGP N   R  FW EL+D                                             L
Subjt:  MWEKDVMEVRDSVVGAFSISVLCSFQGQAKGWITSVYGPCNLRERNYFWQELND---------------------------------------------L

Query:  TLPPMENGRFSWSRMGERPAASRIDRFLLSKQWVETFKE----------------------YKLD----RIENMWLEDPNFKGNVEKWWKEQTLKGWAGF
           P+ +  ++WS M E P   R+DRFL S +W + F +                      +K      R ENMWL+  +FK N  +WW E    GW G 
Subjt:  TLPPMENGRFSWSRMGERPAASRIDRFLLSKQWVETFKE----------------------YKLD----RIENMWLEDPNFKGNVEKWWKEQTLKGWAGF

Query:  KFMEKLKVLKGCIRRWNKEVFGNIIDKKQACLNRIEEVDVFEEQGNLSFQLKEERLKIKSELLDSSLLALEPELEEEIVGFYSKLYTDDKKVRFTLDGLS
        KFM KL+++K  ++ WNK  FG +  KK+  L  +   D  E++G LS +L  +R   K E        LE E +EEI+ ++ KLY    +  + ++GL 
Subjt:  KFMEKLKVLKGCIRRWNKEVFGNIIDKKQACLNRIEEVDVFEEQGNLSFQLKEERLKIKSELLDSSLLALEPELEEEIVGFYSKLYTDDKKVRFTLDGLS

Query:  WSPIEESNKAWLERRFTEDEIHRAIKNLGNNKSPRPDGI---------------------------IINKRTNGTYICLIPKKKNASRVKDFRPISLVTP
        WSPI+  + + LE  FTE+EI++AI  +  +K+P PDG                            IIN+ TN ++I L+PKK  + R+ DFRPISL+T 
Subjt:  WSPIEESNKAWLERRFTEDEIHRAIKNLGNNKSPRPDGI---------------------------IINKRTNGTYICLIPKKKNASRVKDFRPISLVTP

Query:  LYKIIAKVLAERLKEVLLATISNSQAAFVHGRQILDAILVASESVEDRKISKKKGFILKLDLEKAYEKVSWVFLDEVLKLKGFRERWGGWIKGCLQSTNF
        LYKIIAKVLA RL+ VL  TI ++Q AFV GRQILDA+L+A+E V++++ + ++G + K+D EKAY+ +SW FLD VL++KGF  RW  W++GCL S ++
Subjt:  LYKIIAKVLAERLKEVLLATISNSQAAFVHGRQILDAILVASESVEDRKISKKKGFILKLDLEKAYEKVSWVFLDEVLKLKGFRERWGGWIKGCLQSTNF

Query:  SIMMNEKPRGKIIASRGIRQGDSLSP
        ++++N   +G + ASRG+RQGD LSP
Subjt:  SIMMNEKPRGKIIASRGIRQGDSLSP

A0A803QEA6 Uncharacterized protein1.8e-8935.37Show/hide
Query:  MWEKDVMEVRDSVVGAFSISVLCSFQGQAKGWITSVYGPCNLRERNYFWQELNDLTL-------------------------------------------
        +W+   + V DS+VG FSISVL + +G+   W + VYGPC+ + R  FW EL  L+                                            
Subjt:  MWEKDVMEVRDSVVGAFSISVLCSFQGQAKGWITSVYGPCNLRERNYFWQELNDLTL-------------------------------------------

Query:  --PPMENGRFSWSRMGERPAASRIDRFLLSKQWVETFKEYKLD--------------------------RIENMWLEDPNFKGNVEKWWKEQTLKGWAGF
          P +ENG F+WS     P  SR+DRFL +  W   F   + +                          R +N WL+  +F    E+WWKE+   GW G 
Subjt:  --PPMENGRFSWSRMGERPAASRIDRFLLSKQWVETFKEYKLD--------------------------RIENMWLEDPNFKGNVEKWWKEQTLKGWAGF

Query:  KFMEKLKVLKGCIRRWNKEVFGNIIDKKQACLNRIEEVDVFEEQGNLSFQLKEERLKIKSE---------------------------------LLDS--
        KFM+KLK+L+G ++ W+K  FG    KK A   R+  +D  E     +  L +ER K+K E                                 LL++  
Subjt:  KFMEKLKVLKGCIRRWNKEVFGNIIDKKQACLNRIEEVDVFEEQGNLSFQLKEERLKIKSE---------------------------------LLDS--

Query:  -------------SLLALEPELEEEIVGFYSKLYTDDKKVRFTLDGLSWSPIEESNKAWLERRFTEDEIHRAIKNLGNNKSPRPDGII------------
                      ++  E E+ EE++ F+SKLYT + ++   ++G+ W  I ES+   LE  F E+E+   + +   NK+P PDG              
Subjt:  -------------SLLALEPELEEEIVGFYSKLYTDDKKVRFTLDGLSWSPIEESNKAWLERRFTEDEIHRAIKNLGNNKSPRPDGII------------

Query:  ---------------INKRTNGTYICLIPKKKNASRVKDFRPISLVTPLYKIIAKVLAERLKEVLLATISNSQAAFVHGRQILDAILVASESVEDRKISK
                       I    N T+ICLIPK+ N+ +VKDFRPISL+T +YKIIAK LA RL+ VL  TIS +Q+AFV GRQILD++L+A+E+VED +   
Subjt:  ---------------INKRTNGTYICLIPKKKNASRVKDFRPISLVTPLYKIIAKVLAERLKEVLLATISNSQAAFVHGRQILDAILVASESVEDRKISK

Query:  KKGFILKLDLEKAYEKVSWVFLDEVLKLKGFRERWGGWIKGCLQSTNFSIMMNEKPRGKIIASRGIRQGDSLSP
        +KGF+LK+D EKAY++V W FLD VL+ KGF ERW  WI+GC+ ST+FSI +N + RGK   SRG+RQGD LSP
Subjt:  KKGFILKLDLEKAYEKVSWVFLDEVLKLKGFRERWGGWIKGCLQSTNFSIMMNEKPRGKIIASRGIRQGDSLSP

A5BSZ3 Reverse transcriptase domain-containing protein4.7e-9035.92Show/hide
Query:  MWEKDVMEVRDSVVGAFSISVLCSFQGQAKGWITSVYGPCNLRERNYFWQEL---------------------------------------------NDL
        +W+   +   + V+G+FS+SV  +  G  + W+++VYGP +   R  FW EL                                             N+L
Subjt:  MWEKDVMEVRDSVVGAFSISVLCSFQGQAKGWITSVYGPCNLRERNYFWQEL---------------------------------------------NDL

Query:  TLPPMENGRFSWSRMGERPAASRIDRFLLSKQWVETFKEYKLD--------------------------RIENMWLEDPNFKGNVEKWWKEQTLKGWAGF
          PP+ +  F+WS M E P   R+DRFL S +W + F +   +                          R ENMWL  P+FK    +WW+E    GW G 
Subjt:  TLPPMENGRFSWSRMGERPAASRIDRFLLSKQWVETFKEYKLD--------------------------RIENMWLEDPNFKGNVEKWWKEQTLKGWAGF

Query:  KFMEKLKVLKGCIRRWNKEVFGNIIDKKQACLNRIEEVDVFEEQGNLSFQLKEERLKIKSE---LLDSSLLALEPELEEEIVGFYSKLYTDDKKVRFTLD
        KFM KL+ LK  ++ WNK  FG++I++K+  L  I   D  E++G LS +L  +R  +++E   +LD+S       ++EEI+ ++ KLY       + ++
Subjt:  KFMEKLKVLKGCIRRWNKEVFGNIIDKKQACLNRIEEVDVFEEQGNLSFQLKEERLKIKSE---LLDSSLLALEPELEEEIVGFYSKLYTDDKKVRFTLD

Query:  GLSWSPIEESNKAWLERRFTEDEIHRAIKNLGNNKSPRPDGI---------------------------IINKRTNGTYICLIPKKKNASRVKDFRPISL
        GL WSPI   + + LE  F E+EI +AI  +  +K+P PDG                            IIN+ TN ++I L+PKK  A ++ ++RPISL
Subjt:  GLSWSPIEESNKAWLERRFTEDEIHRAIKNLGNNKSPRPDGI---------------------------IINKRTNGTYICLIPKKKNASRVKDFRPISL

Query:  VTPLYKIIAKVLAERLKEVLLATISNSQAAFVHGRQILDAILVASESVEDRKISKKKGFILKLDLEKAYEKVSWVFLDEVLKLKGFRERWGGWIKGCLQS
        +T LYKIIAKVL  RL+ +L  TI ++Q AFV GRQILDA+L+A+E V+++K S  +G + K+D EKAY+ VSW FLD V++ KGF  +W  WI+GCL S
Subjt:  VTPLYKIIAKVLAERLKEVLLATISNSQAAFVHGRQILDAILVASESVEDRKISKKKGFILKLDLEKAYEKVSWVFLDEVLKLKGFRERWGGWIKGCLQS

Query:  TNFSIMMNEKPRGKIIASRGIRQGDSLSP
         +F+I++N   +G + ASRG+RQGD LSP
Subjt:  TNFSIMMNEKPRGKIIASRGIRQGDSLSP

M5VS59 Reverse transcriptase domain-containing protein (Fragment)5.0e-9236.41Show/hide
Query:  MWEKDVMEVRDSVVGAFSISVLCSFQGQAKGWITSVYGPCNLRERNYFWQELND---------------------------------------------L
        +W    + V DS+VG FS+S+          W++ +YGPC  RERN FW+EL D                                             L
Subjt:  MWEKDVMEVRDSVVGAFSISVLCSFQGQAKGWITSVYGPCNLRERNYFWQELND---------------------------------------------L

Query:  TLPPMENGRFSWSRMGERPAASRIDRFLLSKQWVETFKEYK---------------LD-----------RIENMWLEDPNFKGNVEKWWKEQTLKGWAGF
          P + N  F+WS + E     R+DRFL+S  W + F  Y+               LD           R ENMWL  P+F   ++ WW E  + GW G+
Subjt:  TLPPMENGRFSWSRMGERPAASRIDRFLLSKQWVETFKEYK---------------LD-----------RIENMWLEDPNFKGNVEKWWKEQTLKGWAGF

Query:  KFMEKLKVLKGCIRRWNKEVFGNIIDKKQACLNRIEEVDVFEEQGNLSFQLKEER----LKI--------------------------------------
        KFM +LK+LK  ++ W+KE FG++    +    R+  +D  E    L   L+ ER    LKI                                      
Subjt:  KFMEKLKVLKGCIRRWNKEVFGNIIDKKQACLNRIEEVDVFEEQGNLSFQLKEER----LKI--------------------------------------

Query:  ------KSELLDSSLLALEPELEEEIVGFYSKLYTDDKKVRFTLDGLSWSPIEESNKAWLERRFTEDEIHRAIKNLGNNKSPRPDGI-------------
              K E+ D  ++ ++  +E E++ F+  LY+ +K V + ++GL+W PI +    WLER F  +E+ +A+   G +KSP PDG              
Subjt:  ------KSELLDSSLLALEPELEEEIVGFYSKLYTDDKKVRFTLDGLSWSPIEESNKAWLERRFTEDEIHRAIKNLGNNKSPRPDGI-------------

Query:  --------------IINKRTNGTYICLIPKKKNASRVKDFRPISLVTPLYKIIAKVLAERLKEVLLATISNSQAAFVHGRQILDAILVASESVEDRKISK
                      I+N  TN T+ICLIPKK N+ +V D RPISLVT LYK+I+KVLA RL+EVL  TIS SQ AFV  RQILDA+LVA+E VE+ +  K
Subjt:  --------------IINKRTNGTYICLIPKKKNASRVKDFRPISLVTPLYKIIAKVLAERLKEVLLATISNSQAAFVHGRQILDAILVASESVEDRKISK

Query:  KKGFILKLDLEKAYEKVSWVFLDEVLKLKGFRERWGGWIKGCLQSTNFSIMMNEKPRGKIIASRGIRQGDSLSP
        +KG + K+D EKAY+ V W F+D+VL  KGF  +W GWI GCL+S NFSIM+N KPRGK  ASRG+RQGD LSP
Subjt:  KKGFILKLDLEKAYEKVSWVFLDEVLKLKGFRERWGGWIKGCLQSTNFSIMMNEKPRGKIIASRGIRQGDSLSP

M5WPQ5 Reverse transcriptase domain-containing protein2.6e-9336.59Show/hide
Query:  MWEKDVMEVRDSVVGAFSISVLCSFQGQAKGWITSVYGPCNLRERNYFWQELND---------------------------------------------L
        +W    + V DS+VG FS+S+          W++ +YGPC  RERN FW+EL D                                             L
Subjt:  MWEKDVMEVRDSVVGAFSISVLCSFQGQAKGWITSVYGPCNLRERNYFWQELND---------------------------------------------L

Query:  TLPPMENGRFSWSRMGERPAASRIDRFLLSKQWVETFKEYK---------------LD-----------RIENMWLEDPNFKGNVEKWWKEQTLKGWAGF
          P + N  F+WS + E     R+DRFL+S  W E F  Y+               LD           R ENMWL  P+FK  ++ WW E  + GW G+
Subjt:  TLPPMENGRFSWSRMGERPAASRIDRFLLSKQWVETFKEYK---------------LD-----------RIENMWLEDPNFKGNVEKWWKEQTLKGWAGF

Query:  KFMEKLKVLKGCIRRWNKEVFGNIIDKKQACLNRIEEVDVFEEQGNLSFQLKEER----LKI--------------------------------------
        KFM +LK+LK  ++ W+KE FG++    +    R+  +D  E    L   L+ ER    LKI                                      
Subjt:  KFMEKLKVLKGCIRRWNKEVFGNIIDKKQACLNRIEEVDVFEEQGNLSFQLKEER----LKI--------------------------------------

Query:  ------KSELLDSSLLALEPELEEEIVGFYSKLYTDDKKVRFTLDGLSWSPIEESNKAWLERRFTEDEIHRAIKNLGNNKSPRPDGI-------------
              K E+ D  ++ ++  +E E++ F+  LY+ +K V + ++GL+W PI +    WLER F  +E+ +A+ + G +KSP PDG              
Subjt:  ------KSELLDSSLLALEPELEEEIVGFYSKLYTDDKKVRFTLDGLSWSPIEESNKAWLERRFTEDEIHRAIKNLGNNKSPRPDGI-------------

Query:  --------------IINKRTNGTYICLIPKKKNASRVKDFRPISLVTPLYKIIAKVLAERLKEVLLATISNSQAAFVHGRQILDAILVASESVEDRKISK
                      I+N  TN T+ICLIPKK N+ +V D+RPISLVT LYK+I+KVLA RL+EVL  TIS SQ AFV  RQILDA+LVA+E VE+ +  K
Subjt:  --------------IINKRTNGTYICLIPKKKNASRVKDFRPISLVTPLYKIIAKVLAERLKEVLLATISNSQAAFVHGRQILDAILVASESVEDRKISK

Query:  KKGFILKLDLEKAYEKVSWVFLDEVLKLKGFRERWGGWIKGCLQSTNFSIMMNEKPRGKIIASRGIRQGDSLSP
        +KG + K+D EKAY+ V W F+D+V+  KGF  +W GWI GCL+S NFSIM+N KPRGK  ASRG+RQGD LSP
Subjt:  KKGFILKLDLEKAYEKVSWVFLDEVLKLKGFRERWGGWIKGCLQSTNFSIMMNEKPRGKIIASRGIRQGDSLSP

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein5.4e-1126.15Show/hide
Query:  KEERLKIKSELLDSSLLALEP-ELEEEIVGFYSKLYTDD----KKVRFTLDGLSWSPIEESNKAWLERRFTEDEIHRAIKNLGNNKSPRPDGII------
        K E+ +I +   D   +  +P E++  I  +Y  LY +     +++   LD  +   + +     L R  T  EI   I +L   KSP PDG        
Subjt:  KEERLKIKSELLDSSLLALEP-ELEEEIVGFYSKLYTDD----KKVRFTLDGLSWSPIEESNKAWLERRFTEDEIHRAIKNLGNNKSPRPDGII------

Query:  ---------------INKR---TNGTY---ICLIPKK-KNASRVKDFRPISLVTPLYKIIAKVLAERLKEVLLATISNSQAAFVHGRQILDAILVASESV
                       I K     N  Y   I LIPK  ++ ++ ++FRPISL+    KI+ K+LA R+++ +   I + Q  F+ G Q    I  +   +
Subjt:  ---------------INKR---TNGTY---ICLIPKK-KNASRVKDFRPISLVTPLYKIIAKVLAERLKEVLLATISNSQAAFVHGRQILDAILVASESV

Query:  EDRKISKKKG-FILKLDLEKAYEKVSWVFLDEVLKLKGFRERWGGWIKGCLQSTNFSIMMNEKPRGKIIASRGIRQGDSLSPL
        +    +K K   I+ +D EKA++K+   F+ + L   G    +   I+        +I++N +         G RQG  LSPL
Subjt:  EDRKISKKKG-FILKLDLEKAYEKVSWVFLDEVLKLKGFRERWGGWIKGCLQSTNFSIMMNEKPRGKIIASRGIRQGDSLSPL

P08548 LINE-1 reverse transcriptase homolog4.4e-1325.79Show/hide
Query:  NKEVFGNIIDKKQACLNRIEEVDVFEEQGNLSFQLKEERLKIKSELLDSSLLALEP-ELEEEIVGFYSKLYTDD----KKVRFTLDGLSWSPIEESNKAW
        NK +   I   K     +I ++D  +   NL+ + K  +  I S    +  +  +P E+++ +  +Y KLY+      K++   L+      + +     
Subjt:  NKEVFGNIIDKKQACLNRIEEVDVFEEQGNLSFQLKEERLKIKSELLDSSLLALEP-ELEEEIVGFYSKLYTDD----KKVRFTLDGLSWSPIEESNKAW

Query:  LERRFTEDEIHRAIKNLGNNKSPRPDG---------------IIINKRTN--------GTY----ICLIPKK-KNASRVKDFRPISLVTPLYKIIAKVLA
        L R  +  EI   I+NL   KSP PDG               I++N   N         T+    I LIPK  K+ +R +++RPISL+    KI+ K+L 
Subjt:  LERRFTEDEIHRAIKNLGNNKSPRPDG---------------IIINKRTN--------GTY----ICLIPKK-KNASRVKDFRPISLVTPLYKIIAKVLA

Query:  ERLKEVLLATISNSQAAFVHGRQILDAILVASESVED-RKISKKKGFILKLDLEKAYEKVSWVFLDEVLKLKGFRERWGGWIKGCLQSTNFSIMMNEKPR
         R+++ +   I + Q  F+ G Q    I  +   ++   K+  K   IL +D EKA++ +   F+   LK  G    +   I+        +I++N    
Subjt:  ERLKEVLLATISNSQAAFVHGRQILDAILVASESVED-RKISKKKGFILKLDLEKAYEKVSWVFLDEVLKLKGFRERWGGWIKGCLQSTNFSIMMNEKPR

Query:  GKIIASRGIRQGDSLSPL
               G RQG  LSPL
Subjt:  GKIIASRGIRQGDSLSPL

P11369 LINE-1 retrotransposable element ORF2 protein1.1e-1429.14Show/hide
Query:  KIKSELLDSSLLALEP-ELEEEIVGFYSKLYTD-----DKKVRFTLDGLSWSPIEESNKAWLERRFTEDEIHRAIKNLGNNKSPRPDGI-----------
        KI++E  D   +  +P E++  I  FY +LY+      D+  +F LD      + +     L    +  EI   I +L   KSP PDG            
Subjt:  KIKSELLDSSLLALEP-ELEEEIVGFYSKLYTD-----DKKVRFTLDGLSWSPIEESNKAWLERRFTEDEIHRAIKNLGNNKSPRPDGI-----------

Query:  ---IINK-----RTNGTY--------ICLIPK-KKNASRVKDFRPISLVTPLYKIIAKVLAERLKEVLLATISNSQAAFVHGRQILDAILVASESVE-DR
           I++K        GT         I LIPK +K+ +++++FRPISL+    KI+ K+LA R++E + A I   Q  F+ G Q    I  +   +    
Subjt:  ---IINK-----RTNGTY--------ICLIPK-KKNASRVKDFRPISLVTPLYKIIAKVLAERLKEVLLATISNSQAAFVHGRQILDAILVASESVE-DR

Query:  KISKKKGFILKLDLEKAYEKVSWVFLDEVLKLKGFRERWGGWIKGCLQSTNFSIMMNEKPRGKIIASRGIRQGDSLSP
        K+  K   I+ LD EKA++K+   F+ +VL+  G +  +   IK        +I +N +    I    G RQG  LSP
Subjt:  KISKKKGFILKLDLEKAYEKVSWVFLDEVLKLKGFRERWGGWIKGCLQSTNFSIMMNEKPRGKIIASRGIRQGDSLSP

P14381 Transposon TX1 uncharacterized 149 kDa protein1.1e-1628.08Show/hide
Query:  EPE-LEEEIVGFYSKLYTDD----KKVRFTLDGLSWSPIEESNKAWLERRFTEDEIHRAIKNLGNNKSPRPDGIIIN---------------------KR
        +PE + +    FY  L++ D           DGL    + E  K  LE   T DE+ +A++ + +NKSP  DG+ I                      K+
Subjt:  EPE-LEEEIVGFYSKLYTDD----KKVRFTLDGLSWSPIEESNKAWLERRFTEDEIHRAIKNLGNNKSPRPDGIIIN---------------------KR

Query:  TNGTYIC------LIPKKKNASRVKDFRPISLVTPLYKIIAKVLAERLKEVLLATISNSQAAFVHGRQILDAILVASESVEDRKISKKKGFILKLDLEKA
              C      L+PKK +   +K++RP+SL++  YKI+AK ++ RLK VL   I   Q+  V GR I D + +  + +   + +      L LD EKA
Subjt:  TNGTYIC------LIPKKKNASRVKDFRPISLVTPLYKIIAKVLAERLKEVLLATISNSQAAFVHGRQILDAILVASESVEDRKISKKKGFILKLDLEKA

Query:  YEKVSWVFLDEVLKLKGFRERWGGWIKGCLQSTNFSIMMNEKPRGKIIASRGIRQGDSLS
        +++V   +L   L+   F  ++ G++K    S    + +N      +   RG+RQG  LS
Subjt:  YEKVSWVFLDEVLKLKGFRERWGGWIKGCLQSTNFSIMMNEKPRGKIIASRGIRQGDSLS

Q03274 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)7.3e-0826.85Show/hide
Query:  LIPKKKNASRVKDFRPISLVTPLYKIIAKVLAERLKEVLLATISNSQAAFVHGRQILDAILVASESVEDRKISKKKGFILKLDLEKAYEKVSWVFLDEVL
        LIPK  +     ++RPI++ + L +++ ++LA+RL+  +    +    A + G  +   +L     +  R+  +K   ++ LD+ KA++ VS   +   L
Subjt:  LIPKKKNASRVKDFRPISLVTPLYKIIAKVLAERLKEVLLATISNSQAAFVHGRQILDAILVASESVEDRKISKKKGFILKLDLEKAYEKVSWVFLDEVL

Query:  KLKGFRERWGGWIKGCLQSTNFSIMMNE-KPRGKIIASRGIRQGDSLSP
        +  G  E    +I G L  +  +I +       KI   RG++QGD LSP
Subjt:  KLKGFRERWGGWIKGCLQSTNFSIMMNE-KPRGKIIASRGIRQGDSLSP

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases6.8e-0940.74Show/hide
Query:  LAERLKEVLLATISNSQAAFVHGRQILDAILVASESVED--RKISKKKGFILKLDLEKAYEKVSWVFLDEVLKLKGFRERW
        + ERLK ++   I  +QA+F+ GR   D I+   E+V    RK   K   +LKLDLEKAY+++ W +L++ L   GF E W
Subjt:  LAERLKEVLLATISNSQAAFVHGRQILDAILVASESVED--RKISKKKGFILKLDLEKAYEKVSWVFLDEVLKLKGFRERW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGGAAAAGGATGTGATGGAGGTCAGGGACTCGGTCGTAGGCGCCTTTTCCATATCTGTCCTCTGTAGTTTTCAGGGGCAGGCTAAGGGATGGATCACAAGTGTGTA
TGGTCCTTGTAATTTGCGTGAGAGGAATTACTTTTGGCAGGAGCTTAATGATTTGACCCTCCCTCCTATGGAAAATGGTAGATTCTCCTGGTCAAGAATGGGGGAGAGAC
CTGCGGCGTCCCGAATAGACAGGTTTCTCCTCTCAAAGCAATGGGTGGAAACCTTCAAGGAATATAAGCTGGATAGAATCGAGAATATGTGGCTAGAGGACCCCAACTTT
AAAGGCAATGTAGAGAAATGGTGGAAGGAGCAAACTCTGAAAGGTTGGGCTGGTTTTAAATTCATGGAGAAATTAAAGGTGTTAAAAGGATGCATTCGTAGATGGAATAA
GGAGGTTTTCGGCAACATTATAGATAAGAAGCAAGCTTGTCTAAATCGAATTGAAGAAGTTGATGTGTTCGAGGAGCAAGGCAATCTCAGCTTCCAGCTAAAAGAGGAAA
GACTGAAAATCAAATCAGAGTTGCTAGATAGTAGCCTTTTAGCCTTGGAGCCTGAACTTGAAGAGGAAATTGTTGGCTTCTACAGCAAATTATACACTGATGATAAGAAA
GTTAGATTCACCCTGGATGGTCTCTCGTGGAGTCCTATTGAAGAGAGCAACAAAGCTTGGTTGGAAAGAAGATTTACCGAGGATGAAATTCACAGAGCGATAAAAAATCT
GGGCAACAACAAATCTCCGAGGCCGGATGGGATCATCATCAACAAGCGTACCAATGGAACCTATATATGCCTCATCCCCAAAAAGAAGAATGCCTCGAGGGTAAAAGATT
TTAGACCTATAAGCTTGGTGACACCTCTTTATAAAATAATAGCAAAAGTCCTTGCAGAGAGATTGAAAGAAGTCCTCCTAGCAACTATTAGCAACAGTCAAGCAGCGTTT
GTCCATGGGAGACAAATCCTTGATGCCATTCTTGTTGCCTCTGAATCGGTTGAAGACCGAAAAATATCGAAGAAGAAAGGTTTTATTCTTAAATTGGACCTTGAAAAAGC
TTATGAAAAGGTCAGTTGGGTCTTCCTTGATGAAGTCTTGAAGCTGAAAGGGTTTAGGGAAAGGTGGGGAGGGTGGATTAAAGGTTGTCTTCAAAGCACAAACTTCTCCA
TCATGATGAACGAAAAACCGAGAGGGAAAATCATAGCATCTAGAGGAATTAGACAAGGGGATTCACTCTCCCCCCTTCCTCTTTACGATTGTGGGCGATGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGGGAAAAGGATGTGATGGAGGTCAGGGACTCGGTCGTAGGCGCCTTTTCCATATCTGTCCTCTGTAGTTTTCAGGGGCAGGCTAAGGGATGGATCACAAGTGTGTA
TGGTCCTTGTAATTTGCGTGAGAGGAATTACTTTTGGCAGGAGCTTAATGATTTGACCCTCCCTCCTATGGAAAATGGTAGATTCTCCTGGTCAAGAATGGGGGAGAGAC
CTGCGGCGTCCCGAATAGACAGGTTTCTCCTCTCAAAGCAATGGGTGGAAACCTTCAAGGAATATAAGCTGGATAGAATCGAGAATATGTGGCTAGAGGACCCCAACTTT
AAAGGCAATGTAGAGAAATGGTGGAAGGAGCAAACTCTGAAAGGTTGGGCTGGTTTTAAATTCATGGAGAAATTAAAGGTGTTAAAAGGATGCATTCGTAGATGGAATAA
GGAGGTTTTCGGCAACATTATAGATAAGAAGCAAGCTTGTCTAAATCGAATTGAAGAAGTTGATGTGTTCGAGGAGCAAGGCAATCTCAGCTTCCAGCTAAAAGAGGAAA
GACTGAAAATCAAATCAGAGTTGCTAGATAGTAGCCTTTTAGCCTTGGAGCCTGAACTTGAAGAGGAAATTGTTGGCTTCTACAGCAAATTATACACTGATGATAAGAAA
GTTAGATTCACCCTGGATGGTCTCTCGTGGAGTCCTATTGAAGAGAGCAACAAAGCTTGGTTGGAAAGAAGATTTACCGAGGATGAAATTCACAGAGCGATAAAAAATCT
GGGCAACAACAAATCTCCGAGGCCGGATGGGATCATCATCAACAAGCGTACCAATGGAACCTATATATGCCTCATCCCCAAAAAGAAGAATGCCTCGAGGGTAAAAGATT
TTAGACCTATAAGCTTGGTGACACCTCTTTATAAAATAATAGCAAAAGTCCTTGCAGAGAGATTGAAAGAAGTCCTCCTAGCAACTATTAGCAACAGTCAAGCAGCGTTT
GTCCATGGGAGACAAATCCTTGATGCCATTCTTGTTGCCTCTGAATCGGTTGAAGACCGAAAAATATCGAAGAAGAAAGGTTTTATTCTTAAATTGGACCTTGAAAAAGC
TTATGAAAAGGTCAGTTGGGTCTTCCTTGATGAAGTCTTGAAGCTGAAAGGGTTTAGGGAAAGGTGGGGAGGGTGGATTAAAGGTTGTCTTCAAAGCACAAACTTCTCCA
TCATGATGAACGAAAAACCGAGAGGGAAAATCATAGCATCTAGAGGAATTAGACAAGGGGATTCACTCTCCCCCCTTCCTCTTTACGATTGTGGGCGATGCTCTTAG
Protein sequenceShow/hide protein sequence
MWEKDVMEVRDSVVGAFSISVLCSFQGQAKGWITSVYGPCNLRERNYFWQELNDLTLPPMENGRFSWSRMGERPAASRIDRFLLSKQWVETFKEYKLDRIENMWLEDPNF
KGNVEKWWKEQTLKGWAGFKFMEKLKVLKGCIRRWNKEVFGNIIDKKQACLNRIEEVDVFEEQGNLSFQLKEERLKIKSELLDSSLLALEPELEEEIVGFYSKLYTDDKK
VRFTLDGLSWSPIEESNKAWLERRFTEDEIHRAIKNLGNNKSPRPDGIIINKRTNGTYICLIPKKKNASRVKDFRPISLVTPLYKIIAKVLAERLKEVLLATISNSQAAF
VHGRQILDAILVASESVEDRKISKKKGFILKLDLEKAYEKVSWVFLDEVLKLKGFRERWGGWIKGCLQSTNFSIMMNEKPRGKIIASRGIRQGDSLSPLPLYDCGRCS