; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0024214 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0024214
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr10:1288170..1292477
RNA-Seq ExpressionLag0024214
SyntenyLag0024214
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN75040.1 hypothetical protein VITISV_026478 [Vitis vinifera]1.8e-11429.82Show/hide
Query:  IRRESLSKPTESRLAMIDRRVIKSLWSSRHFGWLSFDAYNSAGGILVMWKENVVVVDISVLGAFPVLIKCTFLGRFKGWVTGV-----------------
        I+ E      E++    DRR + S+W  +   W +  A  ++GG +++W  + +     V G+F V +K         W+T V                 
Subjt:  IRRESLSKPTESRLAMIDRRVIKSLWSSRHFGWLSFDAYNSAGGILVMWKENVVVVDISVLGAFPVLIKCTFLGRFKGWVTGV-----------------

Query:  -------------------------LNATRISRCSKTFNKFIDDTDLIDPPMVNGQFTWSRLDP-------------DFKKRVESWWSALNLSGWAGYRF
                                 L  TR++   + F++FI ++ LIDPP+ N  FTWS +                FK++   WW      GW G++F
Subjt:  -------------------------LNATRISRCSKTFNKFIDDTDLIDPPMVNGQFTWSRLDP-------------DFKKRVESWWSALNLSGWAGYRF

Query:  MEKMKGLKIKIKEWNKESY--------------------------------------------------------------EGDENTASFHKWASTMRNR
        M K+K +K K+KEWN  ++                                                              EGD N+  FH+ A+  R+R
Subjt:  MEKMKGLKIKIKEWNKESY--------------------------------------------------------------EGDENTASFHKWASTMRNR

Query:  AHISLLENDNGDILTSEEDIEKEVLGFYNKLYERDINPRFTLEGIDWASIDVQHRDMLEMSFSEEEIFKAVQGMGNLKSPGSD-----------------
          I  L ++ G+ L + EDI +E++ F+  LY + +   +  EGIDW  I  +    L+  F+EEE+ +AV  +   K+PG D                 
Subjt:  AHISLLENDNGDILTSEEDIEKEVLGFYNKLYERDINPRFTLEGIDWASIDVQHRDMLEMSFSEEEIFKAVQGMGNLKSPGSD-----------------

Query:  ---------------------------------------------------AKVLAERLKKVLHLTISDCQMAFVQGRQILDAILVASEAVEDYRAIKKK
                                                           AKVL+ RL+KVLH TISD Q AFV+GR ILDA+L+A+E V++ R   ++
Subjt:  ---------------------------------------------------AKVLAERLKKVLHLTISDCQMAFVQGRQILDAILVASEAVEDYRAIKKK

Query:  GVLLKLDLEKAYDKVSWEFLDVVLELKGFGKLWRKWITGCLNNSNFSVIINGK-------PRGLRQGDPLSPSVFTIVGDVIAKSVQFCLEKKILNGWQV
        G++ K+D EKAYD V W FLD VL+ KGF + WR WI GCL++S+F++++NG         RGLRQGDPLSP +FT+V DV+++ +    E  +  G+ V
Subjt:  GVLLKLDLEKAYDKVSWEFLDVVLELKGFGKLWRKWITGCLNNSNFSVIINGK-------PRGLRQGDPLSPSVFTIVGDVIAKSVQFCLEKKILNGWQV

Query:  GKDKVAVSLLQYADDTLVFCPNNAEDIGNW----------------------------WDMLNIILAALSYKLSR-------ISLGGKYRRKALWEPLIE
        G+D+  VSLLQ+ADDT+ F   + E + N                              ++L+ + +    ++S        + LGG  +    W+P++E
Subjt:  GKDKVAVSLLQYADDTLVFCPNNAEDIGNW----------------------------WDMLNIILAALSYKLSR-------ISLGGKYRRKALWEPLIE

Query:  RLRVKLDNWRKFPISKGGRVTLTQTILNSIPQYLFSLLKAPKSVTKAMENISRDFIWNGGTYKPGSNLVKWEWTALPIKLGGLGVGALYQRNDALITKWL
        R+  +LD W+K  +S GGR+TL Q+ L+ IP Y  SL K P S+   +E + R+F+W+G       +LV+WE  + P +LGGLG G +  RN AL+ KWL
Subjt:  RLRVKLDNWRKFPISKGGRVTLTQTILNSIPQYLFSLLKAPKSVTKAMENISRDFIWNGGTYKPGSNLVKWEWTALPIKLGGLGVGALYQRNDALITKWL

Query:  WRFSQEKDSLWRSMIAGIYGVDPFGWKSGELTKRRGNSPWTAIAK
        WRF +E+  LW  +I  IYG  P GW +  + +     PW AIA+
Subjt:  WRFSQEKDSLWRSMIAGIYGVDPFGWKSGELTKRRGNSPWTAIAK

RVW20303.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]5.2e-11431.62Show/hide
Query:  ESRLAMIDRRVIKSLWSSRHFGWLSFDAYNSAGGILVMWKENVVVVDISVLGAFPVLIKCTFLGRFKGWVTGVL--NATR--------------------
        E++ A  DRR + S+W++R+  W    A  ++GGILVMW    +  +  VLG+F V +K    G  + W++ +   N+T                     
Subjt:  ESRLAMIDRRVIKSLWSSRHFGWLSFDAYNSAGGILVMWKENVVVVDISVLGAFPVLIKCTFLGRFKGWVTGVL--NATR--------------------

Query:  --------ISRCS------------KTFNKFIDDTDLIDPPMVNGQFTWS---------RLD--------------------------------------
                I RCS            K  + FI + +LIDPP+ +  FTWS         RLD                                      
Subjt:  --------ISRCS------------KTFNKFIDDTDLIDPPMVNGQFTWS---------RLD--------------------------------------

Query:  ---------------PDFKKRVESWWSALNLSGWAGYRFMEKMKGLKIKIKEWNKESY------------------------------------------
                       P FK+   SWW      GW G++FM K++ LK K+KEWNK ++                                          
Subjt:  ---------------PDFKKRVESWWSALNLSGWAGYRFMEKMKGLKIKIKEWNKESY------------------------------------------

Query:  --------------------EGDENTASFHKWASTMRNRAHISLLENDNGDILTSEEDIEKEVLGFYNKLYERDINPRFTLEGIDWASIDVQHRDMLEMS
                            EGD N+  FHK A+  RNR  I +LEN+ G +L + + I++E+L ++ KLY       + +EG+DW+ I  +    LE  
Subjt:  --------------------EGDENTASFHKWASTMRNRAHISLLENDNGDILTSEEDIEKEVLGFYNKLYERDINPRFTLEGIDWASIDVQHRDMLEMS

Query:  FSEEEIFKAVQGMGNLKSPGSDAKVLA----------------------ERLKKVLHLTISDCQMAFVQGRQILDAILVASEAVEDYRAIKKKGVLLKLD
        F+EEEI KA+  M   K+PG D   +A                       RL+ VLH TI   Q AFVQGRQILDA+L+A+E V++ +   ++GV+ K+D
Subjt:  FSEEEIFKAVQGMGNLKSPGSDAKVLA----------------------ERLKKVLHLTISDCQMAFVQGRQILDAILVASEAVEDYRAIKKKGVLLKLD

Query:  LEKAYDKVSWEFLDVVLELKGFGKLWRKWITGCLNNSNFSVIINGK-------PRGLRQGDPLSPSVFTIVGDVIAKSVQFCLEKKILNGWQVGKDKVAV
         EKAYD VSW+FLD V+E KGF   WRKWI GCL++ +F++++NG         RGLRQGDPLSP +FTIV DV+++ +    E+ +  G++VG+++  V
Subjt:  LEKAYDKVSWEFLDVVLELKGFGKLWRKWITGCLNNSNFSVIINGK-------PRGLRQGDPLSPSVFTIVGDVIAKSVQFCLEKKILNGWQVGKDKVAV

Query:  SLLQYADDTLVFCPNNAED----------------------------IGNWWDMLNIILAALSYKLS-------RISLGGKYRRKALWEPLIERLRVKLD
        S LQ+ADDT+ F     ED                            I    D L+ +   L  K S        + LGG  +    W+P+IER+  +LD
Subjt:  SLLQYADDTLVFCPNNAED----------------------------IGNWWDMLNIILAALSYKLS-------RISLGGKYRRKALWEPLIERLRVKLD

Query:  NWRKFPISKGGRVTLTQTILNSIPQYLFSLLKAPKSVTKAMENISRDFIWNGGTYKPGSNLVKWEWTALPIKLGGLGVGALYQRNDALITKWLWRFSQEK
         W+K  +S GGR+TL ++ L  +P Y  SL + P SV   +E + RDF+W+G       +LV WE        GGLG+G +  RN AL+ KWLWR+ +E 
Subjt:  NWRKFPISKGGRVTLTQTILNSIPQYLFSLLKAPKSVTKAMENISRDFIWNGGTYKPGSNLVKWEWTALPIKLGGLGVGALYQRNDALITKWLWRFSQEK

Query:  DSLWRSMIAGIYGVDPFGWKSGELTKRRGNSPWTAIAK
         +LW  +I  IYG    GW +  + +     PW AIA+
Subjt:  DSLWRSMIAGIYGVDPFGWKSGELTKRRGNSPWTAIAK

RVW99790.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]3.7e-11230.35Show/hide
Query:  ESRLAMIDRRVIKSLWSSRHFGWLSFDAYNSAGGILVMWKENVVVVDISVLGAFPVLIKCTFLGRFKGWVTGV---------------------------
        E++    DRR++ S+WS R+  W +  A  ++GGIL++W    +  +  VLG+F V IK    G    W++ V                           
Subjt:  ESRLAMIDRRVIKSLWSSRHFGWLSFDAYNSAGGILVMWKENVVVVDISVLGAFPVLIKCTFLGRFKGWVTGV---------------------------

Query:  ---------------LNATRISRCSKTFNKFIDDTDLIDPPMVNGQFTWS---------RLD--------------------------------------
                       L  +R+S C K F++FI D +LID P+ +  +TWS         RLD                                      
Subjt:  ---------------LNATRISRCSKTFNKFIDDTDLIDPPMVNGQFTWS---------RLD--------------------------------------

Query:  ---------------PDFKKRVESWWSALNLSGWAGYRFMEKMKGLKIKIKEWNKESY------------------------------------------
                         FK+    WWS    +GW G++FM K++ +K K+KEWNK S+                                          
Subjt:  ---------------PDFKKRVESWWSALNLSGWAGYRFMEKMKGLKIKIKEWNKESY------------------------------------------

Query:  --------------------EGDENTASFHKWASTMRNRAHISLLENDNGDILTSEEDIEKEVLGFYNKLYERDINPRFTLEGIDWASIDVQHRDMLEMS
                            EGD N+  FHK A+  RNR  I  LEN++G +L + E I++E+L ++ KLY       + +EG+DW+ ID +    LE  
Subjt:  --------------------EGDENTASFHKWASTMRNRAHISLLENDNGDILTSEEDIEKEVLGFYNKLYERDINPRFTLEGIDWASIDVQHRDMLEMS

Query:  FSEEEIFKAVQGMGNLKSPGSD--------------------------------------------------------------------AKVLAERLKK
        F+EEEI+KA+  M   K+PG D                                                                    AKVLA RL+ 
Subjt:  FSEEEIFKAVQGMGNLKSPGSD--------------------------------------------------------------------AKVLAERLKK

Query:  VLHLTISDCQMAFVQGRQILDAILVASEAVEDYRAIKKKGVLLKLDLEKAYDKVSWEFLDVVLELKGFGKLWRKWITGCLNNSNFSVIINGK-------P
        VLH TI   Q AFVQGRQILDA+L+A+E V++ R   ++GV+ K+D EKAYD VSW+FLD VLE+KGF   WRKW+ GCL++ +++V++NG         
Subjt:  VLHLTISDCQMAFVQGRQILDAILVASEAVEDYRAIKKKGVLLKLDLEKAYDKVSWEFLDVVLELKGFGKLWRKWITGCLNNSNFSVIINGK-------P

Query:  RGLRQGDPLSPSVFTIVGDVIAKSVQFCLEKKILNGWQVGKDKVAVSLLQYADDTLVFCPNNAEDIGNWWDMLNII--LAALSYKLSR------------
        RGLRQGDPLSP +FTIV DV+++ +    E+ +L G++VG+++  VS LQ+ADDT+ F     ED+     +L +   ++ L   L +            
Subjt:  RGLRQGDPLSPSVFTIVGDVIAKSVQFCLEKKILNGWQVGKDKVAVSLLQYADDTLVFCPNNAEDIGNWWDMLNII--LAALSYKLSR------------

Query:  ---------------------ISLGGKYRRKALWEPLIERLRVKLDNWRKFPISKGGRVTLTQTILNSIPQYLFSLLKAPKSVTKAMENISRDFIWNGGT
                             + LGG  +    W+P+IER+  +LD W+K  +S GGR+TL Q+ L  +P Y  SL K P SV   +E + R+F+W+G  
Subjt:  ---------------------ISLGGKYRRKALWEPLIERLRVKLDNWRKFPISKGGRVTLTQTILNSIPQYLFSLLKAPKSVTKAMENISRDFIWNGGT

Query:  YKPGSNLVKWEWTALPIKLGGLGVGALYQRNDALITKWLWRFSQEKDSLWRSMIAGIYGVDPFGWKSGELTKRRGNSPWTAIA
             +LV W+    P   GGLG G +  RN AL+ KWLWR+ +E  +LW  +I  IYG    GW      +     PW AIA
Subjt:  YKPGSNLVKWEWTALPIKLGGLGVGALYQRNDALITKWLWRFSQEKDSLWRSMIAGIYGVDPFGWKSGELTKRRGNSPWTAIA

RVX14115.1 Protein SWEETIE [Vitis vinifera]2.0e-11832.12Show/hide
Query:  ESRLAMIDRRVIKSLWSSRHFGWLSFDAYNSAGGILVMWKENVVVVDISVLGAFPVLIKCTFLGRFKGWVTGV---------------------------
        E++    DRR + S+W++R+  W +     ++GGIL++W    +  +  VLG+F V IK    G    W++ V                           
Subjt:  ESRLAMIDRRVIKSLWSSRHFGWLSFDAYNSAGGILVMWKENVVVVDISVLGAFPVLIKCTFLGRFKGWVTGV---------------------------

Query:  ---------------LNATRISRCSKTFNKFIDDTDLIDPPMVNGQFTWS--RLD-----------------------------PDFKKRVESWWSALNL
                       L  +R++   K F+ FI D +LID P+ +  FTWS  ++D                             P FK+    WW     
Subjt:  ---------------LNATRISRCSKTFNKFIDDTDLIDPPMVNGQFTWS--RLD-----------------------------PDFKKRVESWWSALNL

Query:  SGWAGYRFMEKMKGLKIKIKEWNKESY--------------------------------------------------------------EGDENTASFHK
        +GW G++FM K++ +K K+K WNK S+                                                              EGD N+  FHK
Subjt:  SGWAGYRFMEKMKGLKIKIKEWNKESY--------------------------------------------------------------EGDENTASFHK

Query:  WASTMRNRAHISLLENDNGDILTSEEDIEKEVLGFYNKLYERDINPRFTLEGIDWASIDVQHRDMLEMSFSEEEIFKAVQGMGNLKSPGSD---------
         A+  RNR  I  LEN+NG ++ + E I++E+L ++ KLY       + +EG+DW+ I  +    LE  F+EEEIFKA+  M   K+PG D         
Subjt:  WASTMRNRAHISLLENDNGDILTSEEDIEKEVLGFYNKLYERDINPRFTLEGIDWASIDVQHRDMLEMSFSEEEIFKAVQGMGNLKSPGSD---------

Query:  --------------------------------AKVLAERLKKVLHLTISDCQMAFVQGRQILDAILVASEAVEDYRAIKKKGVLLKLDLEKAYDKVSWEF
                                        AKVLA R++ VLH TI   Q AFVQGRQILDA+L+A+E V++ R  +++GV+ K+D EKAYD VSW+F
Subjt:  --------------------------------AKVLAERLKKVLHLTISDCQMAFVQGRQILDAILVASEAVEDYRAIKKKGVLLKLDLEKAYDKVSWEF

Query:  LDVVLELKGFGKLWRKWITGCLNNSNFSVIINGK-------PRGLRQGDPLSPSVFTIVGDVIAKSVQFCLEKKILNGWQVGKDKVAVSLLQYADDTLVF
        LD VLE+KGFG  WRKW+ GCL++ +F+V++NG         RGLRQGDPLSP +FTIV DV+++ +    EK +L G++VG+++  VS LQ+ADDT+ F
Subjt:  LDVVLELKGFGKLWRKWITGCLNNSNFSVIINGK-------PRGLRQGDPLSPSVFTIVGDVIAKSVQFCLEKKILNGWQVGKDKVAVSLLQYADDTLVF

Query:  CPNNAEDIGNWWDMLNII--LAALSYKLSR---------------------------------ISLGGKYRRKALWEPLIERLRVKLDNWRKFPISKGGR
          +  ED+    ++L +   ++ L   L +                                 + LGG  +    W+P+IER+  +LD W+K  +S GGR
Subjt:  CPNNAEDIGNWWDMLNII--LAALSYKLSR---------------------------------ISLGGKYRRKALWEPLIERLRVKLDNWRKFPISKGGR

Query:  VTLTQTILNSIPQYLFSLLKAPKSVTKAMENISRDFIWNGGTYKPGSNLVKWEWTALPIKLGGLGVGALYQRNDALITKWLWRFSQEKDSLWRSMIAGIY
        +TL Q+ L  +P Y  SL K P SV   +E + RDF+W+G       +LV W+    P   GGLG G +  RN AL+ KWLWR+ +E  +LW  +I  IY
Subjt:  VTLTQTILNSIPQYLFSLLKAPKSVTKAMENISRDFIWNGGTYKPGSNLVKWEWTALPIKLGGLGVGALYQRNDALITKWLWRFSQEKDSLWRSMIAGIY

Query:  GVDPFGWKSGELTKRRGNSPWTAIA
        G    GW    + +     PW AIA
Subjt:  GVDPFGWKSGELTKRRGNSPWTAIA

RVX16773.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]1.5e-11331.69Show/hide
Query:  PTESRLAMIDRRVIKSLWSSRHFGWLSFDAYNSAGGILVMWKENVVVVDISVLGAFPVLIKCTFLGRFKGWVTGV-------------------------
        P    L     ++  S+W  +   W++  A  ++GGI+++W          VLG+F V +K         W+T V                         
Subjt:  PTESRLAMIDRRVIKSLWSSRHFGWLSFDAYNSAGGILVMWKENVVVVDISVLGAFPVLIKCTFLGRFKGWVTGV-------------------------

Query:  -----------------LNATRISRCSKTFNKFIDDTDLIDPPMVNGQFTWSR---------------LDPDFKKRVESWWSALNLSGWAGYRFMEKMKG
                         +  +R++   + F++FI ++ L+DPP+ N  FTWS                L P+FK++   WW    + GW G++FM K+K 
Subjt:  -----------------LNATRISRCSKTFNKFIDDTDLIDPPMVNGQFTWSR---------------LDPDFKKRVESWWSALNLSGWAGYRFMEKMKG

Query:  LKIKIKEWN--------------------------------------------------------------KESYEGDENTASFHKWASTMRNRAHISLL
        +K K+KEWN                                                              K   E D N+  FH+ A+  R+R  I  L
Subjt:  LKIKIKEWN--------------------------------------------------------------KESYEGDENTASFHKWASTMRNRAHISLL

Query:  ENDNGDILTSEEDIEKEVLGFYNKLYERDINPRFTLEGIDWASIDVQHRDMLEMSFSEEEIFKAVQGMGNLKSPGSDA------------------KVLA
         ++ G  L + E I +E++ F+  LY +     + +EGIDWA I  +    L+  FSEEE+  AV  +   K+PG D                   +VL+
Subjt:  ENDNGDILTSEEDIEKEVLGFYNKLYERDINPRFTLEGIDWASIDVQHRDMLEMSFSEEEIFKAVQGMGNLKSPGSDA------------------KVLA

Query:  ERLKKVLHLTISDCQMAFVQGRQILDAILVASEAVEDYRAIKKKGVLLKLDLEKAYDKVSWEFLDVVLELKGFGKLWRKWITGCLNNSNFSVIINGK---
         RL+KVLH TI   Q AFV+GRQILDA+L+A+E V++ R   + GV+ K+D EKAYD V W FLD VL+ KGF + WR W+ GCL++S+F++++NG    
Subjt:  ERLKKVLHLTISDCQMAFVQGRQILDAILVASEAVEDYRAIKKKGVLLKLDLEKAYDKVSWEFLDVVLELKGFGKLWRKWITGCLNNSNFSVIINGK---

Query:  ----PRGLRQGDPLSPSVFTIVGDVIAKSVQFCLEKKILNGWQVGKDKVAVSLLQYADDTLVFCPNNAEDIGNWWDMLNI--------------------
             RGLRQGDPLSP +FT+V DV+++ +    E  I   + VG+D+  VSLLQ+ADDT+ F   + + + N   +L +                    
Subjt:  ----PRGLRQGDPLSPSVFTIVGDVIAKSVQFCLEKKILNGWQVGKDKVAVSLLQYADDTLVFCPNNAEDIGNWWDMLNI--------------------

Query:  ----ILAALSYKLS-RIS----------LGGKYRRKALWEPLIERLRVKLDNWRKFPISKGGRVTLTQTILNSIPQYLFSLLKAPKSVTKAMENISRDFI
            +L++L+  L  R+S          LGG  +    W+P++ER+  +LD W+K  +S GGR+TL Q+ L+ IP Y  SL K P S+   +E + RDF+
Subjt:  ----ILAALSYKLS-RIS----------LGGKYRRKALWEPLIERLRVKLDNWRKFPISKGGRVTLTQTILNSIPQYLFSLLKAPKSVTKAMENISRDFI

Query:  WNGGTYKPGSNLVKWEWTALPIKLGGLGVGALYQRNDALITKWLWRFSQEKDSLWRSMIAGIYGVDPFGWKSGELTKRRGNSPWTAIAK
        W+G       +L++WE  + P ++GGLG G    RN AL+ KWLWRF +E+  LW  +IA IYG  P GW +    +     PW AIA+
Subjt:  WNGGTYKPGSNLVKWEWTALPIKLGGLGVGALYQRNDALITKWLWRFSQEKDSLWRSMIAGIYGVDPFGWKSGELTKRRGNSPWTAIAK

TrEMBL top hitse value%identityAlignment
A0A438BYX6 Transposon TX1 uncharacterized 149 kDa protein2.3e-11229.51Show/hide
Query:  ESRLAMIDRRVIKSLWSSRHFGWLSFDAYNSAGGILVMWKENVVVVDISVLGAFPVLIKCTFLGRFKGWVTGV---------------------------
        E++    DRR + S+W+ R+  W++  A  ++GGIL++W   ++  +  V+G+F V +K +  G    W++ V                           
Subjt:  ESRLAMIDRRVIKSLWSSRHFGWLSFDAYNSAGGILVMWKENVVVVDISVLGAFPVLIKCTFLGRFKGWVTGV---------------------------

Query:  ---------------LNATRISRCSKTFNKFIDDTDLIDPPMVNGQFTWS---------RLD--------------------------------------
                       +  + ++   + F+ FI + +L+DPP+ N  FTWS         RLD                                      
Subjt:  ---------------LNATRISRCSKTFNKFIDDTDLIDPPMVNGQFTWS---------RLD--------------------------------------

Query:  ---------------PDFKKRVESWWSALNLSGWAGYRFMEKMKGLKIKIKEWNKESY--------------------------------EGDENTASFH
                        +FK+    WWS     GW G++FM +++ +K K+KEWNK S+                                EGD N+  +H
Subjt:  ---------------PDFKKRVESWWSALNLSGWAGYRFMEKMKGLKIKIKEWNKESY--------------------------------EGDENTASFH

Query:  KWASTMRNRAHISLLENDNGDILTSEEDIEKEVLGFYNKLYERDINPRFTLEGIDWASIDVQHRDMLEMSFSEEEIFKAVQGMGNLKSPGSD--------
        K A+  RNR +I  LEN+ G +L + E I +E+L ++ KLY       + +EG+DW+ I  +    LE  F+EEEI KA+  +   K+PG D        
Subjt:  KWASTMRNRAHISLLENDNGDILTSEEDIEKEVLGFYNKLYERDINPRFTLEGIDWASIDVQHRDMLEMSFSEEEIFKAVQGMGNLKSPGSD--------

Query:  ------------------------------------------------------------AKVLAERLKKVLHLTISDCQMAFVQGRQILDAILVASEAV
                                                                    AKVL+ RL+ VLH TI   Q AFVQGRQILDA+L+A+E V
Subjt:  ------------------------------------------------------------AKVLAERLKKVLHLTISDCQMAFVQGRQILDAILVASEAV

Query:  EDYRAIKKKGVLLKLDLEKAYDKVSWEFLDVVLELKGFGKLWRKWITGCLNNSNFSVIINGK-------PRGLRQGDPLSPSVFTIVGDVIAKSVQFCLE
        ++ R   ++GV+ K+D EKAYD V W+FLD +LE KGF   WRKW++GCL++ +F++++NG         RGLRQGDPLSP +FT+V DV+++ +    E
Subjt:  EDYRAIKKKGVLLKLDLEKAYDKVSWEFLDVVLELKGFGKLWRKWITGCLNNSNFSVIINGK-------PRGLRQGDPLSPSVFTIVGDVIAKSVQFCLE

Query:  KKILNGWQVGKDKVAVSLLQYADDTLVFCPNNAEDIGNWWDMLNIILAALSYK-----------------LSRIS------------------LGGKYRR
        + +L G++VG+++  VS LQ+ADD + F  +  E++     +L +       K                 LSR++                  LGG  + 
Subjt:  KKILNGWQVGKDKVAVSLLQYADDTLVFCPNNAEDIGNWWDMLNIILAALSYK-----------------LSRIS------------------LGGKYRR

Query:  KALWEPLIERLRVKLDNWRKFPISKGGRVTLTQTILNSIPQYLFSLLKAPKSVTKAMENISRDFIWNGGTYKPGSNLVKWEWTALPIKLGGLGVGALYQR
           W+P++ER+  +LD W+K  +S GGR+TL Q+ L  +P Y  SL K P +V   +E + RDF+W+G       +LV+W+    P  +GGLG+G + +R
Subjt:  KALWEPLIERLRVKLDNWRKFPISKGGRVTLTQTILNSIPQYLFSLLKAPKSVTKAMENISRDFIWNGGTYKPGSNLVKWEWTALPIKLGGLGVGALYQR

Query:  NDALITKWLWRFSQEKDSLWRSMIAGIYGVDPFGWKSGELTKRRGNSPWTAIAK
        N AL+ KWLWR+ +E  +LW  +I  IYG    GW +  + +     PW AIA+
Subjt:  NDALITKWLWRFSQEKDSLWRSMIAGIYGVDPFGWKSGELTKRRGNSPWTAIAK

A0A438CAL5 Transposon TX1 uncharacterized 149 kDa protein2.5e-11431.62Show/hide
Query:  ESRLAMIDRRVIKSLWSSRHFGWLSFDAYNSAGGILVMWKENVVVVDISVLGAFPVLIKCTFLGRFKGWVTGVL--NATR--------------------
        E++ A  DRR + S+W++R+  W    A  ++GGILVMW    +  +  VLG+F V +K    G  + W++ +   N+T                     
Subjt:  ESRLAMIDRRVIKSLWSSRHFGWLSFDAYNSAGGILVMWKENVVVVDISVLGAFPVLIKCTFLGRFKGWVTGVL--NATR--------------------

Query:  --------ISRCS------------KTFNKFIDDTDLIDPPMVNGQFTWS---------RLD--------------------------------------
                I RCS            K  + FI + +LIDPP+ +  FTWS         RLD                                      
Subjt:  --------ISRCS------------KTFNKFIDDTDLIDPPMVNGQFTWS---------RLD--------------------------------------

Query:  ---------------PDFKKRVESWWSALNLSGWAGYRFMEKMKGLKIKIKEWNKESY------------------------------------------
                       P FK+   SWW      GW G++FM K++ LK K+KEWNK ++                                          
Subjt:  ---------------PDFKKRVESWWSALNLSGWAGYRFMEKMKGLKIKIKEWNKESY------------------------------------------

Query:  --------------------EGDENTASFHKWASTMRNRAHISLLENDNGDILTSEEDIEKEVLGFYNKLYERDINPRFTLEGIDWASIDVQHRDMLEMS
                            EGD N+  FHK A+  RNR  I +LEN+ G +L + + I++E+L ++ KLY       + +EG+DW+ I  +    LE  
Subjt:  --------------------EGDENTASFHKWASTMRNRAHISLLENDNGDILTSEEDIEKEVLGFYNKLYERDINPRFTLEGIDWASIDVQHRDMLEMS

Query:  FSEEEIFKAVQGMGNLKSPGSDAKVLA----------------------ERLKKVLHLTISDCQMAFVQGRQILDAILVASEAVEDYRAIKKKGVLLKLD
        F+EEEI KA+  M   K+PG D   +A                       RL+ VLH TI   Q AFVQGRQILDA+L+A+E V++ +   ++GV+ K+D
Subjt:  FSEEEIFKAVQGMGNLKSPGSDAKVLA----------------------ERLKKVLHLTISDCQMAFVQGRQILDAILVASEAVEDYRAIKKKGVLLKLD

Query:  LEKAYDKVSWEFLDVVLELKGFGKLWRKWITGCLNNSNFSVIINGK-------PRGLRQGDPLSPSVFTIVGDVIAKSVQFCLEKKILNGWQVGKDKVAV
         EKAYD VSW+FLD V+E KGF   WRKWI GCL++ +F++++NG         RGLRQGDPLSP +FTIV DV+++ +    E+ +  G++VG+++  V
Subjt:  LEKAYDKVSWEFLDVVLELKGFGKLWRKWITGCLNNSNFSVIINGK-------PRGLRQGDPLSPSVFTIVGDVIAKSVQFCLEKKILNGWQVGKDKVAV

Query:  SLLQYADDTLVFCPNNAED----------------------------IGNWWDMLNIILAALSYKLS-------RISLGGKYRRKALWEPLIERLRVKLD
        S LQ+ADDT+ F     ED                            I    D L+ +   L  K S        + LGG  +    W+P+IER+  +LD
Subjt:  SLLQYADDTLVFCPNNAED----------------------------IGNWWDMLNIILAALSYKLS-------RISLGGKYRRKALWEPLIERLRVKLD

Query:  NWRKFPISKGGRVTLTQTILNSIPQYLFSLLKAPKSVTKAMENISRDFIWNGGTYKPGSNLVKWEWTALPIKLGGLGVGALYQRNDALITKWLWRFSQEK
         W+K  +S GGR+TL ++ L  +P Y  SL + P SV   +E + RDF+W+G       +LV WE        GGLG+G +  RN AL+ KWLWR+ +E 
Subjt:  NWRKFPISKGGRVTLTQTILNSIPQYLFSLLKAPKSVTKAMENISRDFIWNGGTYKPGSNLVKWEWTALPIKLGGLGVGALYQRNDALITKWLWRFSQEK

Query:  DSLWRSMIAGIYGVDPFGWKSGELTKRRGNSPWTAIAK
         +LW  +I  IYG    GW +  + +     PW AIA+
Subjt:  DSLWRSMIAGIYGVDPFGWKSGELTKRRGNSPWTAIAK

A0A438JYU3 Protein SWEETIE9.8e-11932.12Show/hide
Query:  ESRLAMIDRRVIKSLWSSRHFGWLSFDAYNSAGGILVMWKENVVVVDISVLGAFPVLIKCTFLGRFKGWVTGV---------------------------
        E++    DRR + S+W++R+  W +     ++GGIL++W    +  +  VLG+F V IK    G    W++ V                           
Subjt:  ESRLAMIDRRVIKSLWSSRHFGWLSFDAYNSAGGILVMWKENVVVVDISVLGAFPVLIKCTFLGRFKGWVTGV---------------------------

Query:  ---------------LNATRISRCSKTFNKFIDDTDLIDPPMVNGQFTWS--RLD-----------------------------PDFKKRVESWWSALNL
                       L  +R++   K F+ FI D +LID P+ +  FTWS  ++D                             P FK+    WW     
Subjt:  ---------------LNATRISRCSKTFNKFIDDTDLIDPPMVNGQFTWS--RLD-----------------------------PDFKKRVESWWSALNL

Query:  SGWAGYRFMEKMKGLKIKIKEWNKESY--------------------------------------------------------------EGDENTASFHK
        +GW G++FM K++ +K K+K WNK S+                                                              EGD N+  FHK
Subjt:  SGWAGYRFMEKMKGLKIKIKEWNKESY--------------------------------------------------------------EGDENTASFHK

Query:  WASTMRNRAHISLLENDNGDILTSEEDIEKEVLGFYNKLYERDINPRFTLEGIDWASIDVQHRDMLEMSFSEEEIFKAVQGMGNLKSPGSD---------
         A+  RNR  I  LEN+NG ++ + E I++E+L ++ KLY       + +EG+DW+ I  +    LE  F+EEEIFKA+  M   K+PG D         
Subjt:  WASTMRNRAHISLLENDNGDILTSEEDIEKEVLGFYNKLYERDINPRFTLEGIDWASIDVQHRDMLEMSFSEEEIFKAVQGMGNLKSPGSD---------

Query:  --------------------------------AKVLAERLKKVLHLTISDCQMAFVQGRQILDAILVASEAVEDYRAIKKKGVLLKLDLEKAYDKVSWEF
                                        AKVLA R++ VLH TI   Q AFVQGRQILDA+L+A+E V++ R  +++GV+ K+D EKAYD VSW+F
Subjt:  --------------------------------AKVLAERLKKVLHLTISDCQMAFVQGRQILDAILVASEAVEDYRAIKKKGVLLKLDLEKAYDKVSWEF

Query:  LDVVLELKGFGKLWRKWITGCLNNSNFSVIINGK-------PRGLRQGDPLSPSVFTIVGDVIAKSVQFCLEKKILNGWQVGKDKVAVSLLQYADDTLVF
        LD VLE+KGFG  WRKW+ GCL++ +F+V++NG         RGLRQGDPLSP +FTIV DV+++ +    EK +L G++VG+++  VS LQ+ADDT+ F
Subjt:  LDVVLELKGFGKLWRKWITGCLNNSNFSVIINGK-------PRGLRQGDPLSPSVFTIVGDVIAKSVQFCLEKKILNGWQVGKDKVAVSLLQYADDTLVF

Query:  CPNNAEDIGNWWDMLNII--LAALSYKLSR---------------------------------ISLGGKYRRKALWEPLIERLRVKLDNWRKFPISKGGR
          +  ED+    ++L +   ++ L   L +                                 + LGG  +    W+P+IER+  +LD W+K  +S GGR
Subjt:  CPNNAEDIGNWWDMLNII--LAALSYKLSR---------------------------------ISLGGKYRRKALWEPLIERLRVKLDNWRKFPISKGGR

Query:  VTLTQTILNSIPQYLFSLLKAPKSVTKAMENISRDFIWNGGTYKPGSNLVKWEWTALPIKLGGLGVGALYQRNDALITKWLWRFSQEKDSLWRSMIAGIY
        +TL Q+ L  +P Y  SL K P SV   +E + RDF+W+G       +LV W+    P   GGLG G +  RN AL+ KWLWR+ +E  +LW  +I  IY
Subjt:  VTLTQTILNSIPQYLFSLLKAPKSVTKAMENISRDFIWNGGTYKPGSNLVKWEWTALPIKLGGLGVGALYQRNDALITKWLWRFSQEKDSLWRSMIAGIY

Query:  GVDPFGWKSGELTKRRGNSPWTAIA
        G    GW    + +     PW AIA
Subjt:  GVDPFGWKSGELTKRRGNSPWTAIA

A0A438K6E3 LINE-1 retrotransposable element ORF2 protein7.3e-11431.69Show/hide
Query:  PTESRLAMIDRRVIKSLWSSRHFGWLSFDAYNSAGGILVMWKENVVVVDISVLGAFPVLIKCTFLGRFKGWVTGV-------------------------
        P    L     ++  S+W  +   W++  A  ++GGI+++W          VLG+F V +K         W+T V                         
Subjt:  PTESRLAMIDRRVIKSLWSSRHFGWLSFDAYNSAGGILVMWKENVVVVDISVLGAFPVLIKCTFLGRFKGWVTGV-------------------------

Query:  -----------------LNATRISRCSKTFNKFIDDTDLIDPPMVNGQFTWSR---------------LDPDFKKRVESWWSALNLSGWAGYRFMEKMKG
                         +  +R++   + F++FI ++ L+DPP+ N  FTWS                L P+FK++   WW    + GW G++FM K+K 
Subjt:  -----------------LNATRISRCSKTFNKFIDDTDLIDPPMVNGQFTWSR---------------LDPDFKKRVESWWSALNLSGWAGYRFMEKMKG

Query:  LKIKIKEWN--------------------------------------------------------------KESYEGDENTASFHKWASTMRNRAHISLL
        +K K+KEWN                                                              K   E D N+  FH+ A+  R+R  I  L
Subjt:  LKIKIKEWN--------------------------------------------------------------KESYEGDENTASFHKWASTMRNRAHISLL

Query:  ENDNGDILTSEEDIEKEVLGFYNKLYERDINPRFTLEGIDWASIDVQHRDMLEMSFSEEEIFKAVQGMGNLKSPGSDA------------------KVLA
         ++ G  L + E I +E++ F+  LY +     + +EGIDWA I  +    L+  FSEEE+  AV  +   K+PG D                   +VL+
Subjt:  ENDNGDILTSEEDIEKEVLGFYNKLYERDINPRFTLEGIDWASIDVQHRDMLEMSFSEEEIFKAVQGMGNLKSPGSDA------------------KVLA

Query:  ERLKKVLHLTISDCQMAFVQGRQILDAILVASEAVEDYRAIKKKGVLLKLDLEKAYDKVSWEFLDVVLELKGFGKLWRKWITGCLNNSNFSVIINGK---
         RL+KVLH TI   Q AFV+GRQILDA+L+A+E V++ R   + GV+ K+D EKAYD V W FLD VL+ KGF + WR W+ GCL++S+F++++NG    
Subjt:  ERLKKVLHLTISDCQMAFVQGRQILDAILVASEAVEDYRAIKKKGVLLKLDLEKAYDKVSWEFLDVVLELKGFGKLWRKWITGCLNNSNFSVIINGK---

Query:  ----PRGLRQGDPLSPSVFTIVGDVIAKSVQFCLEKKILNGWQVGKDKVAVSLLQYADDTLVFCPNNAEDIGNWWDMLNI--------------------
             RGLRQGDPLSP +FT+V DV+++ +    E  I   + VG+D+  VSLLQ+ADDT+ F   + + + N   +L +                    
Subjt:  ----PRGLRQGDPLSPSVFTIVGDVIAKSVQFCLEKKILNGWQVGKDKVAVSLLQYADDTLVFCPNNAEDIGNWWDMLNI--------------------

Query:  ----ILAALSYKLS-RIS----------LGGKYRRKALWEPLIERLRVKLDNWRKFPISKGGRVTLTQTILNSIPQYLFSLLKAPKSVTKAMENISRDFI
            +L++L+  L  R+S          LGG  +    W+P++ER+  +LD W+K  +S GGR+TL Q+ L+ IP Y  SL K P S+   +E + RDF+
Subjt:  ----ILAALSYKLS-RIS----------LGGKYRRKALWEPLIERLRVKLDNWRKFPISKGGRVTLTQTILNSIPQYLFSLLKAPKSVTKAMENISRDFI

Query:  WNGGTYKPGSNLVKWEWTALPIKLGGLGVGALYQRNDALITKWLWRFSQEKDSLWRSMIAGIYGVDPFGWKSGELTKRRGNSPWTAIAK
        W+G       +L++WE  + P ++GGLG G    RN AL+ KWLWRF +E+  LW  +IA IYG  P GW +    +     PW AIA+
Subjt:  WNGGTYKPGSNLVKWEWTALPIKLGGLGVGALYQRNDALITKWLWRFSQEKDSLWRSMIAGIYGVDPFGWKSGELTKRRGNSPWTAIAK

A5BV95 Reverse transcriptase domain-containing protein8.6e-11529.82Show/hide
Query:  IRRESLSKPTESRLAMIDRRVIKSLWSSRHFGWLSFDAYNSAGGILVMWKENVVVVDISVLGAFPVLIKCTFLGRFKGWVTGV-----------------
        I+ E      E++    DRR + S+W  +   W +  A  ++GG +++W  + +     V G+F V +K         W+T V                 
Subjt:  IRRESLSKPTESRLAMIDRRVIKSLWSSRHFGWLSFDAYNSAGGILVMWKENVVVVDISVLGAFPVLIKCTFLGRFKGWVTGV-----------------

Query:  -------------------------LNATRISRCSKTFNKFIDDTDLIDPPMVNGQFTWSRLDP-------------DFKKRVESWWSALNLSGWAGYRF
                                 L  TR++   + F++FI ++ LIDPP+ N  FTWS +                FK++   WW      GW G++F
Subjt:  -------------------------LNATRISRCSKTFNKFIDDTDLIDPPMVNGQFTWSRLDP-------------DFKKRVESWWSALNLSGWAGYRF

Query:  MEKMKGLKIKIKEWNKESY--------------------------------------------------------------EGDENTASFHKWASTMRNR
        M K+K +K K+KEWN  ++                                                              EGD N+  FH+ A+  R+R
Subjt:  MEKMKGLKIKIKEWNKESY--------------------------------------------------------------EGDENTASFHKWASTMRNR

Query:  AHISLLENDNGDILTSEEDIEKEVLGFYNKLYERDINPRFTLEGIDWASIDVQHRDMLEMSFSEEEIFKAVQGMGNLKSPGSD-----------------
          I  L ++ G+ L + EDI +E++ F+  LY + +   +  EGIDW  I  +    L+  F+EEE+ +AV  +   K+PG D                 
Subjt:  AHISLLENDNGDILTSEEDIEKEVLGFYNKLYERDINPRFTLEGIDWASIDVQHRDMLEMSFSEEEIFKAVQGMGNLKSPGSD-----------------

Query:  ---------------------------------------------------AKVLAERLKKVLHLTISDCQMAFVQGRQILDAILVASEAVEDYRAIKKK
                                                           AKVL+ RL+KVLH TISD Q AFV+GR ILDA+L+A+E V++ R   ++
Subjt:  ---------------------------------------------------AKVLAERLKKVLHLTISDCQMAFVQGRQILDAILVASEAVEDYRAIKKK

Query:  GVLLKLDLEKAYDKVSWEFLDVVLELKGFGKLWRKWITGCLNNSNFSVIINGK-------PRGLRQGDPLSPSVFTIVGDVIAKSVQFCLEKKILNGWQV
        G++ K+D EKAYD V W FLD VL+ KGF + WR WI GCL++S+F++++NG         RGLRQGDPLSP +FT+V DV+++ +    E  +  G+ V
Subjt:  GVLLKLDLEKAYDKVSWEFLDVVLELKGFGKLWRKWITGCLNNSNFSVIINGK-------PRGLRQGDPLSPSVFTIVGDVIAKSVQFCLEKKILNGWQV

Query:  GKDKVAVSLLQYADDTLVFCPNNAEDIGNW----------------------------WDMLNIILAALSYKLSR-------ISLGGKYRRKALWEPLIE
        G+D+  VSLLQ+ADDT+ F   + E + N                              ++L+ + +    ++S        + LGG  +    W+P++E
Subjt:  GKDKVAVSLLQYADDTLVFCPNNAEDIGNW----------------------------WDMLNIILAALSYKLSR-------ISLGGKYRRKALWEPLIE

Query:  RLRVKLDNWRKFPISKGGRVTLTQTILNSIPQYLFSLLKAPKSVTKAMENISRDFIWNGGTYKPGSNLVKWEWTALPIKLGGLGVGALYQRNDALITKWL
        R+  +LD W+K  +S GGR+TL Q+ L+ IP Y  SL K P S+   +E + R+F+W+G       +LV+WE  + P +LGGLG G +  RN AL+ KWL
Subjt:  RLRVKLDNWRKFPISKGGRVTLTQTILNSIPQYLFSLLKAPKSVTKAMENISRDFIWNGGTYKPGSNLVKWEWTALPIKLGGLGVGALYQRNDALITKWL

Query:  WRFSQEKDSLWRSMIAGIYGVDPFGWKSGELTKRRGNSPWTAIAK
        WRF +E+  LW  +I  IYG  P GW +  + +     PW AIA+
Subjt:  WRFSQEKDSLWRSMIAGIYGVDPFGWKSGELTKRRGNSPWTAIAK

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein6.3e-2224.38Show/hide
Query:  KVLAERLKKVLHLTISDCQMAFVQGRQILDAILVASEAVEDY-RAIKKKGVLLKLDLEKAYDKVSWEFLDVVLELKGFGKLWRKWITGCLNNSNFSVIIN
        K+LA R+++ +   I   Q+ F+ G Q    I  +   ++   RA  K  V++ +D EKA+DK+   F+   L   G   ++ K I    +    ++I+N
Subjt:  KVLAERLKKVLHLTISDCQMAFVQGRQILDAILVASEAVEDY-RAIKKKGVLLKLDLEKAYDKVSWEFLDVVLELKGFGKLWRKWITGCLNNSNFSVIIN

Query:  GK-------PRGLRQGDPLSPSVFTIVGDVIAKSVQFCLEKKILNGWQVGKDKVAVSLLQYADDTLVFCPNNAEDIGNWWDMLNIILAALSYKLS-----
        G+         G RQG PLSP +F IV +V+A++++   ++K + G Q+GK++V +SL  +ADD +V+  N      N   +++       YK++     
Subjt:  GK-------PRGLRQGDPLSPSVFTIVGDVIAKSVQFCLEKKILNGWQVGKDKVAVSLLQYADDTLVFCPNNAEDIGNWWDMLNIILAALSYKLS-----

Query:  -------------------------RIS-LGGKYRR------KALWEPLIERLRVKLDNWRKFPISKGGRVTLTQTILNSIPQYLFSL--LKAPKSVTKA
                                 RI  LG +  R      K  ++PL++ ++   + W+  P S  GR+ + +  +     Y F+   +K P +    
Subjt:  -------------------------RIS-LGGKYRR------KALWEPLIERLRVKLDNWRKFPISKGGRVTLTQTILNSIPQYLFSL--LKAPKSVTKA

Query:  MENISRDFIWNGGTYKPGSNLVKWEWTALPIKLGGLGVGALYQRNDALITKWLWRFSQEKD
        +E  +  FIWN    +   +++  +      K GG+ +        A +TK  W + Q +D
Subjt:  MENISRDFIWNGGTYKPGSNLVKWEWTALPIKLGGLGVGALYQRNDALITKWLWRFSQEKD

P08548 LINE-1 reverse transcriptase homolog2.0e-2023.42Show/hide
Query:  KVLAERLKKVLHLTISDCQMAFVQGRQILDAILVASEAVEDYRAIKKKG-VLLKLDLEKAYDKVSWEFLDVVLELKGFGKLWRKWITGCLNNSNFSVIIN
        K+L  R+++ +   I   Q+ F+ G Q    I  +   ++    +K K  ++L +D EKA+D +   F+   L+  G    + K I    +    ++I+N
Subjt:  KVLAERLKKVLHLTISDCQMAFVQGRQILDAILVASEAVEDYRAIKKKG-VLLKLDLEKAYDKVSWEFLDVVLELKGFGKLWRKWITGCLNNSNFSVIIN

Query:  GKP-------RGLRQGDPLSPSVFTIVGDVIAKSVQFCLEKKILNGWQVGKDKVAVSLLQYADDTLVFCPNNAEDIGNWWDMLNIILAALSYKLS-----
        G          G RQG PLSP +F IV +V+A +++   E+K + G  +G +++ +SL  +ADD +V+  N  +      +++        YK++     
Subjt:  GKP-------RGLRQGDPLSPSVFTIVGDVIAKSVQFCLEKKILNGWQVGKDKVAVSLLQYADDTLVFCPNNAEDIGNWWDMLNIILAALSYKLS-----

Query:  -------------------------RISLGGKYRRKAL-------WEPLIERLRVKLDNWRKFPISKGGRVTLTQTILNSIPQYLFSL----LKAPKSVT
                                 ++   G Y  K +       +E L + +   ++ W+  P S  GR+ + +  ++ +P+ +++     +KAP S  
Subjt:  -------------------------RISLGGKYRRKAL-------WEPLIERLRVKLDNWRKFPISKGGRVTLTQTILNSIPQYLFSL----LKAPKSVT

Query:  KAMENISRDFIWNGGTYKPGSNLVKWEWTALPIKLGGLGVGALYQRNDALITKWLWRFSQEKD
        K +E I   FIWN    +    L+  +  A  I L  L    LY ++  + T W W  ++E D
Subjt:  KAMENISRDFIWNGGTYKPGSNLVKWEWTALPIKLGGLGVGALYQRNDALITKWLWRFSQEKD

P0C2F6 Putative ribonuclease H protein At1g657506.3e-2236.05Show/hide
Query:  LIERLRVKLDNWRKFPISKGGRVTLTQTILNSIPQYLFSLLKAPKSVTKAMENISRDFIWNGGTYKPGSNLVKWEWTALPIKLGGLGVGALYQRNDALIT
        ++ER+  ++  WR+  +S  GR+TLT+ +L+S+P +  S +  P+S+   ++ +SR F+W     K   +LVKW     P K GGLGV A    N ALI+
Subjt:  LIERLRVKLDNWRKFPISKGGRVTLTQTILNSIPQYLFSLLKAPKSVTKAMENISRDFIWNGGTYKPGSNLVKWEWTALPIKLGGLGVGALYQRNDALIT

Query:  KWLWRFSQEKDSLWRSMIAGIYGVDPFGWKSGELTKRRGNSPWTAIA
        K  WR  QEK+SLW  ++   Y V         + K   +S W +IA
Subjt:  KWLWRFSQEKDSLWRSMIAGIYGVDPFGWKSGELTKRRGNSPWTAIA

P11369 LINE-1 retrotransposable element ORF2 protein2.8e-2221.65Show/hide
Query:  RNRAHISLLENDNGDILTSEEDIEKEVLGFYNKLYER-----DINPRFTLEGIDWASIDVQHRDMLEMSFSEEEIFKAVQGMGNLKSPGSDA--------
        R++  I+ + N+ GDI T  E+I+  +  FY +LY       D   +F L+      ++    D L    S +EI   +  +   KSPG D         
Subjt:  RNRAHISLLENDNGDILTSEEDIEKEVLGFYNKLYER-----DINPRFTLEGIDWASIDVQHRDMLEMSFSEEEIFKAVQGMGNLKSPGSDA--------

Query:  -------------------------------------------------------------KVLAERLKKVLHLTISDCQMAFVQGRQILDAILVASEAV
                                                                     K+LA R+++ +   I   Q+ F+ G Q    I  +   +
Subjt:  -------------------------------------------------------------KVLAERLKKVLHLTISDCQMAFVQGRQILDAILVASEAV

Query:  EDYRAIKKKG-VLLKLDLEKAYDKVSWEFLDVVLELKGFGKLWRKWITGCLNNSNFSVIINGK-------PRGLRQGDPLSPSVFTIVGDVIAKSVQFCL
             +K K  +++ LD EKA+DK+   F+  VLE  G    +   I    +    ++ +NG+         G RQG PLSP +F IV +V+A++++   
Subjt:  EDYRAIKKKG-VLLKLDLEKAYDKVSWEFLDVVLELKGFGKLWRKWITGCLNNSNFSVIINGK-------PRGLRQGDPLSPSVFTIVGDVIAKSVQFCL

Query:  EKKILNGWQVGKDKVAVSLLQYADDTLVFCPNNAEDIGNWWDMLNIILAALSYKL--------------------------SRISLGGKYRRKAL-----
        ++K + G Q+GK++V +SLL  ADD +V+  +         +++N     + YK+                          S ++   KY    L     
Subjt:  EKKILNGWQVGKDKVAVSLLQYADDTLVFCPNNAEDIGNWWDMLNIILAALSYKL--------------------------SRISLGGKYRRKAL-----

Query:  ------WEPLIERLRVKLDNWRKFPISKGGRVTLTQTILNSIPQYLFSL--LKAPKSVTKAMENISRDFIWNGGTYKPGSNLVKWEWTALPIKLGGLGVG
              ++ L + ++  L  W+  P S  GR+ + +  +     Y F+   +K P      +E     F+WN    +   +L+K + T+  I +  L   
Subjt:  ------WEPLIERLRVKLDNWRKFPISKGGRVTLTQTILNSIPQYLFSL--LKAPKSVTKAMENISRDFIWNGGTYKPGSNLVKWEWTALPIKLGGLGVG

Query:  ALYQRNDALITKWLWRFSQEKD
         LY R   + T W W   ++ D
Subjt:  ALYQRNDALITKWLWRFSQEKD

P14381 Transposon TX1 uncharacterized 149 kDa protein1.3e-1124.4Show/hide
Query:  AKVLAERLKKVLHLTISDCQMAFVQGRQILDAILVASEAVEDYRAIKKKGVLLKLDLEKAYDKVSWEFLDVVLELKGFGKLWRKWI------TGCLNNSN
        AK ++ RLK VL   I   Q   V GR I D + +  + +   R        L LD EKA+D+V  ++L   L+   FG  +  ++        CL   N
Subjt:  AKVLAERLKKVLHLTISDCQMAFVQGRQILDAILVASEAVEDYRAIKKKGVLLKLDLEKAYDKVSWEFLDVVLELKGFGKLWRKWI------TGCLNNSN

Query:  FSVIIN-GKPRGLRQGDPLSPSVFTIVGDVIAKSVQFCLEKKILNGWQVGKDKVAVSLLQYADDTLVFCPNNAEDIGNWWDMLNIILAALSYKLSRISLG
        +S+       RG+RQG PLS  +++     +A     CL +K L G  + +  + V L  YADD ++    +  D+    +   +  AA S +++     
Subjt:  FSVIIN-GKPRGLRQGDPLSPSVFTIVGDVIAKSVQFCLEKKILNGWQVGKDKVAVSLLQYADDTLVFCPNNAEDIGNWWDMLNIILAALSYKLSRISLG

Query:  GKYR------------RKALWEPLI-----------------------ERLRVKLDNWRKFP--ISKGGRVTLTQTILNSIPQYLFSLLKAPKSVTKAME
        G               R   WE  I                       E +  +L  W+ F   +S  GR  +   ++ S   Y    L   +     ++
Subjt:  GKYR------------RKALWEPLI-----------------------ERLRVKLDNWRKFP--ISKGGRVTLTQTILNSIPQYLFSLLKAPKSVTKAME

Query:  NISRDFIWNGGTYKPGSNLVKWEWTALPIKLGGLGV
            DF+W       G + V    ++LP+K GG GV
Subjt:  NISRDFIWNGGTYKPGSNLVKWEWTALPIKLGGLGV

Arabidopsis top hitse value%identityAlignment
AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein6.7e-1125.22Show/hide
Query:  PLIERLRVKLDNWRKFPISKGGRVTLTQTILNSIPQYLFSLLKAPKSVTKAMENISRDFIWNGGTYKPGSNLVKWEWTALPIKLGGLGVGALYQRN----
        PL+E++RV++  W    +S  GR+ L  ++++S+  +  S  + P +  K +++I   F+W+G         V W     P   GGLG+ +L + N    
Subjt:  PLIERLRVKLDNWRKFPISKGGRVTLTQTILNSIPQYLFSLLKAPKSVTKAMENISRDFIWNGGTYKPGSNLVKWEWTALPIKLGGLGVGALYQRN----

Query:  -----DALITKWLWR
             +  +  W+W+
Subjt:  -----DALITKWLWR

AT4G20520.1 RNA binding;RNA-directed DNA polymerases9.7e-1043.37Show/hide
Query:  LAERLKKVLHLTISDCQMAFVQGRQILDAILVASEAVEDYRAIKKKGV----LLKLDLEKAYDKVSWEFLDVVLELKGFGKLW
        + ERLK ++   I   Q +F+ GR   D I+   EAV   R  +KKGV    LLKLDLEKAYD++ W++L+  L   GF ++W
Subjt:  LAERLKKVLHLTISDCQMAFVQGRQILDAILVASEAVEDYRAIKKKGV----LLKLDLEKAYDKVSWEFLDVVLELKGFGKLW

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)3.2e-0539.71Show/hide
Query:  IINGKP-------RGLRQGDPLSPSVFTIVGDVIAKSVQFCLEKKILNGWQVGKDKVAVSLLQYADDT
        IING P       RGLRQGDPLSP +F +  +V++   +   E+  L G +V  +   ++ L +ADDT
Subjt:  IINGKP-------RGLRQGDPLSPSVFTIVGDVIAKSVQFCLEKKILNGWQVGKDKVAVSLLQYADDT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGACACAGAAAGATGGATATGTCCTTTTAAACCAAAAGCCTGCTGCGGGGAAATAAGGAAGATTGATTGGAGCATGGTGATAGTTATTACGAAAAGGGACTTCCA
TGACGACTGGGGCCGAATTCTAGAAGTTATGCAACAACAATTAATGGAGCCGTTAGTCATAAATCCCTTTCAACCGGATAAGGCGCTGCTTAAATGCCCTTCGAATGAGT
TGACAGAGCTTTTAACAAAAAACAGGGGTTGGGTAAGCTTTGGTCGTCTGATTTTGAAACTTGAGAAATGGGATAAGTTGAAACATGGAAAGCTCAACACAGTGCCCAAC
TACGCGAAGGTGAAAATCAAAGACGGAAAGGAGATTTTTCACGTGCAGATTGTCACGTTTCATGACGGTCAAATGTTGGTTGACAGAGATGCCGGAATCCATGGCAGCTT
CTCGCCGGAGGCAGCCCACGCCTTCCACAGGGGCCCTTCGGATTTGGACTTTAGTCCAATGGATACTTGGCGAATCGAGAACGATTCGGATTACCCGATGGTTAATACCA
AAGATACTCCTGCAGTTAATGAAGGTACTGGGGACAGCTGGAAAATAATAATGAAAGTTTTGAAATGTCCCGCCAACTGGAGCTTGAAAATGAAAGGCCAAGACGCCAAT
CCAGCAAGTAGAGAACTGGAAGACGACTCTGAAGACACAGAGGGCTTCGATGAGAAAAGTATAGCGGATATCCCAGAAGACTACCAAATGTGCTTTCTGAAAGAAGGAGA
AGACGTTGGGACTTCTCAATCCGAAAGCCTTGAAGAAATACCTAATTCCCTCCAGCCGGAGAATTGCATAAGAAGAGAAAGCCTAAGCAAGCCTACGGAATCTCGACTGG
CTATGATAGATAGGAGGGTGATTAAGTCGTTGTGGAGCTCTAGACACTTTGGATGGCTATCCTTTGACGCTTACAATTCGGCTGGGGGTATCCTCGTGATGTGGAAAGAA
AATGTGGTGGTTGTTGATATCTCTGTCTTGGGGGCCTTCCCTGTCTTGATAAAGTGTACCTTTCTTGGGAGATTCAAGGGGTGGGTGACGGGCGTACTTAATGCCACTCG
CATCTCTAGATGTTCAAAGACGTTTAATAAATTTATTGATGATACAGATTTGATAGATCCTCCGATGGTTAATGGCCAATTTACTTGGTCTAGATTGGATCCCGACTTCA
AGAAGCGTGTGGAATCTTGGTGGTCAGCTTTAAATCTGTCTGGTTGGGCTGGTTACAGATTTATGGAAAAAATGAAAGGGCTAAAGATAAAGATCAAGGAGTGGAACAAA
GAATCATATGAAGGTGATGAGAATACTGCCTCTTTTCACAAATGGGCTTCGACAATGAGAAACAGAGCTCACATCTCTCTGTTAGAAAATGATAATGGGGATATTCTTAC
CTCTGAAGAGGACATTGAAAAGGAGGTTTTGGGCTTCTACAATAAGCTCTATGAGAGAGATATCAATCCTCGATTCACTTTAGAAGGGATCGATTGGGCCTCAATTGACG
TCCAACATAGGGATATGTTGGAAATGAGCTTTAGTGAAGAGGAGATTTTTAAAGCAGTGCAAGGAATGGGCAATCTGAAATCTCCAGGATCAGATGCTAAGGTTTTGGCG
GAAAGATTAAAGAAGGTTCTGCATCTAACCATCAGTGATTGCCAAATGGCTTTTGTGCAAGGTAGGCAAATTCTCGATGCCATTTTAGTGGCCTCCGAAGCAGTTGAAGA
CTATCGAGCAATTAAGAAGAAAGGTGTGTTGTTGAAATTAGATCTTGAAAAAGCCTACGACAAAGTTAGTTGGGAGTTCTTAGATGTTGTGCTTGAGTTAAAAGGGTTTG
GCAAGCTTTGGAGGAAGTGGATTACAGGATGCCTCAACAACTCTAACTTCTCTGTGATTATTAATGGGAAGCCTAGAGGGTTGAGACAAGGAGACCCGCTCTCTCCCTCC
GTTTTCACAATTGTGGGTGATGTTATAGCCAAATCTGTCCAATTTTGCTTAGAGAAGAAGATCCTGAATGGATGGCAGGTTGGTAAGGATAAGGTGGCAGTGTCTTTGCT
CCAGTATGCCGATGATACTCTTGTTTTTTGTCCAAACAATGCTGAAGACATAGGAAATTGGTGGGACATGCTGAATATTATCCTTGCAGCCCTTTCCTATAAATTATCTC
GGATTTCCCTTGGGGGAAAATACCGCCGGAAAGCCTTGTGGGAACCTTTGATAGAAAGACTCAGAGTCAAGCTTGACAATTGGCGAAAATTTCCTATATCCAAGGGTGGG
AGAGTGACATTAACTCAAACTATTCTCAATAGTATACCTCAATACCTCTTCTCCCTCTTAAAAGCTCCTAAATCTGTTACCAAAGCCATGGAGAACATCAGCAGAGACTT
TATATGGAATGGTGGGACTTATAAGCCGGGTAGCAATCTTGTTAAATGGGAATGGACTGCTCTCCCCATCAAGCTTGGTGGTTTAGGGGTGGGAGCCTTATATCAGAGAA
ATGATGCCCTAATCACTAAATGGCTTTGGAGATTTTCCCAGGAGAAAGACTCCTTGTGGAGATCAATGATCGCTGGTATTTATGGAGTTGACCCCTTCGGCTGGAAATCT
GGTGAACTAACTAAAAGGAGAGGCAACAGTCCTTGGACAGCCATTGCTAAAAACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGACACAGAAAGATGGATATGTCCTTTTAAACCAAAAGCCTGCTGCGGGGAAATAAGGAAGATTGATTGGAGCATGGTGATAGTTATTACGAAAAGGGACTTCCA
TGACGACTGGGGCCGAATTCTAGAAGTTATGCAACAACAATTAATGGAGCCGTTAGTCATAAATCCCTTTCAACCGGATAAGGCGCTGCTTAAATGCCCTTCGAATGAGT
TGACAGAGCTTTTAACAAAAAACAGGGGTTGGGTAAGCTTTGGTCGTCTGATTTTGAAACTTGAGAAATGGGATAAGTTGAAACATGGAAAGCTCAACACAGTGCCCAAC
TACGCGAAGGTGAAAATCAAAGACGGAAAGGAGATTTTTCACGTGCAGATTGTCACGTTTCATGACGGTCAAATGTTGGTTGACAGAGATGCCGGAATCCATGGCAGCTT
CTCGCCGGAGGCAGCCCACGCCTTCCACAGGGGCCCTTCGGATTTGGACTTTAGTCCAATGGATACTTGGCGAATCGAGAACGATTCGGATTACCCGATGGTTAATACCA
AAGATACTCCTGCAGTTAATGAAGGTACTGGGGACAGCTGGAAAATAATAATGAAAGTTTTGAAATGTCCCGCCAACTGGAGCTTGAAAATGAAAGGCCAAGACGCCAAT
CCAGCAAGTAGAGAACTGGAAGACGACTCTGAAGACACAGAGGGCTTCGATGAGAAAAGTATAGCGGATATCCCAGAAGACTACCAAATGTGCTTTCTGAAAGAAGGAGA
AGACGTTGGGACTTCTCAATCCGAAAGCCTTGAAGAAATACCTAATTCCCTCCAGCCGGAGAATTGCATAAGAAGAGAAAGCCTAAGCAAGCCTACGGAATCTCGACTGG
CTATGATAGATAGGAGGGTGATTAAGTCGTTGTGGAGCTCTAGACACTTTGGATGGCTATCCTTTGACGCTTACAATTCGGCTGGGGGTATCCTCGTGATGTGGAAAGAA
AATGTGGTGGTTGTTGATATCTCTGTCTTGGGGGCCTTCCCTGTCTTGATAAAGTGTACCTTTCTTGGGAGATTCAAGGGGTGGGTGACGGGCGTACTTAATGCCACTCG
CATCTCTAGATGTTCAAAGACGTTTAATAAATTTATTGATGATACAGATTTGATAGATCCTCCGATGGTTAATGGCCAATTTACTTGGTCTAGATTGGATCCCGACTTCA
AGAAGCGTGTGGAATCTTGGTGGTCAGCTTTAAATCTGTCTGGTTGGGCTGGTTACAGATTTATGGAAAAAATGAAAGGGCTAAAGATAAAGATCAAGGAGTGGAACAAA
GAATCATATGAAGGTGATGAGAATACTGCCTCTTTTCACAAATGGGCTTCGACAATGAGAAACAGAGCTCACATCTCTCTGTTAGAAAATGATAATGGGGATATTCTTAC
CTCTGAAGAGGACATTGAAAAGGAGGTTTTGGGCTTCTACAATAAGCTCTATGAGAGAGATATCAATCCTCGATTCACTTTAGAAGGGATCGATTGGGCCTCAATTGACG
TCCAACATAGGGATATGTTGGAAATGAGCTTTAGTGAAGAGGAGATTTTTAAAGCAGTGCAAGGAATGGGCAATCTGAAATCTCCAGGATCAGATGCTAAGGTTTTGGCG
GAAAGATTAAAGAAGGTTCTGCATCTAACCATCAGTGATTGCCAAATGGCTTTTGTGCAAGGTAGGCAAATTCTCGATGCCATTTTAGTGGCCTCCGAAGCAGTTGAAGA
CTATCGAGCAATTAAGAAGAAAGGTGTGTTGTTGAAATTAGATCTTGAAAAAGCCTACGACAAAGTTAGTTGGGAGTTCTTAGATGTTGTGCTTGAGTTAAAAGGGTTTG
GCAAGCTTTGGAGGAAGTGGATTACAGGATGCCTCAACAACTCTAACTTCTCTGTGATTATTAATGGGAAGCCTAGAGGGTTGAGACAAGGAGACCCGCTCTCTCCCTCC
GTTTTCACAATTGTGGGTGATGTTATAGCCAAATCTGTCCAATTTTGCTTAGAGAAGAAGATCCTGAATGGATGGCAGGTTGGTAAGGATAAGGTGGCAGTGTCTTTGCT
CCAGTATGCCGATGATACTCTTGTTTTTTGTCCAAACAATGCTGAAGACATAGGAAATTGGTGGGACATGCTGAATATTATCCTTGCAGCCCTTTCCTATAAATTATCTC
GGATTTCCCTTGGGGGAAAATACCGCCGGAAAGCCTTGTGGGAACCTTTGATAGAAAGACTCAGAGTCAAGCTTGACAATTGGCGAAAATTTCCTATATCCAAGGGTGGG
AGAGTGACATTAACTCAAACTATTCTCAATAGTATACCTCAATACCTCTTCTCCCTCTTAAAAGCTCCTAAATCTGTTACCAAAGCCATGGAGAACATCAGCAGAGACTT
TATATGGAATGGTGGGACTTATAAGCCGGGTAGCAATCTTGTTAAATGGGAATGGACTGCTCTCCCCATCAAGCTTGGTGGTTTAGGGGTGGGAGCCTTATATCAGAGAA
ATGATGCCCTAATCACTAAATGGCTTTGGAGATTTTCCCAGGAGAAAGACTCCTTGTGGAGATCAATGATCGCTGGTATTTATGGAGTTGACCCCTTCGGCTGGAAATCT
GGTGAACTAACTAAAAGGAGAGGCAACAGTCCTTGGACAGCCATTGCTAAAAACTAA
Protein sequenceShow/hide protein sequence
MEDTERWICPFKPKACCGEIRKIDWSMVIVITKRDFHDDWGRILEVMQQQLMEPLVINPFQPDKALLKCPSNELTELLTKNRGWVSFGRLILKLEKWDKLKHGKLNTVPN
YAKVKIKDGKEIFHVQIVTFHDGQMLVDRDAGIHGSFSPEAAHAFHRGPSDLDFSPMDTWRIENDSDYPMVNTKDTPAVNEGTGDSWKIIMKVLKCPANWSLKMKGQDAN
PASRELEDDSEDTEGFDEKSIADIPEDYQMCFLKEGEDVGTSQSESLEEIPNSLQPENCIRRESLSKPTESRLAMIDRRVIKSLWSSRHFGWLSFDAYNSAGGILVMWKE
NVVVVDISVLGAFPVLIKCTFLGRFKGWVTGVLNATRISRCSKTFNKFIDDTDLIDPPMVNGQFTWSRLDPDFKKRVESWWSALNLSGWAGYRFMEKMKGLKIKIKEWNK
ESYEGDENTASFHKWASTMRNRAHISLLENDNGDILTSEEDIEKEVLGFYNKLYERDINPRFTLEGIDWASIDVQHRDMLEMSFSEEEIFKAVQGMGNLKSPGSDAKVLA
ERLKKVLHLTISDCQMAFVQGRQILDAILVASEAVEDYRAIKKKGVLLKLDLEKAYDKVSWEFLDVVLELKGFGKLWRKWITGCLNNSNFSVIINGKPRGLRQGDPLSPS
VFTIVGDVIAKSVQFCLEKKILNGWQVGKDKVAVSLLQYADDTLVFCPNNAEDIGNWWDMLNIILAALSYKLSRISLGGKYRRKALWEPLIERLRVKLDNWRKFPISKGG
RVTLTQTILNSIPQYLFSLLKAPKSVTKAMENISRDFIWNGGTYKPGSNLVKWEWTALPIKLGGLGVGALYQRNDALITKWLWRFSQEKDSLWRSMIAGIYGVDPFGWKS
GELTKRRGNSPWTAIAKN