; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035668 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035668
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr3:26669623..26673903
RNA-Seq ExpressionLag0035668
SyntenyLag0035668
Gene Ontology termsGO:0009987 - cellular process (biological process)
GO:0050789 - regulation of biological process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN75040.1 hypothetical protein VITISV_026478 [Vitis vinifera]1.1e-14437.75Show/hide
Query:  MSWLQEGDENTKFFHRILSARRRKNTISELLSREGNSLLTDNDIEAEFLDFYNGL---------------------------------------------
        + W++EGD N+KFFHR+ + RR +  I  L+S  G +L    DI  E ++F+  L                                             
Subjt:  MSWLQEGDENTKFFHRILSARRRKNTISELLSREGNSLLTDNDIEAEFLDFYNGL---------------------------------------------

Query:  --------FTI-----------------------DSVINVALNETYICLIPKKVDARTVNDYRPISLIPCAYKIIARVLSNRLKKVLHSTISENQMAFVE
                FTI                       + VIN + N T+I L+PKK  +  ++DYRPISL+   YKIIA+VLS RL+KVLH TIS++Q AFVE
Subjt:  --------FTI-----------------------DSVINVALNETYICLIPKKVDARTVNDYRPISLIPCAYKIIARVLSNRLKKVLHSTISENQMAFVE

Query:  GRQILDASLIANEIIEDWHSKKNRGLIIKLDLEKAFDKVDWGFLDAILRAKGFGLLWRKWISGCLSSANYSIIINGRPRGKIIPSRGIRQGDPLSPFLFI
        GR ILDA LIANE++++       G++ K+D EKA+D VDWGFLD +L+ KGF   WR WI GCLSS++++I++NG  +G +  SRG+RQGDPLSPFLF 
Subjt:  GRQILDASLIANEIIEDWHSKKNRGLIIKLDLEKAFDKVDWGFLDAILRAKGFGLLWRKWISGCLSSANYSIIINGRPRGKIIPSRGIRQGDPLSPFLFI

Query:  LVSDCLSRLLSHGAHLGKIASHCIGKSSLTVNHLQFADDTLLFSTMNSQALENLFNLIHIFENASGLNINYSKSELLGVQVSQEEMNDLARKFGCRVGTW
        LV+D LSR+L      G      +G+    V+ LQFADDT+ FS  + + L+NL  ++ +F   SGL IN  KS + G+   QE ++ LA  F CRV  W
Subjt:  LVSDCLSRLLSHGAHLGKIASHCIGKSSLTVNHLQFADDTLLFSTMNSQALENLFNLIHIFENASGLNINYSKSELLGVQVSQEEMNDLARKFGCRVGTW

Query:  PSSYLGLPLGGNPKSPLFWQPVVEKIHHKLHNWQYFFLSKGGRHTLIQSTLSNMPIYFLSLFRMPTQTVKSFEKILRDFLWEGAKSNGGLHNVSWAKTQM
        P SYLGLPLGGNPK+  FW PVVE+I  +L  W+  +LS GGR TLIQS LS++P YFLSLF++P       EK+ R+FLW GA      H V W     
Subjt:  PSSYLGLPLGGNPKSPLFWQPVVEKIHHKLHNWQYFFLSKGGRHTLIQSTLSNMPIYFLSLFRMPTQTVKSFEKILRDFLWEGAKSNGGLHNVSWAKTQM

Query:  PIVFGGLGIGNIKQRNESLLSKWIWRYLCEEGSLWQQVIKAKYYHLDAYPN-WPMCADSR-SFKAPWKEISRLNIMVKPHVRRILGNGNHISFWHDIWAT
        P   GGLG G I  RN +LL KW+WR+  E   LW +VI + Y     +PN W      R S + PWK I+++     P VR ++GNG  I FW D+W  
Subjt:  PIVFGGLGIGNIKQRNESLLSKWIWRYLCEEGSLWQQVIKAKYYHLDAYPN-WPMCADSR-SFKAPWKEISRLNIMVKPHVRRILGNGNHISFWHDIWAT

Query:  DIDFATSFPLIYRLSNNAHATVADLW---NSENNDWDLGLRRCLKDSEIADWASLAHILQAFAPKRTNDSWQWSLDPSKSY----------SVRSMMHFL
                   + LS++   TV   +   +  +N       + L  S++         + A     TND  Q    P KS           +  S+ H  
Subjt:  DIDFATSFPLIYRLSNNAHATVADLW---NSENNDWDLGLRRCLKDSEIADWASLAHILQAFAPKRTNDSWQWSLDPSKSY----------SVRSMMHFL

Query:  KY-------WNFILKEFGWNMIMPGSMQAILSVVFSGHPFKGNAETLWLAFNRSFFWSLWCERNGRIFSDTHSSFADLLDLAIFNALYWCKCSHPFKDYS
         +       WN + K  G + + P S + +L + F G       +TLW     +  W +W ERN RIF D   S   L DL +F +  W  CS  F+   
Subjt:  KY-------WNFILKEFGWNMIMPGSMQAILSVVFSGHPFKGNAETLWLAFNRSFFWSLWCERNGRIFSDTHSSFADLLDLAIFNALYWCKCSHPFKDYS

Query:  LDFLTLNW
        L+ + LNW
Subjt:  LDFLTLNW

CAN82685.1 hypothetical protein VITISV_000485 [Vitis vinifera]1.3e-14742.47Show/hide
Query:  DIEAEFLDFYNGLFTIDSVINVALNETYICLIPKKVDARTVNDYRPISLIPCAYKIIARVLSNRLKKVLHSTISENQMAFVEGRQILDASLIANEIIEDW
        D+   FL+F+      + VIN + N T+I L+PKK  +  ++DYRPISL+   YKIIA+VLS RL+KVLH TIS +Q AFVEGR ILDA LIANE++++ 
Subjt:  DIEAEFLDFYNGLFTIDSVINVALNETYICLIPKKVDARTVNDYRPISLIPCAYKIIARVLSNRLKKVLHSTISENQMAFVEGRQILDASLIANEIIEDW

Query:  HSKKNRGLIIKLDLEKAFDKVDWGFLDAILRAKGFGLLWRKWISGCLSSANYSIIINGRPRGKIIPSRGIRQGDPLSPFLFILVSDCLSRLLSHGAHLGK
           +  G++ K+D EKA+D VDWGFLD +L+ K F   WR WI GCLSS++++I++NG  +G +  SRG+RQGDPLSPFLF LV+D LSR+L      G 
Subjt:  HSKKNRGLIIKLDLEKAFDKVDWGFLDAILRAKGFGLLWRKWISGCLSSANYSIIINGRPRGKIIPSRGIRQGDPLSPFLFILVSDCLSRLLSHGAHLGK

Query:  IASHCIGKSSLTVNHLQFADDTLLFSTMNSQALENLFNLIHIFENASGLNINYSKSELLGVQVSQEEMNDLARKFGCRVGTWPSSYLGLPLGGNPKSPLF
             +G+    V+ LQFADDT+ FS  + + L+NL  ++ +F   S L IN  KS + G+   QE ++ LA  F CRV  WP SYLGLPLGGNPK+  F
Subjt:  IASHCIGKSSLTVNHLQFADDTLLFSTMNSQALENLFNLIHIFENASGLNINYSKSELLGVQVSQEEMNDLARKFGCRVGTWPSSYLGLPLGGNPKSPLF

Query:  WQPVVEKIHHKLHNWQYFFLSKGGRHTLIQSTLSNMPIYFLSLFRMPTQTVKSFEKILRDFLWEGAKSNGGLHNVSWAKTQMPIVFGGLGIGNIKQRNES
        W  VVE+I  +L  W+  +LS GGR TLIQS LS++P YF+SLF++P       EK+ R+FLW GA      H V W     P   GGLG G I  RN +
Subjt:  WQPVVEKIHHKLHNWQYFFLSKGGRHTLIQSTLSNMPIYFLSLFRMPTQTVKSFEKILRDFLWEGAKSNGGLHNVSWAKTQMPIVFGGLGIGNIKQRNES

Query:  LLSKWIWRYLCEEGSLWQQVIKAKYYHLDAYPN-WPMCADSR-SFKAPWKEISRLNIMVKPHVRRILGNGNHISFWHDIWATDIDFATSFPLIYRLSNNA
        LL KW+WR+  E   LW +VI + Y     +PN W      R S + PWK I+++     P VR ++GNG  I FW D+W  +    + F  +YR+    
Subjt:  LLSKWIWRYLCEEGSLWQQVIKAKYYHLDAYPN-WPMCADSR-SFKAPWKEISRLNIMVKPHVRRILGNGNHISFWHDIWATDIDFATSFPLIYRLSNNA

Query:  HATVAD-LWNSENNDWDLGLRRCLKDSEIADWASLAHILQA--FAPKRTNDSWQWSLDPSKSYSVRSMMHFL-KYWN---FILKEFGWNMIMPGSMQAIL
        + TV++ L NS    W+L  RR L DSEI     L   L +  F+P    DS  WSL  S  +SV+S    L K  N   F+  +F W+  +P  ++A+ 
Subjt:  HATVAD-LWNSENNDWDLGLRRCLKDSEIADWASLAHILQA--FAPKRTNDSWQWSLDPSKSYSVRSMMHFL-KYWN---FILKEFGWNMIMPGSMQAIL

Query:  SVVFSGHPFK-----------------------GNA---ETLWLAFNRSFFWSLWCERNGRIFSDTHSSFADLLDLAIFNALYWCKCSHPFKDYSLDFLT
         +V  G  FK                       GN+   +TLW     +  W +W ERN RIF D   S   L DL +F +  W  CS  F+   L+ L 
Subjt:  SVVFSGHPFK-----------------------GNA---ETLWLAFNRSFFWSLWCERNGRIFSDTHSSFADLLDLAIFNALYWCKCSHPFKDYSLDFLT

Query:  LNWK
        LNW+
Subjt:  LNWK

KAA0046762.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]3.4e-14336.32Show/hide
Query:  WLQEGDENTKFFHRILSARRRKNTISELLSREGNSLLTDNDIEAEFLDFYNGLFT---------IDS---------------------------------
        WL+EGDEN+ FFHRI +AR+++N I E+   EG    +++ I + F+ F++ +F          ID+                                 
Subjt:  WLQEGDENTKFFHRILSARRRKNTISELLSREGNSLLTDNDIEAEFLDFYNGLFT---------IDS---------------------------------

Query:  -----------------------------------VINVALNETYICLIPKKVDARTVNDYRPISLIPCAYKIIARVLSNRLKKVLHSTISENQMAFVEG
                                           VIN  +N TYI LIPKK D     D+RPISL    YKIIA+ LSNRLK  L +TISENQ+AFV+ 
Subjt:  -----------------------------------VINVALNETYICLIPKKVDARTVNDYRPISLIPCAYKIIARVLSNRLKKVLHSTISENQMAFVEG

Query:  RQILDASLIANEIIEDWHSKKNRGLIIKLDLEKAFDKVDWGFLDAILRAKGFGLLWRKWISGCLSSANYSIIINGRPRGKIIPSRGIRQGDPLSPFLFIL
        RQI DA L+ANE ++ W  KK +G I+KLD+EKAFD ++W F+D +L  K F +LWRKWI GC+S+  YSII+NGRP+G+I  +RG+RQGDPLSPFLF++
Subjt:  RQILDASLIANEIIEDWHSKKNRGLIIKLDLEKAFDKVDWGFLDAILRAKGFGLLWRKWISGCLSSANYSIIINGRPRGKIIPSRGIRQGDPLSPFLFIL

Query:  VSDCLSRLLSHGAHLGKIASHCIGKSSLTVNHLQFADDTLLFSTMNSQALENLFNLIHIFENASGLNINYSKSELLGVQVSQEEMNDLARKFGCRVGTWP
          D LSRLLSH    G I       S+  ++H+ FADD LLF   N   L NL   + +FE ASGL IN  KS L+ + VS+    + A  +G    + P
Subjt:  VSDCLSRLLSHGAHLGKIASHCIGKSSLTVNHLQFADDTLLFSTMNSQALENLFNLIHIFENASGLNINYSKSELLGVQVSQEEMNDLARKFGCRVGTWP

Query:  SSYLGLPLGGNPKSPLFWQPVVEKIHHKLHNWQYFFLSKGGRHTLIQSTLSNMPIYFLSLFRMPTQTVKSFEKILRDFLWEGAKSNGGLHNVSWAKTQMP
         SYLG+PLGGNPKS LFW  V EKI  KL+NW+Y  +SKGGR TLI+STLS++P Y LS+F+ P+ T K+ EK  R+FLW+G  S+ G H ++W K    
Subjt:  SSYLGLPLGGNPKSPLFWQPVVEKIHHKLHNWQYFFLSKGGRHTLIQSTLSNMPIYFLSLFRMPTQTVKSFEKILRDFLWEGAKSNGGLHNVSWAKTQMP

Query:  IVFGGLGIGNIKQRNESLLSKWIWRYLCEEGSLWQQVIKAKYYHLDAYP-NWPMCADSRSFKAPWKEISRLNIMVKPHVRRILGNGNHISFWHDIWATDI
           GGLGI  +   N++LL+KW+WRYL E  +LW+++I+ KY  +  YP + P    S + KAPW+ I       + +    L NG+ ISFW+  W+ + 
Subjt:  IVFGGLGIGNIKQRNESLLSKWIWRYLCEEGSLWQQVIKAKYYHLDAYP-NWPMCADSRSFKAPWKEISRLNIMVKPHVRRILGNGNHISFWHDIWATDI

Query:  DFATSFPLIYRLSNNAHATVADLWNSENNDWDLGLRRCLKDSEIADWASLAHILQAFAPKRTNDSWQWSLDPSKSYSVRSMMHFLK--------------
           T++P ++ LS +   TV D WN+ +N W +  RR L D E   WA +  IL        +    W  D   S+S+ S    +               
Subjt:  DFATSFPLIYRLSNNAHATVADLWNSENNDWDLGLRRCLKDSEIADWASLAHILQAFAPKRTNDSWQWSLDPSKSYSVRSMMHFLK--------------

Query:  ---YWN----FILKEFGWNMI-------------MPG-------------------------------------------SMQAILSVVF--SGHPFKGN
            W       +K F W +I             MP                                            S   +  V F  S H F  N
Subjt:  ---YWN----FILKEFGWNMI-------------MPG-------------------------------------------SMQAILSVVF--SGHPFKGN

Query:  AETLWLAFNRSFFWSLWCERNGRIFS--DTHSSFADLLDLAIFNALYWCKCSHPFKDYSLDFLTLNWKSF
         + ++     + FW +WCERN RIF     H + A++ +        WC     F++YS   + LN  +F
Subjt:  AETLWLAFNRSFFWSLWCERNGRIFS--DTHSSFADLLDLAIFNALYWCKCSHPFKDYSLDFLTLNWKSF

RVW45791.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]7.9e-14837.35Show/hide
Query:  MSWLQEGDENTKFFHRILSARRRKNTISELLSREGNSLLTDNDIEAEFLDFYNGLFT-----------ID------------------------------
        + W++EGD N+KFFHR+ + RR +  I  L+S  G +L     I  E ++F+  L++           ID                              
Subjt:  MSWLQEGDENTKFFHRILSARRRKNTISELLSREGNSLLTDNDIEAEFLDFYNGLFT-----------ID------------------------------

Query:  -----------------------------------SVINVALNETYICLIPKKVDARTVNDYRPISLIPCAYKIIARVLSNRLKKVLHSTISENQMAFVE
                                            VIN + N T+I ++PKK     ++DYRPISL+   YKIIA+VLS RL+KVLH TI  +Q AFVE
Subjt:  -----------------------------------SVINVALNETYICLIPKKVDARTVNDYRPISLIPCAYKIIARVLSNRLKKVLHSTISENQMAFVE

Query:  GRQILDASLIANEIIEDWHSKKNRGLIIKLDLEKAFDKVDWGFLDAILRAKGFGLLWRKWISGCLSSANYSIIINGRPRGKIIPSRGIRQGDPLSPFLFI
        GRQILDA LIANE++++       G++ K+D EKA+D V+WGFLD +L+ KGF   WR W+ GCLSS++++I++NG  +G +  SRG+RQGDPLSPFLF 
Subjt:  GRQILDASLIANEIIEDWHSKKNRGLIIKLDLEKAFDKVDWGFLDAILRAKGFGLLWRKWISGCLSSANYSIIINGRPRGKIIPSRGIRQGDPLSPFLFI

Query:  LVSDCLSRLLSHGAHLGKIASHCIGKSSLTVNHLQFADDTLLFSTMNSQALENLFNLIHIFENASGLNINYSKSELLGVQVSQEEMNDLARKFGCRVGTW
        LV+D LSRL+      G      +G+    V+ LQFADDT+ FS  +   L+NL  ++ +F   SGL IN  KS + G+   QE ++ LA    CRV  W
Subjt:  LVSDCLSRLLSHGAHLGKIASHCIGKSSLTVNHLQFADDTLLFSTMNSQALENLFNLIHIFENASGLNINYSKSELLGVQVSQEEMNDLARKFGCRVGTW

Query:  PSSYLGLPLGGNPKSPLFWQPVVEKIHHKLHNWQYFFLSKGGRHTLIQSTLSNMPIYFLSLFRMPTQTVKSFEKILRDFLWEGAKSNGGLHNVSWAKTQM
        P SYLGLPLGGNPK+  FW PVVE+I  +L  W+  +LS GGR TLIQS LS++P YFLSLF++P       EK+ RDFLW GA+     H + W     
Subjt:  PSSYLGLPLGGNPKSPLFWQPVVEKIHHKLHNWQYFFLSKGGRHTLIQSTLSNMPIYFLSLFRMPTQTVKSFEKILRDFLWEGAKSNGGLHNVSWAKTQM

Query:  PIVFGGLGIGNIKQRNESLLSKWIWRYLCEEGSLWQQVIKAKYYHLDAYPN-WPMCADSR-SFKAPWKEISRLNIMVKPHVRRILGNGNHISFWHDIWAT
        P   GGLG G    RN +LL KW+WR+  E   LW +VI + Y     +PN W      R S + PWK I+++     P VR ++GNG  I FW D+W  
Subjt:  PIVFGGLGIGNIKQRNESLLSKWIWRYLCEEGSLWQQVIKAKYYHLDAYPN-WPMCADSR-SFKAPWKEISRLNIMVKPHVRRILGNGNHISFWHDIWAT

Query:  DIDFATSFPLIYRLSNNAHATVAD-LWNSENNDWDLGLRRCLKDSEIADWASLAHILQA-FAPKRTNDSWQWSLDPSKSYSVRSMMHFLKYWN----FIL
        +      F  +YR+ +  + TV++ L NS    W+    R L DSEI     L   L +      ++DS  WSL  S S+SV+S  + L   +    F+ 
Subjt:  DIDFATSFPLIYRLSNNAHATVAD-LWNSENNDWDLGLRRCLKDSEIADWASLAHILQA-FAPKRTNDSWQWSLDPSKSYSVRSMMHFLKYWN----FIL

Query:  KEFGWNMIMPGSMQAI--------------------LSVVFSGHPFKGNAETLWLAFNRSFFWSLWCERNGRIFSDTHSSFADLLDLAIFNALYWCKCSH
         +F W+  +P  ++A+                    L + F G       + LW     +  W +W ERN RIF D   +   + DL  F +  W  C+ 
Subjt:  KEFGWNMIMPGSMQAI--------------------LSVVFSGHPFKGNAETLWLAFNRSFFWSLWCERNGRIFSDTHSSFADLLDLAIFNALYWCKCSH

Query:  PFKDYSLDFLTLNW
         F+   L  L +NW
Subjt:  PFKDYSLDFLTLNW

RVW64408.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]3.2e-14140.47Show/hide
Query:  MSWLQEGDENTKFFHRILSARRRKNTISELLSREGNSLLTDNDIEAEFLDFYNGL---------------------------------------------
        + W++EGD N+KFFHR+ + RR +  I  L+S  G +L    DI  E ++F+  L                                             
Subjt:  MSWLQEGDENTKFFHRILSARRRKNTISELLSREGNSLLTDNDIEAEFLDFYNGL---------------------------------------------

Query:  --------FTI-----------------------DSVINVALNETYICLIPKKVDARTVNDYRPISLIPCAYKIIARVLSNRLKKVLHSTISENQMAFVE
                FTI                       + VIN + N T+I L+PKK  +  ++DYRPISL+   YKIIA+VLS RL+KVLH TIS++Q AFVE
Subjt:  --------FTI-----------------------DSVINVALNETYICLIPKKVDARTVNDYRPISLIPCAYKIIARVLSNRLKKVLHSTISENQMAFVE

Query:  GRQILDASLIANEIIEDWHSKKNRGLIIKLDLEKAFDKVDWGFLDAILRAKGFGLLWRKWISGCLSSANYSIIINGRPRGKIIPSRGIRQGDPLSPFLFI
        GR ILDA LIANE++++       G++ K+D EKA+D VDWGFLD +L+ KGF   WR WI GCLSS++++I++NG  +G +  SRG+RQGDPLSPFLF 
Subjt:  GRQILDASLIANEIIEDWHSKKNRGLIIKLDLEKAFDKVDWGFLDAILRAKGFGLLWRKWISGCLSSANYSIIINGRPRGKIIPSRGIRQGDPLSPFLFI

Query:  LVSDCLSRLLSHGAHLGKIASHCIGKSSLTVNHLQFADDTLLFSTMNSQALENLFNLIHIFENASGLNINYSKSELLGVQVSQEEMNDLARKFGCRVGTW
        LV+D LSR+L      G      +G+    V+ LQFADDT+ FS  + + L+NL  ++ +F   SGL IN  KS + G+   QE ++ LA  F CRV  W
Subjt:  LVSDCLSRLLSHGAHLGKIASHCIGKSSLTVNHLQFADDTLLFSTMNSQALENLFNLIHIFENASGLNINYSKSELLGVQVSQEEMNDLARKFGCRVGTW

Query:  PSSYLGLPLGGNPKSPLFWQPVVEKIHHKLHNWQYFFLSKGGRHTLIQSTLSNMPIYFLSLFRMPTQTVKSFEKILRDFLWEGAKSNGGLHNVSWAKTQM
        P SYLGLPLGGNPK+  FW PVVE+I  +L  W+  +LS GGR TLIQS LS++P YFLSLF++P       EK+ R+FLW GA      H V W     
Subjt:  PSSYLGLPLGGNPKSPLFWQPVVEKIHHKLHNWQYFFLSKGGRHTLIQSTLSNMPIYFLSLFRMPTQTVKSFEKILRDFLWEGAKSNGGLHNVSWAKTQM

Query:  PIVFGGLGIGNIKQRNESLLSKWIWRYLCEEGSLWQQVIKAKYYHLDAYPN-WPMCADSR-SFKAPWKEISRLNIMVKPHVRRILGNGNHISFWHDIWAT
        P   GGLG G I  RN +LL KW+WR+  E   LW +VI + Y     +PN W      R S + PWK I+++     P VR ++GNG  I FW D+W  
Subjt:  PIVFGGLGIGNIKQRNESLLSKWIWRYLCEEGSLWQQVIKAKYYHLDAYPN-WPMCADSR-SFKAPWKEISRLNIMVKPHVRRILGNGNHISFWHDIWAT

Query:  DIDFATSFPLIYRLSNNAHATVAD-LWNSENNDWDLGLRRCLKDSEIADWASLAHILQA--FAPKRTNDSWQWSLDPSKSYSVRSMMHFL-KYWN---FI
        +    + F  +YR+ +  + TV++ L NS    W+L  RR L DSEI     L   L +  F P    DS  WSL  S  ++V+S    L K  N   F+
Subjt:  DIDFATSFPLIYRLSNNAHATVAD-LWNSENNDWDLGLRRCLKDSEIADWASLAHILQA--FAPKRTNDSWQWSLDPSKSYSVRSMMHFL-KYWN---FI

Query:  LKEFGWNMIMPGSMQAILSVVFSG
          +F W+  +P  ++A+  +V  G
Subjt:  LKEFGWNMIMPGSMQAILSVVFSG

TrEMBL top hitse value%identityAlignment
A0A438ED82 LINE-1 retrotransposable element ORF2 protein3.8e-14837.35Show/hide
Query:  MSWLQEGDENTKFFHRILSARRRKNTISELLSREGNSLLTDNDIEAEFLDFYNGLFT-----------ID------------------------------
        + W++EGD N+KFFHR+ + RR +  I  L+S  G +L     I  E ++F+  L++           ID                              
Subjt:  MSWLQEGDENTKFFHRILSARRRKNTISELLSREGNSLLTDNDIEAEFLDFYNGLFT-----------ID------------------------------

Query:  -----------------------------------SVINVALNETYICLIPKKVDARTVNDYRPISLIPCAYKIIARVLSNRLKKVLHSTISENQMAFVE
                                            VIN + N T+I ++PKK     ++DYRPISL+   YKIIA+VLS RL+KVLH TI  +Q AFVE
Subjt:  -----------------------------------SVINVALNETYICLIPKKVDARTVNDYRPISLIPCAYKIIARVLSNRLKKVLHSTISENQMAFVE

Query:  GRQILDASLIANEIIEDWHSKKNRGLIIKLDLEKAFDKVDWGFLDAILRAKGFGLLWRKWISGCLSSANYSIIINGRPRGKIIPSRGIRQGDPLSPFLFI
        GRQILDA LIANE++++       G++ K+D EKA+D V+WGFLD +L+ KGF   WR W+ GCLSS++++I++NG  +G +  SRG+RQGDPLSPFLF 
Subjt:  GRQILDASLIANEIIEDWHSKKNRGLIIKLDLEKAFDKVDWGFLDAILRAKGFGLLWRKWISGCLSSANYSIIINGRPRGKIIPSRGIRQGDPLSPFLFI

Query:  LVSDCLSRLLSHGAHLGKIASHCIGKSSLTVNHLQFADDTLLFSTMNSQALENLFNLIHIFENASGLNINYSKSELLGVQVSQEEMNDLARKFGCRVGTW
        LV+D LSRL+      G      +G+    V+ LQFADDT+ FS  +   L+NL  ++ +F   SGL IN  KS + G+   QE ++ LA    CRV  W
Subjt:  LVSDCLSRLLSHGAHLGKIASHCIGKSSLTVNHLQFADDTLLFSTMNSQALENLFNLIHIFENASGLNINYSKSELLGVQVSQEEMNDLARKFGCRVGTW

Query:  PSSYLGLPLGGNPKSPLFWQPVVEKIHHKLHNWQYFFLSKGGRHTLIQSTLSNMPIYFLSLFRMPTQTVKSFEKILRDFLWEGAKSNGGLHNVSWAKTQM
        P SYLGLPLGGNPK+  FW PVVE+I  +L  W+  +LS GGR TLIQS LS++P YFLSLF++P       EK+ RDFLW GA+     H + W     
Subjt:  PSSYLGLPLGGNPKSPLFWQPVVEKIHHKLHNWQYFFLSKGGRHTLIQSTLSNMPIYFLSLFRMPTQTVKSFEKILRDFLWEGAKSNGGLHNVSWAKTQM

Query:  PIVFGGLGIGNIKQRNESLLSKWIWRYLCEEGSLWQQVIKAKYYHLDAYPN-WPMCADSR-SFKAPWKEISRLNIMVKPHVRRILGNGNHISFWHDIWAT
        P   GGLG G    RN +LL KW+WR+  E   LW +VI + Y     +PN W      R S + PWK I+++     P VR ++GNG  I FW D+W  
Subjt:  PIVFGGLGIGNIKQRNESLLSKWIWRYLCEEGSLWQQVIKAKYYHLDAYPN-WPMCADSR-SFKAPWKEISRLNIMVKPHVRRILGNGNHISFWHDIWAT

Query:  DIDFATSFPLIYRLSNNAHATVAD-LWNSENNDWDLGLRRCLKDSEIADWASLAHILQA-FAPKRTNDSWQWSLDPSKSYSVRSMMHFLKYWN----FIL
        +      F  +YR+ +  + TV++ L NS    W+    R L DSEI     L   L +      ++DS  WSL  S S+SV+S  + L   +    F+ 
Subjt:  DIDFATSFPLIYRLSNNAHATVAD-LWNSENNDWDLGLRRCLKDSEIADWASLAHILQA-FAPKRTNDSWQWSLDPSKSYSVRSMMHFLKYWN----FIL

Query:  KEFGWNMIMPGSMQAI--------------------LSVVFSGHPFKGNAETLWLAFNRSFFWSLWCERNGRIFSDTHSSFADLLDLAIFNALYWCKCSH
         +F W+  +P  ++A+                    L + F G       + LW     +  W +W ERN RIF D   +   + DL  F +  W  C+ 
Subjt:  KEFGWNMIMPGSMQAI--------------------LSVVFSGHPFKGNAETLWLAFNRSFFWSLWCERNGRIFSDTHSSFADLLDLAIFNALYWCKCSH

Query:  PFKDYSLDFLTLNW
         F+   L  L +NW
Subjt:  PFKDYSLDFLTLNW

A0A5A7TTK1 LINE-1 retrotransposable element ORF2 protein1.7e-14336.32Show/hide
Query:  WLQEGDENTKFFHRILSARRRKNTISELLSREGNSLLTDNDIEAEFLDFYNGLFT---------IDS---------------------------------
        WL+EGDEN+ FFHRI +AR+++N I E+   EG    +++ I + F+ F++ +F          ID+                                 
Subjt:  WLQEGDENTKFFHRILSARRRKNTISELLSREGNSLLTDNDIEAEFLDFYNGLFT---------IDS---------------------------------

Query:  -----------------------------------VINVALNETYICLIPKKVDARTVNDYRPISLIPCAYKIIARVLSNRLKKVLHSTISENQMAFVEG
                                           VIN  +N TYI LIPKK D     D+RPISL    YKIIA+ LSNRLK  L +TISENQ+AFV+ 
Subjt:  -----------------------------------VINVALNETYICLIPKKVDARTVNDYRPISLIPCAYKIIARVLSNRLKKVLHSTISENQMAFVEG

Query:  RQILDASLIANEIIEDWHSKKNRGLIIKLDLEKAFDKVDWGFLDAILRAKGFGLLWRKWISGCLSSANYSIIINGRPRGKIIPSRGIRQGDPLSPFLFIL
        RQI DA L+ANE ++ W  KK +G I+KLD+EKAFD ++W F+D +L  K F +LWRKWI GC+S+  YSII+NGRP+G+I  +RG+RQGDPLSPFLF++
Subjt:  RQILDASLIANEIIEDWHSKKNRGLIIKLDLEKAFDKVDWGFLDAILRAKGFGLLWRKWISGCLSSANYSIIINGRPRGKIIPSRGIRQGDPLSPFLFIL

Query:  VSDCLSRLLSHGAHLGKIASHCIGKSSLTVNHLQFADDTLLFSTMNSQALENLFNLIHIFENASGLNINYSKSELLGVQVSQEEMNDLARKFGCRVGTWP
          D LSRLLSH    G I       S+  ++H+ FADD LLF   N   L NL   + +FE ASGL IN  KS L+ + VS+    + A  +G    + P
Subjt:  VSDCLSRLLSHGAHLGKIASHCIGKSSLTVNHLQFADDTLLFSTMNSQALENLFNLIHIFENASGLNINYSKSELLGVQVSQEEMNDLARKFGCRVGTWP

Query:  SSYLGLPLGGNPKSPLFWQPVVEKIHHKLHNWQYFFLSKGGRHTLIQSTLSNMPIYFLSLFRMPTQTVKSFEKILRDFLWEGAKSNGGLHNVSWAKTQMP
         SYLG+PLGGNPKS LFW  V EKI  KL+NW+Y  +SKGGR TLI+STLS++P Y LS+F+ P+ T K+ EK  R+FLW+G  S+ G H ++W K    
Subjt:  SSYLGLPLGGNPKSPLFWQPVVEKIHHKLHNWQYFFLSKGGRHTLIQSTLSNMPIYFLSLFRMPTQTVKSFEKILRDFLWEGAKSNGGLHNVSWAKTQMP

Query:  IVFGGLGIGNIKQRNESLLSKWIWRYLCEEGSLWQQVIKAKYYHLDAYP-NWPMCADSRSFKAPWKEISRLNIMVKPHVRRILGNGNHISFWHDIWATDI
           GGLGI  +   N++LL+KW+WRYL E  +LW+++I+ KY  +  YP + P    S + KAPW+ I       + +    L NG+ ISFW+  W+ + 
Subjt:  IVFGGLGIGNIKQRNESLLSKWIWRYLCEEGSLWQQVIKAKYYHLDAYP-NWPMCADSRSFKAPWKEISRLNIMVKPHVRRILGNGNHISFWHDIWATDI

Query:  DFATSFPLIYRLSNNAHATVADLWNSENNDWDLGLRRCLKDSEIADWASLAHILQAFAPKRTNDSWQWSLDPSKSYSVRSMMHFLK--------------
           T++P ++ LS +   TV D WN+ +N W +  RR L D E   WA +  IL        +    W  D   S+S+ S    +               
Subjt:  DFATSFPLIYRLSNNAHATVADLWNSENNDWDLGLRRCLKDSEIADWASLAHILQAFAPKRTNDSWQWSLDPSKSYSVRSMMHFLK--------------

Query:  ---YWN----FILKEFGWNMI-------------MPG-------------------------------------------SMQAILSVVF--SGHPFKGN
            W       +K F W +I             MP                                            S   +  V F  S H F  N
Subjt:  ---YWN----FILKEFGWNMI-------------MPG-------------------------------------------SMQAILSVVF--SGHPFKGN

Query:  AETLWLAFNRSFFWSLWCERNGRIFS--DTHSSFADLLDLAIFNALYWCKCSHPFKDYSLDFLTLNWKSF
         + ++     + FW +WCERN RIF     H + A++ +        WC     F++YS   + LN  +F
Subjt:  AETLWLAFNRSFFWSLWCERNGRIFS--DTHSSFADLLDLAIFNALYWCKCSHPFKDYSLDFLTLNWKSF

A0A803P465 Uncharacterized protein2.2e-14841.25Show/hide
Query:  GWWM----SWLQEGDENTKFFHRILSARRRKNTISELLSREGNSLLTDNDIEAEFLDFYNGLFTI-----------------------------------
        G WM     W +EGD N++FFH +L+AR+ +NTIS +   +G+ L    +I  E + F++ L+T                                    
Subjt:  GWWM----SWLQEGDENTKFFHRILSARRRKNTISELLSREGNSLLTDNDIEAEFLDFYNGLFTI-----------------------------------

Query:  -----------------------------------------DSVINVALNETYICLIPKKVDARTVNDYRPISLIPCAYKIIARVLSNRLKKVLHSTISE
                                                 D  I  ++NET+ICLIPKK+ +  V DYRPISLI   YKIIA++LS RL+ VL  TI E
Subjt:  -----------------------------------------DSVINVALNETYICLIPKKVDARTVNDYRPISLIPCAYKIIARVLSNRLKKVLHSTISE

Query:  NQMAFVEGRQILDASLIANEIIEDWHSKKNRGLIIKLDLEKAFDKVDWGFLDAILRAKGFGLLWRKWISGCLSSANYSIIINGRPRGKIIPSRGIRQGDP
         Q AFVEGRQILD+ LIANE +ED+ S+   GL+ K+D EKA+D+V+W F+D +L  KGFG +WRKWI GC+SS ++S+ IN  PRGK   SRG+RQGDP
Subjt:  NQMAFVEGRQILDASLIANEIIEDWHSKKNRGLIIKLDLEKAFDKVDWGFLDAILRAKGFGLLWRKWISGCLSSANYSIIINGRPRGKIIPSRGIRQGDP

Query:  LSPFLFILVSDCLSRLLSHGAHLGKIASHCIGKSSLTVNHLQFADDTLLFSTMNSQALENLFNLIHIFENASGLNINYSKSELLGVQVSQEEMNDLARKF
        LSPFLF LV+D L R+ +     G I+   +GK  + V+HLQFADDT+ F   N Q+L  L  ++  F   SGL IN SKS+LLG+ + +E ++ LAR+ 
Subjt:  LSPFLFILVSDCLSRLLSHGAHLGKIASHCIGKSSLTVNHLQFADDTLLFSTMNSQALENLFNLIHIFENASGLNINYSKSELLGVQVSQEEMNDLARKF

Query:  GCRVGTWPSSYLGLPLGGNPKSPLFWQPVVEKIHHKLHNWQYFFLSKGGRHTLIQSTLSNMPIYFLSLFRMPTQTVKSFEKILRDFLWEGAKSNGGLHNV
        GC VG+WP  YLG+PLGG+P+   FW+PV++K   +L  W+  FLSKGGR TLIQS LS++P+YFLSLF+ P    K+ EK++RDFLWEG++S+GG H V
Subjt:  GCRVGTWPSSYLGLPLGGNPKSPLFWQPVVEKIHHKLHNWQYFFLSKGGRHTLIQSTLSNMPIYFLSLFRMPTQTVKSFEKILRDFLWEGAKSNGGLHNV

Query:  SWAKTQMPIVFGGLGIGNIKQRNESLLSKWIWRYLCEEGSLWQQVIKAKYYHLDAYPNWPMCADSR-SFKAPWKEISRLNIMVKPHVRRILGNGNHISFW
        +W +   P   GGLGIG ++ RN+SLL KW+WR+  E+ SLW +V+ ++Y   D    W     SR S K PW++IS L       V   LG G+ I FW
Subjt:  SWAKTQMPIVFGGLGIGNIKQRNESLLSKWIWRYLCEEGSLWQQVIKAKYYHLDAYPNWPMCADSR-SFKAPWKEISRLNIMVKPHVRRILGNGNHISFW

Query:  HDIWATDIDFATSFPLIYRLSNNAHATVADLWNSEN------NDWDLGLRRCLKDSEIADWASLAHILQAFAPKR----TNDSWQWSLDPSKSYSVRSMM
         D+W  D    ++FP +  +S   +  + +L   E         W+   RR L D E+    SL  ++Q     R    + DS  W  DPS  +S +S  
Subjt:  HDIWATDIDFATSFPLIYRLSNNAHATVADLWNSEN------NDWDLGLRRCLKDSEIADWASLAHILQAFAPKR----TNDSWQWSLDPSKSYSVRSMM

Query:  HFL
         ++
Subjt:  HFL

A5AY60 Reverse transcriptase domain-containing protein6.5e-14842.47Show/hide
Query:  DIEAEFLDFYNGLFTIDSVINVALNETYICLIPKKVDARTVNDYRPISLIPCAYKIIARVLSNRLKKVLHSTISENQMAFVEGRQILDASLIANEIIEDW
        D+   FL+F+      + VIN + N T+I L+PKK  +  ++DYRPISL+   YKIIA+VLS RL+KVLH TIS +Q AFVEGR ILDA LIANE++++ 
Subjt:  DIEAEFLDFYNGLFTIDSVINVALNETYICLIPKKVDARTVNDYRPISLIPCAYKIIARVLSNRLKKVLHSTISENQMAFVEGRQILDASLIANEIIEDW

Query:  HSKKNRGLIIKLDLEKAFDKVDWGFLDAILRAKGFGLLWRKWISGCLSSANYSIIINGRPRGKIIPSRGIRQGDPLSPFLFILVSDCLSRLLSHGAHLGK
           +  G++ K+D EKA+D VDWGFLD +L+ K F   WR WI GCLSS++++I++NG  +G +  SRG+RQGDPLSPFLF LV+D LSR+L      G 
Subjt:  HSKKNRGLIIKLDLEKAFDKVDWGFLDAILRAKGFGLLWRKWISGCLSSANYSIIINGRPRGKIIPSRGIRQGDPLSPFLFILVSDCLSRLLSHGAHLGK

Query:  IASHCIGKSSLTVNHLQFADDTLLFSTMNSQALENLFNLIHIFENASGLNINYSKSELLGVQVSQEEMNDLARKFGCRVGTWPSSYLGLPLGGNPKSPLF
             +G+    V+ LQFADDT+ FS  + + L+NL  ++ +F   S L IN  KS + G+   QE ++ LA  F CRV  WP SYLGLPLGGNPK+  F
Subjt:  IASHCIGKSSLTVNHLQFADDTLLFSTMNSQALENLFNLIHIFENASGLNINYSKSELLGVQVSQEEMNDLARKFGCRVGTWPSSYLGLPLGGNPKSPLF

Query:  WQPVVEKIHHKLHNWQYFFLSKGGRHTLIQSTLSNMPIYFLSLFRMPTQTVKSFEKILRDFLWEGAKSNGGLHNVSWAKTQMPIVFGGLGIGNIKQRNES
        W  VVE+I  +L  W+  +LS GGR TLIQS LS++P YF+SLF++P       EK+ R+FLW GA      H V W     P   GGLG G I  RN +
Subjt:  WQPVVEKIHHKLHNWQYFFLSKGGRHTLIQSTLSNMPIYFLSLFRMPTQTVKSFEKILRDFLWEGAKSNGGLHNVSWAKTQMPIVFGGLGIGNIKQRNES

Query:  LLSKWIWRYLCEEGSLWQQVIKAKYYHLDAYPN-WPMCADSR-SFKAPWKEISRLNIMVKPHVRRILGNGNHISFWHDIWATDIDFATSFPLIYRLSNNA
        LL KW+WR+  E   LW +VI + Y     +PN W      R S + PWK I+++     P VR ++GNG  I FW D+W  +    + F  +YR+    
Subjt:  LLSKWIWRYLCEEGSLWQQVIKAKYYHLDAYPN-WPMCADSR-SFKAPWKEISRLNIMVKPHVRRILGNGNHISFWHDIWATDIDFATSFPLIYRLSNNA

Query:  HATVAD-LWNSENNDWDLGLRRCLKDSEIADWASLAHILQA--FAPKRTNDSWQWSLDPSKSYSVRSMMHFL-KYWN---FILKEFGWNMIMPGSMQAIL
        + TV++ L NS    W+L  RR L DSEI     L   L +  F+P    DS  WSL  S  +SV+S    L K  N   F+  +F W+  +P  ++A+ 
Subjt:  HATVAD-LWNSENNDWDLGLRRCLKDSEIADWASLAHILQA--FAPKRTNDSWQWSLDPSKSYSVRSMMHFL-KYWN---FILKEFGWNMIMPGSMQAIL

Query:  SVVFSGHPFK-----------------------GNA---ETLWLAFNRSFFWSLWCERNGRIFSDTHSSFADLLDLAIFNALYWCKCSHPFKDYSLDFLT
         +V  G  FK                       GN+   +TLW     +  W +W ERN RIF D   S   L DL +F +  W  CS  F+   L+ L 
Subjt:  SVVFSGHPFK-----------------------GNA---ETLWLAFNRSFFWSLWCERNGRIFSDTHSSFADLLDLAIFNALYWCKCSHPFKDYSLDFLT

Query:  LNWK
        LNW+
Subjt:  LNWK

A5BV95 Reverse transcriptase domain-containing protein5.1e-14537.75Show/hide
Query:  MSWLQEGDENTKFFHRILSARRRKNTISELLSREGNSLLTDNDIEAEFLDFYNGL---------------------------------------------
        + W++EGD N+KFFHR+ + RR +  I  L+S  G +L    DI  E ++F+  L                                             
Subjt:  MSWLQEGDENTKFFHRILSARRRKNTISELLSREGNSLLTDNDIEAEFLDFYNGL---------------------------------------------

Query:  --------FTI-----------------------DSVINVALNETYICLIPKKVDARTVNDYRPISLIPCAYKIIARVLSNRLKKVLHSTISENQMAFVE
                FTI                       + VIN + N T+I L+PKK  +  ++DYRPISL+   YKIIA+VLS RL+KVLH TIS++Q AFVE
Subjt:  --------FTI-----------------------DSVINVALNETYICLIPKKVDARTVNDYRPISLIPCAYKIIARVLSNRLKKVLHSTISENQMAFVE

Query:  GRQILDASLIANEIIEDWHSKKNRGLIIKLDLEKAFDKVDWGFLDAILRAKGFGLLWRKWISGCLSSANYSIIINGRPRGKIIPSRGIRQGDPLSPFLFI
        GR ILDA LIANE++++       G++ K+D EKA+D VDWGFLD +L+ KGF   WR WI GCLSS++++I++NG  +G +  SRG+RQGDPLSPFLF 
Subjt:  GRQILDASLIANEIIEDWHSKKNRGLIIKLDLEKAFDKVDWGFLDAILRAKGFGLLWRKWISGCLSSANYSIIINGRPRGKIIPSRGIRQGDPLSPFLFI

Query:  LVSDCLSRLLSHGAHLGKIASHCIGKSSLTVNHLQFADDTLLFSTMNSQALENLFNLIHIFENASGLNINYSKSELLGVQVSQEEMNDLARKFGCRVGTW
        LV+D LSR+L      G      +G+    V+ LQFADDT+ FS  + + L+NL  ++ +F   SGL IN  KS + G+   QE ++ LA  F CRV  W
Subjt:  LVSDCLSRLLSHGAHLGKIASHCIGKSSLTVNHLQFADDTLLFSTMNSQALENLFNLIHIFENASGLNINYSKSELLGVQVSQEEMNDLARKFGCRVGTW

Query:  PSSYLGLPLGGNPKSPLFWQPVVEKIHHKLHNWQYFFLSKGGRHTLIQSTLSNMPIYFLSLFRMPTQTVKSFEKILRDFLWEGAKSNGGLHNVSWAKTQM
        P SYLGLPLGGNPK+  FW PVVE+I  +L  W+  +LS GGR TLIQS LS++P YFLSLF++P       EK+ R+FLW GA      H V W     
Subjt:  PSSYLGLPLGGNPKSPLFWQPVVEKIHHKLHNWQYFFLSKGGRHTLIQSTLSNMPIYFLSLFRMPTQTVKSFEKILRDFLWEGAKSNGGLHNVSWAKTQM

Query:  PIVFGGLGIGNIKQRNESLLSKWIWRYLCEEGSLWQQVIKAKYYHLDAYPN-WPMCADSR-SFKAPWKEISRLNIMVKPHVRRILGNGNHISFWHDIWAT
        P   GGLG G I  RN +LL KW+WR+  E   LW +VI + Y     +PN W      R S + PWK I+++     P VR ++GNG  I FW D+W  
Subjt:  PIVFGGLGIGNIKQRNESLLSKWIWRYLCEEGSLWQQVIKAKYYHLDAYPN-WPMCADSR-SFKAPWKEISRLNIMVKPHVRRILGNGNHISFWHDIWAT

Query:  DIDFATSFPLIYRLSNNAHATVADLW---NSENNDWDLGLRRCLKDSEIADWASLAHILQAFAPKRTNDSWQWSLDPSKSY----------SVRSMMHFL
                   + LS++   TV   +   +  +N       + L  S++         + A     TND  Q    P KS           +  S+ H  
Subjt:  DIDFATSFPLIYRLSNNAHATVADLW---NSENNDWDLGLRRCLKDSEIADWASLAHILQAFAPKRTNDSWQWSLDPSKSY----------SVRSMMHFL

Query:  KY-------WNFILKEFGWNMIMPGSMQAILSVVFSGHPFKGNAETLWLAFNRSFFWSLWCERNGRIFSDTHSSFADLLDLAIFNALYWCKCSHPFKDYS
         +       WN + K  G + + P S + +L + F G       +TLW     +  W +W ERN RIF D   S   L DL +F +  W  CS  F+   
Subjt:  KY-------WNFILKEFGWNMIMPGSMQAILSVVFSGHPFKGNAETLWLAFNRSFFWSLWCERNGRIFSDTHSSFADLLDLAIFNALYWCKCSHPFKDYS

Query:  LDFLTLNW
        L+ + LNW
Subjt:  LDFLTLNW

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein9.3e-2724.87Show/hide
Query:  DSVINVALNETYICLIPKK-VDARTVNDYRPISLIPCAYKIIARVLSNRLKKVLHSTISENQMAFVEGRQ-ILDASLIANEIIEDWHSKKNRGLIIKLDL
        + ++  +  E  I LIPK   D     ++RPISL+    KI+ ++L+NR+++ +   I  +Q+ F+ G Q   +     N I     +K    +II +D 
Subjt:  DSVINVALNETYICLIPKK-VDARTVNDYRPISLIPCAYKIIARVLSNRLKKVLHSTISENQMAFVEGRQ-ILDASLIANEIIEDWHSKKNRGLIIKLDL

Query:  EKAFDKVDWGFLDAILRAKGFGLLWRKWISGCLSSANYSIIINGRPRGKIIPSRGIRQGDPLSPFLFILVSDCLSRLLSHGAHLGKIASHCIGKSSLTVN
        EKAFDK+   F+   L   G   ++ K I         +II+NG+         G RQG PLSP LF +V + L+R +     +  I    +GK  + ++
Subjt:  EKAFDKVDWGFLDAILRAKGFGLLWRKWISGCLSSANYSIIINGRPRGKIIPSRGIRQGDPLSPFLFILVSDCLSRLLSHGAHLGKIASHCIGKSSLTVN

Query:  HLQFADDTLLFSTMNSQALENLFNLIHIFENASGLNINYSKSELLGVQVSQEEMNDLARKFGCRVGTWPSSYLGLPLGGNPKSPLF---WQPVVEKIHHK
           FADD +++      + +NL  LI  F   SG  IN  KS+      +++  + +  +    + +    YLG+ L  + K  LF   ++P++++I   
Subjt:  HLQFADDTLLFSTMNSQALENLFNLIHIFENASGLNINYSKSELLGVQVSQEEMNDLARKFGCRVGTWPSSYLGLPLGGNPKSPLF---WQPVVEKIHHK

Query:  LHNWQYFFLSKGGRHTLIQSTLSNMPIYFLSL--FRMPTQTVKSFEKILRDFLWEGAKSNGGLHNVSWAKTQMPIVFGGLGIGNIKQRNESLLSKWIW
         + W+    S  GR  +++  +    IY  +    ++P       EK    F+W   ++      ++ +        GG+ + + K   ++ ++K  W
Subjt:  LHNWQYFFLSKGGRHTLIQSTLSNMPIYFLSL--FRMPTQTVKSFEKILRDFLWEGAKSNGGLHNVSWAKTQMPIVFGGLGIGNIKQRNESLLSKWIW

P08548 LINE-1 reverse transcriptase homolog5.7e-2425.77Show/hide
Query:  ETYICLIPKK-VDARTVNDYRPISLIPCAYKIIARVLSNRLKKVLHSTISENQMAFVEGRQILDASLIANEIIEDWHSKKNRG-LIIKLDLEKAFDKVDW
        E  I LIPK   D     +YRPISL+    KI+ ++L+NR+++ +   I  +Q+ F+ G Q       +  +I+  +  KN+  +I+ +D EKAFD +  
Subjt:  ETYICLIPKK-VDARTVNDYRPISLIPCAYKIIARVLSNRLKKVLHSTISENQMAFVEGRQILDASLIANEIIEDWHSKKNRG-LIIKLDLEKAFDKVDW

Query:  GFLDAILRAKGFGLLWRKWISGCLSSANYSIIINGRPRGKIIPSR-GIRQGDPLSPFLFILVSDCLSRLLSH-----GAHLGKIASHCIGKSSLTVNHLQ
         F+   L+  G    + K I    S    +II+NG  + K  P R G RQG PLSP LF +V + L+  +       G H+G          S  +    
Subjt:  GFLDAILRAKGFGLLWRKWISGCLSSANYSIIINGRPRGKIIPSR-GIRQGDPLSPFLFILVSDCLSRLLSH-----GAHLGKIASHCIGKSSLTVNHLQ

Query:  FADDTLLFSTMNSQALENLFNLIHIFENASGLNINYSKSELLGVQVSQEEMNDLARKFGCRVGTWPSSYLGLPLGGNPKSPLF---WQPVVEKIHHKLHN
        FADD +++      +   L  +I  + N SG  IN  KS       + +    +       V      YLG+ L  + K  L+   ++ + ++I   ++ 
Subjt:  FADDTLLFSTMNSQALENLFNLIHIFENASGLNINYSKSELLGVQVSQEEMNDLARKFGCRVGTWPSSYLGLPLGGNPKSPLF---WQPVVEKIHHKLHN

Query:  WQYFFLSKGGRHTLIQSTLSNMPIYFLSL--FRMPTQTVKSFEKILRDFLWEGAKSNGGLHNVSWAKTQMPIVFGGLGIGNIKQRNESLLSK--WIWRYL
        W+    S  GR  +++ ++    IY  +    + P    K  EKI+  F+W   K       ++          GG+ + +++   +S++ K  W W + 
Subjt:  WQYFFLSKGGRHTLIQSTLSNMPIYFLSL--FRMPTQTVKSFEKILRDFLWEGAKSNGGLHNVSWAKTQMPIVFGGLGIGNIKQRNESLLSK--WIWRYL

Query:  CEEGSLW-----QQVIKAKYYHL
          E  +W     Q++  A Y++L
Subjt:  CEEGSLW-----QQVIKAKYYHL

P0C2F6 Putative ribonuclease H protein At1g657501.4e-2229.56Show/hide
Query:  VVEKIHHKLHNWQYFFLSKGGRHTLIQSTLSNMPIYFLSLFRMPTQTVKSFEKILRDFLWEGAKSNGGLHNVSWAKTQMPIVFGGLGIGNIKQRNESLLS
        ++E++  ++  W+   LS  GR TL ++ LS+MP++ +S   +P   +   +++ R FLW         H V W+K   P   GGLG+   K  N +L+S
Subjt:  VVEKIHHKLHNWQYFFLSKGGRHTLIQSTLSNMPIYFLSLFRMPTQTVKSFEKILRDFLWEGAKSNGGLHNVSWAKTQMPIVFGGLGIGNIKQRNESLLS

Query:  KWIWRYLCEEGSLWQQVIKAKYYHLDAYPNWPMCADSRSFKAPWKEIS-RLNIMVKPHVRRILGNGNHISFWHDIWATDIDFATSFPLIYRLSNNAHAT-
        K  WR L E+ SLW  V++ K YH+    +        S+ + W+ I+  L  +V   V  I G+G  I FW D W       +  PL+  L N    T 
Subjt:  KWIWRYLCEEGSLWQQVIKAKYYHLDAYPNWPMCADSRSFKAPWKEIS-RLNIMVKPHVRRILGNGNHISFWHDIWATDIDFATSFPLIYRLSNNAHAT-

Query:  -----VADLWNSENNDWDLGLRRCLKDSEIADWASL---AHILQAFAPKRTNDSWQWSLDPSKSYSVRSMMHFL
               DLW      WD        D    +   L   A +L      R   SW++S D    +SVRS    L
Subjt:  -----VADLWNSENNDWDLGLRRCLKDSEIADWASL---AHILQAFAPKRTNDSWQWSLDPSKSYSVRSMMHFL

P11369 LINE-1 retrotransposable element ORF2 protein1.6e-2625.18Show/hide
Query:  IDSVINVALNETYICLIPK-KVDARTVNDYRPISLIPCAYKIIARVLSNRLKKVLHSTISENQMAFVEGRQILDASLIANEIIEDWHSKKNRG-LIIKLD
        ++  +  +  E  I LIPK + D   + ++RPISL+    KI+ ++L+NR+++ + + I  +Q+ F+ G Q       +  +I   +  K++  +II LD
Subjt:  IDSVINVALNETYICLIPK-KVDARTVNDYRPISLIPCAYKIIARVLSNRLKKVLHSTISENQMAFVEGRQILDASLIANEIIEDWHSKKNRG-LIIKLD

Query:  LEKAFDKVDWGFLDAILRAKGFGLLWRKWISGCLSSANYSIIINGRPRGKIIPSRGIRQGDPLSPFLFILVSDCLSRLLSHGAHLGKIASHCIGKSSLTV
         EKAFDK+   F+  +L   G    +   I    S    +I +NG     I    G RQG PLSP+LF +V + L+R +     +  I    IGK  + +
Subjt:  LEKAFDKVDWGFLDAILRAKGFGLLWRKWISGCLSSANYSIIINGRPRGKIIPSRGIRQGDPLSPFLFILVSDCLSRLLSHGAHLGKIASHCIGKSSLTV

Query:  NHLQFADDTLLFSTMNSQALENLFNLIHIFENASGLNINYSKSELLGVQVSQEEMNDLARKFGCRVGTWPSSYLGLPLGGNPKS--PLFWQPVVEKIHHK
        + L  ADD +++ +    +   L NLI+ F    G  IN +KS       +++   ++       + T    YLG+ L    K      ++ + ++I   
Subjt:  NHLQFADDTLLFSTMNSQALENLFNLIHIFENASGLNINYSKSELLGVQVSQEEMNDLARKFGCRVGTWPSSYLGLPLGGNPKS--PLFWQPVVEKIHHK

Query:  LHNWQYFFLSKGGRHTLIQSTLSNMPIYFLSL--FRMPTQTVKSFEKILRDFLWEGAKSNGGLHNVSWAKTQMPIVFGGLGIGNIKQRNESLLSK--WIW
        L  W+    S  GR  +++  +    IY  +    ++PTQ     E  +  F+W   K       +   +T      GG+ + ++K    +++ K  W W
Subjt:  LHNWQYFFLSKGGRHTLIQSTLSNMPIYFLSL--FRMPTQTVKSFEKILRDFLWEGAKSNGGLHNVSWAKTQMPIVFGGLGIGNIKQRNESLLSK--WIW

Query:  RYLCEEGSLWQQV
         Y   +   W ++
Subjt:  RYLCEEGSLWQQV

P14381 Transposon TX1 uncharacterized 149 kDa protein2.0e-2928.92Show/hide
Query:  ICLIPKKVDARTVNDYRPISLIPCAYKIIARVLSNRLKKVLHSTISENQMAFVEGRQILDASLIANEIIEDWHSKKNRGL---IIKLDLEKAFDKVDWGF
        + L+PKK D R + ++RP+SL+   YKI+A+ +S RLK VL   I  +Q   V GR I D   +  +++   H  +  GL    + LD EKAFD+VD  +
Subjt:  ICLIPKKVDARTVNDYRPISLIPCAYKIIARVLSNRLKKVLHSTISENQMAFVEGRQILDASLIANEIIEDWHSKKNRGL---IIKLDLEKAFDKVDWGF

Query:  LDAILRAKGFGLLWRKWISGCLSSANYSIIINGRPRGKIIPSRGIRQGDPLSPFLFILVSDCLSRLLSHGAHLGKIASHCIGKSSLTVNHLQFADDTLLF
        L   L+A  FG  +  ++    +SA   + IN      +   RG+RQG PLS  L+ L  +    LL       ++    + +  + V    +ADD +L 
Subjt:  LDAILRAKGFGLLWRKWISGCLSSANYSIIINGRPRGKIIPSRGIRQGDPLSPFLFILVSDCLSRLLSHGAHLGKIASHCIGKSSLTVNHLQFADDTLLF

Query:  STMNSQALENLFNLIHIFENASGLNINYSKSELLGVQVSQEEMNDLARKFGCRVGTWPS---SYLGLPLGGN--PKSPLFWQPVVEKIHHKLHNWQYF--
        +  +   LE       ++  AS   IN+SKS   G+     +++ L   F  R  +W S    YLG+ L     P S  F + + E +  +L  W+ F  
Subjt:  STMNSQALENLFNLIHIFENASGLNINYSKSELLGVQVSQEEMNDLARKFGCRVGTWPS---SYLGLPLGGN--PKSPLFWQPVVEKIHHKLHNWQYF--

Query:  FLSKGGRHTLIQSTLSNMPIYFLSLFRMPTQTVKSFEKILRDFLWEGAKSNGGLHNVSWAKTQMPIVFGGLGIGNIKQRNESLLSKWIWRYLCEEGSLWQ
         LS  GR  +I   +++   Y L       + +   ++ L DFLW G       H VS   + +P+  GG G+  I+ +  +   + I RYL  + S   
Subjt:  FLSKGGRHTLIQSTLSNMPIYFLSLFRMPTQTVKSFEKILRDFLWEGAKSNGGLHNVSWAKTQMPIVFGGLGIGNIKQRNESLLSKWIWRYLCEEGSLWQ

Query:  QVIKAKYY
          + + +Y
Subjt:  QVIKAKYY

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases2.2e-0730.89Show/hide
Query:  RLKKVLHSTISENQMAFVEGRQILDASLIANEIIEDWHSKKN-RG-LIIKLDLEKAFDKVDWGFLDAILRAKGFGLLWRKWISGCLSSANYSIIINGR--
        RLK ++ + I   Q +F+ GR   D  +   E +     KK  +G +++KLDLEKA+D++ W +L+  L + GF  +W   I+     A       GR  
Subjt:  RLKKVLHSTISENQMAFVEGRQILDASLIANEIIEDWHSKKN-RG-LIIKLDLEKAFDKVDWGFLDAILRAKGFGLLWRKWISGCLSSANYSIIINGR--

Query:  --PRGKIIPSR-GIRQGDPLSPF
           R ++   R G R  D  +PF
Subjt:  --PRGKIIPSR-GIRQGDPLSPF

AT4G29090.1 Ribonuclease H-like superfamily protein6.9e-1726.38Show/hide
Query:  MPIYFLSLFRMPTQTVKSFEKILRDFLWEGAKSNGGLHNVSWAKTQMPIVFGGLGIGNIKQRNESLLSKWIWRYLCEEGSLWQQVIKAKYYHLDAYPNWP
        +P Y ++ F +P    K    +L DF W   +   G+H  +W         GG+G  +I+  N +LL K +WR L    SL  +V K++Y+H     N P
Subjt:  MPIYFLSLFRMPTQTVKSFEKILRDFLWEGAKSNGGLHNVSWAKTQMPIVFGGLGIGNIKQRNESLLSKWIWRYLCEEGSLWQQVIKAKYYHLDAYPNWP

Query:  MCADSRSFKAPWKEISRLNIMVKPHVRRILGNGNHISFWHDIWATDIDFATSFPLIYRLSNNAHAT------VADLWNSENNDWDLGLRRCLKDSEIADW
        +     SF   WK I     +++   R ++GNG  I  W   W  D   A++   + R+    +A+      V+DL +    +W   +   L   E+   
Subjt:  MCADSRSFKAPWKEISRLNIMVKPHVRRILGNGNHISFWHDIWATDIDFATSFPLIYRLSNNAHAT------VADLWNSENNDWDLGLRRCLKDSEIADW

Query:  ASLAHILQAFAP--KRTNDSWQWSLDPSKSYSVRS
             ++    P  +R  DS+ W    S  Y+V+S
Subjt:  ASLAHILQAFAP--KRTNDSWQWSLDPSKSYSVRS

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.1e-1351.47Show/hide
Query:  IINGRPRGKIIPSRGIRQGDPLSPFLFILVSDCLSRLLSHGAHLGKIASHCIGKSSLTVNHLQFADDT
        IING P+G + PSRG+RQGDPLSP+LFIL ++ LS L       G++    +  +S  +NHL FADDT
Subjt:  IINGRPRGKIIPSRGIRQGDPLSPFLFILVSDCLSRLLSHGAHLGKIASHCIGKSSLTVNHLQFADDT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATCTCATGGCACAATCGGTGGTCATTTTCGTCACTCTTCTTAGGGGTGATCATCGGTTGGTCGGAGTCGGTTTTGGACAAAAACCGACTCCGACCACCGACACATC
GGTTTTGATTGGTTGGTGGATGTCGTGGCTTCAGGAGGGTGATGAAAACACAAAATTTTTCCACCGCATTCTCTCAGCTCGTCGACGGAAAAATACAATCTCTGAATTGT
TATCACGAGAAGGTAATAGTCTCCTTACAGATAATGATATAGAGGCTGAGTTTCTGGATTTCTATAATGGCCTATTCACTATCGACAGTGTTATAAATGTAGCTCTTAAT
GAGACGTATATTTGTTTGATTCCAAAAAAAGTGGATGCTAGAACAGTGAATGATTACAGACCTATCAGTCTAATACCATGTGCATATAAGATCATTGCAAGAGTTTTATC
CAATCGTCTCAAGAAAGTCCTCCATTCTACCATATCTGAAAATCAAATGGCGTTTGTAGAAGGAAGGCAAATTTTAGATGCTTCTCTTATAGCAAATGAAATCATTGAAG
ATTGGCACTCAAAAAAAAATCGAGGTTTGATCATTAAACTAGATTTGGAGAAAGCTTTTGACAAGGTTGATTGGGGCTTCCTAGATGCTATTCTCCGAGCCAAAGGGTTC
GGTTTACTTTGGAGGAAATGGATTTCGGGTTGTTTATCTAGTGCAAACTATTCTATTATTATAAATGGAAGACCCCGAGGTAAAATCATTCCCTCAAGAGGCATTAGACA
AGGAGATCCGCTATCTCCTTTTCTATTTATTTTGGTTTCGGATTGCCTTAGTCGTCTTTTATCTCATGGAGCTCATTTGGGAAAGATCGCTTCTCACTGCATTGGCAAAT
CTTCTCTCACAGTCAATCATCTACAATTTGCTGATGATACATTATTATTCTCAACCATGAATTCGCAAGCCTTGGAAAATCTCTTTAATCTCATTCATATATTTGAAAAT
GCATCAGGTCTTAATATTAATTACAGCAAAAGTGAGCTTTTGGGAGTTCAAGTTTCTCAAGAAGAGATGAATGACTTGGCTAGAAAATTTGGATGTCGAGTGGGTACATG
GCCATCATCTTATCTTGGACTTCCATTAGGAGGCAACCCAAAAAGTCCTCTGTTTTGGCAACCAGTTGTGGAAAAGATTCATCATAAACTTCATAATTGGCAATATTTCT
TCCTCTCCAAAGGAGGTAGGCATACTCTTATACAATCCACTCTCTCCAATATGCCTATATATTTTTTATCTTTGTTCAGAATGCCAACTCAAACTGTAAAATCTTTTGAA
AAAATCCTAAGGGATTTTCTTTGGGAAGGTGCAAAAAGTAATGGTGGATTACACAATGTTAGTTGGGCCAAAACTCAAATGCCAATTGTTTTTGGAGGTTTAGGCATTGG
AAATATCAAGCAGAGAAATGAATCGCTTTTATCTAAATGGATTTGGCGATATTTATGTGAGGAAGGCTCTTTGTGGCAGCAGGTTATTAAAGCAAAATATTATCATCTAG
ATGCTTATCCAAATTGGCCTATGTGTGCTGACTCTCGGTCTTTCAAAGCTCCATGGAAGGAGATCTCAAGATTGAACATTATGGTAAAGCCACATGTTCGAAGAATTCTT
GGTAATGGTAATCATATTTCTTTTTGGCATGATATCTGGGCTACTGATATAGACTTTGCTACAAGTTTTCCCCTGATTTATAGGCTTTCTAATAATGCTCATGCAACGGT
GGCTGATTTATGGAATTCGGAAAATAATGATTGGGATTTGGGTCTCAGAAGATGTCTTAAAGACTCAGAGATTGCGGATTGGGCATCTCTTGCCCATATATTACAAGCTT
TTGCTCCAAAGAGGACGAATGATTCATGGCAATGGTCTTTAGATCCATCTAAAAGCTACTCGGTTCGATCTATGATGCATTTTTTGAAGTATTGGAACTTTATCTTGAAG
GAGTTTGGCTGGAATATGATTATGCCTGGATCTATGCAAGCTATTCTATCTGTGGTTTTTTCGGGCCATCCTTTCAAAGGAAATGCGGAAACTCTTTGGCTAGCCTTTAA
TCGTTCTTTCTTTTGGTCTTTATGGTGCGAAAGAAATGGAAGGATTTTCAGCGACACTCATTCATCTTTTGCCGACCTTCTGGATTTAGCTATTTTTAATGCTCTTTATT
GGTGTAAATGTTCACATCCTTTTAAAGATTATAGTCTTGATTTTTTGACTCTCAATTGGAAGTCCTTTTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTATCTCATGGCACAATCGGTGGTCATTTTCGTCACTCTTCTTAGGGGTGATCATCGGTTGGTCGGAGTCGGTTTTGGACAAAAACCGACTCCGACCACCGACACATC
GGTTTTGATTGGTTGGTGGATGTCGTGGCTTCAGGAGGGTGATGAAAACACAAAATTTTTCCACCGCATTCTCTCAGCTCGTCGACGGAAAAATACAATCTCTGAATTGT
TATCACGAGAAGGTAATAGTCTCCTTACAGATAATGATATAGAGGCTGAGTTTCTGGATTTCTATAATGGCCTATTCACTATCGACAGTGTTATAAATGTAGCTCTTAAT
GAGACGTATATTTGTTTGATTCCAAAAAAAGTGGATGCTAGAACAGTGAATGATTACAGACCTATCAGTCTAATACCATGTGCATATAAGATCATTGCAAGAGTTTTATC
CAATCGTCTCAAGAAAGTCCTCCATTCTACCATATCTGAAAATCAAATGGCGTTTGTAGAAGGAAGGCAAATTTTAGATGCTTCTCTTATAGCAAATGAAATCATTGAAG
ATTGGCACTCAAAAAAAAATCGAGGTTTGATCATTAAACTAGATTTGGAGAAAGCTTTTGACAAGGTTGATTGGGGCTTCCTAGATGCTATTCTCCGAGCCAAAGGGTTC
GGTTTACTTTGGAGGAAATGGATTTCGGGTTGTTTATCTAGTGCAAACTATTCTATTATTATAAATGGAAGACCCCGAGGTAAAATCATTCCCTCAAGAGGCATTAGACA
AGGAGATCCGCTATCTCCTTTTCTATTTATTTTGGTTTCGGATTGCCTTAGTCGTCTTTTATCTCATGGAGCTCATTTGGGAAAGATCGCTTCTCACTGCATTGGCAAAT
CTTCTCTCACAGTCAATCATCTACAATTTGCTGATGATACATTATTATTCTCAACCATGAATTCGCAAGCCTTGGAAAATCTCTTTAATCTCATTCATATATTTGAAAAT
GCATCAGGTCTTAATATTAATTACAGCAAAAGTGAGCTTTTGGGAGTTCAAGTTTCTCAAGAAGAGATGAATGACTTGGCTAGAAAATTTGGATGTCGAGTGGGTACATG
GCCATCATCTTATCTTGGACTTCCATTAGGAGGCAACCCAAAAAGTCCTCTGTTTTGGCAACCAGTTGTGGAAAAGATTCATCATAAACTTCATAATTGGCAATATTTCT
TCCTCTCCAAAGGAGGTAGGCATACTCTTATACAATCCACTCTCTCCAATATGCCTATATATTTTTTATCTTTGTTCAGAATGCCAACTCAAACTGTAAAATCTTTTGAA
AAAATCCTAAGGGATTTTCTTTGGGAAGGTGCAAAAAGTAATGGTGGATTACACAATGTTAGTTGGGCCAAAACTCAAATGCCAATTGTTTTTGGAGGTTTAGGCATTGG
AAATATCAAGCAGAGAAATGAATCGCTTTTATCTAAATGGATTTGGCGATATTTATGTGAGGAAGGCTCTTTGTGGCAGCAGGTTATTAAAGCAAAATATTATCATCTAG
ATGCTTATCCAAATTGGCCTATGTGTGCTGACTCTCGGTCTTTCAAAGCTCCATGGAAGGAGATCTCAAGATTGAACATTATGGTAAAGCCACATGTTCGAAGAATTCTT
GGTAATGGTAATCATATTTCTTTTTGGCATGATATCTGGGCTACTGATATAGACTTTGCTACAAGTTTTCCCCTGATTTATAGGCTTTCTAATAATGCTCATGCAACGGT
GGCTGATTTATGGAATTCGGAAAATAATGATTGGGATTTGGGTCTCAGAAGATGTCTTAAAGACTCAGAGATTGCGGATTGGGCATCTCTTGCCCATATATTACAAGCTT
TTGCTCCAAAGAGGACGAATGATTCATGGCAATGGTCTTTAGATCCATCTAAAAGCTACTCGGTTCGATCTATGATGCATTTTTTGAAGTATTGGAACTTTATCTTGAAG
GAGTTTGGCTGGAATATGATTATGCCTGGATCTATGCAAGCTATTCTATCTGTGGTTTTTTCGGGCCATCCTTTCAAAGGAAATGCGGAAACTCTTTGGCTAGCCTTTAA
TCGTTCTTTCTTTTGGTCTTTATGGTGCGAAAGAAATGGAAGGATTTTCAGCGACACTCATTCATCTTTTGCCGACCTTCTGGATTTAGCTATTTTTAATGCTCTTTATT
GGTGTAAATGTTCACATCCTTTTAAAGATTATAGTCTTGATTTTTTGACTCTCAATTGGAAGTCCTTTTTGTAA
Protein sequenceShow/hide protein sequence
MYLMAQSVVIFVTLLRGDHRLVGVGFGQKPTPTTDTSVLIGWWMSWLQEGDENTKFFHRILSARRRKNTISELLSREGNSLLTDNDIEAEFLDFYNGLFTIDSVINVALN
ETYICLIPKKVDARTVNDYRPISLIPCAYKIIARVLSNRLKKVLHSTISENQMAFVEGRQILDASLIANEIIEDWHSKKNRGLIIKLDLEKAFDKVDWGFLDAILRAKGF
GLLWRKWISGCLSSANYSIIINGRPRGKIIPSRGIRQGDPLSPFLFILVSDCLSRLLSHGAHLGKIASHCIGKSSLTVNHLQFADDTLLFSTMNSQALENLFNLIHIFEN
ASGLNINYSKSELLGVQVSQEEMNDLARKFGCRVGTWPSSYLGLPLGGNPKSPLFWQPVVEKIHHKLHNWQYFFLSKGGRHTLIQSTLSNMPIYFLSLFRMPTQTVKSFE
KILRDFLWEGAKSNGGLHNVSWAKTQMPIVFGGLGIGNIKQRNESLLSKWIWRYLCEEGSLWQQVIKAKYYHLDAYPNWPMCADSRSFKAPWKEISRLNIMVKPHVRRIL
GNGNHISFWHDIWATDIDFATSFPLIYRLSNNAHATVADLWNSENNDWDLGLRRCLKDSEIADWASLAHILQAFAPKRTNDSWQWSLDPSKSYSVRSMMHFLKYWNFILK
EFGWNMIMPGSMQAILSVVFSGHPFKGNAETLWLAFNRSFFWSLWCERNGRIFSDTHSSFADLLDLAIFNALYWCKCSHPFKDYSLDFLTLNWKSFL