; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy5G015745 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy5G015745
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionReverse transcriptase domain-containing protein
Genome locationGy14Chr5:21931282..21935484
RNA-Seq ExpressionCsGy5G015745
SyntenyCsGy5G015745
Gene Ontology termsGO:0050789 - regulation of biological process (biological process)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN65484.1 hypothetical protein VITISV_029474 [Vitis vinifera]9.30e-29239.78Show/hide
Query:  PSSYKRRDQFWMELSSLYGLCNDNWCVGGDFNVVRWLNEKSSGTRPTRSMLRFNNLIEELDLVDIPFRNGRFSWSRSGVRPAASKLDRFLLSKPWMEFFR
        P+    R  FW+ELS + GL +  WCVGGDFNV+R  +EK  G+R T SM  F++ I + +L+D+P R+  F+WS   V     +LDRFL S  W + F 
Subjt:  PSSYKRRDQFWMELSSLYGLCNDNWCVGGDFNVVRWLNEKSSGTRPTRSMLRFNNLIEELDLVDIPFRNGRFSWSRSGVRPAASKLDRFLLSKPWMEFFR

Query:  EVSVERLPRTTSDHFPIILKMGAHSWGPTPFRFENAWLDHHLFFKNVENWWGSLEADGWPIFSFMEKLKGLKAILKSWNKETFGNIFSQKQVLIDKINSF
        +     LPR TSDH+PI+L+     WGPTPFRFEN WL H  F +N   WW   + +GW    FM KL+ +KA LK WNK +FG +  +K+ ++  + +F
Subjt:  EVSVERLPRTTSDHFPIILKMGAHSWGPTPFRFENAWLDHHLFFKNVENWWGSLEADGWPIFSFMEKLKGLKAILKSWNKETFGNIFSQKQVLIDKINSF

Query:  DSLEESSCLNEANVKERENCRGALLDLIAKEQKLWIQKSKLHWLREGEENSSFFHRWVSARKSKSIISSLVSIDGKTLVTEKEIVDEILSFFSMLYGTRV
        DSLE+   L+   + +R   +G L +LI +E+  W QK+++ W++EG+ NS FFH+  + R+++  I  L + +G  +   + I +EIL +F  LY +  
Subjt:  DSLEESSCLNEANVKERENCRGALLDLIAKEQKLWIQKSKLHWLREGEENSSFFHRWVSARKSKSIISSLVSIDGKTLVTEKEIVDEILSFFSMLYGTRV

Query:  SSPFICDGLNWRGLSLQDSNLLEAPFTEKEIREAVFDMGCLKSLGPDGMTGEFYKKSWNILKPDLVRVFQDFFKNGVINRRCNETYICLIPKKKEAGRVS
           +  +GL+W  +S + +  LE+PFTE+EI +A+F M   K+ GPDG T   ++  W ++K DLV+VF +F ++G+IN+  N ++I L+PKK  + R+S
Subjt:  SSPFICDGLNWRGLSLQDSNLLEAPFTEKEIREAVFDMGCLKSLGPDGMTGEFYKKSWNILKPDLVRVFQDFFKNGVINRRCNETYICLIPKKKEAGRVS

Query:  DFRPISLITSLYKIISKVLASRLKKVLPSIINDSQMAFVEGRQILDAILTASEVVDEWSLRGRKGVLLKLDLEKAYDKVDWSFLDMIMKLKGFGKRWRKW
        DFRPISLITSLYKII+KVLA R++ VL   I+ +Q AFV+GRQILDA+L A+E+VDE    G +GV+ K+D EKAYD V W FLD ++++KGFG RWRKW
Subjt:  DFRPISLITSLYKIISKVLASRLKKVLPSIINDSQMAFVEGRQILDAILTASEVVDEWSLRGRKGVLLKLDLEKAYDKVDWSFLDMIMKLKGFGKRWRKW

Query:  IWGCLSTTNFSIIVNGRPRGKIIAKRGIRQSDPLAPFLFTIVGDALNCLIHYCNEKRSLRGFHFENLTEDLTHLQYADDTLLFSSWEDGNLENWWKVVNI
        + GCLS+ +F+++VNG  +G + A RG+RQ DPL+PFLFTIV D L+ ++    E+  L GF        ++HLQ+ADDT+ FSS  + ++     V+ +
Subjt:  IWGCLSTTNFSIIVNGRPRGKIIAKRGIRQSDPLAPFLFTIVGDALNCLIHYCNEKRSLRGFHFENLTEDLTHLQYADDTLLFSSWEDGNLENWWKVVNI

Query:  FLVRAGLSLNKAKTSLIGINLSNDDLAPFSESTGCSVDNLPFKYLGFSIGRGHNRKEMWNNLEERFRHKFDRWRNVSLSKGGRLTLVQSVLNSLPCYLFS
        F   +GL +N  K+++ GINL  + L+  +E   C     P  YLG  +G        W+ + ER   + D W+   LS GGR+TL+QS L  +PCY  S
Subjt:  FLVRAGLSLNKAKTSLIGINLSNDDLAPFSESTGCSVDNLPFKYLGFSIGRGHNRKEMWNNLEERFRHKFDRWRNVSLSKGGRLTLVQSVLNSLPCYLFS

Query:  LAQAPVGIINRLEQMIRKFVWTGGSTNPIAHLVNWECTSAPTCYGGLGIGSFRQKNIALLTKWFWRFSKEETSLWRRLIVAIYGLDENGWSTKNPNRGKS
        L + P  +  ++E+M R F+W+G       HLVNW+    P   GGLG G    +N+ALL KW WR+ +E ++LW ++I++IYG   NGW   N  R   
Subjt:  LAQAPVGIINRLEQMIRKFVWTGGSTNPIAHLVNWECTSAPTCYGGLGIGSFRQKNIALLTKWFWRFSKEETSLWRRLIVAIYGLDENGWSTKNPNRGKS

Query:  HRLWAGILKHKEIFFNFSAFVLGKGTKIKFWKDKWCVVETLAEKFPNLFSLALNKEAYVADCW-CTATHSWNLGLRRNMLDNEIANAAS---ALEILH--
           W  I    + F  F+ FV+G G +I+FW D W   + L  ++P L  +  +K A ++     T   SWN   RRN+ D+EI +      +L+ LH  
Subjt:  HRLWAGILKHKEIFFNFSAFVLGKGTKIKFWKDKWCVVETLAEKFPNLFSLALNKEAYVADCW-CTATHSWNLGLRRNMLDNEIANAAS---ALEILH--

Query:  SWAPTERNDSLKWIPNINGNFTTKSTFLNLTKRSPNIAVPLIRQIWKNKIPKKVKFFLWSLAYRSLNTHEKLQKKIQNTLLSPSMCCLCAKDEEMLDHLF
        S  P +R+    W  + +G FT KS FL L++ S +  +   + +W  ++P KVK F+W +A++ +NT++ LQ +     LSP +C LC K  E +DHLF
Subjt:  SWAPTERNDSLKWIPNINGNFTTKSTFLNLTKRSPNIAVPLIRQIWKNKIPKKVKFFLWSLAYRSLNTHEKLQKKIQNTLLSPSMCCLCAKDEEMLDHLF

Query:  LHCPFTRKASYTLFGIFDLELCLPSKIDRWMIEGLNFRGYSPKGNILWKCATRSLLWSIWKERNSRIFDDRFNSFDSFWAVVQHTASWWSTNYTKHFCNY
        LHC  T    + LF    ++   P  I   +    N  G+S +G +LW+ A  +++W +W+ERN+RIF+D+  + +  W  +    S+W+   +K F   
Subjt:  LHCPFTRKASYTLFGIFDLELCLPSKIDRWMIEGLNFRGYSPKGNILWKCATRSLLWSIWKERNSRIFDDRFNSFDSFWAVVQHTASWWSTNYTKHFCNY

Query:  SLSMIFNNWKA
         L+M+  +W A
Subjt:  SLSMIFNNWKA

CAN68165.1 hypothetical protein VITISV_008538 [Vitis vinifera]1.04e-28739.63Show/hide
Query:  PSSYKRRDQFWMELSSLYGLCNDNWCVGGDFNVVRWLNEKSSGTRPTRSMLRFNNLIEELDLVDIPFRNGRFSWSRSGVRPAASKLDRFLLSKPWMEFFR
        P+S   R  FW+ELS ++GL +  WCVGGDFNV+R  +EK  G R T SM   ++ I E +L+D P R+  F+WS     P   +LDRFL S  W + F 
Subjt:  PSSYKRRDQFWMELSSLYGLCNDNWCVGGDFNVVRWLNEKSSGTRPTRSMLRFNNLIEELDLVDIPFRNGRFSWSRSGVRPAASKLDRFLLSKPWMEFFR

Query:  EVSVERLPRTTSDHFPIILKMGAHSWGPTPFRFENAWLDHHLFFKNVENWWGSLEADGWPIFSFMEKLKGLKAILKSWNKETFGNIFSQKQVLIDKINSF
        +   E LPR TSDH+PI+L+     WGPTPFRFEN WL H  F +    WW   + DGW    FM KL+ LKA LK WNK  FG++  +K+ ++  I +F
Subjt:  EVSVERLPRTTSDHFPIILKMGAHSWGPTPFRFENAWLDHHLFFKNVENWWGSLEADGWPIFSFMEKLKGLKAILKSWNKETFGNIFSQKQVLIDKINSF

Query:  DSLEESSCLNEANVKERENCRGALLDLIAKEQKLWIQKSKLHWLREGEENSSFFHRWVSARKSKSIISSLVSIDGKTLVTEKEIVDEILSFFSMLYGTRV
        DS+E+   L+   + +R   +G L +LI +E+  W QK+++ W++EG+ NS  FH+  + R+++  I  L +  G  L     I +EIL +F  LY +  
Subjt:  DSLEESSCLNEANVKERENCRGALLDLIAKEQKLWIQKSKLHWLREGEENSSFFHRWVSARKSKSIISSLVSIDGKTLVTEKEIVDEILSFFSMLYGTRV

Query:  SSPFICDGLNWRGLSLQDSNLLEAPFTEKEIREAVFDMGCLKSLGPDGMTGEFYKKSWNILKPDLVRVFQDFFKNGVINRRCNETYICLIPKKKEAGRVS
           +  +GL+W  +S + ++ LE+PFTE+EI +A+F M    + GPDG T   ++  W+++K DLVRVF +F ++G+IN+  N ++I L+PKK  A ++S
Subjt:  SSPFICDGLNWRGLSLQDSNLLEAPFTEKEIREAVFDMGCLKSLGPDGMTGEFYKKSWNILKPDLVRVFQDFFKNGVINRRCNETYICLIPKKKEAGRVS

Query:  DFRPISLITSLYKIISKVLASRLKKVLPSIINDSQMAFVEGRQILDAILTASEVVDEWSLRGRKGVLLKLDLEKAYDKVDWSFLDMIMKLKGFGKRWRKW
        ++RPISLITSLYKII+KVLA RL+ +L   I+ +Q AFV+GRQILDA+L A+E+VDE    G +GV+ K+D EKAYD V W FLD +M+ KGF    RKW
Subjt:  DFRPISLITSLYKIISKVLASRLKKVLPSIINDSQMAFVEGRQILDAILTASEVVDEWSLRGRKGVLLKLDLEKAYDKVDWSFLDMIMKLKGFGKRWRKW

Query:  IWGCLSTTNFSIIVNGRPRGKIIAKRGIRQSDPLAPFLFTIVGDALNCLIHYCNEKRSLRGFHFENLTEDLTHLQYADDTLLFSSWEDGNLENWWKVVNI
        I  CLS+ +F+I+VNG  +G +   RG+RQ DPL+PFLFTIV D  + ++    E+    GF        ++HLQ+ADDT+ FSS  + +L     V+ +
Subjt:  IWGCLSTTNFSIIVNGRPRGKIIAKRGIRQSDPLAPFLFTIVGDALNCLIHYCNEKRSLRGFHFENLTEDLTHLQYADDTLLFSSWEDGNLENWWKVVNI

Query:  FLVRAGLSLNKAKTSLIGINLSNDDLAPFSESTGCSVDNLPFKYLGFSIGRGHNRKEMWNNLEERFRHKFDRWRNVSLSKGGRLTLVQSVLNSLPCYLFS
        F   +GL +N  K+++ GINL  D L   +E   C     P  YLG  +G        W+ + ER   + D W+   LS GGR+TL+QS L  +PCY  S
Subjt:  FLVRAGLSLNKAKTSLIGINLSNDDLAPFSESTGCSVDNLPFKYLGFSIGRGHNRKEMWNNLEERFRHKFDRWRNVSLSKGGRLTLVQSVLNSLPCYLFS

Query:  LAQAPVGIINRLEQMIRKFVWTGGSTNPIAHLVNWECTSAPTCYGGLGIGSFRQKNIALLTKWFWRFSKEETSLWRRLIVAIYGLDENGWSTKNPNRGKS
        L + P  +  R+E++ R F+W+G       HLV+W+        GGLG+G    +N ALL KW WR+ +E ++LW ++I++IYG   NGW      R   
Subjt:  LAQAPVGIINRLEQMIRKFVWTGGSTNPIAHLVNWECTSAPTCYGGLGIGSFRQKNIALLTKWFWRFSKEETSLWRRLIVAIYGLDENGWSTKNPNRGKS

Query:  HRLWAGILKHKEIFFNFSAFVLGKGTKIKFWKDKWCVVETLAEKFPNLFSLALNKEAYVADCW-CTATHSWNLGLRRNMLDNEIANAASALEIL-HSWAP
           W  I +  + F  F+ F++G G +I+FW+D W   ++L  +FP L  + ++K   ++     T   SWN   RRN+ D+EI    S ++ L H    
Subjt:  HRLWAGILKHKEIFFNFSAFVLGKGTKIKFWKDKWCVVETLAEKFPNLFSLALNKEAYVADCW-CTATHSWNLGLRRNMLDNEIANAASALEIL-HSWAP

Query:  TERNDSLKWIPNINGNFTTKSTFLNLTKRSPNIAVPLIRQIWKNKIPKKVKFFLWSLAYRSLNTHEKLQKKIQNTLLSPSMCCLCAKDEEMLDHLFLHCP
            D   W  + +G FT KS FL L++ S   +V   + +W +++P K+KFF+W +A++ +NT++ LQ +     LSP +C LC +  E +DHLFLHC 
Subjt:  TERNDSLKWIPNINGNFTTKSTFLNLTKRSPNIAVPLIRQIWKNKIPKKVKFFLWSLAYRSLNTHEKLQKKIQNTLLSPSMCCLCAKDEEMLDHLFLHCP

Query:  FTRKASYTLFGIFDLELCLPSKIDRWMIEGLNFRGYSPKGNILWKCATRSLLWSIWKERNSRIFDDRFNSFDSFWAVVQHTASWW
         T    + LF +  ++   P  +   +    N  G S +G +LW+ A  ++LW +W+ERN+RIF+D+  + ++ W ++   AS W
Subjt:  FTRKASYTLFGIFDLELCLPSKIDRWMIEGLNFRGYSPKGNILWKCATRSLLWSIWKERNSRIFDDRFNSFDSFWAVVQHTASWW

RVW64408.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]4.14e-28740.65Show/hide
Query:  RDQFWMELSSLYGLCNDNWCVGGDFNVVRWLNEKSSGTRPTRSMLRFNNLIEELDLVDIPFRNGRFSWSRSGVRPAASKLDRFLLSKPWMEFFREVSVER
        R  FW+EL  LYGL    WCVGGDFNV+R ++EK   TR T +M  F+  I E  L+D P RN  F+WS     P   +LDRFL S  W  FF +   E 
Subjt:  RDQFWMELSSLYGLCNDNWCVGGDFNVVRWLNEKSSGTRPTRSMLRFNNLIEELDLVDIPFRNGRFSWSRSGVRPAASKLDRFLLSKPWMEFFREVSVER

Query:  LPRTTSDHFPIILKMGAHSWGPTPFRFENAWLDHHLFFKNVENWWGSLEADGWPIFSFMEKLKGLKAILKSWNKETFGNIFSQKQVLIDKINSFDSLEES
        LPR TSDH PI L+     WGPTPFRFEN WL H  F +    WW     +GW    FM KLK +K+ LK WN  TFG++  +K++++  ++  D +E+ 
Subjt:  LPRTTSDHFPIILKMGAHSWGPTPFRFENAWLDHHLFFKNVENWWGSLEADGWPIFSFMEKLKGLKAILKSWNKETFGNIFSQKQVLIDKINSFDSLEES

Query:  SCLNEANVKERENCRGALLDLIAKEQKLWIQKSKLHWLREGEENSSFFHRWVSARKSKSIISSLVSIDGKTLVTEKEIVDEILSFFSMLYGTRVSSPFIC
          LN   V ER   R  L D++ KE+  W QKS++ W++EG+ NS FFHR  + R+S+  I SL+S  G+TL   ++I +EI++FF  LY   V   +  
Subjt:  SCLNEANVKERENCRGALLDLIAKEQKLWIQKSKLHWLREGEENSSFFHRWVSARKSKSIISSLVSIDGKTLVTEKEIVDEILSFFSMLYGTRVSSPFIC

Query:  DGLNWRGLSLQDSNLLEAPFTEKEIREAVFDMGCLKSLGPDGMTGEFYKKSWNILKPDLVRVFQDFFKNGVINRRCNETYICLIPKKKEAGRVSDFRPIS
        +G++W  +S +    L+ PFTE+E+R AVF +   K+ GPDG T   Y++ W+++K DL+RVF +F  NGVIN+  N T+I L+PKK ++ ++SD+RPIS
Subjt:  DGLNWRGLSLQDSNLLEAPFTEKEIREAVFDMGCLKSLGPDGMTGEFYKKSWNILKPDLVRVFQDFFKNGVINRRCNETYICLIPKKKEAGRVSDFRPIS

Query:  LITSLYKIISKVLASRLKKVLPSIINDSQMAFVEGRQILDAILTASEVVDEWSLRGRKGVLLKLDLEKAYDKVDWSFLDMIMKLKGFGKRWRKWIWGCLS
        L+TSLYKII+KVL+ RL+KVL   I+DSQ AFVEGR ILDA+L A+EVVDE    G +G++ K+D EKAYD VDW FLD +++ KGF ++WR WI GCLS
Subjt:  LITSLYKIISKVLASRLKKVLPSIINDSQMAFVEGRQILDAILTASEVVDEWSLRGRKGVLLKLDLEKAYDKVDWSFLDMIMKLKGFGKRWRKWIWGCLS

Query:  TTNFSIIVNGRPRGKIIAKRGIRQSDPLAPFLFTIVGDALNCLIHYCNEKRSLRGFHFENLTEDLTHLQYADDTLLFSSWEDGNLENWWKVVNIFLVRAG
        +++F+I+VNG  +G + A RG+RQ DPL+PFLFT+V D L+ ++    E     GF        ++ LQ+ADDT+ FS     +L+N   ++ +F   +G
Subjt:  TTNFSIIVNGRPRGKIIAKRGIRQSDPLAPFLFTIVGDALNCLIHYCNEKRSLRGFHFENLTEDLTHLQYADDTLLFSSWEDGNLENWWKVVNIFLVRAG

Query:  LSLNKAKTSLIGINLSNDDLAPFSESTGCSVDNLPFKYLGFSIGRGHNRKEMWNNLEERFRHKFDRWRNVSLSKGGRLTLVQSVLNSLPCYLFSLAQAPV
        L +N  K+++ GIN   + L+  +    C V   P  YLG  +G        W+ + ER   + D W+   LS GGR+TL+QS L+ +P Y  SL + P 
Subjt:  LSLNKAKTSLIGINLSNDDLAPFSESTGCSVDNLPFKYLGFSIGRGHNRKEMWNNLEERFRHKFDRWRNVSLSKGGRLTLVQSVLNSLPCYLFSLAQAPV

Query:  GIINRLEQMIRKFVWTGGSTNPIAHLVNWECTSAPTCYGGLGIGSFRQKNIALLTKWFWRFSKEETSLWRRLIVAIYGLDENGWSTKNPNRGKSHRLWAG
         I +++E+M R F+W+G       HLV WE  S P   GGLG G    +NIALL KW WRF +E + LW ++I +IYG   NGW      R      W  
Subjt:  GIINRLEQMIRKFVWTGGSTNPIAHLVNWECTSAPTCYGGLGIGSFRQKNIALLTKWFWRFSKEETSLWRRLIVAIYGLDENGWSTKNPNRGKSHRLWAG

Query:  ILKHKEIFFNFSAFVLGKGTKIKFWKDKWCVVETLAEKFPNLFSLALNKEAYVADCWCTATH-SWNLGLRRNMLDNEI------ANAASALEILHSWAPT
        I +  + F  F   V+G G +I+FW+D W   ++L  +F +L+ +   K   V++    +   +WNL  RRN+ D+EI       ++ S++    S A  
Subjt:  ILKHKEIFFNFSAFVLGKGTKIKFWKDKWCVVETLAEKFPNLFSLALNKEAYVADCWCTATH-SWNLGLRRNMLDNEI------ANAASALEILHSWAPT

Query:  ERNDSLKWIPNINGNFTTKSTFLNLTKRSPNIAVPLIRQIWKNKIPKKVKFFLWSLAYRSLNTHEKLQKKIQNTLLSPSMCCLCAKDEEMLDHLFLHCPF
           DS  W  + +G FT KS FL L+K S  I     + +W +K+P KVK   W +A+  +NT++KLQ +     L P  C LC  + E +DHLFLHCP 
Subjt:  ERNDSLKWIPNINGNFTTKSTFLNLTKRSPNIAVPLIRQIWKNKIPKKVKFFLWSLAYRSLNTHEKLQKKIQNTLLSPSMCCLCAKDEEMLDHLFLHCPF

Query:  TRKASYTLFGIFDLELCLPSKIDRWMIEGLNFRGYSPKGNILWKCATRSLLWSIWKERNSRIFDDRFNSFDSFWAVVQHTASWWSTNYTKHFCNYSLSMI
        T      LF +  L+   P   +  ++      G S +G  LW+ A  +L+W +W+ERN RIF+D+  S ++ W ++   ++ W++  +  F    L++I
Subjt:  TRKASYTLFGIFDLELCLPSKIDRWMIEGLNFRGYSPKGNILWKCATRSLLWSIWKERNSRIFDDRFNSFDSFWAVVQHTASWWSTNYTKHFCNYSLSMI

Query:  FNNWKAI
          NW  +
Subjt:  FNNWKAI

RVW70234.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]2.99e-28838.79Show/hide
Query:  PSSYKRRDQFWMELSSLYGLCNDNWCVGGDFNVVRWLNEKSSGTRPTRSMLRFNNLIEELDLVDIPFRNGRFSWSRSGVRPAASKLDRFLLSKPWMEFFR
        P++   R +FW+E+  L+GL   +WCVGGDFNV+R   EK  G+  T SM  F+  I E +L D P RN  F+WS     P   +LDRFL S  W   F 
Subjt:  PSSYKRRDQFWMELSSLYGLCNDNWCVGGDFNVVRWLNEKSSGTRPTRSMLRFNNLIEELDLVDIPFRNGRFSWSRSGVRPAASKLDRFLLSKPWMEFFR

Query:  EVSVERLPRTTSDHFPIILKMGAHSWGPTPFRFENAWLDHHLFFKNVENWWGSLEADGWPIFSFMEKLKGLKAILKSWNKETFGNIFSQKQVLIDKINSF
        +   E LPR TSDH+PI+L      WGPTPFRFEN WL HH F ++  +WW   E +GW    FM KL+ +KA LK WNK TFG +  +K+ ++D+I + 
Subjt:  EVSVERLPRTTSDHFPIILKMGAHSWGPTPFRFENAWLDHHLFFKNVENWWGSLEADGWPIFSFMEKLKGLKAILKSWNKETFGNIFSQKQVLIDKINSF

Query:  DSLEESSCLNEANVKERENCRGALLDLIAKEQKLWIQKSKLHWLREGEENSSFFHRWVSARKSKSIISSLVSIDGKTLVTEKEIVDEILSFFSMLYGTRV
        D++E+   L      +R   +G L +LI +E+  W QK+K+ W++EG+ NS  FH+  + R++K+ +  L +  G  L + + I +EIL +F  LY +  
Subjt:  DSLEESSCLNEANVKERENCRGALLDLIAKEQKLWIQKSKLHWLREGEENSSFFHRWVSARKSKSIISSLVSIDGKTLVTEKEIVDEILSFFSMLYGTRV

Query:  SSPFICDGLNWRGLSLQDSNLLEAPFTEKEIREAVFDMGCLKSLGPDGMTGEFYKKSWNILKPDLVRVFQDFFKNGVINRRCNETYICLIPKKKEAGRVS
           +  +G++W  +S + ++ L++ F E EI  A+F +   K+ GPDG T   ++  W+++K DLVRVF +F  +G+IN+  N ++I L+PKK ++ ++S
Subjt:  SSPFICDGLNWRGLSLQDSNLLEAPFTEKEIREAVFDMGCLKSLGPDGMTGEFYKKSWNILKPDLVRVFQDFFKNGVINRRCNETYICLIPKKKEAGRVS

Query:  DFRPISLITSLYKIISKVLASRLKKVLPSIINDSQMAFVEGRQILDAILTASEVVDEWSLRGRKGVLLKLDLEKAYDKVDWSFLDMIMKLKGFGKRWRKW
        DFRPISLIT LYKII+KVL+ RL+ VL   I+ +Q AFV+GRQILDA+L A+E+VDE    G +GV+ K+D EKAYD V W FLD +++ KGF  +WR W
Subjt:  DFRPISLITSLYKIISKVLASRLKKVLPSIINDSQMAFVEGRQILDAILTASEVVDEWSLRGRKGVLLKLDLEKAYDKVDWSFLDMIMKLKGFGKRWRKW

Query:  IWGCLSTTNFSIIVNGRPRGKIIAKRGIRQSDPLAPFLFTIVGDALNCLIHYCNEKRSLRGFHFENLTEDLTHLQYADDTLLFSSWEDGNLENWWKVVNI
        + GCLS+ +++I+VNG  +G + A RG+RQ DPL+PFLFTIV D L+ ++    E+  L GF        +THLQ+ADDT+LF++  +  L+    ++ +
Subjt:  IWGCLSTTNFSIIVNGRPRGKIIAKRGIRQSDPLAPFLFTIVGDALNCLIHYCNEKRSLRGFHFENLTEDLTHLQYADDTLLFSSWEDGNLENWWKVVNI

Query:  FLVRAGLSLNKAKTSLIGINLSNDDLAPFSESTGCSVDNLPFKYLGFSIGRGHNRKEMWNNLEERFRHKFDRWRNVSLSKGGRLTLVQSVLNSLPCYLFS
        F   +GL +N  K++L GINL  + L+  +    C   + P  YLG  +G        W+ + ER   + D W+    S GGR+TL+ S L+ +P Y  S
Subjt:  FLVRAGLSLNKAKTSLIGINLSNDDLAPFSESTGCSVDNLPFKYLGFSIGRGHNRKEMWNNLEERFRHKFDRWRNVSLSKGGRLTLVQSVLNSLPCYLFS

Query:  LAQAPVGIINRLEQMIRKFVWTGGSTNPIAHLVNWECTSAPTCYGGLGIGSFRQKNIALLTKWFWRFSKEETSLWRRLIVAIYGLDENGWSTKNPNRGKS
        L + P  +  ++E+M R F+W+G       HLV WE    P   GGLG G    +N ALL KW WRF +E TSLW ++I++IYG   NGW      R   
Subjt:  LAQAPVGIINRLEQMIRKFVWTGGSTNPIAHLVNWECTSAPTCYGGLGIGSFRQKNIALLTKWFWRFSKEETSLWRRLIVAIYGLDENGWSTKNPNRGKS

Query:  HRLWAGILKHKEIFFNFSAFVLGKGTKIKFWKDKWCVVETLAEKFPNLFSLALNKEAYVADCWCTAT-HSWNLGLRRNMLDNEIANAASALEILHSW-AP
           W  I +  + F  ++ F++G G +I+FW+D W   + L +++P LF + ++K   +   + ++   SWN   RRN+ D+EI +    +  L      
Subjt:  HRLWAGILKHKEIFFNFSAFVLGKGTKIKFWKDKWCVVETLAEKFPNLFSLALNKEAYVADCWCTAT-HSWNLGLRRNMLDNEIANAASALEILHSW-AP

Query:  TERNDSLKWIPNINGNFTTKSTFLNLTKRSPNIAVPLIRQIWKNKIPKKVKFFLWSLAYRSLNTHEKLQKKIQNTLLSPSMCCLCAKDEEMLDHLFLHCP
        T  +D+  W    +G FT KS F  L++   +      + +WK+++P KVK F+W + ++ +NT++ LQ +  +  +SP +C LC +  E  DH+FLHC 
Subjt:  TERNDSLKWIPNINGNFTTKSTFLNLTKRSPNIAVPLIRQIWKNKIPKKVKFFLWSLAYRSLNTHEKLQKKIQNTLLSPSMCCLCAKDEEMLDHLFLHCP

Query:  FTRKASYTLFGIFDLELCLPSKIDRWMIEGLNFRGY--SPKGNILWKCATRSLLWSIWKERNSRIFDDRFNSFDSFWAVVQHTASWWS
         T    + LF +  ++   P  I   M   +N++G+  S +G ILW+ A+ +L+W +W ERN+RIF+D+  +  + W  +   AS+W+
Subjt:  FTRKASYTLFGIFDLELCLPSKIDRWMIEGLNFRGY--SPKGNILWKCATRSLLWSIWKERNSRIFDDRFNSFDSFWAVVQHTASWWS

RVW70235.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]1.13e-29139.87Show/hide
Query:  PSSYKRRDQFWMELSSLYGLCNDNWCVGGDFNVVRWLNEKSSGTRPTRSMLRFNNLIEELDLVDIPFRNGRFSWSRSGVRPAASKLDRFLLSKPWMEFFR
        P++   R   W+ELS + GL +  WCVGGDFNV+R  +EK  G+R T SM  F++ I + +L+D+P R+  F+WS   V P   +LDRFL S  W + F 
Subjt:  PSSYKRRDQFWMELSSLYGLCNDNWCVGGDFNVVRWLNEKSSGTRPTRSMLRFNNLIEELDLVDIPFRNGRFSWSRSGVRPAASKLDRFLLSKPWMEFFR

Query:  EVSVERLPRTTSDHFPIILKMGAHSWGPTPFRFENAWLDHHLFFKNVENWWGSLEADGWPIFSFMEKLKGLKAILKSWNKETFGNIFSQKQVLIDKINSF
        +     LPR TSDH+PI+L+     WGPTPFRFEN WL H  F +N   WW   + +GW    FM KL+ +KA LK WNK +FG +  +K+ ++  + +F
Subjt:  EVSVERLPRTTSDHFPIILKMGAHSWGPTPFRFENAWLDHHLFFKNVENWWGSLEADGWPIFSFMEKLKGLKAILKSWNKETFGNIFSQKQVLIDKINSF

Query:  DSLEESSCLNEANVKERENCRGALLDLIAKEQKLWIQKSKLHWLREGEENSSFFHRWVSARKSKSIISSLVSIDGKTLVTEKEIVDEILSFFSMLYGTRV
        DSLE+   L+   + +R   +G L +LI +E+  W QK+++ W++EG+ NS FFH+  + R+++  I  L + +G+ +   + I +EIL +F  LY +  
Subjt:  DSLEESSCLNEANVKERENCRGALLDLIAKEQKLWIQKSKLHWLREGEENSSFFHRWVSARKSKSIISSLVSIDGKTLVTEKEIVDEILSFFSMLYGTRV

Query:  SSPFICDGLNWRGLSLQDSNLLEAPFTEKEIREAVFDMGCLKSLGPDGMTGEFYKKSWNILKPDLVRVFQDFFKNGVINRRCNETYICLIPKKKEAGRVS
           +  +GL+W  +S + +  LE+PFTE+EI +A+F M   K+ GPDG T   ++  W ++K DLV+VF +F ++G+IN+  N ++I L+PKK  + R+S
Subjt:  SSPFICDGLNWRGLSLQDSNLLEAPFTEKEIREAVFDMGCLKSLGPDGMTGEFYKKSWNILKPDLVRVFQDFFKNGVINRRCNETYICLIPKKKEAGRVS

Query:  DFRPISLITSLYKIISKVLASRLKKVLPSIINDSQMAFVEGRQILDAILTASEVVDEWSLRGRKGVLLKLDLEKAYDKVDWSFLDMIMKLKGFGKRWRKW
        DFRPISLITSLYKII+KVLA R+++VL   I+ +Q AFV+GRQILDA+L A+E+VDE    G +GV+ K+D EKAYD V W FLD +M++KGFG RWRKW
Subjt:  DFRPISLITSLYKIISKVLASRLKKVLPSIINDSQMAFVEGRQILDAILTASEVVDEWSLRGRKGVLLKLDLEKAYDKVDWSFLDMIMKLKGFGKRWRKW

Query:  IWGCLSTTNFSIIVNGRPRGKIIAKRGIRQSDPLAPFLFTIVGDALNCLIHYCNEKRSLRGFHFENLTEDLTHLQYADDTLLFSSWEDGNLENWWKVVNI
        + GCLS+ +F+++VNG  +G + A RG+RQ DPL+PFLFTIV D L+ ++    E+  L GF        ++HLQ+ADDT+ FSS  + ++     V+ +
Subjt:  IWGCLSTTNFSIIVNGRPRGKIIAKRGIRQSDPLAPFLFTIVGDALNCLIHYCNEKRSLRGFHFENLTEDLTHLQYADDTLLFSSWEDGNLENWWKVVNI

Query:  FLVRAGLSLNKAKTSLIGINLSNDDLAPFSESTGCSVDNLPFKYLGFSIGRGHNRKEMWNNLEERFRHKFDRWRNVSLSKGGRLTLVQSVLNSLPCYLFS
        F   +GL +N  K+++ GINL  + L+  +E   C     P  YLG  +G        W+ + ER   + D W+   LS GGR+TL+QS L  +PCY  S
Subjt:  FLVRAGLSLNKAKTSLIGINLSNDDLAPFSESTGCSVDNLPFKYLGFSIGRGHNRKEMWNNLEERFRHKFDRWRNVSLSKGGRLTLVQSVLNSLPCYLFS

Query:  LAQAPVGIINRLEQMIRKFVWTGGSTNPIAHLVNWECTSAPTCYGGLGIGSFRQKNIALLTKWFWRFSKEETSLWRRLIVAIYGLDENGWSTKNPNRGKS
        L + P  +  ++E+M R F+W+G       HLVNW+    P   GGLG G    +N+ALL KW WR+ +E ++LW ++I++IYG   NGW   N  R   
Subjt:  LAQAPVGIINRLEQMIRKFVWTGGSTNPIAHLVNWECTSAPTCYGGLGIGSFRQKNIALLTKWFWRFSKEETSLWRRLIVAIYGLDENGWSTKNPNRGKS

Query:  HRLWAGILKHKEIFFNFSAFVLGKGTKIKFWKDKWCVVETLAEKFPNLFSLALNKEAYVADCW-CTATHSWNLGLRRNMLDNEIANAASALEI---LH--
           W  I    + F  F+ FV+G G +I+FW D W   + L  ++P L  +  +K A ++     T   SWN   RRN+ D+EI +    ++    LH  
Subjt:  HRLWAGILKHKEIFFNFSAFVLGKGTKIKFWKDKWCVVETLAEKFPNLFSLALNKEAYVADCW-CTATHSWNLGLRRNMLDNEIANAASALEI---LH--

Query:  SWAPTERNDSLKWIPNINGNFTTKSTFLNLTKRSPNIAVPLIRQIWKNKIPKKVKFFLWSLAYRSLNTHEKLQKKIQNTLLSPSMCCLCAKDEEMLDHLF
        S  P +R+    W  + +G FT KS FL L++ S +  +   + +W  ++P KVK F+W +A++ +NT++ LQ +     LSP +C LC K  E +DHLF
Subjt:  SWAPTERNDSLKWIPNINGNFTTKSTFLNLTKRSPNIAVPLIRQIWKNKIPKKVKFFLWSLAYRSLNTHEKLQKKIQNTLLSPSMCCLCAKDEEMLDHLF

Query:  LHCPFTRKASYTLFGIFDLELCLPSKIDRWMIEGLNFRGYSPKGNILWKCATRSLLWSIWKERNSRIFDDRFNSFDSFWAVVQHTASWWSTNYTKHFCNY
        LHC  T    + LF    ++   P  I   +    N  G+S +G +LW+ A  +L+W +W+ERN+RIF+D+  + +  W  +    S+W+   +K F   
Subjt:  LHCPFTRKASYTLFGIFDLELCLPSKIDRWMIEGLNFRGYSPKGNILWKCATRSLLWSIWKERNSRIFDDRFNSFDSFWAVVQHTASWWSTNYTKHFCNY

Query:  SLSMIFNNWKA
         L+M+  +W A
Subjt:  SLSMIFNNWKA

TrEMBL top hitse value%identityAlignment
A0A438FWU5 LINE-1 retrotransposable element ORF2 protein2.00e-28740.65Show/hide
Query:  RDQFWMELSSLYGLCNDNWCVGGDFNVVRWLNEKSSGTRPTRSMLRFNNLIEELDLVDIPFRNGRFSWSRSGVRPAASKLDRFLLSKPWMEFFREVSVER
        R  FW+EL  LYGL    WCVGGDFNV+R ++EK   TR T +M  F+  I E  L+D P RN  F+WS     P   +LDRFL S  W  FF +   E 
Subjt:  RDQFWMELSSLYGLCNDNWCVGGDFNVVRWLNEKSSGTRPTRSMLRFNNLIEELDLVDIPFRNGRFSWSRSGVRPAASKLDRFLLSKPWMEFFREVSVER

Query:  LPRTTSDHFPIILKMGAHSWGPTPFRFENAWLDHHLFFKNVENWWGSLEADGWPIFSFMEKLKGLKAILKSWNKETFGNIFSQKQVLIDKINSFDSLEES
        LPR TSDH PI L+     WGPTPFRFEN WL H  F +    WW     +GW    FM KLK +K+ LK WN  TFG++  +K++++  ++  D +E+ 
Subjt:  LPRTTSDHFPIILKMGAHSWGPTPFRFENAWLDHHLFFKNVENWWGSLEADGWPIFSFMEKLKGLKAILKSWNKETFGNIFSQKQVLIDKINSFDSLEES

Query:  SCLNEANVKERENCRGALLDLIAKEQKLWIQKSKLHWLREGEENSSFFHRWVSARKSKSIISSLVSIDGKTLVTEKEIVDEILSFFSMLYGTRVSSPFIC
          LN   V ER   R  L D++ KE+  W QKS++ W++EG+ NS FFHR  + R+S+  I SL+S  G+TL   ++I +EI++FF  LY   V   +  
Subjt:  SCLNEANVKERENCRGALLDLIAKEQKLWIQKSKLHWLREGEENSSFFHRWVSARKSKSIISSLVSIDGKTLVTEKEIVDEILSFFSMLYGTRVSSPFIC

Query:  DGLNWRGLSLQDSNLLEAPFTEKEIREAVFDMGCLKSLGPDGMTGEFYKKSWNILKPDLVRVFQDFFKNGVINRRCNETYICLIPKKKEAGRVSDFRPIS
        +G++W  +S +    L+ PFTE+E+R AVF +   K+ GPDG T   Y++ W+++K DL+RVF +F  NGVIN+  N T+I L+PKK ++ ++SD+RPIS
Subjt:  DGLNWRGLSLQDSNLLEAPFTEKEIREAVFDMGCLKSLGPDGMTGEFYKKSWNILKPDLVRVFQDFFKNGVINRRCNETYICLIPKKKEAGRVSDFRPIS

Query:  LITSLYKIISKVLASRLKKVLPSIINDSQMAFVEGRQILDAILTASEVVDEWSLRGRKGVLLKLDLEKAYDKVDWSFLDMIMKLKGFGKRWRKWIWGCLS
        L+TSLYKII+KVL+ RL+KVL   I+DSQ AFVEGR ILDA+L A+EVVDE    G +G++ K+D EKAYD VDW FLD +++ KGF ++WR WI GCLS
Subjt:  LITSLYKIISKVLASRLKKVLPSIINDSQMAFVEGRQILDAILTASEVVDEWSLRGRKGVLLKLDLEKAYDKVDWSFLDMIMKLKGFGKRWRKWIWGCLS

Query:  TTNFSIIVNGRPRGKIIAKRGIRQSDPLAPFLFTIVGDALNCLIHYCNEKRSLRGFHFENLTEDLTHLQYADDTLLFSSWEDGNLENWWKVVNIFLVRAG
        +++F+I+VNG  +G + A RG+RQ DPL+PFLFT+V D L+ ++    E     GF        ++ LQ+ADDT+ FS     +L+N   ++ +F   +G
Subjt:  TTNFSIIVNGRPRGKIIAKRGIRQSDPLAPFLFTIVGDALNCLIHYCNEKRSLRGFHFENLTEDLTHLQYADDTLLFSSWEDGNLENWWKVVNIFLVRAG

Query:  LSLNKAKTSLIGINLSNDDLAPFSESTGCSVDNLPFKYLGFSIGRGHNRKEMWNNLEERFRHKFDRWRNVSLSKGGRLTLVQSVLNSLPCYLFSLAQAPV
        L +N  K+++ GIN   + L+  +    C V   P  YLG  +G        W+ + ER   + D W+   LS GGR+TL+QS L+ +P Y  SL + P 
Subjt:  LSLNKAKTSLIGINLSNDDLAPFSESTGCSVDNLPFKYLGFSIGRGHNRKEMWNNLEERFRHKFDRWRNVSLSKGGRLTLVQSVLNSLPCYLFSLAQAPV

Query:  GIINRLEQMIRKFVWTGGSTNPIAHLVNWECTSAPTCYGGLGIGSFRQKNIALLTKWFWRFSKEETSLWRRLIVAIYGLDENGWSTKNPNRGKSHRLWAG
         I +++E+M R F+W+G       HLV WE  S P   GGLG G    +NIALL KW WRF +E + LW ++I +IYG   NGW      R      W  
Subjt:  GIINRLEQMIRKFVWTGGSTNPIAHLVNWECTSAPTCYGGLGIGSFRQKNIALLTKWFWRFSKEETSLWRRLIVAIYGLDENGWSTKNPNRGKSHRLWAG

Query:  ILKHKEIFFNFSAFVLGKGTKIKFWKDKWCVVETLAEKFPNLFSLALNKEAYVADCWCTATH-SWNLGLRRNMLDNEI------ANAASALEILHSWAPT
        I +  + F  F   V+G G +I+FW+D W   ++L  +F +L+ +   K   V++    +   +WNL  RRN+ D+EI       ++ S++    S A  
Subjt:  ILKHKEIFFNFSAFVLGKGTKIKFWKDKWCVVETLAEKFPNLFSLALNKEAYVADCWCTATH-SWNLGLRRNMLDNEI------ANAASALEILHSWAPT

Query:  ERNDSLKWIPNINGNFTTKSTFLNLTKRSPNIAVPLIRQIWKNKIPKKVKFFLWSLAYRSLNTHEKLQKKIQNTLLSPSMCCLCAKDEEMLDHLFLHCPF
           DS  W  + +G FT KS FL L+K S  I     + +W +K+P KVK   W +A+  +NT++KLQ +     L P  C LC  + E +DHLFLHCP 
Subjt:  ERNDSLKWIPNINGNFTTKSTFLNLTKRSPNIAVPLIRQIWKNKIPKKVKFFLWSLAYRSLNTHEKLQKKIQNTLLSPSMCCLCAKDEEMLDHLFLHCPF

Query:  TRKASYTLFGIFDLELCLPSKIDRWMIEGLNFRGYSPKGNILWKCATRSLLWSIWKERNSRIFDDRFNSFDSFWAVVQHTASWWSTNYTKHFCNYSLSMI
        T      LF +  L+   P   +  ++      G S +G  LW+ A  +L+W +W+ERN RIF+D+  S ++ W ++   ++ W++  +  F    L++I
Subjt:  TRKASYTLFGIFDLELCLPSKIDRWMIEGLNFRGYSPKGNILWKCATRSLLWSIWKERNSRIFDDRFNSFDSFWAVVQHTASWWSTNYTKHFCNYSLSMI

Query:  FNNWKAI
          NW  +
Subjt:  FNNWKAI

A0A438GDE7 LINE-1 retrotransposable element ORF2 protein5.46e-29239.87Show/hide
Query:  PSSYKRRDQFWMELSSLYGLCNDNWCVGGDFNVVRWLNEKSSGTRPTRSMLRFNNLIEELDLVDIPFRNGRFSWSRSGVRPAASKLDRFLLSKPWMEFFR
        P++   R   W+ELS + GL +  WCVGGDFNV+R  +EK  G+R T SM  F++ I + +L+D+P R+  F+WS   V P   +LDRFL S  W + F 
Subjt:  PSSYKRRDQFWMELSSLYGLCNDNWCVGGDFNVVRWLNEKSSGTRPTRSMLRFNNLIEELDLVDIPFRNGRFSWSRSGVRPAASKLDRFLLSKPWMEFFR

Query:  EVSVERLPRTTSDHFPIILKMGAHSWGPTPFRFENAWLDHHLFFKNVENWWGSLEADGWPIFSFMEKLKGLKAILKSWNKETFGNIFSQKQVLIDKINSF
        +     LPR TSDH+PI+L+     WGPTPFRFEN WL H  F +N   WW   + +GW    FM KL+ +KA LK WNK +FG +  +K+ ++  + +F
Subjt:  EVSVERLPRTTSDHFPIILKMGAHSWGPTPFRFENAWLDHHLFFKNVENWWGSLEADGWPIFSFMEKLKGLKAILKSWNKETFGNIFSQKQVLIDKINSF

Query:  DSLEESSCLNEANVKERENCRGALLDLIAKEQKLWIQKSKLHWLREGEENSSFFHRWVSARKSKSIISSLVSIDGKTLVTEKEIVDEILSFFSMLYGTRV
        DSLE+   L+   + +R   +G L +LI +E+  W QK+++ W++EG+ NS FFH+  + R+++  I  L + +G+ +   + I +EIL +F  LY +  
Subjt:  DSLEESSCLNEANVKERENCRGALLDLIAKEQKLWIQKSKLHWLREGEENSSFFHRWVSARKSKSIISSLVSIDGKTLVTEKEIVDEILSFFSMLYGTRV

Query:  SSPFICDGLNWRGLSLQDSNLLEAPFTEKEIREAVFDMGCLKSLGPDGMTGEFYKKSWNILKPDLVRVFQDFFKNGVINRRCNETYICLIPKKKEAGRVS
           +  +GL+W  +S + +  LE+PFTE+EI +A+F M   K+ GPDG T   ++  W ++K DLV+VF +F ++G+IN+  N ++I L+PKK  + R+S
Subjt:  SSPFICDGLNWRGLSLQDSNLLEAPFTEKEIREAVFDMGCLKSLGPDGMTGEFYKKSWNILKPDLVRVFQDFFKNGVINRRCNETYICLIPKKKEAGRVS

Query:  DFRPISLITSLYKIISKVLASRLKKVLPSIINDSQMAFVEGRQILDAILTASEVVDEWSLRGRKGVLLKLDLEKAYDKVDWSFLDMIMKLKGFGKRWRKW
        DFRPISLITSLYKII+KVLA R+++VL   I+ +Q AFV+GRQILDA+L A+E+VDE    G +GV+ K+D EKAYD V W FLD +M++KGFG RWRKW
Subjt:  DFRPISLITSLYKIISKVLASRLKKVLPSIINDSQMAFVEGRQILDAILTASEVVDEWSLRGRKGVLLKLDLEKAYDKVDWSFLDMIMKLKGFGKRWRKW

Query:  IWGCLSTTNFSIIVNGRPRGKIIAKRGIRQSDPLAPFLFTIVGDALNCLIHYCNEKRSLRGFHFENLTEDLTHLQYADDTLLFSSWEDGNLENWWKVVNI
        + GCLS+ +F+++VNG  +G + A RG+RQ DPL+PFLFTIV D L+ ++    E+  L GF        ++HLQ+ADDT+ FSS  + ++     V+ +
Subjt:  IWGCLSTTNFSIIVNGRPRGKIIAKRGIRQSDPLAPFLFTIVGDALNCLIHYCNEKRSLRGFHFENLTEDLTHLQYADDTLLFSSWEDGNLENWWKVVNI

Query:  FLVRAGLSLNKAKTSLIGINLSNDDLAPFSESTGCSVDNLPFKYLGFSIGRGHNRKEMWNNLEERFRHKFDRWRNVSLSKGGRLTLVQSVLNSLPCYLFS
        F   +GL +N  K+++ GINL  + L+  +E   C     P  YLG  +G        W+ + ER   + D W+   LS GGR+TL+QS L  +PCY  S
Subjt:  FLVRAGLSLNKAKTSLIGINLSNDDLAPFSESTGCSVDNLPFKYLGFSIGRGHNRKEMWNNLEERFRHKFDRWRNVSLSKGGRLTLVQSVLNSLPCYLFS

Query:  LAQAPVGIINRLEQMIRKFVWTGGSTNPIAHLVNWECTSAPTCYGGLGIGSFRQKNIALLTKWFWRFSKEETSLWRRLIVAIYGLDENGWSTKNPNRGKS
        L + P  +  ++E+M R F+W+G       HLVNW+    P   GGLG G    +N+ALL KW WR+ +E ++LW ++I++IYG   NGW   N  R   
Subjt:  LAQAPVGIINRLEQMIRKFVWTGGSTNPIAHLVNWECTSAPTCYGGLGIGSFRQKNIALLTKWFWRFSKEETSLWRRLIVAIYGLDENGWSTKNPNRGKS

Query:  HRLWAGILKHKEIFFNFSAFVLGKGTKIKFWKDKWCVVETLAEKFPNLFSLALNKEAYVADCW-CTATHSWNLGLRRNMLDNEIANAASALEI---LH--
           W  I    + F  F+ FV+G G +I+FW D W   + L  ++P L  +  +K A ++     T   SWN   RRN+ D+EI +    ++    LH  
Subjt:  HRLWAGILKHKEIFFNFSAFVLGKGTKIKFWKDKWCVVETLAEKFPNLFSLALNKEAYVADCW-CTATHSWNLGLRRNMLDNEIANAASALEI---LH--

Query:  SWAPTERNDSLKWIPNINGNFTTKSTFLNLTKRSPNIAVPLIRQIWKNKIPKKVKFFLWSLAYRSLNTHEKLQKKIQNTLLSPSMCCLCAKDEEMLDHLF
        S  P +R+    W  + +G FT KS FL L++ S +  +   + +W  ++P KVK F+W +A++ +NT++ LQ +     LSP +C LC K  E +DHLF
Subjt:  SWAPTERNDSLKWIPNINGNFTTKSTFLNLTKRSPNIAVPLIRQIWKNKIPKKVKFFLWSLAYRSLNTHEKLQKKIQNTLLSPSMCCLCAKDEEMLDHLF

Query:  LHCPFTRKASYTLFGIFDLELCLPSKIDRWMIEGLNFRGYSPKGNILWKCATRSLLWSIWKERNSRIFDDRFNSFDSFWAVVQHTASWWSTNYTKHFCNY
        LHC  T    + LF    ++   P  I   +    N  G+S +G +LW+ A  +L+W +W+ERN+RIF+D+  + +  W  +    S+W+   +K F   
Subjt:  LHCPFTRKASYTLFGIFDLELCLPSKIDRWMIEGLNFRGYSPKGNILWKCATRSLLWSIWKERNSRIFDDRFNSFDSFWAVVQHTASWWSTNYTKHFCNY

Query:  SLSMIFNNWKA
         L+M+  +W A
Subjt:  SLSMIFNNWKA

A0A438GDF3 LINE-1 retrotransposable element ORF2 protein1.45e-28838.79Show/hide
Query:  PSSYKRRDQFWMELSSLYGLCNDNWCVGGDFNVVRWLNEKSSGTRPTRSMLRFNNLIEELDLVDIPFRNGRFSWSRSGVRPAASKLDRFLLSKPWMEFFR
        P++   R +FW+E+  L+GL   +WCVGGDFNV+R   EK  G+  T SM  F+  I E +L D P RN  F+WS     P   +LDRFL S  W   F 
Subjt:  PSSYKRRDQFWMELSSLYGLCNDNWCVGGDFNVVRWLNEKSSGTRPTRSMLRFNNLIEELDLVDIPFRNGRFSWSRSGVRPAASKLDRFLLSKPWMEFFR

Query:  EVSVERLPRTTSDHFPIILKMGAHSWGPTPFRFENAWLDHHLFFKNVENWWGSLEADGWPIFSFMEKLKGLKAILKSWNKETFGNIFSQKQVLIDKINSF
        +   E LPR TSDH+PI+L      WGPTPFRFEN WL HH F ++  +WW   E +GW    FM KL+ +KA LK WNK TFG +  +K+ ++D+I + 
Subjt:  EVSVERLPRTTSDHFPIILKMGAHSWGPTPFRFENAWLDHHLFFKNVENWWGSLEADGWPIFSFMEKLKGLKAILKSWNKETFGNIFSQKQVLIDKINSF

Query:  DSLEESSCLNEANVKERENCRGALLDLIAKEQKLWIQKSKLHWLREGEENSSFFHRWVSARKSKSIISSLVSIDGKTLVTEKEIVDEILSFFSMLYGTRV
        D++E+   L      +R   +G L +LI +E+  W QK+K+ W++EG+ NS  FH+  + R++K+ +  L +  G  L + + I +EIL +F  LY +  
Subjt:  DSLEESSCLNEANVKERENCRGALLDLIAKEQKLWIQKSKLHWLREGEENSSFFHRWVSARKSKSIISSLVSIDGKTLVTEKEIVDEILSFFSMLYGTRV

Query:  SSPFICDGLNWRGLSLQDSNLLEAPFTEKEIREAVFDMGCLKSLGPDGMTGEFYKKSWNILKPDLVRVFQDFFKNGVINRRCNETYICLIPKKKEAGRVS
           +  +G++W  +S + ++ L++ F E EI  A+F +   K+ GPDG T   ++  W+++K DLVRVF +F  +G+IN+  N ++I L+PKK ++ ++S
Subjt:  SSPFICDGLNWRGLSLQDSNLLEAPFTEKEIREAVFDMGCLKSLGPDGMTGEFYKKSWNILKPDLVRVFQDFFKNGVINRRCNETYICLIPKKKEAGRVS

Query:  DFRPISLITSLYKIISKVLASRLKKVLPSIINDSQMAFVEGRQILDAILTASEVVDEWSLRGRKGVLLKLDLEKAYDKVDWSFLDMIMKLKGFGKRWRKW
        DFRPISLIT LYKII+KVL+ RL+ VL   I+ +Q AFV+GRQILDA+L A+E+VDE    G +GV+ K+D EKAYD V W FLD +++ KGF  +WR W
Subjt:  DFRPISLITSLYKIISKVLASRLKKVLPSIINDSQMAFVEGRQILDAILTASEVVDEWSLRGRKGVLLKLDLEKAYDKVDWSFLDMIMKLKGFGKRWRKW

Query:  IWGCLSTTNFSIIVNGRPRGKIIAKRGIRQSDPLAPFLFTIVGDALNCLIHYCNEKRSLRGFHFENLTEDLTHLQYADDTLLFSSWEDGNLENWWKVVNI
        + GCLS+ +++I+VNG  +G + A RG+RQ DPL+PFLFTIV D L+ ++    E+  L GF        +THLQ+ADDT+LF++  +  L+    ++ +
Subjt:  IWGCLSTTNFSIIVNGRPRGKIIAKRGIRQSDPLAPFLFTIVGDALNCLIHYCNEKRSLRGFHFENLTEDLTHLQYADDTLLFSSWEDGNLENWWKVVNI

Query:  FLVRAGLSLNKAKTSLIGINLSNDDLAPFSESTGCSVDNLPFKYLGFSIGRGHNRKEMWNNLEERFRHKFDRWRNVSLSKGGRLTLVQSVLNSLPCYLFS
        F   +GL +N  K++L GINL  + L+  +    C   + P  YLG  +G        W+ + ER   + D W+    S GGR+TL+ S L+ +P Y  S
Subjt:  FLVRAGLSLNKAKTSLIGINLSNDDLAPFSESTGCSVDNLPFKYLGFSIGRGHNRKEMWNNLEERFRHKFDRWRNVSLSKGGRLTLVQSVLNSLPCYLFS

Query:  LAQAPVGIINRLEQMIRKFVWTGGSTNPIAHLVNWECTSAPTCYGGLGIGSFRQKNIALLTKWFWRFSKEETSLWRRLIVAIYGLDENGWSTKNPNRGKS
        L + P  +  ++E+M R F+W+G       HLV WE    P   GGLG G    +N ALL KW WRF +E TSLW ++I++IYG   NGW      R   
Subjt:  LAQAPVGIINRLEQMIRKFVWTGGSTNPIAHLVNWECTSAPTCYGGLGIGSFRQKNIALLTKWFWRFSKEETSLWRRLIVAIYGLDENGWSTKNPNRGKS

Query:  HRLWAGILKHKEIFFNFSAFVLGKGTKIKFWKDKWCVVETLAEKFPNLFSLALNKEAYVADCWCTAT-HSWNLGLRRNMLDNEIANAASALEILHSW-AP
           W  I +  + F  ++ F++G G +I+FW+D W   + L +++P LF + ++K   +   + ++   SWN   RRN+ D+EI +    +  L      
Subjt:  HRLWAGILKHKEIFFNFSAFVLGKGTKIKFWKDKWCVVETLAEKFPNLFSLALNKEAYVADCWCTAT-HSWNLGLRRNMLDNEIANAASALEILHSW-AP

Query:  TERNDSLKWIPNINGNFTTKSTFLNLTKRSPNIAVPLIRQIWKNKIPKKVKFFLWSLAYRSLNTHEKLQKKIQNTLLSPSMCCLCAKDEEMLDHLFLHCP
        T  +D+  W    +G FT KS F  L++   +      + +WK+++P KVK F+W + ++ +NT++ LQ +  +  +SP +C LC +  E  DH+FLHC 
Subjt:  TERNDSLKWIPNINGNFTTKSTFLNLTKRSPNIAVPLIRQIWKNKIPKKVKFFLWSLAYRSLNTHEKLQKKIQNTLLSPSMCCLCAKDEEMLDHLFLHCP

Query:  FTRKASYTLFGIFDLELCLPSKIDRWMIEGLNFRGY--SPKGNILWKCATRSLLWSIWKERNSRIFDDRFNSFDSFWAVVQHTASWWS
         T    + LF +  ++   P  I   M   +N++G+  S +G ILW+ A+ +L+W +W ERN+RIF+D+  +  + W  +   AS+W+
Subjt:  FTRKASYTLFGIFDLELCLPSKIDRWMIEGLNFRGY--SPKGNILWKCATRSLLWSIWKERNSRIFDDRFNSFDSFWAVVQHTASWWS

A5BCI7 Reverse transcriptase domain-containing protein4.50e-29239.78Show/hide
Query:  PSSYKRRDQFWMELSSLYGLCNDNWCVGGDFNVVRWLNEKSSGTRPTRSMLRFNNLIEELDLVDIPFRNGRFSWSRSGVRPAASKLDRFLLSKPWMEFFR
        P+    R  FW+ELS + GL +  WCVGGDFNV+R  +EK  G+R T SM  F++ I + +L+D+P R+  F+WS   V     +LDRFL S  W + F 
Subjt:  PSSYKRRDQFWMELSSLYGLCNDNWCVGGDFNVVRWLNEKSSGTRPTRSMLRFNNLIEELDLVDIPFRNGRFSWSRSGVRPAASKLDRFLLSKPWMEFFR

Query:  EVSVERLPRTTSDHFPIILKMGAHSWGPTPFRFENAWLDHHLFFKNVENWWGSLEADGWPIFSFMEKLKGLKAILKSWNKETFGNIFSQKQVLIDKINSF
        +     LPR TSDH+PI+L+     WGPTPFRFEN WL H  F +N   WW   + +GW    FM KL+ +KA LK WNK +FG +  +K+ ++  + +F
Subjt:  EVSVERLPRTTSDHFPIILKMGAHSWGPTPFRFENAWLDHHLFFKNVENWWGSLEADGWPIFSFMEKLKGLKAILKSWNKETFGNIFSQKQVLIDKINSF

Query:  DSLEESSCLNEANVKERENCRGALLDLIAKEQKLWIQKSKLHWLREGEENSSFFHRWVSARKSKSIISSLVSIDGKTLVTEKEIVDEILSFFSMLYGTRV
        DSLE+   L+   + +R   +G L +LI +E+  W QK+++ W++EG+ NS FFH+  + R+++  I  L + +G  +   + I +EIL +F  LY +  
Subjt:  DSLEESSCLNEANVKERENCRGALLDLIAKEQKLWIQKSKLHWLREGEENSSFFHRWVSARKSKSIISSLVSIDGKTLVTEKEIVDEILSFFSMLYGTRV

Query:  SSPFICDGLNWRGLSLQDSNLLEAPFTEKEIREAVFDMGCLKSLGPDGMTGEFYKKSWNILKPDLVRVFQDFFKNGVINRRCNETYICLIPKKKEAGRVS
           +  +GL+W  +S + +  LE+PFTE+EI +A+F M   K+ GPDG T   ++  W ++K DLV+VF +F ++G+IN+  N ++I L+PKK  + R+S
Subjt:  SSPFICDGLNWRGLSLQDSNLLEAPFTEKEIREAVFDMGCLKSLGPDGMTGEFYKKSWNILKPDLVRVFQDFFKNGVINRRCNETYICLIPKKKEAGRVS

Query:  DFRPISLITSLYKIISKVLASRLKKVLPSIINDSQMAFVEGRQILDAILTASEVVDEWSLRGRKGVLLKLDLEKAYDKVDWSFLDMIMKLKGFGKRWRKW
        DFRPISLITSLYKII+KVLA R++ VL   I+ +Q AFV+GRQILDA+L A+E+VDE    G +GV+ K+D EKAYD V W FLD ++++KGFG RWRKW
Subjt:  DFRPISLITSLYKIISKVLASRLKKVLPSIINDSQMAFVEGRQILDAILTASEVVDEWSLRGRKGVLLKLDLEKAYDKVDWSFLDMIMKLKGFGKRWRKW

Query:  IWGCLSTTNFSIIVNGRPRGKIIAKRGIRQSDPLAPFLFTIVGDALNCLIHYCNEKRSLRGFHFENLTEDLTHLQYADDTLLFSSWEDGNLENWWKVVNI
        + GCLS+ +F+++VNG  +G + A RG+RQ DPL+PFLFTIV D L+ ++    E+  L GF        ++HLQ+ADDT+ FSS  + ++     V+ +
Subjt:  IWGCLSTTNFSIIVNGRPRGKIIAKRGIRQSDPLAPFLFTIVGDALNCLIHYCNEKRSLRGFHFENLTEDLTHLQYADDTLLFSSWEDGNLENWWKVVNI

Query:  FLVRAGLSLNKAKTSLIGINLSNDDLAPFSESTGCSVDNLPFKYLGFSIGRGHNRKEMWNNLEERFRHKFDRWRNVSLSKGGRLTLVQSVLNSLPCYLFS
        F   +GL +N  K+++ GINL  + L+  +E   C     P  YLG  +G        W+ + ER   + D W+   LS GGR+TL+QS L  +PCY  S
Subjt:  FLVRAGLSLNKAKTSLIGINLSNDDLAPFSESTGCSVDNLPFKYLGFSIGRGHNRKEMWNNLEERFRHKFDRWRNVSLSKGGRLTLVQSVLNSLPCYLFS

Query:  LAQAPVGIINRLEQMIRKFVWTGGSTNPIAHLVNWECTSAPTCYGGLGIGSFRQKNIALLTKWFWRFSKEETSLWRRLIVAIYGLDENGWSTKNPNRGKS
        L + P  +  ++E+M R F+W+G       HLVNW+    P   GGLG G    +N+ALL KW WR+ +E ++LW ++I++IYG   NGW   N  R   
Subjt:  LAQAPVGIINRLEQMIRKFVWTGGSTNPIAHLVNWECTSAPTCYGGLGIGSFRQKNIALLTKWFWRFSKEETSLWRRLIVAIYGLDENGWSTKNPNRGKS

Query:  HRLWAGILKHKEIFFNFSAFVLGKGTKIKFWKDKWCVVETLAEKFPNLFSLALNKEAYVADCW-CTATHSWNLGLRRNMLDNEIANAAS---ALEILH--
           W  I    + F  F+ FV+G G +I+FW D W   + L  ++P L  +  +K A ++     T   SWN   RRN+ D+EI +      +L+ LH  
Subjt:  HRLWAGILKHKEIFFNFSAFVLGKGTKIKFWKDKWCVVETLAEKFPNLFSLALNKEAYVADCW-CTATHSWNLGLRRNMLDNEIANAAS---ALEILH--

Query:  SWAPTERNDSLKWIPNINGNFTTKSTFLNLTKRSPNIAVPLIRQIWKNKIPKKVKFFLWSLAYRSLNTHEKLQKKIQNTLLSPSMCCLCAKDEEMLDHLF
        S  P +R+    W  + +G FT KS FL L++ S +  +   + +W  ++P KVK F+W +A++ +NT++ LQ +     LSP +C LC K  E +DHLF
Subjt:  SWAPTERNDSLKWIPNINGNFTTKSTFLNLTKRSPNIAVPLIRQIWKNKIPKKVKFFLWSLAYRSLNTHEKLQKKIQNTLLSPSMCCLCAKDEEMLDHLF

Query:  LHCPFTRKASYTLFGIFDLELCLPSKIDRWMIEGLNFRGYSPKGNILWKCATRSLLWSIWKERNSRIFDDRFNSFDSFWAVVQHTASWWSTNYTKHFCNY
        LHC  T    + LF    ++   P  I   +    N  G+S +G +LW+ A  +++W +W+ERN+RIF+D+  + +  W  +    S+W+   +K F   
Subjt:  LHCPFTRKASYTLFGIFDLELCLPSKIDRWMIEGLNFRGYSPKGNILWKCATRSLLWSIWKERNSRIFDDRFNSFDSFWAVVQHTASWWSTNYTKHFCNY

Query:  SLSMIFNNWKA
         L+M+  +W A
Subjt:  SLSMIFNNWKA

A5BPI6 Uncharacterized protein5.04e-28839.63Show/hide
Query:  PSSYKRRDQFWMELSSLYGLCNDNWCVGGDFNVVRWLNEKSSGTRPTRSMLRFNNLIEELDLVDIPFRNGRFSWSRSGVRPAASKLDRFLLSKPWMEFFR
        P+S   R  FW+ELS ++GL +  WCVGGDFNV+R  +EK  G R T SM   ++ I E +L+D P R+  F+WS     P   +LDRFL S  W + F 
Subjt:  PSSYKRRDQFWMELSSLYGLCNDNWCVGGDFNVVRWLNEKSSGTRPTRSMLRFNNLIEELDLVDIPFRNGRFSWSRSGVRPAASKLDRFLLSKPWMEFFR

Query:  EVSVERLPRTTSDHFPIILKMGAHSWGPTPFRFENAWLDHHLFFKNVENWWGSLEADGWPIFSFMEKLKGLKAILKSWNKETFGNIFSQKQVLIDKINSF
        +   E LPR TSDH+PI+L+     WGPTPFRFEN WL H  F +    WW   + DGW    FM KL+ LKA LK WNK  FG++  +K+ ++  I +F
Subjt:  EVSVERLPRTTSDHFPIILKMGAHSWGPTPFRFENAWLDHHLFFKNVENWWGSLEADGWPIFSFMEKLKGLKAILKSWNKETFGNIFSQKQVLIDKINSF

Query:  DSLEESSCLNEANVKERENCRGALLDLIAKEQKLWIQKSKLHWLREGEENSSFFHRWVSARKSKSIISSLVSIDGKTLVTEKEIVDEILSFFSMLYGTRV
        DS+E+   L+   + +R   +G L +LI +E+  W QK+++ W++EG+ NS  FH+  + R+++  I  L +  G  L     I +EIL +F  LY +  
Subjt:  DSLEESSCLNEANVKERENCRGALLDLIAKEQKLWIQKSKLHWLREGEENSSFFHRWVSARKSKSIISSLVSIDGKTLVTEKEIVDEILSFFSMLYGTRV

Query:  SSPFICDGLNWRGLSLQDSNLLEAPFTEKEIREAVFDMGCLKSLGPDGMTGEFYKKSWNILKPDLVRVFQDFFKNGVINRRCNETYICLIPKKKEAGRVS
           +  +GL+W  +S + ++ LE+PFTE+EI +A+F M    + GPDG T   ++  W+++K DLVRVF +F ++G+IN+  N ++I L+PKK  A ++S
Subjt:  SSPFICDGLNWRGLSLQDSNLLEAPFTEKEIREAVFDMGCLKSLGPDGMTGEFYKKSWNILKPDLVRVFQDFFKNGVINRRCNETYICLIPKKKEAGRVS

Query:  DFRPISLITSLYKIISKVLASRLKKVLPSIINDSQMAFVEGRQILDAILTASEVVDEWSLRGRKGVLLKLDLEKAYDKVDWSFLDMIMKLKGFGKRWRKW
        ++RPISLITSLYKII+KVLA RL+ +L   I+ +Q AFV+GRQILDA+L A+E+VDE    G +GV+ K+D EKAYD V W FLD +M+ KGF    RKW
Subjt:  DFRPISLITSLYKIISKVLASRLKKVLPSIINDSQMAFVEGRQILDAILTASEVVDEWSLRGRKGVLLKLDLEKAYDKVDWSFLDMIMKLKGFGKRWRKW

Query:  IWGCLSTTNFSIIVNGRPRGKIIAKRGIRQSDPLAPFLFTIVGDALNCLIHYCNEKRSLRGFHFENLTEDLTHLQYADDTLLFSSWEDGNLENWWKVVNI
        I  CLS+ +F+I+VNG  +G +   RG+RQ DPL+PFLFTIV D  + ++    E+    GF        ++HLQ+ADDT+ FSS  + +L     V+ +
Subjt:  IWGCLSTTNFSIIVNGRPRGKIIAKRGIRQSDPLAPFLFTIVGDALNCLIHYCNEKRSLRGFHFENLTEDLTHLQYADDTLLFSSWEDGNLENWWKVVNI

Query:  FLVRAGLSLNKAKTSLIGINLSNDDLAPFSESTGCSVDNLPFKYLGFSIGRGHNRKEMWNNLEERFRHKFDRWRNVSLSKGGRLTLVQSVLNSLPCYLFS
        F   +GL +N  K+++ GINL  D L   +E   C     P  YLG  +G        W+ + ER   + D W+   LS GGR+TL+QS L  +PCY  S
Subjt:  FLVRAGLSLNKAKTSLIGINLSNDDLAPFSESTGCSVDNLPFKYLGFSIGRGHNRKEMWNNLEERFRHKFDRWRNVSLSKGGRLTLVQSVLNSLPCYLFS

Query:  LAQAPVGIINRLEQMIRKFVWTGGSTNPIAHLVNWECTSAPTCYGGLGIGSFRQKNIALLTKWFWRFSKEETSLWRRLIVAIYGLDENGWSTKNPNRGKS
        L + P  +  R+E++ R F+W+G       HLV+W+        GGLG+G    +N ALL KW WR+ +E ++LW ++I++IYG   NGW      R   
Subjt:  LAQAPVGIINRLEQMIRKFVWTGGSTNPIAHLVNWECTSAPTCYGGLGIGSFRQKNIALLTKWFWRFSKEETSLWRRLIVAIYGLDENGWSTKNPNRGKS

Query:  HRLWAGILKHKEIFFNFSAFVLGKGTKIKFWKDKWCVVETLAEKFPNLFSLALNKEAYVADCW-CTATHSWNLGLRRNMLDNEIANAASALEIL-HSWAP
           W  I +  + F  F+ F++G G +I+FW+D W   ++L  +FP L  + ++K   ++     T   SWN   RRN+ D+EI    S ++ L H    
Subjt:  HRLWAGILKHKEIFFNFSAFVLGKGTKIKFWKDKWCVVETLAEKFPNLFSLALNKEAYVADCW-CTATHSWNLGLRRNMLDNEIANAASALEIL-HSWAP

Query:  TERNDSLKWIPNINGNFTTKSTFLNLTKRSPNIAVPLIRQIWKNKIPKKVKFFLWSLAYRSLNTHEKLQKKIQNTLLSPSMCCLCAKDEEMLDHLFLHCP
            D   W  + +G FT KS FL L++ S   +V   + +W +++P K+KFF+W +A++ +NT++ LQ +     LSP +C LC +  E +DHLFLHC 
Subjt:  TERNDSLKWIPNINGNFTTKSTFLNLTKRSPNIAVPLIRQIWKNKIPKKVKFFLWSLAYRSLNTHEKLQKKIQNTLLSPSMCCLCAKDEEMLDHLFLHCP

Query:  FTRKASYTLFGIFDLELCLPSKIDRWMIEGLNFRGYSPKGNILWKCATRSLLWSIWKERNSRIFDDRFNSFDSFWAVVQHTASWW
         T    + LF +  ++   P  +   +    N  G S +G +LW+ A  ++LW +W+ERN+RIF+D+  + ++ W ++   AS W
Subjt:  FTRKASYTLFGIFDLELCLPSKIDRWMIEGLNFRGYSPKGNILWKCATRSLLWSIWKERNSRIFDDRFNSFDSFWAVVQHTASWW

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein8.3e-4822.11Show/hide
Query:  LSSLYGLCNDNWCVGGDFNVVRWLNEKSSGTRPTRSMLRFNNLIEELDLVDI----PFRNGRFSWSRSGVRPAASKLDRFLLSKPWMEFFREVSVERLPR
        LS L    + +  + GDFN    + ++S+  +  +     N+ + + DL+DI      ++  +++  S      SK+D  + SK  +   +   +  +  
Subjt:  LSSLYGLCNDNWCVGGDFNVVRWLNEKSSGTRPTRSMLRFNNLIEELDLVDI----PFRNGRFSWSRSGVRPAASKLDRFLLSKPWMEFFREVSVERLPR

Query:  TTSDHFPIILKM--------GAHSWGPTPFRFENAWLDHH------LFFKNVENWWGSLEADGWPIFSFM--EKLKGLKAILKSWNKETFGNIFSQKQVL
          SDH  I L++         + +W        + W+ +       +FF+  EN   +   + W  F  +   K   L A  +   +     + SQ + L
Subjt:  TTSDHFPIILKM--------GAHSWGPTPFRFENAWLDHH------LFFKNVENWWGSLEADGWPIFSFM--EKLKGLKAILKSWNKETFGNIFSQKQVL

Query:  IDKINSFDSLEESSCLNEANVKERENCRGALLDLIAKEQKLWIQKSKLHWLREGEENSSFFHRWVSARKSKSIISSLVSIDGKTLVTEKEIVDEILSFFS
                  +E +    +  +E    R  L ++  ++    I +S+  +     +      R +  ++ K+ I ++ +  G       EI   I  ++ 
Subjt:  IDKINSFDSLEESSCLNEANVKERENCRGALLDLIAKEQKLWIQKSKLHWLREGEENSSFFHRWVSARKSKSIISSLVSIDGKTLVTEKEIVDEILSFFS

Query:  MLYGTRVSS----PFICDGLNWRGLSLQDSNLLEAPFTEKEIREAVFDMGCLKSLGPDGMTGEFYKKSWNILKPDLVRVFQDFFKNGVINRRCNETYICL
         LY  ++ +        D      L+ ++   L  P T  EI   +  +   KS GPDG T EFY++    L P L+++FQ   K G++     E  I L
Subjt:  MLYGTRVSS----PFICDGLNWRGLSLQDSNLLEAPFTEKEIREAVFDMGCLKSLGPDGMTGEFYKKSWNILKPDLVRVFQDFFKNGVINRRCNETYICL

Query:  IPKK-KEAGRVSDFRPISLITSLYKIISKVLASRLKKVLPSIINDSQMAFVEGRQILDAILTASEVVDEWS-LRGRKGVLLKLDLEKAYDKVDWSFLDMI
        IPK  ++  +  +FRPISL+    KI++K+LA+R+++ +  +I+  Q+ F+ G Q    I  +  V+   +  + +  V++ +D EKA+DK+   F+   
Subjt:  IPKK-KEAGRVSDFRPISLITSLYKIISKVLASRLKKVLPSIINDSQMAFVEGRQILDAILTASEVVDEWS-LRGRKGVLLKLDLEKAYDKVDWSFLDMI

Query:  MKLKGFGKRWRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQSDPLAPFLFTIVGDALNCLIHYCNEKRSLRGFHFENLTEDLTHLQYADDTLLFSSWE
        +   G    + K I         +II+NG+       K G RQ  PL+P LF IV   L  L     +++ ++G       E++    +ADD +++    
Subjt:  MKLKGFGKRWRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQSDPLAPFLFTIVGDALNCLIHYCNEKRSLRGFHFENLTEDLTHLQYADDTLLFSSWE

Query:  DGNLENWWKVVNIFLVRAGLSLNKAKTSLIGINLSNDDLAPFSESTGCSVDNLPFKYLGFSIGRGHNR--KEMWNNLEERFRHKFDRWRNVSLSKGGRLT
          + +N  K+++ F   +G  +N  K+     N +    +        ++ +   KYLG  + R      KE +  L +  +   ++W+N+  S  GR+ 
Subjt:  DGNLENWWKVVNIFLVRAGLSLNKAKTSLIGINLSNDDLAPFSESTGCSVDNLPFKYLGFSIGRGHNR--KEMWNNLEERFRHKFDRWRNVSLSKGGRLT

Query:  LVQSVLNSLPCYLFSL--AQAPVGIINRLEQMIRKFVWTGGSTNPIAHLVNWECTSAPTCYGGLGIGSFRQKNIALLTK--WFWRFSKEETSLWRR
        +V+  +     Y F+    + P+     LE+   KF+W     N     +     S     GG+ +  F+    A +TK  W+W +   +   W R
Subjt:  LVQSVLNSLPCYLFSL--AQAPVGIINRLEQMIRKFVWTGGSTNPIAHLVNWECTSAPTCYGGLGIGSFRQKNIALLTK--WFWRFSKEETSLWRR

P08548 LINE-1 reverse transcriptase homolog1.8e-5524.09Show/hide
Query:  LSSLYGLCNDNWCVGGDFNVVRWLNEKSSGTRPTRSMLRFNNLIEELDLVDI--PFRNGRFSWS-RSGVRPAASKLDRFLLSKPWMEFFREVSVERLPRT
        L+ +  L +    V GDFN    + ++SS  + ++ +L  N+ I+ LDL DI   F   +  ++  S      SK+D  L  K  +  F+++ +  +P  
Subjt:  LSSLYGLCNDNWCVGGDFNVVRWLNEKSSGTRPTRSMLRFNNLIEELDLVDI--PFRNGRFSWS-RSGVRPAASKLDRFLLSKPWMEFFREVSVERLPRT

Query:  TSDHFPIILKMG--------AHSWGPTPFRFENAWLDHHL------FFK-------NVENWWGSLEADGWPIFSFMEKLKGLKAILKSWNKETFGNIFSQ
         SDH  I +++           +W       ++ W+   +      F +       N +N W + +A          K   L+A LK   +E   N+   
Subjt:  TSDHFPIILKMG--------AHSWGPTPFRFENAWLDHHL------FFK-------NVENWWGSLEADGWPIFSFMEKLKGLKAILKSWNKETFGNIFSQ

Query:  KQVLIDKINSFDSLEESSCLNEANVKERENCRGALLDLIAKEQKLWIQKSKLHWLREGEENSSFFHRWVSARKSKSIISSLVSIDGKTLVTEKEIVDEIL
         + L          EE S    +  KE    R  L ++  K     I KSK  +  +  +           ++ KS+ISS+ + + +      EI   + 
Subjt:  KQVLIDKINSFDSLEESSCLNEANVKERENCRGALLDLIAKEQKLWIQKSKLHWLREGEENSSFFHRWVSARKSKSIISSLVSIDGKTLVTEKEIVDEIL

Query:  SFFSMLYGTRVSS----PFICDGLNWRGLSLQDSNLLEAPFTEKEIREAVFDMGCLKSLGPDGMTGEFYKKSWNILKPDLVRVFQDFFKNGVINRRCNET
         ++  LY  +  +        +  +   LS ++  +L  P +  EI   + ++   KS GPDG T EFY+     L P L+ +FQ+  K G++     E 
Subjt:  SFFSMLYGTRVSS----PFICDGLNWRGLSLQDSNLLEAPFTEKEIREAVFDMGCLKSLGPDGMTGEFYKKSWNILKPDLVRVFQDFFKNGVINRRCNET

Query:  YICLIPKK-KEAGRVSDFRPISLITSLYKIISKVLASRLKKVLPSIINDSQMAFVEGRQILDAILTASEVVDEWS-LRGRKGVLLKLDLEKAYDKVDWSF
         I LIPK  K+  R  ++RPISL+    KI++K+L +R+++ +  II+  Q+ F+ G Q    I  +  V+   + L+ +  ++L +D EKA+D +   F
Subjt:  YICLIPKK-KEAGRVSDFRPISLITSLYKIISKVLASRLKKVLPSIINDSQMAFVEGRQILDAILTASEVVDEWS-LRGRKGVLLKLDLEKAYDKVDWSF

Query:  LDMIMKLKGFGKRWRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQSDPLAPFLFTIVGDALNCLIHYCNEKRSLRGFHFENLTEDLTHLQYADDTLLF
        +   +K  G    + K I    S    +II+NG        + G RQ  PL+P LF IV + L   I    E+++++G H    +E++    +ADD +++
Subjt:  LDMIMKLKGFGKRWRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQSDPLAPFLFTIVGDALNCLIHYCNEKRSLRGFHFENLTEDLTHLQYADDTLLF

Query:  SSWEDGNLENWWKVVNIFLVRAGLSLNKAKTSLIGINLSNDDLAPFSESTGCSVDNLPFKYLGFSIGRGHN--RKEMWNNLEERFRHKFDRWRNVSLSKG
              +     +V+  +   +G  +N  K+       +N       +S   +V     KYLG  + +      KE +  L +      ++W+N+  S  
Subjt:  SSWEDGNLENWWKVVNIFLVRAGLSLNKAKTSLIGINLSNDDLAPFSESTGCSVDNLPFKYLGFSIGRGHN--RKEMWNNLEERFRHKFDRWRNVSLSKG

Query:  GRLTLVQSVLNSLPCYLFSL--AQAPVGIINRLEQMIRKFVWTGGSTNPIAHLVNWECTSAPTCYGGLGIGSFR--QKNIALLTKWFWRFSKEETSLWRR
        GR+ +V+  +     Y F+    +AP+     LE++I  F+W          L+     S     GG+ +   R   K+I + T W+W     E  +W R
Subjt:  GRLTLVQSVLNSLPCYLFSL--AQAPVGIINRLEQMIRKFVWTGGSTNPIAHLVNWECTSAPTCYGGLGIGSFR--QKNIALLTKWFWRFSKEETSLWRR

Query:  L
        +
Subjt:  L

P0C2F6 Putative ribonuclease H protein At1g657508.1e-3525.83Show/hide
Query:  KEMWNNLEERFRHKFDRWRNVSLSKGGRLTLVQSVLNSLPCYLFSLAQAPVGIINRLEQMIRKFVWTGGSTNPIAHLVNWECTSAPTCYGGLGIGSFRQK
        K+ +  + ER   +   WR  +LS  GRLTL ++VL+S+P +  S    P  I+NRL+Q+ R F+W   +     HLV W    +P   GGLG+ + +  
Subjt:  KEMWNNLEERFRHKFDRWRNVSLSKGGRLTLVQSVLNSLPCYLFSLAQAPVGIINRLEQMIRKFVWTGGSTNPIAHLVNWECTSAPTCYGGLGIGSFRQK

Query:  NIALLTKWFWRFSKEETSLWRRLIVAIYGLDENGWSTKNPNRGKSHRLWAGI-LKHKEIFFNFSAFVLGKGTKIKFWKDKWCVVETLAE----KFPNLFS
        N AL++K  WR  +E+ SLW  ++   Y + E   S     +G     W  I +  +++  +   ++ G G +I+FW D+W   + L E    + P    
Subjt:  NIALLTKWFWRFSKEETSLWRRLIVAIYGLDENGWSTKNPNRGKSHRLWAGI-LKHKEIFFNFSAFVLGKGTKIKFWKDKWCVVETLAE----KFPNLFS

Query:  LALNKEAYV-ADCWCTA------THSWNLGLRRNMLDNEIANAASALEILHSWAPTERNDSLKWIPNINGNFTTKSTFLNLTKRS---PNIAVPLIRQIW
          + K+ ++    W  A      T++  L LR  +LD                  T   D L W  + +G F+ +S +  LT      PN+A      +W
Subjt:  LALNKEAYV-ADCWCTA------THSWNLGLRRNMLDNEIANAASALEILHSWAPTERNDSLKWIPNINGNFTTKSTFLNLTKRS---PNIAVPLIRQIW

Query:  KNKIPKKVKFFLWSLAYRSLNTHEKLQKKIQNTLLSPSMCCLCAKDEEMLDHLFLHCPFTRKASYTLFGIFDLELCLPSKIDRWMIEGLNFRGYSPKGNI
        K ++P++VK FLW +  +++ T E+  ++    L + ++C +C    E + H+   CP        +      +      +  W+ + L  R  S   +I
Subjt:  KNKIPKKVKFFLWSLAYRSLNTHEKLQKKIQNTLLSPSMCCLCAKDEEMLDHLFLHCPFTRKASYTLFGIFDLELCLPSKIDRWMIEGLNFRGYSPKGNI

Query:  LWKCATRSLLWSIWKERNSRIFDDRFNSFDSFWAVVQHTASWWSTNYTKHFCN
         W      ++W  WK R   IF +     D     V+    W    Y  H  N
Subjt:  LWKCATRSLLWSIWKERNSRIFDDRFNSFDSFWAVVQHTASWWSTNYTKHFCN

P11369 LINE-1 retrotransposable element ORF2 protein1.1e-4723.11Show/hide
Query:  GDFNVVRWLNEKSSGTRPTRSMLRFNNLIEELDLVDI-----PFRNGRFSWSRSGVRPAASKLDRFLLSKPWMEFFREVSVERLPRTTSDH--FPIILKM
        GDFN      ++S   +  R  ++   +++++DL DI     P   G   +  S      SK+D  +  K  +  ++  ++E +P   SDH    +I   
Subjt:  GDFNVVRWLNEKSSGTRPTRSMLRFNNLIEELDLVDI-----PFRNGRFSWSRSGVRPAASKLDRFLLSKPWMEFFREVSVERLPRTTSDH--FPIILKM

Query:  GAHSWGPT-PFRFENAWLDHHLFFKNVEN------WWGSLEADGWP-IFSFMEKLKGLKAILKSWNKETFGNIFSQKQVLIDKINSFDSLEESSCLNEAN
          ++  PT  ++  N  L+  L  + ++        +   EA  +P ++  M+     K I  S +K+      +    L   + + +  +E++    + 
Subjt:  GAHSWGPT-PFRFENAWLDHHLFFKNVEN------WWGSLEADGWP-IFSFMEKLKGLKAILKSWNKETFGNIFSQKQVLIDKINSFDSLEESSCLNEAN

Query:  VKERENCRGALLDLIAKEQKLWIQKSKLHWLREGEENSSFFHRWVSARKSKSIISSLVSIDGKTLVTEKEIVDEILSFFSMLYGTRVSS----PFICDGL
         +E    RG +  +  +     I +++  +  +  +      R     + K +I+ + +  G      +EI + I SF+  LY T++ +        D  
Subjt:  VKERENCRGALLDLIAKEQKLWIQKSKLHWLREGEENSSFFHRWVSARKSKSIISSLVSIDGKTLVTEKEIVDEILSFFSMLYGTRVSS----PFICDGL

Query:  NWRGLSLQDSNLLEAPFTEKEIREAVFDMGCLKSLGPDGMTGEFYKKSWNILKPDLVRVFQDFFKNGVINRRCNETYICLIPK-KKEAGRVSDFRPISLI
            L+    + L +P + KEI   +  +   KS GPDG + EFY+     L P L ++F      G +     E  I LIPK +K+  ++ +FRPISL+
Subjt:  NWRGLSLQDSNLLEAPFTEKEIREAVFDMGCLKSLGPDGMTGEFYKKSWNILKPDLVRVFQDFFKNGVINRRCNETYICLIPK-KKEAGRVSDFRPISLI

Query:  TSLYKIISKVLASRLKKVLPSIINDSQMAFVEGRQILDAILTASEVVDEWS-LRGRKGVLLKLDLEKAYDKVDWSFLDMIMKLKGFGKRWRKWIWGCLST
            KI++K+LA+R+++ + +II+  Q+ F+ G Q    I  +  V+   + L+ +  +++ LD EKA+DK+   F+  +++  G    +   I    S 
Subjt:  TSLYKIISKVLASRLKKVLPSIINDSQMAFVEGRQILDAILTASEVVDEWS-LRGRKGVLLKLDLEKAYDKVDWSFLDMIMKLKGFGKRWRKWIWGCLST

Query:  TNFSIIVNGRPRGKIIAKRGIRQSDPLAPFLFTIVGDALNCLIHYCNEKRSLRGFHFENLTEDLTHLQYADDTLLFSSWEDGNLENWWKVVNIFLVRAGL
           +I VNG     I  K G RQ  PL+P+LF IV   L  L     +++ ++G       E++     ADD +++ S    +      ++N F    G 
Subjt:  TNFSIIVNGRPRGKIIAKRGIRQSDPLAPFLFTIVGDALNCLIHYCNEKRSLRGFHFENLTEDLTHLQYADDTLLFSSWEDGNLENWWKVVNIFLVRAGL

Query:  SLNKAKTSLIGINLSNDDLAPFSESTGCSVDNLPFKYLGFSIGRGHNRKEMWN----NLEERFRHKFDRWRNVSLSKGGRLTLVQSVLNSLPCYLFSL--
         +N  K+       +        E+T  S+     KYLG ++ +    K++++    +L++  +    RW+++  S  GR+ +V+  +     Y F+   
Subjt:  SLNKAKTSLIGINLSNDDLAPFSESTGCSVDNLPFKYLGFSIGRGHNRKEMWN----NLEERFRHKFDRWRNVSLSKGGRLTLVQSVLNSLPCYLFSL--

Query:  AQAPVGIINRLEQMIRKFVWTGGSTNPIAHLVNWECTSAPTCYGGLGIGSFR--QKNIALLTKWFWRFSKEETSLWRRL
         + P    N LE  I KFVW          L+  + TS     GG+ +   +   + I + T W+W +   +   W R+
Subjt:  AQAPVGIINRLEQMIRKFVWTGGSTNPIAHLVNWECTSAPTCYGGLGIGSFR--QKNIALLTKWFWRFSKEETSLWRRL

P14381 Transposon TX1 uncharacterized 149 kDa protein1.1e-5224.84Show/hide
Query:  PSSYKRRDQFWMELSSLYGLCNDN--WCVGGDFNVVRWLNEKSSGTRPTRSMLRFNNLIEELDLVDIPFRNGRFSWSRSGVR-----PAASKLDRFLLSK
        P++   R +F+  LS+     + +    +GGDFN      +++   +   S      LI    LVD+       + + + VR      + S++DR  +S 
Subjt:  PSSYKRRDQFWMELSSLYGLCNDN--WCVGGDFNVVRWLNEKSSGTRPTRSMLRFNNLIEELDLVDIPFRNGRFSWSRSGVR-----PAASKLDRFLLSK

Query:  PWMEFFREVSVERLPRTTSDHFPIILKMGAHSWGPTP--FRFENAWLDHHLFFKNVENWWGSLEADGWPIFSFMEKLKGL-------KAILKSWNKETFG
          M   +  ++   P   SDH  + L+M      P    + F N+ L+   F K+V + W      GW   +F ++   L       K  LK   +E   
Subjt:  PWMEFFREVSVERLPRTTSDHFPIILKMGAHSWGPTP--FRFENAWLDHHLFFKNVENWWGSLEADGWPIFSFMEKLKGL-------KAILKSWNKETFG

Query:  NIFSQKQVLIDKIN-SFDSLEE--SSCLNEANVKERENCRGALLDLIAKEQKLWIQKSKLHWLREGEENSSFFHRWVSARKSKSIISSLVSIDGKTLVTE
        ++  Q+   I+ +N     LE+  S   ++A   E    + AL ++  ++ +    +S++  L + +  S FF+     + ++  I+ L + DG  L   
Subjt:  NIFSQKQVLIDKIN-SFDSLEE--SSCLNEANVKERENCRGALLDLIAKEQKLWIQKSKLHWLREGEENSSFFHRWVSARKSKSIISSLVSIDGKTLVTE

Query:  KEIVDEILSFFSMLYGTRVSSPFICDGLNWRGL---SLQDSNLLEAPFTEKEIREAVFDMGCLKSLGPDGMTGEFYKKSWNILKPDLVRVFQDFFKNGVI
        + I D   SF+  L+     SP  C+ L W GL   S +    LE P T  E+ +A+  M   KS G DG+T EF++  W+ L PD  RV  + FK G +
Subjt:  KEIVDEILSFFSMLYGTRVSSPFICDGLNWRGL---SLQDSNLLEAPFTEKEIREAVFDMGCLKSLGPDGMTGEFYKKSWNILKPDLVRVFQDFFKNGVI

Query:  NRRCNETYICLIPKKKEAGRVSDFRPISLITSLYKIISKVLASRLKKVLPSIINDSQMAFVEGRQILDAILTASEVVDEWSLRGRKGVLLKLDLEKAYDK
           C    + L+PKK +   + ++RP+SL+++ YKI++K ++ RLK VL  +I+  Q   V GR I D +    +++      G     L LD EKA+D+
Subjt:  NRRCNETYICLIPKKKEAGRVSDFRPISLITSLYKIISKVLASRLKKVLPSIINDSQMAFVEGRQILDAILTASEVVDEWSLRGRKGVLLKLDLEKAYDK

Query:  VDWSFLDMIMKLKGFGKRWRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQSDPLAPFLFTIVGDALNCLIHYCNEKRSLRGFHFENLTEDLTHLQYAD
        VD  +L   ++   FG ++  ++    ++    + +N      +   RG+RQ  PL+  L+++  +   CL+     ++ L G   +     +    YAD
Subjt:  VDWSFLDMIMKLKGFGKRWRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQSDPLAPFLFTIVGDALNCLIHYCNEKRSLRGFHFENLTEDLTHLQYAD

Query:  DTLLFSSWEDGNLENWWKVVNIFLVRAGLSLNKAKTS-LIGINLSNDDLAPFSESTGCSVDNLPFKYLG-FSIGRGHNRKEMWNNLEERFRHKFDRWRNV
        D +L +  +  +LE   +   ++   +   +N +K+S L+  +L  D L P       S ++   KYLG +     +   + +  LEE    +  +W+  
Subjt:  DTLLFSSWEDGNLENWWKVVNIFLVRAGLSLNKAKTS-LIGINLSNDDLAPFSESTGCSVDNLPFKYLG-FSIGRGHNRKEMWNNLEERFRHKFDRWRNV

Query:  S--LSKGGRLTLVQSVLNSLPCYLFSLAQAPVGIINRLEQMIRKFVWTGGSTNPIAHLVNWECTSAPTCYGGLGIGSFRQK
        +  LS  GR  ++  ++ S   Y           I ++++ +  F+W G       H V+   +S P   GG G+   R +
Subjt:  S--LSKGGRLTLVQSVLNSLPCYLFSLAQAPVGIINRLEQMIRKFVWTGGSTNPIAHLVNWECTSAPTCYGGLGIGSFRQK

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.1e-3929.7Show/hide
Query:  GDFNVVRWLNEKSSGTR---PTRSMLRFNNLIEELDLVDIPFRNGRFSWS-RSGVRPAASKLDRFLLSKPWMEFFREVSVERLPRTTSDHFP-IILKMGA
        GDF+ +   ++  S  +   P R +  F N + + DLVDIP R   ++WS      P   KLDR + +  W   F            SDH P II+    
Subjt:  GDFNVVRWLNEKSSGTR---PTRSMLRFNNLIEELDLVDIPFRNGRFSWS-RSGVRPAASKLDRFLLSKPWMEFFREVSVERLPRTTSDHFP-IILKMGA

Query:  HSWGPTPFRFENAWLDHHLFFKNVENWWGSLEADGWPIFSFMEKLKGLKAILKSWNKETFGNIFSQKQVLIDKINSFDSL-----EESSCLNEANVKERE
               FR+ +    H  F  ++   W      G  +FS  E LK  K   K  N++ FGNI  + +  +D + S  S       +S    E   +++ 
Subjt:  HSWGPTPFRFENAWLDHHLFFKNVENWWGSLEADGWPIFSFMEKLKGLKAILKSWNKETFGNIFSQKQVLIDKINSFDSL-----EESSCLNEANVKERE

Query:  NCRGALLDLIAKEQKLWIQKSKLHWLREGEENSSFFHRWVSARKSKSIISSLVSIDGKTLVTEKEIVDEILSFFSMLYG------TRVSSPFICDGLNWR
        N   A L      +  + QKS++ WL++G+ N+ FFH+ + A ++K++I  L   D   +    ++ + I+++++ L G      T  S   I D   +R
Subjt:  NCRGALLDLIAKEQKLWIQKSKLHWLREGEENSSFFHRWVSARKSKSIISSLVSIDGKTLVTEKEIVDEILSFFSMLYG------TRVSSPFICDGLNWR

Query:  GLSLQDSNLLEAPFTEKEIREAVFDMGCLKSLGPDGMTGEFYKKSWNILKPDLVRVFQDFFKNGVINRRCNETYICLIPKKKEAGRVSDFRPISLITSLY
              S L   P ++KEI  AVF M   K+ GPD  T EF+ +SW ++K   +   ++FF+ G + +R N T I LIPK     ++S FRP+S  T +Y
Subjt:  GLSLQDSNLLEAPFTEKEIREAVFDMGCLKSLGPDGMTGEFYKKSWNILKPDLVRVFQDFFKNGVINRRCNETYICLIPKKKEAGRVSDFRPISLITSLY

Query:  KIIS
        KII+
Subjt:  KIIS

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.3e-2323.75Show/hide
Query:  LSNDDLAPFSESTGCSVDNLPFKYLGFSIGRGHNRKEMWNNLEERFRHKFDRWRNVSLSKGGRLTLVQSVLNSLPCYLFSLAQAPVGIINRLEQMIRKFV
        + ++D A    S   +   LP +YLG  +         +  L E+ R +  +W    LS  GRL L+ SV++SL  +  S  + P   I  ++ +   F+
Subjt:  LSNDDLAPFSESTGCSVDNLPFKYLGFSIGRGHNRKEMWNNLEERFRHKFDRWRNVSLSKGGRLTLVQSVLNSLPCYLFSLAQAPVGIINRLEQMIRKFV

Query:  WTGGSTNPIAHLVNWECTSAPTCYGGLGIGSFRQKNIALLTKWFWRFSKEETSLWRRLIVAIYGLDENGWSTKNPNRGKSHRLWAGILKHKEIFFNFSAF
        W+G   N     V W     P   GGLGI S ++ N       FW  S   T                 W            +W  ILKH+ +   F   
Subjt:  WTGGSTNPIAHLVNWECTSAPTCYGGLGIGSFRQKNIALLTKWFWRFSKEETSLWRRLIVAIYGLDENGWSTKNPNRGKSHRLWAGILKHKEIFFNFSAF

Query:  VLGKGTKIKFWKDKWCVVETLAE--KFPNLFSLALNKEAYVADCWCTATHSWNLGLRRNMLDNEIANAASALEILHSWAPTERNDSLKWIPN---INGNF
         +  G+   FW D W  +  L +         + +   A VA+         N   RR+  D  +       E+ H    T   D+++W  N       F
Subjt:  VLGKGTKIKFWKDKWCVVETLAE--KFPNLFSLALNKEAYVADCWCTATHSWNLGLRRNMLDNEIANAASALEILHSWAPTERNDSLKWIPN---INGNF

Query:  TTKSTFLNLTKRSPNIAVPLIRQIWKNKIPKKVKFFLWSLAYRSLNTHEKLQKKIQNTLLSPSMCCLCAKDEEMLDHLFLHCPFTRKASYTLFGIFDLEL
         TK T+     R P + V   + +W +    K     W      L T +++   +     + S C LC    E  DHLF  CP++ +  +     F L L
Subjt:  TTKSTFLNLTKRSPNIAVPLIRQIWKNKIPKKVKFFLWSLAYRSLNTHEKLQKKIQNTLLSPSMCCLCAKDEEMLDHLFLHCPFTRKASYTLFGIFDLEL

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.3e-0839.76Show/hide
Query:  LASRLKKVLPSIINDSQMAFVEGRQILDAILTASEVVDEWSLRGRKGV----LLKLDLEKAYDKVDWSFLDMIMKLKGFGKRW
        +  RLK ++ ++I  +Q +F+ GR   D I+   E V   S+R +KGV    LLKLDLEKAYD++ W +L+  +   GF + W
Subjt:  LASRLKKVLPSIINDSQMAFVEGRQILDAILTASEVVDEWSLRGRKGV----LLKLDLEKAYDKVDWSFLDMIMKLKGFGKRW

AT5G18880.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.3e-0823Show/hide
Query:  ILKHKEIFFNFSAFVLGKGTKIKFWKDKWC----VVETLAEKFPNLFSLALNKEAYVADCWCTATHSWNLGLRRNMLDNEIANAASALEILHSWAPTERN
        +++ K     F    +G G    FW D W     ++  L    P    L + ++A V +   +    W L   R+        A +   + H    +   
Subjt:  ILKHKEIFFNFSAFVLGKGTKIKFWKDKWC----VVETLAEKFPNLFSLALNKEAYVADCWCTATHSWNLGLRRNMLDNEIANAASALEILHSWAPTERN

Query:  DSLKWIPNING----NFTTKSTFLNLTKRSPNIAVPLIRQIWKNKIPKKVKFFLWSLAYRSLNTHEKLQKKIQNTLLSPSMCCLCAKDEEMLDHLFLHCP
        DS  W  N  G    +F+++ T+  +   SP   VP  + +W  +   +     W      L T ++L+    N    PS   LC+  +E   HLF  C 
Subjt:  DSLKWIPNING----NFTTKSTFLNLTKRSPNIAVPLIRQIWKNKIPKKVKFFLWSLAYRSLNTHEKLQKKIQNTLLSPSMCCLCAKDEEMLDHLFLHCP

Query:  FTRKASYTLFGIFDLE------LCLPSKIDRWMIEGLNFRGYSPKGNILWKCATRSLLWSIWKERNSRIFDDRFNSFDSFWAVVQHT
        F    S  ++  F  +        LP+    W+++ L  R +S     + K   +S ++ +WKERN+RIF    +S  S    +  T
Subjt:  FTRKASYTLFGIFDLE------LCLPSKIDRWMIEGLNFRGYSPKGNILWKCATRSLLWSIWKERNSRIFDDRFNSFDSFWAVVQHT

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.6e-0939.71Show/hide
Query:  IVNGRPRGKIIAKRGIRQSDPLAPFLFTIVGDALNCLIHYCNEKRSLRGFHFENLTEDLTHLQYADDT
        I+NG P+G +   RG+RQ DPL+P+LF +  + L+ L     E+  L G    N +  + HL +ADDT
Subjt:  IVNGRPRGKIIAKRGIRQSDPLAPFLFTIVGDALNCLIHYCNEKRSLRGFHFENLTEDLTHLQYADDT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAACAGGTGTTTACGGCCCTCATCTTATAAACGTAGAGACCAATTCTGGATGGAGCTTTCTAGCCTATATGGCTTGTGCAACGATAACTGGTGCGTAGGGGGTGA
CTTCAATGTAGTCAGATGGTTGAATGAGAAGAGCTCTGGCACTCGCCCGACTAGAAGTATGCTTCGTTTTAACAACCTGATTGAGGAATTGGACCTGGTGGATATTCCCT
TCCGAAACGGAAGGTTCTCTTGGTCCCGATCAGGGGTTAGGCCCGCTGCTTCAAAGCTCGACAGATTCCTTCTTTCCAAGCCTTGGATGGAGTTCTTTAGAGAAGTAAGT
GTGGAAAGACTTCCTCGTACCACATCGGATCACTTTCCGATAATTTTAAAAATGGGGGCTCATTCCTGGGGTCCCACACCTTTTAGATTTGAAAATGCTTGGCTGGATCA
TCATCTTTTCTTTAAAAATGTTGAGAACTGGTGGGGCAGTTTGGAGGCTGATGGCTGGCCTATATTCTCTTTCATGGAAAAGCTAAAAGGTCTGAAAGCCATACTAAAAA
GTTGGAATAAGGAGACTTTTGGTAATATTTTCTCCCAAAAACAGGTGCTGATTGATAAGATTAACTCTTTTGACTCACTTGAAGAGTCAAGCTGTCTCAACGAGGCAAAT
GTGAAGGAAAGAGAAAACTGTAGAGGGGCTCTGCTTGATTTGATTGCGAAAGAGCAGAAGTTGTGGATTCAGAAGTCGAAGCTTCATTGGCTTAGGGAGGGGGAGGAAAA
CTCAAGCTTCTTTCATAGATGGGTTTCGGCTCGCAAAAGTAAGAGTATTATTTCCTCCTTGGTTAGTATTGATGGGAAGACTCTTGTCACAGAGAAGGAGATTGTGGATG
AGATCCTTAGCTTCTTTTCAATGTTATATGGCACAAGGGTCTCCTCGCCGTTTATTTGCGACGGTCTTAATTGGAGAGGCCTTAGCTTACAGGATTCGAATTTACTTGAG
GCTCCTTTTACCGAAAAAGAAATTAGAGAAGCTGTTTTTGATATGGGTTGTCTCAAGTCCCTTGGCCCTGATGGCATGACTGGAGAGTTTTATAAAAAATCATGGAACAT
TCTGAAGCCCGACCTCGTAAGGGTGTTCCAAGATTTTTTTAAAAACGGAGTTATTAATAGAAGATGTAACGAGACTTATATTTGCCTCATCCCCAAAAAGAAAGAGGCGG
GCCGTGTCAGTGACTTCAGACCAATTAGCTTGATTACCTCCTTGTATAAAATTATCTCCAAGGTGCTTGCTTCAAGGCTTAAAAAAGTTCTTCCTTCGATAATTAATGAC
TCTCAAATGGCTTTTGTGGAGGGAAGGCAAATCCTTGATGCTATCTTAACTGCTTCCGAGGTTGTTGACGAATGGTCTTTAAGAGGCAGAAAAGGCGTGCTTTTGAAGCT
CGACCTGGAGAAAGCTTATGATAAGGTGGATTGGTCTTTTCTTGATATGATCATGAAACTTAAAGGCTTTGGTAAGAGATGGAGGAAATGGATTTGGGGATGCTTGTCGA
CAACTAATTTCTCCATAATTGTCAATGGGAGGCCTAGAGGGAAAATTATTGCTAAAAGGGGCATTCGTCAAAGTGATCCTCTTGCTCCTTTTCTTTTTACGATTGTGGGA
GATGCTCTTAATTGCCTTATCCACTACTGTAATGAGAAAAGGAGTTTAAGAGGCTTTCATTTTGAGAACCTGACAGAAGATTTAACCCATCTTCAGTACGCAGACGACAC
GCTTCTTTTCTCTTCCTGGGAGGATGGAAATCTAGAGAACTGGTGGAAGGTGGTTAATATCTTCCTTGTGAGAGCCGGTCTTTCCCTTAACAAAGCTAAAACATCCTTGA
TTGGCATTAACCTTAGCAATGATGACTTAGCTCCTTTTAGTGAATCTACGGGATGCTCGGTTGATAATCTTCCCTTTAAATATTTGGGCTTCTCTATTGGAAGGGGTCAT
AATAGAAAAGAGATGTGGAACAATCTTGAAGAGAGATTCAGACACAAATTTGATAGGTGGAGGAATGTATCCCTCTCCAAAGGGGGTAGACTAACTCTGGTGCAATCAGT
TCTCAACAGCCTCCCTTGTTATCTCTTCTCCCTTGCTCAAGCTCCAGTTGGCATTATTAATAGATTGGAACAGATGATCAGGAAGTTTGTTTGGACAGGTGGATCTACGA
ATCCAATTGCTCATCTCGTCAACTGGGAATGCACTTCCGCCCCAACTTGTTACGGTGGTCTTGGGATTGGCTCTTTTAGGCAAAAGAATATTGCTCTTCTCACTAAGTGG
TTTTGGAGGTTTAGCAAGGAAGAAACCTCTTTATGGAGGCGATTAATTGTGGCCATCTATGGTTTAGATGAGAATGGGTGGTCTACCAAAAATCCAAACAGGGGAAAATC
TCATAGATTATGGGCTGGTATTTTAAAGCATAAGGAGATATTCTTCAATTTTTCTGCTTTTGTGTTGGGAAAAGGAACAAAAATCAAATTTTGGAAGGATAAATGGTGTG
TCGTGGAAACACTTGCAGAAAAATTCCCTAACTTGTTCTCTTTGGCGCTAAATAAGGAAGCTTATGTGGCTGATTGCTGGTGTACTGCTACTCATTCTTGGAATTTGGGC
CTTAGAAGAAATATGCTCGACAACGAGATTGCCAATGCAGCCTCAGCTTTAGAAATTCTTCACTCGTGGGCCCCCACTGAAAGGAATGATAGTCTTAAATGGATTCCTAA
CATAAATGGCAACTTCACTACAAAATCTACTTTTCTTAACTTAACTAAGAGATCTCCCAACATTGCCGTTCCCTTGATTCGTCAGATTTGGAAGAATAAAATCCCGAAGA
AGGTGAAGTTTTTCTTATGGTCACTTGCTTACAGAAGCCTCAACACCCATGAGAAACTACAAAAAAAAATTCAGAACACTTTGCTTAGCCCCTCGATGTGTTGCCTATGC
GCTAAAGATGAGGAAATGTTGGATCATTTATTTCTACATTGTCCCTTCACAAGAAAAGCTTCGTACACTCTGTTTGGTATTTTCGATTTGGAGCTTTGCCTTCCTAGCAA
GATTGATAGATGGATGATTGAAGGTCTTAACTTTAGAGGTTACAGCCCTAAAGGAAACATCTTATGGAAATGCGCGACGCGTTCCCTTTTGTGGAGCATTTGGAAAGAAA
GGAATAGCAGAATCTTTGACGATAGATTTAATTCTTTTGATTCTTTTTGGGCTGTGGTTCAACACACAGCCTCTTGGTGGAGTACGAATTACACCAAACACTTTTGTAAT
TATAGCCTTTCTATGATTTTCAACAATTGGAAGGCCATTATGTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATTCATTGTAAACTCAATGCTGGTTTCTGTGGATGGATAACAGGTGTTTACGGCCCTCATCTTATAAACGTAGAGACCAATTCTGGATGGAGCTTTCTAGCCTATATGGC
TTGTGCAACGATAACTGGTGCGTAGGGGGTGACTTCAATGTAGTCAGATGGTTGAATGAGAAGAGCTCTGGCACTCGCCCGACTAGAAGTATGCTTCGTTTTAACAACCT
GATTGAGGAATTGGACCTGGTGGATATTCCCTTCCGAAACGGAAGGTTCTCTTGGTCCCGATCAGGGGTTAGGCCCGCTGCTTCAAAGCTCGACAGATTCCTTCTTTCCA
AGCCTTGGATGGAGTTCTTTAGAGAAGTAAGTGTGGAAAGACTTCCTCGTACCACATCGGATCACTTTCCGATAATTTTAAAAATGGGGGCTCATTCCTGGGGTCCCACA
CCTTTTAGATTTGAAAATGCTTGGCTGGATCATCATCTTTTCTTTAAAAATGTTGAGAACTGGTGGGGCAGTTTGGAGGCTGATGGCTGGCCTATATTCTCTTTCATGGA
AAAGCTAAAAGGTCTGAAAGCCATACTAAAAAGTTGGAATAAGGAGACTTTTGGTAATATTTTCTCCCAAAAACAGGTGCTGATTGATAAGATTAACTCTTTTGACTCAC
TTGAAGAGTCAAGCTGTCTCAACGAGGCAAATGTGAAGGAAAGAGAAAACTGTAGAGGGGCTCTGCTTGATTTGATTGCGAAAGAGCAGAAGTTGTGGATTCAGAAGTCG
AAGCTTCATTGGCTTAGGGAGGGGGAGGAAAACTCAAGCTTCTTTCATAGATGGGTTTCGGCTCGCAAAAGTAAGAGTATTATTTCCTCCTTGGTTAGTATTGATGGGAA
GACTCTTGTCACAGAGAAGGAGATTGTGGATGAGATCCTTAGCTTCTTTTCAATGTTATATGGCACAAGGGTCTCCTCGCCGTTTATTTGCGACGGTCTTAATTGGAGAG
GCCTTAGCTTACAGGATTCGAATTTACTTGAGGCTCCTTTTACCGAAAAAGAAATTAGAGAAGCTGTTTTTGATATGGGTTGTCTCAAGTCCCTTGGCCCTGATGGCATG
ACTGGAGAGTTTTATAAAAAATCATGGAACATTCTGAAGCCCGACCTCGTAAGGGTGTTCCAAGATTTTTTTAAAAACGGAGTTATTAATAGAAGATGTAACGAGACTTA
TATTTGCCTCATCCCCAAAAAGAAAGAGGCGGGCCGTGTCAGTGACTTCAGACCAATTAGCTTGATTACCTCCTTGTATAAAATTATCTCCAAGGTGCTTGCTTCAAGGC
TTAAAAAAGTTCTTCCTTCGATAATTAATGACTCTCAAATGGCTTTTGTGGAGGGAAGGCAAATCCTTGATGCTATCTTAACTGCTTCCGAGGTTGTTGACGAATGGTCT
TTAAGAGGCAGAAAAGGCGTGCTTTTGAAGCTCGACCTGGAGAAAGCTTATGATAAGGTGGATTGGTCTTTTCTTGATATGATCATGAAACTTAAAGGCTTTGGTAAGAG
ATGGAGGAAATGGATTTGGGGATGCTTGTCGACAACTAATTTCTCCATAATTGTCAATGGGAGGCCTAGAGGGAAAATTATTGCTAAAAGGGGCATTCGTCAAAGTGATC
CTCTTGCTCCTTTTCTTTTTACGATTGTGGGAGATGCTCTTAATTGCCTTATCCACTACTGTAATGAGAAAAGGAGTTTAAGAGGCTTTCATTTTGAGAACCTGACAGAA
GATTTAACCCATCTTCAGTACGCAGACGACACGCTTCTTTTCTCTTCCTGGGAGGATGGAAATCTAGAGAACTGGTGGAAGGTGGTTAATATCTTCCTTGTGAGAGCCGG
TCTTTCCCTTAACAAAGCTAAAACATCCTTGATTGGCATTAACCTTAGCAATGATGACTTAGCTCCTTTTAGTGAATCTACGGGATGCTCGGTTGATAATCTTCCCTTTA
AATATTTGGGCTTCTCTATTGGAAGGGGTCATAATAGAAAAGAGATGTGGAACAATCTTGAAGAGAGATTCAGACACAAATTTGATAGGTGGAGGAATGTATCCCTCTCC
AAAGGGGGTAGACTAACTCTGGTGCAATCAGTTCTCAACAGCCTCCCTTGTTATCTCTTCTCCCTTGCTCAAGCTCCAGTTGGCATTATTAATAGATTGGAACAGATGAT
CAGGAAGTTTGTTTGGACAGGTGGATCTACGAATCCAATTGCTCATCTCGTCAACTGGGAATGCACTTCCGCCCCAACTTGTTACGGTGGTCTTGGGATTGGCTCTTTTA
GGCAAAAGAATATTGCTCTTCTCACTAAGTGGTTTTGGAGGTTTAGCAAGGAAGAAACCTCTTTATGGAGGCGATTAATTGTGGCCATCTATGGTTTAGATGAGAATGGG
TGGTCTACCAAAAATCCAAACAGGGGAAAATCTCATAGATTATGGGCTGGTATTTTAAAGCATAAGGAGATATTCTTCAATTTTTCTGCTTTTGTGTTGGGAAAAGGAAC
AAAAATCAAATTTTGGAAGGATAAATGGTGTGTCGTGGAAACACTTGCAGAAAAATTCCCTAACTTGTTCTCTTTGGCGCTAAATAAGGAAGCTTATGTGGCTGATTGCT
GGTGTACTGCTACTCATTCTTGGAATTTGGGCCTTAGAAGAAATATGCTCGACAACGAGATTGCCAATGCAGCCTCAGCTTTAGAAATTCTTCACTCGTGGGCCCCCACT
GAAAGGAATGATAGTCTTAAATGGATTCCTAACATAAATGGCAACTTCACTACAAAATCTACTTTTCTTAACTTAACTAAGAGATCTCCCAACATTGCCGTTCCCTTGAT
TCGTCAGATTTGGAAGAATAAAATCCCGAAGAAGGTGAAGTTTTTCTTATGGTCACTTGCTTACAGAAGCCTCAACACCCATGAGAAACTACAAAAAAAAATTCAGAACA
CTTTGCTTAGCCCCTCGATGTGTTGCCTATGCGCTAAAGATGAGGAAATGTTGGATCATTTATTTCTACATTGTCCCTTCACAAGAAAAGCTTCGTACACTCTGTTTGGT
ATTTTCGATTTGGAGCTTTGCCTTCCTAGCAAGATTGATAGATGGATGATTGAAGGTCTTAACTTTAGAGGTTACAGCCCTAAAGGAAACATCTTATGGAAATGCGCGAC
GCGTTCCCTTTTGTGGAGCATTTGGAAAGAAAGGAATAGCAGAATCTTTGACGATAGATTTAATTCTTTTGATTCTTTTTGGGCTGTGGTTCAACACACAGCCTCTTGGT
GGAGTACGAATTACACCAAACACTTTTGTAATTATAGCCTTTCTATGATTTTCAACAATTGGAAGGCCATTATGTCTTAGTTCCTTAGCTTCTTCCGAGGAGGGCCCTCT
CATCCCTCGCCCTTAGGCTGTTCTGTTTTGTTATATGAATATAATTGTCTCTTATCAAAAAAAAAAAAAAAAATTGACGTATCAATGAATTTTATCATGTTTGTATATCT
ATGTCTGTTTTATCGATCAGGAAGAATAGTGCAATTAAAACCAAATACAACATATAACAAGGCAATTTGGCTTTGGACGAATCACCAAACACCGATTAGATCTAAAAATA
GGAGAGAACAAACGATCATGCAAAGAGGACAAATATAGGGGAGACAAGCCCATGAAGAGAAGCTTCCACTAAAGGTTAGAATAGCAAATCAGCGACCCATGAAACAAAAA
CCATATGAAAATACTAACTCAACTAGGGCATTTGGAATTCCACAAGGCCTCATAAACAAACTTATTCATAAAAAGAGGTAGACAAGAGCTTGGAAAGGGAGCTAACAAAA
AAATTTCAGAAGGATCCAATTTCCGACTTTCACAAATAGGTTGGTAAGCCACTCTATTACCCGATAAGAGCAGAAAAACAAGAAATTTTCAATCTGTCACGTTTGTCTAA
AAAAGGCAATCCATGATAAGAGTTTTTATTTTATAGACTGTGCTTCGTTAATTCTCCTATAATAGCAACTCAAAAATCTACCTCAACCTCAAGGTAAACATTAACTTTAA
CTTTAGAAATGTGTGTCAGTTGAACAAATTGACAATATAGATATCCACAAATGTGAGAAAGGAACTTCATGCTTTTTTTAATGAAAAGAACTAGACTTTCATGAGGAAAA
ATTAAAGAAAGACATAGGCATAC
Protein sequenceShow/hide protein sequence
MDNRCLRPSSYKRRDQFWMELSSLYGLCNDNWCVGGDFNVVRWLNEKSSGTRPTRSMLRFNNLIEELDLVDIPFRNGRFSWSRSGVRPAASKLDRFLLSKPWMEFFREVS
VERLPRTTSDHFPIILKMGAHSWGPTPFRFENAWLDHHLFFKNVENWWGSLEADGWPIFSFMEKLKGLKAILKSWNKETFGNIFSQKQVLIDKINSFDSLEESSCLNEAN
VKERENCRGALLDLIAKEQKLWIQKSKLHWLREGEENSSFFHRWVSARKSKSIISSLVSIDGKTLVTEKEIVDEILSFFSMLYGTRVSSPFICDGLNWRGLSLQDSNLLE
APFTEKEIREAVFDMGCLKSLGPDGMTGEFYKKSWNILKPDLVRVFQDFFKNGVINRRCNETYICLIPKKKEAGRVSDFRPISLITSLYKIISKVLASRLKKVLPSIIND
SQMAFVEGRQILDAILTASEVVDEWSLRGRKGVLLKLDLEKAYDKVDWSFLDMIMKLKGFGKRWRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQSDPLAPFLFTIVG
DALNCLIHYCNEKRSLRGFHFENLTEDLTHLQYADDTLLFSSWEDGNLENWWKVVNIFLVRAGLSLNKAKTSLIGINLSNDDLAPFSESTGCSVDNLPFKYLGFSIGRGH
NRKEMWNNLEERFRHKFDRWRNVSLSKGGRLTLVQSVLNSLPCYLFSLAQAPVGIINRLEQMIRKFVWTGGSTNPIAHLVNWECTSAPTCYGGLGIGSFRQKNIALLTKW
FWRFSKEETSLWRRLIVAIYGLDENGWSTKNPNRGKSHRLWAGILKHKEIFFNFSAFVLGKGTKIKFWKDKWCVVETLAEKFPNLFSLALNKEAYVADCWCTATHSWNLG
LRRNMLDNEIANAASALEILHSWAPTERNDSLKWIPNINGNFTTKSTFLNLTKRSPNIAVPLIRQIWKNKIPKKVKFFLWSLAYRSLNTHEKLQKKIQNTLLSPSMCCLC
AKDEEMLDHLFLHCPFTRKASYTLFGIFDLELCLPSKIDRWMIEGLNFRGYSPKGNILWKCATRSLLWSIWKERNSRIFDDRFNSFDSFWAVVQHTASWWSTNYTKHFCN
YSLSMIFNNWKAIMS