; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0029486 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0029486
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr8:39433974..39439116
RNA-Seq ExpressionLag0029486
SyntenyLag0029486
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR005135 - Endonuclease/exonuclease/phosphatase
IPR025558 - Domain of unknown function DUF4283
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN65484.1 hypothetical protein VITISV_029474 [Vitis vinifera]2.0e-18936.4Show/hide
Query:  LQREETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIFMADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLG
        L  +ETKK     R + S+W++ +  W +L + GASGGILI+W   + S +E + G FS+SI   +    S WLSA+YGP+  A R +FW EL D+AGL 
Subjt:  LQREETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIFMADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLG

Query:  GENWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYPCTLSF
           W +GGDFNV R S EK  G   T SM+ F+ +I+D  LID PL++  +TWS+   N  C  +DRFL ++     F  +    L R TSDH+P  L  
Subjt:  GENWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYPCTLSF

Query:  GDLSWGPCPF-----WLKIDSFSGLMDNWWSQNTIQGWPGHGFMMKL-----KLKL----------------------LDDTEDMVPLSTEQISSRRLLR
            WGP PF     WL+  SF      WW +    GW GH FM KL     KLK+                       D  E    LS E ++ R L +
Subjt:  GDLSWGPCPF-----WLKIDSFSGLMDNWWSQNTIQGWPGHGFMMKL-----KLKL----------------------LDDTEDMVPLSTEQISSRRLLR

Query:  EQIEDLSAQEHIYWHQCCKLNWLKEGDENTKFFHRIMAARKRKNFISEILSREGNSLFTDNDIEAEFLGFYQTLFTKDRGTRFIPTNVDWCPISDSQSMG
         ++E+L  +E I+W Q  ++ W+KEGD N++FFH++   R+ + FI E+ +  G  +     I+ E L +++ L+T   G  +    +DW PIS   +  
Subjt:  EQIEDLSAQEHIYWHQCCKLNWLKEGDENTKFFHRIMAARKRKNFISEILSREGNSLFTDNDIEAEFLGFYQTLFTKDRGTRFIPTNVDWCPISDSQSMG

Query:  LEAVFSEDEVYQAVKSLGTSKSPGPDGFTVEFFKFSWHTIKHDIMTMMEDFYNTGIINVSLNKTYICLIPKKLDAKSVSDFRPISLIPCAYKIIARVLSD
        LE+ F+E+E+++A+  +   K+PGPDGFT+  F+  W  IK D++ +  +F+ +GIIN S N ++I L+PKK  ++ +SDFRPISLI   YKIIA+VL+ 
Subjt:  LEAVFSEDEVYQAVKSLGTSKSPGPDGFTVEFFKFSWHTIKHDIMTMMEDFYNTGIINVSLNKTYICLIPKKLDAKSVSDFRPISLIPCAYKIIARVLSD

Query:  RLKMVLPSTIAENQMAFVANRQILDASLIANELIDDWNLSHKKGVVIKLDLEKAFDKVDWDFLDAILQAKGFGLVWRKWIYGCLSSVNYSIIINGKPRGK
        R++ VL  TI   Q AFV  RQILDA LIANE++D+   S ++GVV K+D EKA+D V WDFLD +L+ KGFG+ WRKW+ GCLSSV++++++NG  +G 
Subjt:  RLKMVLPSTIAENQMAFVANRQILDASLIANELIDDWNLSHKKGVVIKLDLEKAFDKVDWDFLDAILQAKGFGLVWRKWIYGCLSSVNYSIIINGKPRGK

Query:  IIPSRGIRQGDPF-PFLFILVSDCLSRLLSHSANMDRIVSHPIGNSHLYVNHLQFADDTLLFSIFCKDALANMFDIVKIFELASGLNINYSKS-------
        +  SRG+RQGDP  PFLF +V+D LSR+L  +   + +    +G +   V+HLQFADDT+ FS   ++ +  + +++ +F   SGL +N  KS       
Subjt:  IIPSRGIRQGDPF-PFLFILVSDCLSRLLSHSANMDRIVSHPIGNSHLYVNHLQFADDTLLFSIFCKDALANMFDIVKIFELASGLNINYSKS-------

Query:  ------------------------------------------------------------GRHTLTQAVLSSKPTYYLSLFKLPRKVAKTLDKLFRDFFW
                                                                    GR TL Q+ L+  P Y+LSLFK+P  VA  ++++ RDF W
Subjt:  ------------------------------------------------------------GRHTLTQAVLSSKPTYYLSLFKLPRKVAKTLDKLFRDFFW

Query:  EGSRGDGGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWYKLIVAKYYNSELPSLWPSIIQKSSHKSPWRFITSTIDLVSSRVKR
         G       H +NW  V  P   GG+G G    RN+ALL KW+WR+  E ++LW+++I++  Y S       + I + SH+ PW+ I       S   + 
Subjt:  EGSRGDGGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWYKLIVAKYYNSELPSLWPSIIQKSSHKSPWRFITSTIDLVSSRVKR

Query:  RLGNGLDTSFWHDSWLSCGILATNFPRLYR-LTDRPRSLVGETWIASQTAWDLSLRRNLNDVETEEWMALSLILCAISLQNCT-DSWIWPLESSNIFSVK
         +GNG    FW D W     L   +PRL R +TD+   +          +W+ + RRNL+D E E+   L   L  + + +   D   W L  S +F+VK
Subjt:  RLGNGLDTSFWHDSWLSCGILATNFPRLYR-LTDRPRSLVGETWIASQTAWDLSLRRNLNDVETEEWMALSLILCAISLQNCT-DSWIWPLESSNIFSVK

Query:  SLMEDLVDYPNMANDL-YKVIWTDFYPKKIKIF
        S    L  Y         K +W    P K+K F
Subjt:  SLMEDLVDYPNMANDL-YKVIWTDFYPKKIKIF

KAA0057507.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]4.8e-19129.03Show/hide
Query:  ITEYSRYSSHSISLSWKSLKWLASSFNTIVHSPCSHKFFSDLRSDNYTLWIEKLNNKN--GFFVEVNQVQNSGNRQRILIPSENNKQGWFSFFSLI---S
        +TE  ++ S S+ ++  +L W+ + F  ++ +  +  FF++ R ++  +W+ K  NK+      E+ ++ N G +  IL+P   +  GW SF +LI   S
Subjt:  ITEYSRYSSHSISLSWKSLKWLASSFNTIVHSPCSHKFFSDLRSDNYTLWIEKLNNKN--GFFVEVNQVQNSGNRQRILIPSENNKQGWFSFFSLI---S

Query:  DYPAEAHRQPTKPTPIS-FKDILQS------KPPTATITPPLKEPSKEPLASTIDE-------------------EWQEIIVLQRSNLHDDWPSIHQSLV
          P +  R   +  P+S F D   S      K     ++   ++ +K+   +T D+                    +++ +++ R   HDDW  I  SL 
Subjt:  DYPAEAHRQPTKPTPIS-FKDILQS------KPPTATITPPLKEPSKEPLASTIDE-------------------EWQEIIVLQRSNLHDDWPSIHQSLV

Query:  AGQVLRCSINPFQANKAMLHV---YDRAIATNLCALSDWTLIGKHKMKFYPLTTSAAQQDILTPSYGGWIEISSLPPTLWTERIFRFIGDSCGGFVETSN
            +  S  PFQA+KA+L +   + + + +N  A + W+ +G +++KF    ++      + PSYGGW+    +P  LW    F+ IG +CGGF++ + 
Subjt:  AGQVLRCSINPFQANKAMLHV---YDRAIATNLCALSDWTLIGKHKMKFYPLTTSAAQQDILTPSYGGWIEISSLPPTLWTERIFRFIGDSCGGFVETSN

Query:  LTNRMIIATEARIKIRPNSSGFIPAAVKLPTDLAGDELTV------QIKGISGNSQRI-----------------------------------------G
         T +M    +A+IK+R N  GF+PA++ L TD  G+   V      + + +   + R+                                          
Subjt:  LTNRMIIATEARIKIRPNSSGFIPAAVKLPTDLAGDELTV------QIKGISGNSQRI-----------------------------------------G

Query:  LINDGIPNMTYQT----SPSPIDTLPPLQQKL------------------------------------------------THNTSSPKLLEPPQIPPYPS
        + N    +++Y T    + S      P  Q+L                                                T   +  K LE   I     
Subjt:  LINDGIPNMTYQT----SPSPIDTLPPLQQKL------------------------------------------------THNTSSPKLLEPPQIPPYPS

Query:  PRPSPTPNMKSPT-------------------------NTFPNCLQHLAPIL---------SKHGLCIMAIPTVPKSSK---------------------
         R SP    K+                           +   N    + PI          + HGL      T   +SK                     
Subjt:  PRPSPTPNMKSPT-------------------------NTFPNCLQHLAPIL---------SKHGLCIMAIPTVPKSSK---------------------

Query:  ----------KKKLATTGKKPKLQREETKKSLI------------------CSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSL
                    K A TG + ++ R   +K +I                   S     + S  ++       +G  GGIL++W +  F V +   G +S+
Subjt:  ----------KKKLATTGKKPKLQREETKKSLI------------------CSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSL

Query:  SIHIFMADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGENWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENH
        S++I +  N ++WL+++YGP ++ DR++ W EL  L  L   NW++ GDFN+ RW  E +      R+M  FN +I+   LID P  N  +TWS+   N 
Subjt:  SIHIFMADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGENWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENH

Query:  YCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPFWLKIDS-----FSGLMDNWWSQNTIQGWPGHGFMMKL-------------
          S +DRFL++    N FG+     L+R  SDH+P  L    + WGPCPF L   S     F     NWW+ +   G+PG+ F+  L             
Subjt:  YCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPFWLKIDS-----FSGLMDNWWSQNTIQGWPGHGFMMKL-------------

Query:  --------------KLKLLDDTEDMVPLSTEQISSRRLLREQIEDLSAQEHIYWHQCCKLNWLKEGDENTKFFHRIMAARKRKNFISEILSREGNSLFTD
                      ++ ++D  E    +ST     R  L+  +  +   +   WHQ  +  W   GDEN  +FHRI    +RKN I  I    G SL + 
Subjt:  --------------KLKLLDDTEDMVPLSTEQISSRRLLREQIEDLSAQEHIYWHQCCKLNWLKEGDENTKFFHRIMAARKRKNFISEILSREGNSLFTD

Query:  NDIEAEFLGFYQTLFTKDRGTRFIPTNVDWCPISDSQSMGLEAVFSEDEVYQAVKSLGTSKSPGPDGFTVEFFKFSWHTIKHDIMTMMEDFYNTGIINVS
        +DI   F+  +Q ++TK+     +  N+ W PIS      L   F E E+   + S    K+PGPDG+T+ F+K  W  +K D++ + +DF+  GI+N +
Subjt:  NDIEAEFLGFYQTLFTKDRGTRFIPTNVDWCPISDSQSMGLEAVFSEDEVYQAVKSLGTSKSPGPDGFTVEFFKFSWHTIKHDIMTMMEDFYNTGIINVS

Query:  LNKTYICLIPKKLDAKSVSDFRPISLIPCAYKIIARVLSDRLKMVLPSTIAENQMAFVANRQILDASLIANELIDDWNLSHKKGVVIKLDLEKAFDKVDW
        +N T+I LI KK      SD+RPISL    YKI+A+ L++RLK  LP TIAENQMAF+  RQI DA LIANE ID W     KG V+KLD+EKAFDK+ W
Subjt:  LNKTYICLIPKKLDAKSVSDFRPISLIPCAYKIIARVLSDRLKMVLPSTIAENQMAFVANRQILDASLIANELIDDWNLSHKKGVVIKLDLEKAFDKVDW

Query:  DFLDAILQAKGFGLVWRKWIYGCLSSVNYSIIINGKPRGKIIPSRGIRQGDPF-PFLFILVSDCLSRLLSHSANMDRIVSHPIGNSHLYVNHLQFADDTL
         F+D +L  K F   WRKWI  C+S+V YSI++NG P+G+I   RGIRQGDP  PF+F+L  D LSRLLSH  +   I      N    ++HL FADD L
Subjt:  DFLDAILQAKGFGLVWRKWIYGCLSSVNYSIIINGKPRGKIIPSRGIRQGDPF-PFLFILVSDCLSRLLSHSANMDRIVSHPIGNSHLYVNHLQFADDTL

Query:  LFSIFCKDALANMFDIVKIFELASGLNINYSKS-------------------------------------------------------------------
        +F    +  L N+   + +FE ASGL  N SKS                                                                   
Subjt:  LFSIFCKDALANMFDIVKIFELASGLNINYSKS-------------------------------------------------------------------

Query:  GRHTLTQAVLSSKPTYYLSLFKLPRKVAKTLDKLFRDFFWEGSRGDGGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWYKLIVA
        GR TL +A LSS PTY LS FK P  V K ++K +RDF W GS      H INW     P  +GG+GI   ++ N ALL KW+WR+ +E NSLW K I A
Subjt:  GRHTLTQAVLSSKPTYYLSLFKLPRKVAKTLDKLFRDFFWEGSRGDGGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWYKLIVA

Query:  KYYNSELPSLWPSIIQKSSHKSPWRFITSTIDLVSSRVKRRLGNGLDTSFWHDSWLSCGILATNFPRLYRLTDRPRSLVGETWIASQTAWDLSLRRNLND
        KY  +    + P + + SS  SPW  I    D   S++     +G   SFWH  W +   L+   PRLY L++   + V E W      W++  RR LN+
Subjt:  KYYNSELPSLWPSIIQKSSHKSPWRFITSTIDLVSSRVKRRLGNGLDTSFWHDSWLSCGILATNFPRLYRLTDRPRSLVGETWIASQTAWDLSLRRNLND

Query:  VETEEWMALSLILCAISLQN--CTDSWIWPLESS--NIFSVKSLMEDLVDYPNMAN--DLYKVIWTDFYPKKIKIFYGSLV
         E + W ++ + L  I      C  +W  P +S    + S K +       P   N     K +W    P+K K F  ++V
Subjt:  VETEEWMALSLILCAISLQN--CTDSWIWPLESS--NIFSVKSLMEDLVDYPNMAN--DLYKVIWTDFYPKKIKIFYGSLV

RVW70235.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]2.9e-18835.61Show/hide
Query:  LSKHGLCIMAIPTVPKSSKKKK-----LATTGKKPKLQREETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHI
        ++K  + I++  T    SKKK+        + K   +  +ETKK     R + S+W++ +  W +L + GASGGILI+W   + S +E + G FS+SI  
Subjt:  LSKHGLCIMAIPTVPKSSKKKK-----LATTGKKPKLQREETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHI

Query:  FMADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGENWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSL
         +    S WLSA+YGP+  A R + W EL D+AGL    W +GGDFNV R S EK  G  +T SM+ F+ +I+D  LID PL++  +TWS+   N  C  
Subjt:  FMADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGENWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSL

Query:  IDRFLMTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPF-----WLKIDSFSGLMDNWWSQNTIQGWPGHGFMMKL-----KLKL--------
        +DRFL ++     F  +    L R TSDH+P  L      WGP PF     WL+  SF      WW +    GW GH FM KL     KLK+        
Subjt:  IDRFLMTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPF-----WLKIDSFSGLMDNWWSQNTIQGWPGHGFMMKL-----KLKL--------

Query:  --------------LDDTEDMVPLSTEQISSRRLLREQIEDLSAQEHIYWHQCCKLNWLKEGDENTKFFHRIMAARKRKNFISEILSREGNSLFTDNDIE
                       D  E    LS E ++ R + + ++E+L  +E I+W Q  ++ W+KEGD N+KFFH++   R+ + FI E+ +  G  +     I+
Subjt:  --------------LDDTEDMVPLSTEQISSRRLLREQIEDLSAQEHIYWHQCCKLNWLKEGDENTKFFHRIMAARKRKNFISEILSREGNSLFTDNDIE

Query:  AEFLGFYQTLFTKDRGTRFIPTNVDWCPISDSQSMGLEAVFSEDEVYQAVKSLGTSKSPGPDGFTVEFFKFSWHTIKHDIMTMMEDFYNTGIINVSLNKT
         E L +++ L+T   G  +    +DW PIS   ++ LE+ F+E+E+ +A+  +   K+PGPDGFT+  F+  W  IK D++ +  +F+ +GIIN S N +
Subjt:  AEFLGFYQTLFTKDRGTRFIPTNVDWCPISDSQSMGLEAVFSEDEVYQAVKSLGTSKSPGPDGFTVEFFKFSWHTIKHDIMTMMEDFYNTGIINVSLNKT

Query:  YICLIPKKLDAKSVSDFRPISLIPCAYKIIARVLSDRLKMVLPSTIAENQMAFVANRQILDASLIANELIDDWNLSHKKGVVIKLDLEKAFDKVDWDFLD
        +I L+PKK  ++ +SDFRPISLI   YKIIA+VL+ R++ VL  TI   Q AFV  RQILDA LIANE++D+   S ++GVV K+D EKA+D V WDFLD
Subjt:  YICLIPKKLDAKSVSDFRPISLIPCAYKIIARVLSDRLKMVLPSTIAENQMAFVANRQILDASLIANELIDDWNLSHKKGVVIKLDLEKAFDKVDWDFLD

Query:  AILQAKGFGLVWRKWIYGCLSSVNYSIIINGKPRGKIIPSRGIRQGDPF-PFLFILVSDCLSRLLSHSANMDRIVSHPIGNSHLYVNHLQFADDTLLFSI
         +++ KGFG+ WRKW+ GCLSSV++++++NG  +G +  SRG+RQGDP  PFLF +V+D LSR+L  +   + +    +G +   V+HLQFADDT+ FS 
Subjt:  AILQAKGFGLVWRKWIYGCLSSVNYSIIINGKPRGKIIPSRGIRQGDPF-PFLFILVSDCLSRLLSHSANMDRIVSHPIGNSHLYVNHLQFADDTLLFSI

Query:  FCKDALANMFDIVKIFELASGLNINYSKS-------------------------------------------------------------------GRHT
          ++ +  + +++ +F   SGL +N  KS                                                                   GR T
Subjt:  FCKDALANMFDIVKIFELASGLNINYSKS-------------------------------------------------------------------GRHT

Query:  LTQAVLSSKPTYYLSLFKLPRKVAKTLDKLFRDFFWEGSRGDGGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWYKLIVAKYYN
        L Q+ L+  P Y+LSLFK+P  VA  ++++ RDF W G       H +NW  V  P   GG+G G    RN+ALL KW+WR+  E ++LW+++I++  Y 
Subjt:  LTQAVLSSKPTYYLSLFKLPRKVAKTLDKLFRDFFWEGSRGDGGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWYKLIVAKYYN

Query:  SELPSLWPSIIQKSSHKSPWRFITSTIDLVSSRVKRRLGNGLDTSFWHDSWLSCGILATNFPRLYR-LTDRPRSLVGETWIASQTAWDLSLRRNLNDVET
        S       + I + SH+ PW+ I       S   +  +GNG    FW D W     L   +PRL R +TD+   +          +W+ + RRNL+D E 
Subjt:  SELPSLWPSIIQKSSHKSPWRFITSTIDLVSSRVKRRLGNGLDTSFWHDSWLSCGILATNFPRLYR-LTDRPRSLVGETWIASQTAWDLSLRRNLNDVET

Query:  EEWMALSLILCAISLQNCT-DSWIWPLESSNIFSVKSLMEDLVDYPNMANDLY--KVIWTDFYPKKIKIF
        E+   L      + + +   D   W L SS +F+VKS    L  Y +++  ++  K +W    P K+K F
Subjt:  EEWMALSLILCAISLQNCT-DSWIWPLESSNIFSVKSLMEDLVDYPNMANDLY--KVIWTDFYPKKIKIF

TYK00493.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.6e-20531.28Show/hide
Query:  RSISIDRKNFTIAFDEHFRGSRAKITEYSRYSSHSISLSWKSLKWLASSFNTIVHSPCSHKFFSDLRSDNYTLWIEKLNNKNGFFVEVNQVQNSGNRQRI
        RS  I+RK F +  D++ + +   +TE   + + SI +S + L W+ S+  +++ +P S++FF + R   + +WI K  N  G   E+ +V +   +  I
Subjt:  RSISIDRKNFTIAFDEHFRGSRAKITEYSRYSSHSISLSWKSLKWLASSFNTIVHSPCSHKFFSDLRSDNYTLWIEKLNNKNGFFVEVNQVQNSGNRQRI

Query:  LIPSENNKQGWFSFFSLISDYPAEAHRQPTKPTPISFKDILQSKPPTATITPPL--------------------------------KEPSKEPLASTIDE
        L+P    K  W SF S+I+  P    +  T+P        L    P   ++PP+                                +     P  S    
Subjt:  LIPSENNKQGWFSFFSLISDYPAEAHRQPTKPTPISFKDILQSKPPTATITPPL--------------------------------KEPSKEPLASTIDE

Query:  EWQEIIVLQRSNLHDDWPSIHQSLVAGQVLRCSINPFQANKAMLHVYDRAIATNLCALSDWTLIGKHKMKFYPLTTSAAQQDILTPSYGGWIEISSLPPT
          +  +VL R   HDDW  I Q+L        + N F A K ++H      A  LC    WT +GK+ ++F     ++     L PSYGGW     +P  
Subjt:  EWQEIIVLQRSNLHDDWPSIHQSLVAGQVLRCSINPFQANKAMLHVYDRAIATNLCALSDWTLIGKHKMKFYPLTTSAAQQDILTPSYGGWIEISSLPPT

Query:  LWTERIFRFIGDSCGGFVETSNLTNRMIIATEARIKIRPNSSGFIPAAVKLPTDLAGDELTVQI------------------------------------
        LW    F+ IG +CGG ++ +  T       EA++KIR N SGF+PA VK+  D  G++  VQ+                                    
Subjt:  LWTERIFRFIGDSCGGFVETSNLTNRMIIATEARIKIRPNSSGFIPAAVKLPTDLAGDELTVQI------------------------------------

Query:  ---------------KGISGNSQRI------GLINDGIPNMTYQTSPSPIDTLPPLQQKL--THNTSSPKLL----------------------------
                         ISG+ + I       L +  I    Y TSP+ ++        L  T N S  K+L                            
Subjt:  ---------------KGISGNSQRI------GLINDGIPNMTYQTSPSPIDTLPPLQQKL--THNTSSPKLL----------------------------

Query:  -EPPQIPPYPSP-------RPSPTPNMKSPT-----------------------------NTFPNCLQHLAPIL--SKHGLCIMA----IPTVPKSSKKK
         +P +   + SP        P   P   SP                              N     LQ +A  L  SK GL +      +P +  S   +
Subjt:  -EPPQIPPYPSP-------RPSPTPNMKSPT-----------------------------NTFPNCLQHLAPIL--SKHGLCIMA----IPTVPKSSKKK

Query:  --------------KLATTGKKPKLQREETKKSLICSRIIKSLWSSSH-------------------------IGW---------TSLDSVGA-------
                            + P+L+  + +KS     +       SH                         + W            DS GA       
Subjt:  --------------KLATTGKKPKLQREETKKSLICSRIIKSLWSSSH-------------------------IGW---------TSLDSVGA-------

Query:  -------SGGILIMWSEPEFSVKETIQGLFSLSIHIFMADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGENWILGGDFNVTRWSWEKSHGRPVTRS
               +GGILI+W     S+    +G FSLS + F + N S+WL+ +YGP +  +R   W +LH+L  L    WI+GGD NV R   E +     + S
Subjt:  -------SGGILIMWSEPEFSVKETIQGLFSLSIHIFMADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGENWILGGDFNVTRWSWEKSHGRPVTRS

Query:  MRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYP--CTLSFGDLSWGPCPFWLKI-----DSFSGLM
          + N +I++  LID PL N  YTWS+       S +DRFL        F       L R TSDH+P  C  S   L WGP PF L         F   M
Subjt:  MRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYP--CTLSFGDLSWGPCPFWLKI-----DSFSGLM

Query:  DNWWSQNTIQGWPGHGFMMKLK---------------------------LKLLDDTEDMVPLSTEQISSRRLLREQIEDLSAQEHIYWHQCCKLNWLKEG
        + WW  +   G PG  F+ +LK                           +  +D  E   PLS E+ + R  L+ ++ DLS +E  +W Q  K  WLKEG
Subjt:  DNWWSQNTIQGWPGHGFMMKLK---------------------------LKLLDDTEDMVPLSTEQISSRRLLREQIEDLSAQEHIYWHQCCKLNWLKEG

Query:  DENTKFFHRIMAARKRKNFISEILSREGNSLFTDNDIEAEFLGFYQTLF--TKDRGTRFIPTNVDWCPISDSQSMGLEAVFSEDEVYQAVKSLGTSKSPG
        DEN+ FFHRI ++R+++N I EI   EG+   T+N+I   F+  +  ++  +  +   FI  N++W PI  S    L A FSE+E+   +KS   +K+PG
Subjt:  DENTKFFHRIMAARKRKNFISEILSREGNSLFTDNDIEAEFLGFYQTLF--TKDRGTRFIPTNVDWCPISDSQSMGLEAVFSEDEVYQAVKSLGTSKSPG

Query:  PDGFTVEFFKFSWHTIKHDIMTMMEDFYNTGIINVSLNKTYICLIPKKLDAKSVSDFRPISLIPCAYKIIARVLSDRLKMVLPSTIAENQMAFVANRQIL
        PDGF + FFK  WH +K DI+ + +DF+  G+IN ++N TYI LI KK D     DFRPISL    YK IA+ LS+RLK+ LP TI+ NQ+AF+ NRQI 
Subjt:  PDGFTVEFFKFSWHTIKHDIMTMMEDFYNTGIINVSLNKTYICLIPKKLDAKSVSDFRPISLIPCAYKIIARVLSDRLKMVLPSTIAENQMAFVANRQIL

Query:  DASLIANELIDDWNLSHKKGVVIKLDLEKAFDKVDWDFLDAILQAKGFGLVWRKWIYGCLSSVNYSIIINGKPRGKIIPSRGIRQGDPFP-FLFILVSDC
        DA L+ANE +D W +   KG ++KLD+EKAFD ++W+F+D +L+   +   WRKWI GC+S+V YSII+NGKP+G+I  +RG+RQGDP   FLF++  D 
Subjt:  DASLIANELIDDWNLSHKKGVVIKLDLEKAFDKVDWDFLDAILQAKGFGLVWRKWIYGCLSSVNYSIIINGKPRGKIIPSRGIRQGDPFP-FLFILVSDC

Query:  LSRLLSH----SANMDRIVSHPIGNSHLYVNHLQFADDTLLFSIFCKDALANMFDIVKIFELASGLNINYSKSGRHTLTQAVLSSKPTYYLSLFKLPRKV
        LSRLLSH     A    I+ H +  ++L V  L     + LF    +D +          +L++    + SK GR TL ++ LSS P Y LS+F+ P   
Subjt:  LSRLLSH----SANMDRIVSHPIGNSHLYVNHLQFADDTLLFSIFCKDALANMFDIVKIFELASGLNINYSKSGRHTLTQAVLSSKPTYYLSLFKLPRKV

Query:  AKTLDKLFRDFFWEGSRGDGGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWYKLIVAKYYNSELPSLWPSIIQKSSHKSPWRFI
         K ++KL+R+F W+GS G  G H INW+ V  P   GG+GI   Q  N ALL+KW+WR+  E NSLW +LI  K Y  + P   PS I  SS K+PWR I
Subjt:  AKTLDKLFRDFFWEGSRGDGGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWYKLIVAKYYNSELPSLWPSIIQKSSHKSPWRFI

Query:  TSTIDLVSSRVKRRLGNGLDTSFWHDSWLSCGILATNFPRLYRLTDRPRSLVGETWIASQTAWDLSLRRNLNDVETEEWMALSLILCAISLQNCTDSWIW
         + ID   S     L NG   SFW+ +W   G L+T +PRL+ L+    S + + W ++   W+++ RR LND E   W  +   L  +          W
Subjt:  TSTIDLVSSRVKRRLGNGLDTSFWHDSWLSCGILATNFPRLYRLTDRPRSLVGETWIASQTAWDLSLRRNLNDVETEEWMALSLILCAISLQNCTDSWIW

Query:  PLESSNIFSVKSLMEDLVDYP--NMAN---DLYKVIWTDFYPKKIKIFYGSLV
          +S   FS+ S    +   P  ++AN    L  +IW    P KIK F   LV
Subjt:  PLESSNIFSVKSLMEDLVDYP--NMAN---DLYKVIWTDFYPKKIKIFYGSLV

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.3e-19129.03Show/hide
Query:  ITEYSRYSSHSISLSWKSLKWLASSFNTIVHSPCSHKFFSDLRSDNYTLWIEKLNNKN--GFFVEVNQVQNSGNRQRILIPSENNKQGWFSFFSLI---S
        +TE  ++ S S+ ++  +L W+ + F  ++ +  +  FF++ R ++  +W+ K  NK+      E+ ++ N G +  IL+P   +  GW SF +LI   S
Subjt:  ITEYSRYSSHSISLSWKSLKWLASSFNTIVHSPCSHKFFSDLRSDNYTLWIEKLNNKN--GFFVEVNQVQNSGNRQRILIPSENNKQGWFSFFSLI---S

Query:  DYPAEAHRQPTKPTPIS-FKDILQS------KPPTATITPPLKEPSKEPLASTIDE-------------------EWQEIIVLQRSNLHDDWPSIHQSLV
          P +  R   +  P+S F D   S      K     ++   ++ +K+   +T D+                    +++ +++ R   HDDW  I  SL 
Subjt:  DYPAEAHRQPTKPTPIS-FKDILQS------KPPTATITPPLKEPSKEPLASTIDE-------------------EWQEIIVLQRSNLHDDWPSIHQSLV

Query:  AGQVLRCSINPFQANKAMLHV---YDRAIATNLCALSDWTLIGKHKMKFYPLTTSAAQQDILTPSYGGWIEISSLPPTLWTERIFRFIGDSCGGFVETSN
            +  S  PFQA+KA+L +   + + + +N  A + W+ +G +++KF    ++      + PSYGGW+    +P  LW    F+ IG +CGGF++ + 
Subjt:  AGQVLRCSINPFQANKAMLHV---YDRAIATNLCALSDWTLIGKHKMKFYPLTTSAAQQDILTPSYGGWIEISSLPPTLWTERIFRFIGDSCGGFVETSN

Query:  LTNRMIIATEARIKIRPNSSGFIPAAVKLPTDLAGDELTV------QIKGISGNSQRI-----------------------------------------G
         T +M    +A+IK+R N  GF+PA++ L TD  G+   V      + + +   + R+                                          
Subjt:  LTNRMIIATEARIKIRPNSSGFIPAAVKLPTDLAGDELTV------QIKGISGNSQRI-----------------------------------------G

Query:  LINDGIPNMTYQT----SPSPIDTLPPLQQKL------------------------------------------------THNTSSPKLLEPPQIPPYPS
        + N    +++Y T    + S      P  Q+L                                                T   +  K LE   I     
Subjt:  LINDGIPNMTYQT----SPSPIDTLPPLQQKL------------------------------------------------THNTSSPKLLEPPQIPPYPS

Query:  PRPSPTPNMKSPT-------------------------NTFPNCLQHLAPIL---------SKHGLCIMAIPTVPKSSK---------------------
         R SP    K+                           +   N    + PI          + HGL      T   +SK                     
Subjt:  PRPSPTPNMKSPT-------------------------NTFPNCLQHLAPIL---------SKHGLCIMAIPTVPKSSK---------------------

Query:  ----------KKKLATTGKKPKLQREETKKSLI------------------CSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSL
                    K A TG + ++ R   +K +I                   S     + S  ++       +G  GGIL++W +  F V +   G +S+
Subjt:  ----------KKKLATTGKKPKLQREETKKSLI------------------CSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSL

Query:  SIHIFMADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGENWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENH
        S++I +  N ++WL+++YGP ++ DR++ W EL  L  L   NW++ GDFN+ RW  E +      R+M  FN +I+   LID P  N  +TWS+   N 
Subjt:  SIHIFMADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGENWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENH

Query:  YCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPFWLKIDS-----FSGLMDNWWSQNTIQGWPGHGFMMKL-------------
          S +DRFL++    N FG+     L+R  SDH+P  L    + WGPCPF L   S     F     NWW+ +   G+PG+ F+  L             
Subjt:  YCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPFWLKIDS-----FSGLMDNWWSQNTIQGWPGHGFMMKL-------------

Query:  --------------KLKLLDDTEDMVPLSTEQISSRRLLREQIEDLSAQEHIYWHQCCKLNWLKEGDENTKFFHRIMAARKRKNFISEILSREGNSLFTD
                      ++ ++D  E    +ST     R  L+  +  +   +   WHQ  +  W   GDEN  +FHRI    +RKN I  I    G SL + 
Subjt:  --------------KLKLLDDTEDMVPLSTEQISSRRLLREQIEDLSAQEHIYWHQCCKLNWLKEGDENTKFFHRIMAARKRKNFISEILSREGNSLFTD

Query:  NDIEAEFLGFYQTLFTKDRGTRFIPTNVDWCPISDSQSMGLEAVFSEDEVYQAVKSLGTSKSPGPDGFTVEFFKFSWHTIKHDIMTMMEDFYNTGIINVS
        +DI   F+  +Q ++TK+     +  N+ W PIS      L   F E E+   + S    K+PGPDG+T+ F+K  W  +K D++ + +DF+  GI+N +
Subjt:  NDIEAEFLGFYQTLFTKDRGTRFIPTNVDWCPISDSQSMGLEAVFSEDEVYQAVKSLGTSKSPGPDGFTVEFFKFSWHTIKHDIMTMMEDFYNTGIINVS

Query:  LNKTYICLIPKKLDAKSVSDFRPISLIPCAYKIIARVLSDRLKMVLPSTIAENQMAFVANRQILDASLIANELIDDWNLSHKKGVVIKLDLEKAFDKVDW
        +N T+I LI KK      SD+RPISL    YKI+A+ L++RLK  LP TIAENQMAF+  RQI DA LIANE+ID W     KG V+KLD+EKAFDK+ W
Subjt:  LNKTYICLIPKKLDAKSVSDFRPISLIPCAYKIIARVLSDRLKMVLPSTIAENQMAFVANRQILDASLIANELIDDWNLSHKKGVVIKLDLEKAFDKVDW

Query:  DFLDAILQAKGFGLVWRKWIYGCLSSVNYSIIINGKPRGKIIPSRGIRQGDPF-PFLFILVSDCLSRLLSHSANMDRIVSHPIGNSHLYVNHLQFADDTL
         F+D +L  K F   WRKWI  C+S+V YSI++NG P+G+I   RGIRQGDP  PF+F+L  D LSRLLSH  +   I      N++  ++HL FADD L
Subjt:  DFLDAILQAKGFGLVWRKWIYGCLSSVNYSIIINGKPRGKIIPSRGIRQGDPF-PFLFILVSDCLSRLLSHSANMDRIVSHPIGNSHLYVNHLQFADDTL

Query:  LFSIFCKDALANMFDIVKIFELASGLNINYSKS-------------------------------------------------------------------
        +F    +  L N+   + +FE ASGL  N SKS                                                                   
Subjt:  LFSIFCKDALANMFDIVKIFELASGLNINYSKS-------------------------------------------------------------------

Query:  GRHTLTQAVLSSKPTYYLSLFKLPRKVAKTLDKLFRDFFWEGSRGDGGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWYKLIVA
        GR TL +A LSS PTY LS FK P  V K ++K +RDF W GS      H INW     P  +GG+GI   ++ N ALL KW+WR+ +E NSLW K I A
Subjt:  GRHTLTQAVLSSKPTYYLSLFKLPRKVAKTLDKLFRDFFWEGSRGDGGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWYKLIVA

Query:  KYYNSELPSLWPSIIQKSSHKSPWRFITSTIDLVSSRVKRRLGNGLDTSFWHDSWLSCGILATNFPRLYRLTDRPRSLVGETWIASQTAWDLSLRRNLND
        KY  +    + P + + SS  SPW  I    D   S++     +G   SFWH  W +   L+   PRLY L++   + V E W      W++  RR LN+
Subjt:  KYYNSELPSLWPSIIQKSSHKSPWRFITSTIDLVSSRVKRRLGNGLDTSFWHDSWLSCGILATNFPRLYRLTDRPRSLVGETWIASQTAWDLSLRRNLND

Query:  VETEEWMALSLILCAISLQN--CTDSWIWPLESS--NIFSVKSLMEDLVDYPNMAN--DLYKVIWTDFYPKKIKIFYGSLV
         E + W ++ + L  I      C  +W  P +S    + S K +       P   N     K +W    P+K K F  ++V
Subjt:  VETEEWMALSLILCAISLQN--CTDSWIWPLESS--NIFSVKSLMEDLVDYPNMAN--DLYKVIWTDFYPKKIKIFYGSLV

TrEMBL top hitse value%identityAlignment
A0A5A7US62 LINE-1 retrotransposable element ORF2 protein2.3e-19129.03Show/hide
Query:  ITEYSRYSSHSISLSWKSLKWLASSFNTIVHSPCSHKFFSDLRSDNYTLWIEKLNNKN--GFFVEVNQVQNSGNRQRILIPSENNKQGWFSFFSLI---S
        +TE  ++ S S+ ++  +L W+ + F  ++ +  +  FF++ R ++  +W+ K  NK+      E+ ++ N G +  IL+P   +  GW SF +LI   S
Subjt:  ITEYSRYSSHSISLSWKSLKWLASSFNTIVHSPCSHKFFSDLRSDNYTLWIEKLNNKN--GFFVEVNQVQNSGNRQRILIPSENNKQGWFSFFSLI---S

Query:  DYPAEAHRQPTKPTPIS-FKDILQS------KPPTATITPPLKEPSKEPLASTIDE-------------------EWQEIIVLQRSNLHDDWPSIHQSLV
          P +  R   +  P+S F D   S      K     ++   ++ +K+   +T D+                    +++ +++ R   HDDW  I  SL 
Subjt:  DYPAEAHRQPTKPTPIS-FKDILQS------KPPTATITPPLKEPSKEPLASTIDE-------------------EWQEIIVLQRSNLHDDWPSIHQSLV

Query:  AGQVLRCSINPFQANKAMLHV---YDRAIATNLCALSDWTLIGKHKMKFYPLTTSAAQQDILTPSYGGWIEISSLPPTLWTERIFRFIGDSCGGFVETSN
            +  S  PFQA+KA+L +   + + + +N  A + W+ +G +++KF    ++      + PSYGGW+    +P  LW    F+ IG +CGGF++ + 
Subjt:  AGQVLRCSINPFQANKAMLHV---YDRAIATNLCALSDWTLIGKHKMKFYPLTTSAAQQDILTPSYGGWIEISSLPPTLWTERIFRFIGDSCGGFVETSN

Query:  LTNRMIIATEARIKIRPNSSGFIPAAVKLPTDLAGDELTV------QIKGISGNSQRI-----------------------------------------G
         T +M    +A+IK+R N  GF+PA++ L TD  G+   V      + + +   + R+                                          
Subjt:  LTNRMIIATEARIKIRPNSSGFIPAAVKLPTDLAGDELTV------QIKGISGNSQRI-----------------------------------------G

Query:  LINDGIPNMTYQT----SPSPIDTLPPLQQKL------------------------------------------------THNTSSPKLLEPPQIPPYPS
        + N    +++Y T    + S      P  Q+L                                                T   +  K LE   I     
Subjt:  LINDGIPNMTYQT----SPSPIDTLPPLQQKL------------------------------------------------THNTSSPKLLEPPQIPPYPS

Query:  PRPSPTPNMKSPT-------------------------NTFPNCLQHLAPIL---------SKHGLCIMAIPTVPKSSK---------------------
         R SP    K+                           +   N    + PI          + HGL      T   +SK                     
Subjt:  PRPSPTPNMKSPT-------------------------NTFPNCLQHLAPIL---------SKHGLCIMAIPTVPKSSK---------------------

Query:  ----------KKKLATTGKKPKLQREETKKSLI------------------CSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSL
                    K A TG + ++ R   +K +I                   S     + S  ++       +G  GGIL++W +  F V +   G +S+
Subjt:  ----------KKKLATTGKKPKLQREETKKSLI------------------CSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSL

Query:  SIHIFMADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGENWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENH
        S++I +  N ++WL+++YGP ++ DR++ W EL  L  L   NW++ GDFN+ RW  E +      R+M  FN +I+   LID P  N  +TWS+   N 
Subjt:  SIHIFMADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGENWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENH

Query:  YCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPFWLKIDS-----FSGLMDNWWSQNTIQGWPGHGFMMKL-------------
          S +DRFL++    N FG+     L+R  SDH+P  L    + WGPCPF L   S     F     NWW+ +   G+PG+ F+  L             
Subjt:  YCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPFWLKIDS-----FSGLMDNWWSQNTIQGWPGHGFMMKL-------------

Query:  --------------KLKLLDDTEDMVPLSTEQISSRRLLREQIEDLSAQEHIYWHQCCKLNWLKEGDENTKFFHRIMAARKRKNFISEILSREGNSLFTD
                      ++ ++D  E    +ST     R  L+  +  +   +   WHQ  +  W   GDEN  +FHRI    +RKN I  I    G SL + 
Subjt:  --------------KLKLLDDTEDMVPLSTEQISSRRLLREQIEDLSAQEHIYWHQCCKLNWLKEGDENTKFFHRIMAARKRKNFISEILSREGNSLFTD

Query:  NDIEAEFLGFYQTLFTKDRGTRFIPTNVDWCPISDSQSMGLEAVFSEDEVYQAVKSLGTSKSPGPDGFTVEFFKFSWHTIKHDIMTMMEDFYNTGIINVS
        +DI   F+  +Q ++TK+     +  N+ W PIS      L   F E E+   + S    K+PGPDG+T+ F+K  W  +K D++ + +DF+  GI+N +
Subjt:  NDIEAEFLGFYQTLFTKDRGTRFIPTNVDWCPISDSQSMGLEAVFSEDEVYQAVKSLGTSKSPGPDGFTVEFFKFSWHTIKHDIMTMMEDFYNTGIINVS

Query:  LNKTYICLIPKKLDAKSVSDFRPISLIPCAYKIIARVLSDRLKMVLPSTIAENQMAFVANRQILDASLIANELIDDWNLSHKKGVVIKLDLEKAFDKVDW
        +N T+I LI KK      SD+RPISL    YKI+A+ L++RLK  LP TIAENQMAF+  RQI DA LIANE ID W     KG V+KLD+EKAFDK+ W
Subjt:  LNKTYICLIPKKLDAKSVSDFRPISLIPCAYKIIARVLSDRLKMVLPSTIAENQMAFVANRQILDASLIANELIDDWNLSHKKGVVIKLDLEKAFDKVDW

Query:  DFLDAILQAKGFGLVWRKWIYGCLSSVNYSIIINGKPRGKIIPSRGIRQGDPF-PFLFILVSDCLSRLLSHSANMDRIVSHPIGNSHLYVNHLQFADDTL
         F+D +L  K F   WRKWI  C+S+V YSI++NG P+G+I   RGIRQGDP  PF+F+L  D LSRLLSH  +   I      N    ++HL FADD L
Subjt:  DFLDAILQAKGFGLVWRKWIYGCLSSVNYSIIINGKPRGKIIPSRGIRQGDPF-PFLFILVSDCLSRLLSHSANMDRIVSHPIGNSHLYVNHLQFADDTL

Query:  LFSIFCKDALANMFDIVKIFELASGLNINYSKS-------------------------------------------------------------------
        +F    +  L N+   + +FE ASGL  N SKS                                                                   
Subjt:  LFSIFCKDALANMFDIVKIFELASGLNINYSKS-------------------------------------------------------------------

Query:  GRHTLTQAVLSSKPTYYLSLFKLPRKVAKTLDKLFRDFFWEGSRGDGGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWYKLIVA
        GR TL +A LSS PTY LS FK P  V K ++K +RDF W GS      H INW     P  +GG+GI   ++ N ALL KW+WR+ +E NSLW K I A
Subjt:  GRHTLTQAVLSSKPTYYLSLFKLPRKVAKTLDKLFRDFFWEGSRGDGGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWYKLIVA

Query:  KYYNSELPSLWPSIIQKSSHKSPWRFITSTIDLVSSRVKRRLGNGLDTSFWHDSWLSCGILATNFPRLYRLTDRPRSLVGETWIASQTAWDLSLRRNLND
        KY  +    + P + + SS  SPW  I    D   S++     +G   SFWH  W +   L+   PRLY L++   + V E W      W++  RR LN+
Subjt:  KYYNSELPSLWPSIIQKSSHKSPWRFITSTIDLVSSRVKRRLGNGLDTSFWHDSWLSCGILATNFPRLYRLTDRPRSLVGETWIASQTAWDLSLRRNLND

Query:  VETEEWMALSLILCAISLQN--CTDSWIWPLESS--NIFSVKSLMEDLVDYPNMAN--DLYKVIWTDFYPKKIKIFYGSLV
         E + W ++ + L  I      C  +W  P +S    + S K +       P   N     K +W    P+K K F  ++V
Subjt:  VETEEWMALSLILCAISLQN--CTDSWIWPLESS--NIFSVKSLMEDLVDYPNMAN--DLYKVIWTDFYPKKIKIFYGSLV

A0A5D3BL61 LINE-1 retrotransposable element ORF2 protein7.5e-20631.28Show/hide
Query:  RSISIDRKNFTIAFDEHFRGSRAKITEYSRYSSHSISLSWKSLKWLASSFNTIVHSPCSHKFFSDLRSDNYTLWIEKLNNKNGFFVEVNQVQNSGNRQRI
        RS  I+RK F +  D++ + +   +TE   + + SI +S + L W+ S+  +++ +P S++FF + R   + +WI K  N  G   E+ +V +   +  I
Subjt:  RSISIDRKNFTIAFDEHFRGSRAKITEYSRYSSHSISLSWKSLKWLASSFNTIVHSPCSHKFFSDLRSDNYTLWIEKLNNKNGFFVEVNQVQNSGNRQRI

Query:  LIPSENNKQGWFSFFSLISDYPAEAHRQPTKPTPISFKDILQSKPPTATITPPL--------------------------------KEPSKEPLASTIDE
        L+P    K  W SF S+I+  P    +  T+P        L    P   ++PP+                                +     P  S    
Subjt:  LIPSENNKQGWFSFFSLISDYPAEAHRQPTKPTPISFKDILQSKPPTATITPPL--------------------------------KEPSKEPLASTIDE

Query:  EWQEIIVLQRSNLHDDWPSIHQSLVAGQVLRCSINPFQANKAMLHVYDRAIATNLCALSDWTLIGKHKMKFYPLTTSAAQQDILTPSYGGWIEISSLPPT
          +  +VL R   HDDW  I Q+L        + N F A K ++H      A  LC    WT +GK+ ++F     ++     L PSYGGW     +P  
Subjt:  EWQEIIVLQRSNLHDDWPSIHQSLVAGQVLRCSINPFQANKAMLHVYDRAIATNLCALSDWTLIGKHKMKFYPLTTSAAQQDILTPSYGGWIEISSLPPT

Query:  LWTERIFRFIGDSCGGFVETSNLTNRMIIATEARIKIRPNSSGFIPAAVKLPTDLAGDELTVQI------------------------------------
        LW    F+ IG +CGG ++ +  T       EA++KIR N SGF+PA VK+  D  G++  VQ+                                    
Subjt:  LWTERIFRFIGDSCGGFVETSNLTNRMIIATEARIKIRPNSSGFIPAAVKLPTDLAGDELTVQI------------------------------------

Query:  ---------------KGISGNSQRI------GLINDGIPNMTYQTSPSPIDTLPPLQQKL--THNTSSPKLL----------------------------
                         ISG+ + I       L +  I    Y TSP+ ++        L  T N S  K+L                            
Subjt:  ---------------KGISGNSQRI------GLINDGIPNMTYQTSPSPIDTLPPLQQKL--THNTSSPKLL----------------------------

Query:  -EPPQIPPYPSP-------RPSPTPNMKSPT-----------------------------NTFPNCLQHLAPIL--SKHGLCIMA----IPTVPKSSKKK
         +P +   + SP        P   P   SP                              N     LQ +A  L  SK GL +      +P +  S   +
Subjt:  -EPPQIPPYPSP-------RPSPTPNMKSPT-----------------------------NTFPNCLQHLAPIL--SKHGLCIMA----IPTVPKSSKKK

Query:  --------------KLATTGKKPKLQREETKKSLICSRIIKSLWSSSH-------------------------IGW---------TSLDSVGA-------
                            + P+L+  + +KS     +       SH                         + W            DS GA       
Subjt:  --------------KLATTGKKPKLQREETKKSLICSRIIKSLWSSSH-------------------------IGW---------TSLDSVGA-------

Query:  -------SGGILIMWSEPEFSVKETIQGLFSLSIHIFMADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGENWILGGDFNVTRWSWEKSHGRPVTRS
               +GGILI+W     S+    +G FSLS + F + N S+WL+ +YGP +  +R   W +LH+L  L    WI+GGD NV R   E +     + S
Subjt:  -------SGGILIMWSEPEFSVKETIQGLFSLSIHIFMADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGENWILGGDFNVTRWSWEKSHGRPVTRS

Query:  MRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYP--CTLSFGDLSWGPCPFWLKI-----DSFSGLM
          + N +I++  LID PL N  YTWS+       S +DRFL        F       L R TSDH+P  C  S   L WGP PF L         F   M
Subjt:  MRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYP--CTLSFGDLSWGPCPFWLKI-----DSFSGLM

Query:  DNWWSQNTIQGWPGHGFMMKLK---------------------------LKLLDDTEDMVPLSTEQISSRRLLREQIEDLSAQEHIYWHQCCKLNWLKEG
        + WW  +   G PG  F+ +LK                           +  +D  E   PLS E+ + R  L+ ++ DLS +E  +W Q  K  WLKEG
Subjt:  DNWWSQNTIQGWPGHGFMMKLK---------------------------LKLLDDTEDMVPLSTEQISSRRLLREQIEDLSAQEHIYWHQCCKLNWLKEG

Query:  DENTKFFHRIMAARKRKNFISEILSREGNSLFTDNDIEAEFLGFYQTLF--TKDRGTRFIPTNVDWCPISDSQSMGLEAVFSEDEVYQAVKSLGTSKSPG
        DEN+ FFHRI ++R+++N I EI   EG+   T+N+I   F+  +  ++  +  +   FI  N++W PI  S    L A FSE+E+   +KS   +K+PG
Subjt:  DENTKFFHRIMAARKRKNFISEILSREGNSLFTDNDIEAEFLGFYQTLF--TKDRGTRFIPTNVDWCPISDSQSMGLEAVFSEDEVYQAVKSLGTSKSPG

Query:  PDGFTVEFFKFSWHTIKHDIMTMMEDFYNTGIINVSLNKTYICLIPKKLDAKSVSDFRPISLIPCAYKIIARVLSDRLKMVLPSTIAENQMAFVANRQIL
        PDGF + FFK  WH +K DI+ + +DF+  G+IN ++N TYI LI KK D     DFRPISL    YK IA+ LS+RLK+ LP TI+ NQ+AF+ NRQI 
Subjt:  PDGFTVEFFKFSWHTIKHDIMTMMEDFYNTGIINVSLNKTYICLIPKKLDAKSVSDFRPISLIPCAYKIIARVLSDRLKMVLPSTIAENQMAFVANRQIL

Query:  DASLIANELIDDWNLSHKKGVVIKLDLEKAFDKVDWDFLDAILQAKGFGLVWRKWIYGCLSSVNYSIIINGKPRGKIIPSRGIRQGDPFP-FLFILVSDC
        DA L+ANE +D W +   KG ++KLD+EKAFD ++W+F+D +L+   +   WRKWI GC+S+V YSII+NGKP+G+I  +RG+RQGDP   FLF++  D 
Subjt:  DASLIANELIDDWNLSHKKGVVIKLDLEKAFDKVDWDFLDAILQAKGFGLVWRKWIYGCLSSVNYSIIINGKPRGKIIPSRGIRQGDPFP-FLFILVSDC

Query:  LSRLLSH----SANMDRIVSHPIGNSHLYVNHLQFADDTLLFSIFCKDALANMFDIVKIFELASGLNINYSKSGRHTLTQAVLSSKPTYYLSLFKLPRKV
        LSRLLSH     A    I+ H +  ++L V  L     + LF    +D +          +L++    + SK GR TL ++ LSS P Y LS+F+ P   
Subjt:  LSRLLSH----SANMDRIVSHPIGNSHLYVNHLQFADDTLLFSIFCKDALANMFDIVKIFELASGLNINYSKSGRHTLTQAVLSSKPTYYLSLFKLPRKV

Query:  AKTLDKLFRDFFWEGSRGDGGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWYKLIVAKYYNSELPSLWPSIIQKSSHKSPWRFI
         K ++KL+R+F W+GS G  G H INW+ V  P   GG+GI   Q  N ALL+KW+WR+  E NSLW +LI  K Y  + P   PS I  SS K+PWR I
Subjt:  AKTLDKLFRDFFWEGSRGDGGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWYKLIVAKYYNSELPSLWPSIIQKSSHKSPWRFI

Query:  TSTIDLVSSRVKRRLGNGLDTSFWHDSWLSCGILATNFPRLYRLTDRPRSLVGETWIASQTAWDLSLRRNLNDVETEEWMALSLILCAISLQNCTDSWIW
         + ID   S     L NG   SFW+ +W   G L+T +PRL+ L+    S + + W ++   W+++ RR LND E   W  +   L  +          W
Subjt:  TSTIDLVSSRVKRRLGNGLDTSFWHDSWLSCGILATNFPRLYRLTDRPRSLVGETWIASQTAWDLSLRRNLNDVETEEWMALSLILCAISLQNCTDSWIW

Query:  PLESSNIFSVKSLMEDLVDYP--NMAN---DLYKVIWTDFYPKKIKIFYGSLV
          +S   FS+ S    +   P  ++AN    L  +IW    P KIK F   LV
Subjt:  PLESSNIFSVKSLMEDLVDYP--NMAN---DLYKVIWTDFYPKKIKIFYGSLV

A0A5D3CA17 LINE-1 retrotransposable element ORF2 protein6.2e-19229.03Show/hide
Query:  ITEYSRYSSHSISLSWKSLKWLASSFNTIVHSPCSHKFFSDLRSDNYTLWIEKLNNKN--GFFVEVNQVQNSGNRQRILIPSENNKQGWFSFFSLI---S
        +TE  ++ S S+ ++  +L W+ + F  ++ +  +  FF++ R ++  +W+ K  NK+      E+ ++ N G +  IL+P   +  GW SF +LI   S
Subjt:  ITEYSRYSSHSISLSWKSLKWLASSFNTIVHSPCSHKFFSDLRSDNYTLWIEKLNNKN--GFFVEVNQVQNSGNRQRILIPSENNKQGWFSFFSLI---S

Query:  DYPAEAHRQPTKPTPIS-FKDILQS------KPPTATITPPLKEPSKEPLASTIDE-------------------EWQEIIVLQRSNLHDDWPSIHQSLV
          P +  R   +  P+S F D   S      K     ++   ++ +K+   +T D+                    +++ +++ R   HDDW  I  SL 
Subjt:  DYPAEAHRQPTKPTPIS-FKDILQS------KPPTATITPPLKEPSKEPLASTIDE-------------------EWQEIIVLQRSNLHDDWPSIHQSLV

Query:  AGQVLRCSINPFQANKAMLHV---YDRAIATNLCALSDWTLIGKHKMKFYPLTTSAAQQDILTPSYGGWIEISSLPPTLWTERIFRFIGDSCGGFVETSN
            +  S  PFQA+KA+L +   + + + +N  A + W+ +G +++KF    ++      + PSYGGW+    +P  LW    F+ IG +CGGF++ + 
Subjt:  AGQVLRCSINPFQANKAMLHV---YDRAIATNLCALSDWTLIGKHKMKFYPLTTSAAQQDILTPSYGGWIEISSLPPTLWTERIFRFIGDSCGGFVETSN

Query:  LTNRMIIATEARIKIRPNSSGFIPAAVKLPTDLAGDELTV------QIKGISGNSQRI-----------------------------------------G
         T +M    +A+IK+R N  GF+PA++ L TD  G+   V      + + +   + R+                                          
Subjt:  LTNRMIIATEARIKIRPNSSGFIPAAVKLPTDLAGDELTV------QIKGISGNSQRI-----------------------------------------G

Query:  LINDGIPNMTYQT----SPSPIDTLPPLQQKL------------------------------------------------THNTSSPKLLEPPQIPPYPS
        + N    +++Y T    + S      P  Q+L                                                T   +  K LE   I     
Subjt:  LINDGIPNMTYQT----SPSPIDTLPPLQQKL------------------------------------------------THNTSSPKLLEPPQIPPYPS

Query:  PRPSPTPNMKSPT-------------------------NTFPNCLQHLAPIL---------SKHGLCIMAIPTVPKSSK---------------------
         R SP    K+                           +   N    + PI          + HGL      T   +SK                     
Subjt:  PRPSPTPNMKSPT-------------------------NTFPNCLQHLAPIL---------SKHGLCIMAIPTVPKSSK---------------------

Query:  ----------KKKLATTGKKPKLQREETKKSLI------------------CSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSL
                    K A TG + ++ R   +K +I                   S     + S  ++       +G  GGIL++W +  F V +   G +S+
Subjt:  ----------KKKLATTGKKPKLQREETKKSLI------------------CSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSL

Query:  SIHIFMADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGENWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENH
        S++I +  N ++WL+++YGP ++ DR++ W EL  L  L   NW++ GDFN+ RW  E +      R+M  FN +I+   LID P  N  +TWS+   N 
Subjt:  SIHIFMADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGENWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENH

Query:  YCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPFWLKIDS-----FSGLMDNWWSQNTIQGWPGHGFMMKL-------------
          S +DRFL++    N FG+     L+R  SDH+P  L    + WGPCPF L   S     F     NWW+ +   G+PG+ F+  L             
Subjt:  YCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPFWLKIDS-----FSGLMDNWWSQNTIQGWPGHGFMMKL-------------

Query:  --------------KLKLLDDTEDMVPLSTEQISSRRLLREQIEDLSAQEHIYWHQCCKLNWLKEGDENTKFFHRIMAARKRKNFISEILSREGNSLFTD
                      ++ ++D  E    +ST     R  L+  +  +   +   WHQ  +  W   GDEN  +FHRI    +RKN I  I    G SL + 
Subjt:  --------------KLKLLDDTEDMVPLSTEQISSRRLLREQIEDLSAQEHIYWHQCCKLNWLKEGDENTKFFHRIMAARKRKNFISEILSREGNSLFTD

Query:  NDIEAEFLGFYQTLFTKDRGTRFIPTNVDWCPISDSQSMGLEAVFSEDEVYQAVKSLGTSKSPGPDGFTVEFFKFSWHTIKHDIMTMMEDFYNTGIINVS
        +DI   F+  +Q ++TK+     +  N+ W PIS      L   F E E+   + S    K+PGPDG+T+ F+K  W  +K D++ + +DF+  GI+N +
Subjt:  NDIEAEFLGFYQTLFTKDRGTRFIPTNVDWCPISDSQSMGLEAVFSEDEVYQAVKSLGTSKSPGPDGFTVEFFKFSWHTIKHDIMTMMEDFYNTGIINVS

Query:  LNKTYICLIPKKLDAKSVSDFRPISLIPCAYKIIARVLSDRLKMVLPSTIAENQMAFVANRQILDASLIANELIDDWNLSHKKGVVIKLDLEKAFDKVDW
        +N T+I LI KK      SD+RPISL    YKI+A+ L++RLK  LP TIAENQMAF+  RQI DA LIANE+ID W     KG V+KLD+EKAFDK+ W
Subjt:  LNKTYICLIPKKLDAKSVSDFRPISLIPCAYKIIARVLSDRLKMVLPSTIAENQMAFVANRQILDASLIANELIDDWNLSHKKGVVIKLDLEKAFDKVDW

Query:  DFLDAILQAKGFGLVWRKWIYGCLSSVNYSIIINGKPRGKIIPSRGIRQGDPF-PFLFILVSDCLSRLLSHSANMDRIVSHPIGNSHLYVNHLQFADDTL
         F+D +L  K F   WRKWI  C+S+V YSI++NG P+G+I   RGIRQGDP  PF+F+L  D LSRLLSH  +   I      N++  ++HL FADD L
Subjt:  DFLDAILQAKGFGLVWRKWIYGCLSSVNYSIIINGKPRGKIIPSRGIRQGDPF-PFLFILVSDCLSRLLSHSANMDRIVSHPIGNSHLYVNHLQFADDTL

Query:  LFSIFCKDALANMFDIVKIFELASGLNINYSKS-------------------------------------------------------------------
        +F    +  L N+   + +FE ASGL  N SKS                                                                   
Subjt:  LFSIFCKDALANMFDIVKIFELASGLNINYSKS-------------------------------------------------------------------

Query:  GRHTLTQAVLSSKPTYYLSLFKLPRKVAKTLDKLFRDFFWEGSRGDGGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWYKLIVA
        GR TL +A LSS PTY LS FK P  V K ++K +RDF W GS      H INW     P  +GG+GI   ++ N ALL KW+WR+ +E NSLW K I A
Subjt:  GRHTLTQAVLSSKPTYYLSLFKLPRKVAKTLDKLFRDFFWEGSRGDGGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWYKLIVA

Query:  KYYNSELPSLWPSIIQKSSHKSPWRFITSTIDLVSSRVKRRLGNGLDTSFWHDSWLSCGILATNFPRLYRLTDRPRSLVGETWIASQTAWDLSLRRNLND
        KY  +    + P + + SS  SPW  I    D   S++     +G   SFWH  W +   L+   PRLY L++   + V E W      W++  RR LN+
Subjt:  KYYNSELPSLWPSIIQKSSHKSPWRFITSTIDLVSSRVKRRLGNGLDTSFWHDSWLSCGILATNFPRLYRLTDRPRSLVGETWIASQTAWDLSLRRNLND

Query:  VETEEWMALSLILCAISLQN--CTDSWIWPLESS--NIFSVKSLMEDLVDYPNMAN--DLYKVIWTDFYPKKIKIFYGSLV
         E + W ++ + L  I      C  +W  P +S    + S K +       P   N     K +W    P+K K F  ++V
Subjt:  VETEEWMALSLILCAISLQN--CTDSWIWPLESS--NIFSVKSLMEDLVDYPNMAN--DLYKVIWTDFYPKKIKIFYGSLV

M5VS59 Reverse transcriptase domain-containing protein (Fragment)8.6e-19437.71Show/hide
Query:  ETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIFMADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGENW
        ETKK  +  +++  +W S    W    S+G SGGI ++W+    SV +++ G FS+SI I       +WLS IYGP R  +R+ FW EL DL G  G+ W
Subjt:  ETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIFMADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGENW

Query:  ILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLS
         LGGDFNV R+S EKS+   VT+SMR FN +I + +L D  L N  +TWS+  EN  C  +DRFL++ +  + F   R   L R+TSDH P  L    + 
Subjt:  ILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLS

Query:  WGPCPF-----WLKIDSFSGLMDNWWSQNTIQGWPGHGFMMKLK---------------------------LKLLDDTEDMVPLSTEQISSRRLLREQIE
        WGP PF     WL    F   +  WW ++ I GW G+ FM +LK                           L +LD  E    L     S R  L  +I 
Subjt:  WGPCPF-----WLKIDSFSGLMDNWWSQNTIQGWPGHGFMMKLK---------------------------LKLLDDTEDMVPLSTEQISSRRLLREQIE

Query:  DLSAQEHIYWHQCCKLNWLKEGDENTKFFHRIMAARKRKNFISEILSREGNSLFTDNDIEAEFLGFYQTLFTKDRGTRFIPTNVDWCPISDSQSMGLEAV
        DL+ +E + W Q  K+ W +EGD NTKFFHR+    +++N+I ++   +   +  D +IE E + F++ L++ ++   +    ++WCPIS  ++  LE  
Subjt:  DLSAQEHIYWHQCCKLNWLKEGDENTKFFHRIMAARKRKNFISEILSREGNSLFTDNDIEAEFLGFYQTLFTKDRGTRFIPTNVDWCPISDSQSMGLEAV

Query:  FSEDEVYQAVKSLGTSKSPGPDGFTVEFFKFSWHTIKHDIMTMMEDFYNTGIINVSLNKTYICLIPKKLDAKSVSDFRPISLIPCAYKIIARVLSDRLKM
        F  +EV +AV   G  KSPGPDGF++ FF+  W  +K D+M +M+DF+ +GI+N   N+T+ICLIPKK ++  V+D RPISL+   YK+I++VL+ RL+ 
Subjt:  FSEDEVYQAVKSLGTSKSPGPDGFTVEFFKFSWHTIKHDIMTMMEDFYNTGIINVSLNKTYICLIPKKLDAKSVSDFRPISLIPCAYKIIARVLSDRLKM

Query:  VLPSTIAENQMAFVANRQILDASLIANELIDDWNLSHKKGVVIKLDLEKAFDKVDWDFLDAILQAKGFGLVWRKWIYGCLSSVNYSIIINGKPRGKIIPS
        VL +TI+++Q AFV  RQILDA L+ANE++++     +KG+V K+D EKA+D V+W+F+D +L  KGFG+ WR WI GCL SVN+SI+INGKPRGK   S
Subjt:  VLPSTIAENQMAFVANRQILDASLIANELIDDWNLSHKKGVVIKLDLEKAFDKVDWDFLDAILQAKGFGLVWRKWIYGCLSSVNYSIIINGKPRGKIIPS

Query:  RGIRQGDPF-PFLFILVSDCLSRLLSHSANMDRIVSHPIGNSHLYVNHLQFADDTLLFSIFCKDALANMFDIVKIFELASGLNIN---------------
        RG+RQGDP  PFLF LVSD LSR++  + +++ +     G+  + V+HLQFADDT+      ++   N+  ++K+F   SG+ IN               
Subjt:  RGIRQGDPF-PFLFILVSDCLSRLLSHSANMDRIVSHPIGNSHLYVNHLQFADDTLLFSIFCKDALANMFDIVKIFELASGLNIN---------------

Query:  ----------------------------------------------------YSKSGRHTLTQAVLSSKPTYYLSLFKLPRKVAKTLDKLFRDFFWEGSR
                                                             SK GR TL QAVLSS P+YY+SLFK+P  VA  +++L R+F WEG  
Subjt:  ----------------------------------------------------YSKSGRHTLTQAVLSSKPTYYLSLFKLPRKVAKTLDKLFRDFFWEGSR

Query:  GDGGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWYKLIVAKYYNSELPSLWPSIIQKSSHKSPWRFITSTIDLVSSRVKRRLGN
             H + W  V      GG+GIG+ + RN AL AKW+WRF  E NSLW+++I +K Y  +        I K S ++PWR I+   +      +  +GN
Subjt:  GDGGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWYKLIVAKYYNSELPSLWPSIIQKSSHKSPWRFITSTIDLVSSRVKRRLGN

Query:  GLDTSFWHDSWLSCGILATNFPRLYRLTDRPRSLVG--ETWIASQTAWDLSLRRNLNDVETEEWMALSLILCAISLQNC-TDSWIWPLESSNIFSVKSLM
        G    FW D WL  GIL   FPRL  L+ R    +            WD   RRNL++ E  E + L  IL  + L     D   W +E    FS KS  
Subjt:  GLDTSFWHDSWLSCGILATNFPRLYRLTDRPRSLVG--ETWIASQTAWDLSLRRNLNDVETEEWMALSLILCAISLQNC-TDSWIWPLESSNIFSVKSLM

Query:  EDLVDYPNMANDLYKVIWTDFYPKKIKIF
          L+         +  IW    P KI+ F
Subjt:  EDLVDYPNMANDLYKVIWTDFYPKKIKIF

M5XHS0 Reverse transcriptase domain-containing protein (Fragment)1.2e-19037.77Show/hide
Query:  RIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIFMADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGENWILGGDFNVT
        +++  +W S    W    S+G SGGI ++W+    SV +++ G FS+SI I       +WLS IYGP R  +R+ FW EL DL G  G+ W LGGDFNV 
Subjt:  RIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIFMADNFSFWLSAIYGPSRHADRSEFWNELHDLAGLGGENWILGGDFNVT

Query:  RWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPF---
        R+S EKS+   VT+SMR FN +I + +L D  L N  +TWS+  EN  C  +DRFL++ +  + F   R   L R+TSDH P  L    + WGP PF   
Subjt:  RWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDLSWGPCPF---

Query:  --WLKIDSFSGLMDNWWSQNTIQGWPGHGFM---------------MKLKLKLLDDTEDMVPLSTEQISSRRLLREQIEDLSAQEHIYWHQCCKLNWLKE
          WL    F   +  WW ++ I GW G+ FM                + +L +LD  E    L     S R  L  +I DL+ +E + W Q  K+ W +E
Subjt:  --WLKIDSFSGLMDNWWSQNTIQGWPGHGFM---------------MKLKLKLLDDTEDMVPLSTEQISSRRLLREQIEDLSAQEHIYWHQCCKLNWLKE

Query:  GDENTKFFHRIMAARKRKNFISEILSREGNSLFTDNDIEAEFLGFYQTLFTKDRGTRFIPTNVDWCPISDSQSMGLEAVFSEDEVYQAVKSLGTSKSPGP
        GD NTKFFHR+ +  +++N+I ++   +   +  D +IE E + F++ L++ ++   +    ++WCPIS  ++  LE  F  +EV +AV   G  KSPGP
Subjt:  GDENTKFFHRIMAARKRKNFISEILSREGNSLFTDNDIEAEFLGFYQTLFTKDRGTRFIPTNVDWCPISDSQSMGLEAVFSEDEVYQAVKSLGTSKSPGP

Query:  DGFTVEFFKFSWHTIKHDIMTMMEDFYNTGIINVSLNKTYICLIPKKLDAKSVSDFRPISLIPCAYKIIARVLSDRLKMVLPSTIAENQMAFVANRQILD
        DGF++ FF+  W  +K D+M +M+DF+ +GI+N   N+T+ICLIPKK ++  V+D+RPISL+   YK+I++VL  RL+ VL +TI+++Q AFV  RQILD
Subjt:  DGFTVEFFKFSWHTIKHDIMTMMEDFYNTGIINVSLNKTYICLIPKKLDAKSVSDFRPISLIPCAYKIIARVLSDRLKMVLPSTIAENQMAFVANRQILD

Query:  ASLIANELIDDWNLSHKKGVVIKLDLEKAFDKVDWDFLDAILQAKGFGLVWRKWIYGCLSSVNYSIIINGKPRGKIIPSRGIRQGDPF-PFLFILVSDCL
        A L+ANE++++     +KG+V K+D EKA+D V+W+F+D +L  KGFG  WR WI GCL SVN+SI+INGKPRGK   SRG+RQGDP  PFLF LVSD L
Subjt:  ASLIANELIDDWNLSHKKGVVIKLDLEKAFDKVDWDFLDAILQAKGFGLVWRKWIYGCLSSVNYSIIINGKPRGKIIPSRGIRQGDPF-PFLFILVSDCL

Query:  SRLLSHSANMDRIVSHPIGNSHLYVNHLQFADDTLLFSIFCKDALANMFDIVKIFELASGLNIN------------------------------------
        SR++  + +++ +     G+  + V+HLQFADDT+      ++   N+  ++K+F   SG+ IN                                    
Subjt:  SRLLSHSANMDRIVSHPIGNSHLYVNHLQFADDTLLFSIFCKDALANMFDIVKIFELASGLNIN------------------------------------

Query:  -------------------------------YSKSGRHTLTQAVLSSKPTYYLSLFKLPRKVAKTLDKLFRDFFWEGSRGDGGMHNINWATVQLPHLMGG
                                        SK GR TL QAVLSS P+YY+SLFK+P  VA  +++L R+F WEG       H + W  V      GG
Subjt:  -------------------------------YSKSGRHTLTQAVLSSKPTYYLSLFKLPRKVAKTLDKLFRDFFWEGSRGDGGMHNINWATVQLPHLMGG

Query:  IGIGNFQNRNLALLAKWIWRFLHEENSLWYKLIVAKYYNSELPSLWPSIIQKSSHKSPWRFITSTIDLVSSRVKRRLGNGLDTSFWHDSWLSCGILATNF
        +GIG+ + RN AL AKW+WRF  E NSLW+++I +K Y  +        I K S ++PWR I+   +      +  +GNG    FW D WL  GIL   F
Subjt:  IGIGNFQNRNLALLAKWIWRFLHEENSLWYKLIVAKYYNSELPSLWPSIIQKSSHKSPWRFITSTIDLVSSRVKRRLGNGLDTSFWHDSWLSCGILATNF

Query:  PRLYRLTDRPRSLVGETWIASQTAWDLSLRRNLNDVETEEWMALSLILCAISLQNC-TDSWIWPLESSNIFSVKSLMEDLVDYPNMANDLYKVIWTDFYP
        PRL  L+ R +S                 +RNL++ E  E + L  IL  + L     D   W +E    FS KS    L+         +  IW    P
Subjt:  PRLYRLTDRPRSLVGETWIASQTAWDLSLRRNLNDVETEEWMALSLILCAISLQNC-TDSWIWPLESSNIFSVKSLMEDLVDYPNMANDLYKVIWTDFYP

Query:  KKIKIF
         KI+ F
Subjt:  KKIKIF

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.0e-2625.32Show/hide
Query:  RIMAARKRKNFISEILSREGNSLFTDNDIEAEFLGFYQTLF-TKDRGTRFIPTNVDWCP---ISDSQSMGLEAVFSEDEVYQAVKSLGTSKSPGPDGFTV
        R++  ++ KN I  I + +G+      +I+     +Y+ L+  K      + T +D      ++  +   L    +  E+   + SL T KSPGPDGFT 
Subjt:  RIMAARKRKNFISEILSREGNSLFTDNDIEAEFLGFYQTLF-TKDRGTRFIPTNVDWCP---ISDSQSMGLEAVFSEDEVYQAVKSLGTSKSPGPDGFTV

Query:  EFFKFSWHTIKHDIMTMMEDFYNTGIINVSLNKTYICLIPKK-LDAKSVSDFRPISLIPCAYKIIARVLSDRLKMVLPSTIAENQMAFVANRQILDASLI
        EF++     +   ++ + +     GI+  S  +  I LIPK   D     +FRPISL+    KI+ ++L++R++  +   I  +Q+ F+   Q       
Subjt:  EFFKFSWHTIKHDIMTMMEDFYNTGIINVSLNKTYICLIPKK-LDAKSVSDFRPISLIPCAYKIIARVLSDRLKMVLPSTIAENQMAFVANRQILDASLI

Query:  ANELIDDWNLSHKKG-VVIKLDLEKAFDKVDWDFLDAILQAKGFGLVWRKWIYGCLSSVNYSIIINGKPRGKIIPSRGIRQGDPF-PFLFILVSDCLSRL
        +  +I   N +  K  V+I +D EKAFDK+   F+   L   G   ++ K I         +II+NG+         G RQG P  P LF +V + L+R 
Subjt:  ANELIDDWNLSHKKG-VVIKLDLEKAFDKVDWDFLDAILQAKGFGLVWRKWIYGCLSSVNYSIIINGKPRGKIIPSRGIRQGDPF-PFLFILVSDCLSRL

Query:  LSHSANMDRIVSHPIGNSHLYVNHLQFADDTLLFSIFCKDALANMFDIVKIFELASGLNINYSKS-------GRHTLTQAV------LSSKPTYYLSLFK
        +     +  I    +G   + ++   FADD +++      +  N+  ++  F   SG  IN  KS        R T +Q +      ++SK   YL + +
Subjt:  LSHSANMDRIVSHPIGNSHLYVNHLQFADDTLLFSIFCKDALANMFDIVKIFELASGLNINYSKS-------GRHTLTQAV------LSSKPTYYLSLFK

Query:  LPRKVAKTLDKLFRDFFWEGSRGDGGMHNI--NWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRF
        L R V     + ++    E         NI  +W             +G      +A+L K I+RF
Subjt:  LPRKVAKTLDKLFRDFFWEGSRGDGGMHNI--NWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRF

P08548 LINE-1 reverse transcriptase homolog7.9e-2723.47Show/hide
Query:  IYGPSRHADRSEFWNE-LHDLAGLGGENWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDT-----PLQNGCYTWSSCGENHYCSLIDRFLM
        IY P+ +A   +F  E L D++ L     I+ GDFN      ++S  + +++ +   N  I    L D      P +   YT+ S     Y S ID  L 
Subjt:  IYGPSRHADRSEFWNE-LHDLAGLGGENWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDT-----PLQNGCYTWSSCGENHYCSLIDRFLM

Query:  TDTCLNKFGVARFLRLDRVTSDHYPCTLSFGD--------LSWGPCPFWLK----IDSFSGLMDNWWSQNTIQG------WPG------------HGFMM
          + L+KF     +    + SDH+   +   +         +W      LK    ID     +  +  QN  Q       W                F+ 
Subjt:  TDTCLNKFGVARFLRLDRVTSDHYPCTLSFGD--------LSWGPCPFWLK----IDSFSGLMDNWWSQNTIQG------WPG------------HGFMM

Query:  KLK----------LKLLDDTEDMVPLSTEQISSRRLLREQIEDLSAQEHIYWHQCCKLNWLKEGDENTKFFHRIMAARKRKNFISEILSREGNSLFTDND
        K +          LK L+  E   P  + +    + +R ++ ++  +  I      K  + ++ ++  K    +   ++ K+ IS I  R GN   T + 
Subjt:  KLK----------LKLLDDTEDMVPLSTEQISSRRLLREQIEDLSAQEHIYWHQCCKLNWLKEGDENTKFFHRIMAARKRKNFISEILSREGNSLFTDND

Query:  IEAEFL--GFYQTLFT-KDRGTRFIPTNVDWC---PISDSQSMGLEAVFSEDEVYQAVKSLGTSKSPGPDGFTVEFFKFSWHTIKHDIMTMMEDFYNTGI
         E + +   +Y+ L++ K    + I   ++ C    +S  +   L    S  E+   +++L   KSPGPDGFT EF++     +   ++ + ++    GI
Subjt:  IEAEFL--GFYQTLFT-KDRGTRFIPTNVDWC---PISDSQSMGLEAVFSEDEVYQAVKSLGTSKSPGPDGFTVEFFKFSWHTIKHDIMTMMEDFYNTGI

Query:  INVSLNKTYICLIPKK-LDAKSVSDFRPISLIPCAYKIIARVLSDRLKMVLPSTIAENQMAFVANRQILDASLIANELIDDWN-LSHKKGVVIKLDLEKA
        +  +  +  I LIPK   D     ++RPISL+    KI+ ++L++R++  +   I  +Q+ F+   Q       +  +I   N L +K  +++ +D EKA
Subjt:  INVSLNKTYICLIPKK-LDAKSVSDFRPISLIPCAYKIIARVLSDRLKMVLPSTIAENQMAFVANRQILDASLIANELIDDWN-LSHKKGVVIKLDLEKA

Query:  FDKVDWDFLDAILQAKGFGLVWRKWIYGCLSSVNYSIIINGKPRGKIIPSRGIRQGDPF-PFLFILVSDCLSRLLSHSANMDRIVSHPIGNSHLYVNHLQ
        FD +   F+   L+  G    + K I    S    +II+NG          G RQG P  P LF +V + L+  +     +  I    IG+  + ++   
Subjt:  FDKVDWDFLDAILQAKGFGLVWRKWIYGCLSSVNYSIIINGKPRGKIIPSRGIRQGDPF-PFLFILVSDCLSRLLSHSANMDRIVSHPIGNSHLYVNHLQ

Query:  FADDTLLFSIFCKDALANMFDIVKIFELASGLNINYSKS
        FADD +++    +D+   + +++K +   SG  IN  KS
Subjt:  FADDTLLFSIFCKDALANMFDIVKIFELASGLNINYSKS

P0C2F6 Putative ribonuclease H protein At1g657507.9e-2733.21Show/hide
Query:  SKSGRHTLTQAVLSSKPTYYLSLFKLPRKVAKTLDKLFRDFFWEGSRGDGGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWYKL
        S +GR TLT+AVLSS P + +S   LP+ +   LD+L R F W  +      H + W+ V  P   GG+G+   ++ N AL++K  WR L E+NSLW  +
Subjt:  SKSGRHTLTQAVLSSKPTYYLSLFKLPRKVAKTLDKLFRDFFWEGSRGDGGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWYKL

Query:  IVAKYYNSEL-PSLWPSIIQKSSHKSPWRFITSTI-DLVSSRVKRRLGNGLDTSFWHDSWLSCGILATNFPRLYRLTDRPRSLVGETWIASQTAWDLSLR
        +  KY+  E+  S W  +I K S  S WR I   + D+VS  V    G+G    FW D W+S G          R TD    +  + WI  +  WD +  
Subjt:  IVAKYYNSEL-PSLWPSIIQKSSHKSPWRFITSTI-DLVSSRVKRRLGNGLDTSFWHDSWLSCGILATNFPRLYRLTDRPRSLVGETWIASQTAWDLSLR

Query:  RNLNDVETEEWMALSLILCAISL-QNCTDSWIWPLESSNIFSVKSLME----DLVDYPNMANDLYKVIWTDFYPKKIKIF
            D  T     L L    + L     D   W       FSV+S  E    D V  PNMA+  +  +W    P+++K F
Subjt:  RNLNDVETEEWMALSLILCAISL-QNCTDSWIWPLESSNIFSVKSLME----DLVDYPNMANDLYKVIWTDFYPKKIKIF

P11369 LINE-1 retrotransposable element ORF2 protein4.6e-2726.02Show/hide
Query:  RIMAARKRKNFISEILSREGNSLFTDNDIEAEFLGFYQTLFTK-----DRGTRFIPTNVDWCPISDSQSMGLEAVFSEDEVYQAVKSLGTSKSPGPDGFT
        R+    + K  I++I + +G+      +I+     FY+ L++      D   +F+        ++  Q   L +  S  E+   + SL T KSPGPDGF+
Subjt:  RIMAARKRKNFISEILSREGNSLFTDNDIEAEFLGFYQTLFTK-----DRGTRFIPTNVDWCPISDSQSMGLEAVFSEDEVYQAVKSLGTSKSPGPDGFT

Query:  VEFFKFSWHTIKHDIMTMMEDFYN----TGIINVSLNKTYICLIPK-KLDAKSVSDFRPISLIPCAYKIIARVLSDRLKMVLPSTIAENQMAFVANRQ--
         EF++    T K D++ ++   ++     G +  S  +  I LIPK + D   + +FRPISL+    KI+ ++L++R++  + + I  +Q+ F+   Q  
Subjt:  VEFFKFSWHTIKHDIMTMMEDFYN----TGIINVSLNKTYICLIPK-KLDAKSVSDFRPISLIPCAYKIIARVLSDRLKMVLPSTIAENQMAFVANRQ--

Query:  -ILDASLIANELIDDWNLSHKKGVVIKLDLEKAFDKVDWDFLDAILQAKGFGLVWRKWIYGCLSSVNYSIIINGKPRGKIIPSRGIRQGDPF-PFLFILV
          +  S+     I+   L  K  ++I LD EKAFDK+   F+  +L+  G    +   I    S    +I +NG+    I    G RQG P  P+LF +V
Subjt:  -ILDASLIANELIDDWNLSHKKGVVIKLDLEKAFDKVDWDFLDAILQAKGFGLVWRKWIYGCLSSVNYSIIINGKPRGKIIPSRGIRQGDPF-PFLFILV

Query:  SDCLSRLLSHSANMDRIVSHPIGNSHLYVNHLQFADDTLLFSIFCKDALANMFDIVKIFELASGLNINYSKSGRHTLTQAVLSSKPTYYLSLFKLPRKVA
         + L+R +     +  I    IG   + ++ L  ADD +++    K++   + +++  F    G  IN +KS     T+   + K     + F +     
Subjt:  SDCLSRLLSHSANMDRIVSHPIGNSHLYVNHLQFADDTLLFSIFCKDALANMFDIVKIFELASGLNINYSKSGRHTLTQAVLSSKPTYYLSLFKLPRKVA

Query:  K----TLDKLFRDFF
        K    TL K  +D +
Subjt:  K----TLDKLFRDFF

P14381 Transposon TX1 uncharacterized 149 kDa protein1.4e-3924.86Show/hide
Query:  SHIGWTSLDSVGASGGILIMWS---EPE-FSVKETIQGLFSLSIHIFMADN-FSFWLSAIYGPSRHADRSEFWNELHDLAGL--GGENWILGGDFNVTRW
        +H+ WTS        G++ ++S   +PE  S    I G     +H+ + ++  ++ L  +Y P+   +R+ F+  L          E  I+GGDFN T  
Subjt:  SHIGWTSLDSVGASGGILIMWS---EPE-FSVKETIQGLFSLSIHIFMADN-FSFWLSAIYGPSRHADRSEFWNELHDLAGL--GGENWILGGDFNVTRW

Query:  SWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNG----CYTWSSCGENHYC-SLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDL-SWGPCP
        + +++  +    S  +  + IA + L+D   +       +T+    + H   S IDR  ++   +++   +  +RL    SDH   +L      S     
Subjt:  SWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNG----CYTWSSCGENHYC-SLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDL-SWGPCP

Query:  FW------LKIDSFS-GLMDNW--WSQ-----NTIQGWPGHGFMMKLKLKLL----------DDTEDMVPLSTEQISSRRLLR---------EQIEDLSA
        +W      L+ + F+  + D W  W        T+  W   G   K+ LKLL              ++  L+ E +   + L          E +E   A
Subjt:  FW------LKIDSFS-GLMDNW--WSQ-----NTIQGWPGHGFMMKLKLKLL----------DDTEDMVPLSTEQISSRRLLR---------EQIEDLSA

Query:  QEHIYWHQC------CKLNWLKEGDENTKFFHRIMAARKRKNFISEILSREGNSLFTDNDIEAEFLGFYQTLFTKDRGTRFIPTNVDWCP--------IS
          ++   Q        ++  L + D  ++FF+ +   +  +  I+ + + +G  L     I      FYQ LF+ D      P + D C         +S
Subjt:  QEHIYWHQC------CKLNWLKEGDENTKFFHRIMAARKRKNFISEILSREGNSLFTDNDIEAEFLGFYQTLFTKDRGTRFIPTNVDWCP--------IS

Query:  DSQSMGLEAVFSEDEVYQAVKSLGTSKSPGPDGFTVEFFKFSWHTIKHDIMTMMEDFYNTGIINVSLNKTYICLIPKKLDAKSVSDFRPISLIPCAYKII
        + +   LE   + DE+ QA++ +  +KSPG DG T+EFF+F W T+  D   ++ + +  G + +S  +  + L+PKK D + + ++RP+SL+   YKI+
Subjt:  DSQSMGLEAVFSEDEVYQAVKSLGTSKSPGPDGFTVEFFKFSWHTIKHDIMTMMEDFYNTGIINVSLNKTYICLIPKKLDAKSVSDFRPISLIPCAYKII

Query:  ARVLSDRLKMVLPSTIAENQMAFVANRQILDASLIANELIDDWNLSHKKGVVIKLDLEKAFDKVDWDFLDAILQAKGFGLVWRKWIYGCLSSVNYSIIIN
        A+ +S RLK VL   I  +Q   V  R I D   +  +L+     +      + LD EKAFD+VD  +L   LQA  FG  +  ++    +S    + IN
Subjt:  ARVLSDRLKMVLPSTIAENQMAFVANRQILDASLIANELIDDWNLSHKKGVVIKLDLEKAFDKVDWDFLDAILQAKGFGLVWRKWIYGCLSSVNYSIIIN

Query:  GKPRGKIIPSRGIRQGDPF----------PFLFILVSDCLSRLLSHSANMDRIVSHPIGNSHLYVNHLQFADDTLLFSIFCKDALANMFDIVKIFELASG
              +   RG+RQG P           PFL  L+   L+ L+    +M  ++S              +ADD +L +    D L    +  +++  AS 
Subjt:  GKPRGKIIPSRGIRQGDPF----------PFLFILVSDCLSRLLSHSANMDRIVSHPIGNSHLYVNHLQFADDTLLFSIFCKDALANMFDIVKIFELASG

Query:  LNINYSKS
          IN+SKS
Subjt:  LNINYSKS

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein3.4e-2526.12Show/hide
Query:  RSMRIFNQWIADYHLIDTPLQNGCYTWSS-CGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDL------SWGPCPFWLKIDSFSG
        R +  F   + D  L+D P +   YTWS+   +N     +DR +      + F  A  +      SDH PC +   +L       +    F     +F  
Subjt:  RSMRIFNQWIADYHLIDTPLQNGCYTWSS-CGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYPCTLSFGDL------SWGPCPFWLKIDSFSG

Query:  LMDNWWSQNTIQGWPGHGFMMKLKLK----------------LLDDTEDMVPLSTEQISSRRLLREQIEDLSAQEHI--------------YWHQCCKLN
         +   W +    G   H F +   LK                +   T++ +  S E I S +LL    + L   EH+              ++ Q  ++ 
Subjt:  LMDNWWSQNTIQGWPGHGFMMKLKLK----------------LLDDTEDMVPLSTEQISSRRLLREQIEDLSAQEHI--------------YWHQCCKLN

Query:  WLKEGDENTKFFHRIMAARKRKNFISEILSREGNSLFTDNDIEAEFLGFYQTLFTKDRGTRFIPTNV----DWCPI--SDSQSMGLEAVFSEDEVYQAVK
        WL++GD NT+FFH+++ A + KN I  +   +   +     ++   + +Y  L   D      P +V    D  P   +D+ +  L A+ S+ E+  AV 
Subjt:  WLKEGDENTKFFHRIMAARKRKNFISEILSREGNSLFTDNDIEAEFLGFYQTLFTKDRGTRFIPTNV----DWCPI--SDSQSMGLEAVFSEDEVYQAVK

Query:  SLGTSKSPGPDGFTVEFFKFSWHTIKHDIMTMMEDFYNTGIINVSLNKTYICLIPKKLDAKSVSDFRPISLIPCAYKII
        ++  +K+PGPD FT EFF  SW  +K   +  +++F+ TG +    N T I LIPK      +S FRP+S     YKII
Subjt:  SLGTSKSPGPDGFTVEFFKFSWHTIKHDIMTMMEDFYNTGIINVSLNKTYICLIPKKLDAKSVSDFRPISLIPCAYKII

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.6e-0635.8Show/hide
Query:  DRLKMVLPSTIAENQMAFVANRQILDASLIANELIDDWNLSHKKGV----VIKLDLEKAFDKVDWDFLDAILQAKGFGLVW
        +RLK ++ + I   Q +F+  R   D  +   E +   ++  KKGV    ++KLDLEKA+D++ WD+L+  L + GF  VW
Subjt:  DRLKMVLPSTIAENQMAFVANRQILDASLIANELIDDWNLSHKKGV----VIKLDLEKAFDKVDWDFLDAILQAKGFGLVW

AT4G29090.1 Ribonuclease H-like superfamily protein1.6e-1430.77Show/hide
Query:  PTYYLSLFKLPRKVAKTLDKLFRDFFWEGSRGDGGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWYKLIVAKYYNSELPSLWPS
        PTY ++ F LP+ V K +  +  DF+W   +   GMH   W  +      GGIG  + +  NLALL K +WR L    SL  K+  ++Y++   P   P 
Subjt:  PTYYLSLFKLPRKVAKTLDKLFRDFFWEGSRGDGGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWYKLIVAKYYNSELPSLWPS

Query:  IIQKSSHKSPWRFITSTIDLVSSRVKRRLGNGLDTSFWHDSWL
            S     W+ I ++ +++    +  +GNG D   W   WL
Subjt:  IIQKSSHKSPWRFITSTIDLVSSRVKRRLGNGLDTSFWHDSWL

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.5e-0926.53Show/hide
Query:  PTYYLSLFKLPRKVAKTLDKLFRDFFWEGSRGDGGMHNINWATV-QLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWYKLIVAKYYNSELPSLWP
        P Y +S F+L + + K L     +F+W        +  + W  + +     GG+G  +    N ALLAK  +R +H+ ++L  +L+ ++Y+         
Subjt:  PTYYLSLFKLPRKVAKTLDKLFRDFFWEGSRGDGGMHNINWATV-QLPHLMGGIGIGNFQNRNLALLAKWIWRFLHEENSLWYKLIVAKYYNSELPSLWP

Query:  SIIQKSSHKSP---WRFITSTIDLVSSRVKRRLGNGLDTSFWHDSWL
        S+++ S    P   WR I    +L+S  + R +G+G+ T  W D W+
Subjt:  SIIQKSSHKSP---WRFITSTIDLVSSRVKRRLGNGLDTSFWHDSWL

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)3.6e-1148.53Show/hide
Query:  IINGKPRGKIIPSRGIRQGDPF-PFLFILVSDCLSRLLSHSANMDRIVSHPIGNSHLYVNHLQFADDT
        IING P+G + PSRG+RQGDP  P+LFIL ++ LS L   +    R+    + N+   +NHL FADDT
Subjt:  IINGKPRGKIIPSRGIRQGDPF-PFLFILVSDCLSRLLSHSANMDRIVSHPIGNSHLYVNHLQFADDT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAACCACAGAGACTCACCCGATACAACCATGGAATCATTCTACCCGATCTATCTCCATCGATAGGAAAAATTTCACAATAGCTTTTGATGAACATTTTAGAGGAAG
TAGAGCCAAGATCACTGAATATAGCAGATACTCATCTCATTCGATTTCTCTTTCTTGGAAATCTCTAAAATGGCTTGCCTCATCTTTCAACACCATTGTTCACTCACCAT
GTTCGCACAAGTTTTTCTCGGACTTAAGGAGCGACAACTATACTCTTTGGATCGAAAAGTTGAATAACAAGAATGGCTTTTTTGTTGAAGTCAATCAAGTGCAAAATTCT
GGTAATCGACAAAGAATACTAATCCCATCGGAAAATAACAAACAAGGATGGTTTTCCTTCTTCTCGTTAATCTCCGATTACCCTGCTGAAGCCCATCGACAACCCACAAA
GCCAACTCCTATATCATTCAAGGACATCCTCCAATCAAAGCCACCAACTGCTACCATTACTCCTCCCTTGAAAGAGCCCTCAAAGGAGCCTTTAGCCTCCACAATTGATG
AAGAATGGCAAGAGATAATTGTTCTCCAACGAAGCAATCTACATGATGACTGGCCGAGCATTCATCAATCACTTGTTGCCGGCCAAGTACTTCGATGCAGCATCAATCCT
TTCCAGGCAAATAAAGCCATGCTCCATGTTTATGATCGAGCCATTGCTACAAATTTATGTGCTCTCTCCGATTGGACCCTAATTGGTAAGCATAAGATGAAATTTTATCC
TTTAACCACTTCTGCTGCTCAACAGGATATTTTGACACCATCTTATGGAGGTTGGATTGAGATCTCTTCTCTTCCCCCTACCTTATGGACTGAGCGTATTTTTCGTTTTA
TTGGGGATTCTTGCGGCGGCTTTGTGGAGACTTCTAACCTCACCAATCGAATGATAATAGCAACTGAGGCTAGGATAAAAATTCGACCAAACTCTTCTGGTTTCATTCCC
GCCGCCGTTAAACTACCAACAGACTTGGCCGGCGATGAACTCACTGTGCAAATCAAAGGCATATCCGGCAACTCTCAGAGAATCGGCCTCATTAATGATGGAATACCTAA
TATGACGTATCAGACTTCCCCTTCCCCGATTGACACATTGCCTCCACTACAGCAGAAGCTTACCCATAATACTTCCTCTCCCAAACTATTGGAACCCCCACAAATCCCAC
CCTACCCTTCCCCACGGCCTTCACCAACACCAAATATGAAGTCTCCAACAAATACATTTCCCAATTGCCTACAACACTTAGCCCCAATCTTAAGTAAGCATGGTCTCTGT
ATTATGGCTATACCGACAGTACCAAAGTCAAGTAAAAAGAAAAAATTGGCAACTACAGGCAAGAAACCTAAACTACAAAGGGAGGAGACTAAAAAATCATTGATTTGCAG
TCGGATTATTAAATCTCTTTGGAGCTCTTCTCATATTGGCTGGACTTCTCTTGACTCAGTGGGCGCCTCTGGAGGCATTCTTATTATGTGGAGCGAACCAGAATTTTCAG
TAAAGGAGACTATTCAAGGTCTTTTCTCTCTCTCTATTCATATCTTTATGGCTGATAATTTCTCTTTTTGGCTATCGGCTATTTATGGCCCTTCTAGACATGCTGATAGA
TCAGAATTCTGGAATGAACTACATGACTTGGCTGGATTAGGTGGTGAAAATTGGATTCTTGGAGGAGATTTTAATGTCACCCGTTGGTCATGGGAAAAATCGCATGGTCG
ACCAGTGACTAGGAGTATGCGCATTTTCAACCAATGGATTGCTGACTATCATCTTATAGACACTCCTTTACAGAATGGTTGCTATACGTGGTCCAGTTGCGGTGAAAATC
ATTATTGCTCATTGATTGATCGATTCTTAATGACGGATACCTGTCTCAATAAATTTGGTGTAGCTCGTTTTCTTCGACTCGATAGGGTTACATCGGACCATTACCCTTGT
ACTCTATCTTTTGGAGATCTCTCTTGGGGTCCTTGCCCCTTTTGGTTGAAAATAGACTCTTTTAGTGGTCTTATGGATAATTGGTGGTCTCAAAACACCATTCAGGGTTG
GCCAGGCCATGGGTTTATGATGAAGCTTAAGTTAAAATTGTTAGATGATACAGAAGACATGGTTCCTTTATCTACGGAACAAATATCCTCGAGAAGATTATTGCGTGAAC
AAATTGAGGATTTATCAGCTCAAGAACACATTTATTGGCATCAATGCTGTAAATTGAACTGGTTAAAAGAAGGGGATGAAAATACAAAATTTTTCCATAGAATTATGGCT
GCTCGTAAAAGAAAGAATTTCATCTCAGAGATCTTGTCTAGAGAAGGAAATAGCCTCTTTACAGATAATGACATTGAAGCGGAGTTCCTTGGTTTTTATCAAACTTTATT
TACAAAGGACCGTGGCACTAGATTTATACCAACTAATGTTGATTGGTGTCCAATTAGTGACTCGCAATCGATGGGATTGGAGGCGGTTTTTTCTGAAGATGAGGTTTATC
AAGCGGTTAAATCTCTAGGCACGAGTAAATCTCCAGGTCCAGATGGTTTTACAGTAGAATTCTTCAAATTTTCTTGGCATACCATCAAACACGATATTATGACTATGATG
GAGGATTTCTATAATACAGGTATTATCAATGTCTCTTTGAATAAAACTTATATCTGCCTTATTCCAAAGAAATTAGACGCTAAATCAGTATCAGACTTCCGACCGATTAG
TCTAATTCCATGTGCATACAAGATCATAGCTCGAGTTTTGTCCGATAGATTGAAAATGGTCTTGCCATCCACTATTGCGGAAAATCAAATGGCCTTTGTGGCTAACAGAC
AAATTCTTGATGCTTCTCTTATAGCAAATGAACTGATTGATGATTGGAATTTATCTCATAAAAAAGGTGTGGTAATTAAGTTAGATCTTGAAAAGGCTTTTGATAAAGTC
GATTGGGACTTTCTGGATGCAATTCTTCAAGCCAAGGGTTTTGGATTGGTATGGAGGAAATGGATATATGGATGTCTTTCTAGTGTTAACTACTCTATCATTATCAATGG
AAAACCACGAGGCAAGATCATTCCCTCTCGAGGCATTCGTCAAGGGGACCCCTTCCCCTTCCTTTTTATCTTGGTATCAGATTGTCTAAGTCGTTTATTATCTCACAGTG
CTAATATGGATCGAATTGTCTCACATCCAATTGGAAATTCACATCTTTATGTGAATCATTTACAATTCGCTGATGATACTTTATTATTCTCCATCTTTTGTAAGGATGCT
TTGGCCAACATGTTCGATATTGTTAAAATTTTTGAGCTAGCTTCTGGATTGAATATTAATTATTCCAAGAGTGGTCGACATACTCTTACTCAAGCAGTTCTTTCCAGTAA
GCCAACATATTATCTATCCTTATTCAAACTGCCGAGAAAGGTTGCAAAAACTCTTGATAAGTTGTTTCGTGATTTCTTTTGGGAGGGATCTAGAGGTGATGGTGGTATGC
ACAATATTAACTGGGCAACAGTACAACTTCCACATTTGATGGGGGGTATTGGTATTGGTAACTTTCAAAATCGCAATCTTGCTCTTCTTGCAAAGTGGATTTGGAGATTT
TTACATGAGGAAAATTCTCTATGGTATAAGCTGATTGTAGCTAAATATTATAACTCTGAGTTGCCTAGTCTTTGGCCTAGCATTATTCAAAAAAGTTCTCACAAATCTCC
TTGGCGATTCATTACGTCTACTATTGACCTTGTATCTTCACGTGTTAAAAGAAGGTTGGGTAATGGTCTTGATACTTCTTTCTGGCATGATTCATGGTTAAGTTGTGGTA
TTTTGGCTACAAATTTTCCTCGCCTTTATCGTTTAACAGATCGTCCGAGGAGTTTGGTTGGTGAAACATGGATTGCTTCTCAAACAGCATGGGATCTGAGTCTTCGGCGT
AATTTAAATGATGTAGAGACAGAGGAATGGATGGCTTTATCACTTATTCTTTGCGCCATCAGCTTACAGAACTGTACTGATTCCTGGATTTGGCCTTTGGAATCGTCCAA
TATTTTTTCTGTTAAATCTCTCATGGAAGACTTAGTAGACTATCCGAATATGGCAAATGATCTATATAAGGTCATTTGGACAGATTTTTATCCAAAGAAGATCAAGATTT
TTTATGGGAGCTTAGTCATGGTGCTATTAATACTGCTGATCGACTTCAACGACGAATGCCTCATTTTCATTTGTCTCCATCTTGGTGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAACCACAGAGACTCACCCGATACAACCATGGAATCATTCTACCCGATCTATCTCCATCGATAGGAAAAATTTCACAATAGCTTTTGATGAACATTTTAGAGGAAG
TAGAGCCAAGATCACTGAATATAGCAGATACTCATCTCATTCGATTTCTCTTTCTTGGAAATCTCTAAAATGGCTTGCCTCATCTTTCAACACCATTGTTCACTCACCAT
GTTCGCACAAGTTTTTCTCGGACTTAAGGAGCGACAACTATACTCTTTGGATCGAAAAGTTGAATAACAAGAATGGCTTTTTTGTTGAAGTCAATCAAGTGCAAAATTCT
GGTAATCGACAAAGAATACTAATCCCATCGGAAAATAACAAACAAGGATGGTTTTCCTTCTTCTCGTTAATCTCCGATTACCCTGCTGAAGCCCATCGACAACCCACAAA
GCCAACTCCTATATCATTCAAGGACATCCTCCAATCAAAGCCACCAACTGCTACCATTACTCCTCCCTTGAAAGAGCCCTCAAAGGAGCCTTTAGCCTCCACAATTGATG
AAGAATGGCAAGAGATAATTGTTCTCCAACGAAGCAATCTACATGATGACTGGCCGAGCATTCATCAATCACTTGTTGCCGGCCAAGTACTTCGATGCAGCATCAATCCT
TTCCAGGCAAATAAAGCCATGCTCCATGTTTATGATCGAGCCATTGCTACAAATTTATGTGCTCTCTCCGATTGGACCCTAATTGGTAAGCATAAGATGAAATTTTATCC
TTTAACCACTTCTGCTGCTCAACAGGATATTTTGACACCATCTTATGGAGGTTGGATTGAGATCTCTTCTCTTCCCCCTACCTTATGGACTGAGCGTATTTTTCGTTTTA
TTGGGGATTCTTGCGGCGGCTTTGTGGAGACTTCTAACCTCACCAATCGAATGATAATAGCAACTGAGGCTAGGATAAAAATTCGACCAAACTCTTCTGGTTTCATTCCC
GCCGCCGTTAAACTACCAACAGACTTGGCCGGCGATGAACTCACTGTGCAAATCAAAGGCATATCCGGCAACTCTCAGAGAATCGGCCTCATTAATGATGGAATACCTAA
TATGACGTATCAGACTTCCCCTTCCCCGATTGACACATTGCCTCCACTACAGCAGAAGCTTACCCATAATACTTCCTCTCCCAAACTATTGGAACCCCCACAAATCCCAC
CCTACCCTTCCCCACGGCCTTCACCAACACCAAATATGAAGTCTCCAACAAATACATTTCCCAATTGCCTACAACACTTAGCCCCAATCTTAAGTAAGCATGGTCTCTGT
ATTATGGCTATACCGACAGTACCAAAGTCAAGTAAAAAGAAAAAATTGGCAACTACAGGCAAGAAACCTAAACTACAAAGGGAGGAGACTAAAAAATCATTGATTTGCAG
TCGGATTATTAAATCTCTTTGGAGCTCTTCTCATATTGGCTGGACTTCTCTTGACTCAGTGGGCGCCTCTGGAGGCATTCTTATTATGTGGAGCGAACCAGAATTTTCAG
TAAAGGAGACTATTCAAGGTCTTTTCTCTCTCTCTATTCATATCTTTATGGCTGATAATTTCTCTTTTTGGCTATCGGCTATTTATGGCCCTTCTAGACATGCTGATAGA
TCAGAATTCTGGAATGAACTACATGACTTGGCTGGATTAGGTGGTGAAAATTGGATTCTTGGAGGAGATTTTAATGTCACCCGTTGGTCATGGGAAAAATCGCATGGTCG
ACCAGTGACTAGGAGTATGCGCATTTTCAACCAATGGATTGCTGACTATCATCTTATAGACACTCCTTTACAGAATGGTTGCTATACGTGGTCCAGTTGCGGTGAAAATC
ATTATTGCTCATTGATTGATCGATTCTTAATGACGGATACCTGTCTCAATAAATTTGGTGTAGCTCGTTTTCTTCGACTCGATAGGGTTACATCGGACCATTACCCTTGT
ACTCTATCTTTTGGAGATCTCTCTTGGGGTCCTTGCCCCTTTTGGTTGAAAATAGACTCTTTTAGTGGTCTTATGGATAATTGGTGGTCTCAAAACACCATTCAGGGTTG
GCCAGGCCATGGGTTTATGATGAAGCTTAAGTTAAAATTGTTAGATGATACAGAAGACATGGTTCCTTTATCTACGGAACAAATATCCTCGAGAAGATTATTGCGTGAAC
AAATTGAGGATTTATCAGCTCAAGAACACATTTATTGGCATCAATGCTGTAAATTGAACTGGTTAAAAGAAGGGGATGAAAATACAAAATTTTTCCATAGAATTATGGCT
GCTCGTAAAAGAAAGAATTTCATCTCAGAGATCTTGTCTAGAGAAGGAAATAGCCTCTTTACAGATAATGACATTGAAGCGGAGTTCCTTGGTTTTTATCAAACTTTATT
TACAAAGGACCGTGGCACTAGATTTATACCAACTAATGTTGATTGGTGTCCAATTAGTGACTCGCAATCGATGGGATTGGAGGCGGTTTTTTCTGAAGATGAGGTTTATC
AAGCGGTTAAATCTCTAGGCACGAGTAAATCTCCAGGTCCAGATGGTTTTACAGTAGAATTCTTCAAATTTTCTTGGCATACCATCAAACACGATATTATGACTATGATG
GAGGATTTCTATAATACAGGTATTATCAATGTCTCTTTGAATAAAACTTATATCTGCCTTATTCCAAAGAAATTAGACGCTAAATCAGTATCAGACTTCCGACCGATTAG
TCTAATTCCATGTGCATACAAGATCATAGCTCGAGTTTTGTCCGATAGATTGAAAATGGTCTTGCCATCCACTATTGCGGAAAATCAAATGGCCTTTGTGGCTAACAGAC
AAATTCTTGATGCTTCTCTTATAGCAAATGAACTGATTGATGATTGGAATTTATCTCATAAAAAAGGTGTGGTAATTAAGTTAGATCTTGAAAAGGCTTTTGATAAAGTC
GATTGGGACTTTCTGGATGCAATTCTTCAAGCCAAGGGTTTTGGATTGGTATGGAGGAAATGGATATATGGATGTCTTTCTAGTGTTAACTACTCTATCATTATCAATGG
AAAACCACGAGGCAAGATCATTCCCTCTCGAGGCATTCGTCAAGGGGACCCCTTCCCCTTCCTTTTTATCTTGGTATCAGATTGTCTAAGTCGTTTATTATCTCACAGTG
CTAATATGGATCGAATTGTCTCACATCCAATTGGAAATTCACATCTTTATGTGAATCATTTACAATTCGCTGATGATACTTTATTATTCTCCATCTTTTGTAAGGATGCT
TTGGCCAACATGTTCGATATTGTTAAAATTTTTGAGCTAGCTTCTGGATTGAATATTAATTATTCCAAGAGTGGTCGACATACTCTTACTCAAGCAGTTCTTTCCAGTAA
GCCAACATATTATCTATCCTTATTCAAACTGCCGAGAAAGGTTGCAAAAACTCTTGATAAGTTGTTTCGTGATTTCTTTTGGGAGGGATCTAGAGGTGATGGTGGTATGC
ACAATATTAACTGGGCAACAGTACAACTTCCACATTTGATGGGGGGTATTGGTATTGGTAACTTTCAAAATCGCAATCTTGCTCTTCTTGCAAAGTGGATTTGGAGATTT
TTACATGAGGAAAATTCTCTATGGTATAAGCTGATTGTAGCTAAATATTATAACTCTGAGTTGCCTAGTCTTTGGCCTAGCATTATTCAAAAAAGTTCTCACAAATCTCC
TTGGCGATTCATTACGTCTACTATTGACCTTGTATCTTCACGTGTTAAAAGAAGGTTGGGTAATGGTCTTGATACTTCTTTCTGGCATGATTCATGGTTAAGTTGTGGTA
TTTTGGCTACAAATTTTCCTCGCCTTTATCGTTTAACAGATCGTCCGAGGAGTTTGGTTGGTGAAACATGGATTGCTTCTCAAACAGCATGGGATCTGAGTCTTCGGCGT
AATTTAAATGATGTAGAGACAGAGGAATGGATGGCTTTATCACTTATTCTTTGCGCCATCAGCTTACAGAACTGTACTGATTCCTGGATTTGGCCTTTGGAATCGTCCAA
TATTTTTTCTGTTAAATCTCTCATGGAAGACTTAGTAGACTATCCGAATATGGCAAATGATCTATATAAGGTCATTTGGACAGATTTTTATCCAAAGAAGATCAAGATTT
TTTATGGGAGCTTAGTCATGGTGCTATTAATACTGCTGATCGACTTCAACGACGAATGCCTCATTTTCATTTGTCTCCATCTTGGTGCATAA
Protein sequenceShow/hide protein sequence
MKTTETHPIQPWNHSTRSISIDRKNFTIAFDEHFRGSRAKITEYSRYSSHSISLSWKSLKWLASSFNTIVHSPCSHKFFSDLRSDNYTLWIEKLNNKNGFFVEVNQVQNS
GNRQRILIPSENNKQGWFSFFSLISDYPAEAHRQPTKPTPISFKDILQSKPPTATITPPLKEPSKEPLASTIDEEWQEIIVLQRSNLHDDWPSIHQSLVAGQVLRCSINP
FQANKAMLHVYDRAIATNLCALSDWTLIGKHKMKFYPLTTSAAQQDILTPSYGGWIEISSLPPTLWTERIFRFIGDSCGGFVETSNLTNRMIIATEARIKIRPNSSGFIP
AAVKLPTDLAGDELTVQIKGISGNSQRIGLINDGIPNMTYQTSPSPIDTLPPLQQKLTHNTSSPKLLEPPQIPPYPSPRPSPTPNMKSPTNTFPNCLQHLAPILSKHGLC
IMAIPTVPKSSKKKKLATTGKKPKLQREETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIFMADNFSFWLSAIYGPSRHADR
SEFWNELHDLAGLGGENWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTDTCLNKFGVARFLRLDRVTSDHYPC
TLSFGDLSWGPCPFWLKIDSFSGLMDNWWSQNTIQGWPGHGFMMKLKLKLLDDTEDMVPLSTEQISSRRLLREQIEDLSAQEHIYWHQCCKLNWLKEGDENTKFFHRIMA
ARKRKNFISEILSREGNSLFTDNDIEAEFLGFYQTLFTKDRGTRFIPTNVDWCPISDSQSMGLEAVFSEDEVYQAVKSLGTSKSPGPDGFTVEFFKFSWHTIKHDIMTMM
EDFYNTGIINVSLNKTYICLIPKKLDAKSVSDFRPISLIPCAYKIIARVLSDRLKMVLPSTIAENQMAFVANRQILDASLIANELIDDWNLSHKKGVVIKLDLEKAFDKV
DWDFLDAILQAKGFGLVWRKWIYGCLSSVNYSIIINGKPRGKIIPSRGIRQGDPFPFLFILVSDCLSRLLSHSANMDRIVSHPIGNSHLYVNHLQFADDTLLFSIFCKDA
LANMFDIVKIFELASGLNINYSKSGRHTLTQAVLSSKPTYYLSLFKLPRKVAKTLDKLFRDFFWEGSRGDGGMHNINWATVQLPHLMGGIGIGNFQNRNLALLAKWIWRF
LHEENSLWYKLIVAKYYNSELPSLWPSIIQKSSHKSPWRFITSTIDLVSSRVKRRLGNGLDTSFWHDSWLSCGILATNFPRLYRLTDRPRSLVGETWIASQTAWDLSLRR
NLNDVETEEWMALSLILCAISLQNCTDSWIWPLESSNIFSVKSLMEDLVDYPNMANDLYKVIWTDFYPKKIKIFYGSLVMVLLILLIDFNDECLIFICLHLGA