; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy06g002930 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy06g002930
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationChr06:2783479..2795321
RNA-Seq ExpressionLcy06g002930
SyntenyLcy06g002930
Gene Ontology termsGO:0050794 - regulation of cellular process (biological process)
GO:0003824 - catalytic activity (molecular function)
GO:0004869 - cysteine-type endopeptidase inhibitor activity (molecular function)
InterPro domainsIPR000010 - Cystatin domain
IPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.0e-28047.23Show/hide
Query:  MWDELCCKITDHIKGCYSMSVHVSLSDGFTWWLSGIYGPARRRDRKFFWRELYDLFGLCGDNWLLGGDFNVFRYPSETSSISPAKLSMKNFNNFIADTGL
        MW++    I    KG +S+S+ V  ++G  WWLS IYGPA+R++R  FW EL  L  +C   W+LGGDFNV R+  ET++ +PA LSM+ FN+FI++  L
Subjt:  MWDELCCKITDHIKGCYSMSVHVSLSDGFTWWLSGIYGPARRRDRKFFWRELYDLFGLCGDNWLLGGDFNVFRYPSETSSISPAKLSMKNFNNFIADTGL

Query:  IDPPLVNGSYTWSNLRNRPVMSRLDRFLFSPNWCQKFHEHHSKRLSRNTSDHFPILLDASSSTWGPCPFRFDNYFLDNNSFVSNVEQWWNDAVCSGYPGF
        IDPPL N  YTWSNLR +  +SRLDRFLF+  W   F  H SK L+R TSDHFPI+L++S+ +WGP PFRF N +L +  +  N+E WW +    GY G+
Subjt:  IDPPLVNGSYTWSNLRNRPVMSRLDRFLFSPNWCQKFHEHHSKRLSRNTSDHFPILLDASSSTWGPCPFRFDNYFLDNNSFVSNVEQWWNDAVCSGYPGF

Query:  SFMGRLKLLARKVKDWKSSNSESFKEKKRVLITEIDRIDSLESMGYLDDIASSLRKSLKADLQQTALLEARYWNQRCKKLWLNDGDENSAFFHKVCTARR
        SFM RLK LA  +K W        +  K+  I EID+ID LE+ G   +I    R +LKADL Q  L EA+ W Q+CK++W+++GDENS+FFHK+CTAR+
Subjt:  SFMGRLKLLARKVKDWKSSNSESFKEKKRVLITEIDRIDSLESMGYLDDIASSLRKSLKADLQQTALLEARYWNQRCKKLWLNDGDENSAFFHKVCTARR

Query:  RRNQIHELISKGGSSIVSDNMMEYEVINHFSAIYEANQETEWIVTNLDWAPINIDLISTLISPFTEEEVFGCIKSIGHNKAPGPDGFTIEFIKKFWSILK
        ++  I ++I+  G + ++D+ +    I HF  IY  N+ ++  + NLDW PI+      L  PF E E++  +KS   NKAPGPDG+ ++F++K WS +K
Subjt:  RRNQIHELISKGGSSIVSDNMMEYEVINHFSAIYEANQETEWIVTNLDWAPINIDLISTLISPFTEEEVFGCIKSIGHNKAPGPDGFTIEFIKKFWSILK

Query:  PSIMSVFHDFFRSKTVNRVVNHTNIALIPKKSMAGHISDFRPISLTTSLYKILAKVLAERLKPTLEDTISLNQSAFVRKRQISDAILLANEMVDFWRVSK
         +I  +F DF  +  +N+VVN T I LI KK      +DFRPISLTT++YK++AK LA+RLK TL DTIS +Q AFV+ RQI++AIL+ANE +DFWR  K
Subjt:  PSIMSVFHDFFRSKTVNRVVNHTNIALIPKKSMAGHISDFRPISLTTSLYKILAKVLAERLKPTLEDTISLNQSAFVRKRQISDAILLANEMVDFWRVSK

Query:  TKGVIIKLDIEKAFDKISWDFIDCVLLNKGYPNTWREWIKACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSKLIEAAEKKGLLSGV
         +G +IKLDIEKAFDK++W FID VL+ K Y   WR+ I +CISSV YSI++NG+PRG I+  RGIRQGDPLSPF+FVLAMDYLS+L+     K  ++GV
Subjt:  TKGVIIKLDIEKAFDKISWDFIDCVLLNKGYPNTWREWIKACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSKLIEAAEKKGLLSGV

Query:  VMG-DISITHLLFADDILLFVQDDEKAIESMFYIIKSFENASGLRINLSKSTVSGINLTEQRTTDIARFWGCCSHSLPIAYLGVPLGGIPKNTQFWVPTI
            ++++TH+LFADDIL+FV+D +  + ++  I+  FE+ASGL INLSKST+  IN+   R   IA  WG     LP +YLG+PLGG P ++ FW   +
Subjt:  VMG-DISITHLLFADDILLFVQDDEKAIESMFYIIKSFENASGLRINLSKSTVSGINLTEQRTTDIARFWGCCSHSLPIAYLGVPLGGIPKNTQFWVPTI

Query:  EKIQRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKW
        +KIQ+++ NW++  LSKGGR+TLI S L S+P+Y +SVFK P  I  ++E     FLW+G S+    +L+RW  + SPK +GGLGIH + STN ALL KW
Subjt:  EKIQRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKW

Query:  IWRFFTEEKSLWRKFISAKYSSDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTVEEAW
        +W+F TE+  LW++ I +KY  +   SFPS  +FSS+ SPW A+++  S F+ N  W+V +G+ I FW DNW+   PL     RL+ LS+NK  +V+E W
Subjt:  IWRFFTEEKSLWRKFISAKYSSDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTVEEAW

Query:  LNLDRVWNFRPRRPLFDREVQSWNEMTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLLVAPPRPF-YSPGETILNNLWKADIPKKIK
              W+    RPL D E   W+ +   LP P   RG     W  + +  F T   +  +  AP  P  + P   +   LWK + PKK K
Subjt:  LNLDRVWNFRPRRPLFDREVQSWNEMTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLLVAPPRPF-YSPGETILNNLWKADIPKKIK

KAA0057507.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]5.8e-27242.23Show/hide
Query:  EKFTLSVDLGSLSPISDAPISSPE------NTPSPKAHTVIEPPSAIINESLKFLVSPDKMDSTGEDSLNGTPRFKNIEAVIDDNSPRKDPQEVIEHGKP
        ++  LSVD+G +SP+     S         N  +P  ++     +   N ++      D+  S    +  G  +     + ++                 
Subjt:  EKFTLSVDLGSLSPISDAPISSPE------NTPSPKAHTVIEPPSAIINESLKFLVSPDKMDSTGEDSLNGTPRFKNIEAVIDDNSPRKDPQEVIEHGKP

Query:  NDESFKKKLNDWLTENDFCLVP--TKSVSGLFYFVILTETKLTNVSKRIIKSLWSSISVNWIALDALGSSGGILIMWDELCCKITDHIKGCYSMSVHVSL
         D +FK+KL  WL EN+  L P  T  V    YF ++   +  +++                    LG  GGIL++WD+   K+ D   G YS+S+++  
Subjt:  NDESFKKKLNDWLTENDFCLVP--TKSVSGLFYFVILTETKLTNVSKRIIKSLWSSISVNWIALDALGSSGGILIMWDELCCKITDHIKGCYSMSVHVSL

Query:  SDGFTWWLSGIYGPARRRDRKFFWRELYDLFGLCGDNWLLGGDFNVFRYPSETSSISPAKLSMKNFNNFIADTGLIDPPLVNGSYTWSNLRNRPVMSRLD
        ++G  WWL+ +YGP +  DR   W EL  L  LC  NWL+ GDFN+ R+  ET++ S  K +M NFNNFI+   LIDPP +N ++TWSNLR  P  SRLD
Subjt:  SDGFTWWLSGIYGPARRRDRKFFWRELYDLFGLCGDNWLLGGDFNVFRYPSETSSISPAKLSMKNFNNFIADTGLIDPPLVNGSYTWSNLRNRPVMSRLD

Query:  RFLFSPNWCQKFHEHHSKRLSRNTSDHFPILLDASSSTWGPCPFRFDNYFLDNNSFVSNVEQWWNDAVCSGYPGFSFMGRLKLLARKVKDWKSSNSESFK
        RFL S  W   F  H S+ L RN SDHFPILL++    WGPCPFR +N  L +  F  N   WWN +  +G+PG++F+  L  L++ +K+W+ +    + 
Subjt:  RFLFSPNWCQKFHEHHSKRLSRNTSDHFPILLDASSSTWGPCPFRFDNYFLDNNSFVSNVEQWWNDAVCSGYPGFSFMGRLKLLARKVKDWKSSNSESFK

Query:  EKKRVLITEIDRIDSLESMGYLDDIASSLRKSLKADLQQTALLEARYWNQRCKKLWLNDGDENSAFFHKVCTARRRRNQIHELISKGGSSIVSDNMMEYE
          K+ L+ EID ID LE  G +       R SLK+DL      +A+ W+QR ++ W   GDEN+++FH++CT  +R+N I  +    G+S+ S + +   
Subjt:  EKKRVLITEIDRIDSLESMGYLDDIASSLRKSLKADLQQTALLEARYWNQRCKKLWLNDGDENSAFFHKVCTARRRRNQIHELISKGGSSIVSDNMMEYE

Query:  VINHFSAIYEANQETEWIVTNLDWAPINIDLISTLISPFTEEEVFGCIKSIGHNKAPGPDGFTIEFIKKFWSILKPSIMSVFHDFFRSKTVNRVVNHTNI
         I+HF  IY      E ++ NL W PI+    S L  PF E E+   I S  + KAPGPDG+T+ F KK W  LK  +++VF DF ++  VN  VN+T I
Subjt:  VINHFSAIYEANQETEWIVTNLDWAPINIDLISTLISPFTEEEVFGCIKSIGHNKAPGPDGFTIEFIKKFWSILKPSIMSVFHDFFRSKTVNRVVNHTNI

Query:  ALIPKKSMAGHISDFRPISLTTSLYKILAKVLAERLKPTLEDTISLNQSAFVRKRQISDAILLANEMVDFWRVSKTKGVIIKLDIEKAFDKISWDFIDCV
        ALI KK      SD+RPISLTTSLYKI+AK LA RLK  L DTI+ NQ AF++ RQI+DAIL+ANE +D W+  K KG ++KLDIEKAFDKISW FID +
Subjt:  ALIPKKSMAGHISDFRPISLTTSLYKILAKVLAERLKPTLEDTISLNQSAFVRKRQISDAILLANEMVDFWRVSKTKGVIIKLDIEKAFDKISWDFIDCV

Query:  LLNKGYPNTWREWIKACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSKLIEAAEKKGLLSGVVMGD-ISITHLLFADDILLFVQDDE
        L  K +P+ WR+WIKACIS+V YSI+LNG P+G I+A+RGIRQGDPLSPF+FVLAMDYLS+L+   E KG + GV   +  +I+HLLFADD+L+FV+D+E
Subjt:  LLNKGYPNTWREWIKACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSKLIEAAEKKGLLSGVVMGD-ISITHLLFADDILLFVQDDE

Query:  KAIESMFYIIKSFENASGLRINLSKSTVSGINLTEQRTTDIARFWGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKIQRRIHNWRFVSLSKGGRLTLIQ
        + + ++   +  FE ASGL  N SKST+S IN++  RT  IA F+G  +  LP+ YLGVPLGG P++  FW  TIE I ++++ W++  +SKGGRLTL++
Subjt:  KAIESMFYIIKSFENASGLRINLSKSTVSGINLTEQRTTDIARFWGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKIQRRIHNWRFVSLSKGGRLTLIQ

Query:  SVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEKSLWRKFISAKYSSDHH
        + L+S+P Y LS FKAPVS+   +E+    FLW G+     ++L+ W I +SPK  GGLGI K+K TN+ALL KW+WR+  E  SLW+K I AKY+ +H 
Subjt:  SVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEKSLWRKFISAKYSSDHH

Query:  NSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNLDRVWNFRPRRPLFDREVQSWNE
           P   R SS+ SPW AI K +  + +   W   +G S+ FWH  W    PL     RLY LS+ +S TV+E W      WN +PRRPL +RE Q+W+ 
Subjt:  NSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNLDRVWNFRPRRPLFDREVQSWNE

Query:  MTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLLVAPPRPFYSPGETILNNLWKADIPKKIK
        +   LP   + RG     W  S+   ++   A+ +       P  +  E  L +LW++ IP+K K
Subjt:  MTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLLVAPPRPFYSPGETILNNLWKADIPKKIK

TYJ99315.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]7.1e-26240.86Show/hide
Query:  HKKVNLVAPSGPVYQDINGEK--FTLSVDLGSLSPISDAPISSPENTPSPKAHTVIEPPSAIINESLKFLVSPDKMDSTGEDSLNGTPRFKNIEAVIDDN
        +K V +  P   V  D +  K   +L+VDLG L P  D   S  ++  S  A  V    + ++ E+ +  +  ++  ++  ++    P+  +        
Subjt:  HKKVNLVAPSGPVYQDINGEK--FTLSVDLGSLSPISDAPISSPENTPSPKAHTVIEPPSAIINESLKFLVSPDKMDSTGEDSLNGTPRFKNIEAVIDDN

Query:  SPRKDPQEVIEHGKPNDESFKKKLNDWLTENDFCLVPTKSVSGLFYF--VILTE----TKLTNVSKRIIKSLWSSISVNWIALDALGSSGGILIMWDELC
           K+         P+ E+FKK+L  WL +N   L      SG      V+L +     K+TN  KRIIKSLW S S+NWIA +A GSSGGILI+WD   
Subjt:  SPRKDPQEVIEHGKPNDESFKKKLNDWLTENDFCLVPTKSVSGLFYF--VILTE----TKLTNVSKRIIKSLWSSISVNWIALDALGSSGGILIMWDELC

Query:  CKITDHIKGCYSMSVHVSLSDGFTWWLSGIYGPARRRDRKFFWRELYDLFGLCGDNWLLGGDFNVFRYPSETSSISPAKLSMKNFNNFIADTGLIDPPLV
          +    +G +S+S +  L++  +WWL+G+YGP +RR+R  FW EL++L  L    W+LGGD NV R   E++S+  +  + +  NNFI++  LIDPPL 
Subjt:  CKITDHIKGCYSMSVHVSLSDGFTWWLSGIYGPARRRDRKFFWRELYDLFGLCGDNWLLGGDFNVFRYPSETSSISPAKLSMKNFNNFIADTGLIDPPLV

Query:  NGSYTWSNLRNRPVMSRLDRFLFSPNWCQKFHEHHSKRLSRNTSDHFPILLDASSS--TWGPCPFRFDNYFLDNNSFVSNVEQWWNDAVCSGYPGFSFMG
        N  +TWSNLRN P  SR+DRFL++ +W   F  H ++ L R+TSDHFP++ + S+   +WGP PFR ++  L +  F  N+ +WW +++ +GYPGFSF+ 
Subjt:  NGSYTWSNLRNRPVMSRLDRFLFSPNWCQKFHEHHSKRLSRNTSDHFPILLDASSS--TWGPCPFRFDNYFLDNNSFVSNVEQWWNDAVCSGYPGFSFMG

Query:  RLKLLARKVKDWKSSNSESFKEKKRVLITEIDRIDSLESMGYLDDIASSLRKSLKADLQQTALLEARYWNQRCKKLWLNDGDENSAFFHKVCTARRRRNQ
        RLK LA  +K W+     S    K  +I E+D ID  E    L    S+ R +LKADL + +L E+++W QR KKLWL +GDENS+FFH++C++R++R+ 
Subjt:  RLKLLARKVKDWKSSNSESFKEKKRVLITEIDRIDSLESMGYLDDIASSLRKSLKADLQQTALLEARYWNQRCKKLWLNDGDENSAFFHKVCTARRRRNQ

Query:  IHELISKGGSSIVSDNMMEYEVINHFSAIYEANQETE-WIVTNLDWAPINIDLISTLISPFTEEEVFGCIKSIGHNKAPGPDGFTIEFIKKFWSILKPSI
        IHE+  + GS   ++N +    I  FS IY ++ +++   + NLDW PI     S L +PF E E+ G I S    K PGPDGF I F K  W       
Subjt:  IHELISKGGSSIVSDNMMEYEVINHFSAIYEANQETE-WIVTNLDWAPINIDLISTLISPFTEEEVFGCIKSIGHNKAPGPDGFTIEFIKKFWSILKPSI

Query:  MSVFHDFFRSKTVNRVVNHTNIALIPKKSMAGHISDFRPISLTTSLYKILAKVLAERLKPTLEDTISLNQSAFVRKRQISDAILLANEMVDFWRVSKTKG
                                                                 LK TL +TIS NQ AFV+ RQI+DAIL+ANE VD+W+V K KG
Subjt:  MSVFHDFFRSKTVNRVVNHTNIALIPKKSMAGHISDFRPISLTTSLYKILAKVLAERLKPTLEDTISLNQSAFVRKRQISDAILLANEMVDFWRVSKTKG

Query:  VIIKLDIEKAFDKISWDFIDCVLLNKGYPNTWREWIKACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSKLIEAAEKKGLLSGVVM-
         I+KLDIEKAFD ++ DFID VL  K +PN WR+WI+ CIS+V+YS+I+NG+P+G I+A RG+RQGDPLSPFLFV+AMDYLS+L+   E  G + GV + 
Subjt:  VIIKLDIEKAFDKISWDFIDCVLLNKGYPNTWREWIKACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSKLIEAAEKKGLLSGVVM-

Query:  GDISITHLLFADDILLFVQDDEKAIESMFYIIKSFENASGLRINLSKSTVSGINLTEQRTTDIARFWGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKI
        G+ +I+H+LFADDILLF++D++  ++++   +  FE ASGL+INL KS +  +N++ +R  + A FWG   HSLP++YLGVPLGG PK+  FW    +KI
Subjt:  GDISITHLLFADDILLFVQDDEKAIESMFYIIKSFENASGLRINLSKSTVSGINLTEQRTTDIARFWGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKI

Query:  QRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWR
        Q++++NW++  +SKGGRLTLI+S L+S+P+Y LSVF+AP   C  +E++  KFLW GN+ S  S+L+ W  VS  K EGGLGI ++  TN+ALL KW+WR
Subjt:  QRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWR

Query:  FFTEEKSLWRKFISAKYSSDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNL
        + +E  +LWR+ I  KY        PS+   S+S++PW +I      F +N  W++ NG  I FW+ NWS  G L     RL+ L+ +K ++V++AW   
Subjt:  FFTEEKSLWRKFISAKYSSDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNL

Query:  DRVWNFRPRRPLFDREVQSWNEMTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLLVAPPRPFYSPGETILNNLWKADIPKKIK
        D  WN   RR L DRE  +W ++  +LP P S RGS    W+   + SFS   A+ ++     +    P   +L  +WK+ IP KIK
Subjt:  DRVWNFRPRRPLFDREVQSWNEMTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLLVAPPRPFYSPGETILNNLWKADIPKKIK

TYK06777.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]3.7e-26346.27Show/hide
Query:  MWDELCCKITDHIKGCYSMSVHVSLSDG---FTWWLSGIYGPARRRDRKFFWRELYDLFGLCGDNWLLGGDFNVFRYPSETSSISPAKLSMKNFNNFIAD
        MWD+L   +TD I+G +S+S++++  DG     WWLS IYGP+  R+RK FW EL DL   C   WLL GDFNV R+PSETS+ +P+K SM+ FN FIAD
Subjt:  MWDELCCKITDHIKGCYSMSVHVSLSDG---FTWWLSGIYGPARRRDRKFFWRELYDLFGLCGDNWLLGGDFNVFRYPSETSSISPAKLSMKNFNNFIAD

Query:  TGLIDPPLVNGSYTWSNLRNRPVMSRLDRFLFSPNWCQKFHEHHSKRLSRNTSDHFPILLDASSSTWGPCPFRFDNYFLDNNSFVSNVEQWWNDAVCSGY
        + LIDPPL N  +TWSNLR  PV+SR+DRFL++ NW   F  H+SK LSR TSDHFPI+L++S  +WGP PF+  N  L    F +N+  WW +    G+
Subjt:  TGLIDPPLVNGSYTWSNLRNRPVMSRLDRFLFSPNWCQKFHEHHSKRLSRNTSDHFPILLDASSSTWGPCPFRFDNYFLDNNSFVSNVEQWWNDAVCSGY

Query:  PGFSFMGRLKLLARKVKDWKSSNSESFKEKKRVLITEIDRIDSLESMGYLDDIASSLRKSLKADLQQTALLEARYWNQRCKKLWLNDGDENSAFFHKVCT
        PGFSFM +LK L+  +++ +  N     E K   I EID ID LE+ G L +  S  R  LKAD+  +   EA+ W Q+ K+LW+ +GDEN++FFHK+C+
Subjt:  PGFSFMGRLKLLARKVKDWKSSNSESFKEKKRVLITEIDRIDSLESMGYLDDIASSLRKSLKADLQQTALLEARYWNQRCKKLWLNDGDENSAFFHKVCT

Query:  ARRRRNQIHELISKGGSSIVSDNMMEYEVINHFSAIYE-ANQETEWIVTNLDWAPINIDLISTLISPFTEEEVFGCIKSIGHNKAPGPDGFTIEFIKKFW
        AR+RR+ I  + S  G    ++  +    ++HF  IY+   +E+ W++ NL+W+PI+ +    L S FTEEE+   + +   NK+P              
Subjt:  ARRRRNQIHELISKGGSSIVSDNMMEYEVINHFSAIYE-ANQETEWIVTNLDWAPINIDLISTLISPFTEEEVFGCIKSIGHNKAPGPDGFTIEFIKKFW

Query:  SILKPSIMSVFHDFFRSKTVNRVVNHTNIALIPKKSMAGHISDFRPISLTTSLYKILAKVLAERLKPTLEDTISLNQSAFVRKRQISDAILLANEMVDFW
                        ++TV+  +N TNIALI KK      +D+RPISLTTS+YK++AKV+AERLK TL  T++ NQ AFV+ RQI DAIL+ANE +D+W
Subjt:  SILKPSIMSVFHDFFRSKTVNRVVNHTNIALIPKKSMAGHISDFRPISLTTSLYKILAKVLAERLKPTLEDTISLNQSAFVRKRQISDAILLANEMVDFW

Query:  RVSKTKGVIIKLDIEKAFDKISWDFIDCVLLNKGYPNTWREWIKACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSKLIEAAEKKGL
        R  K +G +IKLDIEKAFDK++W FID +L+ KGYP  WR WI+ACISSV YSII+NG+PRG IQ  RGIRQGDP+SPF+FVLAMDY+S+L+ +  +K  
Subjt:  RVSKTKGVIIKLDIEKAFDKISWDFIDCVLLNKGYPNTWREWIKACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSKLIEAAEKKGL

Query:  LSGVVM-GDISITHLLFADDILLFVQDDEKAIESMFYIIKSFENASGLRINLSKSTVSGINLTEQRTTDIARFWGCCSHSLPIAYLGVPLGGIPKNTQFW
        + GV + G+I++THLLFADDILLFV+DDE +I+++  II  F+ ASGL INL+KST+S IN+   RT  IA  WG  +  LPI YLGVPLGG      FW
Subjt:  LSGVVM-GDISITHLLFADDILLFVQDDEKAIESMFYIIKSFENASGLRINLSKSTVSGINLTEQRTTDIARFWGCCSHSLPIAYLGVPLGGIPKNTQFW

Query:  VPTIEKIQRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEAL
            EKI +++ +W++  LSKGG++TLI+S L S+P Y LS+FKAPVS C  +E+    FLW     +   +LV W  ++S K +GGLGI ++K TN AL
Subjt:  VPTIEKIQRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEAL

Query:  LLKWIWRFFTEEKSLWRKFISAKYSSDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTV
        L KW+WR+  E+  LW+K I+AKY S      P     SSSRSPWF+I K    F  +  W+++NG+S  FWH +W    PL     RLY LS+NK  ++
Subjt:  LLKWIWRFFTEEKSLWRKFISAKYSSDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTV

Query:  EEAWLNLDRVWNFRPRRPLFDREVQSWNEMTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLLVAPPRPFYSPGETILNNLWKADIPKK
         + W N    W+  PRR L + E+  W E+   +       G D   W  + +G ++    +  L            +    NLWK  IPKK
Subjt:  EEAWLNLDRVWNFRPRRPLFDREVQSWNEMTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLLVAPPRPFYSPGETILNNLWKADIPKK

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]3.4e-27242.23Show/hide
Query:  EKFTLSVDLGSLSPISDAPISSPE------NTPSPKAHTVIEPPSAIINESLKFLVSPDKMDSTGEDSLNGTPRFKNIEAVIDDNSPRKDPQEVIEHGKP
        ++  LSVD+G +SP+     S         N  +P  ++     +   N ++      D+  S    +  G  +     + ++                 
Subjt:  EKFTLSVDLGSLSPISDAPISSPE------NTPSPKAHTVIEPPSAIINESLKFLVSPDKMDSTGEDSLNGTPRFKNIEAVIDDNSPRKDPQEVIEHGKP

Query:  NDESFKKKLNDWLTENDFCLVP--TKSVSGLFYFVILTETKLTNVSKRIIKSLWSSISVNWIALDALGSSGGILIMWDELCCKITDHIKGCYSMSVHVSL
         D +FK+KL  WL EN+  L P  T  V    YF ++   +  +++                    LG  GGIL++WD+   K+ D   G YS+S+++  
Subjt:  NDESFKKKLNDWLTENDFCLVP--TKSVSGLFYFVILTETKLTNVSKRIIKSLWSSISVNWIALDALGSSGGILIMWDELCCKITDHIKGCYSMSVHVSL

Query:  SDGFTWWLSGIYGPARRRDRKFFWRELYDLFGLCGDNWLLGGDFNVFRYPSETSSISPAKLSMKNFNNFIADTGLIDPPLVNGSYTWSNLRNRPVMSRLD
        ++G  WWL+ +YGP +  DR   W EL  L  LC  NWL+ GDFN+ R+  ET++ S  K +M NFNNFI+   LIDPP +N ++TWSNLR  P  SRLD
Subjt:  SDGFTWWLSGIYGPARRRDRKFFWRELYDLFGLCGDNWLLGGDFNVFRYPSETSSISPAKLSMKNFNNFIADTGLIDPPLVNGSYTWSNLRNRPVMSRLD

Query:  RFLFSPNWCQKFHEHHSKRLSRNTSDHFPILLDASSSTWGPCPFRFDNYFLDNNSFVSNVEQWWNDAVCSGYPGFSFMGRLKLLARKVKDWKSSNSESFK
        RFL S  W   F  H S+ L RN SDHFPILL++    WGPCPFR +N  L +  F  N   WWN +  +G+PG++F+  L  L++ +K+W+ +    + 
Subjt:  RFLFSPNWCQKFHEHHSKRLSRNTSDHFPILLDASSSTWGPCPFRFDNYFLDNNSFVSNVEQWWNDAVCSGYPGFSFMGRLKLLARKVKDWKSSNSESFK

Query:  EKKRVLITEIDRIDSLESMGYLDDIASSLRKSLKADLQQTALLEARYWNQRCKKLWLNDGDENSAFFHKVCTARRRRNQIHELISKGGSSIVSDNMMEYE
          K+ L+ EID ID LE  G +       R SLK+DL      +A+ W+QR ++ W   GDEN+++FH++CT  +R+N I  +    G+S+ S + +   
Subjt:  EKKRVLITEIDRIDSLESMGYLDDIASSLRKSLKADLQQTALLEARYWNQRCKKLWLNDGDENSAFFHKVCTARRRRNQIHELISKGGSSIVSDNMMEYE

Query:  VINHFSAIYEANQETEWIVTNLDWAPINIDLISTLISPFTEEEVFGCIKSIGHNKAPGPDGFTIEFIKKFWSILKPSIMSVFHDFFRSKTVNRVVNHTNI
         I+HF  IY      E ++ NL W PI+    S L  PF E E+   I S  + KAPGPDG+T+ F KK W  LK  +++VF DF ++  VN  VN+T I
Subjt:  VINHFSAIYEANQETEWIVTNLDWAPINIDLISTLISPFTEEEVFGCIKSIGHNKAPGPDGFTIEFIKKFWSILKPSIMSVFHDFFRSKTVNRVVNHTNI

Query:  ALIPKKSMAGHISDFRPISLTTSLYKILAKVLAERLKPTLEDTISLNQSAFVRKRQISDAILLANEMVDFWRVSKTKGVIIKLDIEKAFDKISWDFIDCV
        ALI KK      SD+RPISLTTSLYKI+AK LA RLK  L DTI+ NQ AF++ RQI+DAIL+ANE++D W+  K KG ++KLDIEKAFDKISW FID +
Subjt:  ALIPKKSMAGHISDFRPISLTTSLYKILAKVLAERLKPTLEDTISLNQSAFVRKRQISDAILLANEMVDFWRVSKTKGVIIKLDIEKAFDKISWDFIDCV

Query:  LLNKGYPNTWREWIKACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSKLIEAAEKKGLLSGVVMGD-ISITHLLFADDILLFVQDDE
        L  K +P+ WR+WIKACIS+V YSI+LNG P+G I+A+RGIRQGDPLSPF+FVLAMDYLS+L+   E KG + GV   +  +I+HLLFADD+L+FV+D+E
Subjt:  LLNKGYPNTWREWIKACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSKLIEAAEKKGLLSGVVMGD-ISITHLLFADDILLFVQDDE

Query:  KAIESMFYIIKSFENASGLRINLSKSTVSGINLTEQRTTDIARFWGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKIQRRIHNWRFVSLSKGGRLTLIQ
        + + ++   +  FE ASGL  N SKST+S IN++  RT  IA F+G  +  LP+ YLGVPLGG P++  FW  TIE I ++++ W++  +SKGGRLTL++
Subjt:  KAIESMFYIIKSFENASGLRINLSKSTVSGINLTEQRTTDIARFWGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKIQRRIHNWRFVSLSKGGRLTLIQ

Query:  SVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEKSLWRKFISAKYSSDHH
        + L+S+P Y LS FKAPVS+   +E+    FLW G+     ++L+ W I +SPK  GGLGI K+K TN+ALL KW+WR+  E  SLW+K I AKY+ +H 
Subjt:  SVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEKSLWRKFISAKYSSDHH

Query:  NSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNLDRVWNFRPRRPLFDREVQSWNE
           P   R SS+ SPW AI K +  + +   W   +G S+ FWH  W    PL     RLY LS+ +S TV+E W      WN +PRRPL +RE Q+W+ 
Subjt:  NSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNLDRVWNFRPRRPLFDREVQSWNE

Query:  MTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLLVAPPRPFYSPGETILNNLWKADIPKKIK
        +   LP   + RG     W  S+   ++   A+ +       P  +  E  L +LW++ IP+K K
Subjt:  MTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLLVAPPRPFYSPGETILNNLWKADIPKKIK

TrEMBL top hitse value%identityAlignment
A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein9.6e-28147.23Show/hide
Query:  MWDELCCKITDHIKGCYSMSVHVSLSDGFTWWLSGIYGPARRRDRKFFWRELYDLFGLCGDNWLLGGDFNVFRYPSETSSISPAKLSMKNFNNFIADTGL
        MW++    I    KG +S+S+ V  ++G  WWLS IYGPA+R++R  FW EL  L  +C   W+LGGDFNV R+  ET++ +PA LSM+ FN+FI++  L
Subjt:  MWDELCCKITDHIKGCYSMSVHVSLSDGFTWWLSGIYGPARRRDRKFFWRELYDLFGLCGDNWLLGGDFNVFRYPSETSSISPAKLSMKNFNNFIADTGL

Query:  IDPPLVNGSYTWSNLRNRPVMSRLDRFLFSPNWCQKFHEHHSKRLSRNTSDHFPILLDASSSTWGPCPFRFDNYFLDNNSFVSNVEQWWNDAVCSGYPGF
        IDPPL N  YTWSNLR +  +SRLDRFLF+  W   F  H SK L+R TSDHFPI+L++S+ +WGP PFRF N +L +  +  N+E WW +    GY G+
Subjt:  IDPPLVNGSYTWSNLRNRPVMSRLDRFLFSPNWCQKFHEHHSKRLSRNTSDHFPILLDASSSTWGPCPFRFDNYFLDNNSFVSNVEQWWNDAVCSGYPGF

Query:  SFMGRLKLLARKVKDWKSSNSESFKEKKRVLITEIDRIDSLESMGYLDDIASSLRKSLKADLQQTALLEARYWNQRCKKLWLNDGDENSAFFHKVCTARR
        SFM RLK LA  +K W        +  K+  I EID+ID LE+ G   +I    R +LKADL Q  L EA+ W Q+CK++W+++GDENS+FFHK+CTAR+
Subjt:  SFMGRLKLLARKVKDWKSSNSESFKEKKRVLITEIDRIDSLESMGYLDDIASSLRKSLKADLQQTALLEARYWNQRCKKLWLNDGDENSAFFHKVCTARR

Query:  RRNQIHELISKGGSSIVSDNMMEYEVINHFSAIYEANQETEWIVTNLDWAPINIDLISTLISPFTEEEVFGCIKSIGHNKAPGPDGFTIEFIKKFWSILK
        ++  I ++I+  G + ++D+ +    I HF  IY  N+ ++  + NLDW PI+      L  PF E E++  +KS   NKAPGPDG+ ++F++K WS +K
Subjt:  RRNQIHELISKGGSSIVSDNMMEYEVINHFSAIYEANQETEWIVTNLDWAPINIDLISTLISPFTEEEVFGCIKSIGHNKAPGPDGFTIEFIKKFWSILK

Query:  PSIMSVFHDFFRSKTVNRVVNHTNIALIPKKSMAGHISDFRPISLTTSLYKILAKVLAERLKPTLEDTISLNQSAFVRKRQISDAILLANEMVDFWRVSK
         +I  +F DF  +  +N+VVN T I LI KK      +DFRPISLTT++YK++AK LA+RLK TL DTIS +Q AFV+ RQI++AIL+ANE +DFWR  K
Subjt:  PSIMSVFHDFFRSKTVNRVVNHTNIALIPKKSMAGHISDFRPISLTTSLYKILAKVLAERLKPTLEDTISLNQSAFVRKRQISDAILLANEMVDFWRVSK

Query:  TKGVIIKLDIEKAFDKISWDFIDCVLLNKGYPNTWREWIKACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSKLIEAAEKKGLLSGV
         +G +IKLDIEKAFDK++W FID VL+ K Y   WR+ I +CISSV YSI++NG+PRG I+  RGIRQGDPLSPF+FVLAMDYLS+L+     K  ++GV
Subjt:  TKGVIIKLDIEKAFDKISWDFIDCVLLNKGYPNTWREWIKACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSKLIEAAEKKGLLSGV

Query:  VMG-DISITHLLFADDILLFVQDDEKAIESMFYIIKSFENASGLRINLSKSTVSGINLTEQRTTDIARFWGCCSHSLPIAYLGVPLGGIPKNTQFWVPTI
            ++++TH+LFADDIL+FV+D +  + ++  I+  FE+ASGL INLSKST+  IN+   R   IA  WG     LP +YLG+PLGG P ++ FW   +
Subjt:  VMG-DISITHLLFADDILLFVQDDEKAIESMFYIIKSFENASGLRINLSKSTVSGINLTEQRTTDIARFWGCCSHSLPIAYLGVPLGGIPKNTQFWVPTI

Query:  EKIQRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKW
        +KIQ+++ NW++  LSKGGR+TLI S L S+P+Y +SVFK P  I  ++E     FLW+G S+    +L+RW  + SPK +GGLGIH + STN ALL KW
Subjt:  EKIQRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKW

Query:  IWRFFTEEKSLWRKFISAKYSSDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTVEEAW
        +W+F TE+  LW++ I +KY  +   SFPS  +FSS+ SPW A+++  S F+ N  W+V +G+ I FW DNW+   PL     RL+ LS+NK  +V+E W
Subjt:  IWRFFTEEKSLWRKFISAKYSSDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTVEEAW

Query:  LNLDRVWNFRPRRPLFDREVQSWNEMTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLLVAPPRPF-YSPGETILNNLWKADIPKKIK
              W+    RPL D E   W+ +   LP P   RG     W  + +  F T   +  +  AP  P  + P   +   LWK + PKK K
Subjt:  LNLDRVWNFRPRRPLFDREVQSWNEMTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLLVAPPRPF-YSPGETILNNLWKADIPKKIK

A0A5A7US62 LINE-1 retrotransposable element ORF2 protein2.8e-27242.23Show/hide
Query:  EKFTLSVDLGSLSPISDAPISSPE------NTPSPKAHTVIEPPSAIINESLKFLVSPDKMDSTGEDSLNGTPRFKNIEAVIDDNSPRKDPQEVIEHGKP
        ++  LSVD+G +SP+     S         N  +P  ++     +   N ++      D+  S    +  G  +     + ++                 
Subjt:  EKFTLSVDLGSLSPISDAPISSPE------NTPSPKAHTVIEPPSAIINESLKFLVSPDKMDSTGEDSLNGTPRFKNIEAVIDDNSPRKDPQEVIEHGKP

Query:  NDESFKKKLNDWLTENDFCLVP--TKSVSGLFYFVILTETKLTNVSKRIIKSLWSSISVNWIALDALGSSGGILIMWDELCCKITDHIKGCYSMSVHVSL
         D +FK+KL  WL EN+  L P  T  V    YF ++   +  +++                    LG  GGIL++WD+   K+ D   G YS+S+++  
Subjt:  NDESFKKKLNDWLTENDFCLVP--TKSVSGLFYFVILTETKLTNVSKRIIKSLWSSISVNWIALDALGSSGGILIMWDELCCKITDHIKGCYSMSVHVSL

Query:  SDGFTWWLSGIYGPARRRDRKFFWRELYDLFGLCGDNWLLGGDFNVFRYPSETSSISPAKLSMKNFNNFIADTGLIDPPLVNGSYTWSNLRNRPVMSRLD
        ++G  WWL+ +YGP +  DR   W EL  L  LC  NWL+ GDFN+ R+  ET++ S  K +M NFNNFI+   LIDPP +N ++TWSNLR  P  SRLD
Subjt:  SDGFTWWLSGIYGPARRRDRKFFWRELYDLFGLCGDNWLLGGDFNVFRYPSETSSISPAKLSMKNFNNFIADTGLIDPPLVNGSYTWSNLRNRPVMSRLD

Query:  RFLFSPNWCQKFHEHHSKRLSRNTSDHFPILLDASSSTWGPCPFRFDNYFLDNNSFVSNVEQWWNDAVCSGYPGFSFMGRLKLLARKVKDWKSSNSESFK
        RFL S  W   F  H S+ L RN SDHFPILL++    WGPCPFR +N  L +  F  N   WWN +  +G+PG++F+  L  L++ +K+W+ +    + 
Subjt:  RFLFSPNWCQKFHEHHSKRLSRNTSDHFPILLDASSSTWGPCPFRFDNYFLDNNSFVSNVEQWWNDAVCSGYPGFSFMGRLKLLARKVKDWKSSNSESFK

Query:  EKKRVLITEIDRIDSLESMGYLDDIASSLRKSLKADLQQTALLEARYWNQRCKKLWLNDGDENSAFFHKVCTARRRRNQIHELISKGGSSIVSDNMMEYE
          K+ L+ EID ID LE  G +       R SLK+DL      +A+ W+QR ++ W   GDEN+++FH++CT  +R+N I  +    G+S+ S + +   
Subjt:  EKKRVLITEIDRIDSLESMGYLDDIASSLRKSLKADLQQTALLEARYWNQRCKKLWLNDGDENSAFFHKVCTARRRRNQIHELISKGGSSIVSDNMMEYE

Query:  VINHFSAIYEANQETEWIVTNLDWAPINIDLISTLISPFTEEEVFGCIKSIGHNKAPGPDGFTIEFIKKFWSILKPSIMSVFHDFFRSKTVNRVVNHTNI
         I+HF  IY      E ++ NL W PI+    S L  PF E E+   I S  + KAPGPDG+T+ F KK W  LK  +++VF DF ++  VN  VN+T I
Subjt:  VINHFSAIYEANQETEWIVTNLDWAPINIDLISTLISPFTEEEVFGCIKSIGHNKAPGPDGFTIEFIKKFWSILKPSIMSVFHDFFRSKTVNRVVNHTNI

Query:  ALIPKKSMAGHISDFRPISLTTSLYKILAKVLAERLKPTLEDTISLNQSAFVRKRQISDAILLANEMVDFWRVSKTKGVIIKLDIEKAFDKISWDFIDCV
        ALI KK      SD+RPISLTTSLYKI+AK LA RLK  L DTI+ NQ AF++ RQI+DAIL+ANE +D W+  K KG ++KLDIEKAFDKISW FID +
Subjt:  ALIPKKSMAGHISDFRPISLTTSLYKILAKVLAERLKPTLEDTISLNQSAFVRKRQISDAILLANEMVDFWRVSKTKGVIIKLDIEKAFDKISWDFIDCV

Query:  LLNKGYPNTWREWIKACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSKLIEAAEKKGLLSGVVMGD-ISITHLLFADDILLFVQDDE
        L  K +P+ WR+WIKACIS+V YSI+LNG P+G I+A+RGIRQGDPLSPF+FVLAMDYLS+L+   E KG + GV   +  +I+HLLFADD+L+FV+D+E
Subjt:  LLNKGYPNTWREWIKACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSKLIEAAEKKGLLSGVVMGD-ISITHLLFADDILLFVQDDE

Query:  KAIESMFYIIKSFENASGLRINLSKSTVSGINLTEQRTTDIARFWGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKIQRRIHNWRFVSLSKGGRLTLIQ
        + + ++   +  FE ASGL  N SKST+S IN++  RT  IA F+G  +  LP+ YLGVPLGG P++  FW  TIE I ++++ W++  +SKGGRLTL++
Subjt:  KAIESMFYIIKSFENASGLRINLSKSTVSGINLTEQRTTDIARFWGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKIQRRIHNWRFVSLSKGGRLTLIQ

Query:  SVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEKSLWRKFISAKYSSDHH
        + L+S+P Y LS FKAPVS+   +E+    FLW G+     ++L+ W I +SPK  GGLGI K+K TN+ALL KW+WR+  E  SLW+K I AKY+ +H 
Subjt:  SVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEKSLWRKFISAKYSSDHH

Query:  NSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNLDRVWNFRPRRPLFDREVQSWNE
           P   R SS+ SPW AI K +  + +   W   +G S+ FWH  W    PL     RLY LS+ +S TV+E W      WN +PRRPL +RE Q+W+ 
Subjt:  NSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNLDRVWNFRPRRPLFDREVQSWNE

Query:  MTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLLVAPPRPFYSPGETILNNLWKADIPKKIK
        +   LP   + RG     W  S+   ++   A+ +       P  +  E  L +LW++ IP+K K
Subjt:  MTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLLVAPPRPFYSPGETILNNLWKADIPKKIK

A0A5D3BLV7 LINE-1 retrotransposable element ORF2 protein3.4e-26240.86Show/hide
Query:  HKKVNLVAPSGPVYQDINGEK--FTLSVDLGSLSPISDAPISSPENTPSPKAHTVIEPPSAIINESLKFLVSPDKMDSTGEDSLNGTPRFKNIEAVIDDN
        +K V +  P   V  D +  K   +L+VDLG L P  D   S  ++  S  A  V    + ++ E+ +  +  ++  ++  ++    P+  +        
Subjt:  HKKVNLVAPSGPVYQDINGEK--FTLSVDLGSLSPISDAPISSPENTPSPKAHTVIEPPSAIINESLKFLVSPDKMDSTGEDSLNGTPRFKNIEAVIDDN

Query:  SPRKDPQEVIEHGKPNDESFKKKLNDWLTENDFCLVPTKSVSGLFYF--VILTE----TKLTNVSKRIIKSLWSSISVNWIALDALGSSGGILIMWDELC
           K+         P+ E+FKK+L  WL +N   L      SG      V+L +     K+TN  KRIIKSLW S S+NWIA +A GSSGGILI+WD   
Subjt:  SPRKDPQEVIEHGKPNDESFKKKLNDWLTENDFCLVPTKSVSGLFYF--VILTE----TKLTNVSKRIIKSLWSSISVNWIALDALGSSGGILIMWDELC

Query:  CKITDHIKGCYSMSVHVSLSDGFTWWLSGIYGPARRRDRKFFWRELYDLFGLCGDNWLLGGDFNVFRYPSETSSISPAKLSMKNFNNFIADTGLIDPPLV
          +    +G +S+S +  L++  +WWL+G+YGP +RR+R  FW EL++L  L    W+LGGD NV R   E++S+  +  + +  NNFI++  LIDPPL 
Subjt:  CKITDHIKGCYSMSVHVSLSDGFTWWLSGIYGPARRRDRKFFWRELYDLFGLCGDNWLLGGDFNVFRYPSETSSISPAKLSMKNFNNFIADTGLIDPPLV

Query:  NGSYTWSNLRNRPVMSRLDRFLFSPNWCQKFHEHHSKRLSRNTSDHFPILLDASSS--TWGPCPFRFDNYFLDNNSFVSNVEQWWNDAVCSGYPGFSFMG
        N  +TWSNLRN P  SR+DRFL++ +W   F  H ++ L R+TSDHFP++ + S+   +WGP PFR ++  L +  F  N+ +WW +++ +GYPGFSF+ 
Subjt:  NGSYTWSNLRNRPVMSRLDRFLFSPNWCQKFHEHHSKRLSRNTSDHFPILLDASSS--TWGPCPFRFDNYFLDNNSFVSNVEQWWNDAVCSGYPGFSFMG

Query:  RLKLLARKVKDWKSSNSESFKEKKRVLITEIDRIDSLESMGYLDDIASSLRKSLKADLQQTALLEARYWNQRCKKLWLNDGDENSAFFHKVCTARRRRNQ
        RLK LA  +K W+     S    K  +I E+D ID  E    L    S+ R +LKADL + +L E+++W QR KKLWL +GDENS+FFH++C++R++R+ 
Subjt:  RLKLLARKVKDWKSSNSESFKEKKRVLITEIDRIDSLESMGYLDDIASSLRKSLKADLQQTALLEARYWNQRCKKLWLNDGDENSAFFHKVCTARRRRNQ

Query:  IHELISKGGSSIVSDNMMEYEVINHFSAIYEANQETE-WIVTNLDWAPINIDLISTLISPFTEEEVFGCIKSIGHNKAPGPDGFTIEFIKKFWSILKPSI
        IHE+  + GS   ++N +    I  FS IY ++ +++   + NLDW PI     S L +PF E E+ G I S    K PGPDGF I F K  W       
Subjt:  IHELISKGGSSIVSDNMMEYEVINHFSAIYEANQETE-WIVTNLDWAPINIDLISTLISPFTEEEVFGCIKSIGHNKAPGPDGFTIEFIKKFWSILKPSI

Query:  MSVFHDFFRSKTVNRVVNHTNIALIPKKSMAGHISDFRPISLTTSLYKILAKVLAERLKPTLEDTISLNQSAFVRKRQISDAILLANEMVDFWRVSKTKG
                                                                 LK TL +TIS NQ AFV+ RQI+DAIL+ANE VD+W+V K KG
Subjt:  MSVFHDFFRSKTVNRVVNHTNIALIPKKSMAGHISDFRPISLTTSLYKILAKVLAERLKPTLEDTISLNQSAFVRKRQISDAILLANEMVDFWRVSKTKG

Query:  VIIKLDIEKAFDKISWDFIDCVLLNKGYPNTWREWIKACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSKLIEAAEKKGLLSGVVM-
         I+KLDIEKAFD ++ DFID VL  K +PN WR+WI+ CIS+V+YS+I+NG+P+G I+A RG+RQGDPLSPFLFV+AMDYLS+L+   E  G + GV + 
Subjt:  VIIKLDIEKAFDKISWDFIDCVLLNKGYPNTWREWIKACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSKLIEAAEKKGLLSGVVM-

Query:  GDISITHLLFADDILLFVQDDEKAIESMFYIIKSFENASGLRINLSKSTVSGINLTEQRTTDIARFWGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKI
        G+ +I+H+LFADDILLF++D++  ++++   +  FE ASGL+INL KS +  +N++ +R  + A FWG   HSLP++YLGVPLGG PK+  FW    +KI
Subjt:  GDISITHLLFADDILLFVQDDEKAIESMFYIIKSFENASGLRINLSKSTVSGINLTEQRTTDIARFWGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKI

Query:  QRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWR
        Q++++NW++  +SKGGRLTLI+S L+S+P+Y LSVF+AP   C  +E++  KFLW GN+ S  S+L+ W  VS  K EGGLGI ++  TN+ALL KW+WR
Subjt:  QRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWR

Query:  FFTEEKSLWRKFISAKYSSDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNL
        + +E  +LWR+ I  KY        PS+   S+S++PW +I      F +N  W++ NG  I FW+ NWS  G L     RL+ L+ +K ++V++AW   
Subjt:  FFTEEKSLWRKFISAKYSSDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNL

Query:  DRVWNFRPRRPLFDREVQSWNEMTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLLVAPPRPFYSPGETILNNLWKADIPKKIK
        D  WN   RR L DRE  +W ++  +LP P S RGS    W+   + SFS   A+ ++     +    P   +L  +WK+ IP KIK
Subjt:  DRVWNFRPRRPLFDREVQSWNEMTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLLVAPPRPFYSPGETILNNLWKADIPKKIK

A0A5D3C4J1 LINE-1 retrotransposable element ORF2 protein1.8e-26346.27Show/hide
Query:  MWDELCCKITDHIKGCYSMSVHVSLSDG---FTWWLSGIYGPARRRDRKFFWRELYDLFGLCGDNWLLGGDFNVFRYPSETSSISPAKLSMKNFNNFIAD
        MWD+L   +TD I+G +S+S++++  DG     WWLS IYGP+  R+RK FW EL DL   C   WLL GDFNV R+PSETS+ +P+K SM+ FN FIAD
Subjt:  MWDELCCKITDHIKGCYSMSVHVSLSDG---FTWWLSGIYGPARRRDRKFFWRELYDLFGLCGDNWLLGGDFNVFRYPSETSSISPAKLSMKNFNNFIAD

Query:  TGLIDPPLVNGSYTWSNLRNRPVMSRLDRFLFSPNWCQKFHEHHSKRLSRNTSDHFPILLDASSSTWGPCPFRFDNYFLDNNSFVSNVEQWWNDAVCSGY
        + LIDPPL N  +TWSNLR  PV+SR+DRFL++ NW   F  H+SK LSR TSDHFPI+L++S  +WGP PF+  N  L    F +N+  WW +    G+
Subjt:  TGLIDPPLVNGSYTWSNLRNRPVMSRLDRFLFSPNWCQKFHEHHSKRLSRNTSDHFPILLDASSSTWGPCPFRFDNYFLDNNSFVSNVEQWWNDAVCSGY

Query:  PGFSFMGRLKLLARKVKDWKSSNSESFKEKKRVLITEIDRIDSLESMGYLDDIASSLRKSLKADLQQTALLEARYWNQRCKKLWLNDGDENSAFFHKVCT
        PGFSFM +LK L+  +++ +  N     E K   I EID ID LE+ G L +  S  R  LKAD+  +   EA+ W Q+ K+LW+ +GDEN++FFHK+C+
Subjt:  PGFSFMGRLKLLARKVKDWKSSNSESFKEKKRVLITEIDRIDSLESMGYLDDIASSLRKSLKADLQQTALLEARYWNQRCKKLWLNDGDENSAFFHKVCT

Query:  ARRRRNQIHELISKGGSSIVSDNMMEYEVINHFSAIYE-ANQETEWIVTNLDWAPINIDLISTLISPFTEEEVFGCIKSIGHNKAPGPDGFTIEFIKKFW
        AR+RR+ I  + S  G    ++  +    ++HF  IY+   +E+ W++ NL+W+PI+ +    L S FTEEE+   + +   NK+P              
Subjt:  ARRRRNQIHELISKGGSSIVSDNMMEYEVINHFSAIYE-ANQETEWIVTNLDWAPINIDLISTLISPFTEEEVFGCIKSIGHNKAPGPDGFTIEFIKKFW

Query:  SILKPSIMSVFHDFFRSKTVNRVVNHTNIALIPKKSMAGHISDFRPISLTTSLYKILAKVLAERLKPTLEDTISLNQSAFVRKRQISDAILLANEMVDFW
                        ++TV+  +N TNIALI KK      +D+RPISLTTS+YK++AKV+AERLK TL  T++ NQ AFV+ RQI DAIL+ANE +D+W
Subjt:  SILKPSIMSVFHDFFRSKTVNRVVNHTNIALIPKKSMAGHISDFRPISLTTSLYKILAKVLAERLKPTLEDTISLNQSAFVRKRQISDAILLANEMVDFW

Query:  RVSKTKGVIIKLDIEKAFDKISWDFIDCVLLNKGYPNTWREWIKACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSKLIEAAEKKGL
        R  K +G +IKLDIEKAFDK++W FID +L+ KGYP  WR WI+ACISSV YSII+NG+PRG IQ  RGIRQGDP+SPF+FVLAMDY+S+L+ +  +K  
Subjt:  RVSKTKGVIIKLDIEKAFDKISWDFIDCVLLNKGYPNTWREWIKACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSKLIEAAEKKGL

Query:  LSGVVM-GDISITHLLFADDILLFVQDDEKAIESMFYIIKSFENASGLRINLSKSTVSGINLTEQRTTDIARFWGCCSHSLPIAYLGVPLGGIPKNTQFW
        + GV + G+I++THLLFADDILLFV+DDE +I+++  II  F+ ASGL INL+KST+S IN+   RT  IA  WG  +  LPI YLGVPLGG      FW
Subjt:  LSGVVM-GDISITHLLFADDILLFVQDDEKAIESMFYIIKSFENASGLRINLSKSTVSGINLTEQRTTDIARFWGCCSHSLPIAYLGVPLGGIPKNTQFW

Query:  VPTIEKIQRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEAL
            EKI +++ +W++  LSKGG++TLI+S L S+P Y LS+FKAPVS C  +E+    FLW     +   +LV W  ++S K +GGLGI ++K TN AL
Subjt:  VPTIEKIQRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEAL

Query:  LLKWIWRFFTEEKSLWRKFISAKYSSDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTV
        L KW+WR+  E+  LW+K I+AKY S      P     SSSRSPWF+I K    F  +  W+++NG+S  FWH +W    PL     RLY LS+NK  ++
Subjt:  LLKWIWRFFTEEKSLWRKFISAKYSSDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTV

Query:  EEAWLNLDRVWNFRPRRPLFDREVQSWNEMTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLLVAPPRPFYSPGETILNNLWKADIPKK
         + W N    W+  PRR L + E+  W E+   +       G D   W  + +G ++    +  L            +    NLWK  IPKK
Subjt:  EEAWLNLDRVWNFRPRRPLFDREVQSWNEMTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLLVAPPRPFYSPGETILNNLWKADIPKK

A0A5D3CA17 LINE-1 retrotransposable element ORF2 protein1.6e-27242.23Show/hide
Query:  EKFTLSVDLGSLSPISDAPISSPE------NTPSPKAHTVIEPPSAIINESLKFLVSPDKMDSTGEDSLNGTPRFKNIEAVIDDNSPRKDPQEVIEHGKP
        ++  LSVD+G +SP+     S         N  +P  ++     +   N ++      D+  S    +  G  +     + ++                 
Subjt:  EKFTLSVDLGSLSPISDAPISSPE------NTPSPKAHTVIEPPSAIINESLKFLVSPDKMDSTGEDSLNGTPRFKNIEAVIDDNSPRKDPQEVIEHGKP

Query:  NDESFKKKLNDWLTENDFCLVP--TKSVSGLFYFVILTETKLTNVSKRIIKSLWSSISVNWIALDALGSSGGILIMWDELCCKITDHIKGCYSMSVHVSL
         D +FK+KL  WL EN+  L P  T  V    YF ++   +  +++                    LG  GGIL++WD+   K+ D   G YS+S+++  
Subjt:  NDESFKKKLNDWLTENDFCLVP--TKSVSGLFYFVILTETKLTNVSKRIIKSLWSSISVNWIALDALGSSGGILIMWDELCCKITDHIKGCYSMSVHVSL

Query:  SDGFTWWLSGIYGPARRRDRKFFWRELYDLFGLCGDNWLLGGDFNVFRYPSETSSISPAKLSMKNFNNFIADTGLIDPPLVNGSYTWSNLRNRPVMSRLD
        ++G  WWL+ +YGP +  DR   W EL  L  LC  NWL+ GDFN+ R+  ET++ S  K +M NFNNFI+   LIDPP +N ++TWSNLR  P  SRLD
Subjt:  SDGFTWWLSGIYGPARRRDRKFFWRELYDLFGLCGDNWLLGGDFNVFRYPSETSSISPAKLSMKNFNNFIADTGLIDPPLVNGSYTWSNLRNRPVMSRLD

Query:  RFLFSPNWCQKFHEHHSKRLSRNTSDHFPILLDASSSTWGPCPFRFDNYFLDNNSFVSNVEQWWNDAVCSGYPGFSFMGRLKLLARKVKDWKSSNSESFK
        RFL S  W   F  H S+ L RN SDHFPILL++    WGPCPFR +N  L +  F  N   WWN +  +G+PG++F+  L  L++ +K+W+ +    + 
Subjt:  RFLFSPNWCQKFHEHHSKRLSRNTSDHFPILLDASSSTWGPCPFRFDNYFLDNNSFVSNVEQWWNDAVCSGYPGFSFMGRLKLLARKVKDWKSSNSESFK

Query:  EKKRVLITEIDRIDSLESMGYLDDIASSLRKSLKADLQQTALLEARYWNQRCKKLWLNDGDENSAFFHKVCTARRRRNQIHELISKGGSSIVSDNMMEYE
          K+ L+ EID ID LE  G +       R SLK+DL      +A+ W+QR ++ W   GDEN+++FH++CT  +R+N I  +    G+S+ S + +   
Subjt:  EKKRVLITEIDRIDSLESMGYLDDIASSLRKSLKADLQQTALLEARYWNQRCKKLWLNDGDENSAFFHKVCTARRRRNQIHELISKGGSSIVSDNMMEYE

Query:  VINHFSAIYEANQETEWIVTNLDWAPINIDLISTLISPFTEEEVFGCIKSIGHNKAPGPDGFTIEFIKKFWSILKPSIMSVFHDFFRSKTVNRVVNHTNI
         I+HF  IY      E ++ NL W PI+    S L  PF E E+   I S  + KAPGPDG+T+ F KK W  LK  +++VF DF ++  VN  VN+T I
Subjt:  VINHFSAIYEANQETEWIVTNLDWAPINIDLISTLISPFTEEEVFGCIKSIGHNKAPGPDGFTIEFIKKFWSILKPSIMSVFHDFFRSKTVNRVVNHTNI

Query:  ALIPKKSMAGHISDFRPISLTTSLYKILAKVLAERLKPTLEDTISLNQSAFVRKRQISDAILLANEMVDFWRVSKTKGVIIKLDIEKAFDKISWDFIDCV
        ALI KK      SD+RPISLTTSLYKI+AK LA RLK  L DTI+ NQ AF++ RQI+DAIL+ANE++D W+  K KG ++KLDIEKAFDKISW FID +
Subjt:  ALIPKKSMAGHISDFRPISLTTSLYKILAKVLAERLKPTLEDTISLNQSAFVRKRQISDAILLANEMVDFWRVSKTKGVIIKLDIEKAFDKISWDFIDCV

Query:  LLNKGYPNTWREWIKACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSKLIEAAEKKGLLSGVVMGD-ISITHLLFADDILLFVQDDE
        L  K +P+ WR+WIKACIS+V YSI+LNG P+G I+A+RGIRQGDPLSPF+FVLAMDYLS+L+   E KG + GV   +  +I+HLLFADD+L+FV+D+E
Subjt:  LLNKGYPNTWREWIKACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSKLIEAAEKKGLLSGVVMGD-ISITHLLFADDILLFVQDDE

Query:  KAIESMFYIIKSFENASGLRINLSKSTVSGINLTEQRTTDIARFWGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKIQRRIHNWRFVSLSKGGRLTLIQ
        + + ++   +  FE ASGL  N SKST+S IN++  RT  IA F+G  +  LP+ YLGVPLGG P++  FW  TIE I ++++ W++  +SKGGRLTL++
Subjt:  KAIESMFYIIKSFENASGLRINLSKSTVSGINLTEQRTTDIARFWGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKIQRRIHNWRFVSLSKGGRLTLIQ

Query:  SVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEKSLWRKFISAKYSSDHH
        + L+S+P Y LS FKAPVS+   +E+    FLW G+     ++L+ W I +SPK  GGLGI K+K TN+ALL KW+WR+  E  SLW+K I AKY+ +H 
Subjt:  SVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEKSLWRKFISAKYSSDHH

Query:  NSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNLDRVWNFRPRRPLFDREVQSWNE
           P   R SS+ SPW AI K +  + +   W   +G S+ FWH  W    PL     RLY LS+ +S TV+E W      WN +PRRPL +RE Q+W+ 
Subjt:  NSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNLDRVWNFRPRRPLFDREVQSWNE

Query:  MTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLLVAPPRPFYSPGETILNNLWKADIPKKIK
        +   LP   + RG     W  S+   ++   A+ +       P  +  E  L +LW++ IP+K K
Subjt:  MTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLLVAPPRPFYSPGETILNNLWKADIPKKIK

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.4e-4723.72Show/hide
Query:  IYGPARRRDRKFFWRELYDLFGLCGDNWLLGGDFNVFRYPSETSSISPAKLSMKNFNNFIADTGLIDPPLV----NGSYTWSNLRNRPVMSRLDRFLFSP
        IY P     R F  + L DL      + L+ GDFN      + S+        +  N+ +  T LID        +  YT+ +  +    S++D  + S 
Subjt:  IYGPARRRDRKFFWRELYDLFGLCGDNWLLGGDFNVFRYPSETSSISPAKLSMKNFNNFIADTGLIDPPLV----NGSYTWSNLRNRPVMSRLDRFLFSP

Query:  NWCQKFHEHHSKRLSRNTSDHFPILLD--------ASSSTWGPCPFRFDNYFLDNNSFVSNVEQWW--NDAVCSGY----PGFSFMGRLKLLARKVKDWK
            K     ++ ++   SDH  I L+        + S+TW       ++Y++ +N   + ++ ++  N+   + Y      F  + R K +A  +  +K
Subjt:  NWCQKFHEHHSKRLSRNTSDHFPILLD--------ASSSTWGPCPFRFDNYFLDNNSFVSNVEQWW--NDAVCSGY----PGFSFMGRLKLLARKVKDWK

Query:  SSNSESFKEKKRVLITEIDRIDSLESMGYLDDIASSLRKSLK-ADLQQT--ALLEARYW-NQRCKKLWLNDGDENSAFFHKVCTARRRRNQIHELISKGG
             S  +     + E+++ +   S        + +R  LK  + Q+T   + E+R W  +R  K+     D   A   ++   +R +NQI  + +  G
Subjt:  SSNSESFKEKKRVLITEIDRIDSLESMGYLDDIASSLRKSLK-ADLQQT--ALLEARYW-NQRCKKLWLNDGDENSAFFHKVCTARRRRNQIHELISKGG

Query:  SSIVSDNMMEYEVINHFSAIY----EANQETEWIVTNLDWAPINIDLISTLISPFTEEEVFGCIKSIGHNKAPGPDGFTIEFIKKFWSILKPSIMSVFHD
                ++  +  ++  +Y    E  +E +  +       +N + + +L  P T  E+   I S+   K+PGPDGFT EF +++   L P ++ +F  
Subjt:  SSIVSDNMMEYEVINHFSAIY----EANQETEWIVTNLDWAPINIDLISTLISPFTEEEVFGCIKSIGHNKAPGPDGFTIEFIKKFWSILKPSIMSVFHD

Query:  FFRSKTVNRVVNHTNIALIPKKSM-AGHISDFRPISLTTSLYKILAKVLAERLKPTLEDTISLNQSAFVRKRQISDAILLANEMVDFWRVSKTKG-VIIK
          +   +       +I LIPK         +FRPISL     KIL K+LA R++  ++  I  +Q  F+   Q    I  +  ++     +K K  VII 
Subjt:  FFRSKTVNRVVNHTNIALIPKKSM-AGHISDFRPISLTTSLYKILAKVLAERLKPTLEDTISLNQSAFVRKRQISDAILLANEMVDFWRVSKTKG-VIIK

Query:  LDIEKAFDKISWDFIDCVLLNKGYPNTWREWIKACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSKLIEAAEKKGLLSGVVMGDISI
        +D EKAFDKI   F+   L   G    + + I+A     + +IILNG+       K G RQG PLSP LF + ++ L++ I   ++   + G+ +G   +
Subjt:  LDIEKAFDKISWDFIDCVLLNKGYPNTWREWIKACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSKLIEAAEKKGLLSGVVMGDISI

Query:  THLLFADDILLFVQDDEKAIESMFYIIKSFENASGLRINLSKSTVSGINLTEQRTTDIARFWGCCSHSLPIAYLGVPLGGIPKN--TQFWVPTIEKIQRR
           LFADD+++++++   + +++  +I +F   SG +IN+ KS     N   Q  + I         S  I YLG+ L    K+   + + P +++I+  
Subjt:  THLLFADDILLFVQDDEKAIESMFYIIKSFENASGLRINLSKSTVSGINLTEQRTTDIARFWGCCSHSLPIAYLGVPLGGIPKN--TQFWVPTIEKIQRR

Query:  IHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSV--FKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRF
         + W+ +  S  GR+ +++  +    +Y  +    K P++    +E+   KF+W+          +   I+S     GG+ +   K   +A + K  W +
Subjt:  IHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSV--FKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRF

Query:  F
        +
Subjt:  F

P08548 LINE-1 reverse transcriptase homolog2.1e-4322.89Show/hide
Query:  IYGPARRRDRKFFWRELYDLFGLCGDNWLLGGDFNVFRYPSETSSISPAKLSMKNFNNFIADTGLIDPPLV----NGSYTWSNLRNRPVMSRLDRFLFSP
        IY P      +F    L D+  L     ++ GDFN      + SS       + + N+ I    L D           YT+ +  +    S++D  L   
Subjt:  IYGPARRRDRKFFWRELYDLFGLCGDNWLLGGDFNVFRYPSETSSISPAKLSMKNFNNFIADTGLIDPPLV----NGSYTWSNLRNRPVMSRLDRFLFSP

Query:  NWCQK----------FHEHHSKRLSRNTSDHFPILLDASSSTWGPCPFRFDNYFLDNNSFVSNVEQWWNDAVCSGYPGFSFMGRLKLLARKVKDWKSSNS
        +   K          F +HH  ++  N + +    L   + TW     + +N  L +   +  +++     +       +    L   A+ V   K    
Subjt:  NWCQK----------FHEHHSKRLSRNTSDHFPILLDASSSTWGPCPFRFDNYFLDNNSFVSNVEQWWNDAVCSGYPGFSFMGRLKLLARKVKDWKSSNS

Query:  ESFKEKKRVLITEIDRIDSLESMGYLDDIASSLRKSLK-ADLQQTALLEARYWNQRCKKLWLNDGDENSAFFHKV---------CTARRRRNQIHELISK
        ++F +K     TE + +++L  MG+L  +      + K +  ++   + A       K++        S FF K+          T ++R   +   I  
Subjt:  ESFKEKKRVLITEIDRIDSLESMGYLDDIASSLRKSLK-ADLQQTALLEARYWNQRCKKLWLNDGDENSAFFHKV---------CTARRRRNQIHELISK

Query:  GGSSIVSDNMMEYEVINH-----FSAIYEANQETEWIVTNLDWAPINIDLISTLISPFTEEEVFGCIKSIGHNKAPGPDGFTIEFIKKFWSILKPSIMSV
        G   I +D     +++N      +S  YE  +E +  +       ++   +  L  P +  E+   I+++   K+PGPDGFT EF + F   L P ++++
Subjt:  GGSSIVSDNMMEYEVINH-----FSAIYEANQETEWIVTNLDWAPINIDLISTLISPFTEEEVFGCIKSIGHNKAPGPDGFTIEFIKKFWSILKPSIMSV

Query:  FHDFFRSKTVNRVVNHTNIALIPKKSM-AGHISDFRPISLTTSLYKILAKVLAERLKPTLEDTISLNQSAFVRKRQISDAILLANEMVDFWRVSKTKG-V
        F +  +   +       NI LIPK         ++RPISL     KIL K+L  R++  ++  I  +Q  F+   Q    I  +  ++      K K  +
Subjt:  FHDFFRSKTVNRVVNHTNIALIPKKSM-AGHISDFRPISLTTSLYKILAKVLAERLKPTLEDTISLNQSAFVRKRQISDAILLANEMVDFWRVSKTKG-V

Query:  IIKLDIEKAFDKISWDFIDCVLLNKGYPNTWREWIKACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSKLIEAAEKKGLLSGVVMGD
        I+ +D EKAFD I   F+   L   G   T+ + I+A  S  + +IILNG    +   + G RQG PLSP LF + M+ L+  I   E+K  + G+ +G 
Subjt:  IIKLDIEKAFDKISWDFIDCVLLNKGYPNTWREWIKACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSKLIEAAEKKGLLSGVVMGD

Query:  ISITHLLFADDILLFVQDDEKAIESMFYIIKSFENASGLRINLSKSTV---SGINLTEQRTTDIARFWGCCSHSLPIAYLGVPLGGIPKN--TQFWVPTI
          I   LFADD+++++++   +   +  +IK + N SG +IN  KS     +  N  E+   D   F         + YLGV L    K+   + +    
Subjt:  ISITHLLFADDILLFVQDDEKAIESMFYIIKSFENASGLRINLSKSTV---SGINLTEQRTTDIARFWGCCSHSLPIAYLGVPLGGIPKN--TQFWVPTI

Query:  EKIQRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSV--FKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLL
        ++I   ++ W+ +  S  GR+ +++  +    +Y  +    KAP+S    +E+I+  F+W+          +   ++S+    GG+ +  ++   +++++
Subjt:  EKIQRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSV--FKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLL

Query:  KWIWRFF-TEEKSLWRK
        K  W +    E  +W +
Subjt:  KWIWRFF-TEEKSLWRK

P0C2F6 Putative ribonuclease H protein At1g657502.4e-3132.66Show/hide
Query:  IEKIQRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLK
        +E++  R+  WR  +LS  GRLTL ++VL+SMP++ +S    P SI NR++Q+   FLW   +     +LV+W  V SPK EGGLG+   KS N AL+ K
Subjt:  IEKIQRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLK

Query:  WIWRFFTEEKSLWRKFISAKYS----SDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLT
          WR   E+ SLW   +  KY      D     P  S  S+ RS   AI  L+        W   +G+ I FW D W    PL  + D   + +   ++ 
Subjt:  WIWRFFTEEKSLWRKFISAKYS----SDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLT

Query:  VEEAWLNLDRVWNFRPRRPLFDREVQSWNEMTRLLPIPDSFRGS-DVHRWLASEDGSFSTKVARSVLLV-APPRPFYSPGETILNNLWKADIPKKIK
         ++ W+   R W+F    P      +    +     + D   G+ D   W  S+DG FS + A  +L V   PRP  +   +  N LWK  +P+++K
Subjt:  VEEAWLNLDRVWNFRPRRPLFDREVQSWNEMTRLLPIPDSFRGS-DVHRWLASEDGSFSTKVARSVLLV-APPRPFYSPGETILNNLWKADIPKKIK

P11369 LINE-1 retrotransposable element ORF2 protein7.3e-4427.86Show/hide
Query:  INIDLISTLISPFTEEEVFGCIKSIGHNKAPGPDGFTIEFIKKFWSILKPSIMSVFHDFFRSKTVNRVVNHTNIALIPK-KSMAGHISDFRPISLTTSLY
        +N D +  L SP + +E+   I S+   K+PGPDGF+ EF + F   L P +  +FH      T+        I LIPK +     I +FRPISL     
Subjt:  INIDLISTLISPFTEEEVFGCIKSIGHNKAPGPDGFTIEFIKKFWSILKPSIMSVFHDFFRSKTVNRVVNHTNIALIPK-KSMAGHISDFRPISLTTSLY

Query:  KILAKVLAERLKPTLEDTISLNQSAFVRKRQISDAILLANEMVDFWRVSKTKG-VIIKLDIEKAFDKISWDFIDCVLLNKGYPNTWREWIKACISSVSYS
        KIL K+LA R++  ++  I  +Q  F+   Q    I  +  ++ +    K K  +II LD EKAFDKI   F+  VL   G    +   IKA  S    +
Subjt:  KILAKVLAERLKPTLEDTISLNQSAFVRKRQISDAILLANEMVDFWRVSKTKG-VIIKLDIEKAFDKISWDFIDCVLLNKGYPNTWREWIKACISSVSYS

Query:  IILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSKLIEAAEKKGLLSGVVMGDISITHLLFADDILLFVQDDEKAIESMFYIIKSFENASGLRINLSK
        I +NG+    I  K G RQG PLSP+LF + ++ L++ I   ++   + G+ +G   +   L ADD+++++ D + +   +  +I SF    G +IN +K
Subjt:  IILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSKLIEAAEKKGLLSGVVMGDISITHLLFADDILLFVQDDEKAIESMFYIIKSFENASGLRINLSK

Query:  STVSGINLTEQRTTDIARFWGCCSHSLPIAYLGVPLGGIPKN--TQFWVPTIEKIQRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSV--FKAPVSIC
        S        +Q   +I         +  I YLGV L    K+   + +    ++I+  +  W+ +  S  GR+ +++  +    +Y  +    K P    
Subjt:  STVSGINLTEQRTTDIARFWGCCSHSLPIAYLGVPLGGIPKN--TQFWVPTIEKIQRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSV--FKAPVSIC

Query:  NRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEK
        N +E  + KF+W+        +L++       +  GG+ +  +K    A+++K  W ++ + +
Subjt:  NRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEK

P14381 Transposon TX1 uncharacterized 149 kDa protein3.6e-4325.42Show/hide
Query:  GFTWWLSGIYGPARRRDRKFFWRELYDLFGLCGDN--WLLGGDFNVFRYPSETSSISPAKLSMKNFNNFIADTGLID------PPLVNGSYTWSNLRNRP
        G T+ L  +Y P    +R  F+  L         +   ++GGDFN      + +       S       IA   L+D      P  V  ++T+  +R+  
Subjt:  GFTWWLSGIYGPARRRDRKFFWRELYDLFGLCGDN--WLLGGDFNVFRYPSETSSISPAKLSMKNFNNFIADTGLID------PPLVNGSYTWSNLRNRP

Query:  V-MSRLDRFLFSPNWCQKFHEHHSKRLSRNTSDHFPILLDASSSTWGP--CPFRFDNYFLDNNSFVSNVEQWWNDAVCSGYPGFS----------FMGR-
        V  SR+DR   S +   +  +  + RL+   SDH  + L  S +   P    + F+N  L++  F  +V   W      G+  F            +G+ 
Subjt:  V-MSRLDRFLFSPNWCQKFHEHHSKRLSRNTSDHFPILLDASSSTWGP--CPFRFDNYFLDNNSFVSNVEQWWNDAVCSGYPGFS----------FMGR-

Query:  -LKLLARKVKDWKSSNSESFKEKKRVLITEID-RIDSLESMGYLDDIASSLRKSLKADLQQTALLEARYWNQRCKKLWLNDGDENSAFFHKVCTARRRRN
         LKLL ++     S    +  E     + +++ R+   E      +     RK    +++Q    +AR    R +   L D D  S FF+ +   +  R 
Subjt:  -LKLLARKVKDWKSSNSESFKEKKRVLITEID-RIDSLESMGYLDDIASSLRKSLKADLQQTALLEARYWNQRCKKLWLNDGDENSAFFHKVCTARRRRN

Query:  QIHELISKGGSSIVSDNMMEYEVINHFSAIYEANQETEWIVTNL-DWAPINID-LISTLISPFTEEEVFGCIKSIGHNKAPGPDGFTIEFIKKFWSILKP
        QI  L ++ G+ +     +     + +  ++  +  +      L D  P+  +     L +P T +E+   ++ + HNK+PG DG TIEF + FW  L P
Subjt:  QIHELISKGGSSIVSDNMMEYEVINHFSAIYEANQETEWIVTNL-DWAPINID-LISTLISPFTEEEVFGCIKSIGHNKAPGPDGFTIEFIKKFWSILKP

Query:  SIMSVFHDFFRSKTVNRVVNHTNIALIPKKSMAGHISDFRPISLTTSLYKILAKVLAERLKPTLEDTISLNQSAFVRKRQISDAILLANEMVDFWRVSKT
            V  + F+   +        ++L+PKK     I ++RP+SL ++ YKI+AK ++ RLK  L + I  +QS  V  R I D + L  +++ F R +  
Subjt:  SIMSVFHDFFRSKTVNRVVNHTNIALIPKKSMAGHISDFRPISLTTSLYKILAKVLAERLKPTLEDTISLNQSAFVRKRQISDAILLANEMVDFWRVSKT

Query:  KGVIIKLDIEKAFDKISWDFIDCVLLNKGYPNTWREWIKACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSKLIEAAEKKGLLSGVV
            + LD EKAFD++   ++   L    +   +  ++K   +S    + +N      +   RG+RQG PLS  L+ LA++    L+     +  L+G+V
Subjt:  KGVIIKLDIEKAFDKISWDFIDCVLLNKGYPNTWREWIKACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSKLIEAAEKKGLLSGVV

Query:  M--GDISITHLLFADDILLFVQ---DDEKAIESMFYIIKSFENASGLRINLSKSTVSGINLTEQRTTDI--ARFWGCCSHSLPIAYLGVPLGG--IPKNT
        +   D+ +    +ADD++L  Q   D E+A E      + +  AS  RIN SKS  SG+ L      D     F      S  I YLGV L     P + 
Subjt:  M--GDISITHLLFADDILLFVQ---DDEKAIESMFYIIKSFENASGLRINLSKSTVSGINLTEQRTTDI--ARFWGCCSHSLPIAYLGVPLGG--IPKNT

Query:  QFWVPTIEKIQRRIHNWRFVS--LSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKS
         F +   E +  R+  W+  +  LS  GR  +I  ++ S   Y L           ++++ L  FLW G       + V   + S P  EGG G+  I+S
Subjt:  QFWVPTIEKIQRRIHNWRFVS--LSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKS

Query:  TNEALLLKWIWRF-FTEEKSLWRKFISAKY
              L+ I R+ + +    W    S+ Y
Subjt:  TNEALLLKWIWRF-FTEEKSLWRKFISAKY

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein4.6e-3327.48Show/hide
Query:  LLGGDFNVFRYPSETSSISPAKLSMK---NFNNFIADTGLIDPPLVNGSYTWSNLR-NRPVMSRLDRFLFSPNWCQKFHEHHSKRLSRNTSDHFP--ILL
        +L GDF+     S+  S+    + M+    F N + D+ L+D P     YTWSN + + P++ +LDR + + +W   F    +       SDH P  I+L
Subjt:  LLGGDFNVFRYPSETSSISPAKLSMK---NFNNFIADTGLIDPPLVNGSYTWSNLR-NRPVMSRLDRFLFSPNWCQKFHEHHSKRLSRNTSDHFP--ILL

Query:  DASSSTWGPCPFRFDNYFLDNNSFVSNVEQWWNDAVCSGYPGFSFMGRLKLLARKVKDWKSSNSESFKEKKRVLITEIDRIDSLESMGYLDDIASSLRKS
        +        C FR+ ++   + +F+ ++   W + +  G   FS    LK  A K K  K  N + F   +      +D ++S++S    +   S  R  
Subjt:  DASSSTWGPCPFRFDNYFLDNNSFVSNVEQWWNDAVCSGYPGFSFMGRLKLLARKVKDWKSSNSESFKEKKRVLITEIDRIDSLESMGYLDDIASSLRKS

Query:  LKADLQQ---TALLEARYWNQRCKKLWLNDGDENSAFFHKVCTARRRRNQIHELISKGGSSIVSDNMMEYEVINHFSAIYEANQE---TEWIVTNLDWAP
          A  +     A LE+ ++ Q+ +  WL DGD N+ FFHKV  A + +N I  L       + +   ++  ++ +++ +  ++ +    + +    D  P
Subjt:  LKADLQQ---TALLEARYWNQRCKKLWLNDGDENSAFFHKVCTARRRRNQIHELISKGGSSIVSDNMMEYEVINHFSAIYEANQE---TEWIVTNLDWAP

Query:  I--NIDLISTLISPFTEEEVFGCIKSIGHNKAPGPDGFTIEFIKKFWSILKPSIMSVFHDFFRSKTVNRVVNHTNIALIPKKSMAGHISDFRPISLTTSL
           N  L S L +  +++E+   + ++  NKAPGPD FT EF  + W ++K S ++   +FFR+  + +  N T I LIPK +    +S FRP+S  T +
Subjt:  I--NIDLISTLISPFTEEEVFGCIKSIGHNKAPGPDGFTIEFIKKFWSILKPSIMSVFHDFFRSKTVNRVVNHTNIALIPKKSMAGHISDFRPISLTTSL

Query:  YKIL
        YKI+
Subjt:  YKIL

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.5e-2328.24Show/hide
Query:  DIARFWGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKIQRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHS
        DI   +   S +LP+ YLG+PL      T  + P +EKI+ RI  W    LS  GRL LI SV++S+  + +S F+ P +    ++ I   FLW G   +
Subjt:  DIARFWGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKIQRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHS

Query:  GPSNLVRWEIVSSPKAEGGLGIHKIKSTNE---------ALLLKWIWRFFTEEKSLWRKFISAKYSSDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANF
             V W  V +PK EGGLGI  +K  N+           L  W+W+   + ++L   F+                                       
Subjt:  GPSNLVRWEIVSSPKAEGGLGIHKIKSTNE---------ALLLKWIWRFFTEEKSLWRKFISAKYSSDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANF

Query:  RWEVRNGKSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNLDRVWNFRPRRPLFD
        + ++ NG +  FW DNWS +G L  V      +    +L    A    + V N RPRR   D
Subjt:  RWEVRNGKSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNLDRVWNFRPRRPLFD

AT4G20520.1 RNA binding;RNA-directed DNA polymerases2.1e-0938.27Show/hide
Query:  LAERLKPTLEDTISLNQSAFVRKRQISDAILLANEMVDFWRVSK-TKG-VIIKLDIEKAFDKISWDFIDCVLLNKGYPNTW
        + ERLKP + + I   Q++F+  R  +D I+   E V   R  K  KG +++KLD+EKA+D+I WD+++  L++ G+P  W
Subjt:  LAERLKPTLEDTISLNQSAFVRKRQISDAILLANEMVDFWRVSK-TKG-VIIKLDIEKAFDKISWDFIDCVLLNKGYPNTW

AT4G29090.1 Ribonuclease H-like superfamily protein8.9e-1329.86Show/hide
Query:  SMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEKSLWRKFISAKYSSDHHNSFP
        ++P Y ++ F  P ++C ++  +L  F W     +   +   W+ +S  KAEGG+G   I++ N ALL K +WR  +  +SL  K   ++Y    H S P
Subjt:  SMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEKSLWRKFISAKYSSDHHNSFP

Query:  SSSRFSSSRS-PWFAISKLQSPFFANFRWEVRNGKSILFWHDNW
         ++   S  S  W +I   Q       R  V NG+ I+ W   W
Subjt:  SSSRFSSSRS-PWFAISKLQSPFFANFRWEVRNGKSILFWHDNW

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)9.9e-1250.75Show/hide
Query:  ILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSKLIEAAEKKGLLSGVVMGDIS--ITHLLFADD
        I+NG P+G +   RG+RQGDPLSP+LF+L  + LS L   A+++G L G+ + + S  I HLLFADD
Subjt:  ILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSKLIEAAEKKGLLSGVVMGDIS--ITHLLFADD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCGAAATTCTGAAAAGCCCAAATCCGGCAAAGGTAAGGAGATATATCGACCCAAATCCACTACTGGAATAAAAATATCTGAGCCCGCCCAAGAGAAAGTC
AATCTCGTTGCACCTTTTGGCCCAGTTGAGCCCGCCCACAAGAAAGTCAATCTCGTTGCTCCTTCTGGCCCAGTTTACCAAGATATTAATGGTGAAAAGTTTACT
TTAAGTGTTGATTTGGGCTCATTGTCCCCTATATCTGATGCACCCATTTCAAGTCCAGAAAATACTCCATCTCCAAAAGCCCACACTGTGATTGAACCTCCTTCA
GCAATTATTAATGAAAGTCTCAAATTTTTGGTTTCTCCGGACAAAATGGATAGTACGGGTGAAGACAGTTTGAATGGCACTCCTAGATTCAAGAACATTGAAGCA
GTCATTGATGATAATAGTCCTCGGAAGGATCCTCAAGAAGTTATAGAACATGGCAAGCCGAATGACGAAAGTTTCAAGAAGAAGCTCAACGACTGGCTGACTGAA
AACGATTTTTGTCTTGTTCCTACTAAATCTGTTTCGGGTTTATTTTATTTTGTCATTCTTACTGAAACTAAATTGACTAACGTCAGCAAGCGTATTATCAAATCG
TTATGGAGTTCTATTAGTGTCAATTGGATTGCTCTGGACGCGCTTGGTTCTTCAGGAGGTATTCTTATTATGTGGGATGAATTATGTTGCAAAATTACTGATCAT
ATTAAGGGCTGCTATTCAATGTCTGTTCACGTTTCGCTCTCCGATGGTTTTACCTGGTGGTTATCTGGTATCTATGGTCCTGCACGTAGAAGAGATCGAAAGTTT
TTTTGGAGGGAGCTCTATGATCTCTTCGGCTTATGTGGTGATAATTGGCTTCTTGGTGGTGACTTTAATGTGTTCAGGTATCCTTCTGAGACCTCATCAATCAGT
CCGGCTAAATTAAGCATGAAAAATTTTAATAATTTCATTGCGGACACTGGGCTTATTGATCCTCCTCTTGTTAATGGCTCTTATACTTGGTCGAACCTTCGAAAT
AGGCCGGTTATGTCTCGCCTTGATAGATTCTTGTTTTCTCCTAACTGGTGTCAAAAATTTCACGAGCATCACTCCAAAAGGCTCTCTCGTAATACCTCAGACCAC
TTCCCTATTCTGCTGGATGCCTCTTCTTCTACATGGGGTCCTTGCCCTTTTCGTTTTGATAATTATTTCTTGGACAATAATTCCTTCGTCAGCAATGTTGAGCAA
TGGTGGAATGATGCTGTTTGCAGTGGCTACCCAGGTTTCTCTTTTATGGGAAGACTTAAGCTTCTAGCTCGGAAAGTCAAAGATTGGAAGTCTTCCAATTCCGAA
TCTTTCAAAGAAAAGAAAAGGGTCTTAATAACAGAAATAGATCGCATTGATTCGCTGGAATCCATGGGTTATTTGGATGATATTGCTAGCTCTCTCAGAAAATCG
CTAAAAGCTGATCTCCAGCAAACTGCTCTTTTAGAAGCTCGCTATTGGAATCAGCGTTGCAAAAAGCTCTGGTTAAATGACGGGGACGAAAACTCAGCATTTTTT
CATAAAGTTTGTACTGCCCGCCGCCGAAGGAATCAAATTCATGAGCTCATTTCCAAAGGAGGGAGTAGCATTGTCTCTGATAATATGATGGAATATGAAGTGATC
AATCATTTCTCAGCTATCTATGAGGCTAATCAGGAAACTGAATGGATTGTGACTAATCTTGACTGGGCTCCCATTAACATTGATTTGATCAGTACTTTGATATCT
CCTTTTACAGAGGAAGAAGTTTTTGGCTGCATCAAGTCTATTGGTCATAATAAGGCTCCGGGTCCTGATGGCTTCACTATTGAATTTATCAAGAAATTTTGGAGC
ATTCTAAAGCCATCTATCATGTCTGTTTTCCATGATTTTTTTCGTAGTAAGACTGTCAATCGGGTTGTAAACCACACAAATATCGCGCTCATCCCCAAAAAGTCT
ATGGCTGGCCATATTTCAGATTTCCGCCCAATTAGCCTTACCACTTCTCTCTATAAAATTCTTGCCAAGGTCTTGGCGGAGCGTTTGAAGCCCACTCTAGAAGAT
ACAATCAGCTTAAATCAATCAGCCTTTGTTCGCAAAAGACAAATCTCTGATGCTATTTTGTTAGCTAACGAAATGGTGGACTTTTGGCGTGTCTCTAAGACTAAA
GGTGTTATTATAAAGCTCGATATTGAGAAAGCCTTTGACAAAATTAGTTGGGACTTCATTGATTGTGTTCTTCTCAACAAAGGTTACCCCAACACTTGGAGAGAA
TGGATTAAAGCATGTATCTCCTCAGTCTCTTATTCTATTATTCTGAATGGTAAACCTCGAGGCAACATTCAAGCTAAGAGAGGTATCAGACAAGGTGACCCTTTA
TCTCCCTTTCTTTTTGTTCTAGCCATGGATTATCTTAGTAAATTAATTGAGGCTGCTGAAAAGAAAGGTCTTTTATCAGGGGTAGTCATGGGAGATATCTCTATC
ACTCATCTCCTTTTTGCTGATGACATTTTACTTTTTGTCCAAGATGATGAGAAGGCCATTGAAAGCATGTTCTACATTATTAAATCTTTTGAAAATGCCTCTGGT
CTTCGGATTAATCTTTCCAAATCTACTGTTTCCGGTATAAACCTGACGGAACAAAGGACTACTGATATTGCTCGCTTTTGGGGTTGTTGCTCTCATTCTCTGCCA
ATTGCTTATCTTGGTGTCCCTTTAGGCGGCATTCCAAAAAATACTCAGTTTTGGGTGCCCACGATTGAGAAGATTCAGAGACGAATTCACAATTGGCGGTTTGTT
TCTCTTTCTAAGGGAGGTCGTCTTACTCTTATTCAATCGGTTCTTAACAGTATGCCTCTATATGTTCTCTCTGTGTTCAAAGCGCCGGTTTCTATATGCAACAGA
GTCGAACAAATCCTTCATAAATTTCTTTGGGATGGAAATTCTCATTCAGGGCCCTCAAATTTAGTGAGATGGGAAATCGTATCATCCCCAAAGGCAGAAGGCGGT
TTGGGCATTCACAAAATCAAAAGCACGAATGAAGCTCTCCTCCTTAAATGGATATGGCGTTTTTTCACCGAGGAAAAATCTCTTTGGAGGAAATTCATAAGTGCC
AAATATTCCAGCGATCATCACAATAGTTTTCCCTCTAGTAGCAGATTCTCTAGCTCCAGATCTCCGTGGTTTGCTATTTCAAAGCTTCAGTCTCCTTTCTTCGCA
AATTTCAGATGGGAGGTGCGCAACGGTAAATCCATTCTCTTTTGGCATGATAACTGGTCTGTTCTTGGTCCTTTGAAATATGTTAATGATCGCCTCTATCAGTTA
TCTTCAAACAAAAGTCTCACAGTTGAGGAAGCTTGGTTGAATTTGGATAGAGTATGGAATTTTCGTCCTCGTCGGCCTCTTTTTGATAGAGAGGTTCAAAGTTGG
AATGAGATGACTAGGCTTTTACCCATTCCAGATTCTTTTCGTGGTTCTGATGTTCATCGTTGGCTAGCTTCCGAAGACGGTTCCTTCTCCACAAAAGTTGCTCGA
TCCGTCCTTTTGGTTGCTCCTCCTAGACCTTTTTATAGCCCTGGAGAAACAATTCTCAACAACCTTTGGAAAGCTGATATCCCTAAAAAAATAAAGAATGAGTTC
CGAAAGTATCGATCATCTATTCATTCATTGTTGCTGTGTGTCCTTTCTTCGAAACAAGAATTAATGGCTTCTTCTCCTGAATTCGATTTTCCTTATCACTCTGAT
AGTGATGAAGGGCATTCCCGTTCCCCTTCCCCAATTAGTTCCGAAGAATATTTTGAATATGGTATCCGTGGAACCAATAAGGAAGAGGAAAAAGAATATTCTCGT
GCTAAGCATGAATCTCAGGGTTTTGATGTACCTATATTTCCTGGCACGGATGCTTTTGGTTTTATTACGCCCGTGAATGATTATACTAGCTCAGAGCTCCTAGCA
TGCACAGATGAAGCTATTAAACATTACAACAGGGAAAATGGTACAAATTTCGAAGTTGTGGAAATTGTAAAGGCAAATCTTTCATGGGGTTCTAAGTATTTTATA
ACCTTTGAGGCCAAACATGTTGGAAGTACCTCAGGTTACCCTACCACAACTTTTCAAGCAGAAGTGGTTTCTAGAATTCCTGACATAAAGAGTCCTAGTCGTACA
GTATCTTCCTTTGCCGTTTCTCTCTTTCGATCGTCTGAGCTTCTAATGGCGGCGACGGGCATCTGCGTGTGTGACGGTGGTGGGATCTCACGCATACTAACCCAA
AATATCTGTTATTTCCATTGTGGCATTTATGGATGCATGCTTCTGAAAGAGGCTTATATGGGATTAGATGACTTTGCCGAAGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTCGAAATTCTGAAAAGCCCAAATCCGGCAAAGGTAAGGAGATATATCGACCCAAATCCACTACTGGAATAAAAATATCTGAGCCCGCCCAAGAGAAAGTC
AATCTCGTTGCACCTTTTGGCCCAGTTGAGCCCGCCCACAAGAAAGTCAATCTCGTTGCTCCTTCTGGCCCAGTTTACCAAGATATTAATGGTGAAAAGTTTACT
TTAAGTGTTGATTTGGGCTCATTGTCCCCTATATCTGATGCACCCATTTCAAGTCCAGAAAATACTCCATCTCCAAAAGCCCACACTGTGATTGAACCTCCTTCA
GCAATTATTAATGAAAGTCTCAAATTTTTGGTTTCTCCGGACAAAATGGATAGTACGGGTGAAGACAGTTTGAATGGCACTCCTAGATTCAAGAACATTGAAGCA
GTCATTGATGATAATAGTCCTCGGAAGGATCCTCAAGAAGTTATAGAACATGGCAAGCCGAATGACGAAAGTTTCAAGAAGAAGCTCAACGACTGGCTGACTGAA
AACGATTTTTGTCTTGTTCCTACTAAATCTGTTTCGGGTTTATTTTATTTTGTCATTCTTACTGAAACTAAATTGACTAACGTCAGCAAGCGTATTATCAAATCG
TTATGGAGTTCTATTAGTGTCAATTGGATTGCTCTGGACGCGCTTGGTTCTTCAGGAGGTATTCTTATTATGTGGGATGAATTATGTTGCAAAATTACTGATCAT
ATTAAGGGCTGCTATTCAATGTCTGTTCACGTTTCGCTCTCCGATGGTTTTACCTGGTGGTTATCTGGTATCTATGGTCCTGCACGTAGAAGAGATCGAAAGTTT
TTTTGGAGGGAGCTCTATGATCTCTTCGGCTTATGTGGTGATAATTGGCTTCTTGGTGGTGACTTTAATGTGTTCAGGTATCCTTCTGAGACCTCATCAATCAGT
CCGGCTAAATTAAGCATGAAAAATTTTAATAATTTCATTGCGGACACTGGGCTTATTGATCCTCCTCTTGTTAATGGCTCTTATACTTGGTCGAACCTTCGAAAT
AGGCCGGTTATGTCTCGCCTTGATAGATTCTTGTTTTCTCCTAACTGGTGTCAAAAATTTCACGAGCATCACTCCAAAAGGCTCTCTCGTAATACCTCAGACCAC
TTCCCTATTCTGCTGGATGCCTCTTCTTCTACATGGGGTCCTTGCCCTTTTCGTTTTGATAATTATTTCTTGGACAATAATTCCTTCGTCAGCAATGTTGAGCAA
TGGTGGAATGATGCTGTTTGCAGTGGCTACCCAGGTTTCTCTTTTATGGGAAGACTTAAGCTTCTAGCTCGGAAAGTCAAAGATTGGAAGTCTTCCAATTCCGAA
TCTTTCAAAGAAAAGAAAAGGGTCTTAATAACAGAAATAGATCGCATTGATTCGCTGGAATCCATGGGTTATTTGGATGATATTGCTAGCTCTCTCAGAAAATCG
CTAAAAGCTGATCTCCAGCAAACTGCTCTTTTAGAAGCTCGCTATTGGAATCAGCGTTGCAAAAAGCTCTGGTTAAATGACGGGGACGAAAACTCAGCATTTTTT
CATAAAGTTTGTACTGCCCGCCGCCGAAGGAATCAAATTCATGAGCTCATTTCCAAAGGAGGGAGTAGCATTGTCTCTGATAATATGATGGAATATGAAGTGATC
AATCATTTCTCAGCTATCTATGAGGCTAATCAGGAAACTGAATGGATTGTGACTAATCTTGACTGGGCTCCCATTAACATTGATTTGATCAGTACTTTGATATCT
CCTTTTACAGAGGAAGAAGTTTTTGGCTGCATCAAGTCTATTGGTCATAATAAGGCTCCGGGTCCTGATGGCTTCACTATTGAATTTATCAAGAAATTTTGGAGC
ATTCTAAAGCCATCTATCATGTCTGTTTTCCATGATTTTTTTCGTAGTAAGACTGTCAATCGGGTTGTAAACCACACAAATATCGCGCTCATCCCCAAAAAGTCT
ATGGCTGGCCATATTTCAGATTTCCGCCCAATTAGCCTTACCACTTCTCTCTATAAAATTCTTGCCAAGGTCTTGGCGGAGCGTTTGAAGCCCACTCTAGAAGAT
ACAATCAGCTTAAATCAATCAGCCTTTGTTCGCAAAAGACAAATCTCTGATGCTATTTTGTTAGCTAACGAAATGGTGGACTTTTGGCGTGTCTCTAAGACTAAA
GGTGTTATTATAAAGCTCGATATTGAGAAAGCCTTTGACAAAATTAGTTGGGACTTCATTGATTGTGTTCTTCTCAACAAAGGTTACCCCAACACTTGGAGAGAA
TGGATTAAAGCATGTATCTCCTCAGTCTCTTATTCTATTATTCTGAATGGTAAACCTCGAGGCAACATTCAAGCTAAGAGAGGTATCAGACAAGGTGACCCTTTA
TCTCCCTTTCTTTTTGTTCTAGCCATGGATTATCTTAGTAAATTAATTGAGGCTGCTGAAAAGAAAGGTCTTTTATCAGGGGTAGTCATGGGAGATATCTCTATC
ACTCATCTCCTTTTTGCTGATGACATTTTACTTTTTGTCCAAGATGATGAGAAGGCCATTGAAAGCATGTTCTACATTATTAAATCTTTTGAAAATGCCTCTGGT
CTTCGGATTAATCTTTCCAAATCTACTGTTTCCGGTATAAACCTGACGGAACAAAGGACTACTGATATTGCTCGCTTTTGGGGTTGTTGCTCTCATTCTCTGCCA
ATTGCTTATCTTGGTGTCCCTTTAGGCGGCATTCCAAAAAATACTCAGTTTTGGGTGCCCACGATTGAGAAGATTCAGAGACGAATTCACAATTGGCGGTTTGTT
TCTCTTTCTAAGGGAGGTCGTCTTACTCTTATTCAATCGGTTCTTAACAGTATGCCTCTATATGTTCTCTCTGTGTTCAAAGCGCCGGTTTCTATATGCAACAGA
GTCGAACAAATCCTTCATAAATTTCTTTGGGATGGAAATTCTCATTCAGGGCCCTCAAATTTAGTGAGATGGGAAATCGTATCATCCCCAAAGGCAGAAGGCGGT
TTGGGCATTCACAAAATCAAAAGCACGAATGAAGCTCTCCTCCTTAAATGGATATGGCGTTTTTTCACCGAGGAAAAATCTCTTTGGAGGAAATTCATAAGTGCC
AAATATTCCAGCGATCATCACAATAGTTTTCCCTCTAGTAGCAGATTCTCTAGCTCCAGATCTCCGTGGTTTGCTATTTCAAAGCTTCAGTCTCCTTTCTTCGCA
AATTTCAGATGGGAGGTGCGCAACGGTAAATCCATTCTCTTTTGGCATGATAACTGGTCTGTTCTTGGTCCTTTGAAATATGTTAATGATCGCCTCTATCAGTTA
TCTTCAAACAAAAGTCTCACAGTTGAGGAAGCTTGGTTGAATTTGGATAGAGTATGGAATTTTCGTCCTCGTCGGCCTCTTTTTGATAGAGAGGTTCAAAGTTGG
AATGAGATGACTAGGCTTTTACCCATTCCAGATTCTTTTCGTGGTTCTGATGTTCATCGTTGGCTAGCTTCCGAAGACGGTTCCTTCTCCACAAAAGTTGCTCGA
TCCGTCCTTTTGGTTGCTCCTCCTAGACCTTTTTATAGCCCTGGAGAAACAATTCTCAACAACCTTTGGAAAGCTGATATCCCTAAAAAAATAAAGAATGAGTTC
CGAAAGTATCGATCATCTATTCATTCATTGTTGCTGTGTGTCCTTTCTTCGAAACAAGAATTAATGGCTTCTTCTCCTGAATTCGATTTTCCTTATCACTCTGAT
AGTGATGAAGGGCATTCCCGTTCCCCTTCCCCAATTAGTTCCGAAGAATATTTTGAATATGGTATCCGTGGAACCAATAAGGAAGAGGAAAAAGAATATTCTCGT
GCTAAGCATGAATCTCAGGGTTTTGATGTACCTATATTTCCTGGCACGGATGCTTTTGGTTTTATTACGCCCGTGAATGATTATACTAGCTCAGAGCTCCTAGCA
TGCACAGATGAAGCTATTAAACATTACAACAGGGAAAATGGTACAAATTTCGAAGTTGTGGAAATTGTAAAGGCAAATCTTTCATGGGGTTCTAAGTATTTTATA
ACCTTTGAGGCCAAACATGTTGGAAGTACCTCAGGTTACCCTACCACAACTTTTCAAGCAGAAGTGGTTTCTAGAATTCCTGACATAAAGAGTCCTAGTCGTACA
GTATCTTCCTTTGCCGTTTCTCTCTTTCGATCGTCTGAGCTTCTAATGGCGGCGACGGGCATCTGCGTGTGTGACGGTGGTGGGATCTCACGCATACTAACCCAA
AATATCTGTTATTTCCATTGTGGCATTTATGGATGCATGCTTCTGAAAGAGGCTTATATGGGATTAGATGACTTTGCCGAAGTTTAA
Protein sequenceShow/hide protein sequence
MGRNSEKPKSGKGKEIYRPKSTTGIKISEPAQEKVNLVAPFGPVEPAHKKVNLVAPSGPVYQDINGEKFTLSVDLGSLSPISDAPISSPENTPSPKAHTVIEPPS
AIINESLKFLVSPDKMDSTGEDSLNGTPRFKNIEAVIDDNSPRKDPQEVIEHGKPNDESFKKKLNDWLTENDFCLVPTKSVSGLFYFVILTETKLTNVSKRIIKS
LWSSISVNWIALDALGSSGGILIMWDELCCKITDHIKGCYSMSVHVSLSDGFTWWLSGIYGPARRRDRKFFWRELYDLFGLCGDNWLLGGDFNVFRYPSETSSIS
PAKLSMKNFNNFIADTGLIDPPLVNGSYTWSNLRNRPVMSRLDRFLFSPNWCQKFHEHHSKRLSRNTSDHFPILLDASSSTWGPCPFRFDNYFLDNNSFVSNVEQ
WWNDAVCSGYPGFSFMGRLKLLARKVKDWKSSNSESFKEKKRVLITEIDRIDSLESMGYLDDIASSLRKSLKADLQQTALLEARYWNQRCKKLWLNDGDENSAFF
HKVCTARRRRNQIHELISKGGSSIVSDNMMEYEVINHFSAIYEANQETEWIVTNLDWAPINIDLISTLISPFTEEEVFGCIKSIGHNKAPGPDGFTIEFIKKFWS
ILKPSIMSVFHDFFRSKTVNRVVNHTNIALIPKKSMAGHISDFRPISLTTSLYKILAKVLAERLKPTLEDTISLNQSAFVRKRQISDAILLANEMVDFWRVSKTK
GVIIKLDIEKAFDKISWDFIDCVLLNKGYPNTWREWIKACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSKLIEAAEKKGLLSGVVMGDISI
THLLFADDILLFVQDDEKAIESMFYIIKSFENASGLRINLSKSTVSGINLTEQRTTDIARFWGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKIQRRIHNWRFV
SLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEKSLWRKFISA
KYSSDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNLDRVWNFRPRRPLFDREVQSW
NEMTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLLVAPPRPFYSPGETILNNLWKADIPKKIKNEFRKYRSSIHSLLLCVLSSKQELMASSPEFDFPYHSD
SDEGHSRSPSPISSEEYFEYGIRGTNKEEEKEYSRAKHESQGFDVPIFPGTDAFGFITPVNDYTSSELLACTDEAIKHYNRENGTNFEVVEIVKANLSWGSKYFI
TFEAKHVGSTSGYPTTTFQAEVVSRIPDIKSPSRTVSSFAVSLFRSSELLMAATGICVCDGGGISRILTQNICYFHCGIYGCMLLKEAYMGLDDFAEV