; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0009309 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0009309
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr06:3277034..3284144
RNA-Seq ExpressionPay0009309
SyntenyPay0009309
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR025558 - Domain of unknown function DUF4283
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039309.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.0e+0082.53Show/hide
Query:  MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKSLIETPSSNRFFLENRDSEHCIWIRKTRNGKGCIAEIFRVD
        MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKSLIETPSSNRFFLENRD EHCIWIRKTRNGKGC AEIFRVD
Subjt:  MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKSLIETPSSNRFFLENRDSEHCIWIRKTRNGKGCIAEIFRVD

Query:  NKNRKSCILVPEGPEKSGWVSFLSMITPKVETKAKIRPSFLPRSSPEIHSSSPIDYHKRSYAKAVTEGRPSISSDSSESYASSDSSDSSHSSGNSPRNFP
        +KNRKSCILVPEG EKS WVSFLSMITPKVE KAK RP FLPRSSPE   S PIDYHKRSYAKAV+EGR SISSDSS+SYA   SSDSS SSGNSP + P
Subjt:  NKNRKSCILVPEGPEKSGWVSFLSMITPKVETKAKIRPSFLPRSSPEIHSSSPIDYHKRSYAKAVTEGRPSISSDSSESYASSDSSDSSHSSGNSPRNFP

Query:  SPVSLENTVVLVRRFFHDDWYKILQNLRKQIEESFTYNAFHAEKALVHFNSNVPANLLCQNRGWTTVGKYTVRFEKWNPASHASPKLIPSYGGWTTFRGI
         PV LENTVVLVRRFFHDDW KILQNLRKQ EESFTYNAFHAEK LVHFNSNVPANLLCQN+GWTTVGKYTVRFEKW PASHASPKLIPSYGGWTTFRGI
Subjt:  SPVSLENTVVLVRRFFHDDWYKILQNLRKQIEESFTYNAFHAEKALVHFNSNVPANLLCQNRGWTTVGKYTVRFEKWNPASHASPKLIPSYGGWTTFRGI

Query:  PLNLCNMKTFQQIGKACGGLIKVAEETKTASNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFVVQTITHSEGKWLMERNVRLHGTFKRQAAASFDEFNPD
        PL+L NM TFQQIGKACGGLIKVAEETKTA NLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFVVQ +THSEGKWLMERNVRLHGTFKRQAAASFD+FNPD
Subjt:  PLNLCNMKTFQQIGKACGGLIKVAEETKTASNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFVVQTITHSEGKWLMERNVRLHGTFKRQAAASFDEFNPD

Query:  SEQFLFDGMEAISPDLQNTFSGSRKSISPEQPSTLKSVIIKPARDATSPPSLNEEVVNDNSLHATAIKSKGKISYRISNDCSLDKGKQKVDIPSQLTPAF
        SEQFLFDG+EAISPDL NT SGSRKSISPEQPS LKSVIIKPA+ ATSP +LNEEVVNDNSLHATA KSK KI   ISND SLDKGKQKVDIPSQLT AF
Subjt:  SEQFLFDGMEAISPDLQNTFSGSRKSISPEQPSTLKSVIIKPARDATSPPSLNEEVVNDNSLHATAIKSKGKISYRISNDCSLDKGKQKVDIPSQLTPAF

Query:  ILDKPKRKVSFNSPGNKTNFFNPNSAPANHSPSVSSPEKKQRVSKERSVKKKSTITPPKSRANQGQDASNTQPLKIIAHDMNASKKGLSLTVDLGNLPVL
        I  KPKRKVSFNSP NKT FFNP+SAPANH     SPEKK+RVS+ERSVKKKS+   PK RANQG+    TQPL+++AHD++ASKKGLSLTVDLGNLPVL
Subjt:  ILDKPKRKVSFNSPGNKTNFFNPNSAPANHSPSVSSPEKKQRVSKERSVKKKSTITPPKSRANQGQDASNTQPLKIIAHDMNASKKGLSLTVDLGNLPVL

Query:  DPSKSFEDHHSSDNAEVIDITNTEMVPETPELKMTDPEKPKSPPEVNHRKQKHSHRRRHYYRKKEDTEKDSNSEVFKNQLVAWLKENGALIKNTLISYSP
        DPSKSFEDHHSSDNAEVIDITNTE+VPETPELKMTDPEK  S PEVN+RKQKHSHRRRHYYRKKED EKD+NSE FKNQLV WLKENG       +  S 
Subjt:  DPSKSFEDHHSSDNAEVIDITNTEMVPETPELKMTDPEKPKSPPEVNHRKQKHSHRRRHYYRKKEDTEKDSNSEVFKNQLVAWLKENGALIKNTLISYSP

Query:  DFVILTETRLKFINKKIVKSLWPSNSIKWIVKNAIDSSGGILILWDDLHHSLLSQEEGMFSLTANFLSSNNSWWLTGLYGPVQRRERLNFWTDLHNLLHL
        D                  S   + S   +      S+GGILILWD  HHSLLSQEEG FSL+ANF S NNSWWLTGLYGPV+RRERLN W DLHNL HL
Subjt:  DFVILTETRLKFINKKIVKSLWPSNSIKWIVKNAIDSSGGILILWDDLHHSLLSQEEGMFSLTANFLSSNNSWWLTGLYGPVQRRERLNFWTDLHNLLHL

Query:  NSFPWILGGDLNAIRMREESTAVTSSTHSSNMLNNFISINSLIDPPLSNNRYTWSNLRIPPTFSRLDRFLYNPNWEVLFNPHITRTLPRPTSDHFPLVCE
        NS PWI+GGDLN +RMREESTAVT S+HSSNMLN+FIS N LIDPPL+NNRYTWSNLR PPTFSRLDRFLYN  WE+LFNPHITRTLPRPTSDHFPLVCE
Subjt:  NSFPWILGGDLNAIRMREESTAVTSSTHSSNMLNNFISINSLIDPPLSNNRYTWSNLRIPPTFSRLDRFLYNPNWEVLFNPHITRTLPRPTSDHFPLVCE

Query:  DSSPTVSWGPAPFRLNSIVLNDPEFKRNMERWWELSAQEGHPGFAFIQRLKSLANFIKPWQKEKFHSISSAKENIIKEVDAIDKKELDTLLCQEDT----
        DS+ T+ WGPAPFRLNSI LNDPEFKRNMERWWELS Q GHPGF FIQRLKSLAN IKPWQKEKF S++SAKENII+EVD+IDK ELDT L  E++    
Subjt:  DSSPTVSWGPAPFRLNSIVLNDPEFKRNMERWWELSAQEGHPGFAFIQRLKSLANFIKPWQKEKFHSISSAKENIIKEVDAIDKKELDTLLCQEDT----

Query:  ---------------------KKIWLKEGDENFAFFHRICSSRQKRNIIHEIQDEDGSNQNTNISISLAFVNYFTKLYRSSTKTNPLFIDNLKWKPIDYS
                             KK+WLKEGDEN AFFHRICSSRQKRN+IHEIQDE+GS QNTN +ISLAFVN+F+++YR STK +PLFI+NL+W PIDYS
Subjt:  ---------------------KKIWLKEGDENFAFFHRICSSRQKRNIIHEIQDEDGSNQNTNISISLAFVNYFTKLYRSSTKTNPLFIDNLKWKPIDYS

Query:  EWSPLCAPFLEEEIKGVINSFEGNKAPGPDGFPISFFKSYWKLLKEDILAIFKDFYEKGVINKNMNNTFIALIAKKKNYSHPKDFRPISLTTSIYKIIAK
        +WS LCAPF EEEIKGVI SF+GNKAPGPDGFPISFFKSYW LLKEDIL IFKDF+EKGVINKNMNNT+IALI KKK+YSHPKDFRPISLTTSIYK IAK
Subjt:  EWSPLCAPFLEEEIKGVINSFEGNKAPGPDGFPISFFKSYWKLLKEDILAIFKDFYEKGVINKNMNNTFIALIAKKKNYSHPKDFRPISLTTSIYKIIAK

Query:  TLSNKLKLTLPDTISGNQLAFIKNRQITDAILMANEALDYWKVKKIKGFILKLDIEKAFDNLSWDFIDFVLKKKNYPPSWRQWIRGCISNVTYSIIVNGK
        TLSN+LKLTLPDTISGNQLAFIKNRQITDAILMANEALDYWKVKKIKGFILKLDIEKAFDNL+W+FID VLKK NYP SWR+WIRGCISNVTYSIIVNGK
Subjt:  TLSNKLKLTLPDTISGNQLAFIKNRQITDAILMANEALDYWKVKKIKGFILKLDIEKAFDNLSWDFIDFVLKKKNYPPSWRQWIRGCISNVTYSIIVNGK

Query:  PQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESTGAIKGVCLGKDCNISHILFADDILLFVEDNAFYLNNLRMAISLFEKASGLKINLSKSAMVP
        PQGRIKANRGLRQGDPLS FLFVIAMDYLSRLLSHLESTGAIKGVCL  DCNISHILFADDILLFVEDN  +LNNLRMAISLFEKASGLKINLSKSA+VP
Subjt:  PQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESTGAIKGVCLGKDCNISHILFADDILLFVEDNAFYLNNLRMAISLFEKASGLKINLSKSAMVP

Query:  VNVPWPRALDCASS
        VNV W RAL+CASS
Subjt:  VNVPWPRALDCASS

KAA0058980.1 uncharacterized protein E6C27_scaffold98G001710 [Cucumis melo var. makuwa]0.0e+0064.82Show/hide
Query:  MITPKVETKAKIRPSFLPRSSPEIHSSSPIDYHKRSYAKAVTEGRPSISSDSSESYASSDSSDSSHSSGNSPRNFPSPVSLENTVVLVRRFFHDDWYKIL
        MITPKVE K K RP+FLPRSSPE   S PIDYHKRSYAK VTEGRP  +SDSS+SY    SSDSSHSSGNS  + PSP  LENTVVLVRRFFHDDW KIL
Subjt:  MITPKVETKAKIRPSFLPRSSPEIHSSSPIDYHKRSYAKAVTEGRPSISSDSSESYASSDSSDSSHSSGNSPRNFPSPVSLENTVVLVRRFFHDDWYKIL

Query:  QNLRKQIEESFTYNAFHAEKALVHFNSNVPANLLCQNRGWTTVGKYTVRFEKWNPASHASPKLIPSYGGWTTFRGIPLNLCNMKTFQQIGKACGGLIKVA
        QNLRKQ EESFTYNAFHAEKALVHFNSN+P NLLCQN+GWTTVGKY+VRFEKW+PA HA+PKLIPSYGGWTTF+                          
Subjt:  QNLRKQIEESFTYNAFHAEKALVHFNSNVPANLLCQNRGWTTVGKYTVRFEKWNPASHASPKLIPSYGGWTTFRGIPLNLCNMKTFQQIGKACGGLIKVA

Query:  EETKTASNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFVVQTITHSEGKWLMERNVRLHGTFKRQAAASFDEFNPDSEQ----FLFDGMEAISPDLQNTF
           + ++ L+E                                                          +D+F+ + E      LFDG EAISPD  +T 
Subjt:  EETKTASNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFVVQTITHSEGKWLMERNVRLHGTFKRQAAASFDEFNPDSEQ----FLFDGMEAISPDLQNTF

Query:  SGSRKSISPEQPSTLKSVIIKPARDATSPPSLNEEVVNDNSLHATAIKSKGKISYRISNDCSLDKGKQKVDIPSQLTPAFILDKPKRKVSFNSPGNKTNF
        S SRKS +P+QPS LKSVIIKP + ATSP  LNEEVVND++LHATA KS+ +I   I ND  LDKGKQKVDI      A  L+KPKRKVSFNSP NKTN 
Subjt:  SGSRKSISPEQPSTLKSVIIKPARDATSPPSLNEEVVNDNSLHATAIKSKGKISYRISNDCSLDKGKQKVDIPSQLTPAFILDKPKRKVSFNSPGNKTNF

Query:  FNPNSAPANHSPSVSSPEKKQRVSKERSVKKKSTITPPKSRANQGQDASNTQPLKIIAHDMNASKKGLSLTVDLGNLPVLDPSKSFEDHHSSDNAEVIDI
        FNP+SAPANHS S+SSPEKKQ+VS+ERS+KKKS+   P     Q +    TQP++++AHD+ ASKKGLSL V+LG+LPVLDPSKSFEDHHSS NAEVIDI
Subjt:  FNPNSAPANHSPSVSSPEKKQRVSKERSVKKKSTITPPKSRANQGQDASNTQPLKIIAHDMNASKKGLSLTVDLGNLPVLDPSKSFEDHHSSDNAEVIDI

Query:  TNTEMVPETPELKMTDPEKPKSPPEVNHRKQKHSHRRRHYYRKKEDTEKDS-------NSEVF-KNQLVAW------LKENGALIKNTLISYSPDFVILT
        TNTE+VPETPE+KM   E   S  E N+RK KH HRRR+YYRKK    + S       + +++ K +L+ W           ALIKN +ISYSPDFVILT
Subjt:  TNTEMVPETPELKMTDPEKPKSPPEVNHRKQKHSHRRRHYYRKKEDTEKDS-------NSEVF-KNQLVAW------LKENGALIKNTLISYSPDFVILT

Query:  ETRLKFINKKIVKSLWPSNSIKWIVKNAIDSSGGILILWDDLHHSLLSQEEGMFSLTAN-FLSSNNSWWLTGLYGPVQRRERLNFWTDLHNLLHLNSFPW
        ET LK  NK+I+KS WPSNSI WIVKNA  SSGGILILWD   HSLLSQEE +FSL+AN FL++N+SWWLTGLYGP +RR+R++FW DLHNL HLNSFPW
Subjt:  ETRLKFINKKIVKSLWPSNSIKWIVKNAIDSSGGILILWDDLHHSLLSQEEGMFSLTAN-FLSSNNSWWLTGLYGPVQRRERLNFWTDLHNLLHLNSFPW

Query:  ILGGDLNAIRMREESTAVTSSTHSSNMLNNFISINSLIDPPLSNNRYTWSNLRIPPTFSRLDRFLYNPNWEVLFNPHITRTLPRPTSDHFPLVCEDSSPT
         L  DLN IRMREE+T++ SS+HSS MLNNFIS N LIDPPL+NNR+TWSNLR P TFSR+DRFLYN +WE LF+PH TRTLPRPTSDHFPLVCEDS+P 
Subjt:  ILGGDLNAIRMREESTAVTSSTHSSNMLNNFISINSLIDPPLSNNRYTWSNLRIPPTFSRLDRFLYNPNWEVLFNPHITRTLPRPTSDHFPLVCEDSSPT

Query:  VSWGPAPFRLNSIVLNDPEFKRNMERWWELSAQEGHPGFAFIQRLKSLANFIKPWQKEKFHSISSAKENIIKEVDAIDKKELDTLLCQEDTKKIWLKEGD
        + WGPAPFRLNSI LNDPEFKRNMERWWE S Q GHPGF+FIQRLKSLAN IKPWQKEK HS++ AKE II+EVD+IDKKELDT L Q+++         
Subjt:  VSWGPAPFRLNSIVLNDPEFKRNMERWWELSAQEGHPGFAFIQRLKSLANFIKPWQKEKFHSISSAKENIIKEVDAIDKKELDTLLCQEDTKKIWLKEGD

Query:  ENFAFFHRICSSRQKRNIIHEIQDEDGSNQNTNISISLAFVNYFTKLYRSSTKTNPLFIDNLKWKPIDYSEWSPLCAPFLEEEIKGVINSFEGNKAPGPD
                       R +  + +  D S + +   I           Y+SSTK++PLFI+NL W PI++SEW  LCAPFLEEEIKGVINSF+G KAP PD
Subjt:  ENFAFFHRICSSRQKRNIIHEIQDEDGSNQNTNISISLAFVNYFTKLYRSSTKTNPLFIDNLKWKPIDYSEWSPLCAPFLEEEIKGVINSFEGNKAPGPD

Query:  GFPISFFKSYWKLLKEDILAIFKDFYEKGVINKNMNNTFIALIAKKKNYSHPKDFRPISLTTSIYKIIAKTLSNKLKLTLPDTISGNQLAFIKNRQITDA
        GFPISFFKSYW LLKEDI+ IFKDF+EKGVINKNMNNT+IALI KKK+YSHPKDFRPISLTTSIYKIIAKTLSN+LK TLP TISGNQLAFIKNRQITDA
Subjt:  GFPISFFKSYWKLLKEDILAIFKDFYEKGVINKNMNNTFIALIAKKKNYSHPKDFRPISLTTSIYKIIAKTLSNKLKLTLPDTISGNQLAFIKNRQITDA

Query:  ILMANEALDYWKVKKIKGFILKLDIEKAFDNLSWDFIDFVLKKKNYPPSWRQWIRGCISNVTYSIIVNGKPQGRIKANRGLRQGDPLSPFLFVIAMDYLS
        ILMANEA+DYWKVKKIKGFILKLDIEK F NL+WDFID+VL KKN+P SWR+WIRGCISNVTYS+I+NG+PQGRIKANRGLRQGDPLSPFLFVIAMDY S
Subjt:  ILMANEALDYWKVKKIKGFILKLDIEKAFDNLSWDFIDFVLKKKNYPPSWRQWIRGCISNVTYSIIVNGKPQGRIKANRGLRQGDPLSPFLFVIAMDYLS

Query:  RLLSHLESTGAIKGVCLGKDCNISHILFADDILLFVEDNAFYLNNLRMAISLFEKASGLKINLSKSAMVPVNVPWPRALDCASSWDIPCQSLPLSYLGVP
        RLLSHLE++GAIKGV L  +CNISHILFADDILLFVEDN  +LNNL MA+SLFEKASGLKINL KSA+VPVNV   RA +CAS W I C SL LSYLGVP
Subjt:  RLLSHLESTGAIKGVCLGKDCNISHILFADDILLFVEDNAFYLNNLRMAISLFEKASGLKINLSKSAMVPVNVPWPRALDCASSWDIPCQSLPLSYLGVP

Query:  LGGNPKSKPFWRNIEDRIHKKLSNWKYAHISKGGRLTLIKSTLTSIPIYQLSVFQAPLSTYKNIEKLWRRFLWKGSCNPKGSHLIKWSIVTKPKEEGGLG
        LGG+  S                                                                        KGSHLI W+ V K KEEGGLG
Subjt:  LGGNPKSKPFWRNIEDRIHKKLSNWKYAHISKGGRLTLIKSTLTSIPIYQLSVFQAPLSTYKNIEKLWRRFLWKGSCNPKGSHLIKWSIVTKPKEEGGLG

Query:  ISRLQITNQALLPKWLWRYHSEPNSLWRKLIQLKYQSKHPGDLPSNISSSSSKAPWRSIINNSEWFKRNQGWDLKNGDQISFWFSNWSTEGCLSTAYPRL
        ISRLQ+TN+ALL KWLWRY SEPN+LWR+LIQ KY+ KHPGD+PSN SSSSSKAPWRSII+N +WFK NQ WDL NGDQISFW+SNWS EGCLSTAYPRL
Subjt:  ISRLQITNQALLPKWLWRYHSEPNSLWRKLIQLKYQSKHPGDLPSNISSSSSKAPWRSIINNSEWFKRNQGWDLKNGDQISFWFSNWSTEGCLSTAYPRL

Query:  FALSIDKESSIKDVWNSINNQWEIAFRRNLNDRELSTWQRILGNLPVPRTNRGSSKPTWIPDSKKSFSIASAKRCISHQPELSVASPLSKLLDLIWKSFI
        FAL++DKE S+KD WN+I+NQW I FRR LNDRE   W++IL  LP PR NRGSSKPTWIPD  KSFSIASAK  IS Q + +      KLL++IWKS I
Subjt:  FALSIDKESSIKDVWNSINNQWEIAFRRNLNDRELSTWQRILGNLPVPRTNRGSSKPTWIPDSKKSFSIASAKRCISHQPELSVASPLSKLLDLIWKSFI

Query:  PMKIKFFMWCLIQRKLNTSEVIQQRMPNLALQPNWCVLCKKDSESGAHLFLQCDMVKPLWSLLQQALNFAHFSDDFEALISFFLSLNQSLPKHK
        PMKIKFFMWCLIQR+++T EVIQQRM N  LQPNWCVLC KD+ESG HLFL+CD VKPLWSLL ++LNFA  +DDFEAL SFFLSL  SLPKHK
Subjt:  PMKIKFFMWCLIQRKLNTSEVIQQRMPNLALQPNWCVLCKKDSESGAHLFLQCDMVKPLWSLLQQALNFAHFSDDFEALISFFLSLNQSLPKHK

TYJ99315.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.0e+0072Show/hide
Query:  YFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKSLIETPSSNRFFLENRDSEHCIWIRKTRNGKGCIAEIFRVDNK
        +FKSLPRSCK+ERKEFVL LDKY+KHTHYWLTETGAHKAFSIEVSPRDLDWIR TLKSLI TP++NRFFLE RDSE  IWIRKTRN KGC AEIFRVD K
Subjt:  YFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKSLIETPSSNRFFLENRDSEHCIWIRKTRNGKGCIAEIFRVDNK

Query:  NRKSCILVPEGPEKSGWVSFLSMITPKVETKAKIRPSFLPRSSPEIHSSSPIDYHKRSYAKAVTEGRPSISSDSSESYASSDSSDSSHSSGNSPRNFPSP
        NRKSCILVPEGP+KSGWVSFLSMITPKVE KAK RP+FLPR+SP+   S PIDYHKRSYAKAVTEGRP  +SDSS+SY   DSSDSSHSS NS  + PS 
Subjt:  NRKSCILVPEGPEKSGWVSFLSMITPKVETKAKIRPSFLPRSSPEIHSSSPIDYHKRSYAKAVTEGRPSISSDSSESYASSDSSDSSHSSGNSPRNFPSP

Query:  VSLENTVVLVRRFFHDDWYKILQNLRKQIEESFTYNAFHAEKALVHFNSNVPANLLCQNRGWTTVGKYTVRFEKWNPASHASPKLIPSYGGWTTFRGIPL
          LENTVV+VRRFFHDDW+KILQNLRKQ EESFTYNAFHAEKALVHF+SN+PANLLCQN+GW+TVGKY+VRFEKW+P  HA+PKLIPSYGGWTTFRGIPL
Subjt:  VSLENTVVLVRRFFHDDWYKILQNLRKQIEESFTYNAFHAEKALVHFNSNVPANLLCQNRGWTTVGKYTVRFEKWNPASHASPKLIPSYGGWTTFRGIPL

Query:  NLCNMKTFQQIGKACGGLIKVAEETKTASNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFVVQTITHSEGKWLMERNVRLHGTFKRQAAASFDEFNPDSE
        +L NM TFQQIGKAC GLIKVAEET++A NLIEA++K+RYNYSGFLPA V+IFD EGNKF VQ +TH EGKWL+ERNVRLHGTFKRQAAASFD+FNP+SE
Subjt:  NLCNMKTFQQIGKACGGLIKVAEETKTASNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFVVQTITHSEGKWLMERNVRLHGTFKRQAAASFDEFNPDSE

Query:  QFLFDGMEAISPDLQNTFSGSRKSISPEQPSTLKSVIIKPARDATSPPSLNEEVVNDNSLHATAIKSKGKISYRISNDCSLDKGKQKVDIPSQLTPAFIL
        QF F+G EAISPD  +T S  RKS +P+QPS LKSVIIKP R+AT P  LNEE+VND++LHATA KSK +I   ISND  LDKGKQKVDI  Q   A  L
Subjt:  QFLFDGMEAISPDLQNTFSGSRKSISPEQPSTLKSVIIKPARDATSPPSLNEEVVNDNSLHATAIKSKGKISYRISNDCSLDKGKQKVDIPSQLTPAFIL

Query:  DKPKRKVSFNSPGNKTNFFNPNSAPANHSPSVSSPEKKQRVSKERSVKKKSTITPPKSRANQGQDASNTQPLKIIAHDMNASKKGLSLTVDLGNLPVLDP
        DK KRKVSFNSP NKTN FNP+SAPANHSPS++SPEKKQ+VS+ERS+KKKS+ T P S+ANQ +    TQP++I+AHD +A+KKGLSLTVDLG+LP LDP
Subjt:  DKPKRKVSFNSPGNKTNFFNPNSAPANHSPSVSSPEKKQRVSKERSVKKKSTITPPKSRANQGQDASNTQPLKIIAHDMNASKKGLSLTVDLGNLPVLDP

Query:  SKSFEDHHSSDNAEVIDITNTEMVPETPELKMTDPEKPKSPPEVNHRKQKHSHRRRHYYRKKEDTEKDSNSEVFKNQLVAWLKENGALIKNTLIS---YS
        +KS EDHH+SDNAEV+DITNTE+VPETPE+KM   E   S  E N+RK KH H+R++YYRKKE+ EKD +SE FK QLV+WLK+NG  +     S    +
Subjt:  SKSFEDHHSSDNAEVIDITNTEMVPETPELKMTDPEKPKSPPEVNHRKQKHSHRRRHYYRKKEDTEKDSNSEVFKNQLVAWLKENGALIKNTLIS---YS

Query:  PDFVILTE--TRLKFINKKIVKSLWPSNSIKWIVKNAIDSSGGILILWDDLHHSLLSQEEGMFSLTANFLSSNN-SWWLTGLYGPVQRRERLNFWTDLHN
           V+L +  + LK  NK+I+KSLWPSNSI WI KNA  SSGGILILWD  +HSLLSQEEG+FSL+ANFL +NN SWWLTGLYGPV+RRER++FW +LHN
Subjt:  PDFVILTE--TRLKFINKKIVKSLWPSNSIKWIVKNAIDSSGGILILWDDLHHSLLSQEEGMFSLTANFLSSNN-SWWLTGLYGPVQRRERLNFWTDLHN

Query:  LLHLNSFPWILGGDLNAIRMREESTAVTSSTHSSNMLNNFISINSLIDPPLSNNRYTWSNLRIPPTFSRLDRFLYNPNWEVLFNPHITRTLPRPTSDHFP
        L HLNSFPWILGGDLN IRMREEST+V SS+H+S MLNNFIS N LIDPPL+NNR+TWSNLR PPTFSR+DRFLYN +WE LF+PH TRTLPR TSDHFP
Subjt:  LLHLNSFPWILGGDLNAIRMREESTAVTSSTHSSNMLNNFISINSLIDPPLSNNRYTWSNLRIPPTFSRLDRFLYNPNWEVLFNPHITRTLPRPTSDHFP

Query:  LVCEDSSPTVSWGPAPFRLNSIVLNDPEFKRNMERWWELSAQEGHPGFAFIQRLKSLANFIKPWQKEKFHSISSAKENIIKEVDAIDKKELDTLLCQEDT
        LVCEDS+P +SWGP PFRLNSI L+DPEFKRNM RWWE S Q G+PGF+FIQRLKSLANFIKPWQKEK HS++ AKE II+EVD+IDKKELDT L QE++
Subjt:  LVCEDSSPTVSWGPAPFRLNSIVLNDPEFKRNMERWWELSAQEGHPGFAFIQRLKSLANFIKPWQKEKFHSISSAKENIIKEVDAIDKKELDTLLCQEDT

Query:  -------------------------KKIWLKEGDENFAFFHRICSSRQKRNIIHEIQDEDGSNQNTNISISLAFVNYFTKLYRSSTKTNPLFIDNLKWKP
                                 KK+WL+EGDEN +FFHRICSSRQKR+ IHEIQDE+GS QNTN SIS AF+ +F+++YRSSTK++PLFI+NL W P
Subjt:  -------------------------KKIWLKEGDENFAFFHRICSSRQKRNIIHEIQDEDGSNQNTNISISLAFVNYFTKLYRSSTKTNPLFIDNLKWKP

Query:  IDYSEWSPLCAPFLEEEIKGVINSFEGNKAPGPDGFPISFFKSYWKLLKEDILAIFKDFYEKGVINKNMNNTFIALIAKKKNYSHPKDFRPISLTTSIYK
        I  SEWS LCAPFLE EIKGVINSF+G K PGPDGFPISFFKS+W                                                       
Subjt:  IDYSEWSPLCAPFLEEEIKGVINSFEGNKAPGPDGFPISFFKSYWKLLKEDILAIFKDFYEKGVINKNMNNTFIALIAKKKNYSHPKDFRPISLTTSIYK

Query:  IIAKTLSNKLKLTLPDTISGNQLAFIKNRQITDAILMANEALDYWKVKKIKGFILKLDIEKAFDNLSWDFIDFVLKKKNYPPSWRQWIRGCISNVTYSII
                 LK TLP+TISGNQLAF+KNRQITDAILMANEA+DYWKVKKIKGFILKLDIEKAFDNL+ DFID VL+KKN+P  WR+WIRGCISNVTYS+I
Subjt:  IIAKTLSNKLKLTLPDTISGNQLAFIKNRQITDAILMANEALDYWKVKKIKGFILKLDIEKAFDNLSWDFIDFVLKKKNYPPSWRQWIRGCISNVTYSII

Query:  VNGKPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESTGAIKGVCLGKDCNISHILFADDILLFVEDNAFYLNNLRMAISLFEKASGLKINLSKS
        +NG+PQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLES+GAIKGV L  +CNISHILFADDILLF+EDN  +L NLRMA+SLFE+ASGLKINL KS
Subjt:  VNGKPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESTGAIKGVCLGKDCNISHILFADDILLFVEDNAFYLNNLRMAISLFEKASGLKINLSKS

Query:  AMVPVNVPWPRALDCASSWDIPCQSLPLSYLGVPLGGNPKSKPFWRNIEDRIHKKLSNWKYAHISKGGRLTLIKSTLTSIPIYQLSVFQAPLSTYKNIEK
        A+VPVNV   RA +CAS W I C SLPLSYLGVPLGGNPKS  FWRN+ED+I KKL+NWKYA ISKGGRLTLIKSTL+S+PIYQLSVFQAP  T KNIEK
Subjt:  AMVPVNVPWPRALDCASSWDIPCQSLPLSYLGVPLGGNPKSKPFWRNIEDRIHKKLSNWKYAHISKGGRLTLIKSTLTSIPIYQLSVFQAPLSTYKNIEK

Query:  LWRRFLWKGSCNPKGSHLIKWSIVTKPKEEGGLGISRLQITNQALLPKWLWRYHSEPNSLWRKLIQLKYQSKHPGDLPSNISSSSSKAPWRSIINNSEWF
        LWR+FLWKG+   +GSHLI W+ V+K KEEGGLGISRL +TN+ALL KWLWRY SEPN+LWR+LIQ KY+ K PGD+PSNISSS+SKAPWRSII++++WF
Subjt:  LWRRFLWKGSCNPKGSHLIKWSIVTKPKEEGGLGISRLQITNQALLPKWLWRYHSEPNSLWRKLIQLKYQSKHPGDLPSNISSSSSKAPWRSIINNSEWF

Query:  KRNQGWDLKNGDQISFWFSNWSTEGCLSTAYPRLFALSIDKESSIKDVWNSINNQWEIAFRRNLNDRELSTWQRILGNLPVPRTNRGSSKPTWIPDSKKS
        K NQ WDL NGDQISFW+SNWS EG LSTAYPRLFAL++DKE S+KD WN+ +NQW I FRR LNDRE   W++IL  LP PR+NRGSSKPTWIPDS  S
Subjt:  KRNQGWDLKNGDQISFWFSNWSTEGCLSTAYPRLFALSIDKESSIKDVWNSINNQWEIAFRRNLNDRELSTWQRILGNLPVPRTNRGSSKPTWIPDSKKS

Query:  FSIASAKRCISHQPELSVASPLSKLLDLIWKSFIPMKIKFFMWCLIQRKLNTSEVIQQRMPNLALQPN
        FSIASAK  IS Q + +   P +KLL++IWKS IPMKIKFFMWCLIQR++NT EVIQQ+MPN  LQPN
Subjt:  FSIASAKRCISHQPELSVASPLSKLLDLIWKSFIPMKIKFFMWCLIQRKLNTSEVIQQRMPNLALQPN

TYK00493.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.0e+0079.21Show/hide
Query:  MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKSLIETPSSNRFFLENRDSEHCIWIRKTRNGKGCIAEIFRVD
        MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKSLIETPSSNRFFLENRD EHCIWIRKTRNGKGC AEIFRVD
Subjt:  MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKSLIETPSSNRFFLENRDSEHCIWIRKTRNGKGCIAEIFRVD

Query:  NKNRKSCILVPEGPEKSGWVSFLSMITPKVETKAKIRPSFLPRSSPEIHSSSPIDYHKRSYAKAVTEGRPSISSDSSESYASSDSSDSSHSSGNSPRNFP
        +KNRKSCILVPEG EKS WVSFLSMITPKVE KAK RP FLPRSSPE   S PIDYHKRSYAKAV+EGR SISSDSS+SYA   SSDSS SSGNSP + P
Subjt:  NKNRKSCILVPEGPEKSGWVSFLSMITPKVETKAKIRPSFLPRSSPEIHSSSPIDYHKRSYAKAVTEGRPSISSDSSESYASSDSSDSSHSSGNSPRNFP

Query:  SPVSLENTVVLVRRFFHDDWYKILQNLRKQIEESFTYNAFHAEKALVHFNSNVPANLLCQNRGWTTVGKYTVRFEKWNPASHASPKLIPSYGGWTTFRGI
         PV LENTVVLVRRFFHDDW KILQNLRKQ EESFTYNAFHAEK LVHFNSNVPANLLCQN+GWTTVGKYTVRFEKW PASHASPKLIPSYGGWTTFRGI
Subjt:  SPVSLENTVVLVRRFFHDDWYKILQNLRKQIEESFTYNAFHAEKALVHFNSNVPANLLCQNRGWTTVGKYTVRFEKWNPASHASPKLIPSYGGWTTFRGI

Query:  PLNLCNMKTFQQIGKACGGLIKVAEETKTASNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFVVQTITHSEGKWLMERNVRLHGTFKRQAAASFDEFNPD
        PL+L NM TFQQIGKACGGLIKVAEETKTA NLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFVVQ +THSEGKWLMERNVRLHGTFKRQAAASFD+FNPD
Subjt:  PLNLCNMKTFQQIGKACGGLIKVAEETKTASNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFVVQTITHSEGKWLMERNVRLHGTFKRQAAASFDEFNPD

Query:  SEQFLFDGMEAISPDLQNTFSGSRKSISPEQPSTLKSVIIKPARDATSPPSLNEEVVNDNSLHATAIKSKGKISYRISNDCSLDKGKQKVDIPSQLTPAF
        SEQFLFDG+EAISPDL NT SGSRKSISPEQPS LKSVIIKPA+ ATSP +LNEEVVNDNSLHATA KSK KI   ISND SLDKGKQKVDIPSQLT AF
Subjt:  SEQFLFDGMEAISPDLQNTFSGSRKSISPEQPSTLKSVIIKPARDATSPPSLNEEVVNDNSLHATAIKSKGKISYRISNDCSLDKGKQKVDIPSQLTPAF

Query:  ILDKPKRKVSFNSPGNKTNFFNPNSAPANHSPSVSSPEKKQRVSKERSVKKKSTITPPKSRANQGQDASNTQPLKIIAHDMNASKKGLSLTVDLGNLPVL
        I  KPKRKVSFNSP NKT FFNP+SAPANH     SPEKK+RVS+ERSVKKKS+   PK RANQG+    TQPL+++AHD++ASKKGLSLTVDLGNLPVL
Subjt:  ILDKPKRKVSFNSPGNKTNFFNPNSAPANHSPSVSSPEKKQRVSKERSVKKKSTITPPKSRANQGQDASNTQPLKIIAHDMNASKKGLSLTVDLGNLPVL

Query:  DPSKSFEDHHSSDNAEVIDITNTEMVPETPELKMTDPEKPKSPPEVNHRKQKHSHRRRHYYRKKEDTEKDSNSEVFKNQLVAWLKENGALIKNTLISYSP
        DPSKSFEDHHSSDNAEVIDITNTE+VPETPELKMTDPEK  S PEVN+RKQKHSHRRRHYYRKKED EKD+NSE FKNQLV WLKENG       +  S 
Subjt:  DPSKSFEDHHSSDNAEVIDITNTEMVPETPELKMTDPEKPKSPPEVNHRKQKHSHRRRHYYRKKEDTEKDSNSEVFKNQLVAWLKENGALIKNTLISYSP

Query:  DFVILTETRLKFINKKIVKSLWPSNSIKWIVKNAIDSSGGILILWDDLHHSLLSQEEGMFSLTANFLSSNNSWWLTGLYGPVQRRERLNFWTDLHNLLHL
        D                  S   + S   +      S+GGILILWD  HHSLLSQEEG FSL+ANF S NNSWWLTGLYGPV+RRERLN W DLHNL HL
Subjt:  DFVILTETRLKFINKKIVKSLWPSNSIKWIVKNAIDSSGGILILWDDLHHSLLSQEEGMFSLTANFLSSNNSWWLTGLYGPVQRRERLNFWTDLHNLLHL

Query:  NSFPWILGGDLNAIRMREESTAVTSSTHSSNMLNNFISINSLIDPPLSNNRYTWSNLRIPPTFSRLDRFLYNPNWEVLFNPHITRTLPRPTSDHFPLVCE
        NS PWI+GGDLN +RMREESTAVT S+HSSNMLN+FIS N LIDPPL+NNRYTWSNLR PPTFSRLDRFLYN  WE+LFNPHITRTLPRPTSDHFPLVCE
Subjt:  NSFPWILGGDLNAIRMREESTAVTSSTHSSNMLNNFISINSLIDPPLSNNRYTWSNLRIPPTFSRLDRFLYNPNWEVLFNPHITRTLPRPTSDHFPLVCE

Query:  DSSPTVSWGPAPFRLNSIVLNDPEFKRNMERWWELSAQEGHPGFAFIQRLKSLANFIKPWQKEKFHSISSAKENIIKEVDAIDKKELDTLLCQEDT----
        DS+ T+ WGPAPFRLNSI LNDPEFKRNMERWWELS Q GHPGF FIQRLKSLAN IKPWQKEKF S++SAKENII+EVD+IDK ELDT L  E++    
Subjt:  DSSPTVSWGPAPFRLNSIVLNDPEFKRNMERWWELSAQEGHPGFAFIQRLKSLANFIKPWQKEKFHSISSAKENIIKEVDAIDKKELDTLLCQEDT----

Query:  ---------------------KKIWLKEGDENFAFFHRICSSRQKRNIIHEIQDEDGSNQNTNISISLAFVNYFTKLYRSSTKTNPLFIDNLKWKPIDYS
                             KK+WLKEGDEN AFFHRICSSRQKRN+IHEIQDE+GS QNTN +ISLAFVN+F+++YR STK +PLFI+NL+W PIDYS
Subjt:  ---------------------KKIWLKEGDENFAFFHRICSSRQKRNIIHEIQDEDGSNQNTNISISLAFVNYFTKLYRSSTKTNPLFIDNLKWKPIDYS

Query:  EWSPLCAPFLEEEIKGVINSFEGNKAPGPDGFPISFFKSYWKLLKEDILAIFKDFYEKGVINKNMNNTFIALIAKKKNYSHPKDFRPISLTTSIYKIIAK
        +WS LCAPF EEEIKGVI SF+GNKAPGPDGFPISFFKSYW LLKEDIL IFKDF+EKGVINKNMNNT+IALI KKK+YSHPKDFRPISLTTSIYK IAK
Subjt:  EWSPLCAPFLEEEIKGVINSFEGNKAPGPDGFPISFFKSYWKLLKEDILAIFKDFYEKGVINKNMNNTFIALIAKKKNYSHPKDFRPISLTTSIYKIIAK

Query:  TLSNKLKLTLPDTISGNQLAFIKNRQITDAILMANEALDYWKVKKIKGFILKLDIEKAFDNLSWDFIDFVLKKKNYPPSWRQWIRGCISNVTYSIIVNGK
        TLSN+LKLTLPDTISGNQLAFIKNRQITDAILMANEALDYWKVKKIKGFILKLDIEKAFDNL+W+FID VLKK NYP SWR+WIRGCISNVTYSIIVNGK
Subjt:  TLSNKLKLTLPDTISGNQLAFIKNRQITDAILMANEALDYWKVKKIKGFILKLDIEKAFDNLSWDFIDFVLKKKNYPPSWRQWIRGCISNVTYSIIVNGK

Query:  PQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESTGAIKGVCLGKDCNISHILFADDILLFVEDNAFYLNNLRMAISLFEKASGLKINLSKSAMVP
        PQGRIKANRGLRQGDPLS FLFVIAMDYLSRLLSHLESTGAIKG                                                        
Subjt:  PQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESTGAIKGVCLGKDCNISHILFADDILLFVEDNAFYLNNLRMAISLFEKASGLKINLSKSAMVP

Query:  VNVPWPRALDCASSWDIPCQSLPLSYLGVPLGGNPKSKPFWRNIEDRIHKKLSNWKYAHISKGGRLTLIKSTLTSIPIYQLSVFQAPLSTYKNIEKLWRR
                        I C +LPL+YLGVPLGGNPKS  FWRNIEDRI KKLSNWKYAHISKGGRLTLIKSTL+S+PIY+LSVFQAP STYKNIEKLWR 
Subjt:  VNVPWPRALDCASSWDIPCQSLPLSYLGVPLGGNPKSKPFWRNIEDRIHKKLSNWKYAHISKGGRLTLIKSTLTSIPIYQLSVFQAPLSTYKNIEKLWRR

Query:  FLWKGSCNPKGSHLIKWSIVTKPKEEGGLGISRLQITNQALLPKWLWRYHSEPNSLWRKLIQLKYQSKHPGDLPSNISSSSSKAPWRSIINNSEWFKRNQ
        FLWKGSC  KGSHLI WSIVTKPKEEGGLGISRLQ+TNQALL KWLWRY+SEPNSLWR+LI +KY+ KHPGDLPSNISSSSSKAPWRSIINN +WFK NQ
Subjt:  FLWKGSCNPKGSHLIKWSIVTKPKEEGGLGISRLQITNQALLPKWLWRYHSEPNSLWRKLIQLKYQSKHPGDLPSNISSSSSKAPWRSIINNSEWFKRNQ

Query:  GWDLKNGDQISFWFSNWSTEGCLSTAYPRLFALSIDKESSIKDVWNSINNQWEIAFRRNLNDRELSTWQRILGNLPVPRTNRGSSKPTWIPDSKKSFSIA
        GWDL NGDQISFW+SNWS EGCLSTAYPRLFALS+DKESSIKDVWNS NNQWEI FRR LNDRELSTWQ+IL NLP+ RTNRG SKPTWIPDSKK FSIA
Subjt:  GWDLKNGDQISFWFSNWSTEGCLSTAYPRLFALSIDKESSIKDVWNSINNQWEIAFRRNLNDRELSTWQRILGNLPVPRTNRGSSKPTWIPDSKKSFSIA

Query:  SAKRCISHQPELSVASPLSKLLDLIWKSFIPMKIKFFMWCLIQRKLNTSEVIQQRMPNLALQPNWCVLCKKDSESGAHLFLQCDMVKPLWSLLQQALNFA
        SAK CISHQP+ SVA+P  KLL+LIWK+ +PMKIKFFMWCL+QRKLNT EV         LQPNWCVLCKK SE+GAHLFL CD+VKPLWSLL ++LNFA
Subjt:  SAKRCISHQPELSVASPLSKLLDLIWKSFIPMKIKFFMWCLIQRKLNTSEVIQQRMPNLALQPNWCVLCKKDSESGAHLFLQCDMVKPLWSLLQQALNFA

Query:  HFSDDFEALISFFLSLNQSLPKHKIVNCGVIAVLWCIWSERNNRTFDNLSYQKTIINLWEDCKILIGNWSSRDPTFKNYSASTIALNLNA
          SDDFEA+ SFFLSLNQSLPKHK+V CG+IA+LW IW+ERNNR FD LSYQK+I NLWEDCKILIGNW SRDPTFKNYSA+TIALNLNA
Subjt:  HFSDDFEALISFFLSLNQSLPKHKIVNCGVIAVLWCIWSERNNRTFDNLSYQKTIINLWEDCKILIGNWSSRDPTFKNYSASTIALNLNA

TYK05808.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.0e+0069.8Show/hide
Query:  MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKSLIETPSSNRFFLENRDSEHCIWIRKTRNGKGCIAEIFRVD
        MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKSLIETPSSNRFFLENRD EHCIWIRKTRNGKGC AEIFRVD
Subjt:  MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKSLIETPSSNRFFLENRDSEHCIWIRKTRNGKGCIAEIFRVD

Query:  NKNRKSCILVPEGPEKSGWVSFLSMITPKVETKAKIRPSFLPRSSPEIHSSSPIDYHKRSYAKAVTEGRPSISSDSSESYASSDSSDSSHSSGNSPRNFP
        +KNRKSCILVPEGPEKSG VSFLSMITPKVE KAK RP+FLPRSSPE   S PIDYHKRSY KAV++GR SISSDSS+SY    SSDSS SSGNSP + P
Subjt:  NKNRKSCILVPEGPEKSGWVSFLSMITPKVETKAKIRPSFLPRSSPEIHSSSPIDYHKRSYAKAVTEGRPSISSDSSESYASSDSSDSSHSSGNSPRNFP

Query:  SPVSLENTVVLVRRFFHDDWYKILQNLRKQIEESFTYNAFHAEKALVHFNSNVPANLLCQNRGWTTVGKYTVRFEKWNPASHASPKLIPSYGGWTTFRGI
         PV LENTVVL                                 AL+HFNSNVPANLLCQN+GWTTV KY VR                           
Subjt:  SPVSLENTVVLVRRFFHDDWYKILQNLRKQIEESFTYNAFHAEKALVHFNSNVPANLLCQNRGWTTVGKYTVRFEKWNPASHASPKLIPSYGGWTTFRGI

Query:  PLNLCNMKTFQQIGKACGGLIKVAEETKTASNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFVVQTITHSEGKWLMERNVRLHGTFKRQAAASFDEFNPD
                                                                                                            
Subjt:  PLNLCNMKTFQQIGKACGGLIKVAEETKTASNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFVVQTITHSEGKWLMERNVRLHGTFKRQAAASFDEFNPD

Query:  SEQFLFDGMEAISPDLQNTFSGSRKSISPEQPSTLKSVIIKPARDATSPPSLNEEVVNDNSLHATAIKSKGKISYRISNDCSLDKGKQKVDIPSQLTPAF
          + LFDG+EAISPDL NT SGSRKS S EQPS LKSVIIKPARDATSP +LNEEVVNDNSLHAT IKS+ KI   ISND SLDKGKQKVDIPSQLT AF
Subjt:  SEQFLFDGMEAISPDLQNTFSGSRKSISPEQPSTLKSVIIKPARDATSPPSLNEEVVNDNSLHATAIKSKGKISYRISNDCSLDKGKQKVDIPSQLTPAF

Query:  ILDKPKRKVSFNSPGNKTNFFNPNSAPANHSPSVSSPEKKQRVSKERSVKKKSTITPPKSRANQGQDASNTQPLKIIAHDMNASKKGLSLTVDLGNLPVL
        I DKPKRKVSFNSP NKT FFN +SAP NHSP +SSPEKKQRVS+ERSVKKKS+   PKSRANQG+    TQPL+++AHD++ASKKGLSLTVDLGNLPVL
Subjt:  ILDKPKRKVSFNSPGNKTNFFNPNSAPANHSPSVSSPEKKQRVSKERSVKKKSTITPPKSRANQGQDASNTQPLKIIAHDMNASKKGLSLTVDLGNLPVL

Query:  DPSKSFEDHHSSDNAEVIDITNTEMVPETPELKMTDPEKPKSPPEVNHRKQKHSHRRRHYYRKKEDTEKDSNSEVFKNQLVAWLKENGALIKNTLISYSP
        DPSKSFEDHHSSDNAEVIDITNTE+VPETPELKMTDPEK  S PEVN+RKQKHSHRRRHYYRKKED EKD+NSE FKNQLV WLKENG  +     S   
Subjt:  DPSKSFEDHHSSDNAEVIDITNTEMVPETPELKMTDPEKPKSPPEVNHRKQKHSHRRRHYYRKKEDTEKDSNSEVFKNQLVAWLKENGALIKNTLISYSP

Query:  DFVILTETRLKFINKKIVKSLWPSNSIKWIVKNAIDSSGGILILWDDLHHSLLSQEEGMFSLTANFLSSNNSWWLTGLYGPVQRRERLNFWTDLHNLLHL
             T T   F            +SI WIVKNAIDSSGGILILWD  HHSLL                                               
Subjt:  DFVILTETRLKFINKKIVKSLWPSNSIKWIVKNAIDSSGGILILWDDLHHSLLSQEEGMFSLTANFLSSNNSWWLTGLYGPVQRRERLNFWTDLHNLLHL

Query:  NSFPWILGGDLNAIRMREESTAVTSSTHSSNMLNNFISINSLIDPPLSNNRYTWSNLRIPPTFSRLDRFLYNPNWEVLFNPHITRTLPRPTSDHFPLVCE
                GDLN +RMREESTAVTSS+HSSNMLNNFIS N LIDPPL+NNRYTWSNLR PPTFSRLDRFLYN  WE LFNPHITRTL RPTSDHFPLVCE
Subjt:  NSFPWILGGDLNAIRMREESTAVTSSTHSSNMLNNFISINSLIDPPLSNNRYTWSNLRIPPTFSRLDRFLYNPNWEVLFNPHITRTLPRPTSDHFPLVCE

Query:  DSSPTVSWGPAPFRLNSIVLNDPEFKRNMERWWELSAQEGHPGFAFIQRLKSLANFIKPWQKEKFHSISSAKENIIKEVDAIDKKELDTLLCQEDT----
        DS+ T+ WGPAPFRLNSI LNDP+FKRNMERWWELS Q GHPGF+FI+RLKSLAN IKPWQKEKFHS++SAKENII+EVD+IDK ELDT L QE++    
Subjt:  DSSPTVSWGPAPFRLNSIVLNDPEFKRNMERWWELSAQEGHPGFAFIQRLKSLANFIKPWQKEKFHSISSAKENIIKEVDAIDKKELDTLLCQEDT----

Query:  ---------------------KKIWLKEGDENFAFFHRICSSRQKRNIIHEIQDEDGSNQNTNISISLAFVNYFTKLYRSSTKTNPLFIDNLKWKPIDYS
                             KK+WLKEGDEN AFFHRICSSRQKRN+IHEIQDE+GS QNTN +ISLAFVN+F+ +YR STK +PLFI+NL+W PIDYS
Subjt:  ---------------------KKIWLKEGDENFAFFHRICSSRQKRNIIHEIQDEDGSNQNTNISISLAFVNYFTKLYRSSTKTNPLFIDNLKWKPIDYS

Query:  EWSPLCAPFLEEEIKGVINSFEGNKAPGPDGFPISFFKSYWKLLKEDILAIFKDFYEKGVINKNMNNTFIALIAKKKNYSHPKDFRPISLTTSIYKIIAK
        +WS LCAPFLEEEIKGVI SF+GNKAPGPDGFPISFFKSYW LLKEDIL IFKDF+EKG                                     IIAK
Subjt:  EWSPLCAPFLEEEIKGVINSFEGNKAPGPDGFPISFFKSYWKLLKEDILAIFKDFYEKGVINKNMNNTFIALIAKKKNYSHPKDFRPISLTTSIYKIIAK

Query:  TLSNKLKLTLPDTISGNQLAFIKNRQITDAILMANEALDYWKVKKIKGFILKLDIEKAFDNLSWDFIDFVLKKKNYPPSWRQWIRGCISNVTYSIIVNGK
        TLSN+LKLTLPDTISGNQLAFIKNRQITDAIL ANEALDYWKVKKIK FILKLDIEKAFDNL+WDFIDFVLKKKNYP SWR+WIRGCISNVTYSIIVN K
Subjt:  TLSNKLKLTLPDTISGNQLAFIKNRQITDAILMANEALDYWKVKKIKGFILKLDIEKAFDNLSWDFIDFVLKKKNYPPSWRQWIRGCISNVTYSIIVNGK

Query:  PQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESTGAIKGVCLGKDCNISHILFADDILLFVEDNAFYLNNLRMAISLFEKASGLKINLSKSAMVP
        PQ RIKANRGLRQGDPLSPFLFV AMDYLSRLLSHLES+GAIKGVCL  DCNISHILFADDILLFVEDN  +LNNLRMA+SLFEKASGLKINLSKSAMVP
Subjt:  PQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESTGAIKGVCLGKDCNISHILFADDILLFVEDNAFYLNNLRMAISLFEKASGLKINLSKSAMVP

Query:  VNVPWPRALDCASSWDIPCQSLPLSYLGVPLGGNPKSKPFWRNIEDRIHKKLSNWKYAHISKGGRLTLIKSTLTSIPIYQLSVFQAPLSTYKNIEKLWRR
        VNV W RAL+CASSW I C +LPL+YLGVPLGGNPKS  FWRNIEDRI KKL+NWKYAHISKGGRLTLIKSTL+S+ IYQLSVFQAP STYKNIEKLWR 
Subjt:  VNVPWPRALDCASSWDIPCQSLPLSYLGVPLGGNPKSKPFWRNIEDRIHKKLSNWKYAHISKGGRLTLIKSTLTSIPIYQLSVFQAPLSTYKNIEKLWRR

Query:  FLWKGSCNPKGSHLIKWSIVTKPKEEGGLGISRLQITNQALLPKWLWRYHSEPNSLWRKLIQLKYQSKHPGDLPSNISSSSSKAPWRSIINNSEWFKRNQ
        FLWKGS   KGSHLI WSIVTK KEEGGLGISRLQ+ NQALL KWLWRY+SEPNSLWR+LI +KY+ KHPGD+PSNISSSSSKAPW+SIINN +WFK NQ
Subjt:  FLWKGSCNPKGSHLIKWSIVTKPKEEGGLGISRLQITNQALLPKWLWRYHSEPNSLWRKLIQLKYQSKHPGDLPSNISSSSSKAPWRSIINNSEWFKRNQ

Query:  GWDLKNGDQISFWFSNWSTEGCLSTAYPRLFALSIDKESSIKDVWNSINNQWEIAFRRNLNDRELSTWQRILGNLPVPRTNRGSSKPTWIPDSKKSFSIA
        GWDL N DQISFW+SNWS EGCLSTAYPRLFALSIDK+SSIKDVWNS NNQWEI FRR LNDRELSTWQ IL NL +PRTNRG SKPTWIPDSKK FSIA
Subjt:  GWDLKNGDQISFWFSNWSTEGCLSTAYPRLFALSIDKESSIKDVWNSINNQWEIAFRRNLNDRELSTWQRILGNLPVPRTNRGSSKPTWIPDSKKSFSIA

Query:  SAKRCISHQPELSVASPLSKLLDLIWKSFIPMKIKFFMWCLI
        SAK CISHQP+ SVA+P  KLLDLIWK+ +PMKIKFFMWCL+
Subjt:  SAKRCISHQPELSVASPLSKLLDLIWKSFIPMKIKFFMWCLI

TrEMBL top hitse value%identityAlignment
A0A5A7TDG1 LINE-1 retrotransposable element ORF2 protein0.0e+0082.53Show/hide
Query:  MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKSLIETPSSNRFFLENRDSEHCIWIRKTRNGKGCIAEIFRVD
        MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKSLIETPSSNRFFLENRD EHCIWIRKTRNGKGC AEIFRVD
Subjt:  MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKSLIETPSSNRFFLENRDSEHCIWIRKTRNGKGCIAEIFRVD

Query:  NKNRKSCILVPEGPEKSGWVSFLSMITPKVETKAKIRPSFLPRSSPEIHSSSPIDYHKRSYAKAVTEGRPSISSDSSESYASSDSSDSSHSSGNSPRNFP
        +KNRKSCILVPEG EKS WVSFLSMITPKVE KAK RP FLPRSSPE   S PIDYHKRSYAKAV+EGR SISSDSS+SYA   SSDSS SSGNSP + P
Subjt:  NKNRKSCILVPEGPEKSGWVSFLSMITPKVETKAKIRPSFLPRSSPEIHSSSPIDYHKRSYAKAVTEGRPSISSDSSESYASSDSSDSSHSSGNSPRNFP

Query:  SPVSLENTVVLVRRFFHDDWYKILQNLRKQIEESFTYNAFHAEKALVHFNSNVPANLLCQNRGWTTVGKYTVRFEKWNPASHASPKLIPSYGGWTTFRGI
         PV LENTVVLVRRFFHDDW KILQNLRKQ EESFTYNAFHAEK LVHFNSNVPANLLCQN+GWTTVGKYTVRFEKW PASHASPKLIPSYGGWTTFRGI
Subjt:  SPVSLENTVVLVRRFFHDDWYKILQNLRKQIEESFTYNAFHAEKALVHFNSNVPANLLCQNRGWTTVGKYTVRFEKWNPASHASPKLIPSYGGWTTFRGI

Query:  PLNLCNMKTFQQIGKACGGLIKVAEETKTASNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFVVQTITHSEGKWLMERNVRLHGTFKRQAAASFDEFNPD
        PL+L NM TFQQIGKACGGLIKVAEETKTA NLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFVVQ +THSEGKWLMERNVRLHGTFKRQAAASFD+FNPD
Subjt:  PLNLCNMKTFQQIGKACGGLIKVAEETKTASNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFVVQTITHSEGKWLMERNVRLHGTFKRQAAASFDEFNPD

Query:  SEQFLFDGMEAISPDLQNTFSGSRKSISPEQPSTLKSVIIKPARDATSPPSLNEEVVNDNSLHATAIKSKGKISYRISNDCSLDKGKQKVDIPSQLTPAF
        SEQFLFDG+EAISPDL NT SGSRKSISPEQPS LKSVIIKPA+ ATSP +LNEEVVNDNSLHATA KSK KI   ISND SLDKGKQKVDIPSQLT AF
Subjt:  SEQFLFDGMEAISPDLQNTFSGSRKSISPEQPSTLKSVIIKPARDATSPPSLNEEVVNDNSLHATAIKSKGKISYRISNDCSLDKGKQKVDIPSQLTPAF

Query:  ILDKPKRKVSFNSPGNKTNFFNPNSAPANHSPSVSSPEKKQRVSKERSVKKKSTITPPKSRANQGQDASNTQPLKIIAHDMNASKKGLSLTVDLGNLPVL
        I  KPKRKVSFNSP NKT FFNP+SAPANH     SPEKK+RVS+ERSVKKKS+   PK RANQG+    TQPL+++AHD++ASKKGLSLTVDLGNLPVL
Subjt:  ILDKPKRKVSFNSPGNKTNFFNPNSAPANHSPSVSSPEKKQRVSKERSVKKKSTITPPKSRANQGQDASNTQPLKIIAHDMNASKKGLSLTVDLGNLPVL

Query:  DPSKSFEDHHSSDNAEVIDITNTEMVPETPELKMTDPEKPKSPPEVNHRKQKHSHRRRHYYRKKEDTEKDSNSEVFKNQLVAWLKENGALIKNTLISYSP
        DPSKSFEDHHSSDNAEVIDITNTE+VPETPELKMTDPEK  S PEVN+RKQKHSHRRRHYYRKKED EKD+NSE FKNQLV WLKENG       +  S 
Subjt:  DPSKSFEDHHSSDNAEVIDITNTEMVPETPELKMTDPEKPKSPPEVNHRKQKHSHRRRHYYRKKEDTEKDSNSEVFKNQLVAWLKENGALIKNTLISYSP

Query:  DFVILTETRLKFINKKIVKSLWPSNSIKWIVKNAIDSSGGILILWDDLHHSLLSQEEGMFSLTANFLSSNNSWWLTGLYGPVQRRERLNFWTDLHNLLHL
        D                  S   + S   +      S+GGILILWD  HHSLLSQEEG FSL+ANF S NNSWWLTGLYGPV+RRERLN W DLHNL HL
Subjt:  DFVILTETRLKFINKKIVKSLWPSNSIKWIVKNAIDSSGGILILWDDLHHSLLSQEEGMFSLTANFLSSNNSWWLTGLYGPVQRRERLNFWTDLHNLLHL

Query:  NSFPWILGGDLNAIRMREESTAVTSSTHSSNMLNNFISINSLIDPPLSNNRYTWSNLRIPPTFSRLDRFLYNPNWEVLFNPHITRTLPRPTSDHFPLVCE
        NS PWI+GGDLN +RMREESTAVT S+HSSNMLN+FIS N LIDPPL+NNRYTWSNLR PPTFSRLDRFLYN  WE+LFNPHITRTLPRPTSDHFPLVCE
Subjt:  NSFPWILGGDLNAIRMREESTAVTSSTHSSNMLNNFISINSLIDPPLSNNRYTWSNLRIPPTFSRLDRFLYNPNWEVLFNPHITRTLPRPTSDHFPLVCE

Query:  DSSPTVSWGPAPFRLNSIVLNDPEFKRNMERWWELSAQEGHPGFAFIQRLKSLANFIKPWQKEKFHSISSAKENIIKEVDAIDKKELDTLLCQEDT----
        DS+ T+ WGPAPFRLNSI LNDPEFKRNMERWWELS Q GHPGF FIQRLKSLAN IKPWQKEKF S++SAKENII+EVD+IDK ELDT L  E++    
Subjt:  DSSPTVSWGPAPFRLNSIVLNDPEFKRNMERWWELSAQEGHPGFAFIQRLKSLANFIKPWQKEKFHSISSAKENIIKEVDAIDKKELDTLLCQEDT----

Query:  ---------------------KKIWLKEGDENFAFFHRICSSRQKRNIIHEIQDEDGSNQNTNISISLAFVNYFTKLYRSSTKTNPLFIDNLKWKPIDYS
                             KK+WLKEGDEN AFFHRICSSRQKRN+IHEIQDE+GS QNTN +ISLAFVN+F+++YR STK +PLFI+NL+W PIDYS
Subjt:  ---------------------KKIWLKEGDENFAFFHRICSSRQKRNIIHEIQDEDGSNQNTNISISLAFVNYFTKLYRSSTKTNPLFIDNLKWKPIDYS

Query:  EWSPLCAPFLEEEIKGVINSFEGNKAPGPDGFPISFFKSYWKLLKEDILAIFKDFYEKGVINKNMNNTFIALIAKKKNYSHPKDFRPISLTTSIYKIIAK
        +WS LCAPF EEEIKGVI SF+GNKAPGPDGFPISFFKSYW LLKEDIL IFKDF+EKGVINKNMNNT+IALI KKK+YSHPKDFRPISLTTSIYK IAK
Subjt:  EWSPLCAPFLEEEIKGVINSFEGNKAPGPDGFPISFFKSYWKLLKEDILAIFKDFYEKGVINKNMNNTFIALIAKKKNYSHPKDFRPISLTTSIYKIIAK

Query:  TLSNKLKLTLPDTISGNQLAFIKNRQITDAILMANEALDYWKVKKIKGFILKLDIEKAFDNLSWDFIDFVLKKKNYPPSWRQWIRGCISNVTYSIIVNGK
        TLSN+LKLTLPDTISGNQLAFIKNRQITDAILMANEALDYWKVKKIKGFILKLDIEKAFDNL+W+FID VLKK NYP SWR+WIRGCISNVTYSIIVNGK
Subjt:  TLSNKLKLTLPDTISGNQLAFIKNRQITDAILMANEALDYWKVKKIKGFILKLDIEKAFDNLSWDFIDFVLKKKNYPPSWRQWIRGCISNVTYSIIVNGK

Query:  PQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESTGAIKGVCLGKDCNISHILFADDILLFVEDNAFYLNNLRMAISLFEKASGLKINLSKSAMVP
        PQGRIKANRGLRQGDPLS FLFVIAMDYLSRLLSHLESTGAIKGVCL  DCNISHILFADDILLFVEDN  +LNNLRMAISLFEKASGLKINLSKSA+VP
Subjt:  PQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESTGAIKGVCLGKDCNISHILFADDILLFVEDNAFYLNNLRMAISLFEKASGLKINLSKSAMVP

Query:  VNVPWPRALDCASS
        VNV W RAL+CASS
Subjt:  VNVPWPRALDCASS

A0A5A7UV84 Reverse transcriptase domain-containing protein0.0e+0064.82Show/hide
Query:  MITPKVETKAKIRPSFLPRSSPEIHSSSPIDYHKRSYAKAVTEGRPSISSDSSESYASSDSSDSSHSSGNSPRNFPSPVSLENTVVLVRRFFHDDWYKIL
        MITPKVE K K RP+FLPRSSPE   S PIDYHKRSYAK VTEGRP  +SDSS+SY    SSDSSHSSGNS  + PSP  LENTVVLVRRFFHDDW KIL
Subjt:  MITPKVETKAKIRPSFLPRSSPEIHSSSPIDYHKRSYAKAVTEGRPSISSDSSESYASSDSSDSSHSSGNSPRNFPSPVSLENTVVLVRRFFHDDWYKIL

Query:  QNLRKQIEESFTYNAFHAEKALVHFNSNVPANLLCQNRGWTTVGKYTVRFEKWNPASHASPKLIPSYGGWTTFRGIPLNLCNMKTFQQIGKACGGLIKVA
        QNLRKQ EESFTYNAFHAEKALVHFNSN+P NLLCQN+GWTTVGKY+VRFEKW+PA HA+PKLIPSYGGWTTF+                          
Subjt:  QNLRKQIEESFTYNAFHAEKALVHFNSNVPANLLCQNRGWTTVGKYTVRFEKWNPASHASPKLIPSYGGWTTFRGIPLNLCNMKTFQQIGKACGGLIKVA

Query:  EETKTASNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFVVQTITHSEGKWLMERNVRLHGTFKRQAAASFDEFNPDSEQ----FLFDGMEAISPDLQNTF
           + ++ L+E                                                          +D+F+ + E      LFDG EAISPD  +T 
Subjt:  EETKTASNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFVVQTITHSEGKWLMERNVRLHGTFKRQAAASFDEFNPDSEQ----FLFDGMEAISPDLQNTF

Query:  SGSRKSISPEQPSTLKSVIIKPARDATSPPSLNEEVVNDNSLHATAIKSKGKISYRISNDCSLDKGKQKVDIPSQLTPAFILDKPKRKVSFNSPGNKTNF
        S SRKS +P+QPS LKSVIIKP + ATSP  LNEEVVND++LHATA KS+ +I   I ND  LDKGKQKVDI      A  L+KPKRKVSFNSP NKTN 
Subjt:  SGSRKSISPEQPSTLKSVIIKPARDATSPPSLNEEVVNDNSLHATAIKSKGKISYRISNDCSLDKGKQKVDIPSQLTPAFILDKPKRKVSFNSPGNKTNF

Query:  FNPNSAPANHSPSVSSPEKKQRVSKERSVKKKSTITPPKSRANQGQDASNTQPLKIIAHDMNASKKGLSLTVDLGNLPVLDPSKSFEDHHSSDNAEVIDI
        FNP+SAPANHS S+SSPEKKQ+VS+ERS+KKKS+   P     Q +    TQP++++AHD+ ASKKGLSL V+LG+LPVLDPSKSFEDHHSS NAEVIDI
Subjt:  FNPNSAPANHSPSVSSPEKKQRVSKERSVKKKSTITPPKSRANQGQDASNTQPLKIIAHDMNASKKGLSLTVDLGNLPVLDPSKSFEDHHSSDNAEVIDI

Query:  TNTEMVPETPELKMTDPEKPKSPPEVNHRKQKHSHRRRHYYRKKEDTEKDS-------NSEVF-KNQLVAW------LKENGALIKNTLISYSPDFVILT
        TNTE+VPETPE+KM   E   S  E N+RK KH HRRR+YYRKK    + S       + +++ K +L+ W           ALIKN +ISYSPDFVILT
Subjt:  TNTEMVPETPELKMTDPEKPKSPPEVNHRKQKHSHRRRHYYRKKEDTEKDS-------NSEVF-KNQLVAW------LKENGALIKNTLISYSPDFVILT

Query:  ETRLKFINKKIVKSLWPSNSIKWIVKNAIDSSGGILILWDDLHHSLLSQEEGMFSLTAN-FLSSNNSWWLTGLYGPVQRRERLNFWTDLHNLLHLNSFPW
        ET LK  NK+I+KS WPSNSI WIVKNA  SSGGILILWD   HSLLSQEE +FSL+AN FL++N+SWWLTGLYGP +RR+R++FW DLHNL HLNSFPW
Subjt:  ETRLKFINKKIVKSLWPSNSIKWIVKNAIDSSGGILILWDDLHHSLLSQEEGMFSLTAN-FLSSNNSWWLTGLYGPVQRRERLNFWTDLHNLLHLNSFPW

Query:  ILGGDLNAIRMREESTAVTSSTHSSNMLNNFISINSLIDPPLSNNRYTWSNLRIPPTFSRLDRFLYNPNWEVLFNPHITRTLPRPTSDHFPLVCEDSSPT
         L  DLN IRMREE+T++ SS+HSS MLNNFIS N LIDPPL+NNR+TWSNLR P TFSR+DRFLYN +WE LF+PH TRTLPRPTSDHFPLVCEDS+P 
Subjt:  ILGGDLNAIRMREESTAVTSSTHSSNMLNNFISINSLIDPPLSNNRYTWSNLRIPPTFSRLDRFLYNPNWEVLFNPHITRTLPRPTSDHFPLVCEDSSPT

Query:  VSWGPAPFRLNSIVLNDPEFKRNMERWWELSAQEGHPGFAFIQRLKSLANFIKPWQKEKFHSISSAKENIIKEVDAIDKKELDTLLCQEDTKKIWLKEGD
        + WGPAPFRLNSI LNDPEFKRNMERWWE S Q GHPGF+FIQRLKSLAN IKPWQKEK HS++ AKE II+EVD+IDKKELDT L Q+++         
Subjt:  VSWGPAPFRLNSIVLNDPEFKRNMERWWELSAQEGHPGFAFIQRLKSLANFIKPWQKEKFHSISSAKENIIKEVDAIDKKELDTLLCQEDTKKIWLKEGD

Query:  ENFAFFHRICSSRQKRNIIHEIQDEDGSNQNTNISISLAFVNYFTKLYRSSTKTNPLFIDNLKWKPIDYSEWSPLCAPFLEEEIKGVINSFEGNKAPGPD
                       R +  + +  D S + +   I           Y+SSTK++PLFI+NL W PI++SEW  LCAPFLEEEIKGVINSF+G KAP PD
Subjt:  ENFAFFHRICSSRQKRNIIHEIQDEDGSNQNTNISISLAFVNYFTKLYRSSTKTNPLFIDNLKWKPIDYSEWSPLCAPFLEEEIKGVINSFEGNKAPGPD

Query:  GFPISFFKSYWKLLKEDILAIFKDFYEKGVINKNMNNTFIALIAKKKNYSHPKDFRPISLTTSIYKIIAKTLSNKLKLTLPDTISGNQLAFIKNRQITDA
        GFPISFFKSYW LLKEDI+ IFKDF+EKGVINKNMNNT+IALI KKK+YSHPKDFRPISLTTSIYKIIAKTLSN+LK TLP TISGNQLAFIKNRQITDA
Subjt:  GFPISFFKSYWKLLKEDILAIFKDFYEKGVINKNMNNTFIALIAKKKNYSHPKDFRPISLTTSIYKIIAKTLSNKLKLTLPDTISGNQLAFIKNRQITDA

Query:  ILMANEALDYWKVKKIKGFILKLDIEKAFDNLSWDFIDFVLKKKNYPPSWRQWIRGCISNVTYSIIVNGKPQGRIKANRGLRQGDPLSPFLFVIAMDYLS
        ILMANEA+DYWKVKKIKGFILKLDIEK F NL+WDFID+VL KKN+P SWR+WIRGCISNVTYS+I+NG+PQGRIKANRGLRQGDPLSPFLFVIAMDY S
Subjt:  ILMANEALDYWKVKKIKGFILKLDIEKAFDNLSWDFIDFVLKKKNYPPSWRQWIRGCISNVTYSIIVNGKPQGRIKANRGLRQGDPLSPFLFVIAMDYLS

Query:  RLLSHLESTGAIKGVCLGKDCNISHILFADDILLFVEDNAFYLNNLRMAISLFEKASGLKINLSKSAMVPVNVPWPRALDCASSWDIPCQSLPLSYLGVP
        RLLSHLE++GAIKGV L  +CNISHILFADDILLFVEDN  +LNNL MA+SLFEKASGLKINL KSA+VPVNV   RA +CAS W I C SL LSYLGVP
Subjt:  RLLSHLESTGAIKGVCLGKDCNISHILFADDILLFVEDNAFYLNNLRMAISLFEKASGLKINLSKSAMVPVNVPWPRALDCASSWDIPCQSLPLSYLGVP

Query:  LGGNPKSKPFWRNIEDRIHKKLSNWKYAHISKGGRLTLIKSTLTSIPIYQLSVFQAPLSTYKNIEKLWRRFLWKGSCNPKGSHLIKWSIVTKPKEEGGLG
        LGG+  S                                                                        KGSHLI W+ V K KEEGGLG
Subjt:  LGGNPKSKPFWRNIEDRIHKKLSNWKYAHISKGGRLTLIKSTLTSIPIYQLSVFQAPLSTYKNIEKLWRRFLWKGSCNPKGSHLIKWSIVTKPKEEGGLG

Query:  ISRLQITNQALLPKWLWRYHSEPNSLWRKLIQLKYQSKHPGDLPSNISSSSSKAPWRSIINNSEWFKRNQGWDLKNGDQISFWFSNWSTEGCLSTAYPRL
        ISRLQ+TN+ALL KWLWRY SEPN+LWR+LIQ KY+ KHPGD+PSN SSSSSKAPWRSII+N +WFK NQ WDL NGDQISFW+SNWS EGCLSTAYPRL
Subjt:  ISRLQITNQALLPKWLWRYHSEPNSLWRKLIQLKYQSKHPGDLPSNISSSSSKAPWRSIINNSEWFKRNQGWDLKNGDQISFWFSNWSTEGCLSTAYPRL

Query:  FALSIDKESSIKDVWNSINNQWEIAFRRNLNDRELSTWQRILGNLPVPRTNRGSSKPTWIPDSKKSFSIASAKRCISHQPELSVASPLSKLLDLIWKSFI
        FAL++DKE S+KD WN+I+NQW I FRR LNDRE   W++IL  LP PR NRGSSKPTWIPD  KSFSIASAK  IS Q + +      KLL++IWKS I
Subjt:  FALSIDKESSIKDVWNSINNQWEIAFRRNLNDRELSTWQRILGNLPVPRTNRGSSKPTWIPDSKKSFSIASAKRCISHQPELSVASPLSKLLDLIWKSFI

Query:  PMKIKFFMWCLIQRKLNTSEVIQQRMPNLALQPNWCVLCKKDSESGAHLFLQCDMVKPLWSLLQQALNFAHFSDDFEALISFFLSLNQSLPKHK
        PMKIKFFMWCLIQR+++T EVIQQRM N  LQPNWCVLC KD+ESG HLFL+CD VKPLWSLL ++LNFA  +DDFEAL SFFLSL  SLPKHK
Subjt:  PMKIKFFMWCLIQRKLNTSEVIQQRMPNLALQPNWCVLCKKDSESGAHLFLQCDMVKPLWSLLQQALNFAHFSDDFEALISFFLSLNQSLPKHK

A0A5D3BL61 LINE-1 retrotransposable element ORF2 protein0.0e+0079.21Show/hide
Query:  MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKSLIETPSSNRFFLENRDSEHCIWIRKTRNGKGCIAEIFRVD
        MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKSLIETPSSNRFFLENRD EHCIWIRKTRNGKGC AEIFRVD
Subjt:  MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKSLIETPSSNRFFLENRDSEHCIWIRKTRNGKGCIAEIFRVD

Query:  NKNRKSCILVPEGPEKSGWVSFLSMITPKVETKAKIRPSFLPRSSPEIHSSSPIDYHKRSYAKAVTEGRPSISSDSSESYASSDSSDSSHSSGNSPRNFP
        +KNRKSCILVPEG EKS WVSFLSMITPKVE KAK RP FLPRSSPE   S PIDYHKRSYAKAV+EGR SISSDSS+SYA   SSDSS SSGNSP + P
Subjt:  NKNRKSCILVPEGPEKSGWVSFLSMITPKVETKAKIRPSFLPRSSPEIHSSSPIDYHKRSYAKAVTEGRPSISSDSSESYASSDSSDSSHSSGNSPRNFP

Query:  SPVSLENTVVLVRRFFHDDWYKILQNLRKQIEESFTYNAFHAEKALVHFNSNVPANLLCQNRGWTTVGKYTVRFEKWNPASHASPKLIPSYGGWTTFRGI
         PV LENTVVLVRRFFHDDW KILQNLRKQ EESFTYNAFHAEK LVHFNSNVPANLLCQN+GWTTVGKYTVRFEKW PASHASPKLIPSYGGWTTFRGI
Subjt:  SPVSLENTVVLVRRFFHDDWYKILQNLRKQIEESFTYNAFHAEKALVHFNSNVPANLLCQNRGWTTVGKYTVRFEKWNPASHASPKLIPSYGGWTTFRGI

Query:  PLNLCNMKTFQQIGKACGGLIKVAEETKTASNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFVVQTITHSEGKWLMERNVRLHGTFKRQAAASFDEFNPD
        PL+L NM TFQQIGKACGGLIKVAEETKTA NLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFVVQ +THSEGKWLMERNVRLHGTFKRQAAASFD+FNPD
Subjt:  PLNLCNMKTFQQIGKACGGLIKVAEETKTASNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFVVQTITHSEGKWLMERNVRLHGTFKRQAAASFDEFNPD

Query:  SEQFLFDGMEAISPDLQNTFSGSRKSISPEQPSTLKSVIIKPARDATSPPSLNEEVVNDNSLHATAIKSKGKISYRISNDCSLDKGKQKVDIPSQLTPAF
        SEQFLFDG+EAISPDL NT SGSRKSISPEQPS LKSVIIKPA+ ATSP +LNEEVVNDNSLHATA KSK KI   ISND SLDKGKQKVDIPSQLT AF
Subjt:  SEQFLFDGMEAISPDLQNTFSGSRKSISPEQPSTLKSVIIKPARDATSPPSLNEEVVNDNSLHATAIKSKGKISYRISNDCSLDKGKQKVDIPSQLTPAF

Query:  ILDKPKRKVSFNSPGNKTNFFNPNSAPANHSPSVSSPEKKQRVSKERSVKKKSTITPPKSRANQGQDASNTQPLKIIAHDMNASKKGLSLTVDLGNLPVL
        I  KPKRKVSFNSP NKT FFNP+SAPANH     SPEKK+RVS+ERSVKKKS+   PK RANQG+    TQPL+++AHD++ASKKGLSLTVDLGNLPVL
Subjt:  ILDKPKRKVSFNSPGNKTNFFNPNSAPANHSPSVSSPEKKQRVSKERSVKKKSTITPPKSRANQGQDASNTQPLKIIAHDMNASKKGLSLTVDLGNLPVL

Query:  DPSKSFEDHHSSDNAEVIDITNTEMVPETPELKMTDPEKPKSPPEVNHRKQKHSHRRRHYYRKKEDTEKDSNSEVFKNQLVAWLKENGALIKNTLISYSP
        DPSKSFEDHHSSDNAEVIDITNTE+VPETPELKMTDPEK  S PEVN+RKQKHSHRRRHYYRKKED EKD+NSE FKNQLV WLKENG       +  S 
Subjt:  DPSKSFEDHHSSDNAEVIDITNTEMVPETPELKMTDPEKPKSPPEVNHRKQKHSHRRRHYYRKKEDTEKDSNSEVFKNQLVAWLKENGALIKNTLISYSP

Query:  DFVILTETRLKFINKKIVKSLWPSNSIKWIVKNAIDSSGGILILWDDLHHSLLSQEEGMFSLTANFLSSNNSWWLTGLYGPVQRRERLNFWTDLHNLLHL
        D                  S   + S   +      S+GGILILWD  HHSLLSQEEG FSL+ANF S NNSWWLTGLYGPV+RRERLN W DLHNL HL
Subjt:  DFVILTETRLKFINKKIVKSLWPSNSIKWIVKNAIDSSGGILILWDDLHHSLLSQEEGMFSLTANFLSSNNSWWLTGLYGPVQRRERLNFWTDLHNLLHL

Query:  NSFPWILGGDLNAIRMREESTAVTSSTHSSNMLNNFISINSLIDPPLSNNRYTWSNLRIPPTFSRLDRFLYNPNWEVLFNPHITRTLPRPTSDHFPLVCE
        NS PWI+GGDLN +RMREESTAVT S+HSSNMLN+FIS N LIDPPL+NNRYTWSNLR PPTFSRLDRFLYN  WE+LFNPHITRTLPRPTSDHFPLVCE
Subjt:  NSFPWILGGDLNAIRMREESTAVTSSTHSSNMLNNFISINSLIDPPLSNNRYTWSNLRIPPTFSRLDRFLYNPNWEVLFNPHITRTLPRPTSDHFPLVCE

Query:  DSSPTVSWGPAPFRLNSIVLNDPEFKRNMERWWELSAQEGHPGFAFIQRLKSLANFIKPWQKEKFHSISSAKENIIKEVDAIDKKELDTLLCQEDT----
        DS+ T+ WGPAPFRLNSI LNDPEFKRNMERWWELS Q GHPGF FIQRLKSLAN IKPWQKEKF S++SAKENII+EVD+IDK ELDT L  E++    
Subjt:  DSSPTVSWGPAPFRLNSIVLNDPEFKRNMERWWELSAQEGHPGFAFIQRLKSLANFIKPWQKEKFHSISSAKENIIKEVDAIDKKELDTLLCQEDT----

Query:  ---------------------KKIWLKEGDENFAFFHRICSSRQKRNIIHEIQDEDGSNQNTNISISLAFVNYFTKLYRSSTKTNPLFIDNLKWKPIDYS
                             KK+WLKEGDEN AFFHRICSSRQKRN+IHEIQDE+GS QNTN +ISLAFVN+F+++YR STK +PLFI+NL+W PIDYS
Subjt:  ---------------------KKIWLKEGDENFAFFHRICSSRQKRNIIHEIQDEDGSNQNTNISISLAFVNYFTKLYRSSTKTNPLFIDNLKWKPIDYS

Query:  EWSPLCAPFLEEEIKGVINSFEGNKAPGPDGFPISFFKSYWKLLKEDILAIFKDFYEKGVINKNMNNTFIALIAKKKNYSHPKDFRPISLTTSIYKIIAK
        +WS LCAPF EEEIKGVI SF+GNKAPGPDGFPISFFKSYW LLKEDIL IFKDF+EKGVINKNMNNT+IALI KKK+YSHPKDFRPISLTTSIYK IAK
Subjt:  EWSPLCAPFLEEEIKGVINSFEGNKAPGPDGFPISFFKSYWKLLKEDILAIFKDFYEKGVINKNMNNTFIALIAKKKNYSHPKDFRPISLTTSIYKIIAK

Query:  TLSNKLKLTLPDTISGNQLAFIKNRQITDAILMANEALDYWKVKKIKGFILKLDIEKAFDNLSWDFIDFVLKKKNYPPSWRQWIRGCISNVTYSIIVNGK
        TLSN+LKLTLPDTISGNQLAFIKNRQITDAILMANEALDYWKVKKIKGFILKLDIEKAFDNL+W+FID VLKK NYP SWR+WIRGCISNVTYSIIVNGK
Subjt:  TLSNKLKLTLPDTISGNQLAFIKNRQITDAILMANEALDYWKVKKIKGFILKLDIEKAFDNLSWDFIDFVLKKKNYPPSWRQWIRGCISNVTYSIIVNGK

Query:  PQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESTGAIKGVCLGKDCNISHILFADDILLFVEDNAFYLNNLRMAISLFEKASGLKINLSKSAMVP
        PQGRIKANRGLRQGDPLS FLFVIAMDYLSRLLSHLESTGAIKG                                                        
Subjt:  PQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESTGAIKGVCLGKDCNISHILFADDILLFVEDNAFYLNNLRMAISLFEKASGLKINLSKSAMVP

Query:  VNVPWPRALDCASSWDIPCQSLPLSYLGVPLGGNPKSKPFWRNIEDRIHKKLSNWKYAHISKGGRLTLIKSTLTSIPIYQLSVFQAPLSTYKNIEKLWRR
                        I C +LPL+YLGVPLGGNPKS  FWRNIEDRI KKLSNWKYAHISKGGRLTLIKSTL+S+PIY+LSVFQAP STYKNIEKLWR 
Subjt:  VNVPWPRALDCASSWDIPCQSLPLSYLGVPLGGNPKSKPFWRNIEDRIHKKLSNWKYAHISKGGRLTLIKSTLTSIPIYQLSVFQAPLSTYKNIEKLWRR

Query:  FLWKGSCNPKGSHLIKWSIVTKPKEEGGLGISRLQITNQALLPKWLWRYHSEPNSLWRKLIQLKYQSKHPGDLPSNISSSSSKAPWRSIINNSEWFKRNQ
        FLWKGSC  KGSHLI WSIVTKPKEEGGLGISRLQ+TNQALL KWLWRY+SEPNSLWR+LI +KY+ KHPGDLPSNISSSSSKAPWRSIINN +WFK NQ
Subjt:  FLWKGSCNPKGSHLIKWSIVTKPKEEGGLGISRLQITNQALLPKWLWRYHSEPNSLWRKLIQLKYQSKHPGDLPSNISSSSSKAPWRSIINNSEWFKRNQ

Query:  GWDLKNGDQISFWFSNWSTEGCLSTAYPRLFALSIDKESSIKDVWNSINNQWEIAFRRNLNDRELSTWQRILGNLPVPRTNRGSSKPTWIPDSKKSFSIA
        GWDL NGDQISFW+SNWS EGCLSTAYPRLFALS+DKESSIKDVWNS NNQWEI FRR LNDRELSTWQ+IL NLP+ RTNRG SKPTWIPDSKK FSIA
Subjt:  GWDLKNGDQISFWFSNWSTEGCLSTAYPRLFALSIDKESSIKDVWNSINNQWEIAFRRNLNDRELSTWQRILGNLPVPRTNRGSSKPTWIPDSKKSFSIA

Query:  SAKRCISHQPELSVASPLSKLLDLIWKSFIPMKIKFFMWCLIQRKLNTSEVIQQRMPNLALQPNWCVLCKKDSESGAHLFLQCDMVKPLWSLLQQALNFA
        SAK CISHQP+ SVA+P  KLL+LIWK+ +PMKIKFFMWCL+QRKLNT EV         LQPNWCVLCKK SE+GAHLFL CD+VKPLWSLL ++LNFA
Subjt:  SAKRCISHQPELSVASPLSKLLDLIWKSFIPMKIKFFMWCLIQRKLNTSEVIQQRMPNLALQPNWCVLCKKDSESGAHLFLQCDMVKPLWSLLQQALNFA

Query:  HFSDDFEALISFFLSLNQSLPKHKIVNCGVIAVLWCIWSERNNRTFDNLSYQKTIINLWEDCKILIGNWSSRDPTFKNYSASTIALNLNA
          SDDFEA+ SFFLSLNQSLPKHK+V CG+IA+LW IW+ERNNR FD LSYQK+I NLWEDCKILIGNW SRDPTFKNYSA+TIALNLNA
Subjt:  HFSDDFEALISFFLSLNQSLPKHKIVNCGVIAVLWCIWSERNNRTFDNLSYQKTIINLWEDCKILIGNWSSRDPTFKNYSASTIALNLNA

A0A5D3BLV7 LINE-1 retrotransposable element ORF2 protein0.0e+0072Show/hide
Query:  YFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKSLIETPSSNRFFLENRDSEHCIWIRKTRNGKGCIAEIFRVDNK
        +FKSLPRSCK+ERKEFVL LDKY+KHTHYWLTETGAHKAFSIEVSPRDLDWIR TLKSLI TP++NRFFLE RDSE  IWIRKTRN KGC AEIFRVD K
Subjt:  YFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKSLIETPSSNRFFLENRDSEHCIWIRKTRNGKGCIAEIFRVDNK

Query:  NRKSCILVPEGPEKSGWVSFLSMITPKVETKAKIRPSFLPRSSPEIHSSSPIDYHKRSYAKAVTEGRPSISSDSSESYASSDSSDSSHSSGNSPRNFPSP
        NRKSCILVPEGP+KSGWVSFLSMITPKVE KAK RP+FLPR+SP+   S PIDYHKRSYAKAVTEGRP  +SDSS+SY   DSSDSSHSS NS  + PS 
Subjt:  NRKSCILVPEGPEKSGWVSFLSMITPKVETKAKIRPSFLPRSSPEIHSSSPIDYHKRSYAKAVTEGRPSISSDSSESYASSDSSDSSHSSGNSPRNFPSP

Query:  VSLENTVVLVRRFFHDDWYKILQNLRKQIEESFTYNAFHAEKALVHFNSNVPANLLCQNRGWTTVGKYTVRFEKWNPASHASPKLIPSYGGWTTFRGIPL
          LENTVV+VRRFFHDDW+KILQNLRKQ EESFTYNAFHAEKALVHF+SN+PANLLCQN+GW+TVGKY+VRFEKW+P  HA+PKLIPSYGGWTTFRGIPL
Subjt:  VSLENTVVLVRRFFHDDWYKILQNLRKQIEESFTYNAFHAEKALVHFNSNVPANLLCQNRGWTTVGKYTVRFEKWNPASHASPKLIPSYGGWTTFRGIPL

Query:  NLCNMKTFQQIGKACGGLIKVAEETKTASNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFVVQTITHSEGKWLMERNVRLHGTFKRQAAASFDEFNPDSE
        +L NM TFQQIGKAC GLIKVAEET++A NLIEA++K+RYNYSGFLPA V+IFD EGNKF VQ +TH EGKWL+ERNVRLHGTFKRQAAASFD+FNP+SE
Subjt:  NLCNMKTFQQIGKACGGLIKVAEETKTASNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFVVQTITHSEGKWLMERNVRLHGTFKRQAAASFDEFNPDSE

Query:  QFLFDGMEAISPDLQNTFSGSRKSISPEQPSTLKSVIIKPARDATSPPSLNEEVVNDNSLHATAIKSKGKISYRISNDCSLDKGKQKVDIPSQLTPAFIL
        QF F+G EAISPD  +T S  RKS +P+QPS LKSVIIKP R+AT P  LNEE+VND++LHATA KSK +I   ISND  LDKGKQKVDI  Q   A  L
Subjt:  QFLFDGMEAISPDLQNTFSGSRKSISPEQPSTLKSVIIKPARDATSPPSLNEEVVNDNSLHATAIKSKGKISYRISNDCSLDKGKQKVDIPSQLTPAFIL

Query:  DKPKRKVSFNSPGNKTNFFNPNSAPANHSPSVSSPEKKQRVSKERSVKKKSTITPPKSRANQGQDASNTQPLKIIAHDMNASKKGLSLTVDLGNLPVLDP
        DK KRKVSFNSP NKTN FNP+SAPANHSPS++SPEKKQ+VS+ERS+KKKS+ T P S+ANQ +    TQP++I+AHD +A+KKGLSLTVDLG+LP LDP
Subjt:  DKPKRKVSFNSPGNKTNFFNPNSAPANHSPSVSSPEKKQRVSKERSVKKKSTITPPKSRANQGQDASNTQPLKIIAHDMNASKKGLSLTVDLGNLPVLDP

Query:  SKSFEDHHSSDNAEVIDITNTEMVPETPELKMTDPEKPKSPPEVNHRKQKHSHRRRHYYRKKEDTEKDSNSEVFKNQLVAWLKENGALIKNTLIS---YS
        +KS EDHH+SDNAEV+DITNTE+VPETPE+KM   E   S  E N+RK KH H+R++YYRKKE+ EKD +SE FK QLV+WLK+NG  +     S    +
Subjt:  SKSFEDHHSSDNAEVIDITNTEMVPETPELKMTDPEKPKSPPEVNHRKQKHSHRRRHYYRKKEDTEKDSNSEVFKNQLVAWLKENGALIKNTLIS---YS

Query:  PDFVILTE--TRLKFINKKIVKSLWPSNSIKWIVKNAIDSSGGILILWDDLHHSLLSQEEGMFSLTANFLSSNN-SWWLTGLYGPVQRRERLNFWTDLHN
           V+L +  + LK  NK+I+KSLWPSNSI WI KNA  SSGGILILWD  +HSLLSQEEG+FSL+ANFL +NN SWWLTGLYGPV+RRER++FW +LHN
Subjt:  PDFVILTE--TRLKFINKKIVKSLWPSNSIKWIVKNAIDSSGGILILWDDLHHSLLSQEEGMFSLTANFLSSNN-SWWLTGLYGPVQRRERLNFWTDLHN

Query:  LLHLNSFPWILGGDLNAIRMREESTAVTSSTHSSNMLNNFISINSLIDPPLSNNRYTWSNLRIPPTFSRLDRFLYNPNWEVLFNPHITRTLPRPTSDHFP
        L HLNSFPWILGGDLN IRMREEST+V SS+H+S MLNNFIS N LIDPPL+NNR+TWSNLR PPTFSR+DRFLYN +WE LF+PH TRTLPR TSDHFP
Subjt:  LLHLNSFPWILGGDLNAIRMREESTAVTSSTHSSNMLNNFISINSLIDPPLSNNRYTWSNLRIPPTFSRLDRFLYNPNWEVLFNPHITRTLPRPTSDHFP

Query:  LVCEDSSPTVSWGPAPFRLNSIVLNDPEFKRNMERWWELSAQEGHPGFAFIQRLKSLANFIKPWQKEKFHSISSAKENIIKEVDAIDKKELDTLLCQEDT
        LVCEDS+P +SWGP PFRLNSI L+DPEFKRNM RWWE S Q G+PGF+FIQRLKSLANFIKPWQKEK HS++ AKE II+EVD+IDKKELDT L QE++
Subjt:  LVCEDSSPTVSWGPAPFRLNSIVLNDPEFKRNMERWWELSAQEGHPGFAFIQRLKSLANFIKPWQKEKFHSISSAKENIIKEVDAIDKKELDTLLCQEDT

Query:  -------------------------KKIWLKEGDENFAFFHRICSSRQKRNIIHEIQDEDGSNQNTNISISLAFVNYFTKLYRSSTKTNPLFIDNLKWKP
                                 KK+WL+EGDEN +FFHRICSSRQKR+ IHEIQDE+GS QNTN SIS AF+ +F+++YRSSTK++PLFI+NL W P
Subjt:  -------------------------KKIWLKEGDENFAFFHRICSSRQKRNIIHEIQDEDGSNQNTNISISLAFVNYFTKLYRSSTKTNPLFIDNLKWKP

Query:  IDYSEWSPLCAPFLEEEIKGVINSFEGNKAPGPDGFPISFFKSYWKLLKEDILAIFKDFYEKGVINKNMNNTFIALIAKKKNYSHPKDFRPISLTTSIYK
        I  SEWS LCAPFLE EIKGVINSF+G K PGPDGFPISFFKS+W                                                       
Subjt:  IDYSEWSPLCAPFLEEEIKGVINSFEGNKAPGPDGFPISFFKSYWKLLKEDILAIFKDFYEKGVINKNMNNTFIALIAKKKNYSHPKDFRPISLTTSIYK

Query:  IIAKTLSNKLKLTLPDTISGNQLAFIKNRQITDAILMANEALDYWKVKKIKGFILKLDIEKAFDNLSWDFIDFVLKKKNYPPSWRQWIRGCISNVTYSII
                 LK TLP+TISGNQLAF+KNRQITDAILMANEA+DYWKVKKIKGFILKLDIEKAFDNL+ DFID VL+KKN+P  WR+WIRGCISNVTYS+I
Subjt:  IIAKTLSNKLKLTLPDTISGNQLAFIKNRQITDAILMANEALDYWKVKKIKGFILKLDIEKAFDNLSWDFIDFVLKKKNYPPSWRQWIRGCISNVTYSII

Query:  VNGKPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESTGAIKGVCLGKDCNISHILFADDILLFVEDNAFYLNNLRMAISLFEKASGLKINLSKS
        +NG+PQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLES+GAIKGV L  +CNISHILFADDILLF+EDN  +L NLRMA+SLFE+ASGLKINL KS
Subjt:  VNGKPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESTGAIKGVCLGKDCNISHILFADDILLFVEDNAFYLNNLRMAISLFEKASGLKINLSKS

Query:  AMVPVNVPWPRALDCASSWDIPCQSLPLSYLGVPLGGNPKSKPFWRNIEDRIHKKLSNWKYAHISKGGRLTLIKSTLTSIPIYQLSVFQAPLSTYKNIEK
        A+VPVNV   RA +CAS W I C SLPLSYLGVPLGGNPKS  FWRN+ED+I KKL+NWKYA ISKGGRLTLIKSTL+S+PIYQLSVFQAP  T KNIEK
Subjt:  AMVPVNVPWPRALDCASSWDIPCQSLPLSYLGVPLGGNPKSKPFWRNIEDRIHKKLSNWKYAHISKGGRLTLIKSTLTSIPIYQLSVFQAPLSTYKNIEK

Query:  LWRRFLWKGSCNPKGSHLIKWSIVTKPKEEGGLGISRLQITNQALLPKWLWRYHSEPNSLWRKLIQLKYQSKHPGDLPSNISSSSSKAPWRSIINNSEWF
        LWR+FLWKG+   +GSHLI W+ V+K KEEGGLGISRL +TN+ALL KWLWRY SEPN+LWR+LIQ KY+ K PGD+PSNISSS+SKAPWRSII++++WF
Subjt:  LWRRFLWKGSCNPKGSHLIKWSIVTKPKEEGGLGISRLQITNQALLPKWLWRYHSEPNSLWRKLIQLKYQSKHPGDLPSNISSSSSKAPWRSIINNSEWF

Query:  KRNQGWDLKNGDQISFWFSNWSTEGCLSTAYPRLFALSIDKESSIKDVWNSINNQWEIAFRRNLNDRELSTWQRILGNLPVPRTNRGSSKPTWIPDSKKS
        K NQ WDL NGDQISFW+SNWS EG LSTAYPRLFAL++DKE S+KD WN+ +NQW I FRR LNDRE   W++IL  LP PR+NRGSSKPTWIPDS  S
Subjt:  KRNQGWDLKNGDQISFWFSNWSTEGCLSTAYPRLFALSIDKESSIKDVWNSINNQWEIAFRRNLNDRELSTWQRILGNLPVPRTNRGSSKPTWIPDSKKS

Query:  FSIASAKRCISHQPELSVASPLSKLLDLIWKSFIPMKIKFFMWCLIQRKLNTSEVIQQRMPNLALQPN
        FSIASAK  IS Q + +   P +KLL++IWKS IPMKIKFFMWCLIQR++NT EVIQQ+MPN  LQPN
Subjt:  FSIASAKRCISHQPELSVASPLSKLLDLIWKSFIPMKIKFFMWCLIQRKLNTSEVIQQRMPNLALQPN

A0A5D3C3M3 LINE-1 retrotransposable element ORF2 protein0.0e+0069.8Show/hide
Query:  MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKSLIETPSSNRFFLENRDSEHCIWIRKTRNGKGCIAEIFRVD
        MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKSLIETPSSNRFFLENRD EHCIWIRKTRNGKGC AEIFRVD
Subjt:  MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKSLIETPSSNRFFLENRDSEHCIWIRKTRNGKGCIAEIFRVD

Query:  NKNRKSCILVPEGPEKSGWVSFLSMITPKVETKAKIRPSFLPRSSPEIHSSSPIDYHKRSYAKAVTEGRPSISSDSSESYASSDSSDSSHSSGNSPRNFP
        +KNRKSCILVPEGPEKSG VSFLSMITPKVE KAK RP+FLPRSSPE   S PIDYHKRSY KAV++GR SISSDSS+SY    SSDSS SSGNSP + P
Subjt:  NKNRKSCILVPEGPEKSGWVSFLSMITPKVETKAKIRPSFLPRSSPEIHSSSPIDYHKRSYAKAVTEGRPSISSDSSESYASSDSSDSSHSSGNSPRNFP

Query:  SPVSLENTVVLVRRFFHDDWYKILQNLRKQIEESFTYNAFHAEKALVHFNSNVPANLLCQNRGWTTVGKYTVRFEKWNPASHASPKLIPSYGGWTTFRGI
         PV LENTVVL                                 AL+HFNSNVPANLLCQN+GWTTV KY VR                           
Subjt:  SPVSLENTVVLVRRFFHDDWYKILQNLRKQIEESFTYNAFHAEKALVHFNSNVPANLLCQNRGWTTVGKYTVRFEKWNPASHASPKLIPSYGGWTTFRGI

Query:  PLNLCNMKTFQQIGKACGGLIKVAEETKTASNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFVVQTITHSEGKWLMERNVRLHGTFKRQAAASFDEFNPD
                                                                                                            
Subjt:  PLNLCNMKTFQQIGKACGGLIKVAEETKTASNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFVVQTITHSEGKWLMERNVRLHGTFKRQAAASFDEFNPD

Query:  SEQFLFDGMEAISPDLQNTFSGSRKSISPEQPSTLKSVIIKPARDATSPPSLNEEVVNDNSLHATAIKSKGKISYRISNDCSLDKGKQKVDIPSQLTPAF
          + LFDG+EAISPDL NT SGSRKS S EQPS LKSVIIKPARDATSP +LNEEVVNDNSLHAT IKS+ KI   ISND SLDKGKQKVDIPSQLT AF
Subjt:  SEQFLFDGMEAISPDLQNTFSGSRKSISPEQPSTLKSVIIKPARDATSPPSLNEEVVNDNSLHATAIKSKGKISYRISNDCSLDKGKQKVDIPSQLTPAF

Query:  ILDKPKRKVSFNSPGNKTNFFNPNSAPANHSPSVSSPEKKQRVSKERSVKKKSTITPPKSRANQGQDASNTQPLKIIAHDMNASKKGLSLTVDLGNLPVL
        I DKPKRKVSFNSP NKT FFN +SAP NHSP +SSPEKKQRVS+ERSVKKKS+   PKSRANQG+    TQPL+++AHD++ASKKGLSLTVDLGNLPVL
Subjt:  ILDKPKRKVSFNSPGNKTNFFNPNSAPANHSPSVSSPEKKQRVSKERSVKKKSTITPPKSRANQGQDASNTQPLKIIAHDMNASKKGLSLTVDLGNLPVL

Query:  DPSKSFEDHHSSDNAEVIDITNTEMVPETPELKMTDPEKPKSPPEVNHRKQKHSHRRRHYYRKKEDTEKDSNSEVFKNQLVAWLKENGALIKNTLISYSP
        DPSKSFEDHHSSDNAEVIDITNTE+VPETPELKMTDPEK  S PEVN+RKQKHSHRRRHYYRKKED EKD+NSE FKNQLV WLKENG  +     S   
Subjt:  DPSKSFEDHHSSDNAEVIDITNTEMVPETPELKMTDPEKPKSPPEVNHRKQKHSHRRRHYYRKKEDTEKDSNSEVFKNQLVAWLKENGALIKNTLISYSP

Query:  DFVILTETRLKFINKKIVKSLWPSNSIKWIVKNAIDSSGGILILWDDLHHSLLSQEEGMFSLTANFLSSNNSWWLTGLYGPVQRRERLNFWTDLHNLLHL
             T T   F            +SI WIVKNAIDSSGGILILWD  HHSLL                                               
Subjt:  DFVILTETRLKFINKKIVKSLWPSNSIKWIVKNAIDSSGGILILWDDLHHSLLSQEEGMFSLTANFLSSNNSWWLTGLYGPVQRRERLNFWTDLHNLLHL

Query:  NSFPWILGGDLNAIRMREESTAVTSSTHSSNMLNNFISINSLIDPPLSNNRYTWSNLRIPPTFSRLDRFLYNPNWEVLFNPHITRTLPRPTSDHFPLVCE
                GDLN +RMREESTAVTSS+HSSNMLNNFIS N LIDPPL+NNRYTWSNLR PPTFSRLDRFLYN  WE LFNPHITRTL RPTSDHFPLVCE
Subjt:  NSFPWILGGDLNAIRMREESTAVTSSTHSSNMLNNFISINSLIDPPLSNNRYTWSNLRIPPTFSRLDRFLYNPNWEVLFNPHITRTLPRPTSDHFPLVCE

Query:  DSSPTVSWGPAPFRLNSIVLNDPEFKRNMERWWELSAQEGHPGFAFIQRLKSLANFIKPWQKEKFHSISSAKENIIKEVDAIDKKELDTLLCQEDT----
        DS+ T+ WGPAPFRLNSI LNDP+FKRNMERWWELS Q GHPGF+FI+RLKSLAN IKPWQKEKFHS++SAKENII+EVD+IDK ELDT L QE++    
Subjt:  DSSPTVSWGPAPFRLNSIVLNDPEFKRNMERWWELSAQEGHPGFAFIQRLKSLANFIKPWQKEKFHSISSAKENIIKEVDAIDKKELDTLLCQEDT----

Query:  ---------------------KKIWLKEGDENFAFFHRICSSRQKRNIIHEIQDEDGSNQNTNISISLAFVNYFTKLYRSSTKTNPLFIDNLKWKPIDYS
                             KK+WLKEGDEN AFFHRICSSRQKRN+IHEIQDE+GS QNTN +ISLAFVN+F+ +YR STK +PLFI+NL+W PIDYS
Subjt:  ---------------------KKIWLKEGDENFAFFHRICSSRQKRNIIHEIQDEDGSNQNTNISISLAFVNYFTKLYRSSTKTNPLFIDNLKWKPIDYS

Query:  EWSPLCAPFLEEEIKGVINSFEGNKAPGPDGFPISFFKSYWKLLKEDILAIFKDFYEKGVINKNMNNTFIALIAKKKNYSHPKDFRPISLTTSIYKIIAK
        +WS LCAPFLEEEIKGVI SF+GNKAPGPDGFPISFFKSYW LLKEDIL IFKDF+EKG                                     IIAK
Subjt:  EWSPLCAPFLEEEIKGVINSFEGNKAPGPDGFPISFFKSYWKLLKEDILAIFKDFYEKGVINKNMNNTFIALIAKKKNYSHPKDFRPISLTTSIYKIIAK

Query:  TLSNKLKLTLPDTISGNQLAFIKNRQITDAILMANEALDYWKVKKIKGFILKLDIEKAFDNLSWDFIDFVLKKKNYPPSWRQWIRGCISNVTYSIIVNGK
        TLSN+LKLTLPDTISGNQLAFIKNRQITDAIL ANEALDYWKVKKIK FILKLDIEKAFDNL+WDFIDFVLKKKNYP SWR+WIRGCISNVTYSIIVN K
Subjt:  TLSNKLKLTLPDTISGNQLAFIKNRQITDAILMANEALDYWKVKKIKGFILKLDIEKAFDNLSWDFIDFVLKKKNYPPSWRQWIRGCISNVTYSIIVNGK

Query:  PQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESTGAIKGVCLGKDCNISHILFADDILLFVEDNAFYLNNLRMAISLFEKASGLKINLSKSAMVP
        PQ RIKANRGLRQGDPLSPFLFV AMDYLSRLLSHLES+GAIKGVCL  DCNISHILFADDILLFVEDN  +LNNLRMA+SLFEKASGLKINLSKSAMVP
Subjt:  PQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESTGAIKGVCLGKDCNISHILFADDILLFVEDNAFYLNNLRMAISLFEKASGLKINLSKSAMVP

Query:  VNVPWPRALDCASSWDIPCQSLPLSYLGVPLGGNPKSKPFWRNIEDRIHKKLSNWKYAHISKGGRLTLIKSTLTSIPIYQLSVFQAPLSTYKNIEKLWRR
        VNV W RAL+CASSW I C +LPL+YLGVPLGGNPKS  FWRNIEDRI KKL+NWKYAHISKGGRLTLIKSTL+S+ IYQLSVFQAP STYKNIEKLWR 
Subjt:  VNVPWPRALDCASSWDIPCQSLPLSYLGVPLGGNPKSKPFWRNIEDRIHKKLSNWKYAHISKGGRLTLIKSTLTSIPIYQLSVFQAPLSTYKNIEKLWRR

Query:  FLWKGSCNPKGSHLIKWSIVTKPKEEGGLGISRLQITNQALLPKWLWRYHSEPNSLWRKLIQLKYQSKHPGDLPSNISSSSSKAPWRSIINNSEWFKRNQ
        FLWKGS   KGSHLI WSIVTK KEEGGLGISRLQ+ NQALL KWLWRY+SEPNSLWR+LI +KY+ KHPGD+PSNISSSSSKAPW+SIINN +WFK NQ
Subjt:  FLWKGSCNPKGSHLIKWSIVTKPKEEGGLGISRLQITNQALLPKWLWRYHSEPNSLWRKLIQLKYQSKHPGDLPSNISSSSSKAPWRSIINNSEWFKRNQ

Query:  GWDLKNGDQISFWFSNWSTEGCLSTAYPRLFALSIDKESSIKDVWNSINNQWEIAFRRNLNDRELSTWQRILGNLPVPRTNRGSSKPTWIPDSKKSFSIA
        GWDL N DQISFW+SNWS EGCLSTAYPRLFALSIDK+SSIKDVWNS NNQWEI FRR LNDRELSTWQ IL NL +PRTNRG SKPTWIPDSKK FSIA
Subjt:  GWDLKNGDQISFWFSNWSTEGCLSTAYPRLFALSIDKESSIKDVWNSINNQWEIAFRRNLNDRELSTWQRILGNLPVPRTNRGSSKPTWIPDSKKSFSIA

Query:  SAKRCISHQPELSVASPLSKLLDLIWKSFIPMKIKFFMWCLI
        SAK CISHQP+ SVA+P  KLLDLIWK+ +PMKIKFFMWCL+
Subjt:  SAKRCISHQPELSVASPLSKLLDLIWKSFIPMKIKFFMWCLI

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.2e-4926.58Show/hide
Query:  RLKSLANFIKPWQK-EKFHSISSAKENIIKEVDAIDKKELDTLLCQEDTKKIWLKEG----DENFAFFHRICSSRQKRNIIHEIQDEDGSNQNTNISISL
        ++ +L + +K  +K E+ HS +S ++ I K    + + E    L + +  + W  E     D   A   R+   ++++N I  I+++ G        I  
Subjt:  RLKSLANFIKPWQK-EKFHSISSAKENIIKEVDAIDKKELDTLLCQEDTKKIWLKEG----DENFAFFHRICSSRQKRNIIHEIQDEDGSNQNTNISISL

Query:  AFVNYFTKLYRS---STKTNPLFIDNLKWKPIDYSEWSPLCAPFLEEEIKGVINSFEGNKAPGPDGFPISFFKSYWKLLKEDILAIFKDFYEKGVINKNM
            Y+  LY +   + +    F+D      ++  E   L  P    EI  +INS    K+PGPDGF   F++ Y + L   +L +F+   ++G++  + 
Subjt:  AFVNYFTKLYRS---STKTNPLFIDNLKWKPIDYSEWSPLCAPFLEEEIKGVINSFEGNKAPGPDGFPISFFKSYWKLLKEDILAIFKDFYEKGVINKNM

Query:  NNTFIALIAKK-KNYSHPKDFRPISLTTSIYKIIAKTLSNKLKLTLPDTISGNQLAFIKNRQITDAILMANEALDYWKVKKIKG-FILKLDIEKAFDNLS
            I LI K  ++ +  ++FRPISL     KI+ K L+N+++  +   I  +Q+ FI   Q    I  +   + +    K K   I+ +D EKAFD + 
Subjt:  NNTFIALIAKK-KNYSHPKDFRPISLTTSIYKIIAKTLSNKLKLTLPDTISGNQLAFIKNRQITDAILMANEALDYWKVKKIKG-FILKLDIEKAFDNLS

Query:  WDFIDFVLKKKNYPPSWRQWIRGCISNVTYSIIVNGKPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESTGAIKGVCLGKDCNISHILFADDIL
          F+   L K      + + IR      T +II+NG+         G RQG PLSP LF I ++ L+R +   +    IKG+ LGK+  +   LFADD++
Subjt:  WDFIDFVLKKKNYPPSWRQWIRGCISNVTYSIIVNGKPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESTGAIKGVCLGKDCNISHILFADDIL

Query:  LFVEDNAFYLNNLRMAISLFEKASGLKINLSKSAMVPVNVPWPRALDCASSWDIPCQSLPLSYLGVPLGGNPKS--KPFWRNIEDRIHKKLSNWKYAHIS
        +++E+      NL   IS F K SG KIN+ KS     N                  S  + YLG+ L  + K   K  ++ +   I +  + WK    S
Subjt:  LFVEDNAFYLNNLRMAISLFEKASGLKINLSKSAMVPVNVPWPRALDCASSWDIPCQSLPLSYLGVPLGGNPKS--KPFWRNIEDRIHKKLSNWKYAHIS

Query:  KGGRLTLIKSTLTSIPIYQLSV--FQAPLSTYKNIEKLWRRFLWKGSCNPKGSHLIKWSIVTKPKEEGGLGISRLQITNQALLPK--WLWRYHSEPNSLW
          GR+ ++K  +    IY+ +    + P++ +  +EK   +F+W    N K + + K SI+++  + GG+ +   ++  +A + K  W W Y +     W
Subjt:  KGGRLTLIKSTLTSIPIYQLSV--FQAPLSTYKNIEKLWRRFLWKGSCNPKGSHLIKWSIVTKPKEEGGLGISRLQITNQALLPK--WLWRYHSEPNSLW

Query:  RK
         +
Subjt:  RK

P08548 LINE-1 reverse transcriptase homolog1.8e-4525.74Show/hide
Query:  QRLKSLANFIKPWQKEKFHSISSAKENIIKEVDA-IDKKELDTLLCQEDTKKIWLKEG----DENFAFFHRICSSRQKRNIIHEIQDEDGSNQNTNISIS
        + + +L   +K  +KE+  +   ++   I ++ A +++ E   ++ Q +  K W  E     D+  A    +   ++ +++I  I++ +         I 
Subjt:  QRLKSLANFIKPWQKEKFHSISSAKENIIKEVDA-IDKKELDTLLCQEDTKKIWLKEG----DENFAFFHRICSSRQKRNIIHEIQDEDGSNQNTNISIS

Query:  LAFVNYFTKLYR---SSTKTNPLFIDNLKWKPIDYSEWSPLCAPFLEEEIKGVINSFEGNKAPGPDGFPISFFKSYWKLLKEDILAIFKDFYEKGVINKN
             Y+ KLY     + K    +++      +   E   L  P    EI   I +    K+PGPDGF   F++++ + L   +L +F++  ++G++   
Subjt:  LAFVNYFTKLYR---SSTKTNPLFIDNLKWKPIDYSEWSPLCAPFLEEEIKGVINSFEGNKAPGPDGFPISFFKSYWKLLKEDILAIFKDFYEKGVINKN

Query:  MNNTFIALIAKK-KNYSHPKDFRPISLTTSIYKIIAKTLSNKLKLTLPDTISGNQLAFIKNRQITDAILMANEALDYW-KVKKIKGFILKLDIEKAFDNL
             I LI K  K+ +  +++RPISL     KI+ K L+N+++  +   I  +Q+ FI   Q    I  +   + +  K+K     IL +D EKAFDN+
Subjt:  MNNTFIALIAKK-KNYSHPKDFRPISLTTSIYKIIAKTLSNKLKLTLPDTISGNQLAFIKNRQITDAILMANEALDYW-KVKKIKGFILKLDIEKAFDNL

Query:  SWDFIDFVLKKKNYPPSWRQWIRGCISNVTYSIIVNGKPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESTGAIKGVCLGKDCNISHILFADDI
           F+   LKK     ++ + I    S  T +II+NG          G RQG PLSP LF I M+ L+     +    AIKG+ +G +  I   LFADD+
Subjt:  SWDFIDFVLKKKNYPPSWRQWIRGCISNVTYSIIVNGKPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESTGAIKGVCLGKDCNISHILFADDI

Query:  LLFVEDNAFYLNNLRMAISLFEKASGLKINLSKS-AMVPVNVPWPRALDCASSWDIPCQSLP--LSYLGVPLGGNPKS--KPFWRNIEDRIHKKLSNWKY
        ++++E+       L   I  +   SG KIN  KS A +  N       +      IP   +P  + YLGV L  + K   K  +  +   I + ++ WK 
Subjt:  LLFVEDNAFYLNNLRMAISLFEKASGLKINLSKS-AMVPVNVPWPRALDCASSWDIPCQSLP--LSYLGVPLGGNPKS--KPFWRNIEDRIHKKLSNWKY

Query:  AHISKGGRLTLIKSTLTSIPIYQLSV--FQAPLSTYKNIEKLWRRFLWKGSCNPKGSHLIKWSIVTKPKEEGGLGISRLQITNQALLPKWLWRYHSEPN-
           S  GR+ ++K ++    IY  +    +APLS +K++EK+   F+W    N K   + K ++++   + GG+ +  L++  ++++ K  W +H     
Subjt:  AHISKGGRLTLIKSTLTSIPIYQLSV--FQAPLSTYKNIEKLWRRFLWKGSCNPKGSHLIKWSIVTKPKEEGGLGISRLQITNQALLPKWLWRYHSEPN-

Query:  SLWRKL
         +W ++
Subjt:  SLWRKL

P0C2F6 Putative ribonuclease H protein At1g657507.8e-3328.29Show/hide
Query:  SKPFWRNIEDRIHKKLSNWKYAHISKGGRLTLIKSTLTSIPIYQLSVFQAPLSTYKNIEKLWRRFLWKGSCNPKGSHLIKWSIVTKPKEEGGLGISRLQI
        +K  +  I +R+  ++S W+   +S  GRLTL K+ L+S+P++ +S    P S    +++L R FLW  +   K  HL+KWS V  PK+EGGLG+   + 
Subjt:  SKPFWRNIEDRIHKKLSNWKYAHISKGGRLTLIKSTLTSIPIYQLSVFQAPLSTYKNIEKLWRRFLWKGSCNPKGSHLIKWSIVTKPKEEGGLGISRLQI

Query:  TNQALLPKWLWRYHSEPNSLWRKLIQLKYQSKHPGDLPSNISSSSSKAPWRSI-INNSEWFKRNQGWDLKNGDQISFWFSNWSTEGCLSTAYPRLFALSI
         N+AL+ K  WR   E NSLW  ++Q KY      D    I   S  + WRSI I   +      GW   +G QI FW   W +   L            
Subjt:  TNQALLPKWLWRYHSEPNSLWRKLIQLKYQSKHPGDLPSNISSSSSKAPWRSI-INNSEWFKRNQGWDLKNGDQISFWFSNWSTEGCLSTAYPRLFALSI

Query:  DKESSIKDVWNSINNQWEIAFRRNLNDRELSTWQRILGNLPVPRTNRGSSKPTWIPDSKKSFSIASAKR--CISHQPELSVASPLSKLLDLIWKSFIPMK
        D   + KD+W      W+ A    ++    +  +  L  + +        + +W       FS+ SA     +   P  ++AS      + +WK  +P +
Subjt:  DKESSIKDVWNSINNQWEIAFRRNLNDRELSTWQRILGNLPVPRTNRGSSKPTWIPDSKKSFSIASAKR--CISHQPELSVASPLSKLLDLIWKSFIPMK

Query:  IKFFMWCLIQRKLNTSEVIQQRMPNLALQPNWCVLCKKDSESGAHLFLQCDMVKPLW
        +K F+W +  + + T E   +R  + +   N C +CK   ES  H+   C     +W
Subjt:  IKFFMWCLIQRKLNTSEVIQQRMPNLALQPNWCVLCKKDSESGAHLFLQCDMVKPLW

P11369 LINE-1 retrotransposable element ORF2 protein1.5e-4426.05Show/hide
Query:  SLANFIKPWQKEKFHSIS-SAKENIIKEVDAIDKKELDTLLCQEDTKKIWLKEG----DENFAFFHRICSSRQKRNIIHEIQDEDGSNQNTNISISLAFV
        SL   +K  +K++ +S   S ++ IIK    I++ E    + + +  + W  E     D+  A   R+    + + +I++I++E G        I     
Subjt:  SLANFIKPWQKEKFHSIS-SAKENIIKEVDAIDKKELDTLLCQEDTKKIWLKEG----DENFAFFHRICSSRQKRNIIHEIQDEDGSNQNTNISISLAFV

Query:  NYFTKLYRSSTKTNPL-----FIDNLKWKPIDYSEWSPLCAPFLEEEIKGVINSFEGNKAPGPDGFPISFFKSYWKLLKEDILAIFKDFYEKGVINKNMN
        +++ +LY  STK   L     F+D  +   ++  +   L +P   +EI+ VINS    K+PGPDGF   F++++    KED++ I    + K  +   + 
Subjt:  NYFTKLYRSSTKTNPL-----FIDNLKWKPIDYSEWSPLCAPFLEEEIKGVINSFEGNKAPGPDGFPISFFKSYWKLLKEDILAIFKDFYEKGVINKNMN

Query:  NTF----IALIAK-KKNYSHPKDFRPISLTTSIYKIIAKTLSNKLKLTLPDTISGNQLAFIKNRQITDAILMANEALDYW-KVKKIKGFILKLDIEKAFD
        N+F    I LI K +K+ +  ++FRPISL     KI+ K L+N+++  +   I  +Q+ FI   Q    I  +   + Y  K+K     I+ LD EKAFD
Subjt:  NTF----IALIAK-KKNYSHPKDFRPISLTTSIYKIIAKTLSNKLKLTLPDTISGNQLAFIKNRQITDAILMANEALDYW-KVKKIKGFILKLDIEKAFD

Query:  NLSWDFIDFVLKKKNYPPSWRQWIRGCISNVTYSIIVNGKPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESTGAIKGVCLGKDCNISHILFAD
         +   F+  VL++      +   I+   S    +I VNG+    I    G RQG PLSP+LF I ++ L+R +   +    IKG+ +GK+  +   L AD
Subjt:  NLSWDFIDFVLKKKNYPPSWRQWIRGCISNVTYSIIVNGKPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESTGAIKGVCLGKDCNISHILFAD

Query:  DILLFVEDNAFYLNNLRMAISLFEKASGLKINLSKSAMVPVNVPWPRALDCASSWDIPCQSLPLSYLGVPLGGNPKS--KPFWRNIEDRIHKKLSNWKYA
        D+++++ D       L   I+ F +  G KIN +KS             +   +      +  + YLGV L    K      +++++  I + L  WK  
Subjt:  DILLFVEDNAFYLNNLRMAISLFEKASGLKINLSKSAMVPVNVPWPRALDCASSWDIPCQSLPLSYLGVPLGGNPKS--KPFWRNIEDRIHKKLSNWKYA

Query:  HISKGGRLTLIKSTLTSIPIYQLSV--FQAPLSTYKNIEKLWRRFLWKGSCNPKGSHLIKWSIVTKPKEEGGLGISRLQITNQALLPK--WLWRYHSEPN
          S  GR+ ++K  +    IY+ +    + P   +  +E    +F+W    N K   + K S++   +  GG+ +  L++  +A++ K  W W Y     
Subjt:  HISKGGRLTLIKSTLTSIPIYQLSV--FQAPLSTYKNIEKLWRRFLWKGSCNPKGSHLIKWSIVTKPKEEGGLGISRLQITNQALLPK--WLWRYHSEPN

Query:  SLWRKLIQLKYQSKHPGDLPSNISSSSSKAPWRSIINNSEW
          W ++   +      G L  +  + + +    SI NN  W
Subjt:  SLWRKLIQLKYQSKHPGDLPSNISSSSSKAPWRSIINNSEW

P14381 Transposon TX1 uncharacterized 149 kDa protein2.2e-3522.9Show/hide
Query:  SNNSWWLTGLYGPVQRRERLNFWTDLHNLLHL--NSFPWILGGDLNAIRMREESTAVTSSTHSSNMLNNFISINSLIDPPLSNNRYT--WSNLRIPP---
        S  ++ L  +Y P    ER  F+  L   +    +    I+GGD N      +         S ++L   I+  SL+D     N  T  ++ +R+     
Subjt:  SNNSWWLTGLYGPVQRRERLNFWTDLHNLLHL--NSFPWILGGDLNAIRMREESTAVTSSTHSSNMLNNFISINSLIDPPLSNNRYT--WSNLRIPP---

Query:  TFSRLDRFLYNPNWEVLFNPHITRTLPRPTSDHFPLVCEDSSPTVSWGPAPFRLNSIVLNDPEFKRNMERWWELSAQEGHPGFAFIQRLKSLAN-FIKPW
        + SR+DR   + +  ++     +     P SDH  +    S        A +  N+ +L D  F +++   W    +     FA + +   +    +K  
Subjt:  TFSRLDRFLYNPNWEVLFNPHITRTLPRPTSDHFPLVCEDSSPTVSWGPAPFRLNSIVLNDPEFKRNMERWWELSAQEGHPGFAFIQRLKSLAN-FIKPW

Query:  QKEKFHSISSAKENIIKEVDAIDKKELD-----------TLLCQEDTKKIWLK--------------------EGDENFAFFHRICSSRQKRNIIHEIQD
         +E   S+S  +     E++A++ + LD            L C+   +K  L+                    + D    FF+ +   +  R  I  +  
Subjt:  QKEKFHSISSAKENIIKEVDAIDKKELD-----------TLLCQEDTKKIWLK--------------------EGDENFAFFHRICSSRQKRNIIHEIQD

Query:  EDGSNQNTNISISLAFVNYFTKLYRSSTKTNPLFIDNLKW---KPIDYSEWSPLCAPFLEEEIKGVINSFEGNKAPGPDGFPISFFKSYWKLLKEDILAI
        EDG+      +I     +++  L+ S    +P   + L W     +       L  P   +E+   +     NK+PG DG  I FF+ +W  L  D   +
Subjt:  EDGSNQNTNISISLAFVNYFTKLYRSSTKTNPLFIDNLKW---KPIDYSEWSPLCAPFLEEEIKGVINSFEGNKAPGPDGFPISFFKSYWKLLKEDILAI

Query:  FKDFYEKGVINKNMNNTFIALIAKKKNYSHPKDFRPISLTTSIYKIIAKTLSNKLKLTLPDTISGNQLAFIKNRQITDAILMANEALDYWKVKKIKGFIL
          + ++KG +  +     ++L+ KK +    K++RP+SL ++ YKI+AK +S +LK  L + I  +Q   +  R I D + +  + L + +   +    L
Subjt:  FKDFYEKGVINKNMNNTFIALIAKKKNYSHPKDFRPISLTTSIYKIIAKTLSNKLKLTLPDTISGNQLAFIKNRQITDAILMANEALDYWKVKKIKGFIL

Query:  KLDIEKAFDNLSWDFIDFVLKKKNYPPSWRQWIRGCISNVTYSIIVNGKPQGRIKANRGLRQGDPLSPFLFVIAMD-YLSRLLSHLESTGAIKGVCLGKD
         LD EKAFD +   ++   L+  ++ P +  +++   ++    + +N      +   RG+RQG PLS  L+ +A++ +L  L   L  TG    V    D
Subjt:  KLDIEKAFDNLSWDFIDFVLKKKNYPPSWRQWIRGCISNVTYSIIVNGKPQGRIKANRGLRQGDPLSPFLFVIAMD-YLSRLLSHLESTGAIKGVCLGKD

Query:  CNISHILFADDILLFVEDNAFYLNNLRMAISLFEKASGLKINLSKSAMVPVNVPWPRALDCASSWDIPCQSLPLSYLGVPLGGN--PKSKPFWRNIEDRI
          +    +ADD++L  +D    L   +    ++  AS  +IN SKS+ +         L  A   DI  +S  + YLGV L     P S+ F   +E+ +
Subjt:  CNISHILFADDILLFVEDNAFYLNNLRMAISLFEKASGLKINLSKSAMVPVNVPWPRALDCASSWDIPCQSLPLSYLGVPLGGN--PKSKPFWRNIEDRI

Query:  HKKLSNWK-YAHI-SKGGRLTLIKSTLTSIPIYQLSVFQAPLSTYKNIEKLWRRFLWKGSCNPKGSHLIKWSIVTKPKEEGGLGISRLQITNQALLPKWL
          +L  WK +A + S  GR  +I   + S   Y+L            I++    FLW       G H +   + + P +EGG G+  ++        + +
Subjt:  HKKLSNWK-YAHI-SKGGRLTLIKSTLTSIPIYQLSVFQAPLSTYKNIEKLWRRFLWKGSCNPKGSHLIKWSIVTKPKEEGGLGISRLQITNQALLPKWL

Query:  WRY-HSEPNSLWRKLIQLKYQ
         RY +++P+  W  L    Y+
Subjt:  WRY-HSEPNSLWRKLIQLKYQ

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein4.3e-2626.43Show/hide
Query:  ILGGDLNAIRMREESTAVTSSTHSSNMLNNF---ISINSLIDPPLSNNRYTWSNLRIP-PTFSRLDRFLYNPNWEVLFNPHITRTLPRPTSDHFP-LVCE
        IL GD + I    +  +V  ++     L  F   +  + L+D P     YTWSN +   P   +LDR + N +W   F   I        SDH P ++  
Subjt:  ILGGDLNAIRMREESTAVTSSTHSSNMLNNF---ISINSLIDPPLSNNRYTWSNLRIP-PTFSRLDRFLYNPNWEVLFNPHITRTLPRPTSDHFP-LVCE

Query:  DSSPTVSWGPAPFRLNSIVLNDPEFKRNMERWWELSAQEGHPGFAFIQRLKSLANFIKPWQKEKFHSISSAKENIIKEVDAIDKKEL----DTL------
        ++ P  S     FR  S +   P F  ++   WE     G   F+  + LK+     K   ++ F +I    +  +  +++I  + L    D+L      
Subjt:  DSSPTVSWGPAPFRLNSIVLNDPEFKRNMERWWELSAQEGHPGFAFIQRLKSLANFIKPWQKEKFHSISSAKENIIKEVDAIDKKEL----DTL------

Query:  --------------LCQEDTKKIWLKEGDENFAFFHRICSSRQKRNIIHEIQDEDGSNQNTNISISLAFVNYFTKLYRS-STKTNPLFIDNLK-WKPIDY
                        ++ ++  WL++GD N  FFH++  + Q +N+I  ++ +D         +    V Y+T L  S S    P  +  +K   P   
Subjt:  --------------LCQEDTKKIWLKEGDENFAFFHRICSSRQKRNIIHEIQDEDGSNQNTNISISLAFVNYFTKLYRS-STKTNPLFIDNLK-WKPIDY

Query:  SEW--SPLCAPFLEEEIKGVINSFEGNKAPGPDGFPISFFKSYWKLLKEDILAIFKDFYEKGVINKNMNNTFIALIAKKKNYSHPKDFRPISLTTSIYKI
        ++   S L A   ++EI   + +   NKAPGPD F   FF   W ++K+  +A  K+F+  G + K  N T I LI K         FRP+S  T +YKI
Subjt:  SEW--SPLCAPFLEEEIKGVINSFEGNKAPGPDGFPISFFKSYWKLLKEDILAIFKDFYEKGVINKNMNNTFIALIAKKKNYSHPKDFRPISLTTSIYKI

Query:  I
        I
Subjt:  I

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.3e-1924.07Show/hide
Query:  DCASSWDIPCQSLPLSYLGVPLGGNPKSKPFWRNIEDRIHKKLSNWKYAHISKGGRLTLIKSTLTSIPIYQLSVFQAPLSTYKNIEKLWRRFLWKGSCNP
        D   S+     +LP+ YLG+PL     +   +  + ++I  ++  W   H+S  GRL LI S + S+  + +S F+ P +  K I+ +   FLW G    
Subjt:  DCASSWDIPCQSLPLSYLGVPLGGNPKSKPFWRNIEDRIHKKLSNWKYAHISKGGRLTLIKSTLTSIPIYQLSVFQAPLSTYKNIEKLWRRFLWKGSCNP

Query:  KGSHLIKWSIVTKPKEEGGLGISRLQITNQALLPKWLWRYHSEPNSLWRKLIQLKYQSKHPGDLPSNISSSSSKAPWRSIINNSEWFKRNQGWDLKNGDQ
             + WS V  PK+EGGLGI  L+  N+               S W               +  N +  S    W+ I+ +          D+ NG  
Subjt:  KGSHLIKWSIVTKPKEEGGLGISRLQITNQALLPKWLWRYHSEPNSLWRKLIQLKYQSKHPGDLPSNISSSSSKAPWRSIINNSEWFKRNQGWDLKNGDQ

Query:  ISFWFSNWSTEGCL--STAYPRLFALSIDKESSIKDVWNSINNQWEIAFRRNLNDRELSTWQRILGNLPVPRTNRGSSKPTWIPDSKKSFSIASAKR--C
         SFWF NWS  G L   T +     + I   +S+ +    +N++     RR+ +D  L   + ++  +       G     W  +        + K    
Subjt:  ISFWFSNWSTEGCL--STAYPRLFALSIDKESSIKDVWNSINNQWEIAFRRNLNDRELSTWQRILGNLPVPRTNRGSSKPTWIPDSKKSFSIASAKR--C

Query:  ISHQPELSVASPLSKLLDLIWKSFIPMKIKFFMWCLIQRKLNTSEVIQQRMPNLALQPNWCVLCKKDSESGAHLFLQC
         + +P+L V          +W S    K     W  I+ +L T +   + +   A   + CVLC    E+  HLF  C
Subjt:  ISHQPELSVASPLSKLLDLIWKSFIPMKIKFFMWCLIQRKLNTSEVIQQRMPNLALQPNWCVLCKKDSESGAHLFLQC

AT4G29090.1 Ribonuclease H-like superfamily protein6.4e-2222.6Show/hide
Query:  SIPIYQLSVFQAPLSTYKNIEKLWRRFLWKGSCNPKGSHLIKWSIVTKPKEEGGLGISRLQITNQALLPKWLWRYHSEPNSLWRKLIQLKYQSKHPGDLP
        ++P Y ++ F  P +  K I  +   F W+     KG H   W  ++  K EGG+G   ++  N ALL K +WR  S P SL  K+ + +Y   H  D  
Subjt:  SIPIYQLSVFQAPLSTYKNIEKLWRRFLWKGSCNPKGSHLIKWSIVTKPKEEGGLGISRLQITNQALLPKWLWRYHSEPNSLWRKLIQLKYQSKHPGDLP

Query:  SNISSSSSKAPWRSIINNSEWFKRNQGWDLKNGDQISFWFSNWSTEGCLSTAY------PRLFALSIDKESSIKDVWNSINNQWEIAFRRNLNDRELSTW
        +    S     W+SI  + E  ++     + NG+ I  W   W      S A       P+ +A S+     + D+ +    +W    R+++ +      
Subjt:  SNISSSSSKAPWRSIINNSEWFKRNQGWDLKNGDQISFWFSNWSTEGCLSTAY------PRLFALSIDKESSIKDVWNSINNQWEIAFRRNLNDRELSTW

Query:  QRILGNLPVPRTNRGSSKPTWIPDSKKSFSIASAKRCISH------QPELSVASPLSKLLDLIWKSFIPMKIKFFMWCLIQRKLNTSEVIQQRMPNLALQ
        +R L     P   R     TW   S   +++ S    ++        P+      L+ +   IWKS    KI+ F+W  +   L  +  +  R  +   +
Subjt:  QRILGNLPVPRTNRGSSKPTWIPDSKKSFSIASAKRCISH------QPELSVASPLSKLLDLIWKSFIPMKIKFFMWCLIQRKLNTSEVIQQRMPNLALQ

Query:  PNWCVLCKKDSESGAHLFLQCDMVKPLWSLLQQALNF-AHFSDDFEALISFFLSLNQSLPKHKIVNCGVIAVLWCIWSERNNRTF
         + C+ C    E+  HL  +C   +  W++    +     ++D     + +  +L    P+ +  +  V  +LW +W  RN   F
Subjt:  PNWCVLCKKDSESGAHLFLQCDMVKPLWSLLQQALNF-AHFSDDFEALISFFLSLNQSLPKHKIVNCGVIAVLWCIWSERNNRTF

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.8e-0928.47Show/hide
Query:  SIPIYQLSVFQAPLSTYKNIEKLWRRFLWKGSCNPKGSHLIKWSIVTKPKE-EGGLGISRLQITNQALLPKWLWRYHSEPNSLWRKLIQLKYQSKHPGDL
        ++P+Y +S F+      K +      F W    N +    + W  + K KE +GGLG   L   NQALL K  +R   +P++L  +L++ +Y   H   +
Subjt:  SIPIYQLSVFQAPLSTYKNIEKLWRRFLWKGSCNPKGSHLIKWSIVTKPKE-EGGLGISRLQITNQALLPKWLWRYHSEPNSLWRKLIQLKYQSKHPGDL

Query:  PSNISSSSSKAPWRSIINNSEWFKRNQGWDLKNGDQISFWFSNW
          ++ +  S A WRSII+  E   R     + +G     W   W
Subjt:  PSNISSSSSKAPWRSIINNSEWFKRNQGWDLKNGDQISFWFSNW

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.3e-1146.27Show/hide
Query:  IVNGKPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESTGAIKGVCLGKDC-NISHILFADD
        I+NG PQG +  +RGLRQGDPLSP+LF++  + LS L    +  G + G+ +  +   I+H+LFADD
Subjt:  IVNGKPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESTGAIKGVCLGKDC-NISHILFADD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTACTTCAAATCACTTCCTAGATCCTGCAAAATAGAAAGAAAAGAATTTGTCCTTCTCCTTGACAAGTATGCAAAACACACTCATTATTGGCTTACCGAA
ACAGGAGCTCACAAAGCCTTTTCCATTGAAGTTTCCCCAAGAGATTTAGATTGGATAAGAAGCACTCTTAAATCACTGATTGAAACTCCAAGCTCGAATCGCTTC
TTCCTTGAAAATCGTGATTCCGAGCATTGCATCTGGATTAGGAAAACAAGAAATGGTAAAGGATGTATTGCAGAAATTTTCAGAGTGGATAACAAAAATAGAAAA
TCTTGCATCCTAGTTCCGGAAGGTCCGGAGAAAAGTGGGTGGGTTTCCTTTTTATCAATGATCACTCCAAAAGTAGAAACAAAAGCAAAGATAAGACCATCATTC
TTACCAAGAAGCAGCCCTGAAATCCACTCTTCATCTCCCATCGATTACCACAAACGTTCATATGCCAAAGCAGTCACTGAAGGAAGACCTTCCATTTCAAGCGAC
TCAAGTGAATCTTATGCATCAAGTGATTCAAGTGATTCAAGCCATTCTTCAGGTAACAGCCCACGCAACTTCCCATCCCCTGTCTCACTAGAAAATACAGTGGTG
TTAGTAAGACGTTTTTTTCATGATGACTGGTACAAAATCCTTCAAAATTTGAGGAAGCAAATCGAGGAATCTTTTACCTACAACGCTTTCCATGCTGAAAAGGCC
CTGGTGCACTTCAACTCAAATGTACCAGCGAATCTTCTCTGTCAAAACAGAGGGTGGACAACCGTTGGGAAGTACACGGTCAGGTTTGAAAAATGGAATCCTGCT
TCCCATGCCTCTCCAAAACTCATTCCTAGCTATGGGGGGTGGACAACCTTCAGGGGAATTCCTCTTAACTTATGCAACATGAAGACCTTTCAACAAATCGGGAAA
GCATGTGGAGGTTTGATTAAAGTAGCCGAGGAAACAAAGACAGCAAGCAACCTGATTGAAGCTAAATTAAAAATTAGATACAACTATTCAGGCTTTTTACCAGCT
TATGTGAAGATCTTCGATCAAGAAGGTAACAAATTTGTTGTTCAAACTATCACTCACTCAGAAGGAAAATGGCTAATGGAAAGGAATGTTAGATTGCACGGTACC
TTCAAGAGGCAAGCTGCGGCTTCTTTCGATGAATTCAATCCCGATTCAGAGCAATTCCTGTTCGATGGCATGGAGGCCATTTCACCGGATCTTCAGAACACCTTC
TCCGGCAGCCGTAAAAGCATATCACCGGAGCAGCCATCTACATTAAAATCAGTCATTATTAAACCTGCCAGAGATGCCACGTCGCCACCTTCTTTAAATGAAGAG
GTAGTTAATGATAACAGTTTGCATGCAACGGCTATTAAATCGAAGGGAAAGATTTCATATAGGATATCAAATGATTGCTCTTTGGATAAAGGGAAGCAGAAGGTT
GACATTCCATCCCAACTAACTCCTGCATTTATTTTGGATAAGCCAAAAAGAAAAGTATCCTTTAATTCCCCCGGTAATAAAACCAATTTTTTTAATCCGAACTCT
GCTCCAGCCAATCATTCCCCTTCTGTTAGCTCCCCGGAGAAAAAACAAAGAGTCAGCAAAGAGAGAAGTGTTAAGAAGAAATCAACAATTACTCCGCCTAAATCA
AGAGCCAATCAGGGCCAAGATGCTTCTAACACTCAACCTCTTAAAATTATAGCTCATGATATGAATGCTTCTAAAAAAGGTCTCTCTCTCACAGTGGACCTGGGA
AATCTACCAGTTTTAGATCCGAGTAAATCCTTCGAAGATCACCACAGCTCTGACAATGCAGAAGTCATAGACATCACGAACACAGAGATGGTTCCAGAGACACCT
GAATTGAAGATGACAGATCCAGAGAAACCAAAGTCCCCTCCGGAAGTCAACCATAGAAAGCAAAAACACTCTCATCGAAGAAGACACTACTATAGAAAAAAGGAA
GACACTGAGAAGGATTCAAATTCAGAAGTCTTCAAAAATCAACTGGTTGCTTGGCTAAAGGAAAATGGAGCCTTAATAAAAAATACTTTAATTTCCTATTCCCCC
GACTTTGTGATCCTCACTGAAACGAGGCTCAAATTCATAAATAAGAAAATTGTTAAGTCACTTTGGCCTTCAAACAGCATAAAGTGGATTGTGAAAAATGCTATA
GACAGTTCAGGAGGGATTCTGATTTTATGGGATGATCTTCATCATTCTCTTTTGAGTCAAGAGGAAGGGATGTTCAGCCTTACTGCAAACTTTCTGTCCTCTAAT
AATTCATGGTGGTTAACAGGCTTATATGGTCCGGTCCAAAGAAGGGAAAGATTAAATTTCTGGACTGATTTACATAATCTCCTTCACCTTAATTCTTTTCCTTGG
ATATTAGGGGGAGATCTGAATGCAATCAGAATGAGAGAGGAATCAACAGCAGTCACTTCCTCTACTCATAGTTCCAACATGCTTAACAACTTCATATCAATCAAT
TCCCTGATTGACCCTCCCCTGTCCAACAACAGATACACTTGGTCCAACTTGAGAATTCCTCCAACCTTTTCCCGTCTTGACAGATTCCTATACAATCCTAATTGG
GAAGTCCTCTTCAACCCTCATATCACTAGAACTCTTCCTCGACCTACCTCGGACCACTTTCCTTTAGTCTGTGAAGATTCATCCCCCACTGTGAGTTGGGGCCCT
GCTCCATTTAGATTAAACTCCATAGTCTTAAACGACCCTGAATTCAAAAGAAATATGGAAAGATGGTGGGAATTATCAGCCCAAGAGGGCCATCCCGGTTTTGCT
TTCATTCAAAGGCTCAAGTCCTTAGCTAATTTCATCAAACCTTGGCAAAAAGAGAAATTTCATTCCATCTCTTCTGCTAAAGAAAACATAATCAAGGAAGTGGAT
GCCATTGATAAGAAGGAACTGGACACCCTGTTGTGTCAGGAGGACACAAAAAAGATTTGGCTTAAAGAGGGAGACGAAAATTTTGCCTTCTTTCACCGAATTTGT
TCTTCCAGACAGAAGAGAAATATAATTCACGAAATTCAGGACGAAGATGGCTCCAATCAGAATACAAACATTAGCATTTCCCTCGCCTTTGTCAATTATTTTACA
AAGCTCTACAGGAGTTCAACCAAAACCAATCCCCTCTTCATTGATAACCTCAAATGGAAGCCAATTGATTACTCGGAGTGGTCCCCTCTTTGTGCCCCCTTCTTG
GAGGAAGAAATAAAAGGGGTCATCAACTCCTTTGAAGGAAATAAGGCCCCTGGTCCAGACGGATTCCCAATTTCCTTCTTTAAGTCCTATTGGAAACTTCTAAAA
GAGGACATTCTAGCCATTTTCAAGGATTTCTACGAGAAGGGAGTGATCAATAAGAATATGAACAACACTTTCATAGCGTTAATTGCAAAGAAGAAGAATTATTCT
CATCCAAAGGACTTCAGACCTATCAGCCTAACAACATCTATCTACAAGATTATTGCTAAGACTCTTTCCAACAAGTTAAAGCTCACCCTTCCTGATACTATCTCA
GGTAATCAACTGGCTTTTATCAAAAATCGTCAAATCACAGATGCCATCTTAATGGCAAATGAAGCTTTAGACTATTGGAAAGTGAAAAAAATCAAAGGATTTATC
TTGAAGCTGGATATTGAAAAGGCTTTTGATAATCTGAGCTGGGATTTCATTGATTTCGTTCTTAAGAAAAAGAATTACCCTCCTTCCTGGAGGCAGTGGATTAGA
GGCTGCATAAGCAATGTAACCTATTCCATCATAGTCAATGGAAAGCCCCAAGGGAGAATTAAGGCAAATAGGGGACTAAGACAAGGTGATCCTCTCTCCCCTTTC
CTTTTCGTTATAGCTATGGACTATCTTAGTAGACTCTTGAGCCATTTGGAGTCCACTGGTGCCATCAAAGGTGTGTGCCTTGGAAAGGATTGCAACATATCTCAT
ATCCTCTTCGCTGATGACATTCTTCTTTTTGTTGAAGATAATGCCTTTTATCTGAACAATCTTAGAATGGCTATTTCTTTGTTCGAAAAGGCCTCGGGGCTCAAA
ATCAATTTGTCTAAGTCAGCAATGGTTCCAGTTAATGTCCCTTGGCCTAGAGCTTTGGATTGTGCTTCTTCTTGGGATATTCCTTGCCAGTCGCTTCCGCTATCC
TACTTAGGAGTCCCTCTCGGTGGAAACCCAAAATCCAAACCTTTCTGGAGGAATATTGAGGACAGAATTCATAAAAAACTTAGCAACTGGAAATACGCGCACATC
TCAAAAGGTGGAAGACTCACGCTAATCAAGTCTACTCTCACTAGTATTCCGATCTATCAGCTTTCTGTTTTTCAAGCTCCTCTCTCCACGTATAAGAACATTGAA
AAACTTTGGAGAAGATTCCTTTGGAAAGGTAGCTGCAATCCAAAAGGGTCTCACCTAATCAAATGGTCAATAGTTACAAAGCCTAAAGAAGAGGGTGGGCTGGGC
ATTTCGAGACTCCAAATTACAAATCAAGCTCTATTACCGAAGTGGCTTTGGCGTTACCATTCGGAGCCTAATTCCCTTTGGAGGAAACTAATCCAGCTAAAATAT
CAAAGTAAACACCCTGGGGACTTACCTTCAAATATTTCCTCTAGTTCCTCTAAAGCCCCGTGGCGATCTATCATTAACAACAGTGAATGGTTCAAAAGAAATCAA
GGTTGGGATTTAAAAAATGGAGATCAAATTTCATTCTGGTTTTCTAACTGGTCTACAGAAGGCTGTCTATCTACTGCCTATCCCAGACTATTTGCTCTCTCTATC
GACAAAGAATCCTCAATCAAAGATGTGTGGAACTCAATTAACAATCAATGGGAAATTGCTTTTCGAAGAAATTTGAATGATAGAGAATTAAGTACTTGGCAGAGA
ATTTTAGGGAATCTTCCAGTTCCTAGAACAAACAGAGGTTCAAGCAAACCTACCTGGATTCCCGACAGCAAGAAATCTTTCTCTATCGCCTCTGCAAAACGCTGT
ATCTCCCACCAGCCGGAGCTTTCGGTAGCGTCTCCTCTATCTAAGCTGCTGGATCTTATTTGGAAATCTTTCATTCCTATGAAGATAAAATTCTTCATGTGGTGC
CTGATTCAAAGAAAGCTAAACACATCGGAAGTTATCCAGCAAAGAATGCCAAATTTGGCTCTTCAACCAAATTGGTGCGTCCTCTGCAAAAAAGATAGTGAATCG
GGAGCTCACCTGTTCCTTCAATGCGATATGGTGAAACCCCTGTGGTCTCTGCTCCAGCAAGCTCTCAACTTCGCCCATTTTTCCGATGATTTTGAAGCGTTGATC
TCCTTCTTCCTCTCCCTAAATCAGTCCCTCCCGAAGCACAAGATCGTTAATTGCGGTGTGATCGCTGTCCTTTGGTGCATCTGGTCAGAGAGAAATAATAGAACT
TTTGATAATTTAAGTTATCAAAAAACTATTATTAATTTATGGGAAGATTGCAAAATTCTCATAGGAAATTGGAGTAGTAGGGATCCTACTTTTAAAAATTATTCA
GCATCTACCATAGCTCTTAATCTTAATGCACTATCCAAACTCACAAAACCCTCATCAGAAAATTCCTTGGCATCTTTGCCAGCCCACGTTGTGGACTTTAGTGGT
GGCTGGTCGTCGATTCCCAATTCACATTCCTTCCGTACATTACGTAAATTAACGCACTTAGGGTGTACACTTGTTGTGAAGGAGAGTCATGTACTTGGTATACCA
CTTCCTTCCTTTGATCGAGTCATAGTCTATAGGAAAAGGAAATTGAATGGCAACCAACATTCAGACAAACATGATCAAAAGGGATATCATTTTTGTTTGATAAGA
GCGAATAAGCAACTAGAAAAAGACAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCTACTTCAAATCACTTCCTAGATCCTGCAAAATAGAAAGAAAAGAATTTGTCCTTCTCCTTGACAAGTATGCAAAACACACTCATTATTGGCTTACCGAA
ACAGGAGCTCACAAAGCCTTTTCCATTGAAGTTTCCCCAAGAGATTTAGATTGGATAAGAAGCACTCTTAAATCACTGATTGAAACTCCAAGCTCGAATCGCTTC
TTCCTTGAAAATCGTGATTCCGAGCATTGCATCTGGATTAGGAAAACAAGAAATGGTAAAGGATGTATTGCAGAAATTTTCAGAGTGGATAACAAAAATAGAAAA
TCTTGCATCCTAGTTCCGGAAGGTCCGGAGAAAAGTGGGTGGGTTTCCTTTTTATCAATGATCACTCCAAAAGTAGAAACAAAAGCAAAGATAAGACCATCATTC
TTACCAAGAAGCAGCCCTGAAATCCACTCTTCATCTCCCATCGATTACCACAAACGTTCATATGCCAAAGCAGTCACTGAAGGAAGACCTTCCATTTCAAGCGAC
TCAAGTGAATCTTATGCATCAAGTGATTCAAGTGATTCAAGCCATTCTTCAGGTAACAGCCCACGCAACTTCCCATCCCCTGTCTCACTAGAAAATACAGTGGTG
TTAGTAAGACGTTTTTTTCATGATGACTGGTACAAAATCCTTCAAAATTTGAGGAAGCAAATCGAGGAATCTTTTACCTACAACGCTTTCCATGCTGAAAAGGCC
CTGGTGCACTTCAACTCAAATGTACCAGCGAATCTTCTCTGTCAAAACAGAGGGTGGACAACCGTTGGGAAGTACACGGTCAGGTTTGAAAAATGGAATCCTGCT
TCCCATGCCTCTCCAAAACTCATTCCTAGCTATGGGGGGTGGACAACCTTCAGGGGAATTCCTCTTAACTTATGCAACATGAAGACCTTTCAACAAATCGGGAAA
GCATGTGGAGGTTTGATTAAAGTAGCCGAGGAAACAAAGACAGCAAGCAACCTGATTGAAGCTAAATTAAAAATTAGATACAACTATTCAGGCTTTTTACCAGCT
TATGTGAAGATCTTCGATCAAGAAGGTAACAAATTTGTTGTTCAAACTATCACTCACTCAGAAGGAAAATGGCTAATGGAAAGGAATGTTAGATTGCACGGTACC
TTCAAGAGGCAAGCTGCGGCTTCTTTCGATGAATTCAATCCCGATTCAGAGCAATTCCTGTTCGATGGCATGGAGGCCATTTCACCGGATCTTCAGAACACCTTC
TCCGGCAGCCGTAAAAGCATATCACCGGAGCAGCCATCTACATTAAAATCAGTCATTATTAAACCTGCCAGAGATGCCACGTCGCCACCTTCTTTAAATGAAGAG
GTAGTTAATGATAACAGTTTGCATGCAACGGCTATTAAATCGAAGGGAAAGATTTCATATAGGATATCAAATGATTGCTCTTTGGATAAAGGGAAGCAGAAGGTT
GACATTCCATCCCAACTAACTCCTGCATTTATTTTGGATAAGCCAAAAAGAAAAGTATCCTTTAATTCCCCCGGTAATAAAACCAATTTTTTTAATCCGAACTCT
GCTCCAGCCAATCATTCCCCTTCTGTTAGCTCCCCGGAGAAAAAACAAAGAGTCAGCAAAGAGAGAAGTGTTAAGAAGAAATCAACAATTACTCCGCCTAAATCA
AGAGCCAATCAGGGCCAAGATGCTTCTAACACTCAACCTCTTAAAATTATAGCTCATGATATGAATGCTTCTAAAAAAGGTCTCTCTCTCACAGTGGACCTGGGA
AATCTACCAGTTTTAGATCCGAGTAAATCCTTCGAAGATCACCACAGCTCTGACAATGCAGAAGTCATAGACATCACGAACACAGAGATGGTTCCAGAGACACCT
GAATTGAAGATGACAGATCCAGAGAAACCAAAGTCCCCTCCGGAAGTCAACCATAGAAAGCAAAAACACTCTCATCGAAGAAGACACTACTATAGAAAAAAGGAA
GACACTGAGAAGGATTCAAATTCAGAAGTCTTCAAAAATCAACTGGTTGCTTGGCTAAAGGAAAATGGAGCCTTAATAAAAAATACTTTAATTTCCTATTCCCCC
GACTTTGTGATCCTCACTGAAACGAGGCTCAAATTCATAAATAAGAAAATTGTTAAGTCACTTTGGCCTTCAAACAGCATAAAGTGGATTGTGAAAAATGCTATA
GACAGTTCAGGAGGGATTCTGATTTTATGGGATGATCTTCATCATTCTCTTTTGAGTCAAGAGGAAGGGATGTTCAGCCTTACTGCAAACTTTCTGTCCTCTAAT
AATTCATGGTGGTTAACAGGCTTATATGGTCCGGTCCAAAGAAGGGAAAGATTAAATTTCTGGACTGATTTACATAATCTCCTTCACCTTAATTCTTTTCCTTGG
ATATTAGGGGGAGATCTGAATGCAATCAGAATGAGAGAGGAATCAACAGCAGTCACTTCCTCTACTCATAGTTCCAACATGCTTAACAACTTCATATCAATCAAT
TCCCTGATTGACCCTCCCCTGTCCAACAACAGATACACTTGGTCCAACTTGAGAATTCCTCCAACCTTTTCCCGTCTTGACAGATTCCTATACAATCCTAATTGG
GAAGTCCTCTTCAACCCTCATATCACTAGAACTCTTCCTCGACCTACCTCGGACCACTTTCCTTTAGTCTGTGAAGATTCATCCCCCACTGTGAGTTGGGGCCCT
GCTCCATTTAGATTAAACTCCATAGTCTTAAACGACCCTGAATTCAAAAGAAATATGGAAAGATGGTGGGAATTATCAGCCCAAGAGGGCCATCCCGGTTTTGCT
TTCATTCAAAGGCTCAAGTCCTTAGCTAATTTCATCAAACCTTGGCAAAAAGAGAAATTTCATTCCATCTCTTCTGCTAAAGAAAACATAATCAAGGAAGTGGAT
GCCATTGATAAGAAGGAACTGGACACCCTGTTGTGTCAGGAGGACACAAAAAAGATTTGGCTTAAAGAGGGAGACGAAAATTTTGCCTTCTTTCACCGAATTTGT
TCTTCCAGACAGAAGAGAAATATAATTCACGAAATTCAGGACGAAGATGGCTCCAATCAGAATACAAACATTAGCATTTCCCTCGCCTTTGTCAATTATTTTACA
AAGCTCTACAGGAGTTCAACCAAAACCAATCCCCTCTTCATTGATAACCTCAAATGGAAGCCAATTGATTACTCGGAGTGGTCCCCTCTTTGTGCCCCCTTCTTG
GAGGAAGAAATAAAAGGGGTCATCAACTCCTTTGAAGGAAATAAGGCCCCTGGTCCAGACGGATTCCCAATTTCCTTCTTTAAGTCCTATTGGAAACTTCTAAAA
GAGGACATTCTAGCCATTTTCAAGGATTTCTACGAGAAGGGAGTGATCAATAAGAATATGAACAACACTTTCATAGCGTTAATTGCAAAGAAGAAGAATTATTCT
CATCCAAAGGACTTCAGACCTATCAGCCTAACAACATCTATCTACAAGATTATTGCTAAGACTCTTTCCAACAAGTTAAAGCTCACCCTTCCTGATACTATCTCA
GGTAATCAACTGGCTTTTATCAAAAATCGTCAAATCACAGATGCCATCTTAATGGCAAATGAAGCTTTAGACTATTGGAAAGTGAAAAAAATCAAAGGATTTATC
TTGAAGCTGGATATTGAAAAGGCTTTTGATAATCTGAGCTGGGATTTCATTGATTTCGTTCTTAAGAAAAAGAATTACCCTCCTTCCTGGAGGCAGTGGATTAGA
GGCTGCATAAGCAATGTAACCTATTCCATCATAGTCAATGGAAAGCCCCAAGGGAGAATTAAGGCAAATAGGGGACTAAGACAAGGTGATCCTCTCTCCCCTTTC
CTTTTCGTTATAGCTATGGACTATCTTAGTAGACTCTTGAGCCATTTGGAGTCCACTGGTGCCATCAAAGGTGTGTGCCTTGGAAAGGATTGCAACATATCTCAT
ATCCTCTTCGCTGATGACATTCTTCTTTTTGTTGAAGATAATGCCTTTTATCTGAACAATCTTAGAATGGCTATTTCTTTGTTCGAAAAGGCCTCGGGGCTCAAA
ATCAATTTGTCTAAGTCAGCAATGGTTCCAGTTAATGTCCCTTGGCCTAGAGCTTTGGATTGTGCTTCTTCTTGGGATATTCCTTGCCAGTCGCTTCCGCTATCC
TACTTAGGAGTCCCTCTCGGTGGAAACCCAAAATCCAAACCTTTCTGGAGGAATATTGAGGACAGAATTCATAAAAAACTTAGCAACTGGAAATACGCGCACATC
TCAAAAGGTGGAAGACTCACGCTAATCAAGTCTACTCTCACTAGTATTCCGATCTATCAGCTTTCTGTTTTTCAAGCTCCTCTCTCCACGTATAAGAACATTGAA
AAACTTTGGAGAAGATTCCTTTGGAAAGGTAGCTGCAATCCAAAAGGGTCTCACCTAATCAAATGGTCAATAGTTACAAAGCCTAAAGAAGAGGGTGGGCTGGGC
ATTTCGAGACTCCAAATTACAAATCAAGCTCTATTACCGAAGTGGCTTTGGCGTTACCATTCGGAGCCTAATTCCCTTTGGAGGAAACTAATCCAGCTAAAATAT
CAAAGTAAACACCCTGGGGACTTACCTTCAAATATTTCCTCTAGTTCCTCTAAAGCCCCGTGGCGATCTATCATTAACAACAGTGAATGGTTCAAAAGAAATCAA
GGTTGGGATTTAAAAAATGGAGATCAAATTTCATTCTGGTTTTCTAACTGGTCTACAGAAGGCTGTCTATCTACTGCCTATCCCAGACTATTTGCTCTCTCTATC
GACAAAGAATCCTCAATCAAAGATGTGTGGAACTCAATTAACAATCAATGGGAAATTGCTTTTCGAAGAAATTTGAATGATAGAGAATTAAGTACTTGGCAGAGA
ATTTTAGGGAATCTTCCAGTTCCTAGAACAAACAGAGGTTCAAGCAAACCTACCTGGATTCCCGACAGCAAGAAATCTTTCTCTATCGCCTCTGCAAAACGCTGT
ATCTCCCACCAGCCGGAGCTTTCGGTAGCGTCTCCTCTATCTAAGCTGCTGGATCTTATTTGGAAATCTTTCATTCCTATGAAGATAAAATTCTTCATGTGGTGC
CTGATTCAAAGAAAGCTAAACACATCGGAAGTTATCCAGCAAAGAATGCCAAATTTGGCTCTTCAACCAAATTGGTGCGTCCTCTGCAAAAAAGATAGTGAATCG
GGAGCTCACCTGTTCCTTCAATGCGATATGGTGAAACCCCTGTGGTCTCTGCTCCAGCAAGCTCTCAACTTCGCCCATTTTTCCGATGATTTTGAAGCGTTGATC
TCCTTCTTCCTCTCCCTAAATCAGTCCCTCCCGAAGCACAAGATCGTTAATTGCGGTGTGATCGCTGTCCTTTGGTGCATCTGGTCAGAGAGAAATAATAGAACT
TTTGATAATTTAAGTTATCAAAAAACTATTATTAATTTATGGGAAGATTGCAAAATTCTCATAGGAAATTGGAGTAGTAGGGATCCTACTTTTAAAAATTATTCA
GCATCTACCATAGCTCTTAATCTTAATGCACTATCCAAACTCACAAAACCCTCATCAGAAAATTCCTTGGCATCTTTGCCAGCCCACGTTGTGGACTTTAGTGGT
GGCTGGTCGTCGATTCCCAATTCACATTCCTTCCGTACATTACGTAAATTAACGCACTTAGGGTGTACACTTGTTGTGAAGGAGAGTCATGTACTTGGTATACCA
CTTCCTTCCTTTGATCGAGTCATAGTCTATAGGAAAAGGAAATTGAATGGCAACCAACATTCAGACAAACATGATCAAAAGGGATATCATTTTTGTTTGATAAGA
GCGAATAAGCAACTAGAAAAAGACAAATGA
Protein sequenceShow/hide protein sequence
MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKSLIETPSSNRFFLENRDSEHCIWIRKTRNGKGCIAEIFRVDNKNRK
SCILVPEGPEKSGWVSFLSMITPKVETKAKIRPSFLPRSSPEIHSSSPIDYHKRSYAKAVTEGRPSISSDSSESYASSDSSDSSHSSGNSPRNFPSPVSLENTVV
LVRRFFHDDWYKILQNLRKQIEESFTYNAFHAEKALVHFNSNVPANLLCQNRGWTTVGKYTVRFEKWNPASHASPKLIPSYGGWTTFRGIPLNLCNMKTFQQIGK
ACGGLIKVAEETKTASNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFVVQTITHSEGKWLMERNVRLHGTFKRQAAASFDEFNPDSEQFLFDGMEAISPDLQNTF
SGSRKSISPEQPSTLKSVIIKPARDATSPPSLNEEVVNDNSLHATAIKSKGKISYRISNDCSLDKGKQKVDIPSQLTPAFILDKPKRKVSFNSPGNKTNFFNPNS
APANHSPSVSSPEKKQRVSKERSVKKKSTITPPKSRANQGQDASNTQPLKIIAHDMNASKKGLSLTVDLGNLPVLDPSKSFEDHHSSDNAEVIDITNTEMVPETP
ELKMTDPEKPKSPPEVNHRKQKHSHRRRHYYRKKEDTEKDSNSEVFKNQLVAWLKENGALIKNTLISYSPDFVILTETRLKFINKKIVKSLWPSNSIKWIVKNAI
DSSGGILILWDDLHHSLLSQEEGMFSLTANFLSSNNSWWLTGLYGPVQRRERLNFWTDLHNLLHLNSFPWILGGDLNAIRMREESTAVTSSTHSSNMLNNFISIN
SLIDPPLSNNRYTWSNLRIPPTFSRLDRFLYNPNWEVLFNPHITRTLPRPTSDHFPLVCEDSSPTVSWGPAPFRLNSIVLNDPEFKRNMERWWELSAQEGHPGFA
FIQRLKSLANFIKPWQKEKFHSISSAKENIIKEVDAIDKKELDTLLCQEDTKKIWLKEGDENFAFFHRICSSRQKRNIIHEIQDEDGSNQNTNISISLAFVNYFT
KLYRSSTKTNPLFIDNLKWKPIDYSEWSPLCAPFLEEEIKGVINSFEGNKAPGPDGFPISFFKSYWKLLKEDILAIFKDFYEKGVINKNMNNTFIALIAKKKNYS
HPKDFRPISLTTSIYKIIAKTLSNKLKLTLPDTISGNQLAFIKNRQITDAILMANEALDYWKVKKIKGFILKLDIEKAFDNLSWDFIDFVLKKKNYPPSWRQWIR
GCISNVTYSIIVNGKPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESTGAIKGVCLGKDCNISHILFADDILLFVEDNAFYLNNLRMAISLFEKASGLK
INLSKSAMVPVNVPWPRALDCASSWDIPCQSLPLSYLGVPLGGNPKSKPFWRNIEDRIHKKLSNWKYAHISKGGRLTLIKSTLTSIPIYQLSVFQAPLSTYKNIE
KLWRRFLWKGSCNPKGSHLIKWSIVTKPKEEGGLGISRLQITNQALLPKWLWRYHSEPNSLWRKLIQLKYQSKHPGDLPSNISSSSSKAPWRSIINNSEWFKRNQ
GWDLKNGDQISFWFSNWSTEGCLSTAYPRLFALSIDKESSIKDVWNSINNQWEIAFRRNLNDRELSTWQRILGNLPVPRTNRGSSKPTWIPDSKKSFSIASAKRC
ISHQPELSVASPLSKLLDLIWKSFIPMKIKFFMWCLIQRKLNTSEVIQQRMPNLALQPNWCVLCKKDSESGAHLFLQCDMVKPLWSLLQQALNFAHFSDDFEALI
SFFLSLNQSLPKHKIVNCGVIAVLWCIWSERNNRTFDNLSYQKTIINLWEDCKILIGNWSSRDPTFKNYSASTIALNLNALSKLTKPSSENSLASLPAHVVDFSG
GWSSIPNSHSFRTLRKLTHLGCTLVVKESHVLGIPLPSFDRVIVYRKRKLNGNQHSDKHDQKGYHFCLIRANKQLEKDK