; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0014496 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0014496
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr12:24134624..24140530
RNA-Seq ExpressionPay0014496
SyntenyPay0014496
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR025558 - Domain of unknown function DUF4283
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058980.1 uncharacterized protein E6C27_scaffold98G001710 [Cucumis melo var. makuwa]0.0e+0072.94Show/hide
Query:  MITPKVEVKAKTRPTFLPRTSPDCQLSPPIDYHKRSYAKAVTEGRPFATSDSSDSYDSSDSSHSSGNSFCDSPSFDLLENTLVIVRRFFHDDWHKILQNL
        MITPKVEVK KTRPTFLPR+SP+ +LSPPIDYHKRSYAK VTEGRPF TSDSSDSY SSDSSHSSGNSFCDSPS DLLENT+V+VRRFFHDDW KILQNL
Subjt:  MITPKVEVKAKTRPTFLPRTSPDCQLSPPIDYHKRSYAKAVTEGRPFATSDSSDSYDSSDSSHSSGNSFCDSPSFDLLENTLVIVRRFFHDDWHKILQNL

Query:  RKQTDESFTYNAFHAEKALVHFSSNIPANLLCQNKGWSTVGKYSVRFEKWSPAYHATPKLIPSYGGWTTFRGIPLHLWNMMTFQQIGKACGGLIKVAEET
        RKQT+ESFTYNAFHAEKALVHF+SNIP NLLCQNKGW+TVGKYSVRFEKWSPAYHATPKLIPSYGGWTTF+                             
Subjt:  RKQTDESFTYNAFHAEKALVHFSSNIPANLLCQNKGWSTVGKYSVRFEKWSPAYHATPKLIPSYGGWTTFRGIPLHLWNMMTFQQIGKACGGLIKVAEET

Query:  RSAKNLIEARIKVRYNYSGFLPANVRIFDNEGNKFSVQVVTHPEEWLIERSVRLHGTFKRQAAASFDDFNPESEQ----FFFEGSEAISPDFLSTSSDDR
        R++  L+E                                                         +DDF+   E       F+GSEAISPDFLSTSS  R
Subjt:  RSAKNLIEARIKVRYNYSGFLPANVRIFDNEGNKFSVQVVTHPEEWLIERSVRLHGTFKRQAAASFDDFNPESEQ----FFFEGSEAISPDFLSTSSDDR

Query:  KSSTPDQPSALKSVIIKPDRNATLPSFLNEELVNDSNLHAMANKSKLEILSGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSPSNKTNIFNPN
        KSSTPDQPSALKSVIIKPD+ AT P++LNEE+VNDSNLHA ANKS+LEILSGI NDGVLDKGKQKVDIQL PNSALNL+K KRKVSFNSPSNKTNIFNP+
Subjt:  KSSTPDQPSALKSVIIKPDRNATLPSFLNEELVNDSNLHAMANKSKLEILSGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSPSNKTNIFNPN

Query:  SAPANHSPSLNSPEKKQKVSRERSIKKKSSFTQPNSKANQNKGVFITQPIQIVAHDRDAAKKGLSLTVDLGDLPALDPNKSLEDHHSSDNAEVIDITNTE
        SAPANHS SL+SPEKKQKVSRERSIKKKSS  QP     QNKGV ITQPIQ+VAHD +A+KKGLSL V+LGDLP LDP+KS EDHHSS NAEVIDITNTE
Subjt:  SAPANHSPSLNSPEKKQKVSRERSIKKKSSFTQPNSKANQNKGVFITQPIQIVAHDRDAAKKGLSLTVDLGDLPALDPNKSLEDHHSSDNAEVIDITNTE

Query:  VVTETPEMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKEEK--------EKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTSTNVLALIKNTIIS
        VV ETPEMKMPVNENSNSSSEANYRKPKHVH+R+YYYRKK  K        E        K +L++W             ++    S +  ALIKN IIS
Subjt:  VVTETPEMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKEEK--------EKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTSTNVLALIKNTIIS

Query:  YSPDFVILTETRLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLSQEEGLFSLSANFLLNNNSSWWLTGLYGP------------LHN
        YSPDFVILTET LKITNKRIIKS WPSNSINWI KNASGSSGGILILWDAQ+HSLLSQEE +FSLSANF LNNNSSWWLTGLYGP            LHN
Subjt:  YSPDFVILTETRLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLSQEEGLFSLSANFLLNNNSSWWLTGLYGP------------LHN

Query:  LQHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLNNFISNNLLIDPPLTNNRFTWSNLRNPPTFSRIDRFLYNSSWKNLFSPHTTRTLPRSTSDHFP
        LQHLNSFPW L  DLNVIRMREE+TS+LSSSHSSRMLNNFISNNLLIDPPLTNNRFTWSNLRNP TFSRIDRFLYNSSW+NLFSPHTTRTLPR TSDHFP
Subjt:  LQHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLNNFISNNLLIDPPLTNNRFTWSNLRNPPTFSRIDRFLYNSSWKNLFSPHTTRTLPRSTSDHFP

Query:  LVCEDSNPKLSWGPVPFRLNSITLSDPEFKRNMGRWWENLIQAGHPRFSFIQRLKSLANFIKPWQKEKLHSLTYAKEAIIREVDSIDKKELDTPLTQEES
        LVCEDSNPKL WGP PFRLNSI L+DPEFKRNM RWWEN +Q GHP FSFIQRLKSLAN IKPWQKEKLHSL YAKE IIREVDSIDKKELDTPL+Q+ES
Subjt:  LVCEDSNPKLSWGPVPFRLNSITLSDPEFKRNMGRWWENLIQAGHPRFSFIQRLKSLANFIKPWQKEKLHSLTYAKEAIIREVDSIDKKELDTPLTQEES

Query:  NRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEKGSIQNTNNSISIAFIKFFSRIYRSSTKSDPLFIENLDWNP
        NRRLALKA+LS+LSLKESQF                       C                                    IY+SSTKSDPLFIENLDWNP
Subjt:  NRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEKGSIQNTNNSISIAFIKFFSRIYRSSTKSDPLFIENLDWNP

Query:  IASSEWSHLCAPFLEGEIKGVINSFDGKKTPGPDGFPISFFKSHWYLLKEDIMDIFKDFYD------------------KCDYSHPKDFRPISLTTSIYK
        I  SEW HLCAPFLE EIKGVINSFDGKK P PDGFPISFFKS+W+LLKEDIMDIFKDF++                  K DYSHPKDFRPISLTTSIYK
Subjt:  IASSEWSHLCAPFLEGEIKGVINSFDGKKTPGPDGFPISFFKSHWYLLKEDIMDIFKDFYD------------------KCDYSHPKDFRPISLTTSIYK

Query:  IIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDYWKVKKIKGFILKLDIEKAFDNLNWDFIDNVLEKKNFPNPWRKWIRGCISNVTYSVI
        IIAKTLSNRLKTTLP TISGNQLAF+KNRQITDAILMANEAVDYWKVKKIKGFILKLDIEK F NLNWDFID VL KKNFPN WRKWIRGCISNVTYSVI
Subjt:  IIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDYWKVKKIKGFILKLDIEKAFDNLNWDFIDNVLEKKNFPNPWRKWIRGCISNVTYSVI

Query:  INGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNGNCNISHILFADDILLFIEDNDCFLKNLRMALSLFERASGLKINLLKS
        INGRPQGRIKANRGLRQGDPLSPFLFVIAMDY SRLLSHLE+SGAIKGVSLN NCNISHILFADDILLF+EDNDCFL NL MALSLFE+ASGLKINLLKS
Subjt:  INGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNGNCNISHILFADDILLFIEDNDCFLKNLRMALSLFERASGLKINLLKS

Query:  ALVPVNVSLNRAKECASFWGISCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSLTCKNIEK
        ALVPVNVSLNRAKECASFWGISCHSL LSYLGVPLGG+                                                              
Subjt:  ALVPVNVSLNRAKECASFWGISCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSLTCKNIEK

Query:  LWRKFLWKGNNEWSHLINWTKVSKSKEEGGLGISRLNVTNKALLSKWLWRYLSEPNALWRRLIQCKYKGKFPGDIPSNISSSTSKALWRSIIDSTDWFKS
                  ++ SHLINWTKV KSKEEGGLGISRL VTNKALLSKWLWRY SEPNALWRRLIQCKYKGK PGDIPSN SSS+SKA WRSIID+ DWFKS
Subjt:  LWRKFLWKGNNEWSHLINWTKVSKSKEEGGLGISRLNVTNKALLSKWLWRYLSEPNALWRRLIQCKYKGKFPGDIPSNISSSTSKALWRSIIDSTDWFKS

Query:  NQSWDLNNGDQISFWYSNWSQEGRLSTAYPRLFALTLDKEISVKDAWNTFDNQWNIIFRRELNDRERCNWEKILEILPTPRSNRGSSKPTWIPDSNNSFS
        NQSWDLNNGDQISFWYSNWSQEG LSTAYPRLFALTLDKEISVKDAWNT DNQW I FRRELNDRERCNWEKILEILPTPR NRGSSKPTWIPD N SFS
Subjt:  NQSWDLNNGDQISFWYSNWSQEGRLSTAYPRLFALTLDKEISVKDAWNTFDNQWNIIFRRELNDRERCNWEKILEILPTPRSNRGSSKPTWIPDSNNSFS

Query:  IASTKVLISRQLDQTPGDPRAKLLEIIWKSSIPMKIKFFMWCLIQRRINTMEVIQQKMPNTLLQPNWCVLCNKDSETGNHLFLRCEAVKPLWSFLQHSLN
        IAS K+LIS QLDQT GD R KLLEIIWKS+IPMKIKFFMWCLIQRRI+TMEVIQQ+M NTLLQPNWCVLCNKD+E+GNHLFLRC+AVKPLWS L  SLN
Subjt:  IASTKVLISRQLDQTPGDPRAKLLEIIWKSSIPMKIKFFMWCLIQRRINTMEVIQQKMPNTLLQPNWCVLCNKDSETGNHLFLRCEAVKPLWSFLQHSLN

Query:  FALGSDEFEDLFSFFHSLNCSFPKHK
        FAL +D+FE LFSFF SL CS PKHK
Subjt:  FALGSDEFEDLFSFFHSLNCSFPKHK

TYJ99315.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.0e+0093.47Show/hide
Query:  YFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPRDLDWIRCTLKSLIATPNTNRFFLETRDSEQRIWIRKTRNSKGCTAEIFRVDQK
        +FKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPRDLDWIRCTLKSLIATPNTNRFFLETRDSEQRIWIRKTRNSKGCTAEIFRVDQK
Subjt:  YFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPRDLDWIRCTLKSLIATPNTNRFFLETRDSEQRIWIRKTRNSKGCTAEIFRVDQK

Query:  NRKSCILVLEGPDKSGWVSFLSMITPKVEVKAKTRPTFLPRTSPDCQLSPPIDYHKRSYAKAVTEGRPFATSDSSDSYDSSDSSHSSGNSFCDSPSFDLL
        NRKSCILV EGPDKSGWVSFLSMITPKVEVKAKTRPTFLPRTSPDC+LSPPIDYHKRSYAKAVTEGRPFATSDSSDSYDSSDSSHSS NSFCDSPS DLL
Subjt:  NRKSCILVLEGPDKSGWVSFLSMITPKVEVKAKTRPTFLPRTSPDCQLSPPIDYHKRSYAKAVTEGRPFATSDSSDSYDSSDSSHSSGNSFCDSPSFDLL

Query:  ENTLVIVRRFFHDDWHKILQNLRKQTDESFTYNAFHAEKALVHFSSNIPANLLCQNKGWSTVGKYSVRFEKWSPAYHATPKLIPSYGGWTTFRGIPLHLW
        ENT+VIVRRFFHDDWHKILQNLRKQT+ESFTYNAFHAEKALVHFSSNIPANLLCQNKGWSTVGKYSVRFEKWSP YHATPKLIPSYGGWTTFRGIPLHLW
Subjt:  ENTLVIVRRFFHDDWHKILQNLRKQTDESFTYNAFHAEKALVHFSSNIPANLLCQNKGWSTVGKYSVRFEKWSPAYHATPKLIPSYGGWTTFRGIPLHLW

Query:  NMMTFQQIGKACGGLIKVAEETRSAKNLIEARIKVRYNYSGFLPANVRIFDNEGNKFSVQVVTHPE-EWLIERSVRLHGTFKRQAAASFDDFNPESEQFF
        NMMTFQQIGKAC GLIKVAEETRSAKNLIEARIKVRYNYSGFLPANVRIFDNEGNKF VQVVTHPE +WLIER+VRLHGTFKRQAAASFDDFNPESEQFF
Subjt:  NMMTFQQIGKACGGLIKVAEETRSAKNLIEARIKVRYNYSGFLPANVRIFDNEGNKFSVQVVTHPE-EWLIERSVRLHGTFKRQAAASFDDFNPESEQFF

Query:  FEGSEAISPDFLSTSSDDRKSSTPDQPSALKSVIIKPDRNATLPSFLNEELVNDSNLHAMANKSKLEILSGISNDGVLDKGKQKVDIQLQPNSALNLDKS
        FEGSEAISPDFLSTSSD RKSSTPDQPSALKSVIIKPDRNATLPSFLNEELVNDSNLHA ANKSKLEILSGISNDGVLDKGKQKVDIQLQPNSALNLDKS
Subjt:  FEGSEAISPDFLSTSSDDRKSSTPDQPSALKSVIIKPDRNATLPSFLNEELVNDSNLHAMANKSKLEILSGISNDGVLDKGKQKVDIQLQPNSALNLDKS

Query:  KRKVSFNSPSNKTNIFNPNSAPANHSPSLNSPEKKQKVSRERSIKKKSSFTQPNSKANQNKGVFITQPIQIVAHDRDAAKKGLSLTVDLGDLPALDPNKS
        KRKVSFNSPSNKTNIFNP+SAPANHSPSLNSPEKKQKVSRERSIKKKSS TQPNSKANQNKGVFITQPIQIVAHDRDAAKKGLSLTVDLGDLPALDPNKS
Subjt:  KRKVSFNSPSNKTNIFNPNSAPANHSPSLNSPEKKQKVSRERSIKKKSSFTQPNSKANQNKGVFITQPIQIVAHDRDAAKKGLSLTVDLGDLPALDPNKS

Query:  LEDHHSSDNAEVIDITNTEVVTETPEMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTSTN
        LEDHH+SDNAEV+DITNTEVV ETPEMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLK+NGLKLSTDTDSSGATTSTN
Subjt:  LEDHHSSDNAEVIDITNTEVVTETPEMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTSTN

Query:  VLALIKNTIISYSPDFVILTETRLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLSQEEGLFSLSANFLLNNNSSWWLTGLYGP----
        VL    N              + LKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLSQEEGLFSLSANFLLNNNSSWWLTGLYGP    
Subjt:  VLALIKNTIISYSPDFVILTETRLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLSQEEGLFSLSANFLLNNNSSWWLTGLYGP----

Query:  --------LHNLQHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLNNFISNNLLIDPPLTNNRFTWSNLRNPPTFSRIDRFLYNSSWKNLFSPHTTR
                LHNLQHLNSFPWILGGDLNVIRMREESTSVLSSSH+SRMLNNFISNNLLIDPPLTNNRFTWSNLRNPPTFSRIDRFLYNSSW+NLFSPHTTR
Subjt:  --------LHNLQHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLNNFISNNLLIDPPLTNNRFTWSNLRNPPTFSRIDRFLYNSSWKNLFSPHTTR

Query:  TLPRSTSDHFPLVCEDSNPKLSWGPVPFRLNSITLSDPEFKRNMGRWWENLIQAGHPRFSFIQRLKSLANFIKPWQKEKLHSLTYAKEAIIREVDSIDKK
        TLPRSTSDHFPLVCEDSNPKLSWGP+PFRLNSITLSDPEFKRNMGRWWEN IQAG+P FSFIQRLKSLANFIKPWQKEKLHSLTYAKEAIIREVDSIDKK
Subjt:  TLPRSTSDHFPLVCEDSNPKLSWGPVPFRLNSITLSDPEFKRNMGRWWENLIQAGHPRFSFIQRLKSLANFIKPWQKEKLHSLTYAKEAIIREVDSIDKK

Query:  ELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEKGSIQNTNNSISIAFIKFFSRIYRSSTKSD
        ELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDE+GSIQNTNNSIS AFIKFFSRIYRSSTKSD
Subjt:  ELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEKGSIQNTNNSISIAFIKFFSRIYRSSTKSD

Query:  PLFIENLDWNPIASSEWSHLCAPFLEGEIKGVINSFDGKKTPGPDGFPISFFKSHWYLLKEDIMDIFKDFYDKCDYSHPKDFRPISLTTSIYKIIAKTLS
        PLFIENLDWNPIASSEWSHLCAPFLEGEIKGVINSFDGKKTPGPDGFPISFFKSHW                                            
Subjt:  PLFIENLDWNPIASSEWSHLCAPFLEGEIKGVINSFDGKKTPGPDGFPISFFKSHWYLLKEDIMDIFKDFYDKCDYSHPKDFRPISLTTSIYKIIAKTLS

Query:  NRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDYWKVKKIKGFILKLDIEKAFDNLNWDFIDNVLEKKNFPNPWRKWIRGCISNVTYSVIINGRPQG
          LKTTLP+TISGNQLAFVKNRQITDAILMANEAVDYWKVKKIKGFILKLDIEKAFDNLN DFIDNVLEKKNFPNPWRKWIRGCISNVTYSVIINGRPQG
Subjt:  NRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDYWKVKKIKGFILKLDIEKAFDNLNWDFIDNVLEKKNFPNPWRKWIRGCISNVTYSVIINGRPQG

Query:  RIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNGNCNISHILFADDILLFIEDNDCFLKNLRMALSLFERASGLKINLLKSALVPVNV
        RIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNGNCNISHILFADDILLFIEDNDCFLKNLRMALSLFERASGLKINLLKSALVPVNV
Subjt:  RIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNGNCNISHILFADDILLFIEDNDCFLKNLRMALSLFERASGLKINLLKSALVPVNV

Query:  SLNRAKECASFWGISCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSLTCKNIEKLWRKFLW
        SL RAKECASFWGISCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSLTCKNIEKLWRKFLW
Subjt:  SLNRAKECASFWGISCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSLTCKNIEKLWRKFLW

Query:  KGNN--EWSHLINWTKVSKSKEEGGLGISRLNVTNKALLSKWLWRYLSEPNALWRRLIQCKYKGKFPGDIPSNISSSTSKALWRSIIDSTDWFKSNQSWD
        KGNN  E SHLINWTKVSKSKEEGGLGISRLNVTNKALLSKWLWRYLSEPNALWRRLIQCKYKGKFPGDIPSNISSSTSKA WRSIIDSTDWFKSNQSWD
Subjt:  KGNN--EWSHLINWTKVSKSKEEGGLGISRLNVTNKALLSKWLWRYLSEPNALWRRLIQCKYKGKFPGDIPSNISSSTSKALWRSIIDSTDWFKSNQSWD

Query:  LNNGDQISFWYSNWSQEGRLSTAYPRLFALTLDKEISVKDAWNTFDNQWNIIFRRELNDRERCNWEKILEILPTPRSNRGSSKPTWIPDSNNSFSIASTK
        LNNGDQISFWYSNWSQEGRLSTAYPRLFALTLDKEISVKDAWNTFDNQWNIIFRRELNDRERCNWEKILEILPTPRSNRGSSKPTWIPDSNNSFSIAS K
Subjt:  LNNGDQISFWYSNWSQEGRLSTAYPRLFALTLDKEISVKDAWNTFDNQWNIIFRRELNDRERCNWEKILEILPTPRSNRGSSKPTWIPDSNNSFSIASTK

Query:  VLISRQLDQTPGDPRAKLLEIIWKSSIPMKIKFFMWCLIQRRINTMEVIQQKMPNTLLQPN
        VLISRQLDQTPGDPRAKLLEIIWKSSIPMKIKFFMWCLIQRRINTMEVIQQKMPNTLLQPN
Subjt:  VLISRQLDQTPGDPRAKLLEIIWKSSIPMKIKFFMWCLIQRRINTMEVIQQKMPNTLLQPN

TYK00493.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.0e+0073.72Show/hide
Query:  MAYFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPRDLDWIRCTLKSLIATPNTNRFFLETRDSEQRIWIRKTRNSKGCTAEIFRVD
        MAYFKSLPRSCK+ERKEFVL LDKY+KHTHYWLTETGAHKAFSIEVSPRDLDWIR TLKSLI TP++NRFFLE RD E  IWIRKTRN KGCTAEIFRVD
Subjt:  MAYFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPRDLDWIRCTLKSLIATPNTNRFFLETRDSEQRIWIRKTRNSKGCTAEIFRVD

Query:  QKNRKSCILVLEGPDKSGWVSFLSMITPKVEVKAKTRPTFLPRTSPDCQLSPPIDYHKRSYAKAVTEGRPFATSDSSDSYDSSDSSHSSGNSFCDSPSFD
         KNRKSCILV EG +KS WVSFLSMITPKVEVKAKTRP FLPR+SP+ +LSPPIDYHKRSYAKAV+EGR   +SDSSDSY SSDSS SSGNS CDSP   
Subjt:  QKNRKSCILVLEGPDKSGWVSFLSMITPKVEVKAKTRPTFLPRTSPDCQLSPPIDYHKRSYAKAVTEGRPFATSDSSDSYDSSDSSHSSGNSFCDSPSFD

Query:  LLENTLVIVRRFFHDDWHKILQNLRKQTDESFTYNAFHAEKALVHFSSNIPANLLCQNKGWSTVGKYSVRFEKWSPAYHATPKLIPSYGGWTTFRGIPLH
        LLENT+V+VRRFFHDDW KILQNLRKQT+ESFTYNAFHAEK LVHF+SN+PANLLCQNKGW+TVGKY+VRFEKW+PA HA+PKLIPSYGGWTTFRGIPLH
Subjt:  LLENTLVIVRRFFHDDWHKILQNLRKQTDESFTYNAFHAEKALVHFSSNIPANLLCQNKGWSTVGKYSVRFEKWSPAYHATPKLIPSYGGWTTFRGIPLH

Query:  LWNMMTFQQIGKACGGLIKVAEETRSAKNLIEARIKVRYNYSGFLPANVRIFDNEGNKFSVQVVTHPE-EWLIERSVRLHGTFKRQAAASFDDFNPESEQ
        LWNMMTFQQIGKACGGLIKVAEET++A+NLIEA++K+RYNYSGFLPA V+IFD EGNKF VQVVTH E +WL+ER+VRLHGTFKRQAAASFDDFNP+SEQ
Subjt:  LWNMMTFQQIGKACGGLIKVAEETRSAKNLIEARIKVRYNYSGFLPANVRIFDNEGNKFSVQVVTHPE-EWLIERSVRLHGTFKRQAAASFDDFNPESEQ

Query:  FFFEGSEAISPDFLSTSSDDRKSSTPDQPSALKSVIIKPDRNATLPSFLNEELVNDSNLHAMANKSKLEILSGISNDGVLDKGKQKVDIQLQPNSALNLD
        F F+G EAISPD L+T S  RKS +P+QPSALKSVIIKP + AT P+ LNEE+VND++LHA ANKSKL+ILSGISNDG LDKGKQKVDI  Q  SA    
Subjt:  FFFEGSEAISPDFLSTSSDDRKSSTPDQPSALKSVIIKPDRNATLPSFLNEELVNDSNLHAMANKSKLEILSGISNDGVLDKGKQKVDIQLQPNSALNLD

Query:  KSKRKVSFNSPSNKTNIFNPNSAPANHSPSLNSPEKKQKVSRERSIKKKSSFTQPNSKANQNKGVFITQPIQIVAHDRDAAKKGLSLTVDLGDLPALDPN
        K KRKVSFNSPSNKT  FNP+SAPANH     SPEKK++VSRERS+KKKSS  QP  +ANQ KG  ITQP+Q+VAHD DA+KKGLSLTVDLG+LP LDP+
Subjt:  KSKRKVSFNSPSNKTNIFNPNSAPANHSPSLNSPEKKQKVSRERSIKKKSSFTQPNSKANQNKGVFITQPIQIVAHDRDAAKKGLSLTVDLGDLPALDPN

Query:  KSLEDHHSSDNAEVIDITNTEVVTETPEMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTS
        KS EDHHSSDNAEVIDITNTEVV ETPE+KM   E SNSS E NYRK KH H+R++YYRKKE+KEKD +SEAFK QLV+WLKENGLKLS DTDSSGATTS
Subjt:  KSLEDHHSSDNAEVIDITNTEVVTETPEMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTS

Query:  TNVLALIKNTIISYSPDFVILTETRLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLSQEEGLFSLSANFLLNNNSSWWLTGLYGP--
        TN L                                                S+GGILILWDAQ+HSLLSQEEG FSLSANF   NN SWWLTGLYGP  
Subjt:  TNVLALIKNTIISYSPDFVILTETRLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLSQEEGLFSLSANFLLNNNSSWWLTGLYGP--

Query:  ----------LHNLQHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLNNFISNNLLIDPPLTNNRFTWSNLRNPPTFSRIDRFLYNSSWKNLFSPHT
                  LHNL HLNS PWI+GGDLNV+RMREEST+V  SSHSS MLN+FISNNLLIDPPLTNNR+TWSNLRNPPTFSR+DRFLYNS W+ LF+PH 
Subjt:  ----------LHNLQHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLNNFISNNLLIDPPLTNNRFTWSNLRNPPTFSRIDRFLYNSSWKNLFSPHT

Query:  TRTLPRSTSDHFPLVCEDSNPKLSWGPVPFRLNSITLSDPEFKRNMGRWWENLIQAGHPRFSFIQRLKSLANFIKPWQKEKLHSLTYAKEAIIREVDSID
        TRTLPR TSDHFPLVCEDS   L WGP PFRLNSI L+DPEFKRNM RWWE  +Q GHP F FIQRLKSLAN IKPWQKEK  SLT AKE IIREVDSID
Subjt:  TRTLPRSTSDHFPLVCEDSNPKLSWGPVPFRLNSITLSDPEFKRNMGRWWENLIQAGHPRFSFIQRLKSLANFIKPWQKEKLHSLTYAKEAIIREVDSID

Query:  KKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEKGSIQNTNNSISIAFIKFFSRIYRSSTK
        K ELDTPL+ EESNRRLALKA+L++LSLKESQFW+QRAKKLWL+EGDENS+FFHRICSSRQKR+ IHEIQDE+GSIQNTNN+IS+AF+  FSRIYR STK
Subjt:  KKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEKGSIQNTNNSISIAFIKFFSRIYRSSTK

Query:  SDPLFIENLDWNPIASSEWSHLCAPFLEGEIKGVINSFDGKKTPGPDGFPISFFKSHWYLLKEDIMDIFKDFYD------------------KCDYSHPK
         DPLFIENL+WNPI  S+WS LCAPF E EIKGVI SFDG K PGPDGFPISFFKS+W+LLKEDI+DIFKDF++                  K DYSHPK
Subjt:  SDPLFIENLDWNPIASSEWSHLCAPFLEGEIKGVINSFDGKKTPGPDGFPISFFKSHWYLLKEDIMDIFKDFYD------------------KCDYSHPK

Query:  DFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDYWKVKKIKGFILKLDIEKAFDNLNWDFIDNVLEKKNFPNPWRKW
        DFRPISLTTSIYK IAKTLSNRLK TLPDTISGNQLAF+KNRQITDAILMANEA+DYWKVKKIKGFILKLDIEKAFDNLNW+FID VL+K N+PN WRKW
Subjt:  DFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDYWKVKKIKGFILKLDIEKAFDNLNWDFIDNVLEKKNFPNPWRKW

Query:  IRGCISNVTYSVIINGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNGNCNISHILFADDILLFIEDNDCFLKNLRMALSLF
        IRGCISNVTYS+I+NG+PQGRIKANRGLRQGDPLS FLFVIAMDYLSRLLSHLES+GAIKG                                       
Subjt:  IRGCISNVTYSVIINGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNGNCNISHILFADDILLFIEDNDCFLKNLRMALSLF

Query:  ERASGLKINLLKSALVPVNVSLNRAKECASFWGISCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSV
                                        GI CH+LPL+YLGVPLGGNPKSNLFWRN+ED+IQKKL+NWKYA ISKGGRLTLIKSTLSSLPIY+LSV
Subjt:  ERASGLKINLLKSALVPVNVSLNRAKECASFWGISCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSV

Query:  FQAPSLTCKNIEKLWRKFLWKGN--NEWSHLINWTKVSKSKEEGGLGISRLNVTNKALLSKWLWRYLSEPNALWRRLIQCKYKGKFPGDIPSNISSSTSK
        FQAPS T KNIEKLWR FLWKG+   + SHLINW+ V+K KEEGGLGISRL VTN+ALLSKWLWRY SEPN+LWRRLI  KYKGK PGD+PSNISSS+SK
Subjt:  FQAPSLTCKNIEKLWRKFLWKGN--NEWSHLINWTKVSKSKEEGGLGISRLNVTNKALLSKWLWRYLSEPNALWRRLIQCKYKGKFPGDIPSNISSSTSK

Query:  ALWRSIIDSTDWFKSNQSWDLNNGDQISFWYSNWSQEGRLSTAYPRLFALTLDKEISVKDAWNTFDNQWNIIFRRELNDRERCNWEKILEILPTPRSNRG
        A WRSII++ DWFKSNQ WDLNNGDQISFWYSNWS EG LSTAYPRLFAL++DKE S+KD WN+ +NQW I FRR+LNDRE   W+KILE LP  R+NRG
Subjt:  ALWRSIIDSTDWFKSNQSWDLNNGDQISFWYSNWSQEGRLSTAYPRLFALTLDKEISVKDAWNTFDNQWNIIFRRELNDRERCNWEKILEILPTPRSNRG

Query:  SSKPTWIPDSNNSFSIASTKVLISRQLDQTPGDPRAKLLEIIWKSSIPMKIKFFMWCLIQRRINTMEVIQQKMPNTLLQPNWCVLCNKDSETGNHLFLRC
         SKPTWIPDS   FSIAS K  IS Q D++  +PR KLL +IWK+ +PMKIKFFMWCL+QR++NTMEV       TLLQPNWCVLC K SETG HLFL C
Subjt:  SSKPTWIPDSNNSFSIASTKVLISRQLDQTPGDPRAKLLEIIWKSSIPMKIKFFMWCLIQRRINTMEVIQQKMPNTLLQPNWCVLCNKDSETGNHLFLRC

Query:  EAVKPLWSFLQHSLNFALGSDEFEDLFSFFHSLNCSFPKHKVIFCGIIALFWEIWCERNFRIFGTSSSHKTIANMWEDCKILIGNWCSRDPHFKNYSAAT
        + VKPLWS L  SLNFA  SD+FE +FSFF SLN S PKHKV+FCG+IA+ W IW ERN RIF T S  K+IAN+WEDCKILIGNW SRDP FKNYSAAT
Subjt:  EAVKPLWSFLQHSLNFALGSDEFEDLFSFFHSLNCSFPKHKVIFCGIIALFWEIWCERNFRIFGTSSSHKTIANMWEDCKILIGNWCSRDPHFKNYSAAT

Query:  IALNLSTFCN
        IALNL+ FCN
Subjt:  IALNLSTFCN

TYK05808.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.0e+0067.65Show/hide
Query:  MAYFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPRDLDWIRCTLKSLIATPNTNRFFLETRDSEQRIWIRKTRNSKGCTAEIFRVD
        MAYFKSLPRSCK+ERKEFVL LDKY+KHTHYWLTETGAHKAFSIEVSPRDLDWIR TLKSLI TP++NRFFLE RD E  IWIRKTRN KGCTAEIFRVD
Subjt:  MAYFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPRDLDWIRCTLKSLIATPNTNRFFLETRDSEQRIWIRKTRNSKGCTAEIFRVD

Query:  QKNRKSCILVLEGPDKSGWVSFLSMITPKVEVKAKTRPTFLPRTSPDCQLSPPIDYHKRSYAKAVTEGRPFATSDSSDSYDSSDSSHSSGNSFCDSPSFD
         KNRKSCILV EGP+KSG VSFLSMITPKVEVKAKTRPTFLPR+SP+ +LSPPIDYHKRSY KAV++GR   +SDSSDSY SSDSS SSGNS CDSP   
Subjt:  QKNRKSCILVLEGPDKSGWVSFLSMITPKVEVKAKTRPTFLPRTSPDCQLSPPIDYHKRSYAKAVTEGRPFATSDSSDSYDSSDSSHSSGNSFCDSPSFD

Query:  LLENTLVIVRRFFHDDWHKILQNLRKQTDESFTYNAFHAEKALVHFSSNIPANLLCQNKGWSTVGKYSVRFEKWSPAYHATPKLIPSYGGWTTFRGIPLH
        LLENT+V+                                 AL+HF+SN+PANLLCQNKGW+TV KY VR                              
Subjt:  LLENTLVIVRRFFHDDWHKILQNLRKQTDESFTYNAFHAEKALVHFSSNIPANLLCQNKGWSTVGKYSVRFEKWSPAYHATPKLIPSYGGWTTFRGIPLH

Query:  LWNMMTFQQIGKACGGLIKVAEETRSAKNLIEARIKVRYNYSGFLPANVRIFDNEGNKFSVQVVTHPEEWLIERSVRLHGTFKRQAAASFDDFNPESEQF
                                                                                                          + 
Subjt:  LWNMMTFQQIGKACGGLIKVAEETRSAKNLIEARIKVRYNYSGFLPANVRIFDNEGNKFSVQVVTHPEEWLIERSVRLHGTFKRQAAASFDDFNPESEQF

Query:  FFEGSEAISPDFLSTSSDDRKSSTPDQPSALKSVIIKPDRNATLPSFLNEELVNDSNLHAMANKSKLEILSGISNDGVLDKGKQKVDIQLQPNSALNLDK
         F+G EAISPD L+T S  RKS++ +QPSALKSVIIKP R+AT P+ LNEE+VND++LHA   KS+L+ILSGISNDG LDKGKQKVDI  Q  SA   DK
Subjt:  FFEGSEAISPDFLSTSSDDRKSSTPDQPSALKSVIIKPDRNATLPSFLNEELVNDSNLHAMANKSKLEILSGISNDGVLDKGKQKVDIQLQPNSALNLDK

Query:  SKRKVSFNSPSNKTNIFNPNSAPANHSPSLNSPEKKQKVSRERSIKKKSSFTQPNSKANQNKGVFITQPIQIVAHDRDAAKKGLSLTVDLGDLPALDPNK
         KRKVSFNSPSNKT  FN +SAP NHSP L+SPEKKQ+VSRERS+KKKSS  QP S+ANQ KG  ITQP+Q+VAHD DA+KKGLSLTVDLG+LP LDP+K
Subjt:  SKRKVSFNSPSNKTNIFNPNSAPANHSPSLNSPEKKQKVSRERSIKKKSSFTQPNSKANQNKGVFITQPIQIVAHDRDAAKKGLSLTVDLGDLPALDPNK

Query:  SLEDHHSSDNAEVIDITNTEVVTETPEMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTST
        S EDHHSSDNAEVIDITNTEVV ETPE+KM   E SNSS E NYRK KH H+R++YYRKKE+KEKD +SEAFK QLV+WLKENGLKLSTDTDSSGATTST
Subjt:  SLEDHHSSDNAEVIDITNTEVVTETPEMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTST

Query:  NVLALIKNTIISYSPDFVILTETRLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLSQEEGLFSLSANFLLNNNSSWWLTGLYGPLHN
        N L             F  L                   +SI+WI KNA  SSGGILILWDAQ+HSLL                                
Subjt:  NVLALIKNTIISYSPDFVILTETRLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLSQEEGLFSLSANFLLNNNSSWWLTGLYGPLHN

Query:  LQHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLNNFISNNLLIDPPLTNNRFTWSNLRNPPTFSRIDRFLYNSSWKNLFSPHTTRTLPRSTSDHFP
                    GDLNV+RMREEST+V SSSHSS MLNNFISNNLLIDPPLTNNR+TWSNLRNPPTFSR+DRFLYNS W+ LF+PH TRTL R TSDHFP
Subjt:  LQHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLNNFISNNLLIDPPLTNNRFTWSNLRNPPTFSRIDRFLYNSSWKNLFSPHTTRTLPRSTSDHFP

Query:  LVCEDSNPKLSWGPVPFRLNSITLSDPEFKRNMGRWWENLIQAGHPRFSFIQRLKSLANFIKPWQKEKLHSLTYAKEAIIREVDSIDKKELDTPLTQEES
        LVCEDS   L WGP PFRLNSI L+DP+FKRNM RWWE  +Q GHP FSFI+RLKSLAN IKPWQKEK HSLT AKE IIREVDSIDK ELDTPL+QEES
Subjt:  LVCEDSNPKLSWGPVPFRLNSITLSDPEFKRNMGRWWENLIQAGHPRFSFIQRLKSLANFIKPWQKEKLHSLTYAKEAIIREVDSIDKKELDTPLTQEES

Query:  NRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEKGSIQNTNNSISIAFIKFFSRIYRSSTKSDPLFIENLDWNP
        NRRLALKA+LS+LSLKESQFW+QRAKKLWL+EGDENS+FFHRICSSRQKR+ IHEIQDE+GSIQNTNN+IS+AF+  FS IYR STK DPLFIENL+WNP
Subjt:  NRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEKGSIQNTNNSISIAFIKFFSRIYRSSTKSDPLFIENLDWNP

Query:  IASSEWSHLCAPFLEGEIKGVINSFDGKKTPGPDGFPISFFKSHWYLLKEDIMDIFKDFYDKCDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTI
        I  S+WS LCAPFLE EIKGVI SFDG K PGPDGFPISFFKS+W+LLKEDI+DIFKDF++K                    IIAKTLSNRLK TLPDTI
Subjt:  IASSEWSHLCAPFLEGEIKGVINSFDGKKTPGPDGFPISFFKSHWYLLKEDIMDIFKDFYDKCDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTI

Query:  SGNQLAFVKNRQITDAILMANEAVDYWKVKKIKGFILKLDIEKAFDNLNWDFIDNVLEKKNFPNPWRKWIRGCISNVTYSVIINGRPQGRIKANRGLRQG
        SGNQLAF+KNRQITDAIL ANEA+DYWKVKKIK FILKLDIEKAFDNLNWDFID VL+KKN+PN WRKWIRGCISNVTYS+I+N +PQ RIKANRGLRQG
Subjt:  SGNQLAFVKNRQITDAILMANEAVDYWKVKKIKGFILKLDIEKAFDNLNWDFIDNVLEKKNFPNPWRKWIRGCISNVTYSVIINGRPQGRIKANRGLRQG

Query:  DPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNGNCNISHILFADDILLFIEDNDCFLKNLRMALSLFERASGLKINLLKSALVPVNVSLNRAKECASF
        DPLSPFLFV AMDYLSRLLSHLESSGAIKGV L  +CNISHILFADDILLF+EDND FL NLRMALSLFE+ASGLKINL KSA+VPVNVS +RA ECAS 
Subjt:  DPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNGNCNISHILFADDILLFIEDNDCFLKNLRMALSLFERASGLKINLLKSALVPVNVSLNRAKECASF

Query:  WGISCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSLTCKNIEKLWRKFLWKGN--NEWSHL
        WGISCH+LPL+YLGVPLGGNPKSN+FWRN+ED+IQKKLNNWKYA ISKGGRLTLIKSTLSSL IYQLSVFQAP  T KNIEKLWR FLWKG+   + SHL
Subjt:  WGISCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSLTCKNIEKLWRKFLWKGN--NEWSHL

Query:  INWTKVSKSKEEGGLGISRLNVTNKALLSKWLWRYLSEPNALWRRLIQCKYKGKFPGDIPSNISSSTSKALWRSIIDSTDWFKSNQSWDLNNGDQISFWY
        INW+ V+K KEEGGLGISRL V N+ALLSKWLWRY SEPN+LWRRLI  KYKGK PGDIPSNISSS+SKA W+SII++ DWFKSNQ WDLNN DQISFWY
Subjt:  INWTKVSKSKEEGGLGISRLNVTNKALLSKWLWRYLSEPNALWRRLIQCKYKGKFPGDIPSNISSSTSKALWRSIIDSTDWFKSNQSWDLNNGDQISFWY

Query:  SNWSQEGRLSTAYPRLFALTLDKEISVKDAWNTFDNQWNIIFRRELNDRERCNWEKILEILPTPRSNRGSSKPTWIPDSNNSFSIASTKVLISRQLDQTP
        SNWS EG LSTAYPRLFAL++DK+ S+KD WN+ +NQW I FRR+LNDRE   W+ ILE L  PR+NRG SKPTWIPDS   FSIAS K  IS Q D++ 
Subjt:  SNWSQEGRLSTAYPRLFALTLDKEISVKDAWNTFDNQWNIIFRRELNDRERCNWEKILEILPTPRSNRGSSKPTWIPDSNNSFSIASTKVLISRQLDQTP

Query:  GDPRAKLLEIIWKSSIPMKIKFFMWCLI
         +PR KLL++IWK+ +PMKIKFFMWCL+
Subjt:  GDPRAKLLEIIWKSSIPMKIKFFMWCLI

TYK11012.1 uncharacterized protein E5676_scaffold874G00540 [Cucumis melo var. makuwa]0.0e+0072.94Show/hide
Query:  MITPKVEVKAKTRPTFLPRTSPDCQLSPPIDYHKRSYAKAVTEGRPFATSDSSDSYDSSDSSHSSGNSFCDSPSFDLLENTLVIVRRFFHDDWHKILQNL
        MITPKVEVK KTRPTFLPR+SP+ +LSPPIDYHKRSYAKAVTEGRPF TSDSSDSY SSDSSHSSGNSFCDSPS DLLENT+V+VRRFFHDDW KILQNL
Subjt:  MITPKVEVKAKTRPTFLPRTSPDCQLSPPIDYHKRSYAKAVTEGRPFATSDSSDSYDSSDSSHSSGNSFCDSPSFDLLENTLVIVRRFFHDDWHKILQNL

Query:  RKQTDESFTYNAFHAEKALVHFSSNIPANLLCQNKGWSTVGKYSVRFEKWSPAYHATPKLIPSYGGWTTFRGIPLHLWNMMTFQQIGKACGGLIKVAEET
        RKQT+ESFTYNAFHAEKALVHF+SNIP NLLCQNKGW+TVGKYSVRFEKWSPAYHATPKLIPSYGGWTTF+                             
Subjt:  RKQTDESFTYNAFHAEKALVHFSSNIPANLLCQNKGWSTVGKYSVRFEKWSPAYHATPKLIPSYGGWTTFRGIPLHLWNMMTFQQIGKACGGLIKVAEET

Query:  RSAKNLIEARIKVRYNYSGFLPANVRIFDNEGNKFSVQVVTHPEEWLIERSVRLHGTFKRQAAASFDDFNPESEQ----FFFEGSEAISPDFLSTSSDDR
        R++  L+E                                                         +DDF+   E       F+GSEAISPDFLSTSS  R
Subjt:  RSAKNLIEARIKVRYNYSGFLPANVRIFDNEGNKFSVQVVTHPEEWLIERSVRLHGTFKRQAAASFDDFNPESEQ----FFFEGSEAISPDFLSTSSDDR

Query:  KSSTPDQPSALKSVIIKPDRNATLPSFLNEELVNDSNLHAMANKSKLEILSGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSPSNKTNIFNPN
        KSSTPDQPSALKSVIIKPD+ AT P++LNEE+VNDSNLHA ANKS+LEILSGI NDGVLDKGKQKVDIQL PNSALNL+K KRKVSFNSPSNKTNIFNP+
Subjt:  KSSTPDQPSALKSVIIKPDRNATLPSFLNEELVNDSNLHAMANKSKLEILSGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSPSNKTNIFNPN

Query:  SAPANHSPSLNSPEKKQKVSRERSIKKKSSFTQPNSKANQNKGVFITQPIQIVAHDRDAAKKGLSLTVDLGDLPALDPNKSLEDHHSSDNAEVIDITNTE
        SAPANHS SL+SPEKKQKVSRERSIKKKSS  QP     QNKGV ITQPIQ+VAHD +A+KKGLSL V+LGDLP LDP+KS EDHHSS NAEVIDITNTE
Subjt:  SAPANHSPSLNSPEKKQKVSRERSIKKKSSFTQPNSKANQNKGVFITQPIQIVAHDRDAAKKGLSLTVDLGDLPALDPNKSLEDHHSSDNAEVIDITNTE

Query:  VVTETPEMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKEEK--------EKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTSTNVLALIKNTIIS
        VV ETPEMKMPVNENSNSSSEANYRKPKHVH+R+YYYRKK  K        E        K +L++W             ++    S +  ALIKN IIS
Subjt:  VVTETPEMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKEEK--------EKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTSTNVLALIKNTIIS

Query:  YSPDFVILTETRLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLSQEEGLFSLSANFLLNNNSSWWLTGLYGP------------LHN
        YSPDFVILTET LKITNKRIIKS WPSNSINWI KNASGSSGGILILWDAQ+HSLLSQEE +FSLSANF LNNNSSWWLTGLYGP            LHN
Subjt:  YSPDFVILTETRLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLSQEEGLFSLSANFLLNNNSSWWLTGLYGP------------LHN

Query:  LQHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLNNFISNNLLIDPPLTNNRFTWSNLRNPPTFSRIDRFLYNSSWKNLFSPHTTRTLPRSTSDHFP
        LQHLNSFPW L  DLNVIRMREE+TS+LSSSHSSRMLNNFISNNLLIDPPLTNNRFTWSNLRNP TFSRIDRFLYNSSW+NLFSPHTTRTLPR TSDHFP
Subjt:  LQHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLNNFISNNLLIDPPLTNNRFTWSNLRNPPTFSRIDRFLYNSSWKNLFSPHTTRTLPRSTSDHFP

Query:  LVCEDSNPKLSWGPVPFRLNSITLSDPEFKRNMGRWWENLIQAGHPRFSFIQRLKSLANFIKPWQKEKLHSLTYAKEAIIREVDSIDKKELDTPLTQEES
        LVCEDSNPKL WGP PFRLNSI L+DPEFKRNM RWWEN +Q GH  FSFIQRLKSLAN IKPWQKEKLHSL YAKE IIREVDSIDKKELDTPL+Q+ES
Subjt:  LVCEDSNPKLSWGPVPFRLNSITLSDPEFKRNMGRWWENLIQAGHPRFSFIQRLKSLANFIKPWQKEKLHSLTYAKEAIIREVDSIDKKELDTPLTQEES

Query:  NRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEKGSIQNTNNSISIAFIKFFSRIYRSSTKSDPLFIENLDWNP
        NRRLALKA+LS+LSLKESQF                       C                                    IY+SSTKSDPLFIENLDWNP
Subjt:  NRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEKGSIQNTNNSISIAFIKFFSRIYRSSTKSDPLFIENLDWNP

Query:  IASSEWSHLCAPFLEGEIKGVINSFDGKKTPGPDGFPISFFKSHWYLLKEDIMDIFKDFYD------------------KCDYSHPKDFRPISLTTSIYK
        I  SEW HLCAPFLE EIKGVINSFDGKK P PDGFPISFFKS+W+LLKEDIMDIFKDF++                  K DYSHPKDFRPISLTTSIYK
Subjt:  IASSEWSHLCAPFLEGEIKGVINSFDGKKTPGPDGFPISFFKSHWYLLKEDIMDIFKDFYD------------------KCDYSHPKDFRPISLTTSIYK

Query:  IIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDYWKVKKIKGFILKLDIEKAFDNLNWDFIDNVLEKKNFPNPWRKWIRGCISNVTYSVI
        IIAKTLSNRLKTTLP TISGNQLAF+KNRQITDAILMANEAVDYWKVKKIKGFILKLDIEK F NLNWDFID VL KKNFPN WRKWIRGCISNVTYSVI
Subjt:  IIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDYWKVKKIKGFILKLDIEKAFDNLNWDFIDNVLEKKNFPNPWRKWIRGCISNVTYSVI

Query:  INGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNGNCNISHILFADDILLFIEDNDCFLKNLRMALSLFERASGLKINLLKS
        INGRPQGRIKANRGLRQGDPLSPFLFVIAMDY SRLLSHLE+SGAIKGVSLN NCNISHILFADDILLF+EDNDCFL NL MALSLFE+ASGLKINLLKS
Subjt:  INGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNGNCNISHILFADDILLFIEDNDCFLKNLRMALSLFERASGLKINLLKS

Query:  ALVPVNVSLNRAKECASFWGISCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSLTCKNIEK
        ALVPVNVSLNRAKECASFWGISCHSL LSYLGVPLGG+                                                              
Subjt:  ALVPVNVSLNRAKECASFWGISCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSLTCKNIEK

Query:  LWRKFLWKGNNEWSHLINWTKVSKSKEEGGLGISRLNVTNKALLSKWLWRYLSEPNALWRRLIQCKYKGKFPGDIPSNISSSTSKALWRSIIDSTDWFKS
                  ++ SHLINWTKV KSKEEGGLGISRL VTNKALLSKWLWRY SEPNALWRRLIQCKYKGK PGDIPSN SSS+SKA WRSIID+ DWFKS
Subjt:  LWRKFLWKGNNEWSHLINWTKVSKSKEEGGLGISRLNVTNKALLSKWLWRYLSEPNALWRRLIQCKYKGKFPGDIPSNISSSTSKALWRSIIDSTDWFKS

Query:  NQSWDLNNGDQISFWYSNWSQEGRLSTAYPRLFALTLDKEISVKDAWNTFDNQWNIIFRRELNDRERCNWEKILEILPTPRSNRGSSKPTWIPDSNNSFS
        NQSWDLNNGDQISFWYSNWSQEG LSTAYPRLFALTLDKEISVKDAWNT DNQW I FRRELNDRERCNWEKILEILPTPR NRGSSKPTWIPD N SFS
Subjt:  NQSWDLNNGDQISFWYSNWSQEGRLSTAYPRLFALTLDKEISVKDAWNTFDNQWNIIFRRELNDRERCNWEKILEILPTPRSNRGSSKPTWIPDSNNSFS

Query:  IASTKVLISRQLDQTPGDPRAKLLEIIWKSSIPMKIKFFMWCLIQRRINTMEVIQQKMPNTLLQPNWCVLCNKDSETGNHLFLRCEAVKPLWSFLQHSLN
        IAS K+LIS QLDQT GD R KLLEIIWKS+IPMKIKFFMWCLIQRRI+TMEVIQQ+M NTLLQPNWCVLCNKD+E+GNHLFLRC+AVKPLWS L  SLN
Subjt:  IASTKVLISRQLDQTPGDPRAKLLEIIWKSSIPMKIKFFMWCLIQRRINTMEVIQQKMPNTLLQPNWCVLCNKDSETGNHLFLRCEAVKPLWSFLQHSLN

Query:  FALGSDEFEDLFSFFHSLNCSFPKHK
        FAL +D+FE LFSFF SL CS PKHK
Subjt:  FALGSDEFEDLFSFFHSLNCSFPKHK

TrEMBL top hitse value%identityAlignment
A0A5A7UV84 Reverse transcriptase domain-containing protein0.0e+0072.94Show/hide
Query:  MITPKVEVKAKTRPTFLPRTSPDCQLSPPIDYHKRSYAKAVTEGRPFATSDSSDSYDSSDSSHSSGNSFCDSPSFDLLENTLVIVRRFFHDDWHKILQNL
        MITPKVEVK KTRPTFLPR+SP+ +LSPPIDYHKRSYAK VTEGRPF TSDSSDSY SSDSSHSSGNSFCDSPS DLLENT+V+VRRFFHDDW KILQNL
Subjt:  MITPKVEVKAKTRPTFLPRTSPDCQLSPPIDYHKRSYAKAVTEGRPFATSDSSDSYDSSDSSHSSGNSFCDSPSFDLLENTLVIVRRFFHDDWHKILQNL

Query:  RKQTDESFTYNAFHAEKALVHFSSNIPANLLCQNKGWSTVGKYSVRFEKWSPAYHATPKLIPSYGGWTTFRGIPLHLWNMMTFQQIGKACGGLIKVAEET
        RKQT+ESFTYNAFHAEKALVHF+SNIP NLLCQNKGW+TVGKYSVRFEKWSPAYHATPKLIPSYGGWTTF+                             
Subjt:  RKQTDESFTYNAFHAEKALVHFSSNIPANLLCQNKGWSTVGKYSVRFEKWSPAYHATPKLIPSYGGWTTFRGIPLHLWNMMTFQQIGKACGGLIKVAEET

Query:  RSAKNLIEARIKVRYNYSGFLPANVRIFDNEGNKFSVQVVTHPEEWLIERSVRLHGTFKRQAAASFDDFNPESEQ----FFFEGSEAISPDFLSTSSDDR
        R++  L+E                                                         +DDF+   E       F+GSEAISPDFLSTSS  R
Subjt:  RSAKNLIEARIKVRYNYSGFLPANVRIFDNEGNKFSVQVVTHPEEWLIERSVRLHGTFKRQAAASFDDFNPESEQ----FFFEGSEAISPDFLSTSSDDR

Query:  KSSTPDQPSALKSVIIKPDRNATLPSFLNEELVNDSNLHAMANKSKLEILSGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSPSNKTNIFNPN
        KSSTPDQPSALKSVIIKPD+ AT P++LNEE+VNDSNLHA ANKS+LEILSGI NDGVLDKGKQKVDIQL PNSALNL+K KRKVSFNSPSNKTNIFNP+
Subjt:  KSSTPDQPSALKSVIIKPDRNATLPSFLNEELVNDSNLHAMANKSKLEILSGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSPSNKTNIFNPN

Query:  SAPANHSPSLNSPEKKQKVSRERSIKKKSSFTQPNSKANQNKGVFITQPIQIVAHDRDAAKKGLSLTVDLGDLPALDPNKSLEDHHSSDNAEVIDITNTE
        SAPANHS SL+SPEKKQKVSRERSIKKKSS  QP     QNKGV ITQPIQ+VAHD +A+KKGLSL V+LGDLP LDP+KS EDHHSS NAEVIDITNTE
Subjt:  SAPANHSPSLNSPEKKQKVSRERSIKKKSSFTQPNSKANQNKGVFITQPIQIVAHDRDAAKKGLSLTVDLGDLPALDPNKSLEDHHSSDNAEVIDITNTE

Query:  VVTETPEMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKEEK--------EKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTSTNVLALIKNTIIS
        VV ETPEMKMPVNENSNSSSEANYRKPKHVH+R+YYYRKK  K        E        K +L++W             ++    S +  ALIKN IIS
Subjt:  VVTETPEMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKEEK--------EKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTSTNVLALIKNTIIS

Query:  YSPDFVILTETRLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLSQEEGLFSLSANFLLNNNSSWWLTGLYGP------------LHN
        YSPDFVILTET LKITNKRIIKS WPSNSINWI KNASGSSGGILILWDAQ+HSLLSQEE +FSLSANF LNNNSSWWLTGLYGP            LHN
Subjt:  YSPDFVILTETRLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLSQEEGLFSLSANFLLNNNSSWWLTGLYGP------------LHN

Query:  LQHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLNNFISNNLLIDPPLTNNRFTWSNLRNPPTFSRIDRFLYNSSWKNLFSPHTTRTLPRSTSDHFP
        LQHLNSFPW L  DLNVIRMREE+TS+LSSSHSSRMLNNFISNNLLIDPPLTNNRFTWSNLRNP TFSRIDRFLYNSSW+NLFSPHTTRTLPR TSDHFP
Subjt:  LQHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLNNFISNNLLIDPPLTNNRFTWSNLRNPPTFSRIDRFLYNSSWKNLFSPHTTRTLPRSTSDHFP

Query:  LVCEDSNPKLSWGPVPFRLNSITLSDPEFKRNMGRWWENLIQAGHPRFSFIQRLKSLANFIKPWQKEKLHSLTYAKEAIIREVDSIDKKELDTPLTQEES
        LVCEDSNPKL WGP PFRLNSI L+DPEFKRNM RWWEN +Q GHP FSFIQRLKSLAN IKPWQKEKLHSL YAKE IIREVDSIDKKELDTPL+Q+ES
Subjt:  LVCEDSNPKLSWGPVPFRLNSITLSDPEFKRNMGRWWENLIQAGHPRFSFIQRLKSLANFIKPWQKEKLHSLTYAKEAIIREVDSIDKKELDTPLTQEES

Query:  NRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEKGSIQNTNNSISIAFIKFFSRIYRSSTKSDPLFIENLDWNP
        NRRLALKA+LS+LSLKESQF                       C                                    IY+SSTKSDPLFIENLDWNP
Subjt:  NRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEKGSIQNTNNSISIAFIKFFSRIYRSSTKSDPLFIENLDWNP

Query:  IASSEWSHLCAPFLEGEIKGVINSFDGKKTPGPDGFPISFFKSHWYLLKEDIMDIFKDFYD------------------KCDYSHPKDFRPISLTTSIYK
        I  SEW HLCAPFLE EIKGVINSFDGKK P PDGFPISFFKS+W+LLKEDIMDIFKDF++                  K DYSHPKDFRPISLTTSIYK
Subjt:  IASSEWSHLCAPFLEGEIKGVINSFDGKKTPGPDGFPISFFKSHWYLLKEDIMDIFKDFYD------------------KCDYSHPKDFRPISLTTSIYK

Query:  IIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDYWKVKKIKGFILKLDIEKAFDNLNWDFIDNVLEKKNFPNPWRKWIRGCISNVTYSVI
        IIAKTLSNRLKTTLP TISGNQLAF+KNRQITDAILMANEAVDYWKVKKIKGFILKLDIEK F NLNWDFID VL KKNFPN WRKWIRGCISNVTYSVI
Subjt:  IIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDYWKVKKIKGFILKLDIEKAFDNLNWDFIDNVLEKKNFPNPWRKWIRGCISNVTYSVI

Query:  INGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNGNCNISHILFADDILLFIEDNDCFLKNLRMALSLFERASGLKINLLKS
        INGRPQGRIKANRGLRQGDPLSPFLFVIAMDY SRLLSHLE+SGAIKGVSLN NCNISHILFADDILLF+EDNDCFL NL MALSLFE+ASGLKINLLKS
Subjt:  INGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNGNCNISHILFADDILLFIEDNDCFLKNLRMALSLFERASGLKINLLKS

Query:  ALVPVNVSLNRAKECASFWGISCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSLTCKNIEK
        ALVPVNVSLNRAKECASFWGISCHSL LSYLGVPLGG+                                                              
Subjt:  ALVPVNVSLNRAKECASFWGISCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSLTCKNIEK

Query:  LWRKFLWKGNNEWSHLINWTKVSKSKEEGGLGISRLNVTNKALLSKWLWRYLSEPNALWRRLIQCKYKGKFPGDIPSNISSSTSKALWRSIIDSTDWFKS
                  ++ SHLINWTKV KSKEEGGLGISRL VTNKALLSKWLWRY SEPNALWRRLIQCKYKGK PGDIPSN SSS+SKA WRSIID+ DWFKS
Subjt:  LWRKFLWKGNNEWSHLINWTKVSKSKEEGGLGISRLNVTNKALLSKWLWRYLSEPNALWRRLIQCKYKGKFPGDIPSNISSSTSKALWRSIIDSTDWFKS

Query:  NQSWDLNNGDQISFWYSNWSQEGRLSTAYPRLFALTLDKEISVKDAWNTFDNQWNIIFRRELNDRERCNWEKILEILPTPRSNRGSSKPTWIPDSNNSFS
        NQSWDLNNGDQISFWYSNWSQEG LSTAYPRLFALTLDKEISVKDAWNT DNQW I FRRELNDRERCNWEKILEILPTPR NRGSSKPTWIPD N SFS
Subjt:  NQSWDLNNGDQISFWYSNWSQEGRLSTAYPRLFALTLDKEISVKDAWNTFDNQWNIIFRRELNDRERCNWEKILEILPTPRSNRGSSKPTWIPDSNNSFS

Query:  IASTKVLISRQLDQTPGDPRAKLLEIIWKSSIPMKIKFFMWCLIQRRINTMEVIQQKMPNTLLQPNWCVLCNKDSETGNHLFLRCEAVKPLWSFLQHSLN
        IAS K+LIS QLDQT GD R KLLEIIWKS+IPMKIKFFMWCLIQRRI+TMEVIQQ+M NTLLQPNWCVLCNKD+E+GNHLFLRC+AVKPLWS L  SLN
Subjt:  IASTKVLISRQLDQTPGDPRAKLLEIIWKSSIPMKIKFFMWCLIQRRINTMEVIQQKMPNTLLQPNWCVLCNKDSETGNHLFLRCEAVKPLWSFLQHSLN

Query:  FALGSDEFEDLFSFFHSLNCSFPKHK
        FAL +D+FE LFSFF SL CS PKHK
Subjt:  FALGSDEFEDLFSFFHSLNCSFPKHK

A0A5D3BL61 LINE-1 retrotransposable element ORF2 protein0.0e+0073.72Show/hide
Query:  MAYFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPRDLDWIRCTLKSLIATPNTNRFFLETRDSEQRIWIRKTRNSKGCTAEIFRVD
        MAYFKSLPRSCK+ERKEFVL LDKY+KHTHYWLTETGAHKAFSIEVSPRDLDWIR TLKSLI TP++NRFFLE RD E  IWIRKTRN KGCTAEIFRVD
Subjt:  MAYFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPRDLDWIRCTLKSLIATPNTNRFFLETRDSEQRIWIRKTRNSKGCTAEIFRVD

Query:  QKNRKSCILVLEGPDKSGWVSFLSMITPKVEVKAKTRPTFLPRTSPDCQLSPPIDYHKRSYAKAVTEGRPFATSDSSDSYDSSDSSHSSGNSFCDSPSFD
         KNRKSCILV EG +KS WVSFLSMITPKVEVKAKTRP FLPR+SP+ +LSPPIDYHKRSYAKAV+EGR   +SDSSDSY SSDSS SSGNS CDSP   
Subjt:  QKNRKSCILVLEGPDKSGWVSFLSMITPKVEVKAKTRPTFLPRTSPDCQLSPPIDYHKRSYAKAVTEGRPFATSDSSDSYDSSDSSHSSGNSFCDSPSFD

Query:  LLENTLVIVRRFFHDDWHKILQNLRKQTDESFTYNAFHAEKALVHFSSNIPANLLCQNKGWSTVGKYSVRFEKWSPAYHATPKLIPSYGGWTTFRGIPLH
        LLENT+V+VRRFFHDDW KILQNLRKQT+ESFTYNAFHAEK LVHF+SN+PANLLCQNKGW+TVGKY+VRFEKW+PA HA+PKLIPSYGGWTTFRGIPLH
Subjt:  LLENTLVIVRRFFHDDWHKILQNLRKQTDESFTYNAFHAEKALVHFSSNIPANLLCQNKGWSTVGKYSVRFEKWSPAYHATPKLIPSYGGWTTFRGIPLH

Query:  LWNMMTFQQIGKACGGLIKVAEETRSAKNLIEARIKVRYNYSGFLPANVRIFDNEGNKFSVQVVTHPE-EWLIERSVRLHGTFKRQAAASFDDFNPESEQ
        LWNMMTFQQIGKACGGLIKVAEET++A+NLIEA++K+RYNYSGFLPA V+IFD EGNKF VQVVTH E +WL+ER+VRLHGTFKRQAAASFDDFNP+SEQ
Subjt:  LWNMMTFQQIGKACGGLIKVAEETRSAKNLIEARIKVRYNYSGFLPANVRIFDNEGNKFSVQVVTHPE-EWLIERSVRLHGTFKRQAAASFDDFNPESEQ

Query:  FFFEGSEAISPDFLSTSSDDRKSSTPDQPSALKSVIIKPDRNATLPSFLNEELVNDSNLHAMANKSKLEILSGISNDGVLDKGKQKVDIQLQPNSALNLD
        F F+G EAISPD L+T S  RKS +P+QPSALKSVIIKP + AT P+ LNEE+VND++LHA ANKSKL+ILSGISNDG LDKGKQKVDI  Q  SA    
Subjt:  FFFEGSEAISPDFLSTSSDDRKSSTPDQPSALKSVIIKPDRNATLPSFLNEELVNDSNLHAMANKSKLEILSGISNDGVLDKGKQKVDIQLQPNSALNLD

Query:  KSKRKVSFNSPSNKTNIFNPNSAPANHSPSLNSPEKKQKVSRERSIKKKSSFTQPNSKANQNKGVFITQPIQIVAHDRDAAKKGLSLTVDLGDLPALDPN
        K KRKVSFNSPSNKT  FNP+SAPANH     SPEKK++VSRERS+KKKSS  QP  +ANQ KG  ITQP+Q+VAHD DA+KKGLSLTVDLG+LP LDP+
Subjt:  KSKRKVSFNSPSNKTNIFNPNSAPANHSPSLNSPEKKQKVSRERSIKKKSSFTQPNSKANQNKGVFITQPIQIVAHDRDAAKKGLSLTVDLGDLPALDPN

Query:  KSLEDHHSSDNAEVIDITNTEVVTETPEMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTS
        KS EDHHSSDNAEVIDITNTEVV ETPE+KM   E SNSS E NYRK KH H+R++YYRKKE+KEKD +SEAFK QLV+WLKENGLKLS DTDSSGATTS
Subjt:  KSLEDHHSSDNAEVIDITNTEVVTETPEMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTS

Query:  TNVLALIKNTIISYSPDFVILTETRLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLSQEEGLFSLSANFLLNNNSSWWLTGLYGP--
        TN L                                                S+GGILILWDAQ+HSLLSQEEG FSLSANF   NN SWWLTGLYGP  
Subjt:  TNVLALIKNTIISYSPDFVILTETRLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLSQEEGLFSLSANFLLNNNSSWWLTGLYGP--

Query:  ----------LHNLQHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLNNFISNNLLIDPPLTNNRFTWSNLRNPPTFSRIDRFLYNSSWKNLFSPHT
                  LHNL HLNS PWI+GGDLNV+RMREEST+V  SSHSS MLN+FISNNLLIDPPLTNNR+TWSNLRNPPTFSR+DRFLYNS W+ LF+PH 
Subjt:  ----------LHNLQHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLNNFISNNLLIDPPLTNNRFTWSNLRNPPTFSRIDRFLYNSSWKNLFSPHT

Query:  TRTLPRSTSDHFPLVCEDSNPKLSWGPVPFRLNSITLSDPEFKRNMGRWWENLIQAGHPRFSFIQRLKSLANFIKPWQKEKLHSLTYAKEAIIREVDSID
        TRTLPR TSDHFPLVCEDS   L WGP PFRLNSI L+DPEFKRNM RWWE  +Q GHP F FIQRLKSLAN IKPWQKEK  SLT AKE IIREVDSID
Subjt:  TRTLPRSTSDHFPLVCEDSNPKLSWGPVPFRLNSITLSDPEFKRNMGRWWENLIQAGHPRFSFIQRLKSLANFIKPWQKEKLHSLTYAKEAIIREVDSID

Query:  KKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEKGSIQNTNNSISIAFIKFFSRIYRSSTK
        K ELDTPL+ EESNRRLALKA+L++LSLKESQFW+QRAKKLWL+EGDENS+FFHRICSSRQKR+ IHEIQDE+GSIQNTNN+IS+AF+  FSRIYR STK
Subjt:  KKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEKGSIQNTNNSISIAFIKFFSRIYRSSTK

Query:  SDPLFIENLDWNPIASSEWSHLCAPFLEGEIKGVINSFDGKKTPGPDGFPISFFKSHWYLLKEDIMDIFKDFYD------------------KCDYSHPK
         DPLFIENL+WNPI  S+WS LCAPF E EIKGVI SFDG K PGPDGFPISFFKS+W+LLKEDI+DIFKDF++                  K DYSHPK
Subjt:  SDPLFIENLDWNPIASSEWSHLCAPFLEGEIKGVINSFDGKKTPGPDGFPISFFKSHWYLLKEDIMDIFKDFYD------------------KCDYSHPK

Query:  DFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDYWKVKKIKGFILKLDIEKAFDNLNWDFIDNVLEKKNFPNPWRKW
        DFRPISLTTSIYK IAKTLSNRLK TLPDTISGNQLAF+KNRQITDAILMANEA+DYWKVKKIKGFILKLDIEKAFDNLNW+FID VL+K N+PN WRKW
Subjt:  DFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDYWKVKKIKGFILKLDIEKAFDNLNWDFIDNVLEKKNFPNPWRKW

Query:  IRGCISNVTYSVIINGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNGNCNISHILFADDILLFIEDNDCFLKNLRMALSLF
        IRGCISNVTYS+I+NG+PQGRIKANRGLRQGDPLS FLFVIAMDYLSRLLSHLES+GAIKG                                       
Subjt:  IRGCISNVTYSVIINGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNGNCNISHILFADDILLFIEDNDCFLKNLRMALSLF

Query:  ERASGLKINLLKSALVPVNVSLNRAKECASFWGISCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSV
                                        GI CH+LPL+YLGVPLGGNPKSNLFWRN+ED+IQKKL+NWKYA ISKGGRLTLIKSTLSSLPIY+LSV
Subjt:  ERASGLKINLLKSALVPVNVSLNRAKECASFWGISCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSV

Query:  FQAPSLTCKNIEKLWRKFLWKGN--NEWSHLINWTKVSKSKEEGGLGISRLNVTNKALLSKWLWRYLSEPNALWRRLIQCKYKGKFPGDIPSNISSSTSK
        FQAPS T KNIEKLWR FLWKG+   + SHLINW+ V+K KEEGGLGISRL VTN+ALLSKWLWRY SEPN+LWRRLI  KYKGK PGD+PSNISSS+SK
Subjt:  FQAPSLTCKNIEKLWRKFLWKGN--NEWSHLINWTKVSKSKEEGGLGISRLNVTNKALLSKWLWRYLSEPNALWRRLIQCKYKGKFPGDIPSNISSSTSK

Query:  ALWRSIIDSTDWFKSNQSWDLNNGDQISFWYSNWSQEGRLSTAYPRLFALTLDKEISVKDAWNTFDNQWNIIFRRELNDRERCNWEKILEILPTPRSNRG
        A WRSII++ DWFKSNQ WDLNNGDQISFWYSNWS EG LSTAYPRLFAL++DKE S+KD WN+ +NQW I FRR+LNDRE   W+KILE LP  R+NRG
Subjt:  ALWRSIIDSTDWFKSNQSWDLNNGDQISFWYSNWSQEGRLSTAYPRLFALTLDKEISVKDAWNTFDNQWNIIFRRELNDRERCNWEKILEILPTPRSNRG

Query:  SSKPTWIPDSNNSFSIASTKVLISRQLDQTPGDPRAKLLEIIWKSSIPMKIKFFMWCLIQRRINTMEVIQQKMPNTLLQPNWCVLCNKDSETGNHLFLRC
         SKPTWIPDS   FSIAS K  IS Q D++  +PR KLL +IWK+ +PMKIKFFMWCL+QR++NTMEV       TLLQPNWCVLC K SETG HLFL C
Subjt:  SSKPTWIPDSNNSFSIASTKVLISRQLDQTPGDPRAKLLEIIWKSSIPMKIKFFMWCLIQRRINTMEVIQQKMPNTLLQPNWCVLCNKDSETGNHLFLRC

Query:  EAVKPLWSFLQHSLNFALGSDEFEDLFSFFHSLNCSFPKHKVIFCGIIALFWEIWCERNFRIFGTSSSHKTIANMWEDCKILIGNWCSRDPHFKNYSAAT
        + VKPLWS L  SLNFA  SD+FE +FSFF SLN S PKHKV+FCG+IA+ W IW ERN RIF T S  K+IAN+WEDCKILIGNW SRDP FKNYSAAT
Subjt:  EAVKPLWSFLQHSLNFALGSDEFEDLFSFFHSLNCSFPKHKVIFCGIIALFWEIWCERNFRIFGTSSSHKTIANMWEDCKILIGNWCSRDPHFKNYSAAT

Query:  IALNLSTFCN
        IALNL+ FCN
Subjt:  IALNLSTFCN

A0A5D3BLV7 LINE-1 retrotransposable element ORF2 protein0.0e+0093.47Show/hide
Query:  YFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPRDLDWIRCTLKSLIATPNTNRFFLETRDSEQRIWIRKTRNSKGCTAEIFRVDQK
        +FKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPRDLDWIRCTLKSLIATPNTNRFFLETRDSEQRIWIRKTRNSKGCTAEIFRVDQK
Subjt:  YFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPRDLDWIRCTLKSLIATPNTNRFFLETRDSEQRIWIRKTRNSKGCTAEIFRVDQK

Query:  NRKSCILVLEGPDKSGWVSFLSMITPKVEVKAKTRPTFLPRTSPDCQLSPPIDYHKRSYAKAVTEGRPFATSDSSDSYDSSDSSHSSGNSFCDSPSFDLL
        NRKSCILV EGPDKSGWVSFLSMITPKVEVKAKTRPTFLPRTSPDC+LSPPIDYHKRSYAKAVTEGRPFATSDSSDSYDSSDSSHSS NSFCDSPS DLL
Subjt:  NRKSCILVLEGPDKSGWVSFLSMITPKVEVKAKTRPTFLPRTSPDCQLSPPIDYHKRSYAKAVTEGRPFATSDSSDSYDSSDSSHSSGNSFCDSPSFDLL

Query:  ENTLVIVRRFFHDDWHKILQNLRKQTDESFTYNAFHAEKALVHFSSNIPANLLCQNKGWSTVGKYSVRFEKWSPAYHATPKLIPSYGGWTTFRGIPLHLW
        ENT+VIVRRFFHDDWHKILQNLRKQT+ESFTYNAFHAEKALVHFSSNIPANLLCQNKGWSTVGKYSVRFEKWSP YHATPKLIPSYGGWTTFRGIPLHLW
Subjt:  ENTLVIVRRFFHDDWHKILQNLRKQTDESFTYNAFHAEKALVHFSSNIPANLLCQNKGWSTVGKYSVRFEKWSPAYHATPKLIPSYGGWTTFRGIPLHLW

Query:  NMMTFQQIGKACGGLIKVAEETRSAKNLIEARIKVRYNYSGFLPANVRIFDNEGNKFSVQVVTHPE-EWLIERSVRLHGTFKRQAAASFDDFNPESEQFF
        NMMTFQQIGKAC GLIKVAEETRSAKNLIEARIKVRYNYSGFLPANVRIFDNEGNKF VQVVTHPE +WLIER+VRLHGTFKRQAAASFDDFNPESEQFF
Subjt:  NMMTFQQIGKACGGLIKVAEETRSAKNLIEARIKVRYNYSGFLPANVRIFDNEGNKFSVQVVTHPE-EWLIERSVRLHGTFKRQAAASFDDFNPESEQFF

Query:  FEGSEAISPDFLSTSSDDRKSSTPDQPSALKSVIIKPDRNATLPSFLNEELVNDSNLHAMANKSKLEILSGISNDGVLDKGKQKVDIQLQPNSALNLDKS
        FEGSEAISPDFLSTSSD RKSSTPDQPSALKSVIIKPDRNATLPSFLNEELVNDSNLHA ANKSKLEILSGISNDGVLDKGKQKVDIQLQPNSALNLDKS
Subjt:  FEGSEAISPDFLSTSSDDRKSSTPDQPSALKSVIIKPDRNATLPSFLNEELVNDSNLHAMANKSKLEILSGISNDGVLDKGKQKVDIQLQPNSALNLDKS

Query:  KRKVSFNSPSNKTNIFNPNSAPANHSPSLNSPEKKQKVSRERSIKKKSSFTQPNSKANQNKGVFITQPIQIVAHDRDAAKKGLSLTVDLGDLPALDPNKS
        KRKVSFNSPSNKTNIFNP+SAPANHSPSLNSPEKKQKVSRERSIKKKSS TQPNSKANQNKGVFITQPIQIVAHDRDAAKKGLSLTVDLGDLPALDPNKS
Subjt:  KRKVSFNSPSNKTNIFNPNSAPANHSPSLNSPEKKQKVSRERSIKKKSSFTQPNSKANQNKGVFITQPIQIVAHDRDAAKKGLSLTVDLGDLPALDPNKS

Query:  LEDHHSSDNAEVIDITNTEVVTETPEMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTSTN
        LEDHH+SDNAEV+DITNTEVV ETPEMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLK+NGLKLSTDTDSSGATTSTN
Subjt:  LEDHHSSDNAEVIDITNTEVVTETPEMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTSTN

Query:  VLALIKNTIISYSPDFVILTETRLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLSQEEGLFSLSANFLLNNNSSWWLTGLYGP----
        VL    N              + LKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLSQEEGLFSLSANFLLNNNSSWWLTGLYGP    
Subjt:  VLALIKNTIISYSPDFVILTETRLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLSQEEGLFSLSANFLLNNNSSWWLTGLYGP----

Query:  --------LHNLQHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLNNFISNNLLIDPPLTNNRFTWSNLRNPPTFSRIDRFLYNSSWKNLFSPHTTR
                LHNLQHLNSFPWILGGDLNVIRMREESTSVLSSSH+SRMLNNFISNNLLIDPPLTNNRFTWSNLRNPPTFSRIDRFLYNSSW+NLFSPHTTR
Subjt:  --------LHNLQHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLNNFISNNLLIDPPLTNNRFTWSNLRNPPTFSRIDRFLYNSSWKNLFSPHTTR

Query:  TLPRSTSDHFPLVCEDSNPKLSWGPVPFRLNSITLSDPEFKRNMGRWWENLIQAGHPRFSFIQRLKSLANFIKPWQKEKLHSLTYAKEAIIREVDSIDKK
        TLPRSTSDHFPLVCEDSNPKLSWGP+PFRLNSITLSDPEFKRNMGRWWEN IQAG+P FSFIQRLKSLANFIKPWQKEKLHSLTYAKEAIIREVDSIDKK
Subjt:  TLPRSTSDHFPLVCEDSNPKLSWGPVPFRLNSITLSDPEFKRNMGRWWENLIQAGHPRFSFIQRLKSLANFIKPWQKEKLHSLTYAKEAIIREVDSIDKK

Query:  ELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEKGSIQNTNNSISIAFIKFFSRIYRSSTKSD
        ELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDE+GSIQNTNNSIS AFIKFFSRIYRSSTKSD
Subjt:  ELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEKGSIQNTNNSISIAFIKFFSRIYRSSTKSD

Query:  PLFIENLDWNPIASSEWSHLCAPFLEGEIKGVINSFDGKKTPGPDGFPISFFKSHWYLLKEDIMDIFKDFYDKCDYSHPKDFRPISLTTSIYKIIAKTLS
        PLFIENLDWNPIASSEWSHLCAPFLEGEIKGVINSFDGKKTPGPDGFPISFFKSHW                                            
Subjt:  PLFIENLDWNPIASSEWSHLCAPFLEGEIKGVINSFDGKKTPGPDGFPISFFKSHWYLLKEDIMDIFKDFYDKCDYSHPKDFRPISLTTSIYKIIAKTLS

Query:  NRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDYWKVKKIKGFILKLDIEKAFDNLNWDFIDNVLEKKNFPNPWRKWIRGCISNVTYSVIINGRPQG
          LKTTLP+TISGNQLAFVKNRQITDAILMANEAVDYWKVKKIKGFILKLDIEKAFDNLN DFIDNVLEKKNFPNPWRKWIRGCISNVTYSVIINGRPQG
Subjt:  NRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDYWKVKKIKGFILKLDIEKAFDNLNWDFIDNVLEKKNFPNPWRKWIRGCISNVTYSVIINGRPQG

Query:  RIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNGNCNISHILFADDILLFIEDNDCFLKNLRMALSLFERASGLKINLLKSALVPVNV
        RIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNGNCNISHILFADDILLFIEDNDCFLKNLRMALSLFERASGLKINLLKSALVPVNV
Subjt:  RIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNGNCNISHILFADDILLFIEDNDCFLKNLRMALSLFERASGLKINLLKSALVPVNV

Query:  SLNRAKECASFWGISCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSLTCKNIEKLWRKFLW
        SL RAKECASFWGISCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSLTCKNIEKLWRKFLW
Subjt:  SLNRAKECASFWGISCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSLTCKNIEKLWRKFLW

Query:  KGNN--EWSHLINWTKVSKSKEEGGLGISRLNVTNKALLSKWLWRYLSEPNALWRRLIQCKYKGKFPGDIPSNISSSTSKALWRSIIDSTDWFKSNQSWD
        KGNN  E SHLINWTKVSKSKEEGGLGISRLNVTNKALLSKWLWRYLSEPNALWRRLIQCKYKGKFPGDIPSNISSSTSKA WRSIIDSTDWFKSNQSWD
Subjt:  KGNN--EWSHLINWTKVSKSKEEGGLGISRLNVTNKALLSKWLWRYLSEPNALWRRLIQCKYKGKFPGDIPSNISSSTSKALWRSIIDSTDWFKSNQSWD

Query:  LNNGDQISFWYSNWSQEGRLSTAYPRLFALTLDKEISVKDAWNTFDNQWNIIFRRELNDRERCNWEKILEILPTPRSNRGSSKPTWIPDSNNSFSIASTK
        LNNGDQISFWYSNWSQEGRLSTAYPRLFALTLDKEISVKDAWNTFDNQWNIIFRRELNDRERCNWEKILEILPTPRSNRGSSKPTWIPDSNNSFSIAS K
Subjt:  LNNGDQISFWYSNWSQEGRLSTAYPRLFALTLDKEISVKDAWNTFDNQWNIIFRRELNDRERCNWEKILEILPTPRSNRGSSKPTWIPDSNNSFSIASTK

Query:  VLISRQLDQTPGDPRAKLLEIIWKSSIPMKIKFFMWCLIQRRINTMEVIQQKMPNTLLQPN
        VLISRQLDQTPGDPRAKLLEIIWKSSIPMKIKFFMWCLIQRRINTMEVIQQKMPNTLLQPN
Subjt:  VLISRQLDQTPGDPRAKLLEIIWKSSIPMKIKFFMWCLIQRRINTMEVIQQKMPNTLLQPN

A0A5D3C3M3 LINE-1 retrotransposable element ORF2 protein0.0e+0067.65Show/hide
Query:  MAYFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPRDLDWIRCTLKSLIATPNTNRFFLETRDSEQRIWIRKTRNSKGCTAEIFRVD
        MAYFKSLPRSCK+ERKEFVL LDKY+KHTHYWLTETGAHKAFSIEVSPRDLDWIR TLKSLI TP++NRFFLE RD E  IWIRKTRN KGCTAEIFRVD
Subjt:  MAYFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPRDLDWIRCTLKSLIATPNTNRFFLETRDSEQRIWIRKTRNSKGCTAEIFRVD

Query:  QKNRKSCILVLEGPDKSGWVSFLSMITPKVEVKAKTRPTFLPRTSPDCQLSPPIDYHKRSYAKAVTEGRPFATSDSSDSYDSSDSSHSSGNSFCDSPSFD
         KNRKSCILV EGP+KSG VSFLSMITPKVEVKAKTRPTFLPR+SP+ +LSPPIDYHKRSY KAV++GR   +SDSSDSY SSDSS SSGNS CDSP   
Subjt:  QKNRKSCILVLEGPDKSGWVSFLSMITPKVEVKAKTRPTFLPRTSPDCQLSPPIDYHKRSYAKAVTEGRPFATSDSSDSYDSSDSSHSSGNSFCDSPSFD

Query:  LLENTLVIVRRFFHDDWHKILQNLRKQTDESFTYNAFHAEKALVHFSSNIPANLLCQNKGWSTVGKYSVRFEKWSPAYHATPKLIPSYGGWTTFRGIPLH
        LLENT+V+                                 AL+HF+SN+PANLLCQNKGW+TV KY VR                              
Subjt:  LLENTLVIVRRFFHDDWHKILQNLRKQTDESFTYNAFHAEKALVHFSSNIPANLLCQNKGWSTVGKYSVRFEKWSPAYHATPKLIPSYGGWTTFRGIPLH

Query:  LWNMMTFQQIGKACGGLIKVAEETRSAKNLIEARIKVRYNYSGFLPANVRIFDNEGNKFSVQVVTHPEEWLIERSVRLHGTFKRQAAASFDDFNPESEQF
                                                                                                          + 
Subjt:  LWNMMTFQQIGKACGGLIKVAEETRSAKNLIEARIKVRYNYSGFLPANVRIFDNEGNKFSVQVVTHPEEWLIERSVRLHGTFKRQAAASFDDFNPESEQF

Query:  FFEGSEAISPDFLSTSSDDRKSSTPDQPSALKSVIIKPDRNATLPSFLNEELVNDSNLHAMANKSKLEILSGISNDGVLDKGKQKVDIQLQPNSALNLDK
         F+G EAISPD L+T S  RKS++ +QPSALKSVIIKP R+AT P+ LNEE+VND++LHA   KS+L+ILSGISNDG LDKGKQKVDI  Q  SA   DK
Subjt:  FFEGSEAISPDFLSTSSDDRKSSTPDQPSALKSVIIKPDRNATLPSFLNEELVNDSNLHAMANKSKLEILSGISNDGVLDKGKQKVDIQLQPNSALNLDK

Query:  SKRKVSFNSPSNKTNIFNPNSAPANHSPSLNSPEKKQKVSRERSIKKKSSFTQPNSKANQNKGVFITQPIQIVAHDRDAAKKGLSLTVDLGDLPALDPNK
         KRKVSFNSPSNKT  FN +SAP NHSP L+SPEKKQ+VSRERS+KKKSS  QP S+ANQ KG  ITQP+Q+VAHD DA+KKGLSLTVDLG+LP LDP+K
Subjt:  SKRKVSFNSPSNKTNIFNPNSAPANHSPSLNSPEKKQKVSRERSIKKKSSFTQPNSKANQNKGVFITQPIQIVAHDRDAAKKGLSLTVDLGDLPALDPNK

Query:  SLEDHHSSDNAEVIDITNTEVVTETPEMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTST
        S EDHHSSDNAEVIDITNTEVV ETPE+KM   E SNSS E NYRK KH H+R++YYRKKE+KEKD +SEAFK QLV+WLKENGLKLSTDTDSSGATTST
Subjt:  SLEDHHSSDNAEVIDITNTEVVTETPEMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTST

Query:  NVLALIKNTIISYSPDFVILTETRLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLSQEEGLFSLSANFLLNNNSSWWLTGLYGPLHN
        N L             F  L                   +SI+WI KNA  SSGGILILWDAQ+HSLL                                
Subjt:  NVLALIKNTIISYSPDFVILTETRLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLSQEEGLFSLSANFLLNNNSSWWLTGLYGPLHN

Query:  LQHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLNNFISNNLLIDPPLTNNRFTWSNLRNPPTFSRIDRFLYNSSWKNLFSPHTTRTLPRSTSDHFP
                    GDLNV+RMREEST+V SSSHSS MLNNFISNNLLIDPPLTNNR+TWSNLRNPPTFSR+DRFLYNS W+ LF+PH TRTL R TSDHFP
Subjt:  LQHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLNNFISNNLLIDPPLTNNRFTWSNLRNPPTFSRIDRFLYNSSWKNLFSPHTTRTLPRSTSDHFP

Query:  LVCEDSNPKLSWGPVPFRLNSITLSDPEFKRNMGRWWENLIQAGHPRFSFIQRLKSLANFIKPWQKEKLHSLTYAKEAIIREVDSIDKKELDTPLTQEES
        LVCEDS   L WGP PFRLNSI L+DP+FKRNM RWWE  +Q GHP FSFI+RLKSLAN IKPWQKEK HSLT AKE IIREVDSIDK ELDTPL+QEES
Subjt:  LVCEDSNPKLSWGPVPFRLNSITLSDPEFKRNMGRWWENLIQAGHPRFSFIQRLKSLANFIKPWQKEKLHSLTYAKEAIIREVDSIDKKELDTPLTQEES

Query:  NRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEKGSIQNTNNSISIAFIKFFSRIYRSSTKSDPLFIENLDWNP
        NRRLALKA+LS+LSLKESQFW+QRAKKLWL+EGDENS+FFHRICSSRQKR+ IHEIQDE+GSIQNTNN+IS+AF+  FS IYR STK DPLFIENL+WNP
Subjt:  NRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEKGSIQNTNNSISIAFIKFFSRIYRSSTKSDPLFIENLDWNP

Query:  IASSEWSHLCAPFLEGEIKGVINSFDGKKTPGPDGFPISFFKSHWYLLKEDIMDIFKDFYDKCDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTI
        I  S+WS LCAPFLE EIKGVI SFDG K PGPDGFPISFFKS+W+LLKEDI+DIFKDF++K                    IIAKTLSNRLK TLPDTI
Subjt:  IASSEWSHLCAPFLEGEIKGVINSFDGKKTPGPDGFPISFFKSHWYLLKEDIMDIFKDFYDKCDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTI

Query:  SGNQLAFVKNRQITDAILMANEAVDYWKVKKIKGFILKLDIEKAFDNLNWDFIDNVLEKKNFPNPWRKWIRGCISNVTYSVIINGRPQGRIKANRGLRQG
        SGNQLAF+KNRQITDAIL ANEA+DYWKVKKIK FILKLDIEKAFDNLNWDFID VL+KKN+PN WRKWIRGCISNVTYS+I+N +PQ RIKANRGLRQG
Subjt:  SGNQLAFVKNRQITDAILMANEAVDYWKVKKIKGFILKLDIEKAFDNLNWDFIDNVLEKKNFPNPWRKWIRGCISNVTYSVIINGRPQGRIKANRGLRQG

Query:  DPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNGNCNISHILFADDILLFIEDNDCFLKNLRMALSLFERASGLKINLLKSALVPVNVSLNRAKECASF
        DPLSPFLFV AMDYLSRLLSHLESSGAIKGV L  +CNISHILFADDILLF+EDND FL NLRMALSLFE+ASGLKINL KSA+VPVNVS +RA ECAS 
Subjt:  DPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNGNCNISHILFADDILLFIEDNDCFLKNLRMALSLFERASGLKINLLKSALVPVNVSLNRAKECASF

Query:  WGISCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSLTCKNIEKLWRKFLWKGN--NEWSHL
        WGISCH+LPL+YLGVPLGGNPKSN+FWRN+ED+IQKKLNNWKYA ISKGGRLTLIKSTLSSL IYQLSVFQAP  T KNIEKLWR FLWKG+   + SHL
Subjt:  WGISCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSLTCKNIEKLWRKFLWKGN--NEWSHL

Query:  INWTKVSKSKEEGGLGISRLNVTNKALLSKWLWRYLSEPNALWRRLIQCKYKGKFPGDIPSNISSSTSKALWRSIIDSTDWFKSNQSWDLNNGDQISFWY
        INW+ V+K KEEGGLGISRL V N+ALLSKWLWRY SEPN+LWRRLI  KYKGK PGDIPSNISSS+SKA W+SII++ DWFKSNQ WDLNN DQISFWY
Subjt:  INWTKVSKSKEEGGLGISRLNVTNKALLSKWLWRYLSEPNALWRRLIQCKYKGKFPGDIPSNISSSTSKALWRSIIDSTDWFKSNQSWDLNNGDQISFWY

Query:  SNWSQEGRLSTAYPRLFALTLDKEISVKDAWNTFDNQWNIIFRRELNDRERCNWEKILEILPTPRSNRGSSKPTWIPDSNNSFSIASTKVLISRQLDQTP
        SNWS EG LSTAYPRLFAL++DK+ S+KD WN+ +NQW I FRR+LNDRE   W+ ILE L  PR+NRG SKPTWIPDS   FSIAS K  IS Q D++ 
Subjt:  SNWSQEGRLSTAYPRLFALTLDKEISVKDAWNTFDNQWNIIFRRELNDRERCNWEKILEILPTPRSNRGSSKPTWIPDSNNSFSIASTKVLISRQLDQTP

Query:  GDPRAKLLEIIWKSSIPMKIKFFMWCLI
         +PR KLL++IWK+ +PMKIKFFMWCL+
Subjt:  GDPRAKLLEIIWKSSIPMKIKFFMWCLI

A0A5D3CI86 Reverse transcriptase domain-containing protein0.0e+0072.94Show/hide
Query:  MITPKVEVKAKTRPTFLPRTSPDCQLSPPIDYHKRSYAKAVTEGRPFATSDSSDSYDSSDSSHSSGNSFCDSPSFDLLENTLVIVRRFFHDDWHKILQNL
        MITPKVEVK KTRPTFLPR+SP+ +LSPPIDYHKRSYAKAVTEGRPF TSDSSDSY SSDSSHSSGNSFCDSPS DLLENT+V+VRRFFHDDW KILQNL
Subjt:  MITPKVEVKAKTRPTFLPRTSPDCQLSPPIDYHKRSYAKAVTEGRPFATSDSSDSYDSSDSSHSSGNSFCDSPSFDLLENTLVIVRRFFHDDWHKILQNL

Query:  RKQTDESFTYNAFHAEKALVHFSSNIPANLLCQNKGWSTVGKYSVRFEKWSPAYHATPKLIPSYGGWTTFRGIPLHLWNMMTFQQIGKACGGLIKVAEET
        RKQT+ESFTYNAFHAEKALVHF+SNIP NLLCQNKGW+TVGKYSVRFEKWSPAYHATPKLIPSYGGWTTF+                             
Subjt:  RKQTDESFTYNAFHAEKALVHFSSNIPANLLCQNKGWSTVGKYSVRFEKWSPAYHATPKLIPSYGGWTTFRGIPLHLWNMMTFQQIGKACGGLIKVAEET

Query:  RSAKNLIEARIKVRYNYSGFLPANVRIFDNEGNKFSVQVVTHPEEWLIERSVRLHGTFKRQAAASFDDFNPESEQ----FFFEGSEAISPDFLSTSSDDR
        R++  L+E                                                         +DDF+   E       F+GSEAISPDFLSTSS  R
Subjt:  RSAKNLIEARIKVRYNYSGFLPANVRIFDNEGNKFSVQVVTHPEEWLIERSVRLHGTFKRQAAASFDDFNPESEQ----FFFEGSEAISPDFLSTSSDDR

Query:  KSSTPDQPSALKSVIIKPDRNATLPSFLNEELVNDSNLHAMANKSKLEILSGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSPSNKTNIFNPN
        KSSTPDQPSALKSVIIKPD+ AT P++LNEE+VNDSNLHA ANKS+LEILSGI NDGVLDKGKQKVDIQL PNSALNL+K KRKVSFNSPSNKTNIFNP+
Subjt:  KSSTPDQPSALKSVIIKPDRNATLPSFLNEELVNDSNLHAMANKSKLEILSGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSPSNKTNIFNPN

Query:  SAPANHSPSLNSPEKKQKVSRERSIKKKSSFTQPNSKANQNKGVFITQPIQIVAHDRDAAKKGLSLTVDLGDLPALDPNKSLEDHHSSDNAEVIDITNTE
        SAPANHS SL+SPEKKQKVSRERSIKKKSS  QP     QNKGV ITQPIQ+VAHD +A+KKGLSL V+LGDLP LDP+KS EDHHSS NAEVIDITNTE
Subjt:  SAPANHSPSLNSPEKKQKVSRERSIKKKSSFTQPNSKANQNKGVFITQPIQIVAHDRDAAKKGLSLTVDLGDLPALDPNKSLEDHHSSDNAEVIDITNTE

Query:  VVTETPEMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKEEK--------EKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTSTNVLALIKNTIIS
        VV ETPEMKMPVNENSNSSSEANYRKPKHVH+R+YYYRKK  K        E        K +L++W             ++    S +  ALIKN IIS
Subjt:  VVTETPEMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKEEK--------EKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTSTNVLALIKNTIIS

Query:  YSPDFVILTETRLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLSQEEGLFSLSANFLLNNNSSWWLTGLYGP------------LHN
        YSPDFVILTET LKITNKRIIKS WPSNSINWI KNASGSSGGILILWDAQ+HSLLSQEE +FSLSANF LNNNSSWWLTGLYGP            LHN
Subjt:  YSPDFVILTETRLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLSQEEGLFSLSANFLLNNNSSWWLTGLYGP------------LHN

Query:  LQHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLNNFISNNLLIDPPLTNNRFTWSNLRNPPTFSRIDRFLYNSSWKNLFSPHTTRTLPRSTSDHFP
        LQHLNSFPW L  DLNVIRMREE+TS+LSSSHSSRMLNNFISNNLLIDPPLTNNRFTWSNLRNP TFSRIDRFLYNSSW+NLFSPHTTRTLPR TSDHFP
Subjt:  LQHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLNNFISNNLLIDPPLTNNRFTWSNLRNPPTFSRIDRFLYNSSWKNLFSPHTTRTLPRSTSDHFP

Query:  LVCEDSNPKLSWGPVPFRLNSITLSDPEFKRNMGRWWENLIQAGHPRFSFIQRLKSLANFIKPWQKEKLHSLTYAKEAIIREVDSIDKKELDTPLTQEES
        LVCEDSNPKL WGP PFRLNSI L+DPEFKRNM RWWEN +Q GH  FSFIQRLKSLAN IKPWQKEKLHSL YAKE IIREVDSIDKKELDTPL+Q+ES
Subjt:  LVCEDSNPKLSWGPVPFRLNSITLSDPEFKRNMGRWWENLIQAGHPRFSFIQRLKSLANFIKPWQKEKLHSLTYAKEAIIREVDSIDKKELDTPLTQEES

Query:  NRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEKGSIQNTNNSISIAFIKFFSRIYRSSTKSDPLFIENLDWNP
        NRRLALKA+LS+LSLKESQF                       C                                    IY+SSTKSDPLFIENLDWNP
Subjt:  NRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEKGSIQNTNNSISIAFIKFFSRIYRSSTKSDPLFIENLDWNP

Query:  IASSEWSHLCAPFLEGEIKGVINSFDGKKTPGPDGFPISFFKSHWYLLKEDIMDIFKDFYD------------------KCDYSHPKDFRPISLTTSIYK
        I  SEW HLCAPFLE EIKGVINSFDGKK P PDGFPISFFKS+W+LLKEDIMDIFKDF++                  K DYSHPKDFRPISLTTSIYK
Subjt:  IASSEWSHLCAPFLEGEIKGVINSFDGKKTPGPDGFPISFFKSHWYLLKEDIMDIFKDFYD------------------KCDYSHPKDFRPISLTTSIYK

Query:  IIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDYWKVKKIKGFILKLDIEKAFDNLNWDFIDNVLEKKNFPNPWRKWIRGCISNVTYSVI
        IIAKTLSNRLKTTLP TISGNQLAF+KNRQITDAILMANEAVDYWKVKKIKGFILKLDIEK F NLNWDFID VL KKNFPN WRKWIRGCISNVTYSVI
Subjt:  IIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDYWKVKKIKGFILKLDIEKAFDNLNWDFIDNVLEKKNFPNPWRKWIRGCISNVTYSVI

Query:  INGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNGNCNISHILFADDILLFIEDNDCFLKNLRMALSLFERASGLKINLLKS
        INGRPQGRIKANRGLRQGDPLSPFLFVIAMDY SRLLSHLE+SGAIKGVSLN NCNISHILFADDILLF+EDNDCFL NL MALSLFE+ASGLKINLLKS
Subjt:  INGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNGNCNISHILFADDILLFIEDNDCFLKNLRMALSLFERASGLKINLLKS

Query:  ALVPVNVSLNRAKECASFWGISCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSLTCKNIEK
        ALVPVNVSLNRAKECASFWGISCHSL LSYLGVPLGG+                                                              
Subjt:  ALVPVNVSLNRAKECASFWGISCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSLTCKNIEK

Query:  LWRKFLWKGNNEWSHLINWTKVSKSKEEGGLGISRLNVTNKALLSKWLWRYLSEPNALWRRLIQCKYKGKFPGDIPSNISSSTSKALWRSIIDSTDWFKS
                  ++ SHLINWTKV KSKEEGGLGISRL VTNKALLSKWLWRY SEPNALWRRLIQCKYKGK PGDIPSN SSS+SKA WRSIID+ DWFKS
Subjt:  LWRKFLWKGNNEWSHLINWTKVSKSKEEGGLGISRLNVTNKALLSKWLWRYLSEPNALWRRLIQCKYKGKFPGDIPSNISSSTSKALWRSIIDSTDWFKS

Query:  NQSWDLNNGDQISFWYSNWSQEGRLSTAYPRLFALTLDKEISVKDAWNTFDNQWNIIFRRELNDRERCNWEKILEILPTPRSNRGSSKPTWIPDSNNSFS
        NQSWDLNNGDQISFWYSNWSQEG LSTAYPRLFALTLDKEISVKDAWNT DNQW I FRRELNDRERCNWEKILEILPTPR NRGSSKPTWIPD N SFS
Subjt:  NQSWDLNNGDQISFWYSNWSQEGRLSTAYPRLFALTLDKEISVKDAWNTFDNQWNIIFRRELNDRERCNWEKILEILPTPRSNRGSSKPTWIPDSNNSFS

Query:  IASTKVLISRQLDQTPGDPRAKLLEIIWKSSIPMKIKFFMWCLIQRRINTMEVIQQKMPNTLLQPNWCVLCNKDSETGNHLFLRCEAVKPLWSFLQHSLN
        IAS K+LIS QLDQT GD R KLLEIIWKS+IPMKIKFFMWCLIQRRI+TMEVIQQ+M NTLLQPNWCVLCNKD+E+GNHLFLRC+AVKPLWS L  SLN
Subjt:  IASTKVLISRQLDQTPGDPRAKLLEIIWKSSIPMKIKFFMWCLIQRRINTMEVIQQKMPNTLLQPNWCVLCNKDSETGNHLFLRCEAVKPLWSFLQHSLN

Query:  FALGSDEFEDLFSFFHSLNCSFPKHK
        FAL +D+FE LFSFF SL CS PKHK
Subjt:  FALGSDEFEDLFSFFHSLNCSFPKHK

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.2e-4324.31Show/hide
Query:  TFSRIDRFLYNSSWKNLFSPHTTRTLPRSTSDHFPLVCEDSNPKLSWG-PVPFRLNSITLSDP--------------EFKRNMGRWWENLIQAGHPRFSF
        T+S+ID  +   S   L     T  +    SDH  +  E     L+      ++LN++ L+D               E   N    ++NL  A    F  
Subjt:  TFSRIDRFLYNSSWKNLFSPHTTRTLPRSTSDHFPLVCEDSNPKLSWG-PVPFRLNSITLSDP--------------EFKRNMGRWWENLIQAGHPRFSF

Query:  IQRLKSLA--NFIKPWQKEKLHSLTYAKEAIIREVDSIDKKELDTPLTQEESNRR---LALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICS
        + R K +A   + +  ++ K+ +LT        ++  ++K+E     T  +++RR     ++A+L E+  +++      ++  +    ++      R+  
Subjt:  IQRLKSLA--NFIKPWQKEKLHSLTYAKEAIIREVDSIDKKELDTPLTQEESNRR---LALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICS

Query:  SRQKRSFIHEIQDEKGSIQNTNNSISIAFIKFFSRIYRS---STKSDPLFIENLDWNPIASSEWSHLCAPFLEGEIKGVINSFDGKKTPGPDGFPISFFK
         +++++ I  I+++KG I      I     +++  +Y +   + +    F++      +   E   L  P    EI  +INS   KK+PGPDGF   F++
Subjt:  SRQKRSFIHEIQDEKGSIQNTNNSISIAFIKFFSRIYRS---STKSDPLFIENLDWNPIASSEWSHLCAPFLEGEIKGVINSFDGKKTPGPDGFPISFFK

Query:  SHWYLLKEDIMDIFK----------DFYDKC---------DYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMANEA
         +   L   ++ +F+           FY+           D +  ++FRPISL     KI+ K L+NR++  +   I  +Q+ F+   Q    I  +   
Subjt:  SHWYLLKEDIMDIFK----------DFYDKC---------DYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMANEA

Query:  VDYWKVKKIKG-FILKLDIEKAFDNLNWDFIDNVLEKKNFPNPWRKWIRGCISNVTYSVIINGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHL
        + +    K K   I+ +D EKAFD +   F+   L K      + K IR      T ++I+NG+         G RQG PLSP LF I ++ L+R    +
Subjt:  VDYWKVKKIKG-FILKLDIEKAFDNLNWDFIDNVLEKKNFPNPWRKWIRGCISNVTYSVIINGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHL

Query:  ESSGAIKGVSLNGNCNISHILFADDILLFIEDNDCFLKNLRMALSLFERASGLKINLLKSALVPVNVSLNRAKECASFWGISCHSLPLSYLGVPLGGNPK
             IKG+ L G   +   LFADD+++++E+     +NL   +S F + SG KIN+ KS     N +     +       +  S  + YLG+ L  + K
Subjt:  ESSGAIKGVSLNGNCNISHILFADDILLFIEDNDCFLKNLRMALSLFERASGLKINLLKSALVPVNVSLNRAKECASFWGISCHSLPLSYLGVPLGGNPK

Query:  SNLFWRNVE---DKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSV--FQAPSLTCKNIEKLWRKFLWKGNNEWSHLINWTKVSKSKEEGGLGISR
         +LF  N +    +I++  N WK    S  GR+ ++K  +    IY+ +    + P      +EK   KF+W   N+    I  + +S+  + GG+ +  
Subjt:  SNLFWRNVE---DKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSV--FQAPSLTCKNIEKLWRKFLWKGNNEWSHLINWTKVSKSKEEGGLGISR

Query:  LNVTNKALLSK--WLWRYLSEPNALWRR
          +  KA ++K  W W Y +     W R
Subjt:  LNVTNKALLSK--WLWRYLSEPNALWRR

P08548 LINE-1 reverse transcriptase homolog4.0e-4224.33Show/hide
Query:  LTGLYGPLHN-----------LQHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLNNFISNNLLIDPPLT----NNRFTWSNLRNPPTFSRIDRFLY
        +  +Y P HN           + +L S   I+ GD N      + +S    S     LN+ I +  L D   T       +T+ +  +  T+S+ID  L 
Subjt:  LTGLYGPLHN-----------LQHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLNNFISNNLLIDPPLT----NNRFTWSNLRNPPTFSRIDRFLY

Query:  NSSWKNLFSPHTTRTLPRSTSDHFPLVCE-DSNPKLSWGPVPFRLNSITLSD----PEFKRNMGRW----------WENLIQAGHP--RFSFIQRLKSLA
        + S  NL        +P   SDH  +  E ++N  L      ++LN++ L D     E K+ + ++          ++NL        R  FI    +L 
Subjt:  NSSWKNLFSPHTTRTLPRSTSDHFPLVCE-DSNPKLSWGPVPFRLNSITLSD----PEFKRNMGRW----------WENLIQAGHP--RFSFIQRLKSLA

Query:  NFIKPWQKEKLHSLTYAKEAIIREVDSIDKKELDTPLTQEESNRR--LALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEI
         F+K  ++E++++L       +  +  ++K+E   P   + S R+    ++A+L+E+  K       ++K  +  + ++       +   ++ +S I  I
Subjt:  NFIKPWQKEKLHSLTYAKEAIIREVDSIDKKELDTPLTQEESNRR--LALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEI

Query:  QDEKGSIQNTNNSISIAFIKFFSRIYR---SSTKSDPLFIENLDWNPIASSEWSHLCAPFLEGEIKGVINSFDGKKTPGPDGFPISFFKSHWYLLKEDIM
        ++    I    + I     +++ ++Y     + K    ++E      ++  E   L  P    EI   I +   KK+PGPDGF   F+++    L   ++
Subjt:  QDEKGSIQNTNNSISIAFIKFFSRIYR---SSTKSDPLFIENLDWNPIASSEWSHLCAPFLEGEIKGVINSFDGKKTPGPDGFPISFFKSHWYLLKEDIM

Query:  DIFKD----------FYDK---------CDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDYW-KVKKIK
        ++F++          FY+           D +  +++RPISL     KI+ K L+NR++  +   I  +Q+ F+   Q    I  +   + +  K+K   
Subjt:  DIFKD----------FYDK---------CDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDYW-KVKKIK

Query:  GFILKLDIEKAFDNLNWDFIDNVLEKKNFPNPWRKWIRGCISNVTYSVIINGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSL
          IL +D EKAFDN+   F+   L+K      + K I    S  T ++I+NG          G RQG PLSP LF I M+ L+     +    AIKG+ +
Subjt:  GFILKLDIEKAFDNLNWDFIDNVLEKKNFPNPWRKWIRGCISNVTYSVIINGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSL

Query:  NGNCNISHILFADDILLFIEDNDCFLKNLRMALSLFERASGLKINLLKSALVPVNVSLNRAKECASFWGISCHSLPLSYLGVPLGGNPKSNLFWRNVE--
         G+  I   LFADD+++++E+       L   +  +   SG KIN  KS       +    K        +     + YLGV L  + K +L+  N E  
Subjt:  NGNCNISHILFADDILLFIEDNDCFLKNLRMALSLFERASGLKINLLKSALVPVNVSLNRAKECASFWGISCHSLPLSYLGVPLGGNPKSNLFWRNVE--

Query:  -DKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSV--FQAPSLTCKNIEKLWRKFLWKGNNEWSHLINWTKVSKSKEEGGLGISRLNVTNKALLSK
          +I + +N WK    S  GR+ ++K ++    IY  +    +AP    K++EK+   F+W   N+    I  T +S   + GG+ +  L +  K+++ K
Subjt:  -DKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSV--FQAPSLTCKNIEKLWRKFLWKGNNEWSHLINWTKVSKSKEEGGLGISRLNVTNKALLSK

Query:  --WLWRYLSEPNALWRRL
          W W    E + +W R+
Subjt:  --WLWRYLSEPNALWRRL

P0C2F6 Putative ribonuclease H protein At1g657504.5e-3026.88Show/hide
Query:  DKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSLTCKNIEKLWRKFLWKGNNE--WSHLINWTKVSKSKEEGGLGISRLNVTNKALLSKW
        +++  +++ W+   +S  GRLTL K+ LSS+P++ +S    P      +++L R FLW    E    HL+ W+KV   K+EGGLG+      N+AL+SK 
Subjt:  DKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSLTCKNIEKLWRKFLWKGNNE--WSHLINWTKVSKSKEEGGLGISRLNVTNKALLSKW

Query:  LWRYLSEPNALWRRLIQCKYKGKFPGDIPSNISSSTSKALWRSI-IDSTDWFKSNQSWDLNNGDQISFWYSNWSQEGRLSTAYPRLFALTLDKEISVKDA
         WR L E N+LW  ++Q KY      D    I   +  + WRSI I   D       W   +G QI FW   W   G+           T    +  KD 
Subjt:  LWRYLSEPNALWRRLIQCKYKGKFPGDIPSNISSSTSKALWRSI-IDSTDWFKSNQSWDLNNGDQISFWYSNWSQEGRLSTAYPRLFALTLDKEISVKDA

Query:  WNTFDNQWNIIFRRELNDRERCNWEKILEILPTPRSNRGSSKPTWIPDSNNSFSIASTKVLISRQLDQTPGDPRAKLLEIIWKSSIPMKIKFFMWCLIQR
        W      W+     +++     N    L  +          + +W    +  FS+ S   +++  +D+ P    A     +WK  +P ++K F+W +  +
Subjt:  WNTFDNQWNIIFRRELNDRERCNWEKILEILPTPRSNRGSSKPTWIPDSNNSFSIASTKVLISRQLDQTPGDPRAKLLEIIWKSSIPMKIKFFMWCLIQR

Query:  RINTMEVIQQKMPNTLLQPNWCVLCNKDSETGNHLFLRCEAVKPLW
         + T E   ++    L   N C +C    E+  H+   C A   +W
Subjt:  RINTMEVIQQKMPNTLLQPNWCVLCNKDSETGNHLFLRCEAVKPLW

P11369 LINE-1 retrotransposable element ORF2 protein9.1e-4723.9Show/hide
Query:  TFSRIDRFLYNSSWKNLFSPHTTRTLPRSTSDH------FPLVCEDSNPKLSWGPVPFRLNSITLSDPEFKRNMGRWWENLIQAGHPRFSFIQRLKSLAN
        TFS+ID  + + +  N +       +P   SDH      F     +  P  +W     +LN+  L+D   K  + +  ++ ++      +      +L +
Subjt:  TFSRIDRFLYNSSWKNLFSPHTTRTLPRSTSDH------FPLVCEDSNPKLSWGPVPFRLNSITLSDPEFKRNMGRWWENLIQAGHPRFSFIQRLKSLAN

Query:  FIKPWQKEKLHSLTYAKE--------AIIREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRS
         +K + + KL +L+ +K+        ++   + +++KKE ++P  +      + L+ +++++  + +     + +  +  + ++      R+    + + 
Subjt:  FIKPWQKEKLHSLTYAKE--------AIIREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRS

Query:  FIHEIQDEKGSIQNTNNSISIAFIKFFSRIYRSSTKSDPL-----FIENLDWNPIASSEWSHLCAPFLEGEIKGVINSFDGKKTPGPDGFPISFFKSHWY
         I++I++EKG I      I      F+ R+Y  STK + L     F++      +   +  HL +P    EI+ VINS   KK+PGPDGF   F+++   
Subjt:  FIHEIQDEKGSIQNTNNSISIAFIKFFSRIYRSSTKSDPL-----FIENLDWNPIASSEWSHLCAPFLEGEIKGVINSFDGKKTPGPDGFPISFFKSHWY

Query:  LLKEDIMDIFKDFYDKC-----------------------DYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMANEA
          KED++ I    + K                        D +  ++FRPISL     KI+ K L+NR++  +   I  +Q+ F+   Q    I  +   
Subjt:  LLKEDIMDIFKDFYDKC-----------------------DYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMANEA

Query:  VDYW-KVKKIKGFILKLDIEKAFDNLNWDFIDNVLEKKNFPNPWRKWIRGCISNVTYSVIINGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHL
        + Y  K+K     I+ LD EKAFD +   F+  VLE+     P+   I+   S    ++ +NG     I    G RQG PLSP+LF I ++ L+R +   
Subjt:  VDYW-KVKKIKGFILKLDIEKAFDNLNWDFIDNVLEKKNFPNPWRKWIRGCISNVTYSVIINGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHL

Query:  ESSGAIKGVSLNGNCNISHILFADDILLFIEDNDCFLKNLRMALSLFERASGLKINLLKSALVPVNVSLNRAKECASFWGISCHSLPLSYLGVPLGGNPK
        +    IKG+ + G   +   L ADD++++I D     + L   ++ F    G KIN  KS       +    KE       S  +  + YLGV L    K
Subjt:  ESSGAIKGVSLNGNCNISHILFADDILLFIEDNDCFLKNLRMALSLFERASGLKINLLKSALVPVNVSLNRAKECASFWGISCHSLPLSYLGVPLGGNPK

Query:  S--NLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSV--FQAPSLTCKNIEKLWRKFLWKGNNEWSHLINWTKVSKSKEEGGLGISRL
           +  +++++ +I++ L  WK    S  GR+ ++K  +    IY+ +    + P+     +E    KF+W  NN+   +   + +   +  GG+ +  L
Subjt:  S--NLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSV--FQAPSLTCKNIEKLWRKFLWKGNNEWSHLINWTKVSKSKEEGGLGISRL

Query:  NVTNKALLSK--WLWRYLSEPNALWRRL
         +  +A++ K  W W Y       W R+
Subjt:  NVTNKALLSK--WLWRYLSEPNALWRRL

P14381 Transposon TX1 uncharacterized 149 kDa protein4.1e-3123.62Show/hide
Query:  ILGGDLNVIRMREESTSVLSSSHSSRMLNNFISNNLLIDPPLTNN----RFTWSNLRN-PPTFSRIDRFLYNSSWKNLFSPHTTRTLPRSTSDH----FP
        I+GGD N      +         S  +L   I++  L+D     N     FT+  +R+   + SRIDR   +S   +     T R  P   SDH      
Subjt:  ILGGDLNVIRMREESTSVLSSSHSSRMLNNFISNNLLIDPPLTNN----RFTWSNLRN-PPTFSRIDRFLYNSSWKNLFSPHTTRTLPRSTSDH----FP

Query:  LVCEDSNPKLSWGPVPFRLNSITLSDPEFKRNMGRWWENLIQAGHPRFSFIQRLKSLAN-FIKPWQKEKLHSLTYAKEAIIREVDSIDKKELDTPLTQEE
        +    S PK ++    +  N+  L D  F +++   W    +A    F+ + +   +    +K   +E   S++  + A I  ++  +  +L+  L+  E
Subjt:  LVCEDSNPKLSWGPVPFRLNSITLSDPEFKRNMGRWWENLIQAGHPRFSFIQRLKSLAN-FIKPWQKEKLHSLTYAKEAIIREVDSIDKKELDTPLTQEE

Query:  SN----RRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEKGSIQNTNNSISIAFIKFFSRIYRSSTKSDPLFIEN
                L  K  L  +  ++++  + R++   L + D  S FF+ +   +  R  I  +  E G+      +I      F+  ++     S     E 
Subjt:  SN----RRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEKGSIQNTNNSISIAFIKFFSRIYRSSTKSDPLFIEN

Query:  LDWNPIASS-EWSHLCAPFLEGEIKGVINSFDGKKTPGPDGFPISFFKSHWYLLKEDIMDIFKDFYDK------C------------DYSHPKDFRPISL
         D  P+ S      L  P    E+   +      K+PG DG  I FF+  W  L  D   +  + + K      C            D    K++RP+SL
Subjt:  LDWNPIASS-EWSHLCAPFLEGEIKGVINSFDGKKTPGPDGFPISFFKSHWYLLKEDIMDIFKDFYDK------C------------DYSHPKDFRPISL

Query:  TTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDYWKVKKIKGFILKLDIEKAFDNLNWDFIDNVLEKKNFPNPWRKWIRGCISN
         ++ YKI+AK +S RLK+ L + I  +Q   V  R I D + +  + + + +   +    L LD EKAFD ++  ++   L+  +F   +  +++   ++
Subjt:  TTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDYWKVKKIKGFILKLDIEKAFDNLNWDFIDNVLEKKNFPNPWRKWIRGCISN

Query:  VTYSVIINGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNGNCNISHILFADDILLFIEDNDCFLKNLRMALSLFERASGLK
            V IN      +   RG+RQG PLS  L+ +A++    LL    +   +K      +  +    +ADD++L  +D    L+  +    ++  AS  +
Subjt:  VTYSVIINGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNGNCNISHILFADDILLFIEDNDCFLKNLRMALSLFERASGLK

Query:  INLLKSALVPVNVSLNRAKECASFWGISCHSLPLSYLGVPLGGN--PKSNLFWRNVEDKIQKKLNNWK-YAQI-SKGGRLTLIKSTLSSLPIYQLSVFQA
        IN  KS+ + +  SL       +F  IS  S  + YLGV L     P S  F   +E+ +  +L  WK +A++ S  GR  +I   ++S   Y+L     
Subjt:  INLLKSALVPVNVSLNRAKECASFWGISCHSLPLSYLGVPLGGN--PKSNLFWRNVEDKIQKKLNNWK-YAQI-SKGGRLTLIKSTLSSLPIYQLSVFQA

Query:  PSLTCKNIEKLWRKFLWKGNNEWSHLINWTKVSKSKEEGGLGISRLNVTNKALLSKWLWRYL-SEPNALWRRLIQCKYK
               I++    FLW G     H ++    S   +EGG G+  +         + + RYL ++P+  W  L    Y+
Subjt:  PSLTCKNIEKLWRKFLWKGNNEWSHLINWTKVSKSKEEGGLGISRLNVTNKALLSKWLWRYL-SEPNALWRRLIQCKYK

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein8.0e-2224.69Show/hide
Query:  ILGGDLNVIRMREESTSVLSSSHSSRMLNNF---ISNNLLIDPPLTNNRFTWSNLRNP-PTFSRIDRFLYNSSWKNLFSPHTTRTLPRSTSDHFP-LVCE
        IL GD + I    +  SVL +S   R L  F   + ++ L+D P     +TWSN ++  P   ++DR + N  W + F            SDH P ++  
Subjt:  ILGGDLNVIRMREESTSVLSSSHSSRMLNNF---ISNNLLIDPPLTNNRFTWSNLRNP-PTFSRIDRFLYNSSWKNLFSPHTTRTLPRSTSDHFP-LVCE

Query:  DSNPKLSWGPVPFRLNSITLSDPEFKRNMGRWWENLIQAGHPRFSFIQRLKSLANFIKPWQKEKLHSLTYAKEAIIREVDSIDKKELDTPLTQEESNRRL
        ++ PK S     FR  S   + P F  ++   WE  I  G   FS  + LK+     K   ++   ++ +  +  +  ++SI  + L  P         +
Subjt:  DSNPKLSWGPVPFRLNSITLSDPEFKRNMGRWWENLIQAGHPRFSFIQRLKSLANFIKPWQKEKLHSLTYAKEAIIREVDSIDKKELDTPLTQEESNRRL

Query:  ALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQ-DEKGSIQNTNNSISIAFIKFFSRIYRSSTKSDPLFIENL----DWN
        A K      +  ES F+ Q+++  WL++GD N+ FFH++  + Q ++ I  ++ D+   ++N    +    + +++ +  S   SD L  +++    D +
Subjt:  ALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQ-DEKGSIQNTNNSISIAFIKFFSRIYRSSTKSDPLFIENL----DWN

Query:  PIASSEW--SHLCAPFLEGEIKGVINSFDGKKTPGPDGFPISFFKSHWYLLKEDIMDIFKDFY------------------DKCDYSHPKDFRPISLTTS
        P   ++   S L A   + EI   + +    K PGPD F   FF   W+++K+  +   K+F+                            FRP+S  T 
Subjt:  PIASSEW--SHLCAPFLEGEIKGVINSFDGKKTPGPDGFPISFFKSHWYLLKEDIMDIFKDFY------------------DKCDYSHPKDFRPISLTTS

Query:  IYKII
        +YKII
Subjt:  IYKII

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.8e-1924.93Show/hide
Query:  SLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSLTCKNIEKLWRKFLWKGN--NEWSHLINWTKV
        +LP+ YLG+PL     +   +  + +KI+ ++  W    +S  GRL LI S + SL  + +S F+ PS   K I+ +   FLW G   N     + W+ V
Subjt:  SLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSLTCKNIEKLWRKFLWKGN--NEWSHLINWTKV

Query:  SKSKEEGGLGISRLNVTNKALLSKWLWRYLSEPNALWRRLIQCKYKGKFPGDIPSNISSSTSKA--LWRSIIDSTDWFKSNQSWDLNNGDQISFWYSNWS
           K+EGGLGI  L   NK               + W                  +IS +T+    +W+ I+            D++NG   SFW+ NWS
Subjt:  SKSKEEGGLGISRLNVTNKALLSKWLWRYLSEPNALWRRLIQCKYKGKFPGDIPSNISSSTSKA--LWRSIIDSTDWFKSNQSWDLNNGDQISFWYSNWS

Query:  QEGRL--STAYPRLFALTLDKEISVKDAWNTFDNQWNIIFRRELNDRERCNWEKILEILPTPRSNRGSSKPTWIPDSNNSFSIASTKVLISRQLDQTPGD
        + GRL   T +     + +    SV +A        N   RR  +D      E ++  +       G     W  + +      +TK     +      +
Subjt:  QEGRL--STAYPRLFALTLDKEISVKDAWNTFDNQWNIIFRRELNDRERCNWEKILEILPTPRSNRGSSKPTWIPDSNNSFSIASTKVLISRQLDQTPGD

Query:  PRAKL--LEIIWKSSIPMKIKFFMWCLIQRRINTMEVIQQKMPNTLLQPNWCVLCNKDSETGNHLFLRC
        P+ K+   + +W S    K     W  I+ R+ T +   + +       + CVLC+   ET +HLF  C
Subjt:  PRAKL--LEIIWKSSIPMKIKFFMWCLIQRRINTMEVIQQKMPNTLLQPNWCVLCNKDSETGNHLFLRC

AT4G29090.1 Ribonuclease H-like superfamily protein1.4e-2324.94Show/hide
Query:  SLPIYQLSVFQAPSLTCKNIEKLWRKFLWKGNNE--WSHLINWTKVSKSKEEGGLGISRLNVTNKALLSKWLWRYLSEPNALWRRLIQCKYKGKFPGDIP
        +LP Y ++ F  P   CK I  +   F W+   E    H   W  +S  K EGG+G   +   N ALL K +WR LS P +L  ++ + +Y   F    P
Subjt:  SLPIYQLSVFQAPSLTCKNIEKLWRKFLWKGNNE--WSHLINWTKVSKSKEEGGLGISRLNVTNKALLSKWLWRYLSEPNALWRRLIQCKYKGKFPGDIP

Query:  SNIS-SSTSKALWRSIIDSTDWFKSNQSWDLNNGDQISFWYSNW------SQEGRLSTAYPRLFALTLDKEISVKDAWNTFDNQWNIIFRRELNDRERCN
         N    S    +W+SI  S +  +      + NG+ I  W   W      S   R+    P+ +A ++   + V D  +    +W    R+++ +     
Subjt:  SNIS-SSTSKALWRSIIDSTDWFKSNQSWDLNNGDQISFWYSNW------SQEGRLSTAYPRLFALTLDKEISVKDAWNTFDNQWNIIFRRELNDRERCN

Query:  WEKILEILPTPRSNRGSSKPTWIPDSNNSFSIAS-----TKVLISRQLDQTPGDPRAK-LLEIIWKSSIPMKIKFFMW-CLIQRRINTMEVIQQKMPNTL
         E+ L     P   R     TW   S+  +++ S     T+++  R   Q   +P    + + IWKS    KI+ F+W CL     N++ V        L
Subjt:  WEKILEILPTPRSNRGSSKPTWIPDSNNSFSIAS-----TKVLISRQLDQTPGDPRAK-LLEIIWKSSIPMKIKFFMW-CLIQRRINTMEVIQQKMPNTL

Query:  LQPNWCVLCNKDSETGNHLFLRCEAVKPLWSFLQHSLNFALGSDEFEDLF---SFFHSLNCSFPKHKVIFCGIIALFWEIWCERNFRIF
         + + C+ C    ET NHL  +C   +  W+    S+   LG +  + ++    +  +L    P+ +     +  L W +W  RN  +F
Subjt:  LQPNWCVLCNKDSETGNHLFLRCEAVKPLWSFLQHSLNFALGSDEFEDLF---SFFHSLNCSFPKHKVIFCGIIALFWEIWCERNFRIF

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.8e-1131.03Show/hide
Query:  SLPIYQLSVFQAPSLTCKNIEKLWRKFLWKG--NNEWSHLINWTKVSKSKE-EGGLGISRLNVTNKALLSKWLWRYLSEPNALWRRLIQCKYKGKFPGDI
        +LP+Y +S F+   L CK +     +F W    N      + W K+ KSKE +GGLG   L   N+ALL+K  +R + +P+ L  RL++ +Y   FP   
Subjt:  SLPIYQLSVFQAPSLTCKNIEKLWRKFLWKG--NNEWSHLINWTKVSKSKE-EGGLGISRLNVTNKALLSKWLWRYLSEPNALWRRLIQCKYKGKFPGDI

Query:  PSNISSSTSKA-LWRSIIDSTDWFKSNQSWDLNNGDQISFWYSNW
            S  T  +  WRSII   +         + +G     W   W
Subjt:  PSNISSSTSKA-LWRSIIDSTDWFKSNQSWDLNNGDQISFWYSNW

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)3.4e-1249.25Show/hide
Query:  IINGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNGNC-NISHILFADD
        IING PQG +  +RGLRQGDPLSP+LF++  + LS L    +  G + G+ ++ N   I+H+LFADD
Subjt:  IINGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNGNC-NISHILFADD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTACTTCAAATCACTTCCTAGATCATGCAAGGTTGAAAGAAAGGAATTTGTCCTTCACCTTGACAAATATTCGAAGCATACTCACTATTGGTTAACCGAAACTGG
AGCTCACAAGGCCTTCTCCATCGAAGTTTCCCCAAGAGACTTAGACTGGATAAGATGTACTCTGAAATCATTGATAGCAACTCCAAATACAAACCGTTTCTTCCTTGAGA
CCCGTGACTCTGAGCAACGCATCTGGATCAGGAAAACAAGAAACAGTAAAGGATGTACTGCAGAAATTTTTAGAGTTGATCAAAAAAACAGAAAATCATGTATTCTAGTT
CTGGAAGGCCCTGATAAAAGTGGCTGGGTTTCCTTCCTGTCTATGATTACCCCAAAAGTGGAAGTGAAAGCAAAAACAAGACCAACCTTTTTGCCAAGGACCAGTCCTGA
TTGTCAACTATCTCCTCCCATTGACTACCACAAACGATCATATGCAAAAGCTGTCACTGAAGGAAGACCTTTTGCTACAAGTGACTCAAGTGACTCTTATGATTCAAGCG
ATTCAAGCCATTCATCAGGTAATAGCTTTTGCGACTCTCCCTCTTTTGACCTTCTTGAAAATACATTGGTGATAGTTAGACGTTTCTTTCATGATGACTGGCATAAAATC
CTTCAAAACCTGAGGAAACAAACAGATGAATCTTTCACTTACAATGCTTTCCATGCTGAAAAGGCTTTAGTTCATTTTAGTTCAAACATACCCGCAAATCTCCTCTGCCA
AAACAAAGGATGGTCCACTGTAGGGAAGTACTCGGTAAGATTTGAAAAATGGTCCCCTGCGTACCACGCCACTCCAAAACTCATTCCTAGTTATGGAGGATGGACAACTT
TCAGAGGAATTCCACTACACTTGTGGAATATGATGACTTTTCAACAAATTGGGAAAGCCTGCGGAGGTTTGATTAAAGTGGCTGAGGAAACAAGATCAGCAAAAAACCTG
ATAGAAGCAAGGATAAAAGTCAGATACAATTACTCAGGTTTCTTACCAGCAAATGTAAGGATTTTTGATAATGAAGGAAACAAATTTTCCGTTCAAGTAGTTACTCATCC
AGAAGAATGGTTAATAGAAAGGAGTGTCAGATTACATGGTACCTTCAAGAGACAAGCTGCAGCCTCCTTTGATGACTTCAATCCTGAATCAGAACAATTCTTCTTCGAAG
GATCGGAGGCCATATCGCCGGACTTTCTTTCCACCAGCTCCGACGACCGTAAAAGCAGCACACCGGATCAGCCATCAGCATTAAAATCTGTTATCATTAAACCTGACAGA
AATGCCACGTTGCCAAGCTTCTTAAATGAAGAGTTAGTTAATGATAGTAATTTGCATGCAATGGCTAATAAATCCAAGTTAGAGATTTTATCTGGGATATCAAATGATGG
CGTATTGGACAAAGGAAAACAGAAGGTTGACATTCAGCTTCAACCCAATTCAGCATTAAATTTGGACAAATCCAAAAGGAAAGTCTCCTTTAATTCTCCCAGTAATAAAA
CCAACATCTTCAACCCGAATTCTGCCCCAGCCAATCATTCTCCATCATTGAATTCCCCTGAGAAAAAACAGAAAGTTAGTAGAGAGAGAAGTATCAAGAAGAAATCGTCT
TTCACGCAGCCGAATTCAAAAGCCAATCAGAACAAAGGTGTATTCATTACTCAACCAATTCAAATTGTGGCACACGATCGGGATGCTGCTAAAAAAGGTCTCTCTCTCAC
TGTTGATCTGGGAGATCTGCCAGCTTTGGATCCAAATAAATCCCTTGAAGACCATCACAGCTCTGACAATGCAGAAGTTATTGATATAACAAACACTGAAGTGGTTACTG
AGACACCTGAAATGAAAATGCCAGTTAATGAGAATTCAAATTCTTCTTCTGAAGCCAACTACAGGAAACCAAAACATGTTCATAAAAGAAAATATTACTACAGAAAAAAA
GAAGAAAAGGAGAAGGATCCGGACTCAGAGGCCTTCAAAAAACAACTCGTTTCCTGGTTAAAGGAAAATGGTCTGAAACTCTCTACGGACACTGACTCTTCAGGTGCAAC
TACTTCAACAAATGTTTTAGCCCTAATAAAAAATACAATAATTTCATACTCCCCTGATTTTGTGATTTTGACTGAAACTAGGCTTAAGATCACAAACAAGAGAATCATAA
AGTCCCTTTGGCCCTCTAATAGCATTAATTGGATTGCGAAAAATGCTTCTGGTAGTTCTGGAGGGATCTTAATTCTTTGGGATGCCCAGAATCACTCTCTTTTAAGTCAA
GAGGAAGGGCTCTTTAGCCTATCAGCAAATTTTTTGCTCAACAACAATTCGTCCTGGTGGTTAACAGGTCTTTATGGTCCATTACATAATCTTCAACATCTTAATTCCTT
CCCCTGGATTTTAGGAGGTGATCTTAACGTCATCAGAATGAGAGAGGAATCAACGTCAGTCTTGAGCTCTTCTCACAGCTCCAGAATGTTAAACAATTTCATCTCCAACA
ATCTTCTGATAGATCCTCCTCTCACAAACAATAGATTCACTTGGTCAAACTTACGGAATCCTCCTACCTTTTCCCGAATTGATAGATTCCTTTACAATTCAAGTTGGAAA
AATCTCTTCAGTCCCCATACTACAAGGACCCTTCCTAGATCTACTTCAGACCACTTTCCTCTGGTCTGTGAAGATTCCAACCCCAAACTTAGTTGGGGTCCTGTCCCGTT
CCGTTTAAACTCCATAACTCTAAGTGACCCAGAATTCAAAAGAAACATGGGAAGATGGTGGGAAAACTTGATCCAAGCTGGTCACCCAAGATTCTCTTTCATTCAAAGGC
TAAAGTCTTTAGCAAATTTTATCAAACCTTGGCAAAAGGAGAAATTACACTCTCTCACCTATGCTAAAGAAGCCATTATTAGGGAAGTGGACTCTATTGACAAAAAGGAA
TTGGATACTCCTCTGACACAGGAGGAAAGTAATCGCCGATTAGCTCTAAAAGCTGATCTCAGTGAGTTATCCCTAAAGGAGTCCCAATTCTGGTATCAAAGGGCTAAAAA
GCTTTGGCTTAGGGAGGGAGATGAAAATTCCTCCTTCTTTCATAGAATTTGCTCATCAAGACAAAAGAGAAGTTTCATTCATGAAATCCAGGATGAAAAAGGCTCGATTC
AGAATACAAACAACAGTATATCAATTGCTTTTATAAAATTCTTTTCAAGGATTTATAGAAGTTCTACAAAAAGTGATCCTCTGTTTATTGAAAATCTTGATTGGAACCCG
ATTGCGTCTTCTGAGTGGTCACACCTTTGTGCCCCTTTTTTGGAAGGAGAGATTAAAGGGGTTATCAACTCTTTTGATGGAAAAAAGACTCCTGGTCCAGACGGCTTCCC
TATCTCCTTCTTTAAATCTCACTGGTATCTTCTAAAAGAGGACATCATGGATATATTCAAGGATTTTTATGACAAATGTGACTATTCTCACCCAAAAGACTTCAGACCAA
TCAGCCTAACAACGTCCATCTACAAGATCATTGCCAAAACTCTTTCAAACAGGTTAAAAACCACCCTTCCTGACACCATCTCAGGAAACCAGCTAGCTTTTGTCAAGAAT
CGCCAAATAACTGATGCTATCCTAATGGCGAACGAAGCTGTGGATTATTGGAAGGTGAAGAAGATTAAGGGCTTTATTTTGAAGCTTGACATCGAAAAGGCTTTTGACAA
TCTAAACTGGGATTTCATTGATAATGTTCTGGAGAAAAAGAATTTTCCAAATCCTTGGAGAAAGTGGATAAGAGGATGTATAAGCAACGTCACTTACTCTGTTATTATCA
ACGGAAGACCCCAAGGTCGAATCAAAGCTAACAGAGGTCTTAGACAAGGTGATCCCCTTTCCCCCTTCCTGTTTGTTATTGCCATGGATTACCTTAGTCGTCTCTTATCC
CATCTGGAAAGTTCTGGTGCCATTAAAGGGGTCTCTCTCAACGGTAATTGCAACATCTCCCACATCCTTTTCGCTGATGATATTCTTCTTTTTATAGAAGATAATGATTG
TTTCCTAAAAAACCTTAGAATGGCTTTATCTCTGTTTGAAAGAGCTTCGGGTCTCAAAATAAATTTATTGAAATCAGCTTTGGTACCAGTGAATGTGTCCTTGAATAGAG
CTAAAGAATGTGCTTCTTTTTGGGGTATTTCTTGCCATTCTCTCCCCCTTTCCTATTTGGGAGTTCCTCTTGGTGGTAATCCAAAATCCAACCTTTTTTGGCGCAACGTT
GAAGATAAGATCCAAAAAAAGCTCAATAATTGGAAATATGCACAGATATCAAAAGGTGGAAGACTCACTTTAATCAAGTCTACCCTTAGCAGTCTTCCTATATATCAACT
ATCTGTTTTCCAAGCTCCTTCCTTGACGTGCAAAAACATTGAAAAATTATGGAGAAAGTTCCTTTGGAAAGGTAATAACGAATGGTCTCACCTAATCAACTGGACTAAAG
TCTCTAAATCTAAAGAGGAGGGTGGTCTGGGGATCTCAAGGCTCAATGTGACAAACAAAGCCCTCTTATCTAAATGGCTTTGGCGTTATCTCTCAGAACCTAATGCCCTT
TGGAGGAGGCTAATTCAATGCAAATATAAAGGCAAATTTCCAGGGGACATTCCATCAAACATCTCCTCTAGTACTTCTAAAGCCCTGTGGAGATCTATCATCGACAGCAC
CGATTGGTTCAAAAGTAATCAAAGTTGGGATTTGAATAATGGAGACCAAATCTCCTTCTGGTATTCTAATTGGTCTCAAGAAGGACGCCTTTCAACTGCCTATCCTAGAC
TTTTTGCTCTTACTCTTGACAAAGAAATCTCAGTAAAAGATGCGTGGAACACATTCGACAACCAATGGAATATAATTTTTAGAAGAGAGCTGAACGATAGAGAAAGATGC
AATTGGGAGAAAATTTTAGAGATTCTCCCTACTCCGAGATCTAACAGAGGGTCAAGTAAGCCTACCTGGATTCCTGACAGCAACAACTCTTTCTCCATTGCCTCCACAAA
AGTCTTGATCTCTCGGCAGCTGGATCAAACCCCAGGGGACCCTCGAGCAAAGCTTCTTGAGATCATTTGGAAGTCCAGCATTCCAATGAAGATCAAATTCTTCATGTGGT
GCCTGATACAAAGAAGAATAAATACGATGGAGGTTATTCAGCAAAAAATGCCCAACACTCTCCTTCAACCCAACTGGTGCGTGTTGTGCAACAAAGACAGTGAAACAGGA
AATCACCTTTTTCTTCGATGCGAAGCTGTGAAACCCTTATGGTCCTTTCTCCAGCACTCCCTCAATTTCGCTCTTGGTTCCGATGAATTTGAGGATCTGTTCTCCTTCTT
CCACTCTCTAAATTGCTCCTTCCCGAAACACAAGGTTATTTTTTGTGGAATTATAGCTTTATTTTGGGAAATTTGGTGTGAAAGGAACTTTAGAATTTTTGGAACCTCTA
GCTCTCATAAAACAATTGCTAATATGTGGGAGGACTGTAAAATCCTTATAGGCAATTGGTGCAGTAGGGATCCCCATTTTAAAAATTATTCGGCTGCTACAATTGCTTTA
AACCTTAGCACCTTCTGTAATTATATCTTGGACTTTTCTCTAGCCCCTCATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCTACTTCAAATCACTTCCTAGATCATGCAAGGTTGAAAGAAAGGAATTTGTCCTTCACCTTGACAAATATTCGAAGCATACTCACTATTGGTTAACCGAAACTGG
AGCTCACAAGGCCTTCTCCATCGAAGTTTCCCCAAGAGACTTAGACTGGATAAGATGTACTCTGAAATCATTGATAGCAACTCCAAATACAAACCGTTTCTTCCTTGAGA
CCCGTGACTCTGAGCAACGCATCTGGATCAGGAAAACAAGAAACAGTAAAGGATGTACTGCAGAAATTTTTAGAGTTGATCAAAAAAACAGAAAATCATGTATTCTAGTT
CTGGAAGGCCCTGATAAAAGTGGCTGGGTTTCCTTCCTGTCTATGATTACCCCAAAAGTGGAAGTGAAAGCAAAAACAAGACCAACCTTTTTGCCAAGGACCAGTCCTGA
TTGTCAACTATCTCCTCCCATTGACTACCACAAACGATCATATGCAAAAGCTGTCACTGAAGGAAGACCTTTTGCTACAAGTGACTCAAGTGACTCTTATGATTCAAGCG
ATTCAAGCCATTCATCAGGTAATAGCTTTTGCGACTCTCCCTCTTTTGACCTTCTTGAAAATACATTGGTGATAGTTAGACGTTTCTTTCATGATGACTGGCATAAAATC
CTTCAAAACCTGAGGAAACAAACAGATGAATCTTTCACTTACAATGCTTTCCATGCTGAAAAGGCTTTAGTTCATTTTAGTTCAAACATACCCGCAAATCTCCTCTGCCA
AAACAAAGGATGGTCCACTGTAGGGAAGTACTCGGTAAGATTTGAAAAATGGTCCCCTGCGTACCACGCCACTCCAAAACTCATTCCTAGTTATGGAGGATGGACAACTT
TCAGAGGAATTCCACTACACTTGTGGAATATGATGACTTTTCAACAAATTGGGAAAGCCTGCGGAGGTTTGATTAAAGTGGCTGAGGAAACAAGATCAGCAAAAAACCTG
ATAGAAGCAAGGATAAAAGTCAGATACAATTACTCAGGTTTCTTACCAGCAAATGTAAGGATTTTTGATAATGAAGGAAACAAATTTTCCGTTCAAGTAGTTACTCATCC
AGAAGAATGGTTAATAGAAAGGAGTGTCAGATTACATGGTACCTTCAAGAGACAAGCTGCAGCCTCCTTTGATGACTTCAATCCTGAATCAGAACAATTCTTCTTCGAAG
GATCGGAGGCCATATCGCCGGACTTTCTTTCCACCAGCTCCGACGACCGTAAAAGCAGCACACCGGATCAGCCATCAGCATTAAAATCTGTTATCATTAAACCTGACAGA
AATGCCACGTTGCCAAGCTTCTTAAATGAAGAGTTAGTTAATGATAGTAATTTGCATGCAATGGCTAATAAATCCAAGTTAGAGATTTTATCTGGGATATCAAATGATGG
CGTATTGGACAAAGGAAAACAGAAGGTTGACATTCAGCTTCAACCCAATTCAGCATTAAATTTGGACAAATCCAAAAGGAAAGTCTCCTTTAATTCTCCCAGTAATAAAA
CCAACATCTTCAACCCGAATTCTGCCCCAGCCAATCATTCTCCATCATTGAATTCCCCTGAGAAAAAACAGAAAGTTAGTAGAGAGAGAAGTATCAAGAAGAAATCGTCT
TTCACGCAGCCGAATTCAAAAGCCAATCAGAACAAAGGTGTATTCATTACTCAACCAATTCAAATTGTGGCACACGATCGGGATGCTGCTAAAAAAGGTCTCTCTCTCAC
TGTTGATCTGGGAGATCTGCCAGCTTTGGATCCAAATAAATCCCTTGAAGACCATCACAGCTCTGACAATGCAGAAGTTATTGATATAACAAACACTGAAGTGGTTACTG
AGACACCTGAAATGAAAATGCCAGTTAATGAGAATTCAAATTCTTCTTCTGAAGCCAACTACAGGAAACCAAAACATGTTCATAAAAGAAAATATTACTACAGAAAAAAA
GAAGAAAAGGAGAAGGATCCGGACTCAGAGGCCTTCAAAAAACAACTCGTTTCCTGGTTAAAGGAAAATGGTCTGAAACTCTCTACGGACACTGACTCTTCAGGTGCAAC
TACTTCAACAAATGTTTTAGCCCTAATAAAAAATACAATAATTTCATACTCCCCTGATTTTGTGATTTTGACTGAAACTAGGCTTAAGATCACAAACAAGAGAATCATAA
AGTCCCTTTGGCCCTCTAATAGCATTAATTGGATTGCGAAAAATGCTTCTGGTAGTTCTGGAGGGATCTTAATTCTTTGGGATGCCCAGAATCACTCTCTTTTAAGTCAA
GAGGAAGGGCTCTTTAGCCTATCAGCAAATTTTTTGCTCAACAACAATTCGTCCTGGTGGTTAACAGGTCTTTATGGTCCATTACATAATCTTCAACATCTTAATTCCTT
CCCCTGGATTTTAGGAGGTGATCTTAACGTCATCAGAATGAGAGAGGAATCAACGTCAGTCTTGAGCTCTTCTCACAGCTCCAGAATGTTAAACAATTTCATCTCCAACA
ATCTTCTGATAGATCCTCCTCTCACAAACAATAGATTCACTTGGTCAAACTTACGGAATCCTCCTACCTTTTCCCGAATTGATAGATTCCTTTACAATTCAAGTTGGAAA
AATCTCTTCAGTCCCCATACTACAAGGACCCTTCCTAGATCTACTTCAGACCACTTTCCTCTGGTCTGTGAAGATTCCAACCCCAAACTTAGTTGGGGTCCTGTCCCGTT
CCGTTTAAACTCCATAACTCTAAGTGACCCAGAATTCAAAAGAAACATGGGAAGATGGTGGGAAAACTTGATCCAAGCTGGTCACCCAAGATTCTCTTTCATTCAAAGGC
TAAAGTCTTTAGCAAATTTTATCAAACCTTGGCAAAAGGAGAAATTACACTCTCTCACCTATGCTAAAGAAGCCATTATTAGGGAAGTGGACTCTATTGACAAAAAGGAA
TTGGATACTCCTCTGACACAGGAGGAAAGTAATCGCCGATTAGCTCTAAAAGCTGATCTCAGTGAGTTATCCCTAAAGGAGTCCCAATTCTGGTATCAAAGGGCTAAAAA
GCTTTGGCTTAGGGAGGGAGATGAAAATTCCTCCTTCTTTCATAGAATTTGCTCATCAAGACAAAAGAGAAGTTTCATTCATGAAATCCAGGATGAAAAAGGCTCGATTC
AGAATACAAACAACAGTATATCAATTGCTTTTATAAAATTCTTTTCAAGGATTTATAGAAGTTCTACAAAAAGTGATCCTCTGTTTATTGAAAATCTTGATTGGAACCCG
ATTGCGTCTTCTGAGTGGTCACACCTTTGTGCCCCTTTTTTGGAAGGAGAGATTAAAGGGGTTATCAACTCTTTTGATGGAAAAAAGACTCCTGGTCCAGACGGCTTCCC
TATCTCCTTCTTTAAATCTCACTGGTATCTTCTAAAAGAGGACATCATGGATATATTCAAGGATTTTTATGACAAATGTGACTATTCTCACCCAAAAGACTTCAGACCAA
TCAGCCTAACAACGTCCATCTACAAGATCATTGCCAAAACTCTTTCAAACAGGTTAAAAACCACCCTTCCTGACACCATCTCAGGAAACCAGCTAGCTTTTGTCAAGAAT
CGCCAAATAACTGATGCTATCCTAATGGCGAACGAAGCTGTGGATTATTGGAAGGTGAAGAAGATTAAGGGCTTTATTTTGAAGCTTGACATCGAAAAGGCTTTTGACAA
TCTAAACTGGGATTTCATTGATAATGTTCTGGAGAAAAAGAATTTTCCAAATCCTTGGAGAAAGTGGATAAGAGGATGTATAAGCAACGTCACTTACTCTGTTATTATCA
ACGGAAGACCCCAAGGTCGAATCAAAGCTAACAGAGGTCTTAGACAAGGTGATCCCCTTTCCCCCTTCCTGTTTGTTATTGCCATGGATTACCTTAGTCGTCTCTTATCC
CATCTGGAAAGTTCTGGTGCCATTAAAGGGGTCTCTCTCAACGGTAATTGCAACATCTCCCACATCCTTTTCGCTGATGATATTCTTCTTTTTATAGAAGATAATGATTG
TTTCCTAAAAAACCTTAGAATGGCTTTATCTCTGTTTGAAAGAGCTTCGGGTCTCAAAATAAATTTATTGAAATCAGCTTTGGTACCAGTGAATGTGTCCTTGAATAGAG
CTAAAGAATGTGCTTCTTTTTGGGGTATTTCTTGCCATTCTCTCCCCCTTTCCTATTTGGGAGTTCCTCTTGGTGGTAATCCAAAATCCAACCTTTTTTGGCGCAACGTT
GAAGATAAGATCCAAAAAAAGCTCAATAATTGGAAATATGCACAGATATCAAAAGGTGGAAGACTCACTTTAATCAAGTCTACCCTTAGCAGTCTTCCTATATATCAACT
ATCTGTTTTCCAAGCTCCTTCCTTGACGTGCAAAAACATTGAAAAATTATGGAGAAAGTTCCTTTGGAAAGGTAATAACGAATGGTCTCACCTAATCAACTGGACTAAAG
TCTCTAAATCTAAAGAGGAGGGTGGTCTGGGGATCTCAAGGCTCAATGTGACAAACAAAGCCCTCTTATCTAAATGGCTTTGGCGTTATCTCTCAGAACCTAATGCCCTT
TGGAGGAGGCTAATTCAATGCAAATATAAAGGCAAATTTCCAGGGGACATTCCATCAAACATCTCCTCTAGTACTTCTAAAGCCCTGTGGAGATCTATCATCGACAGCAC
CGATTGGTTCAAAAGTAATCAAAGTTGGGATTTGAATAATGGAGACCAAATCTCCTTCTGGTATTCTAATTGGTCTCAAGAAGGACGCCTTTCAACTGCCTATCCTAGAC
TTTTTGCTCTTACTCTTGACAAAGAAATCTCAGTAAAAGATGCGTGGAACACATTCGACAACCAATGGAATATAATTTTTAGAAGAGAGCTGAACGATAGAGAAAGATGC
AATTGGGAGAAAATTTTAGAGATTCTCCCTACTCCGAGATCTAACAGAGGGTCAAGTAAGCCTACCTGGATTCCTGACAGCAACAACTCTTTCTCCATTGCCTCCACAAA
AGTCTTGATCTCTCGGCAGCTGGATCAAACCCCAGGGGACCCTCGAGCAAAGCTTCTTGAGATCATTTGGAAGTCCAGCATTCCAATGAAGATCAAATTCTTCATGTGGT
GCCTGATACAAAGAAGAATAAATACGATGGAGGTTATTCAGCAAAAAATGCCCAACACTCTCCTTCAACCCAACTGGTGCGTGTTGTGCAACAAAGACAGTGAAACAGGA
AATCACCTTTTTCTTCGATGCGAAGCTGTGAAACCCTTATGGTCCTTTCTCCAGCACTCCCTCAATTTCGCTCTTGGTTCCGATGAATTTGAGGATCTGTTCTCCTTCTT
CCACTCTCTAAATTGCTCCTTCCCGAAACACAAGGTTATTTTTTGTGGAATTATAGCTTTATTTTGGGAAATTTGGTGTGAAAGGAACTTTAGAATTTTTGGAACCTCTA
GCTCTCATAAAACAATTGCTAATATGTGGGAGGACTGTAAAATCCTTATAGGCAATTGGTGCAGTAGGGATCCCCATTTTAAAAATTATTCGGCTGCTACAATTGCTTTA
AACCTTAGCACCTTCTGTAATTATATCTTGGACTTTTCTCTAGCCCCTCATTAA
Protein sequenceShow/hide protein sequence
MAYFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPRDLDWIRCTLKSLIATPNTNRFFLETRDSEQRIWIRKTRNSKGCTAEIFRVDQKNRKSCILV
LEGPDKSGWVSFLSMITPKVEVKAKTRPTFLPRTSPDCQLSPPIDYHKRSYAKAVTEGRPFATSDSSDSYDSSDSSHSSGNSFCDSPSFDLLENTLVIVRRFFHDDWHKI
LQNLRKQTDESFTYNAFHAEKALVHFSSNIPANLLCQNKGWSTVGKYSVRFEKWSPAYHATPKLIPSYGGWTTFRGIPLHLWNMMTFQQIGKACGGLIKVAEETRSAKNL
IEARIKVRYNYSGFLPANVRIFDNEGNKFSVQVVTHPEEWLIERSVRLHGTFKRQAAASFDDFNPESEQFFFEGSEAISPDFLSTSSDDRKSSTPDQPSALKSVIIKPDR
NATLPSFLNEELVNDSNLHAMANKSKLEILSGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSPSNKTNIFNPNSAPANHSPSLNSPEKKQKVSRERSIKKKSS
FTQPNSKANQNKGVFITQPIQIVAHDRDAAKKGLSLTVDLGDLPALDPNKSLEDHHSSDNAEVIDITNTEVVTETPEMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKK
EEKEKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTSTNVLALIKNTIISYSPDFVILTETRLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLSQ
EEGLFSLSANFLLNNNSSWWLTGLYGPLHNLQHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLNNFISNNLLIDPPLTNNRFTWSNLRNPPTFSRIDRFLYNSSWK
NLFSPHTTRTLPRSTSDHFPLVCEDSNPKLSWGPVPFRLNSITLSDPEFKRNMGRWWENLIQAGHPRFSFIQRLKSLANFIKPWQKEKLHSLTYAKEAIIREVDSIDKKE
LDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEKGSIQNTNNSISIAFIKFFSRIYRSSTKSDPLFIENLDWNP
IASSEWSHLCAPFLEGEIKGVINSFDGKKTPGPDGFPISFFKSHWYLLKEDIMDIFKDFYDKCDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKN
RQITDAILMANEAVDYWKVKKIKGFILKLDIEKAFDNLNWDFIDNVLEKKNFPNPWRKWIRGCISNVTYSVIINGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLS
HLESSGAIKGVSLNGNCNISHILFADDILLFIEDNDCFLKNLRMALSLFERASGLKINLLKSALVPVNVSLNRAKECASFWGISCHSLPLSYLGVPLGGNPKSNLFWRNV
EDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSLTCKNIEKLWRKFLWKGNNEWSHLINWTKVSKSKEEGGLGISRLNVTNKALLSKWLWRYLSEPNAL
WRRLIQCKYKGKFPGDIPSNISSSTSKALWRSIIDSTDWFKSNQSWDLNNGDQISFWYSNWSQEGRLSTAYPRLFALTLDKEISVKDAWNTFDNQWNIIFRRELNDRERC
NWEKILEILPTPRSNRGSSKPTWIPDSNNSFSIASTKVLISRQLDQTPGDPRAKLLEIIWKSSIPMKIKFFMWCLIQRRINTMEVIQQKMPNTLLQPNWCVLCNKDSETG
NHLFLRCEAVKPLWSFLQHSLNFALGSDEFEDLFSFFHSLNCSFPKHKVIFCGIIALFWEIWCERNFRIFGTSSSHKTIANMWEDCKILIGNWCSRDPHFKNYSAATIAL
NLSTFCNYILDFSLAPH