; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0009816 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0009816
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr09:6721985..6725623
RNA-Seq ExpressionIVF0009816
SyntenyIVF0009816
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039309.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.074.46Show/hide
Query:  AHKAFSIEVSPRDLDWIRCTLKSLIATPSTNRFFLETRDSEQCIWIRKTRNSKGCTAEIFRVDQKNRKSCILVSEGPDKSGWVSFLSMITPKVEVKAKTR
        AHKAFSIEVSPRDLDWIR TLKSLI TPS+NRFFLE RD E CIWIRKTRN KGCTAEIFRVD KNRKSCILV EG +KS WVSFLSMITPKVEVKAKTR
Subjt:  AHKAFSIEVSPRDLDWIRCTLKSLIATPSTNRFFLETRDSEQCIWIRKTRNSKGCTAEIFRVDQKNRKSCILVSEGPDKSGWVSFLSMITPKVEVKAKTR

Query:  PTFLPRISPDCRLSPPIDYHKRSYAKAVTEGRPFATSDLSDSYDSNDSSHSSGNSFCDPPSPDLLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAF
        P FLPR SP+ RLSPPIDYHKRSYAKAV+EGR   +SD SDSY S+DSS SSGNS CD P P LLENTVV+VRRFFHDDW KILQNLRKQTEESFTYNAF
Subjt:  PTFLPRISPDCRLSPPIDYHKRSYAKAVTEGRPFATSDLSDSYDSNDSSHSSGNSFCDPPSPDLLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAF

Query:  HAEKALVHFR------------------KYSVRFEKWSPAYHATPKLIPSYGGWTTFRGIPLHMWNMMTFQQIGKAYGGLIKVAEETRSAKNLIEAKIKV
        HAEK LVHF                   KY+VRFEKW+PA HA+PKLIPSYGGWTTFRGIPLH+WNMMTFQQIGKA GGLIKVAEET++A+NLIEAK+K+
Subjt:  HAEKALVHFR------------------KYSVRFEKWSPAYHATPKLIPSYGGWTTFRGIPLHMWNMMTFQQIGKAYGGLIKVAEETRSAKNLIEAKIKV

Query:  RYNYSGFLPANVRIFDNEGNKFSVQVVTHPEGKWLIERNVKLHGTFKRQAAASFDDFNPESEQFFFEGSEAISLDFLSTSSDDRKSSTSDQPSALKSVII
        RYNYSGFLPA V+IFD EGNKF VQVVTH EGKWL+ERNV+LHGTFKRQAAASFDDFNP+SEQF F+G EAIS D L+T S  RKS + +QPSALKSVII
Subjt:  RYNYSGFLPANVRIFDNEGNKFSVQVVTHPEGKWLIERNVKLHGTFKRQAAASFDDFNPESEQFFFEGSEAISLDFLSTSSDDRKSSTSDQPSALKSVII

Query:  KPDRAATSPSFLNEEVVNDSNLHGTANKSRLEILYGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSPSNKTNIFNPDSAPANHSPLLSSPEKK
        KP + ATSP+ LNEEVVND++LH TANKS+L+IL GISNDG LDKGKQKVDI  Q  SA    K KRKVSFNSPSNKT  FNPDSAPANHSP     EKK
Subjt:  KPDRAATSPSFLNEEVVNDSNLHGTANKSRLEILYGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSPSNKTNIFNPDSAPANHSPLLSSPEKK

Query:  QKVSRERSIKKKSSSIQPNSKANQNKGVFITQPIQIVAHDREASKKGLSLTVDLGDLPALDPNKSFEDHHSSDNVEVIDITNTVVVPETPEMKMPVNENS
        ++VSRERS+KKKSS+IQP  +ANQ KG  ITQP+Q+VAHD +ASKKGLSLTVDLG+LP LDP+KSFEDHHSSDN EVIDITNT VVPETPE+KM   E S
Subjt:  QKVSRERSIKKKSSSIQPNSKANQNKGVFITQPIQIVAHDREASKKGLSLTVDLGDLPALDPNKSFEDHHSSDNVEVIDITNTVVVPETPEMKMPVNENS

Query:  NSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTSTNVLLNQLSSGLKITNKRIIKSLWPSNNINWIAKNAS
        NSS E NYRK KH H+R++YYRKKE+KEKD +SEAFK QLV+WLKENGLKLS DTDSSGATTSTN L +QL S                           
Subjt:  NSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTSTNVLLNQLSSGLKITNKRIIKSLWPSNNINWIAKNAS

Query:  GSSGGILILWDAQNHSLLSQEEGFFSLSPIFLLNNNSSRWLTGLYGPVKRRERIHFWAELHNLQHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLN
         S+GGILILWDAQ+HSLLSQEEG FSLS  F   NNS  WLTGLYGPVKRRER++ W +LHNL HLNS PWI+GGDLNV+RMREEST+V  SSHSS MLN
Subjt:  GSSGGILILWDAQNHSLLSQEEGFFSLSPIFLLNNNSSRWLTGLYGPVKRRERIHFWAELHNLQHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLN

Query:  NFISNNLLIDPPLTNNTFTWSNLRNPPTFSRIDRFLYNSSWENLFSPHTTRTLPRSTSDHFPLVCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWE
        +FISNNLLIDPPLTNN +TWSNLRNPPTFSR+DRFLYNS WE LF+PH TRTLPR TSDHFPLVCEDS   L WGP PFRLNSIAL+DPEFKRNM RWWE
Subjt:  NFISNNLLIDPPLTNNTFTWSNLRNPPTFSRIDRFLYNSSWENLFSPHTTRTLPRSTSDHFPLVCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWE

Query:  NSIQDGHPGFSFIQRLKSLANLIKPWQKEKLHSFTYAKETIIREVDSIDKKKLDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSS
         S+Q+GHPGF FIQRLKSLANLIKPWQKEK  S T AKE IIREVDSIDK +LDTPL+ EESNRRLALKA+L++LSLKESQFW+QRAKKLWL+EGDENS+
Subjt:  NSIQDGHPGFSFIQRLKSLANLIKPWQKEKLHSFTYAKETIIREVDSIDKKKLDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSS

Query:  FFHRIFSSRQKRSFIHEIQDEEGSTQNTNNSIST------------------------------FSEWSHLCASFLEGEIKGVINSFDGKKTPGPDGFPI
        FFHRI SSRQKR+ IHEIQDEEGS QNTNN+IS                               +S+WS LCA F E EIKGVI SFDG K PGPDGFPI
Subjt:  FFHRIFSSRQKRSFIHEIQDEEGSTQNTNNSIST------------------------------FSEWSHLCASFLEGEIKGVINSFDGKKTPGPDGFPI

Query:  SFFKSYWYLLKEDIMD
        SFFKSYW+LLKEDI+D
Subjt:  SFFKSYWYLLKEDIMD

KAA0058980.1 uncharacterized protein E6C27_scaffold98G001710 [Cucumis melo var. makuwa]0.069.43Show/hide
Query:  MITPKVEVKAKTRPTFLPRISPDCRLSPPIDYHKRSYAKAVTEGRPFATSDLSDSYDSNDSSHSSGNSFCDPPSPDLLENTVVIVRRFFHDDWHKILQNL
        MITPKVEVK KTRPTFLPR SP+ RLSPPIDYHKRSYAK VTEGRPF TSD SDSY S+DSSHSSGNSFCD PSPDLLENTVV+VRRFFHDDW KILQNL
Subjt:  MITPKVEVKAKTRPTFLPRISPDCRLSPPIDYHKRSYAKAVTEGRPFATSDLSDSYDSNDSSHSSGNSFCDPPSPDLLENTVVIVRRFFHDDWHKILQNL

Query:  RKQTEESFTYNAFHAEKALVHFR------------------KYSVRFEKWSPAYHATPKLIPSYGGWTTFRGIPLHMWNMMTFQQIGKAYGGLIKVAEET
        RKQTEESFTYNAFHAEKALVHF                   KYSVRFEKWSPAYHATPKLIPSYGGWTTF+                             
Subjt:  RKQTEESFTYNAFHAEKALVHFR------------------KYSVRFEKWSPAYHATPKLIPSYGGWTTFRGIPLHMWNMMTFQQIGKAYGGLIKVAEET

Query:  RSAKNLIEAKIKVRYNYSGFLPANVRIFDNEGNKFSVQVVTHPEGKWLIERNVKLHGTFKRQAAASFDDFNPESEQF----FFEGSEAISLDFLSTSSDD
        R++  L+E                                                          +DDF+   E       F+GSEAIS DFLSTSS  
Subjt:  RSAKNLIEAKIKVRYNYSGFLPANVRIFDNEGNKFSVQVVTHPEGKWLIERNVKLHGTFKRQAAASFDDFNPESEQF----FFEGSEAISLDFLSTSSDD

Query:  RKSSTSDQPSALKSVIIKPDRAATSPSFLNEEVVNDSNLHGTANKSRLEILYGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSPSNKTNIFNP
        RKSST DQPSALKSVIIKPD+AATSP++LNEEVVNDSNLH TANKSRLEIL GI NDGVLDKGKQKVDIQL PNSALNL+K KRKVSFNSPSNKTNIFNP
Subjt:  RKSSTSDQPSALKSVIIKPDRAATSPSFLNEEVVNDSNLHGTANKSRLEILYGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSPSNKTNIFNP

Query:  DSAPANHSPLLSSPEKKQKVSRERSIKKKSSSIQPNSKANQNKGVFITQPIQIVAHDREASKKGLSLTVDLGDLPALDPNKSFEDHHSSDNVEVIDITNT
        DSAPANHS  LSSPEKKQKVSRERSIKKKSSSIQP     QNKGV ITQPIQ+VAHD EASKKGLSL V+LGDLP LDP+KSFEDHHSS N EVIDITNT
Subjt:  DSAPANHSPLLSSPEKKQKVSRERSIKKKSSSIQPNSKANQNKGVFITQPIQIVAHDREASKKGLSLTVDLGDLPALDPNKSFEDHHSSDNVEVIDITNT

Query:  VVVPETPEMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEA--------FKKQLVSWLKENGLKLSTDTDSSGATTSTN---VLLNQLS
         VVPETPEMKMPVNENSNSSSEANYRKPKHVH+R+YYYRKK  K +               K +L++W        S       A  S +   V+L +  
Subjt:  VVVPETPEMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEA--------FKKQLVSWLKENGLKLSTDTDSSGATTSTN---VLLNQLS

Query:  SGLKITNKRIIKSLWPSNNINWIAKNASGSSGGILILWDAQNHSLLSQEEGFFSLSPIFLLNNNSSRWLTGLYGPVKRRERIHFWAELHNLQHLNSFPWI
          LKITNKRIIKS WPSN+INWI KNASGSSGGILILWDAQ+HSLLSQEE  FSLS  F LNNNSS WLTGLYGP KRR+RIHFWA+LHNLQHLNSFPW 
Subjt:  SGLKITNKRIIKSLWPSNNINWIAKNASGSSGGILILWDAQNHSLLSQEEGFFSLSPIFLLNNNSSRWLTGLYGPVKRRERIHFWAELHNLQHLNSFPWI

Query:  LGGDLNVIRMREESTSVLSSSHSSRMLNNFISNNLLIDPPLTNNTFTWSNLRNPPTFSRIDRFLYNSSWENLFSPHTTRTLPRSTSDHFPLVCEDSNPKL
        L  DLNVIRMREE+TS+LSSSHSSRMLNNFISNNLLIDPPLTNN FTWSNLRNP TFSRIDRFLYNSSWENLFSPHTTRTLPR TSDHFPLVCEDSNPKL
Subjt:  LGGDLNVIRMREESTSVLSSSHSSRMLNNFISNNLLIDPPLTNNTFTWSNLRNPPTFSRIDRFLYNSSWENLFSPHTTRTLPRSTSDHFPLVCEDSNPKL

Query:  SWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANLIKPWQKEKLHSFTYAKETIIREVDSIDKKKLDTPLTQEESNRRLALKADL
         WGP PFRLNSIAL+DPEFKRNM RWWENS+Q+GHPGFSFIQRLKSLAN IKPWQKEKLHS  YAKETIIREVDSIDKK+LDTPL+Q+ESNRRLALKA+L
Subjt:  SWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANLIKPWQKEKLHSFTYAKETIIREVDSIDKKKLDTPLTQEESNRRLALKADL

Query:  SELSLKESQFWYQRAKKLWLREGDENSSFFHRIFSSRQKRSFIHEIQDEEGSTQNTNNSISTFSEWSHLCASFLEGEIKGVINSFDGKKTPGPDGFPISF
        S+LSLKESQF   ++                   S++    FI          +N + +   FSEW HLCA FLE EIKGVINSFDGKK P PDGFPISF
Subjt:  SELSLKESQFWYQRAKKLWLREGDENSSFFHRIFSSRQKRSFIHEIQDEEGSTQNTNNSISTFSEWSHLCASFLEGEIKGVINSFDGKKTPGPDGFPISF

Query:  FKSYWYLLKEDIMD
        FKSYW+LLKEDIMD
Subjt:  FKSYWYLLKEDIMD

TYJ99315.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.090.33Show/hide
Query:  AHKAFSIEVSPRDLDWIRCTLKSLIATPSTNRFFLETRDSEQCIWIRKTRNSKGCTAEIFRVDQKNRKSCILVSEGPDKSGWVSFLSMITPKVEVKAKTR
        AHKAFSIEVSPRDLDWIRCTLKSLIATP+TNRFFLETRDSEQ IWIRKTRNSKGCTAEIFRVDQKNRKSCILV EGPDKSGWVSFLSMITPKVEVKAKTR
Subjt:  AHKAFSIEVSPRDLDWIRCTLKSLIATPSTNRFFLETRDSEQCIWIRKTRNSKGCTAEIFRVDQKNRKSCILVSEGPDKSGWVSFLSMITPKVEVKAKTR

Query:  PTFLPRISPDCRLSPPIDYHKRSYAKAVTEGRPFATSDLSDSYDSNDSSHSSGNSFCDPPSPDLLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAF
        PTFLPR SPDCRLSPPIDYHKRSYAKAVTEGRPFATSD SDSYDS+DSSHSS NSFCD PS DLLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAF
Subjt:  PTFLPRISPDCRLSPPIDYHKRSYAKAVTEGRPFATSDLSDSYDSNDSSHSSGNSFCDPPSPDLLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAF

Query:  HAEKALVHFR------------------KYSVRFEKWSPAYHATPKLIPSYGGWTTFRGIPLHMWNMMTFQQIGKAYGGLIKVAEETRSAKNLIEAKIKV
        HAEKALVHF                   KYSVRFEKWSP YHATPKLIPSYGGWTTFRGIPLH+WNMMTFQQIGKA  GLIKVAEETRSAKNLIEA+IKV
Subjt:  HAEKALVHFR------------------KYSVRFEKWSPAYHATPKLIPSYGGWTTFRGIPLHMWNMMTFQQIGKAYGGLIKVAEETRSAKNLIEAKIKV

Query:  RYNYSGFLPANVRIFDNEGNKFSVQVVTHPEGKWLIERNVKLHGTFKRQAAASFDDFNPESEQFFFEGSEAISLDFLSTSSDDRKSSTSDQPSALKSVII
        RYNYSGFLPANVRIFDNEGNKF VQVVTHPEGKWLIERNV+LHGTFKRQAAASFDDFNPESEQFFFEGSEAIS DFLSTSSD RKSST DQPSALKSVII
Subjt:  RYNYSGFLPANVRIFDNEGNKFSVQVVTHPEGKWLIERNVKLHGTFKRQAAASFDDFNPESEQFFFEGSEAISLDFLSTSSDDRKSSTSDQPSALKSVII

Query:  KPDRAATSPSFLNEEVVNDSNLHGTANKSRLEILYGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSPSNKTNIFNPDSAPANHSPLLSSPEKK
        KPDR AT PSFLNEE+VNDSNLH TANKS+LEIL GISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSPSNKTNIFNPDSAPANHSP L+SPEKK
Subjt:  KPDRAATSPSFLNEEVVNDSNLHGTANKSRLEILYGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSPSNKTNIFNPDSAPANHSPLLSSPEKK

Query:  QKVSRERSIKKKSSSIQPNSKANQNKGVFITQPIQIVAHDREASKKGLSLTVDLGDLPALDPNKSFEDHHSSDNVEVIDITNTVVVPETPEMKMPVNENS
        QKVSRERSIKKKSSS QPNSKANQNKGVFITQPIQIVAHDR+A+KKGLSLTVDLGDLPALDPNKS EDHH+SDN EV+DITNT VVPETPEMKMPVNENS
Subjt:  QKVSRERSIKKKSSSIQPNSKANQNKGVFITQPIQIVAHDREASKKGLSLTVDLGDLPALDPNKSFEDHHSSDNVEVIDITNTVVVPETPEMKMPVNENS

Query:  NSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTSTNVLLNQLSSGLKITNKRIIKSLWPSNNINWIAKNAS
        NSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLK+NGLKLSTDTDSSGATTSTNVLLNQ++SGLKITNKRIIKSLWPSN+INWIAKNAS
Subjt:  NSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTSTNVLLNQLSSGLKITNKRIIKSLWPSNNINWIAKNAS

Query:  GSSGGILILWDAQNHSLLSQEEGFFSLSPIFLLNNNSSRWLTGLYGPVKRRERIHFWAELHNLQHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLN
        GSSGGILILWDAQNHSLLSQEEG FSLS  FLLNNNSS WLTGLYGPVKRRERIHFWAELHNLQHLNSFPWILGGDLNVIRMREESTSVLSSSH+SRMLN
Subjt:  GSSGGILILWDAQNHSLLSQEEGFFSLSPIFLLNNNSSRWLTGLYGPVKRRERIHFWAELHNLQHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLN

Query:  NFISNNLLIDPPLTNNTFTWSNLRNPPTFSRIDRFLYNSSWENLFSPHTTRTLPRSTSDHFPLVCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWE
        NFISNNLLIDPPLTNN FTWSNLRNPPTFSRIDRFLYNSSWENLFSPHTTRTLPRSTSDHFPLVCEDSNPKLSWGP+PFRLNSI LSDPEFKRNMGRWWE
Subjt:  NFISNNLLIDPPLTNNTFTWSNLRNPPTFSRIDRFLYNSSWENLFSPHTTRTLPRSTSDHFPLVCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWE

Query:  NSIQDGHPGFSFIQRLKSLANLIKPWQKEKLHSFTYAKETIIREVDSIDKKKLDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSS
        NSIQ G+PGFSFIQRLKSLAN IKPWQKEKLHS TYAKE IIREVDSIDKK+LDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSS
Subjt:  NSIQDGHPGFSFIQRLKSLANLIKPWQKEKLHSFTYAKETIIREVDSIDKKKLDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSS

Query:  FFHRIFSSRQKRSFIHEIQDEEGSTQNTNNSISTF------------------------------SEWSHLCASFLEGEIKGVINSFDGKKTPGPDGFPI
        FFHRI SSRQKRSFIHEIQDEEGS QNTNNSIST                               SEWSHLCA FLEGEIKGVINSFDGKKTPGPDGFPI
Subjt:  FFHRIFSSRQKRSFIHEIQDEEGSTQNTNNSISTF------------------------------SEWSHLCASFLEGEIKGVINSFDGKKTPGPDGFPI

Query:  SFFKSYW
        SFFKS+W
Subjt:  SFFKSYW

TYK00493.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.074.46Show/hide
Query:  AHKAFSIEVSPRDLDWIRCTLKSLIATPSTNRFFLETRDSEQCIWIRKTRNSKGCTAEIFRVDQKNRKSCILVSEGPDKSGWVSFLSMITPKVEVKAKTR
        AHKAFSIEVSPRDLDWIR TLKSLI TPS+NRFFLE RD E CIWIRKTRN KGCTAEIFRVD KNRKSCILV EG +KS WVSFLSMITPKVEVKAKTR
Subjt:  AHKAFSIEVSPRDLDWIRCTLKSLIATPSTNRFFLETRDSEQCIWIRKTRNSKGCTAEIFRVDQKNRKSCILVSEGPDKSGWVSFLSMITPKVEVKAKTR

Query:  PTFLPRISPDCRLSPPIDYHKRSYAKAVTEGRPFATSDLSDSYDSNDSSHSSGNSFCDPPSPDLLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAF
        P FLPR SP+ RLSPPIDYHKRSYAKAV+EGR   +SD SDSY S+DSS SSGNS CD P P LLENTVV+VRRFFHDDW KILQNLRKQTEESFTYNAF
Subjt:  PTFLPRISPDCRLSPPIDYHKRSYAKAVTEGRPFATSDLSDSYDSNDSSHSSGNSFCDPPSPDLLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAF

Query:  HAEKALVHFR------------------KYSVRFEKWSPAYHATPKLIPSYGGWTTFRGIPLHMWNMMTFQQIGKAYGGLIKVAEETRSAKNLIEAKIKV
        HAEK LVHF                   KY+VRFEKW+PA HA+PKLIPSYGGWTTFRGIPLH+WNMMTFQQIGKA GGLIKVAEET++A+NLIEAK+K+
Subjt:  HAEKALVHFR------------------KYSVRFEKWSPAYHATPKLIPSYGGWTTFRGIPLHMWNMMTFQQIGKAYGGLIKVAEETRSAKNLIEAKIKV

Query:  RYNYSGFLPANVRIFDNEGNKFSVQVVTHPEGKWLIERNVKLHGTFKRQAAASFDDFNPESEQFFFEGSEAISLDFLSTSSDDRKSSTSDQPSALKSVII
        RYNYSGFLPA V+IFD EGNKF VQVVTH EGKWL+ERNV+LHGTFKRQAAASFDDFNP+SEQF F+G EAIS D L+T S  RKS + +QPSALKSVII
Subjt:  RYNYSGFLPANVRIFDNEGNKFSVQVVTHPEGKWLIERNVKLHGTFKRQAAASFDDFNPESEQFFFEGSEAISLDFLSTSSDDRKSSTSDQPSALKSVII

Query:  KPDRAATSPSFLNEEVVNDSNLHGTANKSRLEILYGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSPSNKTNIFNPDSAPANHSPLLSSPEKK
        KP + ATSP+ LNEEVVND++LH TANKS+L+IL GISNDG LDKGKQKVDI  Q  SA    K KRKVSFNSPSNKT  FNPDSAPANHSP     EKK
Subjt:  KPDRAATSPSFLNEEVVNDSNLHGTANKSRLEILYGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSPSNKTNIFNPDSAPANHSPLLSSPEKK

Query:  QKVSRERSIKKKSSSIQPNSKANQNKGVFITQPIQIVAHDREASKKGLSLTVDLGDLPALDPNKSFEDHHSSDNVEVIDITNTVVVPETPEMKMPVNENS
        ++VSRERS+KKKSS+IQP  +ANQ KG  ITQP+Q+VAHD +ASKKGLSLTVDLG+LP LDP+KSFEDHHSSDN EVIDITNT VVPETPE+KM   E S
Subjt:  QKVSRERSIKKKSSSIQPNSKANQNKGVFITQPIQIVAHDREASKKGLSLTVDLGDLPALDPNKSFEDHHSSDNVEVIDITNTVVVPETPEMKMPVNENS

Query:  NSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTSTNVLLNQLSSGLKITNKRIIKSLWPSNNINWIAKNAS
        NSS E NYRK KH H+R++YYRKKE+KEKD +SEAFK QLV+WLKENGLKLS DTDSSGATTSTN L +QL S                           
Subjt:  NSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTSTNVLLNQLSSGLKITNKRIIKSLWPSNNINWIAKNAS

Query:  GSSGGILILWDAQNHSLLSQEEGFFSLSPIFLLNNNSSRWLTGLYGPVKRRERIHFWAELHNLQHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLN
         S+GGILILWDAQ+HSLLSQEEG FSLS  F   NNS  WLTGLYGPVKRRER++ W +LHNL HLNS PWI+GGDLNV+RMREEST+V  SSHSS MLN
Subjt:  GSSGGILILWDAQNHSLLSQEEGFFSLSPIFLLNNNSSRWLTGLYGPVKRRERIHFWAELHNLQHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLN

Query:  NFISNNLLIDPPLTNNTFTWSNLRNPPTFSRIDRFLYNSSWENLFSPHTTRTLPRSTSDHFPLVCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWE
        +FISNNLLIDPPLTNN +TWSNLRNPPTFSR+DRFLYNS WE LF+PH TRTLPR TSDHFPLVCEDS   L WGP PFRLNSIAL+DPEFKRNM RWWE
Subjt:  NFISNNLLIDPPLTNNTFTWSNLRNPPTFSRIDRFLYNSSWENLFSPHTTRTLPRSTSDHFPLVCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWE

Query:  NSIQDGHPGFSFIQRLKSLANLIKPWQKEKLHSFTYAKETIIREVDSIDKKKLDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSS
         S+Q+GHPGF FIQRLKSLANLIKPWQKEK  S T AKE IIREVDSIDK +LDTPL+ EESNRRLALKA+L++LSLKESQFW+QRAKKLWL+EGDENS+
Subjt:  NSIQDGHPGFSFIQRLKSLANLIKPWQKEKLHSFTYAKETIIREVDSIDKKKLDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSS

Query:  FFHRIFSSRQKRSFIHEIQDEEGSTQNTNNSIST------------------------------FSEWSHLCASFLEGEIKGVINSFDGKKTPGPDGFPI
        FFHRI SSRQKR+ IHEIQDEEGS QNTNN+IS                               +S+WS LCA F E EIKGVI SFDG K PGPDGFPI
Subjt:  FFHRIFSSRQKRSFIHEIQDEEGSTQNTNNSIST------------------------------FSEWSHLCASFLEGEIKGVINSFDGKKTPGPDGFPI

Query:  SFFKSYWYLLKEDIMD
        SFFKSYW+LLKEDI+D
Subjt:  SFFKSYWYLLKEDIMD

TYK05719.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.0100Show/hide
Query:  MWNMMTFQQIGKAYGGLIKVAEETRSAKNLIEAKIKVRYNYSGFLPANVRIFDNEGNKFSVQVVTHPEGKWLIERNVKLHGTFKRQAAASFDDFNPESEQ
        MWNMMTFQQIGKAYGGLIKVAEETRSAKNLIEAKIKVRYNYSGFLPANVRIFDNEGNKFSVQVVTHPEGKWLIERNVKLHGTFKRQAAASFDDFNPESEQ
Subjt:  MWNMMTFQQIGKAYGGLIKVAEETRSAKNLIEAKIKVRYNYSGFLPANVRIFDNEGNKFSVQVVTHPEGKWLIERNVKLHGTFKRQAAASFDDFNPESEQ

Query:  FFFEGSEAISLDFLSTSSDDRKSSTSDQPSALKSVIIKPDRAATSPSFLNEEVVNDSNLHGTANKSRLEILYGISNDGVLDKGKQKVDIQLQPNSALNLD
        FFFEGSEAISLDFLSTSSDDRKSSTSDQPSALKSVIIKPDRAATSPSFLNEEVVNDSNLHGTANKSRLEILYGISNDGVLDKGKQKVDIQLQPNSALNLD
Subjt:  FFFEGSEAISLDFLSTSSDDRKSSTSDQPSALKSVIIKPDRAATSPSFLNEEVVNDSNLHGTANKSRLEILYGISNDGVLDKGKQKVDIQLQPNSALNLD

Query:  KSKRKVSFNSPSNKTNIFNPDSAPANHSPLLSSPEKKQKVSRERSIKKKSSSIQPNSKANQNKGVFITQPIQIVAHDREASKKGLSLTVDLGDLPALDPN
        KSKRKVSFNSPSNKTNIFNPDSAPANHSPLLSSPEKKQKVSRERSIKKKSSSIQPNSKANQNKGVFITQPIQIVAHDREASKKGLSLTVDLGDLPALDPN
Subjt:  KSKRKVSFNSPSNKTNIFNPDSAPANHSPLLSSPEKKQKVSRERSIKKKSSSIQPNSKANQNKGVFITQPIQIVAHDREASKKGLSLTVDLGDLPALDPN

Query:  KSFEDHHSSDNVEVIDITNTVVVPETPEMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTS
        KSFEDHHSSDNVEVIDITNTVVVPETPEMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTS
Subjt:  KSFEDHHSSDNVEVIDITNTVVVPETPEMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTS

Query:  TNVLLNQLSSGLKITNKRIIKSLWPSNNINWIAKNASGSSGGILILWDAQNHSLLSQEEGFFSLSPIFLLNNNSSRWLTGLYGPVKRRERIHFWAELHNL
        TNVLLNQLSSGLKITNKRIIKSLWPSNNINWIAKNASGSSGGILILWDAQNHSLLSQEEGFFSLSPIFLLNNNSSRWLTGLYGPVKRRERIHFWAELHNL
Subjt:  TNVLLNQLSSGLKITNKRIIKSLWPSNNINWIAKNASGSSGGILILWDAQNHSLLSQEEGFFSLSPIFLLNNNSSRWLTGLYGPVKRRERIHFWAELHNL

Query:  QHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLNNFISNNLLIDPPLTNNTFTWSNLRNPPTFSRIDRFLYNSSWENLFSPHTTRTLPRSTSDHFPL
        QHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLNNFISNNLLIDPPLTNNTFTWSNLRNPPTFSRIDRFLYNSSWENLFSPHTTRTLPRSTSDHFPL
Subjt:  QHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLNNFISNNLLIDPPLTNNTFTWSNLRNPPTFSRIDRFLYNSSWENLFSPHTTRTLPRSTSDHFPL

Query:  VCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANLIKPWQKEKLHSFTYAKETIIREVDSIDKKKLDTPLTQEESN
        VCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANLIKPWQKEKLHSFTYAKETIIREVDSIDKKKLDTPLTQEESN
Subjt:  VCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANLIKPWQKEKLHSFTYAKETIIREVDSIDKKKLDTPLTQEESN

Query:  RRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRIFSSRQKRSFIHEIQDEEGSTQNTNNSIST
        RRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRIFSSRQKRSFIHEIQDEEGSTQNTNNSIST
Subjt:  RRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRIFSSRQKRSFIHEIQDEEGSTQNTNNSIST

TrEMBL top hitse value%identityAlignment
A0A5A7TDG1 LINE-1 retrotransposable element ORF2 protein0.0e+0074.46Show/hide
Query:  AHKAFSIEVSPRDLDWIRCTLKSLIATPSTNRFFLETRDSEQCIWIRKTRNSKGCTAEIFRVDQKNRKSCILVSEGPDKSGWVSFLSMITPKVEVKAKTR
        AHKAFSIEVSPRDLDWIR TLKSLI TPS+NRFFLE RD E CIWIRKTRN KGCTAEIFRVD KNRKSCILV EG +KS WVSFLSMITPKVEVKAKTR
Subjt:  AHKAFSIEVSPRDLDWIRCTLKSLIATPSTNRFFLETRDSEQCIWIRKTRNSKGCTAEIFRVDQKNRKSCILVSEGPDKSGWVSFLSMITPKVEVKAKTR

Query:  PTFLPRISPDCRLSPPIDYHKRSYAKAVTEGRPFATSDLSDSYDSNDSSHSSGNSFCDPPSPDLLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAF
        P FLPR SP+ RLSPPIDYHKRSYAKAV+EGR   +SD SDSY S+DSS SSGNS CD P P LLENTVV+VRRFFHDDW KILQNLRKQTEESFTYNAF
Subjt:  PTFLPRISPDCRLSPPIDYHKRSYAKAVTEGRPFATSDLSDSYDSNDSSHSSGNSFCDPPSPDLLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAF

Query:  HAEKALVHFR------------------KYSVRFEKWSPAYHATPKLIPSYGGWTTFRGIPLHMWNMMTFQQIGKAYGGLIKVAEETRSAKNLIEAKIKV
        HAEK LVHF                   KY+VRFEKW+PA HA+PKLIPSYGGWTTFRGIPLH+WNMMTFQQIGKA GGLIKVAEET++A+NLIEAK+K+
Subjt:  HAEKALVHFR------------------KYSVRFEKWSPAYHATPKLIPSYGGWTTFRGIPLHMWNMMTFQQIGKAYGGLIKVAEETRSAKNLIEAKIKV

Query:  RYNYSGFLPANVRIFDNEGNKFSVQVVTHPEGKWLIERNVKLHGTFKRQAAASFDDFNPESEQFFFEGSEAISLDFLSTSSDDRKSSTSDQPSALKSVII
        RYNYSGFLPA V+IFD EGNKF VQVVTH EGKWL+ERNV+LHGTFKRQAAASFDDFNP+SEQF F+G EAIS D L+T S  RKS + +QPSALKSVII
Subjt:  RYNYSGFLPANVRIFDNEGNKFSVQVVTHPEGKWLIERNVKLHGTFKRQAAASFDDFNPESEQFFFEGSEAISLDFLSTSSDDRKSSTSDQPSALKSVII

Query:  KPDRAATSPSFLNEEVVNDSNLHGTANKSRLEILYGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSPSNKTNIFNPDSAPANHSPLLSSPEKK
        KP + ATSP+ LNEEVVND++LH TANKS+L+IL GISNDG LDKGKQKVDI  Q  SA    K KRKVSFNSPSNKT  FNPDSAPANH     SPEKK
Subjt:  KPDRAATSPSFLNEEVVNDSNLHGTANKSRLEILYGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSPSNKTNIFNPDSAPANHSPLLSSPEKK

Query:  QKVSRERSIKKKSSSIQPNSKANQNKGVFITQPIQIVAHDREASKKGLSLTVDLGDLPALDPNKSFEDHHSSDNVEVIDITNTVVVPETPEMKMPVNENS
        ++VSRERS+KKKSS+IQP  +ANQ KG  ITQP+Q+VAHD +ASKKGLSLTVDLG+LP LDP+KSFEDHHSSDN EVIDITNT VVPETPE+KM   E S
Subjt:  QKVSRERSIKKKSSSIQPNSKANQNKGVFITQPIQIVAHDREASKKGLSLTVDLGDLPALDPNKSFEDHHSSDNVEVIDITNTVVVPETPEMKMPVNENS

Query:  NSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTSTNVLLNQLSSGLKITNKRIIKSLWPSNNINWIAKNAS
        NSS E NYRK KH H+R++YYRKKE+KEKD +SEAFK QLV+WLKENGLKLS DTDSSGATTSTN L +QL S                           
Subjt:  NSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTSTNVLLNQLSSGLKITNKRIIKSLWPSNNINWIAKNAS

Query:  GSSGGILILWDAQNHSLLSQEEGFFSLSPIFLLNNNSSRWLTGLYGPVKRRERIHFWAELHNLQHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLN
         S+GGILILWDAQ+HSLLSQEEG FSLS  F   NN S WLTGLYGPVKRRER++ W +LHNL HLNS PWI+GGDLNV+RMREEST+V  SSHSS MLN
Subjt:  GSSGGILILWDAQNHSLLSQEEGFFSLSPIFLLNNNSSRWLTGLYGPVKRRERIHFWAELHNLQHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLN

Query:  NFISNNLLIDPPLTNNTFTWSNLRNPPTFSRIDRFLYNSSWENLFSPHTTRTLPRSTSDHFPLVCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWE
        +FISNNLLIDPPLTNN +TWSNLRNPPTFSR+DRFLYNS WE LF+PH TRTLPR TSDHFPLVCEDS   L WGP PFRLNSIAL+DPEFKRNM RWWE
Subjt:  NFISNNLLIDPPLTNNTFTWSNLRNPPTFSRIDRFLYNSSWENLFSPHTTRTLPRSTSDHFPLVCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWE

Query:  NSIQDGHPGFSFIQRLKSLANLIKPWQKEKLHSFTYAKETIIREVDSIDKKKLDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSS
         S+Q+GHPGF FIQRLKSLANLIKPWQKEK  S T AKE IIREVDSIDK +LDTPL+ EESNRRLALKA+L++LSLKESQFW+QRAKKLWL+EGDENS+
Subjt:  NSIQDGHPGFSFIQRLKSLANLIKPWQKEKLHSFTYAKETIIREVDSIDKKKLDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSS

Query:  FFHRIFSSRQKRSFIHEIQDEEGSTQNTNNSIS------------------------------TFSEWSHLCASFLEGEIKGVINSFDGKKTPGPDGFPI
        FFHRI SSRQKR+ IHEIQDEEGS QNTNN+IS                               +S+WS LCA F E EIKGVI SFDG K PGPDGFPI
Subjt:  FFHRIFSSRQKRSFIHEIQDEEGSTQNTNNSIS------------------------------TFSEWSHLCASFLEGEIKGVINSFDGKKTPGPDGFPI

Query:  SFFKSYWYLLKEDIMD
        SFFKSYW+LLKEDI+D
Subjt:  SFFKSYWYLLKEDIMD

A0A5A7UV84 Reverse transcriptase domain-containing protein0.0e+0069.46Show/hide
Query:  MITPKVEVKAKTRPTFLPRISPDCRLSPPIDYHKRSYAKAVTEGRPFATSDLSDSYDSNDSSHSSGNSFCDPPSPDLLENTVVIVRRFFHDDWHKILQNL
        MITPKVEVK KTRPTFLPR SP+ RLSPPIDYHKRSYAK VTEGRPF TSD SDSY S+DSSHSSGNSFCD PSPDLLENTVV+VRRFFHDDW KILQNL
Subjt:  MITPKVEVKAKTRPTFLPRISPDCRLSPPIDYHKRSYAKAVTEGRPFATSDLSDSYDSNDSSHSSGNSFCDPPSPDLLENTVVIVRRFFHDDWHKILQNL

Query:  RKQTEESFTYNAFHAEKALVHFR------------------KYSVRFEKWSPAYHATPKLIPSYGGWTTFRGIPLHMWNMMTFQQIGKAYGGLIKVAEET
        RKQTEESFTYNAFHAEKALVHF                   KYSVRFEKWSPAYHATPKLIPSYGGWTTF+                             
Subjt:  RKQTEESFTYNAFHAEKALVHFR------------------KYSVRFEKWSPAYHATPKLIPSYGGWTTFRGIPLHMWNMMTFQQIGKAYGGLIKVAEET

Query:  RSAKNLIEAKIKVRYNYSGFLPANVRIFDNEGNKFSVQVVTHPEGKWLIERNVKLHGTFKRQAAASFDDFNPESEQ----FFFEGSEAISLDFLSTSSDD
        R++  L+E                                                          +DDF+   E       F+GSEAIS DFLSTSS  
Subjt:  RSAKNLIEAKIKVRYNYSGFLPANVRIFDNEGNKFSVQVVTHPEGKWLIERNVKLHGTFKRQAAASFDDFNPESEQ----FFFEGSEAISLDFLSTSSDD

Query:  RKSSTSDQPSALKSVIIKPDRAATSPSFLNEEVVNDSNLHGTANKSRLEILYGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSPSNKTNIFNP
        RKSST DQPSALKSVIIKPD+AATSP++LNEEVVNDSNLH TANKSRLEIL GI NDGVLDKGKQKVDIQL PNSALNL+K KRKVSFNSPSNKTNIFNP
Subjt:  RKSSTSDQPSALKSVIIKPDRAATSPSFLNEEVVNDSNLHGTANKSRLEILYGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSPSNKTNIFNP

Query:  DSAPANHSPLLSSPEKKQKVSRERSIKKKSSSIQPNSKANQNKGVFITQPIQIVAHDREASKKGLSLTVDLGDLPALDPNKSFEDHHSSDNVEVIDITNT
        DSAPANHS  LSSPEKKQKVSRERSIKKKSSSIQP     QNKGV ITQPIQ+VAHD EASKKGLSL V+LGDLP LDP+KSFEDHHSS N EVIDITNT
Subjt:  DSAPANHSPLLSSPEKKQKVSRERSIKKKSSSIQPNSKANQNKGVFITQPIQIVAHDREASKKGLSLTVDLGDLPALDPNKSFEDHHSSDNVEVIDITNT

Query:  VVVPETPEMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKEEK--------EKDPDSEAFKKQLVSWLKENGL----KLSTDTDSSGATTSTNVLLNQL
         VVPETPEMKMPVNENSNSSSEANYRKPKHVH+R+YYYRKK  K        E        K +L++W    GL    K +   ++  + +   V+L + 
Subjt:  VVVPETPEMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKEEK--------EKDPDSEAFKKQLVSWLKENGL----KLSTDTDSSGATTSTNVLLNQL

Query:  SSGLKITNKRIIKSLWPSNNINWIAKNASGSSGGILILWDAQNHSLLSQEEGFFSLSPIFLLNNNSSRWLTGLYGPVKRRERIHFWAELHNLQHLNSFPW
         + LKITNKRIIKS WPSN+INWI KNASGSSGGILILWDAQ+HSLLSQEE  FSLS  F LNNNSS WLTGLYGP KRR+RIHFWA+LHNLQHLNSFPW
Subjt:  SSGLKITNKRIIKSLWPSNNINWIAKNASGSSGGILILWDAQNHSLLSQEEGFFSLSPIFLLNNNSSRWLTGLYGPVKRRERIHFWAELHNLQHLNSFPW

Query:  ILGGDLNVIRMREESTSVLSSSHSSRMLNNFISNNLLIDPPLTNNTFTWSNLRNPPTFSRIDRFLYNSSWENLFSPHTTRTLPRSTSDHFPLVCEDSNPK
         L  DLNVIRMREE+TS+LSSSHSSRMLNNFISNNLLIDPPLTNN FTWSNLRNP TFSRIDRFLYNSSWENLFSPHTTRTLPR TSDHFPLVCEDSNPK
Subjt:  ILGGDLNVIRMREESTSVLSSSHSSRMLNNFISNNLLIDPPLTNNTFTWSNLRNPPTFSRIDRFLYNSSWENLFSPHTTRTLPRSTSDHFPLVCEDSNPK

Query:  LSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANLIKPWQKEKLHSFTYAKETIIREVDSIDKKKLDTPLTQEESNRRLALKAD
        L WGP PFRLNSIAL+DPEFKRNM RWWENS+Q+GHPGFSFIQRLKSLAN IKPWQKEKLHS  YAKETIIREVDSIDKK+LDTPL+Q+ESNRRLALKA+
Subjt:  LSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANLIKPWQKEKLHSFTYAKETIIREVDSIDKKKLDTPLTQEESNRRLALKAD

Query:  LSELSLKESQFWYQRAKKLWLREGDENSSFFHRIFSSRQKRSFIHEIQDEEGSTQNTNNSISTFSEWSHLCASFLEGEIKGVINSFDGKKTPGPDGFPIS
        LS+LSLKESQF   ++                   S++    FI          +N + +   FSEW HLCA FLE EIKGVINSFDGKK P PDGFPIS
Subjt:  LSELSLKESQFWYQRAKKLWLREGDENSSFFHRIFSSRQKRSFIHEIQDEEGSTQNTNNSISTFSEWSHLCASFLEGEIKGVINSFDGKKTPGPDGFPIS

Query:  FFKSYWYLLKEDIMD
        FFKSYW+LLKEDIMD
Subjt:  FFKSYWYLLKEDIMD

A0A5D3BL61 LINE-1 retrotransposable element ORF2 protein0.0e+0074.46Show/hide
Query:  AHKAFSIEVSPRDLDWIRCTLKSLIATPSTNRFFLETRDSEQCIWIRKTRNSKGCTAEIFRVDQKNRKSCILVSEGPDKSGWVSFLSMITPKVEVKAKTR
        AHKAFSIEVSPRDLDWIR TLKSLI TPS+NRFFLE RD E CIWIRKTRN KGCTAEIFRVD KNRKSCILV EG +KS WVSFLSMITPKVEVKAKTR
Subjt:  AHKAFSIEVSPRDLDWIRCTLKSLIATPSTNRFFLETRDSEQCIWIRKTRNSKGCTAEIFRVDQKNRKSCILVSEGPDKSGWVSFLSMITPKVEVKAKTR

Query:  PTFLPRISPDCRLSPPIDYHKRSYAKAVTEGRPFATSDLSDSYDSNDSSHSSGNSFCDPPSPDLLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAF
        P FLPR SP+ RLSPPIDYHKRSYAKAV+EGR   +SD SDSY S+DSS SSGNS CD P P LLENTVV+VRRFFHDDW KILQNLRKQTEESFTYNAF
Subjt:  PTFLPRISPDCRLSPPIDYHKRSYAKAVTEGRPFATSDLSDSYDSNDSSHSSGNSFCDPPSPDLLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAF

Query:  HAEKALVHFR------------------KYSVRFEKWSPAYHATPKLIPSYGGWTTFRGIPLHMWNMMTFQQIGKAYGGLIKVAEETRSAKNLIEAKIKV
        HAEK LVHF                   KY+VRFEKW+PA HA+PKLIPSYGGWTTFRGIPLH+WNMMTFQQIGKA GGLIKVAEET++A+NLIEAK+K+
Subjt:  HAEKALVHFR------------------KYSVRFEKWSPAYHATPKLIPSYGGWTTFRGIPLHMWNMMTFQQIGKAYGGLIKVAEETRSAKNLIEAKIKV

Query:  RYNYSGFLPANVRIFDNEGNKFSVQVVTHPEGKWLIERNVKLHGTFKRQAAASFDDFNPESEQFFFEGSEAISLDFLSTSSDDRKSSTSDQPSALKSVII
        RYNYSGFLPA V+IFD EGNKF VQVVTH EGKWL+ERNV+LHGTFKRQAAASFDDFNP+SEQF F+G EAIS D L+T S  RKS + +QPSALKSVII
Subjt:  RYNYSGFLPANVRIFDNEGNKFSVQVVTHPEGKWLIERNVKLHGTFKRQAAASFDDFNPESEQFFFEGSEAISLDFLSTSSDDRKSSTSDQPSALKSVII

Query:  KPDRAATSPSFLNEEVVNDSNLHGTANKSRLEILYGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSPSNKTNIFNPDSAPANHSPLLSSPEKK
        KP + ATSP+ LNEEVVND++LH TANKS+L+IL GISNDG LDKGKQKVDI  Q  SA    K KRKVSFNSPSNKT  FNPDSAPANH     SPEKK
Subjt:  KPDRAATSPSFLNEEVVNDSNLHGTANKSRLEILYGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSPSNKTNIFNPDSAPANHSPLLSSPEKK

Query:  QKVSRERSIKKKSSSIQPNSKANQNKGVFITQPIQIVAHDREASKKGLSLTVDLGDLPALDPNKSFEDHHSSDNVEVIDITNTVVVPETPEMKMPVNENS
        ++VSRERS+KKKSS+IQP  +ANQ KG  ITQP+Q+VAHD +ASKKGLSLTVDLG+LP LDP+KSFEDHHSSDN EVIDITNT VVPETPE+KM   E S
Subjt:  QKVSRERSIKKKSSSIQPNSKANQNKGVFITQPIQIVAHDREASKKGLSLTVDLGDLPALDPNKSFEDHHSSDNVEVIDITNTVVVPETPEMKMPVNENS

Query:  NSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTSTNVLLNQLSSGLKITNKRIIKSLWPSNNINWIAKNAS
        NSS E NYRK KH H+R++YYRKKE+KEKD +SEAFK QLV+WLKENGLKLS DTDSSGATTSTN L +QL S                           
Subjt:  NSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTSTNVLLNQLSSGLKITNKRIIKSLWPSNNINWIAKNAS

Query:  GSSGGILILWDAQNHSLLSQEEGFFSLSPIFLLNNNSSRWLTGLYGPVKRRERIHFWAELHNLQHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLN
         S+GGILILWDAQ+HSLLSQEEG FSLS  F   NN S WLTGLYGPVKRRER++ W +LHNL HLNS PWI+GGDLNV+RMREEST+V  SSHSS MLN
Subjt:  GSSGGILILWDAQNHSLLSQEEGFFSLSPIFLLNNNSSRWLTGLYGPVKRRERIHFWAELHNLQHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLN

Query:  NFISNNLLIDPPLTNNTFTWSNLRNPPTFSRIDRFLYNSSWENLFSPHTTRTLPRSTSDHFPLVCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWE
        +FISNNLLIDPPLTNN +TWSNLRNPPTFSR+DRFLYNS WE LF+PH TRTLPR TSDHFPLVCEDS   L WGP PFRLNSIAL+DPEFKRNM RWWE
Subjt:  NFISNNLLIDPPLTNNTFTWSNLRNPPTFSRIDRFLYNSSWENLFSPHTTRTLPRSTSDHFPLVCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWE

Query:  NSIQDGHPGFSFIQRLKSLANLIKPWQKEKLHSFTYAKETIIREVDSIDKKKLDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSS
         S+Q+GHPGF FIQRLKSLANLIKPWQKEK  S T AKE IIREVDSIDK +LDTPL+ EESNRRLALKA+L++LSLKESQFW+QRAKKLWL+EGDENS+
Subjt:  NSIQDGHPGFSFIQRLKSLANLIKPWQKEKLHSFTYAKETIIREVDSIDKKKLDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSS

Query:  FFHRIFSSRQKRSFIHEIQDEEGSTQNTNNSIS------------------------------TFSEWSHLCASFLEGEIKGVINSFDGKKTPGPDGFPI
        FFHRI SSRQKR+ IHEIQDEEGS QNTNN+IS                               +S+WS LCA F E EIKGVI SFDG K PGPDGFPI
Subjt:  FFHRIFSSRQKRSFIHEIQDEEGSTQNTNNSIS------------------------------TFSEWSHLCASFLEGEIKGVINSFDGKKTPGPDGFPI

Query:  SFFKSYWYLLKEDIMD
        SFFKSYW+LLKEDI+D
Subjt:  SFFKSYWYLLKEDIMD

A0A5D3BLV7 LINE-1 retrotransposable element ORF2 protein0.0e+0090.33Show/hide
Query:  AHKAFSIEVSPRDLDWIRCTLKSLIATPSTNRFFLETRDSEQCIWIRKTRNSKGCTAEIFRVDQKNRKSCILVSEGPDKSGWVSFLSMITPKVEVKAKTR
        AHKAFSIEVSPRDLDWIRCTLKSLIATP+TNRFFLETRDSEQ IWIRKTRNSKGCTAEIFRVDQKNRKSCILV EGPDKSGWVSFLSMITPKVEVKAKTR
Subjt:  AHKAFSIEVSPRDLDWIRCTLKSLIATPSTNRFFLETRDSEQCIWIRKTRNSKGCTAEIFRVDQKNRKSCILVSEGPDKSGWVSFLSMITPKVEVKAKTR

Query:  PTFLPRISPDCRLSPPIDYHKRSYAKAVTEGRPFATSDLSDSYDSNDSSHSSGNSFCDPPSPDLLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAF
        PTFLPR SPDCRLSPPIDYHKRSYAKAVTEGRPFATSD SDSYDS+DSSHSS NSFCD PS DLLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAF
Subjt:  PTFLPRISPDCRLSPPIDYHKRSYAKAVTEGRPFATSDLSDSYDSNDSSHSSGNSFCDPPSPDLLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAF

Query:  HAEKALVHFR------------------KYSVRFEKWSPAYHATPKLIPSYGGWTTFRGIPLHMWNMMTFQQIGKAYGGLIKVAEETRSAKNLIEAKIKV
        HAEKALVHF                   KYSVRFEKWSP YHATPKLIPSYGGWTTFRGIPLH+WNMMTFQQIGKA  GLIKVAEETRSAKNLIEA+IKV
Subjt:  HAEKALVHFR------------------KYSVRFEKWSPAYHATPKLIPSYGGWTTFRGIPLHMWNMMTFQQIGKAYGGLIKVAEETRSAKNLIEAKIKV

Query:  RYNYSGFLPANVRIFDNEGNKFSVQVVTHPEGKWLIERNVKLHGTFKRQAAASFDDFNPESEQFFFEGSEAISLDFLSTSSDDRKSSTSDQPSALKSVII
        RYNYSGFLPANVRIFDNEGNKF VQVVTHPEGKWLIERNV+LHGTFKRQAAASFDDFNPESEQFFFEGSEAIS DFLSTSSD RKSST DQPSALKSVII
Subjt:  RYNYSGFLPANVRIFDNEGNKFSVQVVTHPEGKWLIERNVKLHGTFKRQAAASFDDFNPESEQFFFEGSEAISLDFLSTSSDDRKSSTSDQPSALKSVII

Query:  KPDRAATSPSFLNEEVVNDSNLHGTANKSRLEILYGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSPSNKTNIFNPDSAPANHSPLLSSPEKK
        KPDR AT PSFLNEE+VNDSNLH TANKS+LEIL GISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSPSNKTNIFNPDSAPANHSP L+SPEKK
Subjt:  KPDRAATSPSFLNEEVVNDSNLHGTANKSRLEILYGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSPSNKTNIFNPDSAPANHSPLLSSPEKK

Query:  QKVSRERSIKKKSSSIQPNSKANQNKGVFITQPIQIVAHDREASKKGLSLTVDLGDLPALDPNKSFEDHHSSDNVEVIDITNTVVVPETPEMKMPVNENS
        QKVSRERSIKKKSSS QPNSKANQNKGVFITQPIQIVAHDR+A+KKGLSLTVDLGDLPALDPNKS EDHH+SDN EV+DITNT VVPETPEMKMPVNENS
Subjt:  QKVSRERSIKKKSSSIQPNSKANQNKGVFITQPIQIVAHDREASKKGLSLTVDLGDLPALDPNKSFEDHHSSDNVEVIDITNTVVVPETPEMKMPVNENS

Query:  NSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTSTNVLLNQLSSGLKITNKRIIKSLWPSNNINWIAKNAS
        NSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLK+NGLKLSTDTDSSGATTSTNVLLNQ++SGLKITNKRIIKSLWPSN+INWIAKNAS
Subjt:  NSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTSTNVLLNQLSSGLKITNKRIIKSLWPSNNINWIAKNAS

Query:  GSSGGILILWDAQNHSLLSQEEGFFSLSPIFLLNNNSSRWLTGLYGPVKRRERIHFWAELHNLQHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLN
        GSSGGILILWDAQNHSLLSQEEG FSLS  FLLNNNSS WLTGLYGPVKRRERIHFWAELHNLQHLNSFPWILGGDLNVIRMREESTSVLSSSH+SRMLN
Subjt:  GSSGGILILWDAQNHSLLSQEEGFFSLSPIFLLNNNSSRWLTGLYGPVKRRERIHFWAELHNLQHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLN

Query:  NFISNNLLIDPPLTNNTFTWSNLRNPPTFSRIDRFLYNSSWENLFSPHTTRTLPRSTSDHFPLVCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWE
        NFISNNLLIDPPLTNN FTWSNLRNPPTFSRIDRFLYNSSWENLFSPHTTRTLPRSTSDHFPLVCEDSNPKLSWGP+PFRLNSI LSDPEFKRNMGRWWE
Subjt:  NFISNNLLIDPPLTNNTFTWSNLRNPPTFSRIDRFLYNSSWENLFSPHTTRTLPRSTSDHFPLVCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWE

Query:  NSIQDGHPGFSFIQRLKSLANLIKPWQKEKLHSFTYAKETIIREVDSIDKKKLDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSS
        NSIQ G+PGFSFIQRLKSLAN IKPWQKEKLHS TYAKE IIREVDSIDKK+LDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSS
Subjt:  NSIQDGHPGFSFIQRLKSLANLIKPWQKEKLHSFTYAKETIIREVDSIDKKKLDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSS

Query:  FFHRIFSSRQKRSFIHEIQDEEGSTQNTNNSIST------------------------------FSEWSHLCASFLEGEIKGVINSFDGKKTPGPDGFPI
        FFHRI SSRQKRSFIHEIQDEEGS QNTNNSIST                               SEWSHLCA FLEGEIKGVINSFDGKKTPGPDGFPI
Subjt:  FFHRIFSSRQKRSFIHEIQDEEGSTQNTNNSIST------------------------------FSEWSHLCASFLEGEIKGVINSFDGKKTPGPDGFPI

Query:  SFFKSYW
        SFFKS+W
Subjt:  SFFKSYW

A0A5D3C5P2 LINE-1 retrotransposable element ORF2 protein0.0e+00100Show/hide
Query:  MWNMMTFQQIGKAYGGLIKVAEETRSAKNLIEAKIKVRYNYSGFLPANVRIFDNEGNKFSVQVVTHPEGKWLIERNVKLHGTFKRQAAASFDDFNPESEQ
        MWNMMTFQQIGKAYGGLIKVAEETRSAKNLIEAKIKVRYNYSGFLPANVRIFDNEGNKFSVQVVTHPEGKWLIERNVKLHGTFKRQAAASFDDFNPESEQ
Subjt:  MWNMMTFQQIGKAYGGLIKVAEETRSAKNLIEAKIKVRYNYSGFLPANVRIFDNEGNKFSVQVVTHPEGKWLIERNVKLHGTFKRQAAASFDDFNPESEQ

Query:  FFFEGSEAISLDFLSTSSDDRKSSTSDQPSALKSVIIKPDRAATSPSFLNEEVVNDSNLHGTANKSRLEILYGISNDGVLDKGKQKVDIQLQPNSALNLD
        FFFEGSEAISLDFLSTSSDDRKSSTSDQPSALKSVIIKPDRAATSPSFLNEEVVNDSNLHGTANKSRLEILYGISNDGVLDKGKQKVDIQLQPNSALNLD
Subjt:  FFFEGSEAISLDFLSTSSDDRKSSTSDQPSALKSVIIKPDRAATSPSFLNEEVVNDSNLHGTANKSRLEILYGISNDGVLDKGKQKVDIQLQPNSALNLD

Query:  KSKRKVSFNSPSNKTNIFNPDSAPANHSPLLSSPEKKQKVSRERSIKKKSSSIQPNSKANQNKGVFITQPIQIVAHDREASKKGLSLTVDLGDLPALDPN
        KSKRKVSFNSPSNKTNIFNPDSAPANHSPLLSSPEKKQKVSRERSIKKKSSSIQPNSKANQNKGVFITQPIQIVAHDREASKKGLSLTVDLGDLPALDPN
Subjt:  KSKRKVSFNSPSNKTNIFNPDSAPANHSPLLSSPEKKQKVSRERSIKKKSSSIQPNSKANQNKGVFITQPIQIVAHDREASKKGLSLTVDLGDLPALDPN

Query:  KSFEDHHSSDNVEVIDITNTVVVPETPEMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTS
        KSFEDHHSSDNVEVIDITNTVVVPETPEMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTS
Subjt:  KSFEDHHSSDNVEVIDITNTVVVPETPEMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTS

Query:  TNVLLNQLSSGLKITNKRIIKSLWPSNNINWIAKNASGSSGGILILWDAQNHSLLSQEEGFFSLSPIFLLNNNSSRWLTGLYGPVKRRERIHFWAELHNL
        TNVLLNQLSSGLKITNKRIIKSLWPSNNINWIAKNASGSSGGILILWDAQNHSLLSQEEGFFSLSPIFLLNNNSSRWLTGLYGPVKRRERIHFWAELHNL
Subjt:  TNVLLNQLSSGLKITNKRIIKSLWPSNNINWIAKNASGSSGGILILWDAQNHSLLSQEEGFFSLSPIFLLNNNSSRWLTGLYGPVKRRERIHFWAELHNL

Query:  QHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLNNFISNNLLIDPPLTNNTFTWSNLRNPPTFSRIDRFLYNSSWENLFSPHTTRTLPRSTSDHFPL
        QHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLNNFISNNLLIDPPLTNNTFTWSNLRNPPTFSRIDRFLYNSSWENLFSPHTTRTLPRSTSDHFPL
Subjt:  QHLNSFPWILGGDLNVIRMREESTSVLSSSHSSRMLNNFISNNLLIDPPLTNNTFTWSNLRNPPTFSRIDRFLYNSSWENLFSPHTTRTLPRSTSDHFPL

Query:  VCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANLIKPWQKEKLHSFTYAKETIIREVDSIDKKKLDTPLTQEESN
        VCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANLIKPWQKEKLHSFTYAKETIIREVDSIDKKKLDTPLTQEESN
Subjt:  VCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANLIKPWQKEKLHSFTYAKETIIREVDSIDKKKLDTPLTQEESN

Query:  RRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRIFSSRQKRSFIHEIQDEEGSTQNTNNSIST
        RRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRIFSSRQKRSFIHEIQDEEGSTQNTNNSIST
Subjt:  RRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRIFSSRQKRSFIHEIQDEEGSTQNTNNSIST

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein4.9e-1624.29Show/hide
Query:  ILGGDLNVIRMREESTSVLSSSHSSRMLNNF---ISNNLLIDPPLTNNTFTWSNLRNP-PTFSRIDRFLYNSSWENLFSPHTTRTLPRSTSDHFP-LVCE
        IL GD + I    +  SVL +S   R L  F   + ++ L+D P     +TWSN ++  P   ++DR + N  W + F            SDH P ++  
Subjt:  ILGGDLNVIRMREESTSVLSSSHSSRMLNNF---ISNNLLIDPPLTNNTFTWSNLRNP-PTFSRIDRFLYNSSWENLFSPHTTRTLPRSTSDHFP-LVCE

Query:  DSNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANLIKPWQKEKLHSFTYAKETIIREVDSIDKKKLDTPLTQEESNRRL
        ++ PK S     FR  S   + P F  ++   WE  I  G   FS  + LK+     K   ++   +  +  +  +  ++SI  + L  P         +
Subjt:  DSNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANLIKPWQKEKLHSFTYAKETIIREVDSIDKKKLDTPLTQEESNRRL

Query:  ALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRIFSSRQKRSFIHEIQ-DEEGSTQN-----------------------TNNSISTFSE-----
        A K      +  ES F+ Q+++  WL++GD N+ FFH++  + Q ++ I  ++ D++   +N                       T +S+    +     
Subjt:  ALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRIFSSRQKRSFIHEIQ-DEEGSTQN-----------------------TNNSISTFSE-----

Query:  -----WSHLCASFLEGEIKGVINSFDGKKTPGPDGFPISFFKSYWYLLKE
              S L A   + EI   + +    K PGPD F   FF   W+++K+
Subjt:  -----WSHLCASFLEGEIKGVINSFDGKKTPGPDGFPISFFKSYWYLLKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGCTCACAAGGCTTTCTCCATCGAAGTTTCCCCAAGAGACTTAGACTGGATAAGATGTACTCTGAAATCATTGATCGCAACTCCAAGTACGAACCGTTTCTTCCT
TGAGACCCGTGACTCTGAGCAATGCATCTGGATCAGGAAAACAAGAAACAGTAAAGGATGTACTGCAGAAATTTTTAGAGTTGATCAAAAAAACAGAAAATCATGTATTC
TAGTTTCGGAAGGTCCTGATAAAAGTGGCTGGGTTTCCTTCTTGTCCATGATTACCCCAAAAGTGGAAGTGAAAGCAAAAACAAGACCAACCTTTTTGCCAAGGATCAGT
CCTGATTGTCGACTATCTCCTCCCATTGACTACCACAAACGATCATACGCAAAAGCTGTCACTGAAGGAAGACCTTTTGCCACAAGTGACTTAAGTGACTCTTACGATTC
AAACGATTCAAGCCATTCATCAGGTAATAGTTTTTGTGACCCTCCCTCACCTGATCTTCTTGAAAATACAGTGGTGATAGTTAGACGATTCTTTCATGATGACTGGCATA
AAATCCTTCAAAACCTGAGGAAACAAACAGAGGAATCTTTCACTTACAATGCTTTCCATGCTGAAAAGGCTTTAGTTCATTTTAGGAAGTACTCGGTAAGATTTGAAAAA
TGGTCCCCTGCGTACCACGCCACTCCAAAACTTATTCCTAGTTATGGAGGATGGACAACTTTCAGAGGAATTCCGCTACACATGTGGAATATGATGACTTTTCAACAAAT
TGGGAAAGCCTACGGAGGTTTGATTAAAGTGGCTGAGGAAACAAGATCAGCAAAAAACCTGATAGAAGCAAAGATAAAAGTCAGATACAATTACTCAGGTTTCTTACCAG
CAAATGTAAGGATTTTCGATAATGAAGGAAACAAATTTTCCGTTCAAGTAGTTACTCACCCAGAAGGCAAATGGTTAATAGAAAGGAATGTCAAATTACATGGTACCTTC
AAGAGACAAGCTGCAGCCTCCTTTGATGACTTCAATCCTGAATCAGAACAATTCTTTTTCGAAGGATCGGAGGCCATATCGCTGGACTTTCTTTCCACCAGCTCCGACGA
CCGTAAAAGCAGCACATCGGATCAGCCATCTGCATTAAAATCTGTTATCATTAAACCTGACAGAGCTGCCACGTCGCCAAGCTTCTTAAATGAAGAGGTAGTTAATGATA
GTAATTTGCATGGAACGGCTAATAAATCCAGGTTAGAGATATTATATGGGATATCAAATGATGGCGTATTGGACAAAGGAAAACAGAAGGTTGACATTCAGCTTCAACCC
AATTCAGCATTAAATTTGGACAAATCCAAAAGGAAAGTCTCCTTTAATTCTCCCAGTAATAAAACCAACATCTTTAACCCGGATTCTGCTCCAGCCAATCATTCTCCATT
ATTGAGTTCCCCTGAGAAAAAACAGAAAGTTAGTAGAGAGAGAAGTATCAAGAAGAAATCATCCTCCATTCAGCCGAATTCAAAAGCCAATCAGAACAAAGGTGTATTCA
TTACTCAACCAATTCAAATTGTGGCACACGATCGGGAAGCTTCCAAAAAAGGTCTCTCTCTCACTGTAGATCTGGGAGATCTGCCAGCTTTGGATCCAAATAAATCCTTT
GAAGACCATCACAGCTCTGACAATGTAGAAGTTATTGATATAACAAACACTGTAGTGGTTCCTGAGACACCTGAAATGAAAATGCCAGTTAATGAGAATTCAAATTCTTC
TTCTGAAGCCAACTACAGGAAACCAAAACATGTTCATAAAAGAAAATATTACTACAGGAAAAAAGAAGAAAAGGAGAAGGATCCGGACTCAGAGGCCTTCAAAAAACAAC
TTGTTTCCTGGTTAAAGGAAAATGGTCTGAAACTATCTACGGACACTGACTCTTCAGGTGCAACTACTTCAACAAATGTTTTGTTAAATCAATTGAGTTCTGGGCTTAAG
ATCACAAACAAGAGAATCATAAAGTCCCTTTGGCCCTCTAATAACATTAATTGGATTGCTAAAAATGCTTCTGGTAGTTCTGGAGGGATCTTAATTCTATGGGATGCTCA
GAATCATTCTCTTTTAAGTCAAGAGGAAGGGTTTTTTAGCCTATCACCAATTTTTTTGCTCAACAACAATTCGTCCCGGTGGTTAACAGGTCTTTACGGTCCAGTCAAAA
GGAGGGAAAGAATTCATTTCTGGGCGGAGTTACATAATCTTCAACATCTTAATTCCTTCCCTTGGATTTTAGGAGGTGATCTTAATGTCATCAGAATGAGAGAGGAATCA
ACGTCAGTTTTGAGCTCTTCTCACAGCTCCAGAATGTTGAACAATTTCATCTCCAACAATCTTCTGATAGATCCTCCTCTCACAAACAATACATTCACATGGTCAAACTT
AAGGAATCCCCCTACCTTTTCCCGAATTGATAGATTCCTTTACAATTCAAGTTGGGAAAATCTCTTCAGCCCCCATACTACAAGGACCCTTCCTAGATCTACTTCAGACC
ACTTTCCTCTGGTCTGTGAAGATTCCAACCCCAAACTCAGTTGGGGTCCTGTCCCGTTCCGTTTAAACTCCATAGCTCTAAGTGACCCAGAATTCAAAAGAAACATGGGA
AGATGGTGGGAAAACTCGATCCAAGACGGTCACCCCGGATTCTCTTTCATCCAAAGGTTAAAGTCTTTAGCAAATCTTATTAAACCTTGGCAAAAGGAGAAATTACACTC
TTTCACCTATGCTAAAGAAACCATTATAAGAGAAGTGGACTCTATTGACAAAAAGAAATTGGATACTCCTTTGACACAGGAGGAAAGTAATCGTCGATTAGCTCTAAAAG
CTGATCTCAGTGAGTTATCCCTCAAGGAGTCCCAATTCTGGTATCAAAGGGCTAAAAAGCTTTGGCTTAGGGAGGGAGATGAAAATTCCTCCTTCTTTCATAGAATTTTC
TCATCAAGACAAAAGAGAAGTTTCATTCATGAAATCCAGGATGAAGAAGGCTCGACTCAGAATACAAACAATAGTATATCAACTTTTTCTGAGTGGTCACACCTTTGTGC
CTCTTTTTTAGAAGGAGAGATTAAAGGGGTTATCAACTCTTTTGATGGAAAAAAGACTCCTGGTCCAGACGGCTTCCCTATTTCCTTCTTTAAATCTTACTGGTATCTTC
TAAAAGAGGACATCATGGATGAAGGGATGAAAACCCTTTACAGCGGAAAATATACAGAAACTCAATTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAAGCTCACAAGGCTTTCTCCATCGAAGTTTCCCCAAGAGACTTAGACTGGATAAGATGTACTCTGAAATCATTGATCGCAACTCCAAGTACGAACCGTTTCTTCCT
TGAGACCCGTGACTCTGAGCAATGCATCTGGATCAGGAAAACAAGAAACAGTAAAGGATGTACTGCAGAAATTTTTAGAGTTGATCAAAAAAACAGAAAATCATGTATTC
TAGTTTCGGAAGGTCCTGATAAAAGTGGCTGGGTTTCCTTCTTGTCCATGATTACCCCAAAAGTGGAAGTGAAAGCAAAAACAAGACCAACCTTTTTGCCAAGGATCAGT
CCTGATTGTCGACTATCTCCTCCCATTGACTACCACAAACGATCATACGCAAAAGCTGTCACTGAAGGAAGACCTTTTGCCACAAGTGACTTAAGTGACTCTTACGATTC
AAACGATTCAAGCCATTCATCAGGTAATAGTTTTTGTGACCCTCCCTCACCTGATCTTCTTGAAAATACAGTGGTGATAGTTAGACGATTCTTTCATGATGACTGGCATA
AAATCCTTCAAAACCTGAGGAAACAAACAGAGGAATCTTTCACTTACAATGCTTTCCATGCTGAAAAGGCTTTAGTTCATTTTAGGAAGTACTCGGTAAGATTTGAAAAA
TGGTCCCCTGCGTACCACGCCACTCCAAAACTTATTCCTAGTTATGGAGGATGGACAACTTTCAGAGGAATTCCGCTACACATGTGGAATATGATGACTTTTCAACAAAT
TGGGAAAGCCTACGGAGGTTTGATTAAAGTGGCTGAGGAAACAAGATCAGCAAAAAACCTGATAGAAGCAAAGATAAAAGTCAGATACAATTACTCAGGTTTCTTACCAG
CAAATGTAAGGATTTTCGATAATGAAGGAAACAAATTTTCCGTTCAAGTAGTTACTCACCCAGAAGGCAAATGGTTAATAGAAAGGAATGTCAAATTACATGGTACCTTC
AAGAGACAAGCTGCAGCCTCCTTTGATGACTTCAATCCTGAATCAGAACAATTCTTTTTCGAAGGATCGGAGGCCATATCGCTGGACTTTCTTTCCACCAGCTCCGACGA
CCGTAAAAGCAGCACATCGGATCAGCCATCTGCATTAAAATCTGTTATCATTAAACCTGACAGAGCTGCCACGTCGCCAAGCTTCTTAAATGAAGAGGTAGTTAATGATA
GTAATTTGCATGGAACGGCTAATAAATCCAGGTTAGAGATATTATATGGGATATCAAATGATGGCGTATTGGACAAAGGAAAACAGAAGGTTGACATTCAGCTTCAACCC
AATTCAGCATTAAATTTGGACAAATCCAAAAGGAAAGTCTCCTTTAATTCTCCCAGTAATAAAACCAACATCTTTAACCCGGATTCTGCTCCAGCCAATCATTCTCCATT
ATTGAGTTCCCCTGAGAAAAAACAGAAAGTTAGTAGAGAGAGAAGTATCAAGAAGAAATCATCCTCCATTCAGCCGAATTCAAAAGCCAATCAGAACAAAGGTGTATTCA
TTACTCAACCAATTCAAATTGTGGCACACGATCGGGAAGCTTCCAAAAAAGGTCTCTCTCTCACTGTAGATCTGGGAGATCTGCCAGCTTTGGATCCAAATAAATCCTTT
GAAGACCATCACAGCTCTGACAATGTAGAAGTTATTGATATAACAAACACTGTAGTGGTTCCTGAGACACCTGAAATGAAAATGCCAGTTAATGAGAATTCAAATTCTTC
TTCTGAAGCCAACTACAGGAAACCAAAACATGTTCATAAAAGAAAATATTACTACAGGAAAAAAGAAGAAAAGGAGAAGGATCCGGACTCAGAGGCCTTCAAAAAACAAC
TTGTTTCCTGGTTAAAGGAAAATGGTCTGAAACTATCTACGGACACTGACTCTTCAGGTGCAACTACTTCAACAAATGTTTTGTTAAATCAATTGAGTTCTGGGCTTAAG
ATCACAAACAAGAGAATCATAAAGTCCCTTTGGCCCTCTAATAACATTAATTGGATTGCTAAAAATGCTTCTGGTAGTTCTGGAGGGATCTTAATTCTATGGGATGCTCA
GAATCATTCTCTTTTAAGTCAAGAGGAAGGGTTTTTTAGCCTATCACCAATTTTTTTGCTCAACAACAATTCGTCCCGGTGGTTAACAGGTCTTTACGGTCCAGTCAAAA
GGAGGGAAAGAATTCATTTCTGGGCGGAGTTACATAATCTTCAACATCTTAATTCCTTCCCTTGGATTTTAGGAGGTGATCTTAATGTCATCAGAATGAGAGAGGAATCA
ACGTCAGTTTTGAGCTCTTCTCACAGCTCCAGAATGTTGAACAATTTCATCTCCAACAATCTTCTGATAGATCCTCCTCTCACAAACAATACATTCACATGGTCAAACTT
AAGGAATCCCCCTACCTTTTCCCGAATTGATAGATTCCTTTACAATTCAAGTTGGGAAAATCTCTTCAGCCCCCATACTACAAGGACCCTTCCTAGATCTACTTCAGACC
ACTTTCCTCTGGTCTGTGAAGATTCCAACCCCAAACTCAGTTGGGGTCCTGTCCCGTTCCGTTTAAACTCCATAGCTCTAAGTGACCCAGAATTCAAAAGAAACATGGGA
AGATGGTGGGAAAACTCGATCCAAGACGGTCACCCCGGATTCTCTTTCATCCAAAGGTTAAAGTCTTTAGCAAATCTTATTAAACCTTGGCAAAAGGAGAAATTACACTC
TTTCACCTATGCTAAAGAAACCATTATAAGAGAAGTGGACTCTATTGACAAAAAGAAATTGGATACTCCTTTGACACAGGAGGAAAGTAATCGTCGATTAGCTCTAAAAG
CTGATCTCAGTGAGTTATCCCTCAAGGAGTCCCAATTCTGGTATCAAAGGGCTAAAAAGCTTTGGCTTAGGGAGGGAGATGAAAATTCCTCCTTCTTTCATAGAATTTTC
TCATCAAGACAAAAGAGAAGTTTCATTCATGAAATCCAGGATGAAGAAGGCTCGACTCAGAATACAAACAATAGTATATCAACTTTTTCTGAGTGGTCACACCTTTGTGC
CTCTTTTTTAGAAGGAGAGATTAAAGGGGTTATCAACTCTTTTGATGGAAAAAAGACTCCTGGTCCAGACGGCTTCCCTATTTCCTTCTTTAAATCTTACTGGTATCTTC
TAAAAGAGGACATCATGGATGAAGGGATGAAAACCCTTTACAGCGGAAAATATACAGAAACTCAATTTTAA
Protein sequenceShow/hide protein sequence
MQAHKAFSIEVSPRDLDWIRCTLKSLIATPSTNRFFLETRDSEQCIWIRKTRNSKGCTAEIFRVDQKNRKSCILVSEGPDKSGWVSFLSMITPKVEVKAKTRPTFLPRIS
PDCRLSPPIDYHKRSYAKAVTEGRPFATSDLSDSYDSNDSSHSSGNSFCDPPSPDLLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAFHAEKALVHFRKYSVRFEK
WSPAYHATPKLIPSYGGWTTFRGIPLHMWNMMTFQQIGKAYGGLIKVAEETRSAKNLIEAKIKVRYNYSGFLPANVRIFDNEGNKFSVQVVTHPEGKWLIERNVKLHGTF
KRQAAASFDDFNPESEQFFFEGSEAISLDFLSTSSDDRKSSTSDQPSALKSVIIKPDRAATSPSFLNEEVVNDSNLHGTANKSRLEILYGISNDGVLDKGKQKVDIQLQP
NSALNLDKSKRKVSFNSPSNKTNIFNPDSAPANHSPLLSSPEKKQKVSRERSIKKKSSSIQPNSKANQNKGVFITQPIQIVAHDREASKKGLSLTVDLGDLPALDPNKSF
EDHHSSDNVEVIDITNTVVVPETPEMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLKENGLKLSTDTDSSGATTSTNVLLNQLSSGLK
ITNKRIIKSLWPSNNINWIAKNASGSSGGILILWDAQNHSLLSQEEGFFSLSPIFLLNNNSSRWLTGLYGPVKRRERIHFWAELHNLQHLNSFPWILGGDLNVIRMREES
TSVLSSSHSSRMLNNFISNNLLIDPPLTNNTFTWSNLRNPPTFSRIDRFLYNSSWENLFSPHTTRTLPRSTSDHFPLVCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMG
RWWENSIQDGHPGFSFIQRLKSLANLIKPWQKEKLHSFTYAKETIIREVDSIDKKKLDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRIF
SSRQKRSFIHEIQDEEGSTQNTNNSISTFSEWSHLCASFLEGEIKGVINSFDGKKTPGPDGFPISFFKSYWYLLKEDIMDEGMKTLYSGKYTETQF