; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0016528 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0016528
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr01:16728284..16732548
RNA-Seq ExpressionPay0016528
SyntenyPay0016528
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046851.1 uncharacterized protein E6C27_scaffold19358G00020 [Cucumis melo var. makuwa]0.0e+0065.55Show/hide
Query:  NSW--DYSCSYSNSGVGRIWVMWKKNRFFFSTHVMDEQF----------------------------------------------VVMGDFNAIRVHSEA
        N W  +YSCSYSNSGVGRIWVMWKK RF F THVMDE+F                                              VVMGDFNAIRVHSEA
Subjt:  NSW--DYSCSYSNSGVGRIWVMWKKNRFFFSTHVMDEQF----------------------------------------------VVMGDFNAIRVHSEA

Query:  FGGSPIQGEMEDFDPAI--------------------LHGSGMLRRLDRVLVNDDWLSASPTMLVNVLPWGISDHSPILFYPSFQLNSKVVSFRFFNHWV
        FGGSPIQGEME+FD AI                    + GSGMLRRLDRVLVND+WLSA PTM +NVLPWGISDHSPILFYPSFQ+NS+VVSFRFFNHWV
Subjt:  FGGSPIQGEMEDFDPAI--------------------LHGSGMLRRLDRVLVNDDWLSASPTMLVNVLPWGISDHSPILFYPSFQLNSKVVSFRFFNHWV

Query:  EDPSFIEVVFRMWSSHEGVSPLVSLMRNLHHLKPILRRRFGRHIKSLSEEMRIAKEAMDIAQREVERNPMSDVLSRQASLATETFWTA------------
        E+PSFIEVV RMWS HEGVS LVSLMRNLHHLKPILRR+FGRHIKSLSEE+ IAKEAMDIAQREVERNP+SDVLSRQASLATETFWTA            
Subjt:  EDPSFIEVVFRMWSSHEGVSPLVSLMRNLHHLKPILRRRFGRHIKSLSEEMRIAKEAMDIAQREVERNPMSDVLSRQASLATETFWTA------------

Query:  ----------NTAFFHRSVRSRMSRNSLLSLVDSDGSRVFSHDGVAQMAVNYFSNSLGSQEIGYRELSP------------ECCQALQLLISREEVRRVL
                  NTAFFHRSVRSR+SRNSLLSLVDSDGSRV SHDGVAQMAVNYFSNSLGSQEIGYRELSP            ECCQALQL ISREEVRRVL
Subjt:  ----------NTAFFHRSVRSRMSRNSLLSLVDSDGSRVFSHDGVAQMAVNYFSNSLGSQEIGYRELSP------------ECCQALQLLISREEVRRVL

Query:  FSMDSGKAPGSDGFSVGFFKGAW--------------FVTCYLPVGVNATAITLIPKHNGAKRLEDFRPISCCNVLYKCISKILADRLHVWLPSFISSNQ
        FSMDSGKAPG DGFSVGF+KGAW              F TCYLP+GVNATAITLIPKH GA+RLEDFRPISCCNVLYKCISKILADRL +WLPSFISSNQ
Subjt:  FSMDSGKAPGSDGFSVGFFKGAW--------------FVTCYLPVGVNATAITLIPKHNGAKRLEDFRPISCCNVLYKCISKILADRLHVWLPSFISSNQ

Query:  YAFILGRSIIENILLCQELVGGYHLNSGKPRCTLKVDLQKAYDSVNWNFLFGLLIAIGTPLKF----------------------------KDVRQGNPL
         AFI GRSIIENILLCQELVGGYHLNSGKPRCTLKVDLQKAYDSVNW+FLFGLLIAIGTPLKF                            K +RQG+PL
Subjt:  YAFILGRSIIENILLCQELVGGYHLNSGKPRCTLKVDLQKAYDSVNWNFLFGLLIAIGTPLKF----------------------------KDVRQGNPL

Query:  SPFLFVMVMEVLSRMLNKIPQSFQFHHRCEK-------------------------------KFGELSGLFANPRKSFIFVAGVNNENASQLAACMGFVR
        SPFLFVMVMEVLSRMLNKIPQSF+FHHRCEK                               KFGE SGLFANPRKS IFV GVNNE AS LAAC+G   
Subjt:  SPFLFVMVMEVLSRMLNKIPQSFQFHHRCEK-------------------------------KFGELSGLFANPRKSFIFVAGVNNENASQLAACMGFVR

Query:  GNLPIRYLGLPLLTGRLRSNDCAPMIQ------------LLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN---------------------------
           P            LRS DCAP+IQ            +LSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN                           
Subjt:  GNLPIRYLGLPLLTGRLRSNDCAPMIQ------------LLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN---------------------------

Query:  ------EGGLDIRDGPSWNIASTLKILWLMLTSLGSLWVAWVEAYILKGRSLWDVDSRVGRSWCLRAILRKREKLKHHVRMKVGNGNRWGAILEQVGERV
              EGGL IRDGPSWNIA+TLKIL   LT+LGSLWVAW+EAYILKG+SLWDVDSRVGRSWCLRAILRKREK+KHH                 VGERV
Subjt:  ------EGGLDIRDGPSWNIASTLKILWLMLTSLGSLWVAWVEAYILKGRSLWDVDSRVGRSWCLRAILRKREKLKHHVRMKVGNGNRWGAILEQVGERV

Query:  LYDAASRREAKLSDFIGSDGEWLWSR--------------------------------GGFSIASAWKAIHPRGGRVLWDGLMWGGRNIPKHSFCAWLAI
        LYDAASRREAKLSDFI  +GEWLW R                                GGFSIASAW+AI PRGGRVLWDGL+WGG NIPKHSFCAWLAI
Subjt:  LYDAASRREAKLSDFIGSDGEWLWSR--------------------------------GGFSIASAWKAIHPRGGRVLWDGLMWGGRNIPKHSFCAWLAI

Query:  KDRLGTRDRLHRWDSLVPMSCILCQGGVESRDHLFFS----------VLRIMASSHRIGHWGVELSWICHQGIGKGVRSK----LWC---------RNHR
        KDRL TRDRLHRWDS +P+SCILCQGGVESRDHLFFS          V +IM SSHRIGHWGVELSWICH+GIGKGVR K    LWC         RNHR
Subjt:  KDRLGTRDRLHRWDSLVPMSCILCQGGVESRDHLFFS----------VLRIMASSHRIGHWGVELSWICHQGIGKGVRSK----LWC---------RNHR

Query:  LHGGQARDHIVLFHLICTWIRARAGSWREDADLPF
        LHGG+ARD I+LFHLICTWIRARAGSWREDA LPF
Subjt:  LHGGQARDHIVLFHLICTWIRARAGSWREDADLPF

KAA0057642.1 reverse transcriptase [Cucumis melo var. makuwa]0.0e+0073.56Show/hide
Query:  MEVIMPNSFGSLLEVGDADKWVLSIIEGSPPPLQ---------VDE----------DTDTRIREGNFASVSRRFGNSWDYSCSYSNSGVGRIWVMWKKNR
        MEVI PNSFGSLLEVGD DKW LSIIEGSP   +         VD             +TR+REGNF SVSRRFGNSWDYSCSYSNSGVGRIWVMWKKNR
Subjt:  MEVIMPNSFGSLLEVGDADKWVLSIIEGSPPPLQ---------VDE----------DTDTRIREGNFASVSRRFGNSWDYSCSYSNSGVGRIWVMWKKNR

Query:  FFFSTHVMDEQF----VVMGDFNAIRVHSEAFGGSPIQGEMEDFDPAI--------------------LHGSGMLRRLDRVLVNDDWLSASPTMLVNVLP
        F FSTHV DEQF    VVM DFNAIR HSEA GGSPIQGEMEDFD AI                    + GSGM+RRLDRVL+NDDWLSA PTMLVNVLP
Subjt:  FFFSTHVMDEQF----VVMGDFNAIRVHSEAFGGSPIQGEMEDFDPAI--------------------LHGSGMLRRLDRVLVNDDWLSASPTMLVNVLP

Query:  WGISDHSPILFYPSFQLNSKVVSFRFFNHWVEDPSFIEVVFRMWSSHEGVSPLVSLMRNLHHLKPILRRRFGRHIKSLSEEMRIAKEAMDIAQREVERNP
        WGISDHSPIL YPSFQ NSKVVSFR FNHWV+DPSF+                               RRFGRHI+SLSEE+RIAKEAMDIAQREVERNP
Subjt:  WGISDHSPILFYPSFQLNSKVVSFRFFNHWVEDPSFIEVVFRMWSSHEGVSPLVSLMRNLHHLKPILRRRFGRHIKSLSEEMRIAKEAMDIAQREVERNP

Query:  MSDVLSRQASLATETFWTANTAFFHRSVRSRMSRNSLLSLVDSDGSRVFSHDGVAQMAVNYFSNSLGSQEIGYRELSP------------ECCQALQLLI
        MSDVLSRQASLATETFWTA      R  + R  RN L  +VDS  SRV SHDGVAQMAVNYFSNSLGSQEIGYREL+P            ECCQALQ+ I
Subjt:  MSDVLSRQASLATETFWTANTAFFHRSVRSRMSRNSLLSLVDSDGSRVFSHDGVAQMAVNYFSNSLGSQEIGYRELSP------------ECCQALQLLI

Query:  SREEVRRVLFSMDSGKAPGSDGFSVGFFKGAW--------------FVTCYLPVGVNATAITLIPKHNGAKRLEDFRPISCCNVLYKCISKILADRLHVW
        SREEVRRVLFSMDSGKAPG DGFSVGFFKGAW              F T YLPVGVNATAITLIPKHNGA+RLEDFRPISCCNVLYKCISKILADRL VW
Subjt:  SREEVRRVLFSMDSGKAPGSDGFSVGFFKGAW--------------FVTCYLPVGVNATAITLIPKHNGAKRLEDFRPISCCNVLYKCISKILADRLHVW

Query:  LPSFISSNQYAFILGRSIIENILLCQELVGGYHLNSGKPRCTLKVDLQKAYDSVNWNFLFGLLIAIGTPLKF----------------------------
        LPSFISSNQ AFILGRSIIENILLCQELVGGYHLNSGKPRCTLKVDLQKAYDSVNW+FLFGL I+I TPLKF                            
Subjt:  LPSFISSNQYAFILGRSIIENILLCQELVGGYHLNSGKPRCTLKVDLQKAYDSVNWNFLFGLLIAIGTPLKF----------------------------

Query:  KDVRQGNPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEKKFGELSGLFANPRKSFIFVAGVNNENASQLAACMGFVRGNLPIRYLGLPLLTGRLRSNDC
        K VRQG+PLS FLFVMVMEVLSRMLNKIPQSFQFHHRCEK+FGELSGLFANPRKS IF+AGVNNENAS+LAACMGFVRGNLP+RYLGLPLLTGRLRSNDC
Subjt:  KDVRQGNPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEKKFGELSGLFANPRKSFIFVAGVNNENASQLAACMGFVRGNLPIRYLGLPLLTGRLRSNDC

Query:  APMIQ------------LLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN-EGGLDIRDGPSWNIASTLKILWLMLTSLGSLWVAWVEAYILKGRSLWD
         P+IQ            +LSFAGRLQLV SVL SLQVYWA VFVLPAYVHN EGGL IRDG +W  ASTLKILWLMLT+ GSLWVAWVEAY+LKGRSLWD
Subjt:  APMIQ------------LLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN-EGGLDIRDGPSWNIASTLKILWLMLTSLGSLWVAWVEAYILKGRSLWD

Query:  VDSRVGRSWCLRAILRKREKLKHHVRMKVGNGNR-------W---GAILEQVGERVLYDAASRREAKLSDFIGSDGEWLWSRGGFSIASAWKAIHPRGGR
        VDSRVGRSWCLRAILRK+EKLK HVRMKVGNGNR       W   GAILEQVGERVLYDAASRREA LS+FIG DGEWLW RGGFSIASAW+AI PRGGR
Subjt:  VDSRVGRSWCLRAILRKREKLKHHVRMKVGNGNR-------W---GAILEQVGERVLYDAASRREAKLSDFIGSDGEWLWSRGGFSIASAWKAIHPRGGR

Query:  VLWDGLMWGGRNIPKHSFCAWLAIKDRLGTRDRLHRWDSLVPMSCILCQGGVESRDHLFFS----------VLRIMASSHRIGHWGVELSWICHQGIGKG
        VLWDGL+WGG NIPKHSFCAWLAIKDRLGTRDR HRWDS VP+SCILC+GG+ESRDHLFFS          VLRIMASSHRIGHWGVELSWICHQGI KG
Subjt:  VLWDGLMWGGRNIPKHSFCAWLAIKDRLGTRDRLHRWDSLVPMSCILCQGGVESRDHLFFS----------VLRIMASSHRIGHWGVELSWICHQGIGKG

Query:  VRSK----LWC---------RNHRLHGGQARDHIVLFHLICTWIRARAGSWREDADLPF
        VR K    LWC         RNHRLHGGQA D IV+FHLICTWIRARAGSWREDA LPF
Subjt:  VRSK----LWC---------RNHRLHGGQARDHIVLFHLICTWIRARAGSWREDADLPF

TYK19523.1 reverse transcriptase [Cucumis melo var. makuwa]0.0e+0071.48Show/hide
Query:  MEVIMPNSFGSLLEVGDADKWVLSIIEGSPPPLQ---------VDE----------DTDTRIREGNFASVSRRFGNSWDYSCSYSNSGVGRIWVMWKKNR
        MEVI PNSFGSLLEVGD DKW LSIIEGSP   +         VD             +TR+REGNF SVSRRFGNSWDYSCSYSNSGVGRIWVMWKKNR
Subjt:  MEVIMPNSFGSLLEVGDADKWVLSIIEGSPPPLQ---------VDE----------DTDTRIREGNFASVSRRFGNSWDYSCSYSNSGVGRIWVMWKKNR

Query:  FFFSTHVMDEQF----VVMGDFNAIRVHSEAFGGSPIQGEMEDFDPAI--------------------LHGSGMLRRLDRVLVNDDWLSASPTMLVNVLP
        F FSTHV DEQF    VVM DFNAIR HSEA GGSPIQGEMEDFD AI                    + GSGM+RRLDRVL+NDDWLSA PTMLVNVLP
Subjt:  FFFSTHVMDEQF----VVMGDFNAIRVHSEAFGGSPIQGEMEDFDPAI--------------------LHGSGMLRRLDRVLVNDDWLSASPTMLVNVLP

Query:  WGISDHSPILFYPSFQLNSKVVSFRFFNHWVEDPSFIEVVFRMWSSHEGVSPLVSLMRNLHHLKPILRRRFGRHIKSLSEEMRIAKEAMDIAQREVERNP
        WGISDHSPIL YPSFQ NSKV                       S HEGVSPLV LMRNL+ LKPILRRRFGRHI+SLSEE+RIAKEAMDIAQRE++   
Subjt:  WGISDHSPILFYPSFQLNSKVVSFRFFNHWVEDPSFIEVVFRMWSSHEGVSPLVSLMRNLHHLKPILRRRFGRHIKSLSEEMRIAKEAMDIAQREVERNP

Query:  MSDVLSRQASLATETFWTANTAFFHRSVRSRMSRNSLLSLVDSDGSRVFSHDGVAQMAVNYFSNSLGSQEIGYRELSP------------ECCQALQLLI
                        W        RS+      NSL+       + V SHDGVAQMAVNYFSNSLGSQEIGYREL+P            ECCQALQ+ I
Subjt:  MSDVLSRQASLATETFWTANTAFFHRSVRSRMSRNSLLSLVDSDGSRVFSHDGVAQMAVNYFSNSLGSQEIGYRELSP------------ECCQALQLLI

Query:  SREEVRRVLFSMDSGKAPGSDGFSVGFFKGAW--------------FVTCYLPVGVNATAITLIPKHNGAKRLEDFRPISCCNVLYKCISKILADRLHVW
        SREEVRRVLFSMDSGKAPG DGFSVGFFKGAW              F T YLPVGVNATAITLIPKHNGA+RLEDFRPISCCNVLYKCISKILADRL VW
Subjt:  SREEVRRVLFSMDSGKAPGSDGFSVGFFKGAW--------------FVTCYLPVGVNATAITLIPKHNGAKRLEDFRPISCCNVLYKCISKILADRLHVW

Query:  LPSFISSNQYAFILGRSIIENILLCQELVGGYHLNSGKPRCTLKVDLQKAYDSVNWNFLFGLLIAIGTPLKF----------------------------
        LPSFISSNQ AFILGRSIIENILLCQELVGGYHLNSGKPRCTLKVDLQKAYDSVNW+FLFGL I+I TPLKF                            
Subjt:  LPSFISSNQYAFILGRSIIENILLCQELVGGYHLNSGKPRCTLKVDLQKAYDSVNWNFLFGLLIAIGTPLKF----------------------------

Query:  KDVRQGNPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEKKFGELSGLFANPRKSFIFVAGVNNENASQLAACMGFVRGNLPIRYLGLPLLTGRLRSNDC
        K VRQG+PLS FLFVMVMEVLSRMLNKIPQSFQFHHRCEK+FGELSGLFANPRKS IF+AGVNNENAS+LAACMGFVRGNLP+RYLGLPLLTGRLRSNDC
Subjt:  KDVRQGNPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEKKFGELSGLFANPRKSFIFVAGVNNENASQLAACMGFVRGNLPIRYLGLPLLTGRLRSNDC

Query:  APMIQ------------LLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN-EGGLDIRDGPSWNIASTLKILWLMLTSLGSLWVAWVEAYILKGRSLWD
         P+IQ            +LSFAGRLQLV SVL SLQVYWA VFVLPAYVHN EGGL IRDG +W  ASTLKILWLMLT+ GSLWVAWVEAY+LKGRSLWD
Subjt:  APMIQ------------LLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN-EGGLDIRDGPSWNIASTLKILWLMLTSLGSLWVAWVEAYILKGRSLWD

Query:  VDSRVGRSWCLRAILRKREKLKHHVRMKVGNGNR-------W---GAILEQVGERVLYDAASRREAKLSDFIGSDGEWLWSRGGFSIASAWKAIHPRGGR
        VDSRVGRSWCLRAILRK+EKLK HVRMKVGNGNR       W   GAILE+VGERVLYDAASRREA LS+FIG DGEWLW RGGFSIASAW+AI PRGGR
Subjt:  VDSRVGRSWCLRAILRKREKLKHHVRMKVGNGNR-------W---GAILEQVGERVLYDAASRREAKLSDFIGSDGEWLWSRGGFSIASAWKAIHPRGGR

Query:  VLWDGLMWGGRNIPKHSFCAWLAIKDRLGTRDRLHRWDSLVPMSCILCQGGVESRDHLFFS----------VLRIMASSHRIGHWGVELSWICHQGIGKG
        VLWDGL+WGG NIPKHSFCAWLAIKDRLGTRDR HRWDS VP+SCILC+GG+ESRDHLFFS          VLRIMASSHRIGHWGVELSWICHQGI KG
Subjt:  VLWDGLMWGGRNIPKHSFCAWLAIKDRLGTRDRLHRWDSLVPMSCILCQGGVESRDHLFFS----------VLRIMASSHRIGHWGVELSWICHQGIGKG

Query:  VRSK----LWC---------RNHRLHGGQARDHIVLFHLICTWIRARAGSWREDADLPF
        VR K    LWC         RNHRLHGGQA D IV+FHLICTWIRARAGSWREDA LPF
Subjt:  VRSK----LWC---------RNHRLHGGQARDHIVLFHLICTWIRARAGSWREDADLPF

TYK28099.1 uncharacterized protein E5676_scaffold1467G00020 [Cucumis melo var. makuwa]0.0e+0061.89Show/hide
Query:  KPISLDLANKKHRRLSYARVCIELKGGSNMPAEITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSGSKCSRSVESKTIQEEVVHKGDGLDSEPCGEVVLE
        KPISLD A KK RRLSYARVC+EL+GGSNM AEITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHS SKCSRSVESKTIQEEVVHKGD +D E CGEVVLE
Subjt:  KPISLDLANKKHRRLSYARVCIELKGGSNMPAEITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSGSKCSRSVESKTIQEEVVHKGDGLDSEPCGEVVLE

Query:  SFKQLEEGEIRSSPNRHNSQVEKGVGKSDDFTLVTRKKSELVSVRDRGKSMEVIMPNSFGSLLEVGDADKWVLSIIEGSPPPLQVDEDTD----------
        SFKQ+E+GEIRSSPNRH+SQVEKGVGKSD+FTLVTRKKSELVS+RDRGKSMEVIMPNSFGSLLEVGDADKW LSIIEGS PPLQVDE TD          
Subjt:  SFKQLEEGEIRSSPNRHNSQVEKGVGKSDDFTLVTRKKSELVSVRDRGKSMEVIMPNSFGSLLEVGDADKWVLSIIEGSPPPLQVDEDTD----------

Query:  -------TRIREGNFASVSRRFGNSWDYSCSYSNSGVGRIWVMWKKNRFFFSTHVMDEQF----------------------------------------
               TR+REGNF SVSRRF NSWDYSCSYSNSGVGRIWVMWKKNRF FSTHVMDEQF                                        
Subjt:  -------TRIREGNFASVSRRFGNSWDYSCSYSNSGVGRIWVMWKKNRFFFSTHVMDEQF----------------------------------------

Query:  ------VVMGDFNAIRVHSEAFGGSPIQGEMEDFDPAI--------------------LHGSGMLRRLDRVLVNDDWLSASPTMLVNVLPWGISDHSPIL
              VVM DFNAIRVHSEAF GSPIQGEMEDF+ AI                    + GSGMLRRLDRVLVNDDWLS  PTMLVNVLPWGISDH PIL
Subjt:  ------VVMGDFNAIRVHSEAFGGSPIQGEMEDFDPAI--------------------LHGSGMLRRLDRVLVNDDWLSASPTMLVNVLPWGISDHSPIL

Query:  FYPSFQLNSKVVSFRFFNHWVEDPSFIEVVFRMWSSHEGVSPLVSLMRNLHHLKPILRRRFGRHIKSLSEEMRIAKEAMDIAQREVERNPMSDVLSRQAS
        FYPSFQ ++KVVSFRFFNHWVEDPSFIEVV RMWS HEGVSPLV LMRNLH LKPILRRRFGRHIK LSEE+RI KEAMDIAQRE               
Subjt:  FYPSFQLNSKVVSFRFFNHWVEDPSFIEVVFRMWSSHEGVSPLVSLMRNLHHLKPILRRRFGRHIKSLSEEMRIAKEAMDIAQREVERNPMSDVLSRQAS

Query:  LATETFWTANTAFFHRSVRSRMSRNSLLSLVDSDGSRVFSHDGVAQMAVNYFSNSLGSQEIGYRELSP------------ECCQALQLLISREEVRRVLF
                                                      MAVNYF NSLGSQEIGYRELSP            ECCQALQL ISREEVRRVLF
Subjt:  LATETFWTANTAFFHRSVRSRMSRNSLLSLVDSDGSRVFSHDGVAQMAVNYFSNSLGSQEIGYRELSP------------ECCQALQLLISREEVRRVLF

Query:  SMDSGKAPGSDGFSVGFFKGAWFVTCYLPVGVNATAITLIPKHNGAKRLEDFRPISCCNVLYKCISKILADRLHVWLPSFISSNQYAFILGRSIIENILL
        SMDSGKAPG DGFSV              VG+NATAITLIPKHNGA+RLEDF PISC NVLYKCISKILADRL VWLPSFISSNQ AFI GRSIIENILL
Subjt:  SMDSGKAPGSDGFSVGFFKGAWFVTCYLPVGVNATAITLIPKHNGAKRLEDFRPISCCNVLYKCISKILADRLHVWLPSFISSNQYAFILGRSIIENILL

Query:  CQELVGGYHLNSGKPRCTLKVDLQKAYDSVNWNFLFGLLIAIGTPLKFKDVRQGNPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEK------------
        CQEL+           C          +     F  G           K VRQG+PLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEK            
Subjt:  CQELVGGYHLNSGKPRCTLKVDLQKAYDSVNWNFLFGLLIAIGTPLKFKDVRQGNPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEK------------

Query:  -------------------KFGELSGLFANPRKSFIFVAGVNNENASQLAACMGFVRGNLPIRYLGLPLLTGRLRSNDCAPMIQ------------LLSF
                           KFGELSGLFANPRKS IFVAGVNNENAS LA CMGF RGNLP+RYLGLPLLTGRLRSNDCAP+IQ            +LSF
Subjt:  -------------------KFGELSGLFANPRKSFIFVAGVNNENASQLAACMGFVRGNLPIRYLGLPLLTGRLRSNDCAPMIQ------------LLSF

Query:  AGRLQLVRSVLRSLQVYWASVFVLPAYVHN---------------------------------EGGLDIRDGPSWNIASTLKILWLMLTSLGSLWVAWVE
        AGRLQLVRSVLRSLQVYWASVFVLPAYVHN                                 EGG  IRDGPSWNIASTLKILWLMLT+ GSLWVAWVE
Subjt:  AGRLQLVRSVLRSLQVYWASVFVLPAYVHN---------------------------------EGGLDIRDGPSWNIASTLKILWLMLTSLGSLWVAWVE

Query:  AYILKGRSLWDVDSRVGRSWCLRAILRKREKLKHHVRMKVGNGNRWGAILEQVGERVLYDAASRREAKLSDFIGSDGEWLWSR-GGFSIASAWKAIHPRG
        AYILKGRSLWDVDSRVGRSWCL AILR      H     V    R    L  + ERV       +E      +     W+  R GGFSI+SAW+AI PRG
Subjt:  AYILKGRSLWDVDSRVGRSWCLRAILRKREKLKHHVRMKVGNGNRWGAILEQVGERVLYDAASRREAKLSDFIGSDGEWLWSR-GGFSIASAWKAIHPRG

Query:  GRVLWDGLMWGGRNIPKHSFCAWLAIKDRLGTRDRLHRWDSLVPMSCILCQGGVESRDHLFFS----------VLRIMASSHRIGHWGVELSWICHQGIG
        GRVLWD                                             GGVESRDHLFFS          VLRIMASS+RIGHWGVELSWICHQGIG
Subjt:  GRVLWDGLMWGGRNIPKHSFCAWLAIKDRLGTRDRLHRWDSLVPMSCILCQGGVESRDHLFFS----------VLRIMASSHRIGHWGVELSWICHQGIG

Query:  KGVRSKL
        KGVR KL
Subjt:  KGVRSKL

XP_008452126.1 PREDICTED: uncharacterized protein LOC103493225 [Cucumis melo]0.0e+0071.07Show/hide
Query:  MEVIMPNSFGSLLEVGDADKWVLSIIEGSPPPLQ---------VDE----------DTDTRIREGNFASVSRRFGNSWDYSCSYSNSGVGRIWVMWKKNR
        MEVI PNSFGSLLEVGD DKW LSIIEGSP   +         VD             +TR+REGNF SVSRRFGNSWDYSCSYSNSGVGRIWVMWKKNR
Subjt:  MEVIMPNSFGSLLEVGDADKWVLSIIEGSPPPLQ---------VDE----------DTDTRIREGNFASVSRRFGNSWDYSCSYSNSGVGRIWVMWKKNR

Query:  FFFSTHVMDEQF-----------VVMGDFNAIRVHSEAFGGSPIQGEMEDFDPAI--------------------LHGSGMLRRLDRVLVNDDWLSASPT
        F FSTHV DEQF           VVM DFNAIR HSEA GGSPIQGEMEDFD AI                    + GSGM+RRLDRVL+NDDWLSA PT
Subjt:  FFFSTHVMDEQF-----------VVMGDFNAIRVHSEAFGGSPIQGEMEDFDPAI--------------------LHGSGMLRRLDRVLVNDDWLSASPT

Query:  MLVNVLPWGISDHSPILFYPSFQLNSKVVSFRFFNHWVEDPSFIEVVFRMWSSHEGVSPLVSLMRNLHHLKPILRRRFGRHIKSLSEEMRIAKEAMDIAQ
        +                                                                           RFGRHI+SLSEE+RIAKEAMDIAQ
Subjt:  MLVNVLPWGISDHSPILFYPSFQLNSKVVSFRFFNHWVEDPSFIEVVFRMWSSHEGVSPLVSLMRNLHHLKPILRRRFGRHIKSLSEEMRIAKEAMDIAQ

Query:  REVERNPMSDVLSRQASLATETFWTANTAFFHRSVRSRMSRNSLLSLVDSDGSRVFSHDGVAQMAVNYFSNSLGSQEIGYRELSP------------ECC
        REVERNPMSDVLSRQASLATETFWTA      R  + R  RN L  +VDS  SRV SHDGVAQMAVNYFSNSLGSQEIGYREL+P            ECC
Subjt:  REVERNPMSDVLSRQASLATETFWTANTAFFHRSVRSRMSRNSLLSLVDSDGSRVFSHDGVAQMAVNYFSNSLGSQEIGYRELSP------------ECC

Query:  QALQLLISREEVRRVLFSMDSGKAPGSDGFSVGFFKGAW--------------FVTCYLPVGVNATAITLIPKHNGAKRLEDFRPISCCNVLYKCISKIL
        QALQ+ ISREEVRRVLFSMDSGKAPG DGFSVGFFKGAW              F T YLPVGVNATAITLIPKHNGA+RLEDFRPISCCNVLYKCISKIL
Subjt:  QALQLLISREEVRRVLFSMDSGKAPGSDGFSVGFFKGAW--------------FVTCYLPVGVNATAITLIPKHNGAKRLEDFRPISCCNVLYKCISKIL

Query:  ADRLHVWLPSFISSNQYAFILGRSIIENILLCQELVGGYHLNSGKPRCTLKVDLQKAYDSVNWNFLFGLLIAIGTPLKFKDVRQGNPLSPFLFVMVMEVL
        ADRL VWLPSFISSNQ AFILGRSIIENILLCQELVGGYHLNSGKPRCTLKVDLQKAYDSVNW+FLFGL I+I TPLK K VRQG+PLS FLFVMVMEVL
Subjt:  ADRLHVWLPSFISSNQYAFILGRSIIENILLCQELVGGYHLNSGKPRCTLKVDLQKAYDSVNWNFLFGLLIAIGTPLKFKDVRQGNPLSPFLFVMVMEVL

Query:  SRMLNKIPQSFQFHHRCEKKFGELSGLFANPRKSFIFVAGVNNENASQLAACMGFVRGNLPIRYLGLPLLTGRLRSNDCAPMIQ------------LLSF
        SRMLNKIPQSFQFHHRCEK+FGELSGLFANPRKS IF+AGVNNENAS+LAACMGFVRGNLP+RYLGLPLLTGRLRSNDC P+IQ            +LSF
Subjt:  SRMLNKIPQSFQFHHRCEKKFGELSGLFANPRKSFIFVAGVNNENASQLAACMGFVRGNLPIRYLGLPLLTGRLRSNDCAPMIQ------------LLSF

Query:  AGRLQLVRSVLRSLQVYWASVFVLPAYVHNEGGLDIRDGPSWNIASTLKILWLMLTSLGSLWVAWVEAYILKGRSLWDVDSRVGRSWCLRAILRKREKLK
        AGRLQLV SVL SLQVYWA VFVLPAYVHNEGGL IRDG +W  ASTLKILWLMLT+ GSLWVAWVEAY+LKGRSLWDVDSRVGRSWCLRAILRK+EKLK
Subjt:  AGRLQLVRSVLRSLQVYWASVFVLPAYVHNEGGLDIRDGPSWNIASTLKILWLMLTSLGSLWVAWVEAYILKGRSLWDVDSRVGRSWCLRAILRKREKLK

Query:  HHVRMKVGNGNR-------W---GAILEQVGERVLYDAASRREAKLSDFIGSDGEWLWSRGGFSIASAWKAIHPRGGRVLWDGLMWGGRNIPKHSFCAWL
         HVRMKVGNGNR       W   GAILE+VGERVLYDAASRREA LS+FIG DGEWLW RGGFSIASAW+AI PRGGRVLWDGL+WGG NIPKHSFCAWL
Subjt:  HHVRMKVGNGNR-------W---GAILEQVGERVLYDAASRREAKLSDFIGSDGEWLWSRGGFSIASAWKAIHPRGGRVLWDGLMWGGRNIPKHSFCAWL

Query:  AIKDRLGTRDRLHRWDSLVPMSCILCQGGVESRDHLFFS----------VLRIMASSHRIGHWGVELSWICHQGIGKGVRSK----LWC---------RN
        AIKDRLGTRDR HRWDS VP+SCILC+GG+ESRDHLFFS          VLRIMASSHRIGHWGVELSWICHQGI KGVR K    LWC         RN
Subjt:  AIKDRLGTRDRLHRWDSLVPMSCILCQGGVESRDHLFFS----------VLRIMASSHRIGHWGVELSWICHQGIGKGVRSK----LWC---------RN

Query:  HRLHGGQARDHIVLFHLICTWIRARAGSWREDADLPF
        HRLHGGQA D IV+FHLICTWIRARAGSWREDA LPF
Subjt:  HRLHGGQARDHIVLFHLICTWIRARAGSWREDADLPF

TrEMBL top hitse value%identityAlignment
A0A1S3BSI8 uncharacterized protein LOC1034932250.0e+0071.07Show/hide
Query:  MEVIMPNSFGSLLEVGDADKWVLSIIEGSPPPLQ---------VDE----------DTDTRIREGNFASVSRRFGNSWDYSCSYSNSGVGRIWVMWKKNR
        MEVI PNSFGSLLEVGD DKW LSIIEGSP   +         VD             +TR+REGNF SVSRRFGNSWDYSCSYSNSGVGRIWVMWKKNR
Subjt:  MEVIMPNSFGSLLEVGDADKWVLSIIEGSPPPLQ---------VDE----------DTDTRIREGNFASVSRRFGNSWDYSCSYSNSGVGRIWVMWKKNR

Query:  FFFSTHVMDEQF-----------VVMGDFNAIRVHSEAFGGSPIQGEMEDFDPAI--------------------LHGSGMLRRLDRVLVNDDWLSASPT
        F FSTHV DEQF           VVM DFNAIR HSEA GGSPIQGEMEDFD AI                    + GSGM+RRLDRVL+NDDWLSA PT
Subjt:  FFFSTHVMDEQF-----------VVMGDFNAIRVHSEAFGGSPIQGEMEDFDPAI--------------------LHGSGMLRRLDRVLVNDDWLSASPT

Query:  MLVNVLPWGISDHSPILFYPSFQLNSKVVSFRFFNHWVEDPSFIEVVFRMWSSHEGVSPLVSLMRNLHHLKPILRRRFGRHIKSLSEEMRIAKEAMDIAQ
        +                                                                           RFGRHI+SLSEE+RIAKEAMDIAQ
Subjt:  MLVNVLPWGISDHSPILFYPSFQLNSKVVSFRFFNHWVEDPSFIEVVFRMWSSHEGVSPLVSLMRNLHHLKPILRRRFGRHIKSLSEEMRIAKEAMDIAQ

Query:  REVERNPMSDVLSRQASLATETFWTANTAFFHRSVRSRMSRNSLLSLVDSDGSRVFSHDGVAQMAVNYFSNSLGSQEIGYRELSP------------ECC
        REVERNPMSDVLSRQASLATETFWTA      R  + R  RN L  +VDS  SRV SHDGVAQMAVNYFSNSLGSQEIGYREL+P            ECC
Subjt:  REVERNPMSDVLSRQASLATETFWTANTAFFHRSVRSRMSRNSLLSLVDSDGSRVFSHDGVAQMAVNYFSNSLGSQEIGYRELSP------------ECC

Query:  QALQLLISREEVRRVLFSMDSGKAPGSDGFSVGFFKGAW--------------FVTCYLPVGVNATAITLIPKHNGAKRLEDFRPISCCNVLYKCISKIL
        QALQ+ ISREEVRRVLFSMDSGKAPG DGFSVGFFKGAW              F T YLPVGVNATAITLIPKHNGA+RLEDFRPISCCNVLYKCISKIL
Subjt:  QALQLLISREEVRRVLFSMDSGKAPGSDGFSVGFFKGAW--------------FVTCYLPVGVNATAITLIPKHNGAKRLEDFRPISCCNVLYKCISKIL

Query:  ADRLHVWLPSFISSNQYAFILGRSIIENILLCQELVGGYHLNSGKPRCTLKVDLQKAYDSVNWNFLFGLLIAIGTPLKFKDVRQGNPLSPFLFVMVMEVL
        ADRL VWLPSFISSNQ AFILGRSIIENILLCQELVGGYHLNSGKPRCTLKVDLQKAYDSVNW+FLFGL I+I TPLK K VRQG+PLS FLFVMVMEVL
Subjt:  ADRLHVWLPSFISSNQYAFILGRSIIENILLCQELVGGYHLNSGKPRCTLKVDLQKAYDSVNWNFLFGLLIAIGTPLKFKDVRQGNPLSPFLFVMVMEVL

Query:  SRMLNKIPQSFQFHHRCEKKFGELSGLFANPRKSFIFVAGVNNENASQLAACMGFVRGNLPIRYLGLPLLTGRLRSNDCAPMIQ------------LLSF
        SRMLNKIPQSFQFHHRCEK+FGELSGLFANPRKS IF+AGVNNENAS+LAACMGFVRGNLP+RYLGLPLLTGRLRSNDC P+IQ            +LSF
Subjt:  SRMLNKIPQSFQFHHRCEKKFGELSGLFANPRKSFIFVAGVNNENASQLAACMGFVRGNLPIRYLGLPLLTGRLRSNDCAPMIQ------------LLSF

Query:  AGRLQLVRSVLRSLQVYWASVFVLPAYVHNEGGLDIRDGPSWNIASTLKILWLMLTSLGSLWVAWVEAYILKGRSLWDVDSRVGRSWCLRAILRKREKLK
        AGRLQLV SVL SLQVYWA VFVLPAYVHNEGGL IRDG +W  ASTLKILWLMLT+ GSLWVAWVEAY+LKGRSLWDVDSRVGRSWCLRAILRK+EKLK
Subjt:  AGRLQLVRSVLRSLQVYWASVFVLPAYVHNEGGLDIRDGPSWNIASTLKILWLMLTSLGSLWVAWVEAYILKGRSLWDVDSRVGRSWCLRAILRKREKLK

Query:  HHVRMKVGNGNR-------W---GAILEQVGERVLYDAASRREAKLSDFIGSDGEWLWSRGGFSIASAWKAIHPRGGRVLWDGLMWGGRNIPKHSFCAWL
         HVRMKVGNGNR       W   GAILE+VGERVLYDAASRREA LS+FIG DGEWLW RGGFSIASAW+AI PRGGRVLWDGL+WGG NIPKHSFCAWL
Subjt:  HHVRMKVGNGNR-------W---GAILEQVGERVLYDAASRREAKLSDFIGSDGEWLWSRGGFSIASAWKAIHPRGGRVLWDGLMWGGRNIPKHSFCAWL

Query:  AIKDRLGTRDRLHRWDSLVPMSCILCQGGVESRDHLFFS----------VLRIMASSHRIGHWGVELSWICHQGIGKGVRSK----LWC---------RN
        AIKDRLGTRDR HRWDS VP+SCILC+GG+ESRDHLFFS          VLRIMASSHRIGHWGVELSWICHQGI KGVR K    LWC         RN
Subjt:  AIKDRLGTRDRLHRWDSLVPMSCILCQGGVESRDHLFFS----------VLRIMASSHRIGHWGVELSWICHQGIGKGVRSK----LWC---------RN

Query:  HRLHGGQARDHIVLFHLICTWIRARAGSWREDADLPF
        HRLHGGQA D IV+FHLICTWIRARAGSWREDA LPF
Subjt:  HRLHGGQARDHIVLFHLICTWIRARAGSWREDADLPF

A0A5A7TZS0 Reverse transcriptase domain-containing protein0.0e+0065.55Show/hide
Query:  NSW--DYSCSYSNSGVGRIWVMWKKNRFFFSTHVMDEQF----------------------------------------------VVMGDFNAIRVHSEA
        N W  +YSCSYSNSGVGRIWVMWKK RF F THVMDE+F                                              VVMGDFNAIRVHSEA
Subjt:  NSW--DYSCSYSNSGVGRIWVMWKKNRFFFSTHVMDEQF----------------------------------------------VVMGDFNAIRVHSEA

Query:  FGGSPIQGEMEDFDPAI--------------------LHGSGMLRRLDRVLVNDDWLSASPTMLVNVLPWGISDHSPILFYPSFQLNSKVVSFRFFNHWV
        FGGSPIQGEME+FD AI                    + GSGMLRRLDRVLVND+WLSA PTM +NVLPWGISDHSPILFYPSFQ+NS+VVSFRFFNHWV
Subjt:  FGGSPIQGEMEDFDPAI--------------------LHGSGMLRRLDRVLVNDDWLSASPTMLVNVLPWGISDHSPILFYPSFQLNSKVVSFRFFNHWV

Query:  EDPSFIEVVFRMWSSHEGVSPLVSLMRNLHHLKPILRRRFGRHIKSLSEEMRIAKEAMDIAQREVERNPMSDVLSRQASLATETFWTA------------
        E+PSFIEVV RMWS HEGVS LVSLMRNLHHLKPILRR+FGRHIKSLSEE+ IAKEAMDIAQREVERNP+SDVLSRQASLATETFWTA            
Subjt:  EDPSFIEVVFRMWSSHEGVSPLVSLMRNLHHLKPILRRRFGRHIKSLSEEMRIAKEAMDIAQREVERNPMSDVLSRQASLATETFWTA------------

Query:  ----------NTAFFHRSVRSRMSRNSLLSLVDSDGSRVFSHDGVAQMAVNYFSNSLGSQEIGYRELSP------------ECCQALQLLISREEVRRVL
                  NTAFFHRSVRSR+SRNSLLSLVDSDGSRV SHDGVAQMAVNYFSNSLGSQEIGYRELSP            ECCQALQL ISREEVRRVL
Subjt:  ----------NTAFFHRSVRSRMSRNSLLSLVDSDGSRVFSHDGVAQMAVNYFSNSLGSQEIGYRELSP------------ECCQALQLLISREEVRRVL

Query:  FSMDSGKAPGSDGFSVGFFKGAW--------------FVTCYLPVGVNATAITLIPKHNGAKRLEDFRPISCCNVLYKCISKILADRLHVWLPSFISSNQ
        FSMDSGKAPG DGFSVGF+KGAW              F TCYLP+GVNATAITLIPKH GA+RLEDFRPISCCNVLYKCISKILADRL +WLPSFISSNQ
Subjt:  FSMDSGKAPGSDGFSVGFFKGAW--------------FVTCYLPVGVNATAITLIPKHNGAKRLEDFRPISCCNVLYKCISKILADRLHVWLPSFISSNQ

Query:  YAFILGRSIIENILLCQELVGGYHLNSGKPRCTLKVDLQKAYDSVNWNFLFGLLIAIGTPLKF----------------------------KDVRQGNPL
         AFI GRSIIENILLCQELVGGYHLNSGKPRCTLKVDLQKAYDSVNW+FLFGLLIAIGTPLKF                            K +RQG+PL
Subjt:  YAFILGRSIIENILLCQELVGGYHLNSGKPRCTLKVDLQKAYDSVNWNFLFGLLIAIGTPLKF----------------------------KDVRQGNPL

Query:  SPFLFVMVMEVLSRMLNKIPQSFQFHHRCEK-------------------------------KFGELSGLFANPRKSFIFVAGVNNENASQLAACMGFVR
        SPFLFVMVMEVLSRMLNKIPQSF+FHHRCEK                               KFGE SGLFANPRKS IFV GVNNE AS LAAC+G   
Subjt:  SPFLFVMVMEVLSRMLNKIPQSFQFHHRCEK-------------------------------KFGELSGLFANPRKSFIFVAGVNNENASQLAACMGFVR

Query:  GNLPIRYLGLPLLTGRLRSNDCAPMIQ------------LLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN---------------------------
           P            LRS DCAP+IQ            +LSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN                           
Subjt:  GNLPIRYLGLPLLTGRLRSNDCAPMIQ------------LLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN---------------------------

Query:  ------EGGLDIRDGPSWNIASTLKILWLMLTSLGSLWVAWVEAYILKGRSLWDVDSRVGRSWCLRAILRKREKLKHHVRMKVGNGNRWGAILEQVGERV
              EGGL IRDGPSWNIA+TLKIL   LT+LGSLWVAW+EAYILKG+SLWDVDSRVGRSWCLRAILRKREK+KHH                 VGERV
Subjt:  ------EGGLDIRDGPSWNIASTLKILWLMLTSLGSLWVAWVEAYILKGRSLWDVDSRVGRSWCLRAILRKREKLKHHVRMKVGNGNRWGAILEQVGERV

Query:  LYDAASRREAKLSDFIGSDGEWLWSR--------------------------------GGFSIASAWKAIHPRGGRVLWDGLMWGGRNIPKHSFCAWLAI
        LYDAASRREAKLSDFI  +GEWLW R                                GGFSIASAW+AI PRGGRVLWDGL+WGG NIPKHSFCAWLAI
Subjt:  LYDAASRREAKLSDFIGSDGEWLWSR--------------------------------GGFSIASAWKAIHPRGGRVLWDGLMWGGRNIPKHSFCAWLAI

Query:  KDRLGTRDRLHRWDSLVPMSCILCQGGVESRDHLFFS----------VLRIMASSHRIGHWGVELSWICHQGIGKGVRSK----LWC---------RNHR
        KDRL TRDRLHRWDS +P+SCILCQGGVESRDHLFFS          V +IM SSHRIGHWGVELSWICH+GIGKGVR K    LWC         RNHR
Subjt:  KDRLGTRDRLHRWDSLVPMSCILCQGGVESRDHLFFS----------VLRIMASSHRIGHWGVELSWICHQGIGKGVRSK----LWC---------RNHR

Query:  LHGGQARDHIVLFHLICTWIRARAGSWREDADLPF
        LHGG+ARD I+LFHLICTWIRARAGSWREDA LPF
Subjt:  LHGGQARDHIVLFHLICTWIRARAGSWREDADLPF

A0A5A7UP65 Reverse transcriptase0.0e+0073.56Show/hide
Query:  MEVIMPNSFGSLLEVGDADKWVLSIIEGSPPPLQ---------VDE----------DTDTRIREGNFASVSRRFGNSWDYSCSYSNSGVGRIWVMWKKNR
        MEVI PNSFGSLLEVGD DKW LSIIEGSP   +         VD             +TR+REGNF SVSRRFGNSWDYSCSYSNSGVGRIWVMWKKNR
Subjt:  MEVIMPNSFGSLLEVGDADKWVLSIIEGSPPPLQ---------VDE----------DTDTRIREGNFASVSRRFGNSWDYSCSYSNSGVGRIWVMWKKNR

Query:  FFFSTHVMDEQF----VVMGDFNAIRVHSEAFGGSPIQGEMEDFDPAI--------------------LHGSGMLRRLDRVLVNDDWLSASPTMLVNVLP
        F FSTHV DEQF    VVM DFNAIR HSEA GGSPIQGEMEDFD AI                    + GSGM+RRLDRVL+NDDWLSA PTMLVNVLP
Subjt:  FFFSTHVMDEQF----VVMGDFNAIRVHSEAFGGSPIQGEMEDFDPAI--------------------LHGSGMLRRLDRVLVNDDWLSASPTMLVNVLP

Query:  WGISDHSPILFYPSFQLNSKVVSFRFFNHWVEDPSFIEVVFRMWSSHEGVSPLVSLMRNLHHLKPILRRRFGRHIKSLSEEMRIAKEAMDIAQREVERNP
        WGISDHSPIL YPSFQ NSKVVSFR FNHWV+DPSF+                               RRFGRHI+SLSEE+RIAKEAMDIAQREVERNP
Subjt:  WGISDHSPILFYPSFQLNSKVVSFRFFNHWVEDPSFIEVVFRMWSSHEGVSPLVSLMRNLHHLKPILRRRFGRHIKSLSEEMRIAKEAMDIAQREVERNP

Query:  MSDVLSRQASLATETFWTANTAFFHRSVRSRMSRNSLLSLVDSDGSRVFSHDGVAQMAVNYFSNSLGSQEIGYRELSP------------ECCQALQLLI
        MSDVLSRQASLATETFWTA      R  + R  RN L  +VDS  SRV SHDGVAQMAVNYFSNSLGSQEIGYREL+P            ECCQALQ+ I
Subjt:  MSDVLSRQASLATETFWTANTAFFHRSVRSRMSRNSLLSLVDSDGSRVFSHDGVAQMAVNYFSNSLGSQEIGYRELSP------------ECCQALQLLI

Query:  SREEVRRVLFSMDSGKAPGSDGFSVGFFKGAW--------------FVTCYLPVGVNATAITLIPKHNGAKRLEDFRPISCCNVLYKCISKILADRLHVW
        SREEVRRVLFSMDSGKAPG DGFSVGFFKGAW              F T YLPVGVNATAITLIPKHNGA+RLEDFRPISCCNVLYKCISKILADRL VW
Subjt:  SREEVRRVLFSMDSGKAPGSDGFSVGFFKGAW--------------FVTCYLPVGVNATAITLIPKHNGAKRLEDFRPISCCNVLYKCISKILADRLHVW

Query:  LPSFISSNQYAFILGRSIIENILLCQELVGGYHLNSGKPRCTLKVDLQKAYDSVNWNFLFGLLIAIGTPLKF----------------------------
        LPSFISSNQ AFILGRSIIENILLCQELVGGYHLNSGKPRCTLKVDLQKAYDSVNW+FLFGL I+I TPLKF                            
Subjt:  LPSFISSNQYAFILGRSIIENILLCQELVGGYHLNSGKPRCTLKVDLQKAYDSVNWNFLFGLLIAIGTPLKF----------------------------

Query:  KDVRQGNPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEKKFGELSGLFANPRKSFIFVAGVNNENASQLAACMGFVRGNLPIRYLGLPLLTGRLRSNDC
        K VRQG+PLS FLFVMVMEVLSRMLNKIPQSFQFHHRCEK+FGELSGLFANPRKS IF+AGVNNENAS+LAACMGFVRGNLP+RYLGLPLLTGRLRSNDC
Subjt:  KDVRQGNPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEKKFGELSGLFANPRKSFIFVAGVNNENASQLAACMGFVRGNLPIRYLGLPLLTGRLRSNDC

Query:  APMIQ------------LLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN-EGGLDIRDGPSWNIASTLKILWLMLTSLGSLWVAWVEAYILKGRSLWD
         P+IQ            +LSFAGRLQLV SVL SLQVYWA VFVLPAYVHN EGGL IRDG +W  ASTLKILWLMLT+ GSLWVAWVEAY+LKGRSLWD
Subjt:  APMIQ------------LLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN-EGGLDIRDGPSWNIASTLKILWLMLTSLGSLWVAWVEAYILKGRSLWD

Query:  VDSRVGRSWCLRAILRKREKLKHHVRMKVGNGNR-------W---GAILEQVGERVLYDAASRREAKLSDFIGSDGEWLWSRGGFSIASAWKAIHPRGGR
        VDSRVGRSWCLRAILRK+EKLK HVRMKVGNGNR       W   GAILEQVGERVLYDAASRREA LS+FIG DGEWLW RGGFSIASAW+AI PRGGR
Subjt:  VDSRVGRSWCLRAILRKREKLKHHVRMKVGNGNR-------W---GAILEQVGERVLYDAASRREAKLSDFIGSDGEWLWSRGGFSIASAWKAIHPRGGR

Query:  VLWDGLMWGGRNIPKHSFCAWLAIKDRLGTRDRLHRWDSLVPMSCILCQGGVESRDHLFFS----------VLRIMASSHRIGHWGVELSWICHQGIGKG
        VLWDGL+WGG NIPKHSFCAWLAIKDRLGTRDR HRWDS VP+SCILC+GG+ESRDHLFFS          VLRIMASSHRIGHWGVELSWICHQGI KG
Subjt:  VLWDGLMWGGRNIPKHSFCAWLAIKDRLGTRDRLHRWDSLVPMSCILCQGGVESRDHLFFS----------VLRIMASSHRIGHWGVELSWICHQGIGKG

Query:  VRSK----LWC---------RNHRLHGGQARDHIVLFHLICTWIRARAGSWREDADLPF
        VR K    LWC         RNHRLHGGQA D IV+FHLICTWIRARAGSWREDA LPF
Subjt:  VRSK----LWC---------RNHRLHGGQARDHIVLFHLICTWIRARAGSWREDADLPF

A0A5D3D7P6 Reverse transcriptase0.0e+0071.48Show/hide
Query:  MEVIMPNSFGSLLEVGDADKWVLSIIEGSPPPLQ---------VDE----------DTDTRIREGNFASVSRRFGNSWDYSCSYSNSGVGRIWVMWKKNR
        MEVI PNSFGSLLEVGD DKW LSIIEGSP   +         VD             +TR+REGNF SVSRRFGNSWDYSCSYSNSGVGRIWVMWKKNR
Subjt:  MEVIMPNSFGSLLEVGDADKWVLSIIEGSPPPLQ---------VDE----------DTDTRIREGNFASVSRRFGNSWDYSCSYSNSGVGRIWVMWKKNR

Query:  FFFSTHVMDEQF----VVMGDFNAIRVHSEAFGGSPIQGEMEDFDPAI--------------------LHGSGMLRRLDRVLVNDDWLSASPTMLVNVLP
        F FSTHV DEQF    VVM DFNAIR HSEA GGSPIQGEMEDFD AI                    + GSGM+RRLDRVL+NDDWLSA PTMLVNVLP
Subjt:  FFFSTHVMDEQF----VVMGDFNAIRVHSEAFGGSPIQGEMEDFDPAI--------------------LHGSGMLRRLDRVLVNDDWLSASPTMLVNVLP

Query:  WGISDHSPILFYPSFQLNSKVVSFRFFNHWVEDPSFIEVVFRMWSSHEGVSPLVSLMRNLHHLKPILRRRFGRHIKSLSEEMRIAKEAMDIAQREVERNP
        WGISDHSPIL YPSFQ NSKV                       S HEGVSPLV LMRNL+ LKPILRRRFGRHI+SLSEE+RIAKEAMDIAQRE++   
Subjt:  WGISDHSPILFYPSFQLNSKVVSFRFFNHWVEDPSFIEVVFRMWSSHEGVSPLVSLMRNLHHLKPILRRRFGRHIKSLSEEMRIAKEAMDIAQREVERNP

Query:  MSDVLSRQASLATETFWTANTAFFHRSVRSRMSRNSLLSLVDSDGSRVFSHDGVAQMAVNYFSNSLGSQEIGYRELSP------------ECCQALQLLI
                        W        RS+      NSL+       + V SHDGVAQMAVNYFSNSLGSQEIGYREL+P            ECCQALQ+ I
Subjt:  MSDVLSRQASLATETFWTANTAFFHRSVRSRMSRNSLLSLVDSDGSRVFSHDGVAQMAVNYFSNSLGSQEIGYRELSP------------ECCQALQLLI

Query:  SREEVRRVLFSMDSGKAPGSDGFSVGFFKGAW--------------FVTCYLPVGVNATAITLIPKHNGAKRLEDFRPISCCNVLYKCISKILADRLHVW
        SREEVRRVLFSMDSGKAPG DGFSVGFFKGAW              F T YLPVGVNATAITLIPKHNGA+RLEDFRPISCCNVLYKCISKILADRL VW
Subjt:  SREEVRRVLFSMDSGKAPGSDGFSVGFFKGAW--------------FVTCYLPVGVNATAITLIPKHNGAKRLEDFRPISCCNVLYKCISKILADRLHVW

Query:  LPSFISSNQYAFILGRSIIENILLCQELVGGYHLNSGKPRCTLKVDLQKAYDSVNWNFLFGLLIAIGTPLKF----------------------------
        LPSFISSNQ AFILGRSIIENILLCQELVGGYHLNSGKPRCTLKVDLQKAYDSVNW+FLFGL I+I TPLKF                            
Subjt:  LPSFISSNQYAFILGRSIIENILLCQELVGGYHLNSGKPRCTLKVDLQKAYDSVNWNFLFGLLIAIGTPLKF----------------------------

Query:  KDVRQGNPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEKKFGELSGLFANPRKSFIFVAGVNNENASQLAACMGFVRGNLPIRYLGLPLLTGRLRSNDC
        K VRQG+PLS FLFVMVMEVLSRMLNKIPQSFQFHHRCEK+FGELSGLFANPRKS IF+AGVNNENAS+LAACMGFVRGNLP+RYLGLPLLTGRLRSNDC
Subjt:  KDVRQGNPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEKKFGELSGLFANPRKSFIFVAGVNNENASQLAACMGFVRGNLPIRYLGLPLLTGRLRSNDC

Query:  APMIQ------------LLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN-EGGLDIRDGPSWNIASTLKILWLMLTSLGSLWVAWVEAYILKGRSLWD
         P+IQ            +LSFAGRLQLV SVL SLQVYWA VFVLPAYVHN EGGL IRDG +W  ASTLKILWLMLT+ GSLWVAWVEAY+LKGRSLWD
Subjt:  APMIQ------------LLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN-EGGLDIRDGPSWNIASTLKILWLMLTSLGSLWVAWVEAYILKGRSLWD

Query:  VDSRVGRSWCLRAILRKREKLKHHVRMKVGNGNR-------W---GAILEQVGERVLYDAASRREAKLSDFIGSDGEWLWSRGGFSIASAWKAIHPRGGR
        VDSRVGRSWCLRAILRK+EKLK HVRMKVGNGNR       W   GAILE+VGERVLYDAASRREA LS+FIG DGEWLW RGGFSIASAW+AI PRGGR
Subjt:  VDSRVGRSWCLRAILRKREKLKHHVRMKVGNGNR-------W---GAILEQVGERVLYDAASRREAKLSDFIGSDGEWLWSRGGFSIASAWKAIHPRGGR

Query:  VLWDGLMWGGRNIPKHSFCAWLAIKDRLGTRDRLHRWDSLVPMSCILCQGGVESRDHLFFS----------VLRIMASSHRIGHWGVELSWICHQGIGKG
        VLWDGL+WGG NIPKHSFCAWLAIKDRLGTRDR HRWDS VP+SCILC+GG+ESRDHLFFS          VLRIMASSHRIGHWGVELSWICHQGI KG
Subjt:  VLWDGLMWGGRNIPKHSFCAWLAIKDRLGTRDRLHRWDSLVPMSCILCQGGVESRDHLFFS----------VLRIMASSHRIGHWGVELSWICHQGIGKG

Query:  VRSK----LWC---------RNHRLHGGQARDHIVLFHLICTWIRARAGSWREDADLPF
        VR K    LWC         RNHRLHGGQA D IV+FHLICTWIRARAGSWREDA LPF
Subjt:  VRSK----LWC---------RNHRLHGGQARDHIVLFHLICTWIRARAGSWREDADLPF

A0A5D3DXE4 Reverse transcriptase domain-containing protein0.0e+0061.89Show/hide
Query:  KPISLDLANKKHRRLSYARVCIELKGGSNMPAEITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSGSKCSRSVESKTIQEEVVHKGDGLDSEPCGEVVLE
        KPISLD A KK RRLSYARVC+EL+GGSNM AEITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHS SKCSRSVESKTIQEEVVHKGD +D E CGEVVLE
Subjt:  KPISLDLANKKHRRLSYARVCIELKGGSNMPAEITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSGSKCSRSVESKTIQEEVVHKGDGLDSEPCGEVVLE

Query:  SFKQLEEGEIRSSPNRHNSQVEKGVGKSDDFTLVTRKKSELVSVRDRGKSMEVIMPNSFGSLLEVGDADKWVLSIIEGSPPPLQVDEDTD----------
        SFKQ+E+GEIRSSPNRH+SQVEKGVGKSD+FTLVTRKKSELVS+RDRGKSMEVIMPNSFGSLLEVGDADKW LSIIEGS PPLQVDE TD          
Subjt:  SFKQLEEGEIRSSPNRHNSQVEKGVGKSDDFTLVTRKKSELVSVRDRGKSMEVIMPNSFGSLLEVGDADKWVLSIIEGSPPPLQVDEDTD----------

Query:  -------TRIREGNFASVSRRFGNSWDYSCSYSNSGVGRIWVMWKKNRFFFSTHVMDEQF----------------------------------------
               TR+REGNF SVSRRF NSWDYSCSYSNSGVGRIWVMWKKNRF FSTHVMDEQF                                        
Subjt:  -------TRIREGNFASVSRRFGNSWDYSCSYSNSGVGRIWVMWKKNRFFFSTHVMDEQF----------------------------------------

Query:  ------VVMGDFNAIRVHSEAFGGSPIQGEMEDFDPAI--------------------LHGSGMLRRLDRVLVNDDWLSASPTMLVNVLPWGISDHSPIL
              VVM DFNAIRVHSEAF GSPIQGEMEDF+ AI                    + GSGMLRRLDRVLVNDDWLS  PTMLVNVLPWGISDH PIL
Subjt:  ------VVMGDFNAIRVHSEAFGGSPIQGEMEDFDPAI--------------------LHGSGMLRRLDRVLVNDDWLSASPTMLVNVLPWGISDHSPIL

Query:  FYPSFQLNSKVVSFRFFNHWVEDPSFIEVVFRMWSSHEGVSPLVSLMRNLHHLKPILRRRFGRHIKSLSEEMRIAKEAMDIAQREVERNPMSDVLSRQAS
        FYPSFQ ++KVVSFRFFNHWVEDPSFIEVV RMWS HEGVSPLV LMRNLH LKPILRRRFGRHIK LSEE+RI KEAMDIAQRE               
Subjt:  FYPSFQLNSKVVSFRFFNHWVEDPSFIEVVFRMWSSHEGVSPLVSLMRNLHHLKPILRRRFGRHIKSLSEEMRIAKEAMDIAQREVERNPMSDVLSRQAS

Query:  LATETFWTANTAFFHRSVRSRMSRNSLLSLVDSDGSRVFSHDGVAQMAVNYFSNSLGSQEIGYRELSP------------ECCQALQLLISREEVRRVLF
                                                      MAVNYF NSLGSQEIGYRELSP            ECCQALQL ISREEVRRVLF
Subjt:  LATETFWTANTAFFHRSVRSRMSRNSLLSLVDSDGSRVFSHDGVAQMAVNYFSNSLGSQEIGYRELSP------------ECCQALQLLISREEVRRVLF

Query:  SMDSGKAPGSDGFSVGFFKGAWFVTCYLPVGVNATAITLIPKHNGAKRLEDFRPISCCNVLYKCISKILADRLHVWLPSFISSNQYAFILGRSIIENILL
        SMDSGKAPG DGFSV              VG+NATAITLIPKHNGA+RLEDF PISC NVLYKCISKILADRL VWLPSFISSNQ AFI GRSIIENILL
Subjt:  SMDSGKAPGSDGFSVGFFKGAWFVTCYLPVGVNATAITLIPKHNGAKRLEDFRPISCCNVLYKCISKILADRLHVWLPSFISSNQYAFILGRSIIENILL

Query:  CQELVGGYHLNSGKPRCTLKVDLQKAYDSVNWNFLFGLLIAIGTPLKFKDVRQGNPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEK------------
        CQEL+           C          +     F  G           K VRQG+PLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEK            
Subjt:  CQELVGGYHLNSGKPRCTLKVDLQKAYDSVNWNFLFGLLIAIGTPLKFKDVRQGNPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEK------------

Query:  -------------------KFGELSGLFANPRKSFIFVAGVNNENASQLAACMGFVRGNLPIRYLGLPLLTGRLRSNDCAPMIQ------------LLSF
                           KFGELSGLFANPRKS IFVAGVNNENAS LA CMGF RGNLP+RYLGLPLLTGRLRSNDCAP+IQ            +LSF
Subjt:  -------------------KFGELSGLFANPRKSFIFVAGVNNENASQLAACMGFVRGNLPIRYLGLPLLTGRLRSNDCAPMIQ------------LLSF

Query:  AGRLQLVRSVLRSLQVYWASVFVLPAYVHN---------------------------------EGGLDIRDGPSWNIASTLKILWLMLTSLGSLWVAWVE
        AGRLQLVRSVLRSLQVYWASVFVLPAYVHN                                 EGG  IRDGPSWNIASTLKILWLMLT+ GSLWVAWVE
Subjt:  AGRLQLVRSVLRSLQVYWASVFVLPAYVHN---------------------------------EGGLDIRDGPSWNIASTLKILWLMLTSLGSLWVAWVE

Query:  AYILKGRSLWDVDSRVGRSWCLRAILRKREKLKHHVRMKVGNGNRWGAILEQVGERVLYDAASRREAKLSDFIGSDGEWLWSR-GGFSIASAWKAIHPRG
        AYILKGRSLWDVDSRVGRSWCL AILR      H     V    R    L  + ERV       +E      +     W+  R GGFSI+SAW+AI PRG
Subjt:  AYILKGRSLWDVDSRVGRSWCLRAILRKREKLKHHVRMKVGNGNRWGAILEQVGERVLYDAASRREAKLSDFIGSDGEWLWSR-GGFSIASAWKAIHPRG

Query:  GRVLWDGLMWGGRNIPKHSFCAWLAIKDRLGTRDRLHRWDSLVPMSCILCQGGVESRDHLFFS----------VLRIMASSHRIGHWGVELSWICHQGIG
        GRVLWD                                             GGVESRDHLFFS          VLRIMASS+RIGHWGVELSWICHQGIG
Subjt:  GRVLWDGLMWGGRNIPKHSFCAWLAIKDRLGTRDRLHRWDSLVPMSCILCQGGVESRDHLFFS----------VLRIMASSHRIGHWGVELSWICHQGIG

Query:  KGVRSKL
        KGVR KL
Subjt:  KGVRSKL

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein7.4e-1724.85Show/hide
Query:  LSPECCQALQLLISREEVRRVLFSMDSGKAPGSDGFSVGFFKG------AWFVTCY--------LPVGVNATAITLIPK-HNGAKRLEDFRPISCCNVLY
        L+ E  ++L   I+  E+  ++ S+ + K+PG DGF+  F++        + +  +        LP      +I LIPK      + E+FRPIS  N+  
Subjt:  LSPECCQALQLLISREEVRRVLFSMDSGKAPGSDGFSVGFFKG------AWFVTCY--------LPVGVNATAITLIPK-HNGAKRLEDFRPISCCNVLY

Query:  KCISKILADRLHVWLPSFISSNQYAFILGRSIIENILLCQELVGGYHLNSGKPRCTLKVDLQKAYDSVNWNFLFGLLIAIGT------------------
        K ++KILA+R+   +   I  +Q  FI G     NI     ++   +    K    + +D +KA+D +   F+   L  +G                   
Subjt:  KCISKILADRLHVWLPSFISSNQYAFILGRSIIENILLCQELVGGYHLNSGKPRCTLKVDLQKAYDSVNWNFLFGLLIAIGT------------------

Query:  -----------PLKFKDVRQGNPLSPFLFVMVMEVLSRMLNKIPQ-----------------------------SFQFHHRCEKKFGELSGLFANPRKSF
                   PLK    RQG PLSP LF +V+EVL+R + +  +                             S Q   +    F ++SG   N +KS 
Subjt:  -----------PLKFKDVRQGNPLSPFLFVMVMEVLSRMLNKIPQ-----------------------------SFQFHHRCEKKFGELSGLFANPRKSF

Query:  IFVAGVNNENASQLAACMGFVRGNLPIRYLGLPL
         F+   N +  SQ+   + F   +  I+YLG+ L
Subjt:  IFVAGVNNENASQLAACMGFVRGNLPIRYLGLPL

P08548 LINE-1 reverse transcriptase homolog1.5e-1424.7Show/hide
Query:  LSPECCQALQLLISREEVRRVLFSMDSGKAPGSDGFSVGFFKGAWFVTCYLPVGVN----------------ATAITLIPK-HNGAKRLEDFRPISCCNV
        LS +  + L   IS  E+   + ++   K+PG DGF+  F++   F    +P+ +N                   ITLIPK      R E++RPIS  N+
Subjt:  LSPECCQALQLLISREEVRRVLFSMDSGKAPGSDGFSVGFFKGAWFVTCYLPVGVN----------------ATAITLIPK-HNGAKRLEDFRPISCCNV

Query:  LYKCISKILADRLHVWLPSFISSNQYAFILGRSIIENILLCQELVGGYHLNSGKPRCTLKVDLQKAYDSVNWNFLFGLLIAIGT----------------
          K ++KIL +R+   +   I  +Q  FI G     NI     ++   +    K    L +D +KA+D++   F+   L  IG                 
Subjt:  LYKCISKILADRLHVWLPSFISSNQYAFILGRSIIENILLCQELVGGYHLNSGKPRCTLKVDLQKAYDSVNWNFLFGLLIAIGT----------------

Query:  -------------PLKFKDVRQGNPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCE-----------------------------KKFGELSGLFANPRK
                     PL+    RQG PLSP LF +VMEVL+  + +       H   E                             K++  +SG   N  K
Subjt:  -------------PLKFKDVRQGNPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCE-----------------------------KKFGELSGLFANPRK

Query:  SFIFVAGVNNENASQLAACMGFVRGNLPIRYLGLPL
        S  F+   NN+    +   + F      ++YLG+ L
Subjt:  SFIFVAGVNNENASQLAACMGFVRGNLPIRYLGLPL

P11369 LINE-1 retrotransposable element ORF2 protein1.6e-1626.4Show/hide
Query:  ISREEVRRVLFSMDSGKAPGSDGFSVGFFK--------------GAWFVTCYLPVGVNATAITLIPK-HNGAKRLEDFRPISCCNVLYKCISKILADRLH
        IS +E+  V+ S+ + K+PG DGFS  F++                  V   LP       ITLIPK      ++E+FRPIS  N+  K ++KILA+R+ 
Subjt:  ISREEVRRVLFSMDSGKAPGSDGFSVGFFK--------------GAWFVTCYLPVGVNATAITLIPK-HNGAKRLEDFRPISCCNVLYKCISKILADRLH

Query:  VWLPSFISSNQYAFILGRSIIENILLCQELVGGYHLNSGKPRCTLKVDLQKAYDSVNWNFLFGLLIAIG-----------------------------TP
          + + I  +Q  FI G     NI     ++   +    K    + +D +KA+D +   F+  +L   G                              P
Subjt:  VWLPSFISSNQYAFILGRSIIENILLCQELVGGYHLNSGKPRCTLKVDLQKAYDSVNWNFLFGLLIAIG-----------------------------TP

Query:  LKFKDVRQGNPLSPFLFVMVMEVLSRML--NKIPQSFQFHHRCEK---------------------------KFGELSGLFANPRKSFIFVAGVNNENAS
        LK    RQG PLSP+LF +V+EVL+R +   K  +  Q      K                            FGE+ G   N  KS  F+   N +   
Subjt:  LKFKDVRQGNPLSPFLFVMVMEVLSRML--NKIPQSFQFHHRCEK---------------------------KFGELSGLFANPRKSFIFVAGVNNENAS

Query:  QLAACMGFVRGNLPIRYLGLPL
        ++     F      I+YLG+ L
Subjt:  QLAACMGFVRGNLPIRYLGLPL

P14381 Transposon TX1 uncharacterized 149 kDa protein4.6e-1921.83Show/hide
Query:  RLDRVLVNDDWLSASPTMLVNVLPWGISDHSPILFYPSFQLN-SKVVSFRFFNHWVEDPSFIEVVFRMWSSHEGVSPLVSLMRNLHHL-KPILRRRFGRH
        R+DR+ ++   +S + +  + + P+  SDH+ +    S   +  K   + F N  +ED  F + V   W          + +     + K  L+     +
Subjt:  RLDRVLVNDDWLSASPTMLVNVLPWGISDHSPILFYPSFQLN-SKVVSFRFFNHWVEDPSFIEVVFRMWSSHEGVSPLVSLMRNLHHL-KPILRRRFGRH

Query:  IKSLSEEMRIAKEA-----MDIAQR---EVERNPMSDVLSRQASLATETFWTANTA-----------------FFHRSVRSRMSRNSLLSLVDSDGSRVF
         KS+S +     EA     +D+ QR     ++    + L R+ +L       A  A                 FF+   + + +R  +  L   DG+ + 
Subjt:  IKSLSEEMRIAKEA-----MDIAQR---EVERNPMSDVLSRQASLATETFWTANTA-----------------FFHRSVRSRMSRNSLLSLVDSDGSRVF

Query:  SHDGVAQMAVNYFSNSLGSQEIGYRELSPECC---------------QALQLLISREEVRRVLFSMDSGKAPGSDGFSVGFFKGAW--------------
          + +   A +++ N      I     SP+ C               + L+  I+ +E+ + L  M   K+PG DG ++ FF+  W              
Subjt:  SHDGVAQMAVNYFSNSLGSQEIGYRELSPECC---------------QALQLLISREEVRRVLFSMDSGKAPGSDGFSVGFFKGAW--------------

Query:  FVTCYLPVGVNATAITLIPKHNGAKRLEDFRPISCCNVLYKCISKILADRLHVWLPSFISSNQYAFILGRSIIENILLCQELVGGYHLNSGKPRCTLKVD
        F    LP+      ++L+PK    + ++++RP+S  +  YK ++K ++ RL   L   I  +Q   + GR+I +N+ L ++L+  +   +G     L +D
Subjt:  FVTCYLPVGVNATAITLIPKHNGAKRLEDFRPISCCNVLYKCISKILADRLHVWLPSFISSNQYAFILGRSIIENILLCQELVGGYHLNSGKPRCTLKVD

Query:  LQKAYDSVNWNFLFGLLIA---------------------------IGTPLKF-KDVRQGNPLSPFLFVMVMEVLSRMLNK
         +KA+D V+  +L G L A                           +  PL F + VRQG PLS  L+ + +E    +L K
Subjt:  LQKAYDSVNWNFLFGLLIA---------------------------IGTPLKF-KDVRQGNPLSPFLFVMVMEVLSRMLNK

P16423 Retrovirus-related Pol polyprotein from type-2 retrotransposable element R2DM5.6e-0929.94Show/hide
Query:  LPVGVNATAITLIPKHNGAKRLEDFRPISCCNVLYKCISKILADRLHV---WLPSFISSNQYAFILGRSIIENILLCQELVGGYHLNSGKPRCTLKVDLQ
        LP  +       IPK   AKR +DFRPIS  +VL + ++ ILA RL+    W P      Q  F+      +N  +  +LV  +     +      +D+ 
Subjt:  LPVGVNATAITLIPKHNGAKRLEDFRPISCCNVLYKCISKILADRLHV---WLPSFISSNQYAFILGRSIIENILLCQELVGGYHLNSGKPRCTLKVDLQ

Query:  KAYDSVNWNFLFGLLIAIGTPLKFKD----------------------------VRQGNPLSPFLFVMVMEVLSRML
        KA+DS++   ++  L A G P  F D                            V+QG+PLSP LF +VM+ L R L
Subjt:  KAYDSVNWNFLFGLLIAIGTPLKFKD----------------------------VRQGNPLSPFLFVMVMEVLSRML

Arabidopsis top hitse value%identityAlignment
AT1G43730.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.5e-1730.41Show/hide
Query:  RSLWDVDSRVGRSWCLRAILRKREKLKHHVRMKVGNG-------NRW---GAILEQVGERVLYDAASRREAKLSDFIGSDGEWLWS------RGGFSIAS
        R+ W ++S    SW  R + + RE  +  V   VG+G       + W   G +++ VG           +A        D  ++W          FS A 
Subjt:  RSLWDVDSRVGRSWCLRAILRKREKLKHHVRMKVGNG-------NRW---GAILEQVGERVLYDAASRREAKLSDFIGSDGEWLWS------RGGFSIAS

Query:  AWKAIHPRGGRVLWDGLMWGGRNIPKHSFCAWLAIKDRLGTRDRLHRWDSLVPMSCILCQGGVESRDHLFF
           A+HP+   V W   +W   ++PKH+F  W+   +RL TRDRL  W   +P  C+LC    ESR HLFF
Subjt:  AWKAIHPRGGRVLWDGLMWGGRNIPKHSFCAWLAIKDRLGTRDRLHRWDSLVPMSCILCQGGVESRDHLFF

AT1G43760.1 DNAse I-like superfamily protein2.3e-3427.27Show/hide
Query:  LSIIEGSPPPLQVDEDTDTRIREGNFASVSRRFGNSWDYSCSYSNSGVGRIWVMWKKNRFFFSTHVMDEQFVVMGDFNAIRVHSEAF----------GGS
        +S + GSP  L     +     E N  ++      SW    +Y  S +GRIW++W  +         D+  +++GDF+ I   S+ +          G  
Subjt:  LSIIEGSPPPLQVDEDTDTRIREGNFASVSRRFGNSWDYSCSYSNSGVGRIWVMWKKNRFFFSTHVMDEQFVVMGDFNAIRVHSEAF----------GGS

Query:  PIQGEMEDFDPAILHGSG-------------MLRRLDRVLVNDDWLSASPTMLVNVLPWGISDHSP-ILFYPSFQLNSKVVSFRFFNHWVEDPSFIEVVF
          Q  + D D   +   G             ++R+LDR + N DW S+ P+ +      G+SDHSP I+   +    SK   FR+F+     P+F+  + 
Subjt:  PIQGEMEDFDPAILHGSG-------------MLRRLDRVLVNDDWLSASPTMLVNVLPWGISDHSP-ILFYPSFQLNSKVVSFRFFNHWVEDPSFIEVVF

Query:  RMWSSHEGV-SPLVSLMRNLHHLKPILRRRFGRHIKSLSEEMRIAKEAMDIAQREVERNPMSDVLSRQA----------SLATETFW------------T
          W     V S + SL  +L   K   +    +   ++  + + A ++++  Q ++  NP SD L R            + A E+F+             
Subjt:  RMWSSHEGV-SPLVSLMRNLHHLKPILRRRFGRHIKSLSEEMRIAKEAMDIAQREVERNPMSDVLSRQA----------SLATETFW------------T

Query:  ANTAFFHRSVRSRMSRNSLLSLVDSDGSRVFSHDGVAQMAVNYFSNSLGSQE--------IGYRELSPECC-----QALQLLISREEVRRVLFSMDSGKA
        ANT FFH+ + +  ++N +  L   D  RV +   V +M V Y+++ LGS             +++ P  C       L  L S +E+   +F+M   KA
Subjt:  ANTAFFHRSVRSRMSRNSLLSLVDSDGSRVFSHDGVAQMAVNYFSNSLGSQE--------IGYRELSPECC-----QALQLLISREEVRRVLFSMDSGKA

Query:  PGSDGFSVGFFKGAWFV--------------TCYLPVGVNATAITLIPKHNGAKRLEDFRPISCCNVLYKCIS
        PG D F+  FF  +WFV              T +L    NATAITLIPK  G  +L  FRP+SCC V+YK I+
Subjt:  PGSDGFSVGFFKGAWFV--------------TCYLPVGVNATAITLIPKHNGAKRLEDFRPISCCNVLYKCIS

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.3e-2829.97Show/hide
Query:  VAGVNNENASQLAACMGFVRGNLPIRYLGLPLLTGRLRSNDCAPMIQL------------LSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHNEGGLD--
        +AGV + + + +     F  G LP+RYLGLPLLT ++ ++D  P+++             LSFAGRLQL+ SV+ SL  +W S F LP+    E  +D  
Subjt:  VAGVNNENASQLAACMGFVRGNLPIRYLGLPLLTGRLRSNDCAPMIQL------------LSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHNEGGLD--

Query:  ----IRDGPSWNIASTLKILWLMLTS---LGSLWVAWVEAYILKGRSLWDVDSRVG-RSWCLRAILRKREKLKHHVRMKVGNG-------NRW---GAIL
            +  GP  N     K+ W  + +    G L +  ++    KG S W +       SW  + IL+ R      V+  + NG       + W   G ++
Subjt:  ----IRDGPSWNIASTLKILWLMLTS---LGSLWVAWVEAYILKGRSLWDVDSRVG-RSWCLRAILRKREKLKHHVRMKVGNG-------NRW---GAIL

Query:  EQVGERVLYDA-----ASRREA---------------KLSDFIG-------SDGE--WLWSRGG------FSIASAWKAIHPRGGRVLWDGLMWGGRNIP
        +  G R   D      AS  EA               ++ D I        + GE    W   G      F+    W A      +V W   +W     P
Subjt:  EQVGERVLYDA-----ASRREA---------------KLSDFIG-------SDGE--WLWSRGG------FSIASAWKAIHPRGGRVLWDGLMWGGRNIP

Query:  KHSFCAWLAIKDRLGTRDRLHRWDSLVPMSCILCQGGVESRDHLFFS
        K+S  AW+AIK+RL T DR+  W++    SC+LC   VE+RDHLFF+
Subjt:  KHSFCAWLAIKDRLGTRDRLHRWDSLVPMSCILCQGGVESRDHLFFS

AT4G04650.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.9e-1538.04Show/hide
Query:  DGEWLWS------RGGFSIASAWKAIHPRGGRVLWDGLMWGGRNIPKHSFCAWLAIKDRLGTRDRLHRWDSLVPMSCILCQGGVESRDHLFF
        D  +LW          FS    W A+HP+   V W   +W   ++PKH+F  W+   +RL TRDRL  W   +P  C+LC    +SR HLFF
Subjt:  DGEWLWS------RGGFSIASAWKAIHPRGGRVLWDGLMWGGRNIPKHSFCAWLAIKDRLGTRDRLHRWDSLVPMSCILCQGGVESRDHLFF

AT5G16486.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.2e-1541.3Show/hide
Query:  DGEWLWSRG------GFSIASAWKAIHPRGGRVLWDGLMWGGRNIPKHSFCAWLAIKDRLGTRDRLHRWDSLVPMSCILCQGGVESRDHLFF
        D  ++W  G      GFS A+ W  ++P G +V W   +W    IPKH+F +W+ I+ RL TRD+L  W   VP  C+LC    E+R HLFF
Subjt:  DGEWLWSRG------GFSIASAWKAIHPRGGRVLWDGLMWGGRNIPKHSFCAWLAIKDRLGTRDRLHRWDSLVPMSCILCQGGVESRDHLFF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTTATGGACAGAAGCTGGGTTGGCTGTTGTAGCGAGTGCTGTGGGGAAACCTATATCTTTAGATTTGGCTAATAAGAAGCATCGTAGGCTCTCATATGCTCGTGT
TTGTATTGAATTAAAGGGAGGGTCAAATATGCCTGCTGAAATTACTGTTAGTCTCAGAGGAGTGGATTTTAATGTTTCGGTTAATTATGAGTGGAAACCACGGAAGTGTA
ATTTGTGTTGTGCCTTTGGGCATTCTGGTAGTAAGTGTTCTAGAAGTGTGGAGAGTAAAACCATACAGGAGGAGGTTGTGCACAAGGGGGATGGATTAGACAGTGAACCT
TGTGGGGAAGTTGTTCTTGAATCGTTCAAACAGTTAGAGGAAGGTGAAATTAGGAGCTCTCCTAATAGACATAACAGCCAAGTGGAGAAGGGGGTGGGTAAAAGTGATGA
TTTTACCCTTGTAACTCGTAAGAAGAGTGAGTTAGTCTCTGTTAGAGATCGTGGAAAGAGTATGGAGGTGATTATGCCAAACTCTTTTGGTAGTCTCTTGGAGGTGGGTG
ACGCTGACAAGTGGGTACTATCTATAATAGAGGGTTCACCGCCACCCTTACAGGTTGATGAAGATACTGATACTAGAATTCGAGAAGGTAATTTTGCTTCTGTTTCTAGA
AGGTTTGGTAACTCTTGGGATTACTCTTGTAGCTACAGTAATAGTGGTGTTGGTCGGATTTGGGTGATGTGGAAGAAGAATCGTTTTTTTTTCTCTACTCATGTGATGGA
CGAGCAGTTTGTTGTCATGGGAGATTTTAATGCTATTAGAGTTCATTCTGAAGCATTTGGGGGATCTCCTATTCAGGGTGAGATGGAGGATTTTGATCCAGCTATTCTTC
ATGGGTCTGGGATGTTGCGTCGTCTGGATCGTGTTTTAGTGAATGATGATTGGTTATCTGCATCGCCTACGATGTTGGTTAATGTGCTTCCATGGGGTATTTCTGATCAT
TCTCCTATTTTATTTTATCCTAGCTTTCAGCTAAATAGCAAAGTGGTGTCTTTTCGGTTCTTCAATCATTGGGTGGAGGATCCATCCTTTATTGAGGTTGTTTTTAGGAT
GTGGAGTAGTCATGAGGGTGTCTCTCCGCTAGTGAGCCTCATGAGAAACCTTCATCATCTCAAACCTATCCTCCGTAGACGGTTTGGTAGACACATCAAGAGTCTTAGTG
AGGAGATGCGCATTGCTAAGGAGGCCATGGATATAGCTCAGAGAGAGGTAGAACGTAACCCAATGTCAGATGTTTTGAGTCGCCAAGCAAGCCTTGCTACTGAGACTTTC
TGGACAGCAAATACGGCTTTTTTCCATCGATCCGTCCGCTCTCGTATGAGTCGTAATTCACTACTTTCTCTAGTTGATTCCGATGGATCCAGGGTGTTTTCACATGATGG
GGTGGCTCAGATGGCAGTTAATTATTTTAGTAACAGTTTGGGGTCCCAGGAGATTGGCTATAGAGAATTGTCCCCAGAGTGTTGTCAGGCGTTACAGTTACTTATTAGTC
GGGAGGAAGTTAGGAGGGTCTTATTCTCTATGGATAGTGGAAAGGCTCCCGGTTCTGATGGGTTCTCTGTAGGTTTCTTCAAAGGTGCCTGGTTTGTGACCTGTTATCTT
CCAGTAGGAGTTAATGCTACTGCTATTACCCTCATTCCTAAACATAATGGGGCTAAGCGTTTGGAGGACTTTCGACCTATTTCTTGTTGTAATGTGTTATATAAATGCAT
TTCTAAAATTTTGGCTGATAGACTTCATGTGTGGCTTCCTTCTTTTATCAGTAGTAACCAGTATGCTTTTATACTTGGGAGGAGTATTATCGAGAACATCCTGCTTTGTC
AAGAACTGGTAGGAGGTTATCATCTTAATTCTGGTAAACCTCGATGTACTTTGAAAGTTGATCTTCAAAAAGCATATGACTCTGTTAATTGGAATTTTCTGTTTGGTTTG
CTGATTGCTATTGGTACTCCTTTGAAGTTTAAGGATGTAAGACAAGGTAATCCTTTATCTCCTTTTCTTTTTGTTATGGTGATGGAAGTTCTTTCTCGTATGTTGAACAA
GATTCCTCAGAGTTTTCAATTTCACCATCGTTGTGAAAAGAAGTTTGGTGAGCTTTCAGGTTTGTTTGCAAATCCTAGGAAAAGCTTTATTTTTGTTGCAGGAGTTAATA
ATGAGAATGCTTCTCAGCTGGCTGCTTGTATGGGTTTTGTCCGTGGAAATCTCCCTATTCGTTATCTTGGGCTTCCTCTTCTTACTGGTCGGTTACGTTCTAATGATTGT
GCTCCTATGATTCAGCTTCTTTCGTTTGCTGGTAGACTGCAGCTTGTTCGTTCTGTGCTTCGCAGCCTTCAAGTTTACTGGGCTAGTGTGTTTGTTCTTCCTGCGTATGT
GCATAATGAGGGCGGTCTTGATATTCGAGATGGCCCTTCTTGGAATATTGCGAGTACTTTGAAGATTTTGTGGCTTATGTTGACAAGTTTGGGTTCTCTTTGGGTGGCTT
GGGTGGAGGCTTATATACTAAAGGGGAGGTCATTGTGGGATGTGGATAGTAGAGTGGGTCGTTCTTGGTGTCTTCGAGCGATCTTACGTAAGCGAGAGAAGTTGAAGCAT
CATGTGAGGATGAAGGTGGGAAATGGCAATAGATGGGGTGCTATTCTGGAGCAGGTTGGGGAGAGAGTGCTTTATGATGCAGCAAGTCGGAGGGAGGCTAAACTTTCTGA
CTTTATTGGCTCAGATGGAGAATGGCTTTGGTCGCGAGGTGGTTTCTCTATTGCAAGTGCATGGAAAGCTATTCATCCTAGGGGTGGTCGGGTTCTTTGGGATGGTTTAA
TGTGGGGTGGGAGAAATATCCCAAAACATTCCTTTTGTGCGTGGTTGGCCATTAAAGATAGGTTGGGCACTAGAGATAGATTGCATAGGTGGGATAGTTTGGTACCGATG
TCGTGCATTCTATGTCAGGGGGGTGTGGAGTCTCGCGATCACTTATTCTTTTCAGTTCTTCGGATCATGGCTTCCTCACATAGGATTGGGCATTGGGGGGTTGAGTTGTC
TTGGATTTGTCATCAGGGTATTGGGAAGGGTGTGAGGAGTAAGTTGTGGTGCCGGAATCATCGGCTACATGGTGGTCAGGCTCGTGATCATATTGTCCTTTTCCATCTTA
TTTGTACGTGGATTCGTGCTCGTGCTGGCTCTTGGCGCGAGGATGCTGATCTTCCTTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTTATGGACAGAAGCTGGGTTGGCTGTTGTAGCGAGTGCTGTGGGGAAACCTATATCTTTAGATTTGGCTAATAAGAAGCATCGTAGGCTCTCATATGCTCGTGT
TTGTATTGAATTAAAGGGAGGGTCAAATATGCCTGCTGAAATTACTGTTAGTCTCAGAGGAGTGGATTTTAATGTTTCGGTTAATTATGAGTGGAAACCACGGAAGTGTA
ATTTGTGTTGTGCCTTTGGGCATTCTGGTAGTAAGTGTTCTAGAAGTGTGGAGAGTAAAACCATACAGGAGGAGGTTGTGCACAAGGGGGATGGATTAGACAGTGAACCT
TGTGGGGAAGTTGTTCTTGAATCGTTCAAACAGTTAGAGGAAGGTGAAATTAGGAGCTCTCCTAATAGACATAACAGCCAAGTGGAGAAGGGGGTGGGTAAAAGTGATGA
TTTTACCCTTGTAACTCGTAAGAAGAGTGAGTTAGTCTCTGTTAGAGATCGTGGAAAGAGTATGGAGGTGATTATGCCAAACTCTTTTGGTAGTCTCTTGGAGGTGGGTG
ACGCTGACAAGTGGGTACTATCTATAATAGAGGGTTCACCGCCACCCTTACAGGTTGATGAAGATACTGATACTAGAATTCGAGAAGGTAATTTTGCTTCTGTTTCTAGA
AGGTTTGGTAACTCTTGGGATTACTCTTGTAGCTACAGTAATAGTGGTGTTGGTCGGATTTGGGTGATGTGGAAGAAGAATCGTTTTTTTTTCTCTACTCATGTGATGGA
CGAGCAGTTTGTTGTCATGGGAGATTTTAATGCTATTAGAGTTCATTCTGAAGCATTTGGGGGATCTCCTATTCAGGGTGAGATGGAGGATTTTGATCCAGCTATTCTTC
ATGGGTCTGGGATGTTGCGTCGTCTGGATCGTGTTTTAGTGAATGATGATTGGTTATCTGCATCGCCTACGATGTTGGTTAATGTGCTTCCATGGGGTATTTCTGATCAT
TCTCCTATTTTATTTTATCCTAGCTTTCAGCTAAATAGCAAAGTGGTGTCTTTTCGGTTCTTCAATCATTGGGTGGAGGATCCATCCTTTATTGAGGTTGTTTTTAGGAT
GTGGAGTAGTCATGAGGGTGTCTCTCCGCTAGTGAGCCTCATGAGAAACCTTCATCATCTCAAACCTATCCTCCGTAGACGGTTTGGTAGACACATCAAGAGTCTTAGTG
AGGAGATGCGCATTGCTAAGGAGGCCATGGATATAGCTCAGAGAGAGGTAGAACGTAACCCAATGTCAGATGTTTTGAGTCGCCAAGCAAGCCTTGCTACTGAGACTTTC
TGGACAGCAAATACGGCTTTTTTCCATCGATCCGTCCGCTCTCGTATGAGTCGTAATTCACTACTTTCTCTAGTTGATTCCGATGGATCCAGGGTGTTTTCACATGATGG
GGTGGCTCAGATGGCAGTTAATTATTTTAGTAACAGTTTGGGGTCCCAGGAGATTGGCTATAGAGAATTGTCCCCAGAGTGTTGTCAGGCGTTACAGTTACTTATTAGTC
GGGAGGAAGTTAGGAGGGTCTTATTCTCTATGGATAGTGGAAAGGCTCCCGGTTCTGATGGGTTCTCTGTAGGTTTCTTCAAAGGTGCCTGGTTTGTGACCTGTTATCTT
CCAGTAGGAGTTAATGCTACTGCTATTACCCTCATTCCTAAACATAATGGGGCTAAGCGTTTGGAGGACTTTCGACCTATTTCTTGTTGTAATGTGTTATATAAATGCAT
TTCTAAAATTTTGGCTGATAGACTTCATGTGTGGCTTCCTTCTTTTATCAGTAGTAACCAGTATGCTTTTATACTTGGGAGGAGTATTATCGAGAACATCCTGCTTTGTC
AAGAACTGGTAGGAGGTTATCATCTTAATTCTGGTAAACCTCGATGTACTTTGAAAGTTGATCTTCAAAAAGCATATGACTCTGTTAATTGGAATTTTCTGTTTGGTTTG
CTGATTGCTATTGGTACTCCTTTGAAGTTTAAGGATGTAAGACAAGGTAATCCTTTATCTCCTTTTCTTTTTGTTATGGTGATGGAAGTTCTTTCTCGTATGTTGAACAA
GATTCCTCAGAGTTTTCAATTTCACCATCGTTGTGAAAAGAAGTTTGGTGAGCTTTCAGGTTTGTTTGCAAATCCTAGGAAAAGCTTTATTTTTGTTGCAGGAGTTAATA
ATGAGAATGCTTCTCAGCTGGCTGCTTGTATGGGTTTTGTCCGTGGAAATCTCCCTATTCGTTATCTTGGGCTTCCTCTTCTTACTGGTCGGTTACGTTCTAATGATTGT
GCTCCTATGATTCAGCTTCTTTCGTTTGCTGGTAGACTGCAGCTTGTTCGTTCTGTGCTTCGCAGCCTTCAAGTTTACTGGGCTAGTGTGTTTGTTCTTCCTGCGTATGT
GCATAATGAGGGCGGTCTTGATATTCGAGATGGCCCTTCTTGGAATATTGCGAGTACTTTGAAGATTTTGTGGCTTATGTTGACAAGTTTGGGTTCTCTTTGGGTGGCTT
GGGTGGAGGCTTATATACTAAAGGGGAGGTCATTGTGGGATGTGGATAGTAGAGTGGGTCGTTCTTGGTGTCTTCGAGCGATCTTACGTAAGCGAGAGAAGTTGAAGCAT
CATGTGAGGATGAAGGTGGGAAATGGCAATAGATGGGGTGCTATTCTGGAGCAGGTTGGGGAGAGAGTGCTTTATGATGCAGCAAGTCGGAGGGAGGCTAAACTTTCTGA
CTTTATTGGCTCAGATGGAGAATGGCTTTGGTCGCGAGGTGGTTTCTCTATTGCAAGTGCATGGAAAGCTATTCATCCTAGGGGTGGTCGGGTTCTTTGGGATGGTTTAA
TGTGGGGTGGGAGAAATATCCCAAAACATTCCTTTTGTGCGTGGTTGGCCATTAAAGATAGGTTGGGCACTAGAGATAGATTGCATAGGTGGGATAGTTTGGTACCGATG
TCGTGCATTCTATGTCAGGGGGGTGTGGAGTCTCGCGATCACTTATTCTTTTCAGTTCTTCGGATCATGGCTTCCTCACATAGGATTGGGCATTGGGGGGTTGAGTTGTC
TTGGATTTGTCATCAGGGTATTGGGAAGGGTGTGAGGAGTAAGTTGTGGTGCCGGAATCATCGGCTACATGGTGGTCAGGCTCGTGATCATATTGTCCTTTTCCATCTTA
TTTGTACGTGGATTCGTGCTCGTGCTGGCTCTTGGCGCGAGGATGCTGATCTTCCTTTTTAA
Protein sequenceShow/hide protein sequence
MELWTEAGLAVVASAVGKPISLDLANKKHRRLSYARVCIELKGGSNMPAEITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSGSKCSRSVESKTIQEEVVHKGDGLDSEP
CGEVVLESFKQLEEGEIRSSPNRHNSQVEKGVGKSDDFTLVTRKKSELVSVRDRGKSMEVIMPNSFGSLLEVGDADKWVLSIIEGSPPPLQVDEDTDTRIREGNFASVSR
RFGNSWDYSCSYSNSGVGRIWVMWKKNRFFFSTHVMDEQFVVMGDFNAIRVHSEAFGGSPIQGEMEDFDPAILHGSGMLRRLDRVLVNDDWLSASPTMLVNVLPWGISDH
SPILFYPSFQLNSKVVSFRFFNHWVEDPSFIEVVFRMWSSHEGVSPLVSLMRNLHHLKPILRRRFGRHIKSLSEEMRIAKEAMDIAQREVERNPMSDVLSRQASLATETF
WTANTAFFHRSVRSRMSRNSLLSLVDSDGSRVFSHDGVAQMAVNYFSNSLGSQEIGYRELSPECCQALQLLISREEVRRVLFSMDSGKAPGSDGFSVGFFKGAWFVTCYL
PVGVNATAITLIPKHNGAKRLEDFRPISCCNVLYKCISKILADRLHVWLPSFISSNQYAFILGRSIIENILLCQELVGGYHLNSGKPRCTLKVDLQKAYDSVNWNFLFGL
LIAIGTPLKFKDVRQGNPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEKKFGELSGLFANPRKSFIFVAGVNNENASQLAACMGFVRGNLPIRYLGLPLLTGRLRSNDC
APMIQLLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHNEGGLDIRDGPSWNIASTLKILWLMLTSLGSLWVAWVEAYILKGRSLWDVDSRVGRSWCLRAILRKREKLKH
HVRMKVGNGNRWGAILEQVGERVLYDAASRREAKLSDFIGSDGEWLWSRGGFSIASAWKAIHPRGGRVLWDGLMWGGRNIPKHSFCAWLAIKDRLGTRDRLHRWDSLVPM
SCILCQGGVESRDHLFFSVLRIMASSHRIGHWGVELSWICHQGIGKGVRSKLWCRNHRLHGGQARDHIVLFHLICTWIRARAGSWREDADLPF