; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0017120 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0017120
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr07:20811932..20816035
RNA-Seq ExpressionPay0017120
SyntenyPay0017120
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026071.1 uncharacterized protein E6C27_scaffold581G00620 [Cucumis melo var. makuwa]0.0e+0073.63Show/hide
Query:  MSAEITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSSSKCSRSVESKTIQEEVVHKGDDVDFELCGEVVLESFKQVEKGEIRSSPNRHDSHVEKGVGKSD
        MSAEITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSSSKCSRSVESKTIQEEVVHKGDDVDFELCGEVVLESFKQVEKGEIRSSPNRHDS VEKGVGKSD
Subjt:  MSAEITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSSSKCSRSVESKTIQEEVVHKGDDVDFELCGEVVLESFKQVEKGEIRSSPNRHDSHVEKGVGKSD

Query:  EFTLVTRKKSELVSIRDRGKSMEVIMPNSFGSLLEVGDADKWALSIIEGSSPPLQVDEGTD-----------------TRVREGNFDSVSRRFSNSWDYS
        EFTLVTRKKSELVSIRDRGKSMEVIMPNSFGSLLEVGDADKWALSIIEGSSPPLQVDEGTD                 TRVREGNFDSVSRRFSNSWDYS
Subjt:  EFTLVTRKKSELVSIRDRGKSMEVIMPNSFGSLLEVGDADKWALSIIEGSSPPLQVDEGTD-----------------TRVREGNFDSVSRRFSNSWDYS

Query:  CSYSNSGVGRIWVMWKKNRFSFSTHVMDEQFVTGTLTDLLS-----------------------------------GVVMEDFNAIRVHSEAFRGSPIQG
        CSYSNSGVGRIWVMWKKNRFSFSTHVMDEQFVTGTLTDLLS                                   GVVMEDFNAIRVHSEAFRGSPIQG
Subjt:  CSYSNSGVGRIWVMWKKNRFSFSTHVMDEQFVTGTLTDLLS-----------------------------------GVVMEDFNAIRVHSEAFRGSPIQG

Query:  EMEDFNLAIRDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNDDWLSTWPTMLVNVLPWGISDHYPILFYPSFQHSNKVVSFRFFNHWVEDPSFIEV
        EMEDFNLAIRDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNDDWLSTWPTMLVNVLPWGISDHYPILFYPSFQHSNKVVSFRFFNHWVEDPSFIEV
Subjt:  EMEDFNLAIRDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNDDWLSTWPTMLVNVLPWGISDHYPILFYPSFQHSNKVVSFRFFNHWVEDPSFIEV

Query:  VTRMWSRHEGVSPLVRLMRNLHLLKPILRRRFGRHIKRLSEEVRIVKEAMDIAQREVERNPMSDSAVRLEEASLRQKSRIRWLKLGDQNTAFFHRSVRSR
        VTRMWSRHEGVSPLVRLMRNLHLLKPILRRRFGRHIKRLSEEVRIVKEAMDIAQRE                                            
Subjt:  VTRMWSRHEGVSPLVRLMRNLHLLKPILRRRFGRHIKRLSEEVRIVKEAMDIAQREVERNPMSDSAVRLEEASLRQKSRIRWLKLGDQNTAFFHRSVRSR

Query:  MSHNSLLSLMAVNYFRNSLGSQEIGCRELSPVIDDIVQFQWSEECYQALQLPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVGEDFLGINATAI
                 MAVNYFRNSLGSQEIG RELSPVIDDIVQFQWSEEC QALQLPISREEVRRVLFSMDSGKAPGPDGFSV              +GINATAI
Subjt:  MSHNSLLSLMAVNYFRNSLGSQEIGCRELSPVIDDIVQFQWSEECYQALQLPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVGEDFLGINATAI

Query:  TLIPKHNGAERLEDFHPISCYNVLYKCISKILADRLRVWLPSFISSNQSAFIPGRSIIENILLCQEPVGGYHLNSGKPRCTLKVDLQKAYDSVNWIFCLG
        TLIPKHNGAERLEDFHPISCYNVLYKCISKILADRLRVWLPSFISSNQSAFIPGRSIIENILLCQE +     +   P   +   L+  +         G
Subjt:  TLIPKHNGAERLEDFHPISCYNVLYKCISKILADRLRVWLPSFISSNQSAFIPGRSIIENILLCQEPVGGYHLNSGKPRCTLKVDLQKAYDSVNWIFCLG

Query:  VRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEK-------------------------------KFGELSGLFANPRKSSIFVAGVNNENASHLA
        VRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEK                               KFGELSGLFANPRKSSIFVAGVNNENASHLA
Subjt:  VRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEK-------------------------------KFGELSGLFANPRKSSIFVAGVNNENASHLA

Query:  TCMGFARGNLPVRYLGLPLLTGRLRSNDCAPLIQRITSQIRSWNARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN--------------------
        TCMGFARGNLPVRYLGLPLLTGRLRSNDCAPLIQRITSQIRSWNARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN                    
Subjt:  TCMGFARGNLPVRYLGLPLLTGRLRSNDCAPLIQRITSQIRSWNARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN--------------------

Query:  -------------EGGFGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVDSRVGEKLKQHVRMMVGNGNRCRVCLDPWLQGGAILEQ
                     EGGFGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVDSRVG                   CL  W    AIL  
Subjt:  -------------EGGFGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVDSRVGEKLKQHVRMMVGNGNRCRVCLDPWLQGGAILEQ

Query:  VGERV----LYDAASRGRLDFLTLLAQME--YGFGHDRVWVRGRQGGFSISSAWEAIRPRGGRVLWDDRLHRW
           R     L+   S   +D    + ++         RVWVRGRQGGFSISSAWEAIRPRGGRVLWD     W
Subjt:  VGERV----LYDAASRGRLDFLTLLAQME--YGFGHDRVWVRGRQGGFSISSAWEAIRPRGGRVLWDDRLHRW

KAA0057642.1 reverse transcriptase [Cucumis melo var. makuwa]0.0e+0071.09Show/hide
Query:  MEVIMPNSFGSLLEVGDADKWALSIIEGS-------------------SPPLQVDEGTDTRVREGNFDSVSRRFSNSWDYSCSYSNSGVGRIWVMWKKNR
        MEVI PNSFGSLLEVGD DKWALSIIEGS                   S  +      +TRVREGNFDSVSRRF NSWDYSCSYSNSGVGRIWVMWKKNR
Subjt:  MEVIMPNSFGSLLEVGDADKWALSIIEGS-------------------SPPLQVDEGTDTRVREGNFDSVSRRFSNSWDYSCSYSNSGVGRIWVMWKKNR

Query:  FSFSTHVMDEQFVTGTLTDLLSGVVMEDFNAIRVHSEAFRGSPIQGEMEDFNLAIRDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNDDWLSTWPT
        FSFSTHV DEQF+TGT       VVMEDFNAIR HSEA  GSPIQGEMEDF++AIRDADLVEPSVQGNWFTWTSKVQGSGM+RRLDRVL+NDDWLS WPT
Subjt:  FSFSTHVMDEQFVTGTLTDLLSGVVMEDFNAIRVHSEAFRGSPIQGEMEDFNLAIRDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNDDWLSTWPT

Query:  MLVNVLPWGISDHYPILFYPSFQHSNKVVSFRFFNHWVEDPSFIEVVTRMWSRHEGVSPLVRLMRNLHLLKPILRRRFGRHIKRLSEEVRIVKEAMDIAQ
        MLVNVLPWGISDH PIL YPSFQ ++KVVSFR FNHWV+DPSF+                               RRFGRHI+ LSEEVRI KEAMDIAQ
Subjt:  MLVNVLPWGISDHYPILFYPSFQHSNKVVSFRFFNHWVEDPSFIEVVTRMWSRHEGVSPLVRLMRNLHLLKPILRRRFGRHIKRLSEEVRIVKEAMDIAQ

Query:  REVERNPMSDSAVRLEEASLRQKS-----RIRWLKLGDQNTAFFHRSVRSRMSHNSLLSLMAVNYFRNSLGSQEIGCRELSPVIDDIVQFQWSEECYQAL
        REVERNPMSD  V   +ASL  ++     R+   + G +N  F   SV SR+S +  ++ MAVNYF NSLGSQEIG REL+P+IDDIVQFQWSEEC QAL
Subjt:  REVERNPMSDSAVRLEEASLRQKS-----RIRWLKLGDQNTAFFHRSVRSRMSHNSLLSLMAVNYFRNSLGSQEIGCRELSPVIDDIVQFQWSEECYQAL

Query:  QLPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVGEDF--------------LGINATAITLIPKHNGAERLEDFHPISCYNVLYKCISKILADR
        Q+PISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSV+GEDF              +G+NATAITLIPKHNGAERLEDF PISC NVLYKCISKILADR
Subjt:  QLPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVGEDF--------------LGINATAITLIPKHNGAERLEDFHPISCYNVLYKCISKILADR

Query:  LRVWLPSFISSNQSAFIPGRSIIENILLCQEPVGGYHLNSGKPRCTLKVDLQKAYDSVNWIF--------------------CL----------------
        LRVWLPSFISSNQSAFI GRSIIENILLCQE VGGYHLNSGKPRCTLKVDLQKAYDSVNW F                    C+                
Subjt:  LRVWLPSFISSNQSAFIPGRSIIENILLCQEPVGGYHLNSGKPRCTLKVDLQKAYDSVNWIF--------------------CL----------------

Query:  -----GVRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEKKFGELSGLFANPRKSSIFVAGVNNENASHLATCMGFARGNLPVRYLGLPLLTGRLR
             GVRQGDPLS FLFVMVMEVLSRMLNKIPQSFQFHHRCEK+FGELSGLFANPRKSSIF+AGVNNENAS LA CMGF RGNLPVRYLGLPLLTGRLR
Subjt:  -----GVRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEKKFGELSGLFANPRKSSIFVAGVNNENASHLATCMGFARGNLPVRYLGLPLLTGRLR

Query:  SNDCAPLIQRITSQIRSWNARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN-EGGFGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGR
        SNDC PLIQRITS+IRS +ARVLSFAGRLQLV SVL SLQVYWA VFVLPAYVHN EGG GIRDG +W  ASTLKILWLMLTNSGSLWVAWVEAY+LKGR
Subjt:  SNDCAPLIQRITSQIRSWNARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN-EGGFGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGR

Query:  SLWDVDSRVG------------EKLKQHVRMMVGNGNRCRVCLDPWLQGGAILEQVGERVLYDAASRGRLDFLTLLAQMEYGFGHDRVWVRGRQGGFSIS
        SLWDVDSRVG            EKLKQHVRM VGNGNRCRV LDPWLQ GAILEQVGERVLYDAASR         A +    G D  W+  R GGFSI+
Subjt:  SLWDVDSRVG------------EKLKQHVRMMVGNGNRCRVCLDPWLQGGAILEQVGERVLYDAASRGRLDFLTLLAQMEYGFGHDRVWVRGRQGGFSIS

Query:  SAWEAIRPRGGRVLWD---------------------------DRLHRWDSSVPTSCILCQGGVESRDHLFFSCLFGGDVWSRVLRIMASSYRIGHWGVE
        SAWEAIRPRGGRVLWD                           DR HRWDSSVP SCILC+GG+ESRDHLFFSC FGGDVWSRVLRIMASS+RIGHWGVE
Subjt:  SAWEAIRPRGGRVLWD---------------------------DRLHRWDSSVPTSCILCQGGVESRDHLFFSCLFGGDVWSRVLRIMASSYRIGHWGVE

Query:  LSWIS------------WHVLWCATIYFIWNERNHRLHGGQARDPIILFHLICTWIHARAGS
        LSWI             W VLWCATIYFIWNERNHRLHGGQA DPI++FHLICTWI ARAGS
Subjt:  LSWIS------------WHVLWCATIYFIWNERNHRLHGGQARDPIILFHLICTWIHARAGS

TYK19523.1 reverse transcriptase [Cucumis melo var. makuwa]0.0e+0070.67Show/hide
Query:  MEVIMPNSFGSLLEVGDADKWALSIIEGS-------------------SPPLQVDEGTDTRVREGNFDSVSRRFSNSWDYSCSYSNSGVGRIWVMWKKNR
        MEVI PNSFGSLLEVGD DKWALSIIEGS                   S  +      +TRVREGNFDSVSRRF NSWDYSCSYSNSGVGRIWVMWKKNR
Subjt:  MEVIMPNSFGSLLEVGDADKWALSIIEGS-------------------SPPLQVDEGTDTRVREGNFDSVSRRFSNSWDYSCSYSNSGVGRIWVMWKKNR

Query:  FSFSTHVMDEQFVTGTLTDLLSGVVMEDFNAIRVHSEAFRGSPIQGEMEDFNLAIRDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNDDWLSTWPT
        FSFSTHV DEQF+TGT       VVMEDFNAIR HSEA  GSPIQGEMEDF++AIRDADLVEPSVQ NWFTWTSKVQGSGM+RRLDRVL+NDDWLS WPT
Subjt:  FSFSTHVMDEQFVTGTLTDLLSGVVMEDFNAIRVHSEAFRGSPIQGEMEDFNLAIRDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNDDWLSTWPT

Query:  MLVNVLPWGISDHYPILFYPSFQHSNKVVSFRFFNHWVEDPSFIEVVTRMWSRHEGVSPLVRLMRNLHLLKPILRRRFGRHIKRLSEEVRIVKEAMDIAQ
        MLVNVLPWGISDH PIL YPSFQ ++KV                       SRHEGVSPLVRLMRNL+LLKPILRRRFGRHI+ LSEEVRI KEAMDIAQ
Subjt:  MLVNVLPWGISDHYPILFYPSFQHSNKVVSFRFFNHWVEDPSFIEVVTRMWSRHEGVSPLVRLMRNLHLLKPILRRRFGRHIKRLSEEVRIVKEAMDIAQ

Query:  REVERNPMSDSAVRLEEASLRQKSRIRWLKLGDQNTAFFHRSVRSRMSHNSLLSLMAVNYFRNSLGSQEIGCRELSPVIDDIVQFQWSEECYQALQLPIS
        RE++   ++     L E S                    +  V++ +S +  ++ MAVNYF NSLGSQEIG REL+P+IDDIVQFQWSEEC QALQ+PIS
Subjt:  REVERNPMSDSAVRLEEASLRQKSRIRWLKLGDQNTAFFHRSVRSRMSHNSLLSLMAVNYFRNSLGSQEIGCRELSPVIDDIVQFQWSEECYQALQLPIS

Query:  REEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVGEDF--------------LGINATAITLIPKHNGAERLEDFHPISCYNVLYKCISKILADRLRVWL
        REEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSV+GEDF              +G+NATAITLIPKHNGAERLEDF PISC NVLYKCISKILADRLRVWL
Subjt:  REEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVGEDF--------------LGINATAITLIPKHNGAERLEDFHPISCYNVLYKCISKILADRLRVWL

Query:  PSFISSNQSAFIPGRSIIENILLCQEPVGGYHLNSGKPRCTLKVDLQKAYDSVNWIF--------------------CL---------------------
        PSFISSNQSAFI GRSIIENILLCQE VGGYHLNSGKPRCTLKVDLQKAYDSVNW F                    C+                     
Subjt:  PSFISSNQSAFIPGRSIIENILLCQEPVGGYHLNSGKPRCTLKVDLQKAYDSVNWIF--------------------CL---------------------

Query:  GVRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEKKFGELSGLFANPRKSSIFVAGVNNENASHLATCMGFARGNLPVRYLGLPLLTGRLRSNDCA
        GVRQGDPLS FLFVMVMEVLSRMLNKIPQSFQFHHRCEK+FGELSGLFANPRKSSIF+AGVNNENAS LA CMGF RGNLPVRYLGLPLLTGRLRSNDC 
Subjt:  GVRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEKKFGELSGLFANPRKSSIFVAGVNNENASHLATCMGFARGNLPVRYLGLPLLTGRLRSNDCA

Query:  PLIQRITSQIRSWNARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN-EGGFGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDV
        PLIQRITS+IRS +ARVLSFAGRLQLV SVL SLQVYWA VFVLPAYVHN EGG GIRDG +W  ASTLKILWLMLTNSGSLWVAWVEAY+LKGRSLWDV
Subjt:  PLIQRITSQIRSWNARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN-EGGFGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDV

Query:  DSRVG------------EKLKQHVRMMVGNGNRCRVCLDPWLQGGAILEQVGERVLYDAASRGRLDFLTLLAQMEYGFGHDRVWVRGRQGGFSISSAWEA
        DSRVG            EKLKQHVRM VGNGNRCRV LDPWLQ GAILE+VGERVLYDAASR         A +    G D  W+  R GGFSI+SAWEA
Subjt:  DSRVG------------EKLKQHVRMMVGNGNRCRVCLDPWLQGGAILEQVGERVLYDAASRGRLDFLTLLAQMEYGFGHDRVWVRGRQGGFSISSAWEA

Query:  IRPRGGRVLWD---------------------------DRLHRWDSSVPTSCILCQGGVESRDHLFFSCLFGGDVWSRVLRIMASSYRIGHWGVELSWIS
        IRPRGGRVLWD                           DR HRWDSSVP SCILC+GG+ESRDHLFFSC FGGDVWSRVLRIMASS+RIGHWGVELSWI 
Subjt:  IRPRGGRVLWD---------------------------DRLHRWDSSVPTSCILCQGGVESRDHLFFSCLFGGDVWSRVLRIMASSYRIGHWGVELSWIS

Query:  ------------WHVLWCATIYFIWNERNHRLHGGQARDPIILFHLICTWIHARAGS
                    W VLWCATIYFIWNERNHRLHGGQA DPI++FHLICTWI ARAGS
Subjt:  ------------WHVLWCATIYFIWNERNHRLHGGQARDPIILFHLICTWIHARAGS

TYK28099.1 uncharacterized protein E5676_scaffold1467G00020 [Cucumis melo var. makuwa]0.0e+0073.76Show/hide
Query:  MSAEITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSSSKCSRSVESKTIQEEVVHKGDDVDFELCGEVVLESFKQVEKGEIRSSPNRHDSHVEKGVGKSD
        MSAEITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSSSKCSRSVESKTIQEEVVHKGDDVDFELCGEVVLESFKQVEKGEIRSSPNRHDS VEKGVGKSD
Subjt:  MSAEITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSSSKCSRSVESKTIQEEVVHKGDDVDFELCGEVVLESFKQVEKGEIRSSPNRHDSHVEKGVGKSD

Query:  EFTLVTRKKSELVSIRDRGKSMEVIMPNSFGSLLEVGDADKWALSIIEGSSPPLQVDEGTD-----------------TRVREGNFDSVSRRFSNSWDYS
        EFTLVTRKKSELVSIRDRGKSMEVIMPNSFGSLLEVGDADKWALSIIEGSSPPLQVDEGTD                 TRVREGNFDSVSRRFSNSWDYS
Subjt:  EFTLVTRKKSELVSIRDRGKSMEVIMPNSFGSLLEVGDADKWALSIIEGSSPPLQVDEGTD-----------------TRVREGNFDSVSRRFSNSWDYS

Query:  CSYSNSGVGRIWVMWKKNRFSFSTHVMDEQFVTGTLTDLLS-----------------------------------GVVMEDFNAIRVHSEAFRGSPIQG
        CSYSNSGVGRIWVMWKKNRFSFSTHVMDEQFVTGTLTDLLS                                   GVVMEDFNAIRVHSEAFRGSPIQG
Subjt:  CSYSNSGVGRIWVMWKKNRFSFSTHVMDEQFVTGTLTDLLS-----------------------------------GVVMEDFNAIRVHSEAFRGSPIQG

Query:  EMEDFNLAIRDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNDDWLSTWPTMLVNVLPWGISDHYPILFYPSFQHSNKVVSFRFFNHWVEDPSFIEV
        EMEDFNLAIRDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNDDWLSTWPTMLVNVLPWGISDHYPILFYPSFQHSNKVVSFRFFNHWVEDPSFIEV
Subjt:  EMEDFNLAIRDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNDDWLSTWPTMLVNVLPWGISDHYPILFYPSFQHSNKVVSFRFFNHWVEDPSFIEV

Query:  VTRMWSRHEGVSPLVRLMRNLHLLKPILRRRFGRHIKRLSEEVRIVKEAMDIAQREVERNPMSDSAVRLEEASLRQKSRIRWLKLGDQNTAFFHRSVRSR
        VTRMWSRHEGVSPLVRLMRNLHLLKPILRRRFGRHIKRLSEEVRIVKEAMDIAQRE                                            
Subjt:  VTRMWSRHEGVSPLVRLMRNLHLLKPILRRRFGRHIKRLSEEVRIVKEAMDIAQREVERNPMSDSAVRLEEASLRQKSRIRWLKLGDQNTAFFHRSVRSR

Query:  MSHNSLLSLMAVNYFRNSLGSQEIGCRELSPVIDDIVQFQWSEECYQALQLPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVGEDFLGINATAI
                 MAVNYFRNSLGSQEIG RELSPVIDDIVQFQWSEEC QALQLPISREEVRRVLFSMDSGKAPGPDGFSV              +GINATAI
Subjt:  MSHNSLLSLMAVNYFRNSLGSQEIGCRELSPVIDDIVQFQWSEECYQALQLPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVGEDFLGINATAI

Query:  TLIPKHNGAERLEDFHPISCYNVLYKCISKILADRLRVWLPSFISSNQSAFIPGRSIIENILLCQEPVGGYHLNSGKPRCTLKVDLQKAYDSVNWIFCLG
        TLIPKHNGAERLEDFHPISCYNVLYKCISKILADRLRVWLPSFISSNQSAFIPGRSIIENILLCQE +     +   P   +   L+  +         G
Subjt:  TLIPKHNGAERLEDFHPISCYNVLYKCISKILADRLRVWLPSFISSNQSAFIPGRSIIENILLCQEPVGGYHLNSGKPRCTLKVDLQKAYDSVNWIFCLG

Query:  VRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEK-------------------------------KFGELSGLFANPRKSSIFVAGVNNENASHLA
        VRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEK                               KFGELSGLFANPRKSSIFVAGVNNENASHLA
Subjt:  VRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEK-------------------------------KFGELSGLFANPRKSSIFVAGVNNENASHLA

Query:  TCMGFARGNLPVRYLGLPLLTGRLRSNDCAPLIQRITSQIRSWNARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN--------------------
        TCMGFARGNLPVRYLGLPLLTGRLRSNDCAPLIQRITSQIRSWNARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN                    
Subjt:  TCMGFARGNLPVRYLGLPLLTGRLRSNDCAPLIQRITSQIRSWNARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN--------------------

Query:  -------------EGGFGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVDSRVGEKLKQHVRMMVGNGNRCRVCLDPWLQGGAILEQ
                     EGGFGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVDSRVG                   CL  W    AIL  
Subjt:  -------------EGGFGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVDSRVGEKLKQHVRMMVGNGNRCRVCLDPWLQGGAILEQ

Query:  VGERV----LYDAASRGRLDFLTLLAQME--YGFGHDRVWVRGRQGGFSISSAWEAIRPRGGRVLWDDRLHRWDSSVPTSCILCQGGVESRDHLFFSCLF
           R     L+   S   +D    + ++         RVWVRGRQGGFSISSAWEAIRPRGGRVLWD                  GGVESRDHLFFSCLF
Subjt:  VGERV----LYDAASRGRLDFLTLLAQME--YGFGHDRVWVRGRQGGFSISSAWEAIRPRGGRVLWDDRLHRWDSSVPTSCILCQGGVESRDHLFFSCLF

Query:  GGDVWSRVLRIMASSYRIGHWGVELSWI
        GGDVWSRVLRIMASSYRIGHWGVELSWI
Subjt:  GGDVWSRVLRIMASSYRIGHWGVELSWI

XP_008452126.1 PREDICTED: uncharacterized protein LOC103493225 [Cucumis melo]0.0e+0069.09Show/hide
Query:  MEVIMPNSFGSLLEVGDADKWALSIIEGS-------------------SPPLQVDEGTDTRVREGNFDSVSRRFSNSWDYSCSYSNSGVGRIWVMWKKNR
        MEVI PNSFGSLLEVGD DKWALSIIEGS                   S  +      +TRVREGNFDSVSRRF NSWDYSCSYSNSGVGRIWVMWKKNR
Subjt:  MEVIMPNSFGSLLEVGDADKWALSIIEGS-------------------SPPLQVDEGTDTRVREGNFDSVSRRFSNSWDYSCSYSNSGVGRIWVMWKKNR

Query:  FSFSTHVMDEQFVTGTLTDLLSGVVMEDFNAIRVHSEAFRGSPIQGEMEDFNLAIRDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNDDWLSTWPT
        FSFSTHV DEQF+T     +   VVMEDFNAIR HSEA  GSPIQGEMEDF++AIRDADLVEPSVQ NWFTWTSKVQGSGM+RRLDRVL+NDDWLS WPT
Subjt:  FSFSTHVMDEQFVTGTLTDLLSGVVMEDFNAIRVHSEAFRGSPIQGEMEDFNLAIRDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNDDWLSTWPT

Query:  MLVNVLPWGISDHYPILFYPSFQHSNKVVSFRFFNHWVEDPSFIEVVTRMWSRHEGVSPLVRLMRNLHLLKPILRRRFGRHIKRLSEEVRIVKEAMDIAQ
        +                                                                           RFGRHI+ LSEEVRI KEAMDIAQ
Subjt:  MLVNVLPWGISDHYPILFYPSFQHSNKVVSFRFFNHWVEDPSFIEVVTRMWSRHEGVSPLVRLMRNLHLLKPILRRRFGRHIKRLSEEVRIVKEAMDIAQ

Query:  REVERNPMSDSAVRLEEASLRQKS-----RIRWLKLGDQNTAFFHRSVRSRMSHNSLLSLMAVNYFRNSLGSQEIGCRELSPVIDDIVQFQWSEECYQAL
        REVERNPMSD  V   +ASL  ++     R+   + G +N  F   SV SR+S +  ++ MAVNYF NSLGSQEIG REL+P+IDDIVQFQWSEEC QAL
Subjt:  REVERNPMSDSAVRLEEASLRQKS-----RIRWLKLGDQNTAFFHRSVRSRMSHNSLLSLMAVNYFRNSLGSQEIGCRELSPVIDDIVQFQWSEECYQAL

Query:  QLPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVGEDF--------------LGINATAITLIPKHNGAERLEDFHPISCYNVLYKCISKILADR
        Q+PISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSV+GEDF              +G+NATAITLIPKHNGAERLEDF PISC NVLYKCISKILADR
Subjt:  QLPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVGEDF--------------LGINATAITLIPKHNGAERLEDFHPISCYNVLYKCISKILADR

Query:  LRVWLPSFISSNQSAFIPGRSIIENILLCQEPVGGYHLNSGKPRCTLKVDLQKAYDSVNWIFCL------------GVRQGDPLSPFLFVMVMEVLSRML
        LRVWLPSFISSNQSAFI GRSIIENILLCQE VGGYHLNSGKPRCTLKVDLQKAYDSVNW F              GVRQGDPLS FLFVMVMEVLSRML
Subjt:  LRVWLPSFISSNQSAFIPGRSIIENILLCQEPVGGYHLNSGKPRCTLKVDLQKAYDSVNWIFCL------------GVRQGDPLSPFLFVMVMEVLSRML

Query:  NKIPQSFQFHHRCEKKFGELSGLFANPRKSSIFVAGVNNENASHLATCMGFARGNLPVRYLGLPLLTGRLRSNDCAPLIQRITSQIRSWNARVLSFAGRL
        NKIPQSFQFHHRCEK+FGELSGLFANPRKSSIF+AGVNNENAS LA CMGF RGNLPVRYLGLPLLTGRLRSNDC PLIQRITS+IRS +ARVLSFAGRL
Subjt:  NKIPQSFQFHHRCEKKFGELSGLFANPRKSSIFVAGVNNENASHLATCMGFARGNLPVRYLGLPLLTGRLRSNDCAPLIQRITSQIRSWNARVLSFAGRL

Query:  QLVRSVLRSLQVYWASVFVLPAYVHNEGGFGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVDSRVG------------EKLKQHVR
        QLV SVL SLQVYWA VFVLPAYVHNEGG GIRDG +W  ASTLKILWLMLTNSGSLWVAWVEAY+LKGRSLWDVDSRVG            EKLKQHVR
Subjt:  QLVRSVLRSLQVYWASVFVLPAYVHNEGGFGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVDSRVG------------EKLKQHVR

Query:  MMVGNGNRCRVCLDPWLQGGAILEQVGERVLYDAASRGRLDFLTLLAQMEYGFGHDRVWVRGRQGGFSISSAWEAIRPRGGRVLWD--------------
        M VGNGNRCRV LDPWLQ GAILE+VGERVLYDAASR         A +    G D  W+  R GGFSI+SAWEAIRPRGGRVLWD              
Subjt:  MMVGNGNRCRVCLDPWLQGGAILEQVGERVLYDAASRGRLDFLTLLAQMEYGFGHDRVWVRGRQGGFSISSAWEAIRPRGGRVLWD--------------

Query:  -------------DRLHRWDSSVPTSCILCQGGVESRDHLFFSCLFGGDVWSRVLRIMASSYRIGHWGVELSWIS------------WHVLWCATIYFIW
                     DR HRWDSSVP SCILC+GG+ESRDHLFFSC FGGDVWSRVLRIMASS+RIGHWGVELSWI             W VLWCATIYFIW
Subjt:  -------------DRLHRWDSSVPTSCILCQGGVESRDHLFFSCLFGGDVWSRVLRIMASSYRIGHWGVELSWIS------------WHVLWCATIYFIW

Query:  NERNHRLHGGQARDPIILFHLICTWIHARAGS
        NERNHRLHGGQA DPI++FHLICTWI ARAGS
Subjt:  NERNHRLHGGQARDPIILFHLICTWIHARAGS

TrEMBL top hitse value%identityAlignment
A0A1S3BSI8 uncharacterized protein LOC1034932250.0e+0069.09Show/hide
Query:  MEVIMPNSFGSLLEVGDADKWALSIIEGS-------------------SPPLQVDEGTDTRVREGNFDSVSRRFSNSWDYSCSYSNSGVGRIWVMWKKNR
        MEVI PNSFGSLLEVGD DKWALSIIEGS                   S  +      +TRVREGNFDSVSRRF NSWDYSCSYSNSGVGRIWVMWKKNR
Subjt:  MEVIMPNSFGSLLEVGDADKWALSIIEGS-------------------SPPLQVDEGTDTRVREGNFDSVSRRFSNSWDYSCSYSNSGVGRIWVMWKKNR

Query:  FSFSTHVMDEQFVTGTLTDLLSGVVMEDFNAIRVHSEAFRGSPIQGEMEDFNLAIRDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNDDWLSTWPT
        FSFSTHV DEQF+T     +   VVMEDFNAIR HSEA  GSPIQGEMEDF++AIRDADLVEPSVQ NWFTWTSKVQGSGM+RRLDRVL+NDDWLS WPT
Subjt:  FSFSTHVMDEQFVTGTLTDLLSGVVMEDFNAIRVHSEAFRGSPIQGEMEDFNLAIRDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNDDWLSTWPT

Query:  MLVNVLPWGISDHYPILFYPSFQHSNKVVSFRFFNHWVEDPSFIEVVTRMWSRHEGVSPLVRLMRNLHLLKPILRRRFGRHIKRLSEEVRIVKEAMDIAQ
        +                                                                           RFGRHI+ LSEEVRI KEAMDIAQ
Subjt:  MLVNVLPWGISDHYPILFYPSFQHSNKVVSFRFFNHWVEDPSFIEVVTRMWSRHEGVSPLVRLMRNLHLLKPILRRRFGRHIKRLSEEVRIVKEAMDIAQ

Query:  REVERNPMSDSAVRLEEASLRQKS-----RIRWLKLGDQNTAFFHRSVRSRMSHNSLLSLMAVNYFRNSLGSQEIGCRELSPVIDDIVQFQWSEECYQAL
        REVERNPMSD  V   +ASL  ++     R+   + G +N  F   SV SR+S +  ++ MAVNYF NSLGSQEIG REL+P+IDDIVQFQWSEEC QAL
Subjt:  REVERNPMSDSAVRLEEASLRQKS-----RIRWLKLGDQNTAFFHRSVRSRMSHNSLLSLMAVNYFRNSLGSQEIGCRELSPVIDDIVQFQWSEECYQAL

Query:  QLPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVGEDF--------------LGINATAITLIPKHNGAERLEDFHPISCYNVLYKCISKILADR
        Q+PISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSV+GEDF              +G+NATAITLIPKHNGAERLEDF PISC NVLYKCISKILADR
Subjt:  QLPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVGEDF--------------LGINATAITLIPKHNGAERLEDFHPISCYNVLYKCISKILADR

Query:  LRVWLPSFISSNQSAFIPGRSIIENILLCQEPVGGYHLNSGKPRCTLKVDLQKAYDSVNWIFCL------------GVRQGDPLSPFLFVMVMEVLSRML
        LRVWLPSFISSNQSAFI GRSIIENILLCQE VGGYHLNSGKPRCTLKVDLQKAYDSVNW F              GVRQGDPLS FLFVMVMEVLSRML
Subjt:  LRVWLPSFISSNQSAFIPGRSIIENILLCQEPVGGYHLNSGKPRCTLKVDLQKAYDSVNWIFCL------------GVRQGDPLSPFLFVMVMEVLSRML

Query:  NKIPQSFQFHHRCEKKFGELSGLFANPRKSSIFVAGVNNENASHLATCMGFARGNLPVRYLGLPLLTGRLRSNDCAPLIQRITSQIRSWNARVLSFAGRL
        NKIPQSFQFHHRCEK+FGELSGLFANPRKSSIF+AGVNNENAS LA CMGF RGNLPVRYLGLPLLTGRLRSNDC PLIQRITS+IRS +ARVLSFAGRL
Subjt:  NKIPQSFQFHHRCEKKFGELSGLFANPRKSSIFVAGVNNENASHLATCMGFARGNLPVRYLGLPLLTGRLRSNDCAPLIQRITSQIRSWNARVLSFAGRL

Query:  QLVRSVLRSLQVYWASVFVLPAYVHNEGGFGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVDSRVG------------EKLKQHVR
        QLV SVL SLQVYWA VFVLPAYVHNEGG GIRDG +W  ASTLKILWLMLTNSGSLWVAWVEAY+LKGRSLWDVDSRVG            EKLKQHVR
Subjt:  QLVRSVLRSLQVYWASVFVLPAYVHNEGGFGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVDSRVG------------EKLKQHVR

Query:  MMVGNGNRCRVCLDPWLQGGAILEQVGERVLYDAASRGRLDFLTLLAQMEYGFGHDRVWVRGRQGGFSISSAWEAIRPRGGRVLWD--------------
        M VGNGNRCRV LDPWLQ GAILE+VGERVLYDAASR         A +    G D  W+  R GGFSI+SAWEAIRPRGGRVLWD              
Subjt:  MMVGNGNRCRVCLDPWLQGGAILEQVGERVLYDAASRGRLDFLTLLAQMEYGFGHDRVWVRGRQGGFSISSAWEAIRPRGGRVLWD--------------

Query:  -------------DRLHRWDSSVPTSCILCQGGVESRDHLFFSCLFGGDVWSRVLRIMASSYRIGHWGVELSWIS------------WHVLWCATIYFIW
                     DR HRWDSSVP SCILC+GG+ESRDHLFFSC FGGDVWSRVLRIMASS+RIGHWGVELSWI             W VLWCATIYFIW
Subjt:  -------------DRLHRWDSSVPTSCILCQGGVESRDHLFFSCLFGGDVWSRVLRIMASSYRIGHWGVELSWIS------------WHVLWCATIYFIW

Query:  NERNHRLHGGQARDPIILFHLICTWIHARAGS
        NERNHRLHGGQA DPI++FHLICTWI ARAGS
Subjt:  NERNHRLHGGQARDPIILFHLICTWIHARAGS

A0A5A7SPE5 Reverse transcriptase domain-containing protein0.0e+0073.63Show/hide
Query:  MSAEITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSSSKCSRSVESKTIQEEVVHKGDDVDFELCGEVVLESFKQVEKGEIRSSPNRHDSHVEKGVGKSD
        MSAEITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSSSKCSRSVESKTIQEEVVHKGDDVDFELCGEVVLESFKQVEKGEIRSSPNRHDS VEKGVGKSD
Subjt:  MSAEITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSSSKCSRSVESKTIQEEVVHKGDDVDFELCGEVVLESFKQVEKGEIRSSPNRHDSHVEKGVGKSD

Query:  EFTLVTRKKSELVSIRDRGKSMEVIMPNSFGSLLEVGDADKWALSIIEGSSPPLQVDEGTD-----------------TRVREGNFDSVSRRFSNSWDYS
        EFTLVTRKKSELVSIRDRGKSMEVIMPNSFGSLLEVGDADKWALSIIEGSSPPLQVDEGTD                 TRVREGNFDSVSRRFSNSWDYS
Subjt:  EFTLVTRKKSELVSIRDRGKSMEVIMPNSFGSLLEVGDADKWALSIIEGSSPPLQVDEGTD-----------------TRVREGNFDSVSRRFSNSWDYS

Query:  CSYSNSGVGRIWVMWKKNRFSFSTHVMDEQFVTGTLTDLLS-----------------------------------GVVMEDFNAIRVHSEAFRGSPIQG
        CSYSNSGVGRIWVMWKKNRFSFSTHVMDEQFVTGTLTDLLS                                   GVVMEDFNAIRVHSEAFRGSPIQG
Subjt:  CSYSNSGVGRIWVMWKKNRFSFSTHVMDEQFVTGTLTDLLS-----------------------------------GVVMEDFNAIRVHSEAFRGSPIQG

Query:  EMEDFNLAIRDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNDDWLSTWPTMLVNVLPWGISDHYPILFYPSFQHSNKVVSFRFFNHWVEDPSFIEV
        EMEDFNLAIRDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNDDWLSTWPTMLVNVLPWGISDHYPILFYPSFQHSNKVVSFRFFNHWVEDPSFIEV
Subjt:  EMEDFNLAIRDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNDDWLSTWPTMLVNVLPWGISDHYPILFYPSFQHSNKVVSFRFFNHWVEDPSFIEV

Query:  VTRMWSRHEGVSPLVRLMRNLHLLKPILRRRFGRHIKRLSEEVRIVKEAMDIAQREVERNPMSDSAVRLEEASLRQKSRIRWLKLGDQNTAFFHRSVRSR
        VTRMWSRHEGVSPLVRLMRNLHLLKPILRRRFGRHIKRLSEEVRIVKEAMDIAQRE                                            
Subjt:  VTRMWSRHEGVSPLVRLMRNLHLLKPILRRRFGRHIKRLSEEVRIVKEAMDIAQREVERNPMSDSAVRLEEASLRQKSRIRWLKLGDQNTAFFHRSVRSR

Query:  MSHNSLLSLMAVNYFRNSLGSQEIGCRELSPVIDDIVQFQWSEECYQALQLPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVGEDFLGINATAI
                 MAVNYFRNSLGSQEIG RELSPVIDDIVQFQWSEEC QALQLPISREEVRRVLFSMDSGKAPGPDGFSV              +GINATAI
Subjt:  MSHNSLLSLMAVNYFRNSLGSQEIGCRELSPVIDDIVQFQWSEECYQALQLPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVGEDFLGINATAI

Query:  TLIPKHNGAERLEDFHPISCYNVLYKCISKILADRLRVWLPSFISSNQSAFIPGRSIIENILLCQEPVGGYHLNSGKPRCTLKVDLQKAYDSVNWIFCLG
        TLIPKHNGAERLEDFHPISCYNVLYKCISKILADRLRVWLPSFISSNQSAFIPGRSIIENILLCQE +     +   P   +   L+  +         G
Subjt:  TLIPKHNGAERLEDFHPISCYNVLYKCISKILADRLRVWLPSFISSNQSAFIPGRSIIENILLCQEPVGGYHLNSGKPRCTLKVDLQKAYDSVNWIFCLG

Query:  VRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEK-------------------------------KFGELSGLFANPRKSSIFVAGVNNENASHLA
        VRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEK                               KFGELSGLFANPRKSSIFVAGVNNENASHLA
Subjt:  VRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEK-------------------------------KFGELSGLFANPRKSSIFVAGVNNENASHLA

Query:  TCMGFARGNLPVRYLGLPLLTGRLRSNDCAPLIQRITSQIRSWNARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN--------------------
        TCMGFARGNLPVRYLGLPLLTGRLRSNDCAPLIQRITSQIRSWNARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN                    
Subjt:  TCMGFARGNLPVRYLGLPLLTGRLRSNDCAPLIQRITSQIRSWNARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN--------------------

Query:  -------------EGGFGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVDSRVGEKLKQHVRMMVGNGNRCRVCLDPWLQGGAILEQ
                     EGGFGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVDSRVG                   CL  W    AIL  
Subjt:  -------------EGGFGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVDSRVGEKLKQHVRMMVGNGNRCRVCLDPWLQGGAILEQ

Query:  VGERV----LYDAASRGRLDFLTLLAQME--YGFGHDRVWVRGRQGGFSISSAWEAIRPRGGRVLWDDRLHRW
           R     L+   S   +D    + ++         RVWVRGRQGGFSISSAWEAIRPRGGRVLWD     W
Subjt:  VGERV----LYDAASRGRLDFLTLLAQME--YGFGHDRVWVRGRQGGFSISSAWEAIRPRGGRVLWDDRLHRW

A0A5A7UP65 Reverse transcriptase0.0e+0071.09Show/hide
Query:  MEVIMPNSFGSLLEVGDADKWALSIIEGS-------------------SPPLQVDEGTDTRVREGNFDSVSRRFSNSWDYSCSYSNSGVGRIWVMWKKNR
        MEVI PNSFGSLLEVGD DKWALSIIEGS                   S  +      +TRVREGNFDSVSRRF NSWDYSCSYSNSGVGRIWVMWKKNR
Subjt:  MEVIMPNSFGSLLEVGDADKWALSIIEGS-------------------SPPLQVDEGTDTRVREGNFDSVSRRFSNSWDYSCSYSNSGVGRIWVMWKKNR

Query:  FSFSTHVMDEQFVTGTLTDLLSGVVMEDFNAIRVHSEAFRGSPIQGEMEDFNLAIRDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNDDWLSTWPT
        FSFSTHV DEQF+TGT       VVMEDFNAIR HSEA  GSPIQGEMEDF++AIRDADLVEPSVQGNWFTWTSKVQGSGM+RRLDRVL+NDDWLS WPT
Subjt:  FSFSTHVMDEQFVTGTLTDLLSGVVMEDFNAIRVHSEAFRGSPIQGEMEDFNLAIRDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNDDWLSTWPT

Query:  MLVNVLPWGISDHYPILFYPSFQHSNKVVSFRFFNHWVEDPSFIEVVTRMWSRHEGVSPLVRLMRNLHLLKPILRRRFGRHIKRLSEEVRIVKEAMDIAQ
        MLVNVLPWGISDH PIL YPSFQ ++KVVSFR FNHWV+DPSF+                               RRFGRHI+ LSEEVRI KEAMDIAQ
Subjt:  MLVNVLPWGISDHYPILFYPSFQHSNKVVSFRFFNHWVEDPSFIEVVTRMWSRHEGVSPLVRLMRNLHLLKPILRRRFGRHIKRLSEEVRIVKEAMDIAQ

Query:  REVERNPMSDSAVRLEEASLRQKS-----RIRWLKLGDQNTAFFHRSVRSRMSHNSLLSLMAVNYFRNSLGSQEIGCRELSPVIDDIVQFQWSEECYQAL
        REVERNPMSD  V   +ASL  ++     R+   + G +N  F   SV SR+S +  ++ MAVNYF NSLGSQEIG REL+P+IDDIVQFQWSEEC QAL
Subjt:  REVERNPMSDSAVRLEEASLRQKS-----RIRWLKLGDQNTAFFHRSVRSRMSHNSLLSLMAVNYFRNSLGSQEIGCRELSPVIDDIVQFQWSEECYQAL

Query:  QLPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVGEDF--------------LGINATAITLIPKHNGAERLEDFHPISCYNVLYKCISKILADR
        Q+PISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSV+GEDF              +G+NATAITLIPKHNGAERLEDF PISC NVLYKCISKILADR
Subjt:  QLPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVGEDF--------------LGINATAITLIPKHNGAERLEDFHPISCYNVLYKCISKILADR

Query:  LRVWLPSFISSNQSAFIPGRSIIENILLCQEPVGGYHLNSGKPRCTLKVDLQKAYDSVNWIF--------------------CL----------------
        LRVWLPSFISSNQSAFI GRSIIENILLCQE VGGYHLNSGKPRCTLKVDLQKAYDSVNW F                    C+                
Subjt:  LRVWLPSFISSNQSAFIPGRSIIENILLCQEPVGGYHLNSGKPRCTLKVDLQKAYDSVNWIF--------------------CL----------------

Query:  -----GVRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEKKFGELSGLFANPRKSSIFVAGVNNENASHLATCMGFARGNLPVRYLGLPLLTGRLR
             GVRQGDPLS FLFVMVMEVLSRMLNKIPQSFQFHHRCEK+FGELSGLFANPRKSSIF+AGVNNENAS LA CMGF RGNLPVRYLGLPLLTGRLR
Subjt:  -----GVRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEKKFGELSGLFANPRKSSIFVAGVNNENASHLATCMGFARGNLPVRYLGLPLLTGRLR

Query:  SNDCAPLIQRITSQIRSWNARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN-EGGFGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGR
        SNDC PLIQRITS+IRS +ARVLSFAGRLQLV SVL SLQVYWA VFVLPAYVHN EGG GIRDG +W  ASTLKILWLMLTNSGSLWVAWVEAY+LKGR
Subjt:  SNDCAPLIQRITSQIRSWNARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN-EGGFGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGR

Query:  SLWDVDSRVG------------EKLKQHVRMMVGNGNRCRVCLDPWLQGGAILEQVGERVLYDAASRGRLDFLTLLAQMEYGFGHDRVWVRGRQGGFSIS
        SLWDVDSRVG            EKLKQHVRM VGNGNRCRV LDPWLQ GAILEQVGERVLYDAASR         A +    G D  W+  R GGFSI+
Subjt:  SLWDVDSRVG------------EKLKQHVRMMVGNGNRCRVCLDPWLQGGAILEQVGERVLYDAASRGRLDFLTLLAQMEYGFGHDRVWVRGRQGGFSIS

Query:  SAWEAIRPRGGRVLWD---------------------------DRLHRWDSSVPTSCILCQGGVESRDHLFFSCLFGGDVWSRVLRIMASSYRIGHWGVE
        SAWEAIRPRGGRVLWD                           DR HRWDSSVP SCILC+GG+ESRDHLFFSC FGGDVWSRVLRIMASS+RIGHWGVE
Subjt:  SAWEAIRPRGGRVLWD---------------------------DRLHRWDSSVPTSCILCQGGVESRDHLFFSCLFGGDVWSRVLRIMASSYRIGHWGVE

Query:  LSWIS------------WHVLWCATIYFIWNERNHRLHGGQARDPIILFHLICTWIHARAGS
        LSWI             W VLWCATIYFIWNERNHRLHGGQA DPI++FHLICTWI ARAGS
Subjt:  LSWIS------------WHVLWCATIYFIWNERNHRLHGGQARDPIILFHLICTWIHARAGS

A0A5D3D7P6 Reverse transcriptase0.0e+0070.67Show/hide
Query:  MEVIMPNSFGSLLEVGDADKWALSIIEGS-------------------SPPLQVDEGTDTRVREGNFDSVSRRFSNSWDYSCSYSNSGVGRIWVMWKKNR
        MEVI PNSFGSLLEVGD DKWALSIIEGS                   S  +      +TRVREGNFDSVSRRF NSWDYSCSYSNSGVGRIWVMWKKNR
Subjt:  MEVIMPNSFGSLLEVGDADKWALSIIEGS-------------------SPPLQVDEGTDTRVREGNFDSVSRRFSNSWDYSCSYSNSGVGRIWVMWKKNR

Query:  FSFSTHVMDEQFVTGTLTDLLSGVVMEDFNAIRVHSEAFRGSPIQGEMEDFNLAIRDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNDDWLSTWPT
        FSFSTHV DEQF+TGT       VVMEDFNAIR HSEA  GSPIQGEMEDF++AIRDADLVEPSVQ NWFTWTSKVQGSGM+RRLDRVL+NDDWLS WPT
Subjt:  FSFSTHVMDEQFVTGTLTDLLSGVVMEDFNAIRVHSEAFRGSPIQGEMEDFNLAIRDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNDDWLSTWPT

Query:  MLVNVLPWGISDHYPILFYPSFQHSNKVVSFRFFNHWVEDPSFIEVVTRMWSRHEGVSPLVRLMRNLHLLKPILRRRFGRHIKRLSEEVRIVKEAMDIAQ
        MLVNVLPWGISDH PIL YPSFQ ++KV                       SRHEGVSPLVRLMRNL+LLKPILRRRFGRHI+ LSEEVRI KEAMDIAQ
Subjt:  MLVNVLPWGISDHYPILFYPSFQHSNKVVSFRFFNHWVEDPSFIEVVTRMWSRHEGVSPLVRLMRNLHLLKPILRRRFGRHIKRLSEEVRIVKEAMDIAQ

Query:  REVERNPMSDSAVRLEEASLRQKSRIRWLKLGDQNTAFFHRSVRSRMSHNSLLSLMAVNYFRNSLGSQEIGCRELSPVIDDIVQFQWSEECYQALQLPIS
        RE++   ++     L E S                    +  V++ +S +  ++ MAVNYF NSLGSQEIG REL+P+IDDIVQFQWSEEC QALQ+PIS
Subjt:  REVERNPMSDSAVRLEEASLRQKSRIRWLKLGDQNTAFFHRSVRSRMSHNSLLSLMAVNYFRNSLGSQEIGCRELSPVIDDIVQFQWSEECYQALQLPIS

Query:  REEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVGEDF--------------LGINATAITLIPKHNGAERLEDFHPISCYNVLYKCISKILADRLRVWL
        REEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSV+GEDF              +G+NATAITLIPKHNGAERLEDF PISC NVLYKCISKILADRLRVWL
Subjt:  REEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVGEDF--------------LGINATAITLIPKHNGAERLEDFHPISCYNVLYKCISKILADRLRVWL

Query:  PSFISSNQSAFIPGRSIIENILLCQEPVGGYHLNSGKPRCTLKVDLQKAYDSVNWIF--------------------CL---------------------
        PSFISSNQSAFI GRSIIENILLCQE VGGYHLNSGKPRCTLKVDLQKAYDSVNW F                    C+                     
Subjt:  PSFISSNQSAFIPGRSIIENILLCQEPVGGYHLNSGKPRCTLKVDLQKAYDSVNWIF--------------------CL---------------------

Query:  GVRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEKKFGELSGLFANPRKSSIFVAGVNNENASHLATCMGFARGNLPVRYLGLPLLTGRLRSNDCA
        GVRQGDPLS FLFVMVMEVLSRMLNKIPQSFQFHHRCEK+FGELSGLFANPRKSSIF+AGVNNENAS LA CMGF RGNLPVRYLGLPLLTGRLRSNDC 
Subjt:  GVRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEKKFGELSGLFANPRKSSIFVAGVNNENASHLATCMGFARGNLPVRYLGLPLLTGRLRSNDCA

Query:  PLIQRITSQIRSWNARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN-EGGFGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDV
        PLIQRITS+IRS +ARVLSFAGRLQLV SVL SLQVYWA VFVLPAYVHN EGG GIRDG +W  ASTLKILWLMLTNSGSLWVAWVEAY+LKGRSLWDV
Subjt:  PLIQRITSQIRSWNARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN-EGGFGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDV

Query:  DSRVG------------EKLKQHVRMMVGNGNRCRVCLDPWLQGGAILEQVGERVLYDAASRGRLDFLTLLAQMEYGFGHDRVWVRGRQGGFSISSAWEA
        DSRVG            EKLKQHVRM VGNGNRCRV LDPWLQ GAILE+VGERVLYDAASR         A +    G D  W+  R GGFSI+SAWEA
Subjt:  DSRVG------------EKLKQHVRMMVGNGNRCRVCLDPWLQGGAILEQVGERVLYDAASRGRLDFLTLLAQMEYGFGHDRVWVRGRQGGFSISSAWEA

Query:  IRPRGGRVLWD---------------------------DRLHRWDSSVPTSCILCQGGVESRDHLFFSCLFGGDVWSRVLRIMASSYRIGHWGVELSWIS
        IRPRGGRVLWD                           DR HRWDSSVP SCILC+GG+ESRDHLFFSC FGGDVWSRVLRIMASS+RIGHWGVELSWI 
Subjt:  IRPRGGRVLWD---------------------------DRLHRWDSSVPTSCILCQGGVESRDHLFFSCLFGGDVWSRVLRIMASSYRIGHWGVELSWIS

Query:  ------------WHVLWCATIYFIWNERNHRLHGGQARDPIILFHLICTWIHARAGS
                    W VLWCATIYFIWNERNHRLHGGQA DPI++FHLICTWI ARAGS
Subjt:  ------------WHVLWCATIYFIWNERNHRLHGGQARDPIILFHLICTWIHARAGS

A0A5D3DXE4 Reverse transcriptase domain-containing protein0.0e+0073.76Show/hide
Query:  MSAEITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSSSKCSRSVESKTIQEEVVHKGDDVDFELCGEVVLESFKQVEKGEIRSSPNRHDSHVEKGVGKSD
        MSAEITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSSSKCSRSVESKTIQEEVVHKGDDVDFELCGEVVLESFKQVEKGEIRSSPNRHDS VEKGVGKSD
Subjt:  MSAEITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSSSKCSRSVESKTIQEEVVHKGDDVDFELCGEVVLESFKQVEKGEIRSSPNRHDSHVEKGVGKSD

Query:  EFTLVTRKKSELVSIRDRGKSMEVIMPNSFGSLLEVGDADKWALSIIEGSSPPLQVDEGTD-----------------TRVREGNFDSVSRRFSNSWDYS
        EFTLVTRKKSELVSIRDRGKSMEVIMPNSFGSLLEVGDADKWALSIIEGSSPPLQVDEGTD                 TRVREGNFDSVSRRFSNSWDYS
Subjt:  EFTLVTRKKSELVSIRDRGKSMEVIMPNSFGSLLEVGDADKWALSIIEGSSPPLQVDEGTD-----------------TRVREGNFDSVSRRFSNSWDYS

Query:  CSYSNSGVGRIWVMWKKNRFSFSTHVMDEQFVTGTLTDLLS-----------------------------------GVVMEDFNAIRVHSEAFRGSPIQG
        CSYSNSGVGRIWVMWKKNRFSFSTHVMDEQFVTGTLTDLLS                                   GVVMEDFNAIRVHSEAFRGSPIQG
Subjt:  CSYSNSGVGRIWVMWKKNRFSFSTHVMDEQFVTGTLTDLLS-----------------------------------GVVMEDFNAIRVHSEAFRGSPIQG

Query:  EMEDFNLAIRDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNDDWLSTWPTMLVNVLPWGISDHYPILFYPSFQHSNKVVSFRFFNHWVEDPSFIEV
        EMEDFNLAIRDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNDDWLSTWPTMLVNVLPWGISDHYPILFYPSFQHSNKVVSFRFFNHWVEDPSFIEV
Subjt:  EMEDFNLAIRDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNDDWLSTWPTMLVNVLPWGISDHYPILFYPSFQHSNKVVSFRFFNHWVEDPSFIEV

Query:  VTRMWSRHEGVSPLVRLMRNLHLLKPILRRRFGRHIKRLSEEVRIVKEAMDIAQREVERNPMSDSAVRLEEASLRQKSRIRWLKLGDQNTAFFHRSVRSR
        VTRMWSRHEGVSPLVRLMRNLHLLKPILRRRFGRHIKRLSEEVRIVKEAMDIAQRE                                            
Subjt:  VTRMWSRHEGVSPLVRLMRNLHLLKPILRRRFGRHIKRLSEEVRIVKEAMDIAQREVERNPMSDSAVRLEEASLRQKSRIRWLKLGDQNTAFFHRSVRSR

Query:  MSHNSLLSLMAVNYFRNSLGSQEIGCRELSPVIDDIVQFQWSEECYQALQLPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVGEDFLGINATAI
                 MAVNYFRNSLGSQEIG RELSPVIDDIVQFQWSEEC QALQLPISREEVRRVLFSMDSGKAPGPDGFSV              +GINATAI
Subjt:  MSHNSLLSLMAVNYFRNSLGSQEIGCRELSPVIDDIVQFQWSEECYQALQLPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVGEDFLGINATAI

Query:  TLIPKHNGAERLEDFHPISCYNVLYKCISKILADRLRVWLPSFISSNQSAFIPGRSIIENILLCQEPVGGYHLNSGKPRCTLKVDLQKAYDSVNWIFCLG
        TLIPKHNGAERLEDFHPISCYNVLYKCISKILADRLRVWLPSFISSNQSAFIPGRSIIENILLCQE +     +   P   +   L+  +         G
Subjt:  TLIPKHNGAERLEDFHPISCYNVLYKCISKILADRLRVWLPSFISSNQSAFIPGRSIIENILLCQEPVGGYHLNSGKPRCTLKVDLQKAYDSVNWIFCLG

Query:  VRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEK-------------------------------KFGELSGLFANPRKSSIFVAGVNNENASHLA
        VRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEK                               KFGELSGLFANPRKSSIFVAGVNNENASHLA
Subjt:  VRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCEK-------------------------------KFGELSGLFANPRKSSIFVAGVNNENASHLA

Query:  TCMGFARGNLPVRYLGLPLLTGRLRSNDCAPLIQRITSQIRSWNARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN--------------------
        TCMGFARGNLPVRYLGLPLLTGRLRSNDCAPLIQRITSQIRSWNARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN                    
Subjt:  TCMGFARGNLPVRYLGLPLLTGRLRSNDCAPLIQRITSQIRSWNARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN--------------------

Query:  -------------EGGFGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVDSRVGEKLKQHVRMMVGNGNRCRVCLDPWLQGGAILEQ
                     EGGFGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVDSRVG                   CL  W    AIL  
Subjt:  -------------EGGFGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVDSRVGEKLKQHVRMMVGNGNRCRVCLDPWLQGGAILEQ

Query:  VGERV----LYDAASRGRLDFLTLLAQME--YGFGHDRVWVRGRQGGFSISSAWEAIRPRGGRVLWDDRLHRWDSSVPTSCILCQGGVESRDHLFFSCLF
           R     L+   S   +D    + ++         RVWVRGRQGGFSISSAWEAIRPRGGRVLWD                  GGVESRDHLFFSCLF
Subjt:  VGERV----LYDAASRGRLDFLTLLAQME--YGFGHDRVWVRGRQGGFSISSAWEAIRPRGGRVLWDDRLHRWDSSVPTSCILCQGGVESRDHLFFSCLF

Query:  GGDVWSRVLRIMASSYRIGHWGVELSWI
        GGDVWSRVLRIMASSYRIGHWGVELSWI
Subjt:  GGDVWSRVLRIMASSYRIGHWGVELSWI

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.3e-2122.93Show/hide
Query:  YFRNSLGSQEIGCRELSPVIDDIVQFQWSEECYQALQLPISREEVRRVLFSMDSGKAPGPDGFSVGFF------------KGAWSVVGEDFL--GINATA
        Y+++   ++     E+   +D     + ++E  ++L  PI+  E+  ++ S+ + K+PGPDGF+  F+            K   S+  E  L       +
Subjt:  YFRNSLGSQEIGCRELSPVIDDIVQFQWSEECYQALQLPISREEVRRVLFSMDSGKAPGPDGFSVGFF------------KGAWSVVGEDFL--GINATA

Query:  ITLIPK-HNGAERLEDFHPISCYNVLYKCISKILADRLRVWLPSFISSNQSAFIPGRSIIENILLCQEPVGGYHLNSGKPRCTLKVDLQKAYDSVNWIFC
        I LIPK      + E+F PIS  N+  K ++KILA+R++  +   I  +Q  FIPG     NI      +   +    K    + +D +KA+D +   F 
Subjt:  ITLIPK-HNGAERLEDFHPISCYNVLYKCISKILADRLRVWLPSFISSNQSAFIPGRSIIENILLCQEPVGGYHLNSGKPRCTLKVDLQKAYDSVNWIFC

Query:  L-----------------------------------------GVRQGDPLSPFLFVMVMEVLSRMLNKIPQ-----------------------------
        L                                         G RQG PLSP LF +V+EVL+R + +  +                             
Subjt:  L-----------------------------------------GVRQGDPLSPFLFVMVMEVLSRMLNKIPQ-----------------------------

Query:  SFQFHHRCEKKFGELSGLFANPRKSSIFVAGVNNENASHLATCMGFARGNLPVRYLGLPLL--TGRLRSNDCAPLIQRITSQIRSWNARVLSFAGRLQLV
        S Q   +    F ++SG   N +KS  F+   N +  S +   + F   +  ++YLG+ L      L   +  PL++ I      W     S+ GR+ +V
Subjt:  SFQFHHRCEKKFGELSGLFANPRKSSIFVAGVNNENASHLATCMGFARGNLPVRYLGLPLL--TGRLRSNDCAPLIQRITSQIRSWNARVLSFAGRLQLV

Query:  RSVLRSLQVY
        +  +    +Y
Subjt:  RSVLRSLQVY

P08548 LINE-1 reverse transcriptase homolog9.3e-1721.71Show/hide
Query:  YFRNSLGSQEIGCRELSPVIDDIVQFQWSEECYQALQLPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVGEDFLGI--------------NATA
        Y++     +    +E+   ++     + S++  + L  PIS  E+   + ++   K+PGPDGF+  F++     +    L +                  
Subjt:  YFRNSLGSQEIGCRELSPVIDDIVQFQWSEECYQALQLPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVGEDFLGI--------------NATA

Query:  ITLIPK-HNGAERLEDFHPISCYNVLYKCISKILADRLRVWLPSFISSNQSAFIPGRSIIENILLCQEPVGGYHLNSGKPRCTLKVDLQKAYDSVNWIFC
        ITLIPK      R E++ PIS  N+  K ++KIL +R++  +   I  +Q  FIPG     NI      +   +    K    L +D +KA+D++   F 
Subjt:  ITLIPK-HNGAERLEDFHPISCYNVLYKCISKILADRLRVWLPSFISSNQSAFIPGRSIIENILLCQEPVGGYHLNSGKPRCTLKVDLQKAYDSVNWIFC

Query:  L-----------------------------------------GVRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCE--------------------
        +                                         G RQG PLSP LF +VMEVL+  + +       H   E                    
Subjt:  L-----------------------------------------GVRQGDPLSPFLFVMVMEVLSRMLNKIPQSFQFHHRCE--------------------

Query:  ---------KKFGELSGLFANPRKSSIFVAGVNNENASHLATCMGFARGNLPVRYLGLPLL--TGRLRSNDCAPLIQRITSQIRSWNARVLSFAGRLQLV
                 K++  +SG   N  KS  F+   NN+    +   + F      ++YLG+ L      L   +   L + I   +  W     S+ GR+ +V
Subjt:  ---------KKFGELSGLFANPRKSSIFVAGVNNENASHLATCMGFARGNLPVRYLGLPLL--TGRLRSNDCAPLIQRITSQIRSWNARVLSFAGRLQLV

Query:  RSVLRSLQVY
        +  +    +Y
Subjt:  RSVLRSLQVY

P0C2F6 Putative ribonuclease H protein At1g657502.9e-1025.98Show/hide
Query:  LPLLTGRLRSNDCAPLIQRITSQIRSWNARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN---------------------------------EGG
        +P+L  R+  +    +++R++S++  W  + LSFAGRL L ++VL S+ V+  S  +LP  + N                                 EGG
Subjt:  LPLLTGRLRSNDCAPLIQRITSQIRSWNARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN---------------------------------EGG

Query:  FGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEA------------YILKG--RSLW-DVDSRVGEKLKQHVRMMVGNGNRCRVCLDPWLQGGAILE-Q
         G+R   S N A   K+ W +L    SLW   ++              I KG   S W  +   + + +   V  + G+G + R   D W+ G  +LE  
Subjt:  FGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEA------------YILKG--RSLW-DVDSRVGEKLKQHVRMMVGNGNRCRVCLDPWLQGGAILE-Q

Query:  VGER
         GER
Subjt:  VGER

P11369 LINE-1 retrotransposable element ORF2 protein1.6e-1327.92Show/hide
Query:  LQLPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVGEDFLGI------------------NATAITLIPK-HNGAERLEDFHPISCYNVLYKCIS
        L  PIS +E+  V+ S+ + K+PGPDGFS  F++       ED + I                      ITLIPK      ++E+F PIS  N+  K ++
Subjt:  LQLPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVGEDFLGI------------------NATAITLIPK-HNGAERLEDFHPISCYNVLYKCIS

Query:  KILADRLRVWLPSFISSNQSAFIPGRSIIENILLCQEPVGGYHLNSGKPRCTLKVDLQKAYDSVNWIFCLGVRQGDPLSPFLFVMVMEVLSRMLNKI
        KILA+R++  + + I  +Q  FIPG     NI      +   +    K    + +D +KA+D +   F + V +   +      M+  + S+ +  I
Subjt:  KILADRLRVWLPSFISSNQSAFIPGRSIIENILLCQEPVGGYHLNSGKPRCTLKVDLQKAYDSVNWIFCLGVRQGDPLSPFLFVMVMEVLSRMLNKI

P14381 Transposon TX1 uncharacterized 149 kDa protein6.9e-2022.07Show/hide
Query:  FTWTSKVQGSGMLRRLDRVLVNDDWLSTWPTMLVNVLPWGISDHYPILFYPSFQHS-NKVVSFRFFNHWVEDPSFIEVVTRMWSRHEGVSPLVRLMR---
        FT+     G     R+DR+ ++   +S   +  + + P+  SDH  +    S   S  K   + F N  +ED  F + V   W            +    
Subjt:  FTWTSKVQGSGMLRRLDRVLVNDDWLSTWPTMLVNVLPWGISDHYPILFYPSFQHS-NKVVSFRFFNHWVEDPSFIEVVTRMWSRHEGVSPLVRLMR---

Query:  ---NLHLLKPILRRRFGRHI--KRLSEEVRIVKEAMDIAQR---------EVERNPMSDSAVRLEEASLR---QKSRIRWLKLGDQNTAFFHRSVRSRMS
            +HL   +L + + + +  +R +E   +  E +D+ QR         + E     ++   +E+   R    +SR++ L   D+ + FF+   + + +
Subjt:  ---NLHLLKPILRRRFGRHI--KRLSEEVRIVKEAMDIAQR---------EVERNPMSDSAVRLEEASLR---QKSRIRWLKLGDQNTAFFHRSVRSRMS

Query:  HNSLLSLMAVN--------YFRNSLGSQEIGCRELSPVIDDIVQFQW------SEECYQALQLPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVV
           +  L A +          R+   S         P+  D  +  W      SE   + L+ PI+ +E+ + L  M   K+PG DG ++ FF+  W  +
Subjt:  HNSLLSLMAVN--------YFRNSLGSQEIGCRELSPVIDDIVQFQW------SEECYQALQLPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVV

Query:  GEDF--------------LGINATAITLIPKHNGAERLEDFHPISCYNVLYKCISKILADRLRVWLPSFISSNQSAFIPGRSIIENILLCQEPVGGYHLN
        G DF              L      ++L+PK      ++++ P+S  +  YK ++K ++ RL+  L   I  +QS  +PGR+I +N+ L ++ +  +   
Subjt:  GEDF--------------LGINATAITLIPKHNGAERLEDFHPISCYNVLYKCISKILADRLRVWLPSFISSNQSAFIPGRSIIENILLCQEPVGGYHLN

Query:  SGKPRCTLKVDLQKAYDSVNWIFCLGVRQGDPLSP
        +G     L +D +KA+D V+  + +G  Q     P
Subjt:  SGKPRCTLKVDLQKAYDSVNWIFCLGVRQGDPLSP

Arabidopsis top hitse value%identityAlignment
AT1G40390.1 DNAse I-like superfamily protein1.1e-1530.73Show/hide
Query:  VVMEDFNAIRVHSEAFRGSP----IQGEMEDFNLAIRDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNDDWLSTWPTMLVNVLPWGISDHYP-ILF
        +V+ DFN I   +E +   P    +QG +ED    +RD+DLV+   +G  +TW++  Q + +LR+LDR +VN  WL+T+PT      P   SDH   ++ 
Subjt:  VVMEDFNAIRVHSEAFRGSP----IQGEMEDFNLAIRDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNDDWLSTWPTMLVNVLPWGISDHYP-ILF

Query:  YPSFQHSNKVVSFRFFNHWVEDPSFIEVVTRMWSRHEGVSPLVRLMRNLHLLKPILRRRFGRHIKRLSE-EVRIVKEAMDIAQR--EVERNPMSDSAVRL
          +    +K  SF++F+     P FI  +   W +   V   +  +    LLK   +   G + +  S  + +++    D   R   V R   +  A  L
Subjt:  YPSFQHSNKVVSFRFFNHWVEDPSFIEVVTRMWSRHEGVSPLVRLMRNLHLLKPILRRRFGRHIKRLSE-EVRIVKEAMDIAQR--EVERNPMSDSAVRL

Query:  EEASLRQKSRIRWLKLGD
         E+  +QKSRI+WLK GD
Subjt:  EEASLRQKSRIRWLKLGD

AT1G43760.1 DNAse I-like superfamily protein1.2e-4328.72Show/hide
Query:  SPPLQVDEGTDTRVREGNFDSVSRRFSNSWDYSCSYSNSGVGRIWVMWKKNRFSFSTHVMDEQFVTGTLTDLLSGVVMEDFNAIRVHSEAF----RGSPI
        SP   V  G+ +   E N  ++      SW    +Y  S +GRIW++W     S S  V  +       TD L  +++ DF+ I   S+ +       P+
Subjt:  SPPLQVDEGTDTRVREGNFDSVSRRFSNSWDYSCSYSNSGVGRIWVMWKKNRFSFSTHVMDEQFVTGTLTDLLSGVVMEDFNAIRVHSEAF----RGSPI

Query:  QGEMEDFNLAIRDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNDDWLSTWPTMLVNVLPWGISDHYPILFYPSFQHSNKVVSFRFFNHWVEDPSFI
        +G +E+F   +RD+DLV+   +G  +TW++    + ++R+LDR + N DW S++P+ +      G+SDH P +             FR+F+     P+F+
Subjt:  QGEMEDFNLAIRDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNDDWLSTWPTMLVNVLPWGISDHYPILFYPSFQHSNKVVSFRFFNHWVEDPSFI

Query:  EVVTRMWSRHEGV-SPLVRLMRNLHLLKPILRRRFGRHIKRLSEEVRIVKEAMDIAQREVERNPMSDSAVRLE--------------EASLRQKSRIRWL
          +T  W     V S +  L  +L   K   +    +    +  + +   ++++  Q ++  NP SDS  R+E              E+  RQKSRI+WL
Subjt:  EVVTRMWSRHEGV-SPLVRLMRNLHLLKPILRRRFGRHIKRLSEEVRIVKEAMDIAQREVERNPMSDSAVRLE--------------EASLRQKSRIRWL

Query:  KLGDQNTAFFHRSVRSRMSHNSLLSL----------------MAVNYFRNSLGS-QEIGCRELSPVIDDIVQFQWSEECYQALQLPISREEVRRVLFSMD
        + GD NT FFH+ + +  + N +  L                M V Y+ + LGS  +I   +    I DI  F+ ++     L    S +E+   +F+M 
Subjt:  KLGDQNTAFFHRSVRSRMSHNSLLSL----------------MAVNYFRNSLGS-QEIGCRELSPVIDDIVQFQWSEECYQALQLPISREEVRRVLFSMD

Query:  SGKAPGPDGFSVGFFKGAWSVVGEDFLG--------------INATAITLIPKHNGAERLEDFHPISCYNVLYKCIS
          KAPGPD F+  FF  +W VV +  +                NATAITLIPK  G ++L  F P+SC  V+YK I+
Subjt:  SGKAPGPDGFSVGFFKGAWSVVGEDFLG--------------INATAITLIPKHNGAERLEDFHPISCYNVLYKCIS

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.5e-3027.45Show/hide
Query:  VAGVNNENASHLATCMGFARGNLPVRYLGLPLLTGRLRSNDCAPLIQRITSQIRSWNARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHNE-----G
        +AGV + + + +     FA G LPVRYLGLPLLT ++ ++D  PL+++I  +I  W AR LSFAGRLQL+ SV+ SL  +W S F LP+    E      
Subjt:  VAGVNNENASHLATCMGFARGNLPVRYLGLPLLTGRLRSNDCAPLIQRITSQIRSWNARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHNE-----G

Query:  GFGIRDGPSWNIASTLKILW----------------LMLTNSGSLWV--------AWVEAYILKGRSLWDVDSRVGEKLKQHVRMMVGNGNRCRVCLDPW
         F +  GP  N     K+ W                L   N GS W         +W+   ILK R+L              V+  + NG+      D W
Subjt:  GFGIRDGPSWNIASTLKILW----------------LMLTNSGSLWV--------AWVEAYILKGRSLWDVDSRVGEKLKQHVRMMVGNGNRCRVCLDPW

Query:  LQGGAILEQVGER----------------VLYDAASRGRLDFL----TLLAQMEY-GFGHDRVWVRGRQGG------FSISSAWEAIRPRGGRVLW----
         + G +++  G R                V+     R R D L     ++A++ + G       VR +  G      F+    W A R    +V W    
Subjt:  LQGGAILEQVGER----------------VLYDAASRGRLDFL----TLLAQMEY-GFGHDRVWVRGRQGG------FSISSAWEAIRPRGGRVLW----

Query:  -----------------------DDRLHRWDSSVPTSCILCQGGVESRDHLFFSCLFGGDVWSRVLRIMASSYRIGHWGVELSWISWHVLWCATIYFIWN
                                DR+  W++   +SC+LC   VE+RDHLFF+C +  +V         + Y                 +  T++ +W 
Subjt:  -----------------------DDRLHRWDSSVPTSCILCQGGVESRDHLFFSCLFGGDVWSRVLRIMASSYRIGHWGVELSWISWHVLWCATIYFIWN

Query:  ERNHRLHG
        ERN R HG
Subjt:  ERNHRLHG

AT3G25720.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.3e-1725.18Show/hide
Query:  EGGFGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKG------RSLWDVDSRVGEK------------LKQHVRMMVGNGNRCRVCLDPWLQGG
        EGG G+R    WN    LK++W + +  GSLWV W   + L+G         W     V +              ++ +R  +GNG   R   D W   G
Subjt:  EGGFGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKG------RSLWDVDSRVGEK------------LKQHVRMMVGNGNRCRVCLDPWLQGG

Query:  AILEQVGERVLYDAASRGRLDFLTLLAQMEYGFGHDRVWVRGRQGGFSISSAWEAIRPRGGRVLWDDRLHRWDSSVPTSCILCQGGVESRDHLFFSCLFG
         +L  +G      AA   RL                R+ +       S++ A + I               W   V   C LC    E+RDHL   C F 
Subjt:  AILEQVGERVLYDAASRGRLDFLTLLAQMEYGFGHDRVWVRGRQGGFSISSAWEAIRPRGGRVLWDDRLHRWDSSVPTSCILCQGGVESRDHLFFSCLFG

Query:  GDVWSRV-LRIMASSYRIGHWGVELSWIS----------WHVLWCATIYFIWNERNHRLHGGQARDPIILFHLI
          +WS V  R+   S    +W   L WI             ++  + +Y  W +RN+ LH     DP  +F +I
Subjt:  GDVWSRV-LRIMASSYRIGHWGVELSWIS----------WHVLWCATIYFIWNERNHRLHGGQARDPIILFHLI

AT4G04650.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.2e-1230.83Show/hide
Query:  AWEAIRPRGGRVLWDDRLHRWDSSVPTSCILCQGGVESRDHLFFSCLFGGDVWSRVLRIMASSYRIGHWGVELSWISW--------------HVLWCATI
        AW  +  R       DRL  W  S+P  C+LC    +SR HLFF C F G VW    R   +S  +      +  ++W               + + + +
Subjt:  AWEAIRPRGGRVLWDDRLHRWDSSVPTSCILCQGGVESRDHLFFSCLFGGDVWSRVLRIMASSYRIGHWGVELSWISW--------------HVLWCATI

Query:  YFIWNERNHRLHGGQARDPIILFHLICTWIHAR
        Y IW ERN RLH G +R    +   I   I AR
Subjt:  YFIWNERNHRLHGGQARDPIILFHLICTWIHAR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGCTGAAATTACTGTTAGTCTCAGAGGAGTAGATTTTAATGTTTCGGTTAATTATGAGTGGAAACCACGGAAGTGTAATTTGTGTTGTGCCTTTGGGCATTCTAG
TAGTAAGTGTTCTAGAAGTGTGGAGAGCAAAACCATTCAGGAGGAGGTTGTGCACAAGGGGGATGATGTAGACTTTGAACTTTGTGGGGAAGTTGTTCTTGAATCGTTCA
AACAGGTAGAGAAAGGTGAAATTAGGAGCTCTCCTAATAGGCATGACAGCCATGTAGAGAAGGGGGTGGGTAAAAGTGATGAGTTTACCCTTGTAACTCGCAAGAAGAGT
GAGTTAGTCTCTATTAGAGATCGTGGAAAGAGTATGGAGGTGATTATGCCAAACTCTTTTGGTAGTCTTTTGGAGGTGGGTGATGCTGACAAGTGGGCACTATCTATAAT
AGAGGGTTCATCGCCACCTTTACAGGTTGATGAAGGTACTGATACTAGAGTTCGAGAAGGTAATTTTGATTCTGTTTCTAGAAGGTTCAGTAACTCTTGGGATTACTCTT
GTAGTTACAGTAATAGTGGTGTTGGTCGGATTTGGGTGATGTGGAAGAAGAATCGTTTTTCTTTCTCTACTCATGTGATGGATGAGCAGTTTGTTACAGGTACGTTAACA
GATCTTTTATCCGGGGTTGTCATGGAAGATTTTAATGCTATTAGAGTTCATTCTGAAGCATTTAGGGGATCTCCTATTCAGGGTGAGATGGAGGATTTTAATCTGGCTAT
TCGTGATGCTGATTTAGTAGAGCCTTCGGTGCAGGGGAACTGGTTTACTTGGACTAGTAAAGTTCAGGGGTCTGGGATGTTGCGTCGTCTGGATCGTGTTTTAGTGAATG
ATGATTGGTTATCTACATGGCCTACTATGTTGGTAAATGTGCTTCCATGGGGTATCTCTGATCATTATCCTATTTTATTTTATCCTAGCTTTCAGCATAGTAACAAAGTG
GTGTCTTTTCGGTTCTTCAATCATTGGGTGGAGGATCCATCCTTTATTGAGGTGGTTACTAGGATGTGGAGTCGTCATGAGGGTGTCTCTCCGCTAGTGAGACTCATGAG
AAACCTTCATCTTCTCAAACCTATTCTCCGTAGACGATTTGGTAGACACATCAAAAGACTTAGTGAGGAGGTGCGCATTGTTAAGGAGGCCATGGATATAGCTCAAAGAG
AGGTTGAACGTAACCCAATGTCGGATTCAGCAGTTAGATTGGAGGAAGCATCGCTTCGGCAGAAATCCAGAATTCGATGGTTAAAGCTGGGTGATCAGAATACGGCTTTT
TTCCATCGATCCGTCCGCTCTCGTATGAGTCATAATTCTCTTCTTTCTCTAATGGCAGTTAATTATTTTAGAAACAGTTTGGGATCCCAGGAGATTGGCTGTAGAGAATT
GTCCCCAGTTATTGATGATATTGTTCAGTTTCAGTGGTCTGAGGAGTGTTATCAGGCGTTACAGTTACCTATAAGTCGAGAGGAAGTTAGGAGGGTCTTATTCTCTATGG
ATAGTGGAAAGGCTCCCGGTCCTGATGGGTTCTCTGTAGGTTTCTTCAAAGGTGCTTGGTCAGTGGTTGGGGAAGATTTTTTAGGAATTAATGCTACTGCTATCACCCTC
ATTCCTAAACATAATGGGGCTGAGCGTCTGGAGGACTTTCATCCTATTTCTTGCTATAATGTGTTATATAAATGCATTTCTAAAATTTTGGCTGATAGACTTCGTGTGTG
GCTTCCTTCTTTTATCAGTAGTAACCAGTCTGCTTTTATACCTGGGAGGAGTATTATCGAGAACATCCTGCTTTGTCAGGAACCGGTAGGAGGTTATCATCTTAATTCTG
GTAAACCTCGGTGTACTTTGAAAGTTGATCTTCAAAAAGCATATGACTCTGTTAATTGGATTTTCTGTTTGGGTGTAAGACAAGGTGATCCTTTATCTCCTTTTCTCTTT
GTTATGGTGATGGAAGTTCTTTCTCGTATGTTGAACAAGATTCCACAGAGTTTTCAATTTCACCATCGTTGTGAAAAGAAGTTTGGTGAGCTTTCAGGTTTGTTTGCAAA
TCCTAGGAAAAGCTCTATTTTTGTTGCAGGAGTTAATAATGAGAATGCTTCTCATCTGGCTACTTGTATGGGTTTTGCCCGTGGAAATCTCCCTGTTCGTTATCTTGGCC
TTCCTCTTCTTACTGGTCGGTTACGTTCTAATGATTGTGCTCCTTTGATTCAGCGTATCACTAGCCAGATTCGTTCTTGGAATGCTCGAGTTCTTTCGTTTGCTGGTAGA
CTGCAGCTTGTTCGTTCTGTGCTTCGTAGCCTTCAAGTTTACTGGGCTAGTGTGTTTGTTCTTCCTGCGTATGTGCATAATGAGGGTGGTTTTGGTATTCGTGATGGCCC
TTCTTGGAATATAGCTAGTACTTTGAAGATTTTGTGGCTTATGTTGACAAATTCGGGTTCTCTTTGGGTGGCTTGGGTGGAGGCTTATATACTAAAGGGGAGGTCATTGT
GGGATGTGGATAGTAGAGTGGGAGAGAAGCTGAAGCAGCATGTAAGGATGATGGTGGGAAATGGCAATAGATGTAGAGTTTGCCTTGATCCTTGGTTGCAGGGGGGTGCC
ATTCTAGAGCAGGTTGGGGAGAGAGTGCTTTATGATGCAGCAAGTCGGGGAAGGCTAGACTTTCTGACTTTATTGGCCCAGATGGAGTATGGCTTTGGCCACGATCGGGT
CTGGGTTCGTGGTCGTCAGGGTGGTTTCTCTATTTCAAGTGCATGGGAAGCTATTCGTCCTAGGGGTGGTCGGGTTCTTTGGGATGATAGATTGCATAGGTGGGATAGTT
CTGTACCGACGTCGTGCATTCTATGTCAGGGGGGTGTGGAGTCTCGCGATCACTTATTCTTTTCGTGTCTGTTTGGGGGGGATGTTTGGTCTAGGGTTCTTCGGATCATG
GCTTCCTCATATAGGATTGGGCATTGGGGGGTTGAGTTGTCTTGGATTTCTTGGCATGTTCTTTGGTGCGCAACTATATATTTTATTTGGAACGAGCGGAATCATCGGTT
ACATGGTGGTCAGGCTCGGGATCCTATTATCCTTTTCCATCTTATTTGTACGTGGATTCATGCTCGTGCTGGCTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCAGCTGAAATTACTGTTAGTCTCAGAGGAGTAGATTTTAATGTTTCGGTTAATTATGAGTGGAAACCACGGAAGTGTAATTTGTGTTGTGCCTTTGGGCATTCTAG
TAGTAAGTGTTCTAGAAGTGTGGAGAGCAAAACCATTCAGGAGGAGGTTGTGCACAAGGGGGATGATGTAGACTTTGAACTTTGTGGGGAAGTTGTTCTTGAATCGTTCA
AACAGGTAGAGAAAGGTGAAATTAGGAGCTCTCCTAATAGGCATGACAGCCATGTAGAGAAGGGGGTGGGTAAAAGTGATGAGTTTACCCTTGTAACTCGCAAGAAGAGT
GAGTTAGTCTCTATTAGAGATCGTGGAAAGAGTATGGAGGTGATTATGCCAAACTCTTTTGGTAGTCTTTTGGAGGTGGGTGATGCTGACAAGTGGGCACTATCTATAAT
AGAGGGTTCATCGCCACCTTTACAGGTTGATGAAGGTACTGATACTAGAGTTCGAGAAGGTAATTTTGATTCTGTTTCTAGAAGGTTCAGTAACTCTTGGGATTACTCTT
GTAGTTACAGTAATAGTGGTGTTGGTCGGATTTGGGTGATGTGGAAGAAGAATCGTTTTTCTTTCTCTACTCATGTGATGGATGAGCAGTTTGTTACAGGTACGTTAACA
GATCTTTTATCCGGGGTTGTCATGGAAGATTTTAATGCTATTAGAGTTCATTCTGAAGCATTTAGGGGATCTCCTATTCAGGGTGAGATGGAGGATTTTAATCTGGCTAT
TCGTGATGCTGATTTAGTAGAGCCTTCGGTGCAGGGGAACTGGTTTACTTGGACTAGTAAAGTTCAGGGGTCTGGGATGTTGCGTCGTCTGGATCGTGTTTTAGTGAATG
ATGATTGGTTATCTACATGGCCTACTATGTTGGTAAATGTGCTTCCATGGGGTATCTCTGATCATTATCCTATTTTATTTTATCCTAGCTTTCAGCATAGTAACAAAGTG
GTGTCTTTTCGGTTCTTCAATCATTGGGTGGAGGATCCATCCTTTATTGAGGTGGTTACTAGGATGTGGAGTCGTCATGAGGGTGTCTCTCCGCTAGTGAGACTCATGAG
AAACCTTCATCTTCTCAAACCTATTCTCCGTAGACGATTTGGTAGACACATCAAAAGACTTAGTGAGGAGGTGCGCATTGTTAAGGAGGCCATGGATATAGCTCAAAGAG
AGGTTGAACGTAACCCAATGTCGGATTCAGCAGTTAGATTGGAGGAAGCATCGCTTCGGCAGAAATCCAGAATTCGATGGTTAAAGCTGGGTGATCAGAATACGGCTTTT
TTCCATCGATCCGTCCGCTCTCGTATGAGTCATAATTCTCTTCTTTCTCTAATGGCAGTTAATTATTTTAGAAACAGTTTGGGATCCCAGGAGATTGGCTGTAGAGAATT
GTCCCCAGTTATTGATGATATTGTTCAGTTTCAGTGGTCTGAGGAGTGTTATCAGGCGTTACAGTTACCTATAAGTCGAGAGGAAGTTAGGAGGGTCTTATTCTCTATGG
ATAGTGGAAAGGCTCCCGGTCCTGATGGGTTCTCTGTAGGTTTCTTCAAAGGTGCTTGGTCAGTGGTTGGGGAAGATTTTTTAGGAATTAATGCTACTGCTATCACCCTC
ATTCCTAAACATAATGGGGCTGAGCGTCTGGAGGACTTTCATCCTATTTCTTGCTATAATGTGTTATATAAATGCATTTCTAAAATTTTGGCTGATAGACTTCGTGTGTG
GCTTCCTTCTTTTATCAGTAGTAACCAGTCTGCTTTTATACCTGGGAGGAGTATTATCGAGAACATCCTGCTTTGTCAGGAACCGGTAGGAGGTTATCATCTTAATTCTG
GTAAACCTCGGTGTACTTTGAAAGTTGATCTTCAAAAAGCATATGACTCTGTTAATTGGATTTTCTGTTTGGGTGTAAGACAAGGTGATCCTTTATCTCCTTTTCTCTTT
GTTATGGTGATGGAAGTTCTTTCTCGTATGTTGAACAAGATTCCACAGAGTTTTCAATTTCACCATCGTTGTGAAAAGAAGTTTGGTGAGCTTTCAGGTTTGTTTGCAAA
TCCTAGGAAAAGCTCTATTTTTGTTGCAGGAGTTAATAATGAGAATGCTTCTCATCTGGCTACTTGTATGGGTTTTGCCCGTGGAAATCTCCCTGTTCGTTATCTTGGCC
TTCCTCTTCTTACTGGTCGGTTACGTTCTAATGATTGTGCTCCTTTGATTCAGCGTATCACTAGCCAGATTCGTTCTTGGAATGCTCGAGTTCTTTCGTTTGCTGGTAGA
CTGCAGCTTGTTCGTTCTGTGCTTCGTAGCCTTCAAGTTTACTGGGCTAGTGTGTTTGTTCTTCCTGCGTATGTGCATAATGAGGGTGGTTTTGGTATTCGTGATGGCCC
TTCTTGGAATATAGCTAGTACTTTGAAGATTTTGTGGCTTATGTTGACAAATTCGGGTTCTCTTTGGGTGGCTTGGGTGGAGGCTTATATACTAAAGGGGAGGTCATTGT
GGGATGTGGATAGTAGAGTGGGAGAGAAGCTGAAGCAGCATGTAAGGATGATGGTGGGAAATGGCAATAGATGTAGAGTTTGCCTTGATCCTTGGTTGCAGGGGGGTGCC
ATTCTAGAGCAGGTTGGGGAGAGAGTGCTTTATGATGCAGCAAGTCGGGGAAGGCTAGACTTTCTGACTTTATTGGCCCAGATGGAGTATGGCTTTGGCCACGATCGGGT
CTGGGTTCGTGGTCGTCAGGGTGGTTTCTCTATTTCAAGTGCATGGGAAGCTATTCGTCCTAGGGGTGGTCGGGTTCTTTGGGATGATAGATTGCATAGGTGGGATAGTT
CTGTACCGACGTCGTGCATTCTATGTCAGGGGGGTGTGGAGTCTCGCGATCACTTATTCTTTTCGTGTCTGTTTGGGGGGGATGTTTGGTCTAGGGTTCTTCGGATCATG
GCTTCCTCATATAGGATTGGGCATTGGGGGGTTGAGTTGTCTTGGATTTCTTGGCATGTTCTTTGGTGCGCAACTATATATTTTATTTGGAACGAGCGGAATCATCGGTT
ACATGGTGGTCAGGCTCGGGATCCTATTATCCTTTTCCATCTTATTTGTACGTGGATTCATGCTCGTGCTGGCTCTTGA
Protein sequenceShow/hide protein sequence
MSAEITVSLRGVDFNVSVNYEWKPRKCNLCCAFGHSSSKCSRSVESKTIQEEVVHKGDDVDFELCGEVVLESFKQVEKGEIRSSPNRHDSHVEKGVGKSDEFTLVTRKKS
ELVSIRDRGKSMEVIMPNSFGSLLEVGDADKWALSIIEGSSPPLQVDEGTDTRVREGNFDSVSRRFSNSWDYSCSYSNSGVGRIWVMWKKNRFSFSTHVMDEQFVTGTLT
DLLSGVVMEDFNAIRVHSEAFRGSPIQGEMEDFNLAIRDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNDDWLSTWPTMLVNVLPWGISDHYPILFYPSFQHSNKV
VSFRFFNHWVEDPSFIEVVTRMWSRHEGVSPLVRLMRNLHLLKPILRRRFGRHIKRLSEEVRIVKEAMDIAQREVERNPMSDSAVRLEEASLRQKSRIRWLKLGDQNTAF
FHRSVRSRMSHNSLLSLMAVNYFRNSLGSQEIGCRELSPVIDDIVQFQWSEECYQALQLPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVGEDFLGINATAITL
IPKHNGAERLEDFHPISCYNVLYKCISKILADRLRVWLPSFISSNQSAFIPGRSIIENILLCQEPVGGYHLNSGKPRCTLKVDLQKAYDSVNWIFCLGVRQGDPLSPFLF
VMVMEVLSRMLNKIPQSFQFHHRCEKKFGELSGLFANPRKSSIFVAGVNNENASHLATCMGFARGNLPVRYLGLPLLTGRLRSNDCAPLIQRITSQIRSWNARVLSFAGR
LQLVRSVLRSLQVYWASVFVLPAYVHNEGGFGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVDSRVGEKLKQHVRMMVGNGNRCRVCLDPWLQGGA
ILEQVGERVLYDAASRGRLDFLTLLAQMEYGFGHDRVWVRGRQGGFSISSAWEAIRPRGGRVLWDDRLHRWDSSVPTSCILCQGGVESRDHLFFSCLFGGDVWSRVLRIM
ASSYRIGHWGVELSWISWHVLWCATIYFIWNERNHRLHGGQARDPIILFHLICTWIHARAGS