; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0004189 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0004189
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr09:13504764..13508534
RNA-Seq ExpressionPay0004189
SyntenyPay0004189
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046851.1 uncharacterized protein E6C27_scaffold19358G00020 [Cucumis melo var. makuwa]0.0e+0061.76Show/hide
Query:  NSW--DYSCSYSNSDVGRIW-----------------------------------------GDVEEESFFFFYSFEITSAWSRPGVVMGDFNAIRVHSEA
        N W  +YSCSYSNS VGRIW                                             E  F +   FEITSAWS  GVVMGDFNAIRVHSEA
Subjt:  NSW--DYSCSYSNSDVGRIW-----------------------------------------GDVEEESFFFFYSFEITSAWSRPGVVMGDFNAIRVHSEA

Query:  FGGSPIQGEMEDFDLAISDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNEDWFSAWPTIR------------------------------------
        FGGSPIQGEME+FDLAI DADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVN++W SAWPT+R                                    
Subjt:  FGGSPIQGEMEDFDLAISDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNEDWFSAWPTIR------------------------------------

Query:  ---------------------------------------FGRHIKSLSEKVRNAKAAMDLAQREVERNPMSDVLSHQAGLATETFWTVVRLEEASLRQKS
                                               FGRHIKSLSE+V  AK AMD+AQREVERNP+SDVLS QA LATETFWT VRLEEASLRQKS
Subjt:  ---------------------------------------FGRHIKSLSEKVRNAKAAMDLAQREVERNPMSDVLSHQAGLATETFWTVVRLEEASLRQKS

Query:  RIRWLKLGDQNTAFFHRS-----------------------------MAVNYFSNSLGSQEIGYRELSPVIDDIVQFQWSEKCCQALQLPISREEVRRVL
        ++RWL LGDQNTAFFHRS                             MAVNYFSNSLGSQEIGYRELSP+IDDIVQFQWSE+CCQALQLPISREEVRRVL
Subjt:  RIRWLKLGDQNTAFFHRS-----------------------------MAVNYFSNSLGSQEIGYRELSPVIDDIVQFQWSEKCCQALQLPISREEVRRVL

Query:  FSMDSGKAPGPDGFSAGFFKGAWSVVGEDF--------------LGVNATAITLIPKHNGAERLEDFRPISCCNALYKCISKILADRLRAWLPSFISSNQ
        FSMDSGKAPGPDGFS GF+KGAWSVVGEDF              +GVNATAITLIPKH GAERLEDFRPISCCN LYKCISKILADRLR WLPSFISSNQ
Subjt:  FSMDSGKAPGPDGFSAGFFKGAWSVVGEDF--------------LGVNATAITLIPKHNGAERLEDFRPISCCNALYKCISKILADRLRAWLPSFISSNQ

Query:  SAFIPGRSIIENILLCQELVGGYHLNSGKPCCTLKVDLQKAYDSVNWDFLFGLLIAIGTPLK-----------------------------KGVRQDDPL
        SAFIPGRSIIENILLCQELVGGYHLNSGKP CTLKVDLQKAYDSVNWDFLFGLLIAIGTPLK                             KG+RQ DPL
Subjt:  SAFIPGRSIIENILLCQELVGGYHLNSGKPCCTLKVDLQKAYDSVNWDFLFGLLIAIGTPLK-----------------------------KGVRQDDPL

Query:  STFLFVMVMEVLSRMLNKIPQSFHFHHRCEK-------------------------------KFGELSSLFANPRKSSIFVAGVNNENASHLAACMGFVR
        S FLFVMVMEVLSRMLNKIPQSF FHHRCEK                               KFGE S LFANPRKSSIFV GVNNE ASHLAAC+G   
Subjt:  STFLFVMVMEVLSRMLNKIPQSFHFHHRCEK-------------------------------KFGELSSLFANPRKSSIFVAGVNNENASHLAACMGFVR

Query:  GNLPVRYLGLPLLAGRLRSNDYAPLIQRITSRIRSWTARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN---------------------------
           P            LRS D APLIQRITSRIRSWTARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN                           
Subjt:  GNLPVRYLGLPLLAGRLRSNDYAPLIQRITSRIRSWTARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN---------------------------

Query:  ------EGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVDSRVGKSWCLRAILRKREKLKHLVRMKVGNGNSCRVWLDPWLPEV
              EGGLGIRDGPSWNIA+TLKIL   LTN GSLWVAW+EAYILKG+SLWDVDSRVG+SWCLRAILRKREK+KH V  +                 V
Subjt:  ------EGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVDSRVGKSWCLRAILRKREKLKHLVRMKVGNGNSCRVWLDPWLPEV

Query:  MYDVASRRKARLSDFIDSDGEWLWPR--------------VSPCLSVSDSWVWVPGRRGGFSIASAWEAVRPRGGRVLWDGLLWSGGNIPKHFFCAWLAI
        +YD ASRR+A+LSDFID +GEWLWPR              VSPCLSVSDSWVWVPGR+GGFSIASAWEA+ PRGGRVLWDGLLW GGNIPKH FCAWLAI
Subjt:  MYDVASRRKARLSDFIDSDGEWLWPR--------------VSPCLSVSDSWVWVPGRRGGFSIASAWEAVRPRGGRVLWDGLLWSGGNIPKHFFCAWLAI

Query:  KDRLGTIDRLHRWDSSVPMSCILCQGGVESRDHLFFSCSFGG------------MFGLGFFG---SWVPLIGLGIGGLSCLGFVI------KVLGRRNHR
        KDRL T DRLHRWDSS+P+SCILCQGGVESRDHLFFSC FGG               +G +G   SW+   G+G G    L  V+       +   RNHR
Subjt:  KDRLGTIDRLHRWDSSVPMSCILCQGGVESRDHLFFSCSFGG------------MFGLGFFG---SWVPLIGLGIGGLSCLGFVI------KVLGRRNHR

Query:  LHGGQARDPIVLFHLICSWIRARAGSWREDAHLPF
        LHGG+ARDPI+LFHLIC+WIRARAGSWREDAHLPF
Subjt:  LHGGQARDPIVLFHLICSWIRARAGSWREDAHLPF

KAA0057642.1 reverse transcriptase [Cucumis melo var. makuwa]0.0e+0071.05Show/hide
Query:  MEVIMPNSFGSLLEVGDADKWALSIIEGSPPP---LQIKSKAVVDFLGSSSVGFCCLLETRVREGNFDSVSRRFGNSWDYSCSYSNSDVGRIWGDVEEES
        MEVI PNSFGSLLEVGD DKWALSIIEGSP     +++KSKAVVDFLGSSSVGFCC LETRVREGNFDSVSRRFGNSWDYSCSYSNS VGRIW  + +++
Subjt:  MEVIMPNSFGSLLEVGDADKWALSIIEGSPPP---LQIKSKAVVDFLGSSSVGFCCLLETRVREGNFDSVSRRFGNSWDYSCSYSNSDVGRIWGDVEEES

Query:  FFFFYSFEITSAWSRPGVVMGDFNAIRVHSEAFGGSPIQGEMEDFDLAISDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNEDWFSAWPTI-----
         F F +      +    VVM DFNAIR HSEA GGSPIQGEMEDFD+AI DADLVEPSVQGNWFTWTSKVQGSGM+RRLDRVL+N+DW SAWPT+     
Subjt:  FFFFYSFEITSAWSRPGVVMGDFNAIRVHSEAFGGSPIQGEMEDFDLAISDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNEDWFSAWPTI-----

Query:  ---------------------------------------RFGRHIKSLSEKVRNAKAAMDLAQREVERNPMSDVLSHQAGLATETFWTVVRLEEASLRQK
                                               RFGRHI+SLSE+VR AK AMD+AQREVERNPMSDVLS QA LATETFWT VRLE+   R  
Subjt:  ---------------------------------------RFGRHIKSLSEKVRNAKAAMDLAQREVERNPMSDVLSHQAGLATETFWTVVRLEEASLRQK

Query:  SRIRWLKLGDQNTAFFH---RSMAVNYFSNSLGSQEIGYRELSPVIDDIVQFQWSEKCCQALQLPISREEVRRVLFSMDSGKAPGPDGFSAGFFKGAWSV
          + ++     +    H     MAVNYFSNSLGSQEIGYREL+P+IDDIVQFQWSE+CCQALQ+PISREEVRRVLFSMDSGKAPGPDGFS GFFKGAWSV
Subjt:  SRIRWLKLGDQNTAFFH---RSMAVNYFSNSLGSQEIGYRELSPVIDDIVQFQWSEKCCQALQLPISREEVRRVLFSMDSGKAPGPDGFSAGFFKGAWSV

Query:  VGEDF--------------LGVNATAITLIPKHNGAERLEDFRPISCCNALYKCISKILADRLRAWLPSFISSNQSAFIPGRSIIENILLCQELVGGYHL
        +GEDF              +GVNATAITLIPKHNGAERLEDFRPISCCN LYKCISKILADRLR WLPSFISSNQSAFI GRSIIENILLCQELVGGYHL
Subjt:  VGEDF--------------LGVNATAITLIPKHNGAERLEDFRPISCCNALYKCISKILADRLRAWLPSFISSNQSAFIPGRSIIENILLCQELVGGYHL

Query:  NSGKPCCTLKVDLQKAYDSVNWDFLFGLLIAIGTPLK-----------------------------KGVRQDDPLSTFLFVMVMEVLSRMLNKIPQSFHF
        NSGKP CTLKVDLQKAYDSVNWDFLFGL I+I TPLK                             KGVRQ DPLS FLFVMVMEVLSRMLNKIPQSF F
Subjt:  NSGKPCCTLKVDLQKAYDSVNWDFLFGLLIAIGTPLK-----------------------------KGVRQDDPLSTFLFVMVMEVLSRMLNKIPQSFHF

Query:  HHRCEKKFGELSSLFANPRKSSIFVAGVNNENASHLAACMGFVRGNLPVRYLGLPLLAGRLRSNDYAPLIQRITSRIRSWTARVLSFAGRLQLVRSVLRS
        HHRCEK+FGELS LFANPRKSSIF+AGVNNENAS LAACMGFVRGNLPVRYLGLPLL GRLRSND  PLIQRITSRIRS +ARVLSFAGRLQLV SVL S
Subjt:  HHRCEKKFGELSSLFANPRKSSIFVAGVNNENASHLAACMGFVRGNLPVRYLGLPLLAGRLRSNDYAPLIQRITSRIRSWTARVLSFAGRLQLVRSVLRS

Query:  LQVYWASVFVLPAYVHN-EGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVDSRVGKSWCLRAILRKREKLKHLVRMKVGNGNS
        LQVYWA VFVLPAYVHN EGGLGIRDG +W  ASTLKILWLMLTNSGSLWVAWVEAY+LKGRSLWDVDSRVG+SWCLRAILRK+EKLK  VRMKVGNGN 
Subjt:  LQVYWASVFVLPAYVHN-EGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVDSRVGKSWCLRAILRKREKLKHLVRMKVGNGNS

Query:  CRVWLDPWL----------PEVMYDVASRRKARLSDFIDSDGEWLWPRVSPCLSVSDSWVWVPGRRGGFSIASAWEAVRPRGGRVLWDGLLWSGGNIPKH
        CRVWLDPWL            V+YD ASRR+A LS+FI  DGEWLWP                  RGGFSIASAWEA+RPRGGRVLWDGLLW GGNIPKH
Subjt:  CRVWLDPWL----------PEVMYDVASRRKARLSDFIDSDGEWLWPRVSPCLSVSDSWVWVPGRRGGFSIASAWEAVRPRGGRVLWDGLLWSGGNIPKH

Query:  FFCAWLAIKDRLGTIDRLHRWDSSVPMSCILCQGGVESRDHLFFSCSFGG------------MFGLGFFG---SWVPLIGLGIGGLSCLGFVI------K
         FCAWLAIKDRLGT DR HRWDSSVP+SCILC+GG+ESRDHLFFSC FGG               +G +G   SW+   G+  G    L  V+       
Subjt:  FFCAWLAIKDRLGTIDRLHRWDSSVPMSCILCQGGVESRDHLFFSCSFGG------------MFGLGFFG---SWVPLIGLGIGGLSCLGFVI------K

Query:  VLGRRNHRLHGGQARDPIVLFHLICSWIRARAGSWREDAHLPF
        +   RNHRLHGGQA DPIV+FHLIC+WIRARAGSWREDA+LPF
Subjt:  VLGRRNHRLHGGQARDPIVLFHLICSWIRARAGSWREDAHLPF

KAA0062318.1 uncharacterized protein E6C27_scaffold154G00690 [Cucumis melo var. makuwa]0.0e+0070.3Show/hide
Query:  MLRRLDRVLVNEDWFSAWPTI---------------------------------------------------------------------------RFGR
        MLRRLDRVLVNEDWFSAWPT+                                                                           RFGR
Subjt:  MLRRLDRVLVNEDWFSAWPTI---------------------------------------------------------------------------RFGR

Query:  HIKSLSEKVRNAKAAMDLAQREVERNPMSDVLSHQAGLATETFWTVVRLEEASLRQKSRIRWLKLGDQNTAFFHRSMAVNYFSNSL-------GSQEIGY
        HIKSLSE+VRNAKAA+DLAQREVERNPMSDVLSHQAGL+TETFWT VRLEEASLRQKSRIRWLKLGDQNTAFFHRS+      N+L       G++EIGY
Subjt:  HIKSLSEKVRNAKAAMDLAQREVERNPMSDVLSHQAGLATETFWTVVRLEEASLRQKSRIRWLKLGDQNTAFFHRSMAVNYFSNSL-------GSQEIGY

Query:  RELSPVIDDIVQFQWSEKCCQALQLPISREEVRRVLFSMDSGKAPGPDGFSAGFFKGAWSVVGEDFLGVNATAITLIPKHNGAERLEDFRPISCCNALYK
        RELSPVIDDIVQFQWSE+CCQALQLPISREEVRRVLFSMDSGKAPGPDGFSA              +GVNATAITLIPKHNGAERLEDFRPISCCNALYK
Subjt:  RELSPVIDDIVQFQWSEKCCQALQLPISREEVRRVLFSMDSGKAPGPDGFSAGFFKGAWSVVGEDFLGVNATAITLIPKHNGAERLEDFRPISCCNALYK

Query:  CISKILADRLRAWLPSFISSNQSAFIPGRSIIENILLCQELVGGYHLNSGKPCCTLKVDLQKAYDSVNWDFLFGLLIAIGTPLK----------------
        CISKILADRLRAWLPSFISSNQSAFIPGRSIIENILLCQELVGGYHLNSGKP CTLKVDLQKAYDSVNWDFLFGLLIAIGTPLK                
Subjt:  CISKILADRLRAWLPSFISSNQSAFIPGRSIIENILLCQELVGGYHLNSGKPCCTLKVDLQKAYDSVNWDFLFGLLIAIGTPLK----------------

Query:  -------------KGVRQDDPLSTFLFVMVMEVLSRMLNKIPQSFHFHHRCEK-------------------------------KFGELSSLFANPRKSS
                     KGVRQ DPLSTFLFVMVMEVLSRMLNKIPQSFHFHHRCEK                               KFGELS LFANPRKSS
Subjt:  -------------KGVRQDDPLSTFLFVMVMEVLSRMLNKIPQSFHFHHRCEK-------------------------------KFGELSSLFANPRKSS

Query:  IFVAGVNNENASHLAACMGFVRGNLPVRYLGLPLLAGRLRSNDYAPLIQRITSRIRSWTARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN-----
        IFVAGVNNENASHLA CMGFVRGNL VRYLGLPLLAGRLRSNDYAPLIQRITSRIRSWTARVLSFAGRLQLV SVLRS QVYWASVFVLPAYVHN     
Subjt:  IFVAGVNNENASHLAACMGFVRGNLPVRYLGLPLLAGRLRSNDYAPLIQRITSRIRSWTARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN-----

Query:  ----------------------------EGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVDSRVGKSWCLRAILRKREKLKHL
                                    EGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVDSRVGKSWCLRAILRKREKLKHL
Subjt:  ----------------------------EGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVDSRVGKSWCLRAILRKREKLKHL

Query:  VRMKVGNGNSCRVWLDPWLPE----------VMYDVASRRKARLSDFIDSDGEWLWPR--------------VSPCLSVSDSWVWVPGRRGGFSIASAWE
        VRMKVGNGNS RVWLDPWLPE          VMYD ASRRKARLSDFID DGEWLWPR              VSPCLSVSDSWVWVPGRRGGFSIASAWE
Subjt:  VRMKVGNGNSCRVWLDPWLPE----------VMYDVASRRKARLSDFIDSDGEWLWPR--------------VSPCLSVSDSWVWVPGRRGGFSIASAWE

Query:  AVRPRGGRVLWDGLLWSGGNIPKHFFCAWLAIKDRLGTIDRLHRWDSSVPMSCILCQGGVESRDHLFFSCSFGGMFGLGFFGSWVPLIGLGIGGLSCLGF
        AVRPRGGRVLWDGLLW GGNI KHFFCAWLAIKDRLGTIDRLHRWDSSVPM CI                       L FFGSWVPLIGLGIGGLSCLGF
Subjt:  AVRPRGGRVLWDGLLWSGGNIPKHFFCAWLAIKDRLGTIDRLHRWDSSVPMSCILCQGGVESRDHLFFSCSFGGMFGLGFFGSWVPLIGLGIGGLSCLGF

Query:  VIKVLGRRNHRLHGGQARDPIVLFHLICSWIRARAGSWREDAHLPF
        VIK             ARDPIVLFHLICSWIRARAGSWR+DAHLPF
Subjt:  VIKVLGRRNHRLHGGQARDPIVLFHLICSWIRARAGSWREDAHLPF

TYK19523.1 reverse transcriptase [Cucumis melo var. makuwa]0.0e+0068.61Show/hide
Query:  MEVIMPNSFGSLLEVGDADKWALSIIEGSPPP---LQIKSKAVVDFLGSSSVGFCCLLETRVREGNFDSVSRRFGNSWDYSCSYSNSDVGRIWGDVEEES
        MEVI PNSFGSLLEVGD DKWALSIIEGSP     +++KSKAVVDFLGSSSVGFCC LETRVREGNFDSVSRRFGNSWDYSCSYSNS VGRIW  + +++
Subjt:  MEVIMPNSFGSLLEVGDADKWALSIIEGSPPP---LQIKSKAVVDFLGSSSVGFCCLLETRVREGNFDSVSRRFGNSWDYSCSYSNSDVGRIWGDVEEES

Query:  FFFFYSFEITSAWSRPGVVMGDFNAIRVHSEAFGGSPIQGEMEDFDLAISDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNEDWFSAWPTI-----
         F F +      +    VVM DFNAIR HSEA GGSPIQGEMEDFD+AI DADLVEPSVQ NWFTWTSKVQGSGM+RRLDRVL+N+DW SAWPT+     
Subjt:  FFFFYSFEITSAWSRPGVVMGDFNAIRVHSEAFGGSPIQGEMEDFDLAISDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNEDWFSAWPTI-----

Query:  -----------------------------------------------RFGRHIKSLSEKVRNAKAAMDLAQREVERNPMSDVLSHQAGLATETFWTVVRL
                                                       RFGRHI+SLSE+VR AK AMD+AQRE++                   W  + L
Subjt:  -----------------------------------------------RFGRHIKSLSEKVRNAKAAMDLAQREVERNPMSDVLSHQAGLATETFWTVVRL

Query:  EEASLRQKSRIRWLKLGDQNTAFFHRSMAVNYFSNSLGSQEIGYRELSPVIDDIVQFQWSEKCCQALQLPISREEVRRVLFSMDSGKAPGPDGFSAGFFK
         E S    S ++         A     MAVNYFSNSLGSQEIGYREL+P+IDDIVQFQWSE+CCQALQ+PISREEVRRVLFSMDSGKAPGPDGFS GFFK
Subjt:  EEASLRQKSRIRWLKLGDQNTAFFHRSMAVNYFSNSLGSQEIGYRELSPVIDDIVQFQWSEKCCQALQLPISREEVRRVLFSMDSGKAPGPDGFSAGFFK

Query:  GAWSVVGEDF--------------LGVNATAITLIPKHNGAERLEDFRPISCCNALYKCISKILADRLRAWLPSFISSNQSAFIPGRSIIENILLCQELV
        GAWSV+GEDF              +GVNATAITLIPKHNGAERLEDFRPISCCN LYKCISKILADRLR WLPSFISSNQSAFI GRSIIENILLCQELV
Subjt:  GAWSVVGEDF--------------LGVNATAITLIPKHNGAERLEDFRPISCCNALYKCISKILADRLRAWLPSFISSNQSAFIPGRSIIENILLCQELV

Query:  GGYHLNSGKPCCTLKVDLQKAYDSVNWDFLFGLLIAIGTPLK-----------------------------KGVRQDDPLSTFLFVMVMEVLSRMLNKIP
        GGYHLNSGKP CTLKVDLQKAYDSVNWDFLFGL I+I TPLK                             KGVRQ DPLS FLFVMVMEVLSRMLNKIP
Subjt:  GGYHLNSGKPCCTLKVDLQKAYDSVNWDFLFGLLIAIGTPLK-----------------------------KGVRQDDPLSTFLFVMVMEVLSRMLNKIP

Query:  QSFHFHHRCEKKFGELSSLFANPRKSSIFVAGVNNENASHLAACMGFVRGNLPVRYLGLPLLAGRLRSNDYAPLIQRITSRIRSWTARVLSFAGRLQLVR
        QSF FHHRCEK+FGELS LFANPRKSSIF+AGVNNENAS LAACMGFVRGNLPVRYLGLPLL GRLRSND  PLIQRITSRIRS +ARVLSFAGRLQLV 
Subjt:  QSFHFHHRCEKKFGELSSLFANPRKSSIFVAGVNNENASHLAACMGFVRGNLPVRYLGLPLLAGRLRSNDYAPLIQRITSRIRSWTARVLSFAGRLQLVR

Query:  SVLRSLQVYWASVFVLPAYVHN-EGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVDSRVGKSWCLRAILRKREKLKHLVRMKV
        SVL SLQVYWA VFVLPAYVHN EGGLGIRDG +W  ASTLKILWLMLTNSGSLWVAWVEAY+LKGRSLWDVDSRVG+SWCLRAILRK+EKLK  VRMKV
Subjt:  SVLRSLQVYWASVFVLPAYVHN-EGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVDSRVGKSWCLRAILRKREKLKHLVRMKV

Query:  GNGNSCRVWLDPWL----------PEVMYDVASRRKARLSDFIDSDGEWLWPRVSPCLSVSDSWVWVPGRRGGFSIASAWEAVRPRGGRVLWDGLLWSGG
        GNGN CRVWLDPWL            V+YD ASRR+A LS+FI  DGEWLWP                  RGGFSIASAWEA+RPRGGRVLWDGLLW GG
Subjt:  GNGNSCRVWLDPWL----------PEVMYDVASRRKARLSDFIDSDGEWLWPRVSPCLSVSDSWVWVPGRRGGFSIASAWEAVRPRGGRVLWDGLLWSGG

Query:  NIPKHFFCAWLAIKDRLGTIDRLHRWDSSVPMSCILCQGGVESRDHLFFSCSFGG------------MFGLGFFG---SWVPLIGLGIGGLSCLGFVI--
        NIPKH FCAWLAIKDRLGT DR HRWDSSVP+SCILC+GG+ESRDHLFFSC FGG               +G +G   SW+   G+  G    L  V+  
Subjt:  NIPKHFFCAWLAIKDRLGTIDRLHRWDSSVPMSCILCQGGVESRDHLFFSCSFGG------------MFGLGFFG---SWVPLIGLGIGGLSCLGFVI--

Query:  ----KVLGRRNHRLHGGQARDPIVLFHLICSWIRARAGSWREDAHLPF
             +   RNHRLHGGQA DPIV+FHLIC+WIRARAGSWREDA+LPF
Subjt:  ----KVLGRRNHRLHGGQARDPIVLFHLICSWIRARAGSWREDAHLPF

XP_008452126.1 PREDICTED: uncharacterized protein LOC103493225 [Cucumis melo]0.0e+0076.31Show/hide
Query:  MEVIMPNSFGSLLEVGDADKWALSIIEGSPPP---LQIKSKAVVDFLGSSSVGFCCLLETRVREGNFDSVSRRFGNSWDYSCSYSNSDVGRIWGDVEEES
        MEVI PNSFGSLLEVGD DKWALSIIEGSP     +++KSKAVVDFLGSSSVGFCC LETRVREGNFDSVSRRFGNSWDYSCSYSNS VGRIW   ++  
Subjt:  MEVIMPNSFGSLLEVGDADKWALSIIEGSPPP---LQIKSKAVVDFLGSSSVGFCCLLETRVREGNFDSVSRRFGNSWDYSCSYSNSDVGRIWGDVEEES

Query:  FFFFYSFE----ITSAWS--RPGVVMGDFNAIRVHSEAFGGSPIQGEMEDFDLAISDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNEDWFSAWPT
        F F         IT  +    P VVM DFNAIR HSEA GGSPIQGEMEDFD+AI DADLVEPSVQ NWFTWTSKVQGSGM+RRLDRVL+N+DW SAWPT
Subjt:  FFFFYSFE----ITSAWS--RPGVVMGDFNAIRVHSEAFGGSPIQGEMEDFDLAISDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNEDWFSAWPT

Query:  IRFGRHIKSLSEKVRNAKAAMDLAQREVERNPMSDVLSHQAGLATETFWTVVRLEEASLRQKSRIRWLKLGDQNTAFFH---RSMAVNYFSNSLGSQEIG
        IRFGRHI+SLSE+VR AK AMD+AQREVERNPMSDVLS QA LATETFWT VRLE+   R    + ++     +    H     MAVNYFSNSLGSQEIG
Subjt:  IRFGRHIKSLSEKVRNAKAAMDLAQREVERNPMSDVLSHQAGLATETFWTVVRLEEASLRQKSRIRWLKLGDQNTAFFH---RSMAVNYFSNSLGSQEIG

Query:  YRELSPVIDDIVQFQWSEKCCQALQLPISREEVRRVLFSMDSGKAPGPDGFSAGFFKGAWSVVGEDF--------------LGVNATAITLIPKHNGAER
        YREL+P+IDDIVQFQWSE+CCQALQ+PISREEVRRVLFSMDSGKAPGPDGFS GFFKGAWSV+GEDF              +GVNATAITLIPKHNGAER
Subjt:  YRELSPVIDDIVQFQWSEKCCQALQLPISREEVRRVLFSMDSGKAPGPDGFSAGFFKGAWSVVGEDF--------------LGVNATAITLIPKHNGAER

Query:  LEDFRPISCCNALYKCISKILADRLRAWLPSFISSNQSAFIPGRSIIENILLCQELVGGYHLNSGKPCCTLKVDLQKAYDSVNWDFLFGLLIAIGTPLKK
        LEDFRPISCCN LYKCISKILADRLR WLPSFISSNQSAFI GRSIIENILLCQELVGGYHLNSGKP CTLKVDLQKAYDSVNWDFLFGL I+I TPLKK
Subjt:  LEDFRPISCCNALYKCISKILADRLRAWLPSFISSNQSAFIPGRSIIENILLCQELVGGYHLNSGKPCCTLKVDLQKAYDSVNWDFLFGLLIAIGTPLKK

Query:  GVRQDDPLSTFLFVMVMEVLSRMLNKIPQSFHFHHRCEKKFGELSSLFANPRKSSIFVAGVNNENASHLAACMGFVRGNLPVRYLGLPLLAGRLRSNDYA
        GVRQ DPLS FLFVMVMEVLSRMLNKIPQSF FHHRCEK+FGELS LFANPRKSSIF+AGVNNENAS LAACMGFVRGNLPVRYLGLPLL GRLRSND  
Subjt:  GVRQDDPLSTFLFVMVMEVLSRMLNKIPQSFHFHHRCEKKFGELSSLFANPRKSSIFVAGVNNENASHLAACMGFVRGNLPVRYLGLPLLAGRLRSNDYA

Query:  PLIQRITSRIRSWTARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHNEGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVD
        PLIQRITSRIRS +ARVLSFAGRLQLV SVL SLQVYWA VFVLPAYVHNEGGLGIRDG +W  ASTLKILWLMLTNSGSLWVAWVEAY+LKGRSLWDVD
Subjt:  PLIQRITSRIRSWTARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHNEGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVD

Query:  SRVGKSWCLRAILRKREKLKHLVRMKVGNGNSCRVWLDPWL----------PEVMYDVASRRKARLSDFIDSDGEWLWPRVSPCLSVSDSWVWVPGRRGG
        SRVG+SWCLRAILRK+EKLK  VRMKVGNGN CRVWLDPWL            V+YD ASRR+A LS+FI  DGEWLWP                  RGG
Subjt:  SRVGKSWCLRAILRKREKLKHLVRMKVGNGNSCRVWLDPWL----------PEVMYDVASRRKARLSDFIDSDGEWLWPRVSPCLSVSDSWVWVPGRRGG

Query:  FSIASAWEAVRPRGGRVLWDGLLWSGGNIPKHFFCAWLAIKDRLGTIDRLHRWDSSVPMSCILCQGGVESRDHLFFSCSFGG------------MFGLGF
        FSIASAWEA+RPRGGRVLWDGLLW GGNIPKH FCAWLAIKDRLGT DR HRWDSSVP+SCILC+GG+ESRDHLFFSC FGG               +G 
Subjt:  FSIASAWEAVRPRGGRVLWDGLLWSGGNIPKHFFCAWLAIKDRLGTIDRLHRWDSSVPMSCILCQGGVESRDHLFFSCSFGG------------MFGLGF

Query:  FG---SWVPLIGLGIGGLSCLGFVI------KVLGRRNHRLHGGQARDPIVLFHLICSWIRARAGSWREDAHLPF
        +G   SW+   G+  G    L  V+       +   RNHRLHGGQA DPIV+FHLIC+WIRARAGSWREDA+LPF
Subjt:  FG---SWVPLIGLGIGGLSCLGFVI------KVLGRRNHRLHGGQARDPIVLFHLICSWIRARAGSWREDAHLPF

TrEMBL top hitse value%identityAlignment
A0A1S3BSI8 uncharacterized protein LOC1034932250.0e+0076.31Show/hide
Query:  MEVIMPNSFGSLLEVGDADKWALSIIEGSPPP---LQIKSKAVVDFLGSSSVGFCCLLETRVREGNFDSVSRRFGNSWDYSCSYSNSDVGRIWGDVEEES
        MEVI PNSFGSLLEVGD DKWALSIIEGSP     +++KSKAVVDFLGSSSVGFCC LETRVREGNFDSVSRRFGNSWDYSCSYSNS VGRIW   ++  
Subjt:  MEVIMPNSFGSLLEVGDADKWALSIIEGSPPP---LQIKSKAVVDFLGSSSVGFCCLLETRVREGNFDSVSRRFGNSWDYSCSYSNSDVGRIWGDVEEES

Query:  FFFFYSFE----ITSAWS--RPGVVMGDFNAIRVHSEAFGGSPIQGEMEDFDLAISDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNEDWFSAWPT
        F F         IT  +    P VVM DFNAIR HSEA GGSPIQGEMEDFD+AI DADLVEPSVQ NWFTWTSKVQGSGM+RRLDRVL+N+DW SAWPT
Subjt:  FFFFYSFE----ITSAWS--RPGVVMGDFNAIRVHSEAFGGSPIQGEMEDFDLAISDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNEDWFSAWPT

Query:  IRFGRHIKSLSEKVRNAKAAMDLAQREVERNPMSDVLSHQAGLATETFWTVVRLEEASLRQKSRIRWLKLGDQNTAFFH---RSMAVNYFSNSLGSQEIG
        IRFGRHI+SLSE+VR AK AMD+AQREVERNPMSDVLS QA LATETFWT VRLE+   R    + ++     +    H     MAVNYFSNSLGSQEIG
Subjt:  IRFGRHIKSLSEKVRNAKAAMDLAQREVERNPMSDVLSHQAGLATETFWTVVRLEEASLRQKSRIRWLKLGDQNTAFFH---RSMAVNYFSNSLGSQEIG

Query:  YRELSPVIDDIVQFQWSEKCCQALQLPISREEVRRVLFSMDSGKAPGPDGFSAGFFKGAWSVVGEDF--------------LGVNATAITLIPKHNGAER
        YREL+P+IDDIVQFQWSE+CCQALQ+PISREEVRRVLFSMDSGKAPGPDGFS GFFKGAWSV+GEDF              +GVNATAITLIPKHNGAER
Subjt:  YRELSPVIDDIVQFQWSEKCCQALQLPISREEVRRVLFSMDSGKAPGPDGFSAGFFKGAWSVVGEDF--------------LGVNATAITLIPKHNGAER

Query:  LEDFRPISCCNALYKCISKILADRLRAWLPSFISSNQSAFIPGRSIIENILLCQELVGGYHLNSGKPCCTLKVDLQKAYDSVNWDFLFGLLIAIGTPLKK
        LEDFRPISCCN LYKCISKILADRLR WLPSFISSNQSAFI GRSIIENILLCQELVGGYHLNSGKP CTLKVDLQKAYDSVNWDFLFGL I+I TPLKK
Subjt:  LEDFRPISCCNALYKCISKILADRLRAWLPSFISSNQSAFIPGRSIIENILLCQELVGGYHLNSGKPCCTLKVDLQKAYDSVNWDFLFGLLIAIGTPLKK

Query:  GVRQDDPLSTFLFVMVMEVLSRMLNKIPQSFHFHHRCEKKFGELSSLFANPRKSSIFVAGVNNENASHLAACMGFVRGNLPVRYLGLPLLAGRLRSNDYA
        GVRQ DPLS FLFVMVMEVLSRMLNKIPQSF FHHRCEK+FGELS LFANPRKSSIF+AGVNNENAS LAACMGFVRGNLPVRYLGLPLL GRLRSND  
Subjt:  GVRQDDPLSTFLFVMVMEVLSRMLNKIPQSFHFHHRCEKKFGELSSLFANPRKSSIFVAGVNNENASHLAACMGFVRGNLPVRYLGLPLLAGRLRSNDYA

Query:  PLIQRITSRIRSWTARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHNEGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVD
        PLIQRITSRIRS +ARVLSFAGRLQLV SVL SLQVYWA VFVLPAYVHNEGGLGIRDG +W  ASTLKILWLMLTNSGSLWVAWVEAY+LKGRSLWDVD
Subjt:  PLIQRITSRIRSWTARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHNEGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVD

Query:  SRVGKSWCLRAILRKREKLKHLVRMKVGNGNSCRVWLDPWL----------PEVMYDVASRRKARLSDFIDSDGEWLWPRVSPCLSVSDSWVWVPGRRGG
        SRVG+SWCLRAILRK+EKLK  VRMKVGNGN CRVWLDPWL            V+YD ASRR+A LS+FI  DGEWLWP                  RGG
Subjt:  SRVGKSWCLRAILRKREKLKHLVRMKVGNGNSCRVWLDPWL----------PEVMYDVASRRKARLSDFIDSDGEWLWPRVSPCLSVSDSWVWVPGRRGG

Query:  FSIASAWEAVRPRGGRVLWDGLLWSGGNIPKHFFCAWLAIKDRLGTIDRLHRWDSSVPMSCILCQGGVESRDHLFFSCSFGG------------MFGLGF
        FSIASAWEA+RPRGGRVLWDGLLW GGNIPKH FCAWLAIKDRLGT DR HRWDSSVP+SCILC+GG+ESRDHLFFSC FGG               +G 
Subjt:  FSIASAWEAVRPRGGRVLWDGLLWSGGNIPKHFFCAWLAIKDRLGTIDRLHRWDSSVPMSCILCQGGVESRDHLFFSCSFGG------------MFGLGF

Query:  FG---SWVPLIGLGIGGLSCLGFVI------KVLGRRNHRLHGGQARDPIVLFHLICSWIRARAGSWREDAHLPF
        +G   SW+   G+  G    L  V+       +   RNHRLHGGQA DPIV+FHLIC+WIRARAGSWREDA+LPF
Subjt:  FG---SWVPLIGLGIGGLSCLGFVI------KVLGRRNHRLHGGQARDPIVLFHLICSWIRARAGSWREDAHLPF

A0A5A7TZS0 Reverse transcriptase domain-containing protein0.0e+0061.76Show/hide
Query:  NSW--DYSCSYSNSDVGRIW-----------------------------------------GDVEEESFFFFYSFEITSAWSRPGVVMGDFNAIRVHSEA
        N W  +YSCSYSNS VGRIW                                             E  F +   FEITSAWS  GVVMGDFNAIRVHSEA
Subjt:  NSW--DYSCSYSNSDVGRIW-----------------------------------------GDVEEESFFFFYSFEITSAWSRPGVVMGDFNAIRVHSEA

Query:  FGGSPIQGEMEDFDLAISDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNEDWFSAWPTIR------------------------------------
        FGGSPIQGEME+FDLAI DADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVN++W SAWPT+R                                    
Subjt:  FGGSPIQGEMEDFDLAISDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNEDWFSAWPTIR------------------------------------

Query:  ---------------------------------------FGRHIKSLSEKVRNAKAAMDLAQREVERNPMSDVLSHQAGLATETFWTVVRLEEASLRQKS
                                               FGRHIKSLSE+V  AK AMD+AQREVERNP+SDVLS QA LATETFWT VRLEEASLRQKS
Subjt:  ---------------------------------------FGRHIKSLSEKVRNAKAAMDLAQREVERNPMSDVLSHQAGLATETFWTVVRLEEASLRQKS

Query:  RIRWLKLGDQNTAFFHRS-----------------------------MAVNYFSNSLGSQEIGYRELSPVIDDIVQFQWSEKCCQALQLPISREEVRRVL
        ++RWL LGDQNTAFFHRS                             MAVNYFSNSLGSQEIGYRELSP+IDDIVQFQWSE+CCQALQLPISREEVRRVL
Subjt:  RIRWLKLGDQNTAFFHRS-----------------------------MAVNYFSNSLGSQEIGYRELSPVIDDIVQFQWSEKCCQALQLPISREEVRRVL

Query:  FSMDSGKAPGPDGFSAGFFKGAWSVVGEDF--------------LGVNATAITLIPKHNGAERLEDFRPISCCNALYKCISKILADRLRAWLPSFISSNQ
        FSMDSGKAPGPDGFS GF+KGAWSVVGEDF              +GVNATAITLIPKH GAERLEDFRPISCCN LYKCISKILADRLR WLPSFISSNQ
Subjt:  FSMDSGKAPGPDGFSAGFFKGAWSVVGEDF--------------LGVNATAITLIPKHNGAERLEDFRPISCCNALYKCISKILADRLRAWLPSFISSNQ

Query:  SAFIPGRSIIENILLCQELVGGYHLNSGKPCCTLKVDLQKAYDSVNWDFLFGLLIAIGTPLK-----------------------------KGVRQDDPL
        SAFIPGRSIIENILLCQELVGGYHLNSGKP CTLKVDLQKAYDSVNWDFLFGLLIAIGTPLK                             KG+RQ DPL
Subjt:  SAFIPGRSIIENILLCQELVGGYHLNSGKPCCTLKVDLQKAYDSVNWDFLFGLLIAIGTPLK-----------------------------KGVRQDDPL

Query:  STFLFVMVMEVLSRMLNKIPQSFHFHHRCEK-------------------------------KFGELSSLFANPRKSSIFVAGVNNENASHLAACMGFVR
        S FLFVMVMEVLSRMLNKIPQSF FHHRCEK                               KFGE S LFANPRKSSIFV GVNNE ASHLAAC+G   
Subjt:  STFLFVMVMEVLSRMLNKIPQSFHFHHRCEK-------------------------------KFGELSSLFANPRKSSIFVAGVNNENASHLAACMGFVR

Query:  GNLPVRYLGLPLLAGRLRSNDYAPLIQRITSRIRSWTARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN---------------------------
           P            LRS D APLIQRITSRIRSWTARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN                           
Subjt:  GNLPVRYLGLPLLAGRLRSNDYAPLIQRITSRIRSWTARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN---------------------------

Query:  ------EGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVDSRVGKSWCLRAILRKREKLKHLVRMKVGNGNSCRVWLDPWLPEV
              EGGLGIRDGPSWNIA+TLKIL   LTN GSLWVAW+EAYILKG+SLWDVDSRVG+SWCLRAILRKREK+KH V  +                 V
Subjt:  ------EGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVDSRVGKSWCLRAILRKREKLKHLVRMKVGNGNSCRVWLDPWLPEV

Query:  MYDVASRRKARLSDFIDSDGEWLWPR--------------VSPCLSVSDSWVWVPGRRGGFSIASAWEAVRPRGGRVLWDGLLWSGGNIPKHFFCAWLAI
        +YD ASRR+A+LSDFID +GEWLWPR              VSPCLSVSDSWVWVPGR+GGFSIASAWEA+ PRGGRVLWDGLLW GGNIPKH FCAWLAI
Subjt:  MYDVASRRKARLSDFIDSDGEWLWPR--------------VSPCLSVSDSWVWVPGRRGGFSIASAWEAVRPRGGRVLWDGLLWSGGNIPKHFFCAWLAI

Query:  KDRLGTIDRLHRWDSSVPMSCILCQGGVESRDHLFFSCSFGG------------MFGLGFFG---SWVPLIGLGIGGLSCLGFVI------KVLGRRNHR
        KDRL T DRLHRWDSS+P+SCILCQGGVESRDHLFFSC FGG               +G +G   SW+   G+G G    L  V+       +   RNHR
Subjt:  KDRLGTIDRLHRWDSSVPMSCILCQGGVESRDHLFFSCSFGG------------MFGLGFFG---SWVPLIGLGIGGLSCLGFVI------KVLGRRNHR

Query:  LHGGQARDPIVLFHLICSWIRARAGSWREDAHLPF
        LHGG+ARDPI+LFHLIC+WIRARAGSWREDAHLPF
Subjt:  LHGGQARDPIVLFHLICSWIRARAGSWREDAHLPF

A0A5A7UP65 Reverse transcriptase0.0e+0071.05Show/hide
Query:  MEVIMPNSFGSLLEVGDADKWALSIIEGSPPP---LQIKSKAVVDFLGSSSVGFCCLLETRVREGNFDSVSRRFGNSWDYSCSYSNSDVGRIWGDVEEES
        MEVI PNSFGSLLEVGD DKWALSIIEGSP     +++KSKAVVDFLGSSSVGFCC LETRVREGNFDSVSRRFGNSWDYSCSYSNS VGRIW  + +++
Subjt:  MEVIMPNSFGSLLEVGDADKWALSIIEGSPPP---LQIKSKAVVDFLGSSSVGFCCLLETRVREGNFDSVSRRFGNSWDYSCSYSNSDVGRIWGDVEEES

Query:  FFFFYSFEITSAWSRPGVVMGDFNAIRVHSEAFGGSPIQGEMEDFDLAISDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNEDWFSAWPTI-----
         F F +      +    VVM DFNAIR HSEA GGSPIQGEMEDFD+AI DADLVEPSVQGNWFTWTSKVQGSGM+RRLDRVL+N+DW SAWPT+     
Subjt:  FFFFYSFEITSAWSRPGVVMGDFNAIRVHSEAFGGSPIQGEMEDFDLAISDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNEDWFSAWPTI-----

Query:  ---------------------------------------RFGRHIKSLSEKVRNAKAAMDLAQREVERNPMSDVLSHQAGLATETFWTVVRLEEASLRQK
                                               RFGRHI+SLSE+VR AK AMD+AQREVERNPMSDVLS QA LATETFWT VRLE+   R  
Subjt:  ---------------------------------------RFGRHIKSLSEKVRNAKAAMDLAQREVERNPMSDVLSHQAGLATETFWTVVRLEEASLRQK

Query:  SRIRWLKLGDQNTAFFH---RSMAVNYFSNSLGSQEIGYRELSPVIDDIVQFQWSEKCCQALQLPISREEVRRVLFSMDSGKAPGPDGFSAGFFKGAWSV
          + ++     +    H     MAVNYFSNSLGSQEIGYREL+P+IDDIVQFQWSE+CCQALQ+PISREEVRRVLFSMDSGKAPGPDGFS GFFKGAWSV
Subjt:  SRIRWLKLGDQNTAFFH---RSMAVNYFSNSLGSQEIGYRELSPVIDDIVQFQWSEKCCQALQLPISREEVRRVLFSMDSGKAPGPDGFSAGFFKGAWSV

Query:  VGEDF--------------LGVNATAITLIPKHNGAERLEDFRPISCCNALYKCISKILADRLRAWLPSFISSNQSAFIPGRSIIENILLCQELVGGYHL
        +GEDF              +GVNATAITLIPKHNGAERLEDFRPISCCN LYKCISKILADRLR WLPSFISSNQSAFI GRSIIENILLCQELVGGYHL
Subjt:  VGEDF--------------LGVNATAITLIPKHNGAERLEDFRPISCCNALYKCISKILADRLRAWLPSFISSNQSAFIPGRSIIENILLCQELVGGYHL

Query:  NSGKPCCTLKVDLQKAYDSVNWDFLFGLLIAIGTPLK-----------------------------KGVRQDDPLSTFLFVMVMEVLSRMLNKIPQSFHF
        NSGKP CTLKVDLQKAYDSVNWDFLFGL I+I TPLK                             KGVRQ DPLS FLFVMVMEVLSRMLNKIPQSF F
Subjt:  NSGKPCCTLKVDLQKAYDSVNWDFLFGLLIAIGTPLK-----------------------------KGVRQDDPLSTFLFVMVMEVLSRMLNKIPQSFHF

Query:  HHRCEKKFGELSSLFANPRKSSIFVAGVNNENASHLAACMGFVRGNLPVRYLGLPLLAGRLRSNDYAPLIQRITSRIRSWTARVLSFAGRLQLVRSVLRS
        HHRCEK+FGELS LFANPRKSSIF+AGVNNENAS LAACMGFVRGNLPVRYLGLPLL GRLRSND  PLIQRITSRIRS +ARVLSFAGRLQLV SVL S
Subjt:  HHRCEKKFGELSSLFANPRKSSIFVAGVNNENASHLAACMGFVRGNLPVRYLGLPLLAGRLRSNDYAPLIQRITSRIRSWTARVLSFAGRLQLVRSVLRS

Query:  LQVYWASVFVLPAYVHN-EGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVDSRVGKSWCLRAILRKREKLKHLVRMKVGNGNS
        LQVYWA VFVLPAYVHN EGGLGIRDG +W  ASTLKILWLMLTNSGSLWVAWVEAY+LKGRSLWDVDSRVG+SWCLRAILRK+EKLK  VRMKVGNGN 
Subjt:  LQVYWASVFVLPAYVHN-EGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVDSRVGKSWCLRAILRKREKLKHLVRMKVGNGNS

Query:  CRVWLDPWL----------PEVMYDVASRRKARLSDFIDSDGEWLWPRVSPCLSVSDSWVWVPGRRGGFSIASAWEAVRPRGGRVLWDGLLWSGGNIPKH
        CRVWLDPWL            V+YD ASRR+A LS+FI  DGEWLWP                  RGGFSIASAWEA+RPRGGRVLWDGLLW GGNIPKH
Subjt:  CRVWLDPWL----------PEVMYDVASRRKARLSDFIDSDGEWLWPRVSPCLSVSDSWVWVPGRRGGFSIASAWEAVRPRGGRVLWDGLLWSGGNIPKH

Query:  FFCAWLAIKDRLGTIDRLHRWDSSVPMSCILCQGGVESRDHLFFSCSFGG------------MFGLGFFG---SWVPLIGLGIGGLSCLGFVI------K
         FCAWLAIKDRLGT DR HRWDSSVP+SCILC+GG+ESRDHLFFSC FGG               +G +G   SW+   G+  G    L  V+       
Subjt:  FFCAWLAIKDRLGTIDRLHRWDSSVPMSCILCQGGVESRDHLFFSCSFGG------------MFGLGFFG---SWVPLIGLGIGGLSCLGFVI------K

Query:  VLGRRNHRLHGGQARDPIVLFHLICSWIRARAGSWREDAHLPF
        +   RNHRLHGGQA DPIV+FHLIC+WIRARAGSWREDA+LPF
Subjt:  VLGRRNHRLHGGQARDPIVLFHLICSWIRARAGSWREDAHLPF

A0A5A7V3Z0 Reverse transcriptase domain-containing protein0.0e+0070.3Show/hide
Query:  MLRRLDRVLVNEDWFSAWPTI---------------------------------------------------------------------------RFGR
        MLRRLDRVLVNEDWFSAWPT+                                                                           RFGR
Subjt:  MLRRLDRVLVNEDWFSAWPTI---------------------------------------------------------------------------RFGR

Query:  HIKSLSEKVRNAKAAMDLAQREVERNPMSDVLSHQAGLATETFWTVVRLEEASLRQKSRIRWLKLGDQNTAFFHRSMAVNYFSNSL-------GSQEIGY
        HIKSLSE+VRNAKAA+DLAQREVERNPMSDVLSHQAGL+TETFWT VRLEEASLRQKSRIRWLKLGDQNTAFFHRS+      N+L       G++EIGY
Subjt:  HIKSLSEKVRNAKAAMDLAQREVERNPMSDVLSHQAGLATETFWTVVRLEEASLRQKSRIRWLKLGDQNTAFFHRSMAVNYFSNSL-------GSQEIGY

Query:  RELSPVIDDIVQFQWSEKCCQALQLPISREEVRRVLFSMDSGKAPGPDGFSAGFFKGAWSVVGEDFLGVNATAITLIPKHNGAERLEDFRPISCCNALYK
        RELSPVIDDIVQFQWSE+CCQALQLPISREEVRRVLFSMDSGKAPGPDGFSA              +GVNATAITLIPKHNGAERLEDFRPISCCNALYK
Subjt:  RELSPVIDDIVQFQWSEKCCQALQLPISREEVRRVLFSMDSGKAPGPDGFSAGFFKGAWSVVGEDFLGVNATAITLIPKHNGAERLEDFRPISCCNALYK

Query:  CISKILADRLRAWLPSFISSNQSAFIPGRSIIENILLCQELVGGYHLNSGKPCCTLKVDLQKAYDSVNWDFLFGLLIAIGTPLK----------------
        CISKILADRLRAWLPSFISSNQSAFIPGRSIIENILLCQELVGGYHLNSGKP CTLKVDLQKAYDSVNWDFLFGLLIAIGTPLK                
Subjt:  CISKILADRLRAWLPSFISSNQSAFIPGRSIIENILLCQELVGGYHLNSGKPCCTLKVDLQKAYDSVNWDFLFGLLIAIGTPLK----------------

Query:  -------------KGVRQDDPLSTFLFVMVMEVLSRMLNKIPQSFHFHHRCEK-------------------------------KFGELSSLFANPRKSS
                     KGVRQ DPLSTFLFVMVMEVLSRMLNKIPQSFHFHHRCEK                               KFGELS LFANPRKSS
Subjt:  -------------KGVRQDDPLSTFLFVMVMEVLSRMLNKIPQSFHFHHRCEK-------------------------------KFGELSSLFANPRKSS

Query:  IFVAGVNNENASHLAACMGFVRGNLPVRYLGLPLLAGRLRSNDYAPLIQRITSRIRSWTARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN-----
        IFVAGVNNENASHLA CMGFVRGNL VRYLGLPLLAGRLRSNDYAPLIQRITSRIRSWTARVLSFAGRLQLV SVLRS QVYWASVFVLPAYVHN     
Subjt:  IFVAGVNNENASHLAACMGFVRGNLPVRYLGLPLLAGRLRSNDYAPLIQRITSRIRSWTARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN-----

Query:  ----------------------------EGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVDSRVGKSWCLRAILRKREKLKHL
                                    EGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVDSRVGKSWCLRAILRKREKLKHL
Subjt:  ----------------------------EGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVDSRVGKSWCLRAILRKREKLKHL

Query:  VRMKVGNGNSCRVWLDPWLPE----------VMYDVASRRKARLSDFIDSDGEWLWPR--------------VSPCLSVSDSWVWVPGRRGGFSIASAWE
        VRMKVGNGNS RVWLDPWLPE          VMYD ASRRKARLSDFID DGEWLWPR              VSPCLSVSDSWVWVPGRRGGFSIASAWE
Subjt:  VRMKVGNGNSCRVWLDPWLPE----------VMYDVASRRKARLSDFIDSDGEWLWPR--------------VSPCLSVSDSWVWVPGRRGGFSIASAWE

Query:  AVRPRGGRVLWDGLLWSGGNIPKHFFCAWLAIKDRLGTIDRLHRWDSSVPMSCILCQGGVESRDHLFFSCSFGGMFGLGFFGSWVPLIGLGIGGLSCLGF
        AVRPRGGRVLWDGLLW GGNI KHFFCAWLAIKDRLGTIDRLHRWDSSVPM CI                       L FFGSWVPLIGLGIGGLSCLGF
Subjt:  AVRPRGGRVLWDGLLWSGGNIPKHFFCAWLAIKDRLGTIDRLHRWDSSVPMSCILCQGGVESRDHLFFSCSFGGMFGLGFFGSWVPLIGLGIGGLSCLGF

Query:  VIKVLGRRNHRLHGGQARDPIVLFHLICSWIRARAGSWREDAHLPF
        VIK             ARDPIVLFHLICSWIRARAGSWR+DAHLPF
Subjt:  VIKVLGRRNHRLHGGQARDPIVLFHLICSWIRARAGSWREDAHLPF

A0A5D3D7P6 Reverse transcriptase0.0e+0068.61Show/hide
Query:  MEVIMPNSFGSLLEVGDADKWALSIIEGSPPP---LQIKSKAVVDFLGSSSVGFCCLLETRVREGNFDSVSRRFGNSWDYSCSYSNSDVGRIWGDVEEES
        MEVI PNSFGSLLEVGD DKWALSIIEGSP     +++KSKAVVDFLGSSSVGFCC LETRVREGNFDSVSRRFGNSWDYSCSYSNS VGRIW  + +++
Subjt:  MEVIMPNSFGSLLEVGDADKWALSIIEGSPPP---LQIKSKAVVDFLGSSSVGFCCLLETRVREGNFDSVSRRFGNSWDYSCSYSNSDVGRIWGDVEEES

Query:  FFFFYSFEITSAWSRPGVVMGDFNAIRVHSEAFGGSPIQGEMEDFDLAISDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNEDWFSAWPTI-----
         F F +      +    VVM DFNAIR HSEA GGSPIQGEMEDFD+AI DADLVEPSVQ NWFTWTSKVQGSGM+RRLDRVL+N+DW SAWPT+     
Subjt:  FFFFYSFEITSAWSRPGVVMGDFNAIRVHSEAFGGSPIQGEMEDFDLAISDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNEDWFSAWPTI-----

Query:  -----------------------------------------------RFGRHIKSLSEKVRNAKAAMDLAQREVERNPMSDVLSHQAGLATETFWTVVRL
                                                       RFGRHI+SLSE+VR AK AMD+AQRE++                   W  + L
Subjt:  -----------------------------------------------RFGRHIKSLSEKVRNAKAAMDLAQREVERNPMSDVLSHQAGLATETFWTVVRL

Query:  EEASLRQKSRIRWLKLGDQNTAFFHRSMAVNYFSNSLGSQEIGYRELSPVIDDIVQFQWSEKCCQALQLPISREEVRRVLFSMDSGKAPGPDGFSAGFFK
         E S    S ++         A     MAVNYFSNSLGSQEIGYREL+P+IDDIVQFQWSE+CCQALQ+PISREEVRRVLFSMDSGKAPGPDGFS GFFK
Subjt:  EEASLRQKSRIRWLKLGDQNTAFFHRSMAVNYFSNSLGSQEIGYRELSPVIDDIVQFQWSEKCCQALQLPISREEVRRVLFSMDSGKAPGPDGFSAGFFK

Query:  GAWSVVGEDF--------------LGVNATAITLIPKHNGAERLEDFRPISCCNALYKCISKILADRLRAWLPSFISSNQSAFIPGRSIIENILLCQELV
        GAWSV+GEDF              +GVNATAITLIPKHNGAERLEDFRPISCCN LYKCISKILADRLR WLPSFISSNQSAFI GRSIIENILLCQELV
Subjt:  GAWSVVGEDF--------------LGVNATAITLIPKHNGAERLEDFRPISCCNALYKCISKILADRLRAWLPSFISSNQSAFIPGRSIIENILLCQELV

Query:  GGYHLNSGKPCCTLKVDLQKAYDSVNWDFLFGLLIAIGTPLK-----------------------------KGVRQDDPLSTFLFVMVMEVLSRMLNKIP
        GGYHLNSGKP CTLKVDLQKAYDSVNWDFLFGL I+I TPLK                             KGVRQ DPLS FLFVMVMEVLSRMLNKIP
Subjt:  GGYHLNSGKPCCTLKVDLQKAYDSVNWDFLFGLLIAIGTPLK-----------------------------KGVRQDDPLSTFLFVMVMEVLSRMLNKIP

Query:  QSFHFHHRCEKKFGELSSLFANPRKSSIFVAGVNNENASHLAACMGFVRGNLPVRYLGLPLLAGRLRSNDYAPLIQRITSRIRSWTARVLSFAGRLQLVR
        QSF FHHRCEK+FGELS LFANPRKSSIF+AGVNNENAS LAACMGFVRGNLPVRYLGLPLL GRLRSND  PLIQRITSRIRS +ARVLSFAGRLQLV 
Subjt:  QSFHFHHRCEKKFGELSSLFANPRKSSIFVAGVNNENASHLAACMGFVRGNLPVRYLGLPLLAGRLRSNDYAPLIQRITSRIRSWTARVLSFAGRLQLVR

Query:  SVLRSLQVYWASVFVLPAYVHN-EGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVDSRVGKSWCLRAILRKREKLKHLVRMKV
        SVL SLQVYWA VFVLPAYVHN EGGLGIRDG +W  ASTLKILWLMLTNSGSLWVAWVEAY+LKGRSLWDVDSRVG+SWCLRAILRK+EKLK  VRMKV
Subjt:  SVLRSLQVYWASVFVLPAYVHN-EGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVDSRVGKSWCLRAILRKREKLKHLVRMKV

Query:  GNGNSCRVWLDPWL----------PEVMYDVASRRKARLSDFIDSDGEWLWPRVSPCLSVSDSWVWVPGRRGGFSIASAWEAVRPRGGRVLWDGLLWSGG
        GNGN CRVWLDPWL            V+YD ASRR+A LS+FI  DGEWLWP                  RGGFSIASAWEA+RPRGGRVLWDGLLW GG
Subjt:  GNGNSCRVWLDPWL----------PEVMYDVASRRKARLSDFIDSDGEWLWPRVSPCLSVSDSWVWVPGRRGGFSIASAWEAVRPRGGRVLWDGLLWSGG

Query:  NIPKHFFCAWLAIKDRLGTIDRLHRWDSSVPMSCILCQGGVESRDHLFFSCSFGG------------MFGLGFFG---SWVPLIGLGIGGLSCLGFVI--
        NIPKH FCAWLAIKDRLGT DR HRWDSSVP+SCILC+GG+ESRDHLFFSC FGG               +G +G   SW+   G+  G    L  V+  
Subjt:  NIPKHFFCAWLAIKDRLGTIDRLHRWDSSVPMSCILCQGGVESRDHLFFSCSFGG------------MFGLGFFG---SWVPLIGLGIGGLSCLGFVI--

Query:  ----KVLGRRNHRLHGGQARDPIVLFHLICSWIRARAGSWREDAHLPF
             +   RNHRLHGGQA DPIV+FHLIC+WIRARAGSWREDA+LPF
Subjt:  ----KVLGRRNHRLHGGQARDPIVLFHLICSWIRARAGSWREDAHLPF

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein5.8e-2323.41Show/hide
Query:  YFSNSLGSQEIGYRELSPVIDDIVQFQWSEKCCQALQLPISREEVRRVLFSMDSGKAPGPDGFSAGFF------------KGAWSVVGEDFL--GVNATA
        Y+ +   ++     E+   +D     + +++  ++L  PI+  E+  ++ S+ + K+PGPDGF+A F+            K   S+  E  L       +
Subjt:  YFSNSLGSQEIGYRELSPVIDDIVQFQWSEKCCQALQLPISREEVRRVLFSMDSGKAPGPDGFSAGFF------------KGAWSVVGEDFL--GVNATA

Query:  ITLIPK-HNGAERLEDFRPISCCNALYKCISKILADRLRAWLPSFISSNQSAFIPGRSIIENILLCQELVGGYHLNSGKPCCTLKVDLQKAYDSVNWDFL
        I LIPK      + E+FRPIS  N   K ++KILA+R++  +   I  +Q  FIPG     NI     ++   +    K    + +D +KA+D +   F+
Subjt:  ITLIPK-HNGAERLEDFRPISCCNALYKCISKILADRLRAWLPSFISSNQSAFIPGRSIIENILLCQELVGGYHLNSGKPCCTLKVDLQKAYDSVNWDFL

Query:  FGLLIAIGT-----------------------------PLKKGVRQDDPLSTFLFVMVMEVLSRMLNKIPQ-----------------------------
           L  +G                              PLK G RQ  PLS  LF +V+EVL+R + +  +                             
Subjt:  FGLLIAIGT-----------------------------PLKKGVRQDDPLSTFLFVMVMEVLSRMLNKIPQ-----------------------------

Query:  SFHFHHRCEKKFGELSSLFANPRKSSIFVAGVNNENASHLAACMGFVRGNLPVRYLGLPLL--AGRLRSNDYAPLIQRITSRIRSWTARVLSFAGRLQLV
        S     +    F ++S    N +KS  F+   N +  S +   + F   +  ++YLG+ L      L   +Y PL++ I      W     S+ GR+ +V
Subjt:  SFHFHHRCEKKFGELSSLFANPRKSSIFVAGVNNENASHLAACMGFVRGNLPVRYLGLPLL--AGRLRSNDYAPLIQRITSRIRSWTARVLSFAGRLQLV

Query:  RSVLRSLQVY
        +  +    +Y
Subjt:  RSVLRSLQVY

P08548 LINE-1 reverse transcriptase homolog1.8e-1924.08Show/hide
Query:  SEKCCQALQLPISREEVRRVLFSMDSGKAPGPDGFSAGFFK-----------GAWSVVGEDFLGVNA---TAITLIPK-HNGAERLEDFRPISCCNALYK
        S+K  + L  PIS  E+   + ++   K+PGPDGF++ F++             +  + ++ +  N      ITLIPK      R E++RPIS  N   K
Subjt:  SEKCCQALQLPISREEVRRVLFSMDSGKAPGPDGFSAGFFK-----------GAWSVVGEDFLGVNA---TAITLIPK-HNGAERLEDFRPISCCNALYK

Query:  CISKILADRLRAWLPSFISSNQSAFIPGRSIIENILLCQELVGGYHLNSGKPCCTLKVDLQKAYDSVNWDFLFGLLIAIGT-------------------
         ++KIL +R++  +   I  +Q  FIPG     NI     ++   +    K    L +D +KA+D++   F+   L  IG                    
Subjt:  CISKILADRLRAWLPSFISSNQSAFIPGRSIIENILLCQELVGGYHLNSGKPCCTLKVDLQKAYDSVNWDFLFGLLIAIGT-------------------

Query:  ----------PLKKGVRQDDPLSTFLFVMVMEVLSRMLNKIPQSFHFHHRCE-----------------------------KKFGELSSLFANPRKSSIF
                  PL+ G RQ  PLS  LF +VMEVL+  + +       H   E                             K++  +S    N  KS  F
Subjt:  ----------PLKKGVRQDDPLSTFLFVMVMEVLSRMLNKIPQSFHFHHRCE-----------------------------KKFGELSSLFANPRKSSIF

Query:  VAGVNNENASHLAACMGFVRGNLPVRYLGLPLL--AGRLRSNDYAPLIQRITSRIRSWTARVLSFAGRLQLVRSVLRSLQVY
        +   NN+    +   + F      ++YLG+ L      L   +Y  L + I   +  W     S+ GR+ +V+  +    +Y
Subjt:  VAGVNNENASHLAACMGFVRGNLPVRYLGLPLL--AGRLRSNDYAPLIQRITSRIRSWTARVLSFAGRLQLVRSVLRSLQVY

P0C2F6 Putative ribonuclease H protein At1g657502.1e-2023.41Show/hide
Query:  LPLLAGRLRSNDYAPLIQRITSRIRSWTARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN---------------------------------EGG
        +P+L  R+  + +  +++R++SR+  W  + LSFAGRL L ++VL S+ V+  S  +LP  + N                                 EGG
Subjt:  LPLLAGRLRSNDYAPLIQRITSRIRSWTARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHN---------------------------------EGG

Query:  LGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGR---SLWDVDSRVGKSWCLRAILRKREKLKHLVRMKVGNGNSCRVWLDPWLP-EVMYDVA
        LG+R   S N A   K+ W +L    SLW   ++     G    S W +      S      +  R+ + H V    G+G   R W D W+  + + ++ 
Subjt:  LGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGR---SLWDVDSRVGKSWCLRAILRKREKLKHLVRMKVGNGNSCRVWLDPWLP-EVMYDVA

Query:  SRRKARLSDFIDSDGEWL------WPRVSP-----------------CLSVSDSWVWVPGRRGGFSIASAWEAVR----PRGGRVLWDGLLWSGGNIPKH
        +  +    D + +   W+      + ++ P                      D   W   + G FS+ SA+E +     PR     +   LW      + 
Subjt:  SRRKARLSDFIDSDGEWL------WPRVSP-----------------CLSVSDSWVWVPGRRGGFSIASAWEAVR----PRGGRVLWDGLLWSGGNIPKH

Query:  FFCAWLAIKDRLGTIDRLHRWDSSVPMSCILCQGGVESRDHLFFSC
            WL     + T +  HR   S    C +C+GGVES  H+   C
Subjt:  FFCAWLAIKDRLGTIDRLHRWDSSVPMSCILCQGGVESRDHLFFSC

P11369 LINE-1 retrotransposable element ORF2 protein3.2e-2124.8Show/hide
Query:  LQLPISREEVRRVLFSMDSGKAPGPDGFSAGFFKGAWSVVGEDFLGV------------------NATAITLIPK-HNGAERLEDFRPISCCNALYKCIS
        L  PIS +E+  V+ S+ + K+PGPDGFSA F++       ED + +                      ITLIPK      ++E+FRPIS  N   K ++
Subjt:  LQLPISREEVRRVLFSMDSGKAPGPDGFSAGFFKGAWSVVGEDFLGV------------------NATAITLIPK-HNGAERLEDFRPISCCNALYKCIS

Query:  KILADRLRAWLPSFISSNQSAFIPGRSIIENILLCQELVGGYHLNSGKPCCTLKVDLQKAYDSVNWDFLFGLLIAIG-----------------------
        KILA+R++  + + I  +Q  FIPG     NI     ++   +    K    + +D +KA+D +   F+  +L   G                       
Subjt:  KILADRLRAWLPSFISSNQSAFIPGRSIIENILLCQELVGGYHLNSGKPCCTLKVDLQKAYDSVNWDFLFGLLIAIG-----------------------

Query:  ------TPLKKGVRQDDPLSTFLFVMVMEVLSRMLNKIPQSFHFHHRCEK-----------------------------KFGELSSLFANPRKSSIFVAG
               PLK G RQ  PLS +LF +V+EVL+R + +  +        E+                              FGE+     N  KS  F+  
Subjt:  ------TPLKKGVRQDDPLSTFLFVMVMEVLSRMLNKIPQSFHFHHRCEK-----------------------------KFGELSSLFANPRKSSIFVAG

Query:  VNNENASHLAACMGFVRGNLPVRYLGLPLL--AGRLRSNDYAPLIQRITSRIRSWTARVLSFAGRLQLVRSVLRSLQVY
         N +    +     F      ++YLG+ L      L   ++  L + I   +R W     S+ GR+ +V+  +    +Y
Subjt:  VNNENASHLAACMGFVRGNLPVRYLGLPLL--AGRLRSNDYAPLIQRITSRIRSWTARVLSFAGRLQLVRSVLRSLQVY

P14381 Transposon TX1 uncharacterized 149 kDa protein7.1e-2124.08Show/hide
Query:  PVIDDIVQFQW------SEKCCQALQLPISREEVRRVLFSMDSGKAPGPDGFSAGFFKGAWSVVGEDF--------------LGVNATAITLIPKHNGAE
        P+  D  +  W      SE+  + L+ PI+ +E+ + L  M   K+PG DG +  FF+  W  +G DF              L      ++L+PK     
Subjt:  PVIDDIVQFQW------SEKCCQALQLPISREEVRRVLFSMDSGKAPGPDGFSAGFFKGAWSVVGEDF--------------LGVNATAITLIPKHNGAE

Query:  RLEDFRPISCCNALYKCISKILADRLRAWLPSFISSNQSAFIPGRSIIENILLCQELVGGYHLNSGKPCCTLKVDLQKAYDSVNWDFLFGLLIA------
         ++++RP+S  +  YK ++K ++ RL++ L   I  +QS  +PGR+I +N+ L ++L+  +   +G     L +D +KA+D V+  +L G L A      
Subjt:  RLEDFRPISCCNALYKCISKILADRLRAWLPSFISSNQSAFIPGRSIIENILLCQELVGGYHLNSGKPCCTLKVDLQKAYDSVNWDFLFGLLIA------

Query:  ---------------------IGTPLK--KGVRQDDPLSTFLFVMVMEVLSRMLNK-------------------------IPQSFHFHHR---CEKKFG
                             +  PL   +GVRQ  PLS  L+ + +E    +L K                         + Q      R   C++ + 
Subjt:  ---------------------IGTPLK--KGVRQDDPLSTFLFVMVMEVLSRMLNK-------------------------IPQSFHFHHR---CEKKFG

Query:  ELSSLFANPRKSSIFVAGVNNENASHLAACMGFVRGNLP-VRYLGLPLLAGRLR-SNDYAPLIQRITSRIRSWT--ARVLSFAGRLQLVRSVLRSLQVYW
          SS   N  KSS  + G  +     L      +      ++YLG+ L A     S ++  L + + +R+  W   A+VLS  GR  ++  ++ S Q+++
Subjt:  ELSSLFANPRKSSIFVAGVNNENASHLAACMGFVRGNLP-VRYLGLPLLAGRLR-SNDYAPLIQRITSRIRSWT--ARVLSFAGRLQLVRSVLRSLQVYW

Query:  ASVFVLP
          + + P
Subjt:  ASVFVLP

Arabidopsis top hitse value%identityAlignment
AT1G43730.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein8.1e-2031.25Show/hide
Query:  RSLWDVDSRVGKSWCLRAILRKREKLKHLVRMKVGNGNSCRVWLDPWLPE-VMYDVASRRKARLSDF-IDSDGEWLWPRVSPCLSVSDSWVW---VPGRR
        R+ W ++S    SW  R + + RE  +  V   VG+G + + W D W     + D+      +     ID+ G      +  C    DS++W   +    
Subjt:  RSLWDVDSRVGKSWCLRAILRKREKLKHLVRMKVGNGNSCRVWLDPWLPE-VMYDVASRRKARLSDF-IDSDGEWLWPRVSPCLSVSDSWVW---VPGRR

Query:  GGFSIASAWEAVRPRGGRVLWDGLLWSGGNIPKHFFCAWLAIKDRLGTIDRLHRWDSSVPMSCILCQGGVESRDHLFFSCSFGGMFGLGFFG
          FS A    A+ P+   V W   +W   ++PKH F  W+   +RL T DRL  W  S+P  C+LC    ESR HLFF C F G     F G
Subjt:  GGFSIASAWEAVRPRGGRVLWDGLLWSGGNIPKHFFCAWLAIKDRLGTIDRLHRWDSSVPMSCILCQGGVESRDHLFFSCSFGGMFGLGFFG

AT1G43760.1 DNAse I-like superfamily protein3.4e-3426.92Show/hide
Query:  SWDYSCSYSNSDVGRIWGDVEEESFFFFYSFEITSAWSRPGVVMGDFNAIRVHSEAFG----GSPIQGEMEDFDLAISDADLVEPSVQGNWFTWTSKVQG
        SW    +Y  S++GRIW  V + S     S  +     +  +++GDF+ I   S+ +       P++G +E+F   + D+DLV+   +G  +TW++    
Subjt:  SWDYSCSYSNSDVGRIWGDVEEESFFFFYSFEITSAWSRPGVVMGDFNAIRVHSEAFG----GSPIQGEMEDFDLAISDADLVEPSVQGNWFTWTSKVQG

Query:  SGMLRRLDRVLVNEDWFSAWPT------------------------------------------------------IRFGRHIKSLSEKVRNAKAAMDLA
        + ++R+LDR + N DWFS++P+                                                      I  G H+ SL E ++ AK    L 
Subjt:  SGMLRRLDRVLVNEDWFSAWPT------------------------------------------------------IRFGRHIKSLSEKVRNAKAAMDLA

Query:  QRE-------VERNPMSDVLSHQAGLATETFWTVVRLE--------------EASLRQKSRIRWLKLGDQNTAFFH------------------------
         R+         +  +  + S Q+ L T    ++ R+E              E+  RQKSRI+WL+ GD NT FFH                        
Subjt:  QRE-------VERNPMSDVLSHQAGLATETFWTVVRLE--------------EASLRQKSRIRWLKLGDQNTAFFH------------------------

Query:  -----RSMAVNYFSNSLGS-QEIGYRELSPVIDDIVQFQWSEKCCQALQLPISREEVRRVLFSMDSGKAPGPDGFSAGFFKGAWSVVGEDFLGV------
             + M V Y+++ LGS  +I   +    I DI  F+ ++     L    S +E+   +F+M   KAPGPD F+A FF  +W VV +  +        
Subjt:  -----RSMAVNYFSNSLGS-QEIGYRELSPVIDDIVQFQWSEKCCQALQLPISREEVRRVLFSMDSGKAPGPDGFSAGFFKGAWSVVGEDFLGV------

Query:  --------NATAITLIPKHNGAERLEDFRPISCCNALYKCIS
                NATAITLIPK  G ++L  FRP+SCC  +YK I+
Subjt:  --------NATAITLIPKHNGAERLEDFRPISCCNALYKCIS

AT1G60720.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.8e-2329.85Show/hide
Query:  ILRKREKLKHLVRMKVGNGNSCRVWLDPW-----LPEVMYDVASRR-----KARLSDFIDSDGEWLWP--RVSPC---------------LSVSDSWVWV
        +L  R   +  V+  +GNG     W D W     L +VM D  SR       AR+ + +  +G W  P  R +P                 ++ DS+ WV
Subjt:  ILRKREKLKHLVRMKVGNGNSCRVWLDPW-----LPEVMYDVASRR-----KARLSDFIDSDGEWLWP--RVSPC---------------LSVSDSWVWV

Query:  PGR--RGGFSIASAWEAVRPRGGRVLWDGLLWSGGNIPKHFFCAWLAIKDRLGTIDRLHRWDSSVPMSCILCQGGVESRDHLFFSCSFGG-MFGLGF---
         G     GFS A  W+A+RPR   + W   +W  G +PKH F  W++  DRL T  RL  W       C LC    ESRDHL FSC F   ++ L F   
Subjt:  PGR--RGGFSIASAWEAVRPRGGRVLWDGLLWSGGNIPKHFFCAWLAIKDRLGTIDRLHRWDSSVPMSCILCQGGVESRDHLFFSCSFGG-MFGLGF---

Query:  ------FGSWVPLIGLG----------IGGLSCLGFVIKVLGRRNHRLHGGQARDPIVLFHLICSWIR
              F SW  L+             +  +S    +  +  +RN+ LH      PI++F ++   IR
Subjt:  ------FGSWVPLIGLG----------IGGLSCLGFVIKVLGRRNHRLHGGQARDPIVLFHLICSWIR

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.0e-3831.36Show/hide
Query:  VAGVNNENASHLAACMGFVRGNLPVRYLGLPLLAGRLRSNDYAPLIQRITSRIRSWTARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHNE----GG
        +AGV + + + +     F  G LPVRYLGLPLL  ++ ++DY PL+++I  RI  WTAR LSFAGRLQL+ SV+ SL  +W S F LP+    E      
Subjt:  VAGVNNENASHLAACMGFVRGNLPVRYLGLPLLAGRLRSNDYAPLIQRITSRIRSWTARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHNE----GG

Query:  LGIRDGPSWNIASTLKILWLML---TNSGSLWVAWVEAYILKGRSLWDVDSRVG-KSWCLRAILRKREKLKHLVRMKVGNGNSCRVWLDPW---------
          +  GP  N     K+ W  +    + G L +  ++    KG S W +       SW  + IL+ R      V+  + NG++   W D W         
Subjt:  LGIRDGPSWNIASTLKILWLML---TNSGSLWVAWVEAYILKGRSLWDVDSRVG-KSWCLRAILRKREKLKHLVRMKVGNGNSCRVWLDPW---------

Query:  ----------------LPEVMYDVASRRK-----ARLSDFIDSDGEWLWPRVSPCLSVSDSWVWVPGR---RGGFSIASAWEAVRPRGGRVLWDGLLWSG
                        + E + +   RR       R+ D I         R     S  D+  W       +  F+    W A R    +V W   +W  
Subjt:  ----------------LPEVMYDVASRRK-----ARLSDFIDSDGEWLWPRVSPCLSVSDSWVWVPGR---RGGFSIASAWEAVRPRGGRVLWDGLLWSG

Query:  GNIPKHFFCAWLAIKDRLGTIDRLHRWDSSVPMSCILCQGGVESRDHLFFSCSF
           PK+   AW+AIK+RL T DR+  W++    SC+LC   VE+RDHLFF+C +
Subjt:  GNIPKHFFCAWLAIKDRLGTIDRLHRWDSSVPMSCILCQGGVESRDHLFFSCSF

AT5G16486.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.8e-2332.34Show/hide
Query:  SWCLRAILRKREKLKHLVRMKVGNGNSCRVWLDPW-----LPEVMYDVASR-----RKARLSDFIDSDGEW----------LWPRVSPCLSVS-------
        SW  ++I + R   +  V  KVG+G +C  W + W     L  +  D+  R     R A ++D +  DG W          +   +  CL +S       
Subjt:  SWCLRAILRKREKLKHLVRMKVGNGNSCRVWLDPW-----LPEVMYDVASR-----RKARLSDFIDSDGEW----------LWPRVSPCLSVS-------

Query:  ----DSWVWVPG---RRGGFSIASAWEAVRPRGGRVLWDGLLWSGGNIPKHFFCAWLAIKDRLGTIDRLHRWDSSVPMSCILCQGGVESRDHLFFSCSFG
            D ++W  G      GFS A+ W  + P G +V W   +W  G IPKH F +W+ I+ RL T D+L  W   VP  C+LC    E+R HLFF C F 
Subjt:  ----DSWVWVPG---RRGGFSIASAWEAVRPRGGRVLWDGLLWSGGNIPKHFFCAWLAIKDRLGTIDRLHRWDSSVPMSCILCQGGVESRDHLFFSCSFG

Query:  G
        G
Subjt:  G


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGTGATTATGCCAAACTCTTTTGGTAGTCTTTTGGAGGTGGGTGATGCTGACAAGTGGGCATTATCTATAATAGAGGGTTCACCGCCACCCTTACAGATAAAGAG
TAAGGCAGTCGTTGATTTTCTGGGGTCTTCCTCAGTAGGGTTTTGTTGCCTCTTGGAGACTAGAGTTCGAGAAGGTAATTTTGATTCTGTTTCTAGAAGATTTGGTAATT
CTTGGGATTACTCATGTAGTTACAGTAATAGTGATGTTGGTCGGATTTGGGGTGATGTGGAAGAAGAATCGTTTTTCTTTTTCTACTCATTTGAGATTACTTCTGCGTGG
TCGAGACCAGGGGTTGTCATGGGTGACTTTAATGCTATTAGAGTTCATTCTGAAGCATTTGGGGGATCTCCTATTCAGGGCGAGATGGAGGACTTTGATTTGGCTATTAG
CGATGCTGACTTAGTAGAGCCTTCGGTGCAGGGAAATTGGTTTACTTGGACTAGTAAAGTTCAGGGGTCTGGGATGTTGCGTCGTCTGGATCGTGTTTTGGTGAATGAAG
ATTGGTTTTCTGCATGGCCTACCATACGGTTTGGTAGACACATAAAGAGTCTTAGTGAGAAGGTACGCAATGCTAAGGCAGCCATGGATTTAGCCCAGAGAGAGGTAGAA
CGTAATCCTATGTCGGATGTTTTGAGTCACCAAGCAGGTCTTGCTACGGAGACTTTCTGGACAGTAGTTAGATTGGAGGAAGCCTCGCTTCGGCAGAAATCCAGAATTCG
ATGGTTAAAGTTGGGTGATCAGAATACGGCTTTTTTCCATCGATCAATGGCGGTTAATTATTTTAGTAACAGTTTGGGATCCCAGGAGATTGGCTATAGAGAATTGTCCC
CAGTCATTGATGATATTGTTCAGTTTCAGTGGTCTGAGAAGTGTTGTCAGGCATTACAGTTACCTATTAGTAGGGAGGAAGTTAGGAGAGTCTTATTCTCTATGGATAGT
GGAAAGGCCCCCGGTCCTGATGGATTCTCTGCAGGTTTCTTTAAAGGTGCCTGGTCTGTGGTTGGGGAAGATTTTTTAGGAGTTAATGCTACTGCTATCACCCTCATTCC
TAAACATAATGGGGCGGAGCGTCTGGAGGACTTTCGTCCTATTTCTTGTTGTAATGCGTTATATAAATGCATTTCTAAAATTCTGGCTGATAGACTTCGTGCGTGGCTTC
CTTCTTTTATCAGTAGTAATCAGTCTGCTTTTATTCCTGGGAGGAGTATTATCGAGAACATTCTGCTTTGTCAGGAACTGGTGGGTGGATATCATCTTAACTCCGGTAAG
CCTTGTTGTACTTTGAAAGTTGATCTTCAAAAAGCATATGACTCTGTTAATTGGGATTTTCTGTTTGGTTTGTTGATTGCTATTGGTACTCCGTTGAAGAAGGGTGTAAG
ACAAGATGATCCTTTATCTACCTTTCTCTTTGTTATGGTGATGGAAGTTCTTTCTCGTATGTTGAATAAGATCCCTCAGAGTTTTCATTTTCACCATCGTTGTGAGAAGA
AGTTTGGTGAGCTTTCAAGTTTGTTCGCAAATCCTAGGAAAAGCTCTATTTTTGTTGCAGGAGTTAATAATGAGAATGCTTCTCATCTGGCTGCTTGTATGGGTTTTGTC
CGTGGTAATCTCCCTGTTCGTTATCTTGGCCTTCCTCTTTTGGCGGGTCGATTACGTTCTAATGATTATGCTCCTCTGATTCAGCGTATCACTAGCAGGATTCGTTCTTG
GACCGCTCGAGTTCTTTCGTTTGCAGGTAGACTGCAGCTTGTTCGTTCTGTGCTTCGTAGTCTTCAAGTGTACTGGGCTAGTGTGTTTGTTCTTCCTGCGTATGTGCACA
ATGAGGGCGGTCTTGGTATTCGAGATGGCCCTTCTTGGAATATTGCGAGTACTTTGAAGATCTTGTGGCTTATGTTAACAAATTCGGGGTCTCTTTGGGTGGCTTGGGTG
GAAGCTTATATACTGAAAGGGAGGTCATTGTGGGATGTGGATAGTAGAGTGGGTAAATCTTGGTGTCTTCGGGCGATCTTACGTAAGCGAGAGAAGCTGAAGCATCTTGT
AAGGATGAAGGTGGGAAATGGTAATAGTTGTAGAGTTTGGCTTGATCCGTGGTTGCCGGAGGTGATGTATGATGTAGCTAGTCGGAGGAAGGCTAGACTCTCTGACTTTA
TTGACTCAGATGGAGAATGGCTTTGGCCTCGAGTCAGTCCGTGTCTTAGTGTTAGTGATAGTTGGGTATGGGTTCCTGGTCGTCGGGGTGGTTTCTCTATTGCAAGTGCA
TGGGAAGCTGTTCGTCCTAGGGGTGGTCGGGTTCTATGGGATGGTTTATTGTGGAGTGGGGGAAATATCCCAAAACATTTCTTCTGTGCGTGGTTGGCTATTAAAGATAG
GTTGGGTACTATAGATAGATTGCATAGGTGGGATAGTTCAGTTCCGATGTCATGCATTCTATGTCAGGGGGGTGTGGAGTCTCGCGATCACTTATTCTTTTCGTGTTCGT
TTGGGGGGATGTTTGGTCTAGGGTTCTTCGGATCATGGGTTCCTCTCATAGGATTGGGCATTGGAGGGTTGAGTTGTCTTGGATTTGTCATTAAGGTATTGGGAAGGAGA
AATCATCGGTTACATGGTGGTCAGGCTCGTGATCCTATTGTCCTTTTTCATCTTATTTGTTCGTGGATTCGTGCTCGTGCTGGATCGTGGAGAGAGGATGCTCATCTACC
TTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGTGATTATGCCAAACTCTTTTGGTAGTCTTTTGGAGGTGGGTGATGCTGACAAGTGGGCATTATCTATAATAGAGGGTTCACCGCCACCCTTACAGATAAAGAG
TAAGGCAGTCGTTGATTTTCTGGGGTCTTCCTCAGTAGGGTTTTGTTGCCTCTTGGAGACTAGAGTTCGAGAAGGTAATTTTGATTCTGTTTCTAGAAGATTTGGTAATT
CTTGGGATTACTCATGTAGTTACAGTAATAGTGATGTTGGTCGGATTTGGGGTGATGTGGAAGAAGAATCGTTTTTCTTTTTCTACTCATTTGAGATTACTTCTGCGTGG
TCGAGACCAGGGGTTGTCATGGGTGACTTTAATGCTATTAGAGTTCATTCTGAAGCATTTGGGGGATCTCCTATTCAGGGCGAGATGGAGGACTTTGATTTGGCTATTAG
CGATGCTGACTTAGTAGAGCCTTCGGTGCAGGGAAATTGGTTTACTTGGACTAGTAAAGTTCAGGGGTCTGGGATGTTGCGTCGTCTGGATCGTGTTTTGGTGAATGAAG
ATTGGTTTTCTGCATGGCCTACCATACGGTTTGGTAGACACATAAAGAGTCTTAGTGAGAAGGTACGCAATGCTAAGGCAGCCATGGATTTAGCCCAGAGAGAGGTAGAA
CGTAATCCTATGTCGGATGTTTTGAGTCACCAAGCAGGTCTTGCTACGGAGACTTTCTGGACAGTAGTTAGATTGGAGGAAGCCTCGCTTCGGCAGAAATCCAGAATTCG
ATGGTTAAAGTTGGGTGATCAGAATACGGCTTTTTTCCATCGATCAATGGCGGTTAATTATTTTAGTAACAGTTTGGGATCCCAGGAGATTGGCTATAGAGAATTGTCCC
CAGTCATTGATGATATTGTTCAGTTTCAGTGGTCTGAGAAGTGTTGTCAGGCATTACAGTTACCTATTAGTAGGGAGGAAGTTAGGAGAGTCTTATTCTCTATGGATAGT
GGAAAGGCCCCCGGTCCTGATGGATTCTCTGCAGGTTTCTTTAAAGGTGCCTGGTCTGTGGTTGGGGAAGATTTTTTAGGAGTTAATGCTACTGCTATCACCCTCATTCC
TAAACATAATGGGGCGGAGCGTCTGGAGGACTTTCGTCCTATTTCTTGTTGTAATGCGTTATATAAATGCATTTCTAAAATTCTGGCTGATAGACTTCGTGCGTGGCTTC
CTTCTTTTATCAGTAGTAATCAGTCTGCTTTTATTCCTGGGAGGAGTATTATCGAGAACATTCTGCTTTGTCAGGAACTGGTGGGTGGATATCATCTTAACTCCGGTAAG
CCTTGTTGTACTTTGAAAGTTGATCTTCAAAAAGCATATGACTCTGTTAATTGGGATTTTCTGTTTGGTTTGTTGATTGCTATTGGTACTCCGTTGAAGAAGGGTGTAAG
ACAAGATGATCCTTTATCTACCTTTCTCTTTGTTATGGTGATGGAAGTTCTTTCTCGTATGTTGAATAAGATCCCTCAGAGTTTTCATTTTCACCATCGTTGTGAGAAGA
AGTTTGGTGAGCTTTCAAGTTTGTTCGCAAATCCTAGGAAAAGCTCTATTTTTGTTGCAGGAGTTAATAATGAGAATGCTTCTCATCTGGCTGCTTGTATGGGTTTTGTC
CGTGGTAATCTCCCTGTTCGTTATCTTGGCCTTCCTCTTTTGGCGGGTCGATTACGTTCTAATGATTATGCTCCTCTGATTCAGCGTATCACTAGCAGGATTCGTTCTTG
GACCGCTCGAGTTCTTTCGTTTGCAGGTAGACTGCAGCTTGTTCGTTCTGTGCTTCGTAGTCTTCAAGTGTACTGGGCTAGTGTGTTTGTTCTTCCTGCGTATGTGCACA
ATGAGGGCGGTCTTGGTATTCGAGATGGCCCTTCTTGGAATATTGCGAGTACTTTGAAGATCTTGTGGCTTATGTTAACAAATTCGGGGTCTCTTTGGGTGGCTTGGGTG
GAAGCTTATATACTGAAAGGGAGGTCATTGTGGGATGTGGATAGTAGAGTGGGTAAATCTTGGTGTCTTCGGGCGATCTTACGTAAGCGAGAGAAGCTGAAGCATCTTGT
AAGGATGAAGGTGGGAAATGGTAATAGTTGTAGAGTTTGGCTTGATCCGTGGTTGCCGGAGGTGATGTATGATGTAGCTAGTCGGAGGAAGGCTAGACTCTCTGACTTTA
TTGACTCAGATGGAGAATGGCTTTGGCCTCGAGTCAGTCCGTGTCTTAGTGTTAGTGATAGTTGGGTATGGGTTCCTGGTCGTCGGGGTGGTTTCTCTATTGCAAGTGCA
TGGGAAGCTGTTCGTCCTAGGGGTGGTCGGGTTCTATGGGATGGTTTATTGTGGAGTGGGGGAAATATCCCAAAACATTTCTTCTGTGCGTGGTTGGCTATTAAAGATAG
GTTGGGTACTATAGATAGATTGCATAGGTGGGATAGTTCAGTTCCGATGTCATGCATTCTATGTCAGGGGGGTGTGGAGTCTCGCGATCACTTATTCTTTTCGTGTTCGT
TTGGGGGGATGTTTGGTCTAGGGTTCTTCGGATCATGGGTTCCTCTCATAGGATTGGGCATTGGAGGGTTGAGTTGTCTTGGATTTGTCATTAAGGTATTGGGAAGGAGA
AATCATCGGTTACATGGTGGTCAGGCTCGTGATCCTATTGTCCTTTTTCATCTTATTTGTTCGTGGATTCGTGCTCGTGCTGGATCGTGGAGAGAGGATGCTCATCTACC
TTTTTAA
Protein sequenceShow/hide protein sequence
MEVIMPNSFGSLLEVGDADKWALSIIEGSPPPLQIKSKAVVDFLGSSSVGFCCLLETRVREGNFDSVSRRFGNSWDYSCSYSNSDVGRIWGDVEEESFFFFYSFEITSAW
SRPGVVMGDFNAIRVHSEAFGGSPIQGEMEDFDLAISDADLVEPSVQGNWFTWTSKVQGSGMLRRLDRVLVNEDWFSAWPTIRFGRHIKSLSEKVRNAKAAMDLAQREVE
RNPMSDVLSHQAGLATETFWTVVRLEEASLRQKSRIRWLKLGDQNTAFFHRSMAVNYFSNSLGSQEIGYRELSPVIDDIVQFQWSEKCCQALQLPISREEVRRVLFSMDS
GKAPGPDGFSAGFFKGAWSVVGEDFLGVNATAITLIPKHNGAERLEDFRPISCCNALYKCISKILADRLRAWLPSFISSNQSAFIPGRSIIENILLCQELVGGYHLNSGK
PCCTLKVDLQKAYDSVNWDFLFGLLIAIGTPLKKGVRQDDPLSTFLFVMVMEVLSRMLNKIPQSFHFHHRCEKKFGELSSLFANPRKSSIFVAGVNNENASHLAACMGFV
RGNLPVRYLGLPLLAGRLRSNDYAPLIQRITSRIRSWTARVLSFAGRLQLVRSVLRSLQVYWASVFVLPAYVHNEGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWV
EAYILKGRSLWDVDSRVGKSWCLRAILRKREKLKHLVRMKVGNGNSCRVWLDPWLPEVMYDVASRRKARLSDFIDSDGEWLWPRVSPCLSVSDSWVWVPGRRGGFSIASA
WEAVRPRGGRVLWDGLLWSGGNIPKHFFCAWLAIKDRLGTIDRLHRWDSSVPMSCILCQGGVESRDHLFFSCSFGGMFGLGFFGSWVPLIGLGIGGLSCLGFVIKVLGRR
NHRLHGGQARDPIVLFHLICSWIRARAGSWREDAHLPF