; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G11940 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G11940
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr5:10612081..10615418
RNA-Seq ExpressionCSPI05G11940
SyntenyCSPI05G11940
Gene Ontology termsGO:0009987 - cellular process (biological process)
GO:0005488 - binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047995.1 retrotransposon protein, putative, Ty1-copia sub-class [Cucumis melo var. makuwa]1.4e-23760.86Show/hide
Query:  SMGGLRYFLSIIDAFSRKVWMYPLKQKDETFEKFLEWKKQVENQTGREVK--------------------------------------------------
        SMGG RYF+SIID FSRKVW+YPLKQKDE F KFLEWKKQVENQTGR+VK                                                  
Subjt:  SMGGLRYFLSIIDAFSRKVWMYPLKQKDETFEKFLEWKKQVENQTGREVK--------------------------------------------------

Query:  ----------------------------SPSTTLDLKTPQE------------------------------------------GVKGYKIWCLEKGINKC
                                    SPST L+LKTPQE                                          GVKGYK+WC+EKG+NKC
Subjt:  ----------------------------SPSTTLDLKTPQE------------------------------------------GVKGYKIWCLEKGINKC

Query:  IISRDVE-----------------------TEVRVDLEVRPSIGLDVSSDQLPLVSQTEATHQSEFDGIQSQQERILIDEGLHSEESSSNNDLQNYQLTR
        IISRDV                        TEVR+  EVRPSI LD   +Q PLVS+ E T QSEFDGIQSQQERILIDE    EESSSNNDLQNYQLTR
Subjt:  IISRDVE-----------------------TEVRVDLEVRPSIGLDVSSDQLPLVSQTEATHQSEFDGIQSQQERILIDEGLHSEESSSNNDLQNYQLTR

Query:  DRVQRERQAPIRYGYADLVAYALTCAADSIEAQPLTFEETIVYDSKKQWKDAMEAELFSLQKNQTWSLVPKPHNQKLIQSKWIYKIKSAKGYTQKEGVDF
        DRVQRER APIRYGYADLVAYALTCAADSIEA+PLTFEE IV DSKKQWKDAME ELFSL KNQTWSLVPKP NQKLIQSKWIYKIK   G   K     
Subjt:  DRVQRERQAPIRYGYADLVAYALTCAADSIEAQPLTFEETIVYDSKKQWKDAMEAELFSLQKNQTWSLVPKPHNQKLIQSKWIYKIKSAKGYTQKEGVDF

Query:  NEIFSPVVRHSSIRLILFIVVHFDMFIEQMDVTTTFLHGELEEVIYMAQPKGYEVKGKEDMVCRLHKSIYGLKQSPRQWYIRFDTFILKQGFYRNSYDAC
                     RLIL I VHFDMFIEQMDVTTTFLHGELEEVIYMAQPKGYEVKGKEDMVCRLHKS+YGLKQSPRQWYI FDTFILKQGF+RNSYDAC
Subjt:  NEIFSPVVRHSSIRLILFIVVHFDMFIEQMDVTTTFLHGELEEVIYMAQPKGYEVKGKEDMVCRLHKSIYGLKQSPRQWYIRFDTFILKQGFYRNSYDAC

Query:  VYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEYEMKD-----------------------------------------------LASHLKL
        VYWK SQ+GT+IYLLLYVDDMILVSKDYA IC+LKKQLS+E+EMKD                                               LASH +L
Subjt:  VYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEYEMKD-----------------------------------------------LASHLKL

Query:  SSSQCPVTEQERLEMSNIPYCNAVGSTMYLMICTRPDLGYAMS--------------------------SASVSLCFIKDCDKSTLLEGFTNADNNVADL
        SSSQCPVT+QER+EMSNIPYCNAVGS MYLMICTRPDLGYAMS                          SASVSLC+ +DCDKSTLLEGFT+AD   ADL
Subjt:  SSSQCPVTEQERLEMSNIPYCNAVGSTMYLMICTRPDLGYAMS--------------------------SASVSLCFIKDCDKSTLLEGFTNADNNVADL

Query:  AKRRSLSGHAFRLYGNIVSWKVTLQPVVALSTTESEYISLGEAAKEAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNLSHHERGK
         KRRSLSGH FRLYGN+VSWKV LQPVVALSTTESEYISLGEA KEAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKN SHHER K
Subjt:  AKRRSLSGHAFRLYGNIVSWKVTLQPVVALSTTESEYISLGEAAKEAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNLSHHERGK

KAA0050719.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]1.9e-25362.72Show/hide
Query:  SMGGLRYFLSIIDAFSRKVWMYPLKQKDETFEKFLEWKKQVENQTGREVK--------------------------------------------------
        SMGG RYF+SIID FSRKVW+YPLKQKDE F KFLEWKKQVENQTGR+VK                                                  
Subjt:  SMGGLRYFLSIIDAFSRKVWMYPLKQKDETFEKFLEWKKQVENQTGREVK--------------------------------------------------

Query:  ----------------------------SPSTTLDLKTPQE------------------------------------------GVKGYKIWCLEKGINKC
                                    SPST L+LKTPQE                                          GVKGYK+WC+EKG+NKC
Subjt:  ----------------------------SPSTTLDLKTPQE------------------------------------------GVKGYKIWCLEKGINKC

Query:  IISRDVE-----------------------TEVRVDLEVRPSIGLDVSSDQLPLVSQTEATHQSEFDGIQSQQERILIDEGLHSEESSSNNDLQNYQLTR
        IISRDV                        TEVR+  EVRPSI LD   +Q PLVS+ E T QSEFDGIQSQQERILIDEG   EESSSNNDLQNYQLTR
Subjt:  IISRDVE-----------------------TEVRVDLEVRPSIGLDVSSDQLPLVSQTEATHQSEFDGIQSQQERILIDEGLHSEESSSNNDLQNYQLTR

Query:  DRVQRERQAPIRYGYADLVAYALTCAADSIEAQPLTFEETIVYDSKKQWKDAMEAELFSLQKNQTWSLVPKPHNQKLIQSKWIYKIKS------------
        DRVQRER APIRYGYADLVAYALTCAADSIEA+PLTFEE IV DSKKQWKDAME ELFSL KNQTWSLVPKP NQKLIQSKWIYKIK             
Subjt:  DRVQRERQAPIRYGYADLVAYALTCAADSIEAQPLTFEETIVYDSKKQWKDAMEAELFSLQKNQTWSLVPKPHNQKLIQSKWIYKIKS------------

Query:  ---AKGYTQKEGVDFNEIFSPVVRHSSIRLILFIVVHFDMFIEQMDVTTTFLHGELEEVIYMAQPKGYEVKGKEDMVCRLHKSIYGLKQSPRQWYIRFDT
           AKGYTQKEGVDF+EIFSPVVRHSSIRLIL I VHFDMFIEQMDVTT FLHGELEEVIYMAQPKGYEVKGKEDMVCRLHKS+YGLKQSPRQWYIRFDT
Subjt:  ---AKGYTQKEGVDFNEIFSPVVRHSSIRLILFIVVHFDMFIEQMDVTTTFLHGELEEVIYMAQPKGYEVKGKEDMVCRLHKSIYGLKQSPRQWYIRFDT

Query:  FILKQGFYRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEYEMKD---------------------------------------
        FILKQGF+RNSYDACVYWK SQ+GT+IYLLLYVDDMILVSKDYAEIC+LKKQLS+E+EMKD                                       
Subjt:  FILKQGFYRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEYEMKD---------------------------------------

Query:  --------LASHLKLSSSQCPVTEQERLEMSNIPYCNAVGSTMYLMICTRPDLGYAMS--------------------------SASVSLCFIKDCDKST
                LASH +LSSSQCPVT+QER+EMSNIPYCNAVGS MYLMICTRPDLGYAMS                          SASVSLC+ +DCDKST
Subjt:  --------LASHLKLSSSQCPVTEQERLEMSNIPYCNAVGSTMYLMICTRPDLGYAMS--------------------------SASVSLCFIKDCDKST

Query:  LLEGFTNADNNVADLAKRRSLSGHAFRLYGNIVSWKVTLQPVVALSTTESEYISLGEAAKEAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNLSHHER
        LLEGFT+AD   ADL KRRSLSGH FRLYGN+VSWKV LQPVVALSTTESEYISLGEA KEAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKN SHHER
Subjt:  LLEGFTNADNNVADLAKRRSLSGHAFRLYGNIVSWKVTLQPVVALSTTESEYISLGEAAKEAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNLSHHER

Query:  GK
         K
Subjt:  GK

KAA0067243.1 retrotransposon protein, putative, Ty1-copia sub-class [Cucumis melo var. makuwa]2.2e-20162.15Show/hide
Query:  SMGGLRYFLSIIDAFSRKVWMYPLKQKDETFEKFLEWKKQVENQTGREVK--------------------------------------------------
        SMG  RYF+SIID FSRKVW+YPLKQKDE F KFLEW+KQVENQTGR+VK                                                  
Subjt:  SMGGLRYFLSIIDAFSRKVWMYPLKQKDETFEKFLEWKKQVENQTGREVK--------------------------------------------------

Query:  --SPSTTLDLKTPQE---------------GVKGY---KIWCLEKGINKCIISRDVE--TEVRVDLEVRPSIGLDVSSDQLPLVSQTEATHQSEFDGIQS
          SPS+ L+LKTPQ+               G   Y   K   L K   KC+     +   EVR+ LEVRP + LD   DQ PLVS+ E T QSEFD +QS
Subjt:  --SPSTTLDLKTPQE---------------GVKGY---KIWCLEKGINKCIISRDVE--TEVRVDLEVRPSIGLDVSSDQLPLVSQTEATHQSEFDGIQS

Query:  QQERILIDEGLHSEESSSNNDLQNYQLTRDRVQRERQAPIRYGYADLVAYALTCAADSIEAQPLTFEETIVYDSKKQWKDAMEAELFSLQKNQTWSLVPK
        QQERILIDEG   EE+ SNNDLQNYQLTRDRVQRER       +ADLVAYALTCAA+SIEA+ LTFEE IV +SKKQWKDAMEAELFSL KNQTWSLVPK
Subjt:  QQERILIDEGLHSEESSSNNDLQNYQLTRDRVQRERQAPIRYGYADLVAYALTCAADSIEAQPLTFEETIVYDSKKQWKDAMEAELFSLQKNQTWSLVPK

Query:  PHNQKLIQSKWIYKIKS---------------AKGYTQKEGVDFNEIFSPVVRHSSIRLILFIVVHFDMFIEQMDVTTTFLHGELEEVIYMAQPKGYEVK
        P NQKLIQSKWIYKIK                AKGYTQKEGVDF+EIFSPVVRHSSIRLIL IVVHFDMFIEQMDVTTTFLHGELEEVIYMAQPKGYEVK
Subjt:  PHNQKLIQSKWIYKIKS---------------AKGYTQKEGVDFNEIFSPVVRHSSIRLILFIVVHFDMFIEQMDVTTTFLHGELEEVIYMAQPKGYEVK

Query:  GKEDMVCRLHKSIYGLKQSPRQWYIRFDTFILKQGFYRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEYEMKD----------
        GKEDMV RLHKS+YGLKQSPRQWYIRFDTFILKQGF+RNSYDACVYWKLSQ+GT+IYLLLYVDDMILVSKDY EIC+LKKQLS+E+EMKD          
Subjt:  GKEDMVCRLHKSIYGLKQSPRQWYIRFDTFILKQGFYRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEYEMKD----------

Query:  ------LASHLKLSSSQCPVTEQERLE----MSNIP-YCNAVGSTMYLMICTRPDLGYAMSSASVSLCFIKDCDKSTLLEGFTNADNNVADLAKRRSLSG
              LASH +LSSSQCPVT+QER +    MS I  + +  G   +  +  +  L Y   SASVSLC+ +DCDKSTLLEGFT+A N VADL KRRSLSG
Subjt:  ------LASHLKLSSSQCPVTEQERLE----MSNIP-YCNAVGSTMYLMICTRPDLGYAMSSASVSLCFIKDCDKSTLLEGFTNADNNVADLAKRRSLSG

Query:  HAFRLYGNIVSWKVTLQPVVALSTTESEYISLGEAAKEAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNLSHHERGK
        H FRLYGN+VSWKV LQP+ ALSTTESEYISLGEA                            SAIHLAKN SHHER K
Subjt:  HAFRLYGNIVSWKVTLQPVVALSTTESEYISLGEAAKEAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNLSHHERGK

TYK13826.1 putative polyprotein [Cucumis melo var. makuwa]1.9e-25362.72Show/hide
Query:  SMGGLRYFLSIIDAFSRKVWMYPLKQKDETFEKFLEWKKQVENQTGREVK--------------------------------------------------
        SMGG RYF+SIID FSRKVW+YPLKQKDE F KFLEWKKQVENQTGR+VK                                                  
Subjt:  SMGGLRYFLSIIDAFSRKVWMYPLKQKDETFEKFLEWKKQVENQTGREVK--------------------------------------------------

Query:  ----------------------------SPSTTLDLKTPQE------------------------------------------GVKGYKIWCLEKGINKC
                                    SPST L+LKTPQE                                          GVKGYK+WC+EKG+NKC
Subjt:  ----------------------------SPSTTLDLKTPQE------------------------------------------GVKGYKIWCLEKGINKC

Query:  IISRDVE-----------------------TEVRVDLEVRPSIGLDVSSDQLPLVSQTEATHQSEFDGIQSQQERILIDEGLHSEESSSNNDLQNYQLTR
        IISRDV                        TEVR+  EVRPSI LD   +Q PLVS+ E T QSEFDGIQSQQERILIDEG   EESSSNNDLQNYQLTR
Subjt:  IISRDVE-----------------------TEVRVDLEVRPSIGLDVSSDQLPLVSQTEATHQSEFDGIQSQQERILIDEGLHSEESSSNNDLQNYQLTR

Query:  DRVQRERQAPIRYGYADLVAYALTCAADSIEAQPLTFEETIVYDSKKQWKDAMEAELFSLQKNQTWSLVPKPHNQKLIQSKWIYKIKS------------
        DRVQRER APIRYGYADLVAYALTCAADSIEA+PLTFEE IV DSKKQWKDAME ELFSL KNQTWSLVPKP NQKLIQSKWIYKIK             
Subjt:  DRVQRERQAPIRYGYADLVAYALTCAADSIEAQPLTFEETIVYDSKKQWKDAMEAELFSLQKNQTWSLVPKPHNQKLIQSKWIYKIKS------------

Query:  ---AKGYTQKEGVDFNEIFSPVVRHSSIRLILFIVVHFDMFIEQMDVTTTFLHGELEEVIYMAQPKGYEVKGKEDMVCRLHKSIYGLKQSPRQWYIRFDT
           AKGYTQKEGVDF+EIFSPVVRHSSIRLIL I VHFDMFIEQMDVTT FLHGELEEVIYMAQPKGYEVKGKEDMVCRLHKS+YGLKQSPRQWYIRFDT
Subjt:  ---AKGYTQKEGVDFNEIFSPVVRHSSIRLILFIVVHFDMFIEQMDVTTTFLHGELEEVIYMAQPKGYEVKGKEDMVCRLHKSIYGLKQSPRQWYIRFDT

Query:  FILKQGFYRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEYEMKD---------------------------------------
        FILKQGF+RNSYDACVYWK SQ+GT+IYLLLYVDDMILVSKDYAEIC+LKKQLS+E+EMKD                                       
Subjt:  FILKQGFYRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEYEMKD---------------------------------------

Query:  --------LASHLKLSSSQCPVTEQERLEMSNIPYCNAVGSTMYLMICTRPDLGYAMS--------------------------SASVSLCFIKDCDKST
                LASH +LSSSQCPVT+QER+EMSNIPYCNAVGS MYLMICTRPDLGYAMS                          SASVSLC+ +DCDKST
Subjt:  --------LASHLKLSSSQCPVTEQERLEMSNIPYCNAVGSTMYLMICTRPDLGYAMS--------------------------SASVSLCFIKDCDKST

Query:  LLEGFTNADNNVADLAKRRSLSGHAFRLYGNIVSWKVTLQPVVALSTTESEYISLGEAAKEAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNLSHHER
        LLEGFT+AD   ADL KRRSLSGH FRLYGN+VSWKV LQPVVALSTTESEYISLGEA KEAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKN SHHER
Subjt:  LLEGFTNADNNVADLAKRRSLSGHAFRLYGNIVSWKVTLQPVVALSTTESEYISLGEAAKEAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNLSHHER

Query:  GK
         K
Subjt:  GK

TYK25306.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]7.1e-23761.34Show/hide
Query:  SMGGLRYFLSIIDAFSRKVWMYPLKQKDETFEKFLEWKKQVENQTGREVK--------------------------------------------------
        SMGG RYF+SIID FSRKVW+YPLKQKDE F KFLEWKKQVENQTGR+VK                                                  
Subjt:  SMGGLRYFLSIIDAFSRKVWMYPLKQKDETFEKFLEWKKQVENQTGREVK--------------------------------------------------

Query:  ----------------------------SPSTTLDLKTPQE------------------------------------------GVKGYKIWCLEKGINKC
                                    SPST L+LKTPQE                                          GVKGYK+WC+EKG+NKC
Subjt:  ----------------------------SPSTTLDLKTPQE------------------------------------------GVKGYKIWCLEKGINKC

Query:  IISRDVE-----------------------TEVRVDLEVRPSIGLDVSSDQLPLVSQTEATHQSEFDGIQSQQERILIDEGLHSEESSSNNDLQNYQLTR
        IISRDV                        TEVR+  EVRPSI LD   +Q PLVS+ E T QSEFDGIQSQQERILIDEG   EESSSNNDLQNYQLTR
Subjt:  IISRDVE-----------------------TEVRVDLEVRPSIGLDVSSDQLPLVSQTEATHQSEFDGIQSQQERILIDEGLHSEESSSNNDLQNYQLTR

Query:  DRVQRERQAPIRYGYADLVAYALTCAADSIEAQPLTFEETIVYDSKKQWKDAMEAELFSLQKNQTWSLVPKPHNQKLIQSKWIYKIKS------------
        DRVQRER APIRYGYADLVAYALTCAADSIEA+PLTFEE IV DSKKQWKDAME ELFSL KNQTWSLVPKP NQKLIQSKWIYKIK             
Subjt:  DRVQRERQAPIRYGYADLVAYALTCAADSIEAQPLTFEETIVYDSKKQWKDAMEAELFSLQKNQTWSLVPKPHNQKLIQSKWIYKIKS------------

Query:  ---AKGYTQKEGVDFNEIFSPVVRHSSIRLILFIVVHFDMFIEQMDVTTTFLHGELEEVIYMAQPKGYEVKGKEDMVCRLHKSIYGLKQSPRQWYIRFDT
           AKGYTQKEGVDF+EIFSPVVRHSSIRLIL I VHFDMFIEQMDVTT FLHGELEEVIYMAQPKGYEVKGKEDMVCRLHKS+YGLKQSPRQWYIRFDT
Subjt:  ---AKGYTQKEGVDFNEIFSPVVRHSSIRLILFIVVHFDMFIEQMDVTTTFLHGELEEVIYMAQPKGYEVKGKEDMVCRLHKSIYGLKQSPRQWYIRFDT

Query:  FILKQGFYRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEYEMKD---------------------------------------
        FILKQGF+RNSYDACVYWK SQ+GT+IYLLLYVDDMILVSKDYAEIC+LKKQLS+E+EMKD                                       
Subjt:  FILKQGFYRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEYEMKD---------------------------------------

Query:  --------LASHLKLSSSQCPVTEQERLEMSNIPYCNAVGSTMYLMICTRPDLGYAMSSASVSLCFIKDCDKSTLLEGFTNADNNVADLAKRRSLSGHAF
                LASH +LSSSQCPVT+QER+EMSNIPYCNAVGS MYLMICTRPDLGYAMSSASVSLC+ +DCDKSTLLEGFT+AD   ADL KR  L     
Subjt:  --------LASHLKLSSSQCPVTEQERLEMSNIPYCNAVGSTMYLMICTRPDLGYAMSSASVSLCFIKDCDKSTLLEGFTNADNNVADLAKRRSLSGHAF

Query:  RLYGNIVSWKVTLQPVVALSTTESEYISLGEAAKEAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNLSHHERGK
                          L    +EYISLGEA KEAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKN SHHER K
Subjt:  RLYGNIVSWKVTLQPVVALSTTESEYISLGEAAKEAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNLSHHERGK

TrEMBL top hitse value%identityAlignment
A0A5A7U2U7 Retrotransposon protein, putative, Ty1-copia sub-class7.0e-23860.86Show/hide
Query:  SMGGLRYFLSIIDAFSRKVWMYPLKQKDETFEKFLEWKKQVENQTGREVK--------------------------------------------------
        SMGG RYF+SIID FSRKVW+YPLKQKDE F KFLEWKKQVENQTGR+VK                                                  
Subjt:  SMGGLRYFLSIIDAFSRKVWMYPLKQKDETFEKFLEWKKQVENQTGREVK--------------------------------------------------

Query:  ----------------------------SPSTTLDLKTPQE------------------------------------------GVKGYKIWCLEKGINKC
                                    SPST L+LKTPQE                                          GVKGYK+WC+EKG+NKC
Subjt:  ----------------------------SPSTTLDLKTPQE------------------------------------------GVKGYKIWCLEKGINKC

Query:  IISRDVE-----------------------TEVRVDLEVRPSIGLDVSSDQLPLVSQTEATHQSEFDGIQSQQERILIDEGLHSEESSSNNDLQNYQLTR
        IISRDV                        TEVR+  EVRPSI LD   +Q PLVS+ E T QSEFDGIQSQQERILIDE    EESSSNNDLQNYQLTR
Subjt:  IISRDVE-----------------------TEVRVDLEVRPSIGLDVSSDQLPLVSQTEATHQSEFDGIQSQQERILIDEGLHSEESSSNNDLQNYQLTR

Query:  DRVQRERQAPIRYGYADLVAYALTCAADSIEAQPLTFEETIVYDSKKQWKDAMEAELFSLQKNQTWSLVPKPHNQKLIQSKWIYKIKSAKGYTQKEGVDF
        DRVQRER APIRYGYADLVAYALTCAADSIEA+PLTFEE IV DSKKQWKDAME ELFSL KNQTWSLVPKP NQKLIQSKWIYKIK   G   K     
Subjt:  DRVQRERQAPIRYGYADLVAYALTCAADSIEAQPLTFEETIVYDSKKQWKDAMEAELFSLQKNQTWSLVPKPHNQKLIQSKWIYKIKSAKGYTQKEGVDF

Query:  NEIFSPVVRHSSIRLILFIVVHFDMFIEQMDVTTTFLHGELEEVIYMAQPKGYEVKGKEDMVCRLHKSIYGLKQSPRQWYIRFDTFILKQGFYRNSYDAC
                     RLIL I VHFDMFIEQMDVTTTFLHGELEEVIYMAQPKGYEVKGKEDMVCRLHKS+YGLKQSPRQWYI FDTFILKQGF+RNSYDAC
Subjt:  NEIFSPVVRHSSIRLILFIVVHFDMFIEQMDVTTTFLHGELEEVIYMAQPKGYEVKGKEDMVCRLHKSIYGLKQSPRQWYIRFDTFILKQGFYRNSYDAC

Query:  VYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEYEMKD-----------------------------------------------LASHLKL
        VYWK SQ+GT+IYLLLYVDDMILVSKDYA IC+LKKQLS+E+EMKD                                               LASH +L
Subjt:  VYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEYEMKD-----------------------------------------------LASHLKL

Query:  SSSQCPVTEQERLEMSNIPYCNAVGSTMYLMICTRPDLGYAMS--------------------------SASVSLCFIKDCDKSTLLEGFTNADNNVADL
        SSSQCPVT+QER+EMSNIPYCNAVGS MYLMICTRPDLGYAMS                          SASVSLC+ +DCDKSTLLEGFT+AD   ADL
Subjt:  SSSQCPVTEQERLEMSNIPYCNAVGSTMYLMICTRPDLGYAMS--------------------------SASVSLCFIKDCDKSTLLEGFTNADNNVADL

Query:  AKRRSLSGHAFRLYGNIVSWKVTLQPVVALSTTESEYISLGEAAKEAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNLSHHERGK
         KRRSLSGH FRLYGN+VSWKV LQPVVALSTTESEYISLGEA KEAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKN SHHER K
Subjt:  AKRRSLSGHAFRLYGNIVSWKVTLQPVVALSTTESEYISLGEAAKEAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNLSHHERGK

A0A5A7UB25 Putative gag-pol polyprotein9.0e-25462.72Show/hide
Query:  SMGGLRYFLSIIDAFSRKVWMYPLKQKDETFEKFLEWKKQVENQTGREVK--------------------------------------------------
        SMGG RYF+SIID FSRKVW+YPLKQKDE F KFLEWKKQVENQTGR+VK                                                  
Subjt:  SMGGLRYFLSIIDAFSRKVWMYPLKQKDETFEKFLEWKKQVENQTGREVK--------------------------------------------------

Query:  ----------------------------SPSTTLDLKTPQE------------------------------------------GVKGYKIWCLEKGINKC
                                    SPST L+LKTPQE                                          GVKGYK+WC+EKG+NKC
Subjt:  ----------------------------SPSTTLDLKTPQE------------------------------------------GVKGYKIWCLEKGINKC

Query:  IISRDVE-----------------------TEVRVDLEVRPSIGLDVSSDQLPLVSQTEATHQSEFDGIQSQQERILIDEGLHSEESSSNNDLQNYQLTR
        IISRDV                        TEVR+  EVRPSI LD   +Q PLVS+ E T QSEFDGIQSQQERILIDEG   EESSSNNDLQNYQLTR
Subjt:  IISRDVE-----------------------TEVRVDLEVRPSIGLDVSSDQLPLVSQTEATHQSEFDGIQSQQERILIDEGLHSEESSSNNDLQNYQLTR

Query:  DRVQRERQAPIRYGYADLVAYALTCAADSIEAQPLTFEETIVYDSKKQWKDAMEAELFSLQKNQTWSLVPKPHNQKLIQSKWIYKIKS------------
        DRVQRER APIRYGYADLVAYALTCAADSIEA+PLTFEE IV DSKKQWKDAME ELFSL KNQTWSLVPKP NQKLIQSKWIYKIK             
Subjt:  DRVQRERQAPIRYGYADLVAYALTCAADSIEAQPLTFEETIVYDSKKQWKDAMEAELFSLQKNQTWSLVPKPHNQKLIQSKWIYKIKS------------

Query:  ---AKGYTQKEGVDFNEIFSPVVRHSSIRLILFIVVHFDMFIEQMDVTTTFLHGELEEVIYMAQPKGYEVKGKEDMVCRLHKSIYGLKQSPRQWYIRFDT
           AKGYTQKEGVDF+EIFSPVVRHSSIRLIL I VHFDMFIEQMDVTT FLHGELEEVIYMAQPKGYEVKGKEDMVCRLHKS+YGLKQSPRQWYIRFDT
Subjt:  ---AKGYTQKEGVDFNEIFSPVVRHSSIRLILFIVVHFDMFIEQMDVTTTFLHGELEEVIYMAQPKGYEVKGKEDMVCRLHKSIYGLKQSPRQWYIRFDT

Query:  FILKQGFYRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEYEMKD---------------------------------------
        FILKQGF+RNSYDACVYWK SQ+GT+IYLLLYVDDMILVSKDYAEIC+LKKQLS+E+EMKD                                       
Subjt:  FILKQGFYRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEYEMKD---------------------------------------

Query:  --------LASHLKLSSSQCPVTEQERLEMSNIPYCNAVGSTMYLMICTRPDLGYAMS--------------------------SASVSLCFIKDCDKST
                LASH +LSSSQCPVT+QER+EMSNIPYCNAVGS MYLMICTRPDLGYAMS                          SASVSLC+ +DCDKST
Subjt:  --------LASHLKLSSSQCPVTEQERLEMSNIPYCNAVGSTMYLMICTRPDLGYAMS--------------------------SASVSLCFIKDCDKST

Query:  LLEGFTNADNNVADLAKRRSLSGHAFRLYGNIVSWKVTLQPVVALSTTESEYISLGEAAKEAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNLSHHER
        LLEGFT+AD   ADL KRRSLSGH FRLYGN+VSWKV LQPVVALSTTESEYISLGEA KEAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKN SHHER
Subjt:  LLEGFTNADNNVADLAKRRSLSGHAFRLYGNIVSWKVTLQPVVALSTTESEYISLGEAAKEAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNLSHHER

Query:  GK
         K
Subjt:  GK

A0A5A7VGM4 Retrotransposon protein, putative, Ty1-copia sub-class1.0e-20162.15Show/hide
Query:  SMGGLRYFLSIIDAFSRKVWMYPLKQKDETFEKFLEWKKQVENQTGREVK--------------------------------------------------
        SMG  RYF+SIID FSRKVW+YPLKQKDE F KFLEW+KQVENQTGR+VK                                                  
Subjt:  SMGGLRYFLSIIDAFSRKVWMYPLKQKDETFEKFLEWKKQVENQTGREVK--------------------------------------------------

Query:  --SPSTTLDLKTPQE---------------GVKGY---KIWCLEKGINKCIISRDVE--TEVRVDLEVRPSIGLDVSSDQLPLVSQTEATHQSEFDGIQS
          SPS+ L+LKTPQ+               G   Y   K   L K   KC+     +   EVR+ LEVRP + LD   DQ PLVS+ E T QSEFD +QS
Subjt:  --SPSTTLDLKTPQE---------------GVKGY---KIWCLEKGINKCIISRDVE--TEVRVDLEVRPSIGLDVSSDQLPLVSQTEATHQSEFDGIQS

Query:  QQERILIDEGLHSEESSSNNDLQNYQLTRDRVQRERQAPIRYGYADLVAYALTCAADSIEAQPLTFEETIVYDSKKQWKDAMEAELFSLQKNQTWSLVPK
        QQERILIDEG   EE+ SNNDLQNYQLTRDRVQRER       +ADLVAYALTCAA+SIEA+ LTFEE IV +SKKQWKDAMEAELFSL KNQTWSLVPK
Subjt:  QQERILIDEGLHSEESSSNNDLQNYQLTRDRVQRERQAPIRYGYADLVAYALTCAADSIEAQPLTFEETIVYDSKKQWKDAMEAELFSLQKNQTWSLVPK

Query:  PHNQKLIQSKWIYKIKS---------------AKGYTQKEGVDFNEIFSPVVRHSSIRLILFIVVHFDMFIEQMDVTTTFLHGELEEVIYMAQPKGYEVK
        P NQKLIQSKWIYKIK                AKGYTQKEGVDF+EIFSPVVRHSSIRLIL IVVHFDMFIEQMDVTTTFLHGELEEVIYMAQPKGYEVK
Subjt:  PHNQKLIQSKWIYKIKS---------------AKGYTQKEGVDFNEIFSPVVRHSSIRLILFIVVHFDMFIEQMDVTTTFLHGELEEVIYMAQPKGYEVK

Query:  GKEDMVCRLHKSIYGLKQSPRQWYIRFDTFILKQGFYRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEYEMKD----------
        GKEDMV RLHKS+YGLKQSPRQWYIRFDTFILKQGF+RNSYDACVYWKLSQ+GT+IYLLLYVDDMILVSKDY EIC+LKKQLS+E+EMKD          
Subjt:  GKEDMVCRLHKSIYGLKQSPRQWYIRFDTFILKQGFYRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEYEMKD----------

Query:  ------LASHLKLSSSQCPVTEQERLE----MSNIP-YCNAVGSTMYLMICTRPDLGYAMSSASVSLCFIKDCDKSTLLEGFTNADNNVADLAKRRSLSG
              LASH +LSSSQCPVT+QER +    MS I  + +  G   +  +  +  L Y   SASVSLC+ +DCDKSTLLEGFT+A N VADL KRRSLSG
Subjt:  ------LASHLKLSSSQCPVTEQERLE----MSNIP-YCNAVGSTMYLMICTRPDLGYAMSSASVSLCFIKDCDKSTLLEGFTNADNNVADLAKRRSLSG

Query:  HAFRLYGNIVSWKVTLQPVVALSTTESEYISLGEAAKEAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNLSHHERGK
        H FRLYGN+VSWKV LQP+ ALSTTESEYISLGEA                            SAIHLAKN SHHER K
Subjt:  HAFRLYGNIVSWKVTLQPVVALSTTESEYISLGEAAKEAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNLSHHERGK

A0A5D3CTV2 Putative polyprotein9.0e-25462.72Show/hide
Query:  SMGGLRYFLSIIDAFSRKVWMYPLKQKDETFEKFLEWKKQVENQTGREVK--------------------------------------------------
        SMGG RYF+SIID FSRKVW+YPLKQKDE F KFLEWKKQVENQTGR+VK                                                  
Subjt:  SMGGLRYFLSIIDAFSRKVWMYPLKQKDETFEKFLEWKKQVENQTGREVK--------------------------------------------------

Query:  ----------------------------SPSTTLDLKTPQE------------------------------------------GVKGYKIWCLEKGINKC
                                    SPST L+LKTPQE                                          GVKGYK+WC+EKG+NKC
Subjt:  ----------------------------SPSTTLDLKTPQE------------------------------------------GVKGYKIWCLEKGINKC

Query:  IISRDVE-----------------------TEVRVDLEVRPSIGLDVSSDQLPLVSQTEATHQSEFDGIQSQQERILIDEGLHSEESSSNNDLQNYQLTR
        IISRDV                        TEVR+  EVRPSI LD   +Q PLVS+ E T QSEFDGIQSQQERILIDEG   EESSSNNDLQNYQLTR
Subjt:  IISRDVE-----------------------TEVRVDLEVRPSIGLDVSSDQLPLVSQTEATHQSEFDGIQSQQERILIDEGLHSEESSSNNDLQNYQLTR

Query:  DRVQRERQAPIRYGYADLVAYALTCAADSIEAQPLTFEETIVYDSKKQWKDAMEAELFSLQKNQTWSLVPKPHNQKLIQSKWIYKIKS------------
        DRVQRER APIRYGYADLVAYALTCAADSIEA+PLTFEE IV DSKKQWKDAME ELFSL KNQTWSLVPKP NQKLIQSKWIYKIK             
Subjt:  DRVQRERQAPIRYGYADLVAYALTCAADSIEAQPLTFEETIVYDSKKQWKDAMEAELFSLQKNQTWSLVPKPHNQKLIQSKWIYKIKS------------

Query:  ---AKGYTQKEGVDFNEIFSPVVRHSSIRLILFIVVHFDMFIEQMDVTTTFLHGELEEVIYMAQPKGYEVKGKEDMVCRLHKSIYGLKQSPRQWYIRFDT
           AKGYTQKEGVDF+EIFSPVVRHSSIRLIL I VHFDMFIEQMDVTT FLHGELEEVIYMAQPKGYEVKGKEDMVCRLHKS+YGLKQSPRQWYIRFDT
Subjt:  ---AKGYTQKEGVDFNEIFSPVVRHSSIRLILFIVVHFDMFIEQMDVTTTFLHGELEEVIYMAQPKGYEVKGKEDMVCRLHKSIYGLKQSPRQWYIRFDT

Query:  FILKQGFYRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEYEMKD---------------------------------------
        FILKQGF+RNSYDACVYWK SQ+GT+IYLLLYVDDMILVSKDYAEIC+LKKQLS+E+EMKD                                       
Subjt:  FILKQGFYRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEYEMKD---------------------------------------

Query:  --------LASHLKLSSSQCPVTEQERLEMSNIPYCNAVGSTMYLMICTRPDLGYAMS--------------------------SASVSLCFIKDCDKST
                LASH +LSSSQCPVT+QER+EMSNIPYCNAVGS MYLMICTRPDLGYAMS                          SASVSLC+ +DCDKST
Subjt:  --------LASHLKLSSSQCPVTEQERLEMSNIPYCNAVGSTMYLMICTRPDLGYAMS--------------------------SASVSLCFIKDCDKST

Query:  LLEGFTNADNNVADLAKRRSLSGHAFRLYGNIVSWKVTLQPVVALSTTESEYISLGEAAKEAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNLSHHER
        LLEGFT+AD   ADL KRRSLSGH FRLYGN+VSWKV LQPVVALSTTESEYISLGEA KEAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKN SHHER
Subjt:  LLEGFTNADNNVADLAKRRSLSGHAFRLYGNIVSWKVTLQPVVALSTTESEYISLGEAAKEAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNLSHHER

Query:  GK
         K
Subjt:  GK

A0A5D3DNU1 Putative gag-pol polyprotein3.5e-23761.34Show/hide
Query:  SMGGLRYFLSIIDAFSRKVWMYPLKQKDETFEKFLEWKKQVENQTGREVK--------------------------------------------------
        SMGG RYF+SIID FSRKVW+YPLKQKDE F KFLEWKKQVENQTGR+VK                                                  
Subjt:  SMGGLRYFLSIIDAFSRKVWMYPLKQKDETFEKFLEWKKQVENQTGREVK--------------------------------------------------

Query:  ----------------------------SPSTTLDLKTPQE------------------------------------------GVKGYKIWCLEKGINKC
                                    SPST L+LKTPQE                                          GVKGYK+WC+EKG+NKC
Subjt:  ----------------------------SPSTTLDLKTPQE------------------------------------------GVKGYKIWCLEKGINKC

Query:  IISRDVE-----------------------TEVRVDLEVRPSIGLDVSSDQLPLVSQTEATHQSEFDGIQSQQERILIDEGLHSEESSSNNDLQNYQLTR
        IISRDV                        TEVR+  EVRPSI LD   +Q PLVS+ E T QSEFDGIQSQQERILIDEG   EESSSNNDLQNYQLTR
Subjt:  IISRDVE-----------------------TEVRVDLEVRPSIGLDVSSDQLPLVSQTEATHQSEFDGIQSQQERILIDEGLHSEESSSNNDLQNYQLTR

Query:  DRVQRERQAPIRYGYADLVAYALTCAADSIEAQPLTFEETIVYDSKKQWKDAMEAELFSLQKNQTWSLVPKPHNQKLIQSKWIYKIKS------------
        DRVQRER APIRYGYADLVAYALTCAADSIEA+PLTFEE IV DSKKQWKDAME ELFSL KNQTWSLVPKP NQKLIQSKWIYKIK             
Subjt:  DRVQRERQAPIRYGYADLVAYALTCAADSIEAQPLTFEETIVYDSKKQWKDAMEAELFSLQKNQTWSLVPKPHNQKLIQSKWIYKIKS------------

Query:  ---AKGYTQKEGVDFNEIFSPVVRHSSIRLILFIVVHFDMFIEQMDVTTTFLHGELEEVIYMAQPKGYEVKGKEDMVCRLHKSIYGLKQSPRQWYIRFDT
           AKGYTQKEGVDF+EIFSPVVRHSSIRLIL I VHFDMFIEQMDVTT FLHGELEEVIYMAQPKGYEVKGKEDMVCRLHKS+YGLKQSPRQWYIRFDT
Subjt:  ---AKGYTQKEGVDFNEIFSPVVRHSSIRLILFIVVHFDMFIEQMDVTTTFLHGELEEVIYMAQPKGYEVKGKEDMVCRLHKSIYGLKQSPRQWYIRFDT

Query:  FILKQGFYRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEYEMKD---------------------------------------
        FILKQGF+RNSYDACVYWK SQ+GT+IYLLLYVDDMILVSKDYAEIC+LKKQLS+E+EMKD                                       
Subjt:  FILKQGFYRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEYEMKD---------------------------------------

Query:  --------LASHLKLSSSQCPVTEQERLEMSNIPYCNAVGSTMYLMICTRPDLGYAMSSASVSLCFIKDCDKSTLLEGFTNADNNVADLAKRRSLSGHAF
                LASH +LSSSQCPVT+QER+EMSNIPYCNAVGS MYLMICTRPDLGYAMSSASVSLC+ +DCDKSTLLEGFT+AD   ADL KR  L     
Subjt:  --------LASHLKLSSSQCPVTEQERLEMSNIPYCNAVGSTMYLMICTRPDLGYAMSSASVSLCFIKDCDKSTLLEGFTNADNNVADLAKRRSLSGHAF

Query:  RLYGNIVSWKVTLQPVVALSTTESEYISLGEAAKEAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNLSHHERGK
                          L    +EYISLGEA KEAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKN SHHER K
Subjt:  RLYGNIVSWKVTLQPVVALSTTESEYISLGEAAKEAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNLSHHERGK

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.5e-5328.7Show/hide
Query:  HQSEFDGIQSQQERILIDEGLHSEESSSNNDLQN--YQLTRDRVQRERQAP-IRYGYADLVAYALTCAADSI-EAQPLTFEETIVYDSKKQWKDAMEAEL
        H +E  G  +  E    +   H +E   +N  +N   ++   R +R +  P I Y   D     +   A +I    P +F+E    D K  W++A+  EL
Subjt:  HQSEFDGIQSQQERILIDEGLHSEESSSNNDLQN--YQLTRDRVQRERQAP-IRYGYADLVAYALTCAADSI-EAQPLTFEETIVYDSKKQWKDAMEAEL

Query:  FSLQKNQTWSLVPKPHNQKLIQSKWIYKIK--------------SAKGYTQKEGVDFNEIFSPVVRHSSIRLILFIVVHFDMFIEQMDVTTTFLHGELEE
         + + N TW++  +P N+ ++ S+W++ +K               A+G+TQK  +D+ E F+PV R SS R IL +V+ +++ + QMDV T FL+G L+E
Subjt:  FSLQKNQTWSLVPKPHNQKLIQSKWIYKIK--------------SAKGYTQKEGVDFNEIFSPVVRHSSIRLILFIVVHFDMFIEQMDVTTTFLHGELEE

Query:  VIYMAQPKGYEVKGKEDMVCRLHKSIYGLKQSPRQWYIRFDTFILKQGFYRNSYDACVYWKLSQRGTF---IYLLLYVDDMILVSKDYAEICQLKKQLSS
         IYM  P+G  +    D VC+L+K+IYGLKQ+ R W+  F+  + +  F  +S D C+Y  +  +G     IY+LLYVDD+++ + D   +   K+ L  
Subjt:  VIYMAQPKGYEVKGKEDMVCRLHKSIYGLKQSPRQWYIRFDTFILKQGFYRNSYDACVYWKLSQRGTF---IYLLLYVDDMILVSKDYAEICQLKKQLSS

Query:  EYEMKDL-------ASHLKLSSSQCPVTE----QERLEMSNIPYCNAV---------------------------GSTMYLMICTRPD------------
        ++ M DL          +++   +  +++    ++ L   N+  CNAV                           G  MY+M+CTRPD            
Subjt:  EYEMKDL-------ASHLKLSSSQCPVTE----QERLEMSNIPYCNAV---------------------------GSTMYLMICTRPD------------

Query:  --------------LGYAMSSASVSLCFIKDCDKSTLLEGFTNADNNVADLAKRRSLSGHAFRLYG-NIVSWKVTLQPVVALSTTESEYISLGEAAKEAV
                      L Y   +  + L F K+      + G+ ++D   +++  R+S +G+ F+++  N++ W    Q  VA S+TE+EY++L EA +EA+
Subjt:  --------------LGYAMSSASVSLCFIKDCDKSTLLEGFTNADNNVADLAKRRSLSGHAFRLYG-NIVSWKVTLQPVVALSTTESEYISLGEAAKEAV

Query:  WLKRIVGELLSQEFIPI-IHCDSQSAIHLAKNLSHHERGK
        WLK ++  +  +   PI I+ D+Q  I +A N S H+R K
Subjt:  WLKRIVGELLSQEFIPI-IHCDSQSAIHLAKNLSHHERGK

P0CV72 Secreted RxLR effector protein 1615.0e-1538.97Show/hide
Query:  MSNIPYCNAVGSTMYLMICTRPDLG--------------------------YAMSSASVSLCFIKDCDKSTLLEGFTNADNNVADLAKRRSLSGHAFRLY
        M N+PY +AVG+ MYLM+ TRPDL                           Y  S+ +  L F +    +  L G+++AD    D+  RRS SG+ F+L 
Subjt:  MSNIPYCNAVGSTMYLMICTRPDLG--------------------------YAMSSASVSLCFIKDCDKSTLLEGFTNADNNVADLAKRRSLSGHAFRLY

Query:  GNIVSWKVTLQPVVALSTTESEYISLGEAAKEAVWL
        G  VSW+   Q  VALS+TE EY++L EA +EAVWL
Subjt:  GNIVSWKVTLQPVVALSTTESEYISLGEAAKEAVWL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.4e-9431.78Show/hide
Query:  MSSMGGLRYFLSIIDAFSRKVWMYPLKQKDETFEKFLEWKKQVENQTGREVK------------------------------------------------
        + SMGG +YF++ ID  SRK+W+Y LK KD+ F+ F ++   VE +TGR++K                                                
Subjt:  MSSMGGLRYFLSIIDAFSRKVWMYPLKQKDETFEKFLEWKKQVENQTGREVK------------------------------------------------

Query:  ------------------------------SPSTTLDLKTPQE---------------GVK------------------------------GYKIWCLEK
                                      SPS  L  + P+                G +                              GY++W   K
Subjt:  ------------------------------SPSTTLDLKTPQE---------------GVK------------------------------GYKIWCLEK

Query:  GINKCIISRDV---ETEVR--VDLEVRPSIGLDVSSDQLPLVSQTEATHQSEFDGIQSQQER--------ILIDEGLHSEESSSNNDLQNYQLTRDRVQR
           K I SRDV   E+EVR   D+  +   G+  +   +P  S    + +S  D +  Q E+          +DEG+   E  +  + Q+  L R   +R
Subjt:  GINKCIISRDV---ETEVR--VDLEVRPSIGLDVSSDQLPLVSQTEATHQSEFDGIQSQQER--------ILIDEGLHSEESSSNNDLQNYQLTRDRVQR

Query:  ERQAPIRYGYADLVAYALTCAADSIEAQPLTFEETIVYDSKKQWKDAMEAELFSLQKNQTWSLVPKPHNQKLIQSKWIYKIKS--------------AKG
         R    RY   + V         S + +P + +E + +  K Q   AM+ E+ SLQKN T+ LV  P  ++ ++ KW++K+K                KG
Subjt:  ERQAPIRYGYADLVAYALTCAADSIEAQPLTFEETIVYDSKKQWKDAMEAELFSLQKNQTWSLVPKPHNQKLIQSKWIYKIKS--------------AKG

Query:  YTQKEGVDFNEIFSPVVRHSSIRLILFIVVHFDMFIEQMDVTTTFLHGELEEVIYMAQPKGYEVKGKEDMVCRLHKSIYGLKQSPRQWYIRFDTFILKQG
        + QK+G+DF+EIFSPVV+ +SIR IL +    D+ +EQ+DV T FLHG+LEE IYM QP+G+EV GK+ MVC+L+KS+YGLKQ+PRQWY++FD+F+  Q 
Subjt:  YTQKEGVDFNEIFSPVVRHSSIRLILFIVVHFDMFIEQMDVTTTFLHGELEEVIYMAQPKGYEVKGKEDMVCRLHKSIYGLKQSPRQWYIRFDTFILKQG

Query:  FYRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEYEMKD---------------------------------------------
        + +   D CVY+K      FI LLLYVDDM++V KD   I +LK  LS  ++MKD                                             
Subjt:  FYRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEYEMKD---------------------------------------------

Query:  --LASHLKLSSSQCPVTEQERLEMSNIPYCNAVGSTMYLMICTRPDLGYAMSSAS---------------VSLCFIK----DC----DKSTLLEGFTNAD
          LA HLKLS   CP T +E+  M+ +PY +AVGS MY M+CTRPD+ +A+   S                 L +++    DC        +L+G+T+AD
Subjt:  --LASHLKLSSSQCPVTEQERLEMSNIPYCNAVGSTMYLMICTRPDLGYAMSSAS---------------VSLCFIK----DC----DKSTLLEGFTNAD

Query:  NNVADLAKRRSLSGHAFRLYGNIVSWKVTLQPVVALSTTESEYISLGEAAKEAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNLSHHERGK
            D+  R+S +G+ F   G  +SW+  LQ  VALSTTE+EYI+  E  KE +WLKR + EL   +   +++CDSQSAI L+KN  +H R K
Subjt:  NNVADLAKRRSLSGHAFRLYGNIVSWKVTLQPVVALSTTESEYISLGEAAKEAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNLSHHERGK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.2e-4028.29Show/hide
Query:  EAQPLTFEETIVYDSKKQWKDAMEAELFSLQKNQTWSLVPKPHNQ-KLIQSKWIYKIK--------------SAKGYTQKEGVDFNEIFSPVVRHSSIRL
        E++P T  + +     ++W++AM +E+ +   N TW LVP P +   ++  +WI+  K               AKGY Q+ G+D+ E FSPV++ +SIR+
Subjt:  EAQPLTFEETIVYDSKKQWKDAMEAELFSLQKNQTWSLVPKPHNQ-KLIQSKWIYKIK--------------SAKGYTQKEGVDFNEIFSPVVRHSSIRL

Query:  ILFIVVHFDMFIEQMDVTTTFLHGELEEVIYMAQPKGYEVKGKEDMVCRLHKSIYGLKQSPRQWYIRFDTFILKQGFYRNSYDACVYWKLSQRG-TFIYL
        +L + V     I Q+DV   FL G L + +YM+QP G+  K + + VC+L K++YGLKQ+PR WY+    ++L  GF  +  D  ++  + QRG + +Y+
Subjt:  ILFIVVHFDMFIEQMDVTTTFLHGELEEVIYMAQPKGYEVKGKEDMVCRLHKSIYGLKQSPRQWYIRFDTFILKQGFYRNSYDACVYWKLSQRG-TFIYL

Query:  LLYVDDMILVSKDYAEICQLKKQLSSEYEMKD--------------LASHLKLSSSQ------------------CPVTEQERLEMSN-------IPYCN
        L+YVDD+++   D   +      LS  + +KD              + + L LS  +                   P+    +L + +         Y  
Subjt:  LLYVDDMILVSKDYAEICQLKKQLSSEYEMKD--------------LASHLKLSSSQ------------------CPVTEQERLEMSN-------IPYCN

Query:  AVGSTMYLMICTRPDLGYAMSSASVSLCFIKDCDKSTL---LEGFTNADNNVADLAKRRSLSGHAFR--------------------LYGNIVSWKVTLQ
         VGS  YL   TRPD+ YA++  S  +    +     L   L       N+   L K  +LS HA+                     L  + +SW    Q
Subjt:  AVGSTMYLMICTRPDLGYAMSSASVSLCFIKDCDKSTL---LEGFTNADNNVADLAKRRSLSGHAFR--------------------LYGNIVSWKVTLQ

Query:  PVVALSTTESEYISLGEAAKEAVWLKRIVGEL-LSQEFIPIIHCDSQSAIHLAKNLSHHERGK
          V  S+TE+EY S+   + E  W+  ++ EL +     P+I+CD+  A +L  N   H R K
Subjt:  PVVALSTTESEYISLGEAAKEAVWLKRIVGEL-LSQEFIPIIHCDSQSAIHLAKNLSHHERGK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.7e-0243.59Show/hide
Query:  RYFLSIIDAFSRKVWMYPLKQKDETFEKFLEWKKQVENQ
        RY++  +D F+R  W+YPLKQK +  E F+ +K  +EN+
Subjt:  RYFLSIIDAFSRKVWMYPLKQKDETFEKFLEWKKQVENQ

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.9e-4329.75Show/hide
Query:  AYALTCAADSIEAQPLTFEETIVYDSKKQWKDAMEAELFSLQKNQTWSLV-PKPHNQKLIQSKWIYKIK--------------SAKGYTQKEGVDFNEIF
        +YA + AA+S   +P T  + +  D   +W+ AM +E+ +   N TW LV P P +  ++  +WI+  K               AKGY Q+ G+D+ E F
Subjt:  AYALTCAADSIEAQPLTFEETIVYDSKKQWKDAMEAELFSLQKNQTWSLV-PKPHNQKLIQSKWIYKIK--------------SAKGYTQKEGVDFNEIF

Query:  SPVVRHSSIRLILFIVVHFDMFIEQMDVTTTFLHGELEEVIYMAQPKGYEVKGKEDMVCRLHKSIYGLKQSPRQWYIRFDTFILKQGFYRNSYDACVYWK
        SPV++ +SIR++L + V     I Q+DV   FL G L + +YM+QP G+  K + D VCRL K+IYGLKQ+PR WY+   T++L  GF  +  D  ++  
Subjt:  SPVVRHSSIRLILFIVVHFDMFIEQMDVTTTFLHGELEEVIYMAQPKGYEVKGKEDMVCRLHKSIYGLKQSPRQWYIRFDTFILKQGFYRNSYDACVYWK

Query:  LSQRG-TFIYLLLYVDDMILVSKDYAEICQLKKQLSSEYEMK-----------------------------DLASHLKLSSSQ---CPVTEQERLEM---
        + QRG + IY+L+YVDD+++   D   +      LS  + +K                             DL +   + +++    P+    +L +   
Subjt:  LSQRG-TFIYLLLYVDDMILVSKDYAEICQLKKQLSSEYEMK-----------------------------DLASHLKLSSSQ---CPVTEQERLEM---

Query:  SNIP----YCNAVGSTMYLMICTRPDLGYAMSSASVSLCFIKDCDKSTL---LEGFTNADNNVADLAKRRSLSGHAFR--------------------LY
        + +P    Y   VGS  YL   TRPDL YA++  S  +    D   + L   L       ++   L K  +LS HA+                     L 
Subjt:  SNIP----YCNAVGSTMYLMICTRPDLGYAMSSASVSLCFIKDCDKSTL---LEGFTNADNNVADLAKRRSLSGHAFR--------------------LY

Query:  GNIVSWKVTLQPVVALSTTESEYISLGEAAKEAVWLKRIVGEL-LSQEFIPIIHCDSQSAIHLAKNLSHHERGK
         + +SW    Q  V  S+TE+EY S+   + E  W+  ++ EL +     P+I+CD+  A +L  N   H R K
Subjt:  GNIVSWKVTLQPVVALSTTESEYISLGEAAKEAVWLKRIVGEL-LSQEFIPIIHCDSQSAIHLAKNLSHHERGK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-0240.91Show/hide
Query:  SMGGLRYFLSIIDAFSRKVWMYPLKQKDETFEKFLEWKKQVENQ
        S+   RY++  +D F+R  W+YPLKQK +  + F+ +K  VEN+
Subjt:  SMGGLRYFLSIIDAFSRKVWMYPLKQKDETFEKFLEWKKQVENQ

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 87.0e-4929.28Show/hide
Query:  QSQQERILIDEGLHSEESSSNNDLQNYQLTRDRVQRERQAPIRYGYADLVAYALTCAADSIEAQPLTFEETIVYDSKKQWKDAMEAELFSLQKNQTWSLV
        ++++   L D   HS  S + +D+  +      +  E+ +P+ + +       L C A + E  P T+ E   +     W  AM+ E+ +++   TW + 
Subjt:  QSQQERILIDEGLHSEESSSNNDLQNYQLTRDRVQRERQAPIRYGYADLVAYALTCAADSIEAQPLTFEETIVYDSKKQWKDAMEAELFSLQKNQTWSLV

Query:  PKPHNQKLIQSKWIYKIK--------------SAKGYTQKEGVDFNEIFSPVVRHSSIRLILFIVVHFDMFIEQMDVTTTFLHGELEEVIYMAQPKGYEV
          P N+K I  KW+YKIK               AKGYTQ+EG+DF E FSPV + +S++LIL I   ++  + Q+D++  FL+G+L+E IYM  P GY  
Subjt:  PKPHNQKLIQSKWIYKIK--------------SAKGYTQKEGVDFNEIFSPVVRHSSIRLILFIVVHFDMFIEQMDVTTTFLHGELEEVIYMAQPKGYEV

Query:  KGKEDM----VCRLHKSIYGLKQSPRQWYIRFDTFILKQGFYRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEYEMKDLAS--
        +  + +    VC L KSIYGLKQ+ RQW+++F   ++  GF ++  D   + K++    F+ +L+YVDD+I+ S + A + +LK QL S ++++DL    
Subjt:  KGKEDM----VCRLHKSIYGLKQSPRQWYIRFDTFILKQGFYRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEYEMKDLAS--

Query:  ---HLKLSSSQCPVTEQER--------------LEMSNIP--------------------YCNAVGSTMYLMICTRPDLGYAMSSAS-----------VS
            L+++ S   +   +R               + S++P                    Y   +G  MYL I TR D+ +A++  S            +
Subjt:  ---HLKLSSSQCPVTEQER--------------LEMSNIP--------------------YCNAVGSTMYLMICTRPDLGYAMSSAS-----------VS

Query:  LCFIKDCDKSTLLEGFTNADNNVADLA------------KRRSLSGHAFRLYGNIVSWKVTLQPVVALSTTESEYISLGEAAKEAVWLKRIVGEL-LSQE
        +  I    K T+ +G   +      L              RRS +G+   L  +++SWK   Q VV+ S+ E+EY +L  A  E +WL +   EL L   
Subjt:  LCFIKDCDKSTLLEGFTNADNNVADLA------------KRRSLSGHAFRLYGNIVSWKVTLQPVVALSTTESEYISLGEAAKEAVWLKRIVGEL-LSQE

Query:  FIPIIHCDSQSAIHLAKNLSHHERGK
           ++ CD+ +AIH+A N   HER K
Subjt:  FIPIIHCDSQSAIHLAKNLSHHERGK

ATMG00810.1 DNA/RNA polymerases superfamily protein1.8e-0728.32Show/hide
Query:  IYLLLYVDDMILVSKDYAEICQLKKQLSSEYEMKDLA--------------SHLKLS--------------------SSQCPVTEQERLEMSNIP----Y
        +YLLLYVDD++L       +  L  QLSS + MKDL               S L LS                    S+  P+     +  +  P    +
Subjt:  IYLLLYVDDMILVSKDYAEICQLKKQLSSEYEMKDLA--------------SHLKLS--------------------SSQCPVTEQERLEMSNIP----Y

Query:  CNAVGSTMYLMICTRPDLGYAMSSASVSL--CFIKDCD---------KSTLLEGF---TNADNNVADL---------AKRRSLSGHAFRLYGNIVSWKVT
         + VG+  YL + TRPD+ YA++     +    + D D         K T+  G     N+  NV            + RRS +G    L  NI+SW   
Subjt:  CNAVGSTMYLMICTRPDLGYAMSSASVSL--CFIKDCD---------KSTLLEGF---TNADNNVADL---------AKRRSLSGHAFRLYGNIVSWKVT

Query:  LQPVVALSTTESEYISLGEAAKEAVW
         QP V+ S+TE+EY +L   A E  W
Subjt:  LQPVVALSTTESEYISLGEAAKEAVW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.4e-1135Show/hide
Query:  YALTCAADSIEAQPLTFEETIVYDSKKQ-WKDAMEAELFSLQKNQTWSLVPKPHNQKLIQSKWIYKIK--------------SAKGYTQKEGVDFNEIFS
        Y+LT    +I+ +P    +++++  K   W  AM+ EL +L +N+TW LVP P NQ ++  KW++K K               AKG+ Q+EG+ F E +S
Subjt:  YALTCAADSIEAQPLTFEETIVYDSKKQ-WKDAMEAELFSLQKNQTWSLVPKPHNQKLIQSKWIYKIK--------------SAKGYTQKEGVDFNEIFS

Query:  PVVRHSSIRLILFIVVHFDM
        PVVR ++IR IL +    ++
Subjt:  PVVRHSSIRLILFIVVHFDM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCTATGGGAGGTTTGAGATACTTCTTGTCTATCATTGACGCCTTTTCAAGGAAGGTATGGATGTATCCATTGAAACAAAAGGATGAAACTTTTGAGAAATTCCT
TGAATGGAAGAAGCAAGTTGAAAATCAAACAGGTAGGGAGGTCAAGAGTCCTTCTACAACTTTAGACTTAAAGACTCCTCAAGAGGGTGTCAAAGGTTATAAAATTTGGT
GCTTGGAGAAGGGGATAAATAAATGCATTATCAGTAGAGATGTTGAGACAGAGGTCAGAGTTGATTTAGAAGTACGACCATCAATTGGCTTAGATGTATCTAGTGATCAG
TTACCACTAGTTTCACAGACAGAGGCTACACACCAGTCTGAATTTGATGGTATACAGTCTCAACAGGAGAGGATTTTGATTGATGAGGGACTTCACAGTGAAGAAAGCTC
AAGTAATAATGACCTACAAAACTATCAGCTTACTCGGGACAGGGTTCAGAGAGAAAGACAGGCTCCTATAAGGTATGGTTACGCTGACTTAGTTGCTTATGCTCTTACTT
GCGCAGCTGATAGTATTGAAGCACAGCCTCTTACTTTTGAAGAGACAATCGTATATGATTCTAAGAAACAATGGAAGGATGCTATGGAGGCAGAGTTGTTCTCTTTACAA
AAGAATCAGACATGGTCATTGGTTCCAAAGCCTCATAACCAGAAGCTCATTCAATCAAAGTGGATTTACAAAATCAAGTCAGCAAAGGGCTACACTCAGAAGGAGGGAGT
TGATTTTAATGAGATTTTTTCTCCGGTGGTGAGGCATTCGTCCATTAGATTAATTTTATTTATTGTTGTTCACTTTGATATGTTCATTGAACAAATGGACGTCACCACAA
CATTTCTTCATGGAGAATTGGAGGAGGTGATTTACATGGCTCAACCTAAGGGCTATGAGGTGAAGGGTAAGGAAGACATGGTTTGTCGTCTTCATAAGTCCATTTATGGA
CTTAAACAATCTCCAAGACAGTGGTATATCAGGTTTGATACTTTCATTCTAAAGCAGGGATTTTACAGGAACTCATATGATGCTTGTGTTTACTGGAAACTATCTCAGAG
AGGTACATTTATCTATCTACTGTTGTATGTAGATGATATGATACTAGTGTCTAAGGATTATGCTGAAATCTGTCAACTCAAGAAACAGTTAAGTAGTGAGTATGAAATGA
AAGATTTAGCATCTCATTTAAAACTTTCTTCGTCTCAATGTCCTGTTACTGAACAAGAAAGGTTAGAGATGTCTAATATTCCATATTGTAATGCTGTTGGAAGTACTATG
TACCTGATGATTTGTACTAGGCCTGACTTGGGTTATGCTATGAGTAGTGCCAGTGTATCATTGTGTTTTATTAAGGATTGTGATAAGTCAACATTGTTGGAAGGCTTCAC
AAATGCAGACAACAATGTTGCAGATCTTGCTAAAAGAAGGTCACTATCAGGTCACGCTTTTCGCTTGTATGGTAATATTGTCAGTTGGAAAGTTACCCTACAACCAGTTG
TTGCTTTGTCGACTACTGAGTCAGAATATATTTCTCTTGGTGAAGCAGCGAAGGAAGCAGTGTGGTTGAAAAGAATTGTTGGTGAGTTGTTATCGCAGGAGTTTATTCCT
ATCATCCATTGTGATAGCCAGAGTGCTATTCATCTTGCGAAAAATCTATCTCATCATGAACGGGGTAAGATCGGGTGGATAGCTGGGAACATAGGGTACAAGATGGAATT
CACTCCTACCCGCTTTAGGGTTAGTAGATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCTATGGGAGGTTTGAGATACTTCTTGTCTATCATTGACGCCTTTTCAAGGAAGGTATGGATGTATCCATTGAAACAAAAGGATGAAACTTTTGAGAAATTCCT
TGAATGGAAGAAGCAAGTTGAAAATCAAACAGGTAGGGAGGTCAAGAGTCCTTCTACAACTTTAGACTTAAAGACTCCTCAAGAGGGTGTCAAAGGTTATAAAATTTGGT
GCTTGGAGAAGGGGATAAATAAATGCATTATCAGTAGAGATGTTGAGACAGAGGTCAGAGTTGATTTAGAAGTACGACCATCAATTGGCTTAGATGTATCTAGTGATCAG
TTACCACTAGTTTCACAGACAGAGGCTACACACCAGTCTGAATTTGATGGTATACAGTCTCAACAGGAGAGGATTTTGATTGATGAGGGACTTCACAGTGAAGAAAGCTC
AAGTAATAATGACCTACAAAACTATCAGCTTACTCGGGACAGGGTTCAGAGAGAAAGACAGGCTCCTATAAGGTATGGTTACGCTGACTTAGTTGCTTATGCTCTTACTT
GCGCAGCTGATAGTATTGAAGCACAGCCTCTTACTTTTGAAGAGACAATCGTATATGATTCTAAGAAACAATGGAAGGATGCTATGGAGGCAGAGTTGTTCTCTTTACAA
AAGAATCAGACATGGTCATTGGTTCCAAAGCCTCATAACCAGAAGCTCATTCAATCAAAGTGGATTTACAAAATCAAGTCAGCAAAGGGCTACACTCAGAAGGAGGGAGT
TGATTTTAATGAGATTTTTTCTCCGGTGGTGAGGCATTCGTCCATTAGATTAATTTTATTTATTGTTGTTCACTTTGATATGTTCATTGAACAAATGGACGTCACCACAA
CATTTCTTCATGGAGAATTGGAGGAGGTGATTTACATGGCTCAACCTAAGGGCTATGAGGTGAAGGGTAAGGAAGACATGGTTTGTCGTCTTCATAAGTCCATTTATGGA
CTTAAACAATCTCCAAGACAGTGGTATATCAGGTTTGATACTTTCATTCTAAAGCAGGGATTTTACAGGAACTCATATGATGCTTGTGTTTACTGGAAACTATCTCAGAG
AGGTACATTTATCTATCTACTGTTGTATGTAGATGATATGATACTAGTGTCTAAGGATTATGCTGAAATCTGTCAACTCAAGAAACAGTTAAGTAGTGAGTATGAAATGA
AAGATTTAGCATCTCATTTAAAACTTTCTTCGTCTCAATGTCCTGTTACTGAACAAGAAAGGTTAGAGATGTCTAATATTCCATATTGTAATGCTGTTGGAAGTACTATG
TACCTGATGATTTGTACTAGGCCTGACTTGGGTTATGCTATGAGTAGTGCCAGTGTATCATTGTGTTTTATTAAGGATTGTGATAAGTCAACATTGTTGGAAGGCTTCAC
AAATGCAGACAACAATGTTGCAGATCTTGCTAAAAGAAGGTCACTATCAGGTCACGCTTTTCGCTTGTATGGTAATATTGTCAGTTGGAAAGTTACCCTACAACCAGTTG
TTGCTTTGTCGACTACTGAGTCAGAATATATTTCTCTTGGTGAAGCAGCGAAGGAAGCAGTGTGGTTGAAAAGAATTGTTGGTGAGTTGTTATCGCAGGAGTTTATTCCT
ATCATCCATTGTGATAGCCAGAGTGCTATTCATCTTGCGAAAAATCTATCTCATCATGAACGGGGTAAGATCGGGTGGATAGCTGGGAACATAGGGTACAAGATGGAATT
CACTCCTACCCGCTTTAGGGTTAGTAGATAG
Protein sequenceShow/hide protein sequence
MSSMGGLRYFLSIIDAFSRKVWMYPLKQKDETFEKFLEWKKQVENQTGREVKSPSTTLDLKTPQEGVKGYKIWCLEKGINKCIISRDVETEVRVDLEVRPSIGLDVSSDQ
LPLVSQTEATHQSEFDGIQSQQERILIDEGLHSEESSSNNDLQNYQLTRDRVQRERQAPIRYGYADLVAYALTCAADSIEAQPLTFEETIVYDSKKQWKDAMEAELFSLQ
KNQTWSLVPKPHNQKLIQSKWIYKIKSAKGYTQKEGVDFNEIFSPVVRHSSIRLILFIVVHFDMFIEQMDVTTTFLHGELEEVIYMAQPKGYEVKGKEDMVCRLHKSIYG
LKQSPRQWYIRFDTFILKQGFYRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEYEMKDLASHLKLSSSQCPVTEQERLEMSNIPYCNAVGSTM
YLMICTRPDLGYAMSSASVSLCFIKDCDKSTLLEGFTNADNNVADLAKRRSLSGHAFRLYGNIVSWKVTLQPVVALSTTESEYISLGEAAKEAVWLKRIVGELLSQEFIP
IIHCDSQSAIHLAKNLSHHERGKIGWIAGNIGYKMEFTPTRFRVSR