; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0017821 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0017821
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionPol
Genome locationchr03:8590846..8592317
RNA-Seq ExpressionPay0017821
SyntenyPay0017821
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAK84483.1 putative copia-like polyprotein [Solanum lycopersicum]3.9e-16864.01Show/hide
Query:  MSTGINIEHPVAHVHTQNGLAESFIKHLQLIARPLLMRAKLPLSIWGHAILHAASL--------------------EPNISHLRIFGCAVYVPIFPPQRA
        MSTGI +EHPVAHVHTQNGLAES IK LQLIARPLLMR KL +S+WGHAILHAA+L                    EPNISHLR+FGCAVYVPI PPQR 
Subjt:  MSTGINIEHPVAHVHTQNGLAESFIKHLQLIARPLLMRAKLPLSIWGHAILHAASL--------------------EPNISHLRIFGCAVYVPIFPPQRA

Query:  KMGPQRRLGIYVEFESPSIIRYREPLMGDVFTARFADCHFNET-NSTLGGGIKKLENEIAWNVSLLSHLDPRTKQCELEVQKIIHLQSVANQMPDAFTDT
        KMGPQRRLGIYV +ESPSII+Y EP+ GD+F  RFADCHF+E+   TLGG  K L  EI WN S LSHLDPRT QCE EVQ+II+LQ++ANQ+PDAFT+ 
Subjt:  KMGPQRRLGIYVEFESPSIIRYREPLMGDVFTARFADCHFNET-NSTLGGGIKKLENEIAWNVSLLSHLDPRTKQCELEVQKIIHLQSVANQMPDAFTDT

Query:  KKVTKSYIPAANAPSRIEIPTQQVDTINESTLRQKQ-------------------------------ETQVY---------EDNNEISINYTMTGKRWNR
         ++TKSYIPA NAP R++IPT Q+   NES    K+                               ETQV            NNEIS+NY  T K WNR
Subjt:  KKVTKSYIPAANAPSRIEIPTQQVDTINESTLRQKQ-------------------------------ETQVY---------EDNNEISINYTMTGKRWNR

Query:  INVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWPKWKEAIQAELNSLTKREVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLFAQGFS
         NVV+DNIFAYNVA  I+ ++ED EPKSV ECR R DWPKWK+AIQAEL SL KRE FGP++ TP+G+KPVG+KW+FVRKRNE  EV RYKARL AQGFS
Subjt:  INVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWPKWKEAIQAELNSLTKREVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLFAQGFS

Query:  QRPGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTTYLYGSLENEIYIKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSG
        QRPGIDY ETYSPVVDAIT RYLI+LAV E L+M LMDVVT YLYGSL++ I++KIPE FK+PE+Y  +SRE CSIKLQ+SLYGLKQSG
Subjt:  QRPGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTTYLYGSLENEIYIKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSG

CAN60947.1 hypothetical protein VITISV_015758 [Vitis vinifera]2.5e-16763.11Show/hide
Query:  MSTGINIEHPVAHVHTQNGLAESFIKHLQLIARPLLMRAKLPLSIWGHAILHAASL--------------------EPNISHLRIFGCAVYVPIFPPQRA
        MS GINIEHPVAH HTQNGLAESFIK LQLIARPLLM+ KLP S WGHAI+H A+L                    +PNI HLRIFGCAVYVPI P Q  
Subjt:  MSTGINIEHPVAHVHTQNGLAESFIKHLQLIARPLLMRAKLPLSIWGHAILHAASL--------------------EPNISHLRIFGCAVYVPIFPPQRA

Query:  KMGPQRRLGIYVEFESPSIIRYREPLMGDVFTARFADCHFNETNSTLGGGIKKL---ENEIAWNVSLLSHLDPRTKQCELEVQKIIHLQSVANQMPDAFT
        KMGPQRRLG+YV F+SPSII Y EPL GDVFTARFADCHFNE+     G  K +     EI+W  S ++HLDPRT QCELEVQ+IIHLQ++ NQ+PDAF 
Subjt:  KMGPQRRLGIYVEFESPSIIRYREPLMGDVFTARFADCHFNETNSTLGGGIKKL---ENEIAWNVSLLSHLDPRTKQCELEVQKIIHLQSVANQMPDAFT

Query:  DTKKVTKSYIPAANAPSRIEIPTQQVDTINESTLRQK-------------------------------------QETQVYEDNNEISINYTMTGKRWNRI
        DTKKVTKS+IPAAN P+RI++P  Q+   NES +R K                                     +E QV E N EIS++Y  TG++W+R 
Subjt:  DTKKVTKSYIPAANAPSRIEIPTQQVDTINESTLRQK-------------------------------------QETQVYEDNNEISINYTMTGKRWNRI

Query:  NVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWPKWKEAIQAELNSLTKREVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLFAQGFSQ
        N+V++NIFA+ VA +II  +ED EP++V+ECR+R DWPKWKE IQAELNSLTKREVFGPVV TP+ VKPVG+KWVFVRKRNENNE+ RYKARL AQGFSQ
Subjt:  NVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWPKWKEAIQAELNSLTKREVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLFAQGFSQ

Query:  RPGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTTYLYGSLENEIYIKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSG
        RPGIDYEETYSPV+DAIT R+LISLAV E LDM LMDV+T YLYGS++N+IY+KIPEGFK+P++ N+  R + SIKLQRSLYGLKQSG
Subjt:  RPGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTTYLYGSLENEIYIKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSG

RVW26456.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.1e-16659.39Show/hide
Query:  MSTGINIEHPVAHVHTQNGLAESFIKHLQLIARPLLMRAKLPLSIWGHAILHAASL--------------------EPNISHLRIFGCAVYVPIFPPQRA
        MS GINIEHPVAH HTQNGLAESFIK LQLIARPLLM+ KLP S WGHAI+HAA+L                    +PNISHLRIFGCAVYVPI P QR 
Subjt:  MSTGINIEHPVAHVHTQNGLAESFIKHLQLIARPLLMRAKLPLSIWGHAILHAASL--------------------EPNISHLRIFGCAVYVPIFPPQRA

Query:  KMGPQRRLGIYVEFESPSIIRYREPLMGDVFTARFADCHFNETNSTLGGGIKKL---ENEIAWNVSLLSHLDPRTKQCELEVQKIIHLQSVANQMPDAFT
        KMGPQRRLG+YV F+SPSII+Y EPL GDVFTARFADCHFNE+     G  K +     EI+W  S ++HLDPRT QCELEVQ+IIHLQ++ NQ+PDAF 
Subjt:  KMGPQRRLGIYVEFESPSIIRYREPLMGDVFTARFADCHFNETNSTLGGGIKKL---ENEIAWNVSLLSHLDPRTKQCELEVQKIIHLQSVANQMPDAFT

Query:  DTKKVTKSYIPAANAPSRIEIPTQQVDTINESTLRQK------------------------------------------QETQVYE--------------
        DTKKVTKS+IPAAN P+RI++P  Q+   NES +R K                                          +E Q+ +              
Subjt:  DTKKVTKSYIPAANAPSRIEIPTQQVDTINESTLRQK------------------------------------------QETQVYE--------------

Query:  ---------------DNNEISINYTMTGKRWNRINVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWPKWKEAIQAELNSLTKREVFGPVVYTPKG
                       +N EIS++Y  TG++W+R N+V++NIFA+ VA +II  +ED EP++V+ECR+R DWPKWKEAIQAELNSLTKREVFGPVV TP+ 
Subjt:  ---------------DNNEISINYTMTGKRWNRINVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWPKWKEAIQAELNSLTKREVFGPVVYTPKG

Query:  VKPVGFKWVFVRKRNENNEVTRYKARLFAQGFSQRPGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTTYLYGSLENEIYIKIPEGFKIPESYN
        VKPVG+KWVFVRKRNENNE+ RYKARL AQGFSQRPGIDYEETYSPV+DAIT R+LISLAV E LDM LMDV+T YLYGS++N+IY+KIPEGFK+P++ N
Subjt:  VKPVGFKWVFVRKRNENNEVTRYKARLFAQGFSQRPGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTTYLYGSLENEIYIKIPEGFKIPESYN

Query:  SNSRELCSIKLQRSLYGLKQSG
        +  R + SIKLQRSLYGLKQSG
Subjt:  SNSRELCSIKLQRSLYGLKQSG

RVW51062.1 Copia protein [Vitis vinifera]3.6e-16659.39Show/hide
Query:  MSTGINIEHPVAHVHTQNGLAESFIKHLQLIARPLLMRAKLPLSIWGHAILHAASL--------------------EPNISHLRIFGCAVYVPIFPPQRA
        MS GINIEHPVAH HTQNGLAESFIK LQLIARPLLM+ KLP S WGHAI+H A+L                    +PNISHLRIFGCAVYVPI P QR 
Subjt:  MSTGINIEHPVAHVHTQNGLAESFIKHLQLIARPLLMRAKLPLSIWGHAILHAASL--------------------EPNISHLRIFGCAVYVPIFPPQRA

Query:  KMGPQRRLGIYVEFESPSIIRYREPLMGDVFTARFADCHFNETNSTLGGGIKKL---ENEIAWNVSLLSHLDPRTKQCELEVQKIIHLQSVANQMPDAFT
        KMGPQRRLG+YV F+SPSII+Y EPL GDVFTARFADCHFNE+     G  K +     EI+W  S ++HLDPRT QCELEVQ+IIHLQ++ANQ+PDAF 
Subjt:  KMGPQRRLGIYVEFESPSIIRYREPLMGDVFTARFADCHFNETNSTLGGGIKKL---ENEIAWNVSLLSHLDPRTKQCELEVQKIIHLQSVANQMPDAFT

Query:  DTKKVTKSYIPAANAPSRIEIPTQQVDTINESTLRQK------------------------------------------QETQVYE--------------
        DTKKVTKS+IPAAN P+RI++P  Q+   NES +R K                                          +E Q+ +              
Subjt:  DTKKVTKSYIPAANAPSRIEIPTQQVDTINESTLRQK------------------------------------------QETQVYE--------------

Query:  ---------------DNNEISINYTMTGKRWNRINVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWPKWKEAIQAELNSLTKREVFGPVVYTPKG
                       +N EIS++Y  TG++W+R N+V++NIFA+ VA +II  +ED EP++V+ECR+R DWPKWKEAIQAELNSLTKREVFGPVV TP+ 
Subjt:  ---------------DNNEISINYTMTGKRWNRINVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWPKWKEAIQAELNSLTKREVFGPVVYTPKG

Query:  VKPVGFKWVFVRKRNENNEVTRYKARLFAQGFSQRPGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTTYLYGSLENEIYIKIPEGFKIPESYN
        VKPVG+KWVFVRKRNENNE+ RYKARL AQGFSQRPGIDYEETYSPV+DAIT R+LISLAV E LDM LMDV+T YLYGS++N+IY+KIPEGFK+P++ N
Subjt:  VKPVGFKWVFVRKRNENNEVTRYKARLFAQGFSQRPGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTTYLYGSLENEIYIKIPEGFKIPESYN

Query:  SNSRELCSIKLQRSLYGLKQSG
        +  R + SIKLQRSLYGLKQSG
Subjt:  SNSRELCSIKLQRSLYGLKQSG

RVX16158.1 Copia protein [Vitis vinifera]4.7e-16658.65Show/hide
Query:  MSTGINIEHPVAHVHTQNGLAESFIKHLQLIARPLLMRAKLPLSIWGHAILHAASL--------------------EPNISHLRIFGCAVYVPIFPPQRA
        MS GINIEHPVAH HTQNGLAESFIK LQLIARPLLM+ KLP S WGHAI+HAA+L                    +PNISHLRIFGCAVYVPI P QR 
Subjt:  MSTGINIEHPVAHVHTQNGLAESFIKHLQLIARPLLMRAKLPLSIWGHAILHAASL--------------------EPNISHLRIFGCAVYVPIFPPQRA

Query:  KMGPQRRLGIYVEFESPSIIRYREPLMGDVFTARFADCHFNETNSTLGGGIKKL---ENEIAWNVSLLSHLDPRTKQCELEVQKIIHLQSVANQMPDAFT
        KMGPQRRLG+YV F+SPSIIRY EPL GDVFTARFADCHFNE+     G  K +     EI+W  S ++HLDPRT QCELEVQ+IIHLQ++ANQ+PDAF 
Subjt:  KMGPQRRLGIYVEFESPSIIRYREPLMGDVFTARFADCHFNETNSTLGGGIKKL---ENEIAWNVSLLSHLDPRTKQCELEVQKIIHLQSVANQMPDAFT

Query:  DTKKVTKSYIPAANAPSRIEIPTQQVDTINESTLRQK------------------------------------------QETQVYE--------------
        DTKKVTKS+IPAAN P+RI++P  Q+   NES +R K                                          +E Q+ +              
Subjt:  DTKKVTKSYIPAANAPSRIEIPTQQVDTINESTLRQK------------------------------------------QETQVYE--------------

Query:  -------------------------DNNEISINYTMTGKRWNRINVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWPKWKEAIQAELNSLTKREV
                                 +N EIS++Y  TG++W+R N+V++NIFA+ VA +II  +ED EP++V+ECR+R DWPKWKEAIQAELNSLTKREV
Subjt:  -------------------------DNNEISINYTMTGKRWNRINVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWPKWKEAIQAELNSLTKREV

Query:  FGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLFAQGFSQRPGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTTYLYGSLENEIYIKIP
        FGPVV TP+ VKPVG+KWVFVRKRNENNE+ RYKARL AQGFSQRPGIDYEETYSPV+DAIT R+LISLAV E LDM LMDV+T YLYGS++N+IY+KIP
Subjt:  FGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLFAQGFSQRPGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTTYLYGSLENEIYIKIP

Query:  EGFKIPESYNSNSRELCSIKLQRSLYGLKQSG
        EGFK+P++ N+  R + SIKLQRSLYGLKQSG
Subjt:  EGFKIPESYNSNSRELCSIKLQRSLYGLKQSG

TrEMBL top hitse value%identityAlignment
A0A1U9WYB0 Pol5.1e-16662.47Show/hide
Query:  MSTGINIEHPVAHVHTQNGLAESFIKHLQLIARPLLMRAKLPLSIWGHAILHAASL--------------------EPNISHLRIFGCAVYVPIFPPQRA
        MS GI +EHPVA+VHTQNGLAES IK LQLIARPLLMR+KLP S WGHAILHAA+L                    EPNISHLRIFGCAVYVPI PPQR 
Subjt:  MSTGINIEHPVAHVHTQNGLAESFIKHLQLIARPLLMRAKLPLSIWGHAILHAASL--------------------EPNISHLRIFGCAVYVPIFPPQRA

Query:  KMGPQRRLGIYVEFESPSIIRYREPLMGDVFTARFADCHFNETN-STLGGGIKKLENEIAWNVSLLSHLDPRTKQCELEVQKIIHLQSVANQMPDAFTDT
        K+ PQRRLGIYV +ESPSII+Y EPL GD+FTARFADCHF+E++  TLGG   + + EI W +S L+ LD RTK+CELEVQ+IIHLQ +ANQ+PDAF D+
Subjt:  KMGPQRRLGIYVEFESPSIIRYREPLMGDVFTARFADCHFNETN-STLGGGIKKLENEIAWNVSLLSHLDPRTKQCELEVQKIIHLQSVANQMPDAFTDT

Query:  KKVTKSYIPAANAPSRIEIPTQQVDTINESTLRQKQ----------------------------------ETQVYEDNN--EISINYTMTGKRWNRINVV
        K+VTKS+IPA NAP +I++P  Q+ + NES  R K+                                  +  + E+N   E SIN+  + K WNR  ++
Subjt:  KKVTKSYIPAANAPSRIEIPTQQVDTINESTLRQKQ----------------------------------ETQVYEDNN--EISINYTMTGKRWNRINVV

Query:  VDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWPKWKEAIQAELNSLTKREVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLFAQGFSQRPG
        +DNIFAYN+A +I+ E+ED+EP+SVDECR R DWPKWK+AIQ+EL+SL KR+VFGPVV TP+GVKPVG+KWVFVRKRNE NE+ RYKARL AQGFSQRPG
Subjt:  VDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWPKWKEAIQAELNSLTKREVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLFAQGFSQRPG

Query:  IDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTTYLYGSLENEIYIKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSG
         DY+ETYSPVVDAIT RYL+S AV E LDM LMDVVT YLYG+LEN+IYI+IPEGF +PE+  SN + + SIKLQRSLYGLKQSG
Subjt:  IDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTTYLYGSLENEIYIKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSG

A0A1U9WYD5 Pol1.1e-16361.86Show/hide
Query:  MSTGINIEHPVAHVHTQNGLAESFIKHLQLIARPLLMRAKLPLSIWGHAILHAASL--------------------EPNISHLRIFGCAVYVPIFPPQRA
        MS GI +EHPVA+VHTQNGLAES IK LQLIARPLLMR+KLP S WGHAILHAA+L                    EPNISHLRIFGCAVYVPI P QR 
Subjt:  MSTGINIEHPVAHVHTQNGLAESFIKHLQLIARPLLMRAKLPLSIWGHAILHAASL--------------------EPNISHLRIFGCAVYVPIFPPQRA

Query:  KMGPQRRLGIYVEFESPSIIRYREPLMGDVFTARFADCHFNETN-STLGGGIKKLENEIAWNVSLLSHLDPRTKQCELEVQKIIHLQSVANQMPDAFTDT
        K+G QRRLGIYV +ESPSII+Y EPL GD+FTARFADCHF+E++  TLGG   + + EI W +S L+ LD RTK+CELEVQ+IIHLQ +ANQ+PDAF D+
Subjt:  KMGPQRRLGIYVEFESPSIIRYREPLMGDVFTARFADCHFNETN-STLGGGIKKLENEIAWNVSLLSHLDPRTKQCELEVQKIIHLQSVANQMPDAFTDT

Query:  KKVTKSYIPAANAPSRIEIPTQQVDTINESTLRQKQ----------------------------------ETQVYEDNNEI--SINYTMTGKRWNRINVV
        K+VTKS+IPA NAP +I++P +Q+ + NES  R K+                                  +  + E+N  I  SIN+  + K WNR  ++
Subjt:  KKVTKSYIPAANAPSRIEIPTQQVDTINESTLRQKQ----------------------------------ETQVYEDNNEI--SINYTMTGKRWNRINVV

Query:  VDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWPKWKEAIQAELNSLTKREVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLFAQGFSQRPG
        +DNIFAYN+A +I+ E+ED+EP+SVDECR R DWPKWK+AIQ+EL+SL KR+VFGPVV TP+GVKPVG+KWVFVRKRNE NE+ RYKARL AQGFSQRP 
Subjt:  VDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWPKWKEAIQAELNSLTKREVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLFAQGFSQRPG

Query:  IDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTTYLYGSLENEIYIKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSG
         DY+ETYSPVVDAIT RYL+S AV E LDM LMDVVT YLYG+LEN+IY++IPEGF +PE+  SN + + SIKLQRSLYGLKQSG
Subjt:  IDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTTYLYGSLENEIYIKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSG

A0A1U9WYE0 Pol8.1e-16461.86Show/hide
Query:  MSTGINIEHPVAHVHTQNGLAESFIKHLQLIARPLLMRAKLPLSIWGHAILHAASL--------------------EPNISHLRIFGCAVYVPIFPPQRA
        MS GI +EHPVA+VHTQNGLAES IK LQLIARPLLMR+KLP S WGHAILHAA+L                    EPNISHLRIFGCAVYVPI PPQR 
Subjt:  MSTGINIEHPVAHVHTQNGLAESFIKHLQLIARPLLMRAKLPLSIWGHAILHAASL--------------------EPNISHLRIFGCAVYVPIFPPQRA

Query:  KMGPQRRLGIYVEFESPSIIRYREPLMGDVFTARFADCHFNETN-STLGGGIKKLENEIAWNVSLLSHLDPRTKQCELEVQKIIHLQSVANQMPDAFTDT
        K+G QRRLGIYV +ESPSII+Y EPL GD+FTARFADCHF+E++ STLGG   + + EI W +S L+ LD RTK+CELEVQ+IIHLQ +ANQ+PDAF D+
Subjt:  KMGPQRRLGIYVEFESPSIIRYREPLMGDVFTARFADCHFNETN-STLGGGIKKLENEIAWNVSLLSHLDPRTKQCELEVQKIIHLQSVANQMPDAFTDT

Query:  KKVTKSYIPAANAPSRIEIPTQQVDTINESTLRQKQ----------------------------------ETQVYEDNNEI--SINYTMTGKRWNRINVV
        K+VTKS+IPA NAP +I++P +Q+ + NES  R K+                                  +  + E+N  I  SIN+  + K WNR  ++
Subjt:  KKVTKSYIPAANAPSRIEIPTQQVDTINESTLRQKQ----------------------------------ETQVYEDNNEI--SINYTMTGKRWNRINVV

Query:  VDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWPKWKEAIQAELNSLTKREVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLFAQGFSQRPG
        +DNIFAYN+A +I+ E+ED+E +SVDECR R DWPKWK+AIQ+EL+SL KR+VFGPVV TP+GVKPVG+KWVFVRKRNE NE+ RYKARL AQGFSQRPG
Subjt:  VDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWPKWKEAIQAELNSLTKREVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLFAQGFSQRPG

Query:  IDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTTYLYGSLENEIYIKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSG
         DY+ETYSPVVDAIT RYL+S AV E LDM LMDVVT YLYG+LEN+IY++I EGF +PE+  SN + + SIKLQ+SLYGLKQSG
Subjt:  IDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTTYLYGSLENEIYIKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSG

A5B4M9 Uncharacterized protein1.2e-16763.11Show/hide
Query:  MSTGINIEHPVAHVHTQNGLAESFIKHLQLIARPLLMRAKLPLSIWGHAILHAASL--------------------EPNISHLRIFGCAVYVPIFPPQRA
        MS GINIEHPVAH HTQNGLAESFIK LQLIARPLLM+ KLP S WGHAI+H A+L                    +PNI HLRIFGCAVYVPI P Q  
Subjt:  MSTGINIEHPVAHVHTQNGLAESFIKHLQLIARPLLMRAKLPLSIWGHAILHAASL--------------------EPNISHLRIFGCAVYVPIFPPQRA

Query:  KMGPQRRLGIYVEFESPSIIRYREPLMGDVFTARFADCHFNETNSTLGGGIKKL---ENEIAWNVSLLSHLDPRTKQCELEVQKIIHLQSVANQMPDAFT
        KMGPQRRLG+YV F+SPSII Y EPL GDVFTARFADCHFNE+     G  K +     EI+W  S ++HLDPRT QCELEVQ+IIHLQ++ NQ+PDAF 
Subjt:  KMGPQRRLGIYVEFESPSIIRYREPLMGDVFTARFADCHFNETNSTLGGGIKKL---ENEIAWNVSLLSHLDPRTKQCELEVQKIIHLQSVANQMPDAFT

Query:  DTKKVTKSYIPAANAPSRIEIPTQQVDTINESTLRQK-------------------------------------QETQVYEDNNEISINYTMTGKRWNRI
        DTKKVTKS+IPAAN P+RI++P  Q+   NES +R K                                     +E QV E N EIS++Y  TG++W+R 
Subjt:  DTKKVTKSYIPAANAPSRIEIPTQQVDTINESTLRQK-------------------------------------QETQVYEDNNEISINYTMTGKRWNRI

Query:  NVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWPKWKEAIQAELNSLTKREVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLFAQGFSQ
        N+V++NIFA+ VA +II  +ED EP++V+ECR+R DWPKWKE IQAELNSLTKREVFGPVV TP+ VKPVG+KWVFVRKRNENNE+ RYKARL AQGFSQ
Subjt:  NVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWPKWKEAIQAELNSLTKREVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLFAQGFSQ

Query:  RPGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTTYLYGSLENEIYIKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSG
        RPGIDYEETYSPV+DAIT R+LISLAV E LDM LMDV+T YLYGS++N+IY+KIPEGFK+P++ N+  R + SIKLQRSLYGLKQSG
Subjt:  RPGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTTYLYGSLENEIYIKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSG

Q949J4 Putative copia-like polyprotein1.9e-16864.01Show/hide
Query:  MSTGINIEHPVAHVHTQNGLAESFIKHLQLIARPLLMRAKLPLSIWGHAILHAASL--------------------EPNISHLRIFGCAVYVPIFPPQRA
        MSTGI +EHPVAHVHTQNGLAES IK LQLIARPLLMR KL +S+WGHAILHAA+L                    EPNISHLR+FGCAVYVPI PPQR 
Subjt:  MSTGINIEHPVAHVHTQNGLAESFIKHLQLIARPLLMRAKLPLSIWGHAILHAASL--------------------EPNISHLRIFGCAVYVPIFPPQRA

Query:  KMGPQRRLGIYVEFESPSIIRYREPLMGDVFTARFADCHFNET-NSTLGGGIKKLENEIAWNVSLLSHLDPRTKQCELEVQKIIHLQSVANQMPDAFTDT
        KMGPQRRLGIYV +ESPSII+Y EP+ GD+F  RFADCHF+E+   TLGG  K L  EI WN S LSHLDPRT QCE EVQ+II+LQ++ANQ+PDAFT+ 
Subjt:  KMGPQRRLGIYVEFESPSIIRYREPLMGDVFTARFADCHFNET-NSTLGGGIKKLENEIAWNVSLLSHLDPRTKQCELEVQKIIHLQSVANQMPDAFTDT

Query:  KKVTKSYIPAANAPSRIEIPTQQVDTINESTLRQKQ-------------------------------ETQVY---------EDNNEISINYTMTGKRWNR
         ++TKSYIPA NAP R++IPT Q+   NES    K+                               ETQV            NNEIS+NY  T K WNR
Subjt:  KKVTKSYIPAANAPSRIEIPTQQVDTINESTLRQKQ-------------------------------ETQVY---------EDNNEISINYTMTGKRWNR

Query:  INVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWPKWKEAIQAELNSLTKREVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLFAQGFS
         NVV+DNIFAYNVA  I+ ++ED EPKSV ECR R DWPKWK+AIQAEL SL KRE FGP++ TP+G+KPVG+KW+FVRKRNE  EV RYKARL AQGFS
Subjt:  INVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWPKWKEAIQAELNSLTKREVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLFAQGFS

Query:  QRPGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTTYLYGSLENEIYIKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSG
        QRPGIDY ETYSPVVDAIT RYLI+LAV E L+M LMDVVT YLYGSL++ I++KIPE FK+PE+Y  +SRE CSIKLQ+SLYGLKQSG
Subjt:  QRPGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTTYLYGSLENEIYIKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSG

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.4e-3227.59Show/hide
Query:  GINIEHPVAHVHTQNGLAESFIKHLQLIARPLLMRAKLPLSIWGHAILHAASL-----------------------EPNISHLRIFGCAVYVPIFPPQRA
        GI+    V H    NG++E  I+ +   AR ++  AKL  S WG A+L A  L                       +P + HLR+FG  VYV I   ++ 
Subjt:  GINIEHPVAHVHTQNGLAESFIKHLQLIARPLLMRAKLPLSIWGHAILHAASL-----------------------EPNISHLRIFGCAVYVPIFPPQRA

Query:  KMGPQRRLGIYVEFESPSIIRYREPLMGDVFTARFADCHFNETNSTLGGGI----------KKLENEIAWNVS---LLSHLDPRTKQCELEVQKIIHLQS
        K   +    I+V +E P+  +  + +      AR  D   +ETN      +          K+ EN+   N S   + +     +K+C+  +Q +    S
Subjt:  KMGPQRRLGIYVEFESPSIIRYREPLMGDVFTARFADCHFNETNSTLGGGI----------KKLENEIAWNVS---LLSHLDPRTKQCELEVQKIIHLQS

Query:  VANQMPDAFTDTKKVTKSYIP-AANAPSRIEIPTQQVDT----INESTLRQK----------------QETQVYEDNNEISINYTMTGKRWNRINVVVDN
          ++  +   D++K+ ++  P  +     I+      ++    +NES  R++                +E++  E   EI I+          IN   + 
Subjt:  VANQMPDAFTDTKKVTKSYIP-AANAPSRIEIPTQQVDT----INESTLRQK----------------QETQVYEDNNEISINYTMTGKRWNRINVVVDN

Query:  I-----FAYNVAHNII-------HENEDYEPKSVDECRNRKDWPKWKEAIQAELNSLTKREVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLF
        +      +YN   N +       H   +  P S DE + R D   W+EAI  ELN+      +  +   P+    V  +WVF  K NE     RYKARL 
Subjt:  I-----FAYNVAHNII-------HENEDYEPKSVDECRNRKDWPKWKEAIQAELNSLTKREVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLF

Query:  AQGFSQRPGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTTYLYGSLENEIYIKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQS
        A+GF+Q+  IDYEET++PV    + R+++SL +  NL +H MDV T +L G+L+ EIY+++P+G       + NS  +C  KL +++YGLKQ+
Subjt:  AQGFSQRPGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTTYLYGSLENEIYIKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQS

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.6e-3427.49Show/hide
Query:  STGINIEHPVAHVHTQNGLAESFIKHLQLIARPLLMRAKLPLSIWGHAILHAASL---------------------EPNISHLRIFGCAVYVPIFPPQRA
        S GI  E  V      NG+AE   + +    R +L  AKLP S WG A+  A  L                     E + SHL++FGC  +  +   QR 
Subjt:  STGINIEHPVAHVHTQNGLAESFIKHLQLIARPLLMRAKLPLSIWGHAILHAASL---------------------EPNISHLRIFGCAVYVPIFPPQRA

Query:  KMGPQRRLGIYVEFESPSI-IRYREPLMGDVFTARFADCHFNETN-STLGGGIKKLENEIAWNVSLLSHLDPRTKQCELEVQKIIHLQSVANQMPDAFTD
        K+  +    I++ +       R  +P+   V  +R  D  F E+   T     +K++N I                    +   + + S +N    A + 
Subjt:  KMGPQRRLGIYVEFESPSI-IRYREPLMGDVFTARFADCHFNETN-STLGGGIKKLENEIAWNVSLLSHLDPRTKQCELEVQKIIHLQSVANQMPDAFTD

Query:  TKKVTKSYIPAANAPSRIEIPTQQVDTINESTLRQKQETQVYEDNNEI--SINYTMTGKRWNRINVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKD
        T +V++        P  +    +Q + ++E     +  TQ  E +  +  S    +  +R+     V+               ++D EP+S+ E  +  +
Subjt:  TKKVTKSYIPAANAPSRIEIPTQQVDTINESTLRQKQETQVYEDNNEI--SINYTMTGKRWNRINVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKD

Query:  WPKWKEAIQAELNSLTKREVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLFAQGFSQRPGIDYEETYSPVVDAITLRYLISLAVCENLDMHLM
          +  +A+Q E+ SL K   +  +V  PKG +P+  KWVF  K++ + ++ RYKARL  +GF Q+ GID++E +SPVV   ++R ++SLA   +L++  +
Subjt:  WPKWKEAIQAELNSLTKREVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLFAQGFSQRPGIDYEETYSPVVDAITLRYLISLAVCENLDMHLM

Query:  DVVTTYLYGSLENEIYIKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQS
        DV T +L+G LE EIY++ PEGF++          +C  KL +SLYGLKQ+
Subjt:  DVVTTYLYGSLENEIYIKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQS

P92520 Uncharacterized mitochondrial protein AtMg008209.1e-1141.75Show/hide
Query:  EPKSVDECRNRKDWPKWKEAIQAELNSLTKREVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLFAQGFSQRPGIDYEETYSPVVDAITLRYLI
        EPKSV      KD P W +A+Q EL++L++ + +  +V  P     +G KWVF  K + +  + R KARL A+GF Q  GI + ETYSPVV   T+R ++
Subjt:  EPKSVDECRNRKDWPKWKEAIQAELNSLTKREVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLFAQGFSQRPGIDYEETYSPVVDAITLRYLI

Query:  SLA
        ++A
Subjt:  SLA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.1e-2332.38Show/hide
Query:  QVYEDNNEISINYTMTGKRWNRINVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWPKWKEAIQAELNSLTKREVFGPVVYTPKGVKPVGFKWVFV
        Q+  +NN+  +N    G R  +  ++  N   Y++A ++  E+   EP++  +    KD  +W+ A+ +E+N+      +  V   P  V  VG +W+F 
Subjt:  QVYEDNNEISINYTMTGKRWNRINVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWPKWKEAIQAELNSLTKREVFGPVVYTPKGVKPVGFKWVFV

Query:  RKRNENNEVTRYKARLFAQGFSQRPGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTTYLYGSLENEIYIKIPEGFKIPESYNSNSRELCSIKL
        +K N +  + RYKARL A+G++QRPG+DY ET+SPV+ + ++R ++ +AV  +  +  +DV   +L G+L +++Y+  P GF   +  N     +C  KL
Subjt:  RKRNENNEVTRYKARLFAQGFSQRPGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTTYLYGSLENEIYIKIPEGFKIPESYNSNSRELCSIKL

Query:  QRSLYGLKQS
        +++LYGLKQ+
Subjt:  QRSLYGLKQS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.9e-2334.9Show/hide
Query:  KWKEAIQAELNSLTKREVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLFAQGFSQRPGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDV
        +W++A+ +E+N+      +  V   P  V  VG +W+F +K N +  + RYKARL A+G++QRPG+DY ET+SPV+ + ++R ++ +AV  +  +  +DV
Subjt:  KWKEAIQAELNSLTKREVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLFAQGFSQRPGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDV

Query:  VTTYLYGSLENEIYIKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQS
           +L G+L +E+Y+  P GF          R     +L++++YGLKQ+
Subjt:  VTTYLYGSLENEIYIKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQS

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.6e-2332.32Show/hide
Query:  EPKSVDECRNRKDWPKWKEAIQAELNSLTKREVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLFAQGFSQRPGIDYEETYSPVVDAITLRYLI
        EP + +E    K++  W  A+  E+ ++     +  +   P   KP+G KWV+  K N +  + RYKARL A+G++Q+ GID+ ET+SPV    +++ ++
Subjt:  EPKSVDECRNRKDWPKWKEAIQAELNSLTKREVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLFAQGFSQRPGIDYEETYSPVVDAITLRYLI

Query:  SLAVCENLDMHLMDVVTTYLYGSLENEIYIKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQS
        +++   N  +H +D+   +L G L+ EIY+K+P G+   +  +     +C +K  +S+YGLKQ+
Subjt:  SLAVCENLDMHLMDVVTTYLYGSLENEIYIKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQS

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)6.4e-1241.75Show/hide
Query:  EPKSVDECRNRKDWPKWKEAIQAELNSLTKREVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLFAQGFSQRPGIDYEETYSPVVDAITLRYLI
        EPKSV      KD P W +A+Q EL++L++ + +  +V  P     +G KWVF  K + +  + R KARL A+GF Q  GI + ETYSPVV   T+R ++
Subjt:  EPKSVDECRNRKDWPKWKEAIQAELNSLTKREVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLFAQGFSQRPGIDYEETYSPVVDAITLRYLI

Query:  SLA
        ++A
Subjt:  SLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAACTGGGATAAATATTGAACATCCTGTAGCTCATGTTCATACACAAAATGGTTTGGCAGAATCATTTATCAAGCATTTACAATTGATTGCCAGACCATTACTTAT
GAGAGCAAAACTTCCATTATCTATATGGGGTCATGCTATTTTGCATGCAGCGTCGCTCGAACCAAATATTTCTCATTTGCGAATTTTTGGTTGTGCAGTATATGTTCCAA
TTTTCCCACCACAACGTGCTAAAATGGGACCTCAAAGAAGGTTAGGAATATATGTCGAATTTGAATCCCCATCAATTATTAGATATCGTGAACCCTTAATGGGAGATGTA
TTTACTGCACGATTCGCTGATTGTCATTTTAATGAGACAAATTCAACATTAGGGGGAGGAATTAAGAAGTTGGAAAATGAAATTGCCTGGAATGTATCGTTATTGTCTCA
TTTAGATCCTCGTACAAAGCAATGTGAACTAGAAGTTCAAAAGATAATTCATTTACAAAGTGTAGCAAACCAAATGCCAGATGCATTTACAGATACTAAGAAAGTGACCA
AGTCATATATTCCAGCTGCAAATGCTCCATCTAGAATTGAAATCCCAACACAGCAAGTTGATACAATTAATGAATCAACGTTGCGCCAAAAACAAGAGACTCAAGTATAT
GAAGATAACAATGAGATCTCGATAAATTATACCATGACAGGAAAAAGGTGGAATAGAATTAATGTAGTTGTGGACAACATTTTTGCGTATAATGTTGCACATAATATCAT
TCATGAAAATGAGGATTATGAACCTAAATCTGTTGACGAATGTCGTAATAGAAAGGATTGGCCCAAGTGGAAAGAAGCCATCCAAGCAGAACTAAACTCACTCACGAAAC
GTGAAGTTTTTGGACCTGTAGTTTATACACCTAAAGGTGTAAAACCTGTCGGATTTAAATGGGTATTTGTACGTAAGCGAAATGAAAATAATGAGGTCACTAGATATAAA
GCACGACTTTTTGCACAAGGATTCTCTCAAAGACCAGGCATTGATTATGAGGAAACATATTCTCCTGTGGTGGATGCTATTACATTAAGATATTTAATTAGTCTGGCTGT
ATGTGAAAATCTTGATATGCATCTTATGGATGTAGTTACAACATACTTATATGGATCTTTGGAAAATGAAATCTATATAAAAATCCCTGAAGGATTTAAGATACCTGAAT
CATATAATTCAAACTCTAGAGAATTATGTTCAATCAAATTACAAAGATCATTATATGGATTGAAACAATCAGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCAACTGGGATAAATATTGAACATCCTGTAGCTCATGTTCATACACAAAATGGTTTGGCAGAATCATTTATCAAGCATTTACAATTGATTGCCAGACCATTACTTAT
GAGAGCAAAACTTCCATTATCTATATGGGGTCATGCTATTTTGCATGCAGCGTCGCTCGAACCAAATATTTCTCATTTGCGAATTTTTGGTTGTGCAGTATATGTTCCAA
TTTTCCCACCACAACGTGCTAAAATGGGACCTCAAAGAAGGTTAGGAATATATGTCGAATTTGAATCCCCATCAATTATTAGATATCGTGAACCCTTAATGGGAGATGTA
TTTACTGCACGATTCGCTGATTGTCATTTTAATGAGACAAATTCAACATTAGGGGGAGGAATTAAGAAGTTGGAAAATGAAATTGCCTGGAATGTATCGTTATTGTCTCA
TTTAGATCCTCGTACAAAGCAATGTGAACTAGAAGTTCAAAAGATAATTCATTTACAAAGTGTAGCAAACCAAATGCCAGATGCATTTACAGATACTAAGAAAGTGACCA
AGTCATATATTCCAGCTGCAAATGCTCCATCTAGAATTGAAATCCCAACACAGCAAGTTGATACAATTAATGAATCAACGTTGCGCCAAAAACAAGAGACTCAAGTATAT
GAAGATAACAATGAGATCTCGATAAATTATACCATGACAGGAAAAAGGTGGAATAGAATTAATGTAGTTGTGGACAACATTTTTGCGTATAATGTTGCACATAATATCAT
TCATGAAAATGAGGATTATGAACCTAAATCTGTTGACGAATGTCGTAATAGAAAGGATTGGCCCAAGTGGAAAGAAGCCATCCAAGCAGAACTAAACTCACTCACGAAAC
GTGAAGTTTTTGGACCTGTAGTTTATACACCTAAAGGTGTAAAACCTGTCGGATTTAAATGGGTATTTGTACGTAAGCGAAATGAAAATAATGAGGTCACTAGATATAAA
GCACGACTTTTTGCACAAGGATTCTCTCAAAGACCAGGCATTGATTATGAGGAAACATATTCTCCTGTGGTGGATGCTATTACATTAAGATATTTAATTAGTCTGGCTGT
ATGTGAAAATCTTGATATGCATCTTATGGATGTAGTTACAACATACTTATATGGATCTTTGGAAAATGAAATCTATATAAAAATCCCTGAAGGATTTAAGATACCTGAAT
CATATAATTCAAACTCTAGAGAATTATGTTCAATCAAATTACAAAGATCATTATATGGATTGAAACAATCAGGATGA
Protein sequenceShow/hide protein sequence
MSTGINIEHPVAHVHTQNGLAESFIKHLQLIARPLLMRAKLPLSIWGHAILHAASLEPNISHLRIFGCAVYVPIFPPQRAKMGPQRRLGIYVEFESPSIIRYREPLMGDV
FTARFADCHFNETNSTLGGGIKKLENEIAWNVSLLSHLDPRTKQCELEVQKIIHLQSVANQMPDAFTDTKKVTKSYIPAANAPSRIEIPTQQVDTINESTLRQKQETQVY
EDNNEISINYTMTGKRWNRINVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWPKWKEAIQAELNSLTKREVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYK
ARLFAQGFSQRPGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTTYLYGSLENEIYIKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSG