; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G19894 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G19894
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationctg4:2840161..2843728
RNA-Seq ExpressionCucsat.G19894
SyntenyCucsat.G19894
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016301 - kinase activity (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032986.1 putative copia-type polyprotein [Cucumis melo var. makuwa]4.48e-9887.06Show/hide
Query:  MDEEIKAMKKNDRWKLSTLPNGKKAVGVKWVFKIKRNEKGEVETYKARLVPKGYSQRKGFDYDEVFTPVARLKTIRLLIALAAQNNWKISQMDVKSAFFN
        MDEEIKA+KKND W+LSTLPNGKKAVGVKWVFKIKRNEKGE E YKARLV KGYSQRKG DYDEVF PVARL+TIRLLIALAAQNNWKI QMDVKSAF N
Subjt:  MDEEIKAMKKNDRWKLSTLPNGKKAVGVKWVFKIKRNEKGEVETYKARLVPKGYSQRKGFDYDEVFTPVARLKTIRLLIALAAQNNWKISQMDVKSAFFN

Query:  GYLEEKVYLEQPLGYFVKGQEDKVLRLKKALHGLKQAPRMWNNIINKYFLDNEYLRCPYEHSLYIKTNDH
        GYLEE+VYLEQP GY VKGQEDKVL+LKKAL+GLKQAPRMWN+ INKYFLDN YLRCPYEHS YIK N H
Subjt:  GYLEEKVYLEQPLGYFVKGQEDKVLRLKKALHGLKQAPRMWNNIINKYFLDNEYLRCPYEHSLYIKTNDH

KAA0038955.1 lectin receptor kinase [Cucumis melo var. makuwa]1.02e-9987.06Show/hide
Query:  MDEEIKAMKKNDRWKLSTLPNGKKAVGVKWVFKIKRNEKGEVETYKARLVPKGYSQRKGFDYDEVFTPVARLKTIRLLIALAAQNNWKISQMDVKSAFFN
        MDEEIKA+KKND W+LSTLPNGKKAVGVKWVFKIKRNEKGEVE YKARLV KGYSQRK  DYDEVF PVARL+TIRLLIALAAQNNWKI QMDVKSAF N
Subjt:  MDEEIKAMKKNDRWKLSTLPNGKKAVGVKWVFKIKRNEKGEVETYKARLVPKGYSQRKGFDYDEVFTPVARLKTIRLLIALAAQNNWKISQMDVKSAFFN

Query:  GYLEEKVYLEQPLGYFVKGQEDKVLRLKKALHGLKQAPRMWNNIINKYFLDNEYLRCPYEHSLYIKTNDH
        GYLEE+VYLEQP GY VKGQEDKVL+L+K L+GLKQAPRMWN+IINKYFLDN YLRCPYEHSLYIK N H
Subjt:  GYLEEKVYLEQPLGYFVKGQEDKVLRLKKALHGLKQAPRMWNNIINKYFLDNEYLRCPYEHSLYIKTNDH

TYK11189.1 reverse transcriptase [Cucumis melo var. makuwa]6.16e-9685.29Show/hide
Query:  MDEEIKAMKKNDRWKLSTLPNGKKAVGVKWVFKIKRNEKGEVETYKARLVPKGYSQRKGFDYDEVFTPVARLKTIRLLIALAAQNNWKISQMDVKSAFFN
        MDEEIKA+KKND W+LSTLPNGKKAVGVKWVFKIKRNEKGEVE YKARLV KGYSQRK  DYDEVF  VARL+TIRLLIALA QNNWKI QMDVKSAF N
Subjt:  MDEEIKAMKKNDRWKLSTLPNGKKAVGVKWVFKIKRNEKGEVETYKARLVPKGYSQRKGFDYDEVFTPVARLKTIRLLIALAAQNNWKISQMDVKSAFFN

Query:  GYLEEKVYLEQPLGYFVKGQEDKVLRLKKALHGLKQAPRMWNNIINKYFLDNEYLRCPYEHSLYIKTNDH
        GYLEE+VYLEQP GY VKGQEDKVL+L+K L+GLKQAPRMWN+ INKYFLDN YLRCPYEHSLYIK N H
Subjt:  GYLEEKVYLEQPLGYFVKGQEDKVLRLKKALHGLKQAPRMWNNIINKYFLDNEYLRCPYEHSLYIKTNDH

TYK18672.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.27e-9688.24Show/hide
Query:  MDEEIKAMKKNDRWKLSTLPNGKKAVGVKWVFKIKRNEKGEVETYKARLVPKGYSQRKGFDYDEVFTPVARLKTIRLLIALAAQNNWKISQMDVKSAFFN
        MDEEIKA+KKND W+LSTLPNGKKAVGVKWVFKIKRNEKGEVE YKARLV KGYSQRKG DYDEVF PVARL+TIRLLIALAAQNNWKI QMDVKSAF N
Subjt:  MDEEIKAMKKNDRWKLSTLPNGKKAVGVKWVFKIKRNEKGEVETYKARLVPKGYSQRKGFDYDEVFTPVARLKTIRLLIALAAQNNWKISQMDVKSAFFN

Query:  GYLEEKVYLEQPLGYFVKGQEDKVLRLKKALHGLKQAPRMWNNIINKYFLDNEYLRCPYEHSLYIKTNDH
        GYLEE+VYLEQP GY VKGQEDKVL+LKKAL+GLKQAPRMWN+ INKYFLDN YLRCPYEHSLYIK N H
Subjt:  GYLEEKVYLEQPLGYFVKGQEDKVLRLKKALHGLKQAPRMWNNIINKYFLDNEYLRCPYEHSLYIKTNDH

TYK21979.1 putative copia-type polyprotein [Cucumis melo var. makuwa]1.39e-9988.24Show/hide
Query:  MDEEIKAMKKNDRWKLSTLPNGKKAVGVKWVFKIKRNEKGEVETYKARLVPKGYSQRKGFDYDEVFTPVARLKTIRLLIALAAQNNWKISQMDVKSAFFN
        MDEEIKA+KKND W+LSTLPNGKKAVGVKWVFKIKRNEKGEVE YKARLV KGYSQRKG DYDEVF PVARL+TIRLLIALAAQNNWKI QMDVKSAF N
Subjt:  MDEEIKAMKKNDRWKLSTLPNGKKAVGVKWVFKIKRNEKGEVETYKARLVPKGYSQRKGFDYDEVFTPVARLKTIRLLIALAAQNNWKISQMDVKSAFFN

Query:  GYLEEKVYLEQPLGYFVKGQEDKVLRLKKALHGLKQAPRMWNNIINKYFLDNEYLRCPYEHSLYIKTNDH
        GYLEE+VYLEQP GY VKGQEDKVL+LKKAL+GLKQAPRMWN+ INKYFLDN YLRCPYEHSLYIK N H
Subjt:  GYLEEKVYLEQPLGYFVKGQEDKVLRLKKALHGLKQAPRMWNNIINKYFLDNEYLRCPYEHSLYIKTNDH

TrEMBL top hitse value%identityAlignment
A0A5A7SSL5 Putative copia-type polyprotein2.17e-9887.06Show/hide
Query:  MDEEIKAMKKNDRWKLSTLPNGKKAVGVKWVFKIKRNEKGEVETYKARLVPKGYSQRKGFDYDEVFTPVARLKTIRLLIALAAQNNWKISQMDVKSAFFN
        MDEEIKA+KKND W+LSTLPNGKKAVGVKWVFKIKRNEKGE E YKARLV KGYSQRKG DYDEVF PVARL+TIRLLIALAAQNNWKI QMDVKSAF N
Subjt:  MDEEIKAMKKNDRWKLSTLPNGKKAVGVKWVFKIKRNEKGEVETYKARLVPKGYSQRKGFDYDEVFTPVARLKTIRLLIALAAQNNWKISQMDVKSAFFN

Query:  GYLEEKVYLEQPLGYFVKGQEDKVLRLKKALHGLKQAPRMWNNIINKYFLDNEYLRCPYEHSLYIKTNDH
        GYLEE+VYLEQP GY VKGQEDKVL+LKKAL+GLKQAPRMWN+ INKYFLDN YLRCPYEHS YIK N H
Subjt:  GYLEEKVYLEQPLGYFVKGQEDKVLRLKKALHGLKQAPRMWNNIINKYFLDNEYLRCPYEHSLYIKTNDH

A0A5A7TC35 Lectin receptor kinase4.93e-10087.06Show/hide
Query:  MDEEIKAMKKNDRWKLSTLPNGKKAVGVKWVFKIKRNEKGEVETYKARLVPKGYSQRKGFDYDEVFTPVARLKTIRLLIALAAQNNWKISQMDVKSAFFN
        MDEEIKA+KKND W+LSTLPNGKKAVGVKWVFKIKRNEKGEVE YKARLV KGYSQRK  DYDEVF PVARL+TIRLLIALAAQNNWKI QMDVKSAF N
Subjt:  MDEEIKAMKKNDRWKLSTLPNGKKAVGVKWVFKIKRNEKGEVETYKARLVPKGYSQRKGFDYDEVFTPVARLKTIRLLIALAAQNNWKISQMDVKSAFFN

Query:  GYLEEKVYLEQPLGYFVKGQEDKVLRLKKALHGLKQAPRMWNNIINKYFLDNEYLRCPYEHSLYIKTNDH
        GYLEE+VYLEQP GY VKGQEDKVL+L+K L+GLKQAPRMWN+IINKYFLDN YLRCPYEHSLYIK N H
Subjt:  GYLEEKVYLEQPLGYFVKGQEDKVLRLKKALHGLKQAPRMWNNIINKYFLDNEYLRCPYEHSLYIKTNDH

A0A5D3CLU6 Reverse transcriptase2.98e-9685.29Show/hide
Query:  MDEEIKAMKKNDRWKLSTLPNGKKAVGVKWVFKIKRNEKGEVETYKARLVPKGYSQRKGFDYDEVFTPVARLKTIRLLIALAAQNNWKISQMDVKSAFFN
        MDEEIKA+KKND W+LSTLPNGKKAVGVKWVFKIKRNEKGEVE YKARLV KGYSQRK  DYDEVF  VARL+TIRLLIALA QNNWKI QMDVKSAF N
Subjt:  MDEEIKAMKKNDRWKLSTLPNGKKAVGVKWVFKIKRNEKGEVETYKARLVPKGYSQRKGFDYDEVFTPVARLKTIRLLIALAAQNNWKISQMDVKSAFFN

Query:  GYLEEKVYLEQPLGYFVKGQEDKVLRLKKALHGLKQAPRMWNNIINKYFLDNEYLRCPYEHSLYIKTNDH
        GYLEE+VYLEQP GY VKGQEDKVL+L+K L+GLKQAPRMWN+ INKYFLDN YLRCPYEHSLYIK N H
Subjt:  GYLEEKVYLEQPLGYFVKGQEDKVLRLKKALHGLKQAPRMWNNIINKYFLDNEYLRCPYEHSLYIKTNDH

A0A5D3D557 Retrovirus-related Pol polyprotein from transposon TNT 1-946.14e-9788.24Show/hide
Query:  MDEEIKAMKKNDRWKLSTLPNGKKAVGVKWVFKIKRNEKGEVETYKARLVPKGYSQRKGFDYDEVFTPVARLKTIRLLIALAAQNNWKISQMDVKSAFFN
        MDEEIKA+KKND W+LSTLPNGKKAVGVKWVFKIKRNEKGEVE YKARLV KGYSQRKG DYDEVF PVARL+TIRLLIALAAQNNWKI QMDVKSAF N
Subjt:  MDEEIKAMKKNDRWKLSTLPNGKKAVGVKWVFKIKRNEKGEVETYKARLVPKGYSQRKGFDYDEVFTPVARLKTIRLLIALAAQNNWKISQMDVKSAFFN

Query:  GYLEEKVYLEQPLGYFVKGQEDKVLRLKKALHGLKQAPRMWNNIINKYFLDNEYLRCPYEHSLYIKTNDH
        GYLEE+VYLEQP GY VKGQEDKVL+LKKAL+GLKQAPRMWN+ INKYFLDN YLRCPYEHSLYIK N H
Subjt:  GYLEEKVYLEQPLGYFVKGQEDKVLRLKKALHGLKQAPRMWNNIINKYFLDNEYLRCPYEHSLYIKTNDH

A0A5D3DF53 Putative copia-type polyprotein6.74e-10088.24Show/hide
Query:  MDEEIKAMKKNDRWKLSTLPNGKKAVGVKWVFKIKRNEKGEVETYKARLVPKGYSQRKGFDYDEVFTPVARLKTIRLLIALAAQNNWKISQMDVKSAFFN
        MDEEIKA+KKND W+LSTLPNGKKAVGVKWVFKIKRNEKGEVE YKARLV KGYSQRKG DYDEVF PVARL+TIRLLIALAAQNNWKI QMDVKSAF N
Subjt:  MDEEIKAMKKNDRWKLSTLPNGKKAVGVKWVFKIKRNEKGEVETYKARLVPKGYSQRKGFDYDEVFTPVARLKTIRLLIALAAQNNWKISQMDVKSAFFN

Query:  GYLEEKVYLEQPLGYFVKGQEDKVLRLKKALHGLKQAPRMWNNIINKYFLDNEYLRCPYEHSLYIKTNDH
        GYLEE+VYLEQP GY VKGQEDKVL+LKKAL+GLKQAPRMWN+ INKYFLDN YLRCPYEHSLYIK N H
Subjt:  GYLEEKVYLEQPLGYFVKGQEDKVLRLKKALHGLKQAPRMWNNIINKYFLDNEYLRCPYEHSLYIKTNDH

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.1e-3139.39Show/hide
Query:  MDEEIKAMKKNDRWKLSTLPNGKKAVGVKWVFKIKRNEKGEVETYKARLVPKGYSQRKGFDYDEVFTPVARLKTIRLLIALAAQNNWKISQMDVKSAFFN
        ++ E+ A K N+ W ++  P  K  V  +WVF +K NE G    YKARLV +G++Q+   DY+E F PVAR+ + R +++L  Q N K+ QMDVK+AF N
Subjt:  MDEEIKAMKKNDRWKLSTLPNGKKAVGVKWVFKIKRNEKGEVETYKARLVPKGYSQRKGFDYDEVFTPVARLKTIRLLIALAAQNNWKISQMDVKSAFFN

Query:  GYLEEKVYLEQPLGYFVKGQEDKVLRLKKALHGLKQAPRMWNNIINKYFLDNEYLRCPYEHSLYI
        G L+E++Y+  P G  +    D V +L KA++GLKQA R W  +  +   + E++    +  +YI
Subjt:  GYLEEKVYLEQPLGYFVKGQEDKVLRLKKALHGLKQAPRMWNNIINKYFLDNEYLRCPYEHSLYI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.9e-3743.37Show/hide
Query:  MDEEIKAMKKNDRWKLSTLPNGKKAVGVKWVFKIKRNEKGEVETYKARLVPKGYSQRKGFDYDEVFTPVARLKTIRLLIALAAQNNWKISQMDVKSAFFN
        M EE+++++KN  +KL  LP GK+ +  KWVFK+K++   ++  YKARLV KG+ Q+KG D+DE+F+PV ++ +IR +++LAA  + ++ Q+DVK+AF +
Subjt:  MDEEIKAMKKNDRWKLSTLPNGKKAVGVKWVFKIKRNEKGEVETYKARLVPKGYSQRKGFDYDEVFTPVARLKTIRLLIALAAQNNWKISQMDVKSAFFN

Query:  GYLEEKVYLEQPLGYFVKGQEDKVLRLKKALHGLKQAPRMWNNIINKYFLDNEYLRCPYEHSLYIK
        G LEE++Y+EQP G+ V G++  V +L K+L+GLKQAPR W    + +     YL+   +  +Y K
Subjt:  GYLEEKVYLEQPLGYFVKGQEDKVLRLKKALHGLKQAPRMWNNIINKYFLDNEYLRCPYEHSLYIK

P92520 Uncharacterized mitochondrial protein AtMg008202.1e-1140.48Show/hide
Query:  MDEEIKAMKKNDRWKLSTLPNGKKAVGVKWVFKIKRNEKGEVETYKARLVPKGYSQRKGFDYDEVFTPVARLKTIRLLIALAAQ
        M EE+ A+ +N  W L   P  +  +G KWVFK K +  G ++  KARLV KG+ Q +G  + E ++PV R  TIR ++ +A Q
Subjt:  MDEEIKAMKKNDRWKLSTLPNGKKAVGVKWVFKIKRNEKGEVETYKARLVPKGYSQRKGFDYDEVFTPVARLKTIRLLIALAAQ

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.5e-3040.96Show/hide
Query:  MDEEIKAMKKNDRWKL-STLPNGKKAVGVKWVFKIKRNEKGEVETYKARLVPKGYSQRKGFDYDEVFTPVARLKTIRLLIALAAQNNWKISQMDVKSAFF
        M  EI A   N  W L    P+    VG +W+F  K N  G +  YKARLV KGY+QR G DY E F+PV +  +IR+++ +A   +W I Q+DV +AF 
Subjt:  MDEEIKAMKKNDRWKL-STLPNGKKAVGVKWVFKIKRNEKGEVETYKARLVPKGYSQRKGFDYDEVFTPVARLKTIRLLIALAAQNNWKISQMDVKSAFF

Query:  NGYLEEKVYLEQPLGYFVKGQEDKVLRLKKALHGLKQAPRMWNNIINKYFLDNEYLRCPYEHSLYI
         G L + VY+ QP G+  K + + V +L+KAL+GLKQAPR W   +  Y L   ++    + SL++
Subjt:  NGYLEEKVYLEQPLGYFVKGQEDKVLRLKKALHGLKQAPRMWNNIINKYFLDNEYLRCPYEHSLYI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.0e-3041.57Show/hide
Query:  MDEEIKAMKKNDRWKL-STLPNGKKAVGVKWVFKIKRNEKGEVETYKARLVPKGYSQRKGFDYDEVFTPVARLKTIRLLIALAAQNNWKISQMDVKSAFF
        M  EI A   N  W L    P     VG +W+F  K N  G +  YKARLV KGY+QR G DY E F+PV +  +IR+++ +A   +W I Q+DV +AF 
Subjt:  MDEEIKAMKKNDRWKL-STLPNGKKAVGVKWVFKIKRNEKGEVETYKARLVPKGYSQRKGFDYDEVFTPVARLKTIRLLIALAAQNNWKISQMDVKSAFF

Query:  NGYLEEKVYLEQPLGYFVKGQEDKVLRLKKALHGLKQAPRMWNNIINKYFLDNEYLRCPYEHSLYI
         G L ++VY+ QP G+  K + D V RL+KA++GLKQAPR W   +  Y L   ++    + SL++
Subjt:  NGYLEEKVYLEQPLGYFVKGQEDKVLRLKKALHGLKQAPRMWNNIINKYFLDNEYLRCPYEHSLYI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 86.2e-3540Show/hide
Query:  MDEEIKAMKKNDRWKLSTLPNGKKAVGVKWVFKIKRNEKGEVETYKARLVPKGYSQRKGFDYDEVFTPVARLKTIRLLIALAAQNNWKISQMDVKSAFFN
        MD+EI AM+    W++ TLP  KK +G KWV+KIK N  G +E YKARLV KGY+Q++G D+ E F+PV +L +++L++A++A  N+ + Q+D+ +AF N
Subjt:  MDEEIKAMKKNDRWKLSTLPNGKKAVGVKWVFKIKRNEKGEVETYKARLVPKGYSQRKGFDYDEVFTPVARLKTIRLLIALAAQNNWKISQMDVKSAFFN

Query:  GYLEEKVYLEQPLGYFVKGQE----DKVLRLKKALHGLKQAPRMWNNIINKYFLDNEYLRCPYEHSLYIK
        G L+E++Y++ P GY  +  +    + V  LKK+++GLKQA R W    +   +   +++   +H+ ++K
Subjt:  GYLEEKVYLEQPLGYFVKGQE----DKVLRLKKALHGLKQAPRMWNNIINKYFLDNEYLRCPYEHSLYIK

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.5e-1240.48Show/hide
Query:  MDEEIKAMKKNDRWKLSTLPNGKKAVGVKWVFKIKRNEKGEVETYKARLVPKGYSQRKGFDYDEVFTPVARLKTIRLLIALAAQ
        M EE+ A+ +N  W L   P  +  +G KWVFK K +  G ++  KARLV KG+ Q +G  + E ++PV R  TIR ++ +A Q
Subjt:  MDEEIKAMKKNDRWKLSTLPNGKKAVGVKWVFKIKRNEKGEVETYKARLVPKGYSQRKGFDYDEVFTPVARLKTIRLLIALAAQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGAAGAGATAAAAGCCATGAAAAAGAATGATAGGTGGAAACTTTCTACTCTTCCAAATGGAAAGAAAGCAGTAGGTGTCAAATGGGTGTTCAAGATAAAAAGAAA
TGAAAAAGGAGAAGTGGAGACATACAAAGCAAGATTAGTTCCAAAAGGATATTCTCAAAGAAAAGGCTTTGATTACGATGAAGTGTTTACTCCAGTTGCTCGTTTGAAAA
CCATAAGATTGTTAATTGCGCTTGCTGCTCAAAATAATTGGAAGATCTCTCAGATGGATGTCAAATCAGCATTTTTTAATGGATATCTAGAAGAAAAAGTCTACTTAGAA
CAACCTCTTGGTTATTTTGTGAAAGGTCAAGAGGATAAAGTTCTAAGATTGAAGAAGGCATTACACGGATTGAAACAAGCACCAAGAATGTGGAATAACATAATCAACAA
ATATTTCCTTGATAATGAGTATTTGAGGTGCCCTTATGAACATTCCCTTTATATTAAGACTAATGATCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGAAGAGATAAAAGCCATGAAAAAGAATGATAGGTGGAAACTTTCTACTCTTCCAAATGGAAAGAAAGCAGTAGGTGTCAAATGGGTGTTCAAGATAAAAAGAAA
TGAAAAAGGAGAAGTGGAGACATACAAAGCAAGATTAGTTCCAAAAGGATATTCTCAAAGAAAAGGCTTTGATTACGATGAAGTGTTTACTCCAGTTGCTCGTTTGAAAA
CCATAAGATTGTTAATTGCGCTTGCTGCTCAAAATAATTGGAAGATCTCTCAGATGGATGTCAAATCAGCATTTTTTAATGGATATCTAGAAGAAAAAGTCTACTTAGAA
CAACCTCTTGGTTATTTTGTGAAAGGTCAAGAGGATAAAGTTCTAAGATTGAAGAAGGCATTACACGGATTGAAACAAGCACCAAGAATGTGGAATAACATAATCAACAA
ATATTTCCTTGATAATGAGTATTTGAGGTGCCCTTATGAACATTCCCTTTATATTAAGACTAATGATCATTGA
Protein sequenceShow/hide protein sequence
MDEEIKAMKKNDRWKLSTLPNGKKAVGVKWVFKIKRNEKGEVETYKARLVPKGYSQRKGFDYDEVFTPVARLKTIRLLIALAAQNNWKISQMDVKSAFFNGYLEEKVYLE
QPLGYFVKGQEDKVLRLKKALHGLKQAPRMWNNIINKYFLDNEYLRCPYEHSLYIKTNDH