; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0014952 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0014952
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionUnknown protein
Genome locationchr12:16536502..16537443
RNA-Seq ExpressionPay0014952
SyntenyPay0014952
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8652887.1 hypothetical protein Csa_017671 [Cucumis sativus]2.6e-8784.77Show/hide
Query:  MKKLYRKTGTVHPSPPLISDHLSFLPTAILTLSSALSLQDREVLAYLISSCSNDFTPVHNSSTHRGKAAHHKHAAPMAGSDHPPAFSCYCFQCYTSYWVR
        MKKLYRK GTVHPSP +ISDHLSFLPT ILTL++ALSL DREVLAYLISSCSNDFT V NSS+HRGKA H KHAA M G DHPPAFSCYCFQCYTSYWVR
Subjt:  MKKLYRKTGTVHPSPPLISDHLSFLPTAILTLSSALSLQDREVLAYLISSCSNDFTPVHNSSTHRGKAAHHKHAAPMAGSDHPPAFSCYCFQCYTSYWVR

Query:  WDSSPNRQLIHEIIDAYEDKLAETKVGKNNKKERKKRNSSGTVSGPGEGKGAEAAAKVEEWKVT-----EGGEEEAEKGPVRRIVSLLGEKIWGSWN
        WDSSPNRQLIHEIIDAYE+KLAE+KVGKNNKKERKKRN+ G VSGPGEGKG+EAA K EEW+VT     EGGEE AEKGPVRRIVSLLGEKIWGSWN
Subjt:  WDSSPNRQLIHEIIDAYEDKLAETKVGKNNKKERKKRNSSGTVSGPGEGKGAEAAAKVEEWKVT-----EGGEEEAEKGPVRRIVSLLGEKIWGSWN

KAG7019402.1 hypothetical protein SDJN02_18363, partial [Cucurbita argyrosperma subsp. argyrosperma]5.9e-7675.5Show/hide
Query:  MKKLYRKTGTVHPSPPLISDHLSFLPTAILTLSSALSLQDREVLAYLISSCSNDFTPVHNSSTHRGKAAHHKHAAPMAGSDHPPAFSCYCFQCYTSYWVR
        MKKLYR+ GTVHPSPP+ISDHLSFLPTAILTL++ALS +DRE+LAYLISS SNDFT V+N S+HRGKAAH K AA  +GSDHPP FSC CF+CYTSYWVR
Subjt:  MKKLYRKTGTVHPSPPLISDHLSFLPTAILTLSSALSLQDREVLAYLISSCSNDFTPVHNSSTHRGKAAHHKHAAPMAGSDHPPAFSCYCFQCYTSYWVR

Query:  WDSSPNRQLIHEIIDAYEDKLAETKVGKNNKKERKKRNS---SGTVSGPGEGKGAEAAAKVEEWKVTE-----GGEEEAEKGPVRRIVSLLGEKIWGSWN
        WDSSPNRQ+IHEIIDAYE+ LAE+K GKNNKKERKKRN+   SG+VS PG+GKG+E A KVEE +VTE     GGE EAEKG VR IVS +GEKIWG WN
Subjt:  WDSSPNRQLIHEIIDAYEDKLAETKVGKNNKKERKKRNS---SGTVSGPGEGKGAEAAAKVEEWKVTE-----GGEEEAEKGPVRRIVSLLGEKIWGSWN

XP_004150423.1 uncharacterized protein LOC101221021 [Cucumis sativus]2.6e-8784.77Show/hide
Query:  MKKLYRKTGTVHPSPPLISDHLSFLPTAILTLSSALSLQDREVLAYLISSCSNDFTPVHNSSTHRGKAAHHKHAAPMAGSDHPPAFSCYCFQCYTSYWVR
        MKKLYRK GTVHPSP +ISDHLSFLPT ILTL++ALSL DREVLAYLISSCSNDFT V NSS+HRGKA H KHAA M G DHPPAFSCYCFQCYTSYWVR
Subjt:  MKKLYRKTGTVHPSPPLISDHLSFLPTAILTLSSALSLQDREVLAYLISSCSNDFTPVHNSSTHRGKAAHHKHAAPMAGSDHPPAFSCYCFQCYTSYWVR

Query:  WDSSPNRQLIHEIIDAYEDKLAETKVGKNNKKERKKRNSSGTVSGPGEGKGAEAAAKVEEWKVT-----EGGEEEAEKGPVRRIVSLLGEKIWGSWN
        WDSSPNRQLIHEIIDAYE+KLAE+KVGKNNKKERKKRN+ G VSGPGEGKG+EAA K EEW+VT     EGGEE AEKGPVRRIVSLLGEKIWGSWN
Subjt:  WDSSPNRQLIHEIIDAYEDKLAETKVGKNNKKERKKRNSSGTVSGPGEGKGAEAAAKVEEWKVT-----EGGEEEAEKGPVRRIVSLLGEKIWGSWN

XP_008458978.1 PREDICTED: uncharacterized protein LOC103498228 [Cucumis melo]2.3e-104100Show/hide
Query:  MKKLYRKTGTVHPSPPLISDHLSFLPTAILTLSSALSLQDREVLAYLISSCSNDFTPVHNSSTHRGKAAHHKHAAPMAGSDHPPAFSCYCFQCYTSYWVR
        MKKLYRKTGTVHPSPPLISDHLSFLPTAILTLSSALSLQDREVLAYLISSCSNDFTPVHNSSTHRGKAAHHKHAAPMAGSDHPPAFSCYCFQCYTSYWVR
Subjt:  MKKLYRKTGTVHPSPPLISDHLSFLPTAILTLSSALSLQDREVLAYLISSCSNDFTPVHNSSTHRGKAAHHKHAAPMAGSDHPPAFSCYCFQCYTSYWVR

Query:  WDSSPNRQLIHEIIDAYEDKLAETKVGKNNKKERKKRNSSGTVSGPGEGKGAEAAAKVEEWKVTEGGEEEAEKGPVRRIVSLLGEKIWGSWN
        WDSSPNRQLIHEIIDAYEDKLAETKVGKNNKKERKKRNSSGTVSGPGEGKGAEAAAKVEEWKVTEGGEEEAEKGPVRRIVSLLGEKIWGSWN
Subjt:  WDSSPNRQLIHEIIDAYEDKLAETKVGKNNKKERKKRNSSGTVSGPGEGKGAEAAAKVEEWKVTEGGEEEAEKGPVRRIVSLLGEKIWGSWN

XP_038894832.1 uncharacterized protein LOC120083238 [Benincasa hispida]2.6e-7980.3Show/hide
Query:  MKKLYRKTGTVHPSPPLISDHLSFLPTAILTLSSALSLQDREVLAYLISSCSNDFTPVHNSSTHRGKAAHHK-HAAPMAGSDHPPAFSCYCFQCYTSYWV
        MKKLYRK GTVHPSPP+ISDHLSFLPTAILTL++ALSL+DREVLAYLISSCSNDFT V+N S HRGKAAH K  AA   GSDHPPAFSC CF+CYTSYWV
Subjt:  MKKLYRKTGTVHPSPPLISDHLSFLPTAILTLSSALSLQDREVLAYLISSCSNDFTPVHNSSTHRGKAAHHK-HAAPMAGSDHPPAFSCYCFQCYTSYWV

Query:  RWDSSPNRQLIHEIIDAYEDKLAETKVGKNNKKERKKRNSSGTVSGPGEGKGAEAAAKVEEWKVT-----EGGEEEAEKGPVRRIVSLLGEKIWGSWN
        RWDSSPNRQLIHEIIDAYE+KLAE+K GKNNKKERKKRN SG VSGPGEGK +E AA+ EE +VT     EGGEEE EKG VRRIVS +GE+IWGSWN
Subjt:  RWDSSPNRQLIHEIIDAYEDKLAETKVGKNNKKERKKRNSSGTVSGPGEGKGAEAAAKVEEWKVT-----EGGEEEAEKGPVRRIVSLLGEKIWGSWN

TrEMBL top hitse value%identityAlignment
A0A0A0LVV3 Uncharacterized protein1.2e-8784.77Show/hide
Query:  MKKLYRKTGTVHPSPPLISDHLSFLPTAILTLSSALSLQDREVLAYLISSCSNDFTPVHNSSTHRGKAAHHKHAAPMAGSDHPPAFSCYCFQCYTSYWVR
        MKKLYRK GTVHPSP +ISDHLSFLPT ILTL++ALSL DREVLAYLISSCSNDFT V NSS+HRGKA H KHAA M G DHPPAFSCYCFQCYTSYWVR
Subjt:  MKKLYRKTGTVHPSPPLISDHLSFLPTAILTLSSALSLQDREVLAYLISSCSNDFTPVHNSSTHRGKAAHHKHAAPMAGSDHPPAFSCYCFQCYTSYWVR

Query:  WDSSPNRQLIHEIIDAYEDKLAETKVGKNNKKERKKRNSSGTVSGPGEGKGAEAAAKVEEWKVT-----EGGEEEAEKGPVRRIVSLLGEKIWGSWN
        WDSSPNRQLIHEIIDAYE+KLAE+KVGKNNKKERKKRN+ G VSGPGEGKG+EAA K EEW+VT     EGGEE AEKGPVRRIVSLLGEKIWGSWN
Subjt:  WDSSPNRQLIHEIIDAYEDKLAETKVGKNNKKERKKRNSSGTVSGPGEGKGAEAAAKVEEWKVT-----EGGEEEAEKGPVRRIVSLLGEKIWGSWN

A0A1S3C9P0 uncharacterized protein LOC1034982281.1e-104100Show/hide
Query:  MKKLYRKTGTVHPSPPLISDHLSFLPTAILTLSSALSLQDREVLAYLISSCSNDFTPVHNSSTHRGKAAHHKHAAPMAGSDHPPAFSCYCFQCYTSYWVR
        MKKLYRKTGTVHPSPPLISDHLSFLPTAILTLSSALSLQDREVLAYLISSCSNDFTPVHNSSTHRGKAAHHKHAAPMAGSDHPPAFSCYCFQCYTSYWVR
Subjt:  MKKLYRKTGTVHPSPPLISDHLSFLPTAILTLSSALSLQDREVLAYLISSCSNDFTPVHNSSTHRGKAAHHKHAAPMAGSDHPPAFSCYCFQCYTSYWVR

Query:  WDSSPNRQLIHEIIDAYEDKLAETKVGKNNKKERKKRNSSGTVSGPGEGKGAEAAAKVEEWKVTEGGEEEAEKGPVRRIVSLLGEKIWGSWN
        WDSSPNRQLIHEIIDAYEDKLAETKVGKNNKKERKKRNSSGTVSGPGEGKGAEAAAKVEEWKVTEGGEEEAEKGPVRRIVSLLGEKIWGSWN
Subjt:  WDSSPNRQLIHEIIDAYEDKLAETKVGKNNKKERKKRNSSGTVSGPGEGKGAEAAAKVEEWKVTEGGEEEAEKGPVRRIVSLLGEKIWGSWN

A0A5D3CKA3 Uncharacterized protein1.1e-104100Show/hide
Query:  MKKLYRKTGTVHPSPPLISDHLSFLPTAILTLSSALSLQDREVLAYLISSCSNDFTPVHNSSTHRGKAAHHKHAAPMAGSDHPPAFSCYCFQCYTSYWVR
        MKKLYRKTGTVHPSPPLISDHLSFLPTAILTLSSALSLQDREVLAYLISSCSNDFTPVHNSSTHRGKAAHHKHAAPMAGSDHPPAFSCYCFQCYTSYWVR
Subjt:  MKKLYRKTGTVHPSPPLISDHLSFLPTAILTLSSALSLQDREVLAYLISSCSNDFTPVHNSSTHRGKAAHHKHAAPMAGSDHPPAFSCYCFQCYTSYWVR

Query:  WDSSPNRQLIHEIIDAYEDKLAETKVGKNNKKERKKRNSSGTVSGPGEGKGAEAAAKVEEWKVTEGGEEEAEKGPVRRIVSLLGEKIWGSWN
        WDSSPNRQLIHEIIDAYEDKLAETKVGKNNKKERKKRNSSGTVSGPGEGKGAEAAAKVEEWKVTEGGEEEAEKGPVRRIVSLLGEKIWGSWN
Subjt:  WDSSPNRQLIHEIIDAYEDKLAETKVGKNNKKERKKRNSSGTVSGPGEGKGAEAAAKVEEWKVTEGGEEEAEKGPVRRIVSLLGEKIWGSWN

A0A6J1ENT0 uncharacterized protein LOC1114342296.4e-7675.5Show/hide
Query:  MKKLYRKTGTVHPSPPLISDHLSFLPTAILTLSSALSLQDREVLAYLISSCSNDFTPVHNSSTHRGKAAHHKHAAPMAGSDHPPAFSCYCFQCYTSYWVR
        MKKLYR+ GTVHPSPP+ISDHLSFLPTAILTL++ALS +DRE+LAYLISS SNDFT V+N S HRGKAAH K AA  +GSDHPP FSC CF+CYTSYWVR
Subjt:  MKKLYRKTGTVHPSPPLISDHLSFLPTAILTLSSALSLQDREVLAYLISSCSNDFTPVHNSSTHRGKAAHHKHAAPMAGSDHPPAFSCYCFQCYTSYWVR

Query:  WDSSPNRQLIHEIIDAYEDKLAETKVGKNNKKERKKRNS---SGTVSGPGEGKGAEAAAKVEEWKVTE-----GGEEEAEKGPVRRIVSLLGEKIWGSWN
        WDSSPNRQ+IHEIIDAYE+ LAE+K GKNNKKERKKRN+   SG+VS PG+GKG+E A KVEE +VTE     GGE EAEKG VR IVS +GEKIWG WN
Subjt:  WDSSPNRQLIHEIIDAYEDKLAETKVGKNNKKERKKRNS---SGTVSGPGEGKGAEAAAKVEEWKVTE-----GGEEEAEKGPVRRIVSLLGEKIWGSWN

A0A6J1KLN6 uncharacterized protein LOC1114956875.1e-7372.91Show/hide
Query:  MKKLYRKTGTVHPSPPLISDHLSFLPTAILTLSSALSLQDREVLAYLISSCSNDFTPVHNSSTHRGKAAHHKHAAPMAGSDHPPAFSCYCFQCYTSYWVR
        M KLYR+ GTVHPSPP+ISDHLSFLPTAILTL++ALS +DRE+LAYLISS SNDFT V+N S HRGKAA  K AA  +GSDHPPAFSC CF+CYTSYWVR
Subjt:  MKKLYRKTGTVHPSPPLISDHLSFLPTAILTLSSALSLQDREVLAYLISSCSNDFTPVHNSSTHRGKAAHHKHAAPMAGSDHPPAFSCYCFQCYTSYWVR

Query:  WDSSPNRQLIHEIIDAYEDKLAETKVGKNNKKERKKRNS---SGTVSGPGEGKGAEAAAKVEEWKVTE--------GGEEEAEKGPVRRIVSLLGEKIWG
        WDSSPNRQ+IHEIIDAYE+ LAE+K GKNNKKERKKRN+   SG+VS  G+GKG+E A KVEE +VTE        GGE EAEKG VR IV  +GEKIWG
Subjt:  WDSSPNRQLIHEIIDAYEDKLAETKVGKNNKKERKKRNS---SGTVSGPGEGKGAEAAAKVEEWKVTE--------GGEEEAEKGPVRRIVSLLGEKIWG

Query:  SWN
         WN
Subjt:  SWN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G12020.1 unknown protein3.2e-3545.45Show/hide
Query:  MKKLYRKTGTVHPSPPLI--SDH-LSFLPTAILTLSSALSLQDREVLAYLISSCSNDFTPVHNSSTHRGKAAHHKHAAPMAGSDHPPAFSCYCFQCYTSY
        MKKLYRK GTVHPSPP I  +DH L+ LP AI +L++ LS +DREVLAYLIS+ S  ++   N ++   K   HK A      +H P F C CF CYTSY
Subjt:  MKKLYRKTGTVHPSPPLI--SDH-LSFLPTAILTLSSALSLQDREVLAYLISSCSNDFTPVHNSSTHRGKAAHHKHAAPMAGSDHPPAFSCYCFQCYTSY

Query:  WVRWDSSPNRQLIHEIIDAYEDKLAETKVGKNN---KKERKKR---NSSGTVSGPGEGKGAEAAAKVEEWKVT-----------------EGGEE-----
        WVRWDSSP+RQLIHEIIDA+ED L + K  K N   KK+R+KR   +SS   S       +E  +++ E  V                   GG E     
Subjt:  WVRWDSSPNRQLIHEIIDAYEDKLAETKVGKNN---KKERKKR---NSSGTVSGPGEGKGAEAAAKVEEWKVT-----------------EGGEE-----

Query:  ---------EAEKGPVRRIVSLLGEKIWGSW
                 E EKG VRR VS +GEK++G W
Subjt:  ---------EAEKGPVRRIVSLLGEKIWGSW

AT1G24270.1 unknown protein3.2e-1943.45Show/hide
Query:  KTGTVHPSPPLIS-------DHLS---FLPTAILTLSSALSLQDREVLAYLISSCSNDFTPVHNSSTHRGKAAHHKHAAPMAGSDHPPAFSCYCFQCYTS
        K G VHPSPPL S       D LS    L +AIL L S LS +D EVLAYLI+   N    V        KA               P   C CF CYTS
Subjt:  KTGTVHPSPPLIS-------DHLS---FLPTAILTLSSALSLQDREVLAYLISSCSNDFTPVHNSSTHRGKAAHHKHAAPMAGSDHPPAFSCYCFQCYTS

Query:  YWVRWDSSPNRQLIHEIIDAYEDKLAETKV-----GKNNKKERKK
        YW +WDSS NR+LI++II+A+ED L   ++      K NKK  KK
Subjt:  YWVRWDSSPNRQLIHEIIDAYEDKLAETKV-----GKNNKKERKK

AT1G62422.1 unknown protein3.0e-3347.32Show/hide
Query:  MKKLYRKTGTVHPSPP----LISDHLSFLPTAILTLSSALSLQDREVLAYLISSCSNDFTPVHNSSTHRGKAAHHKHAAPMAGSDHPPAFSCYCFQCYTS
        MKKL RK GTVHPSPP         LS LP AIL+L +ALS++DREVLAYLIS+ S D   +  S   + K  +H          H P F C CF CYTS
Subjt:  MKKLYRKTGTVHPSPP----LISDHLSFLPTAILTLSSALSLQDREVLAYLISSCSNDFTPVHNSSTHRGKAAHHKHAAPMAGSDHPPAFSCYCFQCYTS

Query:  YWVRWDSSPNRQLIHEIIDAYEDKLAETKVGKNNKKERKKRN--SSGTVSGPGEGKGAEAAAKVEEWKVTE-------GGEE-EAEKGPVRRIVSLLGEK
        YWVRWD+SP RQLIHEIIDAYED L      K  KK+R+KR+  +SG V+  G  + +E  +   E+   +       GGEE E EKG V +++S +G++
Subjt:  YWVRWDSSPNRQLIHEIIDAYEDKLAETKVGKNNKKERKKRN--SSGTVSGPGEGKGAEAAAKVEEWKVTE-------GGEE-EAEKGPVRRIVSLLGEK

Query:  IWGSW
          G W
Subjt:  IWGSW

AT5G13090.1 unknown protein3.8e-2036.17Show/hide
Query:  RKTGTVHPSPP---------LISDHLS-----------FLPTAILTLSSALSLQDREVLAYLISSCSNDFTPVHNSSTHRGKAAHHKHAAPMAGSDHPPA
        +K G V+PSPP           S+HL+            LP  IL L S LS ++REVLAYLI+  +      ++SS ++ K   +K +     +  PP 
Subjt:  RKTGTVHPSPP---------LISDHLS-----------FLPTAILTLSSALSLQDREVLAYLISSCSNDFTPVHNSSTHRGKAAHHKHAAPMAGSDHPPA

Query:  FSCYCFQCYTSYWVRWDSSPNRQLIHEIIDAYEDKLAETKVGKNNKKER-KKRNSSGTVSGPGEGKGAEAAAKVEEWKVTEGGEEEAE
        F C CF CYT+YW RWDSSPNR+LIHEII+A+E+   E      +K +R KK+   G      + K A         +VT+ G+++++
Subjt:  FSCYCFQCYTSYWVRWDSSPNRQLIHEIIDAYEDKLAETKVGKNNKKER-KKRNSSGTVSGPGEGKGAEAAAKVEEWKVTEGGEEEAE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAGCTCTACCGCAAAACAGGAACCGTCCACCCTTCTCCCCCTCTCATCTCCGATCATCTCTCCTTTCTCCCCACTGCCATCCTCACCCTCTCTTCCGCCCTCTC
TCTCCAGGACCGGGAGGTTTTAGCCTACCTTATCTCCTCCTGCTCCAACGACTTCACCCCCGTCCACAATTCCTCCACCCACCGCGGCAAGGCAGCTCACCACAAACACG
CCGCCCCCATGGCCGGTTCGGATCACCCCCCGGCCTTCTCCTGCTACTGTTTCCAATGCTACACCAGCTACTGGGTCAGATGGGATTCCTCACCCAATCGGCAACTGATT
CACGAAATCATCGACGCTTATGAAGACAAATTGGCCGAGACCAAAGTTGGGAAGAACAATAAGAAAGAGAGAAAGAAGCGAAATAGTAGCGGGACGGTTTCCGGTCCGGG
TGAGGGGAAAGGGGCTGAAGCGGCGGCGAAGGTAGAAGAGTGGAAGGTGACGGAAGGCGGCGAGGAGGAGGCGGAGAAAGGGCCGGTGAGAAGGATTGTGAGTTTGCTAG
GGGAAAAAATTTGGGGAAGTTGGAATTAA
mRNA sequenceShow/hide mRNA sequence
CAACCCTTTCCTCCCTCTCTCGCTCTCTCTTTTCTTTTTCTTTTTCTTTTTCGTCCCCCATGAAGAAGCTCTACCGCAAAACAGGAACCGTCCACCCTTCTCCCCCTCTC
ATCTCCGATCATCTCTCCTTTCTCCCCACTGCCATCCTCACCCTCTCTTCCGCCCTCTCTCTCCAGGACCGGGAGGTTTTAGCCTACCTTATCTCCTCCTGCTCCAACGA
CTTCACCCCCGTCCACAATTCCTCCACCCACCGCGGCAAGGCAGCTCACCACAAACACGCCGCCCCCATGGCCGGTTCGGATCACCCCCCGGCCTTCTCCTGCTACTGTT
TCCAATGCTACACCAGCTACTGGGTCAGATGGGATTCCTCACCCAATCGGCAACTGATTCACGAAATCATCGACGCTTATGAAGACAAATTGGCCGAGACCAAAGTTGGG
AAGAACAATAAGAAAGAGAGAAAGAAGCGAAATAGTAGCGGGACGGTTTCCGGTCCGGGTGAGGGGAAAGGGGCTGAAGCGGCGGCGAAGGTAGAAGAGTGGAAGGTGAC
GGAAGGCGGCGAGGAGGAGGCGGAGAAAGGGCCGGTGAGAAGGATTGTGAGTTTGCTAGGGGAAAAAATTTGGGGAAGTTGGAATTAAGTGATTGGATTTTGCACTAATG
GATGCATTTGGAGGTTTTGGGTTTTTATATATGTTTGTGAATTAATTAGTTTGCAGTAATTAGGAAGGAATTGAAGAAGAAGAAGAAGAAGAAAAGGGTGTTCTTATAAA
CATAATTACAATACCATCTTCTTCTTTTTCTTTGTTTTGATTCAGGTTATGGTTGTTAGTAAGTTGTATATAGAATCTTATGTTTTTCATGTACAAATTTATCAATATAT
ATATATATATATAAAGAAATTAGAATTTTTATTTAGTATATATATGTGTGTTTGTTTTCAAT
Protein sequenceShow/hide protein sequence
MKKLYRKTGTVHPSPPLISDHLSFLPTAILTLSSALSLQDREVLAYLISSCSNDFTPVHNSSTHRGKAAHHKHAAPMAGSDHPPAFSCYCFQCYTSYWVRWDSSPNRQLI
HEIIDAYEDKLAETKVGKNNKKERKKRNSSGTVSGPGEGKGAEAAAKVEEWKVTEGGEEEAEKGPVRRIVSLLGEKIWGSWN