; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0012325 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0012325
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionRetrotransposon protein
Genome locationchr01:10374461..10376646
RNA-Seq ExpressionPI0012325
SyntenyPI0012325
Gene Ontology termsNA
InterPro domainsIPR024752 - Myb/SANT-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033487.1 uncharacterized protein E6C27_scaffold261G00210 [Cucumis melo var. makuwa]1.5e-9879.65Show/hide
Query:  TKHRWTTIENEALVECLLQLVEEGGWRADNGTFKLGYLVQVQKLMKEKIPGSNIQVTPNLESRVKILKKQYTAIVEMMGPVCSGFGWNEERKCIKAEKFM
        TKH WTTIE+EALVECLLQLVE+G WR DNGTFK GYLVQVQKLMKEKI  SNIQVTPNLES VKILKKQYT I EMMGPVCSGF WN+ERKCI+AEK +
Subjt:  TKHRWTTIENEALVECLLQLVEEGGWRADNGTFKLGYLVQVQKLMKEKIPGSNIQVTPNLESRVKILKKQYTAIVEMMGPVCSGFGWNEERKCIKAEKFM

Query:  FDDWVKGHSNAQGLLNKPFPYFYDLEIVFGRDRATGGRCKTTAEMGSQTAKDIEKYDMDINLEDFDIPNPHGLEPPSGEDMSSTPTSMAHDARSFRPSKK
         +DWVKGH NA+ LLNKPFPYFYDLEIVFGRDRATGG+CKT  EMGSQTA+D E+ DM INLEDFDIPNPHGLEPPSGEDM STPTSMAHDA SFRPSKK
Subjt:  FDDWVKGHSNAQGLLNKPFPYFYDLEIVFGRDRATGGRCKTTAEMGSQTAKDIEKYDMDINLEDFDIPNPHGLEPPSGEDMSSTPTSMAHDARSFRPSKK

Query:  MWSYSGDLMNTFRKNGNRIIPTQTII
          SYS DLM+TFR     ++P  T++
Subjt:  MWSYSGDLMNTFRKNGNRIIPTQTII

TYJ96933.1 retrotransposon protein [Cucumis melo var. makuwa]1.9e-9681.31Show/hide
Query:  STKHRWTTIENEALVECLLQLVEEGGWRADNGTFKLGYLVQVQKLMKEKIPGSNIQVTPNLESRVKILKKQYTAIVEMMGPVCSGFGWNEERKCIKAEKF
        +TKHRWTTI+++ALVECLLQLVEEGGWRA+N TFK  YLVQVQKLMKEKIP SNIQVT NLESRVK LKKQYTAI +MMGP CS FGWNEERKCI+AEK 
Subjt:  STKHRWTTIENEALVECLLQLVEEGGWRADNGTFKLGYLVQVQKLMKEKIPGSNIQVTPNLESRVKILKKQYTAIVEMMGPVCSGFGWNEERKCIKAEKF

Query:  MFDDWVKGHSNAQGLLNKPFPYFYDLEIVFGRDRATGGRCKTTAEMGSQTAKDIEKYDMDINLEDFDIPNPHGLEPPSGEDMSSTPTSMAHDARSFRPSK
        +FDDWVKGH NA+GLLNKPF YFYDLEIVFGRD+ATGGRCK   EM SQTA+D E+ DMDINLEDFDIPNPHGLEPPSGEDM ST  SM HDA S RPSK
Subjt:  MFDDWVKGHSNAQGLLNKPFPYFYDLEIVFGRDRATGGRCKTTAEMGSQTAKDIEKYDMDINLEDFDIPNPHGLEPPSGEDMSSTPTSMAHDARSFRPSK

Query:  KMWSYSGDLMNTFR
        K  SY GDLM+TFR
Subjt:  KMWSYSGDLMNTFR

TYK26842.1 uncharacterized protein E5676_scaffold260G00340 [Cucumis melo var. makuwa]1.3e-9779.2Show/hide
Query:  TKHRWTTIENEALVECLLQLVEEGGWRADNGTFKLGYLVQVQKLMKEKIPGSNIQVTPNLESRVKILKKQYTAIVEMMGPVCSGFGWNEERKCIKAEKFM
        TKH WTTIE+EALVECLLQLVE+G WR DNGTFK GYLVQVQKLMKEKI  SNIQVTPNLES VKILKKQYT I EMMGPVCSGF WN+ERKCI+AEK +
Subjt:  TKHRWTTIENEALVECLLQLVEEGGWRADNGTFKLGYLVQVQKLMKEKIPGSNIQVTPNLESRVKILKKQYTAIVEMMGPVCSGFGWNEERKCIKAEKFM

Query:  FDDWVKGHSNAQGLLNKPFPYFYDLEIVFGRDRATGGRCKTTAEMGSQTAKDIEKYDMDINLEDFDIPNPHGLEPPSGEDMSSTPTSMAHDARSFRPSKK
         +DWVKGH NA+ LLNKPFPYFYDLEIVFGRDRATGG+CKT  EMGSQTA+D E+ DM INLEDFDIPNPHGLEPPSGEDM STPTSMAHDA S RPSKK
Subjt:  FDDWVKGHSNAQGLLNKPFPYFYDLEIVFGRDRATGGRCKTTAEMGSQTAKDIEKYDMDINLEDFDIPNPHGLEPPSGEDMSSTPTSMAHDARSFRPSKK

Query:  MWSYSGDLMNTFRKNGNRIIPTQTII
          SYS DLM+TFR     ++P  T++
Subjt:  MWSYSGDLMNTFRKNGNRIIPTQTII

XP_008455678.1 PREDICTED: uncharacterized protein At2g29880-like [Cucumis melo]1.9e-9681.31Show/hide
Query:  STKHRWTTIENEALVECLLQLVEEGGWRADNGTFKLGYLVQVQKLMKEKIPGSNIQVTPNLESRVKILKKQYTAIVEMMGPVCSGFGWNEERKCIKAEKF
        +TKHRWTTI+++ALVECLLQLVEEGGWRA+N TFK  YLVQVQKLMKEKIP SNIQVT NLESRVK LKKQYTAI +MMGP CS FGWNEERKCI+AEK 
Subjt:  STKHRWTTIENEALVECLLQLVEEGGWRADNGTFKLGYLVQVQKLMKEKIPGSNIQVTPNLESRVKILKKQYTAIVEMMGPVCSGFGWNEERKCIKAEKF

Query:  MFDDWVKGHSNAQGLLNKPFPYFYDLEIVFGRDRATGGRCKTTAEMGSQTAKDIEKYDMDINLEDFDIPNPHGLEPPSGEDMSSTPTSMAHDARSFRPSK
        +FDDWVKGH NA+GLLNKPF YFYDLEIVFGRD+ATGGRCK   EM SQTA+D E+ DMDINLEDFDIPNPHGLEPPSGEDM ST  SM HDA S RPSK
Subjt:  MFDDWVKGHSNAQGLLNKPFPYFYDLEIVFGRDRATGGRCKTTAEMGSQTAKDIEKYDMDINLEDFDIPNPHGLEPPSGEDMSSTPTSMAHDARSFRPSK

Query:  KMWSYSGDLMNTFR
        K  SY GDLM+TFR
Subjt:  KMWSYSGDLMNTFR

XP_031741735.1 uncharacterized protein At2g29880-like [Cucumis sativus]3.3e-8581.87Show/hide
Query:  STKHRWTTIENEALVECLLQLVEEGGWRADNGTFKLGYLVQVQKLMKEKIPGSNIQVTPNLESRVKILKKQYTAIVEMMGPVCSGFGWNEERKCIKAEKF
        +TKHRWTTI             EEGGWRA NGTFK GYLVQVQKLMKEKIPGSNIQVTPNLE RVKILKKQYTAIVEMMGP CS FGWNE+RKCI+AEKF
Subjt:  STKHRWTTIENEALVECLLQLVEEGGWRADNGTFKLGYLVQVQKLMKEKIPGSNIQVTPNLESRVKILKKQYTAIVEMMGPVCSGFGWNEERKCIKAEKF

Query:  MFDDWVKGHSNAQGLLNKPFPYFYDLEIVFGRDRATGGRCKTTAEMGSQTAKDIEKYDMDINLEDFDIPNPHGLEPPSGEDMSSTPTSMAHDA
        +FDD VKGH NA+GLLNKPFPYFYDLEIVFGRDRATGGRCKT  EM S   +DIE+ DMDINLEDFDIPNPHGLEPPSGEDMSST TSMAHDA
Subjt:  MFDDWVKGHSNAQGLLNKPFPYFYDLEIVFGRDRATGGRCKTTAEMGSQTAKDIEKYDMDINLEDFDIPNPHGLEPPSGEDMSSTPTSMAHDA

TrEMBL top hitse value%identityAlignment
A0A1S3C252 uncharacterized protein At2g29880-like9.1e-9781.31Show/hide
Query:  STKHRWTTIENEALVECLLQLVEEGGWRADNGTFKLGYLVQVQKLMKEKIPGSNIQVTPNLESRVKILKKQYTAIVEMMGPVCSGFGWNEERKCIKAEKF
        +TKHRWTTI+++ALVECLLQLVEEGGWRA+N TFK  YLVQVQKLMKEKIP SNIQVT NLESRVK LKKQYTAI +MMGP CS FGWNEERKCI+AEK 
Subjt:  STKHRWTTIENEALVECLLQLVEEGGWRADNGTFKLGYLVQVQKLMKEKIPGSNIQVTPNLESRVKILKKQYTAIVEMMGPVCSGFGWNEERKCIKAEKF

Query:  MFDDWVKGHSNAQGLLNKPFPYFYDLEIVFGRDRATGGRCKTTAEMGSQTAKDIEKYDMDINLEDFDIPNPHGLEPPSGEDMSSTPTSMAHDARSFRPSK
        +FDDWVKGH NA+GLLNKPF YFYDLEIVFGRD+ATGGRCK   EM SQTA+D E+ DMDINLEDFDIPNPHGLEPPSGEDM ST  SM HDA S RPSK
Subjt:  MFDDWVKGHSNAQGLLNKPFPYFYDLEIVFGRDRATGGRCKTTAEMGSQTAKDIEKYDMDINLEDFDIPNPHGLEPPSGEDMSSTPTSMAHDARSFRPSK

Query:  KMWSYSGDLMNTFR
        K  SY GDLM+TFR
Subjt:  KMWSYSGDLMNTFR

A0A5A7SW62 Myb_DNA-bind_3 domain-containing protein7.4e-9979.65Show/hide
Query:  TKHRWTTIENEALVECLLQLVEEGGWRADNGTFKLGYLVQVQKLMKEKIPGSNIQVTPNLESRVKILKKQYTAIVEMMGPVCSGFGWNEERKCIKAEKFM
        TKH WTTIE+EALVECLLQLVE+G WR DNGTFK GYLVQVQKLMKEKI  SNIQVTPNLES VKILKKQYT I EMMGPVCSGF WN+ERKCI+AEK +
Subjt:  TKHRWTTIENEALVECLLQLVEEGGWRADNGTFKLGYLVQVQKLMKEKIPGSNIQVTPNLESRVKILKKQYTAIVEMMGPVCSGFGWNEERKCIKAEKFM

Query:  FDDWVKGHSNAQGLLNKPFPYFYDLEIVFGRDRATGGRCKTTAEMGSQTAKDIEKYDMDINLEDFDIPNPHGLEPPSGEDMSSTPTSMAHDARSFRPSKK
         +DWVKGH NA+ LLNKPFPYFYDLEIVFGRDRATGG+CKT  EMGSQTA+D E+ DM INLEDFDIPNPHGLEPPSGEDM STPTSMAHDA SFRPSKK
Subjt:  FDDWVKGHSNAQGLLNKPFPYFYDLEIVFGRDRATGGRCKTTAEMGSQTAKDIEKYDMDINLEDFDIPNPHGLEPPSGEDMSSTPTSMAHDARSFRPSKK

Query:  MWSYSGDLMNTFRKNGNRIIPTQTII
          SYS DLM+TFR     ++P  T++
Subjt:  MWSYSGDLMNTFRKNGNRIIPTQTII

A0A5D3BC95 Retrotransposon protein9.1e-9781.31Show/hide
Query:  STKHRWTTIENEALVECLLQLVEEGGWRADNGTFKLGYLVQVQKLMKEKIPGSNIQVTPNLESRVKILKKQYTAIVEMMGPVCSGFGWNEERKCIKAEKF
        +TKHRWTTI+++ALVECLLQLVEEGGWRA+N TFK  YLVQVQKLMKEKIP SNIQVT NLESRVK LKKQYTAI +MMGP CS FGWNEERKCI+AEK 
Subjt:  STKHRWTTIENEALVECLLQLVEEGGWRADNGTFKLGYLVQVQKLMKEKIPGSNIQVTPNLESRVKILKKQYTAIVEMMGPVCSGFGWNEERKCIKAEKF

Query:  MFDDWVKGHSNAQGLLNKPFPYFYDLEIVFGRDRATGGRCKTTAEMGSQTAKDIEKYDMDINLEDFDIPNPHGLEPPSGEDMSSTPTSMAHDARSFRPSK
        +FDDWVKGH NA+GLLNKPF YFYDLEIVFGRD+ATGGRCK   EM SQTA+D E+ DMDINLEDFDIPNPHGLEPPSGEDM ST  SM HDA S RPSK
Subjt:  MFDDWVKGHSNAQGLLNKPFPYFYDLEIVFGRDRATGGRCKTTAEMGSQTAKDIEKYDMDINLEDFDIPNPHGLEPPSGEDMSSTPTSMAHDARSFRPSK

Query:  KMWSYSGDLMNTFR
        K  SY GDLM+TFR
Subjt:  KMWSYSGDLMNTFR

A0A5D3C7T4 Uncharacterized protein1.6e-8573.36Show/hide
Query:  STKHRWTTIENEALVECLLQLVEEGGWRADNGTFKLGYLVQVQKLMKEKIPGSNIQVTPNLESRVKILKKQYTAIVEMMGPVCSGFGWNEERKCIKAEKF
        +TKHRWTTIE+E LVECLLQLVEEGGWRADNGTFKLGYL                              KQYTAI EMMGP CSGFGWNE +KCI+ EK 
Subjt:  STKHRWTTIENEALVECLLQLVEEGGWRADNGTFKLGYLVQVQKLMKEKIPGSNIQVTPNLESRVKILKKQYTAIVEMMGPVCSGFGWNEERKCIKAEKF

Query:  MFDDWVKGHSNAQGLLNKPFPYFYDLEIVFGRDRATGGRCKTTAEMGSQTAKDIEKYDMDINLEDFDIPNPHGLEPPSGEDMSSTPTSMAHDARSFRPSK
        +FDDWVKGH NAQGLLNKPFPYFYDLE+VFGRDRATGGRCKT  EM SQTA+D E+ DMDINLEDFDIPNPHGLEPPSGEDM STPTSM HDA S RPSK
Subjt:  MFDDWVKGHSNAQGLLNKPFPYFYDLEIVFGRDRATGGRCKTTAEMGSQTAKDIEKYDMDINLEDFDIPNPHGLEPPSGEDMSSTPTSMAHDARSFRPSK

Query:  KMWSYSGDLMNTFR
        K  SYSGDLM+TFR
Subjt:  KMWSYSGDLMNTFR

A0A5D3DTL0 Myb_DNA-bind_3 domain-containing protein6.3e-9879.2Show/hide
Query:  TKHRWTTIENEALVECLLQLVEEGGWRADNGTFKLGYLVQVQKLMKEKIPGSNIQVTPNLESRVKILKKQYTAIVEMMGPVCSGFGWNEERKCIKAEKFM
        TKH WTTIE+EALVECLLQLVE+G WR DNGTFK GYLVQVQKLMKEKI  SNIQVTPNLES VKILKKQYT I EMMGPVCSGF WN+ERKCI+AEK +
Subjt:  TKHRWTTIENEALVECLLQLVEEGGWRADNGTFKLGYLVQVQKLMKEKIPGSNIQVTPNLESRVKILKKQYTAIVEMMGPVCSGFGWNEERKCIKAEKFM

Query:  FDDWVKGHSNAQGLLNKPFPYFYDLEIVFGRDRATGGRCKTTAEMGSQTAKDIEKYDMDINLEDFDIPNPHGLEPPSGEDMSSTPTSMAHDARSFRPSKK
         +DWVKGH NA+ LLNKPFPYFYDLEIVFGRDRATGG+CKT  EMGSQTA+D E+ DM INLEDFDIPNPHGLEPPSGEDM STPTSMAHDA S RPSKK
Subjt:  FDDWVKGHSNAQGLLNKPFPYFYDLEIVFGRDRATGGRCKTTAEMGSQTAKDIEKYDMDINLEDFDIPNPHGLEPPSGEDMSSTPTSMAHDARSFRPSKK

Query:  MWSYSGDLMNTFRKNGNRIIPTQTII
          SYS DLM+TFR     ++P  T++
Subjt:  MWSYSGDLMNTFRKNGNRIIPTQTII

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G02210.1 unknown protein3.2e-0926.99Show/hide
Query:  GTFKLGYLVQVQKLMKEKIPGSNIQVTPNLESRVKILKKQYTAIVEMMGPVCSGFGWNEERKCIKAEKFMFDDWVKGHSNAQGLLNKPFPYFYDLEIVFG
        G F+     ++  L   K   SN  V   L++R K L++Q+ AI  ++     GF W+ ER+ + A+  ++ D++K H +A+  + +P PY+ DL ++ G
Subjt:  GTFKLGYLVQVQKLMKEKIPGSNIQVTPNLESRVKILKKQYTAIVEMMGPVCSGFGWNEERKCIKAEKFMFDDWVKGHSNAQGLLNKPFPYFYDLEIVFG

Query:  RDRATGGRCKTT-----AEMGSQTAKDIEKYDMDINLEDFDIPNPHGLEPPSGED-MSSTPTS
                C         E   Q  K     D+ I+ E+ D  N    +P +  D +++T TS
Subjt:  RDRATGGRCKTT-----AEMGSQTAKDIEKYDMDINLEDFDIPNPHGLEPPSGED-MSSTPTS

AT4G02210.2 unknown protein3.2e-0926.99Show/hide
Query:  GTFKLGYLVQVQKLMKEKIPGSNIQVTPNLESRVKILKKQYTAIVEMMGPVCSGFGWNEERKCIKAEKFMFDDWVKGHSNAQGLLNKPFPYFYDLEIVFG
        G F+     ++  L   K   SN  V   L++R K L++Q+ AI  ++     GF W+ ER+ + A+  ++ D++K H +A+  + +P PY+ DL ++ G
Subjt:  GTFKLGYLVQVQKLMKEKIPGSNIQVTPNLESRVKILKKQYTAIVEMMGPVCSGFGWNEERKCIKAEKFMFDDWVKGHSNAQGLLNKPFPYFYDLEIVFG

Query:  RDRATGGRCKTT-----AEMGSQTAKDIEKYDMDINLEDFDIPNPHGLEPPSGED-MSSTPTS
                C         E   Q  K     D+ I+ E+ D  N    +P +  D +++T TS
Subjt:  RDRATGGRCKTT-----AEMGSQTAKDIEKYDMDINLEDFDIPNPHGLEPPSGED-MSSTPTS

AT5G27260.1 unknown protein8.9e-1229.14Show/hide
Query:  WTTIENEALVECLLQLVEEGGWRADNGTF-KLGYLVQ-VQKLMKEKIPGSNIQVTPNLESRVKILKKQYTAIVEMMGPVCSGFGWNEERKCIKAEKFMFD
        W+  E + LV+ L++ +    WR  NGT  KL    + + ++ KE     N     +  SR+K LK QY + +++     SGFGW+   K   A   ++ 
Subjt:  WTTIENEALVECLLQLVEEGGWRADNGTF-KLGYLVQ-VQKLMKEKIPGSNIQVTPNLESRVKILKKQYTAIVEMMGPVCSGFGWNEERKCIKAEKFMFD

Query:  DWVKGHSNAQGLLNKPFPYFYDLEIVFGRDRATG----GRCKTTAEMGSQTAKDIEKYDMD--INLEDFDIPNPH
        D++K H N + L    F +F +L+I+FG   ATG    G C +T  +  +  ++  K  +D   N+ ++D    H
Subjt:  DWVKGHSNAQGLLNKPFPYFYDLEIVFGRDRATG----GRCKTTAEMGSQTAKDIEKYDMD--INLEDFDIPNPH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTAAAACAGGAATTTGGCTCTTTTACACTTTACGAAACAATTTACCTTATTCAATCATTAATAACTGTTGTTTTCTTCTTTTCCATTGCATTATAGCTTCATTATT
CTCCACTAAGCATCGCTGGACAACCATTGAGAATGAGGCATTAGTTGAGTGCTTACTACAACTAGTAGAGGAAGGTGGCTGGAGGGCTGATAATGGGACATTCAAACTGG
GATATTTGGTACAAGTACAAAAACTAATGAAAGAAAAAATTCCTGGAAGCAACATACAAGTAACTCCAAATCTAGAGTCGAGAGTGAAAATTTTGAAGAAGCAATACACT
GCCATAGTAGAGATGATGGGCCCAGTGTGTAGTGGGTTTGGCTGGAATGAGGAACGAAAGTGCATTAAGGCAGAGAAATTCATGTTCGATGACTGGGTTAAGGGACACTC
CAATGCTCAAGGCCTATTGAACAAACCATTTCCTTACTTCTATGACTTGGAAATTGTGTTTGGTAGAGATAGGGCTACTGGTGGTAGATGTAAGACAACCGCTGAAATGG
GCTCACAGACTGCAAAAGATATTGAGAAATATGACATGGACATTAATCTCGAAGACTTTGATATTCCAAATCCACATGGACTCGAGCCACCATCGGGGGAAGACATGTCA
TCCACTCCAACAAGTATGGCACATGATGCAAGATCATTTAGGCCAAGTAAGAAAATGTGGTCATATTCAGGGGACCTCATGAACACATTTCGCAAAAATGGAAATAGAAT
TATCCCTACACAAACAATTATATGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGTAAAACAGGAATTTGGCTCTTTTACACTTTACGAAACAATTTACCTTATTCAATCATTAATAACTGTTGTTTTCTTCTTTTCCATTGCATTATAGCTTCATTATT
CTCCACTAAGCATCGCTGGACAACCATTGAGAATGAGGCATTAGTTGAGTGCTTACTACAACTAGTAGAGGAAGGTGGCTGGAGGGCTGATAATGGGACATTCAAACTGG
GATATTTGGTACAAGTACAAAAACTAATGAAAGAAAAAATTCCTGGAAGCAACATACAAGTAACTCCAAATCTAGAGTCGAGAGTGAAAATTTTGAAGAAGCAATACACT
GCCATAGTAGAGATGATGGGCCCAGTGTGTAGTGGGTTTGGCTGGAATGAGGAACGAAAGTGCATTAAGGCAGAGAAATTCATGTTCGATGACTGGGTTAAGGGACACTC
CAATGCTCAAGGCCTATTGAACAAACCATTTCCTTACTTCTATGACTTGGAAATTGTGTTTGGTAGAGATAGGGCTACTGGTGGTAGATGTAAGACAACCGCTGAAATGG
GCTCACAGACTGCAAAAGATATTGAGAAATATGACATGGACATTAATCTCGAAGACTTTGATATTCCAAATCCACATGGACTCGAGCCACCATCGGGGGAAGACATGTCA
TCCACTCCAACAAGTATGGCACATGATGCAAGATCATTTAGGCCAAGTAAGAAAATGTGGTCATATTCAGGGGACCTCATGAACACATTTCGCAAAAATGGAAATAGAAT
TATCCCTACACAAACAATTATATGTTGA
Protein sequenceShow/hide protein sequence
MRKTGIWLFYTLRNNLPYSIINNCCFLLFHCIIASLFSTKHRWTTIENEALVECLLQLVEEGGWRADNGTFKLGYLVQVQKLMKEKIPGSNIQVTPNLESRVKILKKQYT
AIVEMMGPVCSGFGWNEERKCIKAEKFMFDDWVKGHSNAQGLLNKPFPYFYDLEIVFGRDRATGGRCKTTAEMGSQTAKDIEKYDMDINLEDFDIPNPHGLEPPSGEDMS
STPTSMAHDARSFRPSKKMWSYSGDLMNTFRKNGNRIIPTQTIIC