; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0021179 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0021179
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionTransposase
Genome locationchr09:11470779..11471664
RNA-Seq ExpressionPay0021179
SyntenyPay0021179
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR029480 - Transposase-associated domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046884.1 transposase [Cucumis melo var. makuwa]9.6e-14185.08Show/hide
Query:  MDRSWMHKSRLSKDYELGVENFISFGFSNTKDASIRCPCLKCGNCEKQSR------------NESYKIWFWHGE-QLPESSLYEESSKFDTHMYEENDIG
        MDRSWMHKSRL KDYELGVENFISFGFSNTKDASIRCPCLKCGNCEKQSR            +ESYKIWFWHGE QLPESSLYEESSKFDTHMYE ND+G
Subjt:  MDRSWMHKSRLSKDYELGVENFISFGFSNTKDASIRCPCLKCGNCEKQSR------------NESYKIWFWHGE-QLPESSLYEESSKFDTHMYEENDIG

Query:  SINEMIEVAHEEYSKDPNEFEKFLNDAEKPLYEGCKKFTKLSTLVKLYNLKVRYRWSNINFSELLKTLKEILPTSNEISTSMYEAKKTLGALGMSYEKIH
         INEMIEVAHEEYSKDPNEFEK LNDAEK LYEGCKKFTKLSTLVKLYNLKVRY WS+I+FSELLKTLKEILPT NEI TS+YEAKKTLGALGMSYEKIH
Subjt:  SINEMIEVAHEEYSKDPNEFEKFLNDAEKPLYEGCKKFTKLSTLVKLYNLKVRYRWSNINFSELLKTLKEILPTSNEISTSMYEAKKTLGALGMSYEKIH

Query:  ACPNDCCLYRKEHVNATECPECGESRWK--------------KVVWYFPPIPRFKRLFRSIDNAKNLIWHSNERVINGKLRHLADSPAWKLIDLK
        ACPNDCCLYRKEH NATECPECGESRWK              KVVWYFPPIPRFKRLFRSI+NAKNLIWHSNERVI GKLRH ADSPAWKLIDLK
Subjt:  ACPNDCCLYRKEHVNATECPECGESRWK--------------KVVWYFPPIPRFKRLFRSIDNAKNLIWHSNERVINGKLRHLADSPAWKLIDLK

TYK03264.1 transposase [Cucumis melo var. makuwa]9.6e-14185.08Show/hide
Query:  MDRSWMHKSRLSKDYELGVENFISFGFSNTKDASIRCPCLKCGNCEKQSR------------NESYKIWFWHGE-QLPESSLYEESSKFDTHMYEENDIG
        MDRSWMHKSRL KDYELGVENFISFGFSNTKDASIRCPCLKCGNCEKQSR            +ESYKIWFWHGE QLPESSLYEESSKFDTHMYE ND+G
Subjt:  MDRSWMHKSRLSKDYELGVENFISFGFSNTKDASIRCPCLKCGNCEKQSR------------NESYKIWFWHGE-QLPESSLYEESSKFDTHMYEENDIG

Query:  SINEMIEVAHEEYSKDPNEFEKFLNDAEKPLYEGCKKFTKLSTLVKLYNLKVRYRWSNINFSELLKTLKEILPTSNEISTSMYEAKKTLGALGMSYEKIH
         INEMIEVAHEEYSKDPNEFEK LNDAEK LYEGCKKFTKLSTLVKLYNLKVRY WS+I+FSELLKTLKEILPT NEI TS+YEAKKTLGALGMSYEKIH
Subjt:  SINEMIEVAHEEYSKDPNEFEKFLNDAEKPLYEGCKKFTKLSTLVKLYNLKVRYRWSNINFSELLKTLKEILPTSNEISTSMYEAKKTLGALGMSYEKIH

Query:  ACPNDCCLYRKEHVNATECPECGESRWK--------------KVVWYFPPIPRFKRLFRSIDNAKNLIWHSNERVINGKLRHLADSPAWKLIDLK
        ACPNDCCLYRKEH NATECPECGESRWK              KVVWYFPPIPRFKRLFRSI+NAKNLIWHSNERVI GKLRH ADSPAWKLIDLK
Subjt:  ACPNDCCLYRKEHVNATECPECGESRWK--------------KVVWYFPPIPRFKRLFRSIDNAKNLIWHSNERVINGKLRHLADSPAWKLIDLK

TYK13644.1 transposase [Cucumis melo var. makuwa]4.8e-14084.69Show/hide
Query:  MDRSWMHKSRLSKDYELGVENFISFGFSNTKDASIRCPCLKCGNCEKQSR------------NESYKIWFWHGEQLPESSLYEESSKFDTHMYEENDIGS
        MD SWMHKSRLSKDY LGVENFISFGFSNTKDASIRCPCLK GNCEKQSR            +ESYKIWFWHGEQLPESSLYEESSKFDTHMYEEND+GS
Subjt:  MDRSWMHKSRLSKDYELGVENFISFGFSNTKDASIRCPCLKCGNCEKQSR------------NESYKIWFWHGEQLPESSLYEESSKFDTHMYEENDIGS

Query:  INEMIEVAHEEYSKDPNEFEKFLNDAEKPLYEGCKKFTKLSTLVKLYNLKVRYRWSNINFSELLKTLKEILPTSNEISTSMYEAKKTLGALGMSYEKIHA
        INEMIEVAHEEYSKDPNEFEK LNDA+KPLYEGCK FTKLSTLVKLYNLKVRY W +I+FSELLKTLKEI PTSNEI TSMYEAKKTLGALGMSYEKIHA
Subjt:  INEMIEVAHEEYSKDPNEFEKFLNDAEKPLYEGCKKFTKLSTLVKLYNLKVRYRWSNINFSELLKTLKEILPTSNEISTSMYEAKKTLGALGMSYEKIHA

Query:  CPNDCCLYRKEHVNATECPECGESRWK--------------KVVWYFPPIPRFKRLFRSIDNAKNLIWHSNERVINGKLRHLADSPAWKLIDLK
        CPNDCCLYRKEH NATECPECGESRWK              KVVWYFPPI RFKRLFRSIDNAKNLIW SNERVI+GKLRH ADSPAWKLIDLK
Subjt:  CPNDCCLYRKEHVNATECPECGESRWK--------------KVVWYFPPIPRFKRLFRSIDNAKNLIWHSNERVINGKLRHLADSPAWKLIDLK

XP_031742172.1 uncharacterized protein LOC116404095 [Cucumis sativus]2.2e-12171.77Show/hide
Query:  MDRSWMHKSRLSKDYELGVENFISFGFSNTKDASIRCPCLKCGNCEKQSR------------NESYKIWFWHGEQLPESSLYEESSKFDTHMYEENDIGS
        MD+SWMHKSRLSKDYELGVENFI FGFSNT  + IRCPCLKCGNCEK +R            +ESYKIWFWHGE+LP SS Y+ESSKFD H  E++D+GS
Subjt:  MDRSWMHKSRLSKDYELGVENFISFGFSNTKDASIRCPCLKCGNCEKQSR------------NESYKIWFWHGEQLPESSLYEESSKFDTHMYEENDIGS

Query:  INEMIEVAHEEYSKDPNEFEKFLNDAEKPLYEGCKKFTKLSTLVKLYNLKVRYRWSNINFSELLKTLKEILPTSNEISTSMYEAKKTLGALGMSYEKIHA
        + EMIEVAHEEYSKDP  FEK L DAEKPLYEGCKK+TKLSTLVKLYNLKVRY WS+ +FSELL+TLKEILPT+NE+  S+YEAKKTLGALGM YEKIHA
Subjt:  INEMIEVAHEEYSKDPNEFEKFLNDAEKPLYEGCKKFTKLSTLVKLYNLKVRYRWSNINFSELLKTLKEILPTSNEISTSMYEAKKTLGALGMSYEKIHA

Query:  CPNDCCLYRKEHVNATECPECGESRWK--------------KVVWYFPPIPRFKRLFRSIDNAKNLIWHSNERVINGKLRHLADSPAWKLIDLK
        CPN+CCLYRKE  NA ECPECG+SRWK              KV+WYFP IPRFKRLFRSI+  +NL WHS ER+ +GKLRH A+SPAWKL+D+K
Subjt:  CPNDCCLYRKEHVNATECPECGESRWK--------------KVVWYFPPIPRFKRLFRSIDNAKNLIWHSNERVINGKLRHLADSPAWKLIDLK

XP_031742381.1 uncharacterized protein LOC116404332 [Cucumis sativus]8.4e-12171.77Show/hide
Query:  MDRSWMHKSRLSKDYELGVENFISFGFSNTKDASIRCPCLKCGNCEKQSR------------NESYKIWFWHGEQLPESSLYEESSKFDTHMYEENDIGS
        MD+SWMHKSRLSKDYELGVENFI FGFSNT  + IRCPCLKCGNCEK SR            +ESYKIWFWHGE+LP SS Y+ESSKFD H  E+ D+GS
Subjt:  MDRSWMHKSRLSKDYELGVENFISFGFSNTKDASIRCPCLKCGNCEKQSR------------NESYKIWFWHGEQLPESSLYEESSKFDTHMYEENDIGS

Query:  INEMIEVAHEEYSKDPNEFEKFLNDAEKPLYEGCKKFTKLSTLVKLYNLKVRYRWSNINFSELLKTLKEILPTSNEISTSMYEAKKTLGALGMSYEKIHA
        + EMIEVAHEEYSKDP  FEK L DAEKPLYEGCKK+TKLSTLVKLYNLKVRY WS+ +FSELL+TLKEI+P +NE+  S+YEAKKTLGALGM YEKIHA
Subjt:  INEMIEVAHEEYSKDPNEFEKFLNDAEKPLYEGCKKFTKLSTLVKLYNLKVRYRWSNINFSELLKTLKEILPTSNEISTSMYEAKKTLGALGMSYEKIHA

Query:  CPNDCCLYRKEHVNATECPECGESRWK--------------KVVWYFPPIPRFKRLFRSIDNAKNLIWHSNERVINGKLRHLADSPAWKLIDLK
        CPN+CCLYRKE  NA EC ECG+SRWK              KV+WYFP IPRFKRLFRSI+ A+NL WHS ER+ +GKLRH ADSPAWKL+D+K
Subjt:  CPNDCCLYRKEHVNATECPECGESRWK--------------KVVWYFPPIPRFKRLFRSIDNAKNLIWHSNERVINGKLRHLADSPAWKLIDLK

TrEMBL top hitse value%identityAlignment
A0A5A7TUX7 Transposase4.6e-14185.08Show/hide
Query:  MDRSWMHKSRLSKDYELGVENFISFGFSNTKDASIRCPCLKCGNCEKQSR------------NESYKIWFWHGE-QLPESSLYEESSKFDTHMYEENDIG
        MDRSWMHKSRL KDYELGVENFISFGFSNTKDASIRCPCLKCGNCEKQSR            +ESYKIWFWHGE QLPESSLYEESSKFDTHMYE ND+G
Subjt:  MDRSWMHKSRLSKDYELGVENFISFGFSNTKDASIRCPCLKCGNCEKQSR------------NESYKIWFWHGE-QLPESSLYEESSKFDTHMYEENDIG

Query:  SINEMIEVAHEEYSKDPNEFEKFLNDAEKPLYEGCKKFTKLSTLVKLYNLKVRYRWSNINFSELLKTLKEILPTSNEISTSMYEAKKTLGALGMSYEKIH
         INEMIEVAHEEYSKDPNEFEK LNDAEK LYEGCKKFTKLSTLVKLYNLKVRY WS+I+FSELLKTLKEILPT NEI TS+YEAKKTLGALGMSYEKIH
Subjt:  SINEMIEVAHEEYSKDPNEFEKFLNDAEKPLYEGCKKFTKLSTLVKLYNLKVRYRWSNINFSELLKTLKEILPTSNEISTSMYEAKKTLGALGMSYEKIH

Query:  ACPNDCCLYRKEHVNATECPECGESRWK--------------KVVWYFPPIPRFKRLFRSIDNAKNLIWHSNERVINGKLRHLADSPAWKLIDLK
        ACPNDCCLYRKEH NATECPECGESRWK              KVVWYFPPIPRFKRLFRSI+NAKNLIWHSNERVI GKLRH ADSPAWKLIDLK
Subjt:  ACPNDCCLYRKEHVNATECPECGESRWK--------------KVVWYFPPIPRFKRLFRSIDNAKNLIWHSNERVINGKLRHLADSPAWKLIDLK

A0A5A7U2S8 Transposase9.1e-12171.43Show/hide
Query:  MDRSWMHKSRLSKDYELGVENFISFGFSNTKDASIRCPCLKCGNCEKQSR------------NESYKIWFWHGEQLPESSLYEESSKFDTHMYEENDIGS
        MD+ WMHKSRLSK+YELGVE+FI+FGFSNT  + IRCPCLKCGNCEK SR            +ESYKIWFWHG+     S Y ESSKFDTH  EEND+GS
Subjt:  MDRSWMHKSRLSKDYELGVENFISFGFSNTKDASIRCPCLKCGNCEKQSR------------NESYKIWFWHGEQLPESSLYEESSKFDTHMYEENDIGS

Query:  INEMIEVAHEEYSKDPNEFEKFLNDAEKPLYEGCKKFTKLSTLVKLYNLKVRYRWSNINFSELLKTLKEILPTSNEISTSMYEAKKTLGALGMSYEKIHA
        + E+IEVAHEEYSKDPN FEK L DAEKPLYEGCKK+TKLSTLVKLYNLK RY W++I+FSELLKTLKEILPT+NE+  S+YEAKKTLGALGM YE+IHA
Subjt:  INEMIEVAHEEYSKDPNEFEKFLNDAEKPLYEGCKKFTKLSTLVKLYNLKVRYRWSNINFSELLKTLKEILPTSNEISTSMYEAKKTLGALGMSYEKIHA

Query:  CPNDCCLYRKEHVNATECPECGESRWK--------------KVVWYFPPIPRFKRLFRSIDNAKNLIWHSNERVINGKLRHLADSPAWKLIDLK
        CPN+CCLYRKE  NATECPECG+SRWK              KV+WYFPPIPRFKRLFRSI+ A+NL WH++ER+ +GKLRH ADSPAWKL+D K
Subjt:  CPNDCCLYRKEHVNATECPECGESRWK--------------KVVWYFPPIPRFKRLFRSIDNAKNLIWHSNERVINGKLRHLADSPAWKLIDLK

A0A5A7UL63 Transposase4.0e-11676.14Show/hide
Query:  MDRSWMHKSRLSKDYELGVENFISFGFSNTKDASIRCPCLKCGNCEKQSR------------NESYKIWFWHGEQLPESSLYEESSKFDTHMYEENDIGS
        MDRSWMHKSRLSKDYELGVENFISFGFSNTKDASIRCP LKCGNCEKQSR            +ESYK+WFWHGEQLPES LYEESSKFDT+MYEEND+GS
Subjt:  MDRSWMHKSRLSKDYELGVENFISFGFSNTKDASIRCPCLKCGNCEKQSR------------NESYKIWFWHGEQLPESSLYEESSKFDTHMYEENDIGS

Query:  INEMIEVAHEEYSKDPNEFEKFLNDAEKPLYEGCKKFTKLSTLVKLYNLKVRYRWSNINFSELLKTLKEILPTSNEISTSMYEAKKTLGALGMSYEKIHA
        INEMIEVAHEEYSKDPNEF K LNDAEKPLYEGCKKFTKLSTLVKLYNL+VRY WS+I+FSELLKTLKEILPTSNEI TSMYEAKKTLGAL MSYEKIHA
Subjt:  INEMIEVAHEEYSKDPNEFEKFLNDAEKPLYEGCKKFTKLSTLVKLYNLKVRYRWSNINFSELLKTLKEILPTSNEISTSMYEAKKTLGALGMSYEKIHA

Query:  CPNDCCLYRKEHVNATECPECGESRWKKV-----VWYFPPIPRFKRLFRSIDNAKNLIWHSNERVINGKLRHLADSPAWKLIDLK
        CPN+CCLYRKEH NATECP+CGESRWK           PPIP FKRLFR                         +SPA KLIDLK
Subjt:  CPNDCCLYRKEHVNATECPECGESRWKKV-----VWYFPPIPRFKRLFRSIDNAKNLIWHSNERVINGKLRHLADSPAWKLIDLK

A0A5D3BVS7 Transposase4.6e-14185.08Show/hide
Query:  MDRSWMHKSRLSKDYELGVENFISFGFSNTKDASIRCPCLKCGNCEKQSR------------NESYKIWFWHGE-QLPESSLYEESSKFDTHMYEENDIG
        MDRSWMHKSRL KDYELGVENFISFGFSNTKDASIRCPCLKCGNCEKQSR            +ESYKIWFWHGE QLPESSLYEESSKFDTHMYE ND+G
Subjt:  MDRSWMHKSRLSKDYELGVENFISFGFSNTKDASIRCPCLKCGNCEKQSR------------NESYKIWFWHGE-QLPESSLYEESSKFDTHMYEENDIG

Query:  SINEMIEVAHEEYSKDPNEFEKFLNDAEKPLYEGCKKFTKLSTLVKLYNLKVRYRWSNINFSELLKTLKEILPTSNEISTSMYEAKKTLGALGMSYEKIH
         INEMIEVAHEEYSKDPNEFEK LNDAEK LYEGCKKFTKLSTLVKLYNLKVRY WS+I+FSELLKTLKEILPT NEI TS+YEAKKTLGALGMSYEKIH
Subjt:  SINEMIEVAHEEYSKDPNEFEKFLNDAEKPLYEGCKKFTKLSTLVKLYNLKVRYRWSNINFSELLKTLKEILPTSNEISTSMYEAKKTLGALGMSYEKIH

Query:  ACPNDCCLYRKEHVNATECPECGESRWK--------------KVVWYFPPIPRFKRLFRSIDNAKNLIWHSNERVINGKLRHLADSPAWKLIDLK
        ACPNDCCLYRKEH NATECPECGESRWK              KVVWYFPPIPRFKRLFRSI+NAKNLIWHSNERVI GKLRH ADSPAWKLIDLK
Subjt:  ACPNDCCLYRKEHVNATECPECGESRWK--------------KVVWYFPPIPRFKRLFRSIDNAKNLIWHSNERVINGKLRHLADSPAWKLIDLK

A0A5D3CRI9 Transposase2.3e-14084.69Show/hide
Query:  MDRSWMHKSRLSKDYELGVENFISFGFSNTKDASIRCPCLKCGNCEKQSR------------NESYKIWFWHGEQLPESSLYEESSKFDTHMYEENDIGS
        MD SWMHKSRLSKDY LGVENFISFGFSNTKDASIRCPCLK GNCEKQSR            +ESYKIWFWHGEQLPESSLYEESSKFDTHMYEEND+GS
Subjt:  MDRSWMHKSRLSKDYELGVENFISFGFSNTKDASIRCPCLKCGNCEKQSR------------NESYKIWFWHGEQLPESSLYEESSKFDTHMYEENDIGS

Query:  INEMIEVAHEEYSKDPNEFEKFLNDAEKPLYEGCKKFTKLSTLVKLYNLKVRYRWSNINFSELLKTLKEILPTSNEISTSMYEAKKTLGALGMSYEKIHA
        INEMIEVAHEEYSKDPNEFEK LNDA+KPLYEGCK FTKLSTLVKLYNLKVRY W +I+FSELLKTLKEI PTSNEI TSMYEAKKTLGALGMSYEKIHA
Subjt:  INEMIEVAHEEYSKDPNEFEKFLNDAEKPLYEGCKKFTKLSTLVKLYNLKVRYRWSNINFSELLKTLKEILPTSNEISTSMYEAKKTLGALGMSYEKIHA

Query:  CPNDCCLYRKEHVNATECPECGESRWK--------------KVVWYFPPIPRFKRLFRSIDNAKNLIWHSNERVINGKLRHLADSPAWKLIDLK
        CPNDCCLYRKEH NATECPECGESRWK              KVVWYFPPI RFKRLFRSIDNAKNLIW SNERVI+GKLRH ADSPAWKLIDLK
Subjt:  CPNDCCLYRKEHVNATECPECGESRWK--------------KVVWYFPPIPRFKRLFRSIDNAKNLIWHSNERVINGKLRHLADSPAWKLIDLK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAGATCATGGATGCACAAGAGTAGGTTATCGAAAGATTATGAATTGGGTGTAGAAAACTTCATAAGTTTTGGATTTTCTAATACAAAAGATGCCTCTATTCGTTG
TCCTTGTTTGAAATGTGGGAATTGTGAAAAGCAAAGTCGTAATGAAAGTTATAAAATTTGGTTTTGGCATGGTGAACAACTACCTGAGTCATCCTTATACGAGGAATCTT
CTAAGTTTGACACTCATATGTATGAAGAGAATGATATTGGAAGTATAAATGAAATGATTGAAGTTGCTCATGAGGAGTATTCAAAAGATCCAAATGAATTTGAGAAATTT
CTTAATGATGCCGAAAAACCACTATATGAAGGATGCAAAAAATTCACCAAGTTGTCCACATTAGTGAAGTTGTATAATTTAAAAGTTAGGTATAGATGGAGTAATATTAA
CTTTTCGGAACTACTGAAAACTTTAAAGGAAATTTTGCCTACTTCCAATGAGATCTCAACATCCATGTATGAAGCGAAGAAAACTTTAGGTGCATTAGGAATGAGTTATG
AAAAGATTCATGCATGCCCTAATGATTGTTGTTTATATAGAAAAGAACATGTTAATGCAACTGAATGTCCTGAATGTGGTGAATCAAGGTGGAAAAAAGTTGTATGGTAT
TTTCCACCGATTCCACGTTTCAAAAGATTGTTTCGAAGTATTGACAATGCTAAAAATTTGATTTGGCATTCTAATGAACGAGTAATTAATGGAAAATTACGACATCTTGC
AGACTCTCCAGCTTGGAAATTAATAGATTTGAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATAGATCATGGATGCACAAGAGTAGGTTATCGAAAGATTATGAATTGGGTGTAGAAAACTTCATAAGTTTTGGATTTTCTAATACAAAAGATGCCTCTATTCGTTG
TCCTTGTTTGAAATGTGGGAATTGTGAAAAGCAAAGTCGTAATGAAAGTTATAAAATTTGGTTTTGGCATGGTGAACAACTACCTGAGTCATCCTTATACGAGGAATCTT
CTAAGTTTGACACTCATATGTATGAAGAGAATGATATTGGAAGTATAAATGAAATGATTGAAGTTGCTCATGAGGAGTATTCAAAAGATCCAAATGAATTTGAGAAATTT
CTTAATGATGCCGAAAAACCACTATATGAAGGATGCAAAAAATTCACCAAGTTGTCCACATTAGTGAAGTTGTATAATTTAAAAGTTAGGTATAGATGGAGTAATATTAA
CTTTTCGGAACTACTGAAAACTTTAAAGGAAATTTTGCCTACTTCCAATGAGATCTCAACATCCATGTATGAAGCGAAGAAAACTTTAGGTGCATTAGGAATGAGTTATG
AAAAGATTCATGCATGCCCTAATGATTGTTGTTTATATAGAAAAGAACATGTTAATGCAACTGAATGTCCTGAATGTGGTGAATCAAGGTGGAAAAAAGTTGTATGGTAT
TTTCCACCGATTCCACGTTTCAAAAGATTGTTTCGAAGTATTGACAATGCTAAAAATTTGATTTGGCATTCTAATGAACGAGTAATTAATGGAAAATTACGACATCTTGC
AGACTCTCCAGCTTGGAAATTAATAGATTTGAAGTAG
Protein sequenceShow/hide protein sequence
MDRSWMHKSRLSKDYELGVENFISFGFSNTKDASIRCPCLKCGNCEKQSRNESYKIWFWHGEQLPESSLYEESSKFDTHMYEENDIGSINEMIEVAHEEYSKDPNEFEKF
LNDAEKPLYEGCKKFTKLSTLVKLYNLKVRYRWSNINFSELLKTLKEILPTSNEISTSMYEAKKTLGALGMSYEKIHACPNDCCLYRKEHVNATECPECGESRWKKVVWY
FPPIPRFKRLFRSIDNAKNLIWHSNERVINGKLRHLADSPAWKLIDLK