; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G03950 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G03950
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTYR_PHOSPHATASE_2 domain-containing protein
Genome locationChr6:3694446..3697998
RNA-Seq ExpressionCSPI06G03950
SyntenyCSPI06G03950
Gene Ontology termsGO:0006470 - protein dephosphorylation (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0008138 - protein tyrosine/serine/threonine phosphatase activity (molecular function)
InterPro domainsIPR000340 - Dual specificity phosphatase, catalytic domain
IPR000387 - Tyrosine specific protein phosphatases domain
IPR020422 - Dual specificity protein phosphatase domain
IPR029021 - Protein-tyrosine phosphatase-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7032936.1 ynbD [Cucurbita argyrosperma subsp. argyrosperma]2.9e-12085.19Show/hide
Query:  MVKPGLSSLIGLKAVALFSLFLFFRFYGFRLLSFQFLYASLVSFLVSVASLPSINLPLLLGKKSDGTFPIWSVIIFGPFLYFVRYLPSLRGLYRKDDPYS
        M+KPGLS LIGLKA ALFS FLF RFYGFRLLS  FLYASLVS LVS+ASLPSINLPLLLGKKSDGTFP+WS++IFGPFL+FVR+LPSLRGLY +DDPYS
Subjt:  MVKPGLSSLIGLKAVALFSLFLFFRFYGFRLLSFQFLYASLVSFLVSVASLPSINLPLLLGKKSDGTFPIWSVIIFGPFLYFVRYLPSLRGLYRKDDPYS

Query:  EICDGLFVGGWPCSPDRLPPCNPAIVDCTCELPRCLELSGAGYLCVPTWDTRSPQPREIELAVRWICRKREQKKPVFIHCAYGHGRSVAVTCAALVALGE
        EIC+G++VGGWPCSPDRLPPCNPA+VDCTCELPRCLE+SG GYLCVPTWDTRSPQP EIE AVRWIC+KRE KKPVFIHCAYGHGRSVAVTCA LVALG 
Subjt:  EICDGLFVGGWPCSPDRLPPCNPAIVDCTCELPRCLELSGAGYLCVPTWDTRSPQPREIELAVRWICRKREQKKPVFIHCAYGHGRSVAVTCAALVALGE

Query:  AEDWKDAEKITKEKRPCIRMNSSHRKALEEWSKHRLSAPKKRQ
        A+DWK+AEKI KEKR CIRMN SHRKALEEWSKHRLS PKK++
Subjt:  AEDWKDAEKITKEKRPCIRMNSSHRKALEEWSKHRLSAPKKRQ

XP_004140900.1 uncharacterized protein LOC101207979 [Cucumis sativus]5.3e-138100Show/hide
Query:  MVKPGLSSLIGLKAVALFSLFLFFRFYGFRLLSFQFLYASLVSFLVSVASLPSINLPLLLGKKSDGTFPIWSVIIFGPFLYFVRYLPSLRGLYRKDDPYS
        MVKPGLSSLIGLKAVALFSLFLFFRFYGFRLLSFQFLYASLVSFLVSVASLPSINLPLLLGKKSDGTFPIWSVIIFGPFLYFVRYLPSLRGLYRKDDPYS
Subjt:  MVKPGLSSLIGLKAVALFSLFLFFRFYGFRLLSFQFLYASLVSFLVSVASLPSINLPLLLGKKSDGTFPIWSVIIFGPFLYFVRYLPSLRGLYRKDDPYS

Query:  EICDGLFVGGWPCSPDRLPPCNPAIVDCTCELPRCLELSGAGYLCVPTWDTRSPQPREIELAVRWICRKREQKKPVFIHCAYGHGRSVAVTCAALVALGE
        EICDGLFVGGWPCSPDRLPPCNPAIVDCTCELPRCLELSGAGYLCVPTWDTRSPQPREIELAVRWICRKREQKKPVFIHCAYGHGRSVAVTCAALVALGE
Subjt:  EICDGLFVGGWPCSPDRLPPCNPAIVDCTCELPRCLELSGAGYLCVPTWDTRSPQPREIELAVRWICRKREQKKPVFIHCAYGHGRSVAVTCAALVALGE

Query:  AEDWKDAEKITKEKRPCIRMNSSHRKALEEWSKHRLSAPKKRQ
        AEDWKDAEKITKEKRPCIRMNSSHRKALEEWSKHRLSAPKKRQ
Subjt:  AEDWKDAEKITKEKRPCIRMNSSHRKALEEWSKHRLSAPKKRQ

XP_008456599.1 PREDICTED: uncharacterized protein YnbD-like [Cucumis melo]3.0e-13395.88Show/hide
Query:  MVKPGLSSLIGLKAVALFSLFLFFRFYGFRLLSFQFLYASLVSFLVSVASLPSINLPLLLGKKSDGTFPIWSVIIFGPFLYFVRYLPSLRGLYRKDDPYS
        M+KPGLSSLIGLKAVALFSLFLFFRFYGFRLLSFQFLYASLVSFLVSVASLPSINLPLLLGKKSDGTFPIWSVIIFGPFLYFVRYLPSLRGLYRKDDPYS
Subjt:  MVKPGLSSLIGLKAVALFSLFLFFRFYGFRLLSFQFLYASLVSFLVSVASLPSINLPLLLGKKSDGTFPIWSVIIFGPFLYFVRYLPSLRGLYRKDDPYS

Query:  EICDGLFVGGWPCSPDRLPPCNPAIVDCTCELPRCLELSGAGYLCVPTWDTRSPQPREIELAVRWICRKREQKKPVFIHCAYGHGRSVAVTCAALVALGE
        EICDGLFVGGWPCSPDRLPPCNPAIVDCTCELPRCLE+SGAGYLC+PTWDTRSPQPR+IELAVRWICRKREQKKPVFIHCAYGHGRSVAV CAALVALGE
Subjt:  EICDGLFVGGWPCSPDRLPPCNPAIVDCTCELPRCLELSGAGYLCVPTWDTRSPQPREIELAVRWICRKREQKKPVFIHCAYGHGRSVAVTCAALVALGE

Query:  AEDWKDAEKITKEKRPCIRMNSSHRKALEEWSKHRLSAPKKRQ
        AEDWKDAEKI KEKRPCIRMNSSHRKALEEWSK++LSAPKK++
Subjt:  AEDWKDAEKITKEKRPCIRMNSSHRKALEEWSKHRLSAPKKRQ

XP_022134387.1 uncharacterized protein LOC111006659 [Momordica charantia]2.6e-12185.54Show/hide
Query:  VKPGLSSLIGLKAVALFSLFLFFRFYGFRLLSFQFLYASLVSFLVSVASLPSINLPLLLGKKSDGTFPIWSVIIFGPFLYFVRYLPSLRGLYRKDDPYSE
        +KPG+S LIGLKA ALFSLFLF RFYGFRLLSFQFLYASLVS LVSVASLPSINLPLLLGK++DG+FPIWS++IF PFL+FVR+LPSLRGLYR+DDPYSE
Subjt:  VKPGLSSLIGLKAVALFSLFLFFRFYGFRLLSFQFLYASLVSFLVSVASLPSINLPLLLGKKSDGTFPIWSVIIFGPFLYFVRYLPSLRGLYRKDDPYSE

Query:  ICDGLFVGGWPCSPDRLPPCNPAIVDCTCELPRCLELSGAGYLCVPTWDTRSPQPREIELAVRWICRKREQKKPVFIHCAYGHGRSVAVTCAALVALGEA
        IC+G+FVGGWP SPDRLPPCNPAI+DCTCELPRCLE+SG  YLC+PTWDTRSPQP  IE AVRW+CRKREQK+PVFIHCAYGHGRSVAVTCA LVALGEA
Subjt:  ICDGLFVGGWPCSPDRLPPCNPAIVDCTCELPRCLELSGAGYLCVPTWDTRSPQPREIELAVRWICRKREQKKPVFIHCAYGHGRSVAVTCAALVALGEA

Query:  EDWKDAEKITKEKRPCIRMNSSHRKALEEWSKHRLSAPKKRQ
        EDWK+AEK+ KEKRPCIRMNSSHRKALEEWSKHRLSAP K++
Subjt:  EDWKDAEKITKEKRPCIRMNSSHRKALEEWSKHRLSAPKKRQ

XP_038886707.1 uncharacterized protein YnbD-like [Benincasa hispida]4.5e-12992.98Show/hide
Query:  VKPGLSSLIGLKAVALFSLFLFFRFYGFRLLSFQFLYASLVSFLVSVASLPSINLPLLLGKKSDGTFPIWSVIIFGPFLYFVRYLPSLRGLYRKDDPYSE
        +KPGLSSLIGLKA  LFSLFLF RFYGFRLLSFQFLYASLVS LVSVASLPSINLPLLLGKKSDGTFPIWS+IIFGPFLYFVRYLPSLRGLYR+DDPYSE
Subjt:  VKPGLSSLIGLKAVALFSLFLFFRFYGFRLLSFQFLYASLVSFLVSVASLPSINLPLLLGKKSDGTFPIWSVIIFGPFLYFVRYLPSLRGLYRKDDPYSE

Query:  ICDGLFVGGWPCSPDRLPPCNPAIVDCTCELPRCLELSGAGYLCVPTWDTRSPQPREIELAVRWICRKREQKKPVFIHCAYGHGRSVAVTCAALVALGEA
        IC+GLFVGGWP SPDRLPPCNPAIVDCTCELPRCL++SG+GYLCVPTWDTRSPQP+EIE AVRWICRKREQKKPVFIHCAYGHGRSVAVTCAALVALGEA
Subjt:  ICDGLFVGGWPCSPDRLPPCNPAIVDCTCELPRCLELSGAGYLCVPTWDTRSPQPREIELAVRWICRKREQKKPVFIHCAYGHGRSVAVTCAALVALGEA

Query:  EDWKDAEKITKEKRPCIRMNSSHRKALEEWSKHRLSAPKKRQ
        EDWKDAEKI KEKRPCIRMNSSHRKALEEWSKHRLSAPKK++
Subjt:  EDWKDAEKITKEKRPCIRMNSSHRKALEEWSKHRLSAPKKRQ

TrEMBL top hitse value%identityAlignment
A0A0A0KE65 TYR_PHOSPHATASE_2 domain-containing protein2.6e-138100Show/hide
Query:  MVKPGLSSLIGLKAVALFSLFLFFRFYGFRLLSFQFLYASLVSFLVSVASLPSINLPLLLGKKSDGTFPIWSVIIFGPFLYFVRYLPSLRGLYRKDDPYS
        MVKPGLSSLIGLKAVALFSLFLFFRFYGFRLLSFQFLYASLVSFLVSVASLPSINLPLLLGKKSDGTFPIWSVIIFGPFLYFVRYLPSLRGLYRKDDPYS
Subjt:  MVKPGLSSLIGLKAVALFSLFLFFRFYGFRLLSFQFLYASLVSFLVSVASLPSINLPLLLGKKSDGTFPIWSVIIFGPFLYFVRYLPSLRGLYRKDDPYS

Query:  EICDGLFVGGWPCSPDRLPPCNPAIVDCTCELPRCLELSGAGYLCVPTWDTRSPQPREIELAVRWICRKREQKKPVFIHCAYGHGRSVAVTCAALVALGE
        EICDGLFVGGWPCSPDRLPPCNPAIVDCTCELPRCLELSGAGYLCVPTWDTRSPQPREIELAVRWICRKREQKKPVFIHCAYGHGRSVAVTCAALVALGE
Subjt:  EICDGLFVGGWPCSPDRLPPCNPAIVDCTCELPRCLELSGAGYLCVPTWDTRSPQPREIELAVRWICRKREQKKPVFIHCAYGHGRSVAVTCAALVALGE

Query:  AEDWKDAEKITKEKRPCIRMNSSHRKALEEWSKHRLSAPKKRQ
        AEDWKDAEKITKEKRPCIRMNSSHRKALEEWSKHRLSAPKKRQ
Subjt:  AEDWKDAEKITKEKRPCIRMNSSHRKALEEWSKHRLSAPKKRQ

A0A1S3C3L2 uncharacterized protein YnbD-like1.5e-13395.88Show/hide
Query:  MVKPGLSSLIGLKAVALFSLFLFFRFYGFRLLSFQFLYASLVSFLVSVASLPSINLPLLLGKKSDGTFPIWSVIIFGPFLYFVRYLPSLRGLYRKDDPYS
        M+KPGLSSLIGLKAVALFSLFLFFRFYGFRLLSFQFLYASLVSFLVSVASLPSINLPLLLGKKSDGTFPIWSVIIFGPFLYFVRYLPSLRGLYRKDDPYS
Subjt:  MVKPGLSSLIGLKAVALFSLFLFFRFYGFRLLSFQFLYASLVSFLVSVASLPSINLPLLLGKKSDGTFPIWSVIIFGPFLYFVRYLPSLRGLYRKDDPYS

Query:  EICDGLFVGGWPCSPDRLPPCNPAIVDCTCELPRCLELSGAGYLCVPTWDTRSPQPREIELAVRWICRKREQKKPVFIHCAYGHGRSVAVTCAALVALGE
        EICDGLFVGGWPCSPDRLPPCNPAIVDCTCELPRCLE+SGAGYLC+PTWDTRSPQPR+IELAVRWICRKREQKKPVFIHCAYGHGRSVAV CAALVALGE
Subjt:  EICDGLFVGGWPCSPDRLPPCNPAIVDCTCELPRCLELSGAGYLCVPTWDTRSPQPREIELAVRWICRKREQKKPVFIHCAYGHGRSVAVTCAALVALGE

Query:  AEDWKDAEKITKEKRPCIRMNSSHRKALEEWSKHRLSAPKKRQ
        AEDWKDAEKI KEKRPCIRMNSSHRKALEEWSK++LSAPKK++
Subjt:  AEDWKDAEKITKEKRPCIRMNSSHRKALEEWSKHRLSAPKKRQ

A0A6J1BZH1 uncharacterized protein LOC1110066591.3e-12185.54Show/hide
Query:  VKPGLSSLIGLKAVALFSLFLFFRFYGFRLLSFQFLYASLVSFLVSVASLPSINLPLLLGKKSDGTFPIWSVIIFGPFLYFVRYLPSLRGLYRKDDPYSE
        +KPG+S LIGLKA ALFSLFLF RFYGFRLLSFQFLYASLVS LVSVASLPSINLPLLLGK++DG+FPIWS++IF PFL+FVR+LPSLRGLYR+DDPYSE
Subjt:  VKPGLSSLIGLKAVALFSLFLFFRFYGFRLLSFQFLYASLVSFLVSVASLPSINLPLLLGKKSDGTFPIWSVIIFGPFLYFVRYLPSLRGLYRKDDPYSE

Query:  ICDGLFVGGWPCSPDRLPPCNPAIVDCTCELPRCLELSGAGYLCVPTWDTRSPQPREIELAVRWICRKREQKKPVFIHCAYGHGRSVAVTCAALVALGEA
        IC+G+FVGGWP SPDRLPPCNPAI+DCTCELPRCLE+SG  YLC+PTWDTRSPQP  IE AVRW+CRKREQK+PVFIHCAYGHGRSVAVTCA LVALGEA
Subjt:  ICDGLFVGGWPCSPDRLPPCNPAIVDCTCELPRCLELSGAGYLCVPTWDTRSPQPREIELAVRWICRKREQKKPVFIHCAYGHGRSVAVTCAALVALGEA

Query:  EDWKDAEKITKEKRPCIRMNSSHRKALEEWSKHRLSAPKKRQ
        EDWK+AEK+ KEKRPCIRMNSSHRKALEEWSKHRLSAP K++
Subjt:  EDWKDAEKITKEKRPCIRMNSSHRKALEEWSKHRLSAPKKRQ

A0A6J1E1D8 uncharacterized protein LOC1114298964.1e-12085.6Show/hide
Query:  MVKPGLSSLIGLKAVALFSLFLFFRFYGFRLLSFQFLYASLVSFLVSVASLPSINLPLLLGKKSDGTFPIWSVIIFGPFLYFVRYLPSLRGLYRKDDPYS
        M+KPGLS LIGLKA ALFS FLF RFYGFRLLS  FLYASLVS LVS+ASLPSINLPLLLGKKSDGTFP+WS++IFGPFL+FVR+LPSLRGLY +DDPYS
Subjt:  MVKPGLSSLIGLKAVALFSLFLFFRFYGFRLLSFQFLYASLVSFLVSVASLPSINLPLLLGKKSDGTFPIWSVIIFGPFLYFVRYLPSLRGLYRKDDPYS

Query:  EICDGLFVGGWPCSPDRLPPCNPAIVDCTCELPRCLELSGAGYLCVPTWDTRSPQPREIELAVRWICRKREQKKPVFIHCAYGHGRSVAVTCAALVALGE
        EIC+G++VGGWP SPDRLPPCNPA+VDCTCELPRCLE+SG GYLCVPTWDTRSPQP EIE AVRWIC+KRE KKPVFIHCAYGHGRSVAVTCA LVALG 
Subjt:  EICDGLFVGGWPCSPDRLPPCNPAIVDCTCELPRCLELSGAGYLCVPTWDTRSPQPREIELAVRWICRKREQKKPVFIHCAYGHGRSVAVTCAALVALGE

Query:  AEDWKDAEKITKEKRPCIRMNSSHRKALEEWSKHRLSAPKKRQ
        AEDWK+AEKI KEKR CIRMN SHRKALEEWSKHRLSAPKK++
Subjt:  AEDWKDAEKITKEKRPCIRMNSSHRKALEEWSKHRLSAPKKRQ

A0A6J1JS23 uncharacterized protein LOC1114872995.4e-12085.95Show/hide
Query:  MVKPGLSSLIGLKAVALFSLFLFFRFYGFRLLSFQFLYASLVSFLVSVASLPSINLPLLLGKKSDGTFPIWSVIIFGPFLYFVRYLPSLRGLYRKDDPYS
        M+KPGLS LIGLKA ALFS FLF RFYGFRLLS  FLYASLVS LVS+ASLPSINLPLLLGKKSDGTFP+WS++IFGPFL+FVR+LPSLRGLY KDDPYS
Subjt:  MVKPGLSSLIGLKAVALFSLFLFFRFYGFRLLSFQFLYASLVSFLVSVASLPSINLPLLLGKKSDGTFPIWSVIIFGPFLYFVRYLPSLRGLYRKDDPYS

Query:  EICDGLFVGGWPCSPDRLPPCNPAIVDCTCELPRCLELSGAGYLCVPTWDTRSPQPREIELAVRWICRKREQKKPVFIHCAYGHGRSVAVTCAALVALGE
        EIC+G++VGGWPCSPDRLPP  PA+VDCTCELPRCLE+SG GYLCVPTWDTRSPQP EIE+AVRWIC+KRE KKPVFIHCAYGHGRSVAVTCA LVALG 
Subjt:  EICDGLFVGGWPCSPDRLPPCNPAIVDCTCELPRCLELSGAGYLCVPTWDTRSPQPREIELAVRWICRKREQKKPVFIHCAYGHGRSVAVTCAALVALGE

Query:  AEDWKDAEKITKEKRPCIRMNSSHRKALEEWSKHRLSAPKKR
        AEDWK+AEKI KEKR CIRMN SHRKALEEWSKHRLSAPKK+
Subjt:  AEDWKDAEKITKEKRPCIRMNSSHRKALEEWSKHRLSAPKKR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGAAACCAGGACTTTCATCTTTGATAGGGTTGAAGGCGGTAGCTTTATTCTCTCTATTTCTATTTTTTAGGTTCTATGGCTTCAGATTGTTATCATTTCAATTCTT
ATATGCTTCTTTAGTTTCTTTCTTGGTTTCAGTTGCTTCTCTTCCATCAATCAATCTTCCCCTGCTTTTGGGAAAGAAATCAGATGGGACTTTTCCCATTTGGTCCGTCA
TTATTTTTGGTCCATTTTTGTATTTTGTTCGATATCTTCCTTCATTGCGTGGATTGTACCGTAAAGATGATCCTTACAGTGAAATTTGTGATGGTTTATTTGTTGGGGGG
TGGCCTTGTTCACCTGATAGATTGCCTCCTTGTAATCCTGCCATTGTTGATTGTACTTGTGAATTGCCAAGGTGTTTAGAACTCTCTGGGGCGGGTTATTTGTGTGTTCC
AACTTGGGATACACGCTCTCCTCAGCCTCGAGAAATTGAATTGGCCGTTCGGTGGATTTGTAGAAAGAGAGAGCAGAAGAAGCCGGTGTTCATCCATTGTGCTTATGGTC
ATGGAAGAAGTGTTGCCGTTACGTGTGCAGCGTTAGTGGCTCTAGGAGAGGCAGAAGATTGGAAAGATGCAGAGAAGATAACAAAGGAAAAACGACCTTGCATCAGGATG
AATTCTTCTCATCGCAAAGCCTTGGAAGAATGGTCGAAACATCGGCTATCCGCTCCAAAGAAGAGACAATGA
mRNA sequenceShow/hide mRNA sequence
AAAAAATCCATCTTTTCTTCCATTGGCTTCGTTTCAACGAGGATTGGTGGGTCACAGTCCTCCTCAGCTTCCATGTTTCATATGATCGTAAGAAATTTCCAAATCGTAAC
AACCCATTTGATATTTTGATCTGAAATTTTTTCTAATCTAAGTTTCTGAGGTTTCGAAATGGTGAAACCAGGACTTTCATCTTTGATAGGGTTGAAGGCGGTAGCTTTAT
TCTCTCTATTTCTATTTTTTAGGTTCTATGGCTTCAGATTGTTATCATTTCAATTCTTATATGCTTCTTTAGTTTCTTTCTTGGTTTCAGTTGCTTCTCTTCCATCAATC
AATCTTCCCCTGCTTTTGGGAAAGAAATCAGATGGGACTTTTCCCATTTGGTCCGTCATTATTTTTGGTCCATTTTTGTATTTTGTTCGATATCTTCCTTCATTGCGTGG
ATTGTACCGTAAAGATGATCCTTACAGTGAAATTTGTGATGGTTTATTTGTTGGGGGGTGGCCTTGTTCACCTGATAGATTGCCTCCTTGTAATCCTGCCATTGTTGATT
GTACTTGTGAATTGCCAAGGTGTTTAGAACTCTCTGGGGCGGGTTATTTGTGTGTTCCAACTTGGGATACACGCTCTCCTCAGCCTCGAGAAATTGAATTGGCCGTTCGG
TGGATTTGTAGAAAGAGAGAGCAGAAGAAGCCGGTGTTCATCCATTGTGCTTATGGTCATGGAAGAAGTGTTGCCGTTACGTGTGCAGCGTTAGTGGCTCTAGGAGAGGC
AGAAGATTGGAAAGATGCAGAGAAGATAACAAAGGAAAAACGACCTTGCATCAGGATGAATTCTTCTCATCGCAAAGCCTTGGAAGAATGGTCGAAACATCGGCTATCCG
CTCCAAAGAAGAGACAATGATGTGAGTCCTAAGCTTCTTTCTGGTAGCTGAATAGCAAAAACATGAATCTAATAGGCCATTCCAAACCCCTCCCCTTCTAATCTTCCATA
TCTTATTGCTTGGATGAGCATTTGATGTATCATCATTGAGTACTAGTAATATTAATAGATAAACAAACTCTGTAACAAAAAGTTTTCATCTTTCTATCTTGGATTTAGTC
ATACAATTTCTGTGGCTTCCATTTGCCATTATTGCCCTTTGCCTGCAAGTTGGTTAGTTCAGCCTTGAAGTGTAGATATAGTCTTATAGACATGAATGAACATAGAAATC
GCTGAAATCTATGGAACTTTGATTCTGTAGCTAGTTGAATTTAAACCTATATATGCTTTTTACTGTATTCTTTTTCAATCTCATGATTCAGCTCAAGTTTGAGAAACAAC
TGCTCT
Protein sequenceShow/hide protein sequence
MVKPGLSSLIGLKAVALFSLFLFFRFYGFRLLSFQFLYASLVSFLVSVASLPSINLPLLLGKKSDGTFPIWSVIIFGPFLYFVRYLPSLRGLYRKDDPYSEICDGLFVGG
WPCSPDRLPPCNPAIVDCTCELPRCLELSGAGYLCVPTWDTRSPQPREIELAVRWICRKREQKKPVFIHCAYGHGRSVAVTCAALVALGEAEDWKDAEKITKEKRPCIRM
NSSHRKALEEWSKHRLSAPKKRQ