; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G2391 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G2391
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionHistidine phosphatase
Genome locationctg1002:4449101..4452435
RNA-Seq ExpressionCucsat.G2391
SyntenyCucsat.G2391
Gene Ontology termsNA
InterPro domainsIPR013078 - Histidine phosphatase superfamily, clade-1
IPR029033 - Histidine phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134818.1 uncharacterized protein At3g52155, chloroplastic [Cucumis sativus]4.42e-162100Show/hide
Query:  MNASFCCNNIQQHPQLFKVQHPSSSLRGRNPIQWSSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSVRDHDRPLSKDGKVDAIKIAHKLQ
        MNASFCCNNIQQHPQLFKVQHPSSSLRGRNPIQWSSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSVRDHDRPLSKDGKVDAIKIAHKLQ
Subjt:  MNASFCCNNIQQHPQLFKVQHPSSSLRGRNPIQWSSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSVRDHDRPLSKDGKVDAIKIAHKLQ

Query:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFLEAEVHFISSFYSIAAMDGQTADHLQQVICNYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
        ELSWIPELILSSDAKRTRETLKLMQEQVSGFLEAEVHFISSFYSIAAMDGQTADHLQQVICNYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
Subjt:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFLEAEVHFISSFYSIAAMDGQTADHLQQVICNYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA

Query:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
        LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
Subjt:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS

XP_008440939.1 PREDICTED: uncharacterized protein LOC103485205 [Cucumis melo]5.54e-15696.57Show/hide
Query:  MNASFCCNNIQQHPQLFKVQHPSSSLRGRNPIQWSSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSVRDHDRPLSKDGKVDAIKIAHKLQ
        MNASFCCNNIQQHPQLFKVQHPSSS R RNPIQW SAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLS+RDHDRPLSKDGKVDAIKIAHKLQ
Subjt:  MNASFCCNNIQQHPQLFKVQHPSSSLRGRNPIQWSSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSVRDHDRPLSKDGKVDAIKIAHKLQ

Query:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFLEAEVHFISSFYSIAAMDGQTADHLQQVICNYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
        ELSWIPELILSSDAKRTRETLKLMQEQVSGF EAEVHF+SSFYSIAAMDGQTADHLQQVIC+YSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
Subjt:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFLEAEVHFISSFYSIAAMDGQTADHLQQVICNYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA

Query:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
        LLEASGKSW+EAFALAGLGGWKLHGIVKPNSRS
Subjt:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS

XP_022132392.1 uncharacterized protein LOC111005263 [Momordica charantia]1.18e-14187.12Show/hide
Query:  MNASFCCNNIQQHPQLFKVQHPSSSLRGRNPIQWSSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSVRDHDRPLSKDGKVDAIKIAHKLQ
        MN SFCCN+IQQHPQ F++QH S S RGRNPIQW SAVIQTAE + ATEEA SQSESVARRLILLRHARSS QKLS+RDHDRPLSKDGK DAIKIA KLQ
Subjt:  MNASFCCNNIQQHPQLFKVQHPSSSLRGRNPIQWSSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSVRDHDRPLSKDGKVDAIKIAHKLQ

Query:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFLEAEVHFISSFYSIAAMDGQTADHLQQVICNYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
        EL+WIPELILSSDAKRTRETLK+MQ +VS F EAEVH ISSFYSIAAMDGQTA+HLQQVICNYSRNEI+TVMCMGHN+GWEEAASMFSGSSI+LKTCNAA
Subjt:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFLEAEVHFISSFYSIAAMDGQTADHLQQVICNYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA

Query:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
        LLEASGKSWDEAFALAGLGGWKLHGIVKPN+ S
Subjt:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS

XP_022926854.1 uncharacterized protein At3g52155, chloroplastic isoform X1 [Cucurbita moschata]8.29e-14287.98Show/hide
Query:  MNASFCCNNIQQHPQLFKVQHPSSSLRGRNPIQWSSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSVRDHDRPLSKDGKVDAIKIAHKLQ
        MNASFCCN+I QHPQ  K+QH SS  RGRN IQW S VIQTAES+VATEEA SQSESVARRLILLRHARSSRQKLS+RDHDRPLSKDGKVDAIKIA KLQ
Subjt:  MNASFCCNNIQQHPQLFKVQHPSSSLRGRNPIQWSSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSVRDHDRPLSKDGKVDAIKIAHKLQ

Query:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFLEAEVHFISSFYSIAAMDGQTADHLQQVICNYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
        EL+WIPELILSSDAKRTRETL++MQEQV GF EAEVHFI SFYSIAAMDGQTA+HLQQVI NYSRN+I+TVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
Subjt:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFLEAEVHFISSFYSIAAMDGQTADHLQQVICNYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA

Query:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
        LLEASGKSW+EAFALAGLGGWKLHGIV+PN RS
Subjt:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS

XP_038881679.1 uncharacterized protein At3g52155, chloroplastic [Benincasa hispida]4.56e-15594.85Show/hide
Query:  MNASFCCNNIQQHPQLFKVQHPSSSLRGRNPIQWSSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSVRDHDRPLSKDGKVDAIKIAHKLQ
        MNASFCCN+IQQHPQLFK QHPSSSLR RNPIQW SAVIQTAES+VATEEA SQSESVARRLILLRHARSSRQKLS+RDHDRPLSKDGKVDAIKIAHKLQ
Subjt:  MNASFCCNNIQQHPQLFKVQHPSSSLRGRNPIQWSSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSVRDHDRPLSKDGKVDAIKIAHKLQ

Query:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFLEAEVHFISSFYSIAAMDGQTADHLQQVICNYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
        ELSWIPELILSSDAKRTRETLK+MQ+QVSGF EAEVHFISSFYSIAAMDGQTA+HLQQVICNYSRNEI+TVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
Subjt:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFLEAEVHFISSFYSIAAMDGQTADHLQQVICNYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA

Query:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
        LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
Subjt:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS

TrEMBL top hitse value%identityAlignment
A0A0A0KHM4 Uncharacterized protein2.14e-162100Show/hide
Query:  MNASFCCNNIQQHPQLFKVQHPSSSLRGRNPIQWSSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSVRDHDRPLSKDGKVDAIKIAHKLQ
        MNASFCCNNIQQHPQLFKVQHPSSSLRGRNPIQWSSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSVRDHDRPLSKDGKVDAIKIAHKLQ
Subjt:  MNASFCCNNIQQHPQLFKVQHPSSSLRGRNPIQWSSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSVRDHDRPLSKDGKVDAIKIAHKLQ

Query:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFLEAEVHFISSFYSIAAMDGQTADHLQQVICNYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
        ELSWIPELILSSDAKRTRETLKLMQEQVSGFLEAEVHFISSFYSIAAMDGQTADHLQQVICNYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
Subjt:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFLEAEVHFISSFYSIAAMDGQTADHLQQVICNYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA

Query:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
        LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
Subjt:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS

A0A1S3B2X1 uncharacterized protein LOC1034852052.68e-15696.57Show/hide
Query:  MNASFCCNNIQQHPQLFKVQHPSSSLRGRNPIQWSSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSVRDHDRPLSKDGKVDAIKIAHKLQ
        MNASFCCNNIQQHPQLFKVQHPSSS R RNPIQW SAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLS+RDHDRPLSKDGKVDAIKIAHKLQ
Subjt:  MNASFCCNNIQQHPQLFKVQHPSSSLRGRNPIQWSSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSVRDHDRPLSKDGKVDAIKIAHKLQ

Query:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFLEAEVHFISSFYSIAAMDGQTADHLQQVICNYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
        ELSWIPELILSSDAKRTRETLKLMQEQVSGF EAEVHF+SSFYSIAAMDGQTADHLQQVIC+YSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
Subjt:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFLEAEVHFISSFYSIAAMDGQTADHLQQVICNYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA

Query:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
        LLEASGKSW+EAFALAGLGGWKLHGIVKPNSRS
Subjt:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS

A0A5A7SJV8 Uncharacterized protein2.68e-15696.57Show/hide
Query:  MNASFCCNNIQQHPQLFKVQHPSSSLRGRNPIQWSSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSVRDHDRPLSKDGKVDAIKIAHKLQ
        MNASFCCNNIQQHPQLFKVQHPSSS R RNPIQW SAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLS+RDHDRPLSKDGKVDAIKIAHKLQ
Subjt:  MNASFCCNNIQQHPQLFKVQHPSSSLRGRNPIQWSSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSVRDHDRPLSKDGKVDAIKIAHKLQ

Query:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFLEAEVHFISSFYSIAAMDGQTADHLQQVICNYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
        ELSWIPELILSSDAKRTRETLKLMQEQVSGF EAEVHF+SSFYSIAAMDGQTADHLQQVIC+YSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
Subjt:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFLEAEVHFISSFYSIAAMDGQTADHLQQVICNYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA

Query:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
        LLEASGKSW+EAFALAGLGGWKLHGIVKPNSRS
Subjt:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS

A0A6J1BTP9 uncharacterized protein LOC1110052635.70e-14287.12Show/hide
Query:  MNASFCCNNIQQHPQLFKVQHPSSSLRGRNPIQWSSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSVRDHDRPLSKDGKVDAIKIAHKLQ
        MN SFCCN+IQQHPQ F++QH S S RGRNPIQW SAVIQTAE + ATEEA SQSESVARRLILLRHARSS QKLS+RDHDRPLSKDGK DAIKIA KLQ
Subjt:  MNASFCCNNIQQHPQLFKVQHPSSSLRGRNPIQWSSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSVRDHDRPLSKDGKVDAIKIAHKLQ

Query:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFLEAEVHFISSFYSIAAMDGQTADHLQQVICNYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
        EL+WIPELILSSDAKRTRETLK+MQ +VS F EAEVH ISSFYSIAAMDGQTA+HLQQVICNYSRNEI+TVMCMGHN+GWEEAASMFSGSSI+LKTCNAA
Subjt:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFLEAEVHFISSFYSIAAMDGQTADHLQQVICNYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA

Query:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
        LLEASGKSWDEAFALAGLGGWKLHGIVKPN+ S
Subjt:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS

A0A6J1EGB9 uncharacterized protein At3g52155, chloroplastic isoform X14.02e-14287.98Show/hide
Query:  MNASFCCNNIQQHPQLFKVQHPSSSLRGRNPIQWSSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSVRDHDRPLSKDGKVDAIKIAHKLQ
        MNASFCCN+I QHPQ  K+QH SS  RGRN IQW S VIQTAES+VATEEA SQSESVARRLILLRHARSSRQKLS+RDHDRPLSKDGKVDAIKIA KLQ
Subjt:  MNASFCCNNIQQHPQLFKVQHPSSSLRGRNPIQWSSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSVRDHDRPLSKDGKVDAIKIAHKLQ

Query:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFLEAEVHFISSFYSIAAMDGQTADHLQQVICNYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
        EL+WIPELILSSDAKRTRETL++MQEQV GF EAEVHFI SFYSIAAMDGQTA+HLQQVI NYSRN+I+TVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
Subjt:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFLEAEVHFISSFYSIAAMDGQTADHLQQVICNYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA

Query:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
        LLEASGKSW+EAFALAGLGGWKLHGIV+PN RS
Subjt:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS

SwissProt top hitse value%identityAlignment
Q94BY1 Uncharacterized protein At3g52155, chloroplastic1.8e-6969.63Show/hide
Query:  TAESEVATEEAASQSESVARRLILLRHARSSRQKLSVRDHDRPLSKDGKVDAIKIAHKLQELSWIPELILSSDAKRTRETLKLMQEQVSGFLEAEVHFIS
        TA     ++ AAS S S++RRLILLRHA SS   LS+RDHDRPLSK G+ DA K+A  L  L W+P+LILSSDA RTRETLK MQ QV GF+EA VHFI 
Subjt:  TAESEVATEEAASQSESVARRLILLRHARSSRQKLSVRDHDRPLSKDGKVDAIKIAHKLQELSWIPELILSSDAKRTRETLKLMQEQVSGFLEAEVHFIS

Query:  SFYSIAAMDGQTADHLQQVICNYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAALLEASGKSWDEAFALAGLGGWKLHGIVKPNS
        SFYSIAAMDGQTA+HLQ +I  YS  +I T+MCMGHNKGWEEAASM SG+SIKLKTCNAALL+A G SW+EAFAL+G GGWKL G+V P+S
Subjt:  SFYSIAAMDGQTADHLQQVICNYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAALLEASGKSWDEAFALAGLGGWKLHGIVKPNS

Arabidopsis top hitse value%identityAlignment
AT3G52155.1 Phosphoglycerate mutase family protein1.3e-7069.63Show/hide
Query:  TAESEVATEEAASQSESVARRLILLRHARSSRQKLSVRDHDRPLSKDGKVDAIKIAHKLQELSWIPELILSSDAKRTRETLKLMQEQVSGFLEAEVHFIS
        TA     ++ AAS S S++RRLILLRHA SS   LS+RDHDRPLSK G+ DA K+A  L  L W+P+LILSSDA RTRETLK MQ QV GF+EA VHFI 
Subjt:  TAESEVATEEAASQSESVARRLILLRHARSSRQKLSVRDHDRPLSKDGKVDAIKIAHKLQELSWIPELILSSDAKRTRETLKLMQEQVSGFLEAEVHFIS

Query:  SFYSIAAMDGQTADHLQQVICNYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAALLEASGKSWDEAFALAGLGGWKLHGIVKPNS
        SFYSIAAMDGQTA+HLQ +I  YS  +I T+MCMGHNKGWEEAASM SG+SIKLKTCNAALL+A G SW+EAFAL+G GGWKL G+V P+S
Subjt:  SFYSIAAMDGQTADHLQQVICNYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAALLEASGKSWDEAFALAGLGGWKLHGIVKPNS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGCTTCATTTTGCTGTAATAACATTCAACAACATCCACAACTCTTTAAAGTTCAACACCCAAGCTCCTCCCTGAGGGGAAGAAACCCTATTCAATGGTCATCTGC
TGTTATTCAGACGGCGGAGAGCGAAGTGGCAACTGAGGAAGCTGCGTCCCAATCGGAATCTGTTGCTCGTCGCCTTATTCTGCTTCGTCATGCTAGGAGTTCACGGCAAA
AACTGTCAGTGCGAGATCATGATCGCCCCTTGAGTAAAGATGGAAAAGTTGATGCTATTAAAATTGCTCATAAACTCCAAGAATTGAGTTGGATCCCTGAACTTATTTTA
TCCAGTGATGCCAAGCGAACCAGAGAAACACTTAAGCTAATGCAGGAGCAAGTTAGTGGTTTTTTGGAAGCAGAGGTTCATTTCATTTCCAGTTTTTATTCTATCGCTGC
CATGGATGGTCAGACTGCGGATCACCTTCAGCAGGTTATCTGTAACTATTCAAGGAATGAGATAGTTACAGTCATGTGTATGGGACATAATAAAGGGTGGGAAGAGGCAG
CCTCAATGTTCAGTGGTTCCTCCATAAAACTGAAGACATGTAATGCTGCTTTGCTTGAGGCTTCAGGAAAATCATGGGATGAGGCATTTGCTTTGGCGGGACTAGGTGGA
TGGAAGCTTCATGGCATAGTAAAACCAAATAGTAGATCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACGCTTCATTTTGCTGTAATAACATTCAACAACATCCACAACTCTTTAAAGTTCAACACCCAAGCTCCTCCCTGAGGGGAAGAAACCCTATTCAATGGTCATCTGC
TGTTATTCAGACGGCGGAGAGCGAAGTGGCAACTGAGGAAGCTGCGTCCCAATCGGAATCTGTTGCTCGTCGCCTTATTCTGCTTCGTCATGCTAGGAGTTCACGGCAAA
AACTGTCAGTGCGAGATCATGATCGCCCCTTGAGTAAAGATGGAAAAGTTGATGCTATTAAAATTGCTCATAAACTCCAAGAATTGAGTTGGATCCCTGAACTTATTTTA
TCCAGTGATGCCAAGCGAACCAGAGAAACACTTAAGCTAATGCAGGAGCAAGTTAGTGGTTTTTTGGAAGCAGAGGTTCATTTCATTTCCAGTTTTTATTCTATCGCTGC
CATGGATGGTCAGACTGCGGATCACCTTCAGCAGGTTATCTGTAACTATTCAAGGAATGAGATAGTTACAGTCATGTGTATGGGACATAATAAAGGGTGGGAAGAGGCAG
CCTCAATGTTCAGTGGTTCCTCCATAAAACTGAAGACATGTAATGCTGCTTTGCTTGAGGCTTCAGGAAAATCATGGGATGAGGCATTTGCTTTGGCGGGACTAGGTGGA
TGGAAGCTTCATGGCATAGTAAAACCAAATAGTAGATCGTAG
Protein sequenceShow/hide protein sequence
MNASFCCNNIQQHPQLFKVQHPSSSLRGRNPIQWSSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSVRDHDRPLSKDGKVDAIKIAHKLQELSWIPELIL
SSDAKRTRETLKLMQEQVSGFLEAEVHFISSFYSIAAMDGQTADHLQQVICNYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAALLEASGKSWDEAFALAGLGG
WKLHGIVKPNSRS