; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0218121 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0218121
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionHistidine phosphatase
Genome locationCMiso1.1chr08:6530225..6533742
RNA-Seq ExpressionCmc08g0218121
SyntenyCmc08g0218121
Gene Ontology termsNA
InterPro domainsIPR013078 - Histidine phosphatase superfamily, clade-1
IPR029033 - Histidine phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134818.1 uncharacterized protein At3g52155, chloroplastic [Cucumis sativus]3.9e-12297.85Show/hide
Query:  MNASFCCNNIQQHPQLFKVQHPSSSLRRRNPIQWPSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ
        MNASFCCNNIQQHPQLFKVQHPSSSLR RNPIQW SAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLS+RDHDRPLSKDGKVDAIKIAHKLQ
Subjt:  MNASFCCNNIQQHPQLFKVQHPSSSLRRRNPIQWPSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ

Query:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFSEAEVHFISSFYSIAAMDGQTADHLQQVICDYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
        ELSWIPELILSSDAKRTRETLKLMQEQVSGF EAEVHFISSFYSIAAMDGQTADHLQQVIC+YSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
Subjt:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFSEAEVHFISSFYSIAAMDGQTADHLQQVICDYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA

Query:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
        LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
Subjt:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS

XP_008440939.1 PREDICTED: uncharacterized protein LOC103485205 [Cucumis melo]5.4e-12498.71Show/hide
Query:  MNASFCCNNIQQHPQLFKVQHPSSSLRRRNPIQWPSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ
        MNASFCCNNIQQHPQLFKVQHPSSS RRRNPIQWPSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ
Subjt:  MNASFCCNNIQQHPQLFKVQHPSSSLRRRNPIQWPSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ

Query:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFSEAEVHFISSFYSIAAMDGQTADHLQQVICDYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
        ELSWIPELILSSDAKRTRETLKLMQEQVSGFSEAEVHF+SSFYSIAAMDGQTADHLQQVICDYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
Subjt:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFSEAEVHFISSFYSIAAMDGQTADHLQQVICDYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA

Query:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
        LLEASGKSW+EAFALAGLGGWKLHGIVKPNSRS
Subjt:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS

XP_022132392.1 uncharacterized protein LOC111005263 [Momordica charantia]3.4e-11087.55Show/hide
Query:  MNASFCCNNIQQHPQLFKVQHPSSSLRRRNPIQWPSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ
        MN SFCCN+IQQHPQ F++QH S S R RNPIQWPSAVIQTAE + ATEEA SQSESVARRLILLRHARSS QKLSLRDHDRPLSKDGK DAIKIA KLQ
Subjt:  MNASFCCNNIQQHPQLFKVQHPSSSLRRRNPIQWPSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ

Query:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFSEAEVHFISSFYSIAAMDGQTADHLQQVICDYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
        EL+WIPELILSSDAKRTRETLK+MQ +VS FSEAEVH ISSFYSIAAMDGQTA+HLQQVIC+YSRNEI+TVMCMGHN+GWEEAASMFSGSSI+LKTCNAA
Subjt:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFSEAEVHFISSFYSIAAMDGQTADHLQQVICDYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA

Query:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
        LLEASGKSWDEAFALAGLGGWKLHGIVKPN+ S
Subjt:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS

XP_022926854.1 uncharacterized protein At3g52155, chloroplastic isoform X1 [Cucurbita moschata]4.5e-11087.98Show/hide
Query:  MNASFCCNNIQQHPQLFKVQHPSSSLRRRNPIQWPSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ
        MNASFCCN+I QHPQ  K+QH SS  R RN IQWPS VIQTAES+VATEEA SQSESVARRLILLRHARSSRQKLS+RDHDRPLSKDGKVDAIKIA KLQ
Subjt:  MNASFCCNNIQQHPQLFKVQHPSSSLRRRNPIQWPSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ

Query:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFSEAEVHFISSFYSIAAMDGQTADHLQQVICDYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
        EL+WIPELILSSDAKRTRETL++MQEQV GFSEAEVHFI SFYSIAAMDGQTA+HLQQVI +YSRN+I+TVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
Subjt:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFSEAEVHFISSFYSIAAMDGQTADHLQQVICDYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA

Query:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
        LLEASGKSW+EAFALAGLGGWKLHGIV+PN RS
Subjt:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS

XP_038881679.1 uncharacterized protein At3g52155, chloroplastic [Benincasa hispida]3.3e-12195.71Show/hide
Query:  MNASFCCNNIQQHPQLFKVQHPSSSLRRRNPIQWPSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ
        MNASFCCN+IQQHPQLFK QHPSSSLR RNPIQWPSAVIQTAES+VATEEA SQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ
Subjt:  MNASFCCNNIQQHPQLFKVQHPSSSLRRRNPIQWPSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ

Query:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFSEAEVHFISSFYSIAAMDGQTADHLQQVICDYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
        ELSWIPELILSSDAKRTRETLK+MQ+QVSGFSEAEVHFISSFYSIAAMDGQTA+HLQQVIC+YSRNEI+TVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
Subjt:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFSEAEVHFISSFYSIAAMDGQTADHLQQVICDYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA

Query:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
        LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
Subjt:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS

TrEMBL top hitse value%identityAlignment
A0A0A0KHM4 Uncharacterized protein1.9e-12297.85Show/hide
Query:  MNASFCCNNIQQHPQLFKVQHPSSSLRRRNPIQWPSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ
        MNASFCCNNIQQHPQLFKVQHPSSSLR RNPIQW SAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLS+RDHDRPLSKDGKVDAIKIAHKLQ
Subjt:  MNASFCCNNIQQHPQLFKVQHPSSSLRRRNPIQWPSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ

Query:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFSEAEVHFISSFYSIAAMDGQTADHLQQVICDYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
        ELSWIPELILSSDAKRTRETLKLMQEQVSGF EAEVHFISSFYSIAAMDGQTADHLQQVIC+YSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
Subjt:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFSEAEVHFISSFYSIAAMDGQTADHLQQVICDYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA

Query:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
        LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
Subjt:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS

A0A1S3B2X1 uncharacterized protein LOC1034852052.6e-12498.71Show/hide
Query:  MNASFCCNNIQQHPQLFKVQHPSSSLRRRNPIQWPSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ
        MNASFCCNNIQQHPQLFKVQHPSSS RRRNPIQWPSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ
Subjt:  MNASFCCNNIQQHPQLFKVQHPSSSLRRRNPIQWPSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ

Query:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFSEAEVHFISSFYSIAAMDGQTADHLQQVICDYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
        ELSWIPELILSSDAKRTRETLKLMQEQVSGFSEAEVHF+SSFYSIAAMDGQTADHLQQVICDYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
Subjt:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFSEAEVHFISSFYSIAAMDGQTADHLQQVICDYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA

Query:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
        LLEASGKSW+EAFALAGLGGWKLHGIVKPNSRS
Subjt:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS

A0A5A7SJV8 Uncharacterized protein2.6e-12498.71Show/hide
Query:  MNASFCCNNIQQHPQLFKVQHPSSSLRRRNPIQWPSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ
        MNASFCCNNIQQHPQLFKVQHPSSS RRRNPIQWPSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ
Subjt:  MNASFCCNNIQQHPQLFKVQHPSSSLRRRNPIQWPSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ

Query:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFSEAEVHFISSFYSIAAMDGQTADHLQQVICDYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
        ELSWIPELILSSDAKRTRETLKLMQEQVSGFSEAEVHF+SSFYSIAAMDGQTADHLQQVICDYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
Subjt:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFSEAEVHFISSFYSIAAMDGQTADHLQQVICDYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA

Query:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
        LLEASGKSW+EAFALAGLGGWKLHGIVKPNSRS
Subjt:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS

A0A6J1BTP9 uncharacterized protein LOC1110052631.7e-11087.55Show/hide
Query:  MNASFCCNNIQQHPQLFKVQHPSSSLRRRNPIQWPSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ
        MN SFCCN+IQQHPQ F++QH S S R RNPIQWPSAVIQTAE + ATEEA SQSESVARRLILLRHARSS QKLSLRDHDRPLSKDGK DAIKIA KLQ
Subjt:  MNASFCCNNIQQHPQLFKVQHPSSSLRRRNPIQWPSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ

Query:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFSEAEVHFISSFYSIAAMDGQTADHLQQVICDYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
        EL+WIPELILSSDAKRTRETLK+MQ +VS FSEAEVH ISSFYSIAAMDGQTA+HLQQVIC+YSRNEI+TVMCMGHN+GWEEAASMFSGSSI+LKTCNAA
Subjt:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFSEAEVHFISSFYSIAAMDGQTADHLQQVICDYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA

Query:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
        LLEASGKSWDEAFALAGLGGWKLHGIVKPN+ S
Subjt:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS

A0A6J1EGB9 uncharacterized protein At3g52155, chloroplastic isoform X12.2e-11087.98Show/hide
Query:  MNASFCCNNIQQHPQLFKVQHPSSSLRRRNPIQWPSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ
        MNASFCCN+I QHPQ  K+QH SS  R RN IQWPS VIQTAES+VATEEA SQSESVARRLILLRHARSSRQKLS+RDHDRPLSKDGKVDAIKIA KLQ
Subjt:  MNASFCCNNIQQHPQLFKVQHPSSSLRRRNPIQWPSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ

Query:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFSEAEVHFISSFYSIAAMDGQTADHLQQVICDYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
        EL+WIPELILSSDAKRTRETL++MQEQV GFSEAEVHFI SFYSIAAMDGQTA+HLQQVI +YSRN+I+TVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
Subjt:  ELSWIPELILSSDAKRTRETLKLMQEQVSGFSEAEVHFISSFYSIAAMDGQTADHLQQVICDYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAA

Query:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
        LLEASGKSW+EAFALAGLGGWKLHGIV+PN RS
Subjt:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS

SwissProt top hitse value%identityAlignment
Q94BY1 Uncharacterized protein At3g52155, chloroplastic2.3e-6970.16Show/hide
Query:  TAESEVATEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQELSWIPELILSSDAKRTRETLKLMQEQVSGFSEAEVHFIS
        TA     ++ AAS S S++RRLILLRHA SS   LSLRDHDRPLSK G+ DA K+A  L  L W+P+LILSSDA RTRETLK MQ QV GF EA VHFI 
Subjt:  TAESEVATEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQELSWIPELILSSDAKRTRETLKLMQEQVSGFSEAEVHFIS

Query:  SFYSIAAMDGQTADHLQQVICDYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAALLEASGKSWDEAFALAGLGGWKLHGIVKPNS
        SFYSIAAMDGQTA+HLQ +I  YS  +I T+MCMGHNKGWEEAASM SG+SIKLKTCNAALL+A G SW+EAFAL+G GGWKL G+V P+S
Subjt:  SFYSIAAMDGQTADHLQQVICDYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAALLEASGKSWDEAFALAGLGGWKLHGIVKPNS

Arabidopsis top hitse value%identityAlignment
AT3G52155.1 Phosphoglycerate mutase family protein1.7e-7070.16Show/hide
Query:  TAESEVATEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQELSWIPELILSSDAKRTRETLKLMQEQVSGFSEAEVHFIS
        TA     ++ AAS S S++RRLILLRHA SS   LSLRDHDRPLSK G+ DA K+A  L  L W+P+LILSSDA RTRETLK MQ QV GF EA VHFI 
Subjt:  TAESEVATEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQELSWIPELILSSDAKRTRETLKLMQEQVSGFSEAEVHFIS

Query:  SFYSIAAMDGQTADHLQQVICDYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAALLEASGKSWDEAFALAGLGGWKLHGIVKPNS
        SFYSIAAMDGQTA+HLQ +I  YS  +I T+MCMGHNKGWEEAASM SG+SIKLKTCNAALL+A G SW+EAFAL+G GGWKL G+V P+S
Subjt:  SFYSIAAMDGQTADHLQQVICDYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAALLEASGKSWDEAFALAGLGGWKLHGIVKPNS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGCTTCATTTTGCTGTAATAACATTCAACAACATCCACAACTCTTTAAAGTTCAACATCCAAGCTCCTCCCTGAGGAGGAGAAACCCTATTCAATGGCCATCTGC
CGTTATTCAGACGGCGGAGAGCGAAGTGGCAACTGAGGAAGCTGCCTCCCAATCGGAATCTGTTGCTCGTCGCCTTATTCTGCTTCGTCATGCTAGGAGTTCACGGCAAA
AGCTGTCACTGCGAGATCATGATCGCCCCTTGAGTAAAGATGGGAAAGTTGATGCTATAAAAATTGCTCATAAACTCCAAGAATTGAGTTGGATCCCTGAACTTATTTTA
TCCAGTGATGCCAAGCGAACCAGAGAAACACTTAAGCTAATGCAGGAGCAAGTTAGTGGGTTTTCGGAAGCAGAGGTTCATTTCATTTCCAGTTTTTATTCTATTGCTGC
CATGGATGGTCAGACTGCGGATCACCTTCAGCAGGTTATCTGTGATTATTCGAGGAATGAGATAGTTACAGTCATGTGTATGGGACATAATAAAGGGTGGGAAGAGGCAG
CCTCAATGTTCAGTGGTTCCTCCATAAAACTGAAGACATGCAATGCTGCTTTGCTTGAGGCTTCAGGAAAATCATGGGATGAGGCATTTGCTTTGGCGGGACTAGGTGGA
TGGAAGCTTCATGGCATAGTAAAACCAAATAGTAGATCATAG
mRNA sequenceShow/hide mRNA sequence
ACAATTTTTCATCTAGAAAACTCCATTGAAGGGTACAAAAATAATTACTGCTAAGATTAAAATTTGTTAAGAGAGCCCAAAGCCTTATCTGTTTGTCTTCTTCTTCTTCC
TCTTCTTCTTCTTCCTCTTCTTCGAGCTCATCTACCATTCTCGTTCAGCACAACGAGGTTCAAGCGAGGAACTCTCGAGTTGTTCCTTCGAATTGATTCCGGTTCTGAAC
AATCTAATCATGGCAATGCATGCCCATTAAGCAAACCAGCTAGTACATAGCAGGTATTGTTCGTTTAGGAAAAATTAACCACCGATTATCACTGCTATTTCAAGCTTTAA
GCTCGCGATCGATTCTCCGAGCCTGCCCACTCGCCTCTGTCGCCTCTTGGGTTGAATCGAACACGTATCATGAACGCTTCATTTTGCTGTAATAACATTCAACAACATCC
ACAACTCTTTAAAGTTCAACATCCAAGCTCCTCCCTGAGGAGGAGAAACCCTATTCAATGGCCATCTGCCGTTATTCAGACGGCGGAGAGCGAAGTGGCAACTGAGGAAG
CTGCCTCCCAATCGGAATCTGTTGCTCGTCGCCTTATTCTGCTTCGTCATGCTAGGAGTTCACGGCAAAAGCTGTCACTGCGAGATCATGATCGCCCCTTGAGTAAAGAT
GGGAAAGTTGATGCTATAAAAATTGCTCATAAACTCCAAGAATTGAGTTGGATCCCTGAACTTATTTTATCCAGTGATGCCAAGCGAACCAGAGAAACACTTAAGCTAAT
GCAGGAGCAAGTTAGTGGGTTTTCGGAAGCAGAGGTTCATTTCATTTCCAGTTTTTATTCTATTGCTGCCATGGATGGTCAGACTGCGGATCACCTTCAGCAGGTTATCT
GTGATTATTCGAGGAATGAGATAGTTACAGTCATGTGTATGGGACATAATAAAGGGTGGGAAGAGGCAGCCTCAATGTTCAGTGGTTCCTCCATAAAACTGAAGACATGC
AATGCTGCTTTGCTTGAGGCTTCAGGAAAATCATGGGATGAGGCATTTGCTTTGGCGGGACTAGGTGGATGGAAGCTTCATGGCATAGTAAAACCAAATAGTAGATCATA
GAACCTTTTATAGTTTTCTAGAAAAGTTTAGCAAACAATAGCAACCAACCACTTGTTGCACCCGAATTCACCCTTAGAATGCTCATTTATCTAGATATAATAATTTTAAT
GAAATTTCAACCTCTTGGCTTCGAAACTTTTGCCAGTTAAACCTTTTTCGCTTATTCTTATCCTTTGTTATTTGATCATTTATCGTGTCAAAAGATTAAGACATGTTTGA
CAGCAATTTTAAAATTATTAATAAAATTGCTAAAATTTGTTTATCGTATT
Protein sequenceShow/hide protein sequence
MNASFCCNNIQQHPQLFKVQHPSSSLRRRNPIQWPSAVIQTAESEVATEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQELSWIPELIL
SSDAKRTRETLKLMQEQVSGFSEAEVHFISSFYSIAAMDGQTADHLQQVICDYSRNEIVTVMCMGHNKGWEEAASMFSGSSIKLKTCNAALLEASGKSWDEAFALAGLGG
WKLHGIVKPNSRS