; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh07G003700 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh07G003700
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionHistidine phosphatase superfamily
Genome locationCmo_Chr07:1690160..1696271
RNA-Seq ExpressionCmoCh07G003700
SyntenyCmoCh07G003700
Gene Ontology termsNA
InterPro domainsIPR013078 - Histidine phosphatase superfamily, clade-1
IPR029033 - Histidine phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6594724.1 putative protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]3.5e-151100Show/hide
Query:  MNASFCCNHIHQHPQFLKIQHASSFARGRNAIQWPSTVIQTAESQVATEEATSQSESVARRLILLRHARSSRQKLSMRDHDRPLSKDGKVDAIKIAQKLQ
        MNASFCCNHIHQHPQFLKIQHASSFARGRNAIQWPSTVIQTAESQVATEEATSQSESVARRLILLRHARSSRQKLSMRDHDRPLSKDGKVDAIKIAQKLQ
Subjt:  MNASFCCNHIHQHPQFLKIQHASSFARGRNAIQWPSTVIQTAESQVATEEATSQSESVARRLILLRHARSSRQKLSMRDHDRPLSKDGKVDAIKIAQKLQ

Query:  ELNWIPELILSSDAKRTRETLQIMQEQVVGFSEAEVHFIPSFYSIAAMDGQTAEHLQQVISNYSRNDIITVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
        ELNWIPELILSSDAKRTRETLQIMQEQVVGFSEAEVHFIPSFYSIAAMDGQTAEHLQQVISNYSRNDIITVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
Subjt:  ELNWIPELILSSDAKRTRETLQIMQEQVVGFSEAEVHFIPSFYSIAAMDGQTAEHLQQVISNYSRNDIITVMCMGHNKGWEEAASMFSGSSIKLKTCNAA

Query:  LLEASGKSWEEAFALAGLGGWKLHGIPYLLVSSLTYADDSGSGYAYYRIEKWFWCLRCLWSFFLKNGELNF
        LLEASGKSWEEAFALAGLGGWKLHGIPYLLVSSLTYADDSGSGYAYYRIEKWFWCLRCLWSFFLKNGELNF
Subjt:  LLEASGKSWEEAFALAGLGGWKLHGIPYLLVSSLTYADDSGSGYAYYRIEKWFWCLRCLWSFFLKNGELNF

KAG7026693.1 hypothetical protein SDJN02_10696 [Cucurbita argyrosperma subsp. argyrosperma]9.2e-112100Show/hide
Query:  MNASFCCNHIHQHPQFLKIQHASSFARGRNAIQWPSTVIQTAESQVATEEATSQSESVARRLILLRHARSSRQKLSMRDHDRPLSKDGKVDAIKIAQKLQ
        MNASFCCNHIHQHPQFLKIQHASSFARGRNAIQWPSTVIQTAESQVATEEATSQSESVARRLILLRHARSSRQKLSMRDHDRPLSKDGKVDAIKIAQKLQ
Subjt:  MNASFCCNHIHQHPQFLKIQHASSFARGRNAIQWPSTVIQTAESQVATEEATSQSESVARRLILLRHARSSRQKLSMRDHDRPLSKDGKVDAIKIAQKLQ

Query:  ELNWIPELILSSDAKRTRETLQIMQEQVVGFSEAEVHFIPSFYSIAAMDGQTAEHLQQVISNYSRNDIITVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
        ELNWIPELILSSDAKRTRETLQIMQEQVVGFSEAEVHFIPSFYSIAAMDGQTAEHLQQVISNYSRNDIITVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
Subjt:  ELNWIPELILSSDAKRTRETLQIMQEQVVGFSEAEVHFIPSFYSIAAMDGQTAEHLQQVISNYSRNDIITVMCMGHNKGWEEAASMFSGSSIKLKTCNAA

Query:  LLEASGKSWEE
        LLEASGKSWEE
Subjt:  LLEASGKSWEE

XP_022926854.1 uncharacterized protein At3g52155, chloroplastic isoform X1 [Cucurbita moschata]3.7e-121100Show/hide
Query:  MNASFCCNHIHQHPQFLKIQHASSFARGRNAIQWPSTVIQTAESQVATEEATSQSESVARRLILLRHARSSRQKLSMRDHDRPLSKDGKVDAIKIAQKLQ
        MNASFCCNHIHQHPQFLKIQHASSFARGRNAIQWPSTVIQTAESQVATEEATSQSESVARRLILLRHARSSRQKLSMRDHDRPLSKDGKVDAIKIAQKLQ
Subjt:  MNASFCCNHIHQHPQFLKIQHASSFARGRNAIQWPSTVIQTAESQVATEEATSQSESVARRLILLRHARSSRQKLSMRDHDRPLSKDGKVDAIKIAQKLQ

Query:  ELNWIPELILSSDAKRTRETLQIMQEQVVGFSEAEVHFIPSFYSIAAMDGQTAEHLQQVISNYSRNDIITVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
        ELNWIPELILSSDAKRTRETLQIMQEQVVGFSEAEVHFIPSFYSIAAMDGQTAEHLQQVISNYSRNDIITVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
Subjt:  ELNWIPELILSSDAKRTRETLQIMQEQVVGFSEAEVHFIPSFYSIAAMDGQTAEHLQQVISNYSRNDIITVMCMGHNKGWEEAASMFSGSSIKLKTCNAA

Query:  LLEASGKSWEEAFALAGLGGWKLHGI
        LLEASGKSWEEAFALAGLGGWKLHGI
Subjt:  LLEASGKSWEEAFALAGLGGWKLHGI

XP_023003767.1 uncharacterized protein At3g52155, chloroplastic isoform X1 [Cucurbita maxima]3.3e-11798.23Show/hide
Query:  MNASFCCNHIHQHPQFLKIQHASSFARGRNAIQWPSTVIQTAESQVATEEATSQSESVARRLILLRHARSSRQKLSMRDHDRPLSKDGKVDAIKIAQKLQ
        MNASFCCNHI  HPQFLKIQHASSFARGRNAIQWPSTVIQTAESQVATEEATSQSESVARRLILLRHARSSRQKLSMRDHDRPLSKDGKVDAIKIAQKLQ
Subjt:  MNASFCCNHIHQHPQFLKIQHASSFARGRNAIQWPSTVIQTAESQVATEEATSQSESVARRLILLRHARSSRQKLSMRDHDRPLSKDGKVDAIKIAQKLQ

Query:  ELNWIPELILSSDAKRTRETLQIMQEQVVGFSEAEVHFIPSFYSIAAMDGQTAEHLQQVISNYSRNDIITVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
        ELNWIPELILSSDAKRTRETLQIMQEQVVGFSEA+VHFIPSFYSIAAMDGQTAEHLQQ ISNYSRNDIITVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
Subjt:  ELNWIPELILSSDAKRTRETLQIMQEQVVGFSEAEVHFIPSFYSIAAMDGQTAEHLQQVISNYSRNDIITVMCMGHNKGWEEAASMFSGSSIKLKTCNAA

Query:  LLEASGKSWEEAFALAGLGGWKLHGI
        LLEASGKSWEEAFALAGLGGWKLHGI
Subjt:  LLEASGKSWEEAFALAGLGGWKLHGI

XP_023517120.1 uncharacterized protein At3g52155, chloroplastic isoform X1 [Cucurbita pepo subsp. pepo]2.4e-12099.12Show/hide
Query:  MNASFCCNHIHQHPQFLKIQHASSFARGRNAIQWPSTVIQTAESQVATEEATSQSESVARRLILLRHARSSRQKLSMRDHDRPLSKDGKVDAIKIAQKLQ
        MNASFCCNHIHQHPQFLKIQHASSFARGRNAIQWPSTVIQTAESQVATEEATSQSESVARRLILLRHARSSRQKLSMRDHDRPLSKDGKVDAIKIAQKLQ
Subjt:  MNASFCCNHIHQHPQFLKIQHASSFARGRNAIQWPSTVIQTAESQVATEEATSQSESVARRLILLRHARSSRQKLSMRDHDRPLSKDGKVDAIKIAQKLQ

Query:  ELNWIPELILSSDAKRTRETLQIMQEQVVGFSEAEVHFIPSFYSIAAMDGQTAEHLQQVISNYSRNDIITVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
        ELNWIPELILSSDAKRTRETLQIMQEQ VGFSEAEVHFIPSFYSIAAMDGQTAEHLQQVISNYSRNDIITVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
Subjt:  ELNWIPELILSSDAKRTRETLQIMQEQVVGFSEAEVHFIPSFYSIAAMDGQTAEHLQQVISNYSRNDIITVMCMGHNKGWEEAASMFSGSSIKLKTCNAA

Query:  LLEASGKSWEEAFALAGLGGWKLHGI
        LLEASGKSWEEAFALAGLGGW+LHGI
Subjt:  LLEASGKSWEEAFALAGLGGWKLHGI

TrEMBL top hitse value%identityAlignment
A0A0A0KHM4 Uncharacterized protein7.4e-10788.5Show/hide
Query:  MNASFCCNHIHQHPQFLKIQHASSFARGRNAIQWPSTVIQTAESQVATEEATSQSESVARRLILLRHARSSRQKLSMRDHDRPLSKDGKVDAIKIAQKLQ
        MNASFCCN+I QHPQ  K+QH SS  RGRN IQW S VIQTAES+VATEEA SQSESVARRLILLRHARSSRQKLS+RDHDRPLSKDGKVDAIKIA KLQ
Subjt:  MNASFCCNHIHQHPQFLKIQHASSFARGRNAIQWPSTVIQTAESQVATEEATSQSESVARRLILLRHARSSRQKLSMRDHDRPLSKDGKVDAIKIAQKLQ

Query:  ELNWIPELILSSDAKRTRETLQIMQEQVVGFSEAEVHFIPSFYSIAAMDGQTAEHLQQVISNYSRNDIITVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
        EL+WIPELILSSDAKRTRETL++MQEQV GF EAEVHFI SFYSIAAMDGQTA+HLQQVI NYSRN+I+TVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
Subjt:  ELNWIPELILSSDAKRTRETLQIMQEQVVGFSEAEVHFIPSFYSIAAMDGQTAEHLQQVISNYSRNDIITVMCMGHNKGWEEAASMFSGSSIKLKTCNAA

Query:  LLEASGKSWEEAFALAGLGGWKLHGI
        LLEASGKSW+EAFALAGLGGWKLHGI
Subjt:  LLEASGKSWEEAFALAGLGGWKLHGI

A0A1S3B2X1 uncharacterized protein LOC1034852059.7e-10788.05Show/hide
Query:  MNASFCCNHIHQHPQFLKIQHASSFARGRNAIQWPSTVIQTAESQVATEEATSQSESVARRLILLRHARSSRQKLSMRDHDRPLSKDGKVDAIKIAQKLQ
        MNASFCCN+I QHPQ  K+QH SS  R RN IQWPS VIQTAES+VATEEA SQSESVARRLILLRHARSSRQKLS+RDHDRPLSKDGKVDAIKIA KLQ
Subjt:  MNASFCCNHIHQHPQFLKIQHASSFARGRNAIQWPSTVIQTAESQVATEEATSQSESVARRLILLRHARSSRQKLSMRDHDRPLSKDGKVDAIKIAQKLQ

Query:  ELNWIPELILSSDAKRTRETLQIMQEQVVGFSEAEVHFIPSFYSIAAMDGQTAEHLQQVISNYSRNDIITVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
        EL+WIPELILSSDAKRTRETL++MQEQV GFSEAEVHF+ SFYSIAAMDGQTA+HLQQVI +YSRN+I+TVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
Subjt:  ELNWIPELILSSDAKRTRETLQIMQEQVVGFSEAEVHFIPSFYSIAAMDGQTAEHLQQVISNYSRNDIITVMCMGHNKGWEEAASMFSGSSIKLKTCNAA

Query:  LLEASGKSWEEAFALAGLGGWKLHGI
        LLEASGKSW EAFALAGLGGWKLHGI
Subjt:  LLEASGKSWEEAFALAGLGGWKLHGI

A0A5A7SJV8 Uncharacterized protein9.7e-10788.05Show/hide
Query:  MNASFCCNHIHQHPQFLKIQHASSFARGRNAIQWPSTVIQTAESQVATEEATSQSESVARRLILLRHARSSRQKLSMRDHDRPLSKDGKVDAIKIAQKLQ
        MNASFCCN+I QHPQ  K+QH SS  R RN IQWPS VIQTAES+VATEEA SQSESVARRLILLRHARSSRQKLS+RDHDRPLSKDGKVDAIKIA KLQ
Subjt:  MNASFCCNHIHQHPQFLKIQHASSFARGRNAIQWPSTVIQTAESQVATEEATSQSESVARRLILLRHARSSRQKLSMRDHDRPLSKDGKVDAIKIAQKLQ

Query:  ELNWIPELILSSDAKRTRETLQIMQEQVVGFSEAEVHFIPSFYSIAAMDGQTAEHLQQVISNYSRNDIITVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
        EL+WIPELILSSDAKRTRETL++MQEQV GFSEAEVHF+ SFYSIAAMDGQTA+HLQQVI +YSRN+I+TVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
Subjt:  ELNWIPELILSSDAKRTRETLQIMQEQVVGFSEAEVHFIPSFYSIAAMDGQTAEHLQQVISNYSRNDIITVMCMGHNKGWEEAASMFSGSSIKLKTCNAA

Query:  LLEASGKSWEEAFALAGLGGWKLHGI
        LLEASGKSW EAFALAGLGGWKLHGI
Subjt:  LLEASGKSWEEAFALAGLGGWKLHGI

A0A6J1EGB9 uncharacterized protein At3g52155, chloroplastic isoform X11.8e-121100Show/hide
Query:  MNASFCCNHIHQHPQFLKIQHASSFARGRNAIQWPSTVIQTAESQVATEEATSQSESVARRLILLRHARSSRQKLSMRDHDRPLSKDGKVDAIKIAQKLQ
        MNASFCCNHIHQHPQFLKIQHASSFARGRNAIQWPSTVIQTAESQVATEEATSQSESVARRLILLRHARSSRQKLSMRDHDRPLSKDGKVDAIKIAQKLQ
Subjt:  MNASFCCNHIHQHPQFLKIQHASSFARGRNAIQWPSTVIQTAESQVATEEATSQSESVARRLILLRHARSSRQKLSMRDHDRPLSKDGKVDAIKIAQKLQ

Query:  ELNWIPELILSSDAKRTRETLQIMQEQVVGFSEAEVHFIPSFYSIAAMDGQTAEHLQQVISNYSRNDIITVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
        ELNWIPELILSSDAKRTRETLQIMQEQVVGFSEAEVHFIPSFYSIAAMDGQTAEHLQQVISNYSRNDIITVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
Subjt:  ELNWIPELILSSDAKRTRETLQIMQEQVVGFSEAEVHFIPSFYSIAAMDGQTAEHLQQVISNYSRNDIITVMCMGHNKGWEEAASMFSGSSIKLKTCNAA

Query:  LLEASGKSWEEAFALAGLGGWKLHGI
        LLEASGKSWEEAFALAGLGGWKLHGI
Subjt:  LLEASGKSWEEAFALAGLGGWKLHGI

A0A6J1KNJ1 uncharacterized protein At3g52155, chloroplastic isoform X11.6e-11798.23Show/hide
Query:  MNASFCCNHIHQHPQFLKIQHASSFARGRNAIQWPSTVIQTAESQVATEEATSQSESVARRLILLRHARSSRQKLSMRDHDRPLSKDGKVDAIKIAQKLQ
        MNASFCCNHI  HPQFLKIQHASSFARGRNAIQWPSTVIQTAESQVATEEATSQSESVARRLILLRHARSSRQKLSMRDHDRPLSKDGKVDAIKIAQKLQ
Subjt:  MNASFCCNHIHQHPQFLKIQHASSFARGRNAIQWPSTVIQTAESQVATEEATSQSESVARRLILLRHARSSRQKLSMRDHDRPLSKDGKVDAIKIAQKLQ

Query:  ELNWIPELILSSDAKRTRETLQIMQEQVVGFSEAEVHFIPSFYSIAAMDGQTAEHLQQVISNYSRNDIITVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
        ELNWIPELILSSDAKRTRETLQIMQEQVVGFSEA+VHFIPSFYSIAAMDGQTAEHLQQ ISNYSRNDIITVMCMGHNKGWEEAASMFSGSSIKLKTCNAA
Subjt:  ELNWIPELILSSDAKRTRETLQIMQEQVVGFSEAEVHFIPSFYSIAAMDGQTAEHLQQVISNYSRNDIITVMCMGHNKGWEEAASMFSGSSIKLKTCNAA

Query:  LLEASGKSWEEAFALAGLGGWKLHGI
        LLEASGKSWEEAFALAGLGGWKLHGI
Subjt:  LLEASGKSWEEAFALAGLGGWKLHGI

SwissProt top hitse value%identityAlignment
Q94BY1 Uncharacterized protein At3g52155, chloroplastic4.5e-6972.04Show/hide
Query:  TAESQVATEEATSQSESVARRLILLRHARSSRQKLSMRDHDRPLSKDGKVDAIKIAQKLQELNWIPELILSSDAKRTRETLQIMQEQVVGFSEAEVHFIP
        TA  +  ++ A S S S++RRLILLRHA SS   LS+RDHDRPLSK G+ DA K+AQ L  L W+P+LILSSDA RTRETL+ MQ QV GF EA VHFIP
Subjt:  TAESQVATEEATSQSESVARRLILLRHARSSRQKLSMRDHDRPLSKDGKVDAIKIAQKLQELNWIPELILSSDAKRTRETLQIMQEQVVGFSEAEVHFIP

Query:  SFYSIAAMDGQTAEHLQQVISNYSRNDIITVMCMGHNKGWEEAASMFSGSSIKLKTCNAALLEASGKSWEEAFALAGLGGWKLHGI
        SFYSIAAMDGQTAEHLQ +IS YS  DI T+MCMGHNKGWEEAASM SG+SIKLKTCNAALL+A G SWEEAFAL+G GGWKL G+
Subjt:  SFYSIAAMDGQTAEHLQQVISNYSRNDIITVMCMGHNKGWEEAASMFSGSSIKLKTCNAALLEASGKSWEEAFALAGLGGWKLHGI

Arabidopsis top hitse value%identityAlignment
AT3G52155.1 Phosphoglycerate mutase family protein3.2e-7072.04Show/hide
Query:  TAESQVATEEATSQSESVARRLILLRHARSSRQKLSMRDHDRPLSKDGKVDAIKIAQKLQELNWIPELILSSDAKRTRETLQIMQEQVVGFSEAEVHFIP
        TA  +  ++ A S S S++RRLILLRHA SS   LS+RDHDRPLSK G+ DA K+AQ L  L W+P+LILSSDA RTRETL+ MQ QV GF EA VHFIP
Subjt:  TAESQVATEEATSQSESVARRLILLRHARSSRQKLSMRDHDRPLSKDGKVDAIKIAQKLQELNWIPELILSSDAKRTRETLQIMQEQVVGFSEAEVHFIP

Query:  SFYSIAAMDGQTAEHLQQVISNYSRNDIITVMCMGHNKGWEEAASMFSGSSIKLKTCNAALLEASGKSWEEAFALAGLGGWKLHGI
        SFYSIAAMDGQTAEHLQ +IS YS  DI T+MCMGHNKGWEEAASM SG+SIKLKTCNAALL+A G SWEEAFAL+G GGWKL G+
Subjt:  SFYSIAAMDGQTAEHLQQVISNYSRNDIITVMCMGHNKGWEEAASMFSGSSIKLKTCNAALLEASGKSWEEAFALAGLGGWKLHGI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAAAAGGCAGCCAAGAATTAGCATCAAGAAAGCTACGACTTGATTCGAGCGAGGAACTCTCGAGTTGTTTCCAGAGATTGATTCAGCTTATTGACGATTTAAGCAT
GGCGATACATCCCATTGAGCGAATCAAACCAACTACGATGCGGCTGCTCATTCGGCTCTGTCGCCTCTCTGGTCGAATCGAACACATTCCGGAAGTGATTCGAAGATTCT
GTACAATGAATGCTTCATTTTGTTGTAATCACATTCATCAACACCCACAGTTTCTTAAAATTCAACATGCAAGCTCCTTCGCGAGGGGGAGAAACGCTATTCAATGGCCA
TCTACGGTTATTCAGACGGCGGAGAGCCAAGTGGCGACTGAGGAAGCTACCTCTCAATCGGAATCTGTTGCTCGTCGCCTTATTCTGCTTCGCCATGCCAGGAGTTCGCG
GCAAAAACTTTCAATGCGAGATCACGATCGCCCCTTGAGTAAAGATGGAAAAGTTGACGCTATTAAAATTGCTCAGAAACTACAAGAATTGAACTGGATCCCTGAACTTA
TTTTATCCAGTGATGCCAAGCGGACCAGAGAAACACTTCAGATAATGCAGGAGCAAGTTGTTGGTTTTTCGGAAGCTGAGGTTCATTTCATTCCCAGTTTTTATTCCATT
GCAGCCATGGATGGTCAAACTGCGGAGCACCTTCAGCAGGTTATCAGTAATTATTCGAGGAATGACATAATTACTGTCATGTGTATGGGACATAATAAAGGCTGGGAAGA
GGCAGCCTCGATGTTTAGTGGCTCCTCCATAAAACTGAAGACATGCAATGCTGCTTTGCTTGAGGCTTCGGGAAAATCATGGGAGGAGGCATTTGCTTTGGCGGGACTAG
GTGGGTGGAAGCTTCACGGCATACCGTATCTTTTGGTCTCCTCCTTGACTTATGCGGATGATTCTGGATCTGGATATGCTTACTACCGAATCGAGAAGTGGTTTTGGTGT
TTGAGATGCTTGTGGAGCTTCTTTTTGAAGAATGGTGAACTGAATTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACAAAAGGCAGCCAAGAATTAGCATCAAGAAAGCTACGACTTGATTCGAGCGAGGAACTCTCGAGTTGTTTCCAGAGATTGATTCAGCTTATTGACGATTTAAGCAT
GGCGATACATCCCATTGAGCGAATCAAACCAACTACGATGCGGCTGCTCATTCGGCTCTGTCGCCTCTCTGGTCGAATCGAACACATTCCGGAAGTGATTCGAAGATTCT
GTACAATGAATGCTTCATTTTGTTGTAATCACATTCATCAACACCCACAGTTTCTTAAAATTCAACATGCAAGCTCCTTCGCGAGGGGGAGAAACGCTATTCAATGGCCA
TCTACGGTTATTCAGACGGCGGAGAGCCAAGTGGCGACTGAGGAAGCTACCTCTCAATCGGAATCTGTTGCTCGTCGCCTTATTCTGCTTCGCCATGCCAGGAGTTCGCG
GCAAAAACTTTCAATGCGAGATCACGATCGCCCCTTGAGTAAAGATGGAAAAGTTGACGCTATTAAAATTGCTCAGAAACTACAAGAATTGAACTGGATCCCTGAACTTA
TTTTATCCAGTGATGCCAAGCGGACCAGAGAAACACTTCAGATAATGCAGGAGCAAGTTGTTGGTTTTTCGGAAGCTGAGGTTCATTTCATTCCCAGTTTTTATTCCATT
GCAGCCATGGATGGTCAAACTGCGGAGCACCTTCAGCAGGTTATCAGTAATTATTCGAGGAATGACATAATTACTGTCATGTGTATGGGACATAATAAAGGCTGGGAAGA
GGCAGCCTCGATGTTTAGTGGCTCCTCCATAAAACTGAAGACATGCAATGCTGCTTTGCTTGAGGCTTCGGGAAAATCATGGGAGGAGGCATTTGCTTTGGCGGGACTAG
GTGGGTGGAAGCTTCACGGCATACCGTATCTTTTGGTCTCCTCCTTGACTTATGCGGATGATTCTGGATCTGGATATGCTTACTACCGAATCGAGAAGTGGTTTTGGTGT
TTGAGATGCTTGTGGAGCTTCTTTTTGAAGAATGGTGAACTGAATTTTTGA
Protein sequenceShow/hide protein sequence
MTKGSQELASRKLRLDSSEELSSCFQRLIQLIDDLSMAIHPIERIKPTTMRLLIRLCRLSGRIEHIPEVIRRFCTMNASFCCNHIHQHPQFLKIQHASSFARGRNAIQWP
STVIQTAESQVATEEATSQSESVARRLILLRHARSSRQKLSMRDHDRPLSKDGKVDAIKIAQKLQELNWIPELILSSDAKRTRETLQIMQEQVVGFSEAEVHFIPSFYSI
AAMDGQTAEHLQQVISNYSRNDIITVMCMGHNKGWEEAASMFSGSSIKLKTCNAALLEASGKSWEEAFALAGLGGWKLHGIPYLLVSSLTYADDSGSGYAYYRIEKWFWC
LRCLWSFFLKNGELNF