; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021974 (gene) of Snake gourd v1 genome

Gene IDTan0021974
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPhosphoglycerate mutase family protein
Genome locationLG03:76408301..76411397
RNA-Seq ExpressionTan0021974
SyntenyTan0021974
Gene Ontology termsNA
InterPro domainsIPR013078 - Histidine phosphatase superfamily, clade-1
IPR029033 - Histidine phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134818.1 uncharacterized protein At3g52155, chloroplastic [Cucumis sativus]1.1e-11692.7Show/hide
Query:  MNASFCCNHIQQHPQFFKIQHASSSTRGRNPIQWLPAVIQTAESELANEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ
        MNASFCCN+IQQHPQ FK+QH SSS RGRNPIQW  AVIQTAESE+A EEAASQSESVARRLILLRHARSSRQKLS+RDHDRPLSKDGKVDAIKIAHKLQ
Subjt:  MNASFCCNHIQQHPQFFKIQHASSSTRGRNPIQWLPAVIQTAESELANEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ

Query:  ELSWIPELILSSDAKRTRETLQIMQEQVSGFSEAEVHFISSFYSIAAMDGQTAEHLQQVISNYSRNEIITVMCMGHNKGWEEAASMFSGSSIKLRTCNAA
        ELSWIPELILSSDAKRTRETL++MQEQVSGF EAEVHFISSFYSIAAMDGQTA+HLQQVI NYSRNEI+TVMCMGHNKGWEEAASMFSGSSIKL+TCNAA
Subjt:  ELSWIPELILSSDAKRTRETLQIMQEQVSGFSEAEVHFISSFYSIAAMDGQTAEHLQQVISNYSRNEIITVMCMGHNKGWEEAASMFSGSSIKLRTCNAA

Query:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
        LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
Subjt:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS

XP_008440939.1 PREDICTED: uncharacterized protein LOC103485205 [Cucumis melo]2.1e-11591.85Show/hide
Query:  MNASFCCNHIQQHPQFFKIQHASSSTRGRNPIQWLPAVIQTAESELANEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ
        MNASFCCN+IQQHPQ FK+QH SSS R RNPIQW  AVIQTAESE+A EEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ
Subjt:  MNASFCCNHIQQHPQFFKIQHASSSTRGRNPIQWLPAVIQTAESELANEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ

Query:  ELSWIPELILSSDAKRTRETLQIMQEQVSGFSEAEVHFISSFYSIAAMDGQTAEHLQQVISNYSRNEIITVMCMGHNKGWEEAASMFSGSSIKLRTCNAA
        ELSWIPELILSSDAKRTRETL++MQEQVSGFSEAEVHF+SSFYSIAAMDGQTA+HLQQVI +YSRNEI+TVMCMGHNKGWEEAASMFSGSSIKL+TCNAA
Subjt:  ELSWIPELILSSDAKRTRETLQIMQEQVSGFSEAEVHFISSFYSIAAMDGQTAEHLQQVISNYSRNEIITVMCMGHNKGWEEAASMFSGSSIKLRTCNAA

Query:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
        LLEASGKSW+EAFALAGLGGWKLHGIVKPNSRS
Subjt:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS

XP_022926854.1 uncharacterized protein At3g52155, chloroplastic isoform X1 [Cucurbita moschata]9.6e-11390.56Show/hide
Query:  MNASFCCNHIQQHPQFFKIQHASSSTRGRNPIQWLPAVIQTAESELANEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ
        MNASFCCNHI QHPQF KIQHASS  RGRN IQW   VIQTAES++A EEA SQSESVARRLILLRHARSSRQKLS+RDHDRPLSKDGKVDAIKIA KLQ
Subjt:  MNASFCCNHIQQHPQFFKIQHASSSTRGRNPIQWLPAVIQTAESELANEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ

Query:  ELSWIPELILSSDAKRTRETLQIMQEQVSGFSEAEVHFISSFYSIAAMDGQTAEHLQQVISNYSRNEIITVMCMGHNKGWEEAASMFSGSSIKLRTCNAA
        EL+WIPELILSSDAKRTRETLQIMQEQV GFSEAEVHFI SFYSIAAMDGQTAEHLQQVISNYSRN+IITVMCMGHNKGWEEAASMFSGSSIKL+TCNAA
Subjt:  ELSWIPELILSSDAKRTRETLQIMQEQVSGFSEAEVHFISSFYSIAAMDGQTAEHLQQVISNYSRNEIITVMCMGHNKGWEEAASMFSGSSIKLRTCNAA

Query:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
        LLEASGKSW+EAFALAGLGGWKLHGIV+PN RS
Subjt:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS

XP_023517120.1 uncharacterized protein At3g52155, chloroplastic isoform X1 [Cucurbita pepo subsp. pepo]6.2e-11289.7Show/hide
Query:  MNASFCCNHIQQHPQFFKIQHASSSTRGRNPIQWLPAVIQTAESELANEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ
        MNASFCCNHI QHPQF KIQHASS  RGRN IQW   VIQTAES++A EEA SQSESVARRLILLRHARSSRQKLS+RDHDRPLSKDGKVDAIKIA KLQ
Subjt:  MNASFCCNHIQQHPQFFKIQHASSSTRGRNPIQWLPAVIQTAESELANEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ

Query:  ELSWIPELILSSDAKRTRETLQIMQEQVSGFSEAEVHFISSFYSIAAMDGQTAEHLQQVISNYSRNEIITVMCMGHNKGWEEAASMFSGSSIKLRTCNAA
        EL+WIPELILSSDAKRTRETLQIMQEQ  GFSEAEVHFI SFYSIAAMDGQTAEHLQQVISNYSRN+IITVMCMGHNKGWEEAASMFSGSSIKL+TCNAA
Subjt:  ELSWIPELILSSDAKRTRETLQIMQEQVSGFSEAEVHFISSFYSIAAMDGQTAEHLQQVISNYSRNEIITVMCMGHNKGWEEAASMFSGSSIKLRTCNAA

Query:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
        LLEASGKSW+EAFALAGLGGW+LHGIV+PN RS
Subjt:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS

XP_038881679.1 uncharacterized protein At3g52155, chloroplastic [Benincasa hispida]8.4e-11793.56Show/hide
Query:  MNASFCCNHIQQHPQFFKIQHASSSTRGRNPIQWLPAVIQTAESELANEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ
        MNASFCCNHIQQHPQ FK QH SSS R RNPIQW  AVIQTAES++A EEA SQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ
Subjt:  MNASFCCNHIQQHPQFFKIQHASSSTRGRNPIQWLPAVIQTAESELANEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ

Query:  ELSWIPELILSSDAKRTRETLQIMQEQVSGFSEAEVHFISSFYSIAAMDGQTAEHLQQVISNYSRNEIITVMCMGHNKGWEEAASMFSGSSIKLRTCNAA
        ELSWIPELILSSDAKRTRETL+IMQ+QVSGFSEAEVHFISSFYSIAAMDGQTAEHLQQVI NYSRNEIITVMCMGHNKGWEEAASMFSGSSIKL+TCNAA
Subjt:  ELSWIPELILSSDAKRTRETLQIMQEQVSGFSEAEVHFISSFYSIAAMDGQTAEHLQQVISNYSRNEIITVMCMGHNKGWEEAASMFSGSSIKLRTCNAA

Query:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
        LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
Subjt:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS

TrEMBL top hitse value%identityAlignment
A0A0A0KHM4 Uncharacterized protein5.3e-11792.7Show/hide
Query:  MNASFCCNHIQQHPQFFKIQHASSSTRGRNPIQWLPAVIQTAESELANEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ
        MNASFCCN+IQQHPQ FK+QH SSS RGRNPIQW  AVIQTAESE+A EEAASQSESVARRLILLRHARSSRQKLS+RDHDRPLSKDGKVDAIKIAHKLQ
Subjt:  MNASFCCNHIQQHPQFFKIQHASSSTRGRNPIQWLPAVIQTAESELANEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ

Query:  ELSWIPELILSSDAKRTRETLQIMQEQVSGFSEAEVHFISSFYSIAAMDGQTAEHLQQVISNYSRNEIITVMCMGHNKGWEEAASMFSGSSIKLRTCNAA
        ELSWIPELILSSDAKRTRETL++MQEQVSGF EAEVHFISSFYSIAAMDGQTA+HLQQVI NYSRNEI+TVMCMGHNKGWEEAASMFSGSSIKL+TCNAA
Subjt:  ELSWIPELILSSDAKRTRETLQIMQEQVSGFSEAEVHFISSFYSIAAMDGQTAEHLQQVISNYSRNEIITVMCMGHNKGWEEAASMFSGSSIKLRTCNAA

Query:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
        LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
Subjt:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS

A0A1S3B2X1 uncharacterized protein LOC1034852051.0e-11591.85Show/hide
Query:  MNASFCCNHIQQHPQFFKIQHASSSTRGRNPIQWLPAVIQTAESELANEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ
        MNASFCCN+IQQHPQ FK+QH SSS R RNPIQW  AVIQTAESE+A EEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ
Subjt:  MNASFCCNHIQQHPQFFKIQHASSSTRGRNPIQWLPAVIQTAESELANEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ

Query:  ELSWIPELILSSDAKRTRETLQIMQEQVSGFSEAEVHFISSFYSIAAMDGQTAEHLQQVISNYSRNEIITVMCMGHNKGWEEAASMFSGSSIKLRTCNAA
        ELSWIPELILSSDAKRTRETL++MQEQVSGFSEAEVHF+SSFYSIAAMDGQTA+HLQQVI +YSRNEI+TVMCMGHNKGWEEAASMFSGSSIKL+TCNAA
Subjt:  ELSWIPELILSSDAKRTRETLQIMQEQVSGFSEAEVHFISSFYSIAAMDGQTAEHLQQVISNYSRNEIITVMCMGHNKGWEEAASMFSGSSIKLRTCNAA

Query:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
        LLEASGKSW+EAFALAGLGGWKLHGIVKPNSRS
Subjt:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS

A0A5A7SJV8 Uncharacterized protein1.0e-11591.85Show/hide
Query:  MNASFCCNHIQQHPQFFKIQHASSSTRGRNPIQWLPAVIQTAESELANEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ
        MNASFCCN+IQQHPQ FK+QH SSS R RNPIQW  AVIQTAESE+A EEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ
Subjt:  MNASFCCNHIQQHPQFFKIQHASSSTRGRNPIQWLPAVIQTAESELANEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ

Query:  ELSWIPELILSSDAKRTRETLQIMQEQVSGFSEAEVHFISSFYSIAAMDGQTAEHLQQVISNYSRNEIITVMCMGHNKGWEEAASMFSGSSIKLRTCNAA
        ELSWIPELILSSDAKRTRETL++MQEQVSGFSEAEVHF+SSFYSIAAMDGQTA+HLQQVI +YSRNEI+TVMCMGHNKGWEEAASMFSGSSIKL+TCNAA
Subjt:  ELSWIPELILSSDAKRTRETLQIMQEQVSGFSEAEVHFISSFYSIAAMDGQTAEHLQQVISNYSRNEIITVMCMGHNKGWEEAASMFSGSSIKLRTCNAA

Query:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
        LLEASGKSW+EAFALAGLGGWKLHGIVKPNSRS
Subjt:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS

A0A6J1BTP9 uncharacterized protein LOC1110052631.3e-11088.41Show/hide
Query:  MNASFCCNHIQQHPQFFKIQHASSSTRGRNPIQWLPAVIQTAESELANEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ
        MN SFCCNHIQQHPQFF+IQH+S S+RGRNPIQW  AVIQTAE + A EEA SQSESVARRLILLRHARSS QKLSLRDHDRPLSKDGK DAIKIA KLQ
Subjt:  MNASFCCNHIQQHPQFFKIQHASSSTRGRNPIQWLPAVIQTAESELANEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ

Query:  ELSWIPELILSSDAKRTRETLQIMQEQVSGFSEAEVHFISSFYSIAAMDGQTAEHLQQVISNYSRNEIITVMCMGHNKGWEEAASMFSGSSIKLRTCNAA
        EL+WIPELILSSDAKRTRETL+IMQ +VS FSEAEVH ISSFYSIAAMDGQTAEHLQQVI NYSRNEIITVMCMGHN+GWEEAASMFSGSSI+L+TCNAA
Subjt:  ELSWIPELILSSDAKRTRETLQIMQEQVSGFSEAEVHFISSFYSIAAMDGQTAEHLQQVISNYSRNEIITVMCMGHNKGWEEAASMFSGSSIKLRTCNAA

Query:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
        LLEASGKSWDEAFALAGLGGWKLHGIVKPN+ S
Subjt:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS

A0A6J1EGB9 uncharacterized protein At3g52155, chloroplastic isoform X14.6e-11390.56Show/hide
Query:  MNASFCCNHIQQHPQFFKIQHASSSTRGRNPIQWLPAVIQTAESELANEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ
        MNASFCCNHI QHPQF KIQHASS  RGRN IQW   VIQTAES++A EEA SQSESVARRLILLRHARSSRQKLS+RDHDRPLSKDGKVDAIKIA KLQ
Subjt:  MNASFCCNHIQQHPQFFKIQHASSSTRGRNPIQWLPAVIQTAESELANEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQ

Query:  ELSWIPELILSSDAKRTRETLQIMQEQVSGFSEAEVHFISSFYSIAAMDGQTAEHLQQVISNYSRNEIITVMCMGHNKGWEEAASMFSGSSIKLRTCNAA
        EL+WIPELILSSDAKRTRETLQIMQEQV GFSEAEVHFI SFYSIAAMDGQTAEHLQQVISNYSRN+IITVMCMGHNKGWEEAASMFSGSSIKL+TCNAA
Subjt:  ELSWIPELILSSDAKRTRETLQIMQEQVSGFSEAEVHFISSFYSIAAMDGQTAEHLQQVISNYSRNEIITVMCMGHNKGWEEAASMFSGSSIKLRTCNAA

Query:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS
        LLEASGKSW+EAFALAGLGGWKLHGIV+PN RS
Subjt:  LLEASGKSWDEAFALAGLGGWKLHGIVKPNSRS

SwissProt top hitse value%identityAlignment
Q94BY1 Uncharacterized protein At3g52155, chloroplastic2.3e-6970.16Show/hide
Query:  TAESELANEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQELSWIPELILSSDAKRTRETLQIMQEQVSGFSEAEVHFIS
        TA     ++ AAS S S++RRLILLRHA SS   LSLRDHDRPLSK G+ DA K+A  L  L W+P+LILSSDA RTRETL+ MQ QV GF EA VHFI 
Subjt:  TAESELANEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQELSWIPELILSSDAKRTRETLQIMQEQVSGFSEAEVHFIS

Query:  SFYSIAAMDGQTAEHLQQVISNYSRNEIITVMCMGHNKGWEEAASMFSGSSIKLRTCNAALLEASGKSWDEAFALAGLGGWKLHGIVKPNS
        SFYSIAAMDGQTAEHLQ +IS YS  +I T+MCMGHNKGWEEAASM SG+SIKL+TCNAALL+A G SW+EAFAL+G GGWKL G+V P+S
Subjt:  SFYSIAAMDGQTAEHLQQVISNYSRNEIITVMCMGHNKGWEEAASMFSGSSIKLRTCNAALLEASGKSWDEAFALAGLGGWKLHGIVKPNS

Arabidopsis top hitse value%identityAlignment
AT3G52155.1 Phosphoglycerate mutase family protein1.7e-7070.16Show/hide
Query:  TAESELANEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQELSWIPELILSSDAKRTRETLQIMQEQVSGFSEAEVHFIS
        TA     ++ AAS S S++RRLILLRHA SS   LSLRDHDRPLSK G+ DA K+A  L  L W+P+LILSSDA RTRETL+ MQ QV GF EA VHFI 
Subjt:  TAESELANEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQELSWIPELILSSDAKRTRETLQIMQEQVSGFSEAEVHFIS

Query:  SFYSIAAMDGQTAEHLQQVISNYSRNEIITVMCMGHNKGWEEAASMFSGSSIKLRTCNAALLEASGKSWDEAFALAGLGGWKLHGIVKPNS
        SFYSIAAMDGQTAEHLQ +IS YS  +I T+MCMGHNKGWEEAASM SG+SIKL+TCNAALL+A G SW+EAFAL+G GGWKL G+V P+S
Subjt:  SFYSIAAMDGQTAEHLQQVISNYSRNEIITVMCMGHNKGWEEAASMFSGSSIKLRTCNAALLEASGKSWDEAFALAGLGGWKLHGIVKPNS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGCTTCATTTTGCTGTAATCACATTCAACAACACCCACAATTCTTTAAAATTCAACATGCAAGCTCCTCCACGAGGGGGAGAAACCCTATTCAATGGCTACCTGC
AGTTATTCAGACAGCGGAGAGCGAATTGGCGAATGAGGAAGCTGCCTCTCAATCGGAATCTGTTGCTCGTCGCCTTATTCTGCTTCGTCATGCCAGGAGTTCGCGCCAAA
AGCTTTCACTGCGAGATCACGATCGCCCCTTGAGTAAAGATGGAAAAGTTGATGCTATTAAAATTGCTCATAAACTCCAAGAATTGAGTTGGATCCCTGAACTTATTTTA
TCCAGCGACGCCAAGCGCACAAGAGAAACACTTCAGATAATGCAGGAGCAAGTTAGTGGTTTTTCTGAAGCGGAGGTTCATTTCATTTCCAGTTTTTATTCCATTGCTGC
CATGGACGGCCAGACTGCGGAGCACCTTCAGCAGGTTATCAGTAATTATTCAAGGAATGAGATAATTACAGTCATGTGTATGGGACATAACAAAGGCTGGGAAGAGGCAG
CCTCAATGTTTAGTGGCTCCTCCATAAAACTGAGGACATGCAATGCTGCTTTGCTTGAGGCTTCTGGAAAATCTTGGGATGAGGCATTTGCTTTGGCGGGATTAGGTGGG
TGGAAGCTTCATGGCATAGTAAAACCAAATAGTAGATCATAG
mRNA sequenceShow/hide mRNA sequence
AGACAAACAAAAAATGATTAGGGAGTTCCAAAGCCTTATCTACTCGTCTTCTTCTTTGAGTTCGTCTACCTCTCTCGTTCAGTAGAACGAGGTTTCGAGCGAGGAACTCT
TGAGTTTCTCCTAGAGGTTGATTCAGTTTCTTGACAATTTAAGCATGGCAATACATTCCCATTGAGCAAACCAAACCAACTACGATGCATCAGGTATCCGATTAGGAAGA
ATCAACCACCGGTTATCAGTACTGTTTCAAGTTTTAAGCTCGCCGTCGCTTCTCCGAGCTCGGCTCTGTCGCCTCTTGGTCGAATCGAACACGCTTCATAAGTGATCGAA
AGATTCTGTACAATGAACGCTTCATTTTGCTGTAATCACATTCAACAACACCCACAATTCTTTAAAATTCAACATGCAAGCTCCTCCACGAGGGGGAGAAACCCTATTCA
ATGGCTACCTGCAGTTATTCAGACAGCGGAGAGCGAATTGGCGAATGAGGAAGCTGCCTCTCAATCGGAATCTGTTGCTCGTCGCCTTATTCTGCTTCGTCATGCCAGGA
GTTCGCGCCAAAAGCTTTCACTGCGAGATCACGATCGCCCCTTGAGTAAAGATGGAAAAGTTGATGCTATTAAAATTGCTCATAAACTCCAAGAATTGAGTTGGATCCCT
GAACTTATTTTATCCAGCGACGCCAAGCGCACAAGAGAAACACTTCAGATAATGCAGGAGCAAGTTAGTGGTTTTTCTGAAGCGGAGGTTCATTTCATTTCCAGTTTTTA
TTCCATTGCTGCCATGGACGGCCAGACTGCGGAGCACCTTCAGCAGGTTATCAGTAATTATTCAAGGAATGAGATAATTACAGTCATGTGTATGGGACATAACAAAGGCT
GGGAAGAGGCAGCCTCAATGTTTAGTGGCTCCTCCATAAAACTGAGGACATGCAATGCTGCTTTGCTTGAGGCTTCTGGAAAATCTTGGGATGAGGCATTTGCTTTGGCG
GGATTAGGTGGGTGGAAGCTTCATGGCATAGTAAAACCAAATAGTAGATCATAGAGCATTTTAAAGTTTTCTAGATAGTTTAGCACTTAGCAGTCACTCGTTGTACAAGA
ACTTTCACCCC
Protein sequenceShow/hide protein sequence
MNASFCCNHIQQHPQFFKIQHASSSTRGRNPIQWLPAVIQTAESELANEEAASQSESVARRLILLRHARSSRQKLSLRDHDRPLSKDGKVDAIKIAHKLQELSWIPELIL
SSDAKRTRETLQIMQEQVSGFSEAEVHFISSFYSIAAMDGQTAEHLQQVISNYSRNEIITVMCMGHNKGWEEAASMFSGSSIKLRTCNAALLEASGKSWDEAFALAGLGG
WKLHGIVKPNSRS