; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC01g1247 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC01g1247
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionPhosphoglycerate mutase family protein
Genome locationMC01:17483992..17487763
RNA-Seq ExpressionMC01g1247
SyntenyMC01g1247
Gene Ontology termsNA
InterPro domainsIPR013078 - Histidine phosphatase superfamily, clade-1
IPR029033 - Histidine phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134818.1 uncharacterized protein At3g52155, chloroplastic [Cucumis sativus]7.09e-14186.75Show/hide
Query:  MNGSFCCNHIQQHPQFFRIQHSSFSSRGRNPIQWPSAVIQTAEGQAATEEADSQSESVARRLILLRHARSSWQKLSLRDHDRPLSKDGKADAIKIARKLQ
        MN SFCCN+IQQHPQ F++QH S S RGRNPIQW SAVIQTAE + ATEEA SQSESVARRLILLRHARSS QKLS+RDHDRPLSKDGK DAIKIA KLQ
Subjt:  MNGSFCCNHIQQHPQFFRIQHSSFSSRGRNPIQWPSAVIQTAEGQAATEEADSQSESVARRLILLRHARSSWQKLSLRDHDRPLSKDGKADAIKIARKLQ

Query:  ELNWIPELILSSSDAKRTRETLKIMQNEVSSFSEAEVHLISSFYSIAAMDGQTAEHLQQVICNYSRNEIITVMCMGHNRGWEEAASMFSGSSIELKTCNA
        EL+WIPELILSS DAKRTRETLK+MQ +VS F EAEVH ISSFYSIAAMDGQTA+HLQQVICNYSRNEI+TVMCMGHN+GWEEAASMFSGSSI+LKTCNA
Subjt:  ELNWIPELILSSSDAKRTRETLKIMQNEVSSFSEAEVHLISSFYSIAAMDGQTAEHLQQVICNYSRNEIITVMCMGHNRGWEEAASMFSGSSIELKTCNA

Query:  ALLEASGKSWDEAFALAGLGGWKLHGIVKPNNSS
        ALLEASGKSWDEAFALAGLGGWKLHGIVKPN+ S
Subjt:  ALLEASGKSWDEAFALAGLGGWKLHGIVKPNNSS

XP_008440939.1 PREDICTED: uncharacterized protein LOC103485205 [Cucumis melo]1.01e-14086.32Show/hide
Query:  MNGSFCCNHIQQHPQFFRIQHSSFSSRGRNPIQWPSAVIQTAEGQAATEEADSQSESVARRLILLRHARSSWQKLSLRDHDRPLSKDGKADAIKIARKLQ
        MN SFCCN+IQQHPQ F++QH S S R RNPIQWPSAVIQTAE + ATEEA SQSESVARRLILLRHARSS QKLSLRDHDRPLSKDGK DAIKIA KLQ
Subjt:  MNGSFCCNHIQQHPQFFRIQHSSFSSRGRNPIQWPSAVIQTAEGQAATEEADSQSESVARRLILLRHARSSWQKLSLRDHDRPLSKDGKADAIKIARKLQ

Query:  ELNWIPELILSSSDAKRTRETLKIMQNEVSSFSEAEVHLISSFYSIAAMDGQTAEHLQQVICNYSRNEIITVMCMGHNRGWEEAASMFSGSSIELKTCNA
        EL+WIPELILSS DAKRTRETLK+MQ +VS FSEAEVH +SSFYSIAAMDGQTA+HLQQVIC+YSRNEI+TVMCMGHN+GWEEAASMFSGSSI+LKTCNA
Subjt:  ELNWIPELILSSSDAKRTRETLKIMQNEVSSFSEAEVHLISSFYSIAAMDGQTAEHLQQVICNYSRNEIITVMCMGHNRGWEEAASMFSGSSIELKTCNA

Query:  ALLEASGKSWDEAFALAGLGGWKLHGIVKPNNSS
        ALLEASGKSW+EAFALAGLGGWKLHGIVKPN+ S
Subjt:  ALLEASGKSWDEAFALAGLGGWKLHGIVKPNNSS

XP_022132392.1 uncharacterized protein LOC111005263 [Momordica charantia]1.95e-16399.57Show/hide
Query:  MNGSFCCNHIQQHPQFFRIQHSSFSSRGRNPIQWPSAVIQTAEGQAATEEADSQSESVARRLILLRHARSSWQKLSLRDHDRPLSKDGKADAIKIARKLQ
        MNGSFCCNHIQQHPQFFRIQHSSFSSRGRNPIQWPSAVIQTAEGQAATEEADSQSESVARRLILLRHARSSWQKLSLRDHDRPLSKDGKADAIKIARKLQ
Subjt:  MNGSFCCNHIQQHPQFFRIQHSSFSSRGRNPIQWPSAVIQTAEGQAATEEADSQSESVARRLILLRHARSSWQKLSLRDHDRPLSKDGKADAIKIARKLQ

Query:  ELNWIPELILSSSDAKRTRETLKIMQNEVSSFSEAEVHLISSFYSIAAMDGQTAEHLQQVICNYSRNEIITVMCMGHNRGWEEAASMFSGSSIELKTCNA
        ELNWIPELILSS DAKRTRETLKIMQNEVSSFSEAEVHLISSFYSIAAMDGQTAEHLQQVICNYSRNEIITVMCMGHNRGWEEAASMFSGSSIELKTCNA
Subjt:  ELNWIPELILSSSDAKRTRETLKIMQNEVSSFSEAEVHLISSFYSIAAMDGQTAEHLQQVICNYSRNEIITVMCMGHNRGWEEAASMFSGSSIELKTCNA

Query:  ALLEASGKSWDEAFALAGLGGWKLHGIVKPNNSS
        ALLEASGKSWDEAFALAGLGGWKLHGIVKPNNSS
Subjt:  ALLEASGKSWDEAFALAGLGGWKLHGIVKPNNSS

XP_022926854.1 uncharacterized protein At3g52155, chloroplastic isoform X1 [Cucurbita moschata]5.57e-13886.58Show/hide
Query:  MNGSFCCNHIQQHPQFFRIQHSSFSSRGRNPIQWPSAVIQTAEGQAATEEADSQSESVARRLILLRHARSSWQKLSLRDHDRPLSKDGKADAIKIARKLQ
        MN SFCCNHI QHPQF +IQH+S  +RGRN IQWPS VIQTAE Q ATEEA SQSESVARRLILLRHARSS QKLS+RDHDRPLSKDGK DAIKIA+KLQ
Subjt:  MNGSFCCNHIQQHPQFFRIQHSSFSSRGRNPIQWPSAVIQTAEGQAATEEADSQSESVARRLILLRHARSSWQKLSLRDHDRPLSKDGKADAIKIARKLQ

Query:  ELNWIPELILSSSDAKRTRETLKIMQNEVSSFSEAEVHLISSFYSIAAMDGQTAEHLQQVICNYSRNEIITVMCMGHNRGWEEAASMFSGSSIELKTCNA
        ELNWIPELILSS DAKRTRETL+IMQ +V  FSEAEVH I SFYSIAAMDGQTAEHLQQVI NYSRN+IITVMCMGHN+GWEEAASMFSGSSI+LKTCNA
Subjt:  ELNWIPELILSSSDAKRTRETLKIMQNEVSSFSEAEVHLISSFYSIAAMDGQTAEHLQQVICNYSRNEIITVMCMGHNRGWEEAASMFSGSSIELKTCNA

Query:  ALLEASGKSWDEAFALAGLGGWKLHGIVKPN
        ALLEASGKSW+EAFALAGLGGWKLHGIV+PN
Subjt:  ALLEASGKSWDEAFALAGLGGWKLHGIVKPN

XP_038881679.1 uncharacterized protein At3g52155, chloroplastic [Benincasa hispida]1.10e-14489.74Show/hide
Query:  MNGSFCCNHIQQHPQFFRIQHSSFSSRGRNPIQWPSAVIQTAEGQAATEEADSQSESVARRLILLRHARSSWQKLSLRDHDRPLSKDGKADAIKIARKLQ
        MN SFCCNHIQQHPQ F+ QH S S R RNPIQWPSAVIQTAE Q ATEEA SQSESVARRLILLRHARSS QKLSLRDHDRPLSKDGK DAIKIA KLQ
Subjt:  MNGSFCCNHIQQHPQFFRIQHSSFSSRGRNPIQWPSAVIQTAEGQAATEEADSQSESVARRLILLRHARSSWQKLSLRDHDRPLSKDGKADAIKIARKLQ

Query:  ELNWIPELILSSSDAKRTRETLKIMQNEVSSFSEAEVHLISSFYSIAAMDGQTAEHLQQVICNYSRNEIITVMCMGHNRGWEEAASMFSGSSIELKTCNA
        EL+WIPELILSS DAKRTRETLKIMQ++VS FSEAEVH ISSFYSIAAMDGQTAEHLQQVICNYSRNEIITVMCMGHN+GWEEAASMFSGSSI+LKTCNA
Subjt:  ELNWIPELILSSSDAKRTRETLKIMQNEVSSFSEAEVHLISSFYSIAAMDGQTAEHLQQVICNYSRNEIITVMCMGHNRGWEEAASMFSGSSIELKTCNA

Query:  ALLEASGKSWDEAFALAGLGGWKLHGIVKPNNSS
        ALLEASGKSWDEAFALAGLGGWKLHGIVKPN+ S
Subjt:  ALLEASGKSWDEAFALAGLGGWKLHGIVKPNNSS

TrEMBL top hitse value%identityAlignment
A0A0A0KHM4 Uncharacterized protein3.43e-14186.75Show/hide
Query:  MNGSFCCNHIQQHPQFFRIQHSSFSSRGRNPIQWPSAVIQTAEGQAATEEADSQSESVARRLILLRHARSSWQKLSLRDHDRPLSKDGKADAIKIARKLQ
        MN SFCCN+IQQHPQ F++QH S S RGRNPIQW SAVIQTAE + ATEEA SQSESVARRLILLRHARSS QKLS+RDHDRPLSKDGK DAIKIA KLQ
Subjt:  MNGSFCCNHIQQHPQFFRIQHSSFSSRGRNPIQWPSAVIQTAEGQAATEEADSQSESVARRLILLRHARSSWQKLSLRDHDRPLSKDGKADAIKIARKLQ

Query:  ELNWIPELILSSSDAKRTRETLKIMQNEVSSFSEAEVHLISSFYSIAAMDGQTAEHLQQVICNYSRNEIITVMCMGHNRGWEEAASMFSGSSIELKTCNA
        EL+WIPELILSS DAKRTRETLK+MQ +VS F EAEVH ISSFYSIAAMDGQTA+HLQQVICNYSRNEI+TVMCMGHN+GWEEAASMFSGSSI+LKTCNA
Subjt:  ELNWIPELILSSSDAKRTRETLKIMQNEVSSFSEAEVHLISSFYSIAAMDGQTAEHLQQVICNYSRNEIITVMCMGHNRGWEEAASMFSGSSIELKTCNA

Query:  ALLEASGKSWDEAFALAGLGGWKLHGIVKPNNSS
        ALLEASGKSWDEAFALAGLGGWKLHGIVKPN+ S
Subjt:  ALLEASGKSWDEAFALAGLGGWKLHGIVKPNNSS

A0A1S3B2X1 uncharacterized protein LOC1034852054.87e-14186.32Show/hide
Query:  MNGSFCCNHIQQHPQFFRIQHSSFSSRGRNPIQWPSAVIQTAEGQAATEEADSQSESVARRLILLRHARSSWQKLSLRDHDRPLSKDGKADAIKIARKLQ
        MN SFCCN+IQQHPQ F++QH S S R RNPIQWPSAVIQTAE + ATEEA SQSESVARRLILLRHARSS QKLSLRDHDRPLSKDGK DAIKIA KLQ
Subjt:  MNGSFCCNHIQQHPQFFRIQHSSFSSRGRNPIQWPSAVIQTAEGQAATEEADSQSESVARRLILLRHARSSWQKLSLRDHDRPLSKDGKADAIKIARKLQ

Query:  ELNWIPELILSSSDAKRTRETLKIMQNEVSSFSEAEVHLISSFYSIAAMDGQTAEHLQQVICNYSRNEIITVMCMGHNRGWEEAASMFSGSSIELKTCNA
        EL+WIPELILSS DAKRTRETLK+MQ +VS FSEAEVH +SSFYSIAAMDGQTA+HLQQVIC+YSRNEI+TVMCMGHN+GWEEAASMFSGSSI+LKTCNA
Subjt:  ELNWIPELILSSSDAKRTRETLKIMQNEVSSFSEAEVHLISSFYSIAAMDGQTAEHLQQVICNYSRNEIITVMCMGHNRGWEEAASMFSGSSIELKTCNA

Query:  ALLEASGKSWDEAFALAGLGGWKLHGIVKPNNSS
        ALLEASGKSW+EAFALAGLGGWKLHGIVKPN+ S
Subjt:  ALLEASGKSWDEAFALAGLGGWKLHGIVKPNNSS

A0A5A7SJV8 Uncharacterized protein4.87e-14186.32Show/hide
Query:  MNGSFCCNHIQQHPQFFRIQHSSFSSRGRNPIQWPSAVIQTAEGQAATEEADSQSESVARRLILLRHARSSWQKLSLRDHDRPLSKDGKADAIKIARKLQ
        MN SFCCN+IQQHPQ F++QH S S R RNPIQWPSAVIQTAE + ATEEA SQSESVARRLILLRHARSS QKLSLRDHDRPLSKDGK DAIKIA KLQ
Subjt:  MNGSFCCNHIQQHPQFFRIQHSSFSSRGRNPIQWPSAVIQTAEGQAATEEADSQSESVARRLILLRHARSSWQKLSLRDHDRPLSKDGKADAIKIARKLQ

Query:  ELNWIPELILSSSDAKRTRETLKIMQNEVSSFSEAEVHLISSFYSIAAMDGQTAEHLQQVICNYSRNEIITVMCMGHNRGWEEAASMFSGSSIELKTCNA
        EL+WIPELILSS DAKRTRETLK+MQ +VS FSEAEVH +SSFYSIAAMDGQTA+HLQQVIC+YSRNEI+TVMCMGHN+GWEEAASMFSGSSI+LKTCNA
Subjt:  ELNWIPELILSSSDAKRTRETLKIMQNEVSSFSEAEVHLISSFYSIAAMDGQTAEHLQQVICNYSRNEIITVMCMGHNRGWEEAASMFSGSSIELKTCNA

Query:  ALLEASGKSWDEAFALAGLGGWKLHGIVKPNNSS
        ALLEASGKSW+EAFALAGLGGWKLHGIVKPN+ S
Subjt:  ALLEASGKSWDEAFALAGLGGWKLHGIVKPNNSS

A0A6J1BTP9 uncharacterized protein LOC1110052639.46e-16499.57Show/hide
Query:  MNGSFCCNHIQQHPQFFRIQHSSFSSRGRNPIQWPSAVIQTAEGQAATEEADSQSESVARRLILLRHARSSWQKLSLRDHDRPLSKDGKADAIKIARKLQ
        MNGSFCCNHIQQHPQFFRIQHSSFSSRGRNPIQWPSAVIQTAEGQAATEEADSQSESVARRLILLRHARSSWQKLSLRDHDRPLSKDGKADAIKIARKLQ
Subjt:  MNGSFCCNHIQQHPQFFRIQHSSFSSRGRNPIQWPSAVIQTAEGQAATEEADSQSESVARRLILLRHARSSWQKLSLRDHDRPLSKDGKADAIKIARKLQ

Query:  ELNWIPELILSSSDAKRTRETLKIMQNEVSSFSEAEVHLISSFYSIAAMDGQTAEHLQQVICNYSRNEIITVMCMGHNRGWEEAASMFSGSSIELKTCNA
        ELNWIPELILSS DAKRTRETLKIMQNEVSSFSEAEVHLISSFYSIAAMDGQTAEHLQQVICNYSRNEIITVMCMGHNRGWEEAASMFSGSSIELKTCNA
Subjt:  ELNWIPELILSSSDAKRTRETLKIMQNEVSSFSEAEVHLISSFYSIAAMDGQTAEHLQQVICNYSRNEIITVMCMGHNRGWEEAASMFSGSSIELKTCNA

Query:  ALLEASGKSWDEAFALAGLGGWKLHGIVKPNNSS
        ALLEASGKSWDEAFALAGLGGWKLHGIVKPNNSS
Subjt:  ALLEASGKSWDEAFALAGLGGWKLHGIVKPNNSS

A0A6J1EGB9 uncharacterized protein At3g52155, chloroplastic isoform X12.70e-13886.58Show/hide
Query:  MNGSFCCNHIQQHPQFFRIQHSSFSSRGRNPIQWPSAVIQTAEGQAATEEADSQSESVARRLILLRHARSSWQKLSLRDHDRPLSKDGKADAIKIARKLQ
        MN SFCCNHI QHPQF +IQH+S  +RGRN IQWPS VIQTAE Q ATEEA SQSESVARRLILLRHARSS QKLS+RDHDRPLSKDGK DAIKIA+KLQ
Subjt:  MNGSFCCNHIQQHPQFFRIQHSSFSSRGRNPIQWPSAVIQTAEGQAATEEADSQSESVARRLILLRHARSSWQKLSLRDHDRPLSKDGKADAIKIARKLQ

Query:  ELNWIPELILSSSDAKRTRETLKIMQNEVSSFSEAEVHLISSFYSIAAMDGQTAEHLQQVICNYSRNEIITVMCMGHNRGWEEAASMFSGSSIELKTCNA
        ELNWIPELILSS DAKRTRETL+IMQ +V  FSEAEVH I SFYSIAAMDGQTAEHLQQVI NYSRN+IITVMCMGHN+GWEEAASMFSGSSI+LKTCNA
Subjt:  ELNWIPELILSSSDAKRTRETLKIMQNEVSSFSEAEVHLISSFYSIAAMDGQTAEHLQQVICNYSRNEIITVMCMGHNRGWEEAASMFSGSSIELKTCNA

Query:  ALLEASGKSWDEAFALAGLGGWKLHGIVKPN
        ALLEASGKSW+EAFALAGLGGWKLHGIV+PN
Subjt:  ALLEASGKSWDEAFALAGLGGWKLHGIVKPN

SwissProt top hitse value%identityAlignment
Q94BY1 Uncharacterized protein At3g52155, chloroplastic5.8e-6868.39Show/hide
Query:  TAEGQAATEEADSQSESVARRLILLRHARSSWQKLSLRDHDRPLSKDGKADAIKIARKLQELNWIPELILSSSDAKRTRETLKIMQNEVSSFSEAEVHLI
        TA  +A ++ A S S S++RRLILLRHA SSW  LSLRDHDRPLSK G+ADA K+A+ L  L W+P+LIL SSDA RTRETLK MQ +V  F EA VH I
Subjt:  TAEGQAATEEADSQSESVARRLILLRHARSSWQKLSLRDHDRPLSKDGKADAIKIARKLQELNWIPELILSSSDAKRTRETLKIMQNEVSSFSEAEVHLI

Query:  SSFYSIAAMDGQTAEHLQQVICNYSRNEIITVMCMGHNRGWEEAASMFSGSSIELKTCNAALLEASGKSWDEAFALAGLGGWKLHGIVKPNNS
         SFYSIAAMDGQTAEHLQ +I  YS  +I T+MCMGHN+GWEEAASM SG+SI+LKTCNAALL+A G SW+EAFAL+G GGWKL G+V P++S
Subjt:  SSFYSIAAMDGQTAEHLQQVICNYSRNEIITVMCMGHNRGWEEAASMFSGSSIELKTCNAALLEASGKSWDEAFALAGLGGWKLHGIVKPNNS

Arabidopsis top hitse value%identityAlignment
AT3G52155.1 Phosphoglycerate mutase family protein4.1e-6968.39Show/hide
Query:  TAEGQAATEEADSQSESVARRLILLRHARSSWQKLSLRDHDRPLSKDGKADAIKIARKLQELNWIPELILSSSDAKRTRETLKIMQNEVSSFSEAEVHLI
        TA  +A ++ A S S S++RRLILLRHA SSW  LSLRDHDRPLSK G+ADA K+A+ L  L W+P+LIL SSDA RTRETLK MQ +V  F EA VH I
Subjt:  TAEGQAATEEADSQSESVARRLILLRHARSSWQKLSLRDHDRPLSKDGKADAIKIARKLQELNWIPELILSSSDAKRTRETLKIMQNEVSSFSEAEVHLI

Query:  SSFYSIAAMDGQTAEHLQQVICNYSRNEIITVMCMGHNRGWEEAASMFSGSSIELKTCNAALLEASGKSWDEAFALAGLGGWKLHGIVKPNNS
         SFYSIAAMDGQTAEHLQ +I  YS  +I T+MCMGHN+GWEEAASM SG+SI+LKTCNAALL+A G SW+EAFAL+G GGWKL G+V P++S
Subjt:  SSFYSIAAMDGQTAEHLQQVICNYSRNEIITVMCMGHNRGWEEAASMFSGSSIELKTCNAALLEASGKSWDEAFALAGLGGWKLHGIVKPNNS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGGATCATTTTGCTGTAATCATATACAACAACATCCACAATTCTTTAGAATTCAACATTCGAGCTTCTCCTCGAGGGGAAGAAACCCTATTCAATGGCCATCTGC
AGTTATACAGACGGCGGAAGGCCAAGCGGCGACCGAGGAAGCTGACTCCCAATCGGAATCAGTTGCTCGTCGCCTTATTCTGCTTCGTCATGCCAGGAGTTCGTGGCAAA
AGCTCTCACTACGAGATCATGATCGCCCATTGAGTAAAGATGGAAAGGCTGATGCTATTAAAATTGCTCGTAAACTCCAAGAATTGAATTGGATCCCTGAACTTATTTTA
TCCAGCAGTGACGCCAAGCGAACCAGAGAAACGCTTAAGATAATGCAGAATGAAGTTAGCAGTTTTTCTGAAGCAGAGGTGCATCTCATTTCCAGTTTTTATTCCATTGC
TGCCATGGATGGTCAGACTGCGGAGCACCTTCAACAGGTTATCTGTAATTATTCGCGGAATGAGATAATTACAGTCATGTGTATGGGACATAATAGAGGGTGGGAAGAGG
CAGCCTCAATGTTTAGTGGCTCCTCCATTGAACTGAAGACATGCAATGCTGCTTTGCTTGAGGCTTCAGGAAAATCATGGGACGAGGCATTTGCTTTGGCAGGACTAGGT
GGGTGGAAGCTTCATGGCATAGTAAAACCAAATAACAGTTCATAG
mRNA sequenceShow/hide mRNA sequence
CAGATTTGAGCTAAATCCGGGTGTTAAAAAATAGAAATGGCAGACAAAAAATACCATAAGCCTTATCTACTCGTCGTCTGCTGCTACGCCTGCAAGTTCGAATATGAATT
CCATTTTATTTTCTTGGAGAACTTAAGCATGGCAGTACCAGTACAGCAACCCAAATAGGCCGCTGGAGGCATTCCATTAGGGAGATTCGACAACGGAACCGATCACCAGT
ACTGTTTCAAGCTTCAAGTTCGCCGTCGCTTCTCCGACCTCAATCGGCCCTGTCGCCTCAGGTTGAAAGCACTTCATATAACTAATTGAAAGATTCTGTACAATGAACGG
ATCATTTTGCTGTAATCATATACAACAACATCCACAATTCTTTAGAATTCAACATTCGAGCTTCTCCTCGAGGGGAAGAAACCCTATTCAATGGCCATCTGCAGTTATAC
AGACGGCGGAAGGCCAAGCGGCGACCGAGGAAGCTGACTCCCAATCGGAATCAGTTGCTCGTCGCCTTATTCTGCTTCGTCATGCCAGGAGTTCGTGGCAAAAGCTCTCA
CTACGAGATCATGATCGCCCATTGAGTAAAGATGGAAAGGCTGATGCTATTAAAATTGCTCGTAAACTCCAAGAATTGAATTGGATCCCTGAACTTATTTTATCCAGCAG
TGACGCCAAGCGAACCAGAGAAACGCTTAAGATAATGCAGAATGAAGTTAGCAGTTTTTCTGAAGCAGAGGTGCATCTCATTTCCAGTTTTTATTCCATTGCTGCCATGG
ATGGTCAGACTGCGGAGCACCTTCAACAGGTTATCTGTAATTATTCGCGGAATGAGATAATTACAGTCATGTGTATGGGACATAATAGAGGGTGGGAAGAGGCAGCCTCA
ATGTTTAGTGGCTCCTCCATTGAACTGAAGACATGCAATGCTGCTTTGCTTGAGGCTTCAGGAAAATCATGGGACGAGGCATTTGCTTTGGCAGGACTAGGTGGGTGGAA
GCTTCATGGCATAGTAAAACCAAATAACAGTTCATAGAACGTCTTAAAAGTTTTTCTAGAGTTCTATCTATAGAATTTTTCCCTGGTGGGTTTGACAAAAATTCGACTGG
TCGGCCTTGTGCATATTTAGCACCGAAAAGTTTTACCATTTAACTTTTTCCTCTTAAGTCAGGATTCTTTGTGTCGGAATTAGAAAGCATCTTCATACATCAACAGTGTG
TTTTGATTTTGAATGGTTTTTTCTTTGATAGTTAAAAAGTGCTTGTTCGCTCAGCAGTAATTAAGAACTTGTTAAACTGCTCTTGGAGGATATTATTGTTAAGTGAAGTA
TTTTAACATGATTGTATAATAAGTGTTCAATATTTAAAATGCTACTCTGAAAGTAC
Protein sequenceShow/hide protein sequence
MNGSFCCNHIQQHPQFFRIQHSSFSSRGRNPIQWPSAVIQTAEGQAATEEADSQSESVARRLILLRHARSSWQKLSLRDHDRPLSKDGKADAIKIARKLQELNWIPELIL
SSSDAKRTRETLKIMQNEVSSFSEAEVHLISSFYSIAAMDGQTAEHLQQVICNYSRNEIITVMCMGHNRGWEEAASMFSGSSIELKTCNAALLEASGKSWDEAFALAGLG
GWKLHGIVKPNNSS