; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC05g0719 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC05g0719
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Description4HBT domain-containing protein
Genome locationMC05:5703350..5705324
RNA-Seq ExpressionMC05g0719
SyntenyMC05g0719
Gene Ontology termsGO:0042372 - phylloquinone biosynthetic process (biological process)
GO:0005777 - peroxisome (cellular component)
GO:0061522 - 1,4-dihydroxy-2-naphthoyl-CoA thioesterase activity (molecular function)
InterPro domainsIPR003736 - Phenylacetic acid degradation-related domain
IPR006683 - Thioesterase domain
IPR029069 - HotDog domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582134.1 1,4-dihydroxy-2-naphthoyl-CoA thioesterase 1, partial [Cucurbita argyrosperma subsp. sororia]2.82e-7272.73Show/hide
Query:  PPSSSSNTAADLDAPLHAIGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASMGAHVASGYRRVAGIHLSINHLKAAEVGDVVLAEATPV
        PPS    T   LD  LHA GF+I+HVSP RV+GRL VS  CCQPFKVLHGGVSALIAESLAS+GAH ASGY+RVAGIHLSINHLK+A +GD+VLAEA PV
Subjt:  PPSSSSNTAADLDAPLHAIGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASMGAHVASGYRRVAGIHLSINHLKAAEVGDVVLAEATPV

Query:  SVGKTIQVWDVRLWKGEGESRVYISSSRVTLICNLSVPKHAQNAADALKRFAKL
        +VG+TIQVWDV+LWK   E++V +S++RVTL+CNL+VPKHA+NAA+ALK FAKL
Subjt:  SVGKTIQVWDVRLWKGEGESRVYISSSRVTLICNLSVPKHAQNAADALKRFAKL

XP_004147638.1 1,4-dihydroxy-2-naphthoyl-CoA thioesterase 1 [Cucumis sativus]3.07e-7171.14Show/hide
Query:  SNTAADLDAPLHAIGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASMGAHVASGYRRVAGIHLSINHLKAAEVGDVVLAEATPVSVGKT
        SN    LDAPL ++GF++ HVSPH+VSGRL VSP CCQPFKVLHGGVSALIAESLASMGAH ASGY+RVAGIHLSINHLK+A +G++V+AEA PV+VG+T
Subjt:  SNTAADLDAPLHAIGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASMGAHVASGYRRVAGIHLSINHLKAAEVGDVVLAEATPVSVGKT

Query:  IQVWDVRLWKGEGESRVYISSSRVTLICNLSVPKHAQNAADALKRFAKL
        IQVWDV+LWK   E +V +S++RVTL+ N+ VPKH ++AADALK+F+KL
Subjt:  IQVWDVRLWKGEGESRVYISSSRVTLICNLSVPKHAQNAADALKRFAKL

XP_022138150.1 1,4-dihydroxy-2-naphthoyl-CoA thioesterase 1-like [Momordica charantia]3.75e-110100Show/hide
Query:  MSTSENKKPPSSSSNTAADLDAPLHAIGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASMGAHVASGYRRVAGIHLSINHLKAAEVGDV
        MSTSENKKPPSSSSNTAADLDAPLHAIGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASMGAHVASGYRRVAGIHLSINHLKAAEVGDV
Subjt:  MSTSENKKPPSSSSNTAADLDAPLHAIGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASMGAHVASGYRRVAGIHLSINHLKAAEVGDV

Query:  VLAEATPVSVGKTIQVWDVRLWKGEGESRVYISSSRVTLICNLSVPKHAQNAADALKRFAKL
        VLAEATPVSVGKTIQVWDVRLWKGEGESRVYISSSRVTLICNLSVPKHAQNAADALKRFAKL
Subjt:  VLAEATPVSVGKTIQVWDVRLWKGEGESRVYISSSRVTLICNLSVPKHAQNAADALKRFAKL

XP_022849382.1 1,4-dihydroxy-2-naphthoyl-CoA thioesterase 1-like isoform X1 [Olea europaea var. sylvestris]5.30e-7369.38Show/hide
Query:  NKKPPSSSSNTAADLDAPLHAIGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASMGAHVASGYRRVAGIHLSINHLKAAEVGDVVLAEA
        N++PPSSSS+  A LDAPLHAIGF+I+ +SPH+V+G L V+P CCQPFKVLHGGVSALIAE+LAS+GAH+ASG+RRVAGIHLSINHLK+A+ GD+VLAEA
Subjt:  NKKPPSSSSNTAADLDAPLHAIGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASMGAHVASGYRRVAGIHLSINHLKAAEVGDVVLAEA

Query:  TPVSVGKTIQVWDVRLWKGEG---ESRVYISSSRVTLICNLSVPKHAQNAADALKRFAKL
        TPV+VGKTIQVW+V LWK +    E +  ISSSRVTL+CN+ VP+ A++A + LK++AKL
Subjt:  TPVSVGKTIQVWDVRLWKGEG---ESRVYISSSRVTLICNLSVPKHAQNAADALKRFAKL

XP_022955680.1 1,4-dihydroxy-2-naphthoyl-CoA thioesterase 1-like [Cucurbita moschata]5.68e-7272.73Show/hide
Query:  PPSSSSNTAADLDAPLHAIGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASMGAHVASGYRRVAGIHLSINHLKAAEVGDVVLAEATPV
        PPS    T   LD  LHA GF+I+HVSP RV+GRL VS  CCQPFKVLHGGVSALIAESLAS+GAH ASGY+RVAGIHLSINHLK+A +GD+VLAEA PV
Subjt:  PPSSSSNTAADLDAPLHAIGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASMGAHVASGYRRVAGIHLSINHLKAAEVGDVVLAEATPV

Query:  SVGKTIQVWDVRLWKGEGESRVYISSSRVTLICNLSVPKHAQNAADALKRFAKL
        +VG+TIQVWDV+LWK   E++V +S++RVTL+CNL VPKHA+NAA+ALK FAKL
Subjt:  SVGKTIQVWDVRLWKGEGESRVYISSSRVTLICNLSVPKHAQNAADALKRFAKL

TrEMBL top hitse value%identityAlignment
A0A0A0L876 4HBT domain-containing protein1.49e-7171.14Show/hide
Query:  SNTAADLDAPLHAIGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASMGAHVASGYRRVAGIHLSINHLKAAEVGDVVLAEATPVSVGKT
        SN    LDAPL ++GF++ HVSPH+VSGRL VSP CCQPFKVLHGGVSALIAESLASMGAH ASGY+RVAGIHLSINHLK+A +G++V+AEA PV+VG+T
Subjt:  SNTAADLDAPLHAIGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASMGAHVASGYRRVAGIHLSINHLKAAEVGDVVLAEATPVSVGKT

Query:  IQVWDVRLWKGEGESRVYISSSRVTLICNLSVPKHAQNAADALKRFAKL
        IQVWDV+LWK   E +V +S++RVTL+ N+ VPKH ++AADALK+F+KL
Subjt:  IQVWDVRLWKGEGESRVYISSSRVTLICNLSVPKHAQNAADALKRFAKL

A0A1S3AYE8 1,4-dihydroxy-2-naphthoyl-CoA thioesterase 1-like2.10e-7167.9Show/hide
Query:  MSTSENKKPPSSSSNTAADLDAPLHAIGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASMGAHVASGYRRVAGIHLSINHLKAAEVGDV
        MS+++N  PP         LDAPL + GF+I  VSPH+V+GRL VS  CCQPFKVLHGGVSALIAESLASMGAH ASGY+RVAGIHLSINHLK+A +G++
Subjt:  MSTSENKKPPSSSSNTAADLDAPLHAIGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASMGAHVASGYRRVAGIHLSINHLKAAEVGDV

Query:  VLAEATPVSVGKTIQVWDVRLWKGEGESRVYISSSRVTLICNLSVPKHAQNAADALKRFAKL
        V+AEA PV+VG+TIQVWDV+LWK   E +V +S++RVTL+CN+ VPKH QNAADALK+F+KL
Subjt:  VLAEATPVSVGKTIQVWDVRLWKGEGESRVYISSSRVTLICNLSVPKHAQNAADALKRFAKL

A0A6J1C8M1 1,4-dihydroxy-2-naphthoyl-CoA thioesterase 1-like1.82e-110100Show/hide
Query:  MSTSENKKPPSSSSNTAADLDAPLHAIGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASMGAHVASGYRRVAGIHLSINHLKAAEVGDV
        MSTSENKKPPSSSSNTAADLDAPLHAIGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASMGAHVASGYRRVAGIHLSINHLKAAEVGDV
Subjt:  MSTSENKKPPSSSSNTAADLDAPLHAIGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASMGAHVASGYRRVAGIHLSINHLKAAEVGDV

Query:  VLAEATPVSVGKTIQVWDVRLWKGEGESRVYISSSRVTLICNLSVPKHAQNAADALKRFAKL
        VLAEATPVSVGKTIQVWDVRLWKGEGESRVYISSSRVTLICNLSVPKHAQNAADALKRFAKL
Subjt:  VLAEATPVSVGKTIQVWDVRLWKGEGESRVYISSSRVTLICNLSVPKHAQNAADALKRFAKL

A0A6J1GVR9 1,4-dihydroxy-2-naphthoyl-CoA thioesterase 1-like2.75e-7272.73Show/hide
Query:  PPSSSSNTAADLDAPLHAIGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASMGAHVASGYRRVAGIHLSINHLKAAEVGDVVLAEATPV
        PPS    T   LD  LHA GF+I+HVSP RV+GRL VS  CCQPFKVLHGGVSALIAESLAS+GAH ASGY+RVAGIHLSINHLK+A +GD+VLAEA PV
Subjt:  PPSSSSNTAADLDAPLHAIGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASMGAHVASGYRRVAGIHLSINHLKAAEVGDVVLAEATPV

Query:  SVGKTIQVWDVRLWKGEGESRVYISSSRVTLICNLSVPKHAQNAADALKRFAKL
        +VG+TIQVWDV+LWK   E++V +S++RVTL+CNL VPKHA+NAA+ALK FAKL
Subjt:  SVGKTIQVWDVRLWKGEGESRVYISSSRVTLICNLSVPKHAQNAADALKRFAKL

A0A6J1IQ50 1,4-dihydroxy-2-naphthoyl-CoA thioesterase 1-like4.55e-7170.44Show/hide
Query:  SENKKPPSSSSNTAADLDAPLHAIGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASMGAHVASGYRRVAGIHLSINHLKAAEVGDVVLA
        S  + PP S   T   LD  LHA GF+I+HVSP RV+GRL VS  CCQPFKVLHGGVSALIAESLAS+GAH ASGY+RVAGIHLSINHLK+A +GD+V A
Subjt:  SENKKPPSSSSNTAADLDAPLHAIGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASMGAHVASGYRRVAGIHLSINHLKAAEVGDVVLA

Query:  EATPVSVGKTIQVWDVRLWKGEGESRVYISSSRVTLICNLSVPKHAQNAADALKRFAKL
        EA PV+VG+TIQVWDV+LWK   E++V +S++RVTL+CNL VPKHA NAA+ALK FAKL
Subjt:  EATPVSVGKTIQVWDVRLWKGEGESRVYISSSRVTLICNLSVPKHAQNAADALKRFAKL

SwissProt top hitse value%identityAlignment
P45083 Putative esterase HI_11611.3e-1034.21Show/hide
Query:  IGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASM-GAHVASGYRRVAGIHLSINHLKAAEVGDVVLAEATPVSVGKTIQVWDVRLWKGE
        +G +I       +   +PV     QPF VLHGGVS  +AE++ S+ G+      + V G+ ++ NHL+    G V  A ATP+++G+ IQVW + +    
Subjt:  IGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASM-GAHVASGYRRVAGIHLSINHLKAAEVGDVVLAEATPVSVGKTIQVWDVRLWKGE

Query:  GESRVYISSSRVTL
         E       SR+TL
Subjt:  GESRVYISSSRVTL

P77781 1,4-dihydroxy-2-naphthoyl-CoA hydrolase3.8e-1030.43Show/hide
Query:  IGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASMGAHVAS-GYRRVAGIHLSINHLKAAEVGDVVLAEATPVSVGKTIQVWDVRLWKGE
        +  + EH+    +   +PV     QPF +LHGG S ++AES+ S+  ++ + G ++V G+ ++ NH+++A  G  V     P+ +G   QVW + ++  +
Subjt:  IGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASMGAHVAS-GYRRVAGIHLSINHLKAAEVGDVVLAEATPVSVGKTIQVWDVRLWKGE

Query:  GESRVYISSSRVTLI
        G  R+  SS   T I
Subjt:  GESRVYISSSRVTLI

Q9FI76 1,4-dihydroxy-2-naphthoyl-CoA thioesterase 21.3e-4257.34Show/hide
Query:  LDAPLHAIGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASMGAHVASGYRRVAGIHLSINHLKAAEVGDVVLAEATPVSVGKTIQVWDV
        +D PL  +GF  + +S  RVSG L ++  CCQPFKVLHGGVSALIAE+LAS+GA +ASG++RVAGIHLSI+HL+ A +G++V AE+ PVSVGK IQVW+V
Subjt:  LDAPLHAIGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASMGAHVASGYRRVAGIHLSINHLKAAEVGDVVLAEATPVSVGKTIQVWDV

Query:  RLWKGE----GESRVYISSSRVTLICNLSVPKHAQNAADALKR
        RLWK +     ++++ +S+SRVTL C L +P H ++A D LK+
Subjt:  RLWKGE----GESRVYISSSRVTLICNLSVPKHAQNAADALKR

Q9I3A4 Putative esterase PA16184.9e-1032.77Show/hide
Query:  SSSSNTAADLDAPLHAIGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASMGAH--VASGYRRVAGIHLSINHLKAAEVGDVVLAEATPV
        +S  N+  DL      +G + E      ++  +PV     QPF +LHGG S ++AESL SM ++  V +      G+ ++ NHL+    G V  A A  +
Subjt:  SSSSNTAADLDAPLHAIGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASMGAH--VASGYRRVAGIHLSINHLKAAEVGDVVLAEATPV

Query:  SVGKTIQVWDVRLWKGEGE
         +G+T  VWD+RL   +G+
Subjt:  SVGKTIQVWDVRLWKGEGE

Q9SX65 1,4-dihydroxy-2-naphthoyl-CoA thioesterase 13.3e-5468.39Show/hide
Query:  SSSSNTAADLDAPLHAIGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASMGAHVASGYRRVAGIHLSINHLKAAEVGDVVLAEATPVSV
        S+SSNT A +D PLH +GF+ + +SP R++GRLPVSP CCQPFKVLHGGVSALIAESLASMGAH+ASG++RVAGI LSINHLK+A++GD+V AEATPVS 
Subjt:  SSSSNTAADLDAPLHAIGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASMGAHVASGYRRVAGIHLSINHLKAAEVGDVVLAEATPVSV

Query:  GKTIQVWDVRLWK---GEGESRVYISSSRVTLICNLSVPKHAQNAADALKRFAKL
        GKTIQVW+V+LWK    +  +++ ISSSRVTLICNL +P +A++AA+ LK  AKL
Subjt:  GKTIQVWDVRLWK---GEGESRVYISSSRVTLICNLSVPKHAQNAADALKRFAKL

Arabidopsis top hitse value%identityAlignment
AT1G48320.1 Thioesterase superfamily protein2.3e-5568.39Show/hide
Query:  SSSSNTAADLDAPLHAIGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASMGAHVASGYRRVAGIHLSINHLKAAEVGDVVLAEATPVSV
        S+SSNT A +D PLH +GF+ + +SP R++GRLPVSP CCQPFKVLHGGVSALIAESLASMGAH+ASG++RVAGI LSINHLK+A++GD+V AEATPVS 
Subjt:  SSSSNTAADLDAPLHAIGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASMGAHVASGYRRVAGIHLSINHLKAAEVGDVVLAEATPVSV

Query:  GKTIQVWDVRLWK---GEGESRVYISSSRVTLICNLSVPKHAQNAADALKRFAKL
        GKTIQVW+V+LWK    +  +++ ISSSRVTLICNL +P +A++AA+ LK  AKL
Subjt:  GKTIQVWDVRLWK---GEGESRVYISSSRVTLICNLSVPKHAQNAADALKRFAKL

AT5G48950.1 Thioesterase superfamily protein9.2e-4457.34Show/hide
Query:  LDAPLHAIGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASMGAHVASGYRRVAGIHLSINHLKAAEVGDVVLAEATPVSVGKTIQVWDV
        +D PL  +GF  + +S  RVSG L ++  CCQPFKVLHGGVSALIAE+LAS+GA +ASG++RVAGIHLSI+HL+ A +G++V AE+ PVSVGK IQVW+V
Subjt:  LDAPLHAIGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASMGAHVASGYRRVAGIHLSINHLKAAEVGDVVLAEATPVSVGKTIQVWDV

Query:  RLWKGE----GESRVYISSSRVTLICNLSVPKHAQNAADALKR
        RLWK +     ++++ +S+SRVTL C L +P H ++A D LK+
Subjt:  RLWKGE----GESRVYISSSRVTLICNLSVPKHAQNAADALKR

AT5G48950.2 Thioesterase superfamily protein1.2e-3061.39Show/hide
Query:  LDAPLHAIGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASMGAHVASGYRRVAGIHLSINHLKAAEVGDVVLAEATPVSVGKTIQVWDV
        +D PL  +GF  + +S  RVSG L ++  CCQPFKVLHGGVSALIAE+LAS+GA +ASG++RVAGIHLSI+HL+ A +G++V AE+ PVSVGK IQ  D+
Subjt:  LDAPLHAIGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASMGAHVASGYRRVAGIHLSINHLKAAEVGDVVLAEATPVSVGKTIQVWDV

Query:  R
        +
Subjt:  R


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTACGTCGGAGAACAAGAAGCCGCCGTCCTCCTCCTCCAACACGGCGGCTGATCTCGACGCTCCGCTTCACGCCATCGGATTTCAGATCGAACACGTATCGCCTCA
CAGAGTCAGCGGCCGTCTCCCGGTTTCCCCAAATTGCTGCCAGCCGTTTAAAGTGCTGCACGGAGGAGTATCGGCGCTGATTGCGGAGTCGCTGGCGAGTATGGGCGCTC
ACGTCGCCTCCGGTTACCGGAGAGTGGCCGGAATCCATCTCAGCATCAACCACTTGAAGGCGGCGGAGGTCGGCGACGTCGTTTTGGCAGAAGCGACTCCGGTCTCCGTC
GGCAAAACCATTCAGGTATGGGATGTACGATTATGGAAGGGTGAGGGAGAAAGTAGAGTCTATATTTCCTCATCAAGGGTGACTCTCATATGCAATTTGTCTGTGCCAAA
ACATGCCCAAAATGCCGCTGATGCCCTCAAAAGGTTTGCAAAATTGTGA
mRNA sequenceShow/hide mRNA sequence
AATATAAATTACCTTTGCATTACGTTTAAGCAACCTCACCAATAAAATAGCGATAGTGAGAGATATTTTCAGGTAAGGAAAATAAAAGAAAATTTGCAAAAGAAGCTAGA
AAAATATTAACAAAGTGTTCAAAAAACATTTTTGGATTAAATTTGGTAAAGCATAAACTTTTTTGGTTTAATTTTTACAATTACGTCGTGGTTTCCATTATACTTCTACG
TTTATAAAATATAATAAGACCGGAAGACAAGTCGTATGTGGGGCATCCGATTCTCTAATTAAATTCCGCGTCTCGTCTCCAGTCTTCTCCACCAACTCCTCTCCTCTCCG
ATCTAATCCGCAAAAATCGTCCGCTCCGGACGTCGCACTCCGATCCAAAAAATGTCTACGTCGGAGAACAAGAAGCCGCCGTCCTCCTCCTCCAACACGGCGGCTGATCT
CGACGCTCCGCTTCACGCCATCGGATTTCAGATCGAACACGTATCGCCTCACAGAGTCAGCGGCCGTCTCCCGGTTTCCCCAAATTGCTGCCAGCCGTTTAAAGTGCTGC
ACGGAGGAGTATCGGCGCTGATTGCGGAGTCGCTGGCGAGTATGGGCGCTCACGTCGCCTCCGGTTACCGGAGAGTGGCCGGAATCCATCTCAGCATCAACCACTTGAAG
GCGGCGGAGGTCGGCGACGTCGTTTTGGCAGAAGCGACTCCGGTCTCCGTCGGCAAAACCATTCAGGTATGGGATGTACGATTATGGAAGGGTGAGGGAGAAAGTAGAGT
CTATATTTCCTCATCAAGGGTGACTCTCATATGCAATTTGTCTGTGCCAAAACATGCCCAAAATGCCGCTGATGCCCTCAAAAGGTTTGCAAAATTGTGATAAAATTATT
GTACTATATAGTATATATCTATATTTATTATATTTCCCACTTTAGAGGGTTATTTTGTATTTTAACTTATACTAGAAATAAATCAATCATACTTTAGACCATCTTTCATT
ATGGAATCTTAAAACTAAAATTTTT
Protein sequenceShow/hide protein sequence
MSTSENKKPPSSSSNTAADLDAPLHAIGFQIEHVSPHRVSGRLPVSPNCCQPFKVLHGGVSALIAESLASMGAHVASGYRRVAGIHLSINHLKAAEVGDVVLAEATPVSV
GKTIQVWDVRLWKGEGESRVYISSSRVTLICNLSVPKHAQNAADALKRFAKL