; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020028 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020028
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionThiamine thiazole synthase, chloroplastic
Genome locationtig00153446:1164418..1165907
RNA-Seq ExpressionSgr020028
SyntenySgr020028
Gene Ontology termsGO:0009228 - thiamine biosynthetic process (biological process)
GO:0052837 - thiazole biosynthetic process (biological process)
GO:0005829 - cytosol (cellular component)
GO:0009570 - chloroplast stroma (cellular component)
GO:0005506 - iron ion binding (molecular function)
GO:0016763 - transferase activity, transferring pentosyl groups (molecular function)
InterPro domainsIPR002922 - Thiazole biosynthetic enzyme Thi4 family
IPR027495 - Thiamine thiazole synthase
IPR036188 - FAD/NAD(P)-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7031400.1 Thiamine thiazole synthase 2, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma]2.3e-16686.89Show/hide
Query:  MASIASTLTTKLQRPSLFDSS---SSFHGTPLAPLPSLRLHPIQSNTGASLSISASAS-PPPYDLNRFTFNPIKESIVSREMTRRYMTDMITYADTDVIV
        MAS+ASTLTTKLQ PS   SS   SSFHGTPLAP PSLRL P    T AS SISASAS   PYDLN F FNPIKESIVSREMTRRYMTDMITYADTDV++
Subjt:  MASIASTLTTKLQRPSLFDSS---SSFHGTPLAPLPSLRLHPIQSNTGASLSISASAS-PPPYDLNRFTFNPIKESIVSREMTRRYMTDMITYADTDVIV

Query:  VGAGSAGLSCAYELSKNPSVRVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAHYFLDELGVEYDEQDNYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAE
        VGAGSAGLSCAYELSKNP +R+AIIEQSVSPGGGAWLGGQLFSAMVVRKPAH FLDE+GVEYDEQ++YVVIKHAALFTSTIMSKLLARPNVKLFNAVAAE
Subjt:  VGAGSAGLSCAYELSKNPSVRVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAHYFLDELGVEYDEQDNYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAE

Query:  DLIVKGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVRLTREIVPGMIVTGMEVA
        DLIVKGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVRLTRE+VPGMIVTGMEVA
Subjt:  DLIVKGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVRLTREIVPGMIVTGMEVA

Query:  EIDGAPRMGPTFGAMMISGQKAAHLALKSLGLANAIDGRRKEHAVVIEDEEAGLLLASAESPEVAD
        EIDGAPRMGPTFGAMMISGQKAAHLALKSLGLAN            IEDE A L+LA+ ESPEVAD
Subjt:  EIDGAPRMGPTFGAMMISGQKAAHLALKSLGLANAIDGRRKEHAVVIEDEEAGLLLASAESPEVAD

XP_004152916.1 thiamine thiazole synthase, chloroplastic [Cucumis sativus]9.5e-16886.85Show/hide
Query:  MASIASTLTTKLQRPSLFDSSSSFHGTPLAPLPSLRLHPIQSNTGASLSISASASPPPYDLNRFTFNPIKESIVSREMTRRYMTDMITYADTDVIVVGAG
        MASIASTLTTKLQ+PSL    SSFHGTPLA   SLRL   +S    SL+ISASAS PPYDLN+F FNPI+ESIVSREMTRRYMTDMITYADTDV++VGAG
Subjt:  MASIASTLTTKLQRPSLFDSSSSFHGTPLAPLPSLRLHPIQSNTGASLSISASASPPPYDLNRFTFNPIKESIVSREMTRRYMTDMITYADTDVIVVGAG

Query:  SAGLSCAYELSKNPSVRVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAHYFLDELGVEYDEQDNYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIV
        SAGLSCAYELSKNP++R+AIIEQSVSPGGGAWLGGQLFSAMVVRKPAH FLDE+GVEYDEQD+YVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIV
Subjt:  SAGLSCAYELSKNPSVRVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAHYFLDELGVEYDEQDNYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIV

Query:  KGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVRLTREIVPGMIVTGMEVAEIDG
        KGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVR TRE+VPGMIVTGMEVAEIDG
Subjt:  KGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVRLTREIVPGMIVTGMEVAEIDG

Query:  APRMGPTFGAMMISGQKAAHLALKSLGLANAI--DGRRKEHAVVIEDEEAGLLLASAESPEVADA
        APRMGPTFGAMMISGQKAAHLALKSLG AN+I  +G   E  V +E+EE  LLLA+ ESPEVADA
Subjt:  APRMGPTFGAMMISGQKAAHLALKSLGLANAI--DGRRKEHAVVIEDEEAGLLLASAESPEVADA

XP_008463440.1 PREDICTED: thiamine thiazole synthase, chloroplastic [Cucumis melo]1.2e-16787.12Show/hide
Query:  MASIASTLTTKLQRPSLFDSSSSFHGTPLAPLPSLRLHPIQSNTGASLSISASASPPPYDLNRFTFNPIKESIVSREMTRRYMTDMITYADTDVIVVGAG
        MASIASTLTTKLQ PSL    SSFHGTPLA  PSLRL   +S    SL+ISASAS PPYDLN+F FNPI+ESIVSREMTRRYMTDMITYADTDVI+VGAG
Subjt:  MASIASTLTTKLQRPSLFDSSSSFHGTPLAPLPSLRLHPIQSNTGASLSISASASPPPYDLNRFTFNPIKESIVSREMTRRYMTDMITYADTDVIVVGAG

Query:  SAGLSCAYELSKNPSVRVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAHYFLDELGVEYDEQDNYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIV
        SAGLSCAYELSKNP++R+AIIEQSVSPGGGAWLGGQLFSAMVVRKPAH FLDE+GVEYDEQD+YVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIV
Subjt:  SAGLSCAYELSKNPSVRVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAHYFLDELGVEYDEQDNYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIV

Query:  KGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVRLTREIVPGMIVTGMEVAEIDG
        KGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIG+IDSVPGMKALDMNTAEDAIVR TRE+VPGMIVTGMEVAEIDG
Subjt:  KGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVRLTREIVPGMIVTGMEVAEIDG

Query:  APRMGPTFGAMMISGQKAAHLALKSLGLANAI--DGRRKEHAVVIEDEEAGLLLASAESPEVADA
        APRMGPTFGAMMISGQKAAHLALKSLG AN+I  +G   E  V +E+EE  LLLA+ ESPEVADA
Subjt:  APRMGPTFGAMMISGQKAAHLALKSLGLANAI--DGRRKEHAVVIEDEEAGLLLASAESPEVADA

XP_022985241.1 thiamine thiazole synthase, chloroplastic-like [Cucurbita maxima]1.2e-16585.95Show/hide
Query:  MASIASTLTTKLQRPSLFDSSSSFHGTPLAPLPSLRLHPIQSNTGASLSISASASPPPYDLNRFTFNPIKESIVSREMTRRYMTDMITYADTDVIVVGAG
        MASIAS+L TKL RPSL D  SSFHG+PL P PS RL P  +   ASLSISASA+ PPYDLN+F FNPI+ESIVSREMTRRYMTDMITYADTDVIVVGAG
Subjt:  MASIASTLTTKLQRPSLFDSSSSFHGTPLAPLPSLRLHPIQSNTGASLSISASASPPPYDLNRFTFNPIKESIVSREMTRRYMTDMITYADTDVIVVGAG

Query:  SAGLSCAYELSKNPSVRVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAHYFLDELGVEYDEQDNYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIV
        SAGLSCAYELSK+PSVRVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAHYFLDEL VEYDEQ++YVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIV
Subjt:  SAGLSCAYELSKNPSVRVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAHYFLDELGVEYDEQDNYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIV

Query:  KGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVRLTREIVPGMIVTGMEVAEIDG
        K GRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVR TRE+VPGMIVTGMEVAEIDG
Subjt:  KGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVRLTREIVPGMIVTGMEVAEIDG

Query:  APRMGPTFGAMMISGQKAAHLALKSLGLANAIDGRRKEHAVVIEDEEAGLLLASAESPEVADA
        APRMGPTFGAMMISGQKAAHLALK+LG AN+ID + +     +E  E  ++LA+ E+PEVADA
Subjt:  APRMGPTFGAMMISGQKAAHLALKSLGLANAIDGRRKEHAVVIEDEEAGLLLASAESPEVADA

XP_038905461.1 thiamine thiazole synthase, chloroplastic [Benincasa hispida]3.0e-16988.49Show/hide
Query:  MASIASTLTTKLQRPSLFDSSSSFHGTPLAPLPSLRLHPIQSNTGA-SLSISASASPPPYDLNRFTFNPIKESIVSREMTRRYMTDMITYADTDVIVVGA
        MASIASTLTTKLQRPSL    SSF+GTPLAP PSLRL    + T A SLSISASAS PPYDLN+F FNPIKESIVSREMTRRYMTDMITYADTDVI+VGA
Subjt:  MASIASTLTTKLQRPSLFDSSSSFHGTPLAPLPSLRLHPIQSNTGA-SLSISASASPPPYDLNRFTFNPIKESIVSREMTRRYMTDMITYADTDVIVVGA

Query:  GSAGLSCAYELSKNPSVRVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAHYFLDELGVEYDEQDNYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLI
        GSAGLSCAYELSKNPS+R+AIIEQSVSPGGGAWLGGQLFSAMVVRKPAH FLDE+GVEYDEQ++YVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLI
Subjt:  GSAGLSCAYELSKNPSVRVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAHYFLDELGVEYDEQDNYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLI

Query:  VKGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVRLTREIVPGMIVTGMEVAEID
        VKGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIG+IDSVPGMKALDMNTAEDAIVR TRE+VPGMIVTGMEVAEID
Subjt:  VKGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVRLTREIVPGMIVTGMEVAEID

Query:  GAPRMGPTFGAMMISGQKAAHLALKSLGLANAI-DGRRKEHAVVIEDEEAGLLLASAESPEVADA
        GAPRMGPTFGAMMISGQKAAHLALKSLG AN I DG   E  VV   EE  LLLA+ ESPEVADA
Subjt:  GAPRMGPTFGAMMISGQKAAHLALKSLGLANAI-DGRRKEHAVVIEDEEAGLLLASAESPEVADA

TrEMBL top hitse value%identityAlignment
A0A0A0L546 Thiamine thiazole synthase, chloroplastic4.6e-16886.85Show/hide
Query:  MASIASTLTTKLQRPSLFDSSSSFHGTPLAPLPSLRLHPIQSNTGASLSISASASPPPYDLNRFTFNPIKESIVSREMTRRYMTDMITYADTDVIVVGAG
        MASIASTLTTKLQ+PSL    SSFHGTPLA   SLRL   +S    SL+ISASAS PPYDLN+F FNPI+ESIVSREMTRRYMTDMITYADTDV++VGAG
Subjt:  MASIASTLTTKLQRPSLFDSSSSFHGTPLAPLPSLRLHPIQSNTGASLSISASASPPPYDLNRFTFNPIKESIVSREMTRRYMTDMITYADTDVIVVGAG

Query:  SAGLSCAYELSKNPSVRVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAHYFLDELGVEYDEQDNYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIV
        SAGLSCAYELSKNP++R+AIIEQSVSPGGGAWLGGQLFSAMVVRKPAH FLDE+GVEYDEQD+YVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIV
Subjt:  SAGLSCAYELSKNPSVRVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAHYFLDELGVEYDEQDNYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIV

Query:  KGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVRLTREIVPGMIVTGMEVAEIDG
        KGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVR TRE+VPGMIVTGMEVAEIDG
Subjt:  KGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVRLTREIVPGMIVTGMEVAEIDG

Query:  APRMGPTFGAMMISGQKAAHLALKSLGLANAI--DGRRKEHAVVIEDEEAGLLLASAESPEVADA
        APRMGPTFGAMMISGQKAAHLALKSLG AN+I  +G   E  V +E+EE  LLLA+ ESPEVADA
Subjt:  APRMGPTFGAMMISGQKAAHLALKSLGLANAI--DGRRKEHAVVIEDEEAGLLLASAESPEVADA

A0A1S3CJ81 Thiamine thiazole synthase, chloroplastic6.0e-16887.12Show/hide
Query:  MASIASTLTTKLQRPSLFDSSSSFHGTPLAPLPSLRLHPIQSNTGASLSISASASPPPYDLNRFTFNPIKESIVSREMTRRYMTDMITYADTDVIVVGAG
        MASIASTLTTKLQ PSL    SSFHGTPLA  PSLRL   +S    SL+ISASAS PPYDLN+F FNPI+ESIVSREMTRRYMTDMITYADTDVI+VGAG
Subjt:  MASIASTLTTKLQRPSLFDSSSSFHGTPLAPLPSLRLHPIQSNTGASLSISASASPPPYDLNRFTFNPIKESIVSREMTRRYMTDMITYADTDVIVVGAG

Query:  SAGLSCAYELSKNPSVRVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAHYFLDELGVEYDEQDNYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIV
        SAGLSCAYELSKNP++R+AIIEQSVSPGGGAWLGGQLFSAMVVRKPAH FLDE+GVEYDEQD+YVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIV
Subjt:  SAGLSCAYELSKNPSVRVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAHYFLDELGVEYDEQDNYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIV

Query:  KGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVRLTREIVPGMIVTGMEVAEIDG
        KGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIG+IDSVPGMKALDMNTAEDAIVR TRE+VPGMIVTGMEVAEIDG
Subjt:  KGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVRLTREIVPGMIVTGMEVAEIDG

Query:  APRMGPTFGAMMISGQKAAHLALKSLGLANAI--DGRRKEHAVVIEDEEAGLLLASAESPEVADA
        APRMGPTFGAMMISGQKAAHLALKSLG AN+I  +G   E  V +E+EE  LLLA+ ESPEVADA
Subjt:  APRMGPTFGAMMISGQKAAHLALKSLGLANAI--DGRRKEHAVVIEDEEAGLLLASAESPEVADA

A0A6J1EVU2 Thiamine thiazole synthase, chloroplastic1.8e-16485.4Show/hide
Query:  MASIASTLTTKLQRPSLFDSSSSFHGTPLAPLPSLRLHPIQSNTGASLSISASASPPPYDLNRFTFNPIKESIVSREMTRRYMTDMITYADTDVIVVGAG
        MASIAS+L  KL RPSL +  SSFHG+PL P PS RL P  +   ASLSISASA+ PPYDLN+F FNPI+ESIVSREMTRRYMTDMITYADTDVIVVGAG
Subjt:  MASIASTLTTKLQRPSLFDSSSSFHGTPLAPLPSLRLHPIQSNTGASLSISASASPPPYDLNRFTFNPIKESIVSREMTRRYMTDMITYADTDVIVVGAG

Query:  SAGLSCAYELSKNPSVRVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAHYFLDELGVEYDEQDNYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIV
        SAGLSCAYELSK+PSVRVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAHYFLDEL VEYDEQ++YVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIV
Subjt:  SAGLSCAYELSKNPSVRVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAHYFLDELGVEYDEQDNYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIV

Query:  KGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVRLTREIVPGMIVTGMEVAEIDG
        K GRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMN+AEDAIVR TRE+VPGMIVTGMEVAEIDG
Subjt:  KGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVRLTREIVPGMIVTGMEVAEIDG

Query:  APRMGPTFGAMMISGQKAAHLALKSLGLANAIDGRRKEHAVVIEDEEAGLLLASAESPEVADA
        APRMGPTFGAMMISGQKAAHLALK+LG  N+IDG         E E A ++LA+ E+PEVADA
Subjt:  APRMGPTFGAMMISGQKAAHLALKSLGLANAIDGRRKEHAVVIEDEEAGLLLASAESPEVADA

A0A6J1FWL6 Thiamine thiazole synthase, chloroplastic1.2e-16385.95Show/hide
Query:  MASIASTLTTKLQRPSLFDSSSSFHGTPLA-PLPSLRLHPIQSNTGASLSISASASPPPYDLNRFTFNPIKESIVSREMTRRYMTDMITYADTDVIVVGA
        MAS+ASTLTTKLQ PS     SSFHGTPLA P PSLRL     +T A+ SISAS    PYDLN F FNPIKESIVSREMTRRYMTDMITYADTDV++VGA
Subjt:  MASIASTLTTKLQRPSLFDSSSSFHGTPLA-PLPSLRLHPIQSNTGASLSISASASPPPYDLNRFTFNPIKESIVSREMTRRYMTDMITYADTDVIVVGA

Query:  GSAGLSCAYELSKNPSVRVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAHYFLDELGVEYDEQDNYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLI
        GSAGLSCAYELSKNP +R+AIIEQSVSPGGGAWLGGQLFSAMVVRKPAH FLDE+GVEYDEQ++YVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLI
Subjt:  GSAGLSCAYELSKNPSVRVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAHYFLDELGVEYDEQDNYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLI

Query:  VKGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVRLTREIVPGMIVTGMEVAEID
        VKGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVRLTRE+VPGMIVTGMEVAEID
Subjt:  VKGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVRLTREIVPGMIVTGMEVAEID

Query:  GAPRMGPTFGAMMISGQKAAHLALKSLGLANAIDGRRKEHAVVIEDEEAGLLLASAESPEVAD
        GAPRMGPTFGAMMISGQKAAHLALKSLGLAN            IEDE A L+LA+ ESPEVAD
Subjt:  GAPRMGPTFGAMMISGQKAAHLALKSLGLANAIDGRRKEHAVVIEDEEAGLLLASAESPEVAD

A0A6J1J7L4 Thiamine thiazole synthase, chloroplastic5.6e-16685.95Show/hide
Query:  MASIASTLTTKLQRPSLFDSSSSFHGTPLAPLPSLRLHPIQSNTGASLSISASASPPPYDLNRFTFNPIKESIVSREMTRRYMTDMITYADTDVIVVGAG
        MASIAS+L TKL RPSL D  SSFHG+PL P PS RL P  +   ASLSISASA+ PPYDLN+F FNPI+ESIVSREMTRRYMTDMITYADTDVIVVGAG
Subjt:  MASIASTLTTKLQRPSLFDSSSSFHGTPLAPLPSLRLHPIQSNTGASLSISASASPPPYDLNRFTFNPIKESIVSREMTRRYMTDMITYADTDVIVVGAG

Query:  SAGLSCAYELSKNPSVRVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAHYFLDELGVEYDEQDNYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIV
        SAGLSCAYELSK+PSVRVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAHYFLDEL VEYDEQ++YVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIV
Subjt:  SAGLSCAYELSKNPSVRVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAHYFLDELGVEYDEQDNYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIV

Query:  KGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVRLTREIVPGMIVTGMEVAEIDG
        K GRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVR TRE+VPGMIVTGMEVAEIDG
Subjt:  KGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVRLTREIVPGMIVTGMEVAEIDG

Query:  APRMGPTFGAMMISGQKAAHLALKSLGLANAIDGRRKEHAVVIEDEEAGLLLASAESPEVADA
        APRMGPTFGAMMISGQKAAHLALK+LG AN+ID + +     +E  E  ++LA+ E+PEVADA
Subjt:  APRMGPTFGAMMISGQKAAHLALKSLGLANAIDGRRKEHAVVIEDEEAGLLLASAESPEVADA

SwissProt top hitse value%identityAlignment
F6H7K5 Thiamine thiazole synthase 2, chloroplastic8.7e-15680.05Show/hide
Query:  MASIASTLT--TKLQRPSLFDSSSSFHGTPLAPLPSLRLHPIQ-SNTGASLSISASASPPPYDLNRFTFNPIKESIVSREMTRRYMTDMITYADTDVIVV
        MAS+A+TLT  +   +P+ FD+ SSFHG+P+    S R+ PI+ S+   ++S+S   S  PYDL  F F PIKESIV+REMTRRYM DMITYADTDV++V
Subjt:  MASIASTLT--TKLQRPSLFDSSSSFHGTPLAPLPSLRLHPIQ-SNTGASLSISASASPPPYDLNRFTFNPIKESIVSREMTRRYMTDMITYADTDVIVV

Query:  GAGSAGLSCAYELSKNPSVRVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAHYFLDELGVEYDEQDNYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAED
        GAGSAGLSCAYELSKNPS+RVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAH+FLDELG+EYDEQDNYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAED
Subjt:  GAGSAGLSCAYELSKNPSVRVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAHYFLDELGVEYDEQDNYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAED

Query:  LIVKGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVRLTREIVPGMIVTGMEVAE
        LIVK  RV GVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVRLTREIVPGMIVTGMEVAE
Subjt:  LIVKGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVRLTREIVPGMIVTGMEVAE

Query:  IDGAPRMGPTFGAMMISGQKAAHLALKSLGLANAIDGRRKEHAVVIEDEEAGLLLASAESPEVADA
        IDGAPRMGPTFGAMMISGQKAAHLAL++LG  NAIDG   E     E  +  L+LA+AE+ E+ DA
Subjt:  IDGAPRMGPTFGAMMISGQKAAHLALKSLGLANAIDGRRKEHAVVIEDEEAGLLLASAESPEVADA

F6H9A9 Thiamine thiazole synthase 1, chloroplastic7.6e-15278.3Show/hide
Query:  MASIASTLTTKLQRPSLFD-SSSSFHGTPLAPLPSLRLHPIQSNTGASLSISASASPPPYDLNRFTFNPIKESIVSREMTRRYMTDMITYADTDVIVVGA
        MA++ S++ +K  + S+FD   SSFHG P+A     RL P++S T  +L+++A+A   PYDL  F F PIKESIVSREMTRRYM DMITYADTDV+VVGA
Subjt:  MASIASTLTTKLQRPSLFD-SSSSFHGTPLAPLPSLRLHPIQSNTGASLSISASASPPPYDLNRFTFNPIKESIVSREMTRRYMTDMITYADTDVIVVGA

Query:  GSAGLSCAYELSKNPSVRVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAHYFLDELGVEYDEQDNYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLI
        GSAGLSCAYELSKNPSV+VAIIEQSVSPGGGAWLGGQLFS+MVVRKPAH FLDELG+EYDEQDNYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLI
Subjt:  GSAGLSCAYELSKNPSVRVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAHYFLDELGVEYDEQDNYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLI

Query:  VKGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVRLTREIVPGMIVTGMEVAEID
        +K G+VGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRL+S+GMIDSVPGMKALDMNTAED IVRLTRE+VPGMIVTGMEVAEID
Subjt:  VKGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVRLTREIVPGMIVTGMEVAEID

Query:  GAPRMGPTFGAMMISGQKAAHLALKSLGLANAIDGRRKEHAVVIEDEEAGLLLASAESPEVADA
        G+PRMGPTFGAMMISGQKAAHLALKSLGL NA+DG       +       +L A+A++ E+A+A
Subjt:  GAPRMGPTFGAMMISGQKAAHLALKSLGLANAIDGRRKEHAVVIEDEEAGLLLASAESPEVADA

O23787 Thiamine thiazole synthase, chloroplastic3.8e-15981.82Show/hide
Query:  MASIASTLTTKLQRPSLFDSSSSFHGTPLAPLPSLRLHPIQSNTGASLSISASASPPPYDLNRFTFNPIKESIVSREMTRRYMTDMITYADTDVIVVGAG
        MAS A   +       LFD  SSFHG P++P   LRL PI+S+   +LSISASAS PPYDLN F F+PIKESIVSREMTRRYMTDMITYADTDV+VVGAG
Subjt:  MASIASTLTTKLQRPSLFDSSSSFHGTPLAPLPSLRLHPIQSNTGASLSISASASPPPYDLNRFTFNPIKESIVSREMTRRYMTDMITYADTDVIVVGAG

Query:  SAGLSCAYELSKNPSVRVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAHYFLDELGVEYDEQDNYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIV
        SAGLSCAYELSKNP++++AIIEQSVSPGGGAWLGGQLFSAMVVRKPAH FLDELG++YDEQDNYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIV
Subjt:  SAGLSCAYELSKNPSVRVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAHYFLDELGVEYDEQDNYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIV

Query:  KGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVRLTREIVPGMIVTGMEVAEIDG
        KGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMI+ VPGMKALDMN+AEDAIVRLTRE+VPGMIVTGMEVAEIDG
Subjt:  KGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVRLTREIVPGMIVTGMEVAEIDG

Query:  APRMGPTFGAMMISGQKAAHLALKSLGLANAIDGRRKEHAVVIEDEEAGLLLASAESPEVADA
        APRMGPTFGAMMISGQKAAHLALKSLG  NA+DG        +      L+LA+A+S E ADA
Subjt:  APRMGPTFGAMMISGQKAAHLALKSLGLANAIDGRRKEHAVVIEDEEAGLLLASAESPEVADA

Q38709 Thiamine thiazole synthase, chloroplastic7.1e-15882.87Show/hide
Query:  IASTLTTKLQRPSLFD-SSSSFHGTPLAPLPSLRLHPIQSNTGASLSISASASP-PPYDLNRFTFNPIKESIVSREMTRRYMTDMITYADTDVIVVGAGS
        +ASTLT+K QR +LF+ S+SSF+GTPLAP  S+R+ P  +  GA  SIS S +P PPYDL  FTF+PIKESIVSREMTRRYM DMITYADTDV+VVGAGS
Subjt:  IASTLTTKLQRPSLFD-SSSSFHGTPLAPLPSLRLHPIQSNTGASLSISASASP-PPYDLNRFTFNPIKESIVSREMTRRYMTDMITYADTDVIVVGAGS

Query:  AGLSCAYELSKNPSVRVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAHYFLDELGVEYDEQDNYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVK
        +GL C YELSKNPSV+VAIIEQSVSPGGGAWLGGQLFS MVVRKPAH FLDELG+EYDEQDNYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVK
Subjt:  AGLSCAYELSKNPSVRVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAHYFLDELGVEYDEQDNYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVK

Query:  GGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVRLTREIVPGMIVTGMEVAEIDGA
        GGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVK L+SIGMID+VPGMKALDMN AEDAIVRLTREIVPGMIVTGMEVAEIDGA
Subjt:  GGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVRLTREIVPGMIVTGMEVAEIDGA

Query:  PRMGPTFGAMMISGQKAAHLALKSLGLANAIDGRRKEHAVVIEDEEAGLLLASAESPEVADA
        PRMGPTFGAMMISGQKAAHLALK+LGL NA+DG    +   I  E   L+LA+A+S E+ADA
Subjt:  PRMGPTFGAMMISGQKAAHLALKSLGLANAIDGRRKEHAVVIEDEEAGLLLASAESPEVADA

Q38814 Thiamine thiazole synthase, chloroplastic1.0e-14877.26Show/hide
Query:  MASIASTLTTKLQRPS-LFDSSSSFHGTPLAPLP-SLRLHPIQSNTGASLSISASASPPPYDLNRFTFNPIKESIVSREMTRRYMTDMITYADTDVIVVG
        MA+IASTL+    +P  LFD  SSFHG+ ++  P S+ L P         S S  A+   YDLN FTF+PIKESIVSREMTRRYMTDMITYA+TDV+VVG
Subjt:  MASIASTLTTKLQRPS-LFDSSSSFHGTPLAPLP-SLRLHPIQSNTGASLSISASASPPPYDLNRFTFNPIKESIVSREMTRRYMTDMITYADTDVIVVG

Query:  AGSAGLSCAYELSKNPSVRVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAHYFLDELGVEYDEQDNYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDL
        AGSAGLS AYE+SKNP+V+VAIIEQSVSPGGGAWLGGQLFSAM+VRKPAH FLDE+GV YDEQD YVV+KHAALFTSTIMSKLLARPNVKLFNAVAAEDL
Subjt:  AGSAGLSCAYELSKNPSVRVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAHYFLDELGVEYDEQDNYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDL

Query:  IVKGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVRLTREIVPGMIVTGMEVAEI
        IVKG RVGGVVTNWALV+ NH TQSCMDPNVMEAK+VVSSCGHDGPFGATGVKRLKSIGMID VPGMKALDMNTAEDAIVRLTRE+VPGMIVTGMEVAEI
Subjt:  IVKGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVRLTREIVPGMIVTGMEVAEI

Query:  DGAPRMGPTFGAMMISGQKAAHLALKSLGLANAIDGRRKEHAVVIEDEEAGLLLASAESPEVADA
        DGAPRMGPTFGAMMISGQKA  LALK+LGL NAIDG       ++ +    L+LA+A+S E  DA
Subjt:  DGAPRMGPTFGAMMISGQKAAHLALKSLGLANAIDGRRKEHAVVIEDEEAGLLLASAESPEVADA

Arabidopsis top hitse value%identityAlignment
AT5G54770.1 thiazole biosynthetic enzyme, chloroplast (ARA6) (THI1) (THI4)7.3e-15077.26Show/hide
Query:  MASIASTLTTKLQRPS-LFDSSSSFHGTPLAPLP-SLRLHPIQSNTGASLSISASASPPPYDLNRFTFNPIKESIVSREMTRRYMTDMITYADTDVIVVG
        MA+IASTL+    +P  LFD  SSFHG+ ++  P S+ L P         S S  A+   YDLN FTF+PIKESIVSREMTRRYMTDMITYA+TDV+VVG
Subjt:  MASIASTLTTKLQRPS-LFDSSSSFHGTPLAPLP-SLRLHPIQSNTGASLSISASASPPPYDLNRFTFNPIKESIVSREMTRRYMTDMITYADTDVIVVG

Query:  AGSAGLSCAYELSKNPSVRVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAHYFLDELGVEYDEQDNYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDL
        AGSAGLS AYE+SKNP+V+VAIIEQSVSPGGGAWLGGQLFSAM+VRKPAH FLDE+GV YDEQD YVV+KHAALFTSTIMSKLLARPNVKLFNAVAAEDL
Subjt:  AGSAGLSCAYELSKNPSVRVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAHYFLDELGVEYDEQDNYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDL

Query:  IVKGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVRLTREIVPGMIVTGMEVAEI
        IVKG RVGGVVTNWALV+ NH TQSCMDPNVMEAK+VVSSCGHDGPFGATGVKRLKSIGMID VPGMKALDMNTAEDAIVRLTRE+VPGMIVTGMEVAEI
Subjt:  IVKGGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVRLTREIVPGMIVTGMEVAEI

Query:  DGAPRMGPTFGAMMISGQKAAHLALKSLGLANAIDGRRKEHAVVIEDEEAGLLLASAESPEVADA
        DGAPRMGPTFGAMMISGQKA  LALK+LGL NAIDG       ++ +    L+LA+A+S E  DA
Subjt:  DGAPRMGPTFGAMMISGQKAAHLALKSLGLANAIDGRRKEHAVVIEDEEAGLLLASAESPEVADA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCCATTGCCTCCACTCTCACCACCAAGCTTCAGAGGCCGTCCTTGTTCGACTCCTCCTCTTCCTTCCATGGCACCCCTTTGGCGCCTCTTCCTTCTCTCCGCCT
CCACCCCATCCAATCTAACACCGGCGCTTCCCTCTCCATCTCCGCCTCGGCCTCTCCTCCGCCCTACGATTTGAACAGGTTCACCTTCAACCCCATCAAAGAGTCCATCG
TCTCCCGCGAGATGACCCGCCGGTACATGACAGATATGATCACTTATGCTGATACTGATGTGATTGTCGTCGGCGCTGGCTCCGCCGGCCTCTCTTGTGCTTACGAGCTC
AGCAAAAACCCCTCCGTCCGCGTCGCCATCATCGAACAATCCGTCAGCCCCGGCGGCGGTGCATGGCTCGGCGGCCAGCTCTTCTCCGCCATGGTGGTTCGGAAGCCAGC
CCATTACTTCTTGGACGAGCTGGGCGTGGAGTACGACGAGCAAGACAACTACGTTGTAATCAAGCACGCGGCTCTGTTCACGTCGACGATCATGAGCAAGCTTCTGGCGC
GGCCGAACGTGAAGCTGTTCAACGCCGTGGCGGCGGAGGACTTGATCGTGAAAGGCGGAAGAGTCGGCGGCGTGGTGACGAACTGGGCGCTGGTGTCGATGAACCACGAC
ACGCAGTCCTGCATGGACCCGAACGTGATGGAGGCGAAGGTGGTGGTGAGCTCCTGTGGGCACGACGGGCCATTCGGCGCAACCGGAGTGAAGCGGCTGAAGAGCATCGG
GATGATCGACAGCGTCCCCGGAATGAAAGCTCTGGACATGAACACGGCGGAGGACGCCATCGTTAGACTCACCAGAGAGATTGTTCCGGGCATGATTGTTACAGGCATGG
AGGTTGCAGAGATCGACGGAGCTCCAAGAATGGGGCCGACGTTTGGGGCGATGATGATATCGGGGCAGAAAGCGGCGCACTTGGCGTTGAAGTCGTTGGGGCTGGCGAAC
GCCATAGATGGAAGAAGAAAGGAACATGCAGTGGTGATTGAGGATGAGGAGGCGGGGCTGCTGCTGGCGTCGGCGGAGTCGCCGGAGGTTGCAGATGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCCATTGCCTCCACTCTCACCACCAAGCTTCAGAGGCCGTCCTTGTTCGACTCCTCCTCTTCCTTCCATGGCACCCCTTTGGCGCCTCTTCCTTCTCTCCGCCT
CCACCCCATCCAATCTAACACCGGCGCTTCCCTCTCCATCTCCGCCTCGGCCTCTCCTCCGCCCTACGATTTGAACAGGTTCACCTTCAACCCCATCAAAGAGTCCATCG
TCTCCCGCGAGATGACCCGCCGGTACATGACAGATATGATCACTTATGCTGATACTGATGTGATTGTCGTCGGCGCTGGCTCCGCCGGCCTCTCTTGTGCTTACGAGCTC
AGCAAAAACCCCTCCGTCCGCGTCGCCATCATCGAACAATCCGTCAGCCCCGGCGGCGGTGCATGGCTCGGCGGCCAGCTCTTCTCCGCCATGGTGGTTCGGAAGCCAGC
CCATTACTTCTTGGACGAGCTGGGCGTGGAGTACGACGAGCAAGACAACTACGTTGTAATCAAGCACGCGGCTCTGTTCACGTCGACGATCATGAGCAAGCTTCTGGCGC
GGCCGAACGTGAAGCTGTTCAACGCCGTGGCGGCGGAGGACTTGATCGTGAAAGGCGGAAGAGTCGGCGGCGTGGTGACGAACTGGGCGCTGGTGTCGATGAACCACGAC
ACGCAGTCCTGCATGGACCCGAACGTGATGGAGGCGAAGGTGGTGGTGAGCTCCTGTGGGCACGACGGGCCATTCGGCGCAACCGGAGTGAAGCGGCTGAAGAGCATCGG
GATGATCGACAGCGTCCCCGGAATGAAAGCTCTGGACATGAACACGGCGGAGGACGCCATCGTTAGACTCACCAGAGAGATTGTTCCGGGCATGATTGTTACAGGCATGG
AGGTTGCAGAGATCGACGGAGCTCCAAGAATGGGGCCGACGTTTGGGGCGATGATGATATCGGGGCAGAAAGCGGCGCACTTGGCGTTGAAGTCGTTGGGGCTGGCGAAC
GCCATAGATGGAAGAAGAAAGGAACATGCAGTGGTGATTGAGGATGAGGAGGCGGGGCTGCTGCTGGCGTCGGCGGAGTCGCCGGAGGTTGCAGATGCTTGA
Protein sequenceShow/hide protein sequence
MASIASTLTTKLQRPSLFDSSSSFHGTPLAPLPSLRLHPIQSNTGASLSISASASPPPYDLNRFTFNPIKESIVSREMTRRYMTDMITYADTDVIVVGAGSAGLSCAYEL
SKNPSVRVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAHYFLDELGVEYDEQDNYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKGGRVGGVVTNWALVSMNHD
TQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVRLTREIVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKSLGLAN
AIDGRRKEHAVVIEDEEAGLLLASAESPEVADA