; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh16G009100 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh16G009100
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionThiamine thiazole synthase, chloroplastic
Genome locationCmo_Chr16:5490878..5492427
RNA-Seq ExpressionCmoCh16G009100
SyntenyCmoCh16G009100
Gene Ontology termsGO:0009228 - thiamine biosynthetic process (biological process)
GO:0052837 - thiazole biosynthetic process (biological process)
GO:0005829 - cytosol (cellular component)
GO:0009570 - chloroplast stroma (cellular component)
GO:0005506 - iron ion binding (molecular function)
GO:0016763 - transferase activity, transferring pentosyl groups (molecular function)
InterPro domainsIPR002922 - Thiazole biosynthetic enzyme Thi4 family
IPR027495 - Thiamine thiazole synthase
IPR036188 - FAD/NAD(P)-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577434.1 Thiamine thiazole synthase 2, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]4.0e-17881.73Show/hide
Query:  MASIASSLPIKLHRPSLLESSFHGSPLLPTPSFRLKPTTAAASLSISASAAQPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAGSAGL
        MASIASSLPIKLHRPSLLESSFHGSPLLPTPSFRLKPTTAAASLSISASAAQPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAGSAGL
Subjt:  MASIASSLPIKLHRPSLLESSFHGSPLLPTPSFRLKPTTAAASLSISASAAQPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAGSAGL

Query:  SCAYELSKDPSVRVAIIEQSVSPGGGAWLGGQLFSAMVTNSFPFPCIPITSHFLFFFFFLPYLDGCSVFCSFIGNTQNFEKPTQIFMICTHACRFRTPRL
        SCAYELSKDPSVRVAIIEQSVSPGGGAWLGGQLFSAM                                                               
Subjt:  SCAYELSKDPSVRVAIIEQSVSPGGGAWLGGQLFSAMVTNSFPFPCIPITSHFLFFFFFLPYLDGCSVFCSFIGNTQNFEKPTQIFMICTHACRFRTPRL

Query:  NLIGRFEVVRKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKV
               VVRKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKV
Subjt:  NLIGRFEVVRKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKV

Query:  VVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNSAEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGKTNSID-
        VVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNSAEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGKTNSID 
Subjt:  VVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNSAEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGKTNSID-

Query:  ------GEVEAAEMVLAAGETPEVADA
               EVEAAEMVLAAGETPEVADA
Subjt:  ------GEVEAAEMVLAAGETPEVADA

XP_022932142.1 thiamine thiazole synthase, chloroplastic-like [Cucurbita moschata]3.9e-18183.33Show/hide
Query:  MASIASSLPIKLHRPSLLESSFHGSPLLPTPSFRLKPTTAAASLSISASAAQPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAGSAGL
        MASIASSLPIKLHRPSLLESSFHGSPLLPTPSFRLKPTTAAASLSISASAAQPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAGSAGL
Subjt:  MASIASSLPIKLHRPSLLESSFHGSPLLPTPSFRLKPTTAAASLSISASAAQPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAGSAGL

Query:  SCAYELSKDPSVRVAIIEQSVSPGGGAWLGGQLFSAMVTNSFPFPCIPITSHFLFFFFFLPYLDGCSVFCSFIGNTQNFEKPTQIFMICTHACRFRTPRL
        SCAYELSKDPSVRVAIIEQSVSPGGGAWLGGQLFSAM                                                               
Subjt:  SCAYELSKDPSVRVAIIEQSVSPGGGAWLGGQLFSAMVTNSFPFPCIPITSHFLFFFFFLPYLDGCSVFCSFIGNTQNFEKPTQIFMICTHACRFRTPRL

Query:  NLIGRFEVVRKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKV
               VVRKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKV
Subjt:  NLIGRFEVVRKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKV

Query:  VVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNSAEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGKTNSIDG
        VVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNSAEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGKTNSIDG
Subjt:  VVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNSAEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGKTNSIDG

Query:  EVEAAEMVLAAGETPEVADA
        EVEAAEMVLAAGETPEVADA
Subjt:  EVEAAEMVLAAGETPEVADA

XP_022985241.1 thiamine thiazole synthase, chloroplastic-like [Cucurbita maxima]6.5e-17681.03Show/hide
Query:  MASIASSLPIKLHRPSLLESSFHGSPLLPTPSFRLKPTTAAASLSISASAAQPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAGSAGL
        MASIASSLP KLHRPSLL+SSFHGSPLLPTPSFRLKPTTAAASLSISASAAQPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAGSAGL
Subjt:  MASIASSLPIKLHRPSLLESSFHGSPLLPTPSFRLKPTTAAASLSISASAAQPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAGSAGL

Query:  SCAYELSKDPSVRVAIIEQSVSPGGGAWLGGQLFSAMVTNSFPFPCIPITSHFLFFFFFLPYLDGCSVFCSFIGNTQNFEKPTQIFMICTHACRFRTPRL
        SCAYELSKDPSVRVAIIEQSVSPGGGAWLGGQLFSAM                                                               
Subjt:  SCAYELSKDPSVRVAIIEQSVSPGGGAWLGGQLFSAMVTNSFPFPCIPITSHFLFFFFFLPYLDGCSVFCSFIGNTQNFEKPTQIFMICTHACRFRTPRL

Query:  NLIGRFEVVRKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKV
               VVRKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKV
Subjt:  NLIGRFEVVRKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKV

Query:  VVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNSAEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGKTNSID-
        VVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMN+AEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGK NSID 
Subjt:  VVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNSAEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGKTNSID-

Query:  -GE-----VEAAEMVLAAGETPEVADA
         GE     VEAAEMVLAAGETPEVADA
Subjt:  -GE-----VEAAEMVLAAGETPEVADA

XP_023520449.1 thiamine thiazole synthase, chloroplastic-like [Cucurbita pepo subsp. pepo]1.7e-17680.8Show/hide
Query:  MASIASSLPIKLHRPSLLESSFHGSPLLPTPSFRLKPTTAAASLSISASAAQPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAGSAGL
        MASIASSLP KLHRPSLLESSFHGSPLLPTPSFRLKPTTAAASLSISASAAQPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAGSAGL
Subjt:  MASIASSLPIKLHRPSLLESSFHGSPLLPTPSFRLKPTTAAASLSISASAAQPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAGSAGL

Query:  SCAYELSKDPSVRVAIIEQSVSPGGGAWLGGQLFSAMVTNSFPFPCIPITSHFLFFFFFLPYLDGCSVFCSFIGNTQNFEKPTQIFMICTHACRFRTPRL
        SCAYELSKDPSVRVAIIEQSVSPGGGAWLGGQLFSAM                                                               
Subjt:  SCAYELSKDPSVRVAIIEQSVSPGGGAWLGGQLFSAMVTNSFPFPCIPITSHFLFFFFFLPYLDGCSVFCSFIGNTQNFEKPTQIFMICTHACRFRTPRL

Query:  NLIGRFEVVRKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKV
               VVRKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLL+RPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVMEAK+
Subjt:  NLIGRFEVVRKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKV

Query:  VVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNSAEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGKTNSID-
        VVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNSAEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGK NSID 
Subjt:  VVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNSAEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGKTNSID-

Query:  ------GEVEAAEMVLAAGETPEVADA
               EVEAAEMVLAAGETPEVADA
Subjt:  ------GEVEAAEMVLAAGETPEVADA

XP_038905461.1 thiamine thiazole synthase, chloroplastic [Benincasa hispida]1.0e-15773.55Show/hide
Query:  MASIASSLPIKLHRPS-LLESSFHGSPLLPTPSFRLK---PTTAAASLSISASAAQPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAG
        MASIAS+L  KL RPS LL+SSF+G+PL P PS RLK    TTAA SLSISASA+QPPYDLNQFKFNPI+ESIVSREMTRRYMTDMITYADTDVI+VGAG
Subjt:  MASIASSLPIKLHRPS-LLESSFHGSPLLPTPSFRLK---PTTAAASLSISASAAQPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAG

Query:  SAGLSCAYELSKDPSVRVAIIEQSVSPGGGAWLGGQLFSAMVTNSFPFPCIPITSHFLFFFFFLPYLDGCSVFCSFIGNTQNFEKPTQIFMICTHACRFR
        SAGLSCAYELSK+PS+R+AIIEQSVSPGGGAWLGGQLFSAM                                                           
Subjt:  SAGLSCAYELSKDPSVRVAIIEQSVSPGGGAWLGGQLFSAMVTNSFPFPCIPITSHFLFFFFFLPYLDGCSVFCSFIGNTQNFEKPTQIFMICTHACRFR

Query:  TPRLNLIGRFEVVRKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVM
                   VVRKPAH FLDE+ VEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVK GRVGGVVTNWALVSMNHDTQSCMDPNVM
Subjt:  TPRLNLIGRFEVVRKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVM

Query:  EAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNSAEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGKTN
        EAKVVVSSCGHDGPFGATGVKRLKSIG+IDSVPGMKALDMN+AEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALK+LG+ N
Subjt:  EAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNSAEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGKTN

Query:  SI-DGE------VEAAEMVLAAGETPEVADA
         I DGE      VE  E++LAAGE+PEVADA
Subjt:  SI-DGE------VEAAEMVLAAGETPEVADA

TrEMBL top hitse value%identityAlignment
A0A0A0L546 Thiamine thiazole synthase, chloroplastic1.0e-15572.09Show/hide
Query:  MASIASSLPIKLHRPSLL-ESSFHGSPLLPTPSFRLKPTTAAASLSISASAAQPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAGSAG
        MASIAS+L  KL +PSLL +SSFHG+PL    S RLK +TAA SL+ISASA+QPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDV++VGAGSAG
Subjt:  MASIASSLPIKLHRPSLL-ESSFHGSPLLPTPSFRLKPTTAAASLSISASAAQPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAGSAG

Query:  LSCAYELSKDPSVRVAIIEQSVSPGGGAWLGGQLFSAMVTNSFPFPCIPITSHFLFFFFFLPYLDGCSVFCSFIGNTQNFEKPTQIFMICTHACRFRTPR
        LSCAYELSK+P++R+AIIEQSVSPGGGAWLGGQLFSAM                                                              
Subjt:  LSCAYELSKDPSVRVAIIEQSVSPGGGAWLGGQLFSAMVTNSFPFPCIPITSHFLFFFFFLPYLDGCSVFCSFIGNTQNFEKPTQIFMICTHACRFRTPR

Query:  LNLIGRFEVVRKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVMEAK
                VVRKPAH FLDE+ VEYDEQ+DYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVK GRVGGVVTNWALVSMNHDTQSCMDPNVMEAK
Subjt:  LNLIGRFEVVRKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVMEAK

Query:  VVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNSAEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGKTNSID
        VVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMN+AEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALK+LG+ NSI 
Subjt:  VVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNSAEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGKTNSID

Query:  GE---------VEAAEMVLAAGETPEVADA
         E         +E  E++LAAGE+PEVADA
Subjt:  GE---------VEAAEMVLAAGETPEVADA

A0A1S3CJ81 Thiamine thiazole synthase, chloroplastic6.1e-15672.33Show/hide
Query:  MASIASSLPIKLHRPSLL-ESSFHGSPLLPTPSFRLKPTTAAASLSISASAAQPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAGSAG
        MASIAS+L  KL  PSLL +SSFHG+PL   PS RLK + AA SL+ISASA+QPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVI+VGAGSAG
Subjt:  MASIASSLPIKLHRPSLL-ESSFHGSPLLPTPSFRLKPTTAAASLSISASAAQPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAGSAG

Query:  LSCAYELSKDPSVRVAIIEQSVSPGGGAWLGGQLFSAMVTNSFPFPCIPITSHFLFFFFFLPYLDGCSVFCSFIGNTQNFEKPTQIFMICTHACRFRTPR
        LSCAYELSK+P++R+AIIEQSVSPGGGAWLGGQLFSAM                                                              
Subjt:  LSCAYELSKDPSVRVAIIEQSVSPGGGAWLGGQLFSAMVTNSFPFPCIPITSHFLFFFFFLPYLDGCSVFCSFIGNTQNFEKPTQIFMICTHACRFRTPR

Query:  LNLIGRFEVVRKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVMEAK
                VVRKPAH FLDE+ VEYDEQ+DYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVK GRVGGVVTNWALVSMNHDTQSCMDPNVMEAK
Subjt:  LNLIGRFEVVRKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVMEAK

Query:  VVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNSAEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGKTNSID
        VVVSSCGHDGPFGATGVKRLKSIG+IDSVPGMKALDMN+AEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALK+LG+ NSI 
Subjt:  VVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNSAEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGKTNSID

Query:  GE---------VEAAEMVLAAGETPEVADA
        GE         +E  E++LAAGE+PEVADA
Subjt:  GE---------VEAAEMVLAAGETPEVADA

A0A6J1EVU2 Thiamine thiazole synthase, chloroplastic1.9e-18183.33Show/hide
Query:  MASIASSLPIKLHRPSLLESSFHGSPLLPTPSFRLKPTTAAASLSISASAAQPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAGSAGL
        MASIASSLPIKLHRPSLLESSFHGSPLLPTPSFRLKPTTAAASLSISASAAQPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAGSAGL
Subjt:  MASIASSLPIKLHRPSLLESSFHGSPLLPTPSFRLKPTTAAASLSISASAAQPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAGSAGL

Query:  SCAYELSKDPSVRVAIIEQSVSPGGGAWLGGQLFSAMVTNSFPFPCIPITSHFLFFFFFLPYLDGCSVFCSFIGNTQNFEKPTQIFMICTHACRFRTPRL
        SCAYELSKDPSVRVAIIEQSVSPGGGAWLGGQLFSAM                                                               
Subjt:  SCAYELSKDPSVRVAIIEQSVSPGGGAWLGGQLFSAMVTNSFPFPCIPITSHFLFFFFFLPYLDGCSVFCSFIGNTQNFEKPTQIFMICTHACRFRTPRL

Query:  NLIGRFEVVRKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKV
               VVRKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKV
Subjt:  NLIGRFEVVRKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKV

Query:  VVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNSAEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGKTNSIDG
        VVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNSAEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGKTNSIDG
Subjt:  VVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNSAEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGKTNSIDG

Query:  EVEAAEMVLAAGETPEVADA
        EVEAAEMVLAAGETPEVADA
Subjt:  EVEAAEMVLAAGETPEVADA

A0A6J1FWL6 Thiamine thiazole synthase, chloroplastic1.3e-15071.8Show/hide
Query:  MASIASSLPIKLHRP--SLLESSFHGSPLL-PTPSFRLKPTTAAASLSISASAAQPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAGS
        MAS+AS+L  KL  P  SLL+SSFHG+PL  P PS RLK T AA S+S S    Q PYDLN FKFNPI+ESIVSREMTRRYMTDMITYADTDV++VGAGS
Subjt:  MASIASSLPIKLHRP--SLLESSFHGSPLL-PTPSFRLKPTTAAASLSISASAAQPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAGS

Query:  AGLSCAYELSKDPSVRVAIIEQSVSPGGGAWLGGQLFSAMVTNSFPFPCIPITSHFLFFFFFLPYLDGCSVFCSFIGNTQNFEKPTQIFMICTHACRFRT
        AGLSCAYELSK+P +R+AIIEQSVSPGGGAWLGGQLFSAM                                                            
Subjt:  AGLSCAYELSKDPSVRVAIIEQSVSPGGGAWLGGQLFSAMVTNSFPFPCIPITSHFLFFFFFLPYLDGCSVFCSFIGNTQNFEKPTQIFMICTHACRFRT

Query:  PRLNLIGRFEVVRKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVME
                  VVRKPAH FLDE+ VEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVK GRVGGVVTNWALVSMNHDTQSCMDPNVME
Subjt:  PRLNLIGRFEVVRKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVME

Query:  AKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNSAEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGKTNS
        AKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMN+AEDAIVR TREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALK+LG  N 
Subjt:  AKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNSAEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGKTNS

Query:  IDGEVEAAEMVLAAGETPEVAD
        I  E EAA+++LAAGE+PEVAD
Subjt:  IDGEVEAAEMVLAAGETPEVAD

A0A6J1J7L4 Thiamine thiazole synthase, chloroplastic3.1e-17681.03Show/hide
Query:  MASIASSLPIKLHRPSLLESSFHGSPLLPTPSFRLKPTTAAASLSISASAAQPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAGSAGL
        MASIASSLP KLHRPSLL+SSFHGSPLLPTPSFRLKPTTAAASLSISASAAQPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAGSAGL
Subjt:  MASIASSLPIKLHRPSLLESSFHGSPLLPTPSFRLKPTTAAASLSISASAAQPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAGSAGL

Query:  SCAYELSKDPSVRVAIIEQSVSPGGGAWLGGQLFSAMVTNSFPFPCIPITSHFLFFFFFLPYLDGCSVFCSFIGNTQNFEKPTQIFMICTHACRFRTPRL
        SCAYELSKDPSVRVAIIEQSVSPGGGAWLGGQLFSAM                                                               
Subjt:  SCAYELSKDPSVRVAIIEQSVSPGGGAWLGGQLFSAMVTNSFPFPCIPITSHFLFFFFFLPYLDGCSVFCSFIGNTQNFEKPTQIFMICTHACRFRTPRL

Query:  NLIGRFEVVRKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKV
               VVRKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKV
Subjt:  NLIGRFEVVRKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKV

Query:  VVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNSAEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGKTNSID-
        VVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMN+AEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGK NSID 
Subjt:  VVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNSAEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGKTNSID-

Query:  -GE-----VEAAEMVLAAGETPEVADA
         GE     VEAAEMVLAAGETPEVADA
Subjt:  -GE-----VEAAEMVLAAGETPEVADA

SwissProt top hitse value%identityAlignment
C5X2M4 Thiamine thiazole synthase 2, chloroplastic2.0e-13564.42Show/hide
Query:  SLLESSFHGSPL------LPTPSFRLKPTTAAASLSISASAAQPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAGSAGLSCAYELSKD
        SLL+SSF G+ L        +P+    P  A A  + S S++ PPYDL  F+F+PI+ES+VSREMTRRYMTDMITYADTDV++VGAGSAGLSCAYELSKD
Subjt:  SLLESSFHGSPL------LPTPSFRLKPTTAAASLSISASAAQPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAGSAGLSCAYELSKD

Query:  PSVRVAIIEQSVSPGGGAWLGGQLFSAMVTNSFPFPCIPITSHFLFFFFFLPYLDGCSVFCSFIGNTQNFEKPTQIFMICTHACRFRTPRLNLIGRFEVV
        PSV +AI+EQSVSPGGGAWLGGQLFSAM                                                                      VV
Subjt:  PSVRVAIIEQSVSPGGGAWLGGQLFSAMVTNSFPFPCIPITSHFLFFFFFLPYLDGCSVFCSFIGNTQNFEKPTQIFMICTHACRFRTPRLNLIGRFEVV

Query:  RKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDG
        RKPAH FLDEL V YDE EDYVVIKHAALFTST+MS+LLARPNVKLFNAVA EDLIVK GRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDG
Subjt:  RKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDG

Query:  PFGATGVKRLKSIGMIDSVPGMKALDMNSAEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGKTNSIDGEVEAAEMVL
        PFGATGVKRL+ IGMI  VPGMKALDMN+AED IVR TREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALG+ N++DG ++     L
Subjt:  PFGATGVKRLKSIGMIDSVPGMKALDMNSAEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGKTNSIDGEVEAAEMVL

Query:  -----AAGETPEVADA
              A +  EV DA
Subjt:  -----AAGETPEVADA

F6H7K5 Thiamine thiazole synthase 2, chloroplastic1.6e-14065.03Show/hide
Query:  MASIASSLPIKLHRPSLL----ESSFHGSPLLPTPSFRLKPTTAAASLSISASAAQPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAG
        MAS+A++L      P       +SSFHGSP+    S+R+ P   +      + ++  PYDL  FKF PI+ESIV+REMTRRYM DMITYADTDV++VGAG
Subjt:  MASIASSLPIKLHRPSLL----ESSFHGSPLLPTPSFRLKPTTAAASLSISASAAQPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAG

Query:  SAGLSCAYELSKDPSVRVAIIEQSVSPGGGAWLGGQLFSAMVTNSFPFPCIPITSHFLFFFFFLPYLDGCSVFCSFIGNTQNFEKPTQIFMICTHACRFR
        SAGLSCAYELSK+PS+RVAIIEQSVSPGGGAWLGGQLFSAM                                                           
Subjt:  SAGLSCAYELSKDPSVRVAIIEQSVSPGGGAWLGGQLFSAMVTNSFPFPCIPITSHFLFFFFFLPYLDGCSVFCSFIGNTQNFEKPTQIFMICTHACRFR

Query:  TPRLNLIGRFEVVRKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVM
                   VVRKPAH+FLDEL +EYDEQ++YVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKE RV GVVTNWALVSMNHDTQSCMDPNVM
Subjt:  TPRLNLIGRFEVVRKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVM

Query:  EAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNSAEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGKTN
        EAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMN+AEDAIVR TRE+VPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLAL+ALG+ N
Subjt:  EAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNSAEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGKTN

Query:  SIDG-----EVEAAEMVLAAGETPEVADA
        +IDG     E    E++LAA ET E+ DA
Subjt:  SIDG-----EVEAAEMVLAAGETPEVADA

F6H9A9 Thiamine thiazole synthase 1, chloroplastic8.9e-13664.4Show/hide
Query:  MASIASSLPIKLHRPSLLE---SSFHGSPLLPTPSFRLKPTTAAASLSISASAAQPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAGS
        MA++ SS+  K  + S+ +   SSFHG P+      RL P   +  ++++ +AA  PYDL  FKF PI+ESIVSREMTRRYM DMITYADTDV+VVGAGS
Subjt:  MASIASSLPIKLHRPSLLE---SSFHGSPLLPTPSFRLKPTTAAASLSISASAAQPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAGS

Query:  AGLSCAYELSKDPSVRVAIIEQSVSPGGGAWLGGQLFSAMVTNSFPFPCIPITSHFLFFFFFLPYLDGCSVFCSFIGNTQNFEKPTQIFMICTHACRFRT
        AGLSCAYELSK+PSV+VAIIEQSVSPGGGAWLGGQLFS+M                                                            
Subjt:  AGLSCAYELSKDPSVRVAIIEQSVSPGGGAWLGGQLFSAMVTNSFPFPCIPITSHFLFFFFFLPYLDGCSVFCSFIGNTQNFEKPTQIFMICTHACRFRT

Query:  PRLNLIGRFEVVRKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVME
                  VVRKPAH FLDEL +EYDEQ++YVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLI+KEG+VGGVVTNWALVSMNHDTQSCMDPNVME
Subjt:  PRLNLIGRFEVVRKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVME

Query:  AKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNSAEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGKTNS
        AKVVVSSCGHDGPFGATGVKRL+S+GMIDSVPGMKALDMN+AED IVR TREVVPGMIVTGMEVAEIDG+PRMGPTFGAMMISGQKAAHLALK+LG  N+
Subjt:  AKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNSAEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGKTNS

Query:  IDGEVEA---AEMVL-AAGETPEVADA
        +DG        E+VL AA +  E+A+A
Subjt:  IDGEVEA---AEMVL-AAGETPEVADA

O23787 Thiamine thiazole synthase, chloroplastic1.5e-14366.82Show/hide
Query:  MASIASSLPIKLHRPSLLESSFHGSPLLPTPSFRLKPTTAAA--SLSISASAAQPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAGSA
        MAS A +  +      L +SSFHG+P+ P+   RL+P  ++   +LSISASA+ PPYDLN FKF+PI+ESIVSREMTRRYMTDMITYADTDV+VVGAGSA
Subjt:  MASIASSLPIKLHRPSLLESSFHGSPLLPTPSFRLKPTTAAA--SLSISASAAQPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAGSA

Query:  GLSCAYELSKDPSVRVAIIEQSVSPGGGAWLGGQLFSAMVTNSFPFPCIPITSHFLFFFFFLPYLDGCSVFCSFIGNTQNFEKPTQIFMICTHACRFRTP
        GLSCAYELSK+P++++AIIEQSVSPGGGAWLGGQLFSAM                                                             
Subjt:  GLSCAYELSKDPSVRVAIIEQSVSPGGGAWLGGQLFSAMVTNSFPFPCIPITSHFLFFFFFLPYLDGCSVFCSFIGNTQNFEKPTQIFMICTHACRFRTP

Query:  RLNLIGRFEVVRKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVMEA
                 VVRKPAH FLDEL ++YDEQ++YVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVK GRVGGVVTNWALVSMNHDTQSCMDPNVMEA
Subjt:  RLNLIGRFEVVRKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVMEA

Query:  KVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNSAEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGKTNSI
        KVVVSSCGHDGPFGATGVKRLKSIGMI+ VPGMKALDMNSAEDAIVR TREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALK+LG+ N++
Subjt:  KVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNSAEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGKTNSI

Query:  DGEVEAA---EMVLAAGETPEVADA
        DG        E++LAA ++ E ADA
Subjt:  DGEVEAA---EMVLAAGETPEVADA

Q38709 Thiamine thiazole synthase, chloroplastic1.7e-13966.51Show/hide
Query:  IASSLPIKLHRPSLLE---SSFHGSPLLPTPSFRLKPTTAAASLSISASAA-QPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAGSAG
        +AS+L  K  R +L E   SSF+G+PL P+ S R++PT A A  SIS S A  PPYDL  F F+PI+ESIVSREMTRRYM DMITYADTDV+VVGAGS+G
Subjt:  IASSLPIKLHRPSLLE---SSFHGSPLLPTPSFRLKPTTAAASLSISASAA-QPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAGSAG

Query:  LSCAYELSKDPSVRVAIIEQSVSPGGGAWLGGQLFSAMVTNSFPFPCIPITSHFLFFFFFLPYLDGCSVFCSFIGNTQNFEKPTQIFMICTHACRFRTPR
        L C YELSK+PSV+VAIIEQSVSPGGGAWLGGQLFS M                                                              
Subjt:  LSCAYELSKDPSVRVAIIEQSVSPGGGAWLGGQLFSAMVTNSFPFPCIPITSHFLFFFFFLPYLDGCSVFCSFIGNTQNFEKPTQIFMICTHACRFRTPR

Query:  LNLIGRFEVVRKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVMEAK
                VVRKPAH FLDEL +EYDEQ++YVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVK GRVGGVVTNWALVSMNHDTQSCMDPNVMEAK
Subjt:  LNLIGRFEVVRKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVMEAK

Query:  VVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNSAEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGKTNSID
        VVVSSCGHDGPFGATGVK L+SIGMID+VPGMKALDMN AEDAIVR TRE+VPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALG  N++D
Subjt:  VVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNSAEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGKTNSID

Query:  GEVEAA---EMVLAAGETPEVADA
        G        E++LAA ++ E+ADA
Subjt:  GEVEAA---EMVLAAGETPEVADA

Arabidopsis top hitse value%identityAlignment
AT5G54770.1 thiazole biosynthetic enzyme, chloroplast (ARA6) (THI1) (THI4)1.6e-13564.24Show/hide
Query:  MASIASSLPIKLHRPS-LLESSFHGSPLLPTP-SFRLKPTTAAASLSISASAAQPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAGSA
        MA+IAS+L +   +P  L +SSFHGS +   P S  LKP       S S  A    YDLN F F+PI+ESIVSREMTRRYMTDMITYA+TDV+VVGAGSA
Subjt:  MASIASSLPIKLHRPS-LLESSFHGSPLLPTP-SFRLKPTTAAASLSISASAAQPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAGSA

Query:  GLSCAYELSKDPSVRVAIIEQSVSPGGGAWLGGQLFSAMVTNSFPFPCIPITSHFLFFFFFLPYLDGCSVFCSFIGNTQNFEKPTQIFMICTHACRFRTP
        GLS AYE+SK+P+V+VAIIEQSVSPGGGAWLGGQLFSAM                                                             
Subjt:  GLSCAYELSKDPSVRVAIIEQSVSPGGGAWLGGQLFSAMVTNSFPFPCIPITSHFLFFFFFLPYLDGCSVFCSFIGNTQNFEKPTQIFMICTHACRFRTP

Query:  RLNLIGRFEVVRKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVMEA
                 +VRKPAH FLDE+ V YDEQ+ YVV+KHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVK  RVGGVVTNWALV+ NH TQSCMDPNVMEA
Subjt:  RLNLIGRFEVVRKPAHYFLDELEVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVMEA

Query:  KVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNSAEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGKTNSI
        K+VVSSCGHDGPFGATGVKRLKSIGMID VPGMKALDMN+AEDAIVR TREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKA  LALKALG  N+I
Subjt:  KVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNSAEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGKTNSI

Query:  DGEVE---AAEMVLAAGETPEVADA
        DG +    + E+VLAA ++ E  DA
Subjt:  DGEVE---AAEMVLAAGETPEVADA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCATTGCCTCTTCTCTCCCCATCAAGCTCCATAGGCCCTCTTTGCTCGAATCCTCCTTCCATGGCTCCCCTTTGCTCCCTACTCCTTCTTTCCGCCTCAAACC
CACCACCGCTGCCGCCTCCCTCTCCATCTCCGCCTCCGCCGCTCAGCCTCCCTACGATTTGAACCAATTCAAATTCAACCCCATCAGGGAGTCCATCGTCTCCCGGGAGA
TGACCCGCCGCTACATGACCGACATGATCACTTACGCCGACACTGATGTCATTGTTGTCGGAGCTGGATCCGCCGGTCTCTCTTGCGCTTATGAACTCAGCAAGGACCCC
TCTGTTCGAGTCGCCATCATCGAACAATCCGTCAGCCCTGGCGGTGGTGCATGGCTTGGCGGCCAGCTCTTCTCTGCTATGGTAACAAATTCATTTCCCTTTCCATGTAT
ACCCATAACGTCTCATTTTCTTTTTTTTTTTTTTTTTCTTCCATATTTGGATGGATGCTCTGTTTTTTGTTCTTTCATTGGAAATACCCAAAATTTTGAAAAACCCACAC
AGATTTTCATGATTTGTACTCATGCATGTCGTTTTAGAACCCCGAGATTGAATTTAATTGGGAGATTTGAAGTGGTGCGGAAGCCAGCCCATTACTTCTTGGACGAGCTG
GAAGTGGAGTACGACGAACAAGAAGACTATGTCGTGATAAAGCACGCAGCTCTGTTCACATCAACGATCATGAGCAAGCTGTTGGCCCGGCCAAACGTGAAGCTGTTCAA
CGCCGTGGCGGCAGAGGATTTGATCGTGAAGGAAGGCAGAGTCGGCGGCGTGGTCACCAACTGGGCGCTGGTGTCCATGAACCACGACACACAGTCTTGCATGGACCCCA
ATGTGATGGAAGCTAAGGTCGTGGTGAGCTCATGCGGGCACGACGGACCATTCGGCGCCACGGGAGTGAAGCGGCTGAAGAGCATCGGGATGATCGACAGCGTGCCGGGA
ATGAAGGCGCTGGACATGAACTCGGCGGAGGATGCCATTGTTAGATTCACGAGAGAGGTGGTGCCGGGAATGATTGTGACAGGGATGGAGGTTGCGGAGATCGACGGAGC
GCCAAGAATGGGGCCGACGTTCGGGGCGATGATGATATCAGGGCAGAAGGCGGCGCACTTGGCGCTGAAGGCGTTGGGGAAGACGAACAGCATTGATGGGGAAGTGGAAG
CAGCAGAGATGGTACTGGCGGCGGGAGAGACGCCGGAGGTTGCAGATGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCATTGCCTCTTCTCTCCCCATCAAGCTCCATAGGCCCTCTTTGCTCGAATCCTCCTTCCATGGCTCCCCTTTGCTCCCTACTCCTTCTTTCCGCCTCAAACC
CACCACCGCTGCCGCCTCCCTCTCCATCTCCGCCTCCGCCGCTCAGCCTCCCTACGATTTGAACCAATTCAAATTCAACCCCATCAGGGAGTCCATCGTCTCCCGGGAGA
TGACCCGCCGCTACATGACCGACATGATCACTTACGCCGACACTGATGTCATTGTTGTCGGAGCTGGATCCGCCGGTCTCTCTTGCGCTTATGAACTCAGCAAGGACCCC
TCTGTTCGAGTCGCCATCATCGAACAATCCGTCAGCCCTGGCGGTGGTGCATGGCTTGGCGGCCAGCTCTTCTCTGCTATGGTAACAAATTCATTTCCCTTTCCATGTAT
ACCCATAACGTCTCATTTTCTTTTTTTTTTTTTTTTTCTTCCATATTTGGATGGATGCTCTGTTTTTTGTTCTTTCATTGGAAATACCCAAAATTTTGAAAAACCCACAC
AGATTTTCATGATTTGTACTCATGCATGTCGTTTTAGAACCCCGAGATTGAATTTAATTGGGAGATTTGAAGTGGTGCGGAAGCCAGCCCATTACTTCTTGGACGAGCTG
GAAGTGGAGTACGACGAACAAGAAGACTATGTCGTGATAAAGCACGCAGCTCTGTTCACATCAACGATCATGAGCAAGCTGTTGGCCCGGCCAAACGTGAAGCTGTTCAA
CGCCGTGGCGGCAGAGGATTTGATCGTGAAGGAAGGCAGAGTCGGCGGCGTGGTCACCAACTGGGCGCTGGTGTCCATGAACCACGACACACAGTCTTGCATGGACCCCA
ATGTGATGGAAGCTAAGGTCGTGGTGAGCTCATGCGGGCACGACGGACCATTCGGCGCCACGGGAGTGAAGCGGCTGAAGAGCATCGGGATGATCGACAGCGTGCCGGGA
ATGAAGGCGCTGGACATGAACTCGGCGGAGGATGCCATTGTTAGATTCACGAGAGAGGTGGTGCCGGGAATGATTGTGACAGGGATGGAGGTTGCGGAGATCGACGGAGC
GCCAAGAATGGGGCCGACGTTCGGGGCGATGATGATATCAGGGCAGAAGGCGGCGCACTTGGCGCTGAAGGCGTTGGGGAAGACGAACAGCATTGATGGGGAAGTGGAAG
CAGCAGAGATGGTACTGGCGGCGGGAGAGACGCCGGAGGTTGCAGATGCTTGAAGTTTCTCATTTAATTTGCTTTGGGGTTTGGGGTTGGTGTGTTTGTTTGTTCGTTTG
TAGTGAGTTTTGAGTGAAGAAGAAGAACGGGGTTTGGGTTTGGGGTTTTCATGTGATGCCTCTGTTGTTTTCGTAGCGTAATGGGTTGGGTTGGGGTTAGGGG
Protein sequenceShow/hide protein sequence
MASIASSLPIKLHRPSLLESSFHGSPLLPTPSFRLKPTTAAASLSISASAAQPPYDLNQFKFNPIRESIVSREMTRRYMTDMITYADTDVIVVGAGSAGLSCAYELSKDP
SVRVAIIEQSVSPGGGAWLGGQLFSAMVTNSFPFPCIPITSHFLFFFFFLPYLDGCSVFCSFIGNTQNFEKPTQIFMICTHACRFRTPRLNLIGRFEVVRKPAHYFLDEL
EVEYDEQEDYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKEGRVGGVVTNWALVSMNHDTQSCMDPNVMEAKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPG
MKALDMNSAEDAIVRFTREVVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGKTNSIDGEVEAAEMVLAAGETPEVADA