; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C10G197800 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C10G197800
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
Description3-dehydroquinate synthase homolog
Genome locationCla97Chr10:27532150..27543515
RNA-Seq ExpressionCla97C10G197800
SyntenyCla97C10G197800
Gene Ontology termsGO:0006006 - glucose metabolic process (biological process)
GO:0008652 - cellular amino acid biosynthetic process (biological process)
GO:0009073 - aromatic amino acid family biosynthetic process (biological process)
GO:0033499 - galactose catabolic process via UDP-galactose (biological process)
GO:0003856 - 3-dehydroquinate synthase activity (molecular function)
GO:0004034 - aldose 1-epimerase activity (molecular function)
GO:0016491 - oxidoreductase activity (molecular function)
GO:0030246 - carbohydrate binding (molecular function)
InterPro domainsIPR002812 - 3-dehydroquinate synthase
IPR008183 - Aldose 1-/Glucose-6-phosphate 1-epimerase
IPR011013 - Galactose mutarotase-like domain superfamily
IPR014718 - Glycoside hydrolase-type carbohydrate-binding


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG8384719.1 hypothetical protein BUALT_Bualt04G0147500 [Buddleja alternifolia]3.4e-24555.82Show/hide
Query:  AAFGFANGYEKKGEIGIFELKRGDLSVKFTNWGATIVSLLVPNKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNN
        AA    + +E + EI  +EL++GDL V FTN+GA IVSL+VP+K G   D+VLGYDS+++Y+ D + FG+IVGRVANRIGGAKFTLD   YKL  N+G N
Subjt:  AAFGFANGYEKKGEIGIFELKRGDLSVKFTNWGATIVSLLVPNKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNN

Query:  TLHGGTRGFSDVVWKVTKYQKDGRSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTVNAKALNKPTPVNLAQHTYWNLGGHN-SGDILSNRLQI
        TL+GG +GF++V+W VTK++K   +P I F Y S + ++GFPG+L VTA Y L   N LK+ ++A+  +K TPVNLAQ+TYWNLGGHN S +IL ++LQ+
Subjt:  TLHGGTRGFSDVVWKVTKYQKDGRSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTVNAKALNKPTPVNLAQHTYWNLGGHN-SGDILSNRLQI

Query:  FGSRITVVDQYLIPTGKIEPVKGTPYDFLKPHTVGSRINKLPKGYDINYALDDGT--GEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIKDVKGK
          S  T +DQYL+PTG+I PV  TP+DF  PH + SR +K+  GY++NY +DD     E  +K AAV+ D KSGR+L + T+ P VQFY+G+ + +V+GK
Subjt:  FGSRITVVDQYLIPTGKIEPVKGTPYDFLKPHTVGSRINKLPKGYDINYALDDGT--GEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIKDVKGK

Query:  GGFVYQAHAALCLETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFSTKGPLGFQSDERKRVGKKKTMAMALLSSSSPVSPFLSKQRITYHKTPENLKLR
        GG +YQ +A + L  QGFP++VN  +FPS IV+  KPY    +F+     P          V K +   MA +  S+ V  F  K +    K        
Subjt:  GGFVYQAHAALCLETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFSTKGPLGFQSDERKRVGKKKTMAMALLSSSSPVSPFLSKQRITYHKTPENLKLR

Query:  PLILRDFGEAYAGECKSSNVSRLQCS-YASSSSSMSPTEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELANEWSS-----------------IAL
           L DF          +  +R Q + +   +   S     K VWIW+E ++VMTAA ERGWSTFIF     +LA EWSS                 IAL
Subjt:  PLILRDFGEAYAGECKSSNVSRLQCS-YASSSSSMSPTEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELANEWSS-----------------IAL

Query:  IHPLFIKENGVFDGEGRPIATVVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLEALEHGLGGVILKVED
        ++PLFI+E G+FDG+   +AT  E+S+P+QLE+LQP +   + VVV+L DWQ+IPAENIVAA QG++KTVFAVSKT  EAQ F EALE GLGGV+LK ED
Subjt:  IHPLFIKENGVFDGEGRPIATVVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLEALEHGLGGVILKVED

Query:  PEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTS
         EA+F++KDY DRRN   ++L LTKA +T I + GMGDRVCVDLCSLMRPGEGLLVGS+ARGLFLVHSECLES+YI+SRPFRVNAGPVHAYVA+PG KTS
Subjt:  PEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTS

Query:  YLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALV-CPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTGIE
        YLSEL+AG EVI+VDQ G QRTAI+GRVKIETRQLILV+AK D D QT YSILLQNAETVAL+   G G+++KAIPVTSLK+GDEV LR+QG ARHTGIE
Subjt:  YLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALV-CPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTGIE

Query:  IQEFIVEK
        I+EFIVEK
Subjt:  IQEFIVEK

OMO79237.1 3-dehydroquinate synthase, prokaryotic-type [Corchorus capsularis]2.5e-27763.09Show/hide
Query:  EIGIFELKRGDLSVKFTNWGATIVSLLVPNKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDVV
        E+ I+ELKRG++SVKFTNWGA+IVSL++P+KHGKL D+VLGYDS+++Y NDT+YFG +VGRVANRIGGAKFTLDG  YKL ANEG N LHGG +GFSDV+
Subjt:  EIGIFELKRGDLSVKFTNWGATIVSLLVPNKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDVV

Query:  WKVTKYQKDGRSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQ---LKLTVNAKALNKPTPVNLAQHTYWNLGGHNSGDILSNRLQIFGSRITVVDQY
        W V +Y+ DG  P I FSY S+DG+EGFPG+L V+  Y+L  + +   L + + AKA+ KPTPVNLAQHTYWNLG H+SGDILS  + I+ +  T VD  
Subjt:  WKVTKYQKDGRSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQ---LKLTVNAKALNKPTPVNLAQHTYWNLGGHNSGDILSNRLQIFGSRITVVDQY

Query:  LIPTGKIEPVKGTPYDFLKPHT----VGSRINKLPKGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIKDVKGKGGFVYQAHA
        LIPTG+I    GTPYDFL  H     +GSRI +L KGYDINY LD      K+  AA V D K+GRM+ + TN PG+QFYTGN IKDVKGKGGF+Y+AHA
Subjt:  LIPTGKIEPVKGTPYDFLKPHT----VGSRINKLPKGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIKDVKGKGGFVYQAHA

Query:  ALCLETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFSTKGPLGFQSDERKRVGKKKTMAMALLSSSSPVSPFLSKQRITYHKTPENLKLRPLILRDFGE
         LCLETQGFPD+VNH NFPS I++P KPY H MLF+                             + S V PF                      R F E
Subjt:  ALCLETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFSTKGPLGFQSDERKRVGKKKTMAMALLSSSSPVSPFLSKQRITYHKTPENLKLRPLILRDFGE

Query:  AYAGECKSSNVSRLQCSYASSSSSMSPT--EASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELANEWSSIALIHPLFIKENGVFDGEGRPIATVVEV
                ++     C+ A S S +S +  E SK VWIW+E  QVMTAAVERGW+TFIF+  N ELAN+WS+IA I PL IKE G+F+  G+ +AT+ EV
Subjt:  AYAGECKSSNVSRLQCSYASSSSSMSPT--EASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELANEWSSIALIHPLFIKENGVFDGEGRPIATVVEV

Query:  SNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTK
        S P +L++LQP +     VV+DL DWQ+IPAENIVA FQGSQ TVFAVSK+  EAQ FLEALEHGLGGV+LK ED +AV  LK+YFDRRNE  N LSL+K
Subjt:  SNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTK

Query:  ATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIV
        AT+T++H  GMGDRVCVDLCSLMRPGEGLLVGS+ARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSEL+AGKEVIVVDQEG+ RTA+V
Subjt:  ATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIV

Query:  GRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGN--EKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE
        GRVKIETR LILV+AK DS++QT YSILLQNAETVAL+CP  G   +K  IPVTSLK GDEV LRLQG ARHTGIEIQEFIVE
Subjt:  GRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGN--EKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE

QCD92574.1 aldose 1-epimerase [Vigna unguiculata]5.4e-22754.13Show/hide
Query:  MAKFFLALLCLIALAAFGFANG---YEKKGEIGIFELKRGDLSVKFTNWGATIVSLLVPNKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAK
        M K F+ L C++ LAA GF +G    +KK +I +FELK+GD S+K TNWGAT+VS+++P+K+G L D+VLGYDS + Y ND+SYFG+ VGRV NRIGGA+
Subjt:  MAKFFLALLCLIALAAFGFANG---YEKKGEIGIFELKRGDLSVKFTNWGATIVSLLVPNKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAK

Query:  FTLDGVLYKLIANEGNNTLHGGTRGFSDVVWKVTKYQKDGRSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTVNAKALNKPTPVNLAQHTYWN
        FTL+GV YKL+ANEGNNTLH G + FSDV+W V KY KDG  P+I FSY SFDG+ GFPGDLLVT  Y ++  N L + + AKALNKPTPVNL  H YWN
Subjt:  FTLDGVLYKLIANEGNNTLHGGTRGFSDVVWKVTKYQKDGRSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTVNAKALNKPTPVNLAQHTYWN

Query:  LGGHNSGDILSNRLQIFGSRITVVDQYLIPTGKIEPVKGTPYDFLKPHTVGSRINKLPK--GYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAP
        LG HNSG+IL+  +QIFGS++T++D +LIPTGK   VKGTPYDFL+PH VG RIN+LPK  GYDINY LD   G+  +K  A+V DKKSGR++++ TNAP
Subjt:  LGGHNSGDILSNRLQIFGSRITVVDQYLIPTGKIEPVKGTPYDFLKPHTVGSRINKLPK--GYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAP

Query:  GVQFYTGNYIKDVKGKGGFVYQAHAALCLETQ----GFPDAVNHHNFPSTIVTPKKPYNHIMLFKFSTKGPLGFQSDERKRVGKKKTMAMALLSSSSPVS
        G+QFYT N++++ KGK     +A   LC        G     + HN P   + P                                       SSS+P  
Subjt:  GVQFYTGNYIKDVKGKGGFVYQAHAALCLETQ----GFPDAVNHHNFPSTIVTPKKPYNHIMLFKFSTKGPLGFQSDERKRVGKKKTMAMALLSSSSPVS

Query:  PFLSKQRITYHKTPENLKLRPLILRDFGEAYAGECKSSNVSRLQCSYASSSSSMSPTEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELANEWSSI
                                                        S +S     + SK VWIW+                                 
Subjt:  PFLSKQRITYHKTPENLKLRPLILRDFGEAYAGECKSSNVSRLQCSYASSSSSMSPTEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELANEWSSI

Query:  ALIHPLFIKENGVFDGEGRPIATVVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLEALEHGLGGVILKV
                K+  V D + + +AT+ +VSNP++LE L+P +  A+ +VV+L DWQ+IPAENI+AAFQ SQKTV A+S    EAQ FLEALEHGL G+++K+
Subjt:  ALIHPLFIKENGVFDGEGRPIATVVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLEALEHGLGGVILKV

Query:  EDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGK
        ED E V +LK YFDRR E SNLLSLTKAT+T I VAGMGDRVCVDLCSLMRPGEGLL+GS+ARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPG +
Subjt:  EDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGK

Query:  TSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGN--EKKAIPVTSLKVGDEVFLRLQGEARHT
        TSYLSEL++GKEVIVVDQ+G QR AIVGRVKIE+R LILV+AK +SD QT  SILLQNAETVALVCP +GN   K  IPVTSLKVGDE+ LR+QG ARHT
Subjt:  TSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGN--EKKAIPVTSLKVGDEVFLRLQGEARHT

Query:  GIEIQEFIVEK
        GIEIQEFIVEK
Subjt:  GIEIQEFIVEK

XP_004147467.1 uncharacterized protein LOC101203995 [Cucumis sativus]2.6e-22193.85Show/hide
Query:  MAMALLSSSSPVSPFLSKQRITYHKTPENLKLRPLILRDFGEAYAGECKSSNVSRLQCSYASSSSSMSPTEASKGVWIWSECQQVMTAAVERGWSTFIFS
        MAMA L SSSPVSPFLSKQRITY KTPENL LRPL+ RDFGEAYAGECKSS+VSRLQCSY SSSS MSP EASKGVWIWSECQQVMTAAVERGWSTFIFS
Subjt:  MAMALLSSSSPVSPFLSKQRITYHKTPENLKLRPLILRDFGEAYAGECKSSNVSRLQCSYASSSSSMSPTEASKGVWIWSECQQVMTAAVERGWSTFIFS

Query:  PHNTELANEWSSIALIHPLFIKENGVFDGEGRPIATVVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLE
        PHNTELA+EWSSIALIHPLFIKENGV DGE R IA+VVEVSNPQQLEQLQPA ASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFA+SKTPIEAQ FLE
Subjt:  PHNTELANEWSSIALIHPLFIKENGVFDGEGRPIATVVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLE

Query:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLL+LTKATITQIHV GMGDRVCVDLCSLMRPGEGLLVGSYARGLFL+HSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRG-NEKKAIPVTSLKVGDE
        GPVHAYVAVPGGKTSYLSEL+AG EVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS+LLQNAETVALVCPG+G NEKKAIPVTSLKVGDE
Subjt:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRG-NEKKAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

XP_038903473.1 3-dehydroquinate synthase homolog [Benincasa hispida]7.0e-22796.22Show/hide
Query:  MAMALLSSSSPVSPFLSKQRIT-YHKTPENLKLRPLILRDFGEAYAGECKSSNVSRLQCSYASSSSSMSPTEASKGVWIWSECQQVMTAAVERGWSTFIF
        M MALL SSSPVSPFLSKQRI+ YHKTPENL LRPLI RDFGEAYAGECKSSNVSRLQCSYAS  S+MSPTEASKGVWIWSECQQVMTAAVERGWSTFIF
Subjt:  MAMALLSSSSPVSPFLSKQRIT-YHKTPENLKLRPLILRDFGEAYAGECKSSNVSRLQCSYASSSSSMSPTEASKGVWIWSECQQVMTAAVERGWSTFIF

Query:  SPHNTELANEWSSIALIHPLFIKENGVFDGEGRPIATVVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFL
        SPHNTELA+EWSSIALIHPLFIKENGVFDGEGR IA+VVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFA+SKTPIEAQ FL
Subjt:  SPHNTELANEWSSIALIHPLFIKENGVFDGEGRPIATVVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFL

Query:  EALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVN
        EALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVN
Subjt:  EALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVN

Query:  AGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDE
        AGPVHAYVAVPGGKTSYLSEL AGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKK+IPVTSLKVGDE
Subjt:  AGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

TrEMBL top hitse value%identityAlignment
A0A0A0LG24 Aldose 1-epimerase7.1e-20992.61Show/hide
Query:  MEPLKQIWRKAKSLHSWANMAKFFLALLCLIALAAFGFANGYEKKGEIGIFELKRGDLSVKFTNWGATIVSLLVPNKHGKLDDVVLGYDSIEEYQNDTSY
        + P+KQIWR+A+ LHSWANMAK FLAL C+IALA FGFANGYEKKGEIGIFELKRGD SVKFTNWGATIVSLLVP+KHGKLDDVVLGYDSI+EYQNDT+Y
Subjt:  MEPLKQIWRKAKSLHSWANMAKFFLALLCLIALAAFGFANGYEKKGEIGIFELKRGDLSVKFTNWGATIVSLLVPNKHGKLDDVVLGYDSIEEYQNDTSY

Query:  FGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDVVWKVTKYQKDGRSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTVNAKA
        FGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDVVWKVTKYQKDGRSPQIVFSYRSFDG+EGFPGDLLVTA Y+LIA NQLKLT+NAKA
Subjt:  FGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDVVWKVTKYQKDGRSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTVNAKA

Query:  LNKPTPVNLAQHTYWNLGGHNSGDILSNRLQIFGSRITVVDQYLIPTGKIEPVKGTPYDFLKPHTVGSRINKLPKGYDINYALDDGTGEYKLKKAAVVHD
        LNKPTPVNLAQHTYWNLGGHNSGDILSN LQIFGSRITVVD  LIPTGK+EPVKGTP+DFLKP TVGSRINKLPKGYDINYALDDGTGE+KLKKAAVVHD
Subjt:  LNKPTPVNLAQHTYWNLGGHNSGDILSNRLQIFGSRITVVDQYLIPTGKIEPVKGTPYDFLKPHTVGSRINKLPKGYDINYALDDGTGEYKLKKAAVVHD

Query:  KKSGRMLEISTNAPGVQFYTGNYIKDVKGKGGFVYQAHAALCLETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFSTK
        KKSGRMLE+STN PGVQFYTGNYIKDVKGKGGFVYQAHAALCLETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFSTK
Subjt:  KKSGRMLEISTNAPGVQFYTGNYIKDVKGKGGFVYQAHAALCLETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFSTK

A0A1R3I9H2 3-dehydroquinate synthase, prokaryotic-type1.2e-27763.09Show/hide
Query:  EIGIFELKRGDLSVKFTNWGATIVSLLVPNKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDVV
        E+ I+ELKRG++SVKFTNWGA+IVSL++P+KHGKL D+VLGYDS+++Y NDT+YFG +VGRVANRIGGAKFTLDG  YKL ANEG N LHGG +GFSDV+
Subjt:  EIGIFELKRGDLSVKFTNWGATIVSLLVPNKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDVV

Query:  WKVTKYQKDGRSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQ---LKLTVNAKALNKPTPVNLAQHTYWNLGGHNSGDILSNRLQIFGSRITVVDQY
        W V +Y+ DG  P I FSY S+DG+EGFPG+L V+  Y+L  + +   L + + AKA+ KPTPVNLAQHTYWNLG H+SGDILS  + I+ +  T VD  
Subjt:  WKVTKYQKDGRSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQ---LKLTVNAKALNKPTPVNLAQHTYWNLGGHNSGDILSNRLQIFGSRITVVDQY

Query:  LIPTGKIEPVKGTPYDFLKPHT----VGSRINKLPKGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIKDVKGKGGFVYQAHA
        LIPTG+I    GTPYDFL  H     +GSRI +L KGYDINY LD      K+  AA V D K+GRM+ + TN PG+QFYTGN IKDVKGKGGF+Y+AHA
Subjt:  LIPTGKIEPVKGTPYDFLKPHT----VGSRINKLPKGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIKDVKGKGGFVYQAHA

Query:  ALCLETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFSTKGPLGFQSDERKRVGKKKTMAMALLSSSSPVSPFLSKQRITYHKTPENLKLRPLILRDFGE
         LCLETQGFPD+VNH NFPS I++P KPY H MLF+                             + S V PF                      R F E
Subjt:  ALCLETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFSTKGPLGFQSDERKRVGKKKTMAMALLSSSSPVSPFLSKQRITYHKTPENLKLRPLILRDFGE

Query:  AYAGECKSSNVSRLQCSYASSSSSMSPT--EASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELANEWSSIALIHPLFIKENGVFDGEGRPIATVVEV
                ++     C+ A S S +S +  E SK VWIW+E  QVMTAAVERGW+TFIF+  N ELAN+WS+IA I PL IKE G+F+  G+ +AT+ EV
Subjt:  AYAGECKSSNVSRLQCSYASSSSSMSPT--EASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELANEWSSIALIHPLFIKENGVFDGEGRPIATVVEV

Query:  SNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTK
        S P +L++LQP +     VV+DL DWQ+IPAENIVA FQGSQ TVFAVSK+  EAQ FLEALEHGLGGV+LK ED +AV  LK+YFDRRNE  N LSL+K
Subjt:  SNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTK

Query:  ATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIV
        AT+T++H  GMGDRVCVDLCSLMRPGEGLLVGS+ARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSEL+AGKEVIVVDQEG+ RTA+V
Subjt:  ATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIV

Query:  GRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGN--EKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE
        GRVKIETR LILV+AK DS++QT YSILLQNAETVAL+CP  G   +K  IPVTSLK GDEV LRLQG ARHTGIEIQEFIVE
Subjt:  GRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGN--EKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE

A0A1S3B8Q7 3-dehydroquinate synthase homolog7.9e-21691.25Show/hide
Query:  MAMALLSSSSPVSPFLSKQRITYHKTPENLKLRPLILRDFGEAYAGECKSSNVSRLQCSYASSSSSMSPTEASKGVWIWSECQQVMTAAVERGWSTFIFS
        M MA L SSSPVSP LSKQRITY KTPENL LRPLI R+FG+AYAGECKSS++SRLQCSY SSSS MSP E SKGVWIWSECQ+VMTAAVERGWSTFIFS
Subjt:  MAMALLSSSSPVSPFLSKQRITYHKTPENLKLRPLILRDFGEAYAGECKSSNVSRLQCSYASSSSSMSPTEASKGVWIWSECQQVMTAAVERGWSTFIFS

Query:  PHNTELANEWSSIALIHPLFIKENGVFDGEGRPIATVVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLE
        PHNTELA+EW+SIA+IHPLFIKE+GV DGE R IA+VVE+SNPQQLEQLQPA ASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFA+SKTPIEAQ F E
Subjt:  PHNTELANEWSSIALIHPLFIKENGVFDGEGRPIATVVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLE

Query:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKV+DPEAVFQLKDYFDRRNEASNLLSLTKATITQIHV GMGDRVCVDLCSLMRPGEGLLVGS+ARGLFLVHSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRG-NEKKAIPVTSLKVGDE
        GPVHAYVAVPGGKTSYLSEL+AGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS+LLQNAETVALVCPG+G NEKKAI VTSLKVGDE
Subjt:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRG-NEKKAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

A0A4D6LXG6 Aldose 1-epimerase2.6e-22754.13Show/hide
Query:  MAKFFLALLCLIALAAFGFANG---YEKKGEIGIFELKRGDLSVKFTNWGATIVSLLVPNKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAK
        M K F+ L C++ LAA GF +G    +KK +I +FELK+GD S+K TNWGAT+VS+++P+K+G L D+VLGYDS + Y ND+SYFG+ VGRV NRIGGA+
Subjt:  MAKFFLALLCLIALAAFGFANG---YEKKGEIGIFELKRGDLSVKFTNWGATIVSLLVPNKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAK

Query:  FTLDGVLYKLIANEGNNTLHGGTRGFSDVVWKVTKYQKDGRSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTVNAKALNKPTPVNLAQHTYWN
        FTL+GV YKL+ANEGNNTLH G + FSDV+W V KY KDG  P+I FSY SFDG+ GFPGDLLVT  Y ++  N L + + AKALNKPTPVNL  H YWN
Subjt:  FTLDGVLYKLIANEGNNTLHGGTRGFSDVVWKVTKYQKDGRSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTVNAKALNKPTPVNLAQHTYWN

Query:  LGGHNSGDILSNRLQIFGSRITVVDQYLIPTGKIEPVKGTPYDFLKPHTVGSRINKLPK--GYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAP
        LG HNSG+IL+  +QIFGS++T++D +LIPTGK   VKGTPYDFL+PH VG RIN+LPK  GYDINY LD   G+  +K  A+V DKKSGR++++ TNAP
Subjt:  LGGHNSGDILSNRLQIFGSRITVVDQYLIPTGKIEPVKGTPYDFLKPHTVGSRINKLPK--GYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAP

Query:  GVQFYTGNYIKDVKGKGGFVYQAHAALCLETQ----GFPDAVNHHNFPSTIVTPKKPYNHIMLFKFSTKGPLGFQSDERKRVGKKKTMAMALLSSSSPVS
        G+QFYT N++++ KGK     +A   LC        G     + HN P   + P                                       SSS+P  
Subjt:  GVQFYTGNYIKDVKGKGGFVYQAHAALCLETQ----GFPDAVNHHNFPSTIVTPKKPYNHIMLFKFSTKGPLGFQSDERKRVGKKKTMAMALLSSSSPVS

Query:  PFLSKQRITYHKTPENLKLRPLILRDFGEAYAGECKSSNVSRLQCSYASSSSSMSPTEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELANEWSSI
                                                        S +S     + SK VWIW+                                 
Subjt:  PFLSKQRITYHKTPENLKLRPLILRDFGEAYAGECKSSNVSRLQCSYASSSSSMSPTEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELANEWSSI

Query:  ALIHPLFIKENGVFDGEGRPIATVVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLEALEHGLGGVILKV
                K+  V D + + +AT+ +VSNP++LE L+P +  A+ +VV+L DWQ+IPAENI+AAFQ SQKTV A+S    EAQ FLEALEHGL G+++K+
Subjt:  ALIHPLFIKENGVFDGEGRPIATVVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLEALEHGLGGVILKV

Query:  EDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGK
        ED E V +LK YFDRR E SNLLSLTKAT+T I VAGMGDRVCVDLCSLMRPGEGLL+GS+ARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPG +
Subjt:  EDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGK

Query:  TSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGN--EKKAIPVTSLKVGDEVFLRLQGEARHT
        TSYLSEL++GKEVIVVDQ+G QR AIVGRVKIE+R LILV+AK +SD QT  SILLQNAETVALVCP +GN   K  IPVTSLKVGDE+ LR+QG ARHT
Subjt:  TSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGN--EKKAIPVTSLKVGDEVFLRLQGEARHT

Query:  GIEIQEFIVEK
        GIEIQEFIVEK
Subjt:  GIEIQEFIVEK

A0A5A7UEW0 3-dehydroquinate synthase-like protein7.9e-21691.25Show/hide
Query:  MAMALLSSSSPVSPFLSKQRITYHKTPENLKLRPLILRDFGEAYAGECKSSNVSRLQCSYASSSSSMSPTEASKGVWIWSECQQVMTAAVERGWSTFIFS
        M MA L SSSPVSP LSKQRITY KTPENL LRPLI R+FG+AYAGECKSS++SRLQCSY SSSS MSP E SKGVWIWSECQ+VMTAAVERGWSTFIFS
Subjt:  MAMALLSSSSPVSPFLSKQRITYHKTPENLKLRPLILRDFGEAYAGECKSSNVSRLQCSYASSSSSMSPTEASKGVWIWSECQQVMTAAVERGWSTFIFS

Query:  PHNTELANEWSSIALIHPLFIKENGVFDGEGRPIATVVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLE
        PHNTELA+EW+SIA+IHPLFIKE+GV DGE R IA+VVE+SNPQQLEQLQPA ASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFA+SKTPIEAQ F E
Subjt:  PHNTELANEWSSIALIHPLFIKENGVFDGEGRPIATVVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLE

Query:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKV+DPEAVFQLKDYFDRRNEASNLLSLTKATITQIHV GMGDRVCVDLCSLMRPGEGLLVGS+ARGLFLVHSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRG-NEKKAIPVTSLKVGDE
        GPVHAYVAVPGGKTSYLSEL+AGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS+LLQNAETVALVCPG+G NEKKAI VTSLKVGDE
Subjt:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRG-NEKKAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

SwissProt top hitse value%identityAlignment
Q5EA79 Galactose mutarotase1.3e-6942.99Show/hide
Query:  GEIGIFELKRGDLSVKFTNWGATIVSLLVPNKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDV
        G +  F+L+   L V   +WG TI +L V ++ G+  DVVLG+D +E Y     YFG++VGRVANRI    FTLDG  YKL  N G N+LHGG +GF  V
Subjt:  GEIGIFELKRGDLSVKFTNWGATIVSLLVPNKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDV

Query:  VWKVTKYQKDGRSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTVNAKA-LNKPTPVNLAQHTYWNLGGHNSGDILSNRLQIFGSRITVVDQYL
        +W   +   +G    + FS  S DG+EG+PG+L V   Y+L      +L VN +A  ++ TPVNL  H+Y+NL G  S +I  + + I       VD+ L
Subjt:  VWKVTKYQKDGRSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTVNAKA-LNKPTPVNLAQHTYWNLGGHNSGDILSNRLQIFGSRITVVDQYL

Query:  IPTGKIEPVKGTPYDFLKPHTVGSRINKL-PKGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIK-DVKGKGGFVYQAHAALC
        IPTG+I  V+GT +D  KP  +G  + +    G+D N+ L    G  + +  A VH   SGR+LE+ T  PGVQFYTGN++   +KGK G  Y  H+  C
Subjt:  IPTGKIEPVKGTPYDFLKPHTVGSRINKL-PKGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIK-DVKGKGGFVYQAHAALC

Query:  LETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFS
        LETQ +PDAVN  +FP  ++ P + Y+H   FKFS
Subjt:  LETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFS

Q66HG4 Galactose mutarotase9.7e-7043.24Show/hide
Query:  GEIGIFELKRGDLSVKFTNWGATIVSLLVPNKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDV
        G +  F+L+   L+V   +WG TI +L V ++ GK  DVVLG+  +E Y     YFG++VGRVANRI   +FT+DG  Y L  N   N+LHGG RGF  V
Subjt:  GEIGIFELKRGDLSVKFTNWGATIVSLLVPNKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDV

Query:  VWKVTKYQKDGRSPQIV-----FSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTVNAKA-LNKPTPVNLAQHTYWNLGGHNSGDILSNRLQIFGSRITV
        +W          +PQ++     FS  S DG+EG+PG+L V   Y+L      +L VN +A  ++ TPVNL  H+Y+NL G  S DI  + + I       
Subjt:  VWKVTKYQKDGRSPQIV-----FSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTVNAKA-LNKPTPVNLAQHTYWNLGGHNSGDILSNRLQIFGSRITV

Query:  VDQYLIPTGKIEPVKGTPYDFLKPHTVGSRINKLP-KGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIK-DVKGKGGFVYQA
        VD+ LIPTG I PV+GT +D  KP  +G  +      G+D N+ L +     + K  A VH   SGR+LE+ T  PGVQFYTGN++   +KGK G VY  
Subjt:  VDQYLIPTGKIEPVKGTPYDFLKPHTVGSRINKLP-KGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIK-DVKGKGGFVYQA

Query:  HAALCLETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFS
        H+  CLETQ +PDAVN   FP  ++ P + YNH   FKFS
Subjt:  HAALCLETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFS

Q8K157 Galactose mutarotase7.7e-6741.76Show/hide
Query:  GEIGIFELKRGDLSVKFTNWGATIVSLLVPNKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDV
        G +  F+L+   LSV   +WG TI +L V ++ GK  DVVLG+  +E Y     YFG++VGRVANRI   +FT+ G  Y L  N   N+LHGG  GF  V
Subjt:  GEIGIFELKRGDLSVKFTNWGATIVSLLVPNKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDV

Query:  VWKVTKYQKDGRSPQIV-----FSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTVNAKA-LNKPTPVNLAQHTYWNLGGHNSGDILSNRLQIFGSRITV
        +W          +PQ++     F   S DG+EG+PG+L V   Y+L      +L +N +A  ++ TPVNL  H+Y+NL G  S +I  + + I       
Subjt:  VWKVTKYQKDGRSPQIV-----FSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTVNAKA-LNKPTPVNLAQHTYWNLGGHNSGDILSNRLQIFGSRITV

Query:  VDQYLIPTGKIEPVKGTPYDFLKPHTVGSRINKLP-KGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIK-DVKGKGGFVYQA
        VD+ LIPTG I PV+GT +D  KP  +G+ +      G+D N+ L +     + K  A V    SGR+LE+ T  PGVQFYTGN++   +KGK G VY  
Subjt:  VDQYLIPTGKIEPVKGTPYDFLKPHTVGSRINKLP-KGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIK-DVKGKGGFVYQA

Query:  HAALCLETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFS
        H+ LCLETQ +PD+VN   FP  ++ P + YNH   FKFS
Subjt:  HAALCLETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFS

Q96C23 Galactose mutarotase3.8e-6641.79Show/hide
Query:  GEIGIFELKRGDLSVKFTNWGATIVSLLVPNKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDV
        G +  F+L+   L V   +WG TI +L V ++ G+  DVVLG+  +E Y     YFG+++GRVANRI    F +DG  Y L  N+  N+LHGG RGF  V
Subjt:  GEIGIFELKRGDLSVKFTNWGATIVSLLVPNKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDV

Query:  VWKVTKYQKDGRSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTVNAKA-LNKPTPVNLAQHTYWNLGGHNSGDILSNRLQIFGSRITVVDQYL
        +W   +   +G    + FS  S DG+EG+PG+L V   Y+L      +L VN +A  ++ TPVNL  H+Y+NL G  S +I  + + I       VD+ L
Subjt:  VWKVTKYQKDGRSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTVNAKA-LNKPTPVNLAQHTYWNLGGHNSGDILSNRLQIFGSRITVVDQYL

Query:  IPTGKIEPVKGTPYDFLKPHTVGSRINKLP-KGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIK-DVKGKGGFVYQAHAALC
        IPTG++ PV+GT +D  KP  +G  +      G+D N+ L    G  +    A VH   SGR+LE+ T  PGVQFYTGN++   +KGK G VY  H+  C
Subjt:  IPTGKIEPVKGTPYDFLKPHTVGSRINKLP-KGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIK-DVKGKGGFVYQAHAALC

Query:  LETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFS
        LETQ +PDAVN   FP  ++ P + Y+H   FKFS
Subjt:  LETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFS

Q9GKX6 Galactose mutarotase8.2e-6942.99Show/hide
Query:  GEIGIFELKRGDLSVKFTNWGATIVSLLVPNKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDV
        G +  F+L+   L V   +WG TI +L V ++ G+  DVVLG+  ++EY     YFG++VGRVANRI    FTLDG  YKL  N G N+LHGG RGF  V
Subjt:  GEIGIFELKRGDLSVKFTNWGATIVSLLVPNKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDV

Query:  VWKVTKYQKDGRSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTVNAKA-LNKPTPVNLAQHTYWNLGGHNSGDILSNRLQIFGSRITVVDQYL
        +W   +   +G    I FS  S DG+EG+PG+L V   Y+L      +L VN +A  ++ TPVNL  H+Y+NL G  S +I  + + I       VD+ L
Subjt:  VWKVTKYQKDGRSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTVNAKA-LNKPTPVNLAQHTYWNLGGHNSGDILSNRLQIFGSRITVVDQYL

Query:  IPTGKIEPVKGTPYDFLKPHTVGSRINKLP-KGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIK-DVKGKGGFVYQAHAALC
        IPTG+I PV+GT +D  KP  +G  + +    G+D N+ L       + +  A VH   SGR+LE+ T  PG+QFYTGN++   +KGK G VY  H+  C
Subjt:  IPTGKIEPVKGTPYDFLKPHTVGSRINKLP-KGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIK-DVKGKGGFVYQAHAALC

Query:  LETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFS
        LETQ +P+AVN  +FP  ++ P + YNH   F FS
Subjt:  LETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFS

Arabidopsis top hitse value%identityAlignment
AT3G17940.1 Galactose mutarotase-like superfamily protein9.8e-10252.82Show/hide
Query:  EKKGEIGIFELKRGDLSVKFTNWGATIVSLLVPNKHGKLDDVVLGYDSIEEYQNDTS-YFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRG
        + K    IFEL  G + VK +N+G TI SL VP+K+GKL DVVLG+DS++ Y    + YFG IVGRVANRI   KF+L+GV Y L  N+  N+LHGG +G
Subjt:  EKKGEIGIFELKRGDLSVKFTNWGATIVSLLVPNKHGKLDDVVLGYDSIEEYQNDTS-YFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRG

Query:  FSDVVWKVTKYQKDGRSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTVNAKALNKPTPVNLAQHTYWNLGGHNSGDILSNRLQIFGSRITVVD
        F   +W+V  +++DG  P I F Y S DG+EG+PG + VTA Y+L +   ++L + A   NK TP+NLAQHTYWNL GH+SG+IL +++QI+GS IT VD
Subjt:  FSDVVWKVTKYQKDGRSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTVNAKALNKPTPVNLAQHTYWNLGGHNSGDILSNRLQIFGSRITVVD

Query:  QYLIPTGKIEPVKGTPYDFLKPHTVGSRINKLPKGYDINYALDDGTGEYK-LKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIKDVKGKGGFVYQAHAA
        +Y +PTG+I PVKGTP+DF +   +G  I ++  GYD NY LD    E + LK AA + D  S R+L + TN PG+QFYTGNY+  V GKG  VY  HA 
Subjt:  QYLIPTGKIEPVKGTPYDFLKPHTVGSRINKLPKGYDINYALDDGTGEYK-LKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIKDVKGKGGFVYQAHAA

Query:  LCLETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFS
        +CLETQGFP+A+N  NFPS +V   + YNH MLF+FS
Subjt:  LCLETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFS

AT3G28760.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: 3-dehydroquinate synthase, prokaryotic-type (InterPro:IPR002812); Has 390 Blast hits to 390 proteins in 131 species: Archae - 144; Bacteria - 105; Metazoa - 0; Fungi - 0; Plants - 54; Viruses - 0; Other Eukaryotes - 87 (source: NCBI BLink).2.6e-13962.26Show/hide
Query:  MAMALLSSSSPVSPFLSKQRITYHKTPENLKLRPLILRDFGEAYAGECKSSNVSRLQCSYASSSSSMSPTEASKGVWIWSECQQVMTAAVERGWSTFIFS
        MA+ L+SS S +    ++   ++    E L+L  L+L    +      K +   R+    ++S+  M+  +A K VWIW+ C++VMT AVERGW+TFIFS
Subjt:  MAMALLSSSSPVSPFLSKQRITYHKTPENLKLRPLILRDFGEAYAGECKSSNVSRLQCSYASSSSSMSPTEASKGVWIWSECQQVMTAAVERGWSTFIFS

Query:  PHNTELANEWSSIALIHPLFIKENGVFDGEGRPIATVVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLE
          N +L+NEWSSIAL+  LFI+E  V DG G  +A+V EVS P++L  L   N   + +V+D  DW+ IPAEN+VAA QGS+KTVFAVS TP EA+ FLE
Subjt:  PHNTELANEWSSIALIHPLFIKENGVFDGEGRPIATVVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLE

Query:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGG+ILK ED +AV  LK+YFD+RNE S+ LSLT+ATIT++ + GMGDRVCVDLCSLMRPGEGLLVGS+ARGLFLVHSECLESNYI SRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDS-DEQTPYSILLQNAETVALVCPGRGNE--KKAIPVTSLKVG
        GPVHAYVAVPGGKT YLSELR G+EVIVVDQ+G+QRTA+VGRVKIE R LI+V+AK  + +E+T YSI+LQNAETVALV P + N   + A+PVTSLK G
Subjt:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDS-DEQTPYSILLQNAETVALVCPGRGNE--KKAIPVTSLKVG

Query:  DEVFLRLQGEARHTGIEIQEFIVE
        D+V +RLQG ARHTGIEIQEFIVE
Subjt:  DEVFLRLQGEARHTGIEIQEFIVE

AT3G28760.2 CONTAINS InterPro DOMAIN/s: 3-dehydroquinate synthase, prokaryotic-type (InterPro:IPR002812); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink).9.1e-14064.52Show/hide
Query:  TYHKTPENLKLRPLILRDFGEAYAGECKSSNVSRLQCSYASSSSSMSPTEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELANEWSSIALIHPLFI
        +Y  T E L+L  L+L    +      K +   R+    ++S+  M+  +A K VWIW+ C++VMT AVERGW+TFIFS  N +L+NEWSSIAL+  LFI
Subjt:  TYHKTPENLKLRPLILRDFGEAYAGECKSSNVSRLQCSYASSSSSMSPTEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELANEWSSIALIHPLFI

Query:  KENGVFDGEGRPIATVVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLEALEHGLGGVILKVEDPEAVFQ
        +E  V DG G  +A+V EVS P++L  L   N   + +V+D  DW+ IPAEN+VAA QGS+KTVFAVS TP EA+ FLEALEHGLGG+ILK ED +AV  
Subjt:  KENGVFDGEGRPIATVVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLEALEHGLGGVILKVEDPEAVFQ

Query:  LKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELR
        LK+YFD+RNE S+ LSLT+ATIT++ + GMGDRVCVDLCSLMRPGEGLLVGS+ARGLFLVHSECLESNYI SRPFRVNAGPVHAYVAVPGGKT YLSELR
Subjt:  LKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELR

Query:  AGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDS-DEQTPYSILLQNAETVALVCPGRGNE--KKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEF
         G+EVIVVDQ+G+QRTA+VGRVKIE R LI+V+AK  + +E+T YSI+LQNAETVALV P + N   + A+PVTSLK GD+V +RLQG ARHTGIEIQEF
Subjt:  AGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDS-DEQTPYSILLQNAETVALVCPGRGNE--KKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEF

Query:  IVE
        IVE
Subjt:  IVE

AT3G47800.1 Galactose mutarotase-like superfamily protein1.1e-11659.21Show/hide
Query:  EIGIFELKRGDLSVKFTNWGATIVSLLVPNKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDVV
        +I  ++L RG LSV FTN+GA + SLL+P++HGK DDVVLG+D+++ Y+NDT+YFG+IVGRVANRIGGAKF L+G LYK   NEG NTLHGG++GFSDV+
Subjt:  EIGIFELKRGDLSVKFTNWGATIVSLLVPNKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDVV

Query:  WKVTKYQKDGRSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTVNAKALNKPTPVNLAQHTYWNLGGHNSGDILSNRLQIFGSRITVVDQYLIP
        W V KY     +  I F+Y SFDG+EGFPG++ V   Y LI  N+L + + AK LNKPTP+NLA HTYWNL  HNSG+ILS+++Q+   +IT VD  LIP
Subjt:  WKVTKYQKDGRSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTVNAKALNKPTPVNLAQHTYWNLGGHNSGDILSNRLQIFGSRITVVDQYLIP

Query:  TGKIEPVKGTPYDFLKPHTVGSRINKLPKGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIKDVKGKGGFVYQAHAALCLETQ
        TG+I  + GTPYDFL+P  +GSRI++LP GYDINY +D   G++ L+K AVV ++ +GR +E+ TN PGVQFYT N +K V GKG  VY+ +  LCLETQ
Subjt:  TGKIEPVKGTPYDFLKPHTVGSRINKLPKGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIKDVKGKGGFVYQAHAALCLETQ

Query:  GFPDAVNHHNFPSTIVTPKKPYNHIMLFKFS
        GFPD+VNH NFPS IV P + Y H+MLF+F+
Subjt:  GFPDAVNHHNFPSTIVTPKKPYNHIMLFKFS

AT5G15140.1 Galactose mutarotase-like superfamily protein2.8e-11759.58Show/hide
Query:  KKGEIGIFELKRGDLSVKFTNWGATIVSLLVPNKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFS
        +K +IG++ELK+G+L+VKFTNWGA+I+SL  P+K+GK+DD+VLGYDS++ Y+ D  YFG+ VGRVANRIG  KF L+G  YK   N+G NTLHGG +GF 
Subjt:  KKGEIGIFELKRGDLSVKFTNWGATIVSLLVPNKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFS

Query:  DVVWKVTKYQKDGRSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTVNAKALNKPTPVNLAQHTYWNLGGHNSGDILSNRLQIFGSRITVVDQY
        DVVW V K+Q DG+ P IVF++ S DGD+GFPG+L VT  Y L+ +N+L + + AK  +K TPVNLA H+YWNLGGHNSGDILS  +QI GS  T VD  
Subjt:  DVVWKVTKYQKDGRSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTVNAKALNKPTPVNLAQHTYWNLGGHNSGDILSNRLQIFGSRITVVDQY

Query:  LIPTGKIEPVKGTPYDFLKPHTVGSRINKLPKGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIKDVKGKGGFVYQAHAALCL
        LIPTGKI PVKGT YDFL+   +   +  L  GYDINY LD      K++K   + DKKSGR +E+S N  G+QFYTG  +KDVKGK G VYQA   LCL
Subjt:  LIPTGKIEPVKGTPYDFLKPHTVGSRINKLPKGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIKDVKGKGGFVYQAHAALCL

Query:  ETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFS
        ETQ +PDA+NH  FPS IV P K Y H MLFKFS
Subjt:  ETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCCGTTGAAGCAGATATGGAGGAAGGCAAAATCTCTTCACTCTTGGGCAAATATGGCCAAATTTTTCCTTGCTTTGCTCTGTCTTATTGCTTTAGCTGCT
TTTGGGTTTGCTAATGGCTATGAGAAGAAGGGGGAGATTGGGATTTTTGAGCTCAAGAGAGGTGACCTTTCAGTGAAATTTACCAACTGGGGTGCCACTATTGTG
TCTCTTCTTGTCCCAAACAAGCATGGGAAGTTGGATGATGTTGTTCTTGGGTATGATTCCATTGAGGAGTATCAGAATGATACATCGTATTTTGGTTCGATTGTT
GGAAGAGTTGCTAACCGAATCGGTGGTGCGAAATTTACTCTGGATGGAGTTCTATACAAGCTGATTGCTAATGAAGGCAACAACACACTTCATGGTGGCACTAGA
GGCTTCAGTGATGTTGTCTGGAAAGTGACCAAATATCAGAAAGATGGTAGATCTCCTCAAATCGTATTTTCCTACCGCAGTTTCGACGGCGACGAAGGTTTTCCT
GGTGATCTCTTGGTGACTGCCAAATACTCACTCATTGCAAACAACCAACTGAAGTTAACAGTGAATGCCAAAGCTCTGAACAAGCCTACTCCTGTAAATTTAGCT
CAACACACCTACTGGAATCTTGGTGGACATAACAGCGGTGATATTCTATCAAATCGTCTTCAGATCTTCGGATCCCGCATCACTGTCGTCGATCAATATCTCATT
CCTACAGGAAAGATAGAACCTGTCAAAGGAACTCCATATGATTTCCTCAAGCCCCACACGGTTGGAAGCAGAATAAACAAGCTACCAAAAGGCTATGACATAAAC
TATGCTCTCGATGATGGCACCGGGGAATATAAACTGAAGAAAGCAGCAGTCGTGCACGACAAGAAGTCGGGAAGAATGTTGGAGATATCGACAAATGCTCCAGGT
GTGCAGTTCTATACAGGAAACTATATAAAGGATGTAAAAGGAAAGGGAGGATTTGTGTACCAAGCTCATGCTGCGCTTTGTTTGGAGACTCAAGGCTTTCCTGAT
GCAGTGAATCACCACAATTTTCCTTCAACCATTGTAACTCCCAAGAAGCCTTACAATCACATTATGCTGTTTAAGTTCTCAACTAAAGGGCCATTGGGCTTTCAA
AGCGACGAGCGAAAGAGAGTTGGCAAGAAGAAAACGATGGCCATGGCCTTGCTCTCTTCCTCCTCGCCTGTTTCTCCATTTCTTTCCAAACAGCGCATCACCTAC
CACAAAACACCAGAGAATTTAAAACTCCGGCCCCTAATTTTGAGGGACTTTGGTGAAGCCTATGCTGGTGAATGTAAATCCTCGAATGTGAGTCGTTTACAGTGT
TCTTACGCTTCCTCGTCCTCTTCAATGTCTCCGACTGAGGCGTCGAAGGGGGTATGGATTTGGAGTGAGTGTCAGCAGGTTATGACGGCTGCGGTTGAGAGGGGA
TGGAGCACCTTCATCTTCTCGCCTCATAATACGGAGCTTGCTAATGAATGGTCGTCAATTGCACTAATACATCCTCTTTTTATTAAAGAGAATGGAGTTTTTGAT
GGCGAGGGTAGACCAATTGCCACAGTTGTTGAGGTCTCTAACCCCCAGCAGTTGGAGCAGCTCCAACCAGCAAATGCATCTGCAGACATTGTTGTTGTTGATTTA
CAAGACTGGCAGATAATACCTGCAGAGAATATTGTTGCAGCATTTCAGGGGAGTCAGAAAACAGTGTTTGCCGTCTCGAAAACTCCTATTGAAGCTCAAACCTTC
CTTGAGGCACTTGAACATGGTCTGGGCGGAGTTATTTTGAAAGTTGAAGATCCTGAAGCTGTTTTTCAGCTAAAGGACTATTTTGACAGAAGAAATGAAGCTAGC
AATCTTTTAAGCTTGACTAAGGCTACTATTACTCAAATTCACGTTGCTGGAATGGGAGATCGAGTTTGCGTCGATCTCTGTAGTCTCATGAGACCCGGCGAAGGG
CTTCTTGTCGGGTCCTATGCCAGAGGCCTGTTTCTTGTTCACTCGGAATGCTTAGAATCAAATTATATTGCTAGCCGGCCATTTCGTGTCAATGCTGGACCTGTC
CATGCCTACGTAGCTGTTCCAGGAGGGAAAACTAGCTACCTTTCCGAGTTACGAGCAGGGAAAGAGGTAATTGTAGTTGATCAAGAAGGCAGACAGCGAACCGCT
ATTGTTGGACGTGTAAAGATTGAAACTAGGCAGCTGATCCTTGTCCAGGCAAAGAGAGATTCAGATGAGCAGACTCCTTACAGCATCCTTCTGCAGAATGCGGAA
ACGGTTGCATTAGTCTGCCCGGGACGAGGAAATGAGAAGAAAGCCATCCCAGTTACCTCACTTAAAGTTGGTGATGAAGTGTTCTTGAGATTGCAAGGAGAAGCA
AGACATACAGGAATTGAAATCCAAGAGTTTATTGTGGAGAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGCCGTTGAAGCAGATATGGAGGAAGGCAAAATCTCTTCACTCTTGGGCAAATATGGCCAAATTTTTCCTTGCTTTGCTCTGTCTTATTGCTTTAGCTGCT
TTTGGGTTTGCTAATGGCTATGAGAAGAAGGGGGAGATTGGGATTTTTGAGCTCAAGAGAGGTGACCTTTCAGTGAAATTTACCAACTGGGGTGCCACTATTGTG
TCTCTTCTTGTCCCAAACAAGCATGGGAAGTTGGATGATGTTGTTCTTGGGTATGATTCCATTGAGGAGTATCAGAATGATACATCGTATTTTGGTTCGATTGTT
GGAAGAGTTGCTAACCGAATCGGTGGTGCGAAATTTACTCTGGATGGAGTTCTATACAAGCTGATTGCTAATGAAGGCAACAACACACTTCATGGTGGCACTAGA
GGCTTCAGTGATGTTGTCTGGAAAGTGACCAAATATCAGAAAGATGGTAGATCTCCTCAAATCGTATTTTCCTACCGCAGTTTCGACGGCGACGAAGGTTTTCCT
GGTGATCTCTTGGTGACTGCCAAATACTCACTCATTGCAAACAACCAACTGAAGTTAACAGTGAATGCCAAAGCTCTGAACAAGCCTACTCCTGTAAATTTAGCT
CAACACACCTACTGGAATCTTGGTGGACATAACAGCGGTGATATTCTATCAAATCGTCTTCAGATCTTCGGATCCCGCATCACTGTCGTCGATCAATATCTCATT
CCTACAGGAAAGATAGAACCTGTCAAAGGAACTCCATATGATTTCCTCAAGCCCCACACGGTTGGAAGCAGAATAAACAAGCTACCAAAAGGCTATGACATAAAC
TATGCTCTCGATGATGGCACCGGGGAATATAAACTGAAGAAAGCAGCAGTCGTGCACGACAAGAAGTCGGGAAGAATGTTGGAGATATCGACAAATGCTCCAGGT
GTGCAGTTCTATACAGGAAACTATATAAAGGATGTAAAAGGAAAGGGAGGATTTGTGTACCAAGCTCATGCTGCGCTTTGTTTGGAGACTCAAGGCTTTCCTGAT
GCAGTGAATCACCACAATTTTCCTTCAACCATTGTAACTCCCAAGAAGCCTTACAATCACATTATGCTGTTTAAGTTCTCAACTAAAGGGCCATTGGGCTTTCAA
AGCGACGAGCGAAAGAGAGTTGGCAAGAAGAAAACGATGGCCATGGCCTTGCTCTCTTCCTCCTCGCCTGTTTCTCCATTTCTTTCCAAACAGCGCATCACCTAC
CACAAAACACCAGAGAATTTAAAACTCCGGCCCCTAATTTTGAGGGACTTTGGTGAAGCCTATGCTGGTGAATGTAAATCCTCGAATGTGAGTCGTTTACAGTGT
TCTTACGCTTCCTCGTCCTCTTCAATGTCTCCGACTGAGGCGTCGAAGGGGGTATGGATTTGGAGTGAGTGTCAGCAGGTTATGACGGCTGCGGTTGAGAGGGGA
TGGAGCACCTTCATCTTCTCGCCTCATAATACGGAGCTTGCTAATGAATGGTCGTCAATTGCACTAATACATCCTCTTTTTATTAAAGAGAATGGAGTTTTTGAT
GGCGAGGGTAGACCAATTGCCACAGTTGTTGAGGTCTCTAACCCCCAGCAGTTGGAGCAGCTCCAACCAGCAAATGCATCTGCAGACATTGTTGTTGTTGATTTA
CAAGACTGGCAGATAATACCTGCAGAGAATATTGTTGCAGCATTTCAGGGGAGTCAGAAAACAGTGTTTGCCGTCTCGAAAACTCCTATTGAAGCTCAAACCTTC
CTTGAGGCACTTGAACATGGTCTGGGCGGAGTTATTTTGAAAGTTGAAGATCCTGAAGCTGTTTTTCAGCTAAAGGACTATTTTGACAGAAGAAATGAAGCTAGC
AATCTTTTAAGCTTGACTAAGGCTACTATTACTCAAATTCACGTTGCTGGAATGGGAGATCGAGTTTGCGTCGATCTCTGTAGTCTCATGAGACCCGGCGAAGGG
CTTCTTGTCGGGTCCTATGCCAGAGGCCTGTTTCTTGTTCACTCGGAATGCTTAGAATCAAATTATATTGCTAGCCGGCCATTTCGTGTCAATGCTGGACCTGTC
CATGCCTACGTAGCTGTTCCAGGAGGGAAAACTAGCTACCTTTCCGAGTTACGAGCAGGGAAAGAGGTAATTGTAGTTGATCAAGAAGGCAGACAGCGAACCGCT
ATTGTTGGACGTGTAAAGATTGAAACTAGGCAGCTGATCCTTGTCCAGGCAAAGAGAGATTCAGATGAGCAGACTCCTTACAGCATCCTTCTGCAGAATGCGGAA
ACGGTTGCATTAGTCTGCCCGGGACGAGGAAATGAGAAGAAAGCCATCCCAGTTACCTCACTTAAAGTTGGTGATGAAGTGTTCTTGAGATTGCAAGGAGAAGCA
AGACATACAGGAATTGAAATCCAAGAGTTTATTGTGGAGAAATGATGGTTGATCACCTTTTACCATTTGAATATGTTGTATATTTTATCTTTTGAAAACACTAGA
CTTTTGTTGGGTTGATTCGATAAGTATTTGGTTTTTAATTTTTTTAAAAAT
Protein sequenceShow/hide protein sequence
MEPLKQIWRKAKSLHSWANMAKFFLALLCLIALAAFGFANGYEKKGEIGIFELKRGDLSVKFTNWGATIVSLLVPNKHGKLDDVVLGYDSIEEYQNDTSYFGSIV
GRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDVVWKVTKYQKDGRSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTVNAKALNKPTPVNLA
QHTYWNLGGHNSGDILSNRLQIFGSRITVVDQYLIPTGKIEPVKGTPYDFLKPHTVGSRINKLPKGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPG
VQFYTGNYIKDVKGKGGFVYQAHAALCLETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFSTKGPLGFQSDERKRVGKKKTMAMALLSSSSPVSPFLSKQRITY
HKTPENLKLRPLILRDFGEAYAGECKSSNVSRLQCSYASSSSSMSPTEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELANEWSSIALIHPLFIKENGVFD
GEGRPIATVVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEAS
NLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTA
IVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK