; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC10G200260 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC10G200260
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
Description3-dehydroquinate synthase homolog
Genome locationCicolChr10:27043026..27054235
RNA-Seq ExpressionCcUC10G200260
SyntenyCcUC10G200260
Gene Ontology termsGO:0006006 - glucose metabolic process (biological process)
GO:0008652 - cellular amino acid biosynthetic process (biological process)
GO:0009073 - aromatic amino acid family biosynthetic process (biological process)
GO:0033499 - galactose catabolic process via UDP-galactose (biological process)
GO:0003856 - 3-dehydroquinate synthase activity (molecular function)
GO:0004034 - aldose 1-epimerase activity (molecular function)
GO:0016491 - oxidoreductase activity (molecular function)
GO:0030246 - carbohydrate binding (molecular function)
InterPro domainsIPR002812 - 3-dehydroquinate synthase
IPR008183 - Aldose 1-/Glucose-6-phosphate 1-epimerase
IPR011013 - Galactose mutarotase-like domain superfamily
IPR014718 - Glycoside hydrolase-type carbohydrate-binding


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG8384719.1 hypothetical protein BUALT_Bualt04G0147500 [Buddleja alternifolia]4.0e-24656.06Show/hide
Query:  AAFGFANGYEKKGEIGIFELKRGDLSVKFTNWGATIVSLLVPDKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNN
        AA    + +E + EI  +EL++GDL V FTN+GA IVSL+VPDK G   D+VLGYDS+++Y+ D + FG+IVGRVANRIGGAKFTLD   YKL  N+G N
Subjt:  AAFGFANGYEKKGEIGIFELKRGDLSVKFTNWGATIVSLLVPDKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNN

Query:  TLHGGTRGFSDVVWKVTKYQKDGSSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTMNAKALNKPTPVNLAQHTYWNLGGHN-SGDILLNRLQI
        TL+GG +GF++V+W VTK++K   +P I F Y S + ++GFPG+L VTA Y L   N LK+ ++A+  +K TPVNLAQ+TYWNLGGHN S +IL ++LQ+
Subjt:  TLHGGTRGFSDVVWKVTKYQKDGSSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTMNAKALNKPTPVNLAQHTYWNLGGHN-SGDILLNRLQI

Query:  FGSRITVVDQYLIPTGKIEPVKGTPYDFLKPHTVGSRINKLPKGYDINYALDDGT--GEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIKDVKGK
          S  T +DQYL+PTG+I PV  TP+DF  PH + SR +K+  GY++NY +DD     E  +K AAV+ D KSGR+L + T+ P VQFY+G+ + +V+GK
Subjt:  FGSRITVVDQYLIPTGKIEPVKGTPYDFLKPHTVGSRINKLPKGYDINYALDDGT--GEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIKDVKGK

Query:  GGFVYQAHAALCLETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFSTKGPLGFQSDERKRVGKKKTMAMAVLSSSSPVSPFLSKQRITYHKTPENLKLR
        GG +YQ +A + L  QGFP++VN  +FPS IV+  KPY    +F+     P          V K +   MA +  S+ V  F  K +    K        
Subjt:  GGFVYQAHAALCLETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFSTKGPLGFQSDERKRVGKKKTMAMAVLSSSSPVSPFLSKQRITYHKTPENLKLR

Query:  PLILRDFGEAYAGECKSSNVSRLQCS-YAASSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELADEWSS-----------------IAL
           L DF          +  +R Q + +   +   S     K VWIW+E ++VMTAA ERGWSTFIF     +LA EWSS                 IAL
Subjt:  PLILRDFGEAYAGECKSSNVSRLQCS-YAASSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELADEWSS-----------------IAL

Query:  IHPLFIKENGVFDGEGRPTATVVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLEALEHGLGGVILKVED
        ++PLFI+E G+FDG+    AT  E+S+P+QLE+LQP +   + VVV+L DWQ+IPAENIVAA QG++KTVFAVSKT  EAQ F EALE GLGGV+LK ED
Subjt:  IHPLFIKENGVFDGEGRPTATVVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLEALEHGLGGVILKVED

Query:  PEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTS
         EA+F++KDY DRRN   ++L LTKA +T I + GMGDRVCVDLCSLMRPGEGLLVGS+ARGLFLVHSECLES+YI+SRPFRVNAGPVHAYVA+PG KTS
Subjt:  PEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTS

Query:  YLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCS-GRGNEKKAIPVTSLKVGDEVFLRLQGEARHTGIE
        YLSEL+AG EVI+VDQ G QRTAI+GRVKIETRQLILV+AK D D QT YSILLQNAETVAL+ S G G+++KAIPVTSLK+GDEV LR+QG ARHTGIE
Subjt:  YLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCS-GRGNEKKAIPVTSLKVGDEVFLRLQGEARHTGIE

Query:  IQEFIVEK
        I+EFIVEK
Subjt:  IQEFIVEK

OMO79237.1 3-dehydroquinate synthase, prokaryotic-type [Corchorus capsularis]8.2e-27663.09Show/hide
Query:  EIGIFELKRGDLSVKFTNWGATIVSLLVPDKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDVV
        E+ I+ELKRG++SVKFTNWGA+IVSL++PDKHGKL D+VLGYDS+++Y NDT+YFG +VGRVANRIGGAKFTLDG  YKL ANEG N LHGG +GFSDV+
Subjt:  EIGIFELKRGDLSVKFTNWGATIVSLLVPDKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDVV

Query:  WKVTKYQKDGSSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQ---LKLTMNAKALNKPTPVNLAQHTYWNLGGHNSGDILLNRLQIFGSRITVVDQY
        W V +Y+ DG  P I FSY S+DG+EGFPG+L V+  Y+L  + +   L + M AKA+ KPTPVNLAQHTYWNLG H+SGDIL   + I+ +  T VD  
Subjt:  WKVTKYQKDGSSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQ---LKLTMNAKALNKPTPVNLAQHTYWNLGGHNSGDILLNRLQIFGSRITVVDQY

Query:  LIPTGKIEPVKGTPYDFLKPHT----VGSRINKLPKGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIKDVKGKGGFVYQAHA
        LIPTG+I    GTPYDFL  H     +GSRI +L KGYDINY LD      K+  AA V D K+GRM+ + TN PG+QFYTGN IKDVKGKGGF+Y+AHA
Subjt:  LIPTGKIEPVKGTPYDFLKPHT----VGSRINKLPKGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIKDVKGKGGFVYQAHA

Query:  ALCLETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFSTKGPLGFQSDERKRVGKKKTMAMAVLSSSSPVSPFLSKQRITYHKTPENLKLRPLILRDFGE
         LCLETQGFPD+VNH NFPS I++P KPY H MLF+                             + S V PF                      R F E
Subjt:  ALCLETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFSTKGPLGFQSDERKRVGKKKTMAMAVLSSSSPVSPFLSKQRITYHKTPENLKLRPLILRDFGE

Query:  AYAGECKSSNVSRLQCSYAASSS--SMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELADEWSSIALIHPLFIKENGVFDGEGRPTATVVEV
                ++     C+ A S S  S S  E SK VWIW+E  QVMTAAVERGW+TFIF+  N ELA++WS+IA I PL IKE G+F+  G+  AT+ EV
Subjt:  AYAGECKSSNVSRLQCSYAASSS--SMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELADEWSSIALIHPLFIKENGVFDGEGRPTATVVEV

Query:  SNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTK
        S P +L++LQP +     VV+DL DWQ+IPAENIVA FQGSQ TVFAVSK+  EAQ FLEALEHGLGGV+LK ED +AV  LK+YFDRRNE  N LSL+K
Subjt:  SNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTK

Query:  ATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIV
        AT+T++H  GMGDRVCVDLCSLMRPGEGLLVGS+ARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSEL+AGKEVIVVDQEG+ RTA+V
Subjt:  ATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIV

Query:  GRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCSGRGN--EKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE
        GRVKIETR LILV+AK DS++QT YSILLQNAETVAL+C   G   +K  IPVTSLK GDEV LRLQG ARHTGIEIQEFIVE
Subjt:  GRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCSGRGN--EKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE

QCD92574.1 aldose 1-epimerase [Vigna unguiculata]5.4e-22754.25Show/hide
Query:  MAKFFLALLCLIALAAFGFANG---YEKKGEIGIFELKRGDLSVKFTNWGATIVSLLVPDKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAK
        M K F+ L C++ LAA GF +G    +KK +I +FELK+GD S+K TNWGAT+VS+++PDK+G L D+VLGYDS + Y ND+SYFG+ VGRV NRIGGA+
Subjt:  MAKFFLALLCLIALAAFGFANG---YEKKGEIGIFELKRGDLSVKFTNWGATIVSLLVPDKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAK

Query:  FTLDGVLYKLIANEGNNTLHGGTRGFSDVVWKVTKYQKDGSSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTMNAKALNKPTPVNLAQHTYWN
        FTL+GV YKL+ANEGNNTLH G + FSDV+W V KY KDG  P+I FSY SFDG+ GFPGDLLVT  Y ++  N L + M AKALNKPTPVNL  H YWN
Subjt:  FTLDGVLYKLIANEGNNTLHGGTRGFSDVVWKVTKYQKDGSSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTMNAKALNKPTPVNLAQHTYWN

Query:  LGGHNSGDILLNRLQIFGSRITVVDQYLIPTGKIEPVKGTPYDFLKPHTVGSRINKLPK--GYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAP
        LG HNSG+IL   +QIFGS++T++D +LIPTGK   VKGTPYDFL+PH VG RIN+LPK  GYDINY LD   G+  +K  A+V DKKSGR++++ TNAP
Subjt:  LGGHNSGDILLNRLQIFGSRITVVDQYLIPTGKIEPVKGTPYDFLKPHTVGSRINKLPK--GYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAP

Query:  GVQFYTGNYIKDVKGKGGFVYQAHAALCLETQ----GFPDAVNHHNFPSTIVTPKKPYNHIMLFKFSTKGPLGFQSDERKRVGKKKTMAMAVLSSSSPVS
        G+QFYT N++++ KGK     +A   LC        G     + HN P   + P            S+  P  F S +    GK+               
Subjt:  GVQFYTGNYIKDVKGKGGFVYQAHAALCLETQ----GFPDAVNHHNFPSTIVTPKKPYNHIMLFKFSTKGPLGFQSDERKRVGKKKTMAMAVLSSSSPVS

Query:  PFLSKQRITYHKTPENLKLRPLILRDFGEAYAGECKSSNVSRLQCSYAASSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELADEWSSI
                                                                   SK VWIW+                                 
Subjt:  PFLSKQRITYHKTPENLKLRPLILRDFGEAYAGECKSSNVSRLQCSYAASSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELADEWSSI

Query:  ALIHPLFIKENGVFDGEGRPTATVVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLEALEHGLGGVILKV
                K+  V D + +  AT+ +VSNP++LE L+P +  A+ +VV+L DWQ+IPAENI+AAFQ SQKTV A+S    EAQ FLEALEHGL G+++K+
Subjt:  ALIHPLFIKENGVFDGEGRPTATVVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLEALEHGLGGVILKV

Query:  EDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGK
        ED E V +LK YFDRR E SNLLSLTKAT+T I VAGMGDRVCVDLCSLMRPGEGLL+GS+ARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPG +
Subjt:  EDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGK

Query:  TSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCSGRGN--EKKAIPVTSLKVGDEVFLRLQGEARHT
        TSYLSEL++GKEVIVVDQ+G QR AIVGRVKIE+R LILV+AK +SD QT  SILLQNAETVALVC  +GN   K  IPVTSLKVGDE+ LR+QG ARHT
Subjt:  TSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCSGRGN--EKKAIPVTSLKVGDEVFLRLQGEARHT

Query:  GIEIQEFIVEK
        GIEIQEFIVEK
Subjt:  GIEIQEFIVEK

XP_004147467.1 uncharacterized protein LOC101203995 [Cucumis sativus]6.4e-22093.38Show/hide
Query:  MAMAVLSSSSPVSPFLSKQRITYHKTPENLKLRPLILRDFGEAYAGECKSSNVSRLQCSYAASSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFS
        MAMA L SSSPVSPFLSKQRITY KTPENL LRPL+ RDFGEAYAGECKSS+VSRLQCSY +SSS MSPIEASKGVWIWSECQQVMTAAVERGWSTFIFS
Subjt:  MAMAVLSSSSPVSPFLSKQRITYHKTPENLKLRPLILRDFGEAYAGECKSSNVSRLQCSYAASSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFS

Query:  PHNTELADEWSSIALIHPLFIKENGVFDGEGRPTATVVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLE
        PHNTELA EWSSIALIHPLFIKENGV DGE R  A+VVEVSNPQQLEQLQPA ASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFA+SKTPIEAQ FLE
Subjt:  PHNTELADEWSSIALIHPLFIKENGVFDGEGRPTATVVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLE

Query:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLL+LTKATITQIHV GMGDRVCVDLCSLMRPGEGLLVGSYARGLFL+HSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCSGRG-NEKKAIPVTSLKVGDE
        GPVHAYVAVPGGKTSYLSEL+AG EVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS+LLQNAETVALVC G+G NEKKAIPVTSLKVGDE
Subjt:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCSGRG-NEKKAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

XP_038903473.1 3-dehydroquinate synthase homolog [Benincasa hispida]8.6e-22595.27Show/hide
Query:  MAMAVLSSSSPVSPFLSKQRIT-YHKTPENLKLRPLILRDFGEAYAGECKSSNVSRLQCSYAASSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIF
        M MA+L SSSPVSPFLSKQRI+ YHKTPENL LRPLI RDFGEAYAGECKSSNVSRLQCSYA+  S+MSP EASKGVWIWSECQQVMTAAVERGWSTFIF
Subjt:  MAMAVLSSSSPVSPFLSKQRIT-YHKTPENLKLRPLILRDFGEAYAGECKSSNVSRLQCSYAASSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIF

Query:  SPHNTELADEWSSIALIHPLFIKENGVFDGEGRPTATVVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFL
        SPHNTELADEWSSIALIHPLFIKENGVFDGEGR  A+VVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFA+SKTPIEAQ FL
Subjt:  SPHNTELADEWSSIALIHPLFIKENGVFDGEGRPTATVVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFL

Query:  EALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVN
        EALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVN
Subjt:  EALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVN

Query:  AGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCSGRGNEKKAIPVTSLKVGDE
        AGPVHAYVAVPGGKTSYLSEL AGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVC GRGNEKK+IPVTSLKVGDE
Subjt:  AGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCSGRGNEKKAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

TrEMBL top hitse value%identityAlignment
A0A0A0LG24 Aldose 1-epimerase3.0e-20793.09Show/hide
Query:  LKQIWRKAKSLHSWANMAKFFLALLCLIALAAFGFANGYEKKGEIGIFELKRGDLSVKFTNWGATIVSLLVPDKHGKLDDVVLGYDSIEEYQNDTSYFGS
        +KQIWR+A+ LHSWANMAK FLAL C+IALA FGFANGYEKKGEIGIFELKRGD SVKFTNWGATIVSLLVPDKHGKLDDVVLGYDSI+EYQNDT+YFGS
Subjt:  LKQIWRKAKSLHSWANMAKFFLALLCLIALAAFGFANGYEKKGEIGIFELKRGDLSVKFTNWGATIVSLLVPDKHGKLDDVVLGYDSIEEYQNDTSYFGS

Query:  IVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDVVWKVTKYQKDGSSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTMNAKALNK
        IVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDVVWKVTKYQKDG SPQIVFSYRSFDG+EGFPGDLLVTA Y+LIA NQLKLTMNAKALNK
Subjt:  IVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDVVWKVTKYQKDGSSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTMNAKALNK

Query:  PTPVNLAQHTYWNLGGHNSGDILLNRLQIFGSRITVVDQYLIPTGKIEPVKGTPYDFLKPHTVGSRINKLPKGYDINYALDDGTGEYKLKKAAVVHDKKS
        PTPVNLAQHTYWNLGGHNSGDIL N LQIFGSRITVVD  LIPTGK+EPVKGTP+DFLKP TVGSRINKLPKGYDINYALDDGTGE+KLKKAAVVHDKKS
Subjt:  PTPVNLAQHTYWNLGGHNSGDILLNRLQIFGSRITVVDQYLIPTGKIEPVKGTPYDFLKPHTVGSRINKLPKGYDINYALDDGTGEYKLKKAAVVHDKKS

Query:  GRMLEISTNAPGVQFYTGNYIKDVKGKGGFVYQAHAALCLETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFSTK
        GRMLE+STN PGVQFYTGNYIKDVKGKGGFVYQAHAALCLETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFSTK
Subjt:  GRMLEISTNAPGVQFYTGNYIKDVKGKGGFVYQAHAALCLETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFSTK

A0A1R3I9H2 3-dehydroquinate synthase, prokaryotic-type4.0e-27663.09Show/hide
Query:  EIGIFELKRGDLSVKFTNWGATIVSLLVPDKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDVV
        E+ I+ELKRG++SVKFTNWGA+IVSL++PDKHGKL D+VLGYDS+++Y NDT+YFG +VGRVANRIGGAKFTLDG  YKL ANEG N LHGG +GFSDV+
Subjt:  EIGIFELKRGDLSVKFTNWGATIVSLLVPDKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDVV

Query:  WKVTKYQKDGSSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQ---LKLTMNAKALNKPTPVNLAQHTYWNLGGHNSGDILLNRLQIFGSRITVVDQY
        W V +Y+ DG  P I FSY S+DG+EGFPG+L V+  Y+L  + +   L + M AKA+ KPTPVNLAQHTYWNLG H+SGDIL   + I+ +  T VD  
Subjt:  WKVTKYQKDGSSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQ---LKLTMNAKALNKPTPVNLAQHTYWNLGGHNSGDILLNRLQIFGSRITVVDQY

Query:  LIPTGKIEPVKGTPYDFLKPHT----VGSRINKLPKGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIKDVKGKGGFVYQAHA
        LIPTG+I    GTPYDFL  H     +GSRI +L KGYDINY LD      K+  AA V D K+GRM+ + TN PG+QFYTGN IKDVKGKGGF+Y+AHA
Subjt:  LIPTGKIEPVKGTPYDFLKPHT----VGSRINKLPKGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIKDVKGKGGFVYQAHA

Query:  ALCLETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFSTKGPLGFQSDERKRVGKKKTMAMAVLSSSSPVSPFLSKQRITYHKTPENLKLRPLILRDFGE
         LCLETQGFPD+VNH NFPS I++P KPY H MLF+                             + S V PF                      R F E
Subjt:  ALCLETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFSTKGPLGFQSDERKRVGKKKTMAMAVLSSSSPVSPFLSKQRITYHKTPENLKLRPLILRDFGE

Query:  AYAGECKSSNVSRLQCSYAASSS--SMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELADEWSSIALIHPLFIKENGVFDGEGRPTATVVEV
                ++     C+ A S S  S S  E SK VWIW+E  QVMTAAVERGW+TFIF+  N ELA++WS+IA I PL IKE G+F+  G+  AT+ EV
Subjt:  AYAGECKSSNVSRLQCSYAASSS--SMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELADEWSSIALIHPLFIKENGVFDGEGRPTATVVEV

Query:  SNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTK
        S P +L++LQP +     VV+DL DWQ+IPAENIVA FQGSQ TVFAVSK+  EAQ FLEALEHGLGGV+LK ED +AV  LK+YFDRRNE  N LSL+K
Subjt:  SNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTK

Query:  ATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIV
        AT+T++H  GMGDRVCVDLCSLMRPGEGLLVGS+ARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSEL+AGKEVIVVDQEG+ RTA+V
Subjt:  ATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIV

Query:  GRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCSGRGN--EKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE
        GRVKIETR LILV+AK DS++QT YSILLQNAETVAL+C   G   +K  IPVTSLK GDEV LRLQG ARHTGIEIQEFIVE
Subjt:  GRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCSGRGN--EKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE

A0A1S3B8Q7 3-dehydroquinate synthase homolog1.9e-21490.78Show/hide
Query:  MAMAVLSSSSPVSPFLSKQRITYHKTPENLKLRPLILRDFGEAYAGECKSSNVSRLQCSYAASSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFS
        M MA L SSSPVSP LSKQRITY KTPENL LRPLI R+FG+AYAGECKSS++SRLQCSY +SSS MSPIE SKGVWIWSECQ+VMTAAVERGWSTFIFS
Subjt:  MAMAVLSSSSPVSPFLSKQRITYHKTPENLKLRPLILRDFGEAYAGECKSSNVSRLQCSYAASSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFS

Query:  PHNTELADEWSSIALIHPLFIKENGVFDGEGRPTATVVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLE
        PHNTELA EW+SIA+IHPLFIKE+GV DGE R  A+VVE+SNPQQLEQLQPA ASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFA+SKTPIEAQ F E
Subjt:  PHNTELADEWSSIALIHPLFIKENGVFDGEGRPTATVVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLE

Query:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKV+DPEAVFQLKDYFDRRNEASNLLSLTKATITQIHV GMGDRVCVDLCSLMRPGEGLLVGS+ARGLFLVHSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCSGRG-NEKKAIPVTSLKVGDE
        GPVHAYVAVPGGKTSYLSEL+AGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS+LLQNAETVALVC G+G NEKKAI VTSLKVGDE
Subjt:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCSGRG-NEKKAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

A0A4D6LXG6 Aldose 1-epimerase2.6e-22754.25Show/hide
Query:  MAKFFLALLCLIALAAFGFANG---YEKKGEIGIFELKRGDLSVKFTNWGATIVSLLVPDKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAK
        M K F+ L C++ LAA GF +G    +KK +I +FELK+GD S+K TNWGAT+VS+++PDK+G L D+VLGYDS + Y ND+SYFG+ VGRV NRIGGA+
Subjt:  MAKFFLALLCLIALAAFGFANG---YEKKGEIGIFELKRGDLSVKFTNWGATIVSLLVPDKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAK

Query:  FTLDGVLYKLIANEGNNTLHGGTRGFSDVVWKVTKYQKDGSSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTMNAKALNKPTPVNLAQHTYWN
        FTL+GV YKL+ANEGNNTLH G + FSDV+W V KY KDG  P+I FSY SFDG+ GFPGDLLVT  Y ++  N L + M AKALNKPTPVNL  H YWN
Subjt:  FTLDGVLYKLIANEGNNTLHGGTRGFSDVVWKVTKYQKDGSSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTMNAKALNKPTPVNLAQHTYWN

Query:  LGGHNSGDILLNRLQIFGSRITVVDQYLIPTGKIEPVKGTPYDFLKPHTVGSRINKLPK--GYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAP
        LG HNSG+IL   +QIFGS++T++D +LIPTGK   VKGTPYDFL+PH VG RIN+LPK  GYDINY LD   G+  +K  A+V DKKSGR++++ TNAP
Subjt:  LGGHNSGDILLNRLQIFGSRITVVDQYLIPTGKIEPVKGTPYDFLKPHTVGSRINKLPK--GYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAP

Query:  GVQFYTGNYIKDVKGKGGFVYQAHAALCLETQ----GFPDAVNHHNFPSTIVTPKKPYNHIMLFKFSTKGPLGFQSDERKRVGKKKTMAMAVLSSSSPVS
        G+QFYT N++++ KGK     +A   LC        G     + HN P   + P            S+  P  F S +    GK+               
Subjt:  GVQFYTGNYIKDVKGKGGFVYQAHAALCLETQ----GFPDAVNHHNFPSTIVTPKKPYNHIMLFKFSTKGPLGFQSDERKRVGKKKTMAMAVLSSSSPVS

Query:  PFLSKQRITYHKTPENLKLRPLILRDFGEAYAGECKSSNVSRLQCSYAASSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELADEWSSI
                                                                   SK VWIW+                                 
Subjt:  PFLSKQRITYHKTPENLKLRPLILRDFGEAYAGECKSSNVSRLQCSYAASSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELADEWSSI

Query:  ALIHPLFIKENGVFDGEGRPTATVVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLEALEHGLGGVILKV
                K+  V D + +  AT+ +VSNP++LE L+P +  A+ +VV+L DWQ+IPAENI+AAFQ SQKTV A+S    EAQ FLEALEHGL G+++K+
Subjt:  ALIHPLFIKENGVFDGEGRPTATVVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLEALEHGLGGVILKV

Query:  EDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGK
        ED E V +LK YFDRR E SNLLSLTKAT+T I VAGMGDRVCVDLCSLMRPGEGLL+GS+ARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPG +
Subjt:  EDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGK

Query:  TSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCSGRGN--EKKAIPVTSLKVGDEVFLRLQGEARHT
        TSYLSEL++GKEVIVVDQ+G QR AIVGRVKIE+R LILV+AK +SD QT  SILLQNAETVALVC  +GN   K  IPVTSLKVGDE+ LR+QG ARHT
Subjt:  TSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCSGRGN--EKKAIPVTSLKVGDEVFLRLQGEARHT

Query:  GIEIQEFIVEK
        GIEIQEFIVEK
Subjt:  GIEIQEFIVEK

A0A5A7UEW0 3-dehydroquinate synthase-like protein1.9e-21490.78Show/hide
Query:  MAMAVLSSSSPVSPFLSKQRITYHKTPENLKLRPLILRDFGEAYAGECKSSNVSRLQCSYAASSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFS
        M MA L SSSPVSP LSKQRITY KTPENL LRPLI R+FG+AYAGECKSS++SRLQCSY +SSS MSPIE SKGVWIWSECQ+VMTAAVERGWSTFIFS
Subjt:  MAMAVLSSSSPVSPFLSKQRITYHKTPENLKLRPLILRDFGEAYAGECKSSNVSRLQCSYAASSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFS

Query:  PHNTELADEWSSIALIHPLFIKENGVFDGEGRPTATVVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLE
        PHNTELA EW+SIA+IHPLFIKE+GV DGE R  A+VVE+SNPQQLEQLQPA ASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFA+SKTPIEAQ F E
Subjt:  PHNTELADEWSSIALIHPLFIKENGVFDGEGRPTATVVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLE

Query:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKV+DPEAVFQLKDYFDRRNEASNLLSLTKATITQIHV GMGDRVCVDLCSLMRPGEGLLVGS+ARGLFLVHSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCSGRG-NEKKAIPVTSLKVGDE
        GPVHAYVAVPGGKTSYLSEL+AGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS+LLQNAETVALVC G+G NEKKAI VTSLKVGDE
Subjt:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCSGRG-NEKKAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

SwissProt top hitse value%identityAlignment
Q5EA79 Galactose mutarotase2.8e-6943.11Show/hide
Query:  GEIGIFELKRGDLSVKFTNWGATIVSLLVPDKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDV
        G +  F+L+   L V   +WG TI +L V D+ G+  DVVLG+D +E Y     YFG++VGRVANRI    FTLDG  YKL  N G N+LHGG +GF  V
Subjt:  GEIGIFELKRGDLSVKFTNWGATIVSLLVPDKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDV

Query:  VWKVTKYQKDGSSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTMNAKALNKPTPVNLAQHTYWNLGGHNSGDILLNRLQIFGSRITVVDQYLI
        +W   +   +G    + FS  S DG+EG+PG+L V   Y+L    +L +   A+A ++ TPVNL  H+Y+NL G  S +I  + + I       VD+ LI
Subjt:  VWKVTKYQKDGSSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTMNAKALNKPTPVNLAQHTYWNLGGHNSGDILLNRLQIFGSRITVVDQYLI

Query:  PTGKIEPVKGTPYDFLKPHTVGSRINKL-PKGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIK-DVKGKGGFVYQAHAALCL
        PTG+I  V+GT +D  KP  +G  + +    G+D N+ L    G  + +  A VH   SGR+LE+ T  PGVQFYTGN++   +KGK G  Y  H+  CL
Subjt:  PTGKIEPVKGTPYDFLKPHTVGSRINKL-PKGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIK-DVKGKGGFVYQAHAALCL

Query:  ETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFS
        ETQ +PDAVN  +FP  ++ P + Y+H   FKFS
Subjt:  ETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFS

Q66HG4 Galactose mutarotase2.2e-6943.36Show/hide
Query:  GEIGIFELKRGDLSVKFTNWGATIVSLLVPDKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDV
        G +  F+L+   L+V   +WG TI +L V D+ GK  DVVLG+  +E Y     YFG++VGRVANRI   +FT+DG  Y L  N   N+LHGG RGF  V
Subjt:  GEIGIFELKRGDLSVKFTNWGATIVSLLVPDKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDV

Query:  VWKVTKYQKDGSSPQIV-----FSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTMNAKALNKPTPVNLAQHTYWNLGGHNSGDILLNRLQIFGSRITVV
        +W          +PQ++     FS  S DG+EG+PG+L V   Y+L    +L +   A+A ++ TPVNL  H+Y+NL G  S DI  + + I       V
Subjt:  VWKVTKYQKDGSSPQIV-----FSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTMNAKALNKPTPVNLAQHTYWNLGGHNSGDILLNRLQIFGSRITVV

Query:  DQYLIPTGKIEPVKGTPYDFLKPHTVGSRINKLP-KGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIK-DVKGKGGFVYQAH
        D+ LIPTG I PV+GT +D  KP  +G  +      G+D N+ L +     + K  A VH   SGR+LE+ T  PGVQFYTGN++   +KGK G VY  H
Subjt:  DQYLIPTGKIEPVKGTPYDFLKPHTVGSRINKLP-KGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIK-DVKGKGGFVYQAH

Query:  AALCLETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFS
        +  CLETQ +PDAVN   FP  ++ P + YNH   FKFS
Subjt:  AALCLETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFS

Q8K157 Galactose mutarotase1.0e-6642.18Show/hide
Query:  GEIGIFELKRGDLSVKFTNWGATIVSLLVPDKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDV
        G +  F+L+   LSV   +WG TI +L V D+ GK  DVVLG+  +E Y     YFG++VGRVANRI   +FT+ G  Y L  N   N+LHGG  GF  V
Subjt:  GEIGIFELKRGDLSVKFTNWGATIVSLLVPDKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDV

Query:  VWKVTKYQKDGSSPQIV-----FSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTMNAKALNKPTPVNLAQHTYWNLGGHNSGDILLNRLQIFGSRITVV
        +W          +PQ++     F   S DG+EG+PG+L V   Y+L    +L +   A+A ++ TPVNL  H+Y+NL G  S +I  + + I       V
Subjt:  VWKVTKYQKDGSSPQIV-----FSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTMNAKALNKPTPVNLAQHTYWNLGGHNSGDILLNRLQIFGSRITVV

Query:  DQYLIPTGKIEPVKGTPYDFLKPHTVGSRINKLP-KGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIK-DVKGKGGFVYQAH
        D+ LIPTG I PV+GT +D  KP  +G+ +      G+D N+ L +     + K  A V    SGR+LE+ T  PGVQFYTGN++   +KGK G VY  H
Subjt:  DQYLIPTGKIEPVKGTPYDFLKPHTVGSRINKLP-KGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIK-DVKGKGGFVYQAH

Query:  AALCLETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFS
        + LCLETQ +PD+VN   FP  ++ P + YNH   FKFS
Subjt:  AALCLETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFS

Q96C23 Galactose mutarotase8.5e-6641.92Show/hide
Query:  GEIGIFELKRGDLSVKFTNWGATIVSLLVPDKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDV
        G +  F+L+   L V   +WG TI +L V D+ G+  DVVLG+  +E Y     YFG+++GRVANRI    F +DG  Y L  N+  N+LHGG RGF  V
Subjt:  GEIGIFELKRGDLSVKFTNWGATIVSLLVPDKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDV

Query:  VWKVTKYQKDGSSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTMNAKALNKPTPVNLAQHTYWNLGGHNSGDILLNRLQIFGSRITVVDQYLI
        +W   +   +G    + FS  S DG+EG+PG+L V   Y+L    +L +   A+A ++ TPVNL  H+Y+NL G  S +I  + + I       VD+ LI
Subjt:  VWKVTKYQKDGSSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTMNAKALNKPTPVNLAQHTYWNLGGHNSGDILLNRLQIFGSRITVVDQYLI

Query:  PTGKIEPVKGTPYDFLKPHTVGSRINKLP-KGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIK-DVKGKGGFVYQAHAALCL
        PTG++ PV+GT +D  KP  +G  +      G+D N+ L    G  +    A VH   SGR+LE+ T  PGVQFYTGN++   +KGK G VY  H+  CL
Subjt:  PTGKIEPVKGTPYDFLKPHTVGSRINKLP-KGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIK-DVKGKGGFVYQAHAALCL

Query:  ETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFS
        ETQ +PDAVN   FP  ++ P + Y+H   FKFS
Subjt:  ETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFS

Q9GKX6 Galactose mutarotase1.4e-6843.11Show/hide
Query:  GEIGIFELKRGDLSVKFTNWGATIVSLLVPDKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDV
        G +  F+L+   L V   +WG TI +L V D+ G+  DVVLG+  ++EY     YFG++VGRVANRI    FTLDG  YKL  N G N+LHGG RGF  V
Subjt:  GEIGIFELKRGDLSVKFTNWGATIVSLLVPDKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDV

Query:  VWKVTKYQKDGSSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTMNAKALNKPTPVNLAQHTYWNLGGHNSGDILLNRLQIFGSRITVVDQYLI
        +W   +   +G    I FS  S DG+EG+PG+L V   Y+L    +L +   A+A ++ TPVNL  H+Y+NL G  S +I  + + I       VD+ LI
Subjt:  VWKVTKYQKDGSSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTMNAKALNKPTPVNLAQHTYWNLGGHNSGDILLNRLQIFGSRITVVDQYLI

Query:  PTGKIEPVKGTPYDFLKPHTVGSRINKLP-KGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIK-DVKGKGGFVYQAHAALCL
        PTG+I PV+GT +D  KP  +G  + +    G+D N+ L       + +  A VH   SGR+LE+ T  PG+QFYTGN++   +KGK G VY  H+  CL
Subjt:  PTGKIEPVKGTPYDFLKPHTVGSRINKLP-KGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIK-DVKGKGGFVYQAHAALCL

Query:  ETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFS
        ETQ +P+AVN  +FP  ++ P + YNH   F FS
Subjt:  ETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFS

Arabidopsis top hitse value%identityAlignment
AT3G17940.1 Galactose mutarotase-like superfamily protein4.4e-10253.41Show/hide
Query:  EKKGEIGIFELKRGDLSVKFTNWGATIVSLLVPDKHGKLDDVVLGYDSIEEYQNDTS-YFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRG
        + K    IFEL  G + VK +N+G TI SL VPDK+GKL DVVLG+DS++ Y    + YFG IVGRVANRI   KF+L+GV Y L  N+  N+LHGG +G
Subjt:  EKKGEIGIFELKRGDLSVKFTNWGATIVSLLVPDKHGKLDDVVLGYDSIEEYQNDTS-YFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRG

Query:  FSDVVWKVTKYQKDGSSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTMNAKALNKPTPVNLAQHTYWNLGGHNSGDILLNRLQIFGSRITVVD
        F   +W+V  +++DG  P I F Y S DG+EG+PG + VTA Y+L +   ++L M A   NK TP+NLAQHTYWNL GH+SG+IL +++QI+GS IT VD
Subjt:  FSDVVWKVTKYQKDGSSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTMNAKALNKPTPVNLAQHTYWNLGGHNSGDILLNRLQIFGSRITVVD

Query:  QYLIPTGKIEPVKGTPYDFLKPHTVGSRINKLPKGYDINYALDDGTGEYK-LKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIKDVKGKGGFVYQAHAA
        +Y +PTG+I PVKGTP+DF +   +G  I ++  GYD NY LD    E + LK AA + D  S R+L + TN PG+QFYTGNY+  V GKG  VY  HA 
Subjt:  QYLIPTGKIEPVKGTPYDFLKPHTVGSRINKLPKGYDINYALDDGTGEYK-LKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIKDVKGKGGFVYQAHAA

Query:  LCLETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFS
        +CLETQGFP+A+N  NFPS +V   + YNH MLF+FS
Subjt:  LCLETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFS

AT3G28760.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: 3-dehydroquinate synthase, prokaryotic-type (InterPro:IPR002812); Has 390 Blast hits to 390 proteins in 131 species: Archae - 144; Bacteria - 105; Metazoa - 0; Fungi - 0; Plants - 54; Viruses - 0; Other Eukaryotes - 87 (source: NCBI BLink).1.9e-13761.56Show/hide
Query:  MAMAVLSSSSPVSPFLSKQRITYHKTPENLKLRPLILRDFGEAYAGECKSSNVSRLQCSYAASSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFS
        MA+ ++SS S +    ++   ++    E L+L  L+L    +      K +   R+    +AS+  M+ +  +K VWIW+ C++VMT AVERGW+TFIFS
Subjt:  MAMAVLSSSSPVSPFLSKQRITYHKTPENLKLRPLILRDFGEAYAGECKSSNVSRLQCSYAASSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFS

Query:  PHNTELADEWSSIALIHPLFIKENGVFDGEGRPTATVVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLE
          N +L++EWSSIAL+  LFI+E  V DG G   A+V EVS P++L  L   N   + +V+D  DW+ IPAEN+VAA QGS+KTVFAVS TP EA+ FLE
Subjt:  PHNTELADEWSSIALIHPLFIKENGVFDGEGRPTATVVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLE

Query:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGG+ILK ED +AV  LK+YFD+RNE S+ LSLT+ATIT++ + GMGDRVCVDLCSLMRPGEGLLVGS+ARGLFLVHSECLESNYI SRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDS-DEQTPYSILLQNAETVALVCSGRGNE--KKAIPVTSLKVG
        GPVHAYVAVPGGKT YLSELR G+EVIVVDQ+G+QRTA+VGRVKIE R LI+V+AK  + +E+T YSI+LQNAETVALV   + N   + A+PVTSLK G
Subjt:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDS-DEQTPYSILLQNAETVALVCSGRGNE--KKAIPVTSLKVG

Query:  DEVFLRLQGEARHTGIEIQEFIVE
        D+V +RLQG ARHTGIEIQEFIVE
Subjt:  DEVFLRLQGEARHTGIEIQEFIVE

AT3G28760.2 CONTAINS InterPro DOMAIN/s: 3-dehydroquinate synthase, prokaryotic-type (InterPro:IPR002812); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink).3.8e-13864.02Show/hide
Query:  TYHKTPENLKLRPLILRDFGEAYAGECKSSNVSRLQCSYAASSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELADEWSSIALIHPLFI
        +Y  T E L+L  L+L    +      K +   R+    +AS+  M+ +  +K VWIW+ C++VMT AVERGW+TFIFS  N +L++EWSSIAL+  LFI
Subjt:  TYHKTPENLKLRPLILRDFGEAYAGECKSSNVSRLQCSYAASSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELADEWSSIALIHPLFI

Query:  KENGVFDGEGRPTATVVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLEALEHGLGGVILKVEDPEAVFQ
        +E  V DG G   A+V EVS P++L  L   N   + +V+D  DW+ IPAEN+VAA QGS+KTVFAVS TP EA+ FLEALEHGLGG+ILK ED +AV  
Subjt:  KENGVFDGEGRPTATVVEVSNPQQLEQLQPANASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLEALEHGLGGVILKVEDPEAVFQ

Query:  LKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELR
        LK+YFD+RNE S+ LSLT+ATIT++ + GMGDRVCVDLCSLMRPGEGLLVGS+ARGLFLVHSECLESNYI SRPFRVNAGPVHAYVAVPGGKT YLSELR
Subjt:  LKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELR

Query:  AGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDS-DEQTPYSILLQNAETVALVCSGRGNE--KKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEF
         G+EVIVVDQ+G+QRTA+VGRVKIE R LI+V+AK  + +E+T YSI+LQNAETVALV   + N   + A+PVTSLK GD+V +RLQG ARHTGIEIQEF
Subjt:  AGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDS-DEQTPYSILLQNAETVALVCSGRGNE--KKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEF

Query:  IVE
        IVE
Subjt:  IVE

AT3G47800.1 Galactose mutarotase-like superfamily protein6.3e-11759.52Show/hide
Query:  EIGIFELKRGDLSVKFTNWGATIVSLLVPDKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDVV
        +I  ++L RG LSV FTN+GA + SLL+PD+HGK DDVVLG+D+++ Y+NDT+YFG+IVGRVANRIGGAKF L+G LYK   NEG NTLHGG++GFSDV+
Subjt:  EIGIFELKRGDLSVKFTNWGATIVSLLVPDKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDVV

Query:  WKVTKYQKDGSSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTMNAKALNKPTPVNLAQHTYWNLGGHNSGDILLNRLQIFGSRITVVDQYLIP
        W V KY     +  I F+Y SFDG+EGFPG++ V   Y LI  N+L + M AK LNKPTP+NLA HTYWNL  HNSG+IL +++Q+   +IT VD  LIP
Subjt:  WKVTKYQKDGSSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTMNAKALNKPTPVNLAQHTYWNLGGHNSGDILLNRLQIFGSRITVVDQYLIP

Query:  TGKIEPVKGTPYDFLKPHTVGSRINKLPKGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIKDVKGKGGFVYQAHAALCLETQ
        TG+I  + GTPYDFL+P  +GSRI++LP GYDINY +D   G++ L+K AVV ++ +GR +E+ TN PGVQFYT N +K V GKG  VY+ +  LCLETQ
Subjt:  TGKIEPVKGTPYDFLKPHTVGSRINKLPKGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIKDVKGKGGFVYQAHAALCLETQ

Query:  GFPDAVNHHNFPSTIVTPKKPYNHIMLFKFS
        GFPD+VNH NFPS IV P + Y H+MLF+F+
Subjt:  GFPDAVNHHNFPSTIVTPKKPYNHIMLFKFS

AT5G15140.1 Galactose mutarotase-like superfamily protein3.7e-11759.88Show/hide
Query:  KKGEIGIFELKRGDLSVKFTNWGATIVSLLVPDKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFS
        +K +IG++ELK+G+L+VKFTNWGA+I+SL  PDK+GK+DD+VLGYDS++ Y+ D  YFG+ VGRVANRIG  KF L+G  YK   N+G NTLHGG +GF 
Subjt:  KKGEIGIFELKRGDLSVKFTNWGATIVSLLVPDKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVANRIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFS

Query:  DVVWKVTKYQKDGSSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTMNAKALNKPTPVNLAQHTYWNLGGHNSGDILLNRLQIFGSRITVVDQY
        DVVW V K+Q DG  P IVF++ S DGD+GFPG+L VT  Y L+ +N+L + M AK  +K TPVNLA H+YWNLGGHNSGDIL   +QI GS  T VD  
Subjt:  DVVWKVTKYQKDGSSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTMNAKALNKPTPVNLAQHTYWNLGGHNSGDILLNRLQIFGSRITVVDQY

Query:  LIPTGKIEPVKGTPYDFLKPHTVGSRINKLPKGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIKDVKGKGGFVYQAHAALCL
        LIPTGKI PVKGT YDFL+   +   +  L  GYDINY LD      K++K   + DKKSGR +E+S N  G+QFYTG  +KDVKGK G VYQA   LCL
Subjt:  LIPTGKIEPVKGTPYDFLKPHTVGSRINKLPKGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIKDVKGKGGFVYQAHAALCL

Query:  ETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFS
        ETQ +PDA+NH  FPS IV P K Y H MLFKFS
Subjt:  ETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTCGTTGAAGCAGATATGGAGGAAGGCAAAATCTCTTCACTCTTGGGCAAATATGGCCAAATTTTTCCTTGCTTTGCTCTGTCTTATTGCTTTAGCTGCTTTTGG
GTTTGCTAATGGCTATGAGAAGAAGGGGGAGATTGGGATTTTTGAGCTCAAGAGAGGTGACCTTTCAGTGAAATTTACTAACTGGGGTGCCACTATTGTGTCTCTTCTTG
TCCCAGACAAGCATGGGAAGTTGGATGATGTTGTTCTTGGGTATGATTCCATTGAGGAGTATCAGAATGATACATCGTATTTTGGTTCGATTGTTGGAAGAGTTGCTAAC
CGAATCGGTGGTGCGAAATTTACTCTGGATGGAGTTCTATACAAGCTAATTGCTAATGAAGGCAACAACACACTTCATGGTGGCACTAGAGGCTTCAGTGATGTTGTCTG
GAAAGTGACCAAATATCAGAAAGATGGTAGCTCTCCTCAAATCGTATTTTCCTACCGCAGTTTCGACGGCGACGAAGGTTTTCCTGGTGATCTCTTGGTGACTGCCAAAT
ACTCACTCATTGCAAACAACCAACTGAAGTTAACAATGAATGCCAAAGCTCTAAACAAGCCTACTCCTGTAAATTTAGCTCAACACACCTACTGGAATCTTGGTGGACAT
AACAGTGGTGATATTCTATTGAATCGTCTTCAGATCTTTGGATCCCGCATCACTGTCGTCGATCAATATCTCATTCCTACAGGAAAGATAGAACCTGTCAAAGGAACTCC
ATACGATTTCCTCAAGCCCCACACGGTTGGAAGCAGAATAAACAAGCTACCAAAAGGCTATGACATAAACTATGCTCTCGATGATGGCACCGGGGAATATAAACTGAAGA
AAGCAGCAGTCGTGCACGACAAGAAGTCGGGAAGAATGTTGGAGATATCGACAAATGCTCCCGGTGTGCAGTTCTATACAGGAAACTATATAAAGGATGTAAAAGGAAAG
GGAGGATTTGTGTACCAAGCTCATGCTGCGCTTTGTTTGGAGACTCAAGGCTTTCCTGATGCAGTGAATCACCACAATTTTCCTTCAACCATTGTAACTCCTAAGAAGCC
TTACAATCACATTATGCTGTTTAAGTTCTCAACTAAAGGGCCATTGGGCTTTCAAAGCGACGAGCGAAAGAGAGTTGGCAAGAAGAAAACGATGGCCATGGCCGTGCTCT
CTTCCTCCTCGCCTGTTTCTCCATTTCTTTCCAAACAGCGCATCACCTACCACAAAACACCAGAGAATTTAAAACTCCGGCCCCTAATTTTGAGGGATTTTGGTGAAGCC
TATGCTGGTGAATGTAAATCCTCGAATGTGAGTCGTTTACAGTGTTCTTACGCTGCCTCGTCCTCTTCAATGTCTCCGATTGAGGCGTCGAAGGGGGTATGGATTTGGAG
TGAGTGTCAGCAGGTTATGACGGCTGCGGTTGAGAGGGGATGGAGCACCTTCATCTTCTCGCCTCATAATACGGAGCTTGCTGATGAATGGTCGTCAATTGCACTAATAC
ATCCTCTTTTTATTAAAGAGAATGGAGTTTTTGATGGTGAGGGTAGACCAACTGCCACAGTTGTTGAGGTCTCTAACCCCCAGCAGTTGGAGCAGCTCCAACCAGCAAAT
GCATCTGCAGACATTGTTGTTGTTGATTTACAAGACTGGCAGATAATACCTGCAGAGAATATTGTTGCAGCATTTCAGGGGAGTCAGAAAACAGTGTTTGCCGTCTCGAA
AACTCCTATTGAAGCTCAAACCTTCCTTGAGGCACTTGAACATGGTCTGGGCGGAGTTATTTTGAAAGTTGAAGATCCTGAAGCTGTTTTTCAGCTAAAGGACTATTTTG
ACAGAAGAAATGAAGCTAGCAATCTTTTAAGCTTGACTAAGGCTACTATTACTCAAATTCACGTTGCTGGAATGGGAGATCGAGTTTGCGTCGATCTCTGTAGTCTCATG
AGACCCGGCGAAGGGCTTCTTGTCGGGTCCTATGCTAGAGGCCTGTTTCTTGTTCACTCGGAATGCTTAGAATCAAATTATATTGCTAGCCGGCCATTTCGTGTCAATGC
TGGACCTGTCCATGCCTACGTAGCTGTTCCAGGAGGGAAAACTAGCTACCTTTCTGAGTTACGAGCAGGGAAAGAGGTAATTGTAGTTGATCAAGAAGGCAGACAGCGAA
CCGCTATTGTTGGACGTGTAAAGATTGAAACTAGGCAGCTGATCCTTGTCCAGGCAAAGAGAGATTCAGATGAGCAGACTCCTTACAGCATCCTTCTGCAGAATGCGGAA
ACGGTCGCATTAGTCTGCTCGGGACGAGGAAATGAGAAGAAAGCCATCCCAGTTACCTCGCTTAAAGTTGGTGATGAAGTGTTCTTGAGATTGCAAGGAGAAGCAAGACA
TACAGGAATTGAAATCCAAGAGTTTATTGTGGAGAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTCGTTGAAGCAGATATGGAGGAAGGCAAAATCTCTTCACTCTTGGGCAAATATGGCCAAATTTTTCCTTGCTTTGCTCTGTCTTATTGCTTTAGCTGCTTTTGG
GTTTGCTAATGGCTATGAGAAGAAGGGGGAGATTGGGATTTTTGAGCTCAAGAGAGGTGACCTTTCAGTGAAATTTACTAACTGGGGTGCCACTATTGTGTCTCTTCTTG
TCCCAGACAAGCATGGGAAGTTGGATGATGTTGTTCTTGGGTATGATTCCATTGAGGAGTATCAGAATGATACATCGTATTTTGGTTCGATTGTTGGAAGAGTTGCTAAC
CGAATCGGTGGTGCGAAATTTACTCTGGATGGAGTTCTATACAAGCTAATTGCTAATGAAGGCAACAACACACTTCATGGTGGCACTAGAGGCTTCAGTGATGTTGTCTG
GAAAGTGACCAAATATCAGAAAGATGGTAGCTCTCCTCAAATCGTATTTTCCTACCGCAGTTTCGACGGCGACGAAGGTTTTCCTGGTGATCTCTTGGTGACTGCCAAAT
ACTCACTCATTGCAAACAACCAACTGAAGTTAACAATGAATGCCAAAGCTCTAAACAAGCCTACTCCTGTAAATTTAGCTCAACACACCTACTGGAATCTTGGTGGACAT
AACAGTGGTGATATTCTATTGAATCGTCTTCAGATCTTTGGATCCCGCATCACTGTCGTCGATCAATATCTCATTCCTACAGGAAAGATAGAACCTGTCAAAGGAACTCC
ATACGATTTCCTCAAGCCCCACACGGTTGGAAGCAGAATAAACAAGCTACCAAAAGGCTATGACATAAACTATGCTCTCGATGATGGCACCGGGGAATATAAACTGAAGA
AAGCAGCAGTCGTGCACGACAAGAAGTCGGGAAGAATGTTGGAGATATCGACAAATGCTCCCGGTGTGCAGTTCTATACAGGAAACTATATAAAGGATGTAAAAGGAAAG
GGAGGATTTGTGTACCAAGCTCATGCTGCGCTTTGTTTGGAGACTCAAGGCTTTCCTGATGCAGTGAATCACCACAATTTTCCTTCAACCATTGTAACTCCTAAGAAGCC
TTACAATCACATTATGCTGTTTAAGTTCTCAACTAAAGGGCCATTGGGCTTTCAAAGCGACGAGCGAAAGAGAGTTGGCAAGAAGAAAACGATGGCCATGGCCGTGCTCT
CTTCCTCCTCGCCTGTTTCTCCATTTCTTTCCAAACAGCGCATCACCTACCACAAAACACCAGAGAATTTAAAACTCCGGCCCCTAATTTTGAGGGATTTTGGTGAAGCC
TATGCTGGTGAATGTAAATCCTCGAATGTGAGTCGTTTACAGTGTTCTTACGCTGCCTCGTCCTCTTCAATGTCTCCGATTGAGGCGTCGAAGGGGGTATGGATTTGGAG
TGAGTGTCAGCAGGTTATGACGGCTGCGGTTGAGAGGGGATGGAGCACCTTCATCTTCTCGCCTCATAATACGGAGCTTGCTGATGAATGGTCGTCAATTGCACTAATAC
ATCCTCTTTTTATTAAAGAGAATGGAGTTTTTGATGGTGAGGGTAGACCAACTGCCACAGTTGTTGAGGTCTCTAACCCCCAGCAGTTGGAGCAGCTCCAACCAGCAAAT
GCATCTGCAGACATTGTTGTTGTTGATTTACAAGACTGGCAGATAATACCTGCAGAGAATATTGTTGCAGCATTTCAGGGGAGTCAGAAAACAGTGTTTGCCGTCTCGAA
AACTCCTATTGAAGCTCAAACCTTCCTTGAGGCACTTGAACATGGTCTGGGCGGAGTTATTTTGAAAGTTGAAGATCCTGAAGCTGTTTTTCAGCTAAAGGACTATTTTG
ACAGAAGAAATGAAGCTAGCAATCTTTTAAGCTTGACTAAGGCTACTATTACTCAAATTCACGTTGCTGGAATGGGAGATCGAGTTTGCGTCGATCTCTGTAGTCTCATG
AGACCCGGCGAAGGGCTTCTTGTCGGGTCCTATGCTAGAGGCCTGTTTCTTGTTCACTCGGAATGCTTAGAATCAAATTATATTGCTAGCCGGCCATTTCGTGTCAATGC
TGGACCTGTCCATGCCTACGTAGCTGTTCCAGGAGGGAAAACTAGCTACCTTTCTGAGTTACGAGCAGGGAAAGAGGTAATTGTAGTTGATCAAGAAGGCAGACAGCGAA
CCGCTATTGTTGGACGTGTAAAGATTGAAACTAGGCAGCTGATCCTTGTCCAGGCAAAGAGAGATTCAGATGAGCAGACTCCTTACAGCATCCTTCTGCAGAATGCGGAA
ACGGTCGCATTAGTCTGCTCGGGACGAGGAAATGAGAAGAAAGCCATCCCAGTTACCTCGCTTAAAGTTGGTGATGAAGTGTTCTTGAGATTGCAAGGAGAAGCAAGACA
TACAGGAATTGAAATCCAAGAGTTTATTGTGGAGAAATGATGGTTGATCACCTTTTACCATTTGAATATGTGTATATTTATTTTATCTTTTGAAAACACTAGACTTTTGT
TGGGTTGATACGATAAGTATTTGGTTTTTAATTTGTTTAAAAAAT
Protein sequenceShow/hide protein sequence
MESLKQIWRKAKSLHSWANMAKFFLALLCLIALAAFGFANGYEKKGEIGIFELKRGDLSVKFTNWGATIVSLLVPDKHGKLDDVVLGYDSIEEYQNDTSYFGSIVGRVAN
RIGGAKFTLDGVLYKLIANEGNNTLHGGTRGFSDVVWKVTKYQKDGSSPQIVFSYRSFDGDEGFPGDLLVTAKYSLIANNQLKLTMNAKALNKPTPVNLAQHTYWNLGGH
NSGDILLNRLQIFGSRITVVDQYLIPTGKIEPVKGTPYDFLKPHTVGSRINKLPKGYDINYALDDGTGEYKLKKAAVVHDKKSGRMLEISTNAPGVQFYTGNYIKDVKGK
GGFVYQAHAALCLETQGFPDAVNHHNFPSTIVTPKKPYNHIMLFKFSTKGPLGFQSDERKRVGKKKTMAMAVLSSSSPVSPFLSKQRITYHKTPENLKLRPLILRDFGEA
YAGECKSSNVSRLQCSYAASSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELADEWSSIALIHPLFIKENGVFDGEGRPTATVVEVSNPQQLEQLQPAN
ASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQTFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLM
RPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAE
TVALVCSGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK