; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc10G00650 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc10G00650
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Description3-dehydroquinate synthase, putative
Genome locationClcChr10:640201..650424
RNA-Seq ExpressionClc10G00650
SyntenyClc10G00650
Gene Ontology termsGO:0009073 - aromatic amino acid family biosynthetic process (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0003856 - 3-dehydroquinate synthase activity (molecular function)
GO:0003924 - GTPase activity (molecular function)
GO:0005525 - GTP binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001806 - Small GTPase
IPR005225 - Small GTP-binding protein domain
IPR016037 - 3-dehydroquinate synthase AroB
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR030960 - 3-dehydroquinate synthase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF1873386.1 hypothetical protein Lal_00027424 [Lupinus albus]8.5e-27979.9Show/hide
Query:  VEFNRNGFSGLSSRSTRGKILASSAQIMDQSAIKTDSSAPTIVEVDLGDRSYPIYIGSGLLDQPEILQRHVHGKRVLIVTNETVAPLYLDKVTEALTIGN
        + FN    S  + R+      +SS+Q+MD    KT    PT V VDLGDRSYPIYIGSGLL+QP +LQRHVHGKRVL+VTN TVAPLYLDKV +ALT GN
Subjt:  VEFNRNGFSGLSSRSTRGKILASSAQIMDQSAIKTDSSAPTIVEVDLGDRSYPIYIGSGLLDQPEILQRHVHGKRVLIVTNETVAPLYLDKVTEALTIGN

Query:  PNVSVESVVLPDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALGGGVIGDMCGYAAASFLRGVNFIQIPTTVMAQVDSSVGGKTGINHRLGKNLIGAFYQ
         NVSVESV+LPDGE+YKDMDTLMKVFDKAIESRLDRR TFVALGGGVIGDMCG+AAASFLRGVNFIQIPTTVMAQVDSSVGGKTGINH LGKNLIGAFYQ
Subjt:  PNVSVESVVLPDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALGGGVIGDMCGYAAASFLRGVNFIQIPTTVMAQVDSSVGGKTGINHRLGKNLIGAFYQ

Query:  PQCVVVDTNTLNTLPDRELASGFAEVIKYGLIRDAEFFEWQEKNMPSLMARDPAALAYAIKRSCENKAEVVSLDEKESGLRATLNLGHTFGHAIETGFGY
        PQCV++DT+TLNTLP+RELASGFAEVIKYGLIRDA+FFEWQEKN+ +LMARDP ALAYAIKRSCENKAEVVSLDEKESGLRATLNLGHTFGHAIETG GY
Subjt:  PQCVVVDTNTLNTLPDRELASGFAEVIKYGLIRDAEFFEWQEKNMPSLMARDPAALAYAIKRSCENKAEVVSLDEKESGLRATLNLGHTFGHAIETGFGY

Query:  GQWLHGEAVAVGTVMAVDMSYRLGWIDDTIVSRVLAILKQAKLPTAPPESMTVEMFKSIMAVDKKVADGLLRLILLKGPLGNCVFTGDYDRKALDETLRS
        GQWLHGEAVA GTVMAVDMSYRLGWIDD+IV RV  ILKQAKLPT PPE++T++MFKS+MAVDKKVADGLLRLILLKGPLGNCVFTGDYDRKALD+TL +
Subjt:  GQWLHGEAVAVGTVMAVDMSYRLGWIDDTIVSRVLAILKQAKLPTAPPESMTVEMFKSIMAVDKKVADGLLRLILLKGPLGNCVFTGDYDRKALDETLRS

Query:  FCKCSIDVRPPQLPEQGTTPHSSRRLSSGFLSFCPQVHDLRPPPLSAFSNNERFKVHQDYVPTVFDNFSANVVVNGSTVNLGLWDTAGQEDYNRLRPLSY
        F + S+         Q T   S+ R        C  V D          +        DYVPTVFDNFSANVVVNGS VNLGLWDTAGQEDYNRLRPLSY
Subjt:  FCKCSIDVRPPQLPEQGTTPHSSRRLSSGFLSFCPQVHDLRPPPLSAFSNNERFKVHQDYVPTVFDNFSANVVVNGSTVNLGLWDTAGQEDYNRLRPLSY

Query:  RGADVFILAFSLISKASYENVSKKWIPELKHYAPGVPIVLVGTKLDLRDDKQFFIDHPGAVPISTAQGEELRKLIGAPAYIECSSKTQQNVKGVFDAAIR
        RGADVFILAFSLISKASYENVSKKWIPELKHYAPGVPI+LVGTKLDLRDDKQF IDHPGAVPI+ AQGEELRKLI APAYIECSSKTQ+NVKGVFDAAIR
Subjt:  RGADVFILAFSLISKASYENVSKKWIPELKHYAPGVPIVLVGTKLDLRDDKQFFIDHPGAVPISTAQGEELRKLIGAPAYIECSSKTQQNVKGVFDAAIR

Query:  VVLQPPKQKKKKSKAQKACSIL
        VVLQPPKQKKKK+KAQKACSIL
Subjt:  VVLQPPKQKKKKSKAQKACSIL

KAF3655633.1 3-dehydroquinate synthase, chloroplastic [Capsicum annuum]6.9e-27373.8Show/hide
Query:  LSSRSTRGKIL-ASSAQIMDQSAIKTDSSAPTIVEVDLGDRSYPIYIGSGLLDQPEILQ-----------------------------------------
        + S++ R K+L AS+A++MDQS  K    APT+VEVDLG+RSYPIYIGSGLLD+P++LQ                                         
Subjt:  LSSRSTRGKIL-ASSAQIMDQSAIKTDSSAPTIVEVDLGDRSYPIYIGSGLLDQPEILQ-----------------------------------------

Query:  ---RHVHGKRVLIVTNETVAPLYLDKVTEALTIGNPNVSVESVVLPDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALGGGVIGDMCGYAAASFLRGVNF
           RH+HGKRVL+VTN  VAPLYLDK   ALT GNPNV+VESV+LPDGE++K+M+TLMKVFDKAIESRLDRRCTFVALGGGVIGDMCGYAAAS+LRGVNF
Subjt:  ---RHVHGKRVLIVTNETVAPLYLDKVTEALTIGNPNVSVESVVLPDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALGGGVIGDMCGYAAASFLRGVNF

Query:  IQIPTTVMAQVDSSVGGKTGINHRLGKNLIGAFYQPQCVVVDTNTLNTLPDRELASGFAEVIKYGLIRDAEFFEWQEKNMPSLMARDPAALAYAIKRSCE
        IQIPTTVMAQVDSSVGGKTGINH LGKN+IGAFYQPQCV++DT+TLNTLPDRELASG AEVIKYGLIRDAEFFEWQE+NMP L+AR+PAA  YAIKRSCE
Subjt:  IQIPTTVMAQVDSSVGGKTGINHRLGKNLIGAFYQPQCVVVDTNTLNTLPDRELASGFAEVIKYGLIRDAEFFEWQEKNMPSLMARDPAALAYAIKRSCE

Query:  NKAEVVSLDEKESGLRATLNLGHTFGHAIETGFGYGQWLHGEAVAVGTVMAVDMSYRLGWIDDTIVSRVLAILKQAKLPTAPPESMTVEMFKSIMAVDKK
        NKAEVVS DEKESGLRATLNLGHTFGHA+ETGFGYGQWLHGEAVA GTVMAVD+S RLGWIDD++V RV  IL+QAKLPT+PPE+MTVEMFKSIMAVDKK
Subjt:  NKAEVVSLDEKESGLRATLNLGHTFGHAIETGFGYGQWLHGEAVAVGTVMAVDMSYRLGWIDDTIVSRVLAILKQAKLPTAPPESMTVEMFKSIMAVDKK

Query:  VADGLLRLILLKGPLGNCVFTGDYDRKALDETLRSFCKCSIDVRPPQLPEQGTTP-----------HSSRRLSSGFLSFCPQVHDLRPPPLSAFSNNERF
        VADG LRLILLKGPLG+CVFTGDYD+KALDETLR+F   ++      +  QG                 RR+  GFL                   NE F
Subjt:  VADGLLRLILLKGPLGNCVFTGDYDRKALDETLRSFCKCSIDVRPPQLPEQGTTP-----------HSSRRLSSGFLSFCPQVHDLRPPPLSAFSNNERF

Query:  KVHQDYVPTVFDNFSANVVVNGSTVNLGLWDTAGQEDYNRLRPLSYRGADVFILAFSLISKASYENVSKKWIPELKHYAPGVPIVLVGTKLDLRDDKQFF
        K   DYVPTVFDNFSANVVVNGSTVNLGLWDTAGQEDYNRLRPLSYRGADVFILAFSLISKASYENVSKKWIPELKHYAPGVPIVLVGTKLDLRDDKQFF
Subjt:  KVHQDYVPTVFDNFSANVVVNGSTVNLGLWDTAGQEDYNRLRPLSYRGADVFILAFSLISKASYENVSKKWIPELKHYAPGVPIVLVGTKLDLRDDKQFF

Query:  IDHPGAVPISTAQGEELRKLIGAPAYIECSSKTQQNVKGVFDAAIRVVLQPPKQKKKKSKAQKACSIL
        +DHPGAVPI+TAQGEELRK IGAP+YIECSSKTQQNVK VFDAAI+VVLQPPKQKKKK K+QKACSIL
Subjt:  IDHPGAVPISTAQGEELRKLIGAPAYIECSSKTQQNVKGVFDAAIRVVLQPPKQKKKKSKAQKACSIL

KAF4377639.1 hypothetical protein G4B88_006919 [Cannabis sativa]3.0e-26873.7Show/hide
Query:  MATVPNSLCLSPATDLIVKSPLFKSAAVGCFFPIRNRNSVHLRSSS---TGVEFNRNGFSGLSSRSTRGKILASSAQIMDQSAIKTDSSAPTIVEVDLGD
        MA+  +  C S +T   + +P   S++   F  +   NS+ LRSSS   + +E  R   S L SRST  +I AS AQ+MDQS     S APTIVEVDLG+
Subjt:  MATVPNSLCLSPATDLIVKSPLFKSAAVGCFFPIRNRNSVHLRSSS---TGVEFNRNGFSGLSSRSTRGKILASSAQIMDQSAIKTDSSAPTIVEVDLGD

Query:  RSYPIYIGSGLLDQPEILQRHVHGKRVLIVTNETVAPLYLDKVTEALTIGNPNVSVESVVLPDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALGGGVIG
        RSYPIYIGSGLLDQPE+LQR                             GNPNV+VESV+LPDGE+YK+M+TLMKVFDKAIES+LDR+CTFVALGGGVIG
Subjt:  RSYPIYIGSGLLDQPEILQRHVHGKRVLIVTNETVAPLYLDKVTEALTIGNPNVSVESVVLPDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALGGGVIG

Query:  DMCGYAAASFLRGVNFIQIPTTVMAQVDSSVGGKTGINHRLGKNLIGAFYQPQCVVVDTNTLNTLPDRELASGFAEVIKYGLIRDAEFFEWQEKNMPSLM
        DMCGYAAASFLRGVNFIQIPTTVMAQVDSSVGGKTGINH LGKNLIGAFYQPQCV++DT+TLNTLP+RELASG AEVIKYGLIRDAEFFEWQEKNM +L+
Subjt:  DMCGYAAASFLRGVNFIQIPTTVMAQVDSSVGGKTGINHRLGKNLIGAFYQPQCVVVDTNTLNTLPDRELASGFAEVIKYGLIRDAEFFEWQEKNMPSLM

Query:  ARDPAALAYAIKRSCENKAEVVSLDEKESGLRATLNLGHTFGHAIETGFGYGQWLHGEAVAVGTVMAVDMSYRLGWIDDTIVSRVLAILKQAKLPTAPPE
        ARDP A++YAIKRSCENKAEVVS DEKESGLRATLNLGHTFGHAIETG GYG+WLHGEAVA GTVMAVDMSYRLGWID+++V RV  IL+QAKLP APPE
Subjt:  ARDPAALAYAIKRSCENKAEVVSLDEKESGLRATLNLGHTFGHAIETGFGYGQWLHGEAVAVGTVMAVDMSYRLGWIDDTIVSRVLAILKQAKLPTAPPE

Query:  SMTVEMFKSIMAVDKKVADGLLRLILLKGPLGNCVFTGDYDRKALDETLRSFCKCSIDVRPPQLPEQGTTPHSSRRLSSGFLSFCPQVHDLRPPPLSAFS
         MTVEMFKS+MAVDKKVADGLLRLILLKGPLGNCVFTGDYDRKAL+ETL +F K                   + ++S+     C  V D          
Subjt:  SMTVEMFKSIMAVDKKVADGLLRLILLKGPLGNCVFTGDYDRKALDETLRSFCKCSIDVRPPQLPEQGTTPHSSRRLSSGFLSFCPQVHDLRPPPLSAFS

Query:  NNERFKVHQDYVPTVFDNFSANVVVNGSTVNLGLWDTAGQEDYNRLRPLSYRGADVFILAFSLISKASYENVSKKWIPELKHYAPGVPIVLVGTKLDLRD
        +        DYVPTVFDNFSANVVVNGSTVNLGLWDTAGQEDYNRLRPLSYRGADVFILAFSLISKASYENVSKKWIPELKHYAPGVPI+LVGTKLDLRD
Subjt:  NNERFKVHQDYVPTVFDNFSANVVVNGSTVNLGLWDTAGQEDYNRLRPLSYRGADVFILAFSLISKASYENVSKKWIPELKHYAPGVPIVLVGTKLDLRD

Query:  DKQFFIDHPGAVPISTAQGEELRKLIGAPAYIECSSKTQQNVKGVFDAAIRVVLQPPKQKKKKSKAQKACSIL
        DKQFFIDHPGAVPI+TAQGEELRKLIGAPAYIECSSKTQQNVKGVFDAAIRVVLQPPKQKKKKSKAQKACSIL
Subjt:  DKQFFIDHPGAVPISTAQGEELRKLIGAPAYIECSSKTQQNVKGVFDAAIRVVLQPPKQKKKKSKAQKACSIL

KAF7136338.1 hypothetical protein RHSIM_Rhsim08G0032100 [Rhododendron simsii]1.6e-28281.53Show/hide
Query:  ASSAQIMDQSAIKTDSSAPTIVEVDLGDRSYPIYIGSGLLDQPEILQRHVHGKRVLIVTNETVAPLYLDKVTEALTIGNPNVSVESVVLPDGEKYKDMDT
        +S+AQ+M+QS  +  SS PTIV+VDLG+RSYPIYIGSGLLDQP++LQRH+HGKRVL+VTN T+AP+YLDKV  A+T  NPNV VESV+LPDGEKYK+MD 
Subjt:  ASSAQIMDQSAIKTDSSAPTIVEVDLGDRSYPIYIGSGLLDQPEILQRHVHGKRVLIVTNETVAPLYLDKVTEALTIGNPNVSVESVVLPDGEKYKDMDT

Query:  LMKVFDKAIESRLDRRCTFVALGGGVIGDMCGYAAASFLRGVNFIQIPTTVMAQVDSSVGGKTGINHRLGKNLIGAFYQPQCVVVDTNTLNTLPDRELAS
        LMKVFDKAIESRLDRRCTFVALGGGVIGDMCGYA+ASFLRGVNFIQIPTTVMAQVDSSVGGKTGINHRLGKNLIGAFYQPQCV++DTNTL+TLP+RELAS
Subjt:  LMKVFDKAIESRLDRRCTFVALGGGVIGDMCGYAAASFLRGVNFIQIPTTVMAQVDSSVGGKTGINHRLGKNLIGAFYQPQCVVVDTNTLNTLPDRELAS

Query:  GFAEVIKYGLIRDAEFFEWQEKNMPSLMARDPAALAYAIKRSCENKAEVVSLDEKESGLRATLNLGHTFGHAIETGFGYGQWLHGEAVAVGTVMAVDMSY
        GFAEVIKYGLIRDAEFFEWQEKNM +LM+RDP ALAYAIKRSCENKAEVVSLDEKESGLRATLNLGHTFGHAIETG+GYG WLHGEAVA G VMA+DMSY
Subjt:  GFAEVIKYGLIRDAEFFEWQEKNMPSLMARDPAALAYAIKRSCENKAEVVSLDEKESGLRATLNLGHTFGHAIETGFGYGQWLHGEAVAVGTVMAVDMSY

Query:  RLGWIDDTIVSRVLAILKQAKLPTAPPESMTVEMFKSIMAVDKKVADGLLRLILLKGPLGNCVFTGDYDRKALDETLRSFCKCSIDVRPPQLPEQGTTPH
        RLGWID++I+ R   ILKQAKLPTAPP++MTVEMFKS+MAVDKKVADGLLRLILLKGPLGNCVFTG+YDRKALDETL +FCK                  
Subjt:  RLGWIDDTIVSRVLAILKQAKLPTAPPESMTVEMFKSIMAVDKKVADGLLRLILLKGPLGNCVFTGDYDRKALDETLRSFCKCSIDVRPPQLPEQGTTPH

Query:  SSRRLSSGFLSFCPQVHDLRPPPLSAFSNNERFKVHQDYVPTVFDNFSANVVVNGSTVNLGLWDTAGQEDYNRLRPLSYRGADVFILAFSLISKASYENV
           ++S+     C  V D          +        DYVPTVFDNFSANVVVNG+TVNLGLWDTAGQEDYNRLRPLSYRGADVFILAFSLISKASYENV
Subjt:  SSRRLSSGFLSFCPQVHDLRPPPLSAFSNNERFKVHQDYVPTVFDNFSANVVVNGSTVNLGLWDTAGQEDYNRLRPLSYRGADVFILAFSLISKASYENV

Query:  SKKWIPELKHYAPGVPIVLVGTKLDLRDDKQFFIDHPGAVPISTAQGEELRKLIGAPAYIECSSKTQQNVKGVFDAAIRVVLQPPKQKKKKSKAQKACSI
        SKKWIPELKHYAPGVPIVLVGTKLDLRDDKQFFIDHPGAVPISTAQGEELRKLIGAP+YIECSSKTQ NVK VFDAAI+VVLQPPKQKKKKSKAQKACSI
Subjt:  SKKWIPELKHYAPGVPIVLVGTKLDLRDDKQFFIDHPGAVPISTAQGEELRKLIGAPAYIECSSKTQQNVKGVFDAAIRVVLQPPKQKKKKSKAQKACSI

Query:  L
        L
Subjt:  L

RXH74993.1 hypothetical protein DVH24_029714 [Malus domestica]2.9e-28778.97Show/hide
Query:  IRNRNSVHLRS---SSTGVEFNRNGFSGLSSRSTRG--KILASSAQIMDQSAIKTDSSAPTIVEVDLGDRSYPIYIGSGLLDQPEILQRHVHGKRVLIVT
        +R+ N+  LR+   S++ +E  R   S L S + R   +I+ASSAQ++DQ + KT   APT+V+VDLG+RSYPIYIGSGLLDQPE+LQRHVHGKRVL+VT
Subjt:  IRNRNSVHLRS---SSTGVEFNRNGFSGLSSRSTRG--KILASSAQIMDQSAIKTDSSAPTIVEVDLGDRSYPIYIGSGLLDQPEILQRHVHGKRVLIVT

Query:  NETVAPLYLDKVTEALTIGNPNVSVESVVLPDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALGGGVIGDMCGYAAASFLRGVNFIQIPTTVMAQVDSSV
        N  VAPLYLDKV EALT  NPNVSVESV+LPDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALGGGVIGDMCGYAAASFLRGVNFIQIPTTVMAQVDSSV
Subjt:  NETVAPLYLDKVTEALTIGNPNVSVESVVLPDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALGGGVIGDMCGYAAASFLRGVNFIQIPTTVMAQVDSSV

Query:  GGKTGINHRLGKNLIGAFYQPQCVVVDTNTLNTLPDRELASGFAEVIKYGLIRDAEFFEWQEKNMPSLMARDPAALAYAIKRSCENKAEVVSLDEKESGL
        GGKTGINH LGKNLIGAFYQPQCV++DT+TLNTLPDRELASG AEVIKYGLIRDA+FFEWQE+N+ +LMARDPAA+AYAIKRSCENKAEVVSLDEKE GL
Subjt:  GGKTGINHRLGKNLIGAFYQPQCVVVDTNTLNTLPDRELASGFAEVIKYGLIRDAEFFEWQEKNMPSLMARDPAALAYAIKRSCENKAEVVSLDEKESGL

Query:  RATLNLGHTFGHAIETGFGYGQWLHGEAVAVGTVMAVDMSYRLGWIDDTIVSRVLAILKQAKLPTAPPESMTVEMFKSIMAVDKKVADGLLRLILLKGPL
        RATLNLGHTFGHAIETG GYG WLHGEAVA G VMAVDMSYRLGWIDD +V R   ILKQAKLP APPES+T+E F+S+MAVDKKVADGLLRLILLKGPL
Subjt:  RATLNLGHTFGHAIETGFGYGQWLHGEAVAVGTVMAVDMSYRLGWIDDTIVSRVLAILKQAKLPTAPPESMTVEMFKSIMAVDKKVADGLLRLILLKGPL

Query:  GNCVFTGDYDRKALDETLRSFCKCSIDVRPPQLPEQGTTPHSSRRLSSGFLSFCPQVHDLRPPPLSAFSNNERFKVHQDYVPTVFDNFSANVVVNGSTVN
        GNCVFTG+YDRKALDETL +FC                    + ++S+     C  V D          +        DYVPTVFDNFSANVVVNGSTVN
Subjt:  GNCVFTGDYDRKALDETLRSFCKCSIDVRPPQLPEQGTTPHSSRRLSSGFLSFCPQVHDLRPPPLSAFSNNERFKVHQDYVPTVFDNFSANVVVNGSTVN

Query:  LGLWDTAGQEDYNRLRPLSYRGADVFILAFSLISKASYENVSKKWIPELKHYAPGVPIVLVGTKLDLRDDKQFFIDHPGAVPISTAQGEELRKLIGAPAY
        LGLWDTAGQEDYNRLRPLSYRGADVFILAFSLISKASYENVSKKWIPELKHYAPGVPI+LVGTKLDLRDDKQFFIDHPGAVPI+TAQGEELRKLIGAPAY
Subjt:  LGLWDTAGQEDYNRLRPLSYRGADVFILAFSLISKASYENVSKKWIPELKHYAPGVPIVLVGTKLDLRDDKQFFIDHPGAVPISTAQGEELRKLIGAPAY

Query:  IECSSKTQQNVKGVFDAAIRVVLQPPKQKKKKSKAQKACSIL
        IECSSKTQQNVKGVFDAAIRVVLQPPKQKKKK K QKACSIL
Subjt:  IECSSKTQQNVKGVFDAAIRVVLQPPKQKKKKSKAQKACSIL

TrEMBL top hitse value%identityAlignment
A0A3Q7F7L8 DHQ_synthase domain-containing protein7.2e-27677.24Show/hide
Query:  GLSSR-STRGKILASSA-QIMDQSAIKTDSSAPTIVEVDLGDRSYPIYIGSGLLDQPEILQRHVHGKRVLIVTNETVAPLYLDKVTEALTIGNPNVSVES
        GL S+ +TR K+LA+SA ++MD S+ K  S APT+VEVDLG RSYPIYIG+GLLDQP++LQRH+HGKRVL+VTN TVAPLYLDK   ALT GNPNV+VES
Subjt:  GLSSR-STRGKILASSA-QIMDQSAIKTDSSAPTIVEVDLGDRSYPIYIGSGLLDQPEILQRHVHGKRVLIVTNETVAPLYLDKVTEALTIGNPNVSVES

Query:  VVLPDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALGGGVIGDMCGYAAASFLRGVNFIQIPTTVMAQVDSSVGGKTGINHRLGKNLIGAFYQPQCVVVD
        V+LPDGE++K+M+TLMKVFDKAIESRLDRRCTFVALGGGVIGDMCGYAAAS+LRGVNFIQIPTTVMAQVDSSVGGKTGINH LGKN+IGAFYQPQCV++D
Subjt:  VVLPDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALGGGVIGDMCGYAAASFLRGVNFIQIPTTVMAQVDSSVGGKTGINHRLGKNLIGAFYQPQCVVVD

Query:  TNTLNTLPDRELASGFAEVIKYGLIRDAEFFEWQEKNMPSLMARDPAALAYAIKRSCENKAEVVSLDEKESGLRATLNLGHTFGHAIETGFGYGQWLHGE
        T+TLNTLPDRELASG AEVIKYGLIRDAEFFEWQE+NMP L+ARDP A  YAIKRSCENKA+VVS DEKESG+RATLNLGHTFGHA+ETG GYGQWLHGE
Subjt:  TNTLNTLPDRELASGFAEVIKYGLIRDAEFFEWQEKNMPSLMARDPAALAYAIKRSCENKAEVVSLDEKESGLRATLNLGHTFGHAIETGFGYGQWLHGE

Query:  AVAVGTVMAVDMSYRLGWIDDTIVSRVLAILKQAKLPTAPPESMTVEMFKSIMAVDKKVADGLLRLILLKGPLGNCVFTGDYDRKALDETLRSFCKCSID
        AVA GTVMAVDMS RLGWIDD++V RV  IL+QAKLPT+PPE+MTVEMFKSIMAVDKKVADG LRLILLKG LGNCVFTGDYD+KALDETL+      + 
Subjt:  AVAVGTVMAVDMSYRLGWIDDTIVSRVLAILKQAKLPTAPPESMTVEMFKSIMAVDKKVADGLLRLILLKGPLGNCVFTGDYDRKALDETLRSFCKCSID

Query:  VRPPQLPEQGTTPHSSRRLSSGFLSF----------------------CPQVHDLRPPPLSAFSNNERFKVHQDYVPTVFDNFSANVVVNGSTVNLGLWD
        +   Q  ++    H S   S+GF S                       C  V D          +        DYVPTVFDNFSANVVVNGSTVNLGLWD
Subjt:  VRPPQLPEQGTTPHSSRRLSSGFLSF----------------------CPQVHDLRPPPLSAFSNNERFKVHQDYVPTVFDNFSANVVVNGSTVNLGLWD

Query:  TAGQEDYNRLRPLSYRGADVFILAFSLISKASYENVSKKWIPELKHYAPGVPIVLVGTKLDLRDDKQFFIDHPGAVPISTAQGEELRKLIGAPAYIECSS
        TAGQEDYNRLRPLSYRGADVFILAFSLISKASYENV+KKWIPELKHYAPGVPIVLVGTKLDLRDDKQFF+DHPGAVPI+TAQGEELRK IGAPAYIECSS
Subjt:  TAGQEDYNRLRPLSYRGADVFILAFSLISKASYENVSKKWIPELKHYAPGVPIVLVGTKLDLRDDKQFFIDHPGAVPISTAQGEELRKLIGAPAYIECSS

Query:  KTQQNVKGVFDAAIRVVLQPPKQKKKKSKAQKACSIL
        KTQQNVK VFDAAI+VVLQPPKQKKKK K+QKACSIL
Subjt:  KTQQNVKGVFDAAIRVVLQPPKQKKKKSKAQKACSIL

A0A498I1K6 DHQ_synthase domain-containing protein1.4e-28778.97Show/hide
Query:  IRNRNSVHLRS---SSTGVEFNRNGFSGLSSRSTRG--KILASSAQIMDQSAIKTDSSAPTIVEVDLGDRSYPIYIGSGLLDQPEILQRHVHGKRVLIVT
        +R+ N+  LR+   S++ +E  R   S L S + R   +I+ASSAQ++DQ + KT   APT+V+VDLG+RSYPIYIGSGLLDQPE+LQRHVHGKRVL+VT
Subjt:  IRNRNSVHLRS---SSTGVEFNRNGFSGLSSRSTRG--KILASSAQIMDQSAIKTDSSAPTIVEVDLGDRSYPIYIGSGLLDQPEILQRHVHGKRVLIVT

Query:  NETVAPLYLDKVTEALTIGNPNVSVESVVLPDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALGGGVIGDMCGYAAASFLRGVNFIQIPTTVMAQVDSSV
        N  VAPLYLDKV EALT  NPNVSVESV+LPDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALGGGVIGDMCGYAAASFLRGVNFIQIPTTVMAQVDSSV
Subjt:  NETVAPLYLDKVTEALTIGNPNVSVESVVLPDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALGGGVIGDMCGYAAASFLRGVNFIQIPTTVMAQVDSSV

Query:  GGKTGINHRLGKNLIGAFYQPQCVVVDTNTLNTLPDRELASGFAEVIKYGLIRDAEFFEWQEKNMPSLMARDPAALAYAIKRSCENKAEVVSLDEKESGL
        GGKTGINH LGKNLIGAFYQPQCV++DT+TLNTLPDRELASG AEVIKYGLIRDA+FFEWQE+N+ +LMARDPAA+AYAIKRSCENKAEVVSLDEKE GL
Subjt:  GGKTGINHRLGKNLIGAFYQPQCVVVDTNTLNTLPDRELASGFAEVIKYGLIRDAEFFEWQEKNMPSLMARDPAALAYAIKRSCENKAEVVSLDEKESGL

Query:  RATLNLGHTFGHAIETGFGYGQWLHGEAVAVGTVMAVDMSYRLGWIDDTIVSRVLAILKQAKLPTAPPESMTVEMFKSIMAVDKKVADGLLRLILLKGPL
        RATLNLGHTFGHAIETG GYG WLHGEAVA G VMAVDMSYRLGWIDD +V R   ILKQAKLP APPES+T+E F+S+MAVDKKVADGLLRLILLKGPL
Subjt:  RATLNLGHTFGHAIETGFGYGQWLHGEAVAVGTVMAVDMSYRLGWIDDTIVSRVLAILKQAKLPTAPPESMTVEMFKSIMAVDKKVADGLLRLILLKGPL

Query:  GNCVFTGDYDRKALDETLRSFCKCSIDVRPPQLPEQGTTPHSSRRLSSGFLSFCPQVHDLRPPPLSAFSNNERFKVHQDYVPTVFDNFSANVVVNGSTVN
        GNCVFTG+YDRKALDETL +FC                    + ++S+     C  V D          +        DYVPTVFDNFSANVVVNGSTVN
Subjt:  GNCVFTGDYDRKALDETLRSFCKCSIDVRPPQLPEQGTTPHSSRRLSSGFLSFCPQVHDLRPPPLSAFSNNERFKVHQDYVPTVFDNFSANVVVNGSTVN

Query:  LGLWDTAGQEDYNRLRPLSYRGADVFILAFSLISKASYENVSKKWIPELKHYAPGVPIVLVGTKLDLRDDKQFFIDHPGAVPISTAQGEELRKLIGAPAY
        LGLWDTAGQEDYNRLRPLSYRGADVFILAFSLISKASYENVSKKWIPELKHYAPGVPI+LVGTKLDLRDDKQFFIDHPGAVPI+TAQGEELRKLIGAPAY
Subjt:  LGLWDTAGQEDYNRLRPLSYRGADVFILAFSLISKASYENVSKKWIPELKHYAPGVPIVLVGTKLDLRDDKQFFIDHPGAVPISTAQGEELRKLIGAPAY

Query:  IECSSKTQQNVKGVFDAAIRVVLQPPKQKKKKSKAQKACSIL
        IECSSKTQQNVKGVFDAAIRVVLQPPKQKKKK K QKACSIL
Subjt:  IECSSKTQQNVKGVFDAAIRVVLQPPKQKKKKSKAQKACSIL

A0A6A5MGW1 DHQ_synthase domain-containing protein4.1e-27979.9Show/hide
Query:  VEFNRNGFSGLSSRSTRGKILASSAQIMDQSAIKTDSSAPTIVEVDLGDRSYPIYIGSGLLDQPEILQRHVHGKRVLIVTNETVAPLYLDKVTEALTIGN
        + FN    S  + R+      +SS+Q+MD    KT    PT V VDLGDRSYPIYIGSGLL+QP +LQRHVHGKRVL+VTN TVAPLYLDKV +ALT GN
Subjt:  VEFNRNGFSGLSSRSTRGKILASSAQIMDQSAIKTDSSAPTIVEVDLGDRSYPIYIGSGLLDQPEILQRHVHGKRVLIVTNETVAPLYLDKVTEALTIGN

Query:  PNVSVESVVLPDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALGGGVIGDMCGYAAASFLRGVNFIQIPTTVMAQVDSSVGGKTGINHRLGKNLIGAFYQ
         NVSVESV+LPDGE+YKDMDTLMKVFDKAIESRLDRR TFVALGGGVIGDMCG+AAASFLRGVNFIQIPTTVMAQVDSSVGGKTGINH LGKNLIGAFYQ
Subjt:  PNVSVESVVLPDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALGGGVIGDMCGYAAASFLRGVNFIQIPTTVMAQVDSSVGGKTGINHRLGKNLIGAFYQ

Query:  PQCVVVDTNTLNTLPDRELASGFAEVIKYGLIRDAEFFEWQEKNMPSLMARDPAALAYAIKRSCENKAEVVSLDEKESGLRATLNLGHTFGHAIETGFGY
        PQCV++DT+TLNTLP+RELASGFAEVIKYGLIRDA+FFEWQEKN+ +LMARDP ALAYAIKRSCENKAEVVSLDEKESGLRATLNLGHTFGHAIETG GY
Subjt:  PQCVVVDTNTLNTLPDRELASGFAEVIKYGLIRDAEFFEWQEKNMPSLMARDPAALAYAIKRSCENKAEVVSLDEKESGLRATLNLGHTFGHAIETGFGY

Query:  GQWLHGEAVAVGTVMAVDMSYRLGWIDDTIVSRVLAILKQAKLPTAPPESMTVEMFKSIMAVDKKVADGLLRLILLKGPLGNCVFTGDYDRKALDETLRS
        GQWLHGEAVA GTVMAVDMSYRLGWIDD+IV RV  ILKQAKLPT PPE++T++MFKS+MAVDKKVADGLLRLILLKGPLGNCVFTGDYDRKALD+TL +
Subjt:  GQWLHGEAVAVGTVMAVDMSYRLGWIDDTIVSRVLAILKQAKLPTAPPESMTVEMFKSIMAVDKKVADGLLRLILLKGPLGNCVFTGDYDRKALDETLRS

Query:  FCKCSIDVRPPQLPEQGTTPHSSRRLSSGFLSFCPQVHDLRPPPLSAFSNNERFKVHQDYVPTVFDNFSANVVVNGSTVNLGLWDTAGQEDYNRLRPLSY
        F + S+         Q T   S+ R        C  V D          +        DYVPTVFDNFSANVVVNGS VNLGLWDTAGQEDYNRLRPLSY
Subjt:  FCKCSIDVRPPQLPEQGTTPHSSRRLSSGFLSFCPQVHDLRPPPLSAFSNNERFKVHQDYVPTVFDNFSANVVVNGSTVNLGLWDTAGQEDYNRLRPLSY

Query:  RGADVFILAFSLISKASYENVSKKWIPELKHYAPGVPIVLVGTKLDLRDDKQFFIDHPGAVPISTAQGEELRKLIGAPAYIECSSKTQQNVKGVFDAAIR
        RGADVFILAFSLISKASYENVSKKWIPELKHYAPGVPI+LVGTKLDLRDDKQF IDHPGAVPI+ AQGEELRKLI APAYIECSSKTQ+NVKGVFDAAIR
Subjt:  RGADVFILAFSLISKASYENVSKKWIPELKHYAPGVPIVLVGTKLDLRDDKQFFIDHPGAVPISTAQGEELRKLIGAPAYIECSSKTQQNVKGVFDAAIR

Query:  VVLQPPKQKKKKSKAQKACSIL
        VVLQPPKQKKKK+KAQKACSIL
Subjt:  VVLQPPKQKKKKSKAQKACSIL

A0A7J6G6E0 DHQ_synthase domain-containing protein1.5e-26873.7Show/hide
Query:  MATVPNSLCLSPATDLIVKSPLFKSAAVGCFFPIRNRNSVHLRSSS---TGVEFNRNGFSGLSSRSTRGKILASSAQIMDQSAIKTDSSAPTIVEVDLGD
        MA+  +  C S +T   + +P   S++   F  +   NS+ LRSSS   + +E  R   S L SRST  +I AS AQ+MDQS     S APTIVEVDLG+
Subjt:  MATVPNSLCLSPATDLIVKSPLFKSAAVGCFFPIRNRNSVHLRSSS---TGVEFNRNGFSGLSSRSTRGKILASSAQIMDQSAIKTDSSAPTIVEVDLGD

Query:  RSYPIYIGSGLLDQPEILQRHVHGKRVLIVTNETVAPLYLDKVTEALTIGNPNVSVESVVLPDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALGGGVIG
        RSYPIYIGSGLLDQPE+LQR                             GNPNV+VESV+LPDGE+YK+M+TLMKVFDKAIES+LDR+CTFVALGGGVIG
Subjt:  RSYPIYIGSGLLDQPEILQRHVHGKRVLIVTNETVAPLYLDKVTEALTIGNPNVSVESVVLPDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALGGGVIG

Query:  DMCGYAAASFLRGVNFIQIPTTVMAQVDSSVGGKTGINHRLGKNLIGAFYQPQCVVVDTNTLNTLPDRELASGFAEVIKYGLIRDAEFFEWQEKNMPSLM
        DMCGYAAASFLRGVNFIQIPTTVMAQVDSSVGGKTGINH LGKNLIGAFYQPQCV++DT+TLNTLP+RELASG AEVIKYGLIRDAEFFEWQEKNM +L+
Subjt:  DMCGYAAASFLRGVNFIQIPTTVMAQVDSSVGGKTGINHRLGKNLIGAFYQPQCVVVDTNTLNTLPDRELASGFAEVIKYGLIRDAEFFEWQEKNMPSLM

Query:  ARDPAALAYAIKRSCENKAEVVSLDEKESGLRATLNLGHTFGHAIETGFGYGQWLHGEAVAVGTVMAVDMSYRLGWIDDTIVSRVLAILKQAKLPTAPPE
        ARDP A++YAIKRSCENKAEVVS DEKESGLRATLNLGHTFGHAIETG GYG+WLHGEAVA GTVMAVDMSYRLGWID+++V RV  IL+QAKLP APPE
Subjt:  ARDPAALAYAIKRSCENKAEVVSLDEKESGLRATLNLGHTFGHAIETGFGYGQWLHGEAVAVGTVMAVDMSYRLGWIDDTIVSRVLAILKQAKLPTAPPE

Query:  SMTVEMFKSIMAVDKKVADGLLRLILLKGPLGNCVFTGDYDRKALDETLRSFCKCSIDVRPPQLPEQGTTPHSSRRLSSGFLSFCPQVHDLRPPPLSAFS
         MTVEMFKS+MAVDKKVADGLLRLILLKGPLGNCVFTGDYDRKAL+ETL +F K                   + ++S+     C  V D          
Subjt:  SMTVEMFKSIMAVDKKVADGLLRLILLKGPLGNCVFTGDYDRKALDETLRSFCKCSIDVRPPQLPEQGTTPHSSRRLSSGFLSFCPQVHDLRPPPLSAFS

Query:  NNERFKVHQDYVPTVFDNFSANVVVNGSTVNLGLWDTAGQEDYNRLRPLSYRGADVFILAFSLISKASYENVSKKWIPELKHYAPGVPIVLVGTKLDLRD
        +        DYVPTVFDNFSANVVVNGSTVNLGLWDTAGQEDYNRLRPLSYRGADVFILAFSLISKASYENVSKKWIPELKHYAPGVPI+LVGTKLDLRD
Subjt:  NNERFKVHQDYVPTVFDNFSANVVVNGSTVNLGLWDTAGQEDYNRLRPLSYRGADVFILAFSLISKASYENVSKKWIPELKHYAPGVPIVLVGTKLDLRD

Query:  DKQFFIDHPGAVPISTAQGEELRKLIGAPAYIECSSKTQQNVKGVFDAAIRVVLQPPKQKKKKSKAQKACSIL
        DKQFFIDHPGAVPI+TAQGEELRKLIGAPAYIECSSKTQQNVKGVFDAAIRVVLQPPKQKKKKSKAQKACSIL
Subjt:  DKQFFIDHPGAVPISTAQGEELRKLIGAPAYIECSSKTQQNVKGVFDAAIRVVLQPPKQKKKKSKAQKACSIL

A0A803NPE2 Uncharacterized protein2.0e-27875.78Show/hide
Query:  MATVPNSLCLSPATDLIVKSPLFKSAAVGCFFPIRNRNSVHLRSSS---TGVEFNRNGFSGLSSRSTRGKILASSAQIMDQSAIKTDSSAPTIVEVDLGD
        MA+  +  C S +T   + +P   S++   F  +   NS+ LRSSS   + +E  R   S L SRST  +I AS AQ+MDQS     S APTIVEVDLG+
Subjt:  MATVPNSLCLSPATDLIVKSPLFKSAAVGCFFPIRNRNSVHLRSSS---TGVEFNRNGFSGLSSRSTRGKILASSAQIMDQSAIKTDSSAPTIVEVDLGD

Query:  RSYPIYIGSGLLDQPEILQRHVHGKRVLIVTNETVAPLYLDKVTEALTIGNPNVSVESVVLPDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALGGGVIG
        RSYPIYIGSGLLDQPE+LQRHVHGK+VL+VTN TVAPLYL+KV +ALT GNPNV+VESV+LPDGE+YK+M+TLMKVFDKAIES+LDR+CTFVALGGGVIG
Subjt:  RSYPIYIGSGLLDQPEILQRHVHGKRVLIVTNETVAPLYLDKVTEALTIGNPNVSVESVVLPDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALGGGVIG

Query:  DMCGYAAASFLRGVNFIQIPTTVMAQVDSSVGGKTGINHRLGKNLIGAFYQPQCVVVDTNTLNTLPDRELASGFAEVIKYGLIRDAEFFEWQEKNMPSLM
        DMCGYAAASFLRGVNFIQIPTTVMAQVDSSVGGKTGINH LGKNLIGAFYQPQCVV+DT+TLNTLP+RELASG AEVIKYGLIRDAEFFEWQEKNM +L+
Subjt:  DMCGYAAASFLRGVNFIQIPTTVMAQVDSSVGGKTGINHRLGKNLIGAFYQPQCVVVDTNTLNTLPDRELASGFAEVIKYGLIRDAEFFEWQEKNMPSLM

Query:  ARDPAALAYAIKRSCENKAEVVSLDEKESGLRATLNLGHTFGHAIETGFGYGQWLHGEAVAVGTVMAVDMSYRLGWIDDTIVSRVLAILKQAKLPTAPPE
        ARDP A++YAIKRSCENKAEVVS DEKESGLRATLNLGHTFGHAIETG GYG+WLHGEAVA GTVMAVDMSYRLGWID+++V RV  IL+QAKLP APPE
Subjt:  ARDPAALAYAIKRSCENKAEVVSLDEKESGLRATLNLGHTFGHAIETGFGYGQWLHGEAVAVGTVMAVDMSYRLGWIDDTIVSRVLAILKQAKLPTAPPE

Query:  SMTVEMFKSIMAVDKKVADGLLRLILLKGPLGNCVFTGDYDRKALDETLRSFCKCSIDVRPPQLPEQGTTPHSSRRLSSGFLSFCPQVHDLRPPPLSAFS
         MTVEMFKS+MAVDKKVADGLLRLILLKGPLGNCVFT     K        +     +V P     +     +SR +       C  V D          
Subjt:  SMTVEMFKSIMAVDKKVADGLLRLILLKGPLGNCVFTGDYDRKALDETLRSFCKCSIDVRPPQLPEQGTTPHSSRRLSSGFLSFCPQVHDLRPPPLSAFS

Query:  NNERFKVHQDYVPTVFDNFSANVVVNGSTVNLGLWDTAGQEDYNRLRPLSYRGADVFILAFSLISKASYENVSKKWIPELKHYAPGVPIVLVGTKLDLRD
        +        DYVPTVFDNFSANVVVNGSTVNLGLWDTAGQEDYNRLRPLSYRGADVFILAFSLISKASYENVSKKWIPELKHYAPGVPI+LVGTKLDLRD
Subjt:  NNERFKVHQDYVPTVFDNFSANVVVNGSTVNLGLWDTAGQEDYNRLRPLSYRGADVFILAFSLISKASYENVSKKWIPELKHYAPGVPIVLVGTKLDLRD

Query:  DKQFFIDHPGAVPISTAQGEELRKLIGAPAYIECSSKTQQNVKGVFDAAIRVVLQPPKQKKKKSKAQKACSIL
        DKQFFIDHPGAVPI+TAQGEELRKLIGAPAYIECSSKTQQNVKGVFDAAIRVVLQPPKQKKKKSKAQKACSIL
Subjt:  DKQFFIDHPGAVPISTAQGEELRKLIGAPAYIECSSKTQQNVKGVFDAAIRVVLQPPKQKKKKSKAQKACSIL

SwissProt top hitse value%identityAlignment
B8GPV3 3-dehydroquinate synthase3.5e-12663.28Show/hide
Query:  VDLGDRSYPIYIGSGLLDQPEILQRHVHGKRVLIVTNETVAPLYLDKVTEALTIGNPNVSVESVVLPDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALG
        V+LGDRSYPI+IG GLLD P      + GKRV+IVTNETVAPLYLD++   L     +  VE+V+LPDGE+YK M+TL +V+   +E+R DR+ T VALG
Subjt:  VDLGDRSYPIYIGSGLLDQPEILQRHVHGKRVLIVTNETVAPLYLDKVTEALTIGNPNVSVESVVLPDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALG

Query:  GGVIGDMCGYAAASFLRGVNFIQIPTTVMAQVDSSVGGKTGINHRLGKNLIGAFYQPQCVVVDTNTLNTLPDRELASGFAEVIKYGLIRDAEFFEWQEKN
        GGVIGD+ G+AAAS+ RGV+FIQ+PTT+++QVDSSVGGKTG+NH LGKN+IGAF+QP+CVV+DT+TL+TLPDREL +G AEVIKYGLI D  FF+W E N
Subjt:  GGVIGDMCGYAAASFLRGVNFIQIPTTVMAQVDSSVGGKTGINHRLGKNLIGAFYQPQCVVVDTNTLNTLPDRELASGFAEVIKYGLIRDAEFFEWQEKN

Query:  MPSLMARDPAALAYAIKRSCENKAEVVSLDEKESGLRATLNLGHTFGHAIETGFGYGQWLHGEAVAVGTVMAVDMSYRLGWIDDTIVSRVLAILKQAKLP
        M  L+ARDP ALAYAI RSC +KA+VV+ DE+E G RA LNLGHTFGHAIETG GYG WLHGE VA G VMA  MS RLGW+D   + R  A++ +A LP
Subjt:  MPSLMARDPAALAYAIKRSCENKAEVVSLDEKESGLRATLNLGHTFGHAIETGFGYGQWLHGEAVAVGTVMAVDMSYRLGWIDDTIVSRVLAILKQAKLP

Query:  TAPPESMTVEMFKSIMAVDKKVADGLLRLILLKGPLGNCVFTGDYDRKALDETL
          PP  +T E F  +M+VDKKV DG LRL+LL+G +G  V T D+D  ALD TL
Subjt:  TAPPESMTVEMFKSIMAVDKKVADGLLRLILLKGPLGNCVFTGDYDRKALDETL

Q31DP9 3-dehydroquinate synthase2.1e-12361.3Show/hide
Query:  VDLGDRSYPIYIGSGLLDQPEILQRHVHGKRVLIVTNETVAPLYLDKVTEALTIGNPNVSVESVVLPDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALG
        V+LG+RSYPI+IG GLL Q E+ + +V G +VLIV+N TVAPLYL+K  +A +       V++V+LPDGE+YK++D L ++FD AIE+R DR+CTFVALG
Subjt:  VDLGDRSYPIYIGSGLLDQPEILQRHVHGKRVLIVTNETVAPLYLDKVTEALTIGNPNVSVESVVLPDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALG

Query:  GGVIGDMCGYAAASFLRGVNFIQIPTTVMAQVDSSVGGKTGINHRLGKNLIGAFYQPQCVVVDTNTLNTLPDRELASGFAEVIKYGLIRDAEFFEWQEKN
        GGVIGDM G+AAAS+ RGVNFIQIPTT+++QVDSSVGGKTG+NH  GKN+IGAF+QP+CVV+DT+TLNTL DREL++G AEVIKYGLI D  FFEW E+N
Subjt:  GGVIGDMCGYAAASFLRGVNFIQIPTTVMAQVDSSVGGKTGINHRLGKNLIGAFYQPQCVVVDTNTLNTLPDRELASGFAEVIKYGLIRDAEFFEWQEKN

Query:  MPSLMARDPAALAYAIKRSCENKAEVVSLDEKESGLRATLNLGHTFGHAIETGFGYGQWLHGEAVAVGTVMAVDMSYRLGWIDDTIVSRVLAILKQAKLP
        +P L+ARDP  LA AI+RSC+NKA +V+ DEKE+GLRA  NLGHTFGHAIE G GYG WLHGE V+ G + AV +S  +G +  T   R+ AIL+ A LP
Subjt:  MPSLMARDPAALAYAIKRSCENKAEVVSLDEKESGLRATLNLGHTFGHAIETGFGYGQWLHGEAVAVGTVMAVDMSYRLGWIDDTIVSRVLAILKQAKLP

Query:  TAPPESMTVEMFKSIMAVDKKVADGLLRLILLKGPLGNCVFTGDYDRKALDETL
          PP+ M+V+ F  +MA DKKV  G +RL+LLK  +G    TGDY  + L+ TL
Subjt:  TAPPESMTVEMFKSIMAVDKKVADGLLRLILLKGPLGNCVFTGDYDRKALDETL

Q8RU74 3-dehydroquinate synthase, chloroplastic2.6e-19083.08Show/hide
Query:  GLSSR-STRGKILASSA-QIMDQSAIKTDSSAPTIVEVDLGDRSYPIYIGSGLLDQPEILQRHVHGKRVLIVTNETVAPLYLDKVTEALTIGNPNVSVES
        GL S+ +TR K+LA+SA ++MD S+ K  S APT+VEVDLG RSYPIYIG+GLLDQP++LQRH+HGKRVL+VTN TVAPLYLDK   ALT GNPNV+VES
Subjt:  GLSSR-STRGKILASSA-QIMDQSAIKTDSSAPTIVEVDLGDRSYPIYIGSGLLDQPEILQRHVHGKRVLIVTNETVAPLYLDKVTEALTIGNPNVSVES

Query:  VVLPDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALGGGVIGDMCGYAAASFLRGVNFIQIPTTVMAQVDSSVGGKTGINHRLGKNLIGAFYQPQCVVVD
        V+LPDGE++K+M+TLMKVFDKAIESRLDRRCTFVALGGGVIGDMCGYAAAS+LRGVNFIQIPTTVMAQVDSSVGGKTGINH LGKN+IGAFYQPQCV++D
Subjt:  VVLPDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALGGGVIGDMCGYAAASFLRGVNFIQIPTTVMAQVDSSVGGKTGINHRLGKNLIGAFYQPQCVVVD

Query:  TNTLNTLPDRELASGFAEVIKYGLIRDAEFFEWQEKNMPSLMARDPAALAYAIKRSCENKAEVVSLDEKESGLRATLNLGHTFGHAIETGFGYGQWLHGE
        T+TLNTLPDRELASG AEVIKYGLIRDAEFFEWQE+NMP L+ARDP A  YAIKRSCENKA+VVS DEKESG+RATLNLGHTFGHA+ETG GYGQWLHGE
Subjt:  TNTLNTLPDRELASGFAEVIKYGLIRDAEFFEWQEKNMPSLMARDPAALAYAIKRSCENKAEVVSLDEKESGLRATLNLGHTFGHAIETGFGYGQWLHGE

Query:  AVAVGTVMAVDMSYRLGWIDDTIVSRVLAILKQAKLPTAPPESMTVEMFKSIMAVDKKVADGLLRLILLKGPLGNCVFTGDYDRKALDETLRSFCK
        AVA GTVMAVDMS RLGWIDD++V RV  IL+QAKLPT+PPE+MTVEMFKSIMAVDKKVADG LRLILLKG LGNCVFTGDYD+KALDETLR+F K
Subjt:  AVAVGTVMAVDMSYRLGWIDDTIVSRVLAILKQAKLPTAPPESMTVEMFKSIMAVDKKVADGLLRLILLKGPLGNCVFTGDYDRKALDETLRSFCK

Q8VYV7 3-dehydroquinate synthase, chloroplastic1.2e-19083.97Show/hide
Query:  SSRSTRGKILASSAQIMDQSAIKTDS-SAPTIVEVDLGDRSYPIYIGSGLLDQPEILQRHVHGKRVLIVTNETVAPLYLDKVTEALTIGNPNVSVESVVL
        S+  +R ++ A ++Q+M++      S S+PT+VEVDLGDRSYPIYIG+GLLD  E+LQRHVHGKRVL+VTN+ VAPLYLDK  +ALT GNPNV+VESV+L
Subjt:  SSRSTRGKILASSAQIMDQSAIKTDS-SAPTIVEVDLGDRSYPIYIGSGLLDQPEILQRHVHGKRVLIVTNETVAPLYLDKVTEALTIGNPNVSVESVVL

Query:  PDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALGGGVIGDMCGYAAASFLRGVNFIQIPTTVMAQVDSSVGGKTGINHRLGKNLIGAFYQPQCVVVDTNT
        PDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALGGGVIGDMCGYAAAS+LRGVNFIQIPTTVMAQVDSSVGGKTGINHRLGKNLIGAFYQPQCV+VDT+T
Subjt:  PDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALGGGVIGDMCGYAAASFLRGVNFIQIPTTVMAQVDSSVGGKTGINHRLGKNLIGAFYQPQCVVVDTNT

Query:  LNTLPDRELASGFAEVIKYGLIRDAEFFEWQEKNMPSLMARDPAALAYAIKRSCENKAEVVSLDEKESGLRATLNLGHTFGHAIETGFGYGQWLHGEAVA
        LNTLPDRE+ASG AEVIKYGLIRDAEFFEWQEKN+ +L+ARDPAALA+AIKRSCENKA+VVS DEKESGLRATLNLGHTFGHAIETGFGYG+WLHGEAVA
Subjt:  LNTLPDRELASGFAEVIKYGLIRDAEFFEWQEKNMPSLMARDPAALAYAIKRSCENKAEVVSLDEKESGLRATLNLGHTFGHAIETGFGYGQWLHGEAVA

Query:  VGTVMAVDMSYRLGWIDDTIVSRVLAILKQAKLPTAPPESMTVEMFKSIMAVDKKVADGLLRLILLKGPLGNCVFTGDYDRKALDETLRSFCK
         GTVMAVDMSYRLGWID++IV RV  IL +AKLPT PPESMTV MFKSIMAVDKKVADGLLRLILLKGPLGNCVFTGDYDR+ALD TLR+F K
Subjt:  VGTVMAVDMSYRLGWIDDTIVSRVLAILKQAKLPTAPPESMTVEMFKSIMAVDKKVADGLLRLILLKGPLGNCVFTGDYDRKALDETLRSFCK

U3KRF2 3-dehydroquinate synthase, chloroplastic7.4e-20180.13Show/hide
Query:  LSPATDLIVKSPLFKSAAVGCFFPIRNRNSVHLRSSSTGVEFNRNGFSGLSSRSTRGKI------LASSAQIMDQSAIKTDSSAPTIVEVDLGDRSYPIY
        LSP+T    +  L K+         RN +S+ LR SS            LSS S  G+        +S+A +MD S  K  SSAPTIV+VDLGDRSYPIY
Subjt:  LSPATDLIVKSPLFKSAAVGCFFPIRNRNSVHLRSSSTGVEFNRNGFSGLSSRSTRGKI------LASSAQIMDQSAIKTDSSAPTIVEVDLGDRSYPIY

Query:  IGSGLLDQPEILQRHVHGKRVLIVTNETVAPLYLDKVTEALTIGNPNVSVESVVLPDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALGGGVIGDMCGYA
        IGSGLLDQP++LQRHVHGKRVL+VTN TVAP+YLDKV  ALT GNPNVSVESV+LPDGEKYK+MDTLMKVFDKAIESRLDRRCTFVALGGGVIGDMCGYA
Subjt:  IGSGLLDQPEILQRHVHGKRVLIVTNETVAPLYLDKVTEALTIGNPNVSVESVVLPDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALGGGVIGDMCGYA

Query:  AASFLRGVNFIQIPTTVMAQVDSSVGGKTGINHRLGKNLIGAFYQPQCVVVDTNTLNTLPDRELASGFAEVIKYGLIRDAEFFEWQEKNMPSLMARDPAA
        AASFLRGVNFIQIPTTVMAQVDSSVGGKTGINHRLGKNLIGAFYQPQCV++DT+TLNTLPDRELASG AEV+KYGLIRDA FFEWQEKNMP+LMARDP+A
Subjt:  AASFLRGVNFIQIPTTVMAQVDSSVGGKTGINHRLGKNLIGAFYQPQCVVVDTNTLNTLPDRELASGFAEVIKYGLIRDAEFFEWQEKNMPSLMARDPAA

Query:  LAYAIKRSCENKAEVVSLDEKESGLRATLNLGHTFGHAIETGFGYGQWLHGEAVAVGTVMAVDMSYRLGWIDDTIVSRVLAILKQAKLPTAPPESMTVEM
        LAYAIKRSCENKAEVVSLDEKESGLRATLNLGHTFGHAIETGFGYGQWLHGEAVA G VMAVDMSYRLGWID++IV+R   IL+QAKLPTAPPE+MTVEM
Subjt:  LAYAIKRSCENKAEVVSLDEKESGLRATLNLGHTFGHAIETGFGYGQWLHGEAVAVGTVMAVDMSYRLGWIDDTIVSRVLAILKQAKLPTAPPESMTVEM

Query:  FKSIMAVDKKVADGLLRLILLKGPLGNCVFTGDYDRKALDETLRSFCK
        FKS+MAVDKKVADGLLRLILLKGPLGNCVFTGDYDRKALDETL +FCK
Subjt:  FKSIMAVDKKVADGLLRLILLKGPLGNCVFTGDYDRKALDETLRSFCK

Arabidopsis top hitse value%identityAlignment
AT2G17800.1 Arabidopsis RAC-like 12.1e-8997.56Show/hide
Query:  DYVPTVFDNFSANVVVNGSTVNLGLWDTAGQEDYNRLRPLSYRGADVFILAFSLISKASYENVSKKWIPELKHYAPGVPIVLVGTKLDLRDDKQFFIDHP
        DYVPTVFDNFSANVVVNG+TVNLGLWDTAGQEDYNRLRPLSYRGADVFILAFSLISKASYENVSKKWIPELKHYAPGVPIVLVGTKLDLRDDKQFFIDHP
Subjt:  DYVPTVFDNFSANVVVNGSTVNLGLWDTAGQEDYNRLRPLSYRGADVFILAFSLISKASYENVSKKWIPELKHYAPGVPIVLVGTKLDLRDDKQFFIDHP

Query:  GAVPISTAQGEELRKLIGAPAYIECSSKTQQNVKGVFDAAIRVVLQPPKQKKKKSKAQKACSIL
        GAVPI+TAQGEEL+KLIGAPAYIECSSKTQ+NVKGVFDAAIRVVLQPPKQKKKKSKAQKACSIL
Subjt:  GAVPISTAQGEELRKLIGAPAYIECSSKTQQNVKGVFDAAIRVVLQPPKQKKKKSKAQKACSIL

AT2G17800.2 Arabidopsis RAC-like 12.1e-8997.56Show/hide
Query:  DYVPTVFDNFSANVVVNGSTVNLGLWDTAGQEDYNRLRPLSYRGADVFILAFSLISKASYENVSKKWIPELKHYAPGVPIVLVGTKLDLRDDKQFFIDHP
        DYVPTVFDNFSANVVVNG+TVNLGLWDTAGQEDYNRLRPLSYRGADVFILAFSLISKASYENVSKKWIPELKHYAPGVPIVLVGTKLDLRDDKQFFIDHP
Subjt:  DYVPTVFDNFSANVVVNGSTVNLGLWDTAGQEDYNRLRPLSYRGADVFILAFSLISKASYENVSKKWIPELKHYAPGVPIVLVGTKLDLRDDKQFFIDHP

Query:  GAVPISTAQGEELRKLIGAPAYIECSSKTQQNVKGVFDAAIRVVLQPPKQKKKKSKAQKACSIL
        GAVPI+TAQGEEL+KLIGAPAYIECSSKTQ+NVKGVFDAAIRVVLQPPKQKKKKSKAQKACSIL
Subjt:  GAVPISTAQGEELRKLIGAPAYIECSSKTQQNVKGVFDAAIRVVLQPPKQKKKKSKAQKACSIL

AT3G51290.2 Protein of unknown function (DUF630) ;Protein of unknown function (DUF632)4.6e-8984.54Show/hide
Query:  GFLSFCPQVHDLRPPPLSAFSNNERFKVHQDYVPTVFDNFSANVVVNGSTVNLGLWDTAGQEDYNRLRPLSYRGADVFILAFSLISKASYENVSKKWIPE
        GF S C Q  +       +   ++     +DYVPTVFDNFSANVVVNGSTVNLGLWDTAGQEDYNRLRPLSYRGADVFILAFSLISKASYENVSKKWIPE
Subjt:  GFLSFCPQVHDLRPPPLSAFSNNERFKVHQDYVPTVFDNFSANVVVNGSTVNLGLWDTAGQEDYNRLRPLSYRGADVFILAFSLISKASYENVSKKWIPE

Query:  LKHYAPGVPIVLVGTKLDLRDDKQFFIDHPGAVPISTAQGEELRKLIGAPAYIECSSKTQQNVKGVFDAAIRVVLQPPKQKKKKSKAQKACSIL
        LKHYAPGVPIVLVGTKLDLRDDKQFFIDHPGAVPI+TAQGEELRK IGAP YIECSSKTQ+NVK VFDAAIRVVLQPPKQKKKKSKAQKACSIL
Subjt:  LKHYAPGVPIVLVGTKLDLRDDKQFFIDHPGAVPISTAQGEELRKLIGAPAYIECSSKTQQNVKGVFDAAIRVVLQPPKQKKKKSKAQKACSIL

AT5G66120.1 3-dehydroquinate synthase, putative7.1e-17589.25Show/hide
Query:  RHVHGKRVLIVTNETVAPLYLDKVTEALTIGNPNVSVESVVLPDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALGGGVIGDMCGYAAASFLRGVNFIQI
        RHVHGKRVL+VTN+ VAPLYLDK  +ALT GNPNV+VESV+LPDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALGGGVIGDMCGYAAAS+LRGVNFIQI
Subjt:  RHVHGKRVLIVTNETVAPLYLDKVTEALTIGNPNVSVESVVLPDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALGGGVIGDMCGYAAASFLRGVNFIQI

Query:  PTTVMAQVDSSVGGKTGINHRLGKNLIGAFYQPQCVVVDTNTLNTLPDRELASGFAEVIKYGLIRDAEFFEWQEKNMPSLMARDPAALAYAIKRSCENKA
        PTTVMAQVDSSVGGKTGINHRLGKNLIGAFYQPQCV+VDT+TLNTLPDRE+ASG AEVIKYGLIRDAEFFEWQEKN+ +L+ARDPAALA+AIKRSCENKA
Subjt:  PTTVMAQVDSSVGGKTGINHRLGKNLIGAFYQPQCVVVDTNTLNTLPDRELASGFAEVIKYGLIRDAEFFEWQEKNMPSLMARDPAALAYAIKRSCENKA

Query:  EVVSLDEKESGLRATLNLGHTFGHAIETGFGYGQWLHGEAVAVGTVMAVDMSYRLGWIDDTIVSRVLAILKQAKLPTAPPESMTVEMFKSIMAVDKKVAD
        +VVS DEKESGLRATLNLGHTFGHAIETGFGYG+WLHGEAVA GTVMAVDMSYRLGWID++IV RV  IL +AKLPT PPESMTV MFKSIMAVDKKVAD
Subjt:  EVVSLDEKESGLRATLNLGHTFGHAIETGFGYGQWLHGEAVAVGTVMAVDMSYRLGWIDDTIVSRVLAILKQAKLPTAPPESMTVEMFKSIMAVDKKVAD

Query:  GLLRLILLKGPLGNCVFTGDYDRKALDETLRSFCK
        GLLRLILLKGPLGNCVFTGDYDR+ALD TLR+F K
Subjt:  GLLRLILLKGPLGNCVFTGDYDRKALDETLRSFCK

AT5G66120.2 3-dehydroquinate synthase, putative8.4e-19283.97Show/hide
Query:  SSRSTRGKILASSAQIMDQSAIKTDS-SAPTIVEVDLGDRSYPIYIGSGLLDQPEILQRHVHGKRVLIVTNETVAPLYLDKVTEALTIGNPNVSVESVVL
        S+  +R ++ A ++Q+M++      S S+PT+VEVDLGDRSYPIYIG+GLLD  E+LQRHVHGKRVL+VTN+ VAPLYLDK  +ALT GNPNV+VESV+L
Subjt:  SSRSTRGKILASSAQIMDQSAIKTDS-SAPTIVEVDLGDRSYPIYIGSGLLDQPEILQRHVHGKRVLIVTNETVAPLYLDKVTEALTIGNPNVSVESVVL

Query:  PDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALGGGVIGDMCGYAAASFLRGVNFIQIPTTVMAQVDSSVGGKTGINHRLGKNLIGAFYQPQCVVVDTNT
        PDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALGGGVIGDMCGYAAAS+LRGVNFIQIPTTVMAQVDSSVGGKTGINHRLGKNLIGAFYQPQCV+VDT+T
Subjt:  PDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALGGGVIGDMCGYAAASFLRGVNFIQIPTTVMAQVDSSVGGKTGINHRLGKNLIGAFYQPQCVVVDTNT

Query:  LNTLPDRELASGFAEVIKYGLIRDAEFFEWQEKNMPSLMARDPAALAYAIKRSCENKAEVVSLDEKESGLRATLNLGHTFGHAIETGFGYGQWLHGEAVA
        LNTLPDRE+ASG AEVIKYGLIRDAEFFEWQEKN+ +L+ARDPAALA+AIKRSCENKA+VVS DEKESGLRATLNLGHTFGHAIETGFGYG+WLHGEAVA
Subjt:  LNTLPDRELASGFAEVIKYGLIRDAEFFEWQEKNMPSLMARDPAALAYAIKRSCENKAEVVSLDEKESGLRATLNLGHTFGHAIETGFGYGQWLHGEAVA

Query:  VGTVMAVDMSYRLGWIDDTIVSRVLAILKQAKLPTAPPESMTVEMFKSIMAVDKKVADGLLRLILLKGPLGNCVFTGDYDRKALDETLRSFCK
         GTVMAVDMSYRLGWID++IV RV  IL +AKLPT PPESMTV MFKSIMAVDKKVADGLLRLILLKGPLGNCVFTGDYDR+ALD TLR+F K
Subjt:  VGTVMAVDMSYRLGWIDDTIVSRVLAILKQAKLPTAPPESMTVEMFKSIMAVDKKVADGLLRLILLKGPLGNCVFTGDYDRKALDETLRSFCK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCACCGTCCCCAATTCTCTCTGTCTTTCCCCTGCCACTGACCTTATCGTCAAATCCCCACTCTTCAAATCCGCCGCCGTCGGCTGCTTTTTCCCGATCCGCAATCG
CAATTCGGTTCATCTCCGTTCTTCTTCAACTGGGGTTGAGTTCAATCGAAATGGGTTTTCTGGGTTGAGTTCGAGGTCGACGAGAGGGAAGATTTTGGCGAGTTCTGCTC
AGATTATGGATCAGTCGGCGATTAAAACGGATTCCTCTGCTCCGACGATTGTGGAAGTGGATTTGGGTGACCGGAGCTATCCGATTTATATTGGATCCGGTCTTCTCGAT
CAGCCTGAGATTCTCCAGAGGCATGTTCATGGGAAGAGAGTTCTTATAGTCACCAATGAAACTGTTGCACCATTGTATTTAGATAAAGTTACTGAAGCTTTAACTATTGG
GAATCCAAATGTTTCAGTGGAAAGTGTGGTTTTACCAGATGGTGAGAAGTACAAGGATATGGACACACTCATGAAAGTCTTTGACAAGGCTATTGAGTCGCGGCTCGATC
GGCGATGTACATTTGTGGCCCTTGGAGGTGGTGTCATTGGTGATATGTGTGGTTATGCTGCTGCTTCTTTCCTTCGAGGTGTTAACTTCATTCAGATCCCCACGACTGTT
ATGGCACAAGTGGACTCTTCTGTTGGGGGAAAGACGGGGATAAATCACCGTCTGGGGAAGAACTTGATTGGCGCATTTTATCAACCTCAATGTGTTGTTGTAGATACCAA
TACATTAAACACGCTACCTGATAGGGAATTGGCTTCTGGGTTTGCAGAGGTCATTAAGTATGGGCTAATTAGGGATGCCGAGTTTTTCGAATGGCAGGAAAAGAATATGC
CTTCATTAATGGCAAGGGATCCAGCCGCCTTAGCTTATGCGATAAAACGTTCGTGTGAGAACAAGGCTGAGGTTGTATCATTGGATGAGAAGGAAAGTGGACTAAGGGCA
ACTTTGAACTTGGGCCACACATTCGGACATGCTATAGAAACTGGGTTCGGGTACGGTCAGTGGCTCCATGGAGAAGCTGTTGCAGTTGGCACAGTAATGGCAGTTGACAT
GTCATATCGTCTTGGATGGATTGATGATACAATTGTAAGCAGAGTTCTTGCCATTCTAAAACAGGCAAAGTTGCCTACTGCACCTCCCGAAAGCATGACGGTGGAGATGT
TCAAATCGATAATGGCGGTTGACAAGAAGGTTGCTGACGGGCTACTTAGGCTAATTCTCTTGAAAGGTCCTCTAGGAAACTGTGTTTTCACTGGTGATTACGACAGGAAG
GCATTGGACGAGACGCTTCGTTCATTTTGCAAATGCTCTATTGACGTCCGTCCACCACAACTGCCGGAGCAAGGAACAACTCCACATTCATCACGGCGTCTAAGTAGTGG
GTTTCTTTCTTTTTGTCCTCAGGTTCACGATCTGAGGCCCCCCCCTCTTTCTGCTTTCTCTAACAATGAGCGCTTCAAGGTTCATCAAGATTATGTGCCTACAGTTTTCG
ACAATTTCAGTGCAAATGTGGTTGTTAATGGAAGCACTGTTAACCTAGGGTTATGGGATACAGCTGGACAGGAGGATTATAACCGGCTAAGGCCTTTGAGTTATCGTGGG
GCAGATGTTTTTATATTGGCATTCTCTCTCATAAGCAAGGCCAGCTATGAAAATGTTTCTAAAAAGTGGATTCCAGAGTTGAAGCATTATGCACCAGGAGTACCTATTGT
TCTGGTTGGAACTAAGCTTGATCTTCGAGATGATAAGCAGTTCTTCATTGATCATCCTGGTGCAGTTCCTATTTCAACAGCTCAGGGAGAGGAGCTTAGAAAGCTGATTG
GTGCTCCTGCATACATCGAGTGCAGTTCAAAAACTCAGCAGAATGTGAAGGGAGTTTTTGATGCAGCAATTAGGGTTGTACTTCAACCTCCAAAGCAGAAGAAAAAGAAG
AGCAAAGCTCAGAAAGCGTGCTCGATATTATGA
mRNA sequenceShow/hide mRNA sequence
CGCCACCACTGAAAGCCAACAACGCAAGCCCCTCCTCGATCTTCCCCTTCTTCTTCCTCACAATGCCTCATTTCTCCACCAACTCCTCCCATTTCTCCGCCGCCGACCCA
TTTCCGCCGCCGTAAATTTCTCTCAACACCACCCAGAAATTCCCATTCATGGCCACCGTCCCCAATTCTCTCTGTCTTTCCCCTGCCACTGACCTTATCGTCAAATCCCC
ACTCTTCAAATCCGCCGCCGTCGGCTGCTTTTTCCCGATCCGCAATCGCAATTCGGTTCATCTCCGTTCTTCTTCAACTGGGGTTGAGTTCAATCGAAATGGGTTTTCTG
GGTTGAGTTCGAGGTCGACGAGAGGGAAGATTTTGGCGAGTTCTGCTCAGATTATGGATCAGTCGGCGATTAAAACGGATTCCTCTGCTCCGACGATTGTGGAAGTGGAT
TTGGGTGACCGGAGCTATCCGATTTATATTGGATCCGGTCTTCTCGATCAGCCTGAGATTCTCCAGAGGCATGTTCATGGGAAGAGAGTTCTTATAGTCACCAATGAAAC
TGTTGCACCATTGTATTTAGATAAAGTTACTGAAGCTTTAACTATTGGGAATCCAAATGTTTCAGTGGAAAGTGTGGTTTTACCAGATGGTGAGAAGTACAAGGATATGG
ACACACTCATGAAAGTCTTTGACAAGGCTATTGAGTCGCGGCTCGATCGGCGATGTACATTTGTGGCCCTTGGAGGTGGTGTCATTGGTGATATGTGTGGTTATGCTGCT
GCTTCTTTCCTTCGAGGTGTTAACTTCATTCAGATCCCCACGACTGTTATGGCACAAGTGGACTCTTCTGTTGGGGGAAAGACGGGGATAAATCACCGTCTGGGGAAGAA
CTTGATTGGCGCATTTTATCAACCTCAATGTGTTGTTGTAGATACCAATACATTAAACACGCTACCTGATAGGGAATTGGCTTCTGGGTTTGCAGAGGTCATTAAGTATG
GGCTAATTAGGGATGCCGAGTTTTTCGAATGGCAGGAAAAGAATATGCCTTCATTAATGGCAAGGGATCCAGCCGCCTTAGCTTATGCGATAAAACGTTCGTGTGAGAAC
AAGGCTGAGGTTGTATCATTGGATGAGAAGGAAAGTGGACTAAGGGCAACTTTGAACTTGGGCCACACATTCGGACATGCTATAGAAACTGGGTTCGGGTACGGTCAGTG
GCTCCATGGAGAAGCTGTTGCAGTTGGCACAGTAATGGCAGTTGACATGTCATATCGTCTTGGATGGATTGATGATACAATTGTAAGCAGAGTTCTTGCCATTCTAAAAC
AGGCAAAGTTGCCTACTGCACCTCCCGAAAGCATGACGGTGGAGATGTTCAAATCGATAATGGCGGTTGACAAGAAGGTTGCTGACGGGCTACTTAGGCTAATTCTCTTG
AAAGGTCCTCTAGGAAACTGTGTTTTCACTGGTGATTACGACAGGAAGGCATTGGACGAGACGCTTCGTTCATTTTGCAAATGCTCTATTGACGTCCGTCCACCACAACT
GCCGGAGCAAGGAACAACTCCACATTCATCACGGCGTCTAAGTAGTGGGTTTCTTTCTTTTTGTCCTCAGGTTCACGATCTGAGGCCCCCCCCTCTTTCTGCTTTCTCTA
ACAATGAGCGCTTCAAGGTTCATCAAGATTATGTGCCTACAGTTTTCGACAATTTCAGTGCAAATGTGGTTGTTAATGGAAGCACTGTTAACCTAGGGTTATGGGATACA
GCTGGACAGGAGGATTATAACCGGCTAAGGCCTTTGAGTTATCGTGGGGCAGATGTTTTTATATTGGCATTCTCTCTCATAAGCAAGGCCAGCTATGAAAATGTTTCTAA
AAAGTGGATTCCAGAGTTGAAGCATTATGCACCAGGAGTACCTATTGTTCTGGTTGGAACTAAGCTTGATCTTCGAGATGATAAGCAGTTCTTCATTGATCATCCTGGTG
CAGTTCCTATTTCAACAGCTCAGGGAGAGGAGCTTAGAAAGCTGATTGGTGCTCCTGCATACATCGAGTGCAGTTCAAAAACTCAGCAGAATGTGAAGGGAGTTTTTGAT
GCAGCAATTAGGGTTGTACTTCAACCTCCAAAGCAGAAGAAAAAGAAGAGCAAAGCTCAGAAAGCGTGCTCGATATTATGAGCAAACTTCTAGTTGTGAAAACATGACAC
CACCCTTTGTCTGCCTCATCAACGGGTAGTTTGTTACTGCTTTCCTGGAAATGACGTGGTATCGAATTAAAAGACGACATGTTCCTCACCTTAATTAGCCTTTGGTTTTA
ATTATGTACTAATAGAGATTTGGATACTGGAATGCGACAACGGTTAGGAAGAAGCAATCTGTTTGGAAAGGGCAGTTGGAGTTTGTATGATGTGCTTGAAAAGTGTATGT
ATGGAGGGAAGAAGGGGAAGGATTTGTAATGCAAGTGTGTGCTTCAAAGAGAAACATTGGTGGATTCTTTTATTTATAAATTTGCATTCTGACTTGACACTAGTGGTTTT
GTTTTGTAGCATAGAGCCATAAATAGTTCTTTGGAGTTGTTTCTACTTCAAATGAAGCTGCCAAGCTTGAGTTGCTTGAGTTTTTAAGTTCTA
Protein sequenceShow/hide protein sequence
MATVPNSLCLSPATDLIVKSPLFKSAAVGCFFPIRNRNSVHLRSSSTGVEFNRNGFSGLSSRSTRGKILASSAQIMDQSAIKTDSSAPTIVEVDLGDRSYPIYIGSGLLD
QPEILQRHVHGKRVLIVTNETVAPLYLDKVTEALTIGNPNVSVESVVLPDGEKYKDMDTLMKVFDKAIESRLDRRCTFVALGGGVIGDMCGYAAASFLRGVNFIQIPTTV
MAQVDSSVGGKTGINHRLGKNLIGAFYQPQCVVVDTNTLNTLPDRELASGFAEVIKYGLIRDAEFFEWQEKNMPSLMARDPAALAYAIKRSCENKAEVVSLDEKESGLRA
TLNLGHTFGHAIETGFGYGQWLHGEAVAVGTVMAVDMSYRLGWIDDTIVSRVLAILKQAKLPTAPPESMTVEMFKSIMAVDKKVADGLLRLILLKGPLGNCVFTGDYDRK
ALDETLRSFCKCSIDVRPPQLPEQGTTPHSSRRLSSGFLSFCPQVHDLRPPPLSAFSNNERFKVHQDYVPTVFDNFSANVVVNGSTVNLGLWDTAGQEDYNRLRPLSYRG
ADVFILAFSLISKASYENVSKKWIPELKHYAPGVPIVLVGTKLDLRDDKQFFIDHPGAVPISTAQGEELRKLIGAPAYIECSSKTQQNVKGVFDAAIRVVLQPPKQKKKK
SKAQKACSIL