; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg010834 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg010834
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Description3-dehydroquinate synthase homolog
Genome locationscaffold10:8877476..8885951
RNA-Seq ExpressionSpg010834
SyntenySpg010834
Gene Ontology termsGO:0009073 - aromatic amino acid family biosynthetic process (biological process)
GO:0003856 - 3-dehydroquinate synthase activity (molecular function)
GO:0016491 - oxidoreductase activity (molecular function)
InterPro domainsIPR002812 - 3-dehydroquinate synthase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588380.1 hypothetical protein SDJN03_16945, partial [Cucurbita argyrosperma subsp. sororia]8.1e-19271.73Show/hide
Query:  AMALVCSSPASPLLSKQRINAHKRPDNL---ALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPH
        A+A +  SP SPL  KQRI  HK PDNL   ALISR FG A  GECKSL+ NRL CS  SSSSSMSPIEASK VWIWSE +QVMTAAVERGWSTFIFSPH
Subjt:  AMALVCSSPASPLLSKQRINAHKRPDNL---ALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPH

Query:  NRELADEWSSGWSIEAWKVETFRDLRSCPLFSFGFLHSLSDRETTYVLALLSLLEGVSLRSGRRDACLWSPIPLGAFLAVQGDDRGVPPLFAVLGERFLW
        N+ELADEWSS                                                                                          
Subjt:  NRELADEWSSGWSIEAWKVETFRDLRSCPLFSFGFLHSLSDRETTYVLALLSLLEGVSLRSGRRDACLWSPIPLGAFLAVQGDDRGVPPLFAVLGERFLW

Query:  LAGVCAILWAIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEAL
                  IALIRPLF+ EDGVFD EGRL+ATV+EVSN QQLEQLQP NAS DNV+VDL+DWQIIPAENIVAAFQGS+KTVFAVSKTPIEAQIFLEAL
Subjt:  LAGVCAILWAIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEAL

Query:  EHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP
        EHGLGGVILKV DP+AVFQLKDYFDRRNEASNLLSLTKATI  IHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP
Subjt:  EHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP

Query:  VHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFL
        VHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRT IVGRVKIETRQL+L+QAKRDSDEQT YSILLQNAETVALV PGRGNEKKAIPVTSLKVGDEVFL
Subjt:  VHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFL

Query:  RLQGEARHTGIEIQEFIVEK
        RLQGEARHTGIEIQEFIVEK
Subjt:  RLQGEARHTGIEIQEFIVEK

XP_022928646.1 uncharacterized protein LOC111435491 [Cucurbita moschata]1.6e-19271.73Show/hide
Query:  AMALVCSSPASPLLSKQRINAHKRPDNL---ALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPH
        A+A +  SP SPL  KQRI  HK PD+L   ALISR FG A  GECKSLE NRL CS  SSSSSMSPIEASK VWIWSE +QVMTAAVERGWSTFIFSPH
Subjt:  AMALVCSSPASPLLSKQRINAHKRPDNL---ALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPH

Query:  NRELADEWSSGWSIEAWKVETFRDLRSCPLFSFGFLHSLSDRETTYVLALLSLLEGVSLRSGRRDACLWSPIPLGAFLAVQGDDRGVPPLFAVLGERFLW
        N+ELADEWSS                                                                                          
Subjt:  NRELADEWSSGWSIEAWKVETFRDLRSCPLFSFGFLHSLSDRETTYVLALLSLLEGVSLRSGRRDACLWSPIPLGAFLAVQGDDRGVPPLFAVLGERFLW

Query:  LAGVCAILWAIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEAL
                  IALIRPLF+ EDGVFD EGRL+ATV+EVSN QQLEQLQP NAS DNV+VDL+DWQIIPAENIVAAFQGS+KTVFAVSKTPIEAQIFLEAL
Subjt:  LAGVCAILWAIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEAL

Query:  EHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP
        EHGLGGVILKV DP+AVFQLKDYFDRRNEASNLLSLTKATIT IHVAGMGDRVCVDLCSLM+PGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP
Subjt:  EHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP

Query:  VHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFL
        VHAYVAVPGGKTSYLSELRAGKEVIVVDQ+GRQRT IVGRVKIETRQL+L+QAKRDSDEQT YSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFL
Subjt:  VHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFL

Query:  RLQGEARHTGIEIQEFIVEK
        RLQGEARHTGIEIQEFIVEK
Subjt:  RLQGEARHTGIEIQEFIVEK

XP_022970870.1 uncharacterized protein LOC111469713 [Cucurbita maxima]2.5e-19372.5Show/hide
Query:  AMALVCSSPASPLLSKQRINAHKRPDNL---ALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPH
        AMAL+  S  SP   KQRI AH+ PDNL   ALISR FG A  GECKSLE NRL CS ASSSSSMSPIEASK VWIWS  RQVMTAAVERGWSTFIFSPH
Subjt:  AMALVCSSPASPLLSKQRINAHKRPDNL---ALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPH

Query:  NRELADEWSSGWSIEAWKVETFRDLRSCPLFSFGFLHSLSDRETTYVLALLSLLEGVSLRSGRRDACLWSPIPLGAFLAVQGDDRGVPPLFAVLGERFLW
        N+ELADEWSS                                                                                          
Subjt:  NRELADEWSSGWSIEAWKVETFRDLRSCPLFSFGFLHSLSDRETTYVLALLSLLEGVSLRSGRRDACLWSPIPLGAFLAVQGDDRGVPPLFAVLGERFLW

Query:  LAGVCAILWAIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEAL
                  IALI PLF+ EDGVFDGEGRL+ATV+EVSN QQLEQLQP NAS DNV+VDL+DWQIIPAENIVAAFQGS+KTVFAVSKTPIEAQIFLEAL
Subjt:  LAGVCAILWAIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEAL

Query:  EHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP
        EHGLGGVILKV DP+AVFQLKDYFDRRNEASNLLSLTKATIT IHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP
Subjt:  EHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP

Query:  VHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFL
        VHAYVAVPG KTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQL+L+QAKRDSDEQT YSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFL
Subjt:  VHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFL

Query:  RLQGEARHTGIEIQEFIVEK
        RLQGEARHTGIEIQEFIVEK
Subjt:  RLQGEARHTGIEIQEFIVEK

XP_023529491.1 uncharacterized protein LOC111792332 [Cucurbita pepo subsp. pepo]3.0e-19472.64Show/hide
Query:  MALVCSSPASPLLSKQRINAHKRPDNL---ALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPHN
        MAL+  SP S L  KQRI  HK PDNL   ALISR FG A  GECKSLE NRL CS  SSSSSMSPIEASK VWIWSE+RQVMTAAVERGWSTFIFSPHN
Subjt:  MALVCSSPASPLLSKQRINAHKRPDNL---ALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPHN

Query:  RELADEWSSGWSIEAWKVETFRDLRSCPLFSFGFLHSLSDRETTYVLALLSLLEGVSLRSGRRDACLWSPIPLGAFLAVQGDDRGVPPLFAVLGERFLWL
        +ELADEWSS                                                                                           
Subjt:  RELADEWSSGWSIEAWKVETFRDLRSCPLFSFGFLHSLSDRETTYVLALLSLLEGVSLRSGRRDACLWSPIPLGAFLAVQGDDRGVPPLFAVLGERFLWL

Query:  AGVCAILWAIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALE
                 IALIRPLF+ EDGVFD EGRL+ATV+EVSN QQLEQLQP NAS DNV+VDL+DWQIIPAENIVAAFQGS+KTVFAVSKTPIEAQIFLEALE
Subjt:  AGVCAILWAIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALE

Query:  HGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPV
        HGLGGVILKV DP+AVFQLKDYFDRRNEASNLLSLTKATIT IHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPV
Subjt:  HGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPV

Query:  HAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLR
        HAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRT IVGRVKIETRQL+L+QAKRDSDEQT YSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLR
Subjt:  HAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLR

Query:  LQGEARHTGIEIQEFIVEK
        LQGEARHTGIEIQEFIVEK
Subjt:  LQGEARHTGIEIQEFIVEK

XP_038903473.1 3-dehydroquinate synthase homolog [Benincasa hispida]3.0e-19472.66Show/hide
Query:  MAMALVC-SSPASPLLSKQRIN-AHKRPDNLA---LISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIF
        M MAL+C SSP SP LSKQRI+  HK P+NL    LISR FGEA AGECKS   +RL+CSYAS  S+MSP EASK VWIWSE +QVMTAAVERGWSTFIF
Subjt:  MAMALVC-SSPASPLLSKQRIN-AHKRPDNLA---LISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIF

Query:  SPHNRELADEWSSGWSIEAWKVETFRDLRSCPLFSFGFLHSLSDRETTYVLALLSLLEGVSLRSGRRDACLWSPIPLGAFLAVQGDDRGVPPLFAVLGER
        SPHN ELADEWSS                                                                                       
Subjt:  SPHNRELADEWSSGWSIEAWKVETFRDLRSCPLFSFGFLHSLSDRETTYVLALLSLLEGVSLRSGRRDACLWSPIPLGAFLAVQGDDRGVPPLFAVLGER

Query:  FLWLAGVCAILWAIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFL
                     IALI PLF+KE+GVFDGEGRL+A+VVEVSN QQLEQLQP NASAD VVVDL+DWQIIPAENIVAAFQGSQKTVFA+SKTPIEAQIFL
Subjt:  FLWLAGVCAILWAIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFL

Query:  EALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVN
        EALEHGLGGVILKV DP+AVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVN
Subjt:  EALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVN

Query:  AGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDE
        AGPVHAYVAVPGGKTSYLSEL AGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKK+IPVTSLKVGDE
Subjt:  AGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

TrEMBL top hitse value%identityAlignment
A0A1S3B8Q7 3-dehydroquinate synthase homolog2.5e-18670.17Show/hide
Query:  TMAMALVCSSPASPLLSKQRINAHKRPDNL---ALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFS
        TMA  L  SSP SPLLSKQRI   K P+NL    LISR+FG+A AGECKS + +RL+CSY SSSS MSPIE SK VWIWSE ++VMTAAVERGWSTFIFS
Subjt:  TMAMALVCSSPASPLLSKQRINAHKRPDNL---ALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFS

Query:  PHNRELADEWSSGWSIEAWKVETFRDLRSCPLFSFGFLHSLSDRETTYVLALLSLLEGVSLRSGRRDACLWSPIPLGAFLAVQGDDRGVPPLFAVLGERF
        PHN ELA EW+S                                                                                        
Subjt:  PHNRELADEWSSGWSIEAWKVETFRDLRSCPLFSFGFLHSLSDRETTYVLALLSLLEGVSLRSGRRDACLWSPIPLGAFLAVQGDDRGVPPLFAVLGERF

Query:  LWLAGVCAILWAIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLE
                    IA+I PLF+KEDGV DGE RL+A+VVE+SN QQLEQLQP  ASAD VVVDL+DWQIIPAENIVAAFQGSQKTVFA+SKTPIEAQIF E
Subjt:  LWLAGVCAILWAIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLE

Query:  ALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKV DP+AVFQLKDYFDRRNEASNLLSLTKATITQIHV GMGDRVCVDLCSLMRPGEGLLVGS+ARGLFLVHSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRG-NEKKAIPVTSLKVGDE
        GPVHAYVAVPGGKTSYLSEL+AGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS+LLQNAETVALVCPG+G NEKKAI VTSLKVGDE
Subjt:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRG-NEKKAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

A0A5A7UEW0 3-dehydroquinate synthase-like protein2.5e-18670.17Show/hide
Query:  TMAMALVCSSPASPLLSKQRINAHKRPDNL---ALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFS
        TMA  L  SSP SPLLSKQRI   K P+NL    LISR+FG+A AGECKS + +RL+CSY SSSS MSPIE SK VWIWSE ++VMTAAVERGWSTFIFS
Subjt:  TMAMALVCSSPASPLLSKQRINAHKRPDNL---ALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFS

Query:  PHNRELADEWSSGWSIEAWKVETFRDLRSCPLFSFGFLHSLSDRETTYVLALLSLLEGVSLRSGRRDACLWSPIPLGAFLAVQGDDRGVPPLFAVLGERF
        PHN ELA EW+S                                                                                        
Subjt:  PHNRELADEWSSGWSIEAWKVETFRDLRSCPLFSFGFLHSLSDRETTYVLALLSLLEGVSLRSGRRDACLWSPIPLGAFLAVQGDDRGVPPLFAVLGERF

Query:  LWLAGVCAILWAIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLE
                    IA+I PLF+KEDGV DGE RL+A+VVE+SN QQLEQLQP  ASAD VVVDL+DWQIIPAENIVAAFQGSQKTVFA+SKTPIEAQIF E
Subjt:  LWLAGVCAILWAIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLE

Query:  ALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKV DP+AVFQLKDYFDRRNEASNLLSLTKATITQIHV GMGDRVCVDLCSLMRPGEGLLVGS+ARGLFLVHSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRG-NEKKAIPVTSLKVGDE
        GPVHAYVAVPGGKTSYLSEL+AGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS+LLQNAETVALVCPG+G NEKKAI VTSLKVGDE
Subjt:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRG-NEKKAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

A0A6J1BVA9 uncharacterized protein LOC111005050 isoform X21.4e-18168.47Show/hide
Query:  MALVCSSPASP-LLSKQRINAHKRPDNLALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPHNRE
        M  +C+SPASP LLSK RI         ALIS  FG+ +AG+CKS+ A  ++CS AS S   +P EASK VW+WSE+RQV+TAAVERGW+TF+FSPHNRE
Subjt:  MALVCSSPASP-LLSKQRINAHKRPDNLALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPHNRE

Query:  LADEWSSGWSIEAWKVETFRDLRSCPLFSFGFLHSLSDRETTYVLALLSLLEGVSLRSGRRDACLWSPIPLGAFLAVQGDDRGVPPLFAVLGERFLWLAG
        LA +WSS                                                                                          +A 
Subjt:  LADEWSSGWSIEAWKVETFRDLRSCPLFSFGFLHSLSDRETTYVLALLSLLEGVSLRSGRRDACLWSPIPLGAFLAVQGDDRGVPPLFAVLGERFLWLAG

Query:  VCAILWAIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHG
        +C+          LF+KEDG+FD EG L+ATV EVSN QQLEQLQPENAS DNVVVDL+DWQIIPAENIVAAFQGS+K VFAVSKTPIEAQIFLEALEHG
Subjt:  VCAILWAIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHG

Query:  LGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHA
        LGGVILKV DP+AVFQLKDYFDRRNEASNLLSLTKAT+TQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHA
Subjt:  LGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHA

Query:  YVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLRLQ
        YVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSD+QTPY ILLQNAETVALVCPGRGNEKKAIPVTSLKVGD+VFLRLQ
Subjt:  YVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLRLQ

Query:  GEARHTGIEIQEFIVEK
        GEARHTGIEIQEFIVEK
Subjt:  GEARHTGIEIQEFIVEK

A0A6J1EKW1 uncharacterized protein LOC1114354917.9e-19371.73Show/hide
Query:  AMALVCSSPASPLLSKQRINAHKRPDNL---ALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPH
        A+A +  SP SPL  KQRI  HK PD+L   ALISR FG A  GECKSLE NRL CS  SSSSSMSPIEASK VWIWSE +QVMTAAVERGWSTFIFSPH
Subjt:  AMALVCSSPASPLLSKQRINAHKRPDNL---ALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPH

Query:  NRELADEWSSGWSIEAWKVETFRDLRSCPLFSFGFLHSLSDRETTYVLALLSLLEGVSLRSGRRDACLWSPIPLGAFLAVQGDDRGVPPLFAVLGERFLW
        N+ELADEWSS                                                                                          
Subjt:  NRELADEWSSGWSIEAWKVETFRDLRSCPLFSFGFLHSLSDRETTYVLALLSLLEGVSLRSGRRDACLWSPIPLGAFLAVQGDDRGVPPLFAVLGERFLW

Query:  LAGVCAILWAIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEAL
                  IALIRPLF+ EDGVFD EGRL+ATV+EVSN QQLEQLQP NAS DNV+VDL+DWQIIPAENIVAAFQGS+KTVFAVSKTPIEAQIFLEAL
Subjt:  LAGVCAILWAIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEAL

Query:  EHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP
        EHGLGGVILKV DP+AVFQLKDYFDRRNEASNLLSLTKATIT IHVAGMGDRVCVDLCSLM+PGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP
Subjt:  EHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP

Query:  VHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFL
        VHAYVAVPGGKTSYLSELRAGKEVIVVDQ+GRQRT IVGRVKIETRQL+L+QAKRDSDEQT YSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFL
Subjt:  VHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFL

Query:  RLQGEARHTGIEIQEFIVEK
        RLQGEARHTGIEIQEFIVEK
Subjt:  RLQGEARHTGIEIQEFIVEK

A0A6J1I437 uncharacterized protein LOC1114697131.2e-19372.5Show/hide
Query:  AMALVCSSPASPLLSKQRINAHKRPDNL---ALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPH
        AMAL+  S  SP   KQRI AH+ PDNL   ALISR FG A  GECKSLE NRL CS ASSSSSMSPIEASK VWIWS  RQVMTAAVERGWSTFIFSPH
Subjt:  AMALVCSSPASPLLSKQRINAHKRPDNL---ALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPH

Query:  NRELADEWSSGWSIEAWKVETFRDLRSCPLFSFGFLHSLSDRETTYVLALLSLLEGVSLRSGRRDACLWSPIPLGAFLAVQGDDRGVPPLFAVLGERFLW
        N+ELADEWSS                                                                                          
Subjt:  NRELADEWSSGWSIEAWKVETFRDLRSCPLFSFGFLHSLSDRETTYVLALLSLLEGVSLRSGRRDACLWSPIPLGAFLAVQGDDRGVPPLFAVLGERFLW

Query:  LAGVCAILWAIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEAL
                  IALI PLF+ EDGVFDGEGRL+ATV+EVSN QQLEQLQP NAS DNV+VDL+DWQIIPAENIVAAFQGS+KTVFAVSKTPIEAQIFLEAL
Subjt:  LAGVCAILWAIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEAL

Query:  EHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP
        EHGLGGVILKV DP+AVFQLKDYFDRRNEASNLLSLTKATIT IHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP
Subjt:  EHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP

Query:  VHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFL
        VHAYVAVPG KTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQL+L+QAKRDSDEQT YSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFL
Subjt:  VHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFL

Query:  RLQGEARHTGIEIQEFIVEK
        RLQGEARHTGIEIQEFIVEK
Subjt:  RLQGEARHTGIEIQEFIVEK

SwissProt top hitse value%identityAlignment
A4G0J1 3-dehydroquinate synthase4.1e-6147.37Show/hide
Query:  DNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVC
        D VV++  DW IIP ENI+A   G +  + +V     +A+   E LE G+ GV+L   D + V       +R N  S  L L  AT+T+I   G GDRVC
Subjt:  DNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVC

Query:  VDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAK
        +D CS+M  GEG+L+GSY+RG+FLVHSE +E+ Y+A+RPFRVNAGPVHAY+  P  KT YLS+L+AG +V+VV++ G  R +I+GRVKIE R L LV+A+
Subjt:  VDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAK

Query:  RDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
         + +       +LQNAET+ LV    G + K + V  LKVG +V ++    ARH G+ I+E IVEK
Subjt:  RDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

A5UJ82 3-dehydroquinate synthase4.1e-6144.56Show/hide
Query:  EGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFDRR
        EG+ +   VE++++   +      + AD +++   DW +IP ENI+A  Q +   + A       A++ +E LEHG  GVI +  D +   Q+K      
Subjt:  EGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFDRR

Query:  NEASNL-LSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIV
         +AS +   L  AT+T +   G GDRVCVD   +M+PGEG+L+GSY++ LFLVHSE LES Y+ASRPFRVNAGPV AYV VPG KT YLSEL AG EV++
Subjt:  NEASNL-LSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIV

Query:  VDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
        V+ EG  RTA VGR KIE R LIL++A+    E      LLQNAET+ +V      +   + V  +K+GD+V + ++  ARH GI I E I+E+
Subjt:  VDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

O26680 3-dehydroquinate synthase3.5e-6548.81Show/hide
Query:  GRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFD-RR
        GR +A  VE+ ++   E  +      D +++  +DW+IIP ENI+A  Q     + A      EA++ LE LEHG  GV++   +P  + Q+KD      
Subjt:  GRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFD-RR

Query:  NEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVV
        N  S    L  ATIT+I   G GDRVCVD CS+M  GEG+LVGSY++GLFLVHSE LES Y+ASRPFRVNAGPV AYV VPGG+T YLSEL  G EVI+V
Subjt:  NEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVV

Query:  DQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
        D++GR R+AIVGRVKIE R L+LV+A+    E      LLQNAET+ LV     ++ + + V+ L  GD V +     ARH G+ I+E I+EK
Subjt:  DQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

Q2NI00 3-dehydroquinate synthase1.5e-6345.21Show/hide
Query:  GRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFDRRN
        G+ +A  VE++N+     +      AD V++  K+W++IP ENI+A+ Q     +        EA++ LE +EHG  GV+L   D + + +L    ++ +
Subjt:  GRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFDRRN

Query:  EASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVD
        + S    L  AT+T++   G+GDRVCVD CS+M  G+G+LVGS+A GLFLVHSE LES Y+ASRPFRVNAGPVHAYV  P  KT YLSEL AG EV+ ++
Subjt:  EASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVD

Query:  QEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
         +G   T IVGRVKIE R L+L++AK    + +    L+QNAET+ LV     ++ + I V+ LKVGD+V       ARH G+ I+E I+EK
Subjt:  QEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

Q58646 3-dehydroquinate synthase1.1e-6147.76Show/hide
Query:  DNVVVDLKDWQIIPAENIVA-AFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRV
        DN++++ +DW IIP EN++A  F    K V +V+    EA++  E LE G  GV+L   + + + +L    +  N+    ++L  AT+T++   G GDRV
Subjt:  DNVVVDLKDWQIIPAENIVA-AFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRV

Query:  CVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQA
        C+D CSLM+ GEG+L+GSY+R LFLVHSE +E+ Y+A+RPFRVNAGPVHAY+  PG KT YLSEL+AG +V++VD++G  R AIVGRVKIE R L+L++A
Subjt:  CVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQA

Query:  KRDSDEQTPYSILLQNAETVALVCPGRGNEK-KAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
        +   D       +LQNAET+ LV     NEK + I V  LK GD+V ++ +  ARH G+ I+E I+EK
Subjt:  KRDSDEQTPYSILLQNAETVALVCPGRGNEK-KAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

Arabidopsis top hitse value%identityAlignment
AT3G28760.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: 3-dehydroquinate synthase, prokaryotic-type (InterPro:IPR002812); Has 390 Blast hits to 390 proteins in 131 species: Archae - 144; Bacteria - 105; Metazoa - 0; Fungi - 0; Plants - 54; Viruses - 0; Other Eukaryotes - 87 (source: NCBI BLink).1.4e-12553.56Show/hide
Query:  SSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPHNRELADEWSSGWSIEAWKVETFRDLRSCPLFSFGFLHSLSDRETTYVLALLSLLEGVSL
        S+S+    +  +K+VWIW+  ++VMT AVERGW+TFIFS  NR+L++EWSS                                                 
Subjt:  SSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPHNRELADEWSSGWSIEAWKVETFRDLRSCPLFSFGFLHSLSDRETTYVLALLSLLEGVSL

Query:  RSGRRDACLWSPIPLGAFLAVQGDDRGVPPLFAVLGERFLWLAGVCAILWAIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVV
                                                           IAL+  LF++E  V DG G ++A+V EVS  ++L  L  EN   +N+V+
Subjt:  RSGRRDACLWSPIPLGAFLAVQGDDRGVPPLFAVLGERFLWLAGVCAILWAIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVV

Query:  DLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCS
        D  DW+ IPAEN+VAA QGS+KTVFAVS TP EA++FLEALEHGLGG+ILK  D  AV  LK+YFD+RNE S+ LSLT+ATIT++ + GMGDRVCVDLCS
Subjt:  DLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCS

Query:  LMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDS-D
        LMRPGEGLLVGS+ARGLFLVHSECLESNYI SRPFRVNAGPVHAYVAVPGGKT YLSELR G+EVIVVDQ+G+QRTA+VGRVKIE R LI+V+AK  + +
Subjt:  LMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDS-D

Query:  EQTPYSILLQNAETVALVCPGRGNE--KKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE
        E+T YSI+LQNAETVALV P + N   + A+PVTSLK GD+V +RLQG ARHTGIEIQEFIVE
Subjt:  EQTPYSILLQNAETVALVCPGRGNE--KKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE

AT3G28760.2 CONTAINS InterPro DOMAIN/s: 3-dehydroquinate synthase, prokaryotic-type (InterPro:IPR002812); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink).1.4e-12553.56Show/hide
Query:  SSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPHNRELADEWSSGWSIEAWKVETFRDLRSCPLFSFGFLHSLSDRETTYVLALLSLLEGVSL
        S+S+    +  +K+VWIW+  ++VMT AVERGW+TFIFS  NR+L++EWSS                                                 
Subjt:  SSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPHNRELADEWSSGWSIEAWKVETFRDLRSCPLFSFGFLHSLSDRETTYVLALLSLLEGVSL

Query:  RSGRRDACLWSPIPLGAFLAVQGDDRGVPPLFAVLGERFLWLAGVCAILWAIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVV
                                                           IAL+  LF++E  V DG G ++A+V EVS  ++L  L  EN   +N+V+
Subjt:  RSGRRDACLWSPIPLGAFLAVQGDDRGVPPLFAVLGERFLWLAGVCAILWAIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVV

Query:  DLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCS
        D  DW+ IPAEN+VAA QGS+KTVFAVS TP EA++FLEALEHGLGG+ILK  D  AV  LK+YFD+RNE S+ LSLT+ATIT++ + GMGDRVCVDLCS
Subjt:  DLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCS

Query:  LMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDS-D
        LMRPGEGLLVGS+ARGLFLVHSECLESNYI SRPFRVNAGPVHAYVAVPGGKT YLSELR G+EVIVVDQ+G+QRTA+VGRVKIE R LI+V+AK  + +
Subjt:  LMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDS-D

Query:  EQTPYSILLQNAETVALVCPGRGNE--KKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE
        E+T YSI+LQNAETVALV P + N   + A+PVTSLK GD+V +RLQG ARHTGIEIQEFIVE
Subjt:  EQTPYSILLQNAETVALVCPGRGNE--KKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTTTTATGGTCATTTTTAAAAATTCCCGTACGATGGCCATGGCCTTGGTGTGTTCGTCACCTGCTTCTCCACTTCTTTCCAAACAGCGAATCAACGCCCACAAAAG
ACCAGATAATTTGGCCCTAATTTCAAGGAGTTTTGGCGAAGCCAATGCTGGTGAATGTAAATCTTTGGAGGCAAATCGTTTAGAGTGTTCTTACGCTTCCTCGTCTTCTT
CAATGTCTCCGATTGAGGCGTCGAAGGAGGTGTGGATTTGGAGTGAGCATCGGCAGGTTATGACGGCCGCGGTTGAGAGGGGCTGGAGCACCTTCATCTTCTCGCCTCAC
AATCGGGAGCTTGCTGATGAATGGTCCTCAGGATGGTCTATTGAAGCTTGGAAGGTAGAAACATTCAGGGATTTGAGGAGTTGCCCTTTGTTCTCCTTTGGCTTTCTTCA
TTCGTTATCTGATAGAGAAACCACATACGTTCTAGCCCTCTTGTCGTTGCTAGAGGGGGTTAGTCTGAGATCAGGTAGGAGGGATGCCTGCTTATGGAGTCCAATCCCTC
TGGGGGCTTTTCTTGCGGTGCAGGGAGATGATCGAGGAGTTCCTCCTCTATTTGCCGTTTTGGGAGAAAGATTTTTGTGGCTTGCTGGTGTTTGTGCTATTCTTTGGGCA
ATTGCACTAATACGTCCGCTTTTTATGAAAGAGGATGGAGTTTTTGATGGAGAGGGTAGACTAATGGCCACAGTTGTTGAGGTTTCGAACCGCCAGCAATTGGAGCAGCT
TCAACCAGAAAATGCATCCGCAGACAATGTTGTTGTGGATCTAAAAGATTGGCAGATAATACCTGCAGAGAATATTGTTGCAGCATTTCAGGGGAGTCAGAAAACAGTGT
TTGCTGTCTCGAAAACTCCTATTGAAGCTCAAATCTTCCTTGAGGCACTCGAACACGGTCTAGGTGGAGTTATTTTGAAAGTTGGAGATCCTGATGCTGTTTTCCAGCTA
AAGGACTATTTTGACAGAAGAAATGAAGCTAGTAATCTTTTGAGCTTGACTAAGGCTACTATAACTCAAATTCATGTTGCTGGAATGGGAGATCGAGTTTGTGTTGATCT
CTGTAGTCTCATGAGACCTGGCGAAGGACTTCTTGTCGGTTCTTATGCCAGAGGACTATTCCTTGTTCACTCAGAATGCTTAGAGTCAAATTACATTGCTAGCCGGCCTT
TTCGTGTCAATGCTGGACCTGTCCATGCCTACGTAGCTGTCCCGGGAGGGAAAACTAGCTACCTTTCCGAGTTACGAGCAGGCAAAGAGGTAATTGTAGTTGATCAAGAA
GGCAGACAACGAACCGCTATTGTTGGACGTGTAAAGATCGAAACTAGGCAGCTGATCCTCGTCCAGGCAAAGAGAGATTCAGATGAGCAAACTCCTTACAGCATCCTCCT
GCAGAACGCGGAAACGGTTGCCTTAGTCTGCCCTGGTCGAGGCAATGAGAAGAAAGCCATCCCTGTTACTTCACTTAAAGTTGGTGATGAAGTGTTCTTGAGATTGCAAG
GAGAAGCAAGGCATACAGGTATTGAAATCCAAGAGTTTATTGTGGAGAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCTTTTTATGGTCATTTTTAAAAATTCCCGTACGATGGCCATGGCCTTGGTGTGTTCGTCACCTGCTTCTCCACTTCTTTCCAAACAGCGAATCAACGCCCACAAAAG
ACCAGATAATTTGGCCCTAATTTCAAGGAGTTTTGGCGAAGCCAATGCTGGTGAATGTAAATCTTTGGAGGCAAATCGTTTAGAGTGTTCTTACGCTTCCTCGTCTTCTT
CAATGTCTCCGATTGAGGCGTCGAAGGAGGTGTGGATTTGGAGTGAGCATCGGCAGGTTATGACGGCCGCGGTTGAGAGGGGCTGGAGCACCTTCATCTTCTCGCCTCAC
AATCGGGAGCTTGCTGATGAATGGTCCTCAGGATGGTCTATTGAAGCTTGGAAGGTAGAAACATTCAGGGATTTGAGGAGTTGCCCTTTGTTCTCCTTTGGCTTTCTTCA
TTCGTTATCTGATAGAGAAACCACATACGTTCTAGCCCTCTTGTCGTTGCTAGAGGGGGTTAGTCTGAGATCAGGTAGGAGGGATGCCTGCTTATGGAGTCCAATCCCTC
TGGGGGCTTTTCTTGCGGTGCAGGGAGATGATCGAGGAGTTCCTCCTCTATTTGCCGTTTTGGGAGAAAGATTTTTGTGGCTTGCTGGTGTTTGTGCTATTCTTTGGGCA
ATTGCACTAATACGTCCGCTTTTTATGAAAGAGGATGGAGTTTTTGATGGAGAGGGTAGACTAATGGCCACAGTTGTTGAGGTTTCGAACCGCCAGCAATTGGAGCAGCT
TCAACCAGAAAATGCATCCGCAGACAATGTTGTTGTGGATCTAAAAGATTGGCAGATAATACCTGCAGAGAATATTGTTGCAGCATTTCAGGGGAGTCAGAAAACAGTGT
TTGCTGTCTCGAAAACTCCTATTGAAGCTCAAATCTTCCTTGAGGCACTCGAACACGGTCTAGGTGGAGTTATTTTGAAAGTTGGAGATCCTGATGCTGTTTTCCAGCTA
AAGGACTATTTTGACAGAAGAAATGAAGCTAGTAATCTTTTGAGCTTGACTAAGGCTACTATAACTCAAATTCATGTTGCTGGAATGGGAGATCGAGTTTGTGTTGATCT
CTGTAGTCTCATGAGACCTGGCGAAGGACTTCTTGTCGGTTCTTATGCCAGAGGACTATTCCTTGTTCACTCAGAATGCTTAGAGTCAAATTACATTGCTAGCCGGCCTT
TTCGTGTCAATGCTGGACCTGTCCATGCCTACGTAGCTGTCCCGGGAGGGAAAACTAGCTACCTTTCCGAGTTACGAGCAGGCAAAGAGGTAATTGTAGTTGATCAAGAA
GGCAGACAACGAACCGCTATTGTTGGACGTGTAAAGATCGAAACTAGGCAGCTGATCCTCGTCCAGGCAAAGAGAGATTCAGATGAGCAAACTCCTTACAGCATCCTCCT
GCAGAACGCGGAAACGGTTGCCTTAGTCTGCCCTGGTCGAGGCAATGAGAAGAAAGCCATCCCTGTTACTTCACTTAAAGTTGGTGATGAAGTGTTCTTGAGATTGCAAG
GAGAAGCAAGGCATACAGGTATTGAAATCCAAGAGTTTATTGTGGAGAAATGA
Protein sequenceShow/hide protein sequence
MLFMVIFKNSRTMAMALVCSSPASPLLSKQRINAHKRPDNLALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPH
NRELADEWSSGWSIEAWKVETFRDLRSCPLFSFGFLHSLSDRETTYVLALLSLLEGVSLRSGRRDACLWSPIPLGAFLAVQGDDRGVPPLFAVLGERFLWLAGVCAILWA
IALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQL
KDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQE
GRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK