; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy09g018030 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy09g018030
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
Description3-dehydroquinate synthase homolog
Genome locationChr09:38380967..38389249
RNA-Seq ExpressionLcy09g018030
SyntenyLcy09g018030
Gene Ontology termsGO:0008652 - cellular amino acid biosynthetic process (biological process)
GO:0009073 - aromatic amino acid family biosynthetic process (biological process)
GO:0003856 - 3-dehydroquinate synthase activity (molecular function)
GO:0016491 - oxidoreductase activity (molecular function)
InterPro domainsIPR002812 - 3-dehydroquinate synthase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588380.1 hypothetical protein SDJN03_16945, partial [Cucurbita argyrosperma subsp. sororia]1.2e-20388.6Show/hide
Query:  AMALVCSSPASPLLSKQRINAHKRPDNL---ALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPH
        A+A +  SP SPL  KQRI  HK PDNL   ALISR FG A  GECKSL+ NRL CS  SSSSSMSPIEASK VWIWSE +QVMTAAVERGWSTFIFSPH
Subjt:  AMALVCSSPASPLLSKQRINAHKRPDNL---ALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPH

Query:  NRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEAL
        N+ELADEWSSIALIRPLF+ EDGVFD EGRL+ATV+EVSN QQLEQLQP NAS DNV+VDL+DWQIIPAENIVAAFQGS+KTVFAVSKTPIEAQIFLEAL
Subjt:  NRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEAL

Query:  EHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP
        EHGLGGVILKV DP+AVFQLKDYFDRRNEASNLLSLTKATI  IHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP
Subjt:  EHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP

Query:  VHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGRNEKKAIPVTSLKVGDEVF
        VHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRT IVGRVKIETRQL+L+QAKRDSDEQT YSILLQNAETVALV PGRG NEKKAIPVTSLKVGDEVF
Subjt:  VHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGRNEKKAIPVTSLKVGDEVF

Query:  LRLQGEARHTGIEIQEFIVEK
        LRLQGEARHTGIEIQEFIVEK
Subjt:  LRLQGEARHTGIEIQEFIVEK

XP_022928646.1 uncharacterized protein LOC111435491 [Cucurbita moschata]2.5e-20488.6Show/hide
Query:  AMALVCSSPASPLLSKQRINAHKRPDNL---ALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPH
        A+A +  SP SPL  KQRI  HK PD+L   ALISR FG A  GECKSLE NRL CS  SSSSSMSPIEASK VWIWSE +QVMTAAVERGWSTFIFSPH
Subjt:  AMALVCSSPASPLLSKQRINAHKRPDNL---ALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPH

Query:  NRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEAL
        N+ELADEWSSIALIRPLF+ EDGVFD EGRL+ATV+EVSN QQLEQLQP NAS DNV+VDL+DWQIIPAENIVAAFQGS+KTVFAVSKTPIEAQIFLEAL
Subjt:  NRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEAL

Query:  EHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP
        EHGLGGVILKV DP+AVFQLKDYFDRRNEASNLLSLTKATIT IHVAGMGDRVCVDLCSLM+PGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP
Subjt:  EHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP

Query:  VHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGRNEKKAIPVTSLKVGDEVF
        VHAYVAVPGGKTSYLSELRAGKEVIVVDQ+GRQRT IVGRVKIETRQL+L+QAKRDSDEQT YSILLQNAETVALVCPGRG NEKKAIPVTSLKVGDEVF
Subjt:  VHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGRNEKKAIPVTSLKVGDEVF

Query:  LRLQGEARHTGIEIQEFIVEK
        LRLQGEARHTGIEIQEFIVEK
Subjt:  LRLQGEARHTGIEIQEFIVEK

XP_022970870.1 uncharacterized protein LOC111469713 [Cucurbita maxima]3.9e-20589.55Show/hide
Query:  AMALVCSSPASPLLSKQRINAHKRPDNL---ALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPH
        AMAL+  S  SP   KQRI AH+ PDNL   ALISR FG A  GECKSLE NRL CS ASSSSSMSPIEASK VWIWS  RQVMTAAVERGWSTFIFSPH
Subjt:  AMALVCSSPASPLLSKQRINAHKRPDNL---ALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPH

Query:  NRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEAL
        N+ELADEWSSIALI PLF+ EDGVFDGEGRL+ATV+EVSN QQLEQLQP NAS DNV+VDL+DWQIIPAENIVAAFQGS+KTVFAVSKTPIEAQIFLEAL
Subjt:  NRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEAL

Query:  EHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP
        EHGLGGVILKV DP+AVFQLKDYFDRRNEASNLLSLTKATIT IHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP
Subjt:  EHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP

Query:  VHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGRNEKKAIPVTSLKVGDEVF
        VHAYVAVPG KTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQL+L+QAKRDSDEQT YSILLQNAETVALVCPGRG NEKKAIPVTSLKVGDEVF
Subjt:  VHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGRNEKKAIPVTSLKVGDEVF

Query:  LRLQGEARHTGIEIQEFIVEK
        LRLQGEARHTGIEIQEFIVEK
Subjt:  LRLQGEARHTGIEIQEFIVEK

XP_023529491.1 uncharacterized protein LOC111792332 [Cucurbita pepo subsp. pepo]4.6e-20689.76Show/hide
Query:  MALVCSSPASPLLSKQRINAHKRPDNL---ALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPHN
        MAL+  SP S L  KQRI  HK PDNL   ALISR FG A  GECKSLE NRL CS  SSSSSMSPIEASK VWIWSE+RQVMTAAVERGWSTFIFSPHN
Subjt:  MALVCSSPASPLLSKQRINAHKRPDNL---ALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPHN

Query:  RELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALE
        +ELADEWSSIALIRPLF+ EDGVFD EGRL+ATV+EVSN QQLEQLQP NAS DNV+VDL+DWQIIPAENIVAAFQGS+KTVFAVSKTPIEAQIFLEALE
Subjt:  RELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALE

Query:  HGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPV
        HGLGGVILKV DP+AVFQLKDYFDRRNEASNLLSLTKATIT IHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPV
Subjt:  HGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPV

Query:  HAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGRNEKKAIPVTSLKVGDEVFL
        HAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRT IVGRVKIETRQL+L+QAKRDSDEQT YSILLQNAETVALVCPGRG NEKKAIPVTSLKVGDEVFL
Subjt:  HAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGRNEKKAIPVTSLKVGDEVFL

Query:  RLQGEARHTGIEIQEFIVEK
        RLQGEARHTGIEIQEFIVEK
Subjt:  RLQGEARHTGIEIQEFIVEK

XP_038903473.1 3-dehydroquinate synthase homolog [Benincasa hispida]4.6e-20689.62Show/hide
Query:  MAMALVC-SSPASPLLSKQRIN-AHKRPDNLA---LISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIF
        M MAL+C SSP SP LSKQRI+  HK P+NL    LISR FGEA AGECKS   +RL+CSYAS  S+MSP EASK VWIWSE +QVMTAAVERGWSTFIF
Subjt:  MAMALVC-SSPASPLLSKQRIN-AHKRPDNLA---LISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIF

Query:  SPHNRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFL
        SPHN ELADEWSSIALI PLF+KE+GVFDGEGRL+A+VVEVSN QQLEQLQP NASAD VVVDL+DWQIIPAENIVAAFQGSQKTVFA+SKTPIEAQIFL
Subjt:  SPHNRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFL

Query:  EALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVN
        EALEHGLGGVILKV DP+AVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVN
Subjt:  EALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVN

Query:  AGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGRNEKKAIPVTSLKVGD
        AGPVHAYVAVPGGKTSYLSEL AGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRG NEKK+IPVTSLKVGD
Subjt:  AGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGRNEKKAIPVTSLKVGD

Query:  EVFLRLQGEARHTGIEIQEFIVEK
        EVFLRLQGEARHTGIEIQEFIVEK
Subjt:  EVFLRLQGEARHTGIEIQEFIVEK

TrEMBL top hitse value%identityAlignment
A0A1S3B8Q7 3-dehydroquinate synthase homolog8.1e-20186.76Show/hide
Query:  MAMA-LVCSSPASPLLSKQRINAHKRPDNL---ALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFS
        M MA L  SSP SPLLSKQRI   K P+NL    LISR+FG+A AGECKS + +RL+CSY SSSS MSPIE SK VWIWSE ++VMTAAVERGWSTFIFS
Subjt:  MAMA-LVCSSPASPLLSKQRINAHKRPDNL---ALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFS

Query:  PHNRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLE
        PHN ELA EW+SIA+I PLF+KEDGV DGE RL+A+VVE+SN QQLEQLQP  ASAD VVVDL+DWQIIPAENIVAAFQGSQKTVFA+SKTPIEAQIF E
Subjt:  PHNRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLE

Query:  ALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKV DP+AVFQLKDYFDRRNEASNLLSLTKATITQIHV GMGDRVCVDLCSLMRPGEGLLVGS+ARGLFLVHSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGRNEKKAIPVTSLKVGDE
        GPVHAYVAVPGGKTSYLSEL+AGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS+LLQNAETVALVCPG+G NEKKAI VTSLKVGDE
Subjt:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGRNEKKAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

A0A5A7UEW0 3-dehydroquinate synthase-like protein8.1e-20186.76Show/hide
Query:  MAMA-LVCSSPASPLLSKQRINAHKRPDNL---ALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFS
        M MA L  SSP SPLLSKQRI   K P+NL    LISR+FG+A AGECKS + +RL+CSY SSSS MSPIE SK VWIWSE ++VMTAAVERGWSTFIFS
Subjt:  MAMA-LVCSSPASPLLSKQRINAHKRPDNL---ALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFS

Query:  PHNRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLE
        PHN ELA EW+SIA+I PLF+KEDGV DGE RL+A+VVE+SN QQLEQLQP  ASAD VVVDL+DWQIIPAENIVAAFQGSQKTVFA+SKTPIEAQIF E
Subjt:  PHNRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLE

Query:  ALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKV DP+AVFQLKDYFDRRNEASNLLSLTKATITQIHV GMGDRVCVDLCSLMRPGEGLLVGS+ARGLFLVHSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGRNEKKAIPVTSLKVGDE
        GPVHAYVAVPGGKTSYLSEL+AGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS+LLQNAETVALVCPG+G NEKKAI VTSLKVGDE
Subjt:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGRNEKKAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

A0A6J1BVA9 uncharacterized protein LOC111005050 isoform X22.1e-19384.93Show/hide
Query:  MALVCSSPASP-LLSKQRINAHKRPDNLALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPHNRE
        M  +C+SPASP LLSK RI         ALIS  FG+ +AG+CKS+ A  ++CS AS S   +P EASK VW+WSE+RQV+TAAVERGW+TF+FSPHNRE
Subjt:  MALVCSSPASP-LLSKQRINAHKRPDNLALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPHNRE

Query:  LADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHG
        LA +WSSIA I  LF+KEDG+FD EG L+ATV EVSN QQLEQLQPENAS DNVVVDL+DWQIIPAENIVAAFQGS+K VFAVSKTPIEAQIFLEALEHG
Subjt:  LADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHG

Query:  LGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHA
        LGGVILKV DP+AVFQLKDYFDRRNEASNLLSLTKAT+TQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHA
Subjt:  LGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHA

Query:  YVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGRNEKKAIPVTSLKVGDEVFLRL
        YVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSD+QTPY ILLQNAETVALVCPGRG NEKKAIPVTSLKVGD+VFLRL
Subjt:  YVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGRNEKKAIPVTSLKVGDEVFLRL

Query:  QGEARHTGIEIQEFIVEK
        QGEARHTGIEIQEFIVEK
Subjt:  QGEARHTGIEIQEFIVEK

A0A6J1EKW1 uncharacterized protein LOC1114354911.2e-20488.6Show/hide
Query:  AMALVCSSPASPLLSKQRINAHKRPDNL---ALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPH
        A+A +  SP SPL  KQRI  HK PD+L   ALISR FG A  GECKSLE NRL CS  SSSSSMSPIEASK VWIWSE +QVMTAAVERGWSTFIFSPH
Subjt:  AMALVCSSPASPLLSKQRINAHKRPDNL---ALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPH

Query:  NRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEAL
        N+ELADEWSSIALIRPLF+ EDGVFD EGRL+ATV+EVSN QQLEQLQP NAS DNV+VDL+DWQIIPAENIVAAFQGS+KTVFAVSKTPIEAQIFLEAL
Subjt:  NRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEAL

Query:  EHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP
        EHGLGGVILKV DP+AVFQLKDYFDRRNEASNLLSLTKATIT IHVAGMGDRVCVDLCSLM+PGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP
Subjt:  EHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP

Query:  VHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGRNEKKAIPVTSLKVGDEVF
        VHAYVAVPGGKTSYLSELRAGKEVIVVDQ+GRQRT IVGRVKIETRQL+L+QAKRDSDEQT YSILLQNAETVALVCPGRG NEKKAIPVTSLKVGDEVF
Subjt:  VHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGRNEKKAIPVTSLKVGDEVF

Query:  LRLQGEARHTGIEIQEFIVEK
        LRLQGEARHTGIEIQEFIVEK
Subjt:  LRLQGEARHTGIEIQEFIVEK

A0A6J1I437 uncharacterized protein LOC1114697131.9e-20589.55Show/hide
Query:  AMALVCSSPASPLLSKQRINAHKRPDNL---ALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPH
        AMAL+  S  SP   KQRI AH+ PDNL   ALISR FG A  GECKSLE NRL CS ASSSSSMSPIEASK VWIWS  RQVMTAAVERGWSTFIFSPH
Subjt:  AMALVCSSPASPLLSKQRINAHKRPDNL---ALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPH

Query:  NRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEAL
        N+ELADEWSSIALI PLF+ EDGVFDGEGRL+ATV+EVSN QQLEQLQP NAS DNV+VDL+DWQIIPAENIVAAFQGS+KTVFAVSKTPIEAQIFLEAL
Subjt:  NRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEAL

Query:  EHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP
        EHGLGGVILKV DP+AVFQLKDYFDRRNEASNLLSLTKATIT IHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP
Subjt:  EHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP

Query:  VHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGRNEKKAIPVTSLKVGDEVF
        VHAYVAVPG KTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQL+L+QAKRDSDEQT YSILLQNAETVALVCPGRG NEKKAIPVTSLKVGDEVF
Subjt:  VHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGRNEKKAIPVTSLKVGDEVF

Query:  LRLQGEARHTGIEIQEFIVEK
        LRLQGEARHTGIEIQEFIVEK
Subjt:  LRLQGEARHTGIEIQEFIVEK

SwissProt top hitse value%identityAlignment
A0B6K6 3-dehydroquinate synthase1.5e-6339.57Show/hide
Query:  KEVWI------WSEHRQVMTAAVERGWSTFIFSPHNRELADEWSSIALIRPLFMKEDGVFD------------------GEGRLMATVVEVSNRQQLEQL
        KE W+      W + + ++T A+E G+   + S  + EL  E  SI +    F +E G  D                    GR +   VE+ +++     
Subjt:  KEVWI------WSEHRQVMTAAVERGWSTFIFSPHNRELADEWSSIALIRPLFMKEDGVFD------------------GEGRLMATVVEVSNRQQLEQL

Query:  QPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVA
               D ++V   DW++IP EN++AA QG    + +  ++  EA++ L  LEHG  GV+L   DP  + +++   +R     + + L  AT+  +   
Subjt:  QPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVA

Query:  GMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQ
        GMGDRVCVD CSLMR GEG+LVGS +R  FLV SE  ES Y+A+RPFRVNAG VHAY+ V G KT YLSEL++G EV +VD++G  R+A+VGRVKIE R 
Subjt:  GMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQ

Query:  LILVQAKRDSDEQTPYSILLQNAETVALVCPGRGRNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
        +ILV+A+ D +     S LLQNAET+ LV      ++   I V  LK GD+V + ++  ARH G+ I+E I+E+
Subjt:  LILVQAKRDSDEQTPYSILLQNAETVALVCPGRGRNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

O26680 3-dehydroquinate synthase3.7e-6548.64Show/hide
Query:  GRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFD-RR
        GR +A  VE+ ++   E  +      D +++  +DW+IIP ENI+A  Q     + A      EA++ LE LEHG  GV++   +P  + Q+KD      
Subjt:  GRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFD-RR

Query:  NEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVV
        N  S    L  ATIT+I   G GDRVCVD CS+M  GEG+LVGSY++GLFLVHSE LES Y+ASRPFRVNAGPV AYV VPGG+T YLSEL  G EVI+V
Subjt:  NEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVV

Query:  DQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGRNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
        D++GR R+AIVGRVKIE R L+LV+A+    E      LLQNAET+ LV      ++ + + V+ L  GD V +     ARH G+ I+E I+EK
Subjt:  DQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGRNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

Q2NI00 3-dehydroquinate synthase1.4e-6439.02Show/hide
Query:  KEVWI-----WSEHRQVMTAAVERGWSTFIFSPHNRELADEWSSIALI--------------RPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENA
        K  WI     W++ ++ +  ++E G+   I    N E   +  S+ +I                + M +       G+ +A  VE++N+     +     
Subjt:  KEVWI-----WSEHRQVMTAAVERGWSTFIFSPHNRELADEWSSIALI--------------RPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENA

Query:  SADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDR
         AD V++  K+W++IP ENI+A+ Q     +        EA++ LE +EHG  GV+L   D + + +L    ++ ++ S    L  AT+T++   G+GDR
Subjt:  SADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDR

Query:  VCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQ
        VCVD CS+M  G+G+LVGS+A GLFLVHSE LES Y+ASRPFRVNAGPVHAYV  P  KT YLSEL AG EV+ ++ +G   T IVGRVKIE R L+L++
Subjt:  VCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQ

Query:  AKRDSDEQTPYSILLQNAETVALVCPGRGRNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
        AK    + +    L+QNAET+ LV      ++ + I V+ LKVGD+V       ARH G+ I+E I+EK
Subjt:  AKRDSDEQTPYSILLQNAETVALVCPGRGRNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

Q58646 3-dehydroquinate synthase5.3e-6440.06Show/hide
Query:  WSEHRQVMTAAVERGWSTFIFSPHNRELADEWSSIALIR-------PLFMKEDGV-----FDGEGRLMATVVEVSNRQQLEQLQPENAS---ADNVVVDL
        W E ++++T A+E      +  P + E   E  +I +          L  K D +         G+  A  + + +++  E+   E A     DN++++ 
Subjt:  WSEHRQVMTAAVERGWSTFIFSPHNRELADEWSSIALIR-------PLFMKEDGV-----FDGEGRLMATVVEVSNRQQLEQLQPENAS---ADNVVVDL

Query:  KDWQIIPAENIVA-AFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSL
        +DW IIP EN++A  F    K V +V+    EA++  E LE G  GV+L   + + + +L    +  N+    ++L  AT+T++   G GDRVC+D CSL
Subjt:  KDWQIIPAENIVA-AFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSL

Query:  MRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQ
        M+ GEG+L+GSY+R LFLVHSE +E+ Y+A+RPFRVNAGPVHAY+  PG KT YLSEL+AG +V++VD++G  R AIVGRVKIE R L+L++A+   D  
Subjt:  MRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQ

Query:  TPYSILLQNAETVALVCPGRGRNEK-KAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
             +LQNAET+ LV      NEK + I V  LK GD+V ++ +  ARH G+ I+E I+EK
Subjt:  TPYSILLQNAETVALVCPGRGRNEK-KAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

Q8TVI1 3-dehydroquinate synthase4.2e-6145.85Show/hide
Query:  ENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVG--DPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVA
        +N   D V+   +DW+IIP EN++A  QG +  + A ++   EA+I  E LE G  GV+L     DP  + +  +  +R   A+    L    + ++   
Subjt:  ENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVG--DPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVA

Query:  GMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQ
        G GDRVCVD CSLM  GEG+LVGS +RG+FL+HSE LE+ Y+  RPFRVNAGPVHAY+ VPGGKT YL+ELR G EV++VD EGR R A+VGR+KIE R 
Subjt:  GMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQ

Query:  LILVQAKRDSDEQTPYSILLQNAETVALVCPGRGRNEKKAIPVTSLKVGDEVFLRL---QGEARHTGIEIQEFIVEK
        L+L++A+ +  E      ++QNAET+ LV     R + + + V  LK GD+V   +   +G+ RH G+E++E IVEK
Subjt:  LILVQAKRDSDEQTPYSILLQNAETVALVCPGRGRNEKKAIPVTSLKVGDEVFLRL---QGEARHTGIEIQEFIVEK

Arabidopsis top hitse value%identityAlignment
AT3G28760.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: 3-dehydroquinate synthase, prokaryotic-type (InterPro:IPR002812); Has 390 Blast hits to 390 proteins in 131 species: Archae - 144; Bacteria - 105; Metazoa - 0; Fungi - 0; Plants - 54; Viruses - 0; Other Eukaryotes - 87 (source: NCBI BLink).2.0e-13868.04Show/hide
Query:  SSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPHNRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVV
        S+S+    +  +K+VWIW+  ++VMT AVERGW+TFIFS  NR+L++EWSSIAL+  LF++E  V DG G ++A+V EVS  ++L  L  EN   +N+V+
Subjt:  SSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPHNRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVV

Query:  DLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCS
        D  DW+ IPAEN+VAA QGS+KTVFAVS TP EA++FLEALEHGLGG+ILK  D  AV  LK+YFD+RNE S+ LSLT+ATIT++ + GMGDRVCVDLCS
Subjt:  DLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCS

Query:  LMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDS-D
        LMRPGEGLLVGS+ARGLFLVHSECLESNYI SRPFRVNAGPVHAYVAVPGGKT YLSELR G+EVIVVDQ+G+QRTA+VGRVKIE R LI+V+AK  + +
Subjt:  LMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDS-D

Query:  EQTPYSILLQNAETVALVCPGR-GRNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE
        E+T YSI+LQNAETVALV P +   + + A+PVTSLK GD+V +RLQG ARHTGIEIQEFIVE
Subjt:  EQTPYSILLQNAETVALVCPGR-GRNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE

AT3G28760.2 CONTAINS InterPro DOMAIN/s: 3-dehydroquinate synthase, prokaryotic-type (InterPro:IPR002812); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink).2.0e-13868.04Show/hide
Query:  SSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPHNRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVV
        S+S+    +  +K+VWIW+  ++VMT AVERGW+TFIFS  NR+L++EWSSIAL+  LF++E  V DG G ++A+V EVS  ++L  L  EN   +N+V+
Subjt:  SSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPHNRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVV

Query:  DLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCS
        D  DW+ IPAEN+VAA QGS+KTVFAVS TP EA++FLEALEHGLGG+ILK  D  AV  LK+YFD+RNE S+ LSLT+ATIT++ + GMGDRVCVDLCS
Subjt:  DLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCS

Query:  LMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDS-D
        LMRPGEGLLVGS+ARGLFLVHSECLESNYI SRPFRVNAGPVHAYVAVPGGKT YLSELR G+EVIVVDQ+G+QRTA+VGRVKIE R LI+V+AK  + +
Subjt:  LMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDS-D

Query:  EQTPYSILLQNAETVALVCPGR-GRNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE
        E+T YSI+LQNAETVALV P +   + + A+PVTSLK GD+V +RLQG ARHTGIEIQEFIVE
Subjt:  EQTPYSILLQNAETVALVCPGR-GRNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCATGGCCTTGGTGTGTTCGTCACCTGCTTCTCCACTTCTTTCCAAACAGCGAATCAACGCCCACAAAAGACCAGATAATTTGGCCCTAATTTCAAGGAGT
TTTGGCGAAGCCAATGCTGGTGAATGTAAATCTTTGGAGGCAAATCGTTTAGAGTGTTCTTACGCTTCCTCGTCTTCTTCAATGTCTCCGATTGAGGCGTCGAAG
GAGGTGTGGATTTGGAGTGAGCATCGGCAGGTTATGACGGCCGCGGTTGAGAGGGGCTGGAGCACCTTCATCTTCTCGCCTCACAATCGGGAGCTTGCTGATGAA
TGGTCCTCAATTGCACTAATACGTCCGCTTTTTATGAAAGAGGATGGAGTTTTTGATGGAGAGGGTAGACTAATGGCCACAGTTGTTGAGGTTTCGAACCGCCAG
CAATTGGAGCAGCTTCAACCAGAAAATGCATCCGCAGACAATGTTGTTGTGGATCTAAAAGATTGGCAGATAATACCTGCAGAGAATATTGTTGCAGCATTTCAG
GGGAGTCAGAAAACAGTGTTTGCTGTCTCGAAAACTCCTATTGAAGCTCAAATCTTCCTTGAGGCACTCGAACACGGTCTAGGTGGAGTTATTTTGAAAGTTGGA
GATCCTGATGCTGTTTTCCAGCTAAAGGACTATTTTGACAGAAGAAATGAAGCTAGTAATCTTTTGAGCTTGACTAAGGCTACTATAACTCAAATTCATGTTGCT
GGAATGGGAGATCGAGTTTGTGTTGATCTCTGTAGTCTCATGAGACCTGGCGAAGGACTTCTTGTCGGTTCTTATGCCAGAGGACTATTCCTTGTTCACTCAGAA
TGCTTAGAGTCAAATTACATTGCTAGCCGGCCTTTTCGTGTCAATGCTGGACCTGTCCATGCCTACGTAGCTGTCCCGGGAGGGAAAACTAGCTACCTTTCCGAG
TTACGAGCAGGCAAAGAGGTAATTGTAGTTGATCAAGAAGGCAGACAACGAACCGCTATTGTTGGACGTGTAAAGATCGAAACTAGGCAGCTGATCCTCGTCCAG
GCAAAGAGAGATTCAGATGAGCAAACTCCTTACAGCATCCTCCTGCAGAACGCGGAAACGGTTGCCTTAGTCTGCCCTGGTCGAGGTCGCAATGAGAAGAAAGCC
ATCCCTGTTACTTCACTTAAAGTTGGTGATGAAGTGTTCTTGAGATTGCAAGGAGAAGCAAGGCATACAGGTATTGAAATCCAAGAGTTTATTGTGGAGAAATGA
mRNA sequenceShow/hide mRNA sequence
AAAAATTCCCGTACGATGGCCATGGCCTTGGTGTGTTCGTCACCTGCTTCTCCACTTCTTTCCAAACAGCGAATCAACGCCCACAAAAGACCAGATAATTTGGCC
CTAATTTCAAGGAGTTTTGGCGAAGCCAATGCTGGTGAATGTAAATCTTTGGAGGCAAATCGTTTAGAGTGTTCTTACGCTTCCTCGTCTTCTTCAATGTCTCCG
ATTGAGGCGTCGAAGGAGGTGTGGATTTGGAGTGAGCATCGGCAGGTTATGACGGCCGCGGTTGAGAGGGGCTGGAGCACCTTCATCTTCTCGCCTCACAATCGG
GAGCTTGCTGATGAATGGTCCTCAATTGCACTAATACGTCCGCTTTTTATGAAAGAGGATGGAGTTTTTGATGGAGAGGGTAGACTAATGGCCACAGTTGTTGAG
GTTTCGAACCGCCAGCAATTGGAGCAGCTTCAACCAGAAAATGCATCCGCAGACAATGTTGTTGTGGATCTAAAAGATTGGCAGATAATACCTGCAGAGAATATT
GTTGCAGCATTTCAGGGGAGTCAGAAAACAGTGTTTGCTGTCTCGAAAACTCCTATTGAAGCTCAAATCTTCCTTGAGGCACTCGAACACGGTCTAGGTGGAGTT
ATTTTGAAAGTTGGAGATCCTGATGCTGTTTTCCAGCTAAAGGACTATTTTGACAGAAGAAATGAAGCTAGTAATCTTTTGAGCTTGACTAAGGCTACTATAACT
CAAATTCATGTTGCTGGAATGGGAGATCGAGTTTGTGTTGATCTCTGTAGTCTCATGAGACCTGGCGAAGGACTTCTTGTCGGTTCTTATGCCAGAGGACTATTC
CTTGTTCACTCAGAATGCTTAGAGTCAAATTACATTGCTAGCCGGCCTTTTCGTGTCAATGCTGGACCTGTCCATGCCTACGTAGCTGTCCCGGGAGGGAAAACT
AGCTACCTTTCCGAGTTACGAGCAGGCAAAGAGGTAATTGTAGTTGATCAAGAAGGCAGACAACGAACCGCTATTGTTGGACGTGTAAAGATCGAAACTAGGCAG
CTGATCCTCGTCCAGGCAAAGAGAGATTCAGATGAGCAAACTCCTTACAGCATCCTCCTGCAGAACGCGGAAACGGTTGCCTTAGTCTGCCCTGGTCGAGGTCGC
AATGAGAAGAAAGCCATCCCTGTTACTTCACTTAAAGTTGGTGATGAAGTGTTCTTGAGATTGCAAGGAGAAGCAAGGCATACAGGTATTGAAATCCAAGAGTTT
ATTGTGGAGAAATGAATTGATGGTTGATCAACTTGTCTTATTTGAATATATTGTAAATTTTATCTTTTGAGAAAACTGACTTTAATTTAAGGTTTTGAAGTTGGG
CGTATGTTTAATATGTATGAATTTGTCTGATTTAATTTTTAATAATAAGGTTCGAATTGTTCTAA
Protein sequenceShow/hide protein sequence
MAMALVCSSPASPLLSKQRINAHKRPDNLALISRSFGEANAGECKSLEANRLECSYASSSSSMSPIEASKEVWIWSEHRQVMTAAVERGWSTFIFSPHNRELADE
WSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVG
DPDAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSE
LRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGRNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK