; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0027945 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0027945
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Description3-dehydroquinate synthase homolog
Genome locationchr8:8434916..8443911
RNA-Seq ExpressionLag0027945
SyntenyLag0027945
Gene Ontology termsGO:0008652 - cellular amino acid biosynthetic process (biological process)
GO:0009073 - aromatic amino acid family biosynthetic process (biological process)
GO:0003856 - 3-dehydroquinate synthase activity (molecular function)
GO:0016491 - oxidoreductase activity (molecular function)
InterPro domainsIPR002812 - 3-dehydroquinate synthase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588380.1 hypothetical protein SDJN03_16945, partial [Cucurbita argyrosperma subsp. sororia]2.5e-19484.35Show/hide
Query:  AMALVCSSPASPLLSKQRINAHKRPVLICSFGTYFFVAFSDNL---ALISRSF-----GECKSLEANRLECSYASSSSSMSPIEASKEVWVWSEHRQVMT
        A+A +  SP SPL  KQRI  HK P               DNL   ALISR F     GECKSL+ NRL CS  SSSSSMSPIEASK VW+WSE +QVMT
Subjt:  AMALVCSSPASPLLSKQRINAHKRPVLICSFGTYFFVAFSDNL---ALISRSF-----GECKSLEANRLECSYASSSSSMSPIEASKEVWVWSEHRQVMT

Query:  AAVERGWSTFIFSPHNRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFA
        AAVERGWSTFIFSPHN+ELADEWSSIALIRPLF+ EDGVFD EGRL+ATV+EVSN QQLEQLQP NAS DNV+VDL+DWQIIPAENIVAAFQGS+KTVFA
Subjt:  AAVERGWSTFIFSPHNRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFA

Query:  VSKTPIEAQIFLEALEHGLGGVILKVGDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLE
        VSKTPIEAQIFLEALEHGLGGVILKV DPEAVFQLKDYFDRRNEASNLLSLTKATI  IHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLE
Subjt:  VSKTPIEAQIFLEALEHGLGGVILKVGDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLE

Query:  SNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKK
        SNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRT IVGRVKIETRQL+L+QAKRDSDEQT YSILLQNAETVALV PGRGNEKK
Subjt:  SNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKK

Query:  AIPVTSLKVGDEVFLRLQGEARHTATAI
        AIPVTSLKVGDEVFLRLQGEARHT   I
Subjt:  AIPVTSLKVGDEVFLRLQGEARHTATAI

XP_022928646.1 uncharacterized protein LOC111435491 [Cucurbita moschata]4.9e-19584.35Show/hide
Query:  AMALVCSSPASPLLSKQRINAHKRPVLICSFGTYFFVAFSDNL---ALISRSF-----GECKSLEANRLECSYASSSSSMSPIEASKEVWVWSEHRQVMT
        A+A +  SP SPL  KQRI  HK P               D+L   ALISR F     GECKSLE NRL CS  SSSSSMSPIEASK VW+WSE +QVMT
Subjt:  AMALVCSSPASPLLSKQRINAHKRPVLICSFGTYFFVAFSDNL---ALISRSF-----GECKSLEANRLECSYASSSSSMSPIEASKEVWVWSEHRQVMT

Query:  AAVERGWSTFIFSPHNRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFA
        AAVERGWSTFIFSPHN+ELADEWSSIALIRPLF+ EDGVFD EGRL+ATV+EVSN QQLEQLQP NAS DNV+VDL+DWQIIPAENIVAAFQGS+KTVFA
Subjt:  AAVERGWSTFIFSPHNRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFA

Query:  VSKTPIEAQIFLEALEHGLGGVILKVGDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLE
        VSKTPIEAQIFLEALEHGLGGVILKV DPEAVFQLKDYFDRRNEASNLLSLTKATIT IHVAGMGDRVCVDLCSLM+PGEGLLVGSYARGLFLVHSECLE
Subjt:  VSKTPIEAQIFLEALEHGLGGVILKVGDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLE

Query:  SNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKK
        SNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQ+GRQRT IVGRVKIETRQL+L+QAKRDSDEQT YSILLQNAETVALVCPGRGNEKK
Subjt:  SNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKK

Query:  AIPVTSLKVGDEVFLRLQGEARHTATAI
        AIPVTSLKVGDEVFLRLQGEARHT   I
Subjt:  AIPVTSLKVGDEVFLRLQGEARHTATAI

XP_022970870.1 uncharacterized protein LOC111469713 [Cucurbita maxima]7.6e-19685.28Show/hide
Query:  AMALVCSSPASPLLSKQRINAHKRPVLICSFGTYFFVAFSDNL---ALISRSF-----GECKSLEANRLECSYASSSSSMSPIEASKEVWVWSEHRQVMT
        AMAL+  S  SP   KQRI AH+ P               DNL   ALISR F     GECKSLE NRL CS ASSSSSMSPIEASK VW+WS  RQVMT
Subjt:  AMALVCSSPASPLLSKQRINAHKRPVLICSFGTYFFVAFSDNL---ALISRSF-----GECKSLEANRLECSYASSSSSMSPIEASKEVWVWSEHRQVMT

Query:  AAVERGWSTFIFSPHNRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFA
        AAVERGWSTFIFSPHN+ELADEWSSIALI PLF+ EDGVFDGEGRL+ATV+EVSN QQLEQLQP NAS DNV+VDL+DWQIIPAENIVAAFQGS+KTVFA
Subjt:  AAVERGWSTFIFSPHNRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFA

Query:  VSKTPIEAQIFLEALEHGLGGVILKVGDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLE
        VSKTPIEAQIFLEALEHGLGGVILKV DPEAVFQLKDYFDRRNEASNLLSLTKATIT IHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLE
Subjt:  VSKTPIEAQIFLEALEHGLGGVILKVGDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLE

Query:  SNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKK
        SNYIASRPFRVNAGPVHAYVAVPG KTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQL+L+QAKRDSDEQT YSILLQNAETVALVCPGRGNEKK
Subjt:  SNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKK

Query:  AIPVTSLKVGDEVFLRLQGEARHTATAI
        AIPVTSLKVGDEVFLRLQGEARHT   I
Subjt:  AIPVTSLKVGDEVFLRLQGEARHTATAI

XP_023529491.1 uncharacterized protein LOC111792332 [Cucurbita pepo subsp. pepo]9.0e-19785.48Show/hide
Query:  MALVCSSPASPLLSKQRINAHKRPVLICSFGTYFFVAFSDNL---ALISRSF-----GECKSLEANRLECSYASSSSSMSPIEASKEVWVWSEHRQVMTA
        MAL+  SP S L  KQRI  HK P               DNL   ALISR F     GECKSLE NRL CS  SSSSSMSPIEASK VW+WSE+RQVMTA
Subjt:  MALVCSSPASPLLSKQRINAHKRPVLICSFGTYFFVAFSDNL---ALISRSF-----GECKSLEANRLECSYASSSSSMSPIEASKEVWVWSEHRQVMTA

Query:  AVERGWSTFIFSPHNRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAV
        AVERGWSTFIFSPHN+ELADEWSSIALIRPLF+ EDGVFD EGRL+ATV+EVSN QQLEQLQP NAS DNV+VDL+DWQIIPAENIVAAFQGS+KTVFAV
Subjt:  AVERGWSTFIFSPHNRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAV

Query:  SKTPIEAQIFLEALEHGLGGVILKVGDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLES
        SKTPIEAQIFLEALEHGLGGVILKV DPEAVFQLKDYFDRRNEASNLLSLTKATIT IHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLES
Subjt:  SKTPIEAQIFLEALEHGLGGVILKVGDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLES

Query:  NYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKA
        NYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRT IVGRVKIETRQL+L+QAKRDSDEQT YSILLQNAETVALVCPGRGNEKKA
Subjt:  NYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKA

Query:  IPVTSLKVGDEVFLRLQGEARHTATAI
        IPVTSLKVGDEVFLRLQGEARHT   I
Subjt:  IPVTSLKVGDEVFLRLQGEARHTATAI

XP_038903473.1 3-dehydroquinate synthase homolog [Benincasa hispida]2.2e-19584.92Show/hide
Query:  MAMALVC-SSPASPLLSKQRIN-AHKRPVLICSFGTYFFVAFSDNLA---LISRSF-----GECKSLEANRLECSYASSSSSMSPIEASKEVWVWSEHRQ
        M MAL+C SSP SP LSKQRI+  HK P               +NL    LISR F     GECKS   +RL+CSYAS  S+MSP EASK VW+WSE +Q
Subjt:  MAMALVC-SSPASPLLSKQRIN-AHKRPVLICSFGTYFFVAFSDNLA---LISRSF-----GECKSLEANRLECSYASSSSSMSPIEASKEVWVWSEHRQ

Query:  VMTAAVERGWSTFIFSPHNRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKT
        VMTAAVERGWSTFIFSPHN ELADEWSSIALI PLF+KE+GVFDGEGRL+A+VVEVSN QQLEQLQP NASAD VVVDL+DWQIIPAENIVAAFQGSQKT
Subjt:  VMTAAVERGWSTFIFSPHNRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKT

Query:  VFAVSKTPIEAQIFLEALEHGLGGVILKVGDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSE
        VFA+SKTPIEAQIFLEALEHGLGGVILKV DPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSE
Subjt:  VFAVSKTPIEAQIFLEALEHGLGGVILKVGDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSE

Query:  CLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGN
        CLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSEL AGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGN
Subjt:  CLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGN

Query:  EKKAIPVTSLKVGDEVFLRLQGEARHTATAI
        EKK+IPVTSLKVGDEVFLRLQGEARHT   I
Subjt:  EKKAIPVTSLKVGDEVFLRLQGEARHTATAI

TrEMBL top hitse value%identityAlignment
A0A1S3B8Q7 3-dehydroquinate synthase homolog8.2e-18882.13Show/hide
Query:  MAMA-LVCSSPASPLLSKQRINAHKRPVLICSFGTYFFVAFSDNL---ALISRSF-----GECKSLEANRLECSYASSSSSMSPIEASKEVWVWSEHRQV
        M MA L  SSP SPLLSKQRI   K P               +NL    LISR+F     GECKS + +RL+CSY SSSS MSPIE SK VW+WSE ++V
Subjt:  MAMA-LVCSSPASPLLSKQRINAHKRPVLICSFGTYFFVAFSDNL---ALISRSF-----GECKSLEANRLECSYASSSSSMSPIEASKEVWVWSEHRQV

Query:  MTAAVERGWSTFIFSPHNRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTV
        MTAAVERGWSTFIFSPHN ELA EW+SIA+I PLF+KEDGV DGE RL+A+VVE+SN QQLEQLQP  ASAD VVVDL+DWQIIPAENIVAAFQGSQKTV
Subjt:  MTAAVERGWSTFIFSPHNRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTV

Query:  FAVSKTPIEAQIFLEALEHGLGGVILKVGDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSEC
        FA+SKTPIEAQIF EALEHGLGGVILKV DPEAVFQLKDYFDRRNEASNLLSLTKATITQIHV GMGDRVCVDLCSLMRPGEGLLVGS+ARGLFLVHSEC
Subjt:  FAVSKTPIEAQIFLEALEHGLGGVILKVGDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSEC

Query:  LESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRG-N
        LESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSEL+AGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS+LLQNAETVALVCPG+G N
Subjt:  LESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRG-N

Query:  EKKAIPVTSLKVGDEVFLRLQGEARHTATAI
        EKKAI VTSLKVGDEVFLRLQGEARHT   I
Subjt:  EKKAIPVTSLKVGDEVFLRLQGEARHTATAI

A0A5A7UEW0 3-dehydroquinate synthase-like protein8.2e-18882.13Show/hide
Query:  MAMA-LVCSSPASPLLSKQRINAHKRPVLICSFGTYFFVAFSDNL---ALISRSF-----GECKSLEANRLECSYASSSSSMSPIEASKEVWVWSEHRQV
        M MA L  SSP SPLLSKQRI   K P               +NL    LISR+F     GECKS + +RL+CSY SSSS MSPIE SK VW+WSE ++V
Subjt:  MAMA-LVCSSPASPLLSKQRINAHKRPVLICSFGTYFFVAFSDNL---ALISRSF-----GECKSLEANRLECSYASSSSSMSPIEASKEVWVWSEHRQV

Query:  MTAAVERGWSTFIFSPHNRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTV
        MTAAVERGWSTFIFSPHN ELA EW+SIA+I PLF+KEDGV DGE RL+A+VVE+SN QQLEQLQP  ASAD VVVDL+DWQIIPAENIVAAFQGSQKTV
Subjt:  MTAAVERGWSTFIFSPHNRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTV

Query:  FAVSKTPIEAQIFLEALEHGLGGVILKVGDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSEC
        FA+SKTPIEAQIF EALEHGLGGVILKV DPEAVFQLKDYFDRRNEASNLLSLTKATITQIHV GMGDRVCVDLCSLMRPGEGLLVGS+ARGLFLVHSEC
Subjt:  FAVSKTPIEAQIFLEALEHGLGGVILKVGDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSEC

Query:  LESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRG-N
        LESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSEL+AGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS+LLQNAETVALVCPG+G N
Subjt:  LESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRG-N

Query:  EKKAIPVTSLKVGDEVFLRLQGEARHTATAI
        EKKAI VTSLKVGDEVFLRLQGEARHT   I
Subjt:  EKKAIPVTSLKVGDEVFLRLQGEARHTATAI

A0A6J1BVA9 uncharacterized protein LOC111005050 isoform X21.1e-18481.88Show/hide
Query:  MALVCSSPASP-LLSKQRINAHKRPVLICSFGTYFFVAFSDNLALISRSFGE-----CKSLEANRLECSYASSSSSMSPIEASKEVWVWSEHRQVMTAAV
        M  +C+SPASP LLSK RI   K P             +S   ALIS  FG+     CKS+ A  ++CS AS S   +P EASK VWVWSE+RQV+TAAV
Subjt:  MALVCSSPASP-LLSKQRINAHKRPVLICSFGTYFFVAFSDNLALISRSFGE-----CKSLEANRLECSYASSSSSMSPIEASKEVWVWSEHRQVMTAAV

Query:  ERGWSTFIFSPHNRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSK
        ERGW+TF+FSPHNRELA +WSSIA I  LF+KEDG+FD EG L+ATV EVSN QQLEQLQPENAS DNVVVDL+DWQIIPAENIVAAFQGS+K VFAVSK
Subjt:  ERGWSTFIFSPHNRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSK

Query:  TPIEAQIFLEALEHGLGGVILKVGDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNY
        TPIEAQIFLEALEHGLGGVILKV DPEAVFQLKDYFDRRNEASNLLSLTKAT+TQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNY
Subjt:  TPIEAQIFLEALEHGLGGVILKVGDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNY

Query:  IASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIP
        IASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSD+QTPY ILLQNAETVALVCPGRGNEKKAIP
Subjt:  IASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIP

Query:  VTSLKVGDEVFLRLQGEARHTATAI
        VTSLKVGD+VFLRLQGEARHT   I
Subjt:  VTSLKVGDEVFLRLQGEARHTATAI

A0A6J1EKW1 uncharacterized protein LOC1114354912.4e-19584.35Show/hide
Query:  AMALVCSSPASPLLSKQRINAHKRPVLICSFGTYFFVAFSDNL---ALISRSF-----GECKSLEANRLECSYASSSSSMSPIEASKEVWVWSEHRQVMT
        A+A +  SP SPL  KQRI  HK P               D+L   ALISR F     GECKSLE NRL CS  SSSSSMSPIEASK VW+WSE +QVMT
Subjt:  AMALVCSSPASPLLSKQRINAHKRPVLICSFGTYFFVAFSDNL---ALISRSF-----GECKSLEANRLECSYASSSSSMSPIEASKEVWVWSEHRQVMT

Query:  AAVERGWSTFIFSPHNRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFA
        AAVERGWSTFIFSPHN+ELADEWSSIALIRPLF+ EDGVFD EGRL+ATV+EVSN QQLEQLQP NAS DNV+VDL+DWQIIPAENIVAAFQGS+KTVFA
Subjt:  AAVERGWSTFIFSPHNRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFA

Query:  VSKTPIEAQIFLEALEHGLGGVILKVGDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLE
        VSKTPIEAQIFLEALEHGLGGVILKV DPEAVFQLKDYFDRRNEASNLLSLTKATIT IHVAGMGDRVCVDLCSLM+PGEGLLVGSYARGLFLVHSECLE
Subjt:  VSKTPIEAQIFLEALEHGLGGVILKVGDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLE

Query:  SNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKK
        SNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQ+GRQRT IVGRVKIETRQL+L+QAKRDSDEQT YSILLQNAETVALVCPGRGNEKK
Subjt:  SNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKK

Query:  AIPVTSLKVGDEVFLRLQGEARHTATAI
        AIPVTSLKVGDEVFLRLQGEARHT   I
Subjt:  AIPVTSLKVGDEVFLRLQGEARHTATAI

A0A6J1I437 uncharacterized protein LOC1114697133.7e-19685.28Show/hide
Query:  AMALVCSSPASPLLSKQRINAHKRPVLICSFGTYFFVAFSDNL---ALISRSF-----GECKSLEANRLECSYASSSSSMSPIEASKEVWVWSEHRQVMT
        AMAL+  S  SP   KQRI AH+ P               DNL   ALISR F     GECKSLE NRL CS ASSSSSMSPIEASK VW+WS  RQVMT
Subjt:  AMALVCSSPASPLLSKQRINAHKRPVLICSFGTYFFVAFSDNL---ALISRSF-----GECKSLEANRLECSYASSSSSMSPIEASKEVWVWSEHRQVMT

Query:  AAVERGWSTFIFSPHNRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFA
        AAVERGWSTFIFSPHN+ELADEWSSIALI PLF+ EDGVFDGEGRL+ATV+EVSN QQLEQLQP NAS DNV+VDL+DWQIIPAENIVAAFQGS+KTVFA
Subjt:  AAVERGWSTFIFSPHNRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFA

Query:  VSKTPIEAQIFLEALEHGLGGVILKVGDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLE
        VSKTPIEAQIFLEALEHGLGGVILKV DPEAVFQLKDYFDRRNEASNLLSLTKATIT IHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLE
Subjt:  VSKTPIEAQIFLEALEHGLGGVILKVGDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLE

Query:  SNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKK
        SNYIASRPFRVNAGPVHAYVAVPG KTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQL+L+QAKRDSDEQT YSILLQNAETVALVCPGRGNEKK
Subjt:  SNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKK

Query:  AIPVTSLKVGDEVFLRLQGEARHTATAI
        AIPVTSLKVGDEVFLRLQGEARHT   I
Subjt:  AIPVTSLKVGDEVFLRLQGEARHTATAI

SwissProt top hitse value%identityAlignment
A0B6K6 3-dehydroquinate synthase1.7e-6039.34Show/hide
Query:  KEVWV------WSEHRQVMTAAVERGWSTFIFSPHNRELADEWSSIALIRPLFMKEDGVFD------------------GEGRLMATVVEVSNRQQLEQL
        KE W+      W + + ++T A+E G+   + S  + EL  E  SI +    F +E G  D                    GR +   VE+ +++     
Subjt:  KEVWV------WSEHRQVMTAAVERGWSTFIFSPHNRELADEWSSIALIRPLFMKEDGVFD------------------GEGRLMATVVEVSNRQQLEQL

Query:  QPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVA
               D ++V   DW++IP EN++AA QG    + +  ++  EA++ L  LEHG  GV+L   DP  + +++   +R     + + L  AT+  +   
Subjt:  QPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVA

Query:  GMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQ
        GMGDRVCVD CSLMR GEG+LVGS +R  FLV SE  ES Y+A+RPFRVNAG VHAY+ V G KT YLSEL++G EV +VD++G  R+A+VGRVKIE R 
Subjt:  GMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQ

Query:  LILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTATAI
        +ILV+A+ D +     S LLQNAET+ LV     ++   I V  LK GD+V + ++  ARH   +I
Subjt:  LILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTATAI

A5UJ82 3-dehydroquinate synthase2.6e-5844.25Show/hide
Query:  EGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPEAVFQLKDYFDRR
        EG+ +   VE++++   +      + AD +++   DW +IP ENI+A  Q +   + A       A++ +E LEHG  GVI +  D     Q+K      
Subjt:  EGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPEAVFQLKDYFDRR

Query:  NEASNL-LSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIV
         +AS +   L  AT+T +   G GDRVCVD   +M+PGEG+L+GSY++ LFLVHSE LES Y+ASRPFRVNAGPV AYV VPG KT YLSEL AG EV++
Subjt:  NEASNL-LSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIV

Query:  VDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTATAI
        V+ EG  RTA VGR KIE R LIL++A+    E      LLQNAET+ +V      +   + V  +K+GD+V + ++  ARH   AI
Subjt:  VDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTATAI

O26680 3-dehydroquinate synthase1.8e-6248.6Show/hide
Query:  GRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPEAVFQLKDYFD-RR
        GR +A  VE+ ++   E  +      D +++  +DW+IIP ENI+A  Q     + A      EA++ LE LEHG  GV++   +P  + Q+KD      
Subjt:  GRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPEAVFQLKDYFD-RR

Query:  NEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVV
        N  S    L  ATIT+I   G GDRVCVD CS+M  GEG+LVGSY++GLFLVHSE LES Y+ASRPFRVNAGPV AYV VPGG+T YLSEL  G EVI+V
Subjt:  NEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVV

Query:  DQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTATAI
        D++GR R+AIVGRVKIE R L+LV+A+    E      LLQNAET+ LV     ++ + + V+ L  GD V +     ARH   AI
Subjt:  DQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTATAI

Q2NI00 3-dehydroquinate synthase1.5e-6139.03Show/hide
Query:  WSEHRQVMTAAVERGWSTFIFSPHNRELADEWSSIALI--------------RPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLK
        W++ ++ +  ++E G+   I    N E   +  S+ +I                + M +       G+ +A  VE++N+     +      AD V++  K
Subjt:  WSEHRQVMTAAVERGWSTFIFSPHNRELADEWSSIALI--------------RPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLK

Query:  DWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMR
        +W++IP ENI+A+ Q     +        EA++ LE +EHG  GV+L   D   + +L    ++ ++ S    L  AT+T++   G+GDRVCVD CS+M 
Subjt:  DWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMR

Query:  PGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTP
         G+G+LVGS+A GLFLVHSE LES Y+ASRPFRVNAGPVHAYV  P  KT YLSEL AG EV+ ++ +G   T IVGRVKIE R L+L++AK    + + 
Subjt:  PGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTP

Query:  YSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTATAI
           L+QNAET+ LV     ++ + I V+ LKVGD+V       ARH   AI
Subjt:  YSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTATAI

Q58646 3-dehydroquinate synthase2.0e-6140.11Show/hide
Query:  WSEHRQVMTAAVERGWSTFIFSPHNRELADEWSSIALIR-------PLFMKEDGV-----FDGEGRLMATVVEVSNRQQLEQLQPENAS---ADNVVVDL
        W E ++++T A+E      +  P + E   E  +I +          L  K D +         G+  A  + + +++  E+   E A     DN++++ 
Subjt:  WSEHRQVMTAAVERGWSTFIFSPHNRELADEWSSIALIR-------PLFMKEDGV-----FDGEGRLMATVVEVSNRQQLEQLQPENAS---ADNVVVDL

Query:  KDWQIIPAENIVA-AFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSL
        +DW IIP EN++A  F    K V +V+    EA++  E LE G  GV+L   + E + +L    +  N+    ++L  AT+T++   G GDRVC+D CSL
Subjt:  KDWQIIPAENIVA-AFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSL

Query:  MRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQ
        M+ GEG+L+GSY+R LFLVHSE +E+ Y+A+RPFRVNAGPVHAY+  PG KT YLSEL+AG +V++VD++G  R AIVGRVKIE R L+L++A+   D  
Subjt:  MRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQ

Query:  TPYSILLQNAETVALVCPGRGNEK-KAIPVTSLKVGDEVFLRLQGEARHTATAI
             +LQNAET+ LV     NEK + I V  LK GD+V ++ +  ARH   AI
Subjt:  TPYSILLQNAETVALVCPGRGNEK-KAIPVTSLKVGDEVFLRLQGEARHTATAI

Arabidopsis top hitse value%identityAlignment
AT3G28760.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: 3-dehydroquinate synthase, prokaryotic-type (InterPro:IPR002812); Has 390 Blast hits to 390 proteins in 131 species: Archae - 144; Bacteria - 105; Metazoa - 0; Fungi - 0; Plants - 54; Viruses - 0; Other Eukaryotes - 87 (source: NCBI BLink).2.0e-13366.67Show/hide
Query:  SSSSSMSPIEASKEVWVWSEHRQVMTAAVERGWSTFIFSPHNRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVV
        S+S+    +  +K+VW+W+  ++VMT AVERGW+TFIFS  NR+L++EWSSIAL+  LF++E  V DG G ++A+V EVS  ++L  L  EN   +N+V+
Subjt:  SSSSSMSPIEASKEVWVWSEHRQVMTAAVERGWSTFIFSPHNRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVV

Query:  DLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCS
        D  DW+ IPAEN+VAA QGS+KTVFAVS TP EA++FLEALEHGLGG+ILK  D +AV  LK+YFD+RNE S+ LSLT+ATIT++ + GMGDRVCVDLCS
Subjt:  DLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCS

Query:  LMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDS-D
        LMRPGEGLLVGS+ARGLFLVHSECLESNYI SRPFRVNAGPVHAYVAVPGGKT YLSELR G+EVIVVDQ+G+QRTA+VGRVKIE R LI+V+AK  + +
Subjt:  LMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDS-D

Query:  EQTPYSILLQNAETVALVCPGRGNE--KKAIPVTSLKVGDEVFLRLQGEARHTATAI
        E+T YSI+LQNAETVALV P + N   + A+PVTSLK GD+V +RLQG ARHT   I
Subjt:  EQTPYSILLQNAETVALVCPGRGNE--KKAIPVTSLKVGDEVFLRLQGEARHTATAI

AT3G28760.2 CONTAINS InterPro DOMAIN/s: 3-dehydroquinate synthase, prokaryotic-type (InterPro:IPR002812); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink).1.5e-13360.24Show/hide
Query:  LSKQRINAHKRPVLICSFGTYFFVAFS---DNLALIS--RSFGECKSLEANRLECSYASSSSSMSPIEASKEVWVWSEHRQVMTAAVERGWSTFIFSPHN
        L K++       +++CS  +Y   A     D L L S  +     K   + R+    ++S+  M+ +  +K+VW+W+  ++VMT AVERGW+TFIFS  N
Subjt:  LSKQRINAHKRPVLICSFGTYFFVAFS---DNLALIS--RSFGECKSLEANRLECSYASSSSSMSPIEASKEVWVWSEHRQVMTAAVERGWSTFIFSPHN

Query:  RELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALE
        R+L++EWSSIAL+  LF++E  V DG G ++A+V EVS  ++L  L  EN   +N+V+D  DW+ IPAEN+VAA QGS+KTVFAVS TP EA++FLEALE
Subjt:  RELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALE

Query:  HGLGGVILKVGDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPV
        HGLGG+ILK  D +AV  LK+YFD+RNE S+ LSLT+ATIT++ + GMGDRVCVDLCSLMRPGEGLLVGS+ARGLFLVHSECLESNYI SRPFRVNAGPV
Subjt:  HGLGGVILKVGDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPV

Query:  HAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDS-DEQTPYSILLQNAETVALVCPGRGNE--KKAIPVTSLKVGDEV
        HAYVAVPGGKT YLSELR G+EVIVVDQ+G+QRTA+VGRVKIE R LI+V+AK  + +E+T YSI+LQNAETVALV P + N   + A+PVTSLK GD+V
Subjt:  HAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDS-DEQTPYSILLQNAETVALVCPGRGNE--KKAIPVTSLKVGDEV

Query:  FLRLQGEARHTATAI
         +RLQG ARHT   I
Subjt:  FLRLQGEARHTATAI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCATGGCCTTGGTGTGTTCGTCACCTGCTTCTCCACTTCTTTCCAAACAGCGAATCAACGCCCACAAAAGACCAGTACTGATTTGCTCCTTTGGCACCTAT
TTCTTTGTGGCGTTCTCAGATAATTTGGCCCTAATTTCAAGGAGTTTTGGCGAATGTAAATCTTTGGAGGCAAATCGTTTAGAGTGTTCTTACGCTTCCTCGTCT
TCTTCAATGTCTCCGATTGAGGCGTCGAAGGAGGTGTGGGTTTGGAGTGAGCATCGGCAGGTTATGACGGCCGCGGTTGAGAGGGGCTGGAGCACCTTCATCTTC
TCGCCTCACAATCGGGAGCTTGCTGATGAATGGTCCTCAATTGCACTAATACGTCCGCTTTTTATGAAAGAGGATGGAGTTTTTGATGGAGAGGGTAGACTAATG
GCCACAGTTGTTGAGGTTTCGAACCGCCAGCAACTGGAGCAGCTTCAACCAGAAAATGCATCCGCAGACAATGTTGTTGTGGATCTAAAAGATTGGCAGATAATA
CCTGCAGAGAATATTGTTGCAGCGTTTCAGGGGAGTCAGAAAACAGTGTTTGCTGTCTCGAAAACTCCTATTGAAGCTCAAATCTTCCTTGAGGCACTCGAACAC
GGTCTAGGTGGAGTTATTTTGAAAGTTGGAGATCCTGAAGCTGTTTTCCAGCTAAAGGACTATTTTGACAGAAGAAATGAAGCTAGTAATCTTTTGAGCTTGACT
AAGGCTACTATAACTCAAATTCATGTTGCTGGAATGGGAGATCGAGTTTGTGTTGATCTCTGTAGTCTCATGAGACCTGGCGAAGGACTTCTTGTCGGTTCTTAT
GCCAGAGGACTATTCCTTGTTCACTCAGAATGCTTAGAGTCAAATTACATTGCTAGCCGGCCTTTTCGTGTCAATGCTGGACCTGTCCATGCCTACGTAGCTGTC
CCGGGAGGGAAAACTAGCTACCTTTCCGAGTTACGAGCAGGCAAAGAGGTAATTGTAGTTGATCAAGAAGGCAGACAACGAACCGCTATTGTTGGACGTGTAAAG
ATCGAAACTAGGCAGCTGATCCTCGTCCAGGCAAAGAGAGATTCAGATGAGCAAACTCCTTACAGCATCCTCCTGCAGAATGCGGAAACGGTTGCCTTAGTTTGC
CCTGGTCGAGGCAATGAGAAGAAAGCCATCCCTGTTACTTCACTTAAAGTTGGTGATGAAGTGTTCTTGAGATTGCAAGGAGAAGCAAGGCATACAGCAACCGCC
ATTTCGGTGATTACTATAGTGCTTAGGGGTGATCATCGGCATGTTGATATCGGTTTTGAGCAAAAATCGACGTCGACCATCGACATGTTGGTTCTGATCGGTTGG
TGGTCTCGATTTTGGGAGCTATTATGGAATCGACCGATCGACTATAAAAAAAAATTGGTCGATTTTGGGAGTTCGTATGGAGAAGAGAAAAAGGTAGTGTTGGTC
TATGGGAGCTCGGGTGAAGAAAGTTTTGATGAAAGAAATTTGGAGAGAGAGAAGACGAGGAGAAAAGAAGAAAGGAGGAGGAACAAAAATGGTCCATGGCGCGTA
GATGGTGTAACATCATCCGATAATAGCTTCCTTCGAGTCTTGGAATGGTTCTTATTGAAAGAAAAGAAAGAAATTAGGAGATGGAAAGAGAAGATTAAGGGAGAA
GAGGAGAGGGAAGGGGAGAGATCGGAAGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCATGGCCTTGGTGTGTTCGTCACCTGCTTCTCCACTTCTTTCCAAACAGCGAATCAACGCCCACAAAAGACCAGTACTGATTTGCTCCTTTGGCACCTAT
TTCTTTGTGGCGTTCTCAGATAATTTGGCCCTAATTTCAAGGAGTTTTGGCGAATGTAAATCTTTGGAGGCAAATCGTTTAGAGTGTTCTTACGCTTCCTCGTCT
TCTTCAATGTCTCCGATTGAGGCGTCGAAGGAGGTGTGGGTTTGGAGTGAGCATCGGCAGGTTATGACGGCCGCGGTTGAGAGGGGCTGGAGCACCTTCATCTTC
TCGCCTCACAATCGGGAGCTTGCTGATGAATGGTCCTCAATTGCACTAATACGTCCGCTTTTTATGAAAGAGGATGGAGTTTTTGATGGAGAGGGTAGACTAATG
GCCACAGTTGTTGAGGTTTCGAACCGCCAGCAACTGGAGCAGCTTCAACCAGAAAATGCATCCGCAGACAATGTTGTTGTGGATCTAAAAGATTGGCAGATAATA
CCTGCAGAGAATATTGTTGCAGCGTTTCAGGGGAGTCAGAAAACAGTGTTTGCTGTCTCGAAAACTCCTATTGAAGCTCAAATCTTCCTTGAGGCACTCGAACAC
GGTCTAGGTGGAGTTATTTTGAAAGTTGGAGATCCTGAAGCTGTTTTCCAGCTAAAGGACTATTTTGACAGAAGAAATGAAGCTAGTAATCTTTTGAGCTTGACT
AAGGCTACTATAACTCAAATTCATGTTGCTGGAATGGGAGATCGAGTTTGTGTTGATCTCTGTAGTCTCATGAGACCTGGCGAAGGACTTCTTGTCGGTTCTTAT
GCCAGAGGACTATTCCTTGTTCACTCAGAATGCTTAGAGTCAAATTACATTGCTAGCCGGCCTTTTCGTGTCAATGCTGGACCTGTCCATGCCTACGTAGCTGTC
CCGGGAGGGAAAACTAGCTACCTTTCCGAGTTACGAGCAGGCAAAGAGGTAATTGTAGTTGATCAAGAAGGCAGACAACGAACCGCTATTGTTGGACGTGTAAAG
ATCGAAACTAGGCAGCTGATCCTCGTCCAGGCAAAGAGAGATTCAGATGAGCAAACTCCTTACAGCATCCTCCTGCAGAATGCGGAAACGGTTGCCTTAGTTTGC
CCTGGTCGAGGCAATGAGAAGAAAGCCATCCCTGTTACTTCACTTAAAGTTGGTGATGAAGTGTTCTTGAGATTGCAAGGAGAAGCAAGGCATACAGCAACCGCC
ATTTCGGTGATTACTATAGTGCTTAGGGGTGATCATCGGCATGTTGATATCGGTTTTGAGCAAAAATCGACGTCGACCATCGACATGTTGGTTCTGATCGGTTGG
TGGTCTCGATTTTGGGAGCTATTATGGAATCGACCGATCGACTATAAAAAAAAATTGGTCGATTTTGGGAGTTCGTATGGAGAAGAGAAAAAGGTAGTGTTGGTC
TATGGGAGCTCGGGTGAAGAAAGTTTTGATGAAAGAAATTTGGAGAGAGAGAAGACGAGGAGAAAAGAAGAAAGGAGGAGGAACAAAAATGGTCCATGGCGCGTA
GATGGTGTAACATCATCCGATAATAGCTTCCTTCGAGTCTTGGAATGGTTCTTATTGAAAGAAAAGAAAGAAATTAGGAGATGGAAAGAGAAGATTAAGGGAGAA
GAGGAGAGGGAAGGGGAGAGATCGGAAGAGTGA
Protein sequenceShow/hide protein sequence
MAMALVCSSPASPLLSKQRINAHKRPVLICSFGTYFFVAFSDNLALISRSFGECKSLEANRLECSYASSSSSMSPIEASKEVWVWSEHRQVMTAAVERGWSTFIF
SPHNRELADEWSSIALIRPLFMKEDGVFDGEGRLMATVVEVSNRQQLEQLQPENASADNVVVDLKDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEH
GLGGVILKVGDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAV
PGGKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTATA
ISVITIVLRGDHRHVDIGFEQKSTSTIDMLVLIGWWSRFWELLWNRPIDYKKKLVDFGSSYGEEKKVVLVYGSSGEESFDERNLEREKTRRKEERRRNKNGPWRV
DGVTSSDNSFLRVLEWFLLKEKKEIRRWKEKIKGEEEREGERSEE