; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0009974 (gene) of Chayote v1 genome

Gene IDSed0009974
OrganismSechium edule (Chayote v1)
Description3-dehydroquinate synthase homolog
Genome locationLG06:5471831..5477875
RNA-Seq ExpressionSed0009974
SyntenySed0009974
Gene Ontology termsGO:0008652 - cellular amino acid biosynthetic process (biological process)
GO:0009073 - aromatic amino acid family biosynthetic process (biological process)
GO:0003856 - 3-dehydroquinate synthase activity (molecular function)
GO:0016491 - oxidoreductase activity (molecular function)
InterPro domainsIPR002812 - 3-dehydroquinate synthase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588380.1 hypothetical protein SDJN03_16945, partial [Cucurbita argyrosperma subsp. sororia]1.4e-19284.99Show/hide
Query:  MAAMPLLSSSPVSPILPKQRIATKKRRIPDHSKLRALTSRGF-----GECRSLKAGRLQC----SYGSMSPVEGTKGVWIWSEDRQVMAAAVERGWSTFI
        MAA+  LS SPVSP+ PKQRI T K   PD+ KLRAL SRGF     GEC+SLK  RL C    S  SMSP+E +KGVWIWSED+QVM AAVERGWSTFI
Subjt:  MAAMPLLSSSPVSPILPKQRIATKKRRIPDHSKLRALTSRGF-----GECRSLKAGRLQC----SYGSMSPVEGTKGVWIWSEDRQVMAAAVERGWSTFI

Query:  FSPHNRELADEWSSIALIHPLFMKEDGVFDGEGRLIATVFEVSNPQQLEQLQPSNASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPMEAQIF
        FSPHN+ELADEWSSIALI PLF+ EDGVFD EGRLIATV EVSNPQQLEQLQPSNAS DNV+VDLQDWQIIPAENIVAAFQGS+KTVFAVSKTP+EAQIF
Subjt:  FSPHNRELADEWSSIALIHPLFMKEDGVFDGEGRLIATVFEVSNPQQLEQLQPSNASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPMEAQIF

Query:  LEALEQGLGGVILKVGEPEAIFQLKDYFDRRYESSYLLSLTKATITQIHLAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRV
        LEALE GLGGVILKV +PEA+FQLKDYFDRR E+S LLSLTKATI  IH+AGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRV
Subjt:  LEALEQGLGGVILKVGEPEAIFQLKDYFDRRYESSYLLSLTKATITQIHLAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRV

Query:  NAGPVHAYVAIPGGKTSYLSELQAGKEVIVVDQEGRQRTAIVGRVKIETRQLVLVQAKRDSDEQTFYSILLQNAETVALVCPGRGNEKKATPVTSLEVGD
        NAGPVHAYVA+PGGKTSYLSEL+AGKEVIVVDQEGRQRT IVGRVKIETRQLVL+QAKRDSDEQT YSILLQNAETVALV PGRGNEKKA PVTSL+VGD
Subjt:  NAGPVHAYVAIPGGKTSYLSELQAGKEVIVVDQEGRQRTAIVGRVKIETRQLVLVQAKRDSDEQTFYSILLQNAETVALVCPGRGNEKKATPVTSLEVGD

Query:  EVFLRLQGEARHT
        EVFLRLQGEARHT
Subjt:  EVFLRLQGEARHT

XP_022928646.1 uncharacterized protein LOC111435491 [Cucurbita moschata]1.0e-19384.99Show/hide
Query:  MAAMPLLSSSPVSPILPKQRIATKKRRIPDHSKLRALTSRGF-----GECRSLKAGRLQC----SYGSMSPVEGTKGVWIWSEDRQVMAAAVERGWSTFI
        MAA+  LS SPVSP+ PKQRI T K   PDH KLRAL SRGF     GEC+SL+  RL C    S  SMSP+E +KGVWIWSED+QVM AAVERGWSTFI
Subjt:  MAAMPLLSSSPVSPILPKQRIATKKRRIPDHSKLRALTSRGF-----GECRSLKAGRLQC----SYGSMSPVEGTKGVWIWSEDRQVMAAAVERGWSTFI

Query:  FSPHNRELADEWSSIALIHPLFMKEDGVFDGEGRLIATVFEVSNPQQLEQLQPSNASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPMEAQIF
        FSPHN+ELADEWSSIALI PLF+ EDGVFD EGRLIATV EVSNPQQLEQLQPSNAS DNV+VDLQDWQIIPAENIVAAFQGS+KTVFAVSKTP+EAQIF
Subjt:  FSPHNRELADEWSSIALIHPLFMKEDGVFDGEGRLIATVFEVSNPQQLEQLQPSNASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPMEAQIF

Query:  LEALEQGLGGVILKVGEPEAIFQLKDYFDRRYESSYLLSLTKATITQIHLAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRV
        LEALE GLGGVILKV +PEA+FQLKDYFDRR E+S LLSLTKATIT IH+AGMGDRVCVDLCSLM+PGEGLLVGSYARGLFLVHSECLESNYIASRPFRV
Subjt:  LEALEQGLGGVILKVGEPEAIFQLKDYFDRRYESSYLLSLTKATITQIHLAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRV

Query:  NAGPVHAYVAIPGGKTSYLSELQAGKEVIVVDQEGRQRTAIVGRVKIETRQLVLVQAKRDSDEQTFYSILLQNAETVALVCPGRGNEKKATPVTSLEVGD
        NAGPVHAYVA+PGGKTSYLSEL+AGKEVIVVDQ+GRQRT IVGRVKIETRQLVL+QAKRDSDEQT YSILLQNAETVALVCPGRGNEKKA PVTSL+VGD
Subjt:  NAGPVHAYVAIPGGKTSYLSELQAGKEVIVVDQEGRQRTAIVGRVKIETRQLVLVQAKRDSDEQTFYSILLQNAETVALVCPGRGNEKKATPVTSLEVGD

Query:  EVFLRLQGEARHT
        EVFLRLQGEARHT
Subjt:  EVFLRLQGEARHT

XP_022970870.1 uncharacterized protein LOC111469713 [Cucurbita maxima]2.5e-19285.23Show/hide
Query:  MAAMPLLSSSPVSPILPKQRIATKKRRIPDHSKLRALTSRGF-----GECRSLKAGRLQC----SYGSMSPVEGTKGVWIWSEDRQVMAAAVERGWSTFI
        MAAM LLS S VSP  PKQRI       PD+ KLRAL SRGF     GEC+SL+  RL C    S  SMSP+E +KGVWIWS DRQVM AAVERGWSTFI
Subjt:  MAAMPLLSSSPVSPILPKQRIATKKRRIPDHSKLRALTSRGF-----GECRSLKAGRLQC----SYGSMSPVEGTKGVWIWSEDRQVMAAAVERGWSTFI

Query:  FSPHNRELADEWSSIALIHPLFMKEDGVFDGEGRLIATVFEVSNPQQLEQLQPSNASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPMEAQIF
        FSPHN+ELADEWSSIALI PLF+ EDGVFDGEGRLIATV EVSNPQQLEQLQPSNAS DNV+VDLQDWQIIPAENIVAAFQGS+KTVFAVSKTP+EAQIF
Subjt:  FSPHNRELADEWSSIALIHPLFMKEDGVFDGEGRLIATVFEVSNPQQLEQLQPSNASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPMEAQIF

Query:  LEALEQGLGGVILKVGEPEAIFQLKDYFDRRYESSYLLSLTKATITQIHLAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRV
        LEALE GLGGVILKV +PEA+FQLKDYFDRR E+S LLSLTKATIT IH+AGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRV
Subjt:  LEALEQGLGGVILKVGEPEAIFQLKDYFDRRYESSYLLSLTKATITQIHLAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRV

Query:  NAGPVHAYVAIPGGKTSYLSELQAGKEVIVVDQEGRQRTAIVGRVKIETRQLVLVQAKRDSDEQTFYSILLQNAETVALVCPGRGNEKKATPVTSLEVGD
        NAGPVHAYVA+PG KTSYLSEL+AGKEVIVVDQEGRQRTAIVGRVKIETRQLVL+QAKRDSDEQT YSILLQNAETVALVCPGRGNEKKA PVTSL+VGD
Subjt:  NAGPVHAYVAIPGGKTSYLSELQAGKEVIVVDQEGRQRTAIVGRVKIETRQLVLVQAKRDSDEQTFYSILLQNAETVALVCPGRGNEKKATPVTSLEVGD

Query:  EVFLRLQGEARHT
        EVFLRLQGEARHT
Subjt:  EVFLRLQGEARHT

XP_023529491.1 uncharacterized protein LOC111792332 [Cucurbita pepo subsp. pepo]2.9e-19385.23Show/hide
Query:  MAAMPLLSSSPVSPILPKQRIATKKRRIPDHSKLRALTSRGF-----GECRSLKAGRLQC----SYGSMSPVEGTKGVWIWSEDRQVMAAAVERGWSTFI
        MA M LLS SPVS + PKQRI T K   PD+ KLRAL SRGF     GEC+SL+  RL C    S  SMSP+E +KGVWIWSE+RQVM AAVERGWSTFI
Subjt:  MAAMPLLSSSPVSPILPKQRIATKKRRIPDHSKLRALTSRGF-----GECRSLKAGRLQC----SYGSMSPVEGTKGVWIWSEDRQVMAAAVERGWSTFI

Query:  FSPHNRELADEWSSIALIHPLFMKEDGVFDGEGRLIATVFEVSNPQQLEQLQPSNASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPMEAQIF
        FSPHN+ELADEWSSIALI PLF+ EDGVFD EGRLIATV EVSNPQQLEQLQPSNAS DNV+VDLQDWQIIPAENIVAAFQGS+KTVFAVSKTP+EAQIF
Subjt:  FSPHNRELADEWSSIALIHPLFMKEDGVFDGEGRLIATVFEVSNPQQLEQLQPSNASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPMEAQIF

Query:  LEALEQGLGGVILKVGEPEAIFQLKDYFDRRYESSYLLSLTKATITQIHLAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRV
        LEALE GLGGVILKV +PEA+FQLKDYFDRR E+S LLSLTKATIT IH+AGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRV
Subjt:  LEALEQGLGGVILKVGEPEAIFQLKDYFDRRYESSYLLSLTKATITQIHLAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRV

Query:  NAGPVHAYVAIPGGKTSYLSELQAGKEVIVVDQEGRQRTAIVGRVKIETRQLVLVQAKRDSDEQTFYSILLQNAETVALVCPGRGNEKKATPVTSLEVGD
        NAGPVHAYVA+PGGKTSYLSEL+AGKEVIVVDQEGRQRT IVGRVKIETRQLVL+QAKRDSDEQT YSILLQNAETVALVCPGRGNEKKA PVTSL+VGD
Subjt:  NAGPVHAYVAIPGGKTSYLSELQAGKEVIVVDQEGRQRTAIVGRVKIETRQLVLVQAKRDSDEQTFYSILLQNAETVALVCPGRGNEKKATPVTSLEVGD

Query:  EVFLRLQGEARHT
        EVFLRLQGEARHT
Subjt:  EVFLRLQGEARHT

XP_038903473.1 3-dehydroquinate synthase homolog [Benincasa hispida]3.3e-18984.8Show/hide
Query:  LLSSSPVSPILPKQRIATKKRRIPDHSKLRALTSRGF-----GECRSLKAGRLQCSYGS----MSPVEGTKGVWIWSEDRQVMAAAVERGWSTFIFSPHN
        L SSSPVSP L KQRI +   + P++  LR L SR F     GEC+S    RLQCSY S    MSP E +KGVWIWSE +QVM AAVERGWSTFIFSPHN
Subjt:  LLSSSPVSPILPKQRIATKKRRIPDHSKLRALTSRGF-----GECRSLKAGRLQCSYGS----MSPVEGTKGVWIWSEDRQVMAAAVERGWSTFIFSPHN

Query:  RELADEWSSIALIHPLFMKEDGVFDGEGRLIATVFEVSNPQQLEQLQPSNASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPMEAQIFLEALE
         ELADEWSSIALIHPLF+KE+GVFDGEGRLIA+V EVSNPQQLEQLQP+NASAD VVVDLQDWQIIPAENIVAAFQGSQKTVFA+SKTP+EAQIFLEALE
Subjt:  RELADEWSSIALIHPLFMKEDGVFDGEGRLIATVFEVSNPQQLEQLQPSNASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPMEAQIFLEALE

Query:  QGLGGVILKVGEPEAIFQLKDYFDRRYESSYLLSLTKATITQIHLAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPV
         GLGGVILKV +PEA+FQLKDYFDRR E+S LLSLTKATITQIH+AGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPV
Subjt:  QGLGGVILKVGEPEAIFQLKDYFDRRYESSYLLSLTKATITQIHLAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPV

Query:  HAYVAIPGGKTSYLSELQAGKEVIVVDQEGRQRTAIVGRVKIETRQLVLVQAKRDSDEQTFYSILLQNAETVALVCPGRGNEKKATPVTSLEVGDEVFLR
        HAYVA+PGGKTSYLSEL AGKEVIVVDQEGRQRTAIVGRVKIETRQL+LVQAKRDSDEQT YSILLQNAETVALVCPGRGNEKK+ PVTSL+VGDEVFLR
Subjt:  HAYVAIPGGKTSYLSELQAGKEVIVVDQEGRQRTAIVGRVKIETRQLVLVQAKRDSDEQTFYSILLQNAETVALVCPGRGNEKKATPVTSLEVGDEVFLR

Query:  LQGEARHT
        LQGEARHT
Subjt:  LQGEARHT

TrEMBL top hitse value%identityAlignment
A0A1S3B8Q7 3-dehydroquinate synthase homolog8.6e-18382.15Show/hide
Query:  LLSSSPVSPILPKQRIATKKRRIPDHSKLRALTSRGF-----GECRSLKAGRLQCSYGS----MSPVEGTKGVWIWSEDRQVMAAAVERGWSTFIFSPHN
        L SSSPVSP+L KQRI   K   P++  LR L SR F     GEC+S    RLQCSY S    MSP+E +KGVWIWSE ++VM AAVERGWSTFIFSPHN
Subjt:  LLSSSPVSPILPKQRIATKKRRIPDHSKLRALTSRGF-----GECRSLKAGRLQCSYGS----MSPVEGTKGVWIWSEDRQVMAAAVERGWSTFIFSPHN

Query:  RELADEWSSIALIHPLFMKEDGVFDGEGRLIATVFEVSNPQQLEQLQPSNASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPMEAQIFLEALE
         ELA EW+SIA+IHPLF+KEDGV DGE RLIA+V E+SNPQQLEQLQP+ ASAD VVVDLQDWQIIPAENIVAAFQGSQKTVFA+SKTP+EAQIF EALE
Subjt:  RELADEWSSIALIHPLFMKEDGVFDGEGRLIATVFEVSNPQQLEQLQPSNASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPMEAQIFLEALE

Query:  QGLGGVILKVGEPEAIFQLKDYFDRRYESSYLLSLTKATITQIHLAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPV
         GLGGVILKV +PEA+FQLKDYFDRR E+S LLSLTKATITQIH+ GMGDRVCVDLCSLMRPGEGLLVGS+ARGLFLVHSECLESNYIASRPFRVNAGPV
Subjt:  QGLGGVILKVGEPEAIFQLKDYFDRRYESSYLLSLTKATITQIHLAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPV

Query:  HAYVAIPGGKTSYLSELQAGKEVIVVDQEGRQRTAIVGRVKIETRQLVLVQAKRDSDEQTFYSILLQNAETVALVCPGRG-NEKKATPVTSLEVGDEVFL
        HAYVA+PGGKTSYLSELQAGKEVIVVDQEGRQRTAIVGRVKIETRQL+LVQAKRDSDEQT YS+LLQNAETVALVCPG+G NEKKA  VTSL+VGDEVFL
Subjt:  HAYVAIPGGKTSYLSELQAGKEVIVVDQEGRQRTAIVGRVKIETRQLVLVQAKRDSDEQTFYSILLQNAETVALVCPGRG-NEKKATPVTSLEVGDEVFL

Query:  RLQGEARHT
        RLQGEARHT
Subjt:  RLQGEARHT

A0A5A7UEW0 3-dehydroquinate synthase-like protein8.6e-18382.15Show/hide
Query:  LLSSSPVSPILPKQRIATKKRRIPDHSKLRALTSRGF-----GECRSLKAGRLQCSYGS----MSPVEGTKGVWIWSEDRQVMAAAVERGWSTFIFSPHN
        L SSSPVSP+L KQRI   K   P++  LR L SR F     GEC+S    RLQCSY S    MSP+E +KGVWIWSE ++VM AAVERGWSTFIFSPHN
Subjt:  LLSSSPVSPILPKQRIATKKRRIPDHSKLRALTSRGF-----GECRSLKAGRLQCSYGS----MSPVEGTKGVWIWSEDRQVMAAAVERGWSTFIFSPHN

Query:  RELADEWSSIALIHPLFMKEDGVFDGEGRLIATVFEVSNPQQLEQLQPSNASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPMEAQIFLEALE
         ELA EW+SIA+IHPLF+KEDGV DGE RLIA+V E+SNPQQLEQLQP+ ASAD VVVDLQDWQIIPAENIVAAFQGSQKTVFA+SKTP+EAQIF EALE
Subjt:  RELADEWSSIALIHPLFMKEDGVFDGEGRLIATVFEVSNPQQLEQLQPSNASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPMEAQIFLEALE

Query:  QGLGGVILKVGEPEAIFQLKDYFDRRYESSYLLSLTKATITQIHLAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPV
         GLGGVILKV +PEA+FQLKDYFDRR E+S LLSLTKATITQIH+ GMGDRVCVDLCSLMRPGEGLLVGS+ARGLFLVHSECLESNYIASRPFRVNAGPV
Subjt:  QGLGGVILKVGEPEAIFQLKDYFDRRYESSYLLSLTKATITQIHLAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPV

Query:  HAYVAIPGGKTSYLSELQAGKEVIVVDQEGRQRTAIVGRVKIETRQLVLVQAKRDSDEQTFYSILLQNAETVALVCPGRG-NEKKATPVTSLEVGDEVFL
        HAYVA+PGGKTSYLSELQAGKEVIVVDQEGRQRTAIVGRVKIETRQL+LVQAKRDSDEQT YS+LLQNAETVALVCPG+G NEKKA  VTSL+VGDEVFL
Subjt:  HAYVAIPGGKTSYLSELQAGKEVIVVDQEGRQRTAIVGRVKIETRQLVLVQAKRDSDEQTFYSILLQNAETVALVCPGRG-NEKKATPVTSLEVGDEVFL

Query:  RLQGEARHT
        RLQGEARHT
Subjt:  RLQGEARHT

A0A6J1BVA9 uncharacterized protein LOC111005050 isoform X22.1e-18181.17Show/hide
Query:  MPLLSSSPVSP-ILPKQRIATKKRRIPDHSKLRALTSRGFGE-----CRSLKAGRLQCSYGSMSPV--EGTKGVWIWSEDRQVMAAAVERGWSTFIFSPH
        M  L +SP SP +L K RI T     PD+SKL AL S  FG+     C+S+ A  +QCS  SMSP   E +KGVW+WSE+RQV+ AAVERGW+TF+FSPH
Subjt:  MPLLSSSPVSP-ILPKQRIATKKRRIPDHSKLRALTSRGFGE-----CRSLKAGRLQCSYGSMSPV--EGTKGVWIWSEDRQVMAAAVERGWSTFIFSPH

Query:  NRELADEWSSIALIHPLFMKEDGVFDGEGRLIATVFEVSNPQQLEQLQPSNASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPMEAQIFLEAL
        NRELA +WSSIA I  LF+KEDG+FD EG LIATVFEVSNPQQLEQLQP NAS DNVVVDLQDWQIIPAENIVAAFQGS+K VFAVSKTP+EAQIFLEAL
Subjt:  NRELADEWSSIALIHPLFMKEDGVFDGEGRLIATVFEVSNPQQLEQLQPSNASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPMEAQIFLEAL

Query:  EQGLGGVILKVGEPEAIFQLKDYFDRRYESSYLLSLTKATITQIHLAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP
        E GLGGVILKV +PEA+FQLKDYFDRR E+S LLSLTKAT+TQIH+AGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP
Subjt:  EQGLGGVILKVGEPEAIFQLKDYFDRRYESSYLLSLTKATITQIHLAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP

Query:  VHAYVAIPGGKTSYLSELQAGKEVIVVDQEGRQRTAIVGRVKIETRQLVLVQAKRDSDEQTFYSILLQNAETVALVCPGRGNEKKATPVTSLEVGDEVFL
        VHAYVA+PGGKTSYLSEL+AGKEVIVVDQEGRQRTAIVGRVKIETRQL+LVQAKRDSD+QT Y ILLQNAETVALVCPGRGNEKKA PVTSL+VGD+VFL
Subjt:  VHAYVAIPGGKTSYLSELQAGKEVIVVDQEGRQRTAIVGRVKIETRQLVLVQAKRDSDEQTFYSILLQNAETVALVCPGRGNEKKATPVTSLEVGDEVFL

Query:  RLQGEARHT
        RLQGEARHT
Subjt:  RLQGEARHT

A0A6J1EKW1 uncharacterized protein LOC1114354914.9e-19484.99Show/hide
Query:  MAAMPLLSSSPVSPILPKQRIATKKRRIPDHSKLRALTSRGF-----GECRSLKAGRLQC----SYGSMSPVEGTKGVWIWSEDRQVMAAAVERGWSTFI
        MAA+  LS SPVSP+ PKQRI T K   PDH KLRAL SRGF     GEC+SL+  RL C    S  SMSP+E +KGVWIWSED+QVM AAVERGWSTFI
Subjt:  MAAMPLLSSSPVSPILPKQRIATKKRRIPDHSKLRALTSRGF-----GECRSLKAGRLQC----SYGSMSPVEGTKGVWIWSEDRQVMAAAVERGWSTFI

Query:  FSPHNRELADEWSSIALIHPLFMKEDGVFDGEGRLIATVFEVSNPQQLEQLQPSNASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPMEAQIF
        FSPHN+ELADEWSSIALI PLF+ EDGVFD EGRLIATV EVSNPQQLEQLQPSNAS DNV+VDLQDWQIIPAENIVAAFQGS+KTVFAVSKTP+EAQIF
Subjt:  FSPHNRELADEWSSIALIHPLFMKEDGVFDGEGRLIATVFEVSNPQQLEQLQPSNASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPMEAQIF

Query:  LEALEQGLGGVILKVGEPEAIFQLKDYFDRRYESSYLLSLTKATITQIHLAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRV
        LEALE GLGGVILKV +PEA+FQLKDYFDRR E+S LLSLTKATIT IH+AGMGDRVCVDLCSLM+PGEGLLVGSYARGLFLVHSECLESNYIASRPFRV
Subjt:  LEALEQGLGGVILKVGEPEAIFQLKDYFDRRYESSYLLSLTKATITQIHLAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRV

Query:  NAGPVHAYVAIPGGKTSYLSELQAGKEVIVVDQEGRQRTAIVGRVKIETRQLVLVQAKRDSDEQTFYSILLQNAETVALVCPGRGNEKKATPVTSLEVGD
        NAGPVHAYVA+PGGKTSYLSEL+AGKEVIVVDQ+GRQRT IVGRVKIETRQLVL+QAKRDSDEQT YSILLQNAETVALVCPGRGNEKKA PVTSL+VGD
Subjt:  NAGPVHAYVAIPGGKTSYLSELQAGKEVIVVDQEGRQRTAIVGRVKIETRQLVLVQAKRDSDEQTFYSILLQNAETVALVCPGRGNEKKATPVTSLEVGD

Query:  EVFLRLQGEARHT
        EVFLRLQGEARHT
Subjt:  EVFLRLQGEARHT

A0A6J1I437 uncharacterized protein LOC1114697131.2e-19285.23Show/hide
Query:  MAAMPLLSSSPVSPILPKQRIATKKRRIPDHSKLRALTSRGF-----GECRSLKAGRLQC----SYGSMSPVEGTKGVWIWSEDRQVMAAAVERGWSTFI
        MAAM LLS S VSP  PKQRI       PD+ KLRAL SRGF     GEC+SL+  RL C    S  SMSP+E +KGVWIWS DRQVM AAVERGWSTFI
Subjt:  MAAMPLLSSSPVSPILPKQRIATKKRRIPDHSKLRALTSRGF-----GECRSLKAGRLQC----SYGSMSPVEGTKGVWIWSEDRQVMAAAVERGWSTFI

Query:  FSPHNRELADEWSSIALIHPLFMKEDGVFDGEGRLIATVFEVSNPQQLEQLQPSNASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPMEAQIF
        FSPHN+ELADEWSSIALI PLF+ EDGVFDGEGRLIATV EVSNPQQLEQLQPSNAS DNV+VDLQDWQIIPAENIVAAFQGS+KTVFAVSKTP+EAQIF
Subjt:  FSPHNRELADEWSSIALIHPLFMKEDGVFDGEGRLIATVFEVSNPQQLEQLQPSNASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPMEAQIF

Query:  LEALEQGLGGVILKVGEPEAIFQLKDYFDRRYESSYLLSLTKATITQIHLAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRV
        LEALE GLGGVILKV +PEA+FQLKDYFDRR E+S LLSLTKATIT IH+AGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRV
Subjt:  LEALEQGLGGVILKVGEPEAIFQLKDYFDRRYESSYLLSLTKATITQIHLAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRV

Query:  NAGPVHAYVAIPGGKTSYLSELQAGKEVIVVDQEGRQRTAIVGRVKIETRQLVLVQAKRDSDEQTFYSILLQNAETVALVCPGRGNEKKATPVTSLEVGD
        NAGPVHAYVA+PG KTSYLSEL+AGKEVIVVDQEGRQRTAIVGRVKIETRQLVL+QAKRDSDEQT YSILLQNAETVALVCPGRGNEKKA PVTSL+VGD
Subjt:  NAGPVHAYVAIPGGKTSYLSELQAGKEVIVVDQEGRQRTAIVGRVKIETRQLVLVQAKRDSDEQTFYSILLQNAETVALVCPGRGNEKKATPVTSLEVGD

Query:  EVFLRLQGEARHT
        EVFLRLQGEARHT
Subjt:  EVFLRLQGEARHT

SwissProt top hitse value%identityAlignment
A0B6K6 3-dehydroquinate synthase2.0e-5638.57Show/hide
Query:  WSEDRQVMAAAVERGWSTFIFSPHNRELADEWSSIALIHPLFMKEDGVFD------------------GEGRLIATVFEVSNPQQLEQLQPSNASADNVV
        W + + ++  A+E G+   + S  + EL  E  SI +    F +E G  D                    GR I    E+ + +            D ++
Subjt:  WSEDRQVMAAAVERGWSTFIFSPHNRELADEWSSIALIHPLFMKEDGVFD------------------GEGRLIATVFEVSNPQQLEQLQPSNASADNVV

Query:  VDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPMEAQIFLEALEQGLGGVILKVGEPEAIFQLKDYFDRRYESSYLLSLTKATITQIHLAGMGDRVCVDLC
        V   DW++IP EN++AA QG    + +  ++  EA++ L  LE G  GV+L   +P  I +++   +R   S   + L  AT+  +   GMGDRVCVD C
Subjt:  VDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPMEAQIFLEALEQGLGGVILKVGEPEAIFQLKDYFDRRYESSYLLSLTKATITQIHLAGMGDRVCVDLC

Query:  SLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAIPGGKTSYLSELQAGKEVIVVDQEGRQRTAIVGRVKIETRQLVLVQAKRDSD
        SLMR GEG+LVGS +R  FLV SE  ES Y+A+RPFRVNAG VHAY+ + G KT YLSEL++G EV +VD++G  R+A+VGRVKIE R ++LV+A+ D +
Subjt:  SLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAIPGGKTSYLSELQAGKEVIVVDQEGRQRTAIVGRVKIETRQLVLVQAKRDSD

Query:  EQTFYSILLQNAETVALVCPGRGNEKKATPVTSLEVGDEVFLRLQGEARH
             S LLQNAET+ LV     ++     V  L+ GD+V + ++  ARH
Subjt:  EQTFYSILLQNAETVALVCPGRGNEKKATPVTSLEVGDEVFLRLQGEARH

A4G0J1 3-dehydroquinate synthase4.6e-5647.27Show/hide
Query:  DNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPMEAQIFLEALEQGLGGVILKVGEPEAIFQLKDYFDRRYE--SSYLLSLTKATITQIHLAGMGDR
        D VV++  DW IIP ENI+A   G +  + +V     +A+   E LE+G+ GV+L    PE I ++KD F +  E  +S  L L  AT+T+I   G GDR
Subjt:  DNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPMEAQIFLEALEQGLGGVILKVGEPEAIFQLKDYFDRRYE--SSYLLSLTKATITQIHLAGMGDR

Query:  VCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAIPGGKTSYLSELQAGKEVIVVDQEGRQRTAIVGRVKIETRQLVLVQ
        VC+D CS+M  GEG+L+GSY+RG+FLVHSE +E+ Y+A+RPFRVNAGPVHAY+  P  KT YLS+L+AG +V+VV++ G  R +I+GRVKIE R L LV+
Subjt:  VCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAIPGGKTSYLSELQAGKEVIVVDQEGRQRTAIVGRVKIETRQLVLVQ

Query:  AKRDSDEQTFYSILLQNAETVALVCPGRGNEKKATPVTSLEVGDEVFLRLQGEARH
        A+ + +       +LQNAET+ LV    G + K   V  L+VG +V ++    ARH
Subjt:  AKRDSDEQTFYSILLQNAETVALVCPGRGNEKKATPVTSLEVGDEVFLRLQGEARH

O26680 3-dehydroquinate synthase1.5e-5947.69Show/hide
Query:  GRLIATVFEVSNPQQLEQLQPSNASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPMEAQIFLEALEQGLGGVILKVGEPEAIFQLKDYFDRRY
        GR +A   E+ +    E  +      D +++  +DW+IIP ENI+A  Q     + A      EA++ LE LE G  GV++   EP  I Q+KD      
Subjt:  GRLIATVFEVSNPQQLEQLQPSNASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPMEAQIFLEALEQGLGGVILKVGEPEAIFQLKDYFDRRY

Query:  E-SSYLLSLTKATITQIHLAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAIPGGKTSYLSELQAGKEVIVV
           S    L  ATIT+I   G GDRVCVD CS+M  GEG+LVGSY++GLFLVHSE LES Y+ASRPFRVNAGPV AYV +PGG+T YLSEL+ G EVI+V
Subjt:  E-SSYLLSLTKATITQIHLAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAIPGGKTSYLSELQAGKEVIVV

Query:  DQEGRQRTAIVGRVKIETRQLVLVQAKRDSDEQTFYSILLQNAETVALVCPGRGNEKKATPVTSLEVGDEVFLRLQGEARH
        D++GR R+AIVGRVKIE R L+LV+A+ +  +      LLQNAET+ LV     ++ +   V+ L  GD V +     ARH
Subjt:  DQEGRQRTAIVGRVKIETRQLVLVQAKRDSDEQTFYSILLQNAETVALVCPGRGNEKKATPVTSLEVGDEVFLRLQGEARH

Q2NI00 3-dehydroquinate synthase1.3e-5837.64Show/hide
Query:  KGVWI-----WSEDRQVMAAAVERGWSTFIFSPHNRELADEWSSIALI--------------HPLFMKEDGVFDGEGRLIATVFEVSNPQQLEQLQPSNA
        K  WI     W++ ++ +  ++E G+   I    N E   +  S+ +I              + + M +       G+ +A   E++N      +     
Subjt:  KGVWI-----WSEDRQVMAAAVERGWSTFIFSPHNRELADEWSSIALI--------------HPLFMKEDGVFDGEGRLIATVFEVSNPQQLEQLQPSNA

Query:  SADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPMEAQIFLEALEQGLGGVILKVGEPEAIFQLKDYFDRRYESSYLLSLTKATITQIHLAGMGDR
         AD V++  ++W++IP ENI+A+ Q     +        EA++ LE +E G  GV+L   +   I +L    ++  + SY   L  AT+T++   G+GDR
Subjt:  SADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPMEAQIFLEALEQGLGGVILKVGEPEAIFQLKDYFDRRYESSYLLSLTKATITQIHLAGMGDR

Query:  VCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAIPGGKTSYLSELQAGKEVIVVDQEGRQRTAIVGRVKIETRQLVLVQ
        VCVD CS+M  G+G+LVGS+A GLFLVHSE LES Y+ASRPFRVNAGPVHAYV  P  KT YLSEL+AG EV+ ++ +G   T IVGRVKIE R L+L++
Subjt:  VCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAIPGGKTSYLSELQAGKEVIVVDQEGRQRTAIVGRVKIETRQLVLVQ

Query:  AKRDSDEQTFYSILLQNAETVALVCPGRGNEKKATPVTSLEVGDEVFLRLQGEARH
        AK  +        L+QNAET+ LV     ++ +   V+ L+VGD+V       ARH
Subjt:  AKRDSDEQTFYSILLQNAETVALVCPGRGNEKKATPVTSLEVGDEVFLRLQGEARH

Q58646 3-dehydroquinate synthase1.7e-5839.54Show/hide
Query:  WSEDRQVMAAAVERGWSTFIFSPHNRELADEWSSIALI-HPL--------------FMKEDGVFDGEGRLIATVFEVSNPQQLEQLQPSNASADNVVVDL
        W E ++++  A+E      +  P + E   E  +I +  H L              F+KE     G+   I    E    ++           DN++++ 
Subjt:  WSEDRQVMAAAVERGWSTFIFSPHNRELADEWSSIALI-HPL--------------FMKEDGVFDGEGRLIATVFEVSNPQQLEQLQPSNASADNVVVDL

Query:  QDWQIIPAENIVA-AFQGSQKTVFAVSKTPMEAQIFLEALEQGLGGVILKVGEPEAIFQLKDYFDRRYESSYLLSLTKATITQIHLAGMGDRVCVDLCSL
        +DW IIP EN++A  F    K V +V+    EA++  E LE+G  GV+L     E I +L    +   +    ++L  AT+T++   G GDRVC+D CSL
Subjt:  QDWQIIPAENIVA-AFQGSQKTVFAVSKTPMEAQIFLEALEQGLGGVILKVGEPEAIFQLKDYFDRRYESSYLLSLTKATITQIHLAGMGDRVCVDLCSL

Query:  MRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAIPGGKTSYLSELQAGKEVIVVDQEGRQRTAIVGRVKIETRQLVLVQAKRDSDEQ
        M+ GEG+L+GSY+R LFLVHSE +E+ Y+A+RPFRVNAGPVHAY+  PG KT YLSEL+AG +V++VD++G  R AIVGRVKIE R LVL++A+   D  
Subjt:  MRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAIPGGKTSYLSELQAGKEVIVVDQEGRQRTAIVGRVKIETRQLVLVQAKRDSDEQ

Query:  TFYSILLQNAETVALVCPGRGNEK-KATPVTSLEVGDEVFLRLQGEARH
             +LQNAET+ LV     NEK +   V  L+ GD+V ++ +  ARH
Subjt:  TFYSILLQNAETVALVCPGRGNEK-KATPVTSLEVGDEVFLRLQGEARH

Arabidopsis top hitse value%identityAlignment
AT3G28760.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: 3-dehydroquinate synthase, prokaryotic-type (InterPro:IPR002812); Has 390 Blast hits to 390 proteins in 131 species: Archae - 144; Bacteria - 105; Metazoa - 0; Fungi - 0; Plants - 54; Viruses - 0; Other Eukaryotes - 87 (source: NCBI BLink).1.7e-13066.86Show/hide
Query:  KGVWIWSEDRQVMAAAVERGWSTFIFSPHNRELADEWSSIALIHPLFMKEDGVFDGEGRLIATVFEVSNPQQLEQLQPSNASADNVVVDLQDWQIIPAEN
        K VWIW+  ++VM  AVERGW+TFIFS  NR+L++EWSSIAL+  LF++E  V DG G ++A+VFEVS P++L  L   N   +N+V+D  DW+ IPAEN
Subjt:  KGVWIWSEDRQVMAAAVERGWSTFIFSPHNRELADEWSSIALIHPLFMKEDGVFDGEGRLIATVFEVSNPQQLEQLQPSNASADNVVVDLQDWQIIPAEN

Query:  IVAAFQGSQKTVFAVSKTPMEAQIFLEALEQGLGGVILKVGEPEAIFQLKDYFDRRYESSYLLSLTKATITQIHLAGMGDRVCVDLCSLMRPGEGLLVGS
        +VAA QGS+KTVFAVS TP EA++FLEALE GLGG+ILK  + +A+  LK+YFD+R E S  LSLT+ATIT++ + GMGDRVCVDLCSLMRPGEGLLVGS
Subjt:  IVAAFQGSQKTVFAVSKTPMEAQIFLEALEQGLGGVILKVGEPEAIFQLKDYFDRRYESSYLLSLTKATITQIHLAGMGDRVCVDLCSLMRPGEGLLVGS

Query:  YARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAIPGGKTSYLSELQAGKEVIVVDQEGRQRTAIVGRVKIETRQLVLVQAKRDS-DEQTFYSILLQNA
        +ARGLFLVHSECLESNYI SRPFRVNAGPVHAYVA+PGGKT YLSEL+ G+EVIVVDQ+G+QRTA+VGRVKIE R L++V+AK  + +E+T YSI+LQNA
Subjt:  YARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAIPGGKTSYLSELQAGKEVIVVDQEGRQRTAIVGRVKIETRQLVLVQAKRDS-DEQTFYSILLQNA

Query:  ETVALVCPGRGNE--KKATPVTSLEVGDEVFLRLQGEARHT
        ETVALV P + N   + A PVTSL+ GD+V +RLQG ARHT
Subjt:  ETVALVCPGRGNE--KKATPVTSLEVGDEVFLRLQGEARHT

AT3G28760.2 CONTAINS InterPro DOMAIN/s: 3-dehydroquinate synthase, prokaryotic-type (InterPro:IPR002812); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink).1.7e-13066.86Show/hide
Query:  KGVWIWSEDRQVMAAAVERGWSTFIFSPHNRELADEWSSIALIHPLFMKEDGVFDGEGRLIATVFEVSNPQQLEQLQPSNASADNVVVDLQDWQIIPAEN
        K VWIW+  ++VM  AVERGW+TFIFS  NR+L++EWSSIAL+  LF++E  V DG G ++A+VFEVS P++L  L   N   +N+V+D  DW+ IPAEN
Subjt:  KGVWIWSEDRQVMAAAVERGWSTFIFSPHNRELADEWSSIALIHPLFMKEDGVFDGEGRLIATVFEVSNPQQLEQLQPSNASADNVVVDLQDWQIIPAEN

Query:  IVAAFQGSQKTVFAVSKTPMEAQIFLEALEQGLGGVILKVGEPEAIFQLKDYFDRRYESSYLLSLTKATITQIHLAGMGDRVCVDLCSLMRPGEGLLVGS
        +VAA QGS+KTVFAVS TP EA++FLEALE GLGG+ILK  + +A+  LK+YFD+R E S  LSLT+ATIT++ + GMGDRVCVDLCSLMRPGEGLLVGS
Subjt:  IVAAFQGSQKTVFAVSKTPMEAQIFLEALEQGLGGVILKVGEPEAIFQLKDYFDRRYESSYLLSLTKATITQIHLAGMGDRVCVDLCSLMRPGEGLLVGS

Query:  YARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAIPGGKTSYLSELQAGKEVIVVDQEGRQRTAIVGRVKIETRQLVLVQAKRDS-DEQTFYSILLQNA
        +ARGLFLVHSECLESNYI SRPFRVNAGPVHAYVA+PGGKT YLSEL+ G+EVIVVDQ+G+QRTA+VGRVKIE R L++V+AK  + +E+T YSI+LQNA
Subjt:  YARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAIPGGKTSYLSELQAGKEVIVVDQEGRQRTAIVGRVKIETRQLVLVQAKRDS-DEQTFYSILLQNA

Query:  ETVALVCPGRGNE--KKATPVTSLEVGDEVFLRLQGEARHT
        ETVALV P + N   + A PVTSL+ GD+V +RLQG ARHT
Subjt:  ETVALVCPGRGNE--KKATPVTSLEVGDEVFLRLQGEARHT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTATCGTTTACGATTGGGCTTCAAACAGTGGAAGAACAACAACAATGGCGGCCATGCCCTTGCTCTCTTCCTCTCCCGTTTCTCCGATTCTTCCCAAACAGCGAAT
CGCCACCAAGAAAAGGAGAATCCCAGATCATTCGAAACTTCGAGCCCTAACTTCAAGGGGTTTTGGAGAATGTAGATCTTTGAAGGCAGGTCGTTTGCAGTGTTCTTACG
GTTCGATGTCTCCGGTTGAGGGGACGAAGGGGGTGTGGATTTGGAGTGAGGATCGGCAGGTTATGGCGGCGGCGGTTGAGAGGGGATGGAGCACTTTCATCTTCTCGCCG
CACAATCGGGAACTTGCTGATGAATGGTCCTCAATTGCACTAATACATCCACTTTTTATGAAAGAGGATGGAGTTTTTGATGGAGAGGGAAGACTAATTGCCACAGTTTT
TGAGGTTTCTAATCCCCAGCAGTTGGAGCAGCTTCAACCGTCAAATGCATCCGCTGACAATGTTGTTGTTGATTTACAAGATTGGCAGATAATACCTGCAGAGAATATTG
TTGCAGCATTTCAGGGGAGTCAGAAAACAGTATTTGCGGTCTCGAAAACTCCTATGGAAGCTCAAATTTTCCTTGAGGCACTTGAACAAGGTCTGGGTGGAGTTATTTTG
AAAGTCGGAGAACCTGAAGCTATTTTTCAGCTAAAGGACTATTTTGACAGAAGATATGAAAGTAGTTATCTTCTGAGCTTGACTAAGGCTACTATAACTCAAATCCACCT
TGCTGGAATGGGAGATCGAGTTTGTGTCGATCTCTGTAGTCTTATGAGACCCGGTGAAGGACTTCTTGTGGGGTCCTATGCCAGAGGACTGTTCCTGGTTCACTCAGAAT
GCTTAGAGTCAAATTACATTGCTAGCCGGCCTTTTCGTGTTAATGCTGGACCTGTCCATGCCTATGTAGCTATCCCAGGAGGGAAAACTAGCTACCTTTCTGAGTTACAA
GCAGGCAAAGAGGTAATTGTAGTAGATCAAGAAGGCAGGCAACGAACCGCTATTGTCGGACGTGTAAAGATAGAGACTAGGCAGCTGGTCCTCGTCCAGGCAAAGAGAGA
TTCAGATGAGCAAACTTTCTACAGCATCCTTCTGCAGAACGCAGAAACAGTTGCCTTAGTATGCCCCGGTCGAGGAAATGAGAAGAAAGCCACCCCTGTTACCTCACTTG
AAGTTGGTGATGAAGTGTTCTTGAGATTGCAAGGAGAAGCTAGGCATACAGAAAAAGGAAGGAAAAAAAGGGAAAAAACCCATCCGTGTGACTGCATAATTTTGTGGCAA
AATTTTCATGGTAGAAAAAGGAAGAAAAAAAAAAGGGAGAAAAACAATCTGTGTAAGCTGCATAATTTTATTACAAAAAAGTACATAAATTAA
mRNA sequenceShow/hide mRNA sequence
TAAACTTTCACCATAATTAAAAAGGAAAACAAAAAAAAAACTAGCAGCATTACCCACTAATTCGATCTCTATAATTTGGAAGAATTTATAAAATTAAACACATATTTGTT
CCAATTTCGAACCCAATTTTTCAAAACAGAATCGATACAAATTTGTGATGCTTAATTTTATAAAAACAATTGTGTTCATTTTACGTCAAATCCAATATGATCGTTGAAAT
CTTTCGCAAAGGGCCATGTTTATCGTTTACGATTGGGCTTCAAACAGTGGAAGAACAACAACAATGGCGGCCATGCCCTTGCTCTCTTCCTCTCCCGTTTCTCCGATTCT
TCCCAAACAGCGAATCGCCACCAAGAAAAGGAGAATCCCAGATCATTCGAAACTTCGAGCCCTAACTTCAAGGGGTTTTGGAGAATGTAGATCTTTGAAGGCAGGTCGTT
TGCAGTGTTCTTACGGTTCGATGTCTCCGGTTGAGGGGACGAAGGGGGTGTGGATTTGGAGTGAGGATCGGCAGGTTATGGCGGCGGCGGTTGAGAGGGGATGGAGCACT
TTCATCTTCTCGCCGCACAATCGGGAACTTGCTGATGAATGGTCCTCAATTGCACTAATACATCCACTTTTTATGAAAGAGGATGGAGTTTTTGATGGAGAGGGAAGACT
AATTGCCACAGTTTTTGAGGTTTCTAATCCCCAGCAGTTGGAGCAGCTTCAACCGTCAAATGCATCCGCTGACAATGTTGTTGTTGATTTACAAGATTGGCAGATAATAC
CTGCAGAGAATATTGTTGCAGCATTTCAGGGGAGTCAGAAAACAGTATTTGCGGTCTCGAAAACTCCTATGGAAGCTCAAATTTTCCTTGAGGCACTTGAACAAGGTCTG
GGTGGAGTTATTTTGAAAGTCGGAGAACCTGAAGCTATTTTTCAGCTAAAGGACTATTTTGACAGAAGATATGAAAGTAGTTATCTTCTGAGCTTGACTAAGGCTACTAT
AACTCAAATCCACCTTGCTGGAATGGGAGATCGAGTTTGTGTCGATCTCTGTAGTCTTATGAGACCCGGTGAAGGACTTCTTGTGGGGTCCTATGCCAGAGGACTGTTCC
TGGTTCACTCAGAATGCTTAGAGTCAAATTACATTGCTAGCCGGCCTTTTCGTGTTAATGCTGGACCTGTCCATGCCTATGTAGCTATCCCAGGAGGGAAAACTAGCTAC
CTTTCTGAGTTACAAGCAGGCAAAGAGGTAATTGTAGTAGATCAAGAAGGCAGGCAACGAACCGCTATTGTCGGACGTGTAAAGATAGAGACTAGGCAGCTGGTCCTCGT
CCAGGCAAAGAGAGATTCAGATGAGCAAACTTTCTACAGCATCCTTCTGCAGAACGCAGAAACAGTTGCCTTAGTATGCCCCGGTCGAGGAAATGAGAAGAAAGCCACCC
CTGTTACCTCACTTGAAGTTGGTGATGAAGTGTTCTTGAGATTGCAAGGAGAAGCTAGGCATACAGAAAAAGGAAGGAAAAAAAGGGAAAAAACCCATCCGTGTGACTGC
ATAATTTTGTGGCAAAATTTTCATGGTAGAAAAAGGAAGAAAAAAAAAAGGGAGAAAAACAATCTGTGTAAGCTGCATAATTTTATTACAAAAAAGTACATAAATTAACG
ATCTCCATTTGATAACCATTTTGTTTTTTGTTTTTTGTTTATGTAATTTAAGCCTATTTTTATTCAAA
Protein sequenceShow/hide protein sequence
MFIVYDWASNSGRTTTMAAMPLLSSSPVSPILPKQRIATKKRRIPDHSKLRALTSRGFGECRSLKAGRLQCSYGSMSPVEGTKGVWIWSEDRQVMAAAVERGWSTFIFSP
HNRELADEWSSIALIHPLFMKEDGVFDGEGRLIATVFEVSNPQQLEQLQPSNASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPMEAQIFLEALEQGLGGVIL
KVGEPEAIFQLKDYFDRRYESSYLLSLTKATITQIHLAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAIPGGKTSYLSELQ
AGKEVIVVDQEGRQRTAIVGRVKIETRQLVLVQAKRDSDEQTFYSILLQNAETVALVCPGRGNEKKATPVTSLEVGDEVFLRLQGEARHTEKGRKKREKTHPCDCIILWQ
NFHGRKRKKKKREKNNLCKLHNFITKKYIN