; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G12732 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G12732
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
Description3-dehydroquinate synthase homolog
Genome locationctg1838:3280826..3284751
RNA-Seq ExpressionCucsat.G12732
SyntenyCucsat.G12732
Gene Ontology termsGO:0008652 - cellular amino acid biosynthetic process (biological process)
GO:0009073 - aromatic amino acid family biosynthetic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003856 - 3-dehydroquinate synthase activity (molecular function)
GO:0016491 - oxidoreductase activity (molecular function)
InterPro domainsIPR002812 - 3-dehydroquinate synthase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147467.1 uncharacterized protein LOC101203995 [Cucumis sativus]7.80e-304100Show/hide
Query:  MAMAFLRSSSPVSPFLSKQRITYLKTPENLYLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFS
        MAMAFLRSSSPVSPFLSKQRITYLKTPENLYLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFS
Subjt:  MAMAFLRSSSPVSPFLSKQRITYLKTPENLYLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFS

Query:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLE
        PHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLE
Subjt:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLE

Query:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNA
        ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDE
        GPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDE
Subjt:  GPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

XP_008443422.1 PREDICTED: 3-dehydroquinate synthase homolog [Cucumis melo]4.95e-29195.27Show/hide
Query:  MAMAFLRSSSPVSPFLSKQRITYLKTPENLYLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFS
        M MAFLRSSSPVSP LSKQRITYLKTPENL LRPL+SR+FG+AYAGECKSSD+SRLQCSYTSSSSPMSPIE SKGVWIWSECQ+VMTAAVERGWSTFIFS
Subjt:  MAMAFLRSSSPVSPFLSKQRITYLKTPENLYLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFS

Query:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLE
        PHNTELAHEW+SIA+IHPLFIKE+GVLDGEDRLIASVVE+SNPQQLEQLQPARASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIF E
Subjt:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLE

Query:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNA
        ALEHGLGGVILKV+DPEAVFQLKDYFDRRNEASNLL+LTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGS+ARGLFL+HSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDE
        GPVHAYVAVPGGKTSYLSELQAG EVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAI VTSLKVGDE
Subjt:  GPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

XP_022928646.1 uncharacterized protein LOC111435491 [Cucurbita moschata]5.82e-25386.3Show/hide
Query:  SSSPVSPFLSKQRITYLKTPENLYLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELA
        S SPVSP   KQRI   K P++L LR L+SR FG A  GECKS +++RL CS TSSSS MSPIEASKGVWIWSE QQVMTAAVERGWSTFIFSPHN ELA
Subjt:  SSSPVSPFLSKQRITYLKTPENLYLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELA

Query:  HEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLG
         EWSSIALI PLFI E+GV D E RLIA+V+EVSNPQQLEQLQP+ AS D V+VDLQDWQIIPAENIVAAFQGS+KTVFA+SKTPIEAQIFLEALEHGLG
Subjt:  HEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLG

Query:  GVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYV
        GVILKVEDPEAVFQLKDYFDRRNEASNLL+LTKATIT IHV GMGDRVCVDLCSLM+PGEGLLVGSYARGLFL+HSECLESNYIASRPFRVNAGPVHAYV
Subjt:  GVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYV

Query:  AVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQG
        AVPGGKTSYLSEL+AG EVIVVDQ+GRQRT IVGRVKIETRQL+L+QAKRDSDEQT YS+LLQNAETVALVCPG+GN EKKAIPVTSLKVGDEVFLRLQG
Subjt:  AVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQG

Query:  EARHTGIEIQEFIVEK
        EARHTGIEIQEFIVEK
Subjt:  EARHTGIEIQEFIVEK

XP_023529491.1 uncharacterized protein LOC111792332 [Cucurbita pepo subsp. pepo]2.03e-25385.82Show/hide
Query:  MAMAFLRSSSPVSPFLSKQRITYLKTPENLYLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFS
        MA   L S SPVS    KQRI   K P+NL LR L+SR FG A  GECKS +++RL CS TSSSS MSPIEASKGVWIWSE +QVMTAAVERGWSTFIFS
Subjt:  MAMAFLRSSSPVSPFLSKQRITYLKTPENLYLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFS

Query:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLE
        PHN ELA EWSSIALI PLFI E+GV D E RLIA+V+EVSNPQQLEQLQP+ AS D V+VDLQDWQIIPAENIVAAFQGS+KTVFA+SKTPIEAQIFLE
Subjt:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLE

Query:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNA
        ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLL+LTKATIT IHV GMGDRVCVDLCSLMRPGEGLLVGSYARGLFL+HSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDE
        GPVHAYVAVPGGKTSYLSEL+AG EVIVVDQEGRQRT IVGRVKIETRQL+L+QAKRDSDEQT YS+LLQNAETVALVCPG+GN EKKAIPVTSLKVGDE
Subjt:  GPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

XP_038903473.1 3-dehydroquinate synthase homolog [Benincasa hispida]3.26e-28093.63Show/hide
Query:  MAMAFLRSSSPVSPFLSKQRITYL-KTPENLYLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEASKGVWIWSECQQVMTAAVERGWSTFIF
        M MA L SSSPVSPFLSKQRI+Y  KTPENL LRPL+SRDFGEAYAGECKSS+VSRLQCSY S  S MSP EASKGVWIWSECQQVMTAAVERGWSTFIF
Subjt:  MAMAFLRSSSPVSPFLSKQRITYL-KTPENLYLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEASKGVWIWSECQQVMTAAVERGWSTFIF

Query:  SPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFL
        SPHNTELA EWSSIALIHPLFIKENGV DGE RLIASVVEVSNPQQLEQLQPA ASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFL
Subjt:  SPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFL

Query:  EALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVN
        EALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLL+LTKATITQIHV GMGDRVCVDLCSLMRPGEGLLVGSYARGLFL+HSECLESNYIASRPFRVN
Subjt:  EALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVN

Query:  AGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGD
        AGPVHAYVAVPGGKTSYLSEL AG EVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS+LLQNAETVALVCPG+GN EKK+IPVTSLKVGD
Subjt:  AGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGD

Query:  EVFLRLQGEARHTGIEIQEFIVEK
        EVFLRLQGEARHTGIEIQEFIVEK
Subjt:  EVFLRLQGEARHTGIEIQEFIVEK

TrEMBL top hitse value%identityAlignment
A0A0A0LHS3 Uncharacterized protein2.12e-255100Show/hide
Query:  MSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDW
        MSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDW
Subjt:  MSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDW

Query:  QIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPG
        QIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPG
Subjt:  QIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPG

Query:  EGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS
        EGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS
Subjt:  EGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS

Query:  VLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
        VLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
Subjt:  VLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

A0A1S3B8Q7 3-dehydroquinate synthase homolog2.40e-29195.27Show/hide
Query:  MAMAFLRSSSPVSPFLSKQRITYLKTPENLYLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFS
        M MAFLRSSSPVSP LSKQRITYLKTPENL LRPL+SR+FG+AYAGECKSSD+SRLQCSYTSSSSPMSPIE SKGVWIWSECQ+VMTAAVERGWSTFIFS
Subjt:  MAMAFLRSSSPVSPFLSKQRITYLKTPENLYLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFS

Query:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLE
        PHNTELAHEW+SIA+IHPLFIKE+GVLDGEDRLIASVVE+SNPQQLEQLQPARASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIF E
Subjt:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLE

Query:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNA
        ALEHGLGGVILKV+DPEAVFQLKDYFDRRNEASNLL+LTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGS+ARGLFL+HSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDE
        GPVHAYVAVPGGKTSYLSELQAG EVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAI VTSLKVGDE
Subjt:  GPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

A0A5A7UEW0 3-dehydroquinate synthase-like protein2.40e-29195.27Show/hide
Query:  MAMAFLRSSSPVSPFLSKQRITYLKTPENLYLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFS
        M MAFLRSSSPVSP LSKQRITYLKTPENL LRPL+SR+FG+AYAGECKSSD+SRLQCSYTSSSSPMSPIE SKGVWIWSECQ+VMTAAVERGWSTFIFS
Subjt:  MAMAFLRSSSPVSPFLSKQRITYLKTPENLYLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFS

Query:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLE
        PHNTELAHEW+SIA+IHPLFIKE+GVLDGEDRLIASVVE+SNPQQLEQLQPARASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIF E
Subjt:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLE

Query:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNA
        ALEHGLGGVILKV+DPEAVFQLKDYFDRRNEASNLL+LTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGS+ARGLFL+HSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDE
        GPVHAYVAVPGGKTSYLSELQAG EVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAI VTSLKVGDE
Subjt:  GPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

A0A6J1EKW1 uncharacterized protein LOC1114354912.82e-25386.3Show/hide
Query:  SSSPVSPFLSKQRITYLKTPENLYLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELA
        S SPVSP   KQRI   K P++L LR L+SR FG A  GECKS +++RL CS TSSSS MSPIEASKGVWIWSE QQVMTAAVERGWSTFIFSPHN ELA
Subjt:  SSSPVSPFLSKQRITYLKTPENLYLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELA

Query:  HEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLG
         EWSSIALI PLFI E+GV D E RLIA+V+EVSNPQQLEQLQP+ AS D V+VDLQDWQIIPAENIVAAFQGS+KTVFA+SKTPIEAQIFLEALEHGLG
Subjt:  HEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLG

Query:  GVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYV
        GVILKVEDPEAVFQLKDYFDRRNEASNLL+LTKATIT IHV GMGDRVCVDLCSLM+PGEGLLVGSYARGLFL+HSECLESNYIASRPFRVNAGPVHAYV
Subjt:  GVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYV

Query:  AVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQG
        AVPGGKTSYLSEL+AG EVIVVDQ+GRQRT IVGRVKIETRQL+L+QAKRDSDEQT YS+LLQNAETVALVCPG+GN EKKAIPVTSLKVGDEVFLRLQG
Subjt:  AVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQG

Query:  EARHTGIEIQEFIVEK
        EARHTGIEIQEFIVEK
Subjt:  EARHTGIEIQEFIVEK

A0A6J1I437 uncharacterized protein LOC1114697138.07e-25385.58Show/hide
Query:  MAMAFLRSSSPVSPFLSKQRITYLKTPENLYLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFS
        MA   L S S VSPF  KQRI   + P+NL LR L+SR FG A  GECKS +++RL CS  SSSS MSPIEASKGVWIWS  +QVMTAAVERGWSTFIFS
Subjt:  MAMAFLRSSSPVSPFLSKQRITYLKTPENLYLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFS

Query:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLE
        PHN ELA EWSSIALI PLFI E+GV DGE RLIA+V+EVSNPQQLEQLQP+ AS D V+VDLQDWQIIPAENIVAAFQGS+KTVFA+SKTPIEAQIFLE
Subjt:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLE

Query:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNA
        ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLL+LTKATIT IHV GMGDRVCVDLCSLMRPGEGLLVGSYARGLFL+HSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDE
        GPVHAYVAVPG KTSYLSEL+AG EVIVVDQEGRQRTAIVGRVKIETRQL+L+QAKRDSDEQT YS+LLQNAETVALVCPG+GN EKKAIPVTSLKVGDE
Subjt:  GPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

SwissProt top hitse value%identityAlignment
A0B6K6 3-dehydroquinate synthase1.8e-6439.89Show/hide
Query:  WSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALI--------HPLFIKENGV--------LDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVD
        W + + ++T A+E G+   + S  + EL  E  SI +           L I +  V        ++   R I   VE+ + +            D ++V 
Subjt:  WSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALI--------HPLFIKENGV--------LDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVD

Query:  LQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSL
          DW++IP EN++AA QG    + +  ++  EA++ L  LEHG  GV+L   DP  + +++   +R     + ++L  AT+  +  VGMGDRVCVD CSL
Subjt:  LQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSL

Query:  MRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQ
        MR GEG+LVGS +R  FL+ SE  ES Y+A+RPFRVNAG VHAY+ V G KT YLSEL++G+EV +VD++G  R+A+VGRVKIE R +ILV+A+ D +  
Subjt:  MRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQ

Query:  TPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
           S LLQNAET+ LV     +++   I V  LK GD+V + ++  ARH G+ I+E I+E+
Subjt:  TPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

A4G0J1 3-dehydroquinate synthase3.2e-6146.82Show/hide
Query:  DIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVC
        D VV++  DW IIP ENI+A   G +  + ++     +A+   E LE G+ GV+L  ED   V       +R N  S  L L  AT+T+I  VG GDRVC
Subjt:  DIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVC

Query:  VDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAK
        +D CS+M  GEG+L+GSY+RG+FL+HSE +E+ Y+A+RPFRVNAGPVHAY+  P  KT YLS+L+AG++V+VV++ G  R +I+GRVKIE R L LV+A+
Subjt:  VDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAK

Query:  RDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
         + +       +LQNAET+ LV       + K + V  LKVG +V ++    ARH G+ I+E IVEK
Subjt:  RDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

O26680 3-dehydroquinate synthase5.7e-6647.99Show/hide
Query:  LDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYF
        L    R +A+ VE+ +    E  +      D +++  +DW+IIP ENI+A  Q     + A      EA++ LE LEHG  GV++   +P  + Q+KD  
Subjt:  LDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYF

Query:  D-RRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNE
            N  S    L  ATIT+I  +G GDRVCVD CS+M  GEG+LVGSY++GLFL+HSE LES Y+ASRPFRVNAGPV AYV VPGG+T YLSEL+ G+E
Subjt:  D-RRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNE

Query:  VIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
        VI+VD++GR R+AIVGRVKIE R L+LV+A+    E      LLQNAET+ LV     N++ + + V+ L  GD V +     ARH G+ I+E I+EK
Subjt:  VIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

Q2NI00 3-dehydroquinate synthase4.1e-6438.75Show/hide
Query:  KGVWI-----WSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRL--------------IASVVEVSNPQQLEQLQPARA
        K  WI     W++ ++ +  ++E G+   I    N E   +  S+ +I      +  +L   +++              +A+ VE++N      +     
Subjt:  KGVWI-----WSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRL--------------IASVVEVSNPQQLEQLQPARA

Query:  SADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDR
         AD V++  ++W++IP ENI+A+ Q     +        EA++ LE +EHG  GV+L   D   + +L    ++ ++ S   +L  AT+T++  VG+GDR
Subjt:  SADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDR

Query:  VCVDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQ
        VCVD CS+M  G+G+LVGS+A GLFL+HSE LES Y+ASRPFRVNAGPVHAYV  P  KT YLSEL+AG+EV+ ++ +G   T IVGRVKIE R L+L++
Subjt:  VCVDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQ

Query:  AKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
        AK    + +    L+QNAET+ LV     N++ + I V+ LKVGD+V       ARH G+ I+E I+EK
Subjt:  AKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

Q58646 3-dehydroquinate synthase7.0e-6438.89Show/hide
Query:  WSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALI-HPL--------------FIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDL
        W E ++++T A+E      +  P + E   E  +I +  H L              F+KE   L G++  I   +E    ++           D ++++ 
Subjt:  WSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALI-HPL--------------FIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDL

Query:  QDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLM
        +DW IIP EN++A        + A   +  EA++  E LE G  GV+L  ++ E + +L    +  N+    L++  AT+T++  +G GDRVC+D CSLM
Subjt:  QDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLM

Query:  RPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQT
        + GEG+L+GSY+R LFL+HSE +E+ Y+A+RPFRVNAGPVHAY+  PG KT YLSEL+AG++V++VD++G  R AIVGRVKIE R L+L++A+   D   
Subjt:  RPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQT

Query:  PYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
            +LQNAET+ LV     N + + I V  LK GD+V ++ +  ARH G+ I+E I+EK
Subjt:  PYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

Arabidopsis top hitse value%identityAlignment
AT3G28760.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: 3-dehydroquinate synthase, prokaryotic-type (InterPro:IPR002812); Has 390 Blast hits to 390 proteins in 131 species: Archae - 144; Bacteria - 105; Metazoa - 0; Fungi - 0; Plants - 54; Viruses - 0; Other Eukaryotes - 87 (source: NCBI BLink).3.0e-13966.76Show/hide
Query:  RLQCSYTSSSSPMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARA
        R+    ++S+ PM+ +  +K VWIW+ C++VMT AVERGW+TFIFS  N +L++EWSSIAL+  LFI+E  V+DG   ++ASV EVS P++L  L     
Subjt:  RLQCSYTSSSSPMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARA

Query:  SADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDR
          + +V+D  DW+ IPAEN+VAA QGS+KTVFA+S TP EA++FLEALEHGLGG+ILK ED +AV  LK+YFD+RNE S+ L+LT+ATIT++ +VGMGDR
Subjt:  SADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDR

Query:  VCVDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQ
        VCVDLCSLMRPGEGLLVGS+ARGLFL+HSECLESNYI SRPFRVNAGPVHAYVAVPGGKT YLSEL+ G EVIVVDQ+G+QRTA+VGRVKIE R LI+V+
Subjt:  VCVDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQ

Query:  AKRDS-DEQTPYSVLLQNAETVALVCPGQGNNE-KKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE
        AK  + +E+T YS++LQNAETVALV P Q N+  + A+PVTSLK GD+V +RLQG ARHTGIEIQEFIVE
Subjt:  AKRDS-DEQTPYSVLLQNAETVALVCPGQGNNE-KKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE

AT3G28760.2 CONTAINS InterPro DOMAIN/s: 3-dehydroquinate synthase, prokaryotic-type (InterPro:IPR002812); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink).3.0e-13966.76Show/hide
Query:  RLQCSYTSSSSPMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARA
        R+    ++S+ PM+ +  +K VWIW+ C++VMT AVERGW+TFIFS  N +L++EWSSIAL+  LFI+E  V+DG   ++ASV EVS P++L  L     
Subjt:  RLQCSYTSSSSPMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARA

Query:  SADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDR
          + +V+D  DW+ IPAEN+VAA QGS+KTVFA+S TP EA++FLEALEHGLGG+ILK ED +AV  LK+YFD+RNE S+ L+LT+ATIT++ +VGMGDR
Subjt:  SADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDR

Query:  VCVDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQ
        VCVDLCSLMRPGEGLLVGS+ARGLFL+HSECLESNYI SRPFRVNAGPVHAYVAVPGGKT YLSEL+ G EVIVVDQ+G+QRTA+VGRVKIE R LI+V+
Subjt:  VCVDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQ

Query:  AKRDS-DEQTPYSVLLQNAETVALVCPGQGNNE-KKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE
        AK  + +E+T YS++LQNAETVALV P Q N+  + A+PVTSLK GD+V +RLQG ARHTGIEIQEFIVE
Subjt:  AKRDS-DEQTPYSVLLQNAETVALVCPGQGNNE-KKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCATGGCCTTCCTCCGTTCCTCCTCGCCCGTTTCTCCTTTTCTTTCCAAACAGCGCATCACCTACCTGAAAACACCAGAGAATTTATACCTTCGGCCCCTAGTTTC
GAGGGATTTTGGCGAAGCCTATGCTGGAGAATGTAAGTCTTCGGATGTGAGTCGTTTACAGTGTTCTTACACTTCCTCCTCCTCTCCAATGTCTCCGATTGAGGCGTCGA
AGGGAGTGTGGATTTGGAGTGAGTGTCAGCAGGTAATGACGGCTGCGGTTGAGAGGGGATGGAGCACCTTCATCTTCTCGCCTCATAATACGGAGCTTGCTCATGAATGG
TCTTCAATTGCACTTATACATCCTCTTTTTATTAAAGAGAATGGAGTTTTAGATGGAGAAGATAGACTAATTGCCTCAGTTGTTGAGGTCTCTAACCCCCAGCAGTTGGA
GCAGCTTCAACCAGCACGTGCATCTGCAGACATTGTTGTTGTTGATTTACAAGATTGGCAGATAATACCTGCAGAGAATATTGTCGCAGCGTTTCAGGGAAGTCAGAAAA
CAGTGTTTGCCATCTCGAAAACTCCTATTGAGGCTCAAATCTTTCTTGAGGCGCTTGAACATGGTCTGGGCGGAGTTATTTTGAAAGTTGAAGATCCTGAAGCTGTTTTT
CAGCTAAAGGACTATTTTGACAGAAGAAACGAAGCTAGTAATCTTTTGAATTTGACTAAGGCTACCATTACTCAAATTCATGTGGTTGGAATGGGAGATCGAGTTTGTGT
CGACCTCTGTAGTCTCATGAGACCCGGCGAAGGGCTTCTTGTCGGTTCATATGCGAGAGGACTGTTCCTTATTCATTCGGAATGCTTAGAATCAAATTACATCGCTAGCC
GACCTTTTCGTGTCAATGCTGGACCTGTCCATGCCTATGTAGCTGTTCCGGGAGGGAAAACTAGTTACCTTTCCGAGTTACAAGCAGGCAACGAGGTAATTGTAGTTGAT
CAGGAAGGCAGACAGCGAACTGCTATTGTTGGACGTGTAAAGATTGAGACTAGGCAGCTGATCCTTGTCCAGGCAAAGAGAGATTCAGATGAGCAAACTCCTTACAGCGT
CCTTCTGCAGAATGCGGAAACAGTTGCCTTGGTGTGCCCTGGTCAAGGAAATAATGAGAAGAAAGCCATACCTGTTACCTCACTTAAAGTTGGTGATGAAGTGTTCTTGA
GATTGCAAGGAGAAGCAAGACATACAGGAATTGAAATCCAAGAGTTTATTGTGGAGAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCATGGCCTTCCTCCGTTCCTCCTCGCCCGTTTCTCCTTTTCTTTCCAAACAGCGCATCACCTACCTGAAAACACCAGAGAATTTATACCTTCGGCCCCTAGTTTC
GAGGGATTTTGGCGAAGCCTATGCTGGAGAATGTAAGTCTTCGGATGTGAGTCGTTTACAGTGTTCTTACACTTCCTCCTCCTCTCCAATGTCTCCGATTGAGGCGTCGA
AGGGAGTGTGGATTTGGAGTGAGTGTCAGCAGGTAATGACGGCTGCGGTTGAGAGGGGATGGAGCACCTTCATCTTCTCGCCTCATAATACGGAGCTTGCTCATGAATGG
TCTTCAATTGCACTTATACATCCTCTTTTTATTAAAGAGAATGGAGTTTTAGATGGAGAAGATAGACTAATTGCCTCAGTTGTTGAGGTCTCTAACCCCCAGCAGTTGGA
GCAGCTTCAACCAGCACGTGCATCTGCAGACATTGTTGTTGTTGATTTACAAGATTGGCAGATAATACCTGCAGAGAATATTGTCGCAGCGTTTCAGGGAAGTCAGAAAA
CAGTGTTTGCCATCTCGAAAACTCCTATTGAGGCTCAAATCTTTCTTGAGGCGCTTGAACATGGTCTGGGCGGAGTTATTTTGAAAGTTGAAGATCCTGAAGCTGTTTTT
CAGCTAAAGGACTATTTTGACAGAAGAAACGAAGCTAGTAATCTTTTGAATTTGACTAAGGCTACCATTACTCAAATTCATGTGGTTGGAATGGGAGATCGAGTTTGTGT
CGACCTCTGTAGTCTCATGAGACCCGGCGAAGGGCTTCTTGTCGGTTCATATGCGAGAGGACTGTTCCTTATTCATTCGGAATGCTTAGAATCAAATTACATCGCTAGCC
GACCTTTTCGTGTCAATGCTGGACCTGTCCATGCCTATGTAGCTGTTCCGGGAGGGAAAACTAGTTACCTTTCCGAGTTACAAGCAGGCAACGAGGTAATTGTAGTTGAT
CAGGAAGGCAGACAGCGAACTGCTATTGTTGGACGTGTAAAGATTGAGACTAGGCAGCTGATCCTTGTCCAGGCAAAGAGAGATTCAGATGAGCAAACTCCTTACAGCGT
CCTTCTGCAGAATGCGGAAACAGTTGCCTTGGTGTGCCCTGGTCAAGGAAATAATGAGAAGAAAGCCATACCTGTTACCTCACTTAAAGTTGGTGATGAAGTGTTCTTGA
GATTGCAAGGAGAAGCAAGACATACAGGAATTGAAATCCAAGAGTTTATTGTGGAGAAATAA
Protein sequenceShow/hide protein sequence
MAMAFLRSSSPVSPFLSKQRITYLKTPENLYLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEW
SSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVF
QLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVD
QEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK