; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy3G036710 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy3G036710
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
Description3-dehydroquinate synthase homolog
Genome locationGy14Chr3:34806340..34810382
RNA-Seq ExpressionCsGy3G036710
SyntenyCsGy3G036710
Gene Ontology termsGO:0008652 - cellular amino acid biosynthetic process (biological process)
GO:0009073 - aromatic amino acid family biosynthetic process (biological process)
GO:0003856 - 3-dehydroquinate synthase activity (molecular function)
GO:0016491 - oxidoreductase activity (molecular function)
InterPro domainsIPR002812 - 3-dehydroquinate synthase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147467.1 uncharacterized protein LOC101203995 [Cucumis sativus]2.71e-255100Show/hide
Query:  MSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDW
        MSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDW
Subjt:  MSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDW

Query:  QIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPG
        QIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPG
Subjt:  QIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPG

Query:  EGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS
        EGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS
Subjt:  EGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS

Query:  VLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
        VLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
Subjt:  VLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

XP_008443422.1 PREDICTED: 3-dehydroquinate synthase homolog [Cucumis melo]2.27e-24796.36Show/hide
Query:  MSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDW
        MSPIE SKGVWIWSECQ+VMTAAVERGWSTFIFSPHNTELAHEW+SIA+IHPLFIKE+GVLDGEDRLIASVVE+SNPQQLEQLQPARASADIVVVDLQDW
Subjt:  MSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDW

Query:  QIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPG
        QIIPAENIVAAFQGSQKTVFAISKTPIEAQIF EALEHGLGGVILKV+DPEAVFQLKDYFDRRNEASNLL+LTKATITQIHVVGMGDRVCVDLCSLMRPG
Subjt:  QIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPG

Query:  EGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS
        EGLLVGS+ARGLFL+HSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAG EVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS
Subjt:  EGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS

Query:  VLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
        VLLQNAETVALVCPGQGNNEKKAI VTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
Subjt:  VLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

XP_022928646.1 uncharacterized protein LOC111435491 [Cucurbita moschata]1.76e-23090.76Show/hide
Query:  MSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDW
        MSPIEASKGVWIWSE QQVMTAAVERGWSTFIFSPHN ELA EWSSIALI PLFI E+GV D E RLIA+V+EVSNPQQLEQLQP+ AS D V+VDLQDW
Subjt:  MSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDW

Query:  QIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPG
        QIIPAENIVAAFQGS+KTVFA+SKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLL+LTKATIT IHV GMGDRVCVDLCSLM+PG
Subjt:  QIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPG

Query:  EGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS
        EGLLVGSYARGLFL+HSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSEL+AG EVIVVDQ+GRQRT IVGRVKIETRQL+L+QAKRDSDEQT YS
Subjt:  EGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS

Query:  VLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
        +LLQNAETVALVCPG+GN EKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
Subjt:  VLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

XP_023529491.1 uncharacterized protein LOC111792332 [Cucurbita pepo subsp. pepo]4.34e-23191.04Show/hide
Query:  MSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDW
        MSPIEASKGVWIWSE +QVMTAAVERGWSTFIFSPHN ELA EWSSIALI PLFI E+GV D E RLIA+V+EVSNPQQLEQLQP+ AS D V+VDLQDW
Subjt:  MSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDW

Query:  QIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPG
        QIIPAENIVAAFQGS+KTVFA+SKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLL+LTKATIT IHV GMGDRVCVDLCSLMRPG
Subjt:  QIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPG

Query:  EGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS
        EGLLVGSYARGLFL+HSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSEL+AG EVIVVDQEGRQRT IVGRVKIETRQL+L+QAKRDSDEQT YS
Subjt:  EGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS

Query:  VLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
        +LLQNAETVALVCPG+GN EKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
Subjt:  VLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

XP_038903473.1 3-dehydroquinate synthase homolog [Benincasa hispida]1.46e-24396.08Show/hide
Query:  MSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDW
        MSP EASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELA EWSSIALIHPLFIKENGV DGE RLIASVVEVSNPQQLEQLQPA ASADIVVVDLQDW
Subjt:  MSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDW

Query:  QIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPG
        QIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLL+LTKATITQIHV GMGDRVCVDLCSLMRPG
Subjt:  QIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPG

Query:  EGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS
        EGLLVGSYARGLFL+HSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSEL AG EVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS
Subjt:  EGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS

Query:  VLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
        +LLQNAETVALVCPG+GN EKK+IPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
Subjt:  VLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

TrEMBL top hitse value%identityAlignment
A0A0A0LHS3 Uncharacterized protein1.11e-256100Show/hide
Query:  MSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDW
        MSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDW
Subjt:  MSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDW

Query:  QIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPG
        QIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPG
Subjt:  QIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPG

Query:  EGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS
        EGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS
Subjt:  EGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS

Query:  VLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
        VLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
Subjt:  VLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

A0A1S3B8Q7 3-dehydroquinate synthase homolog1.10e-24796.36Show/hide
Query:  MSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDW
        MSPIE SKGVWIWSECQ+VMTAAVERGWSTFIFSPHNTELAHEW+SIA+IHPLFIKE+GVLDGEDRLIASVVE+SNPQQLEQLQPARASADIVVVDLQDW
Subjt:  MSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDW

Query:  QIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPG
        QIIPAENIVAAFQGSQKTVFAISKTPIEAQIF EALEHGLGGVILKV+DPEAVFQLKDYFDRRNEASNLL+LTKATITQIHVVGMGDRVCVDLCSLMRPG
Subjt:  QIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPG

Query:  EGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS
        EGLLVGS+ARGLFL+HSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAG EVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS
Subjt:  EGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS

Query:  VLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
        VLLQNAETVALVCPGQGNNEKKAI VTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
Subjt:  VLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

A0A5A7UEW0 3-dehydroquinate synthase-like protein1.10e-24796.36Show/hide
Query:  MSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDW
        MSPIE SKGVWIWSECQ+VMTAAVERGWSTFIFSPHNTELAHEW+SIA+IHPLFIKE+GVLDGEDRLIASVVE+SNPQQLEQLQPARASADIVVVDLQDW
Subjt:  MSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDW

Query:  QIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPG
        QIIPAENIVAAFQGSQKTVFAISKTPIEAQIF EALEHGLGGVILKV+DPEAVFQLKDYFDRRNEASNLL+LTKATITQIHVVGMGDRVCVDLCSLMRPG
Subjt:  QIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPG

Query:  EGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS
        EGLLVGS+ARGLFL+HSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAG EVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS
Subjt:  EGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS

Query:  VLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
        VLLQNAETVALVCPGQGNNEKKAI VTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
Subjt:  VLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

A0A6J1EKW1 uncharacterized protein LOC1114354918.54e-23190.76Show/hide
Query:  MSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDW
        MSPIEASKGVWIWSE QQVMTAAVERGWSTFIFSPHN ELA EWSSIALI PLFI E+GV D E RLIA+V+EVSNPQQLEQLQP+ AS D V+VDLQDW
Subjt:  MSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDW

Query:  QIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPG
        QIIPAENIVAAFQGS+KTVFA+SKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLL+LTKATIT IHV GMGDRVCVDLCSLM+PG
Subjt:  QIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPG

Query:  EGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS
        EGLLVGSYARGLFL+HSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSEL+AG EVIVVDQ+GRQRT IVGRVKIETRQL+L+QAKRDSDEQT YS
Subjt:  EGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS

Query:  VLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
        +LLQNAETVALVCPG+GN EKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
Subjt:  VLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

A0A6J1I437 uncharacterized protein LOC1114697131.21e-23091.04Show/hide
Query:  MSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDW
        MSPIEASKGVWIWS  +QVMTAAVERGWSTFIFSPHN ELA EWSSIALI PLFI E+GV DGE RLIA+V+EVSNPQQLEQLQP+ AS D V+VDLQDW
Subjt:  MSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDW

Query:  QIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPG
        QIIPAENIVAAFQGS+KTVFA+SKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLL+LTKATIT IHV GMGDRVCVDLCSLMRPG
Subjt:  QIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPG

Query:  EGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS
        EGLLVGSYARGLFL+HSECLESNYIASRPFRVNAGPVHAYVAVPG KTSYLSEL+AG EVIVVDQEGRQRTAIVGRVKIETRQL+L+QAKRDSDEQT YS
Subjt:  EGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS

Query:  VLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
        +LLQNAETVALVCPG+GN EKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
Subjt:  VLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

SwissProt top hitse value%identityAlignment
A0B6K6 3-dehydroquinate synthase9.1e-6539.89Show/hide
Query:  WSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALI--------HPLFIKENGV--------LDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVD
        W + + ++T A+E G+   + S  + EL  E  SI +           L I +  V        ++   R I   VE+ + +            D ++V 
Subjt:  WSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALI--------HPLFIKENGV--------LDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVD

Query:  LQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSL
          DW++IP EN++AA QG    + +  ++  EA++ L  LEHG  GV+L   DP  + +++   +R     + ++L  AT+  +  VGMGDRVCVD CSL
Subjt:  LQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSL

Query:  MRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQ
        MR GEG+LVGS +R  FL+ SE  ES Y+A+RPFRVNAG VHAY+ V G KT YLSEL++G+EV +VD++G  R+A+VGRVKIE R +ILV+A+ D +  
Subjt:  MRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQ

Query:  TPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
           S LLQNAET+ LV     +++   I V  LK GD+V + ++  ARH G+ I+E I+E+
Subjt:  TPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

A4G0J1 3-dehydroquinate synthase1.6e-6146.82Show/hide
Query:  DIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVC
        D VV++  DW IIP ENI+A   G +  + ++     +A+   E LE G+ GV+L  ED   V       +R N  S  L L  AT+T+I  VG GDRVC
Subjt:  DIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVC

Query:  VDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAK
        +D CS+M  GEG+L+GSY+RG+FL+HSE +E+ Y+A+RPFRVNAGPVHAY+  P  KT YLS+L+AG++V+VV++ G  R +I+GRVKIE R L LV+A+
Subjt:  VDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAK

Query:  RDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
         + +       +LQNAET+ LV       + K + V  LKVG +V ++    ARH G+ I+E IVEK
Subjt:  RDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

O26680 3-dehydroquinate synthase2.2e-6647.99Show/hide
Query:  LDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYF
        L    R +A+ VE+ +    E  +      D +++  +DW+IIP ENI+A  Q     + A      EA++ LE LEHG  GV++   +P  + Q+KD  
Subjt:  LDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYF

Query:  D-RRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNE
            N  S    L  ATIT+I  +G GDRVCVD CS+M  GEG+LVGSY++GLFL+HSE LES Y+ASRPFRVNAGPV AYV VPGG+T YLSEL+ G+E
Subjt:  D-RRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNE

Query:  VIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
        VI+VD++GR R+AIVGRVKIE R L+LV+A+    E      LLQNAET+ LV     N++ + + V+ L  GD V +     ARH G+ I+E I+EK
Subjt:  VIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

Q2NI00 3-dehydroquinate synthase2.0e-6438.75Show/hide
Query:  KGVWI-----WSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRL--------------IASVVEVSNPQQLEQLQPARA
        K  WI     W++ ++ +  ++E G+   I    N E   +  S+ +I      +  +L   +++              +A+ VE++N      +     
Subjt:  KGVWI-----WSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRL--------------IASVVEVSNPQQLEQLQPARA

Query:  SADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDR
         AD V++  ++W++IP ENI+A+ Q     +        EA++ LE +EHG  GV+L   D   + +L    ++ ++ S   +L  AT+T++  VG+GDR
Subjt:  SADIVVVDLQDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDR

Query:  VCVDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQ
        VCVD CS+M  G+G+LVGS+A GLFL+HSE LES Y+ASRPFRVNAGPVHAYV  P  KT YLSEL+AG+EV+ ++ +G   T IVGRVKIE R L+L++
Subjt:  VCVDLCSLMRPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQ

Query:  AKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
        AK    + +    L+QNAET+ LV     N++ + I V+ LKVGD+V       ARH G+ I+E I+EK
Subjt:  AKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

Q58646 3-dehydroquinate synthase2.7e-6438.89Show/hide
Query:  WSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALI-HPL--------------FIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDL
        W E ++++T A+E      +  P + E   E  +I +  H L              F+KE   L G++  I   +E    ++           D ++++ 
Subjt:  WSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALI-HPL--------------FIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDL

Query:  QDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLM
        +DW IIP EN++A        + A   +  EA++  E LE G  GV+L  ++ E + +L    +  N+    L++  AT+T++  +G GDRVC+D CSLM
Subjt:  QDWQIIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLM

Query:  RPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQT
        + GEG+L+GSY+R LFL+HSE +E+ Y+A+RPFRVNAGPVHAY+  PG KT YLSEL+AG++V++VD++G  R AIVGRVKIE R L+L++A+   D   
Subjt:  RPGEGLLVGSYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQT

Query:  PYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
            +LQNAET+ LV     N + + I V  LK GD+V ++ +  ARH G+ I+E I+EK
Subjt:  PYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

Arabidopsis top hitse value%identityAlignment
AT3G28760.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: 3-dehydroquinate synthase, prokaryotic-type (InterPro:IPR002812); Has 390 Blast hits to 390 proteins in 131 species: Archae - 144; Bacteria - 105; Metazoa - 0; Fungi - 0; Plants - 54; Viruses - 0; Other Eukaryotes - 87 (source: NCBI BLink).4.4e-13969.03Show/hide
Query:  SKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQIIPAE
        +K VWIW+ C++VMT AVERGW+TFIFS  N +L++EWSSIAL+  LFI+E  V+DG   ++ASV EVS P++L  L       + +V+D  DW+ IPAE
Subjt:  SKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQIIPAE

Query:  NIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVG
        N+VAA QGS+KTVFA+S TP EA++FLEALEHGLGG+ILK ED +AV  LK+YFD+RNE S+ L+LT+ATIT++ +VGMGDRVCVDLCSLMRPGEGLLVG
Subjt:  NIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVG

Query:  SYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDS-DEQTPYSVLLQN
        S+ARGLFL+HSECLESNYI SRPFRVNAGPVHAYVAVPGGKT YLSEL+ G EVIVVDQ+G+QRTA+VGRVKIE R LI+V+AK  + +E+T YS++LQN
Subjt:  SYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDS-DEQTPYSVLLQN

Query:  AETVALVCPGQGNNE-KKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE
        AETVALV P Q N+  + A+PVTSLK GD+V +RLQG ARHTGIEIQEFIVE
Subjt:  AETVALVCPGQGNNE-KKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE

AT3G28760.2 CONTAINS InterPro DOMAIN/s: 3-dehydroquinate synthase, prokaryotic-type (InterPro:IPR002812); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink).4.4e-13969.03Show/hide
Query:  SKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQIIPAE
        +K VWIW+ C++VMT AVERGW+TFIFS  N +L++EWSSIAL+  LFI+E  V+DG   ++ASV EVS P++L  L       + +V+D  DW+ IPAE
Subjt:  SKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQIIPAE

Query:  NIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVG
        N+VAA QGS+KTVFA+S TP EA++FLEALEHGLGG+ILK ED +AV  LK+YFD+RNE S+ L+LT+ATIT++ +VGMGDRVCVDLCSLMRPGEGLLVG
Subjt:  NIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVG

Query:  SYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDS-DEQTPYSVLLQN
        S+ARGLFL+HSECLESNYI SRPFRVNAGPVHAYVAVPGGKT YLSEL+ G EVIVVDQ+G+QRTA+VGRVKIE R LI+V+AK  + +E+T YS++LQN
Subjt:  SYARGLFLIHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDS-DEQTPYSVLLQN

Query:  AETVALVCPGQGNNE-KKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE
        AETVALV P Q N+  + A+PVTSLK GD+V +RLQG ARHTGIEIQEFIVE
Subjt:  AETVALVCPGQGNNE-KKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCCGATTGAGGCGTCGAAGGGAGTGTGGATTTGGAGTGAGTGTCAGCAGGTAATGACGGCTGCGGTTGAGAGGGGATGGAGCACCTTCATCTTCTCGCCTCATAA
TACGGAGCTTGCTCATGAATGGTCTTCAATTGCACTTATACATCCTCTTTTTATTAAAGAGAATGGAGTTTTAGATGGAGAAGATAGACTAATTGCCTCAGTTGTTGAGG
TCTCTAACCCCCAGCAGTTGGAGCAGCTTCAACCAGCACGTGCATCTGCAGACATTGTTGTTGTTGATTTACAAGATTGGCAGATAATACCTGCAGAGAATATTGTCGCA
GCGTTTCAGGGAAGTCAGAAAACAGTGTTTGCCATCTCGAAAACTCCTATTGAGGCTCAAATCTTTCTTGAGGCGCTTGAACATGGTCTGGGCGGAGTTATTTTGAAAGT
TGAAGATCCTGAAGCTGTTTTTCAGCTAAAGGACTATTTTGACAGAAGAAACGAAGCTAGTAATCTTTTGAATTTGACTAAGGCTACCATTACTCAAATTCATGTGGTTG
GAATGGGAGATCGAGTTTGTGTCGACCTCTGTAGTCTCATGAGACCCGGCGAAGGGCTTCTTGTCGGTTCATATGCGAGAGGACTGTTCCTTATTCATTCGGAATGCTTA
GAATCAAATTACATCGCTAGCCGACCTTTTCGTGTCAATGCTGGACCTGTCCATGCCTATGTAGCTGTTCCGGGAGGGAAAACTAGTTACCTTTCCGAGTTACAAGCAGG
CAACGAGGTAATTGTAGTTGATCAGGAAGGCAGACAGCGAACTGCTATTGTTGGACGTGTAAAGATTGAGACTAGGCAGCTGATCCTTGTCCAGGCAAAGAGAGATTCAG
ATGAGCAAACTCCTTACAGCGTCCTTCTGCAGAATGCGGAAACAGTTGCCTTGGTGTGCCCTGGTCAAGGAAATAATGAGAAGAAAGCCATACCTGTTACCTCACTTAAA
GTTGGTGATGAAGTGTTCTTGAGATTGCAAGGAGAAGCAAGACATACAGGAATTGAAATCCAAGAGTTTATTGTGGAGAAATAA
mRNA sequenceShow/hide mRNA sequence
CTGTTTACTAAGACTTTGGTCGCCCATTCTTCTGTCATCAAGTCCACAACTTCCTTGACCCTTTGAAAATGAGCCATCGTTGTTGAGTCCCCTTTCTTTACGACTTTGTC
GAGTCCCCGACTTTGGTGTTTTGGTGTAAACAGTCGACCCAGTCACTGCGACCCCTTTTGTGGATACCCCTTTCTCAGTTATCACTGCCTTGTTCTAAACGACTTGTCGT
TTTTTTTGGTGATATAACGGCCCACTGCTTTTTTGGTCACAGGACCATTGGGCTTTCAAAGGGACGAGCGAAATAGAGTTGCTAAGACGAAAACGATGGCCATGGCCTTC
CTCCGTTCCTCCTCGCCCGTTTCTCCTTTTCTTTCCAAACAGCGCATCACCTACCTGAAAACACCAGGTTCTCCCGTCTCTTCTCTTTAACAATTCAAGATTTTGAATCC
CATTTCTCCTCTTTTCCATTGATTTAACAATGTTATGCTATTGATTTACCTGCGTGCCACTTAATCTTTGGTGGGGTTCTCAGAGAATTTATACCTTCGGCCCCTAGTTT
CGAGGGATTTTGGCGAAGCCTATGCTGGAGAATGTAAGTCTTCGGATGTGAGTCGTTTACAGTGTTCTTACACTTCCTCCTCCTCTCCAATGTCTCCGATTGAGGCGTCG
AAGGGAGTGTGGATTTGGAGTGAGTGTCAGCAGGTAATGACGGCTGCGGTTGAGAGGGGATGGAGCACCTTCATCTTCTCGCCTCATAATACGGAGCTTGCTCATGAATG
GTCTTCAATTGCACTTATACATCCTCTTTTTATTAAAGAGAATGGAGTTTTAGATGGAGAAGATAGACTAATTGCCTCAGTTGTTGAGGTCTCTAACCCCCAGCAGTTGG
AGCAGCTTCAACCAGCACGTGCATCTGCAGACATTGTTGTTGTTGATTTACAAGATTGGCAGATAATACCTGCAGAGAATATTGTCGCAGCGTTTCAGGGAAGTCAGAAA
ACAGTGTTTGCCATCTCGAAAACTCCTATTGAGGCTCAAATCTTTCTTGAGGCGCTTGAACATGGTCTGGGCGGAGTTATTTTGAAAGTTGAAGATCCTGAAGCTGTTTT
TCAGCTAAAGGACTATTTTGACAGAAGAAACGAAGCTAGTAATCTTTTGAATTTGACTAAGGCTACCATTACTCAAATTCATGTGGTTGGAATGGGAGATCGAGTTTGTG
TCGACCTCTGTAGTCTCATGAGACCCGGCGAAGGGCTTCTTGTCGGTTCATATGCGAGAGGACTGTTCCTTATTCATTCGGAATGCTTAGAATCAAATTACATCGCTAGC
CGACCTTTTCGTGTCAATGCTGGACCTGTCCATGCCTATGTAGCTGTTCCGGGAGGGAAAACTAGTTACCTTTCCGAGTTACAAGCAGGCAACGAGGTAATTGTAGTTGA
TCAGGAAGGCAGACAGCGAACTGCTATTGTTGGACGTGTAAAGATTGAGACTAGGCAGCTGATCCTTGTCCAGGCAAAGAGAGATTCAGATGAGCAAACTCCTTACAGCG
TCCTTCTGCAGAATGCGGAAACAGTTGCCTTGGTGTGCCCTGGTCAAGGAAATAATGAGAAGAAAGCCATACCTGTTACCTCACTTAAAGTTGGTGATGAAGTGTTCTTG
AGATTGCAAGGAGAAGCAAGACATACAGGAATTGAAATCCAAGAGTTTATTGTGGAGAAATAATGGTTGATCACCTTTTACTACTTTAATATATTTGTATATTTTATCTT
TTCTAAAATACTAATTTAAAAATTTTAGGTATTGAATTACTTTGCAAAGCATACATAACTTTAATATTCTTTCTCTCAAAGCTTATAAACTTCAATATTATCTCAAATGT
TTTATAATTCAT
Protein sequenceShow/hide protein sequence
MSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQIIPAENIVA
AFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLIHSECL
ESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLK
VGDEVFLRLQGEARHTGIEIQEFIVEK