; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0014985 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0014985
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
Description3-dehydroquinate synthase homolog
Genome locationchr04:8800632..8804594
RNA-Seq ExpressionPI0014985
SyntenyPI0014985
Gene Ontology termsGO:0008652 - cellular amino acid biosynthetic process (biological process)
GO:0009073 - aromatic amino acid family biosynthetic process (biological process)
GO:0003856 - 3-dehydroquinate synthase activity (molecular function)
GO:0016491 - oxidoreductase activity (molecular function)
InterPro domainsIPR002812 - 3-dehydroquinate synthase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147467.1 uncharacterized protein LOC101203995 [Cucumis sativus]4.3e-22895.74Show/hide
Query:  MAMAFLCSSSPVAPFLSKQRITYLKTPEILNLRPLISRDFGEAYAGECKSSDVSRLQCSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWTTFIFS
        MAMAFL SSSPV+PFLSKQRITYLKTPE L LRPL+SRDFGEAYAGECKSSDVSRLQCSY SSSS MSPIEASKGVWIWSECQQVMTAAVERGW+TFIFS
Subjt:  MAMAFLCSSSPVAPFLSKQRITYLKTPEILNLRPLISRDFGEAYAGECKSSDVSRLQCSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWTTFIFS

Query:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPASASANIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLE
        PHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPA ASA+IVVVDLQDWQIIPAENIVAAFQGSQKTVFA+SKTPIEAQIFLE
Subjt:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPASASANIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLE

Query:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLL+LTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFL+HSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRSAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGEGNNEKKAIPVTSLKVGDE
        GPVHAYVAVPGGKTSYLSEL+AG EVIVVDQEGRQR+AIVGRVKIETRQLILVQAKRDSDEQTPYS+LLQNAETVALVCPG+GNNEKKAIPVTSLKVGDE
Subjt:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRSAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGEGNNEKKAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

XP_008443422.1 PREDICTED: 3-dehydroquinate synthase homolog [Cucumis melo]5.4e-22393.38Show/hide
Query:  MAMAFLCSSSPVAPFLSKQRITYLKTPEILNLRPLISRDFGEAYAGECKSSDVSRLQCSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWTTFIFS
        M MAFL SSSPV+P LSKQRITYLKTPE LNLRPLISR+FG+AYAGECKSSD+SRLQCSY SSSS MSPIE SKGVWIWSECQ+VMTAAVERGW+TFIFS
Subjt:  MAMAFLCSSSPVAPFLSKQRITYLKTPEILNLRPLISRDFGEAYAGECKSSDVSRLQCSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWTTFIFS

Query:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPASASANIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLE
        PHNTELAHEW+SIA+IHPLFIKE+GVLDGEDRLIASVVE+SNPQQLEQLQPA ASA+IVVVDLQDWQIIPAENIVAAFQGSQKTVFA+SKTPIEAQIF E
Subjt:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPASASANIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLE

Query:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKV+DPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGS+ARGLFLVHSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRSAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGEGNNEKKAIPVTSLKVGDE
        GPVHAYVAVPGGKTSYLSEL+AGKEVIVVDQEGRQR+AIVGRVKIETRQLILVQAKRDSDEQTPYS+LLQNAETVALVCPG+GNNEKKAI VTSLKVGDE
Subjt:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRSAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGEGNNEKKAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

XP_022928646.1 uncharacterized protein LOC111435491 [Cucurbita moschata]7.1e-19986.05Show/hide
Query:  MAMAFLCSSSPVAPFLSKQRITYLKTPEILNLRPLISRDFGEAYAGECKSSDVSRLQCSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWTTFIFS
        MA     S SPV+P   KQRI   K P+ L LR LISR FG A  GECKS +++RL CS  SSSSSMSPIEASKGVWIWSE QQVMTAAVERGW+TFIFS
Subjt:  MAMAFLCSSSPVAPFLSKQRITYLKTPEILNLRPLISRDFGEAYAGECKSSDVSRLQCSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWTTFIFS

Query:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPASASANIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLE
        PHN ELA EWSSIALI PLFI E+GV D E RLIA+V+EVSNPQQLEQLQP++AS + V+VDLQDWQIIPAENIVAAFQGS+KTVFAVSKTPIEAQIFLE
Subjt:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPASASANIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLE

Query:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIT IHV GMGDRVCVDLCSLM+PGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRSAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGEGNNEKKAIPVTSLKVGDE
        GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQ+GRQR+ IVGRVKIETRQL+L+QAKRDSDEQT YSILLQNAETVALVCPG G NEKKAIPVTSLKVGDE
Subjt:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRSAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGEGNNEKKAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

XP_022970870.1 uncharacterized protein LOC111469713 [Cucurbita maxima]5.5e-19986.52Show/hide
Query:  MAMAFLCSSSPVAPFLSKQRITYLKTPEILNLRPLISRDFGEAYAGECKSSDVSRLQCSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWTTFIFS
        MA   L S S V+PF  KQRI   + P+ L LR LISR FG A  GECKS +++RL CS ASSSSSMSPIEASKGVWIWS  +QVMTAAVERGW+TFIFS
Subjt:  MAMAFLCSSSPVAPFLSKQRITYLKTPEILNLRPLISRDFGEAYAGECKSSDVSRLQCSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWTTFIFS

Query:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPASASANIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLE
        PHN ELA EWSSIALI PLFI E+GV DGE RLIA+V+EVSNPQQLEQLQP++AS + V+VDLQDWQIIPAENIVAAFQGS+KTVFAVSKTPIEAQIFLE
Subjt:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPASASANIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLE

Query:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIT IHV GMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRSAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGEGNNEKKAIPVTSLKVGDE
        GPVHAYVAVPG KTSYLSELRAGKEVIVVDQEGRQR+AIVGRVKIETRQL+L+QAKRDSDEQT YSILLQNAETVALVCPG G NEKKAIPVTSLKVGDE
Subjt:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRSAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGEGNNEKKAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

XP_038903473.1 3-dehydroquinate synthase homolog [Benincasa hispida]2.5e-22093.87Show/hide
Query:  MAMAFLCSSSPVAPFLSKQRITYL-KTPEILNLRPLISRDFGEAYAGECKSSDVSRLQCSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWTTFIF
        M MA LCSSSPV+PFLSKQRI+Y  KTPE L LRPLISRDFGEAYAGECKSS+VSRLQCSYAS  S+MSP EASKGVWIWSECQQVMTAAVERGW+TFIF
Subjt:  MAMAFLCSSSPVAPFLSKQRITYL-KTPEILNLRPLISRDFGEAYAGECKSSDVSRLQCSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWTTFIF

Query:  SPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPASASANIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFL
        SPHNTELA EWSSIALIHPLFIKENGV DGE RLIASVVEVSNPQQLEQLQPA+ASA+IVVVDLQDWQIIPAENIVAAFQGSQKTVFA+SKTPIEAQIFL
Subjt:  SPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPASASANIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFL

Query:  EALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVN
        EALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHV GMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVN
Subjt:  EALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVN

Query:  AGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRSAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGEGNNEKKAIPVTSLKVGD
        AGPVHAYVAVPGGKTSYLSEL AGKEVIVVDQEGRQR+AIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPG G NEKK+IPVTSLKVGD
Subjt:  AGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRSAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGEGNNEKKAIPVTSLKVGD

Query:  EVFLRLQGEARHTGIEIQEFIVEK
        EVFLRLQGEARHTGIEIQEFIVEK
Subjt:  EVFLRLQGEARHTGIEIQEFIVEK

TrEMBL top hitse value%identityAlignment
A0A0A0LHS3 Uncharacterized protein2.3e-19596.92Show/hide
Query:  MSPIEASKGVWIWSECQQVMTAAVERGWTTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPASASANIVVVDLQDW
        MSPIEASKGVWIWSECQQVMTAAVERGW+TFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPA ASA+IVVVDLQDW
Subjt:  MSPIEASKGVWIWSECQQVMTAAVERGWTTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPASASANIVVVDLQDW

Query:  QIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVVGMGDRVCVDLCSLMRPG
        QIIPAENIVAAFQGSQKTVFA+SKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLL+LTKATITQIHVVGMGDRVCVDLCSLMRPG
Subjt:  QIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVVGMGDRVCVDLCSLMRPG

Query:  EGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRSAIVGRVKIETRQLILVQAKRDSDEQTPYS
        EGLLVGSYARGLFL+HSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSEL+AG EVIVVDQEGRQR+AIVGRVKIETRQLILVQAKRDSDEQTPYS
Subjt:  EGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRSAIVGRVKIETRQLILVQAKRDSDEQTPYS

Query:  ILLQNAETVALVCPGEGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
        +LLQNAETVALVCPG+GNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
Subjt:  ILLQNAETVALVCPGEGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

A0A1S3B8Q7 3-dehydroquinate synthase homolog2.6e-22393.38Show/hide
Query:  MAMAFLCSSSPVAPFLSKQRITYLKTPEILNLRPLISRDFGEAYAGECKSSDVSRLQCSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWTTFIFS
        M MAFL SSSPV+P LSKQRITYLKTPE LNLRPLISR+FG+AYAGECKSSD+SRLQCSY SSSS MSPIE SKGVWIWSECQ+VMTAAVERGW+TFIFS
Subjt:  MAMAFLCSSSPVAPFLSKQRITYLKTPEILNLRPLISRDFGEAYAGECKSSDVSRLQCSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWTTFIFS

Query:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPASASANIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLE
        PHNTELAHEW+SIA+IHPLFIKE+GVLDGEDRLIASVVE+SNPQQLEQLQPA ASA+IVVVDLQDWQIIPAENIVAAFQGSQKTVFA+SKTPIEAQIF E
Subjt:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPASASANIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLE

Query:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKV+DPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGS+ARGLFLVHSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRSAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGEGNNEKKAIPVTSLKVGDE
        GPVHAYVAVPGGKTSYLSEL+AGKEVIVVDQEGRQR+AIVGRVKIETRQLILVQAKRDSDEQTPYS+LLQNAETVALVCPG+GNNEKKAI VTSLKVGDE
Subjt:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRSAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGEGNNEKKAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

A0A5A7UEW0 3-dehydroquinate synthase-like protein2.6e-22393.38Show/hide
Query:  MAMAFLCSSSPVAPFLSKQRITYLKTPEILNLRPLISRDFGEAYAGECKSSDVSRLQCSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWTTFIFS
        M MAFL SSSPV+P LSKQRITYLKTPE LNLRPLISR+FG+AYAGECKSSD+SRLQCSY SSSS MSPIE SKGVWIWSECQ+VMTAAVERGW+TFIFS
Subjt:  MAMAFLCSSSPVAPFLSKQRITYLKTPEILNLRPLISRDFGEAYAGECKSSDVSRLQCSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWTTFIFS

Query:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPASASANIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLE
        PHNTELAHEW+SIA+IHPLFIKE+GVLDGEDRLIASVVE+SNPQQLEQLQPA ASA+IVVVDLQDWQIIPAENIVAAFQGSQKTVFA+SKTPIEAQIF E
Subjt:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPASASANIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLE

Query:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKV+DPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGS+ARGLFLVHSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRSAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGEGNNEKKAIPVTSLKVGDE
        GPVHAYVAVPGGKTSYLSEL+AGKEVIVVDQEGRQR+AIVGRVKIETRQLILVQAKRDSDEQTPYS+LLQNAETVALVCPG+GNNEKKAI VTSLKVGDE
Subjt:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRSAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGEGNNEKKAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

A0A6J1EKW1 uncharacterized protein LOC1114354913.5e-19986.05Show/hide
Query:  MAMAFLCSSSPVAPFLSKQRITYLKTPEILNLRPLISRDFGEAYAGECKSSDVSRLQCSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWTTFIFS
        MA     S SPV+P   KQRI   K P+ L LR LISR FG A  GECKS +++RL CS  SSSSSMSPIEASKGVWIWSE QQVMTAAVERGW+TFIFS
Subjt:  MAMAFLCSSSPVAPFLSKQRITYLKTPEILNLRPLISRDFGEAYAGECKSSDVSRLQCSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWTTFIFS

Query:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPASASANIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLE
        PHN ELA EWSSIALI PLFI E+GV D E RLIA+V+EVSNPQQLEQLQP++AS + V+VDLQDWQIIPAENIVAAFQGS+KTVFAVSKTPIEAQIFLE
Subjt:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPASASANIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLE

Query:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIT IHV GMGDRVCVDLCSLM+PGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRSAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGEGNNEKKAIPVTSLKVGDE
        GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQ+GRQR+ IVGRVKIETRQL+L+QAKRDSDEQT YSILLQNAETVALVCPG G NEKKAIPVTSLKVGDE
Subjt:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRSAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGEGNNEKKAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

A0A6J1I437 uncharacterized protein LOC1114697132.6e-19986.52Show/hide
Query:  MAMAFLCSSSPVAPFLSKQRITYLKTPEILNLRPLISRDFGEAYAGECKSSDVSRLQCSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWTTFIFS
        MA   L S S V+PF  KQRI   + P+ L LR LISR FG A  GECKS +++RL CS ASSSSSMSPIEASKGVWIWS  +QVMTAAVERGW+TFIFS
Subjt:  MAMAFLCSSSPVAPFLSKQRITYLKTPEILNLRPLISRDFGEAYAGECKSSDVSRLQCSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWTTFIFS

Query:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPASASANIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLE
        PHN ELA EWSSIALI PLFI E+GV DGE RLIA+V+EVSNPQQLEQLQP++AS + V+VDLQDWQIIPAENIVAAFQGS+KTVFAVSKTPIEAQIFLE
Subjt:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPASASANIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLE

Query:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIT IHV GMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRSAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGEGNNEKKAIPVTSLKVGDE
        GPVHAYVAVPG KTSYLSELRAGKEVIVVDQEGRQR+AIVGRVKIETRQL+L+QAKRDSDEQT YSILLQNAETVALVCPG G NEKKAIPVTSLKVGDE
Subjt:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRSAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGEGNNEKKAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

SwissProt top hitse value%identityAlignment
A0B6K6 3-dehydroquinate synthase7.0e-6440.17Show/hide
Query:  WSECQQVMTAAVERGWTTFIFSPHNTELAHEWSSIALI--------HPLFIKENGV--------LDGEDRLIASVVEVSNPQQLEQLQPASASANIVVVD
        W + + ++T A+E G+   + S  + EL  E  SI +           L I +  V        ++   R I   VE+ + +            + ++V 
Subjt:  WSECQQVMTAAVERGWTTFIFSPHNTELAHEWSSIALI--------HPLFIKENGV--------LDGEDRLIASVVEVSNPQQLEQLQPASASANIVVVD

Query:  LQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVVGMGDRVCVDLCSL
          DW++IP EN++AA QG    + +  ++  EA++ L  LEHG  GV+L   DP  + +++   +R     + + L  AT+  +  VGMGDRVCVD CSL
Subjt:  LQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVVGMGDRVCVDLCSL

Query:  MRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRSAIVGRVKIETRQLILVQAKRDSDEQ
        MR GEG+LVGS +R  FLV SE  ES Y+A+RPFRVNAG VHAY+ V G KT YLSEL++G EV +VD++G  RSA+VGRVKIE R +ILV+A+ D +  
Subjt:  MRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRSAIVGRVKIETRQLILVQAKRDSDEQ

Query:  TPYSILLQNAETVALVCPGEGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
           S LLQNAET+ LV     +++   I V  LK GD+V + ++  ARH G+ I+E I+E+
Subjt:  TPYSILLQNAETVALVCPGEGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

A4G0J1 3-dehydroquinate synthase9.4e-6147.55Show/hide
Query:  VVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVVGMGDRVCVD
        VV++  DW IIP ENI+A   G +  + +V     +A+   E LE G+ GV+L  ED   V       +R N  S  L L  AT+T+I  VG GDRVC+D
Subjt:  VVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVVGMGDRVCVD

Query:  LCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRSAIVGRVKIETRQLILVQAKRD
         CS+M  GEG+L+GSY+RG+FLVHSE +E+ Y+A+RPFRVNAGPVHAY+  P  KT YLS+L+AG +V+VV++ G  R +I+GRVKIE R L LV+A+ +
Subjt:  LCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRSAIVGRVKIETRQLILVQAKRD

Query:  SDEQTPYSILLQNAETVALVCPGEGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
         +       +LQNAET+ LV       + K + V  LKVG +V ++    ARH G+ I+E IVEK
Subjt:  SDEQTPYSILLQNAETVALVCPGEGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

O26680 3-dehydroquinate synthase1.3e-6548.32Show/hide
Query:  LDGEDRLIASVVEVSNPQQLEQLQPASASANIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYF
        L    R +A+ VE+ +    E  +      + +++  +DW+IIP ENI+A  Q     + A      EA++ LE LEHG  GV++   +P  + Q+KD  
Subjt:  LDGEDRLIASVVEVSNPQQLEQLQPASASANIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYF

Query:  D-RRNEASNLLSLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKE
            N  S    L  ATIT+I  +G GDRVCVD CS+M  GEG+LVGSY++GLFLVHSE LES Y+ASRPFRVNAGPV AYV VPGG+T YLSEL  G E
Subjt:  D-RRNEASNLLSLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKE

Query:  VIVVDQEGRQRSAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGEGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
        VI+VD++GR RSAIVGRVKIE R L+LV+A+    E      LLQNAET+ LV     N++ + + V+ L  GD V +     ARH G+ I+E I+EK
Subjt:  VIVVDQEGRQRSAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGEGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

Q2NI00 3-dehydroquinate synthase3.8e-6244.83Show/hide
Query:  IASVVEVSNPQQLEQLQPASASANIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEAS
        +A+ VE++N      +      A+ V++  ++W++IP ENI+A+ Q     +        EA++ LE +EHG  GV+L   D   + +L    ++ ++ S
Subjt:  IASVVEVSNPQQLEQLQPASASANIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEAS

Query:  NLLSLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEG
            L  AT+T++  VG+GDRVCVD CS+M  G+G+LVGS+A GLFLVHSE LES Y+ASRPFRVNAGPVHAYV  P  KT YLSEL AG EV+ ++ +G
Subjt:  NLLSLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEG

Query:  RQRSAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGEGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
           + IVGRVKIE R L+L++AK    + +    L+QNAET+ LV     N++ + I V+ LKVGD+V       ARH G+ I+E I+EK
Subjt:  RQRSAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGEGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

Q58646 3-dehydroquinate synthase1.2e-6339.61Show/hide
Query:  WSECQQVMTAAVERGWTTFIFSPHNTELAHEWSSIALI-HPL--------------FIKENGVLDGEDRLIASVVEVSNPQQLEQLQPASASANIVVVDL
        W E ++++T A+E      +  P + E   E  +I +  H L              F+KE   L G++  I   +E    ++           + ++++ 
Subjt:  WSECQQVMTAAVERGWTTFIFSPHNTELAHEWSSIALI-HPL--------------FIKENGVLDGEDRLIASVVEVSNPQQLEQLQPASASANIVVVDL

Query:  QDWQIIPAENIVA-AFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVVGMGDRVCVDLCSL
        +DW IIP EN++A  F    K V +V+    EA++  E LE G  GV+L  ++ E + +L    +  N+    ++L  AT+T++  +G GDRVC+D CSL
Subjt:  QDWQIIPAENIVA-AFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVVGMGDRVCVDLCSL

Query:  MRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRSAIVGRVKIETRQLILVQAKRDSDEQ
        M+ GEG+L+GSY+R LFLVHSE +E+ Y+A+RPFRVNAGPVHAY+  PG KT YLSEL+AG +V++VD++G  R AIVGRVKIE R L+L++A+   D  
Subjt:  MRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRSAIVGRVKIETRQLILVQAKRDSDEQ

Query:  TPYSILLQNAETVALVCPGEGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
             +LQNAET+ LV     N + + I V  LK GD+V ++ +  ARH G+ I+E I+EK
Subjt:  TPYSILLQNAETVALVCPGEGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

Arabidopsis top hitse value%identityAlignment
AT3G28760.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: 3-dehydroquinate synthase, prokaryotic-type (InterPro:IPR002812); Has 390 Blast hits to 390 proteins in 131 species: Archae - 144; Bacteria - 105; Metazoa - 0; Fungi - 0; Plants - 54; Viruses - 0; Other Eukaryotes - 87 (source: NCBI BLink).1.5e-13868.32Show/hide
Query:  SSSSSMSPIEASKGVWIWSECQQVMTAAVERGWTTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPASASANIVVV
        S+S+    +  +K VWIW+ C++VMT AVERGW TFIFS  N +L++EWSSIAL+  LFI+E  V+DG   ++ASV EVS P++L  L   +     +V+
Subjt:  SSSSSMSPIEASKGVWIWSECQQVMTAAVERGWTTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPASASANIVVV

Query:  DLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVVGMGDRVCVDLCS
        D  DW+ IPAEN+VAA QGS+KTVFAVS TP EA++FLEALEHGLGG+ILK ED +AV  LK+YFD+RNE S+ LSLT+ATIT++ +VGMGDRVCVDLCS
Subjt:  DLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVVGMGDRVCVDLCS

Query:  LMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRSAIVGRVKIETRQLILVQAKRDS-D
        LMRPGEGLLVGS+ARGLFLVHSECLESNYI SRPFRVNAGPVHAYVAVPGGKT YLSELR G+EVIVVDQ+G+QR+A+VGRVKIE R LI+V+AK  + +
Subjt:  LMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRSAIVGRVKIETRQLILVQAKRDS-D

Query:  EQTPYSILLQNAETVALVCPGEGNNE-KKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE
        E+T YSI+LQNAETVALV P + N+  + A+PVTSLK GD+V +RLQG ARHTGIEIQEFIVE
Subjt:  EQTPYSILLQNAETVALVCPGEGNNE-KKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE

AT3G28760.2 CONTAINS InterPro DOMAIN/s: 3-dehydroquinate synthase, prokaryotic-type (InterPro:IPR002812); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink).1.5e-13868.32Show/hide
Query:  SSSSSMSPIEASKGVWIWSECQQVMTAAVERGWTTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPASASANIVVV
        S+S+    +  +K VWIW+ C++VMT AVERGW TFIFS  N +L++EWSSIAL+  LFI+E  V+DG   ++ASV EVS P++L  L   +     +V+
Subjt:  SSSSSMSPIEASKGVWIWSECQQVMTAAVERGWTTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPASASANIVVV

Query:  DLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVVGMGDRVCVDLCS
        D  DW+ IPAEN+VAA QGS+KTVFAVS TP EA++FLEALEHGLGG+ILK ED +AV  LK+YFD+RNE S+ LSLT+ATIT++ +VGMGDRVCVDLCS
Subjt:  DLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIHVVGMGDRVCVDLCS

Query:  LMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRSAIVGRVKIETRQLILVQAKRDS-D
        LMRPGEGLLVGS+ARGLFLVHSECLESNYI SRPFRVNAGPVHAYVAVPGGKT YLSELR G+EVIVVDQ+G+QR+A+VGRVKIE R LI+V+AK  + +
Subjt:  LMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRSAIVGRVKIETRQLILVQAKRDS-D

Query:  EQTPYSILLQNAETVALVCPGEGNNE-KKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE
        E+T YSI+LQNAETVALV P + N+  + A+PVTSLK GD+V +RLQG ARHTGIEIQEFIVE
Subjt:  EQTPYSILLQNAETVALVCPGEGNNE-KKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCATGGCCTTCCTCTGTTCCTCCTCGCCCGTTGCTCCATTTCTTTCCAAACAGCGCATCACCTACCTGAAAACACCAGAGATTTTAAACCTTCGGCCCCTAATTTC
GAGGGATTTTGGCGAAGCCTATGCTGGTGAATGTAAGTCTTCGGATGTGAGTCGTTTACAGTGTTCTTACGCTTCCTCATCCTCTTCAATGTCTCCGATTGAGGCGTCGA
AGGGGGTATGGATTTGGAGTGAGTGTCAGCAGGTTATGACAGCTGCGGTTGAGAGGGGATGGACCACCTTCATCTTCTCGCCTCATAATACGGAGCTTGCTCATGAATGG
TCTTCAATTGCACTTATACATCCTCTTTTTATTAAAGAGAATGGAGTTTTAGATGGAGAAGATAGACTAATTGCCTCAGTTGTTGAGGTCTCTAACCCCCAGCAGTTGGA
GCAGCTTCAACCAGCAAGTGCATCTGCAAACATAGTTGTTGTTGATTTACAAGATTGGCAGATAATACCTGCAGAGAATATTGTCGCAGCGTTTCAGGGGAGTCAGAAAA
CAGTGTTTGCTGTCTCGAAAACTCCTATTGAAGCTCAAATCTTCCTTGAGGCGCTTGAACATGGTCTGGGCGGAGTTATTTTGAAAGTTGAAGATCCTGAAGCTGTTTTT
CAGCTAAAGGACTATTTTGACAGAAGAAATGAAGCTAGTAATCTTTTGAGTTTGACTAAGGCTACCATTACTCAAATTCATGTGGTTGGAATGGGAGATCGAGTTTGTGT
CGATCTCTGTAGTCTCATGAGACCCGGCGAAGGGCTTCTTGTCGGGTCATATGCGAGAGGACTGTTCCTTGTTCACTCGGAATGCTTAGAATCAAATTACATTGCTAGCC
GACCTTTTCGTGTCAATGCTGGACCTGTCCATGCCTATGTAGCTGTTCCGGGAGGGAAAACTAGTTACCTTTCCGAGTTACGAGCAGGGAAAGAGGTAATTGTAGTTGAT
CAGGAAGGCAGACAGCGAAGTGCTATTGTTGGACGTGTAAAGATTGAGACTAGACAGCTGATCCTTGTCCAGGCAAAGAGAGATTCAGATGAGCAAACTCCTTACAGCAT
CCTTCTGCAGAATGCGGAAACGGTTGCCTTAGTCTGCCCTGGTGAAGGAAATAATGAGAAGAAAGCCATACCTGTTACCTCACTTAAAGTTGGTGATGAAGTGTTCTTGA
GATTGCAAGGAGAAGCAAGACATACAGGAATTGAAATCCAAGAGTTTATTGTGGAGAAATAA
mRNA sequenceShow/hide mRNA sequence
CTTCTGCGTTCTCACTGCCTTGTTGTCCCAAACGACTTGTCGTTTTTTTGGTGATATAACGGCCCACCATTTTTTTGGTCACGGGACCATTGGGCTGTCAAAGGGACGAG
CGAAATAGAGTTGCTAAGACGAAAACGATGGCCATGGCCTTCCTCTGTTCCTCCTCGCCCGTTGCTCCATTTCTTTCCAAACAGCGCATCACCTACCTGAAAACACCAGA
GATTTTAAACCTTCGGCCCCTAATTTCGAGGGATTTTGGCGAAGCCTATGCTGGTGAATGTAAGTCTTCGGATGTGAGTCGTTTACAGTGTTCTTACGCTTCCTCATCCT
CTTCAATGTCTCCGATTGAGGCGTCGAAGGGGGTATGGATTTGGAGTGAGTGTCAGCAGGTTATGACAGCTGCGGTTGAGAGGGGATGGACCACCTTCATCTTCTCGCCT
CATAATACGGAGCTTGCTCATGAATGGTCTTCAATTGCACTTATACATCCTCTTTTTATTAAAGAGAATGGAGTTTTAGATGGAGAAGATAGACTAATTGCCTCAGTTGT
TGAGGTCTCTAACCCCCAGCAGTTGGAGCAGCTTCAACCAGCAAGTGCATCTGCAAACATAGTTGTTGTTGATTTACAAGATTGGCAGATAATACCTGCAGAGAATATTG
TCGCAGCGTTTCAGGGGAGTCAGAAAACAGTGTTTGCTGTCTCGAAAACTCCTATTGAAGCTCAAATCTTCCTTGAGGCGCTTGAACATGGTCTGGGCGGAGTTATTTTG
AAAGTTGAAGATCCTGAAGCTGTTTTTCAGCTAAAGGACTATTTTGACAGAAGAAATGAAGCTAGTAATCTTTTGAGTTTGACTAAGGCTACCATTACTCAAATTCATGT
GGTTGGAATGGGAGATCGAGTTTGTGTCGATCTCTGTAGTCTCATGAGACCCGGCGAAGGGCTTCTTGTCGGGTCATATGCGAGAGGACTGTTCCTTGTTCACTCGGAAT
GCTTAGAATCAAATTACATTGCTAGCCGACCTTTTCGTGTCAATGCTGGACCTGTCCATGCCTATGTAGCTGTTCCGGGAGGGAAAACTAGTTACCTTTCCGAGTTACGA
GCAGGGAAAGAGGTAATTGTAGTTGATCAGGAAGGCAGACAGCGAAGTGCTATTGTTGGACGTGTAAAGATTGAGACTAGACAGCTGATCCTTGTCCAGGCAAAGAGAGA
TTCAGATGAGCAAACTCCTTACAGCATCCTTCTGCAGAATGCGGAAACGGTTGCCTTAGTCTGCCCTGGTGAAGGAAATAATGAGAAGAAAGCCATACCTGTTACCTCAC
TTAAAGTTGGTGATGAAGTGTTCTTGAGATTGCAAGGAGAAGCAAGACATACAGGAATTGAAATCCAAGAGTTTATTGTGGAGAAATAATGGTTGATCACCTTTTACTAT
TTGAATATATTGTATATTTTATCTTTTAAAAAATACTAATTTTTAATTTTAGGTTTTGAAGTTCCTACATTTTCAGTATATGAATTTCTGCTTGTTGTTTGGTATGAAAT
ATTCATTTAAAACTGTAAATTAAACATTGGATAAAAATTATTCTCAAACATCCACTCTCTTTTTTTA
Protein sequenceShow/hide protein sequence
MAMAFLCSSSPVAPFLSKQRITYLKTPEILNLRPLISRDFGEAYAGECKSSDVSRLQCSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWTTFIFSPHNTELAHEW
SSIALIHPLFIKENGVLDGEDRLIASVVEVSNPQQLEQLQPASASANIVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVF
QLKDYFDRRNEASNLLSLTKATITQIHVVGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVD
QEGRQRSAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGEGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK