; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017019 (gene) of Snake gourd v1 genome

Gene IDTan0017019
OrganismTrichosanthes anguina (Snake gourd v1)
Description3-dehydroquinate synthase homolog
Genome locationLG03:39849723..39861207
RNA-Seq ExpressionTan0017019
SyntenyTan0017019
Gene Ontology termsGO:0008652 - cellular amino acid biosynthetic process (biological process)
GO:0009073 - aromatic amino acid family biosynthetic process (biological process)
GO:0003856 - 3-dehydroquinate synthase activity (molecular function)
GO:0016491 - oxidoreductase activity (molecular function)
InterPro domainsIPR002812 - 3-dehydroquinate synthase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588380.1 hypothetical protein SDJN03_16945, partial [Cucurbita argyrosperma subsp. sororia]9.3e-20788.81Show/hide
Query:  AMTLLSSSPVSPFLAKQQFNTHKTPDNLKLRALISRGFAEANAGERKSLETNRFQCSYGCSSSSMSPIEASKGVWIWSEHQQVMTAAVERGWSTFIFSPH
        A+  LS SPVSP   KQ+  THK PDNLKLRALISRGF  A  GE KSL+ NR  CS   SSSSMSPIEASKGVWIWSE QQVMTAAVERGWSTFIFSPH
Subjt:  AMTLLSSSPVSPFLAKQQFNTHKTPDNLKLRALISRGFAEANAGERKSLETNRFQCSYGCSSSSMSPIEASKGVWIWSEHQQVMTAAVERGWSTFIFSPH

Query:  NRELADEWSSIALIRPLFMKEDGVFDGEGRLIATVVEVSNPQQLEQLQPANASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEAL
        N+ELADEWSSIALIRPLF+ EDGVFD EGRLIATV+EVSNPQQLEQLQP+NAS DNV+VDLQDWQIIPAENIVAAFQGS+KTVFAVSKTPIEAQIFLEAL
Subjt:  NRELADEWSSIALIRPLFMKEDGVFDGEGRLIATVVEVSNPQQLEQLQPANASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEAL

Query:  EHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLNMTKATITQIHVAGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP
        EHGLGGVILKV DP+AVFQLKDYFDRRNEASNLL++TKATI  IHVAGMGDRVCVDLCSLMR GEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP
Subjt:  EHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLNMTKATITQIHVAGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP

Query:  VHAYVAVPGGKTSYLSELRAGKEVVVVDQEGRQRIAIVGRVKIETRQLVLVHAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFL
        VHAYVAVPGGKTSYLSELRAGKEV+VVDQEGRQR  IVGRVKIETRQLVL+ AKRDSDEQT YSILLQNAETVALV PGRGNEKKAIPVTSLKVGDEVFL
Subjt:  VHAYVAVPGGKTSYLSELRAGKEVVVVDQEGRQRIAIVGRVKIETRQLVLVHAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFL

Query:  RLQGEARHTGIEIQEFIVEK
        RLQGEARHTGIEIQEFIVEK
Subjt:  RLQGEARHTGIEIQEFIVEK

XP_022928646.1 uncharacterized protein LOC111435491 [Cucurbita moschata]1.9e-20788.81Show/hide
Query:  AMTLLSSSPVSPFLAKQQFNTHKTPDNLKLRALISRGFAEANAGERKSLETNRFQCSYGCSSSSMSPIEASKGVWIWSEHQQVMTAAVERGWSTFIFSPH
        A+  LS SPVSP   KQ+  THK PD+LKLRALISRGF  A  GE KSLE NR  CS   SSSSMSPIEASKGVWIWSE QQVMTAAVERGWSTFIFSPH
Subjt:  AMTLLSSSPVSPFLAKQQFNTHKTPDNLKLRALISRGFAEANAGERKSLETNRFQCSYGCSSSSMSPIEASKGVWIWSEHQQVMTAAVERGWSTFIFSPH

Query:  NRELADEWSSIALIRPLFMKEDGVFDGEGRLIATVVEVSNPQQLEQLQPANASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEAL
        N+ELADEWSSIALIRPLF+ EDGVFD EGRLIATV+EVSNPQQLEQLQP+NAS DNV+VDLQDWQIIPAENIVAAFQGS+KTVFAVSKTPIEAQIFLEAL
Subjt:  NRELADEWSSIALIRPLFMKEDGVFDGEGRLIATVVEVSNPQQLEQLQPANASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEAL

Query:  EHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLNMTKATITQIHVAGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP
        EHGLGGVILKV DP+AVFQLKDYFDRRNEASNLL++TKATIT IHVAGMGDRVCVDLCSLM+ GEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP
Subjt:  EHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLNMTKATITQIHVAGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP

Query:  VHAYVAVPGGKTSYLSELRAGKEVVVVDQEGRQRIAIVGRVKIETRQLVLVHAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFL
        VHAYVAVPGGKTSYLSELRAGKEV+VVDQ+GRQR  IVGRVKIETRQLVL+ AKRDSDEQT YSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFL
Subjt:  VHAYVAVPGGKTSYLSELRAGKEVVVVDQEGRQRIAIVGRVKIETRQLVLVHAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFL

Query:  RLQGEARHTGIEIQEFIVEK
        RLQGEARHTGIEIQEFIVEK
Subjt:  RLQGEARHTGIEIQEFIVEK

XP_022970870.1 uncharacterized protein LOC111469713 [Cucurbita maxima]3.2e-20789.05Show/hide
Query:  AMTLLSSSPVSPFLAKQQFNTHKTPDNLKLRALISRGFAEANAGERKSLETNRFQCSYGCSSSSMSPIEASKGVWIWSEHQQVMTAAVERGWSTFIFSPH
        AM LLS S VSPF  KQ+   H+ PDNLKLRALISRGF  A  GE KSLE NR  CS   SSSSMSPIEASKGVWIWS  +QVMTAAVERGWSTFIFSPH
Subjt:  AMTLLSSSPVSPFLAKQQFNTHKTPDNLKLRALISRGFAEANAGERKSLETNRFQCSYGCSSSSMSPIEASKGVWIWSEHQQVMTAAVERGWSTFIFSPH

Query:  NRELADEWSSIALIRPLFMKEDGVFDGEGRLIATVVEVSNPQQLEQLQPANASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEAL
        N+ELADEWSSIALI PLF+ EDGVFDGEGRLIATV+EVSNPQQLEQLQP+NAS DNV+VDLQDWQIIPAENIVAAFQGS+KTVFAVSKTPIEAQIFLEAL
Subjt:  NRELADEWSSIALIRPLFMKEDGVFDGEGRLIATVVEVSNPQQLEQLQPANASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEAL

Query:  EHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLNMTKATITQIHVAGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP
        EHGLGGVILKV DP+AVFQLKDYFDRRNEASNLL++TKATIT IHVAGMGDRVCVDLCSLMR GEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP
Subjt:  EHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLNMTKATITQIHVAGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP

Query:  VHAYVAVPGGKTSYLSELRAGKEVVVVDQEGRQRIAIVGRVKIETRQLVLVHAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFL
        VHAYVAVPG KTSYLSELRAGKEV+VVDQEGRQR AIVGRVKIETRQLVL+ AKRDSDEQT YSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFL
Subjt:  VHAYVAVPGGKTSYLSELRAGKEVVVVDQEGRQRIAIVGRVKIETRQLVLVHAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFL

Query:  RLQGEARHTGIEIQEFIVEK
        RLQGEARHTGIEIQEFIVEK
Subjt:  RLQGEARHTGIEIQEFIVEK

XP_023529491.1 uncharacterized protein LOC111792332 [Cucurbita pepo subsp. pepo]2.9e-20889.5Show/hide
Query:  MTLLSSSPVSPFLAKQQFNTHKTPDNLKLRALISRGFAEANAGERKSLETNRFQCSYGCSSSSMSPIEASKGVWIWSEHQQVMTAAVERGWSTFIFSPHN
        M LLS SPVS    KQ+  THK PDNLKLRALISRGF  A  GE KSLE NR  CS   SSSSMSPIEASKGVWIWSE++QVMTAAVERGWSTFIFSPHN
Subjt:  MTLLSSSPVSPFLAKQQFNTHKTPDNLKLRALISRGFAEANAGERKSLETNRFQCSYGCSSSSMSPIEASKGVWIWSEHQQVMTAAVERGWSTFIFSPHN

Query:  RELADEWSSIALIRPLFMKEDGVFDGEGRLIATVVEVSNPQQLEQLQPANASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALE
        +ELADEWSSIALIRPLF+ EDGVFD EGRLIATV+EVSNPQQLEQLQP+NAS DNV+VDLQDWQIIPAENIVAAFQGS+KTVFAVSKTPIEAQIFLEALE
Subjt:  RELADEWSSIALIRPLFMKEDGVFDGEGRLIATVVEVSNPQQLEQLQPANASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALE

Query:  HGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLNMTKATITQIHVAGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPV
        HGLGGVILKV DP+AVFQLKDYFDRRNEASNLL++TKATIT IHVAGMGDRVCVDLCSLMR GEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPV
Subjt:  HGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLNMTKATITQIHVAGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPV

Query:  HAYVAVPGGKTSYLSELRAGKEVVVVDQEGRQRIAIVGRVKIETRQLVLVHAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLR
        HAYVAVPGGKTSYLSELRAGKEV+VVDQEGRQR  IVGRVKIETRQLVL+ AKRDSDEQT YSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLR
Subjt:  HAYVAVPGGKTSYLSELRAGKEVVVVDQEGRQRIAIVGRVKIETRQLVLVHAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLR

Query:  LQGEARHTGIEIQEFIVEK
        LQGEARHTGIEIQEFIVEK
Subjt:  LQGEARHTGIEIQEFIVEK

XP_038903473.1 3-dehydroquinate synthase homolog [Benincasa hispida]1.4e-20789.13Show/hide
Query:  MAMTLL-SSSPVSPFLAKQQFN-THKTPDNLKLRALISRGFAEANAGERKSLETNRFQCSYGCSSSSMSPIEASKGVWIWSEHQQVMTAAVERGWSTFIF
        M M LL SSSPVSPFL+KQ+ +  HKTP+NL LR LISR F EA AGE KS   +R QCSY    S+MSP EASKGVWIWSE QQVMTAAVERGWSTFIF
Subjt:  MAMTLL-SSSPVSPFLAKQQFN-THKTPDNLKLRALISRGFAEANAGERKSLETNRFQCSYGCSSSSMSPIEASKGVWIWSEHQQVMTAAVERGWSTFIF

Query:  SPHNRELADEWSSIALIRPLFMKEDGVFDGEGRLIATVVEVSNPQQLEQLQPANASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFL
        SPHN ELADEWSSIALI PLF+KE+GVFDGEGRLIA+VVEVSNPQQLEQLQPANASAD VVVDLQDWQIIPAENIVAAFQGSQKTVFA+SKTPIEAQIFL
Subjt:  SPHNRELADEWSSIALIRPLFMKEDGVFDGEGRLIATVVEVSNPQQLEQLQPANASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFL

Query:  EALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLNMTKATITQIHVAGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVN
        EALEHGLGGVILKV DP+AVFQLKDYFDRRNEASNLL++TKATITQIHVAGMGDRVCVDLCSLMR GEGLLVGSYARGLFLVHSECLESNYIASRPFRVN
Subjt:  EALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLNMTKATITQIHVAGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVN

Query:  AGPVHAYVAVPGGKTSYLSELRAGKEVVVVDQEGRQRIAIVGRVKIETRQLVLVHAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDE
        AGPVHAYVAVPGGKTSYLSEL AGKEV+VVDQEGRQR AIVGRVKIETRQL+LV AKRDSDEQTPYSILLQNAETVALVCPGRGNEKK+IPVTSLKVGDE
Subjt:  AGPVHAYVAVPGGKTSYLSELRAGKEVVVVDQEGRQRIAIVGRVKIETRQLVLVHAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

TrEMBL top hitse value%identityAlignment
A0A1S3B8Q7 3-dehydroquinate synthase homolog4.1e-20086.36Show/hide
Query:  LLSSSPVSPFLAKQQFNTHKTPDNLKLRALISRGFAEANAGERKSLETNRFQCSYGCSSSSMSPIEASKGVWIWSEHQQVMTAAVERGWSTFIFSPHNRE
        L SSSPVSP L+KQ+    KTP+NL LR LISR F +A AGE KS + +R QCSY  SSS MSPIE SKGVWIWSE Q+VMTAAVERGWSTFIFSPHN E
Subjt:  LLSSSPVSPFLAKQQFNTHKTPDNLKLRALISRGFAEANAGERKSLETNRFQCSYGCSSSSMSPIEASKGVWIWSEHQQVMTAAVERGWSTFIFSPHNRE

Query:  LADEWSSIALIRPLFMKEDGVFDGEGRLIATVVEVSNPQQLEQLQPANASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHG
        LA EW+SIA+I PLF+KEDGV DGE RLIA+VVE+SNPQQLEQLQPA ASAD VVVDLQDWQIIPAENIVAAFQGSQKTVFA+SKTPIEAQIF EALEHG
Subjt:  LADEWSSIALIRPLFMKEDGVFDGEGRLIATVVEVSNPQQLEQLQPANASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHG

Query:  LGGVILKVGDPDAVFQLKDYFDRRNEASNLLNMTKATITQIHVAGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHA
        LGGVILKV DP+AVFQLKDYFDRRNEASNLL++TKATITQIHV GMGDRVCVDLCSLMR GEGLLVGS+ARGLFLVHSECLESNYIASRPFRVNAGPVHA
Subjt:  LGGVILKVGDPDAVFQLKDYFDRRNEASNLLNMTKATITQIHVAGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHA

Query:  YVAVPGGKTSYLSELRAGKEVVVVDQEGRQRIAIVGRVKIETRQLVLVHAKRDSDEQTPYSILLQNAETVALVCPGRG-NEKKAIPVTSLKVGDEVFLRL
        YVAVPGGKTSYLSEL+AGKEV+VVDQEGRQR AIVGRVKIETRQL+LV AKRDSDEQTPYS+LLQNAETVALVCPG+G NEKKAI VTSLKVGDEVFLRL
Subjt:  YVAVPGGKTSYLSELRAGKEVVVVDQEGRQRIAIVGRVKIETRQLVLVHAKRDSDEQTPYSILLQNAETVALVCPGRG-NEKKAIPVTSLKVGDEVFLRL

Query:  QGEARHTGIEIQEFIVEK
        QGEARHTGIEIQEFIVEK
Subjt:  QGEARHTGIEIQEFIVEK

A0A5A7UEW0 3-dehydroquinate synthase-like protein4.1e-20086.36Show/hide
Query:  LLSSSPVSPFLAKQQFNTHKTPDNLKLRALISRGFAEANAGERKSLETNRFQCSYGCSSSSMSPIEASKGVWIWSEHQQVMTAAVERGWSTFIFSPHNRE
        L SSSPVSP L+KQ+    KTP+NL LR LISR F +A AGE KS + +R QCSY  SSS MSPIE SKGVWIWSE Q+VMTAAVERGWSTFIFSPHN E
Subjt:  LLSSSPVSPFLAKQQFNTHKTPDNLKLRALISRGFAEANAGERKSLETNRFQCSYGCSSSSMSPIEASKGVWIWSEHQQVMTAAVERGWSTFIFSPHNRE

Query:  LADEWSSIALIRPLFMKEDGVFDGEGRLIATVVEVSNPQQLEQLQPANASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHG
        LA EW+SIA+I PLF+KEDGV DGE RLIA+VVE+SNPQQLEQLQPA ASAD VVVDLQDWQIIPAENIVAAFQGSQKTVFA+SKTPIEAQIF EALEHG
Subjt:  LADEWSSIALIRPLFMKEDGVFDGEGRLIATVVEVSNPQQLEQLQPANASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHG

Query:  LGGVILKVGDPDAVFQLKDYFDRRNEASNLLNMTKATITQIHVAGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHA
        LGGVILKV DP+AVFQLKDYFDRRNEASNLL++TKATITQIHV GMGDRVCVDLCSLMR GEGLLVGS+ARGLFLVHSECLESNYIASRPFRVNAGPVHA
Subjt:  LGGVILKVGDPDAVFQLKDYFDRRNEASNLLNMTKATITQIHVAGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHA

Query:  YVAVPGGKTSYLSELRAGKEVVVVDQEGRQRIAIVGRVKIETRQLVLVHAKRDSDEQTPYSILLQNAETVALVCPGRG-NEKKAIPVTSLKVGDEVFLRL
        YVAVPGGKTSYLSEL+AGKEV+VVDQEGRQR AIVGRVKIETRQL+LV AKRDSDEQTPYS+LLQNAETVALVCPG+G NEKKAI VTSLKVGDEVFLRL
Subjt:  YVAVPGGKTSYLSELRAGKEVVVVDQEGRQRIAIVGRVKIETRQLVLVHAKRDSDEQTPYSILLQNAETVALVCPGRG-NEKKAIPVTSLKVGDEVFLRL

Query:  QGEARHTGIEIQEFIVEK
        QGEARHTGIEIQEFIVEK
Subjt:  QGEARHTGIEIQEFIVEK

A0A6J1BVA9 uncharacterized protein LOC111005050 isoform X26.9e-19283.05Show/hide
Query:  MTLLSSSPVSPFLAKQQFNTHKTPDNLKLRALISRGFAEANAGERKSLETNRFQCSYGCSSSSMSPIEASKGVWIWSEHQQVMTAAVERGWSTFIFSPHN
        M  L +SP SP L  +  +  KTPD  KL ALIS  F + +AG+ KS+     QCS  C+S S +P EASKGVW+WSE++QV+TAAVERGW+TF+FSPHN
Subjt:  MTLLSSSPVSPFLAKQQFNTHKTPDNLKLRALISRGFAEANAGERKSLETNRFQCSYGCSSSSMSPIEASKGVWIWSEHQQVMTAAVERGWSTFIFSPHN

Query:  RELADEWSSIALIRPLFMKEDGVFDGEGRLIATVVEVSNPQQLEQLQPANASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALE
        RELA +WSSIA I  LF+KEDG+FD EG LIATV EVSNPQQLEQLQP NAS DNVVVDLQDWQIIPAENIVAAFQGS+K VFAVSKTPIEAQIFLEALE
Subjt:  RELADEWSSIALIRPLFMKEDGVFDGEGRLIATVVEVSNPQQLEQLQPANASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALE

Query:  HGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLNMTKATITQIHVAGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPV
        HGLGGVILKV DP+AVFQLKDYFDRRNEASNLL++TKAT+TQIHVAGMGDRVCVDLCSLMR GEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPV
Subjt:  HGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLNMTKATITQIHVAGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPV

Query:  HAYVAVPGGKTSYLSELRAGKEVVVVDQEGRQRIAIVGRVKIETRQLVLVHAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLR
        HAYVAVPGGKTSYLSELRAGKEV+VVDQEGRQR AIVGRVKIETRQL+LV AKRDSD+QTPY ILLQNAETVALVCPGRGNEKKAIPVTSLKVGD+VFLR
Subjt:  HAYVAVPGGKTSYLSELRAGKEVVVVDQEGRQRIAIVGRVKIETRQLVLVHAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLR

Query:  LQGEARHTGIEIQEFIVEK
        LQGEARHTGIEIQEFIVEK
Subjt:  LQGEARHTGIEIQEFIVEK

A0A6J1EKW1 uncharacterized protein LOC1114354919.0e-20888.81Show/hide
Query:  AMTLLSSSPVSPFLAKQQFNTHKTPDNLKLRALISRGFAEANAGERKSLETNRFQCSYGCSSSSMSPIEASKGVWIWSEHQQVMTAAVERGWSTFIFSPH
        A+  LS SPVSP   KQ+  THK PD+LKLRALISRGF  A  GE KSLE NR  CS   SSSSMSPIEASKGVWIWSE QQVMTAAVERGWSTFIFSPH
Subjt:  AMTLLSSSPVSPFLAKQQFNTHKTPDNLKLRALISRGFAEANAGERKSLETNRFQCSYGCSSSSMSPIEASKGVWIWSEHQQVMTAAVERGWSTFIFSPH

Query:  NRELADEWSSIALIRPLFMKEDGVFDGEGRLIATVVEVSNPQQLEQLQPANASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEAL
        N+ELADEWSSIALIRPLF+ EDGVFD EGRLIATV+EVSNPQQLEQLQP+NAS DNV+VDLQDWQIIPAENIVAAFQGS+KTVFAVSKTPIEAQIFLEAL
Subjt:  NRELADEWSSIALIRPLFMKEDGVFDGEGRLIATVVEVSNPQQLEQLQPANASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEAL

Query:  EHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLNMTKATITQIHVAGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP
        EHGLGGVILKV DP+AVFQLKDYFDRRNEASNLL++TKATIT IHVAGMGDRVCVDLCSLM+ GEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP
Subjt:  EHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLNMTKATITQIHVAGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP

Query:  VHAYVAVPGGKTSYLSELRAGKEVVVVDQEGRQRIAIVGRVKIETRQLVLVHAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFL
        VHAYVAVPGGKTSYLSELRAGKEV+VVDQ+GRQR  IVGRVKIETRQLVL+ AKRDSDEQT YSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFL
Subjt:  VHAYVAVPGGKTSYLSELRAGKEVVVVDQEGRQRIAIVGRVKIETRQLVLVHAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFL

Query:  RLQGEARHTGIEIQEFIVEK
        RLQGEARHTGIEIQEFIVEK
Subjt:  RLQGEARHTGIEIQEFIVEK

A0A6J1I437 uncharacterized protein LOC1114697131.5e-20789.05Show/hide
Query:  AMTLLSSSPVSPFLAKQQFNTHKTPDNLKLRALISRGFAEANAGERKSLETNRFQCSYGCSSSSMSPIEASKGVWIWSEHQQVMTAAVERGWSTFIFSPH
        AM LLS S VSPF  KQ+   H+ PDNLKLRALISRGF  A  GE KSLE NR  CS   SSSSMSPIEASKGVWIWS  +QVMTAAVERGWSTFIFSPH
Subjt:  AMTLLSSSPVSPFLAKQQFNTHKTPDNLKLRALISRGFAEANAGERKSLETNRFQCSYGCSSSSMSPIEASKGVWIWSEHQQVMTAAVERGWSTFIFSPH

Query:  NRELADEWSSIALIRPLFMKEDGVFDGEGRLIATVVEVSNPQQLEQLQPANASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEAL
        N+ELADEWSSIALI PLF+ EDGVFDGEGRLIATV+EVSNPQQLEQLQP+NAS DNV+VDLQDWQIIPAENIVAAFQGS+KTVFAVSKTPIEAQIFLEAL
Subjt:  NRELADEWSSIALIRPLFMKEDGVFDGEGRLIATVVEVSNPQQLEQLQPANASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEAL

Query:  EHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLNMTKATITQIHVAGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP
        EHGLGGVILKV DP+AVFQLKDYFDRRNEASNLL++TKATIT IHVAGMGDRVCVDLCSLMR GEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP
Subjt:  EHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLNMTKATITQIHVAGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP

Query:  VHAYVAVPGGKTSYLSELRAGKEVVVVDQEGRQRIAIVGRVKIETRQLVLVHAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFL
        VHAYVAVPG KTSYLSELRAGKEV+VVDQEGRQR AIVGRVKIETRQLVL+ AKRDSDEQT YSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFL
Subjt:  VHAYVAVPGGKTSYLSELRAGKEVVVVDQEGRQRIAIVGRVKIETRQLVLVHAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFL

Query:  RLQGEARHTGIEIQEFIVEK
        RLQGEARHTGIEIQEFIVEK
Subjt:  RLQGEARHTGIEIQEFIVEK

SwissProt top hitse value%identityAlignment
A0B6K6 3-dehydroquinate synthase1.3e-6239.78Show/hide
Query:  WSEHQQVMTAAVERGWSTFIFSPHNRELADEWSSIALIRPLFMKEDGVFD------------------GEGRLIATVVEVSNPQQLEQLQPANASADNVV
        W + + ++T A+E G+   + S  + EL  E  SI +    F +E G  D                    GR I   VE+ + +            D ++
Subjt:  WSEHQQVMTAAVERGWSTFIFSPHNRELADEWSSIALIRPLFMKEDGVFD------------------GEGRLIATVVEVSNPQQLEQLQPANASADNVV

Query:  VDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLNMTKATITQIHVAGMGDRVCVDLC
        V   DW++IP EN++AA QG    + +  ++  EA++ L  LEHG  GV+L   DP  + +++   +R     + +++  AT+  +   GMGDRVCVD C
Subjt:  VDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLNMTKATITQIHVAGMGDRVCVDLC

Query:  SLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVVVVDQEGRQRIAIVGRVKIETRQLVLVHAKRDSD
        SLMR GEG+LVGS +R  FLV SE  ES Y+A+RPFRVNAG VHAY+ V G KT YLSEL++G EV +VD++G  R A+VGRVKIE R ++LV A+ D +
Subjt:  SLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVVVVDQEGRQRIAIVGRVKIETRQLVLVHAKRDSD

Query:  EQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
             S LLQNAET+ LV     ++   I V  LK GD+V + ++  ARH G+ I+E I+E+
Subjt:  EQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

A4G0J1 3-dehydroquinate synthase2.7e-6046.99Show/hide
Query:  DNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLNMTKATITQIHVAGMGDRVC
        D VV++  DW IIP ENI+A   G +  + +V     +A+   E LE G+ GV+L   D + V       +R N  S  L +  AT+T+I   G GDRVC
Subjt:  DNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLNMTKATITQIHVAGMGDRVC

Query:  VDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVVVVDQEGRQRIAIVGRVKIETRQLVLVHAK
        +D CS+M +GEG+L+GSY+RG+FLVHSE +E+ Y+A+RPFRVNAGPVHAY+  P  KT YLS+L+AG +V+VV++ G  R +I+GRVKIE R L LV A+
Subjt:  VDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVVVVDQEGRQRIAIVGRVKIETRQLVLVHAK

Query:  RDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
         + +       +LQNAET+ LV    G + K + V  LKVG +V ++    ARH G+ I+E IVEK
Subjt:  RDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

O26680 3-dehydroquinate synthase6.3e-6548.12Show/hide
Query:  GRLIATVVEVSNPQQLEQLQPANASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFD-RR
        GR +A  VE+ +    E  +      D +++  +DW+IIP ENI+A  Q     + A      EA++ LE LEHG  GV++   +P  + Q+KD      
Subjt:  GRLIATVVEVSNPQQLEQLQPANASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFD-RR

Query:  NEASNLLNMTKATITQIHVAGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVVVV
        N  S    +  ATIT+I   G GDRVCVD CS+M +GEG+LVGSY++GLFLVHSE LES Y+ASRPFRVNAGPV AYV VPGG+T YLSEL  G EV++V
Subjt:  NEASNLLNMTKATITQIHVAGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVVVV

Query:  DQEGRQRIAIVGRVKIETRQLVLVHAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
        D++GR R AIVGRVKIE R L+LV A+    E      LLQNAET+ LV     ++ + + V+ L  GD V +     ARH G+ I+E I+EK
Subjt:  DQEGRQRIAIVGRVKIETRQLVLVHAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

Q2NI00 3-dehydroquinate synthase2.6e-6338.59Show/hide
Query:  KGVWI-----WSEHQQVMTAAVERGWSTFIFSPHNRELADEWSSIALI--------------RPLFMKEDGVFDGEGRLIATVVEVSNPQQLEQLQPANA
        K  WI     W++ ++ +  ++E G+   I    N E   +  S+ +I                + M +       G+ +A  VE++N      +     
Subjt:  KGVWI-----WSEHQQVMTAAVERGWSTFIFSPHNRELADEWSSIALI--------------RPLFMKEDGVFDGEGRLIATVVEVSNPQQLEQLQPANA

Query:  SADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLNMTKATITQIHVAGMGDR
         AD V++  ++W++IP ENI+A+ Q     +        EA++ LE +EHG  GV+L   D + + +L    ++ ++ S   ++  AT+T++   G+GDR
Subjt:  SADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLNMTKATITQIHVAGMGDR

Query:  VCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVVVVDQEGRQRIAIVGRVKIETRQLVLVH
        VCVD CS+M +G+G+LVGS+A GLFLVHSE LES Y+ASRPFRVNAGPVHAYV  P  KT YLSEL AG EVV ++ +G     IVGRVKIE R L+L+ 
Subjt:  VCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVVVVDQEGRQRIAIVGRVKIETRQLVLVH

Query:  AKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
        AK    + +    L+QNAET+ LV     ++ + I V+ LKVGD+V       ARH G+ I+E I+EK
Subjt:  AKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

Q58646 3-dehydroquinate synthase4.5e-6340Show/hide
Query:  WSEHQQVMTAAVERGWSTFIFSPHNRELADEWSSIALIR-------PLFMKEDGVFD-------GEGRLIATVVEVSNPQQLEQLQPANASADNVVVDLQ
        W E ++++T A+E      +  P + E   E  +I +          L  K D +         G+   I   +E    ++           DN++++ +
Subjt:  WSEHQQVMTAAVERGWSTFIFSPHNRELADEWSSIALIR-------PLFMKEDGVFD-------GEGRLIATVVEVSNPQQLEQLQPANASADNVVVDLQ

Query:  DWQIIPAENIVA-AFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLNMTKATITQIHVAGMGDRVCVDLCSLM
        DW IIP EN++A  F    K V +V+    EA++  E LE G  GV+L   + + + +L    +  N+    L++  AT+T++   G GDRVC+D CSLM
Subjt:  DWQIIPAENIVA-AFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLNMTKATITQIHVAGMGDRVCVDLCSLM

Query:  RLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVVVVDQEGRQRIAIVGRVKIETRQLVLVHAKRDSDEQT
        ++GEG+L+GSY+R LFLVHSE +E+ Y+A+RPFRVNAGPVHAY+  PG KT YLSEL+AG +V++VD++G  R AIVGRVKIE R LVL+ A+   D   
Subjt:  RLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVVVVDQEGRQRIAIVGRVKIETRQLVLVHAKRDSDEQT

Query:  PYSILLQNAETVALVCPGRGNEK-KAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
            +LQNAET+ LV     NEK + I V  LK GD+V ++ +  ARH G+ I+E I+EK
Subjt:  PYSILLQNAETVALVCPGRGNEK-KAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

Arabidopsis top hitse value%identityAlignment
AT3G28760.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: 3-dehydroquinate synthase, prokaryotic-type (InterPro:IPR002812); Has 390 Blast hits to 390 proteins in 131 species: Archae - 144; Bacteria - 105; Metazoa - 0; Fungi - 0; Plants - 54; Viruses - 0; Other Eukaryotes - 87 (source: NCBI BLink).4.1e-13666.94Show/hide
Query:  SSSMSPIEASKG--VWIWSEHQQVMTAAVERGWSTFIFSPHNRELADEWSSIALIRPLFMKEDGVFDGEGRLIATVVEVSNPQQLEQLQPANASADNVVV
        S+S  P+   K   VWIW+  ++VMT AVERGW+TFIFS  NR+L++EWSSIAL+  LF++E  V DG G ++A+V EVS P++L  L   N   +N+V+
Subjt:  SSSMSPIEASKG--VWIWSEHQQVMTAAVERGWSTFIFSPHNRELADEWSSIALIRPLFMKEDGVFDGEGRLIATVVEVSNPQQLEQLQPANASADNVVV

Query:  DLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLNMTKATITQIHVAGMGDRVCVDLCS
        D  DW+ IPAEN+VAA QGS+KTVFAVS TP EA++FLEALEHGLGG+ILK  D  AV  LK+YFD+RNE S+ L++T+ATIT++ + GMGDRVCVDLCS
Subjt:  DLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLNMTKATITQIHVAGMGDRVCVDLCS

Query:  LMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVVVVDQEGRQRIAIVGRVKIETRQLVLVHAKRDS-D
        LMR GEGLLVGS+ARGLFLVHSECLESNYI SRPFRVNAGPVHAYVAVPGGKT YLSELR G+EV+VVDQ+G+QR A+VGRVKIE R L++V AK  + +
Subjt:  LMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVVVVDQEGRQRIAIVGRVKIETRQLVLVHAKRDS-D

Query:  EQTPYSILLQNAETVALVCPGRGNE--KKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE
        E+T YSI+LQNAETVALV P + N   + A+PVTSLK GD+V +RLQG ARHTGIEIQEFIVE
Subjt:  EQTPYSILLQNAETVALVCPGRGNE--KKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE

AT3G28760.2 CONTAINS InterPro DOMAIN/s: 3-dehydroquinate synthase, prokaryotic-type (InterPro:IPR002812); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink).4.1e-13666.94Show/hide
Query:  SSSMSPIEASKG--VWIWSEHQQVMTAAVERGWSTFIFSPHNRELADEWSSIALIRPLFMKEDGVFDGEGRLIATVVEVSNPQQLEQLQPANASADNVVV
        S+S  P+   K   VWIW+  ++VMT AVERGW+TFIFS  NR+L++EWSSIAL+  LF++E  V DG G ++A+V EVS P++L  L   N   +N+V+
Subjt:  SSSMSPIEASKG--VWIWSEHQQVMTAAVERGWSTFIFSPHNRELADEWSSIALIRPLFMKEDGVFDGEGRLIATVVEVSNPQQLEQLQPANASADNVVV

Query:  DLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLNMTKATITQIHVAGMGDRVCVDLCS
        D  DW+ IPAEN+VAA QGS+KTVFAVS TP EA++FLEALEHGLGG+ILK  D  AV  LK+YFD+RNE S+ L++T+ATIT++ + GMGDRVCVDLCS
Subjt:  DLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQLKDYFDRRNEASNLLNMTKATITQIHVAGMGDRVCVDLCS

Query:  LMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVVVVDQEGRQRIAIVGRVKIETRQLVLVHAKRDS-D
        LMR GEGLLVGS+ARGLFLVHSECLESNYI SRPFRVNAGPVHAYVAVPGGKT YLSELR G+EV+VVDQ+G+QR A+VGRVKIE R L++V AK  + +
Subjt:  LMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVVVVDQEGRQRIAIVGRVKIETRQLVLVHAKRDS-D

Query:  EQTPYSILLQNAETVALVCPGRGNE--KKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE
        E+T YSI+LQNAETVALV P + N   + A+PVTSLK GD+V +RLQG ARHTGIEIQEFIVE
Subjt:  EQTPYSILLQNAETVALVCPGRGNE--KKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCATGACCTTGCTCTCTTCGTCGCCTGTTTCTCCATTTCTTGCCAAACAGCAATTCAACACCCACAAAACACCAGATAATTTGAAACTTCGCGCCCTAATTTCCAG
GGGTTTTGCCGAAGCCAATGCGGGTGAACGTAAATCTTTGGAGACAAATCGTTTTCAGTGTTCTTACGGTTGCTCGTCCTCTTCAATGTCTCCGATTGAGGCGTCGAAGG
GGGTGTGGATTTGGAGTGAGCATCAGCAGGTTATGACGGCTGCGGTTGAGAGGGGATGGAGTACGTTCATCTTCTCGCCTCACAATCGGGAGCTTGCTGATGAATGGTCT
TCAATTGCACTCATAAGGCCTCTTTTTATGAAAGAGGATGGAGTTTTTGATGGAGAGGGTAGACTAATTGCAACAGTTGTTGAGGTTTCGAACCCCCAGCAGTTGGAGCA
GCTTCAACCTGCAAATGCATCCGCAGACAATGTTGTTGTTGATTTACAAGATTGGCAGATAATACCTGCAGAGAATATCGTTGCAGCATTCCAGGGGAGTCAGAAAACAG
TATTTGCTGTCTCGAAAACTCCTATTGAAGCTCAAATCTTCCTTGAGGCACTCGAACACGGTCTGGGTGGAGTTATTTTGAAAGTTGGAGATCCTGATGCTGTTTTTCAG
CTAAAGGACTATTTTGACAGAAGAAATGAAGCTAGCAATCTTCTGAACATGACTAAGGCTACCATAACTCAAATTCATGTTGCTGGAATGGGAGATCGAGTTTGTGTTGA
TCTCTGTAGTCTCATGAGACTCGGCGAAGGACTTCTTGTCGGGTCCTATGCCAGAGGACTGTTCCTTGTTCACTCGGAATGCTTAGAGTCAAATTACATTGCTAGCCGAC
CTTTTCGTGTCAATGCTGGACCTGTACATGCCTACGTGGCTGTCCCGGGAGGGAAAACTAGCTACCTTTCTGAGTTACGGGCAGGCAAAGAGGTAGTTGTAGTTGATCAA
GAAGGTAGGCAACGAATCGCTATTGTTGGACGTGTAAAGATAGAGACTAGGCAGCTAGTCCTCGTCCACGCAAAGAGAGATTCAGATGAGCAAACTCCTTACAGCATCCT
TCTGCAGAACGCCGAAACGGTTGCCTTAGTCTGCCCCGGTCGAGGAAATGAGAAGAAAGCCATCCCCGTTACCTCACTTAAAGTTGGTGATGAAGTGTTTTTGAGATTGC
AAGGAGAAGCAAGGCATACAGGCATTGAAATCCAAGAGTTTATTGTGGAGAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCATGACCTTGCTCTCTTCGTCGCCTGTTTCTCCATTTCTTGCCAAACAGCAATTCAACACCCACAAAACACCAGATAATTTGAAACTTCGCGCCCTAATTTCCAG
GGGTTTTGCCGAAGCCAATGCGGGTGAACGTAAATCTTTGGAGACAAATCGTTTTCAGTGTTCTTACGGTTGCTCGTCCTCTTCAATGTCTCCGATTGAGGCGTCGAAGG
GGGTGTGGATTTGGAGTGAGCATCAGCAGGTTATGACGGCTGCGGTTGAGAGGGGATGGAGTACGTTCATCTTCTCGCCTCACAATCGGGAGCTTGCTGATGAATGGTCT
TCAATTGCACTCATAAGGCCTCTTTTTATGAAAGAGGATGGAGTTTTTGATGGAGAGGGTAGACTAATTGCAACAGTTGTTGAGGTTTCGAACCCCCAGCAGTTGGAGCA
GCTTCAACCTGCAAATGCATCCGCAGACAATGTTGTTGTTGATTTACAAGATTGGCAGATAATACCTGCAGAGAATATCGTTGCAGCATTCCAGGGGAGTCAGAAAACAG
TATTTGCTGTCTCGAAAACTCCTATTGAAGCTCAAATCTTCCTTGAGGCACTCGAACACGGTCTGGGTGGAGTTATTTTGAAAGTTGGAGATCCTGATGCTGTTTTTCAG
CTAAAGGACTATTTTGACAGAAGAAATGAAGCTAGCAATCTTCTGAACATGACTAAGGCTACCATAACTCAAATTCATGTTGCTGGAATGGGAGATCGAGTTTGTGTTGA
TCTCTGTAGTCTCATGAGACTCGGCGAAGGACTTCTTGTCGGGTCCTATGCCAGAGGACTGTTCCTTGTTCACTCGGAATGCTTAGAGTCAAATTACATTGCTAGCCGAC
CTTTTCGTGTCAATGCTGGACCTGTACATGCCTACGTGGCTGTCCCGGGAGGGAAAACTAGCTACCTTTCTGAGTTACGGGCAGGCAAAGAGGTAGTTGTAGTTGATCAA
GAAGGTAGGCAACGAATCGCTATTGTTGGACGTGTAAAGATAGAGACTAGGCAGCTAGTCCTCGTCCACGCAAAGAGAGATTCAGATGAGCAAACTCCTTACAGCATCCT
TCTGCAGAACGCCGAAACGGTTGCCTTAGTCTGCCCCGGTCGAGGAAATGAGAAGAAAGCCATCCCCGTTACCTCACTTAAAGTTGGTGATGAAGTGTTTTTGAGATTGC
AAGGAGAAGCAAGGCATACAGGCATTGAAATCCAAGAGTTTATTGTGGAGAAATGAAATGAGGGTTGATCAACTATTACTATTTGAATATATTGTATATTTCATCTTTTT
TAAAAAACTGTCTTTCATAAAAATTAGGCCTAGCATGAGTTTCTTTTTTTCATATTACAAGTTTGTTAATTTTATGGGATAGAAAAATAGTGGGTACAGTATAAATTGAG
GCAAATTTCTCTGTAGAAAAAAGAAGG
Protein sequenceShow/hide protein sequence
MAMTLLSSSPVSPFLAKQQFNTHKTPDNLKLRALISRGFAEANAGERKSLETNRFQCSYGCSSSSMSPIEASKGVWIWSEHQQVMTAAVERGWSTFIFSPHNRELADEWS
SIALIRPLFMKEDGVFDGEGRLIATVVEVSNPQQLEQLQPANASADNVVVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVGDPDAVFQ
LKDYFDRRNEASNLLNMTKATITQIHVAGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVVVVDQ
EGRQRIAIVGRVKIETRQLVLVHAKRDSDEQTPYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK