; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10019694 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10019694
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Description3-dehydroquinate synthase homolog
Genome locationChr04:24471653..24475400
RNA-Seq ExpressionHG10019694
SyntenyHG10019694
Gene Ontology termsGO:0008652 - cellular amino acid biosynthetic process (biological process)
GO:0009073 - aromatic amino acid family biosynthetic process (biological process)
GO:0003856 - 3-dehydroquinate synthase activity (molecular function)
GO:0016491 - oxidoreductase activity (molecular function)
InterPro domainsIPR002812 - 3-dehydroquinate synthase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147467.1 uncharacterized protein LOC101203995 [Cucumis sativus]1.4e-21592.2Show/hide
Query:  MAMALLSSYSPVSPFLSKQRISYHKTPENLKLRPLISRDFGEFYAVECKSSDVSRLQRSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFS
        MAMA L S SPVSPFLSKQRI+Y KTPENL LRPL+SRDFGE YA ECKSSDVSRLQ SY SSSS MSPIEASKGVWIWSECQQVMTAAVERGWSTFIFS
Subjt:  MAMALLSSYSPVSPFLSKQRISYHKTPENLKLRPLISRDFGEFYAVECKSSDVSRLQRSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFS

Query:  PHNTELADEWSSIALIRPLFIKENGVFDGEGRLIAAVVEVSNPEQLEQLQPANASADIVLVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLE
        PHNTELA EWSSIALI PLFIKENGV DGE RLIA+VVEVSNP+QLEQLQPA ASADIV+VDLQDWQIIPAENIVAAFQGSQKTVFA+SKTPIEAQIFLE
Subjt:  PHNTELADEWSSIALIRPLFIKENGVFDGEGRLIAAVVEVSNPEQLEQLQPANASADIVLVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLE

Query:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIRVAGMGDRVCVDLSSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLL+LTKATITQI V GMGDRVCVDL SLMRPGEGLLVGSYARGLFL+HSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIRVAGMGDRVCVDLSSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELRAGKEVTVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRG-NEKRAIPVTSLKVGDE
        GPVHAYVAVPGGKTSYLSEL+AG EV VVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS+LLQNAETVALVCPG+G NEK+AIPVTSLKVGDE
Subjt:  GPVHAYVAVPGGKTSYLSELRAGKEVTVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRG-NEKRAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

XP_008443422.1 PREDICTED: 3-dehydroquinate synthase homolog [Cucumis melo]9.0e-21089.6Show/hide
Query:  MAMALLSSYSPVSPFLSKQRISYHKTPENLKLRPLISRDFGEFYAVECKSSDVSRLQRSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFS
        M MA L S SPVSP LSKQRI+Y KTPENL LRPLISR+FG+ YA ECKSSD+SRLQ SY SSSS MSPIE SKGVWIWSECQ+VMTAAVERGWSTFIFS
Subjt:  MAMALLSSYSPVSPFLSKQRISYHKTPENLKLRPLISRDFGEFYAVECKSSDVSRLQRSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFS

Query:  PHNTELADEWSSIALIRPLFIKENGVFDGEGRLIAAVVEVSNPEQLEQLQPANASADIVLVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLE
        PHNTELA EW+SIA+I PLFIKE+GV DGE RLIA+VVE+SNP+QLEQLQPA ASADIV+VDLQDWQIIPAENIVAAFQGSQKTVFA+SKTPIEAQIF E
Subjt:  PHNTELADEWSSIALIRPLFIKENGVFDGEGRLIAAVVEVSNPEQLEQLQPANASADIVLVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLE

Query:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIRVAGMGDRVCVDLSSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKV+DPEAVFQLKDYFDRRNEASNLLSLTKATITQI V GMGDRVCVDL SLMRPGEGLLVGS+ARGLFLVHSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIRVAGMGDRVCVDLSSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELRAGKEVTVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRG-NEKRAIPVTSLKVGDE
        GPVHAYVAVPGGKTSYLSEL+AGKEV VVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS+LLQNAETVALVCPG+G NEK+AI VTSLKVGDE
Subjt:  GPVHAYVAVPGGKTSYLSELRAGKEVTVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRG-NEKRAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

XP_022970870.1 uncharacterized protein LOC111469713 [Cucurbita maxima]4.8e-20388.6Show/hide
Query:  AMALLSSYSPVSPFLSKQRISYHKTPENLKLRPLISRDFGEFYAVECKSSDVSRLQRSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSP
        AMALL S+S VSPF  KQRI  H+ P+NLKLR LISR FG     ECKS +++RL  S ASSSSSMSPIEASKGVWIWS  +QVMTAAVERGWSTFIFSP
Subjt:  AMALLSSYSPVSPFLSKQRISYHKTPENLKLRPLISRDFGEFYAVECKSSDVSRLQRSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSP

Query:  HNTELADEWSSIALIRPLFIKENGVFDGEGRLIAAVVEVSNPEQLEQLQPANASADIVLVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEA
        HN ELADEWSSIALI PLFI E+GVFDGEGRLIA V+EVSNP+QLEQLQP+NAS D V+VDLQDWQIIPAENIVAAFQGS+KTVFAVSKTPIEAQIFLEA
Subjt:  HNTELADEWSSIALIRPLFIKENGVFDGEGRLIAAVVEVSNPEQLEQLQPANASADIVLVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEA

Query:  LEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIRVAGMGDRVCVDLSSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAG
        LEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIT I VAGMGDRVCVDL SLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAG
Subjt:  LEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIRVAGMGDRVCVDLSSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAG

Query:  PVHAYVAVPGGKTSYLSELRAGKEVTVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKRAIPVTSLKVGDEVF
        PVHAYVAVPG KTSYLSELRAGKEV VVDQEGRQRTAIVGRVKIETRQL+L+QAKRDSDEQT YSILLQNAETVALVCPGRGNEK+AIPVTSLKVGDEVF
Subjt:  PVHAYVAVPGGKTSYLSELRAGKEVTVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKRAIPVTSLKVGDEVF

Query:  LRLQGEARHTGIEIQEFIVEK
        LRLQGEARHTGIEIQEFIVEK
Subjt:  LRLQGEARHTGIEIQEFIVEK

XP_023529491.1 uncharacterized protein LOC111792332 [Cucurbita pepo subsp. pepo]4.8e-20387.91Show/hide
Query:  MAMALLSSYSPVSPFLSKQRISYHKTPENLKLRPLISRDFGEFYAVECKSSDVSRLQRSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFS
        MA   L S+SPVS    KQRI  HK P+NLKLR LISR FG     ECKS +++RL  S  SSSSSMSPIEASKGVWIWSE +QVMTAAVERGWSTFIFS
Subjt:  MAMALLSSYSPVSPFLSKQRISYHKTPENLKLRPLISRDFGEFYAVECKSSDVSRLQRSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFS

Query:  PHNTELADEWSSIALIRPLFIKENGVFDGEGRLIAAVVEVSNPEQLEQLQPANASADIVLVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLE
        PHN ELADEWSSIALIRPLFI E+GVFD EGRLIA V+EVSNP+QLEQLQP+NAS D V+VDLQDWQIIPAENIVAAFQGS+KTVFAVSKTPIEAQIFLE
Subjt:  PHNTELADEWSSIALIRPLFIKENGVFDGEGRLIAAVVEVSNPEQLEQLQPANASADIVLVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLE

Query:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIRVAGMGDRVCVDLSSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIT I VAGMGDRVCVDL SLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIRVAGMGDRVCVDLSSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELRAGKEVTVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKRAIPVTSLKVGDEV
        GPVHAYVAVPGGKTSYLSELRAGKEV VVDQEGRQRT IVGRVKIETRQL+L+QAKRDSDEQT YSILLQNAETVALVCPGRGNEK+AIPVTSLKVGDEV
Subjt:  GPVHAYVAVPGGKTSYLSELRAGKEVTVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKRAIPVTSLKVGDEV

Query:  FLRLQGEARHTGIEIQEFIVEK
        FLRLQGEARHTGIEIQEFIVEK
Subjt:  FLRLQGEARHTGIEIQEFIVEK

XP_038903473.1 3-dehydroquinate synthase homolog [Benincasa hispida]2.5e-22094.33Show/hide
Query:  MAMALLSSYSPVSPFLSKQRIS-YHKTPENLKLRPLISRDFGEFYAVECKSSDVSRLQRSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIF
        M MALL S SPVSPFLSKQRIS YHKTPENL LRPLISRDFGE YA ECKSS+VSRLQ SYAS  S+MSP EASKGVWIWSECQQVMTAAVERGWSTFIF
Subjt:  MAMALLSSYSPVSPFLSKQRIS-YHKTPENLKLRPLISRDFGEFYAVECKSSDVSRLQRSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIF

Query:  SPHNTELADEWSSIALIRPLFIKENGVFDGEGRLIAAVVEVSNPEQLEQLQPANASADIVLVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFL
        SPHNTELADEWSSIALI PLFIKENGVFDGEGRLIA+VVEVSNP+QLEQLQPANASADIV+VDLQDWQIIPAENIVAAFQGSQKTVFA+SKTPIEAQIFL
Subjt:  SPHNTELADEWSSIALIRPLFIKENGVFDGEGRLIAAVVEVSNPEQLEQLQPANASADIVLVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFL

Query:  EALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIRVAGMGDRVCVDLSSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVN
        EALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQI VAGMGDRVCVDL SLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVN
Subjt:  EALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIRVAGMGDRVCVDLSSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVN

Query:  AGPVHAYVAVPGGKTSYLSELRAGKEVTVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKRAIPVTSLKVGDE
        AGPVHAYVAVPGGKTSYLSEL AGKEV VVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEK++IPVTSLKVGDE
Subjt:  AGPVHAYVAVPGGKTSYLSELRAGKEVTVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKRAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

TrEMBL top hitse value%identityAlignment
A0A0A0LHS3 Uncharacterized protein3.0e-18794.12Show/hide
Query:  MSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELADEWSSIALIRPLFIKENGVFDGEGRLIAAVVEVSNPEQLEQLQPANASADIVLVDLQDW
        MSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELA EWSSIALI PLFIKENGV DGE RLIA+VVEVSNP+QLEQLQPA ASADIV+VDLQDW
Subjt:  MSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELADEWSSIALIRPLFIKENGVFDGEGRLIAAVVEVSNPEQLEQLQPANASADIVLVDLQDW

Query:  QIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIRVAGMGDRVCVDLSSLMRPG
        QIIPAENIVAAFQGSQKTVFA+SKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLL+LTKATITQI V GMGDRVCVDL SLMRPG
Subjt:  QIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIRVAGMGDRVCVDLSSLMRPG

Query:  EGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVTVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS
        EGLLVGSYARGLFL+HSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSEL+AG EV VVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS
Subjt:  EGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVTVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS

Query:  ILLQNAETVALVCPGRG-NEKRAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
        +LLQNAETVALVCPG+G NEK+AIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
Subjt:  ILLQNAETVALVCPGRG-NEKRAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

A0A1S3B8Q7 3-dehydroquinate synthase homolog4.3e-21089.6Show/hide
Query:  MAMALLSSYSPVSPFLSKQRISYHKTPENLKLRPLISRDFGEFYAVECKSSDVSRLQRSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFS
        M MA L S SPVSP LSKQRI+Y KTPENL LRPLISR+FG+ YA ECKSSD+SRLQ SY SSSS MSPIE SKGVWIWSECQ+VMTAAVERGWSTFIFS
Subjt:  MAMALLSSYSPVSPFLSKQRISYHKTPENLKLRPLISRDFGEFYAVECKSSDVSRLQRSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFS

Query:  PHNTELADEWSSIALIRPLFIKENGVFDGEGRLIAAVVEVSNPEQLEQLQPANASADIVLVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLE
        PHNTELA EW+SIA+I PLFIKE+GV DGE RLIA+VVE+SNP+QLEQLQPA ASADIV+VDLQDWQIIPAENIVAAFQGSQKTVFA+SKTPIEAQIF E
Subjt:  PHNTELADEWSSIALIRPLFIKENGVFDGEGRLIAAVVEVSNPEQLEQLQPANASADIVLVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLE

Query:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIRVAGMGDRVCVDLSSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKV+DPEAVFQLKDYFDRRNEASNLLSLTKATITQI V GMGDRVCVDL SLMRPGEGLLVGS+ARGLFLVHSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIRVAGMGDRVCVDLSSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELRAGKEVTVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRG-NEKRAIPVTSLKVGDE
        GPVHAYVAVPGGKTSYLSEL+AGKEV VVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS+LLQNAETVALVCPG+G NEK+AI VTSLKVGDE
Subjt:  GPVHAYVAVPGGKTSYLSELRAGKEVTVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRG-NEKRAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

A0A5A7UEW0 3-dehydroquinate synthase-like protein4.3e-21089.6Show/hide
Query:  MAMALLSSYSPVSPFLSKQRISYHKTPENLKLRPLISRDFGEFYAVECKSSDVSRLQRSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFS
        M MA L S SPVSP LSKQRI+Y KTPENL LRPLISR+FG+ YA ECKSSD+SRLQ SY SSSS MSPIE SKGVWIWSECQ+VMTAAVERGWSTFIFS
Subjt:  MAMALLSSYSPVSPFLSKQRISYHKTPENLKLRPLISRDFGEFYAVECKSSDVSRLQRSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFS

Query:  PHNTELADEWSSIALIRPLFIKENGVFDGEGRLIAAVVEVSNPEQLEQLQPANASADIVLVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLE
        PHNTELA EW+SIA+I PLFIKE+GV DGE RLIA+VVE+SNP+QLEQLQPA ASADIV+VDLQDWQIIPAENIVAAFQGSQKTVFA+SKTPIEAQIF E
Subjt:  PHNTELADEWSSIALIRPLFIKENGVFDGEGRLIAAVVEVSNPEQLEQLQPANASADIVLVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLE

Query:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIRVAGMGDRVCVDLSSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKV+DPEAVFQLKDYFDRRNEASNLLSLTKATITQI V GMGDRVCVDL SLMRPGEGLLVGS+ARGLFLVHSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIRVAGMGDRVCVDLSSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELRAGKEVTVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRG-NEKRAIPVTSLKVGDE
        GPVHAYVAVPGGKTSYLSEL+AGKEV VVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS+LLQNAETVALVCPG+G NEK+AI VTSLKVGDE
Subjt:  GPVHAYVAVPGGKTSYLSELRAGKEVTVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRG-NEKRAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

A0A6J1EKW1 uncharacterized protein LOC1114354915.1e-20387.44Show/hide
Query:  MAMALLSSYSPVSPFLSKQRISYHKTPENLKLRPLISRDFGEFYAVECKSSDVSRLQRSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFS
        MA     S+SPVSP   KQRI  HK P++LKLR LISR FG     ECKS +++RL  S  SSSSSMSPIEASKGVWIWSE QQVMTAAVERGWSTFIFS
Subjt:  MAMALLSSYSPVSPFLSKQRISYHKTPENLKLRPLISRDFGEFYAVECKSSDVSRLQRSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFS

Query:  PHNTELADEWSSIALIRPLFIKENGVFDGEGRLIAAVVEVSNPEQLEQLQPANASADIVLVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLE
        PHN ELADEWSSIALIRPLFI E+GVFD EGRLIA V+EVSNP+QLEQLQP+NAS D V+VDLQDWQIIPAENIVAAFQGS+KTVFAVSKTPIEAQIFLE
Subjt:  PHNTELADEWSSIALIRPLFIKENGVFDGEGRLIAAVVEVSNPEQLEQLQPANASADIVLVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLE

Query:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIRVAGMGDRVCVDLSSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIT I VAGMGDRVCVDL SLM+PGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIRVAGMGDRVCVDLSSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELRAGKEVTVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKRAIPVTSLKVGDEV
        GPVHAYVAVPGGKTSYLSELRAGKEV VVDQ+GRQRT IVGRVKIETRQL+L+QAKRDSDEQT YSILLQNAETVALVCPGRGNEK+AIPVTSLKVGDEV
Subjt:  GPVHAYVAVPGGKTSYLSELRAGKEVTVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKRAIPVTSLKVGDEV

Query:  FLRLQGEARHTGIEIQEFIVEK
        FLRLQGEARHTGIEIQEFIVEK
Subjt:  FLRLQGEARHTGIEIQEFIVEK

A0A6J1I437 uncharacterized protein LOC1114697132.3e-20388.6Show/hide
Query:  AMALLSSYSPVSPFLSKQRISYHKTPENLKLRPLISRDFGEFYAVECKSSDVSRLQRSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSP
        AMALL S+S VSPF  KQRI  H+ P+NLKLR LISR FG     ECKS +++RL  S ASSSSSMSPIEASKGVWIWS  +QVMTAAVERGWSTFIFSP
Subjt:  AMALLSSYSPVSPFLSKQRISYHKTPENLKLRPLISRDFGEFYAVECKSSDVSRLQRSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSP

Query:  HNTELADEWSSIALIRPLFIKENGVFDGEGRLIAAVVEVSNPEQLEQLQPANASADIVLVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEA
        HN ELADEWSSIALI PLFI E+GVFDGEGRLIA V+EVSNP+QLEQLQP+NAS D V+VDLQDWQIIPAENIVAAFQGS+KTVFAVSKTPIEAQIFLEA
Subjt:  HNTELADEWSSIALIRPLFIKENGVFDGEGRLIAAVVEVSNPEQLEQLQPANASADIVLVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEA

Query:  LEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIRVAGMGDRVCVDLSSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAG
        LEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIT I VAGMGDRVCVDL SLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAG
Subjt:  LEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIRVAGMGDRVCVDLSSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAG

Query:  PVHAYVAVPGGKTSYLSELRAGKEVTVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKRAIPVTSLKVGDEVF
        PVHAYVAVPG KTSYLSELRAGKEV VVDQEGRQRTAIVGRVKIETRQL+L+QAKRDSDEQT YSILLQNAETVALVCPGRGNEK+AIPVTSLKVGDEVF
Subjt:  PVHAYVAVPGGKTSYLSELRAGKEVTVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKRAIPVTSLKVGDEVF

Query:  LRLQGEARHTGIEIQEFIVEK
        LRLQGEARHTGIEIQEFIVEK
Subjt:  LRLQGEARHTGIEIQEFIVEK

SwissProt top hitse value%identityAlignment
A0B6K6 3-dehydroquinate synthase3.7e-6541.39Show/hide
Query:  WSECQQVMTAAVERGWSTFIFSPHNTELADEWSSIALI--------RPLFI------KENGVFDGE--GRLIAAVVEVSNPEQLEQLQPANASADIVLVD
        W + + ++T A+E G+   + S  + EL  E  SI +           L I      +EN +   E  GR I   VE+ + E            D +LV 
Subjt:  WSECQQVMTAAVERGWSTFIFSPHNTELADEWSSIALI--------RPLFI------KENGVFDGE--GRLIAAVVEVSNPEQLEQLQPANASADIVLVD

Query:  LQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIRVAGMGDRVCVDLSSL
          DW++IP EN++AA QG    + +  ++  EA++ L  LEHG  GV+L   DP  + +++   +R     + + L  AT+  ++  GMGDRVCVD  SL
Subjt:  LQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIRVAGMGDRVCVDLSSL

Query:  MRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVTVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQ
        MR GEG+LVGS +R  FLV SE  ES Y+A+RPFRVNAG VHAY+ V G KT YLSEL++G EVT+VD++G  R+A+VGRVKIE R +ILV+A+ D +  
Subjt:  MRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVTVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQ

Query:  TPYSILLQNAETVALVCPGRGNEKRAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
           S LLQNAET+ LV     ++   I V  LK GD+V + ++  ARH G+ I+E I+E+
Subjt:  TPYSILLQNAETVALVCPGRGNEKRAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

A5UJ82 3-dehydroquinate synthase2.5e-6144.9Show/hide
Query:  EGRLIAAVVEVSNPEQLEQLQPANASADIVLVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRR
        EG+ + A VE+++    +      + AD +++   DW +IP ENI+A  Q +   + A       A++ +E LEHG  GVI +  D     Q+K      
Subjt:  EGRLIAAVVEVSNPEQLEQLQPANASADIVLVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRR

Query:  NEASNL-LSLTKATITQIRVAGMGDRVCVDLSSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVTV
         +AS +   L  AT+T ++  G GDRVCVD + +M+PGEG+L+GSY++ LFLVHSE LES Y+ASRPFRVNAGPV AYV VPG KT YLSEL AG EV +
Subjt:  NEASNL-LSLTKATITQIRVAGMGDRVCVDLSSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVTV

Query:  VDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKRAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
        V+ EG  RTA VGR KIE R LIL++A+    E      LLQNAET+ +V      +   + V  +K+GD+V + ++  ARH GI I E I+E+
Subjt:  VDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKRAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

O26680 3-dehydroquinate synthase9.1e-6448.46Show/hide
Query:  GRLIAAVVEVSNPEQLEQLQPANASADIVLVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFD-RR
        GR +AA VE+ +    E  +      D +++  +DW+IIP ENI+A  Q     + A      EA++ LE LEHG  GV++   +P  + Q+KD      
Subjt:  GRLIAAVVEVSNPEQLEQLQPANASADIVLVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFD-RR

Query:  NEASNLLSLTKATITQIRVAGMGDRVCVDLSSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVTVV
        N  S    L  ATIT+I   G GDRVCVD  S+M  GEG+LVGSY++GLFLVHSE LES Y+ASRPFRVNAGPV AYV VPGG+T YLSEL  G EV +V
Subjt:  NEASNLLSLTKATITQIRVAGMGDRVCVDLSSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVTVV

Query:  DQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKRAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
        D++GR R+AIVGRVKIE R L+LV+A+    E      LLQNAET+ LV     ++   + V+ L  GD V +     ARH G+ I+E I+EK
Subjt:  DQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKRAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

Q2NI00 3-dehydroquinate synthase1.1e-6144.86Show/hide
Query:  GRLIAAVVEVSNPEQLEQLQPANASADIVLVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRN
        G+ +AA VE++N +    +      AD V++  ++W++IP ENI+A+ Q     +        EA++ LE +EHG  GV+L   D   + +L    ++ +
Subjt:  GRLIAAVVEVSNPEQLEQLQPANASADIVLVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRN

Query:  EASNLLSLTKATITQIRVAGMGDRVCVDLSSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVTVVD
        + S    L  AT+T++   G+GDRVCVD  S+M  G+G+LVGS+A GLFLVHSE LES Y+ASRPFRVNAGPVHAYV  P  KT YLSEL AG EV  ++
Subjt:  EASNLLSLTKATITQIRVAGMGDRVCVDLSSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVTVVD

Query:  QEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKRAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
         +G   T IVGRVKIE R L+L++AK    + +    L+QNAET+ LV     ++   I V+ LKVGD+V       ARH G+ I+E I+EK
Subjt:  QEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKRAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

Q58646 3-dehydroquinate synthase4.2e-6139.17Show/hide
Query:  WSECQQVMTAAVERGWSTFIFSPHNTELADEWSSIALIRP------LFIKENGVFD--------GEGRLIAAVVEVSNPEQLEQLQPANASADIVLVDLQ
        W E ++++T A+E      +  P + E   E  +I +         + + +N   +        G+   I   +E    E+           D ++++ +
Subjt:  WSECQQVMTAAVERGWSTFIFSPHNTELADEWSSIALIRP------LFIKENGVFD--------GEGRLIAAVVEVSNPEQLEQLQPANASADIVLVDLQ

Query:  DWQIIPAENIVA-AFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIRVAGMGDRVCVDLSSLM
        DW IIP EN++A  F    K V +V+    EA++  E LE G  GV+L  ++ E + +L    +  N+    ++L  AT+T++   G GDRVC+D  SLM
Subjt:  DWQIIPAENIVA-AFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIRVAGMGDRVCVDLSSLM

Query:  RPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVTVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQT
        + GEG+L+GSY+R LFLVHSE +E+ Y+A+RPFRVNAGPVHAY+  PG KT YLSEL+AG +V +VD++G  R AIVGRVKIE R L+L++A+   D   
Subjt:  RPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVTVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQT

Query:  PYSILLQNAETVALVCPGRGNEK-RAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
            +LQNAET+ LV     NEK   I V  LK GD+V ++ +  ARH G+ I+E I+EK
Subjt:  PYSILLQNAETVALVCPGRGNEK-RAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

Arabidopsis top hitse value%identityAlignment
AT3G28760.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: 3-dehydroquinate synthase, prokaryotic-type (InterPro:IPR002812); Has 390 Blast hits to 390 proteins in 131 species: Archae - 144; Bacteria - 105; Metazoa - 0; Fungi - 0; Plants - 54; Viruses - 0; Other Eukaryotes - 87 (source: NCBI BLink).5.7e-13861.5Show/hide
Query:  MAMALLSSYSPVSPFLSKQRISYHKTPENLKLRPLISRDFGEFYAVECKSSDVSRLQRSYASSSSSMSPIEASKG--VWIWSECQQVMTAAVERGWSTFI
        MA+ L+SS S +    ++   S+    E L+L  L+     +    + + S  +  QR     S+S  P+   K   VWIW+ C++VMT AVERGW+TFI
Subjt:  MAMALLSSYSPVSPFLSKQRISYHKTPENLKLRPLISRDFGEFYAVECKSSDVSRLQRSYASSSSSMSPIEASKG--VWIWSECQQVMTAAVERGWSTFI

Query:  FSPHNTELADEWSSIALIRPLFIKENGVFDGEGRLIAAVVEVSNPEQLEQLQPANASADIVLVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIF
        FS  N +L++EWSSIAL+  LFI+E  V DG G ++A+V EVS PE+L  L   N   + +++D  DW+ IPAEN+VAA QGS+KTVFAVS TP EA++F
Subjt:  FSPHNTELADEWSSIALIRPLFIKENGVFDGEGRLIAAVVEVSNPEQLEQLQPANASADIVLVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIF

Query:  LEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIRVAGMGDRVCVDLSSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRV
        LEALEHGLGG+ILK ED +AV  LK+YFD+RNE S+ LSLT+ATIT++++ GMGDRVCVDL SLMRPGEGLLVGS+ARGLFLVHSECLESNYI SRPFRV
Subjt:  LEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITQIRVAGMGDRVCVDLSSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRV

Query:  NAGPVHAYVAVPGGKTSYLSELRAGKEVTVVDQEGRQRTAIVGRVKIETRQLILVQAKRDS-DEQTPYSILLQNAETVALVCPGRGNE--KRAIPVTSLK
        NAGPVHAYVAVPGGKT YLSELR G+EV VVDQ+G+QRTA+VGRVKIE R LI+V+AK  + +E+T YSI+LQNAETVALV P + N   + A+PVTSLK
Subjt:  NAGPVHAYVAVPGGKTSYLSELRAGKEVTVVDQEGRQRTAIVGRVKIETRQLILVQAKRDS-DEQTPYSILLQNAETVALVCPGRGNE--KRAIPVTSLK

Query:  VGDEVFLRLQGEARHTGIEIQEFIVE
         GD+V +RLQG ARHTGIEIQEFIVE
Subjt:  VGDEVFLRLQGEARHTGIEIQEFIVE

AT3G28760.2 CONTAINS InterPro DOMAIN/s: 3-dehydroquinate synthase, prokaryotic-type (InterPro:IPR002812); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink).1.5e-13863.7Show/hide
Query:  SYHKTPENLKLRPLISRDFGEFYAVECKSSDVSRLQRSYASSSSSMSPIEASKG--VWIWSECQQVMTAAVERGWSTFIFSPHNTELADEWSSIALIRPL
        SY  T E L+L  L+     +    + + S  +  QR     S+S  P+   K   VWIW+ C++VMT AVERGW+TFIFS  N +L++EWSSIAL+  L
Subjt:  SYHKTPENLKLRPLISRDFGEFYAVECKSSDVSRLQRSYASSSSSMSPIEASKG--VWIWSECQQVMTAAVERGWSTFIFSPHNTELADEWSSIALIRPL

Query:  FIKENGVFDGEGRLIAAVVEVSNPEQLEQLQPANASADIVLVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAV
        FI+E  V DG G ++A+V EVS PE+L  L   N   + +++D  DW+ IPAEN+VAA QGS+KTVFAVS TP EA++FLEALEHGLGG+ILK ED +AV
Subjt:  FIKENGVFDGEGRLIAAVVEVSNPEQLEQLQPANASADIVLVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAV

Query:  FQLKDYFDRRNEASNLLSLTKATITQIRVAGMGDRVCVDLSSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSE
          LK+YFD+RNE S+ LSLT+ATIT++++ GMGDRVCVDL SLMRPGEGLLVGS+ARGLFLVHSECLESNYI SRPFRVNAGPVHAYVAVPGGKT YLSE
Subjt:  FQLKDYFDRRNEASNLLSLTKATITQIRVAGMGDRVCVDLSSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSE

Query:  LRAGKEVTVVDQEGRQRTAIVGRVKIETRQLILVQAKRDS-DEQTPYSILLQNAETVALVCPGRGNE--KRAIPVTSLKVGDEVFLRLQGEARHTGIEIQ
        LR G+EV VVDQ+G+QRTA+VGRVKIE R LI+V+AK  + +E+T YSI+LQNAETVALV P + N   + A+PVTSLK GD+V +RLQG ARHTGIEIQ
Subjt:  LRAGKEVTVVDQEGRQRTAIVGRVKIETRQLILVQAKRDS-DEQTPYSILLQNAETVALVCPGRGNE--KRAIPVTSLKVGDEVFLRLQGEARHTGIEIQ

Query:  EFIVE
        EFIVE
Subjt:  EFIVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCATGGCCTTGCTCTCTTCCTATTCGCCTGTTTCTCCATTTCTTTCCAAACAGCGCATCAGCTACCACAAAACACCAGAGAATTTGAAACTTCGGCCCCTAATTTC
GAGGGATTTTGGTGAATTCTATGCTGTTGAATGTAAATCTTCGGATGTGAGTCGTTTACAGCGTTCTTACGCTTCCTCGTCCTCTTCAATGTCTCCGATTGAGGCGTCGA
AGGGGGTATGGATTTGGAGTGAGTGTCAGCAGGTTATGACGGCTGCGGTTGAGAGGGGATGGAGCACCTTCATCTTCTCGCCTCATAATACGGAGCTTGCTGATGAATGG
TCTTCAATTGCACTAATACGTCCACTTTTTATTAAAGAGAATGGAGTTTTTGATGGAGAGGGTAGACTAATTGCCGCAGTTGTTGAGGTCTCTAACCCCGAGCAGTTGGA
GCAGCTTCAACCAGCAAATGCATCTGCAGACATTGTTCTTGTTGATTTACAAGACTGGCAGATAATACCTGCAGAGAATATTGTTGCAGCGTTTCAGGGGAGTCAGAAAA
CAGTGTTTGCCGTCTCAAAAACTCCCATCGAAGCTCAAATCTTCCTTGAGGCGCTTGAACATGGTCTCGGCGGAGTTATTTTGAAAGTTGAAGATCCTGAAGCTGTTTTT
CAGCTAAAGGACTATTTTGACAGAAGAAATGAAGCTAGTAATCTCCTGAGCTTGACTAAGGCTACTATTACTCAAATTCGTGTTGCTGGAATGGGAGATCGAGTTTGTGT
CGATCTCTCCAGTCTCATGAGACCCGGCGAAGGGCTTCTTGTCGGGTCCTATGCCAGAGGACTGTTCCTTGTTCACTCGGAATGCTTAGAATCAAATTACATTGCTAGCC
GGCCTTTTCGTGTCAATGCTGGACCTGTCCATGCCTACGTAGCTGTTCCGGGAGGGAAAACTAGCTACCTTTCCGAGTTACGAGCAGGCAAAGAGGTAACTGTAGTTGAT
CAAGAAGGCAGACAGCGAACTGCTATTGTTGGACGTGTAAAGATCGAGACTAGGCAGCTGATCCTTGTCCAGGCAAAGAGAGATTCAGATGAGCAAACTCCTTATAGCAT
CCTTCTACAGAATGCAGAAACGGTTGCCTTAGTCTGCCCTGGTCGAGGAAATGAGAAGAGAGCCATCCCTGTTACCTCACTTAAAGTTGGTGATGAAGTGTTCTTGAGAT
TGCAAGGAGAAGCAAGACATACAGGAATTGAAATCCAAGAGTTTATTGTGGAGAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCATGGCCTTGCTCTCTTCCTATTCGCCTGTTTCTCCATTTCTTTCCAAACAGCGCATCAGCTACCACAAAACACCAGAGAATTTGAAACTTCGGCCCCTAATTTC
GAGGGATTTTGGTGAATTCTATGCTGTTGAATGTAAATCTTCGGATGTGAGTCGTTTACAGCGTTCTTACGCTTCCTCGTCCTCTTCAATGTCTCCGATTGAGGCGTCGA
AGGGGGTATGGATTTGGAGTGAGTGTCAGCAGGTTATGACGGCTGCGGTTGAGAGGGGATGGAGCACCTTCATCTTCTCGCCTCATAATACGGAGCTTGCTGATGAATGG
TCTTCAATTGCACTAATACGTCCACTTTTTATTAAAGAGAATGGAGTTTTTGATGGAGAGGGTAGACTAATTGCCGCAGTTGTTGAGGTCTCTAACCCCGAGCAGTTGGA
GCAGCTTCAACCAGCAAATGCATCTGCAGACATTGTTCTTGTTGATTTACAAGACTGGCAGATAATACCTGCAGAGAATATTGTTGCAGCGTTTCAGGGGAGTCAGAAAA
CAGTGTTTGCCGTCTCAAAAACTCCCATCGAAGCTCAAATCTTCCTTGAGGCGCTTGAACATGGTCTCGGCGGAGTTATTTTGAAAGTTGAAGATCCTGAAGCTGTTTTT
CAGCTAAAGGACTATTTTGACAGAAGAAATGAAGCTAGTAATCTCCTGAGCTTGACTAAGGCTACTATTACTCAAATTCGTGTTGCTGGAATGGGAGATCGAGTTTGTGT
CGATCTCTCCAGTCTCATGAGACCCGGCGAAGGGCTTCTTGTCGGGTCCTATGCCAGAGGACTGTTCCTTGTTCACTCGGAATGCTTAGAATCAAATTACATTGCTAGCC
GGCCTTTTCGTGTCAATGCTGGACCTGTCCATGCCTACGTAGCTGTTCCGGGAGGGAAAACTAGCTACCTTTCCGAGTTACGAGCAGGCAAAGAGGTAACTGTAGTTGAT
CAAGAAGGCAGACAGCGAACTGCTATTGTTGGACGTGTAAAGATCGAGACTAGGCAGCTGATCCTTGTCCAGGCAAAGAGAGATTCAGATGAGCAAACTCCTTATAGCAT
CCTTCTACAGAATGCAGAAACGGTTGCCTTAGTCTGCCCTGGTCGAGGAAATGAGAAGAGAGCCATCCCTGTTACCTCACTTAAAGTTGGTGATGAAGTGTTCTTGAGAT
TGCAAGGAGAAGCAAGACATACAGGAATTGAAATCCAAGAGTTTATTGTGGAGAAATGA
Protein sequenceShow/hide protein sequence
MAMALLSSYSPVSPFLSKQRISYHKTPENLKLRPLISRDFGEFYAVECKSSDVSRLQRSYASSSSSMSPIEASKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELADEW
SSIALIRPLFIKENGVFDGEGRLIAAVVEVSNPEQLEQLQPANASADIVLVDLQDWQIIPAENIVAAFQGSQKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVF
QLKDYFDRRNEASNLLSLTKATITQIRVAGMGDRVCVDLSSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVTVVD
QEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSILLQNAETVALVCPGRGNEKRAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK