; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg22900 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg22900
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
Description3-dehydroquinate synthase homolog
Genome locationCarg_Chr11:5391354..5394957
RNA-Seq ExpressionCarg22900
SyntenyCarg22900
Gene Ontology termsGO:0008652 - cellular amino acid biosynthetic process (biological process)
GO:0009073 - aromatic amino acid family biosynthetic process (biological process)
GO:0003856 - 3-dehydroquinate synthase activity (molecular function)
GO:0016491 - oxidoreductase activity (molecular function)
InterPro domainsIPR002812 - 3-dehydroquinate synthase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588380.1 hypothetical protein SDJN03_16945, partial [Cucurbita argyrosperma subsp. sororia]1.1e-236100Show/hide
Query:  MAAIASLSWSPVSPLFPKQRIITHKAPDNLKLRALISRGFGGAIGGECKSLKINRLLCSCTSSSSSMSPIEASKGVWIWSEDQQVMTAAVERGWSTFIFS
        MAAIASLSWSPVSPLFPKQRIITHKAPDNLKLRALISRGFGGAIGGECKSLKINRLLCSCTSSSSSMSPIEASKGVWIWSEDQQVMTAAVERGWSTFIFS
Subjt:  MAAIASLSWSPVSPLFPKQRIITHKAPDNLKLRALISRGFGGAIGGECKSLKINRLLCSCTSSSSSMSPIEASKGVWIWSEDQQVMTAAVERGWSTFIFS

Query:  PHNKELADEWSSIALIRPLFINEDGVFDAEGRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLE
        PHNKELADEWSSIALIRPLFINEDGVFDAEGRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLE
Subjt:  PHNKELADEWSSIALIRPLFINEDGVFDAEGRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLE

Query:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIAHIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIAHIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIAHIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTTIVGRVKIETRQLVLIQAKRDSDEQTLYSILLQNAETVALVSPGRGNEKKAIPVTSLKVGDEV
        GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTTIVGRVKIETRQLVLIQAKRDSDEQTLYSILLQNAETVALVSPGRGNEKKAIPVTSLKVGDEV
Subjt:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTTIVGRVKIETRQLVLIQAKRDSDEQTLYSILLQNAETVALVSPGRGNEKKAIPVTSLKVGDEV

Query:  FLRLQGEARHTGIEIQEFIVEK
        FLRLQGEARHTGIEIQEFIVEK
Subjt:  FLRLQGEARHTGIEIQEFIVEK

XP_022928646.1 uncharacterized protein LOC111435491 [Cucurbita moschata]2.6e-23398.34Show/hide
Query:  MAAIASLSWSPVSPLFPKQRIITHKAPDNLKLRALISRGFGGAIGGECKSLKINRLLCSCTSSSSSMSPIEASKGVWIWSEDQQVMTAAVERGWSTFIFS
        MAAIASLSWSPVSPLFPKQRIITHKAPD+LKLRALISRGFGGAIGGECKSL+INRLLCSCTSSSSSMSPIEASKGVWIWSEDQQVMTAAVERGWSTFIFS
Subjt:  MAAIASLSWSPVSPLFPKQRIITHKAPDNLKLRALISRGFGGAIGGECKSLKINRLLCSCTSSSSSMSPIEASKGVWIWSEDQQVMTAAVERGWSTFIFS

Query:  PHNKELADEWSSIALIRPLFINEDGVFDAEGRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLE
        PHNKELADEWSSIALIRPLFINEDGVFD EGRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLE
Subjt:  PHNKELADEWSSIALIRPLFINEDGVFDAEGRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLE

Query:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIAHIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATI HIHVAGMGDRVCVDLCSLM+PGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIAHIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTTIVGRVKIETRQLVLIQAKRDSDEQTLYSILLQNAETVALVSPGRGNEKKAIPVTSLKVGDEV
        GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQ+GRQRTTIVGRVKIETRQLVLIQAKRDSDEQTLYSILLQNAETVALV PGRGNEKKAIPVTSLKVGDEV
Subjt:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTTIVGRVKIETRQLVLIQAKRDSDEQTLYSILLQNAETVALVSPGRGNEKKAIPVTSLKVGDEV

Query:  FLRLQGEARHTGIEIQEFIVEK
        FLRLQGEARHTGIEIQEFIVEK
Subjt:  FLRLQGEARHTGIEIQEFIVEK

XP_022970870.1 uncharacterized protein LOC111469713 [Cucurbita maxima]4.7e-22796.21Show/hide
Query:  MAAIASLSWSPVSPLFPKQRIITHKAPDNLKLRALISRGFGGAIGGECKSLKINRLLCSCTSSSSSMSPIEASKGVWIWSEDQQVMTAAVERGWSTFIFS
        MAA+A LSWS VSP FPKQRII H+APDNLKLRALISRGFGGAIGGECKSL+INRLLCSC SSSSSMSPIEASKGVWIWS D+QVMTAAVERGWSTFIFS
Subjt:  MAAIASLSWSPVSPLFPKQRIITHKAPDNLKLRALISRGFGGAIGGECKSLKINRLLCSCTSSSSSMSPIEASKGVWIWSEDQQVMTAAVERGWSTFIFS

Query:  PHNKELADEWSSIALIRPLFINEDGVFDAEGRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLE
        PHNKELADEWSSIALI PLFINEDGVFD EGRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLE
Subjt:  PHNKELADEWSSIALIRPLFINEDGVFDAEGRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLE

Query:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIAHIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATI HIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIAHIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTTIVGRVKIETRQLVLIQAKRDSDEQTLYSILLQNAETVALVSPGRGNEKKAIPVTSLKVGDEV
        GPVHAYVAVPG KTSYLSELRAGKEVIVVDQEGRQRT IVGRVKIETRQLVLIQAKRDSDEQTLYSILLQNAETVALV PGRGNEKKAIPVTSLKVGDEV
Subjt:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTTIVGRVKIETRQLVLIQAKRDSDEQTLYSILLQNAETVALVSPGRGNEKKAIPVTSLKVGDEV

Query:  FLRLQGEARHTGIEIQEFIVEK
        FLRLQGEARHTGIEIQEFIVEK
Subjt:  FLRLQGEARHTGIEIQEFIVEK

XP_023529491.1 uncharacterized protein LOC111792332 [Cucurbita pepo subsp. pepo]1.9e-23197.87Show/hide
Query:  MAAIASLSWSPVSPLFPKQRIITHKAPDNLKLRALISRGFGGAIGGECKSLKINRLLCSCTSSSSSMSPIEASKGVWIWSEDQQVMTAAVERGWSTFIFS
        MA +A LSWSPVS LFPKQRIITHKAPDNLKLRALISRGFGGAIGGECKSL+INRLLCSCTSSSSSMSPIEASKGVWIWSE++QVMTAAVERGWSTFIFS
Subjt:  MAAIASLSWSPVSPLFPKQRIITHKAPDNLKLRALISRGFGGAIGGECKSLKINRLLCSCTSSSSSMSPIEASKGVWIWSEDQQVMTAAVERGWSTFIFS

Query:  PHNKELADEWSSIALIRPLFINEDGVFDAEGRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLE
        PHNKELADEWSSIALIRPLFINEDGVFDAEGRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLE
Subjt:  PHNKELADEWSSIALIRPLFINEDGVFDAEGRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLE

Query:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIAHIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATI HIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIAHIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTTIVGRVKIETRQLVLIQAKRDSDEQTLYSILLQNAETVALVSPGRGNEKKAIPVTSLKVGDEV
        GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTTIVGRVKIETRQLVLIQAKRDSDEQTLYSILLQNAETVALV PGRGNEKKAIPVTSLKVGDEV
Subjt:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTTIVGRVKIETRQLVLIQAKRDSDEQTLYSILLQNAETVALVSPGRGNEKKAIPVTSLKVGDEV

Query:  FLRLQGEARHTGIEIQEFIVEK
        FLRLQGEARHTGIEIQEFIVEK
Subjt:  FLRLQGEARHTGIEIQEFIVEK

XP_038903473.1 3-dehydroquinate synthase homolog [Benincasa hispida]2.8e-20388.22Show/hide
Query:  SWSPVSPLFPKQRI-ITHKAPDNLKLRALISRGFGGAIGGECKSLKINRLLCSCTSSSSSMSPIEASKGVWIWSEDQQVMTAAVERGWSTFIFSPHNKEL
        S SPVSP   KQRI   HK P+NL LR LISR FG A  GECKS  ++RL CS  S  S+MSP EASKGVWIWSE QQVMTAAVERGWSTFIFSPHN EL
Subjt:  SWSPVSPLFPKQRI-ITHKAPDNLKLRALISRGFGGAIGGECKSLKINRLLCSCTSSSSSMSPIEASKGVWIWSEDQQVMTAAVERGWSTFIFSPHNKEL

Query:  ADEWSSIALIRPLFINEDGVFDAEGRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLEALEHGL
        ADEWSSIALI PLFI E+GVFD EGRLIA+V+EVSNPQQLEQLQP+NAS D V+VDLQDWQIIPAENIVAAFQGS+KTVFA+SKTPIEAQIFLEALEHGL
Subjt:  ADEWSSIALIRPLFINEDGVFDAEGRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLEALEHGL

Query:  GGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIAHIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAY
        GGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATI  IHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAY
Subjt:  GGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIAHIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAY

Query:  VAVPGGKTSYLSELRAGKEVIVVDQEGRQRTTIVGRVKIETRQLVLIQAKRDSDEQTLYSILLQNAETVALVSPGRGNEKKAIPVTSLKVGDEVFLRLQG
        VAVPGGKTSYLSEL AGKEVIVVDQEGRQRT IVGRVKIETRQL+L+QAKRDSDEQT YSILLQNAETVALV PGRGNEKK+IPVTSLKVGDEVFLRLQG
Subjt:  VAVPGGKTSYLSELRAGKEVIVVDQEGRQRTTIVGRVKIETRQLVLIQAKRDSDEQTLYSILLQNAETVALVSPGRGNEKKAIPVTSLKVGDEVFLRLQG

Query:  EARHTGIEIQEFIVEK
        EARHTGIEIQEFIVEK
Subjt:  EARHTGIEIQEFIVEK

TrEMBL top hitse value%identityAlignment
A0A1S3B8Q7 3-dehydroquinate synthase homolog6.5e-19885.82Show/hide
Query:  SWSPVSPLFPKQRIITHKAPDNLKLRALISRGFGGAIGGECKSLKINRLLCSCTSSSSSMSPIEASKGVWIWSEDQQVMTAAVERGWSTFIFSPHNKELA
        S SPVSPL  KQRI   K P+NL LR LISR FG A  GECKS  ++RL CS TSSSS MSPIE SKGVWIWSE Q+VMTAAVERGWSTFIFSPHN ELA
Subjt:  SWSPVSPLFPKQRIITHKAPDNLKLRALISRGFGGAIGGECKSLKINRLLCSCTSSSSSMSPIEASKGVWIWSEDQQVMTAAVERGWSTFIFSPHNKELA

Query:  DEWSSIALIRPLFINEDGVFDAEGRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLEALEHGLG
         EW+SIA+I PLFI EDGV D E RLIA+V+E+SNPQQLEQLQP+ AS D V+VDLQDWQIIPAENIVAAFQGS+KTVFA+SKTPIEAQIF EALEHGLG
Subjt:  DEWSSIALIRPLFINEDGVFDAEGRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLEALEHGLG

Query:  GVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIAHIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYV
        GVILKV+DPEAVFQLKDYFDRRNEASNLLSLTKATI  IHV GMGDRVCVDLCSLMRPGEGLLVGS+ARGLFLVHSECLESNYIASRPFRVNAGPVHAYV
Subjt:  GVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIAHIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYV

Query:  AVPGGKTSYLSELRAGKEVIVVDQEGRQRTTIVGRVKIETRQLVLIQAKRDSDEQTLYSILLQNAETVALVSPGRG-NEKKAIPVTSLKVGDEVFLRLQG
        AVPGGKTSYLSEL+AGKEVIVVDQEGRQRT IVGRVKIETRQL+L+QAKRDSDEQT YS+LLQNAETVALV PG+G NEKKAI VTSLKVGDEVFLRLQG
Subjt:  AVPGGKTSYLSELRAGKEVIVVDQEGRQRTTIVGRVKIETRQLVLIQAKRDSDEQTLYSILLQNAETVALVSPGRG-NEKKAIPVTSLKVGDEVFLRLQG

Query:  EARHTGIEIQEFIVEK
        EARHTGIEIQEFIVEK
Subjt:  EARHTGIEIQEFIVEK

A0A5A7UEW0 3-dehydroquinate synthase-like protein6.5e-19885.82Show/hide
Query:  SWSPVSPLFPKQRIITHKAPDNLKLRALISRGFGGAIGGECKSLKINRLLCSCTSSSSSMSPIEASKGVWIWSEDQQVMTAAVERGWSTFIFSPHNKELA
        S SPVSPL  KQRI   K P+NL LR LISR FG A  GECKS  ++RL CS TSSSS MSPIE SKGVWIWSE Q+VMTAAVERGWSTFIFSPHN ELA
Subjt:  SWSPVSPLFPKQRIITHKAPDNLKLRALISRGFGGAIGGECKSLKINRLLCSCTSSSSSMSPIEASKGVWIWSEDQQVMTAAVERGWSTFIFSPHNKELA

Query:  DEWSSIALIRPLFINEDGVFDAEGRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLEALEHGLG
         EW+SIA+I PLFI EDGV D E RLIA+V+E+SNPQQLEQLQP+ AS D V+VDLQDWQIIPAENIVAAFQGS+KTVFA+SKTPIEAQIF EALEHGLG
Subjt:  DEWSSIALIRPLFINEDGVFDAEGRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLEALEHGLG

Query:  GVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIAHIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYV
        GVILKV+DPEAVFQLKDYFDRRNEASNLLSLTKATI  IHV GMGDRVCVDLCSLMRPGEGLLVGS+ARGLFLVHSECLESNYIASRPFRVNAGPVHAYV
Subjt:  GVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIAHIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYV

Query:  AVPGGKTSYLSELRAGKEVIVVDQEGRQRTTIVGRVKIETRQLVLIQAKRDSDEQTLYSILLQNAETVALVSPGRG-NEKKAIPVTSLKVGDEVFLRLQG
        AVPGGKTSYLSEL+AGKEVIVVDQEGRQRT IVGRVKIETRQL+L+QAKRDSDEQT YS+LLQNAETVALV PG+G NEKKAI VTSLKVGDEVFLRLQG
Subjt:  AVPGGKTSYLSELRAGKEVIVVDQEGRQRTTIVGRVKIETRQLVLIQAKRDSDEQTLYSILLQNAETVALVSPGRG-NEKKAIPVTSLKVGDEVFLRLQG

Query:  EARHTGIEIQEFIVEK
        EARHTGIEIQEFIVEK
Subjt:  EARHTGIEIQEFIVEK

A0A6J1BVA9 uncharacterized protein LOC111005050 isoform X22.6e-19184.3Show/hide
Query:  SPVSP-LFPKQRIITHKAPDNLKLRALISRGFGGAIGGECKSLKINRLLCSCTSSSSSMSPIEASKGVWIWSEDQQVMTAAVERGWSTFIFSPHNKELAD
        SP SP L  K RI   K PD  KL ALIS  FG    G+CKS+    + CSC  +S S +P EASKGVW+WSE++QV+TAAVERGW+TF+FSPHN+ELA 
Subjt:  SPVSP-LFPKQRIITHKAPDNLKLRALISRGFGGAIGGECKSLKINRLLCSCTSSSSSMSPIEASKGVWIWSEDQQVMTAAVERGWSTFIFSPHNKELAD

Query:  EWSSIALIRPLFINEDGVFDAEGRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLEALEHGLGG
        +WSSIA I  LFI EDG+FD EG LIATV EVSNPQQLEQLQP NAS DNV+VDLQDWQIIPAENIVAAFQGSRK VFAVSKTPIEAQIFLEALEHGLGG
Subjt:  EWSSIALIRPLFINEDGVFDAEGRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLEALEHGLGG

Query:  VILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIAHIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVA
        VILKVEDPEAVFQLKDYFDRRNEASNLLSLTKAT+  IHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVA
Subjt:  VILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIAHIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVA

Query:  VPGGKTSYLSELRAGKEVIVVDQEGRQRTTIVGRVKIETRQLVLIQAKRDSDEQTLYSILLQNAETVALVSPGRGNEKKAIPVTSLKVGDEVFLRLQGEA
        VPGGKTSYLSELRAGKEVIVVDQEGRQRT IVGRVKIETRQL+L+QAKRDSD+QT Y ILLQNAETVALV PGRGNEKKAIPVTSLKVGD+VFLRLQGEA
Subjt:  VPGGKTSYLSELRAGKEVIVVDQEGRQRTTIVGRVKIETRQLVLIQAKRDSDEQTLYSILLQNAETVALVSPGRGNEKKAIPVTSLKVGDEVFLRLQGEA

Query:  RHTGIEIQEFIVEK
        RHTGIEIQEFIVEK
Subjt:  RHTGIEIQEFIVEK

A0A6J1EKW1 uncharacterized protein LOC1114354911.3e-23398.34Show/hide
Query:  MAAIASLSWSPVSPLFPKQRIITHKAPDNLKLRALISRGFGGAIGGECKSLKINRLLCSCTSSSSSMSPIEASKGVWIWSEDQQVMTAAVERGWSTFIFS
        MAAIASLSWSPVSPLFPKQRIITHKAPD+LKLRALISRGFGGAIGGECKSL+INRLLCSCTSSSSSMSPIEASKGVWIWSEDQQVMTAAVERGWSTFIFS
Subjt:  MAAIASLSWSPVSPLFPKQRIITHKAPDNLKLRALISRGFGGAIGGECKSLKINRLLCSCTSSSSSMSPIEASKGVWIWSEDQQVMTAAVERGWSTFIFS

Query:  PHNKELADEWSSIALIRPLFINEDGVFDAEGRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLE
        PHNKELADEWSSIALIRPLFINEDGVFD EGRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLE
Subjt:  PHNKELADEWSSIALIRPLFINEDGVFDAEGRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLE

Query:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIAHIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATI HIHVAGMGDRVCVDLCSLM+PGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIAHIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTTIVGRVKIETRQLVLIQAKRDSDEQTLYSILLQNAETVALVSPGRGNEKKAIPVTSLKVGDEV
        GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQ+GRQRTTIVGRVKIETRQLVLIQAKRDSDEQTLYSILLQNAETVALV PGRGNEKKAIPVTSLKVGDEV
Subjt:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTTIVGRVKIETRQLVLIQAKRDSDEQTLYSILLQNAETVALVSPGRGNEKKAIPVTSLKVGDEV

Query:  FLRLQGEARHTGIEIQEFIVEK
        FLRLQGEARHTGIEIQEFIVEK
Subjt:  FLRLQGEARHTGIEIQEFIVEK

A0A6J1I437 uncharacterized protein LOC1114697132.3e-22796.21Show/hide
Query:  MAAIASLSWSPVSPLFPKQRIITHKAPDNLKLRALISRGFGGAIGGECKSLKINRLLCSCTSSSSSMSPIEASKGVWIWSEDQQVMTAAVERGWSTFIFS
        MAA+A LSWS VSP FPKQRII H+APDNLKLRALISRGFGGAIGGECKSL+INRLLCSC SSSSSMSPIEASKGVWIWS D+QVMTAAVERGWSTFIFS
Subjt:  MAAIASLSWSPVSPLFPKQRIITHKAPDNLKLRALISRGFGGAIGGECKSLKINRLLCSCTSSSSSMSPIEASKGVWIWSEDQQVMTAAVERGWSTFIFS

Query:  PHNKELADEWSSIALIRPLFINEDGVFDAEGRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLE
        PHNKELADEWSSIALI PLFINEDGVFD EGRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLE
Subjt:  PHNKELADEWSSIALIRPLFINEDGVFDAEGRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLE

Query:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIAHIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATI HIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIAHIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTTIVGRVKIETRQLVLIQAKRDSDEQTLYSILLQNAETVALVSPGRGNEKKAIPVTSLKVGDEV
        GPVHAYVAVPG KTSYLSELRAGKEVIVVDQEGRQRT IVGRVKIETRQLVLIQAKRDSDEQTLYSILLQNAETVALV PGRGNEKKAIPVTSLKVGDEV
Subjt:  GPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTTIVGRVKIETRQLVLIQAKRDSDEQTLYSILLQNAETVALVSPGRGNEKKAIPVTSLKVGDEV

Query:  FLRLQGEARHTGIEIQEFIVEK
        FLRLQGEARHTGIEIQEFIVEK
Subjt:  FLRLQGEARHTGIEIQEFIVEK

SwissProt top hitse value%identityAlignment
A0B6K6 3-dehydroquinate synthase2.6e-6339.72Show/hide
Query:  WSEDQQVMTAAVERGWSTFIFSPHNKELADEWSSIALI--------RPLFINEDGV--------FDAEGRLIATVIEVSNPQQLEQLQPSNASVDNVIVD
        W + + ++T A+E G+   + S  + EL  E  SI +           L I +  V         +  GR I   +E+ + +           VD ++V 
Subjt:  WSEDQQVMTAAVERGWSTFIFSPHNKELADEWSSIALI--------RPLFINEDGV--------FDAEGRLIATVIEVSNPQQLEQLQPSNASVDNVIVD

Query:  LQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIAHIHVAGMGDRVCVDLCSL
          DW++IP EN++AA QG    + +  ++  EA++ L  LEHG  GV+L   DP  + +++   +R     + + L  AT+  +   GMGDRVCVD CSL
Subjt:  LQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIAHIHVAGMGDRVCVDLCSL

Query:  MRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTTIVGRVKIETRQLVLIQAKRDSDEQ
        MR GEG+LVGS +R  FLV SE  ES Y+A+RPFRVNAG VHAY+ V G KT YLSEL++G EV +VD++G  R+ +VGRVKIE R ++L++A+ D +  
Subjt:  MRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTTIVGRVKIETRQLVLIQAKRDSDEQ

Query:  TLYSILLQNAETVALVSPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
           S LLQNAET+ LVS    ++   I V  LK GD+V + ++  ARH G+ I+E I+E+
Subjt:  TLYSILLQNAETVALVSPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

A4G0J1 3-dehydroquinate synthase2.1e-6046.82Show/hide
Query:  VDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIAHIHVAGMGDRV
        VD V+++  DW IIP ENI+A   G    + +V     +A+   E LE G+ GV+L  ED   V       +R N  S  L L  AT+  I   G GDRV
Subjt:  VDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIAHIHVAGMGDRV

Query:  CVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTTIVGRVKIETRQLVLIQA
        C+D CS+M  GEG+L+GSY+RG+FLVHSE +E+ Y+A+RPFRVNAGPVHAY+  P  KT YLS+L+AG +V+VV++ G  R +I+GRVKIE R L L++A
Subjt:  CVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTTIVGRVKIETRQLVLIQA

Query:  KRDSDEQTLYSILLQNAETVALVSPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
        + + +       +LQNAET+ LV    G + K + V  LKVG +V ++    ARH G+ I+E IVEK
Subjt:  KRDSDEQTLYSILLQNAETVALVSPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

O26680 3-dehydroquinate synthase4.1e-6448.12Show/hide
Query:  GRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFD-RR
        GR +A  +E+ +    E  +     VD +I+  +DW+IIP ENI+A  Q     + A      EA++ LE LEHG  GV++   +P  + Q+KD      
Subjt:  GRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFD-RR

Query:  NEASNLLSLTKATIAHIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVV
        N  S    L  ATI  I   G GDRVCVD CS+M  GEG+LVGSY++GLFLVHSE LES Y+ASRPFRVNAGPV AYV VPGG+T YLSEL  G EVI+V
Subjt:  NEASNLLSLTKATIAHIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVV

Query:  DQEGRQRTTIVGRVKIETRQLVLIQAKRDSDEQTLYSILLQNAETVALVSPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
        D++GR R+ IVGRVKIE R L+L++A+    E      LLQNAET+ LV+    ++ + + V+ L  GD V +     ARH G+ I+E I+EK
Subjt:  DQEGRQRTTIVGRVKIETRQLVLIQAKRDSDEQTLYSILLQNAETVALVSPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

Q2NI00 3-dehydroquinate synthase2.5e-6144.52Show/hide
Query:  GRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRN
        G+ +A  +E++N      +       D VI+  ++W++IP ENI+A+ Q     +        EA++ LE +EHG  GV+L   D   + +L    ++ +
Subjt:  GRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRN

Query:  EASNLLSLTKATIAHIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVD
        + S    L  AT+  +   G+GDRVCVD CS+M  G+G+LVGS+A GLFLVHSE LES Y+ASRPFRVNAGPVHAYV  P  KT YLSEL AG EV+ ++
Subjt:  EASNLLSLTKATIAHIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVD

Query:  QEGRQRTTIVGRVKIETRQLVLIQAKRDSDEQTLYSILLQNAETVALVSPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
         +G   T IVGRVKIE R L+LI+AK    + +    L+QNAET+ LV+    ++ + I V+ LKVGD+V       ARH G+ I+E I+EK
Subjt:  QEGRQRTTIVGRVKIETRQLVLIQAKRDSDEQTLYSILLQNAETVALVSPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

Q58646 3-dehydroquinate synthase3.5e-6340Show/hide
Query:  WSEDQQVMTAAVERGWSTFIFSPHNKELADEWSSIALIRP------LFINEDGVFD--------AEGRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQ
        W E ++++T A+E      +  P + E   E  +I +         + +N++   +         +   I   IE    ++          VDN+I++ +
Subjt:  WSEDQQVMTAAVERGWSTFIFSPHNKELADEWSSIALIRP------LFINEDGVFD--------AEGRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQ

Query:  DWQIIPAENIVA-AFQGSRKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIAHIHVAGMGDRVCVDLCSLM
        DW IIP EN++A  F    K V +V+    EA++  E LE G  GV+L  ++ E + +L    +  N+    ++L  AT+  +   G GDRVC+D CSLM
Subjt:  DWQIIPAENIVA-AFQGSRKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIAHIHVAGMGDRVCVDLCSLM

Query:  RPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTTIVGRVKIETRQLVLIQAKRDSDEQT
        + GEG+L+GSY+R LFLVHSE +E+ Y+A+RPFRVNAGPVHAY+  PG KT YLSEL+AG +V++VD++G  R  IVGRVKIE R LVLI+A+   D   
Subjt:  RPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTTIVGRVKIETRQLVLIQAKRDSDEQT

Query:  LYSILLQNAETVALVSPGRGNEK-KAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
        +   +LQNAET+ LV     NEK + I V  LK GD+V ++ +  ARH G+ I+E I+EK
Subjt:  LYSILLQNAETVALVSPGRGNEK-KAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

Arabidopsis top hitse value%identityAlignment
AT3G28760.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: 3-dehydroquinate synthase, prokaryotic-type (InterPro:IPR002812); Has 390 Blast hits to 390 proteins in 131 species: Archae - 144; Bacteria - 105; Metazoa - 0; Fungi - 0; Plants - 54; Viruses - 0; Other Eukaryotes - 87 (source: NCBI BLink).2.0e-13865.95Show/hide
Query:  RLLCSCTSSSSSMSPIEASKGVWIWSEDQQVMTAAVERGWSTFIFSPHNKELADEWSSIALIRPLFINEDGVFDAEGRLIATVIEVSNPQQLEQLQPSNA
        R++   ++S+  M+ +  +K VWIW+  ++VMT AVERGW+TFIFS  N++L++EWSSIAL+  LFI E  V D  G ++A+V EVS P++L  L   N 
Subjt:  RLLCSCTSSSSSMSPIEASKGVWIWSEDQQVMTAAVERGWSTFIFSPHNKELADEWSSIALIRPLFINEDGVFDAEGRLIATVIEVSNPQQLEQLQPSNA

Query:  SVDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIAHIHVAGMGDR
         ++N+++D  DW+ IPAEN+VAA QGS KTVFAVS TP EA++FLEALEHGLGG+ILK ED +AV  LK+YFD+RNE S+ LSLT+ATI  + + GMGDR
Subjt:  SVDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIAHIHVAGMGDR

Query:  VCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTTIVGRVKIETRQLVLIQ
        VCVDLCSLMRPGEGLLVGS+ARGLFLVHSECLESNYI SRPFRVNAGPVHAYVAVPGGKT YLSELR G+EVIVVDQ+G+QRT +VGRVKIE R L++++
Subjt:  VCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTTIVGRVKIETRQLVLIQ

Query:  AKRDS-DEQTLYSILLQNAETVALVSPGRGNE--KKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE
        AK  + +E+T+YSI+LQNAETVALV+P + N   + A+PVTSLK GD+V +RLQG ARHTGIEIQEFIVE
Subjt:  AKRDS-DEQTLYSILLQNAETVALVSPGRGNE--KKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE

AT3G28760.2 CONTAINS InterPro DOMAIN/s: 3-dehydroquinate synthase, prokaryotic-type (InterPro:IPR002812); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink).2.0e-13865.95Show/hide
Query:  RLLCSCTSSSSSMSPIEASKGVWIWSEDQQVMTAAVERGWSTFIFSPHNKELADEWSSIALIRPLFINEDGVFDAEGRLIATVIEVSNPQQLEQLQPSNA
        R++   ++S+  M+ +  +K VWIW+  ++VMT AVERGW+TFIFS  N++L++EWSSIAL+  LFI E  V D  G ++A+V EVS P++L  L   N 
Subjt:  RLLCSCTSSSSSMSPIEASKGVWIWSEDQQVMTAAVERGWSTFIFSPHNKELADEWSSIALIRPLFINEDGVFDAEGRLIATVIEVSNPQQLEQLQPSNA

Query:  SVDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIAHIHVAGMGDR
         ++N+++D  DW+ IPAEN+VAA QGS KTVFAVS TP EA++FLEALEHGLGG+ILK ED +AV  LK+YFD+RNE S+ LSLT+ATI  + + GMGDR
Subjt:  SVDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATIAHIHVAGMGDR

Query:  VCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTTIVGRVKIETRQLVLIQ
        VCVDLCSLMRPGEGLLVGS+ARGLFLVHSECLESNYI SRPFRVNAGPVHAYVAVPGGKT YLSELR G+EVIVVDQ+G+QRT +VGRVKIE R L++++
Subjt:  VCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVDQEGRQRTTIVGRVKIETRQLVLIQ

Query:  AKRDS-DEQTLYSILLQNAETVALVSPGRGNE--KKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE
        AK  + +E+T+YSI+LQNAETVALV+P + N   + A+PVTSLK GD+V +RLQG ARHTGIEIQEFIVE
Subjt:  AKRDS-DEQTLYSILLQNAETVALVSPGRGNE--KKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCCATCGCCTCGCTCTCTTGGTCGCCTGTTTCTCCACTGTTTCCCAAACAGCGGATCATCACCCACAAAGCACCAGATAATTTGAAACTTCGCGCTCTGATTTC
AAGGGGTTTTGGCGGAGCCATTGGTGGTGAATGTAAATCTTTGAAGATAAATCGTTTACTGTGTTCTTGCACTTCGTCGTCCTCTTCAATGTCTCCGATTGAGGCGTCGA
AGGGGGTGTGGATTTGGAGTGAGGATCAGCAGGTTATGACGGCGGCGGTTGAGAGGGGATGGAGTACCTTCATCTTCTCGCCTCATAATAAGGAGCTTGCTGATGAATGG
TCCTCAATTGCACTTATACGCCCACTTTTTATCAACGAGGACGGAGTTTTCGATGCAGAGGGTAGACTAATTGCCACAGTTATCGAGGTTTCTAACCCGCAGCAGTTGGA
GCAGCTTCAGCCATCAAATGCATCAGTAGACAATGTTATTGTGGATTTACAAGATTGGCAGATAATACCTGCGGAGAATATTGTTGCAGCGTTTCAGGGGAGTCGAAAAA
CAGTATTTGCAGTCTCGAAAACTCCTATCGAAGCTCAAATCTTCCTCGAGGCGCTTGAACACGGTCTGGGTGGAGTTATTTTGAAAGTTGAAGATCCTGAAGCTGTTTTT
CAGCTAAAGGACTATTTTGACAGAAGAAATGAAGCTAGTAATCTTCTGAGCTTGACTAAAGCTACTATAGCTCATATTCATGTCGCTGGAATGGGAGATCGAGTCTGTGT
CGATCTCTGCAGTCTCATGAGACCCGGTGAAGGACTTCTAGTCGGGTCCTACGCCAGAGGACTATTTCTAGTTCACTCGGAATGCTTAGAGTCAAATTACATTGCAAGCC
GACCTTTTCGAGTCAATGCTGGACCAGTCCATGCCTATGTAGCCGTCCCAGGAGGGAAAACGAGCTACCTATCCGAGTTACGAGCAGGCAAAGAGGTAATTGTAGTCGAT
CAAGAAGGCAGGCAACGAACCACTATTGTTGGACGTGTAAAGATAGAGACTAGGCAGCTGGTACTCATCCAGGCAAAGAGAGATTCAGATGAGCAAACTCTGTACAGCAT
CCTCCTGCAGAACGCAGAAACGGTCGCCTTAGTGTCCCCCGGTCGAGGAAATGAGAAGAAAGCCATCCCTGTTACCTCACTTAAAGTTGGCGATGAAGTGTTCTTGAGAC
TGCAAGGAGAAGCAAGGCATACAGGTATTGAAATCCAAGAGTTTATTGTAGAGAAATGA
mRNA sequenceShow/hide mRNA sequence
CAGATTATAGCGGCCCACCTTCTTCATCACCGGGCCTTTGGGCTTCTCAGCGGCGAGCGAAAGAGAGTGGAAGAAAACGATGGCGGCCATCGCCTCGCTCTCTTGGTCGC
CTGTTTCTCCACTGTTTCCCAAACAGCGGATCATCACCCACAAAGCACCAGATAATTTGAAACTTCGCGCTCTGATTTCAAGGGGTTTTGGCGGAGCCATTGGTGGTGAA
TGTAAATCTTTGAAGATAAATCGTTTACTGTGTTCTTGCACTTCGTCGTCCTCTTCAATGTCTCCGATTGAGGCGTCGAAGGGGGTGTGGATTTGGAGTGAGGATCAGCA
GGTTATGACGGCGGCGGTTGAGAGGGGATGGAGTACCTTCATCTTCTCGCCTCATAATAAGGAGCTTGCTGATGAATGGTCCTCAATTGCACTTATACGCCCACTTTTTA
TCAACGAGGACGGAGTTTTCGATGCAGAGGGTAGACTAATTGCCACAGTTATCGAGGTTTCTAACCCGCAGCAGTTGGAGCAGCTTCAGCCATCAAATGCATCAGTAGAC
AATGTTATTGTGGATTTACAAGATTGGCAGATAATACCTGCGGAGAATATTGTTGCAGCGTTTCAGGGGAGTCGAAAAACAGTATTTGCAGTCTCGAAAACTCCTATCGA
AGCTCAAATCTTCCTCGAGGCGCTTGAACACGGTCTGGGTGGAGTTATTTTGAAAGTTGAAGATCCTGAAGCTGTTTTTCAGCTAAAGGACTATTTTGACAGAAGAAATG
AAGCTAGTAATCTTCTGAGCTTGACTAAAGCTACTATAGCTCATATTCATGTCGCTGGAATGGGAGATCGAGTCTGTGTCGATCTCTGCAGTCTCATGAGACCCGGTGAA
GGACTTCTAGTCGGGTCCTACGCCAGAGGACTATTTCTAGTTCACTCGGAATGCTTAGAGTCAAATTACATTGCAAGCCGACCTTTTCGAGTCAATGCTGGACCAGTCCA
TGCCTATGTAGCCGTCCCAGGAGGGAAAACGAGCTACCTATCCGAGTTACGAGCAGGCAAAGAGGTAATTGTAGTCGATCAAGAAGGCAGGCAACGAACCACTATTGTTG
GACGTGTAAAGATAGAGACTAGGCAGCTGGTACTCATCCAGGCAAAGAGAGATTCAGATGAGCAAACTCTGTACAGCATCCTCCTGCAGAACGCAGAAACGGTCGCCTTA
GTGTCCCCCGGTCGAGGAAATGAGAAGAAAGCCATCCCTGTTACCTCACTTAAAGTTGGCGATGAAGTGTTCTTGAGACTGCAAGGAGAAGCAAGGCATACAGGTATTGA
AATCCAAGAGTTTATTGTAGAGAAATGATTGTTAAGCTATTTTGAGTGAAATACTTCAAACC
Protein sequenceShow/hide protein sequence
MAAIASLSWSPVSPLFPKQRIITHKAPDNLKLRALISRGFGGAIGGECKSLKINRLLCSCTSSSSSMSPIEASKGVWIWSEDQQVMTAAVERGWSTFIFSPHNKELADEW
SSIALIRPLFINEDGVFDAEGRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVF
QLKDYFDRRNEASNLLSLTKATIAHIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELRAGKEVIVVD
QEGRQRTTIVGRVKIETRQLVLIQAKRDSDEQTLYSILLQNAETVALVSPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK