; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy4G082460 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy4G082460
OrganismCucumis hystrix (Cucumber (hystrix) v1)
Description3-dehydroquinate synthase homolog
Genome locationchrH04:21826649..21830226
RNA-Seq ExpressionChy4G082460
SyntenyChy4G082460
Gene Ontology termsGO:0008652 - cellular amino acid biosynthetic process (biological process)
GO:0009073 - aromatic amino acid family biosynthetic process (biological process)
GO:0003856 - 3-dehydroquinate synthase activity (molecular function)
GO:0016491 - oxidoreductase activity (molecular function)
InterPro domainsIPR002812 - 3-dehydroquinate synthase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147467.1 uncharacterized protein LOC101203995 [Cucumis sativus]1.68e-29698.11Show/hide
Query:  MAMAFLRSSSPVSPSLSKQRITYLKTPENLNLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEGSKGVWIWSECQQVMTAAVERGWSTFIFS
        MAMAFLRSSSPVSP LSKQRITYLKTPENL LRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIE SKGVWIWSECQQVMTAAVERGWSTFIFS
Subjt:  MAMAFLRSSSPVSPSLSKQRITYLKTPENLNLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEGSKGVWIWSECQQVMTAAVERGWSTFIFS

Query:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDSLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQVIPAENIVAAFQGSQKTVFAISKTPIEAQIFLE
        PHNTELAHEWSSIALIHPLFIKENGVLDGED LIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQ+IPAENIVAAFQGSQKTVFAISKTPIEAQIFLE
Subjt:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDSLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQVIPAENIVAAFQGSQKTVFAISKTPIEAQIFLE

Query:  ALEHGLGGVILKVEDPEAVVQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKVEDPEAV QLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMR GEGLLVGSYARGLFL+HSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVVQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDE
        GPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDE
Subjt:  GPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

XP_008443422.1 PREDICTED: 3-dehydroquinate synthase homolog [Cucumis melo]5.75e-28894.8Show/hide
Query:  MAMAFLRSSSPVSPSLSKQRITYLKTPENLNLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEGSKGVWIWSECQQVMTAAVERGWSTFIFS
        M MAFLRSSSPVSP LSKQRITYLKTPENLNLRPL+SR+FG+AYAGECKSSD+SRLQCSYTSSSSPMSPIE SKGVWIWSECQ+VMTAAVERGWSTFIFS
Subjt:  MAMAFLRSSSPVSPSLSKQRITYLKTPENLNLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEGSKGVWIWSECQQVMTAAVERGWSTFIFS

Query:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDSLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQVIPAENIVAAFQGSQKTVFAISKTPIEAQIFLE
        PHNTELAHEW+SIA+IHPLFIKE+GVLDGED LIASVVE+SNPQQLEQLQPARASADIVVVDLQDWQ+IPAENIVAAFQGSQKTVFAISKTPIEAQIF E
Subjt:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDSLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQVIPAENIVAAFQGSQKTVFAISKTPIEAQIFLE

Query:  ALEHGLGGVILKVEDPEAVVQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKV+DPEAV QLKDYFDRRNEASNLL+LTKATITQIHVVGMGDRVCVDLCSLMR GEGLLVGS+ARGLFLVHSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVVQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDE
        GPVHAYVAVPGGKTSYLSELQAG EVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAI VTSLKVGDE
Subjt:  GPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

XP_022928646.1 uncharacterized protein LOC111435491 [Cucurbita moschata]1.58e-24884.4Show/hide
Query:  MAMAFLRSSSPVSPSLSKQRITYLKTPENLNLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEGSKGVWIWSECQQVMTAAVERGWSTFIFS
        MA     S SPVSP   KQRI   K P++L LR L+SR FG A  GECKS +++RL CS TSSSS MSPIE SKGVWIWSE QQVMTAAVERGWSTFIFS
Subjt:  MAMAFLRSSSPVSPSLSKQRITYLKTPENLNLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEGSKGVWIWSECQQVMTAAVERGWSTFIFS

Query:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDSLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQVIPAENIVAAFQGSQKTVFAISKTPIEAQIFLE
        PHN ELA EWSSIALI PLFI E+GV D E  LIA+V+EVSNPQQLEQLQP+ AS D V+VDLQDWQ+IPAENIVAAFQGS+KTVFA+SKTPIEAQIFLE
Subjt:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDSLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQVIPAENIVAAFQGSQKTVFAISKTPIEAQIFLE

Query:  ALEHGLGGVILKVEDPEAVVQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKVEDPEAV QLKDYFDRRNEASNLL+LTKATIT IHV GMGDRVCVDLCSLM+ GEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVVQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDE
        GPVHAYVAVPGGKTSYLSEL+AG EVIVVDQ+GRQRT IVGRVKIETRQL+L+QAKRDSDEQT YS+LLQNAETVALVCPG+GN EKKAIPVTSLKVGDE
Subjt:  GPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

XP_023529491.1 uncharacterized protein LOC111792332 [Cucurbita pepo subsp. pepo]5.51e-24984.87Show/hide
Query:  MAMAFLRSSSPVSPSLSKQRITYLKTPENLNLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEGSKGVWIWSECQQVMTAAVERGWSTFIFS
        MA   L S SPVS    KQRI   K P+NL LR L+SR FG A  GECKS +++RL CS TSSSS MSPIE SKGVWIWSE +QVMTAAVERGWSTFIFS
Subjt:  MAMAFLRSSSPVSPSLSKQRITYLKTPENLNLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEGSKGVWIWSECQQVMTAAVERGWSTFIFS

Query:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDSLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQVIPAENIVAAFQGSQKTVFAISKTPIEAQIFLE
        PHN ELA EWSSIALI PLFI E+GV D E  LIA+V+EVSNPQQLEQLQP+ AS D V+VDLQDWQ+IPAENIVAAFQGS+KTVFA+SKTPIEAQIFLE
Subjt:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDSLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQVIPAENIVAAFQGSQKTVFAISKTPIEAQIFLE

Query:  ALEHGLGGVILKVEDPEAVVQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKVEDPEAV QLKDYFDRRNEASNLL+LTKATIT IHV GMGDRVCVDLCSLMR GEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVVQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDE
        GPVHAYVAVPGGKTSYLSEL+AG EVIVVDQEGRQRT IVGRVKIETRQL+L+QAKRDSDEQT YS+LLQNAETVALVCPG+GN EKKAIPVTSLKVGDE
Subjt:  GPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

XP_038903473.1 3-dehydroquinate synthase homolog [Benincasa hispida]7.31e-27592.45Show/hide
Query:  MAMAFLRSSSPVSPSLSKQRITYL-KTPENLNLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEGSKGVWIWSECQQVMTAAVERGWSTFIF
        M MA L SSSPVSP LSKQRI+Y  KTPENL LRPL+SRDFGEAYAGECKSS+VSRLQCSY S  S MSP E SKGVWIWSECQQVMTAAVERGWSTFIF
Subjt:  MAMAFLRSSSPVSPSLSKQRITYL-KTPENLNLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEGSKGVWIWSECQQVMTAAVERGWSTFIF

Query:  SPHNTELAHEWSSIALIHPLFIKENGVLDGEDSLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQVIPAENIVAAFQGSQKTVFAISKTPIEAQIFL
        SPHNTELA EWSSIALIHPLFIKENGV DGE  LIASVVEVSNPQQLEQLQPA ASADIVVVDLQDWQ+IPAENIVAAFQGSQKTVFAISKTPIEAQIFL
Subjt:  SPHNTELAHEWSSIALIHPLFIKENGVLDGEDSLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQVIPAENIVAAFQGSQKTVFAISKTPIEAQIFL

Query:  EALEHGLGGVILKVEDPEAVVQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVN
        EALEHGLGGVILKVEDPEAV QLKDYFDRRNEASNLL+LTKATITQIHV GMGDRVCVDLCSLMR GEGLLVGSYARGLFLVHSECLESNYIASRPFRVN
Subjt:  EALEHGLGGVILKVEDPEAVVQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVN

Query:  AGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGD
        AGPVHAYVAVPGGKTSYLSEL AG EVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS+LLQNAETVALVCPG+GN EKK+IPVTSLKVGD
Subjt:  AGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGD

Query:  EVFLRLQGEARHTGIEIQEFIVEK
        EVFLRLQGEARHTGIEIQEFIVEK
Subjt:  EVFLRLQGEARHTGIEIQEFIVEK

TrEMBL top hitse value%identityAlignment
A0A0A0LHS3 Uncharacterized protein1.6e-19698.32Show/hide
Query:  MSPIEGSKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDSLIASVVEVSNPQQLEQLQPARASADIVVVDLQDW
        MSPIE SKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGED LIASVVEVSNPQQLEQLQPARASADIVVVDLQDW
Subjt:  MSPIEGSKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDSLIASVVEVSNPQQLEQLQPARASADIVVVDLQDW

Query:  QVIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVVQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRLG
        Q+IPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAV QLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMR G
Subjt:  QVIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVVQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRLG

Query:  EGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS
        EGLLVGSYARGLFL+HSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS
Subjt:  EGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYS

Query:  VLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
        VLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
Subjt:  VLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

A0A1S3B8Q7 3-dehydroquinate synthase homolog4.3e-22694.8Show/hide
Query:  MAMAFLRSSSPVSPSLSKQRITYLKTPENLNLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEGSKGVWIWSECQQVMTAAVERGWSTFIFS
        M MAFLRSSSPVSP LSKQRITYLKTPENLNLRPL+SR+FG+AYAGECKSSD+SRLQCSYTSSSSPMSPIE SKGVWIWSECQ+VMTAAVERGWSTFIFS
Subjt:  MAMAFLRSSSPVSPSLSKQRITYLKTPENLNLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEGSKGVWIWSECQQVMTAAVERGWSTFIFS

Query:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDSLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQVIPAENIVAAFQGSQKTVFAISKTPIEAQIFLE
        PHNTELAHEW+SIA+IHPLFIKE+GVLDGED LIASVVE+SNPQQLEQLQPARASADIVVVDLQDWQ+IPAENIVAAFQGSQKTVFAISKTPIEAQIF E
Subjt:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDSLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQVIPAENIVAAFQGSQKTVFAISKTPIEAQIFLE

Query:  ALEHGLGGVILKVEDPEAVVQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKV+DPEAV QLKDYFDRRNEASNLL+LTKATITQIHVVGMGDRVCVDLCSLMR GEGLLVGS+ARGLFLVHSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVVQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDE
        GPVHAYVAVPGGKTSYLSELQAG EVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAI VTSLKVGDE
Subjt:  GPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

A0A5A7UEW0 3-dehydroquinate synthase-like protein4.3e-22694.8Show/hide
Query:  MAMAFLRSSSPVSPSLSKQRITYLKTPENLNLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEGSKGVWIWSECQQVMTAAVERGWSTFIFS
        M MAFLRSSSPVSP LSKQRITYLKTPENLNLRPL+SR+FG+AYAGECKSSD+SRLQCSYTSSSSPMSPIE SKGVWIWSECQ+VMTAAVERGWSTFIFS
Subjt:  MAMAFLRSSSPVSPSLSKQRITYLKTPENLNLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEGSKGVWIWSECQQVMTAAVERGWSTFIFS

Query:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDSLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQVIPAENIVAAFQGSQKTVFAISKTPIEAQIFLE
        PHNTELAHEW+SIA+IHPLFIKE+GVLDGED LIASVVE+SNPQQLEQLQPARASADIVVVDLQDWQ+IPAENIVAAFQGSQKTVFAISKTPIEAQIF E
Subjt:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDSLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQVIPAENIVAAFQGSQKTVFAISKTPIEAQIFLE

Query:  ALEHGLGGVILKVEDPEAVVQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKV+DPEAV QLKDYFDRRNEASNLL+LTKATITQIHVVGMGDRVCVDLCSLMR GEGLLVGS+ARGLFLVHSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVVQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDE
        GPVHAYVAVPGGKTSYLSELQAG EVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAI VTSLKVGDE
Subjt:  GPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

A0A6J1EKW1 uncharacterized protein LOC1114354914.7e-19684.4Show/hide
Query:  MAMAFLRSSSPVSPSLSKQRITYLKTPENLNLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEGSKGVWIWSECQQVMTAAVERGWSTFIFS
        MA     S SPVSP   KQRI   K P++L LR L+SR FG A  GECKS +++RL CS TSSSS MSPIE SKGVWIWSE QQVMTAAVERGWSTFIFS
Subjt:  MAMAFLRSSSPVSPSLSKQRITYLKTPENLNLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEGSKGVWIWSECQQVMTAAVERGWSTFIFS

Query:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDSLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQVIPAENIVAAFQGSQKTVFAISKTPIEAQIFLE
        PHN ELA EWSSIALI PLFI E+GV D E  LIA+V+EVSNPQQLEQLQP+ AS D V+VDLQDWQ+IPAENIVAAFQGS+KTVFA+SKTPIEAQIFLE
Subjt:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDSLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQVIPAENIVAAFQGSQKTVFAISKTPIEAQIFLE

Query:  ALEHGLGGVILKVEDPEAVVQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKVEDPEAV QLKDYFDRRNEASNLL+LTKATIT IHV GMGDRVCVDLCSLM+ GEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVVQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDE
        GPVHAYVAVPGGKTSYLSEL+AG EVIVVDQ+GRQRT IVGRVKIETRQL+L+QAKRDSDEQT YS+LLQNAETVALVCPG+G NEKKAIPVTSLKVGDE
Subjt:  GPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

A0A6J1I437 uncharacterized protein LOC1114697135.2e-19584.4Show/hide
Query:  MAMAFLRSSSPVSPSLSKQRITYLKTPENLNLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEGSKGVWIWSECQQVMTAAVERGWSTFIFS
        MA   L S S VSP   KQRI   + P+NL LR L+SR FG A  GECKS +++RL CS  SSSS MSPIE SKGVWIWS  +QVMTAAVERGWSTFIFS
Subjt:  MAMAFLRSSSPVSPSLSKQRITYLKTPENLNLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEGSKGVWIWSECQQVMTAAVERGWSTFIFS

Query:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDSLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQVIPAENIVAAFQGSQKTVFAISKTPIEAQIFLE
        PHN ELA EWSSIALI PLFI E+GV DGE  LIA+V+EVSNPQQLEQLQP+ AS D V+VDLQDWQ+IPAENIVAAFQGS+KTVFA+SKTPIEAQIFLE
Subjt:  PHNTELAHEWSSIALIHPLFIKENGVLDGEDSLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQVIPAENIVAAFQGSQKTVFAISKTPIEAQIFLE

Query:  ALEHGLGGVILKVEDPEAVVQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
        ALEHGLGGVILKVEDPEAV QLKDYFDRRNEASNLL+LTKATIT IHV GMGDRVCVDLCSLMR GEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA
Subjt:  ALEHGLGGVILKVEDPEAVVQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNA

Query:  GPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDE
        GPVHAYVAVPG KTSYLSEL+AG EVIVVDQEGRQRTAIVGRVKIETRQL+L+QAKRDSDEQT YS+LLQNAETVALVCPG+G NEKKAIPVTSLKVGDE
Subjt:  GPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDE

Query:  VFLRLQGEARHTGIEIQEFIVEK
        VFLRLQGEARHTGIEIQEFIVEK
Subjt:  VFLRLQGEARHTGIEIQEFIVEK

SwissProt top hitse value%identityAlignment
A0B6K6 3-dehydroquinate synthase1.8e-6440.77Show/hide
Query:  WSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLD--------GEDSLIASVVEVSNP--------QQLEQLQPARAS--ADIVV
        W + + ++T A+E G+   + S  + EL  E  SI +    F +E G  D          ++ I SV ++  P         + ++L         D ++
Subjt:  WSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLD--------GEDSLIASVVEVSNP--------QQLEQLQPARAS--ADIVV

Query:  VDLQDWQVIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVVQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLC
        V   DW+VIP EN++AA QG    + +  ++  EA++ L  LEHG  GV+L   DP  + +++   +R     + ++L  AT+  +  VGMGDRVCVD C
Subjt:  VDLQDWQVIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVVQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLC

Query:  SLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSD
        SLMR GEG+LVGS +R  FLV SE  ES Y+A+RPFRVNAG VHAY+ V G KT YLSEL++G+EV +VD++G  R+A+VGRVKIE R +ILV+A+ D +
Subjt:  SLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSD

Query:  EQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
             S LLQNAET+ LV     +++   I V  LK GD+V + ++  ARH G+ I+E I+E+
Subjt:  EQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

A4G0J1 3-dehydroquinate synthase1.5e-6146.82Show/hide
Query:  DIVVVDLQDWQVIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVVQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVC
        D VV++  DW +IP ENI+A   G +  + ++     +A+   E LE G+ GV+L  ED   V       +R N  S  L L  AT+T+I  VG GDRVC
Subjt:  DIVVVDLQDWQVIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVVQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVC

Query:  VDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAK
        +D CS+M +GEG+L+GSY+RG+FLVHSE +E+ Y+A+RPFRVNAGPVHAY+  P  KT YLS+L+AG++V+VV++ G  R +I+GRVKIE R L LV+A+
Subjt:  VDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAK

Query:  RDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
         + +       +LQNAET+ LV       + K + V  LKVG +V ++    ARH G+ I+E IVEK
Subjt:  RDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

O26680 3-dehydroquinate synthase7.5e-6648.45Show/hide
Query:  IASVVEVSNPQQLEQLQPARASADIVVVDLQDWQVIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVVQLKDYFD-RRNEA
        +A+ VE+ +    E  +      D +++  +DW++IP ENI+A  Q     + A      EA++ LE LEHG  GV++   +P  + Q+KD      N  
Subjt:  IASVVEVSNPQQLEQLQPARASADIVVVDLQDWQVIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVVQLKDYFD-RRNEA

Query:  SNLLNLTKATITQIHVVGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQE
        S    L  ATIT+I  +G GDRVCVD CS+M +GEG+LVGSY++GLFLVHSE LES Y+ASRPFRVNAGPV AYV VPGG+T YLSEL+ G+EVI+VD++
Subjt:  SNLLNLTKATITQIHVVGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQE

Query:  GRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
        GR R+AIVGRVKIE R L+LV+A+    E      LLQNAET+ LV     N++ + + V+ L  GD V +     ARH G+ I+E I+EK
Subjt:  GRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

Q2NI00 3-dehydroquinate synthase5.4e-6445.86Show/hide
Query:  IASVVEVSNPQQLEQLQPARASADIVVVDLQDWQVIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVVQLKDYFDRRNEAS
        +A+ VE++N      +      AD V++  ++W+VIP ENI+A+ Q     +        EA++ LE +EHG  GV+L   D   + +L    ++ ++ S
Subjt:  IASVVEVSNPQQLEQLQPARASADIVVVDLQDWQVIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVVQLKDYFDRRNEAS

Query:  NLLNLTKATITQIHVVGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEG
           +L  AT+T++  VG+GDRVCVD CS+M +G+G+LVGS+A GLFLVHSE LES Y+ASRPFRVNAGPVHAYV  P  KT YLSEL+AG+EV+ ++ +G
Subjt:  NLLNLTKATITQIHVVGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEG

Query:  RQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
           T IVGRVKIE R L+L++AK    + +    L+QNAET+ LV     N++ + I V+ LKVGD+V       ARH G+ I+E I+EK
Subjt:  RQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

Q58646 3-dehydroquinate synthase1.4e-6438.89Show/hide
Query:  WSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALI-HPL--------------FIKENGVLDGEDSLIASVVEVSNPQQLEQLQPARASADIVVVDL
        W E ++++T A+E      +  P + E   E  +I +  H L              F+KE   L G+++ I   +E    ++           D ++++ 
Subjt:  WSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALI-HPL--------------FIKENGVLDGEDSLIASVVEVSNPQQLEQLQPARASADIVVVDL

Query:  QDWQVIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVVQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLM
        +DW +IP EN++A        + A   +  EA++  E LE G  GV+L  ++ E + +L    +  N+    L++  AT+T++  +G GDRVC+D CSLM
Subjt:  QDWQVIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVVQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLM

Query:  RLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQT
        ++GEG+L+GSY+R LFLVHSE +E+ Y+A+RPFRVNAGPVHAY+  PG KT YLSEL+AG++V++VD++G  R AIVGRVKIE R L+L++A+   D   
Subjt:  RLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQT

Query:  PYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
            +LQNAET+ LV     N + + I V  LK GD+V ++ +  ARH G+ I+E I+EK
Subjt:  PYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK

Arabidopsis top hitse value%identityAlignment
AT3G28760.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: 3-dehydroquinate synthase, prokaryotic-type (InterPro:IPR002812); Has 390 Blast hits to 390 proteins in 131 species: Archae - 144; Bacteria - 105; Metazoa - 0; Fungi - 0; Plants - 54; Viruses - 0; Other Eukaryotes - 87 (source: NCBI BLink).4.4e-13866.76Show/hide
Query:  RLQCSYTSSSSPMSPIEGSKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDSLIASVVEVSNPQQLEQLQPARA
        R+    ++S+ PM+ +  +K VWIW+ C++VMT AVERGW+TFIFS  N +L++EWSSIAL+  LFI+E  V+DG  +++ASV EVS P++L  L     
Subjt:  RLQCSYTSSSSPMSPIEGSKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFIKENGVLDGEDSLIASVVEVSNPQQLEQLQPARA

Query:  SADIVVVDLQDWQVIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVVQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDR
          + +V+D  DW+ IPAEN+VAA QGS+KTVFA+S TP EA++FLEALEHGLGG+ILK ED +AV+ LK+YFD+RNE S+ L+LT+ATIT++ +VGMGDR
Subjt:  SADIVVVDLQDWQVIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVVQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDR

Query:  VCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQ
        VCVDLCSLMR GEGLLVGS+ARGLFLVHSECLESNYI SRPFRVNAGPVHAYVAVPGGKT YLSEL+ G EVIVVDQ+G+QRTA+VGRVKIE R LI+V+
Subjt:  VCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQ

Query:  AKRDS-DEQTPYSVLLQNAETVALVCPGQGNNE-KKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE
        AK  + +E+T YS++LQNAETVALV P Q N+  + A+PVTSLK GD+V +RLQG ARHTGIEIQEFIVE
Subjt:  AKRDS-DEQTPYSVLLQNAETVALVCPGQGNNE-KKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVE

AT3G28760.2 CONTAINS InterPro DOMAIN/s: 3-dehydroquinate synthase, prokaryotic-type (InterPro:IPR002812); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink).2.6e-13863.03Show/hide
Query:  TYLKTPENLNLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEGSKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFI
        +Y+ T E L L  L+     +      K +   R+    ++S+ PM+ +  +K VWIW+ C++VMT AVERGW+TFIFS  N +L++EWSSIAL+  LFI
Subjt:  TYLKTPENLNLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEGSKGVWIWSECQQVMTAAVERGWSTFIFSPHNTELAHEWSSIALIHPLFI

Query:  KENGVLDGEDSLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQVIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVVQ
        +E  V+DG  +++ASV EVS P++L  L       + +V+D  DW+ IPAEN+VAA QGS+KTVFA+S TP EA++FLEALEHGLGG+ILK ED +AV+ 
Subjt:  KENGVLDGEDSLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQVIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVILKVEDPEAVVQ

Query:  LKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQ
        LK+YFD+RNE S+ L+LT+ATIT++ +VGMGDRVCVDLCSLMR GEGLLVGS+ARGLFLVHSECLESNYI SRPFRVNAGPVHAYVAVPGGKT YLSEL+
Subjt:  LKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTSYLSELQ

Query:  AGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDS-DEQTPYSVLLQNAETVALVCPGQGNNE-KKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEF
         G EVIVVDQ+G+QRTA+VGRVKIE R LI+V+AK  + +E+T YS++LQNAETVALV P Q N+  + A+PVTSLK GD+V +RLQG ARHTGIEIQEF
Subjt:  AGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDS-DEQTPYSVLLQNAETVALVCPGQGNNE-KKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEF

Query:  IVE
        IVE
Subjt:  IVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCATGGCCTTCCTCCGTTCCTCCTCGCCCGTTTCTCCTTCTCTTTCCAAACAGCGCATCACCTACCTGAAAACACCAGAGAATTTAAACCTTCGGCCCCTA
GTTTCAAGGGATTTTGGCGAAGCCTATGCTGGTGAATGTAAGTCTTCGGATGTGAGTCGTTTGCAGTGTTCTTACACTTCCTCCTCCTCTCCAATGTCTCCGATT
GAGGGGTCGAAGGGGGTGTGGATTTGGAGTGAGTGTCAGCAGGTAATGACGGCTGCGGTTGAGAGGGGATGGAGCACCTTCATCTTCTCGCCTCATAATACGGAG
CTTGCTCATGAATGGTCTTCAATTGCACTTATACATCCTCTTTTTATTAAAGAGAATGGAGTTTTAGATGGAGAAGATAGTCTAATTGCCTCAGTTGTTGAGGTC
TCTAACCCCCAGCAGTTGGAGCAGCTTCAACCAGCACGTGCATCTGCAGACATTGTTGTTGTTGATTTACAAGATTGGCAGGTAATACCTGCAGAGAATATTGTC
GCAGCGTTTCAGGGGAGTCAGAAAACAGTGTTTGCCATCTCAAAAACACCTATTGAGGCTCAAATCTTTCTTGAGGCGCTTGAACATGGTCTGGGCGGAGTTATT
TTGAAAGTTGAAGATCCTGAAGCTGTTGTTCAGCTAAAGGACTATTTTGACAGAAGAAACGAAGCTAGTAATCTTTTGAATTTGACTAAGGCTACCATTACTCAA
ATTCATGTGGTTGGAATGGGAGATCGAGTTTGTGTCGACCTTTGTAGTCTCATGAGACTCGGCGAAGGGCTTCTTGTCGGTTCGTATGCGAGAGGACTGTTCCTT
GTTCATTCAGAATGCTTAGAATCGAATTACATCGCTAGCCGACCTTTTCGTGTCAATGCTGGACCTGTCCATGCCTATGTAGCTGTTCCGGGAGGGAAAACTAGT
TACCTTTCCGAGTTACAAGCAGGCAACGAGGTAATTGTAGTTGATCAGGAAGGCCGACAGCGAACCGCTATTGTTGGACGTGTAAAGATTGAGACTAGGCAGCTG
ATCCTTGTCCAGGCAAAGAGAGATTCAGATGAGCAAACTCCTTACAGTGTCCTTCTGCAGAATGCGGAAACAGTTGCCTTAGTGTGCCCTGGTCAAGGAAATAAT
GAGAAGAAAGCCATACCTGTTACCTCACTTAAAGTTGGTGATGAAGTGTTCTTGAGATTGCAAGGAGAAGCAAGACATACAGGAATTGAAATCCAAGAGTTTATT
GTGGAGAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCATGGCCTTCCTCCGTTCCTCCTCGCCCGTTTCTCCTTCTCTTTCCAAACAGCGCATCACCTACCTGAAAACACCAGAGAATTTAAACCTTCGGCCCCTA
GTTTCAAGGGATTTTGGCGAAGCCTATGCTGGTGAATGTAAGTCTTCGGATGTGAGTCGTTTGCAGTGTTCTTACACTTCCTCCTCCTCTCCAATGTCTCCGATT
GAGGGGTCGAAGGGGGTGTGGATTTGGAGTGAGTGTCAGCAGGTAATGACGGCTGCGGTTGAGAGGGGATGGAGCACCTTCATCTTCTCGCCTCATAATACGGAG
CTTGCTCATGAATGGTCTTCAATTGCACTTATACATCCTCTTTTTATTAAAGAGAATGGAGTTTTAGATGGAGAAGATAGTCTAATTGCCTCAGTTGTTGAGGTC
TCTAACCCCCAGCAGTTGGAGCAGCTTCAACCAGCACGTGCATCTGCAGACATTGTTGTTGTTGATTTACAAGATTGGCAGGTAATACCTGCAGAGAATATTGTC
GCAGCGTTTCAGGGGAGTCAGAAAACAGTGTTTGCCATCTCAAAAACACCTATTGAGGCTCAAATCTTTCTTGAGGCGCTTGAACATGGTCTGGGCGGAGTTATT
TTGAAAGTTGAAGATCCTGAAGCTGTTGTTCAGCTAAAGGACTATTTTGACAGAAGAAACGAAGCTAGTAATCTTTTGAATTTGACTAAGGCTACCATTACTCAA
ATTCATGTGGTTGGAATGGGAGATCGAGTTTGTGTCGACCTTTGTAGTCTCATGAGACTCGGCGAAGGGCTTCTTGTCGGTTCGTATGCGAGAGGACTGTTCCTT
GTTCATTCAGAATGCTTAGAATCGAATTACATCGCTAGCCGACCTTTTCGTGTCAATGCTGGACCTGTCCATGCCTATGTAGCTGTTCCGGGAGGGAAAACTAGT
TACCTTTCCGAGTTACAAGCAGGCAACGAGGTAATTGTAGTTGATCAGGAAGGCCGACAGCGAACCGCTATTGTTGGACGTGTAAAGATTGAGACTAGGCAGCTG
ATCCTTGTCCAGGCAAAGAGAGATTCAGATGAGCAAACTCCTTACAGTGTCCTTCTGCAGAATGCGGAAACAGTTGCCTTAGTGTGCCCTGGTCAAGGAAATAAT
GAGAAGAAAGCCATACCTGTTACCTCACTTAAAGTTGGTGATGAAGTGTTCTTGAGATTGCAAGGAGAAGCAAGACATACAGGAATTGAAATCCAAGAGTTTATT
GTGGAGAAATAA
Protein sequenceShow/hide protein sequence
MAMAFLRSSSPVSPSLSKQRITYLKTPENLNLRPLVSRDFGEAYAGECKSSDVSRLQCSYTSSSSPMSPIEGSKGVWIWSECQQVMTAAVERGWSTFIFSPHNTE
LAHEWSSIALIHPLFIKENGVLDGEDSLIASVVEVSNPQQLEQLQPARASADIVVVDLQDWQVIPAENIVAAFQGSQKTVFAISKTPIEAQIFLEALEHGLGGVI
LKVEDPEAVVQLKDYFDRRNEASNLLNLTKATITQIHVVGMGDRVCVDLCSLMRLGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGGKTS
YLSELQAGNEVIVVDQEGRQRTAIVGRVKIETRQLILVQAKRDSDEQTPYSVLLQNAETVALVCPGQGNNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFI
VEK