; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0016066 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0016066
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionSerine O-acetyltransferase
Genome locationchr11:30145598..30158401
RNA-Seq ExpressionPI0016066
SyntenyPI0016066
Gene Ontology termsGO:0006535 - cysteine biosynthetic process from serine (biological process)
GO:0016102 - diterpenoid biosynthetic process (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0010333 - terpene synthase activity (molecular function)
GO:0009001 - serine O-acetyltransferase activity (molecular function)
GO:0000287 - magnesium ion binding (molecular function)
InterPro domainsIPR042122 - Serine acetyltransferase, N-terminal domain superfamily
IPR036965 - Terpene synthase, N-terminal domain superfamily
IPR018357 - Hexapeptide transferase, conserved site
IPR011004 - Trimeric LpxA-like superfamily
IPR010493 - Serine acetyltransferase, N-terminal
IPR008949 - Isoprenoid synthase domain superfamily
IPR008930 - Terpenoid cyclases/protein prenyltransferase alpha-alpha toroid
IPR005881 - Serine O-acetyltransferase
IPR005630 - Terpene synthase, metal-binding domain
IPR001906 - Terpene synthase, N-terminal domain
IPR001451 - Hexapeptide repeat


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045530.1 terpene synthase 10-like [Cucumis melo var. makuwa]1.5e-16189.06Show/hide
Query:  MCIKASI--QSGDVIVRQCANYNPPLWKDDFIQSLHNEFRGETYRRRFSQLKGQVQTLLKEERDSLEQLELIDALQKLGISYHFESEIKDVLERIGNKFY
        MCIKASI  QSGDVIVRQCANYNPPLWKDDFIQSLHNEFRGETYRRRFSQLK QVQTLLKEE DSLEQLELIDALQKLGISYHFESEIK++L+RI NK  
Subjt:  MCIKASI--QSGDVIVRQCANYNPPLWKDDFIQSLHNEFRGETYRRRFSQLKGQVQTLLKEERDSLEQLELIDALQKLGISYHFESEIKDVLERIGNKFY

Query:  KEGNKKNSLYATSLEFRLLRQRQLDISEDVFNAFKDEMGNFKTCFYEDINGMLSLYEASFLSTKGETILEEAKYFAIKYLNEYIKSSKDELKVEIVKNAL
        KE  KKNSLYATSLEFRLLRQ Q DI E VFNAFKDEMGN KTC YEDIN MLSLYEASFLSTKGETILEEAK FA+KYLNE+IKSSKDELKVE+V++AL
Subjt:  KEGNKKNSLYATSLEFRLLRQRQLDISEDVFNAFKDEMGNFKTCFYEDINGMLSLYEASFLSTKGETILEEAKYFAIKYLNEYIKSSKDELKVEIVKNAL

Query:  KLPLHWRIEKLEARWSIDIYERIGTLNPILLEFAKLDFNMVQSIYQEDLKYASSWWRDTELGEKMSFARDQLMENFYWTVGIGFHSELSYFRRMGTKIVA
        KLPLHWRIE+LEARWSIDIYERIGTLNPILLE AKLDFNMVQSIYQEDLKYASSWWRDTELGEKMSFARDQLMENFYWTVGIGF  ELSYFRRMGTKIV 
Subjt:  KLPLHWRIEKLEARWSIDIYERIGTLNPILLEFAKLDFNMVQSIYQEDLKYASSWWRDTELGEKMSFARDQLMENFYWTVGIGFHSELSYFRRMGTKIVA

Query:  LITMIDDVYDVYGTLNELKLFTNAIESFN
        LITMIDDVYDVYGTL+ELKLFT AI+ ++
Subjt:  LITMIDDVYDVYGTLNELKLFTNAIESFN

XP_004148472.3 terpene synthase 10 isoform X1 [Cucumis sativus]5.4e-18388.59Show/hide
Query:  LSMALLHLPLSSTFSFHGAALPSTNYQPSSTMLKCVVIERGMCIKASIQSGDVIVRQCANYNPPLWKDDFIQSLHNEFRGETYRRRFSQLKGQVQTLLKE
        L +ALLHLPLSS+FSFHGA LPSTNY+PS TMLK VVIERGMCIKASIQSGDVIVRQCANY+PPLWKDDFIQSLH++F+GE YRRRFSQLKGQVQ LLKE
Subjt:  LSMALLHLPLSSTFSFHGAALPSTNYQPSSTMLKCVVIERGMCIKASIQSGDVIVRQCANYNPPLWKDDFIQSLHNEFRGETYRRRFSQLKGQVQTLLKE

Query:  ERDSLEQLELIDALQKLGISYHFESEIKDVLERIGNKFYKEGNKKNSLYATSLEFRLLRQRQLDISEDVFNAFKDEMGNFKTCFYEDINGMLSLYEASFL
        ERDSLEQLELIDALQKLGISYHFESEIK +LERI NKF K+  +KNS YATSL+FRLLRQ Q DISE VFNAFKDEMGNFKTCF EDINGMLSLYEASFL
Subjt:  ERDSLEQLELIDALQKLGISYHFESEIKDVLERIGNKFYKEGNKKNSLYATSLEFRLLRQRQLDISEDVFNAFKDEMGNFKTCFYEDINGMLSLYEASFL

Query:  STKGETILEEAKYFAIKYLNEYIKSSKDELKVEIVKNALKLPLHWRIEKLEARWSIDIYERIGTLNPILLEFAKLDFNMVQSIYQEDLKYASSWWRDTEL
        STKGET+LEEAK FA+KYLNE+IKSSKDELKVEIV++ALKLPLHWRIE+LEARWSIDIYERIGTL PILLE AKLDFNMVQSIYQEDLKYASSWWRDTEL
Subjt:  STKGETILEEAKYFAIKYLNEYIKSSKDELKVEIVKNALKLPLHWRIEKLEARWSIDIYERIGTLNPILLEFAKLDFNMVQSIYQEDLKYASSWWRDTEL

Query:  GEKMSFARDQLMENFYWTVGIGFHSELSYFRRMGTKIVALITMIDDVYDVYGTLNELKLFTNAIESFN
        GEKMSFARDQLMENFYWTVGIGF  ELSYFRRMGTKIVALITMIDDVYDVYGTL+ELKLFTNAIE ++
Subjt:  GEKMSFARDQLMENFYWTVGIGFHSELSYFRRMGTKIVALITMIDDVYDVYGTLNELKLFTNAIESFN

XP_011649387.1 serine acetyltransferase 5 isoform X1 [Cucumis sativus]1.5e-15698.62Show/hide
Query:  QARFSSQSPTAVVDSTTNNDETWLWGQIKAEARQDAESEPALASYLYSTILSHSSLERSLSFHLGNKLCSSTLLSTLLYDLFLNAFSTDYGLRSAAVADL
        + RFSSQSPTAVVDST NNDETWLWGQIKAEARQDAESEPALASYLYSTILSHSSLERSLSFHLGNKLCSSTLLSTLLYDLFLNAFSTDYGLRSAAVADL
Subjt:  QARFSSQSPTAVVDSTTNNDETWLWGQIKAEARQDAESEPALASYLYSTILSHSSLERSLSFHLGNKLCSSTLLSTLLYDLFLNAFSTDYGLRSAAVADL

Query:  RAARERDPACVSFSHCLLNYKGFLACQAHRVAHKLWNQSRRPLALALQSRIADVFAVDIHPAARIGKGILFDHATGVVVGETAVIGNNVSILHHVTLGGT
        RAARERDPACVS+SHCLLNYKGFLACQAHRVAHKLWNQSRRPLALALQSRIADVFAVDIHPAARIGKGILFDHATGVVVGETAVIGNNVSILHHVTLGGT
Subjt:  RAARERDPACVSFSHCLLNYKGFLACQAHRVAHKLWNQSRRPLALALQSRIADVFAVDIHPAARIGKGILFDHATGVVVGETAVIGNNVSILHHVTLGGT

Query:  GKMCGDRHPKIGDGVLIGAGATILGNVKIGEGAKIGAGSVVLIDVPPRTTAVGNPARLVGGKEKPSQLEDIPGESMDHTSFISEWSDYII
        GKMCGDRHPKIGDGVLIGAGATILGNVKIGEGAKIGAGSVVLIDVPPRTTAVGNPARLVGGKEKPSQLEDIPGESMDHTSFISEWSDYII
Subjt:  GKMCGDRHPKIGDGVLIGAGATILGNVKIGEGAKIGAGSVVLIDVPPRTTAVGNPARLVGGKEKPSQLEDIPGESMDHTSFISEWSDYII

XP_031736437.1 terpene synthase 10 isoform X2 [Cucumis sativus]9.0e-16281.25Show/hide
Query:  LSMALLHLPLSSTFSFHGAALPSTNYQPSSTMLKCVVIERGMCIKASIQSGDVIVRQCANYNPPLWKDDFIQSLHNEFRGETYRRRFSQLKGQVQTLLKE
        L +ALLHLPLSS+FSFHGA LPSTNY+PS TMLK VVIERGMCIKASIQSGDVIVRQCANY+PPLWKDDFIQSLH++F+GE YRRRFSQLKGQVQ LLKE
Subjt:  LSMALLHLPLSSTFSFHGAALPSTNYQPSSTMLKCVVIERGMCIKASIQSGDVIVRQCANYNPPLWKDDFIQSLHNEFRGETYRRRFSQLKGQVQTLLKE

Query:  ERDSLEQLELIDALQKLGISYHFESEIKDVLERIGNKFYKEGNKKNSLYATSLEFRLLRQRQLDISEDVFNAFKDEMGNFKTCFYEDINGMLSLYEASFL
        ERDSLEQLELIDALQKLGISYHFESEIK +LERI NKF K+  +KNS YATSL+FRLLRQ Q DIS                             EASFL
Subjt:  ERDSLEQLELIDALQKLGISYHFESEIKDVLERIGNKFYKEGNKKNSLYATSLEFRLLRQRQLDISEDVFNAFKDEMGNFKTCFYEDINGMLSLYEASFL

Query:  STKGETILEEAKYFAIKYLNEYIKSSKDELKVEIVKNALKLPLHWRIEKLEARWSIDIYERIGTLNPILLEFAKLDFNMVQSIYQEDLKYASSWWRDTEL
        STKGET+LEEAK FA+KYLNE+IKSSKDELKVEIV++ALKLPLHWRIE+LEARWSIDIYERIGTL PILLE AKLDFNMVQSIYQEDLKYASSWWRDTEL
Subjt:  STKGETILEEAKYFAIKYLNEYIKSSKDELKVEIVKNALKLPLHWRIEKLEARWSIDIYERIGTLNPILLEFAKLDFNMVQSIYQEDLKYASSWWRDTEL

Query:  GEKMSFARDQLMENFYWTVGIGFHSELSYFRRMGTKIVALITMIDDVYDVYGTLNELKLFTNAIESFN
        GEKMSFARDQLMENFYWTVGIGF  ELSYFRRMGTKIVALITMIDDVYDVYGTL+ELKLFTNAIE ++
Subjt:  GEKMSFARDQLMENFYWTVGIGFHSELSYFRRMGTKIVALITMIDDVYDVYGTLNELKLFTNAIESFN

XP_038901091.1 terpene synthase 10-like isoform X1 [Benincasa hispida]3.0e-16580.05Show/hide
Query:  MALLHLPLSSTFSF-----HGAALPSTNYQPSSTMLKCVVIERGMCIKASIQSGDVIVRQCANYNPPLWKDDFIQSLHNEFRGETYRRRFSQLKGQVQTL
        MA+LHL L STFS      HGA   S NYQPSS MLKCVV+ERGMC KASIQSG VI RQCANY P +WKD+FIQSLHN+F GE Y+RRF+QLKGQV+ L
Subjt:  MALLHLPLSSTFSF-----HGAALPSTNYQPSSTMLKCVVIERGMCIKASIQSGDVIVRQCANYNPPLWKDDFIQSLHNEFRGETYRRRFSQLKGQVQTL

Query:  LKEERDSLEQLELIDALQKLGISYHFESEIKDVLERIGNKFYKEGNKKNSLYATSLEFRLLRQRQLDISEDVFNAFKDEMGNFKTCFYEDINGMLSLYEA
        L+E RDSLEQLELID LQ+LGISYHFESEIKD+LERI NKFYKEG KKNSLYATSLEFRLLRQ Q DISE+VFNAFKDE G+FKTCFY D NGMLSLYEA
Subjt:  LKEERDSLEQLELIDALQKLGISYHFESEIKDVLERIGNKFYKEGNKKNSLYATSLEFRLLRQRQLDISEDVFNAFKDEMGNFKTCFYEDINGMLSLYEA

Query:  SFLSTKGETILEEAKYFAIKYLNEYIKSSKDELKVEIVKNALKLPLHWRIEKLEARWSIDIYERIGTLNPILLEFAKLDFNMVQSIYQEDLKYASSWWRD
        SFLSTKGETILE AK FAI +L EYIKS+KD+L+VEIVK+ALKLPLHWR ++LEARW I+IYER  TLNPILLE AKLDFNMVQSIYQEDLKYASSWWR+
Subjt:  SFLSTKGETILEEAKYFAIKYLNEYIKSSKDELKVEIVKNALKLPLHWRIEKLEARWSIDIYERIGTLNPILLEFAKLDFNMVQSIYQEDLKYASSWWRD

Query:  TELGEKMSFARDQLMENFYWTVGIGFHSELSYFRRMGTKIVALITMIDDVYDVYGTLNELKLFTNAIESFN
        TELG+K+SFARDQLMENF+WT+GIGF  EL YFRRMGTKIV LITMIDDVYDVYGTL+ELKLFTN IE ++
Subjt:  TELGEKMSFARDQLMENFYWTVGIGFHSELSYFRRMGTKIVALITMIDDVYDVYGTLNELKLFTNAIESFN

TrEMBL top hitse value%identityAlignment
A0A0A0LM54 Serine O-acetyltransferase0.0e+0090.67Show/hide
Query:  MLSMALLHLPLSSTFSFHGAALPSTNYQPSSTMLKCVVIERGMCIKASIQSGDVIVRQCANYNPPLWKDDFIQSLHNEFRGETYRRRFSQLKGQVQTLLK
        M+SMALLHLPLSS+FSFHGA LPSTNY+PS TMLK VVIERGMCIKASIQSGDVIVRQCANY+PPLWKDDFIQSLH++F+GE YRRRFSQLKGQVQ LLK
Subjt:  MLSMALLHLPLSSTFSFHGAALPSTNYQPSSTMLKCVVIERGMCIKASIQSGDVIVRQCANYNPPLWKDDFIQSLHNEFRGETYRRRFSQLKGQVQTLLK

Query:  EERDSLEQLELIDALQKLGISYHFESEIKDVLERIGNKFYKEGNKKNSLYATSLEFRLLRQRQLDISEDVFNAFKDEMGNFKTCFYEDINGMLSLYEASF
        EERDSLEQLELIDALQKLGISYHFESEIK +LERI NKF K+  +KNS YATSL+FRLLRQ Q DISE VFNAFKDEMGNFKTCF EDINGMLSLYEASF
Subjt:  EERDSLEQLELIDALQKLGISYHFESEIKDVLERIGNKFYKEGNKKNSLYATSLEFRLLRQRQLDISEDVFNAFKDEMGNFKTCFYEDINGMLSLYEASF

Query:  LSTKGETILEEAKYFAIKYLNEYIKSSKDELKVEIVKNALKLPLHWRIEKLEARWSIDIYERIGTLNPILLEFAKLDFNMVQSIYQEDLKYASSWWRDTE
        LSTK ET+LEEAK F +KYLNE+IKSSKDELKVEIV++ALKLPLHWRIE+LEARWSIDIYERIGTL PILLE AKLDFNMVQSIYQEDLKYASSWWRDTE
Subjt:  LSTKGETILEEAKYFAIKYLNEYIKSSKDELKVEIVKNALKLPLHWRIEKLEARWSIDIYERIGTLNPILLEFAKLDFNMVQSIYQEDLKYASSWWRDTE

Query:  LGEKMSFARDQLMENFYWTVGIGFHSELSYFRRMGTKIVALITMIDDVYDVYGTLNELKLFTNAIESFNHERRRQQQRRLQAASGGGCTAASSAIATFEV
        LGEKMSFARDQLMENFYWTVGIGF  ELSYFRRMGTKIVALITMIDDVYDVYGTL+ELKLFTNAIESFN          LQ ASGG CTAAS  IATF+V
Subjt:  LGEKMSFARDQLMENFYWTVGIGFHSELSYFRRMGTKIVALITMIDDVYDVYGTLNELKLFTNAIESFNHERRRQQQRRLQAASGGGCTAASSAIATFEV

Query:  PTPLVSREATAFDDFSDLVSATQA------RFSSQSPTAVVDSTTNNDETWLWGQIKAEARQDAESEPALASYLYSTILSHSSLERSLSFHLGNKLCSST
         TPL SRE TAFDDFSDLVS TQA      RFSSQSPTAVVDST NNDETWLWGQIKAEARQDAESEPALASYLYSTILSHSSLERSLSFHLGNKLCSST
Subjt:  PTPLVSREATAFDDFSDLVSATQA------RFSSQSPTAVVDSTTNNDETWLWGQIKAEARQDAESEPALASYLYSTILSHSSLERSLSFHLGNKLCSST

Query:  LLSTLLYDLFLNAFSTDYGLRSAAVADLRAARERDPACVSFSHCLLNYKGFLACQAHRVAHKLWNQSRRPLALALQSRIADVFAVDIHPAARIGKGILFD
        LLSTLLYDLFLNAFSTDYGLRSAAVADLRAARERDPACVS+SHCLLNYKGFLACQAHRVAHKLWNQSRRPLALALQSRIADVFAVDIHPAARIGKGILFD
Subjt:  LLSTLLYDLFLNAFSTDYGLRSAAVADLRAARERDPACVSFSHCLLNYKGFLACQAHRVAHKLWNQSRRPLALALQSRIADVFAVDIHPAARIGKGILFD

Query:  HATGVVVGETAVIGNNVSILHHVTLGGTGKMCGDRHPKIGDGVLIGAGATILGNVKIGEGAKIGAGSVVLIDVPPRTTAVGNPARLVGGKEKPSQLEDIP
        HATGVVVGETAVIGNNVSILHHVTLGGTGKMCGDRHPKIGDGVLIGAGATILGNVKIGEGAKIGAGSVVLIDVPPRTTAVGNPARLVGGKEKPSQLEDIP
Subjt:  HATGVVVGETAVIGNNVSILHHVTLGGTGKMCGDRHPKIGDGVLIGAGATILGNVKIGEGAKIGAGSVVLIDVPPRTTAVGNPARLVGGKEKPSQLEDIP

Query:  GESMDHTSFISEWSDYII
        GESMDHTSFISEWSDYII
Subjt:  GESMDHTSFISEWSDYII

A0A1S3CQN4 Serine O-acetyltransferase3.6e-15698.28Show/hide
Query:  QARFSSQSPTAVVDSTTNNDETWLWGQIKAEARQDAESEPALASYLYSTILSHSSLERSLSFHLGNKLCSSTLLSTLLYDLFLNAFSTDYGLRSAAVADL
        + RFSSQSP AVVDST NNDETWLWGQIKAEARQDAESEPALASYLYSTILSHSSLERSLSFHLGNKLCSSTLLSTLLYDLFLNAFSTDYGLRSAAVADL
Subjt:  QARFSSQSPTAVVDSTTNNDETWLWGQIKAEARQDAESEPALASYLYSTILSHSSLERSLSFHLGNKLCSSTLLSTLLYDLFLNAFSTDYGLRSAAVADL

Query:  RAARERDPACVSFSHCLLNYKGFLACQAHRVAHKLWNQSRRPLALALQSRIADVFAVDIHPAARIGKGILFDHATGVVVGETAVIGNNVSILHHVTLGGT
        RAARERDPACVSFSHCLLNYKGFLACQAHRVAHKLWN+SRRPLALALQSRIADVFAVDIHPAARIGKGILFDHATGVVVGETAVIGNNVSILHHVTLGGT
Subjt:  RAARERDPACVSFSHCLLNYKGFLACQAHRVAHKLWNQSRRPLALALQSRIADVFAVDIHPAARIGKGILFDHATGVVVGETAVIGNNVSILHHVTLGGT

Query:  GKMCGDRHPKIGDGVLIGAGATILGNVKIGEGAKIGAGSVVLIDVPPRTTAVGNPARLVGGKEKPSQLEDIPGESMDHTSFISEWSDYII
        GKMCGDRHPKIGDGVLIGAGATILGNVKIGEGAKIGAGSVVLIDVPPRTTAVGNPARLVGGKEKPSQLEDIPGESMDHTSFISEWSDYII
Subjt:  GKMCGDRHPKIGDGVLIGAGATILGNVKIGEGAKIGAGSVVLIDVPPRTTAVGNPARLVGGKEKPSQLEDIPGESMDHTSFISEWSDYII

A0A5D3BVK8 Terpene synthase 10-like7.4e-16289.06Show/hide
Query:  MCIKASI--QSGDVIVRQCANYNPPLWKDDFIQSLHNEFRGETYRRRFSQLKGQVQTLLKEERDSLEQLELIDALQKLGISYHFESEIKDVLERIGNKFY
        MCIKASI  QSGDVIVRQCANYNPPLWKDDFIQSLHNEFRGETYRRRFSQLK QVQTLLKEE DSLEQLELIDALQKLGISYHFESEIK++L+RI NK  
Subjt:  MCIKASI--QSGDVIVRQCANYNPPLWKDDFIQSLHNEFRGETYRRRFSQLKGQVQTLLKEERDSLEQLELIDALQKLGISYHFESEIKDVLERIGNKFY

Query:  KEGNKKNSLYATSLEFRLLRQRQLDISEDVFNAFKDEMGNFKTCFYEDINGMLSLYEASFLSTKGETILEEAKYFAIKYLNEYIKSSKDELKVEIVKNAL
        KE  KKNSLYATSLEFRLLRQ Q DI E VFNAFKDEMGN KTC YEDIN MLSLYEASFLSTKGETILEEAK FA+KYLNE+IKSSKDELKVE+V++AL
Subjt:  KEGNKKNSLYATSLEFRLLRQRQLDISEDVFNAFKDEMGNFKTCFYEDINGMLSLYEASFLSTKGETILEEAKYFAIKYLNEYIKSSKDELKVEIVKNAL

Query:  KLPLHWRIEKLEARWSIDIYERIGTLNPILLEFAKLDFNMVQSIYQEDLKYASSWWRDTELGEKMSFARDQLMENFYWTVGIGFHSELSYFRRMGTKIVA
        KLPLHWRIE+LEARWSIDIYERIGTLNPILLE AKLDFNMVQSIYQEDLKYASSWWRDTELGEKMSFARDQLMENFYWTVGIGF  ELSYFRRMGTKIV 
Subjt:  KLPLHWRIEKLEARWSIDIYERIGTLNPILLEFAKLDFNMVQSIYQEDLKYASSWWRDTELGEKMSFARDQLMENFYWTVGIGFHSELSYFRRMGTKIVA

Query:  LITMIDDVYDVYGTLNELKLFTNAIESFN
        LITMIDDVYDVYGTL+ELKLFT AI+ ++
Subjt:  LITMIDDVYDVYGTLNELKLFTNAIESFN

A0A5D3E6Q4 Serine O-acetyltransferase3.6e-15698.28Show/hide
Query:  QARFSSQSPTAVVDSTTNNDETWLWGQIKAEARQDAESEPALASYLYSTILSHSSLERSLSFHLGNKLCSSTLLSTLLYDLFLNAFSTDYGLRSAAVADL
        + RFSSQSP AVVDST NNDETWLWGQIKAEARQDAESEPALASYLYSTILSHSSLERSLSFHLGNKLCSSTLLSTLLYDLFLNAFSTDYGLRSAAVADL
Subjt:  QARFSSQSPTAVVDSTTNNDETWLWGQIKAEARQDAESEPALASYLYSTILSHSSLERSLSFHLGNKLCSSTLLSTLLYDLFLNAFSTDYGLRSAAVADL

Query:  RAARERDPACVSFSHCLLNYKGFLACQAHRVAHKLWNQSRRPLALALQSRIADVFAVDIHPAARIGKGILFDHATGVVVGETAVIGNNVSILHHVTLGGT
        RAARERDPACVSFSHCLLNYKGFLACQAHRVAHKLWN+SRRPLALALQSRIADVFAVDIHPAARIGKGILFDHATGVVVGETAVIGNNVSILHHVTLGGT
Subjt:  RAARERDPACVSFSHCLLNYKGFLACQAHRVAHKLWNQSRRPLALALQSRIADVFAVDIHPAARIGKGILFDHATGVVVGETAVIGNNVSILHHVTLGGT

Query:  GKMCGDRHPKIGDGVLIGAGATILGNVKIGEGAKIGAGSVVLIDVPPRTTAVGNPARLVGGKEKPSQLEDIPGESMDHTSFISEWSDYII
        GKMCGDRHPKIGDGVLIGAGATILGNVKIGEGAKIGAGSVVLIDVPPRTTAVGNPARLVGGKEKPSQLEDIPGESMDHTSFISEWSDYII
Subjt:  GKMCGDRHPKIGDGVLIGAGATILGNVKIGEGAKIGAGSVVLIDVPPRTTAVGNPARLVGGKEKPSQLEDIPGESMDHTSFISEWSDYII

Q39533 Serine O-acetyltransferase1.3e-15396.9Show/hide
Query:  QARFSSQSPTAVVDSTTNNDETWLWGQIKAEARQDAESEPALASYLYSTILSHSSLERSLSFHLGNKLCSSTLLSTLLYDLFLNAFSTDYGLRSAAVADL
        + RFSSQS T VV+STTNNDETWLWGQIKAEAR+DAESEPALASYLYSTILSHSSLERSLSFHLGNKLCSSTLLSTLLYDLFLNAFSTDY LRSA VADL
Subjt:  QARFSSQSPTAVVDSTTNNDETWLWGQIKAEARQDAESEPALASYLYSTILSHSSLERSLSFHLGNKLCSSTLLSTLLYDLFLNAFSTDYGLRSAAVADL

Query:  RAARERDPACVSFSHCLLNYKGFLACQAHRVAHKLWNQSRRPLALALQSRIADVFAVDIHPAARIGKGILFDHATGVVVGETAVIGNNVSILHHVTLGGT
        +AARERDPACVSFSHCLLNYKGFLACQAHRVAHKLWNQSRRPLALALQSRIADVFAVDIHPAARIGKGILFDHATGVVVGETAVIGNNVSILHHVTLGGT
Subjt:  RAARERDPACVSFSHCLLNYKGFLACQAHRVAHKLWNQSRRPLALALQSRIADVFAVDIHPAARIGKGILFDHATGVVVGETAVIGNNVSILHHVTLGGT

Query:  GKMCGDRHPKIGDGVLIGAGATILGNVKIGEGAKIGAGSVVLIDVPPRTTAVGNPARLVGGKEKPSQLEDIPGESMDHTSFISEWSDYII
        GKMCGDRHPKIGDGVLIGAGATILGNVKIGEGAKIGAGSVVLIDVPPRTTAVGNPARLVGGKEKPSQLEDIPGESMDHTSFISEWSDYII
Subjt:  GKMCGDRHPKIGDGVLIGAGATILGNVKIGEGAKIGAGSVVLIDVPPRTTAVGNPARLVGGKEKPSQLEDIPGESMDHTSFISEWSDYII

SwissProt top hitse value%identityAlignment
B9T536 Terpene synthase 103.9e-9954.35Show/hide
Query:  IQSGDVIVRQCANYNPPLWKDDFIQSLHNEFRGETYRRRFSQLKGQVQTLLKEERDSLEQLELIDALQKLGISYHFESEIKDVLERIGNKFYKEGN-KKN
        I S + I+R+ ANY PP+W  DF+QSL +EF GE   +R  +LK  V+ +L +     +Q ELID LQ+LG++YHF  EIK +++ I N    +    K 
Subjt:  IQSGDVIVRQCANYNPPLWKDDFIQSLHNEFRGETYRRRFSQLKGQVQTLLKEERDSLEQLELIDALQKLGISYHFESEIKDVLERIGNKFYKEGN-KKN

Query:  SLYATSLEFRLLRQRQLDISEDVFNAFKDEMGNFKTCFYEDINGMLSLYEASFLSTKGETILEEAKYFAIKYLNEYIKSSKDELKVEIVKNALKLPLHWR
         L+  +L+FRLLRQ   +IS+++F+ F+DE+GNFK C +EDI GMLSLYEAS+L  +GE ILE A+ FA   L +YI+ +KD+L   IV ++L++PLHWR
Subjt:  SLYATSLEFRLLRQRQLDISEDVFNAFKDEMGNFKTCFYEDINGMLSLYEASFLSTKGETILEEAKYFAIKYLNEYIKSSKDELKVEIVKNALKLPLHWR

Query:  IEKLEARWSIDIYERIGTLNPILLEFAKLDFNMVQSIYQEDLKYASSWWRDTELGEKMSFARDQLMENFYWTVGIGFHSELSYFRRMGTKIVALITMIDD
        + +LE RW IDIYE+   +NP+LLE AKLDFN VQ+ Y EDLKY +SWWR+T LGEK+SFARD+LMENF WTVG+ F  +  YFRR+ TK+ +LIT+IDD
Subjt:  IEKLEARWSIDIYERIGTLNPILLEFAKLDFNMVQSIYQEDLKYASSWWRDTELGEKMSFARDQLMENFYWTVGIGFHSELSYFRRMGTKIVALITMIDD

Query:  VYDVYGTLNELKLFTNAIESFN
        +YDVYGTL+EL+LFTNA+E ++
Subjt:  VYDVYGTLNELKLFTNAIESFN

Q0DGG8 Probable serine acetyltransferase 53.1e-11777.82Show/hide
Query:  TNNDETWLWGQIKAEARQDAESEPALASYLYSTILSHSSLERSLSFHLGNKLCSSTLLSTLLYDLFLNAFSTDYGLRSAAVADLRAARERDPACVSFSHC
        + ++E+W+W QIKAEAR+DA++EPALAS+LY+T+LSH SL RS+SFHL NKLCSSTLLSTLLYDLFL +F+    LR+A VADL AAR RDPACV FS C
Subjt:  TNNDETWLWGQIKAEARQDAESEPALASYLYSTILSHSSLERSLSFHLGNKLCSSTLLSTLLYDLFLNAFSTDYGLRSAAVADLRAARERDPACVSFSHC

Query:  LLNYKGFLACQAHRVAHKLWNQSRRPLALALQSRIADVFAVDIHPAARIGKGILFDHATGVVVGETAVIGNNVSILHHVTLGGTGKMCGDRHPKIGDGVL
        LLN+KGFLA QAHRV+H LW Q RRPLALALQSR+ADVFAVDIHPAA +GKGIL DHATGVV+GETAV+G+NVSILHHVTLGGTGK  GDRHPKIGDGVL
Subjt:  LLNYKGFLACQAHRVAHKLWNQSRRPLALALQSRIADVFAVDIHPAARIGKGILFDHATGVVVGETAVIGNNVSILHHVTLGGTGKMCGDRHPKIGDGVL

Query:  IGAGATILGNVKIGEGAKIGAGSVVLIDVPPRTTAVGNPARLVGGKE-KPSQLEDIPGESMDHTSFISEWSDYII
        IGAGATILGNVKIG GAKIGAGSVVLIDVP R TAVGNPARL+G K  +  + ED+PGESMDHTSFI +WSDY I
Subjt:  IGAGATILGNVKIGEGAKIGAGSVVLIDVPPRTTAVGNPARLVGGKE-KPSQLEDIPGESMDHTSFISEWSDYII

Q42538 Serine acetyltransferase 51.6e-12981.42Show/hide
Query:  LVSATQARFSSQSPTAVVDSTTNNDETWLWGQIKAEARQDAESEPALASYLYSTILSHSSLERSLSFHLGNKLCSSTLLSTLLYDLFLNAFSTDYGLRSA
        L S TQ+  +  +  A+  +  + +   LW QIKAEAR+DAE+EPALASYLYSTILSHSSLERS+SFHLGNKLCSSTLLSTLLYDLFLN FS+D  LR+A
Subjt:  LVSATQARFSSQSPTAVVDSTTNNDETWLWGQIKAEARQDAESEPALASYLYSTILSHSSLERSLSFHLGNKLCSSTLLSTLLYDLFLNAFSTDYGLRSA

Query:  AVADLRAARERDPACVSFSHCLLNYKGFLACQAHRVAHKLWNQSRRPLALALQSRIADVFAVDIHPAARIGKGILFDHATGVVVGETAVIGNNVSILHHV
         VADLRAAR RDPAC+SFSHCLLNYKGFLA QAHRV+HKLW QSR+PLALAL SRI+DVFAVDIHPAA+IGKGIL DHATGVVVGETAVIGNNVSILHHV
Subjt:  AVADLRAARERDPACVSFSHCLLNYKGFLACQAHRVAHKLWNQSRRPLALALQSRIADVFAVDIHPAARIGKGILFDHATGVVVGETAVIGNNVSILHHV

Query:  TLGGTGKMCGDRHPKIGDGVLIGAGATILGNVKIGEGAKIGAGSVVLIDVPPRTTAVGNPARLVGGKEKPS-QLEDIPGESMDHTSFISEWSDYII
        TLGGTGK CGDRHPKIGDG LIGAGATILGNVKIG GAK+GAGSVVLIDVP R TAVGNPARLVGGKEKP+   E+ PGESMDHTSFISEWSDYII
Subjt:  TLGGTGKMCGDRHPKIGDGVLIGAGATILGNVKIGEGAKIGAGSVVLIDVPPRTTAVGNPARLVGGKEKPS-QLEDIPGESMDHTSFISEWSDYII

Q6PWU2 (-)-alpha-terpineol synthase8.6e-9952.68Show/hide
Query:  CVVIERGMCIK-ASIQSGDVIVRQCANYNPPLWKDDFIQSLHNEFRGETYRRRFSQLKGQVQTLLKEERDSLEQLELIDALQKLGISYHFESEIKDVLER
        C    RG+ +K  +    ++IVR+ ANY+P +W  D++QSL +++ GETY RR  +LK  V+ +L + +  L+QLELID LQ+LGI YHF+ EIK +L  
Subjt:  CVVIERGMCIK-ASIQSGDVIVRQCANYNPPLWKDDFIQSLHNEFRGETYRRRFSQLKGQVQTLLKEERDSLEQLELIDALQKLGISYHFESEIKDVLER

Query:  IGNKFYK-EGNKKNSLYATSLEFRLLRQRQLDISEDVFNAFKDEMGNFKTCFYEDINGMLSLYEASFLSTKGETILEEAKYFAIKYLNEYIKSSKDELKV
        I N++ + E  +K+ LYAT+LEFRLLRQ   D+ +DVF+ FKD+ G+FK C  ED+ GML LYEAS+L  +GE+ +E+A+ FA ++L + ++ + D+   
Subjt:  IGNKFYK-EGNKKNSLYATSLEFRLLRQRQLDISEDVFNAFKDEMGNFKTCFYEDINGMLSLYEASFLSTKGETILEEAKYFAIKYLNEYIKSSKDELKV

Query:  EIVKNALKLPLHWRIEKLEARWSIDIYERIGTLNPILLEFAKLDFNMVQSIYQEDLKYASSWWRDTELGEKMSFARDQLMENFYWTVGIGFHSELSYFRR
          VK+AL+LPLHWR+ +LEARW ID+YE+   +NPILLEFAKLDFNMVQ+ +QEDL++ SSWW  T LGEK++FARD+LMENF WTVG+ F  +  Y RR
Subjt:  EIVKNALKLPLHWRIEKLEARWSIDIYERIGTLNPILLEFAKLDFNMVQSIYQEDLKYASSWWRDTELGEKMSFARDQLMENFYWTVGIGFHSELSYFRR

Query:  MGTKIVALITMIDDVYDVYGTLNELKLFTNAIESFN
        M TK+  LIT+IDDVYDVYGT++EL+LFT+ ++ ++
Subjt:  MGTKIVALITMIDDVYDVYGTLNELKLFTNAIESFN

Q8W0E4 Probable serine acetyltransferase 11.8e-11778.97Show/hide
Query:  DETWLWGQIKAEARQDAESEPALASYLYSTILSHSSLERSLSFHLGNKLCSSTLLSTLLYDLFLNAFSTDYGLRSAAVADLRAARERDPACVSFSHCLLN
        DE+W+W QIKAEAR+DA++EPALAS+LY+T+LSH SL+RSL+FHL NKLCSSTLLSTLLYDLF+ + +    LR+A VADL AAR RDPACV FSHCLLN
Subjt:  DETWLWGQIKAEARQDAESEPALASYLYSTILSHSSLERSLSFHLGNKLCSSTLLSTLLYDLFLNAFSTDYGLRSAAVADLRAARERDPACVSFSHCLLN

Query:  YKGFLACQAHRVAHKLWNQSRRPLALALQSRIADVFAVDIHPAARIGKGILFDHATGVVVGETAVIGNNVSILHHVTLGGTGKMCGDRHPKIGDGVLIGA
        YKGFLA QA RVAH LW Q RR LALALQSR+A+VFAVDIHPAA IGKG+L DHATGVV+GETAVIG+NVSILHHVTLGGTGK  GDRHPKIGDGVLIGA
Subjt:  YKGFLACQAHRVAHKLWNQSRRPLALALQSRIADVFAVDIHPAARIGKGILFDHATGVVVGETAVIGNNVSILHHVTLGGTGKMCGDRHPKIGDGVLIGA

Query:  GATILGNVKIGEGAKIGAGSVVLIDVPPRTTAVGNPARLVGGKEKPSQLEDIPGESMDHTSFISEWSDYII
        GATILGNV+IG GAKIGAGS+VLIDVPPRTTAVGNPARL+GGK+     +D+PGESMDHTSFI +WSDY I
Subjt:  GATILGNVKIGEGAKIGAGSVVLIDVPPRTTAVGNPARLVGGKEKPSQLEDIPGESMDHTSFISEWSDYII

Arabidopsis top hitse value%identityAlignment
AT1G55920.1 serine acetyltransferase 2;12.0e-9058.63Show/hide
Query:  VDSTTNNDETWLWGQIKAEARQDAESEPALASYLYSTILSHSSLERSLSFHLGNKLCSSTLLSTLLYDLFLNAFSTDYGLRSAAVADLRAARERDPACVS
        +  T   D+  +W ++  EA+ D + EP L++Y Y++I SH SLE +L+  L  KL +  L S  L++LF++       +  +   DL A +ERDPAC+S
Subjt:  VDSTTNNDETWLWGQIKAEARQDAESEPALASYLYSTILSHSSLERSLSFHLGNKLCSSTLLSTLLYDLFLNAFSTDYGLRSAAVADLRAARERDPACVS

Query:  FSHCLLNYKGFLACQAHRVAHKLWNQSRRPLALALQSRIADVFAVDIHPAARIGKGILFDHATGVVVGETAVIGNNVSILHHVTLGGTGKMCGDRHPKIG
        + HC L +KGFLACQAHR+AH LW Q+R+ +AL +Q+R+++ FAVDIHP A+IGKGIL DHATGVV+GETAV+G+NVSILH VTLGGTGK  GDRHPKIG
Subjt:  FSHCLLNYKGFLACQAHRVAHKLWNQSRRPLALALQSRIADVFAVDIHPAARIGKGILFDHATGVVVGETAVIGNNVSILHHVTLGGTGKMCGDRHPKIG

Query:  DGVLIGAGATILGNVKIGEGAKIGAGSVVLIDVPPRTTAVGNPARLVGGKEKPSQLEDIPGESMDHTSFISEWSDYII
        DGVLIGAG+ ILGN+ IGEGAKIG+GSVV+ DVP RTTAVGNPARL+GGKE P + + IP  +MD TS+++EWSDY+I
Subjt:  DGVLIGAGATILGNVKIGEGAKIGAGSVVLIDVPPRTTAVGNPARLVGGKEKPSQLEDIPGESMDHTSFISEWSDYII

AT2G17640.1 Trimeric LpxA-like enzymes superfamily protein8.0e-7657.25Show/hide
Query:  LWGQIKAEARQDAESEPALASYLYSTILSHSSLERSLSFHLGNKLCSSTLLSTLLYDLFLNAFSTDYGLRSAAVADLRAARERDPACVSFSHCLLNYKGF
        +W  I+ EA+ +AE EP L+S+LY+ IL+H  LE++L F L N+L + TLL+T L D+F      D G++S+   DL+A ++RDPAC+S+S  +L+ KG+
Subjt:  LWGQIKAEARQDAESEPALASYLYSTILSHSSLERSLSFHLGNKLCSSTLLSTLLYDLFLNAFSTDYGLRSAAVADLRAARERDPACVSFSHCLLNYKGF

Query:  LACQAHRVAHKLWNQSRRPLALALQSRIADVFAVDIHPAARIGKGILFDHATGVVVGETAVIGNNVSILHHVTLGGTGKMCGDRHPKIGDGVLIGAGATI
         A QA+RVAHKLWN+ R+ LALALQSRI++VF +DIHPAARIG+GIL DH TGVV+GETAVIGN VSILH VTLGGTGK  GDRHPKIG+G L+GA  TI
Subjt:  LACQAHRVAHKLWNQSRRPLALALQSRIADVFAVDIHPAARIGKGILFDHATGVVVGETAVIGNNVSILHHVTLGGTGKMCGDRHPKIGDGVLIGAGATI

Query:  LGNVKIGEGAKIGAGSVVLIDVPPRTTAVGNPARLVGGKEKPSQLEDIPGESMDH
        LGN+ IG GA + AGS+VL DVP  +   GNPA+L+   E     E  P  +M H
Subjt:  LGNVKIGEGAKIGAGSVVLIDVPPRTTAVGNPARLVGGKEKPSQLEDIPGESMDH

AT3G13110.1 serine acetyltransferase 2;22.1e-9261.42Show/hide
Query:  LWGQIKAEARQDAESEPALASYLYSTILSHSSLERSLSFHLGNKLCSSTLLSTLLYDLFLNAFSTDYGLRSAAVADLRAARERDPACVSFSHCLLNYKGF
        +W +I+ EA+ D   EP +++Y +++I+S  SLE +L+  L  KL +  L S  L+DLF      +  +  +   DL A +ERDPAC+S+ HC L++KGF
Subjt:  LWGQIKAEARQDAESEPALASYLYSTILSHSSLERSLSFHLGNKLCSSTLLSTLLYDLFLNAFSTDYGLRSAAVADLRAARERDPACVSFSHCLLNYKGF

Query:  LACQAHRVAHKLWNQSRRPLALALQSRIADVFAVDIHPAARIGKGILFDHATGVVVGETAVIGNNVSILHHVTLGGTGKMCGDRHPKIGDGVLIGAGATI
        LACQAHR+AH+LW Q R+ LAL +Q+R+++ FAVD HP A+IG GIL DHAT +V+GETAV+GNNVSILH+VTLGGTGK CGDRHPKIGDGVLIGAG  I
Subjt:  LACQAHRVAHKLWNQSRRPLALALQSRIADVFAVDIHPAARIGKGILFDHATGVVVGETAVIGNNVSILHHVTLGGTGKMCGDRHPKIGDGVLIGAGATI

Query:  LGNVKIGEGAKIGAGSVVLIDVPPRTTAVGNPARLVGGKEKPSQLEDIPGESMDHTSFISEWSDYII
        LGN+ IGEGAKIGAGSVVL DVPPRTTAVGNPARL+GGK+ P   + IPG +MD TS ISEWSDY+I
Subjt:  LGNVKIGEGAKIGAGSVVLIDVPPRTTAVGNPARLVGGKEKPSQLEDIPGESMDHTSFISEWSDYII

AT4G35640.1 serine acetyltransferase 3;22.7e-7957.63Show/hide
Query:  TNNDETWLWGQIKAEARQDAESEPALASYLYSTILSHSSLERSLSFHLGNKLCSSTLLSTLLYDLFLNAFSTDYGLRSAAVADLRAARERDPACVSFSHC
        TN+    +W  I+ EA+ +AE EP L+S+LY++ILSH  LE++LSF L N+L + TLL+T L D+F N    D G++S+   D++A ++RDPAC+S+S  
Subjt:  TNNDETWLWGQIKAEARQDAESEPALASYLYSTILSHSSLERSLSFHLGNKLCSSTLLSTLLYDLFLNAFSTDYGLRSAAVADLRAARERDPACVSFSHC

Query:  LLNYKGFLACQAHRVAHKLWNQSRRPLALALQSRIADVFAVDIHPAARIGKGILFDHATGVVVGETAVIGNNVSILHHVTLGGTGKMCGDRHPKIGDGVL
        +L+ KG+LA QA+RVAHKLW Q R+ LALALQSR+++VF +DIHPAARIGKGIL DH TGVV+GETAVIG+ VSILH VTLGGTGK  GDRHP IGDG L
Subjt:  LLNYKGFLACQAHRVAHKLWNQSRRPLALALQSRIADVFAVDIHPAARIGKGILFDHATGVVVGETAVIGNNVSILHHVTLGGTGKMCGDRHPKIGDGVL

Query:  IGAGATILGNVKIGEGAKIGAGSVVLIDVPPRTTAVGNPARLVGGKEKPSQLEDIPGESMDH
        +GA  TILGN+KIG GA + AGS+VL DVP  +   GNPA+L+G  +     E  P  +M+H
Subjt:  IGAGATILGNVKIGEGAKIGAGSVVLIDVPPRTTAVGNPARLVGGKEKPSQLEDIPGESMDH

AT5G56760.1 serine acetyltransferase 1;11.1e-13081.42Show/hide
Query:  LVSATQARFSSQSPTAVVDSTTNNDETWLWGQIKAEARQDAESEPALASYLYSTILSHSSLERSLSFHLGNKLCSSTLLSTLLYDLFLNAFSTDYGLRSA
        L S TQ+  +  +  A+  +  + +   LW QIKAEAR+DAE+EPALASYLYSTILSHSSLERS+SFHLGNKLCSSTLLSTLLYDLFLN FS+D  LR+A
Subjt:  LVSATQARFSSQSPTAVVDSTTNNDETWLWGQIKAEARQDAESEPALASYLYSTILSHSSLERSLSFHLGNKLCSSTLLSTLLYDLFLNAFSTDYGLRSA

Query:  AVADLRAARERDPACVSFSHCLLNYKGFLACQAHRVAHKLWNQSRRPLALALQSRIADVFAVDIHPAARIGKGILFDHATGVVVGETAVIGNNVSILHHV
         VADLRAAR RDPAC+SFSHCLLNYKGFLA QAHRV+HKLW QSR+PLALAL SRI+DVFAVDIHPAA+IGKGIL DHATGVVVGETAVIGNNVSILHHV
Subjt:  AVADLRAARERDPACVSFSHCLLNYKGFLACQAHRVAHKLWNQSRRPLALALQSRIADVFAVDIHPAARIGKGILFDHATGVVVGETAVIGNNVSILHHV

Query:  TLGGTGKMCGDRHPKIGDGVLIGAGATILGNVKIGEGAKIGAGSVVLIDVPPRTTAVGNPARLVGGKEKPS-QLEDIPGESMDHTSFISEWSDYII
        TLGGTGK CGDRHPKIGDG LIGAGATILGNVKIG GAK+GAGSVVLIDVP R TAVGNPARLVGGKEKP+   E+ PGESMDHTSFISEWSDYII
Subjt:  TLGGTGKMCGDRHPKIGDGVLIGAGATILGNVKIGEGAKIGAGSVVLIDVPPRTTAVGNPARLVGGKEKPS-QLEDIPGESMDHTSFISEWSDYII


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCTCAATGGCTCTTCTCCACCTCCCTCTTTCTTCTACATTCTCTTTTCATGGAGCTGCCCTACCATCTACAAACTATCAACCTTCTTCTACTATGTTAAAATGTGT
TGTGATTGAACGTGGAATGTGCATAAAAGCATCGATACAAAGTGGTGATGTTATTGTGAGGCAATGTGCAAATTACAATCCTCCTTTGTGGAAAGATGATTTTATTCAAT
CATTACATAATGAATTCAGGGGAGAAACATATCGAAGACGATTTAGTCAATTAAAAGGACAAGTTCAAACATTGCTTAAAGAAGAAAGAGACTCTTTGGAGCAACTCGAA
CTCATTGATGCCTTACAAAAGCTTGGAATATCATACCACTTTGAGAGTGAAATTAAAGATGTACTAGAAAGAATAGGCAACAAGTTTTATAAAGAGGGGAATAAGAAGAA
TAGTCTCTATGCAACATCTCTTGAATTTCGACTTCTAAGACAACGTCAACTTGATATTTCTGAAGATGTTTTCAATGCCTTCAAAGATGAGATGGGGAATTTCAAAACAT
GCTTTTATGAAGATATAAATGGAATGCTATCTTTATATGAAGCTTCATTCTTATCAACTAAAGGGGAGACTATTTTAGAGGAAGCAAAATACTTTGCAATAAAATATCTA
AATGAATACATCAAATCAAGCAAAGATGAACTCAAAGTAGAGATTGTAAAGAATGCCTTGAAGCTTCCTTTACATTGGAGAATAGAAAAATTGGAGGCAAGATGGAGTAT
TGATATATATGAGAGAATAGGAACCCTAAATCCTATTCTTCTTGAATTTGCTAAGCTTGATTTCAACATGGTGCAATCTATTTACCAAGAAGATCTTAAATATGCATCAA
GTTGGTGGAGAGACACAGAGCTCGGAGAAAAGATGAGCTTTGCAAGAGACCAACTGATGGAAAACTTCTATTGGACAGTAGGCATTGGATTTCATTCTGAGCTTTCATAT
TTTAGAAGAATGGGCACAAAGATTGTGGCATTGATTACAATGATTGATGATGTTTATGATGTCTATGGCACATTGAATGAACTCAAACTCTTTACAAATGCAATAGAGAG
CTTTAACCACGAACGGCGCCGGCAACAGCAGCGACGGCTACAAGCAGCTTCCGGCGGCGGATGCACAGCAGCCTCCTCGGCGATTGCCACCTTCGAGGTACCCACGCCAC
TGGTTTCCAGAGAAGCCACGGCGTTTGACGATTTCAGCGACTTGGTATCAGCAACCCAAGCTCGATTTTCATCTCAGTCTCCGACCGCAGTGGTGGATTCTACTACGAAT
AACGATGAGACATGGCTCTGGGGGCAGATCAAAGCGGAGGCACGGCAAGATGCCGAGTCAGAGCCAGCACTGGCTAGCTATCTTTACTCGACGATTTTGTCGCATTCATC
GCTCGAGAGATCACTTTCGTTTCATTTGGGAAACAAACTTTGCTCTTCCACGCTTCTTTCCACTCTCCTTTACGATCTTTTCCTCAACGCTTTCTCCACTGATTATGGTC
TACGATCAGCCGCTGTCGCTGATTTGCGAGCGGCTCGTGAACGGGACCCAGCCTGTGTTTCATTTTCACATTGCCTCCTCAATTACAAAGGATTCTTAGCCTGCCAGGCT
CATCGTGTGGCTCACAAGCTGTGGAATCAATCACGTAGGCCGCTAGCACTAGCACTTCAATCACGCATTGCTGATGTCTTCGCCGTTGACATTCATCCTGCAGCACGAAT
TGGGAAAGGTATTCTGTTTGATCATGCTACTGGTGTAGTGGTTGGTGAGACGGCAGTGATAGGCAACAATGTCTCAATTCTTCATCATGTCACTCTTGGAGGGACAGGAA
AGATGTGTGGAGACAGGCATCCAAAGATTGGGGATGGTGTCTTAATTGGCGCTGGAGCAACCATTCTCGGCAATGTGAAGATTGGAGAAGGAGCTAAAATTGGGGCAGGA
TCTGTGGTGCTCATTGATGTGCCACCACGAACAACTGCCGTGGGAAATCCCGCAAGGCTGGTGGGGGGGAAGGAGAAACCATCGCAGCTCGAGGATATTCCTGGAGAATC
CATGGATCATACTTCTTTCATATCCGAATGGTCAGATTACATAATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTCTCAATGGCTCTTCTCCACCTCCCTCTTTCTTCTACATTCTCTTTTCATGGAGCTGCCCTACCATCTACAAACTATCAACCTTCTTCTACTATGTTAAAATGTGT
TGTGATTGAACGTGGAATGTGCATAAAAGCATCGATACAAAGTGGTGATGTTATTGTGAGGCAATGTGCAAATTACAATCCTCCTTTGTGGAAAGATGATTTTATTCAAT
CATTACATAATGAATTCAGGGGAGAAACATATCGAAGACGATTTAGTCAATTAAAAGGACAAGTTCAAACATTGCTTAAAGAAGAAAGAGACTCTTTGGAGCAACTCGAA
CTCATTGATGCCTTACAAAAGCTTGGAATATCATACCACTTTGAGAGTGAAATTAAAGATGTACTAGAAAGAATAGGCAACAAGTTTTATAAAGAGGGGAATAAGAAGAA
TAGTCTCTATGCAACATCTCTTGAATTTCGACTTCTAAGACAACGTCAACTTGATATTTCTGAAGATGTTTTCAATGCCTTCAAAGATGAGATGGGGAATTTCAAAACAT
GCTTTTATGAAGATATAAATGGAATGCTATCTTTATATGAAGCTTCATTCTTATCAACTAAAGGGGAGACTATTTTAGAGGAAGCAAAATACTTTGCAATAAAATATCTA
AATGAATACATCAAATCAAGCAAAGATGAACTCAAAGTAGAGATTGTAAAGAATGCCTTGAAGCTTCCTTTACATTGGAGAATAGAAAAATTGGAGGCAAGATGGAGTAT
TGATATATATGAGAGAATAGGAACCCTAAATCCTATTCTTCTTGAATTTGCTAAGCTTGATTTCAACATGGTGCAATCTATTTACCAAGAAGATCTTAAATATGCATCAA
GTTGGTGGAGAGACACAGAGCTCGGAGAAAAGATGAGCTTTGCAAGAGACCAACTGATGGAAAACTTCTATTGGACAGTAGGCATTGGATTTCATTCTGAGCTTTCATAT
TTTAGAAGAATGGGCACAAAGATTGTGGCATTGATTACAATGATTGATGATGTTTATGATGTCTATGGCACATTGAATGAACTCAAACTCTTTACAAATGCAATAGAGAG
CTTTAACCACGAACGGCGCCGGCAACAGCAGCGACGGCTACAAGCAGCTTCCGGCGGCGGATGCACAGCAGCCTCCTCGGCGATTGCCACCTTCGAGGTACCCACGCCAC
TGGTTTCCAGAGAAGCCACGGCGTTTGACGATTTCAGCGACTTGGTATCAGCAACCCAAGCTCGATTTTCATCTCAGTCTCCGACCGCAGTGGTGGATTCTACTACGAAT
AACGATGAGACATGGCTCTGGGGGCAGATCAAAGCGGAGGCACGGCAAGATGCCGAGTCAGAGCCAGCACTGGCTAGCTATCTTTACTCGACGATTTTGTCGCATTCATC
GCTCGAGAGATCACTTTCGTTTCATTTGGGAAACAAACTTTGCTCTTCCACGCTTCTTTCCACTCTCCTTTACGATCTTTTCCTCAACGCTTTCTCCACTGATTATGGTC
TACGATCAGCCGCTGTCGCTGATTTGCGAGCGGCTCGTGAACGGGACCCAGCCTGTGTTTCATTTTCACATTGCCTCCTCAATTACAAAGGATTCTTAGCCTGCCAGGCT
CATCGTGTGGCTCACAAGCTGTGGAATCAATCACGTAGGCCGCTAGCACTAGCACTTCAATCACGCATTGCTGATGTCTTCGCCGTTGACATTCATCCTGCAGCACGAAT
TGGGAAAGGTATTCTGTTTGATCATGCTACTGGTGTAGTGGTTGGTGAGACGGCAGTGATAGGCAACAATGTCTCAATTCTTCATCATGTCACTCTTGGAGGGACAGGAA
AGATGTGTGGAGACAGGCATCCAAAGATTGGGGATGGTGTCTTAATTGGCGCTGGAGCAACCATTCTCGGCAATGTGAAGATTGGAGAAGGAGCTAAAATTGGGGCAGGA
TCTGTGGTGCTCATTGATGTGCCACCACGAACAACTGCCGTGGGAAATCCCGCAAGGCTGGTGGGGGGGAAGGAGAAACCATCGCAGCTCGAGGATATTCCTGGAGAATC
CATGGATCATACTTCTTTCATATCCGAATGGTCAGATTACATAATTTGA
Protein sequenceShow/hide protein sequence
MLSMALLHLPLSSTFSFHGAALPSTNYQPSSTMLKCVVIERGMCIKASIQSGDVIVRQCANYNPPLWKDDFIQSLHNEFRGETYRRRFSQLKGQVQTLLKEERDSLEQLE
LIDALQKLGISYHFESEIKDVLERIGNKFYKEGNKKNSLYATSLEFRLLRQRQLDISEDVFNAFKDEMGNFKTCFYEDINGMLSLYEASFLSTKGETILEEAKYFAIKYL
NEYIKSSKDELKVEIVKNALKLPLHWRIEKLEARWSIDIYERIGTLNPILLEFAKLDFNMVQSIYQEDLKYASSWWRDTELGEKMSFARDQLMENFYWTVGIGFHSELSY
FRRMGTKIVALITMIDDVYDVYGTLNELKLFTNAIESFNHERRRQQQRRLQAASGGGCTAASSAIATFEVPTPLVSREATAFDDFSDLVSATQARFSSQSPTAVVDSTTN
NDETWLWGQIKAEARQDAESEPALASYLYSTILSHSSLERSLSFHLGNKLCSSTLLSTLLYDLFLNAFSTDYGLRSAAVADLRAARERDPACVSFSHCLLNYKGFLACQA
HRVAHKLWNQSRRPLALALQSRIADVFAVDIHPAARIGKGILFDHATGVVVGETAVIGNNVSILHHVTLGGTGKMCGDRHPKIGDGVLIGAGATILGNVKIGEGAKIGAG
SVVLIDVPPRTTAVGNPARLVGGKEKPSQLEDIPGESMDHTSFISEWSDYII