; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0002042 (gene) of Chayote v1 genome

Gene IDSed0002042
OrganismSechium edule (Chayote v1)
DescriptionNAC domain-containing protein
Genome locationLG11:25227432..25229985
RNA-Seq ExpressionSed0002042
SyntenySed0002042
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0003677 - DNA binding (molecular function)
InterPro domainsIPR003441 - NAC domain
IPR036093 - NAC domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065255.1 NAC domain-containing protein 13-like [Cucumis melo var. makuwa]1.3e-4554.4Show/hide
Query:  LDYSCTDTEASLALWRMMKGFSPPENVLDSVNPFQLHPQYLPVGVWYHVKGNGNENEDLSSGFWKPKGGDCKLPSDFPFTIIKTTYEFYEGQAPNDHKTA
        LD   +D E  LAL R+++G S P NVLD +NP+Q  P+ LP GVW++ + +GN +   S   WKPKGG  KL SD  F+  +TTYE+YEGQAP + KTA
Subjt:  LDYSCTDTEASLALWRMMKGFSPPENVLDSVNPFQLHPQYLPVGVWYHVKGNGNENEDLSSGFWKPKGGDCKLPSDFPFTIIKTTYEFYEGQAPNDHKTA

Query:  WVMLEYWICRSDMPANNNGQKTSSLCKVFIGDEQFQDHEKFKKIRYSIVRNSEPNCPIHQLVGLGPDGSNNKANGSTSKAKM
        WVM EYW+ ++D+ AN+  ++TSSLCK+F+GDEQFQ+HEK  KI  S   NSEPN   HQLV      SNN ANGSTSK++M
Subjt:  WVMLEYWICRSDMPANNNGQKTSSLCKVFIGDEQFQDHEKFKKIRYSIVRNSEPNCPIHQLVGLGPDGSNNKANGSTSKAKM

XP_008444686.1 PREDICTED: NAC domain-containing protein 13-like [Cucumis melo]4.8e-4553.85Show/hide
Query:  LDYSCTDTEASLALWRMMKGFSPPENVLDSVNPFQLHPQYLPVGVWYHVKGNGNENEDLSSGFWKPKGGDCKLPSDFPFTIIKTTYEFYEGQAPNDHKTA
        LD   +D E  LAL R+++G   P NVLD +NP+Q  P+ LP GVW++ + +GN +   S   WKPKGG  KL SD  F+  +TTYE+YEGQAP + KTA
Subjt:  LDYSCTDTEASLALWRMMKGFSPPENVLDSVNPFQLHPQYLPVGVWYHVKGNGNENEDLSSGFWKPKGGDCKLPSDFPFTIIKTTYEFYEGQAPNDHKTA

Query:  WVMLEYWICRSDMPANNNGQKTSSLCKVFIGDEQFQDHEKFKKIRYSIVRNSEPNCPIHQLVGLGPDGSNNKANGSTSKAKM
        WVM EYW+ ++D+ AN+  ++TSSLCK+F+GDEQFQ+HEK  KI  S   NSEPN   HQLV      SNN ANGSTSK++M
Subjt:  WVMLEYWICRSDMPANNNGQKTSSLCKVFIGDEQFQDHEKFKKIRYSIVRNSEPNCPIHQLVGLGPDGSNNKANGSTSKAKM

XP_022996849.1 uncharacterized protein LOC111491972 isoform X2 [Cucurbita maxima]1.1e-4150.25Show/hide
Query:  MASSLPLDYSCTDTEASLALWRMMKGFSPPENVLDSVNPFQLHPQYLPVGVWYHVKGNGNENEDLSSGFWKPKGGDCKLPSDFPFTIIKTTYEFYEGQAP
        + SS  LD   TD E  LALWR+  G S P NV D +NP+Q  P+ LP GVWY+V+GN +     S G WK K GDCKL ++  FT  +T+YE+YEGQAP
Subjt:  MASSLPLDYSCTDTEASLALWRMMKGFSPPENVLDSVNPFQLHPQYLPVGVWYHVKGNGNENEDLSSGFWKPKGGDCKLPSDFPFTIIKTTYEFYEGQAP

Query:  NDHKTAWVMLEYWICRSDMPANNNGQKTSSLCKVFIGDEQFQDHEKFKKIRYSIVRNSEPNCPIHQLVGLGPDGSNNKANGSTSKAKMVSNGEG-TAADG
         + KTAWVM +YW+ + D+ A    ++TSSLCKVF+GDEQFQ++E  +K  +S + NSEP    HQLV      SNN ANG TSK+KM  + +    AD 
Subjt:  NDHKTAWVMLEYWICRSDMPANNNGQKTSSLCKVFIGDEQFQDHEKFKKIRYSIVRNSEPNCPIHQLVGLGPDGSNNKANGSTSKAKMVSNGEG-TAADG

Query:  GKK
        GK+
Subjt:  GKK

XP_038884713.1 NAC domain-containing protein 83-like isoform X1 [Benincasa hispida]1.1e-4454.36Show/hide
Query:  MASSLPLDYSCTDTEASLALWRMMKGFSPPENVLDSVNPFQLHPQYLPVGVWYHVKGNGNENEDLSSGFWKPKGGDCKLPSDFPFTIIKTTYEFYEGQAP
        + SS  LD   TD E  LAL R++ G   P NV D +NP    P+ LP  VWYH   NG  N D S G WKPKGGDCKL SD  FT  +TTYE+YEGQAP
Subjt:  MASSLPLDYSCTDTEASLALWRMMKGFSPPENVLDSVNPFQLHPQYLPVGVWYHVKGNGNENEDLSSGFWKPKGGDCKLPSDFPFTIIKTTYEFYEGQAP

Query:  NDHKTAWVMLEYWICRSDMPANNNGQKTSSLCKVFIGDEQFQDHEKFKKIRYSIVRNSEPNCPIHQLVGLGPDG--SNNKANGSTSKAKMVSNGE
        ++ KTAWVM +YW+ ++D+  N+  ++TSSLCKVF+GD QFQ+HEK +KI  SI+ NS+PN   HQLV   PDG  SN+ ANGSTSK++++S  +
Subjt:  NDHKTAWVMLEYWICRSDMPANNNGQKTSSLCKVFIGDEQFQDHEKFKKIRYSIVRNSEPNCPIHQLVGLGPDG--SNNKANGSTSKAKMVSNGE

XP_038884714.1 NAC domain-containing protein 83-like isoform X2 [Benincasa hispida]8.3e-4555.79Show/hide
Query:  MASSLPLDYSCTDTEASLALWRMMKGFSPPENVLDSVNPFQLHPQYLPVGVWYHVKGNGNENEDLSSGFWKPKGGDCKLPSDFPFTIIKTTYEFYEGQAP
        + SS  LD   TD E  LAL R++ G   P NV D +NP    P+ LP  VWYH   NG  N D S G WKPKGGDCKL SD  FT  +TTYE+YEGQAP
Subjt:  MASSLPLDYSCTDTEASLALWRMMKGFSPPENVLDSVNPFQLHPQYLPVGVWYHVKGNGNENEDLSSGFWKPKGGDCKLPSDFPFTIIKTTYEFYEGQAP

Query:  NDHKTAWVMLEYWICRSDMPANNNGQKTSSLCKVFIGDEQFQDHEKFKKIRYSIVRNSEPNCPIHQLVGLGPDG--SNNKANGSTSKAKM
        ++ KTAWVM +YW+ ++D+  N+  ++TSSLCKVF+GD QFQ+HEK +KI  SI+ NS+PN   HQLV   PDG  SN+ ANGSTSK++M
Subjt:  NDHKTAWVMLEYWICRSDMPANNNGQKTSSLCKVFIGDEQFQDHEKFKKIRYSIVRNSEPNCPIHQLVGLGPDG--SNNKANGSTSKAKM

TrEMBL top hitse value%identityAlignment
A0A1S3BBN6 NAC domain-containing protein 13-like2.3e-4553.85Show/hide
Query:  LDYSCTDTEASLALWRMMKGFSPPENVLDSVNPFQLHPQYLPVGVWYHVKGNGNENEDLSSGFWKPKGGDCKLPSDFPFTIIKTTYEFYEGQAPNDHKTA
        LD   +D E  LAL R+++G   P NVLD +NP+Q  P+ LP GVW++ + +GN +   S   WKPKGG  KL SD  F+  +TTYE+YEGQAP + KTA
Subjt:  LDYSCTDTEASLALWRMMKGFSPPENVLDSVNPFQLHPQYLPVGVWYHVKGNGNENEDLSSGFWKPKGGDCKLPSDFPFTIIKTTYEFYEGQAPNDHKTA

Query:  WVMLEYWICRSDMPANNNGQKTSSLCKVFIGDEQFQDHEKFKKIRYSIVRNSEPNCPIHQLVGLGPDGSNNKANGSTSKAKM
        WVM EYW+ ++D+ AN+  ++TSSLCK+F+GDEQFQ+HEK  KI  S   NSEPN   HQLV      SNN ANGSTSK++M
Subjt:  WVMLEYWICRSDMPANNNGQKTSSLCKVFIGDEQFQDHEKFKKIRYSIVRNSEPNCPIHQLVGLGPDGSNNKANGSTSKAKM

A0A5A7VHD5 NAC domain-containing protein 13-like6.2e-4654.4Show/hide
Query:  LDYSCTDTEASLALWRMMKGFSPPENVLDSVNPFQLHPQYLPVGVWYHVKGNGNENEDLSSGFWKPKGGDCKLPSDFPFTIIKTTYEFYEGQAPNDHKTA
        LD   +D E  LAL R+++G S P NVLD +NP+Q  P+ LP GVW++ + +GN +   S   WKPKGG  KL SD  F+  +TTYE+YEGQAP + KTA
Subjt:  LDYSCTDTEASLALWRMMKGFSPPENVLDSVNPFQLHPQYLPVGVWYHVKGNGNENEDLSSGFWKPKGGDCKLPSDFPFTIIKTTYEFYEGQAPNDHKTA

Query:  WVMLEYWICRSDMPANNNGQKTSSLCKVFIGDEQFQDHEKFKKIRYSIVRNSEPNCPIHQLVGLGPDGSNNKANGSTSKAKM
        WVM EYW+ ++D+ AN+  ++TSSLCK+F+GDEQFQ+HEK  KI  S   NSEPN   HQLV      SNN ANGSTSK++M
Subjt:  WVMLEYWICRSDMPANNNGQKTSSLCKVFIGDEQFQDHEKFKKIRYSIVRNSEPNCPIHQLVGLGPDGSNNKANGSTSKAKM

A0A6J1HGH3 NAC domain-containing protein 30-like7.1e-4250.25Show/hide
Query:  MASSLPLDYSCTDTEASLALWRMMKGFSPPENVLDSVNPFQLHPQYLPVGVWYHVKGNGNENEDLSSGFWKPKGGDCKLPSDFPFTIIKTTYEFYEGQAP
        + SS  LD   TD E  LALWR+  G S P NV D +NP+Q  P+ LP GVWY+V+GN +     S G WK K GDCKL S   FT  + +YE+YEGQAP
Subjt:  MASSLPLDYSCTDTEASLALWRMMKGFSPPENVLDSVNPFQLHPQYLPVGVWYHVKGNGNENEDLSSGFWKPKGGDCKLPSDFPFTIIKTTYEFYEGQAP

Query:  NDHKTAWVMLEYWICRSDMPANNNGQKTSSLCKVFIGDEQFQDHEKFKKIRYSIVRNSEPNCPIHQLVGLGPDGSNNKANGSTSKAKMVSNGEGT-AADG
         + KTAWVM +YW+ + D+ A    ++TSSLCKVF+GDEQFQ++E  +K  +S + NSEP    HQLV      SNN A+GSTSK+KM  + +    AD 
Subjt:  NDHKTAWVMLEYWICRSDMPANNNGQKTSSLCKVFIGDEQFQDHEKFKKIRYSIVRNSEPNCPIHQLVGLGPDGSNNKANGSTSKAKMVSNGEGT-AADG

Query:  GKK
        GK+
Subjt:  GKK

A0A6J1K9U4 uncharacterized protein LOC111491972 isoform X25.4e-4250.25Show/hide
Query:  MASSLPLDYSCTDTEASLALWRMMKGFSPPENVLDSVNPFQLHPQYLPVGVWYHVKGNGNENEDLSSGFWKPKGGDCKLPSDFPFTIIKTTYEFYEGQAP
        + SS  LD   TD E  LALWR+  G S P NV D +NP+Q  P+ LP GVWY+V+GN +     S G WK K GDCKL ++  FT  +T+YE+YEGQAP
Subjt:  MASSLPLDYSCTDTEASLALWRMMKGFSPPENVLDSVNPFQLHPQYLPVGVWYHVKGNGNENEDLSSGFWKPKGGDCKLPSDFPFTIIKTTYEFYEGQAP

Query:  NDHKTAWVMLEYWICRSDMPANNNGQKTSSLCKVFIGDEQFQDHEKFKKIRYSIVRNSEPNCPIHQLVGLGPDGSNNKANGSTSKAKMVSNGEG-TAADG
         + KTAWVM +YW+ + D+ A    ++TSSLCKVF+GDEQFQ++E  +K  +S + NSEP    HQLV      SNN ANG TSK+KM  + +    AD 
Subjt:  NDHKTAWVMLEYWICRSDMPANNNGQKTSSLCKVFIGDEQFQDHEKFKKIRYSIVRNSEPNCPIHQLVGLGPDGSNNKANGSTSKAKMVSNGEG-TAADG

Query:  GKK
        GK+
Subjt:  GKK

A0A6J1KC65 uncharacterized protein LOC111491972 isoform X11.7e-4048.15Show/hide
Query:  MASSLPLDYSCTDTEASLALWRMMKGFSPPENVLDSVNPFQLHPQYLPVGVWYHVKGNGNENEDLSSGFWKPKGGDCKLPSDFPFTIIKTTYEFYEGQAP
        + SS  LD   TD E  LALWR+  G S P NV D +NP+Q  P+ LP GVWY+V+GN +     S G WK K GDCKL ++  FT  +T+YE+YEGQAP
Subjt:  MASSLPLDYSCTDTEASLALWRMMKGFSPPENVLDSVNPFQLHPQYLPVGVWYHVKGNGNENEDLSSGFWKPKGGDCKLPSDFPFTIIKTTYEFYEGQAP

Query:  NDHKTAWVMLEYWICRSDMPAN-------------NNGQKTSSLCKVFIGDEQFQDHEKFKKIRYSIVRNSEPNCPIHQLVGLGPDGSNNKANGSTSKAK
         + KTAWVM +YW+ + D+ A               N Q+TSSLCKVF+GDEQFQ++E  +K  +S + NSEP    HQLV      SNN ANG TSK+K
Subjt:  NDHKTAWVMLEYWICRSDMPAN-------------NNGQKTSSLCKVFIGDEQFQDHEKFKKIRYSIVRNSEPNCPIHQLVGLGPDGSNNKANGSTSKAK

Query:  MVSNGEG-TAADGGKK
        M  + +    AD GK+
Subjt:  MVSNGEG-TAADGGKK

SwissProt top hitse value%identityAlignment
I1LPE9 NAC domain-containing protein 61.6e-0634.59Show/hide
Query:  LWRMMKGFSPPENVLDSVNPFQLHPQYLP----VG--VWY--------HVKGNGNENEDLSSGFWKPKGGDCKLPS-DFPFTII--KTTYEFYEGQAPND
        L  M+ G     +V+  +N +Q  P  LP    VG   WY        H  G G  N     GFWK  G D K+ +   P  II  + T  FY+G+AP  
Subjt:  LWRMMKGFSPPENVLDSVNPFQLHPQYLP----VG--VWY--------HVKGNGNENEDLSSGFWKPKGGDCKLPS-DFPFTII--KTTYEFYEGQAPND

Query:  HKTAWVMLEYWICRSDMPANNNGQKTSSLCKVF
         KT WVM EY      +P N    K   LCK++
Subjt:  HKTAWVMLEYWICRSDMPANNNGQKTSSLCKVF

O22798 NAC domain-containing protein 416.0e-0627.27Show/hide
Query:  NGN-ENEDLSSGFWKPKGGDCKLPSDFPFTIIKTTYEFYEGQAPNDHKTAWVMLEYWICRSDMPANNNGQKTSSLCKVFI-------------GDEQFQD
        NGN  N    SG+WK  G D ++        +K T  FY+G+ PN  +T WV+ EY +  S   +         LC+VF+               E+ ++
Subjt:  NGN-ENEDLSSGFWKPKGGDCKLPSDFPFTIIKTTYEFYEGQAPNDHKTAWVMLEYWICRSDMPANNNGQKTSSLCKVFI-------------GDEQFQD

Query:  HEKFKKIRYSIVRNSEPNCPI
         ++ +  R     N +  CPI
Subjt:  HEKFKKIRYSIVRNSEPNCPI

Q9FY93 NAC domain-containing protein 831.0e-0528.95Show/hide
Query:  NGN-ENEDLSSGFWKPKGGDCKLPSDFPFTII--KTTYEFYEGQAPNDHKTAWVMLEYWICRSDMPANNNGQKTSSLCKVFI----GDEQFQDHEKFKKI
        NGN  N    SG+WK  G D ++ +     I+  K T  FY+G+ P+  +T W+M EY +  S  P++    +   LC++F+    G++   D    + +
Subjt:  NGN-ENEDLSSGFWKPKGGDCKLPSDFPFTII--KTTYEFYEGQAPNDHKTAWVMLEYWICRSDMPANNNGQKTSSLCKVFI----GDEQFQDHEKFKKI

Query:  RYSIVRNSEPNCPI
        R++   NS     I
Subjt:  RYSIVRNSEPNCPI

Q9SQL0 NAC domain-containing protein JA21.6e-0628Show/hide
Query:  SLPLDYSCTDTEASLA---LWRMMKGFSPPENVLDSVNPFQLHPQYLPVGV------WYHVK------GNGNE-NEDLSSGFWKPKGGDCKLPSDFPFTI
        SLP  +    T+  L    L + + G   P  ++  ++ ++  P  LP         WY          NG+  N    SG+WK  G D  + S      
Subjt:  SLPLDYSCTDTEASLA---LWRMMKGFSPPENVLDSVNPFQLHPQYLPVGV------WYHVK------GNGNE-NEDLSSGFWKPKGGDCKLPSDFPFTI

Query:  IKTTYEFYEGQAPNDHKTAWVMLEYWICRSDMPANNNGQKTSS--LCKVF
        IK    FY G+AP   KT W+M EY +  S    NN   K     LC+++
Subjt:  IKTTYEFYEGQAPNDHKTAWVMLEYWICRSDMPANNNGQKTSS--LCKVF

Q9SQY0 NAC domain containing protein 527.9e-0635.8Show/hide
Query:  GNG-NENEDLSSGFWKPKGGDCKLPSDFPFTIIKTTYEFYEGQAPNDHKTAWVMLEYWICRSDMPANNN-GQKTSSLCKVF
        GNG   N   + G+WK  G D ++  D     +K T  F+ G+AP+  +T WVM EY +   +   N N  Q    LC+VF
Subjt:  GNG-NENEDLSSGFWKPKGGDCKLPSDFPFTIIKTTYEFYEGQAPNDHKTAWVMLEYWICRSDMPANNN-GQKTSSLCKVF

Arabidopsis top hitse value%identityAlignment
AT2G33480.1 NAC domain containing protein 414.3e-0727.27Show/hide
Query:  NGN-ENEDLSSGFWKPKGGDCKLPSDFPFTIIKTTYEFYEGQAPNDHKTAWVMLEYWICRSDMPANNNGQKTSSLCKVFI-------------GDEQFQD
        NGN  N    SG+WK  G D ++        +K T  FY+G+ PN  +T WV+ EY +  S   +         LC+VF+               E+ ++
Subjt:  NGN-ENEDLSSGFWKPKGGDCKLPSDFPFTIIKTTYEFYEGQAPNDHKTAWVMLEYWICRSDMPANNNGQKTSSLCKVFI-------------GDEQFQD

Query:  HEKFKKIRYSIVRNSEPNCPI
         ++ +  R     N +  CPI
Subjt:  HEKFKKIRYSIVRNSEPNCPI

AT3G10480.2 NAC domain containing protein 502.5e-0733.75Show/hide
Query:  GNG-NENEDLSSGFWKPKGGDCKLPSDFPFTIIKTTYEFYEGQAPNDHKTAWVMLEYWICRSDMPANNNGQKTSSLCKVF
        GNG   N   + G+WK  G D ++  D     +K T  F+ G+AP+  +T WVM EY +   +   N +      LC+VF
Subjt:  GNG-NENEDLSSGFWKPKGGDCKLPSDFPFTIIKTTYEFYEGQAPNDHKTAWVMLEYWICRSDMPANNNGQKTSSLCKVF

AT3G10490.1 NAC domain containing protein 525.6e-0735.8Show/hide
Query:  GNG-NENEDLSSGFWKPKGGDCKLPSDFPFTIIKTTYEFYEGQAPNDHKTAWVMLEYWICRSDMPANNN-GQKTSSLCKVF
        GNG   N   + G+WK  G D ++  D     +K T  F+ G+AP+  +T WVM EY +   +   N N  Q    LC+VF
Subjt:  GNG-NENEDLSSGFWKPKGGDCKLPSDFPFTIIKTTYEFYEGQAPNDHKTAWVMLEYWICRSDMPANNN-GQKTSSLCKVF

AT3G10490.2 NAC domain containing protein 525.6e-0735.8Show/hide
Query:  GNG-NENEDLSSGFWKPKGGDCKLPSDFPFTIIKTTYEFYEGQAPNDHKTAWVMLEYWICRSDMPANNN-GQKTSSLCKVF
        GNG   N   + G+WK  G D ++  D     +K T  F+ G+AP+  +T WVM EY +   +   N N  Q    LC+VF
Subjt:  GNG-NENEDLSSGFWKPKGGDCKLPSDFPFTIIKTTYEFYEGQAPNDHKTAWVMLEYWICRSDMPANNN-GQKTSSLCKVF

AT3G17730.1 NAC domain containing protein 577.3e-0733.78Show/hide
Query:  NEDLSSGFWKPKGGDCKLPSDFPFTIIKTTYEFYEGQAPNDHKTAWVMLEYWICRSDMPANNNGQKTSSLCKVF
        N     G+WK  G D ++ S      +K T  +Y+G+AP   +T WVM EY +   D    ++ Q + +LC+VF
Subjt:  NEDLSSGFWKPKGGDCKLPSDFPFTIIKTTYEFYEGQAPNDHKTAWVMLEYWICRSDMPANNNGQKTSSLCKVF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTCTTCCCTTCCACTTGATTACTCTTGTACGGACACCGAGGCTTCGTTAGCATTATGGAGAATGATGAAGGGATTCTCTCCTCCCGAAAATGTTTTGGATTCTGT
AAATCCTTTCCAGCTTCACCCTCAATATTTACCAGTTGGAGTTTGGTATCATGTTAAGGGCAATGGGAATGAGAATGAAGACTTGAGTTCTGGTTTCTGGAAGCCTAAAG
GTGGAGACTGTAAGCTGCCTTCAGACTTTCCCTTCACTATAATCAAAACCACTTATGAATTTTATGAAGGTCAAGCACCTAATGACCATAAGACAGCCTGGGTTATGCTA
GAGTATTGGATATGTCGCTCGGACATGCCCGCAAACAACAACGGGCAGAAAACTAGCTCACTTTGCAAAGTGTTTATTGGAGATGAGCAATTTCAAGATCACGAAAAGTT
TAAGAAAATCAGATATTCTATTGTTCGCAATTCAGAACCCAACTGCCCAATTCATCAGTTGGTTGGTCTCGGTCCAGATGGTTCGAATAACAAGGCAAATGGGTCGACGA
GCAAGGCAAAGATGGTGTCAAATGGTGAAGGAACAGCTGCGGATGGAGGGAAGAAGAAGAAGAAGAAGAAAAAGAGTGTTGGCAAAATGAAGAAGATTCAGAAAAAGTAC
TTTTGTTCCTTCTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCTCTTCCCTTCCACTTGATTACTCTTGTACGGACACCGAGGCTTCGTTAGCATTATGGAGAATGATGAAGGGATTCTCTCCTCCCGAAAATGTTTTGGATTCTGT
AAATCCTTTCCAGCTTCACCCTCAATATTTACCAGTTGGAGTTTGGTATCATGTTAAGGGCAATGGGAATGAGAATGAAGACTTGAGTTCTGGTTTCTGGAAGCCTAAAG
GTGGAGACTGTAAGCTGCCTTCAGACTTTCCCTTCACTATAATCAAAACCACTTATGAATTTTATGAAGGTCAAGCACCTAATGACCATAAGACAGCCTGGGTTATGCTA
GAGTATTGGATATGTCGCTCGGACATGCCCGCAAACAACAACGGGCAGAAAACTAGCTCACTTTGCAAAGTGTTTATTGGAGATGAGCAATTTCAAGATCACGAAAAGTT
TAAGAAAATCAGATATTCTATTGTTCGCAATTCAGAACCCAACTGCCCAATTCATCAGTTGGTTGGTCTCGGTCCAGATGGTTCGAATAACAAGGCAAATGGGTCGACGA
GCAAGGCAAAGATGGTGTCAAATGGTGAAGGAACAGCTGCGGATGGAGGGAAGAAGAAGAAGAAGAAGAAAAAGAGTGTTGGCAAAATGAAGAAGATTCAGAAAAAGTAC
TTTTGTTCCTTCTTTTAA
Protein sequenceShow/hide protein sequence
MASSLPLDYSCTDTEASLALWRMMKGFSPPENVLDSVNPFQLHPQYLPVGVWYHVKGNGNENEDLSSGFWKPKGGDCKLPSDFPFTIIKTTYEFYEGQAPNDHKTAWVML
EYWICRSDMPANNNGQKTSSLCKVFIGDEQFQDHEKFKKIRYSIVRNSEPNCPIHQLVGLGPDGSNNKANGSTSKAKMVSNGEGTAADGGKKKKKKKKSVGKMKKIQKKY
FCSFF