; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0023542 (gene) of Chayote v1 genome

Gene IDSed0023542
OrganismSechium edule (Chayote v1)
DescriptionN-acetyltransferase domain-containing protein
Genome locationLG10:34471123..34476941
RNA-Seq ExpressionSed0023542
SyntenySed0023542
Gene Ontology termsGO:0005737 - cytoplasm (cellular component)
GO:0008080 - N-acetyltransferase activity (molecular function)
InterPro domainsIPR000182 - GNAT domain
IPR016181 - Acyl-CoA N-acyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592197.1 hypothetical protein SDJN03_14543, partial [Cucurbita argyrosperma subsp. sororia]1.5e-10979.2Show/hide
Query:  MVHLLPNPLRIPPHLCSEPPRTAAPARA--------IWRNGGIRGRSAAGVRCSMEYLSPIAAEATATAEELIGISKESGNEYLAKEFGWKVRKLIEEED
        MVHLLPNPLR+  HL  +PPRTA  ARA        I RNGGI+ R A  VRCS +Y SPIAA A  T+EE I IS E  N YLA EFGWKVRKLIEEED
Subjt:  MVHLLPNPLRIPPHLCSEPPRTAAPARA--------IWRNGGIRGRSAAGVRCSMEYLSPIAAEATATAEELIGISKESGNEYLAKEFGWKVRKLIEEED

Query:  DLRMVATIQAQAFHEPLPLFNHFFYHFFQAEVLSALIYRLKNYPPDRYACLVAEAESESC--EYNIVGVVDVTVAGDLKVRRLLPAGQKEYLFVTGIAVS
        DLR VA IQA+AFHEP+ LFNHFF+ FFQAEVLSALIYRLK+YPPDRYACLVAE ESESC  EYN VGVVDVTVAGDLKVRRLLPAG KEYLFVTGIAV 
Subjt:  DLRMVATIQAQAFHEPLPLFNHFFYHFFQAEVLSALIYRLKNYPPDRYACLVAEAESESC--EYNIVGVVDVTVAGDLKVRRLLPAGQKEYLFVTGIAVS

Query:  LTARRRKVATALLKGCDMLGKIWGFKYLALSAYEDDYGARNLYSKAGYEVLSIDPLWKSSWIGRKRCVTMVKQL
         TARRRKVATALLKGCDML K+WGFK+LALSAYEDDYGARNLYSKAGY+VL +DPLWKSSWIGRKRCVTMVKQL
Subjt:  LTARRRKVATALLKGCDMLGKIWGFKYLALSAYEDDYGARNLYSKAGYEVLSIDPLWKSSWIGRKRCVTMVKQL

XP_008461923.1 PREDICTED: uncharacterized protein LOC103500412 isoform X1 [Cucumis melo]1.5e-10977.94Show/hide
Query:  MVHLLPNPLRIPPHLCSEPPRTAAPA----RAIWRNGG-IRGRSAAGVRCSMEYLSPIAAEATATAEELIGISKESG-NEYLAKEFGWKVRKLIEEEDDL
        MVHLLPNPLR+  HL SEPPRTA P     RA+WR GG I+ RSA  VRCS +Y SPI A A AT EELI +S+E G NEYLA+EFGWKVRKLIEEEDDL
Subjt:  MVHLLPNPLRIPPHLCSEPPRTAAPA----RAIWRNGG-IRGRSAAGVRCSMEYLSPIAAEATATAEELIGISKESG-NEYLAKEFGWKVRKLIEEEDDL

Query:  RMVATIQAQAFHEPLPLFNHFFYHFFQAEVLSALIYRLKNYPPDRYACLVAEAESE--SCEYNIVGVVDVTVAGDLKVRRLLPAGQKEYLFVTGIAVSLT
        R VA IQA+AFHEP+ LFNHFF+ FFQAEVLSALIYRLKNYP +RYACLVAE ESE    EYN VGVVDVTVAGDLKV+RLLP G KEYLFVTGIAV+  
Subjt:  RMVATIQAQAFHEPLPLFNHFFYHFFQAEVLSALIYRLKNYPPDRYACLVAEAESE--SCEYNIVGVVDVTVAGDLKVRRLLPAGQKEYLFVTGIAVSLT

Query:  ARRRKVATALLKGCDMLGKIWGFKYLALSAYEDDYGARNLYSKAGYEVLSIDPLWKSSWIGRKRCVTMVKQL
        ARRRKVATALLKGCDMLGK+WGFK+LALSAYEDDYGARNLYSKAGY+V  +DPLWKS+WIGRKRCVTM+K+L
Subjt:  ARRRKVATALLKGCDMLGKIWGFKYLALSAYEDDYGARNLYSKAGYEVLSIDPLWKSSWIGRKRCVTMVKQL

XP_022936794.1 uncharacterized protein LOC111443273 [Cucurbita moschata]3.3e-10978.83Show/hide
Query:  MVHLLPNPLRIPPHLCSEPPRTAAPARA--------IWRNGGIRGRSAAGVRCSMEYLSPIAAEATATAEELIGISKESGNEYLAKEFGWKVRKLIEEED
        MVHLLPNPLR+  HL  +PPRTA  ARA        I RNGGI+ R A  VRCS +Y SPIAA A  T+EE I IS E  N YLA EFGWKVRKLIEEED
Subjt:  MVHLLPNPLRIPPHLCSEPPRTAAPARA--------IWRNGGIRGRSAAGVRCSMEYLSPIAAEATATAEELIGISKESGNEYLAKEFGWKVRKLIEEED

Query:  DLRMVATIQAQAFHEPLPLFNHFFYHFFQAEVLSALIYRLKNYPPDRYACLVAEAESESC--EYNIVGVVDVTVAGDLKVRRLLPAGQKEYLFVTGIAVS
        DLR VA IQA+AFHEP+ LFNHFF+ FFQAEVLSALIYRLK+YPPDRYACLVAE ESE+C  EYN VGVVDVTVAGDLKVRRLLPAG KEYLFVTGIAV 
Subjt:  DLRMVATIQAQAFHEPLPLFNHFFYHFFQAEVLSALIYRLKNYPPDRYACLVAEAESESC--EYNIVGVVDVTVAGDLKVRRLLPAGQKEYLFVTGIAVS

Query:  LTARRRKVATALLKGCDMLGKIWGFKYLALSAYEDDYGARNLYSKAGYEVLSIDPLWKSSWIGRKRCVTMVKQL
         TARRRKVATALLKGCDML K+WGFK+LALSAYEDDYGARNLYSKAGY+VL +DPLWKSSWIGRKRCVTMVKQL
Subjt:  LTARRRKVATALLKGCDMLGKIWGFKYLALSAYEDDYGARNLYSKAGYEVLSIDPLWKSSWIGRKRCVTMVKQL

XP_022976620.1 uncharacterized protein LOC111476967 [Cucurbita maxima]2.5e-10978.83Show/hide
Query:  MVHLLPNPLRIPPHLCSEPPRTAAPARA--------IWRNGGIRGRSAAGVRCSMEYLSPIAAEATATAEELIGISKESGNEYLAKEFGWKVRKLIEEED
        MVHLLPNPLR+  HL S+PPRTA  ARA        I RNGGI+ R A  VRCS +Y SPI A A  T+EE I IS E  N YLA EFGWKVRKLIEEED
Subjt:  MVHLLPNPLRIPPHLCSEPPRTAAPARA--------IWRNGGIRGRSAAGVRCSMEYLSPIAAEATATAEELIGISKESGNEYLAKEFGWKVRKLIEEED

Query:  DLRMVATIQAQAFHEPLPLFNHFFYHFFQAEVLSALIYRLKNYPPDRYACLVAEAESESC--EYNIVGVVDVTVAGDLKVRRLLPAGQKEYLFVTGIAVS
        DLR VA IQA+AFHEP+ LFNHFF+ FFQAEVLSALIYRLKNYPPDRYACLVAE ESE C  EYN VGVVDVTVAGDLKVRRLLPAG KEYLFVTGIAV 
Subjt:  DLRMVATIQAQAFHEPLPLFNHFFYHFFQAEVLSALIYRLKNYPPDRYACLVAEAESESC--EYNIVGVVDVTVAGDLKVRRLLPAGQKEYLFVTGIAVS

Query:  LTARRRKVATALLKGCDMLGKIWGFKYLALSAYEDDYGARNLYSKAGYEVLSIDPLWKSSWIGRKRCVTMVKQL
         TARRRKVATALLKGCDML ++WGFK+LALSAYEDDYGARNLYSKAGY+VL +DPLWKSSWIGRKRCVTMVKQL
Subjt:  LTARRRKVATALLKGCDMLGKIWGFKYLALSAYEDDYGARNLYSKAGYEVLSIDPLWKSSWIGRKRCVTMVKQL

XP_038897179.1 uncharacterized protein LOC120085320 [Benincasa hispida]7.0e-11277.45Show/hide
Query:  MVHLLPNPLRIPPHLCSEPPRTAAPAR--------AIWRNGGIRGRSAAGVRCSMEYLSPIAAEATATAEELIGISKE-SGNEYLAKEFGWKVRKLIEEE
        MVHLLPNPLR+ PHL SEPPRTA  AR        AIWRNGGI+ RSA  +RCS +Y SP       TAEE IG+ +E + NEYLA+EFGW VRKLIEEE
Subjt:  MVHLLPNPLRIPPHLCSEPPRTAAPAR--------AIWRNGGIRGRSAAGVRCSMEYLSPIAAEATATAEELIGISKE-SGNEYLAKEFGWKVRKLIEEE

Query:  DDLRMVATIQAQAFHEPLPLFNHFFYHFFQAEVLSALIYRLKNYPPDRYACLVAEAESESC--EYNIVGVVDVTVAGDLKVRRLLPAGQKEYLFVTGIAV
        DDLR VA IQA+AFHEP+ LFN FF+ FFQAEVLSALIYRLKNYPPDRYACLVAE ESESC  EYN VGVVDVTVAGDLKV+RLLPAG+KEYLFVTGIAV
Subjt:  DDLRMVATIQAQAFHEPLPLFNHFFYHFFQAEVLSALIYRLKNYPPDRYACLVAEAESESC--EYNIVGVVDVTVAGDLKVRRLLPAGQKEYLFVTGIAV

Query:  SLTARRRKVATALLKGCDMLGKIWGFKYLALSAYEDDYGARNLYSKAGYEVLSIDPLWKSSWIGRKRCVTMVKQL
        + TARRRKVATALLKGCDMLGK+WGFK+LALSAYEDDYGARNLYSKAGY+V  +DPLWKS+WIGRKRCVTM+K+L
Subjt:  SLTARRRKVATALLKGCDMLGKIWGFKYLALSAYEDDYGARNLYSKAGYEVLSIDPLWKSSWIGRKRCVTMVKQL

TrEMBL top hitse value%identityAlignment
A0A0A0K826 N-acetyltransferase domain-containing protein4.6e-10976.47Show/hide
Query:  MVHLLPNPLRIPPHLCSEPPRTAAPARA----IWR-NGGIRGRSAAGVRCSMEYLSPIAAEATATAEELIGISKE-SGNEYLAKEFGWKVRKLIEEEDDL
        MVHLLPNPLR+P HL SEPP TA P R+    +WR  GGI+ RSA  VRCS +Y SPI A A  T EEL+G+S+E   NEYLA EFGWKVRKLIEEEDDL
Subjt:  MVHLLPNPLRIPPHLCSEPPRTAAPARA----IWR-NGGIRGRSAAGVRCSMEYLSPIAAEATATAEELIGISKE-SGNEYLAKEFGWKVRKLIEEEDDL

Query:  RMVATIQAQAFHEPLPLFNHFFYHFFQAEVLSALIYRLKNYPPDRYACLVAEAESE--SCEYNIVGVVDVTVAGDLKVRRLLPAGQKEYLFVTGIAVSLT
        + VA IQA+AFHEP+ LFNHFF+ FFQAEVLSALIYRLKNYP DRYACLVAE ESE    EYN VGVVDVTVAGDLK++RLLP G KEYLFVTGIAV+  
Subjt:  RMVATIQAQAFHEPLPLFNHFFYHFFQAEVLSALIYRLKNYPPDRYACLVAEAESE--SCEYNIVGVVDVTVAGDLKVRRLLPAGQKEYLFVTGIAVSLT

Query:  ARRRKVATALLKGCDMLGKIWGFKYLALSAYEDDYGARNLYSKAGYEVLSIDPLWKSSWIGRKRCVTMVKQL
        ARRRKVATALLKGCDMLGK+WGFK+LALSAYEDDYGARNLYSKAGY+V  +DPLWKS+WIGRKRCVTM+K+L
Subjt:  ARRRKVATALLKGCDMLGKIWGFKYLALSAYEDDYGARNLYSKAGYEVLSIDPLWKSSWIGRKRCVTMVKQL

A0A1S3CFP1 uncharacterized protein LOC103500412 isoform X17.1e-11077.94Show/hide
Query:  MVHLLPNPLRIPPHLCSEPPRTAAPA----RAIWRNGG-IRGRSAAGVRCSMEYLSPIAAEATATAEELIGISKESG-NEYLAKEFGWKVRKLIEEEDDL
        MVHLLPNPLR+  HL SEPPRTA P     RA+WR GG I+ RSA  VRCS +Y SPI A A AT EELI +S+E G NEYLA+EFGWKVRKLIEEEDDL
Subjt:  MVHLLPNPLRIPPHLCSEPPRTAAPA----RAIWRNGG-IRGRSAAGVRCSMEYLSPIAAEATATAEELIGISKESG-NEYLAKEFGWKVRKLIEEEDDL

Query:  RMVATIQAQAFHEPLPLFNHFFYHFFQAEVLSALIYRLKNYPPDRYACLVAEAESE--SCEYNIVGVVDVTVAGDLKVRRLLPAGQKEYLFVTGIAVSLT
        R VA IQA+AFHEP+ LFNHFF+ FFQAEVLSALIYRLKNYP +RYACLVAE ESE    EYN VGVVDVTVAGDLKV+RLLP G KEYLFVTGIAV+  
Subjt:  RMVATIQAQAFHEPLPLFNHFFYHFFQAEVLSALIYRLKNYPPDRYACLVAEAESE--SCEYNIVGVVDVTVAGDLKVRRLLPAGQKEYLFVTGIAVSLT

Query:  ARRRKVATALLKGCDMLGKIWGFKYLALSAYEDDYGARNLYSKAGYEVLSIDPLWKSSWIGRKRCVTMVKQL
        ARRRKVATALLKGCDMLGK+WGFK+LALSAYEDDYGARNLYSKAGY+V  +DPLWKS+WIGRKRCVTM+K+L
Subjt:  ARRRKVATALLKGCDMLGKIWGFKYLALSAYEDDYGARNLYSKAGYEVLSIDPLWKSSWIGRKRCVTMVKQL

A0A1S4E3H1 uncharacterized protein LOC103500412 isoform X26.7e-10877.57Show/hide
Query:  MVHLLPNPLRIPPHLCSEPPRTAAPA----RAIWRNGG-IRGRSAAGVRCSMEYLSPIAAEATATAEELIGISKESG-NEYLAKEFGWKVRKLIEEEDDL
        MVHLLPNPLR+  HL SEPPRTA P     RA+WR GG I+ RSA  VRCS +Y SPI A A AT EELI +S+E G NEYLA+EFGWKVRKLIEEEDDL
Subjt:  MVHLLPNPLRIPPHLCSEPPRTAAPA----RAIWRNGG-IRGRSAAGVRCSMEYLSPIAAEATATAEELIGISKESG-NEYLAKEFGWKVRKLIEEEDDL

Query:  RMVATIQAQAFHEPLPLFNHFFYHFFQAEVLSALIYRLKNYPPDRYACLVAEAESE--SCEYNIVGVVDVTVAGDLKVRRLLPAGQKEYLFVTGIAVSLT
        R VA IQA+AFHEP+ LFNHFF+ FFQAEVLSALIYRLKNYP +RYACLVAE ESE    EYN VGVVDVTVAGDLKV+RLLP G KEYLFVTGIAV+  
Subjt:  RMVATIQAQAFHEPLPLFNHFFYHFFQAEVLSALIYRLKNYPPDRYACLVAEAESE--SCEYNIVGVVDVTVAGDLKVRRLLPAGQKEYLFVTGIAVSLT

Query:  ARRRKVATALLKGCDMLGKIWGFKYLALSAYEDDYGARNLYSKAGYEVLSIDPLWKSSWIGRKRCVTMVKQL
        A RRKVATALLKGCDMLGK+WGFK+LALSAYEDDYGARNLYSKAGY+V  +DPLWKS+WIGRKRCVTM+K+L
Subjt:  ARRRKVATALLKGCDMLGKIWGFKYLALSAYEDDYGARNLYSKAGYEVLSIDPLWKSSWIGRKRCVTMVKQL

A0A6J1F9A7 uncharacterized protein LOC1114432731.6e-10978.83Show/hide
Query:  MVHLLPNPLRIPPHLCSEPPRTAAPARA--------IWRNGGIRGRSAAGVRCSMEYLSPIAAEATATAEELIGISKESGNEYLAKEFGWKVRKLIEEED
        MVHLLPNPLR+  HL  +PPRTA  ARA        I RNGGI+ R A  VRCS +Y SPIAA A  T+EE I IS E  N YLA EFGWKVRKLIEEED
Subjt:  MVHLLPNPLRIPPHLCSEPPRTAAPARA--------IWRNGGIRGRSAAGVRCSMEYLSPIAAEATATAEELIGISKESGNEYLAKEFGWKVRKLIEEED

Query:  DLRMVATIQAQAFHEPLPLFNHFFYHFFQAEVLSALIYRLKNYPPDRYACLVAEAESESC--EYNIVGVVDVTVAGDLKVRRLLPAGQKEYLFVTGIAVS
        DLR VA IQA+AFHEP+ LFNHFF+ FFQAEVLSALIYRLK+YPPDRYACLVAE ESE+C  EYN VGVVDVTVAGDLKVRRLLPAG KEYLFVTGIAV 
Subjt:  DLRMVATIQAQAFHEPLPLFNHFFYHFFQAEVLSALIYRLKNYPPDRYACLVAEAESESC--EYNIVGVVDVTVAGDLKVRRLLPAGQKEYLFVTGIAVS

Query:  LTARRRKVATALLKGCDMLGKIWGFKYLALSAYEDDYGARNLYSKAGYEVLSIDPLWKSSWIGRKRCVTMVKQL
         TARRRKVATALLKGCDML K+WGFK+LALSAYEDDYGARNLYSKAGY+VL +DPLWKSSWIGRKRCVTMVKQL
Subjt:  LTARRRKVATALLKGCDMLGKIWGFKYLALSAYEDDYGARNLYSKAGYEVLSIDPLWKSSWIGRKRCVTMVKQL

A0A6J1IJZ4 uncharacterized protein LOC1114769671.2e-10978.83Show/hide
Query:  MVHLLPNPLRIPPHLCSEPPRTAAPARA--------IWRNGGIRGRSAAGVRCSMEYLSPIAAEATATAEELIGISKESGNEYLAKEFGWKVRKLIEEED
        MVHLLPNPLR+  HL S+PPRTA  ARA        I RNGGI+ R A  VRCS +Y SPI A A  T+EE I IS E  N YLA EFGWKVRKLIEEED
Subjt:  MVHLLPNPLRIPPHLCSEPPRTAAPARA--------IWRNGGIRGRSAAGVRCSMEYLSPIAAEATATAEELIGISKESGNEYLAKEFGWKVRKLIEEED

Query:  DLRMVATIQAQAFHEPLPLFNHFFYHFFQAEVLSALIYRLKNYPPDRYACLVAEAESESC--EYNIVGVVDVTVAGDLKVRRLLPAGQKEYLFVTGIAVS
        DLR VA IQA+AFHEP+ LFNHFF+ FFQAEVLSALIYRLKNYPPDRYACLVAE ESE C  EYN VGVVDVTVAGDLKVRRLLPAG KEYLFVTGIAV 
Subjt:  DLRMVATIQAQAFHEPLPLFNHFFYHFFQAEVLSALIYRLKNYPPDRYACLVAEAESESC--EYNIVGVVDVTVAGDLKVRRLLPAGQKEYLFVTGIAVS

Query:  LTARRRKVATALLKGCDMLGKIWGFKYLALSAYEDDYGARNLYSKAGYEVLSIDPLWKSSWIGRKRCVTMVKQL
         TARRRKVATALLKGCDML ++WGFK+LALSAYEDDYGARNLYSKAGY+VL +DPLWKSSWIGRKRCVTMVKQL
Subjt:  LTARRRKVATALLKGCDMLGKIWGFKYLALSAYEDDYGARNLYSKAGYEVLSIDPLWKSSWIGRKRCVTMVKQL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G72030.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein5.7e-5955.4Show/hide
Query:  ATAEELIGISKESGNE----YLAKEFGWKVRKL-IEEEDDLRMVATIQAQAFHEPLPLFNHFFYHFFQAEVLSALIYRLKNYPPDRYACLVAEAESES--
        A+ E L  +++ +  E    YL  + GW VR+L  ++ED++R V+ +QA+AFH PL LF+ FF+ FFQAEVLSAL+Y+LKN PPDRYACLVAE  SE+  
Subjt:  ATAEELIGISKESGNE----YLAKEFGWKVRKL-IEEEDDLRMVATIQAQAFHEPLPLFNHFFYHFFQAEVLSALIYRLKNYPPDRYACLVAEAESES--

Query:  -CEYNIVGVVDVTVAGDLKVRRLLPAGQKEYLFVTGIAVSLTARRRKVATALLKGCDMLGKIWGFKYLALSAYEDDYGARNLYSKAGYEVLSIDPLWKSS
            ++VGVVDVT   +  V R  P G +EYL+V+G+AVS + RR+K+A+ LLK CD+L  +WGFK LAL AYEDD  ARNLYS AGY V+  DPLW S+
Subjt:  -CEYNIVGVVDVTVAGDLKVRRLLPAGQKEYLFVTGIAVSLTARRRKVATALLKGCDMLGKIWGFKYLALSAYEDDYGARNLYSKAGYEVLSIDPLWKSS

Query:  WIGRKRCVTMVKQ
        WIGRKR V M K+
Subjt:  WIGRKRCVTMVKQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCATTTACTTCCGAATCCCCTACGAATTCCGCCGCATCTCTGCTCCGAGCCGCCGCGGACGGCGGCGCCGGCCAGAGCGATTTGGAGAAATGGAGGAATTAGGGG
TAGATCGGCGGCGGGTGTGCGGTGTAGTATGGAGTATTTGAGTCCGATCGCGGCGGAGGCGACGGCGACGGCGGAGGAGTTGATCGGAATATCGAAAGAAAGTGGAAATG
AGTATTTAGCAAAGGAATTTGGATGGAAGGTGAGGAAATTGATTGAAGAAGAAGATGATTTGAGAATGGTGGCCACAATTCAAGCTCAAGCTTTTCATGAACCTCTTCCC
CTTTTCAACCATTTTTTCTACCACTTCTTTCAGGCAGAAGTGCTGTCAGCGTTGATTTACAGATTAAAAAATTACCCTCCGGACAGGTATGCTTGTTTGGTTGCGGAGGC
GGAAAGTGAAAGTTGTGAATACAATATAGTCGGAGTGGTGGACGTGACGGTGGCCGGAGATTTGAAAGTAAGGCGTCTCCTTCCCGCCGGCCAAAAAGAATATCTCTTTG
TAACTGGAATCGCCGTTTCACTAACTGCCAGAAGGCGCAAAGTAGCAACGGCACTATTGAAGGGTTGTGACATGCTTGGGAAGATTTGGGGGTTCAAATATTTGGCACTA
AGTGCATATGAAGATGATTATGGGGCTCGAAATTTGTATAGTAAAGCAGGTTATGAGGTTTTATCCATTGATCCTCTTTGGAAATCTTCTTGGATTGGGAGAAAACGTTG
TGTTACAATGGTTAAACAGCTTTAG
mRNA sequenceShow/hide mRNA sequence
AAAATTTTCAATTAAAGGGGTTTTAAATTATCAGAAGCCGAATTTCAACAGCGACCATGGTTCATTTACTTCCGAATCCCCTACGAATTCCGCCGCATCTCTGCTCCGAG
CCGCCGCGGACGGCGGCGCCGGCCAGAGCGATTTGGAGAAATGGAGGAATTAGGGGTAGATCGGCGGCGGGTGTGCGGTGTAGTATGGAGTATTTGAGTCCGATCGCGGC
GGAGGCGACGGCGACGGCGGAGGAGTTGATCGGAATATCGAAAGAAAGTGGAAATGAGTATTTAGCAAAGGAATTTGGATGGAAGGTGAGGAAATTGATTGAAGAAGAAG
ATGATTTGAGAATGGTGGCCACAATTCAAGCTCAAGCTTTTCATGAACCTCTTCCCCTTTTCAACCATTTTTTCTACCACTTCTTTCAGGCAGAAGTGCTGTCAGCGTTG
ATTTACAGATTAAAAAATTACCCTCCGGACAGGTATGCTTGTTTGGTTGCGGAGGCGGAAAGTGAAAGTTGTGAATACAATATAGTCGGAGTGGTGGACGTGACGGTGGC
CGGAGATTTGAAAGTAAGGCGTCTCCTTCCCGCCGGCCAAAAAGAATATCTCTTTGTAACTGGAATCGCCGTTTCACTAACTGCCAGAAGGCGCAAAGTAGCAACGGCAC
TATTGAAGGGTTGTGACATGCTTGGGAAGATTTGGGGGTTCAAATATTTGGCACTAAGTGCATATGAAGATGATTATGGGGCTCGAAATTTGTATAGTAAAGCAGGTTAT
GAGGTTTTATCCATTGATCCTCTTTGGAAATCTTCTTGGATTGGGAGAAAACGTTGTGTTACAATGGTTAAACAGCTTTAGTTCCATTTCTTAACTTTGGTTTTGTTTTT
AAACTTCTCTACTTCCACGAACACATCC
Protein sequenceShow/hide protein sequence
MVHLLPNPLRIPPHLCSEPPRTAAPARAIWRNGGIRGRSAAGVRCSMEYLSPIAAEATATAEELIGISKESGNEYLAKEFGWKVRKLIEEEDDLRMVATIQAQAFHEPLP
LFNHFFYHFFQAEVLSALIYRLKNYPPDRYACLVAEAESESCEYNIVGVVDVTVAGDLKVRRLLPAGQKEYLFVTGIAVSLTARRRKVATALLKGCDMLGKIWGFKYLAL
SAYEDDYGARNLYSKAGYEVLSIDPLWKSSWIGRKRCVTMVKQL