; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0008972 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0008972
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionN-alpha-acetyltransferase 35, NatC auxiliary subunit isoform X1
Genome locationchr02:13121021..13124781
RNA-Seq ExpressionIVF0008972
SyntenyIVF0008972
Gene Ontology termsGO:0017196 - N-terminal peptidyl-methionine acetylation (biological process)
GO:0031417 - NatC complex (cellular component)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR007244 - -alpha-acetyltransferase 35, NatC auxiliary subunit


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050112.1 N-alpha-acetyltransferase 35, NatC auxiliary subunit isoform X1 [Cucumis melo var. makuwa]4.28e-11878.69Show/hide
Query:  SKSTNAWMKIFQHILIWVEEQT-----------FRIRAVLPGDYCMVYWYLSVVLIKLVEKIHLRALMSNET------EKGASKDIGKDFRIPPAVSFLQ
        S   NAWMKIFQHILIWVEEQT           F +    PGDYCMVYWYLSVVLIKLVEKIHLRALMSNET      +KGASKDIGKDFRIPPAVSFLQ
Subjt:  SKSTNAWMKIFQHILIWVEEQT-----------FRIRAVLPGDYCMVYWYLSVVLIKLVEKIHLRALMSNET------EKGASKDIGKDFRIPPAVSFLQ

Query:  CQICLAEGLVMMLAALRNEHMIAQSPSPFNSEYERFFQHFELLQKACIPDNITYDSYEQSTRLARIS---------------------LFNDPEKLVELR
        CQICLAEGLVMMLAALRNEHMIAQSPSPFNSEYERFFQHFELLQKACIPDNITYDSYEQSTRLARIS                       NDPEKLVELR
Subjt:  CQICLAEGLVMMLAALRNEHMIAQSPSPFNSEYERFFQHFELLQKACIPDNITYDSYEQSTRLARIS---------------------LFNDPEKLVELR

Query:  RIEQVAEHNSVALNLIHKVGGLDPSLKISFEFNHHPYFGTAWLK
        RIEQVAEHNSVALNLIHKVGGLDPSLKISFEFNHHPYFGTA +K
Subjt:  RIEQVAEHNSVALNLIHKVGGLDPSLKISFEFNHHPYFGTAWLK

XP_008443937.1 PREDICTED: N-alpha-acetyltransferase 35, NatC auxiliary subunit isoform X1 [Cucumis melo]1.99e-11778.69Show/hide
Query:  SKSTNAWMKIFQHILIWVEEQT-----------FRIRAVLPGDYCMVYWYLSVVLIKLVEKIHLRALMSNET------EKGASKDIGKDFRIPPAVSFLQ
        S   NAWMKIFQHILIWVEEQT           F +    PGDYCMVYWYLSVVLIKLVEKIHLRALMSNET      +KGASKDIGKDFRIPPAVSFLQ
Subjt:  SKSTNAWMKIFQHILIWVEEQT-----------FRIRAVLPGDYCMVYWYLSVVLIKLVEKIHLRALMSNET------EKGASKDIGKDFRIPPAVSFLQ

Query:  CQICLAEGLVMMLAALRNEHMIAQSPSPFNSEYERFFQHFELLQKACIPDNITYDSYEQSTRLARIS---------------------LFNDPEKLVELR
        CQICLAEGLVMMLAALRNEHMIAQSPSPFNSEYERFFQHFELLQKACIPDNITYDSYEQSTRLARIS                       NDPEKLVELR
Subjt:  CQICLAEGLVMMLAALRNEHMIAQSPSPFNSEYERFFQHFELLQKACIPDNITYDSYEQSTRLARIS---------------------LFNDPEKLVELR

Query:  RIEQVAEHNSVALNLIHKVGGLDPSLKISFEFNHHPYFGTAWLK
        RIEQVAEHNSVALNLIHKVGGLDPSLKISFEFNHHPYFGTA +K
Subjt:  RIEQVAEHNSVALNLIHKVGGLDPSLKISFEFNHHPYFGTAWLK

XP_008443938.1 PREDICTED: N-alpha-acetyltransferase 35, NatC auxiliary subunit isoform X2 [Cucumis melo]4.06e-11878.69Show/hide
Query:  SKSTNAWMKIFQHILIWVEEQT-----------FRIRAVLPGDYCMVYWYLSVVLIKLVEKIHLRALMSNET------EKGASKDIGKDFRIPPAVSFLQ
        S   NAWMKIFQHILIWVEEQT           F +    PGDYCMVYWYLSVVLIKLVEKIHLRALMSNET      +KGASKDIGKDFRIPPAVSFLQ
Subjt:  SKSTNAWMKIFQHILIWVEEQT-----------FRIRAVLPGDYCMVYWYLSVVLIKLVEKIHLRALMSNET------EKGASKDIGKDFRIPPAVSFLQ

Query:  CQICLAEGLVMMLAALRNEHMIAQSPSPFNSEYERFFQHFELLQKACIPDNITYDSYEQSTRLARIS---------------------LFNDPEKLVELR
        CQICLAEGLVMMLAALRNEHMIAQSPSPFNSEYERFFQHFELLQKACIPDNITYDSYEQSTRLARIS                       NDPEKLVELR
Subjt:  CQICLAEGLVMMLAALRNEHMIAQSPSPFNSEYERFFQHFELLQKACIPDNITYDSYEQSTRLARIS---------------------LFNDPEKLVELR

Query:  RIEQVAEHNSVALNLIHKVGGLDPSLKISFEFNHHPYFGTAWLK
        RIEQVAEHNSVALNLIHKVGGLDPSLKISFEFNHHPYFGTA +K
Subjt:  RIEQVAEHNSVALNLIHKVGGLDPSLKISFEFNHHPYFGTAWLK

XP_008443940.1 PREDICTED: N-alpha-acetyltransferase 35, NatC auxiliary subunit isoform X4 [Cucumis melo]1.04e-11878.69Show/hide
Query:  SKSTNAWMKIFQHILIWVEEQT-----------FRIRAVLPGDYCMVYWYLSVVLIKLVEKIHLRALMSNET------EKGASKDIGKDFRIPPAVSFLQ
        S   NAWMKIFQHILIWVEEQT           F +    PGDYCMVYWYLSVVLIKLVEKIHLRALMSNET      +KGASKDIGKDFRIPPAVSFLQ
Subjt:  SKSTNAWMKIFQHILIWVEEQT-----------FRIRAVLPGDYCMVYWYLSVVLIKLVEKIHLRALMSNET------EKGASKDIGKDFRIPPAVSFLQ

Query:  CQICLAEGLVMMLAALRNEHMIAQSPSPFNSEYERFFQHFELLQKACIPDNITYDSYEQSTRLARIS---------------------LFNDPEKLVELR
        CQICLAEGLVMMLAALRNEHMIAQSPSPFNSEYERFFQHFELLQKACIPDNITYDSYEQSTRLARIS                       NDPEKLVELR
Subjt:  CQICLAEGLVMMLAALRNEHMIAQSPSPFNSEYERFFQHFELLQKACIPDNITYDSYEQSTRLARIS---------------------LFNDPEKLVELR

Query:  RIEQVAEHNSVALNLIHKVGGLDPSLKISFEFNHHPYFGTAWLK
        RIEQVAEHNSVALNLIHKVGGLDPSLKISFEFNHHPYFGTA +K
Subjt:  RIEQVAEHNSVALNLIHKVGGLDPSLKISFEFNHHPYFGTAWLK

XP_016899813.1 PREDICTED: N-alpha-acetyltransferase 35, NatC auxiliary subunit isoform X3 [Cucumis melo]3.45e-11878.69Show/hide
Query:  SKSTNAWMKIFQHILIWVEEQT-----------FRIRAVLPGDYCMVYWYLSVVLIKLVEKIHLRALMSNET------EKGASKDIGKDFRIPPAVSFLQ
        S   NAWMKIFQHILIWVEEQT           F +    PGDYCMVYWYLSVVLIKLVEKIHLRALMSNET      +KGASKDIGKDFRIPPAVSFLQ
Subjt:  SKSTNAWMKIFQHILIWVEEQT-----------FRIRAVLPGDYCMVYWYLSVVLIKLVEKIHLRALMSNET------EKGASKDIGKDFRIPPAVSFLQ

Query:  CQICLAEGLVMMLAALRNEHMIAQSPSPFNSEYERFFQHFELLQKACIPDNITYDSYEQSTRLARIS---------------------LFNDPEKLVELR
        CQICLAEGLVMMLAALRNEHMIAQSPSPFNSEYERFFQHFELLQKACIPDNITYDSYEQSTRLARIS                       NDPEKLVELR
Subjt:  CQICLAEGLVMMLAALRNEHMIAQSPSPFNSEYERFFQHFELLQKACIPDNITYDSYEQSTRLARIS---------------------LFNDPEKLVELR

Query:  RIEQVAEHNSVALNLIHKVGGLDPSLKISFEFNHHPYFGTAWLK
        RIEQVAEHNSVALNLIHKVGGLDPSLKISFEFNHHPYFGTA +K
Subjt:  RIEQVAEHNSVALNLIHKVGGLDPSLKISFEFNHHPYFGTAWLK

TrEMBL top hitse value%identityAlignment
A0A1S3B966 N-alpha-acetyltransferase 35, NatC auxiliary subunit isoform X11.6e-9778.69Show/hide
Query:  SKSTNAWMKIFQHILIWVEEQT-----------FRIRAVLPGDYCMVYWYLSVVLIKLVEKIHLRALMSNET------EKGASKDIGKDFRIPPAVSFLQ
        S   NAWMKIFQHILIWVEEQT           F +    PGDYCMVYWYLSVVLIKLVEKIHLRALMSNET      +KGASKDIGKDFRIPPAVSFLQ
Subjt:  SKSTNAWMKIFQHILIWVEEQT-----------FRIRAVLPGDYCMVYWYLSVVLIKLVEKIHLRALMSNET------EKGASKDIGKDFRIPPAVSFLQ

Query:  CQICLAEGLVMMLAALRNEHMIAQSPSPFNSEYERFFQHFELLQKACIPDNITYDSYEQSTRLARI---------------------SLFNDPEKLVELR
        CQICLAEGLVMMLAALRNEHMIAQSPSPFNSEYERFFQHFELLQKACIPDNITYDSYEQSTRLARI                     S  NDPEKLVELR
Subjt:  CQICLAEGLVMMLAALRNEHMIAQSPSPFNSEYERFFQHFELLQKACIPDNITYDSYEQSTRLARI---------------------SLFNDPEKLVELR

Query:  RIEQVAEHNSVALNLIHKVGGLDPSLKISFEFNHHPYFGTAWLK
        RIEQVAEHNSVALNLIHKVGGLDPSLKISFEFNHHPYFGTA +K
Subjt:  RIEQVAEHNSVALNLIHKVGGLDPSLKISFEFNHHPYFGTAWLK

A0A1S3B9A1 N-alpha-acetyltransferase 35, NatC auxiliary subunit isoform X21.6e-9778.69Show/hide
Query:  SKSTNAWMKIFQHILIWVEEQT-----------FRIRAVLPGDYCMVYWYLSVVLIKLVEKIHLRALMSNET------EKGASKDIGKDFRIPPAVSFLQ
        S   NAWMKIFQHILIWVEEQT           F +    PGDYCMVYWYLSVVLIKLVEKIHLRALMSNET      +KGASKDIGKDFRIPPAVSFLQ
Subjt:  SKSTNAWMKIFQHILIWVEEQT-----------FRIRAVLPGDYCMVYWYLSVVLIKLVEKIHLRALMSNET------EKGASKDIGKDFRIPPAVSFLQ

Query:  CQICLAEGLVMMLAALRNEHMIAQSPSPFNSEYERFFQHFELLQKACIPDNITYDSYEQSTRLARI---------------------SLFNDPEKLVELR
        CQICLAEGLVMMLAALRNEHMIAQSPSPFNSEYERFFQHFELLQKACIPDNITYDSYEQSTRLARI                     S  NDPEKLVELR
Subjt:  CQICLAEGLVMMLAALRNEHMIAQSPSPFNSEYERFFQHFELLQKACIPDNITYDSYEQSTRLARI---------------------SLFNDPEKLVELR

Query:  RIEQVAEHNSVALNLIHKVGGLDPSLKISFEFNHHPYFGTAWLK
        RIEQVAEHNSVALNLIHKVGGLDPSLKISFEFNHHPYFGTA +K
Subjt:  RIEQVAEHNSVALNLIHKVGGLDPSLKISFEFNHHPYFGTAWLK

A0A1S4DV16 N-alpha-acetyltransferase 35, NatC auxiliary subunit isoform X31.6e-9778.69Show/hide
Query:  SKSTNAWMKIFQHILIWVEEQT-----------FRIRAVLPGDYCMVYWYLSVVLIKLVEKIHLRALMSNET------EKGASKDIGKDFRIPPAVSFLQ
        S   NAWMKIFQHILIWVEEQT           F +    PGDYCMVYWYLSVVLIKLVEKIHLRALMSNET      +KGASKDIGKDFRIPPAVSFLQ
Subjt:  SKSTNAWMKIFQHILIWVEEQT-----------FRIRAVLPGDYCMVYWYLSVVLIKLVEKIHLRALMSNET------EKGASKDIGKDFRIPPAVSFLQ

Query:  CQICLAEGLVMMLAALRNEHMIAQSPSPFNSEYERFFQHFELLQKACIPDNITYDSYEQSTRLARI---------------------SLFNDPEKLVELR
        CQICLAEGLVMMLAALRNEHMIAQSPSPFNSEYERFFQHFELLQKACIPDNITYDSYEQSTRLARI                     S  NDPEKLVELR
Subjt:  CQICLAEGLVMMLAALRNEHMIAQSPSPFNSEYERFFQHFELLQKACIPDNITYDSYEQSTRLARI---------------------SLFNDPEKLVELR

Query:  RIEQVAEHNSVALNLIHKVGGLDPSLKISFEFNHHPYFGTAWLK
        RIEQVAEHNSVALNLIHKVGGLDPSLKISFEFNHHPYFGTA +K
Subjt:  RIEQVAEHNSVALNLIHKVGGLDPSLKISFEFNHHPYFGTAWLK

A0A5A7U2P9 N-alpha-acetyltransferase 35, NatC auxiliary subunit isoform X11.6e-9778.69Show/hide
Query:  SKSTNAWMKIFQHILIWVEEQT-----------FRIRAVLPGDYCMVYWYLSVVLIKLVEKIHLRALMSNET------EKGASKDIGKDFRIPPAVSFLQ
        S   NAWMKIFQHILIWVEEQT           F +    PGDYCMVYWYLSVVLIKLVEKIHLRALMSNET      +KGASKDIGKDFRIPPAVSFLQ
Subjt:  SKSTNAWMKIFQHILIWVEEQT-----------FRIRAVLPGDYCMVYWYLSVVLIKLVEKIHLRALMSNET------EKGASKDIGKDFRIPPAVSFLQ

Query:  CQICLAEGLVMMLAALRNEHMIAQSPSPFNSEYERFFQHFELLQKACIPDNITYDSYEQSTRLARI---------------------SLFNDPEKLVELR
        CQICLAEGLVMMLAALRNEHMIAQSPSPFNSEYERFFQHFELLQKACIPDNITYDSYEQSTRLARI                     S  NDPEKLVELR
Subjt:  CQICLAEGLVMMLAALRNEHMIAQSPSPFNSEYERFFQHFELLQKACIPDNITYDSYEQSTRLARI---------------------SLFNDPEKLVELR

Query:  RIEQVAEHNSVALNLIHKVGGLDPSLKISFEFNHHPYFGTAWLK
        RIEQVAEHNSVALNLIHKVGGLDPSLKISFEFNHHPYFGTA +K
Subjt:  RIEQVAEHNSVALNLIHKVGGLDPSLKISFEFNHHPYFGTAWLK

A0A5D3C7R5 N-alpha-acetyltransferase 35, NatC auxiliary subunit isoform X11.6e-9778.69Show/hide
Query:  SKSTNAWMKIFQHILIWVEEQT-----------FRIRAVLPGDYCMVYWYLSVVLIKLVEKIHLRALMSNET------EKGASKDIGKDFRIPPAVSFLQ
        S   NAWMKIFQHILIWVEEQT           F +    PGDYCMVYWYLSVVLIKLVEKIHLRALMSNET      +KGASKDIGKDFRIPPAVSFLQ
Subjt:  SKSTNAWMKIFQHILIWVEEQT-----------FRIRAVLPGDYCMVYWYLSVVLIKLVEKIHLRALMSNET------EKGASKDIGKDFRIPPAVSFLQ

Query:  CQICLAEGLVMMLAALRNEHMIAQSPSPFNSEYERFFQHFELLQKACIPDNITYDSYEQSTRLARI---------------------SLFNDPEKLVELR
        CQICLAEGLVMMLAALRNEHMIAQSPSPFNSEYERFFQHFELLQKACIPDNITYDSYEQSTRLARI                     S  NDPEKLVELR
Subjt:  CQICLAEGLVMMLAALRNEHMIAQSPSPFNSEYERFFQHFELLQKACIPDNITYDSYEQSTRLARI---------------------SLFNDPEKLVELR

Query:  RIEQVAEHNSVALNLIHKVGGLDPSLKISFEFNHHPYFGTAWLK
        RIEQVAEHNSVALNLIHKVGGLDPSLKISFEFNHHPYFGTA +K
Subjt:  RIEQVAEHNSVALNLIHKVGGLDPSLKISFEFNHHPYFGTAWLK

SwissProt top hitse value%identityAlignment
Q3E9A4 Probable glycosyltransferase At5g202601.4e-0556Show/hide
Query:  LAEINLPPTFHLNLPR-SGQPPQNRSILAFFAGGTHGFIRHILMQHGKTK
        + EIN+P   HL  PR S     +R ILAFFAGG+HG+IR IL+QH K K
Subjt:  LAEINLPPTFHLNLPR-SGQPPQNRSILAFFAGGTHGFIRHILMQHGKTK

Q3EAR7 Probable glycosyltransferase At3g421804.7e-0653.06Show/hide
Query:  LAEINLPPTFHLNLPRSGQPPQNRSILAFFAGGTHGFIRHILMQHGKTK
        + EIN+P    L  P  GQ P+NR+ILAFFAG  HG+IR +L  H K K
Subjt:  LAEINLPPTFHLNLPRSGQPPQNRSILAFFAGGTHGFIRHILMQHGKTK

Q9LFP3 Probable glycosyltransferase At5g111305.0e-0857.14Show/hide
Query:  LAEINLPPTFHLNLPRSGQPPQNRSILAFFAGGTHGFIRHILMQHGKTK
        L EIN+P +  L    +G+PPQNR +LAFFAGG+HG +R IL QH K K
Subjt:  LAEINLPPTFHLNLPRSGQPPQNRSILAFFAGGTHGFIRHILMQHGKTK

Arabidopsis top hitse value%identityAlignment
AT2G11000.1 MAK10 homologue8.4e-4340.71Show/hide
Query:  LMQHGKTKTMKSKSTNAWMKIFQHILIWVEEQT-----------FRIRAVLPGDYCMVYWYLSVVLIKLVEKIHLRALMSNETE-------KGASKDIGK
        +MQ   +++  SK+ +  + I  HI   +EEQ            F +    P +YCMVYWY+ ++L KL E+   R L+   TE       K  S+D+ +
Subjt:  LMQHGKTKTMKSKSTNAWMKIFQHILIWVEEQT-----------FRIRAVLPGDYCMVYWYLSVVLIKLVEKIHLRALMSNETE-------KGASKDIGK

Query:  DFRIPPAVSFLQCQICLAEGLVMMLAALRNEHMIAQSPSPFNSEYERFFQHFELLQKACIPDNITYDSYEQSTRLARISLF-------------------
        + RI   V FL+CQ CLA+GL +M+AALRNE M  +S  PFN+E E+F QHFELLQKA +P+   Y+S+ +ST  AR+                      
Subjt:  DFRIPPAVSFLQCQICLAEGLVMMLAALRNEHMIAQSPSPFNSEYERFFQHFELLQKACIPDNITYDSYEQSTRLARISLF-------------------

Query:  --NDPEKLVELRRIEQVAEHNSVALNLIHKVGGLDPSLKISFEFNHHPYFGTA
          NDP+KL E+  +E+VAE N VA+NL  +    D SLK+SFEF HHPYF TA
Subjt:  --NDPEKLVELRRIEQVAEHNSVALNLIHKVGGLDPSLKISFEFNHHPYFGTA

AT2G11000.2 MAK10 homologue8.4e-4340.71Show/hide
Query:  LMQHGKTKTMKSKSTNAWMKIFQHILIWVEEQT-----------FRIRAVLPGDYCMVYWYLSVVLIKLVEKIHLRALMSNETE-------KGASKDIGK
        +MQ   +++  SK+ +  + I  HI   +EEQ            F +    P +YCMVYWY+ ++L KL E+   R L+   TE       K  S+D+ +
Subjt:  LMQHGKTKTMKSKSTNAWMKIFQHILIWVEEQT-----------FRIRAVLPGDYCMVYWYLSVVLIKLVEKIHLRALMSNETE-------KGASKDIGK

Query:  DFRIPPAVSFLQCQICLAEGLVMMLAALRNEHMIAQSPSPFNSEYERFFQHFELLQKACIPDNITYDSYEQSTRLARISLF-------------------
        + RI   V FL+CQ CLA+GL +M+AALRNE M  +S  PFN+E E+F QHFELLQKA +P+   Y+S+ +ST  AR+                      
Subjt:  DFRIPPAVSFLQCQICLAEGLVMMLAALRNEHMIAQSPSPFNSEYERFFQHFELLQKACIPDNITYDSYEQSTRLARISLF-------------------

Query:  --NDPEKLVELRRIEQVAEHNSVALNLIHKVGGLDPSLKISFEFNHHPYFGTA
          NDP+KL E+  +E+VAE N VA+NL  +    D SLK+SFEF HHPYF TA
Subjt:  --NDPEKLVELRRIEQVAEHNSVALNLIHKVGGLDPSLKISFEFNHHPYFGTA

AT3G42180.1 Exostosin family protein3.3e-0753.06Show/hide
Query:  LAEINLPPTFHLNLPRSGQPPQNRSILAFFAGGTHGFIRHILMQHGKTK
        + EIN+P    L  P  GQ P+NR+ILAFFAG  HG+IR +L  H K K
Subjt:  LAEINLPPTFHLNLPRSGQPPQNRSILAFFAGGTHGFIRHILMQHGKTK

AT5G11130.1 Exostosin family protein3.5e-0957.14Show/hide
Query:  LAEINLPPTFHLNLPRSGQPPQNRSILAFFAGGTHGFIRHILMQHGKTK
        L EIN+P +  L    +G+PPQNR +LAFFAGG+HG +R IL QH K K
Subjt:  LAEINLPPTFHLNLPRSGQPPQNRSILAFFAGGTHGFIRHILMQHGKTK

AT5G20260.1 Exostosin family protein9.7e-0756Show/hide
Query:  LAEINLPPTFHLNLPR-SGQPPQNRSILAFFAGGTHGFIRHILMQHGKTK
        + EIN+P   HL  PR S     +R ILAFFAGG+HG+IR IL+QH K K
Subjt:  LAEINLPPTFHLNLPR-SGQPPQNRSILAFFAGGTHGFIRHILMQHGKTK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACAGTGGAAAAAGCCCATTTTCGGCCCATGATCCAGAAGAGGCCCATAGTTCTTTGCAATGCCAACACATCCGAAGGCTTCAATCCAATGCGAGATGCATCCTTGC
CGAGATTAACTTACCTCCAACTTTCCACCTCAATCTTCCTCGATCAGGCCAACCGCCTCAGAACCGCTCCATTCTAGCCTTCTTCGCCGGCGGAACACACGGATTCATCC
GCCACATCTTAATGCAGCATGGAAAGACAAAGACGATGAAATCCAAGTCCACGAATGCATGGATGAAGATTTTCCAACATATCCTTATTTGGGTAGAGGAGCAAACGTTT
CGAATTAGAGCTGTACTCCCCGGTGATTATTGCATGGTATATTGGTACCTTTCTGTTGTTTTAATCAAGCTCGTGGAGAAAATACATCTGAGAGCACTGATGAGCAATGA
AACTGAAAAAGGAGCCTCAAAAGATATCGGGAAAGATTTCCGAATTCCACCAGCAGTGTCATTTCTTCAGTGCCAAATATGTCTCGCTGAAGGGCTAGTAATGATGCTTG
CTGCCTTGAGGAACGAACATATGATCGCACAGAGTCCAAGCCCCTTCAATAGCGAGTACGAGAGATTCTTTCAACATTTTGAGCTTCTACAAAAGGCTTGCATTCCCGAC
AACATTACATACGATTCATACGAGCAATCGACTCGCCTGGCTCGCATTTCTCTTTTCAACGACCCCGAAAAACTTGTCGAACTCCGAAGGATCGAGCAAGTTGCAGAGCA
CAACAGTGTTGCCTTGAACCTGATCCACAAGGTAGGGGGCCTTGACCCGTCCTTAAAGATTTCATTCGAGTTCAATCACCACCCATATTTCGGGACTGCCTGGTTAAAAG
ATCTTGAAGCTTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACAGTGGAAAAAGCCCATTTTCGGCCCATGATCCAGAAGAGGCCCATAGTTCTTTGCAATGCCAACACATCCGAAGGCTTCAATCCAATGCGAGATGCATCCTTGC
CGAGATTAACTTACCTCCAACTTTCCACCTCAATCTTCCTCGATCAGGCCAACCGCCTCAGAACCGCTCCATTCTAGCCTTCTTCGCCGGCGGAACACACGGATTCATCC
GCCACATCTTAATGCAGCATGGAAAGACAAAGACGATGAAATCCAAGTCCACGAATGCATGGATGAAGATTTTCCAACATATCCTTATTTGGGTAGAGGAGCAAACGTTT
CGAATTAGAGCTGTACTCCCCGGTGATTATTGCATGGTATATTGGTACCTTTCTGTTGTTTTAATCAAGCTCGTGGAGAAAATACATCTGAGAGCACTGATGAGCAATGA
AACTGAAAAAGGAGCCTCAAAAGATATCGGGAAAGATTTCCGAATTCCACCAGCAGTGTCATTTCTTCAGTGCCAAATATGTCTCGCTGAAGGGCTAGTAATGATGCTTG
CTGCCTTGAGGAACGAACATATGATCGCACAGAGTCCAAGCCCCTTCAATAGCGAGTACGAGAGATTCTTTCAACATTTTGAGCTTCTACAAAAGGCTTGCATTCCCGAC
AACATTACATACGATTCATACGAGCAATCGACTCGCCTGGCTCGCATTTCTCTTTTCAACGACCCCGAAAAACTTGTCGAACTCCGAAGGATCGAGCAAGTTGCAGAGCA
CAACAGTGTTGCCTTGAACCTGATCCACAAGGTAGGGGGCCTTGACCCGTCCTTAAAGATTTCATTCGAGTTCAATCACCACCCATATTTCGGGACTGCCTGGTTAAAAG
ATCTTGAAGCTTTTTGA
Protein sequenceShow/hide protein sequence
MDSGKSPFSAHDPEEAHSSLQCQHIRRLQSNARCILAEINLPPTFHLNLPRSGQPPQNRSILAFFAGGTHGFIRHILMQHGKTKTMKSKSTNAWMKIFQHILIWVEEQTF
RIRAVLPGDYCMVYWYLSVVLIKLVEKIHLRALMSNETEKGASKDIGKDFRIPPAVSFLQCQICLAEGLVMMLAALRNEHMIAQSPSPFNSEYERFFQHFELLQKACIPD
NITYDSYEQSTRLARISLFNDPEKLVELRRIEQVAEHNSVALNLIHKVGGLDPSLKISFEFNHHPYFGTAWLKDLEAF