; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr030193 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr030193
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionNAC domain-containing protein
Genome locationtig00153574:1196324..1205518
RNA-Seq ExpressionSgr030193
SyntenySgr030193
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0016020 - membrane (cellular component)
GO:0003677 - DNA binding (molecular function)
InterPro domainsIPR003441 - NAC domain
IPR036093 - NAC domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573635.1 NAC domain-containing protein 89, partial [Cucurbita argyrosperma subsp. sororia]4.3e-6253.04Show/hide
Query:  AAEKGMSREVQLSMAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLPGQCC-------------------------------
        AAE G+SRE+QLSMAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVI EIEIYKYEPWDLP +                                 
Subjt:  AAEKGMSREVQLSMAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLPGQCC-------------------------------

Query:  ------------------------------------------FNG------FSSCLPPRRNNEFRQHGCSNRASSS---------------GFSSESYQK
                                                  FN       F      RRNNEFRQH CSNRA+SS                F+SES QK
Subjt:  ------------------------------------------FNG------FSSCLPPRRNNEFRQHGCSNRASSS---------------GFSSESYQK

Query:  DAAEAALVESSGDQKDYGSDDFYTEILKDDIINLDESAPY-AASNLFPLVFHRSDSEKKSEHDAYGVLEWLPNHGSASRRIRLKKREVGRCGAKKL
        D AEAA VESSGDQKDYGSDDFY++ILKDDI+NLD SAPY AAS+LFPLVFHRSD+E+K + D    LEWLPN G A+RRIRLK RE    GAKKL
Subjt:  DAAEAALVESSGDQKDYGSDDFYTEILKDDIINLDESAPY-AASNLFPLVFHRSDSEKKSEHDAYGVLEWLPNHGSASRRIRLKKREVGRCGAKKL

KAG7012716.1 NAC domain-containing protein 89, partial [Cucurbita argyrosperma subsp. argyrosperma]4.3e-6253.04Show/hide
Query:  AAEKGMSREVQLSMAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLPGQCC-------------------------------
        AAE G+SRE+QLSMAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVI EIEIYKYEPWDLP +                                 
Subjt:  AAEKGMSREVQLSMAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLPGQCC-------------------------------

Query:  ------------------------------------------FNG------FSSCLPPRRNNEFRQHGCSNRASSS---------------GFSSESYQK
                                                  FN       F      RRNNEFRQH CSNRA+SS                F+SES QK
Subjt:  ------------------------------------------FNG------FSSCLPPRRNNEFRQHGCSNRASSS---------------GFSSESYQK

Query:  DAAEAALVESSGDQKDYGSDDFYTEILKDDIINLDESAPY-AASNLFPLVFHRSDSEKKSEHDAYGVLEWLPNHGSASRRIRLKKREVGRCGAKKL
        D AEAA VESSGDQKDYGSDDFY++ILKDDI+NLD SAPY AAS+LFPLVFHRSD+E+K + D    LEWLPN G A+RRIRLK RE    GAKKL
Subjt:  DAAEAALVESSGDQKDYGSDDFYTEILKDDIINLDESAPY-AASNLFPLVFHRSDSEKKSEHDAYGVLEWLPNHGSASRRIRLKKREVGRCGAKKL

XP_022142529.1 NAC domain-containing protein 40-like [Momordica charantia]2.7e-7255.31Show/hide
Query:  ISDLSPFPFRLLLLFRYGKTGFLVMAAEKGMSREVQLSMAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLP----------
        IS  S FP   LL      +  ++ AAEKG+SREVQLSMAVSSMFPGFRFSPTDEELISFYLKKKLEG EKSVEVI+E+EIYKYEPWDLP          
Subjt:  ISDLSPFPFRLLLLFRYGKTGFLVMAAEKGMSREVQLSMAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLP----------

Query:  -------GQCCFNG-----------------------------------FSSCLPP---------------------------RRNNEFRQHGCSNRASS
               G+   NG                                   F +   P                           RRNNEFRQH CSNRASS
Subjt:  -------GQCCFNG-----------------------------------FSSCLPP---------------------------RRNNEFRQHGCSNRASS

Query:  SGF---------------SSESYQKDAAEAALVESSGDQKDYGSDDFYTEILKDDIINLDESAPYAASNLFPLVFHRSDSEKKSEHDAYGVLEWLPNHGS
        S                 +SES QKDAAEAA+VESSGDQKDYGSDDFYTEILKDDIINLDES P+A SNL+PLVF RSD EK+S+HDAYGVLEW PN GS
Subjt:  SGF---------------SSESYQKDAAEAALVESSGDQKDYGSDDFYTEILKDDIINLDESAPYAASNLFPLVFHRSDSEKKSEHDAYGVLEWLPNHGS

Query:  ASRRIRLKKREVGRCGAKKL
        A+RRIRLKKREVGRCG KKL
Subjt:  ASRRIRLKKREVGRCGAKKL

XP_023541345.1 LOW QUALITY PROTEIN: NAC domain-containing protein 89-like [Cucurbita pepo subsp. pepo]3.3e-6253.04Show/hide
Query:  AAEKGMSREVQLSMAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLPGQCC-------------------------------
        AAE G+SREVQLSMAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVI EIEIYKYEPWDLP +                                 
Subjt:  AAEKGMSREVQLSMAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLPGQCC-------------------------------

Query:  ------------------------------------------FNG------FSSCLPPRRNNEFRQHGCSNRASSS---------------GFSSESYQK
                                                  FN       F +    RRNNEFRQH CSNRA+SS                F+SES QK
Subjt:  ------------------------------------------FNG------FSSCLPPRRNNEFRQHGCSNRASSS---------------GFSSESYQK

Query:  DAAEAALVESSGDQKDYGSDDFYTEILKDDIINLDESAPY-AASNLFPLVFHRSDSEKKSEHDAYGVLEWLPNHGSASRRIRLKKREVGRCGAKKL
        D AEA  VESSGDQKDYGSDDFY++ILKDDI+NLD SAPY AAS+LFPLVFHRSD+E+K + D    LEWLPN G A+RRIRLK RE    GAKKL
Subjt:  DAAEAALVESSGDQKDYGSDDFYTEILKDDIINLDESAPY-AASNLFPLVFHRSDSEKKSEHDAYGVLEWLPNHGSASRRIRLKKREVGRCGAKKL

XP_038895559.1 NAC domain-containing protein 89-like isoform X1 [Benincasa hispida]5.5e-5749.33Show/hide
Query:  LVMAAEKGMSREVQLSMAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLPGQCCF-----------------NG--------
        +  A+E G+SREVQLSMAVSSMFPGFRFSPTDEEL+SFYLKKKLEGYEKSVE+I E+EIYKYEPWDLP +                    NG        
Subjt:  LVMAAEKGMSREVQLSMAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLPGQCCF-----------------NG--------

Query:  ---------------------------FSSCLPP---------------------------RRNNEFRQHGCSNRASSSGF---------------SSES
                                   F +   P                           RRNNEFRQH CSNRASSS                 +SES
Subjt:  ---------------------------FSSCLPP---------------------------RRNNEFRQHGCSNRASSSGF---------------SSES

Query:  YQKDAAEAALVESSG--DQKDYGSDDFYTEILKDDIINLDESAPYAASNLFPLVFHRSDSEKKSEHDAYGVLEWLPNHGSASRRIRLKKREVGRCGAKKL
         QKDAAE A VES G  DQKDYGSDDFY++ILKDDIINLD     AAS+  PL+FHRSD+E++S+H+A  V EW PN GSA+RRIRLK+RE    G KKL
Subjt:  YQKDAAEAALVESSG--DQKDYGSDDFYTEILKDDIINLDESAPYAASNLFPLVFHRSDSEKKSEHDAYGVLEWLPNHGSASRRIRLKKREVGRCGAKKL

TrEMBL top hitse value%identityAlignment
A0A1S3CJQ8 NAC domain-containing protein 89-like isoform X27.9e-5448.32Show/hide
Query:  LVMAAEKGMSREVQLSMAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLPGQCCF-----------------NG--------
        +   AE  +SREVQLSMAVSSMFPGFRFSPTDEELISFYLKKKLEG+EKSVEVIAE+EIYKYEPWDLP +                    NG        
Subjt:  LVMAAEKGMSREVQLSMAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLPGQCCF-----------------NG--------

Query:  ---------------------------FSSCLPP---------------------------RRNNEFRQHGCSNRASSSGF--------------SSESY
                                   F +   P                           RRNNEFRQH CSNRASSS                S+ES 
Subjt:  ---------------------------FSSCLPP---------------------------RRNNEFRQHGCSNRASSSGF--------------SSESY

Query:  QKDAAEAALVESSG-DQKDYGSDDFYTEILKDDIINLDESAPYAASNLFPLVFHRSDSEKKSEHDAYGVLEWLPNHGSASRRIRLKKREVGRCGAKKL
        QKDAAE   VES   DQKD+GSDDFY++ILKDDIINLD +    AS+  PL+F RSD+E++S+H+   V EWL N GSA+RRIRLKKRE    G KKL
Subjt:  QKDAAEAALVESSG-DQKDYGSDDFYTEILKDDIINLDESAPYAASNLFPLVFHRSDSEKKSEHDAYGVLEWLPNHGSASRRIRLKKREVGRCGAKKL

A0A6J1CLS3 NAC domain-containing protein 40-like1.3e-7255.31Show/hide
Query:  ISDLSPFPFRLLLLFRYGKTGFLVMAAEKGMSREVQLSMAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLP----------
        IS  S FP   LL      +  ++ AAEKG+SREVQLSMAVSSMFPGFRFSPTDEELISFYLKKKLEG EKSVEVI+E+EIYKYEPWDLP          
Subjt:  ISDLSPFPFRLLLLFRYGKTGFLVMAAEKGMSREVQLSMAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLP----------

Query:  -------GQCCFNG-----------------------------------FSSCLPP---------------------------RRNNEFRQHGCSNRASS
               G+   NG                                   F +   P                           RRNNEFRQH CSNRASS
Subjt:  -------GQCCFNG-----------------------------------FSSCLPP---------------------------RRNNEFRQHGCSNRASS

Query:  SGF---------------SSESYQKDAAEAALVESSGDQKDYGSDDFYTEILKDDIINLDESAPYAASNLFPLVFHRSDSEKKSEHDAYGVLEWLPNHGS
        S                 +SES QKDAAEAA+VESSGDQKDYGSDDFYTEILKDDIINLDES P+A SNL+PLVF RSD EK+S+HDAYGVLEW PN GS
Subjt:  SGF---------------SSESYQKDAAEAALVESSGDQKDYGSDDFYTEILKDDIINLDESAPYAASNLFPLVFHRSDSEKKSEHDAYGVLEWLPNHGS

Query:  ASRRIRLKKREVGRCGAKKL
        A+RRIRLKKREVGRCG KKL
Subjt:  ASRRIRLKKREVGRCGAKKL

A0A6J1ELC0 NAC domain-containing protein 40-like isoform X24.7e-5450.18Show/hide
Query:  LVMAAEKGMSREVQLSMAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLPGQCCF----------NG---------------
        +   AE G+S+E+QLSMAVSSMFPGFRFSPTDEELISFYLKKKL+GYEKSVE+IAE+EIYKYEPWDLP    F          NG               
Subjt:  LVMAAEKGMSREVQLSMAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLPGQCCF----------NG---------------

Query:  --------------------FSSCLPP---------------------------RRNNEFRQHGCSNRASSSGF---------------SSESYQKDAAE
                            F +   P                           RRNNEFRQH  SNRAS S                 +SES QKD AE
Subjt:  --------------------FSSCLPP---------------------------RRNNEFRQHGCSNRASSSGF---------------SSESYQKDAAE

Query:  AALVESSGDQKDYGSDDFYTEILKDDIINLDESAPY-AASNLFPLVFHRSDSEKKSEHDAYGVLEWLPNHGSASRRIRLKKRE
        AALVE S  QKDYGSDDFY++ILKDDIINLD +APY AAS+L PL FHRSD+ ++S+ +A   LEWLP+ G A+RRIRLK+RE
Subjt:  AALVESSGDQKDYGSDDFYTEILKDDIINLDESAPY-AASNLFPLVFHRSDSEKKSEHDAYGVLEWLPNHGSASRRIRLKKRE

A0A6J1I6Q6 NAC domain-containing protein 40-like isoform X23.2e-5550.52Show/hide
Query:  LVMAAEKGMSREVQLSMAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLPGQCCF----------NG---------------
        +  A+E G+S+E+QLSMAVSSMFPGFRFSPTDEELISFYLKKKL+GYEKSVE+IAE+EIYKYEPWDLP    F          NG               
Subjt:  LVMAAEKGMSREVQLSMAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLPGQCCF----------NG---------------

Query:  --------------------FSSCLPP--------------------------RRNNEFRQHGCSNRASSSGF---------------SSESYQKDAAEA
                            F +   P                          RRNNEFRQH  SNRAS S                 +SES QKDAAEA
Subjt:  --------------------FSSCLPP--------------------------RRNNEFRQHGCSNRASSSGF---------------SSESYQKDAAEA

Query:  ALVESSGDQKDYGSDDFYTEILKDDIINLDESAPY-AASNLFPLVFHRSDSEKKSEHDAYGVLEWLPNHGSASRRIRLKKREVGRCGAKKL
        ALVE S  QKDYGSDDFY++ILKDDIINLD +APY AAS+L PL FHRSD+ ++S+ +A   LEWLP+ G A+RRIRLK+RE  +  AKKL
Subjt:  ALVESSGDQKDYGSDDFYTEILKDDIINLDESAPY-AASNLFPLVFHRSDSEKKSEHDAYGVLEWLPNHGSASRRIRLKKREVGRCGAKKL

A0A6J1I7V5 NAC domain-containing protein 89-like isoform X12.1e-5448.99Show/hide
Query:  LVMAAEKGMSREVQLSMAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLPGQCCF-----------------NG--------
        +  A+E G+S+E+QLSMAVSSMFPGFRFSPTDEELISFYLKKKL+GYEKSVE+IAE+EIYKYEPWDLP +                    NG        
Subjt:  LVMAAEKGMSREVQLSMAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLPGQCCF-----------------NG--------

Query:  ---------------------------FSSCLPP--------------------------RRNNEFRQHGCSNRASSSGF---------------SSESY
                                   F +   P                          RRNNEFRQH  SNRAS S                 +SES 
Subjt:  ---------------------------FSSCLPP--------------------------RRNNEFRQHGCSNRASSSGF---------------SSESY

Query:  QKDAAEAALVESSGDQKDYGSDDFYTEILKDDIINLDESAPY-AASNLFPLVFHRSDSEKKSEHDAYGVLEWLPNHGSASRRIRLKKREVGRCGAKKL
        QKDAAEAALVE S  QKDYGSDDFY++ILKDDIINLD +APY AAS+L PL FHRSD+ ++S+ +A   LEWLP+ G A+RRIRLK+RE  +  AKKL
Subjt:  QKDAAEAALVESSGDQKDYGSDDFYTEILKDDIINLDESAPY-AASNLFPLVFHRSDSEKKSEHDAYGVLEWLPNHGSASRRIRLKKREVGRCGAKKL

SwissProt top hitse value%identityAlignment
A4VCM0 NAC domain-containing protein 453.5e-1445.26Show/hide
Query:  MAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLPGQCCFNGFSSCLPPR---------RNNEFRQHGCSNRASSSGF
        MA  S+ PGFRF PTDEELI++YLK+K+ G E  +EVIAE+++YK EPWDLPG+       S LP +         R+ ++     +NRA+  G+
Subjt:  MAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLPGQCCFNGFSSCLPPR---------RNNEFRQHGCSNRASSSGF

Q94F58 NAC domain-containing protein 892.6e-1746.74Show/hide
Query:  GMSREVQLSMAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLPGQCCFNGFSSCLPPRRNNEFRQHGCSNRASS
        G+S++   SM  S++FPGF+FSPTD ELIS+YLK+K++G E+SVEVI ++EIY +EPWDLP +      S         +   HG  NR ++
Subjt:  GMSREVQLSMAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLPGQCCFNGFSSCLPPRRNNEFRQHGCSNRASS

Q9FFI5 NAC domain-containing protein 862.9e-1342.11Show/hide
Query:  MAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLPGQCCFNGFSSCLPPR---------RNNEFRQHGCSNRASSSGF
        MA  S+ PGFRF PTDEELI++YLK+K+ G E  +E+I E+++YK EPWDLPG+       S +P +         R+ ++     +NRA+  G+
Subjt:  MAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLPGQCCFNGFSSCLPPR---------RNNEFRQHGCSNRASSSGF

Q9LXL9 NAC domain-containing protein 601.4e-1545.05Show/hide
Query:  MSREVQLSMAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLPGQCCFNGFSSCLPPRRNNEFRQHGCSNRASS
        M+    +  AV++ FPGF+FSPTD ELIS+YLK+K++G E+SVE+I E+EIY +EPWDLP +      S         +   HG  NR ++
Subjt:  MSREVQLSMAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLPGQCCFNGFSSCLPPRRNNEFRQHGCSNRASS

Q9XIN7 NAC domain-containing protein 404.7e-1970Show/hide
Query:  MSREVQLSMAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLP
        MS+E ++S+AVS++FPGFRFSPTD ELIS+YL++K++G E SV VIAE+EIYK+EPWDLP
Subjt:  MSREVQLSMAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLP

Arabidopsis top hitse value%identityAlignment
AT2G27300.1 NTM1-like 83.3e-2070Show/hide
Query:  MSREVQLSMAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLP
        MS+E ++S+AVS++FPGFRFSPTD ELIS+YL++K++G E SV VIAE+EIYK+EPWDLP
Subjt:  MSREVQLSMAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLP

AT3G03200.1 NAC domain containing protein 452.5e-1545.26Show/hide
Query:  MAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLPGQCCFNGFSSCLPPR---------RNNEFRQHGCSNRASSSGF
        MA  S+ PGFRF PTDEELI++YLK+K+ G E  +EVIAE+++YK EPWDLPG+       S LP +         R+ ++     +NRA+  G+
Subjt:  MAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLPGQCCFNGFSSCLPPR---------RNNEFRQHGCSNRASSSGF

AT3G44290.1 NAC domain containing protein 601.0e-1645.05Show/hide
Query:  MSREVQLSMAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLPGQCCFNGFSSCLPPRRNNEFRQHGCSNRASS
        M+    +  AV++ FPGF+FSPTD ELIS+YLK+K++G E+SVE+I E+EIY +EPWDLP +      S         +   HG  NR ++
Subjt:  MSREVQLSMAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLPGQCCFNGFSSCLPPRRNNEFRQHGCSNRASS

AT5G17260.1 NAC domain containing protein 862.1e-1442.11Show/hide
Query:  MAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLPGQCCFNGFSSCLPPR---------RNNEFRQHGCSNRASSSGF
        MA  S+ PGFRF PTDEELI++YLK+K+ G E  +E+I E+++YK EPWDLPG+       S +P +         R+ ++     +NRA+  G+
Subjt:  MAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLPGQCCFNGFSSCLPPR---------RNNEFRQHGCSNRASSSGF

AT5G22290.1 NAC domain containing protein 891.8e-1846.74Show/hide
Query:  GMSREVQLSMAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLPGQCCFNGFSSCLPPRRNNEFRQHGCSNRASS
        G+S++   SM  S++FPGF+FSPTD ELIS+YLK+K++G E+SVEVI ++EIY +EPWDLP +      S         +   HG  NR ++
Subjt:  GMSREVQLSMAVSSMFPGFRFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLPGQCCFNGFSSCLPPRRNNEFRQHGCSNRASS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGAGGCCTGGACGATGGCGGCTCGAACAGTGGGTACAGATGAATCGGTGCCCATGTCCACCTCCGCTATCAGTTGCGCATCGTTAGCCGCCGTGGTCGGCACCAG
CGCCATGATTCTCCTCAAAGCAAGGAATTCCCCCAAGGCTTTTTCCCACCCCACCGCAAACCCACGAAGATACCAAATCTCTGATCTCTCCCCTTTCCCCTTTCGTCTGC
TTCTACTTTTCCGATACGGGAAGACGGGGTTTTTGGTAATGGCTGCTGAGAAGGGGATGTCCCGGGAAGTGCAGCTGTCAATGGCGGTTTCGTCTATGTTTCCCGGTTTT
CGCTTCTCGCCGACCGATGAGGAGCTGATTTCGTTTTATCTCAAGAAGAAATTGGAGGGCTATGAGAAGAGCGTTGAAGTCATTGCGGAGATTGAAATTTACAAGTACGA
ACCCTGGGACTTACCCGGTCAGTGTTGCTTTAATGGATTTTCTAGTTGTTTGCCGCCTCGGAGGAATAATGAATTTCGTCAACATGGTTGCTCAAACCGAGCTTCTTCGA
GTGGATTCTCATCTGAATCCTACCAAAAAGATGCAGCAGAAGCTGCACTTGTGGAATCTTCTGGTGATCAGAAGGATTATGGCTCTGATGATTTCTACACCGAGATACTA
AAAGATGACATTATAAACCTAGACGAGTCCGCACCTTACGCAGCTTCCAATCTATTCCCACTGGTTTTCCATAGATCAGACTCAGAGAAAAAATCTGAGCACGACGCATA
TGGTGTTCTGGAATGGCTACCGAACCATGGCTCTGCGAGTCGAAGAATCAGATTGAAGAAGAGAGAAGTAGGTCGCTGTGGAGCAAAGAAGTTGATGCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGGAGGCCTGGACGATGGCGGCTCGAACAGTGGGTACAGATGAATCGGTGCCCATGTCCACCTCCGCTATCAGTTGCGCATCGTTAGCCGCCGTGGTCGGCACCAG
CGCCATGATTCTCCTCAAAGCAAGGAATTCCCCCAAGGCTTTTTCCCACCCCACCGCAAACCCACGAAGATACCAAATCTCTGATCTCTCCCCTTTCCCCTTTCGTCTGC
TTCTACTTTTCCGATACGGGAAGACGGGGTTTTTGGTAATGGCTGCTGAGAAGGGGATGTCCCGGGAAGTGCAGCTGTCAATGGCGGTTTCGTCTATGTTTCCCGGTTTT
CGCTTCTCGCCGACCGATGAGGAGCTGATTTCGTTTTATCTCAAGAAGAAATTGGAGGGCTATGAGAAGAGCGTTGAAGTCATTGCGGAGATTGAAATTTACAAGTACGA
ACCCTGGGACTTACCCGGTCAGTGTTGCTTTAATGGATTTTCTAGTTGTTTGCCGCCTCGGAGGAATAATGAATTTCGTCAACATGGTTGCTCAAACCGAGCTTCTTCGA
GTGGATTCTCATCTGAATCCTACCAAAAAGATGCAGCAGAAGCTGCACTTGTGGAATCTTCTGGTGATCAGAAGGATTATGGCTCTGATGATTTCTACACCGAGATACTA
AAAGATGACATTATAAACCTAGACGAGTCCGCACCTTACGCAGCTTCCAATCTATTCCCACTGGTTTTCCATAGATCAGACTCAGAGAAAAAATCTGAGCACGACGCATA
TGGTGTTCTGGAATGGCTACCGAACCATGGCTCTGCGAGTCGAAGAATCAGATTGAAGAAGAGAGAAGTAGGTCGCTGTGGAGCAAAGAAGTTGATGCAATGA
Protein sequenceShow/hide protein sequence
MVEAWTMAARTVGTDESVPMSTSAISCASLAAVVGTSAMILLKARNSPKAFSHPTANPRRYQISDLSPFPFRLLLLFRYGKTGFLVMAAEKGMSREVQLSMAVSSMFPGF
RFSPTDEELISFYLKKKLEGYEKSVEVIAEIEIYKYEPWDLPGQCCFNGFSSCLPPRRNNEFRQHGCSNRASSSGFSSESYQKDAAEAALVESSGDQKDYGSDDFYTEIL
KDDIINLDESAPYAASNLFPLVFHRSDSEKKSEHDAYGVLEWLPNHGSASRRIRLKKREVGRCGAKKLMQ