; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh20G010680 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh20G010680
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionVQ domain-containing protein
Genome locationCma_Chr20:8056516..8057151
RNA-Seq ExpressionCmaCh20G010680
SyntenyCmaCh20G010680
Gene Ontology termsGO:0005634 - nucleus (cellular component)
InterPro domainsIPR008889 - VQ
IPR039607 - VQ motif-containing protein 8/17/18/20/21/25


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571354.1 Protein MKS1, partial [Cucurbita argyrosperma subsp. sororia]1.7e-10092.17Show/hide
Query:  MNPPGYPAGGSPRKKEIQLQGPRPPQLRVSQDSRKIKKPPPHPQPIPPTGRPPLPPCAGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTGLSSS---
        MNPPGYPAGGSPRKKEIQLQGPRPPQLRV+QDSRKIKKPPPHPQPIPP GRPPLPP    TQWPQPLIIYDISPKVIHVPENNFMSVVQRLTGLSSS   
Subjt:  MNPPGYPAGGSPRKKEIQLQGPRPPQLRVSQDSRKIKKPPPHPQPIPPTGRPPLPPCAGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTGLSSS---

Query:  ---GDLSPAARLAAIEKASPRPEREREREREIDVSEMMDLGEISVELWQTPGILSPAPASLAPIPTGFFSPATELQNLSYSLIQELSPHWPSPSALFSAP
           GDLSPAARLAAIEKASPRP  EREREREIDVSEMMDLGEISVELWQTPGILSPAPASLAPIPTGFFSPATELQNLSYSLIQELSPHWPSPSALFSAP
Subjt:  ---GDLSPAARLAAIEKASPRPEREREREREIDVSEMMDLGEISVELWQTPGILSPAPASLAPIPTGFFSPATELQNLSYSLIQELSPHWPSPSALFSAP

Query:  LVSPISSPNLFSTLFNF
        LVSPISSPNLF+ LF+F
Subjt:  LVSPISSPNLFSTLFNF

XP_022932014.1 protein MKS1-like [Cucurbita moschata]4.5e-10192.17Show/hide
Query:  MNPPGYPAGGSPRKKEIQLQGPRPPQLRVSQDSRKIKKPPPHPQPIPPTGRPPLPPCAGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTGLSSS---
        MNPPGYPAGGSPRKKEIQLQGPRPPQLRVSQDSRKIKKPPPHPQPIPP  RPPLPP AGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTGLSSS   
Subjt:  MNPPGYPAGGSPRKKEIQLQGPRPPQLRVSQDSRKIKKPPPHPQPIPPTGRPPLPPCAGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTGLSSS---

Query:  ---GDLSPAARLAAIEKASPRPEREREREREIDVSEMMDLGEISVELWQTPGILSPAPASLAPIPTGFFSPATELQNLSYSLIQELSPHWPSPSALFSAP
           GDLSPAARLAAIEKASPRP  EREREREIDVSEMMDLGEISVELWQTPGILSPAPASLAPIPTGFFSPATEL N  YSLIQELSPHWPSPSALFSAP
Subjt:  ---GDLSPAARLAAIEKASPRPEREREREREIDVSEMMDLGEISVELWQTPGILSPAPASLAPIPTGFFSPATELQNLSYSLIQELSPHWPSPSALFSAP

Query:  LVSPISSPNLFSTLFNF
        LVSPISSPNLF+ LF+F
Subjt:  LVSPISSPNLFSTLFNF

XP_022971712.1 protein MKS1-like [Cucurbita maxima]5.6e-112100Show/hide
Query:  MNPPGYPAGGSPRKKEIQLQGPRPPQLRVSQDSRKIKKPPPHPQPIPPTGRPPLPPCAGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTGLSSSGDL
        MNPPGYPAGGSPRKKEIQLQGPRPPQLRVSQDSRKIKKPPPHPQPIPPTGRPPLPPCAGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTGLSSSGDL
Subjt:  MNPPGYPAGGSPRKKEIQLQGPRPPQLRVSQDSRKIKKPPPHPQPIPPTGRPPLPPCAGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTGLSSSGDL

Query:  SPAARLAAIEKASPRPEREREREREIDVSEMMDLGEISVELWQTPGILSPAPASLAPIPTGFFSPATELQNLSYSLIQELSPHWPSPSALFSAPLVSPIS
        SPAARLAAIEKASPRPEREREREREIDVSEMMDLGEISVELWQTPGILSPAPASLAPIPTGFFSPATELQNLSYSLIQELSPHWPSPSALFSAPLVSPIS
Subjt:  SPAARLAAIEKASPRPEREREREREIDVSEMMDLGEISVELWQTPGILSPAPASLAPIPTGFFSPATELQNLSYSLIQELSPHWPSPSALFSAPLVSPIS

Query:  SPNLFSTLFNF
        SPNLFSTLFNF
Subjt:  SPNLFSTLFNF

XP_023512395.1 protein MKS1-like [Cucurbita pepo subsp. pepo]2.1e-9890.78Show/hide
Query:  MNPPGYPAGGSPRKKEIQLQGPRPPQLRVSQDSRKIKKPPPHPQPIPPTGRPPLPPCAGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTGLSSS---
        MNPPGYPAGGSPRKKEIQLQGPRPPQLRVSQDSRKIKKPPPHPQPIPP GRPPLPP AGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTGLSSS   
Subjt:  MNPPGYPAGGSPRKKEIQLQGPRPPQLRVSQDSRKIKKPPPHPQPIPPTGRPPLPPCAGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTGLSSS---

Query:  ---GDLSPAARLAAIEKASPRPEREREREREIDVSEMMDLGEISVELWQTPGILSPAPASLAPIPTGFFSPATELQNLSYSLIQELSPHWPSPSALFSAP
           GDLSPAARLAAIEKASPRP    EREREIDVSEMMDLGEISVELWQTPGILSPAPASLAPIPTGFFSPATELQNLSYSLIQELSPHWPSPSALF   
Subjt:  ---GDLSPAARLAAIEKASPRPEREREREREIDVSEMMDLGEISVELWQTPGILSPAPASLAPIPTGFFSPATELQNLSYSLIQELSPHWPSPSALFSAP

Query:  LVSPISSPNLFSTLFNF
          SPISSPNLF+ LF+F
Subjt:  LVSPISSPNLFSTLFNF

XP_038901805.1 protein MKS1-like [Benincasa hispida]3.4e-8580.09Show/hide
Query:  MNPPGYPAGGS---PRKKEIQLQGPRPPQLRVSQDSRKIKKPPPHPQPIPPTGRPPLPPCAGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTGLSSS
        MNPPGY A GS   PRKKEIQLQGPRPPQLRV+Q+SRKIKKPPPHPQP+PP GRPPLPP  G  QWPQPLIIYDISPKVIHV ENNFMSVVQRLTGLSSS
Subjt:  MNPPGYPAGGS---PRKKEIQLQGPRPPQLRVSQDSRKIKKPPPHPQPIPPTGRPPLPPCAGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTGLSSS

Query:  -------GDLSPAARLAAIEKASPRPEREREREREIDVSEMMDLGEISVELWQTPGILSPAPASLAPIPTGFFSPATELQNLSYSLIQELSPHWPSPSAL
               GDLSPAARLA IEKASPR     EREREI+VS+MMDL E+SVEL Q PGILSPAPA+LAPIPTG+FSPA E Q+L+YSLI ELSPHWPSPSAL
Subjt:  -------GDLSPAARLAAIEKASPRPEREREREREIDVSEMMDLGEISVELWQTPGILSPAPASLAPIPTGFFSPATELQNLSYSLIQELSPHWPSPSAL

Query:  FSAPLVSPISSPNLFSTLFNF
        FSAPLVSPISSPN+F+ LF+F
Subjt:  FSAPLVSPISSPNLFSTLFNF

TrEMBL top hitse value%identityAlignment
A0A1S3CD17 protein MKS1-like9.1e-8478.9Show/hide
Query:  MNPPGYPAGGS---PRKKEIQLQGPRPPQLRVSQDSRKIKKPPPHPQPIPPTGRPPLPPCAGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTGLSSS
        MNPPGY A GS   PRKKEIQLQGPRPPQLRVSQ+SRKIKKPPPHPQP+PP GRPPLPP  G +QWPQPLIIYDISPKVIHV ENNFMSVVQRLTG SS+
Subjt:  MNPPGYPAGGS---PRKKEIQLQGPRPPQLRVSQDSRKIKKPPPHPQPIPPTGRPPLPPCAGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTGLSSS

Query:  ----GDLSPAARLAAIEKASPRPEREREREREIDVSEMMDLGEISVELWQTPGILSPAPASLAPIPTGFFSPATELQNLSYSLIQELSPHWPSPSALFSA
            GDLSPAARLA IEKASPR     EREREI+VS+MMDL E+SVEL Q PGILSPAP +LAPIPTG+FSPA E Q+ SYSL  ELSPHWPSPSALFS 
Subjt:  ----GDLSPAARLAAIEKASPRPEREREREREIDVSEMMDLGEISVELWQTPGILSPAPASLAPIPTGFFSPATELQNLSYSLIQELSPHWPSPSALFSA

Query:  PLVSPISSPNLFSTLFNF
        PL+SPISSPN+F+ LF+F
Subjt:  PLVSPISSPNLFSTLFNF

A0A5A7TBE4 Protein MKS1-like9.1e-8478.9Show/hide
Query:  MNPPGYPAGGS---PRKKEIQLQGPRPPQLRVSQDSRKIKKPPPHPQPIPPTGRPPLPPCAGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTGLSSS
        MNPPGY A GS   PRKKEIQLQGPRPPQLRVSQ+SRKIKKPPPHPQP+PP GRPPLPP  G +QWPQPLIIYDISPKVIHV ENNFMSVVQRLTG SS+
Subjt:  MNPPGYPAGGS---PRKKEIQLQGPRPPQLRVSQDSRKIKKPPPHPQPIPPTGRPPLPPCAGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTGLSSS

Query:  ----GDLSPAARLAAIEKASPRPEREREREREIDVSEMMDLGEISVELWQTPGILSPAPASLAPIPTGFFSPATELQNLSYSLIQELSPHWPSPSALFSA
            GDLSPAARLA IEKASPR     EREREI+VS+MMDL E+SVEL Q PGILSPAP +LAPIPTG+FSPA E Q+ SYSL  ELSPHWPSPSALFS 
Subjt:  ----GDLSPAARLAAIEKASPRPEREREREREIDVSEMMDLGEISVELWQTPGILSPAPASLAPIPTGFFSPATELQNLSYSLIQELSPHWPSPSALFSA

Query:  PLVSPISSPNLFSTLFNF
        PL+SPISSPN+F+ LF+F
Subjt:  PLVSPISSPNLFSTLFNF

A0A6J1EVV6 protein MKS1-like2.2e-10192.17Show/hide
Query:  MNPPGYPAGGSPRKKEIQLQGPRPPQLRVSQDSRKIKKPPPHPQPIPPTGRPPLPPCAGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTGLSSS---
        MNPPGYPAGGSPRKKEIQLQGPRPPQLRVSQDSRKIKKPPPHPQPIPP  RPPLPP AGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTGLSSS   
Subjt:  MNPPGYPAGGSPRKKEIQLQGPRPPQLRVSQDSRKIKKPPPHPQPIPPTGRPPLPPCAGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTGLSSS---

Query:  ---GDLSPAARLAAIEKASPRPEREREREREIDVSEMMDLGEISVELWQTPGILSPAPASLAPIPTGFFSPATELQNLSYSLIQELSPHWPSPSALFSAP
           GDLSPAARLAAIEKASPRP  EREREREIDVSEMMDLGEISVELWQTPGILSPAPASLAPIPTGFFSPATEL N  YSLIQELSPHWPSPSALFSAP
Subjt:  ---GDLSPAARLAAIEKASPRPEREREREREIDVSEMMDLGEISVELWQTPGILSPAPASLAPIPTGFFSPATELQNLSYSLIQELSPHWPSPSALFSAP

Query:  LVSPISSPNLFSTLFNF
        LVSPISSPNLF+ LF+F
Subjt:  LVSPISSPNLFSTLFNF

A0A6J1G715 protein MKS1-like1.0e-8279Show/hide
Query:  MNPPGYPAGGS---PRKKEIQLQGPRPPQLRVSQDSRKIKKPPPHPQPIPPTGRPPLPPCAGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTGLSSS
        MNPPG  AGGS   PRKKEIQLQGPRPPQLRV+Q+SRKIKKPPPHPQPIPP+GRPPLPP   A+QWPQPLIIYDISPKV+HV ENNFMSVVQRLTGLSS+
Subjt:  MNPPGYPAGGS---PRKKEIQLQGPRPPQLRVSQDSRKIKKPPPHPQPIPPTGRPPLPPCAGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTGLSSS

Query:  -----GDLSPAARLAAIEKASPRPEREREREREIDVSEMMDLGEISVELWQTPGILSPAPASLAPIPTGFFSPATELQNLSYSLIQELSPHWPSPSALFS
             GDLSPAAR A IEKASPR     EREREIDVS+MMDL E+ VEL Q PGILSPAPASLAPI +GFFSPA E Q+ +YSLI ELSPHWPSPSALF 
Subjt:  -----GDLSPAARLAAIEKASPRPEREREREREIDVSEMMDLGEISVELWQTPGILSPAPASLAPIPTGFFSPATELQNLSYSLIQELSPHWPSPSALFS

Query:  APLVSPISSPNLFSTLFNF
        APLVSPISSPN+F+ LF+F
Subjt:  APLVSPISSPNLFSTLFNF

A0A6J1I2Q6 protein MKS1-like2.7e-112100Show/hide
Query:  MNPPGYPAGGSPRKKEIQLQGPRPPQLRVSQDSRKIKKPPPHPQPIPPTGRPPLPPCAGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTGLSSSGDL
        MNPPGYPAGGSPRKKEIQLQGPRPPQLRVSQDSRKIKKPPPHPQPIPPTGRPPLPPCAGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTGLSSSGDL
Subjt:  MNPPGYPAGGSPRKKEIQLQGPRPPQLRVSQDSRKIKKPPPHPQPIPPTGRPPLPPCAGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTGLSSSGDL

Query:  SPAARLAAIEKASPRPEREREREREIDVSEMMDLGEISVELWQTPGILSPAPASLAPIPTGFFSPATELQNLSYSLIQELSPHWPSPSALFSAPLVSPIS
        SPAARLAAIEKASPRPEREREREREIDVSEMMDLGEISVELWQTPGILSPAPASLAPIPTGFFSPATELQNLSYSLIQELSPHWPSPSALFSAPLVSPIS
Subjt:  SPAARLAAIEKASPRPEREREREREIDVSEMMDLGEISVELWQTPGILSPAPASLAPIPTGFFSPATELQNLSYSLIQELSPHWPSPSALFSAPLVSPIS

Query:  SPNLFSTLFNF
        SPNLFSTLFNF
Subjt:  SPNLFSTLFNF

SwissProt top hitse value%identityAlignment
F4HWF9 Nuclear speckle RNA-binding protein B4.2e-0933.88Show/hide
Query:  GPRPPQLRVSQDSRK-IKKP---PPHPQPIPPTGRPPLPPCAGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTGLSSS-------------------
        GP+P  L+V  DS K IKKP   PPHPQP PP      P  + +   P P+ IY ++P++IH   NNFM++VQRLTG +S+                   
Subjt:  GPRPPQLRVSQDSRK-IKKP---PPHPQPIPPTGRPPLPPCAGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTGLSSS-------------------

Query:  ------GDLSPAARLAAIEKASPRPE----------REREREREIDVSEMMDLGEISVELWQTPGILSPAPASLAPIPTGFFS
              G +SPAAR A  EKA+   E           ++            + G      +   GILSP P SL  +   FFS
Subjt:  ------GDLSPAARLAAIEKASPRPE----------REREREREIDVSEMMDLGEISVELWQTPGILSPAPASLAPIPTGFFS

Q8LGD5 Protein MKS16.4e-2641.26Show/hide
Query:  MNPPGYPAGGSP-----RKKEIQLQGPRPPQLRVSQDSRKIKKPPPHPQPIPPTGRPP--LPPCAGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTG
        M+P  Y AGG+P     +K+++Q+ GPRP  L V +DS KIKKPP HP P P   +PP  +P         +P++IY +SPKV+H   + FM+VVQRLTG
Subjt:  MNPPGYPAGGSP-----RKKEIQLQGPRPPQLRVSQDSRKIKKPPPHPQPIPPTGRPP--LPPCAGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTG

Query:  LSS--------SGDLSPAARLAAIEKASPRPERE-REREREIDVSEMMDLGEISVELWQTPGILSPAPASLAPIPTGFFSPATELQNLSYSLIQELSPHW
        +SS         GD+SPAARLA+ E ASPR  +E   R+  ++++  M+  E +      PGILSP+PA L    TG FSP      +    I       
Subjt:  LSS--------SGDLSPAARLAAIEKASPRPERE-REREREIDVSEMMDLGEISVELWQTPGILSPAPASLAPIPTGFFSPATELQNLSYSLIQELSPHW

Query:  PSPSALFS-APLVSPISSPNLFS
          P  LFS A  +SP  SP   S
Subjt:  PSPSALFS-APLVSPISSPNLFS

Q9LS54 VQ motif-containing protein 202.4e-0437.18Show/hide
Query:  PRPPQLRVSQDSRKIKKPPPHPQPIPPTGRPPLPPCAGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTGLSSSGD
        P PP L+V++DS  IKK PP P       +P             P+IIY  +P++IH    +FM++VQ+LTG++ S +
Subjt:  PRPPQLRVSQDSRKIKKPPPHPQPIPPTGRPPLPPCAGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTGLSSSGD

Arabidopsis top hitse value%identityAlignment
AT1G21320.1 nucleotide binding;nucleic acid binding3.0e-1033.88Show/hide
Query:  GPRPPQLRVSQDSRK-IKKP---PPHPQPIPPTGRPPLPPCAGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTGLSSS-------------------
        GP+P  L+V  DS K IKKP   PPHPQP PP      P  + +   P P+ IY ++P++IH   NNFM++VQRLTG +S+                   
Subjt:  GPRPPQLRVSQDSRK-IKKP---PPHPQPIPPTGRPPLPPCAGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTGLSSS-------------------

Query:  ------GDLSPAARLAAIEKASPRPE----------REREREREIDVSEMMDLGEISVELWQTPGILSPAPASLAPIPTGFFS
              G +SPAAR A  EKA+   E           ++            + G      +   GILSP P SL  +   FFS
Subjt:  ------GDLSPAARLAAIEKASPRPE----------REREREREIDVSEMMDLGEISVELWQTPGILSPAPASLAPIPTGFFS

AT1G21326.1 VQ motif-containing protein2.7e-1133.2Show/hide
Query:  SPRKKEIQLQGPRPPQLRVSQDSRK-IKKP---PPHPQPIPPTGRPPLPPCAGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTGLSSS---------
        SPR + I   GPRP  L+V  DS K IKKP   PPHPQP PP      P  + +   P P+IIY +SP++IH   NNFM++VQRLTG +S+         
Subjt:  SPRKKEIQLQGPRPPQLRVSQDSRK-IKKP---PPHPQPIPPTGRPPLPPCAGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTGLSSS---------

Query:  ----------------GDLSPAARLAAIEKASPRPE---------------------REREREREIDVSEMMDLGEISVELWQTPGILSPAPASLAPIPT
                        G +SPAAR A  EKA+   E                       +E++ +    E           +   GILSP P SL  +  
Subjt:  ----------------GDLSPAARLAAIEKASPRPE---------------------REREREREIDVSEMMDLGEISVELWQTPGILSPAPASLAPIPT

Query:  GFFSP--ATELQNLSYSLIQELSPHWPSPSALFSAPLVSPISSPNLFSTLFNF
         FFS    T+ Q  S S   + +      S L S+P  S I SP+      NF
Subjt:  GFFSP--ATELQNLSYSLIQELSPHWPSPSALFSAPLVSPISSPNLFSTLFNF

AT3G18360.1 VQ motif-containing protein1.7e-0537.18Show/hide
Query:  PRPPQLRVSQDSRKIKKPPPHPQPIPPTGRPPLPPCAGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTGLSSSGD
        P PP L+V++DS  IKK PP P       +P             P+IIY  +P++IH    +FM++VQ+LTG++ S +
Subjt:  PRPPQLRVSQDSRKIKKPPPHPQPIPPTGRPPLPPCAGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTGLSSSGD

AT3G18690.1 MAP kinase substrate 14.6e-2741.26Show/hide
Query:  MNPPGYPAGGSP-----RKKEIQLQGPRPPQLRVSQDSRKIKKPPPHPQPIPPTGRPP--LPPCAGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTG
        M+P  Y AGG+P     +K+++Q+ GPRP  L V +DS KIKKPP HP P P   +PP  +P         +P++IY +SPKV+H   + FM+VVQRLTG
Subjt:  MNPPGYPAGGSP-----RKKEIQLQGPRPPQLRVSQDSRKIKKPPPHPQPIPPTGRPP--LPPCAGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTG

Query:  LSS--------SGDLSPAARLAAIEKASPRPERE-REREREIDVSEMMDLGEISVELWQTPGILSPAPASLAPIPTGFFSPATELQNLSYSLIQELSPHW
        +SS         GD+SPAARLA+ E ASPR  +E   R+  ++++  M+  E +      PGILSP+PA L    TG FSP      +    I       
Subjt:  LSS--------SGDLSPAARLAAIEKASPRPERE-REREREIDVSEMMDLGEISVELWQTPGILSPAPASLAPIPTGFFSPATELQNLSYSLIQELSPHW

Query:  PSPSALFS-APLVSPISSPNLFS
          P  LFS A  +SP  SP   S
Subjt:  PSPSALFS-APLVSPISSPNLFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACCCGCCGGGATATCCCGCCGGCGGAAGCCCCAGAAAGAAGGAGATCCAGCTACAGGGTCCTCGCCCGCCGCAACTCCGAGTCAGTCAAGATTCCCGCAAAATCAA
GAAGCCGCCGCCACACCCTCAGCCGATTCCCCCCACCGGACGCCCACCTCTCCCACCATGCGCCGGCGCCACCCAATGGCCTCAGCCACTCATCATCTACGACATCTCCC
CCAAAGTCATCCACGTCCCGGAGAACAACTTCATGTCCGTCGTGCAACGACTCACTGGACTCTCCTCCTCCGGCGACCTCTCCCCAGCGGCGAGGCTGGCCGCAATCGAG
AAGGCCAGTCCCCGACCGGAAAGAGAAAGAGAAAGAGAAAGAGAAATCGACGTTAGCGAGATGATGGATTTGGGGGAGATTTCGGTGGAGTTATGGCAAACCCCCGGAAT
CCTATCTCCGGCTCCGGCAAGTCTTGCTCCGATTCCGACAGGGTTTTTCTCGCCGGCGACTGAGCTTCAAAATTTGTCGTATTCGTTAATTCAGGAGCTGAGCCCTCATT
GGCCAAGCCCTTCTGCTCTGTTTTCAGCTCCTCTGGTTTCCCCAATTTCTTCACCAAATCTTTTCTCCACTCTCTTTAATTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACCCGCCGGGATATCCCGCCGGCGGAAGCCCCAGAAAGAAGGAGATCCAGCTACAGGGTCCTCGCCCGCCGCAACTCCGAGTCAGTCAAGATTCCCGCAAAATCAA
GAAGCCGCCGCCACACCCTCAGCCGATTCCCCCCACCGGACGCCCACCTCTCCCACCATGCGCCGGCGCCACCCAATGGCCTCAGCCACTCATCATCTACGACATCTCCC
CCAAAGTCATCCACGTCCCGGAGAACAACTTCATGTCCGTCGTGCAACGACTCACTGGACTCTCCTCCTCCGGCGACCTCTCCCCAGCGGCGAGGCTGGCCGCAATCGAG
AAGGCCAGTCCCCGACCGGAAAGAGAAAGAGAAAGAGAAAGAGAAATCGACGTTAGCGAGATGATGGATTTGGGGGAGATTTCGGTGGAGTTATGGCAAACCCCCGGAAT
CCTATCTCCGGCTCCGGCAAGTCTTGCTCCGATTCCGACAGGGTTTTTCTCGCCGGCGACTGAGCTTCAAAATTTGTCGTATTCGTTAATTCAGGAGCTGAGCCCTCATT
GGCCAAGCCCTTCTGCTCTGTTTTCAGCTCCTCTGGTTTCCCCAATTTCTTCACCAAATCTTTTCTCCACTCTCTTTAATTTCTAG
Protein sequenceShow/hide protein sequence
MNPPGYPAGGSPRKKEIQLQGPRPPQLRVSQDSRKIKKPPPHPQPIPPTGRPPLPPCAGATQWPQPLIIYDISPKVIHVPENNFMSVVQRLTGLSSSGDLSPAARLAAIE
KASPRPEREREREREIDVSEMMDLGEISVELWQTPGILSPAPASLAPIPTGFFSPATELQNLSYSLIQELSPHWPSPSALFSAPLVSPISSPNLFSTLFNF