; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh02G006260 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh02G006260
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionVQ domain-containing protein
Genome locationCmo_Chr02:3889510..3890151
RNA-Seq ExpressionCmoCh02G006260
SyntenyCmoCh02G006260
Gene Ontology termsGO:0005634 - nucleus (cellular component)
InterPro domainsIPR008889 - VQ
IPR039607 - VQ motif-containing protein 8/17/18/20/21/25


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605314.1 Protein MKS1, partial [Cucurbita argyrosperma subsp. sororia]4.1e-11099.07Show/hide
Query:  MNPPGSSAGGSPNTPRKKEIQLQGPRPPQLRVNQESRKIKKPPPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTGLSSAAS
        MNPPGSSAGGSPNTPRKKEIQLQGPRPPQLRVNQESRKIKKPPPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTGLSSAAS
Subjt:  MNPPGSSAGGSPNTPRKKEIQLQGPRPPQLRVNQESRKIKKPPPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTGLSSAAS

Query:  STDGDLSPAARFATIEKASPRS--EREREIDVSDMMDLTEVPVELGQNPGILSPAPASLAPISSGFFSPAIEPQSFTYSLIHELSPHWPSPSALFPAPLV
        STDGDLSPAARFATIEKASPRS  EREREIDVSDMMDLTEVPVELGQNPGILSPAPASLAPISSGFFSPAIEPQSFTYSLIHELSPHWPSPSALFPAPLV
Subjt:  STDGDLSPAARFATIEKASPRS--EREREIDVSDMMDLTEVPVELGQNPGILSPAPASLAPISSGFFSPAIEPQSFTYSLIHELSPHWPSPSALFPAPLV

Query:  SPISSPNIFNHLFDF
        SPISSPNIFNHLFDF
Subjt:  SPISSPNIFNHLFDF

XP_022947651.1 protein MKS1-like [Cucurbita moschata]1.3e-111100Show/hide
Query:  MNPPGSSAGGSPNTPRKKEIQLQGPRPPQLRVNQESRKIKKPPPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTGLSSAAS
        MNPPGSSAGGSPNTPRKKEIQLQGPRPPQLRVNQESRKIKKPPPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTGLSSAAS
Subjt:  MNPPGSSAGGSPNTPRKKEIQLQGPRPPQLRVNQESRKIKKPPPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTGLSSAAS

Query:  STDGDLSPAARFATIEKASPRSEREREIDVSDMMDLTEVPVELGQNPGILSPAPASLAPISSGFFSPAIEPQSFTYSLIHELSPHWPSPSALFPAPLVSP
        STDGDLSPAARFATIEKASPRSEREREIDVSDMMDLTEVPVELGQNPGILSPAPASLAPISSGFFSPAIEPQSFTYSLIHELSPHWPSPSALFPAPLVSP
Subjt:  STDGDLSPAARFATIEKASPRSEREREIDVSDMMDLTEVPVELGQNPGILSPAPASLAPISSGFFSPAIEPQSFTYSLIHELSPHWPSPSALFPAPLVSP

Query:  ISSPNIFNHLFDF
        ISSPNIFNHLFDF
Subjt:  ISSPNIFNHLFDF

XP_023007245.1 protein MKS1-like [Cucurbita maxima]5.5e-10797.18Show/hide
Query:  MNPPGSSAGGSPNTPRKKEIQLQGPRPPQLRVNQESRKIKKPPPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTGLSSAAS
        MNPPGSSAGGSPNTPRKKEIQLQGPRPPQLRVNQESRKIKKPPPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTGLSSA+S
Subjt:  MNPPGSSAGGSPNTPRKKEIQLQGPRPPQLRVNQESRKIKKPPPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTGLSSAAS

Query:  STDGDLSPAARFATIEKASPRSEREREIDVSDMMDLTEVPVELGQNPGILSPAPASLAPISSGFFSPAIEPQSFTYSLIHELSPHWPSPSALFPAPLVSP
        S DGDLSPAARFATIEKASPRS  EREIDVSDMMDLTEVPVELGQ PGILSPAPASLAPISSGFFSP IEPQSFTYSLIHELSPHWPSPSALFPAPLVSP
Subjt:  STDGDLSPAARFATIEKASPRSEREREIDVSDMMDLTEVPVELGQNPGILSPAPASLAPISSGFFSPAIEPQSFTYSLIHELSPHWPSPSALFPAPLVSP

Query:  ISSPNIFNHLFDF
        ISSPNIFNHLFDF
Subjt:  ISSPNIFNHLFDF

XP_023533977.1 protein MKS1-like [Cucurbita pepo subsp. pepo]7.0e-11098.59Show/hide
Query:  MNPPGSSAGGSPNTPRKKEIQLQGPRPPQLRVNQESRKIKKPPPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTGLSSAAS
        MNP GSSAGGSPNTPRKKEIQLQGPRPPQLRVNQESRKIKKPPPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTGLSS +S
Subjt:  MNPPGSSAGGSPNTPRKKEIQLQGPRPPQLRVNQESRKIKKPPPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTGLSSAAS

Query:  STDGDLSPAARFATIEKASPRSEREREIDVSDMMDLTEVPVELGQNPGILSPAPASLAPISSGFFSPAIEPQSFTYSLIHELSPHWPSPSALFPAPLVSP
        STDGDLSPAARFATIEKASPRSEREREIDVSDMMDLTEVPVELGQNPGILSPAPASLAPISSGFFSPAIEPQSFTYSLIHELSPHWPSPSALFPAPLVSP
Subjt:  STDGDLSPAARFATIEKASPRSEREREIDVSDMMDLTEVPVELGQNPGILSPAPASLAPISSGFFSPAIEPQSFTYSLIHELSPHWPSPSALFPAPLVSP

Query:  ISSPNIFNHLFDF
        ISSPNIFNHLFDF
Subjt:  ISSPNIFNHLFDF

XP_038901805.1 protein MKS1-like [Benincasa hispida]1.3e-9587.44Show/hide
Query:  MNPPGSSAGGSPNTPRKKEIQLQGPRPPQLRVNQESRKIKKPPPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTGL--SSA
        MNPPG SA GSP TPRKKEIQLQGPRPPQLRVNQESRKIKKPPPHPQP+PP GRPPLPP  +QWPQPLIIYDISPKV+HVAENNFMSVVQRLTGL  SSA
Subjt:  MNPPGSSAGGSPNTPRKKEIQLQGPRPPQLRVNQESRKIKKPPPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTGL--SSA

Query:  ASSTDGDLSPAARFATIEKASPRSEREREIDVSDMMDLTEVPVELGQNPGILSPAPASLAPISSGFFSPAIEPQSFTYSLIHELSPHWPSPSALFPAPLV
        A++TDGDLSPAAR ATIEKASPRSEREREI+VSDMMDL EV VELGQ PGILSPAPA+LAPI +G+FSPAIE QS  YSLIHELSPHWPSPSALF APLV
Subjt:  ASSTDGDLSPAARFATIEKASPRSEREREIDVSDMMDLTEVPVELGQNPGILSPAPASLAPISSGFFSPAIEPQSFTYSLIHELSPHWPSPSALFPAPLV

Query:  SPISSPNIFNHLFDF
        SPISSPNIFN+LFDF
Subjt:  SPISSPNIFNHLFDF

TrEMBL top hitse value%identityAlignment
A0A0A0LK21 VQ domain-containing protein4.1e-9285.38Show/hide
Query:  MNPPGSSAGGSPNTPRKKEIQLQGPRPPQLRVNQESRKIKKPPPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTGLSSAAS
        MNPPG SA GSP TPRKKEIQLQGPRPPQLRV+QESRKIKKPPPHPQP+P  GRPPLPP  SQWPQPLIIYDISPKV+HVAENNFMSVVQRLTG SS A 
Subjt:  MNPPGSSAGGSPNTPRKKEIQLQGPRPPQLRVNQESRKIKKPPPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTGLSSAAS

Query:  STDGDLSPAARFATIEKASPRSEREREIDVSDMMDLTEVPVELGQNPGILSPAPASLAPISSGFFSPAIEPQSFTYSLIHELSPHWPSPSALFPAPLVSP
         TDGDLSPAAR ATIEKASPRSEREREI+VSDMMDL EV VELGQ PGILSPAP +LAPI +G+FSPAIEPQSF+YSL HELSPHW SPSALF  PL+SP
Subjt:  STDGDLSPAARFATIEKASPRSEREREIDVSDMMDLTEVPVELGQNPGILSPAPASLAPISSGFFSPAIEPQSFTYSLIHELSPHWPSPSALFPAPLVSP

Query:  ISSPNIFNHLFD
        ISSPNIFN+LFD
Subjt:  ISSPNIFNHLFD

A0A1S3CD17 protein MKS1-like1.5e-9486.38Show/hide
Query:  MNPPGSSAGGSPNTPRKKEIQLQGPRPPQLRVNQESRKIKKPPPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTGLSSAAS
        MNPPG SA GSP TPRKKEIQLQGPRPPQLRV+QESRKIKKPPPHPQP+PP GRPPLPP  SQWPQPLIIYDISPKV+HVAENNFMSVVQRLTG SS A 
Subjt:  MNPPGSSAGGSPNTPRKKEIQLQGPRPPQLRVNQESRKIKKPPPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTGLSSAAS

Query:  STDGDLSPAARFATIEKASPRSEREREIDVSDMMDLTEVPVELGQNPGILSPAPASLAPISSGFFSPAIEPQSFTYSLIHELSPHWPSPSALFPAPLVSP
         TDGDLSPAAR ATIEKASPRSEREREI+VSDMMDL EV VELGQ PGILSPAP +LAPI +G+FSPAIEPQSF+YSL HELSPHWPSPSALF  PL+SP
Subjt:  STDGDLSPAARFATIEKASPRSEREREIDVSDMMDLTEVPVELGQNPGILSPAPASLAPISSGFFSPAIEPQSFTYSLIHELSPHWPSPSALFPAPLVSP

Query:  ISSPNIFNHLFDF
        ISSPNIFN+LFDF
Subjt:  ISSPNIFNHLFDF

A0A5A7TBE4 Protein MKS1-like1.5e-9486.38Show/hide
Query:  MNPPGSSAGGSPNTPRKKEIQLQGPRPPQLRVNQESRKIKKPPPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTGLSSAAS
        MNPPG SA GSP TPRKKEIQLQGPRPPQLRV+QESRKIKKPPPHPQP+PP GRPPLPP  SQWPQPLIIYDISPKV+HVAENNFMSVVQRLTG SS A 
Subjt:  MNPPGSSAGGSPNTPRKKEIQLQGPRPPQLRVNQESRKIKKPPPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTGLSSAAS

Query:  STDGDLSPAARFATIEKASPRSEREREIDVSDMMDLTEVPVELGQNPGILSPAPASLAPISSGFFSPAIEPQSFTYSLIHELSPHWPSPSALFPAPLVSP
         TDGDLSPAAR ATIEKASPRSEREREI+VSDMMDL EV VELGQ PGILSPAP +LAPI +G+FSPAIEPQSF+YSL HELSPHWPSPSALF  PL+SP
Subjt:  STDGDLSPAARFATIEKASPRSEREREIDVSDMMDLTEVPVELGQNPGILSPAPASLAPISSGFFSPAIEPQSFTYSLIHELSPHWPSPSALFPAPLVSP

Query:  ISSPNIFNHLFDF
        ISSPNIFN+LFDF
Subjt:  ISSPNIFNHLFDF

A0A6J1G715 protein MKS1-like6.1e-112100Show/hide
Query:  MNPPGSSAGGSPNTPRKKEIQLQGPRPPQLRVNQESRKIKKPPPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTGLSSAAS
        MNPPGSSAGGSPNTPRKKEIQLQGPRPPQLRVNQESRKIKKPPPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTGLSSAAS
Subjt:  MNPPGSSAGGSPNTPRKKEIQLQGPRPPQLRVNQESRKIKKPPPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTGLSSAAS

Query:  STDGDLSPAARFATIEKASPRSEREREIDVSDMMDLTEVPVELGQNPGILSPAPASLAPISSGFFSPAIEPQSFTYSLIHELSPHWPSPSALFPAPLVSP
        STDGDLSPAARFATIEKASPRSEREREIDVSDMMDLTEVPVELGQNPGILSPAPASLAPISSGFFSPAIEPQSFTYSLIHELSPHWPSPSALFPAPLVSP
Subjt:  STDGDLSPAARFATIEKASPRSEREREIDVSDMMDLTEVPVELGQNPGILSPAPASLAPISSGFFSPAIEPQSFTYSLIHELSPHWPSPSALFPAPLVSP

Query:  ISSPNIFNHLFDF
        ISSPNIFNHLFDF
Subjt:  ISSPNIFNHLFDF

A0A6J1L2G0 protein MKS1-like2.7e-10797.18Show/hide
Query:  MNPPGSSAGGSPNTPRKKEIQLQGPRPPQLRVNQESRKIKKPPPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTGLSSAAS
        MNPPGSSAGGSPNTPRKKEIQLQGPRPPQLRVNQESRKIKKPPPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTGLSSA+S
Subjt:  MNPPGSSAGGSPNTPRKKEIQLQGPRPPQLRVNQESRKIKKPPPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTGLSSAAS

Query:  STDGDLSPAARFATIEKASPRSEREREIDVSDMMDLTEVPVELGQNPGILSPAPASLAPISSGFFSPAIEPQSFTYSLIHELSPHWPSPSALFPAPLVSP
        S DGDLSPAARFATIEKASPRS  EREIDVSDMMDLTEVPVELGQ PGILSPAPASLAPISSGFFSP IEPQSFTYSLIHELSPHWPSPSALFPAPLVSP
Subjt:  STDGDLSPAARFATIEKASPRSEREREIDVSDMMDLTEVPVELGQNPGILSPAPASLAPISSGFFSPAIEPQSFTYSLIHELSPHWPSPSALFPAPLVSP

Query:  ISSPNIFNHLFDF
        ISSPNIFNHLFDF
Subjt:  ISSPNIFNHLFDF

SwissProt top hitse value%identityAlignment
F4HWF9 Nuclear speckle RNA-binding protein B1.7e-1337.95Show/hide
Query:  GPRPPQLRVNQESRK-IKKP---PPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTG-------LSSAASSTD---------
        GP+P  L+V  +S K IKKP   PPHPQP PP      P  +   P P+ IY ++P+++H   NNFM++VQRLTG        SS++SST          
Subjt:  GPRPPQLRVNQESRK-IKKP---PPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTG-------LSSAASSTD---------

Query:  ----GDLSPAARFATIEKASPRSEREREIDVSDMMDL------TEVPVE-------LGQNPGILSPAPASLAPISSGFFSP--AIEPQSFTYSLI
            G +SPAARFA  EKA+  +E    +     MD        E P +          + GILSP P SL  +S  FFS     +PQ  T  LI
Subjt:  ----GDLSPAARFATIEKASPRSEREREIDVSDMMDL------TEVPVE-------LGQNPGILSPAPASLAPISSGFFSP--AIEPQSFTYSLI

Q8LGD5 Protein MKS16.5e-2640.17Show/hide
Query:  MNPPGSSAGGSPNTP--RKKEIQLQGPRPPQLRVNQESRKIKKPPPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTGLSSA
        M+P    AGG+P+    +K+++Q+ GPRP  L V+++S KIKKPP HP P P   +P  PP   +  +P++IY +SPKV+H   + FM+VVQRLTG+SS 
Subjt:  MNPPGSSAGGSPNTP--RKKEIQLQGPRPPQLRVNQESRKIKKPPPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTGLSSA

Query:  A---SSTDGDLSPAARFATIEKASPRSERE---REIDVSDMMDLTEVPVELGQNPGILSPAPASLAPISSGFFSPAIEPQSFTYSLIHELSPHWPS-PSA
            S   GD+SPAAR A+ E ASPR  +E   R+  V     + E     G  PGILSP+PA L   S+G FSP          + H+     P+ P  
Subjt:  A---SSTDGDLSPAARFATIEKASPRSERE---REIDVSDMMDLTEVPVELGQNPGILSPAPASLAPISSGFFSPAIEPQSFTYSLIHELSPHWPS-PSA

Query:  LF-PAPLVSPISSP------------NIFNHLFD
        LF PA  +SP  SP            + F+H++D
Subjt:  LF-PAPLVSPISSP------------NIFNHLFD

Arabidopsis top hitse value%identityAlignment
AT1G21320.1 nucleotide binding;nucleic acid binding1.2e-1437.95Show/hide
Query:  GPRPPQLRVNQESRK-IKKP---PPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTG-------LSSAASSTD---------
        GP+P  L+V  +S K IKKP   PPHPQP PP      P  +   P P+ IY ++P+++H   NNFM++VQRLTG        SS++SST          
Subjt:  GPRPPQLRVNQESRK-IKKP---PPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTG-------LSSAASSTD---------

Query:  ----GDLSPAARFATIEKASPRSEREREIDVSDMMDL------TEVPVE-------LGQNPGILSPAPASLAPISSGFFSP--AIEPQSFTYSLI
            G +SPAARFA  EKA+  +E    +     MD        E P +          + GILSP P SL  +S  FFS     +PQ  T  LI
Subjt:  ----GDLSPAARFATIEKASPRSEREREIDVSDMMDL------TEVPVE-------LGQNPGILSPAPASLAPISSGFFSP--AIEPQSFTYSLI

AT1G21326.1 VQ motif-containing protein1.6e-1636.78Show/hide
Query:  TPRKKEIQLQGPRPPQLRVNQESRK-IKKP---PPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTG-------LSSAASST
        +PR + I   GPRP  L+V  +S K IKKP   PPHPQP PP      P  +   P P+IIY +SP+++H   NNFM++VQRLTG        SS +SST
Subjt:  TPRKKEIQLQGPRPPQLRVNQESRK-IKKP---PPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTG-------LSSAASST

Query:  D-------------GDLSPAARFATIEKASPRSEREREIDVSDMMDL-------TEVPVELGQN----------PGILSPAPASLAPISSGFFSP--AIE
                      G +SPAARFA  EKA+  +E    +     MD             +  QN           GILSP P SL  +S  FFS     +
Subjt:  D-------------GDLSPAARFATIEKASPRSEREREIDVSDMMDL-------TEVPVELGQN----------PGILSPAPASLAPISSGFFSP--AIE

Query:  PQSFTYSLIHELSPHWPSPSALFPAPLVSPISSPNIFNHLFD
        PQ F+ S  ++ +    S     P+ + SP SS ++FN+ FD
Subjt:  PQSFTYSLIHELSPHWPSPSALFPAPLVSPISSPNIFNHLFD

AT3G18360.1 VQ motif-containing protein1.1e-0435Show/hide
Query:  PRPPQLRVNQESRKIKKPPPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTGLSSAASSTDG
        P PP L+VN++S  IKK PP P     + +P           P+IIY  +P+++H    +FM++VQ+LTG++ +     G
Subjt:  PRPPQLRVNQESRKIKKPPPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTGLSSAASSTDG

AT3G18690.1 MAP kinase substrate 14.6e-2740.17Show/hide
Query:  MNPPGSSAGGSPNTP--RKKEIQLQGPRPPQLRVNQESRKIKKPPPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTGLSSA
        M+P    AGG+P+    +K+++Q+ GPRP  L V+++S KIKKPP HP P P   +P  PP   +  +P++IY +SPKV+H   + FM+VVQRLTG+SS 
Subjt:  MNPPGSSAGGSPNTP--RKKEIQLQGPRPPQLRVNQESRKIKKPPPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTGLSSA

Query:  A---SSTDGDLSPAARFATIEKASPRSERE---REIDVSDMMDLTEVPVELGQNPGILSPAPASLAPISSGFFSPAIEPQSFTYSLIHELSPHWPS-PSA
            S   GD+SPAAR A+ E ASPR  +E   R+  V     + E     G  PGILSP+PA L   S+G FSP          + H+     P+ P  
Subjt:  A---SSTDGDLSPAARFATIEKASPRSERE---REIDVSDMMDLTEVPVELGQNPGILSPAPASLAPISSGFFSPAIEPQSFTYSLIHELSPHWPS-PSA

Query:  LF-PAPLVSPISSP------------NIFNHLFD
        LF PA  +SP  SP            + F+H++D
Subjt:  LF-PAPLVSPISSP------------NIFNHLFD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACCCGCCCGGCTCCTCCGCCGGCGGCAGCCCCAACACCCCAAGAAAGAAGGAGATCCAGCTACAGGGCCCCCGCCCGCCTCAACTCCGTGTCAACCAAGAATCCCG
CAAAATCAAGAAGCCGCCGCCGCACCCTCAGCCGATTCCCCCTTCTGGCCGCCCTCCTCTCCCTCCAGCCGCCTCCCAATGGCCTCAACCACTCATCATCTACGACATCT
CCCCCAAAGTCCTCCACGTCGCCGAGAACAACTTCATGTCCGTCGTGCAGCGCCTCACCGGCCTCTCCTCCGCCGCTTCCTCCACCGACGGCGACCTCTCCCCAGCAGCG
AGATTCGCCACAATCGAAAAAGCCAGCCCCCGATCCGAAAGGGAAAGGGAAATCGACGTTAGCGACATGATGGATTTGACGGAGGTTCCCGTGGAATTAGGGCAAAACCC
CGGAATTTTATCTCCAGCACCGGCGAGTCTAGCTCCGATTTCGTCAGGATTCTTCTCGCCGGCGATTGAGCCTCAAAGCTTTACATATTCGTTGATTCACGAGCTGAGCC
CCCATTGGCCAAGCCCTTCGGCTCTGTTCCCAGCTCCCCTGGTTTCCCCAATTTCTTCACCAAATATCTTCAACCATCTCTTTGACTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACCCGCCCGGCTCCTCCGCCGGCGGCAGCCCCAACACCCCAAGAAAGAAGGAGATCCAGCTACAGGGCCCCCGCCCGCCTCAACTCCGTGTCAACCAAGAATCCCG
CAAAATCAAGAAGCCGCCGCCGCACCCTCAGCCGATTCCCCCTTCTGGCCGCCCTCCTCTCCCTCCAGCCGCCTCCCAATGGCCTCAACCACTCATCATCTACGACATCT
CCCCCAAAGTCCTCCACGTCGCCGAGAACAACTTCATGTCCGTCGTGCAGCGCCTCACCGGCCTCTCCTCCGCCGCTTCCTCCACCGACGGCGACCTCTCCCCAGCAGCG
AGATTCGCCACAATCGAAAAAGCCAGCCCCCGATCCGAAAGGGAAAGGGAAATCGACGTTAGCGACATGATGGATTTGACGGAGGTTCCCGTGGAATTAGGGCAAAACCC
CGGAATTTTATCTCCAGCACCGGCGAGTCTAGCTCCGATTTCGTCAGGATTCTTCTCGCCGGCGATTGAGCCTCAAAGCTTTACATATTCGTTGATTCACGAGCTGAGCC
CCCATTGGCCAAGCCCTTCGGCTCTGTTCCCAGCTCCCCTGGTTTCCCCAATTTCTTCACCAAATATCTTCAACCATCTCTTTGACTTTTAG
Protein sequenceShow/hide protein sequence
MNPPGSSAGGSPNTPRKKEIQLQGPRPPQLRVNQESRKIKKPPPHPQPIPPSGRPPLPPAASQWPQPLIIYDISPKVLHVAENNFMSVVQRLTGLSSAASSTDGDLSPAA
RFATIEKASPRSEREREIDVSDMMDLTEVPVELGQNPGILSPAPASLAPISSGFFSPAIEPQSFTYSLIHELSPHWPSPSALFPAPLVSPISSPNIFNHLFDF