; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC02G017770 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC02G017770
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionVQ domain-containing protein
Genome locationCicolChr02:614344..614982
RNA-Seq ExpressionCcUC02G017770
SyntenyCcUC02G017770
Gene Ontology termsGO:0005634 - nucleus (cellular component)
InterPro domainsIPR008889 - VQ
IPR039607 - VQ motif-containing protein 8/17/18/20/21/25


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605314.1 Protein MKS1, partial [Cucurbita argyrosperma subsp. sororia]1.4e-9487.44Show/hide
Query:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAA-
        MNPPG SA GSP TPRKKEIQLQGPRPPQLRV+QESRKIKKPPPHPQP+PP GRPPLPP  +QWPQPLIIYDISPKV+HVAENNFMSVVQRLTGLSSAA 
Subjt:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAA-

Query:  ATDGDLSPAARLATIEKASPRS--EREREINVSDMMDLAEVPVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLV
        +TDGDLSPAAR ATIEKASPRS  EREREI+VSDMMDL EVPVELGQ PGILSPAPA+LAPI +G+FSPAIE QS  YSLIHELSPHWPSPSALF APLV
Subjt:  ATDGDLSPAARLATIEKASPRS--EREREINVSDMMDLAEVPVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLV

Query:  SPISSPNIFNNLFDF
        SPISSPNIFN+LFDF
Subjt:  SPISSPNIFNNLFDF

XP_008460908.1 PREDICTED: protein MKS1-like [Cucumis melo]7.4e-10493.87Show/hide
Query:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAA
        MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPP GRPPLPPGP+QWPQPLIIYDISPKVIHVAENNFMSVVQRLTG SS A 
Subjt:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAA

Query:  TDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEVPVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLVSPI
        TDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEV VELGQIPGILSPAP TLAPIPTGYFSPAIE QS +YSL HELSPHWPSPSALFS PL+SPI
Subjt:  TDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEVPVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLVSPI

Query:  SSPNIFNNLFDF
        SSPNIFNNLFDF
Subjt:  SSPNIFNNLFDF

XP_011649396.1 protein MKS1 [Cucumis sativus]2.6e-10192.89Show/hide
Query:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAA
        MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVP  GRPPLPPGP+QWPQPLIIYDISPKVIHVAENNFMSVVQRLTG SS A 
Subjt:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAA

Query:  TDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEVPVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLVSPI
        TDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEV VELGQIPGILSPAP TLAPIPTGYFSPAIE QS +YSL HELSPHW SPSALFS PL+SPI
Subjt:  TDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEVPVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLVSPI

Query:  SSPNIFNNLFD
        SSPNIFNNLFD
Subjt:  SSPNIFNNLFD

XP_022947651.1 protein MKS1-like [Cucurbita moschata]4.4e-9688.26Show/hide
Query:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAA-
        MNPPG SA GSP TPRKKEIQLQGPRPPQLRV+QESRKIKKPPPHPQP+PP GRPPLPP  +QWPQPLIIYDISPKV+HVAENNFMSVVQRLTGLSSAA 
Subjt:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAA-

Query:  ATDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEVPVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLVSP
        +TDGDLSPAAR ATIEKASPRSEREREI+VSDMMDL EVPVELGQ PGILSPAPA+LAPI +G+FSPAIE QS  YSLIHELSPHWPSPSALF APLVSP
Subjt:  ATDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEVPVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLVSP

Query:  ISSPNIFNNLFDF
        ISSPNIFN+LFDF
Subjt:  ISSPNIFNNLFDF

XP_038901805.1 protein MKS1-like [Benincasa hispida]2.9e-10897.67Show/hide
Query:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSS---
        MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRV+QESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSS   
Subjt:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSS---

Query:  AAATDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEVPVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLV
        AAATDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEV VELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLV
Subjt:  AAATDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEVPVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLV

Query:  SPISSPNIFNNLFDF
        SPISSPNIFNNLFDF
Subjt:  SPISSPNIFNNLFDF

TrEMBL top hitse value%identityAlignment
A0A0A0LK21 VQ domain-containing protein1.3e-10192.89Show/hide
Query:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAA
        MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVP  GRPPLPPGP+QWPQPLIIYDISPKVIHVAENNFMSVVQRLTG SS A 
Subjt:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAA

Query:  TDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEVPVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLVSPI
        TDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEV VELGQIPGILSPAP TLAPIPTGYFSPAIE QS +YSL HELSPHW SPSALFS PL+SPI
Subjt:  TDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEVPVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLVSPI

Query:  SSPNIFNNLFD
        SSPNIFNNLFD
Subjt:  SSPNIFNNLFD

A0A1S3CD17 protein MKS1-like3.6e-10493.87Show/hide
Query:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAA
        MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPP GRPPLPPGP+QWPQPLIIYDISPKVIHVAENNFMSVVQRLTG SS A 
Subjt:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAA

Query:  TDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEVPVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLVSPI
        TDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEV VELGQIPGILSPAP TLAPIPTGYFSPAIE QS +YSL HELSPHWPSPSALFS PL+SPI
Subjt:  TDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEVPVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLVSPI

Query:  SSPNIFNNLFDF
        SSPNIFNNLFDF
Subjt:  SSPNIFNNLFDF

A0A5A7TBE4 Protein MKS1-like3.6e-10493.87Show/hide
Query:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAA
        MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPP GRPPLPPGP+QWPQPLIIYDISPKVIHVAENNFMSVVQRLTG SS A 
Subjt:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAA

Query:  TDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEVPVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLVSPI
        TDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEV VELGQIPGILSPAP TLAPIPTGYFSPAIE QS +YSL HELSPHWPSPSALFS PL+SPI
Subjt:  TDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEVPVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLVSPI

Query:  SSPNIFNNLFDF
        SSPNIFNNLFDF
Subjt:  SSPNIFNNLFDF

A0A6J1G715 protein MKS1-like2.1e-9688.26Show/hide
Query:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAA-
        MNPPG SA GSP TPRKKEIQLQGPRPPQLRV+QESRKIKKPPPHPQP+PP GRPPLPP  +QWPQPLIIYDISPKV+HVAENNFMSVVQRLTGLSSAA 
Subjt:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAA-

Query:  ATDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEVPVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLVSP
        +TDGDLSPAAR ATIEKASPRSEREREI+VSDMMDL EVPVELGQ PGILSPAPA+LAPI +G+FSPAIE QS  YSLIHELSPHWPSPSALF APLVSP
Subjt:  ATDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEVPVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLVSP

Query:  ISSPNIFNNLFDF
        ISSPNIFN+LFDF
Subjt:  ISSPNIFNNLFDF

A0A6J1L2G0 protein MKS1-like1.1e-9285.92Show/hide
Query:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAA
        MNPPG SA GSP TPRKKEIQLQGPRPPQLRV+QESRKIKKPPPHPQP+PP GRPPLPP  +QWPQPLIIYDISPKV+HVAENNFMSVVQRLTGLSSA++
Subjt:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAA

Query:  T-DGDLSPAARLATIEKASPRSEREREINVSDMMDLAEVPVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLVSP
        + DGDLSPAAR ATIEKASPRS  EREI+VSDMMDL EVPVELGQ PGILSPAPA+LAPI +G+FSP IE QS  YSLIHELSPHWPSPSALF APLVSP
Subjt:  T-DGDLSPAARLATIEKASPRSEREREINVSDMMDLAEVPVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLVSP

Query:  ISSPNIFNNLFDF
        ISSPNIFN+LFDF
Subjt:  ISSPNIFNNLFDF

SwissProt top hitse value%identityAlignment
F4HWF9 Nuclear speckle RNA-binding protein B3.8e-1035Show/hide
Query:  GPRPPQLRVSQESRK-IKKP---PPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAATD-----------------
        GP+P  L+V  +S K IKKP   PPHPQP PP      P      P P+ IY ++P++IH   NNFM++VQRLTG +S + T                  
Subjt:  GPRPPQLRVSQESRK-IKKP---PPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAATD-----------------

Query:  ----GDLSPAARLATIEKASPRSEREREINVSDMMDL------AEVPVELGQI-------PGILSPAPATLAPIPTGYFS
            G +SPAAR A  EKA+  +E    +     MD        E P +            GILSP P +L  +   +FS
Subjt:  ----GDLSPAARLATIEKASPRSEREREINVSDMMDL------AEVPVELGQI-------PGILSPAPATLAPIPTGYFS

Q8LGD5 Protein MKS11.3e-2643.46Show/hide
Query:  MNPPGYSAAGSPTTP--RKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSA
        M+P  Y A G+P+    +K+++Q+ GPRP  L V ++S KIKKPP HP P P   +PP    P    +P++IY +SPKV+H   + FM+VVQRLTG+SS 
Subjt:  MNPPGYSAAGSPTTP--RKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSA

Query:  AATD----GDLSPAARLATIEKASPRSERE---REINVSDMMDLAEVPVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPS-PSA
           +    GD+SPAARLA+ E ASPR  +E   R+  V     + E     G  PGILSP+PA L    TG FSP + HQ   +S         P+ P  
Subjt:  AATD----GDLSPAARLATIEKASPRSERE---REINVSDMMDLAEVPVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPS-PSA

Query:  LFS-APLVSPISSP
        LFS A  +SP  SP
Subjt:  LFS-APLVSPISSP

Arabidopsis top hitse value%identityAlignment
AT1G21320.1 nucleotide binding;nucleic acid binding2.7e-1135Show/hide
Query:  GPRPPQLRVSQESRK-IKKP---PPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAATD-----------------
        GP+P  L+V  +S K IKKP   PPHPQP PP      P      P P+ IY ++P++IH   NNFM++VQRLTG +S + T                  
Subjt:  GPRPPQLRVSQESRK-IKKP---PPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAATD-----------------

Query:  ----GDLSPAARLATIEKASPRSEREREINVSDMMDL------AEVPVELGQI-------PGILSPAPATLAPIPTGYFS
            G +SPAAR A  EKA+  +E    +     MD        E P +            GILSP P +L  +   +FS
Subjt:  ----GDLSPAARLATIEKASPRSEREREINVSDMMDL------AEVPVELGQI-------PGILSPAPATLAPIPTGYFS

AT1G21326.1 VQ motif-containing protein2.4e-1233.88Show/hide
Query:  TPRKKEIQLQGPRPPQLRVSQESRK-IKKP---PPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAATD-------
        +PR + I   GPRP  L+V  +S K IKKP   PPHPQP PP      P      P P+IIY +SP++IH   NNFM++VQRLTG +S + T        
Subjt:  TPRKKEIQLQGPRPPQLRVSQESRK-IKKP---PPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAATD-------

Query:  --------------GDLSPAARLATIEKASPRSEREREINVSDMMD-----------LAEVPVELG------QIPGILSPAPATLAPIPTGYFSPAIEHQ
                      G +SPAAR A  EKA+  +E    +     MD             +     G         GILSP P +L  +   +FS      
Subjt:  --------------GDLSPAARLATIEKASPRSEREREINVSDMMD-----------LAEVPVELG------QIPGILSPAPATLAPIPTGYFSPAIEHQ

Query:  SLAYSLIHELSPHWPSPSALFSAP--LVSPISSPNIFNNLFD
           +S            S L S+P  + SP SS ++FNN FD
Subjt:  SLAYSLIHELSPHWPSPSALFSAP--LVSPISSPNIFNNLFD

AT3G18360.1 VQ motif-containing protein3.2e-0437.5Show/hide
Query:  PRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLS
        P PP L+V+++S  IKK PP P       +P           P+IIY  +P++IH    +FM++VQ+LTG++
Subjt:  PRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLS

AT3G18690.1 MAP kinase substrate 19.2e-2843.46Show/hide
Query:  MNPPGYSAAGSPTTP--RKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSA
        M+P  Y A G+P+    +K+++Q+ GPRP  L V ++S KIKKPP HP P P   +PP    P    +P++IY +SPKV+H   + FM+VVQRLTG+SS 
Subjt:  MNPPGYSAAGSPTTP--RKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSA

Query:  AATD----GDLSPAARLATIEKASPRSERE---REINVSDMMDLAEVPVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPS-PSA
           +    GD+SPAARLA+ E ASPR  +E   R+  V     + E     G  PGILSP+PA L    TG FSP + HQ   +S         P+ P  
Subjt:  AATD----GDLSPAARLATIEKASPRSERE---REINVSDMMDLAEVPVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPS-PSA

Query:  LFS-APLVSPISSP
        LFS A  +SP  SP
Subjt:  LFS-APLVSPISSP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCCGCCGGGATATTCTGCCGCCGGCAGCCCCACCACCCCAAGAAAGAAGGAGATCCAGCTACAAGGCCCCCGCCCGCCGCAACTCCGCGTCAGTCAAGAATCCCG
CAAAATCAAGAAGCCGCCGCCGCACCCTCAGCCGGTTCCCCCCGGCGGCCGGCCTCCTCTCCCGCCGGGCCCCGCCCAATGGCCTCAGCCCCTCATCATCTACGACATCT
CCCCCAAAGTCATCCACGTCGCCGAGAACAACTTCATGTCCGTCGTCCAGCGCCTCACTGGGCTGTCCTCCGCCGCCGCCACCGACGGCGACCTGTCCCCGGCAGCAAGG
CTGGCCACAATCGAGAAAGCCAGTCCCCGATCCGAAAGAGAAAGAGAAATCAACGTTAGCGACATGATGGATTTGGCGGAGGTTCCGGTGGAATTAGGGCAAATTCCCGG
AATCCTATCACCAGCTCCGGCAACTCTGGCTCCGATTCCGACAGGGTATTTCTCGCCGGCGATTGAACATCAAAGCTTGGCGTATTCTTTGATTCACGAGCTGAGTCCTC
ATTGGCCAAGCCCTTCTGCTCTGTTTTCAGCTCCTCTTGTTTCCCCAATTTCTTCACCAAATATCTTCAACAATCTCTTTGACTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATCCGCCGGGATATTCTGCCGCCGGCAGCCCCACCACCCCAAGAAAGAAGGAGATCCAGCTACAAGGCCCCCGCCCGCCGCAACTCCGCGTCAGTCAAGAATCCCG
CAAAATCAAGAAGCCGCCGCCGCACCCTCAGCCGGTTCCCCCCGGCGGCCGGCCTCCTCTCCCGCCGGGCCCCGCCCAATGGCCTCAGCCCCTCATCATCTACGACATCT
CCCCCAAAGTCATCCACGTCGCCGAGAACAACTTCATGTCCGTCGTCCAGCGCCTCACTGGGCTGTCCTCCGCCGCCGCCACCGACGGCGACCTGTCCCCGGCAGCAAGG
CTGGCCACAATCGAGAAAGCCAGTCCCCGATCCGAAAGAGAAAGAGAAATCAACGTTAGCGACATGATGGATTTGGCGGAGGTTCCGGTGGAATTAGGGCAAATTCCCGG
AATCCTATCACCAGCTCCGGCAACTCTGGCTCCGATTCCGACAGGGTATTTCTCGCCGGCGATTGAACATCAAAGCTTGGCGTATTCTTTGATTCACGAGCTGAGTCCTC
ATTGGCCAAGCCCTTCTGCTCTGTTTTCAGCTCCTCTTGTTTCCCCAATTTCTTCACCAAATATCTTCAACAATCTCTTTGACTTTTAG
Protein sequenceShow/hide protein sequence
MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAATDGDLSPAAR
LATIEKASPRSEREREINVSDMMDLAEVPVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLVSPISSPNIFNNLFDF