; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC02G026370 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC02G026370
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionVQ domain-containing protein
Genome locationCiama_Chr02:700631..701269
RNA-Seq ExpressionCaUC02G026370
SyntenyCaUC02G026370
Gene Ontology termsGO:0005634 - nucleus (cellular component)
InterPro domainsIPR008889 - VQ
IPR039607 - VQ motif-containing protein 8/17/18/20/21/25


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605314.1 Protein MKS1, partial [Cucurbita argyrosperma subsp. sororia]1.2e-9386.98Show/hide
Query:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAA-
        MNPPG SA GSP TPRKKEIQLQGPRPPQLRV+QESRKIKKPPPHPQP+PP GRPPLPP  +QWPQPLIIYDISPKV+HVAENNFMSVVQRLTGLSSAA 
Subjt:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAA-

Query:  ATDGDLSPAARLATIEKASPRS--EREREINVSDMMDLAEVAVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLV
        +TDGDLSPAAR ATIEKASPRS  EREREI+VSDMMDL EV VELGQ PGILSPAPA+LAPI +G+FSPAIE QS  YSLIHELSPHWPSPSALF APLV
Subjt:  ATDGDLSPAARLATIEKASPRS--EREREINVSDMMDLAEVAVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLV

Query:  SPISSPNIFNNLFDF
        SPISSPNIFN+LFDF
Subjt:  SPISSPNIFNNLFDF

XP_008460908.1 PREDICTED: protein MKS1-like [Cucumis melo]4.3e-10493.87Show/hide
Query:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAA
        MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPP GRPPLPPGP+QWPQPLIIYDISPKVIHVAENNFMSVVQRLTG SS A 
Subjt:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAA

Query:  TDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEVAVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLVSPI
        TDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEV+VELGQIPGILSPAP TLAPIPTGYFSPAIE QS +YSL HELSPHWPSPSALFS PL+SPI
Subjt:  TDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEVAVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLVSPI

Query:  SSPNIFNNLFDF
        SSPNIFNNLFDF
Subjt:  SSPNIFNNLFDF

XP_011649396.1 protein MKS1 [Cucumis sativus]1.2e-10192.89Show/hide
Query:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAA
        MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVP  GRPPLPPGP+QWPQPLIIYDISPKVIHVAENNFMSVVQRLTG SS A 
Subjt:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAA

Query:  TDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEVAVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLVSPI
        TDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEV+VELGQIPGILSPAP TLAPIPTGYFSPAIE QS +YSL HELSPHW SPSALFS PL+SPI
Subjt:  TDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEVAVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLVSPI

Query:  SSPNIFNNLFD
        SSPNIFNNLFD
Subjt:  SSPNIFNNLFD

XP_022947651.1 protein MKS1-like [Cucurbita moschata]3.7e-9587.79Show/hide
Query:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAA-
        MNPPG SA GSP TPRKKEIQLQGPRPPQLRV+QESRKIKKPPPHPQP+PP GRPPLPP  +QWPQPLIIYDISPKV+HVAENNFMSVVQRLTGLSSAA 
Subjt:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAA-

Query:  ATDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEVAVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLVSP
        +TDGDLSPAAR ATIEKASPRSEREREI+VSDMMDL EV VELGQ PGILSPAPA+LAPI +G+FSPAIE QS  YSLIHELSPHWPSPSALF APLVSP
Subjt:  ATDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEVAVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLVSP

Query:  ISSPNIFNNLFDF
        ISSPNIFN+LFDF
Subjt:  ISSPNIFNNLFDF

XP_038901805.1 protein MKS1-like [Benincasa hispida]1.7e-10897.67Show/hide
Query:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSS---
        MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRV+QESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSS   
Subjt:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSS---

Query:  AAATDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEVAVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLV
        AAATDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEV+VELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLV
Subjt:  AAATDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEVAVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLV

Query:  SPISSPNIFNNLFDF
        SPISSPNIFNNLFDF
Subjt:  SPISSPNIFNNLFDF

TrEMBL top hitse value%identityAlignment
A0A0A0LK21 VQ domain-containing protein5.7e-10292.89Show/hide
Query:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAA
        MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVP  GRPPLPPGP+QWPQPLIIYDISPKVIHVAENNFMSVVQRLTG SS A 
Subjt:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAA

Query:  TDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEVAVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLVSPI
        TDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEV+VELGQIPGILSPAP TLAPIPTGYFSPAIE QS +YSL HELSPHW SPSALFS PL+SPI
Subjt:  TDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEVAVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLVSPI

Query:  SSPNIFNNLFD
        SSPNIFNNLFD
Subjt:  SSPNIFNNLFD

A0A1S3CD17 protein MKS1-like2.1e-10493.87Show/hide
Query:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAA
        MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPP GRPPLPPGP+QWPQPLIIYDISPKVIHVAENNFMSVVQRLTG SS A 
Subjt:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAA

Query:  TDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEVAVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLVSPI
        TDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEV+VELGQIPGILSPAP TLAPIPTGYFSPAIE QS +YSL HELSPHWPSPSALFS PL+SPI
Subjt:  TDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEVAVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLVSPI

Query:  SSPNIFNNLFDF
        SSPNIFNNLFDF
Subjt:  SSPNIFNNLFDF

A0A5A7TBE4 Protein MKS1-like2.1e-10493.87Show/hide
Query:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAA
        MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPP GRPPLPPGP+QWPQPLIIYDISPKVIHVAENNFMSVVQRLTG SS A 
Subjt:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAA

Query:  TDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEVAVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLVSPI
        TDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEV+VELGQIPGILSPAP TLAPIPTGYFSPAIE QS +YSL HELSPHWPSPSALFS PL+SPI
Subjt:  TDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEVAVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLVSPI

Query:  SSPNIFNNLFDF
        SSPNIFNNLFDF
Subjt:  SSPNIFNNLFDF

A0A6J1G715 protein MKS1-like1.8e-9587.79Show/hide
Query:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAA-
        MNPPG SA GSP TPRKKEIQLQGPRPPQLRV+QESRKIKKPPPHPQP+PP GRPPLPP  +QWPQPLIIYDISPKV+HVAENNFMSVVQRLTGLSSAA 
Subjt:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAA-

Query:  ATDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEVAVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLVSP
        +TDGDLSPAAR ATIEKASPRSEREREI+VSDMMDL EV VELGQ PGILSPAPA+LAPI +G+FSPAIE QS  YSLIHELSPHWPSPSALF APLVSP
Subjt:  ATDGDLSPAARLATIEKASPRSEREREINVSDMMDLAEVAVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLVSP

Query:  ISSPNIFNNLFDF
        ISSPNIFN+LFDF
Subjt:  ISSPNIFNNLFDF

A0A6J1L2G0 protein MKS1-like7.0e-9285.45Show/hide
Query:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAA
        MNPPG SA GSP TPRKKEIQLQGPRPPQLRV+QESRKIKKPPPHPQP+PP GRPPLPP  +QWPQPLIIYDISPKV+HVAENNFMSVVQRLTGLSSA++
Subjt:  MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAA

Query:  T-DGDLSPAARLATIEKASPRSEREREINVSDMMDLAEVAVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLVSP
        + DGDLSPAAR ATIEKASPRS  EREI+VSDMMDL EV VELGQ PGILSPAPA+LAPI +G+FSP IE QS  YSLIHELSPHWPSPSALF APLVSP
Subjt:  T-DGDLSPAARLATIEKASPRSEREREINVSDMMDLAEVAVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLVSP

Query:  ISSPNIFNNLFDF
        ISSPNIFN+LFDF
Subjt:  ISSPNIFNNLFDF

SwissProt top hitse value%identityAlignment
F4HWF9 Nuclear speckle RNA-binding protein B1.1e-0934.44Show/hide
Query:  GPRPPQLRVSQESRK-IKKP---PPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAATD-----------------
        GP+P  L+V  +S K IKKP   PPHPQP PP      P      P P+ IY ++P++IH   NNFM++VQRLTG +S + T                  
Subjt:  GPRPPQLRVSQESRK-IKKP---PPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAATD-----------------

Query:  ----GDLSPAARLATIEKASPRSEREREINVSDMMDLAEVAVELGQ-------------IPGILSPAPATLAPIPTGYFS
            G +SPAAR A  EKA+  +E    +     MD         Q               GILSP P +L  +   +FS
Subjt:  ----GDLSPAARLATIEKASPRSEREREINVSDMMDLAEVAVELGQ-------------IPGILSPAPATLAPIPTGYFS

Q8LGD5 Protein MKS13.4e-2743.93Show/hide
Query:  MNPPGYSAAGSPTTP--RKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSA
        M+P  Y A G+P+    +K+++Q+ GPRP  L V ++S KIKKPP HP P P   +PP    P    +P++IY +SPKV+H   + FM+VVQRLTG+SS 
Subjt:  MNPPGYSAAGSPTTP--RKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSA

Query:  AATD----GDLSPAARLATIEKASPRSERE---REINVSDMMDLAEVAVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPS-PSA
           +    GD+SPAARLA+ E ASPR  +E   R+  V     + E A   G  PGILSP+PA L    TG FSP + HQ   +S         P+ P  
Subjt:  AATD----GDLSPAARLATIEKASPRSERE---REINVSDMMDLAEVAVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPS-PSA

Query:  LFS-APLVSPISSP
        LFS A  +SP  SP
Subjt:  LFS-APLVSPISSP

Arabidopsis top hitse value%identityAlignment
AT1G21320.1 nucleotide binding;nucleic acid binding7.8e-1134.44Show/hide
Query:  GPRPPQLRVSQESRK-IKKP---PPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAATD-----------------
        GP+P  L+V  +S K IKKP   PPHPQP PP      P      P P+ IY ++P++IH   NNFM++VQRLTG +S + T                  
Subjt:  GPRPPQLRVSQESRK-IKKP---PPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAATD-----------------

Query:  ----GDLSPAARLATIEKASPRSEREREINVSDMMDLAEVAVELGQ-------------IPGILSPAPATLAPIPTGYFS
            G +SPAAR A  EKA+  +E    +     MD         Q               GILSP P +L  +   +FS
Subjt:  ----GDLSPAARLATIEKASPRSEREREINVSDMMDLAEVAVELGQ-------------IPGILSPAPATLAPIPTGYFS

AT1G21326.1 VQ motif-containing protein1.9e-1233.88Show/hide
Query:  TPRKKEIQLQGPRPPQLRVSQESRK-IKKP---PPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAATD-------
        +PR + I   GPRP  L+V  +S K IKKP   PPHPQP PP      P      P P+IIY +SP++IH   NNFM++VQRLTG +S + T        
Subjt:  TPRKKEIQLQGPRPPQLRVSQESRK-IKKP---PPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAATD-------

Query:  --------------GDLSPAARLATIEKASPRSEREREINVSDMMDL---------------AEVAVELGQI--PGILSPAPATLAPIPTGYFSPAIEHQ
                      G +SPAAR A  EKA+  +E    +     MD                     E       GILSP P +L  +   +FS      
Subjt:  --------------GDLSPAARLATIEKASPRSEREREINVSDMMDL---------------AEVAVELGQI--PGILSPAPATLAPIPTGYFSPAIEHQ

Query:  SLAYSLIHELSPHWPSPSALFSAP--LVSPISSPNIFNNLFD
           +S            S L S+P  + SP SS ++FNN FD
Subjt:  SLAYSLIHELSPHWPSPSALFSAP--LVSPISSPNIFNNLFD

AT3G18360.1 VQ motif-containing protein3.2e-0437.5Show/hide
Query:  PRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLS
        P PP L+V+++S  IKK PP P       +P           P+IIY  +P++IH    +FM++VQ+LTG++
Subjt:  PRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLS

AT3G18690.1 MAP kinase substrate 12.4e-2843.93Show/hide
Query:  MNPPGYSAAGSPTTP--RKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSA
        M+P  Y A G+P+    +K+++Q+ GPRP  L V ++S KIKKPP HP P P   +PP    P    +P++IY +SPKV+H   + FM+VVQRLTG+SS 
Subjt:  MNPPGYSAAGSPTTP--RKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSA

Query:  AATD----GDLSPAARLATIEKASPRSERE---REINVSDMMDLAEVAVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPS-PSA
           +    GD+SPAARLA+ E ASPR  +E   R+  V     + E A   G  PGILSP+PA L    TG FSP + HQ   +S         P+ P  
Subjt:  AATD----GDLSPAARLATIEKASPRSERE---REINVSDMMDLAEVAVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPS-PSA

Query:  LFS-APLVSPISSP
        LFS A  +SP  SP
Subjt:  LFS-APLVSPISSP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCCGCCGGGATATTCCGCCGCCGGCAGCCCCACCACCCCAAGAAAGAAGGAGATCCAGCTACAAGGCCCCCGCCCGCCGCAACTCCGCGTCAGTCAAGAATCCCG
CAAAATCAAGAAGCCGCCGCCGCACCCTCAGCCGGTTCCCCCCGGCGGCCGGCCTCCTCTCCCGCCGGGCCCCGCCCAATGGCCTCAGCCCCTCATCATCTACGACATCT
CCCCCAAAGTCATCCACGTCGCCGAGAACAATTTCATGTCCGTCGTCCAGCGCCTCACTGGGCTGTCCTCCGCCGCCGCCACCGACGGTGACCTGTCCCCGGCAGCAAGG
CTGGCCACAATCGAGAAAGCCAGTCCCCGATCCGAAAGAGAAAGAGAAATCAACGTTAGCGACATGATGGATTTGGCGGAGGTTGCGGTGGAATTAGGGCAAATTCCCGG
AATCCTGTCACCAGCTCCGGCAACTCTGGCTCCGATTCCGACAGGGTATTTCTCGCCGGCGATTGAACATCAAAGCTTGGCGTATTCTTTGATTCACGAGTTGAGTCCTC
ATTGGCCAAGCCCTTCTGCTCTGTTTTCAGCTCCTCTTGTTTCCCCAATTTCTTCACCAAATATCTTCAACAATCTCTTTGACTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATCCGCCGGGATATTCCGCCGCCGGCAGCCCCACCACCCCAAGAAAGAAGGAGATCCAGCTACAAGGCCCCCGCCCGCCGCAACTCCGCGTCAGTCAAGAATCCCG
CAAAATCAAGAAGCCGCCGCCGCACCCTCAGCCGGTTCCCCCCGGCGGCCGGCCTCCTCTCCCGCCGGGCCCCGCCCAATGGCCTCAGCCCCTCATCATCTACGACATCT
CCCCCAAAGTCATCCACGTCGCCGAGAACAATTTCATGTCCGTCGTCCAGCGCCTCACTGGGCTGTCCTCCGCCGCCGCCACCGACGGTGACCTGTCCCCGGCAGCAAGG
CTGGCCACAATCGAGAAAGCCAGTCCCCGATCCGAAAGAGAAAGAGAAATCAACGTTAGCGACATGATGGATTTGGCGGAGGTTGCGGTGGAATTAGGGCAAATTCCCGG
AATCCTGTCACCAGCTCCGGCAACTCTGGCTCCGATTCCGACAGGGTATTTCTCGCCGGCGATTGAACATCAAAGCTTGGCGTATTCTTTGATTCACGAGTTGAGTCCTC
ATTGGCCAAGCCCTTCTGCTCTGTTTTCAGCTCCTCTTGTTTCCCCAATTTCTTCACCAAATATCTTCAACAATCTCTTTGACTTTTAG
Protein sequenceShow/hide protein sequence
MNPPGYSAAGSPTTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPGGRPPLPPGPAQWPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSAAATDGDLSPAAR
LATIEKASPRSEREREINVSDMMDLAEVAVELGQIPGILSPAPATLAPIPTGYFSPAIEHQSLAYSLIHELSPHWPSPSALFSAPLVSPISSPNIFNNLFDF