; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0007872 (gene) of Chayote v1 genome

Gene IDSed0007872
OrganismSechium edule (Chayote v1)
DescriptionVQ domain-containing protein
Genome locationLG05:46225301..46227399
RNA-Seq ExpressionSed0007872
SyntenySed0007872
Gene Ontology termsGO:0005634 - nucleus (cellular component)
InterPro domainsIPR008889 - VQ
IPR039607 - VQ motif-containing protein 8/17/18/20/21/25


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605314.1 Protein MKS1, partial [Cucurbita argyrosperma subsp. sororia]1.1e-8081.02Show/hide
Query:  MNPHG---GG--NTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPPGRPPLPPASASQWPQPLIIYDISPKVHHVAENNFMSVVQRLTGLAA--
        MNP G   GG  NTPRKKEIQLQGPRPPQLRV+QESRKIKKPPPHPQP+PP GRPPLPPA ASQWPQPLIIYDISPKV HVAENNFMSVVQRLTGL++  
Subjt:  MNPHG---GG--NTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPPGRPPLPPASASQWPQPLIIYDISPKVHHVAENNFMSVVQRLTGLAA--

Query:  ---EPDLSPAARFATIGKASPRS--DRERDIDVSDMMDLTEVSVEFGQIPGILSPAPATLAPIPTGFFSPAMEGQGFMYS----LSPHWSSPSALFSAPL
           + DLSPAARFATI KASPRS  +RER+IDVSDMMDLTEV VE GQ PGILSPAPA+LAPI +GFFSPA+E Q F YS    LSPHW SPSALF APL
Subjt:  ---EPDLSPAARFATIGKASPRS--DRERDIDVSDMMDLTEVSVEFGQIPGILSPAPATLAPIPTGFFSPAMEGQGFMYS----LSPHWSSPSALFSAPL

Query:  VSPISSPNIFNHLFDF
        VSPISSPNIFNHLFDF
Subjt:  VSPISSPNIFNHLFDF

XP_008460908.1 PREDICTED: protein MKS1-like [Cucumis melo]8.3e-8181.86Show/hide
Query:  GGGNTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPPGRPPLPPASASQWPQPLIIYDISPKVHHVAENNFMSVVQRLTGLAA----EPDLSPA
        G   TPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPPGRPPLPP   SQWPQPLIIYDISPKV HVAENNFMSVVQRLTG ++    + DLSPA
Subjt:  GGGNTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPPGRPPLPPASASQWPQPLIIYDISPKVHHVAENNFMSVVQRLTGLAA----EPDLSPA

Query:  ARFATIGKASPRSDRERDIDVSDMMDLTEVSVEFGQIPGILSPAPATLAPIPTGFFSPAMEGQGFMYS----LSPHWSSPSALFSAPLVSPISSPNIFNH
        AR ATI KASPRS+RER+I+VSDMMDL EVSVE GQIPGILSPAP TLAPIPTG+FSPA+E Q F YS    LSPHW SPSALFS PL+SPISSPNIFN+
Subjt:  ARFATIGKASPRSDRERDIDVSDMMDLTEVSVEFGQIPGILSPAPATLAPIPTGFFSPAMEGQGFMYS----LSPHWSSPSALFSAPLVSPISSPNIFNH

Query:  LFDF
        LFDF
Subjt:  LFDF

XP_022947651.1 protein MKS1-like [Cucurbita moschata]3.4e-8281.78Show/hide
Query:  MNPHG---GG--NTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPPGRPPLPPASASQWPQPLIIYDISPKVHHVAENNFMSVVQRLTGLAA--
        MNP G   GG  NTPRKKEIQLQGPRPPQLRV+QESRKIKKPPPHPQP+PP GRPPLPPA ASQWPQPLIIYDISPKV HVAENNFMSVVQRLTGL++  
Subjt:  MNPHG---GG--NTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPPGRPPLPPASASQWPQPLIIYDISPKVHHVAENNFMSVVQRLTGLAA--

Query:  ---EPDLSPAARFATIGKASPRSDRERDIDVSDMMDLTEVSVEFGQIPGILSPAPATLAPIPTGFFSPAMEGQGFMYS----LSPHWSSPSALFSAPLVS
           + DLSPAARFATI KASPRS+RER+IDVSDMMDLTEV VE GQ PGILSPAPA+LAPI +GFFSPA+E Q F YS    LSPHW SPSALF APLVS
Subjt:  ---EPDLSPAARFATIGKASPRSDRERDIDVSDMMDLTEVSVEFGQIPGILSPAPATLAPIPTGFFSPAMEGQGFMYS----LSPHWSSPSALFSAPLVS

Query:  PISSPNIFNHLFDF
        PISSPNIFNHLFDF
Subjt:  PISSPNIFNHLFDF

XP_023533977.1 protein MKS1-like [Cucurbita pepo subsp. pepo]3.4e-8281.78Show/hide
Query:  MNPHG---GG--NTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPPGRPPLPPASASQWPQPLIIYDISPKVHHVAENNFMSVVQRLTGLAA--
        MNP G   GG  NTPRKKEIQLQGPRPPQLRV+QESRKIKKPPPHPQP+PP GRPPLPPA ASQWPQPLIIYDISPKV HVAENNFMSVVQRLTGL++  
Subjt:  MNPHG---GG--NTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPPGRPPLPPASASQWPQPLIIYDISPKVHHVAENNFMSVVQRLTGLAA--

Query:  ---EPDLSPAARFATIGKASPRSDRERDIDVSDMMDLTEVSVEFGQIPGILSPAPATLAPIPTGFFSPAMEGQGFMYS----LSPHWSSPSALFSAPLVS
           + DLSPAARFATI KASPRS+RER+IDVSDMMDLTEV VE GQ PGILSPAPA+LAPI +GFFSPA+E Q F YS    LSPHW SPSALF APLVS
Subjt:  ---EPDLSPAARFATIGKASPRSDRERDIDVSDMMDLTEVSVEFGQIPGILSPAPATLAPIPTGFFSPAMEGQGFMYS----LSPHWSSPSALFSAPLVS

Query:  PISSPNIFNHLFDF
        PISSPNIFNHLFDF
Subjt:  PISSPNIFNHLFDF

XP_038901805.1 protein MKS1-like [Benincasa hispida]2.4e-8081.64Show/hide
Query:  GGGNTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPPGRPPLPPASASQWPQPLIIYDISPKVHHVAENNFMSVVQRLTGL-------AAEPDL
        G   TPRKKEIQLQGPRPPQLRV+QESRKIKKPPPHPQPVPP GRPPLPP  A QWPQPLIIYDISPKV HVAENNFMSVVQRLTGL       A + DL
Subjt:  GGGNTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPPGRPPLPPASASQWPQPLIIYDISPKVHHVAENNFMSVVQRLTGL-------AAEPDL

Query:  SPAARFATIGKASPRSDRERDIDVSDMMDLTEVSVEFGQIPGILSPAPATLAPIPTGFFSPAMEGQGFMYS----LSPHWSSPSALFSAPLVSPISSPNI
        SPAAR ATI KASPRS+RER+I+VSDMMDL EVSVE GQIPGILSPAPATLAPIPTG+FSPA+E Q   YS    LSPHW SPSALFSAPLVSPISSPNI
Subjt:  SPAARFATIGKASPRSDRERDIDVSDMMDLTEVSVEFGQIPGILSPAPATLAPIPTGFFSPAMEGQGFMYS----LSPHWSSPSALFSAPLVSPISSPNI

Query:  FNHLFDF
        FN+LFDF
Subjt:  FNHLFDF

TrEMBL top hitse value%identityAlignment
A0A0A0LK21 VQ domain-containing protein7.6e-8081.28Show/hide
Query:  GGGNTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPPGRPPLPPASASQWPQPLIIYDISPKVHHVAENNFMSVVQRLTGLAA----EPDLSPA
        G   TPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVP PGRPPLPP   SQWPQPLIIYDISPKV HVAENNFMSVVQRLTG ++    + DLSPA
Subjt:  GGGNTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPPGRPPLPPASASQWPQPLIIYDISPKVHHVAENNFMSVVQRLTGLAA----EPDLSPA

Query:  ARFATIGKASPRSDRERDIDVSDMMDLTEVSVEFGQIPGILSPAPATLAPIPTGFFSPAMEGQGFMYS----LSPHWSSPSALFSAPLVSPISSPNIFNH
        AR ATI KASPRS+RER+I+VSDMMDL EVSVE GQIPGILSPAP TLAPIPTG+FSPA+E Q F YS    LSPHW+SPSALFS PL+SPISSPNIFN+
Subjt:  ARFATIGKASPRSDRERDIDVSDMMDLTEVSVEFGQIPGILSPAPATLAPIPTGFFSPAMEGQGFMYS----LSPHWSSPSALFSAPLVSPISSPNIFNH

Query:  LFD
        LFD
Subjt:  LFD

A0A1S3CD17 protein MKS1-like4.0e-8181.86Show/hide
Query:  GGGNTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPPGRPPLPPASASQWPQPLIIYDISPKVHHVAENNFMSVVQRLTGLAA----EPDLSPA
        G   TPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPPGRPPLPP   SQWPQPLIIYDISPKV HVAENNFMSVVQRLTG ++    + DLSPA
Subjt:  GGGNTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPPGRPPLPPASASQWPQPLIIYDISPKVHHVAENNFMSVVQRLTGLAA----EPDLSPA

Query:  ARFATIGKASPRSDRERDIDVSDMMDLTEVSVEFGQIPGILSPAPATLAPIPTGFFSPAMEGQGFMYS----LSPHWSSPSALFSAPLVSPISSPNIFNH
        AR ATI KASPRS+RER+I+VSDMMDL EVSVE GQIPGILSPAP TLAPIPTG+FSPA+E Q F YS    LSPHW SPSALFS PL+SPISSPNIFN+
Subjt:  ARFATIGKASPRSDRERDIDVSDMMDLTEVSVEFGQIPGILSPAPATLAPIPTGFFSPAMEGQGFMYS----LSPHWSSPSALFSAPLVSPISSPNIFNH

Query:  LFDF
        LFDF
Subjt:  LFDF

A0A5A7TBE4 Protein MKS1-like4.0e-8181.86Show/hide
Query:  GGGNTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPPGRPPLPPASASQWPQPLIIYDISPKVHHVAENNFMSVVQRLTGLAA----EPDLSPA
        G   TPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPPGRPPLPP   SQWPQPLIIYDISPKV HVAENNFMSVVQRLTG ++    + DLSPA
Subjt:  GGGNTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPPGRPPLPPASASQWPQPLIIYDISPKVHHVAENNFMSVVQRLTGLAA----EPDLSPA

Query:  ARFATIGKASPRSDRERDIDVSDMMDLTEVSVEFGQIPGILSPAPATLAPIPTGFFSPAMEGQGFMYS----LSPHWSSPSALFSAPLVSPISSPNIFNH
        AR ATI KASPRS+RER+I+VSDMMDL EVSVE GQIPGILSPAP TLAPIPTG+FSPA+E Q F YS    LSPHW SPSALFS PL+SPISSPNIFN+
Subjt:  ARFATIGKASPRSDRERDIDVSDMMDLTEVSVEFGQIPGILSPAPATLAPIPTGFFSPAMEGQGFMYS----LSPHWSSPSALFSAPLVSPISSPNIFNH

Query:  LFDF
        LFDF
Subjt:  LFDF

A0A6J1G715 protein MKS1-like1.6e-8281.78Show/hide
Query:  MNPHG---GG--NTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPPGRPPLPPASASQWPQPLIIYDISPKVHHVAENNFMSVVQRLTGLAA--
        MNP G   GG  NTPRKKEIQLQGPRPPQLRV+QESRKIKKPPPHPQP+PP GRPPLPPA ASQWPQPLIIYDISPKV HVAENNFMSVVQRLTGL++  
Subjt:  MNPHG---GG--NTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPPGRPPLPPASASQWPQPLIIYDISPKVHHVAENNFMSVVQRLTGLAA--

Query:  ---EPDLSPAARFATIGKASPRSDRERDIDVSDMMDLTEVSVEFGQIPGILSPAPATLAPIPTGFFSPAMEGQGFMYS----LSPHWSSPSALFSAPLVS
           + DLSPAARFATI KASPRS+RER+IDVSDMMDLTEV VE GQ PGILSPAPA+LAPI +GFFSPA+E Q F YS    LSPHW SPSALF APLVS
Subjt:  ---EPDLSPAARFATIGKASPRSDRERDIDVSDMMDLTEVSVEFGQIPGILSPAPATLAPIPTGFFSPAMEGQGFMYS----LSPHWSSPSALFSAPLVS

Query:  PISSPNIFNHLFDF
        PISSPNIFNHLFDF
Subjt:  PISSPNIFNHLFDF

A0A6J1L2G0 protein MKS1-like2.0e-8081.31Show/hide
Query:  MNPHG---GG--NTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPPGRPPLPPASASQWPQPLIIYDISPKVHHVAENNFMSVVQRLTGL----
        MNP G   GG  NTPRKKEIQLQGPRPPQLRV+QESRKIKKPPPHPQP+PP GRPPLPPA ASQWPQPLIIYDISPKV HVAENNFMSVVQRLTGL    
Subjt:  MNPHG---GG--NTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPPGRPPLPPASASQWPQPLIIYDISPKVHHVAENNFMSVVQRLTGL----

Query:  -AAEPDLSPAARFATIGKASPRSDRERDIDVSDMMDLTEVSVEFGQIPGILSPAPATLAPIPTGFFSPAMEGQGFMYS----LSPHWSSPSALFSAPLVS
         +A+ DLSPAARFATI KASPRS  ER+IDVSDMMDLTEV VE GQ PGILSPAPA+LAPI +GFFSP +E Q F YS    LSPHW SPSALF APLVS
Subjt:  -AAEPDLSPAARFATIGKASPRSDRERDIDVSDMMDLTEVSVEFGQIPGILSPAPATLAPIPTGFFSPAMEGQGFMYS----LSPHWSSPSALFSAPLVS

Query:  PISSPNIFNHLFDF
        PISSPNIFNHLFDF
Subjt:  PISSPNIFNHLFDF

SwissProt top hitse value%identityAlignment
F4HWF9 Nuclear speckle RNA-binding protein B1.7e-0733.7Show/hide
Query:  GPRPPQLRVSQESRK-IKKP---PPHPQPVPPPGRPPLPPASASQWPQPLIIYDISPKVHHVAENNFMSVVQRLTG-------------LAAEPD-----
        GP+P  L+V  +S K IKKP   PPHPQP PP      P  S    P P+ IY ++P++ H   NNFM++VQRLTG               +EP      
Subjt:  GPRPPQLRVSQESRK-IKKP---PPHPQPVPPPGRPPLPPASASQWPQPLIIYDISPKVHHVAENNFMSVVQRLTG-------------LAAEPD-----

Query:  -------LSPAARFATIGKASPRSDRERDIDVSDMMDLTEVSVEFGQ-------------IPGILSPAPATLAPIPTGFFS
               +SPAARFA   KA+  ++    +     MD         Q               GILSP P +L  +   FFS
Subjt:  -------LSPAARFATIGKASPRSDRERDIDVSDMMDLTEVSVEFGQ-------------IPGILSPAPATLAPIPTGFFS

Q8LGD5 Protein MKS12.6e-2440.99Show/hide
Query:  GGNTP----RKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPPGRPPLPPASASQWPQPLIIYDISPKVHHVAENNFMSVVQRLTGLAA--------
        GGN      +K+++Q+ GPRP  L V ++S KIKKPP H  P PPP R   PP       +P++IY +SPKV H   + FM+VVQRLTG+++        
Subjt:  GGNTP----RKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPPGRPPLPPASASQWPQPLIIYDISPKVHHVAENNFMSVVQRLTGLAA--------

Query:  EPDLSPAARFATIGKASPRSDRE---RDIDVSDMMDLTEVSVEFGQIPGILSPAPATLAPIPTGFFSPAMEGQGFMYSLSPHWSSPSALFS-APLVSPIS
          D+SPAAR A+   ASPR  +E   RD  V     + E +   G  PGILSP+PA L    TG FSP M  QG M+S     + P  LFS A  +SP  
Subjt:  EPDLSPAARFATIGKASPRSDRE---RDIDVSDMMDLTEVSVEFGQIPGILSPAPATLAPIPTGFFSPAMEGQGFMYSLSPHWSSPSALFS-APLVSPIS

Query:  SP------------NIFNHLFD
        SP            + F+H++D
Subjt:  SP------------NIFNHLFD

Arabidopsis top hitse value%identityAlignment
AT1G21320.1 nucleotide binding;nucleic acid binding1.2e-0833.7Show/hide
Query:  GPRPPQLRVSQESRK-IKKP---PPHPQPVPPPGRPPLPPASASQWPQPLIIYDISPKVHHVAENNFMSVVQRLTG-------------LAAEPD-----
        GP+P  L+V  +S K IKKP   PPHPQP PP      P  S    P P+ IY ++P++ H   NNFM++VQRLTG               +EP      
Subjt:  GPRPPQLRVSQESRK-IKKP---PPHPQPVPPPGRPPLPPASASQWPQPLIIYDISPKVHHVAENNFMSVVQRLTG-------------LAAEPD-----

Query:  -------LSPAARFATIGKASPRSDRERDIDVSDMMDLTEVSVEFGQ-------------IPGILSPAPATLAPIPTGFFS
               +SPAARFA   KA+  ++    +     MD         Q               GILSP P +L  +   FFS
Subjt:  -------LSPAARFATIGKASPRSDRERDIDVSDMMDLTEVSVEFGQ-------------IPGILSPAPATLAPIPTGFFS

AT1G21326.1 VQ motif-containing protein1.6e-1033.47Show/hide
Query:  TPRKKEIQLQGPRPPQLRVSQESRK-IKKP---PPHPQPVPPPGRPPLPPASASQWPQPLIIYDISPKVHHVAENNFMSVVQRLTG--------------
        +PR + I   GPRP  L+V  +S K IKKP   PPHPQP PP      P  S    P P+IIY +SP++ H   NNFM++VQRLTG              
Subjt:  TPRKKEIQLQGPRPPQLRVSQESRK-IKKP---PPHPQPVPPPGRPPLPPASASQWPQPLIIYDISPKVHHVAENNFMSVVQRLTG--------------

Query:  LAAEPD-----------LSPAARFATIGKASPRSDRERDIDVSDMMDL------------TEVSVEFGQIP-----GILSPAPATLAPIPTGFFSP--AM
         +A  D           +SPAARFA   KA+  ++    +     MD              +   +  + P     GILSP P +L  +   FFS     
Subjt:  LAAEPD-----------LSPAARFATIGKASPRSDRERDIDVSDMMDL------------TEVSVEFGQIP-----GILSPAPATLAPIPTGFFSP--AM

Query:  EGQGFMYSLSPHWSSPSALFSAPLVSPISSPNIFNHLFD
        + QGF  S    ++S      + + SP SS ++FN+ FD
Subjt:  EGQGFMYSLSPHWSSPSALFSAPLVSPISSPNIFNHLFD

AT3G18360.1 VQ motif-containing protein1.8e-0437.5Show/hide
Query:  PRPPQLRVSQESRKIKKPPPHPQPVPPPGRPPLPPASASQWPQPLIIYDISPKVHHVAENNFMSVVQRLTGL
        P PP L+V+++S  IKKPP            P   +SA++   P+IIY  +P++ H    +FM++VQ+LTG+
Subjt:  PRPPQLRVSQESRKIKKPPPHPQPVPPPGRPPLPPASASQWPQPLIIYDISPKVHHVAENNFMSVVQRLTGL

AT3G18690.1 MAP kinase substrate 11.8e-2540.99Show/hide
Query:  GGNTP----RKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPPGRPPLPPASASQWPQPLIIYDISPKVHHVAENNFMSVVQRLTGLAA--------
        GGN      +K+++Q+ GPRP  L V ++S KIKKPP H  P PPP R   PP       +P++IY +SPKV H   + FM+VVQRLTG+++        
Subjt:  GGNTP----RKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPPGRPPLPPASASQWPQPLIIYDISPKVHHVAENNFMSVVQRLTGLAA--------

Query:  EPDLSPAARFATIGKASPRSDRE---RDIDVSDMMDLTEVSVEFGQIPGILSPAPATLAPIPTGFFSPAMEGQGFMYSLSPHWSSPSALFS-APLVSPIS
          D+SPAAR A+   ASPR  +E   RD  V     + E +   G  PGILSP+PA L    TG FSP M  QG M+S     + P  LFS A  +SP  
Subjt:  EPDLSPAARFATIGKASPRSDRE---RDIDVSDMMDLTEVSVEFGQIPGILSPAPATLAPIPTGFFSPAMEGQGFMYSLSPHWSSPSALFS-APLVSPIS

Query:  SP------------NIFNHLFD
        SP            + F+H++D
Subjt:  SP------------NIFNHLFD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACCCGCACGGCGGCGGCAACACCCCAAGAAAGAAGGAGATCCAGCTGCAGGGGCCCCGCCCGCCGCAGCTCCGGGTCAGCCAAGAATCCCGCAAAATAAAGAAGCC
GCCGCCGCACCCTCAGCCGGTTCCCCCGCCCGGCCGCCCTCCTCTCCCGCCGGCCTCCGCCTCCCAATGGCCTCAGCCACTCATCATCTACGACATCTCCCCCAAAGTCC
ACCACGTGGCCGAGAACAATTTCATGTCCGTCGTCCAGCGCCTCACCGGCCTCGCCGCCGAACCCGACCTCTCTCCGGCCGCCAGATTCGCCACAATCGGAAAGGCCAGC
CCTCGATCCGACAGGGAAAGAGACATCGACGTTAGCGACATGATGGACTTGACGGAGGTTTCGGTGGAATTTGGGCAAATACCCGGAATTCTATCTCCGGCGCCGGCGAC
TCTGGCCCCGATACCGACGGGATTCTTCTCGCCGGCGATGGAGGGTCAGGGCTTCATGTATTCGTTGAGCCCTCATTGGTCAAGTCCTTCTGCTCTGTTTTCTGCCCCTC
TGGTTTCCCCAATTTCTTCACCAAATATTTTCAACCATCTTTTTGACTTTTAG
mRNA sequenceShow/hide mRNA sequence
CAGTCTTGTTCGAGGAACAAACTAGTTTGTATTGAATTGTTGGAGACCTGGTAAACCAAACCAGGGATTTGGTTCTCGATTTGTTTATGAACCAAAATCTTTTTGAATTC
TTCTCAAACCCATAACATGACATCTACAGACCTCCTCAAAGAGTGTTTTGCACTGTCTTTACATACAACACATTTTACATTGTAATAGAGGCAGTCAATCAATTTCGTTG
ACCGCCTTGAGCTTCGCTTCTTCCGTCAATAACAGAGTAATTTTCATTCTTTACTCTAATCTTCCACTTTGTTGTCAGTTTCTTTGAATCTCACCTCAACCCCAAATCTT
CAAATTCATCCTTGCAAAAAAATTCTCAACACATGTCAACATGTAAGCTCTCACTTTTGTCAACATGGAAGCTTTCAATTCATCCCCAAATTTGCTCTAATCTTGTATGT
CAACATGTAAACTTTCACTTTGTTGCCTACCGAACAAAATTCTTCATCACGTTCCCTCGATCGAACGGAGCACATATGCGACGCATACCCTTACTCTTTCTCGTTCTCTT
TCCTTCTTCTTTCTCTAAATCCACCAATACCATTCATGGTTGAAAGCTACCTTTGAAGCCCGAGGAAGTTACCTCTTTGTTTAGATATCACCTTAACGTTTCTCTTCTTT
TGGGTTCTTCATTGCCCTCCCTCCATCAACTCCCCTCGAGTTCCATTTCTTTGGCTCACTAAAGAGCCTTCTATATGCCCACATTCGAATCTTCTTAAGAAACCCATCAA
AGCCACAGATCACACTAATTATTAGTATCCACACATGCCACAAGTAAAACCTTCGTCCAAGTTTTCGCAGAGCCCAAAAAGTGTGTTTCTCTCACTAAAACTGAGAGAAA
CACTCTCTATTCAACTCCATTAAACTAACTTCATAATATTTTTACTCACTCAATCACACGTTATTGAATCAAAATGTAATATAAATTAATAAAGTTTTTGTTTTTAATGA
AATATTTAATTATGACCAGCAAATACAAATCCATGAAAGAAAAAAGAGCAAAGTATTCAAAATAAATAATTTTGGTGGGGAGTTTCCACCTTCCACAGTTTCCTTTCCAT
CGTATTTATGAAGTCAACTAACAGAAAGTCAAAGAAGCAATACTTTTGTCACAAAACTAAACAAACAAACAAACACAGCCAAATAAATAAAAATGTTATTATTTAATTTT
CTCACCATATTTTTGAAGTGTGGCCCAATCATCAACCTCTAATTTAACCCAGATTTCCAATTCTCTCCGTTTCCCCCCGATCCGATCCAGCAGCGATGAACCCGCACGGC
GGCGGCAACACCCCAAGAAAGAAGGAGATCCAGCTGCAGGGGCCCCGCCCGCCGCAGCTCCGGGTCAGCCAAGAATCCCGCAAAATAAAGAAGCCGCCGCCGCACCCTCA
GCCGGTTCCCCCGCCCGGCCGCCCTCCTCTCCCGCCGGCCTCCGCCTCCCAATGGCCTCAGCCACTCATCATCTACGACATCTCCCCCAAAGTCCACCACGTGGCCGAGA
ACAATTTCATGTCCGTCGTCCAGCGCCTCACCGGCCTCGCCGCCGAACCCGACCTCTCTCCGGCCGCCAGATTCGCCACAATCGGAAAGGCCAGCCCTCGATCCGACAGG
GAAAGAGACATCGACGTTAGCGACATGATGGACTTGACGGAGGTTTCGGTGGAATTTGGGCAAATACCCGGAATTCTATCTCCGGCGCCGGCGACTCTGGCCCCGATACC
GACGGGATTCTTCTCGCCGGCGATGGAGGGTCAGGGCTTCATGTATTCGTTGAGCCCTCATTGGTCAAGTCCTTCTGCTCTGTTTTCTGCCCCTCTGGTTTCCCCAATTT
CTTCACCAAATATTTTCAACCATCTTTTTGACTTTTAGCTAACTTCTCTCTCTCTCAATTTTTAGTCTGCATTGCCAATGCCCTTCAAAGAACAAATTTGGGGTTTTTTG
TTTTTTGTAAAGTTGGTAATTGTAATTTTTAAGACATTTACATTTTTAGGAATTCGTTAATTATCCACAAAAAAATGTAGTCTTGTTACCTTTTGCAATAAATAAATATC
ATTTCAAAG
Protein sequenceShow/hide protein sequence
MNPHGGGNTPRKKEIQLQGPRPPQLRVSQESRKIKKPPPHPQPVPPPGRPPLPPASASQWPQPLIIYDISPKVHHVAENNFMSVVQRLTGLAAEPDLSPAARFATIGKAS
PRSDRERDIDVSDMMDLTEVSVEFGQIPGILSPAPATLAPIPTGFFSPAMEGQGFMYSLSPHWSSPSALFSAPLVSPISSPNIFNHLFDF