; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G34370 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G34370
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionSerine/arginine repetitive matrix protein 1
Genome locationChr6:28253607..28255001
RNA-Seq ExpressionCSPI06G34370
SyntenyCSPI06G34370
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7026590.1 hypothetical protein SDJN02_10592, partial [Cucurbita argyrosperma subsp. argyrosperma]3.2e-7561.41Show/hide
Query:  MGCCLSSTQSFSSPNKFHSNSV---------NVSRDPPSSMEEETVKEVLSETPALKP----PQKNNSPPEQDEFRKPLGD----EIEKKLSEIPINGIP
        MGCC+SS +S SS +KF + +          N SR+PPSSMEEETVKEVLSET ALKP    P   + PPE+DE +KP+GD    EIEKKL EIPINGI 
Subjt:  MGCCLSSTQSFSSPNKFHSNSV---------NVSRDPPSSMEEETVKEVLSETPALKP----PQKNNSPPEQDEFRKPLGD----EIEKKLSEIPINGIP

Query:  EQPSEFYEISHMNKCISVSAATFTDQTDGGGEVHQTVLKSSPVKLTKNQSVFSDVELKREIQQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARR
        +Q SEF EIS+ +K    + A FTD  DGG EVHQ VLK+ P     NQS+  +V LKR++  ++TL RRSDQSPVRRN  VGS R+V  RD SPAM  R
Subjt:  EQPSEFYEISHMNKCISVSAATFTDQTDGGGEVHQTVLKSSPVKLTKNQSVFSDVELKREIQQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARR

Query:  GLRAEPPRRDPDENSSRRSHSPSTARSDSAGYRSALSRTPSARKSGKSSP------ITAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
        GLR EP ++DPDEN  RRS SP+TAR DS G RSAL RTPS RKSGKSSP      IT   ATS+KVVEENNI +G   TQIESLENPLVSLECFIFL
Subjt:  GLRAEPPRRDPDENSSRRSHSPSTARSDSAGYRSALSRTPSARKSGKSSP------ITAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL

XP_008441084.1 PREDICTED: uncharacterized protein LOC103485312 [Cucumis melo]4.2e-13192.36Show/hide
Query:  MGCCLSSTQSFSSPNKFHSNSVNVSRDPPSSMEEETVKEVLSETPALKPPQKNNSPPEQDEFRKPLGDEIEKKLSEIPINGIPEQPSEFYEISHMNKCIS
        MGCCLSS+QSF+SPNKFH +SVN +RDPPSSMEEETVKEVLSETPALKPP   N PPE+DEF KPLGDE EKKLSEIPINGIPEQPSEFYEISHMNKCIS
Subjt:  MGCCLSSTQSFSSPNKFHSNSVNVSRDPPSSMEEETVKEVLSETPALKPPQKNNSPPEQDEFRKPLGDEIEKKLSEIPINGIPEQPSEFYEISHMNKCIS

Query:  VSAATFTDQTDGGGEVHQTVLKSSPVKLTKNQSVFSDVELKREIQQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDENSSR
        VSAATFTDQ DGGGEVHQT LKSSPVKLTKNQSV SDVELKREI QSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDENSSR
Subjt:  VSAATFTDQTDGGGEVHQTVLKSSPVKLTKNQSVFSDVELKREIQQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDENSSR

Query:  RSHSPSTARSDSAGYRSALSRTPSARKSGKSSPITAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
        RS SPSTA SDSAGYRSALSRTPS RKSGKSSPI AMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
Subjt:  RSHSPSTARSDSAGYRSALSRTPSARKSGKSSPITAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL

XP_011658203.1 uncharacterized protein LOC105435961 [Cucumis sativus]1.5e-14198.91Show/hide
Query:  MGCCLSSTQSFSSPNKFHSNSVNVSRDPPSSMEEETVKEVLSETPALKPPQKNNSPPEQDEFRKPLGDEIEKKLSEIPINGIPEQPSEFYEISHMNKCIS
        MGCCLSSTQSFSSPNKFHSNSVNVSRDPPSSMEEETVKEVLSETPALKPPQKNNS PEQDEFRKPLGDEIEKKLSEIPINGIPEQPSEFYEISHMNKCIS
Subjt:  MGCCLSSTQSFSSPNKFHSNSVNVSRDPPSSMEEETVKEVLSETPALKPPQKNNSPPEQDEFRKPLGDEIEKKLSEIPINGIPEQPSEFYEISHMNKCIS

Query:  VSAATFTDQTDGGGEVHQTVLKSSPVKLTKNQSVFSDVELKREIQQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDENSSR
        VSAATFTDQTDGGGEVHQTVLKSSPVKLTKNQSV SDVELKREIQQSRTLTRRSDQSPVRRNGAVGS+RMVHNRDMSPAMARRGLRAEPPRRDPDENSSR
Subjt:  VSAATFTDQTDGGGEVHQTVLKSSPVKLTKNQSVFSDVELKREIQQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDENSSR

Query:  RSHSPSTARSDSAGYRSALSRTPSARKSGKSSPITAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
        RSHSPSTARSDSAGYRSALSRTPSARKSGKSSPITAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
Subjt:  RSHSPSTARSDSAGYRSALSRTPSARKSGKSSPITAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL

XP_022926404.1 uncharacterized protein LOC111433567 [Cucurbita moschata]3.5e-7762.42Show/hide
Query:  MGCCLSSTQSFSSPNKFHSNSV---------NVSRDPPSSMEEETVKEVLSETPALKP----PQKNNSPPEQDEFRKPLGD----EIEKKLSEIPINGIP
        MGCC+SS +S SS +KF + +          N SR+PPSSMEEETVKEVLSET ALKP    P   N PPE+DE +KP+GD    EIEKKL EIPINGI 
Subjt:  MGCCLSSTQSFSSPNKFHSNSV---------NVSRDPPSSMEEETVKEVLSETPALKP----PQKNNSPPEQDEFRKPLGD----EIEKKLSEIPINGIP

Query:  EQPSEFYEISHMNKCISVSAATFTDQTDGGGEVHQTVLKSSPVKLTKNQSVFSDVELKREIQQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARR
        +QPSEF EIS+ +K    + A FTD  DGG EVHQ VLK+ P     NQS+  +V LKR++  ++TL RRSDQSPVRRN  VGS R+V  RD SPAM  R
Subjt:  EQPSEFYEISHMNKCISVSAATFTDQTDGGGEVHQTVLKSSPVKLTKNQSVFSDVELKREIQQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARR

Query:  GLRAEPPRRDPDENSSRRSHSPSTARSDSAGYRSALSRTPSARKSGKSSP------ITAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
        GLR EP ++DPDEN  RRS SP+TAR DS G RSAL RTPS RKSGKSSP      IT   ATS+KVVEENNI DG   TQIESLENPLVSLECFIFL
Subjt:  GLRAEPPRRDPDENSSRRSHSPSTARSDSAGYRSALSRTPSARKSGKSSP------ITAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL

XP_038882208.1 uncharacterized protein LOC120073430 [Benincasa hispida]1.6e-11180.22Show/hide
Query:  MGCCLSSTQSFSSPNKFHSNSVNVSRDPPSSMEEETVKEVLSETPALKP---PQKNNSPPEQDEFRKPLGDEIEKKLSEIPINGIPEQPSEFYEISHMNK
        MGCCLSS +SF+SPNKFH NS N SRDPPSSMEEETVKEVLSETP+LKP   P K NSPPE+D+  KP+G+EIEKKL EI INGI E PSEFYEISH N+
Subjt:  MGCCLSSTQSFSSPNKFHSNSVNVSRDPPSSMEEETVKEVLSETPALKP---PQKNNSPPEQDEFRKPLGDEIEKKLSEIPINGIPEQPSEFYEISHMNK

Query:  CISVSAATFTDQTDGGGEVHQTVLKSSPVKLTKNQSVFSDVELKREIQQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDEN
        CISVS A  T+Q DGGGE+HQ VLKSSPVKL K+QS+  D E+KREI Q+RTLTRRSDQSPVRRNGA+GSMRMVHNRDM+PAMARR LRAEPPRRDPDEN
Subjt:  CISVSAATFTDQTDGGGEVHQTVLKSSPVKLTKNQSVFSDVELKREIQQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDEN

Query:  SSRRSHSPSTARSDSAGYRSALSRTPSARKSGKSSPITAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
        S RRS SP+TARSD  G RSALSRTPS RKSGKSSP  A TATSQKVVEENNI+DGKFN+QIESLENPLVSLECFIFL
Subjt:  SSRRSHSPSTARSDSAGYRSALSRTPSARKSGKSSPITAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL

TrEMBL top hitse value%identityAlignment
A0A0A0KLE9 Uncharacterized protein7.4e-14298.91Show/hide
Query:  MGCCLSSTQSFSSPNKFHSNSVNVSRDPPSSMEEETVKEVLSETPALKPPQKNNSPPEQDEFRKPLGDEIEKKLSEIPINGIPEQPSEFYEISHMNKCIS
        MGCCLSSTQSFSSPNKFHSNSVNVSRDPPSSMEEETVKEVLSETPALKPPQKNNS PEQDEFRKPLGDEIEKKLSEIPINGIPEQPSEFYEISHMNKCIS
Subjt:  MGCCLSSTQSFSSPNKFHSNSVNVSRDPPSSMEEETVKEVLSETPALKPPQKNNSPPEQDEFRKPLGDEIEKKLSEIPINGIPEQPSEFYEISHMNKCIS

Query:  VSAATFTDQTDGGGEVHQTVLKSSPVKLTKNQSVFSDVELKREIQQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDENSSR
        VSAATFTDQTDGGGEVHQTVLKSSPVKLTKNQSV SDVELKREIQQSRTLTRRSDQSPVRRNGAVGS+RMVHNRDMSPAMARRGLRAEPPRRDPDENSSR
Subjt:  VSAATFTDQTDGGGEVHQTVLKSSPVKLTKNQSVFSDVELKREIQQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDENSSR

Query:  RSHSPSTARSDSAGYRSALSRTPSARKSGKSSPITAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
        RSHSPSTARSDSAGYRSALSRTPSARKSGKSSPITAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
Subjt:  RSHSPSTARSDSAGYRSALSRTPSARKSGKSSPITAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL

A0A1S3B2L5 uncharacterized protein LOC1034853122.0e-13192.36Show/hide
Query:  MGCCLSSTQSFSSPNKFHSNSVNVSRDPPSSMEEETVKEVLSETPALKPPQKNNSPPEQDEFRKPLGDEIEKKLSEIPINGIPEQPSEFYEISHMNKCIS
        MGCCLSS+QSF+SPNKFH +SVN +RDPPSSMEEETVKEVLSETPALKPP   N PPE+DEF KPLGDE EKKLSEIPINGIPEQPSEFYEISHMNKCIS
Subjt:  MGCCLSSTQSFSSPNKFHSNSVNVSRDPPSSMEEETVKEVLSETPALKPPQKNNSPPEQDEFRKPLGDEIEKKLSEIPINGIPEQPSEFYEISHMNKCIS

Query:  VSAATFTDQTDGGGEVHQTVLKSSPVKLTKNQSVFSDVELKREIQQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDENSSR
        VSAATFTDQ DGGGEVHQT LKSSPVKLTKNQSV SDVELKREI QSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDENSSR
Subjt:  VSAATFTDQTDGGGEVHQTVLKSSPVKLTKNQSVFSDVELKREIQQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDENSSR

Query:  RSHSPSTARSDSAGYRSALSRTPSARKSGKSSPITAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
        RS SPSTA SDSAGYRSALSRTPS RKSGKSSPI AMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
Subjt:  RSHSPSTARSDSAGYRSALSRTPSARKSGKSSPITAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL

A0A5D3CNI1 Putative BEST plant protein match is: (TAIR:plant.1) protein2.0e-13192.36Show/hide
Query:  MGCCLSSTQSFSSPNKFHSNSVNVSRDPPSSMEEETVKEVLSETPALKPPQKNNSPPEQDEFRKPLGDEIEKKLSEIPINGIPEQPSEFYEISHMNKCIS
        MGCCLSS+QSF+SPNKFH +SVN +RDPPSSMEEETVKEVLSETPALKPP   N PPE+DEF KPLGDE EKKLSEIPINGIPEQPSEFYEISHMNKCIS
Subjt:  MGCCLSSTQSFSSPNKFHSNSVNVSRDPPSSMEEETVKEVLSETPALKPPQKNNSPPEQDEFRKPLGDEIEKKLSEIPINGIPEQPSEFYEISHMNKCIS

Query:  VSAATFTDQTDGGGEVHQTVLKSSPVKLTKNQSVFSDVELKREIQQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDENSSR
        VSAATFTDQ DGGGEVHQT LKSSPVKLTKNQSV SDVELKREI QSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDENSSR
Subjt:  VSAATFTDQTDGGGEVHQTVLKSSPVKLTKNQSVFSDVELKREIQQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDENSSR

Query:  RSHSPSTARSDSAGYRSALSRTPSARKSGKSSPITAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
        RS SPSTA SDSAGYRSALSRTPS RKSGKSSPI AMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
Subjt:  RSHSPSTARSDSAGYRSALSRTPSARKSGKSSPITAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL

A0A6J1CMA7 uncharacterized protein LOC1110124333.1e-3956.54Show/hide
Query:  MGCCLSSTQSFSSPNKFHSNS------VNVSRDPPSSMEEETVKEVLSETPALK-PPQKNNSPPEQDEFRKPLGD-------EIEKKLSEIPINGIPEQP
        MGCC+SS    +S +KF  NS      +  SR+PPSSMEEETVKEVL+ETPALK PP   N PP++DE  KP+ D       EIEKK+  IP N + E  
Subjt:  MGCCLSSTQSFSSPNKFHSNS------VNVSRDPPSSMEEETVKEVLSETPALK-PPQKNNSPPEQDEFRKPLGD-------EIEKKLSEIPINGIPEQP

Query:  SEFYEISHMNKCISVSAATFTDQTDGGGEVHQTVLKSSPVKLTKNQSVFSDVELKREIQQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMS
         EF EIS  ++C+  SAATFTD+ D G EVHQ V ++SPVKL KNQS   DV  KRE+  +R L RRSDQSPVRRNG VGS R+  NRDM+
Subjt:  SEFYEISHMNKCISVSAATFTDQTDGGGEVHQTVLKSSPVKLTKNQSVFSDVELKREIQQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMS

A0A6J1EF08 uncharacterized protein LOC1114335671.7e-7762.42Show/hide
Query:  MGCCLSSTQSFSSPNKFHSNSV---------NVSRDPPSSMEEETVKEVLSETPALKP----PQKNNSPPEQDEFRKPLGD----EIEKKLSEIPINGIP
        MGCC+SS +S SS +KF + +          N SR+PPSSMEEETVKEVLSET ALKP    P   N PPE+DE +KP+GD    EIEKKL EIPINGI 
Subjt:  MGCCLSSTQSFSSPNKFHSNSV---------NVSRDPPSSMEEETVKEVLSETPALKP----PQKNNSPPEQDEFRKPLGD----EIEKKLSEIPINGIP

Query:  EQPSEFYEISHMNKCISVSAATFTDQTDGGGEVHQTVLKSSPVKLTKNQSVFSDVELKREIQQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARR
        +QPSEF EIS+ +K    + A FTD  DGG EVHQ VLK+ P     NQS+  +V LKR++  ++TL RRSDQSPVRRN  VGS R+V  RD SPAM  R
Subjt:  EQPSEFYEISHMNKCISVSAATFTDQTDGGGEVHQTVLKSSPVKLTKNQSVFSDVELKREIQQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARR

Query:  GLRAEPPRRDPDENSSRRSHSPSTARSDSAGYRSALSRTPSARKSGKSSP------ITAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
        GLR EP ++DPDEN  RRS SP+TAR DS G RSAL RTPS RKSGKSSP      IT   ATS+KVVEENNI DG   TQIESLENPLVSLECFIFL
Subjt:  GLRAEPPRRDPDENSSRRSHSPSTARSDSAGYRSALSRTPSARKSGKSSP------ITAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G11125.1 unknown protein9.0e-0729.73Show/hide
Query:  MGCCLSSTQSFSSPNKFHSNSVNVSRDPPSSMEEET-VKEVLSETPALKPPQKNNSPP-------EQDEFRKP--LGDEIEKKLSEIPINGIPEQPSEFY
        MGCCLSS  +    +    +  N +R PPS ++EET VKEVLSET  L     N++         +++E +KP  + D  ++ +   P    PE+ SE  
Subjt:  MGCCLSSTQSFSSPNKFHSNSVNVSRDPPSSMEEET-VKEVLSETPALKPPQKNNSPP-------EQDEFRKP--LGDEIEKKLSEIPINGIPEQPSEFY

Query:  EISHMNKCISVSAATFTDQTDGGGEVHQTVLKSSPVKLTKNQSVFSDVELKREIQQSRTLTRRSDQSPVRRNG--AVGSMRMVHNRDMSPAMARRGLRAE
        E   +++ + +S     D+ +        V + SP K ++N+ + S               RR+D SP +RN     GS+R+V +   +           
Subjt:  EISHMNKCISVSAATFTDQTDGGGEVHQTVLKSSPVKLTKNQSVFSDVELKREIQQSRTLTRRSDQSPVRRNG--AVGSMRMVHNRDMSPAMARRGLRAE

Query:  PPRRDPDENSSRRSHSPSTARSDSAGYRSALSRTPSA---RKSGKSSPITAMTATSQKVVEENNIVD-----GKFNT-QIESLENPLVSLECFIFL
           RD  E S RRS SP+  RS   G   +   T S    R+  +S     +        +E N        G +N+   +S ENPLVSLECFIFL
Subjt:  PPRRDPDENSSRRSHSPSTARSDSAGYRSALSRTPSA---RKSGKSSPITAMTATSQKVVEENNIVD-----GKFNT-QIESLENPLVSLECFIFL

AT1G61170.1 unknown protein3.9e-1030.28Show/hide
Query:  CCLSSTQSFSSPNKFHSNSVNVSRDPPSSMEEET-VKEVLSETPALKPPQKNNSPPEQDEFRKPLGDEIEKKLSEI----------PINGIPEQPSEFYE
        CC+SS  +          + NVS    + +EEET VKEVLSET    P     +   +D  +  + ++ EKK   +          P +  PE+ SE  E
Subjt:  CCLSSTQSFSSPNKFHSNSVNVSRDPPSSMEEET-VKEVLSETPALKPPQKNNSPPEQDEFRKPLGDEIEKKLSEI----------PINGIPEQPSEFYE

Query:  ISHMNKCISVSAATFTDQTDGGGEVHQTVLKSSPVKLTKNQSVFSDVELKREIQQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARRGLRAEPPR
        I  ++   SVS+   T   +G  E H        V + + +S  S  + + ++  +   TRR+DQSP +RN    +                G R     
Subjt:  ISHMNKCISVSAATFTDQTDGGGEVHQTVLKSSPVKLTKNQSVFSDVELKREIQQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARRGLRAEPPR

Query:  RDPDENSSRRSHSPSTARSDSAGYRSALSRTPSARKSGKSSPITAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
        RDP E S RRS SP+T RS     +S+       RK+ + SP       ++  +++    +  + T+ E LENPLVSLECFIFL
Subjt:  RDPDENSSRRSHSPSTARSDSAGYRSALSRTPSARKSGKSSPITAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTGCTGTCTTAGTTCCACCCAATCCTTCAGTTCACCTAACAAATTCCATTCCAATTCTGTCAATGTAAGCAGAGACCCTCCGTCTTCCATGGAGGAAGAGACCGT
CAAAGAAGTGCTCTCTGAAACACCTGCTCTCAAACCGCCGCAGAAGAATAATTCTCCACCTGAACAAGATGAATTTCGCAAACCACTCGGTGATGAGATCGAGAAGAAGC
TTTCTGAAATTCCCATTAACGGAATTCCAGAACAGCCTTCTGAATTCTATGAAATTTCCCATATGAACAAGTGTATCTCAGTCTCCGCCGCTACTTTCACTGATCAAACA
GACGGGGGCGGTGAAGTTCATCAAACGGTTTTGAAATCATCGCCAGTGAAGTTGACGAAAAATCAATCAGTTTTCAGTGACGTTGAGTTAAAAAGAGAAATTCAGCAGAG
CAGGACACTGACCCGGAGATCCGACCAGTCACCAGTTCGACGAAATGGCGCCGTTGGGTCGATGAGAATGGTTCATAACAGAGACATGAGCCCGGCAATGGCGCGGCGAG
GATTGAGAGCAGAGCCTCCCCGAAGAGACCCAGATGAGAACTCCAGCCGGAGATCCCACTCACCGTCTACCGCTCGTTCTGACAGCGCAGGGTATAGATCTGCACTCAGT
CGGACACCGTCAGCGAGAAAGTCCGGTAAATCATCGCCCATTACGGCGATGACAGCGACAAGTCAAAAAGTAGTAGAAGAAAACAATATTGTAGATGGAAAATTCAACAC
CCAGATTGAATCACTTGAGAACCCTCTGGTTTCATTAGAGTGCTTCATCTTCCTCTGA
mRNA sequenceShow/hide mRNA sequence
TAAGTTTCCCAGATCATTAAATTAAAATAAGAATAATAAAAGGTATTAAAGGGTAAAGTTTTTAATTTTTGTTTCCCCTCCCGATTAATTAATTCGTACTATCCGCCGTA
ACTGCTTCTGTCTTTAAAATTCAATTCATTGATATCTCACACTCTCAGTTCCCAAATCTTCCACCTTTCCCTCAAAATTACAATGCTCTGAAGCCTGCTCTCACAAATGG
GTTGCTGTCTTAGTTCCACCCAATCCTTCAGTTCACCTAACAAATTCCATTCCAATTCTGTCAATGTAAGCAGAGACCCTCCGTCTTCCATGGAGGAAGAGACCGTCAAA
GAAGTGCTCTCTGAAACACCTGCTCTCAAACCGCCGCAGAAGAATAATTCTCCACCTGAACAAGATGAATTTCGCAAACCACTCGGTGATGAGATCGAGAAGAAGCTTTC
TGAAATTCCCATTAACGGAATTCCAGAACAGCCTTCTGAATTCTATGAAATTTCCCATATGAACAAGTGTATCTCAGTCTCCGCCGCTACTTTCACTGATCAAACAGACG
GGGGCGGTGAAGTTCATCAAACGGTTTTGAAATCATCGCCAGTGAAGTTGACGAAAAATCAATCAGTTTTCAGTGACGTTGAGTTAAAAAGAGAAATTCAGCAGAGCAGG
ACACTGACCCGGAGATCCGACCAGTCACCAGTTCGACGAAATGGCGCCGTTGGGTCGATGAGAATGGTTCATAACAGAGACATGAGCCCGGCAATGGCGCGGCGAGGATT
GAGAGCAGAGCCTCCCCGAAGAGACCCAGATGAGAACTCCAGCCGGAGATCCCACTCACCGTCTACCGCTCGTTCTGACAGCGCAGGGTATAGATCTGCACTCAGTCGGA
CACCGTCAGCGAGAAAGTCCGGTAAATCATCGCCCATTACGGCGATGACAGCGACAAGTCAAAAAGTAGTAGAAGAAAACAATATTGTAGATGGAAAATTCAACACCCAG
ATTGAATCACTTGAGAACCCTCTGGTTTCATTAGAGTGCTTCATCTTCCTCTGATTTTGTGCGTGGATTTCACTGATCTTTCATTTTTCTTGTTTGATTGTATTAAAGTT
TTTTGAGTGAACATTGAAGTAAATTTAACAACAACGGCGGTGGATCGCCGGAGAATTAATTGAAGGTCACTGGAAAGGTTGGTATGTTGGATATTAAATTTCAAACTAGA
AGTAAATTAATGGAATTAGATTTATTTGTTAATGGTTGTGATTTGAATGAGGTAACCGACGTAAAAGAGAAGACAAACTCCGCCGCATAAATCACTGTTGATCACATGAT
TTTAGAGGGAATGGGCTGTATATTGTTAATCACATTGTTAATCCAAAACCACAACCAAAAATGGTAGAGTAGCTC
Protein sequenceShow/hide protein sequence
MGCCLSSTQSFSSPNKFHSNSVNVSRDPPSSMEEETVKEVLSETPALKPPQKNNSPPEQDEFRKPLGDEIEKKLSEIPINGIPEQPSEFYEISHMNKCISVSAATFTDQT
DGGGEVHQTVLKSSPVKLTKNQSVFSDVELKREIQQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDENSSRRSHSPSTARSDSAGYRSALS
RTPSARKSGKSSPITAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL