; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0024673 (gene) of Chayote v1 genome

Gene IDSed0024673
OrganismSechium edule (Chayote v1)
DescriptionSerine/arginine repetitive matrix protein 1
Genome locationLG14:22872946..22873993
RNA-Seq ExpressionSed0024673
SyntenySed0024673
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6594621.1 hypothetical protein SDJN03_11174, partial [Cucurbita argyrosperma subsp. sororia]3.6e-6960.07Show/hide
Query:  MGCCVSSGKSTNSTHKLDRNSTA---AAVKNNENREPPSVTEEETVKEVLTETTALKPPP--PPPPPIPPEKDAAVKSEGN----DIEKKFNEIPIDGIV
        MGCCVSSGKS++S HK D  +       + +N +REPPS  EEETVKEVL+ET ALKP P  PP    PPE+D A K  G+    +IEKK  EIPI+GIV
Subjt:  MGCCVSSGKSTNSTHKLDRNSTA---AAVKNNENREPPSVTEEETVKEVLTETTALKPPP--PPPPPIPPEKDAAVKSEGN----DIEKKFNEIPIDGIV

Query:  QQPSEFYEISKSNEFLATANFIDIIDGGGGGGGEVHQKALKSSPVKLPKTQLIVG--------SPNKTLNRRSEPSPVRRNRAVGAAGFVQNRDVSPAMA
        QQ SEF EIS  ++F ATANF DI+D    GG EVHQ  LK+    LP  Q I G        SPNKTLNRRS+ SPVRRN  VG+A  VQ +D SPAM 
Subjt:  QQPSEFYEISKSNEFLATANFIDIIDGGGGGGGEVHQKALKSSPVKLPKTQLIVG--------SPNKTLNRRSEPSPVRRNRAVGAAGFVQNRDVSPAMA

Query:  RRGLRPEFPRRDPDENPGRRSRSPAAARLEGERSRSALRPAAPAPSVRKLGKSSPLRAATA--TAPAAGQKRVEENNIIDENCNTQIESLENPLVSLECF
         RGLR E   +DPDEN GRRSRSPA AR +   SRSAL      PSVRK GKSSPLR  TA   APA  +K VEENNI D N  TQIESLENPLVSLECF
Subjt:  RRGLRPEFPRRDPDENPGRRSRSPAAARLEGERSRSALRPAAPAPSVRKLGKSSPLRAATA--TAPAAGQKRVEENNIIDENCNTQIESLENPLVSLECF

Query:  IFL
        IFL
Subjt:  IFL

KAG7026590.1 hypothetical protein SDJN02_10592, partial [Cucurbita argyrosperma subsp. argyrosperma]2.1e-6960.07Show/hide
Query:  MGCCVSSGKSTNSTHKLDRNSTA---AAVKNNENREPPSVTEEETVKEVLTETTALKPPP--PPPPPIPPEKDAAVKSEGN----DIEKKFNEIPIDGIV
        MGCCVSSGKS++S HK D  +       + +N +REPPS  EEETVKEVL+ET ALKP P  PP    PPE+D A K  G+    +IEKK  EIPI+GIV
Subjt:  MGCCVSSGKSTNSTHKLDRNSTA---AAVKNNENREPPSVTEEETVKEVLTETTALKPPP--PPPPPIPPEKDAAVKSEGN----DIEKKFNEIPIDGIV

Query:  QQPSEFYEISKSNEFLATANFIDIIDGGGGGGGEVHQKALKSSPVKLPKTQLIVG--------SPNKTLNRRSEPSPVRRNRAVGAAGFVQNRDVSPAMA
        QQ SEF EIS  ++F ATANF DI+D    GG EVHQ  LK+    LP  Q I G        SPNKTLNRRS+ SPVRRN  VG+A  VQ RD SPAM 
Subjt:  QQPSEFYEISKSNEFLATANFIDIIDGGGGGGGEVHQKALKSSPVKLPKTQLIVG--------SPNKTLNRRSEPSPVRRNRAVGAAGFVQNRDVSPAMA

Query:  RRGLRPEFPRRDPDENPGRRSRSPAAARLEGERSRSALRPAAPAPSVRKLGKSSPLRAATA--TAPAAGQKRVEENNIIDENCNTQIESLENPLVSLECF
         RGLR E  ++DPDEN GRRSRSPA AR +   SRSAL      PSVRK GKSSPLR  TA   APA  +K VEENNI + N  TQIESLENPLVSLECF
Subjt:  RRGLRPEFPRRDPDENPGRRSRSPAAARLEGERSRSALRPAAPAPSVRKLGKSSPLRAATA--TAPAAGQKRVEENNIIDENCNTQIESLENPLVSLECF

Query:  IFL
        IFL
Subjt:  IFL

XP_008441084.1 PREDICTED: uncharacterized protein LOC103485312 [Cucumis melo]1.7e-6657.82Show/hide
Query:  MGCCVSSGKSTNSTHKLDRNSTAAAVKNNENREPPSVTEEETVKEVLTETTALKPPPPPPPPIPPEKDAAVKSEGNDIEKKFNEIPIDGIVQQPSEFYEI
        MGCC+SS +S NS +K   +S       N NR+PPS  EEETVKEVL+ET ALK  PPP    PPE+D   K  G++ EKK +EIPI+GI +QPSEFYEI
Subjt:  MGCCVSSGKSTNSTHKLDRNSTAAAVKNNENREPPSVTEEETVKEVLTETTALKPPPPPPPPIPPEKDAAVKSEGNDIEKKFNEIPIDGIVQQPSEFYEI

Query:  SKSNEFL--ATANFIDIIDGGGGGGGEVHQKALKSSPVKLPKTQLIVGS--------PNKTLNRRSEPSPVRRNRAVGAAGFVQNRDVSPAMARRGLRPE
        S  N+ +  + A F D  D    GGGEVHQ  LKSSPVKL K Q +            ++TL RRS+ SPVRRN AVG+   V NRD+SPAMARRGLR E
Subjt:  SKSNEFL--ATANFIDIIDGGGGGGGEVHQKALKSSPVKLPKTQLIVGS--------PNKTLNRRSEPSPVRRNRAVGAAGFVQNRDVSPAMARRGLRPE

Query:  FPRRDPDENPGRRSRSPAAARLEGERSRSALRPAAPAPSVRKLGKSSPLRAATATAPAAGQKRVEENNIIDENCNTQIESLENPLVSLECFIFL
         PRRDPDEN  RRS+SP+ A  +    RSAL   +  PS RK GKSSP+RA TAT+    QK VEENNI+D   NTQIESLENPLVSLECFIFL
Subjt:  FPRRDPDENPGRRSRSPAAARLEGERSRSALRPAAPAPSVRKLGKSSPLRAATATAPAAGQKRVEENNIIDENCNTQIESLENPLVSLECFIFL

XP_022926404.1 uncharacterized protein LOC111433567 [Cucurbita moschata]5.5e-7061.11Show/hide
Query:  MGCCVSSGKSTNSTHKLDRNSTAAAVK------NNENREPPSVTEEETVKEVLTETTALKP--PPPPPPPIPPEKDAAVKSEGN----DIEKKFNEIPID
        MGCCVSSGKS++S HK D    AAA K      +N +REPPS  EEETVKEVL+ET ALKP    PP    PPE+D A K  G+    +IEKK  EIPI+
Subjt:  MGCCVSSGKSTNSTHKLDRNSTAAAVK------NNENREPPSVTEEETVKEVLTETTALKP--PPPPPPPIPPEKDAAVKSEGN----DIEKKFNEIPID

Query:  GIVQQPSEFYEISKSNEFLATANFIDIIDGGGGGGGEVHQKALKSSPVKLPKTQLIVG--------SPNKTLNRRSEPSPVRRNRAVGAAGFVQNRDVSP
        GIVQQPSEF EIS  ++F ATANF DI+D    GG EVHQ  LK+    LP  Q I G        SPNKTLNRRS+ SPVRRN  VG+A  VQ RD SP
Subjt:  GIVQQPSEFYEISKSNEFLATANFIDIIDGGGGGGGEVHQKALKSSPVKLPKTQLIVG--------SPNKTLNRRSEPSPVRRNRAVGAAGFVQNRDVSP

Query:  AMARRGLRPEFPRRDPDENPGRRSRSPAAARLEGERSRSALRPAAPAPSVRKLGKSSPLRAATA--TAPAAGQKRVEENNIIDENCNTQIESLENPLVSL
        AM  RGLR E  ++DPDEN GRRSRSPA AR +   SRSAL      PSVRK GKSSPLR  TA   APA  +K VEENNI D N  TQIESLENPLVSL
Subjt:  AMARRGLRPEFPRRDPDENPGRRSRSPAAARLEGERSRSALRPAAPAPSVRKLGKSSPLRAATA--TAPAAGQKRVEENNIIDENCNTQIESLENPLVSL

Query:  ECFIFL
        ECFIFL
Subjt:  ECFIFL

XP_038882208.1 uncharacterized protein LOC120073430 [Benincasa hispida]2.0e-7261.43Show/hide
Query:  MGCCVSSGKSTNSTHKLDRNSTAAAVKNNENREPPSVTEEETVKEVLTETTALKPPPPPPPP-IPPEKDAAVKSEGNDIEKKFNEIPIDGIVQQPSEFYE
        MGCC+SSGKS NS +K  RNS      +N +R+PPS  EEETVKEVL+ET +LKPPP PP    PPE+D  +K  GN+IEKK  EI I+GI + PSEFYE
Subjt:  MGCCVSSGKSTNSTHKLDRNSTAAAVKNNENREPPSVTEEETVKEVLTETTALKPPPPPPPP-IPPEKDAAVKSEGNDIEKKFNEIPIDGIVQQPSEFYE

Query:  ISKSNEFLATANFIDIIDGGGGGGGEVHQKALKSSPVKLPKTQLIVG--------SPNKTLNRRSEPSPVRRNRAVGAAGFVQNRDVSPAMARRGLRPEF
        IS  NE ++ +  I        GGGE+HQ  LKSSPVKLPK Q I G        S N+TL RRS+ SPVRRN A+G+   V NRD++PAMARR LR E 
Subjt:  ISKSNEFLATANFIDIIDGGGGGGGEVHQKALKSSPVKLPKTQLIVG--------SPNKTLNRRSEPSPVRRNRAVGAAGFVQNRDVSPAMARRGLRPEF

Query:  PRRDPDENPGRRSRSPAAARLEGERSRSALRPAAPAPSVRKLGKSSPLRAATATAPAAGQKRVEENNIIDENCNTQIESLENPLVSLECFIFL
        PRRDPDEN  RRSRSPA AR +G  SRSAL   +  PSVRK GKSSP RAATAT+    QK VEENNIID   N+QIESLENPLVSLECFIFL
Subjt:  PRRDPDENPGRRSRSPAAARLEGERSRSALRPAAPAPSVRKLGKSSPLRAATATAPAAGQKRVEENNIIDENCNTQIESLENPLVSLECFIFL

TrEMBL top hitse value%identityAlignment
A0A0A0KLE9 Uncharacterized protein2.2e-6457.14Show/hide
Query:  MGCCVSSGKSTNSTHKLDRNSTAAAVKNNENREPPSVTEEETVKEVLTETTALKPPPPPPPPIPPEKDAAVKSEGNDIEKKFNEIPIDGIVQQPSEFYEI
        MGCC+SS +S +S +K   NS       N +R+PPS  EEETVKEVL+ET ALKPP        PE+D   K  G++IEKK +EIPI+GI +QPSEFYEI
Subjt:  MGCCVSSGKSTNSTHKLDRNSTAAAVKNNENREPPSVTEEETVKEVLTETTALKPPPPPPPPIPPEKDAAVKSEGNDIEKKFNEIPIDGIVQQPSEFYEI

Query:  SKSNEFL--ATANFIDIIDGGGGGGGEVHQKALKSSPVKLPKTQLIVG--------SPNKTLNRRSEPSPVRRNRAVGAAGFVQNRDVSPAMARRGLRPE
        S  N+ +  + A F D  D    GGGEVHQ  LKSSPVKL K Q +            ++TL RRS+ SPVRRN AVG+   V NRD+SPAMARRGLR E
Subjt:  SKSNEFL--ATANFIDIIDGGGGGGGEVHQKALKSSPVKLPKTQLIVG--------SPNKTLNRRSEPSPVRRNRAVGAAGFVQNRDVSPAMARRGLRPE

Query:  FPRRDPDENPGRRSRSPAAARLEGERSRSALRPAAPAPSVRKLGKSSPLRAATATAPAAGQKRVEENNIIDENCNTQIESLENPLVSLECFIFL
         PRRDPDEN  RRS SP+ AR +    RSAL   +  PS RK GKSSP+ A TAT+    QK VEENNI+D   NTQIESLENPLVSLECFIFL
Subjt:  FPRRDPDENPGRRSRSPAAARLEGERSRSALRPAAPAPSVRKLGKSSPLRAATATAPAAGQKRVEENNIIDENCNTQIESLENPLVSLECFIFL

A0A1S3B2L5 uncharacterized protein LOC1034853128.0e-6757.82Show/hide
Query:  MGCCVSSGKSTNSTHKLDRNSTAAAVKNNENREPPSVTEEETVKEVLTETTALKPPPPPPPPIPPEKDAAVKSEGNDIEKKFNEIPIDGIVQQPSEFYEI
        MGCC+SS +S NS +K   +S       N NR+PPS  EEETVKEVL+ET ALK  PPP    PPE+D   K  G++ EKK +EIPI+GI +QPSEFYEI
Subjt:  MGCCVSSGKSTNSTHKLDRNSTAAAVKNNENREPPSVTEEETVKEVLTETTALKPPPPPPPPIPPEKDAAVKSEGNDIEKKFNEIPIDGIVQQPSEFYEI

Query:  SKSNEFL--ATANFIDIIDGGGGGGGEVHQKALKSSPVKLPKTQLIVGS--------PNKTLNRRSEPSPVRRNRAVGAAGFVQNRDVSPAMARRGLRPE
        S  N+ +  + A F D  D    GGGEVHQ  LKSSPVKL K Q +            ++TL RRS+ SPVRRN AVG+   V NRD+SPAMARRGLR E
Subjt:  SKSNEFL--ATANFIDIIDGGGGGGGEVHQKALKSSPVKLPKTQLIVGS--------PNKTLNRRSEPSPVRRNRAVGAAGFVQNRDVSPAMARRGLRPE

Query:  FPRRDPDENPGRRSRSPAAARLEGERSRSALRPAAPAPSVRKLGKSSPLRAATATAPAAGQKRVEENNIIDENCNTQIESLENPLVSLECFIFL
         PRRDPDEN  RRS+SP+ A  +    RSAL   +  PS RK GKSSP+RA TAT+    QK VEENNI+D   NTQIESLENPLVSLECFIFL
Subjt:  FPRRDPDENPGRRSRSPAAARLEGERSRSALRPAAPAPSVRKLGKSSPLRAATATAPAAGQKRVEENNIIDENCNTQIESLENPLVSLECFIFL

A0A5D3CNI1 Putative BEST plant protein match is: (TAIR:plant.1) protein8.0e-6757.82Show/hide
Query:  MGCCVSSGKSTNSTHKLDRNSTAAAVKNNENREPPSVTEEETVKEVLTETTALKPPPPPPPPIPPEKDAAVKSEGNDIEKKFNEIPIDGIVQQPSEFYEI
        MGCC+SS +S NS +K   +S       N NR+PPS  EEETVKEVL+ET ALK  PPP    PPE+D   K  G++ EKK +EIPI+GI +QPSEFYEI
Subjt:  MGCCVSSGKSTNSTHKLDRNSTAAAVKNNENREPPSVTEEETVKEVLTETTALKPPPPPPPPIPPEKDAAVKSEGNDIEKKFNEIPIDGIVQQPSEFYEI

Query:  SKSNEFL--ATANFIDIIDGGGGGGGEVHQKALKSSPVKLPKTQLIVGS--------PNKTLNRRSEPSPVRRNRAVGAAGFVQNRDVSPAMARRGLRPE
        S  N+ +  + A F D  D    GGGEVHQ  LKSSPVKL K Q +            ++TL RRS+ SPVRRN AVG+   V NRD+SPAMARRGLR E
Subjt:  SKSNEFL--ATANFIDIIDGGGGGGGEVHQKALKSSPVKLPKTQLIVGS--------PNKTLNRRSEPSPVRRNRAVGAAGFVQNRDVSPAMARRGLRPE

Query:  FPRRDPDENPGRRSRSPAAARLEGERSRSALRPAAPAPSVRKLGKSSPLRAATATAPAAGQKRVEENNIIDENCNTQIESLENPLVSLECFIFL
         PRRDPDEN  RRS+SP+ A  +    RSAL   +  PS RK GKSSP+RA TAT+    QK VEENNI+D   NTQIESLENPLVSLECFIFL
Subjt:  FPRRDPDENPGRRSRSPAAARLEGERSRSALRPAAPAPSVRKLGKSSPLRAATATAPAAGQKRVEENNIIDENCNTQIESLENPLVSLECFIFL

A0A6J1CMA7 uncharacterized protein LOC1110124333.0e-3754.17Show/hide
Query:  MGCCVSSGKSTNSTHKLDRNSTAAAVKNNENREPPSVTEEETVKEVLTETTALKPPPPPPPPIPPEKDAA-------VKSEGNDIEKKFNEIPIDGIVQQ
        MGCCVSSG   NS HK DRNS AA  K  E+REPPS  EEETVKEVLTET ALKPPPPP    PP++D A       VK E N+IEKK   IP + + + 
Subjt:  MGCCVSSGKSTNSTHKLDRNSTAAAVKNNENREPPSVTEEETVKEVLTETTALKPPPPPPPPIPPEKDAA-------VKSEGNDIEKKFNEIPIDGIVQQ

Query:  PSEFYEISKSNEFLATANFIDIIDGGGGGGGEVHQKALKSSPVKLPKTQLIVGS------PNKTLNRRSEPSPVRRNRAVGAAGFVQNRDVS
          EF EIS  +E L+ A F D +D     G EVHQ+  ++SPVKLPK Q   G       PN+ LNRRS+ SPVRRN  VG+A   QNRD++
Subjt:  PSEFYEISKSNEFLATANFIDIIDGGGGGGGEVHQKALKSSPVKLPKTQLIVGS------PNKTLNRRSEPSPVRRNRAVGAAGFVQNRDVS

A0A6J1EF08 uncharacterized protein LOC1114335672.7e-7061.11Show/hide
Query:  MGCCVSSGKSTNSTHKLDRNSTAAAVK------NNENREPPSVTEEETVKEVLTETTALKP--PPPPPPPIPPEKDAAVKSEGN----DIEKKFNEIPID
        MGCCVSSGKS++S HK D    AAA K      +N +REPPS  EEETVKEVL+ET ALKP    PP    PPE+D A K  G+    +IEKK  EIPI+
Subjt:  MGCCVSSGKSTNSTHKLDRNSTAAAVK------NNENREPPSVTEEETVKEVLTETTALKP--PPPPPPPIPPEKDAAVKSEGN----DIEKKFNEIPID

Query:  GIVQQPSEFYEISKSNEFLATANFIDIIDGGGGGGGEVHQKALKSSPVKLPKTQLIVG--------SPNKTLNRRSEPSPVRRNRAVGAAGFVQNRDVSP
        GIVQQPSEF EIS  ++F ATANF DI+D    GG EVHQ  LK+    LP  Q I G        SPNKTLNRRS+ SPVRRN  VG+A  VQ RD SP
Subjt:  GIVQQPSEFYEISKSNEFLATANFIDIIDGGGGGGGEVHQKALKSSPVKLPKTQLIVG--------SPNKTLNRRSEPSPVRRNRAVGAAGFVQNRDVSP

Query:  AMARRGLRPEFPRRDPDENPGRRSRSPAAARLEGERSRSALRPAAPAPSVRKLGKSSPLRAATA--TAPAAGQKRVEENNIIDENCNTQIESLENPLVSL
        AM  RGLR E  ++DPDEN GRRSRSPA AR +   SRSAL      PSVRK GKSSPLR  TA   APA  +K VEENNI D N  TQIESLENPLVSL
Subjt:  AMARRGLRPEFPRRDPDENPGRRSRSPAAARLEGERSRSALRPAAPAPSVRKLGKSSPLRAATA--TAPAAGQKRVEENNIIDENCNTQIESLENPLVSL

Query:  ECFIFL
        ECFIFL
Subjt:  ECFIFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G11125.1 unknown protein4.3e-0429.77Show/hide
Query:  MGCCVSSGKSTNSTHKLDRNSTAAAVKNNENREPPSVTEEET-VKEVLTETTALKPPPPPPPPIPPEKDAAVKSEGNDIEKKFNEIPIDGIVQQP----S
        MGCC+SS  +      +   +TA          PPSV +EET VKEVL+ETT L            EK    K +  + E+K   I +D + Q+P     
Subjt:  MGCCVSSGKSTNSTHKLDRNSTAAAVKNNENREPPSVTEEET-VKEVLTETTALKPPPPPPPPIPPEKDAAVKSEGNDIEKKFNEIPIDGIVQQP----S

Query:  EFYEISKSNEFLATANFIDIIDGGGGGGGEVHQKALKS------SPVKLPKTQLIVGSPNKTLNRRSEPSPVRRNRAVGAAGFVQNRDVSPAMARRGLRP
           E  K +E     +  + +        E   K +KS      SP K  + +++V  P    NRR++ SP +RN    A      R V  A        
Subjt:  EFYEISKSNEFLATANFIDIIDGGGGGGGEVHQKALKS------SPVKLPKTQLIVGSPNKTLNRRSEPSPVRRNRAVGAAGFVQNRDVSPAMARRGLRP

Query:  EFPRRDPDENPGRRSRSPAAARLEGERSRSALRPAAPAPSVRKLGKSSPLRAATATAPAAGQKRVEEN-NIIDENCN-------------TQIESLENPL
            RD  E   RRSRSPA  R         + P     S    G      A    + + G+ R+  N N  D+ CN                +S ENPL
Subjt:  EFPRRDPDENPGRRSRSPAAARLEGERSRSALRPAAPAPSVRKLGKSSPLRAATATAPAAGQKRVEEN-NIIDENCN-------------TQIESLENPL

Query:  VSLECFIFL
        VSLECFIFL
Subjt:  VSLECFIFL

AT1G61170.1 unknown protein7.3e-0429.39Show/hide
Query:  CCVSSGKSTNSTHKL-DRNSTAAAVKNNENREPPSVTEEETVKEVLTETTALKPPPPPPPPIPPEKDAAVKSEGNDIEKK--FNEIPIDGIVQQP-----
        CCVSSG +      + D+N+T              V EE  VKEVL+ETT     P         KD        D EKK  F ++  D ++ +P     
Subjt:  CCVSSGKSTNSTHKL-DRNSTAAAVKNNENREPPSVTEEETVKEVLTETTALKPPPPPPPPIPPEKDAAVKSEGNDIEKK--FNEIPIDGIVQQP-----

Query:  ------SEFYEISKSNEFLATANFIDIIDGGGGGGGEVHQKALKSSPVKLPKTQLIVGSPNKTLNRRSEPSPVRRNRAVGAAGFVQNRDVSPAMARRGLR
              SE   +S S    +T     +++G       + Q+  + SP K  +TQ+   + N    RR++ SP +RN                     G R
Subjt:  ------SEFYEISKSNEFLATANFIDIIDGGGGGGGEVHQKALKSSPVKLPKTQLIVGSPNKTLNRRSEPSPVRRNRAVGAAGFVQNRDVSPAMARRGLR

Query:  PEFPRRDPDENPGRRSRSPAAARLEGERSRSALRPAAPAPSVRKLGKSSPLRAATATAPAAGQKRVEENNIIDENCNTQIESLENPLVSLECFIFL
             RDP E  GRRSRSPA  R   + ++S+    A      +    SP R     A     ++  +N        T  E LENPLVSLECFIFL
Subjt:  PEFPRRDPDENPGRRSRSPAAARLEGERSRSALRPAAPAPSVRKLGKSSPLRAATATAPAAGQKRVEENNIIDENCNTQIESLENPLVSLECFIFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTGTTGCGTTAGTTCAGGGAAATCCACAAATTCGACGCACAAACTTGATCGGAATTCTACAGCAGCCGCCGTTAAGAACAATGAAAACAGAGAGCCGCCGTCTGT
CACAGAGGAAGAAACGGTCAAAGAAGTTCTCACTGAAACGACTGCACTGAAACCACCGCCGCCGCCTCCGCCGCCAATACCACCGGAAAAAGACGCCGCCGTCAAAAGCG
AGGGAAATGACATCGAGAAGAAGTTTAATGAAATTCCCATTGATGGAATTGTACAACAACCTTCGGAATTCTATGAAATTTCCAAATCGAACGAGTTTCTCGCCACCGCT
AATTTCATCGATATAATCGACGGCGGCGGCGGCGGCGGCGGAGAGGTTCATCAGAAAGCTTTGAAATCATCGCCAGTGAAATTGCCGAAAACCCAATTAATTGTTGGATC
GCCGAACAAGACGCTGAACCGGAGATCCGAACCGTCGCCGGTCCGACGAAACAGAGCTGTCGGGGCGGCGGGTTTTGTTCAGAACAGAGACGTAAGTCCGGCAATGGCGC
GGCGGGGATTGAGGCCGGAGTTTCCCCGGAGAGACCCAGATGAGAATCCCGGCAGGAGATCTCGGTCGCCGGCCGCCGCACGTCTCGAAGGCGAAAGATCTAGATCTGCC
CTTAGACCGGCGGCACCGGCACCGTCGGTGAGAAAGCTAGGTAAGTCGTCGCCGCTCAGGGCGGCGACGGCGACGGCACCGGCGGCCGGTCAAAAGAGAGTAGAAGAAAA
CAATATAATCGATGAAAATTGCAATACTCAGATTGAATCACTGGAAAACCCTCTGGTTTCATTGGAGTGCTTCATCTTCCTCTGA
mRNA sequenceShow/hide mRNA sequence
GTAACATCCTCCATAACAGACCGAAAATTCTGCCTTTAGAATTCAATTCAGTGAGCTTTCGCTCAAAATCCCAAATCTCCAAACTCGCCGGGAAATCACAATGTCCGCCG
CCGGCAACTAACTTGATCGGAAAATCACAATGGGTTGTTGCGTTAGTTCAGGGAAATCCACAAATTCGACGCACAAACTTGATCGGAATTCTACAGCAGCCGCCGTTAAG
AACAATGAAAACAGAGAGCCGCCGTCTGTCACAGAGGAAGAAACGGTCAAAGAAGTTCTCACTGAAACGACTGCACTGAAACCACCGCCGCCGCCTCCGCCGCCAATACC
ACCGGAAAAAGACGCCGCCGTCAAAAGCGAGGGAAATGACATCGAGAAGAAGTTTAATGAAATTCCCATTGATGGAATTGTACAACAACCTTCGGAATTCTATGAAATTT
CCAAATCGAACGAGTTTCTCGCCACCGCTAATTTCATCGATATAATCGACGGCGGCGGCGGCGGCGGCGGAGAGGTTCATCAGAAAGCTTTGAAATCATCGCCAGTGAAA
TTGCCGAAAACCCAATTAATTGTTGGATCGCCGAACAAGACGCTGAACCGGAGATCCGAACCGTCGCCGGTCCGACGAAACAGAGCTGTCGGGGCGGCGGGTTTTGTTCA
GAACAGAGACGTAAGTCCGGCAATGGCGCGGCGGGGATTGAGGCCGGAGTTTCCCCGGAGAGACCCAGATGAGAATCCCGGCAGGAGATCTCGGTCGCCGGCCGCCGCAC
GTCTCGAAGGCGAAAGATCTAGATCTGCCCTTAGACCGGCGGCACCGGCACCGTCGGTGAGAAAGCTAGGTAAGTCGTCGCCGCTCAGGGCGGCGACGGCGACGGCACCG
GCGGCCGGTCAAAAGAGAGTAGAAGAAAACAATATAATCGATGAAAATTGCAATACTCAGATTGAATCACTGGAAAACCCTCTGGTTTCATTGGAGTGCTTCATCTTCCT
CTGAAATTGAAAAAAAAAACTCACAATTCTCAATTGCTCACTTTTTTGCTTTTATTTT
Protein sequenceShow/hide protein sequence
MGCCVSSGKSTNSTHKLDRNSTAAAVKNNENREPPSVTEEETVKEVLTETTALKPPPPPPPPIPPEKDAAVKSEGNDIEKKFNEIPIDGIVQQPSEFYEISKSNEFLATA
NFIDIIDGGGGGGGEVHQKALKSSPVKLPKTQLIVGSPNKTLNRRSEPSPVRRNRAVGAAGFVQNRDVSPAMARRGLRPEFPRRDPDENPGRRSRSPAAARLEGERSRSA
LRPAAPAPSVRKLGKSSPLRAATATAPAAGQKRVEENNIIDENCNTQIESLENPLVSLECFIFL