; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0015303 (gene) of Chayote v1 genome

Gene IDSed0015303
OrganismSechium edule (Chayote v1)
Descriptionserine/arginine repetitive matrix protein 1-like
Genome locationLG01:10963773..10965509
RNA-Seq ExpressionSed0015303
SyntenySed0015303
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573173.1 hypothetical protein SDJN03_27060, partial [Cucurbita argyrosperma subsp. sororia]1.4e-4039.8Show/hide
Query:  MGRGLISPPRSRFSPKPLPP--------------------RPSARIKPNEHLSHRKDSQPSAL-RGTKKTDTTSSSSSSSSSDHNKKKLESNNSTP----
        M RG+ISPPRSR SP+   P                    RP+  I PNE  +HRK+ QP+ + R TK TD +S+      S  N K   S  + P    
Subjt:  MGRGLISPPRSRFSPKPLPP--------------------RPSARIKPNEHLSHRKDSQPSAL-RGTKKTDTTSSSSSSSSSDHNKKKLESNNSTP----

Query:  -----KSAAKTNTKLLSSSPINHQPTPSSKSNTKEAIGSGSRSD---SKP------------NGVGHQQQHVKI------GNDLNHSLSAEAQSLLHQLS
             K+A KT T+  S  P      P SKSN K A GSGSRSD   +KP            +G  + QQ  +I      G+    +LS      LHQLS
Subjt:  -----KSAAKTNTKLLSSSPINHQPTPSSKSNTKEAIGSGSRSD---SKP------------NGVGHQQQHVKI------GNDLNHSLSAEAQSLLHQLS

Query:  IEGK---------------------KEECSSQSNANNGARIFEIYKEIASHRQGNSPITSYFTKLKALWDELATFIDMPQ----------SYRKRKSCAI
        ++ K                     +EECSSQ   NN +R+F+IYKEIASH QGNS ITSY TKLKALWDEL  +ID P+             +R+    
Subjt:  IEGK---------------------KEECSSQSNANNGARIFEIYKEIASHRQGNSPITSYFTKLKALWDELATFIDMPQ----------SYRKRKSCAI

Query:  FLVGLHDSYSSTCSQILVMSPFPTLEESFVVIIREETRRELLSSLESIAAKV--NNCFPHNAHNSNNNGDNDIIEQIDNNVNEQQVDENESL
        FL+GL+DSYS+ C+QIL M PFPT+E++   I+REE RREL+ SLE +AAKV  NN    N H+   NGDN+ ++ +  N+ + + D+NE++
Subjt:  FLVGLHDSYSSTCSQILVMSPFPTLEESFVVIIREETRRELLSSLESIAAKV--NNCFPHNAHNSNNNGDNDIIEQIDNNVNEQQVDENESL

KAG7012356.1 hypothetical protein SDJN02_25108, partial [Cucurbita argyrosperma subsp. argyrosperma]1.4e-4040.31Show/hide
Query:  MGRGLISPPRSRFSPKPLPP--------------------RPSARIKPNEHLSHRKDSQPSAL-RGTKKTDTTSSSSSSSSSDHNKKKLESNNSTP----
        M RG+ISPPRSR SP+   P                    RP+  I PNE  +HRK+ QP+ + R TK TD +S+      S  N K   S  + P    
Subjt:  MGRGLISPPRSRFSPKPLPP--------------------RPSARIKPNEHLSHRKDSQPSAL-RGTKKTDTTSSSSSSSSSDHNKKKLESNNSTP----

Query:  -----KSAAKTNTKLLSSSPINHQPTPSSKSNTKEAIGSGSRSD---SKP------------NGVGHQQQHVKI------GNDLNHSLSAEAQSLLHQLS
             K+A KT T+  S  P      P SKSN K A GSGSRSD   +KP            +G  + QQ  +I      G+    +LS      LHQLS
Subjt:  -----KSAAKTNTKLLSSSPINHQPTPSSKSNTKEAIGSGSRSD---SKP------------NGVGHQQQHVKI------GNDLNHSLSAEAQSLLHQLS

Query:  IEGK---------------------KEECSSQSNANNGARIFEIYKEIASHRQGNSPITSYFTKLKALWDELATFIDMPQ----SYRK------RKSCAI
        ++ K                     +EECSSQ   NN +R+F+IYKEIASH QGNS ITSY TKLKALWDEL  +ID P+    S +K      R+    
Subjt:  IEGK---------------------KEECSSQSNANNGARIFEIYKEIASHRQGNSPITSYFTKLKALWDELATFIDMPQ----SYRK------RKSCAI

Query:  FLVGLHDSYSSTCSQILVMSPFPTLEESFVVIIREETRRELLSSLESIAAKV--NNCFPHNAHNSNNNGDNDIIEQIDNNVNEQQVDENESL
        FL+GL+DSYS+ C+QIL M PFPT+E++   I+REE RREL+ SLE +AAKV  NN    N H+   NGDN+ ++ +  N+ + + D+NE++
Subjt:  FLVGLHDSYSSTCSQILVMSPFPTLEESFVVIIREETRRELLSSLESIAAKV--NNCFPHNAHNSNNNGDNDIIEQIDNNVNEQQVDENESL

XP_022954810.1 serine/arginine repetitive matrix protein 1-like [Cucurbita moschata]1.4e-4040.71Show/hide
Query:  MGRGLISPPRSRFSPKPLPP--------------------RPSARIKPNEHLSHRKDSQPSAL-RGTKKTDTTSSSSSSSSSDHNKKKLESNNSTP----
        M RG+ISPPRSR SP+   P                    RP+  I PNE  +HRK+ QP+ + R TK TD +S+      S  N K   S  + P    
Subjt:  MGRGLISPPRSRFSPKPLPP--------------------RPSARIKPNEHLSHRKDSQPSAL-RGTKKTDTTSSSSSSSSSDHNKKKLESNNSTP----

Query:  -----KSAAKTNTKLLSSSPINHQP-TPSSKSNTKEAIGSGSRSD---SKP------------NGVGHQQQHVKI------GNDLNHSLSAEAQSLLHQL
             K+A KT T+   SSP   +P TP SKSN K A GSGSRSD   +KP            +G  + QQ  +I      G+    +LS      LHQL
Subjt:  -----KSAAKTNTKLLSSSPINHQP-TPSSKSNTKEAIGSGSRSD---SKP------------NGVGHQQQHVKI------GNDLNHSLSAEAQSLLHQL

Query:  SI---------------------EGKKEECSSQSNANNGARIFEIYKEIASHRQGNSPITSYFTKLKALWDELATFIDMPQ----------SYRKRKSCA
        S+                     E K+EECSSQ   NN +R+F+IYKEIASH QGNS ITSY TKLKALWDEL  +ID P+             +R+   
Subjt:  SI---------------------EGKKEECSSQSNANNGARIFEIYKEIASHRQGNSPITSYFTKLKALWDELATFIDMPQ----------SYRKRKSCA

Query:  IFLVGLHDSYSSTCSQILVMSPFPTLEESFVVIIREETRRELLSSLESIAAKV--NNCFPHNAHNSNNNGDNDIIEQIDNNVNEQQVDENESL
         FL+GL+DSYS+ C+QIL M PFPT+E++   I+REE RREL+ SLE +AAKV  NN    N H+   NGDN+ ++ +  N+ + + D+NE++
Subjt:  IFLVGLHDSYSSTCSQILVMSPFPTLEESFVVIIREETRRELLSSLESIAAKV--NNCFPHNAHNSNNNGDNDIIEQIDNNVNEQQVDENESL

XP_023542694.1 uncharacterized protein LOC111802521 [Cucurbita pepo subsp. pepo]1.8e-4040.56Show/hide
Query:  MGRGLISPPRSRFSPKPLPP--------------------RPSARIKPNEHLSHRKDSQPSAL-RGTKKTDTTSSSSSSSSSDHNKKKLESNNSTP----
        M RG+ISPPRSR SP+   P                    RP+  I  NE  +HRK+ QP+ + R TK TD +S+      S  N K + S  + P    
Subjt:  MGRGLISPPRSRFSPKPLPP--------------------RPSARIKPNEHLSHRKDSQPSAL-RGTKKTDTTSSSSSSSSSDHNKKKLESNNSTP----

Query:  -----KSAAKTNTKLLSSSPINHQPTPSSKSNTKEAIGSGSRSD---SKP------------NGVGHQQQHVKI------GNDLNHSLSAEAQSLLHQLS
             K+A KT T+  S  P      P SKSN K A GSGSRSD   +KP            +G  + QQ  +I      G+    +LS      LHQLS
Subjt:  -----KSAAKTNTKLLSSSPINHQPTPSSKSNTKEAIGSGSRSD---SKP------------NGVGHQQQHVKI------GNDLNHSLSAEAQSLLHQLS

Query:  I---------------------EGKKEECSSQSNANNGARIFEIYKEIASHRQGNSPITSYFTKLKALWDELATFIDMPQ----SYRK------RKSCAI
        +                     E K+EECSSQ   NN +R+F+IYKEIASH QGNS ITSY TKLKALWDEL  +ID+P+    S +K      R+    
Subjt:  I---------------------EGKKEECSSQSNANNGARIFEIYKEIASHRQGNSPITSYFTKLKALWDELATFIDMPQ----SYRK------RKSCAI

Query:  FLVGLHDSYSSTCSQILVMSPFPTLEESFVVIIREETRRELLSSLESIAAKV--NNCFPHNAHNSNNNGDNDIIEQIDNNVNEQQVDENESL
        FL+GL DSYS+ C+QIL M PFPT+E++   I+REE RREL+ SLE +AAKV  NN    N H+   NGDN+ ++ +  N+ E + D+NE++
Subjt:  FLVGLHDSYSSTCSQILVMSPFPTLEESFVVIIREETRRELLSSLESIAAKV--NNCFPHNAHNSNNNGDNDIIEQIDNNVNEQQVDENESL

XP_038895286.1 hybrid signal transduction histidine kinase L-like isoform X2 [Benincasa hispida]2.6e-3639.84Show/hide
Query:  RGLISPPRSRFSPKPLPPRPSARIKPNEHLSH-RKDSQPSALRGTKKTDTTSSSSSSSSSDHNKKKLESN--NSTPKSAA-----------KTNTKLLSS
        RG++SPPR++FS        +A  KPNE   + R++++P+ +R TK  +    SS++++S  N  K  S   +S PK+A             + TK++S 
Subjt:  RGLISPPRSRFSPKPLPPRPSARIKPNEHLSH-RKDSQPSALRGTKKTDTTSSSSSSSSSDHNKKKLESN--NSTPKSAA-----------KTNTKLLSS

Query:  -------SPINHQ--PTPSSKSNTKEAIGSGSRSDSKPNGVGHQQQ-HVKIG--NDL--NHSLSAEAQSLLH---------QLSIEGK------------
                P+ HQ  PTP+SK N K  I S S S S+ +  G  +  H + G  NDL  NH  SA   + LH         +LSI+GK            
Subjt:  -------SPINHQ--PTPSSKSNTKEAIGSGSRSDSKPNGVGHQQQ-HVKIG--NDL--NHSLSAEAQSLLH---------QLSIEGK------------

Query:  --------KEECSSQSNANNGARIFEIYKEIASHRQGNSPITSYFTKLKALWDELATF-IDMPQ-----------SYRKRKSCAIFLVGLHDSYSSTCSQ
                KE  SSQS   N +R+F+IYKEIA HRQ NS ITSYFTKL+ALWDELATF  D+ Q            Y +R+    FLVGL+DSYS  C+Q
Subjt:  --------KEECSSQSNANNGARIFEIYKEIASHRQGNSPITSYFTKLKALWDELATF-IDMPQ-----------SYRKRKSCAIFLVGLHDSYSSTCSQ

Query:  ILVMSPFPTLEESFVVIIREETRRELLSSLESIAAKV--NNCFP---HNAHNSNNNGDNDIIEQI---DNNVNEQQVDE
        IL+ +PFPT+E+++  +IREE  REL+  LES+A KV  NN       NAH+SNN  +ND ++Q+    N+++EQQ D+
Subjt:  ILVMSPFPTLEESFVVIIREETRRELLSSLESIAAKV--NNCFP---HNAHNSNNNGDNDIIEQI---DNNVNEQQVDE

TrEMBL top hitse value%identityAlignment
A0A6J1C5Z8 uncharacterized protein LOC1110085881.0e-2534.75Show/hide
Query:  RGLISPPRSRFSPK---------PLPP-RP---SARIKP----NEHLSHRKDSQPSALRGTKKTDTTSSSSSSSSSD------------HNKKKLESNNS
        RGLISPPRSR SP+         P PP RP   S R +P     +H    +    +A R TKK+      S   ++             HN+KKL++  +
Subjt:  RGLISPPRSRFSPK---------PLPP-RP---SARIKP----NEHLSHRKDSQPSALRGTKKTDTTSSSSSSSSSD------------HNKKKLESNNS

Query:  TPKSAAKTNT------KLLSSSPINHQPTPSSKSNTKE-----AIGSGSRSDSKPNGVGHQQQHVKIGNDL------NHS----------LSAEAQSLLH
             AK ++      +L +       PTP SK+ T +     AI S SRSDS         +H+            NHS             +  + L 
Subjt:  TPKSAAKTNT------KLLSSSPINHQPTPSSKSNTKE-----AIGSGSRSDSKPNGVGHQQQHVKIGNDL------NHS----------LSAEAQSLLH

Query:  QLSIEGKK----------------EECSSQSNANNGARIFEIYKEIASHRQGNSPITSYFTKLKALWDELATFI-DMPQ-----------SYRKRKSCAI
        +LSI+GK                  +   +S  +N  RIF+IYK+IASHRQ NS +TSYFTKLK LWDEL T+  D+PQ            + +R+    
Subjt:  QLSIEGKK----------------EECSSQSNANNGARIFEIYKEIASHRQGNSPITSYFTKLKALWDELATFI-DMPQ-----------SYRKRKSCAI

Query:  FLVGLHDSYSSTCSQILVMSPFPTLEESFVVIIREETRRELLSSLESIAAKV-NNCFPHNAHNSNNNGDNDIIEQIDNNVNEQQVD----ENESLDEELG
        FL+GL++SYS+ C QIL++ PFPT+E+++ +IIREE R EL++SLE +AAKV  N +      S+N  D+ I E+++ N  E  V+     NESL  +LG
Subjt:  FLVGLHDSYSSTCSQILVMSPFPTLEESFVVIIREETRRELLSSLESIAAKV-NNCFPHNAHNSNNNGDNDIIEQIDNNVNEQQVD----ENESLDEELG

A0A6J1C6T8 uncharacterized protein LOC111008934 isoform X21.8e-2234.24Show/hide
Query:  RGLISPPRSRFSPK------PL--------PPRPSARIKPNEHLSH----RKDSQPSALRGTKKTDTTSSSSSSSSSDHNKKKLESNNSTPKSAAKTNTK
        RGLISPPR+   P       P+        PPRP+  + P+E + +    +  S  +A+R T       +  +     +++KKL+ NN+       ++TK
Subjt:  RGLISPPRSRFSPK------PL--------PPRPSARIKPNEHLSH----RKDSQPSALRGTKKTDTTSSSSSSSSSDHNKKKLESNNSTPKSAAKTNTK

Query:  LLSSSPINHQPTPSS----KSNT----KEAIGSGSRSD-----------------SKPN-----GVGHQQQHVKIGNDLNHSLSA----EAQSLLHQLSI
          +  P  HQ   +      +NT    K    SGSRSD                 S PN     G G +Q    +GN+   + ++    +  + LH+LS 
Subjt:  LLSSSPINHQPTPSS----KSNT----KEAIGSGSRSD-----------------SKPN-----GVGHQQQHVKIGNDLNHSLSA----EAQSLLHQLSI

Query:  EGK-------------------KEECSSQSNANNGARIFEIYKEIASHRQGNSPITSYFTKLKALWDELATFIDMPQ-----SYRKRKSCAIFLVGLHDS
         G                      +CSSQSN     RIFEIYK+IASHRQGNS ITSYFT+LK LWDEL T+ D+ Q      + +R+    FLVGL+D 
Subjt:  EGK-------------------KEECSSQSNANNGARIFEIYKEIASHRQGNSPITSYFTKLKALWDELATFIDMPQ-----SYRKRKSCAIFLVGLHDS

Query:  YSSTCSQILVMSPFPTLEESFVVIIREETR
        YS+ C QIL++ PFPT+E+++ ++IREE R
Subjt:  YSSTCSQILVMSPFPTLEESFVVIIREETR

A0A6J1C6U3 uncharacterized protein LOC111008934 isoform X12.0e-2133.33Show/hide
Query:  RGLISPPRSRFSPK------PL--------PPRPSARIKPNEHLSH----RKDSQPSALRGTKKTDTTSSSSSSSSSDHNKKKLESNNSTPKSAAKTNTK
        RGLISPPR+   P       P+        PPRP+  + P+E + +    +  S  +A+R T       +  +     +++KKL+ NN+       ++TK
Subjt:  RGLISPPRSRFSPK------PL--------PPRPSARIKPNEHLSH----RKDSQPSALRGTKKTDTTSSSSSSSSSDHNKKKLESNNSTPKSAAKTNTK

Query:  LLSSSPINHQPTPSS----KSNT----KEAIGSGSRSD-----------------SKPN-----GVGHQQQHVKIGNDLNHSLSA----EAQSLLHQLSI
          +  P  HQ   +      +NT    K    SGSRSD                 S PN     G G +Q    +GN+   + ++    +  + LH+LS 
Subjt:  LLSSSPINHQPTPSS----KSNT----KEAIGSGSRSD-----------------SKPN-----GVGHQQQHVKIGNDLNHSLSA----EAQSLLHQLSI

Query:  EGK----------------------------KEECSSQSNANNGARIFEIYKEIASHRQGNSPITSYFTKLKALWDELATFIDMPQ-----SYRKRKSCA
         G                               +CSSQSN     RIFEIYK+IASHRQGNS ITSYFT+LK LWDEL T+ D+ Q      + +R+   
Subjt:  EGK----------------------------KEECSSQSNANNGARIFEIYKEIASHRQGNSPITSYFTKLKALWDELATFIDMPQ-----SYRKRKSCA

Query:  IFLVGLHDSYSSTCSQILVMSPFPTLEESFVVIIREETR
         FLVGL+D YS+ C QIL++ PFPT+E+++ ++IREE R
Subjt:  IFLVGLHDSYSSTCSQILVMSPFPTLEESFVVIIREETR

A0A6J1C7L7 uncharacterized protein LOC1110089866.8e-2234.5Show/hide
Query:  GRGLISPPRSRFSPKPLPPRPSARIKP---NEHLSHRKDSQPSALRGTKKTDTTSSSSSSSSSDHNKKKLESNNSTPKSAAKTNTKLLSSSPINHQPTPS
        GRGLISPP+SRFS      + +A   P     ++S  + S  + +   K+  TT +  +  S+     K  S   TP   ++       ++P  H    +
Subjt:  GRGLISPPRSRFSPKPLPPRPSARIKP---NEHLSHRKDSQPSALRGTKKTDTTSSSSSSSSSDHNKKKLESNNSTPKSAAKTNTKLLSSSPINHQPTPS

Query:  SKSNTKEAIGS-------------GSRSDSKPNGVGHQQQHVKIGNDLNHSLSAEAQ------------SLLHQLSIEGK--------------------
        +K    +   S             G    S  +G  H   H +  N  N+++  E +            + L QLSI+GK                    
Subjt:  SKSNTKEAIGS-------------GSRSDSKPNGVGHQQQHVKIGNDLNHSLSAEAQ------------SLLHQLSIEGK--------------------

Query:  KEECSSQSNANNGARIFEIYKEIASHRQGNSPITSYFTKLKALWDELATFIDMPQ--SYR----------KRKSCAIFLVGLHDSYSSTCSQILVMSPFP
        KEECS QSNA    RI EIYK+IASHRQGNS ITSYFTKL+ LW+EL T+ D+PQ  SY           +R+    FLVGL+DSYS+ CSQIL++ PFP
Subjt:  KEECSSQSNANNGARIFEIYKEIASHRQGNSPITSYFTKLKALWDELATFIDMPQ--SYR----------KRKSCAIFLVGLHDSYSSTCSQILVMSPFP

Query:  TLEESFVVIIREE
        T+E+++ +II +E
Subjt:  TLEESFVVIIREE

A0A6J1GTG4 serine/arginine repetitive matrix protein 1-like6.6e-4140.71Show/hide
Query:  MGRGLISPPRSRFSPKPLPP--------------------RPSARIKPNEHLSHRKDSQPSAL-RGTKKTDTTSSSSSSSSSDHNKKKLESNNSTP----
        M RG+ISPPRSR SP+   P                    RP+  I PNE  +HRK+ QP+ + R TK TD +S+      S  N K   S  + P    
Subjt:  MGRGLISPPRSRFSPKPLPP--------------------RPSARIKPNEHLSHRKDSQPSAL-RGTKKTDTTSSSSSSSSSDHNKKKLESNNSTP----

Query:  -----KSAAKTNTKLLSSSPINHQP-TPSSKSNTKEAIGSGSRSD---SKP------------NGVGHQQQHVKI------GNDLNHSLSAEAQSLLHQL
             K+A KT T+   SSP   +P TP SKSN K A GSGSRSD   +KP            +G  + QQ  +I      G+    +LS      LHQL
Subjt:  -----KSAAKTNTKLLSSSPINHQP-TPSSKSNTKEAIGSGSRSD---SKP------------NGVGHQQQHVKI------GNDLNHSLSAEAQSLLHQL

Query:  SI---------------------EGKKEECSSQSNANNGARIFEIYKEIASHRQGNSPITSYFTKLKALWDELATFIDMPQ----------SYRKRKSCA
        S+                     E K+EECSSQ   NN +R+F+IYKEIASH QGNS ITSY TKLKALWDEL  +ID P+             +R+   
Subjt:  SI---------------------EGKKEECSSQSNANNGARIFEIYKEIASHRQGNSPITSYFTKLKALWDELATFIDMPQ----------SYRKRKSCA

Query:  IFLVGLHDSYSSTCSQILVMSPFPTLEESFVVIIREETRRELLSSLESIAAKV--NNCFPHNAHNSNNNGDNDIIEQIDNNVNEQQVDENESL
         FL+GL+DSYS+ C+QIL M PFPT+E++   I+REE RREL+ SLE +AAKV  NN    N H+   NGDN+ ++ +  N+ + + D+NE++
Subjt:  IFLVGLHDSYSSTCSQILVMSPFPTLEESFVVIIREETRRELLSSLESIAAKV--NNCFPHNAHNSNNNGDNDIIEQIDNNVNEQQVDENESL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.3e-0424Show/hide
Query:  RIFEIYKEIASHRQGNSPITSYFTKLKALWDELATFIDMP---------------QSYRKRKSCAIFLVG--LHDSYSSTCSQILVMSPFPTLEESFVVI
        +I+++ + +A+ RQG   +  YF KL  +W EL+ +  +P               +  R+++    FL+G  L+  + +  ++I+   P P+L E+F ++
Subjt:  RIFEIYKEIASHRQGNSPITSYFTKLKALWDELATFIDMP---------------QSYRKRKSCAIFLVG--LHDSYSSTCSQILVMSPFPTLEESFVVI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGAGGGCTTATCAGTCCCCCGAGATCCAGATTTTCTCCGAAGCCCTTACCTCCTCGGCCATCGGCACGGATTAAACCCAACGAGCACCTGAGTCATAGAAAAGA
CTCTCAACCCAGCGCTTTGCGAGGAACCAAAAAGACCGACACAACATCGTCATCATCATCATCGTCGTCGTCAGATCATAACAAGAAAAAATTAGAGAGCAACAATTCCA
CTCCCAAAAGCGCTGCTAAAACAAATACTAAATTATTATCATCTTCCCCTATAAATCATCAACCTACGCCTTCATCAAAGAGCAACACAAAAGAAGCGATTGGTAGTGGT
TCGAGATCCGATAGTAAGCCGAATGGGGTGGGCCATCAGCAACAACATGTAAAAATTGGGAATGATCTGAACCATTCTCTATCAGCTGAAGCTCAATCTCTATTGCATCA
GCTTTCTATTGAAGGTAAGAAAGAAGAATGTTCTTCTCAAAGCAATGCCAATAATGGTGCAAGAATATTTGAAATTTACAAGGAAATTGCATCTCATCGTCAGGGAAACT
CCCCAATTACATCTTACTTTACAAAGCTGAAGGCATTATGGGATGAACTTGCAACCTTCATTGATATGCCTCAATCATATCGAAAGAGAAAAAGTTGTGCAATTTTTCTT
GTGGGACTGCACGATTCTTACTCCTCAACTTGCTCCCAAATTCTTGTTATGAGCCCATTTCCAACGCTGGAGGAATCTTTTGTGGTAATAATTCGAGAAGAAACGCGAAG
GGAATTGCTTTCGTCATTGGAAAGTATTGCAGCAAAAGTTAACAATTGCTTTCCTCATAATGCTCATAATTCGAACAACAATGGTGATAATGATATTATTGAACAAATTG
ATAATAATGTTAACGAGCAACAAGTTGATGAGAATGAATCATTAGACGAGGAATTAGGCCTAGTGCTCAACGAGGTCTTGGAACACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAGAGGGCTTATCAGTCCCCCGAGATCCAGATTTTCTCCGAAGCCCTTACCTCCTCGGCCATCGGCACGGATTAAACCCAACGAGCACCTGAGTCATAGAAAAGA
CTCTCAACCCAGCGCTTTGCGAGGAACCAAAAAGACCGACACAACATCGTCATCATCATCATCGTCGTCGTCAGATCATAACAAGAAAAAATTAGAGAGCAACAATTCCA
CTCCCAAAAGCGCTGCTAAAACAAATACTAAATTATTATCATCTTCCCCTATAAATCATCAACCTACGCCTTCATCAAAGAGCAACACAAAAGAAGCGATTGGTAGTGGT
TCGAGATCCGATAGTAAGCCGAATGGGGTGGGCCATCAGCAACAACATGTAAAAATTGGGAATGATCTGAACCATTCTCTATCAGCTGAAGCTCAATCTCTATTGCATCA
GCTTTCTATTGAAGGTAAGAAAGAAGAATGTTCTTCTCAAAGCAATGCCAATAATGGTGCAAGAATATTTGAAATTTACAAGGAAATTGCATCTCATCGTCAGGGAAACT
CCCCAATTACATCTTACTTTACAAAGCTGAAGGCATTATGGGATGAACTTGCAACCTTCATTGATATGCCTCAATCATATCGAAAGAGAAAAAGTTGTGCAATTTTTCTT
GTGGGACTGCACGATTCTTACTCCTCAACTTGCTCCCAAATTCTTGTTATGAGCCCATTTCCAACGCTGGAGGAATCTTTTGTGGTAATAATTCGAGAAGAAACGCGAAG
GGAATTGCTTTCGTCATTGGAAAGTATTGCAGCAAAAGTTAACAATTGCTTTCCTCATAATGCTCATAATTCGAACAACAATGGTGATAATGATATTATTGAACAAATTG
ATAATAATGTTAACGAGCAACAAGTTGATGAGAATGAATCATTAGACGAGGAATTAGGCCTAGTGCTCAACGAGGTCTTGGAACACTAA
Protein sequenceShow/hide protein sequence
MGRGLISPPRSRFSPKPLPPRPSARIKPNEHLSHRKDSQPSALRGTKKTDTTSSSSSSSSSDHNKKKLESNNSTPKSAAKTNTKLLSSSPINHQPTPSSKSNTKEAIGSG
SRSDSKPNGVGHQQQHVKIGNDLNHSLSAEAQSLLHQLSIEGKKEECSSQSNANNGARIFEIYKEIASHRQGNSPITSYFTKLKALWDELATFIDMPQSYRKRKSCAIFL
VGLHDSYSSTCSQILVMSPFPTLEESFVVIIREETRRELLSSLESIAAKVNNCFPHNAHNSNNNGDNDIIEQIDNNVNEQQVDENESLDEELGLVLNEVLEH