; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg015889 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg015889
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionBEST Arabidopsis thaliana protein match is: embryo defective 1303 .
Genome locationscaffold6:39762699..39771104
RNA-Seq ExpressionSpg015889
SyntenySpg015889
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0010027 - thylakoid membrane organization (biological process)
GO:0009507 - chloroplast (cellular component)
InterPro domainsIPR040299 - Protein FERTILITY RESTORER RF2-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577548.1 Protein YELLOW LEAF 1, choloroplastic, partial [Cucurbita argyrosperma subsp. sororia]6.4e-7783.7Show/hide
Query:  RLEMLTTNSGALLPLLPPGRLEQTCKMGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAEPLFLKPIPSTGTRGGVLYSSRK-NNAFMCFAALNARC
        ++EMLTT+SGA+L LLPP        +GQHWG+MK KDF LRRD+ QLQTPN RRQ VIAKA P+FLKPIPSTGT+GGVLYS+RK NNAF+CFAALNARC
Subjt:  RLEMLTTNSGALLPLLPPGRLEQTCKMGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAEPLFLKPIPSTGTRGGVLYSSRK-NNAFMCFAALNARC

Query:  AAEQTQTVTREAPTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR
        AAEQTQTVTREAPTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEG Y++DRRR
Subjt:  AAEQTQTVTREAPTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR

KAG6596785.1 Protein YELLOW LEAF 1, choloroplastic, partial [Cucurbita argyrosperma subsp. sororia]2.9e-7784.62Show/hide
Query:  LEMLTTNSGALLPLLPPGRLEQTCKMGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAEPLFLKPIPSTGTRGGVLYSSRKNNAFMCFAALNARCAA
        +EMLTT SGA+L LLPP        +GQHWGRMKA +FHL RD+FQ++T N RRQPVIAKA PL++KPI STG RGGVLYSSRKNNAF+CFAALNARCAA
Subjt:  LEMLTTNSGALLPLLPPGRLEQTCKMGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAEPLFLKPIPSTGTRGGVLYSSRKNNAFMCFAALNARCAA

Query:  EQTQTVTREAPTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR
        EQTQTVTREAPTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESE PYQNDRRR
Subjt:  EQTQTVTREAPTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR

XP_023005574.1 uncharacterized protein LOC111498519 isoform X1 [Cucurbita maxima]8.9e-7985.71Show/hide
Query:  LEMLTTNSGALLPLLPPGRLEQTCKMGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAEPLFLKPIPSTGTRGGVLYSSRKNNAFMCFAALNARCAA
        +EMLTTNSGA+LPLLPP        +GQHWGRMKA +FHL RD+FQ++T N RRQPVIAKA PL++KPI STGT GGVLYSSRKNNAF+CFAALNARCAA
Subjt:  LEMLTTNSGALLPLLPPGRLEQTCKMGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAEPLFLKPIPSTGTRGGVLYSSRKNNAFMCFAALNARCAA

Query:  EQTQTVTREAPTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR
        EQTQTVTREAPTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESE PYQNDRRR
Subjt:  EQTQTVTREAPTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR

XP_023540533.1 uncharacterized protein LOC111800870 [Cucurbita pepo subsp. pepo]8.1e-8086.26Show/hide
Query:  LEMLTTNSGALLPLLPPGRLEQTCKMGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAEPLFLKPIPSTGTRGGVLYSSRKNNAFMCFAALNARCAA
        +EMLTTNSGA+LPLLPP        +GQHWGRM+A +FHL RD+FQ+QT N RRQPVIAKA PL++KPI STGTRGGVLYSSRKNNAF+CFAALNARCAA
Subjt:  LEMLTTNSGALLPLLPPGRLEQTCKMGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAEPLFLKPIPSTGTRGGVLYSSRKNNAFMCFAALNARCAA

Query:  EQTQTVTREAPTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR
        EQTQTVTREAPTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESE PYQNDRRR
Subjt:  EQTQTVTREAPTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR

XP_023553392.1 uncharacterized protein LOC111810820 isoform X1 [Cucurbita pepo subsp. pepo]3.8e-7784.7Show/hide
Query:  LEMLTTNSGALLPLLPPGRLEQTCKMGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAEPLFLKPIPSTGTRGGVLYSSRK-NNAFMCFAALNARCA
        +EMLTT+SGA+L LLPP        +GQHWG+MKAKDF LRRD+ QLQTPN RRQ VIAKA P+FLKPIPSTGT+GGVLYS+RK NNAF+CFAALNARCA
Subjt:  LEMLTTNSGALLPLLPPGRLEQTCKMGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAEPLFLKPIPSTGTRGGVLYSSRK-NNAFMCFAALNARCA

Query:  AEQTQTVTREAPTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR
        AEQTQTVTREAPTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEG Y++DRRR
Subjt:  AEQTQTVTREAPTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR

TrEMBL top hitse value%identityAlignment
A0A6J1CP11 uncharacterized protein LOC1110133921.5e-7684.07Show/hide
Query:  LEMLTTNSGALLPLLPPGRLEQTCKMGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAEPLFLKPIPSTGTRGGVLYSSRKNNAFMCFAALNARCAA
        +EMLTTNSGA+LPLLPP        +G+HWGR KAK+FHL RD+FQLQ  N R Q V+AK  PLFLKPIPS GTR  VLYSSRK NAF CFAALNARCAA
Subjt:  LEMLTTNSGALLPLLPPGRLEQTCKMGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAEPLFLKPIPSTGTRGGVLYSSRKNNAFMCFAALNARCAA

Query:  EQTQTVTREAPTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR
        EQTQTVTREAPTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR
Subjt:  EQTQTVTREAPTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR

A0A6J1F258 uncharacterized protein LOC1114387815.3e-7784.15Show/hide
Query:  LEMLTTNSGALLPLLPPGRLEQTCKMGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAEPLFLKPIPSTGTRGGVLYSSRK-NNAFMCFAALNARCA
        +EMLTT+SGA+L LLPP        +GQHWG+MK KDF LRRD+ QLQTPN RRQ VIAKA P+FLKPIPSTGT+GGVLYS+RK NNAF+CFAALNARCA
Subjt:  LEMLTTNSGALLPLLPPGRLEQTCKMGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAEPLFLKPIPSTGTRGGVLYSSRK-NNAFMCFAALNARCA

Query:  AEQTQTVTREAPTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR
        AEQTQTVTREAPTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEG Y++DRRR
Subjt:  AEQTQTVTREAPTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR

A0A6J1GD79 uncharacterized protein LOC1114531179.0e-7784.07Show/hide
Query:  LEMLTTNSGALLPLLPPGRLEQTCKMGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAEPLFLKPIPSTGTRGGVLYSSRKNNAFMCFAALNARCAA
        +EMLTT SGA+L LLPP        +GQHWGRMKA +FHL  D+FQ++T N RRQPVIAKA PL++KPI STG RGGVLYSSRKNNAF+CFAALNARCAA
Subjt:  LEMLTTNSGALLPLLPPGRLEQTCKMGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAEPLFLKPIPSTGTRGGVLYSSRKNNAFMCFAALNARCAA

Query:  EQTQTVTREAPTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR
        EQTQTVTREAPTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESE PYQNDRRR
Subjt:  EQTQTVTREAPTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR

A0A6J1HQC3 uncharacterized protein LOC1114651275.3e-7784.15Show/hide
Query:  LEMLTTNSGALLPLLPPGRLEQTCKMGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAEPLFLKPIPSTGTRGGVLYSSRK-NNAFMCFAALNARCA
        +EMLTTNSGA+L LLPP        +GQHWG+MK KDF LRRD+ QLQTPN RRQ VIAKA P+FLKPIPSTGT+GG LYS+RK NNAF+CFAALNARCA
Subjt:  LEMLTTNSGALLPLLPPGRLEQTCKMGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAEPLFLKPIPSTGTRGGVLYSSRK-NNAFMCFAALNARCA

Query:  AEQTQTVTREAPTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR
        AEQTQTVTREAPTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEG Y++DRRR
Subjt:  AEQTQTVTREAPTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR

A0A6J1KZM4 uncharacterized protein LOC111498519 isoform X14.3e-7985.71Show/hide
Query:  LEMLTTNSGALLPLLPPGRLEQTCKMGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAEPLFLKPIPSTGTRGGVLYSSRKNNAFMCFAALNARCAA
        +EMLTTNSGA+LPLLPP        +GQHWGRMKA +FHL RD+FQ++T N RRQPVIAKA PL++KPI STGT GGVLYSSRKNNAF+CFAALNARCAA
Subjt:  LEMLTTNSGALLPLLPPGRLEQTCKMGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAEPLFLKPIPSTGTRGGVLYSSRKNNAFMCFAALNARCAA

Query:  EQTQTVTREAPTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR
        EQTQTVTREAPTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESE PYQNDRRR
Subjt:  EQTQTVTREAPTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR

SwissProt top hitse value%identityAlignment
F1SZ41 Protein FERTILITY RESTORER RF2, mitochondrial2.5e-0749.4Show/hide
Query:  NARCAAEQTQTVTREAPTITVL----PGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGG--NWSGGFFFFGFLAFLGFLKDKE
        + RC A QTQ+  R++ T TV      GK + P+LDDG  GFPP   G GGGGGGGGGG  N+ GGF  F  +  L +LK+ E
Subjt:  NARCAAEQTQTVTREAPTITVL----PGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGG--NWSGGFFFFGFLAFLGFLKDKE

F1SZ42 Protein FERTILITY RESTORER RF2, mitochondrial2.5e-0749.4Show/hide
Query:  NARCAAEQTQTVTREAPTITVL----PGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGG--NWSGGFFFFGFLAFLGFLKDKE
        + RC A QTQ+  R++ T TV      GK + P+LDDG  GFPP   G GGGGGGGGGG  N+ GGF  F  +  L +LK+ E
Subjt:  NARCAAEQTQTVTREAPTITVL----PGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGG--NWSGGFFFFGFLAFLGFLKDKE

F1SZ44 Protein FERTILITY RESTORER RF2, mitochondrial1.5e-0749.4Show/hide
Query:  NARCAAEQTQTVTREAPTITVL----PGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGG--NWSGGFFFFGFLAFLGFLKDKE
        + RC A QTQ+  R++ T TV      GK + P+LDDG  GFPP   G GGGGGGGGGG  N+ GGF  F  +  L +LK+ E
Subjt:  NARCAAEQTQTVTREAPTITVL----PGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGG--NWSGGFFFFGFLAFLGFLKDKE

Q0E3V2 Protein YELLOW LEAF 1, choloroplastic5.0e-0844.44Show/hide
Query:  CFAALNARCAAEQTQTVTRE------APTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR
        C A++   C A QTQT  R+      +P    +  K +SP+LDDG +GFPP   G GGGGGGGGG N +GGF  F  +  L +L  +E E   QN  RR
Subjt:  CFAALNARCAAEQTQTVTRE------APTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR

Arabidopsis top hitse value%identityAlignment
AT1G30475.1 BEST Arabidopsis thaliana protein match is: embryo defective 1303 (TAIR:AT1G56200.1)1.6e-2552.9Show/hide
Query:  RRDTFQLQTPNERRQP--VIAKAEPLFLKPIPS-TGTRGGVLYSSRKNNAFMCFAALNARCAAEQTQTVTREAPTITVLP--GKEKSPQLDDGDSGFPPR
        R  +F     ++RR P  V  + +P      PS       V+ S  K   F C +ALN++C+  QTQTVTR++PTIT  P  GK KSP+LDDG +GFPPR
Subjt:  RRDTFQLQTPNERRQP--VIAKAEPLFLKPIPS-TGTRGGVLYSSRKNNAFMCFAALNARCAAEQTQTVTREAPTITVLP--GKEKSPQLDDGDSGFPPR

Query:  DDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESE
        DDG GGGGGGGGGG+ SGGFF FGFL F+G+LKD E E
Subjt:  DDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESE

AT1G30475.2 FUNCTIONS IN: molecular_function unknown1.6e-2552.9Show/hide
Query:  RRDTFQLQTPNERRQP--VIAKAEPLFLKPIPS-TGTRGGVLYSSRKNNAFMCFAALNARCAAEQTQTVTREAPTITVLP--GKEKSPQLDDGDSGFPPR
        R  +F     ++RR P  V  + +P      PS       V+ S  K   F C +ALN++C+  QTQTVTR++PTIT  P  GK KSP+LDDG +GFPPR
Subjt:  RRDTFQLQTPNERRQP--VIAKAEPLFLKPIPS-TGTRGGVLYSSRKNNAFMCFAALNARCAAEQTQTVTREAPTITVLP--GKEKSPQLDDGDSGFPPR

Query:  DDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESE
        DDG GGGGGGGGGG+ SGGFF FGFL F+G+LKD E E
Subjt:  DDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESE

AT1G30475.3 FUNCTIONS IN: molecular_function unknown1.6e-2552.9Show/hide
Query:  RRDTFQLQTPNERRQP--VIAKAEPLFLKPIPS-TGTRGGVLYSSRKNNAFMCFAALNARCAAEQTQTVTREAPTITVLP--GKEKSPQLDDGDSGFPPR
        R  +F     ++RR P  V  + +P      PS       V+ S  K   F C +ALN++C+  QTQTVTR++PTIT  P  GK KSP+LDDG +GFPPR
Subjt:  RRDTFQLQTPNERRQP--VIAKAEPLFLKPIPS-TGTRGGVLYSSRKNNAFMCFAALNARCAAEQTQTVTREAPTITVLP--GKEKSPQLDDGDSGFPPR

Query:  DDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESE
        DDG GGGGGGGGGG+ SGGFF FGFL F+G+LKD E E
Subjt:  DDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESE

AT1G56200.1 embryo defective 13036.1e-3363.87Show/hide
Query:  LKPIPSTGTRGGVLYSSRKNNAFMCFAALNARCAAEQTQTVTREAPTITVLP--GKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAF
        + P+ + G   G+   SR+ +  +C +A+NA+C+  QTQTVTRE+PTIT  P   KEKSP LDDG  GFPPRDDGD GGGGGGGGGNWSGGFFFFGFLAF
Subjt:  LKPIPSTGTRGGVLYSSRKNNAFMCFAALNARCAAEQTQTVTREAPTITVLP--GKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAF

Query:  LGFLKDKESEGPYQNDRRR
        LG LKDKE E  Y+  RRR
Subjt:  LGFLKDKESEGPYQNDRRR

AT1G56200.2 embryo defective 13036.1e-3363.87Show/hide
Query:  LKPIPSTGTRGGVLYSSRKNNAFMCFAALNARCAAEQTQTVTREAPTITVLP--GKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAF
        + P+ + G   G+   SR+ +  +C +A+NA+C+  QTQTVTRE+PTIT  P   KEKSP LDDG  GFPPRDDGD GGGGGGGGGNWSGGFFFFGFLAF
Subjt:  LKPIPSTGTRGGVLYSSRKNNAFMCFAALNARCAAEQTQTVTREAPTITVLP--GKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAF

Query:  LGFLKDKESEGPYQNDRRR
        LG LKDKE E  Y+  RRR
Subjt:  LGFLKDKESEGPYQNDRRR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCATTTGACGATTCCATTAGGGTTCTTGAGTTTTCAAAGTCTCTAAGTGTTAAAGAAGTGAAAGATTGTTTTATTGCAGCCTATCCACTTTCTTATACACAAAGAAA
AGTGCCACTTCATTTTGTTCTTCCAATCCTCGAGTCTGATACTGTGAACCCAAATATAGTTGAAAATATCTCAAGTCTGTTGGTTTTCTCCGTCAAGCAAGCAACCACTT
CAAATTTTTCATCAACTGATCCTTCTTGTCCTCTCTCCTCTCCAACAACATCTGAGAGTTGTCTAACCAAAGGCATCATTCTTTTGGAAAAAGACCCCACATCTTCTCGA
ACTCCTTGTGTTGATGATACTAATGTTGAATCTGAAGTCAGTCTAAGTAGCTTGGATTCCATCCTTTCGCCCATGGAAAATAAAGTTGATGATATTGAGATTGAAGATTC
ATTACTAGAAGACCTAGGTTCTTTATTTAAAAAATGTGACAAGGAGACTCTTCATCCTCCATCCTTAGTGCCTAGCAAATTTGCTTCTTTTATTGAAGTTTGTGGTCTTG
AGTTACATGCGTTTGCTAGGACTATCTATAAATATTGTGGAGCTACATATGCATATGACTCTAAGAGAAACGAGACAACAAATATGAAAAGACACTTAGAGAAATGTAAG
GAGTATATAAGTGAGAGGATTATGTTGAAGGAGAGAGGGATTCTGAATTTGCCGCTGTTGCGCCACCATTGTTGGTGTTGTTGGTTGCCGCCACACTTCGCCTCTAGTCG
TCTCTCAGTATTTCTAGCTGTAAGCGTGCACTCAAGGACTATTGCACACTTGCATTCATTTGCAAAATATGGACTACTTCAGAATCACATTTCATTTGAAGCTCATAACT
CAACAAGAAGGAGGCTATTTGGCCTTTGGAGGTCCTTGTGGATTGGAATTTTAAAGGCAAGGCTAAGGTTTTTAGGAATTGTGCGGTGTCAGGTTGTCATCCACCCTCGC
CTGGAGATGCTTACTACAAATTCTGGTGCCCTGCTACCTCTTCTACCCCCAGGTAGACTAGAGCAGACTTGCAAGATGGGTCAGCATTGGGGAAGAATGAAGGCCAAAGA
TTTCCATCTTAGAAGGGATACTTTCCAACTTCAAACTCCTAATGAGAGGAGACAACCAGTGATTGCTAAAGCAGAACCACTTTTTCTCAAACCAATTCCATCTACTGGGA
CGAGAGGAGGCGTTCTATATTCTAGTAGAAAGAACAATGCTTTCATGTGTTTTGCTGCTTTGAATGCTAGATGTGCAGCGGAGCAGACCCAGACTGTTACGAGAGAGGCT
CCTACAATCACTGTTCTTCCTGGCAAGGAAAAGTCACCGCAACTGGATGATGGTGATTCTGGATTCCCACCTCGTGATGATGGTGATGGTGGCGGTGGCGGCGGAGGCGG
CGGAGGCAACTGGTCTGGTGGATTCTTCTTCTTTGGCTTCCTTGCCTTCCTTGGGTTCTTAAAAGATAAAGAAAGTGAAGGGCCTTATCAGAATGATCGGAGAAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGCCATTTGACGATTCCATTAGGGTTCTTGAGTTTTCAAAGTCTCTAAGTGTTAAAGAAGTGAAAGATTGTTTTATTGCAGCCTATCCACTTTCTTATACACAAAGAAA
AGTGCCACTTCATTTTGTTCTTCCAATCCTCGAGTCTGATACTGTGAACCCAAATATAGTTGAAAATATCTCAAGTCTGTTGGTTTTCTCCGTCAAGCAAGCAACCACTT
CAAATTTTTCATCAACTGATCCTTCTTGTCCTCTCTCCTCTCCAACAACATCTGAGAGTTGTCTAACCAAAGGCATCATTCTTTTGGAAAAAGACCCCACATCTTCTCGA
ACTCCTTGTGTTGATGATACTAATGTTGAATCTGAAGTCAGTCTAAGTAGCTTGGATTCCATCCTTTCGCCCATGGAAAATAAAGTTGATGATATTGAGATTGAAGATTC
ATTACTAGAAGACCTAGGTTCTTTATTTAAAAAATGTGACAAGGAGACTCTTCATCCTCCATCCTTAGTGCCTAGCAAATTTGCTTCTTTTATTGAAGTTTGTGGTCTTG
AGTTACATGCGTTTGCTAGGACTATCTATAAATATTGTGGAGCTACATATGCATATGACTCTAAGAGAAACGAGACAACAAATATGAAAAGACACTTAGAGAAATGTAAG
GAGTATATAAGTGAGAGGATTATGTTGAAGGAGAGAGGGATTCTGAATTTGCCGCTGTTGCGCCACCATTGTTGGTGTTGTTGGTTGCCGCCACACTTCGCCTCTAGTCG
TCTCTCAGTATTTCTAGCTGTAAGCGTGCACTCAAGGACTATTGCACACTTGCATTCATTTGCAAAATATGGACTACTTCAGAATCACATTTCATTTGAAGCTCATAACT
CAACAAGAAGGAGGCTATTTGGCCTTTGGAGGTCCTTGTGGATTGGAATTTTAAAGGCAAGGCTAAGGTTTTTAGGAATTGTGCGGTGTCAGGTTGTCATCCACCCTCGC
CTGGAGATGCTTACTACAAATTCTGGTGCCCTGCTACCTCTTCTACCCCCAGGTAGACTAGAGCAGACTTGCAAGATGGGTCAGCATTGGGGAAGAATGAAGGCCAAAGA
TTTCCATCTTAGAAGGGATACTTTCCAACTTCAAACTCCTAATGAGAGGAGACAACCAGTGATTGCTAAAGCAGAACCACTTTTTCTCAAACCAATTCCATCTACTGGGA
CGAGAGGAGGCGTTCTATATTCTAGTAGAAAGAACAATGCTTTCATGTGTTTTGCTGCTTTGAATGCTAGATGTGCAGCGGAGCAGACCCAGACTGTTACGAGAGAGGCT
CCTACAATCACTGTTCTTCCTGGCAAGGAAAAGTCACCGCAACTGGATGATGGTGATTCTGGATTCCCACCTCGTGATGATGGTGATGGTGGCGGTGGCGGCGGAGGCGG
CGGAGGCAACTGGTCTGGTGGATTCTTCTTCTTTGGCTTCCTTGCCTTCCTTGGGTTCTTAAAAGATAAAGAAAGTGAAGGGCCTTATCAGAATGATCGGAGAAGATAA
Protein sequenceShow/hide protein sequence
MPFDDSIRVLEFSKSLSVKEVKDCFIAAYPLSYTQRKVPLHFVLPILESDTVNPNIVENISSLLVFSVKQATTSNFSSTDPSCPLSSPTTSESCLTKGIILLEKDPTSSR
TPCVDDTNVESEVSLSSLDSILSPMENKVDDIEIEDSLLEDLGSLFKKCDKETLHPPSLVPSKFASFIEVCGLELHAFARTIYKYCGATYAYDSKRNETTNMKRHLEKCK
EYISERIMLKERGILNLPLLRHHCWCCWLPPHFASSRLSVFLAVSVHSRTIAHLHSFAKYGLLQNHISFEAHNSTRRRLFGLWRSLWIGILKARLRFLGIVRCQVVIHPR
LEMLTTNSGALLPLLPPGRLEQTCKMGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAEPLFLKPIPSTGTRGGVLYSSRKNNAFMCFAALNARCAAEQTQTVTREA
PTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR