; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G04280 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G04280
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptionlight-harvesting complex-like protein 3 isotype 1, chloroplastic
Genome locationChr4:2793146..2794687
RNA-Seq ExpressionCSPI04G04280
SyntenyCSPI04G04280
Gene Ontology termsGO:0009535 - chloroplast thylakoid membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146894.2 light-harvesting complex-like protein 3 isotype 1, chloroplastic [Cucumis sativus]1.4e-13099.58Show/hide
Query:  MASIAISASLPRASNSNHVSMKKK-RVHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQYRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNGTWD
        MASIAISASLPRASNSNHVSMKKK RVHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQYRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNGTWD
Subjt:  MASIAISASLPRASNSNHVSMKKK-RVHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQYRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNGTWD

Query:  LNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKTALF
        LNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKTALF
Subjt:  LNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKTALF

Query:  LTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQKPK
        LTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQKPK
Subjt:  LTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQKPK

XP_008453842.1 PREDICTED: uncharacterized protein LOC103494445 [Cucumis melo]7.3e-11688.38Show/hide
Query:  MASIAISASLPRASNSNHVSMKKKR--VHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQ--YRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNG
        MASIAISASLPRA+N NHVSMKKK+   HLAKP YSS TKNPTPKVISTLDV NRDDGAAQQ  + RGN AS+ V + E++NDDWNNNGERFTDKRWKNG
Subjt:  MASIAISASLPRASNSNHVSMKKKR--VHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQ--YRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNG

Query:  TWDLNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKT
        TWDLNMFV+NGKMDWEGVIVEEAKRRKFLE+HPEAATN+EPV+FRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGY VDALTG+GIVGQSGNFICKT
Subjt:  TWDLNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKT

Query:  ALFLTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQK
        ALFLTVIGVLLFRQSED+ENLRNIAEEATFYDKQWQSSWQK
Subjt:  ALFLTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQK

XP_022984403.1 light-harvesting complex-like protein 3 isotype 1, chloroplastic [Cucurbita maxima]9.6e-9274.58Show/hide
Query:  MASIAISASLPRASNSNHVSMKKKRVHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQYRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNGTWDL
        MASIAISAS  R+  SNHVS K++    A+P YSS TKNPTP++I T DV NRD       R GN ASA  K EED +D   N+G+ FTD+RWKNGTWDL
Subjt:  MASIAISASLPRASNSNHVSMKKKRVHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQYRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNGTWDL

Query:  NMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKTALFL
        NMFV+NGKMDWEGVIV EAKRRKFLE++PE ATNQEPV+FRSSIIPWW WLTKSYLPQAELLNGRAAMIGFFM Y VDALTG+GIVGQSGNFI K ALF+
Subjt:  NMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKTALFL

Query:  TVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQ
        TVIGVLLFRQ++DIE LR +AEEATFYDKQWQ+SWQ
Subjt:  TVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQ

XP_023551912.1 light-harvesting complex-like protein 3 isotype 1, chloroplastic [Cucurbita pepo subsp. pepo]2.0e-8973.31Show/hide
Query:  MASIAISASLPRASNSNHVSMKKKRVHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQYRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNGTWDL
        MASIAISAS PR+  SNHVS  ++    A+P +SS TKNPTP++I T DV +RD   A + R GN ASA  K EED +D   N+G+ FTD+RWKNGTWDL
Subjt:  MASIAISASLPRASNSNHVSMKKKRVHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQYRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNGTWDL

Query:  NMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKTALFL
        NMFV+NGKMDWEGVIV EAKRRKFLE++PE ATN EPV+FRSSIIPWW WLT+SYLPQAELLNGRAAMIGFFM Y VDALTG+GIVGQSGNFI K+ALF+
Subjt:  NMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKTALFL

Query:  TVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQ
        TVIGVLLFRQ++DIE LR +AEEATFYDKQWQ+SWQ
Subjt:  TVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQ

XP_038875279.1 light-harvesting complex-like protein 3 isotype 1, chloroplastic [Benincasa hispida]5.1e-9377.97Show/hide
Query:  MASIAISASLPRASNSNHVSMKKKRVHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQYRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNGTWDL
        MASIAISASLPRAS SNHV +K               KNPT KVISTLDV +R DG     R GN ASA  K EED NDDW N+GE+FTDKRWKNGTWDL
Subjt:  MASIAISASLPRASNSNHVSMKKKRVHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQYRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNGTWDL

Query:  NMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKTALFL
        NMFV+NGKMDWEGVIV EAKRRKFLE++PEAATNQEPV+FRSS+IPWW WLTKSYLPQAELLNGRAAMIGFFM Y VDALTG+GIVGQSGNFICK ALF 
Subjt:  NMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKTALFL

Query:  TVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQ
        TVIGVLLFRQ+EDIE+LRNIAEEAT YDKQWQ+SWQ
Subjt:  TVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQ

TrEMBL top hitse value%identityAlignment
A0A0A0KUF5 Uncharacterized protein6.6e-13199.58Show/hide
Query:  MASIAISASLPRASNSNHVSMKKK-RVHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQYRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNGTWD
        MASIAISASLPRASNSNHVSMKKK RVHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQYRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNGTWD
Subjt:  MASIAISASLPRASNSNHVSMKKK-RVHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQYRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNGTWD

Query:  LNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKTALF
        LNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKTALF
Subjt:  LNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKTALF

Query:  LTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQKPK
        LTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQKPK
Subjt:  LTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQKPK

A0A1S3BXB8 uncharacterized protein LOC1034944453.5e-11688.38Show/hide
Query:  MASIAISASLPRASNSNHVSMKKKR--VHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQ--YRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNG
        MASIAISASLPRA+N NHVSMKKK+   HLAKP YSS TKNPTPKVISTLDV NRDDGAAQQ  + RGN AS+ V + E++NDDWNNNGERFTDKRWKNG
Subjt:  MASIAISASLPRASNSNHVSMKKKR--VHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQ--YRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNG

Query:  TWDLNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKT
        TWDLNMFV+NGKMDWEGVIVEEAKRRKFLE+HPEAATN+EPV+FRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGY VDALTG+GIVGQSGNFICKT
Subjt:  TWDLNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKT

Query:  ALFLTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQK
        ALFLTVIGVLLFRQSED+ENLRNIAEEATFYDKQWQSSWQK
Subjt:  ALFLTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQK

A0A5D3CYT8 Chlorophyll A-B binding protein3.5e-11688.38Show/hide
Query:  MASIAISASLPRASNSNHVSMKKKR--VHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQ--YRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNG
        MASIAISASLPRA+N NHVSMKKK+   HLAKP YSS TKNPTPKVISTLDV NRDDGAAQQ  + RGN AS+ V + E++NDDWNNNGERFTDKRWKNG
Subjt:  MASIAISASLPRASNSNHVSMKKKR--VHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQ--YRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNG

Query:  TWDLNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKT
        TWDLNMFV+NGKMDWEGVIVEEAKRRKFLE+HPEAATN+EPV+FRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGY VDALTG+GIVGQSGNFICKT
Subjt:  TWDLNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKT

Query:  ALFLTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQK
        ALFLTVIGVLLFRQSED+ENLRNIAEEATFYDKQWQSSWQK
Subjt:  ALFLTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQK

A0A6J1E3P1 light-harvesting complex-like protein 3 isotype 1, chloroplastic3.7e-8974.58Show/hide
Query:  MASIAISASLPRASNSNHVSMKKKRVHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQYRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNGTWDL
        MASIAISAS PR+  SNHVS K++R   A+P YSS TKNPTP++I T  V +RD   A + R GN ASA  K EED +D   N+G+ FTD+RWKNGTWDL
Subjt:  MASIAISASLPRASNSNHVSMKKKRVHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQYRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNGTWDL

Query:  NMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKTALFL
        NMFV+NGKMDWEGVIV EAKRRKFLE++PE ATN EPV+FRSSIIPWW WLTKSYLPQAELLNGRAAMIGFFM Y VDALTG+GIVGQSGNFI K ALF+
Subjt:  NMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKTALFL

Query:  TVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQ
        TVIGVLLFRQ++DIE LR +AEEATFYDKQWQ+SWQ
Subjt:  TVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQ

A0A6J1JAE2 light-harvesting complex-like protein 3 isotype 1, chloroplastic4.7e-9274.58Show/hide
Query:  MASIAISASLPRASNSNHVSMKKKRVHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQYRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNGTWDL
        MASIAISAS  R+  SNHVS K++    A+P YSS TKNPTP++I T DV NRD       R GN ASA  K EED +D   N+G+ FTD+RWKNGTWDL
Subjt:  MASIAISASLPRASNSNHVSMKKKRVHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQYRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNGTWDL

Query:  NMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKTALFL
        NMFV+NGKMDWEGVIV EAKRRKFLE++PE ATNQEPV+FRSSIIPWW WLTKSYLPQAELLNGRAAMIGFFM Y VDALTG+GIVGQSGNFI K ALF+
Subjt:  NMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKTALFL

Query:  TVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQ
        TVIGVLLFRQ++DIE LR +AEEATFYDKQWQ+SWQ
Subjt:  TVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQ

SwissProt top hitse value%identityAlignment
Q6NKS4 Light-harvesting complex-like protein 3 isotype 2, chloroplastic1.2e-5259.21Show/hide
Query:  RFTDKRWKNGTWDLNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIV
        ++ + +W NGTWDL  F ++GK DW+ VIV EAKRRK+LE +PE  +N E VVF +SIIPWW W+ + +LP+AELLNGRAAMIGFFM Y VD+LTGVG+V
Subjt:  RFTDKRWKNGTWDLNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIV

Query:  GQSGNFICKTALFLTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQKP
         Q GNF CKT LF+ V GVL  R++ED++ L+++ +E T YDKQWQ++W++P
Subjt:  GQSGNFICKTALFLTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQKP

Q9SYX1 Light-harvesting complex-like protein 3 isotype 1, chloroplastic2.8e-5462Show/hide
Query:  RFTDKRWKNGTWDLNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIV
        +F D RW NGTWDL  F ++GK DW+ VIV EAKRRK+LE +PE  +N EPV+F +SIIPWW W+ + +LP+AELLNGRAAMIGFFM Y VD+LTGVG+V
Subjt:  RFTDKRWKNGTWDLNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIV

Query:  GQSGNFICKTALFLTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQ
         Q GNF CKT LF+ V GVL  R++ED++ L+N+ +E T YDKQWQ++W+
Subjt:  GQSGNFICKTALFLTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQ

Arabidopsis top hitse value%identityAlignment
AT4G17600.1 Chlorophyll A-B binding family protein2.0e-5562Show/hide
Query:  RFTDKRWKNGTWDLNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIV
        +F D RW NGTWDL  F ++GK DW+ VIV EAKRRK+LE +PE  +N EPV+F +SIIPWW W+ + +LP+AELLNGRAAMIGFFM Y VD+LTGVG+V
Subjt:  RFTDKRWKNGTWDLNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIV

Query:  GQSGNFICKTALFLTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQ
         Q GNF CKT LF+ V GVL  R++ED++ L+N+ +E T YDKQWQ++W+
Subjt:  GQSGNFICKTALFLTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQ

AT5G47110.1 Chlorophyll A-B binding family protein8.5e-5459.21Show/hide
Query:  RFTDKRWKNGTWDLNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIV
        ++ + +W NGTWDL  F ++GK DW+ VIV EAKRRK+LE +PE  +N E VVF +SIIPWW W+ + +LP+AELLNGRAAMIGFFM Y VD+LTGVG+V
Subjt:  RFTDKRWKNGTWDLNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIV

Query:  GQSGNFICKTALFLTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQKP
         Q GNF CKT LF+ V GVL  R++ED++ L+++ +E T YDKQWQ++W++P
Subjt:  GQSGNFICKTALFLTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQKP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCAATTGCTATCTCTGCCTCATTGCCAAGAGCATCCAATTCAAATCATGTCTCCATGAAGAAAAAACGTGTTCATCTAGCTAAACCCGGCTATTCCTCAACGAC
GAAAAACCCAACACCAAAGGTAATTAGTACTCTTGATGTCGTGAACCGAGACGATGGCGCTGCGCAGCAGTATCGTCGTGGAAATGCTGCATCAGCTGTGGTTAAGGAAG
AGGAAGATAATAATGATGATTGGAATAACAATGGTGAAAGATTTACTGATAAGAGATGGAAAAATGGAACTTGGGATCTCAATATGTTTGTACAGAATGGTAAGATGGAT
TGGGAAGGTGTCATTGTTGAAGAAGCAAAGAGGAGGAAGTTTCTTGAAATACATCCAGAAGCAGCTACAAATCAAGAACCTGTGGTCTTCAGAAGCTCCATTATACCTTG
GTGGGTATGGCTCACCAAGTCCTACCTTCCACAAGCCGAACTACTTAATGGTAGGGCAGCGATGATAGGGTTCTTCATGGGGTATGCAGTGGATGCATTAACAGGAGTTG
GAATAGTTGGGCAAAGCGGGAATTTCATATGCAAAACAGCTCTTTTTTTAACTGTCATTGGCGTGTTGCTGTTTAGGCAAAGTGAAGATATTGAGAACTTAAGGAATATA
GCTGAAGAAGCCACCTTCTATGACAAGCAATGGCAATCTTCATGGCAAAAACCAAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCAATTGCTATCTCTGCCTCATTGCCAAGAGCATCCAATTCAAATCATGTCTCCATGAAGAAAAAACGTGTTCATCTAGCTAAACCCGGCTATTCCTCAACGAC
GAAAAACCCAACACCAAAGGTAATTAGTACTCTTGATGTCGTGAACCGAGACGATGGCGCTGCGCAGCAGTATCGTCGTGGAAATGCTGCATCAGCTGTGGTTAAGGAAG
AGGAAGATAATAATGATGATTGGAATAACAATGGTGAAAGATTTACTGATAAGAGATGGAAAAATGGAACTTGGGATCTCAATATGTTTGTACAGAATGGTAAGATGGAT
TGGGAAGGTGTCATTGTTGAAGAAGCAAAGAGGAGGAAGTTTCTTGAAATACATCCAGAAGCAGCTACAAATCAAGAACCTGTGGTCTTCAGAAGCTCCATTATACCTTG
GTGGGTATGGCTCACCAAGTCCTACCTTCCACAAGCCGAACTACTTAATGGTAGGGCAGCGATGATAGGGTTCTTCATGGGGTATGCAGTGGATGCATTAACAGGAGTTG
GAATAGTTGGGCAAAGCGGGAATTTCATATGCAAAACAGCTCTTTTTTTAACTGTCATTGGCGTGTTGCTGTTTAGGCAAAGTGAAGATATTGAGAACTTAAGGAATATA
GCTGAAGAAGCCACCTTCTATGACAAGCAATGGCAATCTTCATGGCAAAAACCAAAATGATATCATAATTAATACCCCTAATCAACAACATAAGCAAAAAGGCATTTAAT
TAAGTATCTCTCTCTATGTATATTTTGTTTCCAAAAATTAGACTTGTTTTTAACTCTCTTTATATTTTCATCTCCCCTTTCTGCCTTAATTAACTAATTTAGAAAGAATG
AAAATATTTGAGTCCTTT
Protein sequenceShow/hide protein sequence
MASIAISASLPRASNSNHVSMKKKRVHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQYRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNGTWDLNMFVQNGKMD
WEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKTALFLTVIGVLLFRQSEDIENLRNI
AEEATFYDKQWQSSWQKPK