; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy4G004190 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy4G004190
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
Descriptionlight-harvesting complex-like protein 3 isotype 1, chloroplastic
Genome locationGy14Chr4:2748973..2750366
RNA-Seq ExpressionCsGy4G004190
SyntenyCsGy4G004190
Gene Ontology termsGO:0009535 - chloroplast thylakoid membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146894.2 light-harvesting complex-like protein 3 isotype 1, chloroplastic [Cucumis sativus]8.31e-173100Show/hide
Query:  MASIAISASLPRASNSNHVSMKKKQRVHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQYRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNGTWD
        MASIAISASLPRASNSNHVSMKKKQRVHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQYRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNGTWD
Subjt:  MASIAISASLPRASNSNHVSMKKKQRVHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQYRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNGTWD

Query:  LNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKTALF
        LNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKTALF
Subjt:  LNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKTALF

Query:  LTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQKPK
        LTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQKPK
Subjt:  LTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQKPK

XP_008453842.1 PREDICTED: uncharacterized protein LOC103494445 [Cucumis melo]1.05e-15188.8Show/hide
Query:  MASIAISASLPRASNSNHVSMKKKQR-VHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQ--YRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNG
        MASIAISASLPRA+N NHVSMKKKQ+  HLAKP YSS TKNPTPKVISTLDV NRDDGAAQQ  + RGN AS+ V + E++NDDWNNNGERFTDKRWKNG
Subjt:  MASIAISASLPRASNSNHVSMKKKQR-VHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQ--YRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNG

Query:  TWDLNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKT
        TWDLNMFV+NGKMDWEGVIVEEAKRRKFLE+HPEAATN+EPV+FRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGY VDALTG+GIVGQSGNFICKT
Subjt:  TWDLNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKT

Query:  ALFLTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQK
        ALFLTVIGVLLFRQSED+ENLRNIAEEATFYDKQWQSSWQK
Subjt:  ALFLTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQK

XP_022984403.1 light-harvesting complex-like protein 3 isotype 1, chloroplastic [Cucurbita maxima]3.70e-11875.11Show/hide
Query:  MASIAISASLPRASNSNHVSMKKKQRVHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQYRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNGTWD
        MASIAISAS  R+  SNHVS KK+Q    A+P YSS TKNPTP++I T DV NRD       R GN ASA  K EED +D   N+G+ FTD+RWKNGTWD
Subjt:  MASIAISASLPRASNSNHVSMKKKQRVHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQYRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNGTWD

Query:  LNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKTALF
        LNMFV+NGKMDWEGVIV EAKRRKFLE++PE ATNQEPV+FRSSIIPWW WLTKSYLPQAELLNGRAAMIGFFM Y VDALTG+GIVGQSGNFI K ALF
Subjt:  LNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKTALF

Query:  LTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQ
        +TVIGVLLFRQ++DIE LR +AEEATFYDKQWQ+SWQ
Subjt:  LTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQ

XP_023551912.1 light-harvesting complex-like protein 3 isotype 1, chloroplastic [Cucurbita pepo subsp. pepo]1.13e-11573.84Show/hide
Query:  MASIAISASLPRASNSNHVSMKKKQRVHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQYRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNGTWD
        MASIAISAS PR+  SNHVS K +Q    A+P +SS TKNPTP++I T DV +RD   A + R GN ASA  K EED +D   N+G+ FTD+RWKNGTWD
Subjt:  MASIAISASLPRASNSNHVSMKKKQRVHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQYRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNGTWD

Query:  LNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKTALF
        LNMFV+NGKMDWEGVIV EAKRRKFLE++PE ATN EPV+FRSSIIPWW WLT+SYLPQAELLNGRAAMIGFFM Y VDALTG+GIVGQSGNFI K+ALF
Subjt:  LNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKTALF

Query:  LTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQ
        +TVIGVLLFRQ++DIE LR +AEEATFYDKQWQ+SWQ
Subjt:  LTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQ

XP_038875279.1 light-harvesting complex-like protein 3 isotype 1, chloroplastic [Benincasa hispida]8.40e-12177.64Show/hide
Query:  MASIAISASLPRASNSNHVSMKKKQRVHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQYRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNGTWD
        MASIAISASLPRAS SNHV +K                KNPT KVISTLDV +RD G     R GN ASA  K EEDN DDWN+ GE+FTDKRWKNGTWD
Subjt:  MASIAISASLPRASNSNHVSMKKKQRVHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQYRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNGTWD

Query:  LNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKTALF
        LNMFV+NGKMDWEGVIV EAKRRKFLE++PEAATNQEPV+FRSS+IPWW WLTKSYLPQAELLNGRAAMIGFFM Y VDALTG+GIVGQSGNFICK ALF
Subjt:  LNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKTALF

Query:  LTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQ
         TVIGVLLFRQ+EDIE+LRNIAEEAT YDKQWQ+SWQ
Subjt:  LTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQ

TrEMBL top hitse value%identityAlignment
A0A0A0KUF5 Uncharacterized protein4.02e-173100Show/hide
Query:  MASIAISASLPRASNSNHVSMKKKQRVHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQYRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNGTWD
        MASIAISASLPRASNSNHVSMKKKQRVHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQYRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNGTWD
Subjt:  MASIAISASLPRASNSNHVSMKKKQRVHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQYRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNGTWD

Query:  LNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKTALF
        LNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKTALF
Subjt:  LNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKTALF

Query:  LTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQKPK
        LTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQKPK
Subjt:  LTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQKPK

A0A1S3BXB8 uncharacterized protein LOC1034944455.07e-15288.8Show/hide
Query:  MASIAISASLPRASNSNHVSMKKKQR-VHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQ--YRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNG
        MASIAISASLPRA+N NHVSMKKKQ+  HLAKP YSS TKNPTPKVISTLDV NRDDGAAQQ  + RGN AS+ V + E++NDDWNNNGERFTDKRWKNG
Subjt:  MASIAISASLPRASNSNHVSMKKKQR-VHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQ--YRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNG

Query:  TWDLNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKT
        TWDLNMFV+NGKMDWEGVIVEEAKRRKFLE+HPEAATN+EPV+FRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGY VDALTG+GIVGQSGNFICKT
Subjt:  TWDLNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKT

Query:  ALFLTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQK
        ALFLTVIGVLLFRQSED+ENLRNIAEEATFYDKQWQSSWQK
Subjt:  ALFLTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQK

A0A5D3CYT8 Chlorophyll A-B binding protein5.07e-15288.8Show/hide
Query:  MASIAISASLPRASNSNHVSMKKKQR-VHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQ--YRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNG
        MASIAISASLPRA+N NHVSMKKKQ+  HLAKP YSS TKNPTPKVISTLDV NRDDGAAQQ  + RGN AS+ V + E++NDDWNNNGERFTDKRWKNG
Subjt:  MASIAISASLPRASNSNHVSMKKKQR-VHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQ--YRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNG

Query:  TWDLNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKT
        TWDLNMFV+NGKMDWEGVIVEEAKRRKFLE+HPEAATN+EPV+FRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGY VDALTG+GIVGQSGNFICKT
Subjt:  TWDLNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKT

Query:  ALFLTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQK
        ALFLTVIGVLLFRQSED+ENLRNIAEEATFYDKQWQSSWQK
Subjt:  ALFLTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQK

A0A6J1E3P1 light-harvesting complex-like protein 3 isotype 1, chloroplastic1.07e-11574.26Show/hide
Query:  MASIAISASLPRASNSNHVSMKKKQRVHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQYRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNGTWD
        MASIAISAS PR+  SNHVS K+++R   A+P YSS TKNPTP++I T  V +RD   A + R GN ASA  K EED +D   N+G+ FTD+RWKNGTWD
Subjt:  MASIAISASLPRASNSNHVSMKKKQRVHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQYRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNGTWD

Query:  LNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKTALF
        LNMFV+NGKMDWEGVIV EAKRRKFLE++PE ATN EPV+FRSSIIPWW WLTKSYLPQAELLNGRAAMIGFFM Y VDALTG+GIVGQSGNFI K ALF
Subjt:  LNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKTALF

Query:  LTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQ
        +TVIGVLLFRQ++DIE LR +AEEATFYDKQWQ+SWQ
Subjt:  LTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQ

A0A6J1JAE2 light-harvesting complex-like protein 3 isotype 1, chloroplastic1.79e-11875.11Show/hide
Query:  MASIAISASLPRASNSNHVSMKKKQRVHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQYRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNGTWD
        MASIAISAS  R+  SNHVS KK+Q    A+P YSS TKNPTP++I T DV NRD       R GN ASA  K EED +D   N+G+ FTD+RWKNGTWD
Subjt:  MASIAISASLPRASNSNHVSMKKKQRVHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQYRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNGTWD

Query:  LNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKTALF
        LNMFV+NGKMDWEGVIV EAKRRKFLE++PE ATNQEPV+FRSSIIPWW WLTKSYLPQAELLNGRAAMIGFFM Y VDALTG+GIVGQSGNFI K ALF
Subjt:  LNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKTALF

Query:  LTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQ
        +TVIGVLLFRQ++DIE LR +AEEATFYDKQWQ+SWQ
Subjt:  LTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQ

SwissProt top hitse value%identityAlignment
Q6NKS4 Light-harvesting complex-like protein 3 isotype 2, chloroplastic1.2e-5259.21Show/hide
Query:  RFTDKRWKNGTWDLNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIV
        ++ + +W NGTWDL  F ++GK DW+ VIV EAKRRK+LE +PE  +N E VVF +SIIPWW W+ + +LP+AELLNGRAAMIGFFM Y VD+LTGVG+V
Subjt:  RFTDKRWKNGTWDLNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIV

Query:  GQSGNFICKTALFLTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQKP
         Q GNF CKT LF+ V GVL  R++ED++ L+++ +E T YDKQWQ++W++P
Subjt:  GQSGNFICKTALFLTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQKP

Q9SYX1 Light-harvesting complex-like protein 3 isotype 1, chloroplastic2.9e-5462Show/hide
Query:  RFTDKRWKNGTWDLNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIV
        +F D RW NGTWDL  F ++GK DW+ VIV EAKRRK+LE +PE  +N EPV+F +SIIPWW W+ + +LP+AELLNGRAAMIGFFM Y VD+LTGVG+V
Subjt:  RFTDKRWKNGTWDLNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIV

Query:  GQSGNFICKTALFLTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQ
         Q GNF CKT LF+ V GVL  R++ED++ L+N+ +E T YDKQWQ++W+
Subjt:  GQSGNFICKTALFLTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQ

Arabidopsis top hitse value%identityAlignment
AT4G17600.1 Chlorophyll A-B binding family protein2.0e-5562Show/hide
Query:  RFTDKRWKNGTWDLNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIV
        +F D RW NGTWDL  F ++GK DW+ VIV EAKRRK+LE +PE  +N EPV+F +SIIPWW W+ + +LP+AELLNGRAAMIGFFM Y VD+LTGVG+V
Subjt:  RFTDKRWKNGTWDLNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIV

Query:  GQSGNFICKTALFLTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQ
         Q GNF CKT LF+ V GVL  R++ED++ L+N+ +E T YDKQWQ++W+
Subjt:  GQSGNFICKTALFLTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQ

AT5G47110.1 Chlorophyll A-B binding family protein8.5e-5459.21Show/hide
Query:  RFTDKRWKNGTWDLNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIV
        ++ + +W NGTWDL  F ++GK DW+ VIV EAKRRK+LE +PE  +N E VVF +SIIPWW W+ + +LP+AELLNGRAAMIGFFM Y VD+LTGVG+V
Subjt:  RFTDKRWKNGTWDLNMFVQNGKMDWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIV

Query:  GQSGNFICKTALFLTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQKP
         Q GNF CKT LF+ V GVL  R++ED++ L+++ +E T YDKQWQ++W++P
Subjt:  GQSGNFICKTALFLTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQKP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCAATTGCTATCTCTGCCTCATTGCCAAGAGCATCCAATTCAAATCATGTCTCCATGAAGAAAAAACAACGTGTTCATCTAGCTAAACCCGGCTATTCCTCAAC
GACGAAAAACCCAACACCAAAGGTAATTAGTACTCTTGATGTCGTGAACCGAGACGATGGTGCTGCACAGCAGTATCGTCGTGGAAATGCTGCATCAGCTGTGGTTAAGG
AAGAGGAAGATAATAATGATGATTGGAATAACAATGGTGAAAGATTTACTGATAAGAGATGGAAAAATGGAACTTGGGATCTCAATATGTTTGTACAGAATGGTAAGATG
GATTGGGAAGGTGTCATTGTTGAAGAAGCAAAGAGGAGGAAGTTTCTTGAAATACATCCAGAAGCAGCTACAAATCAAGAACCTGTGGTCTTCAGAAGCTCCATTATACC
TTGGTGGGTATGGCTCACCAAGTCCTACCTTCCACAAGCCGAACTACTTAATGGTAGGGCAGCGATGATAGGGTTCTTCATGGGGTATGCAGTGGATGCATTAACAGGAG
TTGGAATAGTTGGGCAAAGCGGGAATTTCATATGCAAAACAGCTCTTTTTTTAACTGTCATTGGCGTGTTGCTGTTTAGGCAAAGTGAAGATATTGAGAACTTAAGGAAT
ATAGCTGAAGAAGCCACCTTCTATGACAAGCAATGGCAATCTTCATGGCAAAAACCAAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCAATTGCTATCTCTGCCTCATTGCCAAGAGCATCCAATTCAAATCATGTCTCCATGAAGAAAAAACAACGTGTTCATCTAGCTAAACCCGGCTATTCCTCAAC
GACGAAAAACCCAACACCAAAGGTAATTAGTACTCTTGATGTCGTGAACCGAGACGATGGTGCTGCACAGCAGTATCGTCGTGGAAATGCTGCATCAGCTGTGGTTAAGG
AAGAGGAAGATAATAATGATGATTGGAATAACAATGGTGAAAGATTTACTGATAAGAGATGGAAAAATGGAACTTGGGATCTCAATATGTTTGTACAGAATGGTAAGATG
GATTGGGAAGGTGTCATTGTTGAAGAAGCAAAGAGGAGGAAGTTTCTTGAAATACATCCAGAAGCAGCTACAAATCAAGAACCTGTGGTCTTCAGAAGCTCCATTATACC
TTGGTGGGTATGGCTCACCAAGTCCTACCTTCCACAAGCCGAACTACTTAATGGTAGGGCAGCGATGATAGGGTTCTTCATGGGGTATGCAGTGGATGCATTAACAGGAG
TTGGAATAGTTGGGCAAAGCGGGAATTTCATATGCAAAACAGCTCTTTTTTTAACTGTCATTGGCGTGTTGCTGTTTAGGCAAAGTGAAGATATTGAGAACTTAAGGAAT
ATAGCTGAAGAAGCCACCTTCTATGACAAGCAATGGCAATCTTCATGGCAAAAACCAAAATGA
Protein sequenceShow/hide protein sequence
MASIAISASLPRASNSNHVSMKKKQRVHLAKPGYSSTTKNPTPKVISTLDVVNRDDGAAQQYRRGNAASAVVKEEEDNNDDWNNNGERFTDKRWKNGTWDLNMFVQNGKM
DWEGVIVEEAKRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDALTGVGIVGQSGNFICKTALFLTVIGVLLFRQSEDIENLRN
IAEEATFYDKQWQSSWQKPK