; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0007234 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0007234
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
Descriptionlight-harvesting complex-like protein 3 isotype 1, chloroplastic
Genome locationchr07:21548541..21549674
RNA-Seq ExpressionIVF0007234
SyntenyIVF0007234
Gene Ontology termsGO:0009535 - chloroplast thylakoid membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146894.2 light-harvesting complex-like protein 3 isotype 1, chloroplastic [Cucumis sativus]1.45e-15188.8Show/hide
Query:  MASIAISASLPRAANPNHVSMKKKQQHAHLAKPAYSSMTKNPTPKVISTLDVGNRDDGAAQQQHHHRGNVASSAVVKVEEDNDDWNNNGERFTDKRWKNG
        MASIAISASLPRA+N NHVSMKKKQ+  HLAKP YSS TKNPTPKVISTLDV NRDDGAAQQ  + RGN AS+ V + E++NDDWNNNGERFTDKRWKNG
Subjt:  MASIAISASLPRAANPNHVSMKKKQQHAHLAKPAYSSMTKNPTPKVISTLDVGNRDDGAAQQQHHHRGNVASSAVVKVEEDNDDWNNNGERFTDKRWKNG

Query:  TWDLNMFVKNGKMDWEGVIVEEAKRRKFLELHPEAATNEEPVLFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYGVDALTGIGIVGQSGNFICKT
        TWDLNMFV+NGKMDWEGVIVEEAKRRKFLE+HPEAATN+EPV+FRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGY VDALTG+GIVGQSGNFICKT
Subjt:  TWDLNMFVKNGKMDWEGVIVEEAKRRKFLELHPEAATNEEPVLFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYGVDALTGIGIVGQSGNFICKT

Query:  ALFLTVIGVLLFRQSEDVENLRNIAEEATFYDKQWQSSWQK
        ALFLTVIGVLLFRQSED+ENLRNIAEEATFYDKQWQSSWQK
Subjt:  ALFLTVIGVLLFRQSEDVENLRNIAEEATFYDKQWQSSWQK

XP_008453842.1 PREDICTED: uncharacterized protein LOC103494445 [Cucumis melo]4.12e-177100Show/hide
Query:  MASIAISASLPRAANPNHVSMKKKQQHAHLAKPAYSSMTKNPTPKVISTLDVGNRDDGAAQQQHHHRGNVASSAVVKVEEDNDDWNNNGERFTDKRWKNG
        MASIAISASLPRAANPNHVSMKKKQQHAHLAKPAYSSMTKNPTPKVISTLDVGNRDDGAAQQQHHHRGNVASSAVVKVEEDNDDWNNNGERFTDKRWKNG
Subjt:  MASIAISASLPRAANPNHVSMKKKQQHAHLAKPAYSSMTKNPTPKVISTLDVGNRDDGAAQQQHHHRGNVASSAVVKVEEDNDDWNNNGERFTDKRWKNG

Query:  TWDLNMFVKNGKMDWEGVIVEEAKRRKFLELHPEAATNEEPVLFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYGVDALTGIGIVGQSGNFICKT
        TWDLNMFVKNGKMDWEGVIVEEAKRRKFLELHPEAATNEEPVLFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYGVDALTGIGIVGQSGNFICKT
Subjt:  TWDLNMFVKNGKMDWEGVIVEEAKRRKFLELHPEAATNEEPVLFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYGVDALTGIGIVGQSGNFICKT

Query:  ALFLTVIGVLLFRQSEDVENLRNIAEEATFYDKQWQSSWQKQPK
        ALFLTVIGVLLFRQSEDVENLRNIAEEATFYDKQWQSSWQKQPK
Subjt:  ALFLTVIGVLLFRQSEDVENLRNIAEEATFYDKQWQSSWQKQPK

XP_022984403.1 light-harvesting complex-like protein 3 isotype 1, chloroplastic [Cucurbita maxima]1.14e-12277.27Show/hide
Query:  MASIAISASLPRAANPNHVSMKKKQQHAHLAKPAYSSMTKNPTPKVISTLDVGNRDDGAAQQQHHHRGNVASSAVVKVEEDNDDWNNNGERFTDKRWKNG
        MASIAISAS  R+   NHVS  KKQQHA  A+P YSSMTKNPTP++I T DVGNRD      + H  GNVAS A  K+EED DD N+ G+ FTD+RWKNG
Subjt:  MASIAISASLPRAANPNHVSMKKKQQHAHLAKPAYSSMTKNPTPKVISTLDVGNRDDGAAQQQHHHRGNVASSAVVKVEEDNDDWNNNGERFTDKRWKNG

Query:  TWDLNMFVKNGKMDWEGVIVEEAKRRKFLELHPEAATNEEPVLFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYGVDALTGIGIVGQSGNFICKT
        TWDLNMFVKNGKMDWEGVIV EAKRRKFLEL+PE ATN+EPVLFRSSIIPWW WLTKSYLPQAELLNGRAAMIGFFM Y VDALTGIGIVGQSGNFI K 
Subjt:  TWDLNMFVKNGKMDWEGVIVEEAKRRKFLELHPEAATNEEPVLFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYGVDALTGIGIVGQSGNFICKT

Query:  ALFLTVIGVLLFRQSEDVENLRNIAEEATFYDKQWQSSWQKQ
        ALF+TVIGVLLFRQ++D+E LR +AEEATFYDKQWQ+SWQ Q
Subjt:  ALFLTVIGVLLFRQSEDVENLRNIAEEATFYDKQWQSSWQKQ

XP_023551912.1 light-harvesting complex-like protein 3 isotype 1, chloroplastic [Cucurbita pepo subsp. pepo]3.64e-12276.45Show/hide
Query:  MASIAISASLPRAANPNHVSMKKKQQHAHLAKPAYSSMTKNPTPKVISTLDVGNRDDGAAQQQHHHRGNVASSAVVKVEEDNDDWNNNGERFTDKRWKNG
        MASIAISAS PR+   NHVS  K QQHA  A+P +SSMTKNPTP++I T DVG+RD   A ++H   GNVAS A  K+EED DD N+ G+ FTD+RWKNG
Subjt:  MASIAISASLPRAANPNHVSMKKKQQHAHLAKPAYSSMTKNPTPKVISTLDVGNRDDGAAQQQHHHRGNVASSAVVKVEEDNDDWNNNGERFTDKRWKNG

Query:  TWDLNMFVKNGKMDWEGVIVEEAKRRKFLELHPEAATNEEPVLFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYGVDALTGIGIVGQSGNFICKT
        TWDLNMFVKNGKMDWEGVIV EAKRRKFLEL+PE ATN EPVLFRSSIIPWW WLT+SYLPQAELLNGRAAMIGFFM Y VDALTGIGIVGQSGNFI K+
Subjt:  TWDLNMFVKNGKMDWEGVIVEEAKRRKFLELHPEAATNEEPVLFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYGVDALTGIGIVGQSGNFICKT

Query:  ALFLTVIGVLLFRQSEDVENLRNIAEEATFYDKQWQSSWQKQ
        ALF+TVIGVLLFRQ++D+E LR +AEEATFYDKQWQ+SWQ Q
Subjt:  ALFLTVIGVLLFRQSEDVENLRNIAEEATFYDKQWQSSWQKQ

XP_038875279.1 light-harvesting complex-like protein 3 isotype 1, chloroplastic [Benincasa hispida]3.46e-12377.69Show/hide
Query:  MASIAISASLPRAANPNHVSMKKKQQHAHLAKPAYSSMTKNPTPKVISTLDVGNRDDGAAQQQHHHRGNVASSAVVKVEEDNDDWNNNGERFTDKRWKNG
        MASIAISASLPRA+  NHV +KK                 NPT KVISTLDVG+RD   A ++H   GNVAS A  KVEEDNDDWN+ GE+FTDKRWKNG
Subjt:  MASIAISASLPRAANPNHVSMKKKQQHAHLAKPAYSSMTKNPTPKVISTLDVGNRDDGAAQQQHHHRGNVASSAVVKVEEDNDDWNNNGERFTDKRWKNG

Query:  TWDLNMFVKNGKMDWEGVIVEEAKRRKFLELHPEAATNEEPVLFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYGVDALTGIGIVGQSGNFICKT
        TWDLNMFVKNGKMDWEGVIV EAKRRKFLEL+PEAATN+EPVLFRSS+IPWW WLTKSYLPQAELLNGRAAMIGFFM Y VDALTGIGIVGQSGNFICK 
Subjt:  TWDLNMFVKNGKMDWEGVIVEEAKRRKFLELHPEAATNEEPVLFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYGVDALTGIGIVGQSGNFICKT

Query:  ALFLTVIGVLLFRQSEDVENLRNIAEEATFYDKQWQSSWQKQ
        ALF TVIGVLLFRQ+ED+E+LRNIAEEAT YDKQWQ+SWQ Q
Subjt:  ALFLTVIGVLLFRQSEDVENLRNIAEEATFYDKQWQSSWQKQ

TrEMBL top hitse value%identityAlignment
A0A0A0KUF5 Uncharacterized protein4.3e-11788.8Show/hide
Query:  MASIAISASLPRAANPNHVSMKKKQQHAHLAKPAYSSMTKNPTPKVISTLDVGNRDDGAAQQQHHHRGNVASSAVVKVEEDNDDWNNNGERFTDKRWKNG
        MASIAISASLPRA+N NHVSMKKKQ+  HLAKP YSS TKNPTPKVISTLDV NRDDGAAQQ  + RGN AS+ V + E++NDDWNNNGERFTDKRWKNG
Subjt:  MASIAISASLPRAANPNHVSMKKKQQHAHLAKPAYSSMTKNPTPKVISTLDVGNRDDGAAQQQHHHRGNVASSAVVKVEEDNDDWNNNGERFTDKRWKNG

Query:  TWDLNMFVKNGKMDWEGVIVEEAKRRKFLELHPEAATNEEPVLFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYGVDALTGIGIVGQSGNFICKT
        TWDLNMFV+NGKMDWEGVIVEEAKRRKFLE+HPEAATN+EPV+FRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGY VDALTG+GIVGQSGNFICKT
Subjt:  TWDLNMFVKNGKMDWEGVIVEEAKRRKFLELHPEAATNEEPVLFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYGVDALTGIGIVGQSGNFICKT

Query:  ALFLTVIGVLLFRQSEDVENLRNIAEEATFYDKQWQSSWQK
        ALFLTVIGVLLFRQSED+ENLRNIAEEATFYDKQWQSSWQK
Subjt:  ALFLTVIGVLLFRQSEDVENLRNIAEEATFYDKQWQSSWQK

A0A1S3BXB8 uncharacterized protein LOC1034944451.4e-136100Show/hide
Query:  MASIAISASLPRAANPNHVSMKKKQQHAHLAKPAYSSMTKNPTPKVISTLDVGNRDDGAAQQQHHHRGNVASSAVVKVEEDNDDWNNNGERFTDKRWKNG
        MASIAISASLPRAANPNHVSMKKKQQHAHLAKPAYSSMTKNPTPKVISTLDVGNRDDGAAQQQHHHRGNVASSAVVKVEEDNDDWNNNGERFTDKRWKNG
Subjt:  MASIAISASLPRAANPNHVSMKKKQQHAHLAKPAYSSMTKNPTPKVISTLDVGNRDDGAAQQQHHHRGNVASSAVVKVEEDNDDWNNNGERFTDKRWKNG

Query:  TWDLNMFVKNGKMDWEGVIVEEAKRRKFLELHPEAATNEEPVLFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYGVDALTGIGIVGQSGNFICKT
        TWDLNMFVKNGKMDWEGVIVEEAKRRKFLELHPEAATNEEPVLFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYGVDALTGIGIVGQSGNFICKT
Subjt:  TWDLNMFVKNGKMDWEGVIVEEAKRRKFLELHPEAATNEEPVLFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYGVDALTGIGIVGQSGNFICKT

Query:  ALFLTVIGVLLFRQSEDVENLRNIAEEATFYDKQWQSSWQKQPK
        ALFLTVIGVLLFRQSEDVENLRNIAEEATFYDKQWQSSWQKQPK
Subjt:  ALFLTVIGVLLFRQSEDVENLRNIAEEATFYDKQWQSSWQKQPK

A0A5D3CYT8 Chlorophyll A-B binding protein1.4e-136100Show/hide
Query:  MASIAISASLPRAANPNHVSMKKKQQHAHLAKPAYSSMTKNPTPKVISTLDVGNRDDGAAQQQHHHRGNVASSAVVKVEEDNDDWNNNGERFTDKRWKNG
        MASIAISASLPRAANPNHVSMKKKQQHAHLAKPAYSSMTKNPTPKVISTLDVGNRDDGAAQQQHHHRGNVASSAVVKVEEDNDDWNNNGERFTDKRWKNG
Subjt:  MASIAISASLPRAANPNHVSMKKKQQHAHLAKPAYSSMTKNPTPKVISTLDVGNRDDGAAQQQHHHRGNVASSAVVKVEEDNDDWNNNGERFTDKRWKNG

Query:  TWDLNMFVKNGKMDWEGVIVEEAKRRKFLELHPEAATNEEPVLFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYGVDALTGIGIVGQSGNFICKT
        TWDLNMFVKNGKMDWEGVIVEEAKRRKFLELHPEAATNEEPVLFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYGVDALTGIGIVGQSGNFICKT
Subjt:  TWDLNMFVKNGKMDWEGVIVEEAKRRKFLELHPEAATNEEPVLFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYGVDALTGIGIVGQSGNFICKT

Query:  ALFLTVIGVLLFRQSEDVENLRNIAEEATFYDKQWQSSWQKQPK
        ALFLTVIGVLLFRQSEDVENLRNIAEEATFYDKQWQSSWQKQPK
Subjt:  ALFLTVIGVLLFRQSEDVENLRNIAEEATFYDKQWQSSWQKQPK

A0A6J1E3P1 light-harvesting complex-like protein 3 isotype 1, chloroplastic2.1e-9275.21Show/hide
Query:  MASIAISASLPRAANPNHVSMKKKQQHAHLAKPAYSSMTKNPTPKVISTLDVGNRDDGAAQQQHHHRGNVASSAVVKVEEDNDDWNNNGERFTDKRWKNG
        MASIAISAS PR+   NHVS K++++    A+P YSSMTKNPTP++I T  VG+RD   A ++H   GNVA SA  K+EED DD  N+G+ FTD+RWKNG
Subjt:  MASIAISASLPRAANPNHVSMKKKQQHAHLAKPAYSSMTKNPTPKVISTLDVGNRDDGAAQQQHHHRGNVASSAVVKVEEDNDDWNNNGERFTDKRWKNG

Query:  TWDLNMFVKNGKMDWEGVIVEEAKRRKFLELHPEAATNEEPVLFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYGVDALTGIGIVGQSGNFICKT
        TWDLNMFVKNGKMDWEGVIV EAKRRKFLEL+PE ATN EPVLFRSSIIPWW WLTKSYLPQAELLNGRAAMIGFFM Y VDALTGIGIVGQSGNFI K 
Subjt:  TWDLNMFVKNGKMDWEGVIVEEAKRRKFLELHPEAATNEEPVLFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYGVDALTGIGIVGQSGNFICKT

Query:  ALFLTVIGVLLFRQSEDVENLRNIAEEATFYDKQWQSSWQKQ
        ALF+TVIGVLLFRQ++D+E LR +AEEATFYDKQWQ+SWQ Q
Subjt:  ALFLTVIGVLLFRQSEDVENLRNIAEEATFYDKQWQSSWQKQ

A0A6J1JAE2 light-harvesting complex-like protein 3 isotype 1, chloroplastic1.2e-9577.69Show/hide
Query:  MASIAISASLPRAANPNHVSMKKKQQHAHLAKPAYSSMTKNPTPKVISTLDVGNRDDGAAQQQHHHRGNVASSAVVKVEEDNDDWNNNGERFTDKRWKNG
        MASIAISAS  R+   NHVS  KKQQHA  A+P YSSMTKNPTP++I T DVGNRD  AA       GNVA SA  K+EED DD  N+G+ FTD+RWKNG
Subjt:  MASIAISASLPRAANPNHVSMKKKQQHAHLAKPAYSSMTKNPTPKVISTLDVGNRDDGAAQQQHHHRGNVASSAVVKVEEDNDDWNNNGERFTDKRWKNG

Query:  TWDLNMFVKNGKMDWEGVIVEEAKRRKFLELHPEAATNEEPVLFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYGVDALTGIGIVGQSGNFICKT
        TWDLNMFVKNGKMDWEGVIV EAKRRKFLEL+PE ATN+EPVLFRSSIIPWW WLTKSYLPQAELLNGRAAMIGFFM Y VDALTGIGIVGQSGNFI K 
Subjt:  TWDLNMFVKNGKMDWEGVIVEEAKRRKFLELHPEAATNEEPVLFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYGVDALTGIGIVGQSGNFICKT

Query:  ALFLTVIGVLLFRQSEDVENLRNIAEEATFYDKQWQSSWQKQ
        ALF+TVIGVLLFRQ++D+E LR +AEEATFYDKQWQ+SWQ Q
Subjt:  ALFLTVIGVLLFRQSEDVENLRNIAEEATFYDKQWQSSWQKQ

SwissProt top hitse value%identityAlignment
Q6NKS4 Light-harvesting complex-like protein 3 isotype 2, chloroplastic3.0e-5158.28Show/hide
Query:  RFTDKRWKNGTWDLNMFVKNGKMDWEGVIVEEAKRRKFLELHPEAATNEEPVLFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYGVDALTGIGIV
        ++ + +W NGTWDL  F K+GK DW+ VIV EAKRRK+LE +PE  +N+E V+F +SIIPWW W+ + +LP+AELLNGRAAMIGFFM Y VD+LTG+G+V
Subjt:  RFTDKRWKNGTWDLNMFVKNGKMDWEGVIVEEAKRRKFLELHPEAATNEEPVLFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYGVDALTGIGIV

Query:  GQSGNFICKTALFLTVIGVLLFRQSEDVENLRNIAEEATFYDKQWQSSWQK
         Q GNF CKT LF+ V GVL  R++ED++ L+++ +E T YDKQWQ++W++
Subjt:  GQSGNFICKTALFLTVIGVLLFRQSEDVENLRNIAEEATFYDKQWQSSWQK

Q9SYX1 Light-harvesting complex-like protein 3 isotype 1, chloroplastic9.9e-5558.24Show/hide
Query:  ASSAVVKVEEDNDDWNNNGERFTDKRWKNGTWDLNMFVKNGKMDWEGVIVEEAKRRKFLELHPEAATNEEPVLFRSSIIPWWVWLTKSYLPQAELLNGRA
        A++  V VE +      +  +F D RW NGTWDL  F K+GK DW+ VIV EAKRRK+LE +PE  +N+EPVLF +SIIPWW W+ + +LP+AELLNGRA
Subjt:  ASSAVVKVEEDNDDWNNNGERFTDKRWKNGTWDLNMFVKNGKMDWEGVIVEEAKRRKFLELHPEAATNEEPVLFRSSIIPWWVWLTKSYLPQAELLNGRA

Query:  AMIGFFMGYGVDALTGIGIVGQSGNFICKTALFLTVIGVLLFRQSEDVENLRNIAEEATFYDKQWQSSWQ
        AMIGFFM Y VD+LTG+G+V Q GNF CKT LF+ V GVL  R++EDV+ L+N+ +E T YDKQWQ++W+
Subjt:  AMIGFFMGYGVDALTGIGIVGQSGNFICKTALFLTVIGVLLFRQSEDVENLRNIAEEATFYDKQWQSSWQ

Arabidopsis top hitse value%identityAlignment
AT4G17600.1 Chlorophyll A-B binding family protein7.1e-5658.24Show/hide
Query:  ASSAVVKVEEDNDDWNNNGERFTDKRWKNGTWDLNMFVKNGKMDWEGVIVEEAKRRKFLELHPEAATNEEPVLFRSSIIPWWVWLTKSYLPQAELLNGRA
        A++  V VE +      +  +F D RW NGTWDL  F K+GK DW+ VIV EAKRRK+LE +PE  +N+EPVLF +SIIPWW W+ + +LP+AELLNGRA
Subjt:  ASSAVVKVEEDNDDWNNNGERFTDKRWKNGTWDLNMFVKNGKMDWEGVIVEEAKRRKFLELHPEAATNEEPVLFRSSIIPWWVWLTKSYLPQAELLNGRA

Query:  AMIGFFMGYGVDALTGIGIVGQSGNFICKTALFLTVIGVLLFRQSEDVENLRNIAEEATFYDKQWQSSWQ
        AMIGFFM Y VD+LTG+G+V Q GNF CKT LF+ V GVL  R++EDV+ L+N+ +E T YDKQWQ++W+
Subjt:  AMIGFFMGYGVDALTGIGIVGQSGNFICKTALFLTVIGVLLFRQSEDVENLRNIAEEATFYDKQWQSSWQ

AT5G47110.1 Chlorophyll A-B binding family protein2.1e-5258.28Show/hide
Query:  RFTDKRWKNGTWDLNMFVKNGKMDWEGVIVEEAKRRKFLELHPEAATNEEPVLFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYGVDALTGIGIV
        ++ + +W NGTWDL  F K+GK DW+ VIV EAKRRK+LE +PE  +N+E V+F +SIIPWW W+ + +LP+AELLNGRAAMIGFFM Y VD+LTG+G+V
Subjt:  RFTDKRWKNGTWDLNMFVKNGKMDWEGVIVEEAKRRKFLELHPEAATNEEPVLFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYGVDALTGIGIV

Query:  GQSGNFICKTALFLTVIGVLLFRQSEDVENLRNIAEEATFYDKQWQSSWQK
         Q GNF CKT LF+ V GVL  R++ED++ L+++ +E T YDKQWQ++W++
Subjt:  GQSGNFICKTALFLTVIGVLLFRQSEDVENLRNIAEEATFYDKQWQSSWQK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCAATTGCTATCTCTGCCTCATTGCCAAGAGCAGCCAATCCAAATCATGTCTCCATGAAGAAAAAACAACAACATGCTCATCTAGCTAAACCTGCCTATTCCTC
AATGACGAAAAACCCAACACCAAAGGTAATTAGTACTCTTGATGTCGGGAACCGAGACGATGGCGCTGCGCAGCAGCAGCATCATCATCGTGGAAATGTTGCATCATCAG
CTGTGGTTAAGGTAGAGGAAGATAATGATGATTGGAATAACAATGGTGAAAGATTCACTGATAAGAGATGGAAAAATGGAACTTGGGATCTCAATATGTTTGTAAAGAAT
GGTAAGATGGACTGGGAAGGTGTCATCGTTGAAGAAGCAAAGAGGAGGAAGTTTCTTGAATTACATCCAGAAGCAGCTACAAATGAAGAACCTGTGCTCTTTAGAAGCTC
CATTATACCTTGGTGGGTATGGCTCACCAAGTCCTACCTTCCACAAGCCGAACTACTTAATGGTAGGGCAGCCATGATAGGGTTCTTCATGGGGTATGGAGTGGATGCAT
TAACAGGAATTGGAATAGTTGGACAAAGTGGGAATTTCATATGCAAAACAGCTCTTTTCTTAACTGTCATTGGTGTGTTGCTGTTTAGGCAAAGTGAAGATGTTGAGAAC
TTAAGAAATATAGCTGAAGAAGCCACCTTCTATGACAAGCAATGGCAATCTTCATGGCAAAAACAACCAAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCAATTGCTATCTCTGCCTCATTGCCAAGAGCAGCCAATCCAAATCATGTCTCCATGAAGAAAAAACAACAACATGCTCATCTAGCTAAACCTGCCTATTCCTC
AATGACGAAAAACCCAACACCAAAGGTAATTAGTACTCTTGATGTCGGGAACCGAGACGATGGCGCTGCGCAGCAGCAGCATCATCATCGTGGAAATGTTGCATCATCAG
CTGTGGTTAAGGTAGAGGAAGATAATGATGATTGGAATAACAATGGTGAAAGATTCACTGATAAGAGATGGAAAAATGGAACTTGGGATCTCAATATGTTTGTAAAGAAT
GGTAAGATGGACTGGGAAGGTGTCATCGTTGAAGAAGCAAAGAGGAGGAAGTTTCTTGAATTACATCCAGAAGCAGCTACAAATGAAGAACCTGTGCTCTTTAGAAGCTC
CATTATACCTTGGTGGGTATGGCTCACCAAGTCCTACCTTCCACAAGCCGAACTACTTAATGGTAGGGCAGCCATGATAGGGTTCTTCATGGGGTATGGAGTGGATGCAT
TAACAGGAATTGGAATAGTTGGACAAAGTGGGAATTTCATATGCAAAACAGCTCTTTTCTTAACTGTCATTGGTGTGTTGCTGTTTAGGCAAAGTGAAGATGTTGAGAAC
TTAAGAAATATAGCTGAAGAAGCCACCTTCTATGACAAGCAATGGCAATCTTCATGGCAAAAACAACCAAAATGA
Protein sequenceShow/hide protein sequence
MASIAISASLPRAANPNHVSMKKKQQHAHLAKPAYSSMTKNPTPKVISTLDVGNRDDGAAQQQHHHRGNVASSAVVKVEEDNDDWNNNGERFTDKRWKNGTWDLNMFVKN
GKMDWEGVIVEEAKRRKFLELHPEAATNEEPVLFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYGVDALTGIGIVGQSGNFICKTALFLTVIGVLLFRQSEDVEN
LRNIAEEATFYDKQWQSSWQKQPK