; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029984 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029984
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionlight-harvesting complex-like protein 3 isotype 1, chloroplastic
Genome locationtig00153554:1778702..1780091
RNA-Seq ExpressionSgr029984
SyntenySgr029984
Gene Ontology termsGO:0009535 - chloroplast thylakoid membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008453842.1 PREDICTED: uncharacterized protein LOC103494445 [Cucumis melo]1.7e-8871.77Show/hide
Query:  MAAIAISASLPRACNSQH-HVPKKKQHAHQTKPVYSSMTKNPTPEV-RTVEVRDRDSFGDAELRRK-----LALGKDKLEEATDVWNHGGCGSHSERFKD
        MA+IAISASLPRA N  H  + KK+QHAH  KP YSSMTKNPTP+V  T++V +RD  G A+ +        +    K+EE  D WN+ G     ERF D
Subjt:  MAAIAISASLPRACNSQH-HVPKKKQHAHQTKPVYSSMTKNPTPEV-RTVEVRDRDSFGDAELRRK-----LALGKDKLEEATDVWNHGGCGSHSERFKD

Query:  ERWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSG
        +RWKNGTWDLNMFVKNGKMDWEGVIV EAKRRKFLEL+PEAATN+EPVLFRSSIIPWW WL KSYLPQAELLNGRAAM+GFFM Y VDALT IGIVGQSG
Subjt:  ERWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSG

Query:  NFIGKAALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQ
        NFI K ALF+TVIGVLLFRQ+EDVE LR +A+EATFYDKQWQ+SWQ Q
Subjt:  NFIGKAALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQ

XP_022141114.1 light-harvesting complex-like protein 3 isotype 1, chloroplastic [Momordica charantia]1.7e-8874.9Show/hide
Query:  MAAI--AISASLPRACNSQHHVPKKKQHAH-QTKPVYSSMTKNPTPEVRT---VEVRDRDSFGDAELRRKLALGKDKLEEATDVWNHGGCGSHSERFKDE
        MA+I  AISASLP  C SQHHV KKKQHAH Q +P YS     P PEV +   V+   RDSF  AE       G+DKL E TD WNHG      ERF DE
Subjt:  MAAI--AISASLPRACNSQHHVPKKKQHAH-QTKPVYSSMTKNPTPEVRT---VEVRDRDSFGDAELRRKLALGKDKLEEATDVWNHGGCGSHSERFKDE

Query:  RWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSGN
        RWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRK LE+YPEAATNQ PVLFRSSIIPWWAWL  SYLPQAELLNGRAAMLGFF+AY+VDALT IGIV QSGN
Subjt:  RWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSGN

Query:  FIGKAALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQAS
        F+ K+ALFVTVI VLLFRQTED EGLRKLA+EATFYDKQWQAS
Subjt:  FIGKAALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQAS

XP_022922577.1 light-harvesting complex-like protein 3 isotype 1, chloroplastic [Cucurbita moschata]1.7e-9676.64Show/hide
Query:  MAAIAISASLPRACNSQHHVPKKKQHAHQTKPVYSSMTKNPTPE-VRTVEVRDRDSFGDAELRRKLALGKDKLEEATDVWNHGGCGSHSERFKDERWKNG
        MA+IAISAS PR+C S H     KQ   Q +P YSSMTKNPTPE ++T  V DRD+    +    +A   DKLEE TD  NHG      + F DERWKNG
Subjt:  MAAIAISASLPRACNSQHHVPKKKQHAHQTKPVYSSMTKNPTPE-VRTVEVRDRDSFGDAELRRKLALGKDKLEEATDVWNHGGCGSHSERFKDERWKNG

Query:  TWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSGNFIGKA
        TWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPE ATN EPVLFRSSIIPWWAWL KSYLPQAELLNGRAAM+GFFMAYLVDALT IGIVGQSGNFI KA
Subjt:  TWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSGNFIGKA

Query:  ALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQNE
        ALFVTVIGVLLFRQT+D+EGLRKLA+EATFYDKQWQASWQNQNE
Subjt:  ALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQNE

XP_022984403.1 light-harvesting complex-like protein 3 isotype 1, chloroplastic [Cucurbita maxima]1.3e-9979.18Show/hide
Query:  MAAIAISASLPRACNSQHHVPKKKQHAHQTKPVYSSMTKNPTPE-VRTVEVRDRDSFGDAELRR-KLALGKDKLEEATDVWNHGGCGSHSERFKDERWKN
        MA+IAISAS  R+C S +HV KK+QHA Q +P YSSMTKNPTPE +RT +V +RD+  DA  R   +A   DKLEE TD  NHG      + F DERWKN
Subjt:  MAAIAISASLPRACNSQHHVPKKKQHAHQTKPVYSSMTKNPTPE-VRTVEVRDRDSFGDAELRR-KLALGKDKLEEATDVWNHGGCGSHSERFKDERWKN

Query:  GTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSGNFIGK
        GTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPE ATNQEPVLFRSSIIPWWAWL KSYLPQAELLNGRAAM+GFFMAYLVDALT IGIVGQSGNFI K
Subjt:  GTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSGNFIGK

Query:  AALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQNE
        AALFVTVIGVLLFRQT+D+EGLRKLA+EATFYDKQWQASWQNQNE
Subjt:  AALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQNE

XP_023551912.1 light-harvesting complex-like protein 3 isotype 1, chloroplastic [Cucurbita pepo subsp. pepo]4.9e-9977.46Show/hide
Query:  MAAIAISASLPRACNSQHHVPKKKQHAHQTKPVYSSMTKNPTPE-VRTVEVRDRDSFGDAELRRKLALGKDKLEEATDVWNHGGCGSHSERFKDERWKNG
        MA+IAISAS PR+C S +HV K +QHA Q +P +SSMTKNPTPE +RT +V DRD+F   +    +A   DKLEE TD  NHG      + F DERWKNG
Subjt:  MAAIAISASLPRACNSQHHVPKKKQHAHQTKPVYSSMTKNPTPE-VRTVEVRDRDSFGDAELRRKLALGKDKLEEATDVWNHGGCGSHSERFKDERWKNG

Query:  TWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSGNFIGKA
        TWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPE ATN EPVLFRSSIIPWWAWL +SYLPQAELLNGRAAM+GFFMAYLVDALT IGIVGQSGNFI K+
Subjt:  TWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSGNFIGKA

Query:  ALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQNE
        ALFVTVIGVLLFRQT+D+EGLRKLA+EATFYDKQWQASWQNQNE
Subjt:  ALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQNE

TrEMBL top hitse value%identityAlignment
A0A1S3BXB8 uncharacterized protein LOC1034944458.4e-8971.77Show/hide
Query:  MAAIAISASLPRACNSQH-HVPKKKQHAHQTKPVYSSMTKNPTPEV-RTVEVRDRDSFGDAELRRK-----LALGKDKLEEATDVWNHGGCGSHSERFKD
        MA+IAISASLPRA N  H  + KK+QHAH  KP YSSMTKNPTP+V  T++V +RD  G A+ +        +    K+EE  D WN+ G     ERF D
Subjt:  MAAIAISASLPRACNSQH-HVPKKKQHAHQTKPVYSSMTKNPTPEV-RTVEVRDRDSFGDAELRRK-----LALGKDKLEEATDVWNHGGCGSHSERFKD

Query:  ERWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSG
        +RWKNGTWDLNMFVKNGKMDWEGVIV EAKRRKFLEL+PEAATN+EPVLFRSSIIPWW WL KSYLPQAELLNGRAAM+GFFM Y VDALT IGIVGQSG
Subjt:  ERWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSG

Query:  NFIGKAALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQ
        NFI K ALF+TVIGVLLFRQ+EDVE LR +A+EATFYDKQWQ+SWQ Q
Subjt:  NFIGKAALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQ

A0A5D3CYT8 Chlorophyll A-B binding protein8.4e-8971.77Show/hide
Query:  MAAIAISASLPRACNSQH-HVPKKKQHAHQTKPVYSSMTKNPTPEV-RTVEVRDRDSFGDAELRRK-----LALGKDKLEEATDVWNHGGCGSHSERFKD
        MA+IAISASLPRA N  H  + KK+QHAH  KP YSSMTKNPTP+V  T++V +RD  G A+ +        +    K+EE  D WN+ G     ERF D
Subjt:  MAAIAISASLPRACNSQH-HVPKKKQHAHQTKPVYSSMTKNPTPEV-RTVEVRDRDSFGDAELRRK-----LALGKDKLEEATDVWNHGGCGSHSERFKD

Query:  ERWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSG
        +RWKNGTWDLNMFVKNGKMDWEGVIV EAKRRKFLEL+PEAATN+EPVLFRSSIIPWW WL KSYLPQAELLNGRAAM+GFFM Y VDALT IGIVGQSG
Subjt:  ERWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSG

Query:  NFIGKAALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQ
        NFI K ALF+TVIGVLLFRQ+EDVE LR +A+EATFYDKQWQ+SWQ Q
Subjt:  NFIGKAALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQ

A0A6J1CHN0 light-harvesting complex-like protein 3 isotype 1, chloroplastic8.4e-8974.9Show/hide
Query:  MAAI--AISASLPRACNSQHHVPKKKQHAH-QTKPVYSSMTKNPTPEVRT---VEVRDRDSFGDAELRRKLALGKDKLEEATDVWNHGGCGSHSERFKDE
        MA+I  AISASLP  C SQHHV KKKQHAH Q +P YS     P PEV +   V+   RDSF  AE       G+DKL E TD WNHG      ERF DE
Subjt:  MAAI--AISASLPRACNSQHHVPKKKQHAH-QTKPVYSSMTKNPTPEVRT---VEVRDRDSFGDAELRRKLALGKDKLEEATDVWNHGGCGSHSERFKDE

Query:  RWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSGN
        RWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRK LE+YPEAATNQ PVLFRSSIIPWWAWL  SYLPQAELLNGRAAMLGFF+AY+VDALT IGIV QSGN
Subjt:  RWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSGN

Query:  FIGKAALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQAS
        F+ K+ALFVTVI VLLFRQTED EGLRKLA+EATFYDKQWQAS
Subjt:  FIGKAALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQAS

A0A6J1E3P1 light-harvesting complex-like protein 3 isotype 1, chloroplastic8.4e-9776.64Show/hide
Query:  MAAIAISASLPRACNSQHHVPKKKQHAHQTKPVYSSMTKNPTPE-VRTVEVRDRDSFGDAELRRKLALGKDKLEEATDVWNHGGCGSHSERFKDERWKNG
        MA+IAISAS PR+C S H     KQ   Q +P YSSMTKNPTPE ++T  V DRD+    +    +A   DKLEE TD  NHG      + F DERWKNG
Subjt:  MAAIAISASLPRACNSQHHVPKKKQHAHQTKPVYSSMTKNPTPE-VRTVEVRDRDSFGDAELRRKLALGKDKLEEATDVWNHGGCGSHSERFKDERWKNG

Query:  TWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSGNFIGKA
        TWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPE ATN EPVLFRSSIIPWWAWL KSYLPQAELLNGRAAM+GFFMAYLVDALT IGIVGQSGNFI KA
Subjt:  TWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSGNFIGKA

Query:  ALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQNE
        ALFVTVIGVLLFRQT+D+EGLRKLA+EATFYDKQWQASWQNQNE
Subjt:  ALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQNE

A0A6J1JAE2 light-harvesting complex-like protein 3 isotype 1, chloroplastic6.2e-10079.18Show/hide
Query:  MAAIAISASLPRACNSQHHVPKKKQHAHQTKPVYSSMTKNPTPE-VRTVEVRDRDSFGDAELRR-KLALGKDKLEEATDVWNHGGCGSHSERFKDERWKN
        MA+IAISAS  R+C S +HV KK+QHA Q +P YSSMTKNPTPE +RT +V +RD+  DA  R   +A   DKLEE TD  NHG      + F DERWKN
Subjt:  MAAIAISASLPRACNSQHHVPKKKQHAHQTKPVYSSMTKNPTPE-VRTVEVRDRDSFGDAELRR-KLALGKDKLEEATDVWNHGGCGSHSERFKDERWKN

Query:  GTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSGNFIGK
        GTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPE ATNQEPVLFRSSIIPWWAWL KSYLPQAELLNGRAAM+GFFMAYLVDALT IGIVGQSGNFI K
Subjt:  GTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSGNFIGK

Query:  AALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQNE
        AALFVTVIGVLLFRQT+D+EGLRKLA+EATFYDKQWQASWQNQNE
Subjt:  AALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQNE

SwissProt top hitse value%identityAlignment
Q6NKS4 Light-harvesting complex-like protein 3 isotype 2, chloroplastic2.8e-4959.33Show/hide
Query:  RFKDERWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIV
        ++++ +W NGTWDL  F K+GK DW+ VIV+EAKRRK+LE  PE  +N E V+F +SIIPWWAW+ + +LP+AELLNGRAAM+GFFMAY VD+LT +G+V
Subjt:  RFKDERWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIV

Query:  GQSGNFIGKAALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQ
         Q GNF  K  LFV V GVL  R+ ED++ L+ L  E T YDKQWQA+W+
Subjt:  GQSGNFIGKAALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQ

Q9SYX1 Light-harvesting complex-like protein 3 isotype 1, chloroplastic8.4e-5462.58Show/hide
Query:  RFKDERWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIV
        +F+D RW NGTWDL  F K+GK DW+ VIVAEAKRRK+LE  PE  +N EPVLF +SIIPWWAW+ + +LP+AELLNGRAAM+GFFMAY VD+LT +G+V
Subjt:  RFKDERWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIV

Query:  GQSGNFIGKAALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQNEK
         Q GNF  K  LFV V GVL  R+ EDV+ L+ L  E T YDKQWQA+W+N +++
Subjt:  GQSGNFIGKAALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQNEK

Arabidopsis top hitse value%identityAlignment
AT4G17600.1 Chlorophyll A-B binding family protein6.0e-5562.58Show/hide
Query:  RFKDERWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIV
        +F+D RW NGTWDL  F K+GK DW+ VIVAEAKRRK+LE  PE  +N EPVLF +SIIPWWAW+ + +LP+AELLNGRAAM+GFFMAY VD+LT +G+V
Subjt:  RFKDERWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIV

Query:  GQSGNFIGKAALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQNEK
         Q GNF  K  LFV V GVL  R+ EDV+ L+ L  E T YDKQWQA+W+N +++
Subjt:  GQSGNFIGKAALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQNEK

AT5G47110.1 Chlorophyll A-B binding family protein2.0e-5059.33Show/hide
Query:  RFKDERWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIV
        ++++ +W NGTWDL  F K+GK DW+ VIV+EAKRRK+LE  PE  +N E V+F +SIIPWWAW+ + +LP+AELLNGRAAM+GFFMAY VD+LT +G+V
Subjt:  RFKDERWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIV

Query:  GQSGNFIGKAALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQ
         Q GNF  K  LFV V GVL  R+ ED++ L+ L  E T YDKQWQA+W+
Subjt:  GQSGNFIGKAALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCAATTGCTATCTCAGCCTCATTGCCAAGAGCATGCAATTCACAACATCATGTCCCCAAGAAGAAACAACATGCTCATCAAACCAAACCTGTCTATTCCTCGAT
GACGAAAAACCCCACCCCGGAGGTTCGTACTGTCGAGGTCAGGGATCGGGACAGTTTTGGCGATGCTGAGCTACGTAGAAAACTTGCATTGGGTAAAGACAAGCTAGAGG
AAGCTACTGATGTTTGGAACCATGGTGGCTGTGGCTCACATTCAGAAAGGTTCAAAGATGAGAGATGGAAAAATGGGACATGGGATCTGAATATGTTTGTGAAGAATGGC
AAGATGGATTGGGAAGGTGTCATTGTTGCAGAAGCAAAGAGGAGGAAGTTTCTTGAATTGTATCCAGAAGCAGCAACAAATCAAGAACCAGTCCTCTTCAGAAGCTCCAT
CATACCTTGGTGGGCATGGCTCATAAAGTCATACCTCCCACAGGCAGAACTACTTAATGGTAGGGCAGCAATGCTAGGGTTCTTCATGGCCTATCTAGTAGATGCATTAA
CAGAAATTGGAATTGTTGGGCAAAGTGGGAATTTCATAGGCAAAGCAGCTCTCTTTGTGACTGTCATAGGTGTGTTGCTGTTTAGGCAAACTGAAGATGTTGAGGGCTTG
AGAAAGCTAGCTCAAGAGGCCACCTTCTATGACAAGCAATGGCAAGCTTCATGGCAAAATCAAAATGAAAAACCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGCAATTGCTATCTCAGCCTCATTGCCAAGAGCATGCAATTCACAACATCATGTCCCCAAGAAGAAACAACATGCTCATCAAACCAAACCTGTCTATTCCTCGAT
GACGAAAAACCCCACCCCGGAGGTTCGTACTGTCGAGGTCAGGGATCGGGACAGTTTTGGCGATGCTGAGCTACGTAGAAAACTTGCATTGGGTAAAGACAAGCTAGAGG
AAGCTACTGATGTTTGGAACCATGGTGGCTGTGGCTCACATTCAGAAAGGTTCAAAGATGAGAGATGGAAAAATGGGACATGGGATCTGAATATGTTTGTGAAGAATGGC
AAGATGGATTGGGAAGGTGTCATTGTTGCAGAAGCAAAGAGGAGGAAGTTTCTTGAATTGTATCCAGAAGCAGCAACAAATCAAGAACCAGTCCTCTTCAGAAGCTCCAT
CATACCTTGGTGGGCATGGCTCATAAAGTCATACCTCCCACAGGCAGAACTACTTAATGGTAGGGCAGCAATGCTAGGGTTCTTCATGGCCTATCTAGTAGATGCATTAA
CAGAAATTGGAATTGTTGGGCAAAGTGGGAATTTCATAGGCAAAGCAGCTCTCTTTGTGACTGTCATAGGTGTGTTGCTGTTTAGGCAAACTGAAGATGTTGAGGGCTTG
AGAAAGCTAGCTCAAGAGGCCACCTTCTATGACAAGCAATGGCAAGCTTCATGGCAAAATCAAAATGAAAAACCCTGA
Protein sequenceShow/hide protein sequence
MAAIAISASLPRACNSQHHVPKKKQHAHQTKPVYSSMTKNPTPEVRTVEVRDRDSFGDAELRRKLALGKDKLEEATDVWNHGGCGSHSERFKDERWKNGTWDLNMFVKNG
KMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSGNFIGKAALFVTVIGVLLFRQTEDVEGL
RKLAQEATFYDKQWQASWQNQNEKP