; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr008391 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr008391
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionlight-harvesting complex-like protein 3 isotype 1, chloroplastic
Genome locationtig00006714:22838..24413
RNA-Seq ExpressionSgr008391
SyntenySgr008391
Gene Ontology termsGO:0009535 - chloroplast thylakoid membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008453842.1 PREDICTED: uncharacterized protein LOC103494445 [Cucumis melo]5.2e-8972.18Show/hide
Query:  MAAIAISASMPRACNSQH-HVPKKKQHAHQTKPVYSSMTKNPTPEV-RTVEVRDRDSFGDAE-----RRRKLALGKDKLEEATDVWNHGGCGSHSERFTD
        MA+IAISAS+PRA N  H  + KK+QHAH  KP YSSMTKNPTP+V  T++V +RD  G A+     R    +    K+EE  D WN+ G     ERFTD
Subjt:  MAAIAISASMPRACNSQH-HVPKKKQHAHQTKPVYSSMTKNPTPEV-RTVEVRDRDSFGDAE-----RRRKLALGKDKLEEATDVWNHGGCGSHSERFTD

Query:  ERWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSG
        +RWKNGTWDLNMFVKNGKMDWEGVIV EAKRRKFLEL+PEAATN+EPVLFRSSIIPWW WL KSYLPQAELLNGRAAM+GFFM Y VDALT IGIVGQSG
Subjt:  ERWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSG

Query:  NFIGKAALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQ
        NFI K ALF+TVIGVLLFRQ+EDVE LR +A+EATFYDKQWQ+SWQ Q
Subjt:  NFIGKAALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQ

XP_022141114.1 light-harvesting complex-like protein 3 isotype 1, chloroplastic [Momordica charantia]1.8e-8975.31Show/hide
Query:  MAAI--AISASMPRACNSQHHVPKKKQHAH-QTKPVYSSMTKNPTPEVRT---VEVRDRDSFGDAERRRKLALGKDKLEEATDVWNHGGCGSHSERFTDE
        MA+I  AISAS+P  C SQHHV KKKQHAH Q +P YS     P PEV +   V+   RDSF  AER      G+DKL E TD WNHG      ERFTDE
Subjt:  MAAI--AISASMPRACNSQHHVPKKKQHAH-QTKPVYSSMTKNPTPEVRT---VEVRDRDSFGDAERRRKLALGKDKLEEATDVWNHGGCGSHSERFTDE

Query:  RWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSGN
        RWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRK LE+YPEAATNQ PVLFRSSIIPWWAWL  SYLPQAELLNGRAAMLGFF+AY+VDALT IGIV QSGN
Subjt:  RWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSGN

Query:  FIGKAALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQAS
        F+ K+ALFVTVI VLLFRQTED EGLRKLA+EATFYDKQWQAS
Subjt:  FIGKAALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQAS

XP_022922577.1 light-harvesting complex-like protein 3 isotype 1, chloroplastic [Cucurbita moschata]8.0e-9877.46Show/hide
Query:  MAAIAISASMPRACNSQHHVPKKKQHAHQTKPVYSSMTKNPTPE-VRTVEVRDRDSFGDAERRRKLALGKDKLEEATDVWNHGGCGSHSERFTDERWKNG
        MA+IAISAS PR+C S H     KQ   Q +P YSSMTKNPTPE ++T  V DRD+    +R   +A   DKLEE TD  NHG      + FTDERWKNG
Subjt:  MAAIAISASMPRACNSQHHVPKKKQHAHQTKPVYSSMTKNPTPE-VRTVEVRDRDSFGDAERRRKLALGKDKLEEATDVWNHGGCGSHSERFTDERWKNG

Query:  TWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSGNFIGKA
        TWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPE ATN EPVLFRSSIIPWWAWL KSYLPQAELLNGRAAM+GFFMAYLVDALT IGIVGQSGNFI KA
Subjt:  TWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSGNFIGKA

Query:  ALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQNE
        ALFVTVIGVLLFRQT+D+EGLRKLA+EATFYDKQWQASWQNQNE
Subjt:  ALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQNE

XP_022984403.1 light-harvesting complex-like protein 3 isotype 1, chloroplastic [Cucurbita maxima]1.3e-10079.59Show/hide
Query:  MAAIAISASMPRACNSQHHVPKKKQHAHQTKPVYSSMTKNPTPE-VRTVEVRDRDSFGDAERRR-KLALGKDKLEEATDVWNHGGCGSHSERFTDERWKN
        MA+IAISAS  R+C S +HV KK+QHA Q +P YSSMTKNPTPE +RT +V +RD+  DA +R   +A   DKLEE TD  NHG      + FTDERWKN
Subjt:  MAAIAISASMPRACNSQHHVPKKKQHAHQTKPVYSSMTKNPTPE-VRTVEVRDRDSFGDAERRR-KLALGKDKLEEATDVWNHGGCGSHSERFTDERWKN

Query:  GTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSGNFIGK
        GTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPE ATNQEPVLFRSSIIPWWAWL KSYLPQAELLNGRAAM+GFFMAYLVDALT IGIVGQSGNFI K
Subjt:  GTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSGNFIGK

Query:  AALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQNE
        AALFVTVIGVLLFRQT+D+EGLRKLA+EATFYDKQWQASWQNQNE
Subjt:  AALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQNE

XP_023551912.1 light-harvesting complex-like protein 3 isotype 1, chloroplastic [Cucurbita pepo subsp. pepo]2.3e-10078.28Show/hide
Query:  MAAIAISASMPRACNSQHHVPKKKQHAHQTKPVYSSMTKNPTPE-VRTVEVRDRDSFGDAERRRKLALGKDKLEEATDVWNHGGCGSHSERFTDERWKNG
        MA+IAISAS PR+C S +HV K +QHA Q +P +SSMTKNPTPE +RT +V DRD+F   +R   +A   DKLEE TD  NHG      + FTDERWKNG
Subjt:  MAAIAISASMPRACNSQHHVPKKKQHAHQTKPVYSSMTKNPTPE-VRTVEVRDRDSFGDAERRRKLALGKDKLEEATDVWNHGGCGSHSERFTDERWKNG

Query:  TWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSGNFIGKA
        TWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPE ATN EPVLFRSSIIPWWAWL +SYLPQAELLNGRAAM+GFFMAYLVDALT IGIVGQSGNFI K+
Subjt:  TWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSGNFIGKA

Query:  ALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQNE
        ALFVTVIGVLLFRQT+D+EGLRKLA+EATFYDKQWQASWQNQNE
Subjt:  ALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQNE

TrEMBL top hitse value%identityAlignment
A0A1S3BXB8 uncharacterized protein LOC1034944452.5e-8972.18Show/hide
Query:  MAAIAISASMPRACNSQH-HVPKKKQHAHQTKPVYSSMTKNPTPEV-RTVEVRDRDSFGDAE-----RRRKLALGKDKLEEATDVWNHGGCGSHSERFTD
        MA+IAISAS+PRA N  H  + KK+QHAH  KP YSSMTKNPTP+V  T++V +RD  G A+     R    +    K+EE  D WN+ G     ERFTD
Subjt:  MAAIAISASMPRACNSQH-HVPKKKQHAHQTKPVYSSMTKNPTPEV-RTVEVRDRDSFGDAE-----RRRKLALGKDKLEEATDVWNHGGCGSHSERFTD

Query:  ERWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSG
        +RWKNGTWDLNMFVKNGKMDWEGVIV EAKRRKFLEL+PEAATN+EPVLFRSSIIPWW WL KSYLPQAELLNGRAAM+GFFM Y VDALT IGIVGQSG
Subjt:  ERWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSG

Query:  NFIGKAALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQ
        NFI K ALF+TVIGVLLFRQ+EDVE LR +A+EATFYDKQWQ+SWQ Q
Subjt:  NFIGKAALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQ

A0A5D3CYT8 Chlorophyll A-B binding protein2.5e-8972.18Show/hide
Query:  MAAIAISASMPRACNSQH-HVPKKKQHAHQTKPVYSSMTKNPTPEV-RTVEVRDRDSFGDAE-----RRRKLALGKDKLEEATDVWNHGGCGSHSERFTD
        MA+IAISAS+PRA N  H  + KK+QHAH  KP YSSMTKNPTP+V  T++V +RD  G A+     R    +    K+EE  D WN+ G     ERFTD
Subjt:  MAAIAISASMPRACNSQH-HVPKKKQHAHQTKPVYSSMTKNPTPEV-RTVEVRDRDSFGDAE-----RRRKLALGKDKLEEATDVWNHGGCGSHSERFTD

Query:  ERWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSG
        +RWKNGTWDLNMFVKNGKMDWEGVIV EAKRRKFLEL+PEAATN+EPVLFRSSIIPWW WL KSYLPQAELLNGRAAM+GFFM Y VDALT IGIVGQSG
Subjt:  ERWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSG

Query:  NFIGKAALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQ
        NFI K ALF+TVIGVLLFRQ+EDVE LR +A+EATFYDKQWQ+SWQ Q
Subjt:  NFIGKAALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQ

A0A6J1CHN0 light-harvesting complex-like protein 3 isotype 1, chloroplastic8.7e-9075.31Show/hide
Query:  MAAI--AISASMPRACNSQHHVPKKKQHAH-QTKPVYSSMTKNPTPEVRT---VEVRDRDSFGDAERRRKLALGKDKLEEATDVWNHGGCGSHSERFTDE
        MA+I  AISAS+P  C SQHHV KKKQHAH Q +P YS     P PEV +   V+   RDSF  AER      G+DKL E TD WNHG      ERFTDE
Subjt:  MAAI--AISASMPRACNSQHHVPKKKQHAH-QTKPVYSSMTKNPTPEVRT---VEVRDRDSFGDAERRRKLALGKDKLEEATDVWNHGGCGSHSERFTDE

Query:  RWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSGN
        RWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRK LE+YPEAATNQ PVLFRSSIIPWWAWL  SYLPQAELLNGRAAMLGFF+AY+VDALT IGIV QSGN
Subjt:  RWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSGN

Query:  FIGKAALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQAS
        F+ K+ALFVTVI VLLFRQTED EGLRKLA+EATFYDKQWQAS
Subjt:  FIGKAALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQAS

A0A6J1E3P1 light-harvesting complex-like protein 3 isotype 1, chloroplastic3.9e-9877.46Show/hide
Query:  MAAIAISASMPRACNSQHHVPKKKQHAHQTKPVYSSMTKNPTPE-VRTVEVRDRDSFGDAERRRKLALGKDKLEEATDVWNHGGCGSHSERFTDERWKNG
        MA+IAISAS PR+C S H     KQ   Q +P YSSMTKNPTPE ++T  V DRD+    +R   +A   DKLEE TD  NHG      + FTDERWKNG
Subjt:  MAAIAISASMPRACNSQHHVPKKKQHAHQTKPVYSSMTKNPTPE-VRTVEVRDRDSFGDAERRRKLALGKDKLEEATDVWNHGGCGSHSERFTDERWKNG

Query:  TWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSGNFIGKA
        TWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPE ATN EPVLFRSSIIPWWAWL KSYLPQAELLNGRAAM+GFFMAYLVDALT IGIVGQSGNFI KA
Subjt:  TWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSGNFIGKA

Query:  ALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQNE
        ALFVTVIGVLLFRQT+D+EGLRKLA+EATFYDKQWQASWQNQNE
Subjt:  ALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQNE

A0A6J1JAE2 light-harvesting complex-like protein 3 isotype 1, chloroplastic6.4e-10179.59Show/hide
Query:  MAAIAISASMPRACNSQHHVPKKKQHAHQTKPVYSSMTKNPTPE-VRTVEVRDRDSFGDAERRR-KLALGKDKLEEATDVWNHGGCGSHSERFTDERWKN
        MA+IAISAS  R+C S +HV KK+QHA Q +P YSSMTKNPTPE +RT +V +RD+  DA +R   +A   DKLEE TD  NHG      + FTDERWKN
Subjt:  MAAIAISASMPRACNSQHHVPKKKQHAHQTKPVYSSMTKNPTPE-VRTVEVRDRDSFGDAERRR-KLALGKDKLEEATDVWNHGGCGSHSERFTDERWKN

Query:  GTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSGNFIGK
        GTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPE ATNQEPVLFRSSIIPWWAWL KSYLPQAELLNGRAAM+GFFMAYLVDALT IGIVGQSGNFI K
Subjt:  GTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSGNFIGK

Query:  AALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQNE
        AALFVTVIGVLLFRQT+D+EGLRKLA+EATFYDKQWQASWQNQNE
Subjt:  AALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQNE

SwissProt top hitse value%identityAlignment
Q6NKS4 Light-harvesting complex-like protein 3 isotype 2, chloroplastic2.5e-4955.9Show/hide
Query:  RFTDERWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIV
        ++ + +W NGTWDL  F K+GK DW+ VIV+EAKRRK+LE  PE  +N E V+F +SIIPWWAW+ + +LP+AELLNGRAAM+GFFMAY VD+LT +G+V
Subjt:  RFTDERWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIV

Query:  GQSGNFIGKAALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQNEKPERKRK
         Q GNF  K  LFV V GVL  R+ ED++ L+ L  E T YDKQWQA+W+  +      +K
Subjt:  GQSGNFIGKAALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQNEKPERKRK

Q9SYX1 Light-harvesting complex-like protein 3 isotype 1, chloroplastic7.4e-5460.87Show/hide
Query:  RFTDERWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIV
        +F D RW NGTWDL  F K+GK DW+ VIVAEAKRRK+LE  PE  +N EPVLF +SIIPWWAW+ + +LP+AELLNGRAAM+GFFMAY VD+LT +G+V
Subjt:  RFTDERWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIV

Query:  GQSGNFIGKAALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQNEKPERKRK
         Q GNF  K  LFV V GVL  R+ EDV+ L+ L  E T YDKQWQA+W+N +++    +K
Subjt:  GQSGNFIGKAALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQNEKPERKRK

Arabidopsis top hitse value%identityAlignment
AT4G17600.1 Chlorophyll A-B binding family protein5.3e-5560.87Show/hide
Query:  RFTDERWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIV
        +F D RW NGTWDL  F K+GK DW+ VIVAEAKRRK+LE  PE  +N EPVLF +SIIPWWAW+ + +LP+AELLNGRAAM+GFFMAY VD+LT +G+V
Subjt:  RFTDERWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIV

Query:  GQSGNFIGKAALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQNEKPERKRK
         Q GNF  K  LFV V GVL  R+ EDV+ L+ L  E T YDKQWQA+W+N +++    +K
Subjt:  GQSGNFIGKAALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQNEKPERKRK

AT5G47110.1 Chlorophyll A-B binding family protein1.7e-5055.9Show/hide
Query:  RFTDERWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIV
        ++ + +W NGTWDL  F K+GK DW+ VIV+EAKRRK+LE  PE  +N E V+F +SIIPWWAW+ + +LP+AELLNGRAAM+GFFMAY VD+LT +G+V
Subjt:  RFTDERWKNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIV

Query:  GQSGNFIGKAALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQNEKPERKRK
         Q GNF  K  LFV V GVL  R+ ED++ L+ L  E T YDKQWQA+W+  +      +K
Subjt:  GQSGNFIGKAALFVTVIGVLLFRQTEDVEGLRKLAQEATFYDKQWQASWQNQNEKPERKRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCAATTGCTATCTCAGCCTCAATGCCAAGAGCATGCAATTCACAACATCATGTCCCCAAGAAGAAACAACATGCTCATCAAACCAAACCTGTCTATTCCTCGAT
GACGAAAAACCCGACCCCGGAGGTTCGTACTGTCGAAGTCAGGGATCGGGACAGTTTTGGCGATGCTGAGCGACGTAGAAAACTTGCGTTGGGTAAAGACAAGCTAGAGG
AAGCTACTGATGTTTGGAACCATGGTGGCTGTGGCTCACATTCAGAAAGGTTCACAGATGAGAGATGGAAAAATGGGACATGGGATCTGAATATGTTTGTGAAGAATGGC
AAGATGGATTGGGAAGGTGTCATTGTTGCAGAAGCAAAGAGGAGGAAGTTTCTTGAATTGTATCCAGAAGCAGCAACAAATCAAGAACCAGTCCTCTTCAGAAGCTCCAT
CATACCTTGGTGGGCATGGCTCATAAAGTCATACCTCCCACAGGCAGAACTACTTAATGGTAGGGCAGCAATGCTAGGGTTCTTCATGGCCTATCTAGTAGATGCATTAA
CAGAAATTGGAATTGTTGGGCAAAGTGGGAATTTCATAGGCAAAGCAGCTCTCTTTGTGACTGTCATAGGTGTGTTGCTCTTTAGGCAAACTGAAGATGTTGAGGGCTTG
AGAAAGCTAGCTCAAGAAGCCACCTTCTATGACAAGCAATGGCAAGCTTCATGGCAAAACCAAAATGAAAAACCTGAACGAAAAAGAAAATTTAGGTCTCGTTTGATAAC
GATATTAGTTGATTTTTTATTCTTTCTAGCTGGGTGTTATTATATTTTTGTGAGTGTTGGTTTTTGTCGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGCAATTGCTATCTCAGCCTCAATGCCAAGAGCATGCAATTCACAACATCATGTCCCCAAGAAGAAACAACATGCTCATCAAACCAAACCTGTCTATTCCTCGAT
GACGAAAAACCCGACCCCGGAGGTTCGTACTGTCGAAGTCAGGGATCGGGACAGTTTTGGCGATGCTGAGCGACGTAGAAAACTTGCGTTGGGTAAAGACAAGCTAGAGG
AAGCTACTGATGTTTGGAACCATGGTGGCTGTGGCTCACATTCAGAAAGGTTCACAGATGAGAGATGGAAAAATGGGACATGGGATCTGAATATGTTTGTGAAGAATGGC
AAGATGGATTGGGAAGGTGTCATTGTTGCAGAAGCAAAGAGGAGGAAGTTTCTTGAATTGTATCCAGAAGCAGCAACAAATCAAGAACCAGTCCTCTTCAGAAGCTCCAT
CATACCTTGGTGGGCATGGCTCATAAAGTCATACCTCCCACAGGCAGAACTACTTAATGGTAGGGCAGCAATGCTAGGGTTCTTCATGGCCTATCTAGTAGATGCATTAA
CAGAAATTGGAATTGTTGGGCAAAGTGGGAATTTCATAGGCAAAGCAGCTCTCTTTGTGACTGTCATAGGTGTGTTGCTCTTTAGGCAAACTGAAGATGTTGAGGGCTTG
AGAAAGCTAGCTCAAGAAGCCACCTTCTATGACAAGCAATGGCAAGCTTCATGGCAAAACCAAAATGAAAAACCTGAACGAAAAAGAAAATTTAGGTCTCGTTTGATAAC
GATATTAGTTGATTTTTTATTCTTTCTAGCTGGGTGTTATTATATTTTTGTGAGTGTTGGTTTTTGTCGCTAA
Protein sequenceShow/hide protein sequence
MAAIAISASMPRACNSQHHVPKKKQHAHQTKPVYSSMTKNPTPEVRTVEVRDRDSFGDAERRRKLALGKDKLEEATDVWNHGGCGSHSERFTDERWKNGTWDLNMFVKNG
KMDWEGVIVAEAKRRKFLELYPEAATNQEPVLFRSSIIPWWAWLIKSYLPQAELLNGRAAMLGFFMAYLVDALTEIGIVGQSGNFIGKAALFVTVIGVLLFRQTEDVEGL
RKLAQEATFYDKQWQASWQNQNEKPERKRKFRSRLITILVDFLFFLAGCYYIFVSVGFCR