; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G20900 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G20900
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPsbP domain-containing protein
Genome locationChr6:18978031..18981828
RNA-Seq ExpressionCSPI06G20900
SyntenyCSPI06G20900
Gene Ontology termsGO:0015979 - photosynthesis (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0009654 - photosystem II oxygen evolving complex (cellular component)
GO:0019898 - extrinsic component of membrane (cellular component)
GO:0005509 - calcium ion binding (molecular function)
InterPro domainsIPR002683 - PsbP, C-terminal
IPR016123 - Mog1/PsbP, alpha/beta/alpha sandwich


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8647284.1 hypothetical protein Csa_002893 [Cucumis sativus]2.2e-14898.53Show/hide
Query:  IVQKNPSLIFEWTTSMAMASLLSPSAVILRPHSLRFSQSSLSNGFSIIPIRSTLRVFCSANGNSIHTSNKKPSYLASGVNRREIMLGIGFTAFSFQEVGS
        + ++NPSLIFEWTTSMAMASLLSPSAVILRPHSLRFSQSSLSNGFSIIPIRSTLRVFCSANGNSIHTSNKKPSYLASGVNRREIMLGIGFTAFSFQEVGS
Subjt:  IVQKNPSLIFEWTTSMAMASLLSPSAVILRPHSLRFSQSSLSNGFSIIPIRSTLRVFCSANGNSIHTSNKKPSYLASGVNRREIMLGIGFTAFSFQEVGS

Query:  NALAESVVVAEDYRTYTDEANKFSLVIPQDWQVGNGEPNGFKSVTAFFPQETSTSNVSVVISGLGPDYTRMESFGKVEEFADTLVSGLDRSWKRPPGVAA
        NALAESVVVAEDYRTYTDEANKFSLVIPQDWQVGNGEPNGFKSVTAFFPQETSTSNVSVVISGLGPDYTRMESFGKVEEFADTLVSGLDRSWKRPPGVAA
Subjt:  NALAESVVVAEDYRTYTDEANKFSLVIPQDWQVGNGEPNGFKSVTAFFPQETSTSNVSVVISGLGPDYTRMESFGKVEEFADTLVSGLDRSWKRPPGVAA

Query:  KLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMSSNGWYNRLYTITGQYADEESESYSSKIEKVVNSFAFI
        KLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMSSNGWYNRLYTITGQYADEESESYSSKIEKVVNSFAFI
Subjt:  KLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMSSNGWYNRLYTITGQYADEESESYSSKIEKVVNSFAFI

XP_004146765.1 psbP domain-containing protein 3, chloroplastic isoform X1 [Cucumis sativus]5.9e-141100Show/hide
Query:  MAMASLLSPSAVILRPHSLRFSQSSLSNGFSIIPIRSTLRVFCSANGNSIHTSNKKPSYLASGVNRREIMLGIGFTAFSFQEVGSNALAESVVVAEDYRT
        MAMASLLSPSAVILRPHSLRFSQSSLSNGFSIIPIRSTLRVFCSANGNSIHTSNKKPSYLASGVNRREIMLGIGFTAFSFQEVGSNALAESVVVAEDYRT
Subjt:  MAMASLLSPSAVILRPHSLRFSQSSLSNGFSIIPIRSTLRVFCSANGNSIHTSNKKPSYLASGVNRREIMLGIGFTAFSFQEVGSNALAESVVVAEDYRT

Query:  YTDEANKFSLVIPQDWQVGNGEPNGFKSVTAFFPQETSTSNVSVVISGLGPDYTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIE
        YTDEANKFSLVIPQDWQVGNGEPNGFKSVTAFFPQETSTSNVSVVISGLGPDYTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIE
Subjt:  YTDEANKFSLVIPQDWQVGNGEPNGFKSVTAFFPQETSTSNVSVVISGLGPDYTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIE

Query:  YTLQNPGESRKHLYSAIGMSSNGWYNRLYTITGQYADEESESYSSKIEKVVNSFAFI
        YTLQNPGESRKHLYSAIGMSSNGWYNRLYTITGQYADEESESYSSKIEKVVNSFAFI
Subjt:  YTLQNPGESRKHLYSAIGMSSNGWYNRLYTITGQYADEESESYSSKIEKVVNSFAFI

XP_008464804.1 PREDICTED: psbP domain-containing protein 3, chloroplastic [Cucumis melo]2.0e-12591.05Show/hide
Query:  MAMASLLSPSAVILRPHSLRFSQSSLSNGFSIIPIRSTLRVFCSANGNSIHTSNKKPSYLASGVNRREIMLGIGFTAFSFQEVGSNALAESVVVAEDYRT
        MAM SLLSPSA+I RPHS RFSQSSLSNGFSI PIRSTLRVFCSA        NKKPSYLAS VNRREIMLGIGFTAFS QEV SNALAESVVVAEDYRT
Subjt:  MAMASLLSPSAVILRPHSLRFSQSSLSNGFSIIPIRSTLRVFCSANGNSIHTSNKKPSYLASGVNRREIMLGIGFTAFSFQEVGSNALAESVVVAEDYRT

Query:  YTDEANKFSLVIPQDWQVGNGEPNGFKSVTAFFPQETSTSNVSVVISGLGPDYTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIE
        YTDEANKFSLVIPQDWQVGNGEPNGFKSVTAFFPQETSTSNVSVVISGLGPD+TRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLI+CRSSKGIYYIE
Subjt:  YTDEANKFSLVIPQDWQVGNGEPNGFKSVTAFFPQETSTSNVSVVISGLGPDYTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIE

Query:  YTLQNPGESRKHLYSAIGMSSNGWYNRLYTITGQYADEESESYSSKIEKVVNSFAFI
        YTLQNPGESR+HLYSAIGM+SNGWYNRLYTITGQYADEES +YSSKIEKVVNSF+FI
Subjt:  YTLQNPGESRKHLYSAIGMSSNGWYNRLYTITGQYADEESESYSSKIEKVVNSFAFI

XP_031742571.1 psbP domain-containing protein 3, chloroplastic isoform X2 [Cucumis sativus]4.6e-109100Show/hide
Query:  SYLASGVNRREIMLGIGFTAFSFQEVGSNALAESVVVAEDYRTYTDEANKFSLVIPQDWQVGNGEPNGFKSVTAFFPQETSTSNVSVVISGLGPDYTRME
        SYLASGVNRREIMLGIGFTAFSFQEVGSNALAESVVVAEDYRTYTDEANKFSLVIPQDWQVGNGEPNGFKSVTAFFPQETSTSNVSVVISGLGPDYTRME
Subjt:  SYLASGVNRREIMLGIGFTAFSFQEVGSNALAESVVVAEDYRTYTDEANKFSLVIPQDWQVGNGEPNGFKSVTAFFPQETSTSNVSVVISGLGPDYTRME

Query:  SFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMSSNGWYNRLYTITGQYADEESESYSSKIEKVVNSFAFI
        SFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMSSNGWYNRLYTITGQYADEESESYSSKIEKVVNSFAFI
Subjt:  SFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMSSNGWYNRLYTITGQYADEESESYSSKIEKVVNSFAFI

XP_038885576.1 psbP domain-containing protein 3, chloroplastic [Benincasa hispida]1.1e-11888.24Show/hide
Query:  MASLLSPSAVILRPHSLRFSQSSLSNGFSIIPIRSTLRVFCSANGNSIHTSNKKPSYLASGVNRREIMLGIGFTAFSFQEVGSNALAESVVVAEDYRTYT
        MASL SPSAVI RP S RFSQSS SNG   IPIRS LRVFCS  GN+I+ SN++  Y ASGVNRREIMLGIG TAFSFQEV SNALAESV+VAEDYRTYT
Subjt:  MASLLSPSAVILRPHSLRFSQSSLSNGFSIIPIRSTLRVFCSANGNSIHTSNKKPSYLASGVNRREIMLGIGFTAFSFQEVGSNALAESVVVAEDYRTYT

Query:  DEANKFSLVIPQDWQVGNGEPNGFKSVTAFFPQETSTSNVSVVISGLGPDYTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYT
        DEANKF L IPQDWQVGNGEPNGFKSVTAFFPQETS+SNVSVVISGLGPD+TRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLI+CRSSKGIYYIEYT
Subjt:  DEANKFSLVIPQDWQVGNGEPNGFKSVTAFFPQETSTSNVSVVISGLGPDYTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYT

Query:  LQNPGESRKHLYSAIGMSSNGWYNRLYTITGQYADEESESYSSKIEKVVNSFAFI
        LQNPGESRKHLYSAIGM+SNGWYNRLYTITGQYADEESE+YSSKIEKVVNSF FI
Subjt:  LQNPGESRKHLYSAIGMSSNGWYNRLYTITGQYADEESESYSSKIEKVVNSFAFI

TrEMBL top hitse value%identityAlignment
A0A0A0KDI5 PsbP domain-containing protein2.9e-141100Show/hide
Query:  MAMASLLSPSAVILRPHSLRFSQSSLSNGFSIIPIRSTLRVFCSANGNSIHTSNKKPSYLASGVNRREIMLGIGFTAFSFQEVGSNALAESVVVAEDYRT
        MAMASLLSPSAVILRPHSLRFSQSSLSNGFSIIPIRSTLRVFCSANGNSIHTSNKKPSYLASGVNRREIMLGIGFTAFSFQEVGSNALAESVVVAEDYRT
Subjt:  MAMASLLSPSAVILRPHSLRFSQSSLSNGFSIIPIRSTLRVFCSANGNSIHTSNKKPSYLASGVNRREIMLGIGFTAFSFQEVGSNALAESVVVAEDYRT

Query:  YTDEANKFSLVIPQDWQVGNGEPNGFKSVTAFFPQETSTSNVSVVISGLGPDYTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIE
        YTDEANKFSLVIPQDWQVGNGEPNGFKSVTAFFPQETSTSNVSVVISGLGPDYTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIE
Subjt:  YTDEANKFSLVIPQDWQVGNGEPNGFKSVTAFFPQETSTSNVSVVISGLGPDYTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIE

Query:  YTLQNPGESRKHLYSAIGMSSNGWYNRLYTITGQYADEESESYSSKIEKVVNSFAFI
        YTLQNPGESRKHLYSAIGMSSNGWYNRLYTITGQYADEESESYSSKIEKVVNSFAFI
Subjt:  YTLQNPGESRKHLYSAIGMSSNGWYNRLYTITGQYADEESESYSSKIEKVVNSFAFI

A0A1S3CMG5 psbP domain-containing protein 3, chloroplastic9.8e-12691.05Show/hide
Query:  MAMASLLSPSAVILRPHSLRFSQSSLSNGFSIIPIRSTLRVFCSANGNSIHTSNKKPSYLASGVNRREIMLGIGFTAFSFQEVGSNALAESVVVAEDYRT
        MAM SLLSPSA+I RPHS RFSQSSLSNGFSI PIRSTLRVFCSA        NKKPSYLAS VNRREIMLGIGFTAFS QEV SNALAESVVVAEDYRT
Subjt:  MAMASLLSPSAVILRPHSLRFSQSSLSNGFSIIPIRSTLRVFCSANGNSIHTSNKKPSYLASGVNRREIMLGIGFTAFSFQEVGSNALAESVVVAEDYRT

Query:  YTDEANKFSLVIPQDWQVGNGEPNGFKSVTAFFPQETSTSNVSVVISGLGPDYTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIE
        YTDEANKFSLVIPQDWQVGNGEPNGFKSVTAFFPQETSTSNVSVVISGLGPD+TRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLI+CRSSKGIYYIE
Subjt:  YTDEANKFSLVIPQDWQVGNGEPNGFKSVTAFFPQETSTSNVSVVISGLGPDYTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIE

Query:  YTLQNPGESRKHLYSAIGMSSNGWYNRLYTITGQYADEESESYSSKIEKVVNSFAFI
        YTLQNPGESR+HLYSAIGM+SNGWYNRLYTITGQYADEES +YSSKIEKVVNSF+FI
Subjt:  YTLQNPGESRKHLYSAIGMSSNGWYNRLYTITGQYADEESESYSSKIEKVVNSFAFI

A0A6J1DNN6 psbP domain-containing protein 3, chloroplastic isoform X22.4e-10881.01Show/hide
Query:  MAMASLLSPSAVILRPHSLRFSQSSLSNGFSI-IPIRSTLRVFCSANGNSIHTSNKKPSYLASGVNRREIMLGIGFTAFSFQEVGSNALAESVVVAEDYR
        MA     SPSAVI RP   RF +SSLSNG +I I  +S   V CS   N+I  S+ +  Y ASGVNRREIMLGI  + FSFQ V SN+LAESVVVAED+R
Subjt:  MAMASLLSPSAVILRPHSLRFSQSSLSNGFSI-IPIRSTLRVFCSANGNSIHTSNKKPSYLASGVNRREIMLGIGFTAFSFQEVGSNALAESVVVAEDYR

Query:  TYTDEANKFSLVIPQDWQVGNGEPNGFKSVTAFFPQETSTSNVSVVISGLGPDYTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYI
        TYTDEANKF LVIPQDW VGNGEPNGFKSVTAF+PQETS+SNVSVVISGLGPD+TRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYI
Subjt:  TYTDEANKFSLVIPQDWQVGNGEPNGFKSVTAFFPQETSTSNVSVVISGLGPDYTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYI

Query:  EYTLQNPGESRKHLYSAIGMSSNGWYNRLYTITGQYADEESESYSSKIEKVVNSFAFI
        EYTLQNPGESRKHLYSAIGM+SNGWYNRLYTITGQYADEESE+YSSKIEKVVNSF+FI
Subjt:  EYTLQNPGESRKHLYSAIGMSSNGWYNRLYTITGQYADEESESYSSKIEKVVNSFAFI

A0A6J1G3W9 psbP domain-containing protein 3, chloroplastic8.4e-10981.57Show/hide
Query:  MASLLSPSAVILRPHSLRFSQSSLSNGFSIIPIRSTLRVFCSANGNSIHTSNKKPSYLASGVNRREIMLGIGFTAFSFQEVGSNALAESVVVAEDYRTYT
        MASL SPSAVI RP S RF+ SSLSNG + IPIR+ LRVFCS  G +I   ++KP    SGVNRREI+LG+G TAFSFQEV S ALAES VVAEDYRTYT
Subjt:  MASLLSPSAVILRPHSLRFSQSSLSNGFSIIPIRSTLRVFCSANGNSIHTSNKKPSYLASGVNRREIMLGIGFTAFSFQEVGSNALAESVVVAEDYRTYT

Query:  DEANKFSLVIPQDWQVGNGEPNGFKSVTAFFPQETSTSNVSVVISGLGPDYTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYT
        DEANKF LVIPQDWQVGNGEPNGFK VTAFFP+ET +SNVSVVISGLGPD+TRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYT
Subjt:  DEANKFSLVIPQDWQVGNGEPNGFKSVTAFFPQETSTSNVSVVISGLGPDYTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYT

Query:  LQNPGESRKHLYSAIGMSSNGWYNRLYTITGQYADEESESYSSKIEKVVNSFAFI
        LQNPGE R HLYSAIGM+SNGWYNRLYT+TGQY DE+SE +SS+I+KVVNSF FI
Subjt:  LQNPGESRKHLYSAIGMSSNGWYNRLYTITGQYADEESESYSSKIEKVVNSFAFI

A0A6J1KI01 psbP domain-containing protein 3, chloroplastic5.4e-10881.18Show/hide
Query:  MASLLSPSAVILRPHSLRFSQSSLSNGFSIIPIRSTLRVFCSANGNSIHTSNKKPSYLASGVNRREIMLGIGFTAFSFQEVGSNALAESVVVAEDYRTYT
        MASL SPSAVI RP S RF+ SSLSNG + IPIR+ LRVFCS  G +I   ++KP    SGVNRREI LG+G TAFSFQEV S ALAE+ VVAEDYRTYT
Subjt:  MASLLSPSAVILRPHSLRFSQSSLSNGFSIIPIRSTLRVFCSANGNSIHTSNKKPSYLASGVNRREIMLGIGFTAFSFQEVGSNALAESVVVAEDYRTYT

Query:  DEANKFSLVIPQDWQVGNGEPNGFKSVTAFFPQETSTSNVSVVISGLGPDYTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYT
        DEANKF LVIPQDWQVGNGEPNGFK VTAFFP+ET +SNVSVVISGLGPD+TRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYT
Subjt:  DEANKFSLVIPQDWQVGNGEPNGFKSVTAFFPQETSTSNVSVVISGLGPDYTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYT

Query:  LQNPGESRKHLYSAIGMSSNGWYNRLYTITGQYADEESESYSSKIEKVVNSFAFI
        LQNPGE R HLYSAIGM+SNGWYNRLYT+TGQY DE+SE +SS+I+KVVNSF FI
Subjt:  LQNPGESRKHLYSAIGMSSNGWYNRLYTITGQYADEESESYSSKIEKVVNSFAFI

SwissProt top hitse value%identityAlignment
Q9S720 PsbP domain-containing protein 3, chloroplastic9.0e-7664.68Show/hide
Query:  SANGNSIHTSNKKPSYLAS----GVNRREIMLGIGFTAFSFQEVGSNALAESVVVAEDYRTYTDEANKFSLVIPQDWQVGNGEPNGFKSVTAFFPQETST
        SA  + + +SN++   ++S    G+ RR++ML I  + F      S A AE+   +E +R YTDE NKF + IPQDWQVG  EPNGFKS+TAF+PQETST
Subjt:  SANGNSIHTSNKKPSYLAS----GVNRREIMLGIGFTAFSFQEVGSNALAESVVVAEDYRTYTDEANKFSLVIPQDWQVGNGEPNGFKSVTAFFPQETST

Query:  SNVSVVISGLGPDYTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMSSNGWYNRLYTITGQYADEE
        SNVS+ I+GLGPD+TRMESFGKVE FA+TLVSGLDRSW++P GV AKLID R+SKG YYIEYTLQNPGE+RKHLYSAIGM++NGWYNRLYT+TGQ+ DEE
Subjt:  SNVSVVISGLGPDYTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMSSNGWYNRLYTITGQYADEE

Query:  SESYSSKIEKVVNSFAFI
        S   SSKI+K V SF FI
Subjt:  SESYSSKIEKVVNSFAFI

Arabidopsis top hitse value%identityAlignment
AT1G76450.1 Photosystem II reaction center PsbP family protein6.4e-7764.68Show/hide
Query:  SANGNSIHTSNKKPSYLAS----GVNRREIMLGIGFTAFSFQEVGSNALAESVVVAEDYRTYTDEANKFSLVIPQDWQVGNGEPNGFKSVTAFFPQETST
        SA  + + +SN++   ++S    G+ RR++ML I  + F      S A AE+   +E +R YTDE NKF + IPQDWQVG  EPNGFKS+TAF+PQETST
Subjt:  SANGNSIHTSNKKPSYLAS----GVNRREIMLGIGFTAFSFQEVGSNALAESVVVAEDYRTYTDEANKFSLVIPQDWQVGNGEPNGFKSVTAFFPQETST

Query:  SNVSVVISGLGPDYTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMSSNGWYNRLYTITGQYADEE
        SNVS+ I+GLGPD+TRMESFGKVE FA+TLVSGLDRSW++P GV AKLID R+SKG YYIEYTLQNPGE+RKHLYSAIGM++NGWYNRLYT+TGQ+ DEE
Subjt:  SNVSVVISGLGPDYTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMSSNGWYNRLYTITGQYADEE

Query:  SESYSSKIEKVVNSFAFI
        S   SSKI+K V SF FI
Subjt:  SESYSSKIEKVVNSFAFI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGAGGACAAATGGAGCGCATTGTACAAAAAAATCCATCTCTAATCTTTGAGTGGACTACAAGCATGGCGATGGCGTCCCTTCTTTCACCCAGCGCTGTAATCCT
ACGCCCTCACTCATTGCGCTTCTCACAATCATCACTCTCCAATGGATTCTCCATTATTCCTATCCGCTCAACACTTCGTGTTTTCTGCTCTGCCAATGGCAACAGCATCC
ACACTTCTAACAAAAAACCCAGTTATTTGGCGAGCGGAGTCAACAGACGAGAAATTATGCTAGGGATTGGATTCACCGCATTTTCATTTCAAGAAGTTGGTTCTAATGCC
CTAGCTGAGAGTGTTGTGGTTGCTGAAGATTATCGGACGTATACAGACGAAGCGAATAAGTTCAGCTTGGTGATTCCTCAAGATTGGCAAGTGGGTAATGGTGAACCGAA
TGGATTCAAGTCGGTTACGGCATTTTTTCCTCAAGAAACTTCAACTTCCAATGTCAGTGTTGTAATCTCGGGGCTTGGTCCTGATTACACGAGGATGGAATCCTTTGGCA
AGGTTGAGGAATTTGCTGATACATTGGTGAGTGGACTGGACAGAAGCTGGAAAAGGCCACCAGGTGTGGCGGCGAAACTTATCGACTGTAGATCATCTAAAGGGATATAT
TACATAGAGTACACACTGCAGAATCCAGGTGAAAGCCGCAAACATTTATACTCAGCAATTGGGATGTCATCCAATGGCTGGTACAATAGACTTTACACCATAACAGGACA
GTATGCAGATGAAGAATCGGAGAGCTATAGCTCCAAAATCGAGAAGGTTGTCAATTCCTTCGCTTTCATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAGAGGACAAATGGAGCGCATTGTACAAAAAAATCCATCTCTAATCTTTGAGTGGACTACAAGCATGGCGATGGCGTCCCTTCTTTCACCCAGCGCTGTAATCCT
ACGCCCTCACTCATTGCGCTTCTCACAATCATCACTCTCCAATGGATTCTCCATTATTCCTATCCGCTCAACACTTCGTGTTTTCTGCTCTGCCAATGGCAACAGCATCC
ACACTTCTAACAAAAAACCCAGTTATTTGGCGAGCGGAGTCAACAGACGAGAAATTATGCTAGGGATTGGATTCACCGCATTTTCATTTCAAGAAGTTGGTTCTAATGCC
CTAGCTGAGAGTGTTGTGGTTGCTGAAGATTATCGGACGTATACAGACGAAGCGAATAAGTTCAGCTTGGTGATTCCTCAAGATTGGCAAGTGGGTAATGGTGAACCGAA
TGGATTCAAGTCGGTTACGGCATTTTTTCCTCAAGAAACTTCAACTTCCAATGTCAGTGTTGTAATCTCGGGGCTTGGTCCTGATTACACGAGGATGGAATCCTTTGGCA
AGGTTGAGGAATTTGCTGATACATTGGTGAGTGGACTGGACAGAAGCTGGAAAAGGCCACCAGGTGTGGCGGCGAAACTTATCGACTGTAGATCATCTAAAGGGATATAT
TACATAGAGTACACACTGCAGAATCCAGGTGAAAGCCGCAAACATTTATACTCAGCAATTGGGATGTCATCCAATGGCTGGTACAATAGACTTTACACCATAACAGGACA
GTATGCAGATGAAGAATCGGAGAGCTATAGCTCCAAAATCGAGAAGGTTGTCAATTCCTTCGCTTTCATTTGATGATTGCCACAGAATTGGCCTCCACCACACTATCATT
ATGGTTAAATATTTTCCACATCTCTCTCTAATTATAGTTCTCTTTTGTTATTATTATTATTATTTTTTGTAATGAGTTCTAAACATAATATTGAATTGTCTTTCATGCAT
CTATATTTTTACATTTTCGTGAGGATGAATTCACATTTCTATTAATT
Protein sequenceShow/hide protein sequence
MGRGQMERIVQKNPSLIFEWTTSMAMASLLSPSAVILRPHSLRFSQSSLSNGFSIIPIRSTLRVFCSANGNSIHTSNKKPSYLASGVNRREIMLGIGFTAFSFQEVGSNA
LAESVVVAEDYRTYTDEANKFSLVIPQDWQVGNGEPNGFKSVTAFFPQETSTSNVSVVISGLGPDYTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIY
YIEYTLQNPGESRKHLYSAIGMSSNGWYNRLYTITGQYADEESESYSSKIEKVVNSFAFI