; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0004043 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0004043
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPsbP domain-containing protein
Genome locationchr6:670010..675310
RNA-Seq ExpressionLag0004043
SyntenyLag0004043
Gene Ontology termsGO:0015979 - photosynthesis (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0009654 - photosystem II oxygen evolving complex (cellular component)
GO:0019898 - extrinsic component of membrane (cellular component)
GO:0005509 - calcium ion binding (molecular function)
InterPro domainsIPR002683 - PsbP, C-terminal
IPR016123 - Mog1/PsbP, alpha/beta/alpha sandwich


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7030019.1 PsbP domain-containing protein 3, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]3.1e-10677.31Show/hide
Query:  MASLPSPSAVIQPPRSWRFSESSLCNGNPSTLLIFLAFHFMRHFKSYLLPSSISGIAIPIRSRLRVFCSCKNTDISNRQPCYWANGVSRREVMLGMGLAA
        MASLPSPSAVIQ PRSWRF+ SSL N                            GIAIPIR+RLRVFCS KN DI +++PC W +GV+RRE++LGMGL A
Subjt:  MASLPSPSAVIQPPRSWRFSESSLCNGNPSTLLIFLAFHFMRHFKSYLLPSSISGIAIPIRSRLRVFCSCKNTDISNRQPCYWANGVSRREVMLGMGLAA

Query:  FSFQEVVPNALAESVVVAEDYRTYTDEANKFRLVIPQDWQVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSW
        FSFQEVV  ALAES VVAEDYRTYTDEANKFRLVIPQDWQVGNGEPNGFK VTAF+P+ET SSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSW
Subjt:  FSFQEVVPNALAESVVVAEDYRTYTDEANKFRLVIPQDWQVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSW

Query:  KRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMANNGWYNRLYTITGQAPD
        KRPPGVAAKLIDCRSSKGIYYIEYTLQNPGE R HLYSAIGMA+NGWYNRLYT+TGQ  D
Subjt:  KRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMANNGWYNRLYTITGQAPD

XP_022155884.1 psbP domain-containing protein 3, chloroplastic isoform X1 [Momordica charantia]3.2e-11173.09Show/hide
Query:  MASL--PSPSAVIQPPRSWRFSESSLCNGNPSTLLIFLAFHFMRHFKSYLLPSSISGIAIPIRSRLR--VFCSCKNTDISNRQPCYWANGVSRREVMLGM
        MASL  PSPSAVIQ PR WRF ESSL N                            GIAI IRS+ +  V CSC N DIS+ Q CYWA+GV+RRE+MLG+
Subjt:  MASL--PSPSAVIQPPRSWRFSESSLCNGNPSTLLIFLAFHFMRHFKSYLLPSSISGIAIPIRSRLR--VFCSCKNTDISNRQPCYWANGVSRREVMLGM

Query:  GLAAFSFQEVVPNALAESVVVAEDYRTYTDEANKFRLVIPQDWQVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGL
         L+ FSFQ VV N+LAESVVVAED+RTYTDEANKFRLVIPQDW VGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGL
Subjt:  GLAAFSFQEVVPNALAESVVVAEDYRTYTDEANKFRLVIPQDWQVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGL

Query:  DRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMANNGWYNRLYTITGQAPDALRRNGLLRPRRKMKNRRTIVRKFRRLSIPSLSFND
        DRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMA+NGWYNRLYTITGQ                MKNRRTIV K RR SIPSLSF++
Subjt:  DRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMANNGWYNRLYTITGQAPDALRRNGLLRPRRKMKNRRTIVRKFRRLSIPSLSFND

Query:  V
        +
Subjt:  V

XP_022946502.1 psbP domain-containing protein 3, chloroplastic [Cucurbita moschata]3.1e-10677.31Show/hide
Query:  MASLPSPSAVIQPPRSWRFSESSLCNGNPSTLLIFLAFHFMRHFKSYLLPSSISGIAIPIRSRLRVFCSCKNTDISNRQPCYWANGVSRREVMLGMGLAA
        MASLPSPSAVIQ PRSWRF+ SSL N                            GIAIPIR+RLRVFCS KN DI +++PC W +GV+RRE++LGMGL A
Subjt:  MASLPSPSAVIQPPRSWRFSESSLCNGNPSTLLIFLAFHFMRHFKSYLLPSSISGIAIPIRSRLRVFCSCKNTDISNRQPCYWANGVSRREVMLGMGLAA

Query:  FSFQEVVPNALAESVVVAEDYRTYTDEANKFRLVIPQDWQVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSW
        FSFQEVV  ALAES VVAEDYRTYTDEANKFRLVIPQDWQVGNGEPNGFK VTAF+P+ET SSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSW
Subjt:  FSFQEVVPNALAESVVVAEDYRTYTDEANKFRLVIPQDWQVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSW

Query:  KRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMANNGWYNRLYTITGQAPD
        KRPPGVAAKLIDCRSSKGIYYIEYTLQNPGE R HLYSAIGMA+NGWYNRLYT+TGQ  D
Subjt:  KRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMANNGWYNRLYTITGQAPD

XP_023546046.1 psbP domain-containing protein 3, chloroplastic [Cucurbita pepo subsp. pepo]1.5e-10576.92Show/hide
Query:  MASLPSPSAVIQPPRSWRFSESSLCNGNPSTLLIFLAFHFMRHFKSYLLPSSISGIAIPIRSRLRVFCSCKNTDISNRQPCYWANGVSRREVMLGMGLAA
        MASLPSPSAVIQ PRSWRF+ SSL N                            GIAIPIR+RLRVFCS KN DI +++PC W +GV+RRE++LGMGL A
Subjt:  MASLPSPSAVIQPPRSWRFSESSLCNGNPSTLLIFLAFHFMRHFKSYLLPSSISGIAIPIRSRLRVFCSCKNTDISNRQPCYWANGVSRREVMLGMGLAA

Query:  FSFQEVVPNALAESVVVAEDYRTYTDEANKFRLVIPQDWQVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSW
        FSFQEVV  ALAES VVAEDYRTY DEANKFRLVIPQDWQVGNGEPNGFK VTAF+P+ET SSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSW
Subjt:  FSFQEVVPNALAESVVVAEDYRTYTDEANKFRLVIPQDWQVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSW

Query:  KRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMANNGWYNRLYTITGQAPD
        KRPPGVAAKLIDCRSSKGIYYIEYTLQNPGE R HLYSAIGMA+NGWYNRLYT+TGQ  D
Subjt:  KRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMANNGWYNRLYTITGQAPD

XP_038885576.1 psbP domain-containing protein 3, chloroplastic [Benincasa hispida]3.2e-11178.49Show/hide
Query:  MASLPSPSAVIQPPRSWRFSESSLCNGNPSTLLIFLAFHFMRHFKSYLLPSSISGIAIPIRSRLRVFCSCKNTDISNRQPCYWANGVSRREVMLGMGLAA
        MASLPSPSAVIQ PRSWRFS+SS  NG P                            IPIRS+LRVFCS  N +ISN+Q CYWA+GV+RRE+MLG+GL A
Subjt:  MASLPSPSAVIQPPRSWRFSESSLCNGNPSTLLIFLAFHFMRHFKSYLLPSSISGIAIPIRSRLRVFCSCKNTDISNRQPCYWANGVSRREVMLGMGLAA

Query:  FSFQEVVPNALAESVVVAEDYRTYTDEANKFRLVIPQDWQVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSW
        FSFQEVV NALAESV+VAEDYRTYTDEANKFRL IPQDWQVGNGEPNGFKSVTAF+PQETSSSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSW
Subjt:  FSFQEVVPNALAESVVVAEDYRTYTDEANKFRLVIPQDWQVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSW

Query:  KRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMANNGWYNRLYTITGQAPDALRRN
        KRPPGVAAKLI+CRSSKGIYYIEYTLQNPGESRKHLYSAIGMA+NGWYNRLYTITGQ  D    N
Subjt:  KRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMANNGWYNRLYTITGQAPDALRRN

TrEMBL top hitse value%identityAlignment
A0A0A0KDI5 PsbP domain-containing protein1.5e-10175.19Show/hide
Query:  MASLPSPSAVIQPPRSWRFSESSLCNGNPSTLLIFLAFHFMRHFKSYLLPSSISGIAIPIRSRLRVFCSCKNTDI--SNRQPCYWANGVSRREVMLGMGL
        MASL SPSAVI  P S RFS+SSL NG                              IPIRS LRVFCS     I  SN++P Y A+GV+RRE+MLG+G 
Subjt:  MASLPSPSAVIQPPRSWRFSESSLCNGNPSTLLIFLAFHFMRHFKSYLLPSSISGIAIPIRSRLRVFCSCKNTDI--SNRQPCYWANGVSRREVMLGMGL

Query:  AAFSFQEVVPNALAESVVVAEDYRTYTDEANKFRLVIPQDWQVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDR
         AFSFQEV  NALAESVVVAEDYRTYTDEANKF LVIPQDWQVGNGEPNGFKSVTAF+PQETS+SNVSVVISGLGPD+TRMESFGKVEEFADTLVSGLDR
Subjt:  AAFSFQEVVPNALAESVVVAEDYRTYTDEANKFRLVIPQDWQVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDR

Query:  SWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMANNGWYNRLYTITGQAPD
        SWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGM++NGWYNRLYTITGQ  D
Subjt:  SWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMANNGWYNRLYTITGQAPD

A0A6J1DNN6 psbP domain-containing protein 3, chloroplastic isoform X21.7e-10575.84Show/hide
Query:  MASL--PSPSAVIQPPRSWRFSESSLCNGNPSTLLIFLAFHFMRHFKSYLLPSSISGIAIPIRSRLR--VFCSCKNTDISNRQPCYWANGVSRREVMLGM
        MASL  PSPSAVIQ PR WRF ESSL N                            GIAI IRS+ +  V CSC N DIS+ Q CYWA+GV+RRE+MLG+
Subjt:  MASL--PSPSAVIQPPRSWRFSESSLCNGNPSTLLIFLAFHFMRHFKSYLLPSSISGIAIPIRSRLR--VFCSCKNTDISNRQPCYWANGVSRREVMLGM

Query:  GLAAFSFQEVVPNALAESVVVAEDYRTYTDEANKFRLVIPQDWQVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGL
         L+ FSFQ VV N+LAESVVVAED+RTYTDEANKFRLVIPQDW VGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGL
Subjt:  GLAAFSFQEVVPNALAESVVVAEDYRTYTDEANKFRLVIPQDWQVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGL

Query:  DRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMANNGWYNRLYTITGQAPDALRRN
        DRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMA+NGWYNRLYTITGQ  D    N
Subjt:  DRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMANNGWYNRLYTITGQAPDALRRN

A0A6J1DRN2 psbP domain-containing protein 3, chloroplastic isoform X11.5e-11173.09Show/hide
Query:  MASL--PSPSAVIQPPRSWRFSESSLCNGNPSTLLIFLAFHFMRHFKSYLLPSSISGIAIPIRSRLR--VFCSCKNTDISNRQPCYWANGVSRREVMLGM
        MASL  PSPSAVIQ PR WRF ESSL N                            GIAI IRS+ +  V CSC N DIS+ Q CYWA+GV+RRE+MLG+
Subjt:  MASL--PSPSAVIQPPRSWRFSESSLCNGNPSTLLIFLAFHFMRHFKSYLLPSSISGIAIPIRSRLR--VFCSCKNTDISNRQPCYWANGVSRREVMLGM

Query:  GLAAFSFQEVVPNALAESVVVAEDYRTYTDEANKFRLVIPQDWQVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGL
         L+ FSFQ VV N+LAESVVVAED+RTYTDEANKFRLVIPQDW VGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGL
Subjt:  GLAAFSFQEVVPNALAESVVVAEDYRTYTDEANKFRLVIPQDWQVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGL

Query:  DRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMANNGWYNRLYTITGQAPDALRRNGLLRPRRKMKNRRTIVRKFRRLSIPSLSFND
        DRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMA+NGWYNRLYTITGQ                MKNRRTIV K RR SIPSLSF++
Subjt:  DRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMANNGWYNRLYTITGQAPDALRRNGLLRPRRKMKNRRTIVRKFRRLSIPSLSFND

Query:  V
        +
Subjt:  V

A0A6J1G3W9 psbP domain-containing protein 3, chloroplastic1.5e-10677.31Show/hide
Query:  MASLPSPSAVIQPPRSWRFSESSLCNGNPSTLLIFLAFHFMRHFKSYLLPSSISGIAIPIRSRLRVFCSCKNTDISNRQPCYWANGVSRREVMLGMGLAA
        MASLPSPSAVIQ PRSWRF+ SSL N                            GIAIPIR+RLRVFCS KN DI +++PC W +GV+RRE++LGMGL A
Subjt:  MASLPSPSAVIQPPRSWRFSESSLCNGNPSTLLIFLAFHFMRHFKSYLLPSSISGIAIPIRSRLRVFCSCKNTDISNRQPCYWANGVSRREVMLGMGLAA

Query:  FSFQEVVPNALAESVVVAEDYRTYTDEANKFRLVIPQDWQVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSW
        FSFQEVV  ALAES VVAEDYRTYTDEANKFRLVIPQDWQVGNGEPNGFK VTAF+P+ET SSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSW
Subjt:  FSFQEVVPNALAESVVVAEDYRTYTDEANKFRLVIPQDWQVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSW

Query:  KRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMANNGWYNRLYTITGQAPD
        KRPPGVAAKLIDCRSSKGIYYIEYTLQNPGE R HLYSAIGMA+NGWYNRLYT+TGQ  D
Subjt:  KRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMANNGWYNRLYTITGQAPD

A0A6J1KI01 psbP domain-containing protein 3, chloroplastic9.7e-10676.92Show/hide
Query:  MASLPSPSAVIQPPRSWRFSESSLCNGNPSTLLIFLAFHFMRHFKSYLLPSSISGIAIPIRSRLRVFCSCKNTDISNRQPCYWANGVSRREVMLGMGLAA
        MASLPSPSAVIQ PRSWRF+ SSL N                            GIAIPIR+RLRVFCS KN DI +++PC W +GV+RRE+ LGMGL A
Subjt:  MASLPSPSAVIQPPRSWRFSESSLCNGNPSTLLIFLAFHFMRHFKSYLLPSSISGIAIPIRSRLRVFCSCKNTDISNRQPCYWANGVSRREVMLGMGLAA

Query:  FSFQEVVPNALAESVVVAEDYRTYTDEANKFRLVIPQDWQVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSW
        FSFQEVV  ALAE+ VVAEDYRTYTDEANKFRLVIPQDWQVGNGEPNGFK VTAF+P+ET SSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSW
Subjt:  FSFQEVVPNALAESVVVAEDYRTYTDEANKFRLVIPQDWQVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSW

Query:  KRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMANNGWYNRLYTITGQAPD
        KRPPGVAAKLIDCRSSKGIYYIEYTLQNPGE R HLYSAIGMA+NGWYNRLYT+TGQ  D
Subjt:  KRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMANNGWYNRLYTITGQAPD

SwissProt top hitse value%identityAlignment
Q9S720 PsbP domain-containing protein 3, chloroplastic7.0e-6970.86Show/hide
Query:  GVSRREVMLGMGLAAFSFQEVVPNALAESVVVAEDYRTYTDEANKFRLVIPQDWQVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDFTRMESFGKV
        G+ RR+VML +  + F     +  A AE+   +E +R YTDE NKF + IPQDWQVG  EPNGFKS+TAFYPQETS+SNVS+ I+GLGPDFTRMESFGKV
Subjt:  GVSRREVMLGMGLAAFSFQEVVPNALAESVVVAEDYRTYTDEANKFRLVIPQDWQVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDFTRMESFGKV

Query:  EEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMANNGWYNRLYTITGQAPD
        E FA+TLVSGLDRSW++P GV AKLID R+SKG YYIEYTLQNPGE+RKHLYSAIGMA NGWYNRLYT+TGQ  D
Subjt:  EEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMANNGWYNRLYTITGQAPD

Arabidopsis top hitse value%identityAlignment
AT1G76450.1 Photosystem II reaction center PsbP family protein5.0e-7070.86Show/hide
Query:  GVSRREVMLGMGLAAFSFQEVVPNALAESVVVAEDYRTYTDEANKFRLVIPQDWQVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDFTRMESFGKV
        G+ RR+VML +  + F     +  A AE+   +E +R YTDE NKF + IPQDWQVG  EPNGFKS+TAFYPQETS+SNVS+ I+GLGPDFTRMESFGKV
Subjt:  GVSRREVMLGMGLAAFSFQEVVPNALAESVVVAEDYRTYTDEANKFRLVIPQDWQVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDFTRMESFGKV

Query:  EEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMANNGWYNRLYTITGQAPD
        E FA+TLVSGLDRSW++P GV AKLID R+SKG YYIEYTLQNPGE+RKHLYSAIGMA NGWYNRLYT+TGQ  D
Subjt:  EEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMANNGWYNRLYTITGQAPD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCCCTTCCTTCACCGAGCGCTGTAATCCAACCTCCTCGCTCATGGCGCTTCTCAGAATCATCACTCTGCAATGGTAATCCTTCTACTCTACTCATCTTCCTCGC
CTTTCACTTCATGCGCCATTTTAAATCCTATCTTCTTCCTTCTTCAATTTCAGGAATCGCCATTCCCATCCGCTCCAGACTCCGTGTCTTCTGCTCTTGCAAGAACACCG
ACATTTCCAATCGACAGCCCTGTTATTGGGCGAACGGAGTTAGTAGACGAGAAGTTATGCTAGGCATGGGATTGGCCGCGTTTTCTTTTCAAGAAGTTGTTCCTAATGCC
CTAGCTGAGAGTGTTGTTGTTGCTGAGGATTATCGGACATACACAGACGAAGCAAATAAGTTCAGATTGGTGATTCCTCAAGATTGGCAAGTGGGCAATGGTGAACCGAA
TGGATTCAAGTCGGTTACAGCCTTTTATCCTCAAGAAACTTCAAGTTCCAATGTCAGTGTTGTAATCTCGGGGCTTGGTCCTGATTTCACGAGAATGGAATCCTTTGGCA
AGGTTGAAGAATTTGCCGATACCCTGGTAAGCGGACTGGACAGAAGCTGGAAAAGGCCACCAGGTGTGGCGGCGAAACTTATAGACTGTAGATCATCTAAAGGAATATAT
TACATTGAGTACACGCTGCAGAATCCAGGTGAAAGCCGCAAACATTTATACTCTGCAATTGGGATGGCAAACAATGGCTGGTACAACAGACTTTACACCATAACAGGACA
GGCTCCAGACGCACTAAGGCGCAATGGCCTCCTGAGGCCTAGGCGCAAGATGAAGAATCGGAGAACTATAGTTCGAAAGTTCAGAAGGTTGTCAATTCCTTCACTTTCAT
TTAATGATGTCACAGAACTGGCTTCCACTACATTTGCTCATGGGGTTAAAGTTTTTCCACTTCAGCTTCCAATTATGGCAATTTTGGACCACCCCGATATACAAGGAGCT
GACGAGGACAACCGGGGAGAAATCGGGCTGAAAGATGGACCAAGGAGGCAAAACCGGCAAATGGGACGGGCCAAGACCGAAGGGGTCGGGTTTTCGGCCCGACCCCCTAC
TCGGCCGAGGCCCATGGCCGAGGCCGAGCATATGGTCGGCCGAGGCCGACCCTCGGTCCGCTCGTGCGGGCTGAGTCCGTTCGGTCTCGTCTGGTCCCCACCGCCTCTGG
TTGCCCCGGTTTTGCCTGGTTTGACCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCCCTTCCTTCACCGAGCGCTGTAATCCAACCTCCTCGCTCATGGCGCTTCTCAGAATCATCACTCTGCAATGGTAATCCTTCTACTCTACTCATCTTCCTCGC
CTTTCACTTCATGCGCCATTTTAAATCCTATCTTCTTCCTTCTTCAATTTCAGGAATCGCCATTCCCATCCGCTCCAGACTCCGTGTCTTCTGCTCTTGCAAGAACACCG
ACATTTCCAATCGACAGCCCTGTTATTGGGCGAACGGAGTTAGTAGACGAGAAGTTATGCTAGGCATGGGATTGGCCGCGTTTTCTTTTCAAGAAGTTGTTCCTAATGCC
CTAGCTGAGAGTGTTGTTGTTGCTGAGGATTATCGGACATACACAGACGAAGCAAATAAGTTCAGATTGGTGATTCCTCAAGATTGGCAAGTGGGCAATGGTGAACCGAA
TGGATTCAAGTCGGTTACAGCCTTTTATCCTCAAGAAACTTCAAGTTCCAATGTCAGTGTTGTAATCTCGGGGCTTGGTCCTGATTTCACGAGAATGGAATCCTTTGGCA
AGGTTGAAGAATTTGCCGATACCCTGGTAAGCGGACTGGACAGAAGCTGGAAAAGGCCACCAGGTGTGGCGGCGAAACTTATAGACTGTAGATCATCTAAAGGAATATAT
TACATTGAGTACACGCTGCAGAATCCAGGTGAAAGCCGCAAACATTTATACTCTGCAATTGGGATGGCAAACAATGGCTGGTACAACAGACTTTACACCATAACAGGACA
GGCTCCAGACGCACTAAGGCGCAATGGCCTCCTGAGGCCTAGGCGCAAGATGAAGAATCGGAGAACTATAGTTCGAAAGTTCAGAAGGTTGTCAATTCCTTCACTTTCAT
TTAATGATGTCACAGAACTGGCTTCCACTACATTTGCTCATGGGGTTAAAGTTTTTCCACTTCAGCTTCCAATTATGGCAATTTTGGACCACCCCGATATACAAGGAGCT
GACGAGGACAACCGGGGAGAAATCGGGCTGAAAGATGGACCAAGGAGGCAAAACCGGCAAATGGGACGGGCCAAGACCGAAGGGGTCGGGTTTTCGGCCCGACCCCCTAC
TCGGCCGAGGCCCATGGCCGAGGCCGAGCATATGGTCGGCCGAGGCCGACCCTCGGTCCGCTCGTGCGGGCTGAGTCCGTTCGGTCTCGTCTGGTCCCCACCGCCTCTGG
TTGCCCCGGTTTTGCCTGGTTTGACCTAA
Protein sequenceShow/hide protein sequence
MASLPSPSAVIQPPRSWRFSESSLCNGNPSTLLIFLAFHFMRHFKSYLLPSSISGIAIPIRSRLRVFCSCKNTDISNRQPCYWANGVSRREVMLGMGLAAFSFQEVVPNA
LAESVVVAEDYRTYTDEANKFRLVIPQDWQVGNGEPNGFKSVTAFYPQETSSSNVSVVISGLGPDFTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIY
YIEYTLQNPGESRKHLYSAIGMANNGWYNRLYTITGQAPDALRRNGLLRPRRKMKNRRTIVRKFRRLSIPSLSFNDVTELASTTFAHGVKVFPLQLPIMAILDHPDIQGA
DEDNRGEIGLKDGPRRQNRQMGRAKTEGVGFSARPPTRPRPMAEAEHMVGRGRPSVRSCGLSPFGLVWSPPPLVAPVLPGLT