; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc11g26330 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc11g26330
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionChlorophyll a-b binding protein, chloroplastic
Genome locationchr11:19397699..19412767
RNA-Seq ExpressionMoc11g26330
SyntenyMoc11g26330
Gene Ontology termsGO:0009416 - response to light stimulus (biological process)
GO:0009768 - photosynthesis, light harvesting in photosystem I (biological process)
GO:0018298 - protein-chromophore linkage (biological process)
GO:0009522 - photosystem I (cellular component)
GO:0009523 - photosystem II (cellular component)
GO:0009535 - chloroplast thylakoid membrane (cellular component)
GO:0016168 - chlorophyll binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001344 - Chlorophyll A-B binding protein, plant and chromista
IPR022796 - Chlorophyll A-B binding protein
IPR023329 - Chlorophyll a/b binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6431911.1 hypothetical protein SASPL_103483 [Salvia splendens]1.7e-17375.84Show/hide
Query:  DGGED--ENGLLWKLPVLKSARLGKLGPAFGLGVGCGVGFGVGLVGGAGFGPGIPGLQFGFGLGAGCGVGLGFGYGVGRGIAQDDRRRYSNVGDLLHGQG
        +GG D  E GLLWKLP LKS +LGKLGPAFG+G GCG GF  GL+GGAGFGPGIPGLQ GFG GAGCGVG+GFGYGVGRGIA DD R+YSNV    H   
Subjt:  DGGED--ENGLLWKLPVLKSARLGKLGPAFGLGVGCGVGFGVGLVGGAGFGPGIPGLQFGFGLGAGCGVGLGFGYGVGRGIAQDDRRRYSNVGDLLHGQG

Query:  HQSIFSHLQFPPAMASLAAST---------------AAASLGVSEMLRNPLSFGGGSPRSAPSASSSVTVKTVALFGKKPAAKAKPSAAVSPVNDELAKW
         +++ +  +    +  L  +T                A+SLGVSEML N LS  GGS R+APSASS  T KTVALF KK AA    S AV+P ++ELAKW
Subjt:  HQSIFSHLQFPPAMASLAAST---------------AAASLGVSEMLRNPLSFGGGSPRSAPSASSSVTVKTVALFGKKPAAKAKPSAAVSPVNDELAKW

Query:  YGPDRRIFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNY
        YGPDRRIFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGL KKPEDF+KYQA+ELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNY
Subjt:  YGPDRRIFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNY

Query:  FGNNIPINLIVAVIAEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPF
        FG NIPINL+VAV+AEVVL+GGAEYYRIINGL  EDKLHPGGPFDPLGLA DPDQAAILKVKEIKNGRLAMF+MLGF+ QAYVTG+GPVENLA HLSDPF
Subjt:  FGNNIPINLIVAVIAEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPF

Query:  GNNLLTVISGNVERVPTL
        GNNLLTVI+G+ ERVPTL
Subjt:  GNNLLTVISGNVERVPTL

KAG6435208.1 hypothetical protein SASPL_100078 [Salvia splendens]3.8e-17376.46Show/hide
Query:  DENGLLWKLPVLKSARLGKLGPAFGLGVGCGVGFGVGLVGGAGFGPGIPGLQFGFGLGAGCGVGLGFGYGVGRGIAQDDRRRYSNVGDLLHGQGHQSIFS
        +E GLLWKLP LKS +LGKLGPAFG+G GCG GF  GLVGGAGFGPGIPGLQ GFGLGAGCGVG+GFGYGVGRGIA DD R+YSNV    H    +++ +
Subjt:  DENGLLWKLPVLKSARLGKLGPAFGLGVGCGVGFGVGLVGGAGFGPGIPGLQFGFGLGAGCGVGLGFGYGVGRGIAQDDRRRYSNVGDLLHGQGHQSIFS

Query:  HLQFPPAMASLAAST---------------AAASLGVSEMLRNPLSFGGGSPRSAPSASSSVTVKTVALFGKKPAAKAKPSAAVSPVNDELAKWYGPDRR
          +    +  L  +T                A+SLGVS+ML N LS  GGS R+APSASS  T KTVALF KK AA    +AAV+P ++ELAKWYGPDRR
Subjt:  HLQFPPAMASLAAST---------------AAASLGVSEMLRNPLSFGGGSPRSAPSASSSVTVKTVALFGKKPAAKAKPSAAVSPVNDELAKWYGPDRR

Query:  IFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIP
        IFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGL KKPE+F+KYQA+ELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFG NIP
Subjt:  IFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIP

Query:  INLIVAVIAEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLT
        INL+VAVIAEVVL+GGAEYYRIINGL  EDKLHPGGPFDPLGLA DPDQAAILKVKEIKNGRLAMF+MLGF+ QAYVTG+GPVENLA HLSDPFGNNLLT
Subjt:  INLIVAVIAEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLT

Query:  VISGNVERVPTL
        VI+G  ERVPTL
Subjt:  VISGNVERVPTL

KZM98589.1 hypothetical protein DCAR_014049 [Daucus carota subsp. sativus]6.5e-16567.9Show/hide
Query:  RDGGEDENGLLWKLPVLKSARLGKLGPAFGLGVGCGVGFGVGLVGG------------------------------AGFG-PGIPGLQFGFGLGAGCGVG
        RD  E E GLLWKLP + S  LGKLGPAFG+GVGCGVGFGVGL+GG                               GFG   +     GFGLGAGCG+G
Subjt:  RDGGEDENGLLWKLPVLKSARLGKLGPAFGLGVGCGVGFGVGLVGG------------------------------AGFG-PGIPGLQFGFGLGAGCGVG

Query:  LGFGYGVGRGIAQDDRRRYSNVGDLLHGQGHQSIFSH-----------------------LQFP-----PAMASLAASTAAASLGVSEMLRNPLSFGGGS
        +GFGYGVGRG+A D+ R+Y+NVG + H  GH  I                          L  P       MASLAASTAAAS+GVSEML N L+F   S
Subjt:  LGFGYGVGRGIAQDDRRRYSNVGDLLHGQGHQSIFSH-----------------------LQFP-----PAMASLAASTAAASLGVSEMLRNPLSFGGGS

Query:  PRSAPSASSSVTVKTVALFGKKPAAKAKPSAAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSKYQAFELIHAR
         R+APSASS  T KTVALFGKK A   K +   +P +DELAKWYGP+RRIFLP+GLLDRSEIPEYLNGEVPGDYGYDPFGL KKPEDF+KYQA+ELIHAR
Subjt:  PRSAPSASSSVTVKTVALFGKKPAAKAKPSAAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSKYQAFELIHAR

Query:  WAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPINLIVAVIAEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAA
        WAMLGAAGFIIPEAFNK+GANCGPEAVWFKTGALLLDGNTLNYFG NIPINL++AV+AEVVL+GGAEYYRI NGL+ EDKLHPGGPFDPLGLADDPDQAA
Subjt:  WAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPINLIVAVIAEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAA

Query:  ILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNVERVPTL
        +LKVKEIKNGRLAMFAMLGF+ QAYVTGEGPVENL+ HLSDPFGNNLLTVI G  ER PTL
Subjt:  ILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNVERVPTL

XP_022156006.1 chlorophyll a-b binding protein CP26, chloroplastic [Momordica charantia]2.1e-163100Show/hide
Query:  MASLAASTAAASLGVSEMLRNPLSFGGGSPRSAPSASSSVTVKTVALFGKKPAAKAKPSAAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVP
        MASLAASTAAASLGVSEMLRNPLSFGGGSPRSAPSASSSVTVKTVALFGKKPAAKAKPSAAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVP
Subjt:  MASLAASTAAASLGVSEMLRNPLSFGGGSPRSAPSASSSVTVKTVALFGKKPAAKAKPSAAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVP

Query:  GDYGYDPFGLSKKPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPINLIVAVIAEVVLVGGAEYYRI
        GDYGYDPFGLSKKPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPINLIVAVIAEVVLVGGAEYYRI
Subjt:  GDYGYDPFGLSKKPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPINLIVAVIAEVVLVGGAEYYRI

Query:  INGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNVERVPTL
        INGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNVERVPTL
Subjt:  INGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNVERVPTL

XP_038889943.1 chlorophyll a-b binding protein CP26, chloroplastic [Benincasa hispida]9.4e-15695.88Show/hide
Query:  MASLAASTAAASLGVSEMLRNPLSFGGGSPRSAPSASSSVTVKTVALFGKKPAAKAKPS-AAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEV
        MASLAASTAAASLGVSEMLRNPLSF G S RSAPSASS  T KTVALFGKKPAA +KP  +AVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEV
Subjt:  MASLAASTAAASLGVSEMLRNPLSFGGGSPRSAPSASSSVTVKTVALFGKKPAAKAKPS-AAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEV

Query:  PGDYGYDPFGLSKKPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPINLIVAVIAEVVLVGGAEYYR
        PGDYGYDPFGLSKKPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPINLIVAVIAEVVLVGGAEYYR
Subjt:  PGDYGYDPFGLSKKPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPINLIVAVIAEVVLVGGAEYYR

Query:  IINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNVERVPTL
        IINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGN ERVPTL
Subjt:  IINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNVERVPTL

TrEMBL top hitse value%identityAlignment
A0A0A0KXR5 Chlorophyll a-b binding protein, chloroplastic1.4e-15294.5Show/hide
Query:  MASLAASTAAASLGVSEMLRNPLSFGGGSPRSAPSASSSVTVKTVALFGKKPAAKAKPS-AAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEV
        MASLAASTAAASLGVSEMLRNPLSF   S RSAPS S+  T KTVALFGKKPAA AKP  +A SPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEV
Subjt:  MASLAASTAAASLGVSEMLRNPLSFGGGSPRSAPSASSSVTVKTVALFGKKPAAKAKPS-AAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEV

Query:  PGDYGYDPFGLSKKPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPINLIVAVIAEVVLVGGAEYYR
        PGDYGYDPFGLSKKPEDF+KYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPINLIVAVIAEVVLVGGAEYYR
Subjt:  PGDYGYDPFGLSKKPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPINLIVAVIAEVVLVGGAEYYR

Query:  IINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNVERVPTL
        IINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGN ERVPTL
Subjt:  IINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNVERVPTL

A0A165XW56 Chlorophyll a-b binding protein, chloroplastic3.1e-16567.9Show/hide
Query:  RDGGEDENGLLWKLPVLKSARLGKLGPAFGLGVGCGVGFGVGLVGG------------------------------AGFG-PGIPGLQFGFGLGAGCGVG
        RD  E E GLLWKLP + S  LGKLGPAFG+GVGCGVGFGVGL+GG                               GFG   +     GFGLGAGCG+G
Subjt:  RDGGEDENGLLWKLPVLKSARLGKLGPAFGLGVGCGVGFGVGLVGG------------------------------AGFG-PGIPGLQFGFGLGAGCGVG

Query:  LGFGYGVGRGIAQDDRRRYSNVGDLLHGQGHQSIFSH-----------------------LQFP-----PAMASLAASTAAASLGVSEMLRNPLSFGGGS
        +GFGYGVGRG+A D+ R+Y+NVG + H  GH  I                          L  P       MASLAASTAAAS+GVSEML N L+F   S
Subjt:  LGFGYGVGRGIAQDDRRRYSNVGDLLHGQGHQSIFSH-----------------------LQFP-----PAMASLAASTAAASLGVSEMLRNPLSFGGGS

Query:  PRSAPSASSSVTVKTVALFGKKPAAKAKPSAAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSKYQAFELIHAR
         R+APSASS  T KTVALFGKK A   K +   +P +DELAKWYGP+RRIFLP+GLLDRSEIPEYLNGEVPGDYGYDPFGL KKPEDF+KYQA+ELIHAR
Subjt:  PRSAPSASSSVTVKTVALFGKKPAAKAKPSAAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSKYQAFELIHAR

Query:  WAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPINLIVAVIAEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAA
        WAMLGAAGFIIPEAFNK+GANCGPEAVWFKTGALLLDGNTLNYFG NIPINL++AV+AEVVL+GGAEYYRI NGL+ EDKLHPGGPFDPLGLADDPDQAA
Subjt:  WAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPINLIVAVIAEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAA

Query:  ILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNVERVPTL
        +LKVKEIKNGRLAMFAMLGF+ QAYVTGEGPVENL+ HLSDPFGNNLLTVI G  ER PTL
Subjt:  ILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNVERVPTL

A0A6J1DTI7 Chlorophyll a-b binding protein, chloroplastic1.0e-163100Show/hide
Query:  MASLAASTAAASLGVSEMLRNPLSFGGGSPRSAPSASSSVTVKTVALFGKKPAAKAKPSAAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVP
        MASLAASTAAASLGVSEMLRNPLSFGGGSPRSAPSASSSVTVKTVALFGKKPAAKAKPSAAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVP
Subjt:  MASLAASTAAASLGVSEMLRNPLSFGGGSPRSAPSASSSVTVKTVALFGKKPAAKAKPSAAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVP

Query:  GDYGYDPFGLSKKPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPINLIVAVIAEVVLVGGAEYYRI
        GDYGYDPFGLSKKPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPINLIVAVIAEVVLVGGAEYYRI
Subjt:  GDYGYDPFGLSKKPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPINLIVAVIAEVVLVGGAEYYRI

Query:  INGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNVERVPTL
        INGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNVERVPTL
Subjt:  INGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNVERVPTL

A0A6J1EWI5 Chlorophyll a-b binding protein, chloroplastic3.0e-15294.16Show/hide
Query:  MASLAASTAAASLGVSEMLRNPLSFGGGSPRSAPSASSSVTVKTVALFGKKPAAKAKPS-AAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEV
        MASLAASTAAASLGVSEMLRNPLSF G S RSA SASS VT K VALFGKKPA   KP  +AVSP NDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEV
Subjt:  MASLAASTAAASLGVSEMLRNPLSFGGGSPRSAPSASSSVTVKTVALFGKKPAAKAKPS-AAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEV

Query:  PGDYGYDPFGLSKKPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPINLIVAVIAEVVLVGGAEYYR
        PGDYGYDPFGLSKKPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPINLIVAV AEVVLVGGAEYYR
Subjt:  PGDYGYDPFGLSKKPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPINLIVAVIAEVVLVGGAEYYR

Query:  IINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNVERVPTL
        IINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPF NNLLTVISGN ERVPTL
Subjt:  IINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNVERVPTL

A0A6J1ID20 Chlorophyll a-b binding protein, chloroplastic8.0e-15394.16Show/hide
Query:  MASLAASTAAASLGVSEMLRNPLSFGGGSPRSAPSASSSVTVKTVALFGKKPAAKAKPS-AAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEV
        MASLAASTAAASLGVSEMLRNPLS  G S RSA SASS VT K VA+FGKKPAA  KP  +AVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEV
Subjt:  MASLAASTAAASLGVSEMLRNPLSFGGGSPRSAPSASSSVTVKTVALFGKKPAAKAKPS-AAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEV

Query:  PGDYGYDPFGLSKKPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPINLIVAVIAEVVLVGGAEYYR
        PGDYGYDPFGLSKKPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPINLIVAV+AEVVLVGGAEYYR
Subjt:  PGDYGYDPFGLSKKPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPINLIVAVIAEVVLVGGAEYYR

Query:  IINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNVERVPTL
        IINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPF NNLLTVISGN ERVPTL
Subjt:  IINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNVERVPTL

SwissProt top hitse value%identityAlignment
P07369 Chlorophyll a-b binding protein 3C, chloroplastic1.6e-5748.88Show/hide
Query:  LGVSEMLRNPLSFGGGSPRSAPSASSSVTVKTVALFGKKPAAKAKPSAAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGLSK
        +  S M  +  +F G + + +PS+S       V +  +K A KAKP+++ SP       WYGPDR  +L        E P YL GE PGDYG+D  GLS 
Subjt:  LGVSEMLRNPLSFGGGSPRSAPSASSSVTVKTVALFGKKPAAKAKPSAAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGLSK

Query:  KPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNN--IPINLIVAVIA-EVVLVGGAEYYRIINGLNFE--
         PE F+K +  E+IH RWAMLGA G + PE   + G   G EAVWFK G+ +     L+Y GN   +    I+A+ A +VVL+G  E YRI  G   E  
Subjt:  KPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNN--IPINLIVAVIA-EVVLVGGAEYYRIINGLNFE--

Query:  DKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNN
        D L+PGG FDPLGLADDP+  A LKVKEIKNGRLAMF+M GF+ QA VTG+GP+ENLA HL+DP  NN
Subjt:  DKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNN

P12330 Chlorophyll a-b binding protein 1, chloroplastic9.4e-5852Show/hide
Query:  RSAPSASSSVTVKTVALFGKKPAAKAKPSAAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSKYQAFELIHARW
        R+APS SS       ALFG+      K +A   P     + WYG DR ++L  G L   E P YL GE PGDYG+D  GLS  PE F+K +  E+IH+RW
Subjt:  RSAPSASSSVTVKTVALFGKKPAAKAKPSAAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSKYQAFELIHARW

Query:  AMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNN--IPINLIVAVIA-EVVLVGGAEYYRIINGLNFE--DKLHPGGPFDPLGLADDP
        AMLGA G + PE   + G   G EAVWFK G+ +     L+Y GN   I    I+A+ A +VVL+G  E YRI  G   E  D L+PGG FDPLGLADDP
Subjt:  AMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNN--IPINLIVAVIA-EVVLVGGAEYYRIINGLNFE--DKLHPGGPFDPLGLADDP

Query:  DQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNN
        +  A LKVKEIKNGRLAMF+M GF+ QA VTG+GP+ENLA HL+DP  NN
Subjt:  DQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNN

P12331 Chlorophyll a-b binding protein 2, chloroplastic4.7e-5754.55Show/hide
Query:  KKPAAKAKPSAAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGA
        +K AAK KP+A+ SP       WYG DR ++L  G L   E P YL GE PGDYG+D  GLS  PE F+K +  E+IH+RWAMLGA G + PE   + G 
Subjt:  KKPAAKAKPSAAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGA

Query:  NCGPEAVWFKTGALLLDGNTLNYFGNN--IPINLIVAVIA-EVVLVGGAEYYRIINGLNFE--DKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMF
          G EAVWFK G+ +     L+Y GN   I    I+A+ A +VVL+G  E YRI  G   E  D L+PGG FDPLGLADDP+  A LKVKEIKNGRLAMF
Subjt:  NCGPEAVWFKTGALLLDGNTLNYFGNN--IPINLIVAVIA-EVVLVGGAEYYRIINGLNFE--DKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMF

Query:  AMLGFYFQAYVTGEGPVENLAKHLSDPFGNN
        +M GF+ QA VTG+GP+ENLA HL+DP  NN
Subjt:  AMLGFYFQAYVTGEGPVENLAKHLSDPFGNN

P27517 Chlorophyll a-b binding protein of LHCII type I, chloroplastic7.9e-5754.03Show/hide
Query:  KWYGPDRRIFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTL
        ++YGPDR  FL  G    ++ PEYL GE PGDYG+D  GLS  P+ F++Y+  ELIHARWA+LGA G + PE  +++      E VWFK GA +     L
Subjt:  KWYGPDRRIFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTL

Query:  NYFGNNIPI---NLIVAVIAEVVLVGGAEYYRIING----LNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVEN
        NY GN   I   ++I  +  +VVL+G AE YR   G    L+  D L+PGGPFDPLGLADDPD  A LKVKEIKNGRLAMF+ LGF+ QA VTG+GPV+N
Subjt:  NYFGNNIPI---NLIVAVIAEVVLVGGAEYYRIING----LNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVEN

Query:  LAKHLSDPFGN
        L  HL+DP  N
Subjt:  LAKHLSDPFGN

Q9XF89 Chlorophyll a-b binding protein CP26, chloroplastic3.6e-13484.29Show/hide
Query:  ASLGVSEMLRNPLSFGGGSPRSAPSASSSVTVKTVALFGKKPAAKAKPSAAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGL
        ASLGVSEML  PL+F   S  SAP ASS  T KTVALF KK  A AK S AVS  +DELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEV GDYGYDPFGL
Subjt:  ASLGVSEMLRNPLSFGGGSPRSAPSASSSVTVKTVALFGKKPAAKAKPSAAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGL

Query:  SKKPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPINLIVAVIAEVVLVGGAEYYRIINGLNFEDKL
         KKPE+F+KYQAFELIHARWAMLGAAGFIIPEA NK+GANCGPEAVWFKTGALLLDGNTLNYFG NIPINL++AV+AEVVL+GGAEYYRI NGL+FEDKL
Subjt:  SKKPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPINLIVAVIAEVVLVGGAEYYRIINGLNFEDKL

Query:  HPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNVERVPTL
        HPGGPFDPLGLA DP+Q A+LKVKEIKNGRLAMFAMLGF+ QAYVTGEGPVENLAKHLSDPFGNNLLTVI+G  ER PTL
Subjt:  HPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNVERVPTL

Arabidopsis top hitse value%identityAlignment
AT1G29910.1 chlorophyll A/B binding protein 34.2e-5345.35Show/hide
Query:  LGVSEMLRNPLSFGGGSPRSAPSASSSVTVKTVALFGKKPAAKAKPSAAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGLSK
        +  S M  +  +F G +   +P+AS  +    V +  +K  AK K  +         + WYG DR  +L        E P YL GE PGDYG+D  GLS 
Subjt:  LGVSEMLRNPLSFGGGSPRSAPSASSSVTVKTVALFGKKPAAKAKPSAAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGLSK

Query:  KPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNN--IPINLIVAVIA-EVVLVGGAEYYRII-NGL--NF
         PE F++ +  E+IH+RWAMLGA G + PE   + G   G EAVWFK G+ +     L+Y GN   +    I+A+ A +V+L+G  E YR+  NG     
Subjt:  KPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNN--IPINLIVAVIA-EVVLVGGAEYYRII-NGL--NF

Query:  EDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNN
        ED L+PGG FDPLGLA DP+  A LKVKE+KNGRLAMF+M GF+ QA VTG+GP+ENLA HL+DP  NN
Subjt:  EDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNN

AT1G29920.1 chlorophyll A/B-binding protein 24.2e-5345.35Show/hide
Query:  LGVSEMLRNPLSFGGGSPRSAPSASSSVTVKTVALFGKKPAAKAKPSAAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGLSK
        +  S M  +  +F G +   +P+AS  +    V +  +K  AK K  +         + WYG DR  +L        E P YL GE PGDYG+D  GLS 
Subjt:  LGVSEMLRNPLSFGGGSPRSAPSASSSVTVKTVALFGKKPAAKAKPSAAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGLSK

Query:  KPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNN--IPINLIVAVIA-EVVLVGGAEYYRII-NGL--NF
         PE F++ +  E+IH+RWAMLGA G + PE   + G   G EAVWFK G+ +     L+Y GN   +    I+A+ A +V+L+G  E YR+  NG     
Subjt:  KPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNN--IPINLIVAVIA-EVVLVGGAEYYRII-NGL--NF

Query:  EDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNN
        ED L+PGG FDPLGLA DP+  A LKVKE+KNGRLAMF+M GF+ QA VTG+GP+ENLA HL+DP  NN
Subjt:  EDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNN

AT1G29930.1 chlorophyll A/B binding protein 12.5e-5345.35Show/hide
Query:  LGVSEMLRNPLSFGGGSPRSAPSASSSVTVKTVALFGKKPAAKAKPSAAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGLSK
        +  S M  +  +F G + + +P+AS  +    V +  +K  AK K  +         + WYG DR  +L        E P YL GE PGDYG+D  GLS 
Subjt:  LGVSEMLRNPLSFGGGSPRSAPSASSSVTVKTVALFGKKPAAKAKPSAAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGLSK

Query:  KPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNN--IPINLIVAVIA-EVVLVGGAEYYRII-NGL--NF
         PE F++ +  E+IH+RWAMLGA G + PE   + G   G EAVWFK G+ +     L+Y GN   +    I+A+ A +V+L+G  E YR+  NG     
Subjt:  KPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNN--IPINLIVAVIA-EVVLVGGAEYYRII-NGL--NF

Query:  EDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNN
        ED L+PGG FDPLGLA DP+  A LKVKE+KNGRLAMF+M GF+ QA VTG+GP+ENLA HL+DP  NN
Subjt:  EDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNN

AT2G34430.1 light-harvesting chlorophyll-protein complex II subunit B11.4e-5346.33Show/hide
Query:  ASSSVTVKTVALFGKKPAAKAKPSA-------------AVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSKYQA
        A+S++ + + AL GK  A K  P+A             A  P     + WYG DR  +L        E P YL GE PGDYG+D  GLS  PE F++ + 
Subjt:  ASSSVTVKTVALFGKKPAAKAKPSA-------------AVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSKYQA

Query:  FELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNN--IPINLIVAVIA-EVVLVGGAEYYRIING---LNFEDKLHPGGPF
         E+IH+RWAMLGA G + PE   + G   G EAVWFK G+ +     L+Y GN   +    I+A+ A +V+L+G  E YR+         ED L+PGG F
Subjt:  FELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNN--IPINLIVAVIA-EVVLVGGAEYYRIING---LNFEDKLHPGGPF

Query:  DPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNN
        DPLGLA DP+  A LKVKE+KNGRLAMF+M GF+ QA VTG+GP+ENLA HL+DP  NN
Subjt:  DPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNN

AT4G10340.1 light harvesting complex of photosystem II 52.6e-13584.29Show/hide
Query:  ASLGVSEMLRNPLSFGGGSPRSAPSASSSVTVKTVALFGKKPAAKAKPSAAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGL
        ASLGVSEML  PL+F   S  SAP ASS  T KTVALF KK  A AK S AVS  +DELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEV GDYGYDPFGL
Subjt:  ASLGVSEMLRNPLSFGGGSPRSAPSASSSVTVKTVALFGKKPAAKAKPSAAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGL

Query:  SKKPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPINLIVAVIAEVVLVGGAEYYRIINGLNFEDKL
         KKPE+F+KYQAFELIHARWAMLGAAGFIIPEA NK+GANCGPEAVWFKTGALLLDGNTLNYFG NIPINL++AV+AEVVL+GGAEYYRI NGL+FEDKL
Subjt:  SKKPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPINLIVAVIAEVVLVGGAEYYRIINGLNFEDKL

Query:  HPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNVERVPTL
        HPGGPFDPLGLA DP+Q A+LKVKEIKNGRLAMFAMLGF+ QAYVTGEGPVENLAKHLSDPFGNNLLTVI+G  ER PTL
Subjt:  HPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNVERVPTL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACCGAAACAGACGGGACGGCGGAGAAGATGAGAACGGTTTACTGTGGAAGCTTCCAGTTCTGAAATCTGCCCGACTCGGAAAGTTAGGCCCCGCCTTCGGT
TTGGGCGTGGGCTGCGGCGTCGGCTTCGGCGTCGGCCTCGTCGGAGGTGCTGGATTTGGTCCGGGAATTCCAGGATTACAATTTGGCTTTGGTCTTGGTGCTGGA
TGTGGAGTTGGCTTAGGATTTGGCTATGGTGTTGGCAGGGGCATTGCCCAAGATGACAGAAGGAGATACTCTAACGTTGGGGATCTATTACACGGTCAAGGTCAT
CAAAGTATTTTTTCTCACCTTCAGTTTCCTCCGGCGATGGCTTCCTTAGCAGCCTCCACCGCGGCCGCCTCCCTCGGCGTCTCCGAGATGCTCAGAAATCCCCTC
AGCTTTGGCGGTGGCTCCCCCAGGTCGGCGCCTTCTGCTTCTAGCTCTGTCACCGTCAAGACTGTCGCGCTTTTCGGGAAGAAACCGGCCGCCAAGGCAAAGCCT
TCCGCCGCTGTCTCTCCGGTCAACGACGAGCTCGCCAAGTGGTACGGTCCTGACAGAAGGATTTTCTTGCCGGATGGGCTGTTGGACCGATCTGAGATCCCTGAG
TACTTGAACGGAGAAGTCCCCGGAGACTACGGCTACGATCCCTTTGGACTCAGCAAGAAACCAGAAGACTTCAGCAAATATCAGGCATTTGAATTGATCCACGCA
AGATGGGCCATGCTTGGAGCTGCTGGTTTCATCATCCCTGAGGCCTTCAACAAATTCGGAGCCAACTGTGGCCCCGAGGCCGTTTGGTTCAAGACTGGAGCTTTG
CTTTTGGATGGAAACACATTGAACTACTTTGGAAACAACATTCCCATCAACCTGATCGTTGCCGTGATTGCCGAGGTCGTCCTCGTCGGTGGCGCAGAATATTAC
AGAATCATCAACGGCTTGAATTTTGAAGATAAGCTTCACCCGGGTGGGCCGTTCGACCCATTGGGGCTGGCGGATGACCCCGACCAGGCAGCGATCTTGAAGGTG
AAGGAGATAAAGAATGGAAGGCTGGCGATGTTTGCAATGCTCGGGTTTTACTTCCAGGCTTACGTGACGGGCGAAGGCCCTGTGGAGAACTTGGCCAAGCATTTG
AGTGATCCCTTCGGAAACAATTTGCTCACTGTCATCTCAGGGAATGTCGAAAGAGTTCCAACTCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAACCGAAACAGACGGGACGGCGGAGAAGATGAGAACGGTTTACTGTGGAAGCTTCCAGTTCTGAAATCTGCCCGACTCGGAAAGTTAGGCCCCGCCTTCGGT
TTGGGCGTGGGCTGCGGCGTCGGCTTCGGCGTCGGCCTCGTCGGAGGTGCTGGATTTGGTCCGGGAATTCCAGGATTACAATTTGGCTTTGGTCTTGGTGCTGGA
TGTGGAGTTGGCTTAGGATTTGGCTATGGTGTTGGCAGGGGCATTGCCCAAGATGACAGAAGGAGATACTCTAACGTTGGGGATCTATTACACGGTCAAGGTCAT
CAAAGTATTTTTTCTCACCTTCAGTTTCCTCCGGCGATGGCTTCCTTAGCAGCCTCCACCGCGGCCGCCTCCCTCGGCGTCTCCGAGATGCTCAGAAATCCCCTC
AGCTTTGGCGGTGGCTCCCCCAGGTCGGCGCCTTCTGCTTCTAGCTCTGTCACCGTCAAGACTGTCGCGCTTTTCGGGAAGAAACCGGCCGCCAAGGCAAAGCCT
TCCGCCGCTGTCTCTCCGGTCAACGACGAGCTCGCCAAGTGGTACGGTCCTGACAGAAGGATTTTCTTGCCGGATGGGCTGTTGGACCGATCTGAGATCCCTGAG
TACTTGAACGGAGAAGTCCCCGGAGACTACGGCTACGATCCCTTTGGACTCAGCAAGAAACCAGAAGACTTCAGCAAATATCAGGCATTTGAATTGATCCACGCA
AGATGGGCCATGCTTGGAGCTGCTGGTTTCATCATCCCTGAGGCCTTCAACAAATTCGGAGCCAACTGTGGCCCCGAGGCCGTTTGGTTCAAGACTGGAGCTTTG
CTTTTGGATGGAAACACATTGAACTACTTTGGAAACAACATTCCCATCAACCTGATCGTTGCCGTGATTGCCGAGGTCGTCCTCGTCGGTGGCGCAGAATATTAC
AGAATCATCAACGGCTTGAATTTTGAAGATAAGCTTCACCCGGGTGGGCCGTTCGACCCATTGGGGCTGGCGGATGACCCCGACCAGGCAGCGATCTTGAAGGTG
AAGGAGATAAAGAATGGAAGGCTGGCGATGTTTGCAATGCTCGGGTTTTACTTCCAGGCTTACGTGACGGGCGAAGGCCCTGTGGAGAACTTGGCCAAGCATTTG
AGTGATCCCTTCGGAAACAATTTGCTCACTGTCATCTCAGGGAATGTCGAAAGAGTTCCAACTCTTTAA
Protein sequenceShow/hide protein sequence
MNRNRRDGGEDENGLLWKLPVLKSARLGKLGPAFGLGVGCGVGFGVGLVGGAGFGPGIPGLQFGFGLGAGCGVGLGFGYGVGRGIAQDDRRRYSNVGDLLHGQGH
QSIFSHLQFPPAMASLAASTAAASLGVSEMLRNPLSFGGGSPRSAPSASSSVTVKTVALFGKKPAAKAKPSAAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPE
YLNGEVPGDYGYDPFGLSKKPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPINLIVAVIAEVVLVGGAEYY
RIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNVERVPTL