; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh11G007560 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh11G007560
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionChlorophyll a-b binding protein, chloroplastic
Genome locationCmo_Chr11:3676214..3683238
RNA-Seq ExpressionCmoCh11G007560
SyntenyCmoCh11G007560
Gene Ontology termsGO:0009416 - response to light stimulus (biological process)
GO:0009768 - photosynthesis, light harvesting in photosystem I (biological process)
GO:0018298 - protein-chromophore linkage (biological process)
GO:0009522 - photosystem I (cellular component)
GO:0009523 - photosystem II (cellular component)
GO:0009535 - chloroplast thylakoid membrane (cellular component)
GO:0016168 - chlorophyll binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001344 - Chlorophyll A-B binding protein, plant and chromista
IPR009327 - Cupin domain of unknown function DUF985
IPR011051 - RmlC-like cupin domain superfamily
IPR014710 - RmlC-like jelly roll fold
IPR022796 - Chlorophyll A-B binding protein
IPR023329 - Chlorophyll a/b binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588096.1 Photosystem I chlorophyll a/b-binding protein 6, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]9.0e-15498.86Show/hide
Query:  MALSISSTALSTLPISRESSHSHHRALNFPGKFPKYNLRRGSTHLNAAKSGVSSVCEPLPPDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKW
        MALSISSTALSTLPI RESSHSHH+ALNFPGKFPKYNLRRGSTHLNAAKSGVSSVCEPLPPDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKW
Subjt:  MALSISSTALSTLPISRESSHSHHRALNFPGKFPKYNLRRGSTHLNAAKSGVSSVCEPLPPDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKW

Query:  FAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMGWVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGFWF
        FAQAELMHARWAMLAVAGILLPEWFESLGLI NFSWYDAGTREYFADPTTLLVAQLGLMGWVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGFWF
Subjt:  FAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMGWVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGFWF

Query:  DPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNIFS
        DPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNIFS
Subjt:  DPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNIFS

KAG7021985.1 Photosystem I chlorophyll a/b-binding protein 6, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma]3.3e-15699.62Show/hide
Query:  MALSISSTALSTLPISRESSHSHHRALNFPGKFPKYNLRRGSTHLNAAKSGVSSVCEPLPPDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKW
        MALSISSTALSTLPISRESSHSHH+ALNFPGKFPKYNLRRGSTHLNAAKSGVSSVCEPLPPDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKW
Subjt:  MALSISSTALSTLPISRESSHSHHRALNFPGKFPKYNLRRGSTHLNAAKSGVSSVCEPLPPDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKW

Query:  FAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMGWVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGFWF
        FAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMGWVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGFWF
Subjt:  FAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMGWVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGFWF

Query:  DPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNIFS
        DPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNIFS
Subjt:  DPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNIFS

PWA76319.1 photosystem I light harvesting complex protein [Artemisia annua]2.9e-17661.43Show/hide
Query:  MALSISSTALSTLPISRESSHSHHRALNFPGKFPKYNLRRGSTH---LNAAKSGVSSVCEPLPPDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPEL
        MA +I+ST+ S+L    E +     A        +Y LR  S++   + AAK GVSSVCEPLP DRP+WFPG +PPEWLDGSLPGDFGFDPLGLGSDPE 
Subjt:  MALSISSTALSTLPISRESSHSHHRALNFPGKFPKYNLRRGSTH---LNAAKSGVSSVCEPLPPDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPEL

Query:  LKWFAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMGWVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGG
        LKWFAQAELMH+RWAMLAVAGIL+PEW ESLG I N+SW+DAG+REYFAD TTL V QL LMGWVEGRRWAD+VNPGSVD++  LP++K+P PDVGYPGG
Subjt:  LKWFAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMGWVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGG

Query:  FWFDPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNIFSV----------TTVLAMNLGHSAPV-EFELRFRD
         WFDP MWGRGSPEPVMVLRTKEIKNGRLAMLAF G  FQAIYTGQGPLENL+AH+ADPGH NIFSV          T V     G +  +  F L   D
Subjt:  FWFDPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNIFSV----------TTVLAMNLGHSAPV-EFELRFRD

Query:  WLHPTPDTTVLLL-----VERVERRIKMGTASEIVAKLNLKPHPEGGFYSETFRDKSVHLSKSHLPPEYKVDREVSTCIYFLVPSGCISALHRIPCAETW
         L+      + LL     +          T SEI+AKLNL P+ EGGF+ ETFRD S++L+ S LP  +KVDR +ST IYF +P+G +S LHRIP AETW
Subjt:  WLHPTPDTTVLLL-----VERVERRIKMGTASEIVAKLNLKPHPEGGFYSETFRDKSVHLSKSHLPPEYKVDREVSTCIYFLVPSGCISALHRIPCAETW

Query:  HFYSGEPLTILELNEEDGRVKFTCLGSDFIGSNHLLQYTVPPNVWFGSFPTKDFNISADGTVTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVS
        HFY GEP+TILE+++++G VKFT +G D + SN +LQYTV P+VWFG+FP++DF+I  D  + + APR++E H+SLVG +  PAF+ +DF LAKRS+L+S
Subjt:  HFYSGEPLTILELNEEDGRVKFTCLGSDFIGSNHLLQYTVPPNVWFGSFPTKDFNISADGTVTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVS

Query:  RFP
        RFP
Subjt:  RFP

XP_022929235.1 photosystem I chlorophyll a/b-binding protein 6, chloroplastic isoform X1 [Cucurbita moschata]1.5e-156100Show/hide
Query:  MALSISSTALSTLPISRESSHSHHRALNFPGKFPKYNLRRGSTHLNAAKSGVSSVCEPLPPDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKW
        MALSISSTALSTLPISRESSHSHHRALNFPGKFPKYNLRRGSTHLNAAKSGVSSVCEPLPPDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKW
Subjt:  MALSISSTALSTLPISRESSHSHHRALNFPGKFPKYNLRRGSTHLNAAKSGVSSVCEPLPPDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKW

Query:  FAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMGWVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGFWF
        FAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMGWVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGFWF
Subjt:  FAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMGWVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGFWF

Query:  DPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNIFS
        DPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNIFS
Subjt:  DPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNIFS

XP_022929243.1 photosystem I chlorophyll a/b-binding protein 6, chloroplastic isoform X2 [Cucurbita moschata]1.1e-15499.62Show/hide
Query:  MALSISSTALSTLPISRESSHSHHRALNFPGKFPKYNLRRGSTHLNAAKSGVSSVCEPLPPDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKW
        MALSISSTALSTLPI RESSHSHHRALNFPGKFPKYNLRRGSTHLNAAKSGVSSVCEPLPPDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKW
Subjt:  MALSISSTALSTLPISRESSHSHHRALNFPGKFPKYNLRRGSTHLNAAKSGVSSVCEPLPPDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKW

Query:  FAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMGWVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGFWF
        FAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMGWVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGFWF
Subjt:  FAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMGWVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGFWF

Query:  DPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNIFS
        DPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNIFS
Subjt:  DPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNIFS

TrEMBL top hitse value%identityAlignment
A0A2U1NS20 Chlorophyll a-b binding protein, chloroplastic1.4e-17661.43Show/hide
Query:  MALSISSTALSTLPISRESSHSHHRALNFPGKFPKYNLRRGSTH---LNAAKSGVSSVCEPLPPDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPEL
        MA +I+ST+ S+L    E +     A        +Y LR  S++   + AAK GVSSVCEPLP DRP+WFPG +PPEWLDGSLPGDFGFDPLGLGSDPE 
Subjt:  MALSISSTALSTLPISRESSHSHHRALNFPGKFPKYNLRRGSTH---LNAAKSGVSSVCEPLPPDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPEL

Query:  LKWFAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMGWVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGG
        LKWFAQAELMH+RWAMLAVAGIL+PEW ESLG I N+SW+DAG+REYFAD TTL V QL LMGWVEGRRWAD+VNPGSVD++  LP++K+P PDVGYPGG
Subjt:  LKWFAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMGWVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGG

Query:  FWFDPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNIFSV----------TTVLAMNLGHSAPV-EFELRFRD
         WFDP MWGRGSPEPVMVLRTKEIKNGRLAMLAF G  FQAIYTGQGPLENL+AH+ADPGH NIFSV          T V     G +  +  F L   D
Subjt:  FWFDPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNIFSV----------TTVLAMNLGHSAPV-EFELRFRD

Query:  WLHPTPDTTVLLL-----VERVERRIKMGTASEIVAKLNLKPHPEGGFYSETFRDKSVHLSKSHLPPEYKVDREVSTCIYFLVPSGCISALHRIPCAETW
         L+      + LL     +          T SEI+AKLNL P+ EGGF+ ETFRD S++L+ S LP  +KVDR +ST IYF +P+G +S LHRIP AETW
Subjt:  WLHPTPDTTVLLL-----VERVERRIKMGTASEIVAKLNLKPHPEGGFYSETFRDKSVHLSKSHLPPEYKVDREVSTCIYFLVPSGCISALHRIPCAETW

Query:  HFYSGEPLTILELNEEDGRVKFTCLGSDFIGSNHLLQYTVPPNVWFGSFPTKDFNISADGTVTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVS
        HFY GEP+TILE+++++G VKFT +G D + SN +LQYTV P+VWFG+FP++DF+I  D  + + APR++E H+SLVG +  PAF+ +DF LAKRS+L+S
Subjt:  HFYSGEPLTILELNEEDGRVKFTCLGSDFIGSNHLLQYTVPPNVWFGSFPTKDFNISADGTVTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVS

Query:  RFP
        RFP
Subjt:  RFP

A0A6J1ERI7 Chlorophyll a-b binding protein, chloroplastic5.2e-15599.62Show/hide
Query:  MALSISSTALSTLPISRESSHSHHRALNFPGKFPKYNLRRGSTHLNAAKSGVSSVCEPLPPDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKW
        MALSISSTALSTLPI RESSHSHHRALNFPGKFPKYNLRRGSTHLNAAKSGVSSVCEPLPPDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKW
Subjt:  MALSISSTALSTLPISRESSHSHHRALNFPGKFPKYNLRRGSTHLNAAKSGVSSVCEPLPPDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKW

Query:  FAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMGWVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGFWF
        FAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMGWVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGFWF
Subjt:  FAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMGWVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGFWF

Query:  DPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNIFS
        DPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNIFS
Subjt:  DPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNIFS

A0A6J1ETS8 Chlorophyll a-b binding protein, chloroplastic7.2e-157100Show/hide
Query:  MALSISSTALSTLPISRESSHSHHRALNFPGKFPKYNLRRGSTHLNAAKSGVSSVCEPLPPDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKW
        MALSISSTALSTLPISRESSHSHHRALNFPGKFPKYNLRRGSTHLNAAKSGVSSVCEPLPPDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKW
Subjt:  MALSISSTALSTLPISRESSHSHHRALNFPGKFPKYNLRRGSTHLNAAKSGVSSVCEPLPPDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKW

Query:  FAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMGWVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGFWF
        FAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMGWVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGFWF
Subjt:  FAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMGWVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGFWF

Query:  DPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNIFS
        DPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNIFS
Subjt:  DPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNIFS

A0A6J1HS79 Chlorophyll a-b binding protein, chloroplastic4.8e-15398.48Show/hide
Query:  MALSISSTALSTLPISRESSHSHHRALNFPGKFPKYNLRRGSTHLNAAKSGVSSVCEPLPPDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKW
        MALSISSTALS+LPISRESSHS HRALNFPGKFPKYNLRRGS+HLNAAKSGVSSVCEPLPPDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKW
Subjt:  MALSISSTALSTLPISRESSHSHHRALNFPGKFPKYNLRRGSTHLNAAKSGVSSVCEPLPPDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKW

Query:  FAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMGWVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGFWF
        FAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMGWVEGRRWADMVNPG VDVDLKLPHKKKPTPDVGYPGGFWF
Subjt:  FAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMGWVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGFWF

Query:  DPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNIFS
        DPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNIFS
Subjt:  DPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNIFS

A0A6J1HWP6 Chlorophyll a-b binding protein, chloroplastic2.7e-15198.1Show/hide
Query:  MALSISSTALSTLPISRESSHSHHRALNFPGKFPKYNLRRGSTHLNAAKSGVSSVCEPLPPDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKW
        MALSISSTALS+LPI RESSHS HRALNFPGKFPKYNLRRGS+HLNAAKSGVSSVCEPLPPDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKW
Subjt:  MALSISSTALSTLPISRESSHSHHRALNFPGKFPKYNLRRGSTHLNAAKSGVSSVCEPLPPDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKW

Query:  FAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMGWVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGFWF
        FAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMGWVEGRRWADMVNPG VDVDLKLPHKKKPTPDVGYPGGFWF
Subjt:  FAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMGWVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGFWF

Query:  DPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNIFS
        DPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNIFS
Subjt:  DPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNIFS

SwissProt top hitse value%identityAlignment
P10708 Chlorophyll a-b binding protein 7, chloroplastic3.9e-8363.98Show/hide
Query:  SSVCEPLPPDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKWFAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLL
        ++VC    PDRPLWFPGSTPP WLDGSLPGDFGFDPLGL SDPE L+W  QAEL+H RWAML  AGI +PE    +G++   SWY AG +EYF D TTL 
Subjt:  SSVCEPLPPDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKWFAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLL

Query:  VAQLGLMGWVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGFWFDPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAH
        + +L L+GW EGRRWAD++ PG V+ D   P+ K    DVGYPGG WFDP+ WG GSP  +  LRTKEIKNGRLAMLA +G WFQ IYTG GP++NL AH
Subjt:  VAQLGLMGWVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGFWFDPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAH

Query:  VADPGHCNIFS
        +ADPGH  IF+
Subjt:  VADPGHCNIFS

P13869 Chlorophyll a-b binding protein, chloroplastic1.2e-8467.98Show/hide
Query:  PDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKWFAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMG
        PDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPE LKW AQAEL+H+RWAML  AGI +PE+   +G++   SWY AG +EYF D TTL V +L L+G
Subjt:  PDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKWFAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMG

Query:  WVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGFWFDPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCN
        W EGRRWAD++ PG V+ D   P+ K    DVGYPGG WFDP+ WG GSP  +  LRTKEIKNGRLAMLA +G WFQ IYTG GP++NL AH+ADPGH  
Subjt:  WVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGFWFDPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCN

Query:  IFS
        IF+
Subjt:  IFS

Q8LCQ4 Photosystem I chlorophyll a/b-binding protein 6, chloroplastic2.3e-10769.03Show/hide
Query:  MALSISSTALSTLPI--SRESSHSHHRALNFPGKFPKYNLRRGSTHLNAAKSGVSSVCEPLPPDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELL
        MA +I+S   STL +  SR  + +  R            L R    +  A   VSSVCEPLPPDRPLWFPGS+PPEWLDGSLPGDFGFDPLGLGSDP+ L
Subjt:  MALSISSTALSTLPI--SRESSHSHHRALNFPGKFPKYNLRRGSTHLNAAKSGVSSVCEPLPPDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELL

Query:  KWFAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMGWVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGF
        KWFAQAEL+H+RWAMLAV GI++PE  E LG I+NFSWYDAG+REYFAD TTL VAQ+ LMGW EGRRWAD++ PGSVD++ K PHK  P PDVGYPGG 
Subjt:  KWFAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMGWVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGF

Query:  WFDPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNIFSVTT
        WFD MMWGRGSPEPVMVLRTKEIKNGRLAMLAF+G  FQA YT Q P+ENL AH+ADPGHCN+FS  T
Subjt:  WFDPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNIFSVTT

Q9SQL2 Chlorophyll a-b binding protein P4, chloroplastic4.6e-5252.55Show/hide
Query:  WFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKWFAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMGWVEGR
        W PG   P +L GSLPGD GFDPLGL  DPE L+WF QAEL++ RWAML VAG+LLPE F S+G+I    WY AG  EYFA  +TL V +  L  +VE R
Subjt:  WFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKWFAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMGWVEGR

Query:  RWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGFWFDPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNI
        RW D+ NPGSV+ D        P  +VGYPGG  F+P+ +      P +  + KEI NGRLAMLAF+G   Q   TG+GP +NL  H++DP H  I
Subjt:  RWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGFWFDPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNI

Q9SYW8 Photosystem I chlorophyll a/b-binding protein 2, chloroplastic1.7e-8365.53Show/hide
Query:  PDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKWFAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMG
        PDRP+WFPGSTPPEWLDGSLPGDFGFDPLGL SDP+ LKW  QAE++H RWAML  AGI +PE+   +G++   SWY AG +EYF D TTL V +L L+G
Subjt:  PDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKWFAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMG

Query:  WVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGFWFDPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCN
        W EGRRWAD++ PGSV+ D   P+ K    DVGYPGG WFDP+ WG GSP  +  LRTKEIKNGRLAMLA +G WFQ IYTG GP++NL AH+ADPGH  
Subjt:  WVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGFWFDPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCN

Query:  IFSVTT
        IF+  T
Subjt:  IFSVTT

Arabidopsis top hitse value%identityAlignment
AT1G19130.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF985 (InterPro:IPR009327), RmlC-like jelly roll fold (InterPro:IPR014710); Has 1465 Blast hits to 1465 proteins in 584 species: Archae - 10; Bacteria - 1038; Metazoa - 19; Fungi - 43; Plants - 51; Viruses - 0; Other Eukaryotes - 304 (source: NCBI BLink).5.4e-7271.19Show/hide
Query:  KMGTASEIVAKLNLKPHPEGGFYSETFRDKSVHLSKSHLPPEYKVDREVSTCIYFLVPSGCISALHRIPCAETWHFYSGEPLTILELNEEDGRVKFTCLG
        KM  +SEIV KLNL+ H EGGF++ETFRD SV LS S LPP +KVDR VST IYFL+PSG +S LHRIP AETWHFY GEPLT++EL  +DG++KFTCLG
Subjt:  KMGTASEIVAKLNLKPHPEGGFYSETFRDKSVHLSKSHLPPEYKVDREVSTCIYFLVPSGCISALHRIPCAETWHFYSGEPLTILELNEEDGRVKFTCLG

Query:  SDFIGSNHLLQYTVPPNVWFGSFPTKDFNISADGTVTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFP
         D    +   QYTVPPNVWFGSFPTKD + S DG + KA  RDSENH+SLVGC+CAPAFQFEDFELAKRSDL+SRFP
Subjt:  SDFIGSNHLLQYTVPPNVWFGSFPTKDFNISADGTVTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFP

AT1G19150.1 photosystem I light harvesting complex gene 61.6e-10869.03Show/hide
Query:  MALSISSTALSTLPI--SRESSHSHHRALNFPGKFPKYNLRRGSTHLNAAKSGVSSVCEPLPPDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELL
        MA +I+S   STL +  SR  + +  R            L R    +  A   VSSVCEPLPPDRPLWFPGS+PPEWLDGSLPGDFGFDPLGLGSDP+ L
Subjt:  MALSISSTALSTLPI--SRESSHSHHRALNFPGKFPKYNLRRGSTHLNAAKSGVSSVCEPLPPDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELL

Query:  KWFAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMGWVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGF
        KWFAQAEL+H+RWAMLAV GI++PE  E LG I+NFSWYDAG+REYFAD TTL VAQ+ LMGW EGRRWAD++ PGSVD++ K PHK  P PDVGYPGG 
Subjt:  KWFAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMGWVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGF

Query:  WFDPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNIFSVTT
        WFD MMWGRGSPEPVMVLRTKEIKNGRLAMLAF+G  FQA YT Q P+ENL AH+ADPGHCN+FS  T
Subjt:  WFDPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNIFSVTT

AT1G45474.1 photosystem I light harvesting complex gene 58.1e-4441.18Show/hide
Query:  LNAAKSGVSSVCEPLPPDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKWFAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREY
        +++ K+    +   +  +R  W PG  PP +LDG+L GD+GFDPLGLG DPE LKW+ QAEL+H+R+AML VAGIL  +   + G+     WY+AG  ++
Subjt:  LNAAKSGVSSVCEPLPPDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKWFAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREY

Query:  -FADPTTLLVAQLGLMGWVEGRRWADMVNPGSVDVDLKLPHKKKPT---PDVGYPGGFWFDPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIY
         FA   TL+V Q  LMG+ E +R+ D V+PGS   +       +      + GYPGG   +P+   +   +     + KEIKNGRLAM+A +G + QA  
Subjt:  -FADPTTLLVAQLGLMGWVEGRRWADMVNPGSVDVDLKLPHKKKPT---PDVGYPGGFWFDPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIY

Query:  TGQGPLENLAAHVADPGHCNI
        T  GP++NL  H+++P H  I
Subjt:  TGQGPLENLAAHVADPGHCNI

AT3G47470.1 light-harvesting chlorophyll-protein complex I subunit A49.6e-5352.04Show/hide
Query:  WFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKWFAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMGWVEGR
        W PG   P++L GSL GD GFDPLGL  DPE LKWF QAEL++ RWAML VAG+LLPE F  +G+I    WYDAG  +YFA  +TL V +  L  +VE R
Subjt:  WFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKWFAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMGWVEGR

Query:  RWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGFWFDPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNI
        RW D+ NPGSV+ D        P  +VGYPGG  F+P+ +      P    + KE+ NGRLAMLAF+G   Q   TG+GP ENL  H++DP H  I
Subjt:  RWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGFWFDPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNI

AT3G61470.1 photosystem I light harvesting complex gene 21.2e-8465.53Show/hide
Query:  PDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKWFAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMG
        PDRP+WFPGSTPPEWLDGSLPGDFGFDPLGL SDP+ LKW  QAE++H RWAML  AGI +PE+   +G++   SWY AG +EYF D TTL V +L L+G
Subjt:  PDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKWFAQAELMHARWAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMG

Query:  WVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGFWFDPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCN
        W EGRRWAD++ PGSV+ D   P+ K    DVGYPGG WFDP+ WG GSP  +  LRTKEIKNGRLAMLA +G WFQ IYTG GP++NL AH+ADPGH  
Subjt:  WVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGFWFDPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCN

Query:  IFSVTT
        IF+  T
Subjt:  IFSVTT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCTCTCCATTTCTTCCACTGCGCTCTCAACTCTCCCAATCAGCAGAGAATCATCCCACAGCCACCACAGAGCACTCAATTTCCCCGGAAAATTCCCAAAATACAA
TCTCCGGCGAGGTTCTACTCATCTGAACGCTGCAAAATCCGGCGTATCTAGCGTCTGTGAACCGCTCCCTCCAGATCGACCCCTCTGGTTCCCCGGCAGCACCCCCCCTG
AGTGGCTCGACGGCAGCCTTCCCGGCGATTTCGGGTTCGACCCACTTGGATTGGGATCCGACCCAGAGCTCCTAAAATGGTTCGCACAAGCGGAGCTAATGCACGCGAGA
TGGGCAATGCTCGCAGTGGCCGGAATTCTCCTCCCCGAATGGTTCGAGAGCTTAGGACTGATTCAAAATTTCTCCTGGTACGACGCCGGAACGCGAGAATACTTTGCAGA
CCCGACGACATTGCTGGTGGCTCAATTGGGTCTAATGGGGTGGGTAGAGGGGCGGCGGTGGGCGGACATGGTTAACCCAGGCAGCGTCGACGTGGACCTCAAACTACCTC
ACAAGAAAAAGCCAACACCGGACGTCGGCTACCCCGGCGGGTTCTGGTTCGACCCGATGATGTGGGGCAGGGGATCGCCGGAGCCGGTCATGGTGCTGCGGACTAAGGAG
ATCAAGAACGGGCGGCTGGCGATGCTGGCGTTTGTCGGACTGTGGTTTCAGGCTATTTATACAGGGCAGGGTCCGCTGGAGAATCTCGCCGCCCACGTTGCCGATCCCGG
CCACTGCAACATCTTCTCGGTAACGACAGTACTTGCCATGAATCTTGGGCATTCAGCTCCGGTGGAATTTGAGCTCAGATTCAGAGATTGGTTACATCCGACACCCGACA
CAACTGTGTTGCTTTTGGTTGAAAGAGTTGAGAGAAGAATCAAAATGGGTACTGCATCAGAAATTGTGGCGAAATTGAATTTGAAACCTCATCCAGAAGGCGGTTTTTAC
TCTGAAACTTTCAGAGACAAGTCCGTTCATCTCTCCAAATCTCATCTCCCACCGGAGTACAAGGTTGATCGAGAGGTCAGCACGTGCATATACTTTTTGGTGCCGTCTGG
ATGCATTTCTGCTCTTCATCGGATTCCATGTGCAGAAACTTGGCATTTTTACTCGGGAGAACCTCTTACGATACTGGAGTTGAATGAAGAGGACGGTCGAGTCAAATTTA
CTTGTCTTGGGTCTGATTTCATTGGAAGCAATCATTTACTTCAGTATACAGTGCCTCCTAATGTCTGGTTTGGTTCTTTTCCAACCAAGGACTTCAATATTTCTGCAGAT
GGGACTGTGACTAAAGCTGCTCCAAGGGACTCTGAGAATCACTACTCACTTGTGGGCTGCAGCTGTGCACCTGCTTTCCAGTTTGAGGACTTCGAATTGGCTAAACGTTC
TGACCTTGTTTCTCGTTTTCCCGATAGTTCTGTAGATGGTCCCCGCTGTACAGAAACAGAAACGAACCCATTGCAAAAGGAAAAACTTAGAAAAAGACCTGACCTTTCTC
TTTTTGTGGGTTCAAGAGTAAAGAATATTGGTAACTGCAAAAACATAGAGAGAGAGATCTTACCTCTCTTTTTGTGGGTTCAGGAACAAATTCTGATCCTTTTCGAAAGG
AAGGTAGCTTCTCAGATTTCTAAAACGATATCTGCACACGAAAATGGAAACACACAATCAGGTACTTCTTCATGGATGATGTTTGGACATTGGAGCTTTGGGGTTCGGGT
TCTCAGAGGAGGAGCAACTGCAGCTGGTGCAATACCGCCTCCACAGACGAAAATTTTCTTGAGAAGGAGAGATAAAGTTCTTTTGCCAATAAGTGTAGTCTCATTAGCTT
CCATACCCATATTTTTGCCTCTACTGAGTATGACATGCTTGAAACTCTCGCTCTCGTCAATTAACTCGTCCGAATATGGTTCATTTTTTGTTATGTCTGATTCGAAGCTT
AATTGTCTGGTAGGAAATATGTTTGATGAACAATGCGTGCTGACTTCAAATGCCAAGGATGGGTCCATTTGCTTAAGATTATCATCTCCAAAGGTTCCGATCGCCAGCAA
CACACGAGGCCAGCTGAAAATCCGGGAGGCTGCTGCTGATGCGAGTGCATGA
mRNA sequenceShow/hide mRNA sequence
CATCATATACATATCAAAGCTGTTCACCAATCTTGTTTTATATCACTGAAGCTCTCCGGCCATGGCCCTCTCCATTTCTTCCACTGCGCTCTCAACTCTCCCAATCAGCA
GAGAATCATCCCACAGCCACCACAGAGCACTCAATTTCCCCGGAAAATTCCCAAAATACAATCTCCGGCGAGGTTCTACTCATCTGAACGCTGCAAAATCCGGCGTATCT
AGCGTCTGTGAACCGCTCCCTCCAGATCGACCCCTCTGGTTCCCCGGCAGCACCCCCCCTGAGTGGCTCGACGGCAGCCTTCCCGGCGATTTCGGGTTCGACCCACTTGG
ATTGGGATCCGACCCAGAGCTCCTAAAATGGTTCGCACAAGCGGAGCTAATGCACGCGAGATGGGCAATGCTCGCAGTGGCCGGAATTCTCCTCCCCGAATGGTTCGAGA
GCTTAGGACTGATTCAAAATTTCTCCTGGTACGACGCCGGAACGCGAGAATACTTTGCAGACCCGACGACATTGCTGGTGGCTCAATTGGGTCTAATGGGGTGGGTAGAG
GGGCGGCGGTGGGCGGACATGGTTAACCCAGGCAGCGTCGACGTGGACCTCAAACTACCTCACAAGAAAAAGCCAACACCGGACGTCGGCTACCCCGGCGGGTTCTGGTT
CGACCCGATGATGTGGGGCAGGGGATCGCCGGAGCCGGTCATGGTGCTGCGGACTAAGGAGATCAAGAACGGGCGGCTGGCGATGCTGGCGTTTGTCGGACTGTGGTTTC
AGGCTATTTATACAGGGCAGGGTCCGCTGGAGAATCTCGCCGCCCACGTTGCCGATCCCGGCCACTGCAACATCTTCTCGGTAACGACAGTACTTGCCATGAATCTTGGG
CATTCAGCTCCGGTGGAATTTGAGCTCAGATTCAGAGATTGGTTACATCCGACACCCGACACAACTGTGTTGCTTTTGGTTGAAAGAGTTGAGAGAAGAATCAAAATGGG
TACTGCATCAGAAATTGTGGCGAAATTGAATTTGAAACCTCATCCAGAAGGCGGTTTTTACTCTGAAACTTTCAGAGACAAGTCCGTTCATCTCTCCAAATCTCATCTCC
CACCGGAGTACAAGGTTGATCGAGAGGTCAGCACGTGCATATACTTTTTGGTGCCGTCTGGATGCATTTCTGCTCTTCATCGGATTCCATGTGCAGAAACTTGGCATTTT
TACTCGGGAGAACCTCTTACGATACTGGAGTTGAATGAAGAGGACGGTCGAGTCAAATTTACTTGTCTTGGGTCTGATTTCATTGGAAGCAATCATTTACTTCAGTATAC
AGTGCCTCCTAATGTCTGGTTTGGTTCTTTTCCAACCAAGGACTTCAATATTTCTGCAGATGGGACTGTGACTAAAGCTGCTCCAAGGGACTCTGAGAATCACTACTCAC
TTGTGGGCTGCAGCTGTGCACCTGCTTTCCAGTTTGAGGACTTCGAATTGGCTAAACGTTCTGACCTTGTTTCTCGTTTTCCCGATAGTTCTGTAGATGGTCCCCGCTGT
ACAGAAACAGAAACGAACCCATTGCAAAAGGAAAAACTTAGAAAAAGACCTGACCTTTCTCTTTTTGTGGGTTCAAGAGTAAAGAATATTGGTAACTGCAAAAACATAGA
GAGAGAGATCTTACCTCTCTTTTTGTGGGTTCAGGAACAAATTCTGATCCTTTTCGAAAGGAAGGTAGCTTCTCAGATTTCTAAAACGATATCTGCACACGAAAATGGAA
ACACACAATCAGGTACTTCTTCATGGATGATGTTTGGACATTGGAGCTTTGGGGTTCGGGTTCTCAGAGGAGGAGCAACTGCAGCTGGTGCAATACCGCCTCCACAGACG
AAAATTTTCTTGAGAAGGAGAGATAAAGTTCTTTTGCCAATAAGTGTAGTCTCATTAGCTTCCATACCCATATTTTTGCCTCTACTGAGTATGACATGCTTGAAACTCTC
GCTCTCGTCAATTAACTCGTCCGAATATGGTTCATTTTTTGTTATGTCTGATTCGAAGCTTAATTGTCTGGTAGGAAATATGTTTGATGAACAATGCGTGCTGACTTCAA
ATGCCAAGGATGGGTCCATTTGCTTAAGATTATCATCTCCAAAGGTTCCGATCGCCAGCAACACACGAGGCCAGCTGAAAATCCGGGAGGCTGCTGCTGATGCGAGTGCA
TGA
Protein sequenceShow/hide protein sequence
MALSISSTALSTLPISRESSHSHHRALNFPGKFPKYNLRRGSTHLNAAKSGVSSVCEPLPPDRPLWFPGSTPPEWLDGSLPGDFGFDPLGLGSDPELLKWFAQAELMHAR
WAMLAVAGILLPEWFESLGLIQNFSWYDAGTREYFADPTTLLVAQLGLMGWVEGRRWADMVNPGSVDVDLKLPHKKKPTPDVGYPGGFWFDPMMWGRGSPEPVMVLRTKE
IKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNIFSVTTVLAMNLGHSAPVEFELRFRDWLHPTPDTTVLLLVERVERRIKMGTASEIVAKLNLKPHPEGGFY
SETFRDKSVHLSKSHLPPEYKVDREVSTCIYFLVPSGCISALHRIPCAETWHFYSGEPLTILELNEEDGRVKFTCLGSDFIGSNHLLQYTVPPNVWFGSFPTKDFNISAD
GTVTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPDSSVDGPRCTETETNPLQKEKLRKRPDLSLFVGSRVKNIGNCKNIEREILPLFLWVQEQILILFER
KVASQISKTISAHENGNTQSGTSSWMMFGHWSFGVRVLRGGATAAGAIPPPQTKIFLRRRDKVLLPISVVSLASIPIFLPLLSMTCLKLSLSSINSSEYGSFFVMSDSKL
NCLVGNMFDEQCVLTSNAKDGSICLRLSSPKVPIASNTRGQLKIREAAADASA