; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh12G007570 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh12G007570
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionPhotosystem II CP43 reaction center protein
Genome locationCmo_Chr12:6069077..6070245
RNA-Seq ExpressionCmoCh12G007570
SyntenyCmoCh12G007570
Gene Ontology termsGO:0009767 - photosynthetic electron transport chain (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0009521 - photosystem (cellular component)
GO:0043229 - intracellular organelle (cellular component)
GO:0016168 - chlorophyll binding (molecular function)
InterPro domainsIPR000932 - Photosystem antenna protein-like
IPR036001 - Photosystem antenna protein-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CDY29457.1 BnaC05g32130D [Brassica napus]1.5e-2155.81Show/hide
Query:  ANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRGRKKERDYVLFGSACSLAKIYNL-GKTLRRIYDPCSFRFFKLRGWRVATEINAVNYLSPRSWLVTSHFV
        AN+GS     TGLGKYLMRSPT E               V+FG      + ++L    L  +  P       LRG  VATEINAVNY+SPRSWL TSHFV
Subjt:  ANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRGRKKERDYVLFGSACSLAKIYNL-GKTLRRIYDPCSFRFFKLRGWRVATEINAVNYLSPRSWLVTSHFV

Query:  LGFPPFLGHLWHAGRARAAAAEFEKGIDR
        LGF  F+GHLWHAGRARAAAA FEKGIDR
Subjt:  LGFPPFLGHLWHAGRARAAAAEFEKGIDR

KAF3499501.1 hypothetical protein F2Q69_00044700 [Brassica cretica]1.5e-2155.81Show/hide
Query:  ANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRGRKKERDYVLFGSACSLAKIYNL-GKTLRRIYDPCSFRFFKLRGWRVATEINAVNYLSPRSWLVTSHFV
        AN+GS     TGLGKYLMRSPT E               V+FG      + ++L    L  +  P       LRG  VATEINAVNY+SPRSWL TSHFV
Subjt:  ANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRGRKKERDYVLFGSACSLAKIYNL-GKTLRRIYDPCSFRFFKLRGWRVATEINAVNYLSPRSWLVTSHFV

Query:  LGFPPFLGHLWHAGRARAAAAEFEKGIDR
        LGF  F+GHLWHAGRARAAAA FEKGIDR
Subjt:  LGFPPFLGHLWHAGRARAAAAEFEKGIDR

KAF4402748.1 hypothetical protein G4B88_010200 [Cannabis sativa]1.5e-2154.62Show/hide
Query:  ANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRGRKKERDYVLFGSACSLAKIYNLGKTLRRIYDPCSFRFFKLRGWRVATEINAVNYLSPRSWLVTSHFVL
        AN+GS     T LGKYLM SPT         G  +ER   L  +            T   IYD CSFRFF + G  VA EIN +NY+SPRSWL TSHFVL
Subjt:  ANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRGRKKERDYVLFGSACSLAKIYNLGKTLRRIYDPCSFRFFKLRGWRVATEINAVNYLSPRSWLVTSHFVL

Query:  GFPPFLGHLWHAGRARAAAAEFEKGIDRPL
        GF  F+GHLWHAG  RAAAA FEKGIDR L
Subjt:  GFPPFLGHLWHAGRARAAAAEFEKGIDRPL

VDD26495.1 unnamed protein product [Brassica rapa]1.5e-2155.81Show/hide
Query:  ANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRGRKKERDYVLFGSACSLAKIYNL-GKTLRRIYDPCSFRFFKLRGWRVATEINAVNYLSPRSWLVTSHFV
        AN+GS     TGLGKYLMRSPT E               V+FG      + ++L    L  +  P       LRG  VATEINAVNY+SPRSWL TSHFV
Subjt:  ANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRGRKKERDYVLFGSACSLAKIYNL-GKTLRRIYDPCSFRFFKLRGWRVATEINAVNYLSPRSWLVTSHFV

Query:  LGFPPFLGHLWHAGRARAAAAEFEKGIDR
        LGF  F+GHLWHAGRARAAAA FEKGIDR
Subjt:  LGFPPFLGHLWHAGRARAAAAEFEKGIDR

YP_247595.1 photosystem II 44 kDa protein [Cucumis sativus]4.7e-2357.25Show/hide
Query:  ANIGSRTYSTTGLGKYLMRSPTIE-------KRHFEGRGRKKERDYVLFGSACSLAK--IYNLGK-TLRRIYDPCSFRFFKLRGWRVATEINAVNYLSPR
        ANIGS     TGLGKYLMRSPT E        R ++ R    E      G   S  K  I  L + T R IYDP         G  VATEINAVNY+SPR
Subjt:  ANIGSRTYSTTGLGKYLMRSPTIE-------KRHFEGRGRKKERDYVLFGSACSLAK--IYNLGK-TLRRIYDPCSFRFFKLRGWRVATEINAVNYLSPR

Query:  SWLVTSHFVLGFPPFLGHLWHAGRARAAAAEFEKGIDR
        SWL TSHFVLGF  F+GHLWHAGRARAAAA FEKGIDR
Subjt:  SWLVTSHFVLGFPPFLGHLWHAGRARAAAAEFEKGIDR

TrEMBL top hitse value%identityAlignment
A0A078EKT7 BnaC05g32120D protein7.4e-2255.81Show/hide
Query:  ANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRGRKKERDYVLFGSACSLAKIYNL-GKTLRRIYDPCSFRFFKLRGWRVATEINAVNYLSPRSWLVTSHFV
        AN+GS     TGLGKYLMRSPT E               V+FG      + ++L    L  +  P       LRG  VATEINAVNY+SPRSWL TSHFV
Subjt:  ANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRGRKKERDYVLFGSACSLAKIYNL-GKTLRRIYDPCSFRFFKLRGWRVATEINAVNYLSPRSWLVTSHFV

Query:  LGFPPFLGHLWHAGRARAAAAEFEKGIDR
        LGF  F+GHLWHAGRARAAAA FEKGIDR
Subjt:  LGFPPFLGHLWHAGRARAAAAEFEKGIDR

A0A078HPI2 BnaC03g60960D protein7.4e-2255.81Show/hide
Query:  ANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRGRKKERDYVLFGSACSLAKIYNL-GKTLRRIYDPCSFRFFKLRGWRVATEINAVNYLSPRSWLVTSHFV
        AN+GS     TGLGKYLMRSPT E               V+FG      + ++L    L  +  P       LRG  VATEINAVNY+SPRSWL TSHFV
Subjt:  ANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRGRKKERDYVLFGSACSLAKIYNL-GKTLRRIYDPCSFRFFKLRGWRVATEINAVNYLSPRSWLVTSHFV

Query:  LGFPPFLGHLWHAGRARAAAAEFEKGIDR
        LGF  F+GHLWHAGRARAAAA FEKGIDR
Subjt:  LGFPPFLGHLWHAGRARAAAAEFEKGIDR

A0A0D3CFB6 Uncharacterized protein7.4e-2255.81Show/hide
Query:  ANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRGRKKERDYVLFGSACSLAKIYNL-GKTLRRIYDPCSFRFFKLRGWRVATEINAVNYLSPRSWLVTSHFV
        AN+GS     TGLGKYLMRSPT E               V+FG      + ++L    L  +  P       LRG  VATEINAVNY+SPRSWL TSHFV
Subjt:  ANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRGRKKERDYVLFGSACSLAKIYNL-GKTLRRIYDPCSFRFFKLRGWRVATEINAVNYLSPRSWLVTSHFV

Query:  LGFPPFLGHLWHAGRARAAAAEFEKGIDR
        LGF  F+GHLWHAGRARAAAA FEKGIDR
Subjt:  LGFPPFLGHLWHAGRARAAAAEFEKGIDR

A0A3P6DR44 Uncharacterized protein7.4e-2255.81Show/hide
Query:  ANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRGRKKERDYVLFGSACSLAKIYNL-GKTLRRIYDPCSFRFFKLRGWRVATEINAVNYLSPRSWLVTSHFV
        AN+GS     TGLGKYLMRSPT E               V+FG      + ++L    L  +  P       LRG  VATEINAVNY+SPRSWL TSHFV
Subjt:  ANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRGRKKERDYVLFGSACSLAKIYNL-GKTLRRIYDPCSFRFFKLRGWRVATEINAVNYLSPRSWLVTSHFV

Query:  LGFPPFLGHLWHAGRARAAAAEFEKGIDR
        LGF  F+GHLWHAGRARAAAA FEKGIDR
Subjt:  LGFPPFLGHLWHAGRARAAAAEFEKGIDR

A0A7J6I6V3 Uncharacterized protein7.4e-2254.62Show/hide
Query:  ANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRGRKKERDYVLFGSACSLAKIYNLGKTLRRIYDPCSFRFFKLRGWRVATEINAVNYLSPRSWLVTSHFVL
        AN+GS     T LGKYLM SPT         G  +ER   L  +            T   IYD CSFRFF + G  VA EIN +NY+SPRSWL TSHFVL
Subjt:  ANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRGRKKERDYVLFGSACSLAKIYNLGKTLRRIYDPCSFRFFKLRGWRVATEINAVNYLSPRSWLVTSHFVL

Query:  GFPPFLGHLWHAGRARAAAAEFEKGIDRPL
        GF  F+GHLWHAG  RAAAA FEKGIDR L
Subjt:  GFPPFLGHLWHAGRARAAAAEFEKGIDRPL

SwissProt top hitse value%identityAlignment
A6MMK3 Photosystem II CP43 reaction center protein7.1e-2250.68Show/hide
Query:  ANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRGRKKERDYVLFGSACSLAKIYN---LGKTLRRIYDPCSFR-------------FFKLRGWRVATEINAV
        AN+GS     TGLGKYLMRSPT E       G +  R + L        +  N   LG+ L++   P   R                + G  VATEINAV
Subjt:  ANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRGRKKERDYVLFGSACSLAKIYN---LGKTLRRIYDPCSFR-------------FFKLRGWRVATEINAV

Query:  NYLSPRSWLVTSHFVLGFPPFLGHLWHAGRARAAAAEFEKGIDRPL
        NY+SPRSWL TSHFVLGF  F+GHLWHAGRARAAAA FEKGIDR L
Subjt:  NYLSPRSWLVTSHFVLGFPPFLGHLWHAGRARAAAAEFEKGIDRPL

B1A931 Photosystem II CP43 reaction center protein1.2e-2150Show/hide
Query:  ANIGSRTYSTTGLGKYLMRSPTIE-------KRHFEGRGRKKERDYVLFGSACSLAKIYNLGKTLRRIYDPCSFRFF---------KLRGWRVATEINAV
        AN+GS     TGLGKYLMRSPT E        R ++ R    E    L G   +   +  L K ++   +  S  +           + G  VATEINAV
Subjt:  ANIGSRTYSTTGLGKYLMRSPTIE-------KRHFEGRGRKKERDYVLFGSACSLAKIYNLGKTLRRIYDPCSFRFF---------KLRGWRVATEINAV

Query:  NYLSPRSWLVTSHFVLGFPPFLGHLWHAGRARAAAAEFEKGIDRPL
        NY+SPRSWL TSHFVLGF  F+GHLWHAGRARAAAA FEKGIDR L
Subjt:  NYLSPRSWLVTSHFVLGFPPFLGHLWHAGRARAAAAEFEKGIDRPL

P10804 Photosystem II CP43 reaction center protein1.4e-2250.68Show/hide
Query:  ANIGSRTYSTTGLGKYLMRSPTIE-------KRHFEGRGRKKERDYVLFGSACSLAKIYNLGKTLRRIYDPCSFRFF---------KLRGWRVATEINAV
        AN+GS     TGLGKYLMRSPT E        R ++ R    E    L G   +   +  L K ++   +  S  +           + G  VATEINAV
Subjt:  ANIGSRTYSTTGLGKYLMRSPTIE-------KRHFEGRGRKKERDYVLFGSACSLAKIYNLGKTLRRIYDPCSFRFF---------KLRGWRVATEINAV

Query:  NYLSPRSWLVTSHFVLGFPPFLGHLWHAGRARAAAAEFEKGIDRPL
        NY+SPRSWL TSHFVLGF PF+GHLWHAGRARAAAA FEKGIDR L
Subjt:  NYLSPRSWLVTSHFVLGFPPFLGHLWHAGRARAAAAEFEKGIDRPL

P48187 Photosystem II CP43 reaction center protein4.2e-2249.65Show/hide
Query:  ANIGSRTYSTTGLGKYLMRSPTIE-------KRHFEGRGRKKERDYVLFGSACSLAKIYNLGKTLRRIYDPCSFRFFKLRGWR----VATEINAVNYLSP
        AN+GS     TGLGKYLMRSPT E        R ++ R    E   +   +   L+++   G+  +            L        VATEINAVNY+SP
Subjt:  ANIGSRTYSTTGLGKYLMRSPTIE-------KRHFEGRGRKKERDYVLFGSACSLAKIYNLGKTLRRIYDPCSFRFFKLRGWR----VATEINAVNYLSP

Query:  RSWLVTSHFVLGFPPFLGHLWHAGRARAAAAEFEKGIDRPL
        RSWL TSHFVLGF  F+GHLWHAGRARAAAA FEKGIDR L
Subjt:  RSWLVTSHFVLGFPPFLGHLWHAGRARAAAAEFEKGIDRPL

Q85FM3 Photosystem II CP43 reaction center protein2.4e-2246.75Show/hide
Query:  ANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRGRKKERDYVLFGSACSLAKIYNL-GKTLRRIYDPCSFRFFKLR----GWR-------------------
        ANIGS     TGLGKYLMRSPT E               ++FG      + ++L    L  +  P      KL+     W+                   
Subjt:  ANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRGRKKERDYVLFGSACSLAKIYNL-GKTLRRIYDPCSFRFFKLR----GWR-------------------

Query:  --VATEINAVNYLSPRSWLVTSHFVLGFPPFLGHLWHAGRARAAAAEFEKGIDR
          VATEINAVNY+SPRSWL TSHFVLGF  F+GHLWHAGRARAAAA FEKGIDR
Subjt:  --VATEINAVNYLSPRSWLVTSHFVLGFPPFLGHLWHAGRARAAAAEFEKGIDR

Arabidopsis top hitse value%identityAlignment
ATCG00280.1 photosystem II reaction center protein C4.3e-2250Show/hide
Query:  ANIGSRTYSTTGLGKYLMRSPTIE-------KRHFEGRGRKKERDYVLFGSACSLAKIYNLGKTLRRIYDPCSFRFF---------KLRGWRVATEINAV
        AN+GS     TGLGKYLMRSPT E        R ++ R    E    L G   +   +  L K ++   +  S  +           + G  VATEINAV
Subjt:  ANIGSRTYSTTGLGKYLMRSPTIE-------KRHFEGRGRKKERDYVLFGSACSLAKIYNLGKTLRRIYDPCSFRFF---------KLRGWRVATEINAV

Query:  NYLSPRSWLVTSHFVLGFPPFLGHLWHAGRARAAAAEFEKGIDR
        NY+SPRSWL TSHFVLGF  F+GHLWHAGRARAAAA FEKGIDR
Subjt:  NYLSPRSWLVTSHFVLGFPPFLGHLWHAGRARAAAAEFEKGIDR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAAAGATCCATTTTCCTTCCCCGCCCTCCTAGCTACTACTCTAAGTTGGGGCGTAGGCACTGCTAACATTGGATCAAGGACCTACTCTACTACTGGTTTAGGTAA
ATATTTAATGCGTTCTCCGACCATAGAGAAAAGGCATTTTGAGGGGAGAGGTAGAAAGAAAGAAAGAGACTATGTGCTTTTTGGATCTGCGTGCTCCTTGGCAAAGATAT
ACAATCTTGGCAAGACCTTACGCAGAATATATGACCCATGCTCCTTTAGGTTCTTTAAATTACGGGGGTGGCGTGTAGCTACCGAAATTAATGCAGTCAATTATCTCTCT
CCTAGAAGTTGGTTAGTTACCTCTCATTTTGTTCTAGGATTCCCCCCATTTCTAGGTCATTTATGGCATGCAGGAAGGGCTCGGGCAGCTGCCGCAGAATTTGAAAAAGG
AATTGATCGACCTCTTCATGACTCTTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAACAAAGATCCATTTTCCTTCCCCGCCCTCCTAGCTACTACTCTAAGTTGGGGCGTAGGCACTGCTAACATTGGATCAAGGACCTACTCTACTACTGGTTTAGGTAA
ATATTTAATGCGTTCTCCGACCATAGAGAAAAGGCATTTTGAGGGGAGAGGTAGAAAGAAAGAAAGAGACTATGTGCTTTTTGGATCTGCGTGCTCCTTGGCAAAGATAT
ACAATCTTGGCAAGACCTTACGCAGAATATATGACCCATGCTCCTTTAGGTTCTTTAAATTACGGGGGTGGCGTGTAGCTACCGAAATTAATGCAGTCAATTATCTCTCT
CCTAGAAGTTGGTTAGTTACCTCTCATTTTGTTCTAGGATTCCCCCCATTTCTAGGTCATTTATGGCATGCAGGAAGGGCTCGGGCAGCTGCCGCAGAATTTGAAAAAGG
AATTGATCGACCTCTTCATGACTCTTCTTAA
Protein sequenceShow/hide protein sequence
MNKDPFSFPALLATTLSWGVGTANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRGRKKERDYVLFGSACSLAKIYNLGKTLRRIYDPCSFRFFKLRGWRVATEINAVNYLS
PRSWLVTSHFVLGFPPFLGHLWHAGRARAAAAEFEKGIDRPLHDSS