; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI02G05970 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI02G05970
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionChlorophyll A-B binding protein
Genome locationChr2:4483949..4489785
RNA-Seq ExpressionCSPI02G05970
SyntenyCSPI02G05970
Gene Ontology termsGO:0009507 - chloroplast (cellular component)
GO:0009579 - thylakoid (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR023329 - Chlorophyll a/b binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055599.1 uncharacterized protein E6C27_scaffold222G00820 [Cucumis melo var. makuwa]3.7e-5889.78Show/hide
Query:  MAAASTALILPINGGNLPPSQYLSFRHTLPSATFSRLGWSR--DAGRSTRRTRGQAFRISNVSPGRDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKR
        MAAAS +LILPINGGNLPPSQYLSFRH+ PSATFSRLGWSR  D GRST RTRGQAFRISNVSP +DGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKR
Subjt:  MAAASTALILPINGGNLPPSQYLSFRHTLPSATFSRLGWSR--DAGRSTRRTRGQAFRISNVSPGRDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKR

Query:  RRQIEAINGAWAMIGLTAGLVIEGRTGKGILAQLADY
        RRQIEAINGAWAMIGLTAGLVIEG+TGKGILAQ+  Y
Subjt:  RRQIEAINGAWAMIGLTAGLVIEGRTGKGILAQLADY

XP_008451793.1 PREDICTED: uncharacterized protein LOC103492970 isoform X1 [Cucumis melo]1.0e-7690.48Show/hide
Query:  MWASQIYKWISIHAGRRCTLDMAAASTALILPINGGNLPPSQYLSFRHTLPSATFSRLGWSR--DAGRSTRRTRGQAFRISNVSPGRDGLIKQVIMVDPL
        MWAS+IYKWISIHA + CTL+MAAAS +LILPINGGNLPPSQYLSFRH+ PSATFSRLGWSR  D GRST RTRGQAFRISNVSP +DGLIKQVIMVDPL
Subjt:  MWASQIYKWISIHAGRRCTLDMAAASTALILPINGGNLPPSQYLSFRHTLPSATFSRLGWSR--DAGRSTRRTRGQAFRISNVSPGRDGLIKQVIMVDPL

Query:  EAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGRTGKGILAQLADYFSILINFFIR
        EAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEG+TGKGILAQLADYFSILINFFIR
Subjt:  EAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGRTGKGILAQLADYFSILINFFIR

XP_011648927.1 uncharacterized protein LOC101212671 [Cucumis sativus]9.4e-86100Show/hide
Query:  MWASQIYKWISIHAGRRCTLDMAAASTALILPINGGNLPPSQYLSFRHTLPSATFSRLGWSRDAGRSTRRTRGQAFRISNVSPGRDGLIKQVIMVDPLEA
        MWASQIYKWISIHAGRRCTLDMAAASTALILPINGGNLPPSQYLSFRHTLPSATFSRLGWSRDAGRSTRRTRGQAFRISNVSPGRDGLIKQVIMVDPLEA
Subjt:  MWASQIYKWISIHAGRRCTLDMAAASTALILPINGGNLPPSQYLSFRHTLPSATFSRLGWSRDAGRSTRRTRGQAFRISNVSPGRDGLIKQVIMVDPLEA

Query:  KRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGRTGKGILAQLADYFSILINFFIR
        KRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGRTGKGILAQLADYFSILINFFIR
Subjt:  KRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGRTGKGILAQLADYFSILINFFIR

XP_016902490.1 PREDICTED: uncharacterized protein LOC103492970 isoform X2 [Cucumis melo]1.0e-6885.12Show/hide
Query:  MWASQIYKWISIHAGRRCTLDMAAASTALILPINGGNLPPSQYLSFRHTLPSATFSRLGWSR--DAGRSTRRTRGQAFRISNVSPGRDGLIKQVIMVDPL
        MWAS+IYKWISIHA + CTL+MAAAS +LILPINGGNLPPSQYLSFRH+ PSATFSRLGWSR  D GRST RTRGQAFRISN           VIMVDPL
Subjt:  MWASQIYKWISIHAGRRCTLDMAAASTALILPINGGNLPPSQYLSFRHTLPSATFSRLGWSR--DAGRSTRRTRGQAFRISNVSPGRDGLIKQVIMVDPL

Query:  EAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGRTGKGILAQLADYFSILINFFIR
        EAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEG+TGKGILAQLADYFSILINFFIR
Subjt:  EAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGRTGKGILAQLADYFSILINFFIR

XP_038889814.1 uncharacterized protein LOC120079625 [Benincasa hispida]3.4e-5987.41Show/hide
Query:  ASTALILPINGGNLPPSQYLSFRHTLPSATFSRLGWSR--DAGRSTRRTRGQAFRISNVSPGRDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKRRRQ
        AST+LILPI GGNLPPSQYLSFRHT PSATFSR GWSR  D GRST RTRGQAF+ISNVSPG+D LIKQVIMVDPLEAKR+AAKEMEKIKAKEKFKR+RQ
Subjt:  ASTALILPINGGNLPPSQYLSFRHTLPSATFSRLGWSR--DAGRSTRRTRGQAFRISNVSPGRDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKRRRQ

Query:  IEAINGAWAMIGLTAGLVIEGRTGKGILAQLADYFSILINFFI
        IEAINGAWAMIGLTAGLVIEG+TGKGILAQL  YFS +INFFI
Subjt:  IEAINGAWAMIGLTAGLVIEGRTGKGILAQLADYFSILINFFI

TrEMBL top hitse value%identityAlignment
A0A0A0LH95 Uncharacterized protein4.5e-86100Show/hide
Query:  MWASQIYKWISIHAGRRCTLDMAAASTALILPINGGNLPPSQYLSFRHTLPSATFSRLGWSRDAGRSTRRTRGQAFRISNVSPGRDGLIKQVIMVDPLEA
        MWASQIYKWISIHAGRRCTLDMAAASTALILPINGGNLPPSQYLSFRHTLPSATFSRLGWSRDAGRSTRRTRGQAFRISNVSPGRDGLIKQVIMVDPLEA
Subjt:  MWASQIYKWISIHAGRRCTLDMAAASTALILPINGGNLPPSQYLSFRHTLPSATFSRLGWSRDAGRSTRRTRGQAFRISNVSPGRDGLIKQVIMVDPLEA

Query:  KRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGRTGKGILAQLADYFSILINFFIR
        KRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGRTGKGILAQLADYFSILINFFIR
Subjt:  KRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGRTGKGILAQLADYFSILINFFIR

A0A1S3BSE0 uncharacterized protein LOC103492970 isoform X15.0e-7790.48Show/hide
Query:  MWASQIYKWISIHAGRRCTLDMAAASTALILPINGGNLPPSQYLSFRHTLPSATFSRLGWSR--DAGRSTRRTRGQAFRISNVSPGRDGLIKQVIMVDPL
        MWAS+IYKWISIHA + CTL+MAAAS +LILPINGGNLPPSQYLSFRH+ PSATFSRLGWSR  D GRST RTRGQAFRISNVSP +DGLIKQVIMVDPL
Subjt:  MWASQIYKWISIHAGRRCTLDMAAASTALILPINGGNLPPSQYLSFRHTLPSATFSRLGWSR--DAGRSTRRTRGQAFRISNVSPGRDGLIKQVIMVDPL

Query:  EAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGRTGKGILAQLADYFSILINFFIR
        EAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEG+TGKGILAQLADYFSILINFFIR
Subjt:  EAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGRTGKGILAQLADYFSILINFFIR

A0A1S4E3D1 uncharacterized protein LOC103492970 isoform X25.0e-6985.12Show/hide
Query:  MWASQIYKWISIHAGRRCTLDMAAASTALILPINGGNLPPSQYLSFRHTLPSATFSRLGWSR--DAGRSTRRTRGQAFRISNVSPGRDGLIKQVIMVDPL
        MWAS+IYKWISIHA + CTL+MAAAS +LILPINGGNLPPSQYLSFRH+ PSATFSRLGWSR  D GRST RTRGQAFRISN           VIMVDPL
Subjt:  MWASQIYKWISIHAGRRCTLDMAAASTALILPINGGNLPPSQYLSFRHTLPSATFSRLGWSR--DAGRSTRRTRGQAFRISNVSPGRDGLIKQVIMVDPL

Query:  EAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGRTGKGILAQLADYFSILINFFIR
        EAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEG+TGKGILAQLADYFSILINFFIR
Subjt:  EAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGRTGKGILAQLADYFSILINFFIR

A0A5A7UQ54 Uncharacterized protein1.8e-5889.78Show/hide
Query:  MAAASTALILPINGGNLPPSQYLSFRHTLPSATFSRLGWSR--DAGRSTRRTRGQAFRISNVSPGRDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKR
        MAAAS +LILPINGGNLPPSQYLSFRH+ PSATFSRLGWSR  D GRST RTRGQAFRISNVSP +DGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKR
Subjt:  MAAASTALILPINGGNLPPSQYLSFRHTLPSATFSRLGWSR--DAGRSTRRTRGQAFRISNVSPGRDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKR

Query:  RRQIEAINGAWAMIGLTAGLVIEGRTGKGILAQLADY
        RRQIEAINGAWAMIGLTAGLVIEG+TGKGILAQ+  Y
Subjt:  RRQIEAINGAWAMIGLTAGLVIEGRTGKGILAQLADY

A0A6J1I0N1 uncharacterized protein LOC111468645 isoform X18.6e-5378.91Show/hide
Query:  ASTALILPINGGNLPPSQYLSFRHTLPSATFSRLGWS--RDAGRSTRRTRGQAFRI---SNVSPGRDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKR
        ASTALILPI+GGN   SQ LSFRHT  SATFSR GWS  RD G ST RTRGQAFRI    NVSPG+D LIK+VIMVDPLEAKR+AAKEMEKIKAKEKFKR
Subjt:  ASTALILPINGGNLPPSQYLSFRHTLPSATFSRLGWS--RDAGRSTRRTRGQAFRI---SNVSPGRDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKR

Query:  RRQIEAINGAWAMIGLTAGLVIEGRTGKGILAQLADYFSILINFFIR
        RRQIEAINGAWAMIGLTAGL++EG+TGKGILAQLA Y + ++NFF+R
Subjt:  RRQIEAINGAWAMIGLTAGLVIEGRTGKGILAQLADYFSILINFFIR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G28025.1 unknown protein1.7e-2453.17Show/hide
Query:  RHTLPSATFSR-LGWSRDAGRSTRRTRGQAFRI---SNVS----PGRDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAG
        R  +PS++  + L   +DA     R R    R+    NVS    PG+  + K+VIMVDPLEAKR+A+K+ME+IK +EK +RRR+IEAINGAWA+IGL  G
Subjt:  RHTLPSATFSR-LGWSRDAGRSTRRTRGQAFRI---SNVS----PGRDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAG

Query:  LVIEGRTGKGILAQLADYFSILINFF
        LVIE +TGKGILAQLA Y+S +++ F
Subjt:  LVIEGRTGKGILAQLADYFSILINFF

AT4G28025.2 unknown protein.1.7e-2451.85Show/hide
Query:  PPSQYLSFRHTLPSATFSRLGWSR--DAGRSTRRTRGQAFRI---SNVS----PGRDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKRRRQIEAINGA
        PP  +L       S++  R G  R  DA     R R    R+    NVS    PG+  + K+VIMVDPLEAKR+A+K+ME+IK +EK +RRR+IEAINGA
Subjt:  PPSQYLSFRHTLPSATFSRLGWSR--DAGRSTRRTRGQAFRI---SNVS----PGRDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKRRRQIEAINGA

Query:  WAMIGLTAGLVIEGRTGKGILAQLADYFSILINFF
        WA+IGL  GLVIE +TGKGILAQLA Y+S +++ F
Subjt:  WAMIGLTAGLVIEGRTGKGILAQLADYFSILINFF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGGCATCTCAAATCTATAAATGGATAAGCATTCATGCAGGAAGGCGTTGCACTTTGGACATGGCTGCTGCTTCCACTGCGCTGATTCTCCCCATCAATGGAGGAAA
CCTTCCGCCTTCCCAATACCTCTCTTTTCGCCATACTCTCCCTTCTGCTACTTTCTCCAGGTTGGGTTGGAGTAGGGATGCTGGCAGAAGTACTCGTAGAACGAGGGGTC
AAGCATTTCGAATCTCTAATGTCTCGCCTGGTAGGGATGGATTAATCAAGCAGGTGATTATGGTTGACCCTTTGGAAGCCAAACGTATGGCAGCGAAAGAAATGGAAAAG
ATCAAAGCAAAAGAGAAGTTCAAGAGAAGACGTCAAATAGAAGCAATTAATGGAGCATGGGCAATGATCGGTCTGACGGCTGGGCTTGTCATTGAAGGTCGAACCGGAAA
AGGAATTCTAGCCCAGTTGGCTGACTACTTCAGCATCCTTATCAACTTCTTCATACGGTAA
mRNA sequenceShow/hide mRNA sequence
CGAAGAGCAGAAAAGTAGTTTTGTTTGGAAAGGGCGTAAGGATAAAAAATGCATCAAATCTAAATGAGTAAAGCGACTGAAAATTTAGGGGAAAGCCCAATTTCGAGTCA
ACCAATTCAATTTCATTTCATTCCAATATGGTGGAGGCAAGCAATATTAGATATATCCGTTTGATTGGATTAGAACAAGCTCTGTACAGCCATCAGTGAGCGTCATTGCA
CAAATGTCATTCTCTATGAGATAATGTGTTGATGTGGGCATCTCAAATCTATAAATGGATAAGCATTCATGCAGGAAGGCGTTGCACTTTGGACATGGCTGCTGCTTCCA
CTGCGCTGATTCTCCCCATCAATGGAGGAAACCTTCCGCCTTCCCAATACCTCTCTTTTCGCCATACTCTCCCTTCTGCTACTTTCTCCAGGTTGGGTTGGAGTAGGGAT
GCTGGCAGAAGTACTCGTAGAACGAGGGGTCAAGCATTTCGAATCTCTAATGTCTCGCCTGGTAGGGATGGATTAATCAAGCAGGTGATTATGGTTGACCCTTTGGAAGC
CAAACGTATGGCAGCGAAAGAAATGGAAAAGATCAAAGCAAAAGAGAAGTTCAAGAGAAGACGTCAAATAGAAGCAATTAATGGAGCATGGGCAATGATCGGTCTGACGG
CTGGGCTTGTCATTGAAGGTCGAACCGGAAAAGGAATTCTAGCCCAGTTGGCTGACTACTTCAGCATCCTTATCAACTTCTTCATACGGTAAGAGGCATCTTCGAGGGGA
AAAGGAGATCTTTGAATGAAAGAGGCGGTGAGATGAAGCAGCAGGTTCATGGTACTTTGTTTTGAAGATTCTATTTGCAATACAATTATATGTACTACCATTTGTTTTTT
TAAAAATCAGTACAAGATTTCATTTCCAAGGAGAGGAAAAACCACCATAGTTCTGTACAATTATTTGGTGAAAAGAAGCTTGTCTCTTTTTTTAAAAGAAAAAGAAATAG
ACTGTCGTTGTCTATAAGCTGATTTATCTTCCAATATTTGCAAACAACAGCGTTGTAATGTAAGCATAGATTAGAATTCAGATGAAAATTCAGATGAATTAGCTTTGATG
ATTCTTCTATTGAACTCCTGTT
Protein sequenceShow/hide protein sequence
MWASQIYKWISIHAGRRCTLDMAAASTALILPINGGNLPPSQYLSFRHTLPSATFSRLGWSRDAGRSTRRTRGQAFRISNVSPGRDGLIKQVIMVDPLEAKRMAAKEMEK
IKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGRTGKGILAQLADYFSILINFFIR