; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10006222 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10006222
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionChlorophyll A-B binding protein
Genome locationChr07:15998815..16004793
RNA-Seq ExpressionHG10006222
SyntenyHG10006222
Gene Ontology termsGO:0009507 - chloroplast (cellular component)
GO:0009579 - thylakoid (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055599.1 uncharacterized protein E6C27_scaffold222G00820 [Cucumis melo var. makuwa]8.8e-5790Show/hide
Query:  ASTALILPIKGRNLPPSQYLSFRHTHPSATFSRWGWSRDQNAGKCTHRTRGQAFQISNASPGKDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKRRRQ
        AS +LILPI G NLPPSQYLSFRH+HPSATFSR GWSRDQ+ G+ THRTRGQAF+ISN SP KDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKRRRQ
Subjt:  ASTALILPIKGRNLPPSQYLSFRHTHPSATFSRWGWSRDQNAGKCTHRTRGQAFQISNASPGKDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKRRRQ

Query:  IEAINGAWAMIGLTAGLVIEGQTGKGILAQ
        IEAINGAWAMIGLTAGLVIEGQTGKGILAQ
Subjt:  IEAINGAWAMIGLTAGLVIEGQTGKGILAQ

XP_008451793.1 PREDICTED: uncharacterized protein LOC103492970 isoform X1 [Cucumis melo]2.3e-6586.36Show/hide
Query:  MWAPEIYKWISFHVRTRFTLEM--ASTALILPIKGRNLPPSQYLSFRHTHPSATFSRWGWSRDQNAGKCTHRTRGQAFQISNASPGKDGLIKQVIMVDPL
        MWA EIYKWIS H R   TLEM  AS +LILPI G NLPPSQYLSFRH+HPSATFSR GWSRDQ+ G+ THRTRGQAF+ISN SP KDGLIKQVIMVDPL
Subjt:  MWAPEIYKWISFHVRTRFTLEM--ASTALILPIKGRNLPPSQYLSFRHTHPSATFSRWGWSRDQNAGKCTHRTRGQAFQISNASPGKDGLIKQVIMVDPL

Query:  EAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQ
        EAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQ
Subjt:  EAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQ

XP_011648927.1 uncharacterized protein LOC101212671 [Cucumis sativus]5.3e-6285.06Show/hide
Query:  MWAPEIYKWISFHVRTRFTLEM--ASTALILPIKGRNLPPSQYLSFRHTHPSATFSRWGWSRDQNAGKCTHRTRGQAFQISNASPGKDGLIKQVIMVDPL
        MWA +IYKWIS H   R TL+M  ASTALILPI G NLPPSQYLSFRHT PSATFSR GWSRD  AG+ T RTRGQAF+ISN SPG+DGLIKQVIMVDPL
Subjt:  MWAPEIYKWISFHVRTRFTLEM--ASTALILPIKGRNLPPSQYLSFRHTHPSATFSRWGWSRDQNAGKCTHRTRGQAFQISNASPGKDGLIKQVIMVDPL

Query:  EAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQ
        EAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEG+TGKGILAQ
Subjt:  EAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQ

XP_016902490.1 PREDICTED: uncharacterized protein LOC103492970 isoform X2 [Cucumis melo]1.8e-5780.52Show/hide
Query:  MWAPEIYKWISFHVRTRFTLEM--ASTALILPIKGRNLPPSQYLSFRHTHPSATFSRWGWSRDQNAGKCTHRTRGQAFQISNASPGKDGLIKQVIMVDPL
        MWA EIYKWIS H R   TLEM  AS +LILPI G NLPPSQYLSFRH+HPSATFSR GWSRDQ+ G+ THRTRGQAF+ISN           VIMVDPL
Subjt:  MWAPEIYKWISFHVRTRFTLEM--ASTALILPIKGRNLPPSQYLSFRHTHPSATFSRWGWSRDQNAGKCTHRTRGQAFQISNASPGKDGLIKQVIMVDPL

Query:  EAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQ
        EAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQ
Subjt:  EAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQ

XP_038889814.1 uncharacterized protein LOC120079625 [Benincasa hispida]2.1e-5891.6Show/hide
Query:  MASTALILPIKGRNLPPSQYLSFRHTHPSATFSRWGWSRDQNAGKCTHRTRGQAFQISNASPGKDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKRRR
        MAST+LILPIKG NLPPSQYLSFRHTHPSATFSR GWSRDQ+ G+ THRTRGQAFQISN SPGKD LIKQVIMVDPLEAKR+AAKEMEKIKAKEKFKR+R
Subjt:  MASTALILPIKGRNLPPSQYLSFRHTHPSATFSRWGWSRDQNAGKCTHRTRGQAFQISNASPGKDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKRRR

Query:  QIEAINGAWAMIGLTAGLVIEGQTGKGILAQ
        QIEAINGAWAMIGLTAGLVIEGQTGKGILAQ
Subjt:  QIEAINGAWAMIGLTAGLVIEGQTGKGILAQ

TrEMBL top hitse value%identityAlignment
A0A0A0LH95 Uncharacterized protein2.6e-6285.06Show/hide
Query:  MWAPEIYKWISFHVRTRFTLEM--ASTALILPIKGRNLPPSQYLSFRHTHPSATFSRWGWSRDQNAGKCTHRTRGQAFQISNASPGKDGLIKQVIMVDPL
        MWA +IYKWIS H   R TL+M  ASTALILPI G NLPPSQYLSFRHT PSATFSR GWSRD  AG+ T RTRGQAF+ISN SPG+DGLIKQVIMVDPL
Subjt:  MWAPEIYKWISFHVRTRFTLEM--ASTALILPIKGRNLPPSQYLSFRHTHPSATFSRWGWSRDQNAGKCTHRTRGQAFQISNASPGKDGLIKQVIMVDPL

Query:  EAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQ
        EAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEG+TGKGILAQ
Subjt:  EAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQ

A0A1S3BSE0 uncharacterized protein LOC103492970 isoform X11.1e-6586.36Show/hide
Query:  MWAPEIYKWISFHVRTRFTLEM--ASTALILPIKGRNLPPSQYLSFRHTHPSATFSRWGWSRDQNAGKCTHRTRGQAFQISNASPGKDGLIKQVIMVDPL
        MWA EIYKWIS H R   TLEM  AS +LILPI G NLPPSQYLSFRH+HPSATFSR GWSRDQ+ G+ THRTRGQAF+ISN SP KDGLIKQVIMVDPL
Subjt:  MWAPEIYKWISFHVRTRFTLEM--ASTALILPIKGRNLPPSQYLSFRHTHPSATFSRWGWSRDQNAGKCTHRTRGQAFQISNASPGKDGLIKQVIMVDPL

Query:  EAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQ
        EAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQ
Subjt:  EAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQ

A0A1S4E3D1 uncharacterized protein LOC103492970 isoform X28.6e-5880.52Show/hide
Query:  MWAPEIYKWISFHVRTRFTLEM--ASTALILPIKGRNLPPSQYLSFRHTHPSATFSRWGWSRDQNAGKCTHRTRGQAFQISNASPGKDGLIKQVIMVDPL
        MWA EIYKWIS H R   TLEM  AS +LILPI G NLPPSQYLSFRH+HPSATFSR GWSRDQ+ G+ THRTRGQAF+ISN           VIMVDPL
Subjt:  MWAPEIYKWISFHVRTRFTLEM--ASTALILPIKGRNLPPSQYLSFRHTHPSATFSRWGWSRDQNAGKCTHRTRGQAFQISNASPGKDGLIKQVIMVDPL

Query:  EAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQ
        EAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQ
Subjt:  EAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQ

A0A5A7UQ54 Uncharacterized protein4.3e-5790Show/hide
Query:  ASTALILPIKGRNLPPSQYLSFRHTHPSATFSRWGWSRDQNAGKCTHRTRGQAFQISNASPGKDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKRRRQ
        AS +LILPI G NLPPSQYLSFRH+HPSATFSR GWSRDQ+ G+ THRTRGQAF+ISN SP KDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKRRRQ
Subjt:  ASTALILPIKGRNLPPSQYLSFRHTHPSATFSRWGWSRDQNAGKCTHRTRGQAFQISNASPGKDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKRRRQ

Query:  IEAINGAWAMIGLTAGLVIEGQTGKGILAQ
        IEAINGAWAMIGLTAGLVIEGQTGKGILAQ
Subjt:  IEAINGAWAMIGLTAGLVIEGQTGKGILAQ

A0A6J1I0N1 uncharacterized protein LOC111468645 isoform X17.0e-5282.84Show/hide
Query:  MASTALILPIKGRNLPPSQYLSFRHTHPSATFSRWGWSRDQNAGKCTHRTRGQAFQI---SNASPGKDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFK
        MASTALILPI G N   SQ LSFRHTH SATFSRWGWSRD++ G  THRTRGQAF+I    N SPGKD LIK+VIMVDPLEAKR+AAKEMEKIKAKEKFK
Subjt:  MASTALILPIKGRNLPPSQYLSFRHTHPSATFSRWGWSRDQNAGKCTHRTRGQAFQI---SNASPGKDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFK

Query:  RRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQ
        RRRQIEAINGAWAMIGLTAGL++EGQTGKGILAQ
Subjt:  RRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G28025.1 unknown protein1.3e-2154.78Show/hide
Query:  RHTHPSATFSRWGWSRDQNAGKCTHRTR-GQAFQISNAS------PGKDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTA
        R   PS++  + G  R Q+A    +R R G    ++N +      PGK  + K+VIMVDPLEAKR+A+K+ME+IK +EK +RRR+IEAINGAWA+IGL  
Subjt:  RHTHPSATFSRWGWSRDQNAGKCTHRTR-GQAFQISNAS------PGKDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTA

Query:  GLVIEGQTGKGILAQ
        GLVIE QTGKGILAQ
Subjt:  GLVIEGQTGKGILAQ

AT4G28025.2 unknown protein.1.2e-2252.85Show/hide
Query:  PPSQYLSFRHTHPSATFSRWGWSRDQNAGKCTHRTR-GQAFQISNAS------PGKDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKRRRQIEAINGA
        PP  +L       S++  R G  R Q+A    +R R G    ++N +      PGK  + K+VIMVDPLEAKR+A+K+ME+IK +EK +RRR+IEAINGA
Subjt:  PPSQYLSFRHTHPSATFSRWGWSRDQNAGKCTHRTR-GQAFQISNAS------PGKDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKRRRQIEAINGA

Query:  WAMIGLTAGLVIEGQTGKGILAQ
        WA+IGL  GLVIE QTGKGILAQ
Subjt:  WAMIGLTAGLVIEGQTGKGILAQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGGCACCTGAAATCTATAAATGGATAAGTTTTCATGTAAGAACCCGCTTCACTTTGGAGATGGCTTCCACTGCGCTGATTCTCCCCATCAAAGGACGGAACCTTCC
GCCTTCTCAGTACCTGTCTTTCCGCCATACCCATCCTTCTGCTACTTTCTCCAGGTGGGGTTGGAGTAGGGACCAAAATGCTGGCAAATGTACTCACAGAACGAGGGGTC
AAGCATTTCAAATCTCTAATGCCTCTCCTGGTAAAGATGGCTTAATCAAGCAGGTGATTATGGTTGACCCTTTGGAAGCCAAACGTATGGCAGCAAAAGAAATGGAAAAG
ATCAAAGCAAAAGAGAAGTTCAAGAGAAGACGTCAAATAGAAGCGATTAATGGAGCATGGGCAATGATTGGTCTGACAGCAGGGCTTGTTATCGAAGGTCAAACCGGAAA
AGGCATTCTAGCACAGCATTCATGCAAGAGAGACGAATTAAGTGGAAAAGCTTCTAGTAGATCAAAATTAAGAGGTTGGAAAATAAAAAATGTAGGAGCTCATGAGAGCA
GTATGATCGTGGCAAGAGATCACGTTGAAGATCGCGATCTTGTAGCACTCGTGCAAGAGAGAAACAAAGTATCTACAACGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTGGGCACCTGAAATCTATAAATGGATAAGTTTTCATGTAAGAACCCGCTTCACTTTGGAGATGGCTTCCACTGCGCTGATTCTCCCCATCAAAGGACGGAACCTTCC
GCCTTCTCAGTACCTGTCTTTCCGCCATACCCATCCTTCTGCTACTTTCTCCAGGTGGGGTTGGAGTAGGGACCAAAATGCTGGCAAATGTACTCACAGAACGAGGGGTC
AAGCATTTCAAATCTCTAATGCCTCTCCTGGTAAAGATGGCTTAATCAAGCAGGTGATTATGGTTGACCCTTTGGAAGCCAAACGTATGGCAGCAAAAGAAATGGAAAAG
ATCAAAGCAAAAGAGAAGTTCAAGAGAAGACGTCAAATAGAAGCGATTAATGGAGCATGGGCAATGATTGGTCTGACAGCAGGGCTTGTTATCGAAGGTCAAACCGGAAA
AGGCATTCTAGCACAGCATTCATGCAAGAGAGACGAATTAAGTGGAAAAGCTTCTAGTAGATCAAAATTAAGAGGTTGGAAAATAAAAAATGTAGGAGCTCATGAGAGCA
GTATGATCGTGGCAAGAGATCACGTTGAAGATCGCGATCTTGTAGCACTCGTGCAAGAGAGAAACAAAGTATCTACAACGTGA
Protein sequenceShow/hide protein sequence
MWAPEIYKWISFHVRTRFTLEMASTALILPIKGRNLPPSQYLSFRHTHPSATFSRWGWSRDQNAGKCTHRTRGQAFQISNASPGKDGLIKQVIMVDPLEAKRMAAKEMEK
IKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQHSCKRDELSGKASSRSKLRGWKIKNVGAHESSMIVARDHVEDRDLVALVQERNKVSTT