; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg27984 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg27984
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionChlorophyll A-B binding protein
Genome locationCarg_Chr12:8081139..8085644
RNA-Seq ExpressionCarg27984
SyntenyCarg27984
Gene Ontology termsGO:0009507 - chloroplast (cellular component)
GO:0009579 - thylakoid (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR022796 - Chlorophyll A-B binding protein
IPR023329 - Chlorophyll a/b binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022937783.1 uncharacterized protein LOC111444076 isoform X1 [Cucurbita moschata]1.1e-71100Show/hide
Query:  MASTALILPIHGGNLSFRHTHPSATFSRWGWNRDQDVGKSTHRTRGQAFRILANPNVSPGKDDLIKKVIMVDPLEAKRLAAKEMEKIKAKEKFKRRRQIE
        MASTALILPIHGGNLSFRHTHPSATFSRWGWNRDQDVGKSTHRTRGQAFRILANPNVSPGKDDLIKKVIMVDPLEAKRLAAKEMEKIKAKEKFKRRRQIE
Subjt:  MASTALILPIHGGNLSFRHTHPSATFSRWGWNRDQDVGKSTHRTRGQAFRILANPNVSPGKDDLIKKVIMVDPLEAKRLAAKEMEKIKAKEKFKRRRQIE

Query:  AINGAWAMIGLTAGLIVEGQTGKGILAQLAGYLAAVVNFFVR
        AINGAWAMIGLTAGLIVEGQTGKGILAQLAGYLAAVVNFFVR
Subjt:  AINGAWAMIGLTAGLIVEGQTGKGILAQLAGYLAAVVNFFVR

XP_022937785.1 uncharacterized protein LOC111444076 isoform X2 [Cucurbita moschata]1.1e-55100Show/hide
Query:  SRWGWNRDQDVGKSTHRTRGQAFRILANPNVSPGKDDLIKKVIMVDPLEAKRLAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLIVEGQTGKGIL
        SRWGWNRDQDVGKSTHRTRGQAFRILANPNVSPGKDDLIKKVIMVDPLEAKRLAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLIVEGQTGKGIL
Subjt:  SRWGWNRDQDVGKSTHRTRGQAFRILANPNVSPGKDDLIKKVIMVDPLEAKRLAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLIVEGQTGKGIL

Query:  AQLAGYLAAVVNFFVR
        AQLAGYLAAVVNFFVR
Subjt:  AQLAGYLAAVVNFFVR

XP_022969695.1 uncharacterized protein LOC111468645 isoform X1 [Cucurbita maxima]3.7e-6793.24Show/hide
Query:  MASTALILPIHGGN------LSFRHTHPSATFSRWGWNRDQDVGKSTHRTRGQAFRILANPNVSPGKDDLIKKVIMVDPLEAKRLAAKEMEKIKAKEKFK
        MASTALILPIHGGN      LSFRHTH SATFSRWGW+RD+DVG STHRTRGQAFRILANPNVSPGKDDLIKKVIMVDPLEAKRLAAKEMEKIKAKEKFK
Subjt:  MASTALILPIHGGN------LSFRHTHPSATFSRWGWNRDQDVGKSTHRTRGQAFRILANPNVSPGKDDLIKKVIMVDPLEAKRLAAKEMEKIKAKEKFK

Query:  RRRQIEAINGAWAMIGLTAGLIVEGQTGKGILAQLAGYLAAVVNFFVR
        RRRQIEAINGAWAMIGLTAGLIVEGQTGKGILAQLAGYLAAVVNFFVR
Subjt:  RRRQIEAINGAWAMIGLTAGLIVEGQTGKGILAQLAGYLAAVVNFFVR

XP_023538319.1 uncharacterized protein LOC111799137 isoform X1 [Cucurbita pepo subsp. pepo]1.2e-7098.59Show/hide
Query:  MASTALILPIHGGNLSFRHTHPSATFSRWGWNRDQDVGKSTHRTRGQAFRILANPNVSPGKDDLIKKVIMVDPLEAKRLAAKEMEKIKAKEKFKRRRQIE
        MASTALILPIHGGNLSFRHTHPSATFSRWGWNRDQDVGKSTHRTRGQAFRILANPNVSPGKD+LIK+VIMVDPLEAKRLAAKEMEKIKAKEKFKRRRQIE
Subjt:  MASTALILPIHGGNLSFRHTHPSATFSRWGWNRDQDVGKSTHRTRGQAFRILANPNVSPGKDDLIKKVIMVDPLEAKRLAAKEMEKIKAKEKFKRRRQIE

Query:  AINGAWAMIGLTAGLIVEGQTGKGILAQLAGYLAAVVNFFVR
        AINGAWAMIGLTAGLIVEGQTGKGILAQLAGYLAAVVNFFVR
Subjt:  AINGAWAMIGLTAGLIVEGQTGKGILAQLAGYLAAVVNFFVR

XP_038889814.1 uncharacterized protein LOC120079625 [Benincasa hispida]3.5e-5781.63Show/hide
Query:  MASTALILPIHGGN------LSFRHTHPSATFSRWGWNRDQDVGKSTHRTRGQAFRILANPNVSPGKDDLIKKVIMVDPLEAKRLAAKEMEKIKAKEKFK
        MAST+LILPI GGN      LSFRHTHPSATFSR GW+RDQDVG+STHRTRGQAF+I    NVSPGKDDLIK+VIMVDPLEAKR+AAKEMEKIKAKEKFK
Subjt:  MASTALILPIHGGN------LSFRHTHPSATFSRWGWNRDQDVGKSTHRTRGQAFRILANPNVSPGKDDLIKKVIMVDPLEAKRLAAKEMEKIKAKEKFK

Query:  RRRQIEAINGAWAMIGLTAGLIVEGQTGKGILAQLAGYLAAVVNFFV
        R+RQIEAINGAWAMIGLTAGL++EGQTGKGILAQL GY + V+NFF+
Subjt:  RRRQIEAINGAWAMIGLTAGLIVEGQTGKGILAQLAGYLAAVVNFFV

TrEMBL top hitse value%identityAlignment
A0A1S3BSE0 uncharacterized protein LOC103492970 isoform X12.1e-5579.59Show/hide
Query:  ASTALILPIHGGN------LSFRHTHPSATFSRWGWNRDQDVGKSTHRTRGQAFRILANPNVSPGKDDLIKKVIMVDPLEAKRLAAKEMEKIKAKEKFKR
        AS +LILPI+GGN      LSFRH+HPSATFSR GW+RDQDVG+STHRTRGQAFRI    NVSP KD LIK+VIMVDPLEAKR+AAKEMEKIKAKEKFKR
Subjt:  ASTALILPIHGGN------LSFRHTHPSATFSRWGWNRDQDVGKSTHRTRGQAFRILANPNVSPGKDDLIKKVIMVDPLEAKRLAAKEMEKIKAKEKFKR

Query:  RRQIEAINGAWAMIGLTAGLIVEGQTGKGILAQLAGYLAAVVNFFVR
        RRQIEAINGAWAMIGLTAGL++EGQTGKGILAQLA Y + ++NFF+R
Subjt:  RRQIEAINGAWAMIGLTAGLIVEGQTGKGILAQLAGYLAAVVNFFVR

A0A6J1FC77 uncharacterized protein LOC111444076 isoform X15.4e-72100Show/hide
Query:  MASTALILPIHGGNLSFRHTHPSATFSRWGWNRDQDVGKSTHRTRGQAFRILANPNVSPGKDDLIKKVIMVDPLEAKRLAAKEMEKIKAKEKFKRRRQIE
        MASTALILPIHGGNLSFRHTHPSATFSRWGWNRDQDVGKSTHRTRGQAFRILANPNVSPGKDDLIKKVIMVDPLEAKRLAAKEMEKIKAKEKFKRRRQIE
Subjt:  MASTALILPIHGGNLSFRHTHPSATFSRWGWNRDQDVGKSTHRTRGQAFRILANPNVSPGKDDLIKKVIMVDPLEAKRLAAKEMEKIKAKEKFKRRRQIE

Query:  AINGAWAMIGLTAGLIVEGQTGKGILAQLAGYLAAVVNFFVR
        AINGAWAMIGLTAGLIVEGQTGKGILAQLAGYLAAVVNFFVR
Subjt:  AINGAWAMIGLTAGLIVEGQTGKGILAQLAGYLAAVVNFFVR

A0A6J1FHS7 uncharacterized protein LOC111444076 isoform X25.4e-56100Show/hide
Query:  SRWGWNRDQDVGKSTHRTRGQAFRILANPNVSPGKDDLIKKVIMVDPLEAKRLAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLIVEGQTGKGIL
        SRWGWNRDQDVGKSTHRTRGQAFRILANPNVSPGKDDLIKKVIMVDPLEAKRLAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLIVEGQTGKGIL
Subjt:  SRWGWNRDQDVGKSTHRTRGQAFRILANPNVSPGKDDLIKKVIMVDPLEAKRLAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLIVEGQTGKGIL

Query:  AQLAGYLAAVVNFFVR
        AQLAGYLAAVVNFFVR
Subjt:  AQLAGYLAAVVNFFVR

A0A6J1HYI7 uncharacterized protein LOC111468645 isoform X22.3e-5497.41Show/hide
Query:  SRWGWNRDQDVGKSTHRTRGQAFRILANPNVSPGKDDLIKKVIMVDPLEAKRLAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLIVEGQTGKGIL
        SRWGW+RD+DVG STHRTRGQAFRILANPNVSPGKDDLIKKVIMVDPLEAKRLAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLIVEGQTGKGIL
Subjt:  SRWGWNRDQDVGKSTHRTRGQAFRILANPNVSPGKDDLIKKVIMVDPLEAKRLAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLIVEGQTGKGIL

Query:  AQLAGYLAAVVNFFVR
        AQLAGYLAAVVNFFVR
Subjt:  AQLAGYLAAVVNFFVR

A0A6J1I0N1 uncharacterized protein LOC111468645 isoform X11.8e-6793.24Show/hide
Query:  MASTALILPIHGGN------LSFRHTHPSATFSRWGWNRDQDVGKSTHRTRGQAFRILANPNVSPGKDDLIKKVIMVDPLEAKRLAAKEMEKIKAKEKFK
        MASTALILPIHGGN      LSFRHTH SATFSRWGW+RD+DVG STHRTRGQAFRILANPNVSPGKDDLIKKVIMVDPLEAKRLAAKEMEKIKAKEKFK
Subjt:  MASTALILPIHGGN------LSFRHTHPSATFSRWGWNRDQDVGKSTHRTRGQAFRILANPNVSPGKDDLIKKVIMVDPLEAKRLAAKEMEKIKAKEKFK

Query:  RRRQIEAINGAWAMIGLTAGLIVEGQTGKGILAQLAGYLAAVVNFFVR
        RRRQIEAINGAWAMIGLTAGLIVEGQTGKGILAQLAGYLAAVVNFFVR
Subjt:  RRRQIEAINGAWAMIGLTAGLIVEGQTGKGILAQLAGYLAAVVNFFVR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G28025.1 unknown protein2.1e-3159.84Show/hide
Query:  RHTHPSATFSRWGWNRDQDVGKSTHRTRGQAFRILANPNVS----PGKDDLIKKVIMVDPLEAKRLAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTA
        R   PS++  + G  R QD     +R R    R+LANPNVS    PGK  + K+VIMVDPLEAKRLA+K+ME+IK +EK +RRR+IEAINGAWA+IGL  
Subjt:  RHTHPSATFSRWGWNRDQDVGKSTHRTRGQAFRILANPNVS----PGKDDLIKKVIMVDPLEAKRLAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTA

Query:  GLIVEGQTGKGILAQLAGYLAAVVNFF
        GL++E QTGKGILAQLAGY +AVV+ F
Subjt:  GLIVEGQTGKGILAQLAGYLAAVVNFF

AT4G28025.2 unknown protein.1.2e-3161.48Show/hide
Query:  SATFSRWGWNRDQDVGKSTHRTRGQAFRILANPNVS----PGKDDLIKKVIMVDPLEAKRLAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLIVE
        S++  R G  R QD     +R R    R+LANPNVS    PGK  + K+VIMVDPLEAKRLA+K+ME+IK +EK +RRR+IEAINGAWA+IGL  GL++E
Subjt:  SATFSRWGWNRDQDVGKSTHRTRGQAFRILANPNVS----PGKDDLIKKVIMVDPLEAKRLAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLIVE

Query:  GQTGKGILAQLAGYLAAVVNFF
         QTGKGILAQLAGY +AVV+ F
Subjt:  GQTGKGILAQLAGYLAAVVNFF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCACTGCGCTGATTCTCCCCATCCATGGAGGAAACCTCTCTTTCCGCCATACCCACCCTTCTGCAACATTTTCCAGGTGGGGTTGGAATAGGGATCAAGACGT
CGGAAAGAGTACACACAGAACGAGGGGTCAAGCATTCCGAATTTTGGCTAACCCTAATGTCTCTCCCGGGAAAGATGACTTAATTAAGAAGGTGATTATGGTTGATCCTT
TGGAAGCCAAACGTTTGGCTGCAAAAGAAATGGAAAAAATCAAAGCAAAAGAGAAGTTCAAGAGACGACGTCAAATAGAAGCGATAAATGGAGCGTGGGCAATGATTGGT
TTGACGGCAGGGCTCATCGTTGAAGGTCAAACTGGAAAAGGCATTCTAGCACAGTTGGCCGGCTACTTGGCCGCGGTTGTGAACTTCTTTGTACGGTAG
mRNA sequenceShow/hide mRNA sequence
AAAACCGTCCCTCTCTGCAAGATAAGTGTGGATGTGGCCAACGGAAATCTGTTACTAATTCCATATAACAATGCTTTACTTCACCAATAGATAAGGGTTCTTGTGAAGAA
GCGTGGTTTTTGAAGATGGCTTCCACTGCGCTGATTCTCCCCATCCATGGAGGAAACCTCTCTTTCCGCCATACCCACCCTTCTGCAACATTTTCCAGGTGGGGTTGGAA
TAGGGATCAAGACGTCGGAAAGAGTACACACAGAACGAGGGGTCAAGCATTCCGAATTTTGGCTAACCCTAATGTCTCTCCCGGGAAAGATGACTTAATTAAGAAGGTGA
TTATGGTTGATCCTTTGGAAGCCAAACGTTTGGCTGCAAAAGAAATGGAAAAAATCAAAGCAAAAGAGAAGTTCAAGAGACGACGTCAAATAGAAGCGATAAATGGAGCG
TGGGCAATGATTGGTTTGACGGCAGGGCTCATCGTTGAAGGTCAAACTGGAAAAGGCATTCTAGCACAGTTGGCCGGCTACTTGGCCGCGGTTGTGAACTTCTTTGTACG
GTAGACATCTTCAAGGGCAAAAGGACTTCTCCTTTGAATGGAAGAGTTATTCTCGTCTCGACCATCGAAAGTTCGGTATGTTAAATATTTTCGAGCTAAGTCCTATTGTT
TGTTTATCGTTTTATACCTAAAGAAAAGGATGTTTCCATCCTAAGAAACAGAAGTCTGATTCTTCCTTCTTTTCAAGACTGATTTCTTCACATTTCATGTACAAATTTGA
CATTTAATTGTGAAAGTGTATTGAGATGGTTTGTTGC
Protein sequenceShow/hide protein sequence
MASTALILPIHGGNLSFRHTHPSATFSRWGWNRDQDVGKSTHRTRGQAFRILANPNVSPGKDDLIKKVIMVDPLEAKRLAAKEMEKIKAKEKFKRRRQIEAINGAWAMIG
LTAGLIVEGQTGKGILAQLAGYLAAVVNFFVR