; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029813 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029813
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionChlorophyll A-B binding protein
Genome locationtig00153533:1412315..1416681
RNA-Seq ExpressionSgr029813
SyntenySgr029813
Gene Ontology termsGO:0009507 - chloroplast (cellular component)
GO:0009579 - thylakoid (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR022796 - Chlorophyll A-B binding protein
IPR023329 - Chlorophyll a/b binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008451793.1 PREDICTED: uncharacterized protein LOC103492970 isoform X1 [Cucumis melo]2.6e-5074.68Show/hide
Query:  ATTALLLPITGGNLVSSQSQYLSCRHSHPSATFSSSFPFQVGLEQGSRCRQNLHRTRGQAFRILANPNVSPGKDGLIKQVIMVDPLEAKRLAAKQMEKIK
        A+ +L+LPI GGNL    SQYLS RHSHPSATFS     ++G  +     ++ HRTRGQAFRI    NVSP KDGLIKQVIMVDPLEAKR+AAK+MEKIK
Subjt:  ATTALLLPITGGNLVSSQSQYLSCRHSHPSATFSSSFPFQVGLEQGSRCRQNLHRTRGQAFRILANPNVSPGKDGLIKQVIMVDPLEAKRLAAKQMEKIK

Query:  AKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYFSAVVNFFVR
        AKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYFS ++NFF+R
Subjt:  AKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYFSAVVNFFVR

XP_022937783.1 uncharacterized protein LOC111444076 isoform X1 [Cucurbita moschata]4.1e-5174.84Show/hide
Query:  MATTALLLPITGGNLVSSQSQYLSCRHSHPSATFSSSFPFQVGLEQGSRCRQNLHRTRGQAFRILANPNVSPGKDGLIKQVIMVDPLEAKRLAAKQMEKI
        MA+TAL+LPI GGN        LS RH+HPSATFS     + G  +     ++ HRTRGQAFRILANPNVSPGKD LIK+VIMVDPLEAKRLAAK+MEKI
Subjt:  MATTALLLPITGGNLVSSQSQYLSCRHSHPSATFSSSFPFQVGLEQGSRCRQNLHRTRGQAFRILANPNVSPGKDGLIKQVIMVDPLEAKRLAAKQMEKI

Query:  KAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYFSAVVNFFVR
        KAKEKFKRRRQIEAINGAWAMIGLTAGL++EGQTGKGILAQLA Y +AVVNFFVR
Subjt:  KAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYFSAVVNFFVR

XP_022969695.1 uncharacterized protein LOC111468645 isoform X1 [Cucurbita maxima]1.4e-5176.13Show/hide
Query:  MATTALLLPITGGNLVSSQSQYLSCRHSHPSATFSSSFPFQVGLEQGSRCRQNLHRTRGQAFRILANPNVSPGKDGLIKQVIMVDPLEAKRLAAKQMEKI
        MA+TAL+LPI GGN  SSQ   LS RH+H SATFS     + G  +      + HRTRGQAFRILANPNVSPGKD LIK+VIMVDPLEAKRLAAK+MEKI
Subjt:  MATTALLLPITGGNLVSSQSQYLSCRHSHPSATFSSSFPFQVGLEQGSRCRQNLHRTRGQAFRILANPNVSPGKDGLIKQVIMVDPLEAKRLAAKQMEKI

Query:  KAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYFSAVVNFFVR
        KAKEKFKRRRQIEAINGAWAMIGLTAGL++EGQTGKGILAQLA Y +AVVNFFVR
Subjt:  KAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYFSAVVNFFVR

XP_023538319.1 uncharacterized protein LOC111799137 isoform X1 [Cucurbita pepo subsp. pepo]1.1e-5175.48Show/hide
Query:  MATTALLLPITGGNLVSSQSQYLSCRHSHPSATFSSSFPFQVGLEQGSRCRQNLHRTRGQAFRILANPNVSPGKDGLIKQVIMVDPLEAKRLAAKQMEKI
        MA+TAL+LPI GGN        LS RH+HPSATFS     + G  +     ++ HRTRGQAFRILANPNVSPGKD LIKQVIMVDPLEAKRLAAK+MEKI
Subjt:  MATTALLLPITGGNLVSSQSQYLSCRHSHPSATFSSSFPFQVGLEQGSRCRQNLHRTRGQAFRILANPNVSPGKDGLIKQVIMVDPLEAKRLAAKQMEKI

Query:  KAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYFSAVVNFFVR
        KAKEKFKRRRQIEAINGAWAMIGLTAGL++EGQTGKGILAQLA Y +AVVNFFVR
Subjt:  KAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYFSAVVNFFVR

XP_038889814.1 uncharacterized protein LOC120079625 [Benincasa hispida]1.4e-4873.38Show/hide
Query:  MATTALLLPITGGNLVSSQSQYLSCRHSHPSATFSSSFPFQVGLEQGSRCRQNLHRTRGQAFRILANPNVSPGKDGLIKQVIMVDPLEAKRLAAKQMEKI
        MA+T+L+LPI GGNL    SQYLS RH+HPSATFS S     G  +     ++ HRTRGQAF+I    NVSPGKD LIKQVIMVDPLEAKR+AAK+MEKI
Subjt:  MATTALLLPITGGNLVSSQSQYLSCRHSHPSATFSSSFPFQVGLEQGSRCRQNLHRTRGQAFRILANPNVSPGKDGLIKQVIMVDPLEAKRLAAKQMEKI

Query:  KAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYFSAVVNFFV
        KAKEKFKR+RQIEAINGAWAMIGLTAGLVIEGQTGKGILAQL  YFS V+NFF+
Subjt:  KAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYFSAVVNFFV

TrEMBL top hitse value%identityAlignment
A0A0A0LH95 Uncharacterized protein1.0e-4773.38Show/hide
Query:  ATTALLLPITGGNLVSSQSQYLSCRHSHPSATFSSSFPFQVGLEQGSRCRQNLHRTRGQAFRILANPNVSPGKDGLIKQVIMVDPLEAKRLAAKQMEKIK
        A+TAL+LPI GGNL    SQYLS RH+ PSATFS        L       ++  RTRGQAFRI    NVSPG+DGLIKQVIMVDPLEAKR+AAK+MEKIK
Subjt:  ATTALLLPITGGNLVSSQSQYLSCRHSHPSATFSSSFPFQVGLEQGSRCRQNLHRTRGQAFRILANPNVSPGKDGLIKQVIMVDPLEAKRLAAKQMEKIK

Query:  AKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYFSAVVNFFVR
        AKEKFKRRRQIEAINGAWAMIGLTAGLVIEG+TGKGILAQLADYFS ++NFF+R
Subjt:  AKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYFSAVVNFFVR

A0A1S3BSE0 uncharacterized protein LOC103492970 isoform X11.3e-5074.68Show/hide
Query:  ATTALLLPITGGNLVSSQSQYLSCRHSHPSATFSSSFPFQVGLEQGSRCRQNLHRTRGQAFRILANPNVSPGKDGLIKQVIMVDPLEAKRLAAKQMEKIK
        A+ +L+LPI GGNL    SQYLS RHSHPSATFS     ++G  +     ++ HRTRGQAFRI    NVSP KDGLIKQVIMVDPLEAKR+AAK+MEKIK
Subjt:  ATTALLLPITGGNLVSSQSQYLSCRHSHPSATFSSSFPFQVGLEQGSRCRQNLHRTRGQAFRILANPNVSPGKDGLIKQVIMVDPLEAKRLAAKQMEKIK

Query:  AKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYFSAVVNFFVR
        AKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYFS ++NFF+R
Subjt:  AKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYFSAVVNFFVR

A0A6J1DIT0 uncharacterized protein LOC1110208595.0e-4777.21Show/hide
Query:  SQYLSCRHSHPSATFSSSFPFQVGLEQGSRCRQNLHRTRGQAFRILANPNVSPGKDGLIKQVIMVDPLEAKRLAAKQMEKIKAKEKFKRRRQIEAINGAW
        SQYLS RH+HP ATFS     + G  +     +  HRTRGQAFRILANPNVSPGKDG +K+VIMVDPLEAKR+AAKQMEKIKAKEK KRRRQIEAINGAW
Subjt:  SQYLSCRHSHPSATFSSSFPFQVGLEQGSRCRQNLHRTRGQAFRILANPNVSPGKDGLIKQVIMVDPLEAKRLAAKQMEKIKAKEKFKRRRQIEAINGAW

Query:  AMIGLTAGLVIEGQTGKGILAQLADYFSAVVNFFVR
        AMIGLTAGLVIEGQTGKGILAQL DYF+ VV+ FVR
Subjt:  AMIGLTAGLVIEGQTGKGILAQLADYFSAVVNFFVR

A0A6J1FC77 uncharacterized protein LOC111444076 isoform X12.0e-5174.84Show/hide
Query:  MATTALLLPITGGNLVSSQSQYLSCRHSHPSATFSSSFPFQVGLEQGSRCRQNLHRTRGQAFRILANPNVSPGKDGLIKQVIMVDPLEAKRLAAKQMEKI
        MA+TAL+LPI GGN        LS RH+HPSATFS     + G  +     ++ HRTRGQAFRILANPNVSPGKD LIK+VIMVDPLEAKRLAAK+MEKI
Subjt:  MATTALLLPITGGNLVSSQSQYLSCRHSHPSATFSSSFPFQVGLEQGSRCRQNLHRTRGQAFRILANPNVSPGKDGLIKQVIMVDPLEAKRLAAKQMEKI

Query:  KAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYFSAVVNFFVR
        KAKEKFKRRRQIEAINGAWAMIGLTAGL++EGQTGKGILAQLA Y +AVVNFFVR
Subjt:  KAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYFSAVVNFFVR

A0A6J1I0N1 uncharacterized protein LOC111468645 isoform X16.8e-5276.13Show/hide
Query:  MATTALLLPITGGNLVSSQSQYLSCRHSHPSATFSSSFPFQVGLEQGSRCRQNLHRTRGQAFRILANPNVSPGKDGLIKQVIMVDPLEAKRLAAKQMEKI
        MA+TAL+LPI GGN  SSQ   LS RH+H SATFS     + G  +      + HRTRGQAFRILANPNVSPGKD LIK+VIMVDPLEAKRLAAK+MEKI
Subjt:  MATTALLLPITGGNLVSSQSQYLSCRHSHPSATFSSSFPFQVGLEQGSRCRQNLHRTRGQAFRILANPNVSPGKDGLIKQVIMVDPLEAKRLAAKQMEKI

Query:  KAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYFSAVVNFFVR
        KAKEKFKRRRQIEAINGAWAMIGLTAGL++EGQTGKGILAQLA Y +AVVNFFVR
Subjt:  KAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYFSAVVNFFVR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G28025.1 unknown protein8.5e-3169.9Show/hide
Query:  HRTRGQAFRILANPNVS----PGKDGLIKQVIMVDPLEAKRLAAKQMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYFSAVV
        +R R    R+LANPNVS    PGK  + K+VIMVDPLEAKRLA+KQME+IK +EK +RRR+IEAINGAWA+IGL  GLVIE QTGKGILAQLA Y+SAVV
Subjt:  HRTRGQAFRILANPNVS----PGKDGLIKQVIMVDPLEAKRLAAKQMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYFSAVV

Query:  NFF
        + F
Subjt:  NFF

AT4G28025.2 unknown protein.5.0e-3162.7Show/hide
Query:  ATFSSSFPFQVGLEQGSRCRQNLHRTRGQAFRILANPNVS----PGKDGLIKQVIMVDPLEAKRLAAKQMEKIKAKEKFKRRRQIEAINGAWAMIGLTAG
        A FSSS   + GL +    +   +R R    R+LANPNVS    PGK  + K+VIMVDPLEAKRLA+KQME+IK +EK +RRR+IEAINGAWA+IGL  G
Subjt:  ATFSSSFPFQVGLEQGSRCRQNLHRTRGQAFRILANPNVS----PGKDGLIKQVIMVDPLEAKRLAAKQMEKIKAKEKFKRRRQIEAINGAWAMIGLTAG

Query:  LVIEGQTGKGILAQLADYFSAVVNFF
        LVIE QTGKGILAQLA Y+SAVV+ F
Subjt:  LVIEGQTGKGILAQLADYFSAVVNFF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTACCACTGCGCTGTTACTCCCAATCACAGGAGGAAACCTTGTGTCTTCCCAATCCCAATACCTCTCTTGCCGCCATAGCCATCCTTCTGCAACTTTCTCCAGTTC
CTTTCCCTTCCAGGTGGGGTTGGAGCAGGGATCAAGATGCAGGCAAAATTTGCATAGAACGAGGGGTCAAGCATTTCGAATCTTGGCCAACCCTAATGTCTCTCCTGGGA
AAGATGGCTTAATTAAGCAGGTGATTATGGTTGATCCTTTGGAAGCGAAACGATTGGCTGCGAAACAGATGGAAAAGATCAAAGCAAAAGAGAAGTTCAAGAGAAGACGT
CAAATAGAAGCAATTAATGGAGCATGGGCAATGATTGGTCTCACAGCGGGGCTCGTAATCGAAGGTCAAACTGGAAAAGGCATACTAGCACAGTTGGCCGACTACTTCAG
TGCCGTTGTCAACTTCTTTGTACGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTACCACTGCGCTGTTACTCCCAATCACAGGAGGAAACCTTGTGTCTTCCCAATCCCAATACCTCTCTTGCCGCCATAGCCATCCTTCTGCAACTTTCTCCAGTTC
CTTTCCCTTCCAGGTGGGGTTGGAGCAGGGATCAAGATGCAGGCAAAATTTGCATAGAACGAGGGGTCAAGCATTTCGAATCTTGGCCAACCCTAATGTCTCTCCTGGGA
AAGATGGCTTAATTAAGCAGGTGATTATGGTTGATCCTTTGGAAGCGAAACGATTGGCTGCGAAACAGATGGAAAAGATCAAAGCAAAAGAGAAGTTCAAGAGAAGACGT
CAAATAGAAGCAATTAATGGAGCATGGGCAATGATTGGTCTCACAGCGGGGCTCGTAATCGAAGGTCAAACTGGAAAAGGCATACTAGCACAGTTGGCCGACTACTTCAG
TGCCGTTGTCAACTTCTTTGTACGATAA
Protein sequenceShow/hide protein sequence
MATTALLLPITGGNLVSSQSQYLSCRHSHPSATFSSSFPFQVGLEQGSRCRQNLHRTRGQAFRILANPNVSPGKDGLIKQVIMVDPLEAKRLAAKQMEKIKAKEKFKRRR
QIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYFSAVVNFFVR