; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC11G215860 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC11G215860
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionChlorophyll A-B binding protein
Genome locationCmU531Chr11:23789823..23803919
RNA-Seq ExpressionCmUC11G215860
SyntenyCmUC11G215860
Gene Ontology termsGO:0009507 - chloroplast (cellular component)
GO:0009579 - thylakoid (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055599.1 uncharacterized protein E6C27_scaffold222G00820 [Cucumis melo var. makuwa]1.9e-4760.82Show/hide
Query:  ASTALILPIKVQTLPLAQYLSFRHSHPSATFSRVNYSPAELYILTNRVRKAPLQAKSLEKISPMATGDSTVWGWSRDQDAGKCTHRMRGQAFRISNASPG
        AS +LILPI    LP +QYLSFRHSHPSATFSR+                                      GWSRDQD G+ THR RGQAFRISN SP 
Subjt:  ASTALILPIKVQTLPLAQYLSFRHSHPSATFSRVNYSPAELYILTNRVRKAPLQAKSLEKISPMATGDSTVWGWSRDQDAGKCTHRMRGQAFRISNASPG

Query:  KDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKVKEVIIINDANHRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADY--FSFIINFF
        KDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFK            RRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQ+  Y   +F++  F
Subjt:  KDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKVKEVIIINDANHRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADY--FSFIINFF

XP_008451793.1 PREDICTED: uncharacterized protein LOC103492970 isoform X1 [Cucumis melo]2.1e-6265.14Show/hide
Query:  MWATEIYKWISIRVTNHCTLEM--ASTALILPIKVQTLPLAQYLSFRHSHPSATFSRVNYSPAELYILTNRVRKAPLQAKSLEKISPMATGDSTVWGWSR
        MWA+EIYKWISI     CTLEM  AS +LILPI    LP +QYLSFRHSHPSATFSR+                                      GWSR
Subjt:  MWATEIYKWISIRVTNHCTLEM--ASTALILPIKVQTLPLAQYLSFRHSHPSATFSRVNYSPAELYILTNRVRKAPLQAKSLEKISPMATGDSTVWGWSR

Query:  DQDAGKCTHRMRGQAFRISNASPGKDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKVKEVIIINDANHRRRQIEAINGAWAMIGLTAGLVIEGQTGKG
        DQD G+ THR RGQAFRISN SP KDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFK            RRRQIEAINGAWAMIGLTAGLVIEGQTGKG
Subjt:  DQDAGKCTHRMRGQAFRISNASPGKDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKVKEVIIINDANHRRRQIEAINGAWAMIGLTAGLVIEGQTGKG

Query:  ILAQLADYFSFIINFFVR
        ILAQLADYFS +INFF+R
Subjt:  ILAQLADYFSFIINFFVR

XP_011648927.1 uncharacterized protein LOC101212671 [Cucumis sativus]1.2e-5762.84Show/hide
Query:  MWATEIYKWISIRVTNHCTLEM--ASTALILPIKVQTLPLAQYLSFRHSHPSATFSRVNYSPAELYILTNRVRKAPLQAKSLEKISPMATGDSTVWGWSR
        MWA++IYKWISI     CTL+M  ASTALILPI    LP +QYLSFRH+ PSATFSR+                                      GWSR
Subjt:  MWATEIYKWISIRVTNHCTLEM--ASTALILPIKVQTLPLAQYLSFRHSHPSATFSRVNYSPAELYILTNRVRKAPLQAKSLEKISPMATGDSTVWGWSR

Query:  DQDAGKCTHRMRGQAFRISNASPGKDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKVKEVIIINDANHRRRQIEAINGAWAMIGLTAGLVIEGQTGKG
          DAG+ T R RGQAFRISN SPG+DGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFK            RRRQIEAINGAWAMIGLTAGLVIEG+TGKG
Subjt:  DQDAGKCTHRMRGQAFRISNASPGKDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKVKEVIIINDANHRRRQIEAINGAWAMIGLTAGLVIEGQTGKG

Query:  ILAQLADYFSFIINFFVR
        ILAQLADYFS +INFF+R
Subjt:  ILAQLADYFSFIINFFVR

XP_016902490.1 PREDICTED: uncharacterized protein LOC103492970 isoform X2 [Cucumis melo]1.6e-5461.01Show/hide
Query:  MWATEIYKWISIRVTNHCTLEM--ASTALILPIKVQTLPLAQYLSFRHSHPSATFSRVNYSPAELYILTNRVRKAPLQAKSLEKISPMATGDSTVWGWSR
        MWA+EIYKWISI     CTLEM  AS +LILPI    LP +QYLSFRHSHPSATFSR+                                      GWSR
Subjt:  MWATEIYKWISIRVTNHCTLEM--ASTALILPIKVQTLPLAQYLSFRHSHPSATFSRVNYSPAELYILTNRVRKAPLQAKSLEKISPMATGDSTVWGWSR

Query:  DQDAGKCTHRMRGQAFRISNASPGKDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKVKEVIIINDANHRRRQIEAINGAWAMIGLTAGLVIEGQTGKG
        DQD G+ THR RGQAFRISN           VIMVDPLEAKRMAAKEMEKIKAKEKFK            RRRQIEAINGAWAMIGLTAGLVIEGQTGKG
Subjt:  DQDAGKCTHRMRGQAFRISNASPGKDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKVKEVIIINDANHRRRQIEAINGAWAMIGLTAGLVIEGQTGKG

Query:  ILAQLADYFSFIINFFVR
        ILAQLADYFS +INFF+R
Subjt:  ILAQLADYFSFIINFFVR

XP_038889814.1 uncharacterized protein LOC120079625 [Benincasa hispida]4.8e-5162.89Show/hide
Query:  MASTALILPIKVQTLPLAQYLSFRHSHPSATFSRVNYSPAELYILTNRVRKAPLQAKSLEKISPMATGDSTVWGWSRDQDAGKCTHRMRGQAFRISNASP
        MAST+LILPIK   LP +QYLSFRH+HPSATFSR                                       GWSRDQD G+ THR RGQAF+ISN SP
Subjt:  MASTALILPIKVQTLPLAQYLSFRHSHPSATFSRVNYSPAELYILTNRVRKAPLQAKSLEKISPMATGDSTVWGWSRDQDAGKCTHRMRGQAFRISNASP

Query:  GKDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKVKEVIIINDANHRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYFSFIINFFV
        GKD LIKQVIMVDPLEAKR+AAKEMEKIKAKEKFK            R+RQIEAINGAWAMIGLTAGLVIEGQTGKGILAQL  YFS +INFF+
Subjt:  GKDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKVKEVIIINDANHRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYFSFIINFFV

TrEMBL top hitse value%identityAlignment
A0A0A0LH95 Uncharacterized protein5.7e-5862.84Show/hide
Query:  MWATEIYKWISIRVTNHCTLEM--ASTALILPIKVQTLPLAQYLSFRHSHPSATFSRVNYSPAELYILTNRVRKAPLQAKSLEKISPMATGDSTVWGWSR
        MWA++IYKWISI     CTL+M  ASTALILPI    LP +QYLSFRH+ PSATFSR+                                      GWSR
Subjt:  MWATEIYKWISIRVTNHCTLEM--ASTALILPIKVQTLPLAQYLSFRHSHPSATFSRVNYSPAELYILTNRVRKAPLQAKSLEKISPMATGDSTVWGWSR

Query:  DQDAGKCTHRMRGQAFRISNASPGKDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKVKEVIIINDANHRRRQIEAINGAWAMIGLTAGLVIEGQTGKG
          DAG+ T R RGQAFRISN SPG+DGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFK            RRRQIEAINGAWAMIGLTAGLVIEG+TGKG
Subjt:  DQDAGKCTHRMRGQAFRISNASPGKDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKVKEVIIINDANHRRRQIEAINGAWAMIGLTAGLVIEGQTGKG

Query:  ILAQLADYFSFIINFFVR
        ILAQLADYFS +INFF+R
Subjt:  ILAQLADYFSFIINFFVR

A0A1S3BSE0 uncharacterized protein LOC103492970 isoform X11.0e-6265.14Show/hide
Query:  MWATEIYKWISIRVTNHCTLEM--ASTALILPIKVQTLPLAQYLSFRHSHPSATFSRVNYSPAELYILTNRVRKAPLQAKSLEKISPMATGDSTVWGWSR
        MWA+EIYKWISI     CTLEM  AS +LILPI    LP +QYLSFRHSHPSATFSR+                                      GWSR
Subjt:  MWATEIYKWISIRVTNHCTLEM--ASTALILPIKVQTLPLAQYLSFRHSHPSATFSRVNYSPAELYILTNRVRKAPLQAKSLEKISPMATGDSTVWGWSR

Query:  DQDAGKCTHRMRGQAFRISNASPGKDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKVKEVIIINDANHRRRQIEAINGAWAMIGLTAGLVIEGQTGKG
        DQD G+ THR RGQAFRISN SP KDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFK            RRRQIEAINGAWAMIGLTAGLVIEGQTGKG
Subjt:  DQDAGKCTHRMRGQAFRISNASPGKDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKVKEVIIINDANHRRRQIEAINGAWAMIGLTAGLVIEGQTGKG

Query:  ILAQLADYFSFIINFFVR
        ILAQLADYFS +INFF+R
Subjt:  ILAQLADYFSFIINFFVR

A0A1S4E3D1 uncharacterized protein LOC103492970 isoform X27.8e-5561.01Show/hide
Query:  MWATEIYKWISIRVTNHCTLEM--ASTALILPIKVQTLPLAQYLSFRHSHPSATFSRVNYSPAELYILTNRVRKAPLQAKSLEKISPMATGDSTVWGWSR
        MWA+EIYKWISI     CTLEM  AS +LILPI    LP +QYLSFRHSHPSATFSR+                                      GWSR
Subjt:  MWATEIYKWISIRVTNHCTLEM--ASTALILPIKVQTLPLAQYLSFRHSHPSATFSRVNYSPAELYILTNRVRKAPLQAKSLEKISPMATGDSTVWGWSR

Query:  DQDAGKCTHRMRGQAFRISNASPGKDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKVKEVIIINDANHRRRQIEAINGAWAMIGLTAGLVIEGQTGKG
        DQD G+ THR RGQAFRISN           VIMVDPLEAKRMAAKEMEKIKAKEKFK            RRRQIEAINGAWAMIGLTAGLVIEGQTGKG
Subjt:  DQDAGKCTHRMRGQAFRISNASPGKDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKVKEVIIINDANHRRRQIEAINGAWAMIGLTAGLVIEGQTGKG

Query:  ILAQLADYFSFIINFFVR
        ILAQLADYFS +INFF+R
Subjt:  ILAQLADYFSFIINFFVR

A0A5A7UQ54 Uncharacterized protein9.2e-4860.82Show/hide
Query:  ASTALILPIKVQTLPLAQYLSFRHSHPSATFSRVNYSPAELYILTNRVRKAPLQAKSLEKISPMATGDSTVWGWSRDQDAGKCTHRMRGQAFRISNASPG
        AS +LILPI    LP +QYLSFRHSHPSATFSR+                                      GWSRDQD G+ THR RGQAFRISN SP 
Subjt:  ASTALILPIKVQTLPLAQYLSFRHSHPSATFSRVNYSPAELYILTNRVRKAPLQAKSLEKISPMATGDSTVWGWSRDQDAGKCTHRMRGQAFRISNASPG

Query:  KDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKVKEVIIINDANHRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADY--FSFIINFF
        KDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFK            RRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQ+  Y   +F++  F
Subjt:  KDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKVKEVIIINDANHRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADY--FSFIINFF

A0A6J1FC77 uncharacterized protein LOC111444076 isoform X11.7e-4659.09Show/hide
Query:  MASTALILPIKVQTLPLAQYLSFRHSHPSATFSRVNYSPAELYILTNRVRKAPLQAKSLEKISPMATGDSTVWGWSRDQDAGKCTHRMRGQAFRI---SN
        MASTALILPI          LSFRH+HPSATFSR                                      WGW+RDQD GK THR RGQAFRI    N
Subjt:  MASTALILPIKVQTLPLAQYLSFRHSHPSATFSRVNYSPAELYILTNRVRKAPLQAKSLEKISPMATGDSTVWGWSRDQDAGKCTHRMRGQAFRI---SN

Query:  ASPGKDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKVKEVIIINDANHRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYFSFIINFFVR
         SPGKD LIK+VIMVDPLEAKR+AAKEMEKIKAKEKFK            RRRQIEAINGAWAMIGLTAGL++EGQTGKGILAQLA Y + ++NFFVR
Subjt:  ASPGKDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKVKEVIIINDANHRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYFSFIINFFVR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G28025.1 unknown protein3.1e-2452.67Show/hide
Query:  STVWGWSRDQDAGKCTHRMRGQAFRI---SNAS----PGKDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKVKEVIIINDANHRRRQIEAINGAWAMI
        S V G  R QDA    +R R    R+    N S    PGK  + K+VIMVDPLEAKR+A+K+ME+IK +EK              RRR+IEAINGAWA+I
Subjt:  STVWGWSRDQDAGKCTHRMRGQAFRI---SNAS----PGKDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKVKEVIIINDANHRRRQIEAINGAWAMI

Query:  GLTAGLVIEGQTGKGILAQLADYFSFIINFF
        GL  GLVIE QTGKGILAQLA Y+S +++ F
Subjt:  GLTAGLVIEGQTGKGILAQLADYFSFIINFF

AT4G28025.2 unknown protein.9.2e-2452.76Show/hide
Query:  GWSRDQDAGKCTHRMRGQAFRI---SNAS----PGKDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKVKEVIIINDANHRRRQIEAINGAWAMIGLTA
        G  R QDA    +R R    R+    N S    PGK  + K+VIMVDPLEAKR+A+K+ME+IK +EK              RRR+IEAINGAWA+IGL  
Subjt:  GWSRDQDAGKCTHRMRGQAFRI---SNAS----PGKDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKVKEVIIINDANHRRRQIEAINGAWAMIGLTA

Query:  GLVIEGQTGKGILAQLADYFSFIINFF
        GLVIE QTGKGILAQLA Y+S +++ F
Subjt:  GLVIEGQTGKGILAQLADYFSFIINFF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAAACTGTTGATGTGGGCAACTGAAATCTATAAATGGATAAGCATTCGTGTAACAAACCACTGCACTTTGGAGATGGCTTCCACTGCGCTGATTCTCCCCATCAA
AGTACAAACCCTTCCGCTTGCTCAATACCTCTCTTTCCGCCATTCCCATCCTTCTGCAACTTTCTCCAGAGTCAACTACTCCCCAGCTGAATTATATATCCTCACTAATA
GAGTCAGGAAAGCACCATTGCAAGCCAAAAGTCTGGAAAAGATTAGCCCAATGGCTACTGGCGACAGCACAGTGTGGGGCTGGAGTAGGGACCAAGATGCTGGCAAATGT
ACTCACAGAATGAGGGGTCAAGCATTTCGAATCTCTAATGCCTCTCCTGGTAAAGATGGCTTAATCAAGCAGGTGATTATGGTTGACCCTTTGGAAGCCAAACGTATGGC
AGCGAAAGAAATGGAAAAGATAAAAGCAAAAGAGAAGTTCAAGGTAAAGGAAGTGATCATCATCAATGATGCAAACCATAGAAGACGTCAAATAGAAGCGATCAATGGAG
CATGGGCAATGATCGGTCTGACAGCAGGGCTTGTTATCGAAGGTCAAACCGGAAAAGGCATTCTAGCACAGTTGGCCGACTACTTCAGCTTCATTATCAACTTCTTCGTA
CGGCAGTGGGATGAAGCAGCAGATTCAGATGGTCTTGTTTTGGAAGATTCTATTGGCAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGATAAACTGTTGATGTGGGCAACTGAAATCTATAAATGGATAAGCATTCGTGTAACAAACCACTGCACTTTGGAGATGGCTTCCACTGCGCTGATTCTCCCCATCAA
AGTACAAACCCTTCCGCTTGCTCAATACCTCTCTTTCCGCCATTCCCATCCTTCTGCAACTTTCTCCAGAGTCAACTACTCCCCAGCTGAATTATATATCCTCACTAATA
GAGTCAGGAAAGCACCATTGCAAGCCAAAAGTCTGGAAAAGATTAGCCCAATGGCTACTGGCGACAGCACAGTGTGGGGCTGGAGTAGGGACCAAGATGCTGGCAAATGT
ACTCACAGAATGAGGGGTCAAGCATTTCGAATCTCTAATGCCTCTCCTGGTAAAGATGGCTTAATCAAGCAGGTGATTATGGTTGACCCTTTGGAAGCCAAACGTATGGC
AGCGAAAGAAATGGAAAAGATAAAAGCAAAAGAGAAGTTCAAGGTAAAGGAAGTGATCATCATCAATGATGCAAACCATAGAAGACGTCAAATAGAAGCGATCAATGGAG
CATGGGCAATGATCGGTCTGACAGCAGGGCTTGTTATCGAAGGTCAAACCGGAAAAGGCATTCTAGCACAGTTGGCCGACTACTTCAGCTTCATTATCAACTTCTTCGTA
CGGCAGTGGGATGAAGCAGCAGATTCAGATGGTCTTGTTTTGGAAGATTCTATTGGCAAATAGCAATGCAATCAAATGTATTACCATCTACCTTTTCTTTTTGAATTATT
ATTGTTATTATTATTTTTTTAATTTTTAATATCAAGCTTTCATTTTGAGCTAGAGAAGAAAAACCATAGTTCAGTATCATTGTGTTATGGAGAAGTATATATGTGTGTGT
ATGTGGAGTTGGTGAAATAATGCTAGTTGGATATTTTGGTTTGATATTGCTTCACATTATGTTGGGATATTGAAAATCTCAAATACTCATGCACTTTAAAATTGATAATA
CCCAACGTCACAAACTAAATCAAATCTAAAAAGAATTAAACCAACACAAATACACTTTTTTTTTTTATTTTTTATTTTTTTAATTATTATTATTATTATTTTTTTAATAA
TGTATTTAAATCACCCATAAGGGGGAAAAAAGAAATCCAAATACATTTTATTTAAGTATTTTGGTCACCCAAAACAAATCTGATTGTGCAAGGCGGGTTTTGAGTAAAAG
TAAATTAAAATTTTTTAATTCACTTCTATATACATATATATTTCCAAAT
Protein sequenceShow/hide protein sequence
MDKLLMWATEIYKWISIRVTNHCTLEMASTALILPIKVQTLPLAQYLSFRHSHPSATFSRVNYSPAELYILTNRVRKAPLQAKSLEKISPMATGDSTVWGWSRDQDAGKC
THRMRGQAFRISNASPGKDGLIKQVIMVDPLEAKRMAAKEMEKIKAKEKFKVKEVIIINDANHRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYFSFIINFFV
RQWDEAADSDGLVLEDSIGK