; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G20860 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G20860
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionTransmembrane protein
Genome locationClcChr09:34403820..34404308
RNA-Seq ExpressionClc09G20860
SyntenyClc09G20860
Gene Ontology termsGO:0010190 - cytochrome b6f complex assembly (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN51853.2 hypothetical protein Csa_008870 [Cucumis sativus]4.2e-7588.89Show/hide
Query:  MEESPSRATRLRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLIRRRGECDGSTAAESDR
        M+ESPS+ATR RRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKL+ RRGECDG+TAAE+  
Subjt:  MEESPSRATRLRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLIRRRGECDGSTAAESDR

Query:  GAAREEGLPEISPGSGEEEETMGNFSARFEAERIWLQLYQMGQLGFGRVSFTGNSNLWPNSN
            E+GLPE+SPGSGEE+ET GNFSARFEAERIWLQLYQ+GQLGFGRVSFTGNSNLWPNSN
Subjt:  GAAREEGLPEISPGSGEEEETMGNFSARFEAERIWLQLYQMGQLGFGRVSFTGNSNLWPNSN

XP_008458773.1 PREDICTED: uncharacterized protein LOC103498078 [Cucumis melo]1.3e-7690.74Show/hide
Query:  MEESPSRATRLRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLIRRRGECDGSTAAESDR
        M+ESPSRATR RRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKL+RRRGECDG+TAAE+  
Subjt:  MEESPSRATRLRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLIRRRGECDGSTAAESDR

Query:  GAAREEGLPEISPGSGEEEETMGNFSARFEAERIWLQLYQMGQLGFGRVSFTGNSNLWPNSN
            E+GLPEISPGSGEE+ET GNFSARFEAERIWLQLYQ+GQLGFGRVSFTGNSNLWPNSN
Subjt:  GAAREEGLPEISPGSGEEEETMGNFSARFEAERIWLQLYQMGQLGFGRVSFTGNSNLWPNSN

XP_011655414.1 uncharacterized protein LOC105435525 [Cucumis sativus]4.2e-7588.89Show/hide
Query:  MEESPSRATRLRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLIRRRGECDGSTAAESDR
        M+ESPS+ATR RRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKL+ RRGECDG+TAAE+  
Subjt:  MEESPSRATRLRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLIRRRGECDGSTAAESDR

Query:  GAAREEGLPEISPGSGEEEETMGNFSARFEAERIWLQLYQMGQLGFGRVSFTGNSNLWPNSN
            E+GLPE+SPGSGEE+ET GNFSARFEAERIWLQLYQ+GQLGFGRVSFTGNSNLWPNSN
Subjt:  GAAREEGLPEISPGSGEEEETMGNFSARFEAERIWLQLYQMGQLGFGRVSFTGNSNLWPNSN

XP_023536558.1 uncharacterized protein LOC111797691 [Cucurbita pepo subsp. pepo]2.7e-6683.13Show/hide
Query:  MEESPSRATRLRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLIRRRGECDGSTAA-ESD
        MEESPS  TR RRF VDDG DLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWM+GRRCLQ+A   RKKRKLIRR+ ECDG+ AA E+ 
Subjt:  MEESPSRATRLRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLIRRRGECDGSTAA-ESD

Query:  RGAAREEGLPEISPGSG---EEEETMGNFSARFEAERIWLQLYQMGQLGFGRVSFTGNSNLWPNSN
         G AREE LPEI PGSG   EEEE +GNFSARFEAERIWLQLYQ+GQLGFGRVSFTGNSN WPNSN
Subjt:  RGAAREEGLPEISPGSG---EEEETMGNFSARFEAERIWLQLYQMGQLGFGRVSFTGNSNLWPNSN

XP_038891069.1 uncharacterized protein LOC120080480 [Benincasa hispida]2.0e-7791.98Show/hide
Query:  MEESPSRATRLRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLIRRRGECDGSTAAESDR
        M+ESPSR TR RRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALAL+KLPWMVGRRCLQ+ARKKRKKRKLIRRRGECDG+TAAE+  
Subjt:  MEESPSRATRLRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLIRRRGECDGSTAAESDR

Query:  GAAREEGLPEISPGSGEEEETMGNFSARFEAERIWLQLYQMGQLGFGRVSFTGNSNLWPNSN
         AA EEGLPEISPGSGEEEET+GNFSARFEAERIWLQLYQ+GQLGFGRVSFTGNSNLWPNSN
Subjt:  GAAREEGLPEISPGSGEEEETMGNFSARFEAERIWLQLYQMGQLGFGRVSFTGNSNLWPNSN

TrEMBL top hitse value%identityAlignment
A0A0A0KR58 Uncharacterized protein2.0e-7588.89Show/hide
Query:  MEESPSRATRLRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLIRRRGECDGSTAAESDR
        M+ESPS+ATR RRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKL+ RRGECDG+TAAE+  
Subjt:  MEESPSRATRLRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLIRRRGECDGSTAAESDR

Query:  GAAREEGLPEISPGSGEEEETMGNFSARFEAERIWLQLYQMGQLGFGRVSFTGNSNLWPNSN
            E+GLPE+SPGSGEE+ET GNFSARFEAERIWLQLYQ+GQLGFGRVSFTGNSNLWPNSN
Subjt:  GAAREEGLPEISPGSGEEEETMGNFSARFEAERIWLQLYQMGQLGFGRVSFTGNSNLWPNSN

A0A1S3C8M3 uncharacterized protein LOC1034980786.4e-7790.74Show/hide
Query:  MEESPSRATRLRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLIRRRGECDGSTAAESDR
        M+ESPSRATR RRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKL+RRRGECDG+TAAE+  
Subjt:  MEESPSRATRLRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLIRRRGECDGSTAAESDR

Query:  GAAREEGLPEISPGSGEEEETMGNFSARFEAERIWLQLYQMGQLGFGRVSFTGNSNLWPNSN
            E+GLPEISPGSGEE+ET GNFSARFEAERIWLQLYQ+GQLGFGRVSFTGNSNLWPNSN
Subjt:  GAAREEGLPEISPGSGEEEETMGNFSARFEAERIWLQLYQMGQLGFGRVSFTGNSNLWPNSN

A0A5A7T2E5 Uncharacterized protein6.4e-7790.74Show/hide
Query:  MEESPSRATRLRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLIRRRGECDGSTAAESDR
        M+ESPSRATR RRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKL+RRRGECDG+TAAE+  
Subjt:  MEESPSRATRLRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLIRRRGECDGSTAAESDR

Query:  GAAREEGLPEISPGSGEEEETMGNFSARFEAERIWLQLYQMGQLGFGRVSFTGNSNLWPNSN
            E+GLPEISPGSGEE+ET GNFSARFEAERIWLQLYQ+GQLGFGRVSFTGNSNLWPNSN
Subjt:  GAAREEGLPEISPGSGEEEETMGNFSARFEAERIWLQLYQMGQLGFGRVSFTGNSNLWPNSN

A0A6J1HEE2 uncharacterized protein LOC1114632063.9e-6682.63Show/hide
Query:  MEESPSRATRLRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLIRRRGECDGSTAA-ESD
        MEESPS  TR RRF VDDG DLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWM+GRRCLQ+A   RKKRKLIRR+ ECDG+ AA E+ 
Subjt:  MEESPSRATRLRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLIRRRGECDGSTAA-ESD

Query:  RGAAREEGLPEISPGS----GEEEETMGNFSARFEAERIWLQLYQMGQLGFGRVSFTGNSNLWPNSN
         G AREEGLPEI PGS     EEEE +GNFSARFEAERIWLQLYQ+GQLGFGRVSFTGNSN WPNSN
Subjt:  RGAAREEGLPEISPGS----GEEEETMGNFSARFEAERIWLQLYQMGQLGFGRVSFTGNSNLWPNSN

A0A6J1JH40 uncharacterized protein LOC1114869581.1e-6580.23Show/hide
Query:  MEESPSRATRLRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLIRRRGECDGSTAA-ESD
        MEESPS  TR RRF VDDG DLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWM+GRRCLQ+A   RKKRKLIRR+ ECDG+ AA E+ 
Subjt:  MEESPSRATRLRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLIRRRGECDGSTAA-ESD

Query:  RGAAREEGLPEISPGS---------GEEEETMGNFSARFEAERIWLQLYQMGQLGFGRVSFTGNSNLWPNSN
         G AREEGLPEI PGS          EEEE +GNFSARFEAERIWLQLYQ+GQLGFGRVSFTGNSN WPNSN
Subjt:  RGAAREEGLPEISPGS---------GEEEETMGNFSARFEAERIWLQLYQMGQLGFGRVSFTGNSNLWPNSN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01516.1 unknown protein5.6e-2543.42Show/hide
Query:  CSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLIRRR-----------------GEC----DGSTAAESDRGAA
        CSGK CRS  A  +ADCVA+CCCPC+VV+   LA VK+PWM+GR+C+ +    +K+ K I R                  G C    DG    +  R   
Subjt:  CSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLIRRR-----------------GEC----DGSTAAESDRGAA

Query:  REEG--LPEISPGSGEEEETMGNFSARFEAERIWLQLYQMGQLGFGRVSFTG
          +G    E +  +  +EE     SAR EAER+WL+LYQ+G LGFGRVSFTG
Subjt:  REEG--LPEISPGSGEEEETMGNFSARFEAERIWLQLYQMGQLGFGRVSFTG

AT3G11690.1 unknown protein8.4e-0535.44Show/hide
Query:  SPSRATR---LRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAV-CCCPCSVVSFLALALVKLPWMVGRRCLQQARKKR
        SPSR+ R   L + ++   +    C G        G  A C AV CCCPC +V+ L LA+ K+P  + RR ++  R+K+
Subjt:  SPSRATR---LRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAV-CCCPCSVVSFLALALVKLPWMVGRRCLQQARKKR

AT5G06380.1 unknown protein3.2e-0429.91Show/hide
Query:  GKHCRSCTAGLVADCVAVC-CCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLIRRRGECDGSTAAESDRGAAREEGLPEISPGSGEEEETMGNF
        G     C  G  A C A+C C PCSVV+ + LA+ KLP  + RR +++ R+KR  +K     G        E  RG + +  +  +     EEEE   + 
Subjt:  GKHCRSCTAGLVADCVAVC-CCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLIRRRGECDGSTAAESDRGAAREEGLPEISPGSGEEEETMGNF

Query:  SARFEAERIWLQLYQMG
        +     + +W + Y  G
Subjt:  SARFEAERIWLQLYQMG

AT5G14690.1 unknown protein5.1e-2638.3Show/hide
Query:  RLRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLIRRR----------------------
        R RR       D + CS K CRS  A  +ADCVA+CCCPC++++ L L LVK+PWM+GRRCL    + +KKR++I RR                      
Subjt:  RLRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLIRRR----------------------

Query:  ---------------GEC-------DGSTAAESDRGAAREEGLPEISPGSGEEEETMGNFSARFEAERIWLQLYQMGQLGFGRVSFTG
                       G C       D     E D    +EE   E +     E+      SAR EAER+WL+LYQ+G LGFGRVSFTG
Subjt:  ---------------GEC-------DGSTAAESDRGAAREEGLPEISPGSGEEEETMGNFSARFEAERIWLQLYQMGQLGFGRVSFTG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAAAGCCCATCTCGAGCCACACGGCTGCGTAGATTCGCCGTGGACGACGGCGCCGATCTAATCGACTGTTCCGGCAAGCATTGCCGGTCGTGCACCGCCGGGTT
GGTTGCGGATTGTGTCGCCGTCTGCTGCTGCCCGTGCTCGGTGGTCAGCTTCTTGGCTCTGGCGCTGGTCAAACTCCCGTGGATGGTCGGGCGGCGGTGTCTGCAGCAGG
CGAGGAAGAAGAGGAAGAAGAGGAAATTGATTCGCCGGAGAGGGGAATGCGACGGCTCTACGGCGGCGGAGAGTGATCGGGGTGCGGCGAGGGAGGAGGGGTTGCCGGAA
ATTTCGCCGGGATCCGGCGAGGAAGAAGAGACGATGGGGAATTTCAGTGCAAGATTTGAAGCAGAGAGAATTTGGCTGCAATTGTATCAAATGGGTCAGTTGGGATTTGG
AAGAGTTTCTTTCACTGGGAATTCTAATCTTTGGCCTAATTCCAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAAAGCCCATCTCGAGCCACACGGCTGCGTAGATTCGCCGTGGACGACGGCGCCGATCTAATCGACTGTTCCGGCAAGCATTGCCGGTCGTGCACCGCCGGGTT
GGTTGCGGATTGTGTCGCCGTCTGCTGCTGCCCGTGCTCGGTGGTCAGCTTCTTGGCTCTGGCGCTGGTCAAACTCCCGTGGATGGTCGGGCGGCGGTGTCTGCAGCAGG
CGAGGAAGAAGAGGAAGAAGAGGAAATTGATTCGCCGGAGAGGGGAATGCGACGGCTCTACGGCGGCGGAGAGTGATCGGGGTGCGGCGAGGGAGGAGGGGTTGCCGGAA
ATTTCGCCGGGATCCGGCGAGGAAGAAGAGACGATGGGGAATTTCAGTGCAAGATTTGAAGCAGAGAGAATTTGGCTGCAATTGTATCAAATGGGTCAGTTGGGATTTGG
AAGAGTTTCTTTCACTGGGAATTCTAATCTTTGGCCTAATTCCAATTAG
Protein sequenceShow/hide protein sequence
MEESPSRATRLRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLIRRRGECDGSTAAESDRGAAREEGLPE
ISPGSGEEEETMGNFSARFEAERIWLQLYQMGQLGFGRVSFTGNSNLWPNSN