; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10022157 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10022157
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionTransmembrane protein
Genome locationChr05:21432218..21432706
RNA-Seq ExpressionHG10022157
SyntenyHG10022157
Gene Ontology termsGO:0010190 - cytochrome b6f complex assembly (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN51853.2 hypothetical protein Csa_008870 [Cucumis sativus]5.0e-7690.12Show/hide
Query:  MEESPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLLRRRGECDGSTATETDR
        M+ESPS+ATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKL+ RRGECDG+TA ET  
Subjt:  MEESPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLLRRRGECDGSTATETDR

Query:  GAAKEEGLPEISPGSGEEEETMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNSNLWPNSN
            E+GLPE+SPGSGEE+ET GNFSARFEAERIWLQLYQVGQLGFGRVSFTGNSNLWPNSN
Subjt:  GAAKEEGLPEISPGSGEEEETMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNSNLWPNSN

XP_008458773.1 PREDICTED: uncharacterized protein LOC103498078 [Cucumis melo]1.6e-7791.98Show/hide
Query:  MEESPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLLRRRGECDGSTATETDR
        M+ESPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKL+RRRGECDG+TA ET  
Subjt:  MEESPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLLRRRGECDGSTATETDR

Query:  GAAKEEGLPEISPGSGEEEETMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNSNLWPNSN
            E+GLPEISPGSGEE+ET GNFSARFEAERIWLQLYQVGQLGFGRVSFTGNSNLWPNSN
Subjt:  GAAKEEGLPEISPGSGEEEETMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNSNLWPNSN

XP_011655414.1 uncharacterized protein LOC105435525 [Cucumis sativus]5.0e-7690.12Show/hide
Query:  MEESPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLLRRRGECDGSTATETDR
        M+ESPS+ATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKL+ RRGECDG+TA ET  
Subjt:  MEESPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLLRRRGECDGSTATETDR

Query:  GAAKEEGLPEISPGSGEEEETMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNSNLWPNSN
            E+GLPE+SPGSGEE+ET GNFSARFEAERIWLQLYQVGQLGFGRVSFTGNSNLWPNSN
Subjt:  GAAKEEGLPEISPGSGEEEETMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNSNLWPNSN

XP_023536558.1 uncharacterized protein LOC111797691 [Cucurbita pepo subsp. pepo]1.6e-6682.53Show/hide
Query:  MEESPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLLRRRGECDG-STATETD
        MEESPS  TRRRRF VDDG DLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWM+GRRCLQ+A   RKKRKL+RR+ ECDG + A+ET 
Subjt:  MEESPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLLRRRGECDG-STATETD

Query:  RGAAKEEGLPEISPGSG---EEEETMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNSNLWPNSN
         G A+EE LPEI PGSG   EEEE +GNFSARFEAERIWLQLYQ+GQLGFGRVSFTGNSN WPNSN
Subjt:  RGAAKEEGLPEISPGSG---EEEETMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNSNLWPNSN

XP_038891069.1 uncharacterized protein LOC120080480 [Benincasa hispida]1.8e-7892.59Show/hide
Query:  MEESPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLLRRRGECDGSTATETDR
        M+ESPSR TRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALAL+KLPWMVGRRCLQ+ARKKRKKRKL+RRRGECDG+TA ET  
Subjt:  MEESPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLLRRRGECDGSTATETDR

Query:  GAAKEEGLPEISPGSGEEEETMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNSNLWPNSN
         AA EEGLPEISPGSGEEEET+GNFSARFEAERIWLQLYQVGQLGFGRVSFTGNSNLWPNSN
Subjt:  GAAKEEGLPEISPGSGEEEETMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNSNLWPNSN

TrEMBL top hitse value%identityAlignment
A0A0A0KR58 Uncharacterized protein2.4e-7690.12Show/hide
Query:  MEESPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLLRRRGECDGSTATETDR
        M+ESPS+ATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKL+ RRGECDG+TA ET  
Subjt:  MEESPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLLRRRGECDGSTATETDR

Query:  GAAKEEGLPEISPGSGEEEETMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNSNLWPNSN
            E+GLPE+SPGSGEE+ET GNFSARFEAERIWLQLYQVGQLGFGRVSFTGNSNLWPNSN
Subjt:  GAAKEEGLPEISPGSGEEEETMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNSNLWPNSN

A0A1S3C8M3 uncharacterized protein LOC1034980787.5e-7891.98Show/hide
Query:  MEESPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLLRRRGECDGSTATETDR
        M+ESPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKL+RRRGECDG+TA ET  
Subjt:  MEESPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLLRRRGECDGSTATETDR

Query:  GAAKEEGLPEISPGSGEEEETMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNSNLWPNSN
            E+GLPEISPGSGEE+ET GNFSARFEAERIWLQLYQVGQLGFGRVSFTGNSNLWPNSN
Subjt:  GAAKEEGLPEISPGSGEEEETMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNSNLWPNSN

A0A5A7T2E5 Uncharacterized protein7.5e-7891.98Show/hide
Query:  MEESPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLLRRRGECDGSTATETDR
        M+ESPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKL+RRRGECDG+TA ET  
Subjt:  MEESPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLLRRRGECDGSTATETDR

Query:  GAAKEEGLPEISPGSGEEEETMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNSNLWPNSN
            E+GLPEISPGSGEE+ET GNFSARFEAERIWLQLYQVGQLGFGRVSFTGNSNLWPNSN
Subjt:  GAAKEEGLPEISPGSGEEEETMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNSNLWPNSN

A0A6J1HEE2 uncharacterized protein LOC1114632062.3e-6682.04Show/hide
Query:  MEESPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLLRRRGECDG-STATETD
        MEESPS  TRRRRF VDDG DLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWM+GRRCLQ+A   RKKRKL+RR+ ECDG + A+ET 
Subjt:  MEESPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLLRRRGECDG-STATETD

Query:  RGAAKEEGLPEISPGS----GEEEETMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNSNLWPNSN
         G A+EEGLPEI PGS     EEEE +GNFSARFEAERIWLQLYQ+GQLGFGRVSFTGNSN WPNSN
Subjt:  RGAAKEEGLPEISPGS----GEEEETMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNSNLWPNSN

A0A6J1JH40 uncharacterized protein LOC1114869586.6e-6679.65Show/hide
Query:  MEESPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLLRRRGECDG-STATETD
        MEESPS  TRRRRF VDDG DLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWM+GRRCLQ+A   RKKRKL+RR+ ECDG + A+ET 
Subjt:  MEESPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLLRRRGECDG-STATETD

Query:  RGAAKEEGLPEISPGS---------GEEEETMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNSNLWPNSN
         G A+EEGLPEI PGS          EEEE +GNFSARFEAERIWLQLYQ+GQLGFGRVSFTGNSN WPNSN
Subjt:  RGAAKEEGLPEISPGS---------GEEEETMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNSNLWPNSN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01516.1 unknown protein2.5e-2543.95Show/hide
Query:  CSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLLRRR-----------------------GECDGS-----TAT
        CSGK CRS  A  +ADCVA+CCCPC+VV+   LA VK+PWM+GR+C+ +    +K+ K + R                        G+ DG         
Subjt:  CSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLLRRR-----------------------GECDGS-----TAT

Query:  ETDRGAAKEEGLPEISPGSGEEEETMGNFSARFEAERIWLQLYQVGQLGFGRVSFTG
        E D    KEE     +    EEEET    SAR EAER+WL+LYQ+G LGFGRVSFTG
Subjt:  ETDRGAAKEEGLPEISPGSGEEEETMGNFSARFEAERIWLQLYQVGQLGFGRVSFTG

AT3G11690.1 unknown protein6.4e-0535.44Show/hide
Query:  SPSRATRRR---RFAVDDGADLIDCSGKHCRSCTAGLVADCVAV-CCCPCSVVSFLALALVKLPWMVGRRCLQQARKKR
        SPSR+ RR+   + ++   +    C G        G  A C AV CCCPC +V+ L LA+ K+P  + RR ++  R+K+
Subjt:  SPSRATRRR---RFAVDDGADLIDCSGKHCRSCTAGLVADCVAV-CCCPCSVVSFLALALVKLPWMVGRRCLQQARKKR

AT5G06380.1 unknown protein1.9e-0429.91Show/hide
Query:  GKHCRSCTAGLVADCVAVC-CCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLLRRRGECDGSTATETDRGAAKEEGLPEISPGSGEEEETMGNF
        G     C  G  A C A+C C PCSVV+ + LA+ KLP  + RR +++ R+KR  +K     G        E  RG + +  +  +     EEEE   + 
Subjt:  GKHCRSCTAGLVADCVAVC-CCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLLRRRGECDGSTATETDRGAAKEEGLPEISPGSGEEEETMGNF

Query:  SARFEAERIWLQLYQVG
        +     + +W + Y  G
Subjt:  SARFEAERIWLQLYQVG

AT5G14690.1 unknown protein3.5e-2738.83Show/hide
Query:  RRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLLRRR----------------------
        RRRR       D + CS K CRS  A  +ADCVA+CCCPC++++ L L LVK+PWM+GRRCL    + +KKR+++ RR                      
Subjt:  RRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLLRRR----------------------

Query:  ---------------GEC-------DGSTATETDRGAAKEEGLPEISPGSGEEEETMGNFSARFEAERIWLQLYQVGQLGFGRVSFTG
                       G C       D     E D    KEE   E +     E+      SAR EAER+WL+LYQ+G LGFGRVSFTG
Subjt:  ---------------GEC-------DGSTATETDRGAAKEEGLPEISPGSGEEEETMGNFSARFEAERIWLQLYQVGQLGFGRVSFTG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGAAAGCCCATCTCGAGCCACACGGCGGCGTAGATTCGCCGTCGACGACGGCGCCGATCTAATCGACTGCTCCGGCAAGCATTGCCGGTCGTGCACCGCCGGGTT
GGTGGCGGACTGTGTCGCCGTCTGTTGCTGCCCGTGCTCGGTGGTCAGCTTCTTGGCTCTGGCCCTGGTCAAACTGCCGTGGATGGTCGGGCGGCGGTGTCTGCAGCAGG
CCAGAAAGAAGAGGAAGAAGAGGAAATTGCTTCGCCGGAGAGGGGAATGCGACGGCTCTACGGCGACGGAGACTGATCGGGGTGCCGCGAAGGAGGAGGGGTTGCCGGAA
ATTTCGCCGGGATCCGGCGAGGAAGAAGAGACGATGGGGAATTTCAGTGCAAGATTTGAAGCAGAGAGAATTTGGCTGCAGTTGTATCAGGTGGGTCAGTTGGGATTTGG
AAGAGTTTCTTTCACTGGGAATTCTAATCTTTGGCCTAATTCCAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGGAAAGCCCATCTCGAGCCACACGGCGGCGTAGATTCGCCGTCGACGACGGCGCCGATCTAATCGACTGCTCCGGCAAGCATTGCCGGTCGTGCACCGCCGGGTT
GGTGGCGGACTGTGTCGCCGTCTGTTGCTGCCCGTGCTCGGTGGTCAGCTTCTTGGCTCTGGCCCTGGTCAAACTGCCGTGGATGGTCGGGCGGCGGTGTCTGCAGCAGG
CCAGAAAGAAGAGGAAGAAGAGGAAATTGCTTCGCCGGAGAGGGGAATGCGACGGCTCTACGGCGACGGAGACTGATCGGGGTGCCGCGAAGGAGGAGGGGTTGCCGGAA
ATTTCGCCGGGATCCGGCGAGGAAGAAGAGACGATGGGGAATTTCAGTGCAAGATTTGAAGCAGAGAGAATTTGGCTGCAGTTGTATCAGGTGGGTCAGTTGGGATTTGG
AAGAGTTTCTTTCACTGGGAATTCTAATCTTTGGCCTAATTCCAATTAG
Protein sequenceShow/hide protein sequence
MEESPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKLPWMVGRRCLQQARKKRKKRKLLRRRGECDGSTATETDRGAAKEEGLPE
ISPGSGEEEETMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNSNLWPNSN