; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g35380 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g35380
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionTransmembrane protein
Genome locationchr4:26601463..26601954
RNA-Seq ExpressionMoc04g35380
SyntenyMoc04g35380
Gene Ontology termsGO:0010190 - cytochrome b6f complex assembly (biological process)
GO:0009507 - chloroplast (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN51853.2 hypothetical protein Csa_008870 [Cucumis sativus]6.2e-5875.93Show/hide
Query:  EQNPSRPSPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSL-RRRGEVDGGPAA
        E +  + SP++ TRRRRFAVDDGADLIDCSGKHCRSCTAGL+ADCVAVCCCPCSVVSFLALALVKLPWMVGRR LQ ARKK K + L  RRGE DG  AA
Subjt:  EQNPSRPSPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSL-RRRGEVDGGPAA

Query:  NTGGTPANRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSL
         TGG   N ++ +PE+SP  G E   TGN SARFEAERIWLQLYQVGQLGFGRVSFTGNS+L
Subjt:  NTGGTPANRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSL

XP_008458773.1 PREDICTED: uncharacterized protein LOC103498078 [Cucumis melo]8.6e-6080.65Show/hide
Query:  SPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSL-RRRGEVDGGPAANTGGTPA
        SP+R TRRRRFAVDDGADLIDCSGKHCRSCTAGL+ADCVAVCCCPCSVVSFLALALVKLPWMVGRR LQ ARKK K + L RRRGE DG  AA TGG   
Subjt:  SPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSL-RRRGEVDGGPAANTGGTPA

Query:  NRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSL
        N ++ +PEISP  G E   TGN SARFEAERIWLQLYQVGQLGFGRVSFTGNS+L
Subjt:  NRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSL

XP_011655414.1 uncharacterized protein LOC105435525 [Cucumis sativus]2.8e-5878.71Show/hide
Query:  SPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSL-RRRGEVDGGPAANTGGTPA
        SP++ TRRRRFAVDDGADLIDCSGKHCRSCTAGL+ADCVAVCCCPCSVVSFLALALVKLPWMVGRR LQ ARKK K + L  RRGE DG  AA TGG   
Subjt:  SPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSL-RRRGEVDGGPAANTGGTPA

Query:  NRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSL
        N ++ +PE+SP  G E   TGN SARFEAERIWLQLYQVGQLGFGRVSFTGNS+L
Subjt:  NRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSL

XP_022134219.1 uncharacterized protein LOC111006530 [Momordica charantia]6.3e-87100Show/hide
Query:  MEQNPSRPSPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSLRRRGEVDGGPAA
        MEQNPSRPSPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSLRRRGEVDGGPAA
Subjt:  MEQNPSRPSPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSLRRRGEVDGGPAA

Query:  NTGGTPANRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSLN
        NTGGTPANRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSLN
Subjt:  NTGGTPANRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSLN

XP_038891069.1 uncharacterized protein LOC120080480 [Benincasa hispida]1.0e-6080.65Show/hide
Query:  SPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSL-RRRGEVDGGPAANTGGTPA
        SP+RPTRRRRFAVDDGADLIDCSGKHCRSCTAGL+ADCVAVCCCPCSVVSFLALAL+KLPWMVGRR LQ ARKK K + L RRRGE DG  AA TGG  A
Subjt:  SPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSL-RRRGEVDGGPAANTGGTPA

Query:  NRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSL
        + +E +PEISP  G E+   GN SARFEAERIWLQLYQVGQLGFGRVSFTGNS+L
Subjt:  NRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSL

TrEMBL top hitse value%identityAlignment
A0A0A0KR58 Uncharacterized protein3.0e-5875.93Show/hide
Query:  EQNPSRPSPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSL-RRRGEVDGGPAA
        E +  + SP++ TRRRRFAVDDGADLIDCSGKHCRSCTAGL+ADCVAVCCCPCSVVSFLALALVKLPWMVGRR LQ ARKK K + L  RRGE DG  AA
Subjt:  EQNPSRPSPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSL-RRRGEVDGGPAA

Query:  NTGGTPANRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSL
         TGG   N ++ +PE+SP  G E   TGN SARFEAERIWLQLYQVGQLGFGRVSFTGNS+L
Subjt:  NTGGTPANRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSL

A0A1S3C8M3 uncharacterized protein LOC1034980784.2e-6080.65Show/hide
Query:  SPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSL-RRRGEVDGGPAANTGGTPA
        SP+R TRRRRFAVDDGADLIDCSGKHCRSCTAGL+ADCVAVCCCPCSVVSFLALALVKLPWMVGRR LQ ARKK K + L RRRGE DG  AA TGG   
Subjt:  SPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSL-RRRGEVDGGPAANTGGTPA

Query:  NRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSL
        N ++ +PEISP  G E   TGN SARFEAERIWLQLYQVGQLGFGRVSFTGNS+L
Subjt:  NRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSL

A0A5A7T2E5 Uncharacterized protein4.2e-6080.65Show/hide
Query:  SPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSL-RRRGEVDGGPAANTGGTPA
        SP+R TRRRRFAVDDGADLIDCSGKHCRSCTAGL+ADCVAVCCCPCSVVSFLALALVKLPWMVGRR LQ ARKK K + L RRRGE DG  AA TGG   
Subjt:  SPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSL-RRRGEVDGGPAANTGGTPA

Query:  NRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSL
        N ++ +PEISP  G E   TGN SARFEAERIWLQLYQVGQLGFGRVSFTGNS+L
Subjt:  NRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSL

A0A6J1BXL5 uncharacterized protein LOC1110065303.1e-87100Show/hide
Query:  MEQNPSRPSPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSLRRRGEVDGGPAA
        MEQNPSRPSPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSLRRRGEVDGGPAA
Subjt:  MEQNPSRPSPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSLRRRGEVDGGPAA

Query:  NTGGTPANRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSLN
        NTGGTPANRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSLN
Subjt:  NTGGTPANRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSLN

A0A6J1JH40 uncharacterized protein LOC1114869588.4e-5370.37Show/hide
Query:  SPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSLRRRGEVDGGPAANTGGTPAN
        SP+ PTRRRRF VDDG DLIDCSGKHCRSCTAGL+ADCVAVCCCPCSVVSFLALALVKLPWM+GRR LQ ARKK K+  +RR+ E DG  AA+  G    
Subjt:  SPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSLRRRGEVDGGPAANTGGTPAN

Query:  RDEKMPEISPAFGGEQAE---------TGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSS
        R+E +PEI P    E+ E          GN SARFEAERIWLQLYQ+GQLGFGRVSFTGNS+
Subjt:  RDEKMPEISPAFGGEQAE---------TGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01516.1 unknown protein1.3e-2444.08Show/hide
Query:  CSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTL-QHARKKWKMKSL------------RRRGEVDGGPAANTGGTPANRDEK--MP
        CSGK CRS  A  IADCVA+CCCPC+VV+   LA VK+PWM+GR+ + +    K +MK +            RR  E+  G     G      D+   + 
Subjt:  CSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTL-QHARKKWKMKSL------------RRRGEVDGGPAANTGGTPANRDEK--MP

Query:  EISPAFGGEQAETGNL--------SARFEAERIWLQLYQVGQLGFGRVSFTG
        E   +   E+A+T +L        SAR EAER+WL+LYQ+G LGFGRVSFTG
Subjt:  EISPAFGGEQAETGNL--------SARFEAERIWLQLYQVGQLGFGRVSFTG

AT3G11690.1 unknown protein1.9e-0432.94Show/hide
Query:  NPSRPSPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAV-CCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKS
        +PSR +  +P  +R  ++   +    C G        G  A C AV CCCPC +V+ L LA+ K+P  + RR ++  R+K  +K+
Subjt:  NPSRPSPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAV-CCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKS

AT5G14690.1 unknown protein3.9e-2635.35Show/hide
Query:  MEQNPSRPSPA-------------RPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTL----QHARKKW
        ME+NP R S                  RRRR       D + CS K CRS  A  IADCVA+CCCPC++++ L L LVK+PWM+GRR L    ++ +K+ 
Subjt:  MEQNPSRPSPA-------------RPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTL----QHARKKW

Query:  KMKSLRRRGEVDG-------------------GPAANTGG---------------------TPANRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQL
         +   +RRG ++G                   G     GG                         ++E+  E + +  GE  +   +SAR EAER+WL+L
Subjt:  KMKSLRRRGEVDG-------------------GPAANTGG---------------------TPANRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQL

Query:  YQVGQLGFGRVSFTG
        YQ+G LGFGRVSFTG
Subjt:  YQVGQLGFGRVSFTG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACAAAACCCATCTCGCCCTTCGCCGGCGAGACCCACGCGGCGGCGCAGATTCGCCGTGGACGACGGCGCCGATCTAATCGACTGCTCCGGCAAGCATTGC
CGGTCGTGCACCGCCGGCTTGATCGCCGACTGCGTCGCCGTCTGCTGCTGCCCCTGCTCTGTGGTCAGCTTCCTGGCTCTGGCTCTGGTCAAACTGCCGTGGATG
GTCGGCCGGAGGACCCTGCAGCACGCCAGGAAGAAGTGGAAGATGAAATCCCTCCGCCGGAGAGGGGAAGTCGACGGCGGCCCTGCGGCAAACACAGGTGGCACT
CCGGCGAACAGGGATGAAAAAATGCCGGAAATTTCGCCGGCGTTCGGCGGCGAACAAGCAGAGACAGGGAATTTGAGTGCGAGATTTGAAGCAGAGAGAATATGG
TTGCAATTGTATCAGGTTGGTCAATTGGGTTTTGGAAGAGTTTCCTTCACTGGGAATTCAAGTCTCAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAACAAAACCCATCTCGCCCTTCGCCGGCGAGACCCACGCGGCGGCGCAGATTCGCCGTGGACGACGGCGCCGATCTAATCGACTGCTCCGGCAAGCATTGC
CGGTCGTGCACCGCCGGCTTGATCGCCGACTGCGTCGCCGTCTGCTGCTGCCCCTGCTCTGTGGTCAGCTTCCTGGCTCTGGCTCTGGTCAAACTGCCGTGGATG
GTCGGCCGGAGGACCCTGCAGCACGCCAGGAAGAAGTGGAAGATGAAATCCCTCCGCCGGAGAGGGGAAGTCGACGGCGGCCCTGCGGCAAACACAGGTGGCACT
CCGGCGAACAGGGATGAAAAAATGCCGGAAATTTCGCCGGCGTTCGGCGGCGAACAAGCAGAGACAGGGAATTTGAGTGCGAGATTTGAAGCAGAGAGAATATGG
TTGCAATTGTATCAGGTTGGTCAATTGGGTTTTGGAAGAGTTTCCTTCACTGGGAATTCAAGTCTCAATTAG
Protein sequenceShow/hide protein sequence
MEQNPSRPSPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSLRRRGEVDGGPAANTGGT
PANRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSLN