; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS003136 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS003136
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionTransmembrane protein
Genome locationscaffold234:642174..642647
RNA-Seq ExpressionMS003136
SyntenyMS003136
Gene Ontology termsGO:0010190 - cytochrome b6f complex assembly (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN51853.2 hypothetical protein Csa_008870 [Cucumis sativus]1.3e-5778.06Show/hide
Query:  SPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSL-RRRGEVDGGPAANTGGTPA
        SP++ TRRRRFAVDDGADLIDCSGKHCRSCTAGL+ADCVAVCCCPCSVVSFLALALVKLPWMVGRR LQ ARKK K + L  RRGE DG  AA TGG   
Subjt:  SPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSL-RRRGEVDGGPAANTGGTPA

Query:  KRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSL
          ++ +PE+SP  G E   TGN SARFEAERIWLQLYQVGQLGFGRVSFTGNS+L
Subjt:  KRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSL

XP_008458773.1 PREDICTED: uncharacterized protein LOC103498078 [Cucumis melo]4.1e-5980Show/hide
Query:  SPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSL-RRRGEVDGGPAANTGGTPA
        SP+R TRRRRFAVDDGADLIDCSGKHCRSCTAGL+ADCVAVCCCPCSVVSFLALALVKLPWMVGRR LQ ARKK K + L RRRGE DG  AA TGG   
Subjt:  SPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSL-RRRGEVDGGPAANTGGTPA

Query:  KRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSL
          ++ +PEISP  G E   TGN SARFEAERIWLQLYQVGQLGFGRVSFTGNS+L
Subjt:  KRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSL

XP_011655414.1 uncharacterized protein LOC105435525 [Cucumis sativus]1.3e-5778.06Show/hide
Query:  SPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSL-RRRGEVDGGPAANTGGTPA
        SP++ TRRRRFAVDDGADLIDCSGKHCRSCTAGL+ADCVAVCCCPCSVVSFLALALVKLPWMVGRR LQ ARKK K + L  RRGE DG  AA TGG   
Subjt:  SPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSL-RRRGEVDGGPAANTGGTPA

Query:  KRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSL
          ++ +PE+SP  G E   TGN SARFEAERIWLQLYQVGQLGFGRVSFTGNS+L
Subjt:  KRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSL

XP_022134219.1 uncharacterized protein LOC111006530 [Momordica charantia]4.1e-8399.37Show/hide
Query:  SRPSPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSLRRRGEVDGGPAANTGGT
        SRPSPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSLRRRGEVDGGPAANTGGT
Subjt:  SRPSPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSLRRRGEVDGGPAANTGGT

Query:  PAKRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSLN
        PA RDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSLN
Subjt:  PAKRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSLN

XP_038891069.1 uncharacterized protein LOC120080480 [Benincasa hispida]1.3e-6080.65Show/hide
Query:  SPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSL-RRRGEVDGGPAANTGGTPA
        SP+RPTRRRRFAVDDGADLIDCSGKHCRSCTAGL+ADCVAVCCCPCSVVSFLALAL+KLPWMVGRR LQ ARKK K + L RRRGE DG  AA TGG  A
Subjt:  SPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSL-RRRGEVDGGPAANTGGTPA

Query:  KRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSL
          +E +PEISP  G E+   GN SARFEAERIWLQLYQVGQLGFGRVSFTGNS+L
Subjt:  KRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSL

TrEMBL top hitse value%identityAlignment
A0A0A0KR58 Uncharacterized protein6.5e-5878.06Show/hide
Query:  SPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSL-RRRGEVDGGPAANTGGTPA
        SP++ TRRRRFAVDDGADLIDCSGKHCRSCTAGL+ADCVAVCCCPCSVVSFLALALVKLPWMVGRR LQ ARKK K + L  RRGE DG  AA TGG   
Subjt:  SPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSL-RRRGEVDGGPAANTGGTPA

Query:  KRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSL
          ++ +PE+SP  G E   TGN SARFEAERIWLQLYQVGQLGFGRVSFTGNS+L
Subjt:  KRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSL

A0A1S3C8M3 uncharacterized protein LOC1034980782.0e-5980Show/hide
Query:  SPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSL-RRRGEVDGGPAANTGGTPA
        SP+R TRRRRFAVDDGADLIDCSGKHCRSCTAGL+ADCVAVCCCPCSVVSFLALALVKLPWMVGRR LQ ARKK K + L RRRGE DG  AA TGG   
Subjt:  SPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSL-RRRGEVDGGPAANTGGTPA

Query:  KRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSL
          ++ +PEISP  G E   TGN SARFEAERIWLQLYQVGQLGFGRVSFTGNS+L
Subjt:  KRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSL

A0A5A7T2E5 Uncharacterized protein2.0e-5980Show/hide
Query:  SPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSL-RRRGEVDGGPAANTGGTPA
        SP+R TRRRRFAVDDGADLIDCSGKHCRSCTAGL+ADCVAVCCCPCSVVSFLALALVKLPWMVGRR LQ ARKK K + L RRRGE DG  AA TGG   
Subjt:  SPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSL-RRRGEVDGGPAANTGGTPA

Query:  KRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSL
          ++ +PEISP  G E   TGN SARFEAERIWLQLYQVGQLGFGRVSFTGNS+L
Subjt:  KRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSL

A0A6J1BXL5 uncharacterized protein LOC1110065302.0e-8399.37Show/hide
Query:  SRPSPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSLRRRGEVDGGPAANTGGT
        SRPSPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSLRRRGEVDGGPAANTGGT
Subjt:  SRPSPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSLRRRGEVDGGPAANTGGT

Query:  PAKRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSLN
        PA RDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSLN
Subjt:  PAKRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSLN

A0A6J1JH40 uncharacterized protein LOC1114869584.8e-5370.37Show/hide
Query:  SPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSLRRRGEVDGGPAANTGGTPAK
        SP+ PTRRRRF VDDG DLIDCSGKHCRSCTAGL+ADCVAVCCCPCSVVSFLALALVKLPWM+GRR LQ ARKK K+  +RR+ E DG  AA+  G    
Subjt:  SPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSLRRRGEVDGGPAANTGGTPAK

Query:  RDEKMPEISPAFGGEQAE---------TGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSS
        R+E +PEI P    E+ E          GN SARFEAERIWLQLYQ+GQLGFGRVSFTGNS+
Subjt:  RDEKMPEISPAFGGEQAE---------TGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01516.1 unknown protein9.3e-2544.08Show/hide
Query:  CSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTL-QHARKKWKMKSL------------RRRGEVDGGPAANTGGTPAKRDEK--MP
        CSGK CRS  A  IADCVA+CCCPC+VV+   LA VK+PWM+GR+ + +    K +MK +            RR  E+  G     G    + D+   + 
Subjt:  CSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTL-QHARKKWKMKSL------------RRRGEVDGGPAANTGGTPAKRDEK--MP

Query:  EISPAFGGEQAETGNL--------SARFEAERIWLQLYQVGQLGFGRVSFTG
        E   +   E+A+T +L        SAR EAER+WL+LYQ+G LGFGRVSFTG
Subjt:  EISPAFGGEQAETGNL--------SARFEAERIWLQLYQVGQLGFGRVSFTG

AT3G11690.1 unknown protein1.1e-0433.73Show/hide
Query:  SPARPTRRR---RFAVDDGADLIDCSGKHCRSCTAGLIADCVAV-CCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKS
        SP+R  RR+   + ++   +    C G        G  A C AV CCCPC +V+ L LA+ K+P  + RR ++  R+K  +K+
Subjt:  SPARPTRRR---RFAVDDGADLIDCSGKHCRSCTAGLIADCVAV-CCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKS

AT5G14690.1 unknown protein3.8e-2637.23Show/hide
Query:  RRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTL----QHARKKWKMKSLRRRGEVDG--------------
        RRRR       D + CS K CRS  A  IADCVA+CCCPC++++ L L LVK+PWM+GRR L    ++ +K+  +   +RRG ++G              
Subjt:  RRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTL----QHARKKWKMKSLRRRGEVDG--------------

Query:  -----GPAANTGG---------------------TPAKRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTG
             G     GG                         ++E+  E + +  GE  +   +SAR EAER+WL+LYQ+G LGFGRVSFTG
Subjt:  -----GPAANTGG---------------------TPAKRDEKMPEISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TCTCGCCCTTCGCCGGCGAGACCCACGCGGCGGCGCAGATTCGCCGTGGACGACGGCGCCGATCTAATCGACTGCTCCGGCAAGCATTGCCGGTCGTGCACCGCCGGCTT
GATCGCCGACTGCGTCGCCGTCTGCTGCTGCCCCTGCTCTGTGGTCAGCTTCCTGGCTCTGGCTCTGGTCAAACTGCCGTGGATGGTCGGCCGGAGGACCCTGCAGCACG
CCAGGAAGAAGTGGAAGATGAAATCCCTCCGCCGGAGAGGGGAAGTCGACGGCGGCCCTGCGGCAAACACAGGTGGCACTCCGGCGAAAAGGGATGAAAAAATGCCGGAA
ATTTCGCCGGCGTTCGGCGGCGAACAAGCAGAGACAGGGAATTTGAGTGCGAGATTTGAAGCAGAGAGAATATGGTTGCAATTGTATCAGGTTGGTCAATTGGGTTTTGG
AAGAGTTTCCTTCACTGGGAATTCAAGTCTCAAT
mRNA sequenceShow/hide mRNA sequence
TCTCGCCCTTCGCCGGCGAGACCCACGCGGCGGCGCAGATTCGCCGTGGACGACGGCGCCGATCTAATCGACTGCTCCGGCAAGCATTGCCGGTCGTGCACCGCCGGCTT
GATCGCCGACTGCGTCGCCGTCTGCTGCTGCCCCTGCTCTGTGGTCAGCTTCCTGGCTCTGGCTCTGGTCAAACTGCCGTGGATGGTCGGCCGGAGGACCCTGCAGCACG
CCAGGAAGAAGTGGAAGATGAAATCCCTCCGCCGGAGAGGGGAAGTCGACGGCGGCCCTGCGGCAAACACAGGTGGCACTCCGGCGAAAAGGGATGAAAAAATGCCGGAA
ATTTCGCCGGCGTTCGGCGGCGAACAAGCAGAGACAGGGAATTTGAGTGCGAGATTTGAAGCAGAGAGAATATGGTTGCAATTGTATCAGGTTGGTCAATTGGGTTTTGG
AAGAGTTTCCTTCACTGGGAATTCAAGTCTCAAT
Protein sequenceShow/hide protein sequence
SRPSPARPTRRRRFAVDDGADLIDCSGKHCRSCTAGLIADCVAVCCCPCSVVSFLALALVKLPWMVGRRTLQHARKKWKMKSLRRRGEVDGGPAANTGGTPAKRDEKMPE
ISPAFGGEQAETGNLSARFEAERIWLQLYQVGQLGFGRVSFTGNSSLN