; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS025591 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS025591
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationscaffold4:1206834..1207136
RNA-Seq ExpressionMS025591
SyntenyMS025591
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAB2629574.1 pentatricopeptide repeat-containing protein [Pyrus ussuriensis x Pyrus communis]8.0e-1651.43Show/hide
Query:  MGIK-WRNG-SQSIRLGRS-SVSIERDSKRPGWQILWQKIVKEKKKFFGSSSVM-ELNSSYNSNAYQMNFDHGIGCFEPDNVCRSFSARFADPS-LVSGN
        MGI+ WRN  SQSIRLG +  +    +  +PGWQ  W++   E+KK FGSS+VM +  +SY+ + Y  NFD G    EPDN+ RSFSARFADPS ++ GN
Subjt:  MGIK-WRNG-SQSIRLGRS-SVSIERDSKRPGWQILWQKIVKEKKKFFGSSSVM-ELNSSYNSNAYQMNFDHGIGCFEPDNVCRSFSARFADPS-LVSGN

Query:  LRLLD
          LLD
Subjt:  LRLLD

KAG6591653.1 hypothetical protein SDJN03_13999, partial [Cucurbita argyrosperma subsp. sororia]1.2e-2264.08Show/hide
Query:  MGIKWRNG-SQSIRLGRSSVS-IERDSKRP-GWQILWQKIVKEKKKFFGSSSVMELNSSYNSNAYQMNFDHGIGCFEPDNVCRSFSARFADPSLVSGNLR
        MGIKWRN  SQSIRLG+S  S  E++SKR  GWQILWQK  KEK++ F  SSV EL SSYN NAY++NF+      +PD++CRSFSARFADPS+VS + R
Subjt:  MGIKWRNG-SQSIRLGRSSVS-IERDSKRP-GWQILWQKIVKEKKKFFGSSSVMELNSSYNSNAYQMNFDHGIGCFEPDNVCRSFSARFADPSLVSGNLR

Query:  LLD
        LLD
Subjt:  LLD

KGN60340.2 hypothetical protein Csa_000895 [Cucumis sativus]6.5e-2668.63Show/hide
Query:  MGIKWRNG-SQSIRLGRSSVSI-ERDSKRPGWQILWQKIVKEKKKFFGSSSVMELNSSYNSNAYQMNFDHGIGCFEPDNVCRSFSARFADPSLVSGNLRL
        MG+KWRN  SQSIRLG+S VS  E++SKR GWQILW+K+ KEK+K F  SSV EL SSYN NAY +NFD      EPDN+ RSFSARFADPS+VS NLRL
Subjt:  MGIKWRNG-SQSIRLGRSSVSI-ERDSKRPGWQILWQKIVKEKKKFFGSSSVMELNSSYNSNAYQMNFDHGIGCFEPDNVCRSFSARFADPSLVSGNLRL

Query:  LD
        LD
Subjt:  LD

XP_022936244.1 uncharacterized protein LOC111442916 [Cucurbita moschata]6.7e-2364.08Show/hide
Query:  MGIKWRNG-SQSIRLGRS-SVSIERDSKRP-GWQILWQKIVKEKKKFFGSSSVMELNSSYNSNAYQMNFDHGIGCFEPDNVCRSFSARFADPSLVSGNLR
        MGIKWRN  SQSIRLG+S   S E++SKR  GWQILWQK  KEK++ F  SSV EL SSYN NAY++NF+      +PD++CRSFSARFADPS+VS + R
Subjt:  MGIKWRNG-SQSIRLGRS-SVSIERDSKRP-GWQILWQKIVKEKKKFFGSSSVMELNSSYNSNAYQMNFDHGIGCFEPDNVCRSFSARFADPSLVSGNLR

Query:  LLD
        LLD
Subjt:  LLD

XP_023536137.1 uncharacterized protein LOC111797385 [Cucurbita pepo subsp. pepo]6.7e-2364.08Show/hide
Query:  MGIKWRNG-SQSIRLGRS-SVSIERDSKR-PGWQILWQKIVKEKKKFFGSSSVMELNSSYNSNAYQMNFDHGIGCFEPDNVCRSFSARFADPSLVSGNLR
        MGIKWRN  SQSIRLG+S   S E++SKR  GWQILWQK  KEK++ F  SSV EL SSYN NAY++NF+      +PD++CRSFSARFADPS+VS + R
Subjt:  MGIKWRNG-SQSIRLGRS-SVSIERDSKR-PGWQILWQKIVKEKKKFFGSSSVMELNSSYNSNAYQMNFDHGIGCFEPDNVCRSFSARFADPSLVSGNLR

Query:  LLD
        LLD
Subjt:  LLD

TrEMBL top hitse value%identityAlignment
A0A0A0LJH9 Uncharacterized protein3.2e-2668.63Show/hide
Query:  MGIKWRNG-SQSIRLGRSSVSI-ERDSKRPGWQILWQKIVKEKKKFFGSSSVMELNSSYNSNAYQMNFDHGIGCFEPDNVCRSFSARFADPSLVSGNLRL
        MG+KWRN  SQSIRLG+S VS  E++SKR GWQILW+K+ KEK+K F  SSV EL SSYN NAY +NFD      EPDN+ RSFSARFADPS+VS NLRL
Subjt:  MGIKWRNG-SQSIRLGRSSVSI-ERDSKRPGWQILWQKIVKEKKKFFGSSSVMELNSSYNSNAYQMNFDHGIGCFEPDNVCRSFSARFADPSLVSGNLRL

Query:  LD
        LD
Subjt:  LD

A0A251RD41 Uncharacterized protein5.6e-1552.38Show/hide
Query:  MGIK-WRNG-SQSIRLGRSSV-SIERDSKRPGWQILWQKI-VKEKKKFFGSSSVMELNSSYNSNAYQMNFDHGIGCFEPDNVCRSFSARFADPSLVSGNL
        MGIK W N  SQS+RLG+  + S      RPGWQ  W+K  + +KKK F SS+V    +SY+   Y  NFD G+G  EPDN+ RSFSARFADPS++  N 
Subjt:  MGIK-WRNG-SQSIRLGRSSV-SIERDSKRPGWQILWQKI-VKEKKKFFGSSSVMELNSSYNSNAYQMNFDHGIGCFEPDNVCRSFSARFADPSLVSGNL

Query:  R-LLD
        R LLD
Subjt:  R-LLD

A0A5N5HP14 Pentatricopeptide repeat-containing protein6.6e-1651.43Show/hide
Query:  MGIK-WRNG-SQSIRLGRS-SVSIERDSKRPGWQILWQKIVKEKKKFFGSSSVM-ELNSSYNSNAYQMNFDHGIGCFEPDNVCRSFSARFADPS-LVSGN
        MGI+ WRN  SQSIRLG +  +    +  +PGWQ  W++   E+KK FGSS+V  +  +SY+   Y  NFD G G  EPDN+ RSFSARFADPS ++ GN
Subjt:  MGIK-WRNG-SQSIRLGRS-SVSIERDSKRPGWQILWQKIVKEKKKFFGSSSVM-ELNSSYNSNAYQMNFDHGIGCFEPDNVCRSFSARFADPS-LVSGN

Query:  LRLLD
          LLD
Subjt:  LRLLD

A0A5N5I1S5 Pentatricopeptide repeat-containing protein3.9e-1651.43Show/hide
Query:  MGIK-WRNG-SQSIRLGRS-SVSIERDSKRPGWQILWQKIVKEKKKFFGSSSVM-ELNSSYNSNAYQMNFDHGIGCFEPDNVCRSFSARFADPS-LVSGN
        MGI+ WRN  SQSIRLG +  +    +  +PGWQ  W++   E+KK FGSS+VM +  +SY+ + Y  NFD G    EPDN+ RSFSARFADPS ++ GN
Subjt:  MGIK-WRNG-SQSIRLGRS-SVSIERDSKRPGWQILWQKIVKEKKKFFGSSSVM-ELNSSYNSNAYQMNFDHGIGCFEPDNVCRSFSARFADPS-LVSGN

Query:  LRLLD
          LLD
Subjt:  LRLLD

A0A6J1F7R9 uncharacterized protein LOC1114429163.3e-2364.08Show/hide
Query:  MGIKWRNG-SQSIRLGRS-SVSIERDSKRP-GWQILWQKIVKEKKKFFGSSSVMELNSSYNSNAYQMNFDHGIGCFEPDNVCRSFSARFADPSLVSGNLR
        MGIKWRN  SQSIRLG+S   S E++SKR  GWQILWQK  KEK++ F  SSV EL SSYN NAY++NF+      +PD++CRSFSARFADPS+VS + R
Subjt:  MGIKWRNG-SQSIRLGRS-SVSIERDSKRP-GWQILWQKIVKEKKKFFGSSSVMELNSSYNSNAYQMNFDHGIGCFEPDNVCRSFSARFADPSLVSGNLR

Query:  LLD
        LLD
Subjt:  LLD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G25735.1 unknown protein4.0e-0534.41Show/hide
Query:  QSIRLGRSSVSI-------ERDSKRPGWQILWQKIVKEKKKFFGSSSVMELNSSYNSNAYQMNFDHGIGCF---EPDNVCRSFSARFADPSLV
        ++I LGR S S         R+   P W++L  K+     K   S +      +Y    Y +NFD G G     EP+N+ RSFS RFADP+ +
Subjt:  QSIRLGRSSVSI-------ERDSKRPGWQILWQKIVKEKKKFFGSSSVMELNSSYNSNAYQMNFDHGIGCF---EPDNVCRSFSARFADPSLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGATCAAATGGCGCAACGGCAGCCAAAGCATCAGGCTGGGGCGGTCCTCCGTTTCAATCGAACGAGATAGCAAACGGCCCGGATGGCAGATTTTGTGGCAGAAAAT
CGTCAAGGAGAAGAAGAAATTCTTCGGTTCTTCTTCTGTTATGGAATTGAATTCTTCTTATAATTCGAACGCTTACCAGATGAATTTTGATCACGGCATCGGATGCTTCG
AACCTGATAATGTCTGCAGATCCTTCTCTGCTCGATTTGCTGATCCCTCCCTTGTCTCCGGAAACTTGAGATTGTTGGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGATCAAATGGCGCAACGGCAGCCAAAGCATCAGGCTGGGGCGGTCCTCCGTTTCAATCGAACGAGATAGCAAACGGCCCGGATGGCAGATTTTGTGGCAGAAAAT
CGTCAAGGAGAAGAAGAAATTCTTCGGTTCTTCTTCTGTTATGGAATTGAATTCTTCTTATAATTCGAACGCTTACCAGATGAATTTTGATCACGGCATCGGATGCTTCG
AACCTGATAATGTCTGCAGATCCTTCTCTGCTCGATTTGCTGATCCCTCCCTTGTCTCCGGAAACTTGAGATTGTTGGATTGA
Protein sequenceShow/hide protein sequence
MGIKWRNGSQSIRLGRSSVSIERDSKRPGWQILWQKIVKEKKKFFGSSSVMELNSSYNSNAYQMNFDHGIGCFEPDNVCRSFSARFADPSLVSGNLRLLD