; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC08g0716 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC08g0716
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationMC08:5814071..5814495
RNA-Seq ExpressionMC08g0716
SyntenyMC08g0716
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585558.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]2.09e-6682.71Show/hide
Query:  TLYCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDYXPCDYALAGAITAC
        TLY K GKVDGAR IFYEMPVKDIV+WNAILSGYVNA RM+E KSFFA+MP K LLTWTVMISGLA+NGFGE+GL LFN+MRLD Y PCDYA AGAITAC
Subjt:  TLYCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDYXPCDYALAGAITAC

Query:  SVLGALEN-RQLHAQLVHLGHSSTLSDGNAMIS
        SVLG+LEN RQLHAQL+HLGH S+LS GNAMIS
Subjt:  SVLGALEN-RQLHAQLVHLGHSSTLSDGNAMIS

KAG7020472.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]2.09e-6682.71Show/hide
Query:  TLYCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDYXPCDYALAGAITAC
        TLY K GKVDGAR IFYEMPVKDIV+WNAILSGYVNA RM+E KSFFA+MP K LLTWTVMISGLA+NGFGE+GL LFN+MRLD Y PCDYA AGAITAC
Subjt:  TLYCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDYXPCDYALAGAITAC

Query:  SVLGALEN-RQLHAQLVHLGHSSTLSDGNAMIS
        SVLG+LEN RQLHAQL+HLGH S+LS GNAMIS
Subjt:  SVLGALEN-RQLHAQLVHLGHSSTLSDGNAMIS

XP_022144208.1 pentatricopeptide repeat-containing protein At1g25360-like [Momordica charantia]1.03e-6884.96Show/hide
Query:  TLYCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDYXPCDYALAGAITAC
        TLY K GKVDGARKIFYEMP+KDIVTWNAILSGYVN+ RM+E KSFF EMP K LLTWTVMISGLA+NGFGE+GLKLFNQMRLD Y PCDYA AGAITAC
Subjt:  TLYCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDYXPCDYALAGAITAC

Query:  SVLGALEN-RQLHAQLVHLGHSSTLSDGNAMIS
        SVLGALEN RQLHAQL+HLGH S+LS GNAMIS
Subjt:  SVLGALEN-RQLHAQLVHLGHSSTLSDGNAMIS

XP_023002238.1 pentatricopeptide repeat-containing protein At1g25360-like [Cucurbita maxima]1.08e-6683.46Show/hide
Query:  TLYCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDYXPCDYALAGAITAC
        TLY K GKVDGAR IFYEMPVKDIV+WNAILSGYVNA RM+E KSFFA+MP K LLTWTVMISGLA+NGFGE+GL LFN+MRLD Y PCDYA AGAITAC
Subjt:  TLYCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDYXPCDYALAGAITAC

Query:  SVLGALEN-RQLHAQLVHLGHSSTLSDGNAMIS
        SVLG+LEN RQLHAQLVHLGH S+LS GNAMIS
Subjt:  SVLGALEN-RQLHAQLVHLGHSSTLSDGNAMIS

XP_038886633.1 pentatricopeptide repeat-containing protein At1g25360-like [Benincasa hispida]5.55e-6783.46Show/hide
Query:  TLYCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDYXPCDYALAGAITAC
        TLY K GKVDGARKIFYEMPVKDIV+WNAILSGYVNA RM++ KSFFA+MP K LLTWTV+ISGLA+NGFGE+ LKLFNQMRLD Y PCDYA AGAITAC
Subjt:  TLYCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDYXPCDYALAGAITAC

Query:  SVLGALEN-RQLHAQLVHLGHSSTLSDGNAMIS
        SVLGALEN RQLHAQ+VHLGH+S+LS GNAMIS
Subjt:  SVLGALEN-RQLHAQLVHLGHSSTLSDGNAMIS

TrEMBL top hitse value%identityAlignment
A0A0A0LL72 DYW_deaminase domain-containing protein1.43e-6581.95Show/hide
Query:  TLYCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDYXPCDYALAGAITAC
        TLY K GKVDGARKIFYEMPVKDI+TWN +LSGYVNA RM+E KSFFA+MP K LLTWTVMISGLA+NGFGE  LKLFNQM+LD Y P DYA AGAITAC
Subjt:  TLYCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDYXPCDYALAGAITAC

Query:  SVLGALEN-RQLHAQLVHLGHSSTLSDGNAMIS
        SVLGALEN RQLHAQ+VHLGH STLS GNAMI+
Subjt:  SVLGALEN-RQLHAQLVHLGHSSTLSDGNAMIS

A0A1S4DVG9 pentatricopeptide repeat-containing protein At1g25360-like1.33e-6581.2Show/hide
Query:  TLYCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDYXPCDYALAGAITAC
        TLY K GKVDGARKIFYEMPVKD+++WN +LSGYVNA RM+E KSFFA+MP K LLTWTVMISGLA+NGFGE  LKLFNQMRLD Y P DYA AGAITAC
Subjt:  TLYCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDYXPCDYALAGAITAC

Query:  SVLGALEN-RQLHAQLVHLGHSSTLSDGNAMIS
        SVLGALEN RQLHAQ+VHLGH STLS GNAMI+
Subjt:  SVLGALEN-RQLHAQLVHLGHSSTLSDGNAMIS

A0A6J1CRF6 pentatricopeptide repeat-containing protein At1g25360-like4.99e-6984.96Show/hide
Query:  TLYCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDYXPCDYALAGAITAC
        TLY K GKVDGARKIFYEMP+KDIVTWNAILSGYVN+ RM+E KSFF EMP K LLTWTVMISGLA+NGFGE+GLKLFNQMRLD Y PCDYA AGAITAC
Subjt:  TLYCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDYXPCDYALAGAITAC

Query:  SVLGALEN-RQLHAQLVHLGHSSTLSDGNAMIS
        SVLGALEN RQLHAQL+HLGH S+LS GNAMIS
Subjt:  SVLGALEN-RQLHAQLVHLGHSSTLSDGNAMIS

A0A6J1GGL5 pentatricopeptide repeat-containing protein At1g25360-like1.01e-6682.71Show/hide
Query:  TLYCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDYXPCDYALAGAITAC
        TLY K GKVDGAR IFYEMPVKDIV+WNAILSGYVNA RM+E KSFFA+MP K LLTWTVMISGLA+NGFGE+GL LFN+MRLD Y PCDYA AGAITAC
Subjt:  TLYCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDYXPCDYALAGAITAC

Query:  SVLGALEN-RQLHAQLVHLGHSSTLSDGNAMIS
        SVLG+LEN RQLHAQL+HLGH S+LS GNAMIS
Subjt:  SVLGALEN-RQLHAQLVHLGHSSTLSDGNAMIS

A0A6J1KSZ4 pentatricopeptide repeat-containing protein At1g25360-like5.21e-6783.46Show/hide
Query:  TLYCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDYXPCDYALAGAITAC
        TLY K GKVDGAR IFYEMPVKDIV+WNAILSGYVNA RM+E KSFFA+MP K LLTWTVMISGLA+NGFGE+GL LFN+MRLD Y PCDYA AGAITAC
Subjt:  TLYCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDYXPCDYALAGAITAC

Query:  SVLGALEN-RQLHAQLVHLGHSSTLSDGNAMIS
        SVLG+LEN RQLHAQLVHLGH S+LS GNAMIS
Subjt:  SVLGALEN-RQLHAQLVHLGHSSTLSDGNAMIS

SwissProt top hitse value%identityAlignment
Q9FJY7 Pentatricopeptide repeat-containing protein At5g665201.3e-1736.84Show/hide
Query:  YCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDYXPCDYALAGAITACSV
        Y   G    A  +F  +P  D V+WN+++ GYV A +MD   + F +M  K  ++WT MISG  +    ++ L+LF++M+  D  P + +LA A++AC+ 
Subjt:  YCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDYXPCDYALAGAITACSV

Query:  LGALE-NRQLHAQL
        LGALE  + +H+ L
Subjt:  LGALE-NRQLHAQL

Q9FRI5 Pentatricopeptide repeat-containing protein At1g253608.9e-4054.42Show/hide
Query:  RLDDFTFTF-----TLYCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDY
        R +DF+F F     +LY K GK D AR IF +MP KD+V+WNA+LSGYV++  + E K  F EM  K +L+W +MISGLA+NGFGE+GLKLF+ M+ + +
Subjt:  RLDDFTFTF-----TLYCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDY

Query:  XPCDYALAGAITACSVLGALEN-RQLHAQLVHLGHSSTLSDGNAMIS
         PCDYA +GAI +C+VLGA  N +Q HAQL+ +G  S+LS GNA+I+
Subjt:  XPCDYALAGAITACSVLGALEN-RQLHAQLVHLGHSSTLSDGNAMIS

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic5.6e-1830.88Show/hide
Query:  TFTFTLYCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDYXPCDYALAGA
        T   ++Y +NG+++ A K+F + P +D+V++ A++ GY +   ++  +  F E+P K +++W  MISG A+ G  ++ L+LF  M   +  P +  +   
Subjt:  TFTFTLYCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDYXPCDYALAGA

Query:  ITACSVLGALE-NRQLHAQLVHLGHSSTLSDGNAMI
        ++AC+  G++E  RQ+H  +   G  S L   NA+I
Subjt:  ITACSVLGALE-NRQLHAQLVHLGHSSTLSDGNAMI

Q9LXF2 Pentatricopeptide repeat-containing protein At5g153005.6e-1834.43Show/hide
Query:  YCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDYXPCDYALAGAITACSV
        Y K GK+D A ++F EMP KD V WN +++G +  + MD  +  F     K ++TW  MISG    G+ ++ L +F +MR     P    +   ++AC+V
Subjt:  YCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDYXPCDYALAGAITACSV

Query:  LGALE-NRQLHAQLVHLGHSST
        LG LE  ++LH  ++     S+
Subjt:  LGALE-NRQLHAQLVHLGHSST

Q9MA50 Pentatricopeptide repeat-containing protein At1g05750, chloroplastic1.3e-1737.14Show/hide
Query:  LYCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDYXPCDYALAGAITACS
        +Y K G+   AR +F  M  K+ VTWN ++ GY+ + ++D     F +MP + L++WT MI+G  K G+ E+ L  F +M++    P   A+  A+ AC+
Subjt:  LYCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDYXPCDYALAGAITACS

Query:  VLGAL
         LGAL
Subjt:  VLGAL

Arabidopsis top hitse value%identityAlignment
AT1G05750.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.9e-1937.14Show/hide
Query:  LYCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDYXPCDYALAGAITACS
        +Y K G+   AR +F  M  K+ VTWN ++ GY+ + ++D     F +MP + L++WT MI+G  K G+ E+ L  F +M++    P   A+  A+ AC+
Subjt:  LYCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDYXPCDYALAGAITACS

Query:  VLGAL
         LGAL
Subjt:  VLGAL

AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.0e-1930.88Show/hide
Query:  TFTFTLYCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDYXPCDYALAGA
        T   ++Y +NG+++ A K+F + P +D+V++ A++ GY +   ++  +  F E+P K +++W  MISG A+ G  ++ L+LF  M   +  P +  +   
Subjt:  TFTFTLYCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDYXPCDYALAGA

Query:  ITACSVLGALE-NRQLHAQLVHLGHSSTLSDGNAMI
        ++AC+  G++E  RQ+H  +   G  S L   NA+I
Subjt:  ITACSVLGALE-NRQLHAQLVHLGHSSTLSDGNAMI

AT1G25360.1 Pentatricopeptide repeat (PPR) superfamily protein6.3e-4154.42Show/hide
Query:  RLDDFTFTF-----TLYCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDY
        R +DF+F F     +LY K GK D AR IF +MP KD+V+WNA+LSGYV++  + E K  F EM  K +L+W +MISGLA+NGFGE+GLKLF+ M+ + +
Subjt:  RLDDFTFTF-----TLYCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDY

Query:  XPCDYALAGAITACSVLGALEN-RQLHAQLVHLGHSSTLSDGNAMIS
         PCDYA +GAI +C+VLGA  N +Q HAQL+ +G  S+LS GNA+I+
Subjt:  XPCDYALAGAITACSVLGALEN-RQLHAQLVHLGHSSTLSDGNAMIS

AT5G15300.1 Pentatricopeptide repeat (PPR) superfamily protein4.0e-1934.43Show/hide
Query:  YCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDYXPCDYALAGAITACSV
        Y K GK+D A ++F EMP KD V WN +++G +  + MD  +  F     K ++TW  MISG    G+ ++ L +F +MR     P    +   ++AC+V
Subjt:  YCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDYXPCDYALAGAITACSV

Query:  LGALE-NRQLHAQLVHLGHSST
        LG LE  ++LH  ++     S+
Subjt:  LGALE-NRQLHAQLVHLGHSST

AT5G66520.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.9e-1936.84Show/hide
Query:  YCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDYXPCDYALAGAITACSV
        Y   G    A  +F  +P  D V+WN+++ GYV A +MD   + F +M  K  ++WT MISG  +    ++ L+LF++M+  D  P + +LA A++AC+ 
Subjt:  YCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDYXPCDYALAGAITACSV

Query:  LGALE-NRQLHAQL
        LGALE  + +H+ L
Subjt:  LGALE-NRQLHAQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CGGCTTGATGACTTTACATTTACATTTACTTTGTACTGCAAGAATGGTAAAGTTGATGGGGCACGGAAGATTTTTTATGAGATGCCAGTTAAAGATATCGTTACTTGGAA
TGCAATCTTATCAGGGTATGTGAATGCAAGGCGTATGGACGAGCCAAAATCTTTCTTCGCAGAAATGCCAGGGAAAACCCTTCTTACTTGGACTGTGATGATTTCAGGAT
TAGCAAAAAATGGATTTGGGGAAGATGGTTTGAAGCTGTTTAACCAAATGAGGTTAGACGATTATTAACCCTGTGATTATGCACTTGCAGGAGCCATTACAGCTTGTTCT
GTGCTTGGAGCATTGGAGAATCGCCAGCTCCATGCTCAGCTTGTTCATCTTGGCCACAGTTCAACACTCTCAGATGGCAATGCAATGATCTCA
mRNA sequenceShow/hide mRNA sequence
CGGCTTGATGACTTTACATTTACATTTACTTTGTACTGCAAGAATGGTAAAGTTGATGGGGCACGGAAGATTTTTTATGAGATGCCAGTTAAAGATATCGTTACTTGGAA
TGCAATCTTATCAGGGTATGTGAATGCAAGGCGTATGGACGAGCCAAAATCTTTCTTCGCAGAAATGCCAGGGAAAACCCTTCTTACTTGGACTGTGATGATTTCAGGAT
TAGCAAAAAATGGATTTGGGGAAGATGGTTTGAAGCTGTTTAACCAAATGAGGTTAGACGATTATTAACCCTGTGATTATGCACTTGCAGGAGCCATTACAGCTTGTTCT
GTGCTTGGAGCATTGGAGAATCGCCAGCTCCATGCTCAGCTTGTTCATCTTGGCCACAGTTCAACACTCTCAGATGGCAATGCAATGATCTCA
Protein sequenceShow/hide protein sequence
RLDDFTFTFTLYCKNGKVDGARKIFYEMPVKDIVTWNAILSGYVNARRMDEPKSFFAEMPGKTLLTWTVMISGLAKNGFGEDGLKLFNQMRLDDYUPCDYALAGAITACS
VLGALENRQLHAQLVHLGHSSTLSDGNAMIS