; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g00190 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g00190
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionpentatricopeptide repeat-containing protein At2g15820, chloroplastic-like
Genome locationchr3:136418..139586
RNA-Seq ExpressionMoc03g00190
SyntenyMoc03g00190
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0006388 - tRNA splicing, via endonucleolytic cleavage and ligation (biological process)
GO:0010239 - chloroplast mRNA processing (biological process)
GO:0045292 - mRNA cis splicing, via spliceosome (biological process)
GO:0048564 - photosystem I assembly (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0004519 - endonuclease activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607381.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]4.4e-5972.32Show/hide
Query:  MSICTSAFATLTLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCFFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASA
        MSI TSAFAT+TL  SLT   SQ H HFR            T+SAK R +LPRIPAFAS   VE L+YD+DSP++SEE  CSPYS  AEGF      ASA
Subjt:  MSICTSAFATLTLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCFFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASA

Query:  DLKHLGNPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRL
        DLKHLG PALEVKELDELPEQWRRSKLAWLCKELPA KPGTL+RLLNAQRKWM+QDDAAY+IVHCLRIRENETAFR+
Subjt:  DLKHLGNPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRL

XP_008465080.1 PREDICTED: pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucumis melo]2.6e-5970.62Show/hide
Query:  MSICTSAFATLTLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCFFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASA
        MSI TSAF+T+TL  SLT SLS  H +F             ++S K R +LPRI AFAS  FV+QL+YD+DSPS+SEEH  SPYSN  +GFHFEN +AS 
Subjt:  MSICTSAFATLTLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCFFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASA

Query:  DLKHLGNPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRL
        DLKHLG PALEVKELDELPEQWRRSKLAWLCKELPA KPGT++RLLNAQRKWM QDDA YL VHCLRIRENETAFR+
Subjt:  DLKHLGNPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRL

XP_022158727.1 pentatricopeptide repeat-containing protein At2g15820, chloroplastic-like [Momordica charantia]1.7e-7184.34Show/hide
Query:  TLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCFFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASADLKHLGNPALE
        TLF SLTHSL   HRHFR            TFSAKRRPKLPRI AFAS   V QLLYD+DSPSDSEEHSCSPYSN A+GFHFENS+ASADLKHLGNPALE
Subjt:  TLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCFFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASADLKHLGNPALE

Query:  VKELDELPEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRL
        VKELDELPEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFR+
Subjt:  VKELDELPEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRL

XP_022949171.1 pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucurbita moschata]1.5e-5972.88Show/hide
Query:  MSICTSAFATLTLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCFFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASA
        MSI TSAFAT+TL  SLT   SQ H HFR            T+SAK R +LPRIPAFAS   VE L+YD+DSP++SEE  CSPYS  AEGF      ASA
Subjt:  MSICTSAFATLTLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCFFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASA

Query:  DLKHLGNPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRL
        DLKHLG PALEVKELDELPEQWRRSKLAWLCKELPA KPGTL+RLLNAQRKWM+QDDAAYLIVHCLRIRENETAFR+
Subjt:  DLKHLGNPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRL

XP_022998786.1 pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucurbita maxima]3.1e-6073.45Show/hide
Query:  MSICTSAFATLTLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCFFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASA
        MSI TSAFAT+TL  SLT   SQ H HFR            T+SAK R +LPRIPAFAS   VE L+YD+DSP++SEE  CSPYSN AE F      ASA
Subjt:  MSICTSAFATLTLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCFFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASA

Query:  DLKHLGNPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRL
        DLKHLG PALEVKELDELPEQWRRSKLAWLCKELPAHKPGTL+RLLNAQRKWM+QDDAAYLIVHCLRIRENETAFR+
Subjt:  DLKHLGNPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRL

TrEMBL top hitse value%identityAlignment
A0A0A0LBL0 LAGLIDADG_2 domain-containing protein6.9e-5867.8Show/hide
Query:  MSICTSAFATLTLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCFFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASA
        MSI TSAF+T+T   SLT SLS  H +F              +S K R +LPRI AFAS  FV+QL+YD DSPS+SEEH  S +SN  +GFHFEN +AS 
Subjt:  MSICTSAFATLTLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCFFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASA

Query:  DLKHLGNPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRL
        DLKHLG P LEVKELDELPEQWRRSK+AWLCKELPA KPGT++RLLNAQ+KWM QDDA YLIVHCLRIRENETAFR+
Subjt:  DLKHLGNPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRL

A0A1S3CPK0 pentatricopeptide repeat-containing protein At2g15820, chloroplastic1.3e-5970.62Show/hide
Query:  MSICTSAFATLTLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCFFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASA
        MSI TSAF+T+TL  SLT SLS  H +F             ++S K R +LPRI AFAS  FV+QL+YD+DSPS+SEEH  SPYSN  +GFHFEN +AS 
Subjt:  MSICTSAFATLTLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCFFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASA

Query:  DLKHLGNPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRL
        DLKHLG PALEVKELDELPEQWRRSKLAWLCKELPA KPGT++RLLNAQRKWM QDDA YL VHCLRIRENETAFR+
Subjt:  DLKHLGNPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRL

A0A6J1DXY9 pentatricopeptide repeat-containing protein At2g15820, chloroplastic-like8.4e-7284.34Show/hide
Query:  TLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCFFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASADLKHLGNPALE
        TLF SLTHSL   HRHFR            TFSAKRRPKLPRI AFAS   V QLLYD+DSPSDSEEHSCSPYSN A+GFHFENS+ASADLKHLGNPALE
Subjt:  TLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCFFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASADLKHLGNPALE

Query:  VKELDELPEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRL
        VKELDELPEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFR+
Subjt:  VKELDELPEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRL

A0A6J1GB98 pentatricopeptide repeat-containing protein At2g15820, chloroplastic7.3e-6072.88Show/hide
Query:  MSICTSAFATLTLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCFFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASA
        MSI TSAFAT+TL  SLT   SQ H HFR            T+SAK R +LPRIPAFAS   VE L+YD+DSP++SEE  CSPYS  AEGF      ASA
Subjt:  MSICTSAFATLTLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCFFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASA

Query:  DLKHLGNPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRL
        DLKHLG PALEVKELDELPEQWRRSKLAWLCKELPA KPGTL+RLLNAQRKWM+QDDAAYLIVHCLRIRENETAFR+
Subjt:  DLKHLGNPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRL

A0A6J1KB64 pentatricopeptide repeat-containing protein At2g15820, chloroplastic1.5e-6073.45Show/hide
Query:  MSICTSAFATLTLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCFFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASA
        MSI TSAFAT+TL  SLT   SQ H HFR            T+SAK R +LPRIPAFAS   VE L+YD+DSP++SEE  CSPYSN AE F      ASA
Subjt:  MSICTSAFATLTLFHSLTHSLSQRHRHFR------------TFSAKRRPKLPRIPAFASCFFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASA

Query:  DLKHLGNPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRL
        DLKHLG PALEVKELDELPEQWRRSKLAWLCKELPAHKPGTL+RLLNAQRKWM+QDDAAYLIVHCLRIRENETAFR+
Subjt:  DLKHLGNPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRL

SwissProt top hitse value%identityAlignment
Q6ZHJ5 Pentatricopeptide repeat-containing protein OTP51, chloroplastic7.9e-2750.39Show/hide
Query:  PRIPAFASCFFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASADLKH-LGNPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLVRLLNAQR
        P IPA AS   +E L+ D D   + E+           G     ++A+AD +  + +P L V EL+ELPEQWRRS++AWLCKELPA+K  T  R+LNAQR
Subjt:  PRIPAFASCFFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASADLKH-LGNPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLVRLLNAQR

Query:  KWMRQDDAAYLIVHCLRIRENETAFRL
        KW+ QDDA Y+ VHCLRIR N+ AFR+
Subjt:  KWMRQDDAAYLIVHCLRIRENETAFRL

Q9XIL5 Pentatricopeptide repeat-containing protein At2g15820, chloroplastic3.3e-2544.58Show/hide
Query:  SAFATLTLFHSLTHSLSQRHRHFRTFSAKRRPKL--PRIPAFASCFFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASADLKHLGNPAL----E
        S+++  +L     H++      F + S+ R P L      A  S  FVE L       ++SEE       + A GF    S A  D++++    +    E
Subjt:  SAFATLTLFHSLTHSLSQRHRHFRTFSAKRRPKL--PRIPAFASCFFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASADLKHLGNPAL----E

Query:  VKELDELPEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRL
        V+EL+ELPE+WRRSKLAWLCKE+P HK  TLVRLLNAQ+KW+RQ+DA Y+ VHC+RIRENET FR+
Subjt:  VKELDELPEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRL

Arabidopsis top hitse value%identityAlignment
AT2G15820.1 endonucleases2.4e-2644.58Show/hide
Query:  SAFATLTLFHSLTHSLSQRHRHFRTFSAKRRPKL--PRIPAFASCFFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASADLKHLGNPAL----E
        S+++  +L     H++      F + S+ R P L      A  S  FVE L       ++SEE       + A GF    S A  D++++    +    E
Subjt:  SAFATLTLFHSLTHSLSQRHRHFRTFSAKRRPKL--PRIPAFASCFFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASADLKHLGNPAL----E

Query:  VKELDELPEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRL
        V+EL+ELPE+WRRSKLAWLCKE+P HK  TLVRLLNAQ+KW+RQ+DA Y+ VHC+RIRENET FR+
Subjt:  VKELDELPEQWRRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCATTTGCACCTCTGCCTTTGCCACTCTCACTCTTTTCCATTCTCTCACTCATTCCCTCTCTCAACGCCATCGCCACTTTCGAACATTTTCCGCAAAACGACGACC
GAAACTTCCGCGAATTCCTGCCTTCGCTTCATGTTTCTTCGTCGAACAGTTGTTATACGACCAGGATTCCCCGTCCGACTCTGAGGAGCACTCGTGTTCTCCATACAGTA
ACAGGGCTGAGGGTTTTCATTTTGAAAATAGTTATGCGTCGGCAGATTTGAAACACTTGGGAAATCCTGCGCTTGAAGTCAAAGAGCTGGACGAGTTGCCGGAGCAGTGG
CGAAGATCCAAATTGGCTTGGCTTTGTAAGGAATTGCCAGCGCATAAGCCGGGAACCTTAGTTCGGCTGCTTAATGCTCAGCGGAAATGGATGAGGCAGGATGATGCGGC
CTATCTCATCGTGCATTGTTTGCGTATTCGTGAGAACGAGACTGCGTTTAGGCTGCTCCAAAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCCATTTGCACCTCTGCCTTTGCCACTCTCACTCTTTTCCATTCTCTCACTCATTCCCTCTCTCAACGCCATCGCCACTTTCGAACATTTTCCGCAAAACGACGACC
GAAACTTCCGCGAATTCCTGCCTTCGCTTCATGTTTCTTCGTCGAACAGTTGTTATACGACCAGGATTCCCCGTCCGACTCTGAGGAGCACTCGTGTTCTCCATACAGTA
ACAGGGCTGAGGGTTTTCATTTTGAAAATAGTTATGCGTCGGCAGATTTGAAACACTTGGGAAATCCTGCGCTTGAAGTCAAAGAGCTGGACGAGTTGCCGGAGCAGTGG
CGAAGATCCAAATTGGCTTGGCTTTGTAAGGAATTGCCAGCGCATAAGCCGGGAACCTTAGTTCGGCTGCTTAATGCTCAGCGGAAATGGATGAGGCAGGATGATGCGGC
CTATCTCATCGTGCATTGTTTGCGTATTCGTGAGAACGAGACTGCGTTTAGGCTGCTCCAAAATTAA
Protein sequenceShow/hide protein sequence
MSICTSAFATLTLFHSLTHSLSQRHRHFRTFSAKRRPKLPRIPAFASCFFVEQLLYDQDSPSDSEEHSCSPYSNRAEGFHFENSYASADLKHLGNPALEVKELDELPEQW
RRSKLAWLCKELPAHKPGTLVRLLNAQRKWMRQDDAAYLIVHCLRIRENETAFRLLQN