; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh11G005630 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh11G005630
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCmo_Chr11:2736948..2737303
RNA-Seq ExpressionCmoCh11G005630
SyntenyCmoCh11G005630
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587915.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]2.5e-3887.5Show/hide
Query:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGFGLMDAISFTAPIAGYAMWGNTDRARKVFDEMPVRDVVSWNAMVAGYGQTSRSKRD
        MIHSGVKPDSYTFPFLFKS AKLASAHGGKQIRA          DAISFTAPIAGYAMWGNTDRARKVF EMPVRDVVSWNAMVAGYGQTSRSKRD
Subjt:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGFGLMDAISFTAPIAGYAMWGNTDRARKVFDEMPVRDVVSWNAMVAGYGQTSRSKRD

KAG6589865.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.5e-3059.68Show/hide
Query:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGF------------------------------GLMDAISFTAPIAGYAMWGNTDRARKVFD
        MIH+GV+P+SYTFPFL KSCAKLASA  GKQI A+VLKLGF                                 DAISFTA IAGY +WG  DRARK+FD
Subjt:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGF------------------------------GLMDAISFTAPIAGYAMWGNTDRARKVFD

Query:  EMPVRDVVSWNAMVAGYGQTSRSK
        EMPVRDVVSWNAM+AGY QT RSK
Subjt:  EMPVRDVVSWNAMVAGYGQTSRSK

XP_022961045.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic [Cucurbita moschata]1.5e-3059.68Show/hide
Query:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGF------------------------------GLMDAISFTAPIAGYAMWGNTDRARKVFD
        MIH+GV+P+SYTFPFL KSCAKLASA  GKQI A+VLKLGF                                 DAISFTA IAGY +WG  DRARK+FD
Subjt:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGF------------------------------GLMDAISFTAPIAGYAMWGNTDRARKVFD

Query:  EMPVRDVVSWNAMVAGYGQTSRSK
        EMPVRDVVSWNAM+AGY QT RSK
Subjt:  EMPVRDVVSWNAMVAGYGQTSRSK

XP_023515625.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic [Cucurbita pepo subsp. pepo]1.5e-3059.68Show/hide
Query:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGF------------------------------GLMDAISFTAPIAGYAMWGNTDRARKVFD
        MIH+GV+P+SYTFPFL KSCAKLASA  GKQI A+VLKLGF                                 DAISFTA IAGY +WG  DRARK+FD
Subjt:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGF------------------------------GLMDAISFTAPIAGYAMWGNTDRARKVFD

Query:  EMPVRDVVSWNAMVAGYGQTSRSK
        EMPVRDVVSWNAM+AGY QT RSK
Subjt:  EMPVRDVVSWNAMVAGYGQTSRSK

XP_038878535.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic isoform X1 [Benincasa hispida]1.1e-3059.68Show/hide
Query:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGF------------------------------GLMDAISFTAPIAGYAMWGNTDRARKVFD
        MI+SGV+P+SYTFPFL KSCAKLASAH GKQI A++LKLGF                                 DAISFTA IAGYA+WG  D ARK+FD
Subjt:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGF------------------------------GLMDAISFTAPIAGYAMWGNTDRARKVFD

Query:  EMPVRDVVSWNAMVAGYGQTSRSK
        EMPVRDVVSWNAM+AGY QT RSK
Subjt:  EMPVRDVVSWNAMVAGYGQTSRSK

TrEMBL top hitse value%identityAlignment
A0A0A0LU28 DYW_deaminase domain-containing protein2.0e-3058.87Show/hide
Query:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGF------------------------------GLMDAISFTAPIAGYAMWGNTDRARKVFD
        MI+SGV+P+SYTFPFL KSCAKLASAH GKQI A+VLKLGF                                 DAISFTA IAGYA+WG  DRAR++FD
Subjt:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGF------------------------------GLMDAISFTAPIAGYAMWGNTDRARKVFD

Query:  EMPVRDVVSWNAMVAGYGQTSRSK
        EMPV+DVVSWNAM+AGY Q  RSK
Subjt:  EMPVRDVVSWNAMVAGYGQTSRSK

A0A1S3CMX0 pentatricopeptide repeat-containing protein At1g08070, chloroplastic1.7e-2958.06Show/hide
Query:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGF------------------------------GLMDAISFTAPIAGYAMWGNTDRARKVFD
        MI+SGV+P+SYTFPFL KSCAKLASA  GKQI A+VLKLGF                                 DAISFTA IAGYA+WG  DRAR++FD
Subjt:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGF------------------------------GLMDAISFTAPIAGYAMWGNTDRARKVFD

Query:  EMPVRDVVSWNAMVAGYGQTSRSK
        EMPV+DVVSWNAM+AGY Q  RSK
Subjt:  EMPVRDVVSWNAMVAGYGQTSRSK

A0A5A7TTJ1 Pentatricopeptide repeat-containing protein1.7e-2958.06Show/hide
Query:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGF------------------------------GLMDAISFTAPIAGYAMWGNTDRARKVFD
        MI+SGV+P+SYTFPFL KSCAKLASA  GKQI A+VLKLGF                                 DAISFTA IAGYA+WG  DRAR++FD
Subjt:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGF------------------------------GLMDAISFTAPIAGYAMWGNTDRARKVFD

Query:  EMPVRDVVSWNAMVAGYGQTSRSK
        EMPV+DVVSWNAM+AGY Q  RSK
Subjt:  EMPVRDVVSWNAMVAGYGQTSRSK

A0A6J1HAU9 pentatricopeptide repeat-containing protein At1g08070, chloroplastic7.0e-3159.68Show/hide
Query:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGF------------------------------GLMDAISFTAPIAGYAMWGNTDRARKVFD
        MIH+GV+P+SYTFPFL KSCAKLASA  GKQI A+VLKLGF                                 DAISFTA IAGY +WG  DRARK+FD
Subjt:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGF------------------------------GLMDAISFTAPIAGYAMWGNTDRARKVFD

Query:  EMPVRDVVSWNAMVAGYGQTSRSK
        EMPVRDVVSWNAM+AGY QT RSK
Subjt:  EMPVRDVVSWNAMVAGYGQTSRSK

A0A6J1JHE6 pentatricopeptide repeat-containing protein At1g08070, chloroplastic1.6e-3058.87Show/hide
Query:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGF------------------------------GLMDAISFTAPIAGYAMWGNTDRARKVFD
        MIH+GV+P+SYTFPFL KSCA+LASA  GKQI A+VLKLGF                                 DAISFTA IAGY +WG  DRARK+FD
Subjt:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGF------------------------------GLMDAISFTAPIAGYAMWGNTDRARKVFD

Query:  EMPVRDVVSWNAMVAGYGQTSRSK
        EMPVRDVVSWNAM+AGY QT RSK
Subjt:  EMPVRDVVSWNAMVAGYGQTSRSK

SwissProt top hitse value%identityAlignment
O49619 Pentatricopeptide repeat-containing protein At4g35130, chloroplastic3.6e-1647.13Show/hide
Query:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGFGLMDAISFTAPIAGYAMWGNTDRARKVFDEMPVRDVVSWNAMVAGY
        M+ +GVK D++T+PF+ KS A ++S   GK+I A V+KLGF + D     + I+ Y   G    A KVF+EMP RD+VSWN+M++GY
Subjt:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGFGLMDAISFTAPIAGYAMWGNTDRARKVFDEMPVRDVVSWNAMVAGY

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic9.2e-2041.94Show/hide
Query:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGFGL------------------------------MDAISFTAPIAGYAMWGNTDRARKVFD
        MI  G+ P+SYTFPF+ KSCAK  +   G+QI  +VLKLG  L                               D +S+TA I GYA  G  + A+K+FD
Subjt:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGFGL------------------------------MDAISFTAPIAGYAMWGNTDRARKVFD

Query:  EMPVRDVVSWNAMVAGYGQTSRSK
        E+PV+DVVSWNAM++GY +T   K
Subjt:  EMPVRDVVSWNAMVAGYGQTSRSK

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226901.1e-1542.55Show/hide
Query:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGFGLMDAISFTAPIAGYAMWGNTDRARKVFDEMPVRDVVSWNAMVAGYGQTSRSK
        M++SG+ PD YTFPF   +CAK  +   G QI   ++K+G+   D     + +  YA  G  D ARKVFDEM  R+VVSW +M+ GY +   +K
Subjt:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGFGLMDAISFTAPIAGYAMWGNTDRARKVFDEMPVRDVVSWNAMVAGYGQTSRSK

Q9LUS3 Pentatricopeptide repeat-containing protein At3g166103.1e-1545.98Show/hide
Query:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGFGLMDAISFTAPIAGYAMWGNTDRARKVFDEMPVRDVVSWNAMVAGY
        M++SGV+P  YT+PF+ K+CA L +   GK I ++V    F   D    TA +  YA  G  + A KVFDEMP RD+V+WNAM++G+
Subjt:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGFGLMDAISFTAPIAGYAMWGNTDRARKVFDEMPVRDVVSWNAMVAGY

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic1.6e-1647.13Show/hide
Query:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGFGLMDAISFTAPIAGYAMWGNTDRARKVFDEMPVRDVVSWNAMVAGY
        M+ SGV+ DSYTF  + KS + L S HGG+Q+  ++LK GFG  +++   + +A Y      D ARKVFDEM  RDV+SWN+++ GY
Subjt:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGFGLMDAISFTAPIAGYAMWGNTDRARKVFDEMPVRDVVSWNAMVAGY

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.5e-2141.94Show/hide
Query:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGFGL------------------------------MDAISFTAPIAGYAMWGNTDRARKVFD
        MI  G+ P+SYTFPF+ KSCAK  +   G+QI  +VLKLG  L                               D +S+TA I GYA  G  + A+K+FD
Subjt:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGFGL------------------------------MDAISFTAPIAGYAMWGNTDRARKVFD

Query:  EMPVRDVVSWNAMVAGYGQTSRSK
        E+PV+DVVSWNAM++GY +T   K
Subjt:  EMPVRDVVSWNAMVAGYGQTSRSK

AT3G22690.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885)7.5e-1742.55Show/hide
Query:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGFGLMDAISFTAPIAGYAMWGNTDRARKVFDEMPVRDVVSWNAMVAGYGQTSRSK
        M++SG+ PD YTFPF   +CAK  +   G QI   ++K+G+   D     + +  YA  G  D ARKVFDEM  R+VVSW +M+ GY +   +K
Subjt:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGFGLMDAISFTAPIAGYAMWGNTDRARKVFDEMPVRDVVSWNAMVAGYGQTSRSK

AT3G22690.2 INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification7.5e-1742.55Show/hide
Query:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGFGLMDAISFTAPIAGYAMWGNTDRARKVFDEMPVRDVVSWNAMVAGYGQTSRSK
        M++SG+ PD YTFPF   +CAK  +   G QI   ++K+G+   D     + +  YA  G  D ARKVFDEM  R+VVSW +M+ GY +   +K
Subjt:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGFGLMDAISFTAPIAGYAMWGNTDRARKVFDEMPVRDVVSWNAMVAGYGQTSRSK

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein1.2e-1747.13Show/hide
Query:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGFGLMDAISFTAPIAGYAMWGNTDRARKVFDEMPVRDVVSWNAMVAGY
        M+ SGV+ DSYTF  + KS + L S HGG+Q+  ++LK GFG  +++   + +A Y      D ARKVFDEM  RDV+SWN+++ GY
Subjt:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGFGLMDAISFTAPIAGYAMWGNTDRARKVFDEMPVRDVVSWNAMVAGY

AT4G35130.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.6e-1747.13Show/hide
Query:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGFGLMDAISFTAPIAGYAMWGNTDRARKVFDEMPVRDVVSWNAMVAGY
        M+ +GVK D++T+PF+ KS A ++S   GK+I A V+KLGF + D     + I+ Y   G    A KVF+EMP RD+VSWN+M++GY
Subjt:  MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGFGLMDAISFTAPIAGYAMWGNTDRARKVFDEMPVRDVVSWNAMVAGY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCATTCGGGAGTCAAGCCGGATTCTTATACTTTTCCTTTTCTATTCAAGTCTTGCGCGAAGCTAGCCTCTGCCCATGGAGGGAAACAGATTCGTGCGTATGTCTT
GAAGCTTGGGTTTGGTTTGATGGATGCAATTTCTTTCACTGCACCAATTGCGGGTTACGCTATGTGGGGTAATACGGATCGCGCACGGAAAGTGTTTGATGAAATGCCTG
TTAGAGACGTGGTGTCTTGGAATGCTATGGTTGCTGGCTATGGGCAAACTAGTCGATCCAAGAGGGATCGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGATTCATTCGGGAGTCAAGCCGGATTCTTATACTTTTCCTTTTCTATTCAAGTCTTGCGCGAAGCTAGCCTCTGCCCATGGAGGGAAACAGATTCGTGCGTATGTCTT
GAAGCTTGGGTTTGGTTTGATGGATGCAATTTCTTTCACTGCACCAATTGCGGGTTACGCTATGTGGGGTAATACGGATCGCGCACGGAAAGTGTTTGATGAAATGCCTG
TTAGAGACGTGGTGTCTTGGAATGCTATGGTTGCTGGCTATGGGCAAACTAGTCGATCCAAGAGGGATCGTTAA
Protein sequenceShow/hide protein sequence
MIHSGVKPDSYTFPFLFKSCAKLASAHGGKQIRAYVLKLGFGLMDAISFTAPIAGYAMWGNTDRARKVFDEMPVRDVVSWNAMVAGYGQTSRSKRDR