; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G03980 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G03980
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr7:3104579..3105025
RNA-Seq ExpressionCSPI07G03980
SyntenyCSPI07G03980
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022135919.1 pentatricopeptide repeat-containing protein At3g12770-like isoform X1 [Momordica charantia]1.0e-5477.94Show/hide
Query:  MIRGYASEGTNAYKALAMYRKMLAFGFNADNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTSW
        MIRGYAS+GTNAYKALA+YRKM+  GF ADN+T  FVLKACG+LGL  MG +IH RVEVCGW+SDIYVNNSL+A+Y+KFGN+  ARK+FDKMP RDLTSW
Subjt:  MIRGYASEGTNAYKALAMYRKMLAFGFNADNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTSW

Query:  NTMISGYVRNNNAREALTIFYLMGKSGLKADGTTLL
        NTMISGYVRNNNA +AL IFY MGK+G KADGTTLL
Subjt:  NTMISGYVRNNNAREALTIFYLMGKSGLKADGTTLL

XP_022135920.1 pentatricopeptide repeat-containing protein At3g12770-like isoform X2 [Momordica charantia]1.0e-5477.94Show/hide
Query:  MIRGYASEGTNAYKALAMYRKMLAFGFNADNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTSW
        MIRGYAS+GTNAYKALA+YRKM+  GF ADN+T  FVLKACG+LGL  MG +IH RVEVCGW+SDIYVNNSL+A+Y+KFGN+  ARK+FDKMP RDLTSW
Subjt:  MIRGYASEGTNAYKALAMYRKMLAFGFNADNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTSW

Query:  NTMISGYVRNNNAREALTIFYLMGKSGLKADGTTLL
        NTMISGYVRNNNA +AL IFY MGK+G KADGTTLL
Subjt:  NTMISGYVRNNNAREALTIFYLMGKSGLKADGTTLL

XP_022135922.1 pentatricopeptide repeat-containing protein DOT4, chloroplastic-like isoform X3 [Momordica charantia]1.0e-5477.94Show/hide
Query:  MIRGYASEGTNAYKALAMYRKMLAFGFNADNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTSW
        MIRGYAS+GTNAYKALA+YRKM+  GF ADN+T  FVLKACG+LGL  MG +IH RVEVCGW+SDIYVNNSL+A+Y+KFGN+  ARK+FDKMP RDLTSW
Subjt:  MIRGYASEGTNAYKALAMYRKMLAFGFNADNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTSW

Query:  NTMISGYVRNNNAREALTIFYLMGKSGLKADGTTLL
        NTMISGYVRNNNA +AL IFY MGK+G KADGTTLL
Subjt:  NTMISGYVRNNNAREALTIFYLMGKSGLKADGTTLL

XP_022968999.1 pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like [Cucurbita maxima]1.1e-5380.15Show/hide
Query:  MIRGYASEGTNAYKALAMYRKMLAFGFNADNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTSW
        MIRGYAS G  AYKALAMYRKML  G   DN+T  FVLKACG+LGL  MG EIH +VEVCGWDSDIYVNNSL+A+YLKFGN+DVARK+FDKMP RD TSW
Subjt:  MIRGYASEGTNAYKALAMYRKMLAFGFNADNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTSW

Query:  NTMISGYVRNNNAREALTIFYLMGKSGLKADGTTLL
        NT+ISGYVRNNNA EAL IFYLMGK+GLKADGTTLL
Subjt:  NTMISGYVRNNNAREALTIFYLMGKSGLKADGTTLL

XP_038887372.1 pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like [Benincasa hispida]3.5e-6084.56Show/hide
Query:  MIRGYASEGTNAYKALAMYRKMLAFGFNADNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTSW
        MIRGYAS+GTNAYKALAMYRKMLAFG+NADN+T  FVLKACG+LGL  +GMEIH +VE+CGW+SDIYVNNSL+A+YLKFGNVDVARK+FDKMP RDLTSW
Subjt:  MIRGYASEGTNAYKALAMYRKMLAFGFNADNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTSW

Query:  NTMISGYVRNNNAREALTIFYLMGKSGLKADGTTLL
        NTMISGYVRNNN+ EALTIFYLM K GLKADGTTLL
Subjt:  NTMISGYVRNNNAREALTIFYLMGKSGLKADGTTLL

TrEMBL top hitse value%identityAlignment
A0A6J1C2E2 pentatricopeptide repeat-containing protein At3g12770-like isoform X24.8e-5577.94Show/hide
Query:  MIRGYASEGTNAYKALAMYRKMLAFGFNADNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTSW
        MIRGYAS+GTNAYKALA+YRKM+  GF ADN+T  FVLKACG+LGL  MG +IH RVEVCGW+SDIYVNNSL+A+Y+KFGN+  ARK+FDKMP RDLTSW
Subjt:  MIRGYASEGTNAYKALAMYRKMLAFGFNADNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTSW

Query:  NTMISGYVRNNNAREALTIFYLMGKSGLKADGTTLL
        NTMISGYVRNNNA +AL IFY MGK+G KADGTTLL
Subjt:  NTMISGYVRNNNAREALTIFYLMGKSGLKADGTTLL

A0A6J1C2U5 pentatricopeptide repeat-containing protein At3g12770-like isoform X14.8e-5577.94Show/hide
Query:  MIRGYASEGTNAYKALAMYRKMLAFGFNADNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTSW
        MIRGYAS+GTNAYKALA+YRKM+  GF ADN+T  FVLKACG+LGL  MG +IH RVEVCGW+SDIYVNNSL+A+Y+KFGN+  ARK+FDKMP RDLTSW
Subjt:  MIRGYASEGTNAYKALAMYRKMLAFGFNADNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTSW

Query:  NTMISGYVRNNNAREALTIFYLMGKSGLKADGTTLL
        NTMISGYVRNNNA +AL IFY MGK+G KADGTTLL
Subjt:  NTMISGYVRNNNAREALTIFYLMGKSGLKADGTTLL

A0A6J1C448 pentatricopeptide repeat-containing protein DOT4, chloroplastic-like isoform X34.8e-5577.94Show/hide
Query:  MIRGYASEGTNAYKALAMYRKMLAFGFNADNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTSW
        MIRGYAS+GTNAYKALA+YRKM+  GF ADN+T  FVLKACG+LGL  MG +IH RVEVCGW+SDIYVNNSL+A+Y+KFGN+  ARK+FDKMP RDLTSW
Subjt:  MIRGYASEGTNAYKALAMYRKMLAFGFNADNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTSW

Query:  NTMISGYVRNNNAREALTIFYLMGKSGLKADGTTLL
        NTMISGYVRNNNA +AL IFY MGK+G KADGTTLL
Subjt:  NTMISGYVRNNNAREALTIFYLMGKSGLKADGTTLL

A0A6J1GMH0 putative pentatricopeptide repeat-containing protein At3g11460, mitochondrial7.0e-5479.41Show/hide
Query:  MIRGYASEGTNAYKALAMYRKMLAFGFNADNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTSW
        MIRGYAS G  AYKALAMYRKML  G   DN+T  FVLKACG+LGL  MG EIH +VEVCGWDSD+YVNNSL+A+YLKFGN+DVARK+FDKMP RD TSW
Subjt:  MIRGYASEGTNAYKALAMYRKMLAFGFNADNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTSW

Query:  NTMISGYVRNNNAREALTIFYLMGKSGLKADGTTLL
        NT+ISGYVRNNNA EAL IFYLMGK+GLKADGTTLL
Subjt:  NTMISGYVRNNNAREALTIFYLMGKSGLKADGTTLL

A0A6J1I1A3 pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like5.3e-5480.15Show/hide
Query:  MIRGYASEGTNAYKALAMYRKMLAFGFNADNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTSW
        MIRGYAS G  AYKALAMYRKML  G   DN+T  FVLKACG+LGL  MG EIH +VEVCGWDSDIYVNNSL+A+YLKFGN+DVARK+FDKMP RD TSW
Subjt:  MIRGYASEGTNAYKALAMYRKMLAFGFNADNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTSW

Query:  NTMISGYVRNNNAREALTIFYLMGKSGLKADGTTLL
        NT+ISGYVRNNNA EAL IFYLMGK+GLKADGTTLL
Subjt:  NTMISGYVRNNNAREALTIFYLMGKSGLKADGTTLL

SwissProt top hitse value%identityAlignment
A8MQA3 Pentatricopeptide repeat-containing protein At4g210655.7e-2140.15Show/hide
Query:  MIRGYASEGTNAYKALAMYRKMLAFGF-NADNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTS
        +IRGYA  G N+  A ++YR+M   G    D +T  F++KA   +    +G  IH  V   G+ S IYV NSL+ +Y   G+V  A K+FDKMP +DL +
Subjt:  MIRGYASEGTNAYKALAMYRKMLAFGF-NADNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTS

Query:  WNTMISGYVRNNNAREALTIFYLMGKSGLKADGTTLL
        WN++I+G+  N    EAL ++  M   G+K DG T++
Subjt:  WNTMISGYVRNNNAREALTIFYLMGKSGLKADGTTLL

O64705 Pentatricopeptide repeat-containing protein At2g344001.0e-2238.97Show/hide
Query:  MIRGYASEGTNAYKALAMYRKMLAFGFNADNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTSW
        MIRG  +   +   AL++YR+M   G   D +T +FV  AC  L   G+G  +H  +   G + D+++N+SL+ +Y K G V  ARKLFD++  RD  SW
Subjt:  MIRGYASEGTNAYKALAMYRKMLAFGFNADNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTSW

Query:  NTMISGYVRNNNAREALTIFYLMGKSGLKADGTTLL
        N+MISGY     A++A+ +F  M + G + D  TL+
Subjt:  NTMISGYVRNNNAREALTIFYLMGKSGLKADGTTLL

P93011 Pentatricopeptide repeat-containing protein At2g337605.2e-2238.02Show/hide
Query:  LAMYRKMLAFGFNADNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTSWNTMISGYVRNNNARE
        +A YR+ML+   +  NYT + V+K+C +L    +G  +HC   V G+  D YV  +LV  Y K G+++ AR++FD+MP + + +WN+++SG+ +N  A E
Subjt:  LAMYRKMLAFGFNADNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTSWNTMISGYVRNNNARE

Query:  ALTIFYLMGKSGLKADGTTLL
        A+ +FY M +SG + D  T +
Subjt:  ALTIFYLMGKSGLKADGTTLL

Q0WQW5 Pentatricopeptide repeat-containing protein At1g59720, chloroplastic/mitochondrial2.2e-2040.44Show/hide
Query:  MIRGYASEGTNAYKALAMYRKMLAFGFNA-DNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTS
        +IR  A + +   +A  +YRKML  G ++ D +T  FVLKAC  +     G ++HC++   G+  D+YVNN L+ +Y   G +D+ARK+FD+MP R L S
Subjt:  MIRGYASEGTNAYKALAMYRKMLAFGFNA-DNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTS

Query:  WNTMISGYVRNNNAREALTIFYLMGKSGLKADGTTL
        WN+MI   VR      AL +F  M +S  + DG T+
Subjt:  WNTMISGYVRNNNAREALTIFYLMGKSGLKADGTTL

Q9SB36 Pentatricopeptide repeat-containing protein At4g25270, chloroplastic2.8e-2040.46Show/hide
Query:  MIRGYASEGTNAYKALAMYRKMLAFGFNADNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTSW
        +I GYA  G     A+A+Y +M   G   D +T   VLKACG +G   +G  IH  +   G+  D+YV N+LV +Y K G++  AR +FD +P +D  SW
Subjt:  MIRGYASEGTNAYKALAMYRKMLAFGFNADNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTSW

Query:  NTMISGYVRNNNAREALTIFYLMGKSGLKAD
        N+M++GY+ +    EAL IF LM ++G++ D
Subjt:  NTMISGYVRNNNAREALTIFYLMGKSGLKAD

Arabidopsis top hitse value%identityAlignment
AT1G15510.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.0e-2140.44Show/hide
Query:  MIRGYASEGTNAYKALAMYRKML-AFGFNADNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTS
        ++ GYA +G    +A+ +Y +ML   G   D YT   VL+ CG +     G E+H  V   G++ DI V N+L+ +Y+K G+V  AR LFD+MP RD+ S
Subjt:  MIRGYASEGTNAYKALAMYRKML-AFGFNADNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTS

Query:  WNTMISGYVRNNNAREALTIFYLMGKSGLKADGTTL
        WN MISGY  N    E L +F+ M    +  D  TL
Subjt:  WNTMISGYVRNNNAREALTIFYLMGKSGLKADGTTL

AT1G59720.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.5e-2140.44Show/hide
Query:  MIRGYASEGTNAYKALAMYRKMLAFGFNA-DNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTS
        +IR  A + +   +A  +YRKML  G ++ D +T  FVLKAC  +     G ++HC++   G+  D+YVNN L+ +Y   G +D+ARK+FD+MP R L S
Subjt:  MIRGYASEGTNAYKALAMYRKMLAFGFNA-DNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTS

Query:  WNTMISGYVRNNNAREALTIFYLMGKSGLKADGTTL
        WN+MI   VR      AL +F  M +S  + DG T+
Subjt:  WNTMISGYVRNNNAREALTIFYLMGKSGLKADGTTL

AT2G33760.1 Pentatricopeptide repeat (PPR) superfamily protein3.7e-2338.02Show/hide
Query:  LAMYRKMLAFGFNADNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTSWNTMISGYVRNNNARE
        +A YR+ML+   +  NYT + V+K+C +L    +G  +HC   V G+  D YV  +LV  Y K G+++ AR++FD+MP + + +WN+++SG+ +N  A E
Subjt:  LAMYRKMLAFGFNADNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTSWNTMISGYVRNNNARE

Query:  ALTIFYLMGKSGLKADGTTLL
        A+ +FY M +SG + D  T +
Subjt:  ALTIFYLMGKSGLKADGTTLL

AT2G34400.1 Pentatricopeptide repeat (PPR-like) superfamily protein7.4e-2438.97Show/hide
Query:  MIRGYASEGTNAYKALAMYRKMLAFGFNADNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTSW
        MIRG  +   +   AL++YR+M   G   D +T +FV  AC  L   G+G  +H  +   G + D+++N+SL+ +Y K G V  ARKLFD++  RD  SW
Subjt:  MIRGYASEGTNAYKALAMYRKMLAFGFNADNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTSW

Query:  NTMISGYVRNNNAREALTIFYLMGKSGLKADGTTLL
        N+MISGY     A++A+ +F  M + G + D  TL+
Subjt:  NTMISGYVRNNNAREALTIFYLMGKSGLKADGTTLL

AT4G21065.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.1e-2240.15Show/hide
Query:  MIRGYASEGTNAYKALAMYRKMLAFGF-NADNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTS
        +IRGYA  G N+  A ++YR+M   G    D +T  F++KA   +    +G  IH  V   G+ S IYV NSL+ +Y   G+V  A K+FDKMP +DL +
Subjt:  MIRGYASEGTNAYKALAMYRKMLAFGF-NADNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTS

Query:  WNTMISGYVRNNNAREALTIFYLMGKSGLKADGTTLL
        WN++I+G+  N    EAL ++  M   G+K DG T++
Subjt:  WNTMISGYVRNNNAREALTIFYLMGKSGLKADGTTLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTAGAGGTTATGCTTCTGAAGGTACTAATGCTTATAAGGCCCTTGCTATGTATCGTAAAATGCTAGCATTTGGATTCAATGCTGACAATTATACATCCTCTTTTGT
ACTAAAGGCGTGTGGTAATCTGGGTCTTTGTGGAATGGGAATGGAAATTCATTGTCGGGTGGAGGTTTGTGGGTGGGACTCAGATATTTATGTGAACAATTCTCTTGTAG
CAGTGTATTTGAAATTTGGGAATGTGGATGTTGCAAGGAAGCTGTTTGATAAAATGCCTGTGAGGGATTTAACTTCTTGGAATACTATGATTTCAGGTTACGTGAGGAAT
AATAACGCCAGGGAAGCTTTAACGATTTTTTATCTAATGGGAAAGTCTGGGTTGAAAGCAGATGGGACGACTTTGCTATTTTACTTTGAATGGCTTCTTGATGAACTCAC
TGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGATTAGAGGTTATGCTTCTGAAGGTACTAATGCTTATAAGGCCCTTGCTATGTATCGTAAAATGCTAGCATTTGGATTCAATGCTGACAATTATACATCCTCTTTTGT
ACTAAAGGCGTGTGGTAATCTGGGTCTTTGTGGAATGGGAATGGAAATTCATTGTCGGGTGGAGGTTTGTGGGTGGGACTCAGATATTTATGTGAACAATTCTCTTGTAG
CAGTGTATTTGAAATTTGGGAATGTGGATGTTGCAAGGAAGCTGTTTGATAAAATGCCTGTGAGGGATTTAACTTCTTGGAATACTATGATTTCAGGTTACGTGAGGAAT
AATAACGCCAGGGAAGCTTTAACGATTTTTTATCTAATGGGAAAGTCTGGGTTGAAAGCAGATGGGACGACTTTGCTATTTTACTTTGAATGGCTTCTTGATGAACTCAC
TGATTGA
Protein sequenceShow/hide protein sequence
MIRGYASEGTNAYKALAMYRKMLAFGFNADNYTSSFVLKACGNLGLCGMGMEIHCRVEVCGWDSDIYVNNSLVAVYLKFGNVDVARKLFDKMPVRDLTSWNTMISGYVRN
NNAREALTIFYLMGKSGLKADGTTLLFYFEWLLDELTD