; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G34320 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G34320
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr1:29221957..29224927
RNA-Seq ExpressionCSPI01G34320
SyntenyCSPI01G34320
Gene Ontology termsGO:0051603 - proteolysis involved in cellular protein catabolic process (biological process)
GO:0005839 - proteasome core complex (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAD1820551.1 unnamed protein product [Ananas comosus var. bracteatus]1.6e-1448.39Show/hide
Query:  KCQPSTDIYTMLINVYGKVSL-----------------------------NSCAGFPFGATEIFSLMQYMGCYPDRASYNIMVDAYGRAGLHE
        +C PS + YT++IN+YGK  L                              S AGFP+G++EIFSLMQ+MGC PDRASYNI+VDA+GRAGLHE
Subjt:  KCQPSTDIYTMLINVYGKVSL-----------------------------NSCAGFPFGATEIFSLMQYMGCYPDRASYNIMVDAYGRAGLHE

KAB5532329.1 hypothetical protein DKX38_018999 [Salix brachista]7.9e-1449.48Show/hide
Query:  CQPSTDIYTMLINVYGKVS----------------------------------LNSCAGFPFGATEIFSLMQYMGCYPDRASYNIMVDAYGRAGLHE
        CQP TD YT+LIN++GKVS                                  L     FP GA EIFSLM++MGC PDRASYNIMVDAYGRAGLHE
Subjt:  CQPSTDIYTMLINVYGKVS----------------------------------LNSCAGFPFGATEIFSLMQYMGCYPDRASYNIMVDAYGRAGLHE

KAE8653657.1 hypothetical protein Csa_007366 [Cucumis sativus]1.4e-47100Show/hide
Query:  MAAKCQPSTDIYTMLINVYGKVSLNSCAGFPFGATEIFSLMQYMGCYPDRASYNIMVDAYGRAGLHEGRFSHQRFKDVPENADNHGRSVIFRL
        MAAKCQPSTDIYTMLINVYGKVSLNSCAGFPFGATEIFSLMQYMGCYPDRASYNIMVDAYGRAGLHEGRFSHQRFKDVPENADNHGRSVIFRL
Subjt:  MAAKCQPSTDIYTMLINVYGKVSLNSCAGFPFGATEIFSLMQYMGCYPDRASYNIMVDAYGRAGLHEGRFSHQRFKDVPENADNHGRSVIFRL

XP_022142649.1 pentatricopeptide repeat-containing protein At2g35130 [Momordica charantia]7.9e-1442.52Show/hide
Query:  CQPSTDIYTMLINVYGKVSLN----------------------------------------------------------------SCAGFPFGATEIFSL
        CQPSTD YTMLINVYGK S +                                                                S AGFP+GA EIFSL
Subjt:  CQPSTDIYTMLINVYGKVSLN----------------------------------------------------------------SCAGFPFGATEIFSL

Query:  MQYMGCYPDRASYNIMVDAYGRAGLHE
        MQ+MGC PDRASYNIMVDAYGRAGLHE
Subjt:  MQYMGCYPDRASYNIMVDAYGRAGLHE

XP_024925143.1 pentatricopeptide repeat-containing protein At2g35130 isoform X2 [Ziziphus jujuba]1.6e-1445.69Show/hide
Query:  CQPSTDIYTMLINVYGKVSL------------------NSC-----------------------------------AGFPFGATEIFSLMQYMGCYPDRA
        CQPSTD YTMLIN+YGK S                   N C                                   AGFP+GA EIFSLMQ+MGC PDRA
Subjt:  CQPSTDIYTMLINVYGKVSL------------------NSC-----------------------------------AGFPFGATEIFSLMQYMGCYPDRA

Query:  SYNIMVDAYGRAGLHE
        SYNI+VDAYGRAGLHE
Subjt:  SYNIMVDAYGRAGLHE

TrEMBL top hitse value%identityAlignment
A0A0A0LYJ9 Uncharacterized protein4.1e-32100Show/hide
Query:  MAAKCQPSTDIYTMLINVYGKVSLNSCAGFPFGATEIFSLMQYMGCYPDRASYNIMVDAYGRAGLHEG
        MAAKCQPSTDIYTMLINVYGKVSLNSCAGFPFGATEIFSLMQYMGCYPDRASYNIMVDAYGRAGLHEG
Subjt:  MAAKCQPSTDIYTMLINVYGKVSLNSCAGFPFGATEIFSLMQYMGCYPDRASYNIMVDAYGRAGLHEG

A0A2P2L2Y1 Uncharacterized protein2.9e-1456.94Show/hide
Query:  QPSTDIYTMLINVYGKVSLNSCAGFPFGATEIFSLMQYMGCYPDRASYNIMVDAYGRAGLHEGRFSHQRFKD
        +P    Y  L+  YG+      AGFP+G+ EIFSLMQ+MGC PDRASYNIMVDAYGRAGLH+G  + +R+++
Subjt:  QPSTDIYTMLINVYGKVSLNSCAGFPFGATEIFSLMQYMGCYPDRASYNIMVDAYGRAGLHEGRFSHQRFKD

A0A6J1CM30 pentatricopeptide repeat-containing protein At2g351303.8e-1442.52Show/hide
Query:  CQPSTDIYTMLINVYGKVSLN----------------------------------------------------------------SCAGFPFGATEIFSL
        CQPSTD YTMLINVYGK S +                                                                S AGFP+GA EIFSL
Subjt:  CQPSTDIYTMLINVYGKVSLN----------------------------------------------------------------SCAGFPFGATEIFSL

Query:  MQYMGCYPDRASYNIMVDAYGRAGLHE
        MQ+MGC PDRASYNIMVDAYGRAGLHE
Subjt:  MQYMGCYPDRASYNIMVDAYGRAGLHE

A0A6P6FTH9 pentatricopeptide repeat-containing protein At2g35130 isoform X27.7e-1545.69Show/hide
Query:  CQPSTDIYTMLINVYGKVSL------------------NSC-----------------------------------AGFPFGATEIFSLMQYMGCYPDRA
        CQPSTD YTMLIN+YGK S                   N C                                   AGFP+GA EIFSLMQ+MGC PDRA
Subjt:  CQPSTDIYTMLINVYGKVSL------------------NSC-----------------------------------AGFPFGATEIFSLMQYMGCYPDRA

Query:  SYNIMVDAYGRAGLHE
        SYNI+VDAYGRAGLHE
Subjt:  SYNIMVDAYGRAGLHE

A0A6V7NPK6 PPR_long domain-containing protein7.7e-1548.39Show/hide
Query:  KCQPSTDIYTMLINVYGKVSL-----------------------------NSCAGFPFGATEIFSLMQYMGCYPDRASYNIMVDAYGRAGLHE
        +C PS + YT++IN+YGK  L                              S AGFP+G++EIFSLMQ+MGC PDRASYNI+VDA+GRAGLHE
Subjt:  KCQPSTDIYTMLINVYGKVSL-----------------------------NSCAGFPFGATEIFSLMQYMGCYPDRASYNIMVDAYGRAGLHE

SwissProt top hitse value%identityAlignment
O82178 Pentatricopeptide repeat-containing protein At2g351303.3e-1562.3Show/hide
Query:  QPSTDIYTMLINVYGKVSLNSCAGFPFGATEIFSLMQYMGCYPDRASYNIMVDAYGRAGLH
        +P   +Y  L+  Y +      AG+P+GA EIFSLMQ+MGC PDRASYNIMVDAYGRAGLH
Subjt:  QPSTDIYTMLINVYGKVSLNSCAGFPFGATEIFSLMQYMGCYPDRASYNIMVDAYGRAGLH

Q8GYP6 Pentatricopeptide repeat-containing protein At1g189005.3e-0538.33Show/hide
Query:  CQPSTDIYTMLINVYGKVSLNSCAGFPFGATEIFSLMQYMGCYPDRASYNIMVDAYGRAG
        CQP+T  Y  LI+ YG+      A +   A  +F+ MQ  GC PDR +Y  ++D + +AG
Subjt:  CQPSTDIYTMLINVYGKVSLNSCAGFPFGATEIFSLMQYMGCYPDRASYNIMVDAYGRAG

Q9SSF9 Pentatricopeptide repeat-containing protein At1g747501.4e-0531.91Show/hide
Query:  CQPSTDIYTMLINVYGKVSLNSCAGFPFGATEIFSLMQYMGCYPDRASYNIMVDAYGRAG-LHEGRFSHQRFKDVPENADNHGRSVIFRLGSKS
        C+P+T  Y  LI+ YG+      A +   A  +F+ MQ  GC PDR +Y  ++D + +AG L      +QR ++   + D    SVI     K+
Subjt:  CQPSTDIYTMLINVYGKVSLNSCAGFPFGATEIFSLMQYMGCYPDRASYNIMVDAYGRAG-LHEGRFSHQRFKDVPENADNHGRSVIFRLGSKS

Arabidopsis top hitse value%identityAlignment
AT1G18900.1 Pentatricopeptide repeat (PPR) superfamily protein3.8e-0638.33Show/hide
Query:  CQPSTDIYTMLINVYGKVSLNSCAGFPFGATEIFSLMQYMGCYPDRASYNIMVDAYGRAG
        CQP+T  Y  LI+ YG+      A +   A  +F+ MQ  GC PDR +Y  ++D + +AG
Subjt:  CQPSTDIYTMLINVYGKVSLNSCAGFPFGATEIFSLMQYMGCYPDRASYNIMVDAYGRAG

AT1G18900.2 Pentatricopeptide repeat (PPR) superfamily protein3.8e-0638.33Show/hide
Query:  CQPSTDIYTMLINVYGKVSLNSCAGFPFGATEIFSLMQYMGCYPDRASYNIMVDAYGRAG
        CQP+T  Y  LI+ YG+      A +   A  +F+ MQ  GC PDR +Y  ++D + +AG
Subjt:  CQPSTDIYTMLINVYGKVSLNSCAGFPFGATEIFSLMQYMGCYPDRASYNIMVDAYGRAG

AT1G74750.1 Pentatricopeptide repeat (PPR) superfamily protein1.0e-0631.91Show/hide
Query:  CQPSTDIYTMLINVYGKVSLNSCAGFPFGATEIFSLMQYMGCYPDRASYNIMVDAYGRAG-LHEGRFSHQRFKDVPENADNHGRSVIFRLGSKS
        C+P+T  Y  LI+ YG+      A +   A  +F+ MQ  GC PDR +Y  ++D + +AG L      +QR ++   + D    SVI     K+
Subjt:  CQPSTDIYTMLINVYGKVSLNSCAGFPFGATEIFSLMQYMGCYPDRASYNIMVDAYGRAG-LHEGRFSHQRFKDVPENADNHGRSVIFRLGSKS

AT2G35130.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.4e-1662.3Show/hide
Query:  QPSTDIYTMLINVYGKVSLNSCAGFPFGATEIFSLMQYMGCYPDRASYNIMVDAYGRAGLH
        +P   +Y  L+  Y +      AG+P+GA EIFSLMQ+MGC PDRASYNIMVDAYGRAGLH
Subjt:  QPSTDIYTMLINVYGKVSLNSCAGFPFGATEIFSLMQYMGCYPDRASYNIMVDAYGRAGLH

AT2G35130.2 Tetratricopeptide repeat (TPR)-like superfamily protein2.4e-1662.3Show/hide
Query:  QPSTDIYTMLINVYGKVSLNSCAGFPFGATEIFSLMQYMGCYPDRASYNIMVDAYGRAGLH
        +P   +Y  L+  Y +      AG+P+GA EIFSLMQ+MGC PDRASYNIMVDAYGRAGLH
Subjt:  QPSTDIYTMLINVYGKVSLNSCAGFPFGATEIFSLMQYMGCYPDRASYNIMVDAYGRAGLH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCCAAGTGCCAACCTTCTACAGATATTTACACAATGTTAATAAACGTGTATGGAAAAGTGAGCCTAAACAGTTGTGCTGGTTTTCCATTTGGAGCTACAGAAAT
ATTTTCACTCATGCAATATATGGGATGTTATCCAGATAGAGCTTCATACAACATCATGGTGGATGCATATGGAAGAGCTGGCCTTCATGAAGGAAGGTTCTCCCATCAGC
GCTTTAAGGATGTCCCTGAGAATGCTGACAACCATGGAAGGAGTGTAATTTTCAGATTGGGCTCAAAATCAACAGGAACAGATGTACAATTTTGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGCCAAGTGCCAACCTTCTACAGATATTTACACAATGTTAATAAACGTGTATGGAAAAGTGAGCCTAAACAGTTGTGCTGGTTTTCCATTTGGAGCTACAGAAAT
ATTTTCACTCATGCAATATATGGGATGTTATCCAGATAGAGCTTCATACAACATCATGGTGGATGCATATGGAAGAGCTGGCCTTCATGAAGGAAGGTTCTCCCATCAGC
GCTTTAAGGATGTCCCTGAGAATGCTGACAACCATGGAAGGAGTGTAATTTTCAGATTGGGCTCAAAATCAACAGGAACAGATGTACAATTTTGGTAA
Protein sequenceShow/hide protein sequence
MAAKCQPSTDIYTMLINVYGKVSLNSCAGFPFGATEIFSLMQYMGCYPDRASYNIMVDAYGRAGLHEGRFSHQRFKDVPENADNHGRSVIFRLGSKSTGTDVQFW