; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G28420 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G28420
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr3:26014879..26015894
RNA-Seq ExpressionCSPI03G28420
SyntenyCSPI03G28420
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039298.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]7.7e-4389.62Show/hide
Query:  MLPFSSRQSIESFATLGSVSLSSSQVLPAYSSNFLLESVDYVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNLLNVYCKCGDTRSADKLFDKMS
        MLPF SR+SIESFATL SVSLSSSQV PAYSS FLLESVDYVKLVQSATKTG LN GKLVHSHMI+TSFR CLFLQNNLLN+YCKCGD RSADKLFDKMS
Subjt:  MLPFSSRQSIESFATLGSVSLSSSQVLPAYSSNFLLESVDYVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNLLNVYCKCGDTRSADKLFDKMS

Query:  KSNIVT
        KSNIVT
Subjt:  KSNIVT

KAE8648993.1 hypothetical protein Csa_009190 [Cucumis sativus]5.2e-4795.28Show/hide
Query:  MLPFSSRQSIESFATLGSVSLSSSQVLPAYSSNFLLESVDYVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNLLNVYCKCGDTRSADKLFDKMS
        MLPFSSRQSIESFATLGSVSLSSSQV PAYSS FLLESVDYVKLVQSATKTGKLNHGKLVHSHMI+TSFR CLFLQNNLLN+YCKCGDTRSADKLFDKMS
Subjt:  MLPFSSRQSIESFATLGSVSLSSSQVLPAYSSNFLLESVDYVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNLLNVYCKCGDTRSADKLFDKMS

Query:  KSNIVT
        KSNIVT
Subjt:  KSNIVT

XP_008459568.1 PREDICTED: pentatricopeptide repeat-containing protein At3g13880 [Cucumis melo]7.5e-5488.19Show/hide
Query:  MLPHKQFLWRFFPFSTPLMFHMLPFSSRQSIESFATLGSVSLSSSQVLPAYSSNFLLESVDYVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNL
        MLP K+FLWRF P STPLMFHMLPF SR+SIESFATL SVSLSSSQV PAYSS FLLESVDYVKLVQSATKTG LN GKLVHSHMI+TSFR CLFLQNNL
Subjt:  MLPHKQFLWRFFPFSTPLMFHMLPFSSRQSIESFATLGSVSLSSSQVLPAYSSNFLLESVDYVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNL

Query:  LNVYCKCGDTRSADKLFDKMSKSNIVT
        LN+YCKCGD RSADKLFDKMSKSNIVT
Subjt:  LNVYCKCGDTRSADKLFDKMSKSNIVT

XP_031741772.1 pentatricopeptide repeat-containing protein At3g13880 [Cucumis sativus]7.0e-6095.28Show/hide
Query:  MLPHKQFLWRFFPFSTPLMFHMLPFSSRQSIESFATLGSVSLSSSQVLPAYSSNFLLESVDYVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNL
        MLP KQFLWRFFPFSTPLMFHMLPFSSRQSIESFATLGSVSLSSSQV PAYSS FLLESVDYVKLVQSATKTGKLNHGKLVHSHMI+TSFR CLFLQNNL
Subjt:  MLPHKQFLWRFFPFSTPLMFHMLPFSSRQSIESFATLGSVSLSSSQVLPAYSSNFLLESVDYVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNL

Query:  LNVYCKCGDTRSADKLFDKMSKSNIVT
        LN+YCKCGDTRSADKLFDKMSKSNIVT
Subjt:  LNVYCKCGDTRSADKLFDKMSKSNIVT

XP_038889992.1 pentatricopeptide repeat-containing protein At3g13880 [Benincasa hispida]4.0e-4779.53Show/hide
Query:  MLPHKQFLWRFFPFSTPLMFHMLPFSSRQSIESFATLGSVSLSSSQVLPAYSSNFLLESVDYVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNL
        MLP K FLWRF P STP +F MLP SSRQ IESFATL  +SLSSSQV P++S  FLLES DYVKLVQSA KTG LNHGKLVHSHMI+TSFR CLFLQNNL
Subjt:  MLPHKQFLWRFFPFSTPLMFHMLPFSSRQSIESFATLGSVSLSSSQVLPAYSSNFLLESVDYVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNL

Query:  LNVYCKCGDTRSADKLFDKMSKSNIVT
        LN+YCKCGDT SADKLFDKMSK NI+T
Subjt:  LNVYCKCGDTRSADKLFDKMSKSNIVT

TrEMBL top hitse value%identityAlignment
A0A0A0KSU3 DYW_deaminase domain-containing protein3.4e-6095.28Show/hide
Query:  MLPHKQFLWRFFPFSTPLMFHMLPFSSRQSIESFATLGSVSLSSSQVLPAYSSNFLLESVDYVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNL
        MLP KQFLWRFFPFSTPLMFHMLPFSSRQSIESFATLGSVSLSSSQV PAYSS FLLESVDYVKLVQSATKTGKLNHGKLVHSHMI+TSFR CLFLQNNL
Subjt:  MLPHKQFLWRFFPFSTPLMFHMLPFSSRQSIESFATLGSVSLSSSQVLPAYSSNFLLESVDYVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNL

Query:  LNVYCKCGDTRSADKLFDKMSKSNIVT
        LN+YCKCGDTRSADKLFDKMSKSNIVT
Subjt:  LNVYCKCGDTRSADKLFDKMSKSNIVT

A0A0A0LBY6 Uncharacterized protein2.0e-6078.53Show/hide
Query:  MLPHKQFLWRFFPFSTPLMFHMLPFSSRQSIESFATLGSVSLSSSQVLPAYSSNFLLESVDYVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNL
        MLPHKQFLWRFFPFSTPLMFHMLPFSSRQSIESFATLGSVSLSSSQVLPAYSSNFLLESVDYVKL                                NNL
Subjt:  MLPHKQFLWRFFPFSTPLMFHMLPFSSRQSIESFATLGSVSLSSSQVLPAYSSNFLLESVDYVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNL

Query:  LNVYCKCGDTRSADKLFDKMSKSNIVTQLGEDSYWGAICETENSYFFHFSRNSIDLCTSKVLQ
        LNVYCKCGDTRSA KLFD MSKSNIVTQLG DSYWGAICETENSYFFHFSRNSIDLCTSKVLQ
Subjt:  LNVYCKCGDTRSADKLFDKMSKSNIVTQLGEDSYWGAICETENSYFFHFSRNSIDLCTSKVLQ

A0A1S3CAH5 pentatricopeptide repeat-containing protein At3g138803.6e-5488.19Show/hide
Query:  MLPHKQFLWRFFPFSTPLMFHMLPFSSRQSIESFATLGSVSLSSSQVLPAYSSNFLLESVDYVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNL
        MLP K+FLWRF P STPLMFHMLPF SR+SIESFATL SVSLSSSQV PAYSS FLLESVDYVKLVQSATKTG LN GKLVHSHMI+TSFR CLFLQNNL
Subjt:  MLPHKQFLWRFFPFSTPLMFHMLPFSSRQSIESFATLGSVSLSSSQVLPAYSSNFLLESVDYVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNL

Query:  LNVYCKCGDTRSADKLFDKMSKSNIVT
        LN+YCKCGD RSADKLFDKMSKSNIVT
Subjt:  LNVYCKCGDTRSADKLFDKMSKSNIVT

A0A5D3BP01 Pentatricopeptide repeat-containing protein3.7e-4389.62Show/hide
Query:  MLPFSSRQSIESFATLGSVSLSSSQVLPAYSSNFLLESVDYVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNLLNVYCKCGDTRSADKLFDKMS
        MLPF SR+SIESFATL SVSLSSSQV PAYSS FLLESVDYVKLVQSATKTG LN GKLVHSHMI+TSFR CLFLQNNLLN+YCKCGD RSADKLFDKMS
Subjt:  MLPFSSRQSIESFATLGSVSLSSSQVLPAYSSNFLLESVDYVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNLLNVYCKCGDTRSADKLFDKMS

Query:  KSNIVT
        KSNIVT
Subjt:  KSNIVT

A0A6J1I7D1 pentatricopeptide repeat-containing protein At3g138801.5e-3970.87Show/hide
Query:  MLPHKQFLWRFFPFSTPLMFHMLPFSSRQSIESFATLGSVSLSSSQVLPAYSSNFLLESVDYVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNL
        MLP K F+WRF   S   MF MLP  SRQ IESFAT    SL  SQV P YS  FLLES DYVKLVQSATKTG LNHGKLVH+HMI+T F+ CLFLQNNL
Subjt:  MLPHKQFLWRFFPFSTPLMFHMLPFSSRQSIESFATLGSVSLSSSQVLPAYSSNFLLESVDYVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNL

Query:  LNVYCKCGDTRSADKLFDKMSKSNIVT
        LN+YCKCGD  SADKLF+KM K NI+T
Subjt:  LNVYCKCGDTRSADKLFDKMSKSNIVT

SwissProt top hitse value%identityAlignment
A8MQA3 Pentatricopeptide repeat-containing protein At4g210657.1e-0732.65Show/hide
Query:  IESFATLGSVSLSSSQVLPAYSSNFLLE--SVDYVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNLLNVYCKCGDTRSADKLFDKMSKSNIV
        I  +A +G+ S+S+  +      + L+E  +  Y  L+++ T    +  G+ +HS +IR+ F   +++QN+LL++Y  CGD  SA K+FDKM + ++V
Subjt:  IESFATLGSVSLSSSQVLPAYSSNFLLE--SVDYVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNLLNVYCKCGDTRSADKLFDKMSKSNIV

Q9LIQ7 Pentatricopeptide repeat-containing protein At3g24000, mitochondrial5.4e-0737.88Show/hide
Query:  YVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNLLNVYCKCGDTRSADKLFDKMSKSNIVT
        Y  L  + + TG L  GK VH++MI++  +L  F  N LL++Y K G    A K+FD+++K ++V+
Subjt:  YVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNLLNVYCKCGDTRSADKLFDKMSKSNIVT

Q9LRV9 Pentatricopeptide repeat-containing protein At3g138802.5e-1237.72Show/hide
Query:  MLPFSSRQSIESFATLGSVSLSSSQVLPAY--------SSNFLLESVDYVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNLLNVYCKCGDTRSA
        +L F ++    + A    V+L + +V   Y          N  L+S  Y  L Q+A K+G +  GKL H HMI++S   CL+L NNLLN+YCKC +   A
Subjt:  MLPFSSRQSIESFATLGSVSLSSSQVLPAY--------SSNFLLESVDYVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNLLNVYCKCGDTRSA

Query:  DKLFDKMSKSNIVT
         +LFD+M + NI++
Subjt:  DKLFDKMSKSNIVT

Q9LYU9 Pentatricopeptide repeat-containing protein At5g13270, chloroplastic3.5e-0636.11Show/hide
Query:  LESVDYVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNLLNVYCKCGDTRSADKLFDKMSKSNIVTQ
        + S  Y  L ++  +   L+HG+L+H  M        + LQN +L +YC+C     ADKLFD+MS+ N V++
Subjt:  LESVDYVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNLLNVYCKCGDTRSADKLFDKMSKSNIVTQ

Q9ZUT4 Pentatricopeptide repeat-containing protein At2g373201.2e-0631.82Show/hide
Query:  YVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNLLNVYCKCGDTRSADKLFDKMSKSNIVT
        +  L+ + T +G L  G+ VH   +    +  L + N+L+++YCKCGD + A ++FD+ S  ++V+
Subjt:  YVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNLLNVYCKCGDTRSADKLFDKMSKSNIVT

Arabidopsis top hitse value%identityAlignment
AT2G37320.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.6e-0831.82Show/hide
Query:  YVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNLLNVYCKCGDTRSADKLFDKMSKSNIVT
        +  L+ + T +G L  G+ VH   +    +  L + N+L+++YCKCGD + A ++FD+ S  ++V+
Subjt:  YVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNLLNVYCKCGDTRSADKLFDKMSKSNIVT

AT3G13880.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.8e-1337.72Show/hide
Query:  MLPFSSRQSIESFATLGSVSLSSSQVLPAY--------SSNFLLESVDYVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNLLNVYCKCGDTRSA
        +L F ++    + A    V+L + +V   Y          N  L+S  Y  L Q+A K+G +  GKL H HMI++S   CL+L NNLLN+YCKC +   A
Subjt:  MLPFSSRQSIESFATLGSVSLSSSQVLPAY--------SSNFLLESVDYVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNLLNVYCKCGDTRSA

Query:  DKLFDKMSKSNIVT
         +LFD+M + NI++
Subjt:  DKLFDKMSKSNIVT

AT3G24000.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.9e-0837.88Show/hide
Query:  YVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNLLNVYCKCGDTRSADKLFDKMSKSNIVT
        Y  L  + + TG L  GK VH++MI++  +L  F  N LL++Y K G    A K+FD+++K ++V+
Subjt:  YVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNLLNVYCKCGDTRSADKLFDKMSKSNIVT

AT4G21065.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.0e-0832.65Show/hide
Query:  IESFATLGSVSLSSSQVLPAYSSNFLLE--SVDYVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNLLNVYCKCGDTRSADKLFDKMSKSNIV
        I  +A +G+ S+S+  +      + L+E  +  Y  L+++ T    +  G+ +HS +IR+ F   +++QN+LL++Y  CGD  SA K+FDKM + ++V
Subjt:  IESFATLGSVSLSSSQVLPAYSSNFLLE--SVDYVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNLLNVYCKCGDTRSADKLFDKMSKSNIV

AT5G13270.1 Pentatricopeptide repeat (PPR) superfamily protein2.5e-0736.11Show/hide
Query:  LESVDYVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNLLNVYCKCGDTRSADKLFDKMSKSNIVTQ
        + S  Y  L ++  +   L+HG+L+H  M        + LQN +L +YC+C     ADKLFD+MS+ N V++
Subjt:  LESVDYVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNLLNVYCKCGDTRSADKLFDKMSKSNIVTQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGCCTCATAAACAATTCCTTTGGAGATTCTTTCCTTTCTCCACTCCCTTAATGTTTCACATGCTTCCTTTTTCCAGTAGACAATCCATAGAGTCTTTTGCCACTTT
GGGATCGGTCTCATTGAGCTCATCACAAGTTTTGCCAGCATATTCTTCGAATTTTCTTTTAGAATCTGTAGACTATGTTAAACTGGTTCAATCAGCTACTAAAACTGGGA
AGTTGAACCATGGCAAACTCGTTCATTCCCATATGATTAGAACTTCTTTCAGGCTCTGTCTATTTTTGCAGAACAATCTTCTTAACGTGTACTGCAAATGTGGGGATACA
CGTTCTGCTGACAAATTGTTTGATAAAATGTCAAAATCAAACATTGTTACACAGCTTGGTGAAGACTCATATTGGGGAGCCATTTGTGAAACTGAGAATTCCTATTTCTT
TCACTTCTCTCGGAATTCCATTGATTTGTGCACCAGTAAAGTCCTCCAGATCAGTCTCGCCACTATATTAGGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTGCCTCATAAACAATTCCTTTGGAGATTCTTTCCTTTCTCCACTCCCTTAATGTTTCACATGCTTCCTTTTTCCAGTAGACAATCCATAGAGTCTTTTGCCACTTT
GGGATCGGTCTCATTGAGCTCATCACAAGTTTTGCCAGCATATTCTTCGAATTTTCTTTTAGAATCTGTAGACTATGTTAAACTGGTTCAATCAGCTACTAAAACTGGGA
AGTTGAACCATGGCAAACTCGTTCATTCCCATATGATTAGAACTTCTTTCAGGCTCTGTCTATTTTTGCAGAACAATCTTCTTAACGTGTACTGCAAATGTGGGGATACA
CGTTCTGCTGACAAATTGTTTGATAAAATGTCAAAATCAAACATTGTTACACAGCTTGGTGAAGACTCATATTGGGGAGCCATTTGTGAAACTGAGAATTCCTATTTCTT
TCACTTCTCTCGGAATTCCATTGATTTGTGCACCAGTAAAGTCCTCCAGATCAGTCTCGCCACTATATTAGGCTAA
Protein sequenceShow/hide protein sequence
MLPHKQFLWRFFPFSTPLMFHMLPFSSRQSIESFATLGSVSLSSSQVLPAYSSNFLLESVDYVKLVQSATKTGKLNHGKLVHSHMIRTSFRLCLFLQNNLLNVYCKCGDT
RSADKLFDKMSKSNIVTQLGEDSYWGAICETENSYFFHFSRNSIDLCTSKVLQISLATILG