; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G4879 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G4879
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionPentatricopeptide repeat-containing protein
Genome locationctg1227:3951656..3952298
RNA-Seq ExpressionCucsat.G4879
SyntenyCucsat.G4879
Gene Ontology termsGO:0008380 - RNA splicing (biological process)
GO:1900865 - chloroplast RNA modification (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0003729 - mRNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK06655.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]9.13e-7996.95Show/hide
Query:  GCLSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVL
        GCLS YLIGVWLRSSRSVKKLRAVHAFILR+FTSF IYVGNNLLSSYLR+GMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVL
Subjt:  GCLSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVL

Query:  ANGQMFVCILNLCAKRLDFELGRQIHGVIVK
        ANGQMFVCILNLCAKRLDFELGRQIHGVIVK
Subjt:  ANGQMFVCILNLCAKRLDFELGRQIHGVIVK

XP_004138810.1 pentatricopeptide repeat-containing protein At4g18520, chloroplastic [Cucumis sativus]3.12e-82100Show/hide
Query:  GCLSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVL
        GCLSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVL
Subjt:  GCLSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVL

Query:  ANGQMFVCILNLCAKRLDFELGRQIHGVIVK
        ANGQMFVCILNLCAKRLDFELGRQIHGVIVK
Subjt:  ANGQMFVCILNLCAKRLDFELGRQIHGVIVK

XP_008441245.1 PREDICTED: pentatricopeptide repeat-containing protein At4g18520 [Cucumis melo]9.13e-7996.95Show/hide
Query:  GCLSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVL
        GCLS YLIGVWLRSSRSVKKLRAVHAFILR+FTSF IYVGNNLLSSYLR+GMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVL
Subjt:  GCLSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVL

Query:  ANGQMFVCILNLCAKRLDFELGRQIHGVIVK
        ANGQMFVCILNLCAKRLDFELGRQIHGVIVK
Subjt:  ANGQMFVCILNLCAKRLDFELGRQIHGVIVK

XP_023549983.1 pentatricopeptide repeat-containing protein At4g18520, chloroplastic-like [Cucurbita pepo subsp. pepo]7.34e-6983.21Show/hide
Query:  GCLSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVL
        GCLSPYLI  WLRSSR VK+LRA+HAFILR+F+S   YVGNNL+SSYLR GML+DAR+VFDEMPMRSVVTWTAIINGYID DLT+EAL LF DSV+SGV 
Subjt:  GCLSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVL

Query:  ANGQMFVCILNLCAKRLDFELGRQIHGVIVK
        ANG+MFVCILNLCAKRLDFELGRQIHG IVK
Subjt:  ANGQMFVCILNLCAKRLDFELGRQIHGVIVK

XP_038884364.1 pentatricopeptide repeat-containing protein At4g18520, chloroplastic-like [Benincasa hispida]9.57e-7085.5Show/hide
Query:  GCLSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVL
        GCLSPYLI VWLRSSRS K+LRA+HAFILR+ TSF IYVGNNL+SSYLR GMLVDARK FDEMP+R+VVTWT IINGYI LD TEEAL LFSDSVK+GV 
Subjt:  GCLSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVL

Query:  ANGQMFVCILNLCAKRLDFELGRQIHGVIVK
        ANG+MFVCILNLCAKRLDFELGRQIHGVIVK
Subjt:  ANGQMFVCILNLCAKRLDFELGRQIHGVIVK

TrEMBL top hitse value%identityAlignment
A0A0A0LSX2 Uncharacterized protein1.51e-82100Show/hide
Query:  GCLSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVL
        GCLSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVL
Subjt:  GCLSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVL

Query:  ANGQMFVCILNLCAKRLDFELGRQIHGVIVK
        ANGQMFVCILNLCAKRLDFELGRQIHGVIVK
Subjt:  ANGQMFVCILNLCAKRLDFELGRQIHGVIVK

A0A1S3B3M7 pentatricopeptide repeat-containing protein At4g185204.42e-7996.95Show/hide
Query:  GCLSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVL
        GCLS YLIGVWLRSSRSVKKLRAVHAFILR+FTSF IYVGNNLLSSYLR+GMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVL
Subjt:  GCLSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVL

Query:  ANGQMFVCILNLCAKRLDFELGRQIHGVIVK
        ANGQMFVCILNLCAKRLDFELGRQIHGVIVK
Subjt:  ANGQMFVCILNLCAKRLDFELGRQIHGVIVK

A0A5D3C5K1 Pentatricopeptide repeat-containing protein4.42e-7996.95Show/hide
Query:  GCLSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVL
        GCLS YLIGVWLRSSRSVKKLRAVHAFILR+FTSF IYVGNNLLSSYLR+GMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVL
Subjt:  GCLSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVL

Query:  ANGQMFVCILNLCAKRLDFELGRQIHGVIVK
        ANGQMFVCILNLCAKRLDFELGRQIHGVIVK
Subjt:  ANGQMFVCILNLCAKRLDFELGRQIHGVIVK

A0A6J1FJ52 pentatricopeptide repeat-containing protein At4g18520, chloroplastic-like3.55e-6983.21Show/hide
Query:  GCLSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVL
        GCLSPYLI  WLRSSR VK+LRA+HAFILR+F+S   YVGNNL+SSYLR GML+DAR+VFDEMPMRSVVTWTAIINGYID DLT+EAL LF DSV+SGV 
Subjt:  GCLSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVL

Query:  ANGQMFVCILNLCAKRLDFELGRQIHGVIVK
        ANG+MFVCILNLCAKRLDFELGRQIHG IVK
Subjt:  ANGQMFVCILNLCAKRLDFELGRQIHGVIVK

A0A6J1JUB8 pentatricopeptide repeat-containing protein At4g18520, chloroplastic-like2.72e-6882.44Show/hide
Query:  GCLSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVL
        GCLSPYLI  WLRSSR VK+LRA+HAFI R+FTS   YVGNNL+SSYLR GML+DAR+VFDEMPMRSVVTWTAIINGYID DLT+EAL LF DSV+SGV 
Subjt:  GCLSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVL

Query:  ANGQMFVCILNLCAKRLDFELGRQIHGVIVK
         NG+MFVCILNLCAKRLDFELGRQIHG IVK
Subjt:  ANGQMFVCILNLCAKRLDFELGRQIHGVIVK

SwissProt top hitse value%identityAlignment
A8MQA3 Pentatricopeptide repeat-containing protein At4g210651.5e-1534.38Show/hide
Query:  PYLIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVLANGQ
        P+LI   + +   V+    +H+ ++R+     IYV N+LL  Y   G +  A KVFD+MP + +V W ++ING+ +    EEALAL+++    G+  +G 
Subjt:  PYLIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVLANGQ

Query:  MFVCILNLCAKRLDFELGRQIHGVIVKV
          V +L+ CAK     LG+++H  ++KV
Subjt:  MFVCILNLCAKRLDFELGRQIHGVIVKV

Q0WNP3 Pentatricopeptide repeat-containing protein At4g18520, chloroplastic4.6e-3354.33Show/hide
Query:  LIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGV-LANGQM
        L+  WL+SS  ++ ++ +HA  L+ F    IY GNNL+SS +RLG LV ARKVFD MP ++ VTWTA+I+GY+   L +EA ALF D VK G+   N +M
Subjt:  LIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGV-LANGQM

Query:  FVCILNLCAKRLDFELGRQIHGVIVKV
        FVC+LNLC++R +FELGRQ+HG +VKV
Subjt:  FVCILNLCAKRLDFELGRQIHGVIVKV

Q9FIB2 Putative pentatricopeptide repeat-containing protein At5g099506.0e-1739.82Show/hide
Query:  RAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVLANGQMFVCILNLCAK--RLDF
        R  H+ + +N     +Y+ NNL+++YL  G  V ARKVFDEMP+R+ V+W  I++GY      +EAL    D VK G+ +N   FV +L  C +   +  
Subjt:  RAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVLANGQMFVCILNLCAK--RLDF

Query:  ELGRQIHGVIVKV
          GRQIHG++ K+
Subjt:  ELGRQIHGVIVKV

Q9LTF4 Putative pentatricopeptide repeat-containing protein At5g526302.4e-1839.09Show/hide
Query:  RAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVLANGQMFVCILNLCAKRLDFEL
        R+VH   ++      ++VG++L+  Y + G +V ARK+FDEMP R+VVTW+ ++ GY  +   EEAL LF +++   +  N   F  ++++CA     EL
Subjt:  RAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVLANGQMFVCILNLCAKRLDFEL

Query:  GRQIHGVIVK
        GRQIHG+ +K
Subjt:  GRQIHGVIVK

Q9SKQ4 Pentatricopeptide repeat-containing protein At2g210906.7e-1635.16Show/hide
Query:  LSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVLAN
        LS +LIG++++  + +   +      LRN     +Y  NN++S Y++ GMLV AR VFD MP R VV+W  ++ GY       EAL  + +  +SG+  N
Subjt:  LSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVLAN

Query:  GQMFVCILNLCAKRLDFELGRQIHGVIV
           F  +L  C K    +L RQ HG ++
Subjt:  GQMFVCILNLCAKRLDFELGRQIHGVIV

Arabidopsis top hitse value%identityAlignment
AT2G21090.1 Pentatricopeptide repeat (PPR-like) superfamily protein4.7e-1735.16Show/hide
Query:  LSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVLAN
        LS +LIG++++  + +   +      LRN     +Y  NN++S Y++ GMLV AR VFD MP R VV+W  ++ GY       EAL  + +  +SG+  N
Subjt:  LSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVLAN

Query:  GQMFVCILNLCAKRLDFELGRQIHGVIV
           F  +L  C K    +L RQ HG ++
Subjt:  GQMFVCILNLCAKRLDFELGRQIHGVIV

AT4G18520.1 Pentatricopeptide repeat (PPR) superfamily protein3.3e-3454.33Show/hide
Query:  LIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGV-LANGQM
        L+  WL+SS  ++ ++ +HA  L+ F    IY GNNL+SS +RLG LV ARKVFD MP ++ VTWTA+I+GY+   L +EA ALF D VK G+   N +M
Subjt:  LIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGV-LANGQM

Query:  FVCILNLCAKRLDFELGRQIHGVIVKV
        FVC+LNLC++R +FELGRQ+HG +VKV
Subjt:  FVCILNLCAKRLDFELGRQIHGVIVKV

AT4G21065.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-1634.38Show/hide
Query:  PYLIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVLANGQ
        P+LI   + +   V+    +H+ ++R+     IYV N+LL  Y   G +  A KVFD+MP + +V W ++ING+ +    EEALAL+++    G+  +G 
Subjt:  PYLIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVLANGQ

Query:  MFVCILNLCAKRLDFELGRQIHGVIVKV
          V +L+ CAK     LG+++H  ++KV
Subjt:  MFVCILNLCAKRLDFELGRQIHGVIVKV

AT5G09950.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.3e-1839.82Show/hide
Query:  RAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVLANGQMFVCILNLCAK--RLDF
        R  H+ + +N     +Y+ NNL+++YL  G  V ARKVFDEMP+R+ V+W  I++GY      +EAL    D VK G+ +N   FV +L  C +   +  
Subjt:  RAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVLANGQMFVCILNLCAK--RLDF

Query:  ELGRQIHGVIVKV
          GRQIHG++ K+
Subjt:  ELGRQIHGVIVKV

AT5G52630.1 mitochondrial RNAediting factor 11.7e-1939.09Show/hide
Query:  RAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVLANGQMFVCILNLCAKRLDFEL
        R+VH   ++      ++VG++L+  Y + G +V ARK+FDEMP R+VVTW+ ++ GY  +   EEAL LF +++   +  N   F  ++++CA     EL
Subjt:  RAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVLANGQMFVCILNLCAKRLDFEL

Query:  GRQIHGVIVK
        GRQIHG+ +K
Subjt:  GRQIHGVIVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GGATGTTTGAGTCCCTATTTGATTGGGGTTTGGCTTCGCTCTAGTCGTAGCGTCAAGAAATTAAGGGCGGTACATGCTTTCATTTTGCGAAATTTTACAAGTTTTGGGAT
CTATGTTGGAAACAATTTGCTTAGTTCTTACTTAAGATTGGGAATGTTGGTTGATGCTAGAAAGGTGTTCGATGAAATGCCAATGAGGAGTGTTGTGACCTGGACGGCTA
TTATTAATGGATATATTGATTTGGATTTGACTGAAGAAGCTTTAGCGTTGTTCAGTGATTCGGTCAAGAGCGGGGTGCTAGCAAATGGGCAGATGTTTGTTTGCATCTTG
AACTTGTGTGCTAAGAGGTTGGATTTTGAGCTTGGGAGGCAAATTCATGGCGTTATTGTGAAAGTGTTCTCAAGGCTTGTG
mRNA sequenceShow/hide mRNA sequence
GGATGTTTGAGTCCCTATTTGATTGGGGTTTGGCTTCGCTCTAGTCGTAGCGTCAAGAAATTAAGGGCGGTACATGCTTTCATTTTGCGAAATTTTACAAGTTTTGGGAT
CTATGTTGGAAACAATTTGCTTAGTTCTTACTTAAGATTGGGAATGTTGGTTGATGCTAGAAAGGTGTTCGATGAAATGCCAATGAGGAGTGTTGTGACCTGGACGGCTA
TTATTAATGGATATATTGATTTGGATTTGACTGAAGAAGCTTTAGCGTTGTTCAGTGATTCGGTCAAGAGCGGGGTGCTAGCAAATGGGCAGATGTTTGTTTGCATCTTG
AACTTGTGTGCTAAGAGGTTGGATTTTGAGCTTGGGAGGCAAATTCATGGCGTTATTGTGAAAGTGTTCTCAAGGCTTGTG
Protein sequenceShow/hide protein sequence
GCLSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVLANGQMFVCIL
NLCAKRLDFELGRQIHGVIVKVFSRLV