; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc05G01165 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc05G01165
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationClcChr05:805307..806683
RNA-Seq ExpressionClc05G01165
SyntenyClc05G01165
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7013843.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]1.2e-2646.39Show/hide
Query:  NQVGLAL-KSDMRVEFKKVF-KLRAIRSFEF----------------------------------------------------------NDALIVWEELK
        N++ +AL KSDMR EFKKVF KLR IRSFEF                                                          NDALIVWEELK
Subjt:  NQVGLAL-KSDMRVEFKKVF-KLRAIRSFEF----------------------------------------------------------NDALIVWEELK

Query:  GSGHEPDAFTYSDIIQGYY-----------------------TIVYNSLLDGLFKVQKVMKACRLFDKMVQ-GVRASSWANNILIDGLFRNGRA
        GSGHEPDAFTY  IIQG                         TIVYNSLLDGLFK ++V++AC+ FDKMVQ GVRAS W  NILIDGLFRNGRA
Subjt:  GSGHEPDAFTYSDIIQGYY-----------------------TIVYNSLLDGLFKVQKVMKACRLFDKMVQ-GVRASSWANNILIDGLFRNGRA

XP_022134897.1 pentatricopeptide repeat-containing protein At4g01570 [Momordica charantia]3.4e-2645.36Show/hide
Query:  NQVGLAL-KSDMRVEFKKVF-KLRAIRSFEFN----------------------------------------------------------DALIVWEELK
        N++ +AL ++DMR+EFKKVF KLR IR FEFN                                                          DALIVWEELK
Subjt:  NQVGLAL-KSDMRVEFKKVF-KLRAIRSFEFN----------------------------------------------------------DALIVWEELK

Query:  GSGHEPDAFTYSDIIQG-----------------------YYTIVYNSLLDGLFKVQKVMKACRLFDKMVQ-GVRASSWANNILIDGLFRNGRA
        GSGHEPDAFTY  IIQG                         TIVYNSLLDGLFK +KV++ C+LFDKMVQ GVRAS W  NILIDGLFRNGRA
Subjt:  GSGHEPDAFTYSDIIQG-----------------------YYTIVYNSLLDGLFKVQKVMKACRLFDKMVQ-GVRASSWANNILIDGLFRNGRA

XP_022929794.1 pentatricopeptide repeat-containing protein At4g01570 [Cucurbita moschata]3.1e-2746.91Show/hide
Query:  NQVGLAL-KSDMRVEFKKVF-KLRAIRSFEF----------------------------------------------------------NDALIVWEELK
        N++ +AL KSDMRVEFKKVF KLR IRSFEF                                                          NDALIVWEELK
Subjt:  NQVGLAL-KSDMRVEFKKVF-KLRAIRSFEF----------------------------------------------------------NDALIVWEELK

Query:  GSGHEPDAFTYSDIIQGYY-----------------------TIVYNSLLDGLFKVQKVMKACRLFDKMVQ-GVRASSWANNILIDGLFRNGRA
        GSGHEPDAFTY  IIQG                         TIVYNSLLDGLFK ++V++AC+ FDKMVQ GVRAS W  NILIDGLFRNGRA
Subjt:  GSGHEPDAFTYSDIIQGYY-----------------------TIVYNSLLDGLFKVQKVMKACRLFDKMVQ-GVRASSWANNILIDGLFRNGRA

XP_022992119.1 pentatricopeptide repeat-containing protein At4g01570 [Cucurbita maxima]1.5e-2646.39Show/hide
Query:  NQVGLAL-KSDMRVEFKKVF-KLRAIRSFEF----------------------------------------------------------NDALIVWEELK
        N++ +AL KSDMRVEFK VF KLR IRSFEF                                                          NDALIVWEELK
Subjt:  NQVGLAL-KSDMRVEFKKVF-KLRAIRSFEF----------------------------------------------------------NDALIVWEELK

Query:  GSGHEPDAFTYSDIIQGY-----------------------YTIVYNSLLDGLFKVQKVMKACRLFDKMVQ-GVRASSWANNILIDGLFRNGRA
        GSGHEPDAFTY  IIQG                         TIVYNSLLDGLFK ++V++AC+ FDKMVQ GVRAS W  NILIDGLFRNGRA
Subjt:  GSGHEPDAFTYSDIIQGY-----------------------YTIVYNSLLDGLFKVQKVMKACRLFDKMVQ-GVRASSWANNILIDGLFRNGRA

XP_023549441.1 pentatricopeptide repeat-containing protein At4g01570 [Cucurbita pepo subsp. pepo]3.1e-2746.91Show/hide
Query:  NQVGLAL-KSDMRVEFKKVF-KLRAIRSFEF----------------------------------------------------------NDALIVWEELK
        N++ +AL KSDMRVEFKKVF KLR IRSFEF                                                          NDALIVWEELK
Subjt:  NQVGLAL-KSDMRVEFKKVF-KLRAIRSFEF----------------------------------------------------------NDALIVWEELK

Query:  GSGHEPDAFTYSDIIQGYY-----------------------TIVYNSLLDGLFKVQKVMKACRLFDKMVQ-GVRASSWANNILIDGLFRNGRA
        GSGHEPDAFTY  IIQG                         TIVYNSLLDGLFK ++V++AC+ FDKMVQ GVRAS W  NILIDGLFRNGRA
Subjt:  GSGHEPDAFTYSDIIQGYY-----------------------TIVYNSLLDGLFKVQKVMKACRLFDKMVQ-GVRASSWANNILIDGLFRNGRA

TrEMBL top hitse value%identityAlignment
A0A0A0KFG9 Uncharacterized protein1.8e-2545.88Show/hide
Query:  NQVGLAL-KSDMRVEFKKVF-KLRAIRSFEFN----------------------------------------------------------DALIVWEELK
        N++ +AL K DMRVEFKKVF KLRAI SFEF+                                                          DALIVWEELK
Subjt:  NQVGLAL-KSDMRVEFKKVF-KLRAIRSFEFN----------------------------------------------------------DALIVWEELK

Query:  GSGHEPDAFTYSDIIQGYY-----------------------TIVYNSLLDGLFKVQKVMKACRLFDKMVQ-GVRASSWANNILIDGLFRNGRA
        GSGHEPDAFTY  IIQG                         TIVYNSLL+GLFK +KV +AC+LFDKMVQ  VRAS W  NILIDGLFRNGRA
Subjt:  GSGHEPDAFTYSDIIQGYY-----------------------TIVYNSLLDGLFKVQKVMKACRLFDKMVQ-GVRASSWANNILIDGLFRNGRA

A0A5A7TE47 Pentatricopeptide repeat-containing protein3.1e-2543.4Show/hide
Query:  QDKEPASKAFYKPN--IENQVGLAL-KSDMRVEFKKVF-KLRAIRSFEFN--------------------------------------------------
        QD   A+   + PN    N++ +AL K DMRVEF+KVF KLRAI +FEFN                                                  
Subjt:  QDKEPASKAFYKPN--IENQVGLAL-KSDMRVEFKKVF-KLRAIRSFEFN--------------------------------------------------

Query:  --------DALIVWEELKGSGHEPDAFTYSDIIQGYY-----------------------TIVYNSLLDGLFKVQKVMKACRLFDKMVQ-GVRASSWANN
                DALIVWEELKGSGHEPDAFTY  IIQG                          IVYNSLL+GLFK +KV +AC+LFDKMVQ  VRAS W  N
Subjt:  --------DALIVWEELKGSGHEPDAFTYSDIIQGYY-----------------------TIVYNSLLDGLFKVQKVMKACRLFDKMVQ-GVRASSWANN

Query:  ILIDGLFRNGRA
        ILIDGLFRNGRA
Subjt:  ILIDGLFRNGRA

A0A6J1C0Z4 pentatricopeptide repeat-containing protein At4g015701.6e-2645.36Show/hide
Query:  NQVGLAL-KSDMRVEFKKVF-KLRAIRSFEFN----------------------------------------------------------DALIVWEELK
        N++ +AL ++DMR+EFKKVF KLR IR FEFN                                                          DALIVWEELK
Subjt:  NQVGLAL-KSDMRVEFKKVF-KLRAIRSFEFN----------------------------------------------------------DALIVWEELK

Query:  GSGHEPDAFTYSDIIQG-----------------------YYTIVYNSLLDGLFKVQKVMKACRLFDKMVQ-GVRASSWANNILIDGLFRNGRA
        GSGHEPDAFTY  IIQG                         TIVYNSLLDGLFK +KV++ C+LFDKMVQ GVRAS W  NILIDGLFRNGRA
Subjt:  GSGHEPDAFTYSDIIQG-----------------------YYTIVYNSLLDGLFKVQKVMKACRLFDKMVQ-GVRASSWANNILIDGLFRNGRA

A0A6J1EPT7 pentatricopeptide repeat-containing protein At4g015701.5e-2746.91Show/hide
Query:  NQVGLAL-KSDMRVEFKKVF-KLRAIRSFEF----------------------------------------------------------NDALIVWEELK
        N++ +AL KSDMRVEFKKVF KLR IRSFEF                                                          NDALIVWEELK
Subjt:  NQVGLAL-KSDMRVEFKKVF-KLRAIRSFEF----------------------------------------------------------NDALIVWEELK

Query:  GSGHEPDAFTYSDIIQGYY-----------------------TIVYNSLLDGLFKVQKVMKACRLFDKMVQ-GVRASSWANNILIDGLFRNGRA
        GSGHEPDAFTY  IIQG                         TIVYNSLLDGLFK ++V++AC+ FDKMVQ GVRAS W  NILIDGLFRNGRA
Subjt:  GSGHEPDAFTYSDIIQGYY-----------------------TIVYNSLLDGLFKVQKVMKACRLFDKMVQ-GVRASSWANNILIDGLFRNGRA

A0A6J1JWP1 pentatricopeptide repeat-containing protein At4g015707.4e-2746.39Show/hide
Query:  NQVGLAL-KSDMRVEFKKVF-KLRAIRSFEF----------------------------------------------------------NDALIVWEELK
        N++ +AL KSDMRVEFK VF KLR IRSFEF                                                          NDALIVWEELK
Subjt:  NQVGLAL-KSDMRVEFKKVF-KLRAIRSFEF----------------------------------------------------------NDALIVWEELK

Query:  GSGHEPDAFTYSDIIQGY-----------------------YTIVYNSLLDGLFKVQKVMKACRLFDKMVQ-GVRASSWANNILIDGLFRNGRA
        GSGHEPDAFTY  IIQG                         TIVYNSLLDGLFK ++V++AC+ FDKMVQ GVRAS W  NILIDGLFRNGRA
Subjt:  GSGHEPDAFTYSDIIQGY-----------------------YTIVYNSLLDGLFKVQKVMKACRLFDKMVQ-GVRASSWANNILIDGLFRNGRA

SwissProt top hitse value%identityAlignment
Q8VZE4 Pentatricopeptide repeat-containing protein At4g015701.2e-2156.73Show/hide
Query:  DALIVWEELKGSGHEPDAFTYSDIIQGYY-----------------------TIVYNSLLDGLFKVQKVMKACRLFDKMVQ-GVRASSWANNILIDGLFR
        DALIVW+ELK SGHEPD  TY  +IQG                         TIVYN LLDG  K +KV +AC+LF+KMVQ GVRAS W  NILIDGLFR
Subjt:  DALIVWEELKGSGHEPDAFTYSDIIQGYY-----------------------TIVYNSLLDGLFKVQKVMKACRLFDKMVQ-GVRASSWANNILIDGLFR

Query:  NGRA
        NGRA
Subjt:  NGRA

Q9FLJ4 Pentatricopeptide repeat-containing protein At5g614001.3e-0423.61Show/hide
Query:  IENQVGLALKSDMRVEFKKVFKLRA---------IRSFEFNDALIVWEELKGSGHEPDAFTYSDIIQGYY-----------------------TIVYNSL
        IE+QV  A +   +++ +++F   A          + +    AL +  E+  SG EP+  T+S +I GY                         + Y +L
Subjt:  IENQVGLALKSDMRVEFKKVFKLRA---------IRSFEFNDALIVWEELKGSGHEPDAFTYSDIIQGYY-----------------------TIVYNSL

Query:  LDGLFKVQKVMKACRLFDKMVQ-GVRASSWANNILIDGLFRNGR
        +D  FK   + +A RL+  M++ G+  +      L+DG ++ GR
Subjt:  LDGLFKVQKVMKACRLFDKMVQ-GVRASSWANNILIDGLFRNGR

Q9LFF1 Pentatricopeptide repeat-containing protein At3g53700, chloroplastic2.4e-0633.33Show/hide
Query:  ALIVWEELKGSGHEPDAFTYSDIIQGYYT-----------------------IVYNSLLDGLFKVQKVMKACRLFDKM-VQGVRASSWANNILIDGLFRN
        A+ ++EE++  G EPD FTY+ +I    +                       I YN+L+DG  K  K  +A  +FD+M V GV  +S   N LIDGL ++
Subjt:  ALIVWEELKGSGHEPDAFTYSDIIQGYYT-----------------------IVYNSLLDGLFKVQKVMKACRLFDKM-VQGVRASSWANNILIDGLFRN

Query:  GR
         R
Subjt:  GR

Q9SI78 Pentatricopeptide repeat-containing protein At1g627201.0e-0431.19Show/hide
Query:  RSFEFNDALIVWEELKGSGHEPDAFTYSDIIQGYY-----------------------TIVYNSLLDGLFKVQKVMKACRLFDKMVQ-GVRASSWANNIL
        R   F  AL V  ++   G+EPD  T S +I G+                         ++YN+++DG  K+  V  A  LFD+M + GVRA +   N L
Subjt:  RSFEFNDALIVWEELKGSGHEPDAFTYSDIIQGYY-----------------------TIVYNSLLDGLFKVQKVMKACRLFDKMVQ-GVRASSWANNIL

Query:  IDGLFRNGR
        + GL  +GR
Subjt:  IDGLFRNGR

Arabidopsis top hitse value%identityAlignment
AT1G62720.1 Pentatricopeptide repeat (PPR-like) superfamily protein7.1e-0631.19Show/hide
Query:  RSFEFNDALIVWEELKGSGHEPDAFTYSDIIQGYY-----------------------TIVYNSLLDGLFKVQKVMKACRLFDKMVQ-GVRASSWANNIL
        R   F  AL V  ++   G+EPD  T S +I G+                         ++YN+++DG  K+  V  A  LFD+M + GVRA +   N L
Subjt:  RSFEFNDALIVWEELKGSGHEPDAFTYSDIIQGYY-----------------------TIVYNSLLDGLFKVQKVMKACRLFDKMVQ-GVRASSWANNIL

Query:  IDGLFRNGR
        + GL  +GR
Subjt:  IDGLFRNGR

AT1G63080.1 Pentatricopeptide repeat (PPR) superfamily protein2.3e-0428.85Show/hide
Query:  NDALIVWEELKGSGHEPDAFTYSDIIQ-----GYYT------------------IVYNSLLDGLFKVQKVMKACRLFDKMVQ-GVRASSWANNILIDGLF
        +DAL ++ E+   G  PD FTYS +I      G ++                  + +NSL+D   K  K+++A +LFD+M+Q  +  +    N LI+G  
Subjt:  NDALIVWEELKGSGHEPDAFTYSDIIQ-----GYYT------------------IVYNSLLDGLFKVQKVMKACRLFDKMVQ-GVRASSWANNILIDGLF

Query:  RNGR
         + R
Subjt:  RNGR

AT3G53700.1 Pentatricopeptide repeat (PPR) superfamily protein1.7e-0733.33Show/hide
Query:  ALIVWEELKGSGHEPDAFTYSDIIQGYYT-----------------------IVYNSLLDGLFKVQKVMKACRLFDKM-VQGVRASSWANNILIDGLFRN
        A+ ++EE++  G EPD FTY+ +I    +                       I YN+L+DG  K  K  +A  +FD+M V GV  +S   N LIDGL ++
Subjt:  ALIVWEELKGSGHEPDAFTYSDIIQGYYT-----------------------IVYNSLLDGLFKVQKVMKACRLFDKM-VQGVRASSWANNILIDGLFRN

Query:  GR
         R
Subjt:  GR

AT4G01570.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.3e-2356.73Show/hide
Query:  DALIVWEELKGSGHEPDAFTYSDIIQGYY-----------------------TIVYNSLLDGLFKVQKVMKACRLFDKMVQ-GVRASSWANNILIDGLFR
        DALIVW+ELK SGHEPD  TY  +IQG                         TIVYN LLDG  K +KV +AC+LF+KMVQ GVRAS W  NILIDGLFR
Subjt:  DALIVWEELKGSGHEPDAFTYSDIIQGYY-----------------------TIVYNSLLDGLFKVQKVMKACRLFDKMVQ-GVRASSWANNILIDGLFR

Query:  NGRA
        NGRA
Subjt:  NGRA

AT5G61400.1 Pentatricopeptide repeat (PPR) superfamily protein9.2e-0623.61Show/hide
Query:  IENQVGLALKSDMRVEFKKVFKLRA---------IRSFEFNDALIVWEELKGSGHEPDAFTYSDIIQGYY-----------------------TIVYNSL
        IE+QV  A +   +++ +++F   A          + +    AL +  E+  SG EP+  T+S +I GY                         + Y +L
Subjt:  IENQVGLALKSDMRVEFKKVFKLRA---------IRSFEFNDALIVWEELKGSGHEPDAFTYSDIIQGYY-----------------------TIVYNSL

Query:  LDGLFKVQKVMKACRLFDKMVQ-GVRASSWANNILIDGLFRNGR
        +D  FK   + +A RL+  M++ G+  +      L+DG ++ GR
Subjt:  LDGLFKVQKVMKACRLFDKMVQ-GVRASSWANNILIDGLFRNGR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTAAAAGGGGAAGGGGAGGAGGGAAGGTGAATGAAATAGCTGAAATAAACAAAAAATCCTTAGCTAAGATACAGGACAAAGAACCTGCTTCAAAAGCTTTCTACAA
ACCTAATATAGAAAACCAGGTGGGTTTGGCATTGAAATCGGACATGAGGGTTGAGTTCAAAAAAGTTTTTAAGCTTAGAGCAATTAGAAGCTTTGAGTTTAATGATGCAC
TTATTGTGTGGGAGGAACTTAAAGGGTCAGGTCATGAGCCTGATGCCTTCACCTACAGTGACATAATTCAAGGTTACTATACCATTGTATATAATTCTCTCCTCGACGGG
CTATTTAAGGTTCAGAAAGTTATGAAAGCATGTCGACTTTTTGATAAAATGGTACAAGGTGTAAGAGCTTCTTCTTGGGCAAACAATATTCTAATTGATGGATTGTTTAG
GAATGGAAGAGCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTGTAAAAGGGGAAGGGGAGGAGGGAAGGTGAATGAAATAGCTGAAATAAACAAAAAATCCTTAGCTAAGATACAGGACAAAGAACCTGCTTCAAAAGCTTTCTACAA
ACCTAATATAGAAAACCAGGTGGGTTTGGCATTGAAATCGGACATGAGGGTTGAGTTCAAAAAAGTTTTTAAGCTTAGAGCAATTAGAAGCTTTGAGTTTAATGATGCAC
TTATTGTGTGGGAGGAACTTAAAGGGTCAGGTCATGAGCCTGATGCCTTCACCTACAGTGACATAATTCAAGGTTACTATACCATTGTATATAATTCTCTCCTCGACGGG
CTATTTAAGGTTCAGAAAGTTATGAAAGCATGTCGACTTTTTGATAAAATGGTACAAGGTGTAAGAGCTTCTTCTTGGGCAAACAATATTCTAATTGATGGATTGTTTAG
GAATGGAAGAGCCTGA
Protein sequenceShow/hide protein sequence
MCKRGRGGGKVNEIAEINKKSLAKIQDKEPASKAFYKPNIENQVGLALKSDMRVEFKKVFKLRAIRSFEFNDALIVWEELKGSGHEPDAFTYSDIIQGYYTIVYNSLLDG
LFKVQKVMKACRLFDKMVQGVRASSWANNILIDGLFRNGRA