; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC01G008080 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC01G008080
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCmU531Chr01:8820283..8822792
RNA-Seq ExpressionCmUC01G008080
SyntenyCmUC01G008080
Gene Ontology termsGO:0050794 - regulation of cellular process (biological process)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059628.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]2.3e-2352.46Show/hide
Query:  DGLCKDGQ-----------ESQGMIPNV-----MVLPKSHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPDAIT
        DGLCK+G+             +G+ PNV     M+     EGQVD AN+L Q+ + NGC PNIITYN L+RG  +SNKLDEVV+LLH MV KDV PDA T
Subjt:  DGLCKDGQ-----------ESQGMIPNV-----MVLPKSHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPDAIT

Query:  CT-------KAKKYKECLDLLP
        C+       K +KY+ECLDLLP
Subjt:  CT-------KAKKYKECLDLLP

XP_008451225.1 PREDICTED: pentatricopeptide repeat-containing protein At3g22470, mitochondrial-like [Cucumis melo]2.3e-2352.46Show/hide
Query:  DGLCKDGQ-----------ESQGMIPNV-----MVLPKSHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPDAIT
        DGLCK+G+             +G+ PNV     M+     EGQVD AN+L Q+ + NGC PNIITYN L+RG  +SNKLDEVV+LLH MV KDV PDA T
Subjt:  DGLCKDGQ-----------ESQGMIPNV-----MVLPKSHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPDAIT

Query:  CT-------KAKKYKECLDLLP
        C+       K +KY+ECLDLLP
Subjt:  CT-------KAKKYKECLDLLP

XP_011659251.1 pentatricopeptide repeat-containing protein At3g22470, mitochondrial isoform X2 [Cucumis sativus]6.6e-2352.46Show/hide
Query:  DGLCKDGQ-----------ESQGMIPNV-----MVLPKSHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPDAIT
        DGLCK G+            ++G  PNV     M+     EGQVD AN+L Q+ + NGCTP+IITYN L+RG  +SNKL+EVV+LLHRM  KDVSPDAIT
Subjt:  DGLCKDGQ-----------ESQGMIPNV-----MVLPKSHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPDAIT

Query:  C-------TKAKKYKECLDLLP
        C       +K +KY+ECL LLP
Subjt:  C-------TKAKKYKECLDLLP

XP_038896203.1 pentatricopeptide repeat-containing protein At1g63330-like [Benincasa hispida]5.1e-2353.28Show/hide
Query:  DGLCKDGQ-----------ESQGMIPNV-----MVLPKSHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPDAIT
        DGLCK G+             +G+ PNV     M+     +GQVDNANILFQ  ++N CTPN+IT N LLRG C+SNK  EVVELLHRMV +DV PD  T
Subjt:  DGLCKDGQ-----------ESQGMIPNV-----MVLPKSHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPDAIT

Query:  CT-------KAKKYKECLDLLP
        CT       K +KY+ECLDLLP
Subjt:  CT-------KAKKYKECLDLLP

XP_038897332.1 pentatricopeptide repeat-containing protein At3g22470, mitochondrial-like [Benincasa hispida]1.7e-2353.28Show/hide
Query:  DGLCKDGQ-----------ESQGMIPNV-----MVLPKSHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPDAIT
        DGLCK G              +G+ PNV     M+     +GQVD ANILFQ  ++NGCTPN+IT N LLRG C+SNK  EVVELLHRMV +DVSPD  T
Subjt:  DGLCKDGQ-----------ESQGMIPNV-----MVLPKSHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPDAIT

Query:  CT-------KAKKYKECLDLLP
        CT       K +KY+EC+DLLP
Subjt:  CT-------KAKKYKECLDLLP

TrEMBL top hitse value%identityAlignment
A0A0A0K5M0 Uncharacterized protein5.5e-2353.27Show/hide
Query:  LCKDGQESQGMIPNVMVLPKSHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPDAITCT-------KAKKYKECL
        L ++G +S  +  N+M+      GQVD ANILF++ ++NGCTP+IITYN LL G CQSNK DEVV+LLH+M+ +D+SPDAI+C        K +KY+ECL
Subjt:  LCKDGQESQGMIPNVMVLPKSHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPDAITCT-------KAKKYKECL

Query:  DLLPSLM
        DLLP  +
Subjt:  DLLPSLM

A0A0A0K730 Uncharacterized protein3.2e-2352.46Show/hide
Query:  DGLCKDGQ-----------ESQGMIPNV-----MVLPKSHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPDAIT
        DGLCK G+            ++G  PNV     M+     EGQVD AN+L Q+ + NGCTP+IITYN L+RG  +SNKL+EVV+LLHRM  KDVSPDAIT
Subjt:  DGLCKDGQ-----------ESQGMIPNV-----MVLPKSHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPDAIT

Query:  C-------TKAKKYKECLDLLP
        C       +K +KY+ECL LLP
Subjt:  C-------TKAKKYKECLDLLP

A0A1S4DYL6 pentatricopeptide repeat-containing protein At3g22470, mitochondrial-like1.1e-2352.46Show/hide
Query:  DGLCKDGQ-----------ESQGMIPNV-----MVLPKSHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPDAIT
        DGLCK+G+             +G+ PNV     M+     EGQVD AN+L Q+ + NGC PNIITYN L+RG  +SNKLDEVV+LLH MV KDV PDA T
Subjt:  DGLCKDGQ-----------ESQGMIPNV-----MVLPKSHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPDAIT

Query:  CT-------KAKKYKECLDLLP
        C+       K +KY+ECLDLLP
Subjt:  CT-------KAKKYKECLDLLP

A0A5A7UUW8 Pentatricopeptide repeat-containing protein2.1e-2251.64Show/hide
Query:  DGLCKDGQ-----------ESQGMIPNV-----MVLPKSHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPDAIT
        DGLCK G+             +G+ PNV     M+      G VD ANILF++ ++NGCTP+IITY+ILLR  CQSNK +EVV LLH+MV +DVSPD   
Subjt:  DGLCKDGQ-----------ESQGMIPNV-----MVLPKSHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPDAIT

Query:  CT-------KAKKYKECLDLLP
        CT       K +KYKECLDLLP
Subjt:  CT-------KAKKYKECLDLLP

A0A5D3C8J0 Pentatricopeptide repeat-containing protein1.1e-2352.46Show/hide
Query:  DGLCKDGQ-----------ESQGMIPNV-----MVLPKSHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPDAIT
        DGLCK+G+             +G+ PNV     M+     EGQVD AN+L Q+ + NGC PNIITYN L+RG  +SNKLDEVV+LLH MV KDV PDA T
Subjt:  DGLCKDGQ-----------ESQGMIPNV-----MVLPKSHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPDAIT

Query:  CT-------KAKKYKECLDLLP
        C+       K +KY+ECLDLLP
Subjt:  CT-------KAKKYKECLDLLP

SwissProt top hitse value%identityAlignment
O49436 Pentatricopeptide repeat-containing protein At4g200902.1e-0835Show/hide
Query:  DGLCK-----------DGQESQGMIP-----NVMVLPKSHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPDAIT
        DGLCK           D  +S+G  P     NV++     +G +     L       GC PN +TYN L+ GLC   KLD+ V LL RMV     P+ +T
Subjt:  DGLCK-----------DGQESQGMIP-----NVMVLPKSHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPDAIT

P0C7Q7 Putative pentatricopeptide repeat-containing protein At1g12700, mitochondrial3.3e-0935.24Show/hide
Query:  GLCKDGQESQG-----------MIPNVMVLPK-----SHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPDAITC
        GLCK G+ + G           ++PNV+           EG++  AN L++     G +PNIITYN L+ G C  N+L E   +L  MV    SPD +T 
Subjt:  GLCKDGQESQG-----------MIPNVMVLPK-----SHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPDAITC

Query:  TKAKK
        T   K
Subjt:  TKAKK

Q9CA58 Putative pentatricopeptide repeat-containing protein At1g745802.4e-0753.33Show/hide
Query:  KNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPDAIT
        + GC PN+ T+NILL  LC+  KLDE + LL  M +K V+PDA+T
Subjt:  KNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPDAIT

Q9FNL2 Pentatricopeptide repeat-containing protein At5g461001.3e-0839.02Show/hide
Query:  DGLCKDGQESQGMIPNVMVLPKSHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPDA
        DGLCKDG+  Q M                    LF+     GC PN++TY  L+ GLC+  K+ E VELL RM  + + PDA
Subjt:  DGLCKDGQESQGMIPNVMVLPKSHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPDA

Q9SV46 Pentatricopeptide repeat-containing protein At3g54980, mitochondrial1.8e-0731.63Show/hide
Query:  DGLCKDGQESQG-----------------MIPNVMVLPKSHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPD
        +GLCK GQ S+                  M  N ++     EG++D+A   ++    NG +PN+ITY  L+ GLC++N++D+ +E+   M +K V  D
Subjt:  DGLCKDGQESQG-----------------MIPNVMVLPKSHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPD

Arabidopsis top hitse value%identityAlignment
AT1G12620.1 Pentatricopeptide repeat (PPR) superfamily protein1.7e-0830.61Show/hide
Query:  FHKDGLCKDGQE------SQGMIPNVMVLPK-----SHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPDAIT
        F K+G  ++ +E       +G+ P+ +           E Q+D AN +       GC PNI T+NIL+ G C++N +D+ +EL  +M  + V  D +T
Subjt:  FHKDGLCKDGQE------SQGMIPNVMVLPK-----SHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPDAIT

AT1G12700.1 ATP binding;nucleic acid binding;helicases2.4e-1035.24Show/hide
Query:  GLCKDGQESQG-----------MIPNVMVLPK-----SHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPDAITC
        GLCK G+ + G           ++PNV+           EG++  AN L++     G +PNIITYN L+ G C  N+L E   +L  MV    SPD +T 
Subjt:  GLCKDGQESQG-----------MIPNVMVLPK-----SHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPDAITC

Query:  TKAKK
        T   K
Subjt:  TKAKK

AT3G54980.1 Pentatricopeptide repeat (PPR) superfamily protein1.3e-0831.63Show/hide
Query:  DGLCKDGQESQG-----------------MIPNVMVLPKSHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPD
        +GLCK GQ S+                  M  N ++     EG++D+A   ++    NG +PN+ITY  L+ GLC++N++D+ +E+   M +K V  D
Subjt:  DGLCKDGQESQG-----------------MIPNVMVLPKSHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPD

AT4G20090.1 Pentatricopeptide repeat (PPR) superfamily protein1.5e-0935Show/hide
Query:  DGLCK-----------DGQESQGMIP-----NVMVLPKSHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPDAIT
        DGLCK           D  +S+G  P     NV++     +G +     L       GC PN +TYN L+ GLC   KLD+ V LL RMV     P+ +T
Subjt:  DGLCK-----------DGQESQGMIP-----NVMVLPKSHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPDAIT

AT5G46100.1 Pentatricopeptide repeat (PPR) superfamily protein8.9e-1039.02Show/hide
Query:  DGLCKDGQESQGMIPNVMVLPKSHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPDA
        DGLCKDG+  Q M                    LF+     GC PN++TY  L+ GLC+  K+ E VELL RM  + + PDA
Subjt:  DGLCKDGQESQGMIPNVMVLPKSHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPDA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATATTTGCATTATCTCAAATTTGCCATATTTCATTAGTTTTCATAAAGATGGGCTCTGTAAGGATGGACAGGAATCTCAAGGAATGATTCCCAATGTTATGGTATT
GCCAAAGAGTCATGAGGGACAAGTAGATAACGCAAATATTTTGTTTCAAAGATGGAAAAAAAATGGTTGTACCCCCAACATAATTACTTATAATATACTTTTGCGTGGTT
TATGCCAAAGTAATAAATTAGATGAGGTGGTTGAACTTCTTCATAGGATGGTTCATAAGGATGTGTCACCAGATGCCATCACTTGCACCAAAGCTAAAAAATATAAAGAA
TGTTTGGACTTACTTCCAAGTTTGATGAAGAGTTATTCACTGCGTGCAATGTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATATTTGCATTATCTCAAATTTGCCATATTTCATTAGTTTTCATAAAGATGGGCTCTGTAAGGATGGACAGGAATCTCAAGGAATGATTCCCAATGTTATGGTATT
GCCAAAGAGTCATGAGGGACAAGTAGATAACGCAAATATTTTGTTTCAAAGATGGAAAAAAAATGGTTGTACCCCCAACATAATTACTTATAATATACTTTTGCGTGGTT
TATGCCAAAGTAATAAATTAGATGAGGTGGTTGAACTTCTTCATAGGATGGTTCATAAGGATGTGTCACCAGATGCCATCACTTGCACCAAAGCTAAAAAATATAAAGAA
TGTTTGGACTTACTTCCAAGTTTGATGAAGAGTTATTCACTGCGTGCAATGTCTTGA
Protein sequenceShow/hide protein sequence
MNICIISNLPYFISFHKDGLCKDGQESQGMIPNVMVLPKSHEGQVDNANILFQRWKKNGCTPNIITYNILLRGLCQSNKLDEVVELLHRMVHKDVSPDAITCTKAKKYKE
CLDLLPSLMKSYSLRAMS