; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi04G004630 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi04G004630
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr04:4393389..4394127
RNA-Seq ExpressionLsi04G004630
SyntenyLsi04G004630
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6574109.1 putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]7.0e-3264.6Show/hide
Query:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQ--------------------------NIICQTAIVSACAKCGDTDFARELFDVMLQRDSV
        DNYTFNFLVRT AQLS CEAG TVHGAL KHGFEFDP VQ                          +++CQTA+VSACAKCGDTD+AR+LFD M QRDSV
Subjt:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQ--------------------------NIICQTAIVSACAKCGDTDFARELFDVMLQRDSV

Query:  SWNAMIAGYAQRG
        SWNAMIAGY QRG
Subjt:  SWNAMIAGYAQRG

XP_022151045.1 putative pentatricopeptide repeat-containing protein At5g40405 [Momordica charantia]3.7e-3366.37Show/hide
Query:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQ--------------------------NIICQTAIVSACAKCGDTDFARELFDVMLQRDSV
        DNYTFNFLVRTCAQLS CEAG TVHGALIKHGFEFDP VQ                          +++CQTA+VSACAKCGDT FAR+LFD M QRDS+
Subjt:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQ--------------------------NIICQTAIVSACAKCGDTDFARELFDVMLQRDSV

Query:  SWNAMIAGYAQRG
        SWNAMIAGYAQRG
Subjt:  SWNAMIAGYAQRG

XP_022968643.1 putative pentatricopeptide repeat-containing protein At5g40405 [Cucurbita maxima]2.4e-3265.49Show/hide
Query:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQ--------------------------NIICQTAIVSACAKCGDTDFARELFDVMLQRDSV
        DNYTFNFLVRT AQLS CEAG TVHGAL KHGFEFDP VQ                          +++CQTA+VSACAKCGDTD+AR+LFD M QRDSV
Subjt:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQ--------------------------NIICQTAIVSACAKCGDTDFARELFDVMLQRDSV

Query:  SWNAMIAGYAQRG
        SWNAMIAGYAQRG
Subjt:  SWNAMIAGYAQRG

XP_023542265.1 putative pentatricopeptide repeat-containing protein At5g40405 [Cucurbita pepo subsp. pepo]2.4e-3265.49Show/hide
Query:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQ--------------------------NIICQTAIVSACAKCGDTDFARELFDVMLQRDSV
        DNYTFNFLVRT AQLS CEAG TVHGAL KHGFEFDP VQ                          +++CQTA+VSACAKCGDTD+AR+LFD M QRDSV
Subjt:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQ--------------------------NIICQTAIVSACAKCGDTDFARELFDVMLQRDSV

Query:  SWNAMIAGYAQRG
        SWNAMIAGYAQRG
Subjt:  SWNAMIAGYAQRG

XP_038891785.1 putative pentatricopeptide repeat-containing protein At5g40405 [Benincasa hispida]2.6e-3469.03Show/hide
Query:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQ--------------------------NIICQTAIVSACAKCGDTDFARELFDVMLQRDSV
        DNYTFNFLVRTCAQLS CEAG TVHGALIKHGFEFDP VQ                          +I+CQTA+VSACAKCGDTDFAR LFD M QRDSV
Subjt:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQ--------------------------NIICQTAIVSACAKCGDTDFARELFDVMLQRDSV

Query:  SWNAMIAGYAQRG
        SWNAMIAGYAQRG
Subjt:  SWNAMIAGYAQRG

TrEMBL top hitse value%identityAlignment
A0A5A7SZ47 Putative pentatricopeptide repeat-containing protein2.3e-2861.06Show/hide
Query:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQ--------------------------NIICQTAIVSACAKCGDTDFARELFDVMLQRDSV
        DNYTFNFLVRTCAQ S CEAG  VHGALIKHGFE+DP V+                          +++CQT +VSACAKCGD  FAR LFD M QRD V
Subjt:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQ--------------------------NIICQTAIVSACAKCGDTDFARELFDVMLQRDSV

Query:  SWNAMIAGYAQRG
        SWNAMIAGYAQRG
Subjt:  SWNAMIAGYAQRG

A0A5N6R2U6 DYW_deaminase domain-containing protein2.3e-2858.12Show/hide
Query:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQ--------------------------NIICQTAIVSACAKCGDTDFARELFDVMLQRDSV
        DNYTFNFLVRTC+QL   E G +VHGALIKHGFE DP VQ                          +++CQTA+VSACAKCGD  FARELFDVM QRD +
Subjt:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQ--------------------------NIICQTAIVSACAKCGDTDFARELFDVMLQRDSV

Query:  SWNAMIAGYAQRGNQGK
        +WNAMIAGYAQ G   K
Subjt:  SWNAMIAGYAQRGNQGK

A0A6J1DA54 putative pentatricopeptide repeat-containing protein At5g404051.8e-3366.37Show/hide
Query:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQ--------------------------NIICQTAIVSACAKCGDTDFARELFDVMLQRDSV
        DNYTFNFLVRTCAQLS CEAG TVHGALIKHGFEFDP VQ                          +++CQTA+VSACAKCGDT FAR+LFD M QRDS+
Subjt:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQ--------------------------NIICQTAIVSACAKCGDTDFARELFDVMLQRDSV

Query:  SWNAMIAGYAQRG
        SWNAMIAGYAQRG
Subjt:  SWNAMIAGYAQRG

A0A6J1FZH3 putative pentatricopeptide repeat-containing protein At5g404054.4e-3263.72Show/hide
Query:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQ--------------------------NIICQTAIVSACAKCGDTDFARELFDVMLQRDSV
        DNYTFNFLVRT AQLS CEAG TVHGAL KHGFEFDP VQ                          +++CQTA+VSACAKCGDTD+AR+LFD M QRDS+
Subjt:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQ--------------------------NIICQTAIVSACAKCGDTDFARELFDVMLQRDSV

Query:  SWNAMIAGYAQRG
        SWNAMIAGY QRG
Subjt:  SWNAMIAGYAQRG

A0A6J1HYM9 putative pentatricopeptide repeat-containing protein At5g404051.2e-3265.49Show/hide
Query:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQ--------------------------NIICQTAIVSACAKCGDTDFARELFDVMLQRDSV
        DNYTFNFLVRT AQLS CEAG TVHGAL KHGFEFDP VQ                          +++CQTA+VSACAKCGDTD+AR+LFD M QRDSV
Subjt:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQ--------------------------NIICQTAIVSACAKCGDTDFARELFDVMLQRDSV

Query:  SWNAMIAGYAQRG
        SWNAMIAGYAQRG
Subjt:  SWNAMIAGYAQRG

SwissProt top hitse value%identityAlignment
O64705 Pentatricopeptide repeat-containing protein At2g344003.1e-1440.23Show/hide
Query:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQNIICQTAIVSACAKCGDTDFARELFDVMLQRDSVSWNAMIAGYAQRG
        D +T+NF+   CA+L     G +VH +L K G E D  + +     +++   AKCG   +AR+LFD + +RD+VSWN+MI+GY++ G
Subjt:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQNIICQTAIVSACAKCGDTDFARELFDVMLQRDSVSWNAMIAGYAQRG

Q9FND7 Putative pentatricopeptide repeat-containing protein At5g404053.4e-2145.13Show/hide
Query:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQ--------------------------NIICQTAIVSACAKCGDTDFARELFDVMLQRDSV
        DNYT NFLV+ C  L   E GL VHG  I+ GF+ DP VQ                          + +C+TA+V+ACA+CGD  FAR+LF+ M +RD +
Subjt:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQ--------------------------NIICQTAIVSACAKCGDTDFARELFDVMLQRDSV

Query:  SWNAMIAGYAQRG
        +WNAMI+GYAQ G
Subjt:  SWNAMIAGYAQRG

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226901.3e-1241.86Show/hide
Query:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQNIICQTAIVSACAKCGDTDFARELFDVMLQRDSVSWNAMIAGYAQR
        D YTF F +  CA+      G+ +HG ++K G+  D  VQN     ++V   A+CG+ D AR++FD M +R+ VSW +MI GYA+R
Subjt:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQNIICQTAIVSACAKCGDTDFARELFDVMLQRDSVSWNAMIAGYAQR

Q9LXF2 Pentatricopeptide repeat-containing protein At5g153001.7e-1237.93Show/hide
Query:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQNIICQTAIVSACAKCGDTDFARELFDVMLQRDSVSWNAMIAGYAQRG
        D YTF F+++ C++L     G   HG +++HGF  +  V+N     A++   A CGD   A ELFD   +   V+W++M +GYA+RG
Subjt:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQNIICQTAIVSACAKCGDTDFARELFDVMLQRDSVSWNAMIAGYAQRG

Q9M9E2 Pentatricopeptide repeat-containing protein At1g15510, chloroplastic5.2e-1443.68Show/hide
Query:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQNIICQTAIVSACAKCGDTDFARELFDVMLQRDSVSWNAMIAGYAQRG
        D YTF  ++RTC  +     G  VH  ++++G+E D DV N     A+++   KCGD   AR LFD M +RD +SWNAMI+GY + G
Subjt:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQNIICQTAIVSACAKCGDTDFARELFDVMLQRDSVSWNAMIAGYAQRG

Arabidopsis top hitse value%identityAlignment
AT1G15510.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.7e-1543.68Show/hide
Query:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQNIICQTAIVSACAKCGDTDFARELFDVMLQRDSVSWNAMIAGYAQRG
        D YTF  ++RTC  +     G  VH  ++++G+E D DV N     A+++   KCGD   AR LFD M +RD +SWNAMI+GY + G
Subjt:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQNIICQTAIVSACAKCGDTDFARELFDVMLQRDSVSWNAMIAGYAQRG

AT2G34400.1 Pentatricopeptide repeat (PPR-like) superfamily protein2.2e-1540.23Show/hide
Query:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQNIICQTAIVSACAKCGDTDFARELFDVMLQRDSVSWNAMIAGYAQRG
        D +T+NF+   CA+L     G +VH +L K G E D  + +     +++   AKCG   +AR+LFD + +RD+VSWN+MI+GY++ G
Subjt:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQNIICQTAIVSACAKCGDTDFARELFDVMLQRDSVSWNAMIAGYAQRG

AT3G22690.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885)9.2e-1441.86Show/hide
Query:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQNIICQTAIVSACAKCGDTDFARELFDVMLQRDSVSWNAMIAGYAQR
        D YTF F +  CA+      G+ +HG ++K G+  D  VQN     ++V   A+CG+ D AR++FD M +R+ VSW +MI GYA+R
Subjt:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQNIICQTAIVSACAKCGDTDFARELFDVMLQRDSVSWNAMIAGYAQR

AT3G22690.2 INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification9.2e-1441.86Show/hide
Query:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQNIICQTAIVSACAKCGDTDFARELFDVMLQRDSVSWNAMIAGYAQR
        D YTF F +  CA+      G+ +HG ++K G+  D  VQN     ++V   A+CG+ D AR++FD M +R+ VSW +MI GYA+R
Subjt:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQNIICQTAIVSACAKCGDTDFARELFDVMLQRDSVSWNAMIAGYAQR

AT5G40405.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.4e-2245.13Show/hide
Query:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQ--------------------------NIICQTAIVSACAKCGDTDFARELFDVMLQRDSV
        DNYT NFLV+ C  L   E GL VHG  I+ GF+ DP VQ                          + +C+TA+V+ACA+CGD  FAR+LF+ M +RD +
Subjt:  DNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQ--------------------------NIICQTAIVSACAKCGDTDFARELFDVMLQRDSV

Query:  SWNAMIAGYAQRG
        +WNAMI+GYAQ G
Subjt:  SWNAMIAGYAQRG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGTCAAAAAAAGATATATCAGATGGTTCTGATTGCTGCACTTGGACTTTTATTCCTACTTTGATTGACAATTACACTTTCAATTTTCTGGTTCGCACTTGCGCCCA
ATTGTCTACTTGTGAAGCAGGTCTAACTGTTCATGGTGCACTTATCAAACATGGTTTTGAATTTGACCCAGATGTTCAAAATATAATTTGTCAGACGGCCATAGTGAGTG
CTTGTGCAAAATGTGGTGATACTGATTTTGCACGAGAGCTGTTCGACGTAATGCTTCAAAGGGATTCTGTGTCATGGAATGCTATGATTGCTGGTTATGCACAGAGGGGC
AATCAAGGGAAGCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAGTCAAAAAAAGATATATCAGATGGTTCTGATTGCTGCACTTGGACTTTTATTCCTACTTTGATTGACAATTACACTTTCAATTTTCTGGTTCGCACTTGCGCCCA
ATTGTCTACTTGTGAAGCAGGTCTAACTGTTCATGGTGCACTTATCAAACATGGTTTTGAATTTGACCCAGATGTTCAAAATATAATTTGTCAGACGGCCATAGTGAGTG
CTTGTGCAAAATGTGGTGATACTGATTTTGCACGAGAGCTGTTCGACGTAATGCTTCAAAGGGATTCTGTGTCATGGAATGCTATGATTGCTGGTTATGCACAGAGGGGC
AATCAAGGGAAGCTTTGA
Protein sequenceShow/hide protein sequence
MQSKKDISDGSDCCTWTFIPTLIDNYTFNFLVRTCAQLSTCEAGLTVHGALIKHGFEFDPDVQNIICQTAIVSACAKCGDTDFARELFDVMLQRDSVSWNAMIAGYAQRG
NQGKL