; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10005304 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10005304
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr07:1438323..1439457
RNA-Seq ExpressionHG10005304
SyntenyHG10005304
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031550.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]2.8e-3488.24Show/hide
Query:  MSELKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFKFKAR
        MSELK HGYVPDLGSVLHD+DNEEKEYNLAHHSEK AIAFALMNT ENVPIRVMKNLRVCNDCHNAIKCISRIRN EII +  +R
Subjt:  MSELKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFKFKAR

XP_008455289.1 PREDICTED: pentatricopeptide repeat-containing protein At2g41080 isoform X1 [Cucumis melo]2.8e-3488.24Show/hide
Query:  MSELKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFKFKAR
        MSELK HGYVPDLGSVLHD+DNEEKEYNLAHHSEK AIAFALMNT ENVPIRVMKNLRVCNDCHNAIKCISRIRN EII +  +R
Subjt:  MSELKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFKFKAR

XP_016901687.1 PREDICTED: pentatricopeptide repeat-containing protein At2g41080 isoform X2 [Cucumis melo]2.8e-3488.24Show/hide
Query:  MSELKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFKFKAR
        MSELK HGYVPDLGSVLHD+DNEEKEYNLAHHSEK AIAFALMNT ENVPIRVMKNLRVCNDCHNAIKCISRIRN EII +  +R
Subjt:  MSELKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFKFKAR

XP_038887926.1 pentatricopeptide repeat-containing protein At2g41080 isoform X1 [Benincasa hispida]2.1e-3487.06Show/hide
Query:  MSELKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFKFKAR
        MSELKLHGYVPDLGSVLHD+DNEEKEYNLAHHSEKLAIAFALMNTPEN PIRVMKNLRVCNDCHNAIKC+S+IR  EII +  +R
Subjt:  MSELKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFKFKAR

XP_038887927.1 pentatricopeptide repeat-containing protein At2g41080 isoform X2 [Benincasa hispida]2.1e-3487.06Show/hide
Query:  MSELKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFKFKAR
        MSELKLHGYVPDLGSVLHD+DNEEKEYNLAHHSEKLAIAFALMNTPEN PIRVMKNLRVCNDCHNAIKC+S+IR  EII +  +R
Subjt:  MSELKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFKFKAR

TrEMBL top hitse value%identityAlignment
A0A0A0K2D8 DYW_deaminase domain-containing protein1.5e-3385.88Show/hide
Query:  MSELKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFKFKAR
        MSELK HGYVP+LGSVLHD+DNEEKEYNLAHHSEK AIAFALMNT ENVPIRVMKNLRVC+DCHNAIKCISRIRN EII +  +R
Subjt:  MSELKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFKFKAR

A0A1S3C1T8 pentatricopeptide repeat-containing protein At2g41080 isoform X11.4e-3488.24Show/hide
Query:  MSELKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFKFKAR
        MSELK HGYVPDLGSVLHD+DNEEKEYNLAHHSEK AIAFALMNT ENVPIRVMKNLRVCNDCHNAIKCISRIRN EII +  +R
Subjt:  MSELKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFKFKAR

A0A1S4E0C2 pentatricopeptide repeat-containing protein At2g41080 isoform X21.4e-3488.24Show/hide
Query:  MSELKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFKFKAR
        MSELK HGYVPDLGSVLHD+DNEEKEYNLAHHSEK AIAFALMNT ENVPIRVMKNLRVCNDCHNAIKCISRIRN EII +  +R
Subjt:  MSELKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFKFKAR

A0A5A7SLF2 Pentatricopeptide repeat-containing protein1.4e-3488.24Show/hide
Query:  MSELKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFKFKAR
        MSELK HGYVPDLGSVLHD+DNEEKEYNLAHHSEK AIAFALMNT ENVPIRVMKNLRVCNDCHNAIKCISRIRN EII +  +R
Subjt:  MSELKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFKFKAR

A0A6J1HWJ8 pentatricopeptide repeat-containing protein At2g410802.5e-3383.53Show/hide
Query:  MSELKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFKFKAR
        MSE+KLHGYVPD+GSVLHD+DNEEKEYNLAHHSEK AIAFALMN PE VPIRVMKNLRVCNDCH AIKCIS+IRN EII +  +R
Subjt:  MSELKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFKFKAR

SwissProt top hitse value%identityAlignment
P0C7R1 Pentatricopeptide repeat-containing protein DWY1, chloroplastic9.1e-2056.63Show/hide
Query:  ELKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFKFKAR
        E++  GYVP+   VLHD+D E KE  L HHSE+LAIAF ++NTP    IRVMKNLR+C DCHN IK +S I + EII +   R
Subjt:  ELKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFKFKAR

Q8S9M4 Pentatricopeptide repeat-containing protein At2g410802.5e-2262.65Show/hide
Query:  ELKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFKFKAR
        E+KL GY PD  SVLHD+D EEKE +L  HSEKLA+AFALM  PE  PIR++KNLRVC+DCH A K IS I+N EI  +  +R
Subjt:  ELKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFKFKAR

Q9FXB9 Pentatricopeptide repeat-containing protein At1g56690, mitochondrial6.3e-2148.18Show/hide
Query:  LKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFKFKARLIDLGIKSRKFAVRFGYN
        L+  GY PD   VLHDVD EEK  +L+ HSE+LA+A+ L+  PE VPIRVMKNLRVC DCH AIK IS++   EII +   R                +N
Subjt:  LKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFKFKARLIDLGIKSRKFAVRFGYN

Query:  KQYCLCRDFF
           C CRD++
Subjt:  KQYCLCRDFF

Q9SHZ8 Pentatricopeptide repeat-containing protein At2g220701.9e-2248.65Show/hide
Query:  ELKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFKFKARLIDLGIKSRKFAVRFGY
        E+K  GYVPD  SVLHD++ E KE  L HHSEKLAIAF L++TP+   +R+MKNLRVCNDCH AIK IS++   EII +   R                +
Subjt:  ELKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFKFKARLIDLGIKSRKFAVRFGY

Query:  NKQYCLCRDFF
           +C CRD++
Subjt:  NKQYCLCRDFF

Q9SMZ2 Pentatricopeptide repeat-containing protein At4g331701.2e-1955.7Show/hide
Query:  ELKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFK
        ++K  GYVP+    L DV+ EEKE  L +HSEKLA+AF L++TP + PIRV+KNLRVC DCHNA+K I+++ N EI+ +
Subjt:  ELKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFK

Arabidopsis top hitse value%identityAlignment
AT1G09410.1 pentatricopeptide (PPR) repeat-containing protein8.4e-2157.69Show/hide
Query:  LKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFK
        L+  GY PD    LHDVD EEK  +L +HSE+LA+A+AL+   E +PIRVMKNLRVC+DCH AIK IS+++  EII +
Subjt:  LKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFK

AT1G47580.1 Pentatricopeptide repeat (PPR) superfamily protein6.4e-2156.63Show/hide
Query:  ELKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFKFKAR
        E++  GYVP+   VLHD+D E KE  L HHSE+LAIAF ++NTP    IRVMKNLR+C DCHN IK +S I + EII +   R
Subjt:  ELKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFKFKAR

AT1G56690.1 Pentatricopeptide repeat (PPR) superfamily protein4.5e-2248.18Show/hide
Query:  LKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFKFKARLIDLGIKSRKFAVRFGYN
        L+  GY PD   VLHDVD EEK  +L+ HSE+LA+A+ L+  PE VPIRVMKNLRVC DCH AIK IS++   EII +   R                +N
Subjt:  LKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFKFKARLIDLGIKSRKFAVRFGYN

Query:  KQYCLCRDFF
           C CRD++
Subjt:  KQYCLCRDFF

AT2G22070.1 pentatricopeptide (PPR) repeat-containing protein1.4e-2348.65Show/hide
Query:  ELKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFKFKARLIDLGIKSRKFAVRFGY
        E+K  GYVPD  SVLHD++ E KE  L HHSEKLAIAF L++TP+   +R+MKNLRVCNDCH AIK IS++   EII +   R                +
Subjt:  ELKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFKFKARLIDLGIKSRKFAVRFGY

Query:  NKQYCLCRDFF
           +C CRD++
Subjt:  NKQYCLCRDFF

AT2G41080.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.8e-2362.65Show/hide
Query:  ELKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFKFKAR
        E+KL GY PD  SVLHD+D EEKE +L  HSEKLA+AFALM  PE  PIR++KNLRVC+DCH A K IS I+N EI  +  +R
Subjt:  ELKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFKFKAR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGAATTGAAACTGCACGGTTACGTGCCAGACTTGGGCTCGGTTTTGCACGACGTGGACAATGAAGAAAAAGAATACAATTTGGCACATCATAGTGAGAAGTTAGC
AATTGCTTTTGCACTGATGAACACTCCAGAGAATGTCCCAATAAGGGTGATGAAGAACTTGCGGGTCTGCAATGACTGTCATAATGCCATTAAGTGCATATCAAGGATCA
GAAACGGAGAGATTATTTTCAAGTTCAAAGCCAGGCTGATTGATTTGGGTATTAAAAGTAGAAAATTTGCAGTAAGATTTGGCTATAACAAGCAATACTGCCTTTGTCGA
GACTTCTTTAAAAAAAGGGAGAAAATCAACAAGAACCAACCAAATCTAGAACAGCTTCCTCAAACCCAGCAGAAGAAACCCGGCGGTAGAAACAGACCGACAAAAACATA
CACAAAATCCCTGGTCAGGAGACAATGGACAGCACCATTACACACCCTATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGAATTGAAACTGCACGGTTACGTGCCAGACTTGGGCTCGGTTTTGCACGACGTGGACAATGAAGAAAAAGAATACAATTTGGCACATCATAGTGAGAAGTTAGC
AATTGCTTTTGCACTGATGAACACTCCAGAGAATGTCCCAATAAGGGTGATGAAGAACTTGCGGGTCTGCAATGACTGTCATAATGCCATTAAGTGCATATCAAGGATCA
GAAACGGAGAGATTATTTTCAAGTTCAAAGCCAGGCTGATTGATTTGGGTATTAAAAGTAGAAAATTTGCAGTAAGATTTGGCTATAACAAGCAATACTGCCTTTGTCGA
GACTTCTTTAAAAAAAGGGAGAAAATCAACAAGAACCAACCAAATCTAGAACAGCTTCCTCAAACCCAGCAGAAGAAACCCGGCGGTAGAAACAGACCGACAAAAACATA
CACAAAATCCCTGGTCAGGAGACAATGGACAGCACCATTACACACCCTATGA
Protein sequenceShow/hide protein sequence
MSELKLHGYVPDLGSVLHDVDNEEKEYNLAHHSEKLAIAFALMNTPENVPIRVMKNLRVCNDCHNAIKCISRIRNGEIIFKFKARLIDLGIKSRKFAVRFGYNKQYCLCR
DFFKKREKINKNQPNLEQLPQTQQKKPGGRNRPTKTYTKSLVRRQWTAPLHTL