; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10018937 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10018937
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionpentatricopeptide repeat-containing protein At4g18975, chloroplastic
Genome locationChr04:11910249..11920810
RNA-Seq ExpressionHG10018937
SyntenyHG10018937
Gene Ontology termsGO:0007165 - signal transduction (biological process)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR000488 - Death domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031520.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]5.4e-4555.39Show/hide
Query:  KLLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQ---------------------------------------------------
        +LLDLRDSKEAVYGALDAWVAWEQDFPIASLKH LAALEKEQQWHR+VQ                                                   
Subjt:  KLLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQ---------------------------------------------------

Query:  --------------------LFKDLEAFGRKPPEKSIVQRVADACEILGLVEEKERVLMKYKYLFTDDKDGSIKKYKKVSFEKSKRKGKSTKGTEVNSNL
                            LFKDLEAFGRKPP+KSIVQRVADACE+LGL+EEKERVL+KYKYLF D+K  S+KKYK+VSFEK KRK KSTKG+E NSNL
Subjt:  --------------------LFKDLEAFGRKPPEKSIVQRVADACEILGLVEEKERVLMKYKYLFTDDKDGSIKKYKKVSFEKSKRKGKSTKGTEVNSNL

Query:  MKAQ
        +K++
Subjt:  MKAQ

XP_004136857.2 pentatricopeptide repeat-containing protein At4g18975, chloroplastic [Cucumis sativus]8.3e-4655.67Show/hide
Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQ----------------------------------------------------
        LLDLRDSKEAVYGALDAWVAWEQDFPIA LKH LAALEKEQQWHR+VQ                                                    
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQ----------------------------------------------------

Query:  -------------------LFKDLEAFGRKPPEKSIVQRVADACEILGLVEEKERVLMKYKYLFTDDKDGSIKKYKKVSFEKSKRKGKSTKGTEVNSNLM
                           LFKDLEAFGRKPP+KSIVQRVADACE+LGL+EEKERVL+KYKYLF D+K+G +KKYK++SFEKSKRK KSTKGTE NSNL+
Subjt:  -------------------LFKDLEAFGRKPPEKSIVQRVADACEILGLVEEKERVLMKYKYLFTDDKDGSIKKYKKVSFEKSKRKGKSTKGTEVNSNLM

Query:  KAQ
        K++
Subjt:  KAQ

XP_008455250.1 PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic [Cucumis melo]7.1e-4555.67Show/hide
Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQ----------------------------------------------------
        LLDLRDSKEAVYGALDAWVAWEQDFPIASLKH LAALEKEQQWHR+VQ                                                    
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQ----------------------------------------------------

Query:  -------------------LFKDLEAFGRKPPEKSIVQRVADACEILGLVEEKERVLMKYKYLFTDDKDGSIKKYKKVSFEKSKRKGKSTKGTEVNSNLM
                           LFKDLEAFGRKPP+KSIVQRVADACE+LGL+EEKERVL+KYKYLF D+K  S+KKYK+VSFEK KRK KSTKG+E NSNL+
Subjt:  -------------------LFKDLEAFGRKPPEKSIVQRVADACEILGLVEEKERVLMKYKYLFTDDKDGSIKKYKKVSFEKSKRKGKSTKGTEVNSNLM

Query:  KAQ
        K++
Subjt:  KAQ

XP_038887984.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Benincasa hispida]6.2e-4959.11Show/hide
Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQ----------------------------------------------------
        LLDLRDSKEAVYGALDAWVAWEQDFPI SLKH L  LEKEQQWHRVVQ                                                    
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQ----------------------------------------------------

Query:  -------------------LFKDLEAFGRKPPEKSIVQRVADACEILGLVEEKERVLMKYKYLFTDDKDGSIKKYKKVSFEKSKRKGKSTKGTEVNSNLM
                           LFKDLEAFGRKPPEKSIVQRVADACEILGL+EEKERVLMKYKYLFTD+K+GSIKKYK+VSFEKSK K KSTK TE NSNLM
Subjt:  -------------------LFKDLEAFGRKPPEKSIVQRVADACEILGLVEEKERVLMKYKYLFTDDKDGSIKKYKKVSFEKSKRKGKSTKGTEVNSNLM

Query:  KAQ
        KAQ
Subjt:  KAQ

XP_038887985.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X2 [Benincasa hispida]1.6e-4959.31Show/hide
Query:  KLLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQ---------------------------------------------------
        KLLDLRDSKEAVYGALDAWVAWEQDFPI SLKH L  LEKEQQWHRVVQ                                                   
Subjt:  KLLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQ---------------------------------------------------

Query:  --------------------LFKDLEAFGRKPPEKSIVQRVADACEILGLVEEKERVLMKYKYLFTDDKDGSIKKYKKVSFEKSKRKGKSTKGTEVNSNL
                            LFKDLEAFGRKPPEKSIVQRVADACEILGL+EEKERVLMKYKYLFTD+K+GSIKKYK+VSFEKSK K KSTK TE NSNL
Subjt:  --------------------LFKDLEAFGRKPPEKSIVQRVADACEILGLVEEKERVLMKYKYLFTDDKDGSIKKYKKVSFEKSKRKGKSTKGTEVNSNL

Query:  MKAQ
        MKAQ
Subjt:  MKAQ

TrEMBL top hitse value%identityAlignment
A0A1S3C174 pentatricopeptide repeat-containing protein At4g18975, chloroplastic3.4e-4555.67Show/hide
Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQ----------------------------------------------------
        LLDLRDSKEAVYGALDAWVAWEQDFPIASLKH LAALEKEQQWHR+VQ                                                    
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQ----------------------------------------------------

Query:  -------------------LFKDLEAFGRKPPEKSIVQRVADACEILGLVEEKERVLMKYKYLFTDDKDGSIKKYKKVSFEKSKRKGKSTKGTEVNSNLM
                           LFKDLEAFGRKPP+KSIVQRVADACE+LGL+EEKERVL+KYKYLF D+K  S+KKYK+VSFEK KRK KSTKG+E NSNL+
Subjt:  -------------------LFKDLEAFGRKPPEKSIVQRVADACEILGLVEEKERVLMKYKYLFTDDKDGSIKKYKKVSFEKSKRKGKSTKGTEVNSNLM

Query:  KAQ
        K++
Subjt:  KAQ

A0A5A7SLC0 Pentatricopeptide repeat-containing protein2.6e-4555.39Show/hide
Query:  KLLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQ---------------------------------------------------
        +LLDLRDSKEAVYGALDAWVAWEQDFPIASLKH LAALEKEQQWHR+VQ                                                   
Subjt:  KLLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQ---------------------------------------------------

Query:  --------------------LFKDLEAFGRKPPEKSIVQRVADACEILGLVEEKERVLMKYKYLFTDDKDGSIKKYKKVSFEKSKRKGKSTKGTEVNSNL
                            LFKDLEAFGRKPP+KSIVQRVADACE+LGL+EEKERVL+KYKYLF D+K  S+KKYK+VSFEK KRK KSTKG+E NSNL
Subjt:  --------------------LFKDLEAFGRKPPEKSIVQRVADACEILGLVEEKERVLMKYKYLFTDDKDGSIKKYKKVSFEKSKRKGKSTKGTEVNSNL

Query:  MKAQ
        +K++
Subjt:  MKAQ

A0A6J1HFH8 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X23.3e-4053.23Show/hide
Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQ----------------------------------------------------
        L+DLRDSKEAVYGALDAWVAWEQDFPIASLKHALA LEKE QWHRVVQ                                                    
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQ----------------------------------------------------

Query:  -------------------LFKDLEAFGRKPPEKSIVQRVADACEILGLVEEKERVLMKYKYLFTDDKDGSIKKYKKVSFEKSKRKGKSTKGTEVNSNLM
                           LFK+LEAFGRKPPEKSIVQRVADACE+LGLVEEKERVL+KY YLFTD+K GSIKKY        K K KSTKG + NS+LM
Subjt:  -------------------LFKDLEAFGRKPPEKSIVQRVADACEILGLVEEKERVLMKYKYLFTDDKDGSIKKYKKVSFEKSKRKGKSTKGTEVNSNLM

Query:  K
        K
Subjt:  K

A0A6J1HSN2 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X28.7e-4153.73Show/hide
Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQ----------------------------------------------------
        L+DLRDSKEAVYGALDAWVAWEQDFPIASLKHALA LEKE QWHRVVQ                                                    
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQ----------------------------------------------------

Query:  -------------------LFKDLEAFGRKPPEKSIVQRVADACEILGLVEEKERVLMKYKYLFTDDKDGSIKKYKKVSFEKSKRKGKSTKGTEVNSNLM
                           LFKDLEAFGRKPPEKSIVQRVADACE+LGLVEEKERVL+KY YLFTD+K GSIKKY        K K KSTKG + NS+LM
Subjt:  -------------------LFKDLEAFGRKPPEKSIVQRVADACEILGLVEEKERVLMKYKYLFTDDKDGSIKKYKKVSFEKSKRKGKSTKGTEVNSNLM

Query:  K
        K
Subjt:  K

A0A6J1HUZ4 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X18.7e-4153.73Show/hide
Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQ----------------------------------------------------
        L+DLRDSKEAVYGALDAWVAWEQDFPIASLKHALA LEKE QWHRVVQ                                                    
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQ----------------------------------------------------

Query:  -------------------LFKDLEAFGRKPPEKSIVQRVADACEILGLVEEKERVLMKYKYLFTDDKDGSIKKYKKVSFEKSKRKGKSTKGTEVNSNLM
                           LFKDLEAFGRKPPEKSIVQRVADACE+LGLVEEKERVL+KY YLFTD+K GSIKKY        K K KSTKG + NS+LM
Subjt:  -------------------LFKDLEAFGRKPPEKSIVQRVADACEILGLVEEKERVLMKYKYLFTDDKDGSIKKYKKVSFEKSKRKGKSTKGTEVNSNLM

Query:  K
        K
Subjt:  K

SwissProt top hitse value%identityAlignment
Q2V3H0 Pentatricopeptide repeat-containing protein At4g18975, chloroplastic1.7e-0958.82Show/hide
Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQLFK
        L  L + KEAVYGAL+ WVAWE +FPI +   AL  L K  QWHRV+QL K
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQLFK

Q8LG95 Pentatricopeptide repeat-containing protein At4g211903.6e-0747.92Show/hide
Query:  LRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQLFK
        L + KE VYGALD+++AWE +FP+  +K AL  LE E++W +++Q+ K
Subjt:  LRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQLFK

Arabidopsis top hitse value%identityAlignment
AT1G04590.1 BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat (PPR) superfamily protein (TAIR:AT4G21190.1)2.3e-2539.23Show/hide
Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQ----------------------------------------------------
        LLD+ D+KEAVYGALDAWVAWE++FPIASLK  +A+LEKE QWHR+VQ                                                    
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQ----------------------------------------------------

Query:  -------------------LFKDLEAFGRKPPEKSIVQRVADACEILGLVEEKERVLMKYKYLF----TDDKDGSIKKYKK
                           LFKDLE++ RKPP+K IVQ VADA E+LG+++EKERV+ KY +L     +DDK     + KK
Subjt:  -------------------LFKDLEAFGRKPPEKSIVQRVADACEILGLVEEKERVLMKYKYLF----TDDKDGSIKKYKK

AT1G04590.2 BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat (PPR) superfamily protein (TAIR:AT4G18975.4)5.1e-2538.59Show/hide
Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQ----------------------------------------------------
        LLD+ D+KEAVYGALDAWVAWE++FPIASLK  +A+LEKE QWHR+VQ                                                    
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQ----------------------------------------------------

Query:  ----------------------LFKDLEAFGRKPPEKSIVQRVADACEILGLVEEKERVLMKYKYLF----TDDKDGSIKKYKK
                              LFKDLE++ RKPP+K IVQ VADA E+LG+++EKERV+ KY +L     +DDK     + KK
Subjt:  ----------------------LFKDLEAFGRKPPEKSIVQRVADACEILGLVEEKERVLMKYKYLF----TDDKDGSIKKYKK

AT4G18975.1 Pentatricopeptide repeat (PPR) superfamily protein1.2e-1058.82Show/hide
Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQLFK
        L  L + KEAVYGAL+ WVAWE +FPI +   AL  L K  QWHRV+QL K
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQLFK

AT4G18975.2 Pentatricopeptide repeat (PPR) superfamily protein1.2e-1058.82Show/hide
Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQLFK
        L  L + KEAVYGAL+ WVAWE +FPI +   AL  L K  QWHRV+QL K
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQLFK

AT4G18975.3 Pentatricopeptide repeat (PPR) superfamily protein1.2e-1058.82Show/hide
Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQLFK
        L  L + KEAVYGAL+ WVAWE +FPI +   AL  L K  QWHRV+QL K
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQLFK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGTGTCTAGATTGTTTGCTCCTAACAACAAGAAGGAGATTAAAGGTGCTTTGGAAGATCATTTTACAGTCTCTATTTTGCTTAATCTTTTCACGACGGACAAGGC
TTTACTCAAACTTGATGATGGAATTGAATGGTCAAGGTTCTTGAAGGTATTACGAATTGGAGATGAAAAGCTTCTCGATCTGAGAGATAGTAAGGAGGCTGTCTATGGTG
CTCTTGATGCCTGGGTTGCATGGGAGCAAGACTTTCCAATAGCATCCCTTAAGCATGCATTGGCTGCCCTTGAGAAGGAACAGCAGTGGCATAGAGTTGTACAGCTTTTT
AAGGATCTTGAAGCTTTCGGACGTAAACCTCCAGAAAAATCAATAGTGCAGAGGGTAGCAGATGCTTGTGAGATTCTGGGCTTGGTTGAAGAGAAAGAGAGGGTACTAAT
GAAGTACAAATACCTTTTTACAGATGATAAGGATGGATCCATCAAGAAATATAAAAAGGTTTCGTTTGAGAAATCAAAGAGAAAAGGAAAATCGACCAAGGGCACTGAAG
TCAATAGCAACCTTATGAAGGCTCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTGTGTCTAGATTGTTTGCTCCTAACAACAAGAAGGAGATTAAAGGTGCTTTGGAAGATCATTTTACAGTCTCTATTTTGCTTAATCTTTTCACGACGGACAAGGC
TTTACTCAAACTTGATGATGGAATTGAATGGTCAAGGTTCTTGAAGGTATTACGAATTGGAGATGAAAAGCTTCTCGATCTGAGAGATAGTAAGGAGGCTGTCTATGGTG
CTCTTGATGCCTGGGTTGCATGGGAGCAAGACTTTCCAATAGCATCCCTTAAGCATGCATTGGCTGCCCTTGAGAAGGAACAGCAGTGGCATAGAGTTGTACAGCTTTTT
AAGGATCTTGAAGCTTTCGGACGTAAACCTCCAGAAAAATCAATAGTGCAGAGGGTAGCAGATGCTTGTGAGATTCTGGGCTTGGTTGAAGAGAAAGAGAGGGTACTAAT
GAAGTACAAATACCTTTTTACAGATGATAAGGATGGATCCATCAAGAAATATAAAAAGGTTTCGTTTGAGAAATCAAAGAGAAAAGGAAAATCGACCAAGGGCACTGAAG
TCAATAGCAACCTTATGAAGGCTCAATGA
Protein sequenceShow/hide protein sequence
MVVSRLFAPNNKKEIKGALEDHFTVSILLNLFTTDKALLKLDDGIEWSRFLKVLRIGDEKLLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQLF
KDLEAFGRKPPEKSIVQRVADACEILGLVEEKERVLMKYKYLFTDDKDGSIKKYKKVSFEKSKRKGKSTKGTEVNSNLMKAQ