; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi03G008070 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi03G008070
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionpentatricopeptide repeat-containing protein At4g18975, chloroplastic
Genome locationchr03:12113461..12126343
RNA-Seq ExpressionLsi03G008070
SyntenyLsi03G008070
Gene Ontology termsGO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031520.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]5.0e-6397.54Show/hide
Query:  MLQLLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVP
        MLQLLDLRDSKEAVYGALDAWVAWEQDFPIASLKH LAALEKEQQWHR+VQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVP
Subjt:  MLQLLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVP

Query:  WQLCRSMISIYYRNKMLEDLVK
        WQLCRSMI+IYYRNKMLEDLVK
Subjt:  WQLCRSMISIYYRNKMLEDLVK

KAG6571702.1 Mediator of RNA polymerase II transcription subunit 15a, partial [Cucurbita argyrosperma subsp. sororia]4.7e-6197.48Show/hide
Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
        L+DLRDSKEAVYGALDAWVAWEQDFPIASLKHALA LEKE QWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL

Query:  CRSMISIYYRNKMLEDLVK
        CRSMISIYYRNKMLEDLVK
Subjt:  CRSMISIYYRNKMLEDLVK

XP_008455250.1 PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic [Cucumis melo]2.1e-6197.48Show/hide
Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
        LLDLRDSKEAVYGALDAWVAWEQDFPIASLKH LAALEKEQQWHR+VQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL

Query:  CRSMISIYYRNKMLEDLVK
        CRSMI+IYYRNKMLEDLVK
Subjt:  CRSMISIYYRNKMLEDLVK

XP_022967610.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Cucurbita maxima]4.7e-6197.48Show/hide
Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
        L+DLRDSKEAVYGALDAWVAWEQDFPIASLKHALA LEKE QWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL

Query:  CRSMISIYYRNKMLEDLVK
        CRSMISIYYRNKMLEDLVK
Subjt:  CRSMISIYYRNKMLEDLVK

XP_022967611.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X2 [Cucurbita maxima]4.7e-6197.48Show/hide
Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
        L+DLRDSKEAVYGALDAWVAWEQDFPIASLKHALA LEKE QWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL

Query:  CRSMISIYYRNKMLEDLVK
        CRSMISIYYRNKMLEDLVK
Subjt:  CRSMISIYYRNKMLEDLVK

TrEMBL top hitse value%identityAlignment
A0A1S3C174 pentatricopeptide repeat-containing protein At4g18975, chloroplastic1.0e-6197.48Show/hide
Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
        LLDLRDSKEAVYGALDAWVAWEQDFPIASLKH LAALEKEQQWHR+VQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL

Query:  CRSMISIYYRNKMLEDLVK
        CRSMI+IYYRNKMLEDLVK
Subjt:  CRSMISIYYRNKMLEDLVK

A0A5A7SLC0 Pentatricopeptide repeat-containing protein2.4e-6397.54Show/hide
Query:  MLQLLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVP
        MLQLLDLRDSKEAVYGALDAWVAWEQDFPIASLKH LAALEKEQQWHR+VQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVP
Subjt:  MLQLLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVP

Query:  WQLCRSMISIYYRNKMLEDLVK
        WQLCRSMI+IYYRNKMLEDLVK
Subjt:  WQLCRSMISIYYRNKMLEDLVK

A0A6J1HFH8 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X22.3e-6197.48Show/hide
Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
        L+DLRDSKEAVYGALDAWVAWEQDFPIASLKHALA LEKE QWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL

Query:  CRSMISIYYRNKMLEDLVK
        CRSMISIYYRNKMLEDLVK
Subjt:  CRSMISIYYRNKMLEDLVK

A0A6J1HSN2 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X22.3e-6197.48Show/hide
Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
        L+DLRDSKEAVYGALDAWVAWEQDFPIASLKHALA LEKE QWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL

Query:  CRSMISIYYRNKMLEDLVK
        CRSMISIYYRNKMLEDLVK
Subjt:  CRSMISIYYRNKMLEDLVK

A0A6J1HUZ4 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X12.3e-6197.48Show/hide
Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
        L+DLRDSKEAVYGALDAWVAWEQDFPIASLKHALA LEKE QWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL

Query:  CRSMISIYYRNKMLEDLVK
        CRSMISIYYRNKMLEDLVK
Subjt:  CRSMISIYYRNKMLEDLVK

SwissProt top hitse value%identityAlignment
Q2V3H0 Pentatricopeptide repeat-containing protein At4g18975, chloroplastic4.6e-2746.03Show/hide
Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
        L  L + KEAVYGAL+ WVAWE +FPI +   AL  L K  QWHRV+Q+ KWMLSKGQG TM  Y  L+ A DMD RA+EA   W M + +   S+P +L
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL

Query:  CRSMISIYYRNKMLEDLVKAMVTTPE
           MI++Y  + + + +++      E
Subjt:  CRSMISIYYRNKMLEDLVKAMVTTPE

Q8LG95 Pentatricopeptide repeat-containing protein At4g211901.3e-2443.9Show/hide
Query:  LRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRS
        L + KE VYGALD+++AWE +FP+  +K AL  LE E++W +++QV KWMLSKGQG TM  Y  L+ AL  D+R +EA + W       L   P +    
Subjt:  LRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRS

Query:  MISIYYRNKMLEDLVKAMVTTPE
        MISIYY+  M + L +      E
Subjt:  MISIYYRNKMLEDLVKAMVTTPE

Arabidopsis top hitse value%identityAlignment
AT1G04590.1 BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat (PPR) superfamily protein (TAIR:AT4G21190.1)2.8e-5165.97Show/hide
Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
        LLD+ D+KEAVYGALDAWVAWE++FPIASLK  +A+LEKE QWHR+VQVIKW+LSKGQG TM  YGQLIRALDMD RAEEAH  W  K+G+DLHSVPWQL
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL

Query:  CRSMISIYYRNKMLEDLVKAMVTTPEGLIDCNGKDRNTPRRHAV
        C  M+ IY+RN ML++LVK          D    DR  P +H V
Subjt:  CRSMISIYYRNKMLEDLVKAMVTTPEGLIDCNGKDRNTPRRHAV

AT1G04590.2 BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat (PPR) superfamily protein (TAIR:AT4G18975.4)5.5e-5266.67Show/hide
Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
        LLD+ D+KEAVYGALDAWVAWE++FPIASLK  +A+LEKE QWHR+VQVIKW+LSKGQG TM  YGQLIRALDMD RAEEAH  W  K+G+DLHSVPWQL
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL

Query:  CRSMISIYYRNKMLEDLVKAMVTTPEGLIDCNGKDRNTPRRHAV
        C  M+ IY+RN ML++LVK M    +   D    DR  P +H V
Subjt:  CRSMISIYYRNKMLEDLVKAMVTTPEGLIDCNGKDRNTPRRHAV

AT4G18975.1 Pentatricopeptide repeat (PPR) superfamily protein3.3e-2846.03Show/hide
Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
        L  L + KEAVYGAL+ WVAWE +FPI +   AL  L K  QWHRV+Q+ KWMLSKGQG TM  Y  L+ A DMD RA+EA   W M + +   S+P +L
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL

Query:  CRSMISIYYRNKMLEDLVKAMVTTPE
           MI++Y  + + + +++      E
Subjt:  CRSMISIYYRNKMLEDLVKAMVTTPE

AT4G18975.2 Pentatricopeptide repeat (PPR) superfamily protein3.3e-2846.03Show/hide
Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
        L  L + KEAVYGAL+ WVAWE +FPI +   AL  L K  QWHRV+Q+ KWMLSKGQG TM  Y  L+ A DMD RA+EA   W M + +   S+P +L
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL

Query:  CRSMISIYYRNKMLEDLVKAMVTTPE
           MI++Y  + + + +++      E
Subjt:  CRSMISIYYRNKMLEDLVKAMVTTPE

AT4G18975.3 Pentatricopeptide repeat (PPR) superfamily protein3.3e-2846.03Show/hide
Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
        L  L + KEAVYGAL+ WVAWE +FPI +   AL  L K  QWHRV+Q+ KWMLSKGQG TM  Y  L+ A DMD RA+EA   W M + +   S+P +L
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL

Query:  CRSMISIYYRNKMLEDLVKAMVTTPE
           MI++Y  + + + +++      E
Subjt:  CRSMISIYYRNKMLEDLVKAMVTTPE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGCAGCTTCTCGATCTGAGAGATAGTAAGGAGGCTGTCTATGGTGCTCTTGATGCCTGGGTTGCATGGGAGCAAGACTTTCCAATAGCATCCCTTAAGCATGCATT
GGCTGCCCTTGAGAAGGAACAGCAGTGGCATAGAGTTGTACAGGTAATCAAATGGATGTTAAGCAAGGGGCAGGGAACCACAATGAATGTCTATGGGCAGTTAATACGGG
CTTTAGACATGGACCATCGAGCGGAAGAAGCACATAAGTTTTGGGTCATGAAAATTGGTTCAGATCTACATTCAGTTCCTTGGCAATTGTGCAGAAGCATGATATCAATA
TACTACCGAAATAAAATGCTAGAAGATCTTGTAAAGGCTATGGTTACAACTCCTGAAGGCCTTATCGACTGTAATGGAAAAGATAGAAATACCCCTAGGAGGCATGCTGT
CACAGCGCTAGTGCACGACGCGAGGGATAGAAAGCGACCAGTAGCAAGGCACTGCGGCGCTGTAGTGGCCATAAGAAAACAGGGCGCACGCGACAGGCACGATAATGTGC
ATAAGGACCCACTTGCGCACATCTGCAAGGCAGACGCACGCGTGTCGCAGCGCTGCGAAAAGGGACAAGAGTGTCGCGGTGCCATTCTGTGTATAATCATTTACTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTGCAGCTTCTCGATCTGAGAGATAGTAAGGAGGCTGTCTATGGTGCTCTTGATGCCTGGGTTGCATGGGAGCAAGACTTTCCAATAGCATCCCTTAAGCATGCATT
GGCTGCCCTTGAGAAGGAACAGCAGTGGCATAGAGTTGTACAGGTAATCAAATGGATGTTAAGCAAGGGGCAGGGAACCACAATGAATGTCTATGGGCAGTTAATACGGG
CTTTAGACATGGACCATCGAGCGGAAGAAGCACATAAGTTTTGGGTCATGAAAATTGGTTCAGATCTACATTCAGTTCCTTGGCAATTGTGCAGAAGCATGATATCAATA
TACTACCGAAATAAAATGCTAGAAGATCTTGTAAAGGCTATGGTTACAACTCCTGAAGGCCTTATCGACTGTAATGGAAAAGATAGAAATACCCCTAGGAGGCATGCTGT
CACAGCGCTAGTGCACGACGCGAGGGATAGAAAGCGACCAGTAGCAAGGCACTGCGGCGCTGTAGTGGCCATAAGAAAACAGGGCGCACGCGACAGGCACGATAATGTGC
ATAAGGACCCACTTGCGCACATCTGCAAGGCAGACGCACGCGTGTCGCAGCGCTGCGAAAAGGGACAAGAGTGTCGCGGTGCCATTCTGTGTATAATCATTTACTTTTAG
Protein sequenceShow/hide protein sequence
MLQLLDLRDSKEAVYGALDAWVAWEQDFPIASLKHALAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISI
YYRNKMLEDLVKAMVTTPEGLIDCNGKDRNTPRRHAVTALVHDARDRKRPVARHCGAVVAIRKQGARDRHDNVHKDPLAHICKADARVSQRCEKGQECRGAILCIIIYF