; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10007891 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10007891
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr10:16638263..16638583
RNA-Seq ExpressionHG10007891
SyntenyHG10007891
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589945.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]1.6e-3880.2Show/hide
Query:  RFKPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAAWHL
        +FKPDNVTFVGVLSACLHSN+IEQ Q +FDS SNQHGLTP+LDHYACM+NLLG   RIDQAV+LIKSMPHEPDFLIWS LL V ATKGDVA+A +A  HL
Subjt:  RFKPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAAWHL

Query:  F
        F
Subjt:  F

KAG7023609.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]1.6e-3880.2Show/hide
Query:  RFKPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAAWHL
        +FKPDNVTFVGVLSACLHSN+IEQ Q +FDS SNQHGLTP+LDHYACM+NLLG   RIDQAV+LIKSMPHEPDFLIWS LL V ATKGDVA+A +A  HL
Subjt:  RFKPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAAWHL

Query:  F
        F
Subjt:  F

XP_016902711.1 PREDICTED: pentatricopeptide repeat-containing protein At4g02750-like [Cucumis melo]2.1e-3874.51Show/hide
Query:  RRFKPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAAWH
        ++FKPDNVTF+G+LSACLH NWIEQ Q YFDS SNQHGLTPTLDHYACM+NLLG   RI+QAV+LIK+M HEPDFLIWS LL + +TKGD+ NA +AA H
Subjt:  RRFKPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAAWH

Query:  LF
        LF
Subjt:  LF

XP_022145099.1 pentatricopeptide repeat-containing protein At4g02750-like [Momordica charantia]1.2e-3877.45Show/hide
Query:  RRFKPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAAWH
        ++FKPDNVTF+GVLSACLHSNWIE+ Q YFDS SNQHGL PT+DHYACM+NLLG L RIDQAV+LIKSMPHEPD LIWS LL V A KGD+ANA +AA +
Subjt:  RRFKPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAAWH

Query:  LF
        LF
Subjt:  LF

XP_022987632.1 pentatricopeptide repeat-containing protein At4g02750-like isoform X1 [Cucurbita maxima]1.2e-3881.19Show/hide
Query:  RFKPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAAWHL
        +FKPDNVTFVGVLSACLHSN IEQ Q +FDS SNQHGLTP+LDHYACM+NLLG   RIDQAVNLIKSMPHEPDFLIWS LL V ATKGDVA A +A  HL
Subjt:  RFKPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAAWHL

Query:  F
        F
Subjt:  F

TrEMBL top hitse value%identityAlignment
A0A1S4E3A6 pentatricopeptide repeat-containing protein At4g02750-like1.0e-3874.51Show/hide
Query:  RRFKPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAAWH
        ++FKPDNVTF+G+LSACLH NWIEQ Q YFDS SNQHGLTPTLDHYACM+NLLG   RI+QAV+LIK+M HEPDFLIWS LL + +TKGD+ NA +AA H
Subjt:  RRFKPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAAWH

Query:  LF
        LF
Subjt:  LF

A0A5A7UC76 Pentatricopeptide repeat-containing protein1.0e-3874.51Show/hide
Query:  RRFKPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAAWH
        ++FKPDNVTF+G+LSACLH NWIEQ Q YFDS SNQHGLTPTLDHYACM+NLLG   RI+QAV+LIK+M HEPDFLIWS LL + +TKGD+ NA +AA H
Subjt:  RRFKPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAAWH

Query:  LF
        LF
Subjt:  LF

A0A6J1CU81 pentatricopeptide repeat-containing protein At4g02750-like5.9e-3977.45Show/hide
Query:  RRFKPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAAWH
        ++FKPDNVTF+GVLSACLHSNWIE+ Q YFDS SNQHGL PT+DHYACM+NLLG L RIDQAV+LIKSMPHEPD LIWS LL V A KGD+ANA +AA +
Subjt:  RRFKPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAAWH

Query:  LF
        LF
Subjt:  LF

A0A6J1HBT0 pentatricopeptide repeat-containing protein At4g02750-like isoform X12.2e-3879.21Show/hide
Query:  RFKPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAAWHL
        +FKPDNVTFVGVLSACLHSN+IEQ Q +FDS SNQHGLTP+LDHYACM+NLLG   RIDQAV+LIKSMPHEPDFLIWS LL V ATKGDVA+A +   HL
Subjt:  RFKPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAAWHL

Query:  F
        F
Subjt:  F

A0A6J1JAW4 pentatricopeptide repeat-containing protein At4g02750-like isoform X15.9e-3981.19Show/hide
Query:  RFKPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAAWHL
        +FKPDNVTFVGVLSACLHSN IEQ Q +FDS SNQHGLTP+LDHYACM+NLLG   RIDQAVNLIKSMPHEPDFLIWS LL V ATKGDVA A +A  HL
Subjt:  RFKPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAAWHL

Query:  F
        F
Subjt:  F

SwissProt top hitse value%identityAlignment
O81767 Pentatricopeptide repeat-containing protein At4g339907.7e-2043.43Show/hide
Query:  KPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAAWHLF
        KPD++TFV +LSAC HS  +++ QW F+     +G+TP+L HY CM+++ G   +++ A+  IKSM  +PD  IW ALL      G+V    IA+ HLF
Subjt:  KPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAAWHLF

P0C8Q8 Pentatricopeptide repeat-containing protein At5g19020, mitochondrial4.5e-2044.9Show/hide
Query:  KPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAAWHL
        KP+++TFVGVLSAC H+  +E  + YF+S  + HG+ P + HY CM++LLG   R+++A  +IK MP + D +IW  LL    T G+V  A +AA  L
Subjt:  KPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAAWHL

Q9FFG8 Pentatricopeptide repeat-containing protein At5g442305.5e-1846.46Show/hide
Query:  KPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAAWHLF
        KP+ VTFVG L AC HS  ++Q +  FDS     G+ PT DHY CM++LLG   R+ +A+ LIK+M  EP   +W ALL       +   A IAA HLF
Subjt:  KPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAAWHLF

Q9FNN7 Pentatricopeptide repeat-containing protein At5g085101.5e-1844.44Show/hide
Query:  KPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAAWHLF
        KPD VTFVG+L AC+H   + + Q  F S    H ++P L+HY CMI+LLG + ++ +A +LIK+MP +PD ++W  LL   +  G+V  A IA+  LF
Subjt:  KPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAAWHLF

Q9LNU6 Pentatricopeptide repeat-containing protein At1g202301.6e-1742.31Show/hide
Query:  MIRRFKPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAA
        M  R KPD ++F  +LSAC      ++   YF   S ++G+ P L+HY+CM+NLLG   ++ +A +LIK MP EPD  +W ALL     + +V  A IAA
Subjt:  MIRRFKPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAA

Query:  WHLF
          LF
Subjt:  WHLF

Arabidopsis top hitse value%identityAlignment
AT1G20230.1 Pentatricopeptide repeat (PPR) superfamily protein1.1e-1842.31Show/hide
Query:  MIRRFKPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAA
        M  R KPD ++F  +LSAC      ++   YF   S ++G+ P L+HY+CM+NLLG   ++ +A +LIK MP EPD  +W ALL     + +V  A IAA
Subjt:  MIRRFKPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAA

Query:  WHLF
          LF
Subjt:  WHLF

AT4G33990.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.5e-2143.43Show/hide
Query:  KPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAAWHLF
        KPD++TFV +LSAC HS  +++ QW F+     +G+TP+L HY CM+++ G   +++ A+  IKSM  +PD  IW ALL      G+V    IA+ HLF
Subjt:  KPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAAWHLF

AT5G08510.1 Pentatricopeptide repeat (PPR) superfamily protein1.0e-1944.44Show/hide
Query:  KPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAAWHLF
        KPD VTFVG+L AC+H   + + Q  F S    H ++P L+HY CMI+LLG + ++ +A +LIK+MP +PD ++W  LL   +  G+V  A IA+  LF
Subjt:  KPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAAWHLF

AT5G19020.1 mitochondrial editing factor 183.2e-2144.9Show/hide
Query:  KPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAAWHL
        KP+++TFVGVLSAC H+  +E  + YF+S  + HG+ P + HY CM++LLG   R+++A  +IK MP + D +IW  LL    T G+V  A +AA  L
Subjt:  KPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAAWHL

AT5G44230.1 Pentatricopeptide repeat (PPR) superfamily protein3.9e-1946.46Show/hide
Query:  KPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAAWHLF
        KP+ VTFVG L AC HS  ++Q +  FDS     G+ PT DHY CM++LLG   R+ +A+ LIK+M  EP   +W ALL       +   A IAA HLF
Subjt:  KPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAAWHLF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGATGATAAGGAGATTTAAACCTGATAATGTAACTTTTGTAGGCGTTTTATCTGCTTGTCTCCATTCTAATTGGATCGAGCAATGGCAGTGGTACTTTGATTCTAC
AAGCAATCAACATGGACTGACACCAACTTTAGATCATTATGCATGTATGATCAATCTCTTAGGATGTTTGGCCCGCATCGATCAAGCAGTTAATCTAATAAAAAGTATGC
CCCATGAACCAGATTTCCTGATTTGGTCCGCACTTCTATATGTTATCGCAACAAAGGGTGATGTTGCAAATGCAGTAATAGCAGCTTGGCATCTCTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGATGATAAGGAGATTTAAACCTGATAATGTAACTTTTGTAGGCGTTTTATCTGCTTGTCTCCATTCTAATTGGATCGAGCAATGGCAGTGGTACTTTGATTCTAC
AAGCAATCAACATGGACTGACACCAACTTTAGATCATTATGCATGTATGATCAATCTCTTAGGATGTTTGGCCCGCATCGATCAAGCAGTTAATCTAATAAAAAGTATGC
CCCATGAACCAGATTTCCTGATTTGGTCCGCACTTCTATATGTTATCGCAACAAAGGGTGATGTTGCAAATGCAGTAATAGCAGCTTGGCATCTCTTTTGA
Protein sequenceShow/hide protein sequence
MEMIRRFKPDNVTFVGVLSACLHSNWIEQWQWYFDSTSNQHGLTPTLDHYACMINLLGCLARIDQAVNLIKSMPHEPDFLIWSALLYVIATKGDVANAVIAAWHLF