; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10018183 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10018183
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr04:1371967..1373903
RNA-Seq ExpressionHG10018183
SyntenyHG10018183
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032594.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]5.7e-6365.44Show/hide
Query:  MVRTTFPFTCIFDFNSGGSCQFTNLLS------TRNLLHYS----------------------------YANSIASISVGNPQIWPLYAIKCLSHQSSTT
        MVRTT PFTCIFDF+SG  C+FTNLLS      TR L                                  N IASI VGN QIWPLYAIKC SHQSS+T
Subjt:  MVRTTFPFTCIFDFNSGGSCQFTNLLS------TRNLLHYS----------------------------YANSIASISVGNPQIWPLYAIKCLSHQSSTT

Query:  NISPDEVKVGDEVLNQVIAQRENASRCSHETFDACIDEMCRSGNLAAASQLLKSLCDRKTSLSSSKAYDMVLLAASERGDTALLCQVFKDSLVSRKSLSS
        NISPDEVKVGDEVLNQ+IA RENAS CSHE  DACID++C  G+LAAA+QLLKSLC+ K SL+SSKAYDMVLLAASERGDT LLCQVFK ++VS KSLSS
Subjt:  NISPDEVKVGDEVLNQVIAQRENASRCSHETFDACIDEMCRSGNLAAASQLLKSLCDRKTSLSSSKAYDMVLLAASERGDTALLCQVFKDSLVSRKSLSS

Query:  TSYMNFAKAFTRTNDSS
         SYM+FA+AFT+TNDSS
Subjt:  TSYMNFAKAFTRTNDSS

KAG6583593.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]1.7e-6281.76Show/hide
Query:  LLSTRNLLHYSYANSIASISVGNPQIWPLYAIKCLSHQSSTTNISPDEVKVGDEVLNQVIAQRENASRCSHETFDACIDEMCRSGNLAAASQLLKSLCDR
        LLS RN+LHYSY NSI S+ VGNPQ W LYAI+   HQ STTNISPDE KV DEVLNQ+ A RENAS CSHETFD CID+MCRS NL AA+QLLKSLCDR
Subjt:  LLSTRNLLHYSYANSIASISVGNPQIWPLYAIKCLSHQSSTTNISPDEVKVGDEVLNQVIAQRENASRCSHETFDACIDEMCRSGNLAAASQLLKSLCDR

Query:  KTSLSSSKAYDMVLLAASERGDTALLCQVFKDSLVSRKSLSSTSYMNFAKAFTRTNDSS
        K SLSSSKAYDMVLLAASERGDT LLCQVFKDSLVSRK LSSTSYMNFAKAF RT+DSS
Subjt:  KTSLSSSKAYDMVLLAASERGDTALLCQVFKDSLVSRKSLSSTSYMNFAKAFTRTNDSS

XP_022964991.1 pentatricopeptide repeat-containing protein At1g11900-like [Cucurbita moschata]1.1e-6181.13Show/hide
Query:  LLSTRNLLHYSYANSIASISVGNPQIWPLYAIKCLSHQSSTTNISPDEVKVGDEVLNQVIAQRENASRCSHETFDACIDEMCRSGNLAAASQLLKSLCDR
        LLS RN+LHYSY NSI S+ VGNPQ W LYAI+   HQ STTNISPDE KV DEVLNQ+ A RENAS CSHETFD CID+MCRS NL AA+QLLKS CDR
Subjt:  LLSTRNLLHYSYANSIASISVGNPQIWPLYAIKCLSHQSSTTNISPDEVKVGDEVLNQVIAQRENASRCSHETFDACIDEMCRSGNLAAASQLLKSLCDR

Query:  KTSLSSSKAYDMVLLAASERGDTALLCQVFKDSLVSRKSLSSTSYMNFAKAFTRTNDSS
        K SLSSSKAYDMVLLAASERGDT LLCQVFKDSLVSRK LSSTSYMNFAKAF RT+DSS
Subjt:  KTSLSSSKAYDMVLLAASERGDTALLCQVFKDSLVSRKSLSSTSYMNFAKAFTRTNDSS

XP_022970322.1 pentatricopeptide repeat-containing protein At1g11900 [Cucurbita maxima]3.0e-6483.12Show/hide
Query:  NLLSTRNLLHYSYANSIASISVGNPQIWPLYAIKCLSHQSSTTNISPDEVKVGDEVLNQVIAQRENASRCSHETFDACIDEMCRSGNLAAASQLLKSLCD
        +LLS RN+LHYSY NSI SI VGNPQ W LYAI+   HQSST NISPDE KV DEVLNQ+ A RENASRCSHETFD CID+MCRSGNL AA+QLLKSLCD
Subjt:  NLLSTRNLLHYSYANSIASISVGNPQIWPLYAIKCLSHQSSTTNISPDEVKVGDEVLNQVIAQRENASRCSHETFDACIDEMCRSGNLAAASQLLKSLCD

Query:  RKTSLSSSKAYDMVLLAASERGDTALLCQVFKDSLVSRKSLSSTSYMNFAKAFTRTNDSS
        RK SLSSSKAYDMVLLAASERGDT LLCQVFKDSLVSRK LSSTSYMNFAKAF RT+DSS
Subjt:  RKTSLSSSKAYDMVLLAASERGDTALLCQVFKDSLVSRKSLSSTSYMNFAKAFTRTNDSS

XP_023518970.1 pentatricopeptide repeat-containing protein At1g11900 [Cucurbita pepo subsp. pepo]1.4e-6181.25Show/hide
Query:  NLLSTRNLLHYSYANSIASISVGNPQIWPLYAIKCLSHQSSTTNISPDEVKVGDEVLNQVIAQRENASRCSHETFDACIDEMCRSGNLAAASQLLKSLCD
        +LLS RN+LHYSY NSI SI VGNPQ W LYAI+   HQ STTNISPDE KV DEVLNQ    RENAS CSHETFD CID+MCRS NL AA+QLLKSLCD
Subjt:  NLLSTRNLLHYSYANSIASISVGNPQIWPLYAIKCLSHQSSTTNISPDEVKVGDEVLNQVIAQRENASRCSHETFDACIDEMCRSGNLAAASQLLKSLCD

Query:  RKTSLSSSKAYDMVLLAASERGDTALLCQVFKDSLVSRKSLSSTSYMNFAKAFTRTNDSS
        RK SLSSSKAYDMVLLAASERGDT LLCQVFKDSLVSRK LSSTSYMNFAKAF RT+DSS
Subjt:  RKTSLSSSKAYDMVLLAASERGDTALLCQVFKDSLVSRKSLSSTSYMNFAKAFTRTNDSS

TrEMBL top hitse value%identityAlignment
A0A1S3CH34 pentatricopeptide repeat-containing protein At1g11900 isoform X22.6e-6181.29Show/hide
Query:  RNLLHYSYANSIASISVGNPQIWPLYAIKCLSHQSSTTNISPDEVKVGDEVLNQVIAQRENASRCSHETFDACIDEMCRSGNLAAASQLLKSLCDRKTSL
        RNLLHYSYAN IASI VGN QIWPLYAIKC SHQSS+TNISPDEVKVGDEVLNQ+IA RENAS CSHE  DACID++C  G+LAAA+QLLKSLC+ K SL
Subjt:  RNLLHYSYANSIASISVGNPQIWPLYAIKCLSHQSSTTNISPDEVKVGDEVLNQVIAQRENASRCSHETFDACIDEMCRSGNLAAASQLLKSLCDRKTSL

Query:  SSSKAYDMVLLAASERGDTALLCQVFKDSLVSRKSLSSTSYMNFAKAFTRTNDSS
        +SSKAYDMVLLAASERGDT LLCQVFK ++VS KSLSS SYM+FA+AFT+TNDSS
Subjt:  SSSKAYDMVLLAASERGDTALLCQVFKDSLVSRKSLSSTSYMNFAKAFTRTNDSS

A0A1S3CIJ8 pentatricopeptide repeat-containing protein At1g11900 isoform X12.6e-6181.29Show/hide
Query:  RNLLHYSYANSIASISVGNPQIWPLYAIKCLSHQSSTTNISPDEVKVGDEVLNQVIAQRENASRCSHETFDACIDEMCRSGNLAAASQLLKSLCDRKTSL
        RNLLHYSYAN IASI VGN QIWPLYAIKC SHQSS+TNISPDEVKVGDEVLNQ+IA RENAS CSHE  DACID++C  G+LAAA+QLLKSLC+ K SL
Subjt:  RNLLHYSYANSIASISVGNPQIWPLYAIKCLSHQSSTTNISPDEVKVGDEVLNQVIAQRENASRCSHETFDACIDEMCRSGNLAAASQLLKSLCDRKTSL

Query:  SSSKAYDMVLLAASERGDTALLCQVFKDSLVSRKSLSSTSYMNFAKAFTRTNDSS
        +SSKAYDMVLLAASERGDT LLCQVFK ++VS KSLSS SYM+FA+AFT+TNDSS
Subjt:  SSSKAYDMVLLAASERGDTALLCQVFKDSLVSRKSLSSTSYMNFAKAFTRTNDSS

A0A5D3DB98 Pentatricopeptide repeat-containing protein2.8e-6365.44Show/hide
Query:  MVRTTFPFTCIFDFNSGGSCQFTNLLS------TRNLLHYS----------------------------YANSIASISVGNPQIWPLYAIKCLSHQSSTT
        MVRTT PFTCIFDF+SG  C+FTNLLS      TR L                                  N IASI VGN QIWPLYAIKC SHQSS+T
Subjt:  MVRTTFPFTCIFDFNSGGSCQFTNLLS------TRNLLHYS----------------------------YANSIASISVGNPQIWPLYAIKCLSHQSSTT

Query:  NISPDEVKVGDEVLNQVIAQRENASRCSHETFDACIDEMCRSGNLAAASQLLKSLCDRKTSLSSSKAYDMVLLAASERGDTALLCQVFKDSLVSRKSLSS
        NISPDEVKVGDEVLNQ+IA RENAS CSHE  DACID++C  G+LAAA+QLLKSLC+ K SL+SSKAYDMVLLAASERGDT LLCQVFK ++VS KSLSS
Subjt:  NISPDEVKVGDEVLNQVIAQRENASRCSHETFDACIDEMCRSGNLAAASQLLKSLCDRKTSLSSSKAYDMVLLAASERGDTALLCQVFKDSLVSRKSLSS

Query:  TSYMNFAKAFTRTNDSS
         SYM+FA+AFT+TNDSS
Subjt:  TSYMNFAKAFTRTNDSS

A0A6J1HKH0 pentatricopeptide repeat-containing protein At1g11900-like5.2e-6281.13Show/hide
Query:  LLSTRNLLHYSYANSIASISVGNPQIWPLYAIKCLSHQSSTTNISPDEVKVGDEVLNQVIAQRENASRCSHETFDACIDEMCRSGNLAAASQLLKSLCDR
        LLS RN+LHYSY NSI S+ VGNPQ W LYAI+   HQ STTNISPDE KV DEVLNQ+ A RENAS CSHETFD CID+MCRS NL AA+QLLKS CDR
Subjt:  LLSTRNLLHYSYANSIASISVGNPQIWPLYAIKCLSHQSSTTNISPDEVKVGDEVLNQVIAQRENASRCSHETFDACIDEMCRSGNLAAASQLLKSLCDR

Query:  KTSLSSSKAYDMVLLAASERGDTALLCQVFKDSLVSRKSLSSTSYMNFAKAFTRTNDSS
        K SLSSSKAYDMVLLAASERGDT LLCQVFKDSLVSRK LSSTSYMNFAKAF RT+DSS
Subjt:  KTSLSSSKAYDMVLLAASERGDTALLCQVFKDSLVSRKSLSSTSYMNFAKAFTRTNDSS

A0A6J1HYS7 pentatricopeptide repeat-containing protein At1g119001.5e-6483.12Show/hide
Query:  NLLSTRNLLHYSYANSIASISVGNPQIWPLYAIKCLSHQSSTTNISPDEVKVGDEVLNQVIAQRENASRCSHETFDACIDEMCRSGNLAAASQLLKSLCD
        +LLS RN+LHYSY NSI SI VGNPQ W LYAI+   HQSST NISPDE KV DEVLNQ+ A RENASRCSHETFD CID+MCRSGNL AA+QLLKSLCD
Subjt:  NLLSTRNLLHYSYANSIASISVGNPQIWPLYAIKCLSHQSSTTNISPDEVKVGDEVLNQVIAQRENASRCSHETFDACIDEMCRSGNLAAASQLLKSLCD

Query:  RKTSLSSSKAYDMVLLAASERGDTALLCQVFKDSLVSRKSLSSTSYMNFAKAFTRTNDSS
        RK SLSSSKAYDMVLLAASERGDT LLCQVFKDSLVSRK LSSTSYMNFAKAF RT+DSS
Subjt:  RKTSLSSSKAYDMVLLAASERGDTALLCQVFKDSLVSRKSLSSTSYMNFAKAFTRTNDSS

SwissProt top hitse value%identityAlignment
Q5BIV3 Pentatricopeptide repeat-containing protein At1g119008.7e-0631.36Show/hide
Query:  DEVLNQVIAQRENASR-CSHETFDACIDEMCRSGNLAAASQLLKSLCDRKTSLSSSKAYDMVLLAASERGDTALLCQVFKDSLV--SRKSLSSTSYMNFA
        +E+L +++   E+ S+  S   +   +++  R GNL+ A  LL+SL ++   L  S  +  +L AA E  D  L C+VF++ L+   ++ LSS  Y+N A
Subjt:  DEVLNQVIAQRENASR-CSHETFDACIDEMCRSGNLAAASQLLKSLCDRKTSLSSSKAYDMVLLAASERGDTALLCQVFKDSLV--SRKSLSSTSYMNFA

Query:  KAFTRTNDSSSYRNMSKK
        +AF  T+D +   ++ K+
Subjt:  KAFTRTNDSSSYRNMSKK

Arabidopsis top hitse value%identityAlignment
AT1G11900.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.2e-0731.36Show/hide
Query:  DEVLNQVIAQRENASR-CSHETFDACIDEMCRSGNLAAASQLLKSLCDRKTSLSSSKAYDMVLLAASERGDTALLCQVFKDSLV--SRKSLSSTSYMNFA
        +E+L +++   E+ S+  S   +   +++  R GNL+ A  LL+SL ++   L  S  +  +L AA E  D  L C+VF++ L+   ++ LSS  Y+N A
Subjt:  DEVLNQVIAQRENASR-CSHETFDACIDEMCRSGNLAAASQLLKSLCDRKTSLSSSKAYDMVLLAASERGDTALLCQVFKDSLV--SRKSLSSTSYMNFA

Query:  KAFTRTNDSSSYRNMSKK
        +AF  T+D +   ++ K+
Subjt:  KAFTRTNDSSSYRNMSKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCGAACCACTTTCCCATTTACTTGCATTTTCGATTTCAATTCTGGCGGGTCATGTCAGTTTACGAACTTGCTATCTACTAGGAATCTTCTGCATTACTCGTACGC
TAATAGCATTGCATCTATCTCTGTTGGTAACCCTCAAATCTGGCCGCTTTATGCCATCAAATGCCTTAGCCATCAGTCATCTACTACAAATATCTCTCCTGATGAAGTGA
AAGTGGGGGATGAAGTCTTGAATCAGGTTATTGCCCAAAGGGAAAATGCCTCAAGGTGTAGCCATGAAACCTTTGATGCTTGCATTGATGAGATGTGTCGATCGGGAAAT
CTTGCAGCTGCTTCTCAATTACTTAAATCATTGTGCGATAGGAAAACATCTCTTAGCTCTTCCAAGGCTTATGATATGGTTTTGCTTGCAGCAAGTGAAAGGGGAGATAC
TGCCCTTTTATGTCAAGTCTTTAAAGATTCCCTGGTTTCCCGTAAATCATTGAGTTCAACCTCTTACATGAATTTTGCCAAGGCCTTTACCAGGACAAATGATAGTAGCA
GCTACCGGAATATGTCAAAGAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCGAACCACTTTCCCATTTACTTGCATTTTCGATTTCAATTCTGGCGGGTCATGTCAGTTTACGAACTTGCTATCTACTAGGAATCTTCTGCATTACTCGTACGC
TAATAGCATTGCATCTATCTCTGTTGGTAACCCTCAAATCTGGCCGCTTTATGCCATCAAATGCCTTAGCCATCAGTCATCTACTACAAATATCTCTCCTGATGAAGTGA
AAGTGGGGGATGAAGTCTTGAATCAGGTTATTGCCCAAAGGGAAAATGCCTCAAGGTGTAGCCATGAAACCTTTGATGCTTGCATTGATGAGATGTGTCGATCGGGAAAT
CTTGCAGCTGCTTCTCAATTACTTAAATCATTGTGCGATAGGAAAACATCTCTTAGCTCTTCCAAGGCTTATGATATGGTTTTGCTTGCAGCAAGTGAAAGGGGAGATAC
TGCCCTTTTATGTCAAGTCTTTAAAGATTCCCTGGTTTCCCGTAAATCATTGAGTTCAACCTCTTACATGAATTTTGCCAAGGCCTTTACCAGGACAAATGATAGTAGCA
GCTACCGGAATATGTCAAAGAAATAA
Protein sequenceShow/hide protein sequence
MVRTTFPFTCIFDFNSGGSCQFTNLLSTRNLLHYSYANSIASISVGNPQIWPLYAIKCLSHQSSTTNISPDEVKVGDEVLNQVIAQRENASRCSHETFDACIDEMCRSGN
LAAASQLLKSLCDRKTSLSSSKAYDMVLLAASERGDTALLCQVFKDSLVSRKSLSSTSYMNFAKAFTRTNDSSSYRNMSKK