; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10014187 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10014187
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Description40S ribosomal protein S29
Genome locationChr02:8262711..8277079
RNA-Seq ExpressionHG10014187
SyntenyHG10014187
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0015935 - small ribosomal subunit (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0003735 - structural constituent of ribosome (molecular function)
InterPro domainsIPR001209 - Ribosomal protein S14
IPR023676 - Ribosomal protein S14, type Z, archaeal
IPR043140 - Ribosomal protein S14/S29


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAB1204029.1 40S ribosomal protein S29 [Morella rubra]4.2e-3938.05Show/hide
Query:  MGHSNVWNSHPKNYGPGSRTCRVCGNPHGLIRKYGLMCCRQCFRSNAKEIGFIK------------------------MQDYEHSNPTRKNVRVGNVN--
        MGHSNVWNSHPKNYGPGSRTCRVCGNPHGLIRKYGLMCCRQCFRSNAKEIGFIK                        ++++  ++P  + +++   N  
Subjt:  MGHSNVWNSHPKNYGPGSRTCRVCGNPHGLIRKYGLMCCRQCFRSNAKEIGFIK------------------------MQDYEHSNPTRKNVRVGNVN--

Query:  -----LYDYSPIDPVPSSKRSIKHGPIEHGSPLIPHMPNPSPPDQPQPVL-----------------------HV-----------DFGVLFLTAIADMS
             L          + + ++ + P    S L+P  P   P     P L                       H+             G+L+ +  A + 
Subjt:  -----LYDYSPIDPVPSSKRSIKHGPIEHGSPLIPHMPNPSPPDQPQPVL-----------------------HV-----------DFGVLFLTAIADMS

Query:  LNHHHVNPNKCLSDI-----------PLNRKLKFQTDEHYSLGR-KKVHPGDLNLDDYHPIDPVPSSKTSVKPGPIEHGTPLLPHMPNPPPPDQPGD
         NH  ++   C + +             NR+LK   +  Y+LG  KK   G++NLDDY PIDPVPSSK S++PGPI+HGTPL+P++P P PP+ P D
Subjt:  LNHHHVNPNKCLSDI-----------PLNRKLKFQTDEHYSLGR-KKVHPGDLNLDDYHPIDPVPSSKTSVKPGPIEHGTPLLPHMPNPPPPDQPGD

KAE8649878.1 hypothetical protein Csa_012522 [Cucumis sativus]1.7e-9383.33Show/hide
Query:  SRTCRVCGNPHGLIRKYGLMCCRQCFRSNAKEIGFIKMQDYEHSNPTRKNVRVGNVNLYDYSPIDPVPSSKRSIKHGPIEHGSPLIPHMPNPSPPDQPQP
        S+T RVCGNPHGLIRKYGLMCCRQCFRSNAKEIGFIK+QD E SNPTRKNVRVG+VNLYDY PIDPVPSSKRSIKHGPIEHGSPLIPHMP+PSPPDQPQP
Subjt:  SRTCRVCGNPHGLIRKYGLMCCRQCFRSNAKEIGFIKMQDYEHSNPTRKNVRVGNVNLYDYSPIDPVPSSKRSIKHGPIEHGSPLIPHMPNPSPPDQPQP

Query:  VLHVDFGVLFLTAIADMSLNHHHVNPNKCLSDIPLNRKLKFQTDEHYSLGRKKVHPGDLNLDDYHPIDPVPSSKTSVKPGPIEHGTPLLPHMPNPPPPDQ
                       D++LNHHHVN NKCLSDI LNRKLKFQTDEH + GRKKVHP DLNLDDYHPIDPVPSSKTSVKPGPIEHG PLLPHMPNPPPP Q
Subjt:  VLHVDFGVLFLTAIADMSLNHHHVNPNKCLSDIPLNRKLKFQTDEHYSLGRKKVHPGDLNLDDYHPIDPVPSSKTSVKPGPIEHGTPLLPHMPNPPPPDQ

Query:  PGDY
        PG Y
Subjt:  PGDY

KAF5740625.1 hypothetical protein HS088_TW11G00702 [Tripterygium wilfordii]7.0e-4260.29Show/hide
Query:  MEWWSKMSSPLRRFTFRVAARLGFRKRGLVKLGRDVKACEYEDVHVMWEMLKRNET----------------------DPKTMGHSNVWNSHPKNYGPGS
        M+WW KM+ P+RR    V   L  R++GL+KL  DV+ACEYED+ +MWEML+++ET                      +  TMGHSNVWNSHPK+YGPGS
Subjt:  MEWWSKMSSPLRRFTFRVAARLGFRKRGLVKLGRDVKACEYEDVHVMWEMLKRNET----------------------DPKTMGHSNVWNSHPKNYGPGS

Query:  RTCRVCGNPHGLIRKYGLMCCRQCFRSNAKEIGFIK
        R CRVCGNPHG+IRKYGLMCCRQCFRSNAKEIGFIK
Subjt:  RTCRVCGNPHGLIRKYGLMCCRQCFRSNAKEIGFIK

KAG6591414.1 hypothetical protein SDJN03_13760, partial [Cucurbita argyrosperma subsp. sororia]3.0e-5345.3Show/hide
Query:  EAENSRMEWWSKMSSPLRRFTFRVAARLGFRKRGLVKLGRDVKACEYEDVHVMWEMLKRNETDPKTMGHSNVWNSHPKNYGPGSRTCRVCGNPHGLIRKY
        EAE + M WWS+M+SPLRRF+FR+A+RLGFRKRGL+KLGRDV+ACEY DV VMWEM+KRNE                                       
Subjt:  EAENSRMEWWSKMSSPLRRFTFRVAARLGFRKRGLVKLGRDVKACEYEDVHVMWEMLKRNETDPKTMGHSNVWNSHPKNYGPGSRTCRVCGNPHGLIRKY

Query:  GLMCCRQCFRSNAKEIGFIKMQDYEHSNPTRKNVRVGNVNLYDYSPIDPVPSSKRSIKHGPIEHGSPLIPHMPNPSPPDQPQPVLHVDFGVLFLTAIADM
                     KE+  + +Q+       R+  R   +N+  +                                                     +DM
Subjt:  GLMCCRQCFRSNAKEIGFIKMQDYEHSNPTRKNVRVGNVNLYDYSPIDPVPSSKRSIKHGPIEHGSPLIPHMPNPSPPDQPQPVLHVDFGVLFLTAIADM

Query:  SLNHHHVNPNKCLSDIPLNRKLKFQTDEHYSLGRKKVHPGDLNLDDYHPIDPVPSSKTSVKPGPIEHGTPLLPHMPNPPPPDQPGDY
        +L+HHHVNPN+CLSDIP NRKLKFQTD+H SLG+KKVHPGDL LDDY  IDPVPSSKTSVKPGPIEHG PLLPHMP PPPPDQPGDY
Subjt:  SLNHHHVNPNKCLSDIPLNRKLKFQTDEHYSLGRKKVHPGDLNLDDYHPIDPVPSSKTSVKPGPIEHGTPLLPHMPNPPPPDQPGDY

XP_038899330.1 uncharacterized protein LOC120086661 [Benincasa hispida]9.1e-4292.22Show/hide
Query:  ADMSLNHHHVNPNKCLSDIPLNRKLKFQTDEHYSLGRKKVHPGDLNLDDYHPIDPVPSSKTSVKPGPIEHGTPLLPHMPNPPPPDQPGDY
        +D++LN+HHVNPNKCLSDIPLNRKLKFQTDEHYSLGRKKVHPGDLNLDDYHPIDPVPSSKTSVKPGPIEHG PLLPHMPN PPP QPGDY
Subjt:  ADMSLNHHHVNPNKCLSDIPLNRKLKFQTDEHYSLGRKKVHPGDLNLDDYHPIDPVPSSKTSVKPGPIEHGTPLLPHMPNPPPPDQPGDY

TrEMBL top hitse value%identityAlignment
A0A067KZM9 Uncharacterized protein1.7e-3846.12Show/hide
Query:  MGHSNVWNSHPKNYGPGSRTCRVCGNPHGLIRKYGLMCCRQCFRSNAKEIGFIKMQDYEHSNPTRKNVRVGNVNLYDYSPIDPVPSSKRSIKHGPIEHGS
        MGHSNVWNSHPKNYGPGSRTCRVCGNPHGLIRKYGLMCCRQCFRSNAKEIGFIK+     S      + VG                             
Subjt:  MGHSNVWNSHPKNYGPGSRTCRVCGNPHGLIRKYGLMCCRQCFRSNAKEIGFIKMQDYEHSNPTRKNVRVGNVNLYDYSPIDPVPSSKRSIKHGPIEHGS

Query:  PLIPHMPNPSPPDQPQPVLHVDFG-VLFLTAIADMSLNHHHVNPNKCLSDIPLNRKLKFQTDEHYSLGRKKVHPGDLNLDDYHPIDPVPSSKTSVKPGPI
                         ++ ++F  V    +I+DM++           S + ++RKLK  +  H +   +  H   L+LDDYHPIDPVPSSK S+KPGPI
Subjt:  PLIPHMPNPSPPDQPQPVLHVDFG-VLFLTAIADMSLNHHHVNPNKCLSDIPLNRKLKFQTDEHYSLGRKKVHPGDLNLDDYHPIDPVPSSKTSVKPGPI

Query:  EHGTPLLPHMPNPPPPDQP
        EHGTPL P++P   PP  P
Subjt:  EHGTPLLPHMPNPPPPDQP

A0A1S3BV38 uncharacterized protein LOC1034935957.8e-3979.59Show/hide
Query:  GVLFLTAIADMSLNHHHVNPNKCLSDIPLNRKLKFQTDEHYSLGRKKVHPGDLNLDDYHPIDPVPSSKTSVKPGPIEHGTPLLPHMPNPPPPDQPGDY
        G +  ++ +D++LN HHVN  KCLSD+PLNRKLKFQTDEH +LGRKKVHPGDLNLDDYHPIDPVPSSKTSVKPGPIEH  PLLPHMPNPPPP+QPG Y
Subjt:  GVLFLTAIADMSLNHHHVNPNKCLSDIPLNRKLKFQTDEHYSLGRKKVHPGDLNLDDYHPIDPVPSSKTSVKPGPIEHGTPLLPHMPNPPPPDQPGDY

A0A5A7VB21 Uncharacterized protein1.3e-3886.67Show/hide
Query:  ADMSLNHHHVNPNKCLSDIPLNRKLKFQTDEHYSLGRKKVHPGDLNLDDYHPIDPVPSSKTSVKPGPIEHGTPLLPHMPNPPPPDQPGDY
        AD++LN HHVN  KCLSD+PLNRKLKFQTDEH +LGRKKVHPGDLNLDDYHPIDPVPSSKTSVKPGPIEH  PLLPHMPNPPPP+QPG Y
Subjt:  ADMSLNHHHVNPNKCLSDIPLNRKLKFQTDEHYSLGRKKVHPGDLNLDDYHPIDPVPSSKTSVKPGPIEHGTPLLPHMPNPPPPDQPGDY

A0A6A1UUD7 40S ribosomal protein S292.0e-3938.05Show/hide
Query:  MGHSNVWNSHPKNYGPGSRTCRVCGNPHGLIRKYGLMCCRQCFRSNAKEIGFIK------------------------MQDYEHSNPTRKNVRVGNVN--
        MGHSNVWNSHPKNYGPGSRTCRVCGNPHGLIRKYGLMCCRQCFRSNAKEIGFIK                        ++++  ++P  + +++   N  
Subjt:  MGHSNVWNSHPKNYGPGSRTCRVCGNPHGLIRKYGLMCCRQCFRSNAKEIGFIK------------------------MQDYEHSNPTRKNVRVGNVN--

Query:  -----LYDYSPIDPVPSSKRSIKHGPIEHGSPLIPHMPNPSPPDQPQPVL-----------------------HV-----------DFGVLFLTAIADMS
             L          + + ++ + P    S L+P  P   P     P L                       H+             G+L+ +  A + 
Subjt:  -----LYDYSPIDPVPSSKRSIKHGPIEHGSPLIPHMPNPSPPDQPQPVL-----------------------HV-----------DFGVLFLTAIADMS

Query:  LNHHHVNPNKCLSDI-----------PLNRKLKFQTDEHYSLGR-KKVHPGDLNLDDYHPIDPVPSSKTSVKPGPIEHGTPLLPHMPNPPPPDQPGD
         NH  ++   C + +             NR+LK   +  Y+LG  KK   G++NLDDY PIDPVPSSK S++PGPI+HGTPL+P++P P PP+ P D
Subjt:  LNHHHVNPNKCLSDI-----------PLNRKLKFQTDEHYSLGR-KKVHPGDLNLDDYHPIDPVPSSKTSVKPGPIEHGTPLLPHMPNPPPPDQPGD

A0A7J7D2Q4 Uncharacterized protein3.4e-4260.29Show/hide
Query:  MEWWSKMSSPLRRFTFRVAARLGFRKRGLVKLGRDVKACEYEDVHVMWEMLKRNET----------------------DPKTMGHSNVWNSHPKNYGPGS
        M+WW KM+ P+RR    V   L  R++GL+KL  DV+ACEYED+ +MWEML+++ET                      +  TMGHSNVWNSHPK+YGPGS
Subjt:  MEWWSKMSSPLRRFTFRVAARLGFRKRGLVKLGRDVKACEYEDVHVMWEMLKRNET----------------------DPKTMGHSNVWNSHPKNYGPGS

Query:  RTCRVCGNPHGLIRKYGLMCCRQCFRSNAKEIGFIK
        R CRVCGNPHG+IRKYGLMCCRQCFRSNAKEIGFIK
Subjt:  RTCRVCGNPHGLIRKYGLMCCRQCFRSNAKEIGFIK

SwissProt top hitse value%identityAlignment
Q4PM47 40S ribosomal protein S291.6e-1769.09Show/hide
Query:  MGHSNVWNSHPKNYGPGSRTCRVCGNPHGLIRKYGLMCCRQCFRSNAKEIGFIKM
        MGH N+W SHP+ YGPGSR CRVC N HGLIRKYGL  CR+CFR  A +IGF K+
Subjt:  MGHSNVWNSHPKNYGPGSRTCRVCGNPHGLIRKYGLMCCRQCFRSNAKEIGFIKM

Q5I7K3 40S ribosomal protein S291.0e-2490.74Show/hide
Query:  MGHSNVWNSHPKNYGPGSRTCRVCGNPHGLIRKYGLMCCRQCFRSNAKEIGFIK
        MGHSNVWNSHPKNYG GSR CRVCGN HGLIRKYGLMCCRQCFRS+AK+IGFIK
Subjt:  MGHSNVWNSHPKNYGPGSRTCRVCGNPHGLIRKYGLMCCRQCFRSNAKEIGFIK

Q680P8 40S ribosomal protein S296.1e-2592.59Show/hide
Query:  MGHSNVWNSHPKNYGPGSRTCRVCGNPHGLIRKYGLMCCRQCFRSNAKEIGFIK
        MGHSNVWNSHPK YGPGSR CRVCGN HGLIRKYGL CCRQCFRSNAKEIGFIK
Subjt:  MGHSNVWNSHPKNYGPGSRTCRVCGNPHGLIRKYGLMCCRQCFRSNAKEIGFIK

Q6F473 40S ribosomal protein S293.6e-1767.27Show/hide
Query:  MGHSNVWNSHPKNYGPGSRTCRVCGNPHGLIRKYGLMCCRQCFRSNAKEIGFIKM
        MGH+N+W SHP+ YG GSR+CR C N HGLIRKYGL  CRQCFR  A +IGF K+
Subjt:  MGHSNVWNSHPKNYGPGSRTCRVCGNPHGLIRKYGLMCCRQCFRSNAKEIGFIKM

Q7XYB0 40S ribosomal protein S294.3e-1868.52Show/hide
Query:  MGHSNVWNSHPKNYGPGSRTCRVCGNPHGLIRKYGLMCCRQCFRSNAKEIGFIK
        MGH N+W SHP+ YG GSR+C++CGN HGLIRKY +  CRQCFR  AK+IGFIK
Subjt:  MGHSNVWNSHPKNYGPGSRTCRVCGNPHGLIRKYGLMCCRQCFRSNAKEIGFIK

Arabidopsis top hitse value%identityAlignment
AT3G43980.1 Ribosomal protein S14p/S29e family protein4.4e-2692.59Show/hide
Query:  MGHSNVWNSHPKNYGPGSRTCRVCGNPHGLIRKYGLMCCRQCFRSNAKEIGFIK
        MGHSNVWNSHPK YGPGSR CRVCGN HGLIRKYGL CCRQCFRSNAKEIGFIK
Subjt:  MGHSNVWNSHPKNYGPGSRTCRVCGNPHGLIRKYGLMCCRQCFRSNAKEIGFIK

AT3G44010.1 Ribosomal protein S14p/S29e family protein4.4e-2692.59Show/hide
Query:  MGHSNVWNSHPKNYGPGSRTCRVCGNPHGLIRKYGLMCCRQCFRSNAKEIGFIK
        MGHSNVWNSHPK YGPGSR CRVCGN HGLIRKYGL CCRQCFRSNAKEIGFIK
Subjt:  MGHSNVWNSHPKNYGPGSRTCRVCGNPHGLIRKYGLMCCRQCFRSNAKEIGFIK

AT4G33865.1 Ribosomal protein S14p/S29e family protein4.4e-2692.59Show/hide
Query:  MGHSNVWNSHPKNYGPGSRTCRVCGNPHGLIRKYGLMCCRQCFRSNAKEIGFIK
        MGHSNVWNSHPK YGPGSR CRVCGN HGLIRKYGL CCRQCFRSNAKEIGFIK
Subjt:  MGHSNVWNSHPKNYGPGSRTCRVCGNPHGLIRKYGLMCCRQCFRSNAKEIGFIK

AT5G01075.1 Glycosyl hydrolase family 35 protein5.4e-0860.98Show/hide
Query:  LNLDDYH-PIDPVPSSKTSVKPGPIEHGTPLLPHMPNPPPP
        + L+DY+ P+DP P++K S+KPGPIEHGTPL P++P PP P
Subjt:  LNLDDYH-PIDPVPSSKTSVKPGPIEHGTPLLPHMPNPPPP

AT5G43150.1 unknown protein2.7e-1251.85Show/hide
Query:  EWWSKMSSPLRRFTFRVAARLGFRKRGLVKLGRDVKACEYEDVHVMWEMLKRNE
        EWW+ M+ P RR   R   R+GFR  GL++L  DV +CEYED+H+MW +L +NE
Subjt:  EWWSKMSSPLRRFTFRVAARLGFRKRGLVKLGRDVKACEYEDVHVMWEMLKRNE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGGAGGCAATGGCCAGAAGGCTAAAATGGCTCGGGAGAGAAACCTGGAGAAACAAAAGTCATCCAGAGAAGCAGAGAACTCGAGAATGGAATGGTGGAGCAAGAT
GTCTTCTCCCTTGCGAAGATTTACCTTCAGAGTCGCCGCTCGTCTCGGATTTCGCAAACGCGGGCTAGTGAAACTTGGGCGAGATGTGAAGGCATGTGAATACGAGGACG
TGCACGTGATGTGGGAGATGCTGAAGAGAAACGAAACGGATCCGAAGACGATGGGTCATTCAAACGTGTGGAATTCACATCCAAAGAACTATGGACCTGGTTCTCGTACT
TGTCGAGTCTGCGGAAACCCCCATGGATTGATCCGCAAGTATGGCCTGATGTGCTGCAGGCAGTGCTTCCGTAGTAATGCCAAAGAAATTGGCTTCATTAAGATGCAGGA
TTACGAGCATAGTAACCCGACGAGGAAGAATGTCCGTGTGGGTAACGTAAATCTTTACGACTACAGCCCCATTGATCCAGTTCCAAGCTCAAAGAGATCCATAAAACATG
GACCAATAGAGCATGGCTCTCCTCTTATACCTCATATGCCAAATCCTTCCCCTCCTGATCAACCTCAGCCTGTTTTGCACGTTGATTTTGGTGTGCTTTTCTTGACTGCA
ATCGCAGATATGTCTCTGAACCACCACCATGTTAATCCAAACAAATGCTTGTCTGACATACCATTGAACAGGAAGCTGAAGTTCCAGACTGATGAACATTATAGCCTGGG
GAGAAAGAAGGTTCACCCAGGTGATCTAAATCTTGATGACTACCATCCCATTGATCCAGTTCCAAGTTCAAAGACATCAGTCAAACCCGGTCCAATAGAGCACGGCACTC
CTCTCCTGCCTCATATGCCAAATCCTCCACCTCCTGATCAGCCAGGCGATTACCCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGCGGAGGCAATGGCCAGAAGGCTAAAATGGCTCGGGAGAGAAACCTGGAGAAACAAAAGTCATCCAGAGAAGCAGAGAACTCGAGAATGGAATGGTGGAGCAAGAT
GTCTTCTCCCTTGCGAAGATTTACCTTCAGAGTCGCCGCTCGTCTCGGATTTCGCAAACGCGGGCTAGTGAAACTTGGGCGAGATGTGAAGGCATGTGAATACGAGGACG
TGCACGTGATGTGGGAGATGCTGAAGAGAAACGAAACGGATCCGAAGACGATGGGTCATTCAAACGTGTGGAATTCACATCCAAAGAACTATGGACCTGGTTCTCGTACT
TGTCGAGTCTGCGGAAACCCCCATGGATTGATCCGCAAGTATGGCCTGATGTGCTGCAGGCAGTGCTTCCGTAGTAATGCCAAAGAAATTGGCTTCATTAAGATGCAGGA
TTACGAGCATAGTAACCCGACGAGGAAGAATGTCCGTGTGGGTAACGTAAATCTTTACGACTACAGCCCCATTGATCCAGTTCCAAGCTCAAAGAGATCCATAAAACATG
GACCAATAGAGCATGGCTCTCCTCTTATACCTCATATGCCAAATCCTTCCCCTCCTGATCAACCTCAGCCTGTTTTGCACGTTGATTTTGGTGTGCTTTTCTTGACTGCA
ATCGCAGATATGTCTCTGAACCACCACCATGTTAATCCAAACAAATGCTTGTCTGACATACCATTGAACAGGAAGCTGAAGTTCCAGACTGATGAACATTATAGCCTGGG
GAGAAAGAAGGTTCACCCAGGTGATCTAAATCTTGATGACTACCATCCCATTGATCCAGTTCCAAGTTCAAAGACATCAGTCAAACCCGGTCCAATAGAGCACGGCACTC
CTCTCCTGCCTCATATGCCAAATCCTCCACCTCCTGATCAGCCAGGCGATTACCCTTAG
Protein sequenceShow/hide protein sequence
MGGGNGQKAKMARERNLEKQKSSREAENSRMEWWSKMSSPLRRFTFRVAARLGFRKRGLVKLGRDVKACEYEDVHVMWEMLKRNETDPKTMGHSNVWNSHPKNYGPGSRT
CRVCGNPHGLIRKYGLMCCRQCFRSNAKEIGFIKMQDYEHSNPTRKNVRVGNVNLYDYSPIDPVPSSKRSIKHGPIEHGSPLIPHMPNPSPPDQPQPVLHVDFGVLFLTA
IADMSLNHHHVNPNKCLSDIPLNRKLKFQTDEHYSLGRKKVHPGDLNLDDYHPIDPVPSSKTSVKPGPIEHGTPLLPHMPNPPPPDQPGDYP