; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh16G003290 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh16G003290
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionprotein SOB FIVE-LIKE 6-like isoform X1
Genome locationCmo_Chr16:1499033..1500458
RNA-Seq ExpressionCmoCh16G003290
SyntenyCmoCh16G003290
Gene Ontology termsGO:0009691 - cytokinin biosynthetic process (biological process)
GO:0009736 - cytokinin-activated signaling pathway (biological process)
InterPro domainsIPR044670 - SOB-five-Like (SOFL) family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576907.1 hypothetical protein SDJN03_24481, partial [Cucurbita argyrosperma subsp. sororia]2.0e-8997.71Show/hide
Query:  MDLLGSECSSGCESGWTLYLEQSFRSQGCLDHIVGGEEYWKAKGSEEEEEEEEEDLSMVSDASSGPPHFIEDEACSQEASKSATLGKRKGKKQRIQEHQC
        MDLLGSECSSGCESGWTLYLEQSFRSQGCLDHIVGGEEYWKAKGS  EEEEEEEDLSMVSDASSGPPHFIEDEACSQEASKSATLGKRKGKKQRIQEHQC
Subjt:  MDLLGSECSSGCESGWTLYLEQSFRSQGCLDHIVGGEEYWKAKGSEEEEEEEEEDLSMVSDASSGPPHFIEDEACSQEASKSATLGKRKGKKQRIQEHQC

Query:  QGEPSSSFLDDTASSPALNYTANKFTNQASMESFLGFSQTNHFEEISACTDHFGFLQASLSGNRLQKNQWFEGEG
        QGEPSSSFLDDTASSPALNYTANKFTNQASMESFLGFSQT+HFEE SACTDHFGFLQASLSGNRLQKNQWFEGEG
Subjt:  QGEPSSSFLDDTASSPALNYTANKFTNQASMESFLGFSQTNHFEEISACTDHFGFLQASLSGNRLQKNQWFEGEG

XP_022922888.1 uncharacterized protein LOC111430730 [Cucurbita moschata]2.0e-97100Show/hide
Query:  MDLLGSECSSGCESGWTLYLEQSFRSQGCLDHIVGGEEYWKAKGSEEEEEEEEEDLSMVSDASSGPPHFIEDEACSQEASKSATLGKRKGKKQRIQEHQC
        MDLLGSECSSGCESGWTLYLEQSFRSQGCLDHIVGGEEYWKAKGSEEEEEEEEEDLSMVSDASSGPPHFIEDEACSQEASKSATLGKRKGKKQRIQEHQC
Subjt:  MDLLGSECSSGCESGWTLYLEQSFRSQGCLDHIVGGEEYWKAKGSEEEEEEEEEDLSMVSDASSGPPHFIEDEACSQEASKSATLGKRKGKKQRIQEHQC

Query:  QGEPSSSFLDDTASSPALNYTANKFTNQASMESFLGFSQTNHFEEISACTDHFGFLQASLSGNRLQKNQWFEGEGRRDRDEMR
        QGEPSSSFLDDTASSPALNYTANKFTNQASMESFLGFSQTNHFEEISACTDHFGFLQASLSGNRLQKNQWFEGEGRRDRDEMR
Subjt:  QGEPSSSFLDDTASSPALNYTANKFTNQASMESFLGFSQTNHFEEISACTDHFGFLQASLSGNRLQKNQWFEGEGRRDRDEMR

XP_022985337.1 uncharacterized protein LOC111483377 isoform X1 [Cucurbita maxima]1.4e-9097.14Show/hide
Query:  MDLLGSECSSGCESGWTLYLEQSFRSQGCLDHIVGGEEYWKAKGSEEEEEEEEEDLSMVSDASSGPPHFIEDEACSQEASKSATLGKRKGKKQRIQEHQC
        MDLLGSECSSGCESGWTLYLEQSFRSQGCLDHIVGGEEYWKAKGSEEEEEEEEEDLSMVSDASSGPPHFIEDEACSQ+ASKSATLGKRKGKKQRIQEHQC
Subjt:  MDLLGSECSSGCESGWTLYLEQSFRSQGCLDHIVGGEEYWKAKGSEEEEEEEEEDLSMVSDASSGPPHFIEDEACSQEASKSATLGKRKGKKQRIQEHQC

Query:  QGEPSSSFLDDTASSPALNYTANKFTNQASMESFLGFSQTNHFEEISACTDHFGFLQASLSGNRLQKNQWFEGEG
        QGEPSSSFLDDTASSPALNYTANKFTNQASM+SFLGFSQT+HFE+ SACTDHFGFLQASLSGNRLQKNQWFEGEG
Subjt:  QGEPSSSFLDDTASSPALNYTANKFTNQASMESFLGFSQTNHFEEISACTDHFGFLQASLSGNRLQKNQWFEGEG

XP_022985339.1 uncharacterized protein LOC111483377 isoform X2 [Cucurbita maxima]2.9e-7297.92Show/hide
Query:  MDLLGSECSSGCESGWTLYLEQSFRSQGCLDHIVGGEEYWKAKGSEEEEEEEEEDLSMVSDASSGPPHFIEDEACSQEASKSATLGKRKGKKQRIQEHQC
        MDLLGSECSSGCESGWTLYLEQSFRSQGCLDHIVGGEEYWKAKGSEEEEEEEEEDLSMVSDASSGPPHFIEDEACSQ+ASKSATLGKRKGKKQRIQEHQC
Subjt:  MDLLGSECSSGCESGWTLYLEQSFRSQGCLDHIVGGEEYWKAKGSEEEEEEEEEDLSMVSDASSGPPHFIEDEACSQEASKSATLGKRKGKKQRIQEHQC

Query:  QGEPSSSFLDDTASSPALNYTANKFTNQASMESFLGFSQTNHFE
        QGEPSSSFLDDTASSPALNYTANKFTNQASM+SFLGFSQT+HFE
Subjt:  QGEPSSSFLDDTASSPALNYTANKFTNQASMESFLGFSQTNHFE

XP_023552790.1 uncharacterized protein LOC111810322 [Cucurbita pepo subsp. pepo]1.1e-8796.55Show/hide
Query:  MDLLGSECSSGCESGWTLYLEQSFRSQGCLDHIVGGEEYWKAKGSEEEEEEEEEDLSMVSDASSGPPHFIEDEACSQEASKSATLGKRKGKKQRIQEHQC
        MDLLGSECSSGCESGWTLYLEQSFRSQGCLDHIVGGEEYWK KGS  EEEEEEEDLSMVSDASSGPPHFIEDEACSQEASKSATLGKRKGKKQRIQEHQC
Subjt:  MDLLGSECSSGCESGWTLYLEQSFRSQGCLDHIVGGEEYWKAKGSEEEEEEEEEDLSMVSDASSGPPHFIEDEACSQEASKSATLGKRKGKKQRIQEHQC

Query:  QGEPSSSFLDDTASSPALNYTANKFTNQASMESFLGFSQTNHFEEISACTDHFGFLQASLSGNRLQKNQWFEGE
        QGEPSSSFLDDTASSPALNYTANKFT+QASMESFLGFSQT+HFEE SACTDHFGFLQASLSGNRLQKNQWFEGE
Subjt:  QGEPSSSFLDDTASSPALNYTANKFTNQASMESFLGFSQTNHFEEISACTDHFGFLQASLSGNRLQKNQWFEGE

TrEMBL top hitse value%identityAlignment
A0A0A0KV02 Uncharacterized protein5.5e-6979.79Show/hide
Query:  MDLLGSECSSGCESGWTLYLEQSFRSQGCLDHIV---GG---EEYWKAKGSEEEEEEEE---EDLSMVSDASSGPPHFIEDEACSQE-------ASKSAT
        MDLLGSECSSGCESGWTLYLEQSF S G   HIV   GG   E YWK KG+EEE+EEEE   EDLSMVSDASSGPPHFIEDEACS E        SKSAT
Subjt:  MDLLGSECSSGCESGWTLYLEQSFRSQGCLDHIV---GG---EEYWKAKGSEEEEEEEE---EDLSMVSDASSGPPHFIEDEACSQE-------ASKSAT

Query:  LGKRKGKKQRIQEHQCQGEPSSSFLDDTASSPALNYTANKFTNQASMESFLGFSQTNHFEEISACTDHFGFLQASLSGNRLQKNQWFE
        LGKRKGKKQRI+E+QCQ EP SSFLDDTASSPALN+TAN FTNQASMESFLG SQTNHFEE SA T+HFGFLQ+SLSGNRLQKNQWFE
Subjt:  LGKRKGKKQRIQEHQCQGEPSSSFLDDTASSPALNYTANKFTNQASMESFLGFSQTNHFEEISACTDHFGFLQASLSGNRLQKNQWFE

A0A6J1D7H0 uncharacterized protein LOC1110176342.5e-6978.07Show/hide
Query:  MDLLGSECSSGCESGWTLYLEQSFRSQGCLDHIVGGE-----EYWKAKGSEEEEEEEEEDLSMVSDASSGPPHFIEDEACSQE-------ASKSATLGKR
        MDLLGSECSSGCESGWTLYLEQSF S G  D +VGGE      YWKAK SEEEEEEE EDLSMVSDASSGPPHFIED+ACS E       ASKSATLGKR
Subjt:  MDLLGSECSSGCESGWTLYLEQSFRSQGCLDHIVGGE-----EYWKAKGSEEEEEEEEEDLSMVSDASSGPPHFIEDEACSQE-------ASKSATLGKR

Query:  KGKKQRIQEHQCQGEPSSSFLDDTASSPALNYTANKFTNQASMESFLGFSQTNHFEEISACTDHFGFLQASLSGNRLQKNQWFEGEG
        KGKKQ+I+E+Q   EPSSSFLDDTASSPALN+T N FTNQASM+SF+ FSQT+HFEE SA TDHFGFLQ+SLSGNR+QKNQWFEG+G
Subjt:  KGKKQRIQEHQCQGEPSSSFLDDTASSPALNYTANKFTNQASMESFLGFSQTNHFEEISACTDHFGFLQASLSGNRLQKNQWFEGEG

A0A6J1E812 uncharacterized protein LOC1114307309.7e-98100Show/hide
Query:  MDLLGSECSSGCESGWTLYLEQSFRSQGCLDHIVGGEEYWKAKGSEEEEEEEEEDLSMVSDASSGPPHFIEDEACSQEASKSATLGKRKGKKQRIQEHQC
        MDLLGSECSSGCESGWTLYLEQSFRSQGCLDHIVGGEEYWKAKGSEEEEEEEEEDLSMVSDASSGPPHFIEDEACSQEASKSATLGKRKGKKQRIQEHQC
Subjt:  MDLLGSECSSGCESGWTLYLEQSFRSQGCLDHIVGGEEYWKAKGSEEEEEEEEEDLSMVSDASSGPPHFIEDEACSQEASKSATLGKRKGKKQRIQEHQC

Query:  QGEPSSSFLDDTASSPALNYTANKFTNQASMESFLGFSQTNHFEEISACTDHFGFLQASLSGNRLQKNQWFEGEGRRDRDEMR
        QGEPSSSFLDDTASSPALNYTANKFTNQASMESFLGFSQTNHFEEISACTDHFGFLQASLSGNRLQKNQWFEGEGRRDRDEMR
Subjt:  QGEPSSSFLDDTASSPALNYTANKFTNQASMESFLGFSQTNHFEEISACTDHFGFLQASLSGNRLQKNQWFEGEGRRDRDEMR

A0A6J1J4L5 uncharacterized protein LOC111483377 isoform X16.7e-9197.14Show/hide
Query:  MDLLGSECSSGCESGWTLYLEQSFRSQGCLDHIVGGEEYWKAKGSEEEEEEEEEDLSMVSDASSGPPHFIEDEACSQEASKSATLGKRKGKKQRIQEHQC
        MDLLGSECSSGCESGWTLYLEQSFRSQGCLDHIVGGEEYWKAKGSEEEEEEEEEDLSMVSDASSGPPHFIEDEACSQ+ASKSATLGKRKGKKQRIQEHQC
Subjt:  MDLLGSECSSGCESGWTLYLEQSFRSQGCLDHIVGGEEYWKAKGSEEEEEEEEEDLSMVSDASSGPPHFIEDEACSQEASKSATLGKRKGKKQRIQEHQC

Query:  QGEPSSSFLDDTASSPALNYTANKFTNQASMESFLGFSQTNHFEEISACTDHFGFLQASLSGNRLQKNQWFEGEG
        QGEPSSSFLDDTASSPALNYTANKFTNQASM+SFLGFSQT+HFE+ SACTDHFGFLQASLSGNRLQKNQWFEGEG
Subjt:  QGEPSSSFLDDTASSPALNYTANKFTNQASMESFLGFSQTNHFEEISACTDHFGFLQASLSGNRLQKNQWFEGEG

A0A6J1JD05 uncharacterized protein LOC111483377 isoform X21.4e-7297.92Show/hide
Query:  MDLLGSECSSGCESGWTLYLEQSFRSQGCLDHIVGGEEYWKAKGSEEEEEEEEEDLSMVSDASSGPPHFIEDEACSQEASKSATLGKRKGKKQRIQEHQC
        MDLLGSECSSGCESGWTLYLEQSFRSQGCLDHIVGGEEYWKAKGSEEEEEEEEEDLSMVSDASSGPPHFIEDEACSQ+ASKSATLGKRKGKKQRIQEHQC
Subjt:  MDLLGSECSSGCESGWTLYLEQSFRSQGCLDHIVGGEEYWKAKGSEEEEEEEEEDLSMVSDASSGPPHFIEDEACSQEASKSATLGKRKGKKQRIQEHQC

Query:  QGEPSSSFLDDTASSPALNYTANKFTNQASMESFLGFSQTNHFE
        QGEPSSSFLDDTASSPALNYTANKFTNQASM+SFLGFSQT+HFE
Subjt:  QGEPSSSFLDDTASSPALNYTANKFTNQASMESFLGFSQTNHFE

SwissProt top hitse value%identityAlignment
B6IDH8 Protein SOB FIVE-LIKE 62.0e-0732.8Show/hide
Query:  MDLLGSECSSGCESGWTLYLEQSFRSQGCLDHIVGGEEYWKAKGSEEEEEEEEEDLSMVSDASSGPPHFIED-------EACSQEASKSATLGKRKGKKQ
        MD    + S   +SGWT+YL  S  S   L H     +Y       E ++E +ED SMVSDASSGPP++ E+       +  +Q   KS +  K K KK 
Subjt:  MDLLGSECSSGCESGWTLYLEQSFRSQGCLDHIVGGEEYWKAKGSEEEEEEEEEDLSMVSDASSGPPHFIED-------EACSQEASKSATLGKRKGKKQ

Query:  RIQEHQCQGEPSSSFLDDTASS-PALNYTANKFTNQASMESFLGFSQTNHFEEISACTDHFGFLQASLSGNRLQ-KNQWFEGEGRR
        ++ E Q   E  +S  DDTASS P     +    +Q   + F  F Q+     I     + GFLQ +   ++L   NQ  + + +R
Subjt:  RIQEHQCQGEPSSSFLDDTASS-PALNYTANKFTNQASMESFLGFSQTNHFEEISACTDHFGFLQASLSGNRLQ-KNQWFEGEGRR

Q8L9K4 Protein SOB FIVE-LIKE 58.3e-1438.15Show/hide
Query:  LLGSECSSGCESGWTLYLEQS--------FRSQGCLDHIVGGEEYWKAKGSEEEEEEEEEDLSMVSDASSGPPHFIEDEACSQEASKSATLGKRKGKKQR
        +LGS  SSGCESGWTLYL+QS        FR     D     ++ W      +EEEEEE+DLSM+SDASSGP +  E+++      K   +G +K  K+ 
Subjt:  LLGSECSSGCESGWTLYLEQS--------FRSQGCLDHIVGGEEYWKAKGSEEEEEEEEEDLSMVSDASSGPPHFIEDEACSQEASKSATLGKRKGKKQR

Query:  IQEHQCQGEPSSSFLDDTASSPALNYT--------ANKFTNQASMESFLGFSQ---TNHFEEISACTDHFGFL
         ++ +   E  +S LDDTASSP  N+          NK   Q   ES L +SQ      F++ +A  +  G+L
Subjt:  IQEHQCQGEPSSSFLDDTASSPALNYT--------ANKFTNQASMESFLGFSQ---TNHFEEISACTDHFGFL

Arabidopsis top hitse value%identityAlignment
AT1G26210.1 SOB five-like 16.1e-0433.04Show/hide
Query:  SGCESGWTLYLEQSFRSQGCLDHIVGGEE-----------YWKAKG-SEEEEEEEEEDLSMVSDASSGP----PHFIEDEACSQEASKSATLGKRKGKKQ
        S CESGWT+Y+E +F        +V  ++           Y    G + ++  +EE D SM SDASSGP    P  I   A  +  SK   L KR+  ++
Subjt:  SGCESGWTLYLEQSFRSQGCLDHIVGGEE-----------YWKAKG-SEEEEEEEEEDLSMVSDASSGP----PHFIEDEACSQEASKSATLGKRKGKKQ

Query:  RIQEHQCQGEPS
         I     +GE S
Subjt:  RIQEHQCQGEPS

AT1G58460.1 unknown protein1.4e-0832.8Show/hide
Query:  MDLLGSECSSGCESGWTLYLEQSFRSQGCLDHIVGGEEYWKAKGSEEEEEEEEEDLSMVSDASSGPPHFIED-------EACSQEASKSATLGKRKGKKQ
        MD    + S   +SGWT+YL  S  S   L H     +Y       E ++E +ED SMVSDASSGPP++ E+       +  +Q   KS +  K K KK 
Subjt:  MDLLGSECSSGCESGWTLYLEQSFRSQGCLDHIVGGEEYWKAKGSEEEEEEEEEDLSMVSDASSGPPHFIED-------EACSQEASKSATLGKRKGKKQ

Query:  RIQEHQCQGEPSSSFLDDTASS-PALNYTANKFTNQASMESFLGFSQTNHFEEISACTDHFGFLQASLSGNRLQ-KNQWFEGEGRR
        ++ E Q   E  +S  DDTASS P     +    +Q   + F  F Q+     I     + GFLQ +   ++L   NQ  + + +R
Subjt:  RIQEHQCQGEPSSSFLDDTASS-PALNYTANKFTNQASMESFLGFSQTNHFEEISACTDHFGFLQASLSGNRLQ-KNQWFEGEGRR

AT4G33800.1 unknown protein5.9e-1538.15Show/hide
Query:  LLGSECSSGCESGWTLYLEQS--------FRSQGCLDHIVGGEEYWKAKGSEEEEEEEEEDLSMVSDASSGPPHFIEDEACSQEASKSATLGKRKGKKQR
        +LGS  SSGCESGWTLYL+QS        FR     D     ++ W      +EEEEEE+DLSM+SDASSGP +  E+++      K   +G +K  K+ 
Subjt:  LLGSECSSGCESGWTLYLEQS--------FRSQGCLDHIVGGEEYWKAKGSEEEEEEEEEDLSMVSDASSGPPHFIEDEACSQEASKSATLGKRKGKKQR

Query:  IQEHQCQGEPSSSFLDDTASSPALNYT--------ANKFTNQASMESFLGFSQ---TNHFEEISACTDHFGFL
         ++ +   E  +S LDDTASSP  N+          NK   Q   ES L +SQ      F++ +A  +  G+L
Subjt:  IQEHQCQGEPSSSFLDDTASSPALNYT--------ANKFTNQASMESFLGFSQ---TNHFEEISACTDHFGFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTGTTGGGTTCTGAGTGCAGCAGTGGGTGTGAGTCTGGTTGGACTTTGTATTTGGAGCAGTCTTTTCGTTCACAAGGCTGCTTAGATCACATTGTTGGCGGAGA
GGAATATTGGAAAGCTAAAGGTTCTGAAGAAGAGGAAGAGGAAGAGGAAGAGGATCTTTCCATGGTTTCTGATGCATCTTCTGGGCCTCCACATTTCATAGAAGACGAAG
CTTGCTCCCAAGAAGCATCAAAATCAGCCACATTGGGGAAGAGAAAGGGCAAAAAACAGAGAATTCAAGAACATCAATGCCAGGGTGAGCCTTCCTCTTCGTTCCTAGAC
GACACTGCTAGCTCTCCTGCTCTCAACTACACTGCTAATAAGTTCACCAATCAAGCCTCAATGGAAAGCTTCTTGGGCTTCTCTCAAACTAACCACTTCGAGGAAATATC
GGCCTGTACAGACCATTTTGGGTTCCTTCAAGCTTCTCTATCAGGAAATAGATTGCAGAAGAACCAGTGGTTTGAAGGCGAAGGAAGGAGGGATAGGGATGAGATGAGAT
AA
mRNA sequenceShow/hide mRNA sequence
CAACTTTGGTGAAAAATCTTGATAGATGTAAATAGATCATTGGTTGCTTATTGAAATCGTTTGTGGAAAGTTCGTATTCTCTGTTTTTTTTGGGTTCCCATTCTGATCTG
GGCTCTGTTTTCTTCTTGAATTTGCATTGCACACAACCTGTTTGATAATTTGCCCGAGAGAAAAAAAGGGTTGGGGAATATGGATTTGTTGGGTTCTGAGTGCAGCAGTG
GGTGTGAGTCTGGTTGGACTTTGTATTTGGAGCAGTCTTTTCGTTCACAAGGCTGCTTAGATCACATTGTTGGCGGAGAGGAATATTGGAAAGCTAAAGGTTCTGAAGAA
GAGGAAGAGGAAGAGGAAGAGGATCTTTCCATGGTTTCTGATGCATCTTCTGGGCCTCCACATTTCATAGAAGACGAAGCTTGCTCCCAAGAAGCATCAAAATCAGCCAC
ATTGGGGAAGAGAAAGGGCAAAAAACAGAGAATTCAAGAACATCAATGCCAGGGTGAGCCTTCCTCTTCGTTCCTAGACGACACTGCTAGCTCTCCTGCTCTCAACTACA
CTGCTAATAAGTTCACCAATCAAGCCTCAATGGAAAGCTTCTTGGGCTTCTCTCAAACTAACCACTTCGAGGAAATATCGGCCTGTACAGACCATTTTGGGTTCCTTCAA
GCTTCTCTATCAGGAAATAGATTGCAGAAGAACCAGTGGTTTGAAGGCGAAGGAAGGAGGGATAGGGATGAGATGAGATAAGTGGCCTTATTTTTGCATCACTTTCTTCT
TAAAAATGCGTGGGAAATGGATCTGCTGAAGAGATGGGCTTTGTTTTCCTGTTTTTTTGTTCATTTCTCTCCTTTTTGTAACGATCAATGCAATGGGAACCATTCGTGGT
GAGAGATTTGTGAATTGGAAACAAATCTGAGTAATTTCTAGAACAAAATCACAAGACCCACTAACTTTCATCCAAATAAGGCCTAAAACTTTAGCTTTTATTTGGTGGAA
TTTTATCAAATCCACACCAAAACTCCAAAAATATATATTTTTTTAATTTAATTGTAAATTTTGGTTCACCCAAAACTGTCT
Protein sequenceShow/hide protein sequence
MDLLGSECSSGCESGWTLYLEQSFRSQGCLDHIVGGEEYWKAKGSEEEEEEEEEDLSMVSDASSGPPHFIEDEACSQEASKSATLGKRKGKKQRIQEHQCQGEPSSSFLD
DTASSPALNYTANKFTNQASMESFLGFSQTNHFEEISACTDHFGFLQASLSGNRLQKNQWFEGEGRRDRDEMR