; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G020680 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G020680
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptionprotein SOB FIVE-LIKE 6-like
Genome locationCG_Chr05:32717284..32718086
RNA-Seq ExpressionClCG05G020680
SyntenyClCG05G020680
Gene Ontology termsGO:0009691 - cytokinin biosynthetic process (biological process)
GO:0009736 - cytokinin-activated signaling pathway (biological process)
InterPro domainsIPR044670 - SOB-five-Like (SOFL) family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591429.1 hypothetical protein SDJN03_13775, partial [Cucurbita argyrosperma subsp. sororia]1.4e-6979.12Show/hide
Query:  MNDWSATHDCSGSESGWTMYLDQSYTSDHRFGGGGGGGGVGENYRAKEAKARADEEDEEDLSMVSDASSGPPHYLEDNEELFYNNGYSSYAIAASESAIN
        MNDWSATH CSG ESGWTMYLDQSYTSDHRFGGG  G         KEAKARA+E DEEDLSMVSDASSGPPHYLEDNEELFYNNGYSSYA +AS+S IN
Subjt:  MNDWSATHDCSGSESGWTMYLDQSYTSDHRFGGGGGGGGVGENYRAKEAKARADEEDEEDLSMVSDASSGPPHYLEDNEELFYNNGYSSYAIAASESAIN

Query:  CSKEERKKSKKGKQNGRNQQYSYLDDTASSPVYGYRKANKINAATSNKALEENPVDFSQGFSATHFKHRKNEAISDSAAASN
        CSKEE+KKSKK KQNGRNQQ SYLDDTASSPVYGY KA+KI AATSNKA EENPVDFSQGFSATHFK   + + S S+++S+
Subjt:  CSKEERKKSKKGKQNGRNQQYSYLDDTASSPVYGYRKANKINAATSNKALEENPVDFSQGFSATHFKHRKNEAISDSAAASN

KAG7024309.1 hypothetical protein SDJN02_13123, partial [Cucurbita argyrosperma subsp. argyrosperma]5.3e-6985.03Show/hide
Query:  MNDWSATHDCSGSESGWTMYLDQSYTSDHRFGGGGGGGGVGENYRAKEAKARADEEDEEDLSMVSDASSGPPHYLEDNEELFYNNGYSSYAIAASESAIN
        MNDWSATH CSG ESGWTMYLDQSYTSDHRFGGG  G         KEAKARA+E DEEDLSMVSDASSGPPHYLEDNEELFYNNGYSSYA +AS+S IN
Subjt:  MNDWSATHDCSGSESGWTMYLDQSYTSDHRFGGGGGGGGVGENYRAKEAKARADEEDEEDLSMVSDASSGPPHYLEDNEELFYNNGYSSYAIAASESAIN

Query:  CSKEERKKSKKGKQNGRNQQYSYLDDTASSPVYGYRKANKINAATSNKALEENPVDFSQGFSATHFK
        CSKEE+KKSKK KQNGRNQQ SYLDDTASSPVYGY KA+KI AATSNKALEENPVDFSQGFSATHFK
Subjt:  CSKEERKKSKKGKQNGRNQQYSYLDDTASSPVYGYRKANKINAATSNKALEENPVDFSQGFSATHFK

XP_022936106.1 uncharacterized protein LOC111442806 [Cucurbita moschata]1.6e-6883.43Show/hide
Query:  MNDWSATHDCSGSESGWTMYLDQSYTSDHRFGGGGGGGGVGENYRAKEAKARADEEDEEDLSMVSDASSGPPHYLEDNEELFYNNGYSSYAIAASESAIN
        MNDWSATH CSG ESGWTMYLDQSYTSDHRFGGG  G         KEAKARA+E DEEDLSMVSDASSGPPHYLEDNEELFYNNGYSSYA +AS+S IN
Subjt:  MNDWSATHDCSGSESGWTMYLDQSYTSDHRFGGGGGGGGVGENYRAKEAKARADEEDEEDLSMVSDASSGPPHYLEDNEELFYNNGYSSYAIAASESAIN

Query:  CSKEERKKSKKGKQNGRNQQYSYLDDTASSPVYGYRKANKINAATSNKALEENPVDFSQGFSATHFKHR
        CSKEE+KKSKK KQNGRNQQ SYLDDTASSPVYGY KA+KI AATSNKA EENPVDFSQGFSATHFK +
Subjt:  CSKEERKKSKKGKQNGRNQQYSYLDDTASSPVYGYRKANKINAATSNKALEENPVDFSQGFSATHFKHR

XP_022977119.1 uncharacterized protein LOC111477282 [Cucurbita maxima]5.9e-6882.84Show/hide
Query:  MNDWSATHDCSGSESGWTMYLDQSYTSDHRFGGGGGGGGVGENYRAKEAKARADEEDEEDLSMVSDASSGPPHYLEDNEELFYNNGYSSYAIAASESAIN
        MNDWSATH CSG ESGWTMYLDQSYTSDHRFGGG  G         KEAKARA+E DEEDLSMVSDASSGPPHYLEDNEELFYNNGYSSYA +AS+S IN
Subjt:  MNDWSATHDCSGSESGWTMYLDQSYTSDHRFGGGGGGGGVGENYRAKEAKARADEEDEEDLSMVSDASSGPPHYLEDNEELFYNNGYSSYAIAASESAIN

Query:  CSKEERKKSKKGKQNGRNQQYSYLDDTASSPVYGYRKANKINAATSNKALEENPVDFSQGFSATHFKHR
        CSKEE+KKSKK KQNGRNQQ SYLDDTASSPVYGY KA+KI AA SNKA EENPVDFSQGFSATHFK +
Subjt:  CSKEERKKSKKGKQNGRNQQYSYLDDTASSPVYGYRKANKINAATSNKALEENPVDFSQGFSATHFKHR

XP_038897152.1 protein SOB FIVE-LIKE 6-like [Benincasa hispida]1.0e-7284.71Show/hide
Query:  MNDWSATHDCSGSESGWTMYLDQSYTSDHRFGGGGGGGGVGENYRAKEAKARADEED-EEDLSMVSDASSGPPHYLEDNEELFYNNGYSSYAIAASESAI
        MNDWS+TH CSGSESGWTMYLDQSYTSDH F G GG GGVGENY+AKEAKAR ++ED EEDLSMVSDASSGPPHY+EDNEELFY NGYSSYA +ASESAI
Subjt:  MNDWSATHDCSGSESGWTMYLDQSYTSDHRFGGGGGGGGVGENYRAKEAKARADEED-EEDLSMVSDASSGPPHYLEDNEELFYNNGYSSYAIAASESAI

Query:  NCSKEERKKSKKGKQNGRNQQYSYLDDTASSPVYGYRKANKINAATSNKALEENPVDFSQGFSATHFKHR
        NC+KEE KK KKGKQNGRNQQ+SYLDDTASSPVYGY K +KIN A SNKALEENPVDFSQGFSATHFK +
Subjt:  NCSKEERKKSKKGKQNGRNQQYSYLDDTASSPVYGYRKANKINAATSNKALEENPVDFSQGFSATHFKHR

TrEMBL top hitse value%identityAlignment
A0A1S3BVI5 uncharacterized protein LOC1034936098.3e-6882.46Show/hide
Query:  MNDWSATHDCSGSESGWTMYLDQSYTSDHRFGGGGGGGGVGENYRAKEAKARADEE--DEEDLSMVSDASSGPPHYLEDNEELFYNNGYSSYAIAASESA
        MNDWSATH CSGSESGWTMYLDQSYTSDHRF GG GG    ENY+AKEAKAR +EE  +EEDLSMVSDASSGPPHY+EDNEELFYNNGYSSYAI+ASE  
Subjt:  MNDWSATHDCSGSESGWTMYLDQSYTSDHRFGGGGGGGGVGENYRAKEAKARADEE--DEEDLSMVSDASSGPPHYLEDNEELFYNNGYSSYAIAASESA

Query:  INCSKEERKKSKKGKQNGRNQQYSYLDDTASSPVYGYRKANKINAATSNKALEENPVDFSQGFSATHFKHR
              ERKKS+KGK+NGRNQQ+SYLDDTASSPVYGY KANKIN ATSNKALEENPVDFSQGFSATHFK +
Subjt:  INCSKEERKKSKKGKQNGRNQQYSYLDDTASSPVYGYRKANKINAATSNKALEENPVDFSQGFSATHFKHR

A0A5D3D989 Uncharacterized protein8.3e-6882.46Show/hide
Query:  MNDWSATHDCSGSESGWTMYLDQSYTSDHRFGGGGGGGGVGENYRAKEAKARADEE--DEEDLSMVSDASSGPPHYLEDNEELFYNNGYSSYAIAASESA
        MNDWSATH CSGSESGWTMYLDQSYTSDHRF GG GG    ENY+AKEAKAR +EE  +EEDLSMVSDASSGPPHY+EDNEELFYNNGYSSYAI+ASE  
Subjt:  MNDWSATHDCSGSESGWTMYLDQSYTSDHRFGGGGGGGGVGENYRAKEAKARADEE--DEEDLSMVSDASSGPPHYLEDNEELFYNNGYSSYAIAASESA

Query:  INCSKEERKKSKKGKQNGRNQQYSYLDDTASSPVYGYRKANKINAATSNKALEENPVDFSQGFSATHFKHR
              ERKKS+KGK+NGRNQQ+SYLDDTASSPVYGY KANKIN ATSNKALEENPVDFSQGFSATHFK +
Subjt:  INCSKEERKKSKKGKQNGRNQQYSYLDDTASSPVYGYRKANKINAATSNKALEENPVDFSQGFSATHFKHR

A0A6J1CDG6 uncharacterized protein LOC111010447 isoform X11.4e-6276.16Show/hide
Query:  MNDWSATHDCSGSESGWTMYLDQSYTSDHRFGGGGGGGGVGENYRAKEAKARADEED-EEDLSMVSDASSGPPHYLEDNEELFYNNGYSSYAIAASESAI
        MNDWS+ H CSG ESGWTMYLDQSY S+HRF    GGGGVGENYR KEAK R +EED EEDLSMVSDASSGPPHYLEDNE   YNNGYSSYA +A+ESA 
Subjt:  MNDWSATHDCSGSESGWTMYLDQSYTSDHRFGGGGGGGGVGENYRAKEAKARADEED-EEDLSMVSDASSGPPHYLEDNEELFYNNGYSSYAIAASESAI

Query:  NCSKEERKKSKKGKQNGRNQQYSYLDDTASSPVYGYRKANKINAATSNKALEENPVDFSQGFSATHFKHRKN
        NC +E   K KKGK++GRNQQ+SYLDDTASSPVYGY KA+KINAA SN+A EEN VDFSQGFSATHFK + +
Subjt:  NCSKEERKKSKKGKQNGRNQQYSYLDDTASSPVYGYRKANKINAATSNKALEENPVDFSQGFSATHFKHRKN

A0A6J1F6K7 uncharacterized protein LOC1114428067.5e-6983.43Show/hide
Query:  MNDWSATHDCSGSESGWTMYLDQSYTSDHRFGGGGGGGGVGENYRAKEAKARADEEDEEDLSMVSDASSGPPHYLEDNEELFYNNGYSSYAIAASESAIN
        MNDWSATH CSG ESGWTMYLDQSYTSDHRFGGG  G         KEAKARA+E DEEDLSMVSDASSGPPHYLEDNEELFYNNGYSSYA +AS+S IN
Subjt:  MNDWSATHDCSGSESGWTMYLDQSYTSDHRFGGGGGGGGVGENYRAKEAKARADEEDEEDLSMVSDASSGPPHYLEDNEELFYNNGYSSYAIAASESAIN

Query:  CSKEERKKSKKGKQNGRNQQYSYLDDTASSPVYGYRKANKINAATSNKALEENPVDFSQGFSATHFKHR
        CSKEE+KKSKK KQNGRNQQ SYLDDTASSPVYGY KA+KI AATSNKA EENPVDFSQGFSATHFK +
Subjt:  CSKEERKKSKKGKQNGRNQQYSYLDDTASSPVYGYRKANKINAATSNKALEENPVDFSQGFSATHFKHR

A0A6J1IQJ9 uncharacterized protein LOC1114772822.9e-6882.84Show/hide
Query:  MNDWSATHDCSGSESGWTMYLDQSYTSDHRFGGGGGGGGVGENYRAKEAKARADEEDEEDLSMVSDASSGPPHYLEDNEELFYNNGYSSYAIAASESAIN
        MNDWSATH CSG ESGWTMYLDQSYTSDHRFGGG  G         KEAKARA+E DEEDLSMVSDASSGPPHYLEDNEELFYNNGYSSYA +AS+S IN
Subjt:  MNDWSATHDCSGSESGWTMYLDQSYTSDHRFGGGGGGGGVGENYRAKEAKARADEEDEEDLSMVSDASSGPPHYLEDNEELFYNNGYSSYAIAASESAIN

Query:  CSKEERKKSKKGKQNGRNQQYSYLDDTASSPVYGYRKANKINAATSNKALEENPVDFSQGFSATHFKHR
        CSKEE+KKSKK KQNGRNQQ SYLDDTASSPVYGY KA+KI AA SNKA EENPVDFSQGFSATHFK +
Subjt:  CSKEERKKSKKGKQNGRNQQYSYLDDTASSPVYGYRKANKINAATSNKALEENPVDFSQGFSATHFKHR

SwissProt top hitse value%identityAlignment
B6IDH8 Protein SOB FIVE-LIKE 61.3e-0431.95Show/hide
Query:  DWSATHDCSGSESGWTMYLDQSYT-SDHRFGGGGGGGGVGENYRAKEAKARADEEDEEDLSMVSDASSGPPHYLED---NEELFYNNGYSSYAIAASESA
        D+S        +SGWTMYL  S + S H F           +Y   E K    +E +ED SMVSDASSGPP+Y E+    + L  N  Y           
Subjt:  DWSATHDCSGSESGWTMYLDQSYT-SDHRFGGGGGGGGVGENYRAKEAKARADEEDEEDLSMVSDASSGPPHYLED---NEELFYNNGYSSYAIAASESA

Query:  INCSKEERKKSKKGKQNGRNQQY-----SYLDDTASSPVYGYRKANKINAATSNKALEENPVDFSQGFS
          C  + + K+K  K+    Q Y     S  DDTASS   G     +++A   ++   +   DF Q +S
Subjt:  INCSKEERKKSKKGKQNGRNQQY-----SYLDDTASSPVYGYRKANKINAATSNKALEENPVDFSQGFS

Q8L9K4 Protein SOB FIVE-LIKE 53.8e-0932.53Show/hide
Query:  SGSESGWTMYLDQSYTSDHRFGGGGGGGGVGENYRAKEA-----KARADEEDEEDLSMVSDASSGPPHYLEDNEELFYNNGYSSYAIAASESAINCSKEE
        SG ESGWT+YLDQS +S           G     R+K++       + +EE+E+DLSM+SDASSGP +  E++                S   IN    +
Subjt:  SGSESGWTMYLDQSYTSDHRFGGGGGGGGVGENYRAKEA-----KARADEEDEEDLSMVSDASSGPPHYLEDNEELFYNNGYSSYAIAASESAINCSKEE

Query:  RKKSKKGKQNGRNQQYSYLDDTASSPVYGY--RKANKINAATSNKALEENPVDFSQGFSATHFKHR
        ++  ++ K+    +  S LDDTASSP++ +       +      +   E+ +D+SQGFSAT F+ +
Subjt:  RKKSKKGKQNGRNQQYSYLDDTASSPVYGY--RKANKINAATSNKALEENPVDFSQGFSATHFKHR

Arabidopsis top hitse value%identityAlignment
AT1G58460.1 unknown protein8.9e-0631.95Show/hide
Query:  DWSATHDCSGSESGWTMYLDQSYT-SDHRFGGGGGGGGVGENYRAKEAKARADEEDEEDLSMVSDASSGPPHYLED---NEELFYNNGYSSYAIAASESA
        D+S        +SGWTMYL  S + S H F           +Y   E K    +E +ED SMVSDASSGPP+Y E+    + L  N  Y           
Subjt:  DWSATHDCSGSESGWTMYLDQSYT-SDHRFGGGGGGGGVGENYRAKEAKARADEEDEEDLSMVSDASSGPPHYLED---NEELFYNNGYSSYAIAASESA

Query:  INCSKEERKKSKKGKQNGRNQQY-----SYLDDTASSPVYGYRKANKINAATSNKALEENPVDFSQGFS
          C  + + K+K  K+    Q Y     S  DDTASS   G     +++A   ++   +   DF Q +S
Subjt:  INCSKEERKKSKKGKQNGRNQQY-----SYLDDTASSPVYGYRKANKINAATSNKALEENPVDFSQGFS

AT4G33800.1 unknown protein2.7e-1032.53Show/hide
Query:  SGSESGWTMYLDQSYTSDHRFGGGGGGGGVGENYRAKEA-----KARADEEDEEDLSMVSDASSGPPHYLEDNEELFYNNGYSSYAIAASESAINCSKEE
        SG ESGWT+YLDQS +S           G     R+K++       + +EE+E+DLSM+SDASSGP +  E++                S   IN    +
Subjt:  SGSESGWTMYLDQSYTSDHRFGGGGGGGGVGENYRAKEA-----KARADEEDEEDLSMVSDASSGPPHYLEDNEELFYNNGYSSYAIAASESAINCSKEE

Query:  RKKSKKGKQNGRNQQYSYLDDTASSPVYGY--RKANKINAATSNKALEENPVDFSQGFSATHFKHR
        ++  ++ K+    +  S LDDTASSP++ +       +      +   E+ +D+SQGFSAT F+ +
Subjt:  RKKSKKGKQNGRNQQYSYLDDTASSPVYGY--RKANKINAATSNKALEENPVDFSQGFSATHFKHR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGATTGGTCTGCTACACATGATTGCAGTGGAAGTGAATCAGGTTGGACTATGTACTTAGACCAATCCTATACTTCAGACCATCGTTTTGGTGGCGGCGGCGGTGG
CGGTGGGGTTGGTGAAAATTATAGGGCAAAAGAAGCAAAAGCAAGAGCAGATGAGGAGGACGAAGAAGATCTATCAATGGTGTCTGATGCTTCCTCAGGTCCACCACATT
ACCTTGAGGACAATGAAGAATTGTTTTACAATAATGGGTATTCTTCCTATGCCATTGCAGCTTCAGAATCAGCAATAAATTGTAGTAAGGAGGAGAGAAAGAAGAGCAAG
AAAGGCAAACAAAATGGCAGAAATCAGCAATATTCTTACCTTGATGACACTGCTAGCTCCCCTGTATATGGCTACAGAAAAGCAAATAAAATCAATGCAGCAACAAGCAA
CAAAGCTTTGGAGGAGAATCCAGTAGATTTTTCTCAGGGATTCTCTGCAACCCACTTTAAGCACCGGAAGAATGAGGCAATTAGCGATTCCGCCGCAGCTTCCAACCCTT
TTTTTTGGGGGCCGTCAAGTAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAATGATTGGTCTGCTACACATGATTGCAGTGGAAGTGAATCAGGTTGGACTATGTACTTAGACCAATCCTATACTTCAGACCATCGTTTTGGTGGCGGCGGCGGTGG
CGGTGGGGTTGGTGAAAATTATAGGGCAAAAGAAGCAAAAGCAAGAGCAGATGAGGAGGACGAAGAAGATCTATCAATGGTGTCTGATGCTTCCTCAGGTCCACCACATT
ACCTTGAGGACAATGAAGAATTGTTTTACAATAATGGGTATTCTTCCTATGCCATTGCAGCTTCAGAATCAGCAATAAATTGTAGTAAGGAGGAGAGAAAGAAGAGCAAG
AAAGGCAAACAAAATGGCAGAAATCAGCAATATTCTTACCTTGATGACACTGCTAGCTCCCCTGTATATGGCTACAGAAAAGCAAATAAAATCAATGCAGCAACAAGCAA
CAAAGCTTTGGAGGAGAATCCAGTAGATTTTTCTCAGGGATTCTCTGCAACCCACTTTAAGCACCGGAAGAATGAGGCAATTAGCGATTCCGCCGCAGCTTCCAACCCTT
TTTTTTGGGGGCCGTCAAGTAAATGA
Protein sequenceShow/hide protein sequence
MNDWSATHDCSGSESGWTMYLDQSYTSDHRFGGGGGGGGVGENYRAKEAKARADEEDEEDLSMVSDASSGPPHYLEDNEELFYNNGYSSYAIAASESAINCSKEERKKSK
KGKQNGRNQQYSYLDDTASSPVYGYRKANKINAATSNKALEENPVDFSQGFSATHFKHRKNEAISDSAAASNPFFWGPSSK