; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028645 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028645
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein SOB FIVE-LIKE 5
Genome locationtig00153204:3057228..3058507
RNA-Seq ExpressionSgr028645
SyntenySgr028645
Gene Ontology termsGO:0009691 - cytokinin biosynthetic process (biological process)
GO:0009736 - cytokinin-activated signaling pathway (biological process)
InterPro domainsIPR044670 - SOB-five-Like (SOFL) family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004142900.1 uncharacterized protein LOC101210927 [Cucumis sativus]4.7e-3356.36Show/hide
Query:  MSNLSGTHY-NSGCESGWTVYFEESLESEAERFGRSVVDYGGG---EEEEGDLSMVSDASSGPRNGFGDGQIGFSFEENCQSLRRNGAK-SAAKSKRKED
        MSN +G+H  NSGCESGWT+YFEES+E+E ERF  S VDYGGG   EEEE DLSM+SDASSGPRNG+ +       E+NCQS+ R+G K  A KSKR+E+
Subjt:  MSNLSGTHY-NSGCESGWTVYFEESLESEAERFGRSVVDYGGG---EEEEGDLSMVSDASSGPRNGFGDGQIGFSFEENCQSLRRNGAK-SAAKSKRKED

Query:  MGCRNQHSCLDDTASSPVFGLSKVTLLDLYNHRPWPLGNLQNRRAFFNLPLRKNQPKMSQVKNST
        MG RNQHSCLDDTASSPVFGLSK+   + Y +     G   N + F     RK+  K +    S+
Subjt:  MGCRNQHSCLDDTASSPVFGLSKVTLLDLYNHRPWPLGNLQNRRAFFNLPLRKNQPKMSQVKNST

XP_022132075.1 uncharacterized protein LOC111005036 isoform X1 [Momordica charantia]2.5e-3472.27Show/hide
Query:  MSNLSGTHYNSGCESGWTVYFEESLESEAERFGRSVVDYGGGEEEEGDLSMVSDASSGPRNGFGDGQIGFSFEENCQSL-RRNGAKSAAKSKRKEDMGCR
        MSN SGTH NSGC+SGWTVYF++S     ERF  SV DYGGG EEEGDLSMVSDASSGPRNGFGDG +GF  E N Q + RRNG KSAAK+KR++++G R
Subjt:  MSNLSGTHYNSGCESGWTVYFEESLESEAERFGRSVVDYGGGEEEEGDLSMVSDASSGPRNGFGDGQIGFSFEENCQSL-RRNGAKSAAKSKRKEDMGCR

Query:  NQHSCLDDTASSPVFGLSK
        NQHS LDDTA+SPVF LSK
Subjt:  NQHSCLDDTASSPVFGLSK

XP_022132076.1 uncharacterized protein LOC111005036 isoform X2 [Momordica charantia]1.1e-3459.01Show/hide
Query:  MSNLSGTHYNSGCESGWTVYFEESLESEAERFGRSVVDYGGGEEEEGDLSMVSDASSGPRNGFGDGQIGFSFEENCQSL-RRNGAKSAAKSKRKEDMGCR
        MSN SGTH NSGC+SGWTVYF++S     ERF  SV DYGGG EEEGDLSMVSDASSGPRNGFGDG +GF  E N Q + RRNG KSAAK+KR++++G R
Subjt:  MSNLSGTHYNSGCESGWTVYFEESLESEAERFGRSVVDYGGGEEEEGDLSMVSDASSGPRNGFGDGQIGFSFEENCQSL-RRNGAKSAAKSKRKEDMGCR

Query:  NQHSCLDDTASSPVFGLSKVTLLDLYNHRPWPLGNLQNRRAFFNLPLRKNQPKMSQVKNST
        NQHS LDDTA+SPVF LSK       N        ++N + F +   RK Q K S  ++S+
Subjt:  NQHSCLDDTASSPVFGLSKVTLLDLYNHRPWPLGNLQNRRAFFNLPLRKNQPKMSQVKNST

XP_022132077.1 uncharacterized protein LOC111005036 isoform X3 [Momordica charantia]2.5e-3472.27Show/hide
Query:  MSNLSGTHYNSGCESGWTVYFEESLESEAERFGRSVVDYGGGEEEEGDLSMVSDASSGPRNGFGDGQIGFSFEENCQSL-RRNGAKSAAKSKRKEDMGCR
        MSN SGTH NSGC+SGWTVYF++S     ERF  SV DYGGG EEEGDLSMVSDASSGPRNGFGDG +GF  E N Q + RRNG KSAAK+KR++++G R
Subjt:  MSNLSGTHYNSGCESGWTVYFEESLESEAERFGRSVVDYGGGEEEEGDLSMVSDASSGPRNGFGDGQIGFSFEENCQSL-RRNGAKSAAKSKRKEDMGCR

Query:  NQHSCLDDTASSPVFGLSK
        NQHS LDDTA+SPVF LSK
Subjt:  NQHSCLDDTASSPVFGLSK

XP_038886134.1 uncharacterized protein LOC120076391 isoform X2 [Benincasa hispida]2.1e-3355.31Show/hide
Query:  MSNLSGTH-YNSGCESGWTVYFEESLESEAERFGRSVVDYGGG--EEEEGDLSMVSDASSGPRNGFGDGQIGFSFEENCQSLRRNGAKSAA-KSKRKEDM
        MSN SG+H  NSGCESGWT+YFEES+E+E   F RS VDYGGG  EEEEGDLSM+SDASSGP NG+ +       E N Q +RRNG KSAA KSKRKE+M
Subjt:  MSNLSGTH-YNSGCESGWTVYFEESLESEAERFGRSVVDYGGG--EEEEGDLSMVSDASSGPRNGFGDGQIGFSFEENCQSLRRNGAKSAA-KSKRKEDM

Query:  GCRNQHSCLDDTASSPVFGLSKVTLLDLYNHRPWPLGNLQNRRAFFNLPLRKNQPKMSQVKNSTVAVAAGSSVRLAGQN
        G RNQHSCLDDTASSPVFGLS V  +  Y +     G ++N + F      + Q    Q      +    SSV+  G+N
Subjt:  GCRNQHSCLDDTASSPVFGLSKVTLLDLYNHRPWPLGNLQNRRAFFNLPLRKNQPKMSQVKNSTVAVAAGSSVRLAGQN

TrEMBL top hitse value%identityAlignment
A0A0A0LR31 Uncharacterized protein2.3e-3356.36Show/hide
Query:  MSNLSGTHY-NSGCESGWTVYFEESLESEAERFGRSVVDYGGG---EEEEGDLSMVSDASSGPRNGFGDGQIGFSFEENCQSLRRNGAK-SAAKSKRKED
        MSN +G+H  NSGCESGWT+YFEES+E+E ERF  S VDYGGG   EEEE DLSM+SDASSGPRNG+ +       E+NCQS+ R+G K  A KSKR+E+
Subjt:  MSNLSGTHY-NSGCESGWTVYFEESLESEAERFGRSVVDYGGG---EEEEGDLSMVSDASSGPRNGFGDGQIGFSFEENCQSLRRNGAK-SAAKSKRKED

Query:  MGCRNQHSCLDDTASSPVFGLSKVTLLDLYNHRPWPLGNLQNRRAFFNLPLRKNQPKMSQVKNST
        MG RNQHSCLDDTASSPVFGLSK+   + Y +     G   N + F     RK+  K +    S+
Subjt:  MGCRNQHSCLDDTASSPVFGLSKVTLLDLYNHRPWPLGNLQNRRAFFNLPLRKNQPKMSQVKNST

A0A6J1BR82 uncharacterized protein LOC111005036 isoform X25.4e-3559.01Show/hide
Query:  MSNLSGTHYNSGCESGWTVYFEESLESEAERFGRSVVDYGGGEEEEGDLSMVSDASSGPRNGFGDGQIGFSFEENCQSL-RRNGAKSAAKSKRKEDMGCR
        MSN SGTH NSGC+SGWTVYF++S     ERF  SV DYGGG EEEGDLSMVSDASSGPRNGFGDG +GF  E N Q + RRNG KSAAK+KR++++G R
Subjt:  MSNLSGTHYNSGCESGWTVYFEESLESEAERFGRSVVDYGGGEEEEGDLSMVSDASSGPRNGFGDGQIGFSFEENCQSL-RRNGAKSAAKSKRKEDMGCR

Query:  NQHSCLDDTASSPVFGLSKVTLLDLYNHRPWPLGNLQNRRAFFNLPLRKNQPKMSQVKNST
        NQHS LDDTA+SPVF LSK       N        ++N + F +   RK Q K S  ++S+
Subjt:  NQHSCLDDTASSPVFGLSKVTLLDLYNHRPWPLGNLQNRRAFFNLPLRKNQPKMSQVKNST

A0A6J1BRF5 uncharacterized protein LOC111005036 isoform X11.2e-3472.27Show/hide
Query:  MSNLSGTHYNSGCESGWTVYFEESLESEAERFGRSVVDYGGGEEEEGDLSMVSDASSGPRNGFGDGQIGFSFEENCQSL-RRNGAKSAAKSKRKEDMGCR
        MSN SGTH NSGC+SGWTVYF++S     ERF  SV DYGGG EEEGDLSMVSDASSGPRNGFGDG +GF  E N Q + RRNG KSAAK+KR++++G R
Subjt:  MSNLSGTHYNSGCESGWTVYFEESLESEAERFGRSVVDYGGGEEEEGDLSMVSDASSGPRNGFGDGQIGFSFEENCQSL-RRNGAKSAAKSKRKEDMGCR

Query:  NQHSCLDDTASSPVFGLSK
        NQHS LDDTA+SPVF LSK
Subjt:  NQHSCLDDTASSPVFGLSK

A0A6J1BSU7 uncharacterized protein LOC111005036 isoform X31.2e-3472.27Show/hide
Query:  MSNLSGTHYNSGCESGWTVYFEESLESEAERFGRSVVDYGGGEEEEGDLSMVSDASSGPRNGFGDGQIGFSFEENCQSL-RRNGAKSAAKSKRKEDMGCR
        MSN SGTH NSGC+SGWTVYF++S     ERF  SV DYGGG EEEGDLSMVSDASSGPRNGFGDG +GF  E N Q + RRNG KSAAK+KR++++G R
Subjt:  MSNLSGTHYNSGCESGWTVYFEESLESEAERFGRSVVDYGGGEEEEGDLSMVSDASSGPRNGFGDGQIGFSFEENCQSL-RRNGAKSAAKSKRKEDMGCR

Query:  NQHSCLDDTASSPVFGLSK
        NQHS LDDTA+SPVF LSK
Subjt:  NQHSCLDDTASSPVFGLSK

A0A6J1K5J3 uncharacterized protein LOC1114924835.1e-3357.5Show/hide
Query:  MSNLSGTH-YNSGCESGWTVYFEESLESEAERFGRSVVDYGG------GEEEEGDLSMVSDASSGPRNGFGDGQIGFSFEENCQSLRRNGAK-SAAKSKR
        MS+ SG+H  N+ CESGWT+Y EES E+E  RF  S VDYGG       EEEEGDLSM+SDASSGPR+G+         EENCQS+RRNG K +AAKSKR
Subjt:  MSNLSGTH-YNSGCESGWTVYFEESLESEAERFGRSVVDYGG------GEEEEGDLSMVSDASSGPRNGFGDGQIGFSFEENCQSLRRNGAK-SAAKSKR

Query:  KEDMGCRNQHSCLDDTASSPVFGLSKVTLLDLYNHRPWPLGNLQNRRAFFNLPLRKNQPK
        KEDMG RN+HSCLDDTASSPVFGLSK    + Y +     G ++N + F     RK   K
Subjt:  KEDMGCRNQHSCLDDTASSPVFGLSKVTLLDLYNHRPWPLGNLQNRRAFFNLPLRKNQPK

SwissProt top hitse value%identityAlignment
Q8L9K4 Protein SOB FIVE-LIKE 53.8e-0938.02Show/hide
Query:  NSGCESGWTVYFEESLES------------EAERFGRSVVD----YGGGEEEEGDLSMVSDASSGPRNGFGDGQIGFSFEENCQSLRRNGAKSAAKSKRK
        +SGCESGWT+Y ++S+ S            ++ R  +   D    +   EEEE DLSM+SDASSGPRN         S E++ + +   G K   K ++K
Subjt:  NSGCESGWTVYFEESLES------------EAERFGRSVVD----YGGGEEEEGDLSMVSDASSGPRNGFGDGQIGFSFEENCQSLRRNGAKSAAKSKRK

Query:  EDMGCRNQHSCLDDTASSPVF
                +S LDDTASSP+F
Subjt:  EDMGCRNQHSCLDDTASSPVF

Arabidopsis top hitse value%identityAlignment
AT1G58460.1 unknown protein1.7e-0434.78Show/hide
Query:  NLSGTHYNSGCESGWTVYFEESLESEAERFGRSVVDYGGGE---EEEGDLSMVSDASSGPRNGFGDGQIGFSFEENCQSLRRNGAKSAAKSKRK--EDMG
        + S   Y+   +SGWT+Y   S       F     DY  GE   E + D SMVSDASSGP     +       ++N Q   ++ +K+  K+K+K  E+ G
Subjt:  NLSGTHYNSGCESGWTVYFEESLESEAERFGRSVVDYGGGE---EEEGDLSMVSDASSGPRNGFGDGQIGFSFEENCQSLRRNGAKSAAKSKRK--EDMG

Query:  CRNQ-HSCLDDTASS
           + +S  DDTASS
Subjt:  CRNQ-HSCLDDTASS

AT4G33800.1 unknown protein2.7e-1038.02Show/hide
Query:  NSGCESGWTVYFEESLES------------EAERFGRSVVD----YGGGEEEEGDLSMVSDASSGPRNGFGDGQIGFSFEENCQSLRRNGAKSAAKSKRK
        +SGCESGWT+Y ++S+ S            ++ R  +   D    +   EEEE DLSM+SDASSGPRN         S E++ + +   G K   K ++K
Subjt:  NSGCESGWTVYFEESLES------------EAERFGRSVVD----YGGGEEEEGDLSMVSDASSGPRNGFGDGQIGFSFEENCQSLRRNGAKSAAKSKRK

Query:  EDMGCRNQHSCLDDTASSPVF
                +S LDDTASSP+F
Subjt:  EDMGCRNQHSCLDDTASSPVF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTAATTTATCGGGTACGCATTATAATAGCGGGTGTGAGTCGGGTTGGACGGTGTATTTCGAGGAATCGTTAGAGTCTGAGGCGGAGAGATTTGGGCGGAGTGTGGT
GGATTACGGCGGAGGGGAGGAGGAAGAAGGGGACTTGTCGATGGTTTCCGATGCGTCGTCGGGGCCGCGGAATGGGTTTGGAGATGGACAAATTGGGTTTTCGTTTGAGG
AGAACTGCCAATCTCTCCGGCGGAATGGTGCGAAATCGGCGGCGAAAAGTAAGAGGAAAGAGGACATGGGCTGCCGGAACCAACATTCCTGCCTTGATGACACTGCTAGC
TCCCCCGTTTTCGGGCTTTCAAAGGTGACTTTACTTGATCTGTACAATCATAGGCCTTGGCCACTTGGAAATTTGCAAAACAGACGAGCTTTTTTCAATCTTCCTCTGCG
AAAAAATCAACCAAAAATGTCTCAGGTAAAAAACTCAACCGTCGCAGTTGCCGCCGGCAGCTCTGTTCGTCTCGCGGGGCAAAATGGGAATGACGTTGCAGCACAACAGA
AGGAGAATGGCCAGAAGAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTAATTTATCGGGTACGCATTATAATAGCGGGTGTGAGTCGGGTTGGACGGTGTATTTCGAGGAATCGTTAGAGTCTGAGGCGGAGAGATTTGGGCGGAGTGTGGT
GGATTACGGCGGAGGGGAGGAGGAAGAAGGGGACTTGTCGATGGTTTCCGATGCGTCGTCGGGGCCGCGGAATGGGTTTGGAGATGGACAAATTGGGTTTTCGTTTGAGG
AGAACTGCCAATCTCTCCGGCGGAATGGTGCGAAATCGGCGGCGAAAAGTAAGAGGAAAGAGGACATGGGCTGCCGGAACCAACATTCCTGCCTTGATGACACTGCTAGC
TCCCCCGTTTTCGGGCTTTCAAAGGTGACTTTACTTGATCTGTACAATCATAGGCCTTGGCCACTTGGAAATTTGCAAAACAGACGAGCTTTTTTCAATCTTCCTCTGCG
AAAAAATCAACCAAAAATGTCTCAGGTAAAAAACTCAACCGTCGCAGTTGCCGCCGGCAGCTCTGTTCGTCTCGCGGGGCAAAATGGGAATGACGTTGCAGCACAACAGA
AGGAGAATGGCCAGAAGAAGTAA
Protein sequenceShow/hide protein sequence
MSNLSGTHYNSGCESGWTVYFEESLESEAERFGRSVVDYGGGEEEEGDLSMVSDASSGPRNGFGDGQIGFSFEENCQSLRRNGAKSAAKSKRKEDMGCRNQHSCLDDTAS
SPVFGLSKVTLLDLYNHRPWPLGNLQNRRAFFNLPLRKNQPKMSQVKNSTVAVAAGSSVRLAGQNGNDVAAQQKENGQKK