; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014472 (gene) of Snake gourd v1 genome

Gene IDTan0014472
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein SOB FIVE-LIKE 6-like
Genome locationLG01:112806644..112807820
RNA-Seq ExpressionTan0014472
SyntenyTan0014472
Gene Ontology termsGO:0009691 - cytokinin biosynthetic process (biological process)
GO:0009736 - cytokinin-activated signaling pathway (biological process)
InterPro domainsIPR044670 - SOB-five-Like (SOFL) family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008452653.1 PREDICTED: uncharacterized protein LOC103493609 [Cucumis melo]4.9e-7580.75Show/hide
Query:  MNNWSATHYCSGDESGWTMYLDQSYTSEHGFGGGGGAGENYREKEAKVREEEEEDEEEDLSMVSDASSGPPHYLEDNEECFYNNGYSSYAFSASESALNC
        MN+WSATHYCSG ESGWTMYLDQSYTS+H F GG G  ENY+ KEAK R EEEEDEEEDLSMVSDASSGPPHY+EDNEE FYNNGYSSYA SASE     
Subjt:  MNNWSATHYCSGDESGWTMYLDQSYTSEHGFGGGGGAGENYREKEAKVREEEEEDEEEDLSMVSDASSGPPHYLEDNEECFYNNGYSSYAFSASESALNC

Query:  GKEEKKKSKKGKQNGRNQQHSYLDDTASSPVYGYGKASKINAASSNEAMEKNAVEFSQGFSATHFKRKSALRKHFGFYRSEKSAPEE
           E+KKS+KGK+NGRNQQHSYLDDTASSPVYGY KA+KIN A+SN+A+E+N V+FSQGFSATHFKRKSALRKH GFYRSEKSAPEE
Subjt:  GKEEKKKSKKGKQNGRNQQHSYLDDTASSPVYGYGKASKINAASSNEAMEKNAVEFSQGFSATHFKRKSALRKHFGFYRSEKSAPEE

XP_022139584.1 uncharacterized protein LOC111010447 isoform X1 [Momordica charantia]3.7e-7585.03Show/hide
Query:  MNNWSATHYCSGDESGWTMYLDQSYTSEHGFGGGGGAGENYREKEAKVREEEEEDEEEDLSMVSDASSGPPHYLEDNEECFYNNGYSSYAFSASESALNC
        MN+WS+ HYCSG ESGWTMYLDQSY SEH F GGGG GENYREKEAKVR  EEEDEEEDLSMVSDASSGPPHYLEDNE   YNNGYSSYA+SA+ESA NC
Subjt:  MNNWSATHYCSGDESGWTMYLDQSYTSEHGFGGGGGAGENYREKEAKVREEEEEDEEEDLSMVSDASSGPPHYLEDNEECFYNNGYSSYAFSASESALNC

Query:  GKEEKKKSKKGKQNGRNQQHSYLDDTASSPVYGYGKASKINAASSNEAMEKNAVEFSQGFSATHFKRKSALRKHFGFYRSEKSAPEE
         KEEKK  KKGK++GRNQQHSYLDDTASSPVYGYGKASKINAA SNEA E+NAV+FSQGFSATHFKRKS+LRKH GF RSEKSAPEE
Subjt:  GKEEKKKSKKGKQNGRNQQHSYLDDTASSPVYGYGKASKINAASSNEAMEKNAVEFSQGFSATHFKRKSALRKHFGFYRSEKSAPEE

XP_022139592.1 uncharacterized protein LOC111010447 isoform X2 [Momordica charantia]7.8e-7383.96Show/hide
Query:  MNNWSATHYCSGDESGWTMYLDQSYTSEHGFGGGGGAGENYREKEAKVREEEEEDEEEDLSMVSDASSGPPHYLEDNEECFYNNGYSSYAFSASESALNC
        MN+WS+ HYCSG ESGWTMYLDQSY SEH F GGGG GENYREKEAKVR  EEEDEEEDLSMVSDASSGPPHYLEDNE   YNNGYSSYA+SA+ESA NC
Subjt:  MNNWSATHYCSGDESGWTMYLDQSYTSEHGFGGGGGAGENYREKEAKVREEEEEDEEEDLSMVSDASSGPPHYLEDNEECFYNNGYSSYAFSASESALNC

Query:  GKEEKKKSKKGKQNGRNQQHSYLDDTASSPVYGYGKASKINAASSNEAMEKNAVEFSQGFSATHFKRKSALRKHFGFYRSEKSAPEE
         KEEKK  KKGK++GRNQQHSYLDDTASSPVYGYGK  KINAA SNEA E+NAV+FSQGFSATHFKRKS+LRKH GF RSEKSAPEE
Subjt:  GKEEKKKSKKGKQNGRNQQHSYLDDTASSPVYGYGKASKINAASSNEAMEKNAVEFSQGFSATHFKRKSALRKHFGFYRSEKSAPEE

XP_022977119.1 uncharacterized protein LOC111477282 [Cucurbita maxima]1.0e-7280.75Show/hide
Query:  MNNWSATHYCSGDESGWTMYLDQSYTSEHGFGGGGGAGENYREKEAKVREEEEEDEEEDLSMVSDASSGPPHYLEDNEECFYNNGYSSYAFSASESALNC
        MN+WSATHYCSG ESGWTMYLDQSYTS+H FGGG         KEAK R EE   +EEDLSMVSDASSGPPHYLEDNEE FYNNGYSSYA+SAS+S +NC
Subjt:  MNNWSATHYCSGDESGWTMYLDQSYTSEHGFGGGGGAGENYREKEAKVREEEEEDEEEDLSMVSDASSGPPHYLEDNEECFYNNGYSSYAFSASESALNC

Query:  GKEEKKKSKKGKQNGRNQQHSYLDDTASSPVYGYGKASKINAASSNEAMEKNAVEFSQGFSATHFKRKSALRKHFGFYRSEKSAPEE
         KEEKKKSKK KQNGRNQQ SYLDDTASSPVYGYGKASKI AA+SN+A E+N V+FSQGFSATHFKRKSA+RKH GFYRSEKSAPEE
Subjt:  GKEEKKKSKKGKQNGRNQQHSYLDDTASSPVYGYGKASKINAASSNEAMEKNAVEFSQGFSATHFKRKSALRKHFGFYRSEKSAPEE

XP_038897152.1 protein SOB FIVE-LIKE 6-like [Benincasa hispida]8.0e-7882.63Show/hide
Query:  MNNWSATHYCSGDESGWTMYLDQSYTSEH---GFGGGGGAGENYREKEAKVREEEEEDEEEDLSMVSDASSGPPHYLEDNEECFYNNGYSSYAFSASESA
        MN+WS+THYCSG ESGWTMYLDQSYTS+H   G GG GG GENY+ KEAK R  E+EDEEEDLSMVSDASSGPPHY+EDNEE FY NGYSSYAFSASESA
Subjt:  MNNWSATHYCSGDESGWTMYLDQSYTSEH---GFGGGGGAGENYREKEAKVREEEEEDEEEDLSMVSDASSGPPHYLEDNEECFYNNGYSSYAFSASESA

Query:  LNCGKEEKKKSKKGKQNGRNQQHSYLDDTASSPVYGYGKASKINAASSNEAMEKNAVEFSQGFSATHFKRKSALRKHFGFYRSEKSAPEE
        +NC KEE KK KKGKQNGRNQQHSYLDDTASSPVYGY K SKIN A SN+A+E+N V+FSQGFSATHFKRKSALRKH GFYRSEKSAPEE
Subjt:  LNCGKEEKKKSKKGKQNGRNQQHSYLDDTASSPVYGYGKASKINAASSNEAMEKNAVEFSQGFSATHFKRKSALRKHFGFYRSEKSAPEE

TrEMBL top hitse value%identityAlignment
A0A1S3BVI5 uncharacterized protein LOC1034936092.4e-7580.75Show/hide
Query:  MNNWSATHYCSGDESGWTMYLDQSYTSEHGFGGGGGAGENYREKEAKVREEEEEDEEEDLSMVSDASSGPPHYLEDNEECFYNNGYSSYAFSASESALNC
        MN+WSATHYCSG ESGWTMYLDQSYTS+H F GG G  ENY+ KEAK R EEEEDEEEDLSMVSDASSGPPHY+EDNEE FYNNGYSSYA SASE     
Subjt:  MNNWSATHYCSGDESGWTMYLDQSYTSEHGFGGGGGAGENYREKEAKVREEEEEDEEEDLSMVSDASSGPPHYLEDNEECFYNNGYSSYAFSASESALNC

Query:  GKEEKKKSKKGKQNGRNQQHSYLDDTASSPVYGYGKASKINAASSNEAMEKNAVEFSQGFSATHFKRKSALRKHFGFYRSEKSAPEE
           E+KKS+KGK+NGRNQQHSYLDDTASSPVYGY KA+KIN A+SN+A+E+N V+FSQGFSATHFKRKSALRKH GFYRSEKSAPEE
Subjt:  GKEEKKKSKKGKQNGRNQQHSYLDDTASSPVYGYGKASKINAASSNEAMEKNAVEFSQGFSATHFKRKSALRKHFGFYRSEKSAPEE

A0A5D3D989 Uncharacterized protein2.4e-7580.75Show/hide
Query:  MNNWSATHYCSGDESGWTMYLDQSYTSEHGFGGGGGAGENYREKEAKVREEEEEDEEEDLSMVSDASSGPPHYLEDNEECFYNNGYSSYAFSASESALNC
        MN+WSATHYCSG ESGWTMYLDQSYTS+H F GG G  ENY+ KEAK R EEEEDEEEDLSMVSDASSGPPHY+EDNEE FYNNGYSSYA SASE     
Subjt:  MNNWSATHYCSGDESGWTMYLDQSYTSEHGFGGGGGAGENYREKEAKVREEEEEDEEEDLSMVSDASSGPPHYLEDNEECFYNNGYSSYAFSASESALNC

Query:  GKEEKKKSKKGKQNGRNQQHSYLDDTASSPVYGYGKASKINAASSNEAMEKNAVEFSQGFSATHFKRKSALRKHFGFYRSEKSAPEE
           E+KKS+KGK+NGRNQQHSYLDDTASSPVYGY KA+KIN A+SN+A+E+N V+FSQGFSATHFKRKSALRKH GFYRSEKSAPEE
Subjt:  GKEEKKKSKKGKQNGRNQQHSYLDDTASSPVYGYGKASKINAASSNEAMEKNAVEFSQGFSATHFKRKSALRKHFGFYRSEKSAPEE

A0A6J1CDG6 uncharacterized protein LOC111010447 isoform X11.8e-7585.03Show/hide
Query:  MNNWSATHYCSGDESGWTMYLDQSYTSEHGFGGGGGAGENYREKEAKVREEEEEDEEEDLSMVSDASSGPPHYLEDNEECFYNNGYSSYAFSASESALNC
        MN+WS+ HYCSG ESGWTMYLDQSY SEH F GGGG GENYREKEAKVR  EEEDEEEDLSMVSDASSGPPHYLEDNE   YNNGYSSYA+SA+ESA NC
Subjt:  MNNWSATHYCSGDESGWTMYLDQSYTSEHGFGGGGGAGENYREKEAKVREEEEEDEEEDLSMVSDASSGPPHYLEDNEECFYNNGYSSYAFSASESALNC

Query:  GKEEKKKSKKGKQNGRNQQHSYLDDTASSPVYGYGKASKINAASSNEAMEKNAVEFSQGFSATHFKRKSALRKHFGFYRSEKSAPEE
         KEEKK  KKGK++GRNQQHSYLDDTASSPVYGYGKASKINAA SNEA E+NAV+FSQGFSATHFKRKS+LRKH GF RSEKSAPEE
Subjt:  GKEEKKKSKKGKQNGRNQQHSYLDDTASSPVYGYGKASKINAASSNEAMEKNAVEFSQGFSATHFKRKSALRKHFGFYRSEKSAPEE

A0A6J1CED8 uncharacterized protein LOC111010447 isoform X23.8e-7383.96Show/hide
Query:  MNNWSATHYCSGDESGWTMYLDQSYTSEHGFGGGGGAGENYREKEAKVREEEEEDEEEDLSMVSDASSGPPHYLEDNEECFYNNGYSSYAFSASESALNC
        MN+WS+ HYCSG ESGWTMYLDQSY SEH F GGGG GENYREKEAKVR  EEEDEEEDLSMVSDASSGPPHYLEDNE   YNNGYSSYA+SA+ESA NC
Subjt:  MNNWSATHYCSGDESGWTMYLDQSYTSEHGFGGGGGAGENYREKEAKVREEEEEDEEEDLSMVSDASSGPPHYLEDNEECFYNNGYSSYAFSASESALNC

Query:  GKEEKKKSKKGKQNGRNQQHSYLDDTASSPVYGYGKASKINAASSNEAMEKNAVEFSQGFSATHFKRKSALRKHFGFYRSEKSAPEE
         KEEKK  KKGK++GRNQQHSYLDDTASSPVYGYGK  KINAA SNEA E+NAV+FSQGFSATHFKRKS+LRKH GF RSEKSAPEE
Subjt:  GKEEKKKSKKGKQNGRNQQHSYLDDTASSPVYGYGKASKINAASSNEAMEKNAVEFSQGFSATHFKRKSALRKHFGFYRSEKSAPEE

A0A6J1IQJ9 uncharacterized protein LOC1114772824.9e-7380.75Show/hide
Query:  MNNWSATHYCSGDESGWTMYLDQSYTSEHGFGGGGGAGENYREKEAKVREEEEEDEEEDLSMVSDASSGPPHYLEDNEECFYNNGYSSYAFSASESALNC
        MN+WSATHYCSG ESGWTMYLDQSYTS+H FGGG         KEAK R EE   +EEDLSMVSDASSGPPHYLEDNEE FYNNGYSSYA+SAS+S +NC
Subjt:  MNNWSATHYCSGDESGWTMYLDQSYTSEHGFGGGGGAGENYREKEAKVREEEEEDEEEDLSMVSDASSGPPHYLEDNEECFYNNGYSSYAFSASESALNC

Query:  GKEEKKKSKKGKQNGRNQQHSYLDDTASSPVYGYGKASKINAASSNEAMEKNAVEFSQGFSATHFKRKSALRKHFGFYRSEKSAPEE
         KEEKKKSKK KQNGRNQQ SYLDDTASSPVYGYGKASKI AA+SN+A E+N V+FSQGFSATHFKRKSA+RKH GFYRSEKSAPEE
Subjt:  GKEEKKKSKKGKQNGRNQQHSYLDDTASSPVYGYGKASKINAASSNEAMEKNAVEFSQGFSATHFKRKSALRKHFGFYRSEKSAPEE

SwissProt top hitse value%identityAlignment
Q8L9K4 Protein SOB FIVE-LIKE 51.0e-1133.15Show/hide
Query:  SGDESGWTMYLDQSYTSEHG--FGGGGGAGENYREKEA---KVREEEEEDEEEDLSMVSDASSGPPHYLEDNEECFYNNGYSSYAFSASESALNCGKEEK
        SG ESGWT+YLDQS +S     F    G     R K++       +EEE+EE+DLSM+SDASSGP +  E++                S   +N    +K
Subjt:  SGDESGWTMYLDQSYTSEHG--FGGGGGAGENYREKEA---KVREEEEEDEEEDLSMVSDASSGPPHYLEDNEECFYNNGYSSYAFSASESALNCGKEEK

Query:  KKSKKGKQNGRNQQHSYLDDTASSPVYGYGK--ASKINAASSNEAMEKNAVEFSQGFSATHFKRKSALRKHFGFYRSE
        +  ++ K+    + +S LDDTASSP++ +       +      +   ++ +++SQGFSAT F+ K+A ++  G+   E
Subjt:  KKSKKGKQNGRNQQHSYLDDTASSPVYGYGK--ASKINAASSNEAMEKNAVEFSQGFSATHFKRKSALRKHFGFYRSE

Arabidopsis top hitse value%identityAlignment
AT1G58460.1 unknown protein8.2e-0430.43Show/hide
Query:  NWSATHYCSGDESGWTMYLDQSYT-SEHGFGGGGGAGENYREKEAKVREEEEEDEEEDLSMVSDASSGPPHYLEDN-EECFYNNGYSSYAFSASESALNC
        ++S   Y    +SGWTMYL  S + S H F    G              E +++ +ED SMVSDASSGPP+Y E+   E         +  S S++    
Subjt:  NWSATHYCSGDESGWTMYLDQSYT-SEHGFGGGGGAGENYREKEAKVREEEEEDEEEDLSMVSDASSGPPHYLEDN-EECFYNNGYSSYAFSASESALNC

Query:  GKEEKKKSKKGKQNGRNQQHSYLDDTASSPVYGYGKASKINAASSNEAMEKNAVEFSQGFS
         K + KK    +Q    + +S  DDTASS   G     +++A   ++   +   +F Q +S
Subjt:  GKEEKKKSKKGKQNGRNQQHSYLDDTASSPVYGYGKASKINAASSNEAMEKNAVEFSQGFS

AT4G33800.1 unknown protein7.4e-1333.15Show/hide
Query:  SGDESGWTMYLDQSYTSEHG--FGGGGGAGENYREKEA---KVREEEEEDEEEDLSMVSDASSGPPHYLEDNEECFYNNGYSSYAFSASESALNCGKEEK
        SG ESGWT+YLDQS +S     F    G     R K++       +EEE+EE+DLSM+SDASSGP +  E++                S   +N    +K
Subjt:  SGDESGWTMYLDQSYTSEHG--FGGGGGAGENYREKEA---KVREEEEEDEEEDLSMVSDASSGPPHYLEDNEECFYNNGYSSYAFSASESALNCGKEEK

Query:  KKSKKGKQNGRNQQHSYLDDTASSPVYGYGK--ASKINAASSNEAMEKNAVEFSQGFSATHFKRKSALRKHFGFYRSE
        +  ++ K+    + +S LDDTASSP++ +       +      +   ++ +++SQGFSAT F+ K+A ++  G+   E
Subjt:  KKSKKGKQNGRNQQHSYLDDTASSPVYGYGK--ASKINAASSNEAMEKNAVEFSQGFSATHFKRKSALRKHFGFYRSE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAATTGGTCTGCTACACATTATTGCAGTGGAGATGAATCAGGTTGGACTATGTACTTAGACCAATCCTATACTTCAGAGCATGGTTTCGGCGGTGGCGGCGGGGC
TGGTGAGAATTATAGGGAAAAAGAAGCAAAAGTAAGAGAAGAGGAGGAGGAGGATGAAGAAGAAGATCTATCAATGGTGTCTGATGCTTCCTCAGGTCCACCACATTACC
TTGAGGACAATGAAGAGTGCTTTTACAATAATGGGTATTCTTCCTATGCCTTTTCAGCTTCAGAATCAGCATTAAACTGTGGTAAGGAGGAGAAAAAGAAGAGCAAGAAG
GGAAAACAAAATGGAAGAAATCAGCAACATTCTTATCTTGATGACACTGCTAGCTCCCCTGTATATGGCTATGGCAAAGCAAGTAAAATCAATGCAGCATCAAGCAACGA
AGCAATGGAGAAGAATGCAGTAGAATTTTCTCAGGGATTCTCTGCAACCCACTTTAAGAGAAAATCTGCCCTGAGGAAACACTTTGGTTTTTATCGCTCTGAAAAATCAG
CACCAGAAGAATGA
mRNA sequenceShow/hide mRNA sequence
GCCTACTGTTGTTGCACATCCAAACGTTTGTCGCTTTTGATATCATCATTGTTGTTGATTTGCCCATTGTTCTTTGCCAACTGGCATCTCTGATGTGTCCACAAACAGAT
AAAGGAAGGAAAGGCCAAAAGATAAAACGAGGGATAAACTAAGTCCAGGATTGTTTTGATCCTTTTTTTTTCCTTCAATAGAAATGCCACAACTCAGAGAAAGAGAAAAG
ACAAGTTTTTGTTATATAAACATTCCAATCTTCATCACCTCTCCTGCTACCTGAAGGCAAAAATCCCCTCAAAGCCTAAGCCCCTCTGAATTGCCACCATGAATAATTGG
TCTGCTACACATTATTGCAGTGGAGATGAATCAGGTTGGACTATGTACTTAGACCAATCCTATACTTCAGAGCATGGTTTCGGCGGTGGCGGCGGGGCTGGTGAGAATTA
TAGGGAAAAAGAAGCAAAAGTAAGAGAAGAGGAGGAGGAGGATGAAGAAGAAGATCTATCAATGGTGTCTGATGCTTCCTCAGGTCCACCACATTACCTTGAGGACAATG
AAGAGTGCTTTTACAATAATGGGTATTCTTCCTATGCCTTTTCAGCTTCAGAATCAGCATTAAACTGTGGTAAGGAGGAGAAAAAGAAGAGCAAGAAGGGAAAACAAAAT
GGAAGAAATCAGCAACATTCTTATCTTGATGACACTGCTAGCTCCCCTGTATATGGCTATGGCAAAGCAAGTAAAATCAATGCAGCATCAAGCAACGAAGCAATGGAGAA
GAATGCAGTAGAATTTTCTCAGGGATTCTCTGCAACCCACTTTAAGAGAAAATCTGCCCTGAGGAAACACTTTGGTTTTTATCGCTCTGAAAAATCAGCACCAGAAGAAT
GAGGCTATTAGCAATTCCGCCGCAGCTTCCACCTTTTTTTTTTTTGGCCGTCAAGTAAATTGATAACGACATGTT
Protein sequenceShow/hide protein sequence
MNNWSATHYCSGDESGWTMYLDQSYTSEHGFGGGGGAGENYREKEAKVREEEEEDEEEDLSMVSDASSGPPHYLEDNEECFYNNGYSSYAFSASESALNCGKEEKKKSKK
GKQNGRNQQHSYLDDTASSPVYGYGKASKINAASSNEAMEKNAVEFSQGFSATHFKRKSALRKHFGFYRSEKSAPEE