; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy6G023630 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy6G023630
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionDNA-binding protein S1FA
Genome locationGy14Chr6:23730977..23735257
RNA-Seq ExpressionCsGy6G023630
SyntenyCsGy6G023630
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003677 - DNA binding (molecular function)
InterPro domainsIPR006779 - DNA binding protein S1FA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004143239.1 DNA-binding protein S1FA [Cucumis sativus]6.78e-56100Show/hide
Query:  MEDDFDFGDKVPPAVNRMGNVIRDGEARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE
        MEDDFDFGDKVPPAVNRMGNVIRDGEARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE
Subjt:  MEDDFDFGDKVPPAVNRMGNVIRDGEARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE

XP_008449267.1 PREDICTED: DNA-binding protein S1FA-like [Cucumis melo]1.13e-5498.88Show/hide
Query:  MEDDFDFGDKVPPAVNRMGNVIRDGEARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE
        MEDDFDFGDKVPPAVNRM NVIRDGEARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE
Subjt:  MEDDFDFGDKVPPAVNRMGNVIRDGEARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE

XP_022962636.1 DNA-binding protein S1FA-like [Cucurbita moschata]3.65e-5191.01Show/hide
Query:  MEDDFDFGDKVPPAVNRMGNVIRDGEARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE
        MEDDFDFGDKVPPAVNRMGNVIR+ +ARGFNPGLIVLLVVGGLL AFL GNYALY+YAQKTLPPK+KKPVSKKKMKRERLKQG+SAPGE
Subjt:  MEDDFDFGDKVPPAVNRMGNVIRDGEARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE

XP_022972581.1 DNA-binding protein S1FA-like [Cucurbita maxima]8.61e-5089.89Show/hide
Query:  MEDDFDFGDKVPPAVNRMGNVIRDGEARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE
        MEDDFDFGDKVPPAVNRMGNVIRD +ARGFNPGLIVLLVVGGLL AFL GNYALY+YAQ TLP K+KKPVSKKKMKRERLKQG+SAPGE
Subjt:  MEDDFDFGDKVPPAVNRMGNVIRDGEARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE

XP_038881351.1 DNA-binding protein S1FA-like [Benincasa hispida]7.64e-5394.38Show/hide
Query:  MEDDFDFGDKVPPAVNRMGNVIRDGEARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE
        MEDDFDFGDKVPPAVNRMGNVIRD +ARGFNPGLIVLLVVGGLL AFL GNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQG+SAPGE
Subjt:  MEDDFDFGDKVPPAVNRMGNVIRDGEARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE

TrEMBL top hitse value%identityAlignment
A0A0A0KEM3 DNA-binding protein S1FA3.28e-56100Show/hide
Query:  MEDDFDFGDKVPPAVNRMGNVIRDGEARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE
        MEDDFDFGDKVPPAVNRMGNVIRDGEARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE
Subjt:  MEDDFDFGDKVPPAVNRMGNVIRDGEARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE

A0A1S3BLN5 DNA-binding protein S1FA-like5.46e-5598.88Show/hide
Query:  MEDDFDFGDKVPPAVNRMGNVIRDGEARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE
        MEDDFDFGDKVPPAVNRM NVIRDGEARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE
Subjt:  MEDDFDFGDKVPPAVNRMGNVIRDGEARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE

A0A5A7TZ88 DNA-binding protein S1FA-like5.46e-5598.88Show/hide
Query:  MEDDFDFGDKVPPAVNRMGNVIRDGEARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE
        MEDDFDFGDKVPPAVNRM NVIRDGEARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE
Subjt:  MEDDFDFGDKVPPAVNRMGNVIRDGEARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE

A0A6J1HDT2 DNA-binding protein S1FA-like1.76e-5191.01Show/hide
Query:  MEDDFDFGDKVPPAVNRMGNVIRDGEARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE
        MEDDFDFGDKVPPAVNRMGNVIR+ +ARGFNPGLIVLLVVGGLL AFL GNYALY+YAQKTLPPK+KKPVSKKKMKRERLKQG+SAPGE
Subjt:  MEDDFDFGDKVPPAVNRMGNVIRDGEARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE

A0A6J1I6D0 DNA-binding protein S1FA-like4.17e-5089.89Show/hide
Query:  MEDDFDFGDKVPPAVNRMGNVIRDGEARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE
        MEDDFDFGDKVPPAVNRMGNVIRD +ARGFNPGLIVLLVVGGLL AFL GNYALY+YAQ TLP K+KKPVSKKKMKRERLKQG+SAPGE
Subjt:  MEDDFDFGDKVPPAVNRMGNVIRDGEARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE

SwissProt top hitse value%identityAlignment
P42552 DNA-binding protein S1FA2.4e-2276.47Show/hide
Query:  IRDGEARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE
        + + EA+G NPGLIVLLV+GGLL  FLVGN+ LY YAQK LPPKKKKP+SKKKMKRERLKQGV+ PGE
Subjt:  IRDGEARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE

P42553 DNA-binding protein S1FA12.0e-2171.62Show/hide
Query:  NRMGNVIRDGEARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE
        ++  N+I +   +G NPG IVLLVV  LL  F VGNYALYMYAQKTLPP+KKKPVSKKK+KRE+LKQGVSAPGE
Subjt:  NRMGNVIRDGEARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE

Q42337 DNA-binding protein S1FA27.7e-2175Show/hide
Query:  EARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE
        EA+G NPGLIVLLV+GGLL  FL+ NY +YMYAQK LPP+KKKP+SKKK+KRE+LKQGV  PGE
Subjt:  EARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE

Q7XLX6 DNA-binding protein S1FA24.5e-2172.86Show/hide
Query:  NVIRDGEARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE
        NV+ +   +G NPG+IVL+VV   L  F VGNYALY+YAQKTLPP+KKKPVSKKKMKRE+LKQGVSAPGE
Subjt:  NVIRDGEARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE

Q93VI0 DNA-binding protein S1FA31.3e-2075Show/hide
Query:  EARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE
        E++G NPGLIVLLV+GGLL  FLVGN+ LY YAQK LPP+KKKPVSKKKMK+E++KQGV  PGE
Subjt:  EARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE

Arabidopsis top hitse value%identityAlignment
AT2G37120.1 S1FA-like DNA-binding protein5.5e-2275Show/hide
Query:  EARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE
        EA+G NPGLIVLLV+GGLL  FL+ NY +YMYAQK LPP+KKKP+SKKK+KRE+LKQGV  PGE
Subjt:  EARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE

AT3G09735.1 S1FA-like DNA-binding protein9.3e-2275Show/hide
Query:  EARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE
        E++G NPGLIVLLV+GGLL  FLVGN+ LY YAQK LPP+KKKPVSKKKMK+E++KQGV  PGE
Subjt:  EARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE

AT3G53370.1 S1FA-like DNA-binding protein1.6e-2176.56Show/hide
Query:  EARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE
        EA+G NPGLIVLLVVGG L  FL+ NY LY+YAQK LPP+KKKPVSKKK+KRE+LKQGV  PGE
Subjt:  EARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE

AT3G53370.2 S1FA-like DNA-binding protein2.2e-0775.76Show/hide
Query:  YAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE
        +A K LPP+KKKPVSKKK+KRE+LKQGV  PGE
Subjt:  YAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGATGATTTCGACTTCGGCGACAAGGTTCCGCCGGCCGTCAACCGCATGGGGAATGTGATTAGAGATGGAGAAGCAAGAGGATTCAACCCAGGACTGATT
GTGCTGCTTGTAGTTGGTGGGTTGCTATTTGCATTTCTTGTTGGGAATTATGCTCTCTACATGTATGCGCAGAAAACACTCCCCCCAAAAAAGAAGAAACCAGTT
TCCAAAAAGAAGATGAAGAGGGAGAGATTGAAGCAAGGTGTCTCTGCACCTGGAGAGTAG
mRNA sequenceShow/hide mRNA sequence
TATATACTTCTTTTTTAATATAGTTTATGTACTCTTTCTTATGTTTGATTTTGAAGGTATAATTGGTAGAATTGTGGTTTTAAAGTATAAATTGATATTTGTAAA
AACAATAAACAGAACGGCGTCGTTTGAAACCCCCATGGGCGGAAATATCGTAAATCCACTGACTATCCGCCATTTTCATATCTTCGTTCCTTTTGTATTTTTTCT
CGAAGCTTCGTCAGTTCATCGTTTTGGTACATAACTTCAACACACCAGCTCTCTCCCATTCTCAATTTCAGCCATGGAAGATGATTTCGACTTCGGCGACAAGGT
TCCGCCGGCCGTCAACCGCATGGGGAATGTGATTAGAGATGGAGAAGCAAGAGGATTCAACCCAGGACTGATTGTGCTGCTTGTAGTTGGTGGGTTGCTATTTGC
ATTTCTTGTTGGGAATTATGCTCTCTACATGTATGCGCAGAAAACACTCCCCCCAAAAAAGAAGAAACCAGTTTCCAAAAAGAAGATGAAGAGGGAGAGATTGAA
GCAAGGTGTCTCTGCACCTGGAGAGTAGAAGTGTAGTGGTAGACTTCTCTTATGTTTTGTATGCCTGTCCCTGATTTGGTTGAACTAACAAGTTCTACCAAATGA
ATTTCAATGTTTGAAGAGTTGTTTTGCCTATGTTGTTAAGTTGGTTAGAATCTGAACATATAAGTAGAAGTTTATTGTCGCTTTGTATCCTTAGAGATGAGAAAT
AATAACAACATAGTTTATGTGGTATATTTTGATCAAATGCTAACCTTTTGGTGCCCTTCCTAAGATTTTCTTATCCACATTGTCTTATTAAATTAAAAAAAATAA
TAATAAAAGCTCAAAATCCTAACACACATAAAAGTAAGTAAACATTATGGCATTTGAAAATTAAAATTGACACTCAATATAATAAACTGAACATATTACGATATG
ATTACAGAAAGTAGTATTTCGTACAGAACAAGTGCTTTAGCCCATAGGGGACCAAAGGGGTTGGTGGTTTAAATCTCTTTATCCCATTTGTCCTATTTTTTTTAA
TAAAAAAAGTACACAAAATTAGTATGATCACATGCAAGTTTAGATTCTGCCACTTTTTGAGTTGGCTGCTATCACATGCAAGTTAGTCACACACGTATCGATAAG
ATTTCCAAAAAAAGGCACTGTCTTCCCTCTCCCAGATGGAAGAGCAGAGGCCTTCGTGTCCCCCTCCAAGCTTCCAAGATGAAATTTAGTCTTGGAAACATTTAT
ACTATCTTACCCACCTTCAGTTTGTTGCTATTCAAATGTTTGAATCCGAATGATATAATATAACCAAAGAAACATACAAATCGTAACTCAGAATTCAAACTAAAA
GAAATAGAGTTGGTTCTTGTGGCCATGAAACTTCAACACATTCTATACATAAAAGAACAATATTATCCACAAATATGATATGCCGTCTCATTCAGACAAAACCTT
GTCAAAATATTTTCCCCTTTGAAAGCTTAACATATGGGTCATCAACCCCCAATACATTGCAGATCAACAGCAAGATGCCACTACATGTAACATGAAAGAGTTAGG
AGGAGGAGGATCTCAGACTCCACGCCCCATGGACTGTTCTCCAGGTGGGATCATGACTAACACAGGACGAGCATTGAAACCTGGGTGGGGCATCCTCCGACGAAG
TTCACGACCTTCCTGGCAAAGTGCACACGTGTGACAGAAGACATGAGTTGCAAAGTCACAAACAGATTCACAGTGCTCACGTTGAACTTCATCTTCAACACAGAG
TCCGCAACAGCCACATGACCTATGAAGTGCCTCGCAGTTACCCTGCTCATAATAAAAAAAGGGATACAATTTATAAACTCTCCTATTAATTGAATTGAAACGTAA
CCACATTTTTTGCATCAAATAAATAGGTTCTTCAATGATGACCATGAACAATGCCCCCATAGTAGCTATTGTGAGGTAAAAGCAAAAGTCATCCGTCTCATTTTA
AGTCATAAGCCAACACAATGCATTAGTTCGCAGATATTCAGAGACAATAATAGATTATGGGTACAGGGTATCAAAGGAGTAGGTTTAACTTGCCTCCAAATTGAA
CATTCTACGAATAGCTGTACGAGTAGGATATGTAAACCATGGAGCCAGACAGTTCCAACCAAAGAAAGAGGTTCCAATTAAGTAGAGACCAGAATATGACATGCA
ATGGTTTGCAAAAGTCCCAGGAGTAGAAGACACAACTCTCTCTGCATTTGTTCCATACAGAATGCAAGGAGCCACACTTCCAAGAAGACCTACACAAACAGAGAT
TGTCAGTCACTTTGTTTTAGTTTCTCTAAATTCATTTCAACTTTTACCCATACTGATCCCTTTTCTATATGAAAATCTGAGTCCCTAATTCTCTCTGTAATCGAA
GTAGAAACCTAGGAGTGAACAGAAATCCCTCCCTGATAAAAAAAAGTTAACAGTAATACCCTCCACATTGAAGAGAAATCAAGAGAAACATGCCAACCAAGAAAA
TACAAGGATAGATAAGAATTTCTTTTGTTATTGTAATTCAAGCCATCTGACACCCAGGATTAGAATGGTTATTTATTGGAAGATTAAGGATTAGGCATAAGACTT
TTGAATTCTATGTAAAATATACATTGGAAACTTGATTATTGAAACCAATTATAAAGGCTGTGATTATGGTCACAAAACTCAGGCAAACAATCTTCAATGCCAAAC
CACCCATACAAGTTACAGCATGGTGTTAGAGACCAATAAATTCTTCATAGCCCAGATCTTATCTGAAACAAGTTATAGCATATTAAGATGTCGCATGCTTATATC
AAATATCCAATCTTCCGGAAAAAATTAGAGAGAAGATTTGAAAAGCCCAAATGCCATCCATTTGATCATTTCAGATTAATTTCCTCGTTAAGCAATAAAAAAATA
AATGGGATCAGTGGCCCATCGAATTCAAATTAATGAGAATTTTCATAATGAACAGTCAAGAAAAAAGAGATTCCCCTTCAGACGCAATCCCTAATCGCTGAAAAA
TCAGCAGAGTCTCAAACTCCAAAACCAGGGATAAAGAAGAATCACAGAACCCCAAAACCTAAGAATCCACCACCTTAAAAATCAATTCCAATAGAGAGGAAGAGA
GAGAGAGA
Protein sequenceShow/hide protein sequence
MEDDFDFGDKVPPAVNRMGNVIRDGEARGFNPGLIVLLVVGGLLFAFLVGNYALYMYAQKTLPPKKKKPVSKKKMKRERLKQGVSAPGE