; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC10G201500 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC10G201500
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionDNA-binding protein S1FA
Genome locationCmU531Chr10:33362503..33363661
RNA-Seq ExpressionCmUC10G201500
SyntenyCmUC10G201500
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003677 - DNA binding (molecular function)
InterPro domainsIPR006779 - DNA binding protein S1FA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8651038.1 hypothetical protein Csa_000825 [Cucumis sativus]3.6e-3247.09Show/hide
Query:  ISLTFSFPSTLCRRSDPYNCRYSGPSCRPIYPALSCSHGFSQWQGTMVERIPPPPPPLVSCLLRSISRDLCFFLFLIRFLIPCWNALFFDLIVWIDQCVS
        ISLTFS  STLC RS PYNCRYSG SCR IYPALS S                                                               
Subjt:  ISLTFSFPSTLCRRSDPYNCRYSGPSCRPIYPALSCSHGFSQWQGTMVERIPPPPPPLVSCLLRSISRDLCFFLFLIRFLIPCWNALFFDLIVWIDQCVS

Query:  RFVFFVFSESLISSGWAVIWLASADLAKYEPCRKIVISNRGISSDWVGKFEQGNVINDVEAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKK
                            +ASA+                      GK   GNVIND+EAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKK
Subjt:  RFVFFVFSESLISSGWAVIWLASADLAKYEPCRKIVISNRGISSDWVGKFEQGNVINDVEAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKK

Query:  KKPVSKKKMKRERLKQGVSAPGE
        KKPVSKKKMKRERLKQGVSAPGE
Subjt:  KKPVSKKKMKRERLKQGVSAPGE

XP_004137812.1 DNA-binding protein S1FA [Cucumis sativus]1.2e-2797.22Show/hide
Query:  QGNVINDVEAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
        +GNVIND+EAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
Subjt:  QGNVINDVEAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE

XP_008442666.1 PREDICTED: DNA-binding protein S1FA-like [Cucumis melo]3.5e-2797.22Show/hide
Query:  QGNVINDVEAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
        +GNVINDVEAKGFNPALIVLLLVGGLLL FLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
Subjt:  QGNVINDVEAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE

XP_022934038.1 DNA-binding protein S1FA-like [Cucurbita moschata]1.0e-2695.83Show/hide
Query:  QGNVINDVEAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
        +GNVINDVEAKG NPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKR+RLKQGVSAPGE
Subjt:  QGNVINDVEAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE

XP_038905724.1 DNA-binding protein S1FA-like [Benincasa hispida]9.1e-2898.61Show/hide
Query:  QGNVINDVEAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
        +GNVINDVEAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
Subjt:  QGNVINDVEAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE

TrEMBL top hitse value%identityAlignment
A0A0A0LAW3 DNA-binding protein S1FA5.8e-2897.22Show/hide
Query:  QGNVINDVEAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
        +GNVIND+EAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
Subjt:  QGNVINDVEAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE

A0A1S3B680 DNA-binding protein S1FA-like1.7e-2797.22Show/hide
Query:  QGNVINDVEAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
        +GNVINDVEAKGFNPALIVLLLVGGLLL FLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
Subjt:  QGNVINDVEAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE

A0A6J1CUM0 DNA-binding protein S1FA-like6.4e-2794.44Show/hide
Query:  QGNVINDVEAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
        +GNVINDVEAKGFNP LIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKM++ERLKQGVSAPGE
Subjt:  QGNVINDVEAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE

A0A6J1F6J1 DNA-binding protein S1FA-like4.9e-2795.83Show/hide
Query:  QGNVINDVEAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
        +GNVINDVEAKG NPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKR+RLKQGVSAPGE
Subjt:  QGNVINDVEAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE

A0A6J1IZV9 DNA-binding protein S1FA-like4.9e-2795.83Show/hide
Query:  QGNVINDVEAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
        +GNVINDVEAKG NPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKR+RLKQGVSAPGE
Subjt:  QGNVINDVEAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE

SwissProt top hitse value%identityAlignment
P42551 DNA-binding protein S1FA13.3e-2078.12Show/hide
Query:  EAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
        EAKG NP LIVLL+VGG LL+FL+ NY LY+YAQKNLPP+KKKPVSKKK+KRE+LKQGV  PGE
Subjt:  EAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE

P42552 DNA-binding protein S1FA3.1e-2380.88Show/hide
Query:  INDVEAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
        +N+VEAKG NP LIVLL++GGLLL FLVGN+ LY YAQKNLPPKKKKP+SKKKMKRERLKQGV+ PGE
Subjt:  INDVEAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE

P42553 DNA-binding protein S1FA11.6e-1968.49Show/hide
Query:  EQGNVINDVEAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
        +  N+I +   KG NP  IVLL+V  LL++F VGNYALY+YAQK LPP+KKKPVSKKK+KRE+LKQGVSAPGE
Subjt:  EQGNVINDVEAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE

Q42337 DNA-binding protein S1FA24.3e-2073.85Show/hide
Query:  VEAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
        VEAKG NP LIVLL++GGLL+ FL+ NY +Y+YAQKNLPP+KKKP+SKKK+KRE+LKQGV  PGE
Subjt:  VEAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE

Q93VI0 DNA-binding protein S1FA32.5e-2075.38Show/hide
Query:  VEAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
        +E+KG NP LIVLL++GGLLL FLVGN+ LY YAQKNLPP+KKKPVSKKKMK+E++KQGV  PGE
Subjt:  VEAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE

Arabidopsis top hitse value%identityAlignment
AT2G37120.1 S1FA-like DNA-binding protein3.0e-2173.85Show/hide
Query:  VEAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
        VEAKG NP LIVLL++GGLL+ FL+ NY +Y+YAQKNLPP+KKKP+SKKK+KRE+LKQGV  PGE
Subjt:  VEAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE

AT3G09735.1 S1FA-like DNA-binding protein1.8e-2175.38Show/hide
Query:  VEAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
        +E+KG NP LIVLL++GGLLL FLVGN+ LY YAQKNLPP+KKKPVSKKKMK+E++KQGV  PGE
Subjt:  VEAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE

AT3G53370.1 S1FA-like DNA-binding protein2.3e-2178.12Show/hide
Query:  EAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
        EAKG NP LIVLL+VGG LL+FL+ NY LY+YAQKNLPP+KKKPVSKKK+KRE+LKQGV  PGE
Subjt:  EAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE

AT3G53370.2 S1FA-like DNA-binding protein1.9e-0778.79Show/hide
Query:  YAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
        +A KNLPP+KKKPVSKKK+KRE+LKQGV  PGE
Subjt:  YAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATCTCGCTGACATTTTCATTTCCGTCAACGCTTTGCCGCCGATCCGATCCGTACAATTGCCGATATTCAGGACCTTCTTGCCGGCCGATCTACCCGGCTCTGAGT
TGTAGTCATGGCTTCAGCCAATGGCAAGGTACGATGGTCGAAAGAATCCCCCCCCCCCCCCCCCCCCTAGTTTCTTGCCTTTTGCGTTCAATTTCACGAGATCTG
TGCTTCTTTCTTTTTTTAATTCGATTTCTGATTCCGTGTTGGAATGCCCTTTTCTTCGATTTGATTGTTTGGATCGATCAGTGTGTGTCGAGATTCGTCTTTTTC
GTCTTTTCTGAAAGCTTGATTTCATCTGGCTGGGCGGTGATTTGGTTAGCGAGTGCTGATCTAGCGAAGTATGAACCCTGCAGGAAAATTGTCATCTCAAATCGG
GGAATTTCGAGCGACTGGGTTGGAAAATTTGAACAGGGAAATGTGATCAATGATGTTGAAGCAAAAGGATTCAATCCAGCGCTAATAGTACTTCTTCTTGTTGGT
GGGTTGCTGCTGATTTTCCTCGTAGGGAACTATGCACTTTACTTGTATGCACAGAAGAATCTCCCTCCCAAGAAGAAAAAGCCAGTATCTAAAAAGAAGATGAAG
AGAGAAAGACTGAAGCAAGGAGTGTCTGCACCTGGAGAGTAA
mRNA sequenceShow/hide mRNA sequence
ATCTCGCTGACATTTTCATTTCCGTCAACGCTTTGCCGCCGATCCGATCCGTACAATTGCCGATATTCAGGACCTTCTTGCCGGCCGATCTACCCGGCTCTGAGT
TGTAGTCATGGCTTCAGCCAATGGCAAGGTACGATGGTCGAAAGAATCCCCCCCCCCCCCCCCCCCCTAGTTTCTTGCCTTTTGCGTTCAATTTCACGAGATCTG
TGCTTCTTTCTTTTTTTAATTCGATTTCTGATTCCGTGTTGGAATGCCCTTTTCTTCGATTTGATTGTTTGGATCGATCAGTGTGTGTCGAGATTCGTCTTTTTC
GTCTTTTCTGAAAGCTTGATTTCATCTGGCTGGGCGGTGATTTGGTTAGCGAGTGCTGATCTAGCGAAGTATGAACCCTGCAGGAAAATTGTCATCTCAAATCGG
GGAATTTCGAGCGACTGGGTTGGAAAATTTGAACAGGGAAATGTGATCAATGATGTTGAAGCAAAAGGATTCAATCCAGCGCTAATAGTACTTCTTCTTGTTGGT
GGGTTGCTGCTGATTTTCCTCGTAGGGAACTATGCACTTTACTTGTATGCACAGAAGAATCTCCCTCCCAAGAAGAAAAAGCCAGTATCTAAAAAGAAGATGAAG
AGAGAAAGACTGAAGCAAGGAGTGTCTGCACCTGGAGAGTAA
Protein sequenceShow/hide protein sequence
ISLTFSFPSTLCRRSDPYNCRYSGPSCRPIYPALSCSHGFSQWQGTMVERIPPPPPPLVSCLLRSISRDLCFFLFLIRFLIPCWNALFFDLIVWIDQCVSRFVFF
VFSESLISSGWAVIWLASADLAKYEPCRKIVISNRGISSDWVGKFEQGNVINDVEAKGFNPALIVLLLVGGLLLIFLVGNYALYLYAQKNLPPKKKKPVSKKKMK
RERLKQGVSAPGE