; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019757 (gene) of Snake gourd v1 genome

Gene IDTan0019757
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDNA-binding protein S1FA
Genome locationLG08:71321522..71323858
RNA-Seq ExpressionTan0019757
SyntenyTan0019757
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003677 - DNA binding (molecular function)
InterPro domainsIPR006779 - DNA binding protein S1FA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8651038.1 hypothetical protein Csa_000825 [Cucumis sativus]3.4e-3093.59Show/hide
Query:  MASANGKGNVMNDVEAKGFNPGLIVLLLVSGLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
        MASANGKGNV+ND+EAKGFNP LIVLLLV GLLLIFL+GNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
Subjt:  MASANGKGNVMNDVEAKGFNPGLIVLLLVSGLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE

XP_004137812.1 DNA-binding protein S1FA [Cucumis sativus]3.4e-3093.59Show/hide
Query:  MASANGKGNVMNDVEAKGFNPGLIVLLLVSGLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
        MASANGKGNV+ND+EAKGFNP LIVLLLV GLLLIFL+GNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
Subjt:  MASANGKGNVMNDVEAKGFNPGLIVLLLVSGLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE

XP_008442666.1 PREDICTED: DNA-binding protein S1FA-like [Cucumis melo]9.9e-3093.59Show/hide
Query:  MASANGKGNVMNDVEAKGFNPGLIVLLLVSGLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
        MASANGKGNV+NDVEAKGFNP LIVLLLV GLLL FL+GNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
Subjt:  MASANGKGNVMNDVEAKGFNPGLIVLLLVSGLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE

XP_023005220.1 DNA-binding protein S1FA-like isoform X1 [Cucurbita maxima]1.5e-3096.15Show/hide
Query:  MASANGKGNVMNDVEAKGFNPGLIVLLLVSGLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
        MASANGKGNVMNDV AKGFNPGL+VLLLV GLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
Subjt:  MASANGKGNVMNDVEAKGFNPGLIVLLLVSGLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE

XP_038905724.1 DNA-binding protein S1FA-like [Benincasa hispida]2.6e-3094.87Show/hide
Query:  MASANGKGNVMNDVEAKGFNPGLIVLLLVSGLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
        MASANGKGNV+NDVEAKGFNP LIVLLLV GLLLIFL+GNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
Subjt:  MASANGKGNVMNDVEAKGFNPGLIVLLLVSGLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE

TrEMBL top hitse value%identityAlignment
A0A0A0LAW3 DNA-binding protein S1FA1.6e-3093.59Show/hide
Query:  MASANGKGNVMNDVEAKGFNPGLIVLLLVSGLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
        MASANGKGNV+ND+EAKGFNP LIVLLLV GLLLIFL+GNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
Subjt:  MASANGKGNVMNDVEAKGFNPGLIVLLLVSGLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE

A0A1S3B680 DNA-binding protein S1FA-like4.8e-3093.59Show/hide
Query:  MASANGKGNVMNDVEAKGFNPGLIVLLLVSGLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
        MASANGKGNV+NDVEAKGFNP LIVLLLV GLLL FL+GNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
Subjt:  MASANGKGNVMNDVEAKGFNPGLIVLLLVSGLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE

A0A6J1GGV1 DNA-binding protein S1FA-like isoform X26.3e-3094.87Show/hide
Query:  MASANGKGNVMNDVEAKGFNPGLIVLLLVSGLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
        MASANGK NVMNDV AKGFNPGL+VLLLV GLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
Subjt:  MASANGKGNVMNDVEAKGFNPGLIVLLLVSGLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE

A0A6J1IZV9 DNA-binding protein S1FA-like1.4e-2992.31Show/hide
Query:  MASANGKGNVMNDVEAKGFNPGLIVLLLVSGLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
        MASANGKGNV+NDVEAKG NP LIVLLLV GLLLIFL+GNYALYLYAQKNLPPKKKKPVSKKKMKR+RLKQGVSAPGE
Subjt:  MASANGKGNVMNDVEAKGFNPGLIVLLLVSGLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE

A0A6J1KWU8 DNA-binding protein S1FA-like isoform X17.4e-3196.15Show/hide
Query:  MASANGKGNVMNDVEAKGFNPGLIVLLLVSGLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
        MASANGKGNVMNDV AKGFNPGL+VLLLV GLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
Subjt:  MASANGKGNVMNDVEAKGFNPGLIVLLLVSGLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE

SwissProt top hitse value%identityAlignment
P42551 DNA-binding protein S1FA18.7e-2179.69Show/hide
Query:  EAKGFNPGLIVLLLVSGLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
        EAKG NPGLIVLL+V G LL+FLI NY LY+YAQKNLPP+KKKPVSKKK+KRE+LKQGV  PGE
Subjt:  EAKGFNPGLIVLLLVSGLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE

P42552 DNA-binding protein S1FA2.5e-2379.41Show/hide
Query:  MNDVEAKGFNPGLIVLLLVSGLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
        +N+VEAKG NPGLIVLL++ GLLL FL+GN+ LY YAQKNLPPKKKKP+SKKKMKRERLKQGV+ PGE
Subjt:  MNDVEAKGFNPGLIVLLLVSGLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE

Q42337 DNA-binding protein S1FA21.5e-2066.67Show/hide
Query:  MASANGKGNVMNDVEAKGFNPGLIVLLLVSGLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
        M+S    G  +  VEAKG NPGLIVLL++ GLL+ FLI NY +Y+YAQKNLPP+KKKP+SKKK+KRE+LKQGV  PGE
Subjt:  MASANGKGNVMNDVEAKGFNPGLIVLLLVSGLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE

Q7XLX6 DNA-binding protein S1FA21.1e-2068Show/hide
Query:  ANGKGNVMNDVEAKGFNPGLIVLLLVSGLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
        A+   NV+ +   KG NPG+IVL++V+  LL+F +GNYALY+YAQK LPP+KKKPVSKKKMKRE+LKQGVSAPGE
Subjt:  ANGKGNVMNDVEAKGFNPGLIVLLLVSGLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE

Q93VI0 DNA-binding protein S1FA31.5e-2073.85Show/hide
Query:  VEAKGFNPGLIVLLLVSGLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
        +E+KG NPGLIVLL++ GLLL FL+GN+ LY YAQKNLPP+KKKPVSKKKMK+E++KQGV  PGE
Subjt:  VEAKGFNPGLIVLLLVSGLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE

Arabidopsis top hitse value%identityAlignment
AT2G37120.1 S1FA-like DNA-binding protein1.1e-2166.67Show/hide
Query:  MASANGKGNVMNDVEAKGFNPGLIVLLLVSGLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
        M+S    G  +  VEAKG NPGLIVLL++ GLL+ FLI NY +Y+YAQKNLPP+KKKP+SKKK+KRE+LKQGV  PGE
Subjt:  MASANGKGNVMNDVEAKGFNPGLIVLLLVSGLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE

AT3G09735.1 S1FA-like DNA-binding protein1.1e-2173.85Show/hide
Query:  VEAKGFNPGLIVLLLVSGLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
        +E+KG NPGLIVLL++ GLLL FL+GN+ LY YAQKNLPP+KKKPVSKKKMK+E++KQGV  PGE
Subjt:  VEAKGFNPGLIVLLLVSGLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE

AT3G53370.1 S1FA-like DNA-binding protein6.2e-2279.69Show/hide
Query:  EAKGFNPGLIVLLLVSGLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
        EAKG NPGLIVLL+V G LL+FLI NY LY+YAQKNLPP+KKKPVSKKK+KRE+LKQGV  PGE
Subjt:  EAKGFNPGLIVLLLVSGLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE

AT3G53370.2 S1FA-like DNA-binding protein6.7e-0878.79Show/hide
Query:  YAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE
        +A KNLPP+KKKPVSKKK+KRE+LKQGV  PGE
Subjt:  YAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCAGCCAATGGCAAGGGAAATGTGATGAATGATGTTGAAGCAAAAGGATTCAATCCAGGGCTAATTGTACTTCTTCTTGTTAGTGGGTTGCTGCTGATTTTCCT
CATAGGGAACTATGCACTTTACTTATATGCACAGAAGAATCTCCCTCCCAAGAAGAAAAAGCCAGTATCTAAAAAGAAGATGAAGAGGGAAAGACTGAAGCAAGGAGTGT
CTGCACCTGGAGAGTAA
mRNA sequenceShow/hide mRNA sequence
GTCGTACGTGAGCCCATTACGATGAAAAGAGCGGCCCATAGAAATCGCGGGACTCGGGTGGACCGAATCGCCGATTTATATGAATCTTCTCCTTTTTCTTAGATCTCGCT
GACATTTTTCCGTCAACGCTTCACCGCCGATCTGATCCGTACAATTTCCTATATTCAGGATCTTCCTGCCGGCCGATCTATCCGGCTCTAAGTTGTAGTCATGGCTTCAG
CCAATGGCAAGGGAAATGTGATGAATGATGTTGAAGCAAAAGGATTCAATCCAGGGCTAATTGTACTTCTTCTTGTTAGTGGGTTGCTGCTGATTTTCCTCATAGGGAAC
TATGCACTTTACTTATATGCACAGAAGAATCTCCCTCCCAAGAAGAAAAAGCCAGTATCTAAAAAGAAGATGAAGAGGGAAAGACTGAAGCAAGGAGTGTCTGCACCTGG
AGAGTAAAAAAAACCCAAGAAAAAATATTCAACCTCCCAAGGTTTCCTTCTCTGTAGAACGTTTTGATTTTCAAAAATTTTCCTTCATAGACTGCCTAGAAAGTTGTTAC
AGTTGTTTATTTCTTTTATCTACTGTTTTAACTCTTCAGAAATCGAACTTGATTGTGAGTTTTTAGCTCATATTACATTACATGGGCTCTTTTCTCAAGGTTCCATTGAC
TCTTTTCTACTTATTCCCTTCTTTTTGGTACACACCAGAAAGATTTCTGTTTCTCCATCTTTAAACAGCCTTAACTCAGTCCCTTTTACTGTTAGAATCTTTTCTTTTTT
TGCCTTTAATCTTTCAATAAATGGTGGGGTTAGATTGTAGACTCTACAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGATTGCATGGTTGAGTGATGGATCTGTGTAAC
TCTCTGTTTAGAAGGTACAAAACACTCTTTTCAACAGGAAGACACTCATAAATTTGTCACAAGGAACATGTAGATTGTAGAGTCAACTAAGGAAAAGACATGGTTCTTCA
AGGGGCCATTACTTGTTTGAATCACGTGGAACACTAGAGAACATTGAAGGCCAGAAAAGCTTTCCTTTTACTGAAGCAAGTAAGTCATCTTGTCCCCTTGTGTTTTTGTG
GGATTCATTGTTTTCAAGTCTCGATCAATATAATTGTGAATTCTGTCCAAACAGGCAATGCTGATATAAAACACTCAGCTTTTAAGCCTTTTACTATTTTGTTATGTTGG
CTTCTAGCTCAGTTGCTTGCTTTGCTATACTCCCGCAGGCAAGTCAGTGGCTTTTGATATGGGCAAAAGCAGCCCATTTTTGCTTTTAACATTACTTGTGGTGAAGTCAC
AGTACCTGATTTCATTAAATTTAGGTTGTGTGTATTTGTGTTTTGTTGGAATGGTAGATGGGGGACTAAATTCTTGGCTTGTTTTATTTATATATTGTCATCTTGTTTAA
GTACCTCGGCATAGTTCAAATGACTAAGATATATATTGTCGACCAAGATGTTTGAGGTTCGAATCTCCACTCTACATGGTGTTGCCTAAAGAGC
Protein sequenceShow/hide protein sequence
MASANGKGNVMNDVEAKGFNPGLIVLLLVSGLLLIFLIGNYALYLYAQKNLPPKKKKPVSKKKMKRERLKQGVSAPGE