; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0015901 (gene) of Chayote v1 genome

Gene IDSed0015901
OrganismSechium edule (Chayote v1)
DescriptionDNA binding;sequence-specific DNA binding transcription factors
Genome locationLG10:3462055..3462474
RNA-Seq ExpressionSed0015901
SyntenySed0015901
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025593.1 hypothetical protein E6C27_scaffold253G00830 [Cucumis melo var. makuwa]2.5e-2351.35Show/hide
Query:  SSPPQIGNGEKRQCGGRGAS-SSGGRLLDLDLDLDLSFGVFDFPWLKESLINSKSEDWKFED------------SFDGVSGGTIET-AFTEFSGVWCDLP
        SSPP+I N +++         SS G   D DL   LSFGVFDFPWLK+ LI SKS+DWKFED            +FD  +  TI T AFTEF      LP
Subjt:  SSPPQIGNGEKRQCGGRGAS-SSGGRLLDLDLDLDLSFGVFDFPWLKESLINSKSEDWKFED------------SFDGVSGGTIET-AFTEFSGVWCDLP

Query:  DPWAVECEA------VPPLDGGGTAAEGMDCVWRSLLNQPLQQRSSAL
        DPW  + EA       P LDGG    EGMDC+WRS+LNQPLQQ SSAL
Subjt:  DPWAVECEA------VPPLDGGGTAAEGMDCVWRSLLNQPLQQRSSAL

KAG6594718.1 hypothetical protein SDJN03_11271, partial [Cucurbita argyrosperma subsp. sororia]1.0e-2959.59Show/hide
Query:  RTRDSSPPQIGNGEKRQCGGRGASSSGGRLLDLDLDLDLSFGVFDFPWLKESLINSKSEDWKFED----SFDGVSGGTIETAFTEFSG--VWCDLPDPWA
        R   SS P+IGNGEKRQ          GRL     DLDLSFGVFDFPWLK+SLI SK EDWK ED    SFDGVS  T + A TE+S   VWC+LPDPW 
Subjt:  RTRDSSPPQIGNGEKRQCGGRGASSSGGRLLDLDLDLDLSFGVFDFPWLKESLINSKSEDWKFED----SFDGVSGGTIETAFTEFSG--VWCDLPDPWA

Query:  VECEA-VPPL--------DGGGTAAEGMDCVWRSLLNQPLQQRSSA
         E EA V P         DG G A EGMDCVW SLLNQPLQQ SS+
Subjt:  VECEA-VPPL--------DGGGTAAEGMDCVWRSLLNQPLQQRSSA

KAG6603846.1 hypothetical protein SDJN03_04455, partial [Cucurbita argyrosperma subsp. sororia]2.3e-2954.78Show/hide
Query:  MDRHSSKRTRDSSP-----PQIGNGEKRQCGGRGASSSGGRLLDLDLDLDLSFGVFDFPWLKESLINSKSEDWKFEDSF-----DGVSGGTIETAFTEFS
        MDR S+KR R+SS      P+IGNG+K++  G G+  +   L     D DLSFGVFDFPWLKESLI S+SEDWK +D F     +G+S     T  TE S
Subjt:  MDRHSSKRTRDSSP-----PQIGNGEKRQCGGRGASSSGGRLLDLDLDLDLSFGVFDFPWLKESLINSKSEDWKFEDSF-----DGVSGGTIETAFTEFS

Query:  G---VWCDLPDPWAVECEA-VPP--LD--GGGTAAEGMDCVWRSLLNQPLQQRSSAL
            VW +LPDPW  E EA VPP  LD  GGG A EG+DC+W SLLNQPLQQ S AL
Subjt:  G---VWCDLPDPWAVECEA-VPP--LD--GGGTAAEGMDCVWRSLLNQPLQQRSSAL

KGN49015.1 hypothetical protein Csa_004415 [Cucumis sativus]1.2e-2547.65Show/hide
Query:  MDRHSSKRTRDSSP--------------PQIGNGEKRQCGGRGASSSGGRLLDLDLDLDLSFGVFDFPWLKESLINSKSEDWKFEDSF-----DGVSGG-
        M+RH++KR R+SS               P+I N +++        SS G   D DL   LSFGVFDFPWLK+ LI SKS+DWKFED F     +G S   
Subjt:  MDRHSSKRTRDSSP--------------PQIGNGEKRQCGGRGASSSGGRLLDLDLDLDLSFGVFDFPWLKESLINSKSEDWKFEDSF-----DGVSGG-

Query:  -----TIETAFTEFSGVWCDLPDPWAVECEA------VPPLDGGGTAAEGMDCVWRSLLNQPLQQRSSAL
             TI TAFTEF      LPDPW  + EA       P LDGG    E MDC+WRS+LNQPLQQ SSAL
Subjt:  -----TIETAFTEFSGVWCDLPDPWAVECEA------VPPLDGGGTAAEGMDCVWRSLLNQPLQQRSSAL

XP_022977867.1 uncharacterized protein LOC111478029 [Cucurbita maxima]3.3e-2854.14Show/hide
Query:  MDRHSSKRTRDSSPPQ-----IGNGEKRQCGGRGASSSGGRLLDLDLDLDLSFGVFDFPWLKESLINSKSEDWKFEDSF-----DGVSGGTIETAFTEFS
        M+R S+KR R+SS  Q     IGNG+K++  G G+  +   L     D DLSFGVFDFPWLKESLI S+SEDWK +D F     +G+S     +  TE S
Subjt:  MDRHSSKRTRDSSPPQ-----IGNGEKRQCGGRGASSSGGRLLDLDLDLDLSFGVFDFPWLKESLINSKSEDWKFEDSF-----DGVSGGTIETAFTEFS

Query:  G---VWCDLPDPWAVECEA-VPP--LD--GGGTAAEGMDCVWRSLLNQPLQQRSSAL
            VW +LPDPW  E EA VPP  LD  GGG A EGMDC+W SLLNQPLQQ S AL
Subjt:  G---VWCDLPDPWAVECEA-VPP--LD--GGGTAAEGMDCVWRSLLNQPLQQRSSAL

TrEMBL top hitse value%identityAlignment
A0A0A0KH39 Uncharacterized protein5.7e-2647.65Show/hide
Query:  MDRHSSKRTRDSSP--------------PQIGNGEKRQCGGRGASSSGGRLLDLDLDLDLSFGVFDFPWLKESLINSKSEDWKFEDSF-----DGVSGG-
        M+RH++KR R+SS               P+I N +++        SS G   D DL   LSFGVFDFPWLK+ LI SKS+DWKFED F     +G S   
Subjt:  MDRHSSKRTRDSSP--------------PQIGNGEKRQCGGRGASSSGGRLLDLDLDLDLSFGVFDFPWLKESLINSKSEDWKFEDSF-----DGVSGG-

Query:  -----TIETAFTEFSGVWCDLPDPWAVECEA------VPPLDGGGTAAEGMDCVWRSLLNQPLQQRSSAL
             TI TAFTEF      LPDPW  + EA       P LDGG    E MDC+WRS+LNQPLQQ SSAL
Subjt:  -----TIETAFTEFSGVWCDLPDPWAVECEA------VPPLDGGGTAAEGMDCVWRSLLNQPLQQRSSAL

A0A5A7SLQ6 Uncharacterized protein1.2e-2351.35Show/hide
Query:  SSPPQIGNGEKRQCGGRGAS-SSGGRLLDLDLDLDLSFGVFDFPWLKESLINSKSEDWKFED------------SFDGVSGGTIET-AFTEFSGVWCDLP
        SSPP+I N +++         SS G   D DL   LSFGVFDFPWLK+ LI SKS+DWKFED            +FD  +  TI T AFTEF      LP
Subjt:  SSPPQIGNGEKRQCGGRGAS-SSGGRLLDLDLDLDLSFGVFDFPWLKESLINSKSEDWKFED------------SFDGVSGGTIET-AFTEFSGVWCDLP

Query:  DPWAVECEA------VPPLDGGGTAAEGMDCVWRSLLNQPLQQRSSAL
        DPW  + EA       P LDGG    EGMDC+WRS+LNQPLQQ SSAL
Subjt:  DPWAVECEA------VPPLDGGGTAAEGMDCVWRSLLNQPLQQRSSAL

A0A6J1BUL5 uncharacterized protein LOC1110055081.7e-1445.99Show/hide
Query:  MDRHSSKRTRD-------------------SSPPQIGNGEKRQCGGRGASSSGGRLLDLDLDLDLSFGVFDFPWLKESLINSKSEDWKFEDSF------D
        MDRH++KR R+                   SSPP+IGN +K+   G G   + GR  DLD        VFDFPWLK+SLI SKSEDWKFED F      +
Subjt:  MDRHSSKRTRD-------------------SSPPQIGNGEKRQCGGRGASSSGGRLLDLDLDLDLSFGVFDFPWLKESLINSKSEDWKFEDSF------D

Query:  GVSGG----TIETAFTEFSG--VWCD---LPDPWAVE
        G S      TI TAFTE SG  +WCD   LPDP+  E
Subjt:  GVSGG----TIETAFTEFSG--VWCD---LPDPWAVE

A0A6J1IJM6 uncharacterized protein LOC1114780291.6e-2854.14Show/hide
Query:  MDRHSSKRTRDSSPPQ-----IGNGEKRQCGGRGASSSGGRLLDLDLDLDLSFGVFDFPWLKESLINSKSEDWKFEDSF-----DGVSGGTIETAFTEFS
        M+R S+KR R+SS  Q     IGNG+K++  G G+  +   L     D DLSFGVFDFPWLKESLI S+SEDWK +D F     +G+S     +  TE S
Subjt:  MDRHSSKRTRDSSPPQ-----IGNGEKRQCGGRGASSSGGRLLDLDLDLDLSFGVFDFPWLKESLINSKSEDWKFEDSF-----DGVSGGTIETAFTEFS

Query:  G---VWCDLPDPWAVECEA-VPP--LD--GGGTAAEGMDCVWRSLLNQPLQQRSSAL
            VW +LPDPW  E EA VPP  LD  GGG A EGMDC+W SLLNQPLQQ S AL
Subjt:  G---VWCDLPDPWAVECEA-VPP--LD--GGGTAAEGMDCVWRSLLNQPLQQRSSAL

A0A6P9ENN9 uncharacterized protein LOC108996550 isoform X28.6e-1448.15Show/hide
Query:  LDLDLDLSFGVFDFPWLKESLINSKSEDWKFEDSFDG---VSGGTIETAFTEFSG-VWCDLPD--PWAVE-------CEAVPPLDGGGTAAEGMDCVWRS
        LD  + L  GVFDFPWLK+ +I SKSEDW+FED+F         +  TA  EFSG   C  P+  P+  E       C AVP  +G G   EG+DC+W S
Subjt:  LDLDLDLSFGVFDFPWLKESLINSKSEDWKFEDSFDG---VSGGTIETAFTEFSG-VWCDLPD--PWAVE-------CEAVPPLDGGGTAAEGMDCVWRS

Query:  LLNQPLQQ
        LL+QPLQQ
Subjt:  LLNQPLQQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G64800.1 DNA binding;sequence-specific DNA binding transcription factors8.0e-0436.14Show/hide
Query:  SFGVFDFPWLKESLINSKSEDWKFEDSFDGVSGGTIETAFTEFSGVWCDLPDPWAVECEAVPPLDGGGTAAEGMDCVWRSLLN
        S GVF+FPW+KES+I S S DW   +S   V     E  F E S V       W V+  +   L+      E  +C+W S+L+
Subjt:  SFGVFDFPWLKESLINSKSEDWKFEDSFDGVSGGTIETAFTEFSGVWCDLPDPWAVECEAVPPLDGGGTAAEGMDCVWRSLLN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCGCCATTCCTCCAAGCGAACGAGAGATTCGTCGCCGCCGCAGATCGGTAACGGAGAGAAGCGCCAGTGCGGCGGCAGAGGAGCTTCTTCTTCTGGTGGCAGATT
GTTGGATTTGGATTTGGATTTGGATCTGTCGTTCGGCGTCTTCGATTTCCCGTGGCTGAAGGAGAGTTTGATCAATTCCAAATCGGAGGACTGGAAATTTGAGGATTCCT
TCGACGGAGTGTCCGGCGGCACGATTGAGACCGCGTTTACGGAGTTTTCCGGCGTCTGGTGTGATCTGCCGGATCCTTGGGCGGTGGAGTGTGAGGCGGTTCCGCCGCTC
GACGGCGGAGGCACGGCGGCGGAGGGGATGGACTGTGTTTGGAGATCGCTGCTGAATCAACCGCTTCAACAAAGAAGTAGCGCGTTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATCGCCATTCCTCCAAGCGAACGAGAGATTCGTCGCCGCCGCAGATCGGTAACGGAGAGAAGCGCCAGTGCGGCGGCAGAGGAGCTTCTTCTTCTGGTGGCAGATT
GTTGGATTTGGATTTGGATTTGGATCTGTCGTTCGGCGTCTTCGATTTCCCGTGGCTGAAGGAGAGTTTGATCAATTCCAAATCGGAGGACTGGAAATTTGAGGATTCCT
TCGACGGAGTGTCCGGCGGCACGATTGAGACCGCGTTTACGGAGTTTTCCGGCGTCTGGTGTGATCTGCCGGATCCTTGGGCGGTGGAGTGTGAGGCGGTTCCGCCGCTC
GACGGCGGAGGCACGGCGGCGGAGGGGATGGACTGTGTTTGGAGATCGCTGCTGAATCAACCGCTTCAACAAAGAAGTAGCGCGTTGTAG
Protein sequenceShow/hide protein sequence
MDRHSSKRTRDSSPPQIGNGEKRQCGGRGASSSGGRLLDLDLDLDLSFGVFDFPWLKESLINSKSEDWKFEDSFDGVSGGTIETAFTEFSGVWCDLPDPWAVECEAVPPL
DGGGTAAEGMDCVWRSLLNQPLQQRSSAL