; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0025179 (gene) of Chayote v1 genome

Gene IDSed0025179
OrganismSechium edule (Chayote v1)
Descriptiontranscription factor SCREAM2-like isoform X1
Genome locationLG03:2138423..2140236
RNA-Seq ExpressionSed0025179
SyntenySed0025179
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0005488 - binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022939238.1 uncharacterized protein LOC111445214 isoform X2 [Cucurbita moschata]6.0e-5882.28Show/hide
Query:  MVSRELNNATLHEKLQLLRSITNSHAQNKASIIVDASKYIKELKHKVERLNQDISTNSLSNHHHHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSILQV
        MVSRE NNA+LH  LQLLRSITNSHA NKASIIVDASKYI+ELK KVERLNQDIST   S      HPMQVTVE+L KGFSINVFSEKSCQGLLVSIL+ 
Subjt:  MVSRELNNATLHEKLQLLRSITNSHAQNKASIIVDASKYIKELKHKVERLNQDISTNSLSNHHHHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSILQV

Query:  FEDLRLNVIEARVSCTDTFQLQAIGEIEEQGEEAINAQVVKEAVVQAIKSWSQSGEQD
        FE+L LNV+EARVSCTD+FQLQAI EIEEQGEEAI+AQ VKEAVVQAIK WSQSGEQD
Subjt:  FEDLRLNVIEARVSCTDTFQLQAIGEIEEQGEEAINAQVVKEAVVQAIKSWSQSGEQD

XP_022964712.1 uncharacterized protein LOC111464709 isoform X1 [Cucurbita moschata]2.7e-5881.01Show/hide
Query:  MVSRELNNATLHEKLQLLRSITNSHAQNKASIIVDASKYIKELKHKVERLNQDISTNSLSNHHHHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSILQV
        MVSRE   A LHEKLQLLRSITNSHA NK SIIVDASKYI+ELK KVERLNQDI+T  + N  H +HPMQVTVE LVKGFSINVFSEKSCQGLLVSIL+ 
Subjt:  MVSRELNNATLHEKLQLLRSITNSHAQNKASIIVDASKYIKELKHKVERLNQDISTNSLSNHHHHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSILQV

Query:  FEDLRLNVIEARVSCTDTFQLQAIGEIEEQGEEAINAQVVKEAVVQAIKSWSQSGEQD
        FE+L LNV+EARVSCTDTFQLQA  EIEEQGEEA++AQ VKEAVV+AIKSWSQ+GEQD
Subjt:  FEDLRLNVIEARVSCTDTFQLQAIGEIEEQGEEAINAQVVKEAVVQAIKSWSQSGEQD

XP_023551077.1 uncharacterized protein LOC111809011 isoform X2 [Cucurbita pepo subsp. pepo]3.5e-5882.28Show/hide
Query:  MVSRELNNATLHEKLQLLRSITNSHAQNKASIIVDASKYIKELKHKVERLNQDISTNSLSNHHHHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSILQV
        MVSRE NNA+LH  LQLLRSITNSHA NKASIIVDASKYI+ELK KVERLNQDIST   S      HPMQVTVE+L KGFSINVFSEKSCQGLLVSIL+ 
Subjt:  MVSRELNNATLHEKLQLLRSITNSHAQNKASIIVDASKYIKELKHKVERLNQDISTNSLSNHHHHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSILQV

Query:  FEDLRLNVIEARVSCTDTFQLQAIGEIEEQGEEAINAQVVKEAVVQAIKSWSQSGEQD
        FE+L LNV+EARVSCTD+FQLQAI EIEEQGEEAI+AQ VKEAVVQAIK WSQSGEQD
Subjt:  FEDLRLNVIEARVSCTDTFQLQAIGEIEEQGEEAINAQVVKEAVVQAIKSWSQSGEQD

XP_038884496.1 uncharacterized protein LOC120075304 isoform X1 [Benincasa hispida]1.2e-5882.82Show/hide
Query:  MVSRELNNATLHEKLQLLRSITNSHAQ-NKASIIVDASKYIKELKHKVERLNQDISTNSLSNH----HHHHHPMQVTVETLVKGFSINVFSEKSCQGLLV
        MVSRE   A LHEKLQLLRSITNSHAQ NKASIIVDASKYI+ELK KVERLNQDIST   S H     H + PMQVTVE LVKGFSINVFSEKSCQGLLV
Subjt:  MVSRELNNATLHEKLQLLRSITNSHAQ-NKASIIVDASKYIKELKHKVERLNQDISTNSLSNH----HHHHHPMQVTVETLVKGFSINVFSEKSCQGLLV

Query:  SILQVFEDLRLNVIEARVSCTDTFQLQAIGEIEEQGEEAINAQVVKEAVVQAIKSWSQSGEQD
        SIL+VFE+L LNVIEARVSCTDTFQLQAI EIEE+GEEAI+AQ VKEAVVQAIKSW QSGEQD
Subjt:  SILQVFEDLRLNVIEARVSCTDTFQLQAIGEIEEQGEEAINAQVVKEAVVQAIKSWSQSGEQD

XP_038884497.1 uncharacterized protein LOC120075304 isoform X2 [Benincasa hispida]3.2e-5982.72Show/hide
Query:  MVSRELNNATLHEKLQLLRSITNSHAQNKASIIVDASKYIKELKHKVERLNQDISTNSLSNH----HHHHHPMQVTVETLVKGFSINVFSEKSCQGLLVS
        MVSRE   A LHEKLQLLRSITNSHA NKASIIVDASKYI+ELK KVERLNQDIST   S H     H + PMQVTVE LVKGFSINVFSEKSCQGLLVS
Subjt:  MVSRELNNATLHEKLQLLRSITNSHAQNKASIIVDASKYIKELKHKVERLNQDISTNSLSNH----HHHHHPMQVTVETLVKGFSINVFSEKSCQGLLVS

Query:  ILQVFEDLRLNVIEARVSCTDTFQLQAIGEIEEQGEEAINAQVVKEAVVQAIKSWSQSGEQD
        IL+VFE+L LNVIEARVSCTDTFQLQAI EIEE+GEEAI+AQ VKEAVVQAIKSW QSGEQD
Subjt:  ILQVFEDLRLNVIEARVSCTDTFQLQAIGEIEEQGEEAINAQVVKEAVVQAIKSWSQSGEQD

TrEMBL top hitse value%identityAlignment
A0A6J1FG89 uncharacterized protein LOC111445214 isoform X11.1e-5782.39Show/hide
Query:  MVSRELNNATLHEKLQLLRSITNSHAQ-NKASIIVDASKYIKELKHKVERLNQDISTNSLSNHHHHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSILQ
        MVSRE NNA+LH  LQLLRSITNSHAQ NKASIIVDASKYI+ELK KVERLNQDIST   S      HPMQVTVE+L KGFSINVFSEKSCQGLLVSIL+
Subjt:  MVSRELNNATLHEKLQLLRSITNSHAQ-NKASIIVDASKYIKELKHKVERLNQDISTNSLSNHHHHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSILQ

Query:  VFEDLRLNVIEARVSCTDTFQLQAIGEIEEQGEEAINAQVVKEAVVQAIKSWSQSGEQD
         FE+L LNV+EARVSCTD+FQLQAI EIEEQGEEAI+AQ VKEAVVQAIK WSQSGEQD
Subjt:  VFEDLRLNVIEARVSCTDTFQLQAIGEIEEQGEEAINAQVVKEAVVQAIKSWSQSGEQD

A0A6J1FL41 uncharacterized protein LOC111445214 isoform X22.9e-5882.28Show/hide
Query:  MVSRELNNATLHEKLQLLRSITNSHAQNKASIIVDASKYIKELKHKVERLNQDISTNSLSNHHHHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSILQV
        MVSRE NNA+LH  LQLLRSITNSHA NKASIIVDASKYI+ELK KVERLNQDIST   S      HPMQVTVE+L KGFSINVFSEKSCQGLLVSIL+ 
Subjt:  MVSRELNNATLHEKLQLLRSITNSHAQNKASIIVDASKYIKELKHKVERLNQDISTNSLSNHHHHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSILQV

Query:  FEDLRLNVIEARVSCTDTFQLQAIGEIEEQGEEAINAQVVKEAVVQAIKSWSQSGEQD
        FE+L LNV+EARVSCTD+FQLQAI EIEEQGEEAI+AQ VKEAVVQAIK WSQSGEQD
Subjt:  FEDLRLNVIEARVSCTDTFQLQAIGEIEEQGEEAINAQVVKEAVVQAIKSWSQSGEQD

A0A6J1HIH6 uncharacterized protein LOC111464709 isoform X11.3e-5881.01Show/hide
Query:  MVSRELNNATLHEKLQLLRSITNSHAQNKASIIVDASKYIKELKHKVERLNQDISTNSLSNHHHHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSILQV
        MVSRE   A LHEKLQLLRSITNSHA NK SIIVDASKYI+ELK KVERLNQDI+T  + N  H +HPMQVTVE LVKGFSINVFSEKSCQGLLVSIL+ 
Subjt:  MVSRELNNATLHEKLQLLRSITNSHAQNKASIIVDASKYIKELKHKVERLNQDISTNSLSNHHHHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSILQV

Query:  FEDLRLNVIEARVSCTDTFQLQAIGEIEEQGEEAINAQVVKEAVVQAIKSWSQSGEQD
        FE+L LNV+EARVSCTDTFQLQA  EIEEQGEEA++AQ VKEAVV+AIKSWSQ+GEQD
Subjt:  FEDLRLNVIEARVSCTDTFQLQAIGEIEEQGEEAINAQVVKEAVVQAIKSWSQSGEQD

A0A6J1JV46 uncharacterized protein LOC111487778 isoform X11.3e-5881.01Show/hide
Query:  MVSRELNNATLHEKLQLLRSITNSHAQNKASIIVDASKYIKELKHKVERLNQDISTNSLSNHHHHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSILQV
        MVSRE   A LHEKLQLLRSITNSHA NK SIIVDASKYI+ELK KVERLNQDI+T  + N  H +HPMQVTVE LVKGFSINVFSEKSCQGLLVSIL+ 
Subjt:  MVSRELNNATLHEKLQLLRSITNSHAQNKASIIVDASKYIKELKHKVERLNQDISTNSLSNHHHHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSILQV

Query:  FEDLRLNVIEARVSCTDTFQLQAIGEIEEQGEEAINAQVVKEAVVQAIKSWSQSGEQD
        FE+L LNV+EARVSCTDTFQLQA  EIEEQGEEA++AQ VKEAVV+AIKSWSQ+GEQD
Subjt:  FEDLRLNVIEARVSCTDTFQLQAIGEIEEQGEEAINAQVVKEAVVQAIKSWSQSGEQD

A0A6J1K013 uncharacterized protein LOC111489817 isoform X24.9e-5881.65Show/hide
Query:  MVSRELNNATLHEKLQLLRSITNSHAQNKASIIVDASKYIKELKHKVERLNQDISTNSLSNHHHHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSILQV
        MVSRE NNA LH  LQLLRSITNSHA NKASIIVDASKYI+ELK KVERLNQDIST   S      HPMQVTVE+L KGFSINVFSEKSCQGLLVSIL+ 
Subjt:  MVSRELNNATLHEKLQLLRSITNSHAQNKASIIVDASKYIKELKHKVERLNQDISTNSLSNHHHHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSILQV

Query:  FEDLRLNVIEARVSCTDTFQLQAIGEIEEQGEEAINAQVVKEAVVQAIKSWSQSGEQD
        FE+L LNV+EARVSCTD+FQLQAI EIEE+GEEAI+AQ VKEAVVQAIK WSQSGEQD
Subjt:  FEDLRLNVIEARVSCTDTFQLQAIGEIEEQGEEAINAQVVKEAVVQAIKSWSQSGEQD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G29270.1 unknown protein1.0e-0729.37Show/hide
Query:  MVSRELNNATLHEKLQLLRSITN--SHAQNKASIIVDASKYIKELKHKVERLNQDISTNSLSNHHHHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSIL
        MV+ E        K   L+++T+       ++ +I +A  YI  LK ++E L ++     ++     H   +V VE + + F + + S +  +  LV+IL
Subjt:  MVSRELNNATLHEKLQLLRSITN--SHAQNKASIIVDASKYIKELKHKVERLNQDISTNSLSNHHHHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSIL

Query:  QVFEDLRLNVIEARVSCTDTFQLQAI
        + FE++ LNV +AR SC D+F ++AI
Subjt:  QVFEDLRLNVIEARVSCTDTFQLQAI

AT2G40435.1 BEST Arabidopsis thaliana protein match is: transcription regulators (TAIR:AT3G56220.1)1.9e-3857.52Show/hide
Query:  MVSRELNNATLHEKLQLLRSITNSHAQNKASIIVDASKYIKELKHKVERLNQDISTNSLSNHHHHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSILQV
        MVSRE    +L EK QLLRSITNSHA+N  SII+DASKYI++LK KVER NQD +    S+         VTVETL KGF INVFS K+  G+LVS+L+ 
Subjt:  MVSRELNNATLHEKLQLLRSITNSHAQNKASIIVDASKYIKELKHKVERLNQDISTNSLSNHHHHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSILQV

Query:  FEDLRLNVIEARVSCTDTFQLQAIGEIEEQGEEAINAQVVKEAVVQAIKSWSQ
        FED+ LNV+EAR SCTD+F L A+G   E GE  ++A+ VK+AV  AI+SW +
Subjt:  FEDLRLNVIEARVSCTDTFQLQAIGEIEEQGEEAINAQVVKEAVVQAIKSWSQ

AT3G56220.1 transcription regulators3.4e-3554.19Show/hide
Query:  MVSRE-LNNATLHEKLQLLRSITNSHAQNKASIIVDASKYIKELKHKVERLNQDISTNSLSNHHHHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSILQ
        MVSRE    ++L EK  LLRSIT+SHA+++ SIIVDASKYIK+LK KVE++N   ++          +PM VTVETL KGF I V S K+  G+LV +L+
Subjt:  MVSRE-LNNATLHEKLQLLRSITNSHAQNKASIIVDASKYIKELKHKVERLNQDISTNSLSNHHHHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSILQ

Query:  VFEDLRLNVIEARVSCTDTFQLQAIGEIEEQGEEAINAQVVKEAVVQAIKSWSQS
         FEDL L+V+EARVSCTDTF L AIG       + I+A+ VK+AV +AI++WS S
Subjt:  VFEDLRLNVIEARVSCTDTFQLQAIGEIEEQGEEAINAQVVKEAVVQAIKSWSQS

AT5G10570.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.1e-0429.66Show/hide
Query:  LHEKLQLLRSITNSHAQ-NKASIIVDASKYIKELKHKVERLNQD---ISTNS-LSNHHHHHHPMQVTVETLVKGFSINVFSEKSC---QGLLVSILQVFE
        L+++L LLRSI     + ++ SI+ DA  Y+KEL  K+ +L +D   + +NS LS    +   ++ +++  V    +N   +  C    GL+VS +   E
Subjt:  LHEKLQLLRSITNSHAQ-NKASIIVDASKYIKELKHKVERLNQD---ISTNS-LSNHHHHHHPMQVTVETLVKGFSINVFSEKSC---QGLLVSILQVFE

Query:  DLRLNVIEARVSCTDTFQLQA-IGEIEEQGEEAINAQVVKEAVVQ
         L L + +  +SC   F LQA   E+ EQ    + ++  K+A+++
Subjt:  DLRLNVIEARVSCTDTFQLQA-IGEIEEQGEEAINAQVVKEAVVQ

AT5G65640.1 beta HLH protein 936.9e-0428.48Show/hide
Query:  LHEKLQLLRSITNSHAQ-NKASIIVDASKYIKELKHKVERLNQDISTNSLSNHHHHHHPM-----QVTVETLVKG---FSINVFSEKS-----CQ---GL
        L+++L +LRSI    ++ ++ SI+ DA  Y+KEL  K+ +L  +      SN+ HH             E LV+    F I+   E +     C    GL
Subjt:  LHEKLQLLRSITNSHAQ-NKASIIVDASKYIKELKHKVERLNQDISTNSLSNHHHHHHPM-----QVTVETLVKG---FSINVFSEKS-----CQ---GL

Query:  LVSILQVFEDLRLNVIEARVSCTDTFQLQAIGEIEEQGEEAINAQVVKEAV
        L+S +   E L L + +  +SC   F LQA      +  + I ++ +K+A+
Subjt:  LVSILQVFEDLRLNVIEARVSCTDTFQLQAIGEIEEQGEEAINAQVVKEAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTCTAGAGAGCTCAACAATGCAACTCTTCATGAAAAGCTTCAATTACTTCGCTCTATTACCAACTCTCATGCTCAAAACAAGGCTTCAATTATAGTGGATGCATC
AAAATATATCAAGGAGCTAAAACACAAAGTAGAAAGATTGAATCAAGACATATCAACCAACTCACTTTCTAATCATCATCATCATCATCATCCCATGCAGGTTACAGTGG
AAACCCTAGTAAAGGGATTTTCTATAAATGTATTTTCAGAGAAAAGCTGCCAAGGTCTCCTTGTCTCAATATTACAAGTCTTTGAAGACCTGAGGCTTAATGTTATTGAA
GCTAGGGTTTCTTGTACTGACACTTTCCAATTACAAGCCATTGGAGAAATTGAGGAACAAGGAGAAGAAGCCATTAATGCTCAAGTTGTAAAAGAAGCAGTAGTTCAAGC
TATAAAGAGCTGGAGTCAAAGCGGTGAACAAGATTAA
mRNA sequenceShow/hide mRNA sequence
GGGGAAAGCTAGCAAACAGAAGATATCATATATAGATTTGAGAGCTCATATTATGATGCCCCTTATAAAAACACACTCTCACAGAAATTTTTGCAGAGTGAAAATTAAGA
ACAAAAAAGAAAAAAGAAAAAAAAGAAGATATATTTGAATCCATGGTTTCTAGAGAGCTCAACAATGCAACTCTTCATGAAAAGCTTCAATTACTTCGCTCTATTACCAA
CTCTCATGCTCAAAACAAGGCTTCAATTATAGTGGATGCATCAAAATATATCAAGGAGCTAAAACACAAAGTAGAAAGATTGAATCAAGACATATCAACCAACTCACTTT
CTAATCATCATCATCATCATCATCCCATGCAGGTTACAGTGGAAACCCTAGTAAAGGGATTTTCTATAAATGTATTTTCAGAGAAAAGCTGCCAAGGTCTCCTTGTCTCA
ATATTACAAGTCTTTGAAGACCTGAGGCTTAATGTTATTGAAGCTAGGGTTTCTTGTACTGACACTTTCCAATTACAAGCCATTGGAGAAATTGAGGAACAAGGAGAAGA
AGCCATTAATGCTCAAGTTGTAAAAGAAGCAGTAGTTCAAGCTATAAAGAGCTGGAGTCAAAGCGGTGAACAAGATTAA
Protein sequenceShow/hide protein sequence
MVSRELNNATLHEKLQLLRSITNSHAQNKASIIVDASKYIKELKHKVERLNQDISTNSLSNHHHHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSILQVFEDLRLNVIE
ARVSCTDTFQLQAIGEIEEQGEEAINAQVVKEAVVQAIKSWSQSGEQD