; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0027815 (gene) of Chayote v1 genome

Gene IDSed0027815
OrganismSechium edule (Chayote v1)
DescriptionsnRNA-activating protein complex subunit 4
Genome locationLG08:2356726..2358646
RNA-Seq ExpressionSed0027815
SyntenySed0027815
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589626.1 Myb-like protein L, partial [Cucurbita argyrosperma subsp. sororia]6.3e-4870.3Show/hide
Query:  AGVNPEDYIN-----SVAGD-----DPDDVDDFQLVRNIQKQFSIADDDQPLSALPP---DEEEDDFETLRRIQQRFGAYGSGTLSNKHDQSCDFDGPLE
        AGVN EDYIN       AGD     D DDVDD +L+RNIQ +FSIA D++PLS LPP   DEEEDDFETLR IQ+RF AY S  LSNK DQSCD DGPL+
Subjt:  AGVNPEDYIN-----SVAGD-----DPDDVDDFQLVRNIQKQFSIADDDQPLSALPP---DEEEDDFETLRRIQQRFGAYGSGTLSNKHDQSCDFDGPLE

Query:  MDSNST--ERSTSSKRSSMLTFEKGSMPKAALSFIDAIKKNRSQQKFICNKMIHLETRIEENKRL
        MDS++T  ER TSS+RSSM+ FEKGS+PKAAL+FIDAIKKNRSQQKFI +KMIHLE RIEENK+L
Subjt:  MDSNST--ERSTSSKRSSMLTFEKGSMPKAALSFIDAIKKNRSQQKFICNKMIHLETRIEENKRL

KAG7023314.1 Myb-like protein L [Cucurbita argyrosperma subsp. argyrosperma]6.3e-4870.3Show/hide
Query:  AGVNPEDYIN-----SVAGD-----DPDDVDDFQLVRNIQKQFSIADDDQPLSALPP---DEEEDDFETLRRIQQRFGAYGSGTLSNKHDQSCDFDGPLE
        AGVN EDYIN       AGD     D DDVDD +L+RNIQ +FSIA D++PLS LPP   DEEEDDFETLR IQ+RF AY S  LSNK DQSCD DGPL+
Subjt:  AGVNPEDYIN-----SVAGD-----DPDDVDDFQLVRNIQKQFSIADDDQPLSALPP---DEEEDDFETLRRIQQRFGAYGSGTLSNKHDQSCDFDGPLE

Query:  MDSNST--ERSTSSKRSSMLTFEKGSMPKAALSFIDAIKKNRSQQKFICNKMIHLETRIEENKRL
        MDS++T  ER TSS+RSSM+ FEKGS+PKAAL+FIDAIKKNRSQQKFI +KMIHLE RIEENK+L
Subjt:  MDSNST--ERSTSSKRSSMLTFEKGSMPKAALSFIDAIKKNRSQQKFICNKMIHLETRIEENKRL

XP_022987958.1 uncharacterized protein LOC111485355 isoform X1 [Cucurbita maxima]6.3e-4869.7Show/hide
Query:  AGVNPEDYIN-----SVAGD-----DPDDVDDFQLVRNIQKQFSIADDDQPLSALPP---DEEEDDFETLRRIQQRFGAYGSGTLSNKHDQSCDFDGPLE
        AGVN EDY+N       AGD     D DDVDD +L+RNIQ +FSIA D+QPLS LPP   DEEEDDFETLR IQ+RF AY S  LSNK DQSCD DGPL+
Subjt:  AGVNPEDYIN-----SVAGD-----DPDDVDDFQLVRNIQKQFSIADDDQPLSALPP---DEEEDDFETLRRIQQRFGAYGSGTLSNKHDQSCDFDGPLE

Query:  MDSNST--ERSTSSKRSSMLTFEKGSMPKAALSFIDAIKKNRSQQKFICNKMIHLETRIEENKRL
        MDS++T  ER TSS+RSSM+ FEKGS+PKAAL+FIDAIKKNRSQQKF+ +KMIHLE RIEENK+L
Subjt:  MDSNST--ERSTSSKRSSMLTFEKGSMPKAALSFIDAIKKNRSQQKFICNKMIHLETRIEENKRL

XP_022987959.1 uncharacterized protein LOC111485355 isoform X2 [Cucurbita maxima]6.3e-4869.7Show/hide
Query:  AGVNPEDYIN-----SVAGD-----DPDDVDDFQLVRNIQKQFSIADDDQPLSALPP---DEEEDDFETLRRIQQRFGAYGSGTLSNKHDQSCDFDGPLE
        AGVN EDY+N       AGD     D DDVDD +L+RNIQ +FSIA D+QPLS LPP   DEEEDDFETLR IQ+RF AY S  LSNK DQSCD DGPL+
Subjt:  AGVNPEDYIN-----SVAGD-----DPDDVDDFQLVRNIQKQFSIADDDQPLSALPP---DEEEDDFETLRRIQQRFGAYGSGTLSNKHDQSCDFDGPLE

Query:  MDSNST--ERSTSSKRSSMLTFEKGSMPKAALSFIDAIKKNRSQQKFICNKMIHLETRIEENKRL
        MDS++T  ER TSS+RSSM+ FEKGS+PKAAL+FIDAIKKNRSQQKF+ +KMIHLE RIEENK+L
Subjt:  MDSNST--ERSTSSKRSSMLTFEKGSMPKAALSFIDAIKKNRSQQKFICNKMIHLETRIEENKRL

XP_023515735.1 uncharacterized protein LOC111779809 isoform X1 [Cucurbita pepo subsp. pepo]1.4e-4770.3Show/hide
Query:  AGVNPEDYIN-----SVAGD-----DPDDVDDFQLVRNIQKQFSIADDDQPLSALPP---DEEEDDFETLRRIQQRFGAYGSGTLSNKHDQSCDFDGPLE
        AGVN EDYIN       AGD     D DDVDD +L+RNIQ +FSIA D+QPLS LPP   DEEEDDFE LR IQ+RF AY S  LSNK DQSCD DGPL+
Subjt:  AGVNPEDYIN-----SVAGD-----DPDDVDDFQLVRNIQKQFSIADDDQPLSALPP---DEEEDDFETLRRIQQRFGAYGSGTLSNKHDQSCDFDGPLE

Query:  MDSNST--ERSTSSKRSSMLTFEKGSMPKAALSFIDAIKKNRSQQKFICNKMIHLETRIEENKRL
        MDS +T  ER TSS+RSSM+ FEKGS+PKAAL+FIDAIKKNRSQQKFI +KMIHLE RIEENK+L
Subjt:  MDSNST--ERSTSSKRSSMLTFEKGSMPKAALSFIDAIKKNRSQQKFICNKMIHLETRIEENKRL

TrEMBL top hitse value%identityAlignment
A0A6J1C075 uncharacterized protein LOC111007172 isoform X22.0e-4464Show/hide
Query:  GVNPEDYIN--------------SVAGDDPDDVDDFQLVRNIQKQFSIADDDQ-------PLSALP---PDEEEDDFETLRRIQQRFGAYGSGTLSNKHD
        G NPE+Y N              S  G D DDVDD +LVRNI+ +FSIA DD+       PLS LP   PDEEEDDFETLR IQ+RF AY S  LSN  D
Subjt:  GVNPEDYIN--------------SVAGDDPDDVDDFQLVRNIQKQFSIADDDQ-------PLSALP---PDEEEDDFETLRRIQQRFGAYGSGTLSNKHD

Query:  QSCDFDGPLEMDSNSTE--RSTSSKRSSMLTFEKGSMPKAALSFIDAIKKNRSQQKFICNKMIHLETRIEENKRL
        QSCDF GPLEMDSN T+  R TSS RSSML  EKG++PKAAL+FIDAIKKNRSQQKFI +KMIHLE RIEENK+L
Subjt:  QSCDFDGPLEMDSNSTE--RSTSSKRSSMLTFEKGSMPKAALSFIDAIKKNRSQQKFICNKMIHLETRIEENKRL

A0A6J1E2J4 uncharacterized protein LOC111430000 isoform X29.8e-4769.09Show/hide
Query:  AGVNPEDYIN-----SVAGD-----DPDDVDDFQLVRNIQKQFSIADDDQPLSALPP---DEEEDDFETLRRIQQRFGAYGSGTLSNKHDQSCDFDGPLE
        AGVN ED IN       AGD     D DDVDD +L+RNIQ +FS A D+QPLS LPP   DEEEDDFETLR IQ+RF AY S  LSNK DQSCD DGPL+
Subjt:  AGVNPEDYIN-----SVAGD-----DPDDVDDFQLVRNIQKQFSIADDDQPLSALPP---DEEEDDFETLRRIQQRFGAYGSGTLSNKHDQSCDFDGPLE

Query:  MDSNSTE--RSTSSKRSSMLTFEKGSMPKAALSFIDAIKKNRSQQKFICNKMIHLETRIEENKRL
        MDS++T+  R TSS+RSSM+ FEKGS+PKAAL+FIDAIKKNRSQQKFI +KMIHLE RIEENK+L
Subjt:  MDSNSTE--RSTSSKRSSMLTFEKGSMPKAALSFIDAIKKNRSQQKFICNKMIHLETRIEENKRL

A0A6J1E6Z7 uncharacterized protein LOC111430000 isoform X19.8e-4769.09Show/hide
Query:  AGVNPEDYIN-----SVAGD-----DPDDVDDFQLVRNIQKQFSIADDDQPLSALPP---DEEEDDFETLRRIQQRFGAYGSGTLSNKHDQSCDFDGPLE
        AGVN ED IN       AGD     D DDVDD +L+RNIQ +FS A D+QPLS LPP   DEEEDDFETLR IQ+RF AY S  LSNK DQSCD DGPL+
Subjt:  AGVNPEDYIN-----SVAGD-----DPDDVDDFQLVRNIQKQFSIADDDQPLSALPP---DEEEDDFETLRRIQQRFGAYGSGTLSNKHDQSCDFDGPLE

Query:  MDSNSTE--RSTSSKRSSMLTFEKGSMPKAALSFIDAIKKNRSQQKFICNKMIHLETRIEENKRL
        MDS++T+  R TSS+RSSM+ FEKGS+PKAAL+FIDAIKKNRSQQKFI +KMIHLE RIEENK+L
Subjt:  MDSNSTE--RSTSSKRSSMLTFEKGSMPKAALSFIDAIKKNRSQQKFICNKMIHLETRIEENKRL

A0A6J1JK98 uncharacterized protein LOC111485355 isoform X23.0e-4869.7Show/hide
Query:  AGVNPEDYIN-----SVAGD-----DPDDVDDFQLVRNIQKQFSIADDDQPLSALPP---DEEEDDFETLRRIQQRFGAYGSGTLSNKHDQSCDFDGPLE
        AGVN EDY+N       AGD     D DDVDD +L+RNIQ +FSIA D+QPLS LPP   DEEEDDFETLR IQ+RF AY S  LSNK DQSCD DGPL+
Subjt:  AGVNPEDYIN-----SVAGD-----DPDDVDDFQLVRNIQKQFSIADDDQPLSALPP---DEEEDDFETLRRIQQRFGAYGSGTLSNKHDQSCDFDGPLE

Query:  MDSNST--ERSTSSKRSSMLTFEKGSMPKAALSFIDAIKKNRSQQKFICNKMIHLETRIEENKRL
        MDS++T  ER TSS+RSSM+ FEKGS+PKAAL+FIDAIKKNRSQQKF+ +KMIHLE RIEENK+L
Subjt:  MDSNST--ERSTSSKRSSMLTFEKGSMPKAALSFIDAIKKNRSQQKFICNKMIHLETRIEENKRL

A0A6J1JKV7 uncharacterized protein LOC111485355 isoform X13.0e-4869.7Show/hide
Query:  AGVNPEDYIN-----SVAGD-----DPDDVDDFQLVRNIQKQFSIADDDQPLSALPP---DEEEDDFETLRRIQQRFGAYGSGTLSNKHDQSCDFDGPLE
        AGVN EDY+N       AGD     D DDVDD +L+RNIQ +FSIA D+QPLS LPP   DEEEDDFETLR IQ+RF AY S  LSNK DQSCD DGPL+
Subjt:  AGVNPEDYIN-----SVAGD-----DPDDVDDFQLVRNIQKQFSIADDDQPLSALPP---DEEEDDFETLRRIQQRFGAYGSGTLSNKHDQSCDFDGPLE

Query:  MDSNST--ERSTSSKRSSMLTFEKGSMPKAALSFIDAIKKNRSQQKFICNKMIHLETRIEENKRL
        MDS++T  ER TSS+RSSM+ FEKGS+PKAAL+FIDAIKKNRSQQKF+ +KMIHLE RIEENK+L
Subjt:  MDSNST--ERSTSSKRSSMLTFEKGSMPKAALSFIDAIKKNRSQQKFICNKMIHLETRIEENKRL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G18100.1 myb domain protein 4r11.0e-1129.53Show/hide
Query:  TIRSPPARKTTTWKSSEEPAGVNPEDYINSVAGDDPDDVDDFQLVRNIQKQFSIA-DDDQPLSALPPDEEEDDFETLRRIQQRFGAY------------G
        TI+S  A      +SS  P G++           D +  DDF+++R+I+ Q S++ D   P   L  DEE+D FETLR I++RF AY             
Subjt:  TIRSPPARKTTTWKSSEEPAGVNPEDYINSVAGDDPDDVDDFQLVRNIQKQFSIA-DDDQPLSALPPDEEEDDFETLRRIQQRFGAY------------G

Query:  SGTLSNKHDQSCDFDGPLEMDSNSTERSTSSKRSSM------------LTFEKGSMPKAALSFIDAIKKNRSQQKFICNKMIHLETRIEENKR
         G     H+   +    +   SN+ E      +S +            +     S P+AA +F+DAI++NR+ QKF+  K+  +E  IE+N++
Subjt:  SGTLSNKHDQSCDFDGPLEMDSNSTERSTSSKRSSM------------LTFEKGSMPKAALSFIDAIKKNRSQQKFICNKMIHLETRIEENKR

AT3G18100.2 myb domain protein 4r13.5e-0446.15Show/hide
Query:  SMPKAALSFIDAIKKNRSQQKFICNKMIHLETRIEENKR
        S P+AA +F+DAI++NR+ QKF+  K+  +E  IE+N++
Subjt:  SMPKAALSFIDAIKKNRSQQKFICNKMIHLETRIEENKR

AT3G18100.3 myb domain protein 4r11.1e-0537.93Show/hide
Query:  TIRSPPARKTTTWKSSEEPAGVNPEDYINSVAGDDPDDVDDFQLVRNIQKQFSIA-DDDQPLSALPPDEEEDDFETLRRIQQRFGAY
        TI+S  A      +SS  P G++           D +  DDF+++R+I+ Q S++ D   P   L  DEE+D FETLR I++RF AY
Subjt:  TIRSPPARKTTTWKSSEEPAGVNPEDYINSVAGDDPDDVDDFQLVRNIQKQFSIA-DDDQPLSALPPDEEEDDFETLRRIQQRFGAY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGGCGGCGGCGGCGGCGGCGACGATTCGCTCACCGCCGGCGAGGAAGACGACGACATGGAAGTCCTCAGAGGAGCCTGCCGGCGTTAATCCTGAGGATTACATTAA
TTCTGTCGCCGGAGATGATCCAGATGATGTTGATGATTTTCAACTTGTTCGGAATATTCAGAAACAGTTCTCGATTGCCGACGATGATCAGCCGTTGAGTGCTCTCCCAC
CGGACGAGGAGGAAGACGATTTCGAGACGCTTCGTCGGATTCAGCAGCGCTTTGGTGCGTATGGAAGCGGTACTTTGAGCAATAAACATGATCAGTCTTGTGACTTTGAT
GGGCCTCTCGAGATGGATTCTAACAGCACAGAGAGATCGACATCCTCAAAAAGGTCATCTATGCTAACCTTTGAAAAGGGAAGCATGCCAAAGGCTGCATTGTCATTTAT
CGATGCTATCAAGAAGAATAGGTCACAACAGAAGTTTATATGTAATAAGATGATTCATCTTGAAACTAGAATTGAGGAGAACAAAAGGCTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGACGGCGGCGGCGGCGGCGGCGACGATTCGCTCACCGCCGGCGAGGAAGACGACGACATGGAAGTCCTCAGAGGAGCCTGCCGGCGTTAATCCTGAGGATTACATTAA
TTCTGTCGCCGGAGATGATCCAGATGATGTTGATGATTTTCAACTTGTTCGGAATATTCAGAAACAGTTCTCGATTGCCGACGATGATCAGCCGTTGAGTGCTCTCCCAC
CGGACGAGGAGGAAGACGATTTCGAGACGCTTCGTCGGATTCAGCAGCGCTTTGGTGCGTATGGAAGCGGTACTTTGAGCAATAAACATGATCAGTCTTGTGACTTTGAT
GGGCCTCTCGAGATGGATTCTAACAGCACAGAGAGATCGACATCCTCAAAAAGGTCATCTATGCTAACCTTTGAAAAGGGAAGCATGCCAAAGGCTGCATTGTCATTTAT
CGATGCTATCAAGAAGAATAGGTCACAACAGAAGTTTATATGTAATAAGATGATTCATCTTGAAACTAGAATTGAGGAGAACAAAAGGCTCTGA
Protein sequenceShow/hide protein sequence
MTAAAAAATIRSPPARKTTTWKSSEEPAGVNPEDYINSVAGDDPDDVDDFQLVRNIQKQFSIADDDQPLSALPPDEEEDDFETLRRIQQRFGAYGSGTLSNKHDQSCDFD
GPLEMDSNSTERSTSSKRSSMLTFEKGSMPKAALSFIDAIKKNRSQQKFICNKMIHLETRIEENKRL