; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10002357 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10002357
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionsnRNA-activating protein complex subunit 4
Genome locationChr11:5942288..5943533
RNA-Seq ExpressionHG10002357
SyntenyHG10002357
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011650584.1 uncharacterized protein LOC101216287 [Cucumis sativus]5.9e-3977.12Show/hide
Query:  MSHRNHDEEGAIELPAGKKDDVVDEDVEALWRAYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDLDDFELLRNIQSRFSIVADEQPLSTLPPVSPHEEE
        MS RNH +E  +E PA K+D VVDED+E L RAYRL GVNPED INPRLSSPAAGDA+ GSDSDD+DDFELLR+IQ+RFSI+ADEQP ST  PVS  EEE
Subjt:  MSHRNHDEEGAIELPAGKKDDVVDEDVEALWRAYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDLDDFELLRNIQSRFSIVADEQPLSTLPPVSPHEEE

Query:  DEFEMLRSIQRRFAAYKS
        DEFEMLRSIQRRFAAY+S
Subjt:  DEFEMLRSIQRRFAAYKS

XP_023515735.1 uncharacterized protein LOC111779809 isoform X1 [Cucurbita pepo subsp. pepo]5.0e-3870.97Show/hide
Query:  MSHRNHDEEGAIELPAGK---KDDVVDEDVEALWRAYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDLDDFELLRNIQSRFSIVADEQPLSTLPPVSPH
        MS R+H + G  ELPA +   +DD+VD+D+E L RA RL GVN ED INPRLS PAAGDANLGSDSDD+DD ELLRNIQ+RFSI ADEQPLS LPPV+  
Subjt:  MSHRNHDEEGAIELPAGK---KDDVVDEDVEALWRAYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDLDDFELLRNIQSRFSIVADEQPLSTLPPVSPH

Query:  EEEDEFEMLRSIQRRFAAYKSGLI
        EEED+FEMLRSIQRRFAAY+S ++
Subjt:  EEEDEFEMLRSIQRRFAAYKSGLI

XP_023515736.1 uncharacterized protein LOC111779809 isoform X2 [Cucurbita pepo subsp. pepo]5.0e-3870.97Show/hide
Query:  MSHRNHDEEGAIELPAGK---KDDVVDEDVEALWRAYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDLDDFELLRNIQSRFSIVADEQPLSTLPPVSPH
        MS R+H + G  ELPA +   +DD+VD+D+E L RA RL GVN ED INPRLS PAAGDANLGSDSDD+DD ELLRNIQ+RFSI ADEQPLS LPPV+  
Subjt:  MSHRNHDEEGAIELPAGK---KDDVVDEDVEALWRAYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDLDDFELLRNIQSRFSIVADEQPLSTLPPVSPH

Query:  EEEDEFEMLRSIQRRFAAYKSGLI
        EEED+FEMLRSIQRRFAAY+S ++
Subjt:  EEEDEFEMLRSIQRRFAAYKSGLI

XP_038905712.1 uncharacterized protein LOC120091681 isoform X1 [Benincasa hispida]1.3e-4380.17Show/hide
Query:  MSHRNHDEEGAIELPAGKKDDVVDEDVEALWRAYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDLDDFELLRNIQSRFSIVADEQPLSTLPPVSPHEEE
        MS  NH +EG +ELPA K+DDVVDED+E L RAYRLVGVNPED INPRLSSPA GDAN G DSDD DDFELLRNIQ+RFSIV DEQPLSTLPPVS  EEE
Subjt:  MSHRNHDEEGAIELPAGKKDDVVDEDVEALWRAYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDLDDFELLRNIQSRFSIVADEQPLSTLPPVSPHEEE

Query:  DEFEMLRSIQRRFAAYKSGLI
        DEFEMLRSIQRRFAAY+S ++
Subjt:  DEFEMLRSIQRRFAAYKSGLI

XP_038905717.1 uncharacterized protein LOC120091681 isoform X2 [Benincasa hispida]1.3e-4380.17Show/hide
Query:  MSHRNHDEEGAIELPAGKKDDVVDEDVEALWRAYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDLDDFELLRNIQSRFSIVADEQPLSTLPPVSPHEEE
        MS  NH +EG +ELPA K+DDVVDED+E L RAYRLVGVNPED INPRLSSPA GDAN G DSDD DDFELLRNIQ+RFSIV DEQPLSTLPPVS  EEE
Subjt:  MSHRNHDEEGAIELPAGKKDDVVDEDVEALWRAYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDLDDFELLRNIQSRFSIVADEQPLSTLPPVSPHEEE

Query:  DEFEMLRSIQRRFAAYKSGLI
        DEFEMLRSIQRRFAAY+S ++
Subjt:  DEFEMLRSIQRRFAAYKSGLI

TrEMBL top hitse value%identityAlignment
A0A0A0L2R2 Uncharacterized protein2.8e-3977.12Show/hide
Query:  MSHRNHDEEGAIELPAGKKDDVVDEDVEALWRAYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDLDDFELLRNIQSRFSIVADEQPLSTLPPVSPHEEE
        MS RNH +E  +E PA K+D VVDED+E L RAYRL GVNPED INPRLSSPAAGDA+ GSDSDD+DDFELLR+IQ+RFSI+ADEQP ST  PVS  EEE
Subjt:  MSHRNHDEEGAIELPAGKKDDVVDEDVEALWRAYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDLDDFELLRNIQSRFSIVADEQPLSTLPPVSPHEEE

Query:  DEFEMLRSIQRRFAAYKS
        DEFEMLRSIQRRFAAY+S
Subjt:  DEFEMLRSIQRRFAAYKS

A0A1S3BUG0 snRNA-activating protein complex subunit 44.1e-3875.42Show/hide
Query:  MSHRNHDEEGAIELPAGKKDDVVDEDVEALWRAYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDLDDFELLRNIQSRFSIVADEQPLSTLPPVSPHEEE
        MS  NH +E  +E  A K+D VVDED+E L RAYRLVGVNPED I+PR SS  AGDA+ GSDSDD+DDFELLR+IQ+RFSIVADEQPLSTL PVS  EEE
Subjt:  MSHRNHDEEGAIELPAGKKDDVVDEDVEALWRAYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDLDDFELLRNIQSRFSIVADEQPLSTLPPVSPHEEE

Query:  DEFEMLRSIQRRFAAYKS
        DEFEMLRSIQRRFAAY+S
Subjt:  DEFEMLRSIQRRFAAYKS

A0A5D3BLR5 snRNA-activating protein complex subunit 41.6e-3774.58Show/hide
Query:  MSHRNHDEEGAIELPAGKKDDVVDEDVEALWRAYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDLDDFELLRNIQSRFSIVADEQPLSTLPPVSPHEEE
        MS  NH +E  +E  A K+D VVDED+E L RAYRLVGVNPED I+PR SS  AGDA+ GSDS+D+DDFELLR+IQ+RFSIVADEQPLSTL PVS  EEE
Subjt:  MSHRNHDEEGAIELPAGKKDDVVDEDVEALWRAYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDLDDFELLRNIQSRFSIVADEQPLSTLPPVSPHEEE

Query:  DEFEMLRSIQRRFAAYKS
        DEFEMLRSIQRRFAAY+S
Subjt:  DEFEMLRSIQRRFAAYKS

A0A6J1E2J4 uncharacterized protein LOC111430000 isoform X23.5e-3769.35Show/hide
Query:  MSHRNHDEEGAIELPAGK---KDDVVDEDVEALWRAYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDLDDFELLRNIQSRFSIVADEQPLSTLPPVSPH
        MS R+H + G  ELPA +   +DD+VD+D+E L RA RL GVN ED INPRLS PAAGDANLGSDSDD+DD ELLRNIQ+RFS  ADEQPLS LPPV+  
Subjt:  MSHRNHDEEGAIELPAGK---KDDVVDEDVEALWRAYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDLDDFELLRNIQSRFSIVADEQPLSTLPPVSPH

Query:  EEEDEFEMLRSIQRRFAAYKSGLI
        EEED+FE LRSIQRRFAAY+S ++
Subjt:  EEEDEFEMLRSIQRRFAAYKSGLI

A0A6J1E6Z7 uncharacterized protein LOC111430000 isoform X13.5e-3769.35Show/hide
Query:  MSHRNHDEEGAIELPAGK---KDDVVDEDVEALWRAYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDLDDFELLRNIQSRFSIVADEQPLSTLPPVSPH
        MS R+H + G  ELPA +   +DD+VD+D+E L RA RL GVN ED INPRLS PAAGDANLGSDSDD+DD ELLRNIQ+RFS  ADEQPLS LPPV+  
Subjt:  MSHRNHDEEGAIELPAGK---KDDVVDEDVEALWRAYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDLDDFELLRNIQSRFSIVADEQPLSTLPPVSPH

Query:  EEEDEFEMLRSIQRRFAAYKSGLI
        EEED+FE LRSIQRRFAAY+S ++
Subjt:  EEEDEFEMLRSIQRRFAAYKSGLI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G18100.1 myb domain protein 4r15.2e-0947.95Show/hide
Query:  RLSSPAAGDANLGSDSDDLDDFELLRNIQSRFSIVADEQPLSTLPPV--SPHEEEDEFEMLRSIQRRFAAYKS
        R S P  G  +L SDS+  DDFE++R+I+S+ S+  D     +LPP+  S  EE+D FE LR+I+RRF+AYK+
Subjt:  RLSSPAAGDANLGSDSDDLDDFELLRNIQSRFSIVADEQPLSTLPPV--SPHEEEDEFEMLRSIQRRFAAYKS

AT3G18100.3 myb domain protein 4r15.2e-0947.95Show/hide
Query:  RLSSPAAGDANLGSDSDDLDDFELLRNIQSRFSIVADEQPLSTLPPV--SPHEEEDEFEMLRSIQRRFAAYKS
        R S P  G  +L SDS+  DDFE++R+I+S+ S+  D     +LPP+  S  EE+D FE LR+I+RRF+AYK+
Subjt:  RLSSPAAGDANLGSDSDDLDDFELLRNIQSRFSIVADEQPLSTLPPV--SPHEEEDEFEMLRSIQRRFAAYKS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCACCGCAACCATGACGAGGAAGGTGCCATAGAGCTTCCTGCCGGCAAGAAAGATGATGTGGTTGATGAGGACGTGGAAGCCCTTTGGAGAGCCTATAGGCTTGT
TGGAGTTAATCCTGAGGATTGCATTAATCCTAGGTTGTCATCACCTGCTGCTGGAGATGCTAATCTTGGTTCTGATTCTGACGATCTTGATGATTTCGAACTTCTTCGAA
ATATTCAGAGTCGGTTCTCGATTGTGGCTGATGAGCAACCGTTGAGTACTCTCCCACCAGTGTCCCCACACGAGGAGGAAGATGAATTCGAGATGCTTCGTTCAATTCAG
CGGCGCTTTGCAGCGTATAAAAGCGGACTCATTTATCTACTTTATCTCCTTTTATGTGTTGAGGCTAATGCTATTCTAGAAGGTCTTCGACTGGTGAAAGGATTGGTATT
CGCTCGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCACCGCAACCATGACGAGGAAGGTGCCATAGAGCTTCCTGCCGGCAAGAAAGATGATGTGGTTGATGAGGACGTGGAAGCCCTTTGGAGAGCCTATAGGCTTGT
TGGAGTTAATCCTGAGGATTGCATTAATCCTAGGTTGTCATCACCTGCTGCTGGAGATGCTAATCTTGGTTCTGATTCTGACGATCTTGATGATTTCGAACTTCTTCGAA
ATATTCAGAGTCGGTTCTCGATTGTGGCTGATGAGCAACCGTTGAGTACTCTCCCACCAGTGTCCCCACACGAGGAGGAAGATGAATTCGAGATGCTTCGTTCAATTCAG
CGGCGCTTTGCAGCGTATAAAAGCGGACTCATTTATCTACTTTATCTCCTTTTATGTGTTGAGGCTAATGCTATTCTAGAAGGTCTTCGACTGGTGAAAGGATTGGTATT
CGCTCGCTGA
Protein sequenceShow/hide protein sequence
MSHRNHDEEGAIELPAGKKDDVVDEDVEALWRAYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDLDDFELLRNIQSRFSIVADEQPLSTLPPVSPHEEEDEFEMLRSIQ
RRFAAYKSGLIYLLYLLLCVEANAILEGLRLVKGLVFAR