; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi07G005910 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi07G005910
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionsnRNA-activating protein complex subunit 4
Genome locationchr07:6272001..6285830
RNA-Seq ExpressionLsi07G005910
SyntenyLsi07G005910
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0060521.1 snRNA-activating protein complex subunit 4 [Cucumis melo var. makuwa]1.1e-3975.4Show/hide
Query:  HSSMSHVNHDEEGAVELPADKKDDVVDEDMEALRRVYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDVDDFELLRNIQSRFSIVADEQPLSTLPPVSPH
        H++MS  NH +E  VE  ADK+D VVDEDME L+R YRLVGVNPED I+PR SS  AGDA+ GSDS+DVDDFELLR+IQ+RFSIVADEQPLSTL PVS  
Subjt:  HSSMSHVNHDEEGAVELPADKKDDVVDEDMEALRRVYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDVDDFELLRNIQSRFSIVADEQPLSTLPPVSPH

Query:  EEEDEFEMLRSIQRRFAAYESDYFVN
        EEEDEFEMLRSIQRRFAAYESD   N
Subjt:  EEEDEFEMLRSIQRRFAAYESDYFVN

XP_008452207.1 PREDICTED: snRNA-activating protein complex subunit 4 [Cucumis melo]4.2e-3977.24Show/hide
Query:  MSHVNHDEEGAVELPADKKDDVVDEDMEALRRVYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDVDDFELLRNIQSRFSIVADEQPLSTLPPVSPHEEE
        MS  NH +E  VE  ADK+D VVDEDME L+R YRLVGVNPED I+PR SS  AGDA+ GSDSDDVDDFELLR+IQ+RFSIVADEQPLSTL PVS  EEE
Subjt:  MSHVNHDEEGAVELPADKKDDVVDEDMEALRRVYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDVDDFELLRNIQSRFSIVADEQPLSTLPPVSPHEEE

Query:  DEFEMLRSIQRRFAAYESDYFVN
        DEFEMLRSIQRRFAAYESD   N
Subjt:  DEFEMLRSIQRRFAAYESDYFVN

XP_011650584.1 uncharacterized protein LOC101216287 [Cucumis sativus]1.1e-3978.05Show/hide
Query:  MSHVNHDEEGAVELPADKKDDVVDEDMEALRRVYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDVDDFELLRNIQSRFSIVADEQPLSTLPPVSPHEEE
        MS  NH +E  VE PADK+D VVDEDME L+R YRL GVNPED INPRLSSPAAGDA+ GSDSDDVDDFELLR+IQ+RFSI+ADEQP ST  PVS  EEE
Subjt:  MSHVNHDEEGAVELPADKKDDVVDEDMEALRRVYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDVDDFELLRNIQSRFSIVADEQPLSTLPPVSPHEEE

Query:  DEFEMLRSIQRRFAAYESDYFVN
        DEFEMLRSIQRRFAAYESD   N
Subjt:  DEFEMLRSIQRRFAAYESDYFVN

XP_038905712.1 uncharacterized protein LOC120091681 isoform X1 [Benincasa hispida]4.7e-4680Show/hide
Query:  VRLPHSSMSHVNHDEEGAVELPADKKDDVVDEDMEALRRVYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDVDDFELLRNIQSRFSIVADEQPLSTLPP
        VR+ H++MS  NH +EG VELPA+K+DDVVDEDME L+R YRLVGVNPED INPRLSSPA GDAN G DSDD DDFELLRNIQ+RFSIV DEQPLSTLPP
Subjt:  VRLPHSSMSHVNHDEEGAVELPADKKDDVVDEDMEALRRVYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDVDDFELLRNIQSRFSIVADEQPLSTLPP

Query:  VSPHEEEDEFEMLRSIQRRFAAYESDYFVN
        VS  EEEDEFEMLRSIQRRFAAYESD   N
Subjt:  VSPHEEEDEFEMLRSIQRRFAAYESDYFVN

XP_038905717.1 uncharacterized protein LOC120091681 isoform X2 [Benincasa hispida]7.5e-4482.11Show/hide
Query:  MSHVNHDEEGAVELPADKKDDVVDEDMEALRRVYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDVDDFELLRNIQSRFSIVADEQPLSTLPPVSPHEEE
        MS  NH +EG VELPA+K+DDVVDEDME L+R YRLVGVNPED INPRLSSPA GDAN G DSDD DDFELLRNIQ+RFSIV DEQPLSTLPPVS  EEE
Subjt:  MSHVNHDEEGAVELPADKKDDVVDEDMEALRRVYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDVDDFELLRNIQSRFSIVADEQPLSTLPPVSPHEEE

Query:  DEFEMLRSIQRRFAAYESDYFVN
        DEFEMLRSIQRRFAAYESD   N
Subjt:  DEFEMLRSIQRRFAAYESDYFVN

TrEMBL top hitse value%identityAlignment
A0A0A0L2R2 Uncharacterized protein5.4e-4078.05Show/hide
Query:  MSHVNHDEEGAVELPADKKDDVVDEDMEALRRVYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDVDDFELLRNIQSRFSIVADEQPLSTLPPVSPHEEE
        MS  NH +E  VE PADK+D VVDEDME L+R YRL GVNPED INPRLSSPAAGDA+ GSDSDDVDDFELLR+IQ+RFSI+ADEQP ST  PVS  EEE
Subjt:  MSHVNHDEEGAVELPADKKDDVVDEDMEALRRVYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDVDDFELLRNIQSRFSIVADEQPLSTLPPVSPHEEE

Query:  DEFEMLRSIQRRFAAYESDYFVN
        DEFEMLRSIQRRFAAYESD   N
Subjt:  DEFEMLRSIQRRFAAYESDYFVN

A0A1S3BUG0 snRNA-activating protein complex subunit 42.0e-3977.24Show/hide
Query:  MSHVNHDEEGAVELPADKKDDVVDEDMEALRRVYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDVDDFELLRNIQSRFSIVADEQPLSTLPPVSPHEEE
        MS  NH +E  VE  ADK+D VVDEDME L+R YRLVGVNPED I+PR SS  AGDA+ GSDSDDVDDFELLR+IQ+RFSIVADEQPLSTL PVS  EEE
Subjt:  MSHVNHDEEGAVELPADKKDDVVDEDMEALRRVYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDVDDFELLRNIQSRFSIVADEQPLSTLPPVSPHEEE

Query:  DEFEMLRSIQRRFAAYESDYFVN
        DEFEMLRSIQRRFAAYESD   N
Subjt:  DEFEMLRSIQRRFAAYESDYFVN

A0A5D3BLR5 snRNA-activating protein complex subunit 45.4e-4075.4Show/hide
Query:  HSSMSHVNHDEEGAVELPADKKDDVVDEDMEALRRVYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDVDDFELLRNIQSRFSIVADEQPLSTLPPVSPH
        H++MS  NH +E  VE  ADK+D VVDEDME L+R YRLVGVNPED I+PR SS  AGDA+ GSDS+DVDDFELLR+IQ+RFSIVADEQPLSTL PVS  
Subjt:  HSSMSHVNHDEEGAVELPADKKDDVVDEDMEALRRVYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDVDDFELLRNIQSRFSIVADEQPLSTLPPVSPH

Query:  EEEDEFEMLRSIQRRFAAYESDYFVN
        EEEDEFEMLRSIQRRFAAYESD   N
Subjt:  EEEDEFEMLRSIQRRFAAYESDYFVN

A0A6J1E2J4 uncharacterized protein LOC111430000 isoform X26.6e-3872.22Show/hide
Query:  MSHVNHDEEGAVELPA---DKKDDVVDEDMEALRRVYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDVDDFELLRNIQSRFSIVADEQPLSTLPPVSPH
        MS  +H + G  ELPA   D +DD+VD+DME LRR  RL GVN ED INPRLS PAAGDANLGSDSDDVDD ELLRNIQ+RFS  ADEQPLS LPPV+  
Subjt:  MSHVNHDEEGAVELPA---DKKDDVVDEDMEALRRVYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDVDDFELLRNIQSRFSIVADEQPLSTLPPVSPH

Query:  EEEDEFEMLRSIQRRFAAYESDYFVN
        EEED+FE LRSIQRRFAAYESD   N
Subjt:  EEEDEFEMLRSIQRRFAAYESDYFVN

A0A6J1E6Z7 uncharacterized protein LOC111430000 isoform X16.6e-3872.22Show/hide
Query:  MSHVNHDEEGAVELPA---DKKDDVVDEDMEALRRVYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDVDDFELLRNIQSRFSIVADEQPLSTLPPVSPH
        MS  +H + G  ELPA   D +DD+VD+DME LRR  RL GVN ED INPRLS PAAGDANLGSDSDDVDD ELLRNIQ+RFS  ADEQPLS LPPV+  
Subjt:  MSHVNHDEEGAVELPA---DKKDDVVDEDMEALRRVYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDVDDFELLRNIQSRFSIVADEQPLSTLPPVSPH

Query:  EEEDEFEMLRSIQRRFAAYESDYFVN
        EEED+FE LRSIQRRFAAYESD   N
Subjt:  EEEDEFEMLRSIQRRFAAYESDYFVN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G18100.1 myb domain protein 4r12.9e-0935.25Show/hide
Query:  SSMSHVNHDEEGAVELPADKKDDVVDEDMEALRRVYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDVDDFELLRNIQSRFSIVADEQPLSTLPPV--SP
        S    +  +  G  E+P+D ++   ++D E LR +   +  + +     R S P  G  +L SDS+  DDFE++R+I+S+ S+  D     +LPP+  S 
Subjt:  SSMSHVNHDEEGAVELPADKKDDVVDEDMEALRRVYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDVDDFELLRNIQSRFSIVADEQPLSTLPPV--SP

Query:  HEEEDEFEMLRSIQRRFAAYES
         EE+D FE LR+I+RRF+AY++
Subjt:  HEEEDEFEMLRSIQRRFAAYES

AT3G18100.3 myb domain protein 4r12.9e-0935.25Show/hide
Query:  SSMSHVNHDEEGAVELPADKKDDVVDEDMEALRRVYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDVDDFELLRNIQSRFSIVADEQPLSTLPPV--SP
        S    +  +  G  E+P+D ++   ++D E LR +   +  + +     R S P  G  +L SDS+  DDFE++R+I+S+ S+  D     +LPP+  S 
Subjt:  SSMSHVNHDEEGAVELPADKKDDVVDEDMEALRRVYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDVDDFELLRNIQSRFSIVADEQPLSTLPPV--SP

Query:  HEEEDEFEMLRSIQRRFAAYES
         EE+D FE LR+I+RRF+AY++
Subjt:  HEEEDEFEMLRSIQRRFAAYES


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAGAATGAAGAAAATGAAGAAGATGAAGCAAAGGAAGGAGATGAAGAAATTCTGGATCTGTGAAAAAGAAGATGAAGAAGGGAAGGAAATGTTGGATCGGTTGCT
TGAAGTTGGATTTTTCGAATTTAAAGATCATATCAAGTTTTTCATCGGCAAGTTCTCAAATCATTCATTCGTTTTGTTTCATTATCTCGCCCGAGATTCATCATATAAAC
TAGGTAGAAGATTGAGCCAAAAAGTACCCCAAGCATTTAAGGACAGACCTGGTCGGGCCACCAACGACAACCATGACGAGGAAGGTGCCATAGAGCTTCCTGCCGGCAAG
AAAGATGATGTGGTTGATGAGGACGTGGAAGCCCTTTGGAGAGCCTATAGGCTTGTTGGAGTTAATCCTGAGGATTGCATTAATCCTAGGTTGTCATCACCTGCTGCTGG
AGATGCTAATCTTGGTTCTGATTCTGACGATCTTGATGATTTCGAACTTCTTCGAAATATTCAGAGTCGGTTCTCGATTGTGGCTGATGAGCAACCGTTGAGTACTCTCC
CACCAGTGTCCCCACACGAGGAGGAAGATGAATTCGAGATGCTTCGACTCATTTATCTACTTTATCTCCTTTTATGTGTTGAGGCTAATGCTATTCTAGAAGGTCTTCGA
CTGGTGAAAGGATTGCAAATCTATCGCTGGAGTAGCCGGTTGTCTGTTCGTGCAGCAACCGATCGTCGATTCTGCAGCTGGCCGTCCGTTTGTGCAACAGCGGCCGTCCA
TTCGTGCAACAGCGGCCGTCCATTACGGTCATCCGTTTTGTTCGTGCAGCCGCGACCGTCTGTTCGTCAGTTCGTTTGTGAAGCAGTCGTTCGTTCGTTCGTGCAATCGT
CGGGGTTGTCTGTTTGTTTGTTTGTTTATTTGTCATCGCCTTCACCAGATGTTCTTAACAATTCTTTTTACATAACTTGCGCAACAATCATTTCAATTTTTGAGCTCGCA
TCGATTAAGGTGCTCGAAGTGCATGCAAACCCTTTTTCTCTTTTTTTTCTTCTTCTTTCGCGTGAGACCGCCGCCCGTACTTCACAGCTGAGTCGCTCGCCGATCAGTCA
CGCCGTCGGCCCCGCCCGTCCGCGCGCCGATCAGTCTCATCGTCCGCGCCGATCAGTCTCATCGTCCGCGCCGATCAGTCTCGCCGCCGCGCGCTCCGCCCGTCGTTTTC
GTCCAGTCGAAGCTCGCCGCCGCACGCTCCGCCAGTCCGCGTCAGACCACGCTGTTCCAGCCGCCGCGCGCTCAGTCGAACCTCGCCGCCGTCTACGCGAGCCGCCGCGC
GCTCCGCCAGTCTGCGCTGCCGCGCCGTTTCGCCAGCCGCCGCTCCACCCAGATCGCGCCGCCGCCGCCAGTCACCGCGCGCTGCCATTCTGCCGCTGCCATCGTCGCAG
CGGGATTTGGGGTTTCGTTTGGGGTGTCGCCCAACACGGTAACCTTGGGTTGCAGGTTCGAAGAGCGGAAGACCTCTATCGAGTACGTCTTCCTCATTCATCCATGTCTC
ACGTCAACCATGACGAGGAAGGTGCCGTTGAGCTTCCTGCCGACAAGAAAGATGATGTGGTTGATGAGGACATGGAAGCCCTTCGGAGAGTCTATAGGCTTGTTGGAGTT
AATCCTGAGGATTGCATTAATCCTAGGTTGTCATCACCTGCTGCTGGAGATGCTAATCTTGGTTCTGATTCTGATGATGTTGATGATTTCGAACTTCTTCGAAATATTCA
GAGTCGGTTCTCGATTGTGGCCGATGAGCAACCGTTGAGTACTCTCCCACCAGTGTCCCCACACGAGGAGGAAGATGAATTTGAGATGCTTCGTTCAATTCAGCGGCGCT
TTGCAGCGTATGAAAGCGATTACTTTGTAAATGTCATATTGTATTGGCTGTTTATGACGTCGTTCGTTTTGAAATTCCTTGGTTGTGATAATATTACTCCGTTTTCCAAT
GACTTTGGATTACTTGGGAGGAATTGTATGTTCCTTCCCCAACTTCTAAGAAAATGCAATACTAAGCCTTACAGGTGGTCAATATCTTCCATCTCTCCGAAACACAATCT
ATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAAGAATGAAGAAAATGAAGAAGATGAAGCAAAGGAAGGAGATGAAGAAATTCTGGATCTGTGAAAAAGAAGATGAAGAAGGGAAGGAAATGTTGGATCGGTTGCT
TGAAGTTGGATTTTTCGAATTTAAAGATCATATCAAGTTTTTCATCGGCAAGTTCTCAAATCATTCATTCGTTTTGTTTCATTATCTCGCCCGAGATTCATCATATAAAC
TAGGTAGAAGATTGAGCCAAAAAGTACCCCAAGCATTTAAGGACAGACCTGGTCGGGCCACCAACGACAACCATGACGAGGAAGGTGCCATAGAGCTTCCTGCCGGCAAG
AAAGATGATGTGGTTGATGAGGACGTGGAAGCCCTTTGGAGAGCCTATAGGCTTGTTGGAGTTAATCCTGAGGATTGCATTAATCCTAGGTTGTCATCACCTGCTGCTGG
AGATGCTAATCTTGGTTCTGATTCTGACGATCTTGATGATTTCGAACTTCTTCGAAATATTCAGAGTCGGTTCTCGATTGTGGCTGATGAGCAACCGTTGAGTACTCTCC
CACCAGTGTCCCCACACGAGGAGGAAGATGAATTCGAGATGCTTCGACTCATTTATCTACTTTATCTCCTTTTATGTGTTGAGGCTAATGCTATTCTAGAAGGTCTTCGA
CTGGTGAAAGGATTGCAAATCTATCGCTGGAGTAGCCGGTTGTCTGTTCGTGCAGCAACCGATCGTCGATTCTGCAGCTGGCCGTCCGTTTGTGCAACAGCGGCCGTCCA
TTCGTGCAACAGCGGCCGTCCATTACGGTCATCCGTTTTGTTCGTGCAGCCGCGACCGTCTGTTCGTCAGTTCGTTTGTGAAGCAGTCGTTCGTTCGTTCGTGCAATCGT
CGGGGTTGTCTGTTTGTTTGTTTGTTTATTTGTCATCGCCTTCACCAGATGTTCTTAACAATTCTTTTTACATAACTTGCGCAACAATCATTTCAATTTTTGAGCTCGCA
TCGATTAAGGTGCTCGAAGTGCATGCAAACCCTTTTTCTCTTTTTTTTCTTCTTCTTTCGCGTGAGACCGCCGCCCGTACTTCACAGCTGAGTCGCTCGCCGATCAGTCA
CGCCGTCGGCCCCGCCCGTCCGCGCGCCGATCAGTCTCATCGTCCGCGCCGATCAGTCTCATCGTCCGCGCCGATCAGTCTCGCCGCCGCGCGCTCCGCCCGTCGTTTTC
GTCCAGTCGAAGCTCGCCGCCGCACGCTCCGCCAGTCCGCGTCAGACCACGCTGTTCCAGCCGCCGCGCGCTCAGTCGAACCTCGCCGCCGTCTACGCGAGCCGCCGCGC
GCTCCGCCAGTCTGCGCTGCCGCGCCGTTTCGCCAGCCGCCGCTCCACCCAGATCGCGCCGCCGCCGCCAGTCACCGCGCGCTGCCATTCTGCCGCTGCCATCGTCGCAG
CGGGATTTGGGGTTTCGTTTGGGGTGTCGCCCAACACGGTAACCTTGGGTTGCAGGTTCGAAGAGCGGAAGACCTCTATCGAGTACGTCTTCCTCATTCATCCATGTCTC
ACGTCAACCATGACGAGGAAGGTGCCGTTGAGCTTCCTGCCGACAAGAAAGATGATGTGGTTGATGAGGACATGGAAGCCCTTCGGAGAGTCTATAGGCTTGTTGGAGTT
AATCCTGAGGATTGCATTAATCCTAGGTTGTCATCACCTGCTGCTGGAGATGCTAATCTTGGTTCTGATTCTGATGATGTTGATGATTTCGAACTTCTTCGAAATATTCA
GAGTCGGTTCTCGATTGTGGCCGATGAGCAACCGTTGAGTACTCTCCCACCAGTGTCCCCACACGAGGAGGAAGATGAATTTGAGATGCTTCGTTCAATTCAGCGGCGCT
TTGCAGCGTATGAAAGCGATTACTTTGTAAATGTCATATTGTATTGGCTGTTTATGACGTCGTTCGTTTTGAAATTCCTTGGTTGTGATAATATTACTCCGTTTTCCAAT
GACTTTGGATTACTTGGGAGGAATTGTATGTTCCTTCCCCAACTTCTAAGAAAATGCAATACTAAGCCTTACAGGTGGTCAATATCTTCCATCTCTCCGAAACACAATCT
ATAG
Protein sequenceShow/hide protein sequence
MERMKKMKKMKQRKEMKKFWICEKEDEEGKEMLDRLLEVGFFEFKDHIKFFIGKFSNHSFVLFHYLARDSSYKLGRRLSQKVPQAFKDRPGRATNDNHDEEGAIELPAGK
KDDVVDEDVEALWRAYRLVGVNPEDCINPRLSSPAAGDANLGSDSDDLDDFELLRNIQSRFSIVADEQPLSTLPPVSPHEEEDEFEMLRLIYLLYLLLCVEANAILEGLR
LVKGLQIYRWSSRLSVRAATDRRFCSWPSVCATAAVHSCNSGRPLRSSVLFVQPRPSVRQFVCEAVVRSFVQSSGLSVCLFVYLSSPSPDVLNNSFYITCATIISIFELA
SIKVLEVHANPFSLFFLLLSRETAARTSQLSRSPISHAVGPARPRADQSHRPRRSVSSSAPISLAAARSARRFRPVEARRRTLRQSASDHAVPAAARSVEPRRRLREPPR
APPVCAAAPFRQPPLHPDRAAAASHRALPFCRCHRRSGIWGFVWGVAQHGNLGLQVRRAEDLYRVRLPHSSMSHVNHDEEGAVELPADKKDDVVDEDMEALRRVYRLVGV
NPEDCINPRLSSPAAGDANLGSDSDDVDDFELLRNIQSRFSIVADEQPLSTLPPVSPHEEEDEFEMLRSIQRRFAAYESDYFVNVILYWLFMTSFVLKFLGCDNITPFSN
DFGLLGRNCMFLPQLLRKCNTKPYRWSISSISPKHNL