; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C10G191170 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C10G191170
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionsnRNA-activating protein complex subunit 4
Genome locationCla97Chr10:10344952..10348782
RNA-Seq ExpressionCla97C10G191170
SyntenyCla97C10G191170
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0060521.1 snRNA-activating protein complex subunit 4 [Cucumis melo var. makuwa]3.2e-7278.89Show/hide
Query:  MSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLDEEE
        MS  NH DE DVE    KED VVDEDME L+RAY+ VGVNPEDYI+PR SS  AGDA+P SDS+DVDDFELLR+IQNRFS VADEQPLSTL P+S DEEE
Subjt:  MSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLDEEE

Query:  DEFEMLRSIQRRFAAYESDTLSNKPNQ-----------SCDPSVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENK
        DEFEMLRSIQRRFAAYESDTLSNKP+Q           S D +VESQT+SKRPSMLAFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIHLEARIEENK
Subjt:  DEFEMLRSIQRRFAAYESDTLSNKPNQ-----------SCDPSVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENK

XP_008452207.1 PREDICTED: snRNA-activating protein complex subunit 4 [Cucumis melo]8.5e-7379.4Show/hide
Query:  MSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLDEEE
        MS  NH DE DVE    KED VVDEDME L+RAY+ VGVNPEDYI+PR SS  AGDA+P SDSDDVDDFELLR+IQNRFS VADEQPLSTL P+S DEEE
Subjt:  MSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLDEEE

Query:  DEFEMLRSIQRRFAAYESDTLSNKPNQ-----------SCDPSVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENK
        DEFEMLRSIQRRFAAYESDTLSNKP+Q           S D +VESQT+SKRPSMLAFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIHLEARIEENK
Subjt:  DEFEMLRSIQRRFAAYESDTLSNKPNQ-----------SCDPSVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENK

XP_011650584.1 uncharacterized protein LOC101216287 [Cucumis sativus]4.5e-7480.4Show/hide
Query:  MSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLDEEE
        MS RNH DE DVE P  KED VVDEDME L+RAY+  GVNPEDYINPRLSSP AGDA+P SDSDDVDDFELLR+IQNRFS +ADEQP ST  P+S DEEE
Subjt:  MSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLDEEE

Query:  DEFEMLRSIQRRFAAYESDTLSNKPNQ-----------SCDPSVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENK
        DEFEMLRSIQRRFAAYESDTLSNKPNQ           S D +VESQT+SKRPSMLAFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIHLEARIEENK
Subjt:  DEFEMLRSIQRRFAAYESDTLSNKPNQ-----------SCDPSVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENK

XP_038905712.1 uncharacterized protein LOC120091681 isoform X1 [Benincasa hispida]1.6e-7680.9Show/hide
Query:  MSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLDEEE
        MS  NH DEGDVELP +KEDDVVDEDME L+RAY+ VGVNPEDYINPRLSSP  GDAN   DSDD DDFELLRNIQNRFS V DEQPLSTLPP+SLDEEE
Subjt:  MSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLDEEE

Query:  DEFEMLRSIQRRFAAYESDTLSNKPNQSCD-----------PSVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENK
        DEFEMLRSIQRRFAAYESD LSNKPN+S D            + ESQT+SKRPSM+AFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIHLEARIEENK
Subjt:  DEFEMLRSIQRRFAAYESDTLSNKPNQSCD-----------PSVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENK

XP_038905717.1 uncharacterized protein LOC120091681 isoform X2 [Benincasa hispida]1.6e-7680.9Show/hide
Query:  MSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLDEEE
        MS  NH DEGDVELP +KEDDVVDEDME L+RAY+ VGVNPEDYINPRLSSP  GDAN   DSDD DDFELLRNIQNRFS V DEQPLSTLPP+SLDEEE
Subjt:  MSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLDEEE

Query:  DEFEMLRSIQRRFAAYESDTLSNKPNQSCD-----------PSVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENK
        DEFEMLRSIQRRFAAYESD LSNKPN+S D            + ESQT+SKRPSM+AFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIHLEARIEENK
Subjt:  DEFEMLRSIQRRFAAYESDTLSNKPNQSCD-----------PSVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENK

TrEMBL top hitse value%identityAlignment
A0A0A0L2R2 Uncharacterized protein2.2e-7480.4Show/hide
Query:  MSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLDEEE
        MS RNH DE DVE P  KED VVDEDME L+RAY+  GVNPEDYINPRLSSP AGDA+P SDSDDVDDFELLR+IQNRFS +ADEQP ST  P+S DEEE
Subjt:  MSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLDEEE

Query:  DEFEMLRSIQRRFAAYESDTLSNKPNQ-----------SCDPSVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENK
        DEFEMLRSIQRRFAAYESDTLSNKPNQ           S D +VESQT+SKRPSMLAFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIHLEARIEENK
Subjt:  DEFEMLRSIQRRFAAYESDTLSNKPNQ-----------SCDPSVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENK

A0A1S3BUG0 snRNA-activating protein complex subunit 44.1e-7379.4Show/hide
Query:  MSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLDEEE
        MS  NH DE DVE    KED VVDEDME L+RAY+ VGVNPEDYI+PR SS  AGDA+P SDSDDVDDFELLR+IQNRFS VADEQPLSTL P+S DEEE
Subjt:  MSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLDEEE

Query:  DEFEMLRSIQRRFAAYESDTLSNKPNQ-----------SCDPSVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENK
        DEFEMLRSIQRRFAAYESDTLSNKP+Q           S D +VESQT+SKRPSMLAFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIHLEARIEENK
Subjt:  DEFEMLRSIQRRFAAYESDTLSNKPNQ-----------SCDPSVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENK

A0A5D3BLR5 snRNA-activating protein complex subunit 41.6e-7278.89Show/hide
Query:  MSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLDEEE
        MS  NH DE DVE    KED VVDEDME L+RAY+ VGVNPEDYI+PR SS  AGDA+P SDS+DVDDFELLR+IQNRFS VADEQPLSTL P+S DEEE
Subjt:  MSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLDEEE

Query:  DEFEMLRSIQRRFAAYESDTLSNKPNQ-----------SCDPSVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENK
        DEFEMLRSIQRRFAAYESDTLSNKP+Q           S D +VESQT+SKRPSMLAFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIHLEARIEENK
Subjt:  DEFEMLRSIQRRFAAYESDTLSNKPNQ-----------SCDPSVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENK

A0A6J1JK98 uncharacterized protein LOC111485355 isoform X22.3e-6869.91Show/hide
Query:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLD
        MS R+H D GD ELP S+   EDD+VD+DME LRRA +  GVN EDY+NP+LS P AGDAN  SDSDDVDD ELLRNIQNRFS  ADEQPLS LPP++ D
Subjt:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQSCD-----------PSVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FE LRSIQRRFAAYESD LSNKP+QSCD            +VE  T+S+R SM+AFEKGSLPKAALAFIDAIKKNRSQQKF+RSKMIHLEARIEE
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQSCD-----------PSVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  N-KAQKTFQNSQRFPG
        N K +K F+  + F G
Subjt:  N-KAQKTFQNSQRFPG

A0A6J1JKV7 uncharacterized protein LOC111485355 isoform X12.3e-6869.91Show/hide
Query:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLD
        MS R+H D GD ELP S+   EDD+VD+DME LRRA +  GVN EDY+NP+LS P AGDAN  SDSDDVDD ELLRNIQNRFS  ADEQPLS LPP++ D
Subjt:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQSCD-----------PSVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FE LRSIQRRFAAYESD LSNKP+QSCD            +VE  T+S+R SM+AFEKGSLPKAALAFIDAIKKNRSQQKF+RSKMIHLEARIEE
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQSCD-----------PSVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  N-KAQKTFQNSQRFPG
        N K +K F+  + F G
Subjt:  N-KAQKTFQNSQRFPG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G18100.1 myb domain protein 4r11.3e-1532.72Show/hide
Query:  GDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSL--DEEEDEFEMLR
        G  E+P   E+   ++D E LR     +  + +     R S P  G  +  SDS+  DDFE++R+I+++ S   D     +LPP+ L  DEE+D FE LR
Subjt:  GDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSL--DEEEDEFEMLR

Query:  SIQRRFAAYESDTLSNK------------PNQSCDPSVE----SQTASKRP-----------------SMLAFEKGSLPKAALAFIDAIKKNRSQQKFIR
        +I+RRF+AY++     K             N   +PS E    S T    P                   +     S P+AA AF+DAI++NR+ QKF+R
Subjt:  SIQRRFAAYESDTLSNK------------PNQSCDPSVE----SQTASKRP-----------------SMLAFEKGSLPKAALAFIDAIKKNRSQQKFIR

Query:  SKMIHLEARIEENKAQK
         K+  +EA IE+N+  K
Subjt:  SKMIHLEARIEENKAQK

AT3G18100.2 myb domain protein 4r15.6e-0653.66Show/hide
Query:  SLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKAQK
        S P+AA AF+DAI++NR+ QKF+R K+  +EA IE+N+  K
Subjt:  SLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKAQK

AT3G18100.3 myb domain protein 4r18.6e-0734.78Show/hide
Query:  DDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINP--RLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSL-----DEE
        DD+ D       E+D + ED+E LRRA     VN + + +    +     G     SDS++ DDFE+LR I+++ +   D    S+ PPM L      E 
Subjt:  DDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINP--RLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSL-----DEE

Query:  EDEFEMLRSIQRRFA
        ED+FEM+RSI+ + +
Subjt:  EDEFEMLRSIQRRFA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCACCGCAACCATGACGATGAAGGTGACGTTGAGCTTCCTGTCAGCAAGGAAGATGATGTGGTTGATGAGGACATGGAAGCCCTTCGGAGAGCCTATAAGCATGT
TGGAGTTAATCCTGAGGATTACATTAATCCTAGGTTGTCATCACCTGTTGCCGGAGATGCTAATCCTAGTTCTGATTCTGATGATGTTGATGATTTTGAACTTCTTCGAA
ATATTCAGAACCGGTTCTCATGTGTGGCTGATGAGCAGCCGTTGAGTACTCTCCCACCAATGTCCCTAGACGAGGAGGAAGATGAATTTGAGATGCTTCGTTCAATTCAG
CGGCGCTTTGCAGCGTACGAAAGTGATACTTTGAGCAATAAACCCAATCAGTCGTGTGACCCATCTGTTGAGAGTCAGACAGCCTCAAAAAGGCCATCCATGCTAGCCTT
TGAAAAGGGAAGCTTGCCAAAAGCTGCATTGGCATTTATTGATGCCATAAAGAAAAATAGGTCCCAGCAGAAGTTTATTCGTAGTAAGATGATTCATCTTGAAGCTAGAA
TTGAGGAGAACAAAGCTCAGAAAACGTTTCAAAATTCTCAAAGATTTCCAGGGTTCATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCACCGCAACCATGACGATGAAGGTGACGTTGAGCTTCCTGTCAGCAAGGAAGATGATGTGGTTGATGAGGACATGGAAGCCCTTCGGAGAGCCTATAAGCATGT
TGGAGTTAATCCTGAGGATTACATTAATCCTAGGTTGTCATCACCTGTTGCCGGAGATGCTAATCCTAGTTCTGATTCTGATGATGTTGATGATTTTGAACTTCTTCGAA
ATATTCAGAACCGGTTCTCATGTGTGGCTGATGAGCAGCCGTTGAGTACTCTCCCACCAATGTCCCTAGACGAGGAGGAAGATGAATTTGAGATGCTTCGTTCAATTCAG
CGGCGCTTTGCAGCGTACGAAAGTGATACTTTGAGCAATAAACCCAATCAGTCGTGTGACCCATCTGTTGAGAGTCAGACAGCCTCAAAAAGGCCATCCATGCTAGCCTT
TGAAAAGGGAAGCTTGCCAAAAGCTGCATTGGCATTTATTGATGCCATAAAGAAAAATAGGTCCCAGCAGAAGTTTATTCGTAGTAAGATGATTCATCTTGAAGCTAGAA
TTGAGGAGAACAAAGCTCAGAAAACGTTTCAAAATTCTCAAAGATTTCCAGGGTTCATGTAA
Protein sequenceShow/hide protein sequence
MSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLDEEEDEFEMLRSIQ
RRFAAYESDTLSNKPNQSCDPSVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKAQKTFQNSQRFPGFM