; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008687 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008687
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionH15 domain-containing protein
Genome locationchr9:27951232..27951936
RNA-Seq ExpressionLag0008687
SyntenyLag0008687
Gene Ontology termsGO:0006334 - nucleosome assembly (biological process)
GO:0000786 - nucleosome (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsIPR005818 - Linker histone H1/H5, domain H15
IPR036388 - Winged helix-like DNA-binding domain superfamily
IPR036390 - Winged helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579248.1 hypothetical protein SDJN03_23696, partial [Cucurbita argyrosperma subsp. sororia]1.0e-5062.64Show/hide
Query:  MDNSKPQLSTIPQPLENLPTSSPPTPHSDHRHSLVVEKFRDALFSAVAAKYATNGSDHSLPFPSEQLKANIERRLHECFPSFHTPTHLPYASMIQRAISE
        M+NS+P LSTIP P EN P  S  TPHSDHR+SL+  +FRDALFSA AAKYATNGS HSLPFPSEQ K+ IE  LHE FPSF TPTHLPYASMIQ+AI+E
Subjt:  MDNSKPQLSTIPQPLENLPTSSPPTPHSDHRHSLVVEKFRDALFSAVAAKYATNGSDHSLPFPSEQLKANIERRLHECFPSFHTPTHLPYASMIQRAISE

Query:  VGEEDGLSEESISEFIVNEHDDFPWAHAAFLHRHLGKFWERRK-----------RKEGEDDKKRKKSPTTASGGRRRYEEEE
        +GEEDGLSEE ISEFIVNE+ D PWAH AFL RHLGK  E  +           + EG++ K++K+   +A   RRR  E +
Subjt:  VGEEDGLSEESISEFIVNEHDDFPWAHAAFLHRHLGKFWERRK-----------RKEGEDDKKRKKSPTTASGGRRRYEEEE

KAG7016763.1 hypothetical protein SDJN02_21873, partial [Cucurbita argyrosperma subsp. argyrosperma]1.0e-5062.64Show/hide
Query:  MDNSKPQLSTIPQPLENLPTSSPPTPHSDHRHSLVVEKFRDALFSAVAAKYATNGSDHSLPFPSEQLKANIERRLHECFPSFHTPTHLPYASMIQRAISE
        M+NS+P LSTIP P EN P  S  TPHSDHR+SL+  +FRDALFSA AAKYATNGS HSLPFPSEQ K+ IE  LHE FPSF TPTHLPYASMIQ+AI+E
Subjt:  MDNSKPQLSTIPQPLENLPTSSPPTPHSDHRHSLVVEKFRDALFSAVAAKYATNGSDHSLPFPSEQLKANIERRLHECFPSFHTPTHLPYASMIQRAISE

Query:  VGEEDGLSEESISEFIVNEHDDFPWAHAAFLHRHLGKFWERRK-----------RKEGEDDKKRKKSPTTASGGRRRYEEEE
        +GEEDGLSEE ISEFIVNE+ D PWAH AFL RHLGK  E  +           + EG++ K++K+   +A   RRR  E +
Subjt:  VGEEDGLSEESISEFIVNEHDDFPWAHAAFLHRHLGKFWERRK-----------RKEGEDDKKRKKSPTTASGGRRRYEEEE

XP_022938936.1 uncharacterized protein LOC111444998 isoform X1 [Cucurbita moschata]2.3e-5062.09Show/hide
Query:  MDNSKPQLSTIPQPLENLPTSSPPTPHSDHRHSLVVEKFRDALFSAVAAKYATNGSDHSLPFPSEQLKANIERRLHECFPSFHTPTHLPYASMIQRAISE
        M+NS+P LSTIP P EN P  S  TPHSDHR+SL+  +FRDALFSA AAKYATNGS HSLPFPSEQ K+ IE  LH+ FPSF TPTHLPYASMIQ+AI+E
Subjt:  MDNSKPQLSTIPQPLENLPTSSPPTPHSDHRHSLVVEKFRDALFSAVAAKYATNGSDHSLPFPSEQLKANIERRLHECFPSFHTPTHLPYASMIQRAISE

Query:  VGEEDGLSEESISEFIVNEHDDFPWAHAAFLHRHLGKFWERRK-----------RKEGEDDKKRKKSPTTASGGRRRYEEEE
        +GEEDGLSEE ISEFIVNE+ D PWAH AFL RHLGK  E  +           + EG++ K++K+   +A   RRR  E +
Subjt:  VGEEDGLSEESISEFIVNEHDDFPWAHAAFLHRHLGKFWERRK-----------RKEGEDDKKRKKSPTTASGGRRRYEEEE

XP_022993719.1 uncharacterized protein LOC111489634 isoform X1 [Cucurbita maxima]4.6e-5163.19Show/hide
Query:  MDNSKPQLSTIPQPLENLPTSSPPTPHSDHRHSLVVEKFRDALFSAVAAKYATNGSDHSLPFPSEQLKANIERRLHECFPSFHTPTHLPYASMIQRAISE
        M+NS+P LSTIP P EN P  S  TPHSDHR+SL+  +FRDALFSA AAKYATNGS HSLPFPSEQ K+ IE  LHE FPSF TPTHLPYASMIQ+AI+E
Subjt:  MDNSKPQLSTIPQPLENLPTSSPPTPHSDHRHSLVVEKFRDALFSAVAAKYATNGSDHSLPFPSEQLKANIERRLHECFPSFHTPTHLPYASMIQRAISE

Query:  VGEEDGLSEESISEFIVNEHDDFPWAHAAFLHRHLGKFWERRK-----------RKEGEDDKKRKKSPTTASGGRRRYEEEE
        VGEEDGLSEE ISEFIVNE+ D PWAH AFL RHLGK  E  +           + EG++ K++K+   +A   RRR  E +
Subjt:  VGEEDGLSEESISEFIVNEHDDFPWAHAAFLHRHLGKFWERRK-----------RKEGEDDKKRKKSPTTASGGRRRYEEEE

XP_023549578.1 uncharacterized protein LOC111808038 isoform X1 [Cucurbita pepo subsp. pepo]1.3e-5062.64Show/hide
Query:  MDNSKPQLSTIPQPLENLPTSSPPTPHSDHRHSLVVEKFRDALFSAVAAKYATNGSDHSLPFPSEQLKANIERRLHECFPSFHTPTHLPYASMIQRAISE
        M+NS+P LSTIP P EN P  S  TPHSDHR+SL+  +FRDALFSA AAKYATNGS HSLPFPSEQ K+ IE  LH+ FPSF TPTHLPYASMIQ+AI+E
Subjt:  MDNSKPQLSTIPQPLENLPTSSPPTPHSDHRHSLVVEKFRDALFSAVAAKYATNGSDHSLPFPSEQLKANIERRLHECFPSFHTPTHLPYASMIQRAISE

Query:  VGEEDGLSEESISEFIVNEHDDFPWAHAAFLHRHLGKFWERRK-----------RKEGEDDKKRKKSPTTASGGRRRYEEEE
        VGEEDGLSEE ISEFIVNE+ D PWAH AFL RHLGK  E  +           + EG++ K++K+   +A   RRR  E +
Subjt:  VGEEDGLSEESISEFIVNEHDDFPWAHAAFLHRHLGKFWERRK-----------RKEGEDDKKRKKSPTTASGGRRRYEEEE

TrEMBL top hitse value%identityAlignment
A0A0A0KPL0 H15 domain-containing protein2.4e-4559.78Show/hide
Query:  MDNSKPQLSTIPQPLENLPTSSPPTPHSDHRHSLVVEKFRDALFSAVAAKYA-TNGSDHSLPFPSEQLKANIERRLHECFPSFHTPTHLPYASMIQRAIS
        M+ S+ QLS+I  P +NL +SS   PHSDHRHSL+  +FRDALFSAVAAKY+  NG+ HSLPF S+Q K+ I+ R+HE FPSF TPTHLPYASMI RAI+
Subjt:  MDNSKPQLSTIPQPLENLPTSSPPTPHSDHRHSLVVEKFRDALFSAVAAKYA-TNGSDHSLPFPSEQLKANIERRLHECFPSFHTPTHLPYASMIQRAIS

Query:  EVGEEDGLSEESISEFIVNEHDDFPWAHAAFLHRHLGKFWER---------RKRKEGEDDKKRKKSPTTASGGRRRYEE
        EVGEEDGLSEESIS FI+NE+ D PWAH+A+L RHLGK  E          R   + ED   ++K     SGGR RY E
Subjt:  EVGEEDGLSEESISEFIVNEHDDFPWAHAAFLHRHLGKFWER---------RKRKEGEDDKKRKKSPTTASGGRRRYEE

A0A5D3E3L6 Transcription regulatory protein SNF2-like isoform X32.0e-4761.11Show/hide
Query:  MDNSKPQLSTIPQPLENLPTSSPPTPHSDHRHSLVVEKFRDALFSAVAAKYATNGSDHSLPFPSEQLKANIERRLHECFPSFHTPTHLPYASMIQRAISE
        M+ S  QLS+I  P ENL + S   PHSDHRHSL+  + RDALFSAVAAKY+TNG+ HSLPF S+Q K+ I+ RL E FPSF TPTHLPYASMIQRAI+E
Subjt:  MDNSKPQLSTIPQPLENLPTSSPPTPHSDHRHSLVVEKFRDALFSAVAAKYATNGSDHSLPFPSEQLKANIERRLHECFPSFHTPTHLPYASMIQRAISE

Query:  VGEEDGLSEESISEFIVNEHDDFPWAHAAFLHRHLGKFWER---------RKRKEGEDDKKRKKSPTTASGGRRRYEEEE
        VGEEDGLSEESISEFIVNE++D PWAH+A+L RHLGK  E          R   + ED   ++K     +GGR RY E E
Subjt:  VGEEDGLSEESISEFIVNEHDDFPWAHAAFLHRHLGKFWER---------RKRKEGEDDKKRKKSPTTASGGRRRYEEEE

A0A6J1DFJ3 uncharacterized protein LOC1110200325.5e-4264.05Show/hide
Query:  SDHRHSLVVEKFRDALFSAVAAKYATNGSDHSLPFPSEQLKANIERRLHECFPSFHTPTHLPYASMIQRAISEVGEEDGLSEESISEFIVNEHDDFPWAH
        SD RHSLV  K RD LFSA+  KYAT+GS+ SLPFPSE+LK+++ERRLHE  PSFHTPTHLPYASMIQRAI+EVGEEDGLSEESIS+FIVNE++D PWAH
Subjt:  SDHRHSLVVEKFRDALFSAVAAKYATNGSDHSLPFPSEQLKANIERRLHECFPSFHTPTHLPYASMIQRAISEVGEEDGLSEESISEFIVNEHDDFPWAH

Query:  AAFLHRHLGKFWER---------RKRKEGEDDKKRKKSPTTASGGRRRYEEEE
        AA L RHLGK  E          R   E ED   ++K     S GR RY   E
Subjt:  AAFLHRHLGKFWER---------RKRKEGEDDKKRKKSPTTASGGRRRYEEEE

A0A6J1FEI4 uncharacterized protein LOC111444998 isoform X11.1e-5062.09Show/hide
Query:  MDNSKPQLSTIPQPLENLPTSSPPTPHSDHRHSLVVEKFRDALFSAVAAKYATNGSDHSLPFPSEQLKANIERRLHECFPSFHTPTHLPYASMIQRAISE
        M+NS+P LSTIP P EN P  S  TPHSDHR+SL+  +FRDALFSA AAKYATNGS HSLPFPSEQ K+ IE  LH+ FPSF TPTHLPYASMIQ+AI+E
Subjt:  MDNSKPQLSTIPQPLENLPTSSPPTPHSDHRHSLVVEKFRDALFSAVAAKYATNGSDHSLPFPSEQLKANIERRLHECFPSFHTPTHLPYASMIQRAISE

Query:  VGEEDGLSEESISEFIVNEHDDFPWAHAAFLHRHLGKFWERRK-----------RKEGEDDKKRKKSPTTASGGRRRYEEEE
        +GEEDGLSEE ISEFIVNE+ D PWAH AFL RHLGK  E  +           + EG++ K++K+   +A   RRR  E +
Subjt:  VGEEDGLSEESISEFIVNEHDDFPWAHAAFLHRHLGKFWERRK-----------RKEGEDDKKRKKSPTTASGGRRRYEEEE

A0A6J1K0W5 uncharacterized protein LOC111489634 isoform X12.2e-5163.19Show/hide
Query:  MDNSKPQLSTIPQPLENLPTSSPPTPHSDHRHSLVVEKFRDALFSAVAAKYATNGSDHSLPFPSEQLKANIERRLHECFPSFHTPTHLPYASMIQRAISE
        M+NS+P LSTIP P EN P  S  TPHSDHR+SL+  +FRDALFSA AAKYATNGS HSLPFPSEQ K+ IE  LHE FPSF TPTHLPYASMIQ+AI+E
Subjt:  MDNSKPQLSTIPQPLENLPTSSPPTPHSDHRHSLVVEKFRDALFSAVAAKYATNGSDHSLPFPSEQLKANIERRLHECFPSFHTPTHLPYASMIQRAISE

Query:  VGEEDGLSEESISEFIVNEHDDFPWAHAAFLHRHLGKFWERRK-----------RKEGEDDKKRKKSPTTASGGRRRYEEEE
        VGEEDGLSEE ISEFIVNE+ D PWAH AFL RHLGK  E  +           + EG++ K++K+   +A   RRR  E +
Subjt:  VGEEDGLSEESISEFIVNEHDDFPWAHAAFLHRHLGKFWERRK-----------RKEGEDDKKRKKSPTTASGGRRRYEEEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAACTCCAAACCCCAACTCTCTACCATTCCACAGCCGCTAGAAAATCTTCCAACTTCTTCTCCTCCGACGCCACATTCCGATCACCGACATTCACTTGTAGTCGA
AAAGTTCAGAGATGCCCTCTTCTCCGCCGTCGCCGCCAAATATGCGACCAACGGCAGCGACCACTCCTTGCCTTTCCCCTCCGAGCAGCTCAAGGCAAATATCGAGCGTC
GCCTTCACGAGTGTTTCCCCTCCTTCCACACTCCAACTCATCTTCCCTATGCCTCGATGATACAAAGGGCAATATCTGAAGTGGGAGAGGAAGATGGGTTGAGTGAGGAG
TCTATATCGGAGTTTATTGTGAATGAACATGATGATTTTCCATGGGCACACGCTGCTTTTTTGCATCGTCATTTGGGGAAGTTCTGGGAAAGAAGGAAAAGAAAAGAAGG
GGAAGATGATAAAAAAAGAAAAAAGTCGCCAACGACAGCGAGCGGTGGCCGCCGGAGATATGAAGAAGAAGAAAAAGGAAGAAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATAACTCCAAACCCCAACTCTCTACCATTCCACAGCCGCTAGAAAATCTTCCAACTTCTTCTCCTCCGACGCCACATTCCGATCACCGACATTCACTTGTAGTCGA
AAAGTTCAGAGATGCCCTCTTCTCCGCCGTCGCCGCCAAATATGCGACCAACGGCAGCGACCACTCCTTGCCTTTCCCCTCCGAGCAGCTCAAGGCAAATATCGAGCGTC
GCCTTCACGAGTGTTTCCCCTCCTTCCACACTCCAACTCATCTTCCCTATGCCTCGATGATACAAAGGGCAATATCTGAAGTGGGAGAGGAAGATGGGTTGAGTGAGGAG
TCTATATCGGAGTTTATTGTGAATGAACATGATGATTTTCCATGGGCACACGCTGCTTTTTTGCATCGTCATTTGGGGAAGTTCTGGGAAAGAAGGAAAAGAAAAGAAGG
GGAAGATGATAAAAAAAGAAAAAAGTCGCCAACGACAGCGAGCGGTGGCCGCCGGAGATATGAAGAAGAAGAAAAAGGAAGAAGATGA
Protein sequenceShow/hide protein sequence
MDNSKPQLSTIPQPLENLPTSSPPTPHSDHRHSLVVEKFRDALFSAVAAKYATNGSDHSLPFPSEQLKANIERRLHECFPSFHTPTHLPYASMIQRAISEVGEEDGLSEE
SISEFIVNEHDDFPWAHAAFLHRHLGKFWERRKRKEGEDDKKRKKSPTTASGGRRRYEEEEKGRR