; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10020993 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10020993
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionCentromere protein S-like
Genome locationChr05:4347001..4349048
RNA-Seq ExpressionHG10020993
SyntenyHG10020993
Gene Ontology termsGO:0000712 - resolution of meiotic recombination intermediates (biological process)
GO:0006312 - mitotic recombination (biological process)
GO:0007129 - synapsis (biological process)
GO:0031297 - replication fork processing (biological process)
GO:0036297 - interstrand cross-link repair (biological process)
GO:0043240 - Fanconi anaemia nuclear complex (cellular component)
GO:0071821 - FANCM-MHF complex (cellular component)
GO:0003682 - chromatin binding (molecular function)
GO:0046982 - protein heterodimerization activity (molecular function)
InterPro domainsIPR009072 - Histone-fold
IPR029003 - CENP-S/Mhf1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004154006.1 protein MHF1 homolog isoform X2 [Cucumis sativus]2.2e-4891.3Show/hide
Query:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLK
        METGMEEDDSASELLRDRFRLS+ISIAEAEA +SGMEISEPVMTCVADLAFKYTKQLAKDLELF QHAGRKSVNTEDVIL+AHRNEHLAAILTSICNDLK
Subjt:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLK

Query:  TKEPQGERARKGTKK
        TKEPQ ER RK   K
Subjt:  TKEPQGERARKGTKK

XP_008440159.1 PREDICTED: centromere protein S isoform X2 [Cucumis melo]7.1e-4789.57Show/hide
Query:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLK
        MET MEEDDSASELLRDRFRLSTISIAEAEA +SGMEISEPVMTCVADLAFK+TKQLAKDLELF QHAGRKSVNTEDVIL+AHRNEHLAAILTSICNDLK
Subjt:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLK

Query:  TKEPQGERARKGTKK
         KEPQ ER RK   K
Subjt:  TKEPQGERARKGTKK

XP_011652290.1 protein MHF1 homolog isoform X1 [Cucumis sativus]1.6e-4689.66Show/hide
Query:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKY-TKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDL
        METGMEEDDSASELLRDRFRLS+ISIAEAEA +SGMEISEPVMTCVADLAFKY T+QLAKDLELF QHAGRKSVNTEDVIL+AHRNEHLAAILTSICNDL
Subjt:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKY-TKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDL

Query:  KTKEPQGERARKGTKK
        KTKEPQ ER RK   K
Subjt:  KTKEPQGERARKGTKK

XP_038894698.1 protein MHF1 homolog isoform X1 [Benincasa hispida]4.6e-4689.66Show/hide
Query:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKY-TKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDL
        M+TG+EED SASELLRDRFRLSTISIAEAEA RSGMEISEPVMTCVADLAFKY T+QLAKDLELFVQHAGRKSVNTEDVILSAHRNEHL+AILTSICNDL
Subjt:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKY-TKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDL

Query:  KTKEPQGERARKGTKK
        KTKEPQ ER RK   K
Subjt:  KTKEPQGERARKGTKK

XP_038894699.1 protein MHF1 homolog isoform X2 [Benincasa hispida]6.4e-4891.3Show/hide
Query:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLK
        M+TG+EED SASELLRDRFRLSTISIAEAEA RSGMEISEPVMTCVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHL+AILTSICNDLK
Subjt:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLK

Query:  TKEPQGERARKGTKK
        TKEPQ ER RK   K
Subjt:  TKEPQGERARKGTKK

TrEMBL top hitse value%identityAlignment
A0A0A0LR97 Uncharacterized protein1.1e-4891.3Show/hide
Query:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLK
        METGMEEDDSASELLRDRFRLS+ISIAEAEA +SGMEISEPVMTCVADLAFKYTKQLAKDLELF QHAGRKSVNTEDVIL+AHRNEHLAAILTSICNDLK
Subjt:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLK

Query:  TKEPQGERARKGTKK
        TKEPQ ER RK   K
Subjt:  TKEPQGERARKGTKK

A0A1S3B159 centromere protein S isoform X12.5e-4587.93Show/hide
Query:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKY-TKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDL
        MET MEEDDSASELLRDRFRLSTISIAEAEA +SGMEISEPVMTCVADLAFK+ T+QLAKDLELF QHAGRKSVNTEDVIL+AHRNEHLAAILTSICNDL
Subjt:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKY-TKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDL

Query:  KTKEPQGERARKGTKK
        K KEPQ ER RK   K
Subjt:  KTKEPQGERARKGTKK

A0A1S3B173 centromere protein S isoform X23.4e-4789.57Show/hide
Query:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLK
        MET MEEDDSASELLRDRFRLSTISIAEAEA +SGMEISEPVMTCVADLAFK+TKQLAKDLELF QHAGRKSVNTEDVIL+AHRNEHLAAILTSICNDLK
Subjt:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLK

Query:  TKEPQGERARKGTKK
         KEPQ ER RK   K
Subjt:  TKEPQGERARKGTKK

A0A5A7UM53 Centromere protein S isoform X12.5e-4587.93Show/hide
Query:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKY-TKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDL
        MET MEEDDSASELLRDRFRLSTISIAEAEA +SGMEISEPVMTCVADLAFK+ T+QLAKDLELF QHAGRKSVNTEDVIL+AHRNEHLAAILTSICNDL
Subjt:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKY-TKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDL

Query:  KTKEPQGERARKGTKK
        K KEPQ ER RK   K
Subjt:  KTKEPQGERARKGTKK

A0A5D3BHQ8 Centromere protein S isoform X23.4e-4789.57Show/hide
Query:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLK
        MET MEEDDSASELLRDRFRLSTISIAEAEA +SGMEISEPVMTCVADLAFK+TKQLAKDLELF QHAGRKSVNTEDVIL+AHRNEHLAAILTSICNDLK
Subjt:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLK

Query:  TKEPQGERARKGTKK
         KEPQ ER RK   K
Subjt:  TKEPQGERARKGTKK

SwissProt top hitse value%identityAlignment
O74807 Inner kinetochore subunit mhf12.4e-0528.97Show/hide
Query:  MEEDDSASELLRDRFRLSTISIAE-AEAKRSGMEISEPVMTCVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICND-LKTK
        MEE+   +E+      +   + +E  E++   + + E     V ++ ++  + LAKD+E F +HAGRK+V  +DV+L   RNE L  I+ +   + +K+K
Subjt:  MEEDDSASELLRDRFRLSTISIAE-AEAKRSGMEISEPVMTCVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICND-LKTK

Query:  EPQGERA
        + + E +
Subjt:  EPQGERA

Q2TBR7 Centromere protein S6.3e-0633.78Show/hide
Query:  IAEAEAKRSGMEISEPVMTCVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDL
        + E  A    M+ S+  +  ++++ F   +  AKDLE+F +HA R ++NTEDV L A R+  L   +T    D+
Subjt:  IAEAEAKRSGMEISEPVMTCVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDL

Q6NRI8 Centromere protein S1.1e-0527.05Show/hide
Query:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTS-----I
        M  G EE  S ++ L+        S+ +  A    ++ S+  +  ++++ F+  +  AKDLE+F +HA R ++N +DV L A R+  L A ++       
Subjt:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTS-----I

Query:  CNDLKTKEPQGERARKGTKKGR
         N L+ KE + +++  G    R
Subjt:  CNDLKTKEPQGERARKGTKKGR

Q8N2Z9 Centromere protein S3.7e-0628.1Show/hide
Query:  ETGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICND---
        ET  ++  S  + L+     +   + E  A    M+ S+  +  +++L F+  +  AKDLE+F +HA R ++NTEDV L A R+  L   +T    +   
Subjt:  ETGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICND---

Query:  --LKTKEPQGERARKGTKKGR
          L+ K  + +++  G+K  R
Subjt:  --LKTKEPQGERARKGTKKGR

Q9FI55 Protein MHF1 homolog1.0e-3558.82Show/hide
Query:  PNAVNCESRSLCSAMETGME-----------EDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKYTKQLAKDLELFVQHAGRKSVNT
        P     +  S C AM+ G E           E+ S  +L+RDRFRLS ISIAEAEAK++GMEI  PV+ CVADLAFKY + +AKDLELF  HAGRK VN 
Subjt:  PNAVNCESRSLCSAMETGME-----------EDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKYTKQLAKDLELFVQHAGRKSVNT

Query:  EDVILSAHRNEHLAAILTSICNDLKTKEPQGERARK
        +DV+LSAHRN++LAA L S+CN+LK KEPQ ER RK
Subjt:  EDVILSAHRNEHLAAILTSICNDLKTKEPQGERARK

Arabidopsis top hitse value%identityAlignment
AT5G50930.1 Histone superfamily protein7.1e-3758.82Show/hide
Query:  PNAVNCESRSLCSAMETGME-----------EDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKYTKQLAKDLELFVQHAGRKSVNT
        P     +  S C AM+ G E           E+ S  +L+RDRFRLS ISIAEAEAK++GMEI  PV+ CVADLAFKY + +AKDLELF  HAGRK VN 
Subjt:  PNAVNCESRSLCSAMETGME-----------EDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKYTKQLAKDLELFVQHAGRKSVNT

Query:  EDVILSAHRNEHLAAILTSICNDLKTKEPQGERARK
        +DV+LSAHRN++LAA L S+CN+LK KEPQ ER RK
Subjt:  EDVILSAHRNEHLAAILTSICNDLKTKEPQGERARK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGCAGTGACAGTGCCGCCAAATGCAGTAAATTGTGAATCCAGAAGCCTCTGCTCTGCAATGGAAACTGGGATGGAAGAAGACGACTCCGCCTCCGAACTCTTGAG
GGACAGATTCCGACTCTCCACCATTTCTATCGCTGAAGCTGAAGCGAAGAGAAGCGGCATGGAAATTTCTGAACCCGTGATGACTTGTGTCGCTGATTTAGCCTTCAAAT
ATACAAAACAGTTGGCGAAGGACCTTGAGTTATTTGTTCAGCATGCTGGTCGGAAATCTGTGAATACAGAAGATGTAATACTATCAGCCCATAGAAATGAGCATTTGGCT
GCCATATTAACATCCATCTGCAATGATCTAAAGACTAAAGAACCTCAAGGTGAGAGAGCGAGAAAAGGCACCAAAAAAGGAAGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGCAGTGACAGTGCCGCCAAATGCAGTAAATTGTGAATCCAGAAGCCTCTGCTCTGCAATGGAAACTGGGATGGAAGAAGACGACTCCGCCTCCGAACTCTTGAG
GGACAGATTCCGACTCTCCACCATTTCTATCGCTGAAGCTGAAGCGAAGAGAAGCGGCATGGAAATTTCTGAACCCGTGATGACTTGTGTCGCTGATTTAGCCTTCAAAT
ATACAAAACAGTTGGCGAAGGACCTTGAGTTATTTGTTCAGCATGCTGGTCGGAAATCTGTGAATACAGAAGATGTAATACTATCAGCCCATAGAAATGAGCATTTGGCT
GCCATATTAACATCCATCTGCAATGATCTAAAGACTAAAGAACCTCAAGGTGAGAGAGCGAGAAAAGGCACCAAAAAAGGAAGATAG
Protein sequenceShow/hide protein sequence
MEAVTVPPNAVNCESRSLCSAMETGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLA
AILTSICNDLKTKEPQGERARKGTKKGR