; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg036921 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg036921
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCentromere protein S-like
Genome locationscaffold5:43989922..43992088
RNA-Seq ExpressionSpg036921
SyntenySpg036921
Gene Ontology termsGO:0000712 - resolution of meiotic recombination intermediates (biological process)
GO:0006312 - mitotic recombination (biological process)
GO:0007129 - synapsis (biological process)
GO:0031297 - replication fork processing (biological process)
GO:0036297 - interstrand cross-link repair (biological process)
GO:0043240 - Fanconi anaemia nuclear complex (cellular component)
GO:0071821 - FANCM-MHF complex (cellular component)
GO:0003682 - chromatin binding (molecular function)
GO:0046982 - protein heterodimerization activity (molecular function)
InterPro domainsIPR009072 - Histone-fold
IPR029003 - CENP-S/Mhf1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004154006.1 protein MHF1 homolog isoform X2 [Cucumis sativus]5.8e-4978.83Show/hide
Query:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTNNGLVTCVAEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAAS
        METGMEE+DS SE LRDRFRLS+ISIAEAEAN+SGMEISE VM CVA+LAFKYT         +QLAKDLELF+QH GRK+VNTEDVIL+AHRNEHLAA 
Subjt:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTNNGLVTCVAEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAAS

Query:  LTFFCNDLKTKEPQSEKKRKKAPKKEDRDRGVVHITD
        LT  CNDLKTKEPQSE+KRKKAPKK+DRDRG VHI D
Subjt:  LTFFCNDLKTKEPQSEKKRKKAPKKEDRDRGVVHITD

XP_011652290.1 protein MHF1 homolog isoform X1 [Cucumis sativus]2.0e-4979.56Show/hide
Query:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTNNGLVTCVAEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAAS
        METGMEE+DS SE LRDRFRLS+ISIAEAEAN+SGMEISE VM CVA+LAFKYT         EQLAKDLELF+QH GRK+VNTEDVIL+AHRNEHLAA 
Subjt:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTNNGLVTCVAEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAAS

Query:  LTFFCNDLKTKEPQSEKKRKKAPKKEDRDRGVVHITD
        LT  CNDLKTKEPQSE+KRKKAPKK+DRDRG VHI D
Subjt:  LTFFCNDLKTKEPQSEKKRKKAPKKEDRDRGVVHITD

XP_022137214.1 protein MHF1 homolog [Momordica charantia]9.9e-4979.56Show/hide
Query:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTNNGLVTCVAEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAAS
        METG EE+D+ +E L DRFRLSTISIAEAEA R+GMEISE VM CVAELAFKYT         EQLAKDLELF+QH GRK+VNTEDVILSAHRNEHLAAS
Subjt:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTNNGLVTCVAEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAAS

Query:  LTFFCNDLKTKEPQSEKKRKKAPKKEDRDRGVVHITD
        LT FCNDLKTKEPQ+E+KRKKA KKEDRDRGVVHI D
Subjt:  LTFFCNDLKTKEPQSEKKRKKAPKKEDRDRGVVHITD

XP_038894698.1 protein MHF1 homolog isoform X1 [Benincasa hispida]2.2e-4879.56Show/hide
Query:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTNNGLVTCVAEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAAS
        M+TG+EE+ S SE LRDRFRLSTISIAEAEANRSGMEISE VM CVA+LAFKYT         EQLAKDLELF QH GRK+VNTEDVILSAHRNEHL+A 
Subjt:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTNNGLVTCVAEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAAS

Query:  LTFFCNDLKTKEPQSEKKRKKAPKKEDRDRGVVHITD
        LT  CNDLKTKEPQSE+KRKKAPKKEDRDRG VHI D
Subjt:  LTFFCNDLKTKEPQSEKKRKKAPKKEDRDRGVVHITD

XP_038894699.1 protein MHF1 homolog isoform X2 [Benincasa hispida]6.4e-4878.83Show/hide
Query:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTNNGLVTCVAEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAAS
        M+TG+EE+ S SE LRDRFRLSTISIAEAEANRSGMEISE VM CVA+LAFKYT         +QLAKDLELF QH GRK+VNTEDVILSAHRNEHL+A 
Subjt:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTNNGLVTCVAEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAAS

Query:  LTFFCNDLKTKEPQSEKKRKKAPKKEDRDRGVVHITD
        LT  CNDLKTKEPQSE+KRKKAPKKEDRDRG VHI D
Subjt:  LTFFCNDLKTKEPQSEKKRKKAPKKEDRDRGVVHITD

TrEMBL top hitse value%identityAlignment
A0A0A0LR97 Uncharacterized protein2.8e-4978.83Show/hide
Query:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTNNGLVTCVAEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAAS
        METGMEE+DS SE LRDRFRLS+ISIAEAEAN+SGMEISE VM CVA+LAFKYT         +QLAKDLELF+QH GRK+VNTEDVIL+AHRNEHLAA 
Subjt:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTNNGLVTCVAEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAAS

Query:  LTFFCNDLKTKEPQSEKKRKKAPKKEDRDRGVVHITD
        LT  CNDLKTKEPQSE+KRKKAPKK+DRDRG VHI D
Subjt:  LTFFCNDLKTKEPQSEKKRKKAPKKEDRDRGVVHITD

A0A1S3B159 centromere protein S isoform X19.0e-4877.37Show/hide
Query:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTNNGLVTCVAEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAAS
        MET MEE+DS SE LRDRFRLSTISIAEAEAN+SGMEISE VM CVA+LAFK+T         EQLAKDLELF+QH GRK+VNTEDVIL+AHRNEHLAA 
Subjt:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTNNGLVTCVAEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAAS

Query:  LTFFCNDLKTKEPQSEKKRKKAPKKEDRDRGVVHITD
        LT  CNDLK KEPQSE+KRKKAPKK+DRDRG VHI +
Subjt:  LTFFCNDLKTKEPQSEKKRKKAPKKEDRDRGVVHITD

A0A5A7UM53 Centromere protein S isoform X19.0e-4877.37Show/hide
Query:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTNNGLVTCVAEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAAS
        MET MEE+DS SE LRDRFRLSTISIAEAEAN+SGMEISE VM CVA+LAFK+T         EQLAKDLELF+QH GRK+VNTEDVIL+AHRNEHLAA 
Subjt:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTNNGLVTCVAEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAAS

Query:  LTFFCNDLKTKEPQSEKKRKKAPKKEDRDRGVVHITD
        LT  CNDLK KEPQSE+KRKKAPKK+DRDRG VHI +
Subjt:  LTFFCNDLKTKEPQSEKKRKKAPKKEDRDRGVVHITD

A0A5D3BHQ8 Centromere protein S isoform X22.6e-4776.64Show/hide
Query:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTNNGLVTCVAEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAAS
        MET MEE+DS SE LRDRFRLSTISIAEAEAN+SGMEISE VM CVA+LAFK+T         +QLAKDLELF+QH GRK+VNTEDVIL+AHRNEHLAA 
Subjt:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTNNGLVTCVAEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAAS

Query:  LTFFCNDLKTKEPQSEKKRKKAPKKEDRDRGVVHITD
        LT  CNDLK KEPQSE+KRKKAPKK+DRDRG VHI +
Subjt:  LTFFCNDLKTKEPQSEKKRKKAPKKEDRDRGVVHITD

A0A6J1C623 protein MHF1 homolog4.8e-4979.56Show/hide
Query:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTNNGLVTCVAEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAAS
        METG EE+D+ +E L DRFRLSTISIAEAEA R+GMEISE VM CVAELAFKYT         EQLAKDLELF+QH GRK+VNTEDVILSAHRNEHLAAS
Subjt:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTNNGLVTCVAEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAAS

Query:  LTFFCNDLKTKEPQSEKKRKKAPKKEDRDRGVVHITD
        LT FCNDLKTKEPQ+E+KRKKA KKEDRDRGVVHI D
Subjt:  LTFFCNDLKTKEPQSEKKRKKAPKKEDRDRGVVHITD

SwissProt top hitse value%identityAlignment
E1BSW7 Centromere protein S8.2e-0628.57Show/hide
Query:  GMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTNNGLVTCVAEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTF
        G E+ + + + LR     +T  + +  A   G+  S+  +A ++E+ F+           E  A+DLE+F++H  R T+ +EDV L A R+  L   +T 
Subjt:  GMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTNNGLVTCVAEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTF

Query:  FCNDLKTKE-PQSEKKRKKAPKKEDR
          ++L +    Q EKK+KK+   + R
Subjt:  FCNDLKTKE-PQSEKKRKKAPKKEDR

Q2TBR7 Centromere protein S1.7e-0632.12Show/hide
Query:  MEEED--------SMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTNNGLVTCVAEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEH
        MEEE+        S  + L+     +   + E  A+   M+ S+  +A ++E+ F            E  AKDLE+F++H  R T+NTEDV L A R+  
Subjt:  MEEED--------SMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTNNGLVTCVAEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEH

Query:  LAASLTFFCNDLKTKEPQSEKKRKKAPKKEDRDRGVV
        L   +T    D+   +   EKK KK  K ED +R  V
Subjt:  LAASLTFFCNDLKTKEPQSEKKRKKAPKKEDRDRGVV

Q6NRI8 Centromere protein S1.3e-0629.27Show/hide
Query:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTNNGLVTCVAEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAAS
        M  G EE  S ++ L+        S+ +  A+   ++ S+  +A ++E+ F+           E  AKDLE+F++H  R T+N +DV L A R+  L A 
Subjt:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTNNGLVTCVAEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAAS

Query:  LTFFCNDLKTKE-PQSEKKRKKA
        ++   +++      Q EKK+KK+
Subjt:  LTFFCNDLKTKE-PQSEKKRKKA

Q8N2Z9 Centromere protein S1.4e-0530.16Show/hide
Query:  ETGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTNNGLVTCVAEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASL
        ET  ++  S  + L+     +   + E  A    M+ S+  +A ++EL F+           E  AKDLE+F++H  R T+NTEDV L A R+  L   +
Subjt:  ETGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTNNGLVTCVAEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASL

Query:  TFFCNDLKTKEPQSEKKRKKAPKKED
        T    ++   +   E+K +K  K ED
Subjt:  TFFCNDLKTKEPQSEKKRKKAPKKED

Q9FI55 Protein MHF1 homolog3.5e-3364.23Show/hide
Query:  EEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTNNGLVTCVAEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFFCN
        EE SM + +RDRFRLS ISIAEAEA ++GMEI   V+ACVA+LAFKY         AE +AKDLELF+ H GRK VN +DV+LSAHRN++LAASL   CN
Subjt:  EEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTNNGLVTCVAEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFFCN

Query:  DLKTKEPQSEKKRKK-APKKEDR
        +LK KEPQSE+KRKK + KKED+
Subjt:  DLKTKEPQSEKKRKK-APKKEDR

Arabidopsis top hitse value%identityAlignment
AT5G50930.1 Histone superfamily protein2.5e-3464.23Show/hide
Query:  EEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTNNGLVTCVAEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFFCN
        EE SM + +RDRFRLS ISIAEAEA ++GMEI   V+ACVA+LAFKY         AE +AKDLELF+ H GRK VN +DV+LSAHRN++LAASL   CN
Subjt:  EEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTNNGLVTCVAEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFFCN

Query:  DLKTKEPQSEKKRKK-APKKEDR
        +LK KEPQSE+KRKK + KKED+
Subjt:  DLKTKEPQSEKKRKK-APKKEDR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAACTGGGATGGAAGAAGAAGACTCCATGTCCGAACACTTGAGGGACCGATTCCGTCTCTCCACCATTTCCATCGCTGAAGCTGAAGCGAATAGAAGCGGCATGGA
AATCTCTGAAACTGTGATGGCTTGTGTCGCTGAGTTAGCGTTCAAATATACAAATAACGGACTGGTAACTTGTGTAGCAGAACAGTTGGCAAAAGACCTTGAGTTATTTT
CTCAGCATGGTGGTCGGAAAACTGTGAATACGGAAGACGTCATACTATCAGCCCATAGAAACGAGCATCTGGCTGCCTCGTTAACATTCTTCTGCAATGATCTAAAGACA
AAAGAACCTCAAAGTGAGAAAAAGCGGAAAAAGGCACCGAAAAAGGAAGATAGAGATAGAGGTGTAGTGCATATCACCGATGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAACTGGGATGGAAGAAGAAGACTCCATGTCCGAACACTTGAGGGACCGATTCCGTCTCTCCACCATTTCCATCGCTGAAGCTGAAGCGAATAGAAGCGGCATGGA
AATCTCTGAAACTGTGATGGCTTGTGTCGCTGAGTTAGCGTTCAAATATACAAATAACGGACTGGTAACTTGTGTAGCAGAACAGTTGGCAAAAGACCTTGAGTTATTTT
CTCAGCATGGTGGTCGGAAAACTGTGAATACGGAAGACGTCATACTATCAGCCCATAGAAACGAGCATCTGGCTGCCTCGTTAACATTCTTCTGCAATGATCTAAAGACA
AAAGAACCTCAAAGTGAGAAAAAGCGGAAAAAGGCACCGAAAAAGGAAGATAGAGATAGAGGTGTAGTGCATATCACCGATGTCTAG
Protein sequenceShow/hide protein sequence
METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTNNGLVTCVAEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFFCNDLKT
KEPQSEKKRKKAPKKEDRDRGVVHITDV