; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy06g005200 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy06g005200
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionCentromere protein S-like
Genome locationChr06:4813233..4815697
RNA-Seq ExpressionLcy06g005200
SyntenyLcy06g005200
Gene Ontology termsGO:0000712 - resolution of meiotic recombination intermediates (biological process)
GO:0006312 - mitotic recombination (biological process)
GO:0007129 - synapsis (biological process)
GO:0031297 - replication fork processing (biological process)
GO:0036297 - interstrand cross-link repair (biological process)
GO:0043240 - Fanconi anaemia nuclear complex (cellular component)
GO:0071821 - FANCM-MHF complex (cellular component)
GO:0003682 - chromatin binding (molecular function)
GO:0046982 - protein heterodimerization activity (molecular function)
InterPro domainsIPR009072 - Histone-fold
IPR029003 - CENP-S/Mhf1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004154006.1 protein MHF1 homolog isoform X2 [Cucumis sativus]1.4e-4983.72Show/hide
Query:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTTEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFFCNDL
        METGMEE+DS SE LRDRFRLS+ISIAEAEAN+SGMEISE VM CVA+LAFKY T+QLAKDLELF+QH GRK+VNTEDVIL+AHRNEHLAA LT  CNDL
Subjt:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTTEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFFCNDL

Query:  KTKEPQSEKKRKKAPKKEDRDRGVVHITD
        KTKEPQSE+KRKKAPKK+DRDRG VHI D
Subjt:  KTKEPQSEKKRKKAPKKEDRDRGVVHITD

XP_008440142.1 PREDICTED: centromere protein S isoform X1 [Cucumis melo]4.9e-5082.95Show/hide
Query:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTTEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFFCNDL
        MET MEE+DS SE LRDRFRLSTISIAEAEAN+SGMEISE VM CVA+LAFK+TTEQLAKDLELF+QH GRK+VNTEDVIL+AHRNEHLAA LT  CNDL
Subjt:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTTEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFFCNDL

Query:  KTKEPQSEKKRKKAPKKEDRDRGVVHITD
        K KEPQSE+KRKKAPKK+DRDRG VHI +
Subjt:  KTKEPQSEKKRKKAPKKEDRDRGVVHITD

XP_011652290.1 protein MHF1 homolog isoform X1 [Cucumis sativus]5.3e-5285.27Show/hide
Query:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTTEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFFCNDL
        METGMEE+DS SE LRDRFRLS+ISIAEAEAN+SGMEISE VM CVA+LAFKYTTEQLAKDLELF+QH GRK+VNTEDVIL+AHRNEHLAA LT  CNDL
Subjt:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTTEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFFCNDL

Query:  KTKEPQSEKKRKKAPKKEDRDRGVVHITD
        KTKEPQSE+KRKKAPKK+DRDRG VHI D
Subjt:  KTKEPQSEKKRKKAPKKEDRDRGVVHITD

XP_022137214.1 protein MHF1 homolog [Momordica charantia]2.6e-5185.27Show/hide
Query:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTTEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFFCNDL
        METG EE+D+ +E L DRFRLSTISIAEAEA R+GMEISE VM CVAELAFKYTTEQLAKDLELF+QH GRK+VNTEDVILSAHRNEHLAASLT FCNDL
Subjt:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTTEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFFCNDL

Query:  KTKEPQSEKKRKKAPKKEDRDRGVVHITD
        KTKEPQ+E+KRKKA KKEDRDRGVVHI D
Subjt:  KTKEPQSEKKRKKAPKKEDRDRGVVHITD

XP_038894698.1 protein MHF1 homolog isoform X1 [Benincasa hispida]5.8e-5185.27Show/hide
Query:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTTEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFFCNDL
        M+TG+EE+ S SE LRDRFRLSTISIAEAEANRSGMEISE VM CVA+LAFKYTTEQLAKDLELF QH GRK+VNTEDVILSAHRNEHL+A LT  CNDL
Subjt:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTTEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFFCNDL

Query:  KTKEPQSEKKRKKAPKKEDRDRGVVHITD
        KTKEPQSE+KRKKAPKKEDRDRG VHI D
Subjt:  KTKEPQSEKKRKKAPKKEDRDRGVVHITD

TrEMBL top hitse value%identityAlignment
A0A0A0LR97 Uncharacterized protein7.0e-5083.72Show/hide
Query:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTTEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFFCNDL
        METGMEE+DS SE LRDRFRLS+ISIAEAEAN+SGMEISE VM CVA+LAFKY T+QLAKDLELF+QH GRK+VNTEDVIL+AHRNEHLAA LT  CNDL
Subjt:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTTEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFFCNDL

Query:  KTKEPQSEKKRKKAPKKEDRDRGVVHITD
        KTKEPQSE+KRKKAPKK+DRDRG VHI D
Subjt:  KTKEPQSEKKRKKAPKKEDRDRGVVHITD

A0A1S3B159 centromere protein S isoform X12.4e-5082.95Show/hide
Query:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTTEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFFCNDL
        MET MEE+DS SE LRDRFRLSTISIAEAEAN+SGMEISE VM CVA+LAFK+TTEQLAKDLELF+QH GRK+VNTEDVIL+AHRNEHLAA LT  CNDL
Subjt:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTTEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFFCNDL

Query:  KTKEPQSEKKRKKAPKKEDRDRGVVHITD
        K KEPQSE+KRKKAPKK+DRDRG VHI +
Subjt:  KTKEPQSEKKRKKAPKKEDRDRGVVHITD

A0A5A7UM53 Centromere protein S isoform X12.4e-5082.95Show/hide
Query:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTTEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFFCNDL
        MET MEE+DS SE LRDRFRLSTISIAEAEAN+SGMEISE VM CVA+LAFK+TTEQLAKDLELF+QH GRK+VNTEDVIL+AHRNEHLAA LT  CNDL
Subjt:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTTEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFFCNDL

Query:  KTKEPQSEKKRKKAPKKEDRDRGVVHITD
        K KEPQSE+KRKKAPKK+DRDRG VHI +
Subjt:  KTKEPQSEKKRKKAPKKEDRDRGVVHITD

A0A5D3BHQ8 Centromere protein S isoform X26.5e-4881.4Show/hide
Query:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTTEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFFCNDL
        MET MEE+DS SE LRDRFRLSTISIAEAEAN+SGMEISE VM CVA+LAFK+ T+QLAKDLELF+QH GRK+VNTEDVIL+AHRNEHLAA LT  CNDL
Subjt:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTTEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFFCNDL

Query:  KTKEPQSEKKRKKAPKKEDRDRGVVHITD
        K KEPQSE+KRKKAPKK+DRDRG VHI +
Subjt:  KTKEPQSEKKRKKAPKKEDRDRGVVHITD

A0A6J1C623 protein MHF1 homolog1.3e-5185.27Show/hide
Query:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTTEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFFCNDL
        METG EE+D+ +E L DRFRLSTISIAEAEA R+GMEISE VM CVAELAFKYTTEQLAKDLELF+QH GRK+VNTEDVILSAHRNEHLAASLT FCNDL
Subjt:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTTEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFFCNDL

Query:  KTKEPQSEKKRKKAPKKEDRDRGVVHITD
        KTKEPQ+E+KRKKA KKEDRDRGVVHI D
Subjt:  KTKEPQSEKKRKKAPKKEDRDRGVVHITD

SwissProt top hitse value%identityAlignment
E1BSW7 Centromere protein S2.7e-0630.51Show/hide
Query:  GMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTTEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFFCNDLKTK
        G E+ + + + LR     +T  + +  A   G+  S+  +A ++E+ F+   E  A+DLE+F++H  R T+ +EDV L A R+  L   +T   ++L + 
Subjt:  GMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTTEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFFCNDLKTK

Query:  E-PQSEKKRKKAPKKEDR
           Q EKK+KK+   + R
Subjt:  E-PQSEKKRKKAPKKEDR

Q2TBR7 Centromere protein S5.4e-0734.11Show/hide
Query:  MEEED--------SMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTTEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFF
        MEEE+        S  + L+     +   + E  A+   M+ S+  +A ++E+ F    E  AKDLE+F++H  R T+NTEDV L A R+  L   +T  
Subjt:  MEEED--------SMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTTEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFF

Query:  CNDLKTKEPQSEKKRKKAPKKEDRDRGVV
          D+   +   EKK KK  K ED +R  V
Subjt:  CNDLKTKEPQSEKKRKKAPKKEDRDRGVV

Q6NRI8 Centromere protein S3.1e-0731.3Show/hide
Query:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTTEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFFCNDL
        M  G EE  S ++ L+        S+ +  A+   ++ S+  +A ++E+ F+   E  AKDLE+F++H  R T+N +DV L A R+  L A ++   +++
Subjt:  METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTTEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFFCNDL

Query:  KTKE-PQSEKKRKKA
              Q EKK+KK+
Subjt:  KTKE-PQSEKKRKKA

Q8N2Z9 Centromere protein S4.5e-0632.2Show/hide
Query:  ETGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTTEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFFCNDLK
        ET  ++  S  + L+     +   + E  A    M+ S+  +A ++EL F+   E  AKDLE+F++H  R T+NTEDV L A R+  L   +T    ++ 
Subjt:  ETGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTTEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFFCNDLK

Query:  TKEPQSEKKRKKAPKKED
          +   E+K +K  K ED
Subjt:  TKEPQSEKKRKKAPKKED

Q9FI55 Protein MHF1 homolog2.0e-3367.83Show/hide
Query:  EEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTTEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFFCNDLKTKEPQ
        EE SM + +RDRFRLS ISIAEAEA ++GMEI   V+ACVA+LAFKY  E +AKDLELF+ H GRK VN +DV+LSAHRN++LAASL   CN+LK KEPQ
Subjt:  EEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTTEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFFCNDLKTKEPQ

Query:  SEKKRKK-APKKEDR
        SE+KRKK + KKED+
Subjt:  SEKKRKK-APKKEDR

Arabidopsis top hitse value%identityAlignment
AT5G50930.1 Histone superfamily protein1.4e-3467.83Show/hide
Query:  EEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTTEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFFCNDLKTKEPQ
        EE SM + +RDRFRLS ISIAEAEA ++GMEI   V+ACVA+LAFKY  E +AKDLELF+ H GRK VN +DV+LSAHRN++LAASL   CN+LK KEPQ
Subjt:  EEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTTEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFFCNDLKTKEPQ

Query:  SEKKRKK-APKKEDR
        SE+KRKK + KKED+
Subjt:  SEKKRKK-APKKEDR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAACTGGGATGGAAGAAGAAGACTCCATGTCCGAACACTTGAGGGACCGATTCCGTCTCTCCACCATTTCCATCGCTGAAGCTGAAGCGAATAGAAGCGGCATGGA
AATCTCTGAAACTGTGATGGCTTGTGTCGCTGAGTTAGCGTTCAAATATACAACAGAACAGTTGGCAAAAGACCTTGAGTTATTTTCTCAGCATGGTGGTCGGAAAACTG
TGAATACGGAAGACGTCATACTATCAGCCCATAGAAACGAGCATCTGGCTGCCTCGTTAACATTCTTCTGCAATGATCTAAAGACAAAAGAACCTCAAAGTGAGAAAAAG
CGGAAAAAGGCACCGAAAAAGGAAGATAGAGATAGAGGTGTAGTGCATATCACCGATGTCTAG
mRNA sequenceShow/hide mRNA sequence
TTCGTTTAAGGCCATTTCCCGTCGCCAGTGACCAGTGCCAATTCTGGAGGAAATAAAGGCGGATTGAAACAGGTGCAGTCAAAAGTAGTAAAATCTGTGTCCAGAAGGCT
CTGCAATGGAAACTGGGATGGAAGAAGAAGACTCCATGTCCGAACACTTGAGGGACCGATTCCGTCTCTCCACCATTTCCATCGCTGAAGCTGAAGCGAATAGAAGCGGC
ATGGAAATCTCTGAAACTGTGATGGCTTGTGTCGCTGAGTTAGCGTTCAAATATACAACAGAACAGTTGGCAAAAGACCTTGAGTTATTTTCTCAGCATGGTGGTCGGAA
AACTGTGAATACGGAAGACGTCATACTATCAGCCCATAGAAACGAGCATCTGGCTGCCTCGTTAACATTCTTCTGCAATGATCTAAAGACAAAAGAACCTCAAAGTGAGA
AAAAGCGGAAAAAGGCACCGAAAAAGGAAGATAGAGATAGAGGTGTAGTGCATATCACCGATGTCTAGTCACATCTCCCCTCTGGGCATATCTCTAGTTAACAAGGGTAC
ATACCGATAAGTAATGACAATAGTATATGATTCTCTTCCTGATTCTAGGTTTTAGAAATCAGTAATGCCATTTTCACATTGTATGGGCCTATAGAGTATTACTCTGATTC
AAAGGTTAAGCTTCTGCATCATGCAAATTCATTAAAGTGTTTCCAAACGCAGGAAATATAGCTTTTTGTTTGTTTTAGGTATTTGAGATCGTCGAAGCATGAGTTTTTCA
ATGTTTTTGTGATCGGGAAGCGAGTTGAAATCCTGAACAGTTGAAAAGGTTGATAAATTGTTCTTTTTGCATTTAAAACACAATTTTGTGATGTAGACTTTTAGTTAAGT
GCACCACATTTTGATAATGTGAAGCATAGCGTTCATGGTGTAAATATAATAGCCTAGTTCTGACAGATATGAGAATGGGATTCGGCCCTCTCTCT
Protein sequenceShow/hide protein sequence
METGMEEEDSMSEHLRDRFRLSTISIAEAEANRSGMEISETVMACVAELAFKYTTEQLAKDLELFSQHGGRKTVNTEDVILSAHRNEHLAASLTFFCNDLKTKEPQSEKK
RKKAPKKEDRDRGVVHITDV