; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc03G04280 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc03G04280
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionCentromere protein S-like
Genome locationClcChr03:4217928..4220321
RNA-Seq ExpressionClc03G04280
SyntenyClc03G04280
Gene Ontology termsGO:0000712 - resolution of meiotic recombination intermediates (biological process)
GO:0006312 - mitotic recombination (biological process)
GO:0007129 - synapsis (biological process)
GO:0031297 - replication fork processing (biological process)
GO:0036297 - interstrand cross-link repair (biological process)
GO:0043240 - Fanconi anaemia nuclear complex (cellular component)
GO:0071821 - FANCM-MHF complex (cellular component)
GO:0003682 - chromatin binding (molecular function)
GO:0046982 - protein heterodimerization activity (molecular function)
InterPro domainsIPR009072 - Histone-fold
IPR029003 - CENP-S/Mhf1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004154006.1 protein MHF1 homolog isoform X2 [Cucumis sativus]1.2e-5693.02Show/hide
Query:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLK
        METGMEEDDSASELLRDRFRLS+ISIAEAEA +SGMEISEPVM CVADLAFKYTKQLAKDLELF QHAGRKSVNTEDVIL+AHRNEHLAAILTSICNDLK
Subjt:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLK

Query:  TKEPQSERKRKKAQKKEDRDRGAVHITDA
        TKEPQSERKRKKA KK+DRDRGAVHI DA
Subjt:  TKEPQSERKRKKAQKKEDRDRGAVHITDA

XP_008440159.1 PREDICTED: centromere protein S isoform X2 [Cucumis melo]1.2e-5490.7Show/hide
Query:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLK
        MET MEEDDSASELLRDRFRLSTISIAEAEA +SGMEISEPVM CVADLAFK+TKQLAKDLELF QHAGRKSVNTEDVIL+AHRNEHLAAILTSICNDLK
Subjt:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLK

Query:  TKEPQSERKRKKAQKKEDRDRGAVHITDA
         KEPQSERKRKKA KK+DRDRGAVHI +A
Subjt:  TKEPQSERKRKKAQKKEDRDRGAVHITDA

XP_011652290.1 protein MHF1 homolog isoform X1 [Cucumis sativus]9.0e-5591.54Show/hide
Query:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKY-TKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDL
        METGMEEDDSASELLRDRFRLS+ISIAEAEA +SGMEISEPVM CVADLAFKY T+QLAKDLELF QHAGRKSVNTEDVIL+AHRNEHLAAILTSICNDL
Subjt:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKY-TKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDL

Query:  KTKEPQSERKRKKAQKKEDRDRGAVHITDA
        KTKEPQSERKRKKA KK+DRDRGAVHI DA
Subjt:  KTKEPQSERKRKKAQKKEDRDRGAVHITDA

XP_038894698.1 protein MHF1 homolog isoform X1 [Benincasa hispida]1.5e-5492.31Show/hide
Query:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKY-TKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDL
        M+TG+EED SASELLRDRFRLSTISIAEAEA RSGMEISEPVM CVADLAFKY T+QLAKDLELFVQHAGRKSVNTEDVILSAHRNEHL+AILTSICNDL
Subjt:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKY-TKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDL

Query:  KTKEPQSERKRKKAQKKEDRDRGAVHITDA
        KTKEPQSERKRKKA KKEDRDRGAVHI DA
Subjt:  KTKEPQSERKRKKAQKKEDRDRGAVHITDA

XP_038894699.1 protein MHF1 homolog isoform X2 [Benincasa hispida]2.1e-5693.8Show/hide
Query:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLK
        M+TG+EED SASELLRDRFRLSTISIAEAEA RSGMEISEPVM CVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHL+AILTSICNDLK
Subjt:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLK

Query:  TKEPQSERKRKKAQKKEDRDRGAVHITDA
        TKEPQSERKRKKA KKEDRDRGAVHI DA
Subjt:  TKEPQSERKRKKAQKKEDRDRGAVHITDA

TrEMBL top hitse value%identityAlignment
A0A0A0LR97 Uncharacterized protein6.1e-5793.02Show/hide
Query:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLK
        METGMEEDDSASELLRDRFRLS+ISIAEAEA +SGMEISEPVM CVADLAFKYTKQLAKDLELF QHAGRKSVNTEDVIL+AHRNEHLAAILTSICNDLK
Subjt:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLK

Query:  TKEPQSERKRKKAQKKEDRDRGAVHITDA
        TKEPQSERKRKKA KK+DRDRGAVHI DA
Subjt:  TKEPQSERKRKKAQKKEDRDRGAVHITDA

A0A1S3B159 centromere protein S isoform X14.1e-5389.23Show/hide
Query:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKY-TKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDL
        MET MEEDDSASELLRDRFRLSTISIAEAEA +SGMEISEPVM CVADLAFK+ T+QLAKDLELF QHAGRKSVNTEDVIL+AHRNEHLAAILTSICNDL
Subjt:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKY-TKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDL

Query:  KTKEPQSERKRKKAQKKEDRDRGAVHITDA
        K KEPQSERKRKKA KK+DRDRGAVHI +A
Subjt:  KTKEPQSERKRKKAQKKEDRDRGAVHITDA

A0A1S3B173 centromere protein S isoform X25.7e-5590.7Show/hide
Query:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLK
        MET MEEDDSASELLRDRFRLSTISIAEAEA +SGMEISEPVM CVADLAFK+TKQLAKDLELF QHAGRKSVNTEDVIL+AHRNEHLAAILTSICNDLK
Subjt:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLK

Query:  TKEPQSERKRKKAQKKEDRDRGAVHITDA
         KEPQSERKRKKA KK+DRDRGAVHI +A
Subjt:  TKEPQSERKRKKAQKKEDRDRGAVHITDA

A0A5A7UM53 Centromere protein S isoform X14.1e-5389.23Show/hide
Query:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKY-TKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDL
        MET MEEDDSASELLRDRFRLSTISIAEAEA +SGMEISEPVM CVADLAFK+ T+QLAKDLELF QHAGRKSVNTEDVIL+AHRNEHLAAILTSICNDL
Subjt:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKY-TKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDL

Query:  KTKEPQSERKRKKAQKKEDRDRGAVHITDA
        K KEPQSERKRKKA KK+DRDRGAVHI +A
Subjt:  KTKEPQSERKRKKAQKKEDRDRGAVHITDA

A0A5D3BHQ8 Centromere protein S isoform X25.7e-5590.7Show/hide
Query:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLK
        MET MEEDDSASELLRDRFRLSTISIAEAEA +SGMEISEPVM CVADLAFK+TKQLAKDLELF QHAGRKSVNTEDVIL+AHRNEHLAAILTSICNDLK
Subjt:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLK

Query:  TKEPQSERKRKKAQKKEDRDRGAVHITDA
         KEPQSERKRKKA KK+DRDRGAVHI +A
Subjt:  TKEPQSERKRKKAQKKEDRDRGAVHITDA

SwissProt top hitse value%identityAlignment
E1BSW7 Centromere protein S1.0e-0527.35Show/hide
Query:  GMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKTKE
        G E+ +   + LR     +T  + +  A+  G+  S+  +  ++++ F+  +  A+DLE+F +HA R ++ +EDV L A R+  L   +T   ++L +  
Subjt:  GMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKTKE

Query:  -PQSERKRKKAQKKEDR
          Q E+K+KK+   + R
Subjt:  -PQSERKRKKAQKKEDR

Q2TBR7 Centromere protein S6.5e-0833.65Show/hide
Query:  IAEAEAKRSGMEISEPVMICVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKTKEPQSERKRKKAQKKEDRDRGAVH
        + E  A    M+ S+  +  ++++ F   +  AKDLE+F +HA R ++NTEDV L A R+  L   +T    D+   +   E+K KK +K ED +R +V 
Subjt:  IAEAEAKRSGMEISEPVMICVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKTKEPQSERKRKKAQKKEDRDRGAVH

Query:  ITDA
          +A
Subjt:  ITDA

Q6NRI8 Centromere protein S2.1e-0626.79Show/hide
Query:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLK
        M  G EE  S ++ L+        S+ +  A    ++ S+  +  ++++ F+  +  AKDLE+F +HA R ++N +DV L A R+  L A ++   +++ 
Subjt:  METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLK

Query:  TKEPQSERKRKK
            + + K+KK
Subjt:  TKEPQSERKRKK

Q8N2Z9 Centromere protein S2.7e-0629.51Show/hide
Query:  ETGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKT
        ET  ++  S  + L+     +   + E  A    M+ S+  +  +++L F+  +  AKDLE+F +HA R ++NTEDV L A R+  L   +T    ++  
Subjt:  ETGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKT

Query:  KEPQSERKRKKAQKKEDRDRGA
         +   ERK +K +K ED  + +
Subjt:  KEPQSERKRKKAQKKEDRDRGA

Q9FI55 Protein MHF1 homolog1.5e-3667.74Show/hide
Query:  EDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKTKEPQS
        E+ S  +L+RDRFRLS ISIAEAEAK++GMEI  PV+ CVADLAFKY + +AKDLELF  HAGRK VN +DV+LSAHRN++LAA L S+CN+LK KEPQS
Subjt:  EDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKTKEPQS

Query:  ERKRKK-AQKKEDR--DRGAVHIT
        ERKRKK + KKED+     AV IT
Subjt:  ERKRKK-AQKKEDR--DRGAVHIT

Arabidopsis top hitse value%identityAlignment
AT5G50930.1 Histone superfamily protein1.1e-3767.74Show/hide
Query:  EDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKTKEPQS
        E+ S  +L+RDRFRLS ISIAEAEAK++GMEI  PV+ CVADLAFKY + +AKDLELF  HAGRK VN +DV+LSAHRN++LAA L S+CN+LK KEPQS
Subjt:  EDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTKQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKTKEPQS

Query:  ERKRKK-AQKKEDR--DRGAVHIT
        ERKRKK + KKED+     AV IT
Subjt:  ERKRKK-AQKKEDR--DRGAVHIT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAAAAAAAATAATGGTGAAACGATTCGTTTAAGACCATCCCGTCGGGAGTGGCATTTCTGGAGGGAAATAAAGGCGGATGAAAACAGTGAGACTGCAGCAAAATT
CCGTAAATTAACTCCAGAAGCCTCTGCAATGGAAACTGGGATGGAAGAAGACGACTCCGCCTCCGAACTCTTGAGGGACAGATTCCGACTCTCCACCATTTCTATTGCTG
AAGCTGAAGCGAAGAGAAGCGGCATGGAAATTTCTGAACCCGTGATGATTTGTGTTGCTGATTTAGCCTTCAAATATACAAAACAGTTGGCAAAGGACCTTGAGTTATTT
GTTCAGCATGCTGGTCGAAAATCTGTGAATACAGAAGATGTAATACTATCAGCCCATAGAAACGAGCATTTGGCTGCTATATTAACATCCATTTGCAATGATCTAAAGAC
AAAAGAACCTCAAAGTGAGAGAAAGCGAAAAAAGGCACAAAAAAAAGAAGATAGAGATAGAGGTGCAGTGCATATTACCGACGCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAAAAAAAAATAATGGTGAAACGATTCGTTTAAGACCATCCCGTCGGGAGTGGCATTTCTGGAGGGAAATAAAGGCGGATGAAAACAGTGAGACTGCAGCAAAATT
CCGTAAATTAACTCCAGAAGCCTCTGCAATGGAAACTGGGATGGAAGAAGACGACTCCGCCTCCGAACTCTTGAGGGACAGATTCCGACTCTCCACCATTTCTATTGCTG
AAGCTGAAGCGAAGAGAAGCGGCATGGAAATTTCTGAACCCGTGATGATTTGTGTTGCTGATTTAGCCTTCAAATATACAAAACAGTTGGCAAAGGACCTTGAGTTATTT
GTTCAGCATGCTGGTCGAAAATCTGTGAATACAGAAGATGTAATACTATCAGCCCATAGAAACGAGCATTTGGCTGCTATATTAACATCCATTTGCAATGATCTAAAGAC
AAAAGAACCTCAAAGTGAGAGAAAGCGAAAAAAGGCACAAAAAAAAGAAGATAGAGATAGAGGTGCAGTGCATATTACCGACGCCTAATCACATCTCCCATCGGGCTGAT
CTAGTTAACAAGGGTGCATACCAACAAGCAATGGCAATAGCTTATGATCTCTTTCCTTTTTCTTGGTTATACTAATTAGTAATACTGTTTTCACATCACATGGGCCTATG
GAGTTCTTCTGATTTAAAGGATAAGCTTCTGCACCATGCAAATTCATTTGTGTTTCCAAACGCAGGAAACATGGCTTTTGGTTTGTTTTAGGTATTTGAGATTGTCGAAG
CTTGAGTTTCAATGTTCATTGGTGACCAACTCTGAACGGGGTGATAAATTGCTCTTTGTAAATCTACAGCACAAATGTATGATTTAAGACTTTTAGTTAAGTACTCAAAT
TTGAAGCATATTTTTGCTTGTGCATGGTGCAAATATAATAGCCTAGTTATGCACGGATTACAATGGTTCATATCTCTCTCTCTCTCTGCTTTGGAAGAAAGAACCAATTC
GTTTTATCAATCCATCTTTCAAGCAGCTGAGAACTCGAGTTGGAGCGGGG
Protein sequenceShow/hide protein sequence
MKKKNNGETIRLRPSRREWHFWREIKADENSETAAKFRKLTPEASAMETGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTKQLAKDLELF
VQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKTKEPQSERKRKKAQKKEDRDRGAVHITDA