; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS027803 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS027803
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionCentromere protein S-like
Genome locationscaffold22:419031..420847
RNA-Seq ExpressionMS027803
SyntenyMS027803
Gene Ontology termsGO:0000712 - resolution of meiotic recombination intermediates (biological process)
GO:0006312 - mitotic recombination (biological process)
GO:0007129 - synapsis (biological process)
GO:0031297 - replication fork processing (biological process)
GO:0036297 - interstrand cross-link repair (biological process)
GO:0043240 - Fanconi anaemia nuclear complex (cellular component)
GO:0071821 - FANCM-MHF complex (cellular component)
GO:0003682 - chromatin binding (molecular function)
GO:0046982 - protein heterodimerization activity (molecular function)
InterPro domainsIPR009072 - Histone-fold
IPR029003 - CENP-S/Mhf1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004154006.1 protein MHF1 homolog isoform X2 [Cucumis sativus]1.3e-5084.62Show/hide
Query:  METGREEDDTATDLLRDRFRLSTISIAEAEAKRNGMEISEPVMTCVAELAFKYTTEQLAKDLELFAQHAGRKSVNMEDVILSAHRNEHLAASLTSFCNDL
        METG EEDD+A++LLRDRFRLS+ISIAEAEA ++GMEISEPVMTCVA+LAFKY T+QLAKDLELFAQHAGRKSVN EDVIL+AHRNEHLAA LTS CNDL
Subjt:  METGREEDDTATDLLRDRFRLSTISIAEAEAKRNGMEISEPVMTCVAELAFKYTTEQLAKDLELFAQHAGRKSVNMEDVILSAHRNEHLAASLTSFCNDL

Query:  KTKEPQAERKRKKASKKEDRDRGVVHINDA
        KTKEPQ+ERKRKKA KK+DRDRG VHI DA
Subjt:  KTKEPQAERKRKKASKKEDRDRGVVHINDA

XP_008440142.1 PREDICTED: centromere protein S isoform X1 [Cucumis melo]4.5e-5183.85Show/hide
Query:  METGREEDDTATDLLRDRFRLSTISIAEAEAKRNGMEISEPVMTCVAELAFKYTTEQLAKDLELFAQHAGRKSVNMEDVILSAHRNEHLAASLTSFCNDL
        MET  EEDD+A++LLRDRFRLSTISIAEAEA ++GMEISEPVMTCVA+LAFK+TTEQLAKDLELFAQHAGRKSVN EDVIL+AHRNEHLAA LTS CNDL
Subjt:  METGREEDDTATDLLRDRFRLSTISIAEAEAKRNGMEISEPVMTCVAELAFKYTTEQLAKDLELFAQHAGRKSVNMEDVILSAHRNEHLAASLTSFCNDL

Query:  KTKEPQAERKRKKASKKEDRDRGVVHINDA
        K KEPQ+ERKRKKA KK+DRDRG VHI +A
Subjt:  KTKEPQAERKRKKASKKEDRDRGVVHINDA

XP_011652290.1 protein MHF1 homolog isoform X1 [Cucumis sativus]4.8e-5386.15Show/hide
Query:  METGREEDDTATDLLRDRFRLSTISIAEAEAKRNGMEISEPVMTCVAELAFKYTTEQLAKDLELFAQHAGRKSVNMEDVILSAHRNEHLAASLTSFCNDL
        METG EEDD+A++LLRDRFRLS+ISIAEAEA ++GMEISEPVMTCVA+LAFKYTTEQLAKDLELFAQHAGRKSVN EDVIL+AHRNEHLAA LTS CNDL
Subjt:  METGREEDDTATDLLRDRFRLSTISIAEAEAKRNGMEISEPVMTCVAELAFKYTTEQLAKDLELFAQHAGRKSVNMEDVILSAHRNEHLAASLTSFCNDL

Query:  KTKEPQAERKRKKASKKEDRDRGVVHINDA
        KTKEPQ+ERKRKKA KK+DRDRG VHI DA
Subjt:  KTKEPQAERKRKKASKKEDRDRGVVHINDA

XP_022137214.1 protein MHF1 homolog [Momordica charantia]3.1e-6096.92Show/hide
Query:  METGREEDDTATDLLRDRFRLSTISIAEAEAKRNGMEISEPVMTCVAELAFKYTTEQLAKDLELFAQHAGRKSVNMEDVILSAHRNEHLAASLTSFCNDL
        METGREEDDTAT+LL DRFRLSTISIAEAEAKRNGMEISEPVMTCVAELAFKYTTEQLAKDLELFAQHAGRKSVN EDVILSAHRNEHLAASLTSFCNDL
Subjt:  METGREEDDTATDLLRDRFRLSTISIAEAEAKRNGMEISEPVMTCVAELAFKYTTEQLAKDLELFAQHAGRKSVNMEDVILSAHRNEHLAASLTSFCNDL

Query:  KTKEPQAERKRKKASKKEDRDRGVVHINDA
        KTKEPQ ERKRKKASKKEDRDRGVVHINDA
Subjt:  KTKEPQAERKRKKASKKEDRDRGVVHINDA

XP_038894698.1 protein MHF1 homolog isoform X1 [Benincasa hispida]4.0e-5286.15Show/hide
Query:  METGREEDDTATDLLRDRFRLSTISIAEAEAKRNGMEISEPVMTCVAELAFKYTTEQLAKDLELFAQHAGRKSVNMEDVILSAHRNEHLAASLTSFCNDL
        M+TG EED +A++LLRDRFRLSTISIAEAEA R+GMEISEPVMTCVA+LAFKYTTEQLAKDLELF QHAGRKSVN EDVILSAHRNEHL+A LTS CNDL
Subjt:  METGREEDDTATDLLRDRFRLSTISIAEAEAKRNGMEISEPVMTCVAELAFKYTTEQLAKDLELFAQHAGRKSVNMEDVILSAHRNEHLAASLTSFCNDL

Query:  KTKEPQAERKRKKASKKEDRDRGVVHINDA
        KTKEPQ+ERKRKKA KKEDRDRG VHI DA
Subjt:  KTKEPQAERKRKKASKKEDRDRGVVHINDA

TrEMBL top hitse value%identityAlignment
A0A0A0LR97 Uncharacterized protein6.3e-5184.62Show/hide
Query:  METGREEDDTATDLLRDRFRLSTISIAEAEAKRNGMEISEPVMTCVAELAFKYTTEQLAKDLELFAQHAGRKSVNMEDVILSAHRNEHLAASLTSFCNDL
        METG EEDD+A++LLRDRFRLS+ISIAEAEA ++GMEISEPVMTCVA+LAFKY T+QLAKDLELFAQHAGRKSVN EDVIL+AHRNEHLAA LTS CNDL
Subjt:  METGREEDDTATDLLRDRFRLSTISIAEAEAKRNGMEISEPVMTCVAELAFKYTTEQLAKDLELFAQHAGRKSVNMEDVILSAHRNEHLAASLTSFCNDL

Query:  KTKEPQAERKRKKASKKEDRDRGVVHINDA
        KTKEPQ+ERKRKKA KK+DRDRG VHI DA
Subjt:  KTKEPQAERKRKKASKKEDRDRGVVHINDA

A0A1S3B159 centromere protein S isoform X12.2e-5183.85Show/hide
Query:  METGREEDDTATDLLRDRFRLSTISIAEAEAKRNGMEISEPVMTCVAELAFKYTTEQLAKDLELFAQHAGRKSVNMEDVILSAHRNEHLAASLTSFCNDL
        MET  EEDD+A++LLRDRFRLSTISIAEAEA ++GMEISEPVMTCVA+LAFK+TTEQLAKDLELFAQHAGRKSVN EDVIL+AHRNEHLAA LTS CNDL
Subjt:  METGREEDDTATDLLRDRFRLSTISIAEAEAKRNGMEISEPVMTCVAELAFKYTTEQLAKDLELFAQHAGRKSVNMEDVILSAHRNEHLAASLTSFCNDL

Query:  KTKEPQAERKRKKASKKEDRDRGVVHINDA
        K KEPQ+ERKRKKA KK+DRDRG VHI +A
Subjt:  KTKEPQAERKRKKASKKEDRDRGVVHINDA

A0A5A7UM53 Centromere protein S isoform X12.2e-5183.85Show/hide
Query:  METGREEDDTATDLLRDRFRLSTISIAEAEAKRNGMEISEPVMTCVAELAFKYTTEQLAKDLELFAQHAGRKSVNMEDVILSAHRNEHLAASLTSFCNDL
        MET  EEDD+A++LLRDRFRLSTISIAEAEA ++GMEISEPVMTCVA+LAFK+TTEQLAKDLELFAQHAGRKSVN EDVIL+AHRNEHLAA LTS CNDL
Subjt:  METGREEDDTATDLLRDRFRLSTISIAEAEAKRNGMEISEPVMTCVAELAFKYTTEQLAKDLELFAQHAGRKSVNMEDVILSAHRNEHLAASLTSFCNDL

Query:  KTKEPQAERKRKKASKKEDRDRGVVHINDA
        K KEPQ+ERKRKKA KK+DRDRG VHI +A
Subjt:  KTKEPQAERKRKKASKKEDRDRGVVHINDA

A0A5D3BHQ8 Centromere protein S isoform X25.9e-4982.31Show/hide
Query:  METGREEDDTATDLLRDRFRLSTISIAEAEAKRNGMEISEPVMTCVAELAFKYTTEQLAKDLELFAQHAGRKSVNMEDVILSAHRNEHLAASLTSFCNDL
        MET  EEDD+A++LLRDRFRLSTISIAEAEA ++GMEISEPVMTCVA+LAFK+ T+QLAKDLELFAQHAGRKSVN EDVIL+AHRNEHLAA LTS CNDL
Subjt:  METGREEDDTATDLLRDRFRLSTISIAEAEAKRNGMEISEPVMTCVAELAFKYTTEQLAKDLELFAQHAGRKSVNMEDVILSAHRNEHLAASLTSFCNDL

Query:  KTKEPQAERKRKKASKKEDRDRGVVHINDA
        K KEPQ+ERKRKKA KK+DRDRG VHI +A
Subjt:  KTKEPQAERKRKKASKKEDRDRGVVHINDA

A0A6J1C623 protein MHF1 homolog1.5e-6096.92Show/hide
Query:  METGREEDDTATDLLRDRFRLSTISIAEAEAKRNGMEISEPVMTCVAELAFKYTTEQLAKDLELFAQHAGRKSVNMEDVILSAHRNEHLAASLTSFCNDL
        METGREEDDTAT+LL DRFRLSTISIAEAEAKRNGMEISEPVMTCVAELAFKYTTEQLAKDLELFAQHAGRKSVN EDVILSAHRNEHLAASLTSFCNDL
Subjt:  METGREEDDTATDLLRDRFRLSTISIAEAEAKRNGMEISEPVMTCVAELAFKYTTEQLAKDLELFAQHAGRKSVNMEDVILSAHRNEHLAASLTSFCNDL

Query:  KTKEPQAERKRKKASKKEDRDRGVVHINDA
        KTKEPQ ERKRKKASKKEDRDRGVVHINDA
Subjt:  KTKEPQAERKRKKASKKEDRDRGVVHINDA

SwissProt top hitse value%identityAlignment
E1BSW7 Centromere protein S5.9e-0630.51Show/hide
Query:  GREEDDTATDLLRDRFRLSTISIAEAEAKRNGMEISEPVMTCVAELAFKYTTEQLAKDLELFAQHAGRKSVNMEDVILSAHRNEHLAASLTSFCNDLKTK
        G E+ +     LR     +T  + +  A+  G+  S+  +  ++E+ F+   E  A+DLE+FA+HA R ++  EDV L A R+  L   +T   ++L + 
Subjt:  GREEDDTATDLLRDRFRLSTISIAEAEAKRNGMEISEPVMTCVAELAFKYTTEQLAKDLELFAQHAGRKSVNMEDVILSAHRNEHLAASLTSFCNDLKTK

Query:  E-PQAERKRKKASKKEDR
           Q E+K+KK+S  + R
Subjt:  E-PQAERKRKKASKKEDR

Q2TBR7 Centromere protein S1.2e-0635.24Show/hide
Query:  IAEAEAKRNGMEISEPVMTCVAELAFKYTTEQLAKDLELFAQHAGRKSVNMEDVILSAHRNEHLAASLTSFCNDLKTKEPQAERKRKKASKKEDRDRGVV
        + E  A    M+ S+  +  ++E+ F    E  AKDLE+FA+HA R ++N EDV L A R+  L   +T    D+   +   E+K KK  K ED +R  V
Subjt:  IAEAEAKRNGMEISEPVMTCVAELAFKYTTEQLAKDLELFAQHAGRKSVNMEDVILSAHRNEHLAASLTSFCNDLKTKEPQAERKRKKASKKEDRDRGVV

Query:  HINDA
           +A
Subjt:  HINDA

Q6NRI8 Centromere protein S1.8e-0730.43Show/hide
Query:  METGREEDDTATDLLRDRFRLSTISIAEAEAKRNGMEISEPVMTCVAELAFKYTTEQLAKDLELFAQHAGRKSVNMEDVILSAHRNEHLAASLTSFCNDL
        M  G+EE  + T  L+        S+ +  A    ++ S+  +  ++E+ F+   E  AKDLE+FA+HA R ++NM+DV L A R+  L A ++   +++
Subjt:  METGREEDDTATDLLRDRFRLSTISIAEAEAKRNGMEISEPVMTCVAELAFKYTTEQLAKDLELFAQHAGRKSVNMEDVILSAHRNEHLAASLTSFCNDL

Query:  KTKEPQAERKRKKAS
             + + K+KK S
Subjt:  KTKEPQAERKRKKAS

Q8N2Z9 Centromere protein S7.8e-0636.17Show/hide
Query:  IAEAEAKRNGMEISEPVMTCVAELAFKYTTEQLAKDLELFAQHAGRKSVNMEDVILSAHRNEHLAASLTSFCNDLKTKEPQAERKRKKASKKED
        + E  A    M+ S+  +  ++EL F+   E  AKDLE+FA+HA R ++N EDV L A R+  L   +T    ++   +   ERK +K  K ED
Subjt:  IAEAEAKRNGMEISEPVMTCVAELAFKYTTEQLAKDLELFAQHAGRKSVNMEDVILSAHRNEHLAASLTSFCNDLKTKEPQAERKRKKASKKED

Q9FI55 Protein MHF1 homolog2.9e-3773.04Show/hide
Query:  EDDTATDLLRDRFRLSTISIAEAEAKRNGMEISEPVMTCVAELAFKYTTEQLAKDLELFAQHAGRKSVNMEDVILSAHRNEHLAASLTSFCNDLKTKEPQ
        E+ +  DL+RDRFRLS ISIAEAEAK+NGMEI  PV+ CVA+LAFKY  E +AKDLELFA HAGRK VNM+DV+LSAHRN++LAASL S CN+LK KEPQ
Subjt:  EDDTATDLLRDRFRLSTISIAEAEAKRNGMEISEPVMTCVAELAFKYTTEQLAKDLELFAQHAGRKSVNMEDVILSAHRNEHLAASLTSFCNDLKTKEPQ

Query:  AERKRKKAS-KKEDR
        +ERKRKK S KKED+
Subjt:  AERKRKKAS-KKEDR

Arabidopsis top hitse value%identityAlignment
AT5G50930.1 Histone superfamily protein2.1e-3873.04Show/hide
Query:  EDDTATDLLRDRFRLSTISIAEAEAKRNGMEISEPVMTCVAELAFKYTTEQLAKDLELFAQHAGRKSVNMEDVILSAHRNEHLAASLTSFCNDLKTKEPQ
        E+ +  DL+RDRFRLS ISIAEAEAK+NGMEI  PV+ CVA+LAFKY  E +AKDLELFA HAGRK VNM+DV+LSAHRN++LAASL S CN+LK KEPQ
Subjt:  EDDTATDLLRDRFRLSTISIAEAEAKRNGMEISEPVMTCVAELAFKYTTEQLAKDLELFAQHAGRKSVNMEDVILSAHRNEHLAASLTSFCNDLKTKEPQ

Query:  AERKRKKAS-KKEDR
        +ERKRKK S KKED+
Subjt:  AERKRKKAS-KKEDR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAACTGGGCGGGAGGAAGACGACACCGCCACCGATCTCTTGAGGGACCGATTCCGACTCTCCACCATTTCTATCGCTGAAGCTGAAGCGAAGAGAAACGGCATGGA
AATCTCTGAACCCGTGATGACTTGTGTCGCTGAATTAGCGTTCAAATATACAACAGAACAGTTGGCAAAGGACCTTGAGTTATTTGCTCAACATGCTGGTAGGAAATCTG
TGAATATGGAAGACGTCATACTATCAGCCCATAGAAACGAGCATCTCGCTGCCTCATTAACATCCTTCTGCAATGATCTAAAGACAAAAGAACCTCAAGCTGAGAGGAAG
CGGAAAAAGGCATCGAAAAAGGAAGACAGAGATAGAGGTGTAGTGCATATTAATGACGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAACTGGGCGGGAGGAAGACGACACCGCCACCGATCTCTTGAGGGACCGATTCCGACTCTCCACCATTTCTATCGCTGAAGCTGAAGCGAAGAGAAACGGCATGGA
AATCTCTGAACCCGTGATGACTTGTGTCGCTGAATTAGCGTTCAAATATACAACAGAACAGTTGGCAAAGGACCTTGAGTTATTTGCTCAACATGCTGGTAGGAAATCTG
TGAATATGGAAGACGTCATACTATCAGCCCATAGAAACGAGCATCTCGCTGCCTCATTAACATCCTTCTGCAATGATCTAAAGACAAAAGAACCTCAAGCTGAGAGGAAG
CGGAAAAAGGCATCGAAAAAGGAAGACAGAGATAGAGGTGTAGTGCATATTAATGACGCATAA
Protein sequenceShow/hide protein sequence
METGREEDDTATDLLRDRFRLSTISIAEAEAKRNGMEISEPVMTCVAELAFKYTTEQLAKDLELFAQHAGRKSVNMEDVILSAHRNEHLAASLTSFCNDLKTKEPQAERK
RKKASKKEDRDRGVVHINDA