; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009927 (gene) of Snake gourd v1 genome

Gene IDTan0009927
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptioncentromere protein S isoform X2
Genome locationLG05:82032539..82034903
RNA-Seq ExpressionTan0009927
SyntenyTan0009927
Gene Ontology termsGO:0000712 - resolution of meiotic recombination intermediates (biological process)
GO:0006312 - mitotic recombination (biological process)
GO:0007129 - synapsis (biological process)
GO:0031297 - replication fork processing (biological process)
GO:0036297 - interstrand cross-link repair (biological process)
GO:0043240 - Fanconi anaemia nuclear complex (cellular component)
GO:0071821 - FANCM-MHF complex (cellular component)
GO:0003682 - chromatin binding (molecular function)
GO:0046982 - protein heterodimerization activity (molecular function)
InterPro domainsIPR009072 - Histone-fold
IPR029003 - CENP-S/Mhf1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573152.1 Protein MHF1-like protein, partial [Cucurbita argyrosperma subsp. sororia]5.7e-3386.17Show/hide
Query:  METGMEEDNSMAELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDVILSDFSRKQYLIARTS
        METGMEED+S AELLRDRFRLS ISIAEAEAKRSGMEIS PVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDVILS   R ++L A  S
Subjt:  METGMEEDNSMAELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDVILSDFSRKQYLIARTS

KAG7012337.1 Protein MHF1-like protein [Cucurbita argyrosperma subsp. argyrosperma]2.8e-3285.11Show/hide
Query:  METGMEEDNSMAELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDVILSDFSRKQYLIARTS
        METGMEED+S AELLRDRFRLS I IAEAEAKRSGMEIS PVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDVILS   R ++L A  S
Subjt:  METGMEEDNSMAELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDVILSDFSRKQYLIARTS

XP_004154006.1 protein MHF1 homolog isoform X2 [Cucumis sativus]5.9e-3076.84Show/hide
Query:  METGMEEDNSMAELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDVILSDFSRKQYLIARTSL
        METGMEED+S +ELLRDRFRLS+ISIAEAEA +SGMEISEPVMTCVA+LAFKYTKQLAKDLELFA H GRKSVNTEDVIL+    +      TS+
Subjt:  METGMEEDNSMAELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDVILSDFSRKQYLIARTSL

XP_022955266.1 protein MHF1 homolog [Cucurbita moschata]4.8e-3285.11Show/hide
Query:  METGMEEDNSMAELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDVILSDFSRKQYLIARTS
        METGMEED+S AELLR RFRLS ISIAEAEAKRSGMEIS PVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDVILS   R ++L A  S
Subjt:  METGMEEDNSMAELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDVILSDFSRKQYLIARTS

XP_022994496.1 protein MHF1 homolog [Cucurbita maxima]7.5e-3386.17Show/hide
Query:  METGMEEDNSMAELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDVILSDFSRKQYLIARTS
        METGMEED+S AELLRDRFRLS ISIAEAEAKRSGMEIS PVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDVILS   R ++L A  S
Subjt:  METGMEEDNSMAELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDVILSDFSRKQYLIARTS

TrEMBL top hitse value%identityAlignment
A0A0A0LR97 Uncharacterized protein2.9e-3076.84Show/hide
Query:  METGMEEDNSMAELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDVILSDFSRKQYLIARTSL
        METGMEED+S +ELLRDRFRLS+ISIAEAEA +SGMEISEPVMTCVA+LAFKYTKQLAKDLELFA H GRKSVNTEDVIL+    +      TS+
Subjt:  METGMEEDNSMAELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDVILSDFSRKQYLIARTSL

A0A1S3B173 centromere protein S isoform X22.4e-2975.79Show/hide
Query:  METGMEEDNSMAELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDVILSDFSRKQYLIARTSL
        MET MEED+S +ELLRDRFRLSTISIAEAEA +SGMEISEPVMTCVA+LAFK+TKQLAKDLELFA H GRKSVNTEDVIL+    +      TS+
Subjt:  METGMEEDNSMAELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDVILSDFSRKQYLIARTSL

A0A5D3BHQ8 Centromere protein S isoform X22.4e-2975.79Show/hide
Query:  METGMEEDNSMAELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDVILSDFSRKQYLIARTSL
        MET MEED+S +ELLRDRFRLSTISIAEAEA +SGMEISEPVMTCVA+LAFK+TKQLAKDLELFA H GRKSVNTEDVIL+    +      TS+
Subjt:  METGMEEDNSMAELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDVILSDFSRKQYLIARTSL

A0A6J1GTB4 protein MHF1 homolog2.3e-3285.11Show/hide
Query:  METGMEEDNSMAELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDVILSDFSRKQYLIARTS
        METGMEED+S AELLR RFRLS ISIAEAEAKRSGMEIS PVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDVILS   R ++L A  S
Subjt:  METGMEEDNSMAELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDVILSDFSRKQYLIARTS

A0A6J1JZB0 protein MHF1 homolog3.6e-3386.17Show/hide
Query:  METGMEEDNSMAELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDVILSDFSRKQYLIARTS
        METGMEED+S AELLRDRFRLS ISIAEAEAKRSGMEIS PVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDVILS   R ++L A  S
Subjt:  METGMEEDNSMAELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDVILSDFSRKQYLIARTS

SwissProt top hitse value%identityAlignment
E1BSW7 Centromere protein S7.6e-0428.57Show/hide
Query:  GMEEDNSMAELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDVIL
        G E+   + + LR     +T  + +  A+  G+  S+  +  ++E+ F+  +  A+DLE+FA H  R ++ +EDV L
Subjt:  GMEEDNSMAELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDVIL

Q2TBR7 Centromere protein S3.4e-0438.18Show/hide
Query:  IAEAEAKRSGMEISEPVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDVIL
        + E  A    M+ S+  +  ++E+ F   +  AKDLE+FA H  R ++NTEDV L
Subjt:  IAEAEAKRSGMEISEPVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDVIL

Q6NRI8 Centromere protein S9.0e-0530.34Show/hide
Query:  METGMEEDNSMAELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDV-ILSDFSRKQY
        M  G EE  S  + L+        S+ +  A    ++ S+  +  ++E+ F+  +  AKDLE+FA H  R ++N +DV +L+  SR  Y
Subjt:  METGMEEDNSMAELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDV-ILSDFSRKQY

Q8N2Z9 Centromere protein S9.0e-0532.91Show/hide
Query:  ETGMEEDNSMAELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDVIL
        ET  ++  S  + L+     +   + E  A    M+ S+  +  ++EL F+  +  AKDLE+FA H  R ++NTEDV L
Subjt:  ETGMEEDNSMAELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDVIL

Q9FI55 Protein MHF1 homolog3.3e-2370.67Show/hide
Query:  EDNSMAELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDVILS
        E+ SM +L+RDRFRLS ISIAEAEAK++GMEI  PV+ CVA+LAFKY + +AKDLELFAHH GRK VN +DV+LS
Subjt:  EDNSMAELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDVILS

Arabidopsis top hitse value%identityAlignment
AT5G50930.1 Histone superfamily protein2.3e-2470.67Show/hide
Query:  EDNSMAELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDVILS
        E+ SM +L+RDRFRLS ISIAEAEAK++GMEI  PV+ CVA+LAFKY + +AKDLELFAHH GRK VN +DV+LS
Subjt:  EDNSMAELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDVILS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGACTGGGATGGAAGAAGACAACTCCATGGCCGAACTCTTGAGGGACCGATTCCGACTCTCCACCATTTCCATCGCTGAAGCTGAAGCGAAGAGAAGCGGCATGGA
AATCTCCGAACCTGTGATGACCTGTGTCGCTGAGTTAGCGTTCAAATATACAAAACAGTTGGCAAAGGACCTTGAGTTATTTGCTCATCATGGTGGTCGGAAATCTGTGA
ATACGGAAGATGTCATACTCTCAGACTTTTCTAGGAAGCAGTACTTAATCGCTCGAACCTCTTTGCTTCTGCTTCATTTTGGTCTCAGCCCATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGACTGGGATGGAAGAAGACAACTCCATGGCCGAACTCTTGAGGGACCGATTCCGACTCTCCACCATTTCCATCGCTGAAGCTGAAGCGAAGAGAAGCGGCATGGA
AATCTCCGAACCTGTGATGACCTGTGTCGCTGAGTTAGCGTTCAAATATACAAAACAGTTGGCAAAGGACCTTGAGTTATTTGCTCATCATGGTGGTCGGAAATCTGTGA
ATACGGAAGATGTCATACTCTCAGACTTTTCTAGGAAGCAGTACTTAATCGCTCGAACCTCTTTGCTTCTGCTTCATTTTGGTCTCAGCCCATAGAAACGATCATCTGGC
TGCCTCATTAGCATCTTTCTGCAATGATCTAAAGACAAAAGAACCTCAAAGTGAGAAAAAGCGGAAAAAGGCACCGAAAAAGGAAGATAGAGATAGAGGTGTTGTGCATA
TTACTGACGCATAATCACATCCCCCAGTGGGCAGATCTCTAGTTAACAAGGGTACATACCGATAAGTAATGACAATATTATATGATTCTCTTCCTGTTTCTAGGTTATAC
TAATTAGTAATACCATTTTCACATCGTATGGGCCTATGGGTTTTTTTCTGATTTGAAGGATAAGCTTCTGCCATAGCATCTTTCTGCAATGTTTAAAGTGTTTCCAAACG
CAGGAAACGGGGCTTTTAAACCGTTTTAGGCATTTGATATTGTTGAAGCATGAGTTTCAATGTTTAGTGGTAGTTTTCATTGTGAACGGAAAATGAATTGGAATCCTGAA
CAGTTGAAAAGGGTGATAAATTGTTCTTTGTAAATTTACAGCACAATTTTATGATTTAGACTTTTAGTAGAGTGCCCAACATTTTGATCTACTTGGAATGAGCGCAATTC
GTGTATCTTTACTGATAAGGAATTAGATACACACTCCCTTTGCACTTCAATTCAATTACTTTTTTG
Protein sequenceShow/hide protein sequence
METGMEEDNSMAELLRDRFRLSTISIAEAEAKRSGMEISEPVMTCVAELAFKYTKQLAKDLELFAHHGGRKSVNTEDVILSDFSRKQYLIARTSLLLLHFGLSP