; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014648 (gene) of Snake gourd v1 genome

Gene IDTan0014648
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionINVOLVED IN: chromosome segregation, cell division; LOCATED IN: chromosome, centromeric region, nucleus; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages;
Genome locationLG02:94759505..94781675
RNA-Seq ExpressionTan0014648
SyntenyTan0014648
Gene Ontology termsGO:0006979 - response to oxidative stress (biological process)
GO:0034508 - centromere complex assembly (biological process)
GO:0042744 - hydrogen peroxide catabolic process (biological process)
GO:0098869 - cellular oxidant detoxification (biological process)
GO:0005634 - nucleus (cellular component)
GO:0031511 - Mis6-Sim4 complex (cellular component)
GO:0004601 - peroxidase activity (molecular function)
GO:0020037 - heme binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598812.1 hypothetical protein SDJN03_08590, partial [Cucurbita argyrosperma subsp. sororia]1.9e-16590.64Show/hide
Query:  MESDIEESFKKRARKNPHHGESSRKS---EEDSTLETTRARLSNALKRHSELTERLSRDSDKMIFERLQKEFEAARASQTQEVYLDGEQWNDGLLATIRE
        MESDIEES KK+ R+NPH GESSRKS   +ED  LETTRAR SN LKRHSELTERLSRDSDKMIFERLQKEFEAARASQ QE+YLDGEQWNDGLLATIRE
Subjt:  MESDIEESFKKRARKNPHHGESSRKS---EEDSTLETTRARLSNALKRHSELTERLSRDSDKMIFERLQKEFEAARASQTQEVYLDGEQWNDGLLATIRE

Query:  RVHMEAERKAMPEDADMLPQEKITYKVGTKVICCLEGARIGIQYETSFAGDPCELYHCVLESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDH
        RVHMEAERKAMPEDADMLP EKITYKVG KVICCLEGARIGIQYETSFAG+PCELYHCVLESKSFLEKMTVLEHTIPFFLPVRE+ENDLLSSNAMKFID+
Subjt:  RVHMEAERKAMPEDADMLPQEKITYKVGTKVICCLEGARIGIQYETSFAGDPCELYHCVLESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDH

Query:  IGELLQAYVDRREQVRLIKELYGNQIRELYHNLSFHMIVFVLDDSDCTVTVSLRYADLISVLPTEISVLAWPMPRLKKNTIN--SSILSIKKEN-GGTVS
        IGELLQAYVDRREQVRLIKELYGNQIRELYH+L FHMI FVLDDSDCTVTVSLRYADLISVLPT+ISVLAWPMP++KKNT N  SS LSIKKEN GGTVS
Subjt:  IGELLQAYVDRREQVRLIKELYGNQIRELYHNLSFHMIVFVLDDSDCTVTVSLRYADLISVLPTEISVLAWPMPRLKKNTIN--SSILSIKKEN-GGTVS

Query:  HPIPARLSYAEDALRTMSLPEAYAEIVLNLPQAIQQMFPQKP
        HPIPARLSYAEDALRTMSLPEAYAEIVLNLPQAIQQMFP KP
Subjt:  HPIPARLSYAEDALRTMSLPEAYAEIVLNLPQAIQQMFPQKP

XP_004138767.1 uncharacterized protein LOC101206507 [Cucumis sativus]4.7e-16991.45Show/hide
Query:  MESDIEESFKKRARKNPHHGESSRKS---EEDSTLETTRARLSNALKRHSELTERLSRDSDKMIFERLQKEFEAARASQTQEVYLDGEQWNDGLLATIRE
        MESDIEES +K+ RKNPH GESSRKS   +ED  +ETTRAR SN LKRHSELTERLSRDSDKMIFERLQKEFEAARASQTQE+YLDGEQWNDGLLATIRE
Subjt:  MESDIEESFKKRARKNPHHGESSRKS---EEDSTLETTRARLSNALKRHSELTERLSRDSDKMIFERLQKEFEAARASQTQEVYLDGEQWNDGLLATIRE

Query:  RVHMEAERKAMPEDADMLPQEKITYKVGTKVICCLEGARIGIQYETSFAGDPCELYHCVLESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDH
        RVHMEAERKAMPEDAD+LPQEKITYKVGTKVICCLEGARIGIQYETSFAG+PCELYHCVLESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFID+
Subjt:  RVHMEAERKAMPEDADMLPQEKITYKVGTKVICCLEGARIGIQYETSFAGDPCELYHCVLESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDH

Query:  IGELLQAYVDRREQVRLIKELYGNQIRELYHNLSFHMIVFVLDDSDCTVTVSLRYADLISVLPTEISVLAWPMPRLKKNTINSSILSIKKENGGTVSHPI
        IGELLQAYVDRREQVRLIKELYGNQIRELYH+L FHMI FV+DDSDCTVTVSLRYADLI VLPT+ISVLAWPMP +KKNT NSSILSIKKENGGTVSHPI
Subjt:  IGELLQAYVDRREQVRLIKELYGNQIRELYHNLSFHMIVFVLDDSDCTVTVSLRYADLISVLPTEISVLAWPMPRLKKNTINSSILSIKKENGGTVSHPI

Query:  PARLSYAEDALRTMSLPEAYAEIVLNLPQAIQQMFPQKP
        PARLSYAEDALRTMSLPEAYAEIVLNLPQAIQQ+FP KP
Subjt:  PARLSYAEDALRTMSLPEAYAEIVLNLPQAIQQMFPQKP

XP_008445088.1 PREDICTED: uncharacterized protein LOC103488232 [Cucumis melo]3.4e-16790.27Show/hide
Query:  MESDIEESFKKRARKNPHHGESSRKS---EEDSTLETTRARLSNALKRHSELTERLSRDSDKMIFERLQKEFEAARASQTQEVYLDGEQWNDGLLATIRE
        MESDI+ES +K+ RKNPH GESSRKS   +ED  +ETTRAR SN LKRHSELTERLSRDSDKM+FERLQKEFEAARASQTQE+YLDGEQWNDGLLATIRE
Subjt:  MESDIEESFKKRARKNPHHGESSRKS---EEDSTLETTRARLSNALKRHSELTERLSRDSDKMIFERLQKEFEAARASQTQEVYLDGEQWNDGLLATIRE

Query:  RVHMEAERKAMPEDADMLPQEKITYKVGTKVICCLEGARIGIQYETSFAGDPCELYHCVLESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDH
        RVHMEAERKAMPEDAD+LP EKITYKVGTKVICCLEGARIGIQYETSFAG+PCELYHCVLESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFID+
Subjt:  RVHMEAERKAMPEDADMLPQEKITYKVGTKVICCLEGARIGIQYETSFAGDPCELYHCVLESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDH

Query:  IGELLQAYVDRREQVRLIKELYGNQIRELYHNLSFHMIVFVLDDSDCTVTVSLRYADLISVLPTEISVLAWPMPRLKKNTINSSILSIKKENGGTVSHPI
        IGELLQAYVDRREQVRLIKELYGNQIRELYH+L FHMI FV+DDSDCTVTVSLRYADLI VLPT+ISVLAWPMP++KKNT NSSI SIKKENGGTVSHPI
Subjt:  IGELLQAYVDRREQVRLIKELYGNQIRELYHNLSFHMIVFVLDDSDCTVTVSLRYADLISVLPTEISVLAWPMPRLKKNTINSSILSIKKENGGTVSHPI

Query:  PARLSYAEDALRTMSLPEAYAEIVLNLPQAIQQMFPQKP
        PARLSYAEDALRTMSLPEAYAEIVLNLPQAIQQ+FP KP
Subjt:  PARLSYAEDALRTMSLPEAYAEIVLNLPQAIQQMFPQKP

XP_023546279.1 uncharacterized protein LOC111805418 [Cucurbita pepo subsp. pepo]1.9e-16590.64Show/hide
Query:  MESDIEESFKKRARKNPHHGESSRKS---EEDSTLETTRARLSNALKRHSELTERLSRDSDKMIFERLQKEFEAARASQTQEVYLDGEQWNDGLLATIRE
        MESDIEES KK+ R+NPH GESSRKS   +ED  LETTRAR SN LKRHSELTERLSRDSDKMIFERLQKEFEAARASQ QE+YLDGEQWNDGLLATIRE
Subjt:  MESDIEESFKKRARKNPHHGESSRKS---EEDSTLETTRARLSNALKRHSELTERLSRDSDKMIFERLQKEFEAARASQTQEVYLDGEQWNDGLLATIRE

Query:  RVHMEAERKAMPEDADMLPQEKITYKVGTKVICCLEGARIGIQYETSFAGDPCELYHCVLESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDH
        RVHMEAERKAMPEDADMLP EKITYKVG KVICCLEGARIGIQYETSFAG+PCELYHCVLESKSFLEKMTVLEHTIPFFLPVRE+ENDLLSSNAMKFID+
Subjt:  RVHMEAERKAMPEDADMLPQEKITYKVGTKVICCLEGARIGIQYETSFAGDPCELYHCVLESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDH

Query:  IGELLQAYVDRREQVRLIKELYGNQIRELYHNLSFHMIVFVLDDSDCTVTVSLRYADLISVLPTEISVLAWPMPRLKKNTIN--SSILSIKKEN-GGTVS
        IGELLQAYVDRREQVRLIKELYGNQIRELYH+L FHMI FVLDDSDCTVTVSLRYADLISVLPT+ISVLAWPMP++KKNT N  SS LSIKKEN GGTVS
Subjt:  IGELLQAYVDRREQVRLIKELYGNQIRELYHNLSFHMIVFVLDDSDCTVTVSLRYADLISVLPTEISVLAWPMPRLKKNTIN--SSILSIKKEN-GGTVS

Query:  HPIPARLSYAEDALRTMSLPEAYAEIVLNLPQAIQQMFPQKP
        HPIPARLSYAEDALRTMSLPEAYAEIVLNLPQAIQQMFP KP
Subjt:  HPIPARLSYAEDALRTMSLPEAYAEIVLNLPQAIQQMFPQKP

XP_038884617.1 uncharacterized protein LOC120075366 [Benincasa hispida]5.2e-16891.42Show/hide
Query:  MESDIEESFKKRARKNPHHGESSRKS---EEDSTLETTRARLSNALKRHSELTERLSRDSDKMIFERLQKEFEAARASQTQEVYLDGEQWNDGLLATIRE
        MESDIEES +K+ RKNPH GESSRKS   +ED  +ETTRAR SN LKRHSELTERLSRDSDKMIFERLQKEFEAARASQTQE+YLDGEQWNDGLLATIRE
Subjt:  MESDIEESFKKRARKNPHHGESSRKS---EEDSTLETTRARLSNALKRHSELTERLSRDSDKMIFERLQKEFEAARASQTQEVYLDGEQWNDGLLATIRE

Query:  RVHMEAERKAMPEDADMLPQEKITYKVGTKVICCLEGARIGIQYETSFAGDPCELYHCVLESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDH
        RVHMEAERKAMPEDAD+LPQEKITYKVGTKVICCLEGARIGIQYETSFAG+PCELYHCVLESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFID+
Subjt:  RVHMEAERKAMPEDADMLPQEKITYKVGTKVICCLEGARIGIQYETSFAGDPCELYHCVLESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDH

Query:  IGELLQAYVDRREQVRLIKELYGNQIRELYHNLSFHMIVFVLDDSDCTVTVSLRYADLISVLPTEISVLAWPMPRLKKNTINSSILSIKKENGGTVSHPI
        IGELLQAYVDRREQVRLIKELYGNQIRELYH+L FHMI F LDDSDCTVTVSLRYADLI VLPT+ISVLAWPMP++KKNT NSSILSIKKENGGTVS+PI
Subjt:  IGELLQAYVDRREQVRLIKELYGNQIRELYHNLSFHMIVFVLDDSDCTVTVSLRYADLISVLPTEISVLAWPMPRLKKNTINSSILSIKKENGGTVSHPI

Query:  PARLSYAEDALRTMSLPEAYAEIVLNLPQAIQQMFPQK
        PARLSYAEDALRTMSLPEAYAEIVLNLPQAIQQMFP K
Subjt:  PARLSYAEDALRTMSLPEAYAEIVLNLPQAIQQMFPQK

TrEMBL top hitse value%identityAlignment
A0A0A0LLQ7 Uncharacterized protein2.3e-16991.45Show/hide
Query:  MESDIEESFKKRARKNPHHGESSRKS---EEDSTLETTRARLSNALKRHSELTERLSRDSDKMIFERLQKEFEAARASQTQEVYLDGEQWNDGLLATIRE
        MESDIEES +K+ RKNPH GESSRKS   +ED  +ETTRAR SN LKRHSELTERLSRDSDKMIFERLQKEFEAARASQTQE+YLDGEQWNDGLLATIRE
Subjt:  MESDIEESFKKRARKNPHHGESSRKS---EEDSTLETTRARLSNALKRHSELTERLSRDSDKMIFERLQKEFEAARASQTQEVYLDGEQWNDGLLATIRE

Query:  RVHMEAERKAMPEDADMLPQEKITYKVGTKVICCLEGARIGIQYETSFAGDPCELYHCVLESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDH
        RVHMEAERKAMPEDAD+LPQEKITYKVGTKVICCLEGARIGIQYETSFAG+PCELYHCVLESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFID+
Subjt:  RVHMEAERKAMPEDADMLPQEKITYKVGTKVICCLEGARIGIQYETSFAGDPCELYHCVLESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDH

Query:  IGELLQAYVDRREQVRLIKELYGNQIRELYHNLSFHMIVFVLDDSDCTVTVSLRYADLISVLPTEISVLAWPMPRLKKNTINSSILSIKKENGGTVSHPI
        IGELLQAYVDRREQVRLIKELYGNQIRELYH+L FHMI FV+DDSDCTVTVSLRYADLI VLPT+ISVLAWPMP +KKNT NSSILSIKKENGGTVSHPI
Subjt:  IGELLQAYVDRREQVRLIKELYGNQIRELYHNLSFHMIVFVLDDSDCTVTVSLRYADLISVLPTEISVLAWPMPRLKKNTINSSILSIKKENGGTVSHPI

Query:  PARLSYAEDALRTMSLPEAYAEIVLNLPQAIQQMFPQKP
        PARLSYAEDALRTMSLPEAYAEIVLNLPQAIQQ+FP KP
Subjt:  PARLSYAEDALRTMSLPEAYAEIVLNLPQAIQQMFPQKP

A0A1S3BBE0 uncharacterized protein LOC1034882321.6e-16790.27Show/hide
Query:  MESDIEESFKKRARKNPHHGESSRKS---EEDSTLETTRARLSNALKRHSELTERLSRDSDKMIFERLQKEFEAARASQTQEVYLDGEQWNDGLLATIRE
        MESDI+ES +K+ RKNPH GESSRKS   +ED  +ETTRAR SN LKRHSELTERLSRDSDKM+FERLQKEFEAARASQTQE+YLDGEQWNDGLLATIRE
Subjt:  MESDIEESFKKRARKNPHHGESSRKS---EEDSTLETTRARLSNALKRHSELTERLSRDSDKMIFERLQKEFEAARASQTQEVYLDGEQWNDGLLATIRE

Query:  RVHMEAERKAMPEDADMLPQEKITYKVGTKVICCLEGARIGIQYETSFAGDPCELYHCVLESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDH
        RVHMEAERKAMPEDAD+LP EKITYKVGTKVICCLEGARIGIQYETSFAG+PCELYHCVLESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFID+
Subjt:  RVHMEAERKAMPEDADMLPQEKITYKVGTKVICCLEGARIGIQYETSFAGDPCELYHCVLESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDH

Query:  IGELLQAYVDRREQVRLIKELYGNQIRELYHNLSFHMIVFVLDDSDCTVTVSLRYADLISVLPTEISVLAWPMPRLKKNTINSSILSIKKENGGTVSHPI
        IGELLQAYVDRREQVRLIKELYGNQIRELYH+L FHMI FV+DDSDCTVTVSLRYADLI VLPT+ISVLAWPMP++KKNT NSSI SIKKENGGTVSHPI
Subjt:  IGELLQAYVDRREQVRLIKELYGNQIRELYHNLSFHMIVFVLDDSDCTVTVSLRYADLISVLPTEISVLAWPMPRLKKNTINSSILSIKKENGGTVSHPI

Query:  PARLSYAEDALRTMSLPEAYAEIVLNLPQAIQQMFPQKP
        PARLSYAEDALRTMSLPEAYAEIVLNLPQAIQQ+FP KP
Subjt:  PARLSYAEDALRTMSLPEAYAEIVLNLPQAIQQMFPQKP

A0A6J1BR87 uncharacterized protein LOC1110050415.3e-15886.51Show/hide
Query:  MESDIEESFKKRARKNPHHGESSRKS---EEDSTLETTRARLSNALKRHSELTERLSRDSDKMIFERLQKEFEAARASQTQEVYLDGEQWNDGLLATIRE
        MESDIEES KK+ RKNPH GESSRKS   EED +LETTRAR SN LKRHSELTERLSRDSDKMIFERLQKEFEAARASQTQE+ LDGEQWNDGLLATIRE
Subjt:  MESDIEESFKKRARKNPHHGESSRKS---EEDSTLETTRARLSNALKRHSELTERLSRDSDKMIFERLQKEFEAARASQTQEVYLDGEQWNDGLLATIRE

Query:  RVHMEAERKAMPEDADMLPQEKITYKVGTKVICCLEGARIGIQYETSFAGDPCELYHCVLESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDH
        RVHMEA+RKAMP ++DMLP EKITYKVGTKVICCLEGARIGIQYETSFAG+PCELYHCVLESKSFLEKMTVLEHTIPFFLP+REAENDLLSSNAMKFIDH
Subjt:  RVHMEAERKAMPEDADMLPQEKITYKVGTKVICCLEGARIGIQYETSFAGDPCELYHCVLESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDH

Query:  IGELLQAYVDRREQVRLIKELYGNQIRELYHNLSFHMIVFVLDDSDCTVTVSLRYADLISVLPTEISVLAWPMPRLKKNTINSSILSIKKENGGTVSHPI
        IGELLQAYVDRREQVRLIKELYGNQIRELYH+L FHM+ FVLDD DCTVTVSLRYADLI VLPT+ISVLAWPMP+ +K             NGGT SHPI
Subjt:  IGELLQAYVDRREQVRLIKELYGNQIRELYHNLSFHMIVFVLDDSDCTVTVSLRYADLISVLPTEISVLAWPMPRLKKNTINSSILSIKKENGGTVSHPI

Query:  PARLSYAEDALRTMSLPEAYAEIVLNLPQAIQQMFPQKPPT
        PARLSYAEDALRTMSLPEAYAEIVLNLP+AIQQMFP KPPT
Subjt:  PARLSYAEDALRTMSLPEAYAEIVLNLPQAIQQMFPQKPPT

A0A6J1HHI2 uncharacterized protein LOC1114629892.0e-16590.64Show/hide
Query:  MESDIEESFKKRARKNPHHGESSRKS---EEDSTLETTRARLSNALKRHSELTERLSRDSDKMIFERLQKEFEAARASQTQEVYLDGEQWNDGLLATIRE
        MESDIEES KK+ R+NPH GESSRKS   +ED  LETTRAR SN LKRHSELTERLSRDSDKMIFERLQKEFEAARASQ QE YLDGEQWNDGLLATIRE
Subjt:  MESDIEESFKKRARKNPHHGESSRKS---EEDSTLETTRARLSNALKRHSELTERLSRDSDKMIFERLQKEFEAARASQTQEVYLDGEQWNDGLLATIRE

Query:  RVHMEAERKAMPEDADMLPQEKITYKVGTKVICCLEGARIGIQYETSFAGDPCELYHCVLESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDH
        RVHMEAERKAMPEDADMLP EKITYKVG KVICCLEGARIGIQYETSFAG+PCELYHCVLESKSFLEKMTVLEHTIPFFLPVRE+ENDLLSSNAMKFID+
Subjt:  RVHMEAERKAMPEDADMLPQEKITYKVGTKVICCLEGARIGIQYETSFAGDPCELYHCVLESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDH

Query:  IGELLQAYVDRREQVRLIKELYGNQIRELYHNLSFHMIVFVLDDSDCTVTVSLRYADLISVLPTEISVLAWPMPRLKKNTIN--SSILSIKKEN-GGTVS
        IGELLQAYVDRREQVRLIKELYGNQIRELYH+L FHMI FVLDDSDCTVTVSLRYADLISVLPT+ISVLAWPMP++KKNT N  SS LSIKKEN GGTVS
Subjt:  IGELLQAYVDRREQVRLIKELYGNQIRELYHNLSFHMIVFVLDDSDCTVTVSLRYADLISVLPTEISVLAWPMPRLKKNTIN--SSILSIKKEN-GGTVS

Query:  HPIPARLSYAEDALRTMSLPEAYAEIVLNLPQAIQQMFPQKP
        HPIPARLSYAEDALRTMSLPEAYAEIVLNLPQAIQQMFP KP
Subjt:  HPIPARLSYAEDALRTMSLPEAYAEIVLNLPQAIQQMFPQKP

A0A6J1K513 uncharacterized protein LOC1114923575.8e-16590.62Show/hide
Query:  MESDIEESFKKRARKNPHHGESSRKS---EEDSTLETTRARLSNALKRHSELTERLSRDSDKMIFERLQKEFEAARASQTQEVYLDGEQWNDGLLATIRE
        MESDIEES KK+ R+NPH GESSRKS   +ED  LETTRAR SN LKRHSELTERLSRDSDKMIFERLQKEFEAARASQ QE+YLDGEQWNDGLLATIRE
Subjt:  MESDIEESFKKRARKNPHHGESSRKS---EEDSTLETTRARLSNALKRHSELTERLSRDSDKMIFERLQKEFEAARASQTQEVYLDGEQWNDGLLATIRE

Query:  RVHMEAERKAMPEDADMLPQEKITYKVGTKVICCLEGARIGIQYETSFAGDPCELYHCVLESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDH
        RVHMEAERKAMPEDADMLP EKITYKVG KVICCLEGARIGIQYETSFAG+PCELYHCVLESKSFLEKMTVLEHTIPFFLPVRE+ENDLLSSNAMKFID+
Subjt:  RVHMEAERKAMPEDADMLPQEKITYKVGTKVICCLEGARIGIQYETSFAGDPCELYHCVLESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDH

Query:  IGELLQAYVDRREQVRLIKELYGNQIRELYHNLSFHMIVFVLDDSDCTVTVSLRYADLISVLPTEISVLAWPMPRLKKNTIN-SSILSIKKEN-GGTVSH
        IGELLQAYVDRREQVRLIKELYGNQIRELYH+L FHMI FVLDDSDCTVTVSLRYADLISVLPT+ISVLAWPMP++KKNT N SS LSIKKEN GGTVSH
Subjt:  IGELLQAYVDRREQVRLIKELYGNQIRELYHNLSFHMIVFVLDDSDCTVTVSLRYADLISVLPTEISVLAWPMPRLKKNTIN-SSILSIKKEN-GGTVSH

Query:  PIPARLSYAEDALRTMSLPEAYAEIVLNLPQAIQQMFPQKP
        PIPARLSYAEDALRTMSLPEAYAEIVLNLPQAIQQMF  KP
Subjt:  PIPARLSYAEDALRTMSLPEAYAEIVLNLPQAIQQMFPQKP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G10710.1 INVOLVED IN: chromosome segregation, cell division; LOCATED IN: chromosome, centromeric region, nucleus; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Centromere protein Cenp-O (InterPro:IPR018464); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink).3.2e-10762.31Show/hide
Query:  GESSRKSEEDSTLETTRARLSNALKRHSELTERLSRDSDKMIFERLQKEFEAARASQTQEVYLDGEQWNDGLLATIRERVHMEAERKAMPEDADML----
        GE     ++D  L+TTRARLSN LKRH EL++RL+RDSDK + +RL KEFEAAR SQ+QEV+LDGE+WNDGLLAT+RERVHMEA+RKA   +A       
Subjt:  GESSRKSEEDSTLETTRARLSNALKRHSELTERLSRDSDKMIFERLQKEFEAARASQTQEVYLDGEQWNDGLLATIRERVHMEAERKAMPEDADML----

Query:  PQEKITYKVGTKVICCLEGARIGIQYETSFAGDPCELYHCVLESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDHIGELLQAYVDRREQVRLI
        P+E+ITY+VG KVICCL+G+RIGIQ+ETS AG+  E+YHCVLESKSFLEKM VLEHTIPFFLP+ + ENDLL SNA KFID++G+LLQAYVDR+EQVRLI
Subjt:  PQEKITYKVGTKVICCLEGARIGIQYETSFAGDPCELYHCVLESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDHIGELLQAYVDRREQVRLI

Query:  KELYGNQIRELYHNLSFHMIVFVLDDSDCTVTVSLRYADLISVLPTEISVLAWPMPRLKKNTINSSILSIKKENGGTVSHPIPARLSYAEDALRTMSLPE
        KEL+G+QI E+YH+L +HMI F +DD DC   VSLRY DL+  LPT++ +L WPM  L            KK+     S  IP RL +AEDA R  SLPE
Subjt:  KELYGNQIRELYHNLSFHMIVFVLDDSDCTVTVSLRYADLISVLPTEISVLAWPMPRLKKNTINSSILSIKKENGGTVSHPIPARLSYAEDALRTMSLPE

Query:  AYAEIVLNLPQAIQQMFPQKP
        AYAEI+ N+P  I+Q+F   P
Subjt:  AYAEIVLNLPQAIQQMFPQKP

AT5G10710.2 INVOLVED IN: chromosome segregation, cell division; LOCATED IN: chromosome, centromeric region, nucleus; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Centromere protein Cenp-O (InterPro:IPR018464); Has 43 Blast hits to 43 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 11; Fungi - 0; Plants - 31; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink).3.2e-10762.31Show/hide
Query:  GESSRKSEEDSTLETTRARLSNALKRHSELTERLSRDSDKMIFERLQKEFEAARASQTQEVYLDGEQWNDGLLATIRERVHMEAERKAMPEDADML----
        GE     ++D  L+TTRARLSN LKRH EL++RL+RDSDK + +RL KEFEAAR SQ+QEV+LDGE+WNDGLLAT+RERVHMEA+RKA   +A       
Subjt:  GESSRKSEEDSTLETTRARLSNALKRHSELTERLSRDSDKMIFERLQKEFEAARASQTQEVYLDGEQWNDGLLATIRERVHMEAERKAMPEDADML----

Query:  PQEKITYKVGTKVICCLEGARIGIQYETSFAGDPCELYHCVLESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDHIGELLQAYVDRREQVRLI
        P+E+ITY+VG KVICCL+G+RIGIQ+ETS AG+  E+YHCVLESKSFLEKM VLEHTIPFFLP+ + ENDLL SNA KFID++G+LLQAYVDR+EQVRLI
Subjt:  PQEKITYKVGTKVICCLEGARIGIQYETSFAGDPCELYHCVLESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDHIGELLQAYVDRREQVRLI

Query:  KELYGNQIRELYHNLSFHMIVFVLDDSDCTVTVSLRYADLISVLPTEISVLAWPMPRLKKNTINSSILSIKKENGGTVSHPIPARLSYAEDALRTMSLPE
        KEL+G+QI E+YH+L +HMI F +DD DC   VSLRY DL+  LPT++ +L WPM  L            KK+     S  IP RL +AEDA R  SLPE
Subjt:  KELYGNQIRELYHNLSFHMIVFVLDDSDCTVTVSLRYADLISVLPTEISVLAWPMPRLKKNTINSSILSIKKENGGTVSHPIPARLSYAEDALRTMSLPE

Query:  AYAEIVLNLPQAIQQMFPQKP
        AYAEI+ N+P  I+Q+F   P
Subjt:  AYAEIVLNLPQAIQQMFPQKP

AT5G10710.3 INVOLVED IN: chromosome segregation, cell division; LOCATED IN: chromosome, centromeric region, nucleus; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Centromere protein Cenp-O (InterPro:IPR018464).5.3e-9457.63Show/hide
Query:  GESSRKSEEDSTLETTRARLSNALKRHSELTERLSRDSDKMIFERLQKEFEAARASQTQEVYLDGEQWNDGLLATIRERVHMEAERKAMPEDADML----
        GE     ++D  L+TTRARLSN LKRH EL++RL+RDSDK + +RL KEFEAAR SQ+QEV+              +  VHMEA+RKA   +A       
Subjt:  GESSRKSEEDSTLETTRARLSNALKRHSELTERLSRDSDKMIFERLQKEFEAARASQTQEVYLDGEQWNDGLLATIRERVHMEAERKAMPEDADML----

Query:  PQEKITYKVGTKVICCLEGARIGIQYETSFAGDPCELYHCVLESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDHIGELLQAYVDRREQVRLI
        P+E+ITY+VG KVICCL+G+RIGIQ+ETS AG+  E+YHCVLESKSFLEKM VLEHTIPFFLP+ + ENDLL SNA KFID++G+LLQAYVDR+EQVRLI
Subjt:  PQEKITYKVGTKVICCLEGARIGIQYETSFAGDPCELYHCVLESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDHIGELLQAYVDRREQVRLI

Query:  KELYGNQIRELYHNLSFHMIVFVLDDSDCTVTVSLRYADLISVLPTEISVLAWPMPRLKKNTINSSILSIKKENGGTVSHPIPARLSYAEDALRTMSLPE
        KEL+G+QI E+YH+L +HMI F +DD DC   VSLRY DL+  LPT++ +L WPM  L            KK+     S  IP RL +AEDA R  SLPE
Subjt:  KELYGNQIRELYHNLSFHMIVFVLDDSDCTVTVSLRYADLISVLPTEISVLAWPMPRLKKNTINSSILSIKKENGGTVSHPIPARLSYAEDALRTMSLPE

Query:  AYAEIVLNLPQAIQQMFPQKP
        AYAEI+ N+P  I+Q+F   P
Subjt:  AYAEIVLNLPQAIQQMFPQKP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAGCGATATCGAAGAATCATTCAAGAAGAGAGCGAGAAAGAATCCACACCATGGCGAATCTTCTCGCAAGAGTGAAGAAGACAGTACATTGGAGACTACACGAGC
AAGACTTTCAAATGCGCTCAAAAGGCACAGTGAATTAACTGAGCGTCTCTCCAGGGACTCTGACAAGATGATATTTGAGCGCTTACAAAAAGAATTTGAAGCTGCTAGAG
CATCTCAGACTCAAGAAGTATATTTGGACGGTGAACAATGGAATGATGGACTTTTAGCAACAATAAGAGAGCGGGTTCATATGGAAGCAGAGAGAAAGGCCATGCCTGAG
GATGCAGATATGTTGCCACAGGAAAAAATCACCTATAAAGTTGGAACCAAGGTTATTTGCTGCTTGGAAGGAGCGAGGATTGGCATACAATATGAGACATCTTTTGCGGG
TGACCCCTGTGAACTTTATCATTGCGTGCTAGAAAGCAAGTCATTTCTTGAAAAGATGACTGTCCTAGAACACACAATTCCATTCTTTCTGCCAGTACGAGAAGCAGAAA
ATGATCTTCTCTCCTCTAACGCGATGAAATTTATAGATCATATTGGAGAACTTTTGCAGGCCTATGTGGATAGAAGGGAACAGGTTCGACTTATCAAGGAGTTGTATGGA
AACCAAATCAGGGAATTGTATCATAACCTTTCATTCCATATGATTGTATTTGTGCTAGATGATTCTGACTGCACGGTGACTGTCAGTCTGAGATATGCAGATCTTATCTC
TGTGCTGCCAACTGAAATCAGTGTGCTTGCATGGCCAATGCCTCGGTTGAAGAAGAATACCATAAACTCATCAATCTTGAGCATCAAGAAGGAAAATGGAGGAACTGTAA
GTCATCCTATCCCAGCTCGTCTATCATATGCAGAGGATGCTTTACGAACCATGAGCTTACCAGAAGCATATGCAGAGATCGTGTTGAATTTGCCCCAAGCTATACAACAG
ATGTTTCCGCAGAAACCTCCCACATAG
mRNA sequenceShow/hide mRNA sequence
CTAAAGTATTGGCATTATGATGCATATTTGAATTTTCGTTTTCATATTTTTGGATTCTAATCTAAAAAAAGAGCGGGTATCCTTCAAAAAAACAGCGGGCATATTTCAAA
GCCCTAATTTCTGTGCCGGGAAAACCCGCCGAAAATTTGTTGCAGTTCACTGGACGCGAAGTGCGAAGCTTTTCTTCTCAAATTCGTTACGATTCTGTCACGGAGTTGAG
AAATGGAAAGCGATATCGAAGAATCATTCAAGAAGAGAGCGAGAAAGAATCCACACCATGGCGAATCTTCTCGCAAGAGTGAAGAAGACAGTACATTGGAGACTACACGA
GCAAGACTTTCAAATGCGCTCAAAAGGCACAGTGAATTAACTGAGCGTCTCTCCAGGGACTCTGACAAGATGATATTTGAGCGCTTACAAAAAGAATTTGAAGCTGCTAG
AGCATCTCAGACTCAAGAAGTATATTTGGACGGTGAACAATGGAATGATGGACTTTTAGCAACAATAAGAGAGCGGGTTCATATGGAAGCAGAGAGAAAGGCCATGCCTG
AGGATGCAGATATGTTGCCACAGGAAAAAATCACCTATAAAGTTGGAACCAAGGTTATTTGCTGCTTGGAAGGAGCGAGGATTGGCATACAATATGAGACATCTTTTGCG
GGTGACCCCTGTGAACTTTATCATTGCGTGCTAGAAAGCAAGTCATTTCTTGAAAAGATGACTGTCCTAGAACACACAATTCCATTCTTTCTGCCAGTACGAGAAGCAGA
AAATGATCTTCTCTCCTCTAACGCGATGAAATTTATAGATCATATTGGAGAACTTTTGCAGGCCTATGTGGATAGAAGGGAACAGGTTCGACTTATCAAGGAGTTGTATG
GAAACCAAATCAGGGAATTGTATCATAACCTTTCATTCCATATGATTGTATTTGTGCTAGATGATTCTGACTGCACGGTGACTGTCAGTCTGAGATATGCAGATCTTATC
TCTGTGCTGCCAACTGAAATCAGTGTGCTTGCATGGCCAATGCCTCGGTTGAAGAAGAATACCATAAACTCATCAATCTTGAGCATCAAGAAGGAAAATGGAGGAACTGT
AAGTCATCCTATCCCAGCTCGTCTATCATATGCAGAGGATGCTTTACGAACCATGAGCTTACCAGAAGCATATGCAGAGATCGTGTTGAATTTGCCCCAAGCTATACAAC
AGATGTTTCCGCAGAAACCTCCCACATAGAAGTATTTCAGCAACTCCTAGAGATATGTGGAACTGTACAAATTTATAGCAGAGGCTCTTCCAATACACTATTGGATCAAT
TCATCAGCAAGCTAAACACGACTTTTGCATCTTGGTCTCTTACGTTACTTTCTTGGGATTCAAGTTCATCCTTCCTCTAGTGGCATTGTGCTCTTTCAAAGCCAAATACA
TTCAAGATCTCTTGCAGCGACTGCACTTCCAACATCTAAAGCCTATTTCAGTTCCTATGGTCACGGGCAAACAACTTTATCATAATGGTGTATCCCTCTTGGAGACTTTT
CAATTGTATCAAAGCACTCTTGGATCTCTCCGGTATCTATTGCACACTAATCCCGATATTACATTTGTCGTTAACAAGCTCAGCCAATTCAATCATGCTCTCACTGAAAC
TCATTGGTAAGCCCTCAAACGCGTCCTCTGTTACCTCCATGGCACTTAGAATTCAGGCCTCCACATTCGACCTGCTCCATTTTTAGCCTTGACTGGGTACTCTGATGACA
ACTGAGTTGCATCCCCTGATGATCGTGAATCCATTGGAGGTTATTGTTGTGGGCCAAGGCAATAGCAATGATGCTATCGCACCAGATAATAGGAACGCACCAGGTAATAG
GAACTTTGGGTCTCACTTGTGAAATTCTAAACATGTTGAATTGGATGCAAATGCATTTGCGATCAAGTTCCTCAATTAGTGTATTTAGATTTGTTATATTGCTTCTCATG
AGCAAATAGCTTATTGCCTCATCAAATCTTTGTCTCAATGTGAACGTCAGATTCTTCGG
Protein sequenceShow/hide protein sequence
MESDIEESFKKRARKNPHHGESSRKSEEDSTLETTRARLSNALKRHSELTERLSRDSDKMIFERLQKEFEAARASQTQEVYLDGEQWNDGLLATIRERVHMEAERKAMPE
DADMLPQEKITYKVGTKVICCLEGARIGIQYETSFAGDPCELYHCVLESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDHIGELLQAYVDRREQVRLIKELYG
NQIRELYHNLSFHMIVFVLDDSDCTVTVSLRYADLISVLPTEISVLAWPMPRLKKNTINSSILSIKKENGGTVSHPIPARLSYAEDALRTMSLPEAYAEIVLNLPQAIQQ
MFPQKPPT