; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg025678 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg025678
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionINVOLVED IN: chromosome segregation, cell division; LOCATED IN: chromosome, centromeric region, nucleus; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages;
Genome locationscaffold13:31386595..31406808
RNA-Seq ExpressionSpg025678
SyntenySpg025678
Gene Ontology termsGO:0034508 - centromere complex assembly (biological process)
GO:0000776 - kinetochore (cellular component)
InterPro domainsIPR018464 - Centromere protein O


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598812.1 hypothetical protein SDJN03_08590, partial [Cucurbita argyrosperma subsp. sororia]1.3e-13655.25Show/hide
Query:  MESDIEESFKKKARKNPHHGESSRKLGFLFTSRLDSVITFYDLKSPSVVFCMHQPLQKYHLELQFSSFRVLRIPVSFVPPITRVESLWKKAILAHCNLML
        MESDIEES KKK R+NPH GESSRK G                          Q  +   LE   + F                                
Subjt:  MESDIEESFKKKARKNPHHGESSRKLGFLFTSRLDSVITFYDLKSPSVVFCMHQPLQKYHLELQFSSFRVLRIPVSFVPPITRVESLWKKAILAHCNLML

Query:  QRTDSSSFAVSNVLKRHSDLSERLSRDSDKMIFERLQKEFEAARASETQEVYLDGEQWNDGLLATIRERVHMEAERKAMPEEADMLPQEKITYKVGTKVI
                  SNVLKRHS+L+ERLSRDSDKMIFERLQKEFEAARAS+ QE+YLDGEQWNDGLLATIRERVHMEAERKAMPE+ADMLP EKITYKVG KVI
Subjt:  QRTDSSSFAVSNVLKRHSDLSERLSRDSDKMIFERLQKEFEAARASETQEVYLDGEQWNDGLLATIRERVHMEAERKAMPEEADMLPQEKITYKVGTKVI

Query:  CCLEGARIGIQYETSFAAQLTKALAPVFDLRLRELEIPLFSGLNPYEWLHQVERYFVVNRMTDSAKVEAAVLCLDGVKFLSVSFSAGDPCELYHCVLESK
        CCLEGARIGIQYETSF                                                                     AG+PCELYHCVLESK
Subjt:  CCLEGARIGIQYETSFAAQLTKALAPVFDLRLRELEIPLFSGLNPYEWLHQVERYFVVNRMTDSAKVEAAVLCLDGVKFLSVSFSAGDPCELYHCVLESK

Query:  SFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDHIGELLQAYVDRREQTPVHVQVKMALKYFFPLYAFTHFLASINVGITWPVGCAKVHQLLLMMIV
        SFLEKMTVLEHTIPFFLPVRE+ENDLLSSNAMKFID+IGELLQAYVDRRE                                                  
Subjt:  SFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDHIGELLQAYVDRREQTPVHVQVKMALKYFFPLYAFTHFLASINVGITWPVGCAKVHQLLLMMIV

Query:  GESCIFHTHKEKVYVLCSQIIQVRLIKELYGNQIRELYHSLPFHMIEFVLDDSDCTVTVSLRYADLIYVLPSEISVLAWPMPQMKKNTIN--SSILSIKK
                             QVRLIKELYGNQIRELYHSLPFHMIEFVLDDSDCTVTVSLRYADLI VLP++ISVLAWPMPQMKKNT N  SS LSIKK
Subjt:  GESCIFHTHKEKVYVLCSQIIQVRLIKELYGNQIRELYHSLPFHMIEFVLDDSDCTVTVSLRYADLIYVLPSEISVLAWPMPQMKKNTIN--SSILSIKK

Query:  EN-GGTVVHPIPARLSYAEDALRTMSLPEAYAEIVLNLPRAIQQMFPPKPHT
        EN GGTV HPIPARLSYAEDALRTMSLPEAYAEIVLNLP+AIQQMFPPKP +
Subjt:  EN-GGTVVHPIPARLSYAEDALRTMSLPEAYAEIVLNLPRAIQQMFPPKPHT

XP_004138767.1 uncharacterized protein LOC101206507 [Cucumis sativus]3.2e-14356.1Show/hide
Query:  MESDIEESFKKKARKNPHHGESSRKLGFLFTSRLDSVITFYDLKSPSVVFCMHQPLQKYHLELQFSSFRVLRIPVSFVPPITRVESLWKKAILAHCNLML
        MESDIEES +KK RKNPH GESSRK G      LD  I                                                              
Subjt:  MESDIEESFKKKARKNPHHGESSRKLGFLFTSRLDSVITFYDLKSPSVVFCMHQPLQKYHLELQFSSFRVLRIPVSFVPPITRVESLWKKAILAHCNLML

Query:  QRTDSSSFAVSNVLKRHSDLSERLSRDSDKMIFERLQKEFEAARASETQEVYLDGEQWNDGLLATIRERVHMEAERKAMPEEADMLPQEKITYKVGTKVI
         R +++    SNVLKRHS+L+ERLSRDSDKMIFERLQKEFEAARAS+TQE+YLDGEQWNDGLLATIRERVHMEAERKAMPE+AD+LPQEKITYKVGTKVI
Subjt:  QRTDSSSFAVSNVLKRHSDLSERLSRDSDKMIFERLQKEFEAARASETQEVYLDGEQWNDGLLATIRERVHMEAERKAMPEEADMLPQEKITYKVGTKVI

Query:  CCLEGARIGIQYETSFAAQLTKALAPVFDLRLRELEIPLFSGLNPYEWLHQVERYFVVNRMTDSAKVEAAVLCLDGVKFLSVSFSAGDPCELYHCVLESK
        CCLEGARIGIQYETSF                                                                     AG+PCELYHCVLESK
Subjt:  CCLEGARIGIQYETSFAAQLTKALAPVFDLRLRELEIPLFSGLNPYEWLHQVERYFVVNRMTDSAKVEAAVLCLDGVKFLSVSFSAGDPCELYHCVLESK

Query:  SFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDHIGELLQAYVDRREQTPVHVQVKMALKYFFPLYAFTHFLASINVGITWPVGCAKVHQLLLMMIV
        SFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFID+IGELLQAYVDRRE                                                  
Subjt:  SFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDHIGELLQAYVDRREQTPVHVQVKMALKYFFPLYAFTHFLASINVGITWPVGCAKVHQLLLMMIV

Query:  GESCIFHTHKEKVYVLCSQIIQVRLIKELYGNQIRELYHSLPFHMIEFVLDDSDCTVTVSLRYADLIYVLPSEISVLAWPMPQMKKNTINSSILSIKKEN
                             QVRLIKELYGNQIRELYHSLPFHMIEFV+DDSDCTVTVSLRYADLIYVLP++ISVLAWPMP MKKNT NSSILSIKKEN
Subjt:  GESCIFHTHKEKVYVLCSQIIQVRLIKELYGNQIRELYHSLPFHMIEFVLDDSDCTVTVSLRYADLIYVLPSEISVLAWPMPQMKKNTINSSILSIKKEN

Query:  GGTVVHPIPARLSYAEDALRTMSLPEAYAEIVLNLPRAIQQMFPPKPHT
        GGTV HPIPARLSYAEDALRTMSLPEAYAEIVLNLP+AIQQ+FPPKPH+
Subjt:  GGTVVHPIPARLSYAEDALRTMSLPEAYAEIVLNLPRAIQQMFPPKPHT

XP_008445088.1 PREDICTED: uncharacterized protein LOC103488232 [Cucumis melo]8.0e-14255.56Show/hide
Query:  MESDIEESFKKKARKNPHHGESSRKLGFLFTSRLDSVITFYDLKSPSVVFCMHQPLQKYHLELQFSSFRVLRIPVSFVPPITRVESLWKKAILAHCNLML
        MESDI+ES +KK RKNPH GESSRK G      LD  I                                                              
Subjt:  MESDIEESFKKKARKNPHHGESSRKLGFLFTSRLDSVITFYDLKSPSVVFCMHQPLQKYHLELQFSSFRVLRIPVSFVPPITRVESLWKKAILAHCNLML

Query:  QRTDSSSFAVSNVLKRHSDLSERLSRDSDKMIFERLQKEFEAARASETQEVYLDGEQWNDGLLATIRERVHMEAERKAMPEEADMLPQEKITYKVGTKVI
         R +++    SNVLKRHS+L+ERLSRDSDKM+FERLQKEFEAARAS+TQE+YLDGEQWNDGLLATIRERVHMEAERKAMPE+AD+LP EKITYKVGTKVI
Subjt:  QRTDSSSFAVSNVLKRHSDLSERLSRDSDKMIFERLQKEFEAARASETQEVYLDGEQWNDGLLATIRERVHMEAERKAMPEEADMLPQEKITYKVGTKVI

Query:  CCLEGARIGIQYETSFAAQLTKALAPVFDLRLRELEIPLFSGLNPYEWLHQVERYFVVNRMTDSAKVEAAVLCLDGVKFLSVSFSAGDPCELYHCVLESK
        CCLEGARIGIQYETSF                                                                     AG+PCELYHCVLESK
Subjt:  CCLEGARIGIQYETSFAAQLTKALAPVFDLRLRELEIPLFSGLNPYEWLHQVERYFVVNRMTDSAKVEAAVLCLDGVKFLSVSFSAGDPCELYHCVLESK

Query:  SFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDHIGELLQAYVDRREQTPVHVQVKMALKYFFPLYAFTHFLASINVGITWPVGCAKVHQLLLMMIV
        SFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFID+IGELLQAYVDRRE                                                  
Subjt:  SFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDHIGELLQAYVDRREQTPVHVQVKMALKYFFPLYAFTHFLASINVGITWPVGCAKVHQLLLMMIV

Query:  GESCIFHTHKEKVYVLCSQIIQVRLIKELYGNQIRELYHSLPFHMIEFVLDDSDCTVTVSLRYADLIYVLPSEISVLAWPMPQMKKNTINSSILSIKKEN
                             QVRLIKELYGNQIRELYHSLPFHMIEFV+DDSDCTVTVSLRYADLIYVLP++ISVLAWPMPQMKKNT NSSI SIKKEN
Subjt:  GESCIFHTHKEKVYVLCSQIIQVRLIKELYGNQIRELYHSLPFHMIEFVLDDSDCTVTVSLRYADLIYVLPSEISVLAWPMPQMKKNTINSSILSIKKEN

Query:  GGTVVHPIPARLSYAEDALRTMSLPEAYAEIVLNLPRAIQQMFPPKPHT
        GGTV HPIPARLSYAEDALRTMSLPEAYAEIVLNLP+AIQQ+FPPKPH+
Subjt:  GGTVVHPIPARLSYAEDALRTMSLPEAYAEIVLNLPRAIQQMFPPKPHT

XP_023546279.1 uncharacterized protein LOC111805418 [Cucurbita pepo subsp. pepo]5.9e-13755.25Show/hide
Query:  MESDIEESFKKKARKNPHHGESSRKLGFLFTSRLDSVITFYDLKSPSVVFCMHQPLQKYHLELQFSSFRVLRIPVSFVPPITRVESLWKKAILAHCNLML
        MESDIEES KKK R+NPH GESSRK G                            LQ   + L+ +  R                               
Subjt:  MESDIEESFKKKARKNPHHGESSRKLGFLFTSRLDSVITFYDLKSPSVVFCMHQPLQKYHLELQFSSFRVLRIPVSFVPPITRVESLWKKAILAHCNLML

Query:  QRTDSSSFAVSNVLKRHSDLSERLSRDSDKMIFERLQKEFEAARASETQEVYLDGEQWNDGLLATIRERVHMEAERKAMPEEADMLPQEKITYKVGTKVI
                  SNVLKRHS+L+ERLSRDSDKMIFERLQKEFEAARAS+ QE+YLDGEQWNDGLLATIRERVHMEAERKAMPE+ADMLP EKITYKVG KVI
Subjt:  QRTDSSSFAVSNVLKRHSDLSERLSRDSDKMIFERLQKEFEAARASETQEVYLDGEQWNDGLLATIRERVHMEAERKAMPEEADMLPQEKITYKVGTKVI

Query:  CCLEGARIGIQYETSFAAQLTKALAPVFDLRLRELEIPLFSGLNPYEWLHQVERYFVVNRMTDSAKVEAAVLCLDGVKFLSVSFSAGDPCELYHCVLESK
        CCLEGARIGIQYETSF                                                                     AG+PCELYHCVLESK
Subjt:  CCLEGARIGIQYETSFAAQLTKALAPVFDLRLRELEIPLFSGLNPYEWLHQVERYFVVNRMTDSAKVEAAVLCLDGVKFLSVSFSAGDPCELYHCVLESK

Query:  SFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDHIGELLQAYVDRREQTPVHVQVKMALKYFFPLYAFTHFLASINVGITWPVGCAKVHQLLLMMIV
        SFLEKMTVLEHTIPFFLPVRE+ENDLLSSNAMKFID+IGELLQAYVDRRE                                                  
Subjt:  SFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDHIGELLQAYVDRREQTPVHVQVKMALKYFFPLYAFTHFLASINVGITWPVGCAKVHQLLLMMIV

Query:  GESCIFHTHKEKVYVLCSQIIQVRLIKELYGNQIRELYHSLPFHMIEFVLDDSDCTVTVSLRYADLIYVLPSEISVLAWPMPQMKKNTIN--SSILSIKK
                             QVRLIKELYGNQIRELYHSLPFHMIEFVLDDSDCTVTVSLRYADLI VLP++ISVLAWPMPQMKKNT N  SS LSIKK
Subjt:  GESCIFHTHKEKVYVLCSQIIQVRLIKELYGNQIRELYHSLPFHMIEFVLDDSDCTVTVSLRYADLIYVLPSEISVLAWPMPQMKKNTIN--SSILSIKK

Query:  EN-GGTVVHPIPARLSYAEDALRTMSLPEAYAEIVLNLPRAIQQMFPPKPHT
        EN GGTV HPIPARLSYAEDALRTMSLPEAYAEIVLNLP+AIQQMFPPKP +
Subjt:  EN-GGTVVHPIPARLSYAEDALRTMSLPEAYAEIVLNLPRAIQQMFPPKPHT

XP_038884617.1 uncharacterized protein LOC120075366 [Benincasa hispida]3.0e-14155.74Show/hide
Query:  MESDIEESFKKKARKNPHHGESSRKLGFLFTSRLDSVITFYDLKSPSVVFCMHQPLQKYHLELQFSSFRVLRIPVSFVPPITRVESLWKKAILAHCNLML
        MESDIEES +KK RKNPH GESSRK G      +D  I                                                              
Subjt:  MESDIEESFKKKARKNPHHGESSRKLGFLFTSRLDSVITFYDLKSPSVVFCMHQPLQKYHLELQFSSFRVLRIPVSFVPPITRVESLWKKAILAHCNLML

Query:  QRTDSSSFAVSNVLKRHSDLSERLSRDSDKMIFERLQKEFEAARASETQEVYLDGEQWNDGLLATIRERVHMEAERKAMPEEADMLPQEKITYKVGTKVI
         R +++    SNVLKRHS+L+ERLSRDSDKMIFERLQKEFEAARAS+TQE+YLDGEQWNDGLLATIRERVHMEAERKAMPE+AD+LPQEKITYKVGTKVI
Subjt:  QRTDSSSFAVSNVLKRHSDLSERLSRDSDKMIFERLQKEFEAARASETQEVYLDGEQWNDGLLATIRERVHMEAERKAMPEEADMLPQEKITYKVGTKVI

Query:  CCLEGARIGIQYETSFAAQLTKALAPVFDLRLRELEIPLFSGLNPYEWLHQVERYFVVNRMTDSAKVEAAVLCLDGVKFLSVSFSAGDPCELYHCVLESK
        CCLEGARIGIQYETSF                                                                     AG+PCELYHCVLESK
Subjt:  CCLEGARIGIQYETSFAAQLTKALAPVFDLRLRELEIPLFSGLNPYEWLHQVERYFVVNRMTDSAKVEAAVLCLDGVKFLSVSFSAGDPCELYHCVLESK

Query:  SFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDHIGELLQAYVDRREQTPVHVQVKMALKYFFPLYAFTHFLASINVGITWPVGCAKVHQLLLMMIV
        SFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFID+IGELLQAYVDRRE                                                  
Subjt:  SFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDHIGELLQAYVDRREQTPVHVQVKMALKYFFPLYAFTHFLASINVGITWPVGCAKVHQLLLMMIV

Query:  GESCIFHTHKEKVYVLCSQIIQVRLIKELYGNQIRELYHSLPFHMIEFVLDDSDCTVTVSLRYADLIYVLPSEISVLAWPMPQMKKNTINSSILSIKKEN
                             QVRLIKELYGNQIRELYHSLPFHMIEF LDDSDCTVTVSLRYADLIYVLP++ISVLAWPMPQMKKNT NSSILSIKKEN
Subjt:  GESCIFHTHKEKVYVLCSQIIQVRLIKELYGNQIRELYHSLPFHMIEFVLDDSDCTVTVSLRYADLIYVLPSEISVLAWPMPQMKKNTINSSILSIKKEN

Query:  GGTVVHPIPARLSYAEDALRTMSLPEAYAEIVLNLPRAIQQMFPPKPHT
        GGTV +PIPARLSYAEDALRTMSLPEAYAEIVLNLP+AIQQMFPPK ++
Subjt:  GGTVVHPIPARLSYAEDALRTMSLPEAYAEIVLNLPRAIQQMFPPKPHT

TrEMBL top hitse value%identityAlignment
A0A0A0LLQ7 Uncharacterized protein1.6e-14356.1Show/hide
Query:  MESDIEESFKKKARKNPHHGESSRKLGFLFTSRLDSVITFYDLKSPSVVFCMHQPLQKYHLELQFSSFRVLRIPVSFVPPITRVESLWKKAILAHCNLML
        MESDIEES +KK RKNPH GESSRK G      LD  I                                                              
Subjt:  MESDIEESFKKKARKNPHHGESSRKLGFLFTSRLDSVITFYDLKSPSVVFCMHQPLQKYHLELQFSSFRVLRIPVSFVPPITRVESLWKKAILAHCNLML

Query:  QRTDSSSFAVSNVLKRHSDLSERLSRDSDKMIFERLQKEFEAARASETQEVYLDGEQWNDGLLATIRERVHMEAERKAMPEEADMLPQEKITYKVGTKVI
         R +++    SNVLKRHS+L+ERLSRDSDKMIFERLQKEFEAARAS+TQE+YLDGEQWNDGLLATIRERVHMEAERKAMPE+AD+LPQEKITYKVGTKVI
Subjt:  QRTDSSSFAVSNVLKRHSDLSERLSRDSDKMIFERLQKEFEAARASETQEVYLDGEQWNDGLLATIRERVHMEAERKAMPEEADMLPQEKITYKVGTKVI

Query:  CCLEGARIGIQYETSFAAQLTKALAPVFDLRLRELEIPLFSGLNPYEWLHQVERYFVVNRMTDSAKVEAAVLCLDGVKFLSVSFSAGDPCELYHCVLESK
        CCLEGARIGIQYETSF                                                                     AG+PCELYHCVLESK
Subjt:  CCLEGARIGIQYETSFAAQLTKALAPVFDLRLRELEIPLFSGLNPYEWLHQVERYFVVNRMTDSAKVEAAVLCLDGVKFLSVSFSAGDPCELYHCVLESK

Query:  SFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDHIGELLQAYVDRREQTPVHVQVKMALKYFFPLYAFTHFLASINVGITWPVGCAKVHQLLLMMIV
        SFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFID+IGELLQAYVDRRE                                                  
Subjt:  SFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDHIGELLQAYVDRREQTPVHVQVKMALKYFFPLYAFTHFLASINVGITWPVGCAKVHQLLLMMIV

Query:  GESCIFHTHKEKVYVLCSQIIQVRLIKELYGNQIRELYHSLPFHMIEFVLDDSDCTVTVSLRYADLIYVLPSEISVLAWPMPQMKKNTINSSILSIKKEN
                             QVRLIKELYGNQIRELYHSLPFHMIEFV+DDSDCTVTVSLRYADLIYVLP++ISVLAWPMP MKKNT NSSILSIKKEN
Subjt:  GESCIFHTHKEKVYVLCSQIIQVRLIKELYGNQIRELYHSLPFHMIEFVLDDSDCTVTVSLRYADLIYVLPSEISVLAWPMPQMKKNTINSSILSIKKEN

Query:  GGTVVHPIPARLSYAEDALRTMSLPEAYAEIVLNLPRAIQQMFPPKPHT
        GGTV HPIPARLSYAEDALRTMSLPEAYAEIVLNLP+AIQQ+FPPKPH+
Subjt:  GGTVVHPIPARLSYAEDALRTMSLPEAYAEIVLNLPRAIQQMFPPKPHT

A0A1S3BBE0 uncharacterized protein LOC1034882323.9e-14255.56Show/hide
Query:  MESDIEESFKKKARKNPHHGESSRKLGFLFTSRLDSVITFYDLKSPSVVFCMHQPLQKYHLELQFSSFRVLRIPVSFVPPITRVESLWKKAILAHCNLML
        MESDI+ES +KK RKNPH GESSRK G      LD  I                                                              
Subjt:  MESDIEESFKKKARKNPHHGESSRKLGFLFTSRLDSVITFYDLKSPSVVFCMHQPLQKYHLELQFSSFRVLRIPVSFVPPITRVESLWKKAILAHCNLML

Query:  QRTDSSSFAVSNVLKRHSDLSERLSRDSDKMIFERLQKEFEAARASETQEVYLDGEQWNDGLLATIRERVHMEAERKAMPEEADMLPQEKITYKVGTKVI
         R +++    SNVLKRHS+L+ERLSRDSDKM+FERLQKEFEAARAS+TQE+YLDGEQWNDGLLATIRERVHMEAERKAMPE+AD+LP EKITYKVGTKVI
Subjt:  QRTDSSSFAVSNVLKRHSDLSERLSRDSDKMIFERLQKEFEAARASETQEVYLDGEQWNDGLLATIRERVHMEAERKAMPEEADMLPQEKITYKVGTKVI

Query:  CCLEGARIGIQYETSFAAQLTKALAPVFDLRLRELEIPLFSGLNPYEWLHQVERYFVVNRMTDSAKVEAAVLCLDGVKFLSVSFSAGDPCELYHCVLESK
        CCLEGARIGIQYETSF                                                                     AG+PCELYHCVLESK
Subjt:  CCLEGARIGIQYETSFAAQLTKALAPVFDLRLRELEIPLFSGLNPYEWLHQVERYFVVNRMTDSAKVEAAVLCLDGVKFLSVSFSAGDPCELYHCVLESK

Query:  SFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDHIGELLQAYVDRREQTPVHVQVKMALKYFFPLYAFTHFLASINVGITWPVGCAKVHQLLLMMIV
        SFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFID+IGELLQAYVDRRE                                                  
Subjt:  SFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDHIGELLQAYVDRREQTPVHVQVKMALKYFFPLYAFTHFLASINVGITWPVGCAKVHQLLLMMIV

Query:  GESCIFHTHKEKVYVLCSQIIQVRLIKELYGNQIRELYHSLPFHMIEFVLDDSDCTVTVSLRYADLIYVLPSEISVLAWPMPQMKKNTINSSILSIKKEN
                             QVRLIKELYGNQIRELYHSLPFHMIEFV+DDSDCTVTVSLRYADLIYVLP++ISVLAWPMPQMKKNT NSSI SIKKEN
Subjt:  GESCIFHTHKEKVYVLCSQIIQVRLIKELYGNQIRELYHSLPFHMIEFVLDDSDCTVTVSLRYADLIYVLPSEISVLAWPMPQMKKNTINSSILSIKKEN

Query:  GGTVVHPIPARLSYAEDALRTMSLPEAYAEIVLNLPRAIQQMFPPKPHT
        GGTV HPIPARLSYAEDALRTMSLPEAYAEIVLNLP+AIQQ+FPPKPH+
Subjt:  GGTVVHPIPARLSYAEDALRTMSLPEAYAEIVLNLPRAIQQMFPPKPHT

A0A6J1BR87 uncharacterized protein LOC1110050411.3e-12952.82Show/hide
Query:  MESDIEESFKKKARKNPHHGESSRKLGFLFTSRLDSVITFYDLKSPSVVFCMHQPLQKYHLELQFSSFRVLRIPVSFVPPITRVESLWKKAILAHCNLML
        MESDIEES KKK RKNPH GESSRK                              +Q+  + L+ +  R                               
Subjt:  MESDIEESFKKKARKNPHHGESSRKLGFLFTSRLDSVITFYDLKSPSVVFCMHQPLQKYHLELQFSSFRVLRIPVSFVPPITRVESLWKKAILAHCNLML

Query:  QRTDSSSFAVSNVLKRHSDLSERLSRDSDKMIFERLQKEFEAARASETQEVYLDGEQWNDGLLATIRERVHMEAERKAMPEEADMLPQEKITYKVGTKVI
                  SNVLKRHS+L+ERLSRDSDKMIFERLQKEFEAARAS+TQE+ LDGEQWNDGLLATIRERVHMEA+RKAMP E+DMLP EKITYKVGTKVI
Subjt:  QRTDSSSFAVSNVLKRHSDLSERLSRDSDKMIFERLQKEFEAARASETQEVYLDGEQWNDGLLATIRERVHMEAERKAMPEEADMLPQEKITYKVGTKVI

Query:  CCLEGARIGIQYETSFAAQLTKALAPVFDLRLRELEIPLFSGLNPYEWLHQVERYFVVNRMTDSAKVEAAVLCLDGVKFLSVSFSAGDPCELYHCVLESK
        CCLEGARIGIQYETSF                                                                     AG+PCELYHCVLESK
Subjt:  CCLEGARIGIQYETSFAAQLTKALAPVFDLRLRELEIPLFSGLNPYEWLHQVERYFVVNRMTDSAKVEAAVLCLDGVKFLSVSFSAGDPCELYHCVLESK

Query:  SFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDHIGELLQAYVDRREQTPVHVQVKMALKYFFPLYAFTHFLASINVGITWPVGCAKVHQLLLMMIV
        SFLEKMTVLEHTIPFFLP+REAENDLLSSNAMKFIDHIGELLQAYVDRRE                                                  
Subjt:  SFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDHIGELLQAYVDRREQTPVHVQVKMALKYFFPLYAFTHFLASINVGITWPVGCAKVHQLLLMMIV

Query:  GESCIFHTHKEKVYVLCSQIIQVRLIKELYGNQIRELYHSLPFHMIEFVLDDSDCTVTVSLRYADLIYVLPSEISVLAWPMPQMKKNTINSSILSIKKEN
                             QVRLIKELYGNQIRELYHSLPFHM+EFVLDD DCTVTVSLRYADLIYVLP++ISVLAWPMPQ +K             N
Subjt:  GESCIFHTHKEKVYVLCSQIIQVRLIKELYGNQIRELYHSLPFHMIEFVLDDSDCTVTVSLRYADLIYVLPSEISVLAWPMPQMKKNTINSSILSIKKEN

Query:  GGTVVHPIPARLSYAEDALRTMSLPEAYAEIVLNLPRAIQQMFPPKPHT
        GGT  HPIPARLSYAEDALRTMSLPEAYAEIVLNLP+AIQQMFPPKP T
Subjt:  GGTVVHPIPARLSYAEDALRTMSLPEAYAEIVLNLPRAIQQMFPPKPHT

A0A6J1HHI2 uncharacterized protein LOC1114629891.4e-13655.25Show/hide
Query:  MESDIEESFKKKARKNPHHGESSRKLGFLFTSRLDSVITFYDLKSPSVVFCMHQPLQKYHLELQFSSFRVLRIPVSFVPPITRVESLWKKAILAHCNLML
        MESDIEES KKK R+NPH GESSRK G                          Q  +   LE   + F                                
Subjt:  MESDIEESFKKKARKNPHHGESSRKLGFLFTSRLDSVITFYDLKSPSVVFCMHQPLQKYHLELQFSSFRVLRIPVSFVPPITRVESLWKKAILAHCNLML

Query:  QRTDSSSFAVSNVLKRHSDLSERLSRDSDKMIFERLQKEFEAARASETQEVYLDGEQWNDGLLATIRERVHMEAERKAMPEEADMLPQEKITYKVGTKVI
                  SNVLKRHS+L+ERLSRDSDKMIFERLQKEFEAARAS+ QE YLDGEQWNDGLLATIRERVHMEAERKAMPE+ADMLP EKITYKVG KVI
Subjt:  QRTDSSSFAVSNVLKRHSDLSERLSRDSDKMIFERLQKEFEAARASETQEVYLDGEQWNDGLLATIRERVHMEAERKAMPEEADMLPQEKITYKVGTKVI

Query:  CCLEGARIGIQYETSFAAQLTKALAPVFDLRLRELEIPLFSGLNPYEWLHQVERYFVVNRMTDSAKVEAAVLCLDGVKFLSVSFSAGDPCELYHCVLESK
        CCLEGARIGIQYETSF                                                                     AG+PCELYHCVLESK
Subjt:  CCLEGARIGIQYETSFAAQLTKALAPVFDLRLRELEIPLFSGLNPYEWLHQVERYFVVNRMTDSAKVEAAVLCLDGVKFLSVSFSAGDPCELYHCVLESK

Query:  SFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDHIGELLQAYVDRREQTPVHVQVKMALKYFFPLYAFTHFLASINVGITWPVGCAKVHQLLLMMIV
        SFLEKMTVLEHTIPFFLPVRE+ENDLLSSNAMKFID+IGELLQAYVDRRE                                                  
Subjt:  SFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDHIGELLQAYVDRREQTPVHVQVKMALKYFFPLYAFTHFLASINVGITWPVGCAKVHQLLLMMIV

Query:  GESCIFHTHKEKVYVLCSQIIQVRLIKELYGNQIRELYHSLPFHMIEFVLDDSDCTVTVSLRYADLIYVLPSEISVLAWPMPQMKKNTIN--SSILSIKK
                             QVRLIKELYGNQIRELYHSLPFHMIEFVLDDSDCTVTVSLRYADLI VLP++ISVLAWPMPQMKKNT N  SS LSIKK
Subjt:  GESCIFHTHKEKVYVLCSQIIQVRLIKELYGNQIRELYHSLPFHMIEFVLDDSDCTVTVSLRYADLIYVLPSEISVLAWPMPQMKKNTIN--SSILSIKK

Query:  EN-GGTVVHPIPARLSYAEDALRTMSLPEAYAEIVLNLPRAIQQMFPPKPHT
        EN GGTV HPIPARLSYAEDALRTMSLPEAYAEIVLNLP+AIQQMFPPKP +
Subjt:  EN-GGTVVHPIPARLSYAEDALRTMSLPEAYAEIVLNLPRAIQQMFPPKPHT

A0A6J1K513 uncharacterized protein LOC1114923571.9e-13655.17Show/hide
Query:  MESDIEESFKKKARKNPHHGESSRKLGFLFTSRLDSVITFYDLKSPSVVFCMHQPLQKYHLELQFSSFRVLRIPVSFVPPITRVESLWKKAILAHCNLML
        MESDIEES KKK R+NPH GESSRK G                            LQ   + L+ +  R                               
Subjt:  MESDIEESFKKKARKNPHHGESSRKLGFLFTSRLDSVITFYDLKSPSVVFCMHQPLQKYHLELQFSSFRVLRIPVSFVPPITRVESLWKKAILAHCNLML

Query:  QRTDSSSFAVSNVLKRHSDLSERLSRDSDKMIFERLQKEFEAARASETQEVYLDGEQWNDGLLATIRERVHMEAERKAMPEEADMLPQEKITYKVGTKVI
                  SNVLKRHS+L+ERLSRDSDKMIFERLQKEFEAARAS+ QE+YLDGEQWNDGLLATIRERVHMEAERKAMPE+ADMLP EKITYKVG KVI
Subjt:  QRTDSSSFAVSNVLKRHSDLSERLSRDSDKMIFERLQKEFEAARASETQEVYLDGEQWNDGLLATIRERVHMEAERKAMPEEADMLPQEKITYKVGTKVI

Query:  CCLEGARIGIQYETSFAAQLTKALAPVFDLRLRELEIPLFSGLNPYEWLHQVERYFVVNRMTDSAKVEAAVLCLDGVKFLSVSFSAGDPCELYHCVLESK
        CCLEGARIGIQYETSF                                                                     AG+PCELYHCVLESK
Subjt:  CCLEGARIGIQYETSFAAQLTKALAPVFDLRLRELEIPLFSGLNPYEWLHQVERYFVVNRMTDSAKVEAAVLCLDGVKFLSVSFSAGDPCELYHCVLESK

Query:  SFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDHIGELLQAYVDRREQTPVHVQVKMALKYFFPLYAFTHFLASINVGITWPVGCAKVHQLLLMMIV
        SFLEKMTVLEHTIPFFLPVRE+ENDLLSSNAMKFID+IGELLQAYVDRRE                                                  
Subjt:  SFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDHIGELLQAYVDRREQTPVHVQVKMALKYFFPLYAFTHFLASINVGITWPVGCAKVHQLLLMMIV

Query:  GESCIFHTHKEKVYVLCSQIIQVRLIKELYGNQIRELYHSLPFHMIEFVLDDSDCTVTVSLRYADLIYVLPSEISVLAWPMPQMKKNTIN-SSILSIKKE
                             QVRLIKELYGNQIRELYHSLPFHMIEFVLDDSDCTVTVSLRYADLI VLP++ISVLAWPMPQMKKNT N SS LSIKKE
Subjt:  GESCIFHTHKEKVYVLCSQIIQVRLIKELYGNQIRELYHSLPFHMIEFVLDDSDCTVTVSLRYADLIYVLPSEISVLAWPMPQMKKNTIN-SSILSIKKE

Query:  N-GGTVVHPIPARLSYAEDALRTMSLPEAYAEIVLNLPRAIQQMFPPKPHT
        N GGTV HPIPARLSYAEDALRTMSLPEAYAEIVLNLP+AIQQMF PKP +
Subjt:  N-GGTVVHPIPARLSYAEDALRTMSLPEAYAEIVLNLPRAIQQMFPPKPHT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G67020.1 unknown protein8.8e-0645.65Show/hide
Query:  LRELEIPLFSGLNPYEWLHQVERYFVVNRMTDSAKVEAAVLCLDGV
        +R +E+P+F G   YEW  +VER+F V R  DS K++   L L+GV
Subjt:  LRELEIPLFSGLNPYEWLHQVERYFVVNRMTDSAKVEAAVLCLDGV

AT5G10710.1 INVOLVED IN: chromosome segregation, cell division; LOCATED IN: chromosome, centromeric region, nucleus; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Centromere protein Cenp-O (InterPro:IPR018464); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink).1.3e-8642.67Show/hide
Query:  RTDSSSFAVSNVLKRHSDLSERLSRDSDKMIFERLQKEFEAARASETQEVYLDGEQWNDGLLATIRERVHMEAERKAMPEEADML----PQEKITYKVGT
        R D++   +SN+LKRH +LS+RL+RDSDK + +RL KEFEAAR S++QEV+LDGE+WNDGLLAT+RERVHMEA+RKA    A       P+E+ITY+VG 
Subjt:  RTDSSSFAVSNVLKRHSDLSERLSRDSDKMIFERLQKEFEAARASETQEVYLDGEQWNDGLLATIRERVHMEAERKAMPEEADML----PQEKITYKVGT

Query:  KVICCLEGARIGIQYETSFAAQLTKALAPVFDLRLRELEIPLFSGLNPYEWLHQVERYFVVNRMTDSAKVEAAVLCLDGVKFLSVSFSAGDPCELYHCVL
        KVICCL+G+RIGIQ+ETS                                                                     +AG+  E+YHCVL
Subjt:  KVICCLEGARIGIQYETSFAAQLTKALAPVFDLRLRELEIPLFSGLNPYEWLHQVERYFVVNRMTDSAKVEAAVLCLDGVKFLSVSFSAGDPCELYHCVL

Query:  ESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDHIGELLQAYVDRREQTPVHVQVKMALKYFFPLYAFTHFLASINVGITWPVGCAKVHQLLLM
        ESKSFLEKM VLEHTIPFFLP+ + ENDLL SNA KFID++G+LLQAYVDR+E                                               
Subjt:  ESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDHIGELLQAYVDRREQTPVHVQVKMALKYFFPLYAFTHFLASINVGITWPVGCAKVHQLLLM

Query:  MIVGESCIFHTHKEKVYVLCSQIIQVRLIKELYGNQIRELYHSLPFHMIEFVLDDSDCTVTVSLRYADLIYVLPSEISVLAWPMPQMKKNTINSSILSIK
                                QVRLIKEL+G+QI E+YHSLP+HMIEF +DD DC   VSLRY DL+  LP+++ +L WPM  + K    S      
Subjt:  MIVGESCIFHTHKEKVYVLCSQIIQVRLIKELYGNQIRELYHSLPFHMIEFVLDDSDCTVTVSLRYADLIYVLPSEISVLAWPMPQMKKNTINSSILSIK

Query:  KENGGTVVHPIPARLSYAEDALRTMSLPEAYAEIVLNLPRAIQQMFPPKP
            G+    IP RL +AEDA R  SLPEAYAEI+ N+P  I+Q+F   P
Subjt:  KENGGTVVHPIPARLSYAEDALRTMSLPEAYAEIVLNLPRAIQQMFPPKP

AT5G10710.2 INVOLVED IN: chromosome segregation, cell division; LOCATED IN: chromosome, centromeric region, nucleus; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Centromere protein Cenp-O (InterPro:IPR018464); Has 43 Blast hits to 43 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 11; Fungi - 0; Plants - 31; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink).1.3e-8642.67Show/hide
Query:  RTDSSSFAVSNVLKRHSDLSERLSRDSDKMIFERLQKEFEAARASETQEVYLDGEQWNDGLLATIRERVHMEAERKAMPEEADML----PQEKITYKVGT
        R D++   +SN+LKRH +LS+RL+RDSDK + +RL KEFEAAR S++QEV+LDGE+WNDGLLAT+RERVHMEA+RKA    A       P+E+ITY+VG 
Subjt:  RTDSSSFAVSNVLKRHSDLSERLSRDSDKMIFERLQKEFEAARASETQEVYLDGEQWNDGLLATIRERVHMEAERKAMPEEADML----PQEKITYKVGT

Query:  KVICCLEGARIGIQYETSFAAQLTKALAPVFDLRLRELEIPLFSGLNPYEWLHQVERYFVVNRMTDSAKVEAAVLCLDGVKFLSVSFSAGDPCELYHCVL
        KVICCL+G+RIGIQ+ETS                                                                     +AG+  E+YHCVL
Subjt:  KVICCLEGARIGIQYETSFAAQLTKALAPVFDLRLRELEIPLFSGLNPYEWLHQVERYFVVNRMTDSAKVEAAVLCLDGVKFLSVSFSAGDPCELYHCVL

Query:  ESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDHIGELLQAYVDRREQTPVHVQVKMALKYFFPLYAFTHFLASINVGITWPVGCAKVHQLLLM
        ESKSFLEKM VLEHTIPFFLP+ + ENDLL SNA KFID++G+LLQAYVDR+E                                               
Subjt:  ESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDHIGELLQAYVDRREQTPVHVQVKMALKYFFPLYAFTHFLASINVGITWPVGCAKVHQLLLM

Query:  MIVGESCIFHTHKEKVYVLCSQIIQVRLIKELYGNQIRELYHSLPFHMIEFVLDDSDCTVTVSLRYADLIYVLPSEISVLAWPMPQMKKNTINSSILSIK
                                QVRLIKEL+G+QI E+YHSLP+HMIEF +DD DC   VSLRY DL+  LP+++ +L WPM  + K    S      
Subjt:  MIVGESCIFHTHKEKVYVLCSQIIQVRLIKELYGNQIRELYHSLPFHMIEFVLDDSDCTVTVSLRYADLIYVLPSEISVLAWPMPQMKKNTINSSILSIK

Query:  KENGGTVVHPIPARLSYAEDALRTMSLPEAYAEIVLNLPRAIQQMFPPKP
            G+    IP RL +AEDA R  SLPEAYAEI+ N+P  I+Q+F   P
Subjt:  KENGGTVVHPIPARLSYAEDALRTMSLPEAYAEIVLNLPRAIQQMFPPKP

AT5G10710.3 INVOLVED IN: chromosome segregation, cell division; LOCATED IN: chromosome, centromeric region, nucleus; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Centromere protein Cenp-O (InterPro:IPR018464).2.2e-7339.33Show/hide
Query:  RTDSSSFAVSNVLKRHSDLSERLSRDSDKMIFERLQKEFEAARASETQEVYLDGEQWNDGLLATIRERVHMEAERKAMPEEADML----PQEKITYKVGT
        R D++   +SN+LKRH +LS+RL+RDSDK + +RL KEFEAAR S++QEV+              +  VHMEA+RKA    A       P+E+ITY+VG 
Subjt:  RTDSSSFAVSNVLKRHSDLSERLSRDSDKMIFERLQKEFEAARASETQEVYLDGEQWNDGLLATIRERVHMEAERKAMPEEADML----PQEKITYKVGT

Query:  KVICCLEGARIGIQYETSFAAQLTKALAPVFDLRLRELEIPLFSGLNPYEWLHQVERYFVVNRMTDSAKVEAAVLCLDGVKFLSVSFSAGDPCELYHCVL
        KVICCL+G+RIGIQ+ETS                                                                     +AG+  E+YHCVL
Subjt:  KVICCLEGARIGIQYETSFAAQLTKALAPVFDLRLRELEIPLFSGLNPYEWLHQVERYFVVNRMTDSAKVEAAVLCLDGVKFLSVSFSAGDPCELYHCVL

Query:  ESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDHIGELLQAYVDRREQTPVHVQVKMALKYFFPLYAFTHFLASINVGITWPVGCAKVHQLLLM
        ESKSFLEKM VLEHTIPFFLP+ + ENDLL SNA KFID++G+LLQAYVDR+E                                               
Subjt:  ESKSFLEKMTVLEHTIPFFLPVREAENDLLSSNAMKFIDHIGELLQAYVDRREQTPVHVQVKMALKYFFPLYAFTHFLASINVGITWPVGCAKVHQLLLM

Query:  MIVGESCIFHTHKEKVYVLCSQIIQVRLIKELYGNQIRELYHSLPFHMIEFVLDDSDCTVTVSLRYADLIYVLPSEISVLAWPMPQMKKNTINSSILSIK
                                QVRLIKEL+G+QI E+YHSLP+HMIEF +DD DC   VSLRY DL+  LP+++ +L WPM  + K    S      
Subjt:  MIVGESCIFHTHKEKVYVLCSQIIQVRLIKELYGNQIRELYHSLPFHMIEFVLDDSDCTVTVSLRYADLIYVLPSEISVLAWPMPQMKKNTINSSILSIK

Query:  KENGGTVVHPIPARLSYAEDALRTMSLPEAYAEIVLNLPRAIQQMFPPKP
            G+    IP RL +AEDA R  SLPEAYAEI+ N+P  I+Q+F   P
Subjt:  KENGGTVVHPIPARLSYAEDALRTMSLPEAYAEIVLNLPRAIQQMFPPKP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAGCGATATCGAAGAATCATTCAAGAAGAAAGCGAGAAAGAATCCGCACCATGGCGAATCTTCTCGAAAGCTTGGGTTTCTGTTTACCTCGAGACTTGATTCTGT
GATCACTTTTTATGATCTGAAGTCTCCAAGTGTAGTTTTTTGCATGCATCAGCCTCTTCAGAAGTATCATCTAGAACTTCAGTTTTCCAGTTTTCGGGTTTTGAGGATTC
CTGTAAGTTTCGTTCCACCGATAACAAGAGTGGAGTCCTTATGGAAGAAGGCCATACTTGCTCACTGCAATCTTATGCTACAAAGAACTGATAGTAGCTCTTTTGCAGTT
TCAAATGTCCTCAAAAGGCATAGTGATTTATCCGAGCGTCTCTCCAGGGATTCTGACAAGATGATTTTTGAGCGCTTACAAAAAGAATTTGAAGCTGCTAGAGCATCTGA
GACTCAAGAAGTATATTTGGATGGTGAACAATGGAATGATGGACTTTTAGCAACAATAAGAGAGCGGGTTCATATGGAAGCAGAGAGAAAGGCCATGCCTGAGGAAGCAG
ATATGTTGCCACAGGAAAAAATCACCTATAAAGTTGGAACCAAGGTTATTTGCTGCTTGGAAGGAGCGAGGATTGGCATACAATATGAGACATCTTTTGCAGCTCAGTTG
ACAAAGGCGTTGGCTCCAGTCTTTGATCTTCGACTTCGCGAACTGGAAATTCCTCTGTTTTCGGGATTAAATCCGTATGAGTGGCTTCACCAAGTCGAGCGATACTTTGT
GGTGAATCGTATGACAGATTCAGCGAAAGTCGAGGCGGCCGTTCTTTGTCTTGATGGTGTCAAGTTCCTATCAGTTTCATTTAGTGCTGGTGATCCCTGTGAACTTTATC
ATTGCGTGCTTGAAAGCAAGTCATTTCTTGAAAAGATGACTGTCCTAGAACACACAATTCCATTCTTTCTGCCAGTACGAGAAGCAGAAAATGATCTTCTCTCCTCTAAC
GCAATGAAATTTATAGATCATATTGGAGAACTTTTGCAGGCCTATGTGGATAGAAGGGAACAGACCCCTGTGCATGTTCAAGTTAAGATGGCGTTGAAGTATTTCTTCCC
TTTGTATGCTTTTACCCACTTCTTGGCAAGTATAAATGTTGGCATTACCTGGCCTGTTGGTTGTGCTAAAGTTCACCAACTTCTTTTGATGATGATTGTAGGGGAGAGCT
GCATCTTCCATACTCACAAAGAGAAGGTTTATGTTTTGTGTTCACAAATCATTCAGGTTCGGCTTATCAAGGAGTTGTATGGAAACCAAATCAGGGAGTTGTATCACAGC
CTTCCATTCCATATGATTGAATTTGTGCTAGATGATTCTGACTGCACAGTGACTGTCAGTCTGAGATACGCAGATCTTATCTATGTGCTGCCGTCTGAAATTAGCGTGCT
TGCATGGCCAATGCCCCAGATGAAGAAAAATACCATAAATTCATCAATCTTGAGCATCAAGAAGGAAAATGGAGGAACTGTAGTTCATCCTATTCCAGCTCGTCTATCAT
ATGCAGAGGATGCTTTACGAACCATGAGCTTACCAGAAGCATATGCAGAGATCGTGTTGAATTTGCCCCGAGCTATACAACAGATGTTTCCGCCAAAACCTCACACATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGAGCGATATCGAAGAATCATTCAAGAAGAAAGCGAGAAAGAATCCGCACCATGGCGAATCTTCTCGAAAGCTTGGGTTTCTGTTTACCTCGAGACTTGATTCTGT
GATCACTTTTTATGATCTGAAGTCTCCAAGTGTAGTTTTTTGCATGCATCAGCCTCTTCAGAAGTATCATCTAGAACTTCAGTTTTCCAGTTTTCGGGTTTTGAGGATTC
CTGTAAGTTTCGTTCCACCGATAACAAGAGTGGAGTCCTTATGGAAGAAGGCCATACTTGCTCACTGCAATCTTATGCTACAAAGAACTGATAGTAGCTCTTTTGCAGTT
TCAAATGTCCTCAAAAGGCATAGTGATTTATCCGAGCGTCTCTCCAGGGATTCTGACAAGATGATTTTTGAGCGCTTACAAAAAGAATTTGAAGCTGCTAGAGCATCTGA
GACTCAAGAAGTATATTTGGATGGTGAACAATGGAATGATGGACTTTTAGCAACAATAAGAGAGCGGGTTCATATGGAAGCAGAGAGAAAGGCCATGCCTGAGGAAGCAG
ATATGTTGCCACAGGAAAAAATCACCTATAAAGTTGGAACCAAGGTTATTTGCTGCTTGGAAGGAGCGAGGATTGGCATACAATATGAGACATCTTTTGCAGCTCAGTTG
ACAAAGGCGTTGGCTCCAGTCTTTGATCTTCGACTTCGCGAACTGGAAATTCCTCTGTTTTCGGGATTAAATCCGTATGAGTGGCTTCACCAAGTCGAGCGATACTTTGT
GGTGAATCGTATGACAGATTCAGCGAAAGTCGAGGCGGCCGTTCTTTGTCTTGATGGTGTCAAGTTCCTATCAGTTTCATTTAGTGCTGGTGATCCCTGTGAACTTTATC
ATTGCGTGCTTGAAAGCAAGTCATTTCTTGAAAAGATGACTGTCCTAGAACACACAATTCCATTCTTTCTGCCAGTACGAGAAGCAGAAAATGATCTTCTCTCCTCTAAC
GCAATGAAATTTATAGATCATATTGGAGAACTTTTGCAGGCCTATGTGGATAGAAGGGAACAGACCCCTGTGCATGTTCAAGTTAAGATGGCGTTGAAGTATTTCTTCCC
TTTGTATGCTTTTACCCACTTCTTGGCAAGTATAAATGTTGGCATTACCTGGCCTGTTGGTTGTGCTAAAGTTCACCAACTTCTTTTGATGATGATTGTAGGGGAGAGCT
GCATCTTCCATACTCACAAAGAGAAGGTTTATGTTTTGTGTTCACAAATCATTCAGGTTCGGCTTATCAAGGAGTTGTATGGAAACCAAATCAGGGAGTTGTATCACAGC
CTTCCATTCCATATGATTGAATTTGTGCTAGATGATTCTGACTGCACAGTGACTGTCAGTCTGAGATACGCAGATCTTATCTATGTGCTGCCGTCTGAAATTAGCGTGCT
TGCATGGCCAATGCCCCAGATGAAGAAAAATACCATAAATTCATCAATCTTGAGCATCAAGAAGGAAAATGGAGGAACTGTAGTTCATCCTATTCCAGCTCGTCTATCAT
ATGCAGAGGATGCTTTACGAACCATGAGCTTACCAGAAGCATATGCAGAGATCGTGTTGAATTTGCCCCGAGCTATACAACAGATGTTTCCGCCAAAACCTCACACATAG
Protein sequenceShow/hide protein sequence
MESDIEESFKKKARKNPHHGESSRKLGFLFTSRLDSVITFYDLKSPSVVFCMHQPLQKYHLELQFSSFRVLRIPVSFVPPITRVESLWKKAILAHCNLMLQRTDSSSFAV
SNVLKRHSDLSERLSRDSDKMIFERLQKEFEAARASETQEVYLDGEQWNDGLLATIRERVHMEAERKAMPEEADMLPQEKITYKVGTKVICCLEGARIGIQYETSFAAQL
TKALAPVFDLRLRELEIPLFSGLNPYEWLHQVERYFVVNRMTDSAKVEAAVLCLDGVKFLSVSFSAGDPCELYHCVLESKSFLEKMTVLEHTIPFFLPVREAENDLLSSN
AMKFIDHIGELLQAYVDRREQTPVHVQVKMALKYFFPLYAFTHFLASINVGITWPVGCAKVHQLLLMMIVGESCIFHTHKEKVYVLCSQIIQVRLIKELYGNQIRELYHS
LPFHMIEFVLDDSDCTVTVSLRYADLIYVLPSEISVLAWPMPQMKKNTINSSILSIKKENGGTVVHPIPARLSYAEDALRTMSLPEAYAEIVLNLPRAIQQMFPPKPHT