; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr024730 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr024730
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionprotein CDC73 homolog
Genome locationtig00002486:2294574..2298568
RNA-Seq ExpressionSgr024730
SyntenySgr024730
Gene Ontology termsGO:0006368 - transcription elongation from RNA polymerase II promoter (biological process)
GO:0016570 - histone modification (biological process)
GO:0016593 - Cdc73/Paf1 complex (cellular component)
InterPro domainsIPR007852 - Cdc73/Parafibromin
IPR031336 - Cell division control protein 73, C-terminal
IPR032041 - Paf1 complex subunit Cdc73, N-terminal domain
IPR038103 - Cell division control protein 73, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134132.1 protein CDC73 homolog [Cucumis sativus]3.0e-18885.75Show/hide
Query:  MDPLSALRDFTIRGELDKIVRVNDEFRFGSDYSFPCSAETAYRSKQGNLYTIETLVYYIKNHHIKHTEYLQNARTQGIPPVTFPDRKPLLDYLTGKVSSS
        MDPLSALRDFTIRGELDKIVRVNDEFRF SDYSFPCS ETAYRSKQGNLYT+ETLVYYIKNHH+KHTEYLQNARTQGI  VTFPDRKPLLDYLTGKVSSS
Subjt:  MDPLSALRDFTIRGELDKIVRVNDEFRFGSDYSFPCSAETAYRSKQGNLYTIETLVYYIKNHHIKHTEYLQNARTQGIPPVTFPDRKPLLDYLTGKVSSS

Query:  DAIEFLVPQNPKFPDLPSVDEYRPEDPVIVGAAIDAVDEDDGFKDSTNVDYMSMIRAIERPLKDRESLLECKNRNFYNVLVASTKREEERQRIESQQRKD
        DAIEFLVPQNPKFPDLPSVDEYRPEDPVIVGAA+DAVDEDDGFKDSTNVDYM+MIRAIERPLKDRESLLECKNRNFYNVLV STKREEERQR+ESQQRKD
Subjt:  DAIEFLVPQNPKFPDLPSVDEYRPEDPVIVGAAIDAVDEDDGFKDSTNVDYMSMIRAIERPLKDRESLLECKNRNFYNVLVASTKREEERQRIESQQRKD

Query:  GLVAKSRLMGSDDRGLVD-----------------------------------MTLITIYNVKEFLEDGVFIPTDVKVKQMKGARPDCVTVQKKFSRDRD
        GLVAKSRLMGSDDRGLV                                     TLITIYNVKEFLEDGVFIPTDVKVKQMKGARPDCVTVQKKFSRDRD
Subjt:  GLVAKSRLMGSDDRGLVD-----------------------------------MTLITIYNVKEFLEDGVFIPTDVKVKQMKGARPDCVTVQKKFSRDRD

Query:  RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNK---------NDSLESAKNVKQWNVKIISISKNKRHQDRAAALEVWDRLEEF
        RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNK         +DSLESAKNVKQWNVKIISISKNKRHQDRAAALEVWDRLEEF
Subjt:  RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNK---------NDSLESAKNVKQWNVKIISISKNKRHQDRAAALEVWDRLEEF

XP_008438653.1 PREDICTED: protein CDC73 homolog [Cucumis melo]2.3e-18885.75Show/hide
Query:  MDPLSALRDFTIRGELDKIVRVNDEFRFGSDYSFPCSAETAYRSKQGNLYTIETLVYYIKNHHIKHTEYLQNARTQGIPPVTFPDRKPLLDYLTGKVSSS
        MDPLSALRDFTIRGELDKIVRVNDEFRF SDYSFPCS ETAYRSKQGNLYT+ETLVYYIKNHH+KHTEYLQNARTQGI  VTFPDRKPLLDYLTGKVSSS
Subjt:  MDPLSALRDFTIRGELDKIVRVNDEFRFGSDYSFPCSAETAYRSKQGNLYTIETLVYYIKNHHIKHTEYLQNARTQGIPPVTFPDRKPLLDYLTGKVSSS

Query:  DAIEFLVPQNPKFPDLPSVDEYRPEDPVIVGAAIDAVDEDDGFKDSTNVDYMSMIRAIERPLKDRESLLECKNRNFYNVLVASTKREEERQRIESQQRKD
        DAIEFLVPQNPKFPDLPSVDEYRPEDPVIVGAA+DAVDEDDGFKDSTNVDYM+MIRAIERPLKDRESLLECKNRNFYNVLV STKREEERQR+ESQQRKD
Subjt:  DAIEFLVPQNPKFPDLPSVDEYRPEDPVIVGAAIDAVDEDDGFKDSTNVDYMSMIRAIERPLKDRESLLECKNRNFYNVLVASTKREEERQRIESQQRKD

Query:  GLVAKSRLMGSDDRGLVD-----------------------------------MTLITIYNVKEFLEDGVFIPTDVKVKQMKGARPDCVTVQKKFSRDRD
        GLVAKSRLMGSDDRGLV                                     TLITIYNVKEFLEDGVFIPTDVKVKQMKGARPDCVTVQKKFSRDRD
Subjt:  GLVAKSRLMGSDDRGLVD-----------------------------------MTLITIYNVKEFLEDGVFIPTDVKVKQMKGARPDCVTVQKKFSRDRD

Query:  RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNK---------NDSLESAKNVKQWNVKIISISKNKRHQDRAAALEVWDRLEEF
        RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNK         +DSLESAKNVKQWNVKIISISKNKRHQDRAAALEVWDRLEEF
Subjt:  RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNK---------NDSLESAKNVKQWNVKIISISKNKRHQDRAAALEVWDRLEEF

XP_022138180.1 protein CDC73 homolog [Momordica charantia]1.4e-19087.25Show/hide
Query:  MDPLSALRDFTIRGELDKIVRVNDEFRFGSDYSFPCSAETAYRSKQGNLYTIETLVYYIKNHHIKHTEYLQNARTQGIPPVTFPDRKPLLDYLTGKVSSS
        MDPLSALRDFTIRGELDKIVRVNDEFRF +DYSFPCSAETAYRSKQGNLYTIETLVYYIKNHHIKHTEYLQNARTQGIPPVTFPDRKPLLDYLTGKVSSS
Subjt:  MDPLSALRDFTIRGELDKIVRVNDEFRFGSDYSFPCSAETAYRSKQGNLYTIETLVYYIKNHHIKHTEYLQNARTQGIPPVTFPDRKPLLDYLTGKVSSS

Query:  DAIEFLVPQNPKFPDLPSVDEYRPEDPVIVGAAIDAVDEDDGFKDSTNVDYMSMIRAIERPLKDRESLLECKNRNFYNVLVASTKREEERQRIESQQRKD
        DAIEFLVPQNPKFPDLPSVDEYRPEDPVIVGAAIDAVDED G KDSTNVDYMSMIRAIERPLKDRESLLECKNRNFYNVLVASTKREEERQRIESQQRKD
Subjt:  DAIEFLVPQNPKFPDLPSVDEYRPEDPVIVGAAIDAVDEDDGFKDSTNVDYMSMIRAIERPLKDRESLLECKNRNFYNVLVASTKREEERQRIESQQRKD

Query:  GLVAKSRLMGSDDRGLVD-----------------------------------MTLITIYNVKEFLEDGVFIPTDVKVKQMKGARPDCVTVQKKFSRDRD
        GLVAKSRLMGSDDRGLV                                     TLITIYNVKEFLEDGVFIPTDVKVKQMKGARPDCVTVQKKFSRDRD
Subjt:  GLVAKSRLMGSDDRGLVD-----------------------------------MTLITIYNVKEFLEDGVFIPTDVKVKQMKGARPDCVTVQKKFSRDRD

Query:  RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNK---------NDSLESAKNVKQWNVKIISISKNKRHQDRAAALEVWDRLEEF
        RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNK         +DSLESAKNVKQWNVKIISISKNKRHQDRAAALEVWDRLEEF
Subjt:  RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNK---------NDSLESAKNVKQWNVKIISISKNKRHQDRAAALEVWDRLEEF

XP_022934371.1 protein CDC73 homolog [Cucurbita moschata]6.1e-18986.75Show/hide
Query:  MDPLSALRDFTIRGELDKIVRVNDEFRFGSDYSFPCSAETAYRSKQGNLYTIETLVYYIKNHHIKHTEYLQNARTQGIPPVTFPDRKPLLDYLTGKVSSS
        MDPLSALRDFTIRGELDKIVRVN EFRFGSDYSFPCSAETAYRSKQGNLYTIETLVYYIKNHHIKHTEYLQNARTQGI  VTFPDRKPLLDYLTGKVSSS
Subjt:  MDPLSALRDFTIRGELDKIVRVNDEFRFGSDYSFPCSAETAYRSKQGNLYTIETLVYYIKNHHIKHTEYLQNARTQGIPPVTFPDRKPLLDYLTGKVSSS

Query:  DAIEFLVPQNPKFPDLPSVDEYRPEDPVIVGAAIDAVDEDDGFKDSTNVDYMSMIRAIERPLKDRESLLECKNRNFYNVLVASTKREEERQRIESQQRKD
        DAIEFLVPQNPKFPDLPSVDEYRPEDPVIVGAAIDAVDEDDGFKDSTNVDYM+MIRAIERPLKDRESLLECKNRNFYNVLV STKREEERQRIESQQRKD
Subjt:  DAIEFLVPQNPKFPDLPSVDEYRPEDPVIVGAAIDAVDEDDGFKDSTNVDYMSMIRAIERPLKDRESLLECKNRNFYNVLVASTKREEERQRIESQQRKD

Query:  GLVAKSRLMGSDDRGL-----------------------------------VDMTLITIYNVKEFLEDGVFIPTDVKVKQMKGARPDCVTVQKKFSRDRD
        GLVAKSRLMGSDDRGL                                      TLITIYNVKEFLEDGVFIPTDVKVKQMKGARPDCVTVQKKFSRDRD
Subjt:  GLVAKSRLMGSDDRGL-----------------------------------VDMTLITIYNVKEFLEDGVFIPTDVKVKQMKGARPDCVTVQKKFSRDRD

Query:  RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNK---------NDSLESAKNVKQWNVKIISISKNKRHQDRAAALEVWDRLEEF
        RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNK         +DSLESAKNVKQWNVKIISISKNKRHQDRAAALEVWDRLEEF
Subjt:  RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNK---------NDSLESAKNVKQWNVKIISISKNKRHQDRAAALEVWDRLEEF

XP_023539477.1 protein CDC73 homolog [Cucurbita pepo subsp. pepo]1.0e-18886.5Show/hide
Query:  MDPLSALRDFTIRGELDKIVRVNDEFRFGSDYSFPCSAETAYRSKQGNLYTIETLVYYIKNHHIKHTEYLQNARTQGIPPVTFPDRKPLLDYLTGKVSSS
        MDPLSALRDFTIRGELDKIVRVN EFRFGSDYSFPCSAETAYRSKQGNLYTIETLVYYIKNHHIKHTEYLQNARTQGI  VTFPDRKPLLDYLTGKVSSS
Subjt:  MDPLSALRDFTIRGELDKIVRVNDEFRFGSDYSFPCSAETAYRSKQGNLYTIETLVYYIKNHHIKHTEYLQNARTQGIPPVTFPDRKPLLDYLTGKVSSS

Query:  DAIEFLVPQNPKFPDLPSVDEYRPEDPVIVGAAIDAVDEDDGFKDSTNVDYMSMIRAIERPLKDRESLLECKNRNFYNVLVASTKREEERQRIESQQRKD
        DAIEFLVPQNPKFPDLPSVDEYRPEDPVIVGAAIDAVDEDDGFKDSTNVDYM+MIRAIERPLKDRESLLECKNRNFYNVLV STKREEERQRIESQQRKD
Subjt:  DAIEFLVPQNPKFPDLPSVDEYRPEDPVIVGAAIDAVDEDDGFKDSTNVDYMSMIRAIERPLKDRESLLECKNRNFYNVLVASTKREEERQRIESQQRKD

Query:  GLVAKSRLMGSDDRGL-----------------------------------VDMTLITIYNVKEFLEDGVFIPTDVKVKQMKGARPDCVTVQKKFSRDRD
        GLVAKSRLMGSDDRG+                                      TLITIYNVKEFLEDGVFIPTDVKVKQMKGARPDCVTVQKKFSRDRD
Subjt:  GLVAKSRLMGSDDRGL-----------------------------------VDMTLITIYNVKEFLEDGVFIPTDVKVKQMKGARPDCVTVQKKFSRDRD

Query:  RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNK---------NDSLESAKNVKQWNVKIISISKNKRHQDRAAALEVWDRLEEF
        RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNK         +DSLESAKNVKQWNVKIISISKNKRHQDRAAALEVWDRLEEF
Subjt:  RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNK---------NDSLESAKNVKQWNVKIISISKNKRHQDRAAALEVWDRLEEF

TrEMBL top hitse value%identityAlignment
A0A1S3AWX0 protein CDC73 homolog1.1e-18885.75Show/hide
Query:  MDPLSALRDFTIRGELDKIVRVNDEFRFGSDYSFPCSAETAYRSKQGNLYTIETLVYYIKNHHIKHTEYLQNARTQGIPPVTFPDRKPLLDYLTGKVSSS
        MDPLSALRDFTIRGELDKIVRVNDEFRF SDYSFPCS ETAYRSKQGNLYT+ETLVYYIKNHH+KHTEYLQNARTQGI  VTFPDRKPLLDYLTGKVSSS
Subjt:  MDPLSALRDFTIRGELDKIVRVNDEFRFGSDYSFPCSAETAYRSKQGNLYTIETLVYYIKNHHIKHTEYLQNARTQGIPPVTFPDRKPLLDYLTGKVSSS

Query:  DAIEFLVPQNPKFPDLPSVDEYRPEDPVIVGAAIDAVDEDDGFKDSTNVDYMSMIRAIERPLKDRESLLECKNRNFYNVLVASTKREEERQRIESQQRKD
        DAIEFLVPQNPKFPDLPSVDEYRPEDPVIVGAA+DAVDEDDGFKDSTNVDYM+MIRAIERPLKDRESLLECKNRNFYNVLV STKREEERQR+ESQQRKD
Subjt:  DAIEFLVPQNPKFPDLPSVDEYRPEDPVIVGAAIDAVDEDDGFKDSTNVDYMSMIRAIERPLKDRESLLECKNRNFYNVLVASTKREEERQRIESQQRKD

Query:  GLVAKSRLMGSDDRGLVD-----------------------------------MTLITIYNVKEFLEDGVFIPTDVKVKQMKGARPDCVTVQKKFSRDRD
        GLVAKSRLMGSDDRGLV                                     TLITIYNVKEFLEDGVFIPTDVKVKQMKGARPDCVTVQKKFSRDRD
Subjt:  GLVAKSRLMGSDDRGLVD-----------------------------------MTLITIYNVKEFLEDGVFIPTDVKVKQMKGARPDCVTVQKKFSRDRD

Query:  RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNK---------NDSLESAKNVKQWNVKIISISKNKRHQDRAAALEVWDRLEEF
        RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNK         +DSLESAKNVKQWNVKIISISKNKRHQDRAAALEVWDRLEEF
Subjt:  RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNK---------NDSLESAKNVKQWNVKIISISKNKRHQDRAAALEVWDRLEEF

A0A5A7U760 Protein CDC73-like protein1.1e-18885.75Show/hide
Query:  MDPLSALRDFTIRGELDKIVRVNDEFRFGSDYSFPCSAETAYRSKQGNLYTIETLVYYIKNHHIKHTEYLQNARTQGIPPVTFPDRKPLLDYLTGKVSSS
        MDPLSALRDFTIRGELDKIVRVNDEFRF SDYSFPCS ETAYRSKQGNLYT+ETLVYYIKNHH+KHTEYLQNARTQGI  VTFPDRKPLLDYLTGKVSSS
Subjt:  MDPLSALRDFTIRGELDKIVRVNDEFRFGSDYSFPCSAETAYRSKQGNLYTIETLVYYIKNHHIKHTEYLQNARTQGIPPVTFPDRKPLLDYLTGKVSSS

Query:  DAIEFLVPQNPKFPDLPSVDEYRPEDPVIVGAAIDAVDEDDGFKDSTNVDYMSMIRAIERPLKDRESLLECKNRNFYNVLVASTKREEERQRIESQQRKD
        DAIEFLVPQNPKFPDLPSVDEYRPEDPVIVGAA+DAVDEDDGFKDSTNVDYM+MIRAIERPLKDRESLLECKNRNFYNVLV STKREEERQR+ESQQRKD
Subjt:  DAIEFLVPQNPKFPDLPSVDEYRPEDPVIVGAAIDAVDEDDGFKDSTNVDYMSMIRAIERPLKDRESLLECKNRNFYNVLVASTKREEERQRIESQQRKD

Query:  GLVAKSRLMGSDDRGLVD-----------------------------------MTLITIYNVKEFLEDGVFIPTDVKVKQMKGARPDCVTVQKKFSRDRD
        GLVAKSRLMGSDDRGLV                                     TLITIYNVKEFLEDGVFIPTDVKVKQMKGARPDCVTVQKKFSRDRD
Subjt:  GLVAKSRLMGSDDRGLVD-----------------------------------MTLITIYNVKEFLEDGVFIPTDVKVKQMKGARPDCVTVQKKFSRDRD

Query:  RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNK---------NDSLESAKNVKQWNVKIISISKNKRHQDRAAALEVWDRLEEF
        RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNK         +DSLESAKNVKQWNVKIISISKNKRHQDRAAALEVWDRLEEF
Subjt:  RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNK---------NDSLESAKNVKQWNVKIISISKNKRHQDRAAALEVWDRLEEF

A0A6J1C8Q5 protein CDC73 homolog7.0e-19187.25Show/hide
Query:  MDPLSALRDFTIRGELDKIVRVNDEFRFGSDYSFPCSAETAYRSKQGNLYTIETLVYYIKNHHIKHTEYLQNARTQGIPPVTFPDRKPLLDYLTGKVSSS
        MDPLSALRDFTIRGELDKIVRVNDEFRF +DYSFPCSAETAYRSKQGNLYTIETLVYYIKNHHIKHTEYLQNARTQGIPPVTFPDRKPLLDYLTGKVSSS
Subjt:  MDPLSALRDFTIRGELDKIVRVNDEFRFGSDYSFPCSAETAYRSKQGNLYTIETLVYYIKNHHIKHTEYLQNARTQGIPPVTFPDRKPLLDYLTGKVSSS

Query:  DAIEFLVPQNPKFPDLPSVDEYRPEDPVIVGAAIDAVDEDDGFKDSTNVDYMSMIRAIERPLKDRESLLECKNRNFYNVLVASTKREEERQRIESQQRKD
        DAIEFLVPQNPKFPDLPSVDEYRPEDPVIVGAAIDAVDED G KDSTNVDYMSMIRAIERPLKDRESLLECKNRNFYNVLVASTKREEERQRIESQQRKD
Subjt:  DAIEFLVPQNPKFPDLPSVDEYRPEDPVIVGAAIDAVDEDDGFKDSTNVDYMSMIRAIERPLKDRESLLECKNRNFYNVLVASTKREEERQRIESQQRKD

Query:  GLVAKSRLMGSDDRGLVD-----------------------------------MTLITIYNVKEFLEDGVFIPTDVKVKQMKGARPDCVTVQKKFSRDRD
        GLVAKSRLMGSDDRGLV                                     TLITIYNVKEFLEDGVFIPTDVKVKQMKGARPDCVTVQKKFSRDRD
Subjt:  GLVAKSRLMGSDDRGLVD-----------------------------------MTLITIYNVKEFLEDGVFIPTDVKVKQMKGARPDCVTVQKKFSRDRD

Query:  RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNK---------NDSLESAKNVKQWNVKIISISKNKRHQDRAAALEVWDRLEEF
        RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNK         +DSLESAKNVKQWNVKIISISKNKRHQDRAAALEVWDRLEEF
Subjt:  RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNK---------NDSLESAKNVKQWNVKIISISKNKRHQDRAAALEVWDRLEEF

A0A6J1F1M2 protein CDC73 homolog2.9e-18986.75Show/hide
Query:  MDPLSALRDFTIRGELDKIVRVNDEFRFGSDYSFPCSAETAYRSKQGNLYTIETLVYYIKNHHIKHTEYLQNARTQGIPPVTFPDRKPLLDYLTGKVSSS
        MDPLSALRDFTIRGELDKIVRVN EFRFGSDYSFPCSAETAYRSKQGNLYTIETLVYYIKNHHIKHTEYLQNARTQGI  VTFPDRKPLLDYLTGKVSSS
Subjt:  MDPLSALRDFTIRGELDKIVRVNDEFRFGSDYSFPCSAETAYRSKQGNLYTIETLVYYIKNHHIKHTEYLQNARTQGIPPVTFPDRKPLLDYLTGKVSSS

Query:  DAIEFLVPQNPKFPDLPSVDEYRPEDPVIVGAAIDAVDEDDGFKDSTNVDYMSMIRAIERPLKDRESLLECKNRNFYNVLVASTKREEERQRIESQQRKD
        DAIEFLVPQNPKFPDLPSVDEYRPEDPVIVGAAIDAVDEDDGFKDSTNVDYM+MIRAIERPLKDRESLLECKNRNFYNVLV STKREEERQRIESQQRKD
Subjt:  DAIEFLVPQNPKFPDLPSVDEYRPEDPVIVGAAIDAVDEDDGFKDSTNVDYMSMIRAIERPLKDRESLLECKNRNFYNVLVASTKREEERQRIESQQRKD

Query:  GLVAKSRLMGSDDRGL-----------------------------------VDMTLITIYNVKEFLEDGVFIPTDVKVKQMKGARPDCVTVQKKFSRDRD
        GLVAKSRLMGSDDRGL                                      TLITIYNVKEFLEDGVFIPTDVKVKQMKGARPDCVTVQKKFSRDRD
Subjt:  GLVAKSRLMGSDDRGL-----------------------------------VDMTLITIYNVKEFLEDGVFIPTDVKVKQMKGARPDCVTVQKKFSRDRD

Query:  RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNK---------NDSLESAKNVKQWNVKIISISKNKRHQDRAAALEVWDRLEEF
        RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNK         +DSLESAKNVKQWNVKIISISKNKRHQDRAAALEVWDRLEEF
Subjt:  RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNK---------NDSLESAKNVKQWNVKIISISKNKRHQDRAAALEVWDRLEEF

A0A6J1IB78 protein CDC73 homolog2.9e-18986.75Show/hide
Query:  MDPLSALRDFTIRGELDKIVRVNDEFRFGSDYSFPCSAETAYRSKQGNLYTIETLVYYIKNHHIKHTEYLQNARTQGIPPVTFPDRKPLLDYLTGKVSSS
        MDPLSALRDFTIRGELDKIVRVN EFRFGSDYSFPCSAETAYRSKQGNLYTIETLVYYIKNHHIKHTEYLQNARTQGI  VTFPDRKPLLDYLTGKVSSS
Subjt:  MDPLSALRDFTIRGELDKIVRVNDEFRFGSDYSFPCSAETAYRSKQGNLYTIETLVYYIKNHHIKHTEYLQNARTQGIPPVTFPDRKPLLDYLTGKVSSS

Query:  DAIEFLVPQNPKFPDLPSVDEYRPEDPVIVGAAIDAVDEDDGFKDSTNVDYMSMIRAIERPLKDRESLLECKNRNFYNVLVASTKREEERQRIESQQRKD
        DAIEFLVPQNPKFPDLPSVDEYRPEDPVIVGAAIDAVDEDDGFKDSTNVDYM+MIRAIERPLKDRESLLECKNRNFYNVLV STKREEERQRIESQQRKD
Subjt:  DAIEFLVPQNPKFPDLPSVDEYRPEDPVIVGAAIDAVDEDDGFKDSTNVDYMSMIRAIERPLKDRESLLECKNRNFYNVLVASTKREEERQRIESQQRKD

Query:  GLVAKSRLMGSDDRGL-----------------------------------VDMTLITIYNVKEFLEDGVFIPTDVKVKQMKGARPDCVTVQKKFSRDRD
        GLVAKSRLMGSDDRGL                                      TLITIYNVKEFLEDGVFIPTDVKVKQMKGARPDCVTVQKKFSRDRD
Subjt:  GLVAKSRLMGSDDRGL-----------------------------------VDMTLITIYNVKEFLEDGVFIPTDVKVKQMKGARPDCVTVQKKFSRDRD

Query:  RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNK---------NDSLESAKNVKQWNVKIISISKNKRHQDRAAALEVWDRLEEF
        RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNK         +DSLESAKNVKQWNVKIISISKNKRHQDRAAALEVWDRLEEF
Subjt:  RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNK---------NDSLESAKNVKQWNVKIISISKNKRHQDRAAALEVWDRLEEF

SwissProt top hitse value%identityAlignment
Q4V8C8 Parafibromin8.4e-2424.28Show/hide
Query:  DPLSALRDFTIRGELDKIVRVNDEFRFGSDYSFPCSAETAY----RSKQG---NLYTIETLVYYIKNHHIKHTEYLQNARTQGIPPVTFPDRKPLLDYLT
        D LS LR + I+ +  +IV   DE  FG ++S+P + +T Y      K+G     YT++++++ + N H+ H  Y++ A T+ IP V  PDRK LL YL 
Subjt:  DPLSALRDFTIRGELDKIVRVNDEFRFGSDYSFPCSAETAY----RSKQG---NLYTIETLVYYIKNHHIKHTEYLQNARTQGIPPVTFPDRKPLLDYLT

Query:  GKVSSSDAIEFLVP----------------------QNPKFPD---------------------LPSVDEYRPEDPVIVGAAIDAV--------------
        G+ S+S +I+   P                      + P+  D                     +   ++ R     ++   I A+              
Subjt:  GKVSSSDAIEFLVP----------------------QNPKFPD---------------------LPSVDEYRPEDPVIVGAAIDAV--------------

Query:  DEDD--------GFKDSTNVDYMSMIRAIERPLKDRESLLECKNRNF----YNVLVASTKREEERQ----------------------------------
        D DD         F D+  VD    I + ER  + R ++L+   +NF    + +L +   REE R                                   
Subjt:  DEDD--------GFKDSTNVDYMSMIRAIERPLKDRESLLECKNRNF----YNVLVASTKREEERQ----------------------------------

Query:  ---------RIESQQRKDGLVAKSRLMGSDDR----------------------------------GLVDMTLITIYNVKEFLEDGVFIPTDVKVKQMKG
                 +I++     G+  KS   G+  R                                       +LIT+ N K+ L+D  F+P+D K KQ   
Subjt:  ---------RIESQQRKDGLVAKSRLMGSDDR----------------------------------GLVDMTLITIYNVKEFLEDGVFIPTDVKVKQMKG

Query:  ARPDCVTVQKKFSRDRD----RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPF----KDHVEIFN-------KNDSLESAKNVKQWNVKIIS
           + +  ++K           V   Y V D+P  L  +DWDRVVAVFV G  WQFK WP+       V+IF        K D +    NV++W+V ++ 
Subjt:  ARPDCVTVQKKFSRDRD----RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPF----KDHVEIFN-------KNDSLESAKNVKQWNVKIIS

Query:  ISKNKRHQDRAAALEVWDRLEEF
        +S +KRH DR   L  W+ L+ +
Subjt:  ISKNKRHQDRAAALEVWDRLEEF

Q5ZLM0 Parafibromin7.1e-2324.52Show/hide
Query:  DPLSALRDFTIRGELDKIVRVNDEFRFGSDYSFPCSAETAY----RSKQG---NLYTIETLVYYIKNHHIKHTEYLQNARTQGIPPVTFPDRKPLLDYLT
        D LS LR +  + +  +IV   DE  FG ++S+P + +T Y      K+G     YT++++++ + N H+ H  Y++ A T+ IP V  PDRK LL YL 
Subjt:  DPLSALRDFTIRGELDKIVRVNDEFRFGSDYSFPCSAETAY----RSKQG---NLYTIETLVYYIKNHHIKHTEYLQNARTQGIPPVTFPDRKPLLDYLT

Query:  GKVSSSDAIEFLVP----------------------QNPKFPD---------------------------LPSVDEYRPEDPVIVGAAIDA---------
        G+ S+S +I+   P                      + P+  D                           + S+ E    + +   AAI A         
Subjt:  GKVSSSDAIEFLVP----------------------QNPKFPD---------------------------LPSVDEYRPEDPVIVGAAIDA---------

Query:  --VDEDD--------GFKDSTNVDYMSMIRAIERPLKDRESLLECKNRNF----YNVLVASTKREEERQ-------------------------------
           D DD         F D+  VD    I + ER  + R ++L+   +NF    + +L +   REE R                                
Subjt:  --VDEDD--------GFKDSTNVDYMSMIRAIERPLKDRESLLECKNRNF----YNVLVASTKREEERQ-------------------------------

Query:  ------------RIESQQRKDGLVAKSRLMGSDDR----------------------------------GLVDMTLITIYNVKEFLEDGVFIPTDVKVKQ
                    +I++     G+  KS   G+  R                                       +LIT+ N K+ L+D  F+P+D K KQ
Subjt:  ------------RIESQQRKDGLVAKSRLMGSDDR----------------------------------GLVDMTLITIYNVKEFLEDGVFIPTDVKVKQ

Query:  MKGARPDCVTVQKKFSRDRD----RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPF----KDHVEIFN-------KNDSLESAKNVKQWNVK
              + +  ++K           V   Y V D+P  L  +DWDRVVAVFV G  WQFK WP+       V+IF        K D +    NV++W+V 
Subjt:  MKGARPDCVTVQKKFSRDRD----RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPF----KDHVEIFN-------KNDSLESAKNVKQWNVK

Query:  IISISKNKRHQDRAAALEVWDRLEEF
        ++ +S +KRH DR   L  W+ L+ +
Subjt:  IISISKNKRHQDRAAALEVWDRLEEF

Q6P1J9 Parafibromin2.4e-2324.71Show/hide
Query:  DPLSALRDFTIRGELDKIVRVNDEFRFGSDYSFPCSAETAY----RSKQG---NLYTIETLVYYIKNHHIKHTEYLQNARTQGIPPVTFPDRKPLLDYLT
        D LS LR + I+ +  +IV   DE  FG ++S+P + +T Y      K+G     YT++++++ + N H+ H  Y++ A T+ IP V  PDRK LL YL 
Subjt:  DPLSALRDFTIRGELDKIVRVNDEFRFGSDYSFPCSAETAY----RSKQG---NLYTIETLVYYIKNHHIKHTEYLQNARTQGIPPVTFPDRKPLLDYLT

Query:  GKVSSSDAIEFLVP----------------------QNPKFPD---------------------------LPSVDEYRPEDPVIVGAAIDA---------
        G+ S+S +I+   P                      + P+  D                           + S+ E    + +   AAI A         
Subjt:  GKVSSSDAIEFLVP----------------------QNPKFPD---------------------------LPSVDEYRPEDPVIVGAAIDA---------

Query:  --VDEDD--------GFKDSTNVDYMSMIRAIERPLKDRESLLECKNRNF----YNVLVASTKREEERQ-------------------------------
           D DD         F D+  VD    I + ER  + R ++L+   +NF    + +L +   REE R                                
Subjt:  --VDEDD--------GFKDSTNVDYMSMIRAIERPLKDRESLLECKNRNF----YNVLVASTKREEERQ-------------------------------

Query:  ------------RIESQQRKDGLVAKSRLMGSDDR----------------------------------GLVDMTLITIYNVKEFLEDGVFIPTDVKVKQ
                    +I++     G+  KS   G+  R                                       +LIT+ N K+ L+D  F+P+D K KQ
Subjt:  ------------RIESQQRKDGLVAKSRLMGSDDR----------------------------------GLVDMTLITIYNVKEFLEDGVFIPTDVKVKQ

Query:  MKGARPDCVTVQKKFSRDRD----RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPF----KDHVEIFN-------KNDSLESAKNVKQWNVK
              + +  ++K           V   Y V D+P  L  +DWDRVVAVFV G  WQFK WP+       V+IF        K D +    NV++W+V 
Subjt:  MKGARPDCVTVQKKFSRDRD----RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPF----KDHVEIFN-------KNDSLESAKNVKQWNVK

Query:  IISISKNKRHQDRAAALEVWDRLEEF
        ++ +S +KRH DR   L  W+ L+ +
Subjt:  IISISKNKRHQDRAAALEVWDRLEEF

Q8JZM7 Parafibromin2.4e-2324.71Show/hide
Query:  DPLSALRDFTIRGELDKIVRVNDEFRFGSDYSFPCSAETAY----RSKQG---NLYTIETLVYYIKNHHIKHTEYLQNARTQGIPPVTFPDRKPLLDYLT
        D LS LR + I+ +  +IV   DE  FG ++S+P + +T Y      K+G     YT++++++ + N H+ H  Y++ A T+ IP V  PDRK LL YL 
Subjt:  DPLSALRDFTIRGELDKIVRVNDEFRFGSDYSFPCSAETAY----RSKQG---NLYTIETLVYYIKNHHIKHTEYLQNARTQGIPPVTFPDRKPLLDYLT

Query:  GKVSSSDAIEFLVP----------------------QNPKFPD---------------------------LPSVDEYRPEDPVIVGAAIDA---------
        G+ S+S +I+   P                      + P+  D                           + S+ E    + +   AAI A         
Subjt:  GKVSSSDAIEFLVP----------------------QNPKFPD---------------------------LPSVDEYRPEDPVIVGAAIDA---------

Query:  --VDEDD--------GFKDSTNVDYMSMIRAIERPLKDRESLLECKNRNF----YNVLVASTKREEERQ-------------------------------
           D DD         F D+  VD    I + ER  + R ++L+   +NF    + +L +   REE R                                
Subjt:  --VDEDD--------GFKDSTNVDYMSMIRAIERPLKDRESLLECKNRNF----YNVLVASTKREEERQ-------------------------------

Query:  ------------RIESQQRKDGLVAKSRLMGSDDR----------------------------------GLVDMTLITIYNVKEFLEDGVFIPTDVKVKQ
                    +I++     G+  KS   G+  R                                       +LIT+ N K+ L+D  F+P+D K KQ
Subjt:  ------------RIESQQRKDGLVAKSRLMGSDDR----------------------------------GLVDMTLITIYNVKEFLEDGVFIPTDVKVKQ

Query:  MKGARPDCVTVQKKFSRDRD----RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPF----KDHVEIFN-------KNDSLESAKNVKQWNVK
              + +  ++K           V   Y V D+P  L  +DWDRVVAVFV G  WQFK WP+       V+IF        K D +    NV++W+V 
Subjt:  MKGARPDCVTVQKKFSRDRD----RVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPF----KDHVEIFN-------KNDSLESAKNVKQWNVK

Query:  IISISKNKRHQDRAAALEVWDRLEEF
        ++ +S +KRH DR   L  W+ L+ +
Subjt:  IISISKNKRHQDRAAALEVWDRLEEF

Q9LJ87 Protein CDC73 homolog7.3e-13761.52Show/hide
Query:  MDPLSALRDFTIRGELDKIVRVNDEFRFGSDYSFPCSAETAYRSKQGNLYTIETLVYYIKNHHIKHTEYLQNARTQGIPPVTFPDRKPLLDYLTGKVSSS
        MDPLS L++FTIRG++DKI RV   +RFGS+YSFPC+ ETAYRSK G+LYT+E LV+Y+KN  +KH EY+Q+     +P VT PDRKPLLDYLTG+V+SS
Subjt:  MDPLSALRDFTIRGELDKIVRVNDEFRFGSDYSFPCSAETAYRSKQGNLYTIETLVYYIKNHHIKHTEYLQNARTQGIPPVTFPDRKPLLDYLTGKVSSS

Query:  DAIEFLVPQNPKFPDLPSVDEYRPEDP----VIVGAAIDAVDEDDGFKDSTNVDYMSMIRAIERPLKDRESLLECKNRNFYNVLVASTKREEERQRIESQ
        D+I+FL+ Q          +EYRP+      V    AI  ++ +D  K   +VDY+ +IR+ ERPLK R+++L+CKNR+FY+VLV STKREEERQRIES 
Subjt:  DAIEFLVPQNPKFPDLPSVDEYRPEDP----VIVGAAIDAVDEDDGFKDSTNVDYMSMIRAIERPLKDRESLLECKNRNFYNVLVASTKREEERQRIESQ

Query:  QRKDGLVAKSRLMGSDDRGLVD---------------------------------------MTLITIYNVKEFLEDGVFIPTDVKVKQMKGARPDCVTVQ
        QRKDGLVAKSRLMG+++RG+V                                         TLITIYNVKEFLEDGV+IP DVK K+MKG +PDC+TVQ
Subjt:  QRKDGLVAKSRLMGSDDRGLVD---------------------------------------MTLITIYNVKEFLEDGVFIPTDVKVKQMKGARPDCVTVQ

Query:  KKFSRDRDRVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNK---------NDSLESAKNVKQWNVKIISISKNKRHQDRAAALE
        KKFSRDR+RVVTAYEVRDKPSALK +DWDRVVAVFVLGK+WQFKDWPFKDHVEIFNK         +DS+ESAK VKQWNVKIISISKNKRHQDRAAALE
Subjt:  KKFSRDRDRVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNK---------NDSLESAKNVKQWNVKIISISKNKRHQDRAAALE

Query:  VWDRLEEF
        VW++LEEF
Subjt:  VWDRLEEF

Arabidopsis top hitse value%identityAlignment
AT3G22590.1 PLANT HOMOLOGOUS TO PARAFIBROMIN5.2e-13861.52Show/hide
Query:  MDPLSALRDFTIRGELDKIVRVNDEFRFGSDYSFPCSAETAYRSKQGNLYTIETLVYYIKNHHIKHTEYLQNARTQGIPPVTFPDRKPLLDYLTGKVSSS
        MDPLS L++FTIRG++DKI RV   +RFGS+YSFPC+ ETAYRSK G+LYT+E LV+Y+KN  +KH EY+Q+     +P VT PDRKPLLDYLTG+V+SS
Subjt:  MDPLSALRDFTIRGELDKIVRVNDEFRFGSDYSFPCSAETAYRSKQGNLYTIETLVYYIKNHHIKHTEYLQNARTQGIPPVTFPDRKPLLDYLTGKVSSS

Query:  DAIEFLVPQNPKFPDLPSVDEYRPEDP----VIVGAAIDAVDEDDGFKDSTNVDYMSMIRAIERPLKDRESLLECKNRNFYNVLVASTKREEERQRIESQ
        D+I+FL+ Q          +EYRP+      V    AI  ++ +D  K   +VDY+ +IR+ ERPLK R+++L+CKNR+FY+VLV STKREEERQRIES 
Subjt:  DAIEFLVPQNPKFPDLPSVDEYRPEDP----VIVGAAIDAVDEDDGFKDSTNVDYMSMIRAIERPLKDRESLLECKNRNFYNVLVASTKREEERQRIESQ

Query:  QRKDGLVAKSRLMGSDDRGLVD---------------------------------------MTLITIYNVKEFLEDGVFIPTDVKVKQMKGARPDCVTVQ
        QRKDGLVAKSRLMG+++RG+V                                         TLITIYNVKEFLEDGV+IP DVK K+MKG +PDC+TVQ
Subjt:  QRKDGLVAKSRLMGSDDRGLVD---------------------------------------MTLITIYNVKEFLEDGVFIPTDVKVKQMKGARPDCVTVQ

Query:  KKFSRDRDRVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNK---------NDSLESAKNVKQWNVKIISISKNKRHQDRAAALE
        KKFSRDR+RVVTAYEVRDKPSALK +DWDRVVAVFVLGK+WQFKDWPFKDHVEIFNK         +DS+ESAK VKQWNVKIISISKNKRHQDRAAALE
Subjt:  KKFSRDRDRVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNK---------NDSLESAKNVKQWNVKIISISKNKRHQDRAAALE

Query:  VWDRLEEF
        VW++LEEF
Subjt:  VWDRLEEF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCTCTCTCAGCTCTCAGAGACTTCACCATCCGAGGCGAGCTCGATAAAATCGTCCGAGTCAACGACGAGTTCCGTTTCGGCTCCGACTATTCTTTCCCTTGCTC
CGCCGAGACTGCTTACCGGTCCAAGCAGGGCAATCTTTACACAATTGAAACCCTCGTCTACTACATAAAGAACCACCATATCAAGCACACAGAGTATCTTCAGAACGCTC
GCACTCAGGGAATTCCTCCTGTCACTTTCCCCGACCGGAAACCCCTCCTCGATTATCTTACCGGCAAGGTTTCCTCCTCCGACGCCATCGAGTTTCTTGTCCCTCAAAAT
CCGAAGTTCCCAGATTTGCCCTCCGTGGATGAGTATCGCCCTGAAGACCCGGTGATCGTTGGCGCAGCCATCGACGCGGTGGATGAGGACGACGGTTTTAAGGATTCTAC
CAATGTCGATTATATGTCGATGATTAGGGCTATCGAGAGACCGTTGAAGGACCGAGAATCGTTGTTGGAATGCAAGAATAGGAATTTCTATAACGTGCTCGTGGCGTCGA
CGAAGCGCGAGGAGGAAAGGCAGCGTATTGAATCGCAACAGAGGAAGGACGGTTTGGTAGCCAAGAGTAGGTTGATGGGTTCCGACGACAGGGGTTTGGTGGATATGACT
CTAATCACGATATATAATGTGAAAGAGTTTTTAGAAGATGGTGTTTTTATACCTACAGATGTCAAGGTCAAGCAGATGAAGGGGGCGAGGCCCGACTGTGTGACCGTGCA
GAAGAAGTTCAGTAGGGACAGAGATAGGGTAGTGACGGCGTACGAGGTTAGGGACAAGCCTTCGGCTCTGAAATCGGAGGACTGGGACCGGGTTGTAGCTGTTTTCGTAT
TGGGAAAGGAATGGCAGTTCAAAGATTGGCCTTTTAAGGACCATGTTGAGATTTTTAATAAAAACGACAGTTTGGAATCAGCTAAGAATGTGAAGCAGTGGAATGTTAAA
ATTATTTCGATTAGCAAGAACAAGCGGCATCAAGACCGAGCTGCAGCATTGGAGGTGTGGGATAGGCTAGAAGAATTTGCTAACTGGCAAGAAGATCAATACAACTTCGA
AGCTGAAGCATATATAGAAGCCAGCAAGATGGAGAATAGCAGGGGTTTTGACAAATGGATGGAGCTTCCGTCTGATGAGTTGTTGTCTGTTGATGGAATAGTCTTAGACC
CAGTTCCTGCACAAACAGTTAGATTCTGGTGCTCCTGCCGGAGAGTCAGTAGGAGAAGCAACTGGCAAAATTTTCAGAAGTTTTGCACATGCATGCTTACCGTTGCAGTT
GCTGATGGAAGGAGTCTGAACATTGCAAGCTCGAGGCAGGGCGAGAGCCTGGGTTTGGTTGATATTGATCCCAGCGATGCGCCGCCACCATTTAGGACCTGGCACAAGCA
CTGTGGTTGTGAGCGGACAACGCTTGCAAGCTGTGTGCAGCATGCTGATGATGGGGTAGAGGAGTTCCCTGTAATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATCCTCTCTCAGCTCTCAGAGACTTCACCATCCGAGGCGAGCTCGATAAAATCGTCCGAGTCAACGACGAGTTCCGTTTCGGCTCCGACTATTCTTTCCCTTGCTC
CGCCGAGACTGCTTACCGGTCCAAGCAGGGCAATCTTTACACAATTGAAACCCTCGTCTACTACATAAAGAACCACCATATCAAGCACACAGAGTATCTTCAGAACGCTC
GCACTCAGGGAATTCCTCCTGTCACTTTCCCCGACCGGAAACCCCTCCTCGATTATCTTACCGGCAAGGTTTCCTCCTCCGACGCCATCGAGTTTCTTGTCCCTCAAAAT
CCGAAGTTCCCAGATTTGCCCTCCGTGGATGAGTATCGCCCTGAAGACCCGGTGATCGTTGGCGCAGCCATCGACGCGGTGGATGAGGACGACGGTTTTAAGGATTCTAC
CAATGTCGATTATATGTCGATGATTAGGGCTATCGAGAGACCGTTGAAGGACCGAGAATCGTTGTTGGAATGCAAGAATAGGAATTTCTATAACGTGCTCGTGGCGTCGA
CGAAGCGCGAGGAGGAAAGGCAGCGTATTGAATCGCAACAGAGGAAGGACGGTTTGGTAGCCAAGAGTAGGTTGATGGGTTCCGACGACAGGGGTTTGGTGGATATGACT
CTAATCACGATATATAATGTGAAAGAGTTTTTAGAAGATGGTGTTTTTATACCTACAGATGTCAAGGTCAAGCAGATGAAGGGGGCGAGGCCCGACTGTGTGACCGTGCA
GAAGAAGTTCAGTAGGGACAGAGATAGGGTAGTGACGGCGTACGAGGTTAGGGACAAGCCTTCGGCTCTGAAATCGGAGGACTGGGACCGGGTTGTAGCTGTTTTCGTAT
TGGGAAAGGAATGGCAGTTCAAAGATTGGCCTTTTAAGGACCATGTTGAGATTTTTAATAAAAACGACAGTTTGGAATCAGCTAAGAATGTGAAGCAGTGGAATGTTAAA
ATTATTTCGATTAGCAAGAACAAGCGGCATCAAGACCGAGCTGCAGCATTGGAGGTGTGGGATAGGCTAGAAGAATTTGCTAACTGGCAAGAAGATCAATACAACTTCGA
AGCTGAAGCATATATAGAAGCCAGCAAGATGGAGAATAGCAGGGGTTTTGACAAATGGATGGAGCTTCCGTCTGATGAGTTGTTGTCTGTTGATGGAATAGTCTTAGACC
CAGTTCCTGCACAAACAGTTAGATTCTGGTGCTCCTGCCGGAGAGTCAGTAGGAGAAGCAACTGGCAAAATTTTCAGAAGTTTTGCACATGCATGCTTACCGTTGCAGTT
GCTGATGGAAGGAGTCTGAACATTGCAAGCTCGAGGCAGGGCGAGAGCCTGGGTTTGGTTGATATTGATCCCAGCGATGCGCCGCCACCATTTAGGACCTGGCACAAGCA
CTGTGGTTGTGAGCGGACAACGCTTGCAAGCTGTGTGCAGCATGCTGATGATGGGGTAGAGGAGTTCCCTGTAATGTAA
Protein sequenceShow/hide protein sequence
MDPLSALRDFTIRGELDKIVRVNDEFRFGSDYSFPCSAETAYRSKQGNLYTIETLVYYIKNHHIKHTEYLQNARTQGIPPVTFPDRKPLLDYLTGKVSSSDAIEFLVPQN
PKFPDLPSVDEYRPEDPVIVGAAIDAVDEDDGFKDSTNVDYMSMIRAIERPLKDRESLLECKNRNFYNVLVASTKREEERQRIESQQRKDGLVAKSRLMGSDDRGLVDMT
LITIYNVKEFLEDGVFIPTDVKVKQMKGARPDCVTVQKKFSRDRDRVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNKNDSLESAKNVKQWNVK
IISISKNKRHQDRAAALEVWDRLEEFANWQEDQYNFEAEAYIEASKMENSRGFDKWMELPSDELLSVDGIVLDPVPAQTVRFWCSCRRVSRRSNWQNFQKFCTCMLTVAV
ADGRSLNIASSRQGESLGLVDIDPSDAPPPFRTWHKHCGCERTTLASCVQHADDGVEEFPVM