; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh14G020040 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh14G020040
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionWD-40 repeat-containing protein MSI1-like
Genome locationCmo_Chr14:14695947..14699323
RNA-Seq ExpressionCmoCh14G020040
SyntenyCmoCh14G020040
Gene Ontology termsGO:0010468 - regulation of gene expression (biological process)
GO:0005634 - nucleus (cellular component)
GO:0042393 - histone binding (molecular function)
InterPro domainsIPR001680 - WD40 repeat
IPR015943 - WD40/YVTN repeat-like-containing domain superfamily
IPR019775 - WD40 repeat, conserved site
IPR022052 - Histone-binding protein RBBP4, N-terminal
IPR036322 - WD40-repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582525.1 WD-40 repeat-containing protein MSI1, partial [Cucurbita argyrosperma subsp. sororia]3.2e-16692.21Show/hide
Query:  MEKGDKEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYR
        MEKGDKEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKM LGTHSFDNKPNYLMLAQVQLPLENSENDARHYR
Subjt:  MEKGDKEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYR

Query:  DDRASAGGFGCANGKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSED
        DDRASAGGFGCANGKVQIIQHINHDGEVNRAR MPQNPFIIATKTVSAEVLVFDYS+HPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSED
Subjt:  DDRASAGGFGCANGKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSED

Query:  SHICLWDINATPDNKTLEAMQIFK---------------------GHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQSEVNC
        SHICLWDINATPDNKTLEAMQIFK                     GHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQSEVNC
Subjt:  SHICLWDINATPDNKTLEAMQIFK---------------------GHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQSEVNC

Query:  LTFNPFNE
        LTFNPFNE
Subjt:  LTFNPFNE

KAG7018909.1 WD-40 repeat-containing protein MSI1, partial [Cucurbita argyrosperma subsp. argyrosperma]2.0e-15290.69Show/hide
Query:  MEKGDKEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYR
        MEKGDKEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKM LGTHSFDNKPNYLMLAQVQLPLENSENDARHYR
Subjt:  MEKGDKEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYR

Query:  DDRASAGGFGCANGKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSED
        DDRASAGGFGCANGKVQIIQHINHDGEVNRAR MPQNPFIIATKTVSAEVLVFDYS+HPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSED
Subjt:  DDRASAGGFGCANGKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSED

Query:  SHICLWDINATPDNKTLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQ---SEVNCLTFNPFNE
        SHICLWDINATPDNKTLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLH                +V H+     VNCLTFNPFNE
Subjt:  SHICLWDINATPDNKTLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQ---SEVNCLTFNPFNE

XP_004133950.1 WD-40 repeat-containing protein MSI1 [Cucumis sativus]6.0e-14987.11Show/hide
Query:  MEKGDKEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYR
        M K D+E+RG+  ERL+NEEYKIWKKNTPFLYDL+ITHALEWPSLTVEWLPDR+EPPGKDYSVQKM LGTH+ +N+PNYLMLAQVQLPLE+SENDARHY 
Subjt:  MEKGDKEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYR

Query:  DDRASAGGFGCANGKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSED
        DDRA AGGFGCANGKVQIIQ INHDGEVNRARYMPQNPFIIATKTVSAEV VFDYS+HPSKPPLDGTCNPDLRLRGH +EGYGLSW+KFKQGHLLSGS+D
Subjt:  DDRASAGGFGCANGKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSED

Query:  SHICLWDINATPDNKTLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQSEVNCLTFNPFNE
        + ICLWDINATP NKTLEAMQIFK HEGVV DVAWH+RHEYLFGSVGDD+YL VWDLR+ P ANKPVQSVVAHQSEVNCL FNPFNE
Subjt:  SHICLWDINATPDNKTLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQSEVNCLTFNPFNE

XP_022980301.1 WD-40 repeat-containing protein MSI1-like [Cucurbita maxima]5.6e-16395.47Show/hide
Query:  MEKGDKEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYR
        MEKGDKEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKM LGTHSFDNKPNYLMLAQVQLPLE+SENDARHYR
Subjt:  MEKGDKEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYR

Query:  DDRASAGGFGCANGKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSED
        DDRAS GGFGCANGKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEV VFDYS+HPS PPLDG CNPDLRLRGHKSEGYGLSW+KFKQGHLLSGSED
Subjt:  DDRASAGGFGCANGKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSED

Query:  SHICLWDINATPDNKTLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQSEVNCLTFNPFNE
        SHICLWDINATPDNKTLEAMQIFKGHEGVVGD+AWHMRHEYLFGSVG+DRYLHVWDLRS P ANKPVQSVVAHQSEV CLTFNPFNE
Subjt:  SHICLWDINATPDNKTLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQSEVNCLTFNPFNE

XP_023528376.1 WD-40 repeat-containing protein MSI1-like [Cucurbita pepo subsp. pepo]9.2e-16697.21Show/hide
Query:  MEKGDKEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYR
        MEKGD+EIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKM LGTHSFDNKPNYLM+AQVQL LENSENDARHYR
Subjt:  MEKGDKEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYR

Query:  DDRASAGGFGCANGKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSED
        DDRASAGGFGCANGKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYS+HPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSED
Subjt:  DDRASAGGFGCANGKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSED

Query:  SHICLWDINATPDNKTLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQSEVNCLTFNPFNE
        SHICLWDINATPDNKTLEAMQIFKGHEGVVGDVAWHM HEYLFGSVGDDRYLHVWDLRS P ANKPVQSVVAHQSEVNCLTFNPFNE
Subjt:  SHICLWDINATPDNKTLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQSEVNCLTFNPFNE

TrEMBL top hitse value%identityAlignment
A0A0A0L971 WD_REPEATS_REGION domain-containing protein2.9e-14987.11Show/hide
Query:  MEKGDKEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYR
        M K D+E+RG+  ERL+NEEYKIWKKNTPFLYDL+ITHALEWPSLTVEWLPDR+EPPGKDYSVQKM LGTH+ +N+PNYLMLAQVQLPLE+SENDARHY 
Subjt:  MEKGDKEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYR

Query:  DDRASAGGFGCANGKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSED
        DDRA AGGFGCANGKVQIIQ INHDGEVNRARYMPQNPFIIATKTVSAEV VFDYS+HPSKPPLDGTCNPDLRLRGH +EGYGLSW+KFKQGHLLSGS+D
Subjt:  DDRASAGGFGCANGKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSED

Query:  SHICLWDINATPDNKTLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQSEVNCLTFNPFNE
        + ICLWDINATP NKTLEAMQIFK HEGVV DVAWH+RHEYLFGSVGDD+YL VWDLR+ P ANKPVQSVVAHQSEVNCL FNPFNE
Subjt:  SHICLWDINATPDNKTLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQSEVNCLTFNPFNE

A0A1S3AWJ3 WD-40 repeat-containing protein MSI12.9e-14987.11Show/hide
Query:  MEKGDKEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYR
        M K D+E+RG+  ERL+NEEYKIWKKNTPFLYDL+ITHALEWPSLTVEWLPDR+EPPGKDYSVQKM LGTH+ +N+PNYLMLAQVQLPLE+SENDARHY 
Subjt:  MEKGDKEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYR

Query:  DDRASAGGFGCANGKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSED
        DDRA AGGFGCANGKVQIIQ INHDGEVNRARYMPQNPFIIATKTVSAEV VFDYS+HPSKPPLDGTCNPDLRLRGH +EGYGLSW+KFKQGHLLSGS+D
Subjt:  DDRASAGGFGCANGKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSED

Query:  SHICLWDINATPDNKTLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQSEVNCLTFNPFNE
        + ICLWDINATP NKTLEAMQIFK HEGVV DVAWH+RHEYLFGSVGDD+YL VWDLR+ P ANKPVQSVVAHQSEVNCL FNPFNE
Subjt:  SHICLWDINATPDNKTLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQSEVNCLTFNPFNE

A0A5A7U4A1 WD-40 repeat-containing protein MSI12.9e-14987.11Show/hide
Query:  MEKGDKEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYR
        M K D+E+RG+  ERL+NEEYKIWKKNTPFLYDL+ITHALEWPSLTVEWLPDR+EPPGKDYSVQKM LGTH+ +N+PNYLMLAQVQLPLE+SENDARHY 
Subjt:  MEKGDKEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYR

Query:  DDRASAGGFGCANGKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSED
        DDRA AGGFGCANGKVQIIQ INHDGEVNRARYMPQNPFIIATKTVSAEV VFDYS+HPSKPPLDGTCNPDLRLRGH +EGYGLSW+KFKQGHLLSGS+D
Subjt:  DDRASAGGFGCANGKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSED

Query:  SHICLWDINATPDNKTLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQSEVNCLTFNPFNE
        + ICLWDINATP NKTLEAMQIFK HEGVV DVAWH+RHEYLFGSVGDD+YL VWDLR+ P ANKPVQSVVAHQSEVNCL FNPFNE
Subjt:  SHICLWDINATPDNKTLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQSEVNCLTFNPFNE

A0A6J1ICW4 WD-40 repeat-containing protein MSI18.5e-14986.41Show/hide
Query:  MEKGDKEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYR
        M K D+E+RG+  ERL+NEEYKIWKKNTPFLYDL+ITHALEWPSLTVEWLPDR+EPPGKDYSVQKM LGTH+ +N+PNYLMLAQVQLPLE+SENDARHY 
Subjt:  MEKGDKEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYR

Query:  DDRASAGGFGCANGKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSED
        DDRA AGGFGCANGKVQIIQ INHDGEVNRARYMPQNPFIIATKTVSAEV VFDYS+HPSKPPLDGTCNPDLRLRGH +EGYGLSW+KFKQGHLLSGS+D
Subjt:  DDRASAGGFGCANGKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSED

Query:  SHICLWDINATPDNKTLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQSEVNCLTFNPFNE
        + ICLWDINATP NKTLEAMQIFK HEGVV DVAWH+RHEYLFGSVGDD+YL +WDLR+ P  NKPVQSVVAHQSEVNCL FNPFNE
Subjt:  SHICLWDINATPDNKTLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQSEVNCLTFNPFNE

A0A6J1IT85 WD-40 repeat-containing protein MSI1-like2.7e-16395.47Show/hide
Query:  MEKGDKEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYR
        MEKGDKEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKM LGTHSFDNKPNYLMLAQVQLPLE+SENDARHYR
Subjt:  MEKGDKEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYR

Query:  DDRASAGGFGCANGKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSED
        DDRAS GGFGCANGKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEV VFDYS+HPS PPLDG CNPDLRLRGHKSEGYGLSW+KFKQGHLLSGSED
Subjt:  DDRASAGGFGCANGKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSED

Query:  SHICLWDINATPDNKTLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQSEVNCLTFNPFNE
        SHICLWDINATPDNKTLEAMQIFKGHEGVVGD+AWHMRHEYLFGSVG+DRYLHVWDLRS P ANKPVQSVVAHQSEV CLTFNPFNE
Subjt:  SHICLWDINATPDNKTLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQSEVNCLTFNPFNE

SwissProt top hitse value%identityAlignment
O22466 WD-40 repeat-containing protein MSI12.4e-14884.32Show/hide
Query:  MEKGDKEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYR
        M K + E+RG+  ERL+NEEYKIWKKNTPFLYDL+ITHALEWPSLTVEWLPDR+EP GKDYSVQKM LGTH+ +N+PNYLMLAQVQLPLE++ENDARHY 
Subjt:  MEKGDKEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYR

Query:  DDRASAGGFGCANGKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSED
        DDR+  GGFGCANGKVQIIQ INHDGEVNRARYMPQNPFIIATKTVSAEV VFDYS+HPSKPPLDG CNPDLRLRGH +EGYGLSW++FKQGHLLSGS+D
Subjt:  DDRASAGGFGCANGKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSED

Query:  SHICLWDINATPDNKTLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQSEVNCLTFNPFNE
        SHICLWDINATP NK LEAMQIFK HEGVV DVAWH+RHEYLFGSVGDD+YLHVWDLR+ P   KP+QSVVAHQSEVNCL FNPFNE
Subjt:  SHICLWDINATPDNKTLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQSEVNCLTFNPFNE

O22467 Histone-binding protein MSI17.4e-14280.49Show/hide
Query:  MEKGDKEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYR
        M K ++E+RG+  ERL+NEEYKIWKKNTPFLYDL+ITHALEWPSLTVEWLPDR+EP GKDYSVQKM LGTH+ +++PNYLMLAQVQLPL+++E++AR Y 
Subjt:  MEKGDKEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYR

Query:  DDRASAGGFGCANGKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSED
        DDR+  GGFGCA GKVQIIQ INHDGEVNRARYMPQNPFIIATKTV+AEV VFDYS+HPSKPPLDG CNPDL+LRGH SEGYGLSW+KFKQGHLLSGS+D
Subjt:  DDRASAGGFGCANGKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSED

Query:  SHICLWDINATPDNKTLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQSEVNCLTFNPFNE
        + ICLWDINATP NK+L+A QIFK HEGVV DVAWH+RHEYLFGSVGDD+YL +WDLRS P A+KPVQSVVAH  EVNCL FNPFNE
Subjt:  SHICLWDINATPDNKTLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQSEVNCLTFNPFNE

Q10G81 Histone-binding protein MSI1 homolog6.7e-13578.09Show/hide
Query:  DKEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYRDDRA
        ++E R +  ERL+NEEYKIWKKNTPFLYDL+ITHALEWPSLTV+WLPDR EP GKD+SVQKM LGTH+ DN+PNYLMLAQVQLPL+++E DARHY DD A
Subjt:  DKEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYRDDRA

Query:  SAGGFGCANGKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSEDSHIC
          GGFG A+GKVQI+Q INHDGEVNRARYMPQN FIIATKTVSAEV VFDYS+HPSKPPLDG CNPDLRL+GH SEGYGLSW+ FK+GHLLSGS+D+ IC
Subjt:  SAGGFGCANGKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSEDSHIC

Query:  LWDINATPDNKTLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQSEVNCLTFNPFNE
        LWDI A   NKTL+A+QIFK H+GVV DVAWH+RHEYLFGSVGDD  L +WDLRS P++ KPVQSV AHQ EVNCL FNPFNE
Subjt:  LWDINATPDNKTLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQSEVNCLTFNPFNE

Q3MHL3 Histone-binding protein RBBP42.8e-10161.82Show/hide
Query:  ERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYRDDRASAGGFGCAN
        ER++NEEYKIWKKNTPFLYDL++THALEWPSLT +WLPD   P GKD+S+ ++ LGTH+ D + N+L++A VQLP ++++ DA HY  ++   GGFG  +
Subjt:  ERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYRDDRASAGGFGCAN

Query:  GKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSEDSHICLWDINATP-
        GK++I   INH+GEVNRARYMPQNP IIATKT S++VLVFDY++HPSKP   G CNPDLRLRGH+ EGYGLSWN    GHLLS S+D  ICLWDI+A P 
Subjt:  GKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSEDSHICLWDINATP-

Query:  DNKTLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQSEVNCLTFNPFNE
        + K ++A  IF GH  VV DV+WH+ HE LFGSV DD+ L +WD RS    +KP  SV AH +EVNCL+FNP++E
Subjt:  DNKTLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQSEVNCLTFNPFNE

Q9W7I5 Histone-binding protein RBBP42.8e-10161.82Show/hide
Query:  ERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYRDDRASAGGFGCAN
        ER++NEEYKIWKKNTPFLYDL++THALEWPSLT +WLPD   P GKD+S+ ++ LGTH+ D + N+L++A VQLP ++++ DA HY  ++   GGFG  +
Subjt:  ERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYRDDRASAGGFGCAN

Query:  GKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSEDSHICLWDINATP-
        GK++I   INH+GEVNRARYMPQNP IIATKT S++VLVFDY++HPSKP   G CNPDLRLRGH+ EGYGLSWN    GHLLS S+D  ICLWDI+A P 
Subjt:  GKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSEDSHICLWDINATP-

Query:  DNKTLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQSEVNCLTFNPFNE
        + K ++A  IF GH  VV DV+WH+ HE LFGSV DD+ L +WD RS    +KP  SV AH +EVNCL+FNP++E
Subjt:  DNKTLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQSEVNCLTFNPFNE

Arabidopsis top hitse value%identityAlignment
AT2G16780.1 Transducin family protein / WD-40 repeat family protein9.4e-7648.61Show/hide
Query:  EIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKD--YSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYRDDRA
        E + +T    V E++ +WKKNTPFLYDL+I+H LEWPSLTV W+P    P   D  + V K+ LGTH+  +  ++LM+A V  P  N+E           
Subjt:  EIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKD--YSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYRDDRA

Query:  SAGGFGCAN-----GKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSE
           G G AN      KV+I Q I  DGEVNRAR MPQ P ++  KT   EV +FDY++H +K      C+PDLRL GH  EGYGLSW+ FK+G+LLSGS+
Subjt:  SAGGFGCAN-----GKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSE

Query:  DSHICLWDINATPDNKTLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQSEVNCLTFNPFNE
        D  ICLWD++ATP +K L AM +++GHE  + DV+WHM++E LFGS G+D  L +WD R+    N+    V  H+ EVN L+FNPFNE
Subjt:  DSHICLWDINATPDNKTLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQSEVNCLTFNPFNE

AT2G19520.1 Transducin family protein / WD-40 repeat family protein1.7e-2930.45Show/hide
Query:  KEIRGKTNER-LVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMNLGTHSFDNKPNYLMLA--QVQLPLENSENDARHYRDD
        KE   KT +   V+E+Y  WK   P LYD +  H L WPSL+  W P  ++   K+   Q++ L   +  + PN L++A  +V  P   +      + ++
Subjt:  KEIRGKTNER-LVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMNLGTHSFDNKPNYLMLA--QVQLPLENSENDARHYRDD

Query:  RASAGGFGCANGKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCN--PDLRLRGHKSEGYGLSWNKFKQGHLLSGSED
          S          V+  + I H GEVNR R +PQN  I+AT T S +VL++D    P++  + G  N  PDL L GH+            +  +LSG +D
Subjt:  RASAGGFGCANGKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCN--PDLRLRGHKSEGYGLSWNKFKQGHLLSGSED

Query:  SHICLWDI----------------------NATPDNK--TLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSV-VAHQS
          + LW I                        T  N+  T+    ++ GHE  V DVA+       F SVGDD  L +WD R+      PV  V  AH +
Subjt:  SHICLWDI----------------------NATPDNK--TLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSV-VAHQS

Query:  EVNCLTFNPFNE
        +++C+ +NP ++
Subjt:  EVNCLTFNPFNE

AT4G29730.1 nucleosome/chromatin assembly factor group C51.1e-2628.24Show/hide
Query:  KTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYRDDRASAGGFG
        ++ +  V++ Y  WK   P LYD  + H L WPSL+  W P  ++   K    Q++ L   +  + PN L++A  +  +    N+  H            
Subjt:  KTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYRDDRASAGGFG

Query:  CANGKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGT--CNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSEDSHICLWDI
          +  V+  + I H GEVNR R +PQN  I+AT T S ++L+++    P +  + G     PDL L GH+ +          +  +LSG +D  + LW+I
Subjt:  CANGKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGT--CNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSEDSHICLWDI

Query:  N-----ATPDNK-------------------TLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSV-VAHQSEVNCLTFN
              A  D+K                   ++    I+ GH+  V DVA+       F SVGDD  L +WD R+      P   V  AH ++++C+ +N
Subjt:  N-----ATPDNK-------------------TLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSV-VAHQSEVNCLTFN

Query:  P
        P
Subjt:  P

AT4G35050.1 Transducin family protein / WD-40 repeat family protein1.4e-7448.72Show/hide
Query:  VNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKD--YSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYRDDRASAGGFGCANG
        V EE+ IWK+NTPFLYDL+I+H LEWPSLT+ W+P    P  KD  ++V K+ LGTH+     ++LM+A V +P  ++E      RD             
Subjt:  VNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKD--YSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYRDDRASAGGFGCANG

Query:  KVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSEDSHICLWDINATPDN
        KV+I Q I  DGEVNRAR MPQ P ++  KT  +EV +FDY++   KP     C+PDLRL GH+ EGYGL+W+ FK+G+LLSGS+D  ICLWD++AT  +
Subjt:  KVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSEDSHICLWDINATPDN

Query:  KTLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQSEVNCLTFNPFNE
        K L  M +++GH+ ++ DVAWHM++E +FGS GDD  L +WDLR+    N+    V  H+ E+N L+FNPFNE
Subjt:  KTLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQSEVNCLTFNPFNE

AT5G58230.1 Transducin/WD40 repeat-like superfamily protein5.2e-14380.49Show/hide
Query:  MEKGDKEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYR
        M K ++E+RG+  ERL+NEEYKIWKKNTPFLYDL+ITHALEWPSLTVEWLPDR+EP GKDYSVQKM LGTH+ +++PNYLMLAQVQLPL+++E++AR Y 
Subjt:  MEKGDKEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYR

Query:  DDRASAGGFGCANGKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSED
        DDR+  GGFGCA GKVQIIQ INHDGEVNRARYMPQNPFIIATKTV+AEV VFDYS+HPSKPPLDG CNPDL+LRGH SEGYGLSW+KFKQGHLLSGS+D
Subjt:  DDRASAGGFGCANGKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSED

Query:  SHICLWDINATPDNKTLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQSEVNCLTFNPFNE
        + ICLWDINATP NK+L+A QIFK HEGVV DVAWH+RHEYLFGSVGDD+YL +WDLRS P A+KPVQSVVAH  EVNCL FNPFNE
Subjt:  SHICLWDINATPDNKTLEAMQIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQSEVNCLTFNPFNE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAAGGGCGATAAAGAGATACGTGGAAAAACGAACGAGAGACTGGTAAATGAGGAGTACAAGATTTGGAAGAAGAATACTCCATTTCTTTACGATTTGATCATCAC
TCATGCCCTAGAGTGGCCTTCACTCACCGTCGAGTGGTTGCCGGACCGGGATGAGCCTCCGGGAAAGGATTACTCTGTTCAGAAGATGAATTTGGGGACTCATAGCTTCG
ATAATAAGCCGAATTACCTCATGCTTGCTCAGGTTCAGCTTCCGCTTGAGAATTCGGAAAACGATGCGCGACATTACCGTGACGATCGTGCCAGTGCGGGTGGCTTTGGG
TGTGCGAACGGCAAGGTACAAATAATCCAGCATATAAATCACGACGGCGAAGTCAATAGAGCCCGTTATATGCCTCAAAATCCATTTATTATTGCCACAAAGACTGTCAG
CGCTGAAGTCTTGGTTTTTGACTACAGTCAACACCCATCCAAACCACCTCTAGATGGTACATGTAATCCTGACTTGAGGTTGAGGGGTCACAAGTCCGAAGGTTATGGTT
TATCATGGAACAAGTTCAAGCAGGGCCATTTACTTAGTGGCTCTGAGGATTCACATATTTGTTTATGGGACATTAATGCTACTCCGGATAACAAAACCCTCGAGGCTATG
CAAATTTTTAAGGGTCATGAAGGTGTTGTGGGAGACGTTGCTTGGCATATGAGGCATGAATACTTATTTGGTTCAGTCGGTGATGATCGATACCTACATGTATGGGATCT
GCGAAGTCGTCCTTTAGCTAATAAGCCTGTACAGTCTGTAGTTGCTCATCAAAGTGAGGTAAATTGCTTGACATTCAATCCCTTCAATGAGTGA
mRNA sequenceShow/hide mRNA sequence
TTCAGCACTACCTCTTCTTGAAGACATATTGGCGGGAACTTCACCGGGGAACTGAGCTCCCAACTTTGCTTCTAACCGACGAAATCGACCCTCAAACTCTGCAAATTTTG
CTTCATGAGAAGACAATGGAGAAGGGCGATAAAGAGATACGTGGAAAAACGAACGAGAGACTGGTAAATGAGGAGTACAAGATTTGGAAGAAGAATACTCCATTTCTTTA
CGATTTGATCATCACTCATGCCCTAGAGTGGCCTTCACTCACCGTCGAGTGGTTGCCGGACCGGGATGAGCCTCCGGGAAAGGATTACTCTGTTCAGAAGATGAATTTGG
GGACTCATAGCTTCGATAATAAGCCGAATTACCTCATGCTTGCTCAGGTTCAGCTTCCGCTTGAGAATTCGGAAAACGATGCGCGACATTACCGTGACGATCGTGCCAGT
GCGGGTGGCTTTGGGTGTGCGAACGGCAAGGTACAAATAATCCAGCATATAAATCACGACGGCGAAGTCAATAGAGCCCGTTATATGCCTCAAAATCCATTTATTATTGC
CACAAAGACTGTCAGCGCTGAAGTCTTGGTTTTTGACTACAGTCAACACCCATCCAAACCACCTCTAGATGGTACATGTAATCCTGACTTGAGGTTGAGGGGTCACAAGT
CCGAAGGTTATGGTTTATCATGGAACAAGTTCAAGCAGGGCCATTTACTTAGTGGCTCTGAGGATTCACATATTTGTTTATGGGACATTAATGCTACTCCGGATAACAAA
ACCCTCGAGGCTATGCAAATTTTTAAGGGTCATGAAGGTGTTGTGGGAGACGTTGCTTGGCATATGAGGCATGAATACTTATTTGGTTCAGTCGGTGATGATCGATACCT
ACATGTATGGGATCTGCGAAGTCGTCCTTTAGCTAATAAGCCTGTACAGTCTGTAGTTGCTCATCAAAGTGAGGTAAATTGCTTGACATTCAATCCCTTCAATGAGTGAT
CAGCTCATCCCTCCATACCTTTTGACTGTCACGAGGAGGAGGTTTTTCAGGTGGGCTGGCATCCAAAGAACGAAACGATCTTAGCTTCTTGTTGTCGTGGAAGGAGACTC
ATGGTTTGGGACCTTAGCAGGATCGAGGAGGAGCAGACACCGGAGGACGTAGAAGATGGGCCACCCGAATTGCTGTTCATTCACGGTGGTCATACCAATACAATATCAGA
CTTCTCTTGGAATCCCTGTGAGGAGTGGGTCGTTGCTAGTGTAGCTGAAGATAACATACTACAAGTCTGGCAGATGGCTGAGAACGTCTACTATGGTGAAGATGATTTGC
TTGAGGAACCTCCAAAGCTCTCTTAG
Protein sequenceShow/hide protein sequence
MEKGDKEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMNLGTHSFDNKPNYLMLAQVQLPLENSENDARHYRDDRASAGGFG
CANGKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSQHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSEDSHICLWDINATPDNKTLEAM
QIFKGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQSEVNCLTFNPFNE