; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi05G019100 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi05G019100
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionWD-40 repeat-containing protein MSI1
Genome locationchr05:26292101..26303663
RNA-Seq ExpressionLsi05G019100
SyntenyLsi05G019100
Gene Ontology termsGO:0010468 - regulation of gene expression (biological process)
GO:0048034 - heme O biosynthetic process (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0008495 - protoheme IX farnesyltransferase activity (molecular function)
GO:0042393 - histone binding (molecular function)
InterPro domainsIPR000537 - UbiA prenyltransferase family
IPR001680 - WD40 repeat
IPR006369 - Protohaem IX farnesyltransferase
IPR015943 - WD40/YVTN repeat-like-containing domain superfamily
IPR019775 - WD40 repeat, conserved site
IPR020472 - G-protein beta WD-40 repeat
IPR022052 - Histone-binding protein RBBP4, N-terminal
IPR036322 - WD40-repeat-containing domain superfamily
IPR044878 - UbiA prenyltransferase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582525.1 WD-40 repeat-containing protein MSI1, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0075.2Show/hide
Query:  MWRNSRSFSSKLRSSSSSPNPS-TTTSISTFYRHVGVAQ---RALYPYSSSPYSSLSSSPSYSDPIRLGCSNSHGFRVFSSVADPSSLASAPVVSRAREA
        MWRNSRSFSSKLRSSSS P PS TTTSISTFYRH+ VA+   RA YPYSSS +S LS SPS SDP R G SNSHG RVFSSVADPSSLASA +VSRAREA
Subjt:  MWRNSRSFSSKLRSSSSSPNPS-TTTSISTFYRHVGVAQ---RALYPYSSSPYSSLSSSPSYSDPIRLGCSNSHGFRVFSSVADPSSLASAPVVSRAREA

Query:  VDLARHYGRCYWELSKARLSMLVVATSGTGFVLGSGSTMDLGGLCWTCAGTMMVAASANSLNQVFEIKNDAKMKRTRRRPLPSGRITTPHAITWATSVGL
        VDLARHYGRCYWELSKARLSMLVVATSGTGFVLGSGST+DLGGLCWTCAGTMMVAASANSLNQ                                     
Subjt:  VDLARHYGRCYWELSKARLSMLVVATSGTGFVLGSGSTMDLGGLCWTCAGTMMVAASANSLNQVFEIKNDAKMKRTRRRPLPSGRITTPHAITWATSVGL

Query:  AGTAMLAAKVFEIKNDAKMKRTRRRPLPSGRITTPHAITWATSVGLAGTAMLAAKTNILAAGLAASNLILYAFVYTPLKQIHPVNTWVGAIVGAIPPLLG
                 VFEIKNDAKMKRT RRPLPSGRIT PHA+TWATSVGLAGTAMLA K NILAAGLAASNLILYAFVYTPLKQIHPVNTWVGAIVGAIPPLLG
Subjt:  AGTAMLAAKVFEIKNDAKMKRTRRRPLPSGRITTPHAITWATSVGLAGTAMLAAKTNILAAGLAASNLILYAFVYTPLKQIHPVNTWVGAIVGAIPPLLG

Query:  WAAASGQISLNAMILPAALYFWQIPHFMALAYLCRDDYAAGG--------------------------GIDIEFRKGGITSGWFCLESSLLTLAISATAF
        WAAASGQISLN+MILPAALYFWQIPHFMALAYLCRDDYAAGG                           +       GITSGWFCLESS+LTLAISATAF
Subjt:  WAAASGQISLNAMILPAALYFWQIPHFMALAYLCRDDYAAGG--------------------------GIDIEFRKGGITSGWFCLESSLLTLAISATAF

Query:  SFYRHCTMQKARRMFHASLLYLPVFMSGLLVHRLSDNEQTMEEDSSERMLDGLVQEDRYIAQKNRTEHSRALASLEDILAGTS-------PANSALNFPS
        SFYRHCTMQKARRMFHASLLYLPVFMSGLL HRLSDNEQTMEEDSSE MLDGL+QEDRYIAQKN+TEH R  A     +A  S       P    L   +
Subjt:  SFYRHCTMQKARRMFHASLLYLPVFMSGLLVHRLSDNEQTMEEDSSERMLDGLVQEDRYIAQKNRTEHSRALASLEDILAGTS-------PANSALNFPS

Query:  HRR--NRPSNSPNFASSEAI-------------MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMI
        + R  +R +  P    ++ I             M K D+E+RG+  ERL+NEEYKIWKKNTPFLYDL+ITHALEWPSLTVEWLPDR+EPPGKDYSVQKMI
Subjt:  HRR--NRPSNSPNFASSEAI-------------MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMI

Query:  LGTHTSENEPNYLMLAQVQLPLEDSENDARHYDDDRADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGT
        LGTH+ +N+PNYLMLAQVQLPLE+SENDARHY DDRA AGGFGCANGKVQIIQ INHDGEVNRAR MPQNPFIIATKTVSAEV VFDYSKHPSKPPLDGT
Subjt:  LGTHTSENEPNYLMLAQVQLPLEDSENDARHYDDDRADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGT

Query:  CNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDDAQICLWDINATPKNKTLEAMQIFKV---------------------HEGVVEDVAWHLRHEYLFGSV
        CNPDLRLRGH +EGYGLSW+KFKQGHLLSGS+D+ ICLWDINATP NKTLEAMQIFK                      HEGVV DVAWH+RHEYLFGSV
Subjt:  CNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDDAQICLWDINATPKNKTLEAMQIFKV---------------------HEGVVEDVAWHLRHEYLFGSV

Query:  GDDQYLLVWDLRT-PSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKLFDLRKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVW
        GDD+YL VWDLR+ P ANKPVQSVVAHQSEVNCL FNPFNEW+VATGSTDK VKLFDLRKISS+LHTFDCH+EEVFQVGW+PKNETILASCC GRRLMVW
Subjt:  GDDQYLLVWDLRT-PSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKLFDLRKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVW

Query:  DLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQIWQMAENIYHDEDDLPEEPPK
        DLSRI+EEQTPED EDGPPELLFIHGGHT+ ISDFSWNPCE+WVVASVAEDNILQ+WQMAEN+Y+ EDDL EEPPK
Subjt:  DLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQIWQMAENIYHDEDDLPEEPPK

XP_004133950.1 WD-40 repeat-containing protein MSI1 [Cucumis sativus]1.5e-25799.76Show/hide
Query:  MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYD
        MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYD
Subjt:  MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYD

Query:  DDRADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDD
        DDRADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDD
Subjt:  DDRADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDD

Query:  AQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL
        AQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL
Subjt:  AQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL

Query:  FDLRKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQ
        FDLRKIS+ALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQ
Subjt:  FDLRKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQ

Query:  IWQMAENIYHDEDDLPEEPPKA
        IWQMAENIYHDEDDLPEEPPKA
Subjt:  IWQMAENIYHDEDDLPEEPPKA

XP_007218065.1 WD-40 repeat-containing protein MSI1 [Prunus persica]3.9e-25096.44Show/hide
Query:  MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYD
        MGKD+EEMRGE+EERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLED+ENDARHYD
Subjt:  MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYD

Query:  DDRADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDD
        DDRA+ GGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTV+AEVFVFDYSKHPSKPPLDG C+PDLRLRGH+TEGYGLSWSKFKQGHLLSGSDD
Subjt:  DDRADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDD

Query:  AQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL
        AQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLL+WDLRTPS  KPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL
Subjt:  AQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL

Query:  FDLRKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQ
        FDLRKI++ALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQ
Subjt:  FDLRKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQ

Query:  IWQMAENIYHDEDDLPEEPPK
        IWQMAENIYHDEDDLPEEP K
Subjt:  IWQMAENIYHDEDDLPEEPPK

XP_022147047.1 WD-40 repeat-containing protein MSI1 [Momordica charantia]2.4e-25599.05Show/hide
Query:  MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYD
        MGKDDE MRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYD
Subjt:  MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYD

Query:  DDRADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDD
        DDRADAGGFGCANGKVQIIQQINHDGEVNRARYMPQN FIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDD
Subjt:  DDRADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDD

Query:  AQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL
        AQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTP+ NKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL
Subjt:  AQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL

Query:  FDLRKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQ
        FDLRKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQ
Subjt:  FDLRKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQ

Query:  IWQMAENIYHDEDDLPEEPPKA
        IWQMAENIYHDEDDLPEEPPKA
Subjt:  IWQMAENIYHDEDDLPEEPPKA

XP_022937662.1 WD-40 repeat-containing protein MSI1 [Cucurbita moschata]5.6e-25799.29Show/hide
Query:  MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYD
        MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYD
Subjt:  MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYD

Query:  DDRADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDD
        DDRADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDD
Subjt:  DDRADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDD

Query:  AQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL
        AQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLL+WDLRTP+ NKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL
Subjt:  AQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL

Query:  FDLRKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQ
        FDLRKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQ
Subjt:  FDLRKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQ

Query:  IWQMAENIYHDEDDLPEEPPKA
        IWQMAENIYHDEDDLPEEPPKA
Subjt:  IWQMAENIYHDEDDLPEEPPKA

TrEMBL top hitse value%identityAlignment
A0A0A0L971 WD_REPEATS_REGION domain-containing protein7.2e-25899.76Show/hide
Query:  MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYD
        MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYD
Subjt:  MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYD

Query:  DDRADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDD
        DDRADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDD
Subjt:  DDRADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDD

Query:  AQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL
        AQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL
Subjt:  AQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL

Query:  FDLRKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQ
        FDLRKIS+ALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQ
Subjt:  FDLRKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQ

Query:  IWQMAENIYHDEDDLPEEPPKA
        IWQMAENIYHDEDDLPEEPPKA
Subjt:  IWQMAENIYHDEDDLPEEPPKA

A0A1S3AWJ3 WD-40 repeat-containing protein MSI17.2e-25899.76Show/hide
Query:  MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYD
        MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYD
Subjt:  MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYD

Query:  DDRADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDD
        DDRADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDD
Subjt:  DDRADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDD

Query:  AQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL
        AQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL
Subjt:  AQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL

Query:  FDLRKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQ
        FDLRKIS+ALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQ
Subjt:  FDLRKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQ

Query:  IWQMAENIYHDEDDLPEEPPKA
        IWQMAENIYHDEDDLPEEPPKA
Subjt:  IWQMAENIYHDEDDLPEEPPKA

A0A5A7U4A1 WD-40 repeat-containing protein MSI17.2e-25899.76Show/hide
Query:  MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYD
        MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYD
Subjt:  MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYD

Query:  DDRADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDD
        DDRADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDD
Subjt:  DDRADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDD

Query:  AQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL
        AQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL
Subjt:  AQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL

Query:  FDLRKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQ
        FDLRKIS+ALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQ
Subjt:  FDLRKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQ

Query:  IWQMAENIYHDEDDLPEEPPKA
        IWQMAENIYHDEDDLPEEPPKA
Subjt:  IWQMAENIYHDEDDLPEEPPKA

A0A6J1FBV1 WD-40 repeat-containing protein MSI12.7e-25799.29Show/hide
Query:  MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYD
        MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYD
Subjt:  MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYD

Query:  DDRADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDD
        DDRADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDD
Subjt:  DDRADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDD

Query:  AQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL
        AQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLL+WDLRTP+ NKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL
Subjt:  AQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL

Query:  FDLRKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQ
        FDLRKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQ
Subjt:  FDLRKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQ

Query:  IWQMAENIYHDEDDLPEEPPKA
        IWQMAENIYHDEDDLPEEPPKA
Subjt:  IWQMAENIYHDEDDLPEEPPKA

A0A6J1ICW4 WD-40 repeat-containing protein MSI12.7e-25799.29Show/hide
Query:  MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYD
        MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYD
Subjt:  MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYD

Query:  DDRADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDD
        DDRADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDD
Subjt:  DDRADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDD

Query:  AQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL
        AQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLL+WDLRTP+ NKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL
Subjt:  AQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL

Query:  FDLRKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQ
        FDLRKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQ
Subjt:  FDLRKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQ

Query:  IWQMAENIYHDEDDLPEEPPKA
        IWQMAENIYHDEDDLPEEPPKA
Subjt:  IWQMAENIYHDEDDLPEEPPKA

SwissProt top hitse value%identityAlignment
O22466 WD-40 repeat-containing protein MSI12.5e-24793.87Show/hide
Query:  MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYD
        MGKD++EMRGE+EERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEP GKDYSVQKMILGTHTSENEPNYLMLAQVQLPLED+ENDARHYD
Subjt:  MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYD

Query:  DDRADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDD
        DDR++ GGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEV+VFDYSKHPSKPPLDG CNPDLRLRGH+TEGYGLSWS+FKQGHLLSGSDD
Subjt:  DDRADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDD

Query:  AQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL
        + ICLWDINATPKNK LEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYL VWDLRTPS  KP+QSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL
Subjt:  AQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL

Query:  FDLRKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQ
        FDLRKIS+ALHT DCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQ
Subjt:  FDLRKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQ

Query:  IWQMAENIYHDEDDLP-EEPPKAP
        IWQMAENIYHDEDDLP ++ PK P
Subjt:  IWQMAENIYHDEDDLP-EEPPKAP

O22467 Histone-binding protein MSI11.7e-24091.25Show/hide
Query:  MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYD
        MGKD+EEMRGE+EERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEP GKDYSVQKMILGTHTSE+EPNYLMLAQVQLPL+D+E++AR YD
Subjt:  MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYD

Query:  DDRADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDD
        DDR++ GGFGCA GKVQIIQQINHDGEVNRARYMPQNPFIIATKTV+AEV+VFDYSKHPSKPPLDG CNPDL+LRGH++EGYGLSWSKFKQGHLLSGSDD
Subjt:  DDRADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDD

Query:  AQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL
        AQICLWDINATPKNK+L+A QIFK HEGVVEDVAWHLRHEYLFGSVGDDQYLL+WDLR+PSA+KPVQSVVAH  EVNCLAFNPFNEWVVATGSTDKTVKL
Subjt:  AQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL

Query:  FDLRKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQ
        FDLRK+S+ALHTFD HKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQT EDAEDGPPELLFIHGGHTSKISDFSWNPCEDWV++SVAEDNILQ
Subjt:  FDLRKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQ

Query:  IWQMAENIYHDEDDLP-EEPPKA
        IWQMAENIYHDEDD P EEP KA
Subjt:  IWQMAENIYHDEDDLP-EEPPKA

Q09028 Histone-binding protein RBBP43.0e-16867Show/hide
Query:  MEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYDDDRADAGGFGC
        +EER+INEEYKIWKKNTPFLYDLV+THALEWPSLT +WLPD   P GKD+S+ +++LGTHTS+ E N+L++A VQLP +D++ DA HYD ++ + GGFG 
Subjt:  MEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYDDDRADAGGFGC

Query:  ANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDDAQICLWDINAT
         +GK++I  +INH+GEVNRARYMPQNP IIATKT S++V VFDY+KHPSKP   G CNPDLRLRGH  EGYGLSW+    GHLLS SDD  ICLWDI+A 
Subjt:  ANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDDAQICLWDINAT

Query:  PK-NKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKLFDLRKISSAL
        PK  K ++A  IF  H  VVEDV+WHL HE LFGSV DDQ L++WD R+ + +KP  SV AH +EVNCL+FNP++E+++ATGS DKTV L+DLR +   L
Subjt:  PK-NKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKLFDLRKISSAL

Query:  HTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQIWQMAENIYH
        H+F+ HK+E+FQV W+P NETILAS    RRL VWDLS+I EEQ+PEDAEDGPPELLFIHGGHT+KISDFSWNP E WV+ SV+EDNI+Q+WQMAENIY+
Subjt:  HTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQIWQMAENIYH

Query:  DED
        DED
Subjt:  DED

Q10G81 Histone-binding protein MSI1 homolog9.4e-23187.17Show/hide
Query:  DDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYDDDR
        ++EE R E+EERLINEEYKIWKKNTPFLYDLVITHALEWPSLTV+WLPDR EP GKD+SVQKM+LGTHTS+NEPNYLMLAQVQLPL+D+E DARHYDDD 
Subjt:  DDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYDDDR

Query:  ADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDDAQI
        A+ GGFG A+GKVQI+QQINHDGEVNRARYMPQN FIIATKTVSAEV+VFDYSKHPSKPPLDG CNPDLRL+GHN+EGYGLSWS FK+GHLLSGSDDAQI
Subjt:  ADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDDAQI

Query:  CLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKLFDL
        CLWDI A  KNKTL+A+QIFK H+GVVEDVAWHLRHEYLFGSVGDD  LL+WDLR+P + KPVQSV AHQ EVNCLAFNPFNEWVVATGSTDKTVKLFDL
Subjt:  CLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKLFDL

Query:  RKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQIWQ
        RKI ++LHTFDCHKEEVFQVGW+PKNETILASCCLGRRLMVWDLSRID+EQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWV+ASVAEDNILQIWQ
Subjt:  RKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQIWQ

Query:  MAENIYHDEDDLP-EEPPKAP
        MAENIYHDEDD+P ++P KAP
Subjt:  MAENIYHDEDDLP-EEPPKAP

Q3MHL3 Histone-binding protein RBBP43.0e-16867Show/hide
Query:  MEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYDDDRADAGGFGC
        +EER+INEEYKIWKKNTPFLYDLV+THALEWPSLT +WLPD   P GKD+S+ +++LGTHTS+ E N+L++A VQLP +D++ DA HYD ++ + GGFG 
Subjt:  MEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYDDDRADAGGFGC

Query:  ANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDDAQICLWDINAT
         +GK++I  +INH+GEVNRARYMPQNP IIATKT S++V VFDY+KHPSKP   G CNPDLRLRGH  EGYGLSW+    GHLLS SDD  ICLWDI+A 
Subjt:  ANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDDAQICLWDINAT

Query:  PK-NKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKLFDLRKISSAL
        PK  K ++A  IF  H  VVEDV+WHL HE LFGSV DDQ L++WD R+ + +KP  SV AH +EVNCL+FNP++E+++ATGS DKTV L+DLR +   L
Subjt:  PK-NKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKLFDLRKISSAL

Query:  HTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQIWQMAENIYH
        H+F+ HK+E+FQV W+P NETILAS    RRL VWDLS+I EEQ+PEDAEDGPPELLFIHGGHT+KISDFSWNP E WV+ SV+EDNI+Q+WQMAENIY+
Subjt:  HTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQIWQMAENIYH

Query:  DED
        DED
Subjt:  DED

Arabidopsis top hitse value%identityAlignment
AT2G16780.1 Transducin family protein / WD-40 repeat family protein6.0e-13255.53Show/hide
Query:  INEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKD--YSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYDDDRADAGGFGCAN-
        + E++ +WKKNTPFLYDL+I+H LEWPSLTV W+P    P   D  + V K+ILGTHTS +  ++LM+A V  P  ++E              G G AN 
Subjt:  INEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKD--YSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYDDDRADAGGFGCAN-

Query:  ----GKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDDAQICLWDIN
             KV+I Q+I  DGEVNRAR MPQ P ++  KT   EVF+FDY+KH +K      C+PDLRL GH+ EGYGLSWS FK+G+LLSGS D +ICLWD++
Subjt:  ----GKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDDAQICLWDIN

Query:  ATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKLFDLRKISSA
        ATP++K L AM +++ HE  + DV+WH+++E LFGS G+D  L++WD RT   N+    V  H+ EVN L+FNPFNEWV+AT S+D TV LFDLRK+++ 
Subjt:  ATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKLFDLRKISSA

Query:  LHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQ--TPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQIWQMAEN
        LH    H+ EVFQV W+P +ET+LAS    RRLMVWDL+R+ EEQ     DAEDGPPELLF HGGH +KISDF+WN  E WV+ASVAEDN LQ+WQMAE+
Subjt:  LHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQ--TPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQIWQMAEN

Query:  IYHDEDD
        IY DE+D
Subjt:  IYHDEDD

AT2G19520.1 Transducin family protein / WD-40 repeat family protein1.2e-5830.71Show/hide
Query:  AGTSPANSALNFPSHRRNRP---SNSPNFASSEAIMGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQK
        A T    +  + P  R  +P    +S   +S +    K  E  +   +   ++E+Y  WK   P LYD +  H L WPSL+  W P  E+   K+   Q+
Subjt:  AGTSPANSALNFPSHRRNRP---SNSPNFASSEAIMGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQK

Query:  MILGTHTSENEPNYLMLAQVQLPLEDSENDARHYDDDRADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLD
        + L   T  + PN L++A  ++ ++     A H      +A      +  V+  + I H GEVNR R +PQN  I+AT T S +V ++D    P++  + 
Subjt:  MILGTHTSENEPNYLMLAQVQLPLEDSENDARHYDDDRADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLD

Query:  GTCN--PDLRLRGHNTEGYGLSWSKFKQGHLLSGSDDAQICLWDI----------------------NATPKNK--TLEAMQIFKVHEGVVEDVAWHLRH
        G  N  PDL L GH             +  +LSG  D  + LW I                        T KN+  T+    ++  HE  VEDVA+    
Subjt:  GTCN--PDLRLRGHNTEGYGLSWSKFKQGHLLSGSDDAQICLWDI----------------------NATPKNK--TLEAMQIFKVHEGVVEDVAWHLRH

Query:  EYLFGSVGDDQYLLVWDLRTPSANKPVQSV-VAHQSEVNCLAFNPFNEWVVATGSTDKTVKLFDLRK-----ISSALHTFDCHKEEVFQVGWNPKNETIL
           F SVGDD  L++WD RT     PV  V  AH ++++C+ +NP ++ ++ TGS D TV+LFD RK     + S ++ F+ HK  V  V W+P   ++ 
Subjt:  EYLFGSVGDDQYLLVWDLRTPSANKPVQSV-VAHQSEVNCLAFNPFNEWVVATGSTDKTVKLFDLRK-----ISSALHTFDCHKEEVFQVGWNPKNETIL

Query:  ASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAED-------NILQIWQMAENIYHDEDDLPEEPPK
         S      L +WD  R+ ++   + A   P  L F H GH  K+ DF WN  + W + SV++D         LQIW+M++ IY  E+++  E  K
Subjt:  ASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAED-------NILQIWQMAENIYHDEDDLPEEPPK

AT2G44520.1 cytochrome c oxidase 101.3e-10552.12Show/hide
Query:  MWRNS--RSFSSKLRSSSSSPNPSTTTSISTFYRHVGVAQRALYPYSSSPYSSLSSSPSYSDPIRLGCS--NSHGFRVF--SSVADPSSLASAPVVSRAR
        MWR S    FSS++  SSS PNP     +  + R +     A+  +S  P S+ S++       +LG +   S   RVF  ++ A  ++  +  + SR  
Subjt:  MWRNS--RSFSSKLRSSSSSPNPSTTTSISTFYRHVGVAQRALYPYSSSPYSSLSSSPSYSDPIRLGCS--NSHGFRVF--SSVADPSSLASAPVVSRAR

Query:  EAVDLARHYGRCYWELSKARLSMLVVATSGTGFVLGSG-STMDLGGLCWTCAGTMMVAASANSLNQVFEIKNDAKMKRTRRRPLPSGRITTPHAITWATS
            L  HY RCYWELSKA+LSMLVVATSGTG++LG+G + +   GLC+TCAGTMM+AASANSLNQ+FEI ND+KMKRT                     
Subjt:  EAVDLARHYGRCYWELSKARLSMLVVATSGTGFVLGSG-STMDLGGLCWTCAGTMMVAASANSLNQVFEIKNDAKMKRTRRRPLPSGRITTPHAITWATS

Query:  VGLAGTAMLAAKVFEIKNDAKMKRTRRRPLPSGRITTPHAITWATSVGLAGTAMLAAKTNILAAGLAASNLILYAFVYTPLKQIHPVNTWVGAIVGAIPP
               ML                  RPLPSGRI+ PHA+ WAT  G +G  +LA+KTN+LAAGLA++NL+LYAFVYTPLKQ+HP+NTWVGA+VGAIPP
Subjt:  VGLAGTAMLAAKVFEIKNDAKMKRTRRRPLPSGRITTPHAITWATSVGLAGTAMLAAKTNILAAGLAASNLILYAFVYTPLKQIHPVNTWVGAIVGAIPP

Query:  LLGWAAASGQISLNAMILPAALYFWQIPHFMALAYLCRDDYAAGG------------------------GIDIEF--RKGGITSGWFCLESSLLTLAISA
        LLGWAAASGQIS N+MILPAALYFWQIPHFMALA+LCR+DYAAGG                         I + F     G+TS WFCLES+LLTLAI+A
Subjt:  LLGWAAASGQISLNAMILPAALYFWQIPHFMALAYLCRDDYAAGG------------------------GIDIEF--RKGGITSGWFCLESSLLTLAISA

Query:  TAFSFYRHCTMQKARRMFHASLLYLPVFMSGLLVHRLS-DNEQTMEEDS
        TAFSFYR  TM KAR+MFHASLL+LPVFMSGLL+HR+S DN+Q + E++
Subjt:  TAFSFYRHCTMQKARRMFHASLLYLPVFMSGLLVHRLS-DNEQTMEEDS

AT4G35050.1 Transducin family protein / WD-40 repeat family protein2.5e-13053.27Show/hide
Query:  EEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKD--YSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYDDDR
        EE + E     + EE+ IWK+NTPFLYDL+I+H LEWPSLT+ W+P    P  KD  ++V K+ILGTHTS    ++LM+A V +P  D+E      D + 
Subjt:  EEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKD--YSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYDDDR

Query:  ADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDDAQI
                   KV+I Q+I  DGEVNRAR MPQ P ++  KT  +EVF+FDY++   KP     C+PDLRL GH  EGYGL+WS FK+G+LLSGS D +I
Subjt:  ADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDDAQI

Query:  CLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKLFDL
        CLWD++AT  +K L  M +++ H+ ++EDVAWH+++E +FGS GDD  L++WDLRT   N+    V  H+ E+N L+FNPFNEWV+AT S+D TV LFDL
Subjt:  CLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKLFDL

Query:  RKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQ--TPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQI
        RK+++ LH    H+ EVFQV W+P +ET+LAS    RRLMVWD++R+ +EQ     DAEDGPPELLF HGGH +KISDF+WN  E WV++SVAEDN LQ+
Subjt:  RKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQ--TPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQI

Query:  WQMAENIYHDEDD
        WQMAE+IY ++D+
Subjt:  WQMAENIYHDEDD

AT5G58230.1 Transducin/WD40 repeat-like superfamily protein1.2e-24191.25Show/hide
Query:  MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYD
        MGKD+EEMRGE+EERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEP GKDYSVQKMILGTHTSE+EPNYLMLAQVQLPL+D+E++AR YD
Subjt:  MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYD

Query:  DDRADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDD
        DDR++ GGFGCA GKVQIIQQINHDGEVNRARYMPQNPFIIATKTV+AEV+VFDYSKHPSKPPLDG CNPDL+LRGH++EGYGLSWSKFKQGHLLSGSDD
Subjt:  DDRADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDD

Query:  AQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL
        AQICLWDINATPKNK+L+A QIFK HEGVVEDVAWHLRHEYLFGSVGDDQYLL+WDLR+PSA+KPVQSVVAH  EVNCLAFNPFNEWVVATGSTDKTVKL
Subjt:  AQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL

Query:  FDLRKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQ
        FDLRK+S+ALHTFD HKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQT EDAEDGPPELLFIHGGHTSKISDFSWNPCEDWV++SVAEDNILQ
Subjt:  FDLRKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQ

Query:  IWQMAENIYHDEDDLP-EEPPKA
        IWQMAENIYHDEDD P EEP KA
Subjt:  IWQMAENIYHDEDDLP-EEPPKA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCATTATCGACCGCGTTAGATCCCAAATTCGTTTCCACGCCATCGCCGCTGCCGCCATGTGGAGGAACTCCCGGAGTTTCTCATCCAAACTCCGGTCATCATCTTC
ATCTCCTAACCCTTCGACCACAACCTCCATTTCCACCTTCTATCGTCACGTCGGCGTCGCTCAGCGCGCCCTCTATCCTTATTCATCATCGCCGTACTCTTCTTTATCAT
CATCGCCGTCCTATTCCGATCCCATCAGACTAGGTTGTTCGAATTCTCATGGATTTCGGGTCTTTTCGAGCGTTGCTGATCCGTCGTCTTTGGCTTCCGCACCGGTGGTT
TCCAGAGCCAGGGAGGCCGTGGACTTAGCTCGCCACTATGGTCGCTGTTACTGGGAGCTCTCGAAGGCTCGTCTTAGCATGCTAGTTGTTGCAACTTCTGGGACTGGATT
TGTTCTAGGGAGTGGAAGTACTATGGACCTTGGGGGACTTTGTTGGACTTGTGCTGGTACCATGATGGTTGCAGCATCTGCAAACTCCTTAAATCAGGTTTTCGAAATAA
AAAATGATGCTAAAATGAAGAGAACACGGCGAAGACCACTACCCTCAGGGCGCATCACAACTCCACATGCAATCACCTGGGCAACTTCTGTTGGTTTAGCTGGGACTGCT
ATGTTAGCTGCCAAGGTTTTCGAAATAAAAAATGATGCTAAAATGAAGAGAACACGGCGAAGACCACTACCCTCAGGGCGCATCACAACTCCACATGCAATCACCTGGGC
AACTTCTGTTGGTTTAGCTGGGACTGCTATGTTAGCTGCCAAGACTAATATCTTGGCAGCTGGTCTTGCAGCTTCAAATTTGATTCTCTATGCATTTGTATATACCCCAC
TCAAGCAGATTCACCCTGTCAATACGTGGGTTGGTGCCATAGTTGGTGCCATTCCACCACTTCTTGGGTGGGCTGCTGCTTCAGGACAAATATCTCTAAATGCAATGATT
CTCCCCGCTGCCCTTTACTTTTGGCAAATACCCCATTTTATGGCTCTTGCATATTTGTGCCGTGATGATTATGCTGCTGGAGGTGGCATAGACATAGAGTTTAGGAAAGG
GGGCATCACCTCGGGATGGTTTTGTCTAGAATCATCCCTCCTCACACTTGCAATCAGTGCGACTGCATTTTCGTTCTACCGACACTGCACGATGCAGAAAGCAAGAAGGA
TGTTTCATGCTAGCCTCTTGTATCTTCCTGTTTTCATGTCAGGACTTCTGGTTCACCGTCTTTCTGATAACGAGCAGACAATGGAAGAAGATAGTTCTGAAAGGATGTTG
GATGGTCTAGTACAGGAGGATAGATATATTGCCCAAAAGAACAGGACGGAGCATAGTCGAGCCCTAGCTTCGCTCGAAGACATATTGGCGGGAACTTCACCGGCAAACTC
AGCACTCAACTTTCCTTCTCACCGACGAAATCGACCCTCAAACTCTCCAAATTTTGCTTCATCGGAGGCGATAATGGGGAAGGACGATGAAGAGATGCGTGGAGAAATGG
AGGAAAGGCTGATAAACGAAGAGTACAAGATTTGGAAGAAGAATACTCCATTTCTTTACGATTTGGTCATCACTCATGCCCTAGAGTGGCCTTCACTTACTGTAGAGTGG
TTGCCGGACCGGGAGGAGCCTCCGGGCAAGGATTACTCCGTTCAGAAGATGATTTTGGGGACTCATACCTCCGAGAATGAACCGAATTACCTCATGCTCGCTCAGGTTCA
GCTTCCGCTTGAGGATTCGGAGAACGATGCGCGACATTACGATGACGACCGTGCTGATGCGGGTGGCTTCGGCTGTGCGAACGGCAAGGTACAAATAATCCAGCAAATAA
ATCACGATGGCGAGGTCAATAGAGCCCGTTATATGCCTCAAAATCCATTTATTATTGCTACAAAGACTGTCAGCGCCGAAGTCTTTGTTTTTGACTATAGTAAACACCCA
TCCAAACCACCTCTAGATGGTACATGCAATCCCGATTTGAGATTGAGGGGTCACAATACCGAAGGTTATGGTTTATCGTGGAGTAAGTTCAAGCAGGGCCATTTACTTAG
TGGTTCTGATGATGCACAGATTTGTTTATGGGACATTAATGCTACTCCAAAGAATAAAACCCTTGAGGCTATGCAAATTTTTAAGGTTCATGAAGGTGTTGTGGAAGACG
TTGCATGGCATCTTAGGCATGAATACTTATTTGGTTCAGTAGGTGATGACCAATACCTGCTCGTATGGGATTTGCGAACTCCTTCAGCTAATAAGCCTGTACAGTCTGTA
GTTGCTCATCAAAGTGAGGTTAATTGCTTGGCATTCAATCCCTTCAATGAGTGGGTTGTAGCCACAGGGTCAACTGATAAGACGGTTAAGTTGTTTGATCTACGTAAAAT
CAGTTCTGCACTCCATACCTTTGACTGTCACAAGGAGGAGGTTTTCCAGGTTGGCTGGAATCCAAAGAATGAAACGATCTTAGCTTCTTGTTGTCTTGGTAGGAGACTCA
TGGTTTGGGACCTTAGCAGGATTGACGAGGAGCAGACACCTGAGGATGCAGAAGATGGCCCGCCCGAATTGCTGTTCATTCATGGTGGTCATACCAGTAAAATATCAGAC
TTTTCTTGGAATCCCTGTGAGGATTGGGTGGTTGCTAGTGTAGCAGAAGATAACATACTACAAATCTGGCAGATGGCTGAGAACATCTACCATGATGAAGATGATTTGCC
TGAGGAACCTCCAAAGGCCCCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTCATTATCGACCGCGTTAGATCCCAAATTCGTTTCCACGCCATCGCCGCTGCCGCCATGTGGAGGAACTCCCGGAGTTTCTCATCCAAACTCCGGTCATCATCTTC
ATCTCCTAACCCTTCGACCACAACCTCCATTTCCACCTTCTATCGTCACGTCGGCGTCGCTCAGCGCGCCCTCTATCCTTATTCATCATCGCCGTACTCTTCTTTATCAT
CATCGCCGTCCTATTCCGATCCCATCAGACTAGGTTGTTCGAATTCTCATGGATTTCGGGTCTTTTCGAGCGTTGCTGATCCGTCGTCTTTGGCTTCCGCACCGGTGGTT
TCCAGAGCCAGGGAGGCCGTGGACTTAGCTCGCCACTATGGTCGCTGTTACTGGGAGCTCTCGAAGGCTCGTCTTAGCATGCTAGTTGTTGCAACTTCTGGGACTGGATT
TGTTCTAGGGAGTGGAAGTACTATGGACCTTGGGGGACTTTGTTGGACTTGTGCTGGTACCATGATGGTTGCAGCATCTGCAAACTCCTTAAATCAGGTTTTCGAAATAA
AAAATGATGCTAAAATGAAGAGAACACGGCGAAGACCACTACCCTCAGGGCGCATCACAACTCCACATGCAATCACCTGGGCAACTTCTGTTGGTTTAGCTGGGACTGCT
ATGTTAGCTGCCAAGGTTTTCGAAATAAAAAATGATGCTAAAATGAAGAGAACACGGCGAAGACCACTACCCTCAGGGCGCATCACAACTCCACATGCAATCACCTGGGC
AACTTCTGTTGGTTTAGCTGGGACTGCTATGTTAGCTGCCAAGACTAATATCTTGGCAGCTGGTCTTGCAGCTTCAAATTTGATTCTCTATGCATTTGTATATACCCCAC
TCAAGCAGATTCACCCTGTCAATACGTGGGTTGGTGCCATAGTTGGTGCCATTCCACCACTTCTTGGGTGGGCTGCTGCTTCAGGACAAATATCTCTAAATGCAATGATT
CTCCCCGCTGCCCTTTACTTTTGGCAAATACCCCATTTTATGGCTCTTGCATATTTGTGCCGTGATGATTATGCTGCTGGAGGTGGCATAGACATAGAGTTTAGGAAAGG
GGGCATCACCTCGGGATGGTTTTGTCTAGAATCATCCCTCCTCACACTTGCAATCAGTGCGACTGCATTTTCGTTCTACCGACACTGCACGATGCAGAAAGCAAGAAGGA
TGTTTCATGCTAGCCTCTTGTATCTTCCTGTTTTCATGTCAGGACTTCTGGTTCACCGTCTTTCTGATAACGAGCAGACAATGGAAGAAGATAGTTCTGAAAGGATGTTG
GATGGTCTAGTACAGGAGGATAGATATATTGCCCAAAAGAACAGGACGGAGCATAGTCGAGCCCTAGCTTCGCTCGAAGACATATTGGCGGGAACTTCACCGGCAAACTC
AGCACTCAACTTTCCTTCTCACCGACGAAATCGACCCTCAAACTCTCCAAATTTTGCTTCATCGGAGGCGATAATGGGGAAGGACGATGAAGAGATGCGTGGAGAAATGG
AGGAAAGGCTGATAAACGAAGAGTACAAGATTTGGAAGAAGAATACTCCATTTCTTTACGATTTGGTCATCACTCATGCCCTAGAGTGGCCTTCACTTACTGTAGAGTGG
TTGCCGGACCGGGAGGAGCCTCCGGGCAAGGATTACTCCGTTCAGAAGATGATTTTGGGGACTCATACCTCCGAGAATGAACCGAATTACCTCATGCTCGCTCAGGTTCA
GCTTCCGCTTGAGGATTCGGAGAACGATGCGCGACATTACGATGACGACCGTGCTGATGCGGGTGGCTTCGGCTGTGCGAACGGCAAGGTACAAATAATCCAGCAAATAA
ATCACGATGGCGAGGTCAATAGAGCCCGTTATATGCCTCAAAATCCATTTATTATTGCTACAAAGACTGTCAGCGCCGAAGTCTTTGTTTTTGACTATAGTAAACACCCA
TCCAAACCACCTCTAGATGGTACATGCAATCCCGATTTGAGATTGAGGGGTCACAATACCGAAGGTTATGGTTTATCGTGGAGTAAGTTCAAGCAGGGCCATTTACTTAG
TGGTTCTGATGATGCACAGATTTGTTTATGGGACATTAATGCTACTCCAAAGAATAAAACCCTTGAGGCTATGCAAATTTTTAAGGTTCATGAAGGTGTTGTGGAAGACG
TTGCATGGCATCTTAGGCATGAATACTTATTTGGTTCAGTAGGTGATGACCAATACCTGCTCGTATGGGATTTGCGAACTCCTTCAGCTAATAAGCCTGTACAGTCTGTA
GTTGCTCATCAAAGTGAGGTTAATTGCTTGGCATTCAATCCCTTCAATGAGTGGGTTGTAGCCACAGGGTCAACTGATAAGACGGTTAAGTTGTTTGATCTACGTAAAAT
CAGTTCTGCACTCCATACCTTTGACTGTCACAAGGAGGAGGTTTTCCAGGTTGGCTGGAATCCAAAGAATGAAACGATCTTAGCTTCTTGTTGTCTTGGTAGGAGACTCA
TGGTTTGGGACCTTAGCAGGATTGACGAGGAGCAGACACCTGAGGATGCAGAAGATGGCCCGCCCGAATTGCTGTTCATTCATGGTGGTCATACCAGTAAAATATCAGAC
TTTTCTTGGAATCCCTGTGAGGATTGGGTGGTTGCTAGTGTAGCAGAAGATAACATACTACAAATCTGGCAGATGGCTGAGAACATCTACCATGATGAAGATGATTTGCC
TGAGGAACCTCCAAAGGCCCCCTAGTGCTTTATTCTCTGTATTTTAAGTAGTACTCAAAACTATTAAAATCTGTCAAGGTGTCATTTAGGTCTTTCAATCTTCCGTAGCC
CTCTTCTTCTCCATGTTCATCTTATCTTTCTTAGACAAAAAGGAAAGGAAAAAGAACAAAAAACAAAAAAAAAAAACCCCACCCCAAGTGATGTTTTAGGAATCTGATTT
CCCTTTTCTTTTCAGTTCTAAATATAAAGAGGCGTTTTTGAGGCTTTCGGTTTCGGTGATTCACGCTATGGATATGGATATGGGTAAAAAGGAGAAACTTTGTAGAGCTC
GGCGGGGTTGCAGGTGTTGGAAGAAAAGATTATATGCAGGATGAGCAATGATGTGTTGTAGGTCTGTCCTTCCTAAACTTTAGTTCAGTGCCTTAAGTAGTATACTTGTT
CATTGTTAGGGTTTATAATGATTCACTACTAACTATTCCGGGATCAAGGGCCTTGGGAAGATGATTTGTTTTAGATCTTTATTCTTCAACCTTCTCTCCTTGTAGGTGGC
TCATCAGAAATATGGTTCCCTTTGATTTTGATGTGATTTGAGTTTTGTGCCATTTGAGCTTTGAAATGATTTAGACAGAGACCTAACAAAGATGAATCACAATCCAAAGG
ACTGTAAAATGACTTTTACCATCTTTAATAGAATACTTCACTCTCAATTTAGTAGGTAT
Protein sequenceShow/hide protein sequence
MLIIDRVRSQIRFHAIAAAAMWRNSRSFSSKLRSSSSSPNPSTTTSISTFYRHVGVAQRALYPYSSSPYSSLSSSPSYSDPIRLGCSNSHGFRVFSSVADPSSLASAPVV
SRAREAVDLARHYGRCYWELSKARLSMLVVATSGTGFVLGSGSTMDLGGLCWTCAGTMMVAASANSLNQVFEIKNDAKMKRTRRRPLPSGRITTPHAITWATSVGLAGTA
MLAAKVFEIKNDAKMKRTRRRPLPSGRITTPHAITWATSVGLAGTAMLAAKTNILAAGLAASNLILYAFVYTPLKQIHPVNTWVGAIVGAIPPLLGWAAASGQISLNAMI
LPAALYFWQIPHFMALAYLCRDDYAAGGGIDIEFRKGGITSGWFCLESSLLTLAISATAFSFYRHCTMQKARRMFHASLLYLPVFMSGLLVHRLSDNEQTMEEDSSERML
DGLVQEDRYIAQKNRTEHSRALASLEDILAGTSPANSALNFPSHRRNRPSNSPNFASSEAIMGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEW
LPDREEPPGKDYSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYDDDRADAGGFGCANGKVQIIQQINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHP
SKPPLDGTCNPDLRLRGHNTEGYGLSWSKFKQGHLLSGSDDAQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHEYLFGSVGDDQYLLVWDLRTPSANKPVQSV
VAHQSEVNCLAFNPFNEWVVATGSTDKTVKLFDLRKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAEDGPPELLFIHGGHTSKISD
FSWNPCEDWVVASVAEDNILQIWQMAENIYHDEDDLPEEPPKAP