; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0034554 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0034554
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionWD-40 repeat-containing protein MSI4
Genome locationchr3:8353668..8359571
RNA-Seq ExpressionLag0034554
SyntenyLag0034554
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR001680 - WD40 repeat
IPR015943 - WD40/YVTN repeat-like-containing domain superfamily
IPR022052 - Histone-binding protein RBBP4, N-terminal
IPR036322 - WD40-repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588040.1 WD-40 repeat-containing protein MSI4, partial [Cucurbita argyrosperma subsp. sororia]4.6e-29396.89Show/hide
Query:  MDSSQSQQQQQPQQQPQ------QQQPVVKKKETRGRKPKPKDEKKDEQQAKKMKAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQ
        MDSSQSQQQQQ QQQPQ      QQQPVVKKKETRGRKPKPKDEKKDEQQAKKMKAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQ
Subjt:  MDSSQSQQQQQPQQQPQ------QQQPVVKKKETRGRKPKPKDEKKDEQQAKKMKAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQ

Query:  ATYKNRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHA
        ATYKNRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHA
Subjt:  ATYKNRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHA

Query:  VLGATNSRPDLILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTV
        VLGATNSRPDLILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITT+A+D AASKSPGSGGSIIKKAGEANDK A+GPSIGPRGVYHGHEDTV
Subjt:  VLGATNSRPDLILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTV

Query:  EDVTFCPSNAQEFCSVGDDSCLILWDARTGSNPAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWS
        EDVTFCPSNAQEFCSVGDDSCLILWDARTG++PAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWS
Subjt:  EDVTFCPSNAQEFCSVGDDSCLILWDARTGSNPAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWS

Query:  PDKSSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLTE
        PDKSSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNASDPWT+VSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVL E
Subjt:  PDKSSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLTE

Query:  LEKFKSHVIECAAKP
        LEKFKSHVIECAAKP
Subjt:  LEKFKSHVIECAAKP

XP_022147703.1 WD-40 repeat-containing protein MSI4-like [Momordica charantia]5.1e-29297.45Show/hide
Query:  MDSSQSQQQQQPQQQPQQQ-QPVVKKKETRGRKPKPKDEKKDEQQAKKMKAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKN
        MDSSQ QQQQQ  Q PQQQ QPVVKKKETRGRKPKPKDEKKDEQ AKKMKAQ QPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKN
Subjt:  MDSSQSQQQQQPQQQPQQQ-QPVVKKKETRGRKPKPKDEKKDEQQAKKMKAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKN

Query:  RQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAVLGAT
        RQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAVLGAT
Subjt:  RQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAVLGAT

Query:  NSRPDLILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTVEDVTF
        NSRPDLILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGE NDKA+DGPS+GPRGVYHGHEDTVEDVTF
Subjt:  NSRPDLILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTVEDVTF

Query:  CPSNAQEFCSVGDDSCLILWDARTGSNPAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKSS
        CPSNAQEFCSVGDDSCLILWDAR GS PAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKSS
Subjt:  CPSNAQEFCSVGDDSCLILWDARTGSNPAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKSS

Query:  VFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLTELEKFK
        VFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVL ELEKFK
Subjt:  VFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLTELEKFK

Query:  SHVIECAAKP
        SHVIECAAKP
Subjt:  SHVIECAAKP

XP_022933315.1 WD-40 repeat-containing protein MSI4 [Cucurbita moschata]1.7e-29296.89Show/hide
Query:  MDSSQSQQQQQPQQQPQQQQP-----VVKKKETRGRKPKPKDEKKDEQQAKKMKAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQA
        MDSSQSQQQQQ QQQPQQQQP      VKKKETRGRKPKPKDEKKDEQQAKKMKAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQA
Subjt:  MDSSQSQQQQQPQQQPQQQQP-----VVKKKETRGRKPKPKDEKKDEQQAKKMKAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQA

Query:  TYKNRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAV
        TYKNRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAV
Subjt:  TYKNRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAV

Query:  LGATNSRPDLILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTVE
        LGATNSRPDLILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITT+A+D AASKSPGSGGSIIKKAGEANDK A+GPSIGPRGVYHGHEDTVE
Subjt:  LGATNSRPDLILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTVE

Query:  DVTFCPSNAQEFCSVGDDSCLILWDARTGSNPAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSP
        DVTFCPSNAQEFCSVGDDSCLILWDARTG++PAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSP
Subjt:  DVTFCPSNAQEFCSVGDDSCLILWDARTGSNPAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSP

Query:  DKSSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLTEL
        DKSSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNASDPWT+VSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVL EL
Subjt:  DKSSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLTEL

Query:  EKFKSHVIECAAKP
        EKFKSHVIECAAKP
Subjt:  EKFKSHVIECAAKP

XP_022967203.1 WD-40 repeat-containing protein MSI4 [Cucurbita maxima]1.5e-29197.08Show/hide
Query:  MDSSQSQQQQQ---PQQQPQ-QQQPVVKKKETRGRKPKPKDEKKDEQQAKKMKAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQAT
        MDSSQSQQQQQ    QQQPQ QQQPVVKKKETRGRKPKPK+EKKDEQQAKKMKAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQAT
Subjt:  MDSSQSQQQQQ---PQQQPQ-QQQPVVKKKETRGRKPKPKDEKKDEQQAKKMKAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQAT

Query:  YKNRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAVL
        YKNRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAVL
Subjt:  YKNRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAVL

Query:  GATNSRPDLILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTVED
        GATNSRPDLILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITT+A+D AASKSPGSGGSIIKKAGEANDK A+GPSIGPRGVYHGHEDTVED
Subjt:  GATNSRPDLILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTVED

Query:  VTFCPSNAQEFCSVGDDSCLILWDARTGSNPAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPD
        VTFCPSNAQEFCSVGDDSCLILWDARTG++PAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPD
Subjt:  VTFCPSNAQEFCSVGDDSCLILWDARTGSNPAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPD

Query:  KSSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLTELE
        KSSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNASDPWT+VSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVL ELE
Subjt:  KSSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLTELE

Query:  KFKSHVIECAAKP
        KFKSHVIECAAKP
Subjt:  KFKSHVIECAAKP

XP_038880551.1 WD-40 repeat-containing protein MSI4-like [Benincasa hispida]2.3e-29296.9Show/hide
Query:  MDSSQS-QQQQQPQQQPQQ------QQPVVKKKETRGRKPKPKDEKKDEQQAKKMKAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLE
        MDSSQS QQQQQ QQQPQQ      QQPVVKKKETRGRKPKPKDEKKDEQQ KKMKAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLE
Subjt:  MDSSQS-QQQQQPQQQPQQ------QQPVVKKKETRGRKPKPKDEKKDEQQAKKMKAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLE

Query:  QATYKNRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRH
        QATYKNRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQN+RIVATHTDSPDVLIWDVEAQPNRH
Subjt:  QATYKNRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRH

Query:  AVLGATNSRPDLILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDT
        AVLGATNSRPDLILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKK GEANDKA+DGPSIGPRGVYHGHEDT
Subjt:  AVLGATNSRPDLILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDT

Query:  VEDVTFCPSNAQEFCSVGDDSCLILWDARTGSNPAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQW
        VEDVTFCPSNAQEFCSVGDDSCLILWDARTGS PAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQW
Subjt:  VEDVTFCPSNAQEFCSVGDDSCLILWDARTGSNPAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQW

Query:  SPDKSSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLT
        SPDKSSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNA+DPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPE+EVL 
Subjt:  SPDKSSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLT

Query:  ELEKFKSHVIECAAKP
        ELEKFKSHVIECAAKP
Subjt:  ELEKFKSHVIECAAKP

TrEMBL top hitse value%identityAlignment
A0A6J1D211 WD-40 repeat-containing protein MSI4-like2.5e-29297.45Show/hide
Query:  MDSSQSQQQQQPQQQPQQQ-QPVVKKKETRGRKPKPKDEKKDEQQAKKMKAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKN
        MDSSQ QQQQQ  Q PQQQ QPVVKKKETRGRKPKPKDEKKDEQ AKKMKAQ QPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKN
Subjt:  MDSSQSQQQQQPQQQPQQQ-QPVVKKKETRGRKPKPKDEKKDEQQAKKMKAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKN

Query:  RQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAVLGAT
        RQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAVLGAT
Subjt:  RQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAVLGAT

Query:  NSRPDLILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTVEDVTF
        NSRPDLILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGE NDKA+DGPS+GPRGVYHGHEDTVEDVTF
Subjt:  NSRPDLILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTVEDVTF

Query:  CPSNAQEFCSVGDDSCLILWDARTGSNPAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKSS
        CPSNAQEFCSVGDDSCLILWDAR GS PAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKSS
Subjt:  CPSNAQEFCSVGDDSCLILWDARTGSNPAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKSS

Query:  VFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLTELEKFK
        VFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVL ELEKFK
Subjt:  VFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLTELEKFK

Query:  SHVIECAAKP
        SHVIECAAKP
Subjt:  SHVIECAAKP

A0A6J1F4J3 WD-40 repeat-containing protein MSI48.4e-29396.89Show/hide
Query:  MDSSQSQQQQQPQQQPQQQQP-----VVKKKETRGRKPKPKDEKKDEQQAKKMKAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQA
        MDSSQSQQQQQ QQQPQQQQP      VKKKETRGRKPKPKDEKKDEQQAKKMKAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQA
Subjt:  MDSSQSQQQQQPQQQPQQQQP-----VVKKKETRGRKPKPKDEKKDEQQAKKMKAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQA

Query:  TYKNRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAV
        TYKNRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAV
Subjt:  TYKNRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAV

Query:  LGATNSRPDLILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTVE
        LGATNSRPDLILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITT+A+D AASKSPGSGGSIIKKAGEANDK A+GPSIGPRGVYHGHEDTVE
Subjt:  LGATNSRPDLILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTVE

Query:  DVTFCPSNAQEFCSVGDDSCLILWDARTGSNPAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSP
        DVTFCPSNAQEFCSVGDDSCLILWDARTG++PAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSP
Subjt:  DVTFCPSNAQEFCSVGDDSCLILWDARTGSNPAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSP

Query:  DKSSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLTEL
        DKSSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNASDPWT+VSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVL EL
Subjt:  DKSSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLTEL

Query:  EKFKSHVIECAAKP
        EKFKSHVIECAAKP
Subjt:  EKFKSHVIECAAKP

A0A6J1H857 WD-40 repeat-containing protein MSI42.1e-29196.86Show/hide
Query:  MDSSQSQQQQQ-PQQQPQQQQPVVKKKETRGRKPKPKDEKKDEQQAKKMKAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKN
        MDS QSQQQQQ  QQQ QQQQPVVKKKETRGRKPKPK+EKKDEQQAKKMKA QQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKN
Subjt:  MDSSQSQQQQQ-PQQQPQQQQPVVKKKETRGRKPKPKDEKKDEQQAKKMKAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKN

Query:  RQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAVLGAT
        RQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQN+RIVATHTDSPDVLIWDVEAQPNRHAVLGAT
Subjt:  RQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAVLGAT

Query:  NSRPDLILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTVEDVTF
        NSRPDLILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPA SKSPGSGGSIIKKAGE  DK +DGPSIGPRGVYHGHEDTVEDVTF
Subjt:  NSRPDLILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTVEDVTF

Query:  CPSNAQEFCSVGDDSCLILWDARTGSNPAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKSS
        CPSNAQEFCSVGDDSCLILWDARTGS+PAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIR+FDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKSS
Subjt:  CPSNAQEFCSVGDDSCLILWDARTGSNPAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKSS

Query:  VFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLTELEKFK
        VFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNA+DPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVL ELEKFK
Subjt:  VFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLTELEKFK

Query:  SHVIECAAKP
        SHVIECAAKP
Subjt:  SHVIECAAKP

A0A6J1HUE9 WD-40 repeat-containing protein MSI47.1e-29297.08Show/hide
Query:  MDSSQSQQQQQ---PQQQPQ-QQQPVVKKKETRGRKPKPKDEKKDEQQAKKMKAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQAT
        MDSSQSQQQQQ    QQQPQ QQQPVVKKKETRGRKPKPK+EKKDEQQAKKMKAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQAT
Subjt:  MDSSQSQQQQQ---PQQQPQ-QQQPVVKKKETRGRKPKPKDEKKDEQQAKKMKAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQAT

Query:  YKNRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAVL
        YKNRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAVL
Subjt:  YKNRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAVL

Query:  GATNSRPDLILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTVED
        GATNSRPDLILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITT+A+D AASKSPGSGGSIIKKAGEANDK A+GPSIGPRGVYHGHEDTVED
Subjt:  GATNSRPDLILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTVED

Query:  VTFCPSNAQEFCSVGDDSCLILWDARTGSNPAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPD
        VTFCPSNAQEFCSVGDDSCLILWDARTG++PAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPD
Subjt:  VTFCPSNAQEFCSVGDDSCLILWDARTGSNPAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPD

Query:  KSSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLTELE
        KSSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNASDPWT+VSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVL ELE
Subjt:  KSSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLTELE

Query:  KFKSHVIECAAKP
        KFKSHVIECAAKP
Subjt:  KFKSHVIECAAKP

A0A6J1JHL6 WD-40 repeat-containing protein MSI41.0e-29096.48Show/hide
Query:  MDSSQSQQQQ--QPQQQPQQQQPVVKKKETRGRKPKPKDEKKDEQQAKKMKAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYK
        MDS QSQQQQ  Q QQQ QQQQPVVKKKETRGRKPKPK+EKKDEQQAKKMKA Q PSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYK
Subjt:  MDSSQSQQQQ--QPQQQPQQQQPVVKKKETRGRKPKPKDEKKDEQQAKKMKAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYK

Query:  NRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAVLGA
        NRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQN+RIVATHTDSPDVLIWDVEAQPNRHAVLGA
Subjt:  NRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAVLGA

Query:  TNSRPDLILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTVEDVT
        TNSRPDLILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPA SKSPGSGGSIIKKAGE  DK +DGPSIGPRGVYHGHEDTVEDVT
Subjt:  TNSRPDLILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTVEDVT

Query:  FCPSNAQEFCSVGDDSCLILWDARTGSNPAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKS
        FCPSNAQEFCSVGDDSCLILWDARTGS+PAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIR+FDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKS
Subjt:  FCPSNAQEFCSVGDDSCLILWDARTGSNPAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKS

Query:  SVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLTELEKF
        SVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNA+DPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVL ELEKF
Subjt:  SVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLTELEKF

Query:  KSHVIECAAKP
        KSHVIECAAKP
Subjt:  KSHVIECAAKP

SwissProt top hitse value%identityAlignment
O22607 WD-40 repeat-containing protein MSI41.8e-24482.65Show/hide
Query:  RGRKPKPKDEKK----DEQQAKKM-----KAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNTLVI
        RGRKPK K++ +     +Q   KM     K QQ PSVDE+Y+QWK LVP+LYDW ANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNTLVI
Subjt:  RGRKPKPKDEKK----DEQQAKKM-----KAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNTLVI

Query:  ANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAVLGATNSRPDLILTGHQENAEFALA
        ANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQN++IVATHTDSPDVLIWDVE QPNRHAVLGA NSRPDLILTGHQ+NAEFALA
Subjt:  ANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAVLGATNSRPDLILTGHQENAEFALA

Query:  MCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILW
        MCPTEP+VLSGGKDK VVLWSIQDHITT  TD  +S      GSIIK+ GE  DK  + P++GPRGVYHGHEDTVEDV F P++AQEFCSVGDDSCLILW
Subjt:  MCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILW

Query:  DARTGSNPAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKSSVFGSSAEDGLLNIWDYDKVG
        DARTG+NP  KVEKAH+ADLHCVDWNPHDDNLI+TGSADN++RLFDRR LT+NGVGSPIYKFEGHKAAVLCVQWSPDKSSVFGSSAEDGLLNIWDYD+V 
Subjt:  DARTGSNPAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKSSVFGSSAEDGLLNIWDYDKVG

Query:  KKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLTELEKFKSHVIECAAKP
        KK++RA ++PA   GLFFQHAGHRDKVVDFHWNASDPWT+VSVSDDC+TTGGGGTLQIWRMSDLIYRPE+EV+ ELEKFKSHV+ CA+KP
Subjt:  KKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLTELEKFKSHVIECAAKP

Q09028 Histone-binding protein RBBP46.7e-6132.97Show/hide
Query:  EQQAKKMKAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYK--NRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFN
        +++A    A ++  ++E Y  WK   P LYD    H L WPSL+ +W P + +   K  +  RL L   T     N LVIA+ ++  P   A    S ++
Subjt:  EQQAKKMKAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYK--NRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFN

Query:  EE--------ARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAVLGATNSRPDLILTGHQENAEFALAMCPT-EPYVLSG
         E        + S  ++    I H GEVNR R +PQN  I+AT T S DVL++D    P++    G  N  PDL L GHQ+   + L+  P    ++LS 
Subjt:  EE--------ARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAVLGATNSRPDLILTGHQENAEFALAMCPT-EPYVLSG

Query:  GKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTG--SNPA
          D  + LW I                     S + K G+  D          + ++ GH   VEDV++   +   F SV DD  L++WD R+   S P+
Subjt:  GKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTG--SNPA

Query:  VKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKSSVFGSSAEDGLLNIWDYDKVGKKTERATRT
          V+ AH A+++C+ +NP+ + ++ TGSAD ++ L+D RNL        ++ FE HK  +  VQWSP   ++  SS  D  LN+WD  K+G++ +     
Subjt:  VKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKSSVFGSSAEDGLLNIWDYDKVGKKTERATRT

Query:  PAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPED
           PP L F H GH  K+ DF WN ++PW + SVS+D         +Q+W+M++ IY  ED
Subjt:  PAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPED

Q3MHL3 Histone-binding protein RBBP46.7e-6132.97Show/hide
Query:  EQQAKKMKAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYK--NRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFN
        +++A    A ++  ++E Y  WK   P LYD    H L WPSL+ +W P + +   K  +  RL L   T     N LVIA+ ++  P   A    S ++
Subjt:  EQQAKKMKAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYK--NRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFN

Query:  EE--------ARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAVLGATNSRPDLILTGHQENAEFALAMCPT-EPYVLSG
         E        + S  ++    I H GEVNR R +PQN  I+AT T S DVL++D    P++    G  N  PDL L GHQ+   + L+  P    ++LS 
Subjt:  EE--------ARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAVLGATNSRPDLILTGHQENAEFALAMCPT-EPYVLSG

Query:  GKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTG--SNPA
          D  + LW I                     S + K G+  D          + ++ GH   VEDV++   +   F SV DD  L++WD R+   S P+
Subjt:  GKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTG--SNPA

Query:  VKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKSSVFGSSAEDGLLNIWDYDKVGKKTERATRT
          V+ AH A+++C+ +NP+ + ++ TGSAD ++ L+D RNL        ++ FE HK  +  VQWSP   ++  SS  D  LN+WD  K+G++ +     
Subjt:  VKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKSSVFGSSAEDGLLNIWDYDKVGKKTERATRT

Query:  PAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPED
           PP L F H GH  K+ DF WN ++PW + SVS+D         +Q+W+M++ IY  ED
Subjt:  PAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPED

Q60972 Histone-binding protein RBBP46.7e-6132.97Show/hide
Query:  EQQAKKMKAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYK--NRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFN
        +++A    A ++  ++E Y  WK   P LYD    H L WPSL+ +W P + +   K  +  RL L   T     N LVIA+ ++  P   A    S ++
Subjt:  EQQAKKMKAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYK--NRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFN

Query:  EE--------ARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAVLGATNSRPDLILTGHQENAEFALAMCPT-EPYVLSG
         E        + S  ++    I H GEVNR R +PQN  I+AT T S DVL++D    P++    G  N  PDL L GHQ+   + L+  P    ++LS 
Subjt:  EE--------ARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAVLGATNSRPDLILTGHQENAEFALAMCPT-EPYVLSG

Query:  GKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTG--SNPA
          D  + LW I                     S + K G+  D          + ++ GH   VEDV++   +   F SV DD  L++WD R+   S P+
Subjt:  GKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTG--SNPA

Query:  VKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKSSVFGSSAEDGLLNIWDYDKVGKKTERATRT
          V+ AH A+++C+ +NP+ + ++ TGSAD ++ L+D RNL        ++ FE HK  +  VQWSP   ++  SS  D  LN+WD  K+G++ +     
Subjt:  VKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKSSVFGSSAEDGLLNIWDYDKVGKKTERATRT

Query:  PAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPED
           PP L F H GH  K+ DF WN ++PW + SVS+D         +Q+W+M++ IY  ED
Subjt:  PAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPED

Q9SU78 WD-40 repeat-containing protein MSI51.9e-20970.81Show/hide
Query:  DSSQSQQQQQPQQQPQQQQPVVKKKETRGRKPKPKDEKKDEQQAKKMKAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQ
        +++ + Q  +P++ P+     +   + R RKPK  +E    Q    ++  Q+ +VD+ Y+QWK+L+P+LYD F NH LVWPSLSCRWGPQLEQA  K  Q
Subjt:  DSSQSQQQQQPQQQPQQQQPVVKKKETRGRKPKPKDEKKDEQQAKKMKAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQ

Query:  RLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAVLGATNS
        RLYLSEQT+GSVPNTLVIANCE V           Q NE+A SPFVKKYKTIIHPGEVNRIRELPQN++IVATHTDSPD+LIW+ E QP+R+AVLGA +S
Subjt:  RLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAVLGATNS

Query:  RPDLILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTVEDVTFCP
        RPDL+L GHQ++AEFALAMCPTEP+VLSGGKDK V+LW+IQDHIT + +D   SKSPGS     K+ GE +DK   GPS+GPRG+Y+GH+DTVEDV FCP
Subjt:  RPDLILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTVEDVTFCP

Query:  SNAQEFCSVGDDSCLILWDARTGSNPAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKSSVF
        S+AQEFCSVGDDSCL+LWDARTG++PA+KVEKAH+ADLHCVDWNPHD+NLI+TGSADN++R+FDRRNLTSNGVGSP+YKFEGH+AAVLCVQWSPDKSSVF
Subjt:  SNAQEFCSVGDDSCLILWDARTGSNPAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKSSVF

Query:  GSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLTELEKFKSH
        GSSAEDGLLNIWD D+VGKK+ERAT+T   P GLFFQHAGHRDKVVDFHW+  +PWT+VSVSD+C++ GGGGTLQIWRMSDLIYRPEDEVLTELEKFKSH
Subjt:  GSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLTELEKFKSH

Query:  VIECAAK
        V  C +K
Subjt:  VIECAAK

Arabidopsis top hitse value%identityAlignment
AT2G16780.1 Transducin family protein / WD-40 repeat family protein1.5e-6031.9Show/hide
Query:  VDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQA----TYKNRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYK
        V+E ++ WK   P LYD   +H L WPSL+  W P         +Y    +L L   T GS  + L++A  +VV P   A   I   N++   P V+  +
Subjt:  VDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQA----TYKNRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYK

Query:  TIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAVLGATNS-RPDLILTGHQENAEFALAMCP-TEPYVLSGGKDKLVVLWSIQDHITTSA
         I   GEVNR R +PQ   +V   T   +V ++D      +HA    T+   PDL L GH +   + L+  P  E Y+LSG +D+ + LW +      SA
Subjt:  TIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAVLGATNS-RPDLILTGHQENAEFALAMCP-TEPYVLSGGKDKLVVLWSIQDHITTSA

Query:  TDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTGSNPAVKVEKAHNADLHCVDWNPHDD
        T                      DK      +    VY GHE  + DV++   N   F S G+D  L++WD RT  N      K H  +++ + +NP ++
Subjt:  TDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTGSNPAVKVEKAHNADLHCVDWNPHDD

Query:  NLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKSSVFGSSAEDGLLNIWDYDKVG-KKTERATRTPAAPPGLFFQHAGHRDKVVD
         ++ T S+D+++ LFD R L      +P++    H+  V  V+W P+  +V  SS ED  L +WD ++VG ++ E        PP L F H GH+ K+ D
Subjt:  NLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKSSVFGSSAEDGLLNIWDYDKVG-KKTERATRTPAAPPGLFFQHAGHRDKVVD

Query:  FHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDE
        F WN ++PW + SV++D        +LQ+W+M++ IYR E++
Subjt:  FHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDE

AT2G19520.1 Transducin family protein / WD-40 repeat family protein1.3e-24582.65Show/hide
Query:  RGRKPKPKDEKK----DEQQAKKM-----KAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNTLVI
        RGRKPK K++ +     +Q   KM     K QQ PSVDE+Y+QWK LVP+LYDW ANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNTLVI
Subjt:  RGRKPKPKDEKK----DEQQAKKM-----KAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNTLVI

Query:  ANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAVLGATNSRPDLILTGHQENAEFALA
        ANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQN++IVATHTDSPDVLIWDVE QPNRHAVLGA NSRPDLILTGHQ+NAEFALA
Subjt:  ANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAVLGATNSRPDLILTGHQENAEFALA

Query:  MCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILW
        MCPTEP+VLSGGKDK VVLWSIQDHITT  TD  +S      GSIIK+ GE  DK  + P++GPRGVYHGHEDTVEDV F P++AQEFCSVGDDSCLILW
Subjt:  MCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILW

Query:  DARTGSNPAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKSSVFGSSAEDGLLNIWDYDKVG
        DARTG+NP  KVEKAH+ADLHCVDWNPHDDNLI+TGSADN++RLFDRR LT+NGVGSPIYKFEGHKAAVLCVQWSPDKSSVFGSSAEDGLLNIWDYD+V 
Subjt:  DARTGSNPAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKSSVFGSSAEDGLLNIWDYDKVG

Query:  KKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLTELEKFKSHVIECAAKP
        KK++RA ++PA   GLFFQHAGHRDKVVDFHWNASDPWT+VSVSDDC+TTGGGGTLQIWRMSDLIYRPE+EV+ ELEKFKSHV+ CA+KP
Subjt:  KKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLTELEKFKSHVIECAAKP

AT4G29730.1 nucleosome/chromatin assembly factor group C51.4e-21070.81Show/hide
Query:  DSSQSQQQQQPQQQPQQQQPVVKKKETRGRKPKPKDEKKDEQQAKKMKAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQ
        +++ + Q  +P++ P+     +   + R RKPK  +E    Q    ++  Q+ +VD+ Y+QWK+L+P+LYD F NH LVWPSLSCRWGPQLEQA  K  Q
Subjt:  DSSQSQQQQQPQQQPQQQQPVVKKKETRGRKPKPKDEKKDEQQAKKMKAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQ

Query:  RLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAVLGATNS
        RLYLSEQT+GSVPNTLVIANCE V           Q NE+A SPFVKKYKTIIHPGEVNRIRELPQN++IVATHTDSPD+LIW+ E QP+R+AVLGA +S
Subjt:  RLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAVLGATNS

Query:  RPDLILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTVEDVTFCP
        RPDL+L GHQ++AEFALAMCPTEP+VLSGGKDK V+LW+IQDHIT + +D   SKSPGS     K+ GE +DK   GPS+GPRG+Y+GH+DTVEDV FCP
Subjt:  RPDLILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTVEDVTFCP

Query:  SNAQEFCSVGDDSCLILWDARTGSNPAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKSSVF
        S+AQEFCSVGDDSCL+LWDARTG++PA+KVEKAH+ADLHCVDWNPHD+NLI+TGSADN++R+FDRRNLTSNGVGSP+YKFEGH+AAVLCVQWSPDKSSVF
Subjt:  SNAQEFCSVGDDSCLILWDARTGSNPAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKSSVF

Query:  GSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLTELEKFKSH
        GSSAEDGLLNIWD D+VGKK+ERAT+T   P GLFFQHAGHRDKVVDFHW+  +PWT+VSVSD+C++ GGGGTLQIWRMSDLIYRPEDEVLTELEKFKSH
Subjt:  GSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLTELEKFKSH

Query:  VIECAAK
        V  C +K
Subjt:  VIECAAK

AT4G35050.1 Transducin family protein / WD-40 repeat family protein2.0e-6031.59Show/hide
Query:  VDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQ----LEQATYKNRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYK
        V+E ++ WK   P LYD   +H L WPSL+  W P       +  Y    +L L   T G   + L++A  +VV P   A   +   ++E   P V+  +
Subjt:  VDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQ----LEQATYKNRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYK

Query:  TIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAVLGATNSRPDLILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATD
         I   GEVNR R +PQ   +V   T   +V ++D      +      +   PDL L GH++           E Y+LSG +D+ + LW +      SAT 
Subjt:  TIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAVLGATNSRPDLILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATD

Query:  PAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTGSNPAVKVEKAHNADLHCVDWNPHDDNL
                           A DK      + P  VY GH+  +EDV +   N   F S GDD  L++WD RT  N      K H  +++ + +NP ++ +
Subjt:  PAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTGSNPAVKVEKAHNADLHCVDWNPHDDNL

Query:  IITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKSSVFGSSAEDGLLNIWDYDKVG-KKTERATRTPAAPPGLFFQHAGHRDKVVDFH
        + T S+D+++ LFD R LT     +P++    H+  V  V+W P+  +V  SS ED  L +WD ++VG ++ E        PP L F H GH+ K+ DF 
Subjt:  IITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKSSVFGSSAEDGLLNIWDYDKVG-KKTERATRTPAAPPGLFFQHAGHRDKVVDFH

Query:  WNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDE
        WN  +PW + SV++D        +LQ+W+M++ IYR +DE
Subjt:  WNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDE

AT5G58230.1 Transducin/WD40 repeat-like superfamily protein4.6e-5730.17Show/hide
Query:  KDEQQAKKMKAQ-QQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNR--QRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHIS
        KDE++   M+ + ++  ++E Y  WK   P LYD    H L WPSL+  W P  E+ + K+   Q++ L   T  S PN L++A  +V  P         
Subjt:  KDEQQAKKMKAQ-QQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNR--QRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHIS

Query:  QFNEEARSPF---------VKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAVLGATNSRPDLILTGHQENAEFALAMCPTEPYV
        Q++++ RS F         V+  + I H GEVNR R +PQN  I+AT T + +V ++D    P++  + GA N  PDL L GH             + ++
Subjt:  QFNEEARSPF---------VKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAVLGATNSRPDLILTGHQENAEFALAMCPTEPYV

Query:  LSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTGS-N
        LSG  D  + LW I                               +      S+  + ++  HE  VEDV +   +   F SVGDD  L++WD R+ S +
Subjt:  LSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTGS-N

Query:  PAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKSSVFGSSAEDGLLNIWDYDKVGKKTERAT
          V+   AH+ +++C+ +NP ++ ++ TGS D +++LFD R L+     + ++ F+ HK  V  V W+P   ++  S      L +WD  ++ ++ +   
Subjt:  PAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKSSVFGSSAEDGLLNIWDYDKVGKKTERAT

Query:  RTPAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDE
             PP L F H GH  K+ DF WN  + W + SV++D         LQIW+M++ IY  ED+
Subjt:  RTPAAPPGLFFQHAGHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTCTTCTCAGTCGCAGCAGCAGCAACAGCCGCAGCAGCAGCCGCAGCAGCAGCAGCCTGTTGTCAAGAAGAAGGAGACCAGAGGCCGCAAGCCCAAGCCTAAGGA
CGAGAAGAAGGACGAGCAGCAAGCTAAGAAGATGAAGGCCCAGCAACAACCCTCCGTCGATGAGCGCTACACCCAGTGGAAGTCTCTTGTTCCTGTTCTTTACGACTGGT
TCGCCAACCACAATCTCGTTTGGCCTTCTCTCTCCTGCCGGTGGGGTCCTCAGCTTGAGCAAGCGACGTATAAGAATCGACAGCGACTTTATCTTTCTGAACAGACTGAT
GGTAGTGTTCCGAATACACTGGTCATTGCAAATTGTGAAGTTGTGAAGCCTAGGGTTGCAGCTGCAGAGCACATTTCTCAGTTCAATGAAGAAGCACGCTCTCCATTTGT
AAAGAAGTACAAGACTATCATACACCCTGGTGAGGTTAACAGAATTAGGGAACTTCCCCAGAATGCTAGAATCGTTGCCACACACACAGATAGTCCAGATGTCCTCATTT
GGGATGTTGAGGCACAACCTAATCGTCATGCTGTCCTTGGTGCCACAAATTCTCGCCCAGATTTGATTCTGACTGGTCATCAAGAAAATGCCGAGTTTGCTCTGGCAATG
TGCCCCACTGAACCTTATGTTCTCTCTGGAGGGAAGGACAAGTTAGTGGTTTTATGGAGTATCCAGGACCATATAACAACTTCTGCCACAGACCCTGCTGCTTCAAAATC
ACCAGGATCTGGTGGTTCTATCATAAAAAAGGCTGGAGAGGCAAATGATAAAGCTGCTGATGGGCCTTCTATTGGGCCACGAGGAGTTTACCATGGCCACGAGGATACTG
TTGAAGATGTGACCTTCTGTCCATCCAATGCACAGGAGTTTTGCAGTGTGGGAGATGATTCTTGTCTAATATTATGGGATGCCCGTACAGGCTCTAACCCAGCTGTCAAG
GTTGAAAAAGCACATAATGCTGATCTTCATTGTGTTGATTGGAATCCCCATGACGATAATCTTATCATAACAGGGTCAGCCGATAATTCTATTCGCTTGTTTGATCGTCG
AAATCTCACTTCTAATGGAGTTGGTTCACCTATCTATAAATTTGAGGGCCACAAAGCAGCCGTTCTTTGTGTTCAGTGGTCCCCAGATAAATCATCTGTCTTTGGAAGTT
CTGCGGAGGATGGACTGTTAAATATTTGGGATTACGATAAGGTTGGTAAAAAGACAGAGCGAGCTACAAGGACGCCTGCTGCTCCTCCAGGCTTATTTTTCCAGCATGCT
GGGCACAGGGATAAAGTCGTTGACTTCCATTGGAATGCATCTGATCCATGGACTGTTGTCAGTGTGTCTGATGATTGTGATACAACTGGTGGAGGAGGAACGTTGCAGAT
ATGGCGCATGAGTGATCTAATCTATCGGCCAGAAGATGAGGTGTTAACTGAGCTTGAAAAATTCAAATCTCACGTAATTGAATGTGCTGCAAAGCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACTCTTCTCAGTCGCAGCAGCAGCAACAGCCGCAGCAGCAGCCGCAGCAGCAGCAGCCTGTTGTCAAGAAGAAGGAGACCAGAGGCCGCAAGCCCAAGCCTAAGGA
CGAGAAGAAGGACGAGCAGCAAGCTAAGAAGATGAAGGCCCAGCAACAACCCTCCGTCGATGAGCGCTACACCCAGTGGAAGTCTCTTGTTCCTGTTCTTTACGACTGGT
TCGCCAACCACAATCTCGTTTGGCCTTCTCTCTCCTGCCGGTGGGGTCCTCAGCTTGAGCAAGCGACGTATAAGAATCGACAGCGACTTTATCTTTCTGAACAGACTGAT
GGTAGTGTTCCGAATACACTGGTCATTGCAAATTGTGAAGTTGTGAAGCCTAGGGTTGCAGCTGCAGAGCACATTTCTCAGTTCAATGAAGAAGCACGCTCTCCATTTGT
AAAGAAGTACAAGACTATCATACACCCTGGTGAGGTTAACAGAATTAGGGAACTTCCCCAGAATGCTAGAATCGTTGCCACACACACAGATAGTCCAGATGTCCTCATTT
GGGATGTTGAGGCACAACCTAATCGTCATGCTGTCCTTGGTGCCACAAATTCTCGCCCAGATTTGATTCTGACTGGTCATCAAGAAAATGCCGAGTTTGCTCTGGCAATG
TGCCCCACTGAACCTTATGTTCTCTCTGGAGGGAAGGACAAGTTAGTGGTTTTATGGAGTATCCAGGACCATATAACAACTTCTGCCACAGACCCTGCTGCTTCAAAATC
ACCAGGATCTGGTGGTTCTATCATAAAAAAGGCTGGAGAGGCAAATGATAAAGCTGCTGATGGGCCTTCTATTGGGCCACGAGGAGTTTACCATGGCCACGAGGATACTG
TTGAAGATGTGACCTTCTGTCCATCCAATGCACAGGAGTTTTGCAGTGTGGGAGATGATTCTTGTCTAATATTATGGGATGCCCGTACAGGCTCTAACCCAGCTGTCAAG
GTTGAAAAAGCACATAATGCTGATCTTCATTGTGTTGATTGGAATCCCCATGACGATAATCTTATCATAACAGGGTCAGCCGATAATTCTATTCGCTTGTTTGATCGTCG
AAATCTCACTTCTAATGGAGTTGGTTCACCTATCTATAAATTTGAGGGCCACAAAGCAGCCGTTCTTTGTGTTCAGTGGTCCCCAGATAAATCATCTGTCTTTGGAAGTT
CTGCGGAGGATGGACTGTTAAATATTTGGGATTACGATAAGGTTGGTAAAAAGACAGAGCGAGCTACAAGGACGCCTGCTGCTCCTCCAGGCTTATTTTTCCAGCATGCT
GGGCACAGGGATAAAGTCGTTGACTTCCATTGGAATGCATCTGATCCATGGACTGTTGTCAGTGTGTCTGATGATTGTGATACAACTGGTGGAGGAGGAACGTTGCAGAT
ATGGCGCATGAGTGATCTAATCTATCGGCCAGAAGATGAGGTGTTAACTGAGCTTGAAAAATTCAAATCTCACGTAATTGAATGTGCTGCAAAGCCTTGA
Protein sequenceShow/hide protein sequence
MDSSQSQQQQQPQQQPQQQQPVVKKKETRGRKPKPKDEKKDEQQAKKMKAQQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTD
GSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSPDVLIWDVEAQPNRHAVLGATNSRPDLILTGHQENAEFALAM
CPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEANDKAADGPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTGSNPAVK
VEKAHNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKSSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHA
GHRDKVVDFHWNASDPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLTELEKFKSHVIECAAKP