; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS009428 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS009428
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionWD repeat-containing protein 43
Genome locationscaffold813:1225725..1228946
RNA-Seq ExpressionMS009428
SyntenyMS009428
Gene Ontology termsGO:0006364 - rRNA processing (biological process)
GO:0005730 - nucleolus (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR007148 - Small-subunit processome, Utp12
IPR011047 - Quinoprotein alcohol dehydrogenase-like superfamily
IPR015943 - WD40/YVTN repeat-like-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136081.1 WD repeat-containing protein 43 isoform X2 [Cucumis sativus]5.8e-27482.3Show/hide
Query:  MKKEMLKSPPITAFTPDGDYLAILSPNET--IWSARDGSLLAEWKDSEGKTDVGYSCLACCFVRKKKKSS-CVIAIGTDNGDVLAVNASSGETKWVSAGC
        MK E+LKS PI AFTPDGDYLAI+S N T  IWS RDGSLLAEWKD +GK D GYSC+ACCF+ KK+KSS CV+AIGT++GDVLAVNAS+GE KWVSAGC
Subjt:  MKKEMLKSPPITAFTPDGDYLAILSPNET--IWSARDGSLLAEWKDSEGKTDVGYSCLACCFVRKKKKSS-CVIAIGTDNGDVLAVNASSGETKWVSAGC

Query:  HIGGVIGLSFANKGRRLRTVGSNGMASEMDTETGNIIKEFKASKKSISSSAFSPDEKYLAVAGKKLRILSTDNGDELMVHSDKLGPVKLVSISDDAKAII
        H GGVIGLSFANKG RLRTVGSNGMASEMDTETGNIIKEFKASKKSISSSAFS DE+YL VAGKKL+ILSTD+GDEL+VH DKLGPVKLVS+SDDAK II
Subjt:  HIGGVIGLSFANKGRRLRTVGSNGMASEMDTETGNIIKEFKASKKSISSSAFSPDEKYLAVAGKKLRILSTDNGDELMVHSDKLGPVKLVSISDDAKAII

Query:  TSEFGAKHLQVWWCDMSARKLSRGPVLSMKHPPFVSECKNISNEEDSIVVLSVSVSGVAYVWRLKILSEDEVSPNKVTVKANDAQSAEENHGSAKKNRIS
        TSE GAKHLQVWWC++SA K SRGP+LSMKHPPFVSEC+N+SN+EDS+VVLSVSVSG AY+W+LK+LSEDEV+P KV+VKAND QSAEENHGSAKKNR S
Subjt:  TSEFGAKHLQVWWCDMSARKLSRGPVLSMKHPPFVSECKNISNEEDSIVVLSVSVSGVAYVWRLKILSEDEVSPNKVTVKANDAQSAEENHGSAKKNRIS

Query:  VIASRIDGSGDNEVSVLVTHGSMDQPQLSLFNIGYSVKEDLNTAHEKKTLQQNDDFSGQGPHEIEQAVTTPKSKKSKKKRAASDIDSLTAGDVSDVGNGD
        V+ASRI G GDNEVSVLVTHGS+D PQ +L +IGY+VKED NTAHE KTLQQND  S QGPHEIEQ V TPKSKKSKKKRAAS++DSLTAGDVSDVGNGD
Subjt:  VIASRIDGSGDNEVSVLVTHGSMDQPQLSLFNIGYSVKEDLNTAHEKKTLQQNDDFSGQGPHEIEQAVTTPKSKKSKKKRAASDIDSLTAGDVSDVGNGD

Query:  ASDVLFNDDINEPTMGEKLASLNLLDQDEDESHEQEEPSVPAIPPSADSVQVLLKQALHADDRALLLECLYTKDDKVISKSIAQLNSSDVLKLLHSLISI
         SDVLFNDD+NEP+MGEKLASLNL DQ++D   EQE+PSVP IPPSADSVQVLLKQALHADDRALLLECLYTKD KVISKSIAQLNSSDVL LLH+LIS 
Subjt:  ASDVLFNDDINEPTMGEKLASLNLLDQDEDESHEQEEPSVPAIPPSADSVQVLLKQALHADDRALLLECLYTKDDKVISKSIAQLNSSDVLKLLHSLISI

Query:  IQSRGAILVCALPWLRGLLLQHASRIMSQESSLLALNSLYQLIESRISTFQSAVLLSSSLDFLYTGVLDEEVDDNDAIVPIIYEDEDDSDDQESGDEMET
        IQSRGAILVCALPWLR L+LQHAS+IMSQESSLLALNSLYQLIESR STFQSA+LLSSSLDFLYT VLD+E +DND IVPIIYE E+DSD+ E+GDEMET
Subjt:  IQSRGAILVCALPWLRGLLLQHASRIMSQESSLLALNSLYQLIESRISTFQSAVLLSSSLDFLYTGVLDEEVDDNDAIVPIIYEDEDDSDDQESGDEMET

Query:  DEDEEEEEREEAFGDLSAGEV-DDMSE
        +ED+E +E  EAF DLSAGEV DDMSE
Subjt:  DEDEEEEEREEAFGDLSAGEV-DDMSE

XP_008461080.1 PREDICTED: WD repeat-containing protein 43 [Cucumis melo]9.3e-28083.73Show/hide
Query:  MKKEMLKSPPITAFTPDGDYLAILSPNET--IWSARDGSLLAEWKDSEGKTDVGYSCLACCFVRKKKKSS-CVIAIGTDNGDVLAVNASSGETKWVSAGC
        MKKE+LKSPPITAFTPDGDYLAI S N T  IWS RDGSLLAEWKD +GK D GYSC+ACC + KK+KSS C++AIGT+NGDVLAVNAS+GE KWVS GC
Subjt:  MKKEMLKSPPITAFTPDGDYLAILSPNET--IWSARDGSLLAEWKDSEGKTDVGYSCLACCFVRKKKKSS-CVIAIGTDNGDVLAVNASSGETKWVSAGC

Query:  HIGGVIGLSFANKGRRLRTVGSNGMASEMDTETGNIIKEFKASKKSISSSAFSPDEKYLAVAGKKLRILSTDNGDELMVHSDKLGPVKLVSISDDAKAII
        H GGVIGLSFAN+GRRL TVGSNGMASEMDTETGNIIKEFKASKKSISSSAFS DEKYLAVAGKKL+ILS D+GDEL+VH DKL PVKLVSISDDAK I+
Subjt:  HIGGVIGLSFANKGRRLRTVGSNGMASEMDTETGNIIKEFKASKKSISSSAFSPDEKYLAVAGKKLRILSTDNGDELMVHSDKLGPVKLVSISDDAKAII

Query:  TSEFGAKHLQVWWCDMSARKLSRGPVLSMKHPPFVSECKNISNEEDSIVVLSVSVSGVAYVWRLKILSEDEVSPNKVTVKANDAQSAEENHGSAKKNRIS
        TSE GAKHLQVWWCDMSA K SRGPVLSM HPPFVSEC+N+SN+EDS+VVLSVSVSG AY+W+LK+LSEDEV P KV+VKAND QSAEENHGSAKKNR+S
Subjt:  TSEFGAKHLQVWWCDMSARKLSRGPVLSMKHPPFVSECKNISNEEDSIVVLSVSVSGVAYVWRLKILSEDEVSPNKVTVKANDAQSAEENHGSAKKNRIS

Query:  VIASRIDGSGDNEVSVLVTHGSMDQPQLSLFNIGYSVKEDLNTAHEKKTLQQNDDFSGQGPHEIEQAVTTPKSKKSKKKRAASDIDSLTAGDVSDVGNGD
        V+AS+I   GDNEVSVLVTHGS+D PQ SL +IGY+VKED NTAHE KTLQQND  S QGPHEIEQ V TPKSKKSKKKRAASD+DS TAGDVSDVGNGD
Subjt:  VIASRIDGSGDNEVSVLVTHGSMDQPQLSLFNIGYSVKEDLNTAHEKKTLQQNDDFSGQGPHEIEQAVTTPKSKKSKKKRAASDIDSLTAGDVSDVGNGD

Query:  ASDVLFNDDINEPTMGEKLASLNLLDQDEDESHEQEEPSVPAIPPSADSVQVLLKQALHADDRALLLECLYTKDDKVISKSIAQLNSSDVLKLLHSLISI
        ASDV+FNDD+NEP+MGEKLASLNL DQ+ED   EQE+PSVP IPPSADSVQVLLKQALHADD ALLLECLYTKDDKVISKSIAQLNSSDVLKLLHS+IS 
Subjt:  ASDVLFNDDINEPTMGEKLASLNLLDQDEDESHEQEEPSVPAIPPSADSVQVLLKQALHADDRALLLECLYTKDDKVISKSIAQLNSSDVLKLLHSLISI

Query:  IQSRGAILVCALPWLRGLLLQHASRIMSQESSLLALNSLYQLIESRISTFQSAVLLSSSLDFLYTGVLDEEVDDNDAIVPIIYEDEDDSDDQESGDEMET
        IQSRGAILVCALPWLRGLLLQHAS+IMSQESSLLALNSLYQLIE+RISTFQSA+LLSSSLDFLYTGVLDEE +DNDAIVPIIYE E+DSD+ E+GDEMET
Subjt:  IQSRGAILVCALPWLRGLLLQHASRIMSQESSLLALNSLYQLIESRISTFQSAVLLSSSLDFLYTGVLDEEVDDNDAIVPIIYEDEDDSDDQESGDEMET

Query:  DEDEEEEEREEAFGDLSAGEV-DDMSE
        DED+E +E  EAF DLSAGEV DDMSE
Subjt:  DEDEEEEEREEAFGDLSAGEV-DDMSE

XP_022150139.1 WD repeat-containing protein 43 [Momordica charantia]0.0e+0098.89Show/hide
Query:  MKKEMLKSPPITAFTPDGDYLAILSPNET--IWSARDGSLLAEWKDSEGKTDVGYSCLACCFVRK---KKKSSCVIAIGTDNGDVLAVNASSGETKWVSA
        MKKEMLKSPPITAFTPDGDYLAILSPNET  IWSARDGSLLAEWKDSEGKTDVGYSCLACCFVRK   KKKSSCVIAIGTDNGDVLAVNASSGETKWVSA
Subjt:  MKKEMLKSPPITAFTPDGDYLAILSPNET--IWSARDGSLLAEWKDSEGKTDVGYSCLACCFVRK---KKKSSCVIAIGTDNGDVLAVNASSGETKWVSA

Query:  GCHIGGVIGLSFANKGRRLRTVGSNGMASEMDTETGNIIKEFKASKKSISSSAFSPDEKYLAVAGKKLRILSTDNGDELMVHSDKLGPVKLVSISDDAKA
        GCHIGGVIGLSFANKGRRLRTVGSNGMASEMDTETGNIIKEFKASKKSISSSAFSPDEKYLAVAGKKLRILSTDNGDELMVHSDKLGPVKLVSISDDAKA
Subjt:  GCHIGGVIGLSFANKGRRLRTVGSNGMASEMDTETGNIIKEFKASKKSISSSAFSPDEKYLAVAGKKLRILSTDNGDELMVHSDKLGPVKLVSISDDAKA

Query:  IITSEFGAKHLQVWWCDMSARKLSRGPVLSMKHPPFVSECKNISNEEDSIVVLSVSVSGVAYVWRLKILSEDEVSPNKVTVKANDAQSAEENHGSAKKNR
        IITSEFGAKHLQVWWCDMSARKLSRGPVLSMKHPPFVSECKNISNEEDSIVVLSVSVSGVAYVWRLKILSEDEVSPNKVTVKANDAQSAEENHGSAKKNR
Subjt:  IITSEFGAKHLQVWWCDMSARKLSRGPVLSMKHPPFVSECKNISNEEDSIVVLSVSVSGVAYVWRLKILSEDEVSPNKVTVKANDAQSAEENHGSAKKNR

Query:  ISVIASRIDGSGDNEVSVLVTHGSMDQPQLSLFNIGYSVKEDLNTAHEKKTLQQNDDFSGQGPHEIEQAVTTPKSKKSKKKRAASDIDSLTAGDVSDVGN
        ISVIASRIDGSGDNEVSVLVTHGSMDQPQLSLFNIGYSVKEDLNTAHEKKTLQQNDDFSGQGPHEIEQAVTTPKSKKSKKKRAASDIDSLTAGDVS VGN
Subjt:  ISVIASRIDGSGDNEVSVLVTHGSMDQPQLSLFNIGYSVKEDLNTAHEKKTLQQNDDFSGQGPHEIEQAVTTPKSKKSKKKRAASDIDSLTAGDVSDVGN

Query:  GDASDVLFNDDINEPTMGEKLASLNLLDQDEDESHEQEEPSVPAIPPSADSVQVLLKQALHADDRALLLECLYTKDDKVISKSIAQLNSSDVLKLLHSLI
        GDASDVLFNDDINEPTMGEKLASLNLLDQDEDESHEQEEPSVPAIPPSADSVQVLLKQALHADDRALLLECLYTKDDKVISKSIAQLNSSDVLKLLHSLI
Subjt:  GDASDVLFNDDINEPTMGEKLASLNLLDQDEDESHEQEEPSVPAIPPSADSVQVLLKQALHADDRALLLECLYTKDDKVISKSIAQLNSSDVLKLLHSLI

Query:  SIIQSRGAILVCALPWLRGLLLQHASRIMSQESSLLALNSLYQLIESRISTFQSAVLLSSSLDFLYTGVLDEEVDDNDAIVPIIYEDEDDSDDQESGDEM
        SIIQSRGAILVCALPWLRGLLLQHASRIMSQESSLLALNSLYQLIESRISTFQSAVLLSSSLDFLYTGVLDEEVDDNDAIVPIIYEDEDDSDDQESGDEM
Subjt:  SIIQSRGAILVCALPWLRGLLLQHASRIMSQESSLLALNSLYQLIESRISTFQSAVLLSSSLDFLYTGVLDEEVDDNDAIVPIIYEDEDDSDDQESGDEM

Query:  ETDEDEEEEEREEAFGDLSAGEVDDMSE
        ETDEDEEEEEREEAFGDLSAGEVDD+SE
Subjt:  ETDEDEEEEEREEAFGDLSAGEVDDMSE

XP_031744699.1 WD repeat-containing protein 43 isoform X1 [Cucumis sativus]9.3e-27281.26Show/hide
Query:  MKKEMLKSPPITAFTPDGDYLAILSPNET--IWSARDGSLLAEWKDSEGKTDVGYSCLACCFVRKKKKSS-CVIAIGTDNGDVLAVNASSGETKWVSAGC
        MK E+LKS PI AFTPDGDYLAI+S N T  IWS RDGSLLAEWKD +GK D GYSC+ACCF+ KK+KSS CV+AIGT++GDVLAVNAS+GE KWVSAGC
Subjt:  MKKEMLKSPPITAFTPDGDYLAILSPNET--IWSARDGSLLAEWKDSEGKTDVGYSCLACCFVRKKKKSS-CVIAIGTDNGDVLAVNASSGETKWVSAGC

Query:  HIGGVIGLSFANKGRRLRTVGSNGMASEMDTETGNIIKEFKASKKSISSSAFSPDEKYLAVAGKKLRILSTDNGDELMVHSDKLGPVKLVSISDDAKAII
        H GGVIGLSFANKG RLRTVGSNGMASEMDTETGNIIKEFKASKKSISSSAFS DE+YL VAGKKL+ILSTD+GDEL+VH DKLGPVKLVS+SDDAK II
Subjt:  HIGGVIGLSFANKGRRLRTVGSNGMASEMDTETGNIIKEFKASKKSISSSAFSPDEKYLAVAGKKLRILSTDNGDELMVHSDKLGPVKLVSISDDAKAII

Query:  TSEFGAKHLQVWWCDMSARKLSRGPVLSMKHPPFVSECKNISNEEDSIVVLSVSVSGVAYVWRLKILSEDEVSPNKVTVKANDAQSAEENHGSAKKNRIS
        TSE GAKHLQVWWC++SA K SRGP+LSMKHPPFVSEC+N+SN+EDS+VVLSVSVSG AY+W+LK+LSEDEV+P KV+VKAND QSAEENHGSAKKNR S
Subjt:  TSEFGAKHLQVWWCDMSARKLSRGPVLSMKHPPFVSECKNISNEEDSIVVLSVSVSGVAYVWRLKILSEDEVSPNKVTVKANDAQSAEENHGSAKKNRIS

Query:  VIASRIDGSGDNEVSVLVTHGSMDQPQLSLFNIGYSVKEDLNTAHEKKTLQQNDDFSGQGPHEIEQAVTTPKSKKSKKKRAASDIDSLTAGDVSD-----
        V+ASRI G GDNEVSVLVTHGS+D PQ +L +IGY+VKED NTAHE KTLQQND  S QGPHEIEQ V TPKSKKSKKKRAAS++DSLTAGDVSD     
Subjt:  VIASRIDGSGDNEVSVLVTHGSMDQPQLSLFNIGYSVKEDLNTAHEKKTLQQNDDFSGQGPHEIEQAVTTPKSKKSKKKRAASDIDSLTAGDVSD-----

Query:  ---VGNGDASDVLFNDDINEPTMGEKLASLNLLDQDEDESHEQEEPSVPAIPPSADSVQVLLKQALHADDRALLLECLYTKDDKVISKSIAQLNSSDVLK
           VGNGD SDVLFNDD+NEP+MGEKLASLNL DQ++D   EQE+PSVP IPPSADSVQVLLKQALHADDRALLLECLYTKD KVISKSIAQLNSSDVL 
Subjt:  ---VGNGDASDVLFNDDINEPTMGEKLASLNLLDQDEDESHEQEEPSVPAIPPSADSVQVLLKQALHADDRALLLECLYTKDDKVISKSIAQLNSSDVLK

Query:  LLHSLISIIQSRGAILVCALPWLRGLLLQHASRIMSQESSLLALNSLYQLIESRISTFQSAVLLSSSLDFLYTGVLDEEVDDNDAIVPIIYEDEDDSDDQ
        LLH+LIS IQSRGAILVCALPWLR L+LQHAS+IMSQESSLLALNSLYQLIESR STFQSA+LLSSSLDFLYT VLD+E +DND IVPIIYE E+DSD+ 
Subjt:  LLHSLISIIQSRGAILVCALPWLRGLLLQHASRIMSQESSLLALNSLYQLIESRISTFQSAVLLSSSLDFLYTGVLDEEVDDNDAIVPIIYEDEDDSDDQ

Query:  ESGDEMETDEDEEEEEREEAFGDLSAGEV-DDMSE
        E+GDEMET+ED+E +E  EAF DLSAGEV DDMSE
Subjt:  ESGDEMETDEDEEEEEREEAFGDLSAGEV-DDMSE

XP_038899613.1 WD repeat-containing protein 43 [Benincasa hispida]2.4e-28885.65Show/hide
Query:  MKKEMLKSPPITAFTPDGDYLAILSPNET--IWSARDGSLLAEWKDSEGKTDVGYSCLACCFVRKKKKSS-CVIAIGTDNGDVLAVNASSGETKWVSAGC
        MKKE+LKSPPITAFTPDGDYLAILS N T  IWS RDGSLLAEWKD +GK DVGYSC+ACCF  KK+K+S CV+A+GT++GDVLAVNAS+GE KWVSAGC
Subjt:  MKKEMLKSPPITAFTPDGDYLAILSPNET--IWSARDGSLLAEWKDSEGKTDVGYSCLACCFVRKKKKSS-CVIAIGTDNGDVLAVNASSGETKWVSAGC

Query:  HIGGVIGLSFANKGRRLRTVGSNGMASEMDTETGNIIKEFKASKKSISSSAFSPDEKYLAVAGKKLRILSTDNGDELMVHSDKLGPVKLVSISDDAKAII
        H+GGVIGLSFANKGRRL  VGSNG  SEMDTETGNIIKEFKASKKSISSS+FS DEKYLAVAGKKL+ILS D+GDELMVH DKLGPVKLVS+SDDAK II
Subjt:  HIGGVIGLSFANKGRRLRTVGSNGMASEMDTETGNIIKEFKASKKSISSSAFSPDEKYLAVAGKKLRILSTDNGDELMVHSDKLGPVKLVSISDDAKAII

Query:  TSEFGAKHLQVWWCDMSARKLSRGPVLSMKHPPFVSECKNISNEEDSIVVLSVSVSGVAYVWRLKILSEDEVSPNKVTVKANDAQSAEENHGSAKKNRIS
        TSE GAKHLQVWWCDMSA KLSRGPVLSMKHPPFVSEC+N+SN+ED++VVLSVSVSGVAY+W+LKILSEDEVSP KV+VKAND QSAEENHGSAKKNR+S
Subjt:  TSEFGAKHLQVWWCDMSARKLSRGPVLSMKHPPFVSECKNISNEEDSIVVLSVSVSGVAYVWRLKILSEDEVSPNKVTVKANDAQSAEENHGSAKKNRIS

Query:  VIASRIDGSGDNEVSVLVTHGSMDQPQLSLFNIGYSVKEDLNTAHEKKTLQQNDDFSGQGPHEIEQAVTTPKSKKSKKKRAASDIDSLTAGDVSDVGNGD
        VIASRI+G GDNEVSVLVTHGSMD PQ SLF+IGYSVKED+NTA   KTLQQNDD S QGPHE+EQ V TPKSKK KKKRAASD+DSLT GD+SDVGNGD
Subjt:  VIASRIDGSGDNEVSVLVTHGSMDQPQLSLFNIGYSVKEDLNTAHEKKTLQQNDDFSGQGPHEIEQAVTTPKSKKSKKKRAASDIDSLTAGDVSDVGNGD

Query:  ASDVLFNDDINEPTMGEKLASLNLLDQDEDESHEQEEPSVPAIPPSADSVQVLLKQALHADDRALLLECLYTKDDKVISKSIAQLNSSDVLKLLHSLISI
        ASDV+FNDD+NEP+MGEKLASLNL+DQ+EDE  E +EPSVPAIPPSADSVQVLLKQALHA+DR LLLECLYTKDDKVISKSIAQLNSSDVLKLLHSLIS 
Subjt:  ASDVLFNDDINEPTMGEKLASLNLLDQDEDESHEQEEPSVPAIPPSADSVQVLLKQALHADDRALLLECLYTKDDKVISKSIAQLNSSDVLKLLHSLISI

Query:  IQSRGAILVCALPWLRGLLLQHASRIMSQESSLLALNSLYQLIESRISTFQSAVLLSSSLDFLYTGVLDEEVDDNDAIVPIIYEDEDDSDDQESGDEMET
        IQSRGAILVC LPWLRGLLLQHAS+IMSQESSLLALNSLYQLIESRISTFQSA+LLSSSLDFLY+GVLDEEVD+NDAIVPIIYE EDDSDD+ESGDEMET
Subjt:  IQSRGAILVCALPWLRGLLLQHASRIMSQESSLLALNSLYQLIESRISTFQSAVLLSSSLDFLYTGVLDEEVDDNDAIVPIIYEDEDDSDDQESGDEMET

Query:  DEDEEEEEREEAFGDLSAGEV-DDMSE
        DEDEE +E  EAF DLSAGEV DDMSE
Subjt:  DEDEEEEEREEAFGDLSAGEV-DDMSE

TrEMBL top hitse value%identityAlignment
A0A0A0KBW9 Utp12 domain-containing protein2.8e-27482.3Show/hide
Query:  MKKEMLKSPPITAFTPDGDYLAILSPNET--IWSARDGSLLAEWKDSEGKTDVGYSCLACCFVRKKKKSS-CVIAIGTDNGDVLAVNASSGETKWVSAGC
        MK E+LKS PI AFTPDGDYLAI+S N T  IWS RDGSLLAEWKD +GK D GYSC+ACCF+ KK+KSS CV+AIGT++GDVLAVNAS+GE KWVSAGC
Subjt:  MKKEMLKSPPITAFTPDGDYLAILSPNET--IWSARDGSLLAEWKDSEGKTDVGYSCLACCFVRKKKKSS-CVIAIGTDNGDVLAVNASSGETKWVSAGC

Query:  HIGGVIGLSFANKGRRLRTVGSNGMASEMDTETGNIIKEFKASKKSISSSAFSPDEKYLAVAGKKLRILSTDNGDELMVHSDKLGPVKLVSISDDAKAII
        H GGVIGLSFANKG RLRTVGSNGMASEMDTETGNIIKEFKASKKSISSSAFS DE+YL VAGKKL+ILSTD+GDEL+VH DKLGPVKLVS+SDDAK II
Subjt:  HIGGVIGLSFANKGRRLRTVGSNGMASEMDTETGNIIKEFKASKKSISSSAFSPDEKYLAVAGKKLRILSTDNGDELMVHSDKLGPVKLVSISDDAKAII

Query:  TSEFGAKHLQVWWCDMSARKLSRGPVLSMKHPPFVSECKNISNEEDSIVVLSVSVSGVAYVWRLKILSEDEVSPNKVTVKANDAQSAEENHGSAKKNRIS
        TSE GAKHLQVWWC++SA K SRGP+LSMKHPPFVSEC+N+SN+EDS+VVLSVSVSG AY+W+LK+LSEDEV+P KV+VKAND QSAEENHGSAKKNR S
Subjt:  TSEFGAKHLQVWWCDMSARKLSRGPVLSMKHPPFVSECKNISNEEDSIVVLSVSVSGVAYVWRLKILSEDEVSPNKVTVKANDAQSAEENHGSAKKNRIS

Query:  VIASRIDGSGDNEVSVLVTHGSMDQPQLSLFNIGYSVKEDLNTAHEKKTLQQNDDFSGQGPHEIEQAVTTPKSKKSKKKRAASDIDSLTAGDVSDVGNGD
        V+ASRI G GDNEVSVLVTHGS+D PQ +L +IGY+VKED NTAHE KTLQQND  S QGPHEIEQ V TPKSKKSKKKRAAS++DSLTAGDVSDVGNGD
Subjt:  VIASRIDGSGDNEVSVLVTHGSMDQPQLSLFNIGYSVKEDLNTAHEKKTLQQNDDFSGQGPHEIEQAVTTPKSKKSKKKRAASDIDSLTAGDVSDVGNGD

Query:  ASDVLFNDDINEPTMGEKLASLNLLDQDEDESHEQEEPSVPAIPPSADSVQVLLKQALHADDRALLLECLYTKDDKVISKSIAQLNSSDVLKLLHSLISI
         SDVLFNDD+NEP+MGEKLASLNL DQ++D   EQE+PSVP IPPSADSVQVLLKQALHADDRALLLECLYTKD KVISKSIAQLNSSDVL LLH+LIS 
Subjt:  ASDVLFNDDINEPTMGEKLASLNLLDQDEDESHEQEEPSVPAIPPSADSVQVLLKQALHADDRALLLECLYTKDDKVISKSIAQLNSSDVLKLLHSLISI

Query:  IQSRGAILVCALPWLRGLLLQHASRIMSQESSLLALNSLYQLIESRISTFQSAVLLSSSLDFLYTGVLDEEVDDNDAIVPIIYEDEDDSDDQESGDEMET
        IQSRGAILVCALPWLR L+LQHAS+IMSQESSLLALNSLYQLIESR STFQSA+LLSSSLDFLYT VLD+E +DND IVPIIYE E+DSD+ E+GDEMET
Subjt:  IQSRGAILVCALPWLRGLLLQHASRIMSQESSLLALNSLYQLIESRISTFQSAVLLSSSLDFLYTGVLDEEVDDNDAIVPIIYEDEDDSDDQESGDEMET

Query:  DEDEEEEEREEAFGDLSAGEV-DDMSE
        +ED+E +E  EAF DLSAGEV DDMSE
Subjt:  DEDEEEEEREEAFGDLSAGEV-DDMSE

A0A1S3CDX9 WD repeat-containing protein 434.5e-28083.73Show/hide
Query:  MKKEMLKSPPITAFTPDGDYLAILSPNET--IWSARDGSLLAEWKDSEGKTDVGYSCLACCFVRKKKKSS-CVIAIGTDNGDVLAVNASSGETKWVSAGC
        MKKE+LKSPPITAFTPDGDYLAI S N T  IWS RDGSLLAEWKD +GK D GYSC+ACC + KK+KSS C++AIGT+NGDVLAVNAS+GE KWVS GC
Subjt:  MKKEMLKSPPITAFTPDGDYLAILSPNET--IWSARDGSLLAEWKDSEGKTDVGYSCLACCFVRKKKKSS-CVIAIGTDNGDVLAVNASSGETKWVSAGC

Query:  HIGGVIGLSFANKGRRLRTVGSNGMASEMDTETGNIIKEFKASKKSISSSAFSPDEKYLAVAGKKLRILSTDNGDELMVHSDKLGPVKLVSISDDAKAII
        H GGVIGLSFAN+GRRL TVGSNGMASEMDTETGNIIKEFKASKKSISSSAFS DEKYLAVAGKKL+ILS D+GDEL+VH DKL PVKLVSISDDAK I+
Subjt:  HIGGVIGLSFANKGRRLRTVGSNGMASEMDTETGNIIKEFKASKKSISSSAFSPDEKYLAVAGKKLRILSTDNGDELMVHSDKLGPVKLVSISDDAKAII

Query:  TSEFGAKHLQVWWCDMSARKLSRGPVLSMKHPPFVSECKNISNEEDSIVVLSVSVSGVAYVWRLKILSEDEVSPNKVTVKANDAQSAEENHGSAKKNRIS
        TSE GAKHLQVWWCDMSA K SRGPVLSM HPPFVSEC+N+SN+EDS+VVLSVSVSG AY+W+LK+LSEDEV P KV+VKAND QSAEENHGSAKKNR+S
Subjt:  TSEFGAKHLQVWWCDMSARKLSRGPVLSMKHPPFVSECKNISNEEDSIVVLSVSVSGVAYVWRLKILSEDEVSPNKVTVKANDAQSAEENHGSAKKNRIS

Query:  VIASRIDGSGDNEVSVLVTHGSMDQPQLSLFNIGYSVKEDLNTAHEKKTLQQNDDFSGQGPHEIEQAVTTPKSKKSKKKRAASDIDSLTAGDVSDVGNGD
        V+AS+I   GDNEVSVLVTHGS+D PQ SL +IGY+VKED NTAHE KTLQQND  S QGPHEIEQ V TPKSKKSKKKRAASD+DS TAGDVSDVGNGD
Subjt:  VIASRIDGSGDNEVSVLVTHGSMDQPQLSLFNIGYSVKEDLNTAHEKKTLQQNDDFSGQGPHEIEQAVTTPKSKKSKKKRAASDIDSLTAGDVSDVGNGD

Query:  ASDVLFNDDINEPTMGEKLASLNLLDQDEDESHEQEEPSVPAIPPSADSVQVLLKQALHADDRALLLECLYTKDDKVISKSIAQLNSSDVLKLLHSLISI
        ASDV+FNDD+NEP+MGEKLASLNL DQ+ED   EQE+PSVP IPPSADSVQVLLKQALHADD ALLLECLYTKDDKVISKSIAQLNSSDVLKLLHS+IS 
Subjt:  ASDVLFNDDINEPTMGEKLASLNLLDQDEDESHEQEEPSVPAIPPSADSVQVLLKQALHADDRALLLECLYTKDDKVISKSIAQLNSSDVLKLLHSLISI

Query:  IQSRGAILVCALPWLRGLLLQHASRIMSQESSLLALNSLYQLIESRISTFQSAVLLSSSLDFLYTGVLDEEVDDNDAIVPIIYEDEDDSDDQESGDEMET
        IQSRGAILVCALPWLRGLLLQHAS+IMSQESSLLALNSLYQLIE+RISTFQSA+LLSSSLDFLYTGVLDEE +DNDAIVPIIYE E+DSD+ E+GDEMET
Subjt:  IQSRGAILVCALPWLRGLLLQHASRIMSQESSLLALNSLYQLIESRISTFQSAVLLSSSLDFLYTGVLDEEVDDNDAIVPIIYEDEDDSDDQESGDEMET

Query:  DEDEEEEEREEAFGDLSAGEV-DDMSE
        DED+E +E  EAF DLSAGEV DDMSE
Subjt:  DEDEEEEEREEAFGDLSAGEV-DDMSE

A0A5A7UYX1 WD repeat-containing protein 434.4e-26784.23Show/hide
Query:  IWSARDGSLLAEWKDSEGKTDVGYSCLACCFVRKKKKSS-CVIAIGTDNGDVLAVNASSGETKWVSAGCHIGGVIGLSFANKGRRLRTVGSNGMASEMDT
        IWS RDGSLLAEWKD +GK D GYSC+ACC + KK+KSS CV+AIGT+NGDVLAVNAS+GE KWVS GCH GGVIGLSFAN+GRRL TVGSNGMASEMDT
Subjt:  IWSARDGSLLAEWKDSEGKTDVGYSCLACCFVRKKKKSS-CVIAIGTDNGDVLAVNASSGETKWVSAGCHIGGVIGLSFANKGRRLRTVGSNGMASEMDT

Query:  ETGNIIKEFKASKKSISSSAFSPDEKYLAVAGKKLRILSTDNGDELMVHSDKLGPVKLVSISDDAKAIITSEFGAKHLQVWWCDMSARKLSRGPVLSMKH
        ETGNIIKEFKASKKSISSSAFS DEKYLAVAGKKL+ILS D+GDEL+VH DKL PVKLVSISDDAK IITSE GAKHLQVWWCDMSA K SRGPVLSM H
Subjt:  ETGNIIKEFKASKKSISSSAFSPDEKYLAVAGKKLRILSTDNGDELMVHSDKLGPVKLVSISDDAKAIITSEFGAKHLQVWWCDMSARKLSRGPVLSMKH

Query:  PPFVSECKNISNEEDSIVVLSVSVSGVAYVWRLKILSEDEVSPNKVTVKANDAQSAEENHGSAKKNRISVIASRIDGSGDNEVSVLVTHGSMDQPQLSLF
        PPFVSEC+N+SN+EDS+VVLSVSVSG AY+W+LK+LSEDEV P KV+VKAND QSAEENHGSAKKNR+SV+ASRI   GDNEVSVLVTHGS+D PQ SL 
Subjt:  PPFVSECKNISNEEDSIVVLSVSVSGVAYVWRLKILSEDEVSPNKVTVKANDAQSAEENHGSAKKNRISVIASRIDGSGDNEVSVLVTHGSMDQPQLSLF

Query:  NIGYSVKEDLNTAHEKKTLQQNDDFSGQGPHEIEQAVTTPKSKKSKKKRAASDIDSLTAGDVSDVGNGDASDVLFNDDINEPTMGEKLASLNLLDQDEDE
        +IGY+VKED NTAHE KTLQQND  S QGPHEIEQ V  PKSKKSKKKRAASD+DS TAGDVSDVGNGDASDV+FNDD+NEP+MGEKLASLNL DQ+ED 
Subjt:  NIGYSVKEDLNTAHEKKTLQQNDDFSGQGPHEIEQAVTTPKSKKSKKKRAASDIDSLTAGDVSDVGNGDASDVLFNDDINEPTMGEKLASLNLLDQDEDE

Query:  SHEQEEPSVPAIPPSADSVQVLLKQALHADDRALLLECLYTKDDKVISKSIAQLNSSDVLKLLHSLISIIQSRGAILVCALPWLRGLLLQHASRIMSQES
          EQE+PSVP IPPSADSVQVLLKQALHADD ALLLECLYTKDDKVISKSIAQLNSSDVLKLLHS+IS IQSRGAILVCALPWLRGLLLQHAS+IMSQES
Subjt:  SHEQEEPSVPAIPPSADSVQVLLKQALHADDRALLLECLYTKDDKVISKSIAQLNSSDVLKLLHSLISIIQSRGAILVCALPWLRGLLLQHASRIMSQES

Query:  SLLALNSLYQLIESRISTFQSAVLLSSSLDFLYTGVLDEEVDDNDAIVPIIYEDEDDSDDQESGDEMETDEDEEEEEREEAFGDLSAGEV-DDMSE
        SLLALNSLYQLIE+RISTFQSA+LLSSSLDFLYTGVLDEE +DNDAIVPIIYE E+DSD+ E+GDEMETDED+E +E  EAF DLSAGEV DDMSE
Subjt:  SLLALNSLYQLIESRISTFQSAVLLSSSLDFLYTGVLDEEVDDNDAIVPIIYEDEDDSDDQESGDEMETDEDEEEEEREEAFGDLSAGEV-DDMSE

A0A6J1D938 WD repeat-containing protein 430.0e+0098.89Show/hide
Query:  MKKEMLKSPPITAFTPDGDYLAILSPNET--IWSARDGSLLAEWKDSEGKTDVGYSCLACCFVRK---KKKSSCVIAIGTDNGDVLAVNASSGETKWVSA
        MKKEMLKSPPITAFTPDGDYLAILSPNET  IWSARDGSLLAEWKDSEGKTDVGYSCLACCFVRK   KKKSSCVIAIGTDNGDVLAVNASSGETKWVSA
Subjt:  MKKEMLKSPPITAFTPDGDYLAILSPNET--IWSARDGSLLAEWKDSEGKTDVGYSCLACCFVRK---KKKSSCVIAIGTDNGDVLAVNASSGETKWVSA

Query:  GCHIGGVIGLSFANKGRRLRTVGSNGMASEMDTETGNIIKEFKASKKSISSSAFSPDEKYLAVAGKKLRILSTDNGDELMVHSDKLGPVKLVSISDDAKA
        GCHIGGVIGLSFANKGRRLRTVGSNGMASEMDTETGNIIKEFKASKKSISSSAFSPDEKYLAVAGKKLRILSTDNGDELMVHSDKLGPVKLVSISDDAKA
Subjt:  GCHIGGVIGLSFANKGRRLRTVGSNGMASEMDTETGNIIKEFKASKKSISSSAFSPDEKYLAVAGKKLRILSTDNGDELMVHSDKLGPVKLVSISDDAKA

Query:  IITSEFGAKHLQVWWCDMSARKLSRGPVLSMKHPPFVSECKNISNEEDSIVVLSVSVSGVAYVWRLKILSEDEVSPNKVTVKANDAQSAEENHGSAKKNR
        IITSEFGAKHLQVWWCDMSARKLSRGPVLSMKHPPFVSECKNISNEEDSIVVLSVSVSGVAYVWRLKILSEDEVSPNKVTVKANDAQSAEENHGSAKKNR
Subjt:  IITSEFGAKHLQVWWCDMSARKLSRGPVLSMKHPPFVSECKNISNEEDSIVVLSVSVSGVAYVWRLKILSEDEVSPNKVTVKANDAQSAEENHGSAKKNR

Query:  ISVIASRIDGSGDNEVSVLVTHGSMDQPQLSLFNIGYSVKEDLNTAHEKKTLQQNDDFSGQGPHEIEQAVTTPKSKKSKKKRAASDIDSLTAGDVSDVGN
        ISVIASRIDGSGDNEVSVLVTHGSMDQPQLSLFNIGYSVKEDLNTAHEKKTLQQNDDFSGQGPHEIEQAVTTPKSKKSKKKRAASDIDSLTAGDVS VGN
Subjt:  ISVIASRIDGSGDNEVSVLVTHGSMDQPQLSLFNIGYSVKEDLNTAHEKKTLQQNDDFSGQGPHEIEQAVTTPKSKKSKKKRAASDIDSLTAGDVSDVGN

Query:  GDASDVLFNDDINEPTMGEKLASLNLLDQDEDESHEQEEPSVPAIPPSADSVQVLLKQALHADDRALLLECLYTKDDKVISKSIAQLNSSDVLKLLHSLI
        GDASDVLFNDDINEPTMGEKLASLNLLDQDEDESHEQEEPSVPAIPPSADSVQVLLKQALHADDRALLLECLYTKDDKVISKSIAQLNSSDVLKLLHSLI
Subjt:  GDASDVLFNDDINEPTMGEKLASLNLLDQDEDESHEQEEPSVPAIPPSADSVQVLLKQALHADDRALLLECLYTKDDKVISKSIAQLNSSDVLKLLHSLI

Query:  SIIQSRGAILVCALPWLRGLLLQHASRIMSQESSLLALNSLYQLIESRISTFQSAVLLSSSLDFLYTGVLDEEVDDNDAIVPIIYEDEDDSDDQESGDEM
        SIIQSRGAILVCALPWLRGLLLQHASRIMSQESSLLALNSLYQLIESRISTFQSAVLLSSSLDFLYTGVLDEEVDDNDAIVPIIYEDEDDSDDQESGDEM
Subjt:  SIIQSRGAILVCALPWLRGLLLQHASRIMSQESSLLALNSLYQLIESRISTFQSAVLLSSSLDFLYTGVLDEEVDDNDAIVPIIYEDEDDSDDQESGDEM

Query:  ETDEDEEEEEREEAFGDLSAGEVDDMSE
        ETDEDEEEEEREEAFGDLSAGEVDD+SE
Subjt:  ETDEDEEEEEREEAFGDLSAGEVDDMSE

A0A6J1HJF3 WD repeat-containing protein 43 isoform X12.2e-26680.86Show/hide
Query:  MKKEMLKSPPITAFTPDGDYLAILSPNET--IWSARDGSLLAEWKDSEGKTDVGYSCLACCFVRKKKK-SSCVIAIGTDNGDVLAVNASSGETKWVSAGC
        MK E  KSPPITAFTP+GDYLAILS N T  IW+  DGSLLAEWKD +GKTD GYSC+ACCFV KK+K SSC+IAIGT+ GDVL VNASSGETKWVSAGC
Subjt:  MKKEMLKSPPITAFTPDGDYLAILSPNET--IWSARDGSLLAEWKDSEGKTDVGYSCLACCFVRKKKK-SSCVIAIGTDNGDVLAVNASSGETKWVSAGC

Query:  HIGGVIGLSFANKGRRLRTVGSNGMASEMDTETGNIIKEFKASKKSISSSAFSPDEKYLAVAGKKLRILSTDNGDELMVHSDKLGPVKLVSISDDAKAII
        H+GGVIGLSFA+KGRRL TVGSNG+A +M+ ETG+II EFKASKKSISSSAFS DEKYLAVAGKKL+ILSTD+G ELMVH DKLGPVKL SISDDAK II
Subjt:  HIGGVIGLSFANKGRRLRTVGSNGMASEMDTETGNIIKEFKASKKSISSSAFSPDEKYLAVAGKKLRILSTDNGDELMVHSDKLGPVKLVSISDDAKAII

Query:  TSEFGAKHLQVWWCDMSARKLSRGPVLSMKHPPFVSECKNISNEEDSIVVLSVSVSGVAYVWRLKILSEDEVSPNKVTVKANDAQSAEENHGSAKKNRIS
        TSE GAKH+QVWWCDMSA KLSRGPVLSMKHPPFVSEC+NI+N EDSIVVLSVSVSGVAY+W+LK LSED+V+P KVTVK N+ +SAEENHGSAKKNRIS
Subjt:  TSEFGAKHLQVWWCDMSARKLSRGPVLSMKHPPFVSECKNISNEEDSIVVLSVSVSGVAYVWRLKILSEDEVSPNKVTVKANDAQSAEENHGSAKKNRIS

Query:  VIASRIDGSGDNEVSVLVTHGSMDQPQLSLFNIGYSVKEDLNTAHEKKTLQQNDDFSGQGPHEIEQAVTTPKSKKSKKKRAASDIDSLTAGDVSDVGNGD
        V++S I G GDNEVSVLVTHGSMD PQ ++ NIGY  KED N A EK           +GPHEI+QAVT+PKSKKSKKKRAASD+DS  AGDVSDVGN D
Subjt:  VIASRIDGSGDNEVSVLVTHGSMDQPQLSLFNIGYSVKEDLNTAHEKKTLQQNDDFSGQGPHEIEQAVTTPKSKKSKKKRAASDIDSLTAGDVSDVGNGD

Query:  ASDVLFNDDINEPTMGEKLASLNLLDQDEDESHEQEEPSVPAIPPSADSVQVLLKQALHADDRALLLECLYTKDDKVISKSIAQLNSSDVLKLLHSLISI
         S+VLFNDD+NEPTMG+KLASLNL +Q+EDE+HEQ+EPSVPAIPPSADSVQVLLKQAL ADDRALLLECLYTKDDKVISKSIAQLNSSDVLKLLH+LISI
Subjt:  ASDVLFNDDINEPTMGEKLASLNLLDQDEDESHEQEEPSVPAIPPSADSVQVLLKQALHADDRALLLECLYTKDDKVISKSIAQLNSSDVLKLLHSLISI

Query:  IQSRGAILVCALPWLRGLLLQHASRIMSQESSLLALNSLYQLIESRISTFQSAVLLSSSLDFLYTGVLDEEVDDNDAIVPIIYEDEDDSDDQESGDEMET
        IQSRGAILVCA+PWLRGLLLQHASRIMSQESSLLALNSLYQLIESRISTFQSA+LLSSSLDFLYTGVLDEE ++NDAIVPIIYE++D  D + SGDEMET
Subjt:  IQSRGAILVCALPWLRGLLLQHASRIMSQESSLLALNSLYQLIESRISTFQSAVLLSSSLDFLYTGVLDEEVDDNDAIVPIIYEDEDDSDDQESGDEMET

Query:  DEDEEEEEREEAFGDLSAGEV-DDMSE
        DE+    E  EAF DLSAGEV DDMSE
Subjt:  DEDEEEEEREEAFGDLSAGEV-DDMSE

SwissProt top hitse value%identityAlignment
Q15061 WD repeat-containing protein 432.5e-0923.97Show/hide
Query:  RKKKKSSCV--------IAIGTDNGDVLAVNASSGE--TKWVSAGCHIGGVIGLSFANKGRRLRTVGSNGMASEMDTETGNIIKEFKASKKSISSSAFSP
        RKK+KS  V        +A+GT  G +L  +   GE  +K +S G H   V  + +      L +   +    E + +T  +  ++K    S+SS   SP
Subjt:  RKKKKSSCV--------IAIGTDNGDVLAVNASSGE--TKWVSAGCHIGGVIGLSFANKGRRLRTVGSNGMASEMDTETGNIIKEFKASKKSISSSAFSP

Query:  DEKYLAVAGK--KLRILSTD------NGDELMVHSDKLGPVKLVSISDDAKAIITSEF--GAKH---LQVWWCDMSARKLSRGPVLSMKHPPFVSECKNI
        D K L  AG+  KL +L T        G    V S     ++  + S     I    F  GA H   L VW      ++ S     ++   P   +    
Subjt:  DEKYLAVAGK--KLRILSTD------NGDELMVHSDKLGPVKLVSISDDAKAIITSEF--GAKH---LQVWWCDMSARKLSRGPVLSMKHPPFVSECKNI

Query:  SNEEDSIVVLSVSVSGVAYVWRLKILSEDEVSPNKVTVKANDAQSAEENHGSAKKNRISVIASRIDGSGDNEVSVLVTHGSMDQPQLSLFNIGYSVKEDL
         N+E+ + +  V   G  +++   IL+     P       ++        G  KK+    I     G   +++S+L+ +GS  QP +         +  L
Subjt:  SNEEDSIVVLSVSVSGVAYVWRLKILSEDEVSPNKVTVKANDAQSAEENHGSAKKNRISVIASRIDGSGDNEVSVLVTHGSMDQPQLSLFNIGYSVKEDL

Query:  NTAHEKKTLQQNDDFSGQGPHEIEQAVTTPKS--KKSKKKRAASDIDSLTAGDVSDVGNGDASDVLFNDDINEPTMGEKLASLNLLDQDEDESHEQEEPS
        N+      L +  D S     ++E A+T  ++    S+ K     I    A         +  +       NE ++ E+L ++++      ++H++ +  
Subjt:  NTAHEKKTLQQNDDFSGQGPHEIEQAVTTPKS--KKSKKKRAASDIDSLTAGDVSDVGNGDASDVLFNDDINEPTMGEKLASLNLLDQDEDESHEQEEPS

Query:  VPAIPPSADSVQVLLKQALHADDRALLLECLYTKDDKVISKSIAQLNSSDVLKLLHSLISIIQSRGAILVCALPWLRGLLLQHASRIMSQESSLLALNSL
        +       +S  VLL Q L ++D  +L + L T++  +I K++ ++    ++ LL  L   +Q      V  + WL+ +L  HAS + +    +  L +L
Subjt:  VPAIPPSADSVQVLLKQALHADDRALLLECLYTKDDKVISKSIAQLNSSDVLKLLHSLISIIQSRGAILVCALPWLRGLLLQHASRIMSQESSLLALNSL

Query:  YQLIESRISTFQSAVLLSSSLDFLYTGVLDEEVDDNDAIVP-----IIYEDE-------DDSDDQESGDEMETDEDEEEEEREE
        YQL+ESR+ TFQ    L   L  L T V   E     A  P     ++YE+E       D+  D++S D  + DE+E E E++E
Subjt:  YQLIESRISTFQSAVLLSSSLDFLYTGVLDEEVDDNDAIVP-----IIYEDE-------DDSDDQESGDEMETDEDEEEEEREE

Q6ZQL4 WD repeat-containing protein 432.3e-0722.73Show/hide
Query:  AFTPDGD-YLAILSPNE--TIWSARDGSLLAEWKDSEGKTDVGYSCLACC---------FVRKKKKSSC--------VIAIGTDNGDVLAVNASSGET-K
        AF+PD   Y A+ S +    +W   +  L  E+  S   +    +CLA             RKK+KS          ++A+GT  G +L  +   GE   
Subjt:  AFTPDGD-YLAILSPNE--TIWSARDGSLLAEWKDSEGKTDVGYSCLACC---------FVRKKKKSSC--------VIAIGTDNGDVLAVNASSGET-K

Query:  WVSAGCHIGGVIGLSFANKGRRLRTVGSNGMASEMDTETGNIIKEFKASKKSISSSAFSPDEKYLAVAGKKLRILSTDNGD---ELMVHSDKLGPVKLVS
         +++G H   V  + +      L +   +    E  T+T  +  ++K    S+SS   SPD K L  AG+ +++   +  +       H+  +  ++  +
Subjt:  WVSAGCHIGGVIGLSFANKGRRLRTVGSNGMASEMDTETGNIIKEFKASKKSISSSAFSPDEKYLAVAGKKLRILSTDNGD---ELMVHSDKLGPVKLVS

Query:  I----SDDAKAIITSEF--GAKH---LQVWWCDMSARKLSRGPVLSMKHPPFVSECKNISNEEDSIVVLSVSVSGVAYVWRLKILSEDEVSPNKVTVKAN
        I    S  +  I    F  GA H   L VW      ++ S     ++   P   +     N+E+ + +  V   G  +++   IL+     P        
Subjt:  I----SDDAKAIITSEF--GAKH---LQVWWCDMSARKLSRGPVLSMKHPPFVSECKNISNEEDSIVVLSVSVSGVAYVWRLKILSEDEVSPNKVTVKAN

Query:  DAQSAEENHGSAKKNRISVIASRIDGSGDNEVSVLVTHGSMDQPQLSLFNIGYSVKEDLNTAHEKKTLQQNDDFSGQGPHEIEQAVTTPKS--KKSKKKR
         A   +    + K   I   +  +D     ++S+L+ +G+  QP +         +  LN+      L++  D S      +E A+T  K+    S+ K 
Subjt:  DAQSAEENHGSAKKNRISVIASRIDGSGDNEVSVLVTHGSMDQPQLSLFNIGYSVKEDLNTAHEKKTLQQNDDFSGQGPHEIEQAVTTPKS--KKSKKKR

Query:  AASDIDSLTAGDVSDVGNGDASDVLFNDDINEPTMGEKLASLNLLDQDEDESHEQEEPSVPAIPPSADSVQVLLKQALHADDRALLLECLYTKDDKVISK
            I    A           ++        E T+ E+L +++L     D    +++          +S  VLL Q L ++D  +L + L TK+  +I +
Subjt:  AASDIDSLTAGDVSDVGNGDASDVLFNDDINEPTMGEKLASLNLLDQDEDESHEQEEPSVPAIPPSADSVQVLLKQALHADDRALLLECLYTKDDKVISK

Query:  SIAQLNSSDVLKLLHSLISIIQSRGAILVCALPWLRGLLLQHASRIMSQESSLLALNSLYQLIESRISTFQSAVLLSSSLDFLYTGVLDEE----VDDND
        ++ ++    V+ LL  L   +Q         + WL+ +L  HAS + +    +  L +LYQL+ESR+ TFQ    L   L  L T V   E    +    
Subjt:  SIAQLNSSDVLKLLHSLISIIQSRGAILVCALPWLRGLLLQHASRIMSQESSLLALNSLYQLIESRISTFQSAVLLSSSLDFLYTGVLDEE----VDDND

Query:  AIVPIIYEDEDDSDDQESGDEM-ETDEDEEEEEREEAFGDLSAGEVDDMSE
            ++YE+E  S ++ES DE+ E D D+  +E E+   +   G  +D  E
Subjt:  AIVPIIYEDEDDSDDQESGDEM-ETDEDEEEEEREEAFGDLSAGEVDDMSE

Arabidopsis top hitse value%identityAlignment
AT1G15420.1 CONTAINS InterPro DOMAIN/s: Small-subunit processome, Utp12 (InterPro:IPR007148); Has 764 Blast hits to 656 proteins in 193 species: Archae - 0; Bacteria - 42; Metazoa - 237; Fungi - 154; Plants - 85; Viruses - 23; Other Eukaryotes - 223 (source: NCBI BLink).7.8e-5152.12Show/hide
Query:  KSKKSKKKRAASDIDSLTAGDVSDVGNGDASDVLFNDDINEPTMGEKLASLNLLDQDEDESHEQEEPSVPA--IPPSADSVQVLLKQALHADDRALLLEC
        K KK  KKRA  + D  +  D     + D   VL +D +NEPT+G+KL SL+LL+ ++  S E    S P    PP+A SV VLL+QALHADDR+LLL+C
Subjt:  KSKKSKKKRAASDIDSLTAGDVSDVGNGDASDVLFNDDINEPTMGEKLASLNLLDQDEDESHEQEEPSVPA--IPPSADSVQVLLKQALHADDRALLLEC

Query:  LYTKDDKVISKSIAQLNSSDVLKLLHSLISIIQSRGAILVCALPWLRGLLLQHASRIMSQESSLLALNSLYQLIESRISTFQSAVLLSSSLDFLYTGVLD
        LY +D++VI+ S+A+LNS++VLKLL++L+ I+QSRGAIL C +PW++ LLL H+S IMSQESSLLALN++YQLIESR+ST  +AV +SS LD L    LD
Subjt:  LYTKDDKVISKSIAQLNSSDVLKLLHSLISIIQSRGAILVCALPWLRGLLLQHASRIMSQESSLLALNSLYQLIESRISTFQSAVLLSSSLDFLYTGVLD

Query:  EEVDDNDAIVPIIYEDEDDSDDQESGDE--METDEDEEEEEREEAFGDLSAGEVDDMSE
        EE D+     P+IYED+D  +D+E G E  METDE+ ++   E A G       DDMS+
Subjt:  EEVDDNDAIVPIIYEDEDDSDDQESGDE--METDEDEEEEEREEAFGDLSAGEVDDMSE

AT5G11240.1 transducin family protein / WD-40 repeat family protein1.3e-3425.43Show/hide
Query:  ITAFTPDGDYLAILSPNE--TIWSARDGSLLAEWKD-------------SEGKTDVGYSCLACCFVRKKKK---SSCVIAIGTDNGDVLAVNASSGETKW
        +T+F+P  DYLA+ + +    IW    G +  E+ D              +G   V Y+C+    + KKKK    + V+ +GT  GDVLA++ +SG+ KW
Subjt:  ITAFTPDGDYLAILSPNE--TIWSARDGSLLAEWKD-------------SEGKTDVGYSCLACCFVRKKKK---SSCVIAIGTDNGDVLAVNASSGETKW

Query:  VSAGCHIGGVIGLSFANKGRRLRTVGSNGMASEMDTETGNIIKEFKASKKSISSSAFSPDEKYLAVAGKKLRILSTDNGDELMVHSDKLGPVKLVSISDD
          + CH GGV  +S + K   + + G++GM  ++D  +GN+I++FKAS K++SS   SPD K L  A  +L+  +  +  ++   +   G V+ V+ ++D
Subjt:  VSAGCHIGGVIGLSFANKGRRLRTVGSNGMASEMDTETGNIIKEFKASKKSISSSAFSPDEKYLAVAGKKLRILSTDNGDELMVHSDKLGPVKLVSISDD

Query:  AKAIITSEFGAKHLQVWWCDMSARKLSRGPVLSMKHPPFVSECKNISNEEDSIVVLSVSVSGVAYVWRLKILSE-DEVSPNKVTVKANDAQSAEENHGSA
         K +++S  G +++ VW  D  A+K S   VL+++HPP   +    +NE+  + VL++S  GV Y W    + E    +P KV +      +A+ +    
Subjt:  AKAIITSEFGAKHLQVWWCDMSARKLSRGPVLSMKHPPFVSECKNISNEEDSIVVLSVSVSGVAYVWRLKILSE-DEVSPNKVTVKANDAQSAEENHGSA

Query:  KKNRISVIASRIDGSGDNEVSVLVTHGSMDQPQLSLFNIGYSVKEDLNTAHEKKTLQ-QNDDFSGQGPHEIEQAVTTPKSKKSKKKRAASDIDSLTAGDV
        K +   + A+++ G        ++  GS      +    G  VK     + +K  LQ  ND         I   +T   SK SK++   + + +L     
Subjt:  KKNRISVIASRIDGSGDNEVSVLVTHGSMDQPQLSLFNIGYSVKEDLNTAHEKKTLQ-QNDDFSGQGPHEIEQAVTTPKSKKSKKKRAASDIDSLTAGDV

Query:  SDVGNGDASDVLFNDDINEPTMGEKLASLNLLDQDE---DESHEQEEPSVPAIPPSADSVQVLLKQALHAD-DRALLLECLYTK---DDKVISKSIAQLN
         D        +    D++E    +K   L+  D+D    D+SH     +  ++     S+ +L     H +   A +++    K     K +  ++  + 
Subjt:  SDVGNGDASDVLFNDDINEPTMGEKLASLNLLDQDE---DESHEQEEPSVPAIPPSADSVQVLLKQALHAD-DRALLLECLYTK---DDKVISKSIAQLN

Query:  SSDVLKLLHSLISIIQSRGAILVCALPWLRGLLLQHASRIMSQE-SSLLALNSLYQLIESRISTFQSAVLLSSSLDFLYTGV----------------LD
         S   K L +L ++ Q+R       LPW+  +++ H+  IMSQE  +   LN+L ++ +SR +  Q  + LS  L  +   +                +D
Subjt:  SSDVLKLLHSLISIIQSRGAILVCALPWLRGLLLQHASRIMSQE-SSLLALNSLYQLIESRISTFQSAVLLSSSLDFLYTGV----------------LD

Query:  EEVDDNDAIVPIIYEDEDDSDDQESGDEMETDE
        E  D+ + +    Y + D+  D  S D  + D+
Subjt:  EEVDDNDAIVPIIYEDEDDSDDQESGDEMETDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAGGAAATGCTGAAATCGCCGCCCATCACAGCTTTCACACCCGACGGCGACTATCTGGCTATCTTGTCCCCCAACGAAACTATTTGGAGTGCTCGTGATGGAAG
TTTACTGGCAGAGTGGAAGGATTCTGAGGGAAAAACCGATGTGGGTTATTCCTGTTTGGCCTGCTGTTTTGTGCGGAAAAAGAAAAAGAGTTCTTGTGTAATTGCCATCG
GTACCGATAATGGGGATGTGTTGGCTGTAAATGCTTCAAGTGGTGAGACGAAGTGGGTATCTGCAGGTTGCCATATTGGTGGAGTTATTGGCCTTTCTTTTGCGAACAAA
GGCCGTAGACTGCGGACGGTTGGAAGTAATGGAATGGCATCTGAGATGGACACTGAAACAGGAAACATTATCAAGGAGTTCAAAGCTTCGAAAAAATCAATCTCTTCTTC
AGCCTTTTCACCTGATGAGAAGTACTTAGCTGTTGCTGGCAAAAAGTTGAGGATTTTAAGCACAGATAACGGGGATGAGCTTATGGTGCACTCTGATAAATTGGGTCCTG
TGAAGCTTGTTTCTATATCTGATGATGCCAAAGCAATAATTACATCAGAATTTGGAGCCAAACATCTTCAAGTGTGGTGGTGTGACATGAGTGCAAGAAAACTTAGTAGA
GGTCCTGTTCTTTCCATGAAACATCCTCCATTTGTTTCTGAGTGCAAAAATATTAGCAATGAAGAAGATAGCATAGTTGTCTTGTCAGTATCAGTATCGGGTGTAGCTTA
TGTATGGAGATTAAAGATTCTATCAGAAGACGAGGTTAGTCCAAATAAAGTCACTGTTAAAGCTAATGACGCCCAATCAGCTGAGGAAAACCATGGAAGTGCTAAGAAGA
ATCGAATTTCTGTCATCGCTTCCAGAATAGATGGTTCAGGAGACAATGAAGTGTCAGTTCTTGTTACTCATGGCTCCATGGACCAACCGCAGCTTAGTCTTTTTAATATT
GGTTATTCCGTGAAGGAAGACCTAAATACTGCGCATGAGAAGAAAACCCTCCAACAAAATGATGATTTTTCTGGACAAGGTCCCCATGAGATCGAACAAGCAGTTACTAC
GCCTAAAAGTAAGAAAAGCAAAAAGAAAAGAGCAGCATCTGATATTGATAGTCTGACAGCTGGAGATGTCAGCGATGTTGGCAATGGAGACGCATCTGATGTTTTATTCA
ATGATGATATAAATGAGCCAACCATGGGAGAGAAACTTGCAAGTTTGAATCTGCTAGACCAGGACGAAGATGAGAGCCATGAACAAGAAGAACCTTCCGTCCCTGCAATA
CCACCAAGTGCAGACTCTGTTCAGGTTTTGCTCAAGCAAGCACTACATGCTGACGATCGCGCCCTTTTGCTAGAATGCTTATATACCAAGGATGATAAGGTTATCTCAAA
ATCAATAGCACAATTGAATTCATCTGATGTTCTTAAGCTTTTGCACTCTCTGATATCCATTATCCAGTCAAGAGGGGCAATTCTTGTATGTGCCCTCCCTTGGCTGAGAG
GTTTACTTCTCCAACATGCAAGTAGAATAATGTCCCAAGAATCTTCTCTGCTCGCCCTGAATTCTCTATATCAGCTCATTGAGTCTAGAATTTCAACTTTCCAATCCGCT
GTTCTGCTATCAAGTAGCTTGGACTTCCTTTACACGGGGGTTCTTGATGAGGAGGTGGATGACAATGATGCCATTGTGCCGATTATTTACGAGGACGAGGACGACAGCGA
TGATCAGGAATCGGGAGATGAAATGGAAACTGATGAAGATGAAGAAGAAGAAGAAAGAGAAGAAGCTTTTGGTGATCTTAGTGCTGGTGAAGTTGATGACATGAGTGAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGAAGGAAATGCTGAAATCGCCGCCCATCACAGCTTTCACACCCGACGGCGACTATCTGGCTATCTTGTCCCCCAACGAAACTATTTGGAGTGCTCGTGATGGAAG
TTTACTGGCAGAGTGGAAGGATTCTGAGGGAAAAACCGATGTGGGTTATTCCTGTTTGGCCTGCTGTTTTGTGCGGAAAAAGAAAAAGAGTTCTTGTGTAATTGCCATCG
GTACCGATAATGGGGATGTGTTGGCTGTAAATGCTTCAAGTGGTGAGACGAAGTGGGTATCTGCAGGTTGCCATATTGGTGGAGTTATTGGCCTTTCTTTTGCGAACAAA
GGCCGTAGACTGCGGACGGTTGGAAGTAATGGAATGGCATCTGAGATGGACACTGAAACAGGAAACATTATCAAGGAGTTCAAAGCTTCGAAAAAATCAATCTCTTCTTC
AGCCTTTTCACCTGATGAGAAGTACTTAGCTGTTGCTGGCAAAAAGTTGAGGATTTTAAGCACAGATAACGGGGATGAGCTTATGGTGCACTCTGATAAATTGGGTCCTG
TGAAGCTTGTTTCTATATCTGATGATGCCAAAGCAATAATTACATCAGAATTTGGAGCCAAACATCTTCAAGTGTGGTGGTGTGACATGAGTGCAAGAAAACTTAGTAGA
GGTCCTGTTCTTTCCATGAAACATCCTCCATTTGTTTCTGAGTGCAAAAATATTAGCAATGAAGAAGATAGCATAGTTGTCTTGTCAGTATCAGTATCGGGTGTAGCTTA
TGTATGGAGATTAAAGATTCTATCAGAAGACGAGGTTAGTCCAAATAAAGTCACTGTTAAAGCTAATGACGCCCAATCAGCTGAGGAAAACCATGGAAGTGCTAAGAAGA
ATCGAATTTCTGTCATCGCTTCCAGAATAGATGGTTCAGGAGACAATGAAGTGTCAGTTCTTGTTACTCATGGCTCCATGGACCAACCGCAGCTTAGTCTTTTTAATATT
GGTTATTCCGTGAAGGAAGACCTAAATACTGCGCATGAGAAGAAAACCCTCCAACAAAATGATGATTTTTCTGGACAAGGTCCCCATGAGATCGAACAAGCAGTTACTAC
GCCTAAAAGTAAGAAAAGCAAAAAGAAAAGAGCAGCATCTGATATTGATAGTCTGACAGCTGGAGATGTCAGCGATGTTGGCAATGGAGACGCATCTGATGTTTTATTCA
ATGATGATATAAATGAGCCAACCATGGGAGAGAAACTTGCAAGTTTGAATCTGCTAGACCAGGACGAAGATGAGAGCCATGAACAAGAAGAACCTTCCGTCCCTGCAATA
CCACCAAGTGCAGACTCTGTTCAGGTTTTGCTCAAGCAAGCACTACATGCTGACGATCGCGCCCTTTTGCTAGAATGCTTATATACCAAGGATGATAAGGTTATCTCAAA
ATCAATAGCACAATTGAATTCATCTGATGTTCTTAAGCTTTTGCACTCTCTGATATCCATTATCCAGTCAAGAGGGGCAATTCTTGTATGTGCCCTCCCTTGGCTGAGAG
GTTTACTTCTCCAACATGCAAGTAGAATAATGTCCCAAGAATCTTCTCTGCTCGCCCTGAATTCTCTATATCAGCTCATTGAGTCTAGAATTTCAACTTTCCAATCCGCT
GTTCTGCTATCAAGTAGCTTGGACTTCCTTTACACGGGGGTTCTTGATGAGGAGGTGGATGACAATGATGCCATTGTGCCGATTATTTACGAGGACGAGGACGACAGCGA
TGATCAGGAATCGGGAGATGAAATGGAAACTGATGAAGATGAAGAAGAAGAAGAAAGAGAAGAAGCTTTTGGTGATCTTAGTGCTGGTGAAGTTGATGACATGAGTGAG
Protein sequenceShow/hide protein sequence
MKKEMLKSPPITAFTPDGDYLAILSPNETIWSARDGSLLAEWKDSEGKTDVGYSCLACCFVRKKKKSSCVIAIGTDNGDVLAVNASSGETKWVSAGCHIGGVIGLSFANK
GRRLRTVGSNGMASEMDTETGNIIKEFKASKKSISSSAFSPDEKYLAVAGKKLRILSTDNGDELMVHSDKLGPVKLVSISDDAKAIITSEFGAKHLQVWWCDMSARKLSR
GPVLSMKHPPFVSECKNISNEEDSIVVLSVSVSGVAYVWRLKILSEDEVSPNKVTVKANDAQSAEENHGSAKKNRISVIASRIDGSGDNEVSVLVTHGSMDQPQLSLFNI
GYSVKEDLNTAHEKKTLQQNDDFSGQGPHEIEQAVTTPKSKKSKKKRAASDIDSLTAGDVSDVGNGDASDVLFNDDINEPTMGEKLASLNLLDQDEDESHEQEEPSVPAI
PPSADSVQVLLKQALHADDRALLLECLYTKDDKVISKSIAQLNSSDVLKLLHSLISIIQSRGAILVCALPWLRGLLLQHASRIMSQESSLLALNSLYQLIESRISTFQSA
VLLSSSLDFLYTGVLDEEVDDNDAIVPIIYEDEDDSDDQESGDEMETDEDEEEEEREEAFGDLSAGEVDDMSE