; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh18G002010 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh18G002010
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionWD repeat-containing protein 74-like
Genome locationCmo_Chr18:1345647..1352433
RNA-Seq ExpressionCmoCh18G002010
SyntenyCmoCh18G002010
Gene Ontology termsGO:0042273 - ribosomal large subunit biogenesis (biological process)
GO:0005730 - nucleolus (cellular component)
GO:0030687 - preribosome, large subunit precursor (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001680 - WD40 repeat
IPR015943 - WD40/YVTN repeat-like-containing domain superfamily
IPR019775 - WD40 repeat, conserved site
IPR036322 - WD40-repeat-containing domain superfamily
IPR037379 - WDR74/Nsa1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573118.1 WD repeat-containing protein 74, partial [Cucurbita argyrosperma subsp. sororia]7.8e-23699.06Show/hide
Query:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIQAQGKDGEIPKVVERWGEPDYSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDSSPP
        MPRTTKVDCPGCPPLRALTFDVLGLVKVIQAQGKDGEIPKVVERWGEPDYSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDSSPP
Subjt:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIQAQGKDGEIPKVVERWGEPDYSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDSSPP

Query:  KDDAIIGMHLFEKDELALASRRCTLLSCTKKGNASMRSIDFSSSSSEDISTNPVKTWKVCSSGDVMCSKVDGSETHALFGGKGVEVNTWNLEQCTKIWTA
        KDDAIIGMHLF KDELALASR CTLLSCTKKGNASMRSIDFSSSSSEDISTNPVKTWKVC SGDVMCSKVDGSETHALFGGKGVEVNTWNLEQCTKIWTA
Subjt:  KDDAIIGMHLFEKDELALASRRCTLLSCTKKGNASMRSIDFSSSSSEDISTNPVKTWKVCSSGDVMCSKVDGSETHALFGGKGVEVNTWNLEQCTKIWTA

Query:  KAPKKNSLGIFTPTCFTSVTFLSKDDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKC
        KAPKKNSLGIFTPTCFTSVTFLSKDDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKC
Subjt:  KAPKKNSLGIFTPTCFTSVTFLSKDDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKC

Query:  SGSIRSIARHPEFPIIASCGLDSYVRFWDIKTRQLLSAVFLKQHLNGVVFDSHFVGE-DITHSAVESIQQDIEIGQTVSEEEHTPQKRKKASKEDGEGKK
        SGSIRSIARHPEFPIIASCGLDSYVRFWDIKTRQLLSAVFLKQHLNGVVFDSHFVGE DITHSAVESIQQDIEIGQTVSEEEHTPQKRKKASKEDGEGKK
Subjt:  SGSIRSIARHPEFPIIASCGLDSYVRFWDIKTRQLLSAVFLKQHLNGVVFDSHFVGE-DITHSAVESIQQDIEIGQTVSEEEHTPQKRKKASKEDGEGKK

Query:  RKGASKENKKSKKKSHGETESKQR
        RKGASKENKKSKKKSHGETESKQR
Subjt:  RKGASKENKKSKKKSHGETESKQR

KAG7012304.1 WD repeat-containing protein 74 [Cucurbita argyrosperma subsp. argyrosperma]1.0e-23598.82Show/hide
Query:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIQAQGKDGEIPKVVERWGEPDYSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDSSPP
        MPRTTKVDCPGCPPLRALTFDVLGLVKVIQAQGKDGEIPKVVERWGEPDYSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDSSPP
Subjt:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIQAQGKDGEIPKVVERWGEPDYSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDSSPP

Query:  KDDAIIGMHLFEKDELALASRRCTLLSCTKKGNASMRSIDFSSSSSEDISTNPVKTWKVCSSGDVMCSKVDGSETHALFGGKGVEVNTWNLEQCTKIWTA
        KDDAIIGMHLF KDELALASR CTLLSCTKKGNASMRSIDFSSSSSEDISTNPVKTWKVC SGDVMCSKVDGSETHALFGGKGVEVNTWNLEQCTKIWTA
Subjt:  KDDAIIGMHLFEKDELALASRRCTLLSCTKKGNASMRSIDFSSSSSEDISTNPVKTWKVCSSGDVMCSKVDGSETHALFGGKGVEVNTWNLEQCTKIWTA

Query:  KAPKKNSLGIFTPTCFTSVTFLSKDDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKC
        KAPKKNSLGIFTPTCFTSVTFLSKDDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKC
Subjt:  KAPKKNSLGIFTPTCFTSVTFLSKDDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKC

Query:  SGSIRSIARHPEFPIIASCGLDSYVRFWDIKTRQLLSAVFLKQHLNGVVFDSHFVGE--DITHSAVESIQQDIEIGQTVSEEEHTPQKRKKASKEDGEGK
        SGSIRSIARHPEFPIIASCGLDSYVRFWDIKTRQLLSAVFLKQHLNGVVFDSHFVGE  DITHSAVESIQQDIEIGQTVSEEEHTPQKRKKASKEDGEGK
Subjt:  SGSIRSIARHPEFPIIASCGLDSYVRFWDIKTRQLLSAVFLKQHLNGVVFDSHFVGE--DITHSAVESIQQDIEIGQTVSEEEHTPQKRKKASKEDGEGK

Query:  KRKGASKENKKSKKKSHGETESKQR
        KRKGASKENKKSKKKSHGETESKQR
Subjt:  KRKGASKENKKSKKKSHGETESKQR

XP_022955240.1 WD repeat-containing protein 74-like isoform X1 [Cucurbita moschata]6.4e-23899.76Show/hide
Query:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIQAQGKDGEIPKVVERWGEPDYSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDSSPP
        MPRTTKVDCPGCPPLRALTFDVLGLVKVIQAQGKDGEIPKVVERWGEPDYSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDSSPP
Subjt:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIQAQGKDGEIPKVVERWGEPDYSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDSSPP

Query:  KDDAIIGMHLFEKDELALASRRCTLLSCTKKGNASMRSIDFSSSSSEDISTNPVKTWKVCSSGDVMCSKVDGSETHALFGGKGVEVNTWNLEQCTKIWTA
        KDDAIIGMHLFEKDELALASRRCTLLSCTKKGNASMRSIDFSSSSSEDISTNPVKTWKVCSSGDVMCSKVDGSETHALFGGKGVEVNTWNLEQCTKIWTA
Subjt:  KDDAIIGMHLFEKDELALASRRCTLLSCTKKGNASMRSIDFSSSSSEDISTNPVKTWKVCSSGDVMCSKVDGSETHALFGGKGVEVNTWNLEQCTKIWTA

Query:  KAPKKNSLGIFTPTCFTSVTFLSKDDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKC
        KAPKKNSLGIFTPTCFTSVTFLSKDDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKC
Subjt:  KAPKKNSLGIFTPTCFTSVTFLSKDDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKC

Query:  SGSIRSIARHPEFPIIASCGLDSYVRFWDIKTRQLLSAVFLKQHLNGVVFDSHFVGE-DITHSAVESIQQDIEIGQTVSEEEHTPQKRKKASKEDGEGKK
        SGSIRSIARHPEFPIIASCGLDSYVRFWDIKTRQLLSAVFLKQHLNGVVFDSHFVGE DITHSAVESIQQDIEIGQTVSEEEHTPQKRKKASKEDGEGKK
Subjt:  SGSIRSIARHPEFPIIASCGLDSYVRFWDIKTRQLLSAVFLKQHLNGVVFDSHFVGE-DITHSAVESIQQDIEIGQTVSEEEHTPQKRKKASKEDGEGKK

Query:  RKGASKENKKSKKKSHGETESKQR
        RKGASKENKKSKKKSHGETESKQR
Subjt:  RKGASKENKKSKKKSHGETESKQR

XP_022955242.1 WD repeat-containing protein 74-like isoform X2 [Cucurbita moschata]2.6e-239100Show/hide
Query:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIQAQGKDGEIPKVVERWGEPDYSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDSSPP
        MPRTTKVDCPGCPPLRALTFDVLGLVKVIQAQGKDGEIPKVVERWGEPDYSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDSSPP
Subjt:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIQAQGKDGEIPKVVERWGEPDYSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDSSPP

Query:  KDDAIIGMHLFEKDELALASRRCTLLSCTKKGNASMRSIDFSSSSSEDISTNPVKTWKVCSSGDVMCSKVDGSETHALFGGKGVEVNTWNLEQCTKIWTA
        KDDAIIGMHLFEKDELALASRRCTLLSCTKKGNASMRSIDFSSSSSEDISTNPVKTWKVCSSGDVMCSKVDGSETHALFGGKGVEVNTWNLEQCTKIWTA
Subjt:  KDDAIIGMHLFEKDELALASRRCTLLSCTKKGNASMRSIDFSSSSSEDISTNPVKTWKVCSSGDVMCSKVDGSETHALFGGKGVEVNTWNLEQCTKIWTA

Query:  KAPKKNSLGIFTPTCFTSVTFLSKDDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKC
        KAPKKNSLGIFTPTCFTSVTFLSKDDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKC
Subjt:  KAPKKNSLGIFTPTCFTSVTFLSKDDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKC

Query:  SGSIRSIARHPEFPIIASCGLDSYVRFWDIKTRQLLSAVFLKQHLNGVVFDSHFVGEDITHSAVESIQQDIEIGQTVSEEEHTPQKRKKASKEDGEGKKR
        SGSIRSIARHPEFPIIASCGLDSYVRFWDIKTRQLLSAVFLKQHLNGVVFDSHFVGEDITHSAVESIQQDIEIGQTVSEEEHTPQKRKKASKEDGEGKKR
Subjt:  SGSIRSIARHPEFPIIASCGLDSYVRFWDIKTRQLLSAVFLKQHLNGVVFDSHFVGEDITHSAVESIQQDIEIGQTVSEEEHTPQKRKKASKEDGEGKKR

Query:  KGASKENKKSKKKSHGETESKQR
        KGASKENKKSKKKSHGETESKQR
Subjt:  KGASKENKKSKKKSHGETESKQR

XP_022994307.1 WD repeat-containing protein 74 isoform X2 [Cucurbita maxima]8.1e-23397.4Show/hide
Query:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIQAQGKDGEIPKVVERWGEPDYSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDSSPP
        MPRTTKVDCPGCPPLRALTFDVLGLVKVIQAQGKD EIPKVVERWGEPDYSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTD+SPP
Subjt:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIQAQGKDGEIPKVVERWGEPDYSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDSSPP

Query:  KDDAIIGMHLFEKDELALASRRCTLLSCTKKGNASMRSIDFSSSSSEDISTNPVKTWKVCSSGDVMCSKVDGSETHALFGGKGVEVNTWNLEQCTKIWTA
        KDDAIIGMHLF KDELALASRRCTLLSCTKKGNASMRSIDFSSSSSEDISTN VKTWKVC SGDV+CSKVDGSETHALFGGKGVEVNTWNLEQCTKIWTA
Subjt:  KDDAIIGMHLFEKDELALASRRCTLLSCTKKGNASMRSIDFSSSSSEDISTNPVKTWKVCSSGDVMCSKVDGSETHALFGGKGVEVNTWNLEQCTKIWTA

Query:  KAPKKNSLGIFTPTCFTSVTFLSKDDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKC
        KAPKKNSLGIFTPTCFTSVTFLSKDDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKC
Subjt:  KAPKKNSLGIFTPTCFTSVTFLSKDDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKC

Query:  SGSIRSIARHPEFPIIASCGLDSYVRFWDIKTRQLLSAVFLKQHLNGVVFDSHFVGEDITHSAVESIQQDIEIGQTVSEEEHTPQKRKKASKEDGEGKKR
        SGSIRSIARHPEFPIIASCGLDSYVRFWDIKTRQLLSAVFLKQHLNGVVFDSHFVGED THSAV+SIQQDIEI QT SEEEHTPQKRKKASKEDGEGKKR
Subjt:  SGSIRSIARHPEFPIIASCGLDSYVRFWDIKTRQLLSAVFLKQHLNGVVFDSHFVGEDITHSAVESIQQDIEIGQTVSEEEHTPQKRKKASKEDGEGKKR

Query:  KGASKENKKSKKKSHGETESKQR
        KGASKENKK KKKSHGETESKQR
Subjt:  KGASKENKKSKKKSHGETESKQR

TrEMBL top hitse value%identityAlignment
A0A6J1EA63 WD repeat-containing protein 74-like4.7e-21088.06Show/hide
Query:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIQAQGKDGEIPKVVERWGEPDYSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDSS-P
        MPRTTKVDCPGCPPLRAL FDVLGL+KVI+A+GK+GEIPKVVERWGEPD+SKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTD+S P
Subjt:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIQAQGKDGEIPKVVERWGEPDYSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDSS-P

Query:  PKDDAIIGMHLFEKDELALASRRCTLLSCTKKGNASMRSIDFSSSSSEDISTNPVKTWKVCSSGDVMCSKVDGSETHALFGGKGVEVNTWNLEQCTKIWT
        PKD+AI+GMHL  KDE  LASRRCTLLSCT KGNASMR+I+FSSSSSED STN  +TWK+CSSGDVMCSKVDGSETHALFGGKGVEVN WNLEQCTKIWT
Subjt:  PKDDAIIGMHLFEKDELALASRRCTLLSCTKKGNASMRSIDFSSSSSEDISTNPVKTWKVCSSGDVMCSKVDGSETHALFGGKGVEVNTWNLEQCTKIWT

Query:  AKAPKKNSLGIFTPTCFTSVTFLSKDDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGK
        AKAPKKNSLGIFTPT FTS TFLSKDDHRKFAAGT+SHQVRLYDISAQKRPVISFDF ETPIKALAEDVDGNTIFVGNA+GDLASFDIRNGKLLGCFLGK
Subjt:  AKAPKKNSLGIFTPTCFTSVTFLSKDDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGK

Query:  CSGSIRSIARHPEFPIIASCGLDSYVRFWDIKTRQLLSAVFLKQHLNGVVFDSHFVGEDITHSAVESIQQDIEIGQTVSEEEHTPQKRKKASKEDGEGKK
        CSGSIRSIARHPEF +IASCGLDSYVRFWDIKTRQLLSAVFLKQHL GVVFDSHFV ED+THSAVE IQQ+ E+ QTV+EEEH PQKRKKA KEDGEG K
Subjt:  CSGSIRSIARHPEFPIIASCGLDSYVRFWDIKTRQLLSAVFLKQHLNGVVFDSHFVGEDITHSAVESIQQDIEIGQTVSEEEHTPQKRKKASKEDGEGKK

Query:  RKGA---SKENKKSKKKSHGETESKQR
        RKG+    KENKKS++KSH E ESKQR
Subjt:  RKGA---SKENKKSKKKSHGETESKQR

A0A6J1GT09 WD repeat-containing protein 74-like isoform X21.3e-239100Show/hide
Query:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIQAQGKDGEIPKVVERWGEPDYSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDSSPP
        MPRTTKVDCPGCPPLRALTFDVLGLVKVIQAQGKDGEIPKVVERWGEPDYSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDSSPP
Subjt:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIQAQGKDGEIPKVVERWGEPDYSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDSSPP

Query:  KDDAIIGMHLFEKDELALASRRCTLLSCTKKGNASMRSIDFSSSSSEDISTNPVKTWKVCSSGDVMCSKVDGSETHALFGGKGVEVNTWNLEQCTKIWTA
        KDDAIIGMHLFEKDELALASRRCTLLSCTKKGNASMRSIDFSSSSSEDISTNPVKTWKVCSSGDVMCSKVDGSETHALFGGKGVEVNTWNLEQCTKIWTA
Subjt:  KDDAIIGMHLFEKDELALASRRCTLLSCTKKGNASMRSIDFSSSSSEDISTNPVKTWKVCSSGDVMCSKVDGSETHALFGGKGVEVNTWNLEQCTKIWTA

Query:  KAPKKNSLGIFTPTCFTSVTFLSKDDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKC
        KAPKKNSLGIFTPTCFTSVTFLSKDDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKC
Subjt:  KAPKKNSLGIFTPTCFTSVTFLSKDDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKC

Query:  SGSIRSIARHPEFPIIASCGLDSYVRFWDIKTRQLLSAVFLKQHLNGVVFDSHFVGEDITHSAVESIQQDIEIGQTVSEEEHTPQKRKKASKEDGEGKKR
        SGSIRSIARHPEFPIIASCGLDSYVRFWDIKTRQLLSAVFLKQHLNGVVFDSHFVGEDITHSAVESIQQDIEIGQTVSEEEHTPQKRKKASKEDGEGKKR
Subjt:  SGSIRSIARHPEFPIIASCGLDSYVRFWDIKTRQLLSAVFLKQHLNGVVFDSHFVGEDITHSAVESIQQDIEIGQTVSEEEHTPQKRKKASKEDGEGKKR

Query:  KGASKENKKSKKKSHGETESKQR
        KGASKENKKSKKKSHGETESKQR
Subjt:  KGASKENKKSKKKSHGETESKQR

A0A6J1GUM0 WD repeat-containing protein 74-like isoform X13.1e-23899.76Show/hide
Query:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIQAQGKDGEIPKVVERWGEPDYSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDSSPP
        MPRTTKVDCPGCPPLRALTFDVLGLVKVIQAQGKDGEIPKVVERWGEPDYSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDSSPP
Subjt:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIQAQGKDGEIPKVVERWGEPDYSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDSSPP

Query:  KDDAIIGMHLFEKDELALASRRCTLLSCTKKGNASMRSIDFSSSSSEDISTNPVKTWKVCSSGDVMCSKVDGSETHALFGGKGVEVNTWNLEQCTKIWTA
        KDDAIIGMHLFEKDELALASRRCTLLSCTKKGNASMRSIDFSSSSSEDISTNPVKTWKVCSSGDVMCSKVDGSETHALFGGKGVEVNTWNLEQCTKIWTA
Subjt:  KDDAIIGMHLFEKDELALASRRCTLLSCTKKGNASMRSIDFSSSSSEDISTNPVKTWKVCSSGDVMCSKVDGSETHALFGGKGVEVNTWNLEQCTKIWTA

Query:  KAPKKNSLGIFTPTCFTSVTFLSKDDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKC
        KAPKKNSLGIFTPTCFTSVTFLSKDDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKC
Subjt:  KAPKKNSLGIFTPTCFTSVTFLSKDDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKC

Query:  SGSIRSIARHPEFPIIASCGLDSYVRFWDIKTRQLLSAVFLKQHLNGVVFDSHFVGE-DITHSAVESIQQDIEIGQTVSEEEHTPQKRKKASKEDGEGKK
        SGSIRSIARHPEFPIIASCGLDSYVRFWDIKTRQLLSAVFLKQHLNGVVFDSHFVGE DITHSAVESIQQDIEIGQTVSEEEHTPQKRKKASKEDGEGKK
Subjt:  SGSIRSIARHPEFPIIASCGLDSYVRFWDIKTRQLLSAVFLKQHLNGVVFDSHFVGE-DITHSAVESIQQDIEIGQTVSEEEHTPQKRKKASKEDGEGKK

Query:  RKGASKENKKSKKKSHGETESKQR
        RKGASKENKKSKKKSHGETESKQR
Subjt:  RKGASKENKKSKKKSHGETESKQR

A0A6J1JVF3 WD repeat-containing protein 74 isoform X23.9e-23397.4Show/hide
Query:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIQAQGKDGEIPKVVERWGEPDYSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDSSPP
        MPRTTKVDCPGCPPLRALTFDVLGLVKVIQAQGKD EIPKVVERWGEPDYSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTD+SPP
Subjt:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIQAQGKDGEIPKVVERWGEPDYSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDSSPP

Query:  KDDAIIGMHLFEKDELALASRRCTLLSCTKKGNASMRSIDFSSSSSEDISTNPVKTWKVCSSGDVMCSKVDGSETHALFGGKGVEVNTWNLEQCTKIWTA
        KDDAIIGMHLF KDELALASRRCTLLSCTKKGNASMRSIDFSSSSSEDISTN VKTWKVC SGDV+CSKVDGSETHALFGGKGVEVNTWNLEQCTKIWTA
Subjt:  KDDAIIGMHLFEKDELALASRRCTLLSCTKKGNASMRSIDFSSSSSEDISTNPVKTWKVCSSGDVMCSKVDGSETHALFGGKGVEVNTWNLEQCTKIWTA

Query:  KAPKKNSLGIFTPTCFTSVTFLSKDDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKC
        KAPKKNSLGIFTPTCFTSVTFLSKDDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKC
Subjt:  KAPKKNSLGIFTPTCFTSVTFLSKDDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKC

Query:  SGSIRSIARHPEFPIIASCGLDSYVRFWDIKTRQLLSAVFLKQHLNGVVFDSHFVGEDITHSAVESIQQDIEIGQTVSEEEHTPQKRKKASKEDGEGKKR
        SGSIRSIARHPEFPIIASCGLDSYVRFWDIKTRQLLSAVFLKQHLNGVVFDSHFVGED THSAV+SIQQDIEI QT SEEEHTPQKRKKASKEDGEGKKR
Subjt:  SGSIRSIARHPEFPIIASCGLDSYVRFWDIKTRQLLSAVFLKQHLNGVVFDSHFVGEDITHSAVESIQQDIEIGQTVSEEEHTPQKRKKASKEDGEGKKR

Query:  KGASKENKKSKKKSHGETESKQR
        KGASKENKK KKKSHGETESKQR
Subjt:  KGASKENKKSKKKSHGETESKQR

A0A6J1JYR5 WD repeat-containing protein 74 isoform X19.7e-23297.17Show/hide
Query:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIQAQGKDGEIPKVVERWGEPDYSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDSSPP
        MPRTTKVDCPGCPPLRALTFDVLGLVKVIQAQGKD EIPKVVERWGEPDYSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTD+SPP
Subjt:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIQAQGKDGEIPKVVERWGEPDYSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDSSPP

Query:  KDDAIIGMHLFEKDELALASRRCTLLSCTKKGNASMRSIDFSSSSSEDISTNPVKTWKVCSSGDVMCSKVDGSETHALFGGKGVEVNTWNLEQCTKIWTA
        KDDAIIGMHLF KDELALASRRCTLLSCTKKGNASMRSIDFSSSSSEDISTN VKTWKVC SGDV+CSKVDGSETHALFGGKGVEVNTWNLEQCTKIWTA
Subjt:  KDDAIIGMHLFEKDELALASRRCTLLSCTKKGNASMRSIDFSSSSSEDISTNPVKTWKVCSSGDVMCSKVDGSETHALFGGKGVEVNTWNLEQCTKIWTA

Query:  KAPKKNSLGIFTPTCFTSVTFLSKDDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKC
        KAPKKNSLGIFTPTCFTSVTFLSKDDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKC
Subjt:  KAPKKNSLGIFTPTCFTSVTFLSKDDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKC

Query:  SGSIRSIARHPEFPIIASCGLDSYVRFWDIKTRQLLSAVFLKQHLNGVVFDSHFVGE-DITHSAVESIQQDIEIGQTVSEEEHTPQKRKKASKEDGEGKK
        SGSIRSIARHPEFPIIASCGLDSYVRFWDIKTRQLLSAVFLKQHLNGVVFDSHFVGE D THSAV+SIQQDIEI QT SEEEHTPQKRKKASKEDGEGKK
Subjt:  SGSIRSIARHPEFPIIASCGLDSYVRFWDIKTRQLLSAVFLKQHLNGVVFDSHFVGE-DITHSAVESIQQDIEIGQTVSEEEHTPQKRKKASKEDGEGKK

Query:  RKGASKENKKSKKKSHGETESKQR
        RKGASKENKK KKKSHGETESKQR
Subjt:  RKGASKENKKSKKKSHGETESKQR

SwissProt top hitse value%identityAlignment
Q54FW9 WD repeat-containing protein DDB_G02905556.4e-1524.71Show/hide
Query:  SSSSEDISTNPVKT--WKVCSSGDVMCSKVDGSETHALFGGKGVEVNTWNLEQCTKIWTAKAPKKNSLGIFTPTCFTSVTFLSKDDHRKFAAGTSSHQVR
        ++ S+ +   PV T  + +  + ++    ++ S     FGGK V +  W+LE+  K ++AK  K + L +  P     V +++ D   K   G S  +++
Subjt:  SSSSEDISTNPVKT--WKVCSSGDVMCSKVDGSETHALFGGKGVEVNTWNLEQCTKIWTAKAPKKNSLGIFTPTCFTSVTFLSKDDHRKFAAGTSSHQVR

Query:  LYDISAQ--KRPVISFDFRETPIKALA-EDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKCSGSIRSIARHPEFPIIASCGLDSYVRFWDIKTRQLLS
         YD+ ++  +   +   F + PI+++   +   +  +  ++ G +  +D+R  + +G F    +GS++ IA HP  P++A+ GLD ++R +++  R++L 
Subjt:  LYDISAQ--KRPVISFDFRETPIKALA-EDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKCSGSIRSIARHPEFPIIASCGLDSYVRFWDIKTRQLLS

Query:  AVFLKQHLNGVVFDSHFVGEDITHSAVESIQQDIEIGQTVSEEEHTPQKRKKASKEDGEGKKR
         +FLKQ L+ V+F      E+ T+   E  Q++ EI + + E ++           +G  KK+
Subjt:  AVFLKQHLNGVVFDSHFVGEDITHSAVESIQQDIEIGQTVSEEEHTPQKRKKASKEDGEGKKR

Q58D06 WD repeat-containing protein 745.1e-2029.14Show/hide
Query:  TLLSCTKKGNASMRSIDFSSSSSEDISTNPVKTWKVCSSGDVMC-SKVDGSETHAL-FGGKGVEVNTWNLEQCTK-IWTAKAPKKNSLGIFTPTCFTSVT
        TL++C   G      +   +   ++ S++PV   +V   G  +C  + D +  H +  GGK   +  W+L+   + ++ AK  + + L +  P     + 
Subjt:  TLLSCTKKGNASMRSIDFSSSSSEDISTNPVKTWKVCSSGDVMC-SKVDGSETHAL-FGGKGVEVNTWNLEQCTK-IWTAKAPKKNSLGIFTPTCFTSVT

Query:  FLSKDDHRKFAAGTSSHQVRLYD-ISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKCSGSIRSIARHPEFPIIASC
        FL   + +K    T  HQVR+YD  S Q+RPV+   + E P+ A+    +GN++ VGN  G LA  D+R G+LLGC  G  +GS+R +  HP  P++ASC
Subjt:  FLSKDDHRKFAAGTSSHQVRLYD-ISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKCSGSIRSIARHPEFPIIASC

Query:  GLDSYVRFWDIKT-RQLLSAVFLKQHLNGVVFDSHFVGEDITHSAVESIQQDIEIGQTVSEEEHTPQKRKKASKEDGEGKKRKGASKENKKSKKKSHGET
        GLD  +R   I+  R L   V+LK  LN ++      G D      +  Q+  ++    +E +      + A+K      ++   + + ++ KKK  G T
Subjt:  GLDSYVRFWDIKT-RQLLSAVFLKQHLNGVVFDSHFVGEDITHSAVESIQQDIEIGQTVSEEEHTPQKRKKASKEDGEGKKRKGASKENKKSKKKSHGET

Query:  ES
         S
Subjt:  ES

Q6BUJ2 Ribosome biogenesis protein NSA15.5e-0625.63Show/hide
Query:  IWTAKAPKKNSLGIFTPTCFTSVTFLSK--DDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRN-----
        I+TA+  K + L +  P   +S+ F  +   D  KF   T   QVR+YD +  KRP+  +   E PI  L        + V +    +A + +       
Subjt:  IWTAKAPKKNSLGIFTPTCFTSVTFLSK--DDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRN-----

Query:  ---------------GKLLGCF-LGKCSGSIRSIARHPEFPIIASCGLDSYVRFWDIKTRQLLSAVFLKQHLNGV-VFDSHFVGEDITHSAV--------
                        KLLG F  G  +G+I  +    +  II++ GLD Y+R +DI +R++L+ V+L   ++ V + DS    E+   S +        
Subjt:  ---------------GKLLGCF-LGKCSGSIRSIARHPEFPIIASCGLDSYVRFWDIKTRQLLSAVFLKQHLNGV-VFDSHFVGEDITHSAV--------

Query:  -ESIQQDIEIGQTVSEEEHTPQKRKKASKEDGEGKKRK
         +  ++ +E  +  S+EE    + ++ +K   E KKR+
Subjt:  -ESIQQDIEIGQTVSEEEHTPQKRKKASKEDGEGKKRK

Q6RFH5 WD repeat-containing protein 743.9e-2029.41Show/hide
Query:  LASRRCTLLSCTKKGNASMRSIDFSSSSSEDISTNPVKTWKVCSSGDVMC-SKVDGSETHAL-FGGKGVEVNTWNLEQCTK-IWTAKAPKKNSLGIFTPT
        LA    TL++C   G      +       +D S++P+   +V   G  +C  + D +  H +  GGK   +  W+L+   + ++ AK  + + L +  P 
Subjt:  LASRRCTLLSCTKKGNASMRSIDFSSSSSEDISTNPVKTWKVCSSGDVMC-SKVDGSETHAL-FGGKGVEVNTWNLEQCTK-IWTAKAPKKNSLGIFTPT

Query:  CFTSVTFLSKDDHRKFAAGTSSHQVRLYD-ISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKCSGSIRSIARHPEF
            + FL     +K    T  HQVR+YD  S Q+RPV+   + E P+ A+     GN++ VGN  G LA  D+R G+LLGC  G  +GS+R +  HP  
Subjt:  CFTSVTFLSKDDHRKFAAGTSSHQVRLYD-ISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKCSGSIRSIARHPEF

Query:  PIIASCGLDSYVRFWDIKT-RQLLSAVFLKQHLNGVVFDSHFVGEDITHSAVESIQQDIEIGQTVSEEEHTPQKRKKASKEDGEGKKRKGASKENKKSKK
        P++ASCGLD  +R   I+  R L   V+LK  LN ++      G D      +  Q+  ++    +E +      + A+K    G ++   + + ++ KK
Subjt:  PIIASCGLDSYVRFWDIKT-RQLLSAVFLKQHLNGVVFDSHFVGEDITHSAVESIQQDIEIGQTVSEEEHTPQKRKKASKEDGEGKKRKGASKENKKSKK

Query:  KSHGET
        K  G T
Subjt:  KSHGET

Q8VCG3 WD repeat-containing protein 748.7e-2030.33Show/hide
Query:  LASRRCTLLSCTKKGNASMRSIDFSSSSSEDISTNPVKTWKVCSSGDVMC-SKVDGSETHALFG-GKGVEVNTWNLEQCTK-IWTAKAPKKNSLGIFTPT
        LA    TL++C   G      +     + ++ S++P+   KV   G  +C  + D + TH +   GK   +  W+L+   + ++ AK  + + L +  P 
Subjt:  LASRRCTLLSCTKKGNASMRSIDFSSSSSEDISTNPVKTWKVCSSGDVMC-SKVDGSETHALFG-GKGVEVNTWNLEQCTK-IWTAKAPKKNSLGIFTPT

Query:  CFTSVTFLSKDDHRKFAAGTSSHQVRLYD-ISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKCSGSIRSIARHPEF
              FL     +K    T  HQVR+YD +S Q+RPV+   + E P+ A+    +GN++ VGN  G LA  D R G+LLGC  G  +GS+R +  HP  
Subjt:  CFTSVTFLSKDDHRKFAAGTSSHQVRLYD-ISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKCSGSIRSIARHPEF

Query:  PIIASCGLDSYVRFWDIKT-RQLLSAVFLKQHLNGVVFDSHFVGEDITHSAVESIQQDIEIGQTVSEEEHTPQKRKKASKEDGEGKKRKGASKENKKSKK
        P++ASCGLD  +R   I+  R L   V+LK  LN ++       ED      E  Q   E  +T   +E        A ++  +  + +GA +  KK K+
Subjt:  PIIASCGLDSYVRFWDIKT-RQLLSAVFLKQHLNGVVFDSHFVGEDITHSAVESIQQDIEIGQTVSEEEHTPQKRKKASKEDGEGKKRKGASKENKKSKK

Arabidopsis top hitse value%identityAlignment
AT1G29320.1 Transducin/WD40 repeat-like superfamily protein1.4e-12955.36Show/hide
Query:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIQAQGKDGEIPKVVERWGEPDYSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNLHVAIS-DNTDSSP
        MPR    +  GCPP RALTFD LGL+KV +A+G++  IP VV  WGE + S+SVLAAS+ DR  +PLLAVARK+G +EV+NP NG+LH + S    D   
Subjt:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIQAQGKDGEIPKVVERWGEPDYSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNLHVAIS-DNTDSSP

Query:  PKDDAIIGMHLFEKDELALASRRCTLLSCTKKGNASMRSIDFSSSSSEDISTNPVKTWKVCSSGDVMCSKVDGSETHALFGGKGVEVNTWNLEQCTKIWT
        P+D+ I  +HLF K       R CTLL+CTKKG+ S+RS+ F  +          KTWK C SG+++  KVDGSE  +LFGGK VE N W+LEQCTKIW+
Subjt:  PKDDAIIGMHLFEKDELALASRRCTLLSCTKKGNASMRSIDFSSSSSEDISTNPVKTWKVCSSGDVMCSKVDGSETHALFGGKGVEVNTWNLEQCTKIWT

Query:  AKAPKKNSLGIFTPTCFTSVTFLSKDDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGK
        AK P KN+LGIFTPT FTS TFLSKDDHRKF  GT SHQVRLYDIS Q+RPV+SFDFRET I ++AED DG+TI+VGNA+ DLASFDIR GKLLG FLGK
Subjt:  AKAPKKNSLGIFTPTCFTSVTFLSKDDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGK

Query:  CSGSIRSIARHPEFPIIASCGLDSYVRFWDIKTRQLLSAVFLKQHLNGVVFDSHFVGED--ITHSAVESIQQD--IEIGQTVSEEEHTPQKRKKASKE--
        CSGSIRS+ RHP+  +IASCGLD Y+R +D+KTRQL+SAVFLKQHL G+VFDS F GE+  + ++  E+  ++    + Q   E E  P KRKK+ KE  
Subjt:  CSGSIRSIARHPEFPIIASCGLDSYVRFWDIKTRQLLSAVFLKQHLNGVVFDSHFVGED--ITHSAVESIQQD--IEIGQTVSEEEHTPQKRKKASKE--

Query:  ------DGEGK------------KRKGASKENKKSKKKSHGETESKQR
              +GE              K K + KE +  +K S GE + + R
Subjt:  ------DGEGK------------KRKGASKENKKSKKKSHGETESKQR

AT5G23430.1 Transducin/WD40 repeat-like superfamily protein6.2e-0524.41Show/hide
Query:  LSKDDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKCSGSIRSIARHPEFPIIASCGL
        + +   R    G   H+V L+ I  +   ++S     + I ++  D     +  G A+G +  +D+   K++    G  S  I S+  HP     AS  L
Subjt:  LSKDDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKCSGSIRSIARHPEFPIIASCGL

Query:  DSYVRFWDIKTRQLLSAVFLKQHLNGV
        D+ ++ WDI+ +  +     K H  GV
Subjt:  DSYVRFWDIKTRQLLSAVFLKQHLNGV

AT5G23430.2 Transducin/WD40 repeat-like superfamily protein6.2e-0524.41Show/hide
Query:  LSKDDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKCSGSIRSIARHPEFPIIASCGL
        + +   R    G   H+V L+ I  +   ++S     + I ++  D     +  G A+G +  +D+   K++    G  S  I S+  HP     AS  L
Subjt:  LSKDDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKCSGSIRSIARHPEFPIIASCGL

Query:  DSYVRFWDIKTRQLLSAVFLKQHLNGV
        D+ ++ WDI+ +  +     K H  GV
Subjt:  DSYVRFWDIKTRQLLSAVFLKQHLNGV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCGCACGACAAAGGTTGATTGCCCTGGCTGTCCTCCGCTTCGTGCCCTAACGTTTGATGTGCTTGGTCTCGTCAAAGTCATCCAAGCTCAGGGGAAGGACGGTGA
GATTCCAAAGGTGGTCGAGAGGTGGGGCGAACCTGATTACTCCAAATCGGTGCTTGCAGCTTCTCTTGTCGATCGTAAATTCGACCCTTTACTAGCTGTTGCTCGAAAAA
ATGGACTGATTGAAGTTCTTAATCCATTGAATGGCAATCTTCATGTTGCAATTTCGGACAATACTGATAGTTCTCCACCGAAAGATGATGCTATTATTGGAATGCATTTA
TTTGAAAAAGATGAATTGGCGTTGGCGTCTAGGCGTTGCACCTTGCTTTCGTGTACAAAAAAAGGAAATGCGAGCATGAGGTCAATTGATTTTTCTAGCTCATCTTCAGA
AGATATCTCTACCAATCCTGTAAAAACGTGGAAAGTATGTAGTTCAGGTGATGTTATGTGCTCTAAAGTTGATGGAAGTGAAACCCATGCATTGTTTGGAGGGAAGGGTG
TTGAAGTTAATACATGGAATCTGGAACAGTGTACTAAGATTTGGACAGCAAAAGCGCCGAAGAAGAACAGTCTTGGTATTTTCACACCAACTTGCTTCACATCAGTGACA
TTTCTTAGTAAAGATGACCATCGTAAGTTTGCTGCTGGTACTAGCAGCCATCAGGTTCGATTGTATGACATCTCTGCTCAGAAGAGACCTGTTATCTCGTTTGATTTTCG
AGAGACTCCTATTAAAGCCTTGGCTGAAGATGTAGATGGTAACACAATATTTGTGGGGAATGCAACTGGTGATCTTGCATCCTTTGATATTCGCAATGGAAAGCTATTGG
GTTGCTTCTTGGGGAAATGTTCTGGCAGCATAAGGTCCATCGCCAGGCATCCAGAGTTCCCGATCATTGCATCATGTGGACTGGATAGTTATGTTCGCTTCTGGGATATA
AAGACGAGGCAACTTCTGTCTGCGGTATTCCTAAAGCAGCATCTTAATGGTGTTGTCTTCGATTCCCATTTTGTTGGAGAAGATATAACACACTCTGCAGTAGAGTCAAT
CCAACAGGACATAGAAATAGGCCAAACCGTTAGCGAGGAGGAACACACGCCTCAGAAAAGAAAAAAGGCATCGAAAGAGGACGGTGAAGGCAAAAAGAGGAAGGGTGCAA
GCAAAGAAAATAAAAAAAGCAAAAAGAAGTCGCATGGCGAAACCGAAAGTAAGCAGAGATACTGTGAGAAATTAGAGGGTATGATTACATTCGAAGTAGTTCAAGCACGT
TCGACTCGAGTATCCCGTGTCAAATTTTACTTTACTAGTAGTTAG
mRNA sequenceShow/hide mRNA sequence
TTTATCTGTGTTCCGCGCCATTCTTCTTCTTCACGCCGTCTCCTACTCTGTTTCGCCGCCGCCTCTCTACCGTTCACTGCTGCCGCTGCCGTCGTCGTCGTCTTTCTTGC
GTTCGTGCCCTGAAGTCTCCTGCCTCTTAGGGGTTTTGCTCTGATTTTCTACTTCCTTGTTGTTGCCGAATATGCCTCGCACGACAAAGGTTGATTGCCCTGGCTGTCCT
CCGCTTCGTGCCCTAACGTTTGATGTGCTTGGTCTCGTCAAAGTCATCCAAGCTCAGGGGAAGGACGGTGAGATTCCAAAGGTGGTCGAGAGGTGGGGCGAACCTGATTA
CTCCAAATCGGTGCTTGCAGCTTCTCTTGTCGATCGTAAATTCGACCCTTTACTAGCTGTTGCTCGAAAAAATGGACTGATTGAAGTTCTTAATCCATTGAATGGCAATC
TTCATGTTGCAATTTCGGACAATACTGATAGTTCTCCACCGAAAGATGATGCTATTATTGGAATGCATTTATTTGAAAAAGATGAATTGGCGTTGGCGTCTAGGCGTTGC
ACCTTGCTTTCGTGTACAAAAAAAGGAAATGCGAGCATGAGGTCAATTGATTTTTCTAGCTCATCTTCAGAAGATATCTCTACCAATCCTGTAAAAACGTGGAAAGTATG
TAGTTCAGGTGATGTTATGTGCTCTAAAGTTGATGGAAGTGAAACCCATGCATTGTTTGGAGGGAAGGGTGTTGAAGTTAATACATGGAATCTGGAACAGTGTACTAAGA
TTTGGACAGCAAAAGCGCCGAAGAAGAACAGTCTTGGTATTTTCACACCAACTTGCTTCACATCAGTGACATTTCTTAGTAAAGATGACCATCGTAAGTTTGCTGCTGGT
ACTAGCAGCCATCAGGTTCGATTGTATGACATCTCTGCTCAGAAGAGACCTGTTATCTCGTTTGATTTTCGAGAGACTCCTATTAAAGCCTTGGCTGAAGATGTAGATGG
TAACACAATATTTGTGGGGAATGCAACTGGTGATCTTGCATCCTTTGATATTCGCAATGGAAAGCTATTGGGTTGCTTCTTGGGGAAATGTTCTGGCAGCATAAGGTCCA
TCGCCAGGCATCCAGAGTTCCCGATCATTGCATCATGTGGACTGGATAGTTATGTTCGCTTCTGGGATATAAAGACGAGGCAACTTCTGTCTGCGGTATTCCTAAAGCAG
CATCTTAATGGTGTTGTCTTCGATTCCCATTTTGTTGGAGAAGATATAACACACTCTGCAGTAGAGTCAATCCAACAGGACATAGAAATAGGCCAAACCGTTAGCGAGGA
GGAACACACGCCTCAGAAAAGAAAAAAGGCATCGAAAGAGGACGGTGAAGGCAAAAAGAGGAAGGGTGCAAGCAAAGAAAATAAAAAAAGCAAAAAGAAGTCGCATGGCG
AAACCGAAAGTAAGCAGAGATACTGTGAGAAATTAGAGGGTATGATTACATTCGAAGTAGTTCAAGCACGTTCGACTCGAGTATCCCGTGTCAAATTTTACTTTACTAGT
AGTTAG
Protein sequenceShow/hide protein sequence
MPRTTKVDCPGCPPLRALTFDVLGLVKVIQAQGKDGEIPKVVERWGEPDYSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDSSPPKDDAIIGMHL
FEKDELALASRRCTLLSCTKKGNASMRSIDFSSSSSEDISTNPVKTWKVCSSGDVMCSKVDGSETHALFGGKGVEVNTWNLEQCTKIWTAKAPKKNSLGIFTPTCFTSVT
FLSKDDHRKFAAGTSSHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDIRNGKLLGCFLGKCSGSIRSIARHPEFPIIASCGLDSYVRFWDI
KTRQLLSAVFLKQHLNGVVFDSHFVGEDITHSAVESIQQDIEIGQTVSEEEHTPQKRKKASKEDGEGKKRKGASKENKKSKKKSHGETESKQRYCEKLEGMITFEVVQAR
STRVSRVKFYFTSS