; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0008572 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0008572
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionWD repeat-containing protein 74
Genome locationchr12:23017139..23021228
RNA-Seq ExpressionIVF0008572
SyntenyIVF0008572
Gene Ontology termsGO:0042273 - ribosomal large subunit biogenesis (biological process)
GO:0005730 - nucleolus (cellular component)
GO:0030687 - preribosome, large subunit precursor (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001680 - WD40 repeat
IPR015943 - WD40/YVTN repeat-like-containing domain superfamily
IPR019775 - WD40 repeat, conserved site
IPR036322 - WD40-repeat-containing domain superfamily
IPR037379 - WDR74/Nsa1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055318.1 WD repeat-containing protein 74 [Cucumis melo var. makuwa]1.10e-28198.73Show/hide
Query:  MPRTTTLDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLADRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDTSPP
        MPRTTTLDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLADRKFDPLLAVARKNGLIEVLNPLNG LHVAISDNTDTSPP
Subjt:  MPRTTTLDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLADRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDTSPP

Query:  PKDEAIVGMHLFSKDELEVESRRCTLLSCTTKGNASMRSIEFSSSSSRDASTNLVKTWKVCGSGDVMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
        PKDEAIVGMHLFSKDELEVESRRCTLLSCTTKGNASMRSIEFSSSSSRDASTNLVKTWKVCGSGDVMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
Subjt:  PKDEAIVGMHLFSKDELEVESRRCTLLSCTTKGNASMRSIEFSSSSSRDASTNLVKTWKVCGSGDVMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT

Query:  AKAPKKNNLGIFTPTWFTSATFLSKDDHRKFAAGTNSHQVRLYDISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGK
        AKAPKKNNLGIFTPTWFTSATFLSK DHRKFAAGTNSHQVRLYDISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGK
Subjt:  AKAPKKNNLGIFTPTWFTSATFLSKDDHRKFAAGTNSHQVRLYDISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGK

Query:  CSGSIRSIARHPELPVIASCGLDSYVRFWDINTRQLLSAVFLKQHLTGVVFDSHFVGEDVTQTAVELIQQETEAAQTVSEEEHVPRKRKSHLK
        CSGSIRSIARHPELPVIASCGLDSYVRFWDINTRQLLSAVFLKQHLTGVVFDSHFVGEDVTQTAVELIQQETEAAQTVSEEEHVPRKRK   K
Subjt:  CSGSIRSIARHPELPVIASCGLDSYVRFWDINTRQLLSAVFLKQHLTGVVFDSHFVGEDVTQTAVELIQQETEAAQTVSEEEHVPRKRKSHLK

KGN64392.1 hypothetical protein Csa_013665 [Cucumis sativus]1.43e-27992.24Show/hide
Query:  MPRTTTLDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLADRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDTSPP
        MPRTTTLDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASL DRKFDPLLAVARKNGLIEVLNPLNGNLH+AISDNTDTSPP
Subjt:  MPRTTTLDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLADRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDTSPP

Query:  PKDEAIVGMHLFSKDELEVESRRCTLLSCTTKGNASMRSIEFSSSSSRDASTNLVKTWKVCGSGDVMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
        PKDEAIVGMHLFSK+ELEVESRRCTLLSCTTKGNASMRSI FSSS S+DAST+LVKTWKVCGSGDV C+KVDGSETHALFGGKGVEVNMWNLEQCTKIWT
Subjt:  PKDEAIVGMHLFSKDELEVESRRCTLLSCTTKGNASMRSIEFSSSSSRDASTNLVKTWKVCGSGDVMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT

Query:  AKAPKKNNLGIFTPTWFTSATFLSKDDHRKFAAGTNSHQVRLYDISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGK
        AKAPKKNNLGIFTPTWFTSATFLSKDDHRKFAAGTNSHQVRLYDISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGK
Subjt:  AKAPKKNNLGIFTPTWFTSATFLSKDDHRKFAAGTNSHQVRLYDISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGK

Query:  CSGSIRSIARHPELPVIASCGLDSYVRFWDINTRQLLSAVFLKQHLTGVVFDSHFVGEDVTQTAVELIQQETEAAQTVSEEEHVPRKRKSHLKKTVKAER
        CSGSIRSIARHPELPVIASCGLDSYVRFWDI TRQLLSAVFLKQHLTGVVFDSHFV EDVTQTAVE IQQETEAAQTVSEEEH+PRKRK   K+  +  +
Subjt:  CSGSIRSIARHPELPVIASCGLDSYVRFWDINTRQLLSAVFLKQHLTGVVFDSHFVGEDVTQTAVELIQQETEAAQTVSEEEHVPRKRKSHLKKTVKAER

Query:  GKVARPQTKKVKKSRRKSHGEAERK
         K  +   K+ KKSRRKSHGE ERK
Subjt:  GKVARPQTKKVKKSRRKSHGEAERK

XP_008439461.1 PREDICTED: WD repeat-containing protein 74 [Cucumis melo]1.15e-28399.24Show/hide
Query:  MPRTTTLDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLADRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDTSPP
        MPRTTTLDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLADRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDTSPP
Subjt:  MPRTTTLDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLADRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDTSPP

Query:  PKDEAIVGMHLFSKDELEVESRRCTLLSCTTKGNASMRSIEFSSSSSRDASTNLVKTWKVCGSGDVMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
        PKDEAIVGMHLFSKDELEVESRRCTLLSCTTKGNASMRSIEFSSSSSRDASTNLVKTWKVCGSGDVMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
Subjt:  PKDEAIVGMHLFSKDELEVESRRCTLLSCTTKGNASMRSIEFSSSSSRDASTNLVKTWKVCGSGDVMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT

Query:  AKAPKKNNLGIFTPTWFTSATFLSKDDHRKFAAGTNSHQVRLYDISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGK
        AKAPKKNNLGIFTPTWFTSATFLSKDDHRKFAAGTNSHQVRLYDISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGK
Subjt:  AKAPKKNNLGIFTPTWFTSATFLSKDDHRKFAAGTNSHQVRLYDISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGK

Query:  CSGSIRSIARHPELPVIASCGLDSYVRFWDINTRQLLSAVFLKQHLTGVVFDSHFVGEDVTQTAVELIQQETEAAQTVSEEEHVPRKRKSHLK
        CSGSIRSIARHPELPVIASCGLDSYVRFWDINTRQLLSAVFLKQHLTGVVFDSHFVGEDVTQTAVELIQQETEAAQTVSEEEHVPRKRK   K
Subjt:  CSGSIRSIARHPELPVIASCGLDSYVRFWDINTRQLLSAVFLKQHLTGVVFDSHFVGEDVTQTAVELIQQETEAAQTVSEEEHVPRKRKSHLK

XP_011652016.1 LOW QUALITY PROTEIN: WD repeat-containing protein 74 [Cucumis sativus]3.23e-27495.93Show/hide
Query:  MPRTTTLDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLADRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDTSPP
        MPRTTTLDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASL DRKFDPLLAVARKNGLIEVLNPLNGNLH+AISDNTDTSPP
Subjt:  MPRTTTLDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLADRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDTSPP

Query:  PKDEAIVGMHLFSKDELEVESRRCTLLSCTTKGNASMRSIEFSSSSSRDASTNLVKTWKVCGSGDVMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
        PKDEAIVGMHLFSK+ELEVESRRCTLLSCTTKGNASMRSI FSSS S+DAST+LVKTWKVCGSGDV C+KVDGSETHALFGGKGVEVNMWNLEQCTKIWT
Subjt:  PKDEAIVGMHLFSKDELEVESRRCTLLSCTTKGNASMRSIEFSSSSSRDASTNLVKTWKVCGSGDVMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT

Query:  AKAPKKNNLGIFTPTWFTSATFLSKDDHRKFAAGTNSHQVRLYDISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGK
        AKAPKKNNLGIFTPTWFTSATFLSKDDHRKFAAGTNSHQVRLYDISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGK
Subjt:  AKAPKKNNLGIFTPTWFTSATFLSKDDHRKFAAGTNSHQVRLYDISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGK

Query:  CSGSIRSIARHPELPVIASCGLDSYVRFWDINTRQLLSAVFLKQHLTGVVFDSHFVGEDVTQTAVELIQQETEAAQTVSEEEHVPRKRKSHLK
        CSGSIRSIARHPELPVIASCGLDSYVRFWDI TRQLLSAVFLKQHLTGVVFDSHFV EDVTQTAVE IQQETEAAQTVSEEEH+PRKRK   K
Subjt:  CSGSIRSIARHPELPVIASCGLDSYVRFWDINTRQLLSAVFLKQHLTGVVFDSHFVGEDVTQTAVELIQQETEAAQTVSEEEHVPRKRKSHLK

XP_038894722.1 LOW QUALITY PROTEIN: WD repeat-containing protein 74 [Benincasa hispida]2.02e-26694.34Show/hide
Query:  MPRTTTLDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLADRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDTSPP
        MPRTTT+DC GCPPLRALTFDVLGLVKVIEARGKEGEIPKVV+RWGEPDFSKSVLAASL DRKFDPLLAVARKNGLIEVLNPLNG+LHV ISDNTDTSPP
Subjt:  MPRTTTLDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLADRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDTSPP

Query:  PKDEAIVGMHLFSKDELEVESRRCTLLSCTTKGNASMRSIEFSSSSSRDASTNLVKTWKVCGSGDVMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
        PKDEAIVGMHLFSKDELEVESR CTLLSCTTKGNASMRS EFSSSSS+D STNLV++WKVCGSGDVMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
Subjt:  PKDEAIVGMHLFSKDELEVESRRCTLLSCTTKGNASMRSIEFSSSSSRDASTNLVKTWKVCGSGDVMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT

Query:  AKAPKKNNLGIFTPTWFTSATFLSKDDHRKFAAGTNSHQVRLYDISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGK
        AKAPKKN+LGIFTPTWFTSATFLSKDDHRKFAAGTNSHQVRLYDISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGK
Subjt:  AKAPKKNNLGIFTPTWFTSATFLSKDDHRKFAAGTNSHQVRLYDISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGK

Query:  CSGSIRSIARHPELPVIASCGLDSYVRFWDINTRQLLSAVFLKQHLTGVVFDSHFVGEDVTQTAVELIQQETEAAQTVSEEEHVPRKRK
        CSGSIRSIARHPE PVIASCGLDSYVRFWDI TRQLLSAVFLKQHLTGVVFDSHFV EDVTQ AVE IQQETE AQT++EE HVPRKRK
Subjt:  CSGSIRSIARHPELPVIASCGLDSYVRFWDINTRQLLSAVFLKQHLTGVVFDSHFVGEDVTQTAVELIQQETEAAQTVSEEEHVPRKRK

TrEMBL top hitse value%identityAlignment
A0A0A0LUB9 WD_REPEATS_REGION domain-containing protein4.7e-22091.8Show/hide
Query:  MPRTTTLDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLADRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDTSPP
        MPRTTTLDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASL DRKFDPLLAVARKNGLIEVLNPLNGNLH+AISDNTDTSPP
Subjt:  MPRTTTLDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLADRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDTSPP

Query:  PKDEAIVGMHLFSKDELEVESRRCTLLSCTTKGNASMRSIEFSSSSSRDASTNLVKTWKVCGSGDVMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
        PKDEAIVGMHLFSK+ELEVESRRCTLLSCTTKGNASMRSI FSSS S+DAST+LVKTWKVCGSGDV C+KVDGSETHALFGGKGVEVNMWNLEQCTKIWT
Subjt:  PKDEAIVGMHLFSKDELEVESRRCTLLSCTTKGNASMRSIEFSSSSSRDASTNLVKTWKVCGSGDVMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT

Query:  AKAPKKNNLGIFTPTWFTSATFLSKDDHRKFAAGTNSHQVRLYDISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGK
        AKAPKKNNLGIFTPTWFTSATFLSKDDHRKFAAGTNSHQVRLYDISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGK
Subjt:  AKAPKKNNLGIFTPTWFTSATFLSKDDHRKFAAGTNSHQVRLYDISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGK

Query:  CSGSIRSIARHPELPVIASCGLDSYVRFWDINTRQLLSAVFLKQHLTGVVFDSHFVGEDVTQTAVELIQQETEAAQTVSEEEHVPRKRKSHLKKTVKAER
        CSGSIRSIARHPELPVIASCGLDSYVRFWDI TRQLLSAVFLKQHLTGVVFDSHFV EDVTQTAVE IQQETEAAQTVSEEEH+PRKRK   K+  +  +
Subjt:  CSGSIRSIARHPELPVIASCGLDSYVRFWDINTRQLLSAVFLKQHLTGVVFDSHFVGEDVTQTAVELIQQETEAAQTVSEEEHVPRKRKSHLKKTVKAER

Query:  GKVARPQTKKVKKSRRKSHGEAERKRR
         K  +   K+ KKSRRKSHGE ERK++
Subjt:  GKVARPQTKKVKKSRRKSHGEAERKRR

A0A1S3AYE7 WD repeat-containing protein 747.7e-22394.48Show/hide
Query:  MPRTTTLDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLADRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDTSPP
        MPRTTTLDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLADRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDTSPP
Subjt:  MPRTTTLDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLADRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDTSPP

Query:  PKDEAIVGMHLFSKDELEVESRRCTLLSCTTKGNASMRSIEFSSSSSRDASTNLVKTWKVCGSGDVMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
        PKDEAIVGMHLFSKDELEVESRRCTLLSCTTKGNASMRSIEFSSSSSRDASTNLVKTWKVCGSGDVMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
Subjt:  PKDEAIVGMHLFSKDELEVESRRCTLLSCTTKGNASMRSIEFSSSSSRDASTNLVKTWKVCGSGDVMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT

Query:  AKAPKKNNLGIFTPTWFTSATFLSKDDHRKFAAGTNSHQVRLYDISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGK
        AKAPKKNNLGIFTPTWFTSATFLSKDDHRKFAAGTNSHQVRLYDISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGK
Subjt:  AKAPKKNNLGIFTPTWFTSATFLSKDDHRKFAAGTNSHQVRLYDISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGK

Query:  CSGSIRSIARHPELPVIASCGLDSYVRFWDINTRQLLSAVFLKQHLTGVVFDSHFVGEDVTQTAVELIQQETEAAQTVSEEEHVPRKRKSHLKKTVKAER
        CSGSIRSIARHPELPVIASCGLDSYVRFWDINTRQLLSAVFLKQHLTGVVFDSHFVGEDVTQTAVELIQQETEAAQTVSEEEHVPRKRK   K+  +  +
Subjt:  CSGSIRSIARHPELPVIASCGLDSYVRFWDINTRQLLSAVFLKQHLTGVVFDSHFVGEDVTQTAVELIQQETEAAQTVSEEEHVPRKRKSHLKKTVKAER

Query:  GKVARPQTKKVKKSRRK
         K ++   K+ KK +++
Subjt:  GKVARPQTKKVKKSRRK

A0A5A7UJY9 WD repeat-containing protein 742.5e-22194Show/hide
Query:  MPRTTTLDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLADRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDTSPP
        MPRTTTLDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLADRKFDPLLAVARKNGLIEVLNPLNG LHVAISDNTDTSPP
Subjt:  MPRTTTLDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLADRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDTSPP

Query:  PKDEAIVGMHLFSKDELEVESRRCTLLSCTTKGNASMRSIEFSSSSSRDASTNLVKTWKVCGSGDVMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
        PKDEAIVGMHLFSKDELEVESRRCTLLSCTTKGNASMRSIEFSSSSSRDASTNLVKTWKVCGSGDVMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
Subjt:  PKDEAIVGMHLFSKDELEVESRRCTLLSCTTKGNASMRSIEFSSSSSRDASTNLVKTWKVCGSGDVMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT

Query:  AKAPKKNNLGIFTPTWFTSATFLSKDDHRKFAAGTNSHQVRLYDISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGK
        AKAPKKNNLGIFTPTWFTSATFLSK DHRKFAAGTNSHQVRLYDISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGK
Subjt:  AKAPKKNNLGIFTPTWFTSATFLSKDDHRKFAAGTNSHQVRLYDISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGK

Query:  CSGSIRSIARHPELPVIASCGLDSYVRFWDINTRQLLSAVFLKQHLTGVVFDSHFVGEDVTQTAVELIQQETEAAQTVSEEEHVPRKRKSHLKKTVKAER
        CSGSIRSIARHPELPVIASCGLDSYVRFWDINTRQLLSAVFLKQHLTGVVFDSHFVGEDVTQTAVELIQQETEAAQTVSEEEHVPRKRK   K+  +  +
Subjt:  CSGSIRSIARHPELPVIASCGLDSYVRFWDINTRQLLSAVFLKQHLTGVVFDSHFVGEDVTQTAVELIQQETEAAQTVSEEEHVPRKRKSHLKKTVKAER

Query:  GKVARPQTKKVKKSRRK
         K ++   K+ KK +++
Subjt:  GKVARPQTKKVKKSRRK

A0A5D3BIZ1 WD repeat-containing protein 747.7e-22394.48Show/hide
Query:  MPRTTTLDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLADRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDTSPP
        MPRTTTLDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLADRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDTSPP
Subjt:  MPRTTTLDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLADRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDTSPP

Query:  PKDEAIVGMHLFSKDELEVESRRCTLLSCTTKGNASMRSIEFSSSSSRDASTNLVKTWKVCGSGDVMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
        PKDEAIVGMHLFSKDELEVESRRCTLLSCTTKGNASMRSIEFSSSSSRDASTNLVKTWKVCGSGDVMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
Subjt:  PKDEAIVGMHLFSKDELEVESRRCTLLSCTTKGNASMRSIEFSSSSSRDASTNLVKTWKVCGSGDVMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT

Query:  AKAPKKNNLGIFTPTWFTSATFLSKDDHRKFAAGTNSHQVRLYDISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGK
        AKAPKKNNLGIFTPTWFTSATFLSKDDHRKFAAGTNSHQVRLYDISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGK
Subjt:  AKAPKKNNLGIFTPTWFTSATFLSKDDHRKFAAGTNSHQVRLYDISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGK

Query:  CSGSIRSIARHPELPVIASCGLDSYVRFWDINTRQLLSAVFLKQHLTGVVFDSHFVGEDVTQTAVELIQQETEAAQTVSEEEHVPRKRKSHLKKTVKAER
        CSGSIRSIARHPELPVIASCGLDSYVRFWDINTRQLLSAVFLKQHLTGVVFDSHFVGEDVTQTAVELIQQETEAAQTVSEEEHVPRKRK   K+  +  +
Subjt:  CSGSIRSIARHPELPVIASCGLDSYVRFWDINTRQLLSAVFLKQHLTGVVFDSHFVGEDVTQTAVELIQQETEAAQTVSEEEHVPRKRKSHLKKTVKAER

Query:  GKVARPQTKKVKKSRRK
         K ++   K+ KK +++
Subjt:  GKVARPQTKKVKKSRRK

E5GBV4 WD-repeat protein7.7e-22394.48Show/hide
Query:  MPRTTTLDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLADRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDTSPP
        MPRTTTLDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLADRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDTSPP
Subjt:  MPRTTTLDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLADRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDTSPP

Query:  PKDEAIVGMHLFSKDELEVESRRCTLLSCTTKGNASMRSIEFSSSSSRDASTNLVKTWKVCGSGDVMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
        PKDEAIVGMHLFSKDELEVESRRCTLLSCTTKGNASMRSIEFSSSSSRDASTNLVKTWKVCGSGDVMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
Subjt:  PKDEAIVGMHLFSKDELEVESRRCTLLSCTTKGNASMRSIEFSSSSSRDASTNLVKTWKVCGSGDVMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT

Query:  AKAPKKNNLGIFTPTWFTSATFLSKDDHRKFAAGTNSHQVRLYDISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGK
        AKAPKKNNLGIFTPTWFTSATFLSKDDHRKFAAGTNSHQVRLYDISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGK
Subjt:  AKAPKKNNLGIFTPTWFTSATFLSKDDHRKFAAGTNSHQVRLYDISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGK

Query:  CSGSIRSIARHPELPVIASCGLDSYVRFWDINTRQLLSAVFLKQHLTGVVFDSHFVGEDVTQTAVELIQQETEAAQTVSEEEHVPRKRKSHLKKTVKAER
        CSGSIRSIARHPELPVIASCGLDSYVRFWDINTRQLLSAVFLKQHLTGVVFDSHFVGEDVTQTAVELIQQETEAAQTVSEEEHVPRKRK   K+  +  +
Subjt:  CSGSIRSIARHPELPVIASCGLDSYVRFWDINTRQLLSAVFLKQHLTGVVFDSHFVGEDVTQTAVELIQQETEAAQTVSEEEHVPRKRKSHLKKTVKAER

Query:  GKVARPQTKKVKKSRRK
         K ++   K+ KK +++
Subjt:  GKVARPQTKKVKKSRRK

SwissProt top hitse value%identityAlignment
A5DKC4 Ribosome biogenesis protein NSA11.4e-0825Show/hide
Query:  LEQCTKIWTAKAPKKNNLGIFTPTWFTSATFLSKDDHRKFAAGTNSHQVRLYDISAQKRPVISFDFRETPIKSL--AEDVDGNTIFVGN----ASGDLAS
        LE    ++ AK  K ++L +  P W T   F+S  +  K    T   Q+R+YD +  ++P   +     PI +L  A       I   N    A   L S
Subjt:  LEQCTKIWTAKAPKKNNLGIFTPTWFTSATFLSKDDHRKFAAGTNSHQVRLYDISAQKRPVISFDFRETPIKSL--AEDVDGNTIFVGN----ASGDLAS

Query:  FDIRNGK---------------LLGCFL-GKCSGSIRSIARHPELPVIASCGLDSYVRFWDINTRQLLSAVFLKQHLTGVVFDSHFVGEDVTQTAVELIQ
         D +  K               LLG +  G  +G+I  ++   +   +A+ GLD Y+R +D+ TR+++S V++   +  ++F     GE+  ++  E+ +
Subjt:  FDIRNGK---------------LLGCFL-GKCSGSIRSIARHPELPVIASCGLDSYVRFWDINTRQLLSAVFLKQHLTGVVFDSHFVGEDVTQTAVELIQ

Query:  QETEAAQTVSEEEHVPRKRK
        +E   A     E + P K++
Subjt:  QETEAAQTVSEEEHVPRKRK

Q54FW9 WD repeat-containing protein DDB_G02905553.8e-1723.73Show/hide
Query:  GEPDFSKSVLAASLADRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDTSPPPKDEAIVGMHLFSKDELEVESRRCTLLSCTTKGNASMRSIEFSSS
        GE D+ +S+ A    +   D LL VA +NGL++V               T T+P     A + +   +K +    +   + +    K    +     + S
Subjt:  GEPDFSKSVLAASLADRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDTSPPPKDEAIVGMHLFSKDELEVESRRCTLLSCTTKGNASMRSIEFSSS

Query:  SSRDASTNLVKTWKVCGSGDVMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWTAKAPKKNNLGIFTPTWFTSATFLSKDDHRKFAAGTNSHQVRLYDI
         S + +TNL        SG  M    + S     FGGK V + +W+LE+  K ++AK  K + L +  P       +++ D   K   G++  +++ YD+
Subjt:  SSRDASTNLVKTWKVCGSGDVMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWTAKAPKKNNLGIFTPTWFTSATFLSKDDHRKFAAGTNSHQVRLYDI

Query:  SAQ--KRPVISFDFRETPIKSLA-EDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGKCSGSIRSIARHPELPVIASCGLDSYVRFWDINTRQLLSAVFL
         ++  +   +   F + PI+S+   +   +  +  ++ G +  +D+R  + +G F    +GS++ IA HP LP++A+ GLD ++R ++++ R++L  +FL
Subjt:  SAQ--KRPVISFDFRETPIKSLA-EDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGKCSGSIRSIARHPELPVIASCGLDSYVRFWDINTRQLLSAVFL

Query:  KQHLTGVVFDSHFVGEDVTQTAVELIQQETEAAQTVSEEEHVPRKRKSHLKKTV
        KQ L+ V+F       ++ Q   E+ +   E    ++ +++      +  KK++
Subjt:  KQHLTGVVFDSHFVGEDVTQTAVELIQQETEAAQTVSEEEHVPRKRKSHLKKTV

Q58D06 WD repeat-containing protein 747.9e-2330.82Show/hide
Query:  TLLSCTTKGNASMRSIEFSSSSSRDASTNLVKTWKVCGSGDVMCSKVDGSETHAL-FGGKGVEVNMWNLEQCTK-IWTAKAPKKNNLGIFTPTWFTSATF
        TL++C   G      +   +   ++AS++ V   +V G G V   + D +  H +  GGK   + +W+L+   + ++ AK  + + L +  P W     F
Subjt:  TLLSCTTKGNASMRSIEFSSSSSRDASTNLVKTWKVCGSGDVMCSKVDGSETHAL-FGGKGVEVNMWNLEQCTK-IWTAKAPKKNNLGIFTPTWFTSATF

Query:  LSKDDHRKFAAGTNSHQVRLYD-ISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGKCSGSIRSIARHPELPVIASCG
        L   + +K    T  HQVR+YD  S Q+RPV+   + E P+ ++    +GN++ VGN  G LA  D+R G+LLGC  G  +GS+R +  HP  P++ASCG
Subjt:  LSKDDHRKFAAGTNSHQVRLYD-ISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGKCSGSIRSIARHPELPVIASCG

Query:  LDSYVRFWDI-NTRQLLSAVFLKQHLTGVVFDSHFVGEDVTQTAVE---LIQQETEAAQTVSEEEHVPRKRKSHLKKT---VKAERGKVARP
        LD  +R   I N R L   V+LK  L  ++       ED  Q   E   +  ++TE  +  +  E   +++    ++T   ++A R K  RP
Subjt:  LDSYVRFWDI-NTRQLLSAVFLKQHLTGVVFDSHFVGEDVTQTAVE---LIQQETEAAQTVSEEEHVPRKRKSHLKKT---VKAERGKVARP

Q6RFH5 WD repeat-containing protein 741.1e-2130.48Show/hide
Query:  TLLSCTTKGNASMRSIEFSSSSSRDASTNLVKTWKVCGSGDVMCSKVDGSETHAL-FGGKGVEVNMWNLEQCTK-IWTAKAPKKNNLGIFTPTWFTSATF
        TL++C   G      +       +D S++ +   +V G G V   + D +  H +  GGK   + +W+L+   + ++ AK  + + L +  P W     F
Subjt:  TLLSCTTKGNASMRSIEFSSSSSRDASTNLVKTWKVCGSGDVMCSKVDGSETHAL-FGGKGVEVNMWNLEQCTK-IWTAKAPKKNNLGIFTPTWFTSATF

Query:  LSKDDHRKFAAGTNSHQVRLYD-ISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGKCSGSIRSIARHPELPVIASCG
        L     +K    T  HQVR+YD  S Q+RPV+   + E P+ ++     GN++ VGN  G LA  D+R G+LLGC  G  +GS+R +  HP  P++ASCG
Subjt:  LSKDDHRKFAAGTNSHQVRLYD-ISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGKCSGSIRSIARHPELPVIASCG

Query:  LDSYVRFWDI-NTRQLLSAVFLKQHLTGVVFDSHFVGEDVTQTAVE---LIQQETEAAQTVSEEEHVPRKRKSHLKK---TVKAERGKVARP
        LD  +R   I N R L   V+LK  L  ++       ED  Q   E   +  ++TE  +  +  E   +++ S L++    ++  R K  RP
Subjt:  LDSYVRFWDI-NTRQLLSAVFLKQHLTGVVFDSHFVGEDVTQTAVE---LIQQETEAAQTVSEEEHVPRKRKSHLKK---TVKAERGKVARP

Q8VCG3 WD repeat-containing protein 742.3e-2231.06Show/hide
Query:  TLLSCTTKGNASMRSIEFSSSSSRDASTNLVKTWKVCGSGDVMCSKVDGSETHALFG-GKGVEVNMWNLEQCTK-IWTAKAPKKNNLGIFTPTWFTSATF
        TL++C   G      +     + ++AS++ +   KV G G V   + D + TH +   GK   + +W+L+   + ++ AK  + + L +  P W     F
Subjt:  TLLSCTTKGNASMRSIEFSSSSSRDASTNLVKTWKVCGSGDVMCSKVDGSETHALFG-GKGVEVNMWNLEQCTK-IWTAKAPKKNNLGIFTPTWFTSATF

Query:  LSKDDHRKFAAGTNSHQVRLYD-ISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGKCSGSIRSIARHPELPVIASCG
        L     +K    T  HQVR+YD +S Q+RPV+   + E P+ ++    +GN++ VGN  G LA  D R G+LLGC  G  +GS+R +  HP  P++ASCG
Subjt:  LSKDDHRKFAAGTNSHQVRLYD-ISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGKCSGSIRSIARHPELPVIASCG

Query:  LDSYVRFWDI-NTRQLLSAVFLKQHLTGVVFDSHFVGEDVTQTAVELIQ---QETEAAQTVSEEEHVPRKRKSHLKKTVKAERGKVARPQTKK
        LD  +R   I N R L   V+LK  L  ++       ED  Q   E  Q   ++TE  +  +  E   +++   L +T    +G + R + KK
Subjt:  LDSYVRFWDI-NTRQLLSAVFLKQHLTGVVFDSHFVGEDVTQTAVELIQ---QETEAAQTVSEEEHVPRKRKSHLKKTVKAERGKVARPQTKK

Arabidopsis top hitse value%identityAlignment
AT1G29320.1 Transducin/WD40 repeat-like superfamily protein3.2e-13660Show/hide
Query:  MPRTTTLDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLADRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDTSPP
        MPR    +  GCPP RALTFD LGL+KV EARG+E  IP VV  WGE + S+SVLAAS+ DR  +PLLAVARK+G +EV+NP NG+LH + S   D    
Subjt:  MPRTTTLDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLADRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDTSPP

Query:  PKDEAIVGMHLFSKDELEVESRRCTLLSCTTKGNASMRSIEFSSSSSRDASTNLVKTWKVCGSGDVMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
        P+D  I  +HLF K   +   R CTLL+CT KG+ S+RS++F  +          KTWK CGSG+++  KVDGSE  +LFGGK VE N+W+LEQCTKIW+
Subjt:  PKDEAIVGMHLFSKDELEVESRRCTLLSCTTKGNASMRSIEFSSSSSRDASTNLVKTWKVCGSGDVMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT

Query:  AKAPKKNNLGIFTPTWFTSATFLSKDDHRKFAAGTNSHQVRLYDISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGK
        AK P KNNLGIFTPTWFTSATFLSKDDHRKF  GT SHQVRLYDIS Q+RPV+SFDFRET I S+AED DG+TI+VGNAS DLASFDIR GKLLG FLGK
Subjt:  AKAPKKNNLGIFTPTWFTSATFLSKDDHRKFAAGTNSHQVRLYDISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGK

Query:  CSGSIRSIARHPELPVIASCGLDSYVRFWDINTRQLLSAVFLKQHLTGVVFDSHFVGEDVTQTAVELIQQETEAAQTVSEEEHVPRKRKSHLKKTVKAER
        CSGSIRS+ RHP+  VIASCGLD Y+R +D+ TRQL+SAVFLKQHLTG+VFDS F GE+ T  A  + +  TE   T+ ++E              + E+
Subjt:  CSGSIRSIARHPELPVIASCGLDSYVRFWDINTRQLLSAVFLKQHLTGVVFDSHFVGEDVTQTAVELIQQETEAAQTVSEEEHVPRKRKSHLKKTVKAER

Query:  GKVARPQTKKVKKSR
          V R ++KK K+SR
Subjt:  GKVARPQTKKVKKSR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCGTACGACGACGCTTGATTGCCCTGGCTGTCCTCCGCTTCGTGCCCTAACTTTCGATGTTCTTGGTCTCGTTAAAGTCATTGAAGCTCGGGGCAAGGAAGGAGA
GATCCCCAAAGTTGTTGAGAGATGGGGTGAACCTGATTTCTCCAAATCCGTGCTTGCAGCTTCTCTTGCTGATCGTAAATTCGACCCCTTATTAGCTGTTGCCCGAAAAA
ATGGCCTGATTGAAGTTCTTAATCCTTTGAATGGCAATCTTCATGTTGCTATTTCGGACAATACCGATACTTCCCCTCCACCAAAAGATGAAGCCATTGTTGGAATGCAT
TTATTTTCAAAAGATGAATTGGAGGTGGAATCTAGGAGGTGCACCTTGCTTTCATGTACAACAAAAGGAAATGCAAGCATGAGGTCGATTGAATTTTCTAGTTCATCGTC
AAGAGATGCCTCTACCAATCTTGTAAAAACGTGGAAAGTATGTGGTTCTGGTGATGTTATGTGTTCTAAAGTTGATGGAAGTGAAACCCATGCATTGTTTGGAGGGAAGG
GTGTTGAAGTTAATATGTGGAATCTAGAACAGTGTACGAAGATTTGGACAGCAAAAGCACCGAAGAAGAACAACCTTGGAATTTTCACACCAACTTGGTTCACTTCAGCA
ACATTCCTTAGTAAAGATGATCACCGTAAGTTTGCAGCGGGTACAAACAGCCATCAGGTTCGATTGTATGACATTTCTGCTCAAAAGAGACCTGTCATCTCATTTGATTT
TCGAGAAACTCCTATTAAATCCTTGGCCGAAGATGTTGATGGTAACACAATATTTGTGGGGAATGCGTCTGGTGATCTTGCATCTTTTGATATTCGCAATGGAAAGTTAT
TGGGTTGCTTCTTGGGGAAATGTTCTGGGAGCATAAGATCCATCGCCAGGCACCCGGAATTACCAGTCATAGCTTCATGTGGACTAGATAGTTACGTGCGCTTCTGGGAT
ATAAACACAAGGCAACTTCTGTCTGCGGTATTCCTAAAACAGCATCTCACCGGTGTCGTCTTTGATTCTCATTTTGTGGGAGAAGACGTAACACAAACTGCCGTAGAGTT
AATCCAACAGGAAACGGAGGCAGCTCAAACTGTCAGTGAGGAAGAACACGTGCCTCGGAAAAGAAAAAGTCATCTAAAGAAGACGGTGAAGGCAGAAAGAGGAAAGGTAG
CAAGACCACAAACAAAGAAAGTAAAAAAAAGCAGAAGGAAGTCACATGGCGAAGCTGAAAGAAAGCGGAGGTAG
mRNA sequenceShow/hide mRNA sequence
GTGTGGAGTCTCTCTCAATCTCCTAAGCTCAACTGTTCTTGCAAAGTATGCCTCGTACGACGACGCTTGATTGCCCTGGCTGTCCTCCGCTTCGTGCCCTAACTTTCGAT
GTTCTTGGTCTCGTTAAAGTCATTGAAGCTCGGGGCAAGGAAGGAGAGATCCCCAAAGTTGTTGAGAGATGGGGTGAACCTGATTTCTCCAAATCCGTGCTTGCAGCTTC
TCTTGCTGATCGTAAATTCGACCCCTTATTAGCTGTTGCCCGAAAAAATGGCCTGATTGAAGTTCTTAATCCTTTGAATGGCAATCTTCATGTTGCTATTTCGGACAATA
CCGATACTTCCCCTCCACCAAAAGATGAAGCCATTGTTGGAATGCATTTATTTTCAAAAGATGAATTGGAGGTGGAATCTAGGAGGTGCACCTTGCTTTCATGTACAACA
AAAGGAAATGCAAGCATGAGGTCGATTGAATTTTCTAGTTCATCGTCAAGAGATGCCTCTACCAATCTTGTAAAAACGTGGAAAGTATGTGGTTCTGGTGATGTTATGTG
TTCTAAAGTTGATGGAAGTGAAACCCATGCATTGTTTGGAGGGAAGGGTGTTGAAGTTAATATGTGGAATCTAGAACAGTGTACGAAGATTTGGACAGCAAAAGCACCGA
AGAAGAACAACCTTGGAATTTTCACACCAACTTGGTTCACTTCAGCAACATTCCTTAGTAAAGATGATCACCGTAAGTTTGCAGCGGGTACAAACAGCCATCAGGTTCGA
TTGTATGACATTTCTGCTCAAAAGAGACCTGTCATCTCATTTGATTTTCGAGAAACTCCTATTAAATCCTTGGCCGAAGATGTTGATGGTAACACAATATTTGTGGGGAA
TGCGTCTGGTGATCTTGCATCTTTTGATATTCGCAATGGAAAGTTATTGGGTTGCTTCTTGGGGAAATGTTCTGGGAGCATAAGATCCATCGCCAGGCACCCGGAATTAC
CAGTCATAGCTTCATGTGGACTAGATAGTTACGTGCGCTTCTGGGATATAAACACAAGGCAACTTCTGTCTGCGGTATTCCTAAAACAGCATCTCACCGGTGTCGTCTTT
GATTCTCATTTTGTGGGAGAAGACGTAACACAAACTGCCGTAGAGTTAATCCAACAGGAAACGGAGGCAGCTCAAACTGTCAGTGAGGAAGAACACGTGCCTCGGAAAAG
AAAAAGTCATCTAAAGAAGACGGTGAAGGCAGAAAGAGGAAAGGTAGCAAGACCACAAACAAAGAAAGTAAAAAAAAGCAGAAGGAAGTCACATGGCGAAGCTGAAAGAA
AGCGGAGGTAGTGGGAGTTACATTCAAATGAAGGAAAGATGGTTTAGAATATGTTATTCTTGGTTCTTGAGAAAGCAGAAAGGTATTCTTCTTCTGATCTATTCAGTTAG
CAAATAGTGTTGGAAATTGAGGGTACAAGAAAAAGTTATTTATCTCATCAATGCATTATTTTGGCCCAAATCATGTGTGTTAGATATGCAGGCTCCATAGTTTTTGGAGT
TTTGTCCTGCGAAGTAGGAATGATTCCAATTCGAAGCAGTTCAAGGAGGGCTCCCATGTAAACTCTTCTTTTTTATTTTTATTTTTAAAAAAATTTTATTTATACCATAT
TACTTGAGGTTAGAAGTATTTGAGTAGAATACGTATAACGTTTTAGATTTGCCCCTTCATTTTAGGCCTAATTATTGGAAAATTGGGGCACCTTATGCTTAACTTATGAG
CTCATCAATTGTCGAGGAGAATTCGGCAAGTCCTTGGAATATTATTGTTTTTTTTATATAGTAATATATAGCACCGG
Protein sequenceShow/hide protein sequence
MPRTTTLDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLADRKFDPLLAVARKNGLIEVLNPLNGNLHVAISDNTDTSPPPKDEAIVGMH
LFSKDELEVESRRCTLLSCTTKGNASMRSIEFSSSSSRDASTNLVKTWKVCGSGDVMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWTAKAPKKNNLGIFTPTWFTSA
TFLSKDDHRKFAAGTNSHQVRLYDISAQKRPVISFDFRETPIKSLAEDVDGNTIFVGNASGDLASFDIRNGKLLGCFLGKCSGSIRSIARHPELPVIASCGLDSYVRFWD
INTRQLLSAVFLKQHLTGVVFDSHFVGEDVTQTAVELIQQETEAAQTVSEEEHVPRKRKSHLKKTVKAERGKVARPQTKKVKKSRRKSHGEAERKRR