; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021782 (gene) of Snake gourd v1 genome

Gene IDTan0021782
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionWD repeat-containing protein 74
Genome locationLG05:81566722..81572335
RNA-Seq ExpressionTan0021782
SyntenyTan0021782
Gene Ontology termsGO:0042273 - ribosomal large subunit biogenesis (biological process)
GO:0005730 - nucleolus (cellular component)
GO:0030687 - preribosome, large subunit precursor (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001680 - WD40 repeat
IPR015943 - WD40/YVTN repeat-like-containing domain superfamily
IPR019775 - WD40 repeat, conserved site
IPR036322 - WD40-repeat-containing domain superfamily
IPR037379 - WDR74/Nsa1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055318.1 WD repeat-containing protein 74 [Cucumis melo var. makuwa]2.9e-21689.76Show/hide
Query:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGAIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNVHVAISDNTDTSPP
        MPRTT +DCPGCPPLRALTFDVLGLVKVIEARGKEG IPKVVERWGEPDFSKSVLAASL DRKFDPLLAVARKNGLIEVLNPLNG +HVAISDNTDTSPP
Subjt:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGAIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNVHVAISDNTDTSPP

Query:  PKDEPIVGMHLFAKDELELGTRRCTLLSCTTKGNASMRSIDFSSSSSKDTSTNLMKMWKVCGSGDIMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
        PKDE IVGMHLF+KDELE+ +RRCTLLSCTTKGNASMRSI+FSSSSS+D STNL+K WKVCGSGD+MCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
Subjt:  PKDEPIVGMHLFAKDELELGTRRCTLLSCTTKGNASMRSIDFSSSSSKDTSTNLMKMWKVCGSGDIMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT

Query:  AKAPKKNSLGIFTPTWFTSATFLSKDDHRKFAAGTNNHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDVRNGKLLGCFLGK
        AKAPKKN+LGIFTPTWFTSATFLSK DHRKFAAGTN+HQVRLYDISAQKRPVISFDFRETPIK+LAEDVDGNTIFVGNA+GDLASFD+RNGKLLGCFLGK
Subjt:  AKAPKKNSLGIFTPTWFTSATFLSKDDHRKFAAGTNNHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDVRNGKLLGCFLGK

Query:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQPLSAVFLKQHLNSVVFDSHFVEEDVTHSAVESIQQETEIAQTVGEEEHVPRKRKKASKEDGEGKT
        CSGSIRSIARHPE PVIASCGLDSYVRFWDI TRQ LSAVFLKQHL  VVFDSHFV EDVT +AVE IQQETE AQTV EEEHVPRKRKK+SKEDGEG+ 
Subjt:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQPLSAVFLKQHLNSVVFDSHFVEEDVTHSAVESIQQETEIAQTVGEEEHVPRKRKKASKEDGEGKT

Query:  RKASKTAGKENKKKKKEVAW
        RK SKT  KE+KKK+KEV W
Subjt:  RKASKTAGKENKKKKKEVAW

KAG6584141.1 WD repeat-containing protein 74, partial [Cucurbita argyrosperma subsp. sororia]1.6e-21490.17Show/hide
Query:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGAIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNVHVAISDNTDTSPP
        MPRTTKVDCPGCPPLRALTFDVLGL+KVIEARGKEG IPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGN+HVAISDNTDTSPP
Subjt:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGAIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNVHVAISDNTDTSPP

Query:  PKDEPIVGMHLFAKDELELGTRRCTLLSCTTKGNASMRSIDFSSSSSKDTSTNLMKMWKVCGSGDIMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
        PKDE IVGMHL +KDE EL +RRCTLLSCTTKGNASMR+I+FSSSSS+DTSTNL + WK+C SGD+MCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
Subjt:  PKDEPIVGMHLFAKDELELGTRRCTLLSCTTKGNASMRSIDFSSSSSKDTSTNLMKMWKVCGSGDIMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT

Query:  AKAPKKNSLGIFTPTWFTSATFLSKDDHRKFAAGTNNHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDVRNGKLLGCFLGK
        AKAPKKNS GIFTPTWFTSATFLSKDDHRKFAAGTN+HQVRLYDISA+KRPVISFDFRETPIKALAEDVDGNTIFVGNA+GDLASFD+RNGKLLGCFLGK
Subjt:  AKAPKKNSLGIFTPTWFTSATFLSKDDHRKFAAGTNNHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDVRNGKLLGCFLGK

Query:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQPLSAVFLKQHLNSVVFDSHFVEEDVTHSAVESIQQETEIAQTVGEEEHVPRKRKKASKEDGEGKT
        CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQ LSAVFLKQHL  VVFDSHFVEEDVTHSAVE IQQETE+ QTV EEEHVP+KRKKA KEDGEG  
Subjt:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQPLSAVFLKQHLNSVVFDSHFVEEDVTHSAVESIQQETEIAQTVGEEEHVPRKRKKASKEDGEGKT

Query:  RKASKTAGKENKKKKKE
        RK SKT  KENKK +++
Subjt:  RKASKTAGKENKKKKKE

XP_008439461.1 PREDICTED: WD repeat-containing protein 74 [Cucumis melo]8.9e-21890.24Show/hide
Query:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGAIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNVHVAISDNTDTSPP
        MPRTT +DCPGCPPLRALTFDVLGLVKVIEARGKEG IPKVVERWGEPDFSKSVLAASL DRKFDPLLAVARKNGLIEVLNPLNGN+HVAISDNTDTSPP
Subjt:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGAIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNVHVAISDNTDTSPP

Query:  PKDEPIVGMHLFAKDELELGTRRCTLLSCTTKGNASMRSIDFSSSSSKDTSTNLMKMWKVCGSGDIMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
        PKDE IVGMHLF+KDELE+ +RRCTLLSCTTKGNASMRSI+FSSSSS+D STNL+K WKVCGSGD+MCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
Subjt:  PKDEPIVGMHLFAKDELELGTRRCTLLSCTTKGNASMRSIDFSSSSSKDTSTNLMKMWKVCGSGDIMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT

Query:  AKAPKKNSLGIFTPTWFTSATFLSKDDHRKFAAGTNNHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDVRNGKLLGCFLGK
        AKAPKKN+LGIFTPTWFTSATFLSKDDHRKFAAGTN+HQVRLYDISAQKRPVISFDFRETPIK+LAEDVDGNTIFVGNA+GDLASFD+RNGKLLGCFLGK
Subjt:  AKAPKKNSLGIFTPTWFTSATFLSKDDHRKFAAGTNNHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDVRNGKLLGCFLGK

Query:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQPLSAVFLKQHLNSVVFDSHFVEEDVTHSAVESIQQETEIAQTVGEEEHVPRKRKKASKEDGEGKT
        CSGSIRSIARHPE PVIASCGLDSYVRFWDI TRQ LSAVFLKQHL  VVFDSHFV EDVT +AVE IQQETE AQTV EEEHVPRKRKK+SKEDGEG+ 
Subjt:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQPLSAVFLKQHLNSVVFDSHFVEEDVTHSAVESIQQETEIAQTVGEEEHVPRKRKKASKEDGEGKT

Query:  RKASKTAGKENKKKKKEVAW
        RK SKT  KE+KKK+KEV W
Subjt:  RKASKTAGKENKKKKKEVAW

XP_011652016.1 LOW QUALITY PROTEIN: WD repeat-containing protein 74 [Cucumis sativus]1.2e-21489.05Show/hide
Query:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGAIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNVHVAISDNTDTSPP
        MPRTT +DCPGCPPLRALTFDVLGLVKVIEARGKEG IPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGN+H+AISDNTDTSPP
Subjt:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGAIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNVHVAISDNTDTSPP

Query:  PKDEPIVGMHLFAKDELELGTRRCTLLSCTTKGNASMRSIDFSSSSSKDTSTNLMKMWKVCGSGDIMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
        PKDE IVGMHLF+K+ELE+ +RRCTLLSCTTKGNASMRSI FSSS SKD ST+L+K WKVCGSGD+ C+KVDGSETHALFGGKGVEVNMWNLEQCTKIWT
Subjt:  PKDEPIVGMHLFAKDELELGTRRCTLLSCTTKGNASMRSIDFSSSSSKDTSTNLMKMWKVCGSGDIMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT

Query:  AKAPKKNSLGIFTPTWFTSATFLSKDDHRKFAAGTNNHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDVRNGKLLGCFLGK
        AKAPKKN+LGIFTPTWFTSATFLSKDDHRKFAAGTN+HQVRLYDISAQKRPVISFDFRETPIK+LAEDVDGNTIFVGNA+GDLASFD+RNGKLLGCFLGK
Subjt:  AKAPKKNSLGIFTPTWFTSATFLSKDDHRKFAAGTNNHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDVRNGKLLGCFLGK

Query:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQPLSAVFLKQHLNSVVFDSHFVEEDVTHSAVESIQQETEIAQTVGEEEHVPRKRKKASKEDGEGKT
        CSGSIRSIARHPE PVIASCGLDSYVRFWDIKTRQ LSAVFLKQHL  VVFDSHFVEEDVT +AVESIQQETE AQTV EEEH+PRKRKK+SKE GEG  
Subjt:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQPLSAVFLKQHLNSVVFDSHFVEEDVTHSAVESIQQETEIAQTVGEEEHVPRKRKKASKEDGEGKT

Query:  RKASKTAGKENKKKKKEVAW
        RK +KT  KE+ KK+KEV W
Subjt:  RKASKTAGKENKKKKKEVAW

XP_038894722.1 LOW QUALITY PROTEIN: WD repeat-containing protein 74 [Benincasa hispida]1.4e-21590Show/hide
Query:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGAIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNVHVAISDNTDTSPP
        MPRTT VDC GCPPLRALTFDVLGLVKVIEARGKEG IPKVV+RWGEPDFSKSVLAASL+DRKFDPLLAVARKNGLIEVLNPLNG++HV ISDNTDTSPP
Subjt:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGAIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNVHVAISDNTDTSPP

Query:  PKDEPIVGMHLFAKDELELGTRRCTLLSCTTKGNASMRSIDFSSSSSKDTSTNLMKMWKVCGSGDIMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
        PKDE IVGMHLF+KDELE+ +R CTLLSCTTKGNASMRS +FSSSSSKDTSTNL++ WKVCGSGD+MCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
Subjt:  PKDEPIVGMHLFAKDELELGTRRCTLLSCTTKGNASMRSIDFSSSSSKDTSTNLMKMWKVCGSGDIMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT

Query:  AKAPKKNSLGIFTPTWFTSATFLSKDDHRKFAAGTNNHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDVRNGKLLGCFLGK
        AKAPKKNSLGIFTPTWFTSATFLSKDDHRKFAAGTN+HQVRLYDISAQKRPVISFDFRETPIK+LAEDVDGNTIFVGNA+GDLASFD+RNGKLLGCFLGK
Subjt:  AKAPKKNSLGIFTPTWFTSATFLSKDDHRKFAAGTNNHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDVRNGKLLGCFLGK

Query:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQPLSAVFLKQHLNSVVFDSHFVEEDVTHSAVESIQQETEIAQTVGEEEHVPRKRKKASKEDGEGKT
        CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQ LSAVFLKQHL  VVFDSHFVEEDVT  AVESIQQETE+AQT+  EEHVPRKRKKASKED EG  
Subjt:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQPLSAVFLKQHLNSVVFDSHFVEEDVTHSAVESIQQETEIAQTVGEEEHVPRKRKKASKEDGEGKT

Query:  RKASKTAGKENKKKKKEVAW
        RK SKTA KEN KK+ EV W
Subjt:  RKASKTAGKENKKKKKEVAW

TrEMBL top hitse value%identityAlignment
A0A1S3AYE7 WD repeat-containing protein 744.3e-21890.24Show/hide
Query:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGAIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNVHVAISDNTDTSPP
        MPRTT +DCPGCPPLRALTFDVLGLVKVIEARGKEG IPKVVERWGEPDFSKSVLAASL DRKFDPLLAVARKNGLIEVLNPLNGN+HVAISDNTDTSPP
Subjt:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGAIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNVHVAISDNTDTSPP

Query:  PKDEPIVGMHLFAKDELELGTRRCTLLSCTTKGNASMRSIDFSSSSSKDTSTNLMKMWKVCGSGDIMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
        PKDE IVGMHLF+KDELE+ +RRCTLLSCTTKGNASMRSI+FSSSSS+D STNL+K WKVCGSGD+MCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
Subjt:  PKDEPIVGMHLFAKDELELGTRRCTLLSCTTKGNASMRSIDFSSSSSKDTSTNLMKMWKVCGSGDIMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT

Query:  AKAPKKNSLGIFTPTWFTSATFLSKDDHRKFAAGTNNHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDVRNGKLLGCFLGK
        AKAPKKN+LGIFTPTWFTSATFLSKDDHRKFAAGTN+HQVRLYDISAQKRPVISFDFRETPIK+LAEDVDGNTIFVGNA+GDLASFD+RNGKLLGCFLGK
Subjt:  AKAPKKNSLGIFTPTWFTSATFLSKDDHRKFAAGTNNHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDVRNGKLLGCFLGK

Query:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQPLSAVFLKQHLNSVVFDSHFVEEDVTHSAVESIQQETEIAQTVGEEEHVPRKRKKASKEDGEGKT
        CSGSIRSIARHPE PVIASCGLDSYVRFWDI TRQ LSAVFLKQHL  VVFDSHFV EDVT +AVE IQQETE AQTV EEEHVPRKRKK+SKEDGEG+ 
Subjt:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQPLSAVFLKQHLNSVVFDSHFVEEDVTHSAVESIQQETEIAQTVGEEEHVPRKRKKASKEDGEGKT

Query:  RKASKTAGKENKKKKKEVAW
        RK SKT  KE+KKK+KEV W
Subjt:  RKASKTAGKENKKKKKEVAW

A0A5A7UJY9 WD repeat-containing protein 741.4e-21689.76Show/hide
Query:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGAIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNVHVAISDNTDTSPP
        MPRTT +DCPGCPPLRALTFDVLGLVKVIEARGKEG IPKVVERWGEPDFSKSVLAASL DRKFDPLLAVARKNGLIEVLNPLNG +HVAISDNTDTSPP
Subjt:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGAIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNVHVAISDNTDTSPP

Query:  PKDEPIVGMHLFAKDELELGTRRCTLLSCTTKGNASMRSIDFSSSSSKDTSTNLMKMWKVCGSGDIMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
        PKDE IVGMHLF+KDELE+ +RRCTLLSCTTKGNASMRSI+FSSSSS+D STNL+K WKVCGSGD+MCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
Subjt:  PKDEPIVGMHLFAKDELELGTRRCTLLSCTTKGNASMRSIDFSSSSSKDTSTNLMKMWKVCGSGDIMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT

Query:  AKAPKKNSLGIFTPTWFTSATFLSKDDHRKFAAGTNNHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDVRNGKLLGCFLGK
        AKAPKKN+LGIFTPTWFTSATFLSK DHRKFAAGTN+HQVRLYDISAQKRPVISFDFRETPIK+LAEDVDGNTIFVGNA+GDLASFD+RNGKLLGCFLGK
Subjt:  AKAPKKNSLGIFTPTWFTSATFLSKDDHRKFAAGTNNHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDVRNGKLLGCFLGK

Query:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQPLSAVFLKQHLNSVVFDSHFVEEDVTHSAVESIQQETEIAQTVGEEEHVPRKRKKASKEDGEGKT
        CSGSIRSIARHPE PVIASCGLDSYVRFWDI TRQ LSAVFLKQHL  VVFDSHFV EDVT +AVE IQQETE AQTV EEEHVPRKRKK+SKEDGEG+ 
Subjt:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQPLSAVFLKQHLNSVVFDSHFVEEDVTHSAVESIQQETEIAQTVGEEEHVPRKRKKASKEDGEGKT

Query:  RKASKTAGKENKKKKKEVAW
        RK SKT  KE+KKK+KEV W
Subjt:  RKASKTAGKENKKKKKEVAW

A0A5D3BIZ1 WD repeat-containing protein 744.3e-21890.24Show/hide
Query:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGAIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNVHVAISDNTDTSPP
        MPRTT +DCPGCPPLRALTFDVLGLVKVIEARGKEG IPKVVERWGEPDFSKSVLAASL DRKFDPLLAVARKNGLIEVLNPLNGN+HVAISDNTDTSPP
Subjt:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGAIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNVHVAISDNTDTSPP

Query:  PKDEPIVGMHLFAKDELELGTRRCTLLSCTTKGNASMRSIDFSSSSSKDTSTNLMKMWKVCGSGDIMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
        PKDE IVGMHLF+KDELE+ +RRCTLLSCTTKGNASMRSI+FSSSSS+D STNL+K WKVCGSGD+MCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
Subjt:  PKDEPIVGMHLFAKDELELGTRRCTLLSCTTKGNASMRSIDFSSSSSKDTSTNLMKMWKVCGSGDIMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT

Query:  AKAPKKNSLGIFTPTWFTSATFLSKDDHRKFAAGTNNHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDVRNGKLLGCFLGK
        AKAPKKN+LGIFTPTWFTSATFLSKDDHRKFAAGTN+HQVRLYDISAQKRPVISFDFRETPIK+LAEDVDGNTIFVGNA+GDLASFD+RNGKLLGCFLGK
Subjt:  AKAPKKNSLGIFTPTWFTSATFLSKDDHRKFAAGTNNHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDVRNGKLLGCFLGK

Query:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQPLSAVFLKQHLNSVVFDSHFVEEDVTHSAVESIQQETEIAQTVGEEEHVPRKRKKASKEDGEGKT
        CSGSIRSIARHPE PVIASCGLDSYVRFWDI TRQ LSAVFLKQHL  VVFDSHFV EDVT +AVE IQQETE AQTV EEEHVPRKRKK+SKEDGEG+ 
Subjt:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQPLSAVFLKQHLNSVVFDSHFVEEDVTHSAVESIQQETEIAQTVGEEEHVPRKRKKASKEDGEGKT

Query:  RKASKTAGKENKKKKKEVAW
        RK SKT  KE+KKK+KEV W
Subjt:  RKASKTAGKENKKKKKEVAW

A0A6J1EA63 WD repeat-containing protein 74-like1.1e-21389.93Show/hide
Query:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGAIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNVHVAISDNTDTSPP
        MPRTTKVDCPGCPPLRAL FDVLGL+KVIEARGKEG IPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGN+HVAISDNTDTSPP
Subjt:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGAIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNVHVAISDNTDTSPP

Query:  PKDEPIVGMHLFAKDELELGTRRCTLLSCTTKGNASMRSIDFSSSSSKDTSTNLMKMWKVCGSGDIMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
        PKDE IVGMHL +KDE EL +RRCTLLSCTTKGNASMR+I+FSSSSS+DTSTNL + WK+C SGD+MCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
Subjt:  PKDEPIVGMHLFAKDELELGTRRCTLLSCTTKGNASMRSIDFSSSSSKDTSTNLMKMWKVCGSGDIMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT

Query:  AKAPKKNSLGIFTPTWFTSATFLSKDDHRKFAAGTNNHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDVRNGKLLGCFLGK
        AKAPKKNSLGIFTPTWFTSATFLSKDDHRKFAAGTN+HQVRLYDISAQKRPVISFDF ETPIKALAEDVDGNTIFVGNA+GDLASFD+RNGKLLGCFLGK
Subjt:  AKAPKKNSLGIFTPTWFTSATFLSKDDHRKFAAGTNNHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDVRNGKLLGCFLGK

Query:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQPLSAVFLKQHLNSVVFDSHFVEEDVTHSAVESIQQETEIAQTVGEEEHVPRKRKKASKEDGEGKT
        CSGSIRSIARHPEF VIASCGLDSYVRFWDIKTRQ LSAVFLKQHL  VVFDSHFVEEDVTHSAVE IQQETE+ QTV EEEHVP+KRKKA KEDGEG  
Subjt:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQPLSAVFLKQHLNSVVFDSHFVEEDVTHSAVESIQQETEIAQTVGEEEHVPRKRKKASKEDGEGKT

Query:  RKASKTAGKENKKKKKE
        RK SKT  KENKK +++
Subjt:  RKASKTAGKENKKKKKE

E5GBV4 WD-repeat protein4.3e-21890.24Show/hide
Query:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGAIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNVHVAISDNTDTSPP
        MPRTT +DCPGCPPLRALTFDVLGLVKVIEARGKEG IPKVVERWGEPDFSKSVLAASL DRKFDPLLAVARKNGLIEVLNPLNGN+HVAISDNTDTSPP
Subjt:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGAIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNVHVAISDNTDTSPP

Query:  PKDEPIVGMHLFAKDELELGTRRCTLLSCTTKGNASMRSIDFSSSSSKDTSTNLMKMWKVCGSGDIMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
        PKDE IVGMHLF+KDELE+ +RRCTLLSCTTKGNASMRSI+FSSSSS+D STNL+K WKVCGSGD+MCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
Subjt:  PKDEPIVGMHLFAKDELELGTRRCTLLSCTTKGNASMRSIDFSSSSSKDTSTNLMKMWKVCGSGDIMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT

Query:  AKAPKKNSLGIFTPTWFTSATFLSKDDHRKFAAGTNNHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDVRNGKLLGCFLGK
        AKAPKKN+LGIFTPTWFTSATFLSKDDHRKFAAGTN+HQVRLYDISAQKRPVISFDFRETPIK+LAEDVDGNTIFVGNA+GDLASFD+RNGKLLGCFLGK
Subjt:  AKAPKKNSLGIFTPTWFTSATFLSKDDHRKFAAGTNNHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDVRNGKLLGCFLGK

Query:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQPLSAVFLKQHLNSVVFDSHFVEEDVTHSAVESIQQETEIAQTVGEEEHVPRKRKKASKEDGEGKT
        CSGSIRSIARHPE PVIASCGLDSYVRFWDI TRQ LSAVFLKQHL  VVFDSHFV EDVT +AVE IQQETE AQTV EEEHVPRKRKK+SKEDGEG+ 
Subjt:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQPLSAVFLKQHLNSVVFDSHFVEEDVTHSAVESIQQETEIAQTVGEEEHVPRKRKKASKEDGEGKT

Query:  RKASKTAGKENKKKKKEVAW
        RK SKT  KE+KKK+KEV W
Subjt:  RKASKTAGKENKKKKKEVAW

SwissProt top hitse value%identityAlignment
Q54FW9 WD repeat-containing protein DDB_G02905553.5e-1527.63Show/hide
Query:  FGGKGVEVNMWNLEQCTKIWTAKAPKKNSLGIFTPTWFTSATFLSKDDHRKFAAGTNNHQVRLYDISAQ--KRPVISFDFRETPIKALA-EDVDGNTIFV
        FGGK V + +W+LE+  K ++AK  K + L +  P       +++ D   K   G ++ +++ YD+ ++  +   +   F + PI+++   +   +  + 
Subjt:  FGGKGVEVNMWNLEQCTKIWTAKAPKKNSLGIFTPTWFTSATFLSKDDHRKFAAGTNNHQVRLYDISAQ--KRPVISFDFRETPIKALA-EDVDGNTIFV

Query:  GNATGDLASFDVRNGKLLGCFLGKCSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQPLSAVFLKQHLNSVVFDSHFVEEDVTHSAVESIQQETEIAQ
         ++ G +  +DVR  + +G F    +GS++ IA HP  P++A+ GLD ++R +++  R+ L  +FLKQ L+ V+F     +E+ T+   E  Q+E EI +
Subjt:  GNATGDLASFDVRNGKLLGCFLGKCSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQPLSAVFLKQHLNSVVFDSHFVEEDVTHSAVESIQQETEIAQ

Query:  TVGEEEHVPRKRKKASKEDGEGKTRKAS
         +  EE+  R     + ++ EG  +K S
Subjt:  TVGEEEHVPRKRKKASKEDGEGKTRKAS

Q58D06 WD repeat-containing protein 746.1e-2029.97Show/hide
Query:  TLLSCTTKGNASMRSIDFSSSSSKDTSTNLMKMWKVCGSGDIMCSKVDGSETHAL-FGGKGVEVNMWNLEQCTK-IWTAKAPKKNSLGIFTPTWFTSATF
        TL++C   G   + + D    +S D    L     VC        + D +  H +  GGK   + +W+L+   + ++ AK  + + L +  P W     F
Subjt:  TLLSCTTKGNASMRSIDFSSSSSKDTSTNLMKMWKVCGSGDIMCSKVDGSETHAL-FGGKGVEVNMWNLEQCTK-IWTAKAPKKNSLGIFTPTWFTSATF

Query:  LSKDDHRKFAAGTNNHQVRLYD-ISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDVRNGKLLGCFLGKCSGSIRSIARHPEFPVIASCG
        L   + +K    T  HQVR+YD  S Q+RPV+   + E P+ A+    +GN++ VGN  G LA  D+R G+LLGC  G  +GS+R +  HP  P++ASCG
Subjt:  LSKDDHRKFAAGTNNHQVRLYD-ISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDVRNGKLLGCFLGKCSGSIRSIARHPEFPVIASCG

Query:  LDSYVRFWDIKTRQPLS-AVFLKQHLNSVVFDSHFVEEDVTHSAVESIQQETEIAQT--VGEEEHVPRKRKKASKEDGEGKTRKASKTAGKENKKKK
        LD  +R   I+  + L   V+LK  LN ++       ED      E  +  +E  +T  +        KRK    E  +G  +       +  KKK+
Subjt:  LDSYVRFWDIKTRQPLS-AVFLKQHLNSVVFDSHFVEEDVTHSAVESIQQETEIAQT--VGEEEHVPRKRKKASKEDGEGKTRKASKTAGKENKKKK

Q6CEC9 Ribosome biogenesis protein NSA17.0e-0820.98Show/hide
Query:  AKAPKKNSLGIFTPTWFTSATFLSKD-DHRKFAAGTNNHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDVRNGKLL-----
        A+  K N + +  P W +   F + D D  +    T + Q+R+Y+    KRP   F   + P++ LA  +D + +   +A      F+  + + +     
Subjt:  AKAPKKNSLGIFTPTWFTSATFLSKD-DHRKFAAGTNNHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDVRNGKLL-----

Query:  ------------GCFLGKCSGSIRSIAR--HPEFPVIASCGLDSYVRFWDIKTRQPLSAVFLKQHLNSVVF-DSHFVEEDVTHSAVESIQQETEIAQTVG
                     C + K  GS+ ++      +  ++A+ GLD Y+R +D++T +  + +F+   ++S++F D+        H    + +++ E+   + 
Subjt:  ------------GCFLGKCSGSIRSIAR--HPEFPVIASCGLDSYVRFWDIKTRQPLSAVFLKQHLNSVVF-DSHFVEEDVTHSAVESIQQETEIAQTVG

Query:  EEEHVPRKRKKASKEDGEGKTRKA
        + E    KR+ A  ++G+ K + A
Subjt:  EEEHVPRKRKKASKEDGEGKTRKA

Q6RFH5 WD repeat-containing protein 748.6e-2229.78Show/hide
Query:  HLFAKDELELGTRRC---------------TLLSCTTKGNASMRSIDFSSSSSKDTSTNLMKMWKVCGSGDIMCSKVDGSETHAL-FGGKGVEVNMWNLE
        H   +D +  G R C               TL++C   G      +       KDTS++ +   +V G G +   + D +  H +  GGK   + +W+L+
Subjt:  HLFAKDELELGTRRC---------------TLLSCTTKGNASMRSIDFSSSSSKDTSTNLMKMWKVCGSGDIMCSKVDGSETHAL-FGGKGVEVNMWNLE

Query:  QCTK-IWTAKAPKKNSLGIFTPTWFTSATFLSKDDHRKFAAGTNNHQVRLYD-ISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDVRNG
           + ++ AK  + + L +  P W     FL     +K    T  HQVR+YD  S Q+RPV+   + E P+ A+     GN++ VGN  G LA  D+R G
Subjt:  QCTK-IWTAKAPKKNSLGIFTPTWFTSATFLSKDDHRKFAAGTNNHQVRLYD-ISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDVRNG

Query:  KLLGCFLGKCSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQPLS-AVFLKQHLNSVVFDSHFVEEDVTHSAVESIQQETEIAQT--VGEEEHVPRKR
        +LLGC  G  +GS+R +  HP  P++ASCGLD  +R   I+  + L   V+LK  LN ++       ED      E  +   E  +T  +        KR
Subjt:  KLLGCFLGKCSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQPLS-AVFLKQHLNSVVFDSHFVEEDVTHSAVESIQQETEIAQT--VGEEEHVPRKR

Query:  KKASKEDGEG--KTRKASK
        K +  E  +G  +TR+  K
Subjt:  KKASKEDGEG--KTRKASK

Q8VCG3 WD repeat-containing protein 743.8e-2230.82Show/hide
Query:  TLLSCTTKGNASMRSIDFSSSSSKDTSTNLMKMWKVCGSGDIMCSKVDGSETHALFG-GKGVEVNMWNLEQCTK-IWTAKAPKKNSLGIFTPTWFTSATF
        TL++C   G      +     + K+ S++ +   KV G G +   + D + TH +   GK   + +W+L+   + ++ AK  + + L +  P W     F
Subjt:  TLLSCTTKGNASMRSIDFSSSSSKDTSTNLMKMWKVCGSGDIMCSKVDGSETHALFG-GKGVEVNMWNLEQCTK-IWTAKAPKKNSLGIFTPTWFTSATF

Query:  LSKDDHRKFAAGTNNHQVRLYD-ISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDVRNGKLLGCFLGKCSGSIRSIARHPEFPVIASCG
        L     +K    T  HQVR+YD +S Q+RPV+   + E P+ A+    +GN++ VGN  G LA  D R G+LLGC  G  +GS+R +  HP  P++ASCG
Subjt:  LSKDDHRKFAAGTNNHQVRLYD-ISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDVRNGKLLGCFLGKCSGSIRSIARHPEFPVIASCG

Query:  LDSYVRFWDIKTRQPLS-AVFLKQHLNSVVFDSHFVEEDVTHSAVESIQQETEIAQT----VGEEEHVPRKRKKASKEDGEGKTRKASKTAG
        LD  +R   I+  + L   V+LK  LN ++       ED      E  Q  +E  +T       E    RK     +  G  + RK  K  G
Subjt:  LDSYVRFWDIKTRQPLS-AVFLKQHLNSVVFDSHFVEEDVTHSAVESIQQETEIAQT----VGEEEHVPRKRKKASKEDGEGKTRKASKTAG

Arabidopsis top hitse value%identityAlignment
AT1G29320.1 Transducin/WD40 repeat-like superfamily protein2.3e-13157.41Show/hide
Query:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGAIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNVHVAISDNTDTSPP
        MPR    +  GCPP RALTFD LGL+KV EARG+E  IP VV  WGE + S+SVLAAS+ DR  +PLLAVARK+G +EV+NP NG++H + S   D    
Subjt:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGAIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNVHVAISDNTDTSPP

Query:  PKDEPIVGMHLFAKDELELGTRRCTLLSCTTKGNASMRSIDFSSSSSKDTSTNLMKMWKVCGSGDIMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT
        P+D  I  +HLF K   +   R CTLL+CT KG+ S+RS+ F  +    T     K WK CGSG+I+  KVDGSE  +LFGGK VE N+W+LEQCTKIW+
Subjt:  PKDEPIVGMHLFAKDELELGTRRCTLLSCTTKGNASMRSIDFSSSSSKDTSTNLMKMWKVCGSGDIMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWT

Query:  AKAPKKNSLGIFTPTWFTSATFLSKDDHRKFAAGTNNHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDVRNGKLLGCFLGK
        AK P KN+LGIFTPTWFTSATFLSKDDHRKF  GT +HQVRLYDIS Q+RPV+SFDFRET I ++AED DG+TI+VGNA+ DLASFD+R GKLLG FLGK
Subjt:  AKAPKKNSLGIFTPTWFTSATFLSKDDHRKFAAGTNNHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDVRNGKLLGCFLGK

Query:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQPLSAVFLKQHLNSVVFDSHFVEED--VTHSAVESIQQE--TEIAQTVGEEEHVPRKRKKASK---
        CSGSIRS+ RHP+  VIASCGLD Y+R +D+KTRQ +SAVFLKQHL  +VFDS F  E+  V ++  E+  +E  T + Q   E E  P KRKK+ K   
Subjt:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQPLSAVFLKQHLNSVVFDSHFVEED--VTHSAVESIQQE--TEIAQTVGEEEHVPRKRKKASK---

Query:  --------EDGEGKTRKASKTAGKENKKKKKE
                ED E    +  K   K  K KK++
Subjt:  --------EDGEGKTRKASKTAGKENKKKKKE

AT4G15900.1 pleiotropic regulatory locus 11.3e-0423.88Show/hide
Query:  WFTSATFLSKDDHRKFAAGTNNHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDVRNGKLLGCFLGKCSGSIRSIARHPEFP
        W  S  F     +  F  G+ +  ++++D++     +      E  ++ LA       +F       +  +D+   K++  + G  SG +  +A HP   
Subjt:  WFTSATFLSKDDHRKFAAGTNNHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDVRNGKLLGCFLGKCSGSIRSIARHPEFP

Query:  VIASCGLDSYVRFWDIKTRQPLSAVFLKQHLNSV
        V+ + G DS  R WDI+T+  + A  L  H N+V
Subjt:  VIASCGLDSYVRFWDIKTRQPLSAVFLKQHLNSV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCGCACGACGAAGGTTGATTGCCCCGGCTGTCCTCCACTTCGTGCCCTAACTTTCGATGTGCTTGGTCTCGTCAAAGTCATCGAAGCTCGGGGCAAGGAAGGCGC
GATTCCAAAGGTGGTTGAGAGGTGGGGCGAACCTGATTTCTCCAAATCAGTGCTTGCCGCTTCTCTTGTTGATCGTAAATTCGACCCTTTATTAGCTGTTGCTCGAAAAA
ATGGTCTGATTGAAGTTCTTAATCCTTTGAATGGAAATGTTCACGTTGCAATTTCGGACAATACTGATACTTCTCCTCCACCAAAAGATGAACCTATTGTTGGAATGCAT
TTATTTGCAAAAGATGAATTGGAGTTGGGAACTAGGCGTTGCACCTTGCTTTCATGTACAACGAAAGGAAATGCAAGCATGAGGTCGATTGATTTTTCTAGTTCATCTTC
AAAAGATACCTCTACCAATCTTATGAAAATGTGGAAAGTATGTGGTTCAGGTGATATTATGTGTTCTAAAGTTGATGGAAGTGAAACACACGCATTGTTTGGAGGGAAGG
GCGTTGAAGTTAATATGTGGAATCTAGAACAGTGTACTAAGATTTGGACAGCCAAAGCACCAAAGAAGAACAGCCTTGGTATTTTTACACCAACTTGGTTCACATCAGCG
ACATTTCTTAGTAAAGATGACCACCGTAAGTTTGCAGCTGGTACCAACAACCATCAGGTTCGACTGTATGACATTTCTGCTCAGAAGAGACCTGTCATCTCATTTGATTT
CCGAGAGACTCCCATTAAAGCCTTGGCAGAAGATGTAGATGGTAACACAATATTTGTGGGGAATGCAACTGGTGATCTTGCATCCTTTGATGTTCGCAATGGAAAGCTAT
TGGGTTGCTTCTTGGGGAAATGTTCTGGCAGCATAAGATCCATCGCCAGGCATCCAGAGTTCCCGGTCATAGCATCGTGTGGACTGGATAGTTATGTGCGCTTCTGGGAT
ATAAAGACAAGGCAACCTCTGTCTGCGGTATTCCTAAAGCAGCATCTTAATAGTGTTGTCTTTGATTCCCATTTTGTTGAAGAAGATGTAACACACTCCGCAGTAGAGTC
AATCCAACAGGAAACAGAGATAGCCCAAACTGTCGGCGAGGAGGAACACGTGCCTCGGAAAAGAAAAAAGGCATCTAAAGAAGACGGTGAAGGCAAAACGAGGAAAGCTA
GCAAGACTGCAGGCAAAGAAAATAAAAAGAAGAAAAAAGAAGTTGCATGGTGA
mRNA sequenceShow/hide mRNA sequence
GAAATTCGTTTCCTTCAACCGAAGCTCACCCCCAAATTCCTTCGGTAGGGTTTATCAGTCCAACGAAGTTCTCTCTGTTTCTCAATCGTTCGTGCTTAACTGGTGTGCAG
AATATGCCTCGCACGACGAAGGTTGATTGCCCCGGCTGTCCTCCACTTCGTGCCCTAACTTTCGATGTGCTTGGTCTCGTCAAAGTCATCGAAGCTCGGGGCAAGGAAGG
CGCGATTCCAAAGGTGGTTGAGAGGTGGGGCGAACCTGATTTCTCCAAATCAGTGCTTGCCGCTTCTCTTGTTGATCGTAAATTCGACCCTTTATTAGCTGTTGCTCGAA
AAAATGGTCTGATTGAAGTTCTTAATCCTTTGAATGGAAATGTTCACGTTGCAATTTCGGACAATACTGATACTTCTCCTCCACCAAAAGATGAACCTATTGTTGGAATG
CATTTATTTGCAAAAGATGAATTGGAGTTGGGAACTAGGCGTTGCACCTTGCTTTCATGTACAACGAAAGGAAATGCAAGCATGAGGTCGATTGATTTTTCTAGTTCATC
TTCAAAAGATACCTCTACCAATCTTATGAAAATGTGGAAAGTATGTGGTTCAGGTGATATTATGTGTTCTAAAGTTGATGGAAGTGAAACACACGCATTGTTTGGAGGGA
AGGGCGTTGAAGTTAATATGTGGAATCTAGAACAGTGTACTAAGATTTGGACAGCCAAAGCACCAAAGAAGAACAGCCTTGGTATTTTTACACCAACTTGGTTCACATCA
GCGACATTTCTTAGTAAAGATGACCACCGTAAGTTTGCAGCTGGTACCAACAACCATCAGGTTCGACTGTATGACATTTCTGCTCAGAAGAGACCTGTCATCTCATTTGA
TTTCCGAGAGACTCCCATTAAAGCCTTGGCAGAAGATGTAGATGGTAACACAATATTTGTGGGGAATGCAACTGGTGATCTTGCATCCTTTGATGTTCGCAATGGAAAGC
TATTGGGTTGCTTCTTGGGGAAATGTTCTGGCAGCATAAGATCCATCGCCAGGCATCCAGAGTTCCCGGTCATAGCATCGTGTGGACTGGATAGTTATGTGCGCTTCTGG
GATATAAAGACAAGGCAACCTCTGTCTGCGGTATTCCTAAAGCAGCATCTTAATAGTGTTGTCTTTGATTCCCATTTTGTTGAAGAAGATGTAACACACTCCGCAGTAGA
GTCAATCCAACAGGAAACAGAGATAGCCCAAACTGTCGGCGAGGAGGAACACGTGCCTCGGAAAAGAAAAAAGGCATCTAAAGAAGACGGTGAAGGCAAAACGAGGAAAG
CTAGCAAGACTGCAGGCAAAGAAAATAAAAAGAAGAAAAAAGAAGTTGCATGGTGAAACTGAAAGTAAGCAGAGGGTAGGAGTTAAAATCAAAGAAAGGCAGGAAGTGTA
AAATATGATGTTCTTGGTTCTTGAGTAAGCAGAAAGGTTGGTTCTTGTTCTGATCTATTCAGTCTGCAGATGGTGTGGGAAATTGGAGGGTAAAAGATAAAAGTTATCTA
TCTCATCAATGCATTTAATTTTGGCTCGAATCATGATTTTTACTCGAGAGGCTCGGTAAAAACCTGAATGCTGGAAATGCAGGCTCCTTTGTTTTTGGATTTTTGTCCTT
CGAAGTAGGAATGATTTCATTGGAAGCAGTTCAAGCACGTACTACTTGAGTCTACCATGTCAAATTTTATTTTATTTGTCATATTACTAGTGGTTAGGATTATTTTAGTA
GGATTAGGATACATCAATTTTGAGCCCCTTCATTTTTGGACCTTGTTATTAGAAAGTTGGGGCAGCTTAACTTATGAGTTCATCAATTTCCGAGAACAATTCAGTGAATC
CTTGAAATATTAGTTCTTTTACAAAGTTAAATATAACCCGATTTGTATTTGTGAGTTCAA
Protein sequenceShow/hide protein sequence
MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGAIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKNGLIEVLNPLNGNVHVAISDNTDTSPPPKDEPIVGMH
LFAKDELELGTRRCTLLSCTTKGNASMRSIDFSSSSSKDTSTNLMKMWKVCGSGDIMCSKVDGSETHALFGGKGVEVNMWNLEQCTKIWTAKAPKKNSLGIFTPTWFTSA
TFLSKDDHRKFAAGTNNHQVRLYDISAQKRPVISFDFRETPIKALAEDVDGNTIFVGNATGDLASFDVRNGKLLGCFLGKCSGSIRSIARHPEFPVIASCGLDSYVRFWD
IKTRQPLSAVFLKQHLNSVVFDSHFVEEDVTHSAVESIQQETEIAQTVGEEEHVPRKRKKASKEDGEGKTRKASKTAGKENKKKKKEVAW