; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS019944 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS019944
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionWD repeat-containing protein 74
Genome locationscaffold22:700504..705961
RNA-Seq ExpressionMS019944
SyntenyMS019944
Gene Ontology termsGO:0042273 - ribosomal large subunit biogenesis (biological process)
GO:0005730 - nucleolus (cellular component)
GO:0030687 - preribosome, large subunit precursor (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001680 - WD40 repeat
IPR015943 - WD40/YVTN repeat-like-containing domain superfamily
IPR019775 - WD40 repeat, conserved site
IPR036322 - WD40-repeat-containing domain superfamily
IPR037379 - WDR74/Nsa1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6584141.1 WD repeat-containing protein 74, partial [Cucurbita argyrosperma subsp. sororia]2.4e-20285.98Show/hide
Query:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKDGLIEVLNPLNGNPHAAISDNTDASPL
        MPRTTKVDCPGCPPLRALTFDVLGL+KVIEARGKEGEIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARK+GLIEVLNPLNGN H AISDNTD SP 
Subjt:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKDGLIEVLNPLNGNPHAAISDNTDASPL

Query:  SKDDAIIGIHLFAKNELELESRRCTLLSCTTKGNASMRSIDFSSSSSKVTSTNLLKTWNVCGSGDVTCSKVDGSETCALFGGKGVEVNMWNLEQCTKIWN
         KD+AI+G+HL +K+E EL SRRCTLLSCTTKGNASMR+I+FSSSSS+ TSTNL +TW +C SGDV CSKVDGSET ALFGGKGVEVNMWNLEQCTKIW 
Subjt:  SKDDAIIGIHLFAKNELELESRRCTLLSCTTKGNASMRSIDFSSSSSKVTSTNLLKTWNVCGSGDVTCSKVDGSETCALFGGKGVEVNMWNLEQCTKIWN

Query:  AKSPKKNSLGIFTPTWFTSTTFLSKDDHRKFASGTNNHEVRLYDISAQRRPVISFDFRETPIKALAADVDGNTIFLGNASGDLASFDIRNGKLLGCFLGK
        AK+PKKNS GIFTPTWFTS TFLSKDDHRKFA+GTN+H+VRLYDISA++RPVISFDFRETPIKALA DVDGNTIF+GNASGDLASFDIRNGKLLGCFLGK
Subjt:  AKSPKKNSLGIFTPTWFTSTTFLSKDDHRKFASGTNNHEVRLYDISAQRRPVISFDFRETPIKALAADVDGNTIFLGNASGDLASFDIRNGKLLGCFLGK

Query:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQLLSAVFLKQHLTSVVFDSHFVGEADVTNSAVESIQQETDAAEIVVEEEHVPQKRKKASK-DGESG
        CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQLLSAVFLKQHLT VVFDSHFV E DVT+SAVE IQQET+  + V EEEHVPQKRKKA K DGE G
Subjt:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQLLSAVFLKQHLTSVVFDSHFVGEADVTNSAVESIQQETDAAEIVVEEEHVPQKRKKASK-DGESG

Query:  KRKGSKTADKENKKKKKKLRGETENKQR
        KRKGSKT DKENKK ++K   E E+KQR
Subjt:  KRKGSKTADKENKKKKKKLRGETENKQR

KGN64392.1 hypothetical protein Csa_013665 [Cucumis sativus]3.9e-20585.98Show/hide
Query:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKDGLIEVLNPLNGNPHAAISDNTDASPL
        MPRTT +DCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARK+GLIEVLNPLNGN H AISDNTD SP 
Subjt:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKDGLIEVLNPLNGNPHAAISDNTDASPL

Query:  SKDDAIIGIHLFAKNELELESRRCTLLSCTTKGNASMRSIDFSSSSSKVTSTNLLKTWNVCGSGDVTCSKVDGSETCALFGGKGVEVNMWNLEQCTKIWN
         KD+AI+G+HLF+K ELE+ESRRCTLLSCTTKGNASMRSI FSSS SK  ST+L+KTW VCGSGDVTC+KVDGSET ALFGGKGVEVNMWNLEQCTKIW 
Subjt:  SKDDAIIGIHLFAKNELELESRRCTLLSCTTKGNASMRSIDFSSSSSKVTSTNLLKTWNVCGSGDVTCSKVDGSETCALFGGKGVEVNMWNLEQCTKIWN

Query:  AKSPKKNSLGIFTPTWFTSTTFLSKDDHRKFASGTNNHEVRLYDISAQRRPVISFDFRETPIKALAADVDGNTIFLGNASGDLASFDIRNGKLLGCFLGK
        AK+PKKN+LGIFTPTWFTS TFLSKDDHRKFA+GTN+H+VRLYDISAQ+RPVISFDFRETPIK+LA DVDGNTIF+GNASGDLASFDIRNGKLLGCFLGK
Subjt:  AKSPKKNSLGIFTPTWFTSTTFLSKDDHRKFASGTNNHEVRLYDISAQRRPVISFDFRETPIKALAADVDGNTIFLGNASGDLASFDIRNGKLLGCFLGK

Query:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQLLSAVFLKQHLTSVVFDSHFVGEADVTNSAVESIQQETDAAEIVVEEEHVPQKRKKASKD-GESG
        CSGSIRSIARHPE PVIASCGLDSYVRFWDIKTRQLLSAVFLKQHLT VVFDSHFV E DVT +AVESIQQET+AA+ V EEEH+P+KRKK+SK+ GE G
Subjt:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQLLSAVFLKQHLTSVVFDSHFVGEADVTNSAVESIQQETDAAEIVVEEEHVPQKRKKASKD-GESG

Query:  KRKGSKTADKENKKKKKKLRGETENKQR
        KRKG+KT DKE+KK ++K  GETE KQ+
Subjt:  KRKGSKTADKENKKKKKKLRGETENKQR

XP_008439461.1 PREDICTED: WD repeat-containing protein 74 [Cucumis melo]3.7e-20386.63Show/hide
Query:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKDGLIEVLNPLNGNPHAAISDNTDASPL
        MPRTT +DCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASL DRKFDPLLAVARK+GLIEVLNPLNGN H AISDNTD SP 
Subjt:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKDGLIEVLNPLNGNPHAAISDNTDASPL

Query:  SKDDAIIGIHLFAKNELELESRRCTLLSCTTKGNASMRSIDFSSSSSKVTSTNLLKTWNVCGSGDVTCSKVDGSETCALFGGKGVEVNMWNLEQCTKIWN
         KD+AI+G+HLF+K+ELE+ESRRCTLLSCTTKGNASMRSI+FSSSSS+  STNL+KTW VCGSGDV CSKVDGSET ALFGGKGVEVNMWNLEQCTKIW 
Subjt:  SKDDAIIGIHLFAKNELELESRRCTLLSCTTKGNASMRSIDFSSSSSKVTSTNLLKTWNVCGSGDVTCSKVDGSETCALFGGKGVEVNMWNLEQCTKIWN

Query:  AKSPKKNSLGIFTPTWFTSTTFLSKDDHRKFASGTNNHEVRLYDISAQRRPVISFDFRETPIKALAADVDGNTIFLGNASGDLASFDIRNGKLLGCFLGK
        AK+PKKN+LGIFTPTWFTS TFLSKDDHRKFA+GTN+H+VRLYDISAQ+RPVISFDFRETPIK+LA DVDGNTIF+GNASGDLASFDIRNGKLLGCFLGK
Subjt:  AKSPKKNSLGIFTPTWFTSTTFLSKDDHRKFASGTNNHEVRLYDISAQRRPVISFDFRETPIKALAADVDGNTIFLGNASGDLASFDIRNGKLLGCFLGK

Query:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQLLSAVFLKQHLTSVVFDSHFVGEADVTNSAVESIQQETDAAEIVVEEEHVPQKRKKASK-DGESG
        CSGSIRSIARHPE PVIASCGLDSYVRFWDI TRQLLSAVFLKQHLT VVFDSHFVGE DVT +AVE IQQET+AA+ V EEEHVP+KRKK+SK DGE  
Subjt:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQLLSAVFLKQHLTSVVFDSHFVGEADVTNSAVESIQQETDAAEIVVEEEHVPQKRKKASK-DGESG

Query:  KRKGSKTADKENKKKKKKL
        KRKGSKT +KE+KKK+K++
Subjt:  KRKGSKTADKENKKKKKKL

XP_011652016.1 LOW QUALITY PROTEIN: WD repeat-containing protein 74 [Cucumis sativus]1.8e-20286.4Show/hide
Query:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKDGLIEVLNPLNGNPHAAISDNTDASPL
        MPRTT +DCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARK+GLIEVLNPLNGN H AISDNTD SP 
Subjt:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKDGLIEVLNPLNGNPHAAISDNTDASPL

Query:  SKDDAIIGIHLFAKNELELESRRCTLLSCTTKGNASMRSIDFSSSSSKVTSTNLLKTWNVCGSGDVTCSKVDGSETCALFGGKGVEVNMWNLEQCTKIWN
         KD+AI+G+HLF+K ELE+ESRRCTLLSCTTKGNASMRSI FSSS SK  ST+L+KTW VCGSGDVTC+KVDGSET ALFGGKGVEVNMWNLEQCTKIW 
Subjt:  SKDDAIIGIHLFAKNELELESRRCTLLSCTTKGNASMRSIDFSSSSSKVTSTNLLKTWNVCGSGDVTCSKVDGSETCALFGGKGVEVNMWNLEQCTKIWN

Query:  AKSPKKNSLGIFTPTWFTSTTFLSKDDHRKFASGTNNHEVRLYDISAQRRPVISFDFRETPIKALAADVDGNTIFLGNASGDLASFDIRNGKLLGCFLGK
        AK+PKKN+LGIFTPTWFTS TFLSKDDHRKFA+GTN+H+VRLYDISAQ+RPVISFDFRETPIK+LA DVDGNTIF+GNASGDLASFDIRNGKLLGCFLGK
Subjt:  AKSPKKNSLGIFTPTWFTSTTFLSKDDHRKFASGTNNHEVRLYDISAQRRPVISFDFRETPIKALAADVDGNTIFLGNASGDLASFDIRNGKLLGCFLGK

Query:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQLLSAVFLKQHLTSVVFDSHFVGEADVTNSAVESIQQETDAAEIVVEEEHVPQKRKKASKD-GESG
        CSGSIRSIARHPE PVIASCGLDSYVRFWDIKTRQLLSAVFLKQHLT VVFDSHFV E DVT +AVESIQQET+AA+ V EEEH+P+KRKK+SK+ GE G
Subjt:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQLLSAVFLKQHLTSVVFDSHFVGEADVTNSAVESIQQETDAAEIVVEEEHVPQKRKKASKD-GESG

Query:  KRKGSKTADKENKKKKKKL
        KRKG+KT DKE+ KK+K++
Subjt:  KRKGSKTADKENKKKKKKL

XP_022137375.1 WD repeat-containing protein 74 [Momordica charantia]7.6e-23398.13Show/hide
Query:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKDGLIEVLNPLNGNPHAAISDNTDASPL
        MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKDGLIEVLNPLNGNPHAAISDNTDASPL
Subjt:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKDGLIEVLNPLNGNPHAAISDNTDASPL

Query:  SKDDAIIGIHLFAKNELELESRRCTLLSCTTKGNASMRSIDFSSSSSKVTSTNLLKTWNVCGSGDVTCSKVDGSETCALFGGKGVEVNMWNLEQCTKIWN
        SKDDAIIGIHLFAKNELELESRRCTLLSCTTKGNASMRSIDFSSSSSKVTSTNLLKTWNVCGSGDVTCSKVDGSET ALFGGKGVEVNMWNLEQCTKIWN
Subjt:  SKDDAIIGIHLFAKNELELESRRCTLLSCTTKGNASMRSIDFSSSSSKVTSTNLLKTWNVCGSGDVTCSKVDGSETCALFGGKGVEVNMWNLEQCTKIWN

Query:  AKSPKKNSLGIFTPTWFTSTTFLSKDDHRKFASGTNNHEVRLYDISAQRRPVISFDFRETPIKALAADVDGNTIFLGNASGDLASFDIRNGKLLGCFLGK
        AKSPKKNSLGIFTPTWFTSTTFLSKDDHRKFASGTNNHEVRLYDISAQRRPVISFDFRETPIKALAADVDGNTIF+GNASGDLASFDIRNG+LLGCFLGK
Subjt:  AKSPKKNSLGIFTPTWFTSTTFLSKDDHRKFASGTNNHEVRLYDISAQRRPVISFDFRETPIKALAADVDGNTIFLGNASGDLASFDIRNGKLLGCFLGK

Query:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQLLSAVFLKQHLTSVVFDSHFVGEADVTNSAVESIQQETDAAEIVVEEEHVPQKRKKASKDGESGK
        CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQLLSAVFLKQHLTSVVFDSHFVGE DVTNSAVESIQQETDAAEIV EEE+VPQKRKKASKDGESG 
Subjt:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQLLSAVFLKQHLTSVVFDSHFVGEADVTNSAVESIQQETDAAEIVVEEEHVPQKRKKASKDGESGK

Query:  RKGSKTADKENKKKKKKLRGETENKQR
        RKGSKTADKENKKKKKKL GETENKQR
Subjt:  RKGSKTADKENKKKKKKLRGETENKQR

TrEMBL top hitse value%identityAlignment
A0A0A0LUB9 WD_REPEATS_REGION domain-containing protein1.9e-20585.98Show/hide
Query:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKDGLIEVLNPLNGNPHAAISDNTDASPL
        MPRTT +DCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARK+GLIEVLNPLNGN H AISDNTD SP 
Subjt:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKDGLIEVLNPLNGNPHAAISDNTDASPL

Query:  SKDDAIIGIHLFAKNELELESRRCTLLSCTTKGNASMRSIDFSSSSSKVTSTNLLKTWNVCGSGDVTCSKVDGSETCALFGGKGVEVNMWNLEQCTKIWN
         KD+AI+G+HLF+K ELE+ESRRCTLLSCTTKGNASMRSI FSSS SK  ST+L+KTW VCGSGDVTC+KVDGSET ALFGGKGVEVNMWNLEQCTKIW 
Subjt:  SKDDAIIGIHLFAKNELELESRRCTLLSCTTKGNASMRSIDFSSSSSKVTSTNLLKTWNVCGSGDVTCSKVDGSETCALFGGKGVEVNMWNLEQCTKIWN

Query:  AKSPKKNSLGIFTPTWFTSTTFLSKDDHRKFASGTNNHEVRLYDISAQRRPVISFDFRETPIKALAADVDGNTIFLGNASGDLASFDIRNGKLLGCFLGK
        AK+PKKN+LGIFTPTWFTS TFLSKDDHRKFA+GTN+H+VRLYDISAQ+RPVISFDFRETPIK+LA DVDGNTIF+GNASGDLASFDIRNGKLLGCFLGK
Subjt:  AKSPKKNSLGIFTPTWFTSTTFLSKDDHRKFASGTNNHEVRLYDISAQRRPVISFDFRETPIKALAADVDGNTIFLGNASGDLASFDIRNGKLLGCFLGK

Query:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQLLSAVFLKQHLTSVVFDSHFVGEADVTNSAVESIQQETDAAEIVVEEEHVPQKRKKASKD-GESG
        CSGSIRSIARHPE PVIASCGLDSYVRFWDIKTRQLLSAVFLKQHLT VVFDSHFV E DVT +AVESIQQET+AA+ V EEEH+P+KRKK+SK+ GE G
Subjt:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQLLSAVFLKQHLTSVVFDSHFVGEADVTNSAVESIQQETDAAEIVVEEEHVPQKRKKASKD-GESG

Query:  KRKGSKTADKENKKKKKKLRGETENKQR
        KRKG+KT DKE+KK ++K  GETE KQ+
Subjt:  KRKGSKTADKENKKKKKKLRGETENKQR

A0A1S3AYE7 WD repeat-containing protein 741.8e-20386.63Show/hide
Query:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKDGLIEVLNPLNGNPHAAISDNTDASPL
        MPRTT +DCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASL DRKFDPLLAVARK+GLIEVLNPLNGN H AISDNTD SP 
Subjt:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKDGLIEVLNPLNGNPHAAISDNTDASPL

Query:  SKDDAIIGIHLFAKNELELESRRCTLLSCTTKGNASMRSIDFSSSSSKVTSTNLLKTWNVCGSGDVTCSKVDGSETCALFGGKGVEVNMWNLEQCTKIWN
         KD+AI+G+HLF+K+ELE+ESRRCTLLSCTTKGNASMRSI+FSSSSS+  STNL+KTW VCGSGDV CSKVDGSET ALFGGKGVEVNMWNLEQCTKIW 
Subjt:  SKDDAIIGIHLFAKNELELESRRCTLLSCTTKGNASMRSIDFSSSSSKVTSTNLLKTWNVCGSGDVTCSKVDGSETCALFGGKGVEVNMWNLEQCTKIWN

Query:  AKSPKKNSLGIFTPTWFTSTTFLSKDDHRKFASGTNNHEVRLYDISAQRRPVISFDFRETPIKALAADVDGNTIFLGNASGDLASFDIRNGKLLGCFLGK
        AK+PKKN+LGIFTPTWFTS TFLSKDDHRKFA+GTN+H+VRLYDISAQ+RPVISFDFRETPIK+LA DVDGNTIF+GNASGDLASFDIRNGKLLGCFLGK
Subjt:  AKSPKKNSLGIFTPTWFTSTTFLSKDDHRKFASGTNNHEVRLYDISAQRRPVISFDFRETPIKALAADVDGNTIFLGNASGDLASFDIRNGKLLGCFLGK

Query:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQLLSAVFLKQHLTSVVFDSHFVGEADVTNSAVESIQQETDAAEIVVEEEHVPQKRKKASK-DGESG
        CSGSIRSIARHPE PVIASCGLDSYVRFWDI TRQLLSAVFLKQHLT VVFDSHFVGE DVT +AVE IQQET+AA+ V EEEHVP+KRKK+SK DGE  
Subjt:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQLLSAVFLKQHLTSVVFDSHFVGEADVTNSAVESIQQETDAAEIVVEEEHVPQKRKKASK-DGESG

Query:  KRKGSKTADKENKKKKKKL
        KRKGSKT +KE+KKK+K++
Subjt:  KRKGSKTADKENKKKKKKL

A0A5D3BIZ1 WD repeat-containing protein 741.8e-20386.63Show/hide
Query:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKDGLIEVLNPLNGNPHAAISDNTDASPL
        MPRTT +DCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASL DRKFDPLLAVARK+GLIEVLNPLNGN H AISDNTD SP 
Subjt:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKDGLIEVLNPLNGNPHAAISDNTDASPL

Query:  SKDDAIIGIHLFAKNELELESRRCTLLSCTTKGNASMRSIDFSSSSSKVTSTNLLKTWNVCGSGDVTCSKVDGSETCALFGGKGVEVNMWNLEQCTKIWN
         KD+AI+G+HLF+K+ELE+ESRRCTLLSCTTKGNASMRSI+FSSSSS+  STNL+KTW VCGSGDV CSKVDGSET ALFGGKGVEVNMWNLEQCTKIW 
Subjt:  SKDDAIIGIHLFAKNELELESRRCTLLSCTTKGNASMRSIDFSSSSSKVTSTNLLKTWNVCGSGDVTCSKVDGSETCALFGGKGVEVNMWNLEQCTKIWN

Query:  AKSPKKNSLGIFTPTWFTSTTFLSKDDHRKFASGTNNHEVRLYDISAQRRPVISFDFRETPIKALAADVDGNTIFLGNASGDLASFDIRNGKLLGCFLGK
        AK+PKKN+LGIFTPTWFTS TFLSKDDHRKFA+GTN+H+VRLYDISAQ+RPVISFDFRETPIK+LA DVDGNTIF+GNASGDLASFDIRNGKLLGCFLGK
Subjt:  AKSPKKNSLGIFTPTWFTSTTFLSKDDHRKFASGTNNHEVRLYDISAQRRPVISFDFRETPIKALAADVDGNTIFLGNASGDLASFDIRNGKLLGCFLGK

Query:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQLLSAVFLKQHLTSVVFDSHFVGEADVTNSAVESIQQETDAAEIVVEEEHVPQKRKKASK-DGESG
        CSGSIRSIARHPE PVIASCGLDSYVRFWDI TRQLLSAVFLKQHLT VVFDSHFVGE DVT +AVE IQQET+AA+ V EEEHVP+KRKK+SK DGE  
Subjt:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQLLSAVFLKQHLTSVVFDSHFVGEADVTNSAVESIQQETDAAEIVVEEEHVPQKRKKASK-DGESG

Query:  KRKGSKTADKENKKKKKKL
        KRKGSKT +KE+KKK+K++
Subjt:  KRKGSKTADKENKKKKKKL

A0A6J1C6G8 WD repeat-containing protein 743.7e-23398.13Show/hide
Query:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKDGLIEVLNPLNGNPHAAISDNTDASPL
        MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKDGLIEVLNPLNGNPHAAISDNTDASPL
Subjt:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKDGLIEVLNPLNGNPHAAISDNTDASPL

Query:  SKDDAIIGIHLFAKNELELESRRCTLLSCTTKGNASMRSIDFSSSSSKVTSTNLLKTWNVCGSGDVTCSKVDGSETCALFGGKGVEVNMWNLEQCTKIWN
        SKDDAIIGIHLFAKNELELESRRCTLLSCTTKGNASMRSIDFSSSSSKVTSTNLLKTWNVCGSGDVTCSKVDGSET ALFGGKGVEVNMWNLEQCTKIWN
Subjt:  SKDDAIIGIHLFAKNELELESRRCTLLSCTTKGNASMRSIDFSSSSSKVTSTNLLKTWNVCGSGDVTCSKVDGSETCALFGGKGVEVNMWNLEQCTKIWN

Query:  AKSPKKNSLGIFTPTWFTSTTFLSKDDHRKFASGTNNHEVRLYDISAQRRPVISFDFRETPIKALAADVDGNTIFLGNASGDLASFDIRNGKLLGCFLGK
        AKSPKKNSLGIFTPTWFTSTTFLSKDDHRKFASGTNNHEVRLYDISAQRRPVISFDFRETPIKALAADVDGNTIF+GNASGDLASFDIRNG+LLGCFLGK
Subjt:  AKSPKKNSLGIFTPTWFTSTTFLSKDDHRKFASGTNNHEVRLYDISAQRRPVISFDFRETPIKALAADVDGNTIFLGNASGDLASFDIRNGKLLGCFLGK

Query:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQLLSAVFLKQHLTSVVFDSHFVGEADVTNSAVESIQQETDAAEIVVEEEHVPQKRKKASKDGESGK
        CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQLLSAVFLKQHLTSVVFDSHFVGE DVTNSAVESIQQETDAAEIV EEE+VPQKRKKASKDGESG 
Subjt:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQLLSAVFLKQHLTSVVFDSHFVGEADVTNSAVESIQQETDAAEIVVEEEHVPQKRKKASKDGESGK

Query:  RKGSKTADKENKKKKKKLRGETENKQR
        RKGSKTADKENKKKKKKL GETENKQR
Subjt:  RKGSKTADKENKKKKKKLRGETENKQR

E5GBV4 WD-repeat protein1.8e-20386.63Show/hide
Query:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKDGLIEVLNPLNGNPHAAISDNTDASPL
        MPRTT +DCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASL DRKFDPLLAVARK+GLIEVLNPLNGN H AISDNTD SP 
Subjt:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKDGLIEVLNPLNGNPHAAISDNTDASPL

Query:  SKDDAIIGIHLFAKNELELESRRCTLLSCTTKGNASMRSIDFSSSSSKVTSTNLLKTWNVCGSGDVTCSKVDGSETCALFGGKGVEVNMWNLEQCTKIWN
         KD+AI+G+HLF+K+ELE+ESRRCTLLSCTTKGNASMRSI+FSSSSS+  STNL+KTW VCGSGDV CSKVDGSET ALFGGKGVEVNMWNLEQCTKIW 
Subjt:  SKDDAIIGIHLFAKNELELESRRCTLLSCTTKGNASMRSIDFSSSSSKVTSTNLLKTWNVCGSGDVTCSKVDGSETCALFGGKGVEVNMWNLEQCTKIWN

Query:  AKSPKKNSLGIFTPTWFTSTTFLSKDDHRKFASGTNNHEVRLYDISAQRRPVISFDFRETPIKALAADVDGNTIFLGNASGDLASFDIRNGKLLGCFLGK
        AK+PKKN+LGIFTPTWFTS TFLSKDDHRKFA+GTN+H+VRLYDISAQ+RPVISFDFRETPIK+LA DVDGNTIF+GNASGDLASFDIRNGKLLGCFLGK
Subjt:  AKSPKKNSLGIFTPTWFTSTTFLSKDDHRKFASGTNNHEVRLYDISAQRRPVISFDFRETPIKALAADVDGNTIFLGNASGDLASFDIRNGKLLGCFLGK

Query:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQLLSAVFLKQHLTSVVFDSHFVGEADVTNSAVESIQQETDAAEIVVEEEHVPQKRKKASK-DGESG
        CSGSIRSIARHPE PVIASCGLDSYVRFWDI TRQLLSAVFLKQHLT VVFDSHFVGE DVT +AVE IQQET+AA+ V EEEHVP+KRKK+SK DGE  
Subjt:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQLLSAVFLKQHLTSVVFDSHFVGEADVTNSAVESIQQETDAAEIVVEEEHVPQKRKKASK-DGESG

Query:  KRKGSKTADKENKKKKKKL
        KRKGSKT +KE+KKK+K++
Subjt:  KRKGSKTADKENKKKKKKL

SwissProt top hitse value%identityAlignment
O94698 Ribosome biogenesis protein nsa11.7e-0925Show/hide
Query:  LRALTFDVLGLVKVIEAR------GKEGEIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKDGLIEVLNPLNGNPHAAISDNTDASPLSKDDAIIG
        ++ L  D +G +K IE +        E E P V++++GE D  K VL       K +  + VARK+G IE  N     P  +     D+S L  + A I 
Subjt:  LRALTFDVLGLVKVIEAR------GKEGEIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKDGLIEVLNPLNGNPHAAISDNTDASPLSKDDAIIG

Query:  IHLFAKNELELESRRCTLLSCTTKGNASMRSIDFSSSSSKVTSTNLLKTWNVCGSGDVTCSKVDGSETCALFGGKGVEVNMWNLEQCTKIWNAKSPKKNS
           ++   L L      LL    + ++ +R +      S V     +      G  +        + TC        E+ +W  E   K++  K+ K +S
Subjt:  IHLFAKNELELESRRCTLLSCTTKGNASMRSIDFSSSSSKVTSTNLLKTWNVCGSGDVTCSKVDGSETCALFGGKGVEVNMWNLEQCTKIWNAKSPKKNS

Query:  LGIFTPTWFTSTTFL----------SKDDHR---KFASGTNNHEVRLYDISAQRRPVISFDFRETPIKALAADVDGNTIFLGNASGDLASFDIRNGKLLG
        L +    W T   F           S+DD      FA+ T+  ++R YD    RRPV +FD   +P+  +        ++  +    ++ FD    K++G
Subjt:  LGIFTPTWFTSTTFL----------SKDDHR---KFASGTNNHEVRLYDISAQRRPVISFDFRETPIKALAADVDGNTIFLGNASGDLASFDIRNGKLLG

Query:  CFLGKCSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQLLSAVFLKQHLTSVVFDSHFVGEADVTNSAVESIQQETDAAEIVVEEEHVPQKRKKASKD
         F G   G+  SI  H    V+A  GLD  VR +D   + L +A ++K   TS++     + E D               AEI+ +EE +    + A ++
Subjt:  CFLGKCSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQLLSAVFLKQHLTSVVFDSHFVGEADVTNSAVESIQQETDAAEIVVEEEHVPQKRKKASKD

Query:  GESGKRKGSKTADKENKKKKKKLR
         E   R   +  D E+KK  K+++
Subjt:  GESGKRKGSKTADKENKKKKKKLR

Q54FW9 WD repeat-containing protein DDB_G02905554.1e-1125Show/hide
Query:  FGGKGVEVNMWNLEQCTKIWNAKSPKKNSLGIFTPTWFTSTTFLSKDDHRKFASGTNNHEVRLYDISAQ--RRPVISFDFRETPIKALA-ADVDGNTIFL
        FGGK V + +W+LE+  K ++AK  K + L +  P       +++ D   K   G ++  ++ YD+ ++  R   +   F + PI+++   +   +  + 
Subjt:  FGGKGVEVNMWNLEQCTKIWNAKSPKKNSLGIFTPTWFTSTTFLSKDDHRKFASGTNNHEVRLYDISAQ--RRPVISFDFRETPIKALA-ADVDGNTIFL

Query:  GNASGDLASFDIRNGKLLGCFLGKCSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQLLSAVFLKQHLTSVVFDSHFVGEADVTNSAVESIQQETDAA
         ++ G +  +D+R  + +G F    +GS++ IA HP  P++A+ GLD ++R +++  R++L  +FLKQ L+ V+F                   +E    
Subjt:  GNASGDLASFDIRNGKLLGCFLGKCSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQLLSAVFLKQHLTSVVFDSHFVGEADVTNSAVESIQQETDAA

Query:  EIVVEEEHV----PQKRKKASKDGESGKRKGSKTADKENKKKKKKLRGETEN
        EI  EEE +     + + + + D      +G       N KKK      T+N
Subjt:  EIVVEEEHV----PQKRKKASKDGESGKRKGSKTADKENKKKKKKLRGETEN

Q58D06 WD repeat-containing protein 744.0e-1930.52Show/hide
Query:  GGKGVEVNMWNLEQCTK-IWNAKSPKKNSLGIFTPTWFTSTTFLSKDDHRKFASGTNNHEVRLYD-ISAQRRPVISFDFRETPIKALAADVDGNTIFLGN
        GGK   + +W+L+   + ++ AK+ + + L +  P W     FL   + +K  + T  H+VR+YD  S QRRPV+   + E P+ A+    +GN++ +GN
Subjt:  GGKGVEVNMWNLEQCTK-IWNAKSPKKNSLGIFTPTWFTSTTFLSKDDHRKFASGTNNHEVRLYD-ISAQRRPVISFDFRETPIKALAADVDGNTIFLGN

Query:  ASGDLASFDIRNGKLLGCFLGKCSGSIRSIARHPEFPVIASCGLDSYVRFWDIKT-RQLLSAVFLKQHLTSVVFD--SHFVGEADVTNSAVESIQQETDA
          G LA  D+R G+LLGC  G  +GS+R +  HP  P++ASCGLD  +R   I+  R L   V+LK  L  ++     ++  E        +   ++T+ 
Subjt:  ASGDLASFDIRNGKLLGCFLGKCSGSIRSIARHPEFPVIASCGLDSYVRFWDIKT-RQLLSAVFLKQHLTSVVFD--SHFVGEADVTNSAVESIQQETDA

Query:  AEIVVEEEHVPQKRKKASKDGESGKRKGSKTADKENKKKKKKLRGETEN
         E+    E    KRK    +   G  +         +++KKK  G T +
Subjt:  AEIVVEEEHVPQKRKKASKDGESGKRKGSKTADKENKKKKKKLRGETEN

Q6RFH5 WD repeat-containing protein 741.2e-1830.77Show/hide
Query:  GGKGVEVNMWNLEQCTK-IWNAKSPKKNSLGIFTPTWFTSTTFLSKDDHRKFASGTNNHEVRLYD-ISAQRRPVISFDFRETPIKALAADVDGNTIFLGN
        GGK   + +W+L+   + ++ AK+ + + L +  P W     FL     +K  + T  H+VR+YD  S QRRPV+   + E P+ A+     GN++ +GN
Subjt:  GGKGVEVNMWNLEQCTK-IWNAKSPKKNSLGIFTPTWFTSTTFLSKDDHRKFASGTNNHEVRLYD-ISAQRRPVISFDFRETPIKALAADVDGNTIFLGN

Query:  ASGDLASFDIRNGKLLGCFLGKCSGSIRSIARHPEFPVIASCGLDSYVRFWDIKT-RQLLSAVFLKQHLTSVVFD--SHFVGEADVTNSAVESIQQETDA
          G LA  D+R G+LLGC  G  +GS+R +  HP  P++ASCGLD  +R   I+  R L   V+LK  L  ++     ++  E        +   ++T+ 
Subjt:  ASGDLASFDIRNGKLLGCFLGKCSGSIRSIARHPEFPVIASCGLDSYVRFWDIKT-RQLLSAVFLKQHLTSVVFD--SHFVGEADVTNSAVESIQQETDA

Query:  AEIVVEEEHVPQKRKKASKDGESGKRKGSKTADKENKKKKKKLRGET
         E+    E    KRK +  +   G          + +++KKK  G T
Subjt:  AEIVVEEEHVPQKRKKASKDGESGKRKGSKTADKENKKKKKKLRGET

Q8VCG3 WD repeat-containing protein 742.8e-2031.71Show/hide
Query:  GKGVEVNMWNLEQCTK-IWNAKSPKKNSLGIFTPTWFTSTTFLSKDDHRKFASGTNNHEVRLYD-ISAQRRPVISFDFRETPIKALAADVDGNTIFLGNA
        GK   + +W+L+   + ++ AK+ + + L +  P W   T FL     +K  + T  H+VR+YD +S QRRPV+   + E P+ A+    +GN++ +GN 
Subjt:  GKGVEVNMWNLEQCTK-IWNAKSPKKNSLGIFTPTWFTSTTFLSKDDHRKFASGTNNHEVRLYD-ISAQRRPVISFDFRETPIKALAADVDGNTIFLGNA

Query:  SGDLASFDIRNGKLLGCFLGKCSGSIRSIARHPEFPVIASCGLDSYVRFWDIKT-RQLLSAVFLKQHLTSVVFD--SHFVGEADVTNSAVESIQQETDAA
         G LA  D R G+LLGC  G  +GS+R +  HP  P++ASCGLD  +R   I+  R L   V+LK  L  ++     ++  E        +   ++T+  
Subjt:  SGDLASFDIRNGKLLGCFLGKCSGSIRSIARHPEFPVIASCGLDSYVRFWDIKT-RQLLSAVFLKQHLTSVVFD--SHFVGEADVTNSAVESIQQETDAA

Query:  EIVVEEEHVPQKRKKASKDGESGKRKGSKTADKENKKKKKKLRGET
        E+    E    KRK    D   G  +         ++KKKK  G T
Subjt:  EIVVEEEHVPQKRKKASKDGESGKRKGSKTADKENKKKKKKLRGET

Arabidopsis top hitse value%identityAlignment
AT1G29320.1 Transducin/WD40 repeat-like superfamily protein1.9e-12857.44Show/hide
Query:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKDGLIEVLNPLNGNPHAAISDNTDASPL
        MPR    +  GCPP RALTFD LGL+KV EARG+E  IP VV  WGE + S+SVLAAS+ DR  +PLLAVARKDG +EV+NP NG+ H + S   D    
Subjt:  MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKDGLIEVLNPLNGNPHAAISDNTDASPL

Query:  SKDDAIIGIHLFAKNELELESRRCTLLSCTTKGNASMRSIDFSSSSSKVTSTNLLKTWNVCGSGDVTCSKVDGSETCALFGGKGVEVNMWNLEQCTKIWN
         +D+ I  +HLF K   +   R CTLL+CT KG+ S+RS+ F  +    T     KTW  CGSG++   KVDGSE  +LFGGK VE N+W+LEQCTKIW+
Subjt:  SKDDAIIGIHLFAKNELELESRRCTLLSCTTKGNASMRSIDFSSSSSKVTSTNLLKTWNVCGSGDVTCSKVDGSETCALFGGKGVEVNMWNLEQCTKIWN

Query:  AKSPKKNSLGIFTPTWFTSTTFLSKDDHRKFASGTNNHEVRLYDISAQRRPVISFDFRETPIKALAADVDGNTIFLGNASGDLASFDIRNGKLLGCFLGK
        AK P KN+LGIFTPTWFTS TFLSKDDHRKF +GT +H+VRLYDIS QRRPV+SFDFRET I ++A D DG+TI++GNAS DLASFDIR GKLLG FLGK
Subjt:  AKSPKKNSLGIFTPTWFTSTTFLSKDDHRKFASGTNNHEVRLYDISAQRRPVISFDFRETPIKALAADVDGNTIFLGNASGDLASFDIRNGKLLGCFLGK

Query:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQLLSAVFLKQHLTSVVFDSHFVG-EADVTNSAVESIQQE--TDAAEIVVEEEHVPQKRKKASKDGE
        CSGSIRS+ RHP+  VIASCGLD Y+R +D+KTRQL+SAVFLKQHLT +VFDS F G E  V N+  E+  +E  T   +   E E  P KRKK+ K+  
Subjt:  CSGSIRSIARHPEFPVIASCGLDSYVRFWDIKTRQLLSAVFLKQHLTSVVFDSHFVG-EADVTNSAVESIQQE--TDAAEIVVEEEHVPQKRKKASKDGE

Query:  SGKRKGSKTADKENKKKKKKLRGETENKQR
        S +       D+EN+ + +K   +T+  ++
Subjt:  SGKRKGSKTADKENKKKKKKLRGETENKQR

AT4G15900.1 pleiotropic regulatory locus 13.8e-0422.4Show/hide
Query:  WFTSTTFLSKDDHRKFASGTNNHEVRLYDISAQRRPVISFDFRETPIKALAADVDGNTIFLGNASGDLASFDIRNGKLLGCFLGKCSGSIRSIARHPEFP
        W  S  F     +  F +G+ +  ++++D++     +      E  ++ LA       +F       +  +D+   K++  + G  SG +  +A HP   
Subjt:  WFTSTTFLSKDDHRKFASGTNNHEVRLYDISAQRRPVISFDFRETPIKALAADVDGNTIFLGNASGDLASFDIRNGKLLGCFLGKCSGSIRSIARHPEFP

Query:  VIASCGLDSYVRFWDIKTRQLLSAV
        V+ + G DS  R WDI+T+  + A+
Subjt:  VIASCGLDSYVRFWDIKTRQLLSAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCGTACAACGAAGGTTGATTGCCCTGGTTGTCCTCCGCTTCGTGCCCTAACGTTCGATGTGCTTGGTCTCGTTAAAGTGATCGAAGCTCGGGGGAAGGAAGGTGA
GATTCCAAAGGTGGTTGAGAGGTGGGGCGAACCCGATTTCTCCAAATCAGTGCTCGCAGCTTCTCTTGTTGATCGTAAATTCGATCCTTTACTAGCTGTTGCACGGAAAG
ATGGTCTGATTGAAGTTCTTAATCCTTTGAATGGGAATCCTCACGCTGCAATTTCTGACAATACGGATGCCTCTCCTCTATCAAAAGATGATGCTATTATTGGAATACAT
TTATTTGCTAAAAATGAGTTGGAGTTGGAATCCAGGCGTTGCACCTTGCTTTCATGTACAACAAAAGGAAATGCAAGCATGAGGTCGATTGATTTTTCTAGTTCATCTTC
AAAAGTTACCTCTACCAATCTTTTAAAAACATGGAACGTATGTGGTTCGGGTGATGTTACGTGTTCTAAAGTTGATGGAAGTGAAACCTGTGCATTATTTGGAGGGAAGG
GTGTTGAAGTTAATATGTGGAATCTAGAACAGTGTACTAAGATTTGGAATGCAAAATCACCGAAAAAGAACAGCCTTGGTATTTTTACCCCAACATGGTTCACATCAACG
ACATTTCTTAGTAAGGATGACCACCGTAAGTTTGCATCTGGTACCAACAACCACGAGGTTCGACTATATGACATCTCTGCTCAGAGGAGACCTGTTATCTCATTTGATTT
TCGGGAGACTCCTATTAAAGCTTTGGCAGCAGATGTAGATGGTAATACAATATTTTTGGGGAATGCATCTGGTGATCTCGCATCTTTCGATATTCGCAATGGTAAGCTAT
TGGGTTGCTTCTTGGGGAAATGTTCTGGCAGCATAAGATCCATAGCAAGGCACCCAGAGTTCCCGGTCATAGCATCATGTGGATTAGATAGCTATGTGCGCTTCTGGGAT
ATTAAGACAAGGCAACTTCTGTCTGCGGTATTCCTAAAGCAGCATCTTACCAGTGTTGTCTTTGATTCCCATTTTGTTGGAGAAGCAGATGTAACAAACTCTGCAGTAGA
GTCAATCCAACAGGAAACAGACGCAGCCGAAATTGTCGTTGAGGAGGAACATGTGCCTCAGAAAAGAAAAAAGGCATCTAAAGATGGAGAAAGCGGAAAGAGAAAGGGTA
GCAAGACTGCAGACAAAGAAAACAAAAAGAAAAAAAAGAAGTTACGTGGCGAAACCGAAAATAAGCAGAGG
mRNA sequenceShow/hide mRNA sequence
ATGCCTCGTACAACGAAGGTTGATTGCCCTGGTTGTCCTCCGCTTCGTGCCCTAACGTTCGATGTGCTTGGTCTCGTTAAAGTGATCGAAGCTCGGGGGAAGGAAGGTGA
GATTCCAAAGGTGGTTGAGAGGTGGGGCGAACCCGATTTCTCCAAATCAGTGCTCGCAGCTTCTCTTGTTGATCGTAAATTCGATCCTTTACTAGCTGTTGCACGGAAAG
ATGGTCTGATTGAAGTTCTTAATCCTTTGAATGGGAATCCTCACGCTGCAATTTCTGACAATACGGATGCCTCTCCTCTATCAAAAGATGATGCTATTATTGGAATACAT
TTATTTGCTAAAAATGAGTTGGAGTTGGAATCCAGGCGTTGCACCTTGCTTTCATGTACAACAAAAGGAAATGCAAGCATGAGGTCGATTGATTTTTCTAGTTCATCTTC
AAAAGTTACCTCTACCAATCTTTTAAAAACATGGAACGTATGTGGTTCGGGTGATGTTACGTGTTCTAAAGTTGATGGAAGTGAAACCTGTGCATTATTTGGAGGGAAGG
GTGTTGAAGTTAATATGTGGAATCTAGAACAGTGTACTAAGATTTGGAATGCAAAATCACCGAAAAAGAACAGCCTTGGTATTTTTACCCCAACATGGTTCACATCAACG
ACATTTCTTAGTAAGGATGACCACCGTAAGTTTGCATCTGGTACCAACAACCACGAGGTTCGACTATATGACATCTCTGCTCAGAGGAGACCTGTTATCTCATTTGATTT
TCGGGAGACTCCTATTAAAGCTTTGGCAGCAGATGTAGATGGTAATACAATATTTTTGGGGAATGCATCTGGTGATCTCGCATCTTTCGATATTCGCAATGGTAAGCTAT
TGGGTTGCTTCTTGGGGAAATGTTCTGGCAGCATAAGATCCATAGCAAGGCACCCAGAGTTCCCGGTCATAGCATCATGTGGATTAGATAGCTATGTGCGCTTCTGGGAT
ATTAAGACAAGGCAACTTCTGTCTGCGGTATTCCTAAAGCAGCATCTTACCAGTGTTGTCTTTGATTCCCATTTTGTTGGAGAAGCAGATGTAACAAACTCTGCAGTAGA
GTCAATCCAACAGGAAACAGACGCAGCCGAAATTGTCGTTGAGGAGGAACATGTGCCTCAGAAAAGAAAAAAGGCATCTAAAGATGGAGAAAGCGGAAAGAGAAAGGGTA
GCAAGACTGCAGACAAAGAAAACAAAAAGAAAAAAAAGAAGTTACGTGGCGAAACCGAAAATAAGCAGAGG
Protein sequenceShow/hide protein sequence
MPRTTKVDCPGCPPLRALTFDVLGLVKVIEARGKEGEIPKVVERWGEPDFSKSVLAASLVDRKFDPLLAVARKDGLIEVLNPLNGNPHAAISDNTDASPLSKDDAIIGIH
LFAKNELELESRRCTLLSCTTKGNASMRSIDFSSSSSKVTSTNLLKTWNVCGSGDVTCSKVDGSETCALFGGKGVEVNMWNLEQCTKIWNAKSPKKNSLGIFTPTWFTST
TFLSKDDHRKFASGTNNHEVRLYDISAQRRPVISFDFRETPIKALAADVDGNTIFLGNASGDLASFDIRNGKLLGCFLGKCSGSIRSIARHPEFPVIASCGLDSYVRFWD
IKTRQLLSAVFLKQHLTSVVFDSHFVGEADVTNSAVESIQQETDAAEIVVEEEHVPQKRKKASKDGESGKRKGSKTADKENKKKKKKLRGETENKQR