; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr011650 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr011650
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionTransducin/WD40 repeat-like superfamily protein
Genome locationtig00153017:76515..85302
RNA-Seq ExpressionSgr011650
SyntenySgr011650
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR001680 - WD40 repeat
IPR015943 - WD40/YVTN repeat-like-containing domain superfamily
IPR036322 - WD40-repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140344.3 protein JINGUBANG [Cucumis sativus]2.3e-15069.14Show/hide
Query:  MDRLVVEAATPLLHSTSSDSVSSSSSSEADEHSPATSYRFDFKDLRFRSYDFPCKSLSGFCSYRPLAVLSGHIGSVSCLALCGEFILSASQGKDIIVWQQ
        MD  V EAATPLLHSTSS+     SSS+AD+H+P TSYRF FK ++F   DF  KS SG+ SYRPLAVL GHIGSVS LALCGEFILSASQGKDIIVWQQ
Subjt:  MDRLVVEAATPLLHSTSSDSVSSSSSSEADEHSPATSYRFDFKDLRFRSYDFPCKSLSGFCSYRPLAVLSGHIGSVSCLALCGEFILSASQGKDIIVWQQ

Query:  PDLRLFTRFGCGEGSVKALVAVGNRVFTAHQDGKIRVWKVSRRSENVFRL----------------ETHF----RPQRTIW----------GIHNGLIYS
        PDLR+FTRFG GEGSVKA+VAVGNRVFTAHQDGKIRVWKVSRRSEN FRL                ++++    R  + +W           +HNGLIYS
Subjt:  PDLRLFTRFGCGEGSVKALVAVGNRVFTAHQDGKIRVWKVSRRSENVFRL----------------ETHF----RPQRTIW----------GIHNGLIYS

Query:  GSWDKTLKVWRVSDLKCLESIKAHDDAINGVVACKGVVYSASADGKIKAWGRKKEEKEKEEDGGGGGHCLLGILEGHKDVSINSVVVSEDGKWVYGGSSD
        GSWDKTLKVWRVSDLKCLESIKAHDDAINGVVAC G+VYSASADGKIKAWGR+K+E+E+ E+     H LLGILEGHKDVSINSVVVS DGKWV+GG SD
Subjt:  GSWDKTLKVWRVSDLKCLESIKAHDDAINGVVACKGVVYSASADGKIKAWGRKKEEKEKEEDGGGGGHCLLGILEGHKDVSINSVVVSEDGKWVYGGSSD

Query:  GFIMGWEKMVE----------------------------HGSADKSIGIWRREGFGRLCKVGVINGHEGPIKCLQAAPNVVGEGFLLYSGSLDKSLRVWW
        GF+MGWEK+ E                             GSADKSIGIWRRE FGRLCK+GVINGHEGPIKCLQAAPN VGEGFLLYSGSLDKSLRVWW
Subjt:  GFIMGWEKMVE----------------------------HGSADKSIGIWRREGFGRLCKVGVINGHEGPIKCLQAAPNVVGEGFLLYSGSLDKSLRVWW

Query:  VPKA------SSSSSSSSSAMGVAGFVAEDS
        V KA      SSSSSSSSSAMGV  FVAEDS
Subjt:  VPKA------SSSSSSSSSAMGVAGFVAEDS

XP_008465822.1 PREDICTED: myosin heavy chain kinase B [Cucumis melo]2.4e-15269.79Show/hide
Query:  MDRLVVEAATPLLHSTSSDSVSSSSSSEADEHSPATSYRFDFKDLRFRSYDFPCKSLSGFCSYRPLAVLSGHIGSVSCLALCGEFILSASQGKDIIVWQQ
        MD  V EAATPLLHSTSS+     SSS+AD+H+P +SYRF+FKD++F   DF  KSLSG+ SYRPLAVL GHIGSVSCLALCGEFILSASQGKDIIVWQQ
Subjt:  MDRLVVEAATPLLHSTSSDSVSSSSSSEADEHSPATSYRFDFKDLRFRSYDFPCKSLSGFCSYRPLAVLSGHIGSVSCLALCGEFILSASQGKDIIVWQQ

Query:  PDLRLFTRFGCGEGSVKALVAVGNRVFTAHQDGKIRVWKVSRRSENVFRL----------------ETHF----RPQRTIW----------GIHNGLIYS
        PDLR+FTRFG GEGSVKA+VAVGNRVFTAHQDGKIRVWKVSRRSEN FRL                ++++    R  + +W           +HNGLIYS
Subjt:  PDLRLFTRFGCGEGSVKALVAVGNRVFTAHQDGKIRVWKVSRRSENVFRL----------------ETHF----RPQRTIW----------GIHNGLIYS

Query:  GSWDKTLKVWRVSDLKCLESIKAHDDAINGVVACKGVVYSASADGKIKAWGRKKEEKEKEEDGGGGGHCLLGILEGHKDVSINSVVVSEDGKWVYGGSSD
        GSWDKTLKVWRVSDLKCLESIKAHDDAINGVVAC G+VYSASADGKIKAWGR+K+++E+ E+     H LLGILEGHKDVSINSVVVS DGKWV+GG SD
Subjt:  GSWDKTLKVWRVSDLKCLESIKAHDDAINGVVACKGVVYSASADGKIKAWGRKKEEKEKEEDGGGGGHCLLGILEGHKDVSINSVVVSEDGKWVYGGSSD

Query:  GFIMGWEKMVE----------------------------HGSADKSIGIWRREGFGRLCKVGVINGHEGPIKCLQAAPNVVGEGFLLYSGSLDKSLRVWW
        GF+MGWEK+ E                             GSADKSIGIWRRE FGRLCK+GVINGHEGPIKCLQAAPN VGEGFLLYSGSLDKSLRVWW
Subjt:  GFIMGWEKMVE----------------------------HGSADKSIGIWRREGFGRLCKVGVINGHEGPIKCLQAAPNVVGEGFLLYSGSLDKSLRVWW

Query:  VPKASSSSSSS--SSAMGVAGFVAEDS
        V KAS SSSSS  SSAMGV  FVAEDS
Subjt:  VPKASSSSSSS--SSAMGVAGFVAEDS

XP_022146334.1 protein JINGUBANG [Momordica charantia]3.3e-15770.82Show/hide
Query:  MDRLVVEAATPLLHSTSSDSVSSSSSSEADEHSPATSYRFDFKDLRFRSYDFPCKSLSGFCSYRPLAVLSGHIGSVSCLALCGEFILSASQGKDIIVWQQ
        MDR+V EAATPLLHS+SS+  SSSSSS+A+  SPA S RFDF+DLR +SYD+ CKS SGF SYRPLAVLSGHIGSVSCLALCGEFILSASQGKDIIVWQQ
Subjt:  MDRLVVEAATPLLHSTSSDSVSSSSSSEADEHSPATSYRFDFKDLRFRSYDFPCKSLSGFCSYRPLAVLSGHIGSVSCLALCGEFILSASQGKDIIVWQQ

Query:  PDLRLFTRFGCGEGSVKALVAVGNRVFTAHQDGKIRVWKVSRRSENVFRLETHF--------------------RPQRTIW----------GIHNGLIYS
        PDLR+FTRFG GEGSVKA+VAVG+RVFTAHQDGKIRVWKVSRRSEN+FRL                        R  + +W           +HNGLIYS
Subjt:  PDLRLFTRFGCGEGSVKALVAVGNRVFTAHQDGKIRVWKVSRRSENVFRLETHF--------------------RPQRTIW----------GIHNGLIYS

Query:  GSWDKTLKVWRVSDLKCLESIKAHDDAINGVVACKGVVYSASADGKIKAWGRKKEEKEKEEDGGGGGHCLLGILEGHKDVSINSVVVSEDGKWVYGGSSD
        GSWDKTLKVWRVSDLKCLESIKAHDDAINGVVACKGVVYSASADGKIKAWGR+KEEK++EE+G G  H LLGILEGHKDVS+NSVVVSEDGKWVYGG+SD
Subjt:  GSWDKTLKVWRVSDLKCLESIKAHDDAINGVVACKGVVYSASADGKIKAWGRKKEEKEKEEDGGGGGHCLLGILEGHKDVSINSVVVSEDGKWVYGGSSD

Query:  GFIMGWEKMVE----------------------------HGSADKSIGIWRREGFGRLCKVGVINGHEGPIKCLQAAPNVVGEGFLLYSGSLDKSLRVWW
        GFIMGWEK+ E                             GSADKSIGIWRRE FGRLC + VINGHEGPI+CLQAA N VG GFLLYSGSLD+SLRVWW
Subjt:  GFIMGWEKMVE----------------------------HGSADKSIGIWRREGFGRLCKVGVINGHEGPIKCLQAAPNVVGEGFLLYSGSLDKSLRVWW

Query:  VPKASSSSSSSSSAMGVAGFVAEDS
        VPK SSSSSS+    G A  VAEDS
Subjt:  VPKASSSSSSSSSAMGVAGFVAEDS

XP_022993783.1 protein JINGUBANG [Cucurbita maxima]1.7e-14566.43Show/hide
Query:  MDRLVVEAATPLLHSTSSDSVSSSSSSEADEHSPATSYRFDFKDLRFRSYDFPCKSLSGFCSYRPLAVLSGHIGSVSCLALCGEFILSASQGKDIIVWQQ
        MD LV EAATPLLHSTSSD     +SS+ADE++P TSY FD K + FR  DF  KS SG+C YR LAVLSGHIGSVSCLALCGEFILSASQGKDIIVWQQ
Subjt:  MDRLVVEAATPLLHSTSSDSVSSSSSSEADEHSPATSYRFDFKDLRFRSYDFPCKSLSGFCSYRPLAVLSGHIGSVSCLALCGEFILSASQGKDIIVWQQ

Query:  PDLRLFTRFGCGEGSVKALVAVGNRVFTAHQDGKIRVWKVSRRSENVFRLETHF--------------------RPQRTIW----------GIHNGLIYS
        PDLR+FTRFG GEGSVKA+VAVG++VFTAHQDGKIRVWKVSRRSEN FRL                        R  + +W           +HNGLIYS
Subjt:  PDLRLFTRFGCGEGSVKALVAVGNRVFTAHQDGKIRVWKVSRRSENVFRLETHF--------------------RPQRTIW----------GIHNGLIYS

Query:  GSWDKTLKVWRVSDLKCLESIKAHDDAINGVVACKGVVYSASADGKIKAWGRKKEEKEKEEDGGGGGHCLLGILEGHKDVSINSVVVSEDGKWVYGGSSD
        GSWDKTLK+WRVSDLKCLESIKAHDDAING+V C G+VYSASADGKIKAWGRKK+E+E+        HCLLGILEGHKD S+N VVVS DGKWVYGG SD
Subjt:  GSWDKTLKVWRVSDLKCLESIKAHDDAINGVVACKGVVYSASADGKIKAWGRKKEEKEKEEDGGGGGHCLLGILEGHKDVSINSVVVSEDGKWVYGGSSD

Query:  GFIMGWEKMVE-----------------------------HGSADKSIGIWRREGFGRLCKVGVINGHEGPIKCLQAAPNVVGEGFLLYSGSLDKSLRVW
        GF+MGWEK  E                              GSADKSIGIWRRE FGRLCKVGVINGH+GPIKCLQ APN VGEGFLLYSGSLD+SLRVW
Subjt:  GFIMGWEKMVE-----------------------------HGSADKSIGIWRREGFGRLCKVGVINGHEGPIKCLQAAPNVVGEGFLLYSGSLDKSLRVW

Query:  WVPKASSSSSSSSSAMGVAGFVAEDS
        WV KA        SAMGV   VAEDS
Subjt:  WVPKASSSSSSSSSAMGVAGFVAEDS

XP_038875529.1 protein JINGUBANG [Benincasa hispida]8.4e-15368.91Show/hide
Query:  MDRLVVEAATPLLHSTSSDSVSSSSSSEADEHSPATSYRFDFKDLRFRSYDFPCKSLSGFCSYRPLAVLSGHIGSVSCLALCGEFILSASQGKDIIVWQQ
        MD LV EAA PLLHSTSS+     SSS+AD+H+P TSYRF+ K++RFR  +F CKS SG+ SYRPLAVLSGHIGSVSCLALCGEFILSASQGKDIIVWQQ
Subjt:  MDRLVVEAATPLLHSTSSDSVSSSSSSEADEHSPATSYRFDFKDLRFRSYDFPCKSLSGFCSYRPLAVLSGHIGSVSCLALCGEFILSASQGKDIIVWQQ

Query:  PDLRLFTRFGCGEGSVKALVAVGNRVFTAHQDGKIRVWKVSRRSENVFRL----------------ETHF----RPQRTIW----------GIHNGLIYS
        PDLR+FTRFG GEGSVKA+VAVG+RVFTAHQDGKIRVWKVSRRSEN FRL                ++++    R  + +W           +HNGLIYS
Subjt:  PDLRLFTRFGCGEGSVKALVAVGNRVFTAHQDGKIRVWKVSRRSENVFRL----------------ETHF----RPQRTIW----------GIHNGLIYS

Query:  GSWDKTLKVWRVSDLKCLESIKAHDDAINGVVACKGVVYSASADGKIKAWGRKKEEKEKEEDGGGGGHCLLGILEGHKDVSINSVVVSEDGKWVYGGSSD
        GSWDKTLKVWRVSDLKCLESIKAHDDAINGVVAC G+VYSASADGKIKAWGR+K++++++E+     H LLGILEGHKDVSINSVVVS DGKWV+GG SD
Subjt:  GSWDKTLKVWRVSDLKCLESIKAHDDAINGVVACKGVVYSASADGKIKAWGRKKEEKEKEEDGGGGGHCLLGILEGHKDVSINSVVVSEDGKWVYGGSSD

Query:  GFIMGWEKMVE----------------------------HGSADKSIGIWRREGFGRLCKVGVINGHEGPIKCLQAAPNVVGEGFLLYSGSLDKSLRVWW
        GF+MGWEK+ E                             GSADKSIGIWRRE FGRLCK+GVINGHEGPIKCLQAAPN VGEGFLLYSGSLDKSLRVWW
Subjt:  GFIMGWEKMVE----------------------------HGSADKSIGIWRREGFGRLCKVGVINGHEGPIKCLQAAPNVVGEGFLLYSGSLDKSLRVWW

Query:  VPKA------SSSSSSSSSAMGVAGFVAEDS
        V KA      SSSSSSSSSAM V+ FVAEDS
Subjt:  VPKA------SSSSSSSSSAMGVAGFVAEDS

TrEMBL top hitse value%identityAlignment
A0A0A0KN90 WD_REPEATS_REGION domain-containing protein1.1e-15069.14Show/hide
Query:  MDRLVVEAATPLLHSTSSDSVSSSSSSEADEHSPATSYRFDFKDLRFRSYDFPCKSLSGFCSYRPLAVLSGHIGSVSCLALCGEFILSASQGKDIIVWQQ
        MD  V EAATPLLHSTSS+     SSS+AD+H+P TSYRF FK ++F   DF  KS SG+ SYRPLAVL GHIGSVS LALCGEFILSASQGKDIIVWQQ
Subjt:  MDRLVVEAATPLLHSTSSDSVSSSSSSEADEHSPATSYRFDFKDLRFRSYDFPCKSLSGFCSYRPLAVLSGHIGSVSCLALCGEFILSASQGKDIIVWQQ

Query:  PDLRLFTRFGCGEGSVKALVAVGNRVFTAHQDGKIRVWKVSRRSENVFRL----------------ETHF----RPQRTIW----------GIHNGLIYS
        PDLR+FTRFG GEGSVKA+VAVGNRVFTAHQDGKIRVWKVSRRSEN FRL                ++++    R  + +W           +HNGLIYS
Subjt:  PDLRLFTRFGCGEGSVKALVAVGNRVFTAHQDGKIRVWKVSRRSENVFRL----------------ETHF----RPQRTIW----------GIHNGLIYS

Query:  GSWDKTLKVWRVSDLKCLESIKAHDDAINGVVACKGVVYSASADGKIKAWGRKKEEKEKEEDGGGGGHCLLGILEGHKDVSINSVVVSEDGKWVYGGSSD
        GSWDKTLKVWRVSDLKCLESIKAHDDAINGVVAC G+VYSASADGKIKAWGR+K+E+E+ E+     H LLGILEGHKDVSINSVVVS DGKWV+GG SD
Subjt:  GSWDKTLKVWRVSDLKCLESIKAHDDAINGVVACKGVVYSASADGKIKAWGRKKEEKEKEEDGGGGGHCLLGILEGHKDVSINSVVVSEDGKWVYGGSSD

Query:  GFIMGWEKMVE----------------------------HGSADKSIGIWRREGFGRLCKVGVINGHEGPIKCLQAAPNVVGEGFLLYSGSLDKSLRVWW
        GF+MGWEK+ E                             GSADKSIGIWRRE FGRLCK+GVINGHEGPIKCLQAAPN VGEGFLLYSGSLDKSLRVWW
Subjt:  GFIMGWEKMVE----------------------------HGSADKSIGIWRREGFGRLCKVGVINGHEGPIKCLQAAPNVVGEGFLLYSGSLDKSLRVWW

Query:  VPKA------SSSSSSSSSAMGVAGFVAEDS
        V KA      SSSSSSSSSAMGV  FVAEDS
Subjt:  VPKA------SSSSSSSSSAMGVAGFVAEDS

A0A1S3CR76 myosin heavy chain kinase B1.2e-15269.79Show/hide
Query:  MDRLVVEAATPLLHSTSSDSVSSSSSSEADEHSPATSYRFDFKDLRFRSYDFPCKSLSGFCSYRPLAVLSGHIGSVSCLALCGEFILSASQGKDIIVWQQ
        MD  V EAATPLLHSTSS+     SSS+AD+H+P +SYRF+FKD++F   DF  KSLSG+ SYRPLAVL GHIGSVSCLALCGEFILSASQGKDIIVWQQ
Subjt:  MDRLVVEAATPLLHSTSSDSVSSSSSSEADEHSPATSYRFDFKDLRFRSYDFPCKSLSGFCSYRPLAVLSGHIGSVSCLALCGEFILSASQGKDIIVWQQ

Query:  PDLRLFTRFGCGEGSVKALVAVGNRVFTAHQDGKIRVWKVSRRSENVFRL----------------ETHF----RPQRTIW----------GIHNGLIYS
        PDLR+FTRFG GEGSVKA+VAVGNRVFTAHQDGKIRVWKVSRRSEN FRL                ++++    R  + +W           +HNGLIYS
Subjt:  PDLRLFTRFGCGEGSVKALVAVGNRVFTAHQDGKIRVWKVSRRSENVFRL----------------ETHF----RPQRTIW----------GIHNGLIYS

Query:  GSWDKTLKVWRVSDLKCLESIKAHDDAINGVVACKGVVYSASADGKIKAWGRKKEEKEKEEDGGGGGHCLLGILEGHKDVSINSVVVSEDGKWVYGGSSD
        GSWDKTLKVWRVSDLKCLESIKAHDDAINGVVAC G+VYSASADGKIKAWGR+K+++E+ E+     H LLGILEGHKDVSINSVVVS DGKWV+GG SD
Subjt:  GSWDKTLKVWRVSDLKCLESIKAHDDAINGVVACKGVVYSASADGKIKAWGRKKEEKEKEEDGGGGGHCLLGILEGHKDVSINSVVVSEDGKWVYGGSSD

Query:  GFIMGWEKMVE----------------------------HGSADKSIGIWRREGFGRLCKVGVINGHEGPIKCLQAAPNVVGEGFLLYSGSLDKSLRVWW
        GF+MGWEK+ E                             GSADKSIGIWRRE FGRLCK+GVINGHEGPIKCLQAAPN VGEGFLLYSGSLDKSLRVWW
Subjt:  GFIMGWEKMVE----------------------------HGSADKSIGIWRREGFGRLCKVGVINGHEGPIKCLQAAPNVVGEGFLLYSGSLDKSLRVWW

Query:  VPKASSSSSSS--SSAMGVAGFVAEDS
        V KAS SSSSS  SSAMGV  FVAEDS
Subjt:  VPKASSSSSSS--SSAMGVAGFVAEDS

A0A5D3BEH9 Myosin heavy chain kinase B1.2e-15269.79Show/hide
Query:  MDRLVVEAATPLLHSTSSDSVSSSSSSEADEHSPATSYRFDFKDLRFRSYDFPCKSLSGFCSYRPLAVLSGHIGSVSCLALCGEFILSASQGKDIIVWQQ
        MD  V EAATPLLHSTSS+     SSS+AD+H+P +SYRF+FKD++F   DF  KSLSG+ SYRPLAVL GHIGSVSCLALCGEFILSASQGKDIIVWQQ
Subjt:  MDRLVVEAATPLLHSTSSDSVSSSSSSEADEHSPATSYRFDFKDLRFRSYDFPCKSLSGFCSYRPLAVLSGHIGSVSCLALCGEFILSASQGKDIIVWQQ

Query:  PDLRLFTRFGCGEGSVKALVAVGNRVFTAHQDGKIRVWKVSRRSENVFRL----------------ETHF----RPQRTIW----------GIHNGLIYS
        PDLR+FTRFG GEGSVKA+VAVGNRVFTAHQDGKIRVWKVSRRSEN FRL                ++++    R  + +W           +HNGLIYS
Subjt:  PDLRLFTRFGCGEGSVKALVAVGNRVFTAHQDGKIRVWKVSRRSENVFRL----------------ETHF----RPQRTIW----------GIHNGLIYS

Query:  GSWDKTLKVWRVSDLKCLESIKAHDDAINGVVACKGVVYSASADGKIKAWGRKKEEKEKEEDGGGGGHCLLGILEGHKDVSINSVVVSEDGKWVYGGSSD
        GSWDKTLKVWRVSDLKCLESIKAHDDAINGVVAC G+VYSASADGKIKAWGR+K+++E+ E+     H LLGILEGHKDVSINSVVVS DGKWV+GG SD
Subjt:  GSWDKTLKVWRVSDLKCLESIKAHDDAINGVVACKGVVYSASADGKIKAWGRKKEEKEKEEDGGGGGHCLLGILEGHKDVSINSVVVSEDGKWVYGGSSD

Query:  GFIMGWEKMVE----------------------------HGSADKSIGIWRREGFGRLCKVGVINGHEGPIKCLQAAPNVVGEGFLLYSGSLDKSLRVWW
        GF+MGWEK+ E                             GSADKSIGIWRRE FGRLCK+GVINGHEGPIKCLQAAPN VGEGFLLYSGSLDKSLRVWW
Subjt:  GFIMGWEKMVE----------------------------HGSADKSIGIWRREGFGRLCKVGVINGHEGPIKCLQAAPNVVGEGFLLYSGSLDKSLRVWW

Query:  VPKASSSSSSS--SSAMGVAGFVAEDS
        V KAS SSSSS  SSAMGV  FVAEDS
Subjt:  VPKASSSSSSS--SSAMGVAGFVAEDS

A0A6J1CXU2 protein JINGUBANG9.3e-15871.06Show/hide
Query:  MDRLVVEAATPLLHSTSSDSVSSSSSSEADEHSPATSYRFDFKDLRFRSYDFPCKSLSGFCSYRPLAVLSGHIGSVSCLALCGEFILSASQGKDIIVWQQ
        MDR+V EAATPLLHS+SSD  SSSSSS+A+  SPA S RFDF+DLR +SYD+ CKS SGF SYRPLAVLSGHIGSVSCLALCGEFILSASQGKDIIVWQQ
Subjt:  MDRLVVEAATPLLHSTSSDSVSSSSSSEADEHSPATSYRFDFKDLRFRSYDFPCKSLSGFCSYRPLAVLSGHIGSVSCLALCGEFILSASQGKDIIVWQQ

Query:  PDLRLFTRFGCGEGSVKALVAVGNRVFTAHQDGKIRVWKVSRRSENVFRLETHF--------------------RPQRTIW----------GIHNGLIYS
        PDLR+FTRFG GEGSVKA+VAVG+RVFTAHQDGKIRVWKVSRRSEN+FRL                        R  + +W           +HNGLIYS
Subjt:  PDLRLFTRFGCGEGSVKALVAVGNRVFTAHQDGKIRVWKVSRRSENVFRLETHF--------------------RPQRTIW----------GIHNGLIYS

Query:  GSWDKTLKVWRVSDLKCLESIKAHDDAINGVVACKGVVYSASADGKIKAWGRKKEEKEKEEDGGGGGHCLLGILEGHKDVSINSVVVSEDGKWVYGGSSD
        GSWDKTLKVWRVSDLKCLESIKAHDDAINGVVACKGVVYSASADGKIKAWGR+KEEK++EE+G G  H LLGILEGHKDVS+NSVVVSEDGKWVYGG+SD
Subjt:  GSWDKTLKVWRVSDLKCLESIKAHDDAINGVVACKGVVYSASADGKIKAWGRKKEEKEKEEDGGGGGHCLLGILEGHKDVSINSVVVSEDGKWVYGGSSD

Query:  GFIMGWEKMVE----------------------------HGSADKSIGIWRREGFGRLCKVGVINGHEGPIKCLQAAPNVVGEGFLLYSGSLDKSLRVWW
        GFIMGWEK+ E                             GSADKSIGIWRRE FGRLC + VINGHEGPI+CLQAA N VG GFLLYSGSLD+SLRVWW
Subjt:  GFIMGWEKMVE----------------------------HGSADKSIGIWRREGFGRLCKVGVINGHEGPIKCLQAAPNVVGEGFLLYSGSLDKSLRVWW

Query:  VPKASSSSSSSSSAMGVAGFVAEDS
        VPK SSSSSS+    G A  VAEDS
Subjt:  VPKASSSSSSSSSAMGVAGFVAEDS

A0A6J1K3A6 protein JINGUBANG8.2e-14666.43Show/hide
Query:  MDRLVVEAATPLLHSTSSDSVSSSSSSEADEHSPATSYRFDFKDLRFRSYDFPCKSLSGFCSYRPLAVLSGHIGSVSCLALCGEFILSASQGKDIIVWQQ
        MD LV EAATPLLHSTSSD     +SS+ADE++P TSY FD K + FR  DF  KS SG+C YR LAVLSGHIGSVSCLALCGEFILSASQGKDIIVWQQ
Subjt:  MDRLVVEAATPLLHSTSSDSVSSSSSSEADEHSPATSYRFDFKDLRFRSYDFPCKSLSGFCSYRPLAVLSGHIGSVSCLALCGEFILSASQGKDIIVWQQ

Query:  PDLRLFTRFGCGEGSVKALVAVGNRVFTAHQDGKIRVWKVSRRSENVFRLETHF--------------------RPQRTIW----------GIHNGLIYS
        PDLR+FTRFG GEGSVKA+VAVG++VFTAHQDGKIRVWKVSRRSEN FRL                        R  + +W           +HNGLIYS
Subjt:  PDLRLFTRFGCGEGSVKALVAVGNRVFTAHQDGKIRVWKVSRRSENVFRLETHF--------------------RPQRTIW----------GIHNGLIYS

Query:  GSWDKTLKVWRVSDLKCLESIKAHDDAINGVVACKGVVYSASADGKIKAWGRKKEEKEKEEDGGGGGHCLLGILEGHKDVSINSVVVSEDGKWVYGGSSD
        GSWDKTLK+WRVSDLKCLESIKAHDDAING+V C G+VYSASADGKIKAWGRKK+E+E+        HCLLGILEGHKD S+N VVVS DGKWVYGG SD
Subjt:  GSWDKTLKVWRVSDLKCLESIKAHDDAINGVVACKGVVYSASADGKIKAWGRKKEEKEKEEDGGGGGHCLLGILEGHKDVSINSVVVSEDGKWVYGGSSD

Query:  GFIMGWEKMVE-----------------------------HGSADKSIGIWRREGFGRLCKVGVINGHEGPIKCLQAAPNVVGEGFLLYSGSLDKSLRVW
        GF+MGWEK  E                              GSADKSIGIWRRE FGRLCKVGVINGH+GPIKCLQ APN VGEGFLLYSGSLD+SLRVW
Subjt:  GFIMGWEKMVE-----------------------------HGSADKSIGIWRREGFGRLCKVGVINGHEGPIKCLQAAPNVVGEGFLLYSGSLDKSLRVW

Query:  WVPKASSSSSSSSSAMGVAGFVAEDS
        WV KA        SAMGV   VAEDS
Subjt:  WVPKASSSSSSSSSAMGVAGFVAEDS

SwissProt top hitse value%identityAlignment
O48716 Protein JINGUBANG1.6e-5035.42Show/hide
Query:  GSVSCLALCGEFILSASQGKDIIVWQQPDLRLFTRFGCGEGSVKALVAVGNRVFTAHQDGKIRVWKVSRRSENVFR-----------LETHFRPQR----
        G +  LA   + + + S  K+I VW+  +L+ F+ F C  G VKA+V  G ++FT HQDGKIRVWKVS +++++ +            +   +P+     
Subjt:  GSVSCLALCGEFILSASQGKDIIVWQQPDLRLFTRFGCGEGSVKALVAVGNRVFTAHQDGKIRVWKVSRRSENVFR-----------LETHFRPQR----

Query:  -----TIWGIH------------NGLIYSGSWDKTLKVWRVSDLKCLESIKAHDDAINGVVA-CKGVVYSASADGKIKAWGRKKEEKEKEEDGGGGGHCL
              +W  H             GL+YS SWD+T+KVWR++D KCLESI AHDDA+N VV+  + +V+S SADG +KAW R ++ K  +       H L
Subjt:  -----TIWGIH------------NGLIYSGSWDKTLKVWRVSDLKCLESIKAHDDAINGVVA-CKGVVYSASADGKIKAWGRKKEEKEKEEDGGGGGHCL

Query:  LGILEGHKDVSINSVVVSEDGKWVYGGSSDGFIMGWEK-------------------------MVEHGSADKSIGIWRREGFGRLCKVGVINGHEGPIKC
        +  L   ++ ++ ++ VS++G  VY GSSDG +  WE+                         +V  GSADK+I +W+R+G    C + V+ GH GP+KC
Subjt:  LGILEGHKDVSINSVVVSEDGKWVYGGSSDGFIMGWEK-------------------------MVEHGSADKSIGIWRREGFGRLCKVGVINGHEGPIKC

Query:  L-----QAAPNVVGEGFLLYSGSLDKSLRVWWVPKA
        L     + A     + +++YSGSLDKS++VW V ++
Subjt:  L-----QAAPNVVGEGFLLYSGSLDKSLRVWWVPKA

P90648 Myosin heavy chain kinase B1.0e-1529.11Show/hide
Query:  RPLAVLSGHIGSVSCLALCGEFILSASQGKDIIVWQQPDLRLFTRFGCGEGSVKALVAVGNRVFTAHQDGKIRVWKVSRRSENVFRLETHFRPQRTIWGI
        R +  L GH   V  + L  +++ S S  K I VW    L           +VK L   G  +F+   D  I+VW +     N + L+ H +   TI  +
Subjt:  RPLAVLSGHIGSVSCLALCGEFILSASQGKDIIVWQQPDLRLFTRFGCGEGSVKALVAVGNRVFTAHQDGKIRVWKVSRRSENVFRLETHFRPQRTIWGI

Query:  HNGLIYSGSWDKTLKVWRVSDLKCLESIKAHDDAINGVVACKGVVYSASADGKIKAWGRKKEEKEKEEDGGGGGHCLLGILEGHKDVSINSVVVSEDGKW
           L YSGS+DKT++VW +  L+C  +++ HD  +  +V C  ++++AS D  IK W  +                    LEGH + ++  + V ED K 
Subjt:  HNGLIYSGSWDKTLKVWRVSDLKCLESIKAHDDAINGVVACKGVVYSASADGKIKAWGRKKEEKEKEEDGGGGGHCLLGILEGHKDVSINSVVVSEDGKW

Query:  VYGGSSDGFIMGW
        V   S D  I  W
Subjt:  VYGGSSDGFIMGW

Q86VZ2 WD repeat-containing protein 5B6.1e-1326.19Show/hide
Query:  SYRPLAVLSGHIGSVSCLALC--GEFILSASQGKDIIVWQQPDLRL-FTRFGCG-EGSVKALVAVGNRVFTAHQDGKIRVWKVSRRSENVFRLETH----
        +Y     L GH  +VS +     GE++ S+S  + II+W   D +   T +G   E S  A  +  +R+ +A  D  +++W V R  + +  L+ H    
Subjt:  SYRPLAVLSGHIGSVSCLALC--GEFILSASQGKDIIVWQQPDLRL-FTRFGCG-EGSVKALVAVGNRVFTAHQDGKIRVWKVSRRSENVFRLETH----

Query:  ----FRPQRTIWGIHNGLIYSGSWDKTLKVWRVSDLKCLESIKAHDDAINGV-VACKG-VVYSASADGKIKAWGRKKEEKEKEEDGGGGGHCLLGILEGH
            F P        + LI SGS+D+T+K+W V   KCL+++ AH D ++ V   C G ++ S S DG  + W                G CL  +++  
Subjt:  ----FRPQRTIWGIHNGLIYSGSWDKTLKVWRVSDLKCLESIKAHDDAINGV-VACKG-VVYSASADGKIKAWGRKKEEKEKEEDGGGGGHCLLGILEGH

Query:  KDVSINSVVVSEDGKWVYGGSSDGFIMGWEKMVEHGSADKSIGIWRREGFGRLCKVGVINGHEGPIKCLQAAPNVVGEGFLLYSGSLDKSLRVW
         +  ++ V  S +GK++   + D  +  W+                    GR  K     GH+    C+ A  +V G G  + SGS D  + +W
Subjt:  KDVSINSVVVSEDGKWVYGGSSDGFIMGWEKMVEHGSADKSIGIWRREGFGRLCKVGVINGHEGPIKCLQAAPNVVGEGFLLYSGSLDKSLRVW

Q8YV57 Uncharacterized WD repeat-containing protein all21248.5e-1526.75Show/hide
Query:  LAVLSGHIGSVSCLALC--GEFILSASQGKDIIVWQQPDLRLFTRFGCGEGSVKALVAV--GNRVFTAHQDGKIRVWKVSRRSENVFRLETHFRPQRTIW
        L   +GH G V  +        I SAS    I +WQ+P +        G   V A+  +  G+ + TA  DG I++W     S++   L+T     + I+
Subjt:  LAVLSGHIGSVSCLALC--GEFILSASQGKDIIVWQQPDLRLFTRFGCGEGSVKALVAV--GNRVFTAHQDGKIRVWKVSRRSENVFRLETHFRPQRTIW

Query:  GI----HNGLIYSGSWDKTLKVWRVSDLKCLESIKAHDDAINGVVACKG--VVYSASADGKIKAWGRKKEEKEKEEDGGGGGHCLLGILEGHKDVSINSV
        GI       LI S + DKT+K+WRV D K L+++  HD+ +N V        + SAS D  +K W     + +K              L+GH D  +  V
Subjt:  GI----HNGLIYSGSWDKTLKVWRVSDLKCLESIKAHDDAINGVVACKG--VVYSASADGKIKAWGRKKEEKEKEEDGGGGGHCLLGILEGHKDVSINSV

Query:  VVSEDGKWVYGGSSDGFIMGWE--------------------------KMVEHGSADKSIGIWRREGFGRLCKVGVINGHEGPIKCLQAAPNVVGEGFLL
          S DGK +   S+D  I  W+                           M+   SADK++ +WR    G L  +   +GH   +     +P    +G  +
Subjt:  VVSEDGKWVYGGSSDGFIMGWE--------------------------KMVEHGSADKSIGIWRREGFGRLCKVGVINGHEGPIKCLQAAPNVVGEGFLL

Query:  YSGSLDKSLRVWWV
         S S DK++++W +
Subjt:  YSGSLDKSLRVWWV

Q9SY00 COMPASS-like H3K4 histone methylase component WDR5B6.9e-1726.12Show/hide
Query:  YRPLAVLSGHIGSVSCLALC--GEFILSASQGKDIIVWQQPDLRLFTRFGCGEGSVKALVAVGNRVFT--AHQDGKIRVWKVSRRSE--NVFRLETHFRP
        YR L  L GH  ++SC+     G  + SAS  K +I+W   +  L  R+      +  L    +  +T  A  D  +R+W      E   V R  T+F  
Subjt:  YRPLAVLSGHIGSVSCLALC--GEFILSASQGKDIIVWQQPDLRLFTRFGCGEGSVKALVAVGNRVFT--AHQDGKIRVWKVSRRSE--NVFRLETHFRP

Query:  QRTIWGIH----NGLIYSGSWDKTLKVWRVSDLKCLESIKAHDDAINGVVACK--GVVYSASADGKIKAWGRKKEEKEKEEDGGGGGHCLLGILEGHKDV
           ++ ++    + LI SGS+D+T+++W V   KC+  IKAH   I+ V   +   ++ SAS DG  K W  K+            G CL  +++  K  
Subjt:  QRTIWGIH----NGLIYSGSWDKTLKVWRVSDLKCLESIKAHDDAINGVVACK--GVVYSASADGKIKAWGRKKEEKEKEEDGGGGGHCLLGILEGHKDV

Query:  SINSVVVSEDGKWVYGGSSDGFIMGWEKMVEHGSADKSIGIWRREGFGRLCKVGVINGHEGPIKCLQAAPNVVGEGFLLYSGSLDKSLRVW
        +++    S +GK++   + D  +    K+  + +             G+  K  V  GH   + C+ +A +V   G  + SGS D  + +W
Subjt:  SINSVVVSEDGKWVYGGSSDGFIMGWEKMVEHGSADKSIGIWRREGFGRLCKVGVINGHEGPIKCLQAAPNVVGEGFLLYSGSLDKSLRVW

Arabidopsis top hitse value%identityAlignment
AT2G26490.1 Transducin/WD40 repeat-like superfamily protein1.2e-5135.42Show/hide
Query:  GSVSCLALCGEFILSASQGKDIIVWQQPDLRLFTRFGCGEGSVKALVAVGNRVFTAHQDGKIRVWKVSRRSENVFR-----------LETHFRPQR----
        G +  LA   + + + S  K+I VW+  +L+ F+ F C  G VKA+V  G ++FT HQDGKIRVWKVS +++++ +            +   +P+     
Subjt:  GSVSCLALCGEFILSASQGKDIIVWQQPDLRLFTRFGCGEGSVKALVAVGNRVFTAHQDGKIRVWKVSRRSENVFR-----------LETHFRPQR----

Query:  -----TIWGIH------------NGLIYSGSWDKTLKVWRVSDLKCLESIKAHDDAINGVVA-CKGVVYSASADGKIKAWGRKKEEKEKEEDGGGGGHCL
              +W  H             GL+YS SWD+T+KVWR++D KCLESI AHDDA+N VV+  + +V+S SADG +KAW R ++ K  +       H L
Subjt:  -----TIWGIH------------NGLIYSGSWDKTLKVWRVSDLKCLESIKAHDDAINGVVA-CKGVVYSASADGKIKAWGRKKEEKEKEEDGGGGGHCL

Query:  LGILEGHKDVSINSVVVSEDGKWVYGGSSDGFIMGWEK-------------------------MVEHGSADKSIGIWRREGFGRLCKVGVINGHEGPIKC
        +  L   ++ ++ ++ VS++G  VY GSSDG +  WE+                         +V  GSADK+I +W+R+G    C + V+ GH GP+KC
Subjt:  LGILEGHKDVSINSVVVSEDGKWVYGGSSDGFIMGWEK-------------------------MVEHGSADKSIGIWRREGFGRLCKVGVINGHEGPIKC

Query:  L-----QAAPNVVGEGFLLYSGSLDKSLRVWWVPKA
        L     + A     + +++YSGSLDKS++VW V ++
Subjt:  L-----QAAPNVVGEGFLLYSGSLDKSLRVWWVPKA

AT3G18950.1 Transducin/WD40 repeat-like superfamily protein1.3e-4736.73Show/hide
Query:  GSVSCLALCGEFILSASQGKDIIVWQQPDLRLFTRFGCGEGSVKALVAVG-NRVFTAHQDGKIRVWKVS-------------------------------
        G V  LA  G+ + + S  K+I VW+  DL+  T F    G VKA+V  G NR+FT HQDGKIRVW+ S                               
Subjt:  GSVSCLALCGEFILSASQGKDIIVWQQPDLRLFTRFGCGEGSVKALVAVG-NRVFTAHQDGKIRVWKVS-------------------------------

Query:  -RRSENVFRLETHFRPQRTIWGIHNGLIYSGSWDKTLKVWRVSDLKCLESIKAHDDAINGVVA-CKGVVYSASADGKIKAWGRKKEEKEKEEDGGGGGHC
         RR +NV ++  +            GL+YSGSWDKTLKVWR+SD KCLESI+AHDDAIN V A    ++++ SADG +K W       ++E  G G  H 
Subjt:  -RRSENVFRLETHFRPQRTIWGIHNGLIYSGSWDKTLKVWRVSDLKCLESIKAHDDAINGVVA-CKGVVYSASADGKIKAWGRKKEEKEKEEDGGGGGHC

Query:  LLGILEGHKDVSINSVVVSEDGKWVYGGSSDGFIMGWE-------------------------KMVEHGSADKSIGIWRREGFGRLCKVGVINGHEGPIK
        L+ +L   ++ ++ ++ V+     VY GSSDG +  WE                          +V  G ADK+I +WRR G G    + V+  H GP+K
Subjt:  LLGILEGHKDVSINSVVVSEDGKWVYGGSSDGFIMGWE-------------------------KMVEHGSADKSIGIWRREGFGRLCKVGVINGHEGPIK

Query:  CLQAAPNVVGEG--------FLLYSGSLDKSLRVWWVPKASSS
        CL A  +  GEG        +++YSGSLDKS++VW V +++S+
Subjt:  CLQAAPNVVGEG--------FLLYSGSLDKSLRVWWVPKASSS

AT3G50390.1 Transducin/WD40 repeat-like superfamily protein5.6e-4634.12Show/hide
Query:  GSVSCLALCGEFILSASQGKDIIVWQQPDLRLFTRFGCGEGSVKALVAVGNRVFTAHQDGKIRVWKVSRRSENV-----------------------FRL
        G +  LA  G+ + + S  K+I VW+  +   F+ F    G VKA+V  G+++FT HQDGKIRVWK + +  NV                       F  
Subjt:  GSVSCLALCGEFILSASQGKDIIVWQQPDLRLFTRFGCGEGSVKALVAVGNRVFTAHQDGKIRVWKVSRRSENV-----------------------FRL

Query:  ETHFRPQRTIWGIH------------NGLIYSGSWDKTLKVWRVSDLKCLESIKAHDDAINGVVA-CKGVVYSASADGKIKAWGRKKEEKEKEEDGGGGG
            R    +   H              L+YSGSWDKT KVWRVSDL+C+ES+ AH+DA+N VV+   G+V++ SADG +K W R+ + K+ +       
Subjt:  ETHFRPQRTIWGIH------------NGLIYSGSWDKTLKVWRVSDLKCLESIKAHDDAINGVVA-CKGVVYSASADGKIKAWGRKKEEKEKEEDGGGGG

Query:  HCLLGILEGHKDVSINSVVVSEDGKWVYGGSSDGFIMGWEK-------------------------MVEHGSADKSIGIWRR-EGFGRLCKVGVINGHEG
        H     L   +D ++ ++ V +    VY GSSDG +  WE+                         ++  GSAD  I +WRR EG G    + V+ GH G
Subjt:  HCLLGILEGHKDVSINSVVVSEDGKWVYGGSSDGFIMGWEK-------------------------MVEHGSADKSIGIWRR-EGFGRLCKVGVINGHEG

Query:  PIKCL---QAAPNVVGE-GFLLYSGSLDKSLRVWWVPKAS
        P+KCL   +   +V GE  +++YSGSLD+S+++W V ++S
Subjt:  PIKCL---QAAPNVVGE-GFLLYSGSLDKSLRVWWVPKAS

AT3G51930.1 Transducin/WD40 repeat-like superfamily protein1.1e-11053.81Show/hide
Query:  LLHSTSSDSVSSSSSSEADEHSPATSYRFDFKDLRFRSYDFPCKSLSGFCSYRPLAVLSGHIGSVSCLALCGEFILSASQGKDIIVWQQPDLRLFTRFGC
        L+ S+SS + S+S+ S+      A++  F    LR          ++G  +Y+PLAVLS H+GSVS LALCGEF+LSASQGKDIIVWQQPDL++F +FG 
Subjt:  LLHSTSSDSVSSSSSSEADEHSPATSYRFDFKDLRFRSYDFPCKSLSGFCSYRPLAVLSGHIGSVSCLALCGEFILSASQGKDIIVWQQPDLRLFTRFGC

Query:  GEGSVKALVAVGNRVFTAHQDGKIRVWKVSRR-SENVFRL----------------ETHF----RPQRTIW----------GIHNGLIYSGSWDKTLKVW
        G+GSVKALV+VG++VFTAHQD +IRVWKVSRR SEN FRL                ++++    R  + +W           +H G+IYSGSWDKTLKVW
Subjt:  GEGSVKALVAVGNRVFTAHQDGKIRVWKVSRR-SENVFRL----------------ETHF----RPQRTIW----------GIHNGLIYSGSWDKTLKVW

Query:  RVSDLKCLESIKAHDDAINGVVACKGVVYSASADGKIKAWGRKKEEK-EKEEDGGGGGHCLLGILEGHKDVSINSVVVSEDGKWVYGGSSDGFIMGWEK-
        R+SDLKCLESIKAHDDAING+VA  G VYSASADGK+K WG++K ++ E         H L   LEG  +VS+NSVVVS DG WVYGG SDGF++GWEK 
Subjt:  RVSDLKCLESIKAHDDAINGVVACKGVVYSASADGKIKAWGRKKEEK-EKEEDGGGGGHCLLGILEGHKDVSINSVVVSEDGKWVYGGSSDGFIMGWEK-

Query:  ------------------------------MVEHGSADKSIGIWRREGFGRLCKVGVINGHEGPIKCLQAAPNVVGEGFLLYSGSLDKSLRVWWVPKASS
                                      MV  GSADKSIG+WRRE  G LCK GVI+GHEGP+KCLQA+PN VG GF+LYSG LDKSLRVWWVPK  +
Subjt:  ------------------------------MVEHGSADKSIGIWRREGFGRLCKVGVINGHEGPIKCLQAAPNVVGEGFLLYSGSLDKSLRVWWVPKASS

Query:  SSSSSSS
             SS
Subjt:  SSSSSSS

AT4G34380.1 Transducin/WD40 repeat-like superfamily protein1.1e-4635.62Show/hide
Query:  GSVSCLALCGEFILSASQGKDIIVWQQPDLRLFTRFGCGEGSVKALVAVGNRVFTAHQDGKIRVWKVSRRS----ENVFRLET------------HF---
        G +  LA  G+ + + S  K+I VW+  +L+    F    G +KA+V  G+R+FT HQDGKIR+WKVS+R     + V  L T            HF   
Subjt:  GSVSCLALCGEFILSASQGKDIIVWQQPDLRLFTRFGCGEGSVKALVAVGNRVFTAHQDGKIRVWKVSRRS----ENVFRLET------------HF---

Query:  -RPQRTIWGIHN------------GLIYSGSWDKTLKVWRVSDLKCLESIKAHDDAINGVVA-CKGVVYSASADGKIKAWGRKKEEKEKEEDGGGGGHCL
         R + ++   HN            GL+YS SWD T+KVWR++D KCLESI AHDDAIN V++    +V++ SADG +K W       ++E  G G  H L
Subjt:  -RPQRTIWGIHN------------GLIYSGSWDKTLKVWRVSDLKCLESIKAHDDAINGVVA-CKGVVYSASADGKIKAWGRKKEEKEKEEDGGGGGHCL

Query:  LGILEGHKDVSINSVVVSEDGKWVYGGSSDGFIMGWEK-------------------------MVEHGSADKSIGIWRREGFGRLCK-VGVINGHEGPIK
          +L   ++ ++ ++ V      VY GSSDG +  WE+                         ++  GSADK+I +WRR+   +  + + V+ GH GP+K
Subjt:  LGILEGHKDVSINSVVVSEDGKWVYGGSSDGFIMGWEK-------------------------MVEHGSADKSIGIWRREGFGRLCK-VGVINGHEGPIK

Query:  CL---------QAAPNVVGEG---FLLYSGSLDKSLRVWWV------------PKASSSSSSSSS
        CL         Q A   V EG   +++YSGSLDKS++VW V            P ASS   SSSS
Subjt:  CL---------QAAPNVVGEG---FLLYSGSLDKSLRVWWV------------PKASSSSSSSSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCGTCTTGTAGTTGAAGCGGCCACGCCATTGCTGCACTCAACAAGCTCCGACAGCGTTAGCAGCAGCAGCAGCAGCGAGGCCGACGAGCACAGTCCGGCGACCTC
CTACAGATTCGATTTCAAAGACCTCAGATTCAGATCGTATGACTTTCCCTGCAAATCTCTATCTGGGTTTTGCTCCTACCGGCCGCTGGCGGTTCTCTCCGGCCACATTG
GATCAGTTTCTTGCTTGGCTTTGTGCGGCGAGTTCATCCTCAGCGCTTCGCAAGGGAAGGACATTATCGTCTGGCAGCAGCCGGACTTGAGGCTCTTCACCAGGTTCGGC
TGCGGCGAGGGCTCGGTGAAGGCGCTGGTCGCCGTCGGGAACCGGGTTTTCACGGCCCACCAAGACGGAAAAATCAGAGTCTGGAAGGTTTCGAGGCGGTCGGAGAACGT
TTTCCGGCTGGAAACACACTTCCGACCACAAAGGACTATTTGGGGAATCCACAATGGGTTGATTTACTCTGGCTCTTGGGACAAGACCCTGAAAGTTTGGAGGGTTTCTG
ATCTCAAGTGCTTGGAATCCATTAAAGCTCATGATGATGCCATTAATGGGGTGGTGGCTTGTAAAGGGGTTGTGTACTCTGCTTCTGCAGATGGGAAAATCAAAGCATGG
GGAAGAAAGAAGGAAGAAAAAGAAAAAGAAGAAGACGGAGGAGGAGGTGGCCACTGTTTGCTGGGGATTTTGGAGGGGCATAAGGATGTTTCAATCAATTCTGTGGTGGT
TTCTGAGGATGGGAAATGGGTATATGGAGGGAGTTCAGATGGGTTCATAATGGGTTGGGAAAAAATGGTGGAACATGGGTCAGCTGATAAGAGCATTGGGATATGGAGAA
GAGAGGGTTTTGGGAGGCTGTGTAAAGTTGGGGTGATAAATGGCCATGAAGGACCAATCAAATGCTTACAGGCAGCTCCAAATGTTGTGGGTGAGGGATTCTTGTTGTAT
AGTGGAAGCCTTGACAAAAGCTTGAGAGTTTGGTGGGTTCCTAAAGCTTCTTCTTCTTCTTCTTCTTCTTCCTCTGCCATGGGAGTAGCTGGTTTTGTTGCAGAGGATTC
AGGAAGTCTGTCATCTCCTGCTGAGCAAGAAAGAGCTCCGTTCAATTGGATTTGGGTCAGAGTATCTCATAGGCCGATCGAACTCCATCTGAGAGTCTTCTTGGGCCTTC
CATGGTCATCCCATCAGCCGCGCCCGTTGCTACATCATACAAATTTCAGCCTCCCACCATGTCCATTCCATTCGATTCCATACAGCTCAAGAGCTATGGCGAGGCTCCTT
ATCTCGAGCCGAATCTTTCCCAGTCCCTCTCTCCCCGTTGCTCCTGCACGGGCATTCTTCAAACCCTCCACGTTGTCGCCGGCGGTTAGATTCTCCGGTGACCCAGCAGC
ACCAGAGGCCTACGACTGGTTACTCGTGCCGGTGCCGGTGCCAGCAGTTACATCTTCGCCTTCTCCATCCCCTTCTCTCTTATTCTGGTCACTGTCCTCACCGCTCTTAA
AATGGGCGATAACCTCGACAAGAAGTTTCTTGAGGAGTTTCTTTTTCCTTGTTTTTGCGATCTTCAACTTTATCGTTATCTGCTCGGCAGCGGTTATACTTCTGAATAAC
CTGTTCGATTCAATGCTTACTCTTGCTGTTAATCAAGCCATCATGGAGGCAAATGAGGAGAACGAAGATGGTAATTCAGAAATATCTTTTGAAGAGAAGCCTTCTCTTCC
AAGAACTCGCAACCGCCCTAAAAGGGGAGCTGAGGTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATCGTCTTGTAGTTGAAGCGGCCACGCCATTGCTGCACTCAACAAGCTCCGACAGCGTTAGCAGCAGCAGCAGCAGCGAGGCCGACGAGCACAGTCCGGCGACCTC
CTACAGATTCGATTTCAAAGACCTCAGATTCAGATCGTATGACTTTCCCTGCAAATCTCTATCTGGGTTTTGCTCCTACCGGCCGCTGGCGGTTCTCTCCGGCCACATTG
GATCAGTTTCTTGCTTGGCTTTGTGCGGCGAGTTCATCCTCAGCGCTTCGCAAGGGAAGGACATTATCGTCTGGCAGCAGCCGGACTTGAGGCTCTTCACCAGGTTCGGC
TGCGGCGAGGGCTCGGTGAAGGCGCTGGTCGCCGTCGGGAACCGGGTTTTCACGGCCCACCAAGACGGAAAAATCAGAGTCTGGAAGGTTTCGAGGCGGTCGGAGAACGT
TTTCCGGCTGGAAACACACTTCCGACCACAAAGGACTATTTGGGGAATCCACAATGGGTTGATTTACTCTGGCTCTTGGGACAAGACCCTGAAAGTTTGGAGGGTTTCTG
ATCTCAAGTGCTTGGAATCCATTAAAGCTCATGATGATGCCATTAATGGGGTGGTGGCTTGTAAAGGGGTTGTGTACTCTGCTTCTGCAGATGGGAAAATCAAAGCATGG
GGAAGAAAGAAGGAAGAAAAAGAAAAAGAAGAAGACGGAGGAGGAGGTGGCCACTGTTTGCTGGGGATTTTGGAGGGGCATAAGGATGTTTCAATCAATTCTGTGGTGGT
TTCTGAGGATGGGAAATGGGTATATGGAGGGAGTTCAGATGGGTTCATAATGGGTTGGGAAAAAATGGTGGAACATGGGTCAGCTGATAAGAGCATTGGGATATGGAGAA
GAGAGGGTTTTGGGAGGCTGTGTAAAGTTGGGGTGATAAATGGCCATGAAGGACCAATCAAATGCTTACAGGCAGCTCCAAATGTTGTGGGTGAGGGATTCTTGTTGTAT
AGTGGAAGCCTTGACAAAAGCTTGAGAGTTTGGTGGGTTCCTAAAGCTTCTTCTTCTTCTTCTTCTTCTTCCTCTGCCATGGGAGTAGCTGGTTTTGTTGCAGAGGATTC
AGGAAGTCTGTCATCTCCTGCTGAGCAAGAAAGAGCTCCGTTCAATTGGATTTGGGTCAGAGTATCTCATAGGCCGATCGAACTCCATCTGAGAGTCTTCTTGGGCCTTC
CATGGTCATCCCATCAGCCGCGCCCGTTGCTACATCATACAAATTTCAGCCTCCCACCATGTCCATTCCATTCGATTCCATACAGCTCAAGAGCTATGGCGAGGCTCCTT
ATCTCGAGCCGAATCTTTCCCAGTCCCTCTCTCCCCGTTGCTCCTGCACGGGCATTCTTCAAACCCTCCACGTTGTCGCCGGCGGTTAGATTCTCCGGTGACCCAGCAGC
ACCAGAGGCCTACGACTGGTTACTCGTGCCGGTGCCGGTGCCAGCAGTTACATCTTCGCCTTCTCCATCCCCTTCTCTCTTATTCTGGTCACTGTCCTCACCGCTCTTAA
AATGGGCGATAACCTCGACAAGAAGTTTCTTGAGGAGTTTCTTTTTCCTTGTTTTTGCGATCTTCAACTTTATCGTTATCTGCTCGGCAGCGGTTATACTTCTGAATAAC
CTGTTCGATTCAATGCTTACTCTTGCTGTTAATCAAGCCATCATGGAGGCAAATGAGGAGAACGAAGATGGTAATTCAGAAATATCTTTTGAAGAGAAGCCTTCTCTTCC
AAGAACTCGCAACCGCCCTAAAAGGGGAGCTGAGGTGTAA
Protein sequenceShow/hide protein sequence
MDRLVVEAATPLLHSTSSDSVSSSSSSEADEHSPATSYRFDFKDLRFRSYDFPCKSLSGFCSYRPLAVLSGHIGSVSCLALCGEFILSASQGKDIIVWQQPDLRLFTRFG
CGEGSVKALVAVGNRVFTAHQDGKIRVWKVSRRSENVFRLETHFRPQRTIWGIHNGLIYSGSWDKTLKVWRVSDLKCLESIKAHDDAINGVVACKGVVYSASADGKIKAW
GRKKEEKEKEEDGGGGGHCLLGILEGHKDVSINSVVVSEDGKWVYGGSSDGFIMGWEKMVEHGSADKSIGIWRREGFGRLCKVGVINGHEGPIKCLQAAPNVVGEGFLLY
SGSLDKSLRVWWVPKASSSSSSSSSAMGVAGFVAEDSGSLSSPAEQERAPFNWIWVRVSHRPIELHLRVFLGLPWSSHQPRPLLHHTNFSLPPCPFHSIPYSSRAMARLL
ISSRIFPSPSLPVAPARAFFKPSTLSPAVRFSGDPAAPEAYDWLLVPVPVPAVTSSPSPSPSLLFWSLSSPLLKWAITSTRSFLRSFFFLVFAIFNFIVICSAAVILLNN
LFDSMLTLAVNQAIMEANEENEDGNSEISFEEKPSLPRTRNRPKRGAEV