; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi05G018440 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi05G018440
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionGTD-binding domain-containing protein
Genome locationchr05:25693622..25697562
RNA-Seq ExpressionLsi05G018440
SyntenyLsi05G018440
Gene Ontology termsGO:0006511 - ubiquitin-dependent protein catabolic process (biological process)
GO:0016579 - protein deubiquitination (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004843 - thiol-dependent ubiquitin-specific protease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0080115 - myosin XI tail binding (molecular function)
InterPro domainsIPR007656 - GTD-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134377.1 uncharacterized protein LOC101204513 [Cucumis sativus]0.0e+0080.11Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSTIKKFPLNSVWDEPKCC
        MACEAI+LWTFNGLVAAFLDLGIAFLLL ASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSS+KISSV+HST +KFPL+S+ D PKCC
Subjt:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSTIKKFPLNSVWDEPKCC

Query:  FKSMLVHERNVKEEHVELEGETSGSSSFKTRSQQGMIYGDFPSVYELHCGGGGDFRKVISASPNDISQLDVELEDLSHSPSSFSGFGDNNTE-DGFFSVD
         KSMLVHERNVK + VELEGE SGSSSFK RS Q M+YGD+PSV ELHCG GGD RKVISAS   ISQ DVELEDLS SPSSFSGFG++NTE DGFFSVD
Subjt:  FKSMLVHERNVKEEHVELEGETSGSSSFKTRSQQGMIYGDFPSVYELHCGGGGDFRKVISASPNDISQLDVELEDLSHSPSSFSGFGDNNTE-DGFFSVD

Query:  SGDEREASSENSDRYKVFPDLELDDSYDKKLCVEMYEASVEEAGNSCRRESCVDGNESDTIKLLEQALEEEQTARATLYMELEKERNAAATAADEAMAMI
        SGDERE SS+NSD+YKVFPDLELDDS D+K+C EM EASV EAGNSCRRE  +DGNESDTIK LEQALEEEQ+ RA LY+ELEKER+AAATAADEAMAMI
Subjt:  SGDEREASSENSDRYKVFPDLELDDSYDKKLCVEMYEASVEEAGNSCRRESCVDGNESDTIKLLEQALEEEQTARATLYMELEKERNAAATAADEAMAMI

Query:  LRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFRKSFFDDDGVGVDMLDSEFTPARTPSFPYPTEDPSHMLQCINRS
        LRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKE+EA R SFF+ DGVGVDMLDSE TP R PSF YPTEDP     CIN  
Subjt:  LRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFRKSFFDDDGVGVDMLDSEFTPARTPSFPYPTEDPSHMLQCINRS

Query:  TRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVTDNFKGSEEIEHELQGKGMVEDENLCVVPGDINKLDPYLQSNES
                 K HSLQ+EIPSV S+ LTFEFGEESP I ADE ADAAKARGMLL QV D +KGSEEI++ELQGK MVEDENL VVPG + +L+PYLQSNES
Subjt:  TRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVTDNFKGSEEIEHELQGKGMVEDENLCVVPGDINKLDPYLQSNES

Query:  NGILKVEKCMELIADEQGKVDEVSYDESVLAKTILPCLEYNLEKNGDHQEMRTKDLDSVNATDPHPHDIHVVEDEARISNEAIANASEEPLVNGTSSIPV
        N + KVEKC ELIADEQ +V EVSYD    AKT LPC     EKNGDHQ  RT+DL SVN TDPH HDIHVVEDEA+ SNEA+ NASEEPLVNGTS+IP 
Subjt:  NGILKVEKCMELIADEQGKVDEVSYDESVLAKTILPCLEYNLEKNGDHQEMRTKDLDSVNATDPHPHDIHVVEDEARISNEAIANASEEPLVNGTSSIPV

Query:  KCDSPSFNLLQSELEIMRTSSDATGRFPPMARSQSNSLRSELRRNSMSAVDYERSKIGNEVEWLKGRLKIVQEGREKLKVSVEHKEK-NDQFQLLENITN
        KCDSPSF+LLQ+EL+  R+SSDA+GRFPP+ARS+S+S+RS+LRRNSMSAVDYERSKIGNEVEWL+GRLKIVQEGREKLK SVEHKEK ++QFQLLENITN
Subjt:  KCDSPSFNLLQSELEIMRTSSDATGRFPPMARSQSNSLRSELRRNSMSAVDYERSKIGNEVEWLKGRLKIVQEGREKLKVSVEHKEK-NDQFQLLENITN

Query:  HLREIRQLTDPEKTVLQAPLPPSSTDVLKKRCWRSSSLSVHRSS
          REIRQLTDP K  LQAPLPPSS DV KKRCWRSSSLSVHRSS
Subjt:  HLREIRQLTDPEKTVLQAPLPPSSTDVLKKRCWRSSSLSVHRSS

XP_016898951.1 PREDICTED: ubiquitin carboxyl-terminal hydrolase 2 [Cucumis melo]0.0e+0080.57Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSTIKKFPLNSVWDEPKCC
        MACEAI+LWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLS DHCFQKLLVD SS+KISSV+HST +KFPL+S+ DEPKCC
Subjt:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSTIKKFPLNSVWDEPKCC

Query:  FKSMLVHERNVKEEHVELEGETSGSSSFKTRSQQGMIYGDFPSVYELHCGGGGDFRKVISASPNDISQLDVELEDLSHSPSSFSGFGDNNTE-DGFFSVD
        FKSMLVHER+VKE  V+LEGE SGSSS K RS Q MIYGD+PSV ELHC  GGD RKVISASPN I Q DVELEDLS SPSSFSGFGD+NTE DGFFSVD
Subjt:  FKSMLVHERNVKEEHVELEGETSGSSSFKTRSQQGMIYGDFPSVYELHCGGGGDFRKVISASPNDISQLDVELEDLSHSPSSFSGFGDNNTE-DGFFSVD

Query:  SGDEREASSENSDRYKVFPDLELDDSYDKKLCVEMYEASVEEAGNSCRRESCVDGNESDTIKLLEQALEEEQTARATLYMELEKERNAAATAADEAMAMI
        SGDEREASS+NSD+YKVFP LELDDS D+K+C EM EASVEEAGNSCRR+ C+DGNESDTIK LEQALEEEQ+ RATLY+ELEKER+AAATAADEAMAMI
Subjt:  SGDEREASSENSDRYKVFPDLELDDSYDKKLCVEMYEASVEEAGNSCRRESCVDGNESDTIKLLEQALEEEQTARATLYMELEKERNAAATAADEAMAMI

Query:  LRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFRKSFFDDDGVGVDMLDSEFTPARTPSFPYPTEDPSHMLQCINRS
        LRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEA R SFF+ DGVGVDMLDSE TP R PSF YPTEDP     C N S
Subjt:  LRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFRKSFFDDDGVGVDMLDSEFTPARTPSFPYPTEDPSHMLQCINRS

Query:  TRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVTDNFKGSEEIEHELQGKGMVEDENLCVVPGDINKLDPYLQSNES
                   HSLQ+EIPSVES+ LTFEF EESP I+ADEIADAAKAR +LL QV DNFKGSEEI++ELQGKGM+EDENL +VPG + +L PYLQSNES
Subjt:  TRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVTDNFKGSEEIEHELQGKGMVEDENLCVVPGDINKLDPYLQSNES

Query:  NGILKVEKCMELIADEQGKVDEVSYDESVLAKTILPCLEYNLEKNGDHQEMRTKDLDSVNATDPHPHDIHVVEDEARISNEAIANASEEPLVNGTSSIPV
        NG+ KV+KC ELIADEQ KVD+VSYD   LAKTILPC     EKNGDHQ  RT+DL SVN TDPHPHDIHVVEDEA+ SNEA+ N SEEPLVNGTS+IPV
Subjt:  NGILKVEKCMELIADEQGKVDEVSYDESVLAKTILPCLEYNLEKNGDHQEMRTKDLDSVNATDPHPHDIHVVEDEARISNEAIANASEEPLVNGTSSIPV

Query:  KCDSPSFNLLQSELEIMRTSSDATGRFPPMARSQSNSLRSELRRNSMSAVDYERSKIGNEVEWLKGRLKIVQEGREKLKVSVEHKEK-NDQFQLLENITN
        KCDSPSF+LLQSEL+I R+SSDATGRFPP+ARS+S+S++S+LRRNSMSAVDYERSKIGNEVEWL+GRLKIVQEGREKLK SVEHKEK ++QFQLLENITN
Subjt:  KCDSPSFNLLQSELEIMRTSSDATGRFPPMARSQSNSLRSELRRNSMSAVDYERSKIGNEVEWLKGRLKIVQEGREKLKVSVEHKEK-NDQFQLLENITN

Query:  HLREIRQLTDPEKTVLQAPLPPSSTDVLKKR
        HLREIRQLTDP K  L APLPPSS   + K+
Subjt:  HLREIRQLTDPEKTVLQAPLPPSSTDVLKKR

XP_022980235.1 uncharacterized protein LOC111479667 isoform X1 [Cucurbita maxima]4.6e-30275.67Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSTIKKFPLNSVWD-EPKC
        MACEAIQLWTFNGLVAAFLDLGIAFLLLCA+SLVFFTSKFLALFG CLPCPCDGLFG+L SDHCFQKLLVD SSKKISSVLHST +KFPL+S+WD EPKC
Subjt:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSTIKKFPLNSVWD-EPKC

Query:  CFKSMLVHERNVKEEHVELEGETSGSSSFKTRSQQGMIYGDFPSVYELHCGGGGDFRKVISASPNDISQLDVELEDLSHSPSSFSGFGDNNTEDGFFSVD
        CFKSM VH+RNVKE  VE + E SG S FKTRS +GMIYGD  ++ E    GG   RK+ S SPND+ Q DVELEDL HSPSSF GFGDNN EDGFFSVD
Subjt:  CFKSMLVHERNVKEEHVELEGETSGSSSFKTRSQQGMIYGDFPSVYELHCGGGGDFRKVISASPNDISQLDVELEDLSHSPSSFSGFGDNNTEDGFFSVD

Query:  SGDEREASSENSDRYKVFPDLELDDSYDKKLCVEMYEASVEEAGNSCRRESCVDGNESDTIKLLEQALEEEQTARATLYMELEKERNAAATAADEAMAMI
        SGDE EAS +NS++YKVFPDLELDDSYD+K+C EMY AS EEA N+CR E C+DGNESD IKLLEQ+LEEEQ ARATLY+ELEKER+AAATAADEAMAMI
Subjt:  SGDEREASSENSDRYKVFPDLELDDSYDKKLCVEMYEASVEEAGNSCRRESCVDGNESDTIKLLEQALEEEQTARATLYMELEKERNAAATAADEAMAMI

Query:  LRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFRKSFFDDDGVGVDMLDSEFTPARTPSFPYPTEDPSHMLQCINRS
        LRLQEEKASIEM+ARQYQRMIEEKTAYDAEEMSILKEILVRRE+EMHFL+KEV AFR+SFFD+ GV VDMLD+EFTP   PS PYPTEDPSHMLQCIN S
Subjt:  LRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFRKSFFDDDGVGVDMLDSEFTPARTPSFPYPTEDPSHMLQCINRS

Query:  TRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVTDNFKGSEEIEHELQGKGMVEDENLCVVPGDINKLDPYLQSNES
          D+Q          +E+PSVESRNL FEFGEESP IQA E ADAAKARGMLLHQV DNF+G EEI+ ELQGKGMVED+NL +VPG++N+L+PY +SN S
Subjt:  TRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVTDNFKGSEEIEHELQGKGMVEDENLCVVPGDINKLDPYLQSNES

Query:  NGILKVEKCMELIADEQGKVDEVSYDESVLAKTILPCLEYNLEKNGDHQEMRTKDLDSVNATDPHPHDIHVVEDEARISNEAIANASEEPLVNGTSSIPV
        N + KVE+C E   DEQ KV + S D    A+T LPC+EYNLEK  DHQ+  T+D  SVN TD  PHDIHV+EDEAR+ NEA ANA EE LVNG+SSIPV
Subjt:  NGILKVEKCMELIADEQGKVDEVSYDESVLAKTILPCLEYNLEKNGDHQEMRTKDLDSVNATDPHPHDIHVVEDEARISNEAIANASEEPLVNGTSSIPV

Query:  KCDSPSFNLLQSELEIMRTSSDATGRFPPMARSQSNSLRSELRRNSMSAVDYERSKIGNEVEWLKGRLKIVQEGREKLKVSVEHKEK-NDQFQLLENITN
         CDS SF+LLQ+EL+I R+SSDATGRFPPMARS+SNSLRSELRRNSMSAVDYERSKIGNEVEWL+GRLKIVQE REK K+SVE KEK N+Q QLLENIT 
Subjt:  KCDSPSFNLLQSELEIMRTSSDATGRFPPMARSQSNSLRSELRRNSMSAVDYERSKIGNEVEWLKGRLKIVQEGREKLKVSVEHKEK-NDQFQLLENITN

Query:  HLREIRQLTDPEKTVLQAPLPPSSTDVLKKRCWRSSSLSVHRSS
             + L+DP K  LQAPLPPSS DV KKRCWRSSSL +HRSS
Subjt:  HLREIRQLTDPEKTVLQAPLPPSSTDVLKKRCWRSSSLSVHRSS

XP_038893600.1 uncharacterized protein LOC120082482 isoform X1 [Benincasa hispida]0.0e+0087.89Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSTIKKFPLNSVWDEPKCC
        MACEAI+LWTFNGLVAAFLDLGIAFLLLCAS+LVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSS+KISSVL ST KKFPL+SVWDEPKCC
Subjt:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSTIKKFPLNSVWDEPKCC

Query:  FKSMLVHERNVKEEHVELEGETSGSSSFKTRSQQGMIYGDFPSVYELHCGGGGDFRKVISASPNDISQLDVELEDLSHSPSSFSGFGDNNTEDGFFSVDS
        FKS+LVHERNVKE HVELEGE SG SSFK+RS QGM+YGDFPSV +L  GGG D RKVISAS N+ISQ DVELEDLSHSPSSFSGFGDNNTEDGFFSVDS
Subjt:  FKSMLVHERNVKEEHVELEGETSGSSSFKTRSQQGMIYGDFPSVYELHCGGGGDFRKVISASPNDISQLDVELEDLSHSPSSFSGFGDNNTEDGFFSVDS

Query:  GDEREASSENSDRYKVFPDLELDDSYDKKLCVEMYEASVEEAGNSCRRESCVDGNESDTIKLLEQALEEEQTARATLYMELEKERNAAATAADEAMAMIL
        GDEREAS +NSD+YKVFP+LELDDSYD K+C EMYEASVEEAGN CR E C+DGNESDTIKLLEQALEEEQTARATLY+ELEKER+AAATAADEAMAMIL
Subjt:  GDEREASSENSDRYKVFPDLELDDSYDKKLCVEMYEASVEEAGNSCRRESCVDGNESDTIKLLEQALEEEQTARATLYMELEKERNAAATAADEAMAMIL

Query:  RLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFRKSFFDDDGVGVDMLDSEFTPARTPSFPYPTEDPSHMLQCINRST
        RLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFRKSFFDDDG GVDMLDSEFTPARTP  PYPTEDPSHMLQCINRS 
Subjt:  RLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFRKSFFDDDGVGVDMLDSEFTPARTPSFPYPTEDPSHMLQCINRST

Query:  RDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVTDNFKGSEEIEHELQGKGMVEDENLCVVPGDINKLDPYLQSNESN
        RDE+ +NYKIHSLQY IPSVESRNLTFEFGEES  I ADE+A AAKARGMLLHQV DNFK SEEI++ELQGKGM+EDE L +VPG++N+LDPYLQSNESN
Subjt:  RDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVTDNFKGSEEIEHELQGKGMVEDENLCVVPGDINKLDPYLQSNESN

Query:  GILKVEKCMELIADEQGKVDEVSYDESVLAKTILPCLEYNLEKNGDHQEMRTKDLDSVNATDPHPHDIHVVEDEARISNEAIANASEEPLVNGTSSIPVK
        G+ KVEKC E+IADEQ KVDEVSYD+  L KTI PCLEYNLEKNGDHQ+MRT+D+DSVN TDPHPHDIHVVEDEARI NEAIANASEEP VNGTSSIPVK
Subjt:  GILKVEKCMELIADEQGKVDEVSYDESVLAKTILPCLEYNLEKNGDHQEMRTKDLDSVNATDPHPHDIHVVEDEARISNEAIANASEEPLVNGTSSIPVK

Query:  CDSPSFNLLQSELEIMRTSSDATGRFPPMARSQSNSLRSELRRNSMSAVDYERSKIGNEVEWLKGRLKIVQEGREKLKVSVEHKEK-NDQFQLLENITNH
        CDSPSF LLQSELEIMRTSSDAT RFPPMARS+SNSLRSELRRNSMSAVDYERSKIGNEVEWL+GRLKIVQEGREKLK SVEHKEK N+Q QLLENITNH
Subjt:  CDSPSFNLLQSELEIMRTSSDATGRFPPMARSQSNSLRSELRRNSMSAVDYERSKIGNEVEWLKGRLKIVQEGREKLKVSVEHKEK-NDQFQLLENITNH

Query:  LREIRQLTDPEKTVLQAPLPPSSTDVLKKRCWRSSSLSVHRSS
        LREI QL DP K  LQAPLPPSS DV KKRCWRSSSLSVHRSS
Subjt:  LREIRQLTDPEKTVLQAPLPPSSTDVLKKRCWRSSSLSVHRSS

XP_038893608.1 uncharacterized protein LOC120082482 isoform X2 [Benincasa hispida]0.0e+0084.66Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSTIKKFPLNSVWDEPKCC
        MACEAI+LWTFNGLVAAFLDLGIAFLLLCAS+LVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSS+KISSVL ST KKFPL+SVWDEPKCC
Subjt:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSTIKKFPLNSVWDEPKCC

Query:  FKSMLVHERNVKEEHVELEGETSGSSSFKTRSQQGMIYGDFPSVYELHCGGGGDFRKVISASPNDISQLDVELEDLSHSPSSFSGFGDNNTEDGFFSVDS
        FKS+LVHERNVKE HVELEGE SG SSFK+RS QGM+YGDFPSV +L  GGG D RKVISAS N+ISQ DVELEDLSHSPSSFSGFGDNNTEDGFFSVDS
Subjt:  FKSMLVHERNVKEEHVELEGETSGSSSFKTRSQQGMIYGDFPSVYELHCGGGGDFRKVISASPNDISQLDVELEDLSHSPSSFSGFGDNNTEDGFFSVDS

Query:  GDEREASSENSDRYKVFPDLELDDSYDKKLCVEMYEASVEEAGNSCRRESCVDGNESDTIKLLEQALEEEQTARATLYMELEKERNAAATAADEAMAMIL
        G                               EMYEASVEEAGN CR E C+DGNESDTIKLLEQALEEEQTARATLY+ELEKER+AAATAADEAMAMIL
Subjt:  GDEREASSENSDRYKVFPDLELDDSYDKKLCVEMYEASVEEAGNSCRRESCVDGNESDTIKLLEQALEEEQTARATLYMELEKERNAAATAADEAMAMIL

Query:  RLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFRKSFFDDDGVGVDMLDSEFTPARTPSFPYPTEDPSHMLQCINRST
        RLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFRKSFFDDDG GVDMLDSEFTPARTP  PYPTEDPSHMLQCINRS 
Subjt:  RLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFRKSFFDDDGVGVDMLDSEFTPARTPSFPYPTEDPSHMLQCINRST

Query:  RDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVTDNFKGSEEIEHELQGKGMVEDENLCVVPGDINKLDPYLQSNESN
        RDE+ +NYKIHSLQY IPSVESRNLTFEFGEES  I ADE+A AAKARGMLLHQV DNFK SEEI++ELQGKGM+EDE L +VPG++N+LDPYLQSNESN
Subjt:  RDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVTDNFKGSEEIEHELQGKGMVEDENLCVVPGDINKLDPYLQSNESN

Query:  GILKVEKCMELIADEQGKVDEVSYDESVLAKTILPCLEYNLEKNGDHQEMRTKDLDSVNATDPHPHDIHVVEDEARISNEAIANASEEPLVNGTSSIPVK
        G+ KVEKC E+IADEQ KVDEVSYD+  L KTI PCLEYNLEKNGDHQ+MRT+D+DSVN TDPHPHDIHVVEDEARI NEAIANASEEP VNGTSSIPVK
Subjt:  GILKVEKCMELIADEQGKVDEVSYDESVLAKTILPCLEYNLEKNGDHQEMRTKDLDSVNATDPHPHDIHVVEDEARISNEAIANASEEPLVNGTSSIPVK

Query:  CDSPSFNLLQSELEIMRTSSDATGRFPPMARSQSNSLRSELRRNSMSAVDYERSKIGNEVEWLKGRLKIVQEGREKLKVSVEHKEK-NDQFQLLENITNH
        CDSPSF LLQSELEIMRTSSDAT RFPPMARS+SNSLRSELRRNSMSAVDYERSKIGNEVEWL+GRLKIVQEGREKLK SVEHKEK N+Q QLLENITNH
Subjt:  CDSPSFNLLQSELEIMRTSSDATGRFPPMARSQSNSLRSELRRNSMSAVDYERSKIGNEVEWLKGRLKIVQEGREKLKVSVEHKEK-NDQFQLLENITNH

Query:  LREIRQLTDPEKTVLQAPLPPSSTDVLKKRCWRSSSLSVHRSS
        LREI QL DP K  LQAPLPPSS DV KKRCWRSSSLSVHRSS
Subjt:  LREIRQLTDPEKTVLQAPLPPSSTDVLKKRCWRSSSLSVHRSS

TrEMBL top hitse value%identityAlignment
A0A0A0L4N9 GTD-binding domain-containing protein0.0e+0080.11Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSTIKKFPLNSVWDEPKCC
        MACEAI+LWTFNGLVAAFLDLGIAFLLL ASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSS+KISSV+HST +KFPL+S+ D PKCC
Subjt:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSTIKKFPLNSVWDEPKCC

Query:  FKSMLVHERNVKEEHVELEGETSGSSSFKTRSQQGMIYGDFPSVYELHCGGGGDFRKVISASPNDISQLDVELEDLSHSPSSFSGFGDNNTE-DGFFSVD
         KSMLVHERNVK + VELEGE SGSSSFK RS Q M+YGD+PSV ELHCG GGD RKVISAS   ISQ DVELEDLS SPSSFSGFG++NTE DGFFSVD
Subjt:  FKSMLVHERNVKEEHVELEGETSGSSSFKTRSQQGMIYGDFPSVYELHCGGGGDFRKVISASPNDISQLDVELEDLSHSPSSFSGFGDNNTE-DGFFSVD

Query:  SGDEREASSENSDRYKVFPDLELDDSYDKKLCVEMYEASVEEAGNSCRRESCVDGNESDTIKLLEQALEEEQTARATLYMELEKERNAAATAADEAMAMI
        SGDERE SS+NSD+YKVFPDLELDDS D+K+C EM EASV EAGNSCRRE  +DGNESDTIK LEQALEEEQ+ RA LY+ELEKER+AAATAADEAMAMI
Subjt:  SGDEREASSENSDRYKVFPDLELDDSYDKKLCVEMYEASVEEAGNSCRRESCVDGNESDTIKLLEQALEEEQTARATLYMELEKERNAAATAADEAMAMI

Query:  LRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFRKSFFDDDGVGVDMLDSEFTPARTPSFPYPTEDPSHMLQCINRS
        LRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKE+EA R SFF+ DGVGVDMLDSE TP R PSF YPTEDP     CIN  
Subjt:  LRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFRKSFFDDDGVGVDMLDSEFTPARTPSFPYPTEDPSHMLQCINRS

Query:  TRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVTDNFKGSEEIEHELQGKGMVEDENLCVVPGDINKLDPYLQSNES
                 K HSLQ+EIPSV S+ LTFEFGEESP I ADE ADAAKARGMLL QV D +KGSEEI++ELQGK MVEDENL VVPG + +L+PYLQSNES
Subjt:  TRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVTDNFKGSEEIEHELQGKGMVEDENLCVVPGDINKLDPYLQSNES

Query:  NGILKVEKCMELIADEQGKVDEVSYDESVLAKTILPCLEYNLEKNGDHQEMRTKDLDSVNATDPHPHDIHVVEDEARISNEAIANASEEPLVNGTSSIPV
        N + KVEKC ELIADEQ +V EVSYD    AKT LPC     EKNGDHQ  RT+DL SVN TDPH HDIHVVEDEA+ SNEA+ NASEEPLVNGTS+IP 
Subjt:  NGILKVEKCMELIADEQGKVDEVSYDESVLAKTILPCLEYNLEKNGDHQEMRTKDLDSVNATDPHPHDIHVVEDEARISNEAIANASEEPLVNGTSSIPV

Query:  KCDSPSFNLLQSELEIMRTSSDATGRFPPMARSQSNSLRSELRRNSMSAVDYERSKIGNEVEWLKGRLKIVQEGREKLKVSVEHKEK-NDQFQLLENITN
        KCDSPSF+LLQ+EL+  R+SSDA+GRFPP+ARS+S+S+RS+LRRNSMSAVDYERSKIGNEVEWL+GRLKIVQEGREKLK SVEHKEK ++QFQLLENITN
Subjt:  KCDSPSFNLLQSELEIMRTSSDATGRFPPMARSQSNSLRSELRRNSMSAVDYERSKIGNEVEWLKGRLKIVQEGREKLKVSVEHKEK-NDQFQLLENITN

Query:  HLREIRQLTDPEKTVLQAPLPPSSTDVLKKRCWRSSSLSVHRSS
          REIRQLTDP K  LQAPLPPSS DV KKRCWRSSSLSVHRSS
Subjt:  HLREIRQLTDPEKTVLQAPLPPSSTDVLKKRCWRSSSLSVHRSS

A0A1S4DSI6 Ubiquitinyl hydrolase 10.0e+0080.57Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSTIKKFPLNSVWDEPKCC
        MACEAI+LWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLS DHCFQKLLVD SS+KISSV+HST +KFPL+S+ DEPKCC
Subjt:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSTIKKFPLNSVWDEPKCC

Query:  FKSMLVHERNVKEEHVELEGETSGSSSFKTRSQQGMIYGDFPSVYELHCGGGGDFRKVISASPNDISQLDVELEDLSHSPSSFSGFGDNNTE-DGFFSVD
        FKSMLVHER+VKE  V+LEGE SGSSS K RS Q MIYGD+PSV ELHC  GGD RKVISASPN I Q DVELEDLS SPSSFSGFGD+NTE DGFFSVD
Subjt:  FKSMLVHERNVKEEHVELEGETSGSSSFKTRSQQGMIYGDFPSVYELHCGGGGDFRKVISASPNDISQLDVELEDLSHSPSSFSGFGDNNTE-DGFFSVD

Query:  SGDEREASSENSDRYKVFPDLELDDSYDKKLCVEMYEASVEEAGNSCRRESCVDGNESDTIKLLEQALEEEQTARATLYMELEKERNAAATAADEAMAMI
        SGDEREASS+NSD+YKVFP LELDDS D+K+C EM EASVEEAGNSCRR+ C+DGNESDTIK LEQALEEEQ+ RATLY+ELEKER+AAATAADEAMAMI
Subjt:  SGDEREASSENSDRYKVFPDLELDDSYDKKLCVEMYEASVEEAGNSCRRESCVDGNESDTIKLLEQALEEEQTARATLYMELEKERNAAATAADEAMAMI

Query:  LRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFRKSFFDDDGVGVDMLDSEFTPARTPSFPYPTEDPSHMLQCINRS
        LRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEA R SFF+ DGVGVDMLDSE TP R PSF YPTEDP     C N S
Subjt:  LRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFRKSFFDDDGVGVDMLDSEFTPARTPSFPYPTEDPSHMLQCINRS

Query:  TRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVTDNFKGSEEIEHELQGKGMVEDENLCVVPGDINKLDPYLQSNES
                   HSLQ+EIPSVES+ LTFEF EESP I+ADEIADAAKAR +LL QV DNFKGSEEI++ELQGKGM+EDENL +VPG + +L PYLQSNES
Subjt:  TRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVTDNFKGSEEIEHELQGKGMVEDENLCVVPGDINKLDPYLQSNES

Query:  NGILKVEKCMELIADEQGKVDEVSYDESVLAKTILPCLEYNLEKNGDHQEMRTKDLDSVNATDPHPHDIHVVEDEARISNEAIANASEEPLVNGTSSIPV
        NG+ KV+KC ELIADEQ KVD+VSYD   LAKTILPC     EKNGDHQ  RT+DL SVN TDPHPHDIHVVEDEA+ SNEA+ N SEEPLVNGTS+IPV
Subjt:  NGILKVEKCMELIADEQGKVDEVSYDESVLAKTILPCLEYNLEKNGDHQEMRTKDLDSVNATDPHPHDIHVVEDEARISNEAIANASEEPLVNGTSSIPV

Query:  KCDSPSFNLLQSELEIMRTSSDATGRFPPMARSQSNSLRSELRRNSMSAVDYERSKIGNEVEWLKGRLKIVQEGREKLKVSVEHKEK-NDQFQLLENITN
        KCDSPSF+LLQSEL+I R+SSDATGRFPP+ARS+S+S++S+LRRNSMSAVDYERSKIGNEVEWL+GRLKIVQEGREKLK SVEHKEK ++QFQLLENITN
Subjt:  KCDSPSFNLLQSELEIMRTSSDATGRFPPMARSQSNSLRSELRRNSMSAVDYERSKIGNEVEWLKGRLKIVQEGREKLKVSVEHKEK-NDQFQLLENITN

Query:  HLREIRQLTDPEKTVLQAPLPPSSTDVLKKR
        HLREIRQLTDP K  L APLPPSS   + K+
Subjt:  HLREIRQLTDPEKTVLQAPLPPSSTDVLKKR

A0A6J1FVE4 probable myosin-binding protein 53.9e-29974.16Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSTIKKFPLNSVWD-EPKC
        MACEAIQ WTFNGLVAAFLDLGIA+L+LCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCF K+LVDRSSKK+SSV+HS  +K PLNS+ D EPKC
Subjt:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSTIKKFPLNSVWD-EPKC

Query:  CFKSMLVHERNVKEEHVELEGETSGSSSFKTRSQQGMIYGDFPSVYELHCGGGGDFRKVISASPNDISQLDVELEDLSHSPSSFSGFGDNNTEDGFFSVD
        CFKS L+HERN  E HVE EGE SG S FKTRS Q MIYGDF SV E  C G  D +KV S SPNDISQLD  +EDLS SPSSFSGFGDNNTEDGFFSVD
Subjt:  CFKSMLVHERNVKEEHVELEGETSGSSSFKTRSQQGMIYGDFPSVYELHCGGGGDFRKVISASPNDISQLDVELEDLSHSPSSFSGFGDNNTEDGFFSVD

Query:  SGDER-EASSENSDRYKVFPDLELDDSYDKKLCVEMYEASVEEAGNSCRRESCVDGNESDTIKLLEQALEEEQTARATLYMELEKERNAAATAADEAMAM
        SGDER E+SS++SDR KVFP        D+K+  E + AS EEAGN+CR E C+DGNESDTIKLLEQALEEEQ ARATLY+ELEKER+AAATA DEAMAM
Subjt:  SGDER-EASSENSDRYKVFPDLELDDSYDKKLCVEMYEASVEEAGNSCRRESCVDGNESDTIKLLEQALEEEQTARATLYMELEKERNAAATAADEAMAM

Query:  ILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFRKSFFDDDGVGVDMLDSEFTPARTPSFPYPTEDPSHMLQCINR
        ILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAF+KSFF+DDGV VDMLD E TP   PSF Y ++ PSHMLQCI+R
Subjt:  ILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFRKSFFDDDGVGVDMLDSEFTPARTPSFPYPTEDPSHMLQCINR

Query:  STRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVTDNFKGSEEIEHELQGKGMVEDENLCVVPGDINKLDPYLQSNE
        S RD+QD+NY   S QYEIPS+ESR LTFEF +ESPFI +DE ADAA+A GMLLHQ  DNF   EE ++ELQ +GMVEDENL ++ G++N+L+PYLQSN 
Subjt:  STRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVTDNFKGSEEIEHELQGKGMVEDENLCVVPGDINKLDPYLQSNE

Query:  SNGILKVEKCMEL--IADEQGKVDEVSYDESVLAKTILPCLEYNLEKNGDHQEMRTKDLDSVNATDPHPHDIHVVEDEARISNEAIANASEEPLVNGTSS
        SNG+ KVEK  EL  IADE+ KVDEVSYD    AKTI PC+EYNLEKNGDHQ+ + KDLD +   DP  HDIHV+++EA   ++  A+AS+EP++NGTSS
Subjt:  SNGILKVEKCMEL--IADEQGKVDEVSYDESVLAKTILPCLEYNLEKNGDHQEMRTKDLDSVNATDPHPHDIHVVEDEARISNEAIANASEEPLVNGTSS

Query:  IPVKCDSPSFNLLQSELEIMRTSSDATGRFPPMARSQSNSLRSELRRNSMSAVDYERSKIGNEVEWLKGRLKIVQEGREKLKVSVEHKEK-NDQFQLLEN
         P  C SPSF+LLQSEL+I R++SDA+GRFPP ARS+SN LRS+LRRNSMSAVDYERSKIG+EVE L+ RLKIVQE REKL+ S+EHK+K N+Q QLLE+
Subjt:  IPVKCDSPSFNLLQSELEIMRTSSDATGRFPPMARSQSNSLRSELRRNSMSAVDYERSKIGNEVEWLKGRLKIVQEGREKLKVSVEHKEK-NDQFQLLEN

Query:  ITNHLREIRQLTDPEKTVLQAPLPPSSTDVLKKRCWRSSSLSVHRSS
        ITNHLREIR LTDP K +LQAP PPSS  V KKRCWRSSSLS+HRSS
Subjt:  ITNHLREIRQLTDPEKTVLQAPLPPSSTDVLKKRCWRSSSLSVHRSS

A0A6J1IHY7 uncharacterized protein LOC1114738541.4e-29673.63Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSTIKKFPLNSVWD-EPKC
        MACEAIQ WTFNGLV AFLDLGIA+LLLCASSLVFFTSKFLALFGL LPCPCDGLFGNLSSDHCFQK+L DRSSKK+SSV+HS I+K PLNS+WD EPKC
Subjt:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSTIKKFPLNSVWD-EPKC

Query:  CFKSMLVHERNVKEEHVELEGETSGSSSFKTRSQQGMIYGDFPSVYELHCGGGGDFRKVISASPNDISQLDVELEDLSHSPSSFSGFGDNNTEDGFFSVD
        C K+ L+HERN  E HVE EGE SG S FKTRS Q  IYGDF SV E HC G  D +KV S SPNDISQLD  +EDLS SPSSFSGFGDNNTEDGFFSVD
Subjt:  CFKSMLVHERNVKEEHVELEGETSGSSSFKTRSQQGMIYGDFPSVYELHCGGGGDFRKVISASPNDISQLDVELEDLSHSPSSFSGFGDNNTEDGFFSVD

Query:  SGDER-EASSENSDRYKVFPDLELDDSYDKKLCVEMYEASVEEAGNSCRRESCVDGNESDTIKLLEQALEEEQTARATLYMELEKERNAAATAADEAMAM
        SGDER E+SS+NSD+ KVFP        D+K+  E + AS EEAGN+CR E C+D NESDTIKLLEQALEEEQ ARATLY+ELEKER+AAATA DEAMAM
Subjt:  SGDER-EASSENSDRYKVFPDLELDDSYDKKLCVEMYEASVEEAGNSCRRESCVDGNESDTIKLLEQALEEEQTARATLYMELEKERNAAATAADEAMAM

Query:  ILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFRKSFFDDDGVGVDMLDSEFTPARTPSFPYPTEDPSHMLQCINR
        ILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRRERE+HFLE EVEAF+KSFF+DDGV VDMLD E TP   PSF Y ++ PSHML+CI+R
Subjt:  ILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFRKSFFDDDGVGVDMLDSEFTPARTPSFPYPTEDPSHMLQCINR

Query:  STRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVTDNFKGSEEIEHELQGKGMVEDENLCVVPGDINKLDPYLQSNE
        S RD+QD+NY   S QYEIPS+ESR  TFEF +ESPFI +DE ADAA+   MLLHQ  DNF   EE ++ELQ KGMVEDENL ++  ++N+L+PYLQSN 
Subjt:  STRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVTDNFKGSEEIEHELQGKGMVEDENLCVVPGDINKLDPYLQSNE

Query:  SNGILKVEKCM--ELIADEQGKVDEVSYDESVLAKTILPCLEYNLEKNGDHQEMRTKDLDSVNATDPHPHDIHVVEDEARISNEAIANASEEPLVNGTSS
        SNG+ KVEK    E IADE+ KV EVSYD    AKTI+PC+EYNLEKNGDHQ+ +TKDLDS+   DP  HDIH++++EA   NE  A+AS+EP++NGTSS
Subjt:  SNGILKVEKCM--ELIADEQGKVDEVSYDESVLAKTILPCLEYNLEKNGDHQEMRTKDLDSVNATDPHPHDIHVVEDEARISNEAIANASEEPLVNGTSS

Query:  IPVKCDSPSFNLLQSELEIMRTSSDATGRFPPMARSQSNSLRSELRRNSMSAVDYERSKIGNEVEWLKGRLKIVQEGREKLKVSVEHKEK-NDQFQLLEN
         P  C SPSF+LLQSEL+I R++SDA+GRFPP ARS+SN LRSELRRNSMSAVDYERSKIG+EVE L+ RLKIVQE REKL+ SVEHK+K N+Q QLLE+
Subjt:  IPVKCDSPSFNLLQSELEIMRTSSDATGRFPPMARSQSNSLRSELRRNSMSAVDYERSKIGNEVEWLKGRLKIVQEGREKLKVSVEHKEK-NDQFQLLEN

Query:  ITNHLREIRQLTDPEKTVLQAPLPPSSTDVLKKRCWRSSSLSVHRSS
        ITNHLREIRQLTDP K +LQAP PPSS  V KKRCWRSSSLS+HRSS
Subjt:  ITNHLREIRQLTDPEKTVLQAPLPPSSTDVLKKRCWRSSSLSVHRSS

A0A6J1IVQ3 uncharacterized protein LOC111479667 isoform X12.2e-30275.67Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSTIKKFPLNSVWD-EPKC
        MACEAIQLWTFNGLVAAFLDLGIAFLLLCA+SLVFFTSKFLALFG CLPCPCDGLFG+L SDHCFQKLLVD SSKKISSVLHST +KFPL+S+WD EPKC
Subjt:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSTIKKFPLNSVWD-EPKC

Query:  CFKSMLVHERNVKEEHVELEGETSGSSSFKTRSQQGMIYGDFPSVYELHCGGGGDFRKVISASPNDISQLDVELEDLSHSPSSFSGFGDNNTEDGFFSVD
        CFKSM VH+RNVKE  VE + E SG S FKTRS +GMIYGD  ++ E    GG   RK+ S SPND+ Q DVELEDL HSPSSF GFGDNN EDGFFSVD
Subjt:  CFKSMLVHERNVKEEHVELEGETSGSSSFKTRSQQGMIYGDFPSVYELHCGGGGDFRKVISASPNDISQLDVELEDLSHSPSSFSGFGDNNTEDGFFSVD

Query:  SGDEREASSENSDRYKVFPDLELDDSYDKKLCVEMYEASVEEAGNSCRRESCVDGNESDTIKLLEQALEEEQTARATLYMELEKERNAAATAADEAMAMI
        SGDE EAS +NS++YKVFPDLELDDSYD+K+C EMY AS EEA N+CR E C+DGNESD IKLLEQ+LEEEQ ARATLY+ELEKER+AAATAADEAMAMI
Subjt:  SGDEREASSENSDRYKVFPDLELDDSYDKKLCVEMYEASVEEAGNSCRRESCVDGNESDTIKLLEQALEEEQTARATLYMELEKERNAAATAADEAMAMI

Query:  LRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFRKSFFDDDGVGVDMLDSEFTPARTPSFPYPTEDPSHMLQCINRS
        LRLQEEKASIEM+ARQYQRMIEEKTAYDAEEMSILKEILVRRE+EMHFL+KEV AFR+SFFD+ GV VDMLD+EFTP   PS PYPTEDPSHMLQCIN S
Subjt:  LRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFRKSFFDDDGVGVDMLDSEFTPARTPSFPYPTEDPSHMLQCINRS

Query:  TRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVTDNFKGSEEIEHELQGKGMVEDENLCVVPGDINKLDPYLQSNES
          D+Q          +E+PSVESRNL FEFGEESP IQA E ADAAKARGMLLHQV DNF+G EEI+ ELQGKGMVED+NL +VPG++N+L+PY +SN S
Subjt:  TRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVTDNFKGSEEIEHELQGKGMVEDENLCVVPGDINKLDPYLQSNES

Query:  NGILKVEKCMELIADEQGKVDEVSYDESVLAKTILPCLEYNLEKNGDHQEMRTKDLDSVNATDPHPHDIHVVEDEARISNEAIANASEEPLVNGTSSIPV
        N + KVE+C E   DEQ KV + S D    A+T LPC+EYNLEK  DHQ+  T+D  SVN TD  PHDIHV+EDEAR+ NEA ANA EE LVNG+SSIPV
Subjt:  NGILKVEKCMELIADEQGKVDEVSYDESVLAKTILPCLEYNLEKNGDHQEMRTKDLDSVNATDPHPHDIHVVEDEARISNEAIANASEEPLVNGTSSIPV

Query:  KCDSPSFNLLQSELEIMRTSSDATGRFPPMARSQSNSLRSELRRNSMSAVDYERSKIGNEVEWLKGRLKIVQEGREKLKVSVEHKEK-NDQFQLLENITN
         CDS SF+LLQ+EL+I R+SSDATGRFPPMARS+SNSLRSELRRNSMSAVDYERSKIGNEVEWL+GRLKIVQE REK K+SVE KEK N+Q QLLENIT 
Subjt:  KCDSPSFNLLQSELEIMRTSSDATGRFPPMARSQSNSLRSELRRNSMSAVDYERSKIGNEVEWLKGRLKIVQEGREKLKVSVEHKEK-NDQFQLLENITN

Query:  HLREIRQLTDPEKTVLQAPLPPSSTDVLKKRCWRSSSLSVHRSS
             + L+DP K  LQAPLPPSS DV KKRCWRSSSL +HRSS
Subjt:  HLREIRQLTDPEKTVLQAPLPPSSTDVLKKRCWRSSSLSVHRSS

SwissProt top hitse value%identityAlignment
F4HVS6 Probable myosin-binding protein 66.4e-1224.04Show/hide
Query:  LVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPC----PCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSTIKKFPLNSVWDEPKCCFKSMLVHER
        L+   L+  +   L     + F +++F   F L +PC      D +         + + + D   KK+SS+ +  + K  L+ +    + C  S    + 
Subjt:  LVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPC----PCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSTIKKFPLNSVWDEPKCCFKSMLVHER

Query:  N------------VKEEHVELEGETSGSSSFK-----TRSQQGMIYGDFPSV----YELHCGGGGDFRKVIS------------------ASPNDISQLD
        +             K+  + ++ E     +FK      ++ + ++     ++     + HC   G+  K+ S                   S N +S+ +
Subjt:  N------------VKEEHVELEGETSGSSSFK-----TRSQQGMIYGDFPSV----YELHCGGGGDFRKVIS------------------ASPNDISQLD

Query:  VELEDL--SHSPSSFSGFGDNNTEDGFFSVDSGDEREASSENSDRYKVFPDLELDDSYDKKLCVEMYEASVEEAGNSCRRESCVDGNESDTIKLLEQALE
         E +D+    +PS   G       + FF +   D    S++NS R+ V        S  K +  +   AS              D      +  L++ + 
Subjt:  VELEDL--SHSPSSFSGFGDNNTEDGFFSVDSGDEREASSENSDRYKVFPDLELDDSYDKKLCVEMYEASVEEAGNSCRRESCVDGNESDTIKLLEQALE

Query:  EEQTARATLYMELEKERNAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFRKSF
         ++ +   LYMEL++ER+A+A AA+EAMAMI RLQ EKA+++M+A QYQRM++E+  YD E +  +   L +RE EM  LE E E +R+ +
Subjt:  EEQTARATLYMELEKERNAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFRKSF

Q0WNW4 Myosin-binding protein 31.0e-1745.45Show/hide
Query:  ASVEEAGNSCRRESCVDGNES-DTIKLLEQALEEEQTARATLYMELEKERNAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILK
        A+ E+AG+     S +DG +   TI+ L + +  EQ A   LY ELE+ER+A+A +A++ MAMI RLQEEKA ++M+A QYQRM+EE+  YD E + +L 
Subjt:  ASVEEAGNSCRRESCVDGNES-DTIKLLEQALEEEQTARATLYMELEKERNAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILK

Query:  EILVRREREMHFLEKEVEAFR
         ++V+RE+E   L++E+E +R
Subjt:  EILVRREREMHFLEKEVEAFR

Q9CAC4 Myosin-binding protein 22.3e-1746.15Show/hide
Query:  VEMYEASVEEAGNSCRRESCVDGNESD----TIKLLEQALEEEQTARATLYMELEKERNAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYD
        +E   +S+  A     +   VDG E      T+  L+  L+EE+ A   LY ELE ERNA+A AA E MAMI RL EEKA+++M+A QYQRM+EE+  +D
Subjt:  VEMYEASVEEAGNSCRRESCVDGNESD----TIKLLEQALEEEQTARATLYMELEKERNAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYD

Query:  AEEMSILKEILVRREREMHFLEKEVEAFRK
         E + +L E++V RE+E   LEKE+E +RK
Subjt:  AEEMSILKEILVRREREMHFLEKEVEAFRK

Q9FG14 Myosin-binding protein 71.3e-1234.67Show/hide
Query:  ENSDRYKVF--PDLELDDSYDKKLCVEMYEASVEEAGNSCRRESCVDGNESDTIKLLEQALEEEQTARATLYMELEKERNAAATAADEAMAMILRLQEEK
        E  +  K+F  P+LELD S + K+ +E                        + ++LL + +  +Q +   LY EL++ERNAA+TAA EAM+MILRLQ +K
Subjt:  ENSDRYKVF--PDLELDDSYDKKLCVEMYEASVEEAGNSCRRESCVDGNESDTIKLLEQALEEEQTARATLYMELEKERNAAATAADEAMAMILRLQEEK

Query:  ASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFR
        A ++M+ RQ++R  EEK  +D +E+  L++++ +RE+ +  L  E +A++
Subjt:  ASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFR

Q9LMC8 Probable myosin-binding protein 56.2e-1532.35Show/hide
Query:  LEQALEEEQTARATLYMELEKERNAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFRKSF---
        L + +  ++ +   LYMEL++ER+A+A AA+ AMAMI RLQ EKA+++M+A QYQRM++E+  YD E +  +  +LV+RE EM  LE  +E +R  +   
Subjt:  LEQALEEEQTARATLYMELEKERNAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFRKSF---

Query:  FDDDGVGVDMLDSEFTPARTPSFPYPTEDPSHMLQCINRSTRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVTDNF
         ++ G   + LD E  P      P  + +    L+ +  S  D   +N  +      I   E  N + +        +  E  +A +++G LL Q++D  
Subjt:  FDDDGVGVDMLDSEFTPARTPSFPYPTEDPSHMLQCINRSTRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVTDNF

Query:  KGSE
          SE
Subjt:  KGSE

Arabidopsis top hitse value%identityAlignment
AT1G04890.1 Protein of unknown function, DUF5935.7e-3235.35Show/hide
Query:  NSCRRESCVDGNESDTIKLLEQALEEEQTARATLYMELEKERNAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRRER
        NS   +S V   E  +++ LE+ L+EE+ ARAT+ +EL+KER+AAA+AADEAMAMI RLQ+EKA+IEM+ARQ+QR++EE++ +DAEEM ILK+IL+RRER
Subjt:  NSCRRESCVDGNESDTIKLLEQALEEEQTARATLYMELEKERNAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRRER

Query:  EMHFLEKEVEAFRKSFFDDDGVGVDMLDSEFTPARTPSFPYPTEDPSHM--LQCINRSTRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEI
        E HFLEKEVEA+R+   + + +   ++  +  P           +P H     C  R     Q+ +  +  + Y       +N          +    E+
Subjt:  EMHFLEKEVEAFRKSFFDDDGVGVDMLDSEFTPARTPSFPYPTEDPSHM--LQCINRSTRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEI

Query:  ADAAKARGMLLHQVTDNFKGSEEIEHELQGKGMVE-DENLCVVPGDINKLDPYL----QSNESNGI------------LKVEKCMELIADEQGKVDE
        A +      ++   T+N    + +E    GK     DEN  +V G   KL P      +S  S+G             LK+E  +EL+ +    V E
Subjt:  ADAAKARGMLLHQVTDNFKGSEEIEHELQGKGMVE-DENLCVVPGDINKLDPYL----QSNESNGI------------LKVEKCMELIADEQGKVDE

AT1G04890.1 Protein of unknown function, DUF5931.7e-0761.11Show/hide
Query:  RFPPMARSQSNSLRSE-LRRNSMSAVDYERSKIGNEVEWLKGRLKIVQEGREKL
        + PP+ R +  SL S   RR SMSAVDYER KI NEVE L+ RLK VQE RE+L
Subjt:  RFPPMARSQSNSLRSE-LRRNSMSAVDYERSKIGNEVEWLKGRLKIVQEGREKL

AT4G13160.1 Protein of unknown function, DUF5936.5e-2864.15Show/hide
Query:  DTIKLLEQALEEEQTARATLYMELEKERNAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFRK
        D ++LLE A+E+E+ A+A L +ELE+ER A+A+AADEAMAMILRLQ +KAS+EM+ +QY+RMI+EK AYD EEM+ILKEIL +RERE HFLEKE+E ++ 
Subjt:  DTIKLLEQALEEEQTARATLYMELEKERNAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFRK

Query:  SFFDDD
           DDD
Subjt:  SFFDDD

AT4G13160.1 Protein of unknown function, DUF5934.5e-0546.77Show/hide
Query:  TFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVD
        TF G++ AF++L  A+ LLC S+ VF TSK L    L +PC      G  +SD C QKLL D
Subjt:  TFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVD

AT4G13630.1 Protein of unknown function, DUF5934.0e-6233.01Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSTIKKFPLNSV-WDEPKC
        M C+ ++ WTF GLVAAF+DL +AF LLCAS +V+ TSKFL LFGL LPCPCDGL+       CFQ+ L +   KKISSV  S   + P +S+ ++  K 
Subjt:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSTIKKFPLNSV-WDEPKC

Query:  CFKSMLVHERNVKEEHVELEGETSGSSSFKTRSQQGMIYGDFPSVYELHCGGGGDFRKVISASPNDISQLDVELEDLSHSPSSFSGFGDNNTEDGFFSVD
                +R  +   V+LE E S ++               PSV +      G    +++A         V+ + LS   S +       +  G  S +
Subjt:  CFKSMLVHERNVKEEHVELEGETSGSSSFKTRSQQGMIYGDFPSVYELHCGGGGDFRKVISASPNDISQLDVELEDLSHSPSSFSGFGDNNTEDGFFSVD

Query:  -SGDEREASSENS-DRYKVFPDLELDDSYDKKLCVEMYEASVEEAGNSCRRESCVD-GNESDTIKLLEQALEEEQTARATLYMELEKERNAAATAADEAM
         S DE +    NS D  K   D+ L  S            S+   G      +C   G    T+++ EQ L EE+ ARA+L +ELEKERNAAA+AADEA+
Subjt:  -SGDEREASSENS-DRYKVFPDLELDDSYDKKLCVEMYEASVEEAGNSCRRESCVD-GNESDTIKLLEQALEEEQTARATLYMELEKERNAAATAADEAM

Query:  AMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFRKSFFDDDGVGVDMLDSEFTPARTP-SFPYPTEDPSHMLQC
         MILRLQEEKASIEM+ARQYQRMIEEK+A+DAEEMSILKEIL+RRERE HFLEKEV+ +R+ F + +            P  TP S P   E     LQ 
Subjt:  AMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFRKSFFDDDGVGVDMLDSEFTPARTP-SFPYPTEDPSHMLQC

Query:  INRSTRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVTDNFKGSEEIEHELQGKGMVEDENLCVVPGDINKLDPYLQ
          + T    D      S  +EI + +  N   +     PF    +I    +  G  L+            E   +   +   + LC  P DI        
Subjt:  INRSTRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVTDNFKGSEEIEHELQGKGMVEDENLCVVPGDINKLDPYLQ

Query:  SNESNGILKVEKCMELIADEQGKVDEVSYDESVLAKTILPCLEYNLEKNGDHQEMRTKDLDSVNATDPHPHDIHVVEDEARISNEAIANASEEPLVNGTS
                                                                          D H HDIHVV DE                 +   
Subjt:  SNESNGILKVEKCMELIADEQGKVDEVSYDESVLAKTILPCLEYNLEKNGDHQEMRTKDLDSVNATDPHPHDIHVVEDEARISNEAIANASEEPLVNGTS

Query:  SIPVKCDSPSFNLLQSELEIMRTSSDATGRFPPMARSQSNSLRSELRRNSMSAVDYERSKIGNEVEWLKGRLKIVQEGREKLKVSVEHKEKNDQFQLLEN
         + V  D  + +L   +L+  ++ SD +  FP   + +SN + + +RRNSMSA+DYER KI +EV  L+GRL+ VQ+GREK+  S + + K+ Q Q   +
Subjt:  SIPVKCDSPSFNLLQSELEIMRTSSDATGRFPPMARSQSNSLRSELRRNSMSAVDYERSKIGNEVEWLKGRLKIVQEGREKLKVSVEHKEKNDQFQLLEN

Query:  ITNHLREIRQ-----LTDPEKTVLQAPLPP
         T+   E R+      + P  T+++A   P
Subjt:  ITNHLREIRQ-----LTDPEKTVLQAPLPP

AT4G13630.2 Protein of unknown function, DUF5934.0e-6233.01Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSTIKKFPLNSV-WDEPKC
        M C+ ++ WTF GLVAAF+DL +AF LLCAS +V+ TSKFL LFGL LPCPCDGL+       CFQ+ L +   KKISSV  S   + P +S+ ++  K 
Subjt:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSTIKKFPLNSV-WDEPKC

Query:  CFKSMLVHERNVKEEHVELEGETSGSSSFKTRSQQGMIYGDFPSVYELHCGGGGDFRKVISASPNDISQLDVELEDLSHSPSSFSGFGDNNTEDGFFSVD
                +R  +   V+LE E S ++               PSV +      G    +++A         V+ + LS   S +       +  G  S +
Subjt:  CFKSMLVHERNVKEEHVELEGETSGSSSFKTRSQQGMIYGDFPSVYELHCGGGGDFRKVISASPNDISQLDVELEDLSHSPSSFSGFGDNNTEDGFFSVD

Query:  -SGDEREASSENS-DRYKVFPDLELDDSYDKKLCVEMYEASVEEAGNSCRRESCVD-GNESDTIKLLEQALEEEQTARATLYMELEKERNAAATAADEAM
         S DE +    NS D  K   D+ L  S            S+   G      +C   G    T+++ EQ L EE+ ARA+L +ELEKERNAAA+AADEA+
Subjt:  -SGDEREASSENS-DRYKVFPDLELDDSYDKKLCVEMYEASVEEAGNSCRRESCVD-GNESDTIKLLEQALEEEQTARATLYMELEKERNAAATAADEAM

Query:  AMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFRKSFFDDDGVGVDMLDSEFTPARTP-SFPYPTEDPSHMLQC
         MILRLQEEKASIEM+ARQYQRMIEEK+A+DAEEMSILKEIL+RRERE HFLEKEV+ +R+ F + +            P  TP S P   E     LQ 
Subjt:  AMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFRKSFFDDDGVGVDMLDSEFTPARTP-SFPYPTEDPSHMLQC

Query:  INRSTRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVTDNFKGSEEIEHELQGKGMVEDENLCVVPGDINKLDPYLQ
          + T    D      S  +EI + +  N   +     PF    +I    +  G  L+            E   +   +   + LC  P DI        
Subjt:  INRSTRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVTDNFKGSEEIEHELQGKGMVEDENLCVVPGDINKLDPYLQ

Query:  SNESNGILKVEKCMELIADEQGKVDEVSYDESVLAKTILPCLEYNLEKNGDHQEMRTKDLDSVNATDPHPHDIHVVEDEARISNEAIANASEEPLVNGTS
                                                                          D H HDIHVV DE                 +   
Subjt:  SNESNGILKVEKCMELIADEQGKVDEVSYDESVLAKTILPCLEYNLEKNGDHQEMRTKDLDSVNATDPHPHDIHVVEDEARISNEAIANASEEPLVNGTS

Query:  SIPVKCDSPSFNLLQSELEIMRTSSDATGRFPPMARSQSNSLRSELRRNSMSAVDYERSKIGNEVEWLKGRLKIVQEGREKLKVSVEHKEKNDQFQLLEN
         + V  D  + +L   +L+  ++ SD +  FP   + +SN + + +RRNSMSA+DYER KI +EV  L+GRL+ VQ+GREK+  S + + K+ Q Q   +
Subjt:  SIPVKCDSPSFNLLQSELEIMRTSSDATGRFPPMARSQSNSLRSELRRNSMSAVDYERSKIGNEVEWLKGRLKIVQEGREKLKVSVEHKEKNDQFQLLEN

Query:  ITNHLREIRQ-----LTDPEKTVLQAPLPP
         T+   E R+      + P  T+++A   P
Subjt:  ITNHLREIRQ-----LTDPEKTVLQAPLPP

AT5G16720.1 Protein of unknown function, DUF5937.2e-1945.45Show/hide
Query:  ASVEEAGNSCRRESCVDGNES-DTIKLLEQALEEEQTARATLYMELEKERNAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILK
        A+ E+AG+     S +DG +   TI+ L + +  EQ A   LY ELE+ER+A+A +A++ MAMI RLQEEKA ++M+A QYQRM+EE+  YD E + +L 
Subjt:  ASVEEAGNSCRRESCVDGNES-DTIKLLEQALEEEQTARATLYMELEKERNAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILK

Query:  EILVRREREMHFLEKEVEAFR
         ++V+RE+E   L++E+E +R
Subjt:  EILVRREREMHFLEKEVEAFR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTGTGAAGCTATACAACTGTGGACATTTAATGGATTAGTGGCTGCATTTCTTGATCTTGGTATAGCTTTCTTATTATTATGTGCATCAAGTCTTGTTTTCTTTAC
ATCCAAATTTCTTGCACTGTTTGGATTATGTCTACCTTGCCCTTGTGATGGGCTATTTGGGAACCTTAGTAGTGATCACTGCTTCCAAAAGTTACTTGTGGATCGTTCGT
CCAAAAAAATATCTTCAGTCTTACATTCGACTATAAAAAAGTTCCCATTAAATTCCGTGTGGGATGAGCCAAAATGTTGTTTTAAGTCAATGTTGGTGCACGAGAGGAAT
GTCAAAGAGGAACATGTTGAATTGGAAGGTGAAACATCAGGTAGTTCCTCTTTTAAAACTAGATCACAACAGGGTATGATTTATGGAGACTTTCCCAGTGTCTACGAATT
GCATTGTGGAGGTGGTGGGGATTTCCGGAAGGTCATATCAGCGTCTCCGAACGACATTTCACAGTTGGATGTGGAACTGGAAGACCTTTCTCATTCTCCTTCAAGCTTCA
GTGGATTTGGGGATAACAATACTGAGGACGGCTTCTTTTCTGTTGATTCTGGAGATGAAAGGGAGGCTTCATCAGAAAACAGTGATCGATATAAAGTATTTCCCGATCTT
GAACTGGATGATTCTTATGATAAGAAACTATGCGTGGAGATGTATGAAGCATCTGTTGAAGAGGCTGGGAATAGCTGCAGAAGGGAGTCATGTGTGGATGGTAATGAGAG
TGATACAATCAAACTATTGGAACAAGCACTTGAAGAAGAGCAGACGGCTCGCGCTACCCTGTACATGGAGCTCGAGAAAGAGAGAAATGCTGCTGCCACTGCTGCTGATG
AGGCCATGGCTATGATATTACGTCTTCAGGAGGAGAAGGCATCGATAGAAATGGATGCTAGGCAATACCAGAGGATGATAGAGGAGAAGACTGCTTATGATGCGGAGGAA
ATGAGTATTCTTAAAGAAATTCTAGTGAGAAGAGAGCGGGAAATGCATTTCTTAGAGAAGGAAGTTGAAGCTTTTCGAAAAAGTTTCTTTGACGATGATGGAGTGGGTGT
TGATATGCTTGACTCGGAATTTACACCTGCAAGGACCCCTTCTTTTCCTTATCCAACTGAAGATCCATCGCATATGCTTCAATGCATTAATAGATCCACTAGAGACGAGC
AAGATTCCAATTACAAAATACATTCTTTGCAATATGAGATACCATCAGTGGAGTCACGAAACTTAACTTTTGAGTTTGGGGAAGAGTCGCCATTTATTCAAGCGGATGAA
ATTGCTGATGCTGCAAAAGCTAGGGGCATGTTGTTGCACCAAGTTACTGATAACTTCAAGGGCAGTGAAGAGATTGAGCATGAGTTACAAGGAAAGGGCATGGTAGAAGA
TGAGAATTTATGCGTTGTACCAGGAGACATAAATAAACTGGATCCATATCTCCAAAGCAATGAGTCAAATGGTATACTTAAGGTTGAGAAATGCATGGAATTAATTGCAG
ATGAACAAGGAAAAGTCGACGAAGTTTCGTATGACGAATCGGTATTGGCTAAAACAATTCTTCCTTGTCTTGAGTACAATTTGGAAAAGAATGGTGACCATCAAGAGATG
CGGACAAAAGATCTTGACTCTGTGAACGCTACAGATCCCCATCCTCATGATATTCATGTCGTTGAAGATGAAGCTAGAATTTCAAATGAAGCAATTGCTAATGCAAGTGA
AGAACCATTGGTTAATGGTACCTCGAGTATTCCGGTTAAATGTGACAGTCCATCCTTCAACTTGCTGCAGAGTGAACTTGAAATCATGAGAACTAGCTCAGATGCCACTG
GAAGATTTCCACCAATGGCTCGTTCTCAAAGCAATTCCTTGCGTTCTGAATTGCGTAGAAATTCGATGTCTGCTGTCGATTATGAAAGGTCAAAAATTGGCAATGAAGTT
GAGTGGCTCAAAGGAAGGCTAAAAATTGTTCAAGAGGGAAGAGAAAAACTCAAGGTCTCTGTGGAGCACAAAGAGAAGAACGATCAGTTTCAACTTTTAGAAAACATAAC
AAATCACCTTCGCGAGATCCGCCAACTGACGGATCCTGAAAAGACAGTTCTGCAGGCTCCATTGCCTCCATCCTCAACGGATGTGTTGAAGAAACGTTGCTGGCGAAGCT
CATCCTTGAGCGTTCACAGAAGCAGCTAG
mRNA sequenceShow/hide mRNA sequence
GCATTTTCGAATTTAATCTTTCGATTTCTCTTCCTACTTTCTCAGCGTTATGATTCTCTTCCTCACACGACGTTACTAAATCAAACTTTTATGGACATTTATTAGGGTTT
TTATATTCAAGAAATCGAAAGTGGAAACCGCCACCATTGAAAGCATATTGTTGTTAAGTTGCTGAGAGTTGAAAGCTAATGGAAGTTTGATTGAAATGTTGGTTTCTGTT
CGATTTTTACATTTCTGCTGTTTAAGGTTTTGTGATTACAGTTGATGCTCACTGTAGCGCAAGAGTGAAGTCTGCGGCTGCATATAGTATAGATTTGTAGATTCAGTAAG
GGACTTGTTCGATTGTCTGAACTTTTCACCGGCGAAAACGGGGTTTTGTTTTATATTCTGTTTTTTTTTTCTTTTTTCTTTGGCTTGATTCATAGTATCGATCTTATTTA
TGTTCGTCAGCGACTAGAAAAGTAGTGGCTGCATTGGATATTGAAGTTATTTTTGTAGGAAATGGCTTGTGAAGCTATACAACTGTGGACATTTAATGGATTAGTGGCTG
CATTTCTTGATCTTGGTATAGCTTTCTTATTATTATGTGCATCAAGTCTTGTTTTCTTTACATCCAAATTTCTTGCACTGTTTGGATTATGTCTACCTTGCCCTTGTGAT
GGGCTATTTGGGAACCTTAGTAGTGATCACTGCTTCCAAAAGTTACTTGTGGATCGTTCGTCCAAAAAAATATCTTCAGTCTTACATTCGACTATAAAAAAGTTCCCATT
AAATTCCGTGTGGGATGAGCCAAAATGTTGTTTTAAGTCAATGTTGGTGCACGAGAGGAATGTCAAAGAGGAACATGTTGAATTGGAAGGTGAAACATCAGGTAGTTCCT
CTTTTAAAACTAGATCACAACAGGGTATGATTTATGGAGACTTTCCCAGTGTCTACGAATTGCATTGTGGAGGTGGTGGGGATTTCCGGAAGGTCATATCAGCGTCTCCG
AACGACATTTCACAGTTGGATGTGGAACTGGAAGACCTTTCTCATTCTCCTTCAAGCTTCAGTGGATTTGGGGATAACAATACTGAGGACGGCTTCTTTTCTGTTGATTC
TGGAGATGAAAGGGAGGCTTCATCAGAAAACAGTGATCGATATAAAGTATTTCCCGATCTTGAACTGGATGATTCTTATGATAAGAAACTATGCGTGGAGATGTATGAAG
CATCTGTTGAAGAGGCTGGGAATAGCTGCAGAAGGGAGTCATGTGTGGATGGTAATGAGAGTGATACAATCAAACTATTGGAACAAGCACTTGAAGAAGAGCAGACGGCT
CGCGCTACCCTGTACATGGAGCTCGAGAAAGAGAGAAATGCTGCTGCCACTGCTGCTGATGAGGCCATGGCTATGATATTACGTCTTCAGGAGGAGAAGGCATCGATAGA
AATGGATGCTAGGCAATACCAGAGGATGATAGAGGAGAAGACTGCTTATGATGCGGAGGAAATGAGTATTCTTAAAGAAATTCTAGTGAGAAGAGAGCGGGAAATGCATT
TCTTAGAGAAGGAAGTTGAAGCTTTTCGAAAAAGTTTCTTTGACGATGATGGAGTGGGTGTTGATATGCTTGACTCGGAATTTACACCTGCAAGGACCCCTTCTTTTCCT
TATCCAACTGAAGATCCATCGCATATGCTTCAATGCATTAATAGATCCACTAGAGACGAGCAAGATTCCAATTACAAAATACATTCTTTGCAATATGAGATACCATCAGT
GGAGTCACGAAACTTAACTTTTGAGTTTGGGGAAGAGTCGCCATTTATTCAAGCGGATGAAATTGCTGATGCTGCAAAAGCTAGGGGCATGTTGTTGCACCAAGTTACTG
ATAACTTCAAGGGCAGTGAAGAGATTGAGCATGAGTTACAAGGAAAGGGCATGGTAGAAGATGAGAATTTATGCGTTGTACCAGGAGACATAAATAAACTGGATCCATAT
CTCCAAAGCAATGAGTCAAATGGTATACTTAAGGTTGAGAAATGCATGGAATTAATTGCAGATGAACAAGGAAAAGTCGACGAAGTTTCGTATGACGAATCGGTATTGGC
TAAAACAATTCTTCCTTGTCTTGAGTACAATTTGGAAAAGAATGGTGACCATCAAGAGATGCGGACAAAAGATCTTGACTCTGTGAACGCTACAGATCCCCATCCTCATG
ATATTCATGTCGTTGAAGATGAAGCTAGAATTTCAAATGAAGCAATTGCTAATGCAAGTGAAGAACCATTGGTTAATGGTACCTCGAGTATTCCGGTTAAATGTGACAGT
CCATCCTTCAACTTGCTGCAGAGTGAACTTGAAATCATGAGAACTAGCTCAGATGCCACTGGAAGATTTCCACCAATGGCTCGTTCTCAAAGCAATTCCTTGCGTTCTGA
ATTGCGTAGAAATTCGATGTCTGCTGTCGATTATGAAAGGTCAAAAATTGGCAATGAAGTTGAGTGGCTCAAAGGAAGGCTAAAAATTGTTCAAGAGGGAAGAGAAAAAC
TCAAGGTCTCTGTGGAGCACAAAGAGAAGAACGATCAGTTTCAACTTTTAGAAAACATAACAAATCACCTTCGCGAGATCCGCCAACTGACGGATCCTGAAAAGACAGTT
CTGCAGGCTCCATTGCCTCCATCCTCAACGGATGTGTTGAAGAAACGTTGCTGGCGAAGCTCATCCTTGAGCGTTCACAGAAGCAGCTAGCTTGTTTGAAATCCCCACCT
AAAAAAAACAATAAGAATAAAATGAAACACCATAGCATATCACATGATACAGCATCCTCCAGGTTAGAAATACTAGTCTTTTCCCCCACTGCCATATAAGACTACAAGTC
ACATGTAAATTAGGAATAAATTGACATATTATGTCTTATAGTTTGACAGATTATACCTTTTGTACATTCTTTGAAATTTTTTTGATGCAGCTGGTAGAATGTAGTTGTCA
TATATGCATGAGAATGAGATGGTCAGATACTTTTGGTTATTCATCGAAGAATTTGTAGTAGCAGCAGCCATATTCGATCGATTCAAATAACTTTTTCGATGTCTTCCGGC
ATCGAAATTAAACTGAACCATGCTAAAATAAAATAGAGTTTGGAGGATCA
Protein sequenceShow/hide protein sequence
MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSTIKKFPLNSVWDEPKCCFKSMLVHERN
VKEEHVELEGETSGSSSFKTRSQQGMIYGDFPSVYELHCGGGGDFRKVISASPNDISQLDVELEDLSHSPSSFSGFGDNNTEDGFFSVDSGDEREASSENSDRYKVFPDL
ELDDSYDKKLCVEMYEASVEEAGNSCRRESCVDGNESDTIKLLEQALEEEQTARATLYMELEKERNAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEE
MSILKEILVRREREMHFLEKEVEAFRKSFFDDDGVGVDMLDSEFTPARTPSFPYPTEDPSHMLQCINRSTRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADE
IADAAKARGMLLHQVTDNFKGSEEIEHELQGKGMVEDENLCVVPGDINKLDPYLQSNESNGILKVEKCMELIADEQGKVDEVSYDESVLAKTILPCLEYNLEKNGDHQEM
RTKDLDSVNATDPHPHDIHVVEDEARISNEAIANASEEPLVNGTSSIPVKCDSPSFNLLQSELEIMRTSSDATGRFPPMARSQSNSLRSELRRNSMSAVDYERSKIGNEV
EWLKGRLKIVQEGREKLKVSVEHKEKNDQFQLLENITNHLREIRQLTDPEKTVLQAPLPPSSTDVLKKRCWRSSSLSVHRSS