; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0024186 (gene) of Chayote v1 genome

Gene IDSed0024186
OrganismSechium edule (Chayote v1)
DescriptionGTD-binding domain-containing protein
Genome locationLG12:2295277..2299722
RNA-Seq ExpressionSed0024186
SyntenySed0024186
Gene Ontology termsGO:0006511 - ubiquitin-dependent protein catabolic process (biological process)
GO:0016579 - protein deubiquitination (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004843 - thiol-dependent ubiquitin-specific protease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0080115 - myosin XI tail binding (molecular function)
InterPro domainsIPR007656 - GTD-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597288.1 Myosin-binding protein 3, partial [Cucurbita argyrosperma subsp. sororia]4.9e-29674.63Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLSIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVERSSKKISSVVRSARDKFPLNSMWDHEPKC
        MACEAIQ WTFNGLVAAFLDL IAYL+LCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQK+LV+RSSKK+SSV+ SAR+K PLNSMWD EP+C
Subjt:  MACEAIQLWTFNGLVAAFLDLSIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVERSSKKISSVVRSARDKFPLNSMWDHEPKC

Query:  CFESMLVQERNGKEAHVELEGEASRSSFFKTRSSRCMICGDFHSVKESCCEGGVGRM-------NDILQSDVEDVSHSPSSFSGFGDNNTEDGFFSVDSG
        CF+S L+ ERNG EAHVE EGE S  SF KTRS + MI GDFHSVKES C G V          NDI Q ++ED+S SPSSFSGFGDNNTEDGFFSVDSG
Subjt:  CFESMLVQERNGKEAHVELEGEASRSSFFKTRSSRCMICGDFHSVKESCCEGGVGRM-------NDILQSDVEDVSHSPSSFSGFGDNNTEDGFFSVDSG

Query:  DGREDASSDNDNQYKVFPALELDESYDEKLWAEKYGTFFEEVGNNCRGKLCLDGNESDTIKLLERAVEEEQTARATLYLELEKERSAAATAADEAMAMIL
        D RE++SSD+ ++ KVFP        DEK+WAEK+   FEE GNNCRG+LCLDGNESDTI+LLE+A+EEEQ ARATLYLELEKERSAAATA DEAMAMIL
Subjt:  DGREDASSDNDNQYKVFPALELDESYDEKLWAEKYGTFFEEVGNNCRGKLCLDGNESDTIKLLERAVEEEQTARATLYLELEKERSAAATAADEAMAMIL

Query:  RLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKEVEAFQKSFFEDDGVDVDLLESEVTPQRPPSFLYSTNDPSHMLRRINRSI
        RLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREM+FL KEVEAFQKSFFEDDGVDVD+ + E+TPQ  PSF YS+  PSHMLR I+RSI
Subjt:  RLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKEVEAFQKSFFEDDGVDVDLLESEVTPQRPPSFLYSTNDPSHMLRRINRSI

Query:  RDKQNASYIKHSAQYEIPSMESRKLTFEFEEESPFIQADEFADAAKAGGALLHQAADNFKGNANVDYELQEKGMVEDEDIYILQGEVNELEPYLHRNESN
        RD Q+A+Y K S+QYEIPS+ESRKLTFEFE+ESPFI +DE ADAA+AGG LLHQAADNF         ++E+   ++E++YILQ EVNELEPYL  N SN
Subjt:  RDKQNASYIKHSAQYEIPSMESRKLTFEFEEESPFIQADEFADAAKAGGALLHQAADNFKGNANVDYELQEKGMVEDEDIYILQGEVNELEPYLHRNESN

Query:  GLGKVEKSM--ELIADDQEKVDEVLYDGLASAKLILPCVEYNWEKNGDHQKQQTRDLDSVTDIDPYPHDIHVINDEATMDDEASAEASKEPVVKGTSSVP
        GL KVEKS   E IAD++EKVDEV YDGLASAK I+PCVEYN EKNGDHQKQQ +DLDS+T+IDP  HDIHVI++EA+  DE SA ASKEPV+ GTSS P
Subjt:  GLGKVEKSM--ELIADDQEKVDEVLYDGLASAKLILPCVEYNWEKNGDHQKQQTRDLDSVTDIDPYPHDIHVINDEATMDDEASAEASKEPVVKGTSSVP

Query:  AKCGGPSFSLLNSELDITRSSSDATVRFPPIARSRRNFSHFDMRRNSMSAVDYERSKIDNEVEWLRERLKIVQEGREKLKFSVEHKEKENNQSQLLEIIT
        A C  PS SLL SELDITRS+SDA+ RFPP ARSR +    ++RRNSMSAVDYERSKI +EVE LRERLKIVQE REKL+FSVEHKEKENNQ +LLE IT
Subjt:  AKCGGPSFSLLNSELDITRSSSDATVRFPPIARSRRNFSHFDMRRNSMSAVDYERSKIDNEVEWLRERLKIVQEGREKLKFSVEHKEKENNQSQLLEIIT

Query:  NQLREIRQLTDPRKETLRAPLPRCSKAMSKKGWWRSSSLSIHRSN
        N LREIRQLTDP K  L+AP P  SK +SKK  WRSSSLSIHRS+
Subjt:  NQLREIRQLTDPRKETLRAPLPRCSKAMSKKGWWRSSSLSIHRSN

XP_022942015.1 probable myosin-binding protein 5 [Cucurbita moschata]2.3e-30676.38Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLSIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVERSSKKISSVVRSARDKFPLNSMWDHEPKC
        MACEAIQ WTFNGLVAAFLDL IAYL+LCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCF K+LV+RSSKK+SSV+ SAR+K PLNSM D EPKC
Subjt:  MACEAIQLWTFNGLVAAFLDLSIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVERSSKKISSVVRSARDKFPLNSMWDHEPKC

Query:  CFESMLVQERNGKEAHVELEGEASRSSFFKTRSSRCMICGDFHSVKESCCEGGVGRM-------NDILQSDVEDVSHSPSSFSGFGDNNTEDGFFSVDSG
        CF+S L+ ERNG EAHVE EGE S  SFFKTRS + MI GDFHSVKES C G V          NDI Q D+ED+S SPSSFSGFGDNNTEDGFFSVDSG
Subjt:  CFESMLVQERNGKEAHVELEGEASRSSFFKTRSSRCMICGDFHSVKESCCEGGVGRM-------NDILQSDVEDVSHSPSSFSGFGDNNTEDGFFSVDSG

Query:  DGREDASSDNDNQYKVFPALELDESYDEKLWAEKYGTFFEEVGNNCRGKLCLDGNESDTIKLLERAVEEEQTARATLYLELEKERSAAATAADEAMAMIL
        D RE++SSD+ ++ KVFP        DEK+WAEK+   FEE GNNCRG+LCLDGNESDTIKLLE+A+EEEQ ARATLYLELEKERSAAATA DEAMAMIL
Subjt:  DGREDASSDNDNQYKVFPALELDESYDEKLWAEKYGTFFEEVGNNCRGKLCLDGNESDTIKLLERAVEEEQTARATLYLELEKERSAAATAADEAMAMIL

Query:  RLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKEVEAFQKSFFEDDGVDVDLLESEVTPQRPPSFLYSTNDPSHMLRRINRSI
        RLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFL KEVEAFQKSFFEDDGVDVD+L+ E+TPQ  PSF YS+  PSHML+ I+RSI
Subjt:  RLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKEVEAFQKSFFEDDGVDVDLLESEVTPQRPPSFLYSTNDPSHMLRRINRSI

Query:  RDKQNASYIKHSAQYEIPSMESRKLTFEFEEESPFIQADEFADAAKAGGALLHQAADNFKGNANVDYELQEKGMVEDEDIYILQGEVNELEPYLHRNESN
        RDKQ+A+Y K S+QYEIPS+ESRKLTFEFE+ESPFI +DE ADAA+AGG LLHQAADNF      D ELQE+GMVEDE++YILQGEVNELEPYL  N SN
Subjt:  RDKQNASYIKHSAQYEIPSMESRKLTFEFEEESPFIQADEFADAAKAGGALLHQAADNFKGNANVDYELQEKGMVEDEDIYILQGEVNELEPYLHRNESN

Query:  GLGKVEKSMEL--IADDQEKVDEVLYDGLASAKLILPCVEYNWEKNGDHQKQQTRDLDSVTDIDPYPHDIHVINDEATMDDEASAEASKEPVVKGTSSVP
        GL KVEKS EL  IAD++EKVDEV YDGLASAK I PCVEYN EKNGDHQKQQ +DLD +T++DP  HDIHVI++EA+  D+ SA+ASKEPV+ GTSS P
Subjt:  GLGKVEKSMEL--IADDQEKVDEVLYDGLASAKLILPCVEYNWEKNGDHQKQQTRDLDSVTDIDPYPHDIHVINDEATMDDEASAEASKEPVVKGTSSVP

Query:  AKCGGPSFSLLNSELDITRSSSDATVRFPPIARSRRNFSHFDMRRNSMSAVDYERSKIDNEVEWLRERLKIVQEGREKLKFSVEHKEKENNQSQLLEIIT
        A C  PSFSLL SELDITRS+SDA+ RFPP ARSR N    D+RRNSMSAVDYERSKI +EVE LRERLKIVQE REKL+FS+EHK+KENNQ QLLE IT
Subjt:  AKCGGPSFSLLNSELDITRSSSDATVRFPPIARSRRNFSHFDMRRNSMSAVDYERSKIDNEVEWLRERLKIVQEGREKLKFSVEHKEKENNQSQLLEIIT

Query:  NQLREIRQLTDPRKETLRAPLPRCSKAMSKKGWWRSSSLSIHRSN
        N LREIR LTDP K  L+AP P  SK +SKK  WRSSSLSIHRS+
Subjt:  NQLREIRQLTDPRKETLRAPLPRCSKAMSKKGWWRSSSLSIHRSN

XP_022974988.1 uncharacterized protein LOC111473854 [Cucurbita maxima]3.9e-30175.44Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLSIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVERSSKKISSVVRSARDKFPLNSMWDHEPKC
        MACEAIQ WTFNGLV AFLDL IAYLLLCASSLVFFTSKFLALFGL LPCPCDGLFGNLSSDHCFQK+L +RSSKK+SSV+ SA +K PLNSMWD EPKC
Subjt:  MACEAIQLWTFNGLVAAFLDLSIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVERSSKKISSVVRSARDKFPLNSMWDHEPKC

Query:  CFESMLVQERNGKEAHVELEGEASRSSFFKTRSSRCMICGDFHSVKESCCEGGVG-------RMNDILQSDVEDVSHSPSSFSGFGDNNTEDGFFSVDSG
        C ++ L+ ERNG EAHVE EGE S  SFFKTRS +  I GDFHSVKES C+G V          NDI Q D+ED+S SPSSFSGFGDNNTEDGFFSVDSG
Subjt:  CFESMLVQERNGKEAHVELEGEASRSSFFKTRSSRCMICGDFHSVKESCCEGGVG-------RMNDILQSDVEDVSHSPSSFSGFGDNNTEDGFFSVDSG

Query:  DGREDASSDNDNQYKVFPALELDESYDEKLWAEKYGTFFEEVGNNCRGKLCLDGNESDTIKLLERAVEEEQTARATLYLELEKERSAAATAADEAMAMIL
        D RE++SSDN +Q KVFP        DEK+WAEK+   FEE GNNCRG+LCLD NESDTIKLLE+A+EEEQ ARATLYLELEKERSAAATA DEAMAMIL
Subjt:  DGREDASSDNDNQYKVFPALELDESYDEKLWAEKYGTFFEEVGNNCRGKLCLDGNESDTIKLLERAVEEEQTARATLYLELEKERSAAATAADEAMAMIL

Query:  RLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKEVEAFQKSFFEDDGVDVDLLESEVTPQRPPSFLYSTNDPSHMLRRINRSI
        RLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRRERE+HFL  EVEAFQKSFFEDDGVDVD+L+ E+TPQ  PSF YS+  PSHMLR I+RSI
Subjt:  RLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKEVEAFQKSFFEDDGVDVDLLESEVTPQRPPSFLYSTNDPSHMLRRINRSI

Query:  RDKQNASYIKHSAQYEIPSMESRKLTFEFEEESPFIQADEFADAAKAGGALLHQAADNFKGNANVDYELQEKGMVEDEDIYILQGEVNELEPYLHRNESN
        RDKQ+A+Y K S+QYEIPS+ESRK TFEFE+ESPFI +DE ADAA+ G  LLHQAADNF      D ELQEKGMVEDE++YILQ EVNELEPYL  N SN
Subjt:  RDKQNASYIKHSAQYEIPSMESRKLTFEFEEESPFIQADEFADAAKAGGALLHQAADNFKGNANVDYELQEKGMVEDEDIYILQGEVNELEPYLHRNESN

Query:  GLGKVEKSM--ELIADDQEKVDEVLYDGLASAKLILPCVEYNWEKNGDHQKQQTRDLDSVTDIDPYPHDIHVINDEATMDDEASAEASKEPVVKGTSSVP
        GL KVEKS   E IAD++EKV EV YDGLASAK I+PCVEYN EKNGDHQKQQT+DLDS+T+IDP  HDIH+I++EA+  +E SA+ASKEPV+ GTSS P
Subjt:  GLGKVEKSM--ELIADDQEKVDEVLYDGLASAKLILPCVEYNWEKNGDHQKQQTRDLDSVTDIDPYPHDIHVINDEATMDDEASAEASKEPVVKGTSSVP

Query:  AKCGGPSFSLLNSELDITRSSSDATVRFPPIARSRRNFSHFDMRRNSMSAVDYERSKIDNEVEWLRERLKIVQEGREKLKFSVEHKEKENNQSQLLEIIT
        A C  PSFSLL SELDITRS+SDA+ RFPP ARSR N    ++RRNSMSAVDYERSKI +EVE LRERLKIVQE REKL+FSVEHK+KENNQ QLLE IT
Subjt:  AKCGGPSFSLLNSELDITRSSSDATVRFPPIARSRRNFSHFDMRRNSMSAVDYERSKIDNEVEWLRERLKIVQEGREKLKFSVEHKEKENNQSQLLEIIT

Query:  NQLREIRQLTDPRKETLRAPLPRCSKAMSKKGWWRSSSLSIHRSN
        N LREIRQLTDP K  L+AP P  SK ++KK  WRSSSLSIHRS+
Subjt:  NQLREIRQLTDPRKETLRAPLPRCSKAMSKKGWWRSSSLSIHRSN

XP_023539873.1 uncharacterized protein LOC111800419 [Cucurbita pepo subsp. pepo]5.6e-30876.64Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLSIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVERSSKKISSVVRSARDKFPLNSMWDHEPKC
        MACEAIQ WTFNGLVAAFLDL IAYL+LCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQK+LV+RSSKK+SSV+ SAR+K PLNSMWD EPKC
Subjt:  MACEAIQLWTFNGLVAAFLDLSIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVERSSKKISSVVRSARDKFPLNSMWDHEPKC

Query:  CFESMLVQERNGKEAHVELEGEASRSSFFKTRSSRCMICGDFHSVKESCCEGGVGRM-------NDILQSDVEDVSHSPSSFSGFGDNNTEDGFFSVDSG
        CF+S L+ ERNG +AHVE EGE S  SF KTRS + MI GDFHSVKES C G V          NDI Q D+ED+S SPSSFSGFGDNNTEDGFFSVDSG
Subjt:  CFESMLVQERNGKEAHVELEGEASRSSFFKTRSSRCMICGDFHSVKESCCEGGVGRM-------NDILQSDVEDVSHSPSSFSGFGDNNTEDGFFSVDSG

Query:  DGREDASSDNDNQYKVFPALELDESYDEKLWAEKYGTFFEEVGNNCRGKLCLDGNESDTIKLLERAVEEEQTARATLYLELEKERSAAATAADEAMAMIL
        D RE++SSDN +Q KVFP        DEK+WAEK+   FEE G+ CRG+LCLDGNESDTIKLLE+A+EEEQ ARATLYLELEKERSAAATA DEAMAMIL
Subjt:  DGREDASSDNDNQYKVFPALELDESYDEKLWAEKYGTFFEEVGNNCRGKLCLDGNESDTIKLLERAVEEEQTARATLYLELEKERSAAATAADEAMAMIL

Query:  RLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKEVEAFQKSFFEDDGVDVDLLESEVTPQRPPSFLYSTNDPSHMLRRINRSI
        RLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFL KEVEAFQKSFFEDDGVDVD+L+ E+TPQ  PSF YS+  PSHML+ I+RSI
Subjt:  RLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKEVEAFQKSFFEDDGVDVDLLESEVTPQRPPSFLYSTNDPSHMLRRINRSI

Query:  RDKQNASYIKHSAQYEIPSMESRKLTFEFEEESPFIQADEFADAAKAGGALLHQAADNFKGNANVDYELQEKGMVEDEDIYILQGEVNELEPYLHRNESN
        RDKQ+A+Y K S+QYEIPS+E RKLTFEFE+ESPFI +DE ADAA+AGG LLHQAADNF      D ELQEKGMVEDE++YILQGEVNELEPYL  N SN
Subjt:  RDKQNASYIKHSAQYEIPSMESRKLTFEFEEESPFIQADEFADAAKAGGALLHQAADNFKGNANVDYELQEKGMVEDEDIYILQGEVNELEPYLHRNESN

Query:  GLGKVEKSMEL--IADDQEKVDEVLYDGLASAKLILPCVEYNWEKNGDHQKQQTRDLDSVTDIDPYPHDIHVINDEATMDDEASAEASKEPVVKGTSSVP
        GL KVEKS EL  IAD++EKVDEV YDGLASAK I+PCVEYN EKNGDHQKQQ +DLDS+T+IDP  HDIHVI++EA+  D+ SA+ASKEPV+ GTSS P
Subjt:  GLGKVEKSMEL--IADDQEKVDEVLYDGLASAKLILPCVEYNWEKNGDHQKQQTRDLDSVTDIDPYPHDIHVINDEATMDDEASAEASKEPVVKGTSSVP

Query:  AKCGGPSFSLLNSELDITRSSSDATVRFPPIARSRRNFSHFDMRRNSMSAVDYERSKIDNEVEWLRERLKIVQEGREKLKFSVEHKEKENNQSQLLEIIT
        A C  PSFSLL SELDITRS+SDA+ RFPP ARSR N    ++RRNSMSAVDYERSKI +EVE LRERLKIVQE REKL+FS+EHK+KENNQ QLLE IT
Subjt:  AKCGGPSFSLLNSELDITRSSSDATVRFPPIARSRRNFSHFDMRRNSMSAVDYERSKIDNEVEWLRERLKIVQEGREKLKFSVEHKEKENNQSQLLEIIT

Query:  NQLREIRQLTDPRKETLRAPLPRCSKAMSKKGWWRSSSLSIHRSN
        N LREIRQLTDP K  L+AP P  SK +SKK  WRSSSLSIHRS+
Subjt:  NQLREIRQLTDPRKETLRAPLPRCSKAMSKKGWWRSSSLSIHRSN

XP_038893600.1 uncharacterized protein LOC120082482 isoform X1 [Benincasa hispida]1.1e-29574.9Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLSIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVERSSKKISSVVRSARDKFPLNSMWDHEPKC
        MACEAI+LWTFNGLVAAFLDL IA+LLLCAS+LVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLV+RSS+KISSV+ S R KFPL+S+WD EPKC
Subjt:  MACEAIQLWTFNGLVAAFLDLSIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVERSSKKISSVVRSARDKFPLNSMWDHEPKC

Query:  CFESMLVQERNGKEAHVELEGEASRSSFFKTRSSRCMICGDFHSVKESCCEGGV-------GRMNDILQSDV--EDVSHSPSSFSGFGDNNTEDGFFSVD
        CF+S+LV ERN KEAHVELEGEAS  S FK+RS + M+ GDF SV +    GG+         +N+I QSDV  ED+SHSPSSFSGFGDNNTEDGFFSVD
Subjt:  CFESMLVQERNGKEAHVELEGEASRSSFFKTRSSRCMICGDFHSVKESCCEGGV-------GRMNDILQSDV--EDVSHSPSSFSGFGDNNTEDGFFSVD

Query:  SGDGREDASSDNDNQYKVFPALELDESYDEKLWAEKYGTFFEEVGNNCRGKLCLDGNESDTIKLLERAVEEEQTARATLYLELEKERSAAATAADEAMAM
        SGD RE AS DN +QYKVFP LELD+SYD K+ AE Y    EE GN CRG+LCLDGNESDTIKLLE+A+EEEQTARATLYLELEKERSAAATAADEAMAM
Subjt:  SGDGREDASSDNDNQYKVFPALELDESYDEKLWAEKYGTFFEEVGNNCRGKLCLDGNESDTIKLLERAVEEEQTARATLYLELEKERSAAATAADEAMAM

Query:  ILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKEVEAFQKSFFEDDGVDVDLLESEVTPQRPPSFLYSTNDPSHMLRRINR
        ILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFL KEVEAF+KSFF+DDG  VD+L+SE TP R P   Y T DPSHML+ INR
Subjt:  ILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKEVEAFQKSFFEDDGVDVDLLESEVTPQRPPSFLYSTNDPSHMLRRINR

Query:  SIRDKQNASYIKHSAQYEIPSMESRKLTFEFEEESPFIQADEFADAAKAGGALLHQAADNFKGNANVDYELQEKGMVEDEDIYILQGEVNELEPYLHRNE
        S RD++ A+Y  HS QY IPS+ESR LTFEF EES  I ADE A AAKA G LLHQ ADNFK +  +DYELQ KGM+EDE +YI+ GEVNEL+PYL  NE
Subjt:  SIRDKQNASYIKHSAQYEIPSMESRKLTFEFEEESPFIQADEFADAAKAGGALLHQAADNFKGNANVDYELQEKGMVEDEDIYILQGEVNELEPYLHRNE

Query:  SNGLGKVEKSMELIADDQEKVDEVLYDGLASAKLILPCVEYNWEKNGDHQKQQTRDLDSVTDIDPYPHDIHVINDEATMDDEASAEASKEPVVKGTSSVP
        SNGLGKVEK  E+IAD+QEKVDEV YD LA  K I PC+EYN EKNGDHQK +TRD+DSV   DP+PHDIHV+ DEA + +EA A AS+EP V GTSS+P
Subjt:  SNGLGKVEKSMELIADDQEKVDEVLYDGLASAKLILPCVEYNWEKNGDHQKQQTRDLDSVTDIDPYPHDIHVINDEATMDDEASAEASKEPVVKGTSSVP

Query:  AKCGGPSFSLLNSELDITRSSSDATVRFPPIARSRRNFSHFDMRRNSMSAVDYERSKIDNEVEWLRERLKIVQEGREKLKFSVEHKEKENNQSQLLEIIT
         KC  PSF LL SEL+I R+SSDAT RFPP+ARSR N    ++RRNSMSAVDYERSKI NEVEWLR RLKIVQEGREKLKFSVEHKEKENNQ QLLE IT
Subjt:  AKCGGPSFSLLNSELDITRSSSDATVRFPPIARSRRNFSHFDMRRNSMSAVDYERSKIDNEVEWLRERLKIVQEGREKLKFSVEHKEKENNQSQLLEIIT

Query:  NQLREIRQLTDPRKETLRAPLPRCSKAMSKKGWWRSSSLSIHRSN
        N LREI QL DP K  L+APLP  SK +SKK  WRSSSLS+HRS+
Subjt:  NQLREIRQLTDPRKETLRAPLPRCSKAMSKKGWWRSSSLSIHRSN

TrEMBL top hitse value%identityAlignment
A0A1S4DSI6 Ubiquitinyl hydrolase 11.3e-27072.85Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLSIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVERSSKKISSVVRSARDKFPLNSMWDHEPKC
        MACEAI+LWTFNGLVAAFLDL IA+LLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLS DHCFQKLLV+ SS+KISSVV S R+KFPL+S+ D EPKC
Subjt:  MACEAIQLWTFNGLVAAFLDLSIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVERSSKKISSVVRSARDKFPLNSMWDHEPKC

Query:  CFESMLVQERNGKEAHVELEGEASRSSFFKTRSSRCMICGDFHSVKE-SCCEGGVGRM------NDILQSDV--EDVSHSPSSFSGFGDNNTE-DGFFSV
        CF+SMLV ER+ KEA V+LEGEAS SS  K RS + MI GD+ SV E  C +GG  R       N ILQSDV  ED+S SPSSFSGFGD+NTE DGFFSV
Subjt:  CFESMLVQERNGKEAHVELEGEASRSSFFKTRSSRCMICGDFHSVKE-SCCEGGVGRM------NDILQSDV--EDVSHSPSSFSGFGDNNTE-DGFFSV

Query:  DSGDGREDASSDNDNQYKVFPALELDESYDEKLWAEKYGTFFEEVGNNCRGKLCLDGNESDTIKLLERAVEEEQTARATLYLELEKERSAAATAADEAMA
        DSGD RE ASSDN +QYKVFP LELD+S DEK+ AE      EE GN+CR +LCLDGNESDTIK LE+A+EEEQ+ RATLY+ELEKERSAAATAADEAMA
Subjt:  DSGDGREDASSDNDNQYKVFPALELDESYDEKLWAEKYGTFFEEVGNNCRGKLCLDGNESDTIKLLERAVEEEQTARATLYLELEKERSAAATAADEAMA

Query:  MILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKEVEAFQKSFFEDDGVDVDLLESEVTPQRPPSFLYSTNDPSHMLRRIN
        MILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFL KEVEA + SFFE DGV VD+L+SEVTP R PSF Y T DP        
Subjt:  MILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKEVEAFQKSFFEDDGVDVDLLESEVTPQRPPSFLYSTNDPSHMLRRIN

Query:  RSIRDKQNASYIKHSAQYEIPSMESRKLTFEFEEESPFIQADEFADAAKAGGALLHQAADNFKGNANVDYELQEKGMVEDEDIYILQGEVNELEPYLHRN
               N S  KHS Q+EIPS+ES+KLTFEFEEESP I+ADE ADAAKA   LL Q  DNFKG+  +DYELQ KGM+EDE++YI+ G+V EL PYL  N
Subjt:  RSIRDKQNASYIKHSAQYEIPSMESRKLTFEFEEESPFIQADEFADAAKAGGALLHQAADNFKGNANVDYELQEKGMVEDEDIYILQGEVNELEPYLHRN

Query:  ESNGLGKVEKSMELIADDQEKVDEVLYDGLASAKLILPCVEYNWEKNGDHQKQQTRDLDSVTDIDPYPHDIHVINDEATMDDEASAEASKEPVVKGTSSV
        ESNGLGKV+K  ELIAD+QEKVD+V YDGLA AK ILPC     EKNGDHQ+  TRDL SV + DP+PHDIHV+ DEA   +EA    S+EP+V GTS++
Subjt:  ESNGLGKVEKSMELIADDQEKVDEVLYDGLASAKLILPCVEYNWEKNGDHQKQQTRDLDSVTDIDPYPHDIHVINDEATMDDEASAEASKEPVVKGTSSV

Query:  PAKCGGPSFSLLNSELDITRSSSDATVRFPPIARSRRNFSHFDMRRNSMSAVDYERSKIDNEVEWLRERLKIVQEGREKLKFSVEHKEKENNQSQLLEII
        P KC  PSFSLL SELDITRSSSDAT RFPPIARSR +     +RRNSMSAVDYERSKI NEVEWLR RLKIVQEGREKLKFSVEHKEKE+NQ QLLE I
Subjt:  PAKCGGPSFSLLNSELDITRSSSDATVRFPPIARSRRNFSHFDMRRNSMSAVDYERSKIDNEVEWLRERLKIVQEGREKLKFSVEHKEKENNQSQLLEII

Query:  TNQLREIRQLTDPRKETLRAPLPRCSK-AMSKK
        TN LREIRQLTDP K +L APLP  SK  M KK
Subjt:  TNQLREIRQLTDPRKETLRAPLPRCSK-AMSKK

A0A6J1CZ28 probable myosin-binding protein 51.3e-28170.82Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLSIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVERSSKKISSVVRSARDKFPLNSMWDHEPKC
        MAC+AIQLWTFNGLVAAFLDL IAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLV+RSSKKIS+V+ S R+KFPL+SMWD EPK 
Subjt:  MACEAIQLWTFNGLVAAFLDLSIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVERSSKKISSVVRSARDKFPLNSMWDHEPKC

Query:  CFESMLVQERNGKEAHVELEGEASRSSFFKTRSSRCMICGDFHSVKE---------------------------------SCCEGGVGR-------MNDI
        CF+SMLV  RN +EA+VELEGEAS SSF KTR     I GDF  V +                                   C+GGVG          DI
Subjt:  CFESMLVQERNGKEAHVELEGEASRSSFFKTRSSRCMICGDFHSVKE---------------------------------SCCEGGVGR-------MNDI

Query:  LQSDVEDVSHSPSSFSGFGDNNTEDGFFSVDSGDGREDASSDNDNQYKVFPALELDESYDEKLWAEKYGTFFEEVGNNCRGKLCLDGNESDTIKLLERAV
        L+ D+EDVS SPSSFS FGD++TED FFSVDS DG E AS DN +Q KVFP LELD+S DEK+  E Y +  ++ GNN   +L LDGNESDTIKLLERA+
Subjt:  LQSDVEDVSHSPSSFSGFGDNNTEDGFFSVDSGDGREDASSDNDNQYKVFPALELDESYDEKLWAEKYGTFFEEVGNNCRGKLCLDGNESDTIKLLERAV

Query:  EEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKEVEAFQKSFFED--DGV
        EEEQ ARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKT YDAEEMSILKEILVRREREMHFL KE+EA++KSF  D  DG+
Subjt:  EEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKEVEAFQKSFFED--DGV

Query:  DVDLLESEVTPQRPPSFLYSTNDPSHMLRRINRSIRDKQNASYIKHSAQYEIPSMESRKLTFEFEEESPFIQADEFADAAKAGGALLHQAADNFKGNANV
        DVD+L+ EVTPQR PSF YST DPSHML+ I+RSI +KQ+ +Y KHS Q+E PS+ESR LTFEF EES F Q DE AD AKAGG  L Q AD+ K    +
Subjt:  DVDLLESEVTPQRPPSFLYSTNDPSHMLRRINRSIRDKQNASYIKHSAQYEIPSMESRKLTFEFEEESPFIQADEFADAAKAGGALLHQAADNFKGNANV

Query:  DYELQEKGMVEDEDIYILQGEVNELEPYLHRNESNGLGKVEKSMELIADDQEKVDEVLYDGLASAKLILPCVEYNWEKNGDHQKQQTRDLDSVTDIDPYP
        D ELQEKGMVEDE+ YILQGE NELEPYL  N+SNGLG VEKS ELIAD+ EKVD+V YD LAS+K ILP  +YN EKNGDHQ Q+T  L+SVT  D +P
Subjt:  DYELQEKGMVEDEDIYILQGEVNELEPYLHRNESNGLGKVEKSMELIADDQEKVDEVLYDGLASAKLILPCVEYNWEKNGDHQKQQTRDLDSVTDIDPYP

Query:  HDIHVINDEATMDDEASAEASKEPVVKGTSSVPAKCGGPSFSLLNSELDITRSSSDATVRFPPIARSRRNFSHFDMRRNSMSAVDYERSKIDNEVEWLRE
        HDIHVI      DDEASA ASK+PV  GTSS P KC GPSFSLL SELDITRSSSDAT RFPP+A SR N    ++RRNSMSAVDYERSKI NEVEWLRE
Subjt:  HDIHVINDEATMDDEASAEASKEPVVKGTSSVPAKCGGPSFSLLNSELDITRSSSDATVRFPPIARSRRNFSHFDMRRNSMSAVDYERSKIDNEVEWLRE

Query:  RLKIVQEGREKLKFSVEHKEKENNQSQLLEIITNQLREIRQLTDPRKETLRAPLPRCSKAMSKKGWWRSSSLSIHRSN
        RLKIVQEGREKLKF+VEH+EKENNQ QLLE ITN L EIRQLTDP K TL+APLP  SKA+SKK  WRSSSLSIHRS+
Subjt:  RLKIVQEGREKLKFSVEHKEKENNQSQLLEIITNQLREIRQLTDPRKETLRAPLPRCSKAMSKKGWWRSSSLSIHRSN

A0A6J1FVE4 probable myosin-binding protein 51.1e-30676.38Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLSIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVERSSKKISSVVRSARDKFPLNSMWDHEPKC
        MACEAIQ WTFNGLVAAFLDL IAYL+LCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCF K+LV+RSSKK+SSV+ SAR+K PLNSM D EPKC
Subjt:  MACEAIQLWTFNGLVAAFLDLSIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVERSSKKISSVVRSARDKFPLNSMWDHEPKC

Query:  CFESMLVQERNGKEAHVELEGEASRSSFFKTRSSRCMICGDFHSVKESCCEGGVGRM-------NDILQSDVEDVSHSPSSFSGFGDNNTEDGFFSVDSG
        CF+S L+ ERNG EAHVE EGE S  SFFKTRS + MI GDFHSVKES C G V          NDI Q D+ED+S SPSSFSGFGDNNTEDGFFSVDSG
Subjt:  CFESMLVQERNGKEAHVELEGEASRSSFFKTRSSRCMICGDFHSVKESCCEGGVGRM-------NDILQSDVEDVSHSPSSFSGFGDNNTEDGFFSVDSG

Query:  DGREDASSDNDNQYKVFPALELDESYDEKLWAEKYGTFFEEVGNNCRGKLCLDGNESDTIKLLERAVEEEQTARATLYLELEKERSAAATAADEAMAMIL
        D RE++SSD+ ++ KVFP        DEK+WAEK+   FEE GNNCRG+LCLDGNESDTIKLLE+A+EEEQ ARATLYLELEKERSAAATA DEAMAMIL
Subjt:  DGREDASSDNDNQYKVFPALELDESYDEKLWAEKYGTFFEEVGNNCRGKLCLDGNESDTIKLLERAVEEEQTARATLYLELEKERSAAATAADEAMAMIL

Query:  RLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKEVEAFQKSFFEDDGVDVDLLESEVTPQRPPSFLYSTNDPSHMLRRINRSI
        RLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFL KEVEAFQKSFFEDDGVDVD+L+ E+TPQ  PSF YS+  PSHML+ I+RSI
Subjt:  RLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKEVEAFQKSFFEDDGVDVDLLESEVTPQRPPSFLYSTNDPSHMLRRINRSI

Query:  RDKQNASYIKHSAQYEIPSMESRKLTFEFEEESPFIQADEFADAAKAGGALLHQAADNFKGNANVDYELQEKGMVEDEDIYILQGEVNELEPYLHRNESN
        RDKQ+A+Y K S+QYEIPS+ESRKLTFEFE+ESPFI +DE ADAA+AGG LLHQAADNF      D ELQE+GMVEDE++YILQGEVNELEPYL  N SN
Subjt:  RDKQNASYIKHSAQYEIPSMESRKLTFEFEEESPFIQADEFADAAKAGGALLHQAADNFKGNANVDYELQEKGMVEDEDIYILQGEVNELEPYLHRNESN

Query:  GLGKVEKSMEL--IADDQEKVDEVLYDGLASAKLILPCVEYNWEKNGDHQKQQTRDLDSVTDIDPYPHDIHVINDEATMDDEASAEASKEPVVKGTSSVP
        GL KVEKS EL  IAD++EKVDEV YDGLASAK I PCVEYN EKNGDHQKQQ +DLD +T++DP  HDIHVI++EA+  D+ SA+ASKEPV+ GTSS P
Subjt:  GLGKVEKSMEL--IADDQEKVDEVLYDGLASAKLILPCVEYNWEKNGDHQKQQTRDLDSVTDIDPYPHDIHVINDEATMDDEASAEASKEPVVKGTSSVP

Query:  AKCGGPSFSLLNSELDITRSSSDATVRFPPIARSRRNFSHFDMRRNSMSAVDYERSKIDNEVEWLRERLKIVQEGREKLKFSVEHKEKENNQSQLLEIIT
        A C  PSFSLL SELDITRS+SDA+ RFPP ARSR N    D+RRNSMSAVDYERSKI +EVE LRERLKIVQE REKL+FS+EHK+KENNQ QLLE IT
Subjt:  AKCGGPSFSLLNSELDITRSSSDATVRFPPIARSRRNFSHFDMRRNSMSAVDYERSKIDNEVEWLRERLKIVQEGREKLKFSVEHKEKENNQSQLLEIIT

Query:  NQLREIRQLTDPRKETLRAPLPRCSKAMSKKGWWRSSSLSIHRSN
        N LREIR LTDP K  L+AP P  SK +SKK  WRSSSLSIHRS+
Subjt:  NQLREIRQLTDPRKETLRAPLPRCSKAMSKKGWWRSSSLSIHRSN

A0A6J1IHY7 uncharacterized protein LOC1114738541.9e-30175.44Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLSIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVERSSKKISSVVRSARDKFPLNSMWDHEPKC
        MACEAIQ WTFNGLV AFLDL IAYLLLCASSLVFFTSKFLALFGL LPCPCDGLFGNLSSDHCFQK+L +RSSKK+SSV+ SA +K PLNSMWD EPKC
Subjt:  MACEAIQLWTFNGLVAAFLDLSIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVERSSKKISSVVRSARDKFPLNSMWDHEPKC

Query:  CFESMLVQERNGKEAHVELEGEASRSSFFKTRSSRCMICGDFHSVKESCCEGGVG-------RMNDILQSDVEDVSHSPSSFSGFGDNNTEDGFFSVDSG
        C ++ L+ ERNG EAHVE EGE S  SFFKTRS +  I GDFHSVKES C+G V          NDI Q D+ED+S SPSSFSGFGDNNTEDGFFSVDSG
Subjt:  CFESMLVQERNGKEAHVELEGEASRSSFFKTRSSRCMICGDFHSVKESCCEGGVG-------RMNDILQSDVEDVSHSPSSFSGFGDNNTEDGFFSVDSG

Query:  DGREDASSDNDNQYKVFPALELDESYDEKLWAEKYGTFFEEVGNNCRGKLCLDGNESDTIKLLERAVEEEQTARATLYLELEKERSAAATAADEAMAMIL
        D RE++SSDN +Q KVFP        DEK+WAEK+   FEE GNNCRG+LCLD NESDTIKLLE+A+EEEQ ARATLYLELEKERSAAATA DEAMAMIL
Subjt:  DGREDASSDNDNQYKVFPALELDESYDEKLWAEKYGTFFEEVGNNCRGKLCLDGNESDTIKLLERAVEEEQTARATLYLELEKERSAAATAADEAMAMIL

Query:  RLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKEVEAFQKSFFEDDGVDVDLLESEVTPQRPPSFLYSTNDPSHMLRRINRSI
        RLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRRERE+HFL  EVEAFQKSFFEDDGVDVD+L+ E+TPQ  PSF YS+  PSHMLR I+RSI
Subjt:  RLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKEVEAFQKSFFEDDGVDVDLLESEVTPQRPPSFLYSTNDPSHMLRRINRSI

Query:  RDKQNASYIKHSAQYEIPSMESRKLTFEFEEESPFIQADEFADAAKAGGALLHQAADNFKGNANVDYELQEKGMVEDEDIYILQGEVNELEPYLHRNESN
        RDKQ+A+Y K S+QYEIPS+ESRK TFEFE+ESPFI +DE ADAA+ G  LLHQAADNF      D ELQEKGMVEDE++YILQ EVNELEPYL  N SN
Subjt:  RDKQNASYIKHSAQYEIPSMESRKLTFEFEEESPFIQADEFADAAKAGGALLHQAADNFKGNANVDYELQEKGMVEDEDIYILQGEVNELEPYLHRNESN

Query:  GLGKVEKSM--ELIADDQEKVDEVLYDGLASAKLILPCVEYNWEKNGDHQKQQTRDLDSVTDIDPYPHDIHVINDEATMDDEASAEASKEPVVKGTSSVP
        GL KVEKS   E IAD++EKV EV YDGLASAK I+PCVEYN EKNGDHQKQQT+DLDS+T+IDP  HDIH+I++EA+  +E SA+ASKEPV+ GTSS P
Subjt:  GLGKVEKSM--ELIADDQEKVDEVLYDGLASAKLILPCVEYNWEKNGDHQKQQTRDLDSVTDIDPYPHDIHVINDEATMDDEASAEASKEPVVKGTSSVP

Query:  AKCGGPSFSLLNSELDITRSSSDATVRFPPIARSRRNFSHFDMRRNSMSAVDYERSKIDNEVEWLRERLKIVQEGREKLKFSVEHKEKENNQSQLLEIIT
        A C  PSFSLL SELDITRS+SDA+ RFPP ARSR N    ++RRNSMSAVDYERSKI +EVE LRERLKIVQE REKL+FSVEHK+KENNQ QLLE IT
Subjt:  AKCGGPSFSLLNSELDITRSSSDATVRFPPIARSRRNFSHFDMRRNSMSAVDYERSKIDNEVEWLRERLKIVQEGREKLKFSVEHKEKENNQSQLLEIIT

Query:  NQLREIRQLTDPRKETLRAPLPRCSKAMSKKGWWRSSSLSIHRSN
        N LREIRQLTDP K  L+AP P  SK ++KK  WRSSSLSIHRS+
Subjt:  NQLREIRQLTDPRKETLRAPLPRCSKAMSKKGWWRSSSLSIHRSN

A0A6J1IVQ3 uncharacterized protein LOC111479667 isoform X16.1e-26870.2Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLSIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVERSSKKISSVVRSARDKFPLNSMWDHEPKC
        MACEAIQLWTFNGLVAAFLDL IA+LLLCA+SLVFFTSKFLALFG CLPCPCDGLFG+L SDHCFQKLLV+ SSKKISSV+ S R+KFPL+SMWD EPKC
Subjt:  MACEAIQLWTFNGLVAAFLDLSIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVERSSKKISSVVRSARDKFPLNSMWDHEPKC

Query:  CFESMLVQERNGKEAHVELEGEASRSSFFKTRSSRCMICGDFHSVKESCCEGGVG-------RMNDILQSDV--EDVSHSPSSFSGFGDNNTEDGFFSVD
        CF+SM V +RN KEA VE + EAS  S+FKTRS R MI GD  ++ ES  +GGVG         ND+ QSDV  ED+ HSPSSF GFGDNN EDGFFSVD
Subjt:  CFESMLVQERNGKEAHVELEGEASRSSFFKTRSSRCMICGDFHSVKESCCEGGVG-------RMNDILQSDV--EDVSHSPSSFSGFGDNNTEDGFFSVD

Query:  SGDGREDASSDNDNQYKVFPALELDESYDEKLWAEKYGTFFEEVGNNCRGKLCLDGNESDTIKLLERAVEEEQTARATLYLELEKERSAAATAADEAMAM
        SGD  E AS DN NQYKVFP LELD+SYDEK+ AE YG   EE  NNCRG+ CLDGNESD IKLLE+++EEEQ ARATLYLELEKERSAAATAADEAMAM
Subjt:  SGDGREDASSDNDNQYKVFPALELDESYDEKLWAEKYGTFFEEVGNNCRGKLCLDGNESDTIKLLERAVEEEQTARATLYLELEKERSAAATAADEAMAM

Query:  ILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKEVEAFQKSFFEDDGVDVDLLESEVTPQRPPSFLYSTNDPSHMLRRINR
        ILRLQEEKASIEM+ARQYQRMIEEKTAYDAEEMSILKEILVRRE+EMHFL KEV AF++SFF++ GV VD+L++E TP   PS  Y T DPSHML+ IN 
Subjt:  ILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKEVEAFQKSFFEDDGVDVDLLESEVTPQRPPSFLYSTNDPSHMLRRINR

Query:  SIRDKQNASYIKHSAQYEIPSMESRKLTFEFEEESPFIQADEFADAAKAGGALLHQAADNFKGNANVDYELQEKGMVEDEDIYILQGEVNELEPYLHRNE
        SI DKQ+         +E+PS+ESR L FEF EESP IQA EFADAAKA G LLHQ ADNF+G   +D ELQ KGMVED+++YI+ GEVNELEPY   N 
Subjt:  SIRDKQNASYIKHSAQYEIPSMESRKLTFEFEEESPFIQADEFADAAKAGGALLHQAADNFKGNANVDYELQEKGMVEDEDIYILQGEVNELEPYLHRNE

Query:  SNGLGKVEKSMELIADDQEKVDEVLYDGLASAKLILPCVEYNWEKNGDHQKQQTRDLDSVTDIDPYPHDIHVINDEATMDDEASAEASKEPVVKGTSSVP
        SN LGKVE+  E   D+QEKV +   DGL SA+  LPCVEYN EK  DHQKQ TRD  SV D D  PHDIHVI DEA M +EASA A +E +V G+SS+P
Subjt:  SNGLGKVEKSMELIADDQEKVDEVLYDGLASAKLILPCVEYNWEKNGDHQKQQTRDLDSVTDIDPYPHDIHVINDEATMDDEASAEASKEPVVKGTSSVP

Query:  AKCGGPSFSLLNSELDITRSSSDATVRFPPIARSRRNFSHFDMRRNSMSAVDYERSKIDNEVEWLRERLKIVQEGREKLKFSVEHKEKENNQSQLLEIIT
          C   SFSLL +ELDITRSSSDAT RFPP+ARSR N    ++RRNSMSAVDYERSKI NEVEWLR RLKIVQE REK K SVE KEKENNQ QLLE IT
Subjt:  AKCGGPSFSLLNSELDITRSSSDATVRFPPIARSRRNFSHFDMRRNSMSAVDYERSKIDNEVEWLRERLKIVQEGREKLKFSVEHKEKENNQSQLLEIIT

Query:  NQLREIRQLTDPRKETLRAPLPRCSKAMSKKGWWRSSSLSIHRSN
              + L+DP K  L+APLP  SK +SKK  WRSSSL IHRS+
Subjt:  NQLREIRQLTDPRKETLRAPLPRCSKAMSKKGWWRSSSLSIHRSN

SwissProt top hitse value%identityAlignment
F4HVS6 Probable myosin-binding protein 61.5e-1342.06Show/hide
Query:  DGNESDTIKLLERAVEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKEV
        D      +  L++ V  ++ +   LY+EL++ERSA+A AA+EAMAMI RLQ EKA+++M+A QYQRM++E+  YD E +  +   L +RE EM  L  E 
Subjt:  DGNESDTIKLLERAVEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKEV

Query:  EAFQKSF
        E +++ +
Subjt:  EAFQKSF

Q0WNW4 Myosin-binding protein 35.0e-1747.22Show/hide
Query:  GNESDTIKLLERAVEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKEVE
        G+   TI+ L   V  EQ A   LY ELE+ERSA+A +A++ MAMI RLQEEKA ++M+A QYQRM+EE+  YD E + +L  ++V+RE+E   L +E+E
Subjt:  GNESDTIKLLERAVEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKEVE

Query:  AFQKSFFE
         ++    E
Subjt:  AFQKSFFE

Q9CAC4 Myosin-binding protein 22.7e-1548.54Show/hide
Query:  TIKLLERAVEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKEVEAFQKS
        T+  L+  ++EE+ A   LY ELE ER+A+A AA E MAMI RL EEKA+++M+A QYQRM+EE+  +D E + +L E++V RE+E   L KE+E ++K 
Subjt:  TIKLLERAVEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKEVEAFQKS

Query:  FFE
          E
Subjt:  FFE

Q9FG14 Myosin-binding protein 71.3e-1235.48Show/hide
Query:  IKLLERAVEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKEVEAFQKSF
        ++LL   V  +Q +   LY EL++ER+AA+TAA EAM+MILRLQ +KA ++M+ RQ++R  EEK  +D +E+  L++++ +RE+ +  L  E +A++   
Subjt:  IKLLERAVEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKEVEAFQKSF

Query:  FEDDGVDVDLLESEVTPQRPPSFL
              + ++   +    R PS +
Subjt:  FEDDGVDVDLLESEVTPQRPPSFL

Q9LMC8 Probable myosin-binding protein 53.0e-1440.62Show/hide
Query:  LDGNESDTIKLLERAVEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKE
        LDG+    ++ L R V  ++ +   LY+EL++ERSA+A AA+ AMAMI RLQ EKA+++M+A QYQRM++E+  YD E +  +  +LV+RE EM  L   
Subjt:  LDGNESDTIKLLERAVEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKE

Query:  VEAFQKSF---FEDDGVDVDLLESEVTP
        +E ++  +    E+ G   + L+ E  P
Subjt:  VEAFQKSF---FEDDGVDVDLLESEVTP

Arabidopsis top hitse value%identityAlignment
AT1G04890.1 Protein of unknown function, DUF5934.0e-3050.68Show/hide
Query:  EKYGTFFEEVGNNCRGKLCLDGN-ESDTIKLLERAVEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEM
        EK  +  ++V +N   +     N E  +++ LE  ++EE+ ARAT+ +EL+KERSAAA+AADEAMAMI RLQ+EKA+IEM+ARQ+QR++EE++ +DAEEM
Subjt:  EKYGTFFEEVGNNCRGKLCLDGN-ESDTIKLLERAVEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEM

Query:  SILKEILVRREREMHFLAKEVEAFQKSFFEDDGVDVDLLESEVTPQ
         ILK+IL+RRERE HFL KEVEA+++   E + ++  L++ +  P+
Subjt:  SILKEILVRREREMHFLAKEVEAFQKSFFEDDGVDVDLLESEVTPQ

AT1G04890.1 Protein of unknown function, DUF5931.3e-0728.68Show/hide
Query:  ADAAKAGGALLHQAADNFKGNANVDYE-------LQEKGMVEDEDIYILQGEVNELEPYLHRNESNGLGKVEKSMELIADDQEKVDEVLYDGLASAKLIL
        A AA    A++H+  D     A ++ E       ++E+   + E++ IL+  +   E   H  E     +VE   +L+ + +E     L   L   K + 
Subjt:  ADAAKAGGALLHQAADNFKGNANVDYE-------LQEKGMVEDEDIYILQGEVNELEPYLHRNESNGLGKVEKSMELIADDQEKVDEVLYDGLASAKLIL

Query:  PCVEYNWEKNGDHQKQQ---TRDLDSVTDIDPY--------PHDIHVINDE---ATMDDEASAEASKEPVVKGTSSVPAKCGGPSFSLLNSELDITRSSS
           E   ++N D Q+++    ++LD      PY          D++  + E   + + D    +   E + K  +   +  G P  SL  + + +    S
Subjt:  PCVEYNWEKNGDHQKQQ---TRDLDSVTDIDPY--------PHDIHVINDE---ATMDDEASAEASKEPVVKGTSSVPAKCGGPSFSLLNSELDITRSSS

Query:  DATVRFPPIARSR-RNFSHFDMRRNSMSAVDYERSKIDNEVEWLRERLKIVQEGREKL
            + PP+ R R ++ S    RR SMSAVDYER KI+NEVE LRERLK VQE RE+L
Subjt:  DATVRFPPIARSR-RNFSHFDMRRNSMSAVDYERSKIDNEVEWLRERLKIVQEGREKL

AT4G13160.1 Protein of unknown function, DUF5931.9e-2765.66Show/hide
Query:  DTIKLLERAVEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKEVEAFQ
        D ++LLE AVE+E+ A+A L +ELE+ER+A+A+AADEAMAMILRLQ +KAS+EM+ +QY+RMI+EK AYD EEM+ILKEIL +RERE HFL KE+E ++
Subjt:  DTIKLLERAVEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKEVEAFQ

AT4G13160.1 Protein of unknown function, DUF5931.5e-0541.25Show/hide
Query:  TFNGLVAAFLDLSIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVERSSKKISSVVRSARDKFP
        TF G++ AF++L+ AY LLC S+ VF TSK L    L +PC      G  +SD C QKLL +   + I  V + A    P
Subjt:  TFNGLVAAFLDLSIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVERSSKKISSVVRSARDKFP

AT4G13630.1 Protein of unknown function, DUF5931.4e-6234Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLSIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVERSSKKISSVVRSARDKFPLNSMWDHEPKC
        M C+ ++ WTF GLVAAF+DLS+A+ LLCAS +V+ TSKFL LFGL LPCPCDGL+       CFQ+ L     KKISSV RS +++ P +S+  +  K 
Subjt:  MACEAIQLWTFNGLVAAFLDLSIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVERSSKKISSVVRSARDKFPLNSMWDHEPKC

Query:  CFESMLVQERNGKEAHVELEGEAS-----------RSSFFKTRSSRCMICGDF--HSVKESCCEGGVGRMNDILQSDVEDVSHSPSSFSGFGDNNTEDGF
                +R  +   V+LE E S           ++S F   +++ +  G F   S + S      G  N   QS +        SF G  D N     
Subjt:  CFESMLVQERNGKEAHVELEGEAS-----------RSSFFKTRSSRCMICGDF--HSVKESCCEGGVGRMNDILQSDVEDVSHSPSSFSGFGDNNTEDGF

Query:  FSVDSGDGREDASSDNDNQYKVFPALELDESYDEKLWAEKYGTFFEEVGNNC-RGKLCLDGNESDTIKLLERAVEEEQTARATLYLELEKERSAAATAAD
         S DSG   ED S                          +       VG     G     G    T+++ E+ + EE+ ARA+L LELEKER+AAA+AAD
Subjt:  FSVDSGDGREDASSDNDNQYKVFPALELDESYDEKLWAEKYGTFFEEVGNNC-RGKLCLDGNESDTIKLLERAVEEEQTARATLYLELEKERSAAATAAD

Query:  EAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKEVEAFQKSFFEDDGVDVDLLESEVTPQRPPSFLYSTNDPSHML
        EA+ MILRLQEEKASIEM+ARQYQRMIEEK+A+DAEEMSILKEIL+RRERE HFL KEV+ +++ F E +       +   TP   P+ +     P    
Subjt:  EAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKEVEAFQKSFFEDDGVDVDLLESEVTPQRPPSFLYSTNDPSHML

Query:  RRINRSIRDKQNASYIKHSAQYEIPSMESRKLTFEFEEESPFIQADEFADAAKAGGALLHQAADNFKGNANVDYELQEKGMVEDEDIYILQGEVNELEPY
        ++I     D +    +  S+ +EI + +      +     PF                          +A  D EL+E+   E E +Y            
Subjt:  RRINRSIRDKQNASYIKHSAQYEIPSMESRKLTFEFEEESPFIQADEFADAAKAGGALLHQAADNFKGNANVDYELQEKGMVEDEDIYILQGEVNELEPY

Query:  LHRNESNGLGKVEKSMELIADDQEKVDEVLYDGLASAKLILPCVEYNWEKNGDHQKQQTRDLDSVTDIDPYPHDIHVINDEATMDDEASAEASKEPVVKG
                        EL++           D   S KL                       +   DID + HDIHV+ DE                 KG
Subjt:  LHRNESNGLGKVEKSMELIADDQEKVDEVLYDGLASAKLILPCVEYNWEKNGDHQKQQTRDLDSVTDIDPYPHDIHVINDEATMDDEASAEASKEPVVKG

Query:  TSSVPAKCGGPSFSLLNSELDITRSSSDATVRFPPIARSRRNFSHFDMRRNSMSAVDYERSKIDNEVEWLRERLKIVQEGREKLKFSVEHKEKENNQ
          +VP+       +  + +LD ++S SD +  FP   + + N S  +MRRNSMSA+DYER KI++EV  LR RL+ VQ+GREK+ FS + + K   Q
Subjt:  TSSVPAKCGGPSFSLLNSELDITRSSSDATVRFPPIARSRRNFSHFDMRRNSMSAVDYERSKIDNEVEWLRERLKIVQEGREKLKFSVEHKEKENNQ

AT4G13630.2 Protein of unknown function, DUF5931.4e-6234Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLSIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVERSSKKISSVVRSARDKFPLNSMWDHEPKC
        M C+ ++ WTF GLVAAF+DLS+A+ LLCAS +V+ TSKFL LFGL LPCPCDGL+       CFQ+ L     KKISSV RS +++ P +S+  +  K 
Subjt:  MACEAIQLWTFNGLVAAFLDLSIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVERSSKKISSVVRSARDKFPLNSMWDHEPKC

Query:  CFESMLVQERNGKEAHVELEGEAS-----------RSSFFKTRSSRCMICGDF--HSVKESCCEGGVGRMNDILQSDVEDVSHSPSSFSGFGDNNTEDGF
                +R  +   V+LE E S           ++S F   +++ +  G F   S + S      G  N   QS +        SF G  D N     
Subjt:  CFESMLVQERNGKEAHVELEGEAS-----------RSSFFKTRSSRCMICGDF--HSVKESCCEGGVGRMNDILQSDVEDVSHSPSSFSGFGDNNTEDGF

Query:  FSVDSGDGREDASSDNDNQYKVFPALELDESYDEKLWAEKYGTFFEEVGNNC-RGKLCLDGNESDTIKLLERAVEEEQTARATLYLELEKERSAAATAAD
         S DSG   ED S                          +       VG     G     G    T+++ E+ + EE+ ARA+L LELEKER+AAA+AAD
Subjt:  FSVDSGDGREDASSDNDNQYKVFPALELDESYDEKLWAEKYGTFFEEVGNNC-RGKLCLDGNESDTIKLLERAVEEEQTARATLYLELEKERSAAATAAD

Query:  EAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKEVEAFQKSFFEDDGVDVDLLESEVTPQRPPSFLYSTNDPSHML
        EA+ MILRLQEEKASIEM+ARQYQRMIEEK+A+DAEEMSILKEIL+RRERE HFL KEV+ +++ F E +       +   TP   P+ +     P    
Subjt:  EAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKEVEAFQKSFFEDDGVDVDLLESEVTPQRPPSFLYSTNDPSHML

Query:  RRINRSIRDKQNASYIKHSAQYEIPSMESRKLTFEFEEESPFIQADEFADAAKAGGALLHQAADNFKGNANVDYELQEKGMVEDEDIYILQGEVNELEPY
        ++I     D +    +  S+ +EI + +      +     PF                          +A  D EL+E+   E E +Y            
Subjt:  RRINRSIRDKQNASYIKHSAQYEIPSMESRKLTFEFEEESPFIQADEFADAAKAGGALLHQAADNFKGNANVDYELQEKGMVEDEDIYILQGEVNELEPY

Query:  LHRNESNGLGKVEKSMELIADDQEKVDEVLYDGLASAKLILPCVEYNWEKNGDHQKQQTRDLDSVTDIDPYPHDIHVINDEATMDDEASAEASKEPVVKG
                        EL++           D   S KL                       +   DID + HDIHV+ DE                 KG
Subjt:  LHRNESNGLGKVEKSMELIADDQEKVDEVLYDGLASAKLILPCVEYNWEKNGDHQKQQTRDLDSVTDIDPYPHDIHVINDEATMDDEASAEASKEPVVKG

Query:  TSSVPAKCGGPSFSLLNSELDITRSSSDATVRFPPIARSRRNFSHFDMRRNSMSAVDYERSKIDNEVEWLRERLKIVQEGREKLKFSVEHKEKENNQ
          +VP+       +  + +LD ++S SD +  FP   + + N S  +MRRNSMSA+DYER KI++EV  LR RL+ VQ+GREK+ FS + + K   Q
Subjt:  TSSVPAKCGGPSFSLLNSELDITRSSSDATVRFPPIARSRRNFSHFDMRRNSMSAVDYERSKIDNEVEWLRERLKIVQEGREKLKFSVEHKEKENNQ

AT5G16720.1 Protein of unknown function, DUF5933.6e-1847.22Show/hide
Query:  GNESDTIKLLERAVEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKEVE
        G+   TI+ L   V  EQ A   LY ELE+ERSA+A +A++ MAMI RLQEEKA ++M+A QYQRM+EE+  YD E + +L  ++V+RE+E   L +E+E
Subjt:  GNESDTIKLLERAVEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLAKEVE

Query:  AFQKSFFE
         ++    E
Subjt:  AFQKSFFE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTGTGAAGCTATACAACTGTGGACATTTAATGGATTAGTGGCTGCATTTCTTGATCTTAGTATAGCTTATCTTTTATTATGTGCATCTAGTCTTGTTTTCTTTAC
ATCCAAATTTCTTGCACTGTTTGGATTATGTCTGCCTTGCCCTTGTGATGGGCTATTTGGGAACCTTAGTAGTGATCATTGCTTCCAGAAGTTACTAGTAGAACGATCGT
CGAAAAAAATATCTTCAGTCGTACGTTCGGCTAGAGACAAGTTCCCATTGAATTCCATGTGGGATCATGAGCCAAAATGTTGTTTTGAGTCAATGTTAGTGCAAGAGAGG
AATGGAAAGGAGGCACATGTTGAATTGGAAGGTGAAGCATCTCGTAGTTCCTTTTTTAAGACCCGATCGTCTCGATGTATGATTTGTGGAGACTTTCACAGTGTGAAAGA
ATCGTGTTGTGAAGGCGGTGTGGGTCGTATGAATGACATTTTACAGTCGGATGTGGAAGATGTTTCTCACTCTCCTTCAAGCTTCAGTGGATTTGGGGATAATAATACAG
AGGACGGCTTCTTTTCTGTTGATTCTGGAGATGGAAGAGAGGATGCTTCATCAGACAACGATAATCAGTATAAAGTATTTCCTGCTCTCGAACTTGATGAATCTTATGAT
GAGAAACTATGGGCAGAGAAGTATGGAACATTTTTTGAAGAGGTAGGGAACAACTGTAGGGGGAAGTTATGCTTGGATGGTAATGAGAGTGATACAATCAAACTATTGGA
ACGAGCAGTCGAAGAAGAGCAGACAGCTCGTGCTACCCTATACCTGGAGCTTGAGAAAGAGAGAAGTGCTGCTGCCACTGCTGCTGATGAGGCCATGGCTATGATATTAC
GTCTTCAGGAGGAGAAAGCATCAATAGAAATGGATGCTAGGCAATACCAGAGGATGATAGAGGAGAAAACTGCTTATGATGCTGAGGAAATGAGTATTCTTAAAGAAATC
CTAGTGAGGAGAGAACGGGAAATGCATTTCCTGGCGAAGGAAGTGGAAGCTTTTCAGAAAAGTTTCTTCGAAGATGATGGAGTTGATGTTGATCTGCTCGAGTCGGAAGT
TACACCCCAAAGGCCCCCTTCTTTCCTATATTCAACTAATGATCCATCTCATATGCTTCGACGCATTAATAGATCCATTAGAGACAAACAAAATGCAAGTTACATAAAAC
ATTCTGCACAATATGAGATTCCATCGATGGAGTCACGAAAATTAACTTTTGAATTCGAGGAAGAGTCGCCATTTATTCAAGCAGATGAATTTGCTGATGCTGCAAAAGCT
GGGGGTGCGTTGTTGCACCAAGCTGCTGATAACTTCAAGGGTAATGCAAACGTTGACTATGAGTTACAAGAAAAGGGCATGGTAGAAGATGAGGATATATACATTCTACA
AGGAGAAGTAAATGAACTGGAACCATATCTACACAGAAACGAGTCAAATGGTCTTGGTAAGGTTGAGAAATCCATGGAACTGATTGCTGATGATCAAGAAAAAGTCGACG
AAGTTTTATATGATGGGTTGGCATCTGCTAAATTAATTCTTCCCTGTGTTGAGTACAATTGGGAAAAGAATGGTGACCATCAAAAGCAGCAAACAAGAGATCTAGACTCT
GTGACTGATATAGATCCCTATCCTCACGACATTCATGTCATCAATGATGAAGCTACCATGGACGATGAAGCGAGTGCCGAAGCAAGTAAGGAACCGGTGGTCAAGGGTAC
CTCGAGTGTTCCAGCAAAATGTGGTGGTCCATCCTTCAGCTTGTTGAATAGTGAACTAGACATCACAAGAAGCAGCTCAGATGCCACTGTTAGATTTCCACCAATAGCTC
GTTCTCGAAGAAATTTCTCTCATTTCGACATGCGTAGAAATTCAATGTCTGCTGTCGATTATGAAAGGTCAAAAATTGACAATGAAGTTGAGTGGCTCAGAGAAAGGTTG
AAGATTGTTCAGGAGGGAAGAGAAAAACTCAAGTTCTCTGTGGAACACAAAGAGAAGGAGAACAATCAGTCACAACTCTTAGAAATTATAACAAATCAGCTGCGTGAAAT
CCGACAACTGACGGATCCTAGAAAGGAAACTCTGCGGGCTCCATTGCCTCGATGCTCTAAGGCTATGTCAAAGAAAGGATGGTGGCGAAGCTCATCTTTAAGCATTCACA
GAAGCAACTAG
mRNA sequenceShow/hide mRNA sequence
CAAATTATTATATCTCTTTCTTAGCGTTATGATCTTCACACAACGTTCTCGAATCGACGTTTATGGAAATTTATTAGGGCTCTTAAATCCGAGAGAGCGAAATAGGGAAC
CGCCATCCATTGATTCGGTCACTCTCATTTCAACATTTCAGTAAACTGTTCTTCAAGATTTCCGTGATTTTGCTGTTGCGTTGCTGAGAATTGAAAGCTTATGGAACTTC
CAGAAGAAATTATGGTTTCTGTTCGATTCTTACCCTTCGGCTGTTTAGTGAAGTGAAGTTTGAACTTAGAATCGTAGATTCAGTTAGGTATTTGATTGTTTGAATTTTTC
TGTACTGTGTGTTTTTTGTGGTTTGATTAATAGTATTGTTCTTATTTCTGTTCGTAGTATTAGCTGCTAGAGAAGTAGTTTTTCGAATTTGTTTGATTGGATTCATTTGG
ATATTTGAAGTTGTTTTTGTGGAAAATGGCTTGTGAAGCTATACAACTGTGGACATTTAATGGATTAGTGGCTGCATTTCTTGATCTTAGTATAGCTTATCTTTTATTAT
GTGCATCTAGTCTTGTTTTCTTTACATCCAAATTTCTTGCACTGTTTGGATTATGTCTGCCTTGCCCTTGTGATGGGCTATTTGGGAACCTTAGTAGTGATCATTGCTTC
CAGAAGTTACTAGTAGAACGATCGTCGAAAAAAATATCTTCAGTCGTACGTTCGGCTAGAGACAAGTTCCCATTGAATTCCATGTGGGATCATGAGCCAAAATGTTGTTT
TGAGTCAATGTTAGTGCAAGAGAGGAATGGAAAGGAGGCACATGTTGAATTGGAAGGTGAAGCATCTCGTAGTTCCTTTTTTAAGACCCGATCGTCTCGATGTATGATTT
GTGGAGACTTTCACAGTGTGAAAGAATCGTGTTGTGAAGGCGGTGTGGGTCGTATGAATGACATTTTACAGTCGGATGTGGAAGATGTTTCTCACTCTCCTTCAAGCTTC
AGTGGATTTGGGGATAATAATACAGAGGACGGCTTCTTTTCTGTTGATTCTGGAGATGGAAGAGAGGATGCTTCATCAGACAACGATAATCAGTATAAAGTATTTCCTGC
TCTCGAACTTGATGAATCTTATGATGAGAAACTATGGGCAGAGAAGTATGGAACATTTTTTGAAGAGGTAGGGAACAACTGTAGGGGGAAGTTATGCTTGGATGGTAATG
AGAGTGATACAATCAAACTATTGGAACGAGCAGTCGAAGAAGAGCAGACAGCTCGTGCTACCCTATACCTGGAGCTTGAGAAAGAGAGAAGTGCTGCTGCCACTGCTGCT
GATGAGGCCATGGCTATGATATTACGTCTTCAGGAGGAGAAAGCATCAATAGAAATGGATGCTAGGCAATACCAGAGGATGATAGAGGAGAAAACTGCTTATGATGCTGA
GGAAATGAGTATTCTTAAAGAAATCCTAGTGAGGAGAGAACGGGAAATGCATTTCCTGGCGAAGGAAGTGGAAGCTTTTCAGAAAAGTTTCTTCGAAGATGATGGAGTTG
ATGTTGATCTGCTCGAGTCGGAAGTTACACCCCAAAGGCCCCCTTCTTTCCTATATTCAACTAATGATCCATCTCATATGCTTCGACGCATTAATAGATCCATTAGAGAC
AAACAAAATGCAAGTTACATAAAACATTCTGCACAATATGAGATTCCATCGATGGAGTCACGAAAATTAACTTTTGAATTCGAGGAAGAGTCGCCATTTATTCAAGCAGA
TGAATTTGCTGATGCTGCAAAAGCTGGGGGTGCGTTGTTGCACCAAGCTGCTGATAACTTCAAGGGTAATGCAAACGTTGACTATGAGTTACAAGAAAAGGGCATGGTAG
AAGATGAGGATATATACATTCTACAAGGAGAAGTAAATGAACTGGAACCATATCTACACAGAAACGAGTCAAATGGTCTTGGTAAGGTTGAGAAATCCATGGAACTGATT
GCTGATGATCAAGAAAAAGTCGACGAAGTTTTATATGATGGGTTGGCATCTGCTAAATTAATTCTTCCCTGTGTTGAGTACAATTGGGAAAAGAATGGTGACCATCAAAA
GCAGCAAACAAGAGATCTAGACTCTGTGACTGATATAGATCCCTATCCTCACGACATTCATGTCATCAATGATGAAGCTACCATGGACGATGAAGCGAGTGCCGAAGCAA
GTAAGGAACCGGTGGTCAAGGGTACCTCGAGTGTTCCAGCAAAATGTGGTGGTCCATCCTTCAGCTTGTTGAATAGTGAACTAGACATCACAAGAAGCAGCTCAGATGCC
ACTGTTAGATTTCCACCAATAGCTCGTTCTCGAAGAAATTTCTCTCATTTCGACATGCGTAGAAATTCAATGTCTGCTGTCGATTATGAAAGGTCAAAAATTGACAATGA
AGTTGAGTGGCTCAGAGAAAGGTTGAAGATTGTTCAGGAGGGAAGAGAAAAACTCAAGTTCTCTGTGGAACACAAAGAGAAGGAGAACAATCAGTCACAACTCTTAGAAA
TTATAACAAATCAGCTGCGTGAAATCCGACAACTGACGGATCCTAGAAAGGAAACTCTGCGGGCTCCATTGCCTCGATGCTCTAAGGCTATGTCAAAGAAAGGATGGTGG
CGAAGCTCATCTTTAAGCATTCACAGAAGCAACTAGTTTGACCGCCCCCGACACCGAAAACGGAAGAGAACAGGAAACAGGAAACACCATTGCAAAACATGATACAGAAT
CCACCAGGTTAGAAATTCTAGGTCTTTTTGCTCCACTGCCATACAAGTCATGTGTATTGGGAATAAGTTGGCAGGTTATGTCTTATAGTTTAACAGATCATCCCTTTTGT
ACTTTCTCTAAAAAAAAATTGATGCGGCTGATAGAATGTACTATTTAGTTGTCATATTGAGATTGTTAGATGCTTTTGGCGTCGAGATTAAAATCTCATTTTGATTTTTA
TATTTTGGATTTCATTTAGAGCAGGTGAGAATTATATATTTTTATTATTTTTCTTTTACATTCGTGAGTGTATAGGTCATAGGTCAGGTTACGCAGAC
Protein sequenceShow/hide protein sequence
MACEAIQLWTFNGLVAAFLDLSIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVERSSKKISSVVRSARDKFPLNSMWDHEPKCCFESMLVQER
NGKEAHVELEGEASRSSFFKTRSSRCMICGDFHSVKESCCEGGVGRMNDILQSDVEDVSHSPSSFSGFGDNNTEDGFFSVDSGDGREDASSDNDNQYKVFPALELDESYD
EKLWAEKYGTFFEEVGNNCRGKLCLDGNESDTIKLLERAVEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEI
LVRREREMHFLAKEVEAFQKSFFEDDGVDVDLLESEVTPQRPPSFLYSTNDPSHMLRRINRSIRDKQNASYIKHSAQYEIPSMESRKLTFEFEEESPFIQADEFADAAKA
GGALLHQAADNFKGNANVDYELQEKGMVEDEDIYILQGEVNELEPYLHRNESNGLGKVEKSMELIADDQEKVDEVLYDGLASAKLILPCVEYNWEKNGDHQKQQTRDLDS
VTDIDPYPHDIHVINDEATMDDEASAEASKEPVVKGTSSVPAKCGGPSFSLLNSELDITRSSSDATVRFPPIARSRRNFSHFDMRRNSMSAVDYERSKIDNEVEWLRERL
KIVQEGREKLKFSVEHKEKENNQSQLLEIITNQLREIRQLTDPRKETLRAPLPRCSKAMSKKGWWRSSSLSIHRSN