; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0017235 (gene) of Chayote v1 genome

Gene IDSed0017235
OrganismSechium edule (Chayote v1)
Descriptionbeta-glucosidase BoGH3B isoform X1
Genome locationLG06:43791951..43805829
RNA-Seq ExpressionSed0017235
SyntenySed0017235
Gene Ontology termsGO:0009251 - glucan catabolic process (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0008422 - beta-glucosidase activity (molecular function)
InterPro domainsIPR001764 - Glycoside hydrolase, family 3, N-terminal
IPR002772 - Glycoside hydrolase family 3 C-terminal domain
IPR017853 - Glycoside hydrolase superfamily
IPR036881 - Glycoside hydrolase family 3 C-terminal domain superfamily
IPR036962 - Glycoside hydrolase, family 3, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008446716.1 PREDICTED: beta-glucosidase BoGH3B isoform X1 [Cucumis melo]0.0e+0089.26Show/hide
Query:  MTLNSNLLSKWKKCGLNSRGKKMAKILVQVVVILCLGWWLWPIMVGAENLKYKDPKQPVGVRVKDLLGRMTLAEKIGQMIQIDRSVANATVMKDYFIGSV
        M LN  LL KWK+CGLN++ KKMAKI VQVVVILCLGW  W  MV AENLKYKDPKQPVGVRVKDLLGRMTL EKIGQM+QIDRSVANATVMKDYFIGS+
Subjt:  MTLNSNLLSKWKKCGLNSRGKKMAKILVQVVVILCLGWWLWPIMVGAENLKYKDPKQPVGVRVKDLLGRMTLAEKIGQMIQIDRSVANATVMKDYFIGSV

Query:  LSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRD
        LSGGGSVPLPDARA+DWVDMINDFQKGSLSSRLGIPM YGIDAVHGHNNVYNATVFPHNVGLGATRNPDL R IGAATALEVRATGISY FAPC+AVCRD
Subjt:  LSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRD

Query:  PRWGRCYESYSEDPKIVRNMTEIIIGLQGEPPANYRKGTPYVGRTKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIKGVSSVMVSY
        PRWGRCYESYSEDPKIV+ MTEIIIGLQGEPPANYRKGTPYVG TKKVIACAKHFVGDGGTT GINENNTVI++HGLLSIHMPAYLDSIIKGVSSVM SY
Subjt:  PRWGRCYESYSEDPKIVRNMTEIIIGLQGEPPANYRKGTPYVGRTKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIKGVSSVMVSY

Query:  SSWNGVKMHADRDLITDFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVK
        SSWNGVKMHA+R+LITDFLKG LKFKGFVISDWEGLDRITS PHSNYTYSVQAAILAGIDMVM+PYKY EFIDDL FLVK+NVIPMDRIDDAV RIL+VK
Subjt:  SSWNGVKMHADRDLITDFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVK

Query:  FTMGLFESPLGDYSLVNEVGSQAHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKS
        FTMGLFESP+ DYSLVNE+GSQAHRDLARDAVRQSLVLLKNGKNDS PLLPLSKK+PKILVAGTHADNLGYQCGGWTIAWQGFSGNN TRGT+ILAAIKS
Subjt:  FTMGLFESPLGDYSLVNEVGSQAHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKS

Query:  TVDPNTEVVFREDPESDFVKSNDFSYAIVVIGETPYAETVGDSMTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYISSIDALVAAWLPGTEGLGV
        TVDP+TEVVFREDP+SDFVKSNDFSYAIVVIGE PYAET GDS TLTMLDPGP+IIKNVC+ V+CVV++ISGRPIV+EPYISSIDALVAAWLPGTEG GV
Subjt:  TVDPNTEVVFREDPESDFVKSNDFSYAIVVIGETPYAETVGDSMTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYISSIDALVAAWLPGTEGLGV

Query:  TDALYGDHGFSGKLPRTWFKSVDQLPMNVGDRHYDPLYPLGFGLTTGSVKDFVARSTSAGIQGTTSFIAMIIAAIVICIL
        TDALYGDHGFSGKLPRTWFKSVDQLPMNVGD HYDPL+P GFGLTTGSVKD +ARSTSAGI+GT S IA I+ AI +CIL
Subjt:  TDALYGDHGFSGKLPRTWFKSVDQLPMNVGDRHYDPLYPLGFGLTTGSVKDFVARSTSAGIQGTTSFIAMIIAAIVICIL

XP_022945501.1 uncharacterized protein LOC111449719 isoform X1 [Cucurbita moschata]0.0e+0091.15Show/hide
Query:  LNSNLLSKWKKCGLNSRGKKMAKILVQVVVILCLGWWLWPIMVGAENLKYKDPKQPVGVRVKDLLGRMTLAEKIGQMIQIDRSVANATVMKDYFIGSVLS
        LN  L  KWK+CG NS GKKMAK  VQVVVILCLGWW W IMVGAENLKYKDPKQPV VRVKDLLGRMTL EKIGQM+QIDRSVANATVMK+YFIGSVLS
Subjt:  LNSNLLSKWKKCGLNSRGKKMAKILVQVVVILCLGWWLWPIMVGAENLKYKDPKQPVGVRVKDLLGRMTLAEKIGQMIQIDRSVANATVMKDYFIGSVLS

Query:  GGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRDPR
        GGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDL+R IGAATALEVRATGISY FAPC+AVCRDPR
Subjt:  GGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRDPR

Query:  WGRCYESYSEDPKIVRNMTEIIIGLQGEPPANYRKGTPYVGRTKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIKGVSSVMVSYSS
        WGRCYESYSEDPK+V+NMTEIIIGLQGEPPAN+RKG PYVG TKKVIACAKHFVGDGGTT GINENNTVID+HGLL IHMPAYLDSIIKGVSSVMVSYSS
Subjt:  WGRCYESYSEDPKIVRNMTEIIIGLQGEPPANYRKGTPYVGRTKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIKGVSSVMVSYSS

Query:  WNGVKMHADRDLITDFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVKFT
        WNGVKMHA+R+LIT FLK TLKFKGFVISDWEGLDRITS PHSNYTYSVQAAI AGIDMVMVPYKY EFIDDLT LVKNNV+ MDRIDDAV RILSVKFT
Subjt:  WNGVKMHADRDLITDFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVKFT

Query:  MGLFESPLGDYSLVNEVGSQAHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTV
        MGLFESPLGDYSLVNE+GSQAHRDLARDAVRQSLVLLKNGKNDSD LLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGT+IL AIKSTV
Subjt:  MGLFESPLGDYSLVNEVGSQAHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTV

Query:  DPNTEVVFREDPESDFVKSNDFSYAIVVIGETPYAETVGDSMTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYISSIDALVAAWLPGTEGLGVTD
        DP+TEVVFREDP+SDFVKSN FSYAIVVIGE PYAET GDS TLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPY+SS+DALVAAWLPGTEGLGVTD
Subjt:  DPNTEVVFREDPESDFVKSNDFSYAIVVIGETPYAETVGDSMTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYISSIDALVAAWLPGTEGLGVTD

Query:  ALYGDHGFSGKLPRTWFKSVDQLPMNVGDRHYDPLYPLGFGLTTGSVKDFVARSTSAGIQGTTSFIAMIIAAIVICIL
        ALYGDHGFSGKLPRTWFKSVDQLPMN GDRHYDPL+PLGFGLTTGSVKD VARSTSAG Q T SFIAMI+A I IC+L
Subjt:  ALYGDHGFSGKLPRTWFKSVDQLPMNVGDRHYDPLYPLGFGLTTGSVKDFVARSTSAGIQGTTSFIAMIIAAIVICIL

XP_022968399.1 uncharacterized protein LOC111467651 isoform X1 [Cucurbita maxima]0.0e+0092.55Show/hide
Query:  MAKILVQVVVILCLGWWLWPIMVGAENLKYKDPKQPVGVRVKDLLGRMTLAEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMIN
        MAKI VQVVVILCLGWW W IMV AENLKYKDPKQPV VRVKDLLGRMTL EKIGQM+QIDRSVANATVMK+YFIGSVLSGGGSVPLPDARAQDWVDMIN
Subjt:  MAKILVQVVVILCLGWWLWPIMVGAENLKYKDPKQPVGVRVKDLLGRMTLAEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMIN

Query:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRDPRWGRCYESYSEDPKIVRNMTE
        DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDL+R IGAATALEVRATGISY FAPC+AVCRDPRWGRCYESYSEDPK+V+NMTE
Subjt:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRDPRWGRCYESYSEDPKIVRNMTE

Query:  IIIGLQGEPPANYRKGTPYVGRTKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHADRDLITDFLKGT
        IIIGLQGEPPANYRKG PYVG TKKVIACAKHFVGDGGTT GINENNTVID+HGLL IHMPAYLDSIIKGVSSVMVSYSSWNGVKMHA+RDLIT FLKGT
Subjt:  IIIGLQGEPPANYRKGTPYVGRTKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHADRDLITDFLKGT

Query:  LKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVKFTMGLFESPLGDYSLVNEVGSQ
        LKFKGFVISDWEGLDRITS PHSNYTYSVQAAI AGIDMVMVPYKY EFIDDL  LVKNNV+PMDRIDDAV RILSVKFTMGLFESPLGDYSLVNE+GSQ
Subjt:  LKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVKFTMGLFESPLGDYSLVNEVGSQ

Query:  AHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPNTEVVFREDPESDFVKSN
        AHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILV GTHADNLGYQCGGWTIAWQGFSGNNATRGT+ILAAIKSTVDP+TEVVFREDP+SDFVKSN
Subjt:  AHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPNTEVVFREDPESDFVKSN

Query:  DFSYAIVVIGETPYAETVGDSMTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYISSIDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
         FSYAIVVIGE PYAET GDS TLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYISS+DALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
Subjt:  DFSYAIVVIGETPYAETVGDSMTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYISSIDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV

Query:  DQLPMNVGDRHYDPLYPLGFGLTTGSVKDFVARSTSAGIQGTTSFIAMIIAAIVICIL
        DQLPMN GDRHYDPL+PLGFGLTTGSVKD VARSTSAG +GT SFIAMI+A I +C+L
Subjt:  DQLPMNVGDRHYDPLYPLGFGLTTGSVKDFVARSTSAGIQGTTSFIAMIIAAIVICIL

XP_022968400.1 uncharacterized protein LOC111467651 isoform X2 [Cucurbita maxima]0.0e+0091.89Show/hide
Query:  LNSNLLSKWKKCGLNSRGKKMAKILVQVVVILCLGWWLWPIMVGAENLKYKDPKQPVGVRVKDLLGRMTLAEKIGQMIQIDRSVANATVMKDYFIGSVLS
        LN  L  KWK+CGLNS GKKMAKI VQVVVILCLGWW W IMV AENLKYKDPKQPV VRVKDLLGRMTL EKIGQM+QIDRSVANATVMK+YFIGSVLS
Subjt:  LNSNLLSKWKKCGLNSRGKKMAKILVQVVVILCLGWWLWPIMVGAENLKYKDPKQPVGVRVKDLLGRMTLAEKIGQMIQIDRSVANATVMKDYFIGSVLS

Query:  GGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRDPR
        GGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDL+R IGAATALEVRATGISY FAPC+AVCRDPR
Subjt:  GGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRDPR

Query:  WGRCYESYSEDPKIVRNMTEIIIGLQGEPPANYRKGTPYVGRTKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIKGVSSVMVSYSS
        WGRCYESYSEDPK+V+NMTEIIIGLQGEPPANYRKG PYVG TKKVIACAKHFVGDGGTT GINENNTVID+HGLL IHMPAYLDSIIKGVSSVMVSYSS
Subjt:  WGRCYESYSEDPKIVRNMTEIIIGLQGEPPANYRKGTPYVGRTKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIKGVSSVMVSYSS

Query:  WNGVKMHADRDLITDFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVKFT
        WNGVKMHA+RDLIT FLKGTLKFKGFVISDWEGLDRITS PHSNYTYSVQAAI AGIDMVMVPYKY EFIDDL  LVKNNV+PMDRIDDAV RILSVKFT
Subjt:  WNGVKMHADRDLITDFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVKFT

Query:  MGLFESPLGDYSLVNEVGSQAHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTV
        MGLFESPLGDYSLVNE+GSQAHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILV GTHADNLGYQCGGWTIAWQGFSGNNATRGT+ILAAIKSTV
Subjt:  MGLFESPLGDYSLVNEVGSQAHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTV

Query:  DPNTEVVFREDPESDFVKSNDFSYAIVVIGETPYAETVGDSMTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYISSIDALVAAWLPGTEGLGVTD
        DP+TEVVFREDP+SDFVKSN FSYAIVVIGE PYAET GDS TLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYISS+DALVAAWLPGTEGLGVTD
Subjt:  DPNTEVVFREDPESDFVKSNDFSYAIVVIGETPYAETVGDSMTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYISSIDALVAAWLPGTEGLGVTD

Query:  ALYGDHGFSGKLPRTWFKSVDQLPMNVGDRHYDPLYPLGFGLTTGSVKDFVARSTSAGIQGTTSFIAMIIAAIVICIL
        ALYGDHGFSGKLPRTWFKSVDQLPMN GDRHYDPL+PLGFGLTTGSVKD VARSTSAG +GT SFIAMI+A I +C+L
Subjt:  ALYGDHGFSGKLPRTWFKSVDQLPMNVGDRHYDPLYPLGFGLTTGSVKDFVARSTSAGIQGTTSFIAMIIAAIVICIL

XP_023541708.1 uncharacterized protein LOC111801784 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0092.1Show/hide
Query:  KWKKCGLNSRGKKMAKILVQVVVILCLGWWLWPIMVGAENLKYKDPKQPVGVRVKDLLGRMTLAEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPL
        KWK+CG NS GKKMAKI VQVVVILCLGWW W IMVGAENLKYKDPKQPV VRVKDLLGRMTL EKIGQM+QIDRSVANATVMK+YFIGSVLSGGGSVPL
Subjt:  KWKKCGLNSRGKKMAKILVQVVVILCLGWWLWPIMVGAENLKYKDPKQPVGVRVKDLLGRMTLAEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPL

Query:  PDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRDPRWGRCYES
        PDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDL+R IGAATALEVRATGISY FAPC+AVCRDPRWGRCYES
Subjt:  PDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRDPRWGRCYES

Query:  YSEDPKIVRNMTEIIIGLQGEPPANYRKGTPYVGRTKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIKGVSSVMVSYSSWNGVKMH
        YSEDPK+V+NMTEIIIGLQGEPPANYRKG PYVG TKKVIACAKHFVGDGGTT GINENNTVID+HGLL +HMPAYLDSIIKGVSSVMVSYSSWNGVKMH
Subjt:  YSEDPKIVRNMTEIIIGLQGEPPANYRKGTPYVGRTKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIKGVSSVMVSYSSWNGVKMH

Query:  ADRDLITDFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVKFTMGLFESP
        A+R+LIT FLKGTLKFKGFVISDWEGLDRITS PHSNYTYSVQAAI AGIDMVMVPYKY EFIDDLT LVKNNV+ MDRIDDAV RILSVKFTMGLFESP
Subjt:  ADRDLITDFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVKFTMGLFESP

Query:  LGDYSLVNEVGSQAHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPNTEVV
        LGDYSLVNE+GSQAHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILV+GTHADNLGYQCGGWTIAWQGFSGNNATRGT+ILAAIKSTVDP+TEVV
Subjt:  LGDYSLVNEVGSQAHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPNTEVV

Query:  FREDPESDFVKSNDFSYAIVVIGETPYAETVGDSMTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYISSIDALVAAWLPGTEGLGVTDALYGDHG
        FREDP+SDFVKSN FSYAIVVIGE PYAET GDS TLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPY+SS+DALVAAWLPGTEGLGVTDALYGDHG
Subjt:  FREDPESDFVKSNDFSYAIVVIGETPYAETVGDSMTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYISSIDALVAAWLPGTEGLGVTDALYGDHG

Query:  FSGKLPRTWFKSVDQLPMNVGDRHYDPLYPLGFGLTTGSVKDFVARSTSAGIQGTTSFIAMIIAAIVICIL
        FSGKLPRTWFKSVDQLPMN GDRHYDPL+PLGFGLTTGSVKD VARSTSAG Q T SFIAMI+A I IC+L
Subjt:  FSGKLPRTWFKSVDQLPMNVGDRHYDPLYPLGFGLTTGSVKDFVARSTSAGIQGTTSFIAMIIAAIVICIL

TrEMBL top hitse value%identityAlignment
A0A1S3BGE4 beta-glucosidase BoGH3B isoform X10.0e+0089.26Show/hide
Query:  MTLNSNLLSKWKKCGLNSRGKKMAKILVQVVVILCLGWWLWPIMVGAENLKYKDPKQPVGVRVKDLLGRMTLAEKIGQMIQIDRSVANATVMKDYFIGSV
        M LN  LL KWK+CGLN++ KKMAKI VQVVVILCLGW  W  MV AENLKYKDPKQPVGVRVKDLLGRMTL EKIGQM+QIDRSVANATVMKDYFIGS+
Subjt:  MTLNSNLLSKWKKCGLNSRGKKMAKILVQVVVILCLGWWLWPIMVGAENLKYKDPKQPVGVRVKDLLGRMTLAEKIGQMIQIDRSVANATVMKDYFIGSV

Query:  LSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRD
        LSGGGSVPLPDARA+DWVDMINDFQKGSLSSRLGIPM YGIDAVHGHNNVYNATVFPHNVGLGATRNPDL R IGAATALEVRATGISY FAPC+AVCRD
Subjt:  LSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRD

Query:  PRWGRCYESYSEDPKIVRNMTEIIIGLQGEPPANYRKGTPYVGRTKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIKGVSSVMVSY
        PRWGRCYESYSEDPKIV+ MTEIIIGLQGEPPANYRKGTPYVG TKKVIACAKHFVGDGGTT GINENNTVI++HGLLSIHMPAYLDSIIKGVSSVM SY
Subjt:  PRWGRCYESYSEDPKIVRNMTEIIIGLQGEPPANYRKGTPYVGRTKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIKGVSSVMVSY

Query:  SSWNGVKMHADRDLITDFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVK
        SSWNGVKMHA+R+LITDFLKG LKFKGFVISDWEGLDRITS PHSNYTYSVQAAILAGIDMVM+PYKY EFIDDL FLVK+NVIPMDRIDDAV RIL+VK
Subjt:  SSWNGVKMHADRDLITDFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVK

Query:  FTMGLFESPLGDYSLVNEVGSQAHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKS
        FTMGLFESP+ DYSLVNE+GSQAHRDLARDAVRQSLVLLKNGKNDS PLLPLSKK+PKILVAGTHADNLGYQCGGWTIAWQGFSGNN TRGT+ILAAIKS
Subjt:  FTMGLFESPLGDYSLVNEVGSQAHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKS

Query:  TVDPNTEVVFREDPESDFVKSNDFSYAIVVIGETPYAETVGDSMTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYISSIDALVAAWLPGTEGLGV
        TVDP+TEVVFREDP+SDFVKSNDFSYAIVVIGE PYAET GDS TLTMLDPGP+IIKNVC+ V+CVV++ISGRPIV+EPYISSIDALVAAWLPGTEG GV
Subjt:  TVDPNTEVVFREDPESDFVKSNDFSYAIVVIGETPYAETVGDSMTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYISSIDALVAAWLPGTEGLGV

Query:  TDALYGDHGFSGKLPRTWFKSVDQLPMNVGDRHYDPLYPLGFGLTTGSVKDFVARSTSAGIQGTTSFIAMIIAAIVICIL
        TDALYGDHGFSGKLPRTWFKSVDQLPMNVGD HYDPL+P GFGLTTGSVKD +ARSTSAGI+GT S IA I+ AI +CIL
Subjt:  TDALYGDHGFSGKLPRTWFKSVDQLPMNVGDRHYDPLYPLGFGLTTGSVKDFVARSTSAGIQGTTSFIAMIIAAIVICIL

A0A6J1G118 uncharacterized protein LOC111449719 isoform X10.0e+0091.15Show/hide
Query:  LNSNLLSKWKKCGLNSRGKKMAKILVQVVVILCLGWWLWPIMVGAENLKYKDPKQPVGVRVKDLLGRMTLAEKIGQMIQIDRSVANATVMKDYFIGSVLS
        LN  L  KWK+CG NS GKKMAK  VQVVVILCLGWW W IMVGAENLKYKDPKQPV VRVKDLLGRMTL EKIGQM+QIDRSVANATVMK+YFIGSVLS
Subjt:  LNSNLLSKWKKCGLNSRGKKMAKILVQVVVILCLGWWLWPIMVGAENLKYKDPKQPVGVRVKDLLGRMTLAEKIGQMIQIDRSVANATVMKDYFIGSVLS

Query:  GGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRDPR
        GGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDL+R IGAATALEVRATGISY FAPC+AVCRDPR
Subjt:  GGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRDPR

Query:  WGRCYESYSEDPKIVRNMTEIIIGLQGEPPANYRKGTPYVGRTKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIKGVSSVMVSYSS
        WGRCYESYSEDPK+V+NMTEIIIGLQGEPPAN+RKG PYVG TKKVIACAKHFVGDGGTT GINENNTVID+HGLL IHMPAYLDSIIKGVSSVMVSYSS
Subjt:  WGRCYESYSEDPKIVRNMTEIIIGLQGEPPANYRKGTPYVGRTKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIKGVSSVMVSYSS

Query:  WNGVKMHADRDLITDFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVKFT
        WNGVKMHA+R+LIT FLK TLKFKGFVISDWEGLDRITS PHSNYTYSVQAAI AGIDMVMVPYKY EFIDDLT LVKNNV+ MDRIDDAV RILSVKFT
Subjt:  WNGVKMHADRDLITDFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVKFT

Query:  MGLFESPLGDYSLVNEVGSQAHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTV
        MGLFESPLGDYSLVNE+GSQAHRDLARDAVRQSLVLLKNGKNDSD LLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGT+IL AIKSTV
Subjt:  MGLFESPLGDYSLVNEVGSQAHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTV

Query:  DPNTEVVFREDPESDFVKSNDFSYAIVVIGETPYAETVGDSMTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYISSIDALVAAWLPGTEGLGVTD
        DP+TEVVFREDP+SDFVKSN FSYAIVVIGE PYAET GDS TLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPY+SS+DALVAAWLPGTEGLGVTD
Subjt:  DPNTEVVFREDPESDFVKSNDFSYAIVVIGETPYAETVGDSMTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYISSIDALVAAWLPGTEGLGVTD

Query:  ALYGDHGFSGKLPRTWFKSVDQLPMNVGDRHYDPLYPLGFGLTTGSVKDFVARSTSAGIQGTTSFIAMIIAAIVICIL
        ALYGDHGFSGKLPRTWFKSVDQLPMN GDRHYDPL+PLGFGLTTGSVKD VARSTSAG Q T SFIAMI+A I IC+L
Subjt:  ALYGDHGFSGKLPRTWFKSVDQLPMNVGDRHYDPLYPLGFGLTTGSVKDFVARSTSAGIQGTTSFIAMIIAAIVICIL

A0A6J1G143 uncharacterized protein LOC111449719 isoform X20.0e+0091.95Show/hide
Query:  MAKILVQVVVILCLGWWLWPIMVGAENLKYKDPKQPVGVRVKDLLGRMTLAEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMIN
        MAK  VQVVVILCLGWW W IMVGAENLKYKDPKQPV VRVKDLLGRMTL EKIGQM+QIDRSVANATVMK+YFIGSVLSGGGSVPLPDARAQDWVDMIN
Subjt:  MAKILVQVVVILCLGWWLWPIMVGAENLKYKDPKQPVGVRVKDLLGRMTLAEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMIN

Query:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRDPRWGRCYESYSEDPKIVRNMTE
        DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDL+R IGAATALEVRATGISY FAPC+AVCRDPRWGRCYESYSEDPK+V+NMTE
Subjt:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRDPRWGRCYESYSEDPKIVRNMTE

Query:  IIIGLQGEPPANYRKGTPYVGRTKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHADRDLITDFLKGT
        IIIGLQGEPPAN+RKG PYVG TKKVIACAKHFVGDGGTT GINENNTVID+HGLL IHMPAYLDSIIKGVSSVMVSYSSWNGVKMHA+R+LIT FLK T
Subjt:  IIIGLQGEPPANYRKGTPYVGRTKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHADRDLITDFLKGT

Query:  LKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVKFTMGLFESPLGDYSLVNEVGSQ
        LKFKGFVISDWEGLDRITS PHSNYTYSVQAAI AGIDMVMVPYKY EFIDDLT LVKNNV+ MDRIDDAV RILSVKFTMGLFESPLGDYSLVNE+GSQ
Subjt:  LKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVKFTMGLFESPLGDYSLVNEVGSQ

Query:  AHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPNTEVVFREDPESDFVKSN
        AHRDLARDAVRQSLVLLKNGKNDSD LLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGT+IL AIKSTVDP+TEVVFREDP+SDFVKSN
Subjt:  AHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPNTEVVFREDPESDFVKSN

Query:  DFSYAIVVIGETPYAETVGDSMTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYISSIDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
         FSYAIVVIGE PYAET GDS TLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPY+SS+DALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
Subjt:  DFSYAIVVIGETPYAETVGDSMTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYISSIDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV

Query:  DQLPMNVGDRHYDPLYPLGFGLTTGSVKDFVARSTSAGIQGTTSFIAMIIAAIVICIL
        DQLPMN GDRHYDPL+PLGFGLTTGSVKD VARSTSAG Q T SFIAMI+A I IC+L
Subjt:  DQLPMNVGDRHYDPLYPLGFGLTTGSVKDFVARSTSAGIQGTTSFIAMIIAAIVICIL

A0A6J1HX36 uncharacterized protein LOC111467651 isoform X20.0e+0091.89Show/hide
Query:  LNSNLLSKWKKCGLNSRGKKMAKILVQVVVILCLGWWLWPIMVGAENLKYKDPKQPVGVRVKDLLGRMTLAEKIGQMIQIDRSVANATVMKDYFIGSVLS
        LN  L  KWK+CGLNS GKKMAKI VQVVVILCLGWW W IMV AENLKYKDPKQPV VRVKDLLGRMTL EKIGQM+QIDRSVANATVMK+YFIGSVLS
Subjt:  LNSNLLSKWKKCGLNSRGKKMAKILVQVVVILCLGWWLWPIMVGAENLKYKDPKQPVGVRVKDLLGRMTLAEKIGQMIQIDRSVANATVMKDYFIGSVLS

Query:  GGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRDPR
        GGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDL+R IGAATALEVRATGISY FAPC+AVCRDPR
Subjt:  GGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRDPR

Query:  WGRCYESYSEDPKIVRNMTEIIIGLQGEPPANYRKGTPYVGRTKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIKGVSSVMVSYSS
        WGRCYESYSEDPK+V+NMTEIIIGLQGEPPANYRKG PYVG TKKVIACAKHFVGDGGTT GINENNTVID+HGLL IHMPAYLDSIIKGVSSVMVSYSS
Subjt:  WGRCYESYSEDPKIVRNMTEIIIGLQGEPPANYRKGTPYVGRTKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIKGVSSVMVSYSS

Query:  WNGVKMHADRDLITDFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVKFT
        WNGVKMHA+RDLIT FLKGTLKFKGFVISDWEGLDRITS PHSNYTYSVQAAI AGIDMVMVPYKY EFIDDL  LVKNNV+PMDRIDDAV RILSVKFT
Subjt:  WNGVKMHADRDLITDFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVKFT

Query:  MGLFESPLGDYSLVNEVGSQAHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTV
        MGLFESPLGDYSLVNE+GSQAHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILV GTHADNLGYQCGGWTIAWQGFSGNNATRGT+ILAAIKSTV
Subjt:  MGLFESPLGDYSLVNEVGSQAHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTV

Query:  DPNTEVVFREDPESDFVKSNDFSYAIVVIGETPYAETVGDSMTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYISSIDALVAAWLPGTEGLGVTD
        DP+TEVVFREDP+SDFVKSN FSYAIVVIGE PYAET GDS TLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYISS+DALVAAWLPGTEGLGVTD
Subjt:  DPNTEVVFREDPESDFVKSNDFSYAIVVIGETPYAETVGDSMTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYISSIDALVAAWLPGTEGLGVTD

Query:  ALYGDHGFSGKLPRTWFKSVDQLPMNVGDRHYDPLYPLGFGLTTGSVKDFVARSTSAGIQGTTSFIAMIIAAIVICIL
        ALYGDHGFSGKLPRTWFKSVDQLPMN GDRHYDPL+PLGFGLTTGSVKD VARSTSAG +GT SFIAMI+A I +C+L
Subjt:  ALYGDHGFSGKLPRTWFKSVDQLPMNVGDRHYDPLYPLGFGLTTGSVKDFVARSTSAGIQGTTSFIAMIIAAIVICIL

A0A6J1HZJ2 uncharacterized protein LOC111467651 isoform X10.0e+0092.55Show/hide
Query:  MAKILVQVVVILCLGWWLWPIMVGAENLKYKDPKQPVGVRVKDLLGRMTLAEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMIN
        MAKI VQVVVILCLGWW W IMV AENLKYKDPKQPV VRVKDLLGRMTL EKIGQM+QIDRSVANATVMK+YFIGSVLSGGGSVPLPDARAQDWVDMIN
Subjt:  MAKILVQVVVILCLGWWLWPIMVGAENLKYKDPKQPVGVRVKDLLGRMTLAEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMIN

Query:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRDPRWGRCYESYSEDPKIVRNMTE
        DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDL+R IGAATALEVRATGISY FAPC+AVCRDPRWGRCYESYSEDPK+V+NMTE
Subjt:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRDPRWGRCYESYSEDPKIVRNMTE

Query:  IIIGLQGEPPANYRKGTPYVGRTKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHADRDLITDFLKGT
        IIIGLQGEPPANYRKG PYVG TKKVIACAKHFVGDGGTT GINENNTVID+HGLL IHMPAYLDSIIKGVSSVMVSYSSWNGVKMHA+RDLIT FLKGT
Subjt:  IIIGLQGEPPANYRKGTPYVGRTKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHADRDLITDFLKGT

Query:  LKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVKFTMGLFESPLGDYSLVNEVGSQ
        LKFKGFVISDWEGLDRITS PHSNYTYSVQAAI AGIDMVMVPYKY EFIDDL  LVKNNV+PMDRIDDAV RILSVKFTMGLFESPLGDYSLVNE+GSQ
Subjt:  LKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVKFTMGLFESPLGDYSLVNEVGSQ

Query:  AHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPNTEVVFREDPESDFVKSN
        AHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILV GTHADNLGYQCGGWTIAWQGFSGNNATRGT+ILAAIKSTVDP+TEVVFREDP+SDFVKSN
Subjt:  AHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPNTEVVFREDPESDFVKSN

Query:  DFSYAIVVIGETPYAETVGDSMTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYISSIDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
         FSYAIVVIGE PYAET GDS TLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYISS+DALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
Subjt:  DFSYAIVVIGETPYAETVGDSMTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYISSIDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV

Query:  DQLPMNVGDRHYDPLYPLGFGLTTGSVKDFVARSTSAGIQGTTSFIAMIIAAIVICIL
        DQLPMN GDRHYDPL+PLGFGLTTGSVKD VARSTSAG +GT SFIAMI+A I +C+L
Subjt:  DQLPMNVGDRHYDPLYPLGFGLTTGSVKDFVARSTSAGIQGTTSFIAMIIAAIVICIL

SwissProt top hitse value%identityAlignment
A7LXU3 Beta-glucosidase BoGH3B3.0e-7730.85Show/hide
Query:  MAKILVQVVVILCLGWWLWPIMVGAENLKYKDPKQPVGVRVKDLLGRMTLAEKIGQMIQIDRSVAN-----------------ATVMKDYFIGSVLSGGG
        M  I   V+V    G  L P    A  +   DP   +   +++ L +MTL +KIGQM +I   V +                  TV+  Y +GS+L    
Subjt:  MAKILVQVVVILCLGWWLWPIMVGAENLKYKDPKQPVGVRVKDLLGRMTLAEKIGQMIQIDRSVAN-----------------ATVMKDYFIGSVLSGGG

Query:  SVPLPDARAQD-WVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRDPRWG
        +VPL  A+ ++ W + I   Q+ S+   +GIP IYG+D +HG     + T+FP  + +GAT N +L R     +A E +A  I + FAP + + RDPRW 
Subjt:  SVPLPDARAQD-WVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRDPRWG

Query:  RCYESYSEDPKIVRNM-TEIIIGLQGEPPANYRKGTPYVGRTKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIKGVSSVMVSYSSW
        R +E+Y ED  +   M    + G QGE P         +G    V AC KH++G G    G +   + I +  +   H   +L ++ +G  SVMV+    
Subjt:  RCYESYSEDPKIVRNM-TEIIIGLQGEPPANYRKGTPYVGRTKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIKGVSSVMVSYSSW

Query:  NGVKMHADRDLITDFLKGTLKFKGFVISDWEGLDRITSPPHSNYT--YSVQAAILAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVKF
        NG+  HA+R+L+T++LK  L + G +++DW  ++ + +  H   T   +V+  I AGIDM MVPY+   F D L  LV+   + M+RIDDAV R+L +K+
Subjt:  NGVKMHADRDLITDFLKGTLKFKGFVISDWEGLDRITSPPHSNYT--YSVQAAILAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVKF

Query:  TMGLFESPLGDYSLVNEVGSQAHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNN-ATRGTSILAAI--
         +GLF+ P  D    ++ GS+    +A  A  +S VLLKN  N    +LP++ K  KIL+ G +A+++    GGW+ +WQG   +  A    +I  A+  
Subjt:  TMGLFESPLGDYSLVNEVGSQAHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNN-ATRGTSILAAI--

Query:  ---KSTVDPNTEVVF------------REDPESDFVKSNDFSYAIVVIGETPYAETVGDSMTLTMLDPGPSIIKNVCESVKCVVVVIS-GRPIVMEPYIS
           K  +     V +            + + E     +      I  IGE  Y ET G+   LT+ +   +++K +  + K +V+V++ GRP ++   + 
Subjt:  ---KSTVDPNTEVVF------------REDPESDFVKSNDFSYAIVVIGETPYAETVGDSMTLTMLDPGPSIIKNVCESVKCVVVVIS-GRPIVMEPYIS

Query:  SIDALVAAWLPGT-EGLGVTDALYGDHGFSGKLPRTW-----------FKSVDQLPMNVGDRHYDPL----YPLGFGLTTGSVK
           A+V   LP    G  + + L GD  FSGK+P T+           +K  + +    G+ +YD +    +P GFGL+  + K
Subjt:  SIDALVAAWLPGT-EGLGVTDALYGDHGFSGKLPRTW-----------FKSVDQLPMNVGDRHYDPL----YPLGFGLTTGSVK

P33363 Periplasmic beta-glucosidase4.9e-5929.09Show/hide
Query:  VKDLLGRMTLAEKIGQMIQIDRSVAN-----ATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFP
        V +LL +MT+ EKIGQ+  I     N       ++KD  +G++ +   +V   D RA    D + +       SRL IP+ +  D +HG       TVFP
Subjt:  VKDLLGRMTLAEKIGQMIQIDRSVAN-----ATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFP

Query:  HNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRDPRWGRCYESYSEDPKIVRNMTEIII-GLQGEPPANYRKGTPYVGRTKKVIACAKHFV
         ++GL ++ N D ++++G  +A E    G++  +AP + V RDPRWGR  E + ED  +   M + ++  +QG+ PA+             V+   KHF 
Subjt:  HNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRDPRWGRCYESYSEDPKIVRNMTEIII-GLQGEPPANYRKGTPYVGRTKKVIACAKHFV

Query:  GDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHADRDLITDFLKGTLKFKGFVISDWEGL-DRITSPPHSNYTYSVQAAI
          G    G   N   +    L + +MP Y   +  G  +VMV+ +S NG    +D  L+ D L+    FKG  +SD   + + I     ++   +V+ A+
Subjt:  GDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHADRDLITDFLKGTLKFKGFVISDWEGL-DRITSPPHSNYTYSVQAAI

Query:  LAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVKFTMGLFESPLGDYS------LVNEVGSQAHRDLARDAVRQSLVLLKNGKNDSDPL
         +GI+M M    Y ++   L  L+K+  + M  +DDA   +L+VK+ MGLF  P           +     S+ HR  AR+  R+SLVLLKN        
Subjt:  LAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVKFTMGLFESPLGDYS------LVNEVGSQAHRDLARDAVRQSLVLLKNGKNDSDPL

Query:  LPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPNTEVVF-----------------------REDPES---------D
        LPL KK+  I V G  AD+     G W+ A        A +  ++L  IK+ V  N +V++                       + DP S          
Subjt:  LPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPNTEVVF-----------------------REDPES---------D

Query:  FVKSNDFSYAIVVIGETP-YAETVGDSMTLTMLDPGPSIIKNVCESVK-CVVVVISGRPIVMEPYISSIDALVAAWLPGTE-GLGVTDALYGDHGFSGKL
          K +D    + V+GE    A        +T+      +I  +  + K  V+V+++GRP+ +       DA++  W  GTE G  + D L+GD+  SGKL
Subjt:  FVKSNDFSYAIVVIGETP-YAETVGDSMTLTMLDPGPSIIKNVCESVK-CVVVVISGRPIVMEPYISSIDALVAAWLPGTE-GLGVTDALYGDHGFSGKL

Query:  PRTWFKSVDQLP-----MNVG------------DRHYD----PLYPLGFGL--TTGSVKD
        P ++ +SV Q+P     +N G             R++D     LYP G+GL  TT +V D
Subjt:  PRTWFKSVDQLP-----MNVG------------DRHYD----PLYPLGFGL--TTGSVKD

Q23892 Lysosomal beta glucosidase5.0e-7231.06Show/hide
Query:  VKDLLGRMTLAEKIGQMIQIDRS--------VANATVM----KDYFIGSVL----SGGGSVPLPDARAQDWVDMINDFQKGSL-SSRLGIPMIYGIDAVH
        V +L+ +M++ EKIGQM Q+D +          N T +    K Y+IGS L    SGG +  +    +  W+DMIN  Q   +  S   IPMIYG+D+VH
Subjt:  VKDLLGRMTLAEKIGQMIQIDRS--------VANATVM----KDYFIGSVL----SGGGSVPLPDARAQDWVDMINDFQKGSL-SSRLGIPMIYGIDAVH

Query:  GHNNVYNATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRDPRWGRCYESYSEDPKIVRNM-TEIIIGLQGEPPANYRKGTPYVGR
        G N V+ AT+FPHN GL AT N +   +    T+ +  A GI + FAP + +   P W R YE++ EDP +   M    + G QG    N     P    
Subjt:  GHNNVYNATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRDPRWGRCYESYSEDPKIVRNM-TEIIIGLQGEPPANYRKGTPYVGR

Query:  TKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSII-KGVSSVMVSYSSWNGVKMHADRDLITDFLKGTLKFKGFVISDWEGLDRITSPP
            +  AKH+ G    T G +     I +  L    +P++ ++I   G  ++M++    NGV MH     +T+ L+G L+F+G  ++DW+ ++++    
Subjt:  TKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSII-KGVSSVMVSYSSWNGVKMHADRDLITDFLKGTLKFKGFVISDWEGLDRITSPP

Query:  HS--NYTYSVQAAILAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVKFTMGLFESPL--GDYSLVNEVGSQAHRDLARDAVRQSLVLL
        H+  +   ++  A+ AGIDM MVP     F   L  +V    +P  R+D +V RIL++K+ +GLF +P    + ++V+ +G    R+ A     +S+ LL
Subjt:  HS--NYTYSVQAAILAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVKFTMGLFESPL--GDYSLVNEVGSQAHRDLARDAVRQSLVLL

Query:  KNGKNDSDPLLPLSKKAPK-ILVAGTHADNLGYQCGGWTIAWQG-FSGNNATRGTSILAAIK----STVDPNTEVVFRE-----------DPESDFVKSN
        +N  N    +LPL+    K +L+ G  AD++    GGW++ WQG +  +    GTSIL  ++     T D N +                D   +  +S+
Subjt:  KNGKNDSDPLLPLSKKAPK-ILVAGTHADNLGYQCGGWTIAWQG-FSGNNATRGTSILAAIK----STVDPNTEVVFRE-----------DPESDFVKSN

Query:  DFSYAIVVIGETPYAETVGDSMTLTMLDPGPSIIKNVCESVKCVV-VVISGRPIVMEP-YISSIDALVAAWLPGTE-GLGVTDALYGDHGFSGKLPRTWF
        D    +VVIGE P AET GD   L+M      +++ + ++ K VV +++  RP ++ P  + S  A++ A+LPG+E G  + + L G+   SG+LP T+ 
Subjt:  DFSYAIVVIGETPYAETVGDSMTLTMLDPGPSIIKNVCESVKCVV-VVISGRPIVMEP-YISSIDALVAAWLPGTE-GLGVTDALYGDHGFSGKLPRTWF

Query:  KSVDQLPMNVGDRHYD-----PLYPLGFGLT
         +   + +    ++ +     PL+  G GL+
Subjt:  KSVDQLPMNVGDRHYD-----PLYPLGFGLT

Q56078 Periplasmic beta-glucosidase3.6e-6229.62Show/hide
Query:  AENLKYKDPKQPVG--VRVKDLLGRMTLAEKIGQMIQIDRSVAN-----ATVMKDYFIGSVLSGGGSVPLPDAR-AQDWVDMINDFQKGSLSSRLGIPMI
        AENL    P  P      V DLL +MT+ EKIGQ+  I     N       ++KD  +G++ +   +V   D R  QD V  +         SRL IP+ 
Subjt:  AENLKYKDPKQPVG--VRVKDLLGRMTLAEKIGQMIQIDRSVAN-----ATVMKDYFIGSVLSGGGSVPLPDAR-AQDWVDMINDFQKGSLSSRLGIPMI

Query:  YGIDAVHGHNNVYNATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRDPRWGRCYESYSEDPKIVRNMTEIII-GLQGEPPANYRK
        +  D VHG       TVFP ++GL ++ N D +R++G  +A E    G++  +AP + V RDPRWGR  E + ED  +   M E ++  +QG+ PA+   
Subjt:  YGIDAVHGHNNVYNATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRDPRWGRCYESYSEDPKIVRNMTEIII-GLQGEPPANYRK

Query:  GTPYVGRTKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHADRDLITDFLKGTLKFKGFVISDWEGL-
                  V+   KHF   G    G   N   +    L + +MP Y   +  G  +VMV+ +S NG    +D  L+ D L+    FKG  +SD   + 
Subjt:  GTPYVGRTKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHADRDLITDFLKGTLKFKGFVISDWEGL-

Query:  DRITSPPHSNYTYSVQAAILAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVKFTMGLFESPLGDYS------LVNEVGSQAHRDLARD
        + I     ++   +V+ A+ AG+DM M    Y ++   L  L+K+  + M  +DDA   +L+VK+ MGLF  P           +     S+ HR  AR+
Subjt:  DRITSPPHSNYTYSVQAAILAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVKFTMGLFESPLGDYS------LVNEVGSQAHRDLARD

Query:  AVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPNTEVVFRE------------------
          R+S+VLLKN        LPL KK+  I V G  AD+     G W+ A        A +  ++LA I++ V    ++++ +                  
Subjt:  AVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPNTEVVFRE------------------

Query:  -----DPESDFVKSNDFSYA-------IVVIGETP-YAETVGDSMTLTMLDPGPSIIKNVCESVK-CVVVVISGRPIVMEPYISSIDALVAAWLPGTE-G
             DP S     ++   A       + V+GE+   A        +T+      +I  +  + K  V+V+++GRP+ +       DA++  W  GTE G
Subjt:  -----DPESDFVKSNDFSYA-------IVVIGETP-YAETVGDSMTLTMLDPGPSIIKNVCESVK-CVVVVISGRPIVMEPYISSIDALVAAWLPGTE-G

Query:  LGVTDALYGDHGFSGKLPRTWFKSVDQLP-----MNVG------------DRHYD----PLYPLGFGL--TTGSVKDFVARS
          + D L+GD+  SGKLP ++ +SV Q+P     +N G             R++D    PLYP G+GL  TT +V D    S
Subjt:  LGVTDALYGDHGFSGKLPRTWFKSVDQLP-----MNVG------------DRHYD----PLYPLGFGL--TTGSVKDFVARS

T2KMH0 Beta-xylosidase3.7e-5129.47Show/hide
Query:  QKGSLSSRLGIPMIYGIDAVHGHNNVY----NATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAV-CRDPRWGRCYESYSEDPKIVRN
        Q    + RLGIP +   +A+HG   V     N TV+P  V   +T  P+LI+ + + TA E RA G+++ ++P + V   D R+GR  ESY EDP +V  
Subjt:  QKGSLSSRLGIPMIYGIDAVHGHNNVY----NATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAV-CRDPRWGRCYESYSEDPKIVRN

Query:  M-TEIIIGLQGEPPANYRKGTPYVGRTKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIK-GVSSVMVSYSSWNGVKMHADRDLITD
        M    I GLQG     + +          VIA AKHFVG     RGIN   + + +  L  +++P +  ++ + GV SVM  +  +NGV  H +  L+ D
Subjt:  M-TEIIIGLQGEPPANYRKGTPYVGRTKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIK-GVSSVMVSYSSWNGVKMHADRDLITD

Query:  FLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAIL---AGIDMVMVPYKYEEFIDDLTFLVKNNVIP----MDRIDDAVVRILSVKFTMGLFES-P
         L+  L F GF++SD   + R+ +  H       +AAIL   AG+DM +V  K  E     T ++K+ ++     M  ID A  RIL+ K+ +GLF++ P
Subjt:  FLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAIL---AGIDMVMVPYKYEEFIDDLTFLVKNNVIP----MDRIDDAVVRILSVKFTMGLFES-P

Query:  LGDYSLVNEVGSQAHRDLARDAVRQSLVLLKNGKNDSDPLLPLS-KKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPNTEV
            +   E G+  HR+ A +   +S+++LKN  N    LLPL   K   + V G +A     + G + +   G+SG       S+L  +K  V  + ++
Subjt:  LGDYSLVNEVGSQAHRDLARDAVRQSLVLLKNGKNDSDPLLPLS-KKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPNTEV

Query:  VFRED-----------PESDFVKSNDFSYAIVVIGETPYAETVGDSMTLTMLDPGPSIIKNVCESVKCVVVV-ISGRPIVMEPYISSIDALVAAWLPGTE
         + +            PE+     N  +  +VV          GD   L +      +++ + ++ K V+VV I+GRP+ +     +I +++  W  G  
Subjt:  VFRED-----------PESDFVKSNDFSYAIVVIGETPYAETVGDSMTLTMLDPGPSIIKNVCESVKCVVVV-ISGRPIVMEPYISSIDALVAAWLPGTE

Query:  -GLGVTDALYGDHGFSGKLPRTWFKSVDQLPMNVGDR--------------HYDPLYPLGFGLTTGSVKDFVAR--STSAGIQGTTS
         G  V + ++GD    GKL  ++ + V Q+P+   +R                 PL+P GFGL+  + K    +  +TS    GTT+
Subjt:  -GLGVTDALYGDHGFSGKLPRTWFKSVDQLPMNVGDR--------------HYDPLYPLGFGLTTGSVKDFVAR--STSAGIQGTTS

Arabidopsis top hitse value%identityAlignment
AT3G47000.1 Glycosyl hydrolase family protein1.9e-21258.13Show/hide
Query:  IMVGAENLKYKDPKQPVGVRVKDLLGRMTLAEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGID
        ++V   +  YK+   PV  RVKDLL RMTL EKIGQM QI+R VA+ +   D+FIGSVL+ GGSVP  DA++ DW DMI+ FQ+ +L+SRLGIP+IYG D
Subjt:  IMVGAENLKYKDPKQPVGVRVKDLLGRMTLAEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGID

Query:  AVHGHNNVYNATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRDPRWGRCYESYSEDPKIVRNMTEIIIGLQGEPPANYRKGTPYV
        AVHG+NNVY ATVFPHN+GLGATR+ DL+R IGAATALEVRA+G+ +AF+PC+AV RDPRWGRCYESY EDP++V  MT ++ GLQG PP  +  G P+V
Subjt:  AVHGHNNVYNATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRDPRWGRCYESYSEDPKIVRNMTEIIIGLQGEPPANYRKGTPYV

Query:  GRTKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHADRDLITDFLKGTLKFKGFVISDWEGLDRITSP
             V+AC KHFVGDGGT +GINE NT+     L  IH+P YL  + +GVS+VM SYSSWNG ++HADR L+T+ LK  L FKGF++SDWEGLDR++ P
Subjt:  GRTKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHADRDLITDFLKGTLKFKGFVISDWEGLDRITSP

Query:  PHSNYTYSVQAAILAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVKFTMGLFESPLGDYSLVNEVGSQAHRDLARDAVRQSLVLLKNG
          SNY Y ++ A+ AGIDMVMVP+KYE+FI D+T LV++  IPM RI+DAV RIL VKF  GLF  PL D SL+  VG + HR+LA++AVR+SLVLLK+G
Subjt:  PHSNYTYSVQAAILAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVKFTMGLFESPLGDYSLVNEVGSQAHRDLARDAVRQSLVLLKNG

Query:  KNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPNTEVVFREDPESDFVKSND-FSYAIVVIGETPYAETVG
        KN   P LPL + A +ILV GTHAD+LGYQCGGWT  W G SG   T GT++L AIK  V   TEV++ + P  + + S++ FSYAIV +GE PYAET+G
Subjt:  KNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPNTEVVFREDPESDFVKSND-FSYAIVVIGETPYAETVG

Query:  DSMTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEP-YISSIDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSVDQLPMNVGDRHYDPLYPL
        D+  L +   G  I+  V E +  +V++ISGRP+V+EP  +   +ALVAAWLPGTEG GV D ++GD+ F GKLP +WFK V+ LP++     YDPL+P 
Subjt:  DSMTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEP-YISSIDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSVDQLPMNVGDRHYDPLYPL

Query:  GFGLTTGSV
        GFGL +  V
Subjt:  GFGLTTGSV

AT5G04885.1 Glycosyl hydrolase family protein2.0e-28670.97Show/hide
Query:  MAKILVQVVVILCLGWWLWPIMVG-AENLKYKDPKQPVGVRVKDLLGRMTLAEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMI
        M++  V++V +L L   +W    G  E L YKDPKQ V  RV DL GRMTL EKIGQM+QIDRSVA   +M+DYFIGSVLSGGGS PLP+A AQ+WVDMI
Subjt:  MAKILVQVVVILCLGWWLWPIMVG-AENLKYKDPKQPVGVRVKDLLGRMTLAEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMI

Query:  NDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRDPRWGRCYESYSEDPKIVRNMT
        N++QKG+L SRLGIPMIYGIDAVHGHNNVYNAT+FPHNVGLGATR+PDL++ IGAATA+EVRATGI Y FAPCIAVCRDPRWGRCYESYSED K+V +MT
Subjt:  NDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRDPRWGRCYESYSEDPKIVRNMT

Query:  EIIIGLQGEPPANYRKGTPYVGRTKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHADRDLITDFLKG
        ++I+GLQGEPP+NY+ G P+VG   KV ACAKH+VGDGGTTRG+NENNTV D HGLLS+HMPAY D++ KGVS+VMVSYSSWNG KMHA+ +LIT +LKG
Subjt:  EIIIGLQGEPPANYRKGTPYVGRTKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHADRDLITDFLKG

Query:  TLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVKFTMGLFESPLGDYSLVNEVGS
        TLKFKGFVISDW+G+D+I++PPH++YT SV+AAI AGIDMVMVP+ + EF++DLT LVKNN IP+ RIDDAV RIL VKFTMGLFE+PL DYS  +E+GS
Subjt:  TLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVKFTMGLFESPLGDYSLVNEVGS

Query:  QAHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPNTEVVFREDPESDFVKS
        QAHRDLAR+AVR+SLVLLKNG N ++P+LPL +K  KILVAGTHADNLGYQCGGWTI WQGFSGN  TRGT++L+A+KS VD +TEVVFRE+P+++F+KS
Subjt:  QAHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPNTEVVFREDPESDFVKS

Query:  NDFSYAIVVIGETPYAETVGDSMTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYISSIDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKS
        N+F+YAI+ +GE PYAET GDS  LTMLDPGP+II + C++VKCVVVVISGRP+VMEPY++SIDALVAAWLPGTEG G+TDAL+GDHGFSGKLP TWF++
Subjt:  NDFSYAIVVIGETPYAETVGDSMTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYISSIDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKS

Query:  VDQLPMNVGDRHYDPLYPLGFGLTTGSVKDFVARSTSAGIQGTTSFIAMIIAAIVICI
         +QLPM+ GD HYDPL+  G GL T SV   VARSTSA    T   +  ++ +  +C+
Subjt:  VDQLPMNVGDRHYDPLYPLGFGLTTGSVKDFVARSTSAGIQGTTSFIAMIIAAIVICI

AT5G20940.1 Glycosyl hydrolase family protein1.2e-24666.61Show/hide
Query:  LVQVVVILCLGWWLWPIMVGAENLKYKDPKQPVGVRVKDLLGRMTLAEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMINDFQK
        L+Q + +L L   +    V   N KYKDPK+P+GVR+K+L+  MTL EKIGQM+Q++R  A   VM+ YF+GSV SGGGSVP P    + WV+M+N+ QK
Subjt:  LVQVVVILCLGWWLWPIMVGAENLKYKDPKQPVGVRVKDLLGRMTLAEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMINDFQK

Query:  GSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRDPRWGRCYESYSEDPKIVRNMTEIIIG
         +LS+RLGIP+IYGIDAVHGHN VYNAT+FPHNVGLG TR+P L++ IG ATALEVRATGI Y FAPCIAVCRDPRWGRCYESYSED KIV+ MTEII G
Subjt:  GSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRDPRWGRCYESYSEDPKIVRNMTEIIIG

Query:  LQGEPPANYRKGTPYVGRTKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHADRDLITDFLKGTLKFK
        LQG+ P   +KG P+V    KV ACAKHFVGDGGT RG+N NNTVI+ +GLL IHMPAY D++ KGV++VMVSYSS NG+KMHA++ LIT FLK  LKF+
Subjt:  LQGEPPANYRKGTPYVGRTKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHADRDLITDFLKGTLKFK

Query:  GFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVKFTMGLFESPLGDYSLVNEVGSQAHRD
        G VISD+ G+D+I +P  +NY++SV AA  AG+DM M      + ID+LT  VK   IPM RIDDAV RIL VKFTMGLFE+P+ D+SL  ++GS+ HR+
Subjt:  GFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVKFTMGLFESPLGDYSLVNEVGSQAHRD

Query:  LARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPNTEVVFREDPESDFVKSNDFSY
        LAR+AVR+SLVLLKNG+N   PLLPL KKA KILVAGTHADNLGYQCGGWTI WQG +GNN T GT+ILAA+K TVDP T+V++ ++P+++FVK+ DF Y
Subjt:  LARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPNTEVVFREDPESDFVKSNDFSY

Query:  AIVVIGETPYAETVGDSMTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYISSIDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSVDQLP
        AIV +GE PYAE  GDS  LT+ +PGPS I NVC SVKCVVVV+SGRP+VM+  IS+IDALVAAWLPGTEG GV D L+GD+GF+GKL RTWFK+VDQLP
Subjt:  AIVVIGETPYAETVGDSMTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYISSIDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSVDQLP

Query:  MNVGDRHYDPLYPLGFGLTT
        MNVGD HYDPLYP GFGL T
Subjt:  MNVGDRHYDPLYPLGFGLTT

AT5G20950.1 Glycosyl hydrolase family protein1.8e-26669.74Show/hide
Query:  ILCLGWWLWPIMVGAENLKYKDPKQPVGVRVKDLLGRMTLAEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSR
        +LCL      +      LKYKDPKQP+G R++DL+ RMTL EKIGQM+QI+RSVA   VMK YFIGSVLSGGGSVP   A  + WV+M+N+ QK SLS+R
Subjt:  ILCLGWWLWPIMVGAENLKYKDPKQPVGVRVKDLLGRMTLAEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSR

Query:  LGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRDPRWGRCYESYSEDPKIVRNMTEIIIGLQGEPP
        LGIPMIYGIDAVHGHNNVY AT+FPHNVGLG TR+P+L++ IGAATALEVRATGI YAFAPCIAVCRDPRWGRCYESYSED +IV+ MTEII GLQG+ P
Subjt:  LGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRDPRWGRCYESYSEDPKIVRNMTEIIIGLQGEPP

Query:  ANYRKGTPYVGRTKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHADRDLITDFLKGTLKFKGFVISD
           RKG P+VG   KV ACAKHFVGDGGT RGI+ENNTVID  GL  IHMP Y +++ KGV+++MVSYS+WNG++MHA+++L+T FLK  LKF+GFVISD
Subjt:  ANYRKGTPYVGRTKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHADRDLITDFLKGTLKFKGFVISD

Query:  WEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVKFTMGLFESPLGDYSLVNEVGSQAHRDLARDAV
        W+G+DRIT+PPH NY+YSV A I AGIDM+MVPY Y EFID+++  ++  +IP+ RIDDA+ RIL VKFTMGLFE PL D S  N++GS+ HR+LAR+AV
Subjt:  WEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVKFTMGLFESPLGDYSLVNEVGSQAHRDLARDAV

Query:  RQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPNTEVVFREDPESDFVKSNDFSYAIVVIG
        R+SLVLLKNGK  + PLLPL KK+ KILVAG HADNLGYQCGGWTI WQG +GN+ T GT+ILAA+K+TV P T+VV+ ++P+++FVKS  F YAIVV+G
Subjt:  RQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPNTEVVFREDPESDFVKSNDFSYAIVVIG

Query:  ETPYAETVGDSMTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYISSIDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSVDQLPMNVGDR
        E PYAE  GD+  LT+ DPGPSII NVC SVKCVVVV+SGRP+V++PY+S+IDALVAAWLPGTEG GV DAL+GD+GF+GKL RTWFKSV QLPMNVGDR
Subjt:  ETPYAETVGDSMTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYISSIDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSVDQLPMNVGDR

Query:  HYDPLYPLGFGLTTGSVK
        HYDPLYP GFGLTT   K
Subjt:  HYDPLYPLGFGLTTGSVK

AT5G20950.2 Glycosyl hydrolase family protein1.8e-26669.74Show/hide
Query:  ILCLGWWLWPIMVGAENLKYKDPKQPVGVRVKDLLGRMTLAEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSR
        +LCL      +      LKYKDPKQP+G R++DL+ RMTL EKIGQM+QI+RSVA   VMK YFIGSVLSGGGSVP   A  + WV+M+N+ QK SLS+R
Subjt:  ILCLGWWLWPIMVGAENLKYKDPKQPVGVRVKDLLGRMTLAEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSR

Query:  LGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRDPRWGRCYESYSEDPKIVRNMTEIIIGLQGEPP
        LGIPMIYGIDAVHGHNNVY AT+FPHNVGLG TR+P+L++ IGAATALEVRATGI YAFAPCIAVCRDPRWGRCYESYSED +IV+ MTEII GLQG+ P
Subjt:  LGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRDPRWGRCYESYSEDPKIVRNMTEIIIGLQGEPP

Query:  ANYRKGTPYVGRTKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHADRDLITDFLKGTLKFKGFVISD
           RKG P+VG   KV ACAKHFVGDGGT RGI+ENNTVID  GL  IHMP Y +++ KGV+++MVSYS+WNG++MHA+++L+T FLK  LKF+GFVISD
Subjt:  ANYRKGTPYVGRTKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHADRDLITDFLKGTLKFKGFVISD

Query:  WEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVKFTMGLFESPLGDYSLVNEVGSQAHRDLARDAV
        W+G+DRIT+PPH NY+YSV A I AGIDM+MVPY Y EFID+++  ++  +IP+ RIDDA+ RIL VKFTMGLFE PL D S  N++GS+ HR+LAR+AV
Subjt:  WEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVKFTMGLFESPLGDYSLVNEVGSQAHRDLARDAV

Query:  RQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPNTEVVFREDPESDFVKSNDFSYAIVVIG
        R+SLVLLKNGK  + PLLPL KK+ KILVAG HADNLGYQCGGWTI WQG +GN+ T GT+ILAA+K+TV P T+VV+ ++P+++FVKS  F YAIVV+G
Subjt:  RQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPNTEVVFREDPESDFVKSNDFSYAIVVIG

Query:  ETPYAETVGDSMTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYISSIDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSVDQLPMNVGDR
        E PYAE  GD+  LT+ DPGPSII NVC SVKCVVVV+SGRP+V++PY+S+IDALVAAWLPGTEG GV DAL+GD+GF+GKL RTWFKSV QLPMNVGDR
Subjt:  ETPYAETVGDSMTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYISSIDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSVDQLPMNVGDR

Query:  HYDPLYPLGFGLTTGSVK
        HYDPLYP GFGLTT   K
Subjt:  HYDPLYPLGFGLTTGSVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCCTCAACTCAAATCTACTGTCGAAATGGAAAAAGTGTGGCTTAAACTCTCGGGGAAAGAAAATGGCCAAGATTTTGGTTCAGGTGGTTGTGATTCTGTGCTTGGG
TTGGTGGCTGTGGCCAATAATGGTGGGCGCCGAGAACTTGAAGTATAAAGATCCTAAGCAACCAGTTGGTGTTCGAGTTAAGGACCTTCTTGGCCGGATGACTCTCGCAG
AGAAAATTGGTCAGATGATTCAGATTGACAGGAGCGTTGCCAATGCTACGGTTATGAAAGATTATTTTATTGGAAGTGTGCTAAGTGGCGGTGGAAGTGTGCCGCTTCCA
GATGCTCGTGCTCAAGATTGGGTTGACATGATAAACGATTTCCAGAAGGGCTCTCTTTCTAGCCGATTGGGCATCCCTATGATTTATGGCATTGATGCTGTTCACGGCCA
TAACAATGTTTACAATGCTACAGTATTTCCTCATAATGTGGGACTTGGAGCTACCAGAAACCCTGACCTAATTCGAAGCATTGGTGCTGCAACGGCACTTGAAGTTAGAG
CGACAGGGATTTCTTATGCCTTTGCTCCATGCATTGCGGTTTGTAGGGACCCGAGGTGGGGGCGGTGTTACGAAAGCTACAGCGAGGATCCGAAAATTGTGCGAAATATG
ACTGAGATTATAATTGGTCTGCAAGGAGAGCCTCCTGCTAACTATCGGAAAGGCACTCCATATGTTGGTCGAACTAAAAAGGTTATCGCCTGTGCAAAACACTTTGTTGG
GGATGGTGGGACGACTCGCGGCATCAATGAGAATAATACCGTTATTGACAAGCATGGGCTGCTCAGCATTCACATGCCTGCCTATCTAGATTCCATCATCAAGGGGGTTT
CATCGGTAATGGTTTCGTATTCTAGTTGGAATGGAGTGAAGATGCATGCAGACCGTGATCTGATTACTGACTTCCTTAAGGGCACCCTTAAATTTAAGGGTTTTGTCATC
TCAGATTGGGAAGGTCTGGATAGAATTACTTCTCCACCACATTCTAATTACACGTACTCTGTCCAAGCTGCAATTTTAGCTGGCATTGATATGGTCATGGTTCCTTACAA
GTATGAAGAGTTCATTGATGATCTTACTTTTCTGGTGAAGAACAATGTCATCCCAATGGATCGAATTGATGATGCTGTTGTGAGAATCTTATCAGTAAAGTTCACGATGG
GTCTTTTTGAAAGCCCCTTGGGCGATTACAGCCTTGTCAATGAGGTTGGGAGCCAGGCACATAGAGACTTGGCAAGAGATGCTGTGAGGCAGTCACTTGTACTGCTGAAG
AATGGGAAAAACGATAGCGACCCGTTGCTACCCCTTTCAAAGAAGGCCCCGAAGATTCTTGTTGCTGGCACTCATGCTGATAATTTAGGATATCAATGTGGTGGATGGAC
AATTGCATGGCAAGGATTCAGTGGCAATAATGCTACAAGGGGAACTAGCATCCTCGCTGCTATCAAATCAACAGTTGATCCAAACACAGAGGTTGTATTCCGTGAGGATC
CCGAGAGTGACTTTGTTAAGTCCAATGACTTCTCATATGCCATAGTTGTTATCGGCGAAACTCCATATGCCGAGACCGTAGGGGATAGTATGACACTTACCATGTTAGAT
CCGGGTCCAAGCATCATAAAAAATGTTTGTGAATCTGTAAAGTGTGTGGTGGTTGTCATTTCTGGAAGGCCAATAGTGATGGAACCATACATTTCATCAATTGATGCTCT
TGTAGCAGCTTGGTTACCTGGCACTGAAGGCCTAGGAGTCACTGATGCCCTTTATGGAGACCATGGATTTAGTGGTAAGCTTCCAAGAACTTGGTTTAAATCTGTAGATC
AACTGCCAATGAACGTTGGAGATCGACATTATGATCCACTTTACCCGCTTGGTTTTGGACTCACAACTGGATCGGTTAAGGATTTTGTCGCGAGGTCGACATCGGCCGGG
ATTCAAGGAACGACATCCTTTATTGCAATGATCATTGCTGCAATAGTGATTTGTATATTGTAG
mRNA sequenceShow/hide mRNA sequence
TTTATACATACTGAAAGAGAAGCCAAGATCTGATCCATCTTATATCTTCTTCTTTCTTATCGACTCTGAGTTCAATTTATTTCTTTCATTTTTTTCAATTCCATCTTTTT
CTTCTTCATTCTTCGCGATCGGATCGGCCGTAGAACCAGAGAACTGCTGTCCGGTTTCAGTCGATCCTCGGCTTCAGGGCGTGTTTGTTTTTTGGACATGAGGTAAAAAA
CCCTTGGTTTGTTCTTTTTGTAGTAAACAATTACTGGATTACATTCATGACCCTCAACTCAAATCTACTGTCGAAATGGAAAAAGTGTGGCTTAAACTCTCGGGGAAAGA
AAATGGCCAAGATTTTGGTTCAGGTGGTTGTGATTCTGTGCTTGGGTTGGTGGCTGTGGCCAATAATGGTGGGCGCCGAGAACTTGAAGTATAAAGATCCTAAGCAACCA
GTTGGTGTTCGAGTTAAGGACCTTCTTGGCCGGATGACTCTCGCAGAGAAAATTGGTCAGATGATTCAGATTGACAGGAGCGTTGCCAATGCTACGGTTATGAAAGATTA
TTTTATTGGAAGTGTGCTAAGTGGCGGTGGAAGTGTGCCGCTTCCAGATGCTCGTGCTCAAGATTGGGTTGACATGATAAACGATTTCCAGAAGGGCTCTCTTTCTAGCC
GATTGGGCATCCCTATGATTTATGGCATTGATGCTGTTCACGGCCATAACAATGTTTACAATGCTACAGTATTTCCTCATAATGTGGGACTTGGAGCTACCAGAAACCCT
GACCTAATTCGAAGCATTGGTGCTGCAACGGCACTTGAAGTTAGAGCGACAGGGATTTCTTATGCCTTTGCTCCATGCATTGCGGTTTGTAGGGACCCGAGGTGGGGGCG
GTGTTACGAAAGCTACAGCGAGGATCCGAAAATTGTGCGAAATATGACTGAGATTATAATTGGTCTGCAAGGAGAGCCTCCTGCTAACTATCGGAAAGGCACTCCATATG
TTGGTCGAACTAAAAAGGTTATCGCCTGTGCAAAACACTTTGTTGGGGATGGTGGGACGACTCGCGGCATCAATGAGAATAATACCGTTATTGACAAGCATGGGCTGCTC
AGCATTCACATGCCTGCCTATCTAGATTCCATCATCAAGGGGGTTTCATCGGTAATGGTTTCGTATTCTAGTTGGAATGGAGTGAAGATGCATGCAGACCGTGATCTGAT
TACTGACTTCCTTAAGGGCACCCTTAAATTTAAGGGTTTTGTCATCTCAGATTGGGAAGGTCTGGATAGAATTACTTCTCCACCACATTCTAATTACACGTACTCTGTCC
AAGCTGCAATTTTAGCTGGCATTGATATGGTCATGGTTCCTTACAAGTATGAAGAGTTCATTGATGATCTTACTTTTCTGGTGAAGAACAATGTCATCCCAATGGATCGA
ATTGATGATGCTGTTGTGAGAATCTTATCAGTAAAGTTCACGATGGGTCTTTTTGAAAGCCCCTTGGGCGATTACAGCCTTGTCAATGAGGTTGGGAGCCAGGCACATAG
AGACTTGGCAAGAGATGCTGTGAGGCAGTCACTTGTACTGCTGAAGAATGGGAAAAACGATAGCGACCCGTTGCTACCCCTTTCAAAGAAGGCCCCGAAGATTCTTGTTG
CTGGCACTCATGCTGATAATTTAGGATATCAATGTGGTGGATGGACAATTGCATGGCAAGGATTCAGTGGCAATAATGCTACAAGGGGAACTAGCATCCTCGCTGCTATC
AAATCAACAGTTGATCCAAACACAGAGGTTGTATTCCGTGAGGATCCCGAGAGTGACTTTGTTAAGTCCAATGACTTCTCATATGCCATAGTTGTTATCGGCGAAACTCC
ATATGCCGAGACCGTAGGGGATAGTATGACACTTACCATGTTAGATCCGGGTCCAAGCATCATAAAAAATGTTTGTGAATCTGTAAAGTGTGTGGTGGTTGTCATTTCTG
GAAGGCCAATAGTGATGGAACCATACATTTCATCAATTGATGCTCTTGTAGCAGCTTGGTTACCTGGCACTGAAGGCCTAGGAGTCACTGATGCCCTTTATGGAGACCAT
GGATTTAGTGGTAAGCTTCCAAGAACTTGGTTTAAATCTGTAGATCAACTGCCAATGAACGTTGGAGATCGACATTATGATCCACTTTACCCGCTTGGTTTTGGACTCAC
AACTGGATCGGTTAAGGATTTTGTCGCGAGGTCGACATCGGCCGGGATTCAAGGAACGACATCCTTTATTGCAATGATCATTGCTGCAATAGTGATTTGTATATTGTAGG
TAAATTTCTAGTTCTAATCAGCAGTAAGCATTGGTTTGTTGAAAACTTGAGTAGATATTAGTTTGTGACTGTG
Protein sequenceShow/hide protein sequence
MTLNSNLLSKWKKCGLNSRGKKMAKILVQVVVILCLGWWLWPIMVGAENLKYKDPKQPVGVRVKDLLGRMTLAEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLP
DARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLIRSIGAATALEVRATGISYAFAPCIAVCRDPRWGRCYESYSEDPKIVRNM
TEIIIGLQGEPPANYRKGTPYVGRTKKVIACAKHFVGDGGTTRGINENNTVIDKHGLLSIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHADRDLITDFLKGTLKFKGFVI
SDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYEEFIDDLTFLVKNNVIPMDRIDDAVVRILSVKFTMGLFESPLGDYSLVNEVGSQAHRDLARDAVRQSLVLLK
NGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPNTEVVFREDPESDFVKSNDFSYAIVVIGETPYAETVGDSMTLTMLD
PGPSIIKNVCESVKCVVVVISGRPIVMEPYISSIDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSVDQLPMNVGDRHYDPLYPLGFGLTTGSVKDFVARSTSAG
IQGTTSFIAMIIAAIVICIL