; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028903 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028903
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionbeta-glucosidase BoGH3B isoform X1
Genome locationtig00153210:1356889..1377676
RNA-Seq ExpressionSgr028903
SyntenySgr028903
Gene Ontology termsGO:0009251 - glucan catabolic process (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0008422 - beta-glucosidase activity (molecular function)
InterPro domainsIPR001764 - Glycoside hydrolase, family 3, N-terminal
IPR002772 - Glycoside hydrolase family 3 C-terminal domain
IPR017853 - Glycoside hydrolase superfamily
IPR036881 - Glycoside hydrolase family 3 C-terminal domain superfamily
IPR036962 - Glycoside hydrolase, family 3, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008446716.1 PREDICTED: beta-glucosidase BoGH3B isoform X1 [Cucumis melo]0.0e+0088.25Show/hide
Query:  FMMLKLKLLWNWKQCGFTSQKRKMAKIFVQVVAILCLGWWWWATMVDAEYLKYKDPNQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGS
        FM+L LKLLW WK+CG  +Q +KMAKIFVQVV ILCLGW WWATMVDAE LKYKDP QPV VRVKDLLGRMTLEEKIGQMVQIDRSVAN TVMKDYFIGS
Subjt:  FMMLKLKLLWNWKQCGFTSQKRKMAKIFVQVVAILCLGWWWWATMVDAEYLKYKDPNQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGS

Query:  VLSGGGSVPLPDASAQDWINMINDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCR
        +LSGGGSVPLPDA A+DW++MINDFQKGSLSSRLGIPM YGIDAVHGHN VYNAT+FPHNVGLGATR+P L RRIGAATALEVRATGIS+ FAPC+AVCR
Subjt:  VLSGGGSVPLPDASAQDWINMINDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCR

Query:  DPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMAS
        DPRWGRCYESYSEDPKIV+EMTEII GLQGEPPANYRKG PYVGGTKKVIACAKHFVGDGGTT+GINENNTVI++HGLLSIHMPAYLDSIIKGVS+VMAS
Subjt:  DPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMAS

Query:  YSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYTYSVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSV
        YSSWNGVKMHAN ELIT FLKG LKFKGFVISDWEGLDRITS PH+NYTYSVQAAILAGIDMVM+PYKY EFIDDL +LVKSNVIPMDRIDDAVGRIL+V
Subjt:  YSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYTYSVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSV

Query:  KFTMGLFESPLGDYSLVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIK
        KFTMGLFESP+ DYSLVNELGSQAHRDLAR+AVRQSLVLLKNGKNDS+PLLPL KK+PKILVAGTHADNLGYQCGGWTI WQGFSGNNGTRGT+ILAAIK
Subjt:  KFTMGLFESPLGDYSLVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIK

Query:  SMVDPSTEVVFREDPDSDFVKSNDFSYAIVVVGETPYAETEGDSTTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLG
        S VDPSTEVVFREDPDSDFVKSNDFSYAIVV+GE PYAET GDSTTLTMLDPGP+IIKNVCD V+CVV++ISGRPIVIEPYISSIDALVAAWLPG+EG G
Subjt:  SMVDPSTEVVFREDPDSDFVKSNDFSYAIVVVGETPYAETEGDSTTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLG

Query:  VTDVLYGDYGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESVKDLVSRSTSPGIRETPSFLAMIVATIAICIL
        VTD LYGD+GFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTT SVKD+++RSTS GIR TPS +A IV  I +CIL
Subjt:  VTDVLYGDYGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESVKDLVSRSTSPGIRETPSFLAMIVATIAICIL

XP_022150694.1 uncharacterized protein LOC111018764 isoform X1 [Momordica charantia]0.0e+0091.63Show/hide
Query:  MLKLKLLWNWKQCGFTSQKRKMAKIFVQVVAILCLGWWWWATMVDAEYLKYKDPNQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVL
        MLKLKLLW W+QCGFTSQKRKMA+IFVQVVAILCLGWWWWAT VDAEYLKYKDP QPV+VRV DLLGRMTLEEKIGQMVQIDRSVANVTVMKDY IGSVL
Subjt:  MLKLKLLWNWKQCGFTSQKRKMAKIFVQVVAILCLGWWWWATMVDAEYLKYKDPNQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVL

Query:  SGGGSVPLPDASAQDWINMINDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDP
        SGGGSVPLPDA A+DW+NMIN+FQKGSLSSRLGIPMMYGIDAVHGHN VYNAT+FPHNVGLGATR+P LVRRIGAATALEVRATGISFAFAPCIAVCRDP
Subjt:  SGGGSVPLPDASAQDWINMINDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDP

Query:  RWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYS
        RWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVID HGLLSIHMPAY DSIIKGVS+VM SYS
Subjt:  RWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYS

Query:  SWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYTYSVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKF
        SWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPH+NYTYSVQAAILAGIDMVMVPYKYTEFIDDLT+LV+SNVIPMDRIDDA GRILSVKF
Subjt:  SWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYTYSVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKF

Query:  TMGLFESPLGDYSLVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIKSM
        +MGLFE+P+GDYSLVNELGSQ HRDLAREAVRQSLVLLKNGKNDS+P+LPLPKKAPKILVAGTH DNLGYQCGGWTI WQGFSGNNGTRGTSILAAIKS 
Subjt:  TMGLFESPLGDYSLVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIKSM

Query:  VDPSTEVVFREDPDSDFVKSNDFSYAIVVVGETPYAETEGDSTTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLGVT
        VDPSTEVVF EDPDS+FVKSNDFSYAIVVVGE PYAE+ GDSTTLTMLDPGP+ IKNVCDSVKCVVVV+SGRPIV+EPYISSIDALVAAWLPG+EGLGVT
Subjt:  VDPSTEVVFREDPDSDFVKSNDFSYAIVVVGETPYAETEGDSTTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLGVT

Query:  DVLYGDYGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESVKDLVSRSTSPGIRETPSFLA-MIVATIAICILQ
        D LYGD+GFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGL T+SVKDLV+RSTS GIR T S +A +IVA +AICILQ
Subjt:  DVLYGDYGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESVKDLVSRSTSPGIRETPSFLA-MIVATIAICILQ

XP_022150697.1 uncharacterized protein LOC111018764 isoform X2 [Momordica charantia]0.0e+0091.64Show/hide
Query:  QCGFTSQKRKMAKIFVQVVAILCLGWWWWATMVDAEYLKYKDPNQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVLSGGGSVPLPDA
        +CGFTSQKRKMA+IFVQVVAILCLGWWWWAT VDAEYLKYKDP QPV+VRV DLLGRMTLEEKIGQMVQIDRSVANVTVMKDY IGSVLSGGGSVPLPDA
Subjt:  QCGFTSQKRKMAKIFVQVVAILCLGWWWWATMVDAEYLKYKDPNQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVLSGGGSVPLPDA

Query:  SAQDWINMINDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSE
         A+DW+NMIN+FQKGSLSSRLGIPMMYGIDAVHGHN VYNAT+FPHNVGLGATR+P LVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSE
Subjt:  SAQDWINMINDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSE

Query:  DPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYSSWNGVKMHANH
        DPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVID HGLLSIHMPAY DSIIKGVS+VM SYSSWNGVKMHANH
Subjt:  DPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYSSWNGVKMHANH

Query:  ELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYTYSVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGD
        ELITGFLKGTLKFKGFVISDWEGLDRITSPPH+NYTYSVQAAILAGIDMVMVPYKYTEFIDDLT+LV+SNVIPMDRIDDA GRILSVKF+MGLFE+P+GD
Subjt:  ELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYTYSVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGD

Query:  YSLVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIKSMVDPSTEVVFRE
        YSLVNELGSQ HRDLAREAVRQSLVLLKNGKNDS+P+LPLPKKAPKILVAGTH DNLGYQCGGWTI WQGFSGNNGTRGTSILAAIKS VDPSTEVVF E
Subjt:  YSLVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIKSMVDPSTEVVFRE

Query:  DPDSDFVKSNDFSYAIVVVGETPYAETEGDSTTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLGVTDVLYGDYGFSG
        DPDS+FVKSNDFSYAIVVVGE PYAE+ GDSTTLTMLDPGP+ IKNVCDSVKCVVVV+SGRPIV+EPYISSIDALVAAWLPG+EGLGVTD LYGD+GFSG
Subjt:  DPDSDFVKSNDFSYAIVVVGETPYAETEGDSTTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLGVTDVLYGDYGFSG

Query:  KLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESVKDLVSRSTSPGIRETPSFLA-MIVATIAICILQ
        KLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGL T+SVKDLV+RSTS GIR T S +A +IVA +AICILQ
Subjt:  KLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESVKDLVSRSTSPGIRETPSFLA-MIVATIAICILQ

XP_022150698.1 uncharacterized protein LOC111018764 isoform X3 [Momordica charantia]0.0e+0091.63Show/hide
Query:  MLKLKLLWNWKQCGFTSQKRKMAKIFVQVVAILCLGWWWWATMVDAEYLKYKDPNQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVL
        MLKLKLLW W+QCGFTSQKRKMA+IFVQVVAILCLGWWWWAT VDAEYLKYKDP QPV+VRV DLLGRMTLEEKIGQMVQIDRSVANVTVMKDY IGSVL
Subjt:  MLKLKLLWNWKQCGFTSQKRKMAKIFVQVVAILCLGWWWWATMVDAEYLKYKDPNQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVL

Query:  SGGGSVPLPDASAQDWINMINDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDP
        SGGGSVPLPDA A+DW+NMIN+FQKGSLSSRLGIPMMYGIDAVHGHN VYNAT+FPHNVGLGATR+P LVRRIGAATALEVRATGISFAFAPCIAVCRDP
Subjt:  SGGGSVPLPDASAQDWINMINDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDP

Query:  RWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYS
        RWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVID HGLLSIHMPAY DSIIKGVS+VM SYS
Subjt:  RWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYS

Query:  SWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYTYSVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKF
        SWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPH+NYTYSVQAAILAGIDMVMVPYKYTEFIDDLT+LV+SNVIPMDRIDDA GRILSVKF
Subjt:  SWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYTYSVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKF

Query:  TMGLFESPLGDYSLVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIKSM
        +MGLFE+P+GDYSLVNELGSQ HRDLAREAVRQSLVLLKNGKNDS+P+LPLPKKAPKILVAGTH DNLGYQCGGWTI WQGFSGNNGTRGTSILAAIKS 
Subjt:  TMGLFESPLGDYSLVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIKSM

Query:  VDPSTEVVFREDPDSDFVKSNDFSYAIVVVGETPYAETEGDSTTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLGVT
        VDPSTEVVF EDPDS+FVKSNDFSYAIVVVGE PYAE+ GDSTTLTMLDPGP+ IKNVCDSVKCVVVV+SGRPIV+EPYISSIDALVAAWLPG+EGLGVT
Subjt:  VDPSTEVVFREDPDSDFVKSNDFSYAIVVVGETPYAETEGDSTTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLGVT

Query:  DVLYGDYGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESVKDLVSRSTSPGIRETPSFLA-MIVATIAICILQ
        D LYGD+GFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGL T+SVKDLV+RSTS GIR T S +A +IVA +AICILQ
Subjt:  DVLYGDYGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESVKDLVSRSTSPGIRETPSFLA-MIVATIAICILQ

XP_038892436.1 beta-glucosidase BoGH3B-like isoform X1 [Benincasa hispida]0.0e+0089.51Show/hide
Query:  MMLKLKLLWNWKQCGFTSQKRKMAKIFVQVVAILCLGWWWWATMVDAEYLKYKDPNQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSV
        MML LKLLW WK+CG  +Q +KMAKIFVQVV ILCLGW WW TMVDAE LKYKDP Q V+VRVKDLLGRMT+EEKIGQM+QIDRSVAN TVMKDYFIGSV
Subjt:  MMLKLKLLWNWKQCGFTSQKRKMAKIFVQVVAILCLGWWWWATMVDAEYLKYKDPNQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSV

Query:  LSGGGSVPLPDASAQDWINMINDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRD
        LSGGGSVPLPDA A+DW+NMINDFQKGSLSSRLGIPM YGIDAVHGHN VYNAT+FPHNVGLGATR+P L RRIGAATALEVRATGIS+ FAPC+AVCRD
Subjt:  LSGGGSVPLPDASAQDWINMINDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRD

Query:  PRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASY
        PRWGRCYESYSEDPKIV+EMTEII GLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTT+GINENNTVI++HGLLSIHMPAYLDSIIKGVS+VMASY
Subjt:  PRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASY

Query:  SSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYTYSVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVK
        SSWNGVKMHAN ELIT FLKGTLK+KGFVISDWEGLDRITS PH+NYTYS+QAAILAGIDMVM+PYKY EFIDDLT+LVKSNVIPMDRIDDAVGRILSVK
Subjt:  SSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYTYSVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVK

Query:  FTMGLFESPLGDYSLVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRP-LLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIK
        FTMGLFESPL DYSLVNELGSQAHRDLAR+AVRQSLVLLKNGKNDS P LLPL KKAPKILVAGTHADNLGYQCGGWTI WQGFSGNNGTRGT+ILAAIK
Subjt:  FTMGLFESPLGDYSLVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRP-LLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIK

Query:  SMVDPSTEVVFREDPDSDFVKSNDFSYAIVVVGETPYAETEGDSTTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLG
        S VDPSTEVVFREDPDSDFVKSNDFSYAIVV+GE PYAET GDSTTLTMLDPGPSIIKNVC SVKCVVVVISGRPIV+EPYISS+DALVAAWLPG+EGLG
Subjt:  SMVDPSTEVVFREDPDSDFVKSNDFSYAIVVVGETPYAETEGDSTTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLG

Query:  VTDVLYGDYGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESVKDLVSRSTSPGIRETPSFLAMIVATIA
        VTD LYGD+GFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTT SVKD V+RSTS GI  TPS +A IVATIA
Subjt:  VTDVLYGDYGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESVKDLVSRSTSPGIRETPSFLAMIVATIA

TrEMBL top hitse value%identityAlignment
A0A1S3BGE4 beta-glucosidase BoGH3B isoform X10.0e+0088.25Show/hide
Query:  FMMLKLKLLWNWKQCGFTSQKRKMAKIFVQVVAILCLGWWWWATMVDAEYLKYKDPNQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGS
        FM+L LKLLW WK+CG  +Q +KMAKIFVQVV ILCLGW WWATMVDAE LKYKDP QPV VRVKDLLGRMTLEEKIGQMVQIDRSVAN TVMKDYFIGS
Subjt:  FMMLKLKLLWNWKQCGFTSQKRKMAKIFVQVVAILCLGWWWWATMVDAEYLKYKDPNQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGS

Query:  VLSGGGSVPLPDASAQDWINMINDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCR
        +LSGGGSVPLPDA A+DW++MINDFQKGSLSSRLGIPM YGIDAVHGHN VYNAT+FPHNVGLGATR+P L RRIGAATALEVRATGIS+ FAPC+AVCR
Subjt:  VLSGGGSVPLPDASAQDWINMINDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCR

Query:  DPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMAS
        DPRWGRCYESYSEDPKIV+EMTEII GLQGEPPANYRKG PYVGGTKKVIACAKHFVGDGGTT+GINENNTVI++HGLLSIHMPAYLDSIIKGVS+VMAS
Subjt:  DPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMAS

Query:  YSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYTYSVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSV
        YSSWNGVKMHAN ELIT FLKG LKFKGFVISDWEGLDRITS PH+NYTYSVQAAILAGIDMVM+PYKY EFIDDL +LVKSNVIPMDRIDDAVGRIL+V
Subjt:  YSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYTYSVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSV

Query:  KFTMGLFESPLGDYSLVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIK
        KFTMGLFESP+ DYSLVNELGSQAHRDLAR+AVRQSLVLLKNGKNDS+PLLPL KK+PKILVAGTHADNLGYQCGGWTI WQGFSGNNGTRGT+ILAAIK
Subjt:  KFTMGLFESPLGDYSLVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIK

Query:  SMVDPSTEVVFREDPDSDFVKSNDFSYAIVVVGETPYAETEGDSTTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLG
        S VDPSTEVVFREDPDSDFVKSNDFSYAIVV+GE PYAET GDSTTLTMLDPGP+IIKNVCD V+CVV++ISGRPIVIEPYISSIDALVAAWLPG+EG G
Subjt:  SMVDPSTEVVFREDPDSDFVKSNDFSYAIVVVGETPYAETEGDSTTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLG

Query:  VTDVLYGDYGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESVKDLVSRSTSPGIRETPSFLAMIVATIAICIL
        VTD LYGD+GFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTT SVKD+++RSTS GIR TPS +A IV  I +CIL
Subjt:  VTDVLYGDYGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESVKDLVSRSTSPGIRETPSFLAMIVATIAICIL

A0A6J1DA47 uncharacterized protein LOC111018764 isoform X30.0e+0091.63Show/hide
Query:  MLKLKLLWNWKQCGFTSQKRKMAKIFVQVVAILCLGWWWWATMVDAEYLKYKDPNQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVL
        MLKLKLLW W+QCGFTSQKRKMA+IFVQVVAILCLGWWWWAT VDAEYLKYKDP QPV+VRV DLLGRMTLEEKIGQMVQIDRSVANVTVMKDY IGSVL
Subjt:  MLKLKLLWNWKQCGFTSQKRKMAKIFVQVVAILCLGWWWWATMVDAEYLKYKDPNQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVL

Query:  SGGGSVPLPDASAQDWINMINDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDP
        SGGGSVPLPDA A+DW+NMIN+FQKGSLSSRLGIPMMYGIDAVHGHN VYNAT+FPHNVGLGATR+P LVRRIGAATALEVRATGISFAFAPCIAVCRDP
Subjt:  SGGGSVPLPDASAQDWINMINDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDP

Query:  RWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYS
        RWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVID HGLLSIHMPAY DSIIKGVS+VM SYS
Subjt:  RWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYS

Query:  SWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYTYSVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKF
        SWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPH+NYTYSVQAAILAGIDMVMVPYKYTEFIDDLT+LV+SNVIPMDRIDDA GRILSVKF
Subjt:  SWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYTYSVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKF

Query:  TMGLFESPLGDYSLVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIKSM
        +MGLFE+P+GDYSLVNELGSQ HRDLAREAVRQSLVLLKNGKNDS+P+LPLPKKAPKILVAGTH DNLGYQCGGWTI WQGFSGNNGTRGTSILAAIKS 
Subjt:  TMGLFESPLGDYSLVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIKSM

Query:  VDPSTEVVFREDPDSDFVKSNDFSYAIVVVGETPYAETEGDSTTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLGVT
        VDPSTEVVF EDPDS+FVKSNDFSYAIVVVGE PYAE+ GDSTTLTMLDPGP+ IKNVCDSVKCVVVV+SGRPIV+EPYISSIDALVAAWLPG+EGLGVT
Subjt:  VDPSTEVVFREDPDSDFVKSNDFSYAIVVVGETPYAETEGDSTTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLGVT

Query:  DVLYGDYGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESVKDLVSRSTSPGIRETPSFLA-MIVATIAICILQ
        D LYGD+GFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGL T+SVKDLV+RSTS GIR T S +A +IVA +AICILQ
Subjt:  DVLYGDYGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESVKDLVSRSTSPGIRETPSFLA-MIVATIAICILQ

A0A6J1DBF5 uncharacterized protein LOC111018764 isoform X20.0e+0091.64Show/hide
Query:  QCGFTSQKRKMAKIFVQVVAILCLGWWWWATMVDAEYLKYKDPNQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVLSGGGSVPLPDA
        +CGFTSQKRKMA+IFVQVVAILCLGWWWWAT VDAEYLKYKDP QPV+VRV DLLGRMTLEEKIGQMVQIDRSVANVTVMKDY IGSVLSGGGSVPLPDA
Subjt:  QCGFTSQKRKMAKIFVQVVAILCLGWWWWATMVDAEYLKYKDPNQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVLSGGGSVPLPDA

Query:  SAQDWINMINDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSE
         A+DW+NMIN+FQKGSLSSRLGIPMMYGIDAVHGHN VYNAT+FPHNVGLGATR+P LVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSE
Subjt:  SAQDWINMINDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSE

Query:  DPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYSSWNGVKMHANH
        DPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVID HGLLSIHMPAY DSIIKGVS+VM SYSSWNGVKMHANH
Subjt:  DPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYSSWNGVKMHANH

Query:  ELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYTYSVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGD
        ELITGFLKGTLKFKGFVISDWEGLDRITSPPH+NYTYSVQAAILAGIDMVMVPYKYTEFIDDLT+LV+SNVIPMDRIDDA GRILSVKF+MGLFE+P+GD
Subjt:  ELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYTYSVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGD

Query:  YSLVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIKSMVDPSTEVVFRE
        YSLVNELGSQ HRDLAREAVRQSLVLLKNGKNDS+P+LPLPKKAPKILVAGTH DNLGYQCGGWTI WQGFSGNNGTRGTSILAAIKS VDPSTEVVF E
Subjt:  YSLVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIKSMVDPSTEVVFRE

Query:  DPDSDFVKSNDFSYAIVVVGETPYAETEGDSTTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLGVTDVLYGDYGFSG
        DPDS+FVKSNDFSYAIVVVGE PYAE+ GDSTTLTMLDPGP+ IKNVCDSVKCVVVV+SGRPIV+EPYISSIDALVAAWLPG+EGLGVTD LYGD+GFSG
Subjt:  DPDSDFVKSNDFSYAIVVVGETPYAETEGDSTTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLGVTDVLYGDYGFSG

Query:  KLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESVKDLVSRSTSPGIRETPSFLA-MIVATIAICILQ
        KLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGL T+SVKDLV+RSTS GIR T S +A +IVA +AICILQ
Subjt:  KLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESVKDLVSRSTSPGIRETPSFLA-MIVATIAICILQ

A0A6J1DCA3 uncharacterized protein LOC111018764 isoform X10.0e+0091.63Show/hide
Query:  MLKLKLLWNWKQCGFTSQKRKMAKIFVQVVAILCLGWWWWATMVDAEYLKYKDPNQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVL
        MLKLKLLW W+QCGFTSQKRKMA+IFVQVVAILCLGWWWWAT VDAEYLKYKDP QPV+VRV DLLGRMTLEEKIGQMVQIDRSVANVTVMKDY IGSVL
Subjt:  MLKLKLLWNWKQCGFTSQKRKMAKIFVQVVAILCLGWWWWATMVDAEYLKYKDPNQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVL

Query:  SGGGSVPLPDASAQDWINMINDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDP
        SGGGSVPLPDA A+DW+NMIN+FQKGSLSSRLGIPMMYGIDAVHGHN VYNAT+FPHNVGLGATR+P LVRRIGAATALEVRATGISFAFAPCIAVCRDP
Subjt:  SGGGSVPLPDASAQDWINMINDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDP

Query:  RWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYS
        RWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVID HGLLSIHMPAY DSIIKGVS+VM SYS
Subjt:  RWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYS

Query:  SWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYTYSVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKF
        SWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPH+NYTYSVQAAILAGIDMVMVPYKYTEFIDDLT+LV+SNVIPMDRIDDA GRILSVKF
Subjt:  SWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYTYSVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKF

Query:  TMGLFESPLGDYSLVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIKSM
        +MGLFE+P+GDYSLVNELGSQ HRDLAREAVRQSLVLLKNGKNDS+P+LPLPKKAPKILVAGTH DNLGYQCGGWTI WQGFSGNNGTRGTSILAAIKS 
Subjt:  TMGLFESPLGDYSLVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIKSM

Query:  VDPSTEVVFREDPDSDFVKSNDFSYAIVVVGETPYAETEGDSTTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLGVT
        VDPSTEVVF EDPDS+FVKSNDFSYAIVVVGE PYAE+ GDSTTLTMLDPGP+ IKNVCDSVKCVVVV+SGRPIV+EPYISSIDALVAAWLPG+EGLGVT
Subjt:  VDPSTEVVFREDPDSDFVKSNDFSYAIVVVGETPYAETEGDSTTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLGVT

Query:  DVLYGDYGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESVKDLVSRSTSPGIRETPSFLA-MIVATIAICILQ
        D LYGD+GFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGL T+SVKDLV+RSTS GIR T S +A +IVA +AICILQ
Subjt:  DVLYGDYGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESVKDLVSRSTSPGIRETPSFLA-MIVATIAICILQ

A0A6J1HX36 uncharacterized protein LOC111467651 isoform X20.0e+0088.68Show/hide
Query:  MLKLKLLWNWKQCGFTSQKRKMAKIFVQVVAILCLGWWWWATMVDAEYLKYKDPNQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVL
        ML LKL W WK+CG  S  +KMAKIFVQVV ILCLGWWWWA MVDAE LKYKDP QPVSVRVKDLLGRMTLEEKIGQMVQIDRSVAN TVMK+YFIGSVL
Subjt:  MLKLKLLWNWKQCGFTSQKRKMAKIFVQVVAILCLGWWWWATMVDAEYLKYKDPNQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVL

Query:  SGGGSVPLPDASAQDWINMINDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDP
        SGGGSVPLPDA AQDW++MINDFQKGSLSSRLGIPM+YGIDAVHGHN VYNAT+FPHNVGLGATR+P L+RRIGAATALEVRATGIS+ FAPC+AVCRDP
Subjt:  SGGGSVPLPDASAQDWINMINDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDP

Query:  RWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYS
        RWGRCYESYSEDPK+VQ MTEII GLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTT+GINENNTVID+HGLL IHMPAYLDSIIKGVS+VM SYS
Subjt:  RWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYS

Query:  SWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYTYSVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKF
        SWNGVKMHAN +LIT FLKGTLKFKGFVISDWEGLDRITS PH+NYTYSVQAAI AGIDMVMVPYKY EFIDDL  LVK+NV+PMDRIDDAV RILSVKF
Subjt:  SWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYTYSVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKF

Query:  TMGLFESPLGDYSLVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIKSM
        TMGLFESPLGDYSLVNELGSQAHRDLAR+AVRQSLVLLKNGKNDS PLLPL KKAPKILV GTHADNLGYQCGGWTI WQGFSGNN TRGT+ILAAIKS 
Subjt:  TMGLFESPLGDYSLVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIKSM

Query:  VDPSTEVVFREDPDSDFVKSNDFSYAIVVVGETPYAETEGDSTTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLGVT
        VDPSTEVVFREDPDSDFVKSN FSYAIVV+GE PYAET GDSTTLTMLDPGPSIIKNVC+SVKCVVVVISGRPIV+EPYISS+DALVAAWLPG+EGLGVT
Subjt:  VDPSTEVVFREDPDSDFVKSNDFSYAIVVVGETPYAETEGDSTTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLGVT

Query:  DVLYGDYGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESVKDLVSRSTSPGIRETPSFLAMIVATIAICILQ
        D LYGD+GFSGKLPRTWFKSVDQLPMN GD HYDPLFP GFGLTT SVKD+V+RSTS G R TPSF+AMIVATIA+C+LQ
Subjt:  DVLYGDYGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESVKDLVSRSTSPGIRETPSFLAMIVATIAICILQ

SwissProt top hitse value%identityAlignment
A7LXU3 Beta-glucosidase BoGH3B5.7e-7931.39Show/hide
Query:  PNQP-VSVRVKDLLGRMTLEEKIGQMVQIDRSVAN-----------------VTVMKDYFIGSVLSGGGSVPLPDASAQD-WINMINDFQKGSLSSRLGI
        P  P +   +++ L +MTLE+KIGQM +I   V +                  TV+  Y +GS+L    +VPL  A  ++ W   I   Q+ S+   +GI
Subjt:  PNQP-VSVRVKDLLGRMTLEEKIGQMVQIDRSVAN-----------------VTVMKDYFIGSVLSGGGSVPLPDASAQD-WINMINDFQKGSLSSRLGI

Query:  PMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEM-TEIIPGLQGEPPAN
        P +YG+D +HG     + T+FP  + +GAT +  L RR    +A E +A  I + FAP + + RDPRW R +E+Y ED  +  EM    + G QGE P  
Subjt:  PMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEM-TEIIPGLQGEPPAN

Query:  YRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWE
                 G   V AC KH++G G   +G +   + I +  +   H   +L ++ +G  +VM +    NG+  HAN EL+T +LK  L + G +++DW 
Subjt:  YRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWE

Query:  GLDRITSPPH--ANYTYSVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQAHRDLAREAV
         ++ + +  H  A    +V+  I AGIDM MVPY+   F D L  LV+   + M+RIDDAV R+L +K+ +GLF+ P  D    ++ GS+    +A +A 
Subjt:  GLDRITSPPH--ANYTYSVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQAHRDLAREAV

Query:  RQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRG-TSILAAI-------KSMVDPS-TEVVFRED-------PDS
         +S VLLKN  N    +LP+  K  KIL+ G +A+++    GGW+  WQG   +   +   +I  A+         + +P  T   ++ D       P++
Subjt:  RQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRG-TSILAAI-------KSMVDPS-TEVVFRED-------PDS

Query:  D--FVKSNDFSYAIVVVGETPYAETEGDSTTLTMLDPGPSIIKNVCDSVKCVVVVIS-GRPIVIEPYISSIDALVAAWLPGS-EGLGVTDVLYGDYGFSG
        +     +      I  +GE  Y ET G+ T LT+ +   +++K +  + K +V+V++ GRP +I   +    A+V   LP +  G  + ++L GD  FSG
Subjt:  D--FVKSNDFSYAIVVVGETPYAETEGDSTTLTMLDPGPSIIKNVCDSVKCVVVVIS-GRPIVIEPYISSIDALVAAWLPGS-EGLGVTDVLYGDYGFSG

Query:  KLPRTW-----------FKSVDQLPMNVGDPHYDPL----FPFGFGLTTESVK
        K+P T+           +K  + +    G+ +YD +    +PFGFGL+  + K
Subjt:  KLPRTW-----------FKSVDQLPMNVGDPHYDPL----FPFGFGLTTESVK

P33363 Periplasmic beta-glucosidase1.8e-5628.15Show/hide
Query:  VKDLLGRMTLEEKIGQMVQIDRSVANV-----TVMKDYFIGSVLSGGGSVPLPDASA-QDWINMINDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIF
        V +LL +MT++EKIGQ+  I     N       ++KD  +G++ +   +V   D  A QD +  +         SRL IP+ +  D +HG  TV     F
Subjt:  VKDLLGRMTLEEKIGQMVQIDRSVANV-----TVMKDYFIGSVLSGGGSVPLPDASA-QDWINMINDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIF

Query:  PHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTE-IIPGLQGEPPANYRKGIPYVGGTKKVIACAKHF
        P ++GL ++ +   V+ +G  +A E    G++  +AP + V RDPRWGR  E + ED  +   M + ++  +QG+ PA+             V+   KHF
Subjt:  PHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTE-IIPGLQGEPPANYRKGIPYVGGTKKVIACAKHF

Query:  VGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGL-DRITSPPHANYTYSVQAA
           G    G   N   +    L + +MP Y   +  G   VM + +S NG    ++  L+   L+    FKG  +SD   + + I     A+   +V+ A
Subjt:  VGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGL-DRITSPPHANYTYSVQAA

Query:  ILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYS------LVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRP
        + +GI+M M    Y+++   L  L+KS  + M  +DDA   +L+VK+ MGLF  P           +     S+ HR  ARE  R+SLVLLKN + ++ P
Subjt:  ILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYS------LVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRP

Query:  LLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIKSMVDPSTEVVF-----------------------REDP-------DSDF
        L    KK+  I V G  AD+     G W+      +     +  ++L  IK+ V  + +V++                       + DP       D   
Subjt:  LLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIKSMVDPSTEVVF-----------------------REDP-------DSDF

Query:  VKSNDFSYAIVVVGETPYAETEGDS-TTLTMLDPGPSIIKNVCDSVK-CVVVVISGRPIVIEPYISSIDALVAAWLPGSE-GLGVTDVLYGDYGFSGKLP
          +      + VVGE      E  S T +T+      +I  +  + K  V+V+++GRP+ +       DA++  W  G+E G  + DVL+GDY  SGKLP
Subjt:  VKSNDFSYAIVVVGETPYAETEGDS-TTLTMLDPGPSIIKNVCDSVK-CVVVVISGRPIVIEPYISSIDALVAAWLPGSE-GLGVTDVLYGDYGFSGKLP

Query:  RTWFKSVDQLP-----MNVGDP------------HYD----PLFPFGFGL--TTESVKDLVSRSTSPGIRETPSFLAMIVAT
         ++ +SV Q+P     +N G P            ++D     L+PFG+GL  TT +V D+  + ++P ++      A +  T
Subjt:  RTWFKSVDQLP-----MNVGDP------------HYD----PLFPFGFGL--TTESVKDLVSRSTSPGIRETPSFLAMIVAT

Q23892 Lysosomal beta glucosidase2.7e-7330.64Show/hide
Query:  VKDLLGRMTLEEKIGQMVQID----RSVANVTV--------MKDYFIGSVL----SGGGSVPLPDASAQDWINMINDFQKGSL-SSRLGIPMMYGIDAVH
        V +L+ +M++ EKIGQM Q+D     S   +T+         K Y+IGS L    SGG +  +   ++  W++MIN  Q   +  S   IPM+YG+D+VH
Subjt:  VKDLLGRMTLEEKIGQMVQID----RSVANVTV--------MKDYFIGSVL----SGGGSVPLPDASAQDWINMINDFQKGSL-SSRLGIPMMYGIDAVH

Query:  GHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEM-TEIIPGLQG-----EPPANYRKGI
        G N V+ AT+FPHN GL AT +          T+ +  A GI + FAP + +   P W R YE++ EDP +   M    + G QG     + P N     
Subjt:  GHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEM-TEIIPGLQG-----EPPANYRKGI

Query:  PYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSII-KGVSTVMASYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDR
                 +  AKH+ G    T+G +     I +  L    +P++ ++I   G  T+M +    NGV MH +++ +T  L+G L+F+G  ++DW+ +++
Subjt:  PYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSII-KGVSTVMASYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDR

Query:  ITSPPH--ANYTYSVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPL--GDYSLVNELGSQAHRDLAREAVRQ
        +    H   +   ++  A+ AGIDM MVP   + F   L  +V +  +P  R+D +V RIL++K+ +GLF +P    + ++V+ +G    R+ A     +
Subjt:  ITSPPH--ANYTYSVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPL--GDYSLVNELGSQAHRDLAREAVRQ

Query:  SLVLLKNGKNDSRPLLPLPKKAPK-ILVAGTHADNLGYQCGGWTIEWQG-FSGNNGTRGTSILAAIKSMVDPSTEVVFRE---------------DPDSD
        S+ LL+N  N    +LPL     K +L+ G  AD++    GGW++ WQG +  +    GTSIL  ++ + + + +   +                D   +
Subjt:  SLVLLKNGKNDSRPLLPLPKKAPK-ILVAGTHADNLGYQCGGWTIEWQG-FSGNNGTRGTSILAAIKSMVDPSTEVVFRE---------------DPDSD

Query:  FVKSNDFSYAIVVVGETPYAETEGDSTTLTMLDPGPSIIKNVCDSVKCVV-VVISGRPIVIEP-YISSIDALVAAWLPGSE-GLGVTDVLYGDYGFSGKL
          +S+D    +VV+GE P AET GD   L+M      +++ + D+ K VV +++  RP ++ P  + S  A++ A+LPGSE G  + ++L G+   SG+L
Subjt:  FVKSNDFSYAIVVVGETPYAETEGDSTTLTMLDPGPSIIKNVCDSVKCVV-VVISGRPIVIEP-YISSIDALVAAWLPGSE-GLGVTDVLYGDYGFSGKL

Query:  PRTWFKSVDQLPMNVGDPHY---------DPLFPFGFGL--TTESVKDLVSRSTSP
        P T+  +      ++G P+Y          PLF FG GL  TT +  +L   +  P
Subjt:  PRTWFKSVDQLPMNVGDPHY---------DPLFPFGFGL--TTESVKDLVSRSTSP

Q56078 Periplasmic beta-glucosidase2.9e-5929.11Show/hide
Query:  VKDLLGRMTLEEKIGQMVQIDRSVANV-----TVMKDYFIGSVLSGGGSVPLPDASAQDWINMINDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFP
        V DLL +MT++EKIGQ+  I     N       ++KD  +G++ +          + QD I  + D  +    SRL IP+ +  D VHG  TV     FP
Subjt:  VKDLLGRMTLEEKIGQMVQIDRSVANV-----TVMKDYFIGSVLSGGGSVPLPDASAQDWINMINDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFP

Query:  HNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTE-IIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFV
         ++GL ++ +   VR +G  +A E    G++  +AP + V RDPRWGR  E + ED  +   M E ++  +QG+ PA+             V+   KHF 
Subjt:  HNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTE-IIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFV

Query:  GDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGL-DRITSPPHANYTYSVQAAI
          G    G   N   +    L + +MP Y   +  G   VM + +S NG    ++  L+   L+    FKG  +SD   + + I     A+   +V+ A+
Subjt:  GDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGL-DRITSPPHANYTYSVQAAI

Query:  LAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYS------LVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRPL
         AG+DM M    Y+++   L  L+KS  + M  +DDA   +L+VK+ MGLF  P           +     S+ HR  ARE  R+S+VLLKN + ++ PL
Subjt:  LAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYS------LVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRPL

Query:  LPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIKSMVDPSTEVVFRE-----------------------DP-------DSDFV
            KK+  I V G  AD+     G W+      +     +  ++LA I++ V    ++++ +                       DP       D    
Subjt:  LPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIKSMVDPSTEVVFRE-----------------------DP-------DSDFV

Query:  KSNDFSYAIVVVGETPYAETEGDS-TTLTMLDPGPSIIKNVCDSVK-CVVVVISGRPIVIEPYISSIDALVAAWLPGSE-GLGVTDVLYGDYGFSGKLPR
         +      + VVGE+     E  S T +T+      +I  +  + K  V+V+++GRP+ +       DA++  W  G+E G  + DVL+GDY  SGKLP 
Subjt:  KSNDFSYAIVVVGETPYAETEGDS-TTLTMLDPGPSIIKNVCDSVK-CVVVVISGRPIVIEPYISSIDALVAAWLPGSE-GLGVTDVLYGDYGFSGKLPR

Query:  TWFKSVDQLP-----MNVGDP------------HYD----PLFPFGFGL--TTESVKDLVSRS
        ++ +SV Q+P     +N G P            ++D    PL+PFG+GL  TT +V D+   S
Subjt:  TWFKSVDQLP-----MNVGDP------------HYD----PLFPFGFGL--TTESVKDLVSRS

T2KMH0 Beta-xylosidase9.1e-5329.31Show/hide
Query:  NQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVLSGGGSVPLPDASAQDWINMINDFQKGSLSSRLGIPMMYGIDAVHGHNTVY----
        ++ +  +V  L+ +MTL+EKI +M Q                             DA A +               RLGIP M   +A+HG   V     
Subjt:  NQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVLSGGGSVPLPDASAQDWINMINDFQKGSLSSRLGIPMMYGIDAVHGHNTVY----

Query:  NATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAV-CRDPRWGRCYESYSEDPKIVQEM-TEIIPGLQGEPPANYRKGIPYVGGTKKVI
        N T++P  V   +T +P L++++ + TA E RA G++  ++P + V   D R+GR  ESY EDP +V  M    I GLQG     + +          VI
Subjt:  NATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAV-CRDPRWGRCYESYSEDPKIVQEM-TEIIPGLQGEPPANYRKGIPYVGGTKKVI

Query:  ACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIK-GVSTVMASYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYT
        A AKHFVG      GIN   + + +  L  +++P +  ++ + GV +VM  +  +NGV  H N  L+   L+  L F GF++SD   + R+ +  H    
Subjt:  ACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIK-GVSTVMASYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYT

Query:  YSVQAAIL---AGIDMVMVPYKYTEFIDDLTYLVKSNVIP----MDRIDDAVGRILSVKFTMGLFES-PLGDYSLVNELGSQAHRDLAREAVRQSLVLLK
           +AAIL   AG+DM +V  K  E     T ++K  ++     M  ID A  RIL+ K+ +GLF++ P    +   E G+  HR+ A E   +S+++LK
Subjt:  YSVQAAIL---AGIDMVMVPYKYTEFIDDLTYLVKSNVIP----MDRIDDAVGRILSVKFTMGLFES-PLGDYSLVNELGSQAHRDLAREAVRQSLVLLK

Query:  NGKNDSRPLLPLP-KKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIKSMVDPSTEVVFREDPDSDFVKSNDFSYAI----------V
        N  N    LLPL   K   + V G +A     + G + +   G+SG       S+L  +K  V    ++ + +  D D      F  AI          +
Subjt:  NGKNDSRPLLPLP-KKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIKSMVDPSTEVVFREDPDSDFVKSNDFSYAI----------V

Query:  VVGETPYAETE-GDSTTLTMLDPGPSIIKNVCDSVKCVVVV-ISGRPIVIEPYISSIDALVAAWLPGSE-GLGVTDVLYGDYGFSGKLPRTWFKSVDQLP
        VVG +     E GD   L +      +++ +  + K V+VV I+GRP+ I     +I +++  W  G   G  V +V++GD    GKL  ++ + V Q+P
Subjt:  VVGETPYAETE-GDSTTLTMLDPGPSIIKNVCDSVKCVVVV-ISGRPIVIEPYISSIDALVAAWLPGSE-GLGVTDVLYGDYGFSGKLPRTWFKSVDQLP

Query:  MNV---------GDPHY-----DPLFPFGFGLTTESVK
        +           G   Y      PLFPFGFGL+  + K
Subjt:  MNV---------GDPHY-----DPLFPFGFGLTTESVK

Arabidopsis top hitse value%identityAlignment
AT3G47000.1 Glycosyl hydrolase family protein7.5e-21258.55Show/hide
Query:  MVDAEYLKYKDPNQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVLSGGGSVPLPDASAQDWINMINDFQKGSLSSRLGIPMMYGIDA
        +V+     YK+ + PV  RVKDLL RMTL EKIGQM QI+R VA+ +   D+FIGSVL+ GGSVP  DA + DW +MI+ FQ+ +L+SRLGIP++YG DA
Subjt:  MVDAEYLKYKDPNQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVLSGGGSVPLPDASAQDWINMINDFQKGSLSSRLGIPMMYGIDA

Query:  VHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVG
        VHG+N VY AT+FPHN+GLGATRD  LVRRIGAATALEVRA+G+ +AF+PC+AV RDPRWGRCYESY EDP++V EMT ++ GLQG PP  +  G P+V 
Subjt:  VHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVG

Query:  GTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPP
        G   V+AC KHFVGDGGT  GINE NT+     L  IH+P YL  + +GVSTVMASYSSWNG ++HA+  L+T  LK  L FKGF++SDWEGLDR++ P 
Subjt:  GTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPP

Query:  HANYTYSVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQAHRDLAREAVRQSLVLLKNGK
         +NY Y ++ A+ AGIDMVMVP+KY +FI D+T LV+S  IPM RI+DAV RIL VKF  GLF  PL D SL+  +G + HR+LA+EAVR+SLVLLK+GK
Subjt:  HANYTYSVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQAHRDLAREAVRQSLVLLKNGK

Query:  NDSRPLLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIKSMVDPSTEVVFREDPDSDFVKSND-FSYAIVVVGETPYAETEGD
        N  +P LPL + A +ILV GTHAD+LGYQCGGWT  W G SG   T GT++L AIK  V   TEV++ + P  + + S++ FSYAIV VGE PYAET GD
Subjt:  NDSRPLLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIKSMVDPSTEVVFREDPDSDFVKSND-FSYAIVVVGETPYAETEGD

Query:  STTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEP-YISSIDALVAAWLPGSEGLGVTDVLYGDYGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFG
        ++ L +   G  I+  V + +  +V++ISGRP+V+EP  +   +ALVAAWLPG+EG GV DV++GDY F GKLP +WFK V+ LP++     YDPLFPFG
Subjt:  STTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEP-YISSIDALVAAWLPGSEGLGVTDVLYGDYGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFG

Query:  FGLTTESV
        FGL ++ V
Subjt:  FGLTTESV

AT5G04885.1 Glycosyl hydrolase family protein1.0e-29372.3Show/hide
Query:  MAKIFVQVVAILCLGWWWWATMVDAEYLKYKDPNQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVLSGGGSVPLPDASAQDWINMIN
        M++  V++V +L     W     D EYL YKDP Q VS RV DL GRMTLEEKIGQMVQIDRSVA V +M+DYFIGSVLSGGGS PLP+ASAQ+W++MIN
Subjt:  MAKIFVQVVAILCLGWWWWATMVDAEYLKYKDPNQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVLSGGGSVPLPDASAQDWINMIN

Query:  DFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTE
        ++QKG+L SRLGIPM+YGIDAVHGHN VYNATIFPHNVGLGATRDP LV+RIGAATA+EVRATGI + FAPCIAVCRDPRWGRCYESYSED K+V++MT+
Subjt:  DFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTE

Query:  IIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYSSWNGVKMHANHELITGFLKGT
        +I GLQGEPP+NY+ G+P+VGG  KV ACAKH+VGDGGTT G+NENNTV D HGLLS+HMPAY D++ KGVSTVM SYSSWNG KMHAN ELITG+LKGT
Subjt:  IIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYSSWNGVKMHANHELITGFLKGT

Query:  LKFKGFVISDWEGLDRITSPPHANYTYSVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQ
        LKFKGFVISDW+G+D+I++PPH +YT SV+AAI AGIDMVMVP+ +TEF++DLT LVK+N IP+ RIDDAV RIL VKFTMGLFE+PL DYS  +ELGSQ
Subjt:  LKFKGFVISDWEGLDRITSPPHANYTYSVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQ

Query:  AHRDLAREAVRQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIKSMVDPSTEVVFREDPDSDFVKSN
        AHRDLAREAVR+SLVLLKNG N + P+LPLP+K  KILVAGTHADNLGYQCGGWTI WQGFSGN  TRGT++L+A+KS VD STEVVFRE+PD++F+KSN
Subjt:  AHRDLAREAVRQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIKSMVDPSTEVVFREDPDSDFVKSN

Query:  DFSYAIVVVGETPYAETEGDSTTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLGVTDVLYGDYGFSGKLPRTWFKSV
        +F+YAI+ VGE PYAET GDS  LTMLDPGP+II + C +VKCVVVVISGRP+V+EPY++SIDALVAAWLPG+EG G+TD L+GD+GFSGKLP TWF++ 
Subjt:  DFSYAIVVVGETPYAETEGDSTTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLGVTDVLYGDYGFSGKLPRTWFKSV

Query:  DQLPMNVGDPHYDPLFPFGFGLTTESVKDLVSRSTSPGIRETPSFLAMIVATIAICI
        +QLPM+ GD HYDPLF +G GL TESV  +V+RSTS     T   L  ++ +  +C+
Subjt:  DQLPMNVGDPHYDPLFPFGFGLTTESVKDLVSRSTSPGIRETPSFLAMIVATIAICI

AT5G20940.1 Glycosyl hydrolase family protein8.5e-25669.03Show/hide
Query:  VQVVAILCLGWWWWATMVDAEYLKYKDPNQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVLSGGGSVPLPDASAQDWINMINDFQKG
        +Q + +L L     A  V     KYKDP +P+ VR+K+L+  MTLEEKIGQMVQ++R  A   VM+ YF+GSV SGGGSVP P    + W+NM+N+ QK 
Subjt:  VQVVAILCLGWWWWATMVDAEYLKYKDPNQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVLSGGGSVPLPDASAQDWINMINDFQKG

Query:  SLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGL
        +LS+RLGIP++YGIDAVHGHNTVYNATIFPHNVGLG TRDPGLV+RIG ATALEVRATGI + FAPCIAVCRDPRWGRCYESYSED KIVQ+MTEIIPGL
Subjt:  SLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGL

Query:  QGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYSSWNGVKMHANHELITGFLKGTLKFKG
        QG+ P   +KG+P+V G  KV ACAKHFVGDGGT  G+N NNTVI+ +GLL IHMPAY D++ KGV+TVM SYSS NG+KMHAN +LITGFLK  LKF+G
Subjt:  QGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYSSWNGVKMHANHELITGFLKGTLKFKG

Query:  FVISDWEGLDRITSPPHANYTYSVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQAHRDL
         VISD+ G+D+I +P  ANY++SV AA  AG+DM M     T+ ID+LT  VK   IPM RIDDAV RIL VKFTMGLFE+P+ D+SL  +LGS+ HR+L
Subjt:  FVISDWEGLDRITSPPHANYTYSVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQAHRDL

Query:  AREAVRQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIKSMVDPSTEVVFREDPDSDFVKSNDFSYA
        AREAVR+SLVLLKNG+N  +PLLPLPKKA KILVAGTHADNLGYQCGGWTI WQG +GNN T GT+ILAA+K  VDP T+V++ ++PD++FVK+ DF YA
Subjt:  AREAVRQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIKSMVDPSTEVVFREDPDSDFVKSNDFSYA

Query:  IVVVGETPYAETEGDSTTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLGVTDVLYGDYGFSGKLPRTWFKSVDQLPM
        IV VGE PYAE  GDST LT+ +PGPS I NVC SVKCVVVV+SGRP+V++  IS+IDALVAAWLPG+EG GV DVL+GDYGF+GKL RTWFK+VDQLPM
Subjt:  IVVVGETPYAETEGDSTTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLGVTDVLYGDYGFSGKLPRTWFKSVDQLPM

Query:  NVGDPHYDPLFPFGFGLTTE
        NVGDPHYDPL+PFGFGL T+
Subjt:  NVGDPHYDPLFPFGFGLTTE

AT5G20950.1 Glycosyl hydrolase family protein8.5e-27272.88Show/hide
Query:  LKYKDPNQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVLSGGGSVPLPDASAQDWINMINDFQKGSLSSRLGIPMMYGIDAVHGHNT
        LKYKDP QP+  R++DL+ RMTL+EKIGQMVQI+RSVA   VMK YFIGSVLSGGGSVP   A+ + W+NM+N+ QK SLS+RLGIPM+YGIDAVHGHN 
Subjt:  LKYKDPNQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVLSGGGSVPLPDASAQDWINMINDFQKGSLSSRLGIPMMYGIDAVHGHNT

Query:  VYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVI
        VY ATIFPHNVGLG TRDP LV+RIGAATALEVRATGI +AFAPCIAVCRDPRWGRCYESYSED +IVQ+MTEIIPGLQG+ P   RKG+P+VGG  KV 
Subjt:  VYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVI

Query:  ACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYTY
        ACAKHFVGDGGT  GI+ENNTVID  GL  IHMP Y +++ KGV+T+M SYS+WNG++MHAN EL+TGFLK  LKF+GFVISDW+G+DRIT+PPH NY+Y
Subjt:  ACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYTY

Query:  SVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRPL
        SV A I AGIDM+MVPY YTEFID+++  ++  +IP+ RIDDA+ RIL VKFTMGLFE PL D S  N+LGS+ HR+LAREAVR+SLVLLKNGK  ++PL
Subjt:  SVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRPL

Query:  LPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIKSMVDPSTEVVFREDPDSDFVKSNDFSYAIVVVGETPYAETEGDSTTLTML
        LPLPKK+ KILVAG HADNLGYQCGGWTI WQG +GN+ T GT+ILAA+K+ V P+T+VV+ ++PD++FVKS  F YAIVVVGE PYAE  GD+T LT+ 
Subjt:  LPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIKSMVDPSTEVVFREDPDSDFVKSNDFSYAIVVVGETPYAETEGDSTTLTML

Query:  DPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLGVTDVLYGDYGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESV
        DPGPSII NVC SVKCVVVV+SGRP+VI+PY+S+IDALVAAWLPG+EG GV D L+GDYGF+GKL RTWFKSV QLPMNVGD HYDPL+PFGFGLTT+  
Subjt:  DPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLGVTDVLYGDYGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESV

Query:  K
        K
Subjt:  K

AT5G20950.2 Glycosyl hydrolase family protein8.5e-27272.88Show/hide
Query:  LKYKDPNQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVLSGGGSVPLPDASAQDWINMINDFQKGSLSSRLGIPMMYGIDAVHGHNT
        LKYKDP QP+  R++DL+ RMTL+EKIGQMVQI+RSVA   VMK YFIGSVLSGGGSVP   A+ + W+NM+N+ QK SLS+RLGIPM+YGIDAVHGHN 
Subjt:  LKYKDPNQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVLSGGGSVPLPDASAQDWINMINDFQKGSLSSRLGIPMMYGIDAVHGHNT

Query:  VYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVI
        VY ATIFPHNVGLG TRDP LV+RIGAATALEVRATGI +AFAPCIAVCRDPRWGRCYESYSED +IVQ+MTEIIPGLQG+ P   RKG+P+VGG  KV 
Subjt:  VYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVI

Query:  ACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYTY
        ACAKHFVGDGGT  GI+ENNTVID  GL  IHMP Y +++ KGV+T+M SYS+WNG++MHAN EL+TGFLK  LKF+GFVISDW+G+DRIT+PPH NY+Y
Subjt:  ACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYTY

Query:  SVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRPL
        SV A I AGIDM+MVPY YTEFID+++  ++  +IP+ RIDDA+ RIL VKFTMGLFE PL D S  N+LGS+ HR+LAREAVR+SLVLLKNGK  ++PL
Subjt:  SVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRPL

Query:  LPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIKSMVDPSTEVVFREDPDSDFVKSNDFSYAIVVVGETPYAETEGDSTTLTML
        LPLPKK+ KILVAG HADNLGYQCGGWTI WQG +GN+ T GT+ILAA+K+ V P+T+VV+ ++PD++FVKS  F YAIVVVGE PYAE  GD+T LT+ 
Subjt:  LPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIKSMVDPSTEVVFREDPDSDFVKSNDFSYAIVVVGETPYAETEGDSTTLTML

Query:  DPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLGVTDVLYGDYGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESV
        DPGPSII NVC SVKCVVVV+SGRP+VI+PY+S+IDALVAAWLPG+EG GV D L+GDYGF+GKL RTWFKSV QLPMNVGD HYDPL+PFGFGLTT+  
Subjt:  DPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLGVTDVLYGDYGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESV

Query:  K
        K
Subjt:  K


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAATTTCGGCTTTATTTCGTCGCGAGCAATCCATAATAAGCATATGCGAAGTATGCTGCTACGACTGTCACAACCGCCGCAGCCAGACTCGCCGTCACCGGCAACG
GCATCCTTCCGACGGTGAAGCGGCCGGCAAGTGTCGAAGCTCTGATCGCTCTCGGCAATGGAGTTCCAATGCTCACCTTGGGCATTTCTTGATTCTTACCCACCTCCTCT
GTTTTGGATTCCTTCGCCTTCGCGTCTTCTTTTTCCTTTCCCTTTTCTGGGTTTTGATTTTCGATTTCTTTCTTCTCCGTTGGCTTGTGATCGGGGCTGCTGATCATCTT
CTGATCCTGCGTCGTTTTAAGCTCGCCGGAATCTCCTTTTCTGCGGATTTTCCCTTATCGTTATCCGCTGGCTTCACTGAATTAGAATCGCCTTTCGGAAGCGCCTCCTC
CGGTCCCTTAGTTTCAGCCTTCGCCTCCGGCCGGGCTGCGTTTTCCGGCGAGATATCTTCCCTCCCCTTTTCAGGTGTTGGTCGTTCTGGGTCTTTGGGAGGCGGAGCCG
CCGCCGTCCCTTGTCCGATATTTTCCGGGGAGATATCTTCCCTTCCCTTTTCAGGGCTTGGTTGTTCTGGTTCTTTGGGAGCCGCCGCCGCCTCCTCCGCCGTCCCTTGT
CCGCTAGTTTCTTTGGGCATTGTAATGAAGGATGAAAAAACCGAAGATTCGAATGGAACGCTCAAGATTTCCATCGTCTTAGAGTGCTTACCTGCTTCTGCGACTACTCC
GCAGCCGTACGGCGATCGAAGTGTTTGGAAACTTTCTCCTTCTTCGAAATTTCAACGAGATAATTGGAAAATCGCAGGACGCACCAGAGAGCCGATCGGATCGGATCTGT
TTTCCTCTTCTCTTCTTGTTCGCTACCTGTTCGGCCATACCAGAGAGCCGCCGGAAGATCTCCGATTTCAGGATCGTCGATTCATGATGCTTAAATTAAAGCTGTTGTGG
AACTGGAAACAGTGTGGGTTCACCTCTCAGAAAAGGAAGATGGCCAAGATTTTTGTTCAGGTGGTTGCGATTCTGTGCTTGGGTTGGTGGTGGTGGGCAACGATGGTGGA
CGCGGAGTACTTGAAATACAAAGACCCTAACCAACCAGTTTCTGTTCGAGTTAAGGACCTTCTTGGCCGCATGACTCTAGAAGAGAAAATCGGTCAGATGGTTCAGATTG
ACAGGAGCGTTGCCAATGTTACAGTTATGAAAGATTATTTCATTGGAAGTGTGCTAAGTGGCGGTGGAAGTGTGCCGCTTCCAGATGCTAGTGCTCAGGATTGGATTAAC
ATGATTAACGATTTCCAGAAGGGTTCTCTTTCTAGTAGATTGGGCATCCCAATGATGTATGGCATTGATGCTGTTCATGGCCATAACACTGTTTACAATGCTACCATATT
TCCTCATAATGTTGGACTTGGAGCTACCAGAGACCCTGGCCTAGTTCGAAGGATTGGTGCTGCAACAGCACTTGAAGTTAGAGCGACGGGGATTTCTTTTGCCTTTGCTC
CATGCATTGCGGTTTGTAGGGACCCAAGGTGGGGGCGGTGTTATGAAAGCTACAGCGAGGATCCAAAAATTGTGCAAGAAATGACCGAGATTATACCTGGTCTGCAGGGA
GAGCCTCCTGCTAATTATCGGAAGGGGATTCCATATGTTGGTGGAACTAAGAAGGTTATCGCCTGTGCAAAGCACTTTGTAGGAGATGGTGGGACAACTAATGGCATCAA
TGAAAATAATACCGTTATTGACAAGCATGGACTGCTCAGCATTCACATGCCTGCCTATTTAGATTCGATCATCAAGGGCGTTTCAACGGTAATGGCTTCCTATTCAAGTT
GGAATGGTGTAAAGATGCATGCAAACCATGAGCTGATTACTGGCTTCCTCAAGGGCACTCTTAAATTTAAGGGTTTTGTCATCTCAGATTGGGAGGGTCTGGACAGAATT
ACTTCTCCGCCACATGCTAATTACACCTACTCTGTCCAAGCTGCAATTTTAGCTGGCATTGACATGGTCATGGTTCCTTACAAGTATACAGAGTTCATTGATGATCTTAC
GTATCTAGTCAAGAGCAATGTCATTCCAATGGATCGCATTGATGATGCTGTTGGGAGAATTTTATCAGTCAAATTTACAATGGGTCTTTTTGAAAGCCCCTTAGGCGATT
ACAGCCTTGTCAATGAACTTGGGAGCCAGGCACATAGAGACTTGGCAAGAGAAGCTGTGAGGCAGTCGCTCGTACTGCTGAAGAATGGGAAAAATGATAGTCGCCCATTG
CTGCCCCTTCCAAAGAAGGCCCCAAAGATCCTGGTTGCTGGCACTCATGCTGATAATTTAGGATATCAATGTGGTGGATGGACAATTGAATGGCAAGGGTTCAGTGGCAA
CAATGGTACAAGGGGAACTAGCATCCTCGCTGCTATCAAATCAATGGTCGATCCAAGCACAGAAGTTGTATTCCGTGAGGATCCTGACAGTGACTTTGTTAAGTCCAATG
ACTTTTCATACGCCATTGTTGTCGTTGGCGAAACCCCATATGCTGAGACTGAAGGGGATAGTACAACGCTTACCATGTTGGATCCTGGTCCAAGCATCATCAAAAACGTT
TGTGATTCTGTAAAGTGCGTGGTGGTAGTCATTTCTGGAAGGCCAATAGTGATCGAACCATACATTTCATCAATTGATGCTCTTGTAGCAGCTTGGTTACCTGGCTCTGA
AGGCCTAGGAGTCACTGATGTCCTTTATGGAGACTATGGGTTTAGTGGGAAGCTTCCAAGAACATGGTTCAAATCTGTAGATCAGCTACCAATGAATGTTGGAGATCCAC
ACTATGATCCACTTTTTCCTTTTGGGTTCGGACTCACAACTGAATCGGTTAAGGACCTTGTCTCGAGGTCTACATCGCCCGGAATTCGTGAAACTCCATCCTTTCTTGCA
ATGATCGTTGCTACAATTGCCATTTGTATATTGCAGAAGTTACCAGATGACGATGACGATGAAGATGCAGATGATTCCCATTTATGGAACCTGACCAGCCATGAATCGGA
AGCGATCTCAAGTCCAGTTCAGACCAACAAAAACAGACTTACCCCTCGTTTCTCTAAACATTCAGACAGAGAAGTCGCCACTTCTGGGGCTTTATCCTTCTCTATGAAGA
AGCTAACTTTATGGGGTCGTTGCTCCGTTTCTGACTGCGATATCCTCATTTTGTCGTGCAGGGGGGAGAGGAATGGTGATTTCGCGAGGTTGGAGAAATTCACCATTGTG
AGATCAAGATCGGAAACAATCATGAGGTTGGCTGAACCATCTAGTCGATCCATAGATGACGGGTCCCTTCTTCAAATTCAGATTAGGGCAACTCAGGATACGCTGACTTC
ACCGATAAAAGTAGATCTTTTACAATGTTCAAATGGCAGACTGCTGAATTATCAAGCAATCCGTAGGCTTAAATACGACACACAAATGCCAACTTTGAAGTTTGGGATTT
CAATTGAACCGCCTAACCACATGAAGATGAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAAATTTCGGCTTTATTTCGTCGCGAGCAATCCATAATAAGCATATGCGAAGTATGCTGCTACGACTGTCACAACCGCCGCAGCCAGACTCGCCGTCACCGGCAACG
GCATCCTTCCGACGGTGAAGCGGCCGGCAAGTGTCGAAGCTCTGATCGCTCTCGGCAATGGAGTTCCAATGCTCACCTTGGGCATTTCTTGATTCTTACCCACCTCCTCT
GTTTTGGATTCCTTCGCCTTCGCGTCTTCTTTTTCCTTTCCCTTTTCTGGGTTTTGATTTTCGATTTCTTTCTTCTCCGTTGGCTTGTGATCGGGGCTGCTGATCATCTT
CTGATCCTGCGTCGTTTTAAGCTCGCCGGAATCTCCTTTTCTGCGGATTTTCCCTTATCGTTATCCGCTGGCTTCACTGAATTAGAATCGCCTTTCGGAAGCGCCTCCTC
CGGTCCCTTAGTTTCAGCCTTCGCCTCCGGCCGGGCTGCGTTTTCCGGCGAGATATCTTCCCTCCCCTTTTCAGGTGTTGGTCGTTCTGGGTCTTTGGGAGGCGGAGCCG
CCGCCGTCCCTTGTCCGATATTTTCCGGGGAGATATCTTCCCTTCCCTTTTCAGGGCTTGGTTGTTCTGGTTCTTTGGGAGCCGCCGCCGCCTCCTCCGCCGTCCCTTGT
CCGCTAGTTTCTTTGGGCATTGTAATGAAGGATGAAAAAACCGAAGATTCGAATGGAACGCTCAAGATTTCCATCGTCTTAGAGTGCTTACCTGCTTCTGCGACTACTCC
GCAGCCGTACGGCGATCGAAGTGTTTGGAAACTTTCTCCTTCTTCGAAATTTCAACGAGATAATTGGAAAATCGCAGGACGCACCAGAGAGCCGATCGGATCGGATCTGT
TTTCCTCTTCTCTTCTTGTTCGCTACCTGTTCGGCCATACCAGAGAGCCGCCGGAAGATCTCCGATTTCAGGATCGTCGATTCATGATGCTTAAATTAAAGCTGTTGTGG
AACTGGAAACAGTGTGGGTTCACCTCTCAGAAAAGGAAGATGGCCAAGATTTTTGTTCAGGTGGTTGCGATTCTGTGCTTGGGTTGGTGGTGGTGGGCAACGATGGTGGA
CGCGGAGTACTTGAAATACAAAGACCCTAACCAACCAGTTTCTGTTCGAGTTAAGGACCTTCTTGGCCGCATGACTCTAGAAGAGAAAATCGGTCAGATGGTTCAGATTG
ACAGGAGCGTTGCCAATGTTACAGTTATGAAAGATTATTTCATTGGAAGTGTGCTAAGTGGCGGTGGAAGTGTGCCGCTTCCAGATGCTAGTGCTCAGGATTGGATTAAC
ATGATTAACGATTTCCAGAAGGGTTCTCTTTCTAGTAGATTGGGCATCCCAATGATGTATGGCATTGATGCTGTTCATGGCCATAACACTGTTTACAATGCTACCATATT
TCCTCATAATGTTGGACTTGGAGCTACCAGAGACCCTGGCCTAGTTCGAAGGATTGGTGCTGCAACAGCACTTGAAGTTAGAGCGACGGGGATTTCTTTTGCCTTTGCTC
CATGCATTGCGGTTTGTAGGGACCCAAGGTGGGGGCGGTGTTATGAAAGCTACAGCGAGGATCCAAAAATTGTGCAAGAAATGACCGAGATTATACCTGGTCTGCAGGGA
GAGCCTCCTGCTAATTATCGGAAGGGGATTCCATATGTTGGTGGAACTAAGAAGGTTATCGCCTGTGCAAAGCACTTTGTAGGAGATGGTGGGACAACTAATGGCATCAA
TGAAAATAATACCGTTATTGACAAGCATGGACTGCTCAGCATTCACATGCCTGCCTATTTAGATTCGATCATCAAGGGCGTTTCAACGGTAATGGCTTCCTATTCAAGTT
GGAATGGTGTAAAGATGCATGCAAACCATGAGCTGATTACTGGCTTCCTCAAGGGCACTCTTAAATTTAAGGGTTTTGTCATCTCAGATTGGGAGGGTCTGGACAGAATT
ACTTCTCCGCCACATGCTAATTACACCTACTCTGTCCAAGCTGCAATTTTAGCTGGCATTGACATGGTCATGGTTCCTTACAAGTATACAGAGTTCATTGATGATCTTAC
GTATCTAGTCAAGAGCAATGTCATTCCAATGGATCGCATTGATGATGCTGTTGGGAGAATTTTATCAGTCAAATTTACAATGGGTCTTTTTGAAAGCCCCTTAGGCGATT
ACAGCCTTGTCAATGAACTTGGGAGCCAGGCACATAGAGACTTGGCAAGAGAAGCTGTGAGGCAGTCGCTCGTACTGCTGAAGAATGGGAAAAATGATAGTCGCCCATTG
CTGCCCCTTCCAAAGAAGGCCCCAAAGATCCTGGTTGCTGGCACTCATGCTGATAATTTAGGATATCAATGTGGTGGATGGACAATTGAATGGCAAGGGTTCAGTGGCAA
CAATGGTACAAGGGGAACTAGCATCCTCGCTGCTATCAAATCAATGGTCGATCCAAGCACAGAAGTTGTATTCCGTGAGGATCCTGACAGTGACTTTGTTAAGTCCAATG
ACTTTTCATACGCCATTGTTGTCGTTGGCGAAACCCCATATGCTGAGACTGAAGGGGATAGTACAACGCTTACCATGTTGGATCCTGGTCCAAGCATCATCAAAAACGTT
TGTGATTCTGTAAAGTGCGTGGTGGTAGTCATTTCTGGAAGGCCAATAGTGATCGAACCATACATTTCATCAATTGATGCTCTTGTAGCAGCTTGGTTACCTGGCTCTGA
AGGCCTAGGAGTCACTGATGTCCTTTATGGAGACTATGGGTTTAGTGGGAAGCTTCCAAGAACATGGTTCAAATCTGTAGATCAGCTACCAATGAATGTTGGAGATCCAC
ACTATGATCCACTTTTTCCTTTTGGGTTCGGACTCACAACTGAATCGGTTAAGGACCTTGTCTCGAGGTCTACATCGCCCGGAATTCGTGAAACTCCATCCTTTCTTGCA
ATGATCGTTGCTACAATTGCCATTTGTATATTGCAGAAGTTACCAGATGACGATGACGATGAAGATGCAGATGATTCCCATTTATGGAACCTGACCAGCCATGAATCGGA
AGCGATCTCAAGTCCAGTTCAGACCAACAAAAACAGACTTACCCCTCGTTTCTCTAAACATTCAGACAGAGAAGTCGCCACTTCTGGGGCTTTATCCTTCTCTATGAAGA
AGCTAACTTTATGGGGTCGTTGCTCCGTTTCTGACTGCGATATCCTCATTTTGTCGTGCAGGGGGGAGAGGAATGGTGATTTCGCGAGGTTGGAGAAATTCACCATTGTG
AGATCAAGATCGGAAACAATCATGAGGTTGGCTGAACCATCTAGTCGATCCATAGATGACGGGTCCCTTCTTCAAATTCAGATTAGGGCAACTCAGGATACGCTGACTTC
ACCGATAAAAGTAGATCTTTTACAATGTTCAAATGGCAGACTGCTGAATTATCAAGCAATCCGTAGGCTTAAATACGACACACAAATGCCAACTTTGAAGTTTGGGATTT
CAATTGAACCGCCTAACCACATGAAGATGAACTGA
Protein sequenceShow/hide protein sequence
MQISALFRREQSIISICEVCCYDCHNRRSQTRRHRQRHPSDGEAAGKCRSSDRSRQWSSNAHLGHFLILTHLLCFGFLRLRVFFFLSLFWVLIFDFFLLRWLVIGAADHL
LILRRFKLAGISFSADFPLSLSAGFTELESPFGSASSGPLVSAFASGRAAFSGEISSLPFSGVGRSGSLGGGAAAVPCPIFSGEISSLPFSGLGCSGSLGAAAASSAVPC
PLVSLGIVMKDEKTEDSNGTLKISIVLECLPASATTPQPYGDRSVWKLSPSSKFQRDNWKIAGRTREPIGSDLFSSSLLVRYLFGHTREPPEDLRFQDRRFMMLKLKLLW
NWKQCGFTSQKRKMAKIFVQVVAILCLGWWWWATMVDAEYLKYKDPNQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVLSGGGSVPLPDASAQDWIN
MINDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQG
EPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRI
TSPPHANYTYSVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRPL
LPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIKSMVDPSTEVVFREDPDSDFVKSNDFSYAIVVVGETPYAETEGDSTTLTMLDPGPSIIKNV
CDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLGVTDVLYGDYGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESVKDLVSRSTSPGIRETPSFLA
MIVATIAICILQKLPDDDDDEDADDSHLWNLTSHESEAISSPVQTNKNRLTPRFSKHSDREVATSGALSFSMKKLTLWGRCSVSDCDILILSCRGERNGDFARLEKFTIV
RSRSETIMRLAEPSSRSIDDGSLLQIQIRATQDTLTSPIKVDLLQCSNGRLLNYQAIRRLKYDTQMPTLKFGISIEPPNHMKMN