; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr011838 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr011838
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionGlycosyl hydrolase family protein
Genome locationtig00153092:38578..41770
RNA-Seq ExpressionSgr011838
SyntenySgr011838
Gene Ontology termsGO:0009251 - glucan catabolic process (biological process)
GO:0008422 - beta-glucosidase activity (molecular function)
InterPro domainsIPR001764 - Glycoside hydrolase, family 3, N-terminal
IPR002772 - Glycoside hydrolase family 3 C-terminal domain
IPR017853 - Glycoside hydrolase superfamily
IPR036881 - Glycoside hydrolase family 3 C-terminal domain superfamily
IPR036962 - Glycoside hydrolase, family 3, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK19869.1 beta-glucosidase BoGH3B-like [Cucumis melo var. makuwa]1.0e-21567.67Show/hide
Query:  MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTNASTTIMKK------------------------------
        MAK + IL+GLLL+CF E   KAE L+Y+DPKQPL  RI DLLGRMTLEEKIGQM QIER NAST +MKK                              
Subjt:  MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTNASTTIMKK------------------------------

Query:  -----DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTEIISGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGT
             DPQL+K+IG A+ALE+RATGIPYAFAPC+AVCRDPRWGRCYESY EDPK+VQ MTEII GLQGEIPPNSRKGVP+VAGKEKV ACAKH+VGDGGT
Subjt:  -----DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTEIISGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGT

Query:  TKGINENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLHFQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDM
        TKGI+ENNTVIDRHGLLSIHMPGYY+SIIKGVAT+MVSYSSWNG KMHANK L+TDFLKNTLHFQGFVISDWQ IDRIT PPHANYTYSILASVTAG+DM
Subjt:  TKGINENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLHFQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDM

Query:  VGIVLN----IYGL------------------KRILRVKFVMGLFENSLADLSLVNELGKKEHRELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLL
        + +  N    I GL                  KRILRVKF+MGLFEN +ADLSLVNELGK+EHRELAREAVRKSLVLLKNG+SADKPLLPL KK QK  +
Subjt:  VGIVLN----IYGL------------------KRILRVKFVMGLFENSLADLSLVNELGKKEHRELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLL

Query:  LVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENPDKEFFQSHKFSYAIVVVGEYPYAE-------------
        LVA     ++    G W        T+    +  N L  GTTVL AIKDT+DP TEVIF ENPDK F QS  FSYAIVVVGE+PYAE             
Subjt:  LVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENPDKEFFQSHKFSYAIVVVGEYPYAE-------------

Query:  ------------------TNGRPVVIQPYIASIDALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTWFKTVDQLPMNFGDPHYDPLFPLGYGLTTKPIIT
                           +GRPVVIQPY+ S+DALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTWFKTVDQLPMNFGD HYDPLFPLG+GLTT+PI T
Subjt:  ------------------TNGRPVVIQPYIASIDALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTWFKTVDQLPMNFGDPHYDPLFPLGYGLTTKPIIT

XP_008443733.1 PREDICTED: beta-glucosidase BoGH3B-like [Cucumis melo]2.0e-21163.74Show/hide
Query:  MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTNASTTIMKK------------------------------
        MAK + IL+GLLL+CF E   KAE L+Y+DPKQPL  RI DLLGRMTLEEKIGQM QIER NAST +MKK                              
Subjt:  MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTNASTTIMKK------------------------------

Query:  ------------------------------------------DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTEII
                                                  DPQL+K+IG A+ALE+RATGIPYAFAPC+AVCRDPRWGRCYESY EDPK+VQ MTEII
Subjt:  ------------------------------------------DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTEII

Query:  SGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKGINENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLH
         GLQGEIPPNSRKGVP+VAGKEKV ACAKH+VGDGGTTKGI+ENNTVIDRHGLLSIHMPGYY+SIIKGVAT+MVSYSSWNG KMHANK L+TDFLKNTLH
Subjt:  SGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKGINENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLH

Query:  FQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLN----IYGL------------------KRILRVKFVMGLFENSLADLSLVNELGKKEH
        FQGFVISDWQ IDRIT PPHANYTYSILASVTAG+DM+ +  N    I GL                  KRILRVKF+MGLFEN +ADLSLVNELGK+EH
Subjt:  FQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLN----IYGL------------------KRILRVKFVMGLFENSLADLSLVNELGKKEH

Query:  RELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENP
        RELAREAVRKSLVLLKNG+SADKPLLPL KK QK  +LVA     ++    G W        T+    +  N L  GTTVL AIKDT+DP TEVIF ENP
Subjt:  RELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENP

Query:  DKEFFQSHKFSYAIVVVGEYPYAE-------------------------------TNGRPVVIQPYIASIDALVAAWLPGTEGKGITDVLFGDYGFTGKL
        DK F QS  FSYAIVVVGE+PYAE                                +GRPVVIQPY+ S+DALVAAWLPGTEGKGITDVLFGDYGFTGKL
Subjt:  DKEFFQSHKFSYAIVVVGEYPYAE-------------------------------TNGRPVVIQPYIASIDALVAAWLPGTEGKGITDVLFGDYGFTGKL

Query:  SQTWFKTVDQLPMNFGDPHYDPLFPLGYGLTTKPIIT
        SQTWFKTVDQLPMNFGD HYDPLFPLG+GLTT+PI T
Subjt:  SQTWFKTVDQLPMNFGDPHYDPLFPLGYGLTTKPIIT

XP_011648555.1 uncharacterized protein LOC101211593 [Cucumis sativus]1.7e-20763.94Show/hide
Query:  MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTNASTTIMKK------------------------------
        MAK  IIL+ LLLIC  E   KAE  +Y+DP Q L  RI DLLGRMTLEEKIGQMVQIER NAST +MKK                              
Subjt:  MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTNASTTIMKK------------------------------

Query:  ------------------------------------------DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTEII
                                                  DPQL+K+IGVA+A E+RATGIPYAFAPCVAVCRDPRWGRCYESY EDPKIVQ MTEII
Subjt:  ------------------------------------------DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTEII

Query:  SGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKGINENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLH
         GLQGEIPPNSRKGVP+VAGKE V ACAKH+VGDGGTTKGI+ENNTVIDRHGLLSIHMPGYY+SIIKGVATIMVSYSSWNGEKMHANKNL+TDFLKNTLH
Subjt:  SGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKGINENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLH

Query:  FQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLN----IYGL------------------KRILRVKFVMGLFENSLADLSLVNELGKKEH
        FQGFVISDW+ IDRIT PPHANYTYSILAS+TAG+DM+ I  N    I GL                  KRILRVKFVMGLFEN +ADLSLVNELGK+EH
Subjt:  FQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLN----IYGL------------------KRILRVKFVMGLFENSLADLSLVNELGKKEH

Query:  RELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENP
        RELAREAVRKSLVLLKNG+SADKPLLPL KK QK  +LVA     ++    G W        T+    +  N L  GTTVL AIKDT+DP TEVIF ENP
Subjt:  RELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENP

Query:  DKEFFQSHKFSYAIVVVGEYPYAETN-------------------------------GRPVVIQPYIASIDALVAAWLPGTEGKGITDVLFGDYGFTGKL
        DK+  QS  FSYAIVVVGE+PYAE N                               GRPVVIQPY+ SIDALVAAWLPGTEGKGITDVLFGDYGFTGKL
Subjt:  DKEFFQSHKFSYAIVVVGEYPYAETN-------------------------------GRPVVIQPYIASIDALVAAWLPGTEGKGITDVLFGDYGFTGKL

Query:  SQTWFKTVDQLPMNFGDPHYDPLFPLGYGLTTKPI
        SQTWFKTVDQLPMNFG+P+YDPLFP G+GLTT+PI
Subjt:  SQTWFKTVDQLPMNFGDPHYDPLFPLGYGLTTKPI

XP_022155346.1 uncharacterized protein LOC111022483 [Momordica charantia]7.5e-21163.15Show/hide
Query:  MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTNASTTIMKK------------------------------
        MAK+ I LMG+LL+CFSEAL K  YLRY+DPKQPL  RI DLLGRMTLEEKIGQMVQI+RT AS  +MKK                              
Subjt:  MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTNASTTIMKK------------------------------

Query:  ------------------------------------------DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTEII
                                                  DP+LVK+IGVATALEVRATGI Y FAPC+AVCRDPRWGRCYESYSEDPKIVQ+MTEII
Subjt:  ------------------------------------------DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTEII

Query:  SGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKGINENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLH
        SGLQGEIP NSRKGVP+VAG+EKVAACAKHFVGDGGTTKGINENNTVI+RHGLLS HMPGYYNSIIKGV+TIM+SYSSWNG+KMHAN+ LITDFLKNTL 
Subjt:  SGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKGINENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLH

Query:  FQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLN----IYGL------------------KRILRVKFVMGLFENSLADLSLVNELGKKEH
        F+GFVISDWQGIDRIT+PPHANYTYSI+  VTAGIDM+ +  N    I GL                  KRILRVKF+MGLFEN LAD S +++LGKKEH
Subjt:  FQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLN----IYGL------------------KRILRVKFVMGLFENSLADLSLVNELGKKEH

Query:  RELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENP
        RELAREAVRKSLVLLKNGESADKP+LPLPKKA K  +LVA     ++    G W        T+    +  N L  GTT+L+AIKDT+DP+TEV+F ENP
Subjt:  RELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENP

Query:  DKEFFQSHKFSYAIVVVGEYPYAET-------------------------------NGRPVVIQPYIASIDALVAAWLPGTEGKGITDVLFGDYGFTGKL
        D EF +S+KFSYAIVVVGE+PYAET                               +GRPVVIQPY A IDALVAAWLPGTEG GITDVLFGDYGFTGKL
Subjt:  DKEFFQSHKFSYAIVVVGEYPYAET-------------------------------NGRPVVIQPYIASIDALVAAWLPGTEGKGITDVLFGDYGFTGKL

Query:  SQTWFKTVDQLPMNFGDPHYDPLFPLGYGLTTKPI
        S+TWFKTVDQLPMN GDPHYDPLFP G+GLTT P+
Subjt:  SQTWFKTVDQLPMNFGDPHYDPLFPLGYGLTTKPI

XP_038905524.1 LOW QUALITY PROTEIN: beta-glucosidase BoGH3B-like [Benincasa hispida]2.7e-20862.36Show/hide
Query:  MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTNASTTIMKK------------------------------
        MA+VLI L+GLL +CFSE L +AEYL+Y+DPKQPL  RI DLLGRMT EEKIGQMVQIER NA+  +M+K                              
Subjt:  MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTNASTTIMKK------------------------------

Query:  ------------------------------------------DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTEII
                                                  DPQLVK+ G+ATALEVRATGIPY FAPC+AVCRDPRWGRCYESY EDPKI+QAM EII
Subjt:  ------------------------------------------DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTEII

Query:  SGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKGINENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLH
         GLQG+IPPNSRKGVP+VAGK+ VAACAKHFVGDGGTTKGINENNTVIDRH LLSIHMPGYYNSIIKGVAT+MVSYSS NGEKMHAN+NL+T+FLKNTL+
Subjt:  SGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKGINENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLH

Query:  FQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLN----IYGL------------------KRILRVKFVMGLFENSLADLSLVNELGKKEH
        F+GFVISDWQGID+IT+PPH+NYTYSI+ASV AG+DM+ +  N    I GL                  KRILRVKF+MGLFEN LADLSL+NELGK+EH
Subjt:  FQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLN----IYGL------------------KRILRVKFVMGLFENSLADLSLVNELGKKEH

Query:  RELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENP
        RELAREAVRKSLVLLKNG+  ++PLLPLPKKA K  +LVA     ++    G W        T+       N L +GT +LAAIKDT+DPET+VIF+ENP
Subjt:  RELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENP

Query:  DKEFFQSHKFSYAIVVVGEYPYAETN-------------------------------GRPVVIQPYIASIDALVAAWLPGTEGKGITDVLFGDYGFTGKL
          EF +SH FSYAIVVVGE PYAETN                               GRPVVIQPYIAS+DALVA WLPGTEGKGITDVLFGDYGF GKL
Subjt:  DKEFFQSHKFSYAIVVVGEYPYAETN-------------------------------GRPVVIQPYIASIDALVAAWLPGTEGKGITDVLFGDYGFTGKL

Query:  SQTWFKTVDQLPMNFGDPHYDPLFPLGYGLTTKPI
        S TWFKTVDQLPMNFGDPHYDPLFP GYGLTT+PI
Subjt:  SQTWFKTVDQLPMNFGDPHYDPLFPLGYGLTTKPI

TrEMBL top hitse value%identityAlignment
A0A0A0LY55 Uncharacterized protein8.4e-20863.94Show/hide
Query:  MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTNASTTIMKK------------------------------
        MAK  IIL+ LLLIC  E   KAE  +Y+DP Q L  RI DLLGRMTLEEKIGQMVQIER NAST +MKK                              
Subjt:  MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTNASTTIMKK------------------------------

Query:  ------------------------------------------DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTEII
                                                  DPQL+K+IGVA+A E+RATGIPYAFAPCVAVCRDPRWGRCYESY EDPKIVQ MTEII
Subjt:  ------------------------------------------DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTEII

Query:  SGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKGINENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLH
         GLQGEIPPNSRKGVP+VAGKE V ACAKH+VGDGGTTKGI+ENNTVIDRHGLLSIHMPGYY+SIIKGVATIMVSYSSWNGEKMHANKNL+TDFLKNTLH
Subjt:  SGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKGINENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLH

Query:  FQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLN----IYGL------------------KRILRVKFVMGLFENSLADLSLVNELGKKEH
        FQGFVISDW+ IDRIT PPHANYTYSILAS+TAG+DM+ I  N    I GL                  KRILRVKFVMGLFEN +ADLSLVNELGK+EH
Subjt:  FQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLN----IYGL------------------KRILRVKFVMGLFENSLADLSLVNELGKKEH

Query:  RELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENP
        RELAREAVRKSLVLLKNG+SADKPLLPL KK QK  +LVA     ++    G W        T+    +  N L  GTTVL AIKDT+DP TEVIF ENP
Subjt:  RELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENP

Query:  DKEFFQSHKFSYAIVVVGEYPYAETN-------------------------------GRPVVIQPYIASIDALVAAWLPGTEGKGITDVLFGDYGFTGKL
        DK+  QS  FSYAIVVVGE+PYAE N                               GRPVVIQPY+ SIDALVAAWLPGTEGKGITDVLFGDYGFTGKL
Subjt:  DKEFFQSHKFSYAIVVVGEYPYAETN-------------------------------GRPVVIQPYIASIDALVAAWLPGTEGKGITDVLFGDYGFTGKL

Query:  SQTWFKTVDQLPMNFGDPHYDPLFPLGYGLTTKPI
        SQTWFKTVDQLPMNFG+P+YDPLFP G+GLTT+PI
Subjt:  SQTWFKTVDQLPMNFGDPHYDPLFPLGYGLTTKPI

A0A1S3B892 beta-glucosidase BoGH3B-like9.6e-21263.74Show/hide
Query:  MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTNASTTIMKK------------------------------
        MAK + IL+GLLL+CF E   KAE L+Y+DPKQPL  RI DLLGRMTLEEKIGQM QIER NAST +MKK                              
Subjt:  MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTNASTTIMKK------------------------------

Query:  ------------------------------------------DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTEII
                                                  DPQL+K+IG A+ALE+RATGIPYAFAPC+AVCRDPRWGRCYESY EDPK+VQ MTEII
Subjt:  ------------------------------------------DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTEII

Query:  SGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKGINENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLH
         GLQGEIPPNSRKGVP+VAGKEKV ACAKH+VGDGGTTKGI+ENNTVIDRHGLLSIHMPGYY+SIIKGVAT+MVSYSSWNG KMHANK L+TDFLKNTLH
Subjt:  SGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKGINENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLH

Query:  FQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLN----IYGL------------------KRILRVKFVMGLFENSLADLSLVNELGKKEH
        FQGFVISDWQ IDRIT PPHANYTYSILASVTAG+DM+ +  N    I GL                  KRILRVKF+MGLFEN +ADLSLVNELGK+EH
Subjt:  FQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLN----IYGL------------------KRILRVKFVMGLFENSLADLSLVNELGKKEH

Query:  RELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENP
        RELAREAVRKSLVLLKNG+SADKPLLPL KK QK  +LVA     ++    G W        T+    +  N L  GTTVL AIKDT+DP TEVIF ENP
Subjt:  RELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENP

Query:  DKEFFQSHKFSYAIVVVGEYPYAE-------------------------------TNGRPVVIQPYIASIDALVAAWLPGTEGKGITDVLFGDYGFTGKL
        DK F QS  FSYAIVVVGE+PYAE                                +GRPVVIQPY+ S+DALVAAWLPGTEGKGITDVLFGDYGFTGKL
Subjt:  DKEFFQSHKFSYAIVVVGEYPYAE-------------------------------TNGRPVVIQPYIASIDALVAAWLPGTEGKGITDVLFGDYGFTGKL

Query:  SQTWFKTVDQLPMNFGDPHYDPLFPLGYGLTTKPIIT
        SQTWFKTVDQLPMNFGD HYDPLFPLG+GLTT+PI T
Subjt:  SQTWFKTVDQLPMNFGDPHYDPLFPLGYGLTTKPIIT

A0A5A7T9L3 Beta-glucosidase BoGH3B-like9.6e-21263.74Show/hide
Query:  MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTNASTTIMKK------------------------------
        MAK + IL+GLLL+CF E   KAE L+Y+DPKQPL  RI DLLGRMTLEEKIGQM QIER NAST +MKK                              
Subjt:  MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTNASTTIMKK------------------------------

Query:  ------------------------------------------DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTEII
                                                  DPQL+K+IG A+ALE+RATGIPYAFAPC+AVCRDPRWGRCYESY EDPK+VQ MTEII
Subjt:  ------------------------------------------DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTEII

Query:  SGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKGINENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLH
         GLQGEIPPNSRKGVP+VAGKEKV ACAKH+VGDGGTTKGI+ENNTVIDRHGLLSIHMPGYY+SIIKGVAT+MVSYSSWNG KMHANK L+TDFLKNTLH
Subjt:  SGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKGINENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLH

Query:  FQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLN----IYGL------------------KRILRVKFVMGLFENSLADLSLVNELGKKEH
        FQGFVISDWQ IDRIT PPHANYTYSILASVTAG+DM+ +  N    I GL                  KRILRVKF+MGLFEN +ADLSLVNELGK+EH
Subjt:  FQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLN----IYGL------------------KRILRVKFVMGLFENSLADLSLVNELGKKEH

Query:  RELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENP
        RELAREAVRKSLVLLKNG+SADKPLLPL KK QK  +LVA     ++    G W        T+    +  N L  GTTVL AIKDT+DP TEVIF ENP
Subjt:  RELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENP

Query:  DKEFFQSHKFSYAIVVVGEYPYAE-------------------------------TNGRPVVIQPYIASIDALVAAWLPGTEGKGITDVLFGDYGFTGKL
        DK F QS  FSYAIVVVGE+PYAE                                +GRPVVIQPY+ S+DALVAAWLPGTEGKGITDVLFGDYGFTGKL
Subjt:  DKEFFQSHKFSYAIVVVGEYPYAE-------------------------------TNGRPVVIQPYIASIDALVAAWLPGTEGKGITDVLFGDYGFTGKL

Query:  SQTWFKTVDQLPMNFGDPHYDPLFPLGYGLTTKPIIT
        SQTWFKTVDQLPMNFGD HYDPLFPLG+GLTT+PI T
Subjt:  SQTWFKTVDQLPMNFGDPHYDPLFPLGYGLTTKPIIT

A0A5D3D8S5 Beta-glucosidase BoGH3B-like4.9e-21667.67Show/hide
Query:  MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTNASTTIMKK------------------------------
        MAK + IL+GLLL+CF E   KAE L+Y+DPKQPL  RI DLLGRMTLEEKIGQM QIER NAST +MKK                              
Subjt:  MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTNASTTIMKK------------------------------

Query:  -----DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTEIISGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGT
             DPQL+K+IG A+ALE+RATGIPYAFAPC+AVCRDPRWGRCYESY EDPK+VQ MTEII GLQGEIPPNSRKGVP+VAGKEKV ACAKH+VGDGGT
Subjt:  -----DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTEIISGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGT

Query:  TKGINENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLHFQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDM
        TKGI+ENNTVIDRHGLLSIHMPGYY+SIIKGVAT+MVSYSSWNG KMHANK L+TDFLKNTLHFQGFVISDWQ IDRIT PPHANYTYSILASVTAG+DM
Subjt:  TKGINENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLHFQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDM

Query:  VGIVLN----IYGL------------------KRILRVKFVMGLFENSLADLSLVNELGKKEHRELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLL
        + +  N    I GL                  KRILRVKF+MGLFEN +ADLSLVNELGK+EHRELAREAVRKSLVLLKNG+SADKPLLPL KK QK  +
Subjt:  VGIVLN----IYGL------------------KRILRVKFVMGLFENSLADLSLVNELGKKEHRELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLL

Query:  LVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENPDKEFFQSHKFSYAIVVVGEYPYAE-------------
        LVA     ++    G W        T+    +  N L  GTTVL AIKDT+DP TEVIF ENPDK F QS  FSYAIVVVGE+PYAE             
Subjt:  LVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENPDKEFFQSHKFSYAIVVVGEYPYAE-------------

Query:  ------------------TNGRPVVIQPYIASIDALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTWFKTVDQLPMNFGDPHYDPLFPLGYGLTTKPIIT
                           +GRPVVIQPY+ S+DALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTWFKTVDQLPMNFGD HYDPLFPLG+GLTT+PI T
Subjt:  ------------------TNGRPVVIQPYIASIDALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTWFKTVDQLPMNFGDPHYDPLFPLGYGLTTKPIIT

A0A6J1DRG0 uncharacterized protein LOC1110224833.7e-21163.15Show/hide
Query:  MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTNASTTIMKK------------------------------
        MAK+ I LMG+LL+CFSEAL K  YLRY+DPKQPL  RI DLLGRMTLEEKIGQMVQI+RT AS  +MKK                              
Subjt:  MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTNASTTIMKK------------------------------

Query:  ------------------------------------------DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTEII
                                                  DP+LVK+IGVATALEVRATGI Y FAPC+AVCRDPRWGRCYESYSEDPKIVQ+MTEII
Subjt:  ------------------------------------------DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTEII

Query:  SGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKGINENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLH
        SGLQGEIP NSRKGVP+VAG+EKVAACAKHFVGDGGTTKGINENNTVI+RHGLLS HMPGYYNSIIKGV+TIM+SYSSWNG+KMHAN+ LITDFLKNTL 
Subjt:  SGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKGINENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLH

Query:  FQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLN----IYGL------------------KRILRVKFVMGLFENSLADLSLVNELGKKEH
        F+GFVISDWQGIDRIT+PPHANYTYSI+  VTAGIDM+ +  N    I GL                  KRILRVKF+MGLFEN LAD S +++LGKKEH
Subjt:  FQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLN----IYGL------------------KRILRVKFVMGLFENSLADLSLVNELGKKEH

Query:  RELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENP
        RELAREAVRKSLVLLKNGESADKP+LPLPKKA K  +LVA     ++    G W        T+    +  N L  GTT+L+AIKDT+DP+TEV+F ENP
Subjt:  RELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENP

Query:  DKEFFQSHKFSYAIVVVGEYPYAET-------------------------------NGRPVVIQPYIASIDALVAAWLPGTEGKGITDVLFGDYGFTGKL
        D EF +S+KFSYAIVVVGE+PYAET                               +GRPVVIQPY A IDALVAAWLPGTEG GITDVLFGDYGFTGKL
Subjt:  DKEFFQSHKFSYAIVVVGEYPYAET-------------------------------NGRPVVIQPYIASIDALVAAWLPGTEGKGITDVLFGDYGFTGKL

Query:  SQTWFKTVDQLPMNFGDPHYDPLFPLGYGLTTKPI
        S+TWFKTVDQLPMN GDPHYDPLFP G+GLTT P+
Subjt:  SQTWFKTVDQLPMNFGDPHYDPLFPLGYGLTTKPI

SwissProt top hitse value%identityAlignment
A7LXU3 Beta-glucosidase BoGH3B2.0e-3325.19Show/hide
Query:  QLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAM-TEIISGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKGINE
        +L ++    +A E +A  IP+ FAP V + RDPRW R +E+Y ED  +   M    + G QGE         P+  G+  VAAC KH++G G    G + 
Subjt:  QLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAM-TEIISGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKGINE

Query:  NNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLHFQGFVISDWQGIDRITTPPH--ANYTYSILASVTAGIDMVGIV
          + I R  +   H   +  ++ +G  ++MV+    NG   HAN+ L+T++LK  L++ G +++DW  I+ + T  H  A    ++   + AGIDM  + 
Subjt:  NNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLHFQGFVISDWQGIDRITTPPH--ANYTYSILASVTAGIDMVGIV

Query:  LNIY---------------------GLKRILRVKFVMGLFENSLADLSLVNELGKKEHRELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAMQ
          +                       + R+LR+K+ +GLF++   D+   ++ G KE   +A +A  +S VLLKN    D  +LP+ K  +  L      
Subjt:  LNIY---------------------GLKRILRVKFVMGLFENSLADLSLVNELGKKEHRELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAMQ

Query:  TTLDINVAVGL-WNGKDLVATTLLLHNIY---------INFLLLGTTVLAAIKDTIDPETEVIFKENPDKEFFQSHKFSYAIVVVGEYPYAET-------
        +   +N      W G          H IY          N +       A+ K+    E     K   +K    + +    I  +GE  Y ET       
Subjt:  TTLDINVAVGL-WNGKDLVATTLLLHNIY---------INFLLLGTTVLAAIKDTIDPETEVIFKENPDKEFFQSHKFSYAIVVVGEYPYAET-------

Query:  -------------------------NGRPVVIQPYIASIDALVAAWLPGT-EGKGITDVLFGDYGFTGKLSQTW-----------FKTVDQLPMNFGDPH
                                  GRP +I   +    A+V   LP    G  + ++L GD  F+GK+  T+           +K  + +    G+ +
Subjt:  -------------------------NGRPVVIQPYIASIDALVAAWLPGT-EGKGITDVLFGDYGFTGKLSQTW-----------FKTVDQLPMNFGDPH

Query:  YDPL----FPLGYGLT
        YD +    +P G+GL+
Subjt:  YDPL----FPLGYGLT

P33363 Periplasmic beta-glucosidase1.7e-2725Show/hide
Query:  VKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTE-IISGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKGINENN
        VK +G  +A E    G+   +AP V V RDPRWGR  E + ED  +   M + ++  +QG+ P          A +  V    KHF   G    G   N 
Subjt:  VKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTE-IISGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKGINENN

Query:  TVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLHFQGFVISDWQGI-DRITTPPHANYTYSILASVTAGIDM-------
          +    L + +MP Y   +  G   +MV+ +S NG    ++  L+ D L++   F+G  +SD   I + I     A+   ++  ++ +GI+M       
Subjt:  TVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLHFQGFVISDWQGI-DRITTPPHANYTYSILASVTAGIDM-------

Query:  ------------VGIVLNIYGLKRILRVKFVMGLFENSLADLS------LVNELGKKEHRELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAM
                    V +       + +L VK+ MGLF +  + L       +      + HR+ ARE  R+SLVLLKN        LPL K A   ++    
Subjt:  ------------VGIVLNIYGLKRILRVKFVMGLFENSLADLS------LVNELGKKEHRELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAM

Query:  QTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENPD-----------------------------KEFFQSHKFSY
         +  D+   +G W+   +   ++              TVL  IK+ +    +V++ +  +                              E  Q+ K S 
Subjt:  QTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENPD-----------------------------KEFFQSHKFSY

Query:  AIV-VVGE---------------YPYAE------------------TNGRPVVIQPYIASIDALVAAWLPGTE-GKGITDVLFGDYGFTGKLSQTWFKTV
         +V VVGE                P ++                   NGRP+ +       DA++  W  GTE G  I DVLFGDY  +GKL  ++ ++V
Subjt:  AIV-VVGE---------------YPYAE------------------TNGRPVVIQPYIASIDALVAAWLPGTE-GKGITDVLFGDYGFTGKLSQTWFKTV

Query:  DQLP-----MNFGDP------------HYD----PLFPLGYGLT
         Q+P     +N G P            ++D     L+P GYGL+
Subjt:  DQLP-----MNFGDP------------HYD----PLFPLGYGLT

Q23892 Lysosomal beta glucosidase1.0e-2926.59Show/hide
Query:  TALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAM-TEIISGLQGEIPPNSRKGVPHVAGKEKVAAC-AKHFVGDGGTTKGINENNTVIDRH
        T+ +  A GIP+ FAP + +   P W R YE++ EDP +   M    + G QG    NS  G  +       A C AKH+ G    T G +     I   
Subjt:  TALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAM-TEIISGLQGEIPPNSRKGVPHVAGKEKVAAC-AKHFVGDGGTTKGINENNTVIDRH

Query:  GLLSIHMPGYYNSII-KGVATIMVSYSSWNGEKMHANKNLITDFLKNTLHFQGFVISDWQGIDRITTPPH--ANYTYSILASVTAGIDMVGIVLNI----
         L    +P +  +I   G  TIM++    NG  MH +   +T+ L+  L F+G  ++DWQ I+++    H   +   +IL ++ AGIDM  + L++    
Subjt:  GLLSIHMPGYYNSII-KGVATIMVSYSSWNGEKMHANKNLITDFLKNTLHFQGFVISDWQGIDRITTPPH--ANYTYSILASVTAGIDMVGIVLNI----

Query:  -----------------YGLKRILRVKFVMGLFENSL--ADLSLVNELGKKEHRELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAMQTTLDI
                           ++RIL +K+ +GLF N     + ++V+ +G+ + RE A     +S+ LL+N  +    +LPL     K +LL         
Subjt:  -----------------YGLKRILRVKFVMGLFENSL--ADLSLVNELGKKEHRELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAMQTTLDI

Query:  NVAVGL---WNG----KDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENP-DKEFFQSHKFSYAIVVVGEYPYAETNG------------
        N+  G    W G     +    T +L  +     +   T    I+ TI  E  V   +   D+    +      +VV+GE P AET G            
Subjt:  NVAVGL---WNG----KDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENP-DKEFFQSHKFSYAIVVVGEYPYAETNG------------

Query:  --------------------RPVVIQP-YIASIDALVAAWLPGTE-GKGITDVLFGDYGFTGKLSQTWFKTVDQLPMNFGDPHY---------DPLFPLG
                            RP ++ P  + S  A++ A+LPG+E GK I ++L G+   +G+L  T+  T   +    G P+Y          PLF  G
Subjt:  --------------------RPVVIQP-YIASIDALVAAWLPGTE-GKGITDVLFGDYGFTGKLSQTWFKTVDQLPMNFGDPHY---------DPLFPLG

Query:  YGLT
         GL+
Subjt:  YGLT

Q56078 Periplasmic beta-glucosidase7.5e-2825.09Show/hide
Query:  VKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTE-IISGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKGINENN
        V+ +G  +A E    G+   +AP V V RDPRWGR  E + ED  +   M E ++  +QG+ P          A +  V    KHF   G    G   N 
Subjt:  VKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTE-IISGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKGINENN

Query:  TVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLHFQGFVISDWQGI-DRITTPPHANYTYSILASVTAGIDM-------
          +    L + +MP Y   +  G   +MV+ +S NG    ++  L+ D L++   F+G  +SD   I + I     A+   ++  ++ AG+DM       
Subjt:  TVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLHFQGFVISDWQGI-DRITTPPHANYTYSILASVTAGIDM-------

Query:  ------------VGIVLNIYGLKRILRVKFVMGLFENSLADLS------LVNELGKKEHRELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAM
                    V +       + +L VK+ MGLF +  + L       +      + HR+ ARE  R+S+VLLKN        LPL K     ++    
Subjt:  ------------VGIVLNIYGLKRILRVKFVMGLFENSLADLS------LVNELGKKEHRELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAM

Query:  QTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVI---------FKENP-------DKEFFQSHKFSYAIVVVGE------
         +  D+   +G W+   +   ++ +     N +  G  +L A    I  +  ++          K +P       D+    + +    + VVGE      
Subjt:  QTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVI---------FKENP-------DKEFFQSHKFSYAIVVVGE------

Query:  ---------YPYAE------------------TNGRPVVIQPYIASIDALVAAWLPGTE-GKGITDVLFGDYGFTGKLSQTWFKTVDQLP-----MNFGD
                  P ++                   NGRP+ +       DA++  W  GTE G  I DVLFGDY  +GKL  ++ ++V Q+P     +N G 
Subjt:  ---------YPYAE------------------TNGRPVVIQPYIASIDALVAAWLPGTE-GKGITDVLFGDYGFTGKLSQTWFKTVDQLP-----MNFGD

Query:  P------------HYD----PLFPLGYGLT
        P            ++D    PL+P GYGL+
Subjt:  P------------HYD----PLFPLGYGLT

T2KMH0 Beta-xylosidase2.4e-3425.56Show/hide
Query:  LMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTN------------------------ASTTIMKK--------DPQLV
        LMGLLL  F   + +       +  + +  ++  L+ +MTL+EKI +M Q    N                         +TT+  +        +P+L+
Subjt:  LMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTN------------------------ASTTIMKK--------DPQLV

Query:  KKIGVATALEVRATGIPYAFAPCVAV-CRDPRWGRCYESYSEDPKIVQAM-TEIISGLQGEIPPNSRKGVPHVAGKEK-----VAACAKHFVGDGGTTKG
        KK+   TA E RA G+ + ++P + V   D R+GR  ESY EDP +V  M    I GLQG              G+E+     V A AKHFVG     +G
Subjt:  KKIGVATALEVRATGIPYAFAPCVAV-CRDPRWGRCYESYSEDPKIVQAM-TEIISGLQGEIPPNSRKGVPHVAGKEK-----VAACAKHFVGDGGTTKG

Query:  INENNTVIDRHGLLSIHMPGYYNSIIK-GVATIMVSYSSWNGEKMHANKNLITDFLKNTLHFQGFVISDWQGIDRITTPPH--ANYTYSILASVTAGIDM
        IN   + +    L  +++P +  ++ + GV ++M  +  +NG   H N  L+ D L++ L F GF++SD   + R+ T      N T + +  + AG+DM
Subjt:  INENNTVIDRHGLLSIHMPGYYNSIIK-GVATIMVSYSSWNGEKMHANKNLITDFLKNTLHFQGFVISDWQGIDRITTPPH--ANYTYSILASVTAGIDM

Query:  VGIV----------LNI----------------YGLKRILRVKFVMGLFENSLADLSLVN-ELGKKEHRELAREAVRKSLVLLKNGESADKPLLPLPKKA
          ++           NI                    RIL  K+ +GLF+     +     E G  EHRE A E   KS+++LKN    D  LLPL    
Subjt:  VGIV----------LNI----------------YGLKRILRVKFVMGLFENSLADLSLVN-ELGKKEHRELAREAVRKSLVLLKNGESADKPLLPLPKKA

Query:  QKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENPDKEFFQSHKFSYAI----------VVVGE---
         K L ++      +          +    T  LL   Y        +VL  +K  +    ++ + +  D + F    F  AI          +VVG    
Subjt:  QKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENPDKEFFQSHKFSYAI----------VVVGE---

Query:  -------------YPYAE-----------------TNGRPVVIQPYIASIDALVAAWLPGTE-GKGITDVLFGDYGFTGKLSQTWFKTVDQLPMNF----
                     Y   +                  NGRP+ I     +I +++  W  G   G  + +V+FGD    GKL+ ++ + V Q+P+ +    
Subjt:  -------------YPYAE-----------------TNGRPVVIQPYIASIDALVAAWLPGTE-GKGITDVLFGDYGFTGKLSQTWFKTVDQLPMNF----

Query:  -----GDPHY-----DPLFPLGYGLT
             G   Y      PLFP G+GL+
Subjt:  -----GDPHY-----DPLFPLGYGLT

Arabidopsis top hitse value%identityAlignment
AT3G62710.1 Glycosyl hydrolase family protein2.2e-13145.93Show/hide
Query:  YLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTNAS---------------------------------------TTIMKK---------------
        Y++Y+DPK  +  R+ DLL RMTL EK+GQM QI+R N S                                       T  MKK               
Subjt:  YLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTNAS---------------------------------------TTIMKK---------------

Query:  -------------------------DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTE-IISGLQGEIPPNSRKGVP
                                 DPQLVKKIG  TA EVRATG+  AFAPCVAVCRDPRWGRCYESYSEDP +V  MTE II GLQG          P
Subjt:  -------------------------DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTE-IISGLQGEIPPNSRKGVP

Query:  HVAG-KEKVAACAKHFVGDGGTTKGINENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLHFQGFVISDWQGIDRI
        ++A  K  VA CAKHFVGDGGT  GINENNTV D   L  IHMP +  ++ KG+A+IM SYSS NG KMHAN+ +ITD+LKNTL FQGFVISDW GID+I
Subjt:  HVAG-KEKVAACAKHFVGDGGTTKGINENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLHFQGFVISDWQGIDRI

Query:  TTPPHANYTYSILASVTAGIDMVGI----------VLNIY------------GLKRILRVKFVMGLFENSLADLSL-VNELGKKEHRELAREAVRKSLVL
        T    +NYTYSI AS+ AGIDMV +          + N+              ++RILRVKF +GLFENSLAD  L   E G + HRE+ REAVRKS+VL
Subjt:  TTPPHANYTYSILASVTAGIDMVGI----------VLNIY------------GLKRILRVKFVMGLFENSLADLSL-VNELGKKEHRELAREAVRKSLVL

Query:  LKNGESADKPLLPLPKKAQKYLLLVAMQTTLDINVAVG----LW---NGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENPDKEFFQS
        LKNG++    ++PLPKK +K  ++VA +   D+    G     W   NG      T   H +     + GTT+L AI+  +DP TEV++ E P+++  + 
Subjt:  LKNGESADKPLLPLPKKAQKYLLLVAMQTTLDINVAVG----LW---NGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENPDKEFFQS

Query:  H-KFSYAIVVVGEYPYAET--------------------------------NGRPVVIQPYIASIDALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTWF
        H   +Y IVVVGE PYAET                                 GRP+VI+PYI  +DAL  AWLPGTEG+G+ DVLFGD+ FTG L +TW 
Subjt:  H-KFSYAIVVVGEYPYAET--------------------------------NGRPVVIQPYIASIDALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTWF

Query:  KTVDQLPMNFGDPHYDPLFPLGYGLTT
        K V QLPMN GD +YDPL+P GYG+ T
Subjt:  KTVDQLPMNFGDPHYDPLFPLGYGLTT

AT5G04885.1 Glycosyl hydrolase family protein7.4e-16449.13Show/hide
Query:  LIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTNASTTIMK-----------------------------------
        +++ M + + C+ +     EYL Y+DPKQ +  R+ DL GRMTLEEKIGQMVQI+R+ A+  IM+                                   
Subjt:  LIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTNASTTIMK-----------------------------------

Query:  -------------------------------------KDPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTEIISGLQ
                                             +DP LVK+IG ATA+EVRATGIPY FAPC+AVCRDPRWGRCYESYSED K+V+ MT++I GLQ
Subjt:  -------------------------------------KDPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTEIISGLQ

Query:  GEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKGINENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLHFQGF
        GE P N + GVP V G++KVAACAKH+VGDGGTT+G+NENNTV D HGLLS+HMP Y +++ KGV+T+MVSYSSWNGEKMHAN  LIT +LK TL F+GF
Subjt:  GEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKGINENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLHFQGF

Query:  VISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLNIY----------------------GLKRILRVKFVMGLFENSLADLSLVNELGKKEHRELA
        VISDWQG+D+I+TPPH +YT S+ A++ AGIDMV +  N                         ++RIL VKF MGLFEN LAD S  +ELG + HR+LA
Subjt:  VISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLNIY----------------------GLKRILRVKFVMGLFENSLADLSLVNELGKKEHRELA

Query:  REAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENPDKEF
        REAVRKSLVLLKNG   + P+LPLP+K  K  +LVA     ++    G W        T+       N    GTT+L+A+K  +D  TEV+F+ENPD EF
Subjt:  REAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENPDKEF

Query:  FQSHKFSYAIVVVGEYPYAET-------------------------------NGRPVVIQPYIASIDALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTW
         +S+ F+YAI+ VGE PYAET                               +GRP+V++PY+ASIDALVAAWLPGTEG+GITD LFGD+GF+GKL  TW
Subjt:  FQSHKFSYAIVVVGEYPYAET-------------------------------NGRPVVIQPYIASIDALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTW

Query:  FKTVDQLPMNFGDPHYDPLFPLGYGLTTKPI
        F+  +QLPM++GD HYDPLF  G GL T+ +
Subjt:  FKTVDQLPMNFGDPHYDPLFPLGYGLTTKPI

AT5G20940.1 Glycosyl hydrolase family protein1.7e-16852.69Show/hide
Query:  LIILMGLLLICFSEALTKAEY--LRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTNASTTIMKK--------------------------------
        L+  +GLLL+C + A  K      +Y+DPK+PL  RI +L+  MTLEEKIGQMVQ+ER NA+T +M+K                                
Subjt:  LIILMGLLLICFSEALTKAEY--LRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTNASTTIMKK--------------------------------

Query:  ----------------------------------------DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTEIISG
                                                DP LVK+IG ATALEVRATGI Y FAPC+AVCRDPRWGRCYESYSED KIVQ MTEII G
Subjt:  ----------------------------------------DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTEIISG

Query:  LQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKGINENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLHFQ
        LQG++ P  +KGVP VAGK KVAACAKHFVGDGGT +G+N NNTVI+ +GLL IHMP Y++++ KGVAT+MVSYSS NG KMHANK LIT FLKN L F+
Subjt:  LQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKGINENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLHFQ

Query:  GFVISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLNIYGL----------------------KRILRVKFVMGLFENSLADLSLVNELGKKEHRE
        G VISD+ G+D+I TP  ANY++S+ A+ TAG+DM     N+  L                      KRILRVKF MGLFEN +AD SL  +LG KEHRE
Subjt:  GFVISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLNIYGL----------------------KRILRVKFVMGLFENSLADLSLVNELGKKEHRE

Query:  LAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENPDK
        LAREAVRKSLVLLKNGE+ADKPLLPLPKKA K  +LVA     ++    G W        T+    +  N L +GTT+LAA+K T+DP+T+VI+ +NPD 
Subjt:  LAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENPDK

Query:  EFFQSHKFSYAIVVVGEYPYAE-------------------------------TNGRPVVIQPYIASIDALVAAWLPGTEGKGITDVLFGDYGFTGKLSQ
         F ++  F YAIV VGE PYAE                                +GRPVV+Q  I++IDALVAAWLPGTEG+G+ DVLFGDYGFTGKL++
Subjt:  EFFQSHKFSYAIVVVGEYPYAE-------------------------------TNGRPVVIQPYIASIDALVAAWLPGTEGKGITDVLFGDYGFTGKLSQ

Query:  TWFKTVDQLPMNFGDPHYDPLFPLGYGLTTKP
        TWFKTVDQLPMN GDPHYDPL+P G+GL TKP
Subjt:  TWFKTVDQLPMNFGDPHYDPLFPLGYGLTTKP

AT5G20950.1 Glycosyl hydrolase family protein3.1e-17853.79Show/hide
Query:  MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTNASTTIMKK------------------------------
        ++KVL +++   ++  +E       L+Y+DPKQPL ARI DL+ RMTL+EKIGQMVQIER+ A+  +MKK                              
Subjt:  MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTNASTTIMKK------------------------------

Query:  ------------------------------------------DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTEII
                                                  DP LVK+IG ATALEVRATGIPYAFAPC+AVCRDPRWGRCYESYSED +IVQ MTEII
Subjt:  ------------------------------------------DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTEII

Query:  SGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKGINENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLH
         GLQG++ P  RKGVP V GK KVAACAKHFVGDGGT +GI+ENNTVID  GL  IHMPGYYN++ KGVATIMVSYS+WNG +MHANK L+T FLKN L 
Subjt:  SGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKGINENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLH

Query:  FQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLNIY----------------------GLKRILRVKFVMGLFENSLADLSLVNELGKKEH
        F+GFVISDWQGIDRITTPPH NY+YS+ A ++AGIDM+ +  N                         LKRILRVKF MGLFE  LADLS  N+LG KEH
Subjt:  FQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLNIY----------------------GLKRILRVKFVMGLFENSLADLSLVNELGKKEH

Query:  RELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENP
        RELAREAVRKSLVLLKNG++  KPLLPLPKK+ K  +LVA     ++    G W        T+    +  N   +GTT+LAA+K+T+ P T+V++ +NP
Subjt:  RELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENP

Query:  DKEFFQSHKFSYAIVVVGEYPYAE-------------------------------TNGRPVVIQPYIASIDALVAAWLPGTEGKGITDVLFGDYGFTGKL
        D  F +S KF YAIVVVGE PYAE                                +GRPVVIQPY+++IDALVAAWLPGTEG+G+ D LFGDYGFTGKL
Subjt:  DKEFFQSHKFSYAIVVVGEYPYAE-------------------------------TNGRPVVIQPYIASIDALVAAWLPGTEGKGITDVLFGDYGFTGKL

Query:  SQTWFKTVDQLPMNFGDPHYDPLFPLGYGLTTKP
        ++TWFK+V QLPMN GD HYDPL+P G+GLTTKP
Subjt:  SQTWFKTVDQLPMNFGDPHYDPLFPLGYGLTTKP

AT5G20950.2 Glycosyl hydrolase family protein3.1e-17853.79Show/hide
Query:  MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTNASTTIMKK------------------------------
        ++KVL +++   ++  +E       L+Y+DPKQPL ARI DL+ RMTL+EKIGQMVQIER+ A+  +MKK                              
Subjt:  MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTNASTTIMKK------------------------------

Query:  ------------------------------------------DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTEII
                                                  DP LVK+IG ATALEVRATGIPYAFAPC+AVCRDPRWGRCYESYSED +IVQ MTEII
Subjt:  ------------------------------------------DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTEII

Query:  SGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKGINENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLH
         GLQG++ P  RKGVP V GK KVAACAKHFVGDGGT +GI+ENNTVID  GL  IHMPGYYN++ KGVATIMVSYS+WNG +MHANK L+T FLKN L 
Subjt:  SGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKGINENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLH

Query:  FQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLNIY----------------------GLKRILRVKFVMGLFENSLADLSLVNELGKKEH
        F+GFVISDWQGIDRITTPPH NY+YS+ A ++AGIDM+ +  N                         LKRILRVKF MGLFE  LADLS  N+LG KEH
Subjt:  FQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLNIY----------------------GLKRILRVKFVMGLFENSLADLSLVNELGKKEH

Query:  RELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENP
        RELAREAVRKSLVLLKNG++  KPLLPLPKK+ K  +LVA     ++    G W        T+    +  N   +GTT+LAA+K+T+ P T+V++ +NP
Subjt:  RELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENP

Query:  DKEFFQSHKFSYAIVVVGEYPYAE-------------------------------TNGRPVVIQPYIASIDALVAAWLPGTEGKGITDVLFGDYGFTGKL
        D  F +S KF YAIVVVGE PYAE                                +GRPVVIQPY+++IDALVAAWLPGTEG+G+ D LFGDYGFTGKL
Subjt:  DKEFFQSHKFSYAIVVVGEYPYAE-------------------------------TNGRPVVIQPYIASIDALVAAWLPGTEGKGITDVLFGDYGFTGKL

Query:  SQTWFKTVDQLPMNFGDPHYDPLFPLGYGLTTKP
        ++TWFK+V QLPMN GD HYDPL+P G+GLTTKP
Subjt:  SQTWFKTVDQLPMNFGDPHYDPLFPLGYGLTTKP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAAAGTTCTCATCATTTTGATGGGGCTTTTGCTCATTTGTTTCTCTGAAGCATTGACAAAAGCTGAGTACTTGAGATATAGAGATCCAAAACAACCATTAAGAGC
TCGAATCAATGACCTCCTTGGTCGAATGACTCTTGAGGAAAAGATAGGTCAAATGGTGCAAATTGAAAGGACTAATGCTTCAACTACGATTATGAAAAAGGATCCTCAAC
TTGTAAAGAAGATTGGTGTTGCTACTGCACTTGAAGTTAGAGCTACCGGAATTCCTTATGCTTTTGCACCTTGTGTAGCGGTTTGCAGAGATCCACGATGGGGTCGATGT
TATGAAAGCTACAGCGAAGATCCTAAAATTGTTCAAGCAATGACTGAGATCATATCAGGATTACAAGGAGAGATTCCACCAAATTCTCGCAAAGGTGTTCCTCATGTTGC
TGGAAAAGAAAAGGTTGCAGCTTGTGCAAAGCACTTTGTGGGCGATGGTGGAACGACTAAGGGTATTAATGAGAACAACACAGTGATAGATAGACATGGATTACTTAGCA
TTCACATGCCAGGTTACTATAACTCAATCATTAAGGGAGTTGCAACCATTATGGTTTCTTATTCAAGTTGGAATGGAGAGAAGATGCATGCAAACAAAAATCTTATAACC
GACTTTCTTAAGAACACTCTTCATTTTCAGGGTTTTGTAATCTCAGATTGGCAAGGTATTGATAGGATTACCACTCCACCTCATGCTAACTATACGTATTCCATTTTAGC
AAGTGTTACTGCTGGTATTGACATGGTTGGTATTGTGTTGAACATTTACGGGTTGAAGAGAATATTGAGAGTTAAATTTGTCATGGGTTTATTTGAGAACTCATTAGCTG
ACCTAAGCTTGGTTAATGAGCTTGGTAAAAAGGAGCATAGAGAACTCGCTAGAGAAGCTGTAAGGAAATCATTAGTGTTGTTAAAGAATGGAGAATCAGCTGACAAACCA
TTGCTACCACTTCCAAAGAAGGCACAAAAATACTTGTTGCTGGTAGCCATGCAAACAACCTTGGATATCAATGTGGCGGTTGGACTATGGAATGGCAAGGACTTAGTGGC
AACAACCTTACTATTGCACAATATTTATATCAACTTCCTTTTGCTAGGTACAACCGTGCTTGCAGCTATAAAAGATACAATTGATCCTGAAACTGAAGTTATATTTAAGG
AGAATCCAGATAAGGAGTTTTTTCAATCACACAAATTTTCTTATGCCATTGTTGTAGTGGGAGAATATCCATATGCAGAAACCAATGGGCGGCCTGTAGTAATCCAACCT
TATATTGCTTCAATTGATGCACTTGTTGCTGCTTGGCTTCCTGGAACTGAAGGCAAAGGCATTACTGATGTGTTATTTGGGGACTATGGCTTTACCGGCAAGCTTTCGCA
AACATGGTTCAAGACTGTTGATCAGCTGCCAATGAATTTTGGAGATCCACATTACGATCCCCTTTTTCCGCTTGGATATGGTCTTACTACTAAGCCTATCATAACCAAAT
GA
mRNA sequenceShow/hide mRNA sequence
ATGGCCAAAGTTCTCATCATTTTGATGGGGCTTTTGCTCATTTGTTTCTCTGAAGCATTGACAAAAGCTGAGTACTTGAGATATAGAGATCCAAAACAACCATTAAGAGC
TCGAATCAATGACCTCCTTGGTCGAATGACTCTTGAGGAAAAGATAGGTCAAATGGTGCAAATTGAAAGGACTAATGCTTCAACTACGATTATGAAAAAGGATCCTCAAC
TTGTAAAGAAGATTGGTGTTGCTACTGCACTTGAAGTTAGAGCTACCGGAATTCCTTATGCTTTTGCACCTTGTGTAGCGGTTTGCAGAGATCCACGATGGGGTCGATGT
TATGAAAGCTACAGCGAAGATCCTAAAATTGTTCAAGCAATGACTGAGATCATATCAGGATTACAAGGAGAGATTCCACCAAATTCTCGCAAAGGTGTTCCTCATGTTGC
TGGAAAAGAAAAGGTTGCAGCTTGTGCAAAGCACTTTGTGGGCGATGGTGGAACGACTAAGGGTATTAATGAGAACAACACAGTGATAGATAGACATGGATTACTTAGCA
TTCACATGCCAGGTTACTATAACTCAATCATTAAGGGAGTTGCAACCATTATGGTTTCTTATTCAAGTTGGAATGGAGAGAAGATGCATGCAAACAAAAATCTTATAACC
GACTTTCTTAAGAACACTCTTCATTTTCAGGGTTTTGTAATCTCAGATTGGCAAGGTATTGATAGGATTACCACTCCACCTCATGCTAACTATACGTATTCCATTTTAGC
AAGTGTTACTGCTGGTATTGACATGGTTGGTATTGTGTTGAACATTTACGGGTTGAAGAGAATATTGAGAGTTAAATTTGTCATGGGTTTATTTGAGAACTCATTAGCTG
ACCTAAGCTTGGTTAATGAGCTTGGTAAAAAGGAGCATAGAGAACTCGCTAGAGAAGCTGTAAGGAAATCATTAGTGTTGTTAAAGAATGGAGAATCAGCTGACAAACCA
TTGCTACCACTTCCAAAGAAGGCACAAAAATACTTGTTGCTGGTAGCCATGCAAACAACCTTGGATATCAATGTGGCGGTTGGACTATGGAATGGCAAGGACTTAGTGGC
AACAACCTTACTATTGCACAATATTTATATCAACTTCCTTTTGCTAGGTACAACCGTGCTTGCAGCTATAAAAGATACAATTGATCCTGAAACTGAAGTTATATTTAAGG
AGAATCCAGATAAGGAGTTTTTTCAATCACACAAATTTTCTTATGCCATTGTTGTAGTGGGAGAATATCCATATGCAGAAACCAATGGGCGGCCTGTAGTAATCCAACCT
TATATTGCTTCAATTGATGCACTTGTTGCTGCTTGGCTTCCTGGAACTGAAGGCAAAGGCATTACTGATGTGTTATTTGGGGACTATGGCTTTACCGGCAAGCTTTCGCA
AACATGGTTCAAGACTGTTGATCAGCTGCCAATGAATTTTGGAGATCCACATTACGATCCCCTTTTTCCGCTTGGATATGGTCTTACTACTAAGCCTATCATAACCAAAT
GA
Protein sequenceShow/hide protein sequence
MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTNASTTIMKKDPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRC
YESYSEDPKIVQAMTEIISGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKGINENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLIT
DFLKNTLHFQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLNIYGLKRILRVKFVMGLFENSLADLSLVNELGKKEHRELAREAVRKSLVLLKNGESADKP
LLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENPDKEFFQSHKFSYAIVVVGEYPYAETNGRPVVIQP
YIASIDALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTWFKTVDQLPMNFGDPHYDPLFPLGYGLTTKPIITK