; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr018052 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr018052
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionbeta-glucosidase BoGH3B-like
Genome locationtig00153092:488545..490404
RNA-Seq ExpressionSgr018052
SyntenySgr018052
Gene Ontology termsGO:0009251 - glucan catabolic process (biological process)
GO:0008422 - beta-glucosidase activity (molecular function)
InterPro domainsIPR002772 - Glycoside hydrolase family 3 C-terminal domain
IPR017853 - Glycoside hydrolase superfamily
IPR036881 - Glycoside hydrolase family 3 C-terminal domain superfamily
IPR036962 - Glycoside hydrolase, family 3, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057201.1 beta-glucosidase BoGH3B-like [Cucumis melo var. makuwa]2.2e-12867.12Show/hide
Query:  MTEIIPGLQGEIPPTSRKGVPYVAGKKG----GSFFVGDGGTTKGINENNTVIDRHGLLSIHMPGF----------------------------------
        MTEII GLQGEIP  SRKGVPYVAG++        +VGDGGTTKGINENNT+  RHGLLSIHMPG+                                  
Subjt:  MTEIIPGLQGEIPPTSRKGVPYVAGKKG----GSFFVGDGGTTKGINENNTVIDRHGLLSIHMPGF----------------------------------

Query:  ----------ISNWQGIDKITNPPHANFTYSILASITAGVDM---------FIDGLTYLVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSLVNEV
                  IS+WQGID+IT+PPHAN+TYSI+A ITAG+DM         FIDGLTYLVK N+I ISRIDD VKRILRVK +MGLFEN LAD S VNE+
Subjt:  ----------ISNWQGIDKITNPPHANFTYSILASITAGVDM---------FIDGLTYLVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSLVNEV

Query:  GKK-HRELAREAVRKSLVLLKNGESADKPLLPLPNKAPKILVAGSHANNLGYQCGDWTMEWQGLSGNKLTSGTIVLAARKDTIDPETEVIFKENPNKEFF
        GKK HRELAREAVRKSLVLLKNGESADKP+LPLP K PKILVAGSHANNLG+QCG WT+EWQGL GN LTSGT +L+A KDT+DP+T+V+FKENP+ EF 
Subjt:  GKK-HRELAREAVRKSLVLLKNGESADKPLLPLPNKAPKILVAGSHANNLGYQCGDWTMEWQGLSGNKLTSGTIVLAARKDTIDPETEVIFKENPNKEFF

Query:  QSHKFSYAIVVLGEYPYAETNGDSLNLTIPHPGSSTITNVCGAMKYVVIIISRRPVVIQRYIASIDALVAA
        +S+KFSYAIVV+GE+PYAET GDSLNLTIP PGSSTITNVCG +K VVI+IS RPVV+Q YI+SIDALVAA
Subjt:  QSHKFSYAIVVLGEYPYAETNGDSLNLTIPHPGSSTITNVCGAMKYVVIIISRRPVVIQRYIASIDALVAA

XP_004141128.1 uncharacterized protein LOC101223112 [Cucumis sativus]5.8e-12967.39Show/hide
Query:  MTEIIPGLQGEIPPTSRKGVPYVAGKKG----GSFFVGDGGTTKGINENNTVIDRHGLLSIHMPGF----------------------------------
        MTEII GLQGEIP  SRKGVPYVAG++        +VGDGGTTKG+NENNT+  RHGLLSIHMPG+                                  
Subjt:  MTEIIPGLQGEIPPTSRKGVPYVAGKKG----GSFFVGDGGTTKGINENNTVIDRHGLLSIHMPGF----------------------------------

Query:  ----------ISNWQGIDKITNPPHANFTYSILASITAGVDM---------FIDGLTYLVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSLVNEV
                  IS+WQGID+IT+PPHAN+TYSI+A ITAG+DM         FIDGLTYLVK N+I ISRIDD VKRILRVK VMGLFEN LAD S VNE+
Subjt:  ----------ISNWQGIDKITNPPHANFTYSILASITAGVDM---------FIDGLTYLVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSLVNEV

Query:  GKK-HRELAREAVRKSLVLLKNGESADKPLLPLPNKAPKILVAGSHANNLGYQCGDWTMEWQGLSGNKLTSGTIVLAARKDTIDPETEVIFKENPNKEFF
        GKK HRELAREAVRKSLVLLKNGESADKP+LPLP K PKILVAGSHANNLG+QCG WT+EWQGL GN LTSGT +L+A KDT+DP+T+V+FKENP+ EF 
Subjt:  GKK-HRELAREAVRKSLVLLKNGESADKPLLPLPNKAPKILVAGSHANNLGYQCGDWTMEWQGLSGNKLTSGTIVLAARKDTIDPETEVIFKENPNKEFF

Query:  QSHKFSYAIVVLGEYPYAETNGDSLNLTIPHPGSSTITNVCGAMKYVVIIISRRPVVIQRYIASIDALVAA
        +S+KFSYAIVV+GEYPYAET GDSLNLTIP PG STITNVCGA+K VVI+IS RPVV+Q YI+SIDALVAA
Subjt:  QSHKFSYAIVVLGEYPYAETNGDSLNLTIPHPGSSTITNVCGAMKYVVIIISRRPVVIQRYIASIDALVAA

XP_008464959.1 PREDICTED: beta-glucosidase BoGH3B-like [Cucumis melo]2.2e-12867.12Show/hide
Query:  MTEIIPGLQGEIPPTSRKGVPYVAGKKG----GSFFVGDGGTTKGINENNTVIDRHGLLSIHMPGF----------------------------------
        MTEII GLQGEIP  SRKGVPYVAG++        +VGDGGTTKGINENNT+  RHGLLSIHMPG+                                  
Subjt:  MTEIIPGLQGEIPPTSRKGVPYVAGKKG----GSFFVGDGGTTKGINENNTVIDRHGLLSIHMPGF----------------------------------

Query:  ----------ISNWQGIDKITNPPHANFTYSILASITAGVDM---------FIDGLTYLVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSLVNEV
                  IS+WQGID+IT+PPHAN+TYSI+A ITAG+DM         FIDGLTYLVK N+I ISRIDD VKRILRVK +MGLFEN LAD S VNE+
Subjt:  ----------ISNWQGIDKITNPPHANFTYSILASITAGVDM---------FIDGLTYLVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSLVNEV

Query:  GKK-HRELAREAVRKSLVLLKNGESADKPLLPLPNKAPKILVAGSHANNLGYQCGDWTMEWQGLSGNKLTSGTIVLAARKDTIDPETEVIFKENPNKEFF
        GKK HRELAREAVRKSLVLLKNGESADKP+LPLP K PKILVAGSHANNLG+QCG WT+EWQGL GN LTSGT +L+A KDT+DP+T+V+FKENP+ EF 
Subjt:  GKK-HRELAREAVRKSLVLLKNGESADKPLLPLPNKAPKILVAGSHANNLGYQCGDWTMEWQGLSGNKLTSGTIVLAARKDTIDPETEVIFKENPNKEFF

Query:  QSHKFSYAIVVLGEYPYAETNGDSLNLTIPHPGSSTITNVCGAMKYVVIIISRRPVVIQRYIASIDALVAA
        +S+KFSYAIVV+GE+PYAET GDSLNLTIP PGSSTITNVCG +K VVI+IS RPVV+Q YI+SIDALVAA
Subjt:  QSHKFSYAIVVLGEYPYAETNGDSLNLTIPHPGSSTITNVCGAMKYVVIIISRRPVVIQRYIASIDALVAA

XP_038905524.1 LOW QUALITY PROTEIN: beta-glucosidase BoGH3B-like [Benincasa hispida]3.4e-12968.38Show/hide
Query:  MTEIIPGLQGEIPPTSRKGVPYVAGKKG----GSFFVGDGGTTKGINENNTVIDRHGLLSIHMPGF----------------------------------
        M EII GLQG+IPP SRKGVPYVAGKK        FVGDGGTTKGINENNTVIDRH LLSIHMPG+                                  
Subjt:  MTEIIPGLQGEIPPTSRKGVPYVAGKKG----GSFFVGDGGTTKGINENNTVIDRHGLLSIHMPGF----------------------------------

Query:  ----------ISNWQGIDKITNPPHANFTYSILASITAGVDM---------FIDGLTYLVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSLVNEV
                  IS+WQGIDKIT+PPH+N+TYSI+AS+ AGVDM         FIDGLTYLVKNN I ISRIDD VKRILRVK +MGLFEN LADLSL+NE+
Subjt:  ----------ISNWQGIDKITNPPHANFTYSILASITAGVDM---------FIDGLTYLVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSLVNEV

Query:  GK-KHRELAREAVRKSLVLLKNGESADKPLLPLPNKAPKILVAGSHANNLGYQCGDWTMEWQGLSGNKLTSGTIVLAARKDTIDPETEVIFKENPNKEFF
        GK +HRELAREAVRKSLVLLKNG+  ++PLLPLP KAPKILVAGSHANNLG QCG WTMEWQG SGN LT GT +LAA KDT+DPET+VIF+ENP+ EF 
Subjt:  GK-KHRELAREAVRKSLVLLKNGESADKPLLPLPNKAPKILVAGSHANNLGYQCGDWTMEWQGLSGNKLTSGTIVLAARKDTIDPETEVIFKENPNKEFF

Query:  QSHKFSYAIVVLGEYPYAETNGDSLNLTIPHPGSSTITNVCGAMKYVVIIISRRPVVIQRYIASIDALVA
        +SH FSYAIVV+GE PYAETNGDSLNLTIPHPG  TITNVCG +K VVI+IS RPVVIQ YIAS+DALVA
Subjt:  QSHKFSYAIVVLGEYPYAETNGDSLNLTIPHPGSSTITNVCGAMKYVVIIISRRPVVIQRYIASIDALVA

XP_038905533.1 LOW QUALITY PROTEIN: beta-glucosidase BoGH3B-like [Benincasa hispida]2.2e-12868.02Show/hide
Query:  MTEIIPGLQGEIPPTSRKGVPYVAGK--KGGSFFVGDGGTTKGINENNTVIDRHGLLSIHMPGF------------------------------------
        MTEIIPGLQG+IPP SRKGVPYVAGK       FVGDGGTTKGINEN+TVIDRH LLSIHMPG+                                    
Subjt:  MTEIIPGLQGEIPPTSRKGVPYVAGK--KGGSFFVGDGGTTKGINENNTVIDRHGLLSIHMPGF------------------------------------

Query:  --------ISNWQGIDKITNPPHANFTYSILASITAGVDM---------FIDGLTYLVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSLVNEVGK
                IS+WQGID+IT+PPH+N+TYSI+AS+ AGVDM         FIDGLTYLVKNN I ISRIDD VKRILRVK +MGLFEN LADLSL+NE+GK
Subjt:  --------ISNWQGIDKITNPPHANFTYSILASITAGVDM---------FIDGLTYLVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSLVNEVGK

Query:  -KHRELAREAVRKSLVLLKNGESADKPLLPLPNKAPKILVAGSHANNLGYQCGDWTMEWQGLSGNKLTSGTIVLAARKDTIDPETEVIFKENPNKEFFQS
         +HRELAREAVRKSLVLLKNG+  ++PLLPLP KAPKILVAGSHANNLG QCG WT+EWQGLSGN LT GT +LAA KDT+DPET+VIF+ENP+ EF +S
Subjt:  -KHRELAREAVRKSLVLLKNGESADKPLLPLPNKAPKILVAGSHANNLGYQCGDWTMEWQGLSGNKLTSGTIVLAARKDTIDPETEVIFKENPNKEFFQS

Query:  HKFSYAIVVLGEYPYAETNGDSLNLTIPHPGSSTITNVCGAMKYVVIIISRRPVVIQRYIASIDALVAA
        H FSYAIVV+GE  YAETNGDSLNLTIPHPG  TITNVCG +K VVI+IS RPVVIQ YIAS+DALVA+
Subjt:  HKFSYAIVVLGEYPYAETNGDSLNLTIPHPGSSTITNVCGAMKYVVIIISRRPVVIQRYIASIDALVAA

TrEMBL top hitse value%identityAlignment
A0A0A0LFL8 Uncharacterized protein2.8e-12967.39Show/hide
Query:  MTEIIPGLQGEIPPTSRKGVPYVAGKKG----GSFFVGDGGTTKGINENNTVIDRHGLLSIHMPGF----------------------------------
        MTEII GLQGEIP  SRKGVPYVAG++        +VGDGGTTKG+NENNT+  RHGLLSIHMPG+                                  
Subjt:  MTEIIPGLQGEIPPTSRKGVPYVAGKKG----GSFFVGDGGTTKGINENNTVIDRHGLLSIHMPGF----------------------------------

Query:  ----------ISNWQGIDKITNPPHANFTYSILASITAGVDM---------FIDGLTYLVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSLVNEV
                  IS+WQGID+IT+PPHAN+TYSI+A ITAG+DM         FIDGLTYLVK N+I ISRIDD VKRILRVK VMGLFEN LAD S VNE+
Subjt:  ----------ISNWQGIDKITNPPHANFTYSILASITAGVDM---------FIDGLTYLVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSLVNEV

Query:  GKK-HRELAREAVRKSLVLLKNGESADKPLLPLPNKAPKILVAGSHANNLGYQCGDWTMEWQGLSGNKLTSGTIVLAARKDTIDPETEVIFKENPNKEFF
        GKK HRELAREAVRKSLVLLKNGESADKP+LPLP K PKILVAGSHANNLG+QCG WT+EWQGL GN LTSGT +L+A KDT+DP+T+V+FKENP+ EF 
Subjt:  GKK-HRELAREAVRKSLVLLKNGESADKPLLPLPNKAPKILVAGSHANNLGYQCGDWTMEWQGLSGNKLTSGTIVLAARKDTIDPETEVIFKENPNKEFF

Query:  QSHKFSYAIVVLGEYPYAETNGDSLNLTIPHPGSSTITNVCGAMKYVVIIISRRPVVIQRYIASIDALVAA
        +S+KFSYAIVV+GEYPYAET GDSLNLTIP PG STITNVCGA+K VVI+IS RPVV+Q YI+SIDALVAA
Subjt:  QSHKFSYAIVVLGEYPYAETNGDSLNLTIPHPGSSTITNVCGAMKYVVIIISRRPVVIQRYIASIDALVAA

A0A1S3CPA1 beta-glucosidase BoGH3B-like1.1e-12867.12Show/hide
Query:  MTEIIPGLQGEIPPTSRKGVPYVAGKKG----GSFFVGDGGTTKGINENNTVIDRHGLLSIHMPGF----------------------------------
        MTEII GLQGEIP  SRKGVPYVAG++        +VGDGGTTKGINENNT+  RHGLLSIHMPG+                                  
Subjt:  MTEIIPGLQGEIPPTSRKGVPYVAGKKG----GSFFVGDGGTTKGINENNTVIDRHGLLSIHMPGF----------------------------------

Query:  ----------ISNWQGIDKITNPPHANFTYSILASITAGVDM---------FIDGLTYLVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSLVNEV
                  IS+WQGID+IT+PPHAN+TYSI+A ITAG+DM         FIDGLTYLVK N+I ISRIDD VKRILRVK +MGLFEN LAD S VNE+
Subjt:  ----------ISNWQGIDKITNPPHANFTYSILASITAGVDM---------FIDGLTYLVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSLVNEV

Query:  GKK-HRELAREAVRKSLVLLKNGESADKPLLPLPNKAPKILVAGSHANNLGYQCGDWTMEWQGLSGNKLTSGTIVLAARKDTIDPETEVIFKENPNKEFF
        GKK HRELAREAVRKSLVLLKNGESADKP+LPLP K PKILVAGSHANNLG+QCG WT+EWQGL GN LTSGT +L+A KDT+DP+T+V+FKENP+ EF 
Subjt:  GKK-HRELAREAVRKSLVLLKNGESADKPLLPLPNKAPKILVAGSHANNLGYQCGDWTMEWQGLSGNKLTSGTIVLAARKDTIDPETEVIFKENPNKEFF

Query:  QSHKFSYAIVVLGEYPYAETNGDSLNLTIPHPGSSTITNVCGAMKYVVIIISRRPVVIQRYIASIDALVAA
        +S+KFSYAIVV+GE+PYAET GDSLNLTIP PGSSTITNVCG +K VVI+IS RPVV+Q YI+SIDALVAA
Subjt:  QSHKFSYAIVVLGEYPYAETNGDSLNLTIPHPGSSTITNVCGAMKYVVIIISRRPVVIQRYIASIDALVAA

A0A5A7T9L3 Beta-glucosidase BoGH3B-like1.5e-12767.65Show/hide
Query:  MTEIIPGLQGEIPPTSRKGVPYVAGKKG----GSFFVGDGGTTKGINENNTVIDRHGLLSIHMPGF----------------------------------
        MTEIIPGLQGEIPP SRKGVPYVAGK+        +VGDGGTTKGI+ENNTVIDRHGLLSIHMPG+                                  
Subjt:  MTEIIPGLQGEIPPTSRKGVPYVAGKKG----GSFFVGDGGTTKGINENNTVIDRHGLLSIHMPGF----------------------------------

Query:  ----------ISNWQGIDKITNPPHANFTYSILASITAGVDM---------FIDGLTYLVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSLVNEV
                  IS+WQ ID+IT+PPHAN+TYSILAS+TAG+DM         FIDGLTYLV NN I I+RIDD VKRILRVK +MGLFEN +ADLSLVNE+
Subjt:  ----------ISNWQGIDKITNPPHANFTYSILASITAGVDM---------FIDGLTYLVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSLVNEV

Query:  GK-KHRELAREAVRKSLVLLKNGESADKPLLPLPNKAPKILVAGSHANNLGYQCGDWTMEWQGLSGNKLTSGTIVLAARKDTIDPETEVIFKENPNKEFF
        GK +HRELAREAVRKSLVLLKNG+SADKPLLPL  K  KILVAGSHA+NLGYQCG WT+EWQGLSGN LTSGT VL A KDT+DP TEVIF ENP+K F 
Subjt:  GK-KHRELAREAVRKSLVLLKNGESADKPLLPLPNKAPKILVAGSHANNLGYQCGDWTMEWQGLSGNKLTSGTIVLAARKDTIDPETEVIFKENPNKEFF

Query:  QSHKFSYAIVVLGEYPYAETNGDSLNLTIPHPGSSTITNVCGAMKYVVIIISRRPVVIQRYIASIDALVAA
        QS  FSYAIVV+GE+PYAE  GDSLNLTIP PG STITNVCG +K VV+IIS RPVVIQ Y+ S+DALVAA
Subjt:  QSHKFSYAIVVLGEYPYAETNGDSLNLTIPHPGSSTITNVCGAMKYVVIIISRRPVVIQRYIASIDALVAA

A0A5A7UUM0 Beta-glucosidase BoGH3B-like1.1e-12867.12Show/hide
Query:  MTEIIPGLQGEIPPTSRKGVPYVAGKKG----GSFFVGDGGTTKGINENNTVIDRHGLLSIHMPGF----------------------------------
        MTEII GLQGEIP  SRKGVPYVAG++        +VGDGGTTKGINENNT+  RHGLLSIHMPG+                                  
Subjt:  MTEIIPGLQGEIPPTSRKGVPYVAGKKG----GSFFVGDGGTTKGINENNTVIDRHGLLSIHMPGF----------------------------------

Query:  ----------ISNWQGIDKITNPPHANFTYSILASITAGVDM---------FIDGLTYLVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSLVNEV
                  IS+WQGID+IT+PPHAN+TYSI+A ITAG+DM         FIDGLTYLVK N+I ISRIDD VKRILRVK +MGLFEN LAD S VNE+
Subjt:  ----------ISNWQGIDKITNPPHANFTYSILASITAGVDM---------FIDGLTYLVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSLVNEV

Query:  GKK-HRELAREAVRKSLVLLKNGESADKPLLPLPNKAPKILVAGSHANNLGYQCGDWTMEWQGLSGNKLTSGTIVLAARKDTIDPETEVIFKENPNKEFF
        GKK HRELAREAVRKSLVLLKNGESADKP+LPLP K PKILVAGSHANNLG+QCG WT+EWQGL GN LTSGT +L+A KDT+DP+T+V+FKENP+ EF 
Subjt:  GKK-HRELAREAVRKSLVLLKNGESADKPLLPLPNKAPKILVAGSHANNLGYQCGDWTMEWQGLSGNKLTSGTIVLAARKDTIDPETEVIFKENPNKEFF

Query:  QSHKFSYAIVVLGEYPYAETNGDSLNLTIPHPGSSTITNVCGAMKYVVIIISRRPVVIQRYIASIDALVAA
        +S+KFSYAIVV+GE+PYAET GDSLNLTIP PGSSTITNVCG +K VVI+IS RPVV+Q YI+SIDALVAA
Subjt:  QSHKFSYAIVVLGEYPYAETNGDSLNLTIPHPGSSTITNVCGAMKYVVIIISRRPVVIQRYIASIDALVAA

A0A5D3D8S5 Beta-glucosidase BoGH3B-like1.5e-12767.65Show/hide
Query:  MTEIIPGLQGEIPPTSRKGVPYVAGKKG----GSFFVGDGGTTKGINENNTVIDRHGLLSIHMPGF----------------------------------
        MTEIIPGLQGEIPP SRKGVPYVAGK+        +VGDGGTTKGI+ENNTVIDRHGLLSIHMPG+                                  
Subjt:  MTEIIPGLQGEIPPTSRKGVPYVAGKKG----GSFFVGDGGTTKGINENNTVIDRHGLLSIHMPGF----------------------------------

Query:  ----------ISNWQGIDKITNPPHANFTYSILASITAGVDM---------FIDGLTYLVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSLVNEV
                  IS+WQ ID+IT+PPHAN+TYSILAS+TAG+DM         FIDGLTYLV NN I I+RIDD VKRILRVK +MGLFEN +ADLSLVNE+
Subjt:  ----------ISNWQGIDKITNPPHANFTYSILASITAGVDM---------FIDGLTYLVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSLVNEV

Query:  GK-KHRELAREAVRKSLVLLKNGESADKPLLPLPNKAPKILVAGSHANNLGYQCGDWTMEWQGLSGNKLTSGTIVLAARKDTIDPETEVIFKENPNKEFF
        GK +HRELAREAVRKSLVLLKNG+SADKPLLPL  K  KILVAGSHA+NLGYQCG WT+EWQGLSGN LTSGT VL A KDT+DP TEVIF ENP+K F 
Subjt:  GK-KHRELAREAVRKSLVLLKNGESADKPLLPLPNKAPKILVAGSHANNLGYQCGDWTMEWQGLSGNKLTSGTIVLAARKDTIDPETEVIFKENPNKEFF

Query:  QSHKFSYAIVVLGEYPYAETNGDSLNLTIPHPGSSTITNVCGAMKYVVIIISRRPVVIQRYIASIDALVAA
        QS  FSYAIVV+GE+PYAE  GDSLNLTIP PG STITNVCG +K VV+IIS RPVVIQ Y+ S+DALVAA
Subjt:  QSHKFSYAIVVLGEYPYAETNGDSLNLTIPHPGSSTITNVCGAMKYVVIIISRRPVVIQRYIASIDALVAA

SwissProt top hitse value%identityAlignment
A7LXU3 Beta-glucosidase BoGH3B2.4e-1627.96Show/hide
Query:  ISNWQGIDKITNPPH--ANFTYSILASITAGVDM--------FIDGLTYLVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSLVNEVG-KKHRELA
        +++W  I+ +    H  A    ++   I AG+DM        F D L  LV+   +S+ RIDD V R+LR+K  +GLF++   D+   ++ G K+   +A
Subjt:  ISNWQGIDKITNPPH--ANFTYSILASITAGVDM--------FIDGLTYLVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSLVNEVG-KKHRELA

Query:  REAVRKSLVLLKNGESADKPLLPLPNKAPKILVAGSHANNLGYQCGDWTMEWQGLSGNKLTSG--TIVLAA----RKDTIDPETEVIFKENPNKEFFQSH
         +A  +S VLLKN    D  +LP+  K  KIL+ G +AN++    G W+  WQG   ++      TI  A      K+ I  E  V +    N  +++ +
Subjt:  REAVRKSLVLLKNGESADKPLLPLPNKAPKILVAGSHANNLGYQCGDWTMEWQGLSGNKLTSG--TIVLAA----RKDTIDPETEVIFKENPNKEFFQSH

Query:  K------------FSYAIVVLGEYPYAETNGDSLNLTIPHPGSSTITNVCGAMKYVVIIISR-RPVVIQRYIASIDALV
        K                I  +GE  Y ET G+  +LT+     + +  +    K +V+++++ RP +I   +    A+V
Subjt:  K------------FSYAIVVLGEYPYAETNGDSLNLTIPHPGSSTITNVCGAMKYVVIIISR-RPVVIQRYIASIDALV

P33363 Periplasmic beta-glucosidase2.2e-0627.22Show/hide
Query:  SILASITAGVDMFIDGLTY------LVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSL-------VNEVGKKHRELAREAVRKSLVLLKNGESAD
        ++  ++ +G++M +    Y      L+K+  ++++ +DD  + +L VK  MGLF +  + L          N   + HR+ ARE  R+SLVLLKN     
Subjt:  SILASITAGVDMFIDGLTY------LVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSL-------VNEVGKKHRELAREAVRKSLVLLKNGESAD

Query:  KPLLPLPNKAPKILVAGSHANNLGYQCGDWTMEWQGLSGNKLTSGTIVLAARKDTIDPETEVIFKENPN
           LPL  K+  I V G  A++     G W+    G++   +T    VL   K+ +    +V++ +  N
Subjt:  KPLLPLPNKAPKILVAGSHANNLGYQCGDWTMEWQGLSGNKLTSGTIVLAARKDTIDPETEVIFKENPN

Q2UFP8 Probable beta-glucosidase C1.6e-0434.82Show/hide
Query:  ITAGVDMF-----IDGLTYLVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSLVNEV--GKKHRELAREAVRKSLVLLKNGESADKPLLPLP--NK
        + AG D F      + +  LV+  IIS  RID  V+R+L+ K V+GLF+N   D      V        L REA R+S  LL N E     ++PL    K
Subjt:  ITAGVDMF-----IDGLTYLVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSLVNEV--GKKHRELAREAVRKSLVLLKNGESADKPLLPLP--NK

Query:  APKILVAGSHAN
        + K  + G +A+
Subjt:  APKILVAGSHAN

Q56078 Periplasmic beta-glucosidase3.8e-0624.45Show/hide
Query:  SILASITAGVDMFIDGLTY------LVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSL-------VNEVGKKHRELAREAVRKSLVLLKNGESAD
        ++  ++ AGVDM +    Y      L+K+  ++++ +DD  + +L VK  MGLF +  + L          N   + HR+ ARE  R+S+VLLKN     
Subjt:  SILASITAGVDMFIDGLTY------LVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSL-------VNEVGKKHRELAREAVRKSLVLLKNGESAD

Query:  KPLLPLPNKAPKILVAGSHANNLGYQCGDWTMEWQGLSGNKLTSGTIVLAARKDTIDPETEVIFKENPN-----------------------------KE
           LPL  K+  I V G  A++     G W+    G++   +T    VLA  ++ +    ++++ +  N                              E
Subjt:  KPLLPLPNKAPKILVAGSHANNLGYQCGDWTMEWQGLSGNKLTSGTIVLAARKDTIDPETEVIFKENPN-----------------------------KE

Query:  FFQSHKFSYAIV-VLGE-YPYAETNGDSLNLTIPHPGSSTITNVCGAMK-YVVIIISRRPVVIQRYIASIDALV
          Q+ K +  +V V+GE    A       N+TIP      IT +    K  V+++++ RP+ + +     DA++
Subjt:  FFQSHKFSYAIV-VLGE-YPYAETNGDSLNLTIPHPGSSTITNVCGAMK-YVVIIISRRPVVIQRYIASIDALV

Arabidopsis top hitse value%identityAlignment
AT3G47000.1 Glycosyl hydrolase family protein5.3e-7243.97Show/hide
Query:  MTEIIPGLQGEIPPTSRKGVPYVAGKKG----GSFFVGDGGTTKGINENNTVIDRHGLLSIHMP------------------------------------
        MT ++ GLQG  P     G P+VAG+         FVGDGGT KGINE NT+     L  IH+P                                    
Subjt:  MTEIIPGLQGEIPPTSRKGVPYVAGKKG----GSFFVGDGGTTKGINENNTVIDRHGLLSIHMP------------------------------------

Query:  -------GF-ISNWQGIDKITNPPHANFTYSILASITAGVDM---------FIDGLTYLVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSLVNEV
               GF +S+W+G+D+++ P  +N+ Y I  ++ AG+DM         FI  +T LV++  I ++RI+D V+RILRVK V GLF + L D SL+  V
Subjt:  -------GF-ISNWQGIDKITNPPHANFTYSILASITAGVDM---------FIDGLTYLVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSLVNEV

Query:  G-KKHRELAREAVRKSLVLLKNGESADKPLLPLPNKAPKILVAGSHANNLGYQCGDWTMEWQGLSGNKLTSGTIVLAARKDTIDPETEVIFKENPNKEFF
        G K+HRELA+EAVRKSLVLLK+G++ADKP LPL   A +ILV G+HA++LGYQCG WT  W GLSG ++T GT +L A K+ +  ETEVI+++ P+KE  
Subjt:  G-KKHRELAREAVRKSLVLLKNGESADKPLLPLPNKAPKILVAGSHANNLGYQCGDWTMEWQGLSGNKLTSGTIVLAARKDTIDPETEVIFKENPNKEFF

Query:  QSHK-FSYAIVVLGEYPYAETNGDSLNLTIPHPGSSTITNVCGAMKYVVIIISRRPVVIQ-RYIASIDALVAA
         S + FSYAIV +GE PYAET GD+  L IP  G+  +T V   +  +VI+IS RPVV++   +   +ALVAA
Subjt:  QSHK-FSYAIVVLGEYPYAETNGDSLNLTIPHPGSSTITNVCGAMKYVVIIISRRPVVIQ-RYIASIDALVAA

AT5G04885.1 Glycosyl hydrolase family protein4.3e-9851.48Show/hide
Query:  MTEIIPGLQGEIPPTSRKGVPYVAGKKG----GSFFVGDGGTTKGINENNTVIDRHGLLSIHMPGF----------------------------------
        MT++I GLQGE P   + GVP+V G+         +VGDGGTT+G+NENNTV D HGLLS+HMP +                                  
Subjt:  MTEIIPGLQGEIPPTSRKGVPYVAGKKG----GSFFVGDGGTTKGINENNTVIDRHGLLSIHMPGF----------------------------------

Query:  ----------ISNWQGIDKITNPPHANFTYSILASITAGVDM---------FIDGLTYLVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSLVNEV
                  IS+WQG+DKI+ PPH ++T S+ A+I AG+DM         F++ LT LVKNN I ++RIDD V+RIL VK  MGLFEN LAD S  +E+
Subjt:  ----------ISNWQGIDKITNPPHANFTYSILASITAGVDM---------FIDGLTYLVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSLVNEV

Query:  GKK-HRELAREAVRKSLVLLKNGESADKPLLPLPNKAPKILVAGSHANNLGYQCGDWTMEWQGLSGNKLTSGTIVLAARKDTIDPETEVIFKENPNKEFF
        G + HR+LAREAVRKSLVLLKNG   + P+LPLP K  KILVAG+HA+NLGYQCG WT+ WQG SGNK T GT +L+A K  +D  TEV+F+ENP+ EF 
Subjt:  GKK-HRELAREAVRKSLVLLKNGESADKPLLPLPNKAPKILVAGSHANNLGYQCGDWTMEWQGLSGNKLTSGTIVLAARKDTIDPETEVIFKENPNKEFF

Query:  QSHKFSYAIVVLGEYPYAETNGDSLNLTIPHPGSSTITNVCGAMKYVVIIISRRPVVIQRYIASIDALVAA
        +S+ F+YAI+ +GE PYAET GDS  LT+  PG + I++ C A+K VV++IS RP+V++ Y+ASIDALVAA
Subjt:  QSHKFSYAIVVLGEYPYAETNGDSLNLTIPHPGSSTITNVCGAMKYVVIIISRRPVVIQRYIASIDALVAA

AT5G20940.1 Glycosyl hydrolase family protein1.0e-9954.72Show/hide
Query:  MTEIIPGLQGEIPPTSRKGVPYVAGKKG----GSFFVGDGGTTKGINENNTVIDRHGLLSIHMPGF----------------------------------
        MTEIIPGLQG++ PT +KGVP+VAGK         FVGDGGT +G+N NNTVI+ +GLL IHMP +                                  
Subjt:  MTEIIPGLQGEIPPTSRKGVPYVAGKKG----GSFFVGDGGTTKGINENNTVIDRHGLLSIHMPGF----------------------------------

Query:  ----------ISNWQGIDKITNPPHANFTYSILASITAGVDMF---------IDGLTYLVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSLVNEV
                  IS++ G+D+I  P  AN+++S+ A+ TAG+DMF         ID LT  VK   I +SRIDD VKRILRVK  MGLFEN +AD SL  ++
Subjt:  ----------ISNWQGIDKITNPPHANFTYSILASITAGVDMF---------IDGLTYLVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSLVNEV

Query:  G-KKHRELAREAVRKSLVLLKNGESADKPLLPLPNKAPKILVAGSHANNLGYQCGDWTMEWQGLSGNKLTSGTIVLAARKDTIDPETEVIFKENPNKEFF
        G K+HRELAREAVRKSLVLLKNGE+ADKPLLPLP KA KILVAG+HA+NLGYQCG WT+ WQGL+GN LT GT +LAA K T+DP+T+VI+ +NP+  F 
Subjt:  G-KKHRELAREAVRKSLVLLKNGESADKPLLPLPNKAPKILVAGSHANNLGYQCGDWTMEWQGLSGNKLTSGTIVLAARKDTIDPETEVIFKENPNKEFF

Query:  QSHKFSYAIVVLGEYPYAETNGDSLNLTIPHPGSSTITNVCGAMKYVVIIISRRPVVIQRYIASIDALVAA
        ++  F YAIV +GE PYAE  GDS NLTI  PG STI NVC ++K VV+++S RPVV+Q  I++IDALVAA
Subjt:  QSHKFSYAIVVLGEYPYAETNGDSLNLTIPHPGSSTITNVCGAMKYVVIIISRRPVVIQRYIASIDALVAA

AT5G20950.1 Glycosyl hydrolase family protein4.7e-10554.99Show/hide
Query:  MTEIIPGLQGEIPPTSRKGVPYVAGKKG----GSFFVGDGGTTKGINENNTVIDRHGLLSIHMPGF----------------------------------
        MTEIIPGLQG++ PT RKGVP+V GK         FVGDGGT +GI+ENNTVID  GL  IHMPG+                                  
Subjt:  MTEIIPGLQGEIPPTSRKGVPYVAGKKG----GSFFVGDGGTTKGINENNTVIDRHGLLSIHMPGF----------------------------------

Query:  ----------ISNWQGIDKITNPPHANFTYSILASITAGVDM---------FIDGLTYLVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSLVNEV
                  IS+WQGID+IT PPH N++YS+ A I+AG+DM         FID ++  ++  +I ISRIDD +KRILRVK  MGLFE  LADLS  N++
Subjt:  ----------ISNWQGIDKITNPPHANFTYSILASITAGVDM---------FIDGLTYLVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSLVNEV

Query:  G-KKHRELAREAVRKSLVLLKNGESADKPLLPLPNKAPKILVAGSHANNLGYQCGDWTMEWQGLSGNKLTSGTIVLAARKDTIDPETEVIFKENPNKEFF
        G K+HRELAREAVRKSLVLLKNG++  KPLLPLP K+ KILVAG+HA+NLGYQCG WT+ WQGL+GN  T GT +LAA K+T+ P T+V++ +NP+  F 
Subjt:  G-KKHRELAREAVRKSLVLLKNGESADKPLLPLPNKAPKILVAGSHANNLGYQCGDWTMEWQGLSGNKLTSGTIVLAARKDTIDPETEVIFKENPNKEFF

Query:  QSHKFSYAIVVLGEYPYAETNGDSLNLTIPHPGSSTITNVCGAMKYVVIIISRRPVVIQRYIASIDALVAA
        +S KF YAIVV+GE PYAE  GD+ NLTI  PG S I NVCG++K VV+++S RPVVIQ Y+++IDALVAA
Subjt:  QSHKFSYAIVVLGEYPYAETNGDSLNLTIPHPGSSTITNVCGAMKYVVIIISRRPVVIQRYIASIDALVAA

AT5G20950.2 Glycosyl hydrolase family protein4.7e-10554.99Show/hide
Query:  MTEIIPGLQGEIPPTSRKGVPYVAGKKG----GSFFVGDGGTTKGINENNTVIDRHGLLSIHMPGF----------------------------------
        MTEIIPGLQG++ PT RKGVP+V GK         FVGDGGT +GI+ENNTVID  GL  IHMPG+                                  
Subjt:  MTEIIPGLQGEIPPTSRKGVPYVAGKKG----GSFFVGDGGTTKGINENNTVIDRHGLLSIHMPGF----------------------------------

Query:  ----------ISNWQGIDKITNPPHANFTYSILASITAGVDM---------FIDGLTYLVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSLVNEV
                  IS+WQGID+IT PPH N++YS+ A I+AG+DM         FID ++  ++  +I ISRIDD +KRILRVK  MGLFE  LADLS  N++
Subjt:  ----------ISNWQGIDKITNPPHANFTYSILASITAGVDM---------FIDGLTYLVKNNIISISRIDDVVKRILRVKCVMGLFENRLADLSLVNEV

Query:  G-KKHRELAREAVRKSLVLLKNGESADKPLLPLPNKAPKILVAGSHANNLGYQCGDWTMEWQGLSGNKLTSGTIVLAARKDTIDPETEVIFKENPNKEFF
        G K+HRELAREAVRKSLVLLKNG++  KPLLPLP K+ KILVAG+HA+NLGYQCG WT+ WQGL+GN  T GT +LAA K+T+ P T+V++ +NP+  F 
Subjt:  G-KKHRELAREAVRKSLVLLKNGESADKPLLPLPNKAPKILVAGSHANNLGYQCGDWTMEWQGLSGNKLTSGTIVLAARKDTIDPETEVIFKENPNKEFF

Query:  QSHKFSYAIVVLGEYPYAETNGDSLNLTIPHPGSSTITNVCGAMKYVVIIISRRPVVIQRYIASIDALVAA
        +S KF YAIVV+GE PYAE  GD+ NLTI  PG S I NVCG++K VV+++S RPVVIQ Y+++IDALVAA
Subjt:  QSHKFSYAIVVLGEYPYAETNGDSLNLTIPHPGSSTITNVCGAMKYVVIIISRRPVVIQRYIASIDALVAA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGACTGAAATCATACCAGGTTTACAAGGAGAGATCCCACCTACTTCGCGCAAGGGTGTTCCTTATGTCGCTGGAAAAAAAGGTGGTAGCTTCTTTGTGGGCGATGG
TGGAACAACTAAAGGTATCAATGAGAACAACACGGTGATAGATAGGCATGGATTACTTAGCATTCACATGCCAGGGTTTATCTCAAATTGGCAGGGTATTGATAAGATTA
CCAATCCACCTCATGCTAACTTTACATATTCCATTTTAGCAAGCATTACTGCTGGTGTTGATATGTTCATTGATGGCCTTACCTACTTGGTGAAAAATAATATAATTTCT
ATTAGTCGAATTGATGATGTAGTGAAGAGAATATTGCGAGTAAAATGTGTTATGGGTTTATTTGAGAACCGATTAGCTGACTTAAGCTTGGTTAATGAGGTTGGTAAAAA
GCATAGAGAGCTAGCTAGAGAAGCTGTAAGGAAATCACTAGTATTGTTAAAGAATGGAGAATCGGCTGACAAACCGTTGCTACCCCTTCCAAATAAAGCACCAAAAATAC
TTGTTGCTGGTAGCCATGCAAACAACCTTGGATATCAGTGTGGCGATTGGACTATGGAGTGGCAAGGACTTAGTGGCAACAAGCTTACTAGTGGTACAATTGTGCTTGCA
GCTAGAAAAGATACCATTGATCCTGAAACAGAAGTTATATTTAAGGAGAATCCAAATAAAGAATTTTTCCAATCACACAAATTTTCTTATGCCATTGTTGTACTGGGAGA
ATATCCATATGCCGAAACCAATGGCGATAGCTTGAATTTAACAATTCCCCACCCTGGTTCAAGCACCATCACAAATGTTTGTGGAGCTATGAAATATGTAGTTATAATAA
TCTCAAGGCGGCCTGTAGTAATCCAACGTTATATTGCTTCAATAGATGCACTTGTTGCTGCATAG
mRNA sequenceShow/hide mRNA sequence
ATGATGACTGAAATCATACCAGGTTTACAAGGAGAGATCCCACCTACTTCGCGCAAGGGTGTTCCTTATGTCGCTGGAAAAAAAGGTGGTAGCTTCTTTGTGGGCGATGG
TGGAACAACTAAAGGTATCAATGAGAACAACACGGTGATAGATAGGCATGGATTACTTAGCATTCACATGCCAGGGTTTATCTCAAATTGGCAGGGTATTGATAAGATTA
CCAATCCACCTCATGCTAACTTTACATATTCCATTTTAGCAAGCATTACTGCTGGTGTTGATATGTTCATTGATGGCCTTACCTACTTGGTGAAAAATAATATAATTTCT
ATTAGTCGAATTGATGATGTAGTGAAGAGAATATTGCGAGTAAAATGTGTTATGGGTTTATTTGAGAACCGATTAGCTGACTTAAGCTTGGTTAATGAGGTTGGTAAAAA
GCATAGAGAGCTAGCTAGAGAAGCTGTAAGGAAATCACTAGTATTGTTAAAGAATGGAGAATCGGCTGACAAACCGTTGCTACCCCTTCCAAATAAAGCACCAAAAATAC
TTGTTGCTGGTAGCCATGCAAACAACCTTGGATATCAGTGTGGCGATTGGACTATGGAGTGGCAAGGACTTAGTGGCAACAAGCTTACTAGTGGTACAATTGTGCTTGCA
GCTAGAAAAGATACCATTGATCCTGAAACAGAAGTTATATTTAAGGAGAATCCAAATAAAGAATTTTTCCAATCACACAAATTTTCTTATGCCATTGTTGTACTGGGAGA
ATATCCATATGCCGAAACCAATGGCGATAGCTTGAATTTAACAATTCCCCACCCTGGTTCAAGCACCATCACAAATGTTTGTGGAGCTATGAAATATGTAGTTATAATAA
TCTCAAGGCGGCCTGTAGTAATCCAACGTTATATTGCTTCAATAGATGCACTTGTTGCTGCATAG
Protein sequenceShow/hide protein sequence
MMTEIIPGLQGEIPPTSRKGVPYVAGKKGGSFFVGDGGTTKGINENNTVIDRHGLLSIHMPGFISNWQGIDKITNPPHANFTYSILASITAGVDMFIDGLTYLVKNNIIS
ISRIDDVVKRILRVKCVMGLFENRLADLSLVNEVGKKHRELAREAVRKSLVLLKNGESADKPLLPLPNKAPKILVAGSHANNLGYQCGDWTMEWQGLSGNKLTSGTIVLA
ARKDTIDPETEVIFKENPNKEFFQSHKFSYAIVVLGEYPYAETNGDSLNLTIPHPGSSTITNVCGAMKYVVIIISRRPVVIQRYIASIDALVAA