; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG07G004840 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG07G004840
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionGlycosyl hydrolase family protein
Genome locationCG_Chr07:6230239..6231900
RNA-Seq ExpressionClCG07G004840
SyntenyClCG07G004840
Gene Ontology termsGO:0009251 - glucan catabolic process (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0008422 - beta-glucosidase activity (molecular function)
InterPro domainsIPR001764 - Glycoside hydrolase, family 3, N-terminal
IPR017853 - Glycoside hydrolase superfamily
IPR036962 - Glycoside hydrolase, family 3, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004137360.1 uncharacterized protein LOC101204835 [Cucumis sativus]3.4e-9680.7Show/hide
Query:  MGFWLLLCCLAVNTDATYLKYKDPKQPLGARIKDLMAD-TLEEKIGQMVQIEQKVATPDVMKNYFIGSVLSGGGN------------------PKGVLAT
        MGFWLLLCCL V TDATYLKYKDPKQPLGARIKDLM   TLEEKIGQMVQIE+ VATPDVMKNYFIGSVLSGGG+                   KG LAT
Subjt:  MGFWLLLCCLAVNTDATYLKYKDPKQPLGARIKDLMAD-TLEEKIGQMVQIEQKVATPDVMKNYFIGSVLSGGGN------------------PKGVLAT

Query:  RIGIPMIYGIDAVHGHNNVYNATIFPDIVGLGVTRDLELLRRIGDATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQMIEIIHGSQGAI
        R+GIPMIYGIDAVHGHNNVYNATIFP  VGLGVTRD ELLRRIG+ATALEVRAT IP VFAPCIAVCRDPRWGRCY+SYSEDHKIVQQ+ EII G QGAI
Subjt:  RIGIPMIYGIDAVHGHNNVYNATIFPDIVGLGVTRDLELLRRIGDATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQMIEIIHGSQGAI

Query:  PSNSRKEIPFVAGKQNVAACVKHFLGDG
        PSNSRK IPFVAGKQ VAAC KHF+GDG
Subjt:  PSNSRKEIPFVAGKQNVAACVKHFLGDG

XP_008453517.1 PREDICTED: beta-glucosidase BoGH3B-like [Cucumis melo]1.3e-9580.26Show/hide
Query:  MGFWLLLCCLAVNTDATYLKYKDPKQPLGARIKDLMAD-TLEEKIGQMVQIEQKVATPDVMKNYFIGSVLSGGGN------------------PKGVLAT
        MGFWLLLCCL V TDATYLKYKDPKQPLGARIKDLM   TLEEKIGQMVQIE+ VATPDVMKNYFIGSVLSGGG+                   KG LAT
Subjt:  MGFWLLLCCLAVNTDATYLKYKDPKQPLGARIKDLMAD-TLEEKIGQMVQIEQKVATPDVMKNYFIGSVLSGGGN------------------PKGVLAT

Query:  RIGIPMIYGIDAVHGHNNVYNATIFPDIVGLGVTRDLELLRRIGDATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQMIEIIHGSQGAI
        R+GIPMIYGIDAVHGHNNVYNATIFP  VGLGVTRD ELLRRIG+ATALEVRAT IP VFAPCIAVCRDPRWGRCY+SYSEDHKIVQQ+ EII G QGAI
Subjt:  RIGIPMIYGIDAVHGHNNVYNATIFPDIVGLGVTRDLELLRRIGDATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQMIEIIHGSQGAI

Query:  PSNSRKEIPFVAGKQNVAACVKHFLGDG
        P NSRK IPFVAGKQ VAAC KHF+GDG
Subjt:  PSNSRKEIPFVAGKQNVAACVKHFLGDG

XP_022135118.1 uncharacterized protein LOC111007174 [Momordica charantia]7.5e-9680.7Show/hide
Query:  MGFWLLLCCLAVNTDATYLKYKDPKQPLGARIKDLMAD-TLEEKIGQMVQIEQKVATPDVMKNYFIGSVLSGGGN------------------PKGVLAT
        +GFWLLLCCLAV TDATYLKY+DPKQPLGARIKDLM   TLEEKIGQMVQIE+KVATPDVMKNYFIGSVLSGGG+                   KG LAT
Subjt:  MGFWLLLCCLAVNTDATYLKYKDPKQPLGARIKDLMAD-TLEEKIGQMVQIEQKVATPDVMKNYFIGSVLSGGGN------------------PKGVLAT

Query:  RIGIPMIYGIDAVHGHNNVYNATIFPDIVGLGVTRDLELLRRIGDATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQMIEIIHGSQGAI
        R+GIPMIYGIDAVHGHNNVYNATIFP  VGLGVTRD  LLRRIGDATALEVRAT IP VFAPCIAVCRDPRWGRCY+SYSEDHKIVQQM EII G QG I
Subjt:  RIGIPMIYGIDAVHGHNNVYNATIFPDIVGLGVTRDLELLRRIGDATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQMIEIIHGSQGAI

Query:  PSNSRKEIPFVAGKQNVAACVKHFLGDG
        PSNSRK IPFVAGKQ VAAC KHF+GDG
Subjt:  PSNSRKEIPFVAGKQNVAACVKHFLGDG

XP_022933885.1 uncharacterized protein LOC111441165 [Cucurbita moschata]1.4e-9462.78Show/hide
Query:  MGFWLLLCCLAVNTDATYLKYKDPKQPLGARIKDLM-ADTLEEKIGQMVQIEQKVATPDVMKNYFIGSVLSGGGN------------------PKGVLAT
        +GFWLLLCCLAV TDATYLKY+DPKQPLGARIKDLM   TLEEKIGQMVQIE+KVATPDVMKNYFIGSVLSGGG+                   KG L+T
Subjt:  MGFWLLLCCLAVNTDATYLKYKDPKQPLGARIKDLM-ADTLEEKIGQMVQIEQKVATPDVMKNYFIGSVLSGGGN------------------PKGVLAT

Query:  RIGIPMIYGIDAVHGHNNVYNATIFPDIVGLGVTRDLELLRRIGDATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQMIEIIHGSQGAI
        R+ IPMIYGIDA+HGHNN YNATIFP  +GLGVTRD ELLRRIG+ATALEVRAT IP VFAPCIAVCRDPRWGRCY+SYSEDHKIVQQM EII G QGAI
Subjt:  RIGIPMIYGIDAVHGHNNVYNATIFPDIVGLGVTRDLELLRRIGDATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQMIEIIHGSQGAI

Query:  PSNSRKEIPFVAGKQNVAACVKHFLGDGAQPGLRVPKIIIHTRMLTLLGLLDFH-KAWKMTLSRGEEAIKC-----------LERDLTVN-LPDDSKDQI
        P+NSRK IPFV GKQ VAAC KHFLGDG         I  +  ++   GLL  H  A+  ++++G   +               RDL    L +  K + 
Subjt:  PSNSRKEIPFVAGKQNVAACVKHFLGDGAQPGLRVPKIIIHTRMLTLLGLLDFH-KAWKMTLSRGEEAIKC-----------LERDLTVN-LPDDSKDQI

Query:  LVLGDWDNL
         V+ DW  +
Subjt:  LVLGDWDNL

XP_038879149.1 beta-glucosidase BoGH3B-like [Benincasa hispida]1.8e-9782.02Show/hide
Query:  MGFWLLLCCLAVNTDATYLKYKDPKQPLGARIKDLMAD-TLEEKIGQMVQIEQKVATPDVMKNYFIGSVLSGGGN------------------PKGVLAT
        MGFWLLLCCLAV TDATYLKYKDPKQPLGARIKDLM   TLEEKIGQMVQIE+KVATPDVMKNYFIGSVLSGGG+                   KG LAT
Subjt:  MGFWLLLCCLAVNTDATYLKYKDPKQPLGARIKDLMAD-TLEEKIGQMVQIEQKVATPDVMKNYFIGSVLSGGGN------------------PKGVLAT

Query:  RIGIPMIYGIDAVHGHNNVYNATIFPDIVGLGVTRDLELLRRIGDATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQMIEIIHGSQGAI
        R+GIPMIYGIDAVHGHNNVYNATIFP  VGLGVTRD ELLRRIGDATALEVRAT IP VFAPCIAVCRDPRWGRCY+SYSEDHKIVQQM EII G QGAI
Subjt:  RIGIPMIYGIDAVHGHNNVYNATIFPDIVGLGVTRDLELLRRIGDATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQMIEIIHGSQGAI

Query:  PSNSRKEIPFVAGKQNVAACVKHFLGDG
        P NSRK IPFVAGKQ VAAC KHF+GDG
Subjt:  PSNSRKEIPFVAGKQNVAACVKHFLGDG

TrEMBL top hitse value%identityAlignment
A0A0A0LV53 Uncharacterized protein1.6e-9680.7Show/hide
Query:  MGFWLLLCCLAVNTDATYLKYKDPKQPLGARIKDLMAD-TLEEKIGQMVQIEQKVATPDVMKNYFIGSVLSGGGN------------------PKGVLAT
        MGFWLLLCCL V TDATYLKYKDPKQPLGARIKDLM   TLEEKIGQMVQIE+ VATPDVMKNYFIGSVLSGGG+                   KG LAT
Subjt:  MGFWLLLCCLAVNTDATYLKYKDPKQPLGARIKDLMAD-TLEEKIGQMVQIEQKVATPDVMKNYFIGSVLSGGGN------------------PKGVLAT

Query:  RIGIPMIYGIDAVHGHNNVYNATIFPDIVGLGVTRDLELLRRIGDATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQMIEIIHGSQGAI
        R+GIPMIYGIDAVHGHNNVYNATIFP  VGLGVTRD ELLRRIG+ATALEVRAT IP VFAPCIAVCRDPRWGRCY+SYSEDHKIVQQ+ EII G QGAI
Subjt:  RIGIPMIYGIDAVHGHNNVYNATIFPDIVGLGVTRDLELLRRIGDATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQMIEIIHGSQGAI

Query:  PSNSRKEIPFVAGKQNVAACVKHFLGDG
        PSNSRK IPFVAGKQ VAAC KHF+GDG
Subjt:  PSNSRKEIPFVAGKQNVAACVKHFLGDG

A0A1S3BXL6 beta-glucosidase BoGH3B-like6.2e-9680.26Show/hide
Query:  MGFWLLLCCLAVNTDATYLKYKDPKQPLGARIKDLMAD-TLEEKIGQMVQIEQKVATPDVMKNYFIGSVLSGGGN------------------PKGVLAT
        MGFWLLLCCL V TDATYLKYKDPKQPLGARIKDLM   TLEEKIGQMVQIE+ VATPDVMKNYFIGSVLSGGG+                   KG LAT
Subjt:  MGFWLLLCCLAVNTDATYLKYKDPKQPLGARIKDLMAD-TLEEKIGQMVQIEQKVATPDVMKNYFIGSVLSGGGN------------------PKGVLAT

Query:  RIGIPMIYGIDAVHGHNNVYNATIFPDIVGLGVTRDLELLRRIGDATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQMIEIIHGSQGAI
        R+GIPMIYGIDAVHGHNNVYNATIFP  VGLGVTRD ELLRRIG+ATALEVRAT IP VFAPCIAVCRDPRWGRCY+SYSEDHKIVQQ+ EII G QGAI
Subjt:  RIGIPMIYGIDAVHGHNNVYNATIFPDIVGLGVTRDLELLRRIGDATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQMIEIIHGSQGAI

Query:  PSNSRKEIPFVAGKQNVAACVKHFLGDG
        P NSRK IPFVAGKQ VAAC KHF+GDG
Subjt:  PSNSRKEIPFVAGKQNVAACVKHFLGDG

A0A5D3DXL9 Beta-glucosidase BoGH3B-like6.2e-9680.26Show/hide
Query:  MGFWLLLCCLAVNTDATYLKYKDPKQPLGARIKDLMAD-TLEEKIGQMVQIEQKVATPDVMKNYFIGSVLSGGGN------------------PKGVLAT
        MGFWLLLCCL V TDATYLKYKDPKQPLGARIKDLM   TLEEKIGQMVQIE+ VATPDVMKNYFIGSVLSGGG+                   KG LAT
Subjt:  MGFWLLLCCLAVNTDATYLKYKDPKQPLGARIKDLMAD-TLEEKIGQMVQIEQKVATPDVMKNYFIGSVLSGGGN------------------PKGVLAT

Query:  RIGIPMIYGIDAVHGHNNVYNATIFPDIVGLGVTRDLELLRRIGDATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQMIEIIHGSQGAI
        R+GIPMIYGIDAVHGHNNVYNATIFP  VGLGVTRD ELLRRIG+ATALEVRAT IP VFAPCIAVCRDPRWGRCY+SYSEDHKIVQQ+ EII G QGAI
Subjt:  RIGIPMIYGIDAVHGHNNVYNATIFPDIVGLGVTRDLELLRRIGDATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQMIEIIHGSQGAI

Query:  PSNSRKEIPFVAGKQNVAACVKHFLGDG
        P NSRK IPFVAGKQ VAAC KHF+GDG
Subjt:  PSNSRKEIPFVAGKQNVAACVKHFLGDG

A0A6J1C0J8 uncharacterized protein LOC1110071743.6e-9680.7Show/hide
Query:  MGFWLLLCCLAVNTDATYLKYKDPKQPLGARIKDLMAD-TLEEKIGQMVQIEQKVATPDVMKNYFIGSVLSGGGN------------------PKGVLAT
        +GFWLLLCCLAV TDATYLKY+DPKQPLGARIKDLM   TLEEKIGQMVQIE+KVATPDVMKNYFIGSVLSGGG+                   KG LAT
Subjt:  MGFWLLLCCLAVNTDATYLKYKDPKQPLGARIKDLMAD-TLEEKIGQMVQIEQKVATPDVMKNYFIGSVLSGGGN------------------PKGVLAT

Query:  RIGIPMIYGIDAVHGHNNVYNATIFPDIVGLGVTRDLELLRRIGDATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQMIEIIHGSQGAI
        R+GIPMIYGIDAVHGHNNVYNATIFP  VGLGVTRD  LLRRIGDATALEVRAT IP VFAPCIAVCRDPRWGRCY+SYSEDHKIVQQM EII G QG I
Subjt:  RIGIPMIYGIDAVHGHNNVYNATIFPDIVGLGVTRDLELLRRIGDATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQMIEIIHGSQGAI

Query:  PSNSRKEIPFVAGKQNVAACVKHFLGDG
        PSNSRK IPFVAGKQ VAAC KHF+GDG
Subjt:  PSNSRKEIPFVAGKQNVAACVKHFLGDG

A0A6J1F630 Beta-glucosidase6.8e-9562.78Show/hide
Query:  MGFWLLLCCLAVNTDATYLKYKDPKQPLGARIKDLM-ADTLEEKIGQMVQIEQKVATPDVMKNYFIGSVLSGGGN------------------PKGVLAT
        +GFWLLLCCLAV TDATYLKY+DPKQPLGARIKDLM   TLEEKIGQMVQIE+KVATPDVMKNYFIGSVLSGGG+                   KG L+T
Subjt:  MGFWLLLCCLAVNTDATYLKYKDPKQPLGARIKDLM-ADTLEEKIGQMVQIEQKVATPDVMKNYFIGSVLSGGGN------------------PKGVLAT

Query:  RIGIPMIYGIDAVHGHNNVYNATIFPDIVGLGVTRDLELLRRIGDATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQMIEIIHGSQGAI
        R+ IPMIYGIDA+HGHNN YNATIFP  +GLGVTRD ELLRRIG+ATALEVRAT IP VFAPCIAVCRDPRWGRCY+SYSEDHKIVQQM EII G QGAI
Subjt:  RIGIPMIYGIDAVHGHNNVYNATIFPDIVGLGVTRDLELLRRIGDATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQMIEIIHGSQGAI

Query:  PSNSRKEIPFVAGKQNVAACVKHFLGDGAQPGLRVPKIIIHTRMLTLLGLLDFH-KAWKMTLSRGEEAIKC-----------LERDLTVN-LPDDSKDQI
        P+NSRK IPFV GKQ VAAC KHFLGDG         I  +  ++   GLL  H  A+  ++++G   +               RDL    L +  K + 
Subjt:  PSNSRKEIPFVAGKQNVAACVKHFLGDGAQPGLRVPKIIIHTRMLTLLGLLDFH-KAWKMTLSRGEEAIKC-----------LERDLTVN-LPDDSKDQI

Query:  LVLGDWDNL
         V+ DW  +
Subjt:  LVLGDWDNL

SwissProt top hitse value%identityAlignment
A7LXU3 Beta-glucosidase BoGH3B7.4e-2233.5Show/hide
Query:  TLEEKIGQMVQIEQKVAT-----------------PDVMKNYFIGSVLSGGGNPKGVLATR-----------------IGIPMIYGIDAVHGHNNVYNAT
        TLE+KIGQM +I   V +                   V+  Y +GS+L+    P GV   +                 IGIP IYG+D +HG     + T
Subjt:  TLEEKIGQMVQIEQKVAT-----------------PDVMKNYFIGSVLSGGGNPKGVLATR-----------------IGIPMIYGIDAVHGHNNVYNAT

Query:  IFPDIVGLGVTRDLELLRRIGDATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQM-IEIIHGSQGAIPSNSRKEIPFVAGKQNVAACVK
        +FP  + +G T + EL RR    +A E +A  IP  FAP + + RDPRW R +++Y ED  +  +M +  + G QG  P+          G+ NVAAC+K
Subjt:  IFPDIVGLGVTRDLELLRRIGDATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQM-IEIIHGSQGAIPSNSRKEIPFVAGKQNVAACVK

Query:  HFLGDG
        H++G G
Subjt:  HFLGDG

D5EY15 Xylan 1,4-beta-xylosidase3.1e-1226.38Show/hide
Query:  FWLLLCCLAVNTDATYLKYKDPKQPLGARIKDLMAD-TLEEKIGQMVQIEQKVATPDVMKNYFIGSVLSGGGNPKGVLATRIGI-PMIYGIDAVHGHNNV
        F  L  C+ +   A  L Y++P      R  DL +  TLEEK   M+ +++  A P                        R+GI    +  +A+HG  N+
Subjt:  FWLLLCCLAVNTDATYLKYKDPKQPLGARIKDLMAD-TLEEKIGQMVQIEQKVATPDVMKNYFIGSVLSGGGNPKGVLATRIGI-PMIYGIDAVHGHNNV

Query:  YNATIFPDIVGLGVTRDLELLRRIGDATALEVRA---------------TRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQM-IEIIHGSQGAIPSN
         N T FP+ VG+  + +  LL ++ D  + E RA                R   V+ P + + RDPRWGR  ++Y ED  +   M ++++ G QG   + 
Subjt:  YNATIFPDIVGLGVTRDLELLRRIGDATALEVRA---------------TRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQM-IEIIHGSQGAIPSN

Query:  SRKEIPFVAGKQNVAACVKHFLGDGAQPGLRVPKIIIHTRMLTLLGLLDFHKAW
         RK          + AC KH+           P+   HT  LT +   DF + +
Subjt:  SRKEIPFVAGKQNVAACVKHFLGDGAQPGLRVPKIIIHTRMLTLLGLLDFHKAW

P33363 Periplasmic beta-glucosidase5.3e-1227.13Show/hide
Query:  TLEEKIGQMVQI-----EQKVATPDVMKNYFIGSVLS-------GGGNPKGVLATRIGIPMIYGIDAVHGHNNVYNATIFPDIVGLGVTRDLELLRRIGD
        T++EKIGQ+  I       K A  +++K+  +G++ +            + +  +R+ IP+ +  D +HG       T+FP  +GL  + +L+ ++ +G 
Subjt:  TLEEKIGQMVQI-----EQKVATPDVMKNYFIGSVLS-------GGGNPKGVLATRIGIPMIYGIDAVHGHNNVYNATIFPDIVGLGVTRDLELLRRIGD

Query:  ATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQMIE-IIHGSQGAIPSNSRKEIPFVAGKQNVAACVKHFLGDGAQPG
         +A E     +   +AP + V RDPRWGR  + + ED  +   M + ++   QG  P          A + +V   VKHF   GA  G
Subjt:  ATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQMIE-IIHGSQGAIPSNSRKEIPFVAGKQNVAACVKHFLGDGAQPG

Q23892 Lysosomal beta glucosidase2.0e-1429.27Show/hide
Query:  TLEEKIGQMVQIE-QKVATPDVM-----------KNYFIGSVLSG--GGNPKG---------------------VLATRIGIPMIYGIDAVHGHNNVYNA
        ++ EKIGQM Q++   + +P+ +           K Y+IGS L+    G   G                     +  +   IPMIYG+D+VHG N V+ A
Subjt:  TLEEKIGQMVQIE-QKVATPDVM-----------KNYFIGSVLSG--GGNPKG---------------------VLATRIGIPMIYGIDAVHGHNNVYNA

Query:  TIFPDIVGLGVTRDLELLRRIGDATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQM-IEIIHGSQGAIPSNSRKEIPFVAGKQNVAACV
        T+FP   GL  T ++E        T+ +  A  IP VFAP + +   P W R Y+++ ED  +   M    + G QG    N+  + P  A   +     
Subjt:  TIFPDIVGLGVTRDLELLRRIGDATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQM-IEIIHGSQGAIPSNSRKEIPFVAGKQNVAACV

Query:  KHFLG
        KH+ G
Subjt:  KHFLG

Q56078 Periplasmic beta-glucosidase6.3e-1328.79Show/hide
Query:  ARIKDLMAD-TLEEKIGQMVQI-----EQKVATPDVMKNYFIGSVLS-------GGGNPKGVLATRIGIPMIYGIDAVHGHNNVYNATIFPDIVGLGVTR
        A + DL+   T++EKIGQ+  I       K A  +++K+  +G++ +            + +  +R+ IP+ +  D VHG       T+FP  +GL  + 
Subjt:  ARIKDLMAD-TLEEKIGQMVQI-----EQKVATPDVMKNYFIGSVLS-------GGGNPKGVLATRIGIPMIYGIDAVHGHNNVYNATIFPDIVGLGVTR

Query:  DLELLRRIGDATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQMIE-IIHGSQGAIPSNSRKEIPFVAGKQNVAACVKHFLGDGAQPG
        +L+ +R +G  +A E     +   +AP + V RDPRWGR  + + ED  +   M E ++   QG  P          A + +V   VKHF   GA  G
Subjt:  DLELLRRIGDATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQMIE-IIHGSQGAIPSNSRKEIPFVAGKQNVAACVKHFLGDGAQPG

Arabidopsis top hitse value%identityAlignment
AT3G47000.1 Glycosyl hydrolase family protein1.2e-5953.37Show/hide
Query:  YKDPKQPLGARIKDLMAD-TLEEKIGQMVQIEQKVATPDVMKNYFIGSVLSGGGN------------------PKGVLATRIGIPMIYGIDAVHGHNNVY
        YK+   P+ AR+KDL++  TL EKIGQM QIE++VA+P    ++FIGSVL+ GG+                   +  LA+R+GIP+IYG DAVHG+NNVY
Subjt:  YKDPKQPLGARIKDLMAD-TLEEKIGQMVQIEQKVATPDVMKNYFIGSVLSGGGN------------------PKGVLATRIGIPMIYGIDAVHGHNNVY

Query:  NATIFPDIVGLGVTRDLELLRRIGDATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQMIEIIHGSQGAIPSNSRKEIPFVAGKQNVAAC
         AT+FP  +GLG TRD +L+RRIG ATALEVRA+ +   F+PC+AV RDPRWGRCY+SY ED ++V +M  ++ G QG  P       PFVAG+ NV AC
Subjt:  NATIFPDIVGLGVTRDLELLRRIGDATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQMIEIIHGSQGAIPSNSRKEIPFVAGKQNVAAC

Query:  VKHFLGDG
        VKHF+GDG
Subjt:  VKHFLGDG

AT5G04885.1 Glycosyl hydrolase family protein6.6e-7461.26Show/hide
Query:  LCCLAVNTDATYLKYKDPKQPLGARIKDLMAD-TLEEKIGQMVQIEQKVATPDVMKNYFIGSVLSGGGN------------------PKGVLATRIGIPM
        +CC     D  YL YKDPKQ +  R+ DL    TLEEKIGQMVQI++ VAT ++M++YFIGSVLSGGG+                   KG L +R+GIPM
Subjt:  LCCLAVNTDATYLKYKDPKQPLGARIKDLMAD-TLEEKIGQMVQIEQKVATPDVMKNYFIGSVLSGGGN------------------PKGVLATRIGIPM

Query:  IYGIDAVHGHNNVYNATIFPDIVGLGVTRDLELLRRIGDATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQMIEIIHGSQGAIPSNSRK
        IYGIDAVHGHNNVYNATIFP  VGLG TRD +L++RIG ATA+EVRAT IP  FAPCIAVCRDPRWGRCY+SYSEDHK+V+ M ++I G QG  PSN + 
Subjt:  IYGIDAVHGHNNVYNATIFPDIVGLGVTRDLELLRRIGDATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQMIEIIHGSQGAIPSNSRK

Query:  EIPFVAGKQNVAACVKHFLGDG
         +PFV G+  VAAC KH++GDG
Subjt:  EIPFVAGKQNVAACVKHFLGDG

AT5G20940.1 Glycosyl hydrolase family protein4.9e-7758.91Show/hide
Query:  MGFWLLLCCLAVN-TDATYLKYKDPKQPLGARIKDLMAD-TLEEKIGQMVQIEQKVATPDVMKNYFIGSVLSGGGN------------------PKGVLA
        +G  LL C +A N       KYKDPK+PLG RIK+LM+  TLEEKIGQMVQ+E+  AT +VM+ YF+GSV SGGG+                   K  L+
Subjt:  MGFWLLLCCLAVN-TDATYLKYKDPKQPLGARIKDLMAD-TLEEKIGQMVQIEQKVATPDVMKNYFIGSVLSGGGN------------------PKGVLA

Query:  TRIGIPMIYGIDAVHGHNNVYNATIFPDIVGLGVTRDLELLRRIGDATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQMIEIIHGSQGA
        TR+GIP+IYGIDAVHGHN VYNATIFP  VGLGVTRD  L++RIG+ATALEVRAT I  VFAPCIAVCRDPRWGRCY+SYSEDHKIVQQM EII G QG 
Subjt:  TRIGIPMIYGIDAVHGHNNVYNATIFPDIVGLGVTRDLELLRRIGDATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQMIEIIHGSQGA

Query:  IPSNSRKEIPFVAGKQNVAACVKHFLGDGAQ-PGLRVPKIIIHTRMLTLLGLLDFHKA
        +P+  +K +PFVAGK  VAAC KHF+GDG    G+     +I++  L  + +  +H A
Subjt:  IPSNSRKEIPFVAGKQNVAACVKHFLGDGAQ-PGLRVPKIIIHTRMLTLLGLLDFHKA

AT5G20950.1 Glycosyl hydrolase family protein1.6e-8068.3Show/hide
Query:  LLLCCLAVNTDATYLKYKDPKQPLGARIKDLM-ADTLEEKIGQMVQIEQKVATPDVMKNYFIGSVLSGGGN------------------PKGVLATRIGI
        +LLCC+    + T LKYKDPKQPLGARI+DLM   TL+EKIGQMVQIE+ VATP+VMK YFIGSVLSGGG+                   K  L+TR+GI
Subjt:  LLLCCLAVNTDATYLKYKDPKQPLGARIKDLM-ADTLEEKIGQMVQIEQKVATPDVMKNYFIGSVLSGGGN------------------PKGVLATRIGI

Query:  PMIYGIDAVHGHNNVYNATIFPDIVGLGVTRDLELLRRIGDATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQMIEIIHGSQGAIPSNS
        PMIYGIDAVHGHNNVY ATIFP  VGLGVTRD  L++RIG ATALEVRAT IP  FAPCIAVCRDPRWGRCY+SYSED++IVQQM EII G QG +P+  
Subjt:  PMIYGIDAVHGHNNVYNATIFPDIVGLGVTRDLELLRRIGDATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQMIEIIHGSQGAIPSNS

Query:  RKEIPFVAGKQNVAACVKHFLGDG
        RK +PFV GK  VAAC KHF+GDG
Subjt:  RKEIPFVAGKQNVAACVKHFLGDG

AT5G20950.2 Glycosyl hydrolase family protein1.6e-8068.3Show/hide
Query:  LLLCCLAVNTDATYLKYKDPKQPLGARIKDLM-ADTLEEKIGQMVQIEQKVATPDVMKNYFIGSVLSGGGN------------------PKGVLATRIGI
        +LLCC+    + T LKYKDPKQPLGARI+DLM   TL+EKIGQMVQIE+ VATP+VMK YFIGSVLSGGG+                   K  L+TR+GI
Subjt:  LLLCCLAVNTDATYLKYKDPKQPLGARIKDLM-ADTLEEKIGQMVQIEQKVATPDVMKNYFIGSVLSGGGN------------------PKGVLATRIGI

Query:  PMIYGIDAVHGHNNVYNATIFPDIVGLGVTRDLELLRRIGDATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQMIEIIHGSQGAIPSNS
        PMIYGIDAVHGHNNVY ATIFP  VGLGVTRD  L++RIG ATALEVRAT IP  FAPCIAVCRDPRWGRCY+SYSED++IVQQM EII G QG +P+  
Subjt:  PMIYGIDAVHGHNNVYNATIFPDIVGLGVTRDLELLRRIGDATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQMIEIIHGSQGAIPSNS

Query:  RKEIPFVAGKQNVAACVKHFLGDG
        RK +PFV GK  VAAC KHF+GDG
Subjt:  RKEIPFVAGKQNVAACVKHFLGDG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGTTTTGGCTGCTGCTTTGCTGCCTGGCCGTTAATACAGATGCAACTTACCTGAAATACAAAGACCCTAAACAGCCATTGGGTGCTAGAATCAAAGATCTT
ATGGCTGATACTTTGGAAGAAAAAATTGGCCAAATGGTTCAGATTGAACAGAAAGTTGCAACCCCAGACGTCATGAAGAACTATTTCATTGGGAGTGTACTAAGC
GGAGGAGGGAATCCAAAAGGGGTCTTAGCCACCCGTATTGGGATCCCTATGATTTATGGGATCGATGCTGTTCATGGTCACAATAATGTGTACAATGCCACTATC
TTTCCTGATATTGTTGGTCTTGGAGTTACCAGGGATTTGGAACTTCTTAGGCGGATTGGGGATGCCACAGCACTTGAAGTCAGAGCAACTAGAATTCCTTGCGTT
TTTGCTCCATGTATAGCGGTGTGCAGAGATCCTAGATGGGGTCGATGCTACAAGAGCTATAGCGAAGATCATAAGATTGTTCAACAAATGATTGAGATTATACAT
GGATCGCAAGGAGCAATTCCTTCTAATTCACGAAAAGAGATTCCTTTTGTTGCGGGAAAACAAAACGTTGCGGCCTGTGTTAAGCACTTCCTAGGAGATGGTGCC
CAACCAGGGTTGAGAGTGCCAAAAATCATAATCCATACCAGAATGCTGACTCTTTTGGGGCTCTTAGACTTCCATAAGGCTTGGAAAATGACCTTATCTAGAGGA
GAAGAAGCAATTAAATGTTTGGAGAGAGATTTGACAGTGAACTTGCCTGATGATTCCAAAGACCAAATCCTTGTATTAGGGGATTGGGATAATCTGTTCTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGTTTTGGCTGCTGCTTTGCTGCCTGGCCGTTAATACAGATGCAACTTACCTGAAATACAAAGACCCTAAACAGCCATTGGGTGCTAGAATCAAAGATCTT
ATGGCTGATACTTTGGAAGAAAAAATTGGCCAAATGGTTCAGATTGAACAGAAAGTTGCAACCCCAGACGTCATGAAGAACTATTTCATTGGGAGTGTACTAAGC
GGAGGAGGGAATCCAAAAGGGGTCTTAGCCACCCGTATTGGGATCCCTATGATTTATGGGATCGATGCTGTTCATGGTCACAATAATGTGTACAATGCCACTATC
TTTCCTGATATTGTTGGTCTTGGAGTTACCAGGGATTTGGAACTTCTTAGGCGGATTGGGGATGCCACAGCACTTGAAGTCAGAGCAACTAGAATTCCTTGCGTT
TTTGCTCCATGTATAGCGGTGTGCAGAGATCCTAGATGGGGTCGATGCTACAAGAGCTATAGCGAAGATCATAAGATTGTTCAACAAATGATTGAGATTATACAT
GGATCGCAAGGAGCAATTCCTTCTAATTCACGAAAAGAGATTCCTTTTGTTGCGGGAAAACAAAACGTTGCGGCCTGTGTTAAGCACTTCCTAGGAGATGGTGCC
CAACCAGGGTTGAGAGTGCCAAAAATCATAATCCATACCAGAATGCTGACTCTTTTGGGGCTCTTAGACTTCCATAAGGCTTGGAAAATGACCTTATCTAGAGGA
GAAGAAGCAATTAAATGTTTGGAGAGAGATTTGACAGTGAACTTGCCTGATGATTCCAAAGACCAAATCCTTGTATTAGGGGATTGGGATAATCTGTTCTTTTAA
Protein sequenceShow/hide protein sequence
MGFWLLLCCLAVNTDATYLKYKDPKQPLGARIKDLMADTLEEKIGQMVQIEQKVATPDVMKNYFIGSVLSGGGNPKGVLATRIGIPMIYGIDAVHGHNNVYNATI
FPDIVGLGVTRDLELLRRIGDATALEVRATRIPCVFAPCIAVCRDPRWGRCYKSYSEDHKIVQQMIEIIHGSQGAIPSNSRKEIPFVAGKQNVAACVKHFLGDGA
QPGLRVPKIIIHTRMLTLLGLLDFHKAWKMTLSRGEEAIKCLERDLTVNLPDDSKDQILVLGDWDNLFF