; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI02G12690 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI02G12690
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionGlycosyl hydrolase family protein
Genome locationChr2:12906861..12909125
RNA-Seq ExpressionCSPI02G12690
SyntenyCSPI02G12690
Gene Ontology termsGO:0005975 - carbohydrate metabolic process (biological process)
GO:0004553 - hydrolase activity, hydrolyzing O-glycosyl compounds (molecular function)
InterPro domainsIPR001764 - Glycoside hydrolase, family 3, N-terminal
IPR002772 - Glycoside hydrolase family 3 C-terminal domain
IPR017853 - Glycoside hydrolase superfamily
IPR036881 - Glycoside hydrolase family 3 C-terminal domain superfamily
IPR036962 - Glycoside hydrolase, family 3, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8651882.1 hypothetical protein Csa_006396 [Cucumis sativus]3.6e-10285.96Show/hide
Query:  MRYQLEHSYQQAWIVTHSHARILSLYNQGCLYNNERYSDVLNQRRRREKDFESQVYDGPVCESIGGSSSVKESTIDLSSRNSRQQSAAAGQSPGKDSADK
        MRYQLEHSYQQAWIVTHSHARILSLYNQGCLYNN                             +  SSSVKESTIDLSSRNSRQQSAAAGQSPGKDSADK
Subjt:  MRYQLEHSYQQAWIVTHSHARILSLYNQGCLYNNERYSDVLNQRRRREKDFESQVYDGPVCESIGGSSSVKESTIDLSSRNSRQQSAAAGQSPGKDSADK

Query:  NLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKVCIVVIVSGRPLTRQQYMSQLDALEAAW
        NLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKVCIVVIVSGRPLTRQQY SQLDALEAAW
Subjt:  NLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKVCIVVIVSGRPLTRQQYMSQLDALEAAW

Query:  LPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEKDG
        LPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEKDG
Subjt:  LPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEKDG

XP_022146225.1 uncharacterized protein LOC111015489 [Momordica charantia]2.6e-9241.57Show/hide
Query:  VMVVLLCCWAALVAVEEDC----------------------------QMAQLDSSATTPEIIRDYSIGS------------ATAQEWIDMVNSFQQATLS
        ++V+LLCC AAL + + D                             QM QLD +  TPEI+RDYS+GS            ATAQEWIDMVNSFQQ +LS
Subjt:  VMVVLLCCWAALVAVEEDC----------------------------QMAQLDSSATTPEIIRDYSIGS------------ATAQEWIDMVNSFQQATLS

Query:  SRLGIPIMYGIDAVHGHNGVYNATIFLNNVGLVFSCDREPELLRRIGAATAKE---------------VCRDTRWGRCYESYSEDPDIVKEMTDIIAGLQ
        SRLGIP++YGIDAVHGHN VYNATIF +NVGL     R+PEL+RRIG ATAKE               VCRD RWGRCYESYSEDPDIVKEMT+II GLQ
Subjt:  SRLGIPIMYGIDAVHGHNGVYNATIFLNNVGLVFSCDREPELLRRIGAATAKE---------------VCRDTRWGRCYESYSEDPDIVKEMTDIIAGLQ

Query:  GQTPSSSSKVIPYVVGRQR-------------------------------------------KGCS-------------------LCKAFL---------
        G+  S  SK +PYV GR +                                           KG S                   L   FL         
Subjt:  GQTPSSSSKVIPYVVGRQR-------------------------------------------KGCS-------------------LCKAFL---------

Query:  FCRRWQ--DNMRYQLEHSYQ-------QAWI----VTHSHARIL----SLYNQGCLYNNERYSDVLNQRRRREKDFESQVYDGPVCESIGGSSSVKESTI
            W   D + Y    +Y        QA I    +  +H   +    +L N   +    R  D +  RR     F   +++ P+ +    +    +   
Subjt:  FCRRWQ--DNMRYQLEHSYQ-------QAWI----VTHSHARIL----SLYNQGCLYNNERYSDVLNQRRRREKDFESQVYDGPVCESIGGSSSVKESTI

Query:  DLSSRNSRQQSAAAGQSPGKDS------------------------------------ADKNLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFS
        DL+    R+           D                                     +  N TTGTTIL+AVKKTVDPNTEV+YN++PTTDY KANNFS
Subjt:  DLSSRNSRQQSAAAGQSPGKDS------------------------------------ADKNLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFS

Query:  HVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKV-----CIVVIVSGRPLTRQQYMSQLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK
        + IV VGE P+AE  GDNLNLTI EGGSDTIQ V     C+VVIVSGRPLT   Y+SQLDAL AAWLPGTEGEGVTDVL G+YGFT KL RT FKT ++
Subjt:  HVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKV-----CIVVIVSGRPLTRQQYMSQLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK

XP_022943425.1 uncharacterized protein LOC111448193 [Cucurbita moschata]4.3e-9542.38Show/hide
Query:  VMVVLLCCWAALVAVEED----------------------------CQMAQLDSSATTPEIIRDYSIGS------------ATAQEWIDMVNSFQQATLS
        ++V+LLCCW AL A  ED                             QM QLD S  TPEI+RDYSIGS            ATAQ WIDMVNSFQQ +LS
Subjt:  VMVVLLCCWAALVAVEED----------------------------CQMAQLDSSATTPEIIRDYSIGS------------ATAQEWIDMVNSFQQATLS

Query:  SRLGIPIMYGIDAVHGHNGVYNATIFLNNVGLVFSCDREPELLRRIGAATAKE---------------VCRDTRWGRCYESYSEDPDIVKEMTDIIAGLQ
        SRLGIPIMYGIDAVHGH  VYNAT+F +NVGL     REPELLRRIGAATA+E               VCRD RWGRCYESYSEDPDIVKEMTDII GLQ
Subjt:  SRLGIPIMYGIDAVHGHNGVYNATIFLNNVGLVFSCDREPELLRRIGAATAKE---------------VCRDTRWGRCYESYSEDPDIVKEMTDIIAGLQ

Query:  GQTPSSSSKVIPYVVGRQRKGCSLCKAFL----FCRRWQDNMRYQLEHSYQQAWIVTHSHARI---------LSLYNQGCLYNN----------------
        GQ PS  SK +PYV GR  K  +  K F+      R   +N      H      +  + H+ I          S +N   +++N                
Subjt:  GQTPSSSSKVIPYVVGRQRKGCSLCKAFL----FCRRWQDNMRYQLEHSYQQAWIVTHSHARI---------LSLYNQGCLYNN----------------

Query:  ---------ERYSDVLNQ------------------------------------------------RRRREKDFESQVYDGPVCE-----SIGGS-----
                 +R +D  +                                                 RR     F   +++ P+ +      +G       
Subjt:  ---------ERYSDVLNQ------------------------------------------------RRRREKDFESQVYDGPVCE-----SIGGS-----

Query:  --SSVKESTIDLSSRNSRQQSA-----------AAG--------QSPG-----KDSADKNLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHV
           +V++S + L +  +                 AG        Q  G     K  +  NLTTGTTILEAVKK+VDPNTEV+++++PT DY KANNF++ 
Subjt:  --SSVKESTIDLSSRNSRQQSA-----------AAG--------QSPG-----KDSADKNLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHV

Query:  IVAVGETPHAEPKGDNLNLTIPEGGSDTIQKV-----CIVVIVSGRPLTRQQYMSQLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK
        IV VGE P+AE  GDNLNLTIPEGG DTIQ V     C+VV+VSGRPLT   YMSQLDAL AAWLPGTEGEGV DVL G+YGFT KL RT FKT+++
Subjt:  IVAVGETPHAEPKGDNLNLTIPEGGSDTIQKV-----CIVVIVSGRPLTRQQYMSQLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK

XP_023512240.1 uncharacterized protein LOC111777028 [Cucurbita pepo subsp. pepo]1.8e-9342.21Show/hide
Query:  VMVVLLCCWAALVAVEED----------------------------CQMAQLDSSATTPEIIRDYSIGS------------ATAQEWIDMVNSFQQATLS
        ++V+LLC W AL A  +D                             QM QLD S  TPEI+RDYSIGS            ATAQ WIDMVNSFQQ +LS
Subjt:  VMVVLLCCWAALVAVEED----------------------------CQMAQLDSSATTPEIIRDYSIGS------------ATAQEWIDMVNSFQQATLS

Query:  SRLGIPIMYGIDAVHGHNGVYNATIFLNNVGLVFSCDREPELLRRIGAATAKE---------------VCRDTRWGRCYESYSEDPDIVKEMTDIIAGLQ
        SRLGIPIMYGIDAVHGH  VYNAT+F +NVGL     REPELLRRIGAATA+E               VCRD RWGRCYESYSEDPDIVKEMTDII GLQ
Subjt:  SRLGIPIMYGIDAVHGHNGVYNATIFLNNVGLVFSCDREPELLRRIGAATAKE---------------VCRDTRWGRCYESYSEDPDIVKEMTDIIAGLQ

Query:  GQTPSSSSKVIPYVVGRQRKGCSLCKAFL----FCRRWQDNMRYQLEHSYQQAWIVTHSHARI---------LSLYNQGCLYNN----------------
        GQ PS  SK +PYV GR  K  +  K F+      R   +N      H      +  + H+ I          S +N   +++N                
Subjt:  GQTPSSSSKVIPYVVGRQRKGCSLCKAFL----FCRRWQDNMRYQLEHSYQQAWIVTHSHARI---------LSLYNQGCLYNN----------------

Query:  ---------ERYSDV----------------------------------------------LNQRRRREKD-------FESQVYDGPVCESIG-------
                 +R +D                                               +N   RR          FE+ + DG     +G       
Subjt:  ---------ERYSDV----------------------------------------------LNQRRRREKD-------FESQVYDGPVCESIG-------

Query:  -------------GSSSVKESTIDLSSRNSRQQSAAA-----GQSPG------KDSADKNLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHV
                        +  +  + LS +  +   A A     G   G      K  +  NLTTGTTIL+AVKK+VDPNTEV+++++PT DY KANNF++ 
Subjt:  -------------GSSSVKESTIDLSSRNSRQQSAAA-----GQSPG------KDSADKNLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHV

Query:  IVAVGETPHAEPKGDNLNLTIPEGGSDTIQKV-----CIVVIVSGRPLTRQQYMSQLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK
        IV VGE P+AE  GDNLNLTIPEGG DTIQ V     C+VV+VSGRPLT   YMSQLDAL AAWLPGTEGEGV DVL G+YGFT KL RT FKT ++
Subjt:  IVAVGETPHAEPKGDNLNLTIPEGGSDTIQKV-----CIVVIVSGRPLTRQQYMSQLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK

XP_038900909.1 beta-glucosidase BoGH3B-like [Benincasa hispida]2.4e-9842.24Show/hide
Query:  MKVMVVLLCCWAALVAVEEDC----------------------------QMAQLDSSATTPEIIRDYSIGS------------ATAQEWIDMVNSFQQAT
        MKV ++LLCCW ALVA +ED                             QMAQLD S  TPEI+RDYSIGS            AT QEWIDMVNSFQ+ +
Subjt:  MKVMVVLLCCWAALVAVEEDC----------------------------QMAQLDSSATTPEIIRDYSIGS------------ATAQEWIDMVNSFQQAT

Query:  LSSRLGIPIMYGIDAVHGHNGVYNATIFLNNVGLVFSCDREPELLRRIGAATAKE---------------VCRDTRWGRCYESYSEDPDIVKEMTDIIAG
        LSSRLGIP++YGIDAVHGHN VYNAT+F +NVGL     REPELLRRIGAATAKE               VCRD RWGRCYESY EDPDIVKEM DII G
Subjt:  LSSRLGIPIMYGIDAVHGHNGVYNATIFLNNVGLVFSCDREPELLRRIGAATAKE---------------VCRDTRWGRCYESYSEDPDIVKEMTDIIAG

Query:  LQGQTPSSSSKVIPYVVGRQRKGCSLCKAFL----FCRRWQDNMRYQLEHSYQQAWIVTHSHARI---------LSLYNQGCLYNNE-------------
        LQGQ PS   K +PYV GR  K  +  K F+      R   +N      H      +  + H+ I          S +N   +++N              
Subjt:  LQGQTPSSSSKVIPYVVGRQRKGCSLCKAFL----FCRRWQDNMRYQLEHSYQQAWIVTHSHARI---------LSLYNQGCLYNNE-------------

Query:  -----------------------------------------RYSDVLNQ-------------------RRRREKDFESQVYDGPVCESIGGSSSVKESTI
                                                  Y++ ++                    RR     F   +++ P+ +    +    +   
Subjt:  -----------------------------------------RYSDVLNQ-------------------RRRREKDFESQVYDGPVCESIGGSSSVKESTI

Query:  DLSSRNSRQQSAAAGQSPGKDS------------------------------------ADKNLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFS
        DL+    R+  A        D                                     +  NLTTGTTILEAVKKTVDPNTE+IYN+N TTDY KANNFS
Subjt:  DLSSRNSRQQSAAAGQSPGKDS------------------------------------ADKNLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFS

Query:  HVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKV-----CIVVIVSGRPLTRQQYMSQLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK
        + IV VGETP+AE  GDNLNLTI EGGSDTIQ V     C+VVIVSGRPLT + +MSQLDAL  +WLPGTEGEGVTDVL G+YGFT KL RT FKT ++
Subjt:  HVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKV-----CIVVIVSGRPLTRQQYMSQLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK

TrEMBL top hitse value%identityAlignment
A0A0A0LJ56 Glyco_hydro_3_C domain-containing protein3.9e-8699.41Show/hide
Query:  SSSVKESTIDLSSRNSRQQSAAAGQSPGKDSADKNLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGG
        SSSVKESTIDLSSRNSRQQSAAAGQSPGKDSADKNLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGG
Subjt:  SSSVKESTIDLSSRNSRQQSAAAGQSPGKDSADKNLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGG

Query:  SDTIQKVCIVVIVSGRPLTRQQYMSQLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEKDG
        SDTIQKVCIVVIVSGRPLTRQQY SQLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEKDG
Subjt:  SDTIQKVCIVVIVSGRPLTRQQYMSQLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEKDG

A0A1S4E304 LOW QUALITY PROTEIN: lysosomal beta glucosidase-like6.0e-8748.87Show/hide
Query:  MKVMVVLLCCWAALVAVEEDC----------------------------QMAQLDSSATTPEIIRDYSIGS------------ATAQEWIDMVNSFQQAT
        MKVMVVLLCCWAALVAV+ED                             QMAQLDSSA TPEIIRDYSIGS            ATAQEWI MVNSFQQAT
Subjt:  MKVMVVLLCCWAALVAVEEDC----------------------------QMAQLDSSATTPEIIRDYSIGS------------ATAQEWIDMVNSFQQAT

Query:  LSSRLGIPIMYGIDAVHGHNGVYNATIFLNNVG-----LVFSCDREPELLRRIGAATAKEVCRDTRWGRCYESYSEDPDIVKEMTDIIAGLQGQTPSSSS
        LSSRLGIPIMYGIDAVHGHNGVYNAT+F +N+G      +     EPELLRRIGAAT KEV R T        Y   P I +       G   +  S+  
Subjt:  LSSRLGIPIMYGIDAVHGHNGVYNATIFLNNVG-----LVFSCDREPELLRRIGAATAKEVCRDTRWGRCYESYSEDPDIVKEMTDIIAGLQGQTPSSSS

Query:  KVIPYVVGRQRKGCSLCKAFLFCRRWQDNMRYQLEHSYQQAWIVTHSHARILSLYNQGCLYNNERYSDVLNQRRRREKDFESQVYDGPVCESIGGS----
         V+ +          L  +  F  R+ D + Y +  +      +  +  RIL +     L+ N    D      R   +  SQ +     E++  S    
Subjt:  KVIPYVVGRQRKGCSLCKAFLFCRRWQDNMRYQLEHSYQQAWIVTHSHARILSLYNQGCLYNNERYSDVLNQRRRREKDFESQVYDGPVCESIGGS----

Query:  ---SSVKESTIDLSSR-----------NSRQQSAAAGQSPGKDSADKNLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEP
            +  E  + LS +           N+     +      +     NLTTGTTILEAVKKTVDPNTEVIYN+NPTTDY KANNFS+ I  VGETP AE 
Subjt:  ---SSVKESTIDLSSR-----------NSRQQSAAAGQSPGKDSADKNLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEP

Query:  KGDNLNLTIPEGGSDTIQKV-----CIVVIVSGRPLTRQQYMSQLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK
        KGDNLNLTI EGGSDTIQ V     CIVVIVSG    RQQYMSQLDAL  AWLPGTEGEGVTDVL GEYGFT KL RT  KT ++
Subjt:  KGDNLNLTIPEGGSDTIQKV-----CIVVIVSGRPLTRQQYMSQLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK

A0A1U8PKK0 lysosomal beta glucosidase-like5.1e-7837.25Show/hide
Query:  QMAQLDSSATTPEIIRDYSIGS------------ATAQEWIDMVNSFQQATLSSRLGIPIMYGIDAVHGHNGVYNATIFLNNVGLVFSCDREPELLRRIG
        QM Q+D    TPE++RDY IGS            AT QEW++MVN FQ  +LS+RLGIP++YGIDAVHGHN VY ATIF +N+GL     R+PEL++RIG
Subjt:  QMAQLDSSATTPEIIRDYSIGS------------ATAQEWIDMVNSFQQATLSSRLGIPIMYGIDAVHGHNGVYNATIFLNNVGLVFSCDREPELLRRIG

Query:  AATAKE---------------VCRDTRWGRCYESYSEDPDIVKEMTDIIAGLQGQTPSSSSKVIPYVVGRQR----------------------------
        +ATA+E               VCRD RWGRC+ESYSEDPDIVKEMT+II GLQG+ P  S K +PYV G+ +                            
Subjt:  AATAKE---------------VCRDTRWGRCYESYSEDPDIVKEMTDIIAGLQGQTPSSSSKVIPYVVGRQR----------------------------

Query:  --------------------------------------------KGCSLCKAFLFCRRWQ--DNMRYQLEHSYQQAWIVTHSHA---RILSLYN------
                                                    KG    + F+    WQ  D M Y +  +Y  + ++T   A    I+  YN      
Subjt:  --------------------------------------------KGCSLCKAFLFCRRWQ--DNMRYQLEHSYQQAWIVTHSHA---RILSLYN------

Query:  --QGCLYNN----ERYSDVLNQRRRREKDFESQVYDGPVCES-------------------------IGGSSSVKESTIDLSSRNSRQQSAAA-----GQ
           G + N      R  D +  RR     F+  +++ P+ +                          +    +  E  + L  ++S+   AA+     G 
Subjt:  --QGCLYNN----ERYSDVLNQRRRREKDFESQVYDGPVCES-------------------------IGGSSSVKESTIDLSSRNSRQQSAAA-----GQ

Query:  SPG------KDSADKNLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKV-----CIVVIVS
          G      +  +  NLT+GTTIL+ + + VDP+TE++Y  NP  DY K+NNFS+ IV VGE P+AE  GDNLNLTIP  G  T+  V     C+VV++S
Subjt:  SPG------KDSADKNLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKV-----CIVVIVS

Query:  GRPLTRQQYMSQLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK
        GRPL    ++ Q+DAL AAWLPGTEG+GV DVL G+YGF+ KL RT FKT E+
Subjt:  GRPLTRQQYMSQLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK

A0A6J1CWQ0 uncharacterized protein LOC1110154891.3e-9241.57Show/hide
Query:  VMVVLLCCWAALVAVEEDC----------------------------QMAQLDSSATTPEIIRDYSIGS------------ATAQEWIDMVNSFQQATLS
        ++V+LLCC AAL + + D                             QM QLD +  TPEI+RDYS+GS            ATAQEWIDMVNSFQQ +LS
Subjt:  VMVVLLCCWAALVAVEEDC----------------------------QMAQLDSSATTPEIIRDYSIGS------------ATAQEWIDMVNSFQQATLS

Query:  SRLGIPIMYGIDAVHGHNGVYNATIFLNNVGLVFSCDREPELLRRIGAATAKE---------------VCRDTRWGRCYESYSEDPDIVKEMTDIIAGLQ
        SRLGIP++YGIDAVHGHN VYNATIF +NVGL     R+PEL+RRIG ATAKE               VCRD RWGRCYESYSEDPDIVKEMT+II GLQ
Subjt:  SRLGIPIMYGIDAVHGHNGVYNATIFLNNVGLVFSCDREPELLRRIGAATAKE---------------VCRDTRWGRCYESYSEDPDIVKEMTDIIAGLQ

Query:  GQTPSSSSKVIPYVVGRQR-------------------------------------------KGCS-------------------LCKAFL---------
        G+  S  SK +PYV GR +                                           KG S                   L   FL         
Subjt:  GQTPSSSSKVIPYVVGRQR-------------------------------------------KGCS-------------------LCKAFL---------

Query:  FCRRWQ--DNMRYQLEHSYQ-------QAWI----VTHSHARIL----SLYNQGCLYNNERYSDVLNQRRRREKDFESQVYDGPVCESIGGSSSVKESTI
            W   D + Y    +Y        QA I    +  +H   +    +L N   +    R  D +  RR     F   +++ P+ +    +    +   
Subjt:  FCRRWQ--DNMRYQLEHSYQ-------QAWI----VTHSHARIL----SLYNQGCLYNNERYSDVLNQRRRREKDFESQVYDGPVCESIGGSSSVKESTI

Query:  DLSSRNSRQQSAAAGQSPGKDS------------------------------------ADKNLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFS
        DL+    R+           D                                     +  N TTGTTIL+AVKKTVDPNTEV+YN++PTTDY KANNFS
Subjt:  DLSSRNSRQQSAAAGQSPGKDS------------------------------------ADKNLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFS

Query:  HVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKV-----CIVVIVSGRPLTRQQYMSQLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK
        + IV VGE P+AE  GDNLNLTI EGGSDTIQ V     C+VVIVSGRPLT   Y+SQLDAL AAWLPGTEGEGVTDVL G+YGFT KL RT FKT ++
Subjt:  HVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKV-----CIVVIVSGRPLTRQQYMSQLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK

A0A6J1FXT0 uncharacterized protein LOC1114481932.1e-9542.38Show/hide
Query:  VMVVLLCCWAALVAVEED----------------------------CQMAQLDSSATTPEIIRDYSIGS------------ATAQEWIDMVNSFQQATLS
        ++V+LLCCW AL A  ED                             QM QLD S  TPEI+RDYSIGS            ATAQ WIDMVNSFQQ +LS
Subjt:  VMVVLLCCWAALVAVEED----------------------------CQMAQLDSSATTPEIIRDYSIGS------------ATAQEWIDMVNSFQQATLS

Query:  SRLGIPIMYGIDAVHGHNGVYNATIFLNNVGLVFSCDREPELLRRIGAATAKE---------------VCRDTRWGRCYESYSEDPDIVKEMTDIIAGLQ
        SRLGIPIMYGIDAVHGH  VYNAT+F +NVGL     REPELLRRIGAATA+E               VCRD RWGRCYESYSEDPDIVKEMTDII GLQ
Subjt:  SRLGIPIMYGIDAVHGHNGVYNATIFLNNVGLVFSCDREPELLRRIGAATAKE---------------VCRDTRWGRCYESYSEDPDIVKEMTDIIAGLQ

Query:  GQTPSSSSKVIPYVVGRQRKGCSLCKAFL----FCRRWQDNMRYQLEHSYQQAWIVTHSHARI---------LSLYNQGCLYNN----------------
        GQ PS  SK +PYV GR  K  +  K F+      R   +N      H      +  + H+ I          S +N   +++N                
Subjt:  GQTPSSSSKVIPYVVGRQRKGCSLCKAFL----FCRRWQDNMRYQLEHSYQQAWIVTHSHARI---------LSLYNQGCLYNN----------------

Query:  ---------ERYSDVLNQ------------------------------------------------RRRREKDFESQVYDGPVCE-----SIGGS-----
                 +R +D  +                                                 RR     F   +++ P+ +      +G       
Subjt:  ---------ERYSDVLNQ------------------------------------------------RRRREKDFESQVYDGPVCE-----SIGGS-----

Query:  --SSVKESTIDLSSRNSRQQSA-----------AAG--------QSPG-----KDSADKNLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHV
           +V++S + L +  +                 AG        Q  G     K  +  NLTTGTTILEAVKK+VDPNTEV+++++PT DY KANNF++ 
Subjt:  --SSVKESTIDLSSRNSRQQSA-----------AAG--------QSPG-----KDSADKNLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHV

Query:  IVAVGETPHAEPKGDNLNLTIPEGGSDTIQKV-----CIVVIVSGRPLTRQQYMSQLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK
        IV VGE P+AE  GDNLNLTIPEGG DTIQ V     C+VV+VSGRPLT   YMSQLDAL AAWLPGTEGEGV DVL G+YGFT KL RT FKT+++
Subjt:  IVAVGETPHAEPKGDNLNLTIPEGGSDTIQKV-----CIVVIVSGRPLTRQQYMSQLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK

SwissProt top hitse value%identityAlignment
A7LXU3 Beta-glucosidase BoGH3B4.1e-0828Show/hide
Query:  SSATTPEIIRDYSIGS---------ATAQEWIDMVNSFQQATLSSRLGIPIMYGIDAVHGHNGVYNATIFLNNVGLVFSCDREPELLRR---IGAATAKE
        S A    +I  Y +GS            ++W + +   Q+ ++   +GIP +YG+D +HG     + T+F   + +  + +R  EL RR   I A   K 
Subjt:  SSATTPEIIRDYSIGS---------ATAQEWIDMVNSFQQATLSSRLGIPIMYGIDAVHGHNGVYNATIFLNNVGLVFSCDREPELLRR---IGAATAKE

Query:  VC------------RDTRWGRCYESYSEDPDIVKEM-TDIIAGLQGQTPS
         C            RD RW R +E+Y ED  +  EM    + G QG+ P+
Subjt:  VC------------RDTRWGRCYESYSEDPDIVKEM-TDIIAGLQGQTPS

Q23892 Lysosomal beta glucosidase1.6e-0731.97Show/hide
Query:  WIDMVNSFQQATL-SSRLGIPIMYGIDAVHGHNGVYNATIFLNNVGLVFSCDREPELLRRIGAATAKEV-CRDT-------------------RWGRCYE
        W+DM+N+ Q   +  S   IP++YG+D+VHG N V+ AT+F +N GL  + + E        A TA ++  +DT                    W R YE
Subjt:  WIDMVNSFQQATL-SSRLGIPIMYGIDAVHGHNGVYNATIFLNNVGLVFSCDREPELLRRIGAATAKEV-CRDT-------------------RWGRCYE

Query:  SYSEDPDIVKEM-TDIIAGLQG
        ++ EDP +   M    + G QG
Subjt:  SYSEDPDIVKEM-TDIIAGLQG

T2KMH0 Beta-xylosidase5.6e-0531.34Show/hide
Query:  DYSIGSATAQEWID--MVNSFQQATLSSRLGIPIMYGIDAVHGHNGVY----NATIFLNNVGLVFSCDREPELLRRIGAATAKEV---------------
        D  + +  +Q  +D  +    Q A  + RLGIP M   +A+HG   V     N T++   V    +   EPEL++++ + TA+E                
Subjt:  DYSIGSATAQEWID--MVNSFQQATLSSRLGIPIMYGIDAVHGHNGVY----NATIFLNNVGLVFSCDREPELLRRIGAATAKEV---------------

Query:  -CRDTRWGRCYESYSEDPDIVKEM-TDIIAGLQG
           D R+GR  ESY EDP +V  M    I GLQG
Subjt:  -CRDTRWGRCYESYSEDPDIVKEM-TDIIAGLQG

Arabidopsis top hitse value%identityAlignment
AT3G47040.1 Glycosyl hydrolase family protein3.1e-5130Show/hide
Query:  QMAQLDSSATTPEIIRDYSIGS------------ATAQEWIDMVNSFQQATLSSRLGIPIMYGIDAVHGHNGVYNATIFLNNVGL---------------
        QM Q++   TTP +I D  IGS            A   +W DM++ +Q A L+SRLGIPI+YGIDAVHG+N VY ATIF +N+GL               
Subjt:  QMAQLDSSATTPEIIRDYSIGS------------ATAQEWIDMVNSFQQATLSSRLGIPIMYGIDAVHGHNGVYNATIFLNNVGL---------------

Query:  --------VFSCDREPELLRRIGAATAKEV---------------CRDTRWGRCYESYSEDPDIVKEMTDIIAGLQGQTPSSSSKVIPYVVGRQR-----
                V  CDR+ +L+RR+GAATA EV                RD RWGR YESYSEDPDI+ E++ +++GLQG+ P       P++ GR       
Subjt:  --------VFSCDREPELLRRIGAATAKEV---------------CRDTRWGRCYESYSEDPDIVKEMTDIIAGLQGQTPSSSSKVIPYVVGRQR-----

Query:  ----------KGCSLCKAFLFCRRWQD-------NMRYQ----LEHSYQQAWIVTHSHARILSLYN--------QGCLYNN----ERYSDVLNQRRRR--
                  KG +     +     +        N   Q    +  SY  +W  +  H+    L          +G + ++    ER S+      R   
Subjt:  ----------KGCSLCKAFLFCRRWQD-------NMRYQ----LEHSYQQAWIVTHSHARILSLYN--------QGCLYNN----ERYSDVLNQRRRR--

Query:  --------------------EKD-------------------------------FESQVYDGPVCESIG-------GSSSVKESTIDLSSRNSRQQS---
                             KD                               FE  + D  +  ++G          SV++S + L +  + ++    
Subjt:  --------------------EKD-------------------------------FESQVYDGPVCESIG-------GSSSVKESTIDLSSRNSRQQS---

Query:  --------------------AAAGQSPGKDSADKNLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKA-NNFSHVIVAVGETPHAEPKGDNLNLTIPEG
                               G +         +T GTT+L+A+K+ V   TEVIY   P+ +   +   FS+ IVAVGETP+AE  GDN  LTIP  
Subjt:  --------------------AAAGQSPGKDSADKNLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKA-NNFSHVIVAVGETPHAEPKGDNLNLTIPEG

Query:  GSDTIQKVC-----IVVIVSGRPLTRQQ-YMSQLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFK
        G+D +  +      +VV+ SGRPL  +   + + +AL AAWLPGTEG+G+TDV+ G+Y F  KL  + FK
Subjt:  GSDTIQKVC-----IVVIVSGRPLTRQQ-YMSQLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFK

AT5G04885.1 Glycosyl hydrolase family protein8.1e-6834Show/hide
Query:  QMAQLDSSATTPEIIRDYSIGS------------ATAQEWIDMVNSFQQATLSSRLGIPIMYGIDAVHGHNGVYNATIFLNNVGLVFSCDREPELLRRIG
        QM Q+D S  T  I+RDY IGS            A+AQ W+DM+N +Q+  L SRLGIP++YGIDAVHGHN VYNATIF +NVGL     R+P+L++RIG
Subjt:  QMAQLDSSATTPEIIRDYSIGS------------ATAQEWIDMVNSFQQATLSSRLGIPIMYGIDAVHGHNGVYNATIFLNNVGLVFSCDREPELLRRIG

Query:  AATAKE---------------VCRDTRWGRCYESYSEDPDIVKEMTDIIAGLQGQTPSSSSKVIPYVVGRQRKGCSLCKAFL----FCRRWQDN------
        AATA E               VCRD RWGRCYESYSED  +V++MTD+I GLQG+ PS+    +P+V GR  K  +  K ++      R   +N      
Subjt:  AATAKE---------------VCRDTRWGRCYESYSEDPDIVKEMTDIIAGLQGQTPSSSSKVIPYVVGRQRKGCSLCKAFL----FCRRWQDN------

Query:  -------MRYQLEHSYQQAWIV-----------THSHARILSLYNQGCL-----------------------------------------------YNNE
               M    +  Y+    V            H++  +++ Y +G L                                               + N+
Subjt:  -------MRYQLEHSYQQAWIV-----------THSHARILSLYNQGCL-----------------------------------------------YNNE

Query:  RYSDVLNQ-----------RRRREKDFESQVYDGPVCE-----SIGGSS-------SVKESTIDLSSRNSRQQSAAAGQSPGK-----------------
          + V N            RR     F   +++ P+ +      +G  +       +V++S + L + N         +   K                 
Subjt:  RYSDVLNQ-----------RRRREKDFESQVYDGPVCE-----SIGGSS-------SVKESTIDLSSRNSRQQSAAAGQSPGK-----------------

Query:  -------DSADKNLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGG----SDTIQKV-CIVVIVSGRP
                S +KN T GTT+L AVK  VD +TEV++  NP  ++ K+NNF++ I+AVGE P+AE  GD+  LT+ + G    S T Q V C+VV++SGRP
Subjt:  -------DSADKNLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGG----SDTIQKV-CIVVIVSGRP

Query:  LTRQQYMSQLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK
        L  + Y++ +DAL AAWLPGTEG+G+TD L G++GF+ KL  T F+  E+
Subjt:  LTRQQYMSQLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK

AT5G20940.1 Glycosyl hydrolase family protein1.5e-6635.64Show/hide
Query:  QMAQLDSSATTPEIIRDYSIGSATA------------QEWIDMVNSFQQATLSSRLGIPIMYGIDAVHGHNGVYNATIFLNNVGLVFSCDREPELLRRIG
        QM Q++    T E+++ Y +GS  +            + W++MVN  Q+  LS+RLGIPI+YGIDAVHGHN VYNATIF +NVGL     R+P L++RIG
Subjt:  QMAQLDSSATTPEIIRDYSIGSATA------------QEWIDMVNSFQQATLSSRLGIPIMYGIDAVHGHNGVYNATIFLNNVGLVFSCDREPELLRRIG

Query:  AATAKE---------------VCRDTRWGRCYESYSEDPDIVKEMTDIIAGLQGQTPSSSSKVIPYVVGRQRKGCSLCKAFL----FCRRWQDN------
         ATA E               VCRD RWGRCYESYSED  IV++MT+II GLQG  P +  K +P+V G+  K  +  K F+      R    N      
Subjt:  AATAKE---------------VCRDTRWGRCYESYSEDPDIVKEMTDIIAGLQGQTPSSSSKVIPYVVGRQRKGCSLCKAFL----FCRRWQDN------

Query:  -----MRYQLEHSYQQAWIVT-------------HSHARILS--LYNQ---------------------GCLYNNERYS-------------------DV
             +     H      + T             H++ ++++  L N+                     G  Y++  Y+                   D 
Subjt:  -----MRYQLEHSYQQAWIVT-------------HSHARILS--LYNQ---------------------GCLYNNERYS-------------------DV

Query:  LNQRRRRE---------------------KDFESQVYDGPVCESIGGS-------SSVKESTIDL-SSRNSRQQSAAAGQSPGK-----DSAD-------
        L  + +R+                       FE+ + D  + + +G          +V++S + L +  N+ +      +   K       AD       
Subjt:  LNQRRRRE---------------------KDFESQVYDGPVCESIGGS-------SSVKESTIDL-SSRNSRQQSAAAGQSPGK-----DSAD-------

Query:  -----------KNLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKV-----CIVVIVSGRP
                    NLT GTTIL AVKKTVDP T+VIYN NP T++ KA +F + IVAVGE P+AE  GD+ NLTI E G  TI  V     C+VV+VSGRP
Subjt:  -----------KNLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKV-----CIVVIVSGRP

Query:  LTRQQYMSQLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK
        +  Q  +S +DAL AAWLPGTEG+GV DVL G+YGFT KL RT FKT ++
Subjt:  LTRQQYMSQLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK

AT5G20950.1 Glycosyl hydrolase family protein2.2e-6535.4Show/hide
Query:  QMAQLDSSATTPEIIRDYSIGS------------ATAQEWIDMVNSFQQATLSSRLGIPIMYGIDAVHGHNGVYNATIFLNNVGLVFSCDREPELLRRIG
        QM Q++ S  TPE+++ Y IGS            AT + W++MVN  Q+A+LS+RLGIP++YGIDAVHGHN VY ATIF +NVGL     R+P L++RIG
Subjt:  QMAQLDSSATTPEIIRDYSIGS------------ATAQEWIDMVNSFQQATLSSRLGIPIMYGIDAVHGHNGVYNATIFLNNVGLVFSCDREPELLRRIG

Query:  AATAKE---------------VCRDTRWGRCYESYSEDPDIVKEMTDIIAGLQGQTPSSSSKVIPYVVGRQR----------------------------
        AATA E               VCRD RWGRCYESYSED  IV++MT+II GLQG  P +  K +P+V G+ +                            
Subjt:  AATAKE---------------VCRDTRWGRCYESYSEDPDIVKEMTDIIAGLQGQTPSSSSKVIPYVVGRQR----------------------------

Query:  ---------------KGCS-------------------LCKAFL---------FCRRWQDNMRY----QLEHSYQQAWIVTHSHARILSLYN--------
                       KG +                   L   FL             WQ   R      L +SY     ++     I+  YN        
Subjt:  ---------------KGCS-------------------LCKAFL---------FCRRWQDNMRY----QLEHSYQQAWIVTHSHARILSLYN--------

Query:  ----QGCLYNNERYSDVLNQRRRREKD---FESQVYDGPVCESIGGS-------SSVKESTIDLSSRNSRQQSAAAGQSPGKDS--------AD------
            Q  L    R  D L +  R +     FE  + D      +G          +V++S + L  +N +  +      P K          AD      
Subjt:  ----QGCLYNNERYSDVLNQRRRREKD---FESQVYDGPVCESIGGS-------SSVKESTIDLSSRNSRQQSAAAGQSPGKDS--------AD------

Query:  ------------KNLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKV-----CIVVIVSGR
                     + T GTTIL AVK TV P T+V+Y+ NP  ++ K+  F + IV VGE P+AE  GD  NLTI + G   I  V     C+VV+VSGR
Subjt:  ------------KNLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKV-----CIVVIVSGR

Query:  PLTRQQYMSQLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKT
        P+  Q Y+S +DAL AAWLPGTEG+GV D L G+YGFT KL RT FK+
Subjt:  PLTRQQYMSQLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKT

AT5G20950.2 Glycosyl hydrolase family protein2.2e-6535.4Show/hide
Query:  QMAQLDSSATTPEIIRDYSIGS------------ATAQEWIDMVNSFQQATLSSRLGIPIMYGIDAVHGHNGVYNATIFLNNVGLVFSCDREPELLRRIG
        QM Q++ S  TPE+++ Y IGS            AT + W++MVN  Q+A+LS+RLGIP++YGIDAVHGHN VY ATIF +NVGL     R+P L++RIG
Subjt:  QMAQLDSSATTPEIIRDYSIGS------------ATAQEWIDMVNSFQQATLSSRLGIPIMYGIDAVHGHNGVYNATIFLNNVGLVFSCDREPELLRRIG

Query:  AATAKE---------------VCRDTRWGRCYESYSEDPDIVKEMTDIIAGLQGQTPSSSSKVIPYVVGRQR----------------------------
        AATA E               VCRD RWGRCYESYSED  IV++MT+II GLQG  P +  K +P+V G+ +                            
Subjt:  AATAKE---------------VCRDTRWGRCYESYSEDPDIVKEMTDIIAGLQGQTPSSSSKVIPYVVGRQR----------------------------

Query:  ---------------KGCS-------------------LCKAFL---------FCRRWQDNMRY----QLEHSYQQAWIVTHSHARILSLYN--------
                       KG +                   L   FL             WQ   R      L +SY     ++     I+  YN        
Subjt:  ---------------KGCS-------------------LCKAFL---------FCRRWQDNMRY----QLEHSYQQAWIVTHSHARILSLYN--------

Query:  ----QGCLYNNERYSDVLNQRRRREKD---FESQVYDGPVCESIGGS-------SSVKESTIDLSSRNSRQQSAAAGQSPGKDS--------AD------
            Q  L    R  D L +  R +     FE  + D      +G          +V++S + L  +N +  +      P K          AD      
Subjt:  ----QGCLYNNERYSDVLNQRRRREKD---FESQVYDGPVCESIGGS-------SSVKESTIDLSSRNSRQQSAAAGQSPGKDS--------AD------

Query:  ------------KNLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKV-----CIVVIVSGR
                     + T GTTIL AVK TV P T+V+Y+ NP  ++ K+  F + IV VGE P+AE  GD  NLTI + G   I  V     C+VV+VSGR
Subjt:  ------------KNLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKV-----CIVVIVSGR

Query:  PLTRQQYMSQLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKT
        P+  Q Y+S +DAL AAWLPGTEG+GV D L G+YGFT KL RT FK+
Subjt:  PLTRQQYMSQLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGTGATGGTGGTTTTACTCTGTTGCTGGGCAGCTTTGGTGGCTGTTGAGGAAGACTGTCAAATGGCACAGTTGGATTCTTCGGCCACCACACCGGAAATCATAAG
AGATTACTCCATTGGCAGTGCTACTGCACAGGAGTGGATTGACATGGTGAATTCCTTCCAGCAGGCCACATTATCAAGTAGGCTTGGAATTCCAATAATGTATGGTATTG
ATGCTGTCCATGGTCACAATGGCGTCTACAATGCTACCATCTTCCTCAACAATGTCGGTCTTGTTTTCTCATGTGACAGGGAACCTGAACTTTTAAGAAGGATTGGTGCT
GCTACTGCTAAAGAAGTCTGTAGAGATACGAGATGGGGAAGGTGTTATGAAAGCTACAGTGAAGATCCTGACATTGTCAAAGAAATGACAGATATCATAGCTGGGTTGCA
AGGACAAACCCCATCTAGTTCTTCAAAAGTTATTCCTTATGTTGTTGGAAGGCAAAGAAAAGGTTGCAGCTTGTGCAAAGCATTTCTATTTTGTAGGCGATGGCAGGACA
ACATGCGGTATCAATTAGAACACAGTTATCAGCAAGCATGGATTGTTACGCATTCACACGCCAGGATACTATCACTCTACAATCAAGGGTGTCTCTACAATAATGAACGT
TATTCTGATGTCCTGAATCAACGACGACGCCGTGAGAAGGATTTTGAGAGTCAAGTTTATGATGGGCCTGTTTGTGAATCCATTGGCGGTTCTTCCTCTGTCAAAGAAAG
CACCATAGATCTTAGTAGCCGGAACTCACGCCAACAATCTGCGGCGGCCGGACAATCACCTGGCAAGGACTCAGCGGACAAAAATCTAACAACCGGAACCACCATTCTCG
AGGCAGTGAAGAAAACCGTTGATCCAAACACGGAGGTCATCTACAACATAAATCCAACAACCGATTACTTCAAGGCGAACAACTTCTCGCACGTCATTGTGGCTGTAGGA
GAAACGCCGCACGCCGAGCCCAAAGGCGACAACCTAAACCTAACTATCCCCGAAGGAGGCTCGGACACGATCCAGAAGGTGTGCATCGTTGTCATCGTCTCCGGCCGGCC
TCTGACGAGGCAGCAATACATGTCACAATTGGACGCGCTGGAGGCGGCGTGGCTGCCGGGAACGGAAGGGGAAGGCGTGACGGACGTGCTGTTGGGAGAATATGGGTTCA
CCGAAAAGCTGGAGAGGACGAGGTTCAAGACTCAAGAAAAAGATGGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGGTGATGGTGGTTTTACTCTGTTGCTGGGCAGCTTTGGTGGCTGTTGAGGAAGACTGTCAAATGGCACAGTTGGATTCTTCGGCCACCACACCGGAAATCATAAG
AGATTACTCCATTGGCAGTGCTACTGCACAGGAGTGGATTGACATGGTGAATTCCTTCCAGCAGGCCACATTATCAAGTAGGCTTGGAATTCCAATAATGTATGGTATTG
ATGCTGTCCATGGTCACAATGGCGTCTACAATGCTACCATCTTCCTCAACAATGTCGGTCTTGTTTTCTCATGTGACAGGGAACCTGAACTTTTAAGAAGGATTGGTGCT
GCTACTGCTAAAGAAGTCTGTAGAGATACGAGATGGGGAAGGTGTTATGAAAGCTACAGTGAAGATCCTGACATTGTCAAAGAAATGACAGATATCATAGCTGGGTTGCA
AGGACAAACCCCATCTAGTTCTTCAAAAGTTATTCCTTATGTTGTTGGAAGGCAAAGAAAAGGTTGCAGCTTGTGCAAAGCATTTCTATTTTGTAGGCGATGGCAGGACA
ACATGCGGTATCAATTAGAACACAGTTATCAGCAAGCATGGATTGTTACGCATTCACACGCCAGGATACTATCACTCTACAATCAAGGGTGTCTCTACAATAATGAACGT
TATTCTGATGTCCTGAATCAACGACGACGCCGTGAGAAGGATTTTGAGAGTCAAGTTTATGATGGGCCTGTTTGTGAATCCATTGGCGGTTCTTCCTCTGTCAAAGAAAG
CACCATAGATCTTAGTAGCCGGAACTCACGCCAACAATCTGCGGCGGCCGGACAATCACCTGGCAAGGACTCAGCGGACAAAAATCTAACAACCGGAACCACCATTCTCG
AGGCAGTGAAGAAAACCGTTGATCCAAACACGGAGGTCATCTACAACATAAATCCAACAACCGATTACTTCAAGGCGAACAACTTCTCGCACGTCATTGTGGCTGTAGGA
GAAACGCCGCACGCCGAGCCCAAAGGCGACAACCTAAACCTAACTATCCCCGAAGGAGGCTCGGACACGATCCAGAAGGTGTGCATCGTTGTCATCGTCTCCGGCCGGCC
TCTGACGAGGCAGCAATACATGTCACAATTGGACGCGCTGGAGGCGGCGTGGCTGCCGGGAACGGAAGGGGAAGGCGTGACGGACGTGCTGTTGGGAGAATATGGGTTCA
CCGAAAAGCTGGAGAGGACGAGGTTCAAGACTCAAGAAAAAGATGGTTAG
Protein sequenceShow/hide protein sequence
MKVMVVLLCCWAALVAVEEDCQMAQLDSSATTPEIIRDYSIGSATAQEWIDMVNSFQQATLSSRLGIPIMYGIDAVHGHNGVYNATIFLNNVGLVFSCDREPELLRRIGA
ATAKEVCRDTRWGRCYESYSEDPDIVKEMTDIIAGLQGQTPSSSSKVIPYVVGRQRKGCSLCKAFLFCRRWQDNMRYQLEHSYQQAWIVTHSHARILSLYNQGCLYNNER
YSDVLNQRRRREKDFESQVYDGPVCESIGGSSSVKESTIDLSSRNSRQQSAAAGQSPGKDSADKNLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVG
ETPHAEPKGDNLNLTIPEGGSDTIQKVCIVVIVSGRPLTRQQYMSQLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEKDG