; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi04G001820 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi04G001820
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionGlycosyl hydrolase family protein
Genome locationchr04:1949999..1957801
RNA-Seq ExpressionLsi04G001820
SyntenyLsi04G001820
Gene Ontology termsGO:0009251 - glucan catabolic process (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0008422 - beta-glucosidase activity (molecular function)
InterPro domainsIPR001764 - Glycoside hydrolase, family 3, N-terminal
IPR017853 - Glycoside hydrolase superfamily
IPR036962 - Glycoside hydrolase, family 3, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150694.1 uncharacterized protein LOC111018764 isoform X1 [Momordica charantia]4.2e-16082.54Show/hide
Query:  LLKLKLPWKCKESGVTSQRWKMAKIFVQVVMTLCLGWWLWAAMVDGDNLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMAQIDRGVANATVMKNYFIGSVL
        +LKLKL WK ++ G TSQ+ KMA+IFVQVV  LCLGWW WA  VD + LKYKDPKQPVAVRV DLLGRMTLEEKIGQM QIDR VAN TVMK+Y IGSVL
Subjt:  LLKLKLPWKCKESGVTSQRWKMAKIFVQVVMTLCLGWWLWAAMVDGDNLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMAQIDRGVANATVMKNYFIGSVL

Query:  TGGGTELLPDARAQDWVNMINEIQKGSLSSRLGIPMMYGVDAVHGHNNAYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPC
        +GGG+  LPDARA+DWVNMINE QKGSLSSRLGIPMMYG+DAVHGHNN YNAT+FPHNVGLGATR       NP LVRRIGAATALEVRATGISFAF+PC
Subjt:  TGGGTELLPDARAQDWVNMINEIQKGSLSSRLGIPMMYGVDAVHGHNNAYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPC

Query:  IAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPAKYRKGIPYVGGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIFKGIS
        IAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPA YRKGIPYVGG++KV+ACAKHFVGDGGTTNGI+E+NTVID HGLLSIHMPAY DSI KG+S
Subjt:  IAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPAKYRKGIPYVGGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIFKGIS

Query:  TVMVSYSSWNGVKMHANRELITGFLKGTLKFKVCLILD
        +VM+SYSSWNGVKMHAN ELITGFLKGTLKFK  +I D
Subjt:  TVMVSYSSWNGVKMHANRELITGFLKGTLKFKVCLILD

XP_022150697.1 uncharacterized protein LOC111018764 isoform X2 [Momordica charantia]1.3e-15683.49Show/hide
Query:  ESGVTSQRWKMAKIFVQVVMTLCLGWWLWAAMVDGDNLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMAQIDRGVANATVMKNYFIGSVLTGGGTELLPDA
        E G TSQ+ KMA+IFVQVV  LCLGWW WA  VD + LKYKDPKQPVAVRV DLLGRMTLEEKIGQM QIDR VAN TVMK+Y IGSVL+GGG+  LPDA
Subjt:  ESGVTSQRWKMAKIFVQVVMTLCLGWWLWAAMVDGDNLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMAQIDRGVANATVMKNYFIGSVLTGGGTELLPDA

Query:  RAQDWVNMINEIQKGSLSSRLGIPMMYGVDAVHGHNNAYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPCIAVCRDPRWGR
        RA+DWVNMINE QKGSLSSRLGIPMMYG+DAVHGHNN YNAT+FPHNVGLGATR       NP LVRRIGAATALEVRATGISFAF+PCIAVCRDPRWGR
Subjt:  RAQDWVNMINEIQKGSLSSRLGIPMMYGVDAVHGHNNAYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPCIAVCRDPRWGR

Query:  CYESYSEDPKIVQEMTEIIPGLQGEPPAKYRKGIPYVGGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIFKGISTVMVSYSSWNG
        CYESYSEDPKIVQEMTEIIPGLQGEPPA YRKGIPYVGG++KV+ACAKHFVGDGGTTNGI+E+NTVID HGLLSIHMPAY DSI KG+S+VM+SYSSWNG
Subjt:  CYESYSEDPKIVQEMTEIIPGLQGEPPAKYRKGIPYVGGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIFKGISTVMVSYSSWNG

Query:  VKMHANRELITGFLKGTLKFKVCLILD
        VKMHAN ELITGFLKGTLKFK  +I D
Subjt:  VKMHANRELITGFLKGTLKFKVCLILD

XP_022150698.1 uncharacterized protein LOC111018764 isoform X3 [Momordica charantia]4.2e-16082.54Show/hide
Query:  LLKLKLPWKCKESGVTSQRWKMAKIFVQVVMTLCLGWWLWAAMVDGDNLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMAQIDRGVANATVMKNYFIGSVL
        +LKLKL WK ++ G TSQ+ KMA+IFVQVV  LCLGWW WA  VD + LKYKDPKQPVAVRV DLLGRMTLEEKIGQM QIDR VAN TVMK+Y IGSVL
Subjt:  LLKLKLPWKCKESGVTSQRWKMAKIFVQVVMTLCLGWWLWAAMVDGDNLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMAQIDRGVANATVMKNYFIGSVL

Query:  TGGGTELLPDARAQDWVNMINEIQKGSLSSRLGIPMMYGVDAVHGHNNAYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPC
        +GGG+  LPDARA+DWVNMINE QKGSLSSRLGIPMMYG+DAVHGHNN YNAT+FPHNVGLGATR       NP LVRRIGAATALEVRATGISFAF+PC
Subjt:  TGGGTELLPDARAQDWVNMINEIQKGSLSSRLGIPMMYGVDAVHGHNNAYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPC

Query:  IAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPAKYRKGIPYVGGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIFKGIS
        IAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPA YRKGIPYVGG++KV+ACAKHFVGDGGTTNGI+E+NTVID HGLLSIHMPAY DSI KG+S
Subjt:  IAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPAKYRKGIPYVGGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIFKGIS

Query:  TVMVSYSSWNGVKMHANRELITGFLKGTLKFKVCLILD
        +VM+SYSSWNGVKMHAN ELITGFLKGTLKFK  +I D
Subjt:  TVMVSYSSWNGVKMHANRELITGFLKGTLKFKVCLILD

XP_022945501.1 uncharacterized protein LOC111449719 isoform X1 [Cucurbita moschata]9.6e-15780.18Show/hide
Query:  LLKLKLPWKCKESGVTSQRWKMAKIFVQVVMTLCLGWWLWAAMVDGDNLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMAQIDRGVANATVMKNYFIGSVL
        +L LKLPWK KE G  S   KMAK FVQVV+ LCLGWW WA MV  +NLKYKDPKQPV+VRVKDLLGRMTLEEKIGQM QIDR VANATVMKNYFIGSVL
Subjt:  LLKLKLPWKCKESGVTSQRWKMAKIFVQVVMTLCLGWWLWAAMVDGDNLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMAQIDRGVANATVMKNYFIGSVL

Query:  TGGGTELLPDARAQDWVNMINEIQKGSLSSRLGIPMMYGVDAVHGHNNAYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPC
        +GGG+  LPDARAQDWV+MIN+ QKGSLSSRLGIPM+YG+DAVHGHNN YNAT+FPHNVGLGATR       NP L+RRIGAATALEVRATGIS+ F+PC
Subjt:  TGGGTELLPDARAQDWVNMINEIQKGSLSSRLGIPMMYGVDAVHGHNNAYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPC

Query:  IAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPAKYRKGIPYVGGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIFKGIS
        +AVCRDPRWGRCYESYSEDPK+VQ MTEII GLQGEPPA +RKGIPYVGG++KV+ACAKHFVGDGGTT+GI+E+NTVID+HGLL IHMPAY+DSI KG+S
Subjt:  IAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPAKYRKGIPYVGGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIFKGIS

Query:  TVMVSYSSWNGVKMHANRELITGFLKGTLKFKVCLILD
        +VMVSYSSWNGVKMHANRELIT FLK TLKFK  +I D
Subjt:  TVMVSYSSWNGVKMHANRELITGFLKGTLKFKVCLILD

XP_022968400.1 uncharacterized protein LOC111467651 isoform X2 [Cucurbita maxima]2.1e-15981.07Show/hide
Query:  LLKLKLPWKCKESGVTSQRWKMAKIFVQVVMTLCLGWWLWAAMVDGDNLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMAQIDRGVANATVMKNYFIGSVL
        +L LKLPWK KE G+ S   KMAKIFVQVV+ LCLGWW WA MVD +NLKYKDPKQPV+VRVKDLLGRMTLEEKIGQM QIDR VANATVMKNYFIGSVL
Subjt:  LLKLKLPWKCKESGVTSQRWKMAKIFVQVVMTLCLGWWLWAAMVDGDNLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMAQIDRGVANATVMKNYFIGSVL

Query:  TGGGTELLPDARAQDWVNMINEIQKGSLSSRLGIPMMYGVDAVHGHNNAYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPC
        +GGG+  LPDARAQDWV+MIN+ QKGSLSSRLGIPM+YG+DAVHGHNN YNAT+FPHNVGLGATR       NP L+RRIGAATALEVRATGIS+ F+PC
Subjt:  TGGGTELLPDARAQDWVNMINEIQKGSLSSRLGIPMMYGVDAVHGHNNAYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPC

Query:  IAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPAKYRKGIPYVGGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIFKGIS
        +AVCRDPRWGRCYESYSEDPK+VQ MTEII GLQGEPPA YRKGIPYVGG++KV+ACAKHFVGDGGTT+GI+E+NTVID+HGLL IHMPAY+DSI KG+S
Subjt:  IAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPAKYRKGIPYVGGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIFKGIS

Query:  TVMVSYSSWNGVKMHANRELITGFLKGTLKFKVCLILD
        +VMVSYSSWNGVKMHANR+LIT FLKGTLKFK  +I D
Subjt:  TVMVSYSSWNGVKMHANRELITGFLKGTLKFKVCLILD

TrEMBL top hitse value%identityAlignment
A0A6J1DA47 uncharacterized protein LOC111018764 isoform X32.0e-16082.54Show/hide
Query:  LLKLKLPWKCKESGVTSQRWKMAKIFVQVVMTLCLGWWLWAAMVDGDNLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMAQIDRGVANATVMKNYFIGSVL
        +LKLKL WK ++ G TSQ+ KMA+IFVQVV  LCLGWW WA  VD + LKYKDPKQPVAVRV DLLGRMTLEEKIGQM QIDR VAN TVMK+Y IGSVL
Subjt:  LLKLKLPWKCKESGVTSQRWKMAKIFVQVVMTLCLGWWLWAAMVDGDNLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMAQIDRGVANATVMKNYFIGSVL

Query:  TGGGTELLPDARAQDWVNMINEIQKGSLSSRLGIPMMYGVDAVHGHNNAYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPC
        +GGG+  LPDARA+DWVNMINE QKGSLSSRLGIPMMYG+DAVHGHNN YNAT+FPHNVGLGATR       NP LVRRIGAATALEVRATGISFAF+PC
Subjt:  TGGGTELLPDARAQDWVNMINEIQKGSLSSRLGIPMMYGVDAVHGHNNAYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPC

Query:  IAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPAKYRKGIPYVGGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIFKGIS
        IAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPA YRKGIPYVGG++KV+ACAKHFVGDGGTTNGI+E+NTVID HGLLSIHMPAY DSI KG+S
Subjt:  IAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPAKYRKGIPYVGGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIFKGIS

Query:  TVMVSYSSWNGVKMHANRELITGFLKGTLKFKVCLILD
        +VM+SYSSWNGVKMHAN ELITGFLKGTLKFK  +I D
Subjt:  TVMVSYSSWNGVKMHANRELITGFLKGTLKFKVCLILD

A0A6J1DBF5 uncharacterized protein LOC111018764 isoform X26.1e-15783.49Show/hide
Query:  ESGVTSQRWKMAKIFVQVVMTLCLGWWLWAAMVDGDNLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMAQIDRGVANATVMKNYFIGSVLTGGGTELLPDA
        E G TSQ+ KMA+IFVQVV  LCLGWW WA  VD + LKYKDPKQPVAVRV DLLGRMTLEEKIGQM QIDR VAN TVMK+Y IGSVL+GGG+  LPDA
Subjt:  ESGVTSQRWKMAKIFVQVVMTLCLGWWLWAAMVDGDNLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMAQIDRGVANATVMKNYFIGSVLTGGGTELLPDA

Query:  RAQDWVNMINEIQKGSLSSRLGIPMMYGVDAVHGHNNAYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPCIAVCRDPRWGR
        RA+DWVNMINE QKGSLSSRLGIPMMYG+DAVHGHNN YNAT+FPHNVGLGATR       NP LVRRIGAATALEVRATGISFAF+PCIAVCRDPRWGR
Subjt:  RAQDWVNMINEIQKGSLSSRLGIPMMYGVDAVHGHNNAYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPCIAVCRDPRWGR

Query:  CYESYSEDPKIVQEMTEIIPGLQGEPPAKYRKGIPYVGGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIFKGISTVMVSYSSWNG
        CYESYSEDPKIVQEMTEIIPGLQGEPPA YRKGIPYVGG++KV+ACAKHFVGDGGTTNGI+E+NTVID HGLLSIHMPAY DSI KG+S+VM+SYSSWNG
Subjt:  CYESYSEDPKIVQEMTEIIPGLQGEPPAKYRKGIPYVGGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIFKGISTVMVSYSSWNG

Query:  VKMHANRELITGFLKGTLKFKVCLILD
        VKMHAN ELITGFLKGTLKFK  +I D
Subjt:  VKMHANRELITGFLKGTLKFKVCLILD

A0A6J1DCA3 uncharacterized protein LOC111018764 isoform X12.0e-16082.54Show/hide
Query:  LLKLKLPWKCKESGVTSQRWKMAKIFVQVVMTLCLGWWLWAAMVDGDNLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMAQIDRGVANATVMKNYFIGSVL
        +LKLKL WK ++ G TSQ+ KMA+IFVQVV  LCLGWW WA  VD + LKYKDPKQPVAVRV DLLGRMTLEEKIGQM QIDR VAN TVMK+Y IGSVL
Subjt:  LLKLKLPWKCKESGVTSQRWKMAKIFVQVVMTLCLGWWLWAAMVDGDNLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMAQIDRGVANATVMKNYFIGSVL

Query:  TGGGTELLPDARAQDWVNMINEIQKGSLSSRLGIPMMYGVDAVHGHNNAYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPC
        +GGG+  LPDARA+DWVNMINE QKGSLSSRLGIPMMYG+DAVHGHNN YNAT+FPHNVGLGATR       NP LVRRIGAATALEVRATGISFAF+PC
Subjt:  TGGGTELLPDARAQDWVNMINEIQKGSLSSRLGIPMMYGVDAVHGHNNAYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPC

Query:  IAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPAKYRKGIPYVGGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIFKGIS
        IAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPA YRKGIPYVGG++KV+ACAKHFVGDGGTTNGI+E+NTVID HGLLSIHMPAY DSI KG+S
Subjt:  IAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPAKYRKGIPYVGGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIFKGIS

Query:  TVMVSYSSWNGVKMHANRELITGFLKGTLKFKVCLILD
        +VM+SYSSWNGVKMHAN ELITGFLKGTLKFK  +I D
Subjt:  TVMVSYSSWNGVKMHANRELITGFLKGTLKFKVCLILD

A0A6J1G118 uncharacterized protein LOC111449719 isoform X14.7e-15780.18Show/hide
Query:  LLKLKLPWKCKESGVTSQRWKMAKIFVQVVMTLCLGWWLWAAMVDGDNLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMAQIDRGVANATVMKNYFIGSVL
        +L LKLPWK KE G  S   KMAK FVQVV+ LCLGWW WA MV  +NLKYKDPKQPV+VRVKDLLGRMTLEEKIGQM QIDR VANATVMKNYFIGSVL
Subjt:  LLKLKLPWKCKESGVTSQRWKMAKIFVQVVMTLCLGWWLWAAMVDGDNLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMAQIDRGVANATVMKNYFIGSVL

Query:  TGGGTELLPDARAQDWVNMINEIQKGSLSSRLGIPMMYGVDAVHGHNNAYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPC
        +GGG+  LPDARAQDWV+MIN+ QKGSLSSRLGIPM+YG+DAVHGHNN YNAT+FPHNVGLGATR       NP L+RRIGAATALEVRATGIS+ F+PC
Subjt:  TGGGTELLPDARAQDWVNMINEIQKGSLSSRLGIPMMYGVDAVHGHNNAYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPC

Query:  IAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPAKYRKGIPYVGGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIFKGIS
        +AVCRDPRWGRCYESYSEDPK+VQ MTEII GLQGEPPA +RKGIPYVGG++KV+ACAKHFVGDGGTT+GI+E+NTVID+HGLL IHMPAY+DSI KG+S
Subjt:  IAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPAKYRKGIPYVGGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIFKGIS

Query:  TVMVSYSSWNGVKMHANRELITGFLKGTLKFKVCLILD
        +VMVSYSSWNGVKMHANRELIT FLK TLKFK  +I D
Subjt:  TVMVSYSSWNGVKMHANRELITGFLKGTLKFKVCLILD

A0A6J1HX36 uncharacterized protein LOC111467651 isoform X21.0e-15981.07Show/hide
Query:  LLKLKLPWKCKESGVTSQRWKMAKIFVQVVMTLCLGWWLWAAMVDGDNLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMAQIDRGVANATVMKNYFIGSVL
        +L LKLPWK KE G+ S   KMAKIFVQVV+ LCLGWW WA MVD +NLKYKDPKQPV+VRVKDLLGRMTLEEKIGQM QIDR VANATVMKNYFIGSVL
Subjt:  LLKLKLPWKCKESGVTSQRWKMAKIFVQVVMTLCLGWWLWAAMVDGDNLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMAQIDRGVANATVMKNYFIGSVL

Query:  TGGGTELLPDARAQDWVNMINEIQKGSLSSRLGIPMMYGVDAVHGHNNAYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPC
        +GGG+  LPDARAQDWV+MIN+ QKGSLSSRLGIPM+YG+DAVHGHNN YNAT+FPHNVGLGATR       NP L+RRIGAATALEVRATGIS+ F+PC
Subjt:  TGGGTELLPDARAQDWVNMINEIQKGSLSSRLGIPMMYGVDAVHGHNNAYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPC

Query:  IAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPAKYRKGIPYVGGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIFKGIS
        +AVCRDPRWGRCYESYSEDPK+VQ MTEII GLQGEPPA YRKGIPYVGG++KV+ACAKHFVGDGGTT+GI+E+NTVID+HGLL IHMPAY+DSI KG+S
Subjt:  IAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPAKYRKGIPYVGGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIFKGIS

Query:  TVMVSYSSWNGVKMHANRELITGFLKGTLKFKVCLILD
        +VMVSYSSWNGVKMHANR+LIT FLKGTLKFK  +I D
Subjt:  TVMVSYSSWNGVKMHANRELITGFLKGTLKFKVCLILD

SwissProt top hitse value%identityAlignment
A7LXU3 Beta-glucosidase BoGH3B2.3e-3631.91Show/hide
Query:  PKQP-VAVRVKDLLGRMTLEEKIGQMAQIDRGVAN-----------------ATVMKNYFIGSVLTGGGTELLPDARAQDWVNMINEIQKGSLSSRLGIP
        P  P +   +++ L +MTLE+KIGQM +I   V +                  TV+  Y +GS+L      L    + + W   I +IQ+ S+   +GIP
Subjt:  PKQP-VAVRVKDLLGRMTLEEKIGQMAQIDRGVAN-----------------ATVMKNYFIGSVLTGGGTELLPDARAQDWVNMINEIQKGSLSSRLGIP

Query:  MMYGVDAVHGHNNAYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPCIAVCRDPRWGRCYESYSEDPKIVQEM-TEIIPGLQ
         +YGVD +HG     + T+FP  + +GAT        N  L RR    +A E +A  I + F+P + + RDPRW R +E+Y ED  +  EM    + G Q
Subjt:  MMYGVDAVHGHNNAYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPCIAVCRDPRWGRCYESYSEDPKIVQEM-TEIIPGLQ

Query:  GEPPAKYRKGIPYVGGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIFKGISTVMVSYSSWNGVKMHANRELITGFLKGTLKFKVC
        GE P +         G   V AC KH++G G   +G D + + I +  +   H   ++ ++ +G  +VMV+    NG+  HANREL+T +LK  L +   
Subjt:  GEPPAKYRKGIPYVGGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIFKGISTVMVSYSSWNGVKMHANRELITGFLKGTLKFKVC

Query:  LILD
        ++ D
Subjt:  LILD

P33363 Periplasmic beta-glucosidase1.9e-2228Show/hide
Query:  DNLKYKDPKQPVA--VRVKDLLGRMTLEEKIGQMAQIDRGVAN-----ATVMKNYFIGSVLTGGGTELLPDARAQDWVNMINEIQKGSLSSRLGIPMMYG
        D+L    P  P A    V +LL +MT++EKIGQ+  I  G  N       ++K+  +G++     T    D RA    + + E+      SRL IP+ + 
Subjt:  DNLKYKDPKQPVA--VRVKDLLGRMTLEEKIGQMAQIDRGVAN-----ATVMKNYFIGSVLTGGGTELLPDARAQDWVNMINEIQKGSLSSRLGIPMMYG

Query:  VDAVHGHNNAYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPCIAVCRDPRWGRCYESYSEDPKIVQEMTE-IIPGLQGEPP
         D +HG       T+FP ++GL +       S N   V+ +G  +A E    G++  ++P + V RDPRWGR  E + ED  +   M + ++  +QG+ P
Subjt:  VDAVHGHNNAYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPCIAVCRDPRWGRCYESYSEDPKIVQEMTE-IIPGLQGEPP

Query:  AKYRKGIPYVGGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIFKGISTVMVSYSSWNGVKMHANRELITGFLKGTLKFKVCLILD
        A              V+   KHF   G    G + +   +    L + +MP Y   +  G   VMV+ +S NG    ++  L+   L+    FK   + D
Subjt:  AKYRKGIPYVGGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIFKGISTVMVSYSSWNGVKMHANRELITGFLKGTLKFKVCLILD

Q23892 Lysosomal beta glucosidase6.5e-3131.08Show/hide
Query:  VKDLLGRMTLEEKIGQMAQID------------RGVANATVMKNYFIGSVL----TGGGTELLPDARAQDWVNMINEIQKGSL-SSRLGIPMMYGVDAVH
        V +L+ +M++ EKIGQM Q+D                 A   K Y+IGS L    +GG    +    +  W++MIN IQ   +  S   IPM+YG+D+VH
Subjt:  VKDLLGRMTLEEKIGQMAQID------------RGVANATVMKNYFIGSVL----TGGGTELLPDARAQDWVNMINEIQKGSL-SSRLGIPMMYGVDAVH

Query:  GHNNAYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPCIAVCRDPRWGRCYESYSEDPKIVQEM-TEIIPGLQGEPPAKYRK
        G N  + AT+FPHN GL AT        N          T+ +  A GI + F+P + +   P W R YE++ EDP +   M    + G QG        
Subjt:  GHNNAYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPCIAVCRDPRWGRCYESYSEDPKIVQEM-TEIIPGLQGEPPAKYRK

Query:  GIPYVGGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIF-KGISTVMVSYSSWNGVKMHANRELITGFLKGTLKFKVCLILD
               +   V  AKH+ G    T+G D +   I +  L    +P++ ++I   G  T+M++    NGV MH + + +T  L+G L+F+   + D
Subjt:  GIPYVGGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIF-KGISTVMVSYSSWNGVKMHANRELITGFLKGTLKFKVCLILD

Q56078 Periplasmic beta-glucosidase5.9e-2429.15Show/hide
Query:  DNLKYKDPKQPVA--VRVKDLLGRMTLEEKIGQMAQIDRGVANATVMKNYFIGSVLTGGGTELLPDARAQDWVNMINEIQKGSLSSRLGIPMMYGVDAVH
        +NL    P  P A    V DLL +MT++EKIGQ+  I  G  N    K      +  G    +      QD   M +++      SRL IP+ +  D VH
Subjt:  DNLKYKDPKQPVA--VRVKDLLGRMTLEEKIGQMAQIDRGVANATVMKNYFIGSVLTGGGTELLPDARAQDWVNMINEIQKGSLSSRLGIPMMYGVDAVH

Query:  GHNNAYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPCIAVCRDPRWGRCYESYSEDPKIVQEMTE-IIPGLQGEPPAKYRK
        G       T+FP ++GL +       S N   VR +G  +A E    G++  ++P + V RDPRWGR  E + ED  +   M E ++  +QG+ PA    
Subjt:  GHNNAYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPCIAVCRDPRWGRCYESYSEDPKIVQEMTE-IIPGLQGEPPAKYRK

Query:  GIPYVGGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIFKGISTVMVSYSSWNGVKMHANRELITGFLKGTLKFKVCLILD
                  V+   KHF   G    G + +   +    L + +MP Y   +  G   VMV+ +S NG    ++  L+   L+    FK   + D
Subjt:  GIPYVGGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIFKGISTVMVSYSSWNGVKMHANRELITGFLKGTLKFKVCLILD

Q5BCC6 Beta-glucosidase C2.1e-2129.26Show/hide
Query:  YKDPKQPVAVRVKDLLGRMTLEEKIGQM--AQIDRGVANATVMKN---YFIGSVLTGGGTELLPDARAQDWVNMINEIQKGSLSSRLGIPMMYGVDAVHG
        YK+    V  RV+DLL RMTLEEK GQ+   Q+  G  +     N     IG               A      IN IQK +L +RLGIP+    D  H 
Subjt:  YKDPKQPVAVRVKDLLGRMTLEEKIGQM--AQIDRGVANATVMKN---YFIGSVLTGGGTELLPDARAQDWVNMINEIQKGSLSSRLGIPMMYGVDAVHG

Query:  H----NNAYNATIF---PHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPCIAVCRDPRWGRCYESYSEDPKIVQEM-TEIIPGLQGEP
                + A +F   P ++GL A R       +P LVR        E  A GI  A  P + +  +PRW R   ++ E+  +  E+  E I G QGE 
Subjt:  H----NNAYNATIF---PHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPCIAVCRDPRWGRCYESYSEDPKIVQEM-TEIIPGLQGEP

Query:  PAKYRKGIPYVGGSQKVVACAKHFVGDGGTTNGIDESNTVIDKH-----GLLSIHMPAYIDSIFKGISTVMVSYS-----SWNGVKMHANRELITGFLKG
                    G + V    KHF G G   NG ++S+    K+       +  H+  +  ++  G + +M  YS     +W  V    N+E++T  L+G
Subjt:  PAKYRKGIPYVGGSQKVVACAKHFVGDGGTTNGIDESNTVIDKH-----GLLSIHMPAYIDSIFKGISTVMVSYS-----SWNGVKMHANRELITGFLKG

Query:  TLKFKVCLILD
         L F   ++ D
Subjt:  TLKFKVCLILD

Arabidopsis top hitse value%identityAlignment
AT3G47000.1 Glycosyl hydrolase family protein1.1e-9757.09Show/hide
Query:  MVDGDNLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMAQIDRGVANATVMKNYFIGSVLTGGGTELLPDARAQDWVNMINEIQKGSLSSRLGIPMMYGVDA
        +V+  +  YK+   PV  RVKDLL RMTL EKIGQM QI+R VA+ +   ++FIGSVL  GG+    DA++ DW +MI+  Q+ +L+SRLGIP++YG DA
Subjt:  MVDGDNLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMAQIDRGVANATVMKNYFIGSVLTGGGTELLPDARAQDWVNMINEIQKGSLSSRLGIPMMYGVDA

Query:  VHGHNNAYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPAKYR
        VHG+NN Y AT+FPHN+GLGATR       +  LVRRIGAATALEVRA+G+ +AFSPC+AV RDPRWGRCYESY EDP++V EMT ++ GLQG PP ++ 
Subjt:  VHGHNNAYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPAKYR

Query:  KGIPYVGGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIFKGISTVMVSYSSWNGVKMHANRELITGFLKGTLKFKVCLILD
         G P+V G   VVAC KHFVGDGGT  GI+E NT+     L  IH+P Y+  + +G+STVM SYSSWNG ++HA+R L+T  LK  L FK  L+ D
Subjt:  KGIPYVGGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIFKGISTVMVSYSSWNGVKMHANRELITGFLKGTLKFKVCLILD

AT5G04885.1 Glycosyl hydrolase family protein1.2e-12869.55Show/hide
Query:  VQVVMTLCLGWWLWAAMVDGDNLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMAQIDRGVANATVMKNYFIGSVLTGGGTELLPDARAQDWVNMINEIQKG
        V V++ +C+  W+     DG+ L YKDPKQ V+ RV DL GRMTLEEKIGQM QIDR VA   +M++YFIGSVL+GGG+  LP+A AQ+WV+MINE QKG
Subjt:  VQVVMTLCLGWWLWAAMVDGDNLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMAQIDRGVANATVMKNYFIGSVLTGGGTELLPDARAQDWVNMINEIQKG

Query:  SLSSRLGIPMMYGVDAVHGHNNAYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPCIAVCRDPRWGRCYESYSEDPKIVQEM
        +L SRLGIPM+YG+DAVHGHNN YNATIFPHNVGLGATR       +P LV+RIGAATA+EVRATGI + F+PCIAVCRDPRWGRCYESYSED K+V++M
Subjt:  SLSSRLGIPMMYGVDAVHGHNNAYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPCIAVCRDPRWGRCYESYSEDPKIVQEM

Query:  TEIIPGLQGEPPAKYRKGIPYVGGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIFKGISTVMVSYSSWNGVKMHANRELITGFLK
        T++I GLQGEPP+ Y+ G+P+VGG  KV ACAKH+VGDGGTT G++E+NTV D HGLLS+HMPAY D+++KG+STVMVSYSSWNG KMHAN ELITG+LK
Subjt:  TEIIPGLQGEPPAKYRKGIPYVGGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIFKGISTVMVSYSSWNGVKMHANRELITGFLK

Query:  GTLKFKVCLILD
        GTLKFK  +I D
Subjt:  GTLKFKVCLILD

AT5G20940.1 Glycosyl hydrolase family protein5.3e-11366.32Show/hide
Query:  NLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMAQIDRGVANATVMKNYFIGSVLTGGGTELLPDARAQDWVNMINEIQKGSLSSRLGIPMMYGVDAVHGHN
        N KYKDPK+P+ VR+K+L+  MTLEEKIGQM Q++R  A   VM+ YF+GSV +GGG+   P    + WVNM+NE+QK +LS+RLGIP++YG+DAVHGHN
Subjt:  NLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMAQIDRGVANATVMKNYFIGSVLTGGGTELLPDARAQDWVNMINEIQKGSLSSRLGIPMMYGVDAVHGHN

Query:  NAYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPAKYRKGIPY
          YNATIFPHNVGLG TR       +PGLV+RIG ATALEVRATGI + F+PCIAVCRDPRWGRCYESYSED KIVQ+MTEIIPGLQG+ P   +KG+P+
Subjt:  NAYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPAKYRKGIPY

Query:  VGGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIFKGISTVMVSYSSWNGVKMHANRELITGFLKGTLKFKVCLILD
        V G  KV ACAKHFVGDGGT  G++ +NTVI+ +GLL IHMPAY D++ KG++TVMVSYSS NG+KMHAN++LITGFLK  LKF+  +I D
Subjt:  VGGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIFKGISTVMVSYSSWNGVKMHANRELITGFLKGTLKFKVCLILD

AT5G20950.1 Glycosyl hydrolase family protein3.4e-12070Show/hide
Query:  LKYKDPKQPVAVRVKDLLGRMTLEEKIGQMAQIDRGVANATVMKNYFIGSVLTGGGTELLPDARAQDWVNMINEIQKGSLSSRLGIPMMYGVDAVHGHNN
        LKYKDPKQP+  R++DL+ RMTL+EKIGQM QI+R VA   VMK YFIGSVL+GGG+     A  + WVNM+NEIQK SLS+RLGIPM+YG+DAVHGHNN
Subjt:  LKYKDPKQPVAVRVKDLLGRMTLEEKIGQMAQIDRGVANATVMKNYFIGSVLTGGGTELLPDARAQDWVNMINEIQKGSLSSRLGIPMMYGVDAVHGHNN

Query:  AYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPAKYRKGIPYV
         Y ATIFPHNVGLG TR       +P LV+RIGAATALEVRATGI +AF+PCIAVCRDPRWGRCYESYSED +IVQ+MTEIIPGLQG+ P K RKG+P+V
Subjt:  AYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPAKYRKGIPYV

Query:  GGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIFKGISTVMVSYSSWNGVKMHANRELITGFLKGTLKFKVCLILD
        GG  KV ACAKHFVGDGGT  GIDE+NTVID  GL  IHMP Y +++ KG++T+MVSYS+WNG++MHAN+EL+TGFLK  LKF+  +I D
Subjt:  GGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIFKGISTVMVSYSSWNGVKMHANRELITGFLKGTLKFKVCLILD

AT5G20950.2 Glycosyl hydrolase family protein3.4e-12070Show/hide
Query:  LKYKDPKQPVAVRVKDLLGRMTLEEKIGQMAQIDRGVANATVMKNYFIGSVLTGGGTELLPDARAQDWVNMINEIQKGSLSSRLGIPMMYGVDAVHGHNN
        LKYKDPKQP+  R++DL+ RMTL+EKIGQM QI+R VA   VMK YFIGSVL+GGG+     A  + WVNM+NEIQK SLS+RLGIPM+YG+DAVHGHNN
Subjt:  LKYKDPKQPVAVRVKDLLGRMTLEEKIGQMAQIDRGVANATVMKNYFIGSVLTGGGTELLPDARAQDWVNMINEIQKGSLSSRLGIPMMYGVDAVHGHNN

Query:  AYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPAKYRKGIPYV
         Y ATIFPHNVGLG TR       +P LV+RIGAATALEVRATGI +AF+PCIAVCRDPRWGRCYESYSED +IVQ+MTEIIPGLQG+ P K RKG+P+V
Subjt:  AYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPAKYRKGIPYV

Query:  GGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIFKGISTVMVSYSSWNGVKMHANRELITGFLKGTLKFKVCLILD
        GG  KV ACAKHFVGDGGT  GIDE+NTVID  GL  IHMP Y +++ KG++T+MVSYS+WNG++MHAN+EL+TGFLK  LKF+  +I D
Subjt:  GGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIFKGISTVMVSYSSWNGVKMHANRELITGFLKGTLKFKVCLILD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTAAATTATTGACTAAAATTTCACATCAATTCGCACTGCTTAAATTAAAACTACCATGGAAATGTAAAGAGAGTGGAGTAACCTCTCAGAGATGGAAGATGGCCAA
GATTTTTGTTCAGGTGGTTATGACTCTGTGCCTGGGATGGTGGTTGTGGGCAGCAATGGTGGACGGGGACAACTTGAAATACAAAGACCCTAAGCAACCAGTTGCTGTTC
GAGTTAAGGACCTTCTTGGCCGAATGACTCTGGAAGAGAAAATTGGTCAGATGGCTCAGATTGACAGGGGCGTTGCCAATGCTACAGTTATGAAAAATTATTTCATCGGA
AGTGTGCTAACTGGTGGTGGAACTGAGCTACTTCCAGATGCTCGTGCTCAAGACTGGGTTAACATGATAAATGAAATCCAGAAAGGTTCTCTTTCTAGCCGATTGGGTAT
ACCGATGATGTATGGTGTTGATGCTGTTCATGGCCATAACAATGCTTACAATGCTACAATATTTCCTCATAATGTTGGACTTGGAGCTACCAGGCAAGTCATTTTCTTCT
CCTTAAACCCTGGCCTAGTTCGAAGGATTGGTGCTGCAACGGCACTTGAAGTTAGAGCGACGGGGATTTCTTTTGCTTTTTCTCCATGCATTGCGGTTTGTAGGGACCCA
AGGTGGGGGCGGTGTTATGAAAGTTACAGTGAAGATCCAAAAATTGTGCAAGAAATGACCGAGATTATACCTGGTTTGCAAGGAGAGCCTCCTGCTAAATATCGGAAGGG
GATTCCATATGTTGGTGGAAGTCAAAAGGTTGTCGCCTGTGCAAAGCACTTTGTTGGAGATGGTGGGACAACTAATGGCATCGATGAGAGCAATACAGTTATTGACAAAC
ATGGACTGCTCAGCATTCACATGCCTGCCTACATTGATTCGATCTTCAAGGGTATTTCGACAGTAATGGTTTCCTATTCTAGTTGGAATGGAGTGAAGATGCATGCAAAC
CGTGAGCTGATAACTGGCTTCCTCAAGGGCACCCTTAAATTTAAGGTATGTCTGATCTTAGATTTGTTTTGA
mRNA sequenceShow/hide mRNA sequence
CTTTAATCTTTCTTTGGCTTCTTAATCTGATTTTACCATTAGCACTTTCAGGATGCTTAAATTATTGACTAAAATTTCACATCAATTCGCACTGCTTAAATTAAAACTAC
CATGGAAATGTAAAGAGAGTGGAGTAACCTCTCAGAGATGGAAGATGGCCAAGATTTTTGTTCAGGTGGTTATGACTCTGTGCCTGGGATGGTGGTTGTGGGCAGCAATG
GTGGACGGGGACAACTTGAAATACAAAGACCCTAAGCAACCAGTTGCTGTTCGAGTTAAGGACCTTCTTGGCCGAATGACTCTGGAAGAGAAAATTGGTCAGATGGCTCA
GATTGACAGGGGCGTTGCCAATGCTACAGTTATGAAAAATTATTTCATCGGAAGTGTGCTAACTGGTGGTGGAACTGAGCTACTTCCAGATGCTCGTGCTCAAGACTGGG
TTAACATGATAAATGAAATCCAGAAAGGTTCTCTTTCTAGCCGATTGGGTATACCGATGATGTATGGTGTTGATGCTGTTCATGGCCATAACAATGCTTACAATGCTACA
ATATTTCCTCATAATGTTGGACTTGGAGCTACCAGGCAAGTCATTTTCTTCTCCTTAAACCCTGGCCTAGTTCGAAGGATTGGTGCTGCAACGGCACTTGAAGTTAGAGC
GACGGGGATTTCTTTTGCTTTTTCTCCATGCATTGCGGTTTGTAGGGACCCAAGGTGGGGGCGGTGTTATGAAAGTTACAGTGAAGATCCAAAAATTGTGCAAGAAATGA
CCGAGATTATACCTGGTTTGCAAGGAGAGCCTCCTGCTAAATATCGGAAGGGGATTCCATATGTTGGTGGAAGTCAAAAGGTTGTCGCCTGTGCAAAGCACTTTGTTGGA
GATGGTGGGACAACTAATGGCATCGATGAGAGCAATACAGTTATTGACAAACATGGACTGCTCAGCATTCACATGCCTGCCTACATTGATTCGATCTTCAAGGGTATTTC
GACAGTAATGGTTTCCTATTCTAGTTGGAATGGAGTGAAGATGCATGCAAACCGTGAGCTGATAACTGGCTTCCTCAAGGGCACCCTTAAATTTAAGGTATGTCTGATCT
TAGATTTGTTTTGA
Protein sequenceShow/hide protein sequence
MLKLLTKISHQFALLKLKLPWKCKESGVTSQRWKMAKIFVQVVMTLCLGWWLWAAMVDGDNLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMAQIDRGVANATVMKNYFIG
SVLTGGGTELLPDARAQDWVNMINEIQKGSLSSRLGIPMMYGVDAVHGHNNAYNATIFPHNVGLGATRQVIFFSLNPGLVRRIGAATALEVRATGISFAFSPCIAVCRDP
RWGRCYESYSEDPKIVQEMTEIIPGLQGEPPAKYRKGIPYVGGSQKVVACAKHFVGDGGTTNGIDESNTVIDKHGLLSIHMPAYIDSIFKGISTVMVSYSSWNGVKMHAN
RELITGFLKGTLKFKVCLILDLF