; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi04G001790 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi04G001790
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionGlycosyl hydrolase family protein
Genome locationchr04:1930903..1934620
RNA-Seq ExpressionLsi04G001790
SyntenyLsi04G001790
Gene Ontology termsGO:0009251 - glucan catabolic process (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0008422 - beta-glucosidase activity (molecular function)
InterPro domainsIPR001764 - Glycoside hydrolase, family 3, N-terminal
IPR017853 - Glycoside hydrolase superfamily
IPR036881 - Glycoside hydrolase family 3 C-terminal domain superfamily
IPR036962 - Glycoside hydrolase, family 3, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004135054.1 uncharacterized protein LOC101209588 isoform X2 [Cucumis sativus]2.5e-28692.38Show/hide
Query:  MAKIFVQVVAILCLGWLWWVTMVDAENLKYKDPKRPVAVRVKDLLGRMTLQEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLFDARAEDWVNMIN
        MAKIFVQVV ILCLGWLWW TMVDAENLKYKDPK+PV VRVKDLLGRMTL+EKIGQM+QIDRSVANATVMKDYFIGSVLSGGGSVPL DARAEDWVNMIN
Subjt:  MAKIFVQVVAILCLGWLWWVTMVDAENLKYKDPKRPVAVRVKDLLGRMTLQEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLFDARAEDWVNMIN

Query:  DFQKGSLSSRLGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKV
        DFQKGSLSSRLGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATR      NPDL RRIGAATALEVRATGISYTFAPC+AVCRDPRWGRCYESYSEDPK+
Subjt:  DFQKGSLSSRLGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKV

Query:  VKEMTEIIIGLQGEPPADYRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANHELIT
        VKEMTEIIIGLQGEPPA+YRKG PYVGGTKKVIACAKHFVGDGGTTHGINENNTVI+RHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHAN ELIT
Subjt:  VKEMTEIIIGLQGEPPADYRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANHELIT

Query:  GFLKGALKFKGFVISDWEGLDRITSTPHSNYTYSIQAAILAGIDMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYSLV
         FLKGALKFKGFVISDWEGLDRITSTPHSNYTYS+QA+ILAGIDMVMIPYKYAEFI+D+ FLVK+N IPMDRIDDAV RIL+VKFTMGLFESP+ DYSLV
Subjt:  GFLKGALKFKGFVISDWEGLDRITSTPHSNYTYSIQAAILAGIDMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYSLV

Query:  NELGSQAHRELARDAVRQSLVLLKNGKNDSNPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVFREDPDG
        NELGSQAHR+LARDAVRQSLVLLKNGKNDS PLLPLSKK+PKILVAGTHADNLGYQCGGWTIAWQGFSGNN TRGT+ILA+IKSTVDPSTEVVFREDPD 
Subjt:  NELGSQAHRELARDAVRQSLVLLKNGKNDSNPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVFREDPDG

Query:  DFVKSNDFSYAIVVIGEAPYAETGG
        DFVKS+DFSYAIVVIGEAPYAETGG
Subjt:  DFVKSNDFSYAIVVIGEAPYAETGG

XP_008446716.1 PREDICTED: beta-glucosidase BoGH3B isoform X1 [Cucumis melo]2.9e-29093.71Show/hide
Query:  MAKIFVQVVAILCLGWLWWVTMVDAENLKYKDPKRPVAVRVKDLLGRMTLQEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLFDARAEDWVNMIN
        MAKIFVQVV ILCLGWLWW TMVDAENLKYKDPK+PV VRVKDLLGRMTL+EKIGQM+QIDRSVANATVMKDYFIGS+LSGGGSVPL DARAEDWV+MIN
Subjt:  MAKIFVQVVAILCLGWLWWVTMVDAENLKYKDPKRPVAVRVKDLLGRMTLQEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLFDARAEDWVNMIN

Query:  DFQKGSLSSRLGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKV
        DFQKGSLSSRLGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATR      NPDLARRIGAATALEVRATGISYTFAPC+AVCRDPRWGRCYESYSEDPK+
Subjt:  DFQKGSLSSRLGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKV

Query:  VKEMTEIIIGLQGEPPADYRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANHELIT
        VKEMTEIIIGLQGEPPA+YRKG PYVGGTKKVIACAKHFVGDGGTTHGINENNTVI+RHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHAN ELIT
Subjt:  VKEMTEIIIGLQGEPPADYRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANHELIT

Query:  GFLKGALKFKGFVISDWEGLDRITSTPHSNYTYSIQAAILAGIDMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYSLV
         FLKGALKFKGFVISDWEGLDRITSTPHSNYTYS+QAAILAGIDMVMIPYKYAEFI+DL FLVKSNVIPMDRIDDAVGRIL+VKFTMGLFESP++DYSLV
Subjt:  GFLKGALKFKGFVISDWEGLDRITSTPHSNYTYSIQAAILAGIDMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYSLV

Query:  NELGSQAHRELARDAVRQSLVLLKNGKNDSNPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVFREDPDG
        NELGSQAHR+LARDAVRQSLVLLKNGKNDS PLLPLSKK+PKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGT+ILAAIKSTVDPSTEVVFREDPD 
Subjt:  NELGSQAHRELARDAVRQSLVLLKNGKNDSNPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVFREDPDG

Query:  DFVKSNDFSYAIVVIGEAPYAETGG
        DFVKSNDFSYAIVVIGEAPYAETGG
Subjt:  DFVKSNDFSYAIVVIGEAPYAETGG

XP_008446717.1 PREDICTED: beta-glucosidase BoGH3B isoform X2 [Cucumis melo]2.9e-29093.71Show/hide
Query:  MAKIFVQVVAILCLGWLWWVTMVDAENLKYKDPKRPVAVRVKDLLGRMTLQEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLFDARAEDWVNMIN
        MAKIFVQVV ILCLGWLWW TMVDAENLKYKDPK+PV VRVKDLLGRMTL+EKIGQM+QIDRSVANATVMKDYFIGS+LSGGGSVPL DARAEDWV+MIN
Subjt:  MAKIFVQVVAILCLGWLWWVTMVDAENLKYKDPKRPVAVRVKDLLGRMTLQEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLFDARAEDWVNMIN

Query:  DFQKGSLSSRLGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKV
        DFQKGSLSSRLGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATR      NPDLARRIGAATALEVRATGISYTFAPC+AVCRDPRWGRCYESYSEDPK+
Subjt:  DFQKGSLSSRLGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKV

Query:  VKEMTEIIIGLQGEPPADYRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANHELIT
        VKEMTEIIIGLQGEPPA+YRKG PYVGGTKKVIACAKHFVGDGGTTHGINENNTVI+RHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHAN ELIT
Subjt:  VKEMTEIIIGLQGEPPADYRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANHELIT

Query:  GFLKGALKFKGFVISDWEGLDRITSTPHSNYTYSIQAAILAGIDMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYSLV
         FLKGALKFKGFVISDWEGLDRITSTPHSNYTYS+QAAILAGIDMVMIPYKYAEFI+DL FLVKSNVIPMDRIDDAVGRIL+VKFTMGLFESP++DYSLV
Subjt:  GFLKGALKFKGFVISDWEGLDRITSTPHSNYTYSIQAAILAGIDMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYSLV

Query:  NELGSQAHRELARDAVRQSLVLLKNGKNDSNPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVFREDPDG
        NELGSQAHR+LARDAVRQSLVLLKNGKNDS PLLPLSKK+PKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGT+ILAAIKSTVDPSTEVVFREDPD 
Subjt:  NELGSQAHRELARDAVRQSLVLLKNGKNDSNPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVFREDPDG

Query:  DFVKSNDFSYAIVVIGEAPYAETGG
        DFVKSNDFSYAIVVIGEAPYAETGG
Subjt:  DFVKSNDFSYAIVVIGEAPYAETGG

XP_038892436.1 beta-glucosidase BoGH3B-like isoform X1 [Benincasa hispida]6.9e-29295.25Show/hide
Query:  MAKIFVQVVAILCLGWLWWVTMVDAENLKYKDPKRPVAVRVKDLLGRMTLQEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLFDARAEDWVNMIN
        MAKIFVQVV ILCLGWLWWVTMVDAENLKYKDPK+ VAVRVKDLLGRMT++EKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPL DARAEDWVNMIN
Subjt:  MAKIFVQVVAILCLGWLWWVTMVDAENLKYKDPKRPVAVRVKDLLGRMTLQEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLFDARAEDWVNMIN

Query:  DFQKGSLSSRLGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKV
        DFQKGSLSSRLGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATR      NPDLARRIGAATALEVRATGISYTFAPC+AVCRDPRWGRCYESYSEDPK+
Subjt:  DFQKGSLSSRLGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKV

Query:  VKEMTEIIIGLQGEPPADYRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANHELIT
        VKEMTEIIIGLQGEPPA+YRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVI+RHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHAN ELIT
Subjt:  VKEMTEIIIGLQGEPPADYRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANHELIT

Query:  GFLKGALKFKGFVISDWEGLDRITSTPHSNYTYSIQAAILAGIDMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYSLV
         FLKG LK+KGFVISDWEGLDRITSTPHSNYTYSIQAAILAGIDMVMIPYKYAEFI+DLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYSLV
Subjt:  GFLKGALKFKGFVISDWEGLDRITSTPHSNYTYSIQAAILAGIDMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYSLV

Query:  NELGSQAHRELARDAVRQSLVLLKNGKNDSNP-LLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVFREDPD
        NELGSQAHR+LARDAVRQSLVLLKNGKNDSNP LLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGT+ILAAIKSTVDPSTEVVFREDPD
Subjt:  NELGSQAHRELARDAVRQSLVLLKNGKNDSNP-LLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVFREDPD

Query:  GDFVKSNDFSYAIVVIGEAPYAETGG
         DFVKSNDFSYAIVVIGEAPYAETGG
Subjt:  GDFVKSNDFSYAIVVIGEAPYAETGG

XP_038892438.1 beta-glucosidase BoGH3B-like isoform X2 [Benincasa hispida]6.9e-29295.25Show/hide
Query:  MAKIFVQVVAILCLGWLWWVTMVDAENLKYKDPKRPVAVRVKDLLGRMTLQEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLFDARAEDWVNMIN
        MAKIFVQVV ILCLGWLWWVTMVDAENLKYKDPK+ VAVRVKDLLGRMT++EKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPL DARAEDWVNMIN
Subjt:  MAKIFVQVVAILCLGWLWWVTMVDAENLKYKDPKRPVAVRVKDLLGRMTLQEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLFDARAEDWVNMIN

Query:  DFQKGSLSSRLGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKV
        DFQKGSLSSRLGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATR      NPDLARRIGAATALEVRATGISYTFAPC+AVCRDPRWGRCYESYSEDPK+
Subjt:  DFQKGSLSSRLGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKV

Query:  VKEMTEIIIGLQGEPPADYRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANHELIT
        VKEMTEIIIGLQGEPPA+YRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVI+RHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHAN ELIT
Subjt:  VKEMTEIIIGLQGEPPADYRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANHELIT

Query:  GFLKGALKFKGFVISDWEGLDRITSTPHSNYTYSIQAAILAGIDMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYSLV
         FLKG LK+KGFVISDWEGLDRITSTPHSNYTYSIQAAILAGIDMVMIPYKYAEFI+DLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYSLV
Subjt:  GFLKGALKFKGFVISDWEGLDRITSTPHSNYTYSIQAAILAGIDMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYSLV

Query:  NELGSQAHRELARDAVRQSLVLLKNGKNDSNP-LLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVFREDPD
        NELGSQAHR+LARDAVRQSLVLLKNGKNDSNP LLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGT+ILAAIKSTVDPSTEVVFREDPD
Subjt:  NELGSQAHRELARDAVRQSLVLLKNGKNDSNP-LLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVFREDPD

Query:  GDFVKSNDFSYAIVVIGEAPYAETGG
         DFVKSNDFSYAIVVIGEAPYAETGG
Subjt:  GDFVKSNDFSYAIVVIGEAPYAETGG

TrEMBL top hitse value%identityAlignment
A0A0A0KRA3 Uncharacterized protein1.2e-28692.38Show/hide
Query:  MAKIFVQVVAILCLGWLWWVTMVDAENLKYKDPKRPVAVRVKDLLGRMTLQEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLFDARAEDWVNMIN
        MAKIFVQVV ILCLGWLWW TMVDAENLKYKDPK+PV VRVKDLLGRMTL+EKIGQM+QIDRSVANATVMKDYFIGSVLSGGGSVPL DARAEDWVNMIN
Subjt:  MAKIFVQVVAILCLGWLWWVTMVDAENLKYKDPKRPVAVRVKDLLGRMTLQEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLFDARAEDWVNMIN

Query:  DFQKGSLSSRLGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKV
        DFQKGSLSSRLGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATR      NPDL RRIGAATALEVRATGISYTFAPC+AVCRDPRWGRCYESYSEDPK+
Subjt:  DFQKGSLSSRLGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKV

Query:  VKEMTEIIIGLQGEPPADYRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANHELIT
        VKEMTEIIIGLQGEPPA+YRKG PYVGGTKKVIACAKHFVGDGGTTHGINENNTVI+RHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHAN ELIT
Subjt:  VKEMTEIIIGLQGEPPADYRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANHELIT

Query:  GFLKGALKFKGFVISDWEGLDRITSTPHSNYTYSIQAAILAGIDMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYSLV
         FLKGALKFKGFVISDWEGLDRITSTPHSNYTYS+QA+ILAGIDMVMIPYKYAEFI+D+ FLVK+N IPMDRIDDAV RIL+VKFTMGLFESP+ DYSLV
Subjt:  GFLKGALKFKGFVISDWEGLDRITSTPHSNYTYSIQAAILAGIDMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYSLV

Query:  NELGSQAHRELARDAVRQSLVLLKNGKNDSNPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVFREDPDG
        NELGSQAHR+LARDAVRQSLVLLKNGKNDS PLLPLSKK+PKILVAGTHADNLGYQCGGWTIAWQGFSGNN TRGT+ILA+IKSTVDPSTEVVFREDPD 
Subjt:  NELGSQAHRELARDAVRQSLVLLKNGKNDSNPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVFREDPDG

Query:  DFVKSNDFSYAIVVIGEAPYAETGG
        DFVKS+DFSYAIVVIGEAPYAETGG
Subjt:  DFVKSNDFSYAIVVIGEAPYAETGG

A0A1S3BGE4 beta-glucosidase BoGH3B isoform X11.4e-29093.71Show/hide
Query:  MAKIFVQVVAILCLGWLWWVTMVDAENLKYKDPKRPVAVRVKDLLGRMTLQEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLFDARAEDWVNMIN
        MAKIFVQVV ILCLGWLWW TMVDAENLKYKDPK+PV VRVKDLLGRMTL+EKIGQM+QIDRSVANATVMKDYFIGS+LSGGGSVPL DARAEDWV+MIN
Subjt:  MAKIFVQVVAILCLGWLWWVTMVDAENLKYKDPKRPVAVRVKDLLGRMTLQEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLFDARAEDWVNMIN

Query:  DFQKGSLSSRLGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKV
        DFQKGSLSSRLGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATR      NPDLARRIGAATALEVRATGISYTFAPC+AVCRDPRWGRCYESYSEDPK+
Subjt:  DFQKGSLSSRLGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKV

Query:  VKEMTEIIIGLQGEPPADYRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANHELIT
        VKEMTEIIIGLQGEPPA+YRKG PYVGGTKKVIACAKHFVGDGGTTHGINENNTVI+RHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHAN ELIT
Subjt:  VKEMTEIIIGLQGEPPADYRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANHELIT

Query:  GFLKGALKFKGFVISDWEGLDRITSTPHSNYTYSIQAAILAGIDMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYSLV
         FLKGALKFKGFVISDWEGLDRITSTPHSNYTYS+QAAILAGIDMVMIPYKYAEFI+DL FLVKSNVIPMDRIDDAVGRIL+VKFTMGLFESP++DYSLV
Subjt:  GFLKGALKFKGFVISDWEGLDRITSTPHSNYTYSIQAAILAGIDMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYSLV

Query:  NELGSQAHRELARDAVRQSLVLLKNGKNDSNPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVFREDPDG
        NELGSQAHR+LARDAVRQSLVLLKNGKNDS PLLPLSKK+PKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGT+ILAAIKSTVDPSTEVVFREDPD 
Subjt:  NELGSQAHRELARDAVRQSLVLLKNGKNDSNPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVFREDPDG

Query:  DFVKSNDFSYAIVVIGEAPYAETGG
        DFVKSNDFSYAIVVIGEAPYAETGG
Subjt:  DFVKSNDFSYAIVVIGEAPYAETGG

A0A1S3BGJ8 beta-glucosidase BoGH3B isoform X21.4e-29093.71Show/hide
Query:  MAKIFVQVVAILCLGWLWWVTMVDAENLKYKDPKRPVAVRVKDLLGRMTLQEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLFDARAEDWVNMIN
        MAKIFVQVV ILCLGWLWW TMVDAENLKYKDPK+PV VRVKDLLGRMTL+EKIGQM+QIDRSVANATVMKDYFIGS+LSGGGSVPL DARAEDWV+MIN
Subjt:  MAKIFVQVVAILCLGWLWWVTMVDAENLKYKDPKRPVAVRVKDLLGRMTLQEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLFDARAEDWVNMIN

Query:  DFQKGSLSSRLGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKV
        DFQKGSLSSRLGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATR      NPDLARRIGAATALEVRATGISYTFAPC+AVCRDPRWGRCYESYSEDPK+
Subjt:  DFQKGSLSSRLGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKV

Query:  VKEMTEIIIGLQGEPPADYRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANHELIT
        VKEMTEIIIGLQGEPPA+YRKG PYVGGTKKVIACAKHFVGDGGTTHGINENNTVI+RHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHAN ELIT
Subjt:  VKEMTEIIIGLQGEPPADYRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANHELIT

Query:  GFLKGALKFKGFVISDWEGLDRITSTPHSNYTYSIQAAILAGIDMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYSLV
         FLKGALKFKGFVISDWEGLDRITSTPHSNYTYS+QAAILAGIDMVMIPYKYAEFI+DL FLVKSNVIPMDRIDDAVGRIL+VKFTMGLFESP++DYSLV
Subjt:  GFLKGALKFKGFVISDWEGLDRITSTPHSNYTYSIQAAILAGIDMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYSLV

Query:  NELGSQAHRELARDAVRQSLVLLKNGKNDSNPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVFREDPDG
        NELGSQAHR+LARDAVRQSLVLLKNGKNDS PLLPLSKK+PKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGT+ILAAIKSTVDPSTEVVFREDPD 
Subjt:  NELGSQAHRELARDAVRQSLVLLKNGKNDSNPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVFREDPDG

Query:  DFVKSNDFSYAIVVIGEAPYAETGG
        DFVKSNDFSYAIVVIGEAPYAETGG
Subjt:  DFVKSNDFSYAIVVIGEAPYAETGG

A0A5A7SVG7 Beta-glucosidase BoGH3B isoform X13.6e-28685.4Show/hide
Query:  MAKIFVQVVAILCLGWLWWVTMVDAENLKYKDPKRPVAVRVKDLLGRMTLQEKIGQMIQIDRSVANATVMKDYFIG------------------------
        MAKIFVQVV ILCLGWLWW  MVDAENLKYKDPK+PV VRVKDLLGRMTL+EKIGQM+QIDRSVANATVMKDYFIG                        
Subjt:  MAKIFVQVVAILCLGWLWWVTMVDAENLKYKDPKRPVAVRVKDLLGRMTLQEKIGQMIQIDRSVANATVMKDYFIG------------------------

Query:  --------------------------------SVLSGGGSVPLFDARAEDWVNMINDFQKGSLSSRLGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATR
                                        S+LSGGGSVPL DARAEDWV+MINDFQKGSLSSRLGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATR
Subjt:  --------------------------------SVLSGGGSVPLFDARAEDWVNMINDFQKGSLSSRLGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATR

Query:  QV-FFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKVVKEMTEIIIGLQGEPPADYRKGIPYVGGTKKVIACAKHFVGDG
        QV FFSLNPDLARRIGAATALEVRATGISYTFAPC+AVCRDPRWGRCYESYSEDPK+VKEMTEIIIGLQGEPPA+YRKG PYVGGTKKVIACAKHFVGDG
Subjt:  QV-FFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKVVKEMTEIIIGLQGEPPADYRKGIPYVGGTKKVIACAKHFVGDG

Query:  GTTHGINENNTVISRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANHELITGFLKGALKFKGFVISDWEGLDRITSTPHSNYTYSIQAAILAGI
        GTTHGINENNTVI+RHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHAN ELIT FLKGALKFKGFVISDWEGLDRITSTPHSNYTYS+QAAILAGI
Subjt:  GTTHGINENNTVISRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANHELITGFLKGALKFKGFVISDWEGLDRITSTPHSNYTYSIQAAILAGI

Query:  DMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYSLVNELGSQAHRELARDAVRQSLVLLKNGKNDSNPLLPLSKKAPKI
        DMVMIPYKYAEFI+DL FLVKSNVIPMDRIDDAVGRIL+VKFTMGLFESP++DYSLVNELGSQAHR+LARDAVRQSLVLLKNGKNDS PLLPLSKK+PKI
Subjt:  DMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYSLVNELGSQAHRELARDAVRQSLVLLKNGKNDSNPLLPLSKKAPKI

Query:  LVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVFREDPDGDFVKSNDFSYAIVVIGEAPYAETGG
        LVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGT+ILAAIKSTVDPSTEVVFREDPD DFVKSNDFSYAIVVIGEAPYAETGG
Subjt:  LVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVFREDPDGDFVKSNDFSYAIVVIGEAPYAETGG

A0A6J1HZJ2 uncharacterized protein LOC111467651 isoform X17.7e-28190.67Show/hide
Query:  MAKIFVQVVAILCLGWLWWVTMVDAENLKYKDPKRPVAVRVKDLLGRMTLQEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLFDARAEDWVNMIN
        MAKIFVQVV ILCLGW WW  MVDAENLKYKDPK+PV+VRVKDLLGRMTL+EKIGQM+QIDRSVANATVMK+YFIGSVLSGGGSVPL DARA+DWV+MIN
Subjt:  MAKIFVQVVAILCLGWLWWVTMVDAENLKYKDPKRPVAVRVKDLLGRMTLQEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLFDARAEDWVNMIN

Query:  DFQKGSLSSRLGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKV
        DFQKGSLSSRLGIPM YGIDAVHGHNNVYNATVFPHNVGLGATR      NPDL RRIGAATALEVRATGISYTFAPC+AVCRDPRWGRCYESYSEDPK+
Subjt:  DFQKGSLSSRLGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKV

Query:  VKEMTEIIIGLQGEPPADYRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANHELIT
        V+ MTEIIIGLQGEPPA+YRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVI RHGLL IHMPAYLDSIIKGVSSVM SYSSWNGVKMHAN +LIT
Subjt:  VKEMTEIIIGLQGEPPADYRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANHELIT

Query:  GFLKGALKFKGFVISDWEGLDRITSTPHSNYTYSIQAAILAGIDMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYSLV
         FLKG LKFKGFVISDWEGLDRITSTPHSNYTYS+QAAI AGIDMVM+PYKYAEFI+DL  LVK+NV+PMDRIDDAV RILSVKFTMGLFESPL DYSLV
Subjt:  GFLKGALKFKGFVISDWEGLDRITSTPHSNYTYSIQAAILAGIDMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYSLV

Query:  NELGSQAHRELARDAVRQSLVLLKNGKNDSNPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVFREDPDG
        NELGSQAHR+LARDAVRQSLVLLKNGKNDS+PLLPLSKKAPKILV GTHADNLGYQCGGWTIAWQGFSGNN TRGT+ILAAIKSTVDPSTEVVFREDPD 
Subjt:  NELGSQAHRELARDAVRQSLVLLKNGKNDSNPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVFREDPDG

Query:  DFVKSNDFSYAIVVIGEAPYAETGG
        DFVKSN FSYAIVVIGEAPYAETGG
Subjt:  DFVKSNDFSYAIVVIGEAPYAETGG

SwissProt top hitse value%identityAlignment
A7LXU3 Beta-glucosidase BoGH3B7.2e-6634.81Show/hide
Query:  VAVRVKDLLGRMTLQEKIGQMIQIDRSVAN-----------------ATVMKDYFIGSVLSGGGSVPLFDA-RAEDWVNMINDFQKGSLSSRLGIPMFYG
        +   +++ L +MTL++KIGQM +I   V +                  TV+  Y +GS+L    +VPL  A + E W   I   Q+ S+   +GIP  YG
Subjt:  VAVRVKDLLGRMTLQEKIGQMIQIDRSVAN-----------------ATVMKDYFIGSVLSGGGSVPLFDA-RAEDWVNMINDFQKGSLSSRLGIPMFYG

Query:  IDAVHGHNNVYNATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKVVKEM-TEIIIGLQGEPPA
        +D +HG     + T+FP  + +GAT       N +L RR    +A E +A  I +TFAP + + RDPRW R +E+Y ED  V  EM    + G QGE P 
Subjt:  IDAVHGHNNVYNATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKVVKEM-TEIIIGLQGEPPA

Query:  DYRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANHELITGFLKGALKFKGFVISDW
                  G   V AC KH++G G    G +   + ISR  +   H   +L ++ +G  SVM +    NG+  HAN EL+T +LK  L + G +++DW
Subjt:  DYRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANHELITGFLKGALKFKGFVISDW

Query:  EGLDRITSTPHSNYT--YSIQAAILAGIDMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYSLVNELGSQAHRELARDA
          ++ + +  H   T   +++  I AGIDM M+PY+   F + L  LV+   + M+RIDDAV R+L +K+ +GLF+ P  D    ++ GS+    +A  A
Subjt:  EGLDRITSTPHSNYT--YSIQAAILAGIDMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYSLVNELGSQAHRELARDA

Query:  VRQSLVLLKNGKNDSNPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQG
          +S VLLKN  N    +LP++ K  KIL+ G +A+++    GGW+ +WQG
Subjt:  VRQSLVLLKNGKNDSNPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQG

P33363 Periplasmic beta-glucosidase2.4e-4530.45Show/hide
Query:  AENLKYKDPKRPVA--VRVKDLLGRMTLQEKIGQMIQIDRSVAN-----ATVMKDYFIGSVLSGGGSVPLFDARA-EDWVNMINDFQKGSLSSRLGIPMF
        A++L    P  P A    V +LL +MT+ EKIGQ+  I     N       ++KD  +G++ +   +V   D RA +D V  +         SRL IP+F
Subjt:  AENLKYKDPKRPVA--VRVKDLLGRMTLQEKIGQMIQIDRSVAN-----ATVMKDYFIGSVLSGGGSVPLFDARA-EDWVNMINDFQKGSLSSRLGIPMF

Query:  YGIDAVHGHNNVYNATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKVVKEMTEIII-GLQGEP
        +  D +HG       TVFP ++GL +      S N D  + +G  +A E    G++ T+AP + V RDPRWGR  E + ED  +   M + ++  +QG+ 
Subjt:  YGIDAVHGHNNVYNATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKVVKEMTEIII-GLQGEP

Query:  PADYRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANHELITGFLKGALKFKGFVIS
        PAD             V+   KHF   G    G   N   +S   L + +MP Y   +  G  +VM + +S NG    ++  L+   L+    FKG  +S
Subjt:  PADYRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANHELITGFLKGALKFKGFVIS

Query:  DWEGL-DRITSTPHSNYTYSIQAAILAGIDMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYS------LVNELGSQAH
        D   + + I     ++   +++ A+ +GI+M M    Y+++   L  L+KS  + M  +DDA   +L+VK+ MGLF  P S         +     S+ H
Subjt:  DWEGL-DRITSTPHSNYTYSIQAAILAGIDMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYS------LVNELGSQAH

Query:  RELARDAVRQSLVLLKNGKNDSNPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVF
        R+ AR+  R+SLVLLKN        LPL KK+  I V G  AD+     G W+ A          +  ++L  IK+ V  + +V++
Subjt:  RELARDAVRQSLVLLKNGKNDSNPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVF

Q23892 Lysosomal beta glucosidase7.7e-6031.05Show/hide
Query:  VKDLLGRMTLQEKIGQMIQIDRS--------VANATVM----KDYFIGSVL----SGGGSVPLFDARAEDWVNMINDFQKGSL-SSRLGIPMFYGIDAVH
        V +L+ +M++ EKIGQM Q+D +          N T +    K Y+IGS L    SGG +  +    +  W++MIN  Q   +  S   IPM YG+D+VH
Subjt:  VKDLLGRMTLQEKIGQMIQIDRS--------VANATVM----KDYFIGSVL----SGGGSVPLFDARAEDWVNMINDFQKGSL-SSRLGIPMFYGIDAVH

Query:  GHNNVYNATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKVVKEM-TEIIIGLQGEPPADYRKG
        G N V+ AT+FPHN GL AT       N + A      T+ +  A GI + FAP + +   P W R YE++ EDP V   M    + G QG         
Subjt:  GHNNVYNATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKVVKEM-TEIIIGLQGEPPADYRKG

Query:  IPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSII-KGVSSVMASYSSWNGVKMHANHELITGFLKGALKFKGFVISDWEGLD
                  +  AKH+ G    T G +     I    L    +P++ ++I   G  ++M +    NGV MH +++ +T  L+G L+F+G  ++DW+ ++
Subjt:  IPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSII-KGVSSVMASYSSWNGVKMHANHELITGFLKGALKFKGFVISDWEGLD

Query:  RITSTPHS--NYTYSIQAAILAGIDMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPL--SDYSLVNELGSQAHRELARDAVR
        ++    H+  +   +I  A+ AGIDM M+P   + F   L  +V +  +P  R+D +V RIL++K+ +GLF +P    + ++V+ +G    RE A     
Subjt:  RITSTPHS--NYTYSIQAAILAGIDMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPL--SDYSLVNELGSQAHRELARDAVR

Query:  QSLVLLKNGKNDSNPLLPLSKKAPK-ILVAGTHADNLGYQCGGWTIAWQG-FSGNNGTRGTSILAAIKSTVDPSTEVVFRE---------------DPDG
        +S+ LL+N     N +LPL+    K +L+ G  AD++    GGW++ WQG +  +    GTSIL  ++   + + +   +                D   
Subjt:  QSLVLLKNGKNDSNPLLPLSKKAPK-ILVAGTHADNLGYQCGGWTIAWQG-FSGNNGTRGTSILAAIKSTVDPSTEVVFRE---------------DPDG

Query:  DFVKSNDFSYAIVVIGEAPYAETGG
        +  +S+D    +VVIGE P AET G
Subjt:  DFVKSNDFSYAIVVIGEAPYAETGG

Q56078 Periplasmic beta-glucosidase5.2e-4830.83Show/hide
Query:  AENLKYKDPKRPVA--VRVKDLLGRMTLQEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLFDARAEDWVNMINDFQKGSLSSRLGIPMFYGIDAV
        AENL    P  P A    V DLL +MT+ EKIGQ+  I     N        I  ++  G    +F+      +  + D  +    SRL IP+F+  D V
Subjt:  AENLKYKDPKRPVA--VRVKDLLGRMTLQEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLFDARAEDWVNMINDFQKGSLSSRLGIPMFYGIDAV

Query:  HGHNNVYNATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKVVKEMTEIII-GLQGEPPADYRK
        HG       TVFP ++GL +      S N D  R +G  +A E    G++ T+AP + V RDPRWGR  E + ED  +   M E ++  +QG+ PAD   
Subjt:  HGHNNVYNATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKVVKEMTEIII-GLQGEPPADYRK

Query:  GIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANHELITGFLKGALKFKGFVISDWEGL-
                  V+   KHF   G    G   N   +S   L + +MP Y   +  G  +VM + +S NG    ++  L+   L+    FKG  +SD   + 
Subjt:  GIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANHELITGFLKGALKFKGFVISDWEGL-

Query:  DRITSTPHSNYTYSIQAAILAGIDMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYS------LVNELGSQAHRELARD
        + I     ++   +++ A+ AG+DM M    Y+++   L  L+KS  + M  +DDA   +L+VK+ MGLF  P S         +     S+ HR+ AR+
Subjt:  DRITSTPHSNYTYSIQAAILAGIDMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYS------LVNELGSQAHRELARD

Query:  AVRQSLVLLKNGKNDSNPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVF
          R+S+VLLKN        LPL KK+  I V G  AD+     G W+ A          +  ++LA I++ V    ++++
Subjt:  AVRQSLVLLKNGKNDSNPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVF

T2KMH0 Beta-xylosidase2.0e-3932.32Show/hide
Query:  QKGSLSSRLGIP-MFYGIDAVHGHNNVY----NATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAV-CRDPRWGRCYESYSE
        Q    + RLGIP M YG +A+HG   V     N TV+P  V   +T +      P+L +++ + TA E RA G+++ ++P + V   D R+GR  ESY E
Subjt:  QKGSLSSRLGIP-MFYGIDAVHGHNNVY----NATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAV-CRDPRWGRCYESYSE

Query:  DPKVVKEM-TEIIIGLQGEPPADYRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSIIK-GVSSVMASYSSWNGVKMHA
        DP +V  M    I GLQG     + +          VIA AKHFVG      GIN   + +S   L  +++P +  ++ + GV SVM  +  +NGV  H 
Subjt:  DPKVVKEM-TEIIIGLQGEPPADYRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSIIK-GVSSVMASYSSWNGVKMHA

Query:  NHELITGFLKGALKFKGFVISDWEGLDRITSTPHSNYTYSIQAAIL---AGIDMVMIPYKYAEFINDLTFLVKSNVIP----MDRIDDAVGRILSVKFTM
        N  L+   L+  L F GF++SD   + R+  T H       +AAIL   AG+DM ++  K  E     T ++K  ++     M  ID A  RIL+ K+ +
Subjt:  NHELITGFLKGALKFKGFVISDWEGLDRITSTPHSNYTYSIQAAIL---AGIDMVMIPYKYAEFINDLTFLVKSNVIP----MDRIDDAVGRILSVKFTM

Query:  GLFES-PLSDYSLVNELGSQAHRELARDAVRQSLVLLKNGKNDSNPLLPLS-KKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKST
        GLF++ P    +   E G+  HRE A +   +S+++LKN  N    LLPL   K   + V G +A     + G + +   G+SG       S+L  +K  
Subjt:  GLFES-PLSDYSLVNELGSQAHRELARDAVRQSLVLLKNGKNDSNPLLPLS-KKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKST

Query:  VDPSTEVVFREDPDGDFVKSNDFSYAI
        V    ++ + +  D D      F  AI
Subjt:  VDPSTEVVFREDPDGDFVKSNDFSYAI

Arabidopsis top hitse value%identityAlignment
AT3G47000.1 Glycosyl hydrolase family protein1.1e-17559.41Show/hide
Query:  MVDAENLKYKDPKRPVAVRVKDLLGRMTLQEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLFDARAEDWVNMINDFQKGSLSSRLGIPMFYGIDA
        +V+  +  YK+   PV  RVKDLL RMTL EKIGQM QI+R VA+ +   D+FIGSVL+ GGSVP  DA++ DW +MI+ FQ+ +L+SRLGIP+ YG DA
Subjt:  MVDAENLKYKDPKRPVAVRVKDLLGRMTLQEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLFDARAEDWVNMINDFQKGSLSSRLGIPMFYGIDA

Query:  VHGHNNVYNATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKVVKEMTEIIIGLQGEPPADYRK
        VHG+NNVY ATVFPHN+GLGATR      + DL RRIGAATALEVRA+G+ + F+PC+AV RDPRWGRCYESY EDP++V EMT ++ GLQG PP ++  
Subjt:  VHGHNNVYNATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKVVKEMTEIIIGLQGEPPADYRK

Query:  GIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANHELITGFLKGALKFKGFVISDWEGLD
        G P+V G   V+AC KHFVGDGGT  GINE NT+ S   L  IH+P YL  + +GVS+VMASYSSWNG ++HA+  L+T  LK  L FKGF++SDWEGLD
Subjt:  GIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANHELITGFLKGALKFKGFVISDWEGLD

Query:  RITSTPHSNYTYSIQAAILAGIDMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYSLVNELGSQAHRELARDAVRQSLV
        R++    SNY Y I+ A+ AGIDMVM+P+KY +FI D+T LV+S  IPM RI+DAV RIL VKF  GLF  PL+D SL+  +G + HRELA++AVR+SLV
Subjt:  RITSTPHSNYTYSIQAAILAGIDMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYSLVNELGSQAHRELARDAVRQSLV

Query:  LLKNGKNDSNPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVFREDPDGDFVKSND-FSYAIVVIGEAPY
        LLK+GKN   P LPL + A +ILV GTHAD+LGYQCGGWT  W G SG   T GT++L AIK  V   TEV++ + P  + + S++ FSYAIV +GE PY
Subjt:  LLKNGKNDSNPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVFREDPDGDFVKSND-FSYAIVVIGEAPY

Query:  AETGG
        AET G
Subjt:  AETGG

AT5G04885.1 Glycosyl hydrolase family protein1.8e-22971.24Show/hide
Query:  MAKIFVQVVAILCLGWLWWVTMVDAENLKYKDPKRPVAVRVKDLLGRMTLQEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLFDARAEDWVNMIN
        M++  V++V +L    +W     D E L YKDPK+ V+ RV DL GRMTL+EKIGQM+QIDRSVA   +M+DYFIGSVLSGGGS PL +A A++WV+MIN
Subjt:  MAKIFVQVVAILCLGWLWWVTMVDAENLKYKDPKRPVAVRVKDLLGRMTLQEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLFDARAEDWVNMIN

Query:  DFQKGSLSSRLGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKV
        ++QKG+L SRLGIPM YGIDAVHGHNNVYNAT+FPHNVGLGATR      +PDL +RIGAATA+EVRATGI YTFAPCIAVCRDPRWGRCYESYSED KV
Subjt:  DFQKGSLSSRLGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKV

Query:  VKEMTEIIIGLQGEPPADYRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANHELIT
        V++MT++I+GLQGEPP++Y+ G+P+VGG  KV ACAKH+VGDGGTT G+NENNTV   HGLLS+HMPAY D++ KGVS+VM SYSSWNG KMHAN ELIT
Subjt:  VKEMTEIIIGLQGEPPADYRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANHELIT

Query:  GFLKGALKFKGFVISDWEGLDRITSTPHSNYTYSIQAAILAGIDMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYSLV
        G+LKG LKFKGFVISDW+G+D+I++ PH++YT S++AAI AGIDMVM+P+ + EF+NDLT LVK+N IP+ RIDDAV RIL VKFTMGLFE+PL+DYS  
Subjt:  GFLKGALKFKGFVISDWEGLDRITSTPHSNYTYSIQAAILAGIDMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYSLV

Query:  NELGSQAHRELARDAVRQSLVLLKNGKNDSNPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVFREDPDG
        +ELGSQAHR+LAR+AVR+SLVLLKNG N +NP+LPL +K  KILVAGTHADNLGYQCGGWTI WQGFSGN  TRGT++L+A+KS VD STEVVFRE+PD 
Subjt:  NELGSQAHRELARDAVRQSLVLLKNGKNDSNPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVFREDPDG

Query:  DFVKSNDFSYAIVVIGEAPYAETGG
        +F+KSN+F+YAI+ +GE PYAET G
Subjt:  DFVKSNDFSYAIVVIGEAPYAETGG

AT5G20940.1 Glycosyl hydrolase family protein1.2e-19363.85Show/hide
Query:  VQVVAILCLGWLWWVTMVDAENLKYKDPKRPVAVRVKDLLGRMTLQEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLFDARAEDWVNMINDFQKG
        +Q + +L L        V   N KYKDPK P+ VR+K+L+  MTL+EKIGQM+Q++R  A   VM+ YF+GSV SGGGSVP      E WVNM+N+ QK 
Subjt:  VQVVAILCLGWLWWVTMVDAENLKYKDPKRPVAVRVKDLLGRMTLQEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLFDARAEDWVNMINDFQKG

Query:  SLSSRLGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKVVKEMT
        +LS+RLGIP+ YGIDAVHGHN VYNAT+FPHNVGLG TR      +P L +RIG ATALEVRATGI Y FAPCIAVCRDPRWGRCYESYSED K+V++MT
Subjt:  SLSSRLGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKVVKEMT

Query:  EIIIGLQGEPPADYRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANHELITGFLKG
        EII GLQG+ P   +KG+P+V G  KV ACAKHFVGDGGT  G+N NNTVI+ +GLL IHMPAY D++ KGV++VM SYSS NG+KMHAN +LITGFLK 
Subjt:  EIIIGLQGEPPADYRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANHELITGFLKG

Query:  ALKFKGFVISDWEGLDRITSTPHSNYTYSIQAAILAGIDMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYSLVNELGS
         LKF+G VISD+ G+D+I +   +NY++S+ AA  AG+DM M      + I++LT  VK   IPM RIDDAV RIL VKFTMGLFE+P++D+SL  +LGS
Subjt:  ALKFKGFVISDWEGLDRITSTPHSNYTYSIQAAILAGIDMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYSLVNELGS

Query:  QAHRELARDAVRQSLVLLKNGKNDSNPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVFREDPDGDFVKS
        + HRELAR+AVR+SLVLLKNG+N   PLLPL KKA KILVAGTHADNLGYQCGGWTI WQG +GNN T GT+ILAA+K TVDP T+V++ ++PD +FVK+
Subjt:  QAHRELARDAVRQSLVLLKNGKNDSNPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVFREDPDGDFVKS

Query:  NDFSYAIVVIGEAPYAETGG
         DF YAIV +GE PYAE  G
Subjt:  NDFSYAIVVIGEAPYAETGG

AT5G20950.1 Glycosyl hydrolase family protein3.2e-21067.18Show/hide
Query:  ILCLGWLWWVTMVDAENLKYKDPKRPVAVRVKDLLGRMTLQEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLFDARAEDWVNMINDFQKGSLSSR
        +LCL  L  +       LKYKDPK+P+  R++DL+ RMTLQEKIGQM+QI+RSVA   VMK YFIGSVLSGGGSVP   A  E WVNM+N+ QK SLS+R
Subjt:  ILCLGWLWWVTMVDAENLKYKDPKRPVAVRVKDLLGRMTLQEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLFDARAEDWVNMINDFQKGSLSSR

Query:  LGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKVVKEMTEIIIG
        LGIPM YGIDAVHGHNNVY AT+FPHNVGLG TR      +P+L +RIGAATALEVRATGI Y FAPCIAVCRDPRWGRCYESYSED ++V++MTEII G
Subjt:  LGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKVVKEMTEIIIG

Query:  LQGEPPADYRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANHELITGFLKGALKFK
        LQG+ P   RKG+P+VGG  KV ACAKHFVGDGGT  GI+ENNTVI   GL  IHMP Y +++ KGV+++M SYS+WNG++MHAN EL+TGFLK  LKF+
Subjt:  LQGEPPADYRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANHELITGFLKGALKFK

Query:  GFVISDWEGLDRITSTPHSNYTYSIQAAILAGIDMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYSLVNELGSQAHRE
        GFVISDW+G+DRIT+ PH NY+YS+ A I AGIDM+M+PY Y EFI++++  ++  +IP+ RIDDA+ RIL VKFTMGLFE PL+D S  N+LGS+ HRE
Subjt:  GFVISDWEGLDRITSTPHSNYTYSIQAAILAGIDMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYSLVNELGSQAHRE

Query:  LARDAVRQSLVLLKNGKNDSNPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVFREDPDGDFVKSNDFSY
        LAR+AVR+SLVLLKNGK  + PLLPL KK+ KILVAG HADNLGYQCGGWTI WQG +GN+ T GT+ILAA+K+TV P+T+VV+ ++PD +FVKS  F Y
Subjt:  LARDAVRQSLVLLKNGKNDSNPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVFREDPDGDFVKSNDFSY

Query:  AIVVIGEAPYAETGG
        AIVV+GE PYAE  G
Subjt:  AIVVIGEAPYAETGG

AT5G20950.2 Glycosyl hydrolase family protein3.2e-21067.18Show/hide
Query:  ILCLGWLWWVTMVDAENLKYKDPKRPVAVRVKDLLGRMTLQEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLFDARAEDWVNMINDFQKGSLSSR
        +LCL  L  +       LKYKDPK+P+  R++DL+ RMTLQEKIGQM+QI+RSVA   VMK YFIGSVLSGGGSVP   A  E WVNM+N+ QK SLS+R
Subjt:  ILCLGWLWWVTMVDAENLKYKDPKRPVAVRVKDLLGRMTLQEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLFDARAEDWVNMINDFQKGSLSSR

Query:  LGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKVVKEMTEIIIG
        LGIPM YGIDAVHGHNNVY AT+FPHNVGLG TR      +P+L +RIGAATALEVRATGI Y FAPCIAVCRDPRWGRCYESYSED ++V++MTEII G
Subjt:  LGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKVVKEMTEIIIG

Query:  LQGEPPADYRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANHELITGFLKGALKFK
        LQG+ P   RKG+P+VGG  KV ACAKHFVGDGGT  GI+ENNTVI   GL  IHMP Y +++ KGV+++M SYS+WNG++MHAN EL+TGFLK  LKF+
Subjt:  LQGEPPADYRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANHELITGFLKGALKFK

Query:  GFVISDWEGLDRITSTPHSNYTYSIQAAILAGIDMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYSLVNELGSQAHRE
        GFVISDW+G+DRIT+ PH NY+YS+ A I AGIDM+M+PY Y EFI++++  ++  +IP+ RIDDA+ RIL VKFTMGLFE PL+D S  N+LGS+ HRE
Subjt:  GFVISDWEGLDRITSTPHSNYTYSIQAAILAGIDMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYSLVNELGSQAHRE

Query:  LARDAVRQSLVLLKNGKNDSNPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVFREDPDGDFVKSNDFSY
        LAR+AVR+SLVLLKNGK  + PLLPL KK+ KILVAG HADNLGYQCGGWTI WQG +GN+ T GT+ILAA+K+TV P+T+VV+ ++PD +FVKS  F Y
Subjt:  LARDAVRQSLVLLKNGKNDSNPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVFREDPDGDFVKSNDFSY

Query:  AIVVIGEAPYAETGG
        AIVV+GE PYAE  G
Subjt:  AIVVIGEAPYAETGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAAGATTTTTGTTCAGGTGGTTGCGATTCTGTGTTTGGGTTGGTTGTGGTGGGTGACAATGGTGGACGCGGAGAACTTGAAATACAAAGACCCTAAACGACCAGT
TGCTGTTCGAGTTAAGGACCTTCTTGGCCGAATGACTCTGCAAGAGAAAATTGGTCAGATGATTCAGATTGACAGGAGTGTTGCCAATGCTACAGTTATGAAAGATTATT
TCATTGGAAGTGTGCTAAGTGGTGGTGGAAGTGTGCCGCTTTTTGATGCTCGTGCTGAAGATTGGGTTAACATGATAAATGATTTCCAAAAGGGTTCTCTTTCTAGTCGA
TTGGGTATCCCAATGTTTTATGGCATTGATGCTGTTCATGGCCATAACAATGTTTACAATGCTACAGTATTTCCTCATAATGTTGGACTTGGAGCTACAAGGCAAGTATT
TTTTTCTTTAAACCCTGACCTAGCTAGAAGGATTGGTGCTGCAACGGCACTTGAAGTTAGAGCGACAGGGATTTCTTATACCTTTGCTCCATGCATTGCGGTTTGTAGGG
ACCCGAGGTGGGGGCGGTGTTATGAAAGCTACAGTGAGGATCCAAAAGTTGTGAAAGAAATGACCGAGATTATAATTGGTCTGCAAGGAGAGCCTCCTGCTGATTATCGG
AAGGGGATTCCATATGTTGGCGGAACTAAAAAGGTTATTGCCTGTGCAAAGCACTTTGTTGGAGATGGTGGGACAACTCATGGTATCAATGAGAATAATACTGTTATTAG
CAGGCATGGACTGCTCAGCATTCACATGCCTGCCTATTTAGATTCGATCATAAAGGGTGTTTCATCAGTAATGGCTTCTTACTCTAGTTGGAATGGAGTAAAGATGCATG
CAAACCATGAGCTGATTACAGGCTTCCTCAAGGGCGCCCTTAAATTTAAGGGTTTTGTCATCTCAGATTGGGAGGGTCTGGACAGAATTACTTCTACACCACATTCTAAT
TACACATACTCTATTCAAGCTGCAATTTTAGCTGGCATTGATATGGTCATGATTCCTTACAAGTATGCAGAGTTCATTAATGATCTTACATTTCTAGTGAAGAGCAATGT
CATCCCAATGGATCGCATTGATGATGCTGTTGGGAGAATTTTATCAGTCAAGTTCACAATGGGTCTTTTTGAAAGCCCCTTGAGTGATTACAGCCTCGTCAATGAGCTTG
GGAGCCAGGCACATAGAGAGTTGGCAAGAGATGCTGTGAGGCAGTCACTTGTACTGCTGAAGAATGGGAAAAATGATAGCAACCCATTGCTACCCCTTTCAAAGAAGGCC
CCAAAGATCCTTGTTGCTGGCACTCACGCTGATAATTTAGGATATCAATGTGGTGGATGGACAATAGCATGGCAAGGGTTCAGTGGCAACAATGGTACAAGGGGAACTAG
CATCCTCGCTGCCATTAAATCGACAGTCGATCCAAGCACAGAGGTTGTATTCCGTGAGGATCCTGATGGTGACTTTGTTAAGTCCAATGACTTCTCATATGCCATAGTTG
TTATTGGTGAAGCCCCATATGCTGAGACTGGAGGGGNATATTGTATCAAAGAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCAAGATTTTTGTTCAGGTGGTTGCGATTCTGTGTTTGGGTTGGTTGTGGTGGGTGACAATGGTGGACGCGGAGAACTTGAAATACAAAGACCCTAAACGACCAGT
TGCTGTTCGAGTTAAGGACCTTCTTGGCCGAATGACTCTGCAAGAGAAAATTGGTCAGATGATTCAGATTGACAGGAGTGTTGCCAATGCTACAGTTATGAAAGATTATT
TCATTGGAAGTGTGCTAAGTGGTGGTGGAAGTGTGCCGCTTTTTGATGCTCGTGCTGAAGATTGGGTTAACATGATAAATGATTTCCAAAAGGGTTCTCTTTCTAGTCGA
TTGGGTATCCCAATGTTTTATGGCATTGATGCTGTTCATGGCCATAACAATGTTTACAATGCTACAGTATTTCCTCATAATGTTGGACTTGGAGCTACAAGGCAAGTATT
TTTTTCTTTAAACCCTGACCTAGCTAGAAGGATTGGTGCTGCAACGGCACTTGAAGTTAGAGCGACAGGGATTTCTTATACCTTTGCTCCATGCATTGCGGTTTGTAGGG
ACCCGAGGTGGGGGCGGTGTTATGAAAGCTACAGTGAGGATCCAAAAGTTGTGAAAGAAATGACCGAGATTATAATTGGTCTGCAAGGAGAGCCTCCTGCTGATTATCGG
AAGGGGATTCCATATGTTGGCGGAACTAAAAAGGTTATTGCCTGTGCAAAGCACTTTGTTGGAGATGGTGGGACAACTCATGGTATCAATGAGAATAATACTGTTATTAG
CAGGCATGGACTGCTCAGCATTCACATGCCTGCCTATTTAGATTCGATCATAAAGGGTGTTTCATCAGTAATGGCTTCTTACTCTAGTTGGAATGGAGTAAAGATGCATG
CAAACCATGAGCTGATTACAGGCTTCCTCAAGGGCGCCCTTAAATTTAAGGGTTTTGTCATCTCAGATTGGGAGGGTCTGGACAGAATTACTTCTACACCACATTCTAAT
TACACATACTCTATTCAAGCTGCAATTTTAGCTGGCATTGATATGGTCATGATTCCTTACAAGTATGCAGAGTTCATTAATGATCTTACATTTCTAGTGAAGAGCAATGT
CATCCCAATGGATCGCATTGATGATGCTGTTGGGAGAATTTTATCAGTCAAGTTCACAATGGGTCTTTTTGAAAGCCCCTTGAGTGATTACAGCCTCGTCAATGAGCTTG
GGAGCCAGGCACATAGAGAGTTGGCAAGAGATGCTGTGAGGCAGTCACTTGTACTGCTGAAGAATGGGAAAAATGATAGCAACCCATTGCTACCCCTTTCAAAGAAGGCC
CCAAAGATCCTTGTTGCTGGCACTCACGCTGATAATTTAGGATATCAATGTGGTGGATGGACAATAGCATGGCAAGGGTTCAGTGGCAACAATGGTACAAGGGGAACTAG
CATCCTCGCTGCCATTAAATCGACAGTCGATCCAAGCACAGAGGTTGTATTCCGTGAGGATCCTGATGGTGACTTTGTTAAGTCCAATGACTTCTCATATGCCATAGTTG
TTATTGGTGAAGCCCCATATGCTGAGACTGGAGGGGNATATTGTATCAAAGAGTAA
Protein sequenceShow/hide protein sequence
MAKIFVQVVAILCLGWLWWVTMVDAENLKYKDPKRPVAVRVKDLLGRMTLQEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLFDARAEDWVNMINDFQKGSLSSR
LGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATRQVFFSLNPDLARRIGAATALEVRATGISYTFAPCIAVCRDPRWGRCYESYSEDPKVVKEMTEIIIGLQGEPPADYR
KGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVISRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANHELITGFLKGALKFKGFVISDWEGLDRITSTPHSN
YTYSIQAAILAGIDMVMIPYKYAEFINDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYSLVNELGSQAHRELARDAVRQSLVLLKNGKNDSNPLLPLSKKA
PKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVFREDPDGDFVKSNDFSYAIVVIGEAPYAETGGXYCIKE