; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013897 (gene) of Snake gourd v1 genome

Gene IDTan0013897
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGlycosyl hydrolase family protein
Genome locationLG04:6370037..6377502
RNA-Seq ExpressionTan0013897
SyntenyTan0013897
Gene Ontology termsGO:0009251 - glucan catabolic process (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0008422 - beta-glucosidase activity (molecular function)
InterPro domainsIPR001764 - Glycoside hydrolase, family 3, N-terminal
IPR002772 - Glycoside hydrolase family 3 C-terminal domain
IPR017853 - Glycoside hydrolase superfamily
IPR036881 - Glycoside hydrolase family 3 C-terminal domain superfamily
IPR036962 - Glycoside hydrolase, family 3, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7012970.1 hypothetical protein SDJN02_25724 [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0095.14Show/hide
Query:  MAKIFVQLFVILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMIN
        MAKIFVQ+ VILCLGWWWWAIMVGAENLKYKDPKQPV+VRVKDLLGRMTLEEKIGQMVQIDRSVANATVMK+YFIGSVLSGGGSVPLPDARAQDWVDMIN
Subjt:  MAKIFVQLFVILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMIN

Query:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPRIVQNMTE
        DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDL+RRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDP++VQNMTE
Subjt:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPRIVQNMTE

Query:  IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRELITGFLKGT
        IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLL IHMPAYLDSIIKGVSSVM SYSSWNGVKMHANRELIT FLK T
Subjt:  IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRELITGFLKGT

Query:  LKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQ
        LKFKGFVISDWEGLDR+TSTPHSNYTYSV+AAI AGIDMVM+PYKYAEFIDDLT LVK+NV+ MDRIDDAV RILSVKFTMGLFESPLGDYSLVNELGSQ
Subjt:  LKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQ

Query:  EHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPSTEVVFREDPDSDFVKSN
         HRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGT+ILAAIKSTVDPSTEVVFREDPDSDFVKSN
Subjt:  EHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPSTEVVFREDPDSDFVKSN

Query:  DFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
         FSYAIVVIGEAPYAETEGDSTTLTMLDPGPSI+KNVC+SVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
Subjt:  DFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV

Query:  DQLPMNVGDRHYDPLFPLGFGLTTGSVKDVVARSTSAGIKGTPSFIAMIIATIAICILQ
        DQLPMN GDRHYDPLFPLGFGLTTGSVKD+VARSTSAG + TPSFIAMI+ATIAIC+LQ
Subjt:  DQLPMNVGDRHYDPLFPLGFGLTTGSVKDVVARSTSAGIKGTPSFIAMIIATIAICILQ

XP_022945501.1 uncharacterized protein LOC111449719 isoform X1 [Cucurbita moschata]0.0e+0093.48Show/hide
Query:  LPFLFYLCELGSRGKKMAKIFVQLFVILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSVLSGGGS
        LP+ +  C   S GKKMAK FVQ+ VILCLGWWWWAIMVGAENLKYKDPKQPV+VRVKDLLGRMTLEEKIGQMVQIDRSVANATVMK+YFIGSVLSGGGS
Subjt:  LPFLFYLCELGSRGKKMAKIFVQLFVILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSVLSGGGS

Query:  VPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRC
        VPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDL+RRIGAATALEVRATGISYTFAPCLAVCRDPRWGRC
Subjt:  VPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRC

Query:  YESYSEDPRIVQNMTEIIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGV
        YESYSEDP++VQNMTEIIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLL IHMPAYLDSIIKGVSSVM SYSSWNGV
Subjt:  YESYSEDPRIVQNMTEIIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGV

Query:  KMHANRELITGFLKGTLKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLF
        KMHANRELIT FLK TLKFKGFVISDWEGLDR+TSTPHSNYTYSV+AAI AGIDMVM+PYKYAEFIDDLT LVK+NV+ MDRIDDAV RILSVKFTMGLF
Subjt:  KMHANRELITGFLKGTLKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLF

Query:  ESPLGDYSLVNELGSQEHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPST
        ESPLGDYSLVNELGSQ HRDLARDAVRQSLVLLKNGKNDSD LLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGT+IL AIKSTVDPST
Subjt:  ESPLGDYSLVNELGSQEHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPST

Query:  EVVFREDPDSDFVKSNDFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYG
        EVVFREDPDSDFVKSN FSYAIVVIGEAPYAETEGDSTTLTMLDPGPSI+KNVC+SVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYG
Subjt:  EVVFREDPDSDFVKSNDFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYG

Query:  DHGFSGKLPRTWFKSVDQLPMNVGDRHYDPLFPLGFGLTTGSVKDVVARSTSAGIKGTPSFIAMIIATIAICILQ
        DHGFSGKLPRTWFKSVDQLPMN GDRHYDPLFPLGFGLTTGSVKD+VARSTSAG + TPSFIAMI+ATIAIC+LQ
Subjt:  DHGFSGKLPRTWFKSVDQLPMNVGDRHYDPLFPLGFGLTTGSVKDVVARSTSAGIKGTPSFIAMIIATIAICILQ

XP_022968400.1 uncharacterized protein LOC111467651 isoform X2 [Cucurbita maxima]0.0e+0093.33Show/hide
Query:  LPFLFYLCELGSRGKKMAKIFVQLFVILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSVLSGGGS
        LP+ +  C L S GKKMAKIFVQ+ VILCLGWWWWAIMV AENLKYKDPKQPV+VRVKDLLGRMTLEEKIGQMVQIDRSVANATVMK+YFIGSVLSGGGS
Subjt:  LPFLFYLCELGSRGKKMAKIFVQLFVILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSVLSGGGS

Query:  VPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRC
        VPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDL+RRIGAATALEVRATGISYTFAPCLAVCRDPRWGRC
Subjt:  VPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRC

Query:  YESYSEDPRIVQNMTEIIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGV
        YESYSEDP++VQNMTEIIIGLQGEPPAN+RKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLL IHMPAYLDSIIKGVSSVM SYSSWNGV
Subjt:  YESYSEDPRIVQNMTEIIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGV

Query:  KMHANRELITGFLKGTLKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLF
        KMHANR+LIT FLKGTLKFKGFVISDWEGLDR+TSTPHSNYTYSV+AAI AGIDMVM+PYKYAEFIDDL  LVK+NV+PMDRIDDAV RILSVKFTMGLF
Subjt:  KMHANRELITGFLKGTLKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLF

Query:  ESPLGDYSLVNELGSQEHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPST
        ESPLGDYSLVNELGSQ HRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILV GTHADNLGYQCGGWTIAWQGFSGNNATRGT+ILAAIKSTVDPST
Subjt:  ESPLGDYSLVNELGSQEHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPST

Query:  EVVFREDPDSDFVKSNDFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYG
        EVVFREDPDSDFVKSN FSYAIVVIGEAPYAET GDSTTLTMLDPGPSI+KNVC+SVKCVVVVISGRPIVMEPY+SSMDALVAAWLPGTEGLGVTDALYG
Subjt:  EVVFREDPDSDFVKSNDFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYG

Query:  DHGFSGKLPRTWFKSVDQLPMNVGDRHYDPLFPLGFGLTTGSVKDVVARSTSAGIKGTPSFIAMIIATIAICILQ
        DHGFSGKLPRTWFKSVDQLPMN GDRHYDPLFPLGFGLTTGSVKD+VARSTSAG +GTPSFIAMI+ATIA+C+LQ
Subjt:  DHGFSGKLPRTWFKSVDQLPMNVGDRHYDPLFPLGFGLTTGSVKDVVARSTSAGIKGTPSFIAMIIATIAICILQ

XP_023541708.1 uncharacterized protein LOC111801784 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0094.16Show/hide
Query:  CELGSRGKKMAKIFVQLFVILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSVLSGGGSVPLPDAR
        C   S GKKMAKIFVQ+ VILCLGWWWWAIMVGAENLKYKDPKQPV+VRVKDLLGRMTLEEKIGQMVQIDRSVANATVMK+YFIGSVLSGGGSVPLPDAR
Subjt:  CELGSRGKKMAKIFVQLFVILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSVLSGGGSVPLPDAR

Query:  AQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSED
        AQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDL+RRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSED
Subjt:  AQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSED

Query:  PRIVQNMTEIIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRE
        P++VQNMTEIIIGLQGEPPAN+RKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLL +HMPAYLDSIIKGVSSVM SYSSWNGVKMHANRE
Subjt:  PRIVQNMTEIIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRE

Query:  LITGFLKGTLKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDY
        LIT FLKGTLKFKGFVISDWEGLDR+TSTPHSNYTYSV+AAI AGIDMVM+PYKY EFIDDLT LVK+NV+ MDRIDDAV RILSVKFTMGLFESPLGDY
Subjt:  LITGFLKGTLKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDY

Query:  SLVNELGSQEHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPSTEVVFRED
        SLVNELGSQ HRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILV+GTHADNLGYQCGGWTIAWQGFSGNNATRGT+ILAAIKSTVDPSTEVVFRED
Subjt:  SLVNELGSQEHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPSTEVVFRED

Query:  PDSDFVKSNDFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGK
        PDSDFVKSN FSYAIVVIGEAPYAETEGDSTTLTMLDPGPSI+KNVC+SVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGK
Subjt:  PDSDFVKSNDFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGK

Query:  LPRTWFKSVDQLPMNVGDRHYDPLFPLGFGLTTGSVKDVVARSTSAGIKGTPSFIAMIIATIAICILQ
        LPRTWFKSVDQLPMN GDRHYDPLFPLGFGLTTGSVKD+VARSTSAG + TPSFIAMI+ATIAIC+LQ
Subjt:  LPRTWFKSVDQLPMNVGDRHYDPLFPLGFGLTTGSVKDVVARSTSAGIKGTPSFIAMIIATIAICILQ

XP_023541709.1 uncharacterized protein LOC111801784 isoform X2 [Cucurbita pepo subsp. pepo]0.0e+0094.69Show/hide
Query:  MAKIFVQLFVILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMIN
        MAKIFVQ+ VILCLGWWWWAIMVGAENLKYKDPKQPV+VRVKDLLGRMTLEEKIGQMVQIDRSVANATVMK+YFIGSVLSGGGSVPLPDARAQDWVDMIN
Subjt:  MAKIFVQLFVILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMIN

Query:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPRIVQNMTE
        DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDL+RRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDP++VQNMTE
Subjt:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPRIVQNMTE

Query:  IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRELITGFLKGT
        IIIGLQGEPPAN+RKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLL +HMPAYLDSIIKGVSSVM SYSSWNGVKMHANRELIT FLKGT
Subjt:  IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRELITGFLKGT

Query:  LKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQ
        LKFKGFVISDWEGLDR+TSTPHSNYTYSV+AAI AGIDMVM+PYKY EFIDDLT LVK+NV+ MDRIDDAV RILSVKFTMGLFESPLGDYSLVNELGSQ
Subjt:  LKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQ

Query:  EHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPSTEVVFREDPDSDFVKSN
         HRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILV+GTHADNLGYQCGGWTIAWQGFSGNNATRGT+ILAAIKSTVDPSTEVVFREDPDSDFVKSN
Subjt:  EHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPSTEVVFREDPDSDFVKSN

Query:  DFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
         FSYAIVVIGEAPYAETEGDSTTLTMLDPGPSI+KNVC+SVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
Subjt:  DFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV

Query:  DQLPMNVGDRHYDPLFPLGFGLTTGSVKDVVARSTSAGIKGTPSFIAMIIATIAICILQ
        DQLPMN GDRHYDPLFPLGFGLTTGSVKD+VARSTSAG + TPSFIAMI+ATIAIC+LQ
Subjt:  DQLPMNVGDRHYDPLFPLGFGLTTGSVKDVVARSTSAGIKGTPSFIAMIIATIAICILQ

TrEMBL top hitse value%identityAlignment
A0A1S3BGE4 beta-glucosidase BoGH3B isoform X10.0e+0091.45Show/hide
Query:  CELGSRGKKMAKIFVQLFVILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSVLSGGGSVPLPDAR
        C L ++ KKMAKIFVQ+ VILCLGW WWA MV AENLKYKDPKQPV VRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKDYFIGS+LSGGGSVPLPDAR
Subjt:  CELGSRGKKMAKIFVQLFVILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSVLSGGGSVPLPDAR

Query:  AQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSED
        A+DWVDMINDFQKGSLSSRLGIPM YGIDAVHGHNNVYNATVFPHNVGLGATRNPDL RRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSED
Subjt:  AQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSED

Query:  PRIVQNMTEIIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRE
        P+IV+ MTEIIIGLQGEPPAN+RKG PYVGGTKKVIACAKHFVGDGGTTHGINENNTVI+RHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRE
Subjt:  PRIVQNMTEIIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRE

Query:  LITGFLKGTLKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDY
        LIT FLKG LKFKGFVISDWEGLDR+TSTPHSNYTYSV+AAILAGIDMVMIPYKYAEFIDDL FLVKSNVIPMDRIDDAVGRIL+VKFTMGLFESP+ DY
Subjt:  LITGFLKGTLKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDY

Query:  SLVNELGSQEHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPSTEVVFRED
        SLVNELGSQ HRDLARDAVRQSLVLLKNGKNDS PLLPLSKK+PKILVAGTHADNLGYQCGGWTIAWQGFSGNN TRGT+ILAAIKSTVDPSTEVVFRED
Subjt:  SLVNELGSQEHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPSTEVVFRED

Query:  PDSDFVKSNDFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGK
        PDSDFVKSNDFSYAIVVIGEAPYAET GDSTTLTMLDPGP+I+KNVCD V+CVV++ISGRPIV+EPY+SS+DALVAAWLPGTEG GVTDALYGDHGFSGK
Subjt:  PDSDFVKSNDFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGK

Query:  LPRTWFKSVDQLPMNVGDRHYDPLFPLGFGLTTGSVKDVVARSTSAGIKGTPSFIAMIIATIAICIL
        LPRTWFKSVDQLPMNVGD HYDPLFP GFGLTTGSVKD++ARSTSAGI+GTPS IA I+  I +CIL
Subjt:  LPRTWFKSVDQLPMNVGDRHYDPLFPLGFGLTTGSVKDVVARSTSAGIKGTPSFIAMIIATIAICIL

A0A6J1G118 uncharacterized protein LOC111449719 isoform X10.0e+0093.48Show/hide
Query:  LPFLFYLCELGSRGKKMAKIFVQLFVILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSVLSGGGS
        LP+ +  C   S GKKMAK FVQ+ VILCLGWWWWAIMVGAENLKYKDPKQPV+VRVKDLLGRMTLEEKIGQMVQIDRSVANATVMK+YFIGSVLSGGGS
Subjt:  LPFLFYLCELGSRGKKMAKIFVQLFVILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSVLSGGGS

Query:  VPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRC
        VPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDL+RRIGAATALEVRATGISYTFAPCLAVCRDPRWGRC
Subjt:  VPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRC

Query:  YESYSEDPRIVQNMTEIIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGV
        YESYSEDP++VQNMTEIIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLL IHMPAYLDSIIKGVSSVM SYSSWNGV
Subjt:  YESYSEDPRIVQNMTEIIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGV

Query:  KMHANRELITGFLKGTLKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLF
        KMHANRELIT FLK TLKFKGFVISDWEGLDR+TSTPHSNYTYSV+AAI AGIDMVM+PYKYAEFIDDLT LVK+NV+ MDRIDDAV RILSVKFTMGLF
Subjt:  KMHANRELITGFLKGTLKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLF

Query:  ESPLGDYSLVNELGSQEHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPST
        ESPLGDYSLVNELGSQ HRDLARDAVRQSLVLLKNGKNDSD LLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGT+IL AIKSTVDPST
Subjt:  ESPLGDYSLVNELGSQEHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPST

Query:  EVVFREDPDSDFVKSNDFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYG
        EVVFREDPDSDFVKSN FSYAIVVIGEAPYAETEGDSTTLTMLDPGPSI+KNVC+SVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYG
Subjt:  EVVFREDPDSDFVKSNDFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYG

Query:  DHGFSGKLPRTWFKSVDQLPMNVGDRHYDPLFPLGFGLTTGSVKDVVARSTSAGIKGTPSFIAMIIATIAICILQ
        DHGFSGKLPRTWFKSVDQLPMN GDRHYDPLFPLGFGLTTGSVKD+VARSTSAG + TPSFIAMI+ATIAIC+LQ
Subjt:  DHGFSGKLPRTWFKSVDQLPMNVGDRHYDPLFPLGFGLTTGSVKDVVARSTSAGIKGTPSFIAMIIATIAICILQ

A0A6J1G143 uncharacterized protein LOC111449719 isoform X20.0e+0094.69Show/hide
Query:  MAKIFVQLFVILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMIN
        MAK FVQ+ VILCLGWWWWAIMVGAENLKYKDPKQPV+VRVKDLLGRMTLEEKIGQMVQIDRSVANATVMK+YFIGSVLSGGGSVPLPDARAQDWVDMIN
Subjt:  MAKIFVQLFVILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMIN

Query:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPRIVQNMTE
        DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDL+RRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDP++VQNMTE
Subjt:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPRIVQNMTE

Query:  IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRELITGFLKGT
        IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLL IHMPAYLDSIIKGVSSVM SYSSWNGVKMHANRELIT FLK T
Subjt:  IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRELITGFLKGT

Query:  LKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQ
        LKFKGFVISDWEGLDR+TSTPHSNYTYSV+AAI AGIDMVM+PYKYAEFIDDLT LVK+NV+ MDRIDDAV RILSVKFTMGLFESPLGDYSLVNELGSQ
Subjt:  LKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQ

Query:  EHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPSTEVVFREDPDSDFVKSN
         HRDLARDAVRQSLVLLKNGKNDSD LLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGT+IL AIKSTVDPSTEVVFREDPDSDFVKSN
Subjt:  EHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPSTEVVFREDPDSDFVKSN

Query:  DFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
         FSYAIVVIGEAPYAETEGDSTTLTMLDPGPSI+KNVC+SVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
Subjt:  DFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV

Query:  DQLPMNVGDRHYDPLFPLGFGLTTGSVKDVVARSTSAGIKGTPSFIAMIIATIAICILQ
        DQLPMN GDRHYDPLFPLGFGLTTGSVKD+VARSTSAG + TPSFIAMI+ATIAIC+LQ
Subjt:  DQLPMNVGDRHYDPLFPLGFGLTTGSVKDVVARSTSAGIKGTPSFIAMIIATIAICILQ

A0A6J1HX36 uncharacterized protein LOC111467651 isoform X20.0e+0093.33Show/hide
Query:  LPFLFYLCELGSRGKKMAKIFVQLFVILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSVLSGGGS
        LP+ +  C L S GKKMAKIFVQ+ VILCLGWWWWAIMV AENLKYKDPKQPV+VRVKDLLGRMTLEEKIGQMVQIDRSVANATVMK+YFIGSVLSGGGS
Subjt:  LPFLFYLCELGSRGKKMAKIFVQLFVILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSVLSGGGS

Query:  VPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRC
        VPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDL+RRIGAATALEVRATGISYTFAPCLAVCRDPRWGRC
Subjt:  VPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRC

Query:  YESYSEDPRIVQNMTEIIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGV
        YESYSEDP++VQNMTEIIIGLQGEPPAN+RKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLL IHMPAYLDSIIKGVSSVM SYSSWNGV
Subjt:  YESYSEDPRIVQNMTEIIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGV

Query:  KMHANRELITGFLKGTLKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLF
        KMHANR+LIT FLKGTLKFKGFVISDWEGLDR+TSTPHSNYTYSV+AAI AGIDMVM+PYKYAEFIDDL  LVK+NV+PMDRIDDAV RILSVKFTMGLF
Subjt:  KMHANRELITGFLKGTLKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLF

Query:  ESPLGDYSLVNELGSQEHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPST
        ESPLGDYSLVNELGSQ HRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILV GTHADNLGYQCGGWTIAWQGFSGNNATRGT+ILAAIKSTVDPST
Subjt:  ESPLGDYSLVNELGSQEHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPST

Query:  EVVFREDPDSDFVKSNDFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYG
        EVVFREDPDSDFVKSN FSYAIVVIGEAPYAET GDSTTLTMLDPGPSI+KNVC+SVKCVVVVISGRPIVMEPY+SSMDALVAAWLPGTEGLGVTDALYG
Subjt:  EVVFREDPDSDFVKSNDFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYG

Query:  DHGFSGKLPRTWFKSVDQLPMNVGDRHYDPLFPLGFGLTTGSVKDVVARSTSAGIKGTPSFIAMIIATIAICILQ
        DHGFSGKLPRTWFKSVDQLPMN GDRHYDPLFPLGFGLTTGSVKD+VARSTSAG +GTPSFIAMI+ATIA+C+LQ
Subjt:  DHGFSGKLPRTWFKSVDQLPMNVGDRHYDPLFPLGFGLTTGSVKDVVARSTSAGIKGTPSFIAMIIATIAICILQ

A0A6J1HZJ2 uncharacterized protein LOC111467651 isoform X10.0e+0094.39Show/hide
Query:  MAKIFVQLFVILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMIN
        MAKIFVQ+ VILCLGWWWWAIMV AENLKYKDPKQPV+VRVKDLLGRMTLEEKIGQMVQIDRSVANATVMK+YFIGSVLSGGGSVPLPDARAQDWVDMIN
Subjt:  MAKIFVQLFVILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMIN

Query:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPRIVQNMTE
        DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDL+RRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDP++VQNMTE
Subjt:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPRIVQNMTE

Query:  IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRELITGFLKGT
        IIIGLQGEPPAN+RKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLL IHMPAYLDSIIKGVSSVM SYSSWNGVKMHANR+LIT FLKGT
Subjt:  IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRELITGFLKGT

Query:  LKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQ
        LKFKGFVISDWEGLDR+TSTPHSNYTYSV+AAI AGIDMVM+PYKYAEFIDDL  LVK+NV+PMDRIDDAV RILSVKFTMGLFESPLGDYSLVNELGSQ
Subjt:  LKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQ

Query:  EHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPSTEVVFREDPDSDFVKSN
         HRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILV GTHADNLGYQCGGWTIAWQGFSGNNATRGT+ILAAIKSTVDPSTEVVFREDPDSDFVKSN
Subjt:  EHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPSTEVVFREDPDSDFVKSN

Query:  DFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
         FSYAIVVIGEAPYAET GDSTTLTMLDPGPSI+KNVC+SVKCVVVVISGRPIVMEPY+SSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
Subjt:  DFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV

Query:  DQLPMNVGDRHYDPLFPLGFGLTTGSVKDVVARSTSAGIKGTPSFIAMIIATIAICILQ
        DQLPMN GDRHYDPLFPLGFGLTTGSVKD+VARSTSAG +GTPSFIAMI+ATIA+C+LQ
Subjt:  DQLPMNVGDRHYDPLFPLGFGLTTGSVKDVVARSTSAGIKGTPSFIAMIIATIAICILQ

SwissProt top hitse value%identityAlignment
A7LXU3 Beta-glucosidase BoGH3B4.3e-8232.62Show/hide
Query:  PKQP-VAVRVKDLLGRMTLEEKIGQMVQIDRSVAN-----------------ATVMKDYFIGSVLSGGGSVPLPDARAQD-WVDMINDFQKGSLSSRLGI
        P  P +   +++ L +MTLE+KIGQM +I   V +                  TV+  Y +GS+L    +VPL  A+ ++ W + I   Q+ S+   +GI
Subjt:  PKQP-VAVRVKDLLGRMTLEEKIGQMVQIDRSVAN-----------------ATVMKDYFIGSVLSGGGSVPLPDARAQD-WVDMINDFQKGSLSSRLGI

Query:  PMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPRIVQNM-TEIIIGLQGEPPAN
        P IYG+D +HG     + T+FP  + +GAT N +L RR    +A E +A  I +TFAP + + RDPRW R +E+Y ED  +   M    + G QGE P  
Subjt:  PMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPRIVQNM-TEIIIGLQGEPPAN

Query:  FRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRELITGFLKGTLKFKGFVISDWE
                 G   V AC KH++G G    G +   + I R  +   H   +L ++ +G  SVM +    NG+  HANREL+T +LK  L + G +++DW 
Subjt:  FRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRELITGFLKGTLKFKGFVISDWE

Query:  GLDRLTSTPHSNYT--YSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQEHRDLARDAV
         ++ L +  H   T   +VK  I AGIDM M+PY+   F D L  LV+   + M+RIDDAV R+L +K+ +GLF+ P  D    ++ GS+E   +A  A 
Subjt:  GLDRLTSTPHSNYT--YSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQEHRDLARDAV

Query:  RQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNN-ATRGTSILAAI-----KSTVDPSTEVVFREDPDSDFVKSN----
         +S VLLKN  N    +LP++ K  KIL+ G +A+++    GGW+ +WQG   +  A    +I  A+     K  +     V +    + ++ + N    
Subjt:  RQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNN-ATRGTSILAAI-----KSTVDPSTEVVFREDPDSDFVKSN----

Query:  --------DFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKCVVVVIS-GRPIVMEPYVSSMDALVAAWLPGT-EGLGVTDALYGDHGFSG
                     I  IGE  Y ET G+ T LT+ +   ++VK +  + K +V+V++ GRP ++   V    A+V   LP    G  + + L GD  FSG
Subjt:  --------DFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKCVVVVIS-GRPIVMEPYVSSMDALVAAWLPGT-EGLGVTDALYGDHGFSG

Query:  KLPRTW-----------FKSVDQLPMNVGDRHYDPL----FPLGFGLTTGSVK
        K+P T+           +K  + +    G+ +YD +    +P GFGL+  + K
Subjt:  KLPRTW-----------FKSVDQLPMNVGDRHYDPL----FPLGFGLTTGSVK

P33363 Periplasmic beta-glucosidase3.0e-5928.82Show/hide
Query:  AENLKYKDPKQPVA--VRVKDLLGRMTLEEKIGQMVQIDRSVAN-----ATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIY
        A++L    P  P A    V +LL +MT++EKIGQ+  I     N       ++KD  +G++ +   +V   D RA    D + +       SRL IP+ +
Subjt:  AENLKYKDPKQPVA--VRVKDLLGRMTLEEKIGQMVQIDRSVAN-----ATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIY

Query:  GIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPRIVQNMTEIII-GLQGEPPANFRKG
          D +HG       TVFP ++GL ++ N D V+ +G  +A E    G++ T+AP + V RDPRWGR  E + ED  +   M + ++  +QG+ PA+    
Subjt:  GIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPRIVQNMTEIII-GLQGEPPANFRKG

Query:  IPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRELITGFLKGTLKFKGFVISDWEGLDR
                 V+   KHF   G    G   N   +    L + +MP Y   +  G  +VM + +S NG    ++  L+   L+    FKG  +SD   +  
Subjt:  IPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRELITGFLKGTLKFKGFVISDWEGLDR

Query:  LTS-TPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYS------LVNELGSQEHRDLARDA
        L      ++   +V+ A+ +GI+M M    Y+++   L  L+KS  + M  +DDA   +L+VK+ MGLF  P           +     S+ HR  AR+ 
Subjt:  LTS-TPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYS------LVNELGSQEHRDLARDA

Query:  VRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPSTEVVF---------------------
         R+SLVLLKN        LPL KK+  I V G  AD+     G W+ A        A +  ++L  IK+ V  + +V++                     
Subjt:  VRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPSTEVVF---------------------

Query:  --REDP-------DSDFVKSNDFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKC-----VVVVISGRPIVMEPYVSSMDALVAAWLPGTE
          + DP       D     +      + V+GEA     E  S T   +   P   +++  ++K      V+V+++GRP+ +       DA++  W  GTE
Subjt:  --REDP-------DSDFVKSNDFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKC-----VVVVISGRPIVMEPYVSSMDALVAAWLPGTE

Query:  -GLGVTDALYGDHGFSGKLPRTWFKSVDQLP-----MNVG------------DRHYD----PLFPLGFGL--TTGSVKDV
         G  + D L+GD+  SGKLP ++ +SV Q+P     +N G             R++D     L+P G+GL  TT +V DV
Subjt:  -GLGVTDALYGDHGFSGKLPRTWFKSVDQLP-----MNVG------------DRHYD----PLFPLGFGL--TTGSVKDV

Q23892 Lysosomal beta glucosidase1.4e-7230.97Show/hide
Query:  VKDLLGRMTLEEKIGQMVQIDRS--------VANATVM----KDYFIGSVL----SGGGSVPLPDARAQDWVDMINDFQKGSL-SSRLGIPMIYGIDAVH
        V +L+ +M++ EKIGQM Q+D +          N T +    K Y+IGS L    SGG +  +    +  W+DMIN  Q   +  S   IPMIYG+D+VH
Subjt:  VKDLLGRMTLEEKIGQMVQIDRS--------VANATVM----KDYFIGSVL----SGGGSVPLPDARAQDWVDMINDFQKGSL-SSRLGIPMIYGIDAVH

Query:  GHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPRIVQNM-TEIIIGLQG-----EPPANFRKGI
        G N V+ AT+FPHN GL AT N +        T+ +  A GI + FAP L +   P W R YE++ EDP +   M    + G QG     + P N     
Subjt:  GHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPRIVQNM-TEIIIGLQG-----EPPANFRKGI

Query:  PYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSII-KGVSSVMASYSSWNGVKMHANRELITGFLKGTLKFKGFVISDWEGLDR
                 +  AKH+ G    T G +     I    L    +P++ ++I   G  ++M +    NGV MH + + +T  L+G L+F+G  ++DW+ +++
Subjt:  PYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSII-KGVSSVMASYSSWNGVKMHANRELITGFLKGTLKFKGFVISDWEGLDR

Query:  LTSTPHS--NYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPL--GDYSLVNELGSQEHRDLARDAVRQ
        L    H+  +   ++  A+ AGIDM M+P   + F   L  +V +  +P  R+D +V RIL++K+ +GLF +P    + ++V+ +G  + R+ A     +
Subjt:  LTSTPHS--NYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPL--GDYSLVNELGSQEHRDLARDAVRQ

Query:  SLVLLKNGKNDSDPLLPLSKKAPK-ILVAGTHADNLGYQCGGWTIAWQG-FSGNNATRGTSILAAIKSTVDPSTEVVFRE---------------DPDSD
        S+ LL+N  N    +LPL+    K +L+ G  AD++    GGW++ WQG +  +    GTSIL  ++   + + +   +                D   +
Subjt:  SLVLLKNGKNDSDPLLPLSKKAPK-ILVAGTHADNLGYQCGGWTIAWQG-FSGNNATRGTSILAAIKSTVDPSTEVVFRE---------------DPDSD

Query:  FVKSNDFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKCVV-VVISGRPIVMEP-YVSSMDALVAAWLPGTE-GLGVTDALYGDHGFSGKL
          +S+D    +VVIGE P AET GD   L+M      +++ + D+ K VV +++  RP ++ P  V S  A++ A+LPG+E G  + + L G+   SG+L
Subjt:  FVKSNDFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKCVV-VVISGRPIVMEP-YVSSMDALVAAWLPGTE-GLGVTDALYGDHGFSGKL

Query:  PRTWFKSVDQLPMNVGDRHYD-----PLFPLGFGLT
        P T+  +   + +    ++ +     PLF  G GL+
Subjt:  PRTWFKSVDQLPMNVGDRHYD-----PLFPLGFGLT

Q56078 Periplasmic beta-glucosidase2.2e-6229.64Show/hide
Query:  AENLKYKDPKQPVA--VRVKDLLGRMTLEEKIGQMVQIDRSVAN-----ATVMKDYFIGSVLSGGGSVPLPDAR-AQDWVDMINDFQKGSLSSRLGIPMI
        AENL    P  P A    V DLL +MT++EKIGQ+  I     N       ++KD  +G++ +   +V   D R  QD V  +         SRL IP+ 
Subjt:  AENLKYKDPKQPVA--VRVKDLLGRMTLEEKIGQMVQIDRSVAN-----ATVMKDYFIGSVLSGGGSVPLPDAR-AQDWVDMINDFQKGSLSSRLGIPMI

Query:  YGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPRIVQNMTEIII-GLQGEPPANFRK
        +  D VHG       TVFP ++GL ++ N D VR +G  +A E    G++ T+AP + V RDPRWGR  E + ED  +   M E ++  +QG+ PA+   
Subjt:  YGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPRIVQNMTEIII-GLQGEPPANFRK

Query:  GIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRELITGFLKGTLKFKGFVISDWEGLD
                  V+   KHF   G    G   N   +    L + +MP Y   +  G  +VM + +S NG    ++  L+   L+    FKG  +SD   + 
Subjt:  GIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRELITGFLKGTLKFKGFVISDWEGLD

Query:  RLTS-TPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYS------LVNELGSQEHRDLARD
         L      ++   +V+ A+ AG+DM M    Y+++   L  L+KS  + M  +DDA   +L+VK+ MGLF  P           +     S+ HR  AR+
Subjt:  RLTS-TPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYS------LVNELGSQEHRDLARD

Query:  AVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPSTEVVFRE------------------
          R+S+VLLKN        LPL KK+  I V G  AD+     G W+ A        A +  ++LA I++ V    ++++ +                  
Subjt:  AVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPSTEVVFRE------------------

Query:  -----DP-------DSDFVKSNDFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKC-----VVVVISGRPIVMEPYVSSMDALVAAWLPGT
             DP       D     +      + V+GE+     E  S T   +   P   +++  ++K      V+V+++GRP+ +       DA++  W  GT
Subjt:  -----DP-------DSDFVKSNDFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKC-----VVVVISGRPIVMEPYVSSMDALVAAWLPGT

Query:  E-GLGVTDALYGDHGFSGKLPRTWFKSVDQLP-----MNVG------------DRHYD----PLFPLGFGL--TTGSVKDVVARS
        E G  + D L+GD+  SGKLP ++ +SV Q+P     +N G             R++D    PL+P G+GL  TT +V DV   S
Subjt:  E-GLGVTDALYGDHGFSGKLPRTWFKSVDQLP-----MNVG------------DRHYD----PLFPLGFGL--TTGSVKDVVARS

T2KMH0 Beta-xylosidase9.3e-5330.43Show/hide
Query:  QKGSLSSRLGIPMIYGIDAVHGHNNVY----NATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAV-CRDPRWGRCYESYSEDPRIVQN
        Q    + RLGIP +   +A+HG   V     N TV+P  V   +T  P+L++++ + TA E RA G+++ ++P L V   D R+GR  ESY EDP +V  
Subjt:  QKGSLSSRLGIPMIYGIDAVHGHNNVY----NATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAV-CRDPRWGRCYESYSEDPRIVQN

Query:  M-TEIIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIK-GVSSVMASYSSWNGVKMHANRELITG
        M    I GLQG     F +          VIA AKHFVG      GIN   + +    L  +++P +  ++ + GV SVM  +  +NGV  H N  L+  
Subjt:  M-TEIIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIK-GVSSVMASYSSWNGVKMHANRELITG

Query:  FLKGTLKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAIL---AGIDMVMIPYKYAEFIDDLTFLVKSNVIP----MDRIDDAVGRILSVKFTMGLFES-P
         L+  L F GF++SD   + RL  T H       +AAIL   AG+DM ++  K  E     T ++K  ++     M  ID A  RIL+ K+ +GLF++ P
Subjt:  FLKGTLKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAIL---AGIDMVMIPYKYAEFIDDLTFLVKSNVIP----MDRIDDAVGRILSVKFTMGLFES-P

Query:  LGDYSLVNELGSQEHRDLARDAVRQSLVLLKNGKNDSDPLLPLS-KKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPSTEV
            +   E G+ EHR+ A +   +S+++LKN  N    LLPL   K   + V G +A     + G + +   G+SG       S+L  +K  V    ++
Subjt:  LGDYSLVNELGSQEHRDLARDAVRQSLVLLKNGKNDSDPLLPLS-KKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPSTEV

Query:  VFREDPDSDFVKSNDFSYAI----------VVIGEAPYAETE-GDSTTLTMLDPGPSIVKNVCDSVKCVVVV-ISGRPIVMEPYVSSMDALVAAWLPGTE
         + +  D D      F  AI          +V+G +     E GD   L +      +V+ +  + K V+VV I+GRP+ +     ++ +++  W  G  
Subjt:  VFREDPDSDFVKSNDFSYAI----------VVIGEAPYAETE-GDSTTLTMLDPGPSIVKNVCDSVKCVVVV-ISGRPIVMEPYVSSMDALVAAWLPGTE

Query:  -GLGVTDALYGDHGFSGKLPRTWFKSVDQLPMNVGDR--------------HYDPLFPLGFGLTTGSVKDVVAR--STSAGIKGT
         G  V + ++GD    GKL  ++ + V Q+P+   +R                 PLFP GFGL+  + K    +  +TS    GT
Subjt:  -GLGVTDALYGDHGFSGKLPRTWFKSVDQLPMNVGDR--------------HYDPLFPLGFGLTTGSVKDVVAR--STSAGIKGT

Arabidopsis top hitse value%identityAlignment
AT3G47000.1 Glycosyl hydrolase family protein5.9e-21258.78Show/hide
Query:  IMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGID
        ++V   +  YK+   PV  RVKDLL RMTL EKIGQM QI+R VA+ +   D+FIGSVL+ GGSVP  DA++ DW DMI+ FQ+ +L+SRLGIP+IYG D
Subjt:  IMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGID

Query:  AVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPRIVQNMTEIIIGLQGEPPANFRKGIPYV
        AVHG+NNVY ATVFPHN+GLGATR+ DLVRRIGAATALEVRA+G+ + F+PC+AV RDPRWGRCYESY EDP +V  MT ++ GLQG PP     G P+V
Subjt:  AVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPRIVQNMTEIIIGLQGEPPANFRKGIPYV

Query:  GGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRELITGFLKGTLKFKGFVISDWEGLDRLTST
         G   V+AC KHFVGDGGT  GINE NT+     L  IH+P YL  + +GVS+VMASYSSWNG ++HA+R L+T  LK  L FKGF++SDWEGLDRL+  
Subjt:  GGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRELITGFLKGTLKFKGFVISDWEGLDRLTST

Query:  PHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQEHRDLARDAVRQSLVLLKNG
          SNY Y +K A+ AGIDMVM+P+KY +FI D+T LV+S  IPM RI+DAV RIL VKF  GLF  PL D SL+  +G +EHR+LA++AVR+SLVLLK+G
Subjt:  PHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQEHRDLARDAVRQSLVLLKNG

Query:  KNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPSTEVVFREDPDSDFVKSND-FSYAIVVIGEAPYAETEG
        KN   P LPL + A +ILV GTHAD+LGYQCGGWT  W G SG   T GT++L AIK  V   TEV++ + P  + + S++ FSYAIV +GE PYAET G
Subjt:  KNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPSTEVVFREDPDSDFVKSND-FSYAIVVIGEAPYAETEG

Query:  DSTTLTMLDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYV-SSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSVDQLPMNVGDRHYDPLFPL
        D++ L +   G  IV  V + +  +V++ISGRP+V+EP V    +ALVAAWLPGTEG GV D ++GD+ F GKLP +WFK V+ LP++     YDPLFP 
Subjt:  DSTTLTMLDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYV-SSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSVDQLPMNVGDRHYDPLFPL

Query:  GFGLTTGSV
        GFGL +  V
Subjt:  GFGLTTGSV

AT5G04885.1 Glycosyl hydrolase family protein2.7e-28970.93Show/hide
Query:  MAKIFVQLFVILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMIN
        M++  V++  +L     W       E L YKDPKQ V+ RV DL GRMTLEEKIGQMVQIDRSVA   +M+DYFIGSVLSGGGS PLP+A AQ+WVDMIN
Subjt:  MAKIFVQLFVILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMIN

Query:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPRIVQNMTE
        ++QKG+L SRLGIPMIYGIDAVHGHNNVYNAT+FPHNVGLGATR+PDLV+RIGAATA+EVRATGI YTFAPC+AVCRDPRWGRCYESYSED ++V++MT+
Subjt:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPRIVQNMTE

Query:  IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRELITGFLKGT
        +I+GLQGEPP+N++ G+P+VGG  KV ACAKH+VGDGGTT G+NENNTV D HGLLS+HMPAY D++ KGVS+VM SYSSWNG KMHAN ELITG+LKGT
Subjt:  IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRELITGFLKGT

Query:  LKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQ
        LKFKGFVISDW+G+D++++ PH++YT SV+AAI AGIDMVM+P+ + EF++DLT LVK+N IP+ RIDDAV RIL VKFTMGLFE+PL DYS  +ELGSQ
Subjt:  LKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQ

Query:  EHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPSTEVVFREDPDSDFVKSN
         HRDLAR+AVR+SLVLLKNG N ++P+LPL +K  KILVAGTHADNLGYQCGGWTI WQGFSGN  TRGT++L+A+KS VD STEVVFRE+PD++F+KSN
Subjt:  EHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPSTEVVFREDPDSDFVKSN

Query:  DFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
        +F+YAI+ +GE PYAET GDS  LTMLDPGP+I+ + C +VKCVVVVISGRP+VMEPYV+S+DALVAAWLPGTEG G+TDAL+GDHGFSGKLP TWF++ 
Subjt:  DFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV

Query:  DQLPMNVGDRHYDPLFPLGFGLTTGSVKDVVARSTSAGIKGTPSFIAMIIATIAICI
        +QLPM+ GD HYDPLF  G GL T SV  +VARSTSA    T   +  ++ +  +C+
Subjt:  DQLPMNVGDRHYDPLFPLGFGLTTGSVKDVVARSTSAGIKGTPSFIAMIIATIAICI

AT5G20940.1 Glycosyl hydrolase family protein2.3e-24868.39Show/hide
Query:  NLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHN
        N KYKDPK+P+ VR+K+L+  MTLEEKIGQMVQ++R  A   VM+ YF+GSV SGGGSVP P    + WV+M+N+ QK +LS+RLGIP+IYGIDAVHGHN
Subjt:  NLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHN

Query:  NVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPRIVQNMTEIIIGLQGEPPANFRKGIPYVGGTKKV
         VYNAT+FPHNVGLG TR+P LV+RIG ATALEVRATGI Y FAPC+AVCRDPRWGRCYESYSED +IVQ MTEII GLQG+ P   +KG+P+V G  KV
Subjt:  NVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPRIVQNMTEIIIGLQGEPPANFRKGIPYVGGTKKV

Query:  IACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRELITGFLKGTLKFKGFVISDWEGLDRLTSTPHSNYT
         ACAKHFVGDGGT  G+N NNTVI+ +GLL IHMPAY D++ KGV++VM SYSS NG+KMHAN++LITGFLK  LKF+G VISD+ G+D++ +   +NY+
Subjt:  IACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRELITGFLKGTLKFKGFVISDWEGLDRLTSTPHSNYT

Query:  YSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQEHRDLARDAVRQSLVLLKNGKNDSDP
        +SV AA  AG+DM M      + ID+LT  VK   IPM RIDDAV RIL VKFTMGLFE+P+ D+SL  +LGS+EHR+LAR+AVR+SLVLLKNG+N   P
Subjt:  YSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQEHRDLARDAVRQSLVLLKNGKNDSDP

Query:  LLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPSTEVVFREDPDSDFVKSNDFSYAIVVIGEAPYAETEGDSTTLTM
        LLPL KKA KILVAGTHADNLGYQCGGWTI WQG +GNN T GT+ILAA+K TVDP T+V++ ++PD++FVK+ DF YAIV +GE PYAE  GDST LT+
Subjt:  LLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPSTEVVFREDPDSDFVKSNDFSYAIVVIGEAPYAETEGDSTTLTM

Query:  LDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSVDQLPMNVGDRHYDPLFPLGFGLTT
         +PGPS + NVC SVKCVVVV+SGRP+VM+  +S++DALVAAWLPGTEG GV D L+GD+GF+GKL RTWFK+VDQLPMNVGD HYDPL+P GFGL T
Subjt:  LDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSVDQLPMNVGDRHYDPLFPLGFGLTT

AT5G20950.1 Glycosyl hydrolase family protein8.9e-26970.23Show/hide
Query:  ILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSR
        +LCL      +      LKYKDPKQP+  R++DL+ RMTL+EKIGQMVQI+RSVA   VMK YFIGSVLSGGGSVP   A  + WV+M+N+ QK SLS+R
Subjt:  ILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSR

Query:  LGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPRIVQNMTEIIIGLQGEPP
        LGIPMIYGIDAVHGHNNVY AT+FPHNVGLG TR+P+LV+RIGAATALEVRATGI Y FAPC+AVCRDPRWGRCYESYSED RIVQ MTEII GLQG+ P
Subjt:  LGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPRIVQNMTEIIIGLQGEPP

Query:  ANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRELITGFLKGTLKFKGFVISD
           RKG+P+VGG  KV ACAKHFVGDGGT  GI+ENNTVID  GL  IHMP Y +++ KGV+++M SYS+WNG++MHAN+EL+TGFLK  LKF+GFVISD
Subjt:  ANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRELITGFLKGTLKFKGFVISD

Query:  WEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQEHRDLARDAV
        W+G+DR+T+ PH NY+YSV A I AGIDM+M+PY Y EFID+++  ++  +IP+ RIDDA+ RIL VKFTMGLFE PL D S  N+LGS+EHR+LAR+AV
Subjt:  WEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQEHRDLARDAV

Query:  RQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPSTEVVFREDPDSDFVKSNDFSYAIVVIG
        R+SLVLLKNGK  + PLLPL KK+ KILVAG HADNLGYQCGGWTI WQG +GN+ T GT+ILAA+K+TV P+T+VV+ ++PD++FVKS  F YAIVV+G
Subjt:  RQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPSTEVVFREDPDSDFVKSNDFSYAIVVIG

Query:  EAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSVDQLPMNVGDR
        E PYAE  GD+T LT+ DPGPSI+ NVC SVKCVVVV+SGRP+V++PYVS++DALVAAWLPGTEG GV DAL+GD+GF+GKL RTWFKSV QLPMNVGDR
Subjt:  EAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSVDQLPMNVGDR

Query:  HYDPLFPLGFGLTTGSVK
        HYDPL+P GFGLTT   K
Subjt:  HYDPLFPLGFGLTTGSVK

AT5G20950.2 Glycosyl hydrolase family protein8.9e-26970.23Show/hide
Query:  ILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSR
        +LCL      +      LKYKDPKQP+  R++DL+ RMTL+EKIGQMVQI+RSVA   VMK YFIGSVLSGGGSVP   A  + WV+M+N+ QK SLS+R
Subjt:  ILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSR

Query:  LGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPRIVQNMTEIIIGLQGEPP
        LGIPMIYGIDAVHGHNNVY AT+FPHNVGLG TR+P+LV+RIGAATALEVRATGI Y FAPC+AVCRDPRWGRCYESYSED RIVQ MTEII GLQG+ P
Subjt:  LGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPRIVQNMTEIIIGLQGEPP

Query:  ANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRELITGFLKGTLKFKGFVISD
           RKG+P+VGG  KV ACAKHFVGDGGT  GI+ENNTVID  GL  IHMP Y +++ KGV+++M SYS+WNG++MHAN+EL+TGFLK  LKF+GFVISD
Subjt:  ANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRELITGFLKGTLKFKGFVISD

Query:  WEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQEHRDLARDAV
        W+G+DR+T+ PH NY+YSV A I AGIDM+M+PY Y EFID+++  ++  +IP+ RIDDA+ RIL VKFTMGLFE PL D S  N+LGS+EHR+LAR+AV
Subjt:  WEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQEHRDLARDAV

Query:  RQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPSTEVVFREDPDSDFVKSNDFSYAIVVIG
        R+SLVLLKNGK  + PLLPL KK+ KILVAG HADNLGYQCGGWTI WQG +GN+ T GT+ILAA+K+TV P+T+VV+ ++PD++FVKS  F YAIVV+G
Subjt:  RQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPSTEVVFREDPDSDFVKSNDFSYAIVVIG

Query:  EAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSVDQLPMNVGDR
        E PYAE  GD+T LT+ DPGPSI+ NVC SVKCVVVV+SGRP+V++PYVS++DALVAAWLPGTEG GV DAL+GD+GF+GKL RTWFKSV QLPMNVGDR
Subjt:  EAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSVDQLPMNVGDR

Query:  HYDPLFPLGFGLTTGSVK
        HYDPL+P GFGLTT   K
Subjt:  HYDPLFPLGFGLTTGSVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCTTATTTTTAAAGGTACGTGGCCAAATAAGAACAAATCTAAAAATTCAAATATTAAAATTATAATTTTACCTTTTCTTTTCTATTTGTGTGAGTTGGGCTCTCG
AGGAAAGAAGATGGCGAAGATTTTTGTTCAGCTGTTTGTGATTTTGTGCTTGGGTTGGTGGTGGTGGGCAATAATGGTGGGCGCGGAGAACTTGAAGTACAAAGACCCTA
AGCAACCGGTTGCTGTGCGAGTTAAGGACCTTCTTGGCCGAATGACGCTGGAAGAGAAAATTGGTCAGATGGTTCAGATTGACAGAAGCGTTGCCAATGCTACAGTTATG
AAAGATTATTTTATTGGAAGTGTGCTAAGTGGTGGTGGAAGTGTGCCGCTTCCAGATGCTCGTGCTCAAGATTGGGTTGACATGATAAATGATTTCCAGAAGGGTTCTCT
TTCTAGTCGATTGGGTATCCCAATGATTTATGGCATTGATGCTGTTCATGGCCATAACAACGTTTACAATGCTACTGTATTCCCTCATAATGTTGGACTTGGAGCTACCA
GAAACCCTGACCTGGTTCGAAGGATTGGTGCAGCAACTGCACTTGAAGTTAGAGCTACAGGGATTTCTTATACATTTGCTCCATGCCTTGCGGTTTGTAGGGACCCGAGG
TGGGGTCGATGTTATGAAAGCTACAGTGAGGATCCAAGAATTGTGCAAAATATGACTGAGATTATAATTGGTCTGCAAGGAGAGCCTCCTGCAAATTTTCGGAAGGGGAT
TCCATATGTTGGTGGAACTAAAAAGGTTATCGCCTGTGCGAAGCACTTTGTTGGAGATGGTGGGACAACTCATGGCATCAATGAGAATAATACCGTTATTGACAGGCATG
GACTGCTCAGCATTCACATGCCTGCTTATTTAGATTCGATCATCAAGGGTGTTTCATCGGTAATGGCTTCCTATTCTAGTTGGAATGGAGTAAAGATGCACGCAAACCGT
GAGCTGATTACTGGTTTCCTCAAGGGTACCCTTAAATTTAAGGGTTTTGTCATCTCAGATTGGGAGGGTCTGGACAGATTGACTTCTACACCACATTCTAATTACACGTA
CTCTGTCAAAGCTGCAATTTTAGCTGGCATTGACATGGTCATGATTCCTTACAAGTATGCAGAGTTCATTGATGATCTTACGTTTCTAGTGAAGAGCAATGTCATCCCAA
TGGATCGTATTGATGATGCTGTTGGGAGAATTTTGTCTGTCAAGTTCACAATGGGTCTTTTTGAAAGCCCTTTGGGTGATTACAGCCTTGTCAATGAGCTTGGGAGCCAG
GAACATAGAGACTTGGCAAGAGATGCTGTGAGGCAGTCACTCGTACTGCTGAAGAATGGGAAAAATGACAGCGATCCGTTGCTACCCCTTTCAAAGAAGGCCCCAAAGAT
CCTTGTTGCTGGCACTCACGCTGATAATTTAGGATATCAATGTGGTGGATGGACAATTGCATGGCAAGGATTCAGTGGCAACAATGCTACAAGGGGAACTAGCATCCTCG
CTGCCATCAAATCAACAGTTGATCCAAGCACAGAGGTAGTATTCCGTGAGGATCCTGACAGTGATTTTGTTAAGTCCAATGACTTCTCATACGCCATTGTTGTTATTGGG
GAAGCCCCATATGCCGAGACTGAAGGGGATAGTACAACACTTACCATGTTGGATCCTGGTCCAAGCATCGTAAAAAATGTTTGTGATTCTGTAAAGTGTGTGGTGGTTGT
CATTTCTGGAAGGCCAATTGTGATGGAACCATATGTTTCATCAATGGATGCTCTTGTAGCAGCTTGGTTACCTGGTACTGAAGGCCTAGGAGTCACTGATGCCCTTTATG
GAGACCATGGTTTTAGTGGGAAGCTTCCAAGAACTTGGTTTAAATCTGTAGATCAACTGCCAATGAACGTTGGAGATCGACACTATGATCCACTTTTCCCGCTTGGTTTC
GGACTCACAACTGGATCCGTTAAGGACGTTGTCGCGAGGTCAACATCGGCGGGAATTAAAGGAACACCATCCTTCATTGCAATGATCATTGCTACAATTGCCATTTGTAT
ATTGCAGGAGTTTCTTCCCATTCAATTTCATGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATCTTATTTTTAAAGGTACGTGGCCAAATAAGAACAAATCTAAAAATTCAAATATTAAAATTATAATTTTACCTTTTCTTTTCTATTTGTGTGAGTTGGGCTCTCG
AGGAAAGAAGATGGCGAAGATTTTTGTTCAGCTGTTTGTGATTTTGTGCTTGGGTTGGTGGTGGTGGGCAATAATGGTGGGCGCGGAGAACTTGAAGTACAAAGACCCTA
AGCAACCGGTTGCTGTGCGAGTTAAGGACCTTCTTGGCCGAATGACGCTGGAAGAGAAAATTGGTCAGATGGTTCAGATTGACAGAAGCGTTGCCAATGCTACAGTTATG
AAAGATTATTTTATTGGAAGTGTGCTAAGTGGTGGTGGAAGTGTGCCGCTTCCAGATGCTCGTGCTCAAGATTGGGTTGACATGATAAATGATTTCCAGAAGGGTTCTCT
TTCTAGTCGATTGGGTATCCCAATGATTTATGGCATTGATGCTGTTCATGGCCATAACAACGTTTACAATGCTACTGTATTCCCTCATAATGTTGGACTTGGAGCTACCA
GAAACCCTGACCTGGTTCGAAGGATTGGTGCAGCAACTGCACTTGAAGTTAGAGCTACAGGGATTTCTTATACATTTGCTCCATGCCTTGCGGTTTGTAGGGACCCGAGG
TGGGGTCGATGTTATGAAAGCTACAGTGAGGATCCAAGAATTGTGCAAAATATGACTGAGATTATAATTGGTCTGCAAGGAGAGCCTCCTGCAAATTTTCGGAAGGGGAT
TCCATATGTTGGTGGAACTAAAAAGGTTATCGCCTGTGCGAAGCACTTTGTTGGAGATGGTGGGACAACTCATGGCATCAATGAGAATAATACCGTTATTGACAGGCATG
GACTGCTCAGCATTCACATGCCTGCTTATTTAGATTCGATCATCAAGGGTGTTTCATCGGTAATGGCTTCCTATTCTAGTTGGAATGGAGTAAAGATGCACGCAAACCGT
GAGCTGATTACTGGTTTCCTCAAGGGTACCCTTAAATTTAAGGGTTTTGTCATCTCAGATTGGGAGGGTCTGGACAGATTGACTTCTACACCACATTCTAATTACACGTA
CTCTGTCAAAGCTGCAATTTTAGCTGGCATTGACATGGTCATGATTCCTTACAAGTATGCAGAGTTCATTGATGATCTTACGTTTCTAGTGAAGAGCAATGTCATCCCAA
TGGATCGTATTGATGATGCTGTTGGGAGAATTTTGTCTGTCAAGTTCACAATGGGTCTTTTTGAAAGCCCTTTGGGTGATTACAGCCTTGTCAATGAGCTTGGGAGCCAG
GAACATAGAGACTTGGCAAGAGATGCTGTGAGGCAGTCACTCGTACTGCTGAAGAATGGGAAAAATGACAGCGATCCGTTGCTACCCCTTTCAAAGAAGGCCCCAAAGAT
CCTTGTTGCTGGCACTCACGCTGATAATTTAGGATATCAATGTGGTGGATGGACAATTGCATGGCAAGGATTCAGTGGCAACAATGCTACAAGGGGAACTAGCATCCTCG
CTGCCATCAAATCAACAGTTGATCCAAGCACAGAGGTAGTATTCCGTGAGGATCCTGACAGTGATTTTGTTAAGTCCAATGACTTCTCATACGCCATTGTTGTTATTGGG
GAAGCCCCATATGCCGAGACTGAAGGGGATAGTACAACACTTACCATGTTGGATCCTGGTCCAAGCATCGTAAAAAATGTTTGTGATTCTGTAAAGTGTGTGGTGGTTGT
CATTTCTGGAAGGCCAATTGTGATGGAACCATATGTTTCATCAATGGATGCTCTTGTAGCAGCTTGGTTACCTGGTACTGAAGGCCTAGGAGTCACTGATGCCCTTTATG
GAGACCATGGTTTTAGTGGGAAGCTTCCAAGAACTTGGTTTAAATCTGTAGATCAACTGCCAATGAACGTTGGAGATCGACACTATGATCCACTTTTCCCGCTTGGTTTC
GGACTCACAACTGGATCCGTTAAGGACGTTGTCGCGAGGTCAACATCGGCGGGAATTAAAGGAACACCATCCTTCATTGCAATGATCATTGCTACAATTGCCATTTGTAT
ATTGCAGGAGTTTCTTCCCATTCAATTTCATGAATGA
Protein sequenceShow/hide protein sequence
MDLIFKGTWPNKNKSKNSNIKIIILPFLFYLCELGSRGKKMAKIFVQLFVILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQIDRSVANATVM
KDYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPR
WGRCYESYSEDPRIVQNMTEIIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANR
ELITGFLKGTLKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQ
EHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPSTEVVFREDPDSDFVKSNDFSYAIVVIG
EAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSVDQLPMNVGDRHYDPLFPLGF
GLTTGSVKDVVARSTSAGIKGTPSFIAMIIATIAICILQEFLPIQFHE