; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg19012 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg19012
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionGlycosyl hydrolase family protein
Genome locationCarg_Chr18:9766553..9772078
RNA-Seq ExpressionCarg19012
SyntenyCarg19012
Gene Ontology termsGO:0009251 - glucan catabolic process (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0008422 - beta-glucosidase activity (molecular function)
InterPro domainsIPR001764 - Glycoside hydrolase, family 3, N-terminal
IPR002772 - Glycoside hydrolase family 3 C-terminal domain
IPR017853 - Glycoside hydrolase superfamily
IPR036881 - Glycoside hydrolase family 3 C-terminal domain superfamily
IPR036962 - Glycoside hydrolase, family 3, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7012970.1 hypothetical protein SDJN02_25724 [Cucurbita argyrosperma subsp. argyrosperma]0.0e+00100Show/hide
Query:  MAKIFVQVVVILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMIN
        MAKIFVQVVVILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMIN
Subjt:  MAKIFVQVVVILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMIN

Query:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTE
        DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTE
Subjt:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTE

Query:  IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKST
        IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKST
Subjt:  IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKST

Query:  LKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQ
        LKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQ
Subjt:  LKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQ

Query:  AHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSN
        AHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSN
Subjt:  AHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSN

Query:  GFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
        GFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
Subjt:  GFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV

Query:  DQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGAQATPSFIAMIVATIAICVLQVHF
        DQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGAQATPSFIAMIVATIAICVLQVHF
Subjt:  DQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGAQATPSFIAMIVATIAICVLQVHF

XP_022945501.1 uncharacterized protein LOC111449719 isoform X1 [Cucurbita moschata]0.0e+0099.55Show/hide
Query:  MAKIFVQVVVILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMIN
        MAK FVQVVVILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMIN
Subjt:  MAKIFVQVVVILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMIN

Query:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTE
        DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTE
Subjt:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTE

Query:  IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKST
        IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKST
Subjt:  IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKST

Query:  LKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQ
        LKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQ
Subjt:  LKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQ

Query:  AHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSN
        AHRDLARDAVRQSLVLLKNGKNDSD LLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTIL AIKSTVDPSTEVVFREDPDSDFVKSN
Subjt:  AHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSN

Query:  GFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
        GFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
Subjt:  GFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV

Query:  DQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGAQATPSFIAMIVATIAICVLQVHF
        DQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGAQATPSFIAMIVATIAICVLQVHF
Subjt:  DQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGAQATPSFIAMIVATIAICVLQVHF

XP_022945502.1 uncharacterized protein LOC111449719 isoform X2 [Cucurbita moschata]0.0e+0099.55Show/hide
Query:  MAKIFVQVVVILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMIN
        MAK FVQVVVILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMIN
Subjt:  MAKIFVQVVVILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMIN

Query:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTE
        DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTE
Subjt:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTE

Query:  IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKST
        IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKST
Subjt:  IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKST

Query:  LKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQ
        LKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQ
Subjt:  LKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQ

Query:  AHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSN
        AHRDLARDAVRQSLVLLKNGKNDSD LLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTIL AIKSTVDPSTEVVFREDPDSDFVKSN
Subjt:  AHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSN

Query:  GFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
        GFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
Subjt:  GFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV

Query:  DQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGAQATPSFIAMIVATIAICVLQVHF
        DQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGAQATPSFIAMIVATIAICVLQVHF
Subjt:  DQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGAQATPSFIAMIVATIAICVLQVHF

XP_023541708.1 uncharacterized protein LOC111801784 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0099.24Show/hide
Query:  MAKIFVQVVVILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMIN
        MAKIFVQVVVILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMIN
Subjt:  MAKIFVQVVVILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMIN

Query:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTE
        DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTE
Subjt:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTE

Query:  IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKST
        IIIGLQGEPPAN+RKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLG+HMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLK T
Subjt:  IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKST

Query:  LKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQ
        LKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKY EFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQ
Subjt:  LKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQ

Query:  AHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSN
        AHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILV+GTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSN
Subjt:  AHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSN

Query:  GFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
        GFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
Subjt:  GFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV

Query:  DQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGAQATPSFIAMIVATIAICVLQVHF
        DQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGAQATPSFIAMIVATIAICVLQVHF
Subjt:  DQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGAQATPSFIAMIVATIAICVLQVHF

XP_023541709.1 uncharacterized protein LOC111801784 isoform X2 [Cucurbita pepo subsp. pepo]0.0e+0099.24Show/hide
Query:  MAKIFVQVVVILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMIN
        MAKIFVQVVVILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMIN
Subjt:  MAKIFVQVVVILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMIN

Query:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTE
        DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTE
Subjt:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTE

Query:  IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKST
        IIIGLQGEPPAN+RKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLG+HMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLK T
Subjt:  IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKST

Query:  LKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQ
        LKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKY EFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQ
Subjt:  LKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQ

Query:  AHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSN
        AHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILV+GTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSN
Subjt:  AHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSN

Query:  GFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
        GFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
Subjt:  GFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV

Query:  DQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGAQATPSFIAMIVATIAICVLQVHF
        DQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGAQATPSFIAMIVATIAICVLQVHF
Subjt:  DQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGAQATPSFIAMIVATIAICVLQVHF

TrEMBL top hitse value%identityAlignment
A0A1S3BGE4 beta-glucosidase BoGH3B isoform X10.0e+0090.88Show/hide
Query:  MAKIFVQVVVILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMIN
        MAKIFVQVVVILCLGW WWA MV AENLKYKDPKQPV VRVKDLLGRMTLEEKIGQMVQIDRSVANATVMK+YFIGS+LSGGGSVPLPDARA+DWVDMIN
Subjt:  MAKIFVQVVVILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMIN

Query:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTE
        DFQKGSLSSRLGIPM YGIDAVHGHNNVYNATVFPHNVGLGATRNPDL RRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPK+V+ MTE
Subjt:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTE

Query:  IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKST
        IIIGLQGEPPAN+RKG PYVGGTKKVIACAKHFVGDGGTTHGINENNTVI+RHGLL IHMPAYLDSIIKGVSSVM SYSSWNGVKMHANRELIT FLK  
Subjt:  IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKST

Query:  LKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQ
        LKFKGFVISDWEGLDRITSTPHSNYTYSVQAAI AGIDMVM+PYKYAEFIDDL  LVK+NV+ MDRIDDAV RIL+VKFTMGLFESP+ DYSLVNELGSQ
Subjt:  LKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQ

Query:  AHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSN
        AHRDLARDAVRQSLVLLKNGKNDS PLLPLSKK+PKILVAGTHADNLGYQCGGWTIAWQGFSGNN TRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSN
Subjt:  AHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSN

Query:  GFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
         FSYAIVVIGEAPYAET GDSTTLTMLDPGP+IIKNVC+ V+CVV++ISGRPIV+EPY+SS+DALVAAWLPGTEG GVTDALYGDHGFSGKLPRTWFKSV
Subjt:  GFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV

Query:  DQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGAQATPSFIAMIVATIAICVL
        DQLPMN GD HYDPLFP GFGLTTGSVKDI+ARSTSAG + TPS IA IV  I +C+L
Subjt:  DQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGAQATPSFIAMIVATIAICVL

A0A6J1G118 uncharacterized protein LOC111449719 isoform X10.0e+0099.55Show/hide
Query:  MAKIFVQVVVILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMIN
        MAK FVQVVVILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMIN
Subjt:  MAKIFVQVVVILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMIN

Query:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTE
        DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTE
Subjt:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTE

Query:  IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKST
        IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKST
Subjt:  IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKST

Query:  LKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQ
        LKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQ
Subjt:  LKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQ

Query:  AHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSN
        AHRDLARDAVRQSLVLLKNGKNDSD LLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTIL AIKSTVDPSTEVVFREDPDSDFVKSN
Subjt:  AHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSN

Query:  GFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
        GFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
Subjt:  GFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV

Query:  DQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGAQATPSFIAMIVATIAICVLQVHF
        DQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGAQATPSFIAMIVATIAICVLQVHF
Subjt:  DQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGAQATPSFIAMIVATIAICVLQVHF

A0A6J1G143 uncharacterized protein LOC111449719 isoform X20.0e+0099.55Show/hide
Query:  MAKIFVQVVVILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMIN
        MAK FVQVVVILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMIN
Subjt:  MAKIFVQVVVILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMIN

Query:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTE
        DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTE
Subjt:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTE

Query:  IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKST
        IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKST
Subjt:  IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKST

Query:  LKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQ
        LKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQ
Subjt:  LKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQ

Query:  AHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSN
        AHRDLARDAVRQSLVLLKNGKNDSD LLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTIL AIKSTVDPSTEVVFREDPDSDFVKSN
Subjt:  AHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSN

Query:  GFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
        GFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
Subjt:  GFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV

Query:  DQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGAQATPSFIAMIVATIAICVLQVHF
        DQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGAQATPSFIAMIVATIAICVLQVHF
Subjt:  DQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGAQATPSFIAMIVATIAICVLQVHF

A0A6J1HX36 uncharacterized protein LOC111467651 isoform X20.0e+0098.04Show/hide
Query:  MAKIFVQVVVILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMIN
        MAKIFVQVVVILCLGWWWWAIMV AENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMIN
Subjt:  MAKIFVQVVVILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMIN

Query:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTE
        DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTE
Subjt:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTE

Query:  IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKST
        IIIGLQGEPPAN+RKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANR+LITRFLK T
Subjt:  IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKST

Query:  LKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQ
        LKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDL LLVKNNVV MDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQ
Subjt:  LKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQ

Query:  AHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSN
        AHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILV GTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSN
Subjt:  AHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSN

Query:  GFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
        GFSYAIVVIGEAPYAET GDSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPY+SSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
Subjt:  GFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV

Query:  DQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGAQATPSFIAMIVATIAICVLQVHF
        DQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAG + TPSFIAMIVATIA+CVLQVHF
Subjt:  DQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGAQATPSFIAMIVATIAICVLQVHF

A0A6J1HZJ2 uncharacterized protein LOC111467651 isoform X10.0e+0098.04Show/hide
Query:  MAKIFVQVVVILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMIN
        MAKIFVQVVVILCLGWWWWAIMV AENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMIN
Subjt:  MAKIFVQVVVILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMIN

Query:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTE
        DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTE
Subjt:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTE

Query:  IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKST
        IIIGLQGEPPAN+RKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANR+LITRFLK T
Subjt:  IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKST

Query:  LKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQ
        LKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDL LLVKNNVV MDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQ
Subjt:  LKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQ

Query:  AHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSN
        AHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILV GTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSN
Subjt:  AHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSN

Query:  GFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
        GFSYAIVVIGEAPYAET GDSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPY+SSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
Subjt:  GFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV

Query:  DQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGAQATPSFIAMIVATIAICVLQVHF
        DQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAG + TPSFIAMIVATIA+CVLQVHF
Subjt:  DQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGAQATPSFIAMIVATIAICVLQVHF

SwissProt top hitse value%identityAlignment
A7LXU3 Beta-glucosidase BoGH3B4.7e-8332.92Show/hide
Query:  PKQP-VSVRVKDLLGRMTLEEKIGQMVQIDRSVAN-----------------ATVMKNYFIGSVLSGGGSVPLPDARAQD-WVDMINDFQKGSLSSRLGI
        P  P +   +++ L +MTLE+KIGQM +I   V +                  TV+  Y +GS+L    +VPL  A+ ++ W + I   Q+ S+   +GI
Subjt:  PKQP-VSVRVKDLLGRMTLEEKIGQMVQIDRSVAN-----------------ATVMKNYFIGSVLSGGGSVPLPDARAQD-WVDMINDFQKGSLSSRLGI

Query:  PMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNM-TEIIIGLQGEPPAN
        P IYG+D +HG     + T+FP  + +GAT N +L RR    +A E +A  I +TFAP + + RDPRW R +E+Y ED  +   M    + G QGE P  
Subjt:  PMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNM-TEIIIGLQGEPPAN

Query:  FRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKSTLKFKGFVISDWE
                 G   V AC KH++G G    G +   + I R  +   H   +L ++ +G  SVMV+    NG+  HANREL+T +LK  L + G +++DW 
Subjt:  FRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKSTLKFKGFVISDWE

Query:  GLDRITSTPHSNYT--YSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQAHRDLARDAV
         ++ + +  H   T   +V+  I+AGIDM MVPY+   F D L  LV+   VSM+RIDDAVAR+L +K+ +GLF+ P  D    ++ GS+    +A  A 
Subjt:  GLDRITSTPHSNYT--YSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQAHRDLARDAV

Query:  RQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNN-ATRGTTILAAI-----KSTVDPSTEVVFREDPDSDFVKSN----
         +S VLLKN  N    +LP++ K  KIL+ G +A+++    GGW+ +WQG   +  A    TI  A+     K  +     V +    + ++ + N    
Subjt:  RQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNN-ATRGTTILAAI-----KSTVDPSTEVVFREDPDSDFVKSN----

Query:  --------GFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVVVVIS-GRPIVMEPYVSSMDALVAAWLPGT-EGLGVTDALYGDHGFSG
                     I  IGE  Y ET G+ T LT+ +   +++K +  + K +V+V++ GRP ++   V    A+V   LP    G  + + L GD  FSG
Subjt:  --------GFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVVVVIS-GRPIVMEPYVSSMDALVAAWLPGT-EGLGVTDALYGDHGFSG

Query:  KLPRTW-----------FKSVDQLPMNFGDRHYDPL----FPLGFGLTTGSVK
        K+P T+           +K  + +    G+ +YD +    +P GFGL+  + K
Subjt:  KLPRTW-----------FKSVDQLPMNFGDRHYDPL----FPLGFGLTTGSVK

P33363 Periplasmic beta-glucosidase1.4e-5828.68Show/hide
Query:  VKDLLGRMTLEEKIGQMVQIDRSVAN-----ATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFP
        V +LL +MT++EKIGQ+  I     N       ++K+  +G++ +   +V   D RA    D + +       SRL IP+ +  D +HG       TVFP
Subjt:  VKDLLGRMTLEEKIGQMVQIDRSVAN-----ATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFP

Query:  HNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTEIII-GLQGEPPANFRKGIPYVGGTKKVIACAKHFV
         ++GL ++ N D ++ +G  +A E    G++ T+AP + V RDPRWGR  E + ED  L   M + ++  +QG+ PA+             V+   KHF 
Subjt:  HNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTEIII-GLQGEPPANFRKGIPYVGGTKKVIACAKHFV

Query:  GDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKSTLKFKGFVISDWEGL-DRITSTPHSNYTYSVQAAI
          G    G   N   +    L   +MP Y   +  G  +VMV+ +S NG    ++  L+   L+    FKG  +SD   + + I     ++   +V+ A+
Subjt:  GDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKSTLKFKGFVISDWEGL-DRITSTPHSNYTYSVQAAI

Query:  SAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYS------LVNELGSQAHRDLARDAVRQSLVLLKNGKNDSDPL
         +GI+M M    Y++++     L+K+  V+M  +DDA   +L+VK+ MGLF  P           +     S+ HR  AR+  R+SLVLLKN        
Subjt:  SAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYS------LVNELGSQAHRDLARDAVRQSLVLLKNGKNDSDPL

Query:  LPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVF-----------------------REDP-------DSDFV
        LPL KK+  I V G  AD+     G W+ A        A +  T+L  IK+ V  + +V++                       + DP       D    
Subjt:  LPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVF-----------------------REDP-------DSDFV

Query:  KSNGFSYAIVVIGEAPYAETEGDS-TTLTMLDPGPSIIKNVCESVK-CVVVVISGRPIVMEPYVSSMDALVAAWLPGTE-GLGVTDALYGDHGFSGKLPR
         +      + V+GEA     E  S T +T+      +I  +  + K  V+V+++GRP+ +       DA++  W  GTE G  + D L+GD+  SGKLP 
Subjt:  KSNGFSYAIVVIGEAPYAETEGDS-TTLTMLDPGPSIIKNVCESVK-CVVVVISGRPIVMEPYVSSMDALVAAWLPGTE-GLGVTDALYGDHGFSGKLPR

Query:  TWFKSVDQLPM-----------------NFGDRHYD----PLFPLGFGL--TTGSVKDI
        ++ +SV Q+P+                  +  R++D     L+P G+GL  TT +V D+
Subjt:  TWFKSVDQLPM-----------------NFGDRHYD----PLFPLGFGL--TTGSVKDI

Q23892 Lysosomal beta glucosidase2.7e-7030.28Show/hide
Query:  VKDLLGRMTLEEKIGQMVQIDRS--------VANATVM----KNYFIGSVL----SGGGSVPLPDARAQDWVDMINDFQKGSL-SSRLGIPMIYGIDAVH
        V +L+ +M++ EKIGQM Q+D +          N T +    K Y+IGS L    SGG +  +    +  W+DMIN  Q   +  S   IPMIYG+D+VH
Subjt:  VKDLLGRMTLEEKIGQMVQIDRS--------VANATVM----KNYFIGSVL----SGGGSVPLPDARAQDWVDMINDFQKGSL-SSRLGIPMIYGIDAVH

Query:  GHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNM-TEIIIGLQG-----EPPANFRKGI
        G N V+ AT+FPHN GL AT N +        T+ +  A GI + FAP L +   P W R YE++ EDP +   M    + G QG     + P N     
Subjt:  GHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNM-TEIIIGLQG-----EPPANFRKGI

Query:  PYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSII-KGVSSVMVSYSSWNGVKMHANRELITRFLKSTLKFKGFVISDWEGLDR
                 +  AKH+ G    T G +     I    L    +P++ ++I   G  ++M++    NGV MH + + +T  L+  L+F+G  ++DW+ +++
Subjt:  PYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSII-KGVSSVMVSYSSWNGVKMHANRELITRFLKSTLKFKGFVISDWEGLDR

Query:  ITSTPHS--NYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPL--GDYSLVNELGSQAHRDLARDAVRQ
        +    H+  +   ++  A+ AGIDM MVP   + F   L  +V    V   R+D +V RIL++K+ +GLF +P    + ++V+ +G    R+ A     +
Subjt:  ITSTPHS--NYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPL--GDYSLVNELGSQAHRDLARDAVRQ

Query:  SLVLLKNGKNDSDPLLPLSKKAPK-ILVAGTHADNLGYQCGGWTIAWQG-FSGNNATRGTTILAAIKSTVDPSTEV-------------VFREDPDSDFV
        S+ LL+N  N    +LPL+    K +L+ G  AD++    GGW++ WQG +  +    GT+IL  ++   + + +                +   D    
Subjt:  SLVLLKNGKNDSDPLLPLSKKAPK-ILVAGTHADNLGYQCGGWTIAWQG-FSGNNATRGTTILAAIKSTVDPSTEV-------------VFREDPDSDFV

Query:  KSNGFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVV-VVISGRPIVMEP-YVSSMDALVAAWLPGTE-GLGVTDALYGDHGFSGKLPR
         +      +VVIGE P AET GD   L+M      +++ + ++ K VV +++  RP ++ P  V S  A++ A+LPG+E G  + + L G+   SG+LP 
Subjt:  KSNGFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVV-VVISGRPIVMEP-YVSSMDALVAAWLPGTE-GLGVTDALYGDHGFSGKLPR

Query:  TWFKSVDQLPMNFGDRHYD-----PLFPLGFGLT
        T+  +   + + +  ++ +     PLF  G GL+
Subjt:  TWFKSVDQLPMNFGDRHYD-----PLFPLGFGLT

Q56078 Periplasmic beta-glucosidase4.6e-6229.33Show/hide
Query:  AENLKYKDPKQPVS--VRVKDLLGRMTLEEKIGQMVQIDRSVAN-----ATVMKNYFIGSVLSGGGSVPLPDAR-AQDWVDMINDFQKGSLSSRLGIPMI
        AENL    P  P +    V DLL +MT++EKIGQ+  I     N       ++K+  +G++ +   +V   D R  QD V  +         SRL IP+ 
Subjt:  AENLKYKDPKQPVS--VRVKDLLGRMTLEEKIGQMVQIDRSVAN-----ATVMKNYFIGSVLSGGGSVPLPDAR-AQDWVDMINDFQKGSLSSRLGIPMI

Query:  YGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTEIII-GLQGEPPANFRK
        +  D VHG       TVFP ++GL ++ N D +R +G  +A E    G++ T+AP + V RDPRWGR  E + ED  L   M E ++  +QG+ PA+   
Subjt:  YGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTEIII-GLQGEPPANFRK

Query:  GIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKSTLKFKGFVISDWEGL-
                  V+   KHF   G    G   N   +    L   +MP Y   +  G  +VMV+ +S NG    ++  L+   L+    FKG  +SD   + 
Subjt:  GIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKSTLKFKGFVISDWEGL-

Query:  DRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYS------LVNELGSQAHRDLARD
        + I     ++   +V+ A+ AG+DM M    Y++++     L+K+  V+M  +DDA   +L+VK+ MGLF  P           +     S+ HR  AR+
Subjt:  DRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYS------LVNELGSQAHRDLARD

Query:  AVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFRE------------------
          R+S+VLLKN        LPL KK+  I V G  AD+     G W+ A        A +  T+LA I++ V    ++++ +                  
Subjt:  AVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFRE------------------

Query:  -----DP-------DSDFVKSNGFSYAIVVIGEAPYAETEGDS-TTLTMLDPGPSIIKNVCESVK-CVVVVISGRPIVMEPYVSSMDALVAAWLPGTE-G
             DP       D     +      + V+GE+     E  S T +T+      +I  +  + K  V+V+++GRP+ +       DA++  W  GTE G
Subjt:  -----DP-------DSDFVKSNGFSYAIVVIGEAPYAETEGDS-TTLTMLDPGPSIIKNVCESVK-CVVVVISGRPIVMEPYVSSMDALVAAWLPGTE-G

Query:  LGVTDALYGDHGFSGKLPRTWFKSVDQLPM-----------------NFGDRHYD----PLFPLGFGL--TTGSVKDIVARS
          + D L+GD+  SGKLP ++ +SV Q+P+                  +  R++D    PL+P G+GL  TT +V D+   S
Subjt:  LGVTDALYGDHGFSGKLPRTWFKSVDQLPM-----------------NFGDRHYD----PLFPLGFGL--TTGSVKDIVARS

T2KMH0 Beta-xylosidase6.6e-5330.05Show/hide
Query:  QKGSLSSRLGIPMIYGIDAVHGHNNVY----NATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAV-CRDPRWGRCYESYSEDPKLVQN
        Q    + RLGIP +   +A+HG   V     N TV+P  V   +T  P+L++++ + TA E RA G+++ ++P L V   D R+GR  ESY EDP LV  
Subjt:  QKGSLSSRLGIPMIYGIDAVHGHNNVY----NATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAV-CRDPRWGRCYESYSEDPKLVQN

Query:  M-TEIIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIK-GVSSVMVSYSSWNGVKMHANRELITR
        M    I GLQG     F +          VIA AKHFVG      GIN   + +    L  +++P +  ++ + GV SVM  +  +NGV  H N  L+  
Subjt:  M-TEIIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIK-GVSSVMVSYSSWNGVKMHANRELITR

Query:  FLKSTLKFKGFVISDWEGLDRITSTPH---SNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVS----MDRIDDAVARILSVKFTMGLFES-P
         L+  L F GF++SD   + R+  T H    N T +    + AG+DM +V  K  E     T ++K+ ++     M  ID A +RIL+ K+ +GLF++ P
Subjt:  FLKSTLKFKGFVISDWEGLDRITSTPH---SNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVS----MDRIDDAVARILSVKFTMGLFES-P

Query:  LGDYSLVNELGSQAHRDLARDAVRQSLVLLKNGKNDSDPLLPLS-KKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEV
            +   E G+  HR+ A +   +S+++LKN  N    LLPL   K   + V G +A     + G + +   G+SG       ++L  +K  V    ++
Subjt:  LGDYSLVNELGSQAHRDLARDAVRQSLVLLKNGKNDSDPLLPLS-KKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEV

Query:  VFREDPDSDFVKSNGFSYAI----------VVIGEAPYAETE-GDSTTLTMLDPGPSIIKNVCESVKCVVVV-ISGRPIVMEPYVSSMDALVAAWLPGTE
         + +  D D     GF  AI          +V+G +     E GD   L +      +++ + ++ K V+VV I+GRP+ +     ++ +++  W  G  
Subjt:  VFREDPDSDFVKSNGFSYAI----------VVIGEAPYAETE-GDSTTLTMLDPGPSIIKNVCESVKCVVVV-ISGRPIVMEPYVSSMDALVAAWLPGTE

Query:  -GLGVTDALYGDHGFSGKLPRTWFKSVDQLPMNFGDR--------------HYDPLFPLGFGLTTGSVK
         G  V + ++GD    GKL  ++ + V Q+P+ + +R                 PLFP GFGL+  + K
Subjt:  -GLGVTDALYGDHGFSGKLPRTWFKSVDQLPMNFGDR--------------HYDPLFPLGFGLTTGSVK

Arabidopsis top hitse value%identityAlignment
AT3G47000.1 Glycosyl hydrolase family protein6.7e-21057.96Show/hide
Query:  IMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGID
        ++V   +  YK+   PV  RVKDLL RMTL EKIGQM QI+R VA+ +   ++FIGSVL+ GGSVP  DA++ DW DMI+ FQ+ +L+SRLGIP+IYG D
Subjt:  IMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGID

Query:  AVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTEIIIGLQGEPPANFRKGIPYV
        AVHG+NNVY ATVFPHN+GLGATR+ DL+RRIGAATALEVRA+G+ + F+PC+AV RDPRWGRCYESY EDP+LV  MT ++ GLQG PP     G P+V
Subjt:  AVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTEIIIGLQGEPPANFRKGIPYV

Query:  GGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKSTLKFKGFVISDWEGLDRITST
         G   V+AC KHFVGDGGT  GINE NT+     L  IH+P YL  + +GVS+VM SYSSWNG ++HA+R L+T  LK  L FKGF++SDWEGLDR++  
Subjt:  GGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKSTLKFKGFVISDWEGLDRITST

Query:  PHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQAHRDLARDAVRQSLVLLKNG
          SNY Y ++ A++AGIDMVMVP+KY +FI D+T LV++  + M RI+DAV RIL VKF  GLF  PL D SL+  +G + HR+LA++AVR+SLVLLK+G
Subjt:  PHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQAHRDLARDAVRQSLVLLKNG

Query:  KNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFREDPDSD-FVKSNGFSYAIVVIGEAPYAETEG
        KN   P LPL + A +ILV GTHAD+LGYQCGGWT  W G SG   T GTT+L AIK  V   TEV++ + P  +    S GFSYAIV +GE PYAET G
Subjt:  KNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFREDPDSD-FVKSNGFSYAIVVIGEAPYAETEG

Query:  DSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYV-SSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSVDQLPMNFGDRHYDPLFPL
        D++ L +   G  I+  V E +  +V++ISGRP+V+EP V    +ALVAAWLPGTEG GV D ++GD+ F GKLP +WFK V+ LP++     YDPLFP 
Subjt:  DSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYV-SSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSVDQLPMNFGDRHYDPLFPL

Query:  GFGLTTGSV
        GFGL +  V
Subjt:  GFGLTTGSV

AT5G04885.1 Glycosyl hydrolase family protein4.3e-28971.69Show/hide
Query:  MAKIFVQVVVILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMIN
        M++  V++V +L     W       E L YKDPKQ VS RV DL GRMTLEEKIGQMVQIDRSVA   +M++YFIGSVLSGGGS PLP+A AQ+WVDMIN
Subjt:  MAKIFVQVVVILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMIN

Query:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTE
        ++QKG+L SRLGIPMIYGIDAVHGHNNVYNAT+FPHNVGLGATR+PDL++RIGAATA+EVRATGI YTFAPC+AVCRDPRWGRCYESYSED K+V++MT+
Subjt:  DFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTE

Query:  IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKST
        +I+GLQGEPP+N++ G+P+VGG  KV ACAKH+VGDGGTT G+NENNTV D HGLL +HMPAY D++ KGVS+VMVSYSSWNG KMHAN ELIT +LK T
Subjt:  IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKST

Query:  LKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQ
        LKFKGFVISDW+G+D+I++ PH++YT SV+AAI AGIDMVMVP+ + EF++DLT LVKNN + + RIDDAV RIL VKFTMGLFE+PL DYS  +ELGSQ
Subjt:  LKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQ

Query:  AHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSN
        AHRDLAR+AVR+SLVLLKNG N ++P+LPL +K  KILVAGTHADNLGYQCGGWTI WQGFSGN  TRGTT+L+A+KS VD STEVVFRE+PD++F+KSN
Subjt:  AHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSN

Query:  GFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
         F+YAI+ +GE PYAET GDS  LTMLDPGP+II + C++VKCVVVVISGRP+VMEPYV+S+DALVAAWLPGTEG G+TDAL+GDHGFSGKLP TWF++ 
Subjt:  GFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV

Query:  DQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGAQATPSFIAMIVATIAICV
        +QLPM++GD HYDPLF  G GL T SV  IVARSTSA A  T   +  ++ +  +C+
Subjt:  DQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGAQATPSFIAMIVATIAICV

AT5G20940.1 Glycosyl hydrolase family protein6.9e-24768.06Show/hide
Query:  NLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHN
        N KYKDPK+P+ VR+K+L+  MTLEEKIGQMVQ++R  A   VM+ YF+GSV SGGGSVP P    + WV+M+N+ QK +LS+RLGIP+IYGIDAVHGHN
Subjt:  NLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHN

Query:  NVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTEIIIGLQGEPPANFRKGIPYVGGTKKV
         VYNAT+FPHNVGLG TR+P L++RIG ATALEVRATGI Y FAPC+AVCRDPRWGRCYESYSED K+VQ MTEII GLQG+ P   +KG+P+V G  KV
Subjt:  NVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTEIIIGLQGEPPANFRKGIPYVGGTKKV

Query:  IACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKSTLKFKGFVISDWEGLDRITSTPHSNYT
         ACAKHFVGDGGT  G+N NNTVI+ +GLLGIHMPAY D++ KGV++VMVSYSS NG+KMHAN++LIT FLK+ LKF+G VISD+ G+D+I +   +NY+
Subjt:  IACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKSTLKFKGFVISDWEGLDRITSTPHSNYT

Query:  YSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQAHRDLARDAVRQSLVLLKNGKNDSDP
        +SV AA +AG+DM M      + ID+LT  VK   + M RIDDAV RIL VKFTMGLFE+P+ D+SL  +LGS+ HR+LAR+AVR+SLVLLKNG+N   P
Subjt:  YSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQAHRDLARDAVRQSLVLLKNGKNDSDP

Query:  LLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSNGFSYAIVVIGEAPYAETEGDSTTLTM
        LLPL KKA KILVAGTHADNLGYQCGGWTI WQG +GNN T GTTILAA+K TVDP T+V++ ++PD++FVK+  F YAIV +GE PYAE  GDST LT+
Subjt:  LLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSNGFSYAIVVIGEAPYAETEGDSTTLTM

Query:  LDPGPSIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSVDQLPMNFGDRHYDPLFPLGFGLTT
         +PGPS I NVC SVKCVVVV+SGRP+VM+  +S++DALVAAWLPGTEG GV D L+GD+GF+GKL RTWFK+VDQLPMN GD HYDPL+P GFGL T
Subjt:  LDPGPSIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSVDQLPMNFGDRHYDPLFPLGFGLTT

AT5G20950.1 Glycosyl hydrolase family protein2.7e-26770.06Show/hide
Query:  ILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSR
        +LCL      +      LKYKDPKQP+  R++DL+ RMTL+EKIGQMVQI+RSVA   VMK YFIGSVLSGGGSVP   A  + WV+M+N+ QK SLS+R
Subjt:  ILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSR

Query:  LGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTEIIIGLQGEPP
        LGIPMIYGIDAVHGHNNVY AT+FPHNVGLG TR+P+L++RIGAATALEVRATGI Y FAPC+AVCRDPRWGRCYESYSED ++VQ MTEII GLQG+ P
Subjt:  LGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTEIIIGLQGEPP

Query:  ANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKSTLKFKGFVISD
           RKG+P+VGG  KV ACAKHFVGDGGT  GI+ENNTVID  GL GIHMP Y +++ KGV+++MVSYS+WNG++MHAN+EL+T FLK+ LKF+GFVISD
Subjt:  ANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKSTLKFKGFVISD

Query:  WEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQAHRDLARDAV
        W+G+DRIT+ PH NY+YSV A ISAGIDM+MVPY Y EFID+++  ++  ++ + RIDDA+ RIL VKFTMGLFE PL D S  N+LGS+ HR+LAR+AV
Subjt:  WEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQAHRDLARDAV

Query:  RQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSNGFSYAIVVIG
        R+SLVLLKNGK  + PLLPL KK+ KILVAG HADNLGYQCGGWTI WQG +GN+ T GTTILAA+K+TV P+T+VV+ ++PD++FVKS  F YAIVV+G
Subjt:  RQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSNGFSYAIVVIG

Query:  EAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSVDQLPMNFGDR
        E PYAE  GD+T LT+ DPGPSII NVC SVKCVVVV+SGRP+V++PYVS++DALVAAWLPGTEG GV DAL+GD+GF+GKL RTWFKSV QLPMN GDR
Subjt:  EAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSVDQLPMNFGDR

Query:  HYDPLFPLGFGLTTGSVK
        HYDPL+P GFGLTT   K
Subjt:  HYDPLFPLGFGLTTGSVK

AT5G20950.2 Glycosyl hydrolase family protein2.7e-26770.06Show/hide
Query:  ILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSR
        +LCL      +      LKYKDPKQP+  R++DL+ RMTL+EKIGQMVQI+RSVA   VMK YFIGSVLSGGGSVP   A  + WV+M+N+ QK SLS+R
Subjt:  ILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSR

Query:  LGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTEIIIGLQGEPP
        LGIPMIYGIDAVHGHNNVY AT+FPHNVGLG TR+P+L++RIGAATALEVRATGI Y FAPC+AVCRDPRWGRCYESYSED ++VQ MTEII GLQG+ P
Subjt:  LGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTEIIIGLQGEPP

Query:  ANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKSTLKFKGFVISD
           RKG+P+VGG  KV ACAKHFVGDGGT  GI+ENNTVID  GL GIHMP Y +++ KGV+++MVSYS+WNG++MHAN+EL+T FLK+ LKF+GFVISD
Subjt:  ANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKSTLKFKGFVISD

Query:  WEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQAHRDLARDAV
        W+G+DRIT+ PH NY+YSV A ISAGIDM+MVPY Y EFID+++  ++  ++ + RIDDA+ RIL VKFTMGLFE PL D S  N+LGS+ HR+LAR+AV
Subjt:  WEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQAHRDLARDAV

Query:  RQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSNGFSYAIVVIG
        R+SLVLLKNGK  + PLLPL KK+ KILVAG HADNLGYQCGGWTI WQG +GN+ T GTTILAA+K+TV P+T+VV+ ++PD++FVKS  F YAIVV+G
Subjt:  RQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSNGFSYAIVVIG

Query:  EAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSVDQLPMNFGDR
        E PYAE  GD+T LT+ DPGPSII NVC SVKCVVVV+SGRP+V++PYVS++DALVAAWLPGTEG GV DAL+GD+GF+GKL RTWFKSV QLPMN GDR
Subjt:  EAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSVDQLPMNFGDR

Query:  HYDPLFPLGFGLTTGSVK
        HYDPL+P GFGLTT   K
Subjt:  HYDPLFPLGFGLTTGSVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAAAATTTTTGTTCAGGTGGTTGTGATTCTGTGCTTGGGTTGGTGGTGGTGGGCTATAATGGTGGGCGCTGAGAATTTGAAGTACAAAGACCCTAAGCAGCCAGT
TTCTGTTCGAGTTAAGGATCTTCTTGGCCGAATGACTCTGGAAGAGAAAATTGGTCAGATGGTTCAGATTGACAGGAGCGTTGCCAATGCTACAGTTATGAAAAATTATT
TTATTGGAAGTGTGCTAAGTGGTGGTGGAAGTGTGCCGCTTCCAGATGCTCGTGCTCAAGATTGGGTTGACATGATTAATGATTTCCAGAAGGGTTCTCTTTCTAGTCGA
TTGGGCATCCCAATGATTTATGGCATTGATGCTGTTCACGGCCATAACAACGTTTACAATGCTACAGTATTTCCTCATAATGTTGGACTGGGAGCTACCAGGAACCCTGA
CCTACTTCGAAGGATTGGTGCAGCAACAGCACTAGAAGTTCGAGCCACAGGGATTTCTTATACCTTTGCTCCTTGCCTTGCGGTTTGTAGGGACCCGAGGTGGGGGCGGT
GTTATGAGAGCTACAGTGAGGATCCAAAACTTGTGCAAAATATGACTGAGATTATAATTGGTTTGCAAGGAGAGCCTCCTGCTAATTTCCGGAAGGGGATTCCTTATGTT
GGTGGAACGAAGAAAGTTATCGCCTGTGCAAAGCACTTTGTTGGAGATGGTGGGACAACTCATGGCATCAATGAGAATAACACCGTTATTGACAGGCATGGACTGCTCGG
CATTCACATGCCTGCCTATTTAGATTCGATCATCAAGGGCGTTTCATCGGTAATGGTTTCGTATTCTAGTTGGAATGGAGTAAAGATGCATGCAAACCGTGAGCTCATTA
CTCGCTTTCTCAAGAGTACCCTTAAATTTAAGGGTTTTGTCATCTCAGATTGGGAGGGTCTCGACAGAATAACTTCTACGCCGCATTCTAATTACACGTACTCTGTCCAA
GCTGCAATTTCAGCTGGCATTGACATGGTCATGGTTCCTTACAAATATGCGGAGTTCATTGATGATCTTACGTTGTTAGTGAAGAACAATGTCGTATCGATGGATCGAAT
TGACGATGCTGTTGCCAGAATTTTATCAGTCAAGTTCACAATGGGACTTTTTGAAAGCCCCTTGGGTGATTACAGCCTTGTCAATGAGCTTGGGAGTCAGGCGCATAGAG
ACTTGGCAAGAGATGCTGTGAGGCAGTCACTCGTACTGCTGAAGAATGGTAAAAATGATAGCGATCCGTTGCTACCCCTTTCAAAGAAGGCCCCAAAGATCCTTGTTGCT
GGTACTCATGCTGATAATTTAGGATATCAATGTGGTGGGTGGACAATTGCATGGCAAGGATTCAGTGGCAACAATGCTACAAGGGGAACTACCATCCTCGCTGCCATTAA
ATCAACAGTTGATCCGAGCACAGAGGTTGTATTCCGTGAGGATCCCGACAGTGACTTTGTTAAGTCCAATGGCTTCTCATACGCCATTGTCGTTATTGGCGAAGCCCCGT
ATGCCGAGACAGAAGGGGATAGTACGACGCTTACCATGTTAGATCCAGGTCCAAGCATCATAAAAAATGTTTGTGAGTCTGTGAAGTGTGTGGTGGTTGTCATTTCTGGG
AGGCCAATTGTGATGGAACCATACGTTTCATCCATGGATGCTCTTGTAGCAGCTTGGTTACCTGGGACTGAAGGCCTGGGAGTCACTGATGCCCTCTATGGAGACCATGG
GTTTAGTGGGAAGCTTCCAAGAACATGGTTTAAATCTGTAGATCAACTGCCAATGAACTTTGGAGATCGACACTACGATCCACTTTTTCCTTTGGGTTTCGGACTCACAA
CTGGATCGGTCAAGGACATCGTTGCGAGGTCGACATCGGCAGGAGCTCAAGCAACACCATCCTTTATTGCAATGATCGTTGCTACAATCGCCATTTGTGTACTACAGGTA
CACTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATATTATCCATTTATGTTTCAATATATAATTGATTTATGATCCGACCATGAACCAAGGGGTTTGGAATTGGAGCTTGTATTTAAGTCCCTTAATCTACACTTGGGATTAA
CAAGGAAGCTCATTAATGGCGGAAGAAGGGTGAGGAATCTGTTCTCTCTATTTTGCTCCCACTTTTTCCAAGAACGTCCATTATCCCACGACTAACCTTTCTCTTTCCCA
CCAACTTACTCTACTTTTACGGATTCACCATTCCCCAGCTTTCTCTCACCATTACAACTGGCTTTTTCCATTGCTTTAACTTAACTTCCTCTTCCGTATTCGGATTTCTG
CCATTGCCACTCTCATTTACGCTTTCAGTGTGGGTTTAACTCTTCGGGAAAGAAGATGGCCAAAATTTTTGTTCAGGTGGTTGTGATTCTGTGCTTGGGTTGGTGGTGGT
GGGCTATAATGGTGGGCGCTGAGAATTTGAAGTACAAAGACCCTAAGCAGCCAGTTTCTGTTCGAGTTAAGGATCTTCTTGGCCGAATGACTCTGGAAGAGAAAATTGGT
CAGATGGTTCAGATTGACAGGAGCGTTGCCAATGCTACAGTTATGAAAAATTATTTTATTGGAAGTGTGCTAAGTGGTGGTGGAAGTGTGCCGCTTCCAGATGCTCGTGC
TCAAGATTGGGTTGACATGATTAATGATTTCCAGAAGGGTTCTCTTTCTAGTCGATTGGGCATCCCAATGATTTATGGCATTGATGCTGTTCACGGCCATAACAACGTTT
ACAATGCTACAGTATTTCCTCATAATGTTGGACTGGGAGCTACCAGGAACCCTGACCTACTTCGAAGGATTGGTGCAGCAACAGCACTAGAAGTTCGAGCCACAGGGATT
TCTTATACCTTTGCTCCTTGCCTTGCGGTTTGTAGGGACCCGAGGTGGGGGCGGTGTTATGAGAGCTACAGTGAGGATCCAAAACTTGTGCAAAATATGACTGAGATTAT
AATTGGTTTGCAAGGAGAGCCTCCTGCTAATTTCCGGAAGGGGATTCCTTATGTTGGTGGAACGAAGAAAGTTATCGCCTGTGCAAAGCACTTTGTTGGAGATGGTGGGA
CAACTCATGGCATCAATGAGAATAACACCGTTATTGACAGGCATGGACTGCTCGGCATTCACATGCCTGCCTATTTAGATTCGATCATCAAGGGCGTTTCATCGGTAATG
GTTTCGTATTCTAGTTGGAATGGAGTAAAGATGCATGCAAACCGTGAGCTCATTACTCGCTTTCTCAAGAGTACCCTTAAATTTAAGGGTTTTGTCATCTCAGATTGGGA
GGGTCTCGACAGAATAACTTCTACGCCGCATTCTAATTACACGTACTCTGTCCAAGCTGCAATTTCAGCTGGCATTGACATGGTCATGGTTCCTTACAAATATGCGGAGT
TCATTGATGATCTTACGTTGTTAGTGAAGAACAATGTCGTATCGATGGATCGAATTGACGATGCTGTTGCCAGAATTTTATCAGTCAAGTTCACAATGGGACTTTTTGAA
AGCCCCTTGGGTGATTACAGCCTTGTCAATGAGCTTGGGAGTCAGGCGCATAGAGACTTGGCAAGAGATGCTGTGAGGCAGTCACTCGTACTGCTGAAGAATGGTAAAAA
TGATAGCGATCCGTTGCTACCCCTTTCAAAGAAGGCCCCAAAGATCCTTGTTGCTGGTACTCATGCTGATAATTTAGGATATCAATGTGGTGGGTGGACAATTGCATGGC
AAGGATTCAGTGGCAACAATGCTACAAGGGGAACTACCATCCTCGCTGCCATTAAATCAACAGTTGATCCGAGCACAGAGGTTGTATTCCGTGAGGATCCCGACAGTGAC
TTTGTTAAGTCCAATGGCTTCTCATACGCCATTGTCGTTATTGGCGAAGCCCCGTATGCCGAGACAGAAGGGGATAGTACGACGCTTACCATGTTAGATCCAGGTCCAAG
CATCATAAAAAATGTTTGTGAGTCTGTGAAGTGTGTGGTGGTTGTCATTTCTGGGAGGCCAATTGTGATGGAACCATACGTTTCATCCATGGATGCTCTTGTAGCAGCTT
GGTTACCTGGGACTGAAGGCCTGGGAGTCACTGATGCCCTCTATGGAGACCATGGGTTTAGTGGGAAGCTTCCAAGAACATGGTTTAAATCTGTAGATCAACTGCCAATG
AACTTTGGAGATCGACACTACGATCCACTTTTTCCTTTGGGTTTCGGACTCACAACTGGATCGGTCAAGGACATCGTTGCGAGGTCGACATCGGCAGGAGCTCAAGCAAC
ACCATCCTTTATTGCAATGATCGTTGCTACAATCGCCATTTGTGTACTACAGGTACACTTCTAGTTCTATCTAGTAGGCATTAGTTTGTTGCAAACTTTGGTGGCTATTT
TTAGGAAATTTTGAGCTCTACGAGGGTTACAATATCATCAGCCTCAGATGCTTTGCCATTTATTTTAGCGACTGGTTATTATGGATTTGTAGTAGCAAAGACTTTTCAGT
TCCATTTACATATTAACCTTTATTTTCTTCATCATGGTTCTCCATTGATCGATCCCAGACATGAAATCATGATGTTTTTCATAAAACAAAAAGATATAGACAATGAATAT
ATAGTTTATACCAGGATAAAATTCGATTTGGTTATTCATATATTC
Protein sequenceShow/hide protein sequence
MAKIFVQVVVILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSR
LGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTEIIIGLQGEPPANFRKGIPYV
GGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKSTLKFKGFVISDWEGLDRITSTPHSNYTYSVQ
AAISAGIDMVMVPYKYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQAHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVA
GTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSNGFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIIKNVCESVKCVVVVISG
RPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSVDQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGAQATPSFIAMIVATIAICVLQV
HF