; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS004123 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS004123
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionGlycosyl hydrolase family protein
Genome locationscaffold92:207631..212062
RNA-Seq ExpressionMS004123
SyntenyMS004123
Gene Ontology termsGO:0009251 - glucan catabolic process (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0008422 - beta-glucosidase activity (molecular function)
InterPro domainsIPR001764 - Glycoside hydrolase, family 3, N-terminal
IPR002772 - Glycoside hydrolase family 3 C-terminal domain
IPR017853 - Glycoside hydrolase superfamily
IPR036881 - Glycoside hydrolase family 3 C-terminal domain superfamily
IPR036962 - Glycoside hydrolase, family 3, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008446717.1 PREDICTED: beta-glucosidase BoGH3B isoform X2 [Cucumis melo]0.0e+0085.4Show/hide
Query:  MAQIFVQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMIN
        MA+IFVQVV ILCLGW WWAT VDAE LKYKDPKQPV VRV DLLGRMTLEEKIGQMVQIDRSVAN TVMKDY IGS+LSGGGSVPLPDARAEDWV+MIN
Subjt:  MAQIFVQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMIN

Query:  EFQKGSLSSRLGIPMMYGIDAVHGHNNVYNATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIA
        +FQKGSLSSRLGIPM YGIDAVHGHNNVYNATVFPHNVGLGAT                          RNPDL RRIGAATALEVRATGIS+ FAPC+A
Subjt:  EFQKGSLSSRLGIPMMYGIDAVHGHNNVYNATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIA

Query:  VCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSV
        VCRDPRWGRCYESYSEDPKIV+EMTEII GLQGEPPANYRKG PYVGGTKKVIACAKHFVGDGGTT+GINENNTVI+ HGLLSIHMPAY DSIIKGVSSV
Subjt:  VCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSV

Query:  MISYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRI
        M SYSSWNGVKMHAN ELIT FLKG LKFKGFVISDWEGLDRITS PHSNYTYSVQAAILAGIDMVM+PYKY EFIDDL  LVKSNVIPMDRIDDA GRI
Subjt:  MISYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRI

Query:  LSVKFSMGLFENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILA
        L+VKF+MGLFE+PM DYSLVNELGSQ HRDLAR+AVRQSLVLLKNGKNDS+P+LPL KK+PKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGT+ILA
Subjt:  LSVKFSMGLFENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILA

Query:  AIKSTVDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTE
        AIKSTVDPSTEVVF EDPDS+FVKSNDFSYAIVV+GE PYAE+ GDSTTLTMLDPGPN IKNVCD V+CVV+++SGRPIV+EPYISSIDALVAAWLPGTE
Subjt:  AIKSTVDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTE

Query:  GLGVTDCLYGDHGFSGKLPRTWFKYVDQLPMNVGDPHYDPLFPFGFGLGTDSVKDLVARSTSSGIRGTASVIATIIVAALAICIL
        G GVTD LYGDHGFSGKLPRTWFK VDQLPMNVGDPHYDPLFPFGFGL T SVKD++ARSTS+GIRGT S+IA+I+V A+ +CIL
Subjt:  GLGVTDCLYGDHGFSGKLPRTWFKYVDQLPMNVGDPHYDPLFPFGFGLGTDSVKDLVARSTSSGIRGTASVIATIIVAALAICIL

XP_022150694.1 uncharacterized protein LOC111018764 isoform X1 [Momordica charantia]0.0e+0095.77Show/hide
Query:  MAQIFVQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMIN
        MAQIFVQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMIN
Subjt:  MAQIFVQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMIN

Query:  EFQKGSLSSRLGIPMMYGIDAVHGHNNVYNATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIA
        EFQKGSLSSRLGIPMMYGIDAVHGHNNVYNATVFPHNVGLGAT                          RNPDLVRRIGAATALEVRATGISFAFAPCIA
Subjt:  EFQKGSLSSRLGIPMMYGIDAVHGHNNVYNATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIA

Query:  VCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSV
        VCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSV
Subjt:  VCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSV

Query:  MISYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRI
        MISYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLV+SNVIPMDRIDDAAGRI
Subjt:  MISYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRI

Query:  LSVKFSMGLFENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILA
        LSVKFSMGLFENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTH DNLGYQCGGWTIAWQGFSGNNGTRGTSILA
Subjt:  LSVKFSMGLFENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILA

Query:  AIKSTVDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTE
        AIKSTVDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTE
Subjt:  AIKSTVDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTE

Query:  GLGVTDCLYGDHGFSGKLPRTWFKYVDQLPMNVGDPHYDPLFPFGFGLGTDSVKDLVARSTSSGIRGTASVIATIIVAALAICIL
        GLGVTDCLYGDHGFSGKLPRTWFK VDQLPMNVGDPHYDPLFPFGFGLGTDSVKDLVARSTSSGIRGTASVIATIIVAALAICIL
Subjt:  GLGVTDCLYGDHGFSGKLPRTWFKYVDQLPMNVGDPHYDPLFPFGFGLGTDSVKDLVARSTSSGIRGTASVIATIIVAALAICIL

XP_022150697.1 uncharacterized protein LOC111018764 isoform X2 [Momordica charantia]0.0e+0095.77Show/hide
Query:  MAQIFVQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMIN
        MAQIFVQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMIN
Subjt:  MAQIFVQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMIN

Query:  EFQKGSLSSRLGIPMMYGIDAVHGHNNVYNATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIA
        EFQKGSLSSRLGIPMMYGIDAVHGHNNVYNATVFPHNVGLGAT                          RNPDLVRRIGAATALEVRATGISFAFAPCIA
Subjt:  EFQKGSLSSRLGIPMMYGIDAVHGHNNVYNATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIA

Query:  VCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSV
        VCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSV
Subjt:  VCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSV

Query:  MISYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRI
        MISYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLV+SNVIPMDRIDDAAGRI
Subjt:  MISYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRI

Query:  LSVKFSMGLFENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILA
        LSVKFSMGLFENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTH DNLGYQCGGWTIAWQGFSGNNGTRGTSILA
Subjt:  LSVKFSMGLFENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILA

Query:  AIKSTVDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTE
        AIKSTVDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTE
Subjt:  AIKSTVDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTE

Query:  GLGVTDCLYGDHGFSGKLPRTWFKYVDQLPMNVGDPHYDPLFPFGFGLGTDSVKDLVARSTSSGIRGTASVIATIIVAALAICIL
        GLGVTDCLYGDHGFSGKLPRTWFK VDQLPMNVGDPHYDPLFPFGFGLGTDSVKDLVARSTSSGIRGTASVIATIIVAALAICIL
Subjt:  GLGVTDCLYGDHGFSGKLPRTWFKYVDQLPMNVGDPHYDPLFPFGFGLGTDSVKDLVARSTSSGIRGTASVIATIIVAALAICIL

XP_022150698.1 uncharacterized protein LOC111018764 isoform X3 [Momordica charantia]0.0e+0095.77Show/hide
Query:  MAQIFVQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMIN
        MAQIFVQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMIN
Subjt:  MAQIFVQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMIN

Query:  EFQKGSLSSRLGIPMMYGIDAVHGHNNVYNATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIA
        EFQKGSLSSRLGIPMMYGIDAVHGHNNVYNATVFPHNVGLGAT                          RNPDLVRRIGAATALEVRATGISFAFAPCIA
Subjt:  EFQKGSLSSRLGIPMMYGIDAVHGHNNVYNATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIA

Query:  VCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSV
        VCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSV
Subjt:  VCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSV

Query:  MISYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRI
        MISYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLV+SNVIPMDRIDDAAGRI
Subjt:  MISYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRI

Query:  LSVKFSMGLFENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILA
        LSVKFSMGLFENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTH DNLGYQCGGWTIAWQGFSGNNGTRGTSILA
Subjt:  LSVKFSMGLFENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILA

Query:  AIKSTVDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTE
        AIKSTVDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTE
Subjt:  AIKSTVDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTE

Query:  GLGVTDCLYGDHGFSGKLPRTWFKYVDQLPMNVGDPHYDPLFPFGFGLGTDSVKDLVARSTSSGIRGTASVIATIIVAALAICIL
        GLGVTDCLYGDHGFSGKLPRTWFK VDQLPMNVGDPHYDPLFPFGFGLGTDSVKDLVARSTSSGIRGTASVIATIIVAALAICIL
Subjt:  GLGVTDCLYGDHGFSGKLPRTWFKYVDQLPMNVGDPHYDPLFPFGFGLGTDSVKDLVARSTSSGIRGTASVIATIIVAALAICIL

XP_022150699.1 uncharacterized protein LOC111018764 isoform X4 [Momordica charantia]0.0e+0095.77Show/hide
Query:  MAQIFVQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMIN
        MAQIFVQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMIN
Subjt:  MAQIFVQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMIN

Query:  EFQKGSLSSRLGIPMMYGIDAVHGHNNVYNATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIA
        EFQKGSLSSRLGIPMMYGIDAVHGHNNVYNATVFPHNVGLGAT                          RNPDLVRRIGAATALEVRATGISFAFAPCIA
Subjt:  EFQKGSLSSRLGIPMMYGIDAVHGHNNVYNATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIA

Query:  VCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSV
        VCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSV
Subjt:  VCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSV

Query:  MISYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRI
        MISYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLV+SNVIPMDRIDDAAGRI
Subjt:  MISYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRI

Query:  LSVKFSMGLFENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILA
        LSVKFSMGLFENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTH DNLGYQCGGWTIAWQGFSGNNGTRGTSILA
Subjt:  LSVKFSMGLFENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILA

Query:  AIKSTVDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTE
        AIKSTVDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTE
Subjt:  AIKSTVDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTE

Query:  GLGVTDCLYGDHGFSGKLPRTWFKYVDQLPMNVGDPHYDPLFPFGFGLGTDSVKDLVARSTSSGIRGTASVIATIIVAALAICIL
        GLGVTDCLYGDHGFSGKLPRTWFK VDQLPMNVGDPHYDPLFPFGFGLGTDSVKDLVARSTSSGIRGTASVIATIIVAALAICIL
Subjt:  GLGVTDCLYGDHGFSGKLPRTWFKYVDQLPMNVGDPHYDPLFPFGFGLGTDSVKDLVARSTSSGIRGTASVIATIIVAALAICIL

TrEMBL top hitse value%identityAlignment
A0A1S3BGE4 beta-glucosidase BoGH3B isoform X10.0e+0085.4Show/hide
Query:  MAQIFVQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMIN
        MA+IFVQVV ILCLGW WWAT VDAE LKYKDPKQPV VRV DLLGRMTLEEKIGQMVQIDRSVAN TVMKDY IGS+LSGGGSVPLPDARAEDWV+MIN
Subjt:  MAQIFVQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMIN

Query:  EFQKGSLSSRLGIPMMYGIDAVHGHNNVYNATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIA
        +FQKGSLSSRLGIPM YGIDAVHGHNNVYNATVFPHNVGLGAT                          RNPDL RRIGAATALEVRATGIS+ FAPC+A
Subjt:  EFQKGSLSSRLGIPMMYGIDAVHGHNNVYNATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIA

Query:  VCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSV
        VCRDPRWGRCYESYSEDPKIV+EMTEII GLQGEPPANYRKG PYVGGTKKVIACAKHFVGDGGTT+GINENNTVI+ HGLLSIHMPAY DSIIKGVSSV
Subjt:  VCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSV

Query:  MISYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRI
        M SYSSWNGVKMHAN ELIT FLKG LKFKGFVISDWEGLDRITS PHSNYTYSVQAAILAGIDMVM+PYKY EFIDDL  LVKSNVIPMDRIDDA GRI
Subjt:  MISYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRI

Query:  LSVKFSMGLFENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILA
        L+VKF+MGLFE+PM DYSLVNELGSQ HRDLAR+AVRQSLVLLKNGKNDS+P+LPL KK+PKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGT+ILA
Subjt:  LSVKFSMGLFENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILA

Query:  AIKSTVDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTE
        AIKSTVDPSTEVVF EDPDS+FVKSNDFSYAIVV+GE PYAE+ GDSTTLTMLDPGPN IKNVCD V+CVV+++SGRPIV+EPYISSIDALVAAWLPGTE
Subjt:  AIKSTVDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTE

Query:  GLGVTDCLYGDHGFSGKLPRTWFKYVDQLPMNVGDPHYDPLFPFGFGLGTDSVKDLVARSTSSGIRGTASVIATIIVAALAICIL
        G GVTD LYGDHGFSGKLPRTWFK VDQLPMNVGDPHYDPLFPFGFGL T SVKD++ARSTS+GIRGT S+IA+I+V A+ +CIL
Subjt:  GLGVTDCLYGDHGFSGKLPRTWFKYVDQLPMNVGDPHYDPLFPFGFGLGTDSVKDLVARSTSSGIRGTASVIATIIVAALAICIL

A0A6J1DA47 uncharacterized protein LOC111018764 isoform X30.0e+0095.77Show/hide
Query:  MAQIFVQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMIN
        MAQIFVQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMIN
Subjt:  MAQIFVQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMIN

Query:  EFQKGSLSSRLGIPMMYGIDAVHGHNNVYNATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIA
        EFQKGSLSSRLGIPMMYGIDAVHGHNNVYNATVFPHNVGLGAT                          RNPDLVRRIGAATALEVRATGISFAFAPCIA
Subjt:  EFQKGSLSSRLGIPMMYGIDAVHGHNNVYNATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIA

Query:  VCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSV
        VCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSV
Subjt:  VCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSV

Query:  MISYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRI
        MISYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLV+SNVIPMDRIDDAAGRI
Subjt:  MISYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRI

Query:  LSVKFSMGLFENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILA
        LSVKFSMGLFENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTH DNLGYQCGGWTIAWQGFSGNNGTRGTSILA
Subjt:  LSVKFSMGLFENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILA

Query:  AIKSTVDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTE
        AIKSTVDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTE
Subjt:  AIKSTVDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTE

Query:  GLGVTDCLYGDHGFSGKLPRTWFKYVDQLPMNVGDPHYDPLFPFGFGLGTDSVKDLVARSTSSGIRGTASVIATIIVAALAICIL
        GLGVTDCLYGDHGFSGKLPRTWFK VDQLPMNVGDPHYDPLFPFGFGLGTDSVKDLVARSTSSGIRGTASVIATIIVAALAICIL
Subjt:  GLGVTDCLYGDHGFSGKLPRTWFKYVDQLPMNVGDPHYDPLFPFGFGLGTDSVKDLVARSTSSGIRGTASVIATIIVAALAICIL

A0A6J1DAV0 uncharacterized protein LOC111018764 isoform X40.0e+0095.77Show/hide
Query:  MAQIFVQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMIN
        MAQIFVQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMIN
Subjt:  MAQIFVQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMIN

Query:  EFQKGSLSSRLGIPMMYGIDAVHGHNNVYNATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIA
        EFQKGSLSSRLGIPMMYGIDAVHGHNNVYNATVFPHNVGLGAT                          RNPDLVRRIGAATALEVRATGISFAFAPCIA
Subjt:  EFQKGSLSSRLGIPMMYGIDAVHGHNNVYNATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIA

Query:  VCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSV
        VCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSV
Subjt:  VCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSV

Query:  MISYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRI
        MISYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLV+SNVIPMDRIDDAAGRI
Subjt:  MISYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRI

Query:  LSVKFSMGLFENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILA
        LSVKFSMGLFENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTH DNLGYQCGGWTIAWQGFSGNNGTRGTSILA
Subjt:  LSVKFSMGLFENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILA

Query:  AIKSTVDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTE
        AIKSTVDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTE
Subjt:  AIKSTVDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTE

Query:  GLGVTDCLYGDHGFSGKLPRTWFKYVDQLPMNVGDPHYDPLFPFGFGLGTDSVKDLVARSTSSGIRGTASVIATIIVAALAICIL
        GLGVTDCLYGDHGFSGKLPRTWFK VDQLPMNVGDPHYDPLFPFGFGLGTDSVKDLVARSTSSGIRGTASVIATIIVAALAICIL
Subjt:  GLGVTDCLYGDHGFSGKLPRTWFKYVDQLPMNVGDPHYDPLFPFGFGLGTDSVKDLVARSTSSGIRGTASVIATIIVAALAICIL

A0A6J1DBF5 uncharacterized protein LOC111018764 isoform X20.0e+0095.77Show/hide
Query:  MAQIFVQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMIN
        MAQIFVQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMIN
Subjt:  MAQIFVQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMIN

Query:  EFQKGSLSSRLGIPMMYGIDAVHGHNNVYNATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIA
        EFQKGSLSSRLGIPMMYGIDAVHGHNNVYNATVFPHNVGLGAT                          RNPDLVRRIGAATALEVRATGISFAFAPCIA
Subjt:  EFQKGSLSSRLGIPMMYGIDAVHGHNNVYNATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIA

Query:  VCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSV
        VCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSV
Subjt:  VCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSV

Query:  MISYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRI
        MISYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLV+SNVIPMDRIDDAAGRI
Subjt:  MISYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRI

Query:  LSVKFSMGLFENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILA
        LSVKFSMGLFENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTH DNLGYQCGGWTIAWQGFSGNNGTRGTSILA
Subjt:  LSVKFSMGLFENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILA

Query:  AIKSTVDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTE
        AIKSTVDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTE
Subjt:  AIKSTVDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTE

Query:  GLGVTDCLYGDHGFSGKLPRTWFKYVDQLPMNVGDPHYDPLFPFGFGLGTDSVKDLVARSTSSGIRGTASVIATIIVAALAICIL
        GLGVTDCLYGDHGFSGKLPRTWFK VDQLPMNVGDPHYDPLFPFGFGLGTDSVKDLVARSTSSGIRGTASVIATIIVAALAICIL
Subjt:  GLGVTDCLYGDHGFSGKLPRTWFKYVDQLPMNVGDPHYDPLFPFGFGLGTDSVKDLVARSTSSGIRGTASVIATIIVAALAICIL

A0A6J1DCA3 uncharacterized protein LOC111018764 isoform X10.0e+0095.77Show/hide
Query:  MAQIFVQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMIN
        MAQIFVQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMIN
Subjt:  MAQIFVQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMIN

Query:  EFQKGSLSSRLGIPMMYGIDAVHGHNNVYNATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIA
        EFQKGSLSSRLGIPMMYGIDAVHGHNNVYNATVFPHNVGLGAT                          RNPDLVRRIGAATALEVRATGISFAFAPCIA
Subjt:  EFQKGSLSSRLGIPMMYGIDAVHGHNNVYNATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIA

Query:  VCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSV
        VCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSV
Subjt:  VCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSV

Query:  MISYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRI
        MISYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLV+SNVIPMDRIDDAAGRI
Subjt:  MISYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRI

Query:  LSVKFSMGLFENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILA
        LSVKFSMGLFENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTH DNLGYQCGGWTIAWQGFSGNNGTRGTSILA
Subjt:  LSVKFSMGLFENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILA

Query:  AIKSTVDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTE
        AIKSTVDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTE
Subjt:  AIKSTVDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTE

Query:  GLGVTDCLYGDHGFSGKLPRTWFKYVDQLPMNVGDPHYDPLFPFGFGLGTDSVKDLVARSTSSGIRGTASVIATIIVAALAICIL
        GLGVTDCLYGDHGFSGKLPRTWFK VDQLPMNVGDPHYDPLFPFGFGLGTDSVKDLVARSTSSGIRGTASVIATIIVAALAICIL
Subjt:  GLGVTDCLYGDHGFSGKLPRTWFKYVDQLPMNVGDPHYDPLFPFGFGLGTDSVKDLVARSTSSGIRGTASVIATIIVAALAICIL

SwissProt top hitse value%identityAlignment
A7LXU3 Beta-glucosidase BoGH3B1.7e-7530.34Show/hide
Query:  PKQP-VAVRVMDLLGRMTLEEKIGQMVQIDRSVAN-----------------VTVMKDYSIGSVLSGGGSVPLPDA-RAEDWVNMINEFQKGSLSSRLGI
        P  P +   + + L +MTLE+KIGQM +I   V +                  TV+  Y +GS+L    +VPL  A + E W   I + Q+ S+   +GI
Subjt:  PKQP-VAVRVMDLLGRMTLEEKIGQMVQIDRSVAN-----------------VTVMKDYSIGSVLSGGGSVPLPDA-RAEDWVNMINEFQKGSLSSRLGI

Query:  PMMYGIDAVHGHNNVYNATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYES
        P +YG+D +HG     + T+FP  + +GAT                           N +L RR    +A E +A  I + FAP + + RDPRW R +E+
Subjt:  PMMYGIDAVHGHNNVYNATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYES

Query:  YSEDPKIVQEM-TEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSVMISYSSWNGVKM
        Y ED  +  EM    + G QGE P           G   V AC KH++G G   +G +   + I    +   H   +  ++ +G  SVM++    NG+  
Subjt:  YSEDPKIVQEM-TEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSVMISYSSWNGVKM

Query:  HANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYT--YSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRILSVKFSMGLF
        HAN EL+T +LK  L + G +++DW  ++ + +  H   T   +V+  I AGIDM MVPY+   F D L  LV+   + M+RIDDA  R+L +K+ +GLF
Subjt:  HANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYT--YSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRILSVKFSMGLF

Query:  ENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRG-TSILAAI-----KS
        ++P  D    ++ GS+E   +A +A  +S VLLKN  N    +LP+  K  KIL+ G +A+++    GGW+ +WQG   +   +   +I  A+     K 
Subjt:  ENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRG-TSILAAI-----KS

Query:  TVDPSTEVVFSEDPDSNFVKSN------------DFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVS-GRPIVMEPYISSIDAL
         +     V ++   + N+ + N                 I  +GE  Y E+ G+ T LT+ +   N +K +  + K +V+V++ GRP ++   +    A+
Subjt:  TVDPSTEVVFSEDPDSNFVKSN------------DFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVS-GRPIVMEPYISSIDAL

Query:  VAAWLPGT-EGLGVTDCLYGDHGFSGKLPRTW-----------FKYVDQLPMNVGDPHYDPL----FPFGFGLGTDSVK
        V   LP    G  + + L GD  FSGK+P T+           +K  + +    G+ +YD +    +PFGFGL   + K
Subjt:  VAAWLPGT-EGLGVTDCLYGDHGFSGKLPRTW-----------FKYVDQLPMNVGDPHYDPL----FPFGFGLGTDSVK

P33363 Periplasmic beta-glucosidase6.9e-5327.75Show/hide
Query:  VMDLLGRMTLEEKIGQMVQIDRSVANV-----TVMKDYSIGSVLSGGGSVPLPDARA-EDWVNMINEFQKGSLSSRLGIPMMYGIDAVHGHNNVYNATVF
        V +LL +MT++EKIGQ+  I     N       ++KD  +G++ +   +V   D RA +D V  +         SRL IP+ +  D +HG       TVF
Subjt:  VMDLLGRMTLEEKIGQMVQIDRSVANV-----TVMKDYSIGSVLSGGGSVPLPDARA-EDWVNMINEFQKGSLSSRLGIPMMYGIDAVHGHNNVYNATVF

Query:  PHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTE-IIPGLQG
        P ++GL +                          + N D V+ +G  +A E    G++  +AP + V RDPRWGR  E + ED  +   M + ++  +QG
Subjt:  PHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTE-IIPGLQG

Query:  EPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSVMISYSSWNGVKMHANHELITGFLKGTLKFKGFV
        + PA+             V+   KHF   G    G   N   +    L + +MP Y   +  G  +VM++ +S NG    ++  L+   L+    FKG  
Subjt:  EPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSVMISYSSWNGVKMHANHELITGFLKGTLKFKGFV

Query:  ISDWEGL-DRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRILSVKFSMGLFENPMGDYS------LVNELGSQ
        +SD   + + I     ++   +V+ A+ +GI+M M    Y+++   L  L+KS  + M  +DDAA  +L+VK+ MGLF +P           +     S+
Subjt:  ISDWEGL-DRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRILSVKFSMGLFENPMGDYS------LVNELGSQ

Query:  EHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVFSE----------
         HR  ARE  R+SLVLLKN        LPL KK+  I V G  AD+     G W+ A          +  ++L  IK+ V  + +V++++          
Subjt:  EHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVFSE----------

Query:  -------------DP-------DSNFVKSNDFSYAIVVVGE-MPYAESVGDSTTLTMLDPGPNTIKNVCDSVKC-----VVVVVSGRPIVMEPYISSIDA
                     DP       D     +      + VVGE    A      T +T+    P + +++  ++K      V+V+++GRP+ +       DA
Subjt:  -------------DP-------DSNFVKSNDFSYAIVVVGE-MPYAESVGDSTTLTMLDPGPNTIKNVCDSVKC-----VVVVVSGRPIVMEPYISSIDA

Query:  LVAAWLPGTE-GLGVTDCLYGDHGFSGKLPRTWFKYVDQLP-----MNVGDP------------HYD----PLFPFGFGLG--TDSVKD--LVARSTSSG
        ++  W  GTE G  + D L+GD+  SGKLP ++ + V Q+P     +N G P            ++D     L+PFG+GL   T +V D  L A +    
Subjt:  LVAAWLPGTE-GLGVTDCLYGDHGFSGKLPRTWFKYVDQLP-----MNVGDP------------HYD----PLFPFGFGLG--TDSVKD--LVARSTSSG

Query:  IRGTASVIAT
         + TASV  T
Subjt:  IRGTASVIAT

Q23892 Lysosomal beta glucosidase4.9e-6729.77Show/hide
Query:  VMDLLGRMTLEEKIGQMVQID----RSVANVTV--------MKDYSIGSVL----SGGGSVPLPDARAEDWVNMINEFQKGSL-SSRLGIPMMYGIDAVH
        V +L+ +M++ EKIGQM Q+D     S   +T+         K Y IGS L    SGG +  +    +  W++MIN  Q   +  S   IPM+YG+D+VH
Subjt:  VMDLLGRMTLEEKIGQMVQID----RSVANVTV--------MKDYSIGSVL----SGGGSVPLPDARAEDWVNMINEFQKGSL-SSRLGIPMMYGIDAVH

Query:  GHNNVYNATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQE
        G N V+ AT+FPHN GL AT  +         ++  T+  +                 T+ +  A GI + FAP + +   P W R YE++ EDP +   
Subjt:  GHNNVYNATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQE

Query:  M-TEIIPGLQG-----EPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSII-KGVSSVMISYSSWNGVKMHANH
        M    + G QG     + P N              +  AKH+ G    T+G +     I    L    +P++ ++I   G  ++MI+    NGV MH ++
Subjt:  M-TEIIPGLQG-----EPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSII-KGVSSVMISYSSWNGVKMHANH

Query:  ELITGFLKGTLKFKGFVISDWEGLDRITSPPHS--NYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRILSVKFSMGLFENPM
        + +T  L+G L+F+G  ++DW+ ++++    H+  +   ++  A+ AGIDM MVP   + F   L  +V +  +P  R+D +  RIL++K+++GLF NP 
Subjt:  ELITGFLKGTLKFKGFVISDWEGLDRITSPPHS--NYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRILSVKFSMGLFENPM

Query:  --GDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPK-ILVAGTHADNLGYQCGGWTIAWQG-FSGNNGTRGTSILAAIKSTVDPST
           + ++V+ +G  + R+ A     +S+ LL+N  N    +LPL     K +L+ G  AD++    GGW++ WQG +  +    GTSIL  ++   + + 
Subjt:  --GDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPK-ILVAGTHADNLGYQCGGWTIAWQG-FSGNNGTRGTSILAAIKSTVDPST

Query:  EVVF---------------SEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVV-VVVSGRPIVMEP-YISSIDALVA
        +                  S D      +S+D    +VV+GE+P AE+ GD   L+M       ++ + D+ K VV ++V  RP ++ P  + S  A++ 
Subjt:  EVVF---------------SEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVV-VVVSGRPIVMEP-YISSIDALVA

Query:  AWLPGTE-GLGVTDCLYGDHGFSGKLPRTWFKYVDQLPMNVGDPHY---------DPLFPFGFGL
        A+LPG+E G  + + L G+   SG+LP T+         ++G P+Y          PLF FG GL
Subjt:  AWLPGTE-GLGVTDCLYGDHGFSGKLPRTWFKYVDQLPMNVGDPHY---------DPLFPFGFGL

Q56078 Periplasmic beta-glucosidase5.1e-5628.16Show/hide
Query:  AEYLKYKDPKQPVA--VRVMDLLGRMTLEEKIGQMVQIDRSVANV-----TVMKDYSIGSVLSGGGSVPLPDAR-AEDWVNMINEFQKGSLSSRLGIPMM
        AE L    P  P A    V DLL +MT++EKIGQ+  I     N       ++KD  +G++ +   +V   D R  +D V  +         SRL IP+ 
Subjt:  AEYLKYKDPKQPVA--VRVMDLLGRMTLEEKIGQMVQIDRSVANV-----TVMKDYSIGSVLSGGGSVPLPDAR-AEDWVNMINEFQKGSLSSRLGIPMM

Query:  YGIDAVHGHNNVYNATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSE
        +  D VHG       TVFP ++GL +                          + N D VR +G  +A E    G++  +AP + V RDPRWGR  E + E
Subjt:  YGIDAVHGHNNVYNATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSE

Query:  DPKIVQEMTE-IIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSVMISYSSWNGVKMHAN
        D  +   M E ++  +QG+ PA+             V+   KHF   G    G   N   +    L + +MP Y   +  G  +VM++ +S NG    ++
Subjt:  DPKIVQEMTE-IIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSVMISYSSWNGVKMHAN

Query:  HELITGFLKGTLKFKGFVISDWEGL-DRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRILSVKFSMGLFENPM
          L+   L+    FKG  +SD   + + I     ++   +V+ A+ AG+DM M    Y+++   L  L+KS  + M  +DDA   +L+VK+ MGLF +P 
Subjt:  HELITGFLKGTLKFKGFVISDWEGL-DRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRILSVKFSMGLFENPM

Query:  GDYS------LVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDP
                  +     S+ HR  ARE  R+S+VLLKN        LPL KK+  I V G  AD+     G W+ A          +  ++LA I++ V  
Subjt:  GDYS------LVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDP

Query:  STEVVFSE-----------------------DP-------DSNFVKSNDFSYAIVVVGE-MPYAESVGDSTTLTMLDPGPNTIKNVCDSVKC-----VVV
          ++++++                       DP       D     +      + VVGE    A      T +T+    P + +++  ++K      V+V
Subjt:  STEVVFSE-----------------------DP-------DSNFVKSNDFSYAIVVVGE-MPYAESVGDSTTLTMLDPGPNTIKNVCDSVKC-----VVV

Query:  VVSGRPIVMEPYISSIDALVAAWLPGTE-GLGVTDCLYGDHGFSGKLPRTWFKYVDQLP-----MNVGDP------------HYD----PLFPFGFGLG-
        +++GRP+ +       DA++  W  GTE G  + D L+GD+  SGKLP ++ + V Q+P     +N G P            ++D    PL+PFG+GL  
Subjt:  VVSGRPIVMEPYISSIDALVAAWLPGTE-GLGVTDCLYGDHGFSGKLPRTWFKYVDQLP-----MNVGDP------------HYD----PLFPFGFGLG-

Query:  -TDSVKDLVARSTSSGIRG--TASVIAT
         T +V D+   S +    G  TASV  T
Subjt:  -TDSVKDLVARSTSSGIRG--TASVIAT

T2KMH0 Beta-xylosidase1.3e-4628.34Show/hide
Query:  QKGSLSSRLGIPMMYGIDAVHGHNNVY----NATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPC
        Q    + RLGIP M   +A+HG   V     N TV+P  V   +T +                          P+L++++ + TA E RA G++  ++P 
Subjt:  QKGSLSSRLGIPMMYGIDAVHGHNNVY----NATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPC

Query:  IAV-CRDPRWGRCYESYSEDPKIVQEM-TEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIK-
        + V   D R+GR  ESY EDP +V  M    I GLQG     + +          VIA AKHFVG      GIN   + +    L  +++P +  ++ + 
Subjt:  IAV-CRDPRWGRCYESYSEDPKIVQEM-TEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIK-

Query:  GVSSVMISYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAIL---AGIDMVMVPYKYTEFIDDLTHLVKSNVIP---
        GV SVM  +  +NGV  H N  L+   L+  L F GF++SD   + R+ +  H       +AAIL   AG+DM +V  K  E     T+++K  ++    
Subjt:  GVSSVMISYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAIL---AGIDMVMVPYKYTEFIDDLTHLVKSNVIP---

Query:  -MDRIDDAAGRILSVKFSMGLFE-NPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLP-KKAPKILVAGTHADNLGYQCGGWTIAWQG
         M  ID A  RIL+ K+ +GLF+  P    +   E G+ EHR+ A E   +S+++LKN  N    +LPL   K   + V G +A     + G + +   G
Subjt:  -MDRIDDAAGRILSVKFSMGLFE-NPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLP-KKAPKILVAGTHADNLGYQCGGWTIAWQG

Query:  FSGNNGTRGTSILAAIKSTVDPSTEVVFSED-----------PDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVV-V
        +SG       S+L  +K  V    ++ +++            P++     N  +  +VV          GD   L +       ++ +  + K V+VV +
Subjt:  FSGNNGTRGTSILAAIKSTVDPSTEVVFSED-----------PDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVV-V

Query:  SGRPIVMEPYISSIDALVAAWLPGTE-GLGVTDCLYGDHGFSGKLPRTWFKYVDQLPMNV---------GDPHY-----DPLFPFGFGLGTDSVKDLVAR
        +GRP+ +     +I +++  W  G   G  V + ++GD    GKL  ++ + V Q+P+           G   Y      PLFPFGFGL   + K    +
Subjt:  SGRPIVMEPYISSIDALVAAWLPGTE-GLGVTDCLYGDHGFSGKLPRTWFKYVDQLPMNV---------GDPHY-----DPLFPFGFGLGTDSVKDLVAR

Query:  --STSSGIRGTASV
          +TS    GT +V
Subjt:  --STSSGIRGTASV

Arabidopsis top hitse value%identityAlignment
AT3G47000.1 Glycosyl hydrolase family protein3.3e-20455.59Show/hide
Query:  YKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMINEFQKGSLSSRLGIPMMYGIDAVHGHNNVY
        YK+   PV  RV DLL RMTL EKIGQM QI+R VA+ +   D+ IGSVL+ GGSVP  DA++ DW +MI+ FQ+ +L+SRLGIP++YG DAVHG+NNVY
Subjt:  YKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMINEFQKGSLSSRLGIPMMYGIDAVHGHNNVY

Query:  NATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIP
         ATVFPHN+GLGAT                          R+ DLVRRIGAATALEVRA+G+ +AF+PC+AV RDPRWGRCYESY EDP++V EMT ++ 
Subjt:  NATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIP

Query:  GLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSVMISYSSWNGVKMHANHELITGFLKGTLKF
        GLQG PP  +  G P+V G   V+AC KHFVGDGGT  GINE NT+  +  L  IH+P Y   + +GVS+VM SYSSWNG ++HA+  L+T  LK  L F
Subjt:  GLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSVMISYSSWNGVKMHANHELITGFLKGTLKF

Query:  KGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRILSVKFSMGLFENPMGDYSLVNELGSQEHR
        KGF++SDWEGLDR++ P  SNY Y ++ A+ AGIDMVMVP+KY +FI D+T LV+S  IPM RI+DA  RIL VKF  GLF +P+ D SL+  +G +EHR
Subjt:  KGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRILSVKFSMGLFENPMGDYSLVNELGSQEHR

Query:  DLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVFSEDPDSNFVKSND-F
        +LA+EAVR+SLVLLK+GKN  +P LPL + A +ILV GTHAD+LGYQCGGWT  W G SG   T GT++L AIK  V   TEV++ + P    + S++ F
Subjt:  DLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVFSEDPDSNFVKSND-F

Query:  SYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEP-YISSIDALVAAWLPGTEGLGVTDCLYGDHGFSGKLPRTWFKYVD
        SYAIV VGE PYAE++GD++ L +   G + +  V + +  +V+++SGRP+V+EP  +   +ALVAAWLPGTEG GV D ++GD+ F GKLP +WFK+V+
Subjt:  SYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEP-YISSIDALVAAWLPGTEGLGVTDCLYGDHGFSGKLPRTWFKYVD

Query:  QLPMNVGDPHYDPLFPFGFGLGTDSV
         LP++     YDPLFPFGFGL +  V
Subjt:  QLPMNVGDPHYDPLFPFGFGLGTDSV

AT5G04885.1 Glycosyl hydrolase family protein8.6e-28568.86Show/hide
Query:  MAQIFVQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMIN
        M++  V++V +L     W     D EYL YKDPKQ V+ RV DL GRMTLEEKIGQMVQIDRSVA V +M+DY IGSVLSGGGS PLP+A A++WV+MIN
Subjt:  MAQIFVQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMIN

Query:  EFQKGSLSSRLGIPMMYGIDAVHGHNNVYNATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIA
        E+QKG+L SRLGIPM+YGIDAVHGHNNVYNAT+FPHNVGLGAT                          R+PDLV+RIGAATA+EVRATGI + FAPCIA
Subjt:  EFQKGSLSSRLGIPMMYGIDAVHGHNNVYNATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIA

Query:  VCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSV
        VCRDPRWGRCYESYSED K+V++MT++I GLQGEPP+NY+ G+P+VGG  KV ACAKH+VGDGGTT G+NENNTV D HGLLS+HMPAY D++ KGVS+V
Subjt:  VCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSV

Query:  MISYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRI
        M+SYSSWNG KMHAN ELITG+LKGTLKFKGFVISDW+G+D+I++PPH++YT SV+AAI AGIDMVMVP+ +TEF++DLT LVK+N IP+ RIDDA  RI
Subjt:  MISYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRI

Query:  LSVKFSMGLFENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILA
        L VKF+MGLFENP+ DYS  +ELGSQ HRDLAREAVR+SLVLLKNG N + P+LPLP+K  KILVAGTHADNLGYQCGGWTI WQGFSGN  TRGT++L+
Subjt:  LSVKFSMGLFENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILA

Query:  AIKSTVDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTE
        A+KS VD STEVVF E+PD+ F+KSN+F+YAI+ VGE PYAE+ GDS  LTMLDPGP  I + C +VKCVVVV+SGRP+VMEPY++SIDALVAAWLPGTE
Subjt:  AIKSTVDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTE

Query:  GLGVTDCLYGDHGFSGKLPRTWFKYVDQLPMNVGDPHYDPLFPFGFGLGTDSVKDLVARSTSSGIRGTASVIATIIVAALAICI
        G G+TD L+GDHGFSGKLP TWF+  +QLPM+ GD HYDPLF +G GL T+SV  +VARSTS+    T   + T++V+A  +C+
Subjt:  GLGVTDCLYGDHGFSGKLPRTWFKYVDQLPMNVGDPHYDPLFPFGFGLGTDSVKDLVARSTSSGIRGTASVIATIIVAALAICI

AT5G20940.1 Glycosyl hydrolase family protein1.5e-24966.05Show/hide
Query:  VQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMINEFQKG
        +Q + +L L     A  V     KYKDPK+P+ VR+ +L+  MTLEEKIGQMVQ++R  A   VM+ Y +GSV SGGGSVP P    E WVNM+NE QK 
Subjt:  VQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMINEFQKG

Query:  SLSSRLGIPMMYGIDAVHGHNNVYNATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIAVCRDP
        +LS+RLGIP++YGIDAVHGHN VYNAT+FPHNVGLG T                          R+P LV+RIG ATALEVRATGI + FAPCIAVCRDP
Subjt:  SLSSRLGIPMMYGIDAVHGHNNVYNATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIAVCRDP

Query:  RWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSVMISYS
        RWGRCYESYSED KIVQ+MTEIIPGLQG+ P   +KG+P+V G  KV ACAKHFVGDGGT  G+N NNTVI+ +GLL IHMPAY+D++ KGV++VM+SYS
Subjt:  RWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSVMISYS

Query:  SWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRILSVKF
        S NG+KMHAN +LITGFLK  LKF+G VISD+ G+D+I +P  +NY++SV AA  AG+DM M     T+ ID+LT  VK   IPM RIDDA  RIL VKF
Subjt:  SWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRILSVKF

Query:  SMGLFENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKST
        +MGLFENP+ D+SL  +LGS+EHR+LAREAVR+SLVLLKNG+N  +P+LPLPKKA KILVAGTHADNLGYQCGGWTI WQG +GNN T GT+ILAA+K T
Subjt:  SMGLFENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKST

Query:  VDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTEGLGVT
        VDP T+V+++++PD+NFVK+ DF YAIV VGE PYAE  GDST LT+ +PGP+TI NVC SVKCVVVVVSGRP+VM+  IS+IDALVAAWLPGTEG GV 
Subjt:  VDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTEGLGVT

Query:  DCLYGDHGFSGKLPRTWFKYVDQLPMNVGDPHYDPLFPFGFGLGT
        D L+GD+GF+GKL RTWFK VDQLPMNVGDPHYDPL+PFGFGL T
Subjt:  DCLYGDHGFSGKLPRTWFKYVDQLPMNVGDPHYDPLFPFGFGLGT

AT5G20950.1 Glycosyl hydrolase family protein6.9e-26668.32Show/hide
Query:  ILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMINEFQKGSLSSR
        +LCL             LKYKDPKQP+  R+ DL+ RMTL+EKIGQMVQI+RSVA   VMK Y IGSVLSGGGSVP   A  E WVNM+NE QK SLS+R
Subjt:  ILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMINEFQKGSLSSR

Query:  LGIPMMYGIDAVHGHNNVYNATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRC
        LGIPM+YGIDAVHGHNNVY AT+FPHNVGLG T                          R+P+LV+RIGAATALEVRATGI +AFAPCIAVCRDPRWGRC
Subjt:  LGIPMMYGIDAVHGHNNVYNATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRC

Query:  YESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSVMISYSSWNGV
        YESYSED +IVQ+MTEIIPGLQG+ P   RKG+P+VGG  KV ACAKHFVGDGGT  GI+ENNTVID  GL  IHMP YY+++ KGV+++M+SYS+WNG+
Subjt:  YESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSVMISYSSWNGV

Query:  KMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRILSVKFSMGLF
        +MHAN EL+TGFLK  LKF+GFVISDW+G+DRIT+PPH NY+YSV A I AGIDM+MVPY YTEFID+++  ++  +IP+ RIDDA  RIL VKF+MGLF
Subjt:  KMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRILSVKFSMGLF

Query:  ENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPST
        E P+ D S  N+LGS+EHR+LAREAVR+SLVLLKNGK  ++P+LPLPKK+ KILVAG HADNLGYQCGGWTI WQG +GN+ T GT+ILAA+K+TV P+T
Subjt:  ENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPST

Query:  EVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTEGLGVTDCLYG
        +VV+S++PD+NFVKS  F YAIVVVGE PYAE  GD+T LT+ DPGP+ I NVC SVKCVVVVVSGRP+V++PY+S+IDALVAAWLPGTEG GV D L+G
Subjt:  EVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTEGLGVTDCLYG

Query:  DHGFSGKLPRTWFKYVDQLPMNVGDPHYDPLFPFGFGLGTDSVK
        D+GF+GKL RTWFK V QLPMNVGD HYDPL+PFGFGL T   K
Subjt:  DHGFSGKLPRTWFKYVDQLPMNVGDPHYDPLFPFGFGLGTDSVK

AT5G20950.2 Glycosyl hydrolase family protein6.9e-26668.32Show/hide
Query:  ILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMINEFQKGSLSSR
        +LCL             LKYKDPKQP+  R+ DL+ RMTL+EKIGQMVQI+RSVA   VMK Y IGSVLSGGGSVP   A  E WVNM+NE QK SLS+R
Subjt:  ILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMINEFQKGSLSSR

Query:  LGIPMMYGIDAVHGHNNVYNATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRC
        LGIPM+YGIDAVHGHNNVY AT+FPHNVGLG T                          R+P+LV+RIGAATALEVRATGI +AFAPCIAVCRDPRWGRC
Subjt:  LGIPMMYGIDAVHGHNNVYNATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRC

Query:  YESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSVMISYSSWNGV
        YESYSED +IVQ+MTEIIPGLQG+ P   RKG+P+VGG  KV ACAKHFVGDGGT  GI+ENNTVID  GL  IHMP YY+++ KGV+++M+SYS+WNG+
Subjt:  YESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSVMISYSSWNGV

Query:  KMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRILSVKFSMGLF
        +MHAN EL+TGFLK  LKF+GFVISDW+G+DRIT+PPH NY+YSV A I AGIDM+MVPY YTEFID+++  ++  +IP+ RIDDA  RIL VKF+MGLF
Subjt:  KMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRILSVKFSMGLF

Query:  ENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPST
        E P+ D S  N+LGS+EHR+LAREAVR+SLVLLKNGK  ++P+LPLPKK+ KILVAG HADNLGYQCGGWTI WQG +GN+ T GT+ILAA+K+TV P+T
Subjt:  ENPMGDYSLVNELGSQEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPST

Query:  EVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTEGLGVTDCLYG
        +VV+S++PD+NFVKS  F YAIVVVGE PYAE  GD+T LT+ DPGP+ I NVC SVKCVVVVVSGRP+V++PY+S+IDALVAAWLPGTEG GV D L+G
Subjt:  EVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTEGLGVTDCLYG

Query:  DHGFSGKLPRTWFKYVDQLPMNVGDPHYDPLFPFGFGLGTDSVK
        D+GF+GKL RTWFK V QLPMNVGD HYDPL+PFGFGL T   K
Subjt:  DHGFSGKLPRTWFKYVDQLPMNVGDPHYDPLFPFGFGLGTDSVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCAGATTTTTGTTCAGGTGGTTGCGATTCTGTGCTTGGGTTGGTGGTGGTGGGCTACAACGGTGGACGCGGAGTATTTGAAATACAAAGATCCGAAACAACCAGT
TGCTGTTCGAGTTATGGACCTTCTTGGCCGAATGACTCTGGAGGAGAAAATCGGTCAAATGGTCCAGATTGACAGGAGCGTTGCCAATGTTACAGTTATGAAAGATTATT
CCATTGGAAGTGTGCTAAGTGGCGGGGGAAGTGTGCCGCTTCCTGATGCTCGTGCTGAAGATTGGGTGAACATGATAAATGAATTCCAGAAGGGCTCTCTTTCTAGTCGA
TTGGGCATCCCAATGATGTATGGCATTGATGCTGTTCATGGCCATAACAACGTTTACAATGCTACCGTATTTCCTCATAATGTTGGACTTGGAGCAACCAGGCAAGTGAT
TGTCCTTTCCTTATGCCCTGCACATAAGAACGTCATAACCTCTTTACATTTGTTCTTTTCCCTTAACAGAAATCCTGACCTAGTTCGAAGGATTGGTGCTGCAACAGCAC
TTGAAGTTAGAGCTACGGGGATTTCTTTTGCCTTTGCTCCATGCATTGCGGTATGTAGGGACCCGAGGTGGGGGCGATGTTATGAAAGCTACAGCGAGGATCCAAAAATT
GTGCAGGAGATGACAGAGATTATTCCTGGTCTGCAAGGAGAACCTCCTGCTAATTATCGGAAGGGGATTCCATATGTTGGCGGAACCAAGAAGGTTATCGCCTGTGCAAA
GCACTTTGTGGGAGATGGTGGGACAACTAATGGCATCAATGAGAACAATACAGTTATTGACTGGCATGGACTGCTCAGCATTCACATGCCTGCCTACTATGATTCGATCA
TCAAGGGCGTTTCATCGGTAATGATTTCCTATTCCAGTTGGAATGGGGTAAAGATGCATGCAAACCATGAGCTCATTACTGGCTTCCTCAAGGGCACCCTTAAATTTAAG
GGTTTTGTCATCTCAGATTGGGAGGGCCTGGACAGAATTACTTCTCCGCCACATTCTAACTACACCTACTCTGTCCAAGCTGCAATTTTAGCTGGCATTGACATGGTCAT
GGTTCCTTACAAGTATACAGAATTCATTGATGATCTTACGCATCTAGTGAAGAGCAATGTCATCCCAATGGATCGCATTGATGATGCTGCTGGGAGAATTTTATCAGTCA
AGTTTTCAATGGGTCTTTTTGAGAACCCCATGGGAGATTACAGCCTTGTCAATGAGCTCGGGAGCCAGGAACATAGGGACTTGGCAAGAGAAGCTGTGAGGCAGTCGCTG
GTGCTGCTGAAGAATGGGAAAAACGATAGCCAACCGGTGCTACCCCTTCCAAAGAAGGCCCCTAAGATCCTGGTTGCTGGTACTCATGCTGATAATTTAGGATATCAATG
TGGTGGATGGACAATCGCATGGCAAGGGTTCAGTGGCAACAATGGTACAAGAGGAACCAGCATTCTTGCTGCCATCAAATCAACAGTCGACCCGAGCACAGAGGTCGTAT
TCAGTGAGGATCCCGACAGCAACTTTGTGAAGTCTAATGACTTCTCATATGCCATTGTTGTCGTTGGCGAAATGCCGTATGCCGAGAGTGTAGGAGATAGTACTACGCTC
ACCATGTTGGATCCTGGTCCAAACACCATAAAAAATGTTTGTGATTCTGTAAAGTGTGTGGTGGTAGTCGTTTCTGGAAGGCCAATTGTGATGGAACCATACATTTCATC
TATTGATGCTCTTGTAGCAGCTTGGTTACCAGGCACCGAGGGCCTAGGAGTCACTGATTGCCTCTATGGAGACCATGGGTTCAGTGGGAAGCTTCCAAGAACATGGTTTA
AATACGTTGATCAACTACCAATGAATGTAGGAGATCCACACTATGATCCACTTTTTCCTTTCGGGTTCGGACTCGGAACTGACTCGGTCAAGGACCTCGTCGCAAGGTCT
ACATCGTCGGGAATTCGTGGAACTGCATCCGTTATTGCAACAATCATCGTTGCTGCACTCGCCATTTGTATATTG
mRNA sequenceShow/hide mRNA sequence
ATGGCCCAGATTTTTGTTCAGGTGGTTGCGATTCTGTGCTTGGGTTGGTGGTGGTGGGCTACAACGGTGGACGCGGAGTATTTGAAATACAAAGATCCGAAACAACCAGT
TGCTGTTCGAGTTATGGACCTTCTTGGCCGAATGACTCTGGAGGAGAAAATCGGTCAAATGGTCCAGATTGACAGGAGCGTTGCCAATGTTACAGTTATGAAAGATTATT
CCATTGGAAGTGTGCTAAGTGGCGGGGGAAGTGTGCCGCTTCCTGATGCTCGTGCTGAAGATTGGGTGAACATGATAAATGAATTCCAGAAGGGCTCTCTTTCTAGTCGA
TTGGGCATCCCAATGATGTATGGCATTGATGCTGTTCATGGCCATAACAACGTTTACAATGCTACCGTATTTCCTCATAATGTTGGACTTGGAGCAACCAGGCAAGTGAT
TGTCCTTTCCTTATGCCCTGCACATAAGAACGTCATAACCTCTTTACATTTGTTCTTTTCCCTTAACAGAAATCCTGACCTAGTTCGAAGGATTGGTGCTGCAACAGCAC
TTGAAGTTAGAGCTACGGGGATTTCTTTTGCCTTTGCTCCATGCATTGCGGTATGTAGGGACCCGAGGTGGGGGCGATGTTATGAAAGCTACAGCGAGGATCCAAAAATT
GTGCAGGAGATGACAGAGATTATTCCTGGTCTGCAAGGAGAACCTCCTGCTAATTATCGGAAGGGGATTCCATATGTTGGCGGAACCAAGAAGGTTATCGCCTGTGCAAA
GCACTTTGTGGGAGATGGTGGGACAACTAATGGCATCAATGAGAACAATACAGTTATTGACTGGCATGGACTGCTCAGCATTCACATGCCTGCCTACTATGATTCGATCA
TCAAGGGCGTTTCATCGGTAATGATTTCCTATTCCAGTTGGAATGGGGTAAAGATGCATGCAAACCATGAGCTCATTACTGGCTTCCTCAAGGGCACCCTTAAATTTAAG
GGTTTTGTCATCTCAGATTGGGAGGGCCTGGACAGAATTACTTCTCCGCCACATTCTAACTACACCTACTCTGTCCAAGCTGCAATTTTAGCTGGCATTGACATGGTCAT
GGTTCCTTACAAGTATACAGAATTCATTGATGATCTTACGCATCTAGTGAAGAGCAATGTCATCCCAATGGATCGCATTGATGATGCTGCTGGGAGAATTTTATCAGTCA
AGTTTTCAATGGGTCTTTTTGAGAACCCCATGGGAGATTACAGCCTTGTCAATGAGCTCGGGAGCCAGGAACATAGGGACTTGGCAAGAGAAGCTGTGAGGCAGTCGCTG
GTGCTGCTGAAGAATGGGAAAAACGATAGCCAACCGGTGCTACCCCTTCCAAAGAAGGCCCCTAAGATCCTGGTTGCTGGTACTCATGCTGATAATTTAGGATATCAATG
TGGTGGATGGACAATCGCATGGCAAGGGTTCAGTGGCAACAATGGTACAAGAGGAACCAGCATTCTTGCTGCCATCAAATCAACAGTCGACCCGAGCACAGAGGTCGTAT
TCAGTGAGGATCCCGACAGCAACTTTGTGAAGTCTAATGACTTCTCATATGCCATTGTTGTCGTTGGCGAAATGCCGTATGCCGAGAGTGTAGGAGATAGTACTACGCTC
ACCATGTTGGATCCTGGTCCAAACACCATAAAAAATGTTTGTGATTCTGTAAAGTGTGTGGTGGTAGTCGTTTCTGGAAGGCCAATTGTGATGGAACCATACATTTCATC
TATTGATGCTCTTGTAGCAGCTTGGTTACCAGGCACCGAGGGCCTAGGAGTCACTGATTGCCTCTATGGAGACCATGGGTTCAGTGGGAAGCTTCCAAGAACATGGTTTA
AATACGTTGATCAACTACCAATGAATGTAGGAGATCCACACTATGATCCACTTTTTCCTTTCGGGTTCGGACTCGGAACTGACTCGGTCAAGGACCTCGTCGCAAGGTCT
ACATCGTCGGGAATTCGTGGAACTGCATCCGTTATTGCAACAATCATCGTTGCTGCACTCGCCATTTGTATATTG
Protein sequenceShow/hide protein sequence
MAQIFVQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMINEFQKGSLSSR
LGIPMMYGIDAVHGHNNVYNATVFPHNVGLGATRQVIVLSLCPAHKNVITSLHLFFSLNRNPDLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKI
VQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSVMISYSSWNGVKMHANHELITGFLKGTLKFK
GFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFIDDLTHLVKSNVIPMDRIDDAAGRILSVKFSMGLFENPMGDYSLVNELGSQEHRDLAREAVRQSL
VLLKNGKNDSQPVLPLPKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTSILAAIKSTVDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTL
TMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTEGLGVTDCLYGDHGFSGKLPRTWFKYVDQLPMNVGDPHYDPLFPFGFGLGTDSVKDLVARS
TSSGIRGTASVIATIIVAALAICIL