; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr026823 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr026823
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionalpha-N-acetylglucosaminidase-like
Genome locationtig00153047:1332623..1343450
RNA-Seq ExpressionSgr026823
SyntenySgr026823
Gene Ontology termsGO:0009987 - cellular process (biological process)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR007781 - Alpha-N-acetylglucosaminidase
IPR024240 - Alpha-N-acetylglucosaminidase, N-terminal
IPR024732 - Alpha-N-acetylglucosaminidase, C-terminal
IPR024733 - Alpha-N-acetylglucosaminidase, tim-barrel domain
IPR029018 - Beta-hexosaminidase-like, domain 2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587494.1 Alpha-N-acetylglucosaminidase, partial [Cucurbita argyrosperma subsp. sororia]9.0e-28868.16Show/hide
Query:  MASTFPAIFLILVSTFAALSTSRSSTIGVAYISGILEIQDRERAPAHVQVAAARGVLRRLLPSHLSSFDFQIVSKVLDFTSFYALFEKIEDGAADKCGGE
        MA  F A+FLI +S F   STS SSTIGV YIS +L+IQDRERAP+ VQVAAARGVLRRLLPSHLSSFDFQI+SK                   D CGGE
Subjt:  MASTFPAIFLILVSTFAALSTSRSSTIGVAYISGILEIQDRERAPAHVQVAAARGVLRRLLPSHLSSFDFQIVSKVLDFTSFYALFEKIEDGAADKCGGE

Query:  SCFMIRNHRAFRRPGEPEILIAGVTGVELLAGLHWYLKQWCGAHISWDKTGGSQLFSVPKAGLLPRIQSDEIIFQRPIPLNYYQNAVTSSCK--------
        SCF+IRNHRAFRRPG+PEILIAGVTGVE+LAGLHWYLK WCGAHISWDKTGGSQLFSVPK G LP I+SDEII QRPIPLNYYQNAVTSS          
Subjt:  SCFMIRNHRAFRRPGEPEILIAGVTGVELLAGLHWYLKQWCGAHISWDKTGGSQLFSVPKAGLLPRIQSDEIIFQRPIPLNYYQNAVTSSCK--------

Query:  -------TLLPG---GLGKMGK--------------GNRLDGSSG------YQYASSIYWAGGYLAES---------------------ISVLPAFSGNI
                 L G    L   G+               + LD   G      +    +++  GG L +S                       VLPAFSGNI
Subjt:  -------TLLPG---GLGKMGK--------------GNRLDGSSG------YQYASSIYWAGGYLAES---------------------ISVLPAFSGNI

Query:  PAAFKQIYPSAKITRLGNWFSVHSDPRWCCTYLLDATDPLFVEIGKAFIEQQLKEYGRTSHIYNCDTFDENTPPVDEAEYISSLGAAIFGGMQAGDSDAV
        PAAFKQIYPSAKITRLGNWFSVHSDPRWCCTYLLDA DPLFVEIG+AFIEQQLKEYGRTSH+YNCDTFDENTPPVD+ EYISSLGAAIFGGMQAGDS AV
Subjt:  PAAFKQIYPSAKITRLGNWFSVHSDPRWCCTYLLDATDPLFVEIGKAFIEQQLKEYGRTSHIYNCDTFDENTPPVDEAEYISSLGAAIFGGMQAGDSDAV

Query:  WLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAEVKPIWITSEQFYGTPYIWKVSIPFSCLILMF------------------------------
        WLMQGWMFSYDPFWRPQQMKALLHSV LGRLVVLDLYAEVKPIWI SEQFYG PYIWKV+IPF C ILM                               
Subjt:  WLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAEVKPIWITSEQFYGTPYIWKVSIPFSCLILMF------------------------------

Query:  RCMLHNFAGNVEMYGTLDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKQWLHQYSIRRYGQLVPSIQDAWDVLYHTIYNC
        +CMLHNFAGNVEMYG LDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVK+WL+QYSIRRYG LVPSIQDAWDVLYHTIYNC
Subjt:  RCMLHNFAGNVEMYGTLDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKQWLHQYSIRRYGQLVPSIQDAWDVLYHTIYNC

Query:  TDGAYSYCFRIVSVFYWSRYFSCN-ITEGSDRYGNLDSSIAVLQDATFDRPHLWYPTSEVIRALKLFIAGGDQ----------LLVVARTALAKYSNELF
        TDGAY     ++  F      S + I EGSDR+         LQDA F+RPHLWYPTSEVIRALKLF+A GDQ          L+ + R ALAKYSNELF
Subjt:  TDGAYSYCFRIVSVFYWSRYFSCN-ITEGSDRYGNLDSSIAVLQDATFDRPHLWYPTSEVIRALKLFIAGGDQ----------LLVVARTALAKYSNELF

Query:  FRVAKAYQLHDAQTVASLSQRFLELVKDMDTLLACHEGFLLGPWLESAKQLAQDEEQEKQFEWNARTQITMWFDNTERK
        FR+ KAYQL D QT  SLSQ+FLELV D+DTL+ACHEGFLLGPWL+SAKQLAQDE+QEKQ+EWNARTQITMWFDNTE +
Subjt:  FRVAKAYQLHDAQTVASLSQRFLELVKDMDTLLACHEGFLLGPWLESAKQLAQDEEQEKQFEWNARTQITMWFDNTERK

XP_008453133.1 PREDICTED: alpha-N-acetylglucosaminidase-like [Cucumis melo]2.3e-29170.89Show/hide
Query:  MASTFPAIFLILVSTFAALSTSRSSTIGVAYISGILEIQDRERAPAHVQVAAARGVLRRLLPSHLSSFDFQIVSKVLDFTSFYALFEKIEDGAADKCGGE
        MAS F + FLI V+ FAA STSRSSTIGV YIS +LEIQDRER PA+VQVAAARGVLRRLLPSHLSSFDFQIVSK                   DKCGGE
Subjt:  MASTFPAIFLILVSTFAALSTSRSSTIGVAYISGILEIQDRERAPAHVQVAAARGVLRRLLPSHLSSFDFQIVSKVLDFTSFYALFEKIEDGAADKCGGE

Query:  SCFMIRNHRAFRRPGEPEILIAGVTGVELLAGLHWYLKQWCGAHISWDKTGGSQLFSVPKAGLLPRIQSDEIIFQRPIPLNYYQNAVTSSCK--------
        SCF+IRNHRAFR+PG+PEILIAGVTGVE+LAGLHWYLK WCGAHISWDKTGGSQLFSVPKAGLLPRIQ+DE++ +RPIPLNYYQNAVTSS          
Subjt:  SCFMIRNHRAFRRPGEPEILIAGVTGVELLAGLHWYLKQWCGAHISWDKTGGSQLFSVPKAGLLPRIQSDEIIFQRPIPLNYYQNAVTSSCK--------

Query:  -------TLLPG---GLGKMGK--------------GNRLDGSSG------YQYASSIYWAGGYLAES---------------------ISVLPAFSGNI
                 L G    L   G+               + LD   G      +    +++  GG L +S                       VLPAFSGNI
Subjt:  -------TLLPG---GLGKMGK--------------GNRLDGSSG------YQYASSIYWAGGYLAES---------------------ISVLPAFSGNI

Query:  PAAFKQIYPSAKITRLGNWFSVHSDPRWCCTYLLDATDPLFVEIGKAFIEQQLKEYGRTSHIYNCDTFDENTPPVDEAEYISSLGAAIFGGMQAGDSDAV
        PAAFKQIYPSAKITRLGNWF+VHSDPRWCCTYLLDA DPLFVEIGKAFIEQQ KEYGRTSH+YNCDTFDENTPPVDE EYISSLG+AIFGGMQAGDS+AV
Subjt:  PAAFKQIYPSAKITRLGNWFSVHSDPRWCCTYLLDATDPLFVEIGKAFIEQQLKEYGRTSHIYNCDTFDENTPPVDEAEYISSLGAAIFGGMQAGDSDAV

Query:  WLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAEVKPIWITSEQFYGTPYIWKVSIPFSCLILMFRCMLHNFAGNVEMYGTLDSIASGPIEARSS
        WLMQGWMFSYDPFWRP QMKALLHSVPLGRLVVLDLYAEVKPIWI+SEQFYGTPYIW              CMLHNFAGNVEMYG LDSIASGPIEARSS
Subjt:  WLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAEVKPIWITSEQFYGTPYIWKVSIPFSCLILMFRCMLHNFAGNVEMYGTLDSIASGPIEARSS

Query:  PYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKQWLHQYSIRRYGQLVPSIQDAWDVLYHTIYNCTDGAYSYCFRIVSVFYWSRYFSCNI-TEGS
        PYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVK+WL QYS+RRYG LVPSIQDAWDVLYHTIYNCTDGA      ++  F      S  +  EGS
Subjt:  PYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKQWLHQYSIRRYGQLVPSIQDAWDVLYHTIYNCTDGAYSYCFRIVSVFYWSRYFSCNI-TEGS

Query:  DRYGNLDSSIAVLQDATFDRPHLWYPTSEVIRALKLFIAGGDQ----------LLVVARTALAKYSNELFFRVAKAYQLHDAQTVASLSQRFLELVKDMD
        D++G LDSS+  LQDATFDRPHLWYPTS+VI ALKLFI GGDQ          L+ + R ALAKYSNELFFR  KAYQL+DAQT+ASLSQ FLELV D+D
Subjt:  DRYGNLDSSIAVLQDATFDRPHLWYPTSEVIRALKLFIAGGDQ----------LLVVARTALAKYSNELFFRVAKAYQLHDAQTVASLSQRFLELVKDMD

Query:  TLLACHEGFLLGPWLESAKQLAQDEEQEKQFEWNARTQITMWFDNTERK
        TLLACHEGFLLGPWL+SAKQLAQ EE+EKQ+EWNARTQITMWFDNTE +
Subjt:  TLLACHEGFLLGPWLESAKQLAQDEEQEKQFEWNARTQITMWFDNTERK

XP_011658935.1 alpha-N-acetylglucosaminidase [Cucumis sativus]8.7e-29170.36Show/hide
Query:  MASTFPAIFLILVSTFAALSTSRSSTIGVAYISGILEIQDRERAPAHVQVAAARGVLRRLLPSHLSSFDFQIVSKVLDFTSFYALFEKIEDGAADKCGGE
        MAS F + FLI V+ FAA STSRSSTIGV YIS +LEIQDRER PA+VQVAAARGVLRRLLPSHL SFDFQIVSK                   DKCGGE
Subjt:  MASTFPAIFLILVSTFAALSTSRSSTIGVAYISGILEIQDRERAPAHVQVAAARGVLRRLLPSHLSSFDFQIVSKVLDFTSFYALFEKIEDGAADKCGGE

Query:  SCFMIRNHRAFRRPGEPEILIAGVTGVELLAGLHWYLKQWCGAHISWDKTGGSQLFSVPKAGLLPRIQSDEIIFQRPIPLNYYQNAVTSSCK--------
        SCF+IRNHRAFR+ G+PEILIAGVTGVE+LAGLHWYLK WCGAHISWDKTGGSQLFSVPKAGLLPRIQ++E++ QRPIPLNYYQNAVTSS          
Subjt:  SCFMIRNHRAFRRPGEPEILIAGVTGVELLAGLHWYLKQWCGAHISWDKTGGSQLFSVPKAGLLPRIQSDEIIFQRPIPLNYYQNAVTSSCK--------

Query:  -------TLLPG---GLGKMGK--------------GNRLDGSSG------YQYASSIYWAGGYLAES---------------------ISVLPAFSGNI
                 L G    L   G+               + LD   G      +    +++  GG L +S                       VLPAFSGNI
Subjt:  -------TLLPG---GLGKMGK--------------GNRLDGSSG------YQYASSIYWAGGYLAES---------------------ISVLPAFSGNI

Query:  PAAFKQIYPSAKITRLGNWFSVHSDPRWCCTYLLDATDPLFVEIGKAFIEQQLKEYGRTSHIYNCDTFDENTPPVDEAEYISSLGAAIFGGMQAGDSDAV
        PAAFKQIYP+AKITRLGNWF+VHSDPRWCCTYLLDA DPLFVEIGKAFIEQQ KEYGRTSH+YNCDTFDENTPPVD+ EYISSLG+AIFGGMQAGDS+AV
Subjt:  PAAFKQIYPSAKITRLGNWFSVHSDPRWCCTYLLDATDPLFVEIGKAFIEQQLKEYGRTSHIYNCDTFDENTPPVDEAEYISSLGAAIFGGMQAGDSDAV

Query:  WLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAEVKPIWITSEQFYGTPYIWKVSIPFSCLILMFRCMLHNFAGNVEMYGTLDSIASGPIEARSS
        WLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAEVKPIWI+SEQFYG PYIW              CMLHNFAGNVEMYG LDSIASGPIEARSS
Subjt:  WLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAEVKPIWITSEQFYGTPYIWKVSIPFSCLILMFRCMLHNFAGNVEMYGTLDSIASGPIEARSS

Query:  PYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKQWLHQYSIRRYGQLVPSIQDAWDVLYHTIYNCTDGAYSYCFRIVSVF-YWSRYFSCNITEGS
        PYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVK+WL QYS+RRYG LVPSIQDAWDVLYHT+YNCTDGA      ++  F          + EGS
Subjt:  PYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKQWLHQYSIRRYGQLVPSIQDAWDVLYHTIYNCTDGAYSYCFRIVSVF-YWSRYFSCNITEGS

Query:  DRYGNLDSSIAVLQDATFDRPHLWYPTSEVIRALKLFIAGGDQ----------LLVVARTALAKYSNELFFRVAKAYQLHDAQTVASLSQRFLELVKDMD
        +R+GNLDSS+  LQDATFDRPHLWYPTSEVI ALKLFIAGGDQ          L+ + R ALAKYSNELFFR+ KAYQLHD QT+ASLSQ FLELV D+D
Subjt:  DRYGNLDSSIAVLQDATFDRPHLWYPTSEVIRALKLFIAGGDQ----------LLVVARTALAKYSNELFFRVAKAYQLHDAQTVASLSQRFLELVKDMD

Query:  TLLACHEGFLLGPWLESAKQLAQDEEQEKQFEWNARTQITMWFDNTERK
        TLLACHEGFLLGPWL+SAKQLA+ EE+EKQ+EWNARTQITMWFDNTE +
Subjt:  TLLACHEGFLLGPWLESAKQLAQDEEQEKQFEWNARTQITMWFDNTERK

XP_022135500.1 alpha-N-acetylglucosaminidase-like [Momordica charantia]6.2e-29772.63Show/hide
Query:  MASTFPAIFLILVSTFAALSTSRSSTIGVAYISGILEIQDRERAPAHVQVAAARGVLRRLLPSHLSSFDFQIVSKVLDFTSFYALFEKIEDGAADKCGGE
        MAS FPAIFLI VS FAA STSR STIGV YIS +LEIQDRERAPAHVQVAAARGVLRRLLPSHLSSFDFQIVSK                   DKCG E
Subjt:  MASTFPAIFLILVSTFAALSTSRSSTIGVAYISGILEIQDRERAPAHVQVAAARGVLRRLLPSHLSSFDFQIVSKVLDFTSFYALFEKIEDGAADKCGGE

Query:  SCFMIRNHRAFRRPGEPEILIAGVTGVELLAGLHWYLKQWCGAHISWDKTGGSQLFSVPKAGLLPRIQSDEIIFQRPIPLNYYQNAVTSSC---------
        SCF+IRNHR+FRRPG+PEILIAGVTGVE+LAGLHWYLK WCGAHISWDKTGGSQLFSVPKAGLLPRIQS+EII QRP+PLNYYQNAVTSS          
Subjt:  SCFMIRNHRAFRRPGEPEILIAGVTGVELLAGLHWYLKQWCGAHISWDKTGGSQLFSVPKAGLLPRIQSDEIIFQRPIPLNYYQNAVTSSC---------

Query:  ----------------------KTLLPGGLGKMGKGNR-LDGSSG------YQYASSIYWAGGYLAES---------------------ISVLPAFSGNI
                              + +      K    N  LD   G      +    +++  GG L +S                       VLPAFSGNI
Subjt:  ----------------------KTLLPGGLGKMGKGNR-LDGSSG------YQYASSIYWAGGYLAES---------------------ISVLPAFSGNI

Query:  PAAFKQIYPSAKITRLGNWFSVHSDPRWCCTYLLDATDPLFVEIGKAFIEQQLKEYGRTSHIYNCDTFDENTPPVDEAEYISSLGAAIFGGMQAGDSDAV
        PAAFKQIYPSAKITRLGNWFSVHSDPRWCCTYLLDA DPLFVEIGKAFIEQQLKEYGRTSH+YNCDTFDENTPPVD AEYISSLGAAIFGGMQAGDSDAV
Subjt:  PAAFKQIYPSAKITRLGNWFSVHSDPRWCCTYLLDATDPLFVEIGKAFIEQQLKEYGRTSHIYNCDTFDENTPPVDEAEYISSLGAAIFGGMQAGDSDAV

Query:  WLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAEVKPIWITSEQFYGTPYIWKVSIPFSCLILMFRCMLHNFAGNVEMYGTLDSIASGPIEARSS
        WLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAEVKPIWI+SEQFYGTPYIW              CMLHNFAGNVEMYG LDSIASGPIEAR+S
Subjt:  WLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAEVKPIWITSEQFYGTPYIWKVSIPFSCLILMFRCMLHNFAGNVEMYGTLDSIASGPIEARSS

Query:  PYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKQWLHQYSIRRYGQLVPSIQDAWDVLYHTIYNCTDGAYSYCFRIVSVFYWSRYFS-CNITEGS
        PYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVK+WL QYSIRRYGQLVPSIQDAWDVLYHTIYNCTDGAY     ++  F      S   + EGS
Subjt:  PYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKQWLHQYSIRRYGQLVPSIQDAWDVLYHTIYNCTDGAYSYCFRIVSVFYWSRYFS-CNITEGS

Query:  --DRYGNLDSSIAVLQDATFDRPHLWYPTSEVIRALKLFIAGGDQ----------LLVVARTALAKYSNELFFRVAKAYQLHDAQTVASLSQRFLELVKD
          DRY N +SS+  L  ATFDRPHLWY TSEVIRALKLFIAG DQ          L+ + R ALAKYSNELFFR+ KAYQL+DAQ +ASLSQ+FLELVKD
Subjt:  --DRYGNLDSSIAVLQDATFDRPHLWYPTSEVIRALKLFIAGGDQ----------LLVVARTALAKYSNELFFRVAKAYQLHDAQTVASLSQRFLELVKD

Query:  MDTLLACHEGFLLGPWLESAKQLAQDEEQEKQFEWNARTQITMWFDNTE
        +DTLLACHEGFLLGPWLESAKQLAQDEEQEKQ+EWNARTQITMWFDNTE
Subjt:  MDTLLACHEGFLLGPWLESAKQLAQDEEQEKQFEWNARTQITMWFDNTE

XP_038880130.1 alpha-N-acetylglucosaminidase-like [Benincasa hispida]3.2e-29371.37Show/hide
Query:  MASTFPAIFLILVSTFAALSTSRSSTIGVAYISGILEIQDRERAPAHVQVAAARGVLRRLLPSHLSSFDFQIVSKVLDFTSFYALFEKIEDGAADKCGGE
        MAS F +IFLI VS FAA STSRSSTIGV YIS +LEIQDRERAPA+VQVAAARGVL RLLPSHLSSFDFQIVSK                   DKCGGE
Subjt:  MASTFPAIFLILVSTFAALSTSRSSTIGVAYISGILEIQDRERAPAHVQVAAARGVLRRLLPSHLSSFDFQIVSKVLDFTSFYALFEKIEDGAADKCGGE

Query:  SCFMIRNHRAFRRPGEPEILIAGVTGVELLAGLHWYLKQWCGAHISWDKTGGSQLFSVPKAGLLPRIQSDEIIFQRPIPLNYYQNAVTSSCK--------
        SCF+IRNHRAFR+PG+PEILIAGVTGVE+LAGLHWYLK WCGAHISWDKTGGSQLFSVPKAGLLPRIQ+DEI+ QRP+PLNYYQNAVTSS          
Subjt:  SCFMIRNHRAFRRPGEPEILIAGVTGVELLAGLHWYLKQWCGAHISWDKTGGSQLFSVPKAGLLPRIQSDEIIFQRPIPLNYYQNAVTSSCK--------

Query:  -------TLLPG---GLGKMGK--------------GNRLDGSSG------YQYASSIYWAGGYLAES---------------------ISVLPAFSGNI
                 L G    L   G+               + LD   G      +    +++  GG L +S                       VLPAFSGNI
Subjt:  -------TLLPG---GLGKMGK--------------GNRLDGSSG------YQYASSIYWAGGYLAES---------------------ISVLPAFSGNI

Query:  PAAFKQIYPSAKITRLGNWFSVHSDPRWCCTYLLDATDPLFVEIGKAFIEQQLKEYGRTSHIYNCDTFDENTPPVDEAEYISSLGAAIFGGMQAGDSDAV
        PAAFK IYPSAKITRLGNWFSVHSDPRWCCTYLLDATDPLFVEIGKAFIEQQ KEYGRTSHIYNCDTFDENTPPVDE EYISSLGAAIFGGMQAGDS+AV
Subjt:  PAAFKQIYPSAKITRLGNWFSVHSDPRWCCTYLLDATDPLFVEIGKAFIEQQLKEYGRTSHIYNCDTFDENTPPVDEAEYISSLGAAIFGGMQAGDSDAV

Query:  WLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAEVKPIWITSEQFYGTPYIWKVSIPFSCLILMFRCMLHNFAGNVEMYGTLDSIASGPIEARSS
        WLMQGWMFSYDPFWRP QMKALLHSVPLGRLVVLDLYAEVKP+WI+SEQFYGTPYIW              CMLHNFAGNVEMYG LDSIASGPIEARSS
Subjt:  WLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAEVKPIWITSEQFYGTPYIWKVSIPFSCLILMFRCMLHNFAGNVEMYGTLDSIASGPIEARSS

Query:  PYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKQWLHQYSIRRYGQLVPSIQDAWDVLYHTIYNCTDGAYSYCFRIVSVFYWSRYFSCNI-TEGS
        PYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVK+WL+QYSIRRYG LVPSIQDAWDVLYHTIYNCTDGA      ++  F      S  +  EGS
Subjt:  PYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKQWLHQYSIRRYGQLVPSIQDAWDVLYHTIYNCTDGAYSYCFRIVSVFYWSRYFSCNI-TEGS

Query:  DRYGNLDSSIAVLQ--DATFDRPHLWYPTSEVIRALKLFIAGGDQ----------LLVVARTALAKYSNELFFRVAKAYQLHDAQTVASLSQRFLELVKD
        +R+GNLDS +  L+  DA FDRPHLWYPTSEV RALKLFIAGGDQ          L+ + R ALAKYSNELFFR+ KAYQL+DAQT+A+LSQ FLELV D
Subjt:  DRYGNLDSSIAVLQ--DATFDRPHLWYPTSEVIRALKLFIAGGDQ----------LLVVARTALAKYSNELFFRVAKAYQLHDAQTVASLSQRFLELVKD

Query:  MDTLLACHEGFLLGPWLESAKQLAQDEEQEKQFEWNARTQITMWFDNTERK
        +DTLLACHEGFLLGPWL+SAKQLAQ EE+EKQ+EWNARTQITMWFDNTE +
Subjt:  MDTLLACHEGFLLGPWLESAKQLAQDEEQEKQFEWNARTQITMWFDNTERK

TrEMBL top hitse value%identityAlignment
A0A1S3BVG2 alpha-N-acetylglucosaminidase-like1.1e-29170.89Show/hide
Query:  MASTFPAIFLILVSTFAALSTSRSSTIGVAYISGILEIQDRERAPAHVQVAAARGVLRRLLPSHLSSFDFQIVSKVLDFTSFYALFEKIEDGAADKCGGE
        MAS F + FLI V+ FAA STSRSSTIGV YIS +LEIQDRER PA+VQVAAARGVLRRLLPSHLSSFDFQIVSK                   DKCGGE
Subjt:  MASTFPAIFLILVSTFAALSTSRSSTIGVAYISGILEIQDRERAPAHVQVAAARGVLRRLLPSHLSSFDFQIVSKVLDFTSFYALFEKIEDGAADKCGGE

Query:  SCFMIRNHRAFRRPGEPEILIAGVTGVELLAGLHWYLKQWCGAHISWDKTGGSQLFSVPKAGLLPRIQSDEIIFQRPIPLNYYQNAVTSSCK--------
        SCF+IRNHRAFR+PG+PEILIAGVTGVE+LAGLHWYLK WCGAHISWDKTGGSQLFSVPKAGLLPRIQ+DE++ +RPIPLNYYQNAVTSS          
Subjt:  SCFMIRNHRAFRRPGEPEILIAGVTGVELLAGLHWYLKQWCGAHISWDKTGGSQLFSVPKAGLLPRIQSDEIIFQRPIPLNYYQNAVTSSCK--------

Query:  -------TLLPG---GLGKMGK--------------GNRLDGSSG------YQYASSIYWAGGYLAES---------------------ISVLPAFSGNI
                 L G    L   G+               + LD   G      +    +++  GG L +S                       VLPAFSGNI
Subjt:  -------TLLPG---GLGKMGK--------------GNRLDGSSG------YQYASSIYWAGGYLAES---------------------ISVLPAFSGNI

Query:  PAAFKQIYPSAKITRLGNWFSVHSDPRWCCTYLLDATDPLFVEIGKAFIEQQLKEYGRTSHIYNCDTFDENTPPVDEAEYISSLGAAIFGGMQAGDSDAV
        PAAFKQIYPSAKITRLGNWF+VHSDPRWCCTYLLDA DPLFVEIGKAFIEQQ KEYGRTSH+YNCDTFDENTPPVDE EYISSLG+AIFGGMQAGDS+AV
Subjt:  PAAFKQIYPSAKITRLGNWFSVHSDPRWCCTYLLDATDPLFVEIGKAFIEQQLKEYGRTSHIYNCDTFDENTPPVDEAEYISSLGAAIFGGMQAGDSDAV

Query:  WLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAEVKPIWITSEQFYGTPYIWKVSIPFSCLILMFRCMLHNFAGNVEMYGTLDSIASGPIEARSS
        WLMQGWMFSYDPFWRP QMKALLHSVPLGRLVVLDLYAEVKPIWI+SEQFYGTPYIW              CMLHNFAGNVEMYG LDSIASGPIEARSS
Subjt:  WLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAEVKPIWITSEQFYGTPYIWKVSIPFSCLILMFRCMLHNFAGNVEMYGTLDSIASGPIEARSS

Query:  PYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKQWLHQYSIRRYGQLVPSIQDAWDVLYHTIYNCTDGAYSYCFRIVSVFYWSRYFSCNI-TEGS
        PYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVK+WL QYS+RRYG LVPSIQDAWDVLYHTIYNCTDGA      ++  F      S  +  EGS
Subjt:  PYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKQWLHQYSIRRYGQLVPSIQDAWDVLYHTIYNCTDGAYSYCFRIVSVFYWSRYFSCNI-TEGS

Query:  DRYGNLDSSIAVLQDATFDRPHLWYPTSEVIRALKLFIAGGDQ----------LLVVARTALAKYSNELFFRVAKAYQLHDAQTVASLSQRFLELVKDMD
        D++G LDSS+  LQDATFDRPHLWYPTS+VI ALKLFI GGDQ          L+ + R ALAKYSNELFFR  KAYQL+DAQT+ASLSQ FLELV D+D
Subjt:  DRYGNLDSSIAVLQDATFDRPHLWYPTSEVIRALKLFIAGGDQ----------LLVVARTALAKYSNELFFRVAKAYQLHDAQTVASLSQRFLELVKDMD

Query:  TLLACHEGFLLGPWLESAKQLAQDEEQEKQFEWNARTQITMWFDNTERK
        TLLACHEGFLLGPWL+SAKQLAQ EE+EKQ+EWNARTQITMWFDNTE +
Subjt:  TLLACHEGFLLGPWLESAKQLAQDEEQEKQFEWNARTQITMWFDNTERK

A0A5D3BH46 Alpha-N-acetylglucosaminidase-like9.7e-27267.42Show/hide
Query:  MASTFPAIFLILVSTFAALSTSRSSTIGVAYISGILEIQDRERAPAHVQVAAARGVLRRLLPSHLSSFDFQIVSKVLDFTSFYALFEKIEDGAADKCGGE
        MAS F + FLI+V+ FAA STSRSSTIGV YIS +LEIQDRERAPA+VQVAAARGVLRRLLPSHLSSFDFQI                      DKCGGE
Subjt:  MASTFPAIFLILVSTFAALSTSRSSTIGVAYISGILEIQDRERAPAHVQVAAARGVLRRLLPSHLSSFDFQIVSKVLDFTSFYALFEKIEDGAADKCGGE

Query:  SCFMIRNHRAFRRPGEPEILIAGVTGVELLAGLHWYLKQWCGAHISWDKTGGSQLFSVPKAGLLPRIQSDEIIFQRPIPLNYYQNAVTSSCK--------
        SCF+IRNHRAFR+PG+PEILIAGVTGVE+LAGLHWYLK WCGAHISWDKTGGSQLFSVPKAGLLPRIQ+DE++ +RPIPLNYYQNAVTSS          
Subjt:  SCFMIRNHRAFRRPGEPEILIAGVTGVELLAGLHWYLKQWCGAHISWDKTGGSQLFSVPKAGLLPRIQSDEIIFQRPIPLNYYQNAVTSSCK--------

Query:  -------TLLPG---GLGKMGK--------------GNRLDGSSG------YQYASSIYWAGGYLAES---------------------ISVLPAFSGNI
                 L G    L   G+               + LD   G      +    +++  GG L  S                       VLPAFSGNI
Subjt:  -------TLLPG---GLGKMGK--------------GNRLDGSSG------YQYASSIYWAGGYLAES---------------------ISVLPAFSGNI

Query:  PAAFKQIYPSAKITRLGNWFSVHSDPRWCCTYLLDATDPLFVEIGKAFIEQQLKEYGRTSHIYNCDTFDENTPPVDEAEYISSLGAAIFGGMQAGDSDAV
        PAAFKQIYPSAKITRLGNWF+VHSDPRWCCTYLLDA DPLFVEIGKAFIEQQ KEYG+TSH+YNCDTFDENTPPVDE EYISSLG+AIFGGMQAGDS+AV
Subjt:  PAAFKQIYPSAKITRLGNWFSVHSDPRWCCTYLLDATDPLFVEIGKAFIEQQLKEYGRTSHIYNCDTFDENTPPVDEAEYISSLGAAIFGGMQAGDSDAV

Query:  WLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAEVKPIWITSEQFYGTPYIWKVSIPFSCLILMFRCMLHNFAGNVEMYGTLDSIASGPIEARSS
        WLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDL                                   CMLHNFAGNVEMYG LDSIASGPIEARSS
Subjt:  WLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAEVKPIWITSEQFYGTPYIWKVSIPFSCLILMFRCMLHNFAGNVEMYGTLDSIASGPIEARSS

Query:  PYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKQWLHQYSIRRYGQLVPSIQDAWDVLYHTIYNCTDGAYSYCFRIVSVFYWSRYFSCNI-TEGS
         YSTMVGVGMSMEGIEQNPVVYDLMSEM FQ NKVDVK+WL QYS+RRYG LVPSIQDAWD+LYHTIYNCTDGA      ++  F      S  +  EGS
Subjt:  PYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKQWLHQYSIRRYGQLVPSIQDAWDVLYHTIYNCTDGAYSYCFRIVSVFYWSRYFSCNI-TEGS

Query:  DRYGNLDSSIAVLQDATFDRPHLWYPTSEVIRALKLFIAGGDQ----------LLVVARTALAKYSNELFFRVAKAYQLHDAQTVASLSQRFLELVKDMD
        D++G LDSS+  LQDATFDRPHLWYPTS+VI ALKLFI GGDQ          L+ + R ALAKYSNELFFR  KAYQL+DAQT+ASLSQ FLELV D+D
Subjt:  DRYGNLDSSIAVLQDATFDRPHLWYPTSEVIRALKLFIAGGDQ----------LLVVARTALAKYSNELFFRVAKAYQLHDAQTVASLSQRFLELVKDMD

Query:  TLLACHEGFLLGPWLESAKQLAQDEEQEKQFEWNARTQITMWFDNTERK
        TLLACHEGFLLGPWL+SAKQLAQ EE+EKQ+EWNARTQITMWFDNTE +
Subjt:  TLLACHEGFLLGPWLESAKQLAQDEEQEKQFEWNARTQITMWFDNTERK

A0A6J1C176 alpha-N-acetylglucosaminidase-like3.0e-29772.63Show/hide
Query:  MASTFPAIFLILVSTFAALSTSRSSTIGVAYISGILEIQDRERAPAHVQVAAARGVLRRLLPSHLSSFDFQIVSKVLDFTSFYALFEKIEDGAADKCGGE
        MAS FPAIFLI VS FAA STSR STIGV YIS +LEIQDRERAPAHVQVAAARGVLRRLLPSHLSSFDFQIVSK                   DKCG E
Subjt:  MASTFPAIFLILVSTFAALSTSRSSTIGVAYISGILEIQDRERAPAHVQVAAARGVLRRLLPSHLSSFDFQIVSKVLDFTSFYALFEKIEDGAADKCGGE

Query:  SCFMIRNHRAFRRPGEPEILIAGVTGVELLAGLHWYLKQWCGAHISWDKTGGSQLFSVPKAGLLPRIQSDEIIFQRPIPLNYYQNAVTSSC---------
        SCF+IRNHR+FRRPG+PEILIAGVTGVE+LAGLHWYLK WCGAHISWDKTGGSQLFSVPKAGLLPRIQS+EII QRP+PLNYYQNAVTSS          
Subjt:  SCFMIRNHRAFRRPGEPEILIAGVTGVELLAGLHWYLKQWCGAHISWDKTGGSQLFSVPKAGLLPRIQSDEIIFQRPIPLNYYQNAVTSSC---------

Query:  ----------------------KTLLPGGLGKMGKGNR-LDGSSG------YQYASSIYWAGGYLAES---------------------ISVLPAFSGNI
                              + +      K    N  LD   G      +    +++  GG L +S                       VLPAFSGNI
Subjt:  ----------------------KTLLPGGLGKMGKGNR-LDGSSG------YQYASSIYWAGGYLAES---------------------ISVLPAFSGNI

Query:  PAAFKQIYPSAKITRLGNWFSVHSDPRWCCTYLLDATDPLFVEIGKAFIEQQLKEYGRTSHIYNCDTFDENTPPVDEAEYISSLGAAIFGGMQAGDSDAV
        PAAFKQIYPSAKITRLGNWFSVHSDPRWCCTYLLDA DPLFVEIGKAFIEQQLKEYGRTSH+YNCDTFDENTPPVD AEYISSLGAAIFGGMQAGDSDAV
Subjt:  PAAFKQIYPSAKITRLGNWFSVHSDPRWCCTYLLDATDPLFVEIGKAFIEQQLKEYGRTSHIYNCDTFDENTPPVDEAEYISSLGAAIFGGMQAGDSDAV

Query:  WLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAEVKPIWITSEQFYGTPYIWKVSIPFSCLILMFRCMLHNFAGNVEMYGTLDSIASGPIEARSS
        WLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAEVKPIWI+SEQFYGTPYIW              CMLHNFAGNVEMYG LDSIASGPIEAR+S
Subjt:  WLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAEVKPIWITSEQFYGTPYIWKVSIPFSCLILMFRCMLHNFAGNVEMYGTLDSIASGPIEARSS

Query:  PYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKQWLHQYSIRRYGQLVPSIQDAWDVLYHTIYNCTDGAYSYCFRIVSVFYWSRYFS-CNITEGS
        PYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVK+WL QYSIRRYGQLVPSIQDAWDVLYHTIYNCTDGAY     ++  F      S   + EGS
Subjt:  PYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKQWLHQYSIRRYGQLVPSIQDAWDVLYHTIYNCTDGAYSYCFRIVSVFYWSRYFS-CNITEGS

Query:  --DRYGNLDSSIAVLQDATFDRPHLWYPTSEVIRALKLFIAGGDQ----------LLVVARTALAKYSNELFFRVAKAYQLHDAQTVASLSQRFLELVKD
          DRY N +SS+  L  ATFDRPHLWY TSEVIRALKLFIAG DQ          L+ + R ALAKYSNELFFR+ KAYQL+DAQ +ASLSQ+FLELVKD
Subjt:  --DRYGNLDSSIAVLQDATFDRPHLWYPTSEVIRALKLFIAGGDQ----------LLVVARTALAKYSNELFFRVAKAYQLHDAQTVASLSQRFLELVKD

Query:  MDTLLACHEGFLLGPWLESAKQLAQDEEQEKQFEWNARTQITMWFDNTE
        +DTLLACHEGFLLGPWLESAKQLAQDEEQEKQ+EWNARTQITMWFDNTE
Subjt:  MDTLLACHEGFLLGPWLESAKQLAQDEEQEKQFEWNARTQITMWFDNTE

A0A6J1ECY3 alpha-N-acetylglucosaminidase-like2.5e-28369.56Show/hide
Query:  MASTFPAIFLILVSTFAALSTSRSSTIGVAYISGILEIQDRERAPAHVQVAAARGVLRRLLPSHLSSFDFQIVSKVLDFTSFYALFEKIEDGAADKCGGE
        MA  F A+ LI +S F   STS SSTIG  YIS +L+IQDRERAP+ VQVAAARGVLRRLLPSHLSSFDFQI+SK                   D CGGE
Subjt:  MASTFPAIFLILVSTFAALSTSRSSTIGVAYISGILEIQDRERAPAHVQVAAARGVLRRLLPSHLSSFDFQIVSKVLDFTSFYALFEKIEDGAADKCGGE

Query:  SCFMIRNHRAFRRPGEPEILIAGVTGVELLAGLHWYLKQWCGAHISWDKTGGSQLFSVPKAGLLPRIQSDEIIFQRPIPLNYYQNAVTSSCK--------
        SCF+IRNHRAFRRPG+PEILIAGVTGVE+LAGLHWYLK WCGAHISWDKTGGSQLFSVPK G LP IQSDEII +RPIPLNYYQNAVTSS          
Subjt:  SCFMIRNHRAFRRPGEPEILIAGVTGVELLAGLHWYLKQWCGAHISWDKTGGSQLFSVPKAGLLPRIQSDEIIFQRPIPLNYYQNAVTSSCK--------

Query:  -------TLLPG---GLGKMGK--------------GNRLDGSSG------YQYASSIYWAGGYLAES---------------------ISVLPAFSGNI
                 L G    L   G+               + LD   G      +    +++  GG L +S                       VLPAFSGNI
Subjt:  -------TLLPG---GLGKMGK--------------GNRLDGSSG------YQYASSIYWAGGYLAES---------------------ISVLPAFSGNI

Query:  PAAFKQIYPSAKITRLGNWFSVHSDPRWCCTYLLDATDPLFVEIGKAFIEQQLKEYGRTSHIYNCDTFDENTPPVDEAEYISSLGAAIFGGMQAGDSDAV
        PAAFKQIYPSAKITRLGNWFSVHSDPRWCCTYLLDA DPLFVEIG+AFIEQQLKEYGRTSH+YNCDTFDENTPPVD+ EYISSLGAAIFGGMQAGDS AV
Subjt:  PAAFKQIYPSAKITRLGNWFSVHSDPRWCCTYLLDATDPLFVEIGKAFIEQQLKEYGRTSHIYNCDTFDENTPPVDEAEYISSLGAAIFGGMQAGDSDAV

Query:  WLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAEVKPIWITSEQFYGTPYIWKVSIPFSCLILMFRCMLHNFAGNVEMYGTLDSIASGPIEARSS
        WLMQGWMFSYDPFWRPQQMKALLHSV LGRLVVLDLYAEVKPIWI SEQFYG PYIW              CMLHNFAGNVEMYG LDSIASGPIEARSS
Subjt:  WLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAEVKPIWITSEQFYGTPYIWKVSIPFSCLILMFRCMLHNFAGNVEMYGTLDSIASGPIEARSS

Query:  PYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKQWLHQYSIRRYGQLVPSIQDAWDVLYHTIYNCTDGAYSYCFRIVSVFYWSRYFSCN-ITEGS
        PYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVK+WL+QYSIRRYG LVPSIQDAWDVLYHTIYNCTDGAY     ++  F      S + I EGS
Subjt:  PYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKQWLHQYSIRRYGQLVPSIQDAWDVLYHTIYNCTDGAYSYCFRIVSVFYWSRYFSCN-ITEGS

Query:  DRYGNLDSSIAVLQDATFDRPHLWYPTSEVIRALKLFIAGGDQ----------LLVVARTALAKYSNELFFRVAKAYQLHDAQTVASLSQRFLELVKDMD
        DR+         LQDA F+RPHLWYPTSEVIRALKLFIA GDQ          L+ + R ALAKYSNELFFR+ KAYQL D QT  SLSQ+FLELV D+D
Subjt:  DRYGNLDSSIAVLQDATFDRPHLWYPTSEVIRALKLFIAGGDQ----------LLVVARTALAKYSNELFFRVAKAYQLHDAQTVASLSQRFLELVKDMD

Query:  TLLACHEGFLLGPWLESAKQLAQDEEQEKQFEWNARTQITMWFDNTERK
        TL+ACHEGFLLGPWL+SAKQLAQDE+QEKQ+EWNARTQITMWFDNTE +
Subjt:  TLLACHEGFLLGPWLESAKQLAQDEEQEKQFEWNARTQITMWFDNTERK

A0A6J1I5L2 alpha-N-acetylglucosaminidase-like6.7e-28168.98Show/hide
Query:  MASTFPAIFLILVSTFAALSTSRSSTIGVAYISGILEIQDRERAPAHVQVAAARGVLRRLLPSHLSSFDFQIVSKVLDFTSFYALFEKIEDGAADKCGGE
        MA  F A+FLI +S F   STS SSTIGV YIS +L+IQDRERAP+ VQVAAARGVLRRLLPSHLSSFDFQI+SK                   D CGGE
Subjt:  MASTFPAIFLILVSTFAALSTSRSSTIGVAYISGILEIQDRERAPAHVQVAAARGVLRRLLPSHLSSFDFQIVSKVLDFTSFYALFEKIEDGAADKCGGE

Query:  SCFMIRNHRAFRRPGEPEILIAGVTGVELLAGLHWYLKQWCGAHISWDKTGGSQLFSVPKAGLLPRIQSDEIIFQRPIPLNYYQNAVTSSCK--------
        SCF+IRNHRAFRRPG+PEILIAGVTGVE+LAGLHWYLK WCGAHISWDKTGGSQLFS PK G LP I+SDEII +RPIPLNYYQNAVTSS          
Subjt:  SCFMIRNHRAFRRPGEPEILIAGVTGVELLAGLHWYLKQWCGAHISWDKTGGSQLFSVPKAGLLPRIQSDEIIFQRPIPLNYYQNAVTSSCK--------

Query:  -------TLLPG---GLGKMGK--------------GNRLDGSSG------YQYASSIYWAGGYLAES---------------------ISVLPAFSGNI
                 L G    L   G+               + LD   G      +    +++  GG L  S                       VLPAFSGNI
Subjt:  -------TLLPG---GLGKMGK--------------GNRLDGSSG------YQYASSIYWAGGYLAES---------------------ISVLPAFSGNI

Query:  PAAFKQIYPSAKITRLGNWFSVHSDPRWCCTYLLDATDPLFVEIGKAFIEQQLKEYGRTSHIYNCDTFDENTPPVDEAEYISSLGAAIFGGMQAGDSDAV
        PAAFKQIYPSAKITRLGNWFSV SDPRWCCTYLLDA DPLFVEIG+AFIEQQLKEYGRTSH+YNCDTFDENTPPVD+ EYISSLGAAIFGGMQAGDS AV
Subjt:  PAAFKQIYPSAKITRLGNWFSVHSDPRWCCTYLLDATDPLFVEIGKAFIEQQLKEYGRTSHIYNCDTFDENTPPVDEAEYISSLGAAIFGGMQAGDSDAV

Query:  WLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAEVKPIWITSEQFYGTPYIWKVSIPFSCLILMFRCMLHNFAGNVEMYGTLDSIASGPIEARSS
        WLMQGWMFSYDPFWRPQQMKALLHSV LGRLVVLDLYAEVKPIWI SEQFYG PYIW              CMLHNFAGNVEMYG LDSIASGPIEARSS
Subjt:  WLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAEVKPIWITSEQFYGTPYIWKVSIPFSCLILMFRCMLHNFAGNVEMYGTLDSIASGPIEARSS

Query:  PYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKQWLHQYSIRRYGQLVPSIQDAWDVLYHTIYNCTDGAYSYCFRIVSVFYWSRYFSCNITEGSD
        PYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVK+WL+QYSIRRYG  VPSIQDAWDVLYHTIYNCTDGAY     ++  F        +  EGSD
Subjt:  PYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKQWLHQYSIRRYGQLVPSIQDAWDVLYHTIYNCTDGAYSYCFRIVSVFYWSRYFSCNITEGSD

Query:  RYGNLDSSIAVLQDATFDRPHLWYPTSEVIRALKLFIAGGDQ----------LLVVARTALAKYSNELFFRVAKAYQLHDAQTVASLSQRFLELVKDMDT
        R+         LQDA F+RPHLWYPTSEVIRALKLFIA GDQ          L+ + R ALAKYSNELFFR+ KAYQL D  T  SLSQ+FLELV D+DT
Subjt:  RYGNLDSSIAVLQDATFDRPHLWYPTSEVIRALKLFIAGGDQ----------LLVVARTALAKYSNELFFRVAKAYQLHDAQTVASLSQRFLELVKDMDT

Query:  LLACHEGFLLGPWLESAKQLAQDEEQEKQFEWNARTQITMWFDNTERK
        L+ACHEGFLLGPWL+SAKQLAQDE+QEKQ+EWNARTQITMWFDNTE +
Subjt:  LLACHEGFLLGPWLESAKQLAQDEEQEKQFEWNARTQITMWFDNTERK

SwissProt top hitse value%identityAlignment
P54802 Alpha-N-acetylglucosaminidase2.6e-8032.07Show/hide
Query:  GEPEILIAGVTGVELLAGLHWYLKQWCGAHISWDKTGGSQLFSVPKAGLLPRIQSDEIIFQRPIPLNYYQNAVTSSC-----------------------
        G   + + G TGV   AGLH YL+ +CG H++W    GSQL  +P+   LP +   E+    P    YYQN  T S                        
Subjt:  GEPEILIAGVTGVELLAGLHWYLKQWCGAHISWDKTGGSQLFSVPKAGLLPRIQSDEIIFQRPIPLNYYQNAVTSSC-----------------------

Query:  -------------KTLLPGGL-----------------GKMGKGNRLDGS-SGYQYASSIYWAGGYLAESIS-----VLPAFSGNIPAAFKQIYPSAKIT
                     +  L  GL                 G+MG  +  DG      +   +Y     L +  S     VLPAF+G++P A  +++P   +T
Subjt:  -------------KTLLPGGL-----------------GKMGKGNRLDGS-SGYQYASSIYWAGGYLAESIS-----VLPAFSGNIPAAFKQIYPSAKIT

Query:  RLGNWFSVHSDPRWCCTYLLDATDPLFVEIGKAFIEQQLKEYGRTSHIYNCDTFDENTPPVDEAEYISSLGAAIFGGMQAGDSDAVWLMQGWMFSYDP-F
        ++G+W   H +  + C++LL   DP+F  IG  F+ + +KE+G T HIY  DTF+E  PP  E  Y+++   A++  M A D++AVWL+QGW+F + P F
Subjt:  RLGNWFSVHSDPRWCCTYLLDATDPLFVEIGKAFIEQQLKEYGRTSHIYNCDTFDENTPPVDEAEYISSLGAAIFGGMQAGDSDAVWLMQGWMFSYDP-F

Query:  WRPQQMKALLHSVPLGRLVVLDLYAEVKPIWITSEQFYGTPYIWKVSIPFSCLILMFRCMLHNFAGNVEMYGTLDSIASGPIEARSSPYSTMVGVGMSME
        W P Q++A+L +VP GRL+VLDL+AE +P++  +  F G P+IW              CMLHNF GN  ++G L+++  GP  AR  P STMVG GM+ E
Subjt:  WRPQQMKALLHSVPLGRLVVLDLYAEVKPIWITSEQFYGTPYIWKVSIPFSCLILMFRCMLHNFAGNVEMYGTLDSIASGPIEARSSPYSTMVGVGMSME

Query:  GIEQNPVVYDLMSEMAFQHNKV-DVKQWLHQYSIRRYGQLVPSIQDAWDVLYHTIYNCTDGAYSYCFRIVSVFYWSRYFSCNITEGSDRYGNLDSSIAVL
        GI QN VVY LM+E+ ++ + V D+  W+  ++ RRYG   P    AW +L  ++YNC+  A                       G +R      S  V 
Subjt:  GIEQNPVVYDLMSEMAFQHNKV-DVKQWLHQYSIRRYGQLVPSIQDAWDVLYHTIYNCTDGAYSYCFRIVSVFYWSRYFSCNITEGSDRYGNLDSSIAVL

Query:  QDATFDRPHLWYPTSEVIRALKLFIAGGD----------QLLVVARTALAKYSNELFFRVAKAYQLHDAQTVASLSQR----FLELVKDMDTLLACHEGF
        + +      +WY  S+V  A +L +               LL + R A+ +  + L++  A++  L  ++ +ASL +       EL+  +D +LA    F
Subjt:  QDATFDRPHLWYPTSEVIRALKLFIAGGD----------QLLVVARTALAKYSNELFFRVAKAYQLHDAQTVASLSQR----FLELVKDMDTLLACHEGF

Query:  LLGPWLESAKQLAQDEEQEKQFEWNARTQITMW
        LLG WLE A+  A  E +   +E N+R Q+T+W
Subjt:  LLGPWLESAKQLAQDEEQEKQFEWNARTQITMW

Q9FNA3 Alpha-N-acetylglucosaminidase4.9e-18046.41Show/hide
Query:  IFLILVSTFAALSTSRSSTIGVAYISGILEIQDRERAPAHVQVAAARGVLRRLLPSHLSSFDFQIVSKVLDFTSFYALFEKIEDGAADKCGGESCFMIRN
        + ++L+ +F + + S+        I G+L+  D     + VQ +AA+G+L+RLLP+H  SF+ +I+SK                   D CGG SCF+I N
Subjt:  IFLILVSTFAALSTSRSSTIGVAYISGILEIQDRERAPAHVQVAAARGVLRRLLPSHLSSFDFQIVSKVLDFTSFYALFEKIEDGAADKCGGESCFMIRN

Query:  HRAFRRPGEPEILIAGVTGVELLAGLHWYLKQWCGAHISWDKTGGSQLFSVPKAGLLPRIQSDEIIFQRPIPLNYYQNAVTSSCKTLLPG----------
        +    R G PEILI G TGVE+ +GLHWYLK  C AH+SWDKTGG Q+ SVP+ G LPRI S  I  +RP+P NYYQN VTSS   +  G          
Subjt:  HRAFRRPGEPEILIAGVTGVELLAGLHWYLKQWCGAHISWDKTGGSQLFSVPKAGLLPRIQSDEIIFQRPIPLNYYQNAVTSSCKTLLPG----------

Query:  --------------------------GLGKMGKGNRLDGSSGYQYA--SSIYWAGGYLAES---------------------ISVLPAFSGNIPAAFKQI
                                   + K    +   G +   +A   +++  GG L+++                       VLP+FSGN+P+A ++I
Subjt:  --------------------------GLGKMGKGNRLDGSSGYQYA--SSIYWAGGYLAES---------------------ISVLPAFSGNIPAAFKQI

Query:  YPSAKITRLGNWFSVHSDPRWCCTYLLDATDPLFVEIGKAFIEQQLKEYGRTSHIYNCDTFDENTPPVDEAEYISSLGAAIFGGMQAGDSDAVWLMQGWM
        YP A ITRL NW +V  D RWCCTYLL+ +DPLF+EIG+AFI+QQ +EYG  ++IYNCDTF+ENTPP  E EYISSLGAA++  M  G+ +AVWLMQGW+
Subjt:  YPSAKITRLGNWFSVHSDPRWCCTYLLDATDPLFVEIGKAFIEQQLKEYGRTSHIYNCDTFDENTPPVDEAEYISSLGAAIFGGMQAGDSDAVWLMQGWM

Query:  FSYD-PFWRPQQMKALLHSVPLGRLVVLDLYAEVKPIWITSEQFYGTPYIWKVSIPFSCLILMFRCMLHNFAGNVEMYGTLDSIASGPIEARSSPYSTMV
        FS D  FW+P Q+KALLHSVP G+++VLDLYAEVKPIW  S QFYGTPYIW              CMLHNF GN+EMYG LDSI+SGP++AR S  STMV
Subjt:  FSYD-PFWRPQQMKALLHSVPLGRLVVLDLYAEVKPIWITSEQFYGTPYIWKVSIPFSCLILMFRCMLHNFAGNVEMYGTLDSIASGPIEARSSPYSTMV

Query:  GVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKQWLHQYSIRRYGQLVPSIQDAWDVLYHTIYNCTDGAYSY-CFRIVSVFYWSRYFSC-NITEGSDRY--
        GVGM MEGIEQNPVVY+L SEMAF+  KVDV++WL  Y+ RRY +    I+ AW++LYHT+YNCTDG   +    IV +  W    S  +  +  D Y  
Subjt:  GVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKQWLHQYSIRRYGQLVPSIQDAWDVLYHTIYNCTDGAYSY-CFRIVSVFYWSRYFSC-NITEGSDRY--

Query:  --GNLDSSIAVL-QDATFDRP--HLWYPTSEVIRALKLFIAGGDQL----------LVVARTALAKYSNELFFRVAKAYQLHDAQTVASLSQRFLELVKD
          G  ++   VL QD T D P  HLWY T EVI+ALKLF+  GD L          + + R  L+K +N+++     A+   D  ++  LS++FLEL+KD
Subjt:  --GNLDSSIAVL-QDATFDRP--HLWYPTSEVIRALKLFIAGGDQL----------LVVARTALAKYSNELFFRVAKAYQLHDAQTVASLSQRFLELVKD

Query:  MDTLLACHEGFLLGPWLESAKQLAQDEEQEKQFEWNARTQITMWFDNTERKQ
        MD LLA  +  LLG WLESAK+LA++ ++ KQ+EWNARTQ+TMW+D+ +  Q
Subjt:  MDTLLACHEGFLLGPWLESAKQLAQDEEQEKQFEWNARTQITMWFDNTERKQ

Arabidopsis top hitse value%identityAlignment
AT1G19020.1 unknown protein1.1e-1450.59Show/hide
Query:  NGSYDWADQWDYSDSK-------EIATDNNKKSGGGNSSAKYKQKVGEGLGKTKAVASNGVKKVKEGTSLGLQWIKDKYHKTTHK
        NG+  WADQWD S            A   +  SG  +++ KYK+K+G+GL KTKAVAS+G KK+K G+++G +W+KDKYHKTTHK
Subjt:  NGSYDWADQWDYSDSK-------EIATDNNKKSGGGNSSAKYKQKVGEGLGKTKAVASNGVKKVKEGTSLGLQWIKDKYHKTTHK

AT3G48180.1 unknown protein1.2e-1660.81Show/hide
Query:  WADQWDYSDSKEIATDNNKKSGGGNSSAKYKQKVGEGLGKTKAVASNGVKKVKEGTSLGLQWIKDKYHKTTHKN
        WA+QWD +      T +  +  GG +S+KYK+KVG GLGKTKA AS+G+KKVK GTSLGL W+KDKY+KTT KN
Subjt:  WADQWDYSDSKEIATDNNKKSGGGNSSAKYKQKVGEGLGKTKAVASNGVKKVKEGTSLGLQWIKDKYHKTTHKN

AT5G13690.1 alpha-N-acetylglucosaminidase family / NAGLU family3.5e-18146.41Show/hide
Query:  IFLILVSTFAALSTSRSSTIGVAYISGILEIQDRERAPAHVQVAAARGVLRRLLPSHLSSFDFQIVSKVLDFTSFYALFEKIEDGAADKCGGESCFMIRN
        + ++L+ +F + + S+        I G+L+  D     + VQ +AA+G+L+RLLP+H  SF+ +I+SK                   D CGG SCF+I N
Subjt:  IFLILVSTFAALSTSRSSTIGVAYISGILEIQDRERAPAHVQVAAARGVLRRLLPSHLSSFDFQIVSKVLDFTSFYALFEKIEDGAADKCGGESCFMIRN

Query:  HRAFRRPGEPEILIAGVTGVELLAGLHWYLKQWCGAHISWDKTGGSQLFSVPKAGLLPRIQSDEIIFQRPIPLNYYQNAVTSSCKTLLPG----------
        +    R G PEILI G TGVE+ +GLHWYLK  C AH+SWDKTGG Q+ SVP+ G LPRI S  I  +RP+P NYYQN VTSS   +  G          
Subjt:  HRAFRRPGEPEILIAGVTGVELLAGLHWYLKQWCGAHISWDKTGGSQLFSVPKAGLLPRIQSDEIIFQRPIPLNYYQNAVTSSCKTLLPG----------

Query:  --------------------------GLGKMGKGNRLDGSSGYQYA--SSIYWAGGYLAES---------------------ISVLPAFSGNIPAAFKQI
                                   + K    +   G +   +A   +++  GG L+++                       VLP+FSGN+P+A ++I
Subjt:  --------------------------GLGKMGKGNRLDGSSGYQYA--SSIYWAGGYLAES---------------------ISVLPAFSGNIPAAFKQI

Query:  YPSAKITRLGNWFSVHSDPRWCCTYLLDATDPLFVEIGKAFIEQQLKEYGRTSHIYNCDTFDENTPPVDEAEYISSLGAAIFGGMQAGDSDAVWLMQGWM
        YP A ITRL NW +V  D RWCCTYLL+ +DPLF+EIG+AFI+QQ +EYG  ++IYNCDTF+ENTPP  E EYISSLGAA++  M  G+ +AVWLMQGW+
Subjt:  YPSAKITRLGNWFSVHSDPRWCCTYLLDATDPLFVEIGKAFIEQQLKEYGRTSHIYNCDTFDENTPPVDEAEYISSLGAAIFGGMQAGDSDAVWLMQGWM

Query:  FSYD-PFWRPQQMKALLHSVPLGRLVVLDLYAEVKPIWITSEQFYGTPYIWKVSIPFSCLILMFRCMLHNFAGNVEMYGTLDSIASGPIEARSSPYSTMV
        FS D  FW+P Q+KALLHSVP G+++VLDLYAEVKPIW  S QFYGTPYIW              CMLHNF GN+EMYG LDSI+SGP++AR S  STMV
Subjt:  FSYD-PFWRPQQMKALLHSVPLGRLVVLDLYAEVKPIWITSEQFYGTPYIWKVSIPFSCLILMFRCMLHNFAGNVEMYGTLDSIASGPIEARSSPYSTMV

Query:  GVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKQWLHQYSIRRYGQLVPSIQDAWDVLYHTIYNCTDGAYSY-CFRIVSVFYWSRYFSC-NITEGSDRY--
        GVGM MEGIEQNPVVY+L SEMAF+  KVDV++WL  Y+ RRY +    I+ AW++LYHT+YNCTDG   +    IV +  W    S  +  +  D Y  
Subjt:  GVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKQWLHQYSIRRYGQLVPSIQDAWDVLYHTIYNCTDGAYSY-CFRIVSVFYWSRYFSC-NITEGSDRY--

Query:  --GNLDSSIAVL-QDATFDRP--HLWYPTSEVIRALKLFIAGGDQL----------LVVARTALAKYSNELFFRVAKAYQLHDAQTVASLSQRFLELVKD
          G  ++   VL QD T D P  HLWY T EVI+ALKLF+  GD L          + + R  L+K +N+++     A+   D  ++  LS++FLEL+KD
Subjt:  --GNLDSSIAVL-QDATFDRP--HLWYPTSEVIRALKLFIAGGDQL----------LVVARTALAKYSNELFFRVAKAYQLHDAQTVASLSQRFLELVKD

Query:  MDTLLACHEGFLLGPWLESAKQLAQDEEQEKQFEWNARTQITMWFDNTERKQ
        MD LLA  +  LLG WLESAK+LA++ ++ KQ+EWNARTQ+TMW+D+ +  Q
Subjt:  MDTLLACHEGFLLGPWLESAKQLAQDEEQEKQFEWNARTQITMWFDNTERKQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCCACTTTCCCTGCCATTTTTCTAATCCTCGTTTCCACATTCGCTGCCTTGTCCACTTCTCGTTCCTCAACGATCGGAGTCGCCTACATTTCGGGGATTCTTGA
AATTCAGGATCGCGAGAGGGCGCCTGCGCATGTACAAGTTGCGGCTGCTCGTGGAGTTCTTCGTCGGCTTCTTCCTTCTCATCTCTCCAGCTTCGACTTTCAAATTGTCT
CTAAGGTACTTGATTTTACAAGTTTTTATGCATTGTTTGAAAAGATAGAGGACGGCGCTGCGGACAAATGTGGAGGGGAATCTTGCTTTATGATCAGGAACCATCGCGCG
TTCAGGAGACCAGGGGAACCTGAAATTTTAATCGCTGGGGTCACTGGAGTGGAGCTTTTAGCGGGCTTGCACTGGTATCTAAAGCAATGGTGCGGTGCACACATATCTTG
GGATAAAACAGGTGGCTCACAACTATTTTCTGTACCCAAGGCAGGCTTGTTACCTCGTATTCAAAGTGACGAAATTATTTTTCAGAGACCTATTCCCTTGAACTATTATC
AAAATGCAGTTACATCAAGCTGTAAGACTCTTTTGCCTGGTGGACTGGGAAAGATGGGAAAGGGAAATAGACTGGATGGCTCTTCAGGGTATCAATATGCCTCTAGCATT
TACTGGGCAGGAGGCTATCTGGCGGAAAGTATTTCAGTTCTGCCAGCCTTTTCGGGAAACATTCCGGCTGCTTTCAAACAAATATATCCATCAGCAAAGATAACACGCTT
AGGAAATTGGTTTTCTGTTCACAGTGACCCTAGATGGTGCTGCACTTACCTACTTGATGCCACGGACCCCTTATTTGTCGAGATTGGTAAAGCATTTATTGAGCAACAAC
TGAAAGAATATGGAAGAACTTCCCATATATACAATTGTGATACCTTTGACGAGAACACCCCACCTGTTGATGAGGCAGAATACATCTCTTCATTAGGTGCAGCTATTTTT
GGAGGAATGCAAGCTGGTGATTCCGATGCTGTCTGGCTAATGCAGGGGTGGATGTTTTCATATGATCCATTCTGGAGGCCTCAGCAAATGAAGGCCCTTTTACATTCTGT
GCCGCTGGGAAGGCTGGTAGTCCTTGATCTGTATGCTGAAGTGAAACCGATCTGGATAACTTCTGAGCAATTTTATGGCACCCCTTACATCTGGAAAGTCTCTATTCCAT
TCTCTTGCTTGATCTTGATGTTCAGGTGCATGCTGCATAACTTTGCTGGAAATGTTGAGATGTATGGCACTTTAGATTCAATAGCATCTGGACCAATTGAAGCTCGTAGT
AGTCCATACTCAACAATGGTTGGGGTAGGAATGTCCATGGAAGGAATAGAACAGAATCCTGTTGTCTATGATCTCATGTCTGAAATGGCTTTTCAACACAACAAAGTTGA
TGTCAAGCAATGGCTTCATCAATATTCAATAAGACGCTATGGTCAATTAGTCCCTTCAATACAAGATGCCTGGGATGTATTATATCACACCATCTACAATTGCACCGATG
GTGCCTATTCTTATTGCTTTAGAATAGTCAGTGTCTTCTATTGGTCTCGTTACTTTTCTTGCAATATTACCGAGGGGTCCGACCGTTATGGGAATTTGGACTCAAGCATA
GCTGTCCTCCAGGATGCAACGTTTGACCGACCTCATCTGTGGTATCCTACTTCCGAAGTAATTCGTGCATTAAAGCTTTTCATTGCTGGCGGCGATCAACTTCTGGTAGT
AGCACGTACAGCTCTAGCCAAATACTCGAATGAACTGTTCTTTAGAGTTGCCAAAGCGTATCAGTTACATGATGCCCAAACAGTGGCCAGCTTAAGCCAGCGGTTTCTTG
AACTTGTCAAAGATATGGATACATTATTGGCTTGTCATGAGGGATTTCTTTTGGGACCTTGGTTAGAAAGCGCCAAGCAACTTGCCCAAGATGAAGAGCAGGAAAAACAG
TTTGAGTGGAATGCAAGAACTCAAATAACCATGTGGTTTGACAACACAGAGAGGAAGCAAGTTTGCTTCGTGATTATGGAAACAAGTACTGGAGTGGACTCTTGGGCGAT
TACTATGGTCCTCGAGCTGCAATATACTTCAAGAAACGGCTCGTACGATTGGGCGGACCAGTGGGATTACAGCGACTCGAAGGAGATAGCCACGGATAACAACAAGAAGA
GCGGCGGCGGAAACAGTTCGGCCAAGTACAAGCAGAAGGTCGGAGAAGGGCTTGGGAAGACCAAAGCTGTGGCCTCCAATGGCGTCAAAAAGGTCAAAGAAGGAACCTCT
CTTGGCCTCCAATGGATCAAAGATAAATACCATAAAACCACTCATAAGAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCCACTTTCCCTGCCATTTTTCTAATCCTCGTTTCCACATTCGCTGCCTTGTCCACTTCTCGTTCCTCAACGATCGGAGTCGCCTACATTTCGGGGATTCTTGA
AATTCAGGATCGCGAGAGGGCGCCTGCGCATGTACAAGTTGCGGCTGCTCGTGGAGTTCTTCGTCGGCTTCTTCCTTCTCATCTCTCCAGCTTCGACTTTCAAATTGTCT
CTAAGGTACTTGATTTTACAAGTTTTTATGCATTGTTTGAAAAGATAGAGGACGGCGCTGCGGACAAATGTGGAGGGGAATCTTGCTTTATGATCAGGAACCATCGCGCG
TTCAGGAGACCAGGGGAACCTGAAATTTTAATCGCTGGGGTCACTGGAGTGGAGCTTTTAGCGGGCTTGCACTGGTATCTAAAGCAATGGTGCGGTGCACACATATCTTG
GGATAAAACAGGTGGCTCACAACTATTTTCTGTACCCAAGGCAGGCTTGTTACCTCGTATTCAAAGTGACGAAATTATTTTTCAGAGACCTATTCCCTTGAACTATTATC
AAAATGCAGTTACATCAAGCTGTAAGACTCTTTTGCCTGGTGGACTGGGAAAGATGGGAAAGGGAAATAGACTGGATGGCTCTTCAGGGTATCAATATGCCTCTAGCATT
TACTGGGCAGGAGGCTATCTGGCGGAAAGTATTTCAGTTCTGCCAGCCTTTTCGGGAAACATTCCGGCTGCTTTCAAACAAATATATCCATCAGCAAAGATAACACGCTT
AGGAAATTGGTTTTCTGTTCACAGTGACCCTAGATGGTGCTGCACTTACCTACTTGATGCCACGGACCCCTTATTTGTCGAGATTGGTAAAGCATTTATTGAGCAACAAC
TGAAAGAATATGGAAGAACTTCCCATATATACAATTGTGATACCTTTGACGAGAACACCCCACCTGTTGATGAGGCAGAATACATCTCTTCATTAGGTGCAGCTATTTTT
GGAGGAATGCAAGCTGGTGATTCCGATGCTGTCTGGCTAATGCAGGGGTGGATGTTTTCATATGATCCATTCTGGAGGCCTCAGCAAATGAAGGCCCTTTTACATTCTGT
GCCGCTGGGAAGGCTGGTAGTCCTTGATCTGTATGCTGAAGTGAAACCGATCTGGATAACTTCTGAGCAATTTTATGGCACCCCTTACATCTGGAAAGTCTCTATTCCAT
TCTCTTGCTTGATCTTGATGTTCAGGTGCATGCTGCATAACTTTGCTGGAAATGTTGAGATGTATGGCACTTTAGATTCAATAGCATCTGGACCAATTGAAGCTCGTAGT
AGTCCATACTCAACAATGGTTGGGGTAGGAATGTCCATGGAAGGAATAGAACAGAATCCTGTTGTCTATGATCTCATGTCTGAAATGGCTTTTCAACACAACAAAGTTGA
TGTCAAGCAATGGCTTCATCAATATTCAATAAGACGCTATGGTCAATTAGTCCCTTCAATACAAGATGCCTGGGATGTATTATATCACACCATCTACAATTGCACCGATG
GTGCCTATTCTTATTGCTTTAGAATAGTCAGTGTCTTCTATTGGTCTCGTTACTTTTCTTGCAATATTACCGAGGGGTCCGACCGTTATGGGAATTTGGACTCAAGCATA
GCTGTCCTCCAGGATGCAACGTTTGACCGACCTCATCTGTGGTATCCTACTTCCGAAGTAATTCGTGCATTAAAGCTTTTCATTGCTGGCGGCGATCAACTTCTGGTAGT
AGCACGTACAGCTCTAGCCAAATACTCGAATGAACTGTTCTTTAGAGTTGCCAAAGCGTATCAGTTACATGATGCCCAAACAGTGGCCAGCTTAAGCCAGCGGTTTCTTG
AACTTGTCAAAGATATGGATACATTATTGGCTTGTCATGAGGGATTTCTTTTGGGACCTTGGTTAGAAAGCGCCAAGCAACTTGCCCAAGATGAAGAGCAGGAAAAACAG
TTTGAGTGGAATGCAAGAACTCAAATAACCATGTGGTTTGACAACACAGAGAGGAAGCAAGTTTGCTTCGTGATTATGGAAACAAGTACTGGAGTGGACTCTTGGGCGAT
TACTATGGTCCTCGAGCTGCAATATACTTCAAGAAACGGCTCGTACGATTGGGCGGACCAGTGGGATTACAGCGACTCGAAGGAGATAGCCACGGATAACAACAAGAAGA
GCGGCGGCGGAAACAGTTCGGCCAAGTACAAGCAGAAGGTCGGAGAAGGGCTTGGGAAGACCAAAGCTGTGGCCTCCAATGGCGTCAAAAAGGTCAAAGAAGGAACCTCT
CTTGGCCTCCAATGGATCAAAGATAAATACCATAAAACCACTCATAAGAATTAA
Protein sequenceShow/hide protein sequence
MASTFPAIFLILVSTFAALSTSRSSTIGVAYISGILEIQDRERAPAHVQVAAARGVLRRLLPSHLSSFDFQIVSKVLDFTSFYALFEKIEDGAADKCGGESCFMIRNHRA
FRRPGEPEILIAGVTGVELLAGLHWYLKQWCGAHISWDKTGGSQLFSVPKAGLLPRIQSDEIIFQRPIPLNYYQNAVTSSCKTLLPGGLGKMGKGNRLDGSSGYQYASSI
YWAGGYLAESISVLPAFSGNIPAAFKQIYPSAKITRLGNWFSVHSDPRWCCTYLLDATDPLFVEIGKAFIEQQLKEYGRTSHIYNCDTFDENTPPVDEAEYISSLGAAIF
GGMQAGDSDAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAEVKPIWITSEQFYGTPYIWKVSIPFSCLILMFRCMLHNFAGNVEMYGTLDSIASGPIEARS
SPYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKQWLHQYSIRRYGQLVPSIQDAWDVLYHTIYNCTDGAYSYCFRIVSVFYWSRYFSCNITEGSDRYGNLDSSI
AVLQDATFDRPHLWYPTSEVIRALKLFIAGGDQLLVVARTALAKYSNELFFRVAKAYQLHDAQTVASLSQRFLELVKDMDTLLACHEGFLLGPWLESAKQLAQDEEQEKQ
FEWNARTQITMWFDNTERKQVCFVIMETSTGVDSWAITMVLELQYTSRNGSYDWADQWDYSDSKEIATDNNKKSGGGNSSAKYKQKVGEGLGKTKAVASNGVKKVKEGTS
LGLQWIKDKYHKTTHKN