; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002831 (gene) of Snake gourd v1 genome

Gene IDTan0002831
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionalpha-N-acetylglucosaminidase
Genome locationLG06:4739656..4758890
RNA-Seq ExpressionTan0002831
SyntenyTan0002831
Gene Ontology termsGO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR007781 - Alpha-N-acetylglucosaminidase
IPR017853 - Glycoside hydrolase superfamily
IPR024240 - Alpha-N-acetylglucosaminidase, N-terminal
IPR024732 - Alpha-N-acetylglucosaminidase, C-terminal
IPR024733 - Alpha-N-acetylglucosaminidase, tim-barrel domain
IPR029018 - Beta-hexosaminidase-like, domain 2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008461320.1 PREDICTED: alpha-N-acetylglucosaminidase [Cucumis melo]0.0e+0092.9Show/hide
Query:  MSNFNSSILVLILIVFPLVLSEQEPIKAIIHRLDSKTLSPSIQEAAAKGLLRRLLPTHVDSFKFQIVYKDVCGGRSCFLISNFKSSSRNNGAEILIKGTT
        MSNF+SSILVLILI+ PL LS+QE I+AIIHRLDSKTLSPSIQEAAAK LLRRLLPTHVDSF+FQIV +DVC G SCFLISNFKSSSR NGAEILI+GTT
Subjt:  MSNFNSSILVLILIVFPLVLSEQEPIKAIIHRLDSKTLSPSIQEAAAKGLLRRLLPTHVDSFKFQIVYKDVCGGRSCFLISNFKSSSRNNGAEILIKGTT

Query:  AVEITSGLYWYLKYWCGAHVSWDKTGGVQLASIPKPGSLPLLKGDGVVVKRPVPWNYYQNVVTSSYAYVWWDWERWEKEIDWMALHGINLPLAFTGQESI
        AVEITSGLYWYLKYWCGAHVSWDKTGGVQLASIPKPGSLP LKGDGVV+KRPVPWNYYQNVVTSSY+YVWWDWERWEKEIDWMALHGINLPLAFTGQESI
Subjt:  AVEITSGLYWYLKYWCGAHVSWDKTGGVQLASIPKPGSLPLLKGDGVVVKRPVPWNYYQNVVTSSYAYVWWDWERWEKEIDWMALHGINLPLAFTGQESI

Query:  WRSVFRDFNLTVKDLDNFFGGPAFLAWARMGNLHGWGGPLTQNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAALAEIFPSADITRLGNWNSIDAD
        WR+VFRDFNL  KDLDNFFGGPAFLAWARMGNLHGWGGPL++NWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPA L EIFPSA+ITRLGNWNSIDAD
Subjt:  WRSVFRDFNLTVKDLDNFFGGPAFLAWARMGNLHGWGGPLTQNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAALAEIFPSADITRLGNWNSIDAD

Query:  PSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTDIYNCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSTFWKPDQMKALLH
        PSTCCTYLLNPSDPLFV+IGEAFIRQQIKEYGDVTDIY+CDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDS FWKPDQMKALLH
Subjt:  PSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTDIYNCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSTFWKPDQMKALLH

Query:  SVPFGKMIVLDLFADVKPIWRTSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSKKVEVQ
        SVPFGKMIVLDLFADVKPIW+TSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYEL+SEMAFRSKKV+VQ
Subjt:  SVPFGKMIVLDLFADVKPIWRTSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSKKVEVQ

Query:  EWLKTYSRCRYGKADHYVEAAWKILYHTIYNCTDGIADHNNDFIVKLPDWDPSSSFGLSK-PHLWYSTQEVINALQLLLNADDNLVNSAPYRYDLVDLTR
        EWLKTYSRCRYGKADHYV+AAW ILYHTIYNCTDGIA+HN DFIVKLPDWDPSSS  L K PHLWYSTQEVINALQLL+N DDNLV+SA YRYDLVDLTR
Subjt:  EWLKTYSRCRYGKADHYVEAAWKILYHTIYNCTDGIADHNNDFIVKLPDWDPSSSFGLSK-PHLWYSTQEVINALQLLLNADDNLVNSAPYRYDLVDLTR

Query:  QVLGKLANEEYLKAVTAFRRKNVKDLNLHSKRFVQLIRDIDRLLASNSNFLLGTWLESAKKLATNPSEMKQYEWNARTQVTMWYDNTKVNQSKLHDYANK
        QVLGKLANEEYLKAVTAFRR+NVK  NLHSKRF+QLIRDIDRLLASNSNFLLGTWLESAKKLATNPSEMKQYEWNARTQVTMWYDNTKVNQSKLHDYANK
Subjt:  QVLGKLANEEYLKAVTAFRRKNVKDLNLHSKRFVQLIRDIDRLLASNSNFLLGTWLESAKKLATNPSEMKQYEWNARTQVTMWYDNTKVNQSKLHDYANK

Query:  YWSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWISFSNKWQAASEKYPVQAEGNAIAISRALYKKYFG
        YWSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWI FSNKWQAASE YPV+A+GNA+AIS+ALY+KYFG
Subjt:  YWSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWISFSNKWQAASEKYPVQAEGNAIAISRALYKKYFG

XP_022150276.1 alpha-N-acetylglucosaminidase [Momordica charantia]0.0e+0092.51Show/hide
Query:  MSNFNSSILVLILIVFPLVLSEQEPIKAIIHRLDSKTLSPSIQEAAAKGLLRRLLPTHVDSFKFQIVYKDVCGGRSCFLISNFKSSSRNNGAEILIKGTT
        MSNFN S+LVLIL+VFPL LSE E IKAIIHRLDSK LSPSIQEAAA G+LRRLLPTHV SF+FQIV +DVCGG SCFLISNFKSS R NGAEILIKGTT
Subjt:  MSNFNSSILVLILIVFPLVLSEQEPIKAIIHRLDSKTLSPSIQEAAAKGLLRRLLPTHVDSFKFQIVYKDVCGGRSCFLISNFKSSSRNNGAEILIKGTT

Query:  AVEITSGLYWYLKYWCGAHVSWDKTGGVQLASIPKPGSLPLLKGDGVVVKRPVPWNYYQNVVTSSYAYVWWDWERWEKEIDWMALHGINLPLAFTGQESI
        AVEITSGLYWYLKYWCGAH+SWDKTGGVQ+ASIPKPGSLPLLKGDGVVVKRPVPWNYYQNVVTSSY+YVWWDWERWEKEIDWMALHGINLPLAFTGQESI
Subjt:  AVEITSGLYWYLKYWCGAHVSWDKTGGVQLASIPKPGSLPLLKGDGVVVKRPVPWNYYQNVVTSSYAYVWWDWERWEKEIDWMALHGINLPLAFTGQESI

Query:  WRSVFRDFNLTVKDLDNFFGGPAFLAWARMGNLHGWGGPLTQNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAALAEIFPSADITRLGNWNSIDAD
        W+SVFRDFNLTVKDLDNFFGGPAFLAWARMGNLHGWGG L+Q+WLDQQL LQKQILSRMRELGMTPVLPSFSGNVPAALAE FPSADITRLGNWNSIDAD
Subjt:  WRSVFRDFNLTVKDLDNFFGGPAFLAWARMGNLHGWGGPLTQNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAALAEIFPSADITRLGNWNSIDAD

Query:  PSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTDIYNCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSTFWKPDQMKALLH
        PSTCCTYLLNPSDPLFVKIGEAFIR+QIKEY DVTDIYNCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSTFWKPDQMKALLH
Subjt:  PSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTDIYNCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSTFWKPDQMKALLH

Query:  SVPFGKMIVLDLFADVKPIWRTSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSKKVEVQ
        SVPFGKMIVLDLFA+VKPIWRTSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYE+MSEMAFRSKKVEVQ
Subjt:  SVPFGKMIVLDLFADVKPIWRTSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSKKVEVQ

Query:  EWLKTYSRCRYGKADHYVEAAWKILYHTIYNCTDGIADHNNDFIVKLPDWDPSSSFGLSKPHLWYSTQEVINALQLLLNADDNLVNSAPYRYDLVDLTRQ
        EWLKTYSRCRYGKADHYV+AAWKILYHTIYNCTDGIADHN DFIVKLPDWDP SS  + KPHLWYSTQ+VINALQLLLNA+++L+NS+ YRYDLVDL RQ
Subjt:  EWLKTYSRCRYGKADHYVEAAWKILYHTIYNCTDGIADHNNDFIVKLPDWDPSSSFGLSKPHLWYSTQEVINALQLLLNADDNLVNSAPYRYDLVDLTRQ

Query:  VLGKLANEEYLKAVTAFRRKNVKDLNLHSKRFVQLIRDIDRLLASNSNFLLGTWLESAKKLATNPSEMKQYEWNARTQVTMWYDNTKVNQSKLHDYANKY
        VLGKLANEEYL AV AF+RK+VK LN+HSKRF+QLIRDIDRLLASNSNFLLGTWLESAKKLATNPSEMKQYEWNARTQVTMWYDNTK NQSKLHDYANKY
Subjt:  VLGKLANEEYLKAVTAFRRKNVKDLNLHSKRFVQLIRDIDRLLASNSNFLLGTWLESAKKLATNPSEMKQYEWNARTQVTMWYDNTKVNQSKLHDYANKY

Query:  WSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWISFSNKWQAASEKYPVQAEGNAIAISRALYKKYFG
        WSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWI FSNKWQAASE YPV+AEGN++AISRALY+KYFG
Subjt:  WSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWISFSNKWQAASEKYPVQAEGNAIAISRALYKKYFG

XP_022960481.1 alpha-N-acetylglucosaminidase [Cucurbita moschata]0.0e+0093.02Show/hide
Query:  MSNFNSSILVLILIVFPLVLSEQEPIKAIIHRLDSKTLSPSIQEAAAKGLLRRLLPTHVDSFKFQIVYKDVCGGRSCFLISNFKSSSRNNGAEILIKGTT
        MS FNS ILVLIL VFPL  SEQE IKAIIHRLDSKT SPSIQEAAAKGLLRRLLPTHVDSFKFQIV +DVCGG SCFLISNFK SS +NGAEILI+GTT
Subjt:  MSNFNSSILVLILIVFPLVLSEQEPIKAIIHRLDSKTLSPSIQEAAAKGLLRRLLPTHVDSFKFQIVYKDVCGGRSCFLISNFKSSSRNNGAEILIKGTT

Query:  AVEITSGLYWYLKYWCGAHVSWDKTGGVQLASIPKPGSLPLLKGDGVVVKRPVPWNYYQNVVTSSYAYVWWDWERWEKEIDWMALHGINLPLAFTGQESI
        AVEITSGLYWYLKYWCGAHVSWDKTGGVQLASIPKPGSLPLL+G+GVVVKRPVPWNYYQNVVTSSY+YVWWDWERWEKEIDWMALHGINLPLAFTGQESI
Subjt:  AVEITSGLYWYLKYWCGAHVSWDKTGGVQLASIPKPGSLPLLKGDGVVVKRPVPWNYYQNVVTSSYAYVWWDWERWEKEIDWMALHGINLPLAFTGQESI

Query:  WRSVFRDFNLTVKDLDNFFGGPAFLAWARMGNLHGWGGPLTQNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAALAEIFPSADITRLGNWNSIDAD
        WRSVFRDFNLTVKDLDNFFGGPAFLAWARMGNLHGWGGPL+QNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAALA+ FPSADITRLGNWNSI+AD
Subjt:  WRSVFRDFNLTVKDLDNFFGGPAFLAWARMGNLHGWGGPLTQNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAALAEIFPSADITRLGNWNSIDAD

Query:  PSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTDIYNCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSTFWKPDQMKALLH
        PSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTDIYNCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSTFWKP+QMKALLH
Subjt:  PSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTDIYNCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSTFWKPDQMKALLH

Query:  SVPFGKMIVLDLFADVKPIWRTSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSKKVEVQ
        SVPFGKMIVLDLFADVKPIW+TSSQFYGTPYVWCMLHNFGGNIEMYG+LDAISSGPVDALASENSTMVGVGMCMEGIEHN VVYELMSEMAFRSKKVEVQ
Subjt:  SVPFGKMIVLDLFADVKPIWRTSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSKKVEVQ

Query:  EWLKTYSRCRYGKADHYVEAAWKILYHTIYNCTDGIADHNNDFIVKLPDWDPSSSFGLSKPHLWYSTQEVINALQLLLNADDNLVNSAPYRYDLVDLTRQ
        +WLKTYSRCRYGKAD YVEAAW ILYHTIYNCTDGIADHNNDFIVKLPDWDPSSSF    PHLWYSTQEVINALQLLL A DNL NSA YRYDLVDLTRQ
Subjt:  EWLKTYSRCRYGKADHYVEAAWKILYHTIYNCTDGIADHNNDFIVKLPDWDPSSSFGLSKPHLWYSTQEVINALQLLLNADDNLVNSAPYRYDLVDLTRQ

Query:  VLGKLANEEYLKAVTAFRRKNVKDLNLHSKRFVQLIRDIDRLLASNSNFLLGTWLESAKKLATNPSEMKQYEWNARTQVTMWYDNTKVNQSKLHDYANKY
        VLGKLANEEYLKA+++F+RKNV  LN HSKRFVQLIRDIDRLLAS+SNFLLGTWLESAKKLATNPSEMKQYEWNARTQVTMWYDNT+VNQSKLHDYANKY
Subjt:  VLGKLANEEYLKAVTAFRRKNVKDLNLHSKRFVQLIRDIDRLLASNSNFLLGTWLESAKKLATNPSEMKQYEWNARTQVTMWYDNTKVNQSKLHDYANKY

Query:  WSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWISFSNKWQAASEKYPVQAEGNAIAISRALYKKYFG
        WSGL+EGYYLPRALTYFYY+SKSLRKNESFHLE+WRREWI FSNKWQAASE YPV+AEGN IAISRA Y+KYFG
Subjt:  WSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWISFSNKWQAASEKYPVQAEGNAIAISRALYKKYFG

XP_038897833.1 alpha-N-acetylglucosaminidase isoform X1 [Benincasa hispida]0.0e+0092.1Show/hide
Query:  MSNFNSSILVLILIVFPLVLSEQEPIKAIIHRLDSKTLSPSIQEAAAKGLLRRLLPTHVDSFKFQIVYKDVCGGRSCFLISNFKSSSRNNGAEILIKGTT
        MSNF S ILVLIL+V PL LSEQE I+AIIHRLDSKTL PSIQEAAA+ LLRRLLPTHVDSF+FQIV +DVCGG SCFLISNFKSSSR NGAEI IKGTT
Subjt:  MSNFNSSILVLILIVFPLVLSEQEPIKAIIHRLDSKTLSPSIQEAAAKGLLRRLLPTHVDSFKFQIVYKDVCGGRSCFLISNFKSSSRNNGAEILIKGTT

Query:  AVEITSGLYWYLKYWCGAHVSWDKTGGVQLASIPKPGSLPLLKGDGVVVKRPVPWNYYQNVVTSSYAYVWWDWERWEKEIDWMALHGINLPLAFTGQESI
        AVEITSGLYWYLKYWCGAHVSWDKTGGVQLASIPKPGSLPLLKGDGVVVKRPVPWNYYQNVVTSSY+YVWWDWERWEKEIDWMALHGINLPLAFTGQESI
Subjt:  AVEITSGLYWYLKYWCGAHVSWDKTGGVQLASIPKPGSLPLLKGDGVVVKRPVPWNYYQNVVTSSYAYVWWDWERWEKEIDWMALHGINLPLAFTGQESI

Query:  WRSVFRDFNLTVKDLDNFFGGPAFLAWARMGNLHGWGGPLTQNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAALAEIFPSADITRLGNWNSIDAD
        WR+VFRDFNLTVK+LDNFFGGPAFLAWARMGNLHGWGGPL++NWLDQQL+LQKQILSRM+ELGMTPVLPSFSGNVPA LAEIFPSADITRLGNWNSIDAD
Subjt:  WRSVFRDFNLTVKDLDNFFGGPAFLAWARMGNLHGWGGPLTQNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAALAEIFPSADITRLGNWNSIDAD

Query:  PSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTDIYNCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSTFWKPDQMKALLH
        PSTCCTYLLNPSDPLFV+IGEAFIRQQIKEYGDVTDIYNCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSTFWKPDQMKALLH
Subjt:  PSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTDIYNCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSTFWKPDQMKALLH

Query:  SVPFGKMIVLDLFADVKPIWRTSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSKKVEVQ
        SVPFGKMIVLDLFADV+PIWRTSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSKKV VQ
Subjt:  SVPFGKMIVLDLFADVKPIWRTSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSKKVEVQ

Query:  EWLKTYSRCRYGKADHYVEAAWKILYHTIYNCTDGIADHNNDFIVKLPDWDPSSSFGLSKPHLWYSTQEVINALQLLLNADDNLVNSAPYRYDLVDLTRQ
        EWLKTYSRCRYGKADHYV+AAW ILYHTIYNCTDGIADHN DFIVKLPDWDPSSS  L KPHLWYSTQEV NALQLLLNADDNL++ A YRYDLVDLTRQ
Subjt:  EWLKTYSRCRYGKADHYVEAAWKILYHTIYNCTDGIADHNNDFIVKLPDWDPSSSFGLSKPHLWYSTQEVINALQLLLNADDNLVNSAPYRYDLVDLTRQ

Query:  VLGKLANEEYLKAVTAFRRKNVKDLNLHSKRFVQLIRDIDRLLASNSNFLLGTWLESAKKLATNPSEMK-----------QYEWNARTQVTMWYDNTKVN
        VLGKLANEEYLKAVTAF+RKNVK  NLHSKRF+QLIRDIDRLLASNSNFLLGTWLESAKKLATN SEMK           QYEWNARTQVTMWYDNT++N
Subjt:  VLGKLANEEYLKAVTAFRRKNVKDLNLHSKRFVQLIRDIDRLLASNSNFLLGTWLESAKKLATNPSEMK-----------QYEWNARTQVTMWYDNTKVN

Query:  QSKLHDYANKYWSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWISFSNKWQAASEKYPVQAEGNAIAISRALYKKYFG
        QSKLHDYANKYWSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWI FSNKWQAASE YPV+AEGNA+AIS+ALY+KYFG
Subjt:  QSKLHDYANKYWSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWISFSNKWQAASEKYPVQAEGNAIAISRALYKKYFG

XP_038897835.1 alpha-N-acetylglucosaminidase isoform X2 [Benincasa hispida]0.0e+0093.41Show/hide
Query:  MSNFNSSILVLILIVFPLVLSEQEPIKAIIHRLDSKTLSPSIQEAAAKGLLRRLLPTHVDSFKFQIVYKDVCGGRSCFLISNFKSSSRNNGAEILIKGTT
        MSNF S ILVLIL+V PL LSEQE I+AIIHRLDSKTL PSIQEAAA+ LLRRLLPTHVDSF+FQIV +DVCGG SCFLISNFKSSSR NGAEI IKGTT
Subjt:  MSNFNSSILVLILIVFPLVLSEQEPIKAIIHRLDSKTLSPSIQEAAAKGLLRRLLPTHVDSFKFQIVYKDVCGGRSCFLISNFKSSSRNNGAEILIKGTT

Query:  AVEITSGLYWYLKYWCGAHVSWDKTGGVQLASIPKPGSLPLLKGDGVVVKRPVPWNYYQNVVTSSYAYVWWDWERWEKEIDWMALHGINLPLAFTGQESI
        AVEITSGLYWYLKYWCGAHVSWDKTGGVQLASIPKPGSLPLLKGDGVVVKRPVPWNYYQNVVTSSY+YVWWDWERWEKEIDWMALHGINLPLAFTGQESI
Subjt:  AVEITSGLYWYLKYWCGAHVSWDKTGGVQLASIPKPGSLPLLKGDGVVVKRPVPWNYYQNVVTSSYAYVWWDWERWEKEIDWMALHGINLPLAFTGQESI

Query:  WRSVFRDFNLTVKDLDNFFGGPAFLAWARMGNLHGWGGPLTQNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAALAEIFPSADITRLGNWNSIDAD
        WR+VFRDFNLTVK+LDNFFGGPAFLAWARMGNLHGWGGPL++NWLDQQL+LQKQILSRM+ELGMTPVLPSFSGNVPA LAEIFPSADITRLGNWNSIDAD
Subjt:  WRSVFRDFNLTVKDLDNFFGGPAFLAWARMGNLHGWGGPLTQNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAALAEIFPSADITRLGNWNSIDAD

Query:  PSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTDIYNCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSTFWKPDQMKALLH
        PSTCCTYLLNPSDPLFV+IGEAFIRQQIKEYGDVTDIYNCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSTFWKPDQMKALLH
Subjt:  PSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTDIYNCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSTFWKPDQMKALLH

Query:  SVPFGKMIVLDLFADVKPIWRTSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSKKVEVQ
        SVPFGKMIVLDLFADV+PIWRTSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSKKV VQ
Subjt:  SVPFGKMIVLDLFADVKPIWRTSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSKKVEVQ

Query:  EWLKTYSRCRYGKADHYVEAAWKILYHTIYNCTDGIADHNNDFIVKLPDWDPSSSFGLSKPHLWYSTQEVINALQLLLNADDNLVNSAPYRYDLVDLTRQ
        EWLKTYSRCRYGKADHYV+AAW ILYHTIYNCTDGIADHN DFIVKLPDWDPSSS  L KPHLWYSTQEV NALQLLLNADDNL++ A YRYDLVDLTRQ
Subjt:  EWLKTYSRCRYGKADHYVEAAWKILYHTIYNCTDGIADHNNDFIVKLPDWDPSSSFGLSKPHLWYSTQEVINALQLLLNADDNLVNSAPYRYDLVDLTRQ

Query:  VLGKLANEEYLKAVTAFRRKNVKDLNLHSKRFVQLIRDIDRLLASNSNFLLGTWLESAKKLATNPSEMKQYEWNARTQVTMWYDNTKVNQSKLHDYANKY
        VLGKLANEEYLKAVTAF+RKNVK  NLHSKRF+QLIRDIDRLLASNSNFLLGTWLESAKKLATN SEMKQYEWNARTQVTMWYDNT++NQSKLHDYANKY
Subjt:  VLGKLANEEYLKAVTAFRRKNVKDLNLHSKRFVQLIRDIDRLLASNSNFLLGTWLESAKKLATNPSEMKQYEWNARTQVTMWYDNTKVNQSKLHDYANKY

Query:  WSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWISFSNKWQAASEKYPVQAEGNAIAISRALYKKYFG
        WSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWI FSNKWQAASE YPV+AEGNA+AIS+ALY+KYFG
Subjt:  WSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWISFSNKWQAASEKYPVQAEGNAIAISRALYKKYFG

TrEMBL top hitse value%identityAlignment
A0A1S3CEF3 alpha-N-acetylglucosaminidase0.0e+0092.9Show/hide
Query:  MSNFNSSILVLILIVFPLVLSEQEPIKAIIHRLDSKTLSPSIQEAAAKGLLRRLLPTHVDSFKFQIVYKDVCGGRSCFLISNFKSSSRNNGAEILIKGTT
        MSNF+SSILVLILI+ PL LS+QE I+AIIHRLDSKTLSPSIQEAAAK LLRRLLPTHVDSF+FQIV +DVC G SCFLISNFKSSSR NGAEILI+GTT
Subjt:  MSNFNSSILVLILIVFPLVLSEQEPIKAIIHRLDSKTLSPSIQEAAAKGLLRRLLPTHVDSFKFQIVYKDVCGGRSCFLISNFKSSSRNNGAEILIKGTT

Query:  AVEITSGLYWYLKYWCGAHVSWDKTGGVQLASIPKPGSLPLLKGDGVVVKRPVPWNYYQNVVTSSYAYVWWDWERWEKEIDWMALHGINLPLAFTGQESI
        AVEITSGLYWYLKYWCGAHVSWDKTGGVQLASIPKPGSLP LKGDGVV+KRPVPWNYYQNVVTSSY+YVWWDWERWEKEIDWMALHGINLPLAFTGQESI
Subjt:  AVEITSGLYWYLKYWCGAHVSWDKTGGVQLASIPKPGSLPLLKGDGVVVKRPVPWNYYQNVVTSSYAYVWWDWERWEKEIDWMALHGINLPLAFTGQESI

Query:  WRSVFRDFNLTVKDLDNFFGGPAFLAWARMGNLHGWGGPLTQNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAALAEIFPSADITRLGNWNSIDAD
        WR+VFRDFNL  KDLDNFFGGPAFLAWARMGNLHGWGGPL++NWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPA L EIFPSA+ITRLGNWNSIDAD
Subjt:  WRSVFRDFNLTVKDLDNFFGGPAFLAWARMGNLHGWGGPLTQNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAALAEIFPSADITRLGNWNSIDAD

Query:  PSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTDIYNCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSTFWKPDQMKALLH
        PSTCCTYLLNPSDPLFV+IGEAFIRQQIKEYGDVTDIY+CDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDS FWKPDQMKALLH
Subjt:  PSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTDIYNCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSTFWKPDQMKALLH

Query:  SVPFGKMIVLDLFADVKPIWRTSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSKKVEVQ
        SVPFGKMIVLDLFADVKPIW+TSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYEL+SEMAFRSKKV+VQ
Subjt:  SVPFGKMIVLDLFADVKPIWRTSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSKKVEVQ

Query:  EWLKTYSRCRYGKADHYVEAAWKILYHTIYNCTDGIADHNNDFIVKLPDWDPSSSFGLSK-PHLWYSTQEVINALQLLLNADDNLVNSAPYRYDLVDLTR
        EWLKTYSRCRYGKADHYV+AAW ILYHTIYNCTDGIA+HN DFIVKLPDWDPSSS  L K PHLWYSTQEVINALQLL+N DDNLV+SA YRYDLVDLTR
Subjt:  EWLKTYSRCRYGKADHYVEAAWKILYHTIYNCTDGIADHNNDFIVKLPDWDPSSSFGLSK-PHLWYSTQEVINALQLLLNADDNLVNSAPYRYDLVDLTR

Query:  QVLGKLANEEYLKAVTAFRRKNVKDLNLHSKRFVQLIRDIDRLLASNSNFLLGTWLESAKKLATNPSEMKQYEWNARTQVTMWYDNTKVNQSKLHDYANK
        QVLGKLANEEYLKAVTAFRR+NVK  NLHSKRF+QLIRDIDRLLASNSNFLLGTWLESAKKLATNPSEMKQYEWNARTQVTMWYDNTKVNQSKLHDYANK
Subjt:  QVLGKLANEEYLKAVTAFRRKNVKDLNLHSKRFVQLIRDIDRLLASNSNFLLGTWLESAKKLATNPSEMKQYEWNARTQVTMWYDNTKVNQSKLHDYANK

Query:  YWSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWISFSNKWQAASEKYPVQAEGNAIAISRALYKKYFG
        YWSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWI FSNKWQAASE YPV+A+GNA+AIS+ALY+KYFG
Subjt:  YWSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWISFSNKWQAASEKYPVQAEGNAIAISRALYKKYFG

A0A5D3CGM4 Alpha-N-acetylglucosaminidase0.0e+0092.52Show/hide
Query:  MSNFNSSILVLILIVFPLVLSEQEPIKAIIHRLDSKTLSPSIQEAAAKGLLRRLLPTHVDSFKFQIVYKDVCGGRSCFLISNFKSSSRNNGAEILIKGTT
        MSNF+ SILVLILI+ PL LS+QE I+AIIHRLDSKTLSPSIQEAAAK LLRRLLPTHVDSF+FQIV +DVC G SCFLISNFKSSSR NGAEIL  GTT
Subjt:  MSNFNSSILVLILIVFPLVLSEQEPIKAIIHRLDSKTLSPSIQEAAAKGLLRRLLPTHVDSFKFQIVYKDVCGGRSCFLISNFKSSSRNNGAEILIKGTT

Query:  AVEITSGLYWYLKYWCGAHVSWDKTGGVQLASIPKPGSLPLLKGDGVVVKRPVPWNYYQNVVTSSYAYVWWDWERWEKEIDWMALHGINLPLAFTGQESI
        AVEITSGLYWYLKYWCGAHVSWDKTGGVQLASIPKPGSLP +KGDGVV+KRPVPWNYYQNVVTSSY+YVWWDWERWEKEIDWMALHGINLPLAFTGQESI
Subjt:  AVEITSGLYWYLKYWCGAHVSWDKTGGVQLASIPKPGSLPLLKGDGVVVKRPVPWNYYQNVVTSSYAYVWWDWERWEKEIDWMALHGINLPLAFTGQESI

Query:  WRSVFRDFNLTVKDLDNFFGGPAFLAWARMGNLHGWGGPLTQNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAALAEIFPSADITRLGNWNSIDAD
        WR+VFRDFNL  KDLDNFFGGPAFLAWARMGNLHGWGGPL++NWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPA L EIFPSA+ITRLGNWNSIDAD
Subjt:  WRSVFRDFNLTVKDLDNFFGGPAFLAWARMGNLHGWGGPLTQNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAALAEIFPSADITRLGNWNSIDAD

Query:  PSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTDIYNCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSTFWKPDQMKALLH
        PSTCCTYLLNPSDPLFV+IGEAFIRQQIKEYGDVTDIY+CDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDS FWKPDQMKALL 
Subjt:  PSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTDIYNCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSTFWKPDQMKALLH

Query:  SVPFGKMIVLDLFADVKPIWRTSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSKKVEVQ
        SVPFGKMIVLDLFADVKPIW+TSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSKKV+VQ
Subjt:  SVPFGKMIVLDLFADVKPIWRTSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSKKVEVQ

Query:  EWLKTYSRCRYGKADHYVEAAWKILYHTIYNCTDGIADHNNDFIVKLPDWDPSSSFGLSK-PHLWYSTQEVINALQLLLNADDNLVNSAPYRYDLVDLTR
        EWLKTYSRCRYGKADHYV+AAW ILYHTIYNCTDGIA+HN DFIVKLPDWDPSSS  L K PHLWYSTQEVINALQLL+N DDNLV+SA YRYDLVDLTR
Subjt:  EWLKTYSRCRYGKADHYVEAAWKILYHTIYNCTDGIADHNNDFIVKLPDWDPSSSFGLSK-PHLWYSTQEVINALQLLLNADDNLVNSAPYRYDLVDLTR

Query:  QVLGKLANEEYLKAVTAFRRKNVKDLNLHSKRFVQLIRDIDRLLASNSNFLLGTWLESAKKLATNPSEMKQYEWNARTQVTMWYDNTKVNQSKLHDYANK
        QVLGKLANEEYLKAVTAFRR+NVK  NLHSKRF+QLIRDIDRLLASNSNFLLGTWLESAKKLATNPSEMKQYEWNARTQVTMWYDNTKVNQSKLHDYANK
Subjt:  QVLGKLANEEYLKAVTAFRRKNVKDLNLHSKRFVQLIRDIDRLLASNSNFLLGTWLESAKKLATNPSEMKQYEWNARTQVTMWYDNTKVNQSKLHDYANK

Query:  YWSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWISFSNKWQAASEKYPVQAEGNAIAISRALYKKYFG
        YWSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWI FSNKWQAASE YPV+A+GNA+AIS+ALY+KYFG
Subjt:  YWSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWISFSNKWQAASEKYPVQAEGNAIAISRALYKKYFG

A0A6J1D9J6 alpha-N-acetylglucosaminidase0.0e+0092.51Show/hide
Query:  MSNFNSSILVLILIVFPLVLSEQEPIKAIIHRLDSKTLSPSIQEAAAKGLLRRLLPTHVDSFKFQIVYKDVCGGRSCFLISNFKSSSRNNGAEILIKGTT
        MSNFN S+LVLIL+VFPL LSE E IKAIIHRLDSK LSPSIQEAAA G+LRRLLPTHV SF+FQIV +DVCGG SCFLISNFKSS R NGAEILIKGTT
Subjt:  MSNFNSSILVLILIVFPLVLSEQEPIKAIIHRLDSKTLSPSIQEAAAKGLLRRLLPTHVDSFKFQIVYKDVCGGRSCFLISNFKSSSRNNGAEILIKGTT

Query:  AVEITSGLYWYLKYWCGAHVSWDKTGGVQLASIPKPGSLPLLKGDGVVVKRPVPWNYYQNVVTSSYAYVWWDWERWEKEIDWMALHGINLPLAFTGQESI
        AVEITSGLYWYLKYWCGAH+SWDKTGGVQ+ASIPKPGSLPLLKGDGVVVKRPVPWNYYQNVVTSSY+YVWWDWERWEKEIDWMALHGINLPLAFTGQESI
Subjt:  AVEITSGLYWYLKYWCGAHVSWDKTGGVQLASIPKPGSLPLLKGDGVVVKRPVPWNYYQNVVTSSYAYVWWDWERWEKEIDWMALHGINLPLAFTGQESI

Query:  WRSVFRDFNLTVKDLDNFFGGPAFLAWARMGNLHGWGGPLTQNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAALAEIFPSADITRLGNWNSIDAD
        W+SVFRDFNLTVKDLDNFFGGPAFLAWARMGNLHGWGG L+Q+WLDQQL LQKQILSRMRELGMTPVLPSFSGNVPAALAE FPSADITRLGNWNSIDAD
Subjt:  WRSVFRDFNLTVKDLDNFFGGPAFLAWARMGNLHGWGGPLTQNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAALAEIFPSADITRLGNWNSIDAD

Query:  PSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTDIYNCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSTFWKPDQMKALLH
        PSTCCTYLLNPSDPLFVKIGEAFIR+QIKEY DVTDIYNCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSTFWKPDQMKALLH
Subjt:  PSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTDIYNCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSTFWKPDQMKALLH

Query:  SVPFGKMIVLDLFADVKPIWRTSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSKKVEVQ
        SVPFGKMIVLDLFA+VKPIWRTSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYE+MSEMAFRSKKVEVQ
Subjt:  SVPFGKMIVLDLFADVKPIWRTSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSKKVEVQ

Query:  EWLKTYSRCRYGKADHYVEAAWKILYHTIYNCTDGIADHNNDFIVKLPDWDPSSSFGLSKPHLWYSTQEVINALQLLLNADDNLVNSAPYRYDLVDLTRQ
        EWLKTYSRCRYGKADHYV+AAWKILYHTIYNCTDGIADHN DFIVKLPDWDP SS  + KPHLWYSTQ+VINALQLLLNA+++L+NS+ YRYDLVDL RQ
Subjt:  EWLKTYSRCRYGKADHYVEAAWKILYHTIYNCTDGIADHNNDFIVKLPDWDPSSSFGLSKPHLWYSTQEVINALQLLLNADDNLVNSAPYRYDLVDLTRQ

Query:  VLGKLANEEYLKAVTAFRRKNVKDLNLHSKRFVQLIRDIDRLLASNSNFLLGTWLESAKKLATNPSEMKQYEWNARTQVTMWYDNTKVNQSKLHDYANKY
        VLGKLANEEYL AV AF+RK+VK LN+HSKRF+QLIRDIDRLLASNSNFLLGTWLESAKKLATNPSEMKQYEWNARTQVTMWYDNTK NQSKLHDYANKY
Subjt:  VLGKLANEEYLKAVTAFRRKNVKDLNLHSKRFVQLIRDIDRLLASNSNFLLGTWLESAKKLATNPSEMKQYEWNARTQVTMWYDNTKVNQSKLHDYANKY

Query:  WSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWISFSNKWQAASEKYPVQAEGNAIAISRALYKKYFG
        WSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWI FSNKWQAASE YPV+AEGN++AISRALY+KYFG
Subjt:  WSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWISFSNKWQAASEKYPVQAEGNAIAISRALYKKYFG

A0A6J1H7Q2 alpha-N-acetylglucosaminidase0.0e+0093.02Show/hide
Query:  MSNFNSSILVLILIVFPLVLSEQEPIKAIIHRLDSKTLSPSIQEAAAKGLLRRLLPTHVDSFKFQIVYKDVCGGRSCFLISNFKSSSRNNGAEILIKGTT
        MS FNS ILVLIL VFPL  SEQE IKAIIHRLDSKT SPSIQEAAAKGLLRRLLPTHVDSFKFQIV +DVCGG SCFLISNFK SS +NGAEILI+GTT
Subjt:  MSNFNSSILVLILIVFPLVLSEQEPIKAIIHRLDSKTLSPSIQEAAAKGLLRRLLPTHVDSFKFQIVYKDVCGGRSCFLISNFKSSSRNNGAEILIKGTT

Query:  AVEITSGLYWYLKYWCGAHVSWDKTGGVQLASIPKPGSLPLLKGDGVVVKRPVPWNYYQNVVTSSYAYVWWDWERWEKEIDWMALHGINLPLAFTGQESI
        AVEITSGLYWYLKYWCGAHVSWDKTGGVQLASIPKPGSLPLL+G+GVVVKRPVPWNYYQNVVTSSY+YVWWDWERWEKEIDWMALHGINLPLAFTGQESI
Subjt:  AVEITSGLYWYLKYWCGAHVSWDKTGGVQLASIPKPGSLPLLKGDGVVVKRPVPWNYYQNVVTSSYAYVWWDWERWEKEIDWMALHGINLPLAFTGQESI

Query:  WRSVFRDFNLTVKDLDNFFGGPAFLAWARMGNLHGWGGPLTQNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAALAEIFPSADITRLGNWNSIDAD
        WRSVFRDFNLTVKDLDNFFGGPAFLAWARMGNLHGWGGPL+QNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAALA+ FPSADITRLGNWNSI+AD
Subjt:  WRSVFRDFNLTVKDLDNFFGGPAFLAWARMGNLHGWGGPLTQNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAALAEIFPSADITRLGNWNSIDAD

Query:  PSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTDIYNCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSTFWKPDQMKALLH
        PSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTDIYNCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSTFWKP+QMKALLH
Subjt:  PSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTDIYNCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSTFWKPDQMKALLH

Query:  SVPFGKMIVLDLFADVKPIWRTSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSKKVEVQ
        SVPFGKMIVLDLFADVKPIW+TSSQFYGTPYVWCMLHNFGGNIEMYG+LDAISSGPVDALASENSTMVGVGMCMEGIEHN VVYELMSEMAFRSKKVEVQ
Subjt:  SVPFGKMIVLDLFADVKPIWRTSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSKKVEVQ

Query:  EWLKTYSRCRYGKADHYVEAAWKILYHTIYNCTDGIADHNNDFIVKLPDWDPSSSFGLSKPHLWYSTQEVINALQLLLNADDNLVNSAPYRYDLVDLTRQ
        +WLKTYSRCRYGKAD YVEAAW ILYHTIYNCTDGIADHNNDFIVKLPDWDPSSSF    PHLWYSTQEVINALQLLL A DNL NSA YRYDLVDLTRQ
Subjt:  EWLKTYSRCRYGKADHYVEAAWKILYHTIYNCTDGIADHNNDFIVKLPDWDPSSSFGLSKPHLWYSTQEVINALQLLLNADDNLVNSAPYRYDLVDLTRQ

Query:  VLGKLANEEYLKAVTAFRRKNVKDLNLHSKRFVQLIRDIDRLLASNSNFLLGTWLESAKKLATNPSEMKQYEWNARTQVTMWYDNTKVNQSKLHDYANKY
        VLGKLANEEYLKA+++F+RKNV  LN HSKRFVQLIRDIDRLLAS+SNFLLGTWLESAKKLATNPSEMKQYEWNARTQVTMWYDNT+VNQSKLHDYANKY
Subjt:  VLGKLANEEYLKAVTAFRRKNVKDLNLHSKRFVQLIRDIDRLLASNSNFLLGTWLESAKKLATNPSEMKQYEWNARTQVTMWYDNTKVNQSKLHDYANKY

Query:  WSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWISFSNKWQAASEKYPVQAEGNAIAISRALYKKYFG
        WSGL+EGYYLPRALTYFYY+SKSLRKNESFHLE+WRREWI FSNKWQAASE YPV+AEGN IAISRA Y+KYFG
Subjt:  WSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWISFSNKWQAASEKYPVQAEGNAIAISRALYKKYFG

A0A6J1KUM7 alpha-N-acetylglucosaminidase0.0e+0092.25Show/hide
Query:  MSNFNSSILVLILIVFPLVLSEQEPIKAIIHRLDSKTLSPSIQEAAAKGLLRRLLPTHVDSFKFQIVYKDVCGGRSCFLISNFKSSSRNNGAEILIKGTT
        MS FNS ILVLIL VFPL LSEQE IKAIIHRLDSKT SPSIQEAAAKGLLRR LPTHVDSFKFQIV +DVCGG SCFLISNFK SS +N AEILI+GTT
Subjt:  MSNFNSSILVLILIVFPLVLSEQEPIKAIIHRLDSKTLSPSIQEAAAKGLLRRLLPTHVDSFKFQIVYKDVCGGRSCFLISNFKSSSRNNGAEILIKGTT

Query:  AVEITSGLYWYLKYWCGAHVSWDKTGGVQLASIPKPGSLPLLKGDGVVVKRPVPWNYYQNVVTSSYAYVWWDWERWEKEIDWMALHGINLPLAFTGQESI
        AVEITSGLYWYLKYWCGAHVSWDKTGGVQLASIPKPGSLPLL+G+GVVVKRPVPWNYYQNVVTSSY+YVWWDWERWEKEIDWMALHGINLPLAFTGQES+
Subjt:  AVEITSGLYWYLKYWCGAHVSWDKTGGVQLASIPKPGSLPLLKGDGVVVKRPVPWNYYQNVVTSSYAYVWWDWERWEKEIDWMALHGINLPLAFTGQESI

Query:  WRSVFRDFNLTVKDLDNFFGGPAFLAWARMGNLHGWGGPLTQNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAALAEIFPSADITRLGNWNSIDAD
        WRSVFRDFNLTVKDLDNFFGGPAFLAWARMGNLHGWGGPL++NWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAAL + FPSADITRLGNWNSI+AD
Subjt:  WRSVFRDFNLTVKDLDNFFGGPAFLAWARMGNLHGWGGPLTQNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAALAEIFPSADITRLGNWNSIDAD

Query:  PSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTDIYNCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSTFWKPDQMKALLH
        PSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTDIYNCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSTFWKP+QMKALLH
Subjt:  PSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTDIYNCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSTFWKPDQMKALLH

Query:  SVPFGKMIVLDLFADVKPIWRTSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSKKVEVQ
        SVPFGKMIVLDLFADVKPIW+TSSQFYGTPYVWCMLHNFGGNIEMYG+LDAISSGPVDALASENSTMVGVGMCMEGIEHN VVYELMSEMAFRSKKVEVQ
Subjt:  SVPFGKMIVLDLFADVKPIWRTSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSKKVEVQ

Query:  EWLKTYSRCRYGKADHYVEAAWKILYHTIYNCTDGIADHNNDFIVKLPDWDPSSSFGLSKPHLWYSTQEVINALQLLLNADDNLVNSAPYRYDLVDLTRQ
        +WL TYSRCRYGKAD +VEAAW ILYHTIYNCTDGIADHNNDFIVKLPDWDPSSSF    PHLWY TQEVINALQLLL A DNL NS  YRYDLVDLTRQ
Subjt:  EWLKTYSRCRYGKADHYVEAAWKILYHTIYNCTDGIADHNNDFIVKLPDWDPSSSFGLSKPHLWYSTQEVINALQLLLNADDNLVNSAPYRYDLVDLTRQ

Query:  VLGKLANEEYLKAVTAFRRKNVKDLNLHSKRFVQLIRDIDRLLASNSNFLLGTWLESAKKLATNPSEMKQYEWNARTQVTMWYDNTKVNQSKLHDYANKY
        VLGKLANEEYLKAV++FRRKNV  LN HSKRFVQLIRDIDRLLAS+SNFL+GTWLESAKKLATNPSEMKQYEWNARTQVTMWYDNT+VNQSKLHDYANKY
Subjt:  VLGKLANEEYLKAVTAFRRKNVKDLNLHSKRFVQLIRDIDRLLASNSNFLLGTWLESAKKLATNPSEMKQYEWNARTQVTMWYDNTKVNQSKLHDYANKY

Query:  WSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWISFSNKWQAASEKYPVQAEGNAIAISRALYKKYFG
        WSGLLEGYYLPRALTYFYY+SKSLRKNESFHLE+WRREWI FSNKWQAASE YPV+AEGN+IAISRA Y+KYFG
Subjt:  WSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWISFSNKWQAASEKYPVQAEGNAIAISRALYKKYFG

SwissProt top hitse value%identityAlignment
P54802 Alpha-N-acetylglucosaminidase3.3e-16041.17Show/hide
Query:  AEILIKGTTAVEITSGLYWYLKYWCGAHVSWDKTGGVQLASIPKPGSLPLLKGDGVVVKRPVPWNYYQNVVTSSYAYVWWDWERWEKEIDWMALHGINLP
        A + ++G+T V   +GL+ YL+ +CG HV+W    G QL  +P+P  LP + G+ +    P  + YYQNV T SY++VWWDW RWE+EIDWMAL+GINL 
Subjt:  AEILIKGTTAVEITSGLYWYLKYWCGAHVSWDKTGGVQLASIPKPGSLPLLKGDGVVVKRPVPWNYYQNVVTSSYAYVWWDWERWEKEIDWMALHGINLP

Query:  LAFTGQESIWRSVFRDFNLTVKDLDNFFGGPAFLAWARMGNLHGWGGPLTQNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAALAEIFPSADITRL
        LA++GQE+IW+ V+    LT  +++ FF GPAFLAW RMGNLH W GPL  +W  +QL LQ ++L +MR  GMTPVLP+F+G+VP A+  +FP  ++T++
Subjt:  LAFTGQESIWRSVFRDFNLTVKDLDNFFGGPAFLAWARMGNLHGWGGPLTQNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAALAEIFPSADITRL

Query:  GNWNSIDADPSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTDIYNCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSTFWK
        G+W   +   S  C++LL P DP+F  IG  F+R+ IKE+G    IY  DTFNE  PP+++ SY+++   +VY+AM   D +AVWL+QGWLF     FW 
Subjt:  GNWNSIDADPSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTDIYNCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSTFWK

Query:  PDQMKALLHSVPFGKMIVLDLFADVKPIWRTSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYELMSEMA
        P Q++A+L +VP G+++VLDLFA+ +P++  ++ F G P++WCMLHNFGGN  ++G L+A++ GP  A    NSTMVG GM  EGI  N VVY LM+E+ 
Subjt:  PDQMKALLHSVPFGKMIVLDLFADVKPIWRTSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYELMSEMA

Query:  FRSKKV-EVQEWLKTYSRCRYGKADHYVEAAWKILYHTIYNCT-DGIADHNNDFIVKLPDWDPSSSFGLSKPHLWYSTQEVINALQLLLNADDNLVNSAP
        +R   V ++  W+ +++  RYG +     AAW++L  ++YNC+ +    HN   +V+ P    ++S       +WY+  +V  A +LLL +  +L  S  
Subjt:  FRSKKV-EVQEWLKTYSRCRYGKADHYVEAAWKILYHTIYNCT-DGIADHNNDFIVKLPDWDPSSSFGLSKPHLWYSTQEVINALQLLLNADDNLVNSAP

Query:  YRYDLVDLTRQVLGKLANEEYLKAVTAFRRKNVKD-LNLHSKRFVQLIRDIDRLLASNSNFLLGTWLESAKKLATNPSEMKQYEWNARTQVTMWYDNTKV
        +RYDL+DLTRQ + +L +  Y +A +A+  K +   L        +L+  +D +LAS+S FLLG+WLE A+  A + +E   YE N+R Q+T+W      
Subjt:  YRYDLVDLTRQVLGKLANEEYLKAVTAFRRKNVKD-LNLHSKRFVQLIRDIDRLLASNSNFLLGTWLESAKKLATNPSEMKQYEWNARTQVTMWYDNTKV

Query:  NQSKLHDYANKYWSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWISFSNKWQAASEKYPVQAEGNAIAISRALYKKYF
         +  + DYANK  +GL+  YY PR   +   L  S+ +   F    + +        +  + ++YP Q  G+ + +++ ++ KY+
Subjt:  NQSKLHDYANKYWSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWISFSNKWQAASEKYPVQAEGNAIAISRALYKKYF

Q9FNA3 Alpha-N-acetylglucosaminidase0.0e+0067.49Show/hide
Query:  MSNFNSSILVLILIVF-PLVLSEQEP-IKAIIHRLDSKTLSPSIQEAAAKGLLRRLLPTHVDSFKFQIVYKDVCGGRSCFLISNFKSSSRNNGAEILIKG
        M +    +LVL++I F    +S+  P I  ++ RLDS   + S+QE+AAKGLL+RLLPTH  SF+ +I+ KD CGG SCF+I N+    R  G EILIKG
Subjt:  MSNFNSSILVLILIVF-PLVLSEQEP-IKAIIHRLDSKTLSPSIQEAAAKGLLRRLLPTHVDSFKFQIVYKDVCGGRSCFLISNFKSSSRNNGAEILIKG

Query:  TTAVEITSGLYWYLKYWCGAHVSWDKTGGVQLASIPKPGSLPLLKGDGVVVKRPVPWNYYQNVVTSSYAYVWWDWERWEKEIDWMALHGINLPLAFTGQE
        TT VEI SGL+WYLKY C AHVSWDKTGG+Q+AS+P+PG LP +    + ++RPVPWNYYQNVVTSSY+YVWW WERWE+EIDWMAL GINLPLAFTGQE
Subjt:  TTAVEITSGLYWYLKYWCGAHVSWDKTGGVQLASIPKPGSLPLLKGDGVVVKRPVPWNYYQNVVTSSYAYVWWDWERWEKEIDWMALHGINLPLAFTGQE

Query:  SIWRSVFRDFNLTVKDLDNFFGGPAFLAWARMGNLHGWGGPLTQNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAALAEIFPSADITRLGNWNSID
        +IW+ VF+ FN++ +DLD++FGGPAFLAWARMGNLH WGGPL++NWLD QL LQKQILSRM + GMTPVLPSFSGNVP+AL +I+P A+ITRL NWN++D
Subjt:  SIWRSVFRDFNLTVKDLDNFFGGPAFLAWARMGNLHGWGGPLTQNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAALAEIFPSADITRLGNWNSID

Query:  ADPSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTDIYNCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSTFWKPDQMKAL
         D   CCTYLLNPSDPLF++IGEAFI+QQ +EYG++T+IYNCDTFNENTPPT++  YISSLGA+VYKAM K +K+AVWLMQGWLF SDS FWKP Q+KAL
Subjt:  ADPSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTDIYNCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSTFWKPDQMKAL

Query:  LHSVPFGKMIVLDLFADVKPIWRTSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSKKVE
        LHSVPFGKMIVLDL+A+VKPIW  S+QFYGTPY+WCMLHNFGGNIEMYG LD+ISSGPVDA  S+NSTMVGVGMCMEGIE NPVVYEL SEMAFR +KV+
Subjt:  LHSVPFGKMIVLDLFADVKPIWRTSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSKKVE

Query:  VQEWLKTYSRCRYGKADHYVEAAWKILYHTIYNCTDGIADHNNDFIVKLPDWDPSSSF------------------------------GLSKPHLWYSTQ
        VQ+WLK+Y+R RY K +H +EAAW+ILYHT+YNCTDGIADHN DFIVKLPDWDPSSS                                L K HLWYST+
Subjt:  VQEWLKTYSRCRYGKADHYVEAAWKILYHTIYNCTDGIADHNNDFIVKLPDWDPSSSF------------------------------GLSKPHLWYSTQ

Query:  EVINALQLLLNADDNLVNSAPYRYDLVDLTRQVLGKLANEEYLKAVTAFRRKNVKDLNLHSKRFVQLIRDIDRLLASNSNFLLGTWLESAKKLATNPSEM
        EVI AL+L L A D+L  S  YRYD+VDLTRQVL KLAN+ Y +AVTAF +K++  L   S++F++LI+D+D LLAS+ N LLGTWLESAKKLA N  E 
Subjt:  EVINALQLLLNADDNLVNSAPYRYDLVDLTRQVLGKLANEEYLKAVTAFRRKNVKDLNLHSKRFVQLIRDIDRLLASNSNFLLGTWLESAKKLATNPSEM

Query:  KQYEWNARTQVTMWYDNTKVNQSKLHDYANKYWSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWISFSNKW-QAASEKYPVQAEGNAIAISRA
        KQYEWNARTQVTMWYD+  VNQSKLHDYANK+WSGLLE YYLPRA  YF  + KSLR  + F +E WRREWI  S+KW Q++SE YPV+A+G+A+AISR 
Subjt:  KQYEWNARTQVTMWYDNTKVNQSKLHDYANKYWSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWISFSNKW-QAASEKYPVQAEGNAIAISRA

Query:  LYKKYF
        L  KYF
Subjt:  LYKKYF

Arabidopsis top hitse value%identityAlignment
AT5G13690.1 alpha-N-acetylglucosaminidase family / NAGLU family0.0e+0067.49Show/hide
Query:  MSNFNSSILVLILIVF-PLVLSEQEP-IKAIIHRLDSKTLSPSIQEAAAKGLLRRLLPTHVDSFKFQIVYKDVCGGRSCFLISNFKSSSRNNGAEILIKG
        M +    +LVL++I F    +S+  P I  ++ RLDS   + S+QE+AAKGLL+RLLPTH  SF+ +I+ KD CGG SCF+I N+    R  G EILIKG
Subjt:  MSNFNSSILVLILIVF-PLVLSEQEP-IKAIIHRLDSKTLSPSIQEAAAKGLLRRLLPTHVDSFKFQIVYKDVCGGRSCFLISNFKSSSRNNGAEILIKG

Query:  TTAVEITSGLYWYLKYWCGAHVSWDKTGGVQLASIPKPGSLPLLKGDGVVVKRPVPWNYYQNVVTSSYAYVWWDWERWEKEIDWMALHGINLPLAFTGQE
        TT VEI SGL+WYLKY C AHVSWDKTGG+Q+AS+P+PG LP +    + ++RPVPWNYYQNVVTSSY+YVWW WERWE+EIDWMAL GINLPLAFTGQE
Subjt:  TTAVEITSGLYWYLKYWCGAHVSWDKTGGVQLASIPKPGSLPLLKGDGVVVKRPVPWNYYQNVVTSSYAYVWWDWERWEKEIDWMALHGINLPLAFTGQE

Query:  SIWRSVFRDFNLTVKDLDNFFGGPAFLAWARMGNLHGWGGPLTQNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAALAEIFPSADITRLGNWNSID
        +IW+ VF+ FN++ +DLD++FGGPAFLAWARMGNLH WGGPL++NWLD QL LQKQILSRM + GMTPVLPSFSGNVP+AL +I+P A+ITRL NWN++D
Subjt:  SIWRSVFRDFNLTVKDLDNFFGGPAFLAWARMGNLHGWGGPLTQNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAALAEIFPSADITRLGNWNSID

Query:  ADPSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTDIYNCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSTFWKPDQMKAL
         D   CCTYLLNPSDPLF++IGEAFI+QQ +EYG++T+IYNCDTFNENTPPT++  YISSLGA+VYKAM K +K+AVWLMQGWLF SDS FWKP Q+KAL
Subjt:  ADPSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTDIYNCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSTFWKPDQMKAL

Query:  LHSVPFGKMIVLDLFADVKPIWRTSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSKKVE
        LHSVPFGKMIVLDL+A+VKPIW  S+QFYGTPY+WCMLHNFGGNIEMYG LD+ISSGPVDA  S+NSTMVGVGMCMEGIE NPVVYEL SEMAFR +KV+
Subjt:  LHSVPFGKMIVLDLFADVKPIWRTSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSKKVE

Query:  VQEWLKTYSRCRYGKADHYVEAAWKILYHTIYNCTDGIADHNNDFIVKLPDWDPSSSF------------------------------GLSKPHLWYSTQ
        VQ+WLK+Y+R RY K +H +EAAW+ILYHT+YNCTDGIADHN DFIVKLPDWDPSSS                                L K HLWYST+
Subjt:  VQEWLKTYSRCRYGKADHYVEAAWKILYHTIYNCTDGIADHNNDFIVKLPDWDPSSSF------------------------------GLSKPHLWYSTQ

Query:  EVINALQLLLNADDNLVNSAPYRYDLVDLTRQVLGKLANEEYLKAVTAFRRKNVKDLNLHSKRFVQLIRDIDRLLASNSNFLLGTWLESAKKLATNPSEM
        EVI AL+L L A D+L  S  YRYD+VDLTRQVL KLAN+ Y +AVTAF +K++  L   S++F++LI+D+D LLAS+ N LLGTWLESAKKLA N  E 
Subjt:  EVINALQLLLNADDNLVNSAPYRYDLVDLTRQVLGKLANEEYLKAVTAFRRKNVKDLNLHSKRFVQLIRDIDRLLASNSNFLLGTWLESAKKLATNPSEM

Query:  KQYEWNARTQVTMWYDNTKVNQSKLHDYANKYWSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWISFSNKW-QAASEKYPVQAEGNAIAISRA
        KQYEWNARTQVTMWYD+  VNQSKLHDYANK+WSGLLE YYLPRA  YF  + KSLR  + F +E WRREWI  S+KW Q++SE YPV+A+G+A+AISR 
Subjt:  KQYEWNARTQVTMWYDNTKVNQSKLHDYANKYWSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWISFSNKW-QAASEKYPVQAEGNAIAISRA

Query:  LYKKYF
        L  KYF
Subjt:  LYKKYF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCAATTTCAATTCCTCGATTCTGGTTTTGATTCTCATTGTATTTCCACTTGTTCTATCGGAACAAGAACCAATTAAAGCAATAATCCACCGTTTGGATTCCAAAAC
TTTATCTCCTTCAATTCAGGAAGCTGCAGCAAAGGGTCTTCTCCGGCGATTGCTTCCGACTCACGTAGATAGCTTCAAGTTTCAGATTGTTTATAAGGACGTTTGTGGTG
GAAGAAGCTGCTTCTTGATTAGTAATTTCAAGTCCTCTAGTCGCAATAATGGTGCAGAGATATTGATTAAAGGCACCACGGCAGTTGAAATTACATCCGGCCTTTACTGG
TACTTAAAATATTGGTGTGGTGCTCATGTTTCCTGGGACAAGACTGGTGGAGTTCAATTAGCTTCTATTCCTAAACCAGGATCTCTGCCTCTTTTAAAGGGTGACGGAGT
TGTGGTTAAGCGGCCAGTGCCATGGAACTATTACCAAAATGTTGTTACTTCAAGTTATGCCTATGTTTGGTGGGATTGGGAAAGATGGGAGAAAGAGATAGACTGGATGG
CCCTCCATGGAATTAACCTACCTTTGGCATTCACTGGGCAAGAATCTATTTGGAGAAGTGTTTTCAGGGATTTTAACCTCACCGTCAAAGATTTGGACAATTTCTTTGGT
GGACCTGCTTTCCTTGCCTGGGCTCGCATGGGAAATTTACATGGGTGGGGTGGGCCTTTAACACAAAATTGGTTGGATCAACAATTAGCTTTACAGAAACAGATACTATC
CCGAATGCGAGAGTTGGGGATGACTCCAGTTCTGCCATCATTCTCAGGAAATGTCCCAGCAGCTTTGGCAGAGATATTTCCCTCGGCAGACATAACTAGATTAGGAAACT
GGAACTCAATTGATGCAGATCCTAGTACATGCTGCACATACCTTCTTAATCCTTCGGATCCTCTATTTGTCAAGATTGGGGAGGCTTTTATCAGACAACAAATAAAAGAG
TATGGGGATGTAACAGACATTTACAACTGCGATACATTCAATGAAAATACTCCACCTACTAATGATACTTCATATATTTCATCACTTGGAGCTTCTGTCTATAAAGCTAT
GGTGAAAGCTGATAAGGATGCTGTGTGGCTTATGCAAGGATGGCTCTTCTATTCAGACTCTACTTTTTGGAAGCCTGATCAAATGAAAGCACTACTTCATTCGGTCCCAT
TTGGGAAAATGATTGTTCTTGATCTTTTTGCGGACGTCAAGCCAATTTGGAGAACATCATCTCAATTTTATGGCACGCCCTATGTATGGTGTATGTTGCATAACTTTGGT
GGAAATATAGAAATGTATGGTATATTGGATGCAATCTCTTCAGGTCCAGTTGATGCCCTTGCAAGTGAAAATTCAACAATGGTTGGCGTTGGCATGTGTATGGAAGGAAT
AGAGCATAATCCAGTTGTTTATGAATTGATGTCTGAAATGGCATTTCGCAGCAAAAAAGTTGAAGTCCAGGAGTGGTTGAAGACCTATTCCCGTTGTCGCTATGGTAAAG
CAGATCATTATGTTGAGGCAGCTTGGAAGATTCTTTATCATACAATTTACAATTGCACTGATGGCATTGCGGACCATAACAATGATTTCATTGTCAAACTTCCAGATTGG
GATCCATCTTCAAGCTTTGGTCTGAGCAAGCCACATCTATGGTATTCCACTCAGGAGGTTATCAATGCCTTGCAGCTACTCCTTAATGCAGACGATAATCTCGTCAACAG
CGCTCCATATAGATATGACTTGGTCGACTTAACACGGCAAGTGCTAGGGAAGCTGGCAAATGAAGAATATTTGAAAGCTGTAACTGCATTTCGGCGCAAGAATGTGAAGG
ATCTAAATCTTCATAGCAAGAGATTTGTTCAATTAATAAGAGATATTGACAGACTGCTTGCTTCTAATTCAAATTTTCTGCTTGGAACATGGCTTGAAAGTGCAAAGAAG
TTGGCCACAAATCCATCTGAGATGAAGCAGTATGAATGGAATGCAAGAACACAAGTCACTATGTGGTATGATAACACAAAAGTCAATCAGAGCAAACTTCATGATTATGC
TAATAAGTACTGGAGTGGGCTACTTGAAGGTTACTATCTCCCAAGAGCTTTGACCTATTTTTATTACCTATCAAAAAGCTTGAGAAAAAATGAGAGCTTCCATTTAGAAG
ACTGGAGAAGAGAATGGATTTCATTCTCAAACAAATGGCAAGCAGCTTCAGAAAAATACCCAGTTCAAGCTGAAGGAAATGCAATTGCTATTTCTAGAGCCTTATATAAA
AAGTACTTTGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCCAATTTCAATTCCTCGATTCTGGTTTTGATTCTCATTGTATTTCCACTTGTTCTATCGGAACAAGAACCAATTAAAGCAATAATCCACCGTTTGGATTCCAAAAC
TTTATCTCCTTCAATTCAGGAAGCTGCAGCAAAGGGTCTTCTCCGGCGATTGCTTCCGACTCACGTAGATAGCTTCAAGTTTCAGATTGTTTATAAGGACGTTTGTGGTG
GAAGAAGCTGCTTCTTGATTAGTAATTTCAAGTCCTCTAGTCGCAATAATGGTGCAGAGATATTGATTAAAGGCACCACGGCAGTTGAAATTACATCCGGCCTTTACTGG
TACTTAAAATATTGGTGTGGTGCTCATGTTTCCTGGGACAAGACTGGTGGAGTTCAATTAGCTTCTATTCCTAAACCAGGATCTCTGCCTCTTTTAAAGGGTGACGGAGT
TGTGGTTAAGCGGCCAGTGCCATGGAACTATTACCAAAATGTTGTTACTTCAAGTTATGCCTATGTTTGGTGGGATTGGGAAAGATGGGAGAAAGAGATAGACTGGATGG
CCCTCCATGGAATTAACCTACCTTTGGCATTCACTGGGCAAGAATCTATTTGGAGAAGTGTTTTCAGGGATTTTAACCTCACCGTCAAAGATTTGGACAATTTCTTTGGT
GGACCTGCTTTCCTTGCCTGGGCTCGCATGGGAAATTTACATGGGTGGGGTGGGCCTTTAACACAAAATTGGTTGGATCAACAATTAGCTTTACAGAAACAGATACTATC
CCGAATGCGAGAGTTGGGGATGACTCCAGTTCTGCCATCATTCTCAGGAAATGTCCCAGCAGCTTTGGCAGAGATATTTCCCTCGGCAGACATAACTAGATTAGGAAACT
GGAACTCAATTGATGCAGATCCTAGTACATGCTGCACATACCTTCTTAATCCTTCGGATCCTCTATTTGTCAAGATTGGGGAGGCTTTTATCAGACAACAAATAAAAGAG
TATGGGGATGTAACAGACATTTACAACTGCGATACATTCAATGAAAATACTCCACCTACTAATGATACTTCATATATTTCATCACTTGGAGCTTCTGTCTATAAAGCTAT
GGTGAAAGCTGATAAGGATGCTGTGTGGCTTATGCAAGGATGGCTCTTCTATTCAGACTCTACTTTTTGGAAGCCTGATCAAATGAAAGCACTACTTCATTCGGTCCCAT
TTGGGAAAATGATTGTTCTTGATCTTTTTGCGGACGTCAAGCCAATTTGGAGAACATCATCTCAATTTTATGGCACGCCCTATGTATGGTGTATGTTGCATAACTTTGGT
GGAAATATAGAAATGTATGGTATATTGGATGCAATCTCTTCAGGTCCAGTTGATGCCCTTGCAAGTGAAAATTCAACAATGGTTGGCGTTGGCATGTGTATGGAAGGAAT
AGAGCATAATCCAGTTGTTTATGAATTGATGTCTGAAATGGCATTTCGCAGCAAAAAAGTTGAAGTCCAGGAGTGGTTGAAGACCTATTCCCGTTGTCGCTATGGTAAAG
CAGATCATTATGTTGAGGCAGCTTGGAAGATTCTTTATCATACAATTTACAATTGCACTGATGGCATTGCGGACCATAACAATGATTTCATTGTCAAACTTCCAGATTGG
GATCCATCTTCAAGCTTTGGTCTGAGCAAGCCACATCTATGGTATTCCACTCAGGAGGTTATCAATGCCTTGCAGCTACTCCTTAATGCAGACGATAATCTCGTCAACAG
CGCTCCATATAGATATGACTTGGTCGACTTAACACGGCAAGTGCTAGGGAAGCTGGCAAATGAAGAATATTTGAAAGCTGTAACTGCATTTCGGCGCAAGAATGTGAAGG
ATCTAAATCTTCATAGCAAGAGATTTGTTCAATTAATAAGAGATATTGACAGACTGCTTGCTTCTAATTCAAATTTTCTGCTTGGAACATGGCTTGAAAGTGCAAAGAAG
TTGGCCACAAATCCATCTGAGATGAAGCAGTATGAATGGAATGCAAGAACACAAGTCACTATGTGGTATGATAACACAAAAGTCAATCAGAGCAAACTTCATGATTATGC
TAATAAGTACTGGAGTGGGCTACTTGAAGGTTACTATCTCCCAAGAGCTTTGACCTATTTTTATTACCTATCAAAAAGCTTGAGAAAAAATGAGAGCTTCCATTTAGAAG
ACTGGAGAAGAGAATGGATTTCATTCTCAAACAAATGGCAAGCAGCTTCAGAAAAATACCCAGTTCAAGCTGAAGGAAATGCAATTGCTATTTCTAGAGCCTTATATAAA
AAGTACTTTGGTTGA
Protein sequenceShow/hide protein sequence
MSNFNSSILVLILIVFPLVLSEQEPIKAIIHRLDSKTLSPSIQEAAAKGLLRRLLPTHVDSFKFQIVYKDVCGGRSCFLISNFKSSSRNNGAEILIKGTTAVEITSGLYW
YLKYWCGAHVSWDKTGGVQLASIPKPGSLPLLKGDGVVVKRPVPWNYYQNVVTSSYAYVWWDWERWEKEIDWMALHGINLPLAFTGQESIWRSVFRDFNLTVKDLDNFFG
GPAFLAWARMGNLHGWGGPLTQNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAALAEIFPSADITRLGNWNSIDADPSTCCTYLLNPSDPLFVKIGEAFIRQQIKE
YGDVTDIYNCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSTFWKPDQMKALLHSVPFGKMIVLDLFADVKPIWRTSSQFYGTPYVWCMLHNFG
GNIEMYGILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSKKVEVQEWLKTYSRCRYGKADHYVEAAWKILYHTIYNCTDGIADHNNDFIVKLPDW
DPSSSFGLSKPHLWYSTQEVINALQLLLNADDNLVNSAPYRYDLVDLTRQVLGKLANEEYLKAVTAFRRKNVKDLNLHSKRFVQLIRDIDRLLASNSNFLLGTWLESAKK
LATNPSEMKQYEWNARTQVTMWYDNTKVNQSKLHDYANKYWSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWISFSNKWQAASEKYPVQAEGNAIAISRALYK
KYFG