; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0008465 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0008465
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionHeparanase-like protein 1
Genome locationchr09:11669843..11672951
RNA-Seq ExpressionPI0008465
SyntenyPI0008465
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0004566 - beta-glucuronidase activity (molecular function)
InterPro domainsIPR005199 - Glycoside hydrolase, family 79
IPR017853 - Glycoside hydrolase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044757.1 heparanase-like protein 1 [Cucumis melo var. makuwa]1.0e-22074.36Show/hide
Query:  FHPKNYFGHNVTIGKIVVDGTTKISETDENFICFTLDIWSHDECSQPNLCAWDGHASMLNLDLSLPILNKAVRSFKTLKIRVGGTLQDRLIYDIGDGFKG
        F P+   G+NVTIGKIVVDGTTKI+ETDENFICFTLDIW HDECSQPNLC WD HAS+LNLDLSLPILNKAV++FKTL+IRVGGTLQDRLIY+IG+GF+G
Subjt:  FHPKNYFGHNVTIGKIVVDGTTKISETDENFICFTLDIWSHDECSQPNLCAWDGHASMLNLDLSLPILNKAVRSFKTLKIRVGGTLQDRLIYDIGDGFKG

Query:  NCHPFQADKDLLFDFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYRNAEALVKYTVDKKYNINSWEFGN-----VNASQYAK
        NCHPF+AD  LLF+FTEGCLYMERWDDLNKFFNNTGA+VTFGLNALLGKY+TKG+QWEGNWN+ NAEAL+KYTVD  Y INSWEF N     V+A+QYAK
Subjt:  NCHPFQADKDLLFDFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYRNAEALVKYTVDKKYNINSWEFGN-----VNASQYAK

Query:  NLLKLREIIDRLYKNSQQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIY--------IYN------MGAVSNTFRQLKNIIEKHAPWASAWVGEA
        +LLKLRE++DRLY+NSQQKP+IVAPGAFFDDKWY ELVTKTG NVV+ LTHHIY        IY       +  VS TFRQLKNI+EKHAPW+SAWVGEA
Subjt:  NLLKLREIIDRLYKNSQQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIY--------IYN------MGAVSNTFRQLKNIIEKHAPWASAWVGEA

Query:  GGAYHGGSPHISDAFINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGALLFHRLMGSSALKVDNNVSSYLRTYAHCSRGKSGVTM
        GGAYHGG+  ISD FINSFW                  QTL+GGFY++L+AKT++PT DYYGALLFHRLMG   LKV N VS YLRTYAHCSR KSGVTM
Subjt:  GGAYHGGSPHISDAFINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGALLFHRLMGSSALKVDNNVSSYLRTYAHCSRGKSGVTM

Query:  LFINLSNTTKFTIKIENHMNLSLHK-SKPK-NSSPSKNVGT-QREEYHLTPENGIIRSSTVLLNGRELELTKEGELPDLVPVYRDSNSSISIANWSITFI
        LFINLSNTT+FTI +E+ +NLSLHK  KP  +SS + NVGT +REEYHLTP+NG++RSSTVLLNG+ LELTKEGELPDL P+Y+DSNSSI+IA WSI FI
Subjt:  LFINLSNTTKFTIKIENHMNLSLHK-SKPK-NSSPSKNVGT-QREEYHLTPENGIIRSSTVLLNGRELELTKEGELPDLVPVYRDSNSSISIANWSITFI

Query:  VIPDFVAVGCN
        VIPDFVAVGCN
Subjt:  VIPDFVAVGCN

KAA0044764.1 heparanase-like protein 1 [Cucumis melo var. makuwa]1.4e-22275.94Show/hide
Query:  FHPKNYFGHNVTIGKIVVDGTTKISETDENFICFTLDIWSHDECSQPNLCAWDGHASMLNLDLSLPILNKAVRSFKTLKIRVGGTLQDRLIYDIGDGFKG
        F P+   GHNVT GKIVVDGTTKI+ETDENFICFTLDIW HDECSQPNLC WD HAS+LNLDLSLPILNKAV++FKTL+IRVGGTLQDRLIY+IG+GF+G
Subjt:  FHPKNYFGHNVTIGKIVVDGTTKISETDENFICFTLDIWSHDECSQPNLCAWDGHASMLNLDLSLPILNKAVRSFKTLKIRVGGTLQDRLIYDIGDGFKG

Query:  NCHPFQADKDLLFDFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYRNAEALVKYTVDKKYNINSWEFGN-----------VN
        NCHPF+AD  LLF+FTEGCLYMERWDDLNKFFNNTGA+VTFGLNALLGKY+TKG+QWEGNWN+ NAEAL+KYTVD  Y INSWEFGN           V+
Subjt:  NCHPFQADKDLLFDFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYRNAEALVKYTVDKKYNINSWEFGN-----------VN

Query:  ASQYAKNLLKLREIIDRLYKNSQQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIYIYNMGAVSNTFRQLKNIIEKHAPWASAWVGEAGGAYHGGS
        A+QYAK+LLKLRE++DRLY+NSQQKP+IVAPGAFFDDKWY ELVTKTG NVV+ LTHH  IYNM  VS TFRQLKNI+EKHAPW+SAWVGEAGGAYHGG+
Subjt:  ASQYAKNLLKLREIIDRLYKNSQQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIYIYNMGAVSNTFRQLKNIIEKHAPWASAWVGEAGGAYHGGS

Query:  PHISDAFINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGALLFHRLMGSSALKVDNNVSSYLRTYAHCSRGKSGVTMLFINLSNT
          ISD FINSFW                  QTL+GGFY++L+AKT++PT DYYGALLFHRLMG   LKV N VS YLRTYAHCSR KSGVTMLFINLSNT
Subjt:  PHISDAFINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGALLFHRLMGSSALKVDNNVSSYLRTYAHCSRGKSGVTMLFINLSNT

Query:  TKFTIKIENHMNLSLHK-SKPK-NSSPSKNVG-TQREEYHLTPENGIIRSSTVLLNGRELELTKEGELPDLVPVYRDSNSSISIANWSITFIVIPDFVAV
        T+FTI +E+ MNLSLHK  KP  +SS + NVG T+REEYHLTP+NG++RSSTVLLNG+ LELTKEGELPDL P+YRDSNSSI+IA WSI FIVIPDFVA+
Subjt:  TKFTIKIENHMNLSLHK-SKPK-NSSPSKNVG-TQREEYHLTPENGIIRSSTVLLNGRELELTKEGELPDLVPVYRDSNSSISIANWSITFIVIPDFVAV

Query:  GCN
        GCN
Subjt:  GCN

KGN53270.1 hypothetical protein Csa_015114 [Cucumis sativus]3.6e-21871.73Show/hide
Query:  FHPKNYFGHNVTIGKIVVDGTTKISETDENFICFTLDIWSHDECSQPNLCAWDGHASMLNLDLSLPILNKAVRSFKTLKIRVGGTLQDRLIYDIGDGFKG
        F P+   G NVT+GKIVV+G TKI+ETDENFICFTLDIW HDECSQPNLC WD HAS+LN+DLSLPI+NKAV++FKTL+IRVGGTLQDRLIY+IG+GFKG
Subjt:  FHPKNYFGHNVTIGKIVVDGTTKISETDENFICFTLDIWSHDECSQPNLCAWDGHASMLNLDLSLPILNKAVRSFKTLKIRVGGTLQDRLIYDIGDGFKG

Query:  NCHPFQADKDLLFDFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYRNAEALVKYTVDKKYNINSWEFGN-----------VN
        NCHPF+AD  LLFDFTEGCLYMERWDDLN FFNNTGAIVTFGLNALLGKY+T+G+QWEGNWNY NAEAL+KYTVDK Y INSWEFGN           ++
Subjt:  NCHPFQADKDLLFDFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYRNAEALVKYTVDKKYNINSWEFGN-----------VN

Query:  ASQYAKNLLKLREIIDRLYKNSQQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIYIYNMGA------------------VSNTFRQLKNIIEKHA
        ASQYAK+LLKLREI+DRLYKNSQQKPLIVAPGAFFDDKWY ELVTKTG  VVS LTHH  IYNMGA                  VSNTF+QLKNI++KHA
Subjt:  ASQYAKNLLKLREIIDRLYKNSQQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIYIYNMGA------------------VSNTFRQLKNIIEKHA

Query:  PWASAWVGEAGGAYHGGSPHISDAFINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGALLFHRLMGSSALKVDNNVSSYLRTYAH
        PW+SAWVGEAGGAY GG+  ISD+FINSFW                  QTLIGGFY++L+AKT +PT DYYGALLFHRLMG   LKV N VS+YLRTYAH
Subjt:  PWASAWVGEAGGAYHGGSPHISDAFINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGALLFHRLMGSSALKVDNNVSSYLRTYAH

Query:  CSRGKSGVTMLFINLSNTTKFTIKIENHMNLSLHK-SKPKNSSPS-KNVGTQREEYHLTPENGIIRSSTVLLNGRELELTKEGELPDLVPVYRDSNSSIS
        CSR +SG++MLFINLSNTT+F I +++HM LSLHK  KPK+ S S  N+GT REEYHLTP+NG++RSS VLLNG+ L+LT EGELP+L P+Y+DSNSSI+
Subjt:  CSRGKSGVTMLFINLSNTTKFTIKIENHMNLSLHK-SKPKNSSPS-KNVGTQREEYHLTPENGIIRSSTVLLNGRELELTKEGELPDLVPVYRDSNSSIS

Query:  IANWSITFIVIPDFVAVGCN
        IA WSI F+VIPDFVA+GCN
Subjt:  IANWSITFIVIPDFVAVGCN

XP_004140375.1 heparanase-like protein 1 [Cucumis sativus]2.0e-24080Show/hide
Query:  FCFHPKNYFGHNVTIGKIVVDGTTKISETDENFICFTLDIWSHDECSQPNLCAWDGHASMLNLDLSLPILNKAVRSFKTLKIRVGGTLQDRLIYDIGDGF
        F F P+   G NVT GKIVVDGTTKI+ETDENFICFTLDIW HDECSQPNLC WDGHASMLN+DLSLPILNKAV++FKTL+IRVGGTLQDRLIY+IGDGF
Subjt:  FCFHPKNYFGHNVTIGKIVVDGTTKISETDENFICFTLDIWSHDECSQPNLCAWDGHASMLNLDLSLPILNKAVRSFKTLKIRVGGTLQDRLIYDIGDGF

Query:  KGNCHPFQADKDLLFDFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYRNAEALVKYTVDKKYNINSWEFGN-----------
        KGNC+PF+A K LLFDFTEGCLYMERWDDLN FFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNY NAEAL+KYTV+KKYNINSWEFGN           
Subjt:  KGNCHPFQADKDLLFDFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYRNAEALVKYTVDKKYNINSWEFGN-----------

Query:  VNASQYAKNLLKLREIIDRLYKNSQQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIYIYNMGA------------------VSNTFRQLKNIIEK
        V+ASQYAK+LLKLR+IIDRLYKNSQQKPLIVAPGAFFDDKWYDELVTKTGSNVVS LTHH  IYNMGA                  VSNTFRQLKNIIEK
Subjt:  VNASQYAKNLLKLREIIDRLYKNSQQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIYIYNMGA------------------VSNTFRQLKNIIEK

Query:  HAPWASAWVGEAGGAYHGGSPHISDAFINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGALLFHRLMGSSALKVDNNVSSYLRTY
        HAPWASAWVGEAGGAYHGG  HISD FINSFW                  QTL+GG+Y +LR KT+IPT DYYGALLFHRLMGSS LKVDNNVSSYLRTY
Subjt:  HAPWASAWVGEAGGAYHGGSPHISDAFINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGALLFHRLMGSSALKVDNNVSSYLRTY

Query:  AHCSRGKSGVTMLFINLSNTTKFTIKIENHMNLSLHKSKPKNSSPSKNVGTQREEYHLTPENGIIRSSTVLLNGRELELTKEGELPDLVPVYRDSNSSIS
        AHCSRG+SGVTMLFINLSNTT+FTI IENHMNLSLHKSKPK+SS SKNVGTQREEYHLTP+NG++RSSTVLLNG+ LELT EGE+PDL PVYRDSNSSIS
Subjt:  AHCSRGKSGVTMLFINLSNTTKFTIKIENHMNLSLHKSKPKNSSPSKNVGTQREEYHLTPENGIIRSSTVLLNGRELELTKEGELPDLVPVYRDSNSSIS

Query:  IANWSITFIVIPDFVAVGCN
        I NWSI FIVIPDFVA+GCN
Subjt:  IANWSITFIVIPDFVAVGCN

XP_038877281.1 LOW QUALITY PROTEIN: heparanase-like protein 1 [Benincasa hispida]6.2e-21070.84Show/hide
Query:  GHNVTIGKIVVDGTTKISETDENFICFTLDIWSHDECSQPNLCAWDGHASMLNLDLSLPILNKAVRSFKTLKIRVGGTLQDRLIYDIGDGFKGNCHPFQA
        GHNV  GKIVVD TTKI+ETDENFICFT+DIW HDECSQPNLC WD HAS+LN+DLSLP+LNKAV++FK+L+IR+GGTLQDRLIY++G+GFK +C PFQA
Subjt:  GHNVTIGKIVVDGTTKISETDENFICFTLDIWSHDECSQPNLCAWDGHASMLNLDLSLPILNKAVRSFKTLKIRVGGTLQDRLIYDIGDGFKGNCHPFQA

Query:  DKDLLFDFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYRNAEALVKYTVDKKYNINSWEFG-----------NVNASQYAKN
        D DLLFDF+EGCLYMERWDDLN FFNNTGAI+TFGLNALLGK+NTKG+QWEGNWNY NAEAL++YTVDK Y INSWEFG           +V+ASQYAK+
Subjt:  DKDLLFDFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYRNAEALVKYTVDKKYNINSWEFG-----------NVNASQYAKN

Query:  LLKLREIIDRLYKNSQQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIYIYNMGA------------------VSNTFRQLKNIIEKHAPWASAWV
        L+KLREIIDRLY NSQQKPL+VAPGAFFDDKWY ELV K  SN+V+ LTHH  IYNMG                   VSNTF+QL NII+KHAPWASAWV
Subjt:  LLKLREIIDRLYKNSQQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIYIYNMGA------------------VSNTFRQLKNIIEKHAPWASAWV

Query:  GEAGGAYHGGSPHISDAFINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGALLFHRLMGSSALKVDNNVSSYLRTYAHCSRGKSG
        GEAGGAYHGGSPHISDAFINSFW                  QTLIGGFY++L++KTY+PT DYYGALLFHRLMGS  LKVDNNVSSYLRTYAHCSRG+SG
Subjt:  GEAGGAYHGGSPHISDAFINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGALLFHRLMGSSALKVDNNVSSYLRTYAHCSRGKSG

Query:  VTMLFINLSNTTKFTIKIENHMNLSLHKSKPKNSSPSK-NVGTQREEYHLTPENGIIRSSTVLLNGRELELTKEGELPDLVPVYRDSNSSISIANWSITF
        VT+LFINLSNTT+FTIKIE+HMN SLH S  +  + S     ++ ++ +L  +NG++RSSTVLLN   LELTKEGELP+  PVY +SNSSI+IA WSI F
Subjt:  VTMLFINLSNTTKFTIKIENHMNLSLHKSKPKNSSPSK-NVGTQREEYHLTPENGIIRSSTVLLNGRELELTKEGELPDLVPVYRDSNSSISIANWSITF

Query:  IVIPDFVAVGC
        IVIPDFVA GC
Subjt:  IVIPDFVAVGC

TrEMBL top hitse value%identityAlignment
A0A0A0KTJ9 Uncharacterized protein1.9e-24178.12Show/hide
Query:  MLHVVGILESTI-YGIPNIPFDFGFCFHPKNYFGHNVTIGKIVVDGTTKISETDENFICFTLDIWSHDECSQPNLCAWDGHASMLNLDLSLPILNKAVRS
        ML + GILE T+ Y I        F F P+   G NVT GKIVVDGTTKI+ETDENFICFTLDIW HDECSQPNLC WDGHASMLN+DLSLPILNKAV++
Subjt:  MLHVVGILESTI-YGIPNIPFDFGFCFHPKNYFGHNVTIGKIVVDGTTKISETDENFICFTLDIWSHDECSQPNLCAWDGHASMLNLDLSLPILNKAVRS

Query:  FKTLKIRVGGTLQDRLIYDIGDGFKGNCHPFQADKDLLFDFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYRNAEALVKYTV
        FKTL+IRVGGTLQDRLIY+IGDGFKGNC+PF+A K LLFDFTEGCLYMERWDDLN FFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNY NAEAL+KYTV
Subjt:  FKTLKIRVGGTLQDRLIYDIGDGFKGNCHPFQADKDLLFDFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYRNAEALVKYTV

Query:  DKKYNINSWEFGN-----------VNASQYAKNLLKLREIIDRLYKNSQQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIYIYNMGA--------
        +KKYNINSWEFGN           V+ASQYAK+LLKLR+IIDRLYKNSQQKPLIVAPGAFFDDKWYDELVTKTGSNVVS LTHH  IYNMGA        
Subjt:  DKKYNINSWEFGN-----------VNASQYAKNLLKLREIIDRLYKNSQQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIYIYNMGA--------

Query:  ----------VSNTFRQLKNIIEKHAPWASAWVGEAGGAYHGGSPHISDAFINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGAL
                  VSNTFRQLKNIIEKHAPWASAWVGEAGGAYHGG  HISD FINSFW                  QTL+GG+Y +LR KT+IPT DYYGAL
Subjt:  ----------VSNTFRQLKNIIEKHAPWASAWVGEAGGAYHGGSPHISDAFINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGAL

Query:  LFHRLMGSSALKVDNNVSSYLRTYAHCSRGKSGVTMLFINLSNTTKFTIKIENHMNLSLHKSKPKNSSPSKNVGTQREEYHLTPENGIIRSSTVLLNGRE
        LFHRLMGSS LKVDNNVSSYLRTYAHCSRG+SGVTMLFINLSNTT+FTI IENHMNLSLHKSKPK+SS SKNVGTQREEYHLTP+NG++RSSTVLLNG+ 
Subjt:  LFHRLMGSSALKVDNNVSSYLRTYAHCSRGKSGVTMLFINLSNTTKFTIKIENHMNLSLHKSKPKNSSPSKNVGTQREEYHLTPENGIIRSSTVLLNGRE

Query:  LELTKEGELPDLVPVYRDSNSSISIANWSITFIVIPDFVAVGCN
        LELT EGE+PDL PVYRDSNSSISI NWSI FIVIPDFVA+GCN
Subjt:  LELTKEGELPDLVPVYRDSNSSISIANWSITFIVIPDFVAVGCN

A0A0A0KUF1 Uncharacterized protein1.7e-21871.73Show/hide
Query:  FHPKNYFGHNVTIGKIVVDGTTKISETDENFICFTLDIWSHDECSQPNLCAWDGHASMLNLDLSLPILNKAVRSFKTLKIRVGGTLQDRLIYDIGDGFKG
        F P+   G NVT+GKIVV+G TKI+ETDENFICFTLDIW HDECSQPNLC WD HAS+LN+DLSLPI+NKAV++FKTL+IRVGGTLQDRLIY+IG+GFKG
Subjt:  FHPKNYFGHNVTIGKIVVDGTTKISETDENFICFTLDIWSHDECSQPNLCAWDGHASMLNLDLSLPILNKAVRSFKTLKIRVGGTLQDRLIYDIGDGFKG

Query:  NCHPFQADKDLLFDFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYRNAEALVKYTVDKKYNINSWEFGN-----------VN
        NCHPF+AD  LLFDFTEGCLYMERWDDLN FFNNTGAIVTFGLNALLGKY+T+G+QWEGNWNY NAEAL+KYTVDK Y INSWEFGN           ++
Subjt:  NCHPFQADKDLLFDFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYRNAEALVKYTVDKKYNINSWEFGN-----------VN

Query:  ASQYAKNLLKLREIIDRLYKNSQQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIYIYNMGA------------------VSNTFRQLKNIIEKHA
        ASQYAK+LLKLREI+DRLYKNSQQKPLIVAPGAFFDDKWY ELVTKTG  VVS LTHH  IYNMGA                  VSNTF+QLKNI++KHA
Subjt:  ASQYAKNLLKLREIIDRLYKNSQQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIYIYNMGA------------------VSNTFRQLKNIIEKHA

Query:  PWASAWVGEAGGAYHGGSPHISDAFINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGALLFHRLMGSSALKVDNNVSSYLRTYAH
        PW+SAWVGEAGGAY GG+  ISD+FINSFW                  QTLIGGFY++L+AKT +PT DYYGALLFHRLMG   LKV N VS+YLRTYAH
Subjt:  PWASAWVGEAGGAYHGGSPHISDAFINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGALLFHRLMGSSALKVDNNVSSYLRTYAH

Query:  CSRGKSGVTMLFINLSNTTKFTIKIENHMNLSLHK-SKPKNSSPS-KNVGTQREEYHLTPENGIIRSSTVLLNGRELELTKEGELPDLVPVYRDSNSSIS
        CSR +SG++MLFINLSNTT+F I +++HM LSLHK  KPK+ S S  N+GT REEYHLTP+NG++RSS VLLNG+ L+LT EGELP+L P+Y+DSNSSI+
Subjt:  CSRGKSGVTMLFINLSNTTKFTIKIENHMNLSLHK-SKPKNSSPS-KNVGTQREEYHLTPENGIIRSSTVLLNGRELELTKEGELPDLVPVYRDSNSSIS

Query:  IANWSITFIVIPDFVAVGCN
        IA WSI F+VIPDFVA+GCN
Subjt:  IANWSITFIVIPDFVAVGCN

A0A5A7TNQ5 Heparanase-like protein 16.9e-22375.94Show/hide
Query:  FHPKNYFGHNVTIGKIVVDGTTKISETDENFICFTLDIWSHDECSQPNLCAWDGHASMLNLDLSLPILNKAVRSFKTLKIRVGGTLQDRLIYDIGDGFKG
        F P+   GHNVT GKIVVDGTTKI+ETDENFICFTLDIW HDECSQPNLC WD HAS+LNLDLSLPILNKAV++FKTL+IRVGGTLQDRLIY+IG+GF+G
Subjt:  FHPKNYFGHNVTIGKIVVDGTTKISETDENFICFTLDIWSHDECSQPNLCAWDGHASMLNLDLSLPILNKAVRSFKTLKIRVGGTLQDRLIYDIGDGFKG

Query:  NCHPFQADKDLLFDFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYRNAEALVKYTVDKKYNINSWEFGN-----------VN
        NCHPF+AD  LLF+FTEGCLYMERWDDLNKFFNNTGA+VTFGLNALLGKY+TKG+QWEGNWN+ NAEAL+KYTVD  Y INSWEFGN           V+
Subjt:  NCHPFQADKDLLFDFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYRNAEALVKYTVDKKYNINSWEFGN-----------VN

Query:  ASQYAKNLLKLREIIDRLYKNSQQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIYIYNMGAVSNTFRQLKNIIEKHAPWASAWVGEAGGAYHGGS
        A+QYAK+LLKLRE++DRLY+NSQQKP+IVAPGAFFDDKWY ELVTKTG NVV+ LTHH  IYNM  VS TFRQLKNI+EKHAPW+SAWVGEAGGAYHGG+
Subjt:  ASQYAKNLLKLREIIDRLYKNSQQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIYIYNMGAVSNTFRQLKNIIEKHAPWASAWVGEAGGAYHGGS

Query:  PHISDAFINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGALLFHRLMGSSALKVDNNVSSYLRTYAHCSRGKSGVTMLFINLSNT
          ISD FINSFW                  QTL+GGFY++L+AKT++PT DYYGALLFHRLMG   LKV N VS YLRTYAHCSR KSGVTMLFINLSNT
Subjt:  PHISDAFINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGALLFHRLMGSSALKVDNNVSSYLRTYAHCSRGKSGVTMLFINLSNT

Query:  TKFTIKIENHMNLSLHK-SKPK-NSSPSKNVG-TQREEYHLTPENGIIRSSTVLLNGRELELTKEGELPDLVPVYRDSNSSISIANWSITFIVIPDFVAV
        T+FTI +E+ MNLSLHK  KP  +SS + NVG T+REEYHLTP+NG++RSSTVLLNG+ LELTKEGELPDL P+YRDSNSSI+IA WSI FIVIPDFVA+
Subjt:  TKFTIKIENHMNLSLHK-SKPK-NSSPSKNVG-TQREEYHLTPENGIIRSSTVLLNGRELELTKEGELPDLVPVYRDSNSSISIANWSITFIVIPDFVAV

Query:  GCN
        GCN
Subjt:  GCN

A0A5A7TSB3 Heparanase-like protein 14.9e-22174.36Show/hide
Query:  FHPKNYFGHNVTIGKIVVDGTTKISETDENFICFTLDIWSHDECSQPNLCAWDGHASMLNLDLSLPILNKAVRSFKTLKIRVGGTLQDRLIYDIGDGFKG
        F P+   G+NVTIGKIVVDGTTKI+ETDENFICFTLDIW HDECSQPNLC WD HAS+LNLDLSLPILNKAV++FKTL+IRVGGTLQDRLIY+IG+GF+G
Subjt:  FHPKNYFGHNVTIGKIVVDGTTKISETDENFICFTLDIWSHDECSQPNLCAWDGHASMLNLDLSLPILNKAVRSFKTLKIRVGGTLQDRLIYDIGDGFKG

Query:  NCHPFQADKDLLFDFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYRNAEALVKYTVDKKYNINSWEFGN-----VNASQYAK
        NCHPF+AD  LLF+FTEGCLYMERWDDLNKFFNNTGA+VTFGLNALLGKY+TKG+QWEGNWN+ NAEAL+KYTVD  Y INSWEF N     V+A+QYAK
Subjt:  NCHPFQADKDLLFDFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYRNAEALVKYTVDKKYNINSWEFGN-----VNASQYAK

Query:  NLLKLREIIDRLYKNSQQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIY--------IYN------MGAVSNTFRQLKNIIEKHAPWASAWVGEA
        +LLKLRE++DRLY+NSQQKP+IVAPGAFFDDKWY ELVTKTG NVV+ LTHHIY        IY       +  VS TFRQLKNI+EKHAPW+SAWVGEA
Subjt:  NLLKLREIIDRLYKNSQQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIY--------IYN------MGAVSNTFRQLKNIIEKHAPWASAWVGEA

Query:  GGAYHGGSPHISDAFINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGALLFHRLMGSSALKVDNNVSSYLRTYAHCSRGKSGVTM
        GGAYHGG+  ISD FINSFW                  QTL+GGFY++L+AKT++PT DYYGALLFHRLMG   LKV N VS YLRTYAHCSR KSGVTM
Subjt:  GGAYHGGSPHISDAFINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGALLFHRLMGSSALKVDNNVSSYLRTYAHCSRGKSGVTM

Query:  LFINLSNTTKFTIKIENHMNLSLHK-SKPK-NSSPSKNVGT-QREEYHLTPENGIIRSSTVLLNGRELELTKEGELPDLVPVYRDSNSSISIANWSITFI
        LFINLSNTT+FTI +E+ +NLSLHK  KP  +SS + NVGT +REEYHLTP+NG++RSSTVLLNG+ LELTKEGELPDL P+Y+DSNSSI+IA WSI FI
Subjt:  LFINLSNTTKFTIKIENHMNLSLHK-SKPK-NSSPSKNVGT-QREEYHLTPENGIIRSSTVLLNGRELELTKEGELPDLVPVYRDSNSSISIANWSITFI

Query:  VIPDFVAVGCN
        VIPDFVAVGCN
Subjt:  VIPDFVAVGCN

A0A6J1I2E3 heparanase-like protein 12.1e-19566.15Show/hide
Query:  GHNVTIGKIVVDGTTKISETDENFICFTLDIWSHDECSQPNLCAWDGHASMLNLDLSLPILNKAVRSFKTLKIRVGGTLQDRLIYDIGDGFKGNCHPFQA
        GH+ T+G IV+DGT  I+ETDENF+C TLDIW HDEC    LC WDGHASMLNLDL+LPILNKAV++FK+++IRVGGTLQD+LIY++G GFKG CHPFQA
Subjt:  GHNVTIGKIVVDGTTKISETDENFICFTLDIWSHDECSQPNLCAWDGHASMLNLDLSLPILNKAVRSFKTLKIRVGGTLQDRLIYDIGDGFKGNCHPFQA

Query:  DKDLLFDFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYRNAEALVKYTVDKKYNINSWEFGN-----------VNASQYAKN
            LFDF+ GCLYMERWDDLN FFNNTGAIVTFGLNALLGK+NTKGIQWEG WNY NAEAL++YTV+K Y INSWEFGN           + A+QYA++
Subjt:  DKDLLFDFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYRNAEALVKYTVDKKYNINSWEFGN-----------VNASQYAKN

Query:  LLKLREIIDRLYKNSQQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIYIYNMGA------------------VSNTFRQLKNIIEKHAPWASAWV
        LLKLREIIDRLY NSQQKPLIVAPGAFFD+ WYDE V KTG  VV  LTHH  IYNMGA                  VSNTF QL+N+I+K+APWA+AWV
Subjt:  LLKLREIIDRLYKNSQQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIYIYNMGA------------------VSNTFRQLKNIIEKHAPWASAWV

Query:  GEAGGAYHGGSPHISDAFINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGALLFHRLMGSSALKVDNNVSSYLRTYAHCSRGKSG
        GEAGGAY GGSPH+SD FINSFW                  QTLIGGFY +L++ T +PT DYYGALLFHRLMG   LK++N VSS LR+YAHCSRG+SG
Subjt:  GEAGGAYHGGSPHISDAFINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGALLFHRLMGSSALKVDNNVSSYLRTYAHCSRGKSG

Query:  VTMLFINLSNTTKFTIKIENHMNLSLHKSKPK-NSSPSK---NVGTQREEYHLTPENGIIRSSTVLLNGRELELTKEGELPDLVPVYRDSNSSISIANWS
        VT+L INLSNTT F I ++N MN+SL KS  + N S SK        REEYHLTP++G++RSSTVLLNG  LE TKEG++P+LVPVYR SNS I IA+WS
Subjt:  VTMLFINLSNTTKFTIKIENHMNLSLHKSKPK-NSSPSK---NVGTQREEYHLTPENGIIRSSTVLLNGRELELTKEGELPDLVPVYRDSNSSISIANWS

Query:  ITFIVIPDFVAVGC
        I F+VIPDFV   C
Subjt:  ITFIVIPDFVAVGC

SwissProt top hitse value%identityAlignment
Q8L608 Heparanase-like protein 23.4e-12643.64Show/hide
Query:  PKNYFGHNVTIGKIVVDGTTKISETDENFICFTLDIWSHDECSQPNLCAWDGHASMLNLDLSLPILNKAVRSFKTLKIRVGGTLQDRLIYDIGDGFKGNC
        P   FG N+    +V+DG+ +I+ETDENFIC TLD W  ++C+  + C W G+AS++NL+L+ P+L KA+++F+TL+IR+GG+LQD++IYD+GD  K  C
Subjt:  PKNYFGHNVTIGKIVVDGTTKISETDENFICFTLDIWSHDECSQPNLCAWDGHASMLNLDLSLPILNKAVRSFKTLKIRVGGTLQDRLIYDIGDGFKGNC

Query:  HPFQADKDLLFDFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYRNAEALVKYTVDKKYNINSWEFGN----------VNASQ
          F+   D LF F+EGCLYM+RWD++N FFN TGAIVTFGLNAL G+    G  W G+W++ N +  + YTV K Y I+SWEFGN          V+   
Subjt:  HPFQADKDLLFDFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYRNAEALVKYTVDKKYNINSWEFGN----------VNASQ

Query:  YAKNLLKLREIIDRLYKNSQQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIYIYNMG----------------AVSNTFRQLKNIIEKHAPWASA
        Y K+L+ L+ +I  +YKNS+ KPL+VAPG FF+++WY EL+  +G  V+  LTHHIY    G                 +S  F  +   I++H PWA+A
Subjt:  YAKNLLKLREIIDRLYKNSQQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIYIYNMG----------------AVSNTFRQLKNIIEKHAPWASA

Query:  WVGEAGGAYHGGSPHISDAFINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGALLFHRLMGSSALKVDNNVSSYLRTYAHCSRGK
        WVGEAGGA++ G   +S+ FINSFW                  Q L+GGFY +L  +T++P  DYY ALL+HRLMG   L V    S YLR Y HCS+ +
Subjt:  WVGEAGGAYHGGSPHISDAFINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGALLFHRLMGSSALKVDNNVSSYLRTYAHCSRGK

Query:  SGVTMLFINLSNTTKFTIKIENHMNLSLHKSKPKNSSPSKNVGTQ--------------REEYHLTPENGIIRSSTVLLNGRELELTKEGELPDLVPVYR
        +G+T+L INLS  T FT+ + N + + L     K  S  + + ++              REEYHL+P++G +RS  +LLNG+ L  T  G++P L PV  
Subjt:  SGVTMLFINLSNTTKFTIKIENHMNLSLHKSKPKNSSPSKNVGTQ--------------REEYHLTPENGIIRSSTVLLNGRELELTKEGELPDLVPVYR

Query:  DSNSSISIANWSITFIVIPDFVAVGCN
           S + I   SI+FIV+P F A  C+
Subjt:  DSNSSISIANWSITFIVIPDFVAVGCN

Q9FF10 Heparanase-like protein 13.9e-13043.75Show/hide
Query:  PKNYFGHNVTIGKIVVDGTTKISETDENFICFTLDIWSHDECSQPNLCAWDGHASMLNLDLSLPILNKAVRSFKTLKIRVGGTLQDRLIYDIGDGFKGNC
        P+      +    IV+ G  ++ ETDENF+C TLD W HD+C+  + C W G++S++N+DL+ P+L KA+++FK L+IR+GG+LQD++IYD+G+  K  C
Subjt:  PKNYFGHNVTIGKIVVDGTTKISETDENFICFTLDIWSHDECSQPNLCAWDGHASMLNLDLSLPILNKAVRSFKTLKIRVGGTLQDRLIYDIGDGFKGNC

Query:  HPFQADKDLLFDFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYRNAEALVKYTVDKKYNINSWEFGN----------VNASQ
         PFQ     LF F++GCL+M+RWD+LN F   TGA+VTFGLNAL G++  +G  W G W++ N +  + YTV K Y I+SWEFGN          V+A  
Subjt:  HPFQADKDLLFDFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYRNAEALVKYTVDKKYNINSWEFGN----------VNASQ

Query:  YAKNLLKLREIIDRLYKNS-QQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIYIYNMG----------------AVSNTFRQLKNIIEKHAPWAS
        Y K+L+ L+++I+++YKNS   KP++VAPG F++ +WY +L+  +G +VV  +THHIY    G                 VS TF+ +   I++H PWAS
Subjt:  YAKNLLKLREIIDRLYKNS-QQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIYIYNMG----------------AVSNTFRQLKNIIEKHAPWAS

Query:  AWVGEAGGAYHGGSPHISDAFINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGALLFHRLMGSSALKVDNNVSSYLRTYAHCSRG
         WVGE+GGAY+ G  H+SD FI+SFW                  QTL+GGFY +L   T++P  DYY ALL+HRLMG   L V  +    LR YAHCS+G
Subjt:  AWVGEAGGAYHGGSPHISDAFINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGALLFHRLMGSSALKVDNNVSSYLRTYAHCSRG

Query:  KSGVTMLFINLSNTTKFTIKIENHMNLSLHKSKPKNSS-------PSKNVGTQ-------REEYHLTPENGIIRSSTVLLNGRELELTKEGELPDLVPVY
        ++GVT+L INLSN + FT+ + N +N+ L+    K  S       P   +G++       REEYHLTPENG++RS T++LNG+ L+ T  G++P L PV 
Subjt:  KSGVTMLFINLSNTTKFTIKIENHMNLSLHKSKPKNSS-------PSKNVGTQ-------REEYHLTPENGIIRSSTVLLNGRELELTKEGELPDLVPVY

Query:  RDSNSSISIANWSITFIVIPDFVAVGCN
        R  NS +++   S++FIV+P+F A  C+
Subjt:  RDSNSSISIANWSITFIVIPDFVAVGCN

Q9FZP1 Heparanase-like protein 35.4e-10039.49Show/hide
Query:  GKIVVDGTTKISETDENFICFTLDIWSHDECSQPNLCAWDGHASMLNLDLSLPILNKAVRSFKTLKIRVGGTLQDRLIYDIGDGFKGNCHPFQADKDLLF
        G + V G   +   DE+FIC TLD W  ++C   + C+WD HAS+LNLDL+  IL  A+++F  LKIR+GGTLQD +IY+  D  K  C PF  +  +LF
Subjt:  GKIVVDGTTKISETDENFICFTLDIWSHDECSQPNLCAWDGHASMLNLDLSLPILNKAVRSFKTLKIRVGGTLQDRLIYDIGDGFKGNCHPFQADKDLLF

Query:  DFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYRNAEALVKYTVDKKYNINSWEFGN----------VNASQYAKNLLKLREI
         +T+GCL M RWD+LN FF  TG  V FGLNAL G+      +  G WNY NAE+ +++T +  Y I+ WE GN          V A+QYA + + LR I
Subjt:  DFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYRNAEALVKYTVDKKYNINSWEFGN----------VNASQYAKNLLKLREI

Query:  IDRLYKNSQQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIYIYNMGA----------------VSNTFRQLKNIIEKHAPWASAWVGEAGGAYHG
        ++R+YKN    PL++ PG FF+  W+ E + K   N ++  T HIY    G                  + +FR LKNII+  +  A AWVGE+GGAY+ 
Subjt:  IDRLYKNSQQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIYIYNMGA----------------VSNTFRQLKNIIEKHAPWASAWVGEAGGAYHG

Query:  GSPHISDAFINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGALLFHRLMGSSALKVDNNVSSYLRTYAHCSRGKSGVTMLFINLS
        G   +S+AF+ SFW                  Q+LIGG Y +L    + P  DYY AL++ +LMG  AL    + +  +R+Y HC+R   G+T+L +NL 
Subjt:  GSPHISDAFINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGALLFHRLMGSSALKVDNNVSSYLRTYAHCSRGKSGVTMLFINLS

Query:  NTTKFTIKIENHMNLSLHKSKPKNSSP--------SKNVGTQREEYHLTPENGIIRSSTVLLNGRELELTKEGELPDLVPVYRDSNSSISIANWSITFIV
        NTT    K+E + + SL  +K   S            N   QREEYHLT ++G + S T+LLNG  L++   G+LP + P++ +S   I+IA +SI F+ 
Subjt:  NTTKFTIKIENHMNLSLHKSKPKNSSP--------SKNVGTQREEYHLTPENGIIRSSTVLLNGRELELTKEGELPDLVPVYRDSNSSISIANWSITFIV

Query:  IPDFVAVGC
        + + V   C
Subjt:  IPDFVAVGC

Q9LRC8 Baicalin-beta-D-glucuronidase3.8e-8536.58Show/hide
Query:  GHNVTIGKIVVDGTTKISETDENFICFTLDIWSHDECSQPNLCAWDGHASMLNLDLSLPILNKAVRSFKTLKIRVGGTLQDRLIYDIGDGFKGNCHPFQA
        G   TI KI       +++TDEN++C TLD+W   +C+  N C W G +S LNLDL+  I+  AV+ F  LK+R GGTLQDRL+Y        +   F  
Subjt:  GHNVTIGKIVVDGTTKISETDENFICFTLDIWSHDECSQPNLCAWDGHASMLNLDLSLPILNKAVRSFKTLKIRVGGTLQDRLIYDIGDGFKGNCHPFQA

Query:  DKDLLFDFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGK-YNTKGIQWE-----------GNWNYRNAEALVKYTVDKKY-NINSWEFGN-------
        + +L+ DF+  CL ++RWD++N+F   TG+   FGLNAL GK    KGI  +           G W+Y N++ L++Y++ K Y +I  W  GN       
Subjt:  DKDLLFDFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGK-YNTKGIQWE-----------GNWNYRNAEALVKYTVDKKY-NINSWEFGN-------

Query:  ---VNASQYAKNLLKLREIIDRLYKNSQQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIYIYNMG-----------------AVSNTFRQLKNII
           V+   YA +  KL E++  +Y++    PLI+APGA FD +WY E + +T    +   THH+Y    G                 A  + +  L+ I+
Subjt:  ---VNASQYAKNLLKLREIIDRLYKNSQQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIYIYNMG-----------------AVSNTFRQLKNII

Query:  EKHAPWASAWVGEAGGAYHGGSPHISDAFINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGALLFHRLMGSSALKVDNNVSSYLR
         +    A AW+GEAGGA++ G   IS+ FIN FW                  QTL GG Y +L+  TYIP  DYY ALL+HRLMGS  LK +   +  + 
Subjt:  EKHAPWASAWVGEAGGAYHGGSPHISDAFINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGALLFHRLMGSSALKVDNNVSSYLR

Query:  TYAHCSRGKSGVTMLFINLSNTTKFTIKIENHMNLSLHKSKPKNSSPSKNVGTQREEYHLTPENGIIRSSTVLLNGRELELTKEGELPDLVPVYRDSNSS
         YAHC++  +G+TML +N           E+ + +SL  SK          G++REEYHLTP N  ++S  V LNG  L L   G +P L PV +D++  
Subjt:  TYAHCSRGKSGVTMLFINLSNTTKFTIKIENHMNLSLHKSKPKNSSPSKNVGTQREEYHLTPENGIIRSSTVLLNGRELELTKEGELPDLVPVYRDSNSS

Query:  ISIANWSITFIVIP
        + +A +S  F+ +P
Subjt:  ISIANWSITFIVIP

X4Y2L4 Hyaluronoglucuronidase6.3e-2425.54Show/hide
Query:  LDLSLPILNKAVRSFKTLKIRVGGTLQDRLIYDIGDGFKGNCHPFQADKD---------LLFDFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGKYN
        +D++ P L K +        RVGGT  + L +D+ +  K   +    DK           LF   +  L  E +DDL K    +   + F LNA +    
Subjt:  LDLSLPILNKAVRSFKTLKIRVGGTLQDRLIYDIGDGFKGNCHPFQADKD---------LLFDFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGKYN

Query:  TKGIQWEGNWNYRNAEALVKYTVDKKYNIN-SWEFGNVNASQYAKNLL--KLREIIDRLYKNSQQKPL-----IVAPGAFFDDKWYDELVTKTGSNVVST
          G +    W+   AE L KY V K Y  N  WE GN      A NL   ++ E    L+K  ++ P      +V P   +    Y + +     + V+ 
Subjt:  TKGIQWEGNWNYRNAEALVKYTVDKKYNIN-SWEFGNVNASQYAKNLL--KLREIIDRLYKNSQQKPL-----IVAPGAFFDDKWYDELVTKTGSNVVST

Query:  LTHHIYIY--NMGAVS-----NTFRQLKNIIE------KHAPWAS--AWVGEAGGAYHGGSPHISDAFINSFW------------------QTLIGGFYN
         T H Y +  N   VS       F++L+ + +      K++P      W+GE    Y+ G+  +SD +++ F                   QT+  G+Y 
Subjt:  LTHHIYIY--NMGAVS-----NTFRQLKNIIE------KHAPWAS--AWVGEAGGAYHGGSPHISDAFINSFW------------------QTLIGGFYN

Query:  ILRAKTYIPTLDYYGALLFHRLMGSSALKVD-NNVSSYLRTYAHCSRGKSGVTMLFINLSNTTKFTIKI-ENHMNLSLHKSKPKNSSPSKNVGTQREEYH
        +L   T  P  DY+   + + L+G++  KVD ++ ++  R YA C++  S  T       + T F + + +  + L + +            G +   Y 
Subjt:  ILRAKTYIPTLDYYGALLFHRLMGSSALKVD-NNVSSYLRTYAHCSRGKSGVTMLFINLSNTTKFTIKI-ENHMNLSLHKSKPKNSSPSKNVGTQREEYH

Query:  LTPENGIIRSSTVLLNGRELELTKEGELPDLVPVYRDSNSSISIANWSITFIVIPDFVAVGC
        LTPE G + S  VLLNG+EL+L  + +LP+L     +S +S +++  +  F V+ D     C
Subjt:  LTPENGIIRSSTVLLNGRELELTKEGELPDLVPVYRDSNSSISIANWSITFIVIPDFVAVGC

Arabidopsis top hitse value%identityAlignment
AT5G07830.1 glucuronidase 22.7e-13143.75Show/hide
Query:  PKNYFGHNVTIGKIVVDGTTKISETDENFICFTLDIWSHDECSQPNLCAWDGHASMLNLDLSLPILNKAVRSFKTLKIRVGGTLQDRLIYDIGDGFKGNC
        P+      +    IV+ G  ++ ETDENF+C TLD W HD+C+  + C W G++S++N+DL+ P+L KA+++FK L+IR+GG+LQD++IYD+G+  K  C
Subjt:  PKNYFGHNVTIGKIVVDGTTKISETDENFICFTLDIWSHDECSQPNLCAWDGHASMLNLDLSLPILNKAVRSFKTLKIRVGGTLQDRLIYDIGDGFKGNC

Query:  HPFQADKDLLFDFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYRNAEALVKYTVDKKYNINSWEFGN----------VNASQ
         PFQ     LF F++GCL+M+RWD+LN F   TGA+VTFGLNAL G++  +G  W G W++ N +  + YTV K Y I+SWEFGN          V+A  
Subjt:  HPFQADKDLLFDFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYRNAEALVKYTVDKKYNINSWEFGN----------VNASQ

Query:  YAKNLLKLREIIDRLYKNS-QQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIYIYNMG----------------AVSNTFRQLKNIIEKHAPWAS
        Y K+L+ L+++I+++YKNS   KP++VAPG F++ +WY +L+  +G +VV  +THHIY    G                 VS TF+ +   I++H PWAS
Subjt:  YAKNLLKLREIIDRLYKNS-QQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIYIYNMG----------------AVSNTFRQLKNIIEKHAPWAS

Query:  AWVGEAGGAYHGGSPHISDAFINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGALLFHRLMGSSALKVDNNVSSYLRTYAHCSRG
         WVGE+GGAY+ G  H+SD FI+SFW                  QTL+GGFY +L   T++P  DYY ALL+HRLMG   L V  +    LR YAHCS+G
Subjt:  AWVGEAGGAYHGGSPHISDAFINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGALLFHRLMGSSALKVDNNVSSYLRTYAHCSRG

Query:  KSGVTMLFINLSNTTKFTIKIENHMNLSLHKSKPKNSS-------PSKNVGTQ-------REEYHLTPENGIIRSSTVLLNGRELELTKEGELPDLVPVY
        ++GVT+L INLSN + FT+ + N +N+ L+    K  S       P   +G++       REEYHLTPENG++RS T++LNG+ L+ T  G++P L PV 
Subjt:  KSGVTMLFINLSNTTKFTIKIENHMNLSLHKSKPKNSS-------PSKNVGTQ-------REEYHLTPENGIIRSSTVLLNGRELELTKEGELPDLVPVY

Query:  RDSNSSISIANWSITFIVIPDFVAVGCN
        R  NS +++   S++FIV+P+F A  C+
Subjt:  RDSNSSISIANWSITFIVIPDFVAVGCN

AT5G34940.1 glucuronidase 38.4e-7237.41Show/hide
Query:  MERWDDLNKFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYRNAEALVKYTVDKKYNINSWEFGN----------VNASQYAKNLLKLREIIDRLYKNS
        M RWD+LN FF  TG  V FGLNAL G+      +  G WNY NAE+ +++T +  Y I+ WE GN          V A+QYA + + LR I++R+YKN 
Subjt:  MERWDDLNKFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYRNAEALVKYTVDKKYNINSWEFGN----------VNASQYAKNLLKLREIIDRLYKNS

Query:  QQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIYIYNMGA----------------VSNTFRQLKNIIEKHAPWASAWVGEAGGAYHGGSPHISDA
           PL++ PG FF+  W+ E + K   N ++  T HIY    G                  + +FR LKNII+  +  A AWVGE+GGAY+ G   +S+A
Subjt:  QQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIYIYNMGA----------------VSNTFRQLKNIIEKHAPWASAWVGEAGGAYHGGSPHISDA

Query:  FINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGALLFHRLMGSSALKVDNNVSSYLRTYAHCSRGKSGVTMLFINLSNTTKFTIK
        F+ SFW                  Q+LIGG Y +L    + P  DYY AL++ +LMG  AL    + +  +R+Y HC+R   G+T+L +NL NTT    K
Subjt:  FINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGALLFHRLMGSSALKVDNNVSSYLRTYAHCSRGKSGVTMLFINLSNTTKFTIK

Query:  IENHMNLSLHKSKPKNSSP--------SKNVGTQREEYHLTPENGIIRSSTVLLNGRELELTKEGELPDLVPVYRDSNSSISIANWSITFIVIPDFVAVG
        +E + + SL  +K   S            N   QREEYHLT ++G + S T+LLNG  L++   G+LP + P++ +S   I+IA +SI F+ + + V   
Subjt:  IENHMNLSLHKSKPKNSSP--------SKNVGTQREEYHLTPENGIIRSSTVLLNGRELELTKEGELPDLVPVYRDSNSSISIANWSITFIVIPDFVAVG

Query:  C
        C
Subjt:  C

AT5G34940.2 glucuronidase 33.9e-10139.49Show/hide
Query:  GKIVVDGTTKISETDENFICFTLDIWSHDECSQPNLCAWDGHASMLNLDLSLPILNKAVRSFKTLKIRVGGTLQDRLIYDIGDGFKGNCHPFQADKDLLF
        G + V G   +   DE+FIC TLD W  ++C   + C+WD HAS+LNLDL+  IL  A+++F  LKIR+GGTLQD +IY+  D  K  C PF  +  +LF
Subjt:  GKIVVDGTTKISETDENFICFTLDIWSHDECSQPNLCAWDGHASMLNLDLSLPILNKAVRSFKTLKIRVGGTLQDRLIYDIGDGFKGNCHPFQADKDLLF

Query:  DFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYRNAEALVKYTVDKKYNINSWEFGN----------VNASQYAKNLLKLREI
         +T+GCL M RWD+LN FF  TG  V FGLNAL G+      +  G WNY NAE+ +++T +  Y I+ WE GN          V A+QYA + + LR I
Subjt:  DFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYRNAEALVKYTVDKKYNINSWEFGN----------VNASQYAKNLLKLREI

Query:  IDRLYKNSQQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIYIYNMGA----------------VSNTFRQLKNIIEKHAPWASAWVGEAGGAYHG
        ++R+YKN    PL++ PG FF+  W+ E + K   N ++  T HIY    G                  + +FR LKNII+  +  A AWVGE+GGAY+ 
Subjt:  IDRLYKNSQQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIYIYNMGA----------------VSNTFRQLKNIIEKHAPWASAWVGEAGGAYHG

Query:  GSPHISDAFINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGALLFHRLMGSSALKVDNNVSSYLRTYAHCSRGKSGVTMLFINLS
        G   +S+AF+ SFW                  Q+LIGG Y +L    + P  DYY AL++ +LMG  AL    + +  +R+Y HC+R   G+T+L +NL 
Subjt:  GSPHISDAFINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGALLFHRLMGSSALKVDNNVSSYLRTYAHCSRGKSGVTMLFINLS

Query:  NTTKFTIKIENHMNLSLHKSKPKNSSP--------SKNVGTQREEYHLTPENGIIRSSTVLLNGRELELTKEGELPDLVPVYRDSNSSISIANWSITFIV
        NTT    K+E + + SL  +K   S            N   QREEYHLT ++G + S T+LLNG  L++   G+LP + P++ +S   I+IA +SI F+ 
Subjt:  NTTKFTIKIENHMNLSLHKSKPKNSSP--------SKNVGTQREEYHLTPENGIIRSSTVLLNGRELELTKEGELPDLVPVYRDSNSSISIANWSITFIV

Query:  IPDFVAVGC
        + + V   C
Subjt:  IPDFVAVGC

AT5G61250.1 glucuronidase 12.4e-12743.64Show/hide
Query:  PKNYFGHNVTIGKIVVDGTTKISETDENFICFTLDIWSHDECSQPNLCAWDGHASMLNLDLSLPILNKAVRSFKTLKIRVGGTLQDRLIYDIGDGFKGNC
        P   FG N+    +V+DG+ +I+ETDENFIC TLD W  ++C+  + C W G+AS++NL+L+ P+L KA+++F+TL+IR+GG+LQD++IYD+GD  K  C
Subjt:  PKNYFGHNVTIGKIVVDGTTKISETDENFICFTLDIWSHDECSQPNLCAWDGHASMLNLDLSLPILNKAVRSFKTLKIRVGGTLQDRLIYDIGDGFKGNC

Query:  HPFQADKDLLFDFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYRNAEALVKYTVDKKYNINSWEFGN----------VNASQ
          F+   D LF F+EGCLYM+RWD++N FFN TGAIVTFGLNAL G+    G  W G+W++ N +  + YTV K Y I+SWEFGN          V+   
Subjt:  HPFQADKDLLFDFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYRNAEALVKYTVDKKYNINSWEFGN----------VNASQ

Query:  YAKNLLKLREIIDRLYKNSQQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIYIYNMG----------------AVSNTFRQLKNIIEKHAPWASA
        Y K+L+ L+ +I  +YKNS+ KPL+VAPG FF+++WY EL+  +G  V+  LTHHIY    G                 +S  F  +   I++H PWA+A
Subjt:  YAKNLLKLREIIDRLYKNSQQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIYIYNMG----------------AVSNTFRQLKNIIEKHAPWASA

Query:  WVGEAGGAYHGGSPHISDAFINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGALLFHRLMGSSALKVDNNVSSYLRTYAHCSRGK
        WVGEAGGA++ G   +S+ FINSFW                  Q L+GGFY +L  +T++P  DYY ALL+HRLMG   L V    S YLR Y HCS+ +
Subjt:  WVGEAGGAYHGGSPHISDAFINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGALLFHRLMGSSALKVDNNVSSYLRTYAHCSRGK

Query:  SGVTMLFINLSNTTKFTIKIENHMNLSLHKSKPKNSSPSKNVGTQ--------------REEYHLTPENGIIRSSTVLLNGRELELTKEGELPDLVPVYR
        +G+T+L INLS  T FT+ + N + + L     K  S  + + ++              REEYHL+P++G +RS  +LLNG+ L  T  G++P L PV  
Subjt:  SGVTMLFINLSNTTKFTIKIENHMNLSLHKSKPKNSSPSKNVGTQ--------------REEYHLTPENGIIRSSTVLLNGRELELTKEGELPDLVPVYR

Query:  DSNSSISIANWSITFIVIPDFVAVGCN
           S + I   SI+FIV+P F A  C+
Subjt:  DSNSSISIANWSITFIVIPDFVAVGCN

AT5G61250.2 glucuronidase 12.4e-12743.64Show/hide
Query:  PKNYFGHNVTIGKIVVDGTTKISETDENFICFTLDIWSHDECSQPNLCAWDGHASMLNLDLSLPILNKAVRSFKTLKIRVGGTLQDRLIYDIGDGFKGNC
        P   FG N+    +V+DG+ +I+ETDENFIC TLD W  ++C+  + C W G+AS++NL+L+ P+L KA+++F+TL+IR+GG+LQD++IYD+GD  K  C
Subjt:  PKNYFGHNVTIGKIVVDGTTKISETDENFICFTLDIWSHDECSQPNLCAWDGHASMLNLDLSLPILNKAVRSFKTLKIRVGGTLQDRLIYDIGDGFKGNC

Query:  HPFQADKDLLFDFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYRNAEALVKYTVDKKYNINSWEFGN----------VNASQ
          F+   D LF F+EGCLYM+RWD++N FFN TGAIVTFGLNAL G+    G  W G+W++ N +  + YTV K Y I+SWEFGN          V+   
Subjt:  HPFQADKDLLFDFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYRNAEALVKYTVDKKYNINSWEFGN----------VNASQ

Query:  YAKNLLKLREIIDRLYKNSQQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIYIYNMG----------------AVSNTFRQLKNIIEKHAPWASA
        Y K+L+ L+ +I  +YKNS+ KPL+VAPG FF+++WY EL+  +G  V+  LTHHIY    G                 +S  F  +   I++H PWA+A
Subjt:  YAKNLLKLREIIDRLYKNSQQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIYIYNMG----------------AVSNTFRQLKNIIEKHAPWASA

Query:  WVGEAGGAYHGGSPHISDAFINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGALLFHRLMGSSALKVDNNVSSYLRTYAHCSRGK
        WVGEAGGA++ G   +S+ FINSFW                  Q L+GGFY +L  +T++P  DYY ALL+HRLMG   L V    S YLR Y HCS+ +
Subjt:  WVGEAGGAYHGGSPHISDAFINSFW------------------QTLIGGFYNILRAKTYIPTLDYYGALLFHRLMGSSALKVDNNVSSYLRTYAHCSRGK

Query:  SGVTMLFINLSNTTKFTIKIENHMNLSLHKSKPKNSSPSKNVGTQ--------------REEYHLTPENGIIRSSTVLLNGRELELTKEGELPDLVPVYR
        +G+T+L INLS  T FT+ + N + + L     K  S  + + ++              REEYHL+P++G +RS  +LLNG+ L  T  G++P L PV  
Subjt:  SGVTMLFINLSNTTKFTIKIENHMNLSLHKSKPKNSSPSKNVGTQ--------------REEYHLTPENGIIRSSTVLLNGRELELTKEGELPDLVPVYR

Query:  DSNSSISIANWSITFIVIPDFVAVGCN
           S + I   SI+FIV+P F A  C+
Subjt:  DSNSSISIANWSITFIVIPDFVAVGCN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGCATGTGGTAGGAATTTTGGAATCAACCATTTATGGAATACCAAATATTCCTTTTGATTTTGGTTTTTGCTTTCATCCCAAGAACTATTTTGGTCATAATGTTAC
AATAGGAAAAATTGTAGTTGATGGAACTACAAAAATATCGGAAACAGATGAGAATTTCATTTGTTTTACTTTGGACATTTGGTCTCATGATGAGTGTAGTCAACCCAACC
TTTGTGCTTGGGATGGTCATGCATCGATGCTTAATTTGGATCTGTCTCTTCCTATTCTTAACAAAGCTGTTCGATCTTTCAAGACATTAAAAATTAGAGTAGGAGGTACC
TTACAAGACAGGTTGATTTACGATATTGGTGATGGTTTCAAGGGAAATTGTCATCCATTTCAAGCCGACAAAGATTTACTTTTTGACTTTACAGAAGGTTGTTTATACAT
GGAAAGATGGGATGATTTGAACAAATTTTTCAACAATACAGGGGCAATTGTAACTTTTGGCTTAAATGCTCTACTGGGCAAGTACAACACAAAAGGAATACAATGGGAAG
GCAATTGGAACTACAGGAATGCTGAGGCTCTTGTTAAATATACAGTGGACAAGAAGTATAATATAAATTCATGGGAGTTTGGTAACGTTAATGCTTCACAATATGCAAAA
AATCTACTGAAACTTCGAGAAATCATAGATCGTTTGTACAAGAATTCCCAACAAAAACCTTTGATTGTTGCACCTGGTGCATTCTTTGATGACAAATGGTATGATGAACT
TGTTACAAAAACTGGATCAAATGTTGTTAGTACTCTCACTCATCATATATATATATATAACATGGGTGCAGTATCAAACACATTTAGACAACTAAAGAATATAATTGAAA
AGCATGCCCCTTGGGCTTCTGCTTGGGTTGGTGAAGCTGGTGGAGCCTACCATGGTGGCAGTCCTCATATTTCTGATGCATTTATCAATAGTTTTTGGCAAACTTTGATA
GGTGGATTTTACAATATTCTTAGAGCTAAAACTTATATTCCTACCCTAGACTACTATGGTGCACTTCTCTTCCACCGACTTATGGGCTCAAGTGCTCTCAAAGTTGATAA
TAATGTCTCTTCTTATCTTCGCACCTATGCTCATTGCTCGAGAGGAAAATCCGGTGTAACCATGCTTTTCATCAACTTGAGCAATACAACAAAGTTCACAATAAAGATTG
AAAACCATATGAACTTGAGTTTGCACAAAAGCAAACCCAAGAATAGTTCACCATCAAAGAATGTGGGAACACAAAGAGAGGAATATCATTTGACACCAGAAAATGGTATT
ATTAGAAGTTCTACGGTGCTTTTGAATGGAAGAGAATTGGAGCTTACAAAAGAAGGAGAATTGCCAGATCTTGTACCTGTCTATAGAGATAGTAACTCTTCTATAAGTAT
TGCTAATTGGTCCATTACTTTCATTGTCATCCCTGACTTTGTAGCCGTTGGATGCAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTGCATGTGGTAGGAATTTTGGAATCAACCATTTATGGAATACCAAATATTCCTTTTGATTTTGGTTTTTGCTTTCATCCCAAGAACTATTTTGGTCATAATGTTAC
AATAGGAAAAATTGTAGTTGATGGAACTACAAAAATATCGGAAACAGATGAGAATTTCATTTGTTTTACTTTGGACATTTGGTCTCATGATGAGTGTAGTCAACCCAACC
TTTGTGCTTGGGATGGTCATGCATCGATGCTTAATTTGGATCTGTCTCTTCCTATTCTTAACAAAGCTGTTCGATCTTTCAAGACATTAAAAATTAGAGTAGGAGGTACC
TTACAAGACAGGTTGATTTACGATATTGGTGATGGTTTCAAGGGAAATTGTCATCCATTTCAAGCCGACAAAGATTTACTTTTTGACTTTACAGAAGGTTGTTTATACAT
GGAAAGATGGGATGATTTGAACAAATTTTTCAACAATACAGGGGCAATTGTAACTTTTGGCTTAAATGCTCTACTGGGCAAGTACAACACAAAAGGAATACAATGGGAAG
GCAATTGGAACTACAGGAATGCTGAGGCTCTTGTTAAATATACAGTGGACAAGAAGTATAATATAAATTCATGGGAGTTTGGTAACGTTAATGCTTCACAATATGCAAAA
AATCTACTGAAACTTCGAGAAATCATAGATCGTTTGTACAAGAATTCCCAACAAAAACCTTTGATTGTTGCACCTGGTGCATTCTTTGATGACAAATGGTATGATGAACT
TGTTACAAAAACTGGATCAAATGTTGTTAGTACTCTCACTCATCATATATATATATATAACATGGGTGCAGTATCAAACACATTTAGACAACTAAAGAATATAATTGAAA
AGCATGCCCCTTGGGCTTCTGCTTGGGTTGGTGAAGCTGGTGGAGCCTACCATGGTGGCAGTCCTCATATTTCTGATGCATTTATCAATAGTTTTTGGCAAACTTTGATA
GGTGGATTTTACAATATTCTTAGAGCTAAAACTTATATTCCTACCCTAGACTACTATGGTGCACTTCTCTTCCACCGACTTATGGGCTCAAGTGCTCTCAAAGTTGATAA
TAATGTCTCTTCTTATCTTCGCACCTATGCTCATTGCTCGAGAGGAAAATCCGGTGTAACCATGCTTTTCATCAACTTGAGCAATACAACAAAGTTCACAATAAAGATTG
AAAACCATATGAACTTGAGTTTGCACAAAAGCAAACCCAAGAATAGTTCACCATCAAAGAATGTGGGAACACAAAGAGAGGAATATCATTTGACACCAGAAAATGGTATT
ATTAGAAGTTCTACGGTGCTTTTGAATGGAAGAGAATTGGAGCTTACAAAAGAAGGAGAATTGCCAGATCTTGTACCTGTCTATAGAGATAGTAACTCTTCTATAAGTAT
TGCTAATTGGTCCATTACTTTCATTGTCATCCCTGACTTTGTAGCCGTTGGATGCAATTAA
Protein sequenceShow/hide protein sequence
MLHVVGILESTIYGIPNIPFDFGFCFHPKNYFGHNVTIGKIVVDGTTKISETDENFICFTLDIWSHDECSQPNLCAWDGHASMLNLDLSLPILNKAVRSFKTLKIRVGGT
LQDRLIYDIGDGFKGNCHPFQADKDLLFDFTEGCLYMERWDDLNKFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYRNAEALVKYTVDKKYNINSWEFGNVNASQYAK
NLLKLREIIDRLYKNSQQKPLIVAPGAFFDDKWYDELVTKTGSNVVSTLTHHIYIYNMGAVSNTFRQLKNIIEKHAPWASAWVGEAGGAYHGGSPHISDAFINSFWQTLI
GGFYNILRAKTYIPTLDYYGALLFHRLMGSSALKVDNNVSSYLRTYAHCSRGKSGVTMLFINLSNTTKFTIKIENHMNLSLHKSKPKNSSPSKNVGTQREEYHLTPENGI
IRSSTVLLNGRELELTKEGELPDLVPVYRDSNSSISIANWSITFIVIPDFVAVGCN