; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0002303 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0002303
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
Descriptionheparan-alpha-glucosaminide N-acetyltransferase
Genome locationchr05:28698779..28711696
RNA-Seq ExpressionPay0002303
SyntenyPay0002303
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR012429 - Heparan-alpha-glucosaminide N-acetyltransferase, catalytic domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004144775.1 heparan-alpha-glucosaminide N-acetyltransferase isoform X1 [Cucumis sativus]8.7e-18897.59Show/hide
Query:  MDHGNNSPNEISQPLISMEEIKSDSTPHHPHRLISVDSDVLLPKPAKSKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIV
        MDHGNNSPNEISQPLISMEEIK DST HHPHRLISVDSD LLPKP KSKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIV
Subjt:  MDHGNNSPNEISQPLISMEEIKSDSTPHHPHRLISVDSDVLLPKPAKSKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIV

Query:  GMAIALALKRIPNQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRQTQSNVQPFNHFSIFKSYFWN
        GMAIALALKRIPNQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSR+TQSNVQPFNHFSIFKSYFWN
Subjt:  GMAIALALKRIPNQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRQTQSNVQPFNHFSIFKSYFWN

Query:  WLVAACILVVYFALLYGIYVPDWQFTVTDSDSVYYGRNFTVACGVRGSLDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCF
        WLV ACILVVYFALLYGIYVPDWQFTVTDS+SVYYGRNFTVACGVRG+LDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCF
Subjt:  WLVAACILVVYFALLYGIYVPDWQFTVTDSDSVYYGRNFTVACGVRGSLDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCF

Query:  APFEPEGILSSISAILSTIIGVHFGHVLIHFQ
        APFEPEGILSSISAILSTIIGVHFGHVLIHFQ
Subjt:  APFEPEGILSSISAILSTIIGVHFGHVLIHFQ

XP_008454037.1 PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Cucumis melo]4.9e-19199.7Show/hide
Query:  MDHGNNSPNEISQPLISMEEIKSDSTPHHPHRLISVDSDVLLPKPAKSKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIV
        MDHGNNSPNEISQPLISMEEIKSDSTPHHPHRLISVDSDVLLPKPAKSKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIV
Subjt:  MDHGNNSPNEISQPLISMEEIKSDSTPHHPHRLISVDSDVLLPKPAKSKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIV

Query:  GMAIALALKRIPNQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRQTQSNVQPFNHFSIFKSYFWN
        GMAIALALKRI NQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRQTQSNVQPFNHFSIFKSYFWN
Subjt:  GMAIALALKRIPNQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRQTQSNVQPFNHFSIFKSYFWN

Query:  WLVAACILVVYFALLYGIYVPDWQFTVTDSDSVYYGRNFTVACGVRGSLDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCF
        WLVAACILVVYFALLYGIYVPDWQFTVTDSDSVYYGRNFTVACGVRGSLDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCF
Subjt:  WLVAACILVVYFALLYGIYVPDWQFTVTDSDSVYYGRNFTVACGVRGSLDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCF

Query:  APFEPEGILSSISAILSTIIGVHFGHVLIHFQ
        APFEPEGILSSISAILSTIIGVHFGHVLIHFQ
Subjt:  APFEPEGILSSISAILSTIIGVHFGHVLIHFQ

XP_022946485.1 heparan-alpha-glucosaminide N-acetyltransferase [Cucurbita moschata]5.7e-17191.44Show/hide
Query:  NSPNEISQPLISMEEIKSDSTPHHPHRLISVDSDVLLPKPAKSKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIVGMAIA
        NS NEIS+PLISMEEIKSDSTP   HRLISV+SD +L KP KSKR+ASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIVGMAIA
Subjt:  NSPNEISQPLISMEEIKSDSTPHHPHRLISVDSDVLLPKPAKSKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIVGMAIA

Query:  LALKRIPNQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRQTQSNVQPFNHFSIFKSYFWNWLVAA
        LALKRIPNQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVD+RKIRL GILQRIALAYLVVA VEVLSR+TQS+ QPFNHFSIFKSYFWNWLV A
Subjt:  LALKRIPNQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRQTQSNVQPFNHFSIFKSYFWNWLVAA

Query:  CILVVYFALLYGIYVPDWQFTVTDSDSVYYGRNFTVACGVRGSLDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCFAPFEP
        CIL+VY ALLYG Y+PDWQFTVTDSDSV+YGRNFTVACGVRGSLDPPCNAVGYIDRKVLGI H+YAHPAWRRSEACTENSPYAG FR+NAPSWCFAPFEP
Subjt:  CILVVYFALLYGIYVPDWQFTVTDSDSVYYGRNFTVACGVRGSLDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCFAPFEP

Query:  EGILSSISAILSTIIGVHFGHVLIHFQ
        EGILSSISAILS+IIGVHFGHVLIHFQ
Subjt:  EGILSSISAILSTIIGVHFGHVLIHFQ

XP_023545272.1 heparan-alpha-glucosaminide N-acetyltransferase isoform X1 [Cucurbita pepo subsp. pepo]2.2e-17090.83Show/hide
Query:  NSPNEISQPLISMEEIKSDSTPHHPHRLISVDSDVLLPKPAKSKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIVGMAIA
        NS NEIS+PLISMEEIKSDSTP   HRLISV+SD +L KP KSKR+ASLDIFRGLTVALMILVDDAGG+WPMIGHAPWYGCNLADFVMPFFLFIVGMAIA
Subjt:  NSPNEISQPLISMEEIKSDSTPHHPHRLISVDSDVLLPKPAKSKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIVGMAIA

Query:  LALKRIPNQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRQTQSNVQPFNHFSIFKSYFWNWLVAA
        LALKRIPNQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVD+RKIRL GILQRIALAYLVVA VEVLSR+TQ++ QPFNHFSIFKSYFWNWLV A
Subjt:  LALKRIPNQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRQTQSNVQPFNHFSIFKSYFWNWLVAA

Query:  CILVVYFALLYGIYVPDWQFTVTDSDSVYYGRNFTVACGVRGSLDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCFAPFEP
        CIL+VY ALLYG Y+PDWQFTVTDS+SV+YGRNFTVACGVRGSLDPPCNAVGYIDRKVLGI H+YAHPAWRRSEACTENSPYAG FRDNAPSWCFAPFEP
Subjt:  CILVVYFALLYGIYVPDWQFTVTDSDSVYYGRNFTVACGVRGSLDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCFAPFEP

Query:  EGILSSISAILSTIIGVHFGHVLIHFQ
        EGILSSISAILS+IIGVHFGHVLIHFQ
Subjt:  EGILSSISAILSTIIGVHFGHVLIHFQ

XP_031736604.1 heparan-alpha-glucosaminide N-acetyltransferase isoform X2 [Cucumis sativus]3.3e-17997.48Show/hide
Query:  LISMEEIKSDSTPHHPHRLISVDSDVLLPKPAKSKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIVGMAIALALKRIPNQ
        LISMEEIK DST HHPHRLISVDSD LLPKP KSKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIVGMAIALALKRIPNQ
Subjt:  LISMEEIKSDSTPHHPHRLISVDSDVLLPKPAKSKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIVGMAIALALKRIPNQ

Query:  LMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRQTQSNVQPFNHFSIFKSYFWNWLVAACILVVYFAL
        LMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSR+TQSNVQPFNHFSIFKSYFWNWLV ACILVVYFAL
Subjt:  LMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRQTQSNVQPFNHFSIFKSYFWNWLVAACILVVYFAL

Query:  LYGIYVPDWQFTVTDSDSVYYGRNFTVACGVRGSLDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCFAPFEPEGILSSISA
        LYGIYVPDWQFTVTDS+SVYYGRNFTVACGVRG+LDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCFAPFEPEGILSSISA
Subjt:  LYGIYVPDWQFTVTDSDSVYYGRNFTVACGVRGSLDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCFAPFEPEGILSSISA

Query:  ILSTIIGVHFGHVLIHFQ
        ILSTIIGVHFGHVLIHFQ
Subjt:  ILSTIIGVHFGHVLIHFQ

TrEMBL top hitse value%identityAlignment
A0A0A0LGG4 Uncharacterized protein4.2e-18897.59Show/hide
Query:  MDHGNNSPNEISQPLISMEEIKSDSTPHHPHRLISVDSDVLLPKPAKSKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIV
        MDHGNNSPNEISQPLISMEEIK DST HHPHRLISVDSD LLPKP KSKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIV
Subjt:  MDHGNNSPNEISQPLISMEEIKSDSTPHHPHRLISVDSDVLLPKPAKSKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIV

Query:  GMAIALALKRIPNQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRQTQSNVQPFNHFSIFKSYFWN
        GMAIALALKRIPNQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSR+TQSNVQPFNHFSIFKSYFWN
Subjt:  GMAIALALKRIPNQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRQTQSNVQPFNHFSIFKSYFWN

Query:  WLVAACILVVYFALLYGIYVPDWQFTVTDSDSVYYGRNFTVACGVRGSLDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCF
        WLV ACILVVYFALLYGIYVPDWQFTVTDS+SVYYGRNFTVACGVRG+LDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCF
Subjt:  WLVAACILVVYFALLYGIYVPDWQFTVTDSDSVYYGRNFTVACGVRGSLDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCF

Query:  APFEPEGILSSISAILSTIIGVHFGHVLIHFQ
        APFEPEGILSSISAILSTIIGVHFGHVLIHFQ
Subjt:  APFEPEGILSSISAILSTIIGVHFGHVLIHFQ

A0A1S3BXP3 heparan-alpha-glucosaminide N-acetyltransferase2.4e-19199.7Show/hide
Query:  MDHGNNSPNEISQPLISMEEIKSDSTPHHPHRLISVDSDVLLPKPAKSKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIV
        MDHGNNSPNEISQPLISMEEIKSDSTPHHPHRLISVDSDVLLPKPAKSKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIV
Subjt:  MDHGNNSPNEISQPLISMEEIKSDSTPHHPHRLISVDSDVLLPKPAKSKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIV

Query:  GMAIALALKRIPNQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRQTQSNVQPFNHFSIFKSYFWN
        GMAIALALKRI NQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRQTQSNVQPFNHFSIFKSYFWN
Subjt:  GMAIALALKRIPNQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRQTQSNVQPFNHFSIFKSYFWN

Query:  WLVAACILVVYFALLYGIYVPDWQFTVTDSDSVYYGRNFTVACGVRGSLDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCF
        WLVAACILVVYFALLYGIYVPDWQFTVTDSDSVYYGRNFTVACGVRGSLDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCF
Subjt:  WLVAACILVVYFALLYGIYVPDWQFTVTDSDSVYYGRNFTVACGVRGSLDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCF

Query:  APFEPEGILSSISAILSTIIGVHFGHVLIHFQ
        APFEPEGILSSISAILSTIIGVHFGHVLIHFQ
Subjt:  APFEPEGILSSISAILSTIIGVHFGHVLIHFQ

A0A6J1G3Z7 heparan-alpha-glucosaminide N-acetyltransferase2.7e-17191.44Show/hide
Query:  NSPNEISQPLISMEEIKSDSTPHHPHRLISVDSDVLLPKPAKSKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIVGMAIA
        NS NEIS+PLISMEEIKSDSTP   HRLISV+SD +L KP KSKR+ASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIVGMAIA
Subjt:  NSPNEISQPLISMEEIKSDSTPHHPHRLISVDSDVLLPKPAKSKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIVGMAIA

Query:  LALKRIPNQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRQTQSNVQPFNHFSIFKSYFWNWLVAA
        LALKRIPNQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVD+RKIRL GILQRIALAYLVVA VEVLSR+TQS+ QPFNHFSIFKSYFWNWLV A
Subjt:  LALKRIPNQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRQTQSNVQPFNHFSIFKSYFWNWLVAA

Query:  CILVVYFALLYGIYVPDWQFTVTDSDSVYYGRNFTVACGVRGSLDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCFAPFEP
        CIL+VY ALLYG Y+PDWQFTVTDSDSV+YGRNFTVACGVRGSLDPPCNAVGYIDRKVLGI H+YAHPAWRRSEACTENSPYAG FR+NAPSWCFAPFEP
Subjt:  CILVVYFALLYGIYVPDWQFTVTDSDSVYYGRNFTVACGVRGSLDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCFAPFEP

Query:  EGILSSISAILSTIIGVHFGHVLIHFQ
        EGILSSISAILS+IIGVHFGHVLIHFQ
Subjt:  EGILSSISAILSTIIGVHFGHVLIHFQ

A0A6J1KAR7 heparan-alpha-glucosaminide N-acetyltransferase isoform X12.3e-17090.83Show/hide
Query:  NSPNEISQPLISMEEIKSDSTPHHPHRLISVDSDVLLPKPAKSKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIVGMAIA
        NS NEIS+PLISMEEIKSDSTP   HRLISV+SD +L KP KSKR+ASLDIFRGLTVALMILVDDAGG+WPMIGHAPWYGCNLADFVMPFFLFIVGMAIA
Subjt:  NSPNEISQPLISMEEIKSDSTPHHPHRLISVDSDVLLPKPAKSKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIVGMAIA

Query:  LALKRIPNQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRQTQSNVQPFNHFSIFKSYFWNWLVAA
        LALKRIPNQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVD+RKIRL GILQRIALAYLVVA VEVLSR+TQ+  +PFNHFSIFKSYFWNWLV A
Subjt:  LALKRIPNQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRQTQSNVQPFNHFSIFKSYFWNWLVAA

Query:  CILVVYFALLYGIYVPDWQFTVTDSDSVYYGRNFTVACGVRGSLDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCFAPFEP
        CIL+VY ALLYG Y+PDWQFTVTDSDSV+YGRNFTVACGVRGSLDPPCNAVGYIDRKVLGI H+YAHPAWRRSEACTENSPYAG FRDNAPSWCFAPFEP
Subjt:  CILVVYFALLYGIYVPDWQFTVTDSDSVYYGRNFTVACGVRGSLDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCFAPFEP

Query:  EGILSSISAILSTIIGVHFGHVLIHFQ
        EGILSSISAILS+IIGVHFGHVLIHFQ
Subjt:  EGILSSISAILSTIIGVHFGHVLIHFQ

A0A6J1KJL0 heparan-alpha-glucosaminide N-acetyltransferase isoform X25.0e-16591.11Show/hide
Query:  MEEIKSDSTPHHPHRLISVDSDVLLPKPAKSKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIVGMAIALALKRIPNQLMA
        MEEIKSDSTP   HRLISV+SD +L KP KSKR+ASLDIFRGLTVALMILVDDAGG+WPMIGHAPWYGCNLADFVMPFFLFIVGMAIALALKRIPNQLMA
Subjt:  MEEIKSDSTPHHPHRLISVDSDVLLPKPAKSKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIVGMAIALALKRIPNQLMA

Query:  IEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRQTQSNVQPFNHFSIFKSYFWNWLVAACILVVYFALLYG
        IEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVD+RKIRL GILQRIALAYLVVA VEVLSR+TQ+  +PFNHFSIFKSYFWNWLV ACIL+VY ALLYG
Subjt:  IEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRQTQSNVQPFNHFSIFKSYFWNWLVAACILVVYFALLYG

Query:  IYVPDWQFTVTDSDSVYYGRNFTVACGVRGSLDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCFAPFEPEGILSSISAILS
         Y+PDWQFTVTDSDSV+YGRNFTVACGVRGSLDPPCNAVGYIDRKVLGI H+YAHPAWRRSEACTENSPYAG FRDNAPSWCFAPFEPEGILSSISAILS
Subjt:  IYVPDWQFTVTDSDSVYYGRNFTVACGVRGSLDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCFAPFEPEGILSSISAILS

Query:  TIIGVHFGHVLIHFQ
        +IIGVHFGHVLIHFQ
Subjt:  TIIGVHFGHVLIHFQ

SwissProt top hitse value%identityAlignment
Q3UDW8 Heparan-alpha-glucosaminide N-acetyltransferase3.5e-2228.48Show/hide
Query:  AKSKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIVGMAIALA----LKRIPNQLMAIEKVTLRTLKLLFWGLLLQGGYSH
        + + RL  +D FRGL + LM+ V+  GG++    H+ W G  +AD V P+F+FI+G +I L+    L+R  ++L  + K+  R+  L+  G+++      
Subjt:  AKSKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIVGMAIALA----LKRIPNQLMAIEKVTLRTLKLLFWGLLLQGGYSH

Query:  APDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVL--SRQTQSNVQPFNHFSI--FKSYFWNWLVAACILVVYFALLYGIYVPDWQFTVTDSDSVYYGR
         P+     +   K+R+ G+LQR+ + Y VVA +E         S     + FS+    S +  WL    +  ++ AL + + VP          + Y G 
Subjt:  APDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVL--SRQTQSNVQPFNHFSI--FKSYFWNWLVAACILVVYFALLYGIYVPDWQFTVTDSDSVYYGR

Query:  NFTVACGVRGSLDPPC--NAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCFAPFEPEGILSSISAILSTIIGVHFGHVLIHFQ---
              G  G   P C   A GYIDR +LG NHLY HP    S     ++  A              ++PEG+L +I++I+   +GV  G +L++++   
Subjt:  NFTVACGVRGSLDPPC--NAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCFAPFEPEGILSSISAILSTIIGVHFGHVLIHFQ---

Query:  RSLFGAVVPGFCIFGL
        +++        CI GL
Subjt:  RSLFGAVVPGFCIFGL

Q68CP4 Heparan-alpha-glucosaminide N-acetyltransferase8.6e-2128.06Show/hide
Query:  NEISQPLISMEEIK-SDSTPHHPHRLISVDSDVLLPKPAKSK------RLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIVG
        N IS+ + S E  +  +S    P R   +D DV   +PA  +      RL S+D FRG+ + LM+ V+  GG++    HA W G  +AD V P+F+FI+G
Subjt:  NEISQPLISMEEIK-SDSTPHHPHRLISVDSDVLLPKPAKSK------RLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIVG

Query:  MAIALA----LKRIPNQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRQTQ----SNVQPFNHFSI
         +I L+    L+R  ++   + K+  R+  L+  G+++       P+     +   K+R+ G+LQR+ + Y VVA +E+L  +      ++ +       
Subjt:  MAIALA----LKRIPNQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRQTQ----SNVQPFNHFSI

Query:  FKSYFWNWLVAACILVVYFALLYGIYVPDWQFTVTDSDSVYYGRNFTVACGVRGSLDPPC--NAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSF
          S +  WL+   +  ++  L + + VP          + Y G       G  G   P C   A GYIDR +LG +HLY HP    S A   ++  A   
Subjt:  FKSYFWNWLVAACILVVYFALLYGIYVPDWQFTVTDSDSVYYGRNFTVACGVRGSLDPPC--NAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSF

Query:  RDNAPSWCFAPFEPEGILSSISAILSTIIGVHFGHVLIHFQ---RSLFGAVVPGFCIFGL
                   ++PEGIL +I++I+   +GV  G +L++++   + +        CI GL
Subjt:  RDNAPSWCFAPFEPEGILSSISAILSTIIGVHFGHVLIHFQ---RSLFGAVVPGFCIFGL

Arabidopsis top hitse value%identityAlignment
AT5G27730.1 Protein of unknown function (DUF1624)1.2e-12366.56Show/hide
Query:  MEEIKSDSTPHHPHRLISVDSDVLLPKPAKS-----KRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIVGMAIALALKRIP
        M EIK + +  H   L+    D       +S      RLASLDIFRGLTVALMILVDDAGG+WPMI HAPW GCNLADFVMPFFLFIVG++IAL+LKRI 
Subjt:  MEEIKSDSTPHHPHRLISVDSDVLLPKPAKS-----KRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIVGMAIALALKRIP

Query:  NQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRQTQSNVQPFNHFSIFKSYFWNWLVAACILVVYF
        N+  A +KV  RT KLLFWGLLLQGG+SHAPD+LTYGVDV  +R  GILQRIAL+YLVVA VE+ ++ +         FSIFKSY+W+W+VAA +LV+Y 
Subjt:  NQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRQTQSNVQPFNHFSIFKSYFWNWLVAACILVVYF

Query:  ALLYGIYVPDWQFTVTDSDSVYYGRNFTVACGVRGSLDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCFAPFEPEGILSSI
        A LYG YVPDW+F V D DSV YG+  +V+CGVRG L+PPCNAVGY+DR+VLGINH+Y HPAWRRS+ACT++SPY G+ R +APSWC APFEPEGILSSI
Subjt:  ALLYGIYVPDWQFTVTDSDSVYYGRNFTVACGVRGSLDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCFAPFEPEGILSSI

Query:  SAILSTIIGVHFGHVLIHFQ
        SAILSTIIGVHFGH+++H +
Subjt:  SAILSTIIGVHFGHVLIHFQ

AT5G47900.1 Protein of unknown function (DUF1624)2.1e-8344.68Show/hide
Query:  LISMEEIK-SDSTPHHPHRLISVDSDVLLPK----PAKSKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIVGMAIALALK
        L   E IK +D   H       ++S + + +    P   +RL SLD+FRGLTVA MILVDD GG  P I H+PW G  LADFVMPFFLFIVG+++A A K
Subjt:  LISMEEIK-SDSTPHHPHRLISVDSDVLLPK----PAKSKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIVGMAIALALK

Query:  RIPNQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRQTQSNVQPFNHFSIFKSYFWNWLVAACILV
         +  + +A  K  +R+LKLL  GL LQGG+ H  + LTYG+DV KIRL GILQRIA+AYLVVA  E+     + N    +  S+ K Y ++W+VA  I  
Subjt:  RIPNQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRQTQSNVQPFNHFSIFKSYFWNWLVAACILV

Query:  VYFALLYGIYVPDWQFTVTDSD---SVYYGRNFTVACGVRGSLDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCFAPFEPE
        +Y +LLYG+YVPDW++ +   D   ++    N  V CGVRG   P CNAVG +DR  LGI HLY  P + R++ C+ N P  G    +APSWC APF+PE
Subjt:  VYFALLYGIYVPDWQFTVTDSD---SVYYGRNFTVACGVRGSLDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCFAPFEPE

Query:  GILSSISAILSTIIGVHFGHVLIHF---QRSLFGAVVPGFC---------IFGLYLEAVIVLLFHSNVIVTEAHEG
        G+LSS+ A ++ ++G+H+GH++IHF   ++ L   ++  FC         +FG++L   +  L  S + VT    G
Subjt:  GILSSISAILSTIIGVHFGHVLIHF---QRSLFGAVVPGFC---------IFGLYLEAVIVLLFHSNVIVTEAHEG

AT5G47900.2 Protein of unknown function (DUF1624)4.7e-8348.16Show/hide
Query:  LISMEEIK-SDSTPHHPHRLISVDSDVLLPK----PAKSKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIVGMAIALALK
        L   E IK +D   H       ++S + + +    P   +RL SLD+FRGLTVA MILVDD GG  P I H+PW G  LADFVMPFFLFIVG+++A A K
Subjt:  LISMEEIK-SDSTPHHPHRLISVDSDVLLPK----PAKSKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIVGMAIALALK

Query:  RIPNQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRQTQSNVQPFNHFSIFKSYFWNWLVAACILV
         +  + +A  K  +R+LKLL  GL LQGG+ H  + LTYG+DV KIRL GILQRIA+AYLVVA  E+     + N    +  S+ K Y ++W+VA  I  
Subjt:  RIPNQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRQTQSNVQPFNHFSIFKSYFWNWLVAACILV

Query:  VYFALLYGIYVPDWQFTVTDSD---SVYYGRNFTVACGVRGSLDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCFAPFEPE
        +Y +LLYG+YVPDW++ +   D   ++    N  V CGVRG   P CNAVG +DR  LGI HLY  P + R++ C+ N P  G    +APSWC APF+PE
Subjt:  VYFALLYGIYVPDWQFTVTDSD---SVYYGRNFTVACGVRGSLDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCFAPFEPE

Query:  GILSSISAILSTIIGVHFGHVLIHFQ
        G+LSS+ A ++ ++G+H+GH++IHF+
Subjt:  GILSSISAILSTIIGVHFGHVLIHFQ

AT5G47900.4 Protein of unknown function (DUF1624)4.3e-8451.54Show/hide
Query:  PAKSKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIVGMAIALALKRIPNQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPD
        P   +RL SLD+FRGLTVA MILVDD GG  P I H+PW G  LADFVMPFFLFIVG+++A A K +  + +A  K  +R+LKLL  GL LQGG+ H  +
Subjt:  PAKSKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIVGMAIALALKRIPNQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPD

Query:  KLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRQTQSNVQPFNHFSIFKSYFWNWLVAACILVVYFALLYGIYVPDWQFTVTDSD---SVYYGRNFTV
         LTYG+DV KIRL GILQRIA+AYLVVA  E+     + N    +  S+ K Y ++W+VA  I  +Y +LLYG+YVPDW++ +   D   ++    N  V
Subjt:  KLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRQTQSNVQPFNHFSIFKSYFWNWLVAACILVVYFALLYGIYVPDWQFTVTDSD---SVYYGRNFTV

Query:  ACGVRGSLDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCFAPFEPEGILSSISAILSTIIGVHFGHVLIHFQRS
         CGVRG   P CNAVG +DR  LGI HLY  P + R++ C+ N P  G    +APSWC APF+PEG+LSS+ A ++ ++G+H+GH++IHF+R+
Subjt:  ACGVRGSLDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCFAPFEPEGILSSISAILSTIIGVHFGHVLIHFQRS

AT5G47900.7 Protein of unknown function (DUF1624)5.6e-8448.17Show/hide
Query:  LISMEEIK-SDSTPHHPHRLISVDSDVLLPK----PAKSKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIVGMAIALALK
        L   E IK +D   H       ++S + + +    P   +RL SLD+FRGLTVA MILVDD GG  P I H+PW G  LADFVMPFFLFIVG+++A A K
Subjt:  LISMEEIK-SDSTPHHPHRLISVDSDVLLPK----PAKSKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIVGMAIALALK

Query:  RIPNQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRQTQSNVQPFNHFSIFKSYFWNWLVAACILV
         +  + +A  K  +R+LKLL  GL LQGG+ H  + LTYG+DV KIRL GILQRIA+AYLVVA  E+     + N    +  S+ K Y ++W+VA  I  
Subjt:  RIPNQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRQTQSNVQPFNHFSIFKSYFWNWLVAACILV

Query:  VYFALLYGIYVPDWQFTVTDSD---SVYYGRNFTVACGVRGSLDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCFAPFEPE
        +Y +LLYG+YVPDW++ +   D   ++    N  V CGVRG   P CNAVG +DR  LGI HLY  P + R++ C+ N P  G    +APSWC APF+PE
Subjt:  VYFALLYGIYVPDWQFTVTDSD---SVYYGRNFTVACGVRGSLDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCFAPFEPE

Query:  GILSSISAILSTIIGVHFGHVLIHFQRS
        G+LSS+ A ++ ++G+H+GH++IHF+R+
Subjt:  GILSSISAILSTIIGVHFGHVLIHFQRS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCATGGTAACAACAGTCCAAACGAGATTTCTCAACCTCTCATTTCCATGGAAGAAATCAAGTCCGATTCCACTCCTCACCACCCCCACCGCCTTATCTCG
GTGGATTCCGATGTTTTGCTCCCAAAACCGGCGAAATCCAAGCGTCTTGCGTCGCTTGATATCTTCCGAGGTCTCACCGTTGCGTTGATGATTTTGGTTGATGAT
GCCGGGGGAGAATGGCCTATGATTGGTCATGCACCATGGTATGGTTGTAATCTTGCGGATTTTGTGATGCCTTTTTTCTTGTTCATTGTTGGGATGGCCATTGCA
CTCGCCCTGAAGAGAATACCTAACCAACTTATGGCCATTGAAAAGGTCACTCTTCGAACTTTAAAGCTCCTATTTTGGGGCCTTCTATTACAAGGTGGCTATTCG
CATGCGCCAGACAAACTGACTTATGGCGTTGATGTGAGAAAGATAAGGTTATTTGGGATTCTCCAGAGAATTGCTCTTGCATATTTGGTTGTGGCATTTGTTGAA
GTACTTTCAAGACAAACACAATCCAATGTTCAACCATTTAACCATTTCTCTATATTCAAGTCATACTTCTGGAATTGGCTGGTTGCAGCTTGTATTCTTGTAGTA
TACTTTGCTCTCCTTTACGGAATATATGTTCCGGATTGGCAATTTACTGTCACCGACAGCGATAGTGTTTATTATGGAAGAAACTTCACTGTAGCATGTGGTGTC
AGAGGAAGTCTGGATCCTCCATGTAATGCTGTGGGATATATTGACAGAAAAGTGCTGGGAATCAATCACTTGTATGCCCATCCTGCTTGGAGAAGATCTGAAGCT
TGCACCGAGAATTCTCCATATGCAGGATCTTTCCGAGATAATGCTCCATCATGGTGTTTTGCCCCATTTGAACCTGAAGGAATTTTAAGCTCTATATCTGCCATT
CTGTCCACAATTATTGGAGTACATTTCGGGCATGTTCTAATCCATTTTCAGAGATCCCTGTTTGGGGCGGTCGTCCCGGGATTTTGCATCTTCGGTTTATACTTG
GAAGCTGTAATTGTGTTATTATTTCATTCTAATGTCATCGTTACTGAGGCTCACGAAGGTCCCTTTGGTTTCATGCGGCAGTGCATGATGTCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATCATGGTAACAACAGTCCAAACGAGATTTCTCAACCTCTCATTTCCATGGAAGAAATCAAGTCCGATTCCACTCCTCACCACCCCCACCGCCTTATCTCG
GTGGATTCCGATGTTTTGCTCCCAAAACCGGCGAAATCCAAGCGTCTTGCGTCGCTTGATATCTTCCGAGGTCTCACCGTTGCGTTGATGATTTTGGTTGATGAT
GCCGGGGGAGAATGGCCTATGATTGGTCATGCACCATGGTATGGTTGTAATCTTGCGGATTTTGTGATGCCTTTTTTCTTGTTCATTGTTGGGATGGCCATTGCA
CTCGCCCTGAAGAGAATACCTAACCAACTTATGGCCATTGAAAAGGTCACTCTTCGAACTTTAAAGCTCCTATTTTGGGGCCTTCTATTACAAGGTGGCTATTCG
CATGCGCCAGACAAACTGACTTATGGCGTTGATGTGAGAAAGATAAGGTTATTTGGGATTCTCCAGAGAATTGCTCTTGCATATTTGGTTGTGGCATTTGTTGAA
GTACTTTCAAGACAAACACAATCCAATGTTCAACCATTTAACCATTTCTCTATATTCAAGTCATACTTCTGGAATTGGCTGGTTGCAGCTTGTATTCTTGTAGTA
TACTTTGCTCTCCTTTACGGAATATATGTTCCGGATTGGCAATTTACTGTCACCGACAGCGATAGTGTTTATTATGGAAGAAACTTCACTGTAGCATGTGGTGTC
AGAGGAAGTCTGGATCCTCCATGTAATGCTGTGGGATATATTGACAGAAAAGTGCTGGGAATCAATCACTTGTATGCCCATCCTGCTTGGAGAAGATCTGAAGCT
TGCACCGAGAATTCTCCATATGCAGGATCTTTCCGAGATAATGCTCCATCATGGTGTTTTGCCCCATTTGAACCTGAAGGAATTTTAAGCTCTATATCTGCCATT
CTGTCCACAATTATTGGAGTACATTTCGGGCATGTTCTAATCCATTTTCAGAGATCCCTGTTTGGGGCGGTCGTCCCGGGATTTTGCATCTTCGGTTTATACTTG
GAAGCTGTAATTGTGTTATTATTTCATTCTAATGTCATCGTTACTGAGGCTCACGAAGGTCCCTTTGGTTTCATGCGGCAGTGCATGATGTCCTAG
Protein sequenceShow/hide protein sequence
MDHGNNSPNEISQPLISMEEIKSDSTPHHPHRLISVDSDVLLPKPAKSKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIVGMAIA
LALKRIPNQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRQTQSNVQPFNHFSIFKSYFWNWLVAACILVV
YFALLYGIYVPDWQFTVTDSDSVYYGRNFTVACGVRGSLDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCFAPFEPEGILSSISAI
LSTIIGVHFGHVLIHFQRSLFGAVVPGFCIFGLYLEAVIVLLFHSNVIVTEAHEGPFGFMRQCMMS