; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G12680 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G12680
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionheparan-alpha-glucosaminide N-acetyltransferase-like
Genome locationClcChr02:24561632..24567116
RNA-Seq ExpressionClc02G12680
SyntenyClc02G12680
Gene Ontology termsGO:0055085 - transmembrane transport (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0015267 - channel activity (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR012429 - Heparan-alpha-glucosaminide N-acetyltransferase, catalytic domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045743.1 heparan-alpha-glucosaminide N-acetyltransferase-like [Cucumis melo var. makuwa]9.0e-21889.21Show/hide
Query:  MADSQPLLKNQQELPESCGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGASVALVYKEVKSRVTATRNAACRALY
        M DS+PLLKNQQELP S GKAPRVVSLDVFRG SVFMMM VDYGGSFLPII+HSPW GLHLADFVMPWFLFIAG SVALVYKEVKS+  A RNAACR LY
Subjt:  MADSQPLLKNQQELPESCGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGASVALVYKEVKSRVTATRNAACRALY

Query:  LFLLGVLLQGGYFHGITSLTYGVDMERI-------RISVGYLIAALCEIWLTRCTREEAQHTKSFSWHWCFMFLVLSLYMGLSYGLYVPDWDFKISAPSS
        LFLLGVLLQGGYFHGITSLTYGVD+ERI       RIS+GYLIAALCEIWLTR TREEAQHTKSFSWHWC +F +LSLYM LSYGLYVPDWDFKISAPSS
Subjt:  LFLLGVLLQGGYFHGITSLTYGVDMERI-------RISVGYLIAALCEIWLTRCTREEAQHTKSFSWHWCFMFLVLSLYMGLSYGLYVPDWDFKISAPSS

Query:  SLPPNGSYVYMVNCALRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILSKA
        SLP +GSYVY VNC+LRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQFPETSPSWC APFEPEGLLSSLTATVACIIGLQYGHIL+KA
Subjt:  SLPPNGSYVYMVNCALRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILSKA

Query:  QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYILVISNILVIGLQGFYWR
        QDHK+RTN WFLLS KILALG+FLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKH+LSIY+LVISNILVIGLQGFYW+
Subjt:  QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYILVISNILVIGLQGFYWR

Query:  SPNNNIVHWIVSRVKSQ
        SPNNNIVHWIVS VK++
Subjt:  SPNNNIVHWIVSRVKSQ

KAG6604943.1 Heparan-alpha-glucosaminide N-acetyltransferase, partial [Cucurbita argyrosperma subsp. sororia]1.6e-21987.71Show/hide
Query:  FSPESMADSQPLLKNQQELPESCGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGASVALVYKEVKSRVTATRNAA
        F PESMADSQPLLKN+QELP+S  KAPRV+SLDVFRG SVFMMMFVDYGGSFLP+IAHSPWNGLHLADFVMPWFLFIAG S+ALVYKEVK +VTATRNAA
Subjt:  FSPESMADSQPLLKNQQELPESCGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGASVALVYKEVKSRVTATRNAA

Query:  CRALYLFLLGVLLQGGYFHGITSLTYGVDMERI-------RISVGYLIAALCEIWLTRCTREEAQHTKSFSWHWCFMFLVLSLYMGLSYGLYVPDWDFKI
         R LYLFLLGVLLQGGYFHGITSLTYGVDM+RI       RISVGYLIAALCEIWLTRCTREEAQ+TKSFSWHWC +FL+LSLY GL YGLYVPDW+FKI
Subjt:  CRALYLFLLGVLLQGGYFHGITSLTYGVDMERI-------RISVGYLIAALCEIWLTRCTREEAQHTKSFSWHWCFMFLVLSLYMGLSYGLYVPDWDFKI

Query:  SAPSSSLPPNGSYVYMVNCALRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGH
        S  SSS PPNGSYVYMVNC++RGD+GPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQ PETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGH
Subjt:  SAPSSSLPPNGSYVYMVNCALRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGH

Query:  ILSKAQDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYILVISNILVIGLQ
        IL+  QDHKSRTN+WF LSLKIL LGIFLVF+GIPVNKSLYTVSYMLITSASAGI+FCALYILVD+HGYR LTCVLEWMGKHALSIY+LVISNILVIG+Q
Subjt:  ILSKAQDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYILVISNILVIGLQ

Query:  GFYWRSPNNNIVHWIVSRVKSQS
        GFYW+SP NNIVHWI+SRVK+QS
Subjt:  GFYWRSPNNNIVHWIVSRVKSQS

XP_022948115.1 heparan-alpha-glucosaminide N-acetyltransferase-like [Cucurbita moschata]4.0e-21887.8Show/hide
Query:  MADSQPLLKNQQELPESCGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGASVALVYKEVKSRVTATRNAACRALY
        MADSQPLLKN+QELP+S  K PRV+SLDVFRG SVFMMMFVDYGGSFLP+IAHSPWNGLHLADFVMPWFLFIAG S+ALVYKEVK +VTAT+NAACR LY
Subjt:  MADSQPLLKNQQELPESCGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGASVALVYKEVKSRVTATRNAACRALY

Query:  LFLLGVLLQGGYFHGITSLTYGVDMERI-------RISVGYLIAALCEIWLTRCTREEAQHTKSFSWHWCFMFLVLSLYMGLSYGLYVPDWDFKISAPSS
        LFLLGVLLQGGYFHGITSLTYGVDM+RI       RISVGYLIAALCEIWLTRCTREEAQ+TKSFSWHWC +FL+LSLYMGL YGLYVPDW+FKIS  SS
Subjt:  LFLLGVLLQGGYFHGITSLTYGVDMERI-------RISVGYLIAALCEIWLTRCTREEAQHTKSFSWHWCFMFLVLSLYMGLSYGLYVPDWDFKISAPSS

Query:  SLPPNGSYVYMVNCALRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILSKA
        S PPNGSYVYMVNC++RGD+GPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQ PETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHIL+  
Subjt:  SLPPNGSYVYMVNCALRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILSKA

Query:  QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYILVISNILVIGLQGFYWR
        QDHKSRTN+WF LSLKIL LGIFLVF+GIPVNKSLYTVSYMLITSASAGI+FCALYILVD+HGYR LTCVLEWMGKHALSIY+LVISNILVIG+QGFYW+
Subjt:  QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYILVISNILVIGLQGFYWR

Query:  SPNNNIVHWIVSRVKSQS
        SP NNIVHWI+SRVK+QS
Subjt:  SPNNNIVHWIVSRVKSQS

XP_023533170.1 heparan-alpha-glucosaminide N-acetyltransferase-like [Cucurbita pepo subsp. pepo]3.9e-22188.42Show/hide
Query:  FSPESMADSQPLLKNQQELPESCGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGASVALVYKEVKSRVTATRNAA
        F PESMADSQPLLKN+QELP+S  KAPRV+SLDVFRG SVFMMMFVDYGGSFLP+IAHSPWNGLHLADFVMPWFLFIAG S+ALVYKEVK +VTATRNAA
Subjt:  FSPESMADSQPLLKNQQELPESCGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGASVALVYKEVKSRVTATRNAA

Query:  CRALYLFLLGVLLQGGYFHGITSLTYGVDMERI-------RISVGYLIAALCEIWLTRCTREEAQHTKSFSWHWCFMFLVLSLYMGLSYGLYVPDWDFKI
        CR LYLFL+GVLLQGGYFHGITSLTYGVDM+RI       RISVGYLIAALCEIWLTRCTREEAQ+TKSFSWHWC +FL+LSLYMGLSYGLYVPDW+FKI
Subjt:  CRALYLFLLGVLLQGGYFHGITSLTYGVDMERI-------RISVGYLIAALCEIWLTRCTREEAQHTKSFSWHWCFMFLVLSLYMGLSYGLYVPDWDFKI

Query:  SAPSSSLPPNGSYVYMVNCALRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGH
        S  SS LPPNGSYVYMVNC++RGD+GPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQ PETSPSWCHA FEPEGLLSSLTATVACIIGLQYGH
Subjt:  SAPSSSLPPNGSYVYMVNCALRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGH

Query:  ILSKAQDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYILVISNILVIGLQ
        IL+  QDHKSRTNSWF LSLKIL LGIFLVFIGIPVNKSLYTVSYMLITSASAGI+FCALYILVD+HGYR LTCVLEWMGKHALSIY+LVISNILVIG+Q
Subjt:  ILSKAQDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYILVISNILVIGLQ

Query:  GFYWRSPNNNIVHWIVSRVKSQS
        GFYW+SP NNIVHWI+SRVK+QS
Subjt:  GFYWRSPNNNIVHWIVSRVKSQS

XP_038900866.1 heparan-alpha-glucosaminide N-acetyltransferase-like [Benincasa hispida]1.3e-22190.19Show/hide
Query:  MADSQPLLKNQQELPESCGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGASVALVYKEVKSRVTATRNAACRALY
        MADS+PLL+NQQELPES GKAPRVVSLDVFRG SVFMM+FVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAG SVALVYK+VKS+VTA RNA CR LY
Subjt:  MADSQPLLKNQQELPESCGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGASVALVYKEVKSRVTATRNAACRALY

Query:  LFLLGVLLQGGYFHGITSLTYGVDMERI-------RISVGYLIAALCEIWLTRCTREEAQHTKSFSWHWCFMFLVLSLYMGLSYGLYVPDWDFKISAPSS
        LFLLGVLLQGGYFHGITSLTYGVDMERI       RIS+GYL+AALCEIWLTR  REEAQ TKSFSWHWC +F +LSLYMGLSYGLYVPDWDF+ISA SS
Subjt:  LFLLGVLLQGGYFHGITSLTYGVDMERI-------RISVGYLIAALCEIWLTRCTREEAQHTKSFSWHWCFMFLVLSLYMGLSYGLYVPDWDFKISAPSS

Query:  SLPPNGSYVYMVNCALRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILSKA
        SLPPNGSYVYMVNC+L+GDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGH L+KA
Subjt:  SLPPNGSYVYMVNCALRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILSKA

Query:  QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYILVISNILVIGLQGFYWR
        QDHKSRT SWFLLSLKI+ALGIFLVFIG+PVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIY+LVISNILV GLQGFYW+
Subjt:  QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYILVISNILVIGLQGFYWR

Query:  SPNNNIVHWIVSRVKSQS
        SP NNIVHWIVSRVK+QS
Subjt:  SPNNNIVHWIVSRVKSQS

TrEMBL top hitse value%identityAlignment
A0A0A0LLN8 DUF1624 domain-containing protein3.2e-21389.43Show/hide
Query:  MADSQPLLKNQQELPESCGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGASVALVYKEVKSRVTATRNAACRALY
        MADS+PLLKNQQELP S GKAPRVVSLDVFRG SVFMMM VDYGGSFLPII+HSPW GLHLADFVMPWFLFIAG SVALVYKEV+S+V A RNAACR LY
Subjt:  MADSQPLLKNQQELPESCGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGASVALVYKEVKSRVTATRNAACRALY

Query:  LFLLGVLLQGGYFHGITSLTYGVDMERI-------RISVGYLIAALCEIWLTRCTREEAQHTKSFSWHWCFMFLVLSLYMGLSYGLYVPDWDFKISAPSS
        LFLLGV LQGGYFHGITSLTYGVD+E I       RIS+GYLIAALCEIWLTRCTREEAQHTKSFSWHWC +F +LSLYMGLSYGLYVPDWDFKISAPSS
Subjt:  LFLLGVLLQGGYFHGITSLTYGVDMERI-------RISVGYLIAALCEIWLTRCTREEAQHTKSFSWHWCFMFLVLSLYMGLSYGLYVPDWDFKISAPSS

Query:  SLPPNGSYVYMVNCALRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILSKA
        SLP +GSYVY VNC+LRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQFPETSPSWC APFEPEGLLSSLTATVACIIGLQYGHIL++A
Subjt:  SLPPNGSYVYMVNCALRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILSKA

Query:  QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYILVISNILVIGLQGFYWR
        QDHK+RTN WFLLS KILA GIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTC LEWMGKH+LSIY+LVISNILVIGLQGFYW+
Subjt:  QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYILVISNILVIGLQGFYWR

Query:  SPNNNIV
        SPNNNIV
Subjt:  SPNNNIV

A0A1S3C640 LOW QUALITY PROTEIN: heparan-alpha-glucosaminide N-acetyltransferase-like9.7e-21888.97Show/hide
Query:  MADSQPLLKNQQELPESCGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGASVALVYKEVKSRVTATRNAACRALY
        M DS+PLLKNQQELP S GKAPRVVSLDVFRG SVFMMM VDYGGSFLPII+HSPW GLHLADFVMPWFLFIAG SVALVYKEVKS+  A RNAACR LY
Subjt:  MADSQPLLKNQQELPESCGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGASVALVYKEVKSRVTATRNAACRALY

Query:  LFLLGVLLQGGYFHGITSLTYGVDMERI-------RISVGYLIAALCEIWLTRCTREEAQHTKSFSWHWCFMFLVLSLYMGLSYGLYVPDWDFKISAPSS
        LFLLGVLLQGGYFHGITSLTYGVD+ERI       RIS+GYLIAALCEIWLTR TREEAQHTKSFSWHWC +F +LSLYM LSYGLYVPDWDFKISAPSS
Subjt:  LFLLGVLLQGGYFHGITSLTYGVDMERI-------RISVGYLIAALCEIWLTRCTREEAQHTKSFSWHWCFMFLVLSLYMGLSYGLYVPDWDFKISAPSS

Query:  SLPPNGSYVYMVNCALRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILSKA
        SLP +GSYVY VNC+LRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQFPETSPSWC APFEPEGLLSSLTATVACIIGLQYGHIL+KA
Subjt:  SLPPNGSYVYMVNCALRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILSKA

Query:  QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYILVISNILVIGLQGFYWR
        QDHK+RTN WFLLSL+ LALG+FLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKH+LSIY+LVISNILVIGLQGFYW+
Subjt:  QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYILVISNILVIGLQGFYWR

Query:  SPNNNIVHWIVSRVKSQ
        SPNNNIVHWIVS VK++
Subjt:  SPNNNIVHWIVSRVKSQ

A0A5D3BKQ6 Heparan-alpha-glucosaminide N-acetyltransferase-like4.3e-21889.21Show/hide
Query:  MADSQPLLKNQQELPESCGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGASVALVYKEVKSRVTATRNAACRALY
        M DS+PLLKNQQELP S GKAPRVVSLDVFRG SVFMMM VDYGGSFLPII+HSPW GLHLADFVMPWFLFIAG SVALVYKEVKS+  A RNAACR LY
Subjt:  MADSQPLLKNQQELPESCGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGASVALVYKEVKSRVTATRNAACRALY

Query:  LFLLGVLLQGGYFHGITSLTYGVDMERI-------RISVGYLIAALCEIWLTRCTREEAQHTKSFSWHWCFMFLVLSLYMGLSYGLYVPDWDFKISAPSS
        LFLLGVLLQGGYFHGITSLTYGVD+ERI       RIS+GYLIAALCEIWLTR TREEAQHTKSFSWHWC +F +LSLYM LSYGLYVPDWDFKISAPSS
Subjt:  LFLLGVLLQGGYFHGITSLTYGVDMERI-------RISVGYLIAALCEIWLTRCTREEAQHTKSFSWHWCFMFLVLSLYMGLSYGLYVPDWDFKISAPSS

Query:  SLPPNGSYVYMVNCALRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILSKA
        SLP +GSYVY VNC+LRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQFPETSPSWC APFEPEGLLSSLTATVACIIGLQYGHIL+KA
Subjt:  SLPPNGSYVYMVNCALRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILSKA

Query:  QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYILVISNILVIGLQGFYWR
        QDHK+RTN WFLLS KILALG+FLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKH+LSIY+LVISNILVIGLQGFYW+
Subjt:  QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYILVISNILVIGLQGFYWR

Query:  SPNNNIVHWIVSRVKSQ
        SPNNNIVHWIVS VK++
Subjt:  SPNNNIVHWIVSRVKSQ

A0A6J1G8U1 heparan-alpha-glucosaminide N-acetyltransferase-like1.9e-21887.8Show/hide
Query:  MADSQPLLKNQQELPESCGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGASVALVYKEVKSRVTATRNAACRALY
        MADSQPLLKN+QELP+S  K PRV+SLDVFRG SVFMMMFVDYGGSFLP+IAHSPWNGLHLADFVMPWFLFIAG S+ALVYKEVK +VTAT+NAACR LY
Subjt:  MADSQPLLKNQQELPESCGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGASVALVYKEVKSRVTATRNAACRALY

Query:  LFLLGVLLQGGYFHGITSLTYGVDMERI-------RISVGYLIAALCEIWLTRCTREEAQHTKSFSWHWCFMFLVLSLYMGLSYGLYVPDWDFKISAPSS
        LFLLGVLLQGGYFHGITSLTYGVDM+RI       RISVGYLIAALCEIWLTRCTREEAQ+TKSFSWHWC +FL+LSLYMGL YGLYVPDW+FKIS  SS
Subjt:  LFLLGVLLQGGYFHGITSLTYGVDMERI-------RISVGYLIAALCEIWLTRCTREEAQHTKSFSWHWCFMFLVLSLYMGLSYGLYVPDWDFKISAPSS

Query:  SLPPNGSYVYMVNCALRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILSKA
        S PPNGSYVYMVNC++RGD+GPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQ PETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHIL+  
Subjt:  SLPPNGSYVYMVNCALRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILSKA

Query:  QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYILVISNILVIGLQGFYWR
        QDHKSRTN+WF LSLKIL LGIFLVF+GIPVNKSLYTVSYMLITSASAGI+FCALYILVD+HGYR LTCVLEWMGKHALSIY+LVISNILVIG+QGFYW+
Subjt:  QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYILVISNILVIGLQGFYWR

Query:  SPNNNIVHWIVSRVKSQS
        SP NNIVHWI+SRVK+QS
Subjt:  SPNNNIVHWIVSRVKSQS

A0A6J1I5L8 heparan-alpha-glucosaminide N-acetyltransferase-like1.6e-21787.56Show/hide
Query:  MADSQPLLKNQQELPESCGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGASVALVYKEVKSRVTATRNAACRALY
        MADSQPLL+N+QELP+S  KAPRV+SLDVFRG SVFMMMFVDYGGSFLP+IAHSPWNGLHLADFVMPWFLFIAG S+ALVYKEV  +VTAT+NAACR LY
Subjt:  MADSQPLLKNQQELPESCGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGASVALVYKEVKSRVTATRNAACRALY

Query:  LFLLGVLLQGGYFHGITSLTYGVDMERI-------RISVGYLIAALCEIWLTRCTREEAQHTKSFSWHWCFMFLVLSLYMGLSYGLYVPDWDFKISAPSS
        LFLLGVLLQGGYFHGITSLTYGVDM+RI       RISVGYLIAALCEIWLT C REEAQ+TKSFSWHWC +FL+LSLYMGLSYGLYVPDW+FKIS  SS
Subjt:  LFLLGVLLQGGYFHGITSLTYGVDMERI-------RISVGYLIAALCEIWLTRCTREEAQHTKSFSWHWCFMFLVLSLYMGLSYGLYVPDWDFKISAPSS

Query:  SLPPNGSYVYMVNCALRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILSKA
        SLPPNGSYVYMVNC++RGD+GPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQ PETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHIL+  
Subjt:  SLPPNGSYVYMVNCALRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILSKA

Query:  QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYILVISNILVIGLQGFYWR
        QDHKSRTNSWF LSLKI  LGIFLVF+GIPVNKSLYTVSYMLITSASAGI+FCALYILVD+HGYR LTCVLEWMGKHALSIY+LVISNILVIG+QGFYW+
Subjt:  QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYILVISNILVIGLQGFYWR

Query:  SPNNNIVHWIVSRVKSQS
        SP NNIVHWI+SRVK+QS
Subjt:  SPNNNIVHWIVSRVKSQS

SwissProt top hitse value%identityAlignment
Q3UDW8 Heparan-alpha-glucosaminide N-acetyltransferase6.2e-2829.34Show/hide
Query:  APRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGASVAL----VYKEVKSRVTATRNAACRALYLFLLGVLL-QGGYFHG
        A R+  +D FRG ++ +M+FV+YGG       HS WNGL +AD V PWF+FI G S+ L    + +   S++        R+  L  +GV++    Y  G
Subjt:  APRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGASVAL----VYKEVKSRVTATRNAACRALYLFLLGVLL-QGGYFHG

Query:  ITSLTYGVDMERI-----RISVGYLIAALCEIWLTR-----CTREEAQHTK---SFSW-HWCFMFLVLSLYMGLSYGLYVPDWDFKISAPSSSLPPNGSY
          S     D  RI     R+ V Y + A+ E +  +     CT E +  +    + SW  W  +  + S+++ L++ L VP        P+  L P G  
Subjt:  ITSLTYGVDMERI-----RISVGYLIAALCEIWLTR-----CTREEAQHTK---SFSW-HWCFMFLVLSLYMGLSYGLYVPDWDFKISAPSSSLPPNGSY

Query:  VYMVNCALRGDLG--PAC--NSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILSKAQDHK
                 GDLG  P C   +AG IDR +LG +HLY  P    L    ++                ++PEG+L ++ + V   +G+Q G IL   +D  
Subjt:  VYMVNCALRGDLG--PAC--NSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILSKAQDHK

Query:  SRTNSWFLLSLKILAL-GIFLVFIG-----IPVNKSLYTVSYMLITSASAGIIFCALYILVDIHG--------YRRLTCVLEWMGKHALSIY
            + F     IL L  I L  +      IP+NK+L+++SY+   S  A  I   LY +VD+ G        Y  +  +L ++G   L  Y
Subjt:  SRTNSWFLLSLKILAL-GIFLVFIG-----IPVNKSLYTVSYMLITSASAGIIFCALYILVDIHG--------YRRLTCVLEWMGKHALSIY

Q68CP4 Heparan-alpha-glucosaminide N-acetyltransferase3.3e-2929.27Show/hide
Query:  DSQPLLKNQQELPESCGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGASVAL----VYKEVKSRVTATRNAACRA
        D QP       LP      PR+ S+D FRG ++ +M+FV+YGG       H+ WNGL +AD V PWF+FI G+S+ L    + +   S+       A R+
Subjt:  DSQPLLKNQQELPESCGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGASVAL----VYKEVKSRVTATRNAACRA

Query:  LYLFLLGVLL-QGGYFHGITSLTYGVDMERI-----RISVGYLIAALCEIWLTRCTREEAQHTKS--------FSW-HWCFMFLVLSLYMGLSYGLYVPD
          L  +G+++    Y  G  S     D  RI     R+ V Y + A+ E+   +   E     +S         SW  W  + ++  L++GL++ L VP 
Subjt:  LYLFLLGVLL-QGGYFHGITSLTYGVDMERI-----RISVGYLIAALCEIWLTRCTREEAQHTKS--------FSW-HWCFMFLVLSLYMGLSYGLYVPD

Query:  WDFKISAPSSSLPPNGSYVYMVNCALRGDLG--PAC--NSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVA
               P+  L P G           GD G  P C   +AG IDR +LG  HLY  P    L    ++                ++PEG+L ++ + V 
Subjt:  WDFKISAPSSSLPPNGSYVYMVNCALRGDLG--PAC--NSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVA

Query:  CIIGLQYGHIL----SKAQDHKSRTNSW-FLLSLKILALGIFLVFIG-IPVNKSLYTVSYMLITSASAGIIFCALYILVDIHG--------YRRLTCVLE
          +G+Q G IL    ++ +D   R  +W  +L L  +AL       G IPVNK+L+++SY+   S+ A  I   LY +VD+ G        Y  +  +L 
Subjt:  CIIGLQYGHIL----SKAQDHKSRTNSW-FLLSLKILALGIFLVFIG-IPVNKSLYTVSYMLITSASAGIIFCALYILVDIHG--------YRRLTCVLE

Query:  WMGKHALSIY
        ++G      Y
Subjt:  WMGKHALSIY

Arabidopsis top hitse value%identityAlignment
AT5G27730.1 Protein of unknown function (DUF1624)6.9e-9141.98Show/hide
Query:  GKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGASVALVYKEVKSRVTATRNAACRALYLFLLGVLLQGGYFHGITS
        G  PR+ SLD+FRG +V +M+ VD  G   P+IAH+PWNG +LADFVMP+FLFI G S+AL  K + ++  A +    R   L   G+LLQGG+ H    
Subjt:  GKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGASVALVYKEVKSRVTATRNAACRALYLFLLGVLLQGGYFHGITS

Query:  LTYGVDMERI-------RISVGYLIAALCEIWLTRCTREEAQHT------KSFSWHWCFMFLVLSLYMGLSYGLYVPDWDFKISAPSSSLPPNGSYVYMV
        LTYGVD+  +       RI++ YL+ AL EI+ T+ + EE   T      KS+ WHW     VL +Y+   YG YVPDW+F +    S L      +  V
Subjt:  LTYGVDMERI-------RISVGYLIAALCEIWLTRCTREEAQHT------KSFSWHWCFMFLVLSLYMGLSYGLYVPDWDFKISAPSSSLPPNGSYVYMV

Query:  NCALRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISS--SGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILSKAQDHKSRTNSW
        +C +RG L P CN+ G +DR VLGI+H+Y  P +R  K C   S   G   + +PSWC APFEPEG+LSS++A ++ IIG+ +GHI+   + H +R   W
Subjt:  NCALRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISS--SGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILSKAQDHKSRTNSW

Query:  FLLSLKILALGIFLVFIGI-PVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYILVISNILVIGLQGFYWRSPNNNIVHW
            L +LALG+ L F  + P+NK LY+ SY+ +TS +A ++F +LY LVDI  ++ +   L+W+G +A+ +Y++    IL     G+Y+R P+N +++W
Subjt:  FLLSLKILALGIFLVFIGI-PVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYILVISNILVIGLQGFYWRSPNNNIVHW

Query:  IVSRV
        I   V
Subjt:  IVSRV

AT5G47900.1 Protein of unknown function (DUF1624)1.5e-11449.49Show/hide
Query:  RVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGASVALVYKEVKSRVTATRNAACRALYLFLLGVLLQGGYFHGITSLTYG
        R+VSLDVFRG +V  M+ VD  G  LP I HSPW+G+ LADFVMP+FLFI G S+A  YK +  R  ATR A  R+L L LLG+ LQGG+ HG+ +LTYG
Subjt:  RVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGASVALVYKEVKSRVTATRNAACRALYLFLLGVLLQGGYFHGITSLTYG

Query:  VDMERI-------RISVGYLIAALCEIWL--TRCTREEAQHTKSFSWHWCFMFLVLSLYMGLSYGLYVPDWDFKISAPSSSLPPNGSYVYMVNCALRGDL
        +D+E+I       RI++ YL+ ALCEIWL        E    K + +HW   F++ ++Y+ L YGLYVPDW+++I                V C +RG  
Subjt:  VDMERI-------RISVGYLIAALCEIWL--TRCTREEAQHTKSFSWHWCFMFLVLSLYMGLSYGLYVPDWDFKISAPSSSLPPNGSYVYMVNCALRGDL

Query:  GPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNIS--SSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILSKAQDHKSRTNSWFLLSLKIL
        GP CN+ GM+DR  LGI HLY KPVY   K+C+I+  ++G  P  +PSWC APF+PEGLLSSL ATV C++GL YGHI+   +DHK R N W L S  +L
Subjt:  GPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNIS--SSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILSKAQDHKSRTNSWFLLSLKIL

Query:  ALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYILVISNILVIGLQGFYWRSPNNNIVHWI
         LG+ L   G+ +NK LYT+SYM +TS ++G +  A+Y++VD++GY+R + VLEWMG HAL IY+L+  N++ + + GFYW++P NN++H I
Subjt:  ALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYILVISNILVIGLQGFYWRSPNNNIVHWI

AT5G47900.4 Protein of unknown function (DUF1624)2.1e-9545.02Show/hide
Query:  RVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGASVALVYKEVKSRVTATRNAACRALYLFLLGVLLQGGYFHGITSLTYG
        R+VSLDVFRG +V  M+ VD  G  LP I HSPW+G+ LADFVMP+FLFI G S+A  YK +  R  ATR A  R+L L LLG+ LQGG+ HG+ +LTYG
Subjt:  RVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGASVALVYKEVKSRVTATRNAACRALYLFLLGVLLQGGYFHGITSLTYG

Query:  VDMERI-------RISVGYLIAALCEIWL--TRCTREEAQHTKSFSWHWCFMFLVLSLYMGLSYGLYVPDWDFKISAPSSSLPPNGSYVYMVNCALRGDL
        +D+E+I       RI++ YL+ ALCEIWL        E    K + +HW   F++ ++Y+ L YGLYVPDW+++I                V C +RG  
Subjt:  VDMERI-------RISVGYLIAALCEIWL--TRCTREEAQHTKSFSWHWCFMFLVLSLYMGLSYGLYVPDWDFKISAPSSSLPPNGSYVYMVNCALRGDL

Query:  GPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNIS--SSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHIL-------SKAQDHKSRTNSWF
        GP CN+ GM+DR  LGI HLY KPVY   K+C+I+  ++G  P  +PSWC APF+PEGLLSSL ATV C++GL YGHI+       SK Q +   + S  
Subjt:  GPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNIS--SSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHIL-------SKAQDHKSRTNSWF

Query:  LLSLKILALGIFLVFIGIPV---NKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYILVISNILVIGLQGFYWRSPNNNIVH
          S K      F  F+   V    + L+ +   +I     G        LVD++GY+R + VLEWMG HAL IY+L+  N++ + + GFYW++P NN++H
Subjt:  LLSLKILALGIFLVFIGIPV---NKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYILVISNILVIGLQGFYWRSPNNNIVH

Query:  WI
         I
Subjt:  WI

AT5G47900.6 Protein of unknown function (DUF1624)4.8e-9247.42Show/hide
Query:  SRVTATRNAACRALYLFLLGVLLQGGYFHGITSLTYGVDMERI-------RISVGYLIAALCEIWL--TRCTREEAQHTKSFSWHWCFMFLVLSLYMGLS
        S+  ATR A  R+L L LLG+ LQGG+ HG+ +LTYG+D+E+I       RI++ YL+ ALCEIWL        E    K + +HW   F++ ++Y+ L 
Subjt:  SRVTATRNAACRALYLFLLGVLLQGGYFHGITSLTYGVDMERI-------RISVGYLIAALCEIWL--TRCTREEAQHTKSFSWHWCFMFLVLSLYMGLS

Query:  YGLYVPDWDFKISAPSSSLPPNGSYVYMVNCALRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNIS--SSGQFPETSPSWCHAPFEPEGLLSSL
        YGLYVPDW+++I                V C +RG  GP CN+ GM+DR  LGI HLY KPVY   K+C+I+  ++G  P  +PSWC APF+PEGLLSSL
Subjt:  YGLYVPDWDFKISAPSSSLPPNGSYVYMVNCALRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNIS--SSGQFPETSPSWCHAPFEPEGLLSSL

Query:  TATVACIIGLQYGHILSKAQDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSI
         ATV C++GL YGHI+   +DHK R N W L S  +L LG+ L   G+ +NK LYT+SYM +TS ++G +  A+Y++VD++GY+R + VLEWMG HAL I
Subjt:  TATVACIIGLQYGHILSKAQDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSI

Query:  YILVISNILVIGLQGFYWRSPNNNIVHWI
        Y+L+  N++ + + GFYW++P NN++H I
Subjt:  YILVISNILVIGLQGFYWRSPNNNIVHWI

AT5G47900.7 Protein of unknown function (DUF1624)1.5e-9045.24Show/hide
Query:  RVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGASVALVYKEVKSRVTATRNAACRALYLFLLGVLLQGGYFHGITSLTYG
        R+VSLDVFRG +V  M+ VD  G  LP I HSPW+G+ LADFVMP+FLFI G S+A  YK +  R  ATR A  R+L L LLG+ LQGG+ HG+ +LTYG
Subjt:  RVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGASVALVYKEVKSRVTATRNAACRALYLFLLGVLLQGGYFHGITSLTYG

Query:  VDMERI-------RISVGYLIAALCEIWL--TRCTREEAQHTKSFSWHWCFMFLVLSLYMGLSYGLYVPDWDFKISAPSSSLPPNGSYVYMVNCALRGDL
        +D+E+I       RI++ YL+ ALCEIWL        E    K + +HW   F++ ++Y+ L YGLYVPDW+++I                V C +RG  
Subjt:  VDMERI-------RISVGYLIAALCEIWL--TRCTREEAQHTKSFSWHWCFMFLVLSLYMGLSYGLYVPDWDFKISAPSSSLPPNGSYVYMVNCALRGDL

Query:  GPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNIS--SSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHIL-------SKA-----------
        GP CN+ GM+DR  LGI HLY KPVY   K+C+I+  ++G  P  +PSWC APF+PEGLLSSL ATV C++GL YGHI+       SK            
Subjt:  GPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNIS--SSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHIL-------SKA-----------

Query:  -------------------QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILV
                           QDHK R N W L S  +L LG+ L   G+ +NK LYT+SYM +TS ++G +  A+Y++V
Subjt:  -------------------QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTTTGGGCAACGGCTTGTTCTCGCCGGAATCAATGGCCGATTCTCAACCGCTGCTCAAGAATCAACAGGAGTTGCCGGAGTCCTGTGGCAAGGCTCCACGAGTTGT
CTCACTTGACGTCTTTCGCGGCTTCAGCGTCTTTATGATGATGTTCGTGGACTACGGTGGCTCTTTTTTACCAATTATCGCTCATTCGCCATGGAATGGACTTCATTTGG
CTGATTTTGTGATGCCTTGGTTTCTATTTATTGCGGGAGCTTCGGTTGCACTTGTTTATAAAGAAGTAAAAAGTAGAGTGACCGCTACAAGGAATGCAGCATGCAGGGCC
CTGTACCTCTTTCTCCTGGGAGTTCTTCTTCAAGGTGGTTATTTTCATGGAATAACATCTTTGACATATGGCGTTGATATGGAAAGGATTAGAATATCTGTTGGATACTT
AATTGCTGCACTCTGTGAGATCTGGCTAACTCGTTGCACACGTGAAGAAGCTCAACATACTAAGAGTTTCAGCTGGCATTGGTGTTTCATGTTTCTTGTGCTGTCATTGT
ATATGGGACTGTCGTATGGTTTATACGTTCCGGATTGGGACTTCAAAATATCAGCCCCAAGCTCCTCACTGCCACCAAATGGAAGCTATGTTTACATGGTGAATTGTGCT
CTTCGAGGTGATTTGGGACCTGCTTGTAATTCTGCTGGCATGATTGATCGTTATGTTCTTGGTATTCACCATTTGTATACTAAACCTGTCTACAGAAATCTAAAGGAGTG
CAATATTTCCTCCAGCGGTCAATTCCCTGAGACTTCACCTTCATGGTGTCATGCTCCTTTTGAACCTGAAGGTCTGTTAAGCTCTCTAACAGCTACAGTGGCATGCATAA
TCGGACTTCAGTATGGTCACATTCTCTCCAAAGCACAGGATCACAAAAGTCGTACCAATAGCTGGTTCTTACTCTCGCTTAAGATTTTGGCTCTCGGAATATTCCTCGTC
TTTATAGGTATCCCTGTAAATAAGTCCCTCTACACAGTCAGCTATATGCTGATTACTTCAGCGTCAGCAGGAATAATCTTTTGTGCTTTATATATCTTGGTGGACATCCA
CGGCTATCGGCGCTTGACATGTGTTCTGGAATGGATGGGGAAGCATGCTTTAAGTATTTATATTTTAGTAATCTCTAACATACTCGTTATTGGGCTCCAAGGATTCTACT
GGAGATCTCCTAACAATAACATCGTGCACTGGATTGTTAGTCGTGTCAAATCTCAAAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACTTTGGGCAACGGCTTGTTCTCGCCGGAATCAATGGCCGATTCTCAACCGCTGCTCAAGAATCAACAGGAGTTGCCGGAGTCCTGTGGCAAGGCTCCACGAGTTGT
CTCACTTGACGTCTTTCGCGGCTTCAGCGTCTTTATGATGATGTTCGTGGACTACGGTGGCTCTTTTTTACCAATTATCGCTCATTCGCCATGGAATGGACTTCATTTGG
CTGATTTTGTGATGCCTTGGTTTCTATTTATTGCGGGAGCTTCGGTTGCACTTGTTTATAAAGAAGTAAAAAGTAGAGTGACCGCTACAAGGAATGCAGCATGCAGGGCC
CTGTACCTCTTTCTCCTGGGAGTTCTTCTTCAAGGTGGTTATTTTCATGGAATAACATCTTTGACATATGGCGTTGATATGGAAAGGATTAGAATATCTGTTGGATACTT
AATTGCTGCACTCTGTGAGATCTGGCTAACTCGTTGCACACGTGAAGAAGCTCAACATACTAAGAGTTTCAGCTGGCATTGGTGTTTCATGTTTCTTGTGCTGTCATTGT
ATATGGGACTGTCGTATGGTTTATACGTTCCGGATTGGGACTTCAAAATATCAGCCCCAAGCTCCTCACTGCCACCAAATGGAAGCTATGTTTACATGGTGAATTGTGCT
CTTCGAGGTGATTTGGGACCTGCTTGTAATTCTGCTGGCATGATTGATCGTTATGTTCTTGGTATTCACCATTTGTATACTAAACCTGTCTACAGAAATCTAAAGGAGTG
CAATATTTCCTCCAGCGGTCAATTCCCTGAGACTTCACCTTCATGGTGTCATGCTCCTTTTGAACCTGAAGGTCTGTTAAGCTCTCTAACAGCTACAGTGGCATGCATAA
TCGGACTTCAGTATGGTCACATTCTCTCCAAAGCACAGGATCACAAAAGTCGTACCAATAGCTGGTTCTTACTCTCGCTTAAGATTTTGGCTCTCGGAATATTCCTCGTC
TTTATAGGTATCCCTGTAAATAAGTCCCTCTACACAGTCAGCTATATGCTGATTACTTCAGCGTCAGCAGGAATAATCTTTTGTGCTTTATATATCTTGGTGGACATCCA
CGGCTATCGGCGCTTGACATGTGTTCTGGAATGGATGGGGAAGCATGCTTTAAGTATTTATATTTTAGTAATCTCTAACATACTCGTTATTGGGCTCCAAGGATTCTACT
GGAGATCTCCTAACAATAACATCGTGCACTGGATTGTTAGTCGTGTCAAATCTCAAAGTTGAAATAATTCGAGTAGAGCTGTGTAGAGTGTTCAGCATTTGGAACTGCTA
ACTTGCTGGGGATAATTCACTTATCGGTAAAATCATCTCCAGCATGTTCGAAATCACACTGCAAATATTGAGAAAAAAATTCAATAAGAACTGTTGCTGGTAAGATCACC
ACTTGATTTTTAGTGTTAGCATGTATGTTATGTATGTTTAACATTGCGGTCATTTGCCTGACACTAAAATTGTAATTGTTGTATTCTATGTGTGCAACTCCTAGAACTGA
GATATTTTTTATTACCCCCCCTGTAGAAATTTGGAGGCAAACCTTTGTTAGCCAACAAGTTTTTGACTTTGGAGATTGAACATTTGATGGATGTTCAAATGTTTATAATG
AATTTATCTATATTTGTG
Protein sequenceShow/hide protein sequence
MTLGNGLFSPESMADSQPLLKNQQELPESCGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGASVALVYKEVKSRVTATRNAACRA
LYLFLLGVLLQGGYFHGITSLTYGVDMERIRISVGYLIAALCEIWLTRCTREEAQHTKSFSWHWCFMFLVLSLYMGLSYGLYVPDWDFKISAPSSSLPPNGSYVYMVNCA
LRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILSKAQDHKSRTNSWFLLSLKILALGIFLV
FIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYILVISNILVIGLQGFYWRSPNNNIVHWIVSRVKSQS