; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi10G005000 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi10G005000
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionheparan-alpha-glucosaminide N-acetyltransferase-like
Genome locationchr10:7155374..7162302
RNA-Seq ExpressionLsi10G005000
SyntenyLsi10G005000
Gene Ontology termsGO:0055085 - transmembrane transport (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0015267 - channel activity (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR012429 - Heparan-alpha-glucosaminide N-acetyltransferase, catalytic domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6604943.1 Heparan-alpha-glucosaminide N-acetyltransferase, partial [Cucurbita argyrosperma subsp. sororia]3.3e-20184.21Show/hide
Query:  MADSQPLLKNQQALPESSGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSVALVYKEVKSKVTATRNAACRGLY
        MADSQPLLKN+Q LP+SS KAPRV+SLDVFRG SVFMMMFVDYGGSFLP+IAHSPWNGLHLADFVMPWFLFIAGVS+ALVYKEVK KVTATRNAA RGLY
Subjt:  MADSQPLLKNQQALPESSGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSVALVYKEVKSKVTATRNAACRGLY

Query:  LFLLGVLLQGKVIAGNSCTEYFN-------IHYIMRISVGYLIAALCEIWLTRCTPEEAQHTKSFSWHWCIMFLLLSLYMGLSYGLYVPDWDFKISAMSS
        LFLLGVLLQG    G +   Y         +  + RISVGYLIAALCEIWLTRCT EEAQ+TKSFSWHWCI+FLLLSLY GL YGLYVPDW+FKIS  SS
Subjt:  LFLLGVLLQGKVIAGNSCTEYFN-------IHYIMRISVGYLIAALCEIWLTRCTPEEAQHTKSFSWHWCIMFLLLSLYMGLSYGLYVPDWDFKISAMSS

Query:  SLPPNGSYVYMVNCSLRGDLGPACNSAGMIDRYVLGIHHFPNEEC--GLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILAKS
        S PPNGSYVYMVNCS+RGD+GPACNSAGMIDRYVLGIHH   +     LKECNISSSGQ PETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILA  
Subjt:  SLPPNGSYVYMVNCSLRGDLGPACNSAGMIDRYVLGIHHFPNEEC--GLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILAKS

Query:  QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYVLVISNILVIGLQGFYWK
        QDHKSRTN+WF LSLKIL LGIFLVF+GIPVNKSLYTVSYMLITSASAGI+FCALYILVD+HGYR LTCVLEWMGKHALSIYVLVISNILVIG+QGFYWK
Subjt:  QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYVLVISNILVIGLQGFYWK

Query:  SPNNNIVHWIVSRVKAQS
        SP NNIVHWI+SRVKAQS
Subjt:  SPNNNIVHWIVSRVKAQS

XP_022948115.1 heparan-alpha-glucosaminide N-acetyltransferase-like [Cucurbita moschata]3.9e-20284.21Show/hide
Query:  MADSQPLLKNQQALPESSGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSVALVYKEVKSKVTATRNAACRGLY
        MADSQPLLKN+Q LP+SS K PRV+SLDVFRG SVFMMMFVDYGGSFLP+IAHSPWNGLHLADFVMPWFLFIAGVS+ALVYKEVK KVTAT+NAACRGLY
Subjt:  MADSQPLLKNQQALPESSGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSVALVYKEVKSKVTATRNAACRGLY

Query:  LFLLGVLLQGKVIAGNSCTEYFN-------IHYIMRISVGYLIAALCEIWLTRCTPEEAQHTKSFSWHWCIMFLLLSLYMGLSYGLYVPDWDFKISAMSS
        LFLLGVLLQG    G +   Y         +  + RISVGYLIAALCEIWLTRCT EEAQ+TKSFSWHWCI+FLLLSLYMGL YGLYVPDW+FKIS  SS
Subjt:  LFLLGVLLQGKVIAGNSCTEYFN-------IHYIMRISVGYLIAALCEIWLTRCTPEEAQHTKSFSWHWCIMFLLLSLYMGLSYGLYVPDWDFKISAMSS

Query:  SLPPNGSYVYMVNCSLRGDLGPACNSAGMIDRYVLGIHHFPNEEC--GLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILAKS
        S PPNGSYVYMVNCS+RGD+GPACNSAGMIDRYVLGIHH   +     LKECNISSSGQ PETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILA  
Subjt:  SLPPNGSYVYMVNCSLRGDLGPACNSAGMIDRYVLGIHHFPNEEC--GLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILAKS

Query:  QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYVLVISNILVIGLQGFYWK
        QDHKSRTN+WF LSLKIL LGIFLVF+GIPVNKSLYTVSYMLITSASAGI+FCALYILVD+HGYR LTCVLEWMGKHALSIYVLVISNILVIG+QGFYWK
Subjt:  QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYVLVISNILVIGLQGFYWK

Query:  SPNNNIVHWIVSRVKAQS
        SP NNIVHWI+SRVKAQS
Subjt:  SPNNNIVHWIVSRVKAQS

XP_022971023.1 heparan-alpha-glucosaminide N-acetyltransferase-like [Cucurbita maxima]3.3e-20183.97Show/hide
Query:  MADSQPLLKNQQALPESSGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSVALVYKEVKSKVTATRNAACRGLY
        MADSQPLL+N+Q LP+SS KAPRV+SLDVFRG SVFMMMFVDYGGSFLP+IAHSPWNGLHLADFVMPWFLFIAGVS+ALVYKEV  KVTAT+NAACRGLY
Subjt:  MADSQPLLKNQQALPESSGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSVALVYKEVKSKVTATRNAACRGLY

Query:  LFLLGVLLQGKVIAGNSCTEYFN-------IHYIMRISVGYLIAALCEIWLTRCTPEEAQHTKSFSWHWCIMFLLLSLYMGLSYGLYVPDWDFKISAMSS
        LFLLGVLLQG    G +   Y         +  + RISVGYLIAALCEIWLT C  EEAQ+TKSFSWHWCI+FLLLSLYMGLSYGLYVPDW+FKIS  SS
Subjt:  LFLLGVLLQGKVIAGNSCTEYFN-------IHYIMRISVGYLIAALCEIWLTRCTPEEAQHTKSFSWHWCIMFLLLSLYMGLSYGLYVPDWDFKISAMSS

Query:  SLPPNGSYVYMVNCSLRGDLGPACNSAGMIDRYVLGIHHFPNEEC--GLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILAKS
        SLPPNGSYVYMVNCS+RGD+GPACNSAGMIDRYVLGIHH   +     LKECNISSSGQ PETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILA  
Subjt:  SLPPNGSYVYMVNCSLRGDLGPACNSAGMIDRYVLGIHHFPNEEC--GLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILAKS

Query:  QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYVLVISNILVIGLQGFYWK
        QDHKSRTNSWF LSLKI  LGIFLVF+GIPVNKSLYTVSYMLITSASAGI+FCALYILVD+HGYR LTCVLEWMGKHALSIYVLVISNILVIG+QGFYWK
Subjt:  QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYVLVISNILVIGLQGFYWK

Query:  SPNNNIVHWIVSRVKAQS
        SP NNIVHWI+SRVKAQS
Subjt:  SPNNNIVHWIVSRVKAQS

XP_023533170.1 heparan-alpha-glucosaminide N-acetyltransferase-like [Cucurbita pepo subsp. pepo]7.9e-20384.93Show/hide
Query:  MADSQPLLKNQQALPESSGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSVALVYKEVKSKVTATRNAACRGLY
        MADSQPLLKN+Q LP+SS KAPRV+SLDVFRG SVFMMMFVDYGGSFLP+IAHSPWNGLHLADFVMPWFLFIAGVS+ALVYKEVK KVTATRNAACRGLY
Subjt:  MADSQPLLKNQQALPESSGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSVALVYKEVKSKVTATRNAACRGLY

Query:  LFLLGVLLQGKVIAGNSCTEYFN-------IHYIMRISVGYLIAALCEIWLTRCTPEEAQHTKSFSWHWCIMFLLLSLYMGLSYGLYVPDWDFKISAMSS
        LFL+GVLLQG    G +   Y         +  + RISVGYLIAALCEIWLTRCT EEAQ+TKSFSWHWCI+FLLLSLYMGLSYGLYVPDW+FKIS  SS
Subjt:  LFLLGVLLQGKVIAGNSCTEYFN-------IHYIMRISVGYLIAALCEIWLTRCTPEEAQHTKSFSWHWCIMFLLLSLYMGLSYGLYVPDWDFKISAMSS

Query:  SLPPNGSYVYMVNCSLRGDLGPACNSAGMIDRYVLGIHHFPNEEC--GLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILAKS
         LPPNGSYVYMVNCS+RGD+GPACNSAGMIDRYVLGIHH   +     LKECNISSSGQ PETSPSWCHA FEPEGLLSSLTATVACIIGLQYGHILA  
Subjt:  SLPPNGSYVYMVNCSLRGDLGPACNSAGMIDRYVLGIHHFPNEEC--GLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILAKS

Query:  QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYVLVISNILVIGLQGFYWK
        QDHKSRTNSWF LSLKIL LGIFLVFIGIPVNKSLYTVSYMLITSASAGI+FCALYILVD+HGYR LTCVLEWMGKHALSIYVLVISNILVIG+QGFYWK
Subjt:  QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYVLVISNILVIGLQGFYWK

Query:  SPNNNIVHWIVSRVKAQS
        SP NNIVHWI+SRVKAQS
Subjt:  SPNNNIVHWIVSRVKAQS

XP_038900866.1 heparan-alpha-glucosaminide N-acetyltransferase-like [Benincasa hispida]3.8e-20586.12Show/hide
Query:  MADSQPLLKNQQALPESSGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSVALVYKEVKSKVTATRNAACRGLY
        MADS+PLL+NQQ LPESSGKAPRVVSLDVFRG SVFMM+FVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSVALVYK+VKSKVTA RNA CRGLY
Subjt:  MADSQPLLKNQQALPESSGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSVALVYKEVKSKVTATRNAACRGLY

Query:  LFLLGVLLQGKVIAGNSCTEYFN-------IHYIMRISVGYLIAALCEIWLTRCTPEEAQHTKSFSWHWCIMFLLLSLYMGLSYGLYVPDWDFKISAMSS
        LFLLGVLLQG    G +   Y         +  + RIS+GYL+AALCEIWLTR   EEAQ TKSFSWHWCI+F LLSLYMGLSYGLYVPDWDF+ISA SS
Subjt:  LFLLGVLLQGKVIAGNSCTEYFN-------IHYIMRISVGYLIAALCEIWLTRCTPEEAQHTKSFSWHWCIMFLLLSLYMGLSYGLYVPDWDFKISAMSS

Query:  SLPPNGSYVYMVNCSLRGDLGPACNSAGMIDRYVLGIHHFPNEEC--GLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILAKS
        SLPPNGSYVYMVNCSL+GDLGPACNSAGMIDRYVLGIHH   +     LKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGH LAK+
Subjt:  SLPPNGSYVYMVNCSLRGDLGPACNSAGMIDRYVLGIHHFPNEEC--GLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILAKS

Query:  QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYVLVISNILVIGLQGFYWK
        QDHKSRT SWFLLSLKI+ALGIFLVFIG+PVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYVLVISNILV GLQGFYWK
Subjt:  QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYVLVISNILVIGLQGFYWK

Query:  SPNNNIVHWIVSRVKAQS
        SP NNIVHWIVSRVKAQS
Subjt:  SPNNNIVHWIVSRVKAQS

TrEMBL top hitse value%identityAlignment
A0A0A0LLN8 DUF1624 domain-containing protein2.6e-19685.26Show/hide
Query:  MADSQPLLKNQQALPESSGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSVALVYKEVKSKVTATRNAACRGLY
        MADS+PLLKNQQ LP SSGKAPRVVSLDVFRG SVFMMM VDYGGSFLPII+HSPW GLHLADFVMPWFLFIAGVSVALVYKEV+SKV A RNAACRGLY
Subjt:  MADSQPLLKNQQALPESSGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSVALVYKEVKSKVTATRNAACRGLY

Query:  LFLLGVLLQGKVIAGNSCTEYFN-------IHYIMRISVGYLIAALCEIWLTRCTPEEAQHTKSFSWHWCIMFLLLSLYMGLSYGLYVPDWDFKISAMSS
        LFLLGV LQG    G +   Y         +  + RIS+GYLIAALCEIWLTRCT EEAQHTKSFSWHWCI+F LLSLYMGLSYGLYVPDWDFKISA SS
Subjt:  LFLLGVLLQGKVIAGNSCTEYFN-------IHYIMRISVGYLIAALCEIWLTRCTPEEAQHTKSFSWHWCIMFLLLSLYMGLSYGLYVPDWDFKISAMSS

Query:  SLPPNGSYVYMVNCSLRGDLGPACNSAGMIDRYVLGIHHFPNEEC--GLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILAKS
        SLP +GSYVY VNCSLRGDLGPACNSAGMIDRYVLGIHH   +     LKECNISSSGQFPETSPSWC APFEPEGLLSSLTATVACIIGLQYGHILA++
Subjt:  SLPPNGSYVYMVNCSLRGDLGPACNSAGMIDRYVLGIHHFPNEEC--GLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILAKS

Query:  QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYVLVISNILVIGLQGFYWK
        QDHK+RTN WFLLS KILA GIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTC LEWMGKH+LSIYVLVISNILVIGLQGFYWK
Subjt:  QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYVLVISNILVIGLQGFYWK

Query:  SPNNNIV
        SPNNNIV
Subjt:  SPNNNIV

A0A1S3C640 LOW QUALITY PROTEIN: heparan-alpha-glucosaminide N-acetyltransferase-like1.4e-20084.89Show/hide
Query:  MADSQPLLKNQQALPESSGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSVALVYKEVKSKVTATRNAACRGLY
        M DS+PLLKNQQ LP SSGKAPRVVSLDVFRG SVFMMM VDYGGSFLPII+HSPW GLHLADFVMPWFLFIAGVSVALVYKEVKSK  A RNAACRGLY
Subjt:  MADSQPLLKNQQALPESSGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSVALVYKEVKSKVTATRNAACRGLY

Query:  LFLLGVLLQGKVIAGNSCTEYFN-------IHYIMRISVGYLIAALCEIWLTRCTPEEAQHTKSFSWHWCIMFLLLSLYMGLSYGLYVPDWDFKISAMSS
        LFLLGVLLQG    G +   Y         +  + RIS+GYLIAALCEIWLTR T EEAQHTKSFSWHWCI+F LLSLYM LSYGLYVPDWDFKISA SS
Subjt:  LFLLGVLLQGKVIAGNSCTEYFN-------IHYIMRISVGYLIAALCEIWLTRCTPEEAQHTKSFSWHWCIMFLLLSLYMGLSYGLYVPDWDFKISAMSS

Query:  SLPPNGSYVYMVNCSLRGDLGPACNSAGMIDRYVLGIHHFPNEEC--GLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILAKS
        SLP +GSYVY VNCSLRGDLGPACNSAGMIDRYVLGIHH   +     LKECNISSSGQFPETSPSWC APFEPEGLLSSLTATVACIIGLQYGHILAK+
Subjt:  SLPPNGSYVYMVNCSLRGDLGPACNSAGMIDRYVLGIHHFPNEEC--GLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILAKS

Query:  QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYVLVISNILVIGLQGFYWK
        QDHK+RTN WFLLSL+ LALG+FLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKH+LSIYVLVISNILVIGLQGFYWK
Subjt:  QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYVLVISNILVIGLQGFYWK

Query:  SPNNNIVHWIVSRVKAQ
        SPNNNIVHWIVS VKA+
Subjt:  SPNNNIVHWIVSRVKAQ

A0A5D3BKQ6 Heparan-alpha-glucosaminide N-acetyltransferase-like6.1e-20185.13Show/hide
Query:  MADSQPLLKNQQALPESSGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSVALVYKEVKSKVTATRNAACRGLY
        M DS+PLLKNQQ LP SSGKAPRVVSLDVFRG SVFMMM VDYGGSFLPII+HSPW GLHLADFVMPWFLFIAGVSVALVYKEVKSK  A RNAACRGLY
Subjt:  MADSQPLLKNQQALPESSGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSVALVYKEVKSKVTATRNAACRGLY

Query:  LFLLGVLLQGKVIAGNSCTEYFN-------IHYIMRISVGYLIAALCEIWLTRCTPEEAQHTKSFSWHWCIMFLLLSLYMGLSYGLYVPDWDFKISAMSS
        LFLLGVLLQG    G +   Y         +  + RIS+GYLIAALCEIWLTR T EEAQHTKSFSWHWCI+F LLSLYM LSYGLYVPDWDFKISA SS
Subjt:  LFLLGVLLQGKVIAGNSCTEYFN-------IHYIMRISVGYLIAALCEIWLTRCTPEEAQHTKSFSWHWCIMFLLLSLYMGLSYGLYVPDWDFKISAMSS

Query:  SLPPNGSYVYMVNCSLRGDLGPACNSAGMIDRYVLGIHHFPNEEC--GLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILAKS
        SLP +GSYVY VNCSLRGDLGPACNSAGMIDRYVLGIHH   +     LKECNISSSGQFPETSPSWC APFEPEGLLSSLTATVACIIGLQYGHILAK+
Subjt:  SLPPNGSYVYMVNCSLRGDLGPACNSAGMIDRYVLGIHHFPNEEC--GLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILAKS

Query:  QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYVLVISNILVIGLQGFYWK
        QDHK+RTN WFLLS KILALG+FLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKH+LSIYVLVISNILVIGLQGFYWK
Subjt:  QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYVLVISNILVIGLQGFYWK

Query:  SPNNNIVHWIVSRVKAQ
        SPNNNIVHWIVS VKA+
Subjt:  SPNNNIVHWIVSRVKAQ

A0A6J1G8U1 heparan-alpha-glucosaminide N-acetyltransferase-like1.9e-20284.21Show/hide
Query:  MADSQPLLKNQQALPESSGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSVALVYKEVKSKVTATRNAACRGLY
        MADSQPLLKN+Q LP+SS K PRV+SLDVFRG SVFMMMFVDYGGSFLP+IAHSPWNGLHLADFVMPWFLFIAGVS+ALVYKEVK KVTAT+NAACRGLY
Subjt:  MADSQPLLKNQQALPESSGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSVALVYKEVKSKVTATRNAACRGLY

Query:  LFLLGVLLQGKVIAGNSCTEYFN-------IHYIMRISVGYLIAALCEIWLTRCTPEEAQHTKSFSWHWCIMFLLLSLYMGLSYGLYVPDWDFKISAMSS
        LFLLGVLLQG    G +   Y         +  + RISVGYLIAALCEIWLTRCT EEAQ+TKSFSWHWCI+FLLLSLYMGL YGLYVPDW+FKIS  SS
Subjt:  LFLLGVLLQGKVIAGNSCTEYFN-------IHYIMRISVGYLIAALCEIWLTRCTPEEAQHTKSFSWHWCIMFLLLSLYMGLSYGLYVPDWDFKISAMSS

Query:  SLPPNGSYVYMVNCSLRGDLGPACNSAGMIDRYVLGIHHFPNEEC--GLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILAKS
        S PPNGSYVYMVNCS+RGD+GPACNSAGMIDRYVLGIHH   +     LKECNISSSGQ PETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILA  
Subjt:  SLPPNGSYVYMVNCSLRGDLGPACNSAGMIDRYVLGIHHFPNEEC--GLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILAKS

Query:  QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYVLVISNILVIGLQGFYWK
        QDHKSRTN+WF LSLKIL LGIFLVF+GIPVNKSLYTVSYMLITSASAGI+FCALYILVD+HGYR LTCVLEWMGKHALSIYVLVISNILVIG+QGFYWK
Subjt:  QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYVLVISNILVIGLQGFYWK

Query:  SPNNNIVHWIVSRVKAQS
        SP NNIVHWI+SRVKAQS
Subjt:  SPNNNIVHWIVSRVKAQS

A0A6J1I5L8 heparan-alpha-glucosaminide N-acetyltransferase-like1.6e-20183.97Show/hide
Query:  MADSQPLLKNQQALPESSGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSVALVYKEVKSKVTATRNAACRGLY
        MADSQPLL+N+Q LP+SS KAPRV+SLDVFRG SVFMMMFVDYGGSFLP+IAHSPWNGLHLADFVMPWFLFIAGVS+ALVYKEV  KVTAT+NAACRGLY
Subjt:  MADSQPLLKNQQALPESSGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSVALVYKEVKSKVTATRNAACRGLY

Query:  LFLLGVLLQGKVIAGNSCTEYFN-------IHYIMRISVGYLIAALCEIWLTRCTPEEAQHTKSFSWHWCIMFLLLSLYMGLSYGLYVPDWDFKISAMSS
        LFLLGVLLQG    G +   Y         +  + RISVGYLIAALCEIWLT C  EEAQ+TKSFSWHWCI+FLLLSLYMGLSYGLYVPDW+FKIS  SS
Subjt:  LFLLGVLLQGKVIAGNSCTEYFN-------IHYIMRISVGYLIAALCEIWLTRCTPEEAQHTKSFSWHWCIMFLLLSLYMGLSYGLYVPDWDFKISAMSS

Query:  SLPPNGSYVYMVNCSLRGDLGPACNSAGMIDRYVLGIHHFPNEEC--GLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILAKS
        SLPPNGSYVYMVNCS+RGD+GPACNSAGMIDRYVLGIHH   +     LKECNISSSGQ PETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILA  
Subjt:  SLPPNGSYVYMVNCSLRGDLGPACNSAGMIDRYVLGIHHFPNEEC--GLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILAKS

Query:  QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYVLVISNILVIGLQGFYWK
        QDHKSRTNSWF LSLKI  LGIFLVF+GIPVNKSLYTVSYMLITSASAGI+FCALYILVD+HGYR LTCVLEWMGKHALSIYVLVISNILVIG+QGFYWK
Subjt:  QDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYVLVISNILVIGLQGFYWK

Query:  SPNNNIVHWIVSRVKAQS
        SP NNIVHWI+SRVKAQS
Subjt:  SPNNNIVHWIVSRVKAQS

SwissProt top hitse value%identityAlignment
Q3UDW8 Heparan-alpha-glucosaminide N-acetyltransferase9.5e-2627.87Show/hide
Query:  SQPLLKNQQALPES-SGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSVAL----VYKEVKSKVTATRNAACRG
        + PL  + Q  PE+    A R+  +D FRG ++ +M+FV+YGG       HS WNGL +AD V PWF+FI G S+ L    + +   SK+        R 
Subjt:  SQPLLKNQQALPES-SGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSVAL----VYKEVKSKVTATRNAACRG

Query:  LYLFLLGVLLQGKVIAGNSCTEYFNIH------YIMRISVGYLIAALCEIWLTRCTPEEAQHTKS--------FSW-HWCIMFLLLSLYMGLSYGLYVPD
          L  +GV+    ++  N C    +         + R+ V Y + A+ E +  +  P+      S         SW  W  +  L S+++ L++ L VP 
Subjt:  LYLFLLGVLLQGKVIAGNSCTEYFNIH------YIMRISVGYLIAALCEIWLTRCTPEEAQHTKS--------FSW-HWCIMFLLLSLYMGLSYGLYVPD

Query:  WDFKISAMSSSLPPNGSYVYMVNCSLRGDLG--PAC--NSAGMIDRYVLGIHHFPNEECGLKECNISSSGQFPETSPSW-CHAPFEPEGLLSSLTATVAC
                +  L P G           GDLG  P C   +AG IDR +LG +H                 Q P ++  +     ++PEG+L ++ + V  
Subjt:  WDFKISAMSSSLPPNGSYVYMVNCSLRGDLG--PAC--NSAGMIDRYVLGIHHFPNEECGLKECNISSSGQFPETSPSW-CHAPFEPEGLLSSLTATVAC

Query:  IIGLQYGHILAKSQDHKSRTNSWFLLSLKILAL-GIFLVFIG-----IPVNKSLYTVSYMLITSASAGIIFCALYILVDIHG--------YRRLTCVLEW
         +G+Q G IL   +D      + F     IL L  I L  +      IP+NK+L+++SY+   S  A  I   LY +VD+ G        Y  +  +L +
Subjt:  IIGLQYGHILAKSQDHKSRTNSWFLLSLKILAL-GIFLVFIG-----IPVNKSLYTVSYMLITSASAGIIFCALYILVDIHG--------YRRLTCVLEW

Query:  MGKHALSIY
        +G   L  Y
Subjt:  MGKHALSIY

Q68CP4 Heparan-alpha-glucosaminide N-acetyltransferase2.0e-2828.85Show/hide
Query:  DSQPLLKNQQALPESSGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSVAL----VYKEVKSKVTATRNAACRG
        D QP      ALP      PR+ S+D FRG ++ +M+FV+YGG       H+ WNGL +AD V PWF+FI G S+ L    + +   SK       A R 
Subjt:  DSQPLLKNQQALPESSGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSVAL----VYKEVKSKVTATRNAACRG

Query:  LYLFLLGVLLQGKVIAGNSCTEYFNIH------YIMRISVGYLIAALCEIWLTRCTPEEAQHTKS--------FSW-HWCIMFLLLSLYMGLSYGLYVPD
          L  +G++    ++  N C    +         + R+ V Y + A+ E+   +  PE     +S         SW  W ++ +L  L++GL++ L VP 
Subjt:  LYLFLLGVLLQGKVIAGNSCTEYFNIH------YIMRISVGYLIAALCEIWLTRCTPEEAQHTKS--------FSW-HWCIMFLLLSLYMGLSYGLYVPD

Query:  WDFKISAMSSSLPPNGSYVYMVNCSLRGDLG--PAC--NSAGMIDRYVLGIHHFPNEECGLKECNISSSGQFPETSPSW-CHAPFEPEGLLSSLTATVAC
                +  L P G           GD G  P C   +AG IDR +LG  H                 Q P ++  +     ++PEG+L ++ + V  
Subjt:  WDFKISAMSSSLPPNGSYVYMVNCSLRGDLG--PAC--NSAGMIDRYVLGIHHFPNEECGLKECNISSSGQFPETSPSW-CHAPFEPEGLLSSLTATVAC

Query:  IIGLQYGHIL----AKSQDHKSRTNSW-FLLSLKILALGIFLVFIG-IPVNKSLYTVSYMLITSASAGIIFCALYILVDIHG--------YRRLTCVLEW
         +G+Q G IL    A+++D   R  +W  +L L  +AL       G IPVNK+L+++SY+   S+ A  I   LY +VD+ G        Y  +  +L +
Subjt:  IIGLQYGHIL----AKSQDHKSRTNSW-FLLSLKILALGIFLVFIG-IPVNKSLYTVSYMLITSASAGIIFCALYILVDIHG--------YRRLTCVLEW

Query:  MGKHALSIY
        +G      Y
Subjt:  MGKHALSIY

Arabidopsis top hitse value%identityAlignment
AT5G27730.1 Protein of unknown function (DUF1624)1.6e-8139.9Show/hide
Query:  SGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSVALVYKEVKSKVTATRNAACRGLYLFLLGVLLQG-------
        +G  PR+ SLD+FRG +V +M+ VD  G   P+IAH+PWNG +LADFVMP+FLFI GVS+AL  K + +K  A +    R   L   G+LLQG       
Subjt:  SGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSVALVYKEVKSKVTATRNAACRGLYLFLLGVLLQG-------

Query:  KVIAGNSCTEYFNIHYIMRISVGYLIAALCEIWLTRCTPEEAQHT------KSFSWHWCIMFLLLSLYMGLSYGLYVPDWDFKISAMSSSLPPNGSYVYM
        ++  G   T       + RI++ YL+ AL EI+ T+ + EE   T      KS+ WHW +   +L +Y+   YG YVPDW+F +    S L      +  
Subjt:  KVIAGNSCTEYFNIHYIMRISVGYLIAALCEIWLTRCTPEEAQHT------KSFSWHWCIMFLLLSLYMGLSYGLYVPDWDFKISAMSSSLPPNGSYVYM

Query:  VNCSLRGDLGPACNSAGMIDRYVLGIHHFPNEEC--GLKECNISS--SGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILAKSQDHKSRTNS
        V+C +RG L P CN+ G +DR VLGI+H  +       K C   S   G   + +PSWC APFEPEG+LSS++A ++ IIG+ +GHI+   + H +R   
Subjt:  VNCSLRGDLGPACNSAGMIDRYVLGIHHFPNEEC--GLKECNISS--SGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILAKSQDHKSRTNS

Query:  WFLLSLKILALGIFLVFIGI-PVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYVLVISNILVIGLQGFYWKSPNNNIVH
        W    L +LALG+ L F  + P+NK LY+ SY+ +TS +A ++F +LY LVDI  ++ +   L+W+G +A+ +YV+    IL     G+Y++ P+N +++
Subjt:  WFLLSLKILALGIFLVFIGI-PVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYVLVISNILVIGLQGFYWKSPNNNIVH

Query:  WIVSRV
        WI   V
Subjt:  WIVSRV

AT5G47900.1 Protein of unknown function (DUF1624)1.3e-10247.19Show/hide
Query:  RVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSVALVYKEVKSKVTATRNAACRGLYLFLLGVLLQGKVIAG-NSCTEY
        R+VSLDVFRG +V  M+ VD  G  LP I HSPW+G+ LADFVMP+FLFI GVS+A  YK +  +  ATR A  R L L LLG+ LQG  I G N+ T  
Subjt:  RVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSVALVYKEVKSKVTATRNAACRGLYLFLLGVLLQGKVIAG-NSCTEY

Query:  FNIHYI------MRISVGYLIAALCEIWL--TRCTPEEAQHTKSFSWHWCIMFLLLSLYMGLSYGLYVPDWDFKISAMSSSLPPNGSYVYMVNCSLRGDL
         ++  I       RI++ YL+ ALCEIWL        E    K + +HW + F++ ++Y+ L YGLYVPDW+++I                V C +RG  
Subjt:  FNIHYI------MRISVGYLIAALCEIWL--TRCTPEEAQHTKSFSWHWCIMFLLLSLYMGLSYGLYVPDWDFKISAMSSSLPPNGSYVYMVNCSLRGDL

Query:  GPACNSAGMIDRYVLGIHHFPNEE--CGLKECNIS--SSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILAKSQDHKSRTNSWFLLSLKIL
        GP CN+ GM+DR  LGI H   +      K+C+I+  ++G  P  +PSWC APF+PEGLLSSL ATV C++GL YGHI+   +DHK R N W L S  +L
Subjt:  GPACNSAGMIDRYVLGIHHFPNEE--CGLKECNIS--SSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILAKSQDHKSRTNSWFLLSLKIL

Query:  ALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYVLVISNILVIGLQGFYWKSPNNNIVHWI
         LG+ L   G+ +NK LYT+SYM +TS ++G +  A+Y++VD++GY+R + VLEWMG HAL IYVL+  N++ + + GFYWK+P NN++H I
Subjt:  ALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYVLVISNILVIGLQGFYWKSPNNNIVHWI

AT5G47900.4 Protein of unknown function (DUF1624)6.7e-8342.14Show/hide
Query:  RVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSVALVYKEVKSKVTATRNAACRGLYLFLLGVLLQGKVIAG-NSCTEY
        R+VSLDVFRG +V  M+ VD  G  LP I HSPW+G+ LADFVMP+FLFI GVS+A  YK +  +  ATR A  R L L LLG+ LQG  I G N+ T  
Subjt:  RVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSVALVYKEVKSKVTATRNAACRGLYLFLLGVLLQGKVIAG-NSCTEY

Query:  FNIHYI------MRISVGYLIAALCEIWL--TRCTPEEAQHTKSFSWHWCIMFLLLSLYMGLSYGLYVPDWDFKISAMSSSLPPNGSYVYMVNCSLRGDL
         ++  I       RI++ YL+ ALCEIWL        E    K + +HW + F++ ++Y+ L YGLYVPDW+++I                V C +RG  
Subjt:  FNIHYI------MRISVGYLIAALCEIWL--TRCTPEEAQHTKSFSWHWCIMFLLLSLYMGLSYGLYVPDWDFKISAMSSSLPPNGSYVYMVNCSLRGDL

Query:  GPACNSAGMIDRYVLGIHHFPNEE--CGLKECNIS--SSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILAKSQDHKSRTNSW------FL
        GP CN+ GM+DR  LGI H   +      K+C+I+  ++G  P  +PSWC APF+PEGLLSSL ATV C++GL YGHI+   + + S+   +        
Subjt:  GPACNSAGMIDRYVLGIHHFPNEE--CGLKECNIS--SSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILAKSQDHKSRTNSW------FL

Query:  LSLKILALGIFLVFIGIPV---NKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYVLVISNILVIGLQGFYWKSPNNNIVHW
         S K      F  F+   V    + L+ +   +I     G        LVD++GY+R + VLEWMG HAL IYVL+  N++ + + GFYWK+P NN++H 
Subjt:  LSLKILALGIFLVFIGIPV---NKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYVLVISNILVIGLQGFYWKSPNNNIVHW

Query:  I
        I
Subjt:  I

AT5G47900.6 Protein of unknown function (DUF1624)5.3e-8044.68Show/hide
Query:  SKVTATRNAACRGLYLFLLGVLLQGKVIAG-NSCTEYFNIHYI------MRISVGYLIAALCEIWL--TRCTPEEAQHTKSFSWHWCIMFLLLSLYMGLS
        S+  ATR A  R L L LLG+ LQG  I G N+ T   ++  I       RI++ YL+ ALCEIWL        E    K + +HW + F++ ++Y+ L 
Subjt:  SKVTATRNAACRGLYLFLLGVLLQGKVIAG-NSCTEYFNIHYI------MRISVGYLIAALCEIWL--TRCTPEEAQHTKSFSWHWCIMFLLLSLYMGLS

Query:  YGLYVPDWDFKISAMSSSLPPNGSYVYMVNCSLRGDLGPACNSAGMIDRYVLGIHHFPNEE--CGLKECNIS--SSGQFPETSPSWCHAPFEPEGLLSSL
        YGLYVPDW+++I                V C +RG  GP CN+ GM+DR  LGI H   +      K+C+I+  ++G  P  +PSWC APF+PEGLLSSL
Subjt:  YGLYVPDWDFKISAMSSSLPPNGSYVYMVNCSLRGDLGPACNSAGMIDRYVLGIHHFPNEE--CGLKECNIS--SSGQFPETSPSWCHAPFEPEGLLSSL

Query:  TATVACIIGLQYGHILAKSQDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSI
         ATV C++GL YGHI+   +DHK R N W L S  +L LG+ L   G+ +NK LYT+SYM +TS ++G +  A+Y++VD++GY+R + VLEWMG HAL I
Subjt:  TATVACIIGLQYGHILAKSQDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSI

Query:  YVLVISNILVIGLQGFYWKSPNNNIVHWI
        YVL+  N++ + + GFYWK+P NN++H I
Subjt:  YVLVISNILVIGLQGFYWKSPNNNIVHWI

AT5G47900.7 Protein of unknown function (DUF1624)1.1e-7741.8Show/hide
Query:  RVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSVALVYKEVKSKVTATRNAACRGLYLFLLGVLLQGKVIAG-NSCTEY
        R+VSLDVFRG +V  M+ VD  G  LP I HSPW+G+ LADFVMP+FLFI GVS+A  YK +  +  ATR A  R L L LLG+ LQG  I G N+ T  
Subjt:  RVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSVALVYKEVKSKVTATRNAACRGLYLFLLGVLLQGKVIAG-NSCTEY

Query:  FNIHYI------MRISVGYLIAALCEIWL--TRCTPEEAQHTKSFSWHWCIMFLLLSLYMGLSYGLYVPDWDFKISAMSSSLPPNGSYVYMVNCSLRGDL
         ++  I       RI++ YL+ ALCEIWL        E    K + +HW + F++ ++Y+ L YGLYVPDW+++I                V C +RG  
Subjt:  FNIHYI------MRISVGYLIAALCEIWL--TRCTPEEAQHTKSFSWHWCIMFLLLSLYMGLSYGLYVPDWDFKISAMSSSLPPNGSYVYMVNCSLRGDL

Query:  GPACNSAGMIDRYVLGIHHFPNEE--CGLKECNIS--SSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILAK-------------------
        GP CN+ GM+DR  LGI H   +      K+C+I+  ++G  P  +PSWC APF+PEGLLSSL ATV C++GL YGHI+                     
Subjt:  GPACNSAGMIDRYVLGIHHFPNEE--CGLKECNIS--SSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILAK-------------------

Query:  ------------------SQDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILV
                           QDHK R N W L S  +L LG+ L   G+ +NK LYT+SYM +TS ++G +  A+Y++V
Subjt:  ------------------SQDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGATTCTCAACCGCTGCTCAAGAATCAACAGGCCTTGCCGGAGTCCAGTGGCAAGGCTCCACGAGTTGTCTCACTCGACGTCTTTCGCGGCTTCAGCGTCTTTAT
GATGATGTTCGTGGACTACGGTGGCTCTTTTTTACCAATTATCGCGCATTCACCATGGAATGGACTTCATTTAGCTGATTTTGTGATGCCTTGGTTTCTATTTATTGCGG
GAGTTTCGGTTGCACTTGTTTATAAAGAAGTAAAAAGTAAAGTGACCGCTACAAGGAATGCAGCATGCAGGGGCCTGTACCTCTTTCTACTGGGAGTTCTTCTTCAAGGT
AAAGTTATTGCTGGTAACTCTTGCACCGAATATTTTAATATTCATTACATTATGAGAATATCTGTTGGATACTTGATTGCTGCACTATGTGAGATCTGGCTAACTCGTTG
CACACCTGAAGAAGCTCAACATACTAAGAGTTTCAGCTGGCATTGGTGTATTATGTTTTTGCTGTTGTCATTGTATATGGGACTGTCGTATGGTTTATATGTTCCGGATT
GGGACTTTAAAATATCAGCCATGAGCTCTTCACTTCCACCAAATGGAAGCTATGTTTACATGGTGAATTGTTCTCTTCGAGGTGATTTGGGACCTGCTTGTAATTCTGCT
GGCATGATTGATCGTTATGTTCTTGGTATTCACCATTTCCCTAATGAAGAATGTGGTTTGAAGGAGTGCAATATTTCTTCCAGCGGTCAATTCCCTGAGACTTCACCTTC
ATGGTGTCATGCTCCTTTTGAACCTGAAGGTCTGTTAAGCTCTTTAACAGCTACAGTAGCATGCATAATAGGACTTCAGTATGGTCACATTCTTGCCAAATCACAGGATC
ACAAAAGTCGCACCAATAGCTGGTTCTTACTCTCGCTTAAGATTTTGGCTCTCGGAATATTCCTCGTCTTTATAGGTATCCCTGTAAATAAGTCCCTCTACACGGTCAGC
TATATGTTGATTACTTCAGCGTCGGCAGGAATAATCTTTTGTGCTCTGTATATCTTGGTGGATATCCACGGCTATCGACGGTTGACATGTGTTCTGGAGTGGATGGGGAA
GCATGCTTTAAGTATTTATGTTTTAGTAATCTCTAACATACTAGTTATTGGGCTCCAAGGATTCTACTGGAAATCTCCCAACAATAACATTGTGCACTGGATTGTTAGTC
GTGTTAAAGCTCAAAGTTGA
mRNA sequenceShow/hide mRNA sequence
GCGACGGCGTTTATAGCAGAGCATATTCTTTACCAAAACATGGCTTTACAGATACTCTAGTCAATTTGATCATTTTCCCTGCTGTTCTCGCCGGAATCAATGGCCGATTC
TCAACCGCTGCTCAAGAATCAACAGGCCTTGCCGGAGTCCAGTGGCAAGGCTCCACGAGTTGTCTCACTCGACGTCTTTCGCGGCTTCAGCGTCTTTATGATGATGTTCG
TGGACTACGGTGGCTCTTTTTTACCAATTATCGCGCATTCACCATGGAATGGACTTCATTTAGCTGATTTTGTGATGCCTTGGTTTCTATTTATTGCGGGAGTTTCGGTT
GCACTTGTTTATAAAGAAGTAAAAAGTAAAGTGACCGCTACAAGGAATGCAGCATGCAGGGGCCTGTACCTCTTTCTACTGGGAGTTCTTCTTCAAGGTAAAGTTATTGC
TGGTAACTCTTGCACCGAATATTTTAATATTCATTACATTATGAGAATATCTGTTGGATACTTGATTGCTGCACTATGTGAGATCTGGCTAACTCGTTGCACACCTGAAG
AAGCTCAACATACTAAGAGTTTCAGCTGGCATTGGTGTATTATGTTTTTGCTGTTGTCATTGTATATGGGACTGTCGTATGGTTTATATGTTCCGGATTGGGACTTTAAA
ATATCAGCCATGAGCTCTTCACTTCCACCAAATGGAAGCTATGTTTACATGGTGAATTGTTCTCTTCGAGGTGATTTGGGACCTGCTTGTAATTCTGCTGGCATGATTGA
TCGTTATGTTCTTGGTATTCACCATTTCCCTAATGAAGAATGTGGTTTGAAGGAGTGCAATATTTCTTCCAGCGGTCAATTCCCTGAGACTTCACCTTCATGGTGTCATG
CTCCTTTTGAACCTGAAGGTCTGTTAAGCTCTTTAACAGCTACAGTAGCATGCATAATAGGACTTCAGTATGGTCACATTCTTGCCAAATCACAGGATCACAAAAGTCGC
ACCAATAGCTGGTTCTTACTCTCGCTTAAGATTTTGGCTCTCGGAATATTCCTCGTCTTTATAGGTATCCCTGTAAATAAGTCCCTCTACACGGTCAGCTATATGTTGAT
TACTTCAGCGTCGGCAGGAATAATCTTTTGTGCTCTGTATATCTTGGTGGATATCCACGGCTATCGACGGTTGACATGTGTTCTGGAGTGGATGGGGAAGCATGCTTTAA
GTATTTATGTTTTAGTAATCTCTAACATACTAGTTATTGGGCTCCAAGGATTCTACTGGAAATCTCCCAACAATAACATTGTGCACTGGATTGTTAGTCGTGTTAAAGCT
CAAAGTTGAAATAATTCGAGTAGCTGTGTAGAGTATTCAGTATTTGGAACTGCTAACTTGCTGGGGATAATTCACTTATCAGTAAAATCATCTCCAGCATTATTCGAAAT
CACACTGCAAATATTGAACAAAAAGTTTAGTAAGAAGGTAACAGAACTCTTGCGGTGGATATTTTGGGTTCGGCCATTTGTTGGGGCGGTGGCAGCAGCGGTGTACCATC
AATACGTTCTCAGAGCGGCGGCTGTCAAGGCTCTTCGATCCTTCCGCAGCAACCCTTCCGCCTAACAAACTTCAATCCTTCTCTTTGGTTTTAAAGACTCCTAACAAACA
TTCTATCCCA
Protein sequenceShow/hide protein sequence
MADSQPLLKNQQALPESSGKAPRVVSLDVFRGFSVFMMMFVDYGGSFLPIIAHSPWNGLHLADFVMPWFLFIAGVSVALVYKEVKSKVTATRNAACRGLYLFLLGVLLQG
KVIAGNSCTEYFNIHYIMRISVGYLIAALCEIWLTRCTPEEAQHTKSFSWHWCIMFLLLSLYMGLSYGLYVPDWDFKISAMSSSLPPNGSYVYMVNCSLRGDLGPACNSA
GMIDRYVLGIHHFPNEECGLKECNISSSGQFPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILAKSQDHKSRTNSWFLLSLKILALGIFLVFIGIPVNKSLYTVS
YMLITSASAGIIFCALYILVDIHGYRRLTCVLEWMGKHALSIYVLVISNILVIGLQGFYWKSPNNNIVHWIVSRVKAQS