; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020393 (gene) of Snake gourd v1 genome

Gene IDTan0020393
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionheparan-alpha-glucosaminide N-acetyltransferase
Genome locationLG10:22558675..22565069
RNA-Seq ExpressionTan0020393
SyntenyTan0020393
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR012429 - Heparan-alpha-glucosaminide N-acetyltransferase, catalytic domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8651279.1 hypothetical protein Csa_000910 [Cucumis sativus]3.1e-25490.37Show/hide
Query:  MAIRKDMGKYEPIKGADDCDLAND----------TVISVSKHCNQSDEDVEMALRDSHSRSPLPLHNANPLTAAAASKIDDAQFSSSARPIHRSSDQRQR
        MAIRKDMG YEPIKGADDCDL N+          T++SVSKHCNQSDEDVEMALR SHSRSPLP+HNANPLT   +SKID+ QFSSS RPI RSSDQ  R
Subjt:  MAIRKDMGKYEPIKGADDCDLAND----------TVISVSKHCNQSDEDVEMALRDSHSRSPLPLHNANPLTAAAASKIDDAQFSSSARPIHRSSDQRQR

Query:  LVSLDVFRGITVALMIVVDYAGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGLNNLTYGV
        LVSLDVFRGITVALMIVVDYAGGVMPAINHSPW+GLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHG+NNLTYGV
Subjt:  LVSLDVFRGITVALMIVVDYAGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGLNNLTYGV

Query:  DIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQLIVAVILTTLYLALSYGLYVPDWEYQVPSQTNSQMASLKIFSVKCGIRGDTGP
        DIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQL+ AV+LT LYLALSYGLYVPDWEYQVPS T S +AS KIFSVKCG RGDTGP
Subjt:  DIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQLIVAVILTTLYLALSYGLYVPDWEYQVPSQTNSQMASLKIFSVKCGIRGDTGP

Query:  GCNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPNAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVL
         CNAVGMIDRKIFGIQHLYKRPIYAR+EQCSINAPDYGPLPP+APSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVL
Subjt:  GCNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPNAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVL

Query:  AIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFYWGQPRNNILR
        AIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMVDVY WRRMNVVMEWMGKHALVIYVLAACNVLPV+LQGFY GQP+NNI R
Subjt:  AIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFYWGQPRNNILR

XP_004141153.1 heparan-alpha-glucosaminide N-acetyltransferase [Cucumis sativus]5.9e-25890.28Show/hide
Query:  MAIRKDMGKYEPIKGADDCDLAND----------TVISVSKHCNQSDEDVEMALRDSHSRSPLPLHNANPLTAAAASKIDDAQFSSSARPIHRSSDQRQR
        MAIRKDMG YEPIKGADDCDL N+          T++SVSKHCNQSDEDVEMALR SHSRSPLP+HNANPLT   +SKID+ QFSSS RPI RSSDQ  R
Subjt:  MAIRKDMGKYEPIKGADDCDLAND----------TVISVSKHCNQSDEDVEMALRDSHSRSPLPLHNANPLTAAAASKIDDAQFSSSARPIHRSSDQRQR

Query:  LVSLDVFRGITVALMIVVDYAGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGLNNLTYGV
        LVSLDVFRGITVALMIVVDYAGGVMPAINHSPW+GLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHG+NNLTYGV
Subjt:  LVSLDVFRGITVALMIVVDYAGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGLNNLTYGV

Query:  DIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQLIVAVILTTLYLALSYGLYVPDWEYQVPSQTNSQMASLKIFSVKCGIRGDTGP
        DIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQL+ AV+LT LYLALSYGLYVPDWEYQVPS T S +AS KIFSVKCG RGDTGP
Subjt:  DIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQLIVAVILTTLYLALSYGLYVPDWEYQVPSQTNSQMASLKIFSVKCGIRGDTGP

Query:  GCNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPNAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVL
         CNAVGMIDRKIFGIQHLYKRPIYAR+EQCSINAPDYGPLPP+APSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVL
Subjt:  GCNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPNAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVL

Query:  AIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFYWGQPRNNILRLIGIPT
        AIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMVDVY WRRMNVVMEWMGKHALVIYVLAACNVLPV+LQGFY GQP+NNILRLIG+P+
Subjt:  AIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFYWGQPRNNILRLIGIPT

XP_008465168.1 PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Cucumis melo]1.0e-25790.28Show/hide
Query:  MAIRKDMGKYEPIKGADDCDLAND----------TVISVSKHCNQSDEDVEMALRDSHSRSPLPLHNANPLTAAAASKIDDAQFSSSARPIHRSSDQRQR
        MAIRKDMG YEPIKGADDCDL N+          T++SVSKHCNQSDEDVEMALR SHSRSPLP+HNANPLT   +SKID+ QFSSS RPI RSSDQ  R
Subjt:  MAIRKDMGKYEPIKGADDCDLAND----------TVISVSKHCNQSDEDVEMALRDSHSRSPLPLHNANPLTAAAASKIDDAQFSSSARPIHRSSDQRQR

Query:  LVSLDVFRGITVALMIVVDYAGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGLNNLTYGV
        LVSLDVFRGITVALMIVVDYAGGVMPAINHSPW+GLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHG+NNLTYGV
Subjt:  LVSLDVFRGITVALMIVVDYAGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGLNNLTYGV

Query:  DIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQLIVAVILTTLYLALSYGLYVPDWEYQVPSQTNSQMASLKIFSVKCGIRGDTGP
        DIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQL+VAV+LT LYL LSYGLYVPDWEYQVPS T S +AS KIFSVKCG RGDTGP
Subjt:  DIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQLIVAVILTTLYLALSYGLYVPDWEYQVPSQTNSQMASLKIFSVKCGIRGDTGP

Query:  GCNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPNAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVL
         CNAVGMIDRKIFGIQHLYKRPIYAR+EQCSINAPDYGPLPP+APSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVL
Subjt:  GCNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPNAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVL

Query:  AIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFYWGQPRNNILRLIGIPT
        AIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMVDVY WRRMNVVMEWMGKHALVIYVLAACNVLPV+LQGFY GQP+NNILRLIG+P+
Subjt:  AIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFYWGQPRNNILRLIGIPT

XP_022983292.1 heparan-alpha-glucosaminide N-acetyltransferase-like isoform X1 [Cucurbita maxima]4.3e-25690.08Show/hide
Query:  MAIRKDMGKYEPIKGADDCDLANDTVI----------SVSKHCNQSDEDVEMALRDSHSRSPLPLHNANPLTAAAASKIDDAQFSSSARPIHRSSDQRQR
        M+IRKDMGKY+PIK   DCDLAN+T +          SVS HCN S EDVEMAL DSHSRSPLPLHNANPLT  A+SK+DDAQFSSSARP+ RSS Q QR
Subjt:  MAIRKDMGKYEPIKGADDCDLANDTVI----------SVSKHCNQSDEDVEMALRDSHSRSPLPLHNANPLTAAAASKIDDAQFSSSARPIHRSSDQRQR

Query:  LVSLDVFRGITVALMIVVDYAGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGLNNLTYGV
        L SLDVFRGITVALMIVVDY GGVMPAINHSPW+GLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHG+NNLTYGV
Subjt:  LVSLDVFRGITVALMIVVDYAGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGLNNLTYGV

Query:  DIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQLIVAVILTTLYLALSYGLYVPDWEYQVPSQTNSQMASLKIFSVKCGIRGDTGP
        DIQQIRWMGILQRIAIAYFLAA+CEIWLKGSDYVNSETALRRKYQLQL+VAVILTTLYL LSYGLYVPDWEYQVPSQ+ S MAS KIFSVKCG RGDTGP
Subjt:  DIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQLIVAVILTTLYLALSYGLYVPDWEYQVPSQTNSQMASLKIFSVKCGIRGDTGP

Query:  GCNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPNAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVL
         CNAVGMIDRKIFGIQHLYKRPIYARSEQCSIN+PDYGPLPPNAPSWCQAPFDPEG+LSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVL
Subjt:  GCNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPNAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVL

Query:  AIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFYWGQPRNNILRLIGIPT
        AIGLDFLGMHINKVLYTVSYMSVT GAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFY GQP+NNILRLIGI T
Subjt:  AIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFYWGQPRNNILRLIGIPT

XP_038905626.1 heparan-alpha-glucosaminide N-acetyltransferase isoform X1 [Benincasa hispida]7.5e-26191.48Show/hide
Query:  MAIRKDMGKYEPIKGADDCDLAND----------TVISVSKHCNQSDEDVEMALRDSHSRSPLPLHNANPLTAAAASKIDDAQFSSSARPIHRSSDQRQR
        MAIRKDMG YEPIKG DDCDLAN+          T++SVSKHCNQ+DEDVEMALRDSHSRSPLPLHNANPLT   +SKIDD QFSSSARPI RSS+QRQR
Subjt:  MAIRKDMGKYEPIKGADDCDLAND----------TVISVSKHCNQSDEDVEMALRDSHSRSPLPLHNANPLTAAAASKIDDAQFSSSARPIHRSSDQRQR

Query:  LVSLDVFRGITVALMIVVDYAGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGLNNLTYGV
        LVSLDVFRGITVALMIVVDYAGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHG+NNLTYGV
Subjt:  LVSLDVFRGITVALMIVVDYAGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGLNNLTYGV

Query:  DIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQLIVAVILTTLYLALSYGLYVPDWEYQVPSQTNSQMASLKIFSVKCGIRGDTGP
        DIQ+IRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQL+VA +LTTLYL LSYGLYV DWEYQVPS T S +AS KIFSVKCG RGDTGP
Subjt:  DIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQLIVAVILTTLYLALSYGLYVPDWEYQVPSQTNSQMASLKIFSVKCGIRGDTGP

Query:  GCNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPNAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVL
         CNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPP+APSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVL
Subjt:  GCNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPNAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVL

Query:  AIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFYWGQPRNNILRLIGIP
        AIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPV+LQGFY GQP+NNILRLIG+P
Subjt:  AIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFYWGQPRNNILRLIGIP

TrEMBL top hitse value%identityAlignment
A0A0A0LFP0 DUF1624 domain-containing protein2.9e-25890.28Show/hide
Query:  MAIRKDMGKYEPIKGADDCDLAND----------TVISVSKHCNQSDEDVEMALRDSHSRSPLPLHNANPLTAAAASKIDDAQFSSSARPIHRSSDQRQR
        MAIRKDMG YEPIKGADDCDL N+          T++SVSKHCNQSDEDVEMALR SHSRSPLP+HNANPLT   +SKID+ QFSSS RPI RSSDQ  R
Subjt:  MAIRKDMGKYEPIKGADDCDLAND----------TVISVSKHCNQSDEDVEMALRDSHSRSPLPLHNANPLTAAAASKIDDAQFSSSARPIHRSSDQRQR

Query:  LVSLDVFRGITVALMIVVDYAGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGLNNLTYGV
        LVSLDVFRGITVALMIVVDYAGGVMPAINHSPW+GLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHG+NNLTYGV
Subjt:  LVSLDVFRGITVALMIVVDYAGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGLNNLTYGV

Query:  DIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQLIVAVILTTLYLALSYGLYVPDWEYQVPSQTNSQMASLKIFSVKCGIRGDTGP
        DIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQL+ AV+LT LYLALSYGLYVPDWEYQVPS T S +AS KIFSVKCG RGDTGP
Subjt:  DIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQLIVAVILTTLYLALSYGLYVPDWEYQVPSQTNSQMASLKIFSVKCGIRGDTGP

Query:  GCNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPNAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVL
         CNAVGMIDRKIFGIQHLYKRPIYAR+EQCSINAPDYGPLPP+APSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVL
Subjt:  GCNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPNAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVL

Query:  AIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFYWGQPRNNILRLIGIPT
        AIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMVDVY WRRMNVVMEWMGKHALVIYVLAACNVLPV+LQGFY GQP+NNILRLIG+P+
Subjt:  AIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFYWGQPRNNILRLIGIPT

A0A1S3CNA5 heparan-alpha-glucosaminide N-acetyltransferase4.9e-25890.28Show/hide
Query:  MAIRKDMGKYEPIKGADDCDLAND----------TVISVSKHCNQSDEDVEMALRDSHSRSPLPLHNANPLTAAAASKIDDAQFSSSARPIHRSSDQRQR
        MAIRKDMG YEPIKGADDCDL N+          T++SVSKHCNQSDEDVEMALR SHSRSPLP+HNANPLT   +SKID+ QFSSS RPI RSSDQ  R
Subjt:  MAIRKDMGKYEPIKGADDCDLAND----------TVISVSKHCNQSDEDVEMALRDSHSRSPLPLHNANPLTAAAASKIDDAQFSSSARPIHRSSDQRQR

Query:  LVSLDVFRGITVALMIVVDYAGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGLNNLTYGV
        LVSLDVFRGITVALMIVVDYAGGVMPAINHSPW+GLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHG+NNLTYGV
Subjt:  LVSLDVFRGITVALMIVVDYAGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGLNNLTYGV

Query:  DIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQLIVAVILTTLYLALSYGLYVPDWEYQVPSQTNSQMASLKIFSVKCGIRGDTGP
        DIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQL+VAV+LT LYL LSYGLYVPDWEYQVPS T S +AS KIFSVKCG RGDTGP
Subjt:  DIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQLIVAVILTTLYLALSYGLYVPDWEYQVPSQTNSQMASLKIFSVKCGIRGDTGP

Query:  GCNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPNAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVL
         CNAVGMIDRKIFGIQHLYKRPIYAR+EQCSINAPDYGPLPP+APSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVL
Subjt:  GCNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPNAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVL

Query:  AIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFYWGQPRNNILRLIGIPT
        AIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMVDVY WRRMNVVMEWMGKHALVIYVLAACNVLPV+LQGFY GQP+NNILRLIG+P+
Subjt:  AIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFYWGQPRNNILRLIGIPT

A0A5A7T699 Heparan-alpha-glucosaminide N-acetyltransferase4.9e-25890.28Show/hide
Query:  MAIRKDMGKYEPIKGADDCDLAND----------TVISVSKHCNQSDEDVEMALRDSHSRSPLPLHNANPLTAAAASKIDDAQFSSSARPIHRSSDQRQR
        MAIRKDMG YEPIKGADDCDL N+          T++SVSKHCNQSDEDVEMALR SHSRSPLP+HNANPLT   +SKID+ QFSSS RPI RSSDQ  R
Subjt:  MAIRKDMGKYEPIKGADDCDLAND----------TVISVSKHCNQSDEDVEMALRDSHSRSPLPLHNANPLTAAAASKIDDAQFSSSARPIHRSSDQRQR

Query:  LVSLDVFRGITVALMIVVDYAGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGLNNLTYGV
        LVSLDVFRGITVALMIVVDYAGGVMPAINHSPW+GLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHG+NNLTYGV
Subjt:  LVSLDVFRGITVALMIVVDYAGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGLNNLTYGV

Query:  DIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQLIVAVILTTLYLALSYGLYVPDWEYQVPSQTNSQMASLKIFSVKCGIRGDTGP
        DIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQL+VAV+LT LYL LSYGLYVPDWEYQVPS T S +AS KIFSVKCG RGDTGP
Subjt:  DIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQLIVAVILTTLYLALSYGLYVPDWEYQVPSQTNSQMASLKIFSVKCGIRGDTGP

Query:  GCNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPNAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVL
         CNAVGMIDRKIFGIQHLYKRPIYAR+EQCSINAPDYGPLPP+APSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVL
Subjt:  GCNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPNAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVL

Query:  AIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFYWGQPRNNILRLIGIPT
        AIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMVDVY WRRMNVVMEWMGKHALVIYVLAACNVLPV+LQGFY GQP+NNILRLIG+P+
Subjt:  AIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFYWGQPRNNILRLIGIPT

A0A6J1IYW6 heparan-alpha-glucosaminide N-acetyltransferase-like isoform X12.1e-25690.08Show/hide
Query:  MAIRKDMGKYEPIKGADDCDLANDTVI----------SVSKHCNQSDEDVEMALRDSHSRSPLPLHNANPLTAAAASKIDDAQFSSSARPIHRSSDQRQR
        M+IRKDMGKY+PIK   DCDLAN+T +          SVS HCN S EDVEMAL DSHSRSPLPLHNANPLT  A+SK+DDAQFSSSARP+ RSS Q QR
Subjt:  MAIRKDMGKYEPIKGADDCDLANDTVI----------SVSKHCNQSDEDVEMALRDSHSRSPLPLHNANPLTAAAASKIDDAQFSSSARPIHRSSDQRQR

Query:  LVSLDVFRGITVALMIVVDYAGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGLNNLTYGV
        L SLDVFRGITVALMIVVDY GGVMPAINHSPW+GLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHG+NNLTYGV
Subjt:  LVSLDVFRGITVALMIVVDYAGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGLNNLTYGV

Query:  DIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQLIVAVILTTLYLALSYGLYVPDWEYQVPSQTNSQMASLKIFSVKCGIRGDTGP
        DIQQIRWMGILQRIAIAYFLAA+CEIWLKGSDYVNSETALRRKYQLQL+VAVILTTLYL LSYGLYVPDWEYQVPSQ+ S MAS KIFSVKCG RGDTGP
Subjt:  DIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQLIVAVILTTLYLALSYGLYVPDWEYQVPSQTNSQMASLKIFSVKCGIRGDTGP

Query:  GCNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPNAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVL
         CNAVGMIDRKIFGIQHLYKRPIYARSEQCSIN+PDYGPLPPNAPSWCQAPFDPEG+LSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVL
Subjt:  GCNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPNAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVL

Query:  AIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFYWGQPRNNILRLIGIPT
        AIGLDFLGMHINKVLYTVSYMSVT GAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFY GQP+NNILRLIGI T
Subjt:  AIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFYWGQPRNNILRLIGIPT

A0A6J1J7D2 heparan-alpha-glucosaminide N-acetyltransferase-like isoform X22.8e-25389.59Show/hide
Query:  MAIRKDMGKYEPIKGADDCDLANDTVI----------SVSKHCNQSDEDVEMALRDSHSRSPLPLHNANPLTAAAASKIDDAQFSSSARPIHRSSDQRQR
        M+IRKDMGKY+PIK   DCDLAN+T +          SVS HCN S EDVEMAL DSHSRSPLPLHNANPLT  A+SK+DDAQFSSSARP+ RSS Q QR
Subjt:  MAIRKDMGKYEPIKGADDCDLANDTVI----------SVSKHCNQSDEDVEMALRDSHSRSPLPLHNANPLTAAAASKIDDAQFSSSARPIHRSSDQRQR

Query:  LVSLDVFRGITVALMIVVDYAGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGLNNLTYGV
        L SLDVFRGITVALMIVVDY GGVMPAINHSPW+GLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHG+NNLTYGV
Subjt:  LVSLDVFRGITVALMIVVDYAGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGLNNLTYGV

Query:  DIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQLIVAVILTTLYLALSYGLYVPDWEYQVPSQTNSQMASLKIFSVKCGIRGDTGP
        DIQQIRWMGILQRIAIAYFLAA+CEIWLKGSDYVNSETALRRKYQLQL+VAVILTTLYL LSYGLYVPDWEYQVPSQ+ S MAS KIFSVKCG RGDTGP
Subjt:  DIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQLIVAVILTTLYLALSYGLYVPDWEYQVPSQTNSQMASLKIFSVKCGIRGDTGP

Query:  GCNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPNAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVL
         CNAVGMIDRKIFGIQHLYKRPIYARSEQCSIN+PDYGPLPPNAPSWCQAPFDPEG+LSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVL
Subjt:  GCNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPNAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVL

Query:  AIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFYWGQPRNNILRLI
        AIGLDFLGMHINKVLYTVSYMSVT GAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFY GQP+NNI+ L+
Subjt:  AIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFYWGQPRNNILRLI

SwissProt top hitse value%identityAlignment
Q3UDW8 Heparan-alpha-glucosaminide N-acetyltransferase9.2e-2828.72Show/hide
Query:  RSSDQRQRLVSLDVFRGITVALMIVVDYAGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHG
        RSS  R R V  D FRG+ + LM+ V+Y GG      HS WNGLT+ADLV P+F+FI+G S+ L+   I  RG +  K + + +   FL L   G  +  
Subjt:  RSSDQRQRLVSLDVFRGITVALMIVVDYAGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHG

Query:  LNNLTYGVDIQQIRWMGILQRIAIAYFLAALCE--IWLKGSDYVNSETALRRKYQL-----QLIVAVILTTLYLALSYGLYVPDWE--YQVPSQTNSQMA
         N     +   ++R  G+LQR+ + YF+ A+ E   W    D    E++      +     Q +  + L +++LAL++ L VP     Y  P        
Subjt:  LNNLTYGVDIQQIRWMGILQRIAIAYFLAALCE--IWLKGSDYVNSETALRRKYQL-----QLIVAVILTTLYLALSYGLYVPDWE--YQVPSQTNSQMA

Query:  SLKIFSVKCGIRGDTG--PGC--NAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPNAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVH
                 G  GD G  P C   A G IDR + G  HLY+ P                          +  +DPEG+L T+ ++V   +G+  G I+V+
Subjt:  SLKIFSVKCGIRGDTG--PGC--NAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPNAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVH

Query:  FKDHRDRMLHWIIPSSCLI-VLAIGLDFLGMH-----INKVLYTVSYMSVTAGAAGLLFTGIYLMVDV--------YRWRRMNVVMEWMGKHALVIY
        +KD    +L       C++ +++I L  +  +     INK L+++SY++  +  A  +   +Y +VDV        + +  MN ++ ++G   L  Y
Subjt:  FKDHRDRMLHWIIPSSCLI-VLAIGLDFLGMH-----INKVLYTVSYMSVTAGAAGLLFTGIYLMVDV--------YRWRRMNVVMEWMGKHALVIY

Q68CP4 Heparan-alpha-glucosaminide N-acetyltransferase1.1e-2526.87Show/hide
Query:  RLVSLDVFRGITVALMIVVDYAGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIA----TQKAVLRTLKLLFLGLFLQGGFLHGLNN
        RL S+D FRGI + LM+ V+Y GG      H+ WNGLT+ADLV P+F+FI+G S+ L+   I  RG +      K   R+  L+ +G+ +        N 
Subjt:  RLVSLDVFRGITVALMIVVDYAGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIA----TQKAVLRTLKLLFLGLFLQGGFLHGLNN

Query:  LTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKG--SDYVNSETALRRKYQL-----QLIVAVILTTLYLALSYGLYVPDWE--YQVPSQTNSQMASLK
            +   ++R  G+LQR+ + YF+ A+ E+       ++  SE +      +     Q ++ ++L  L+L L++ L VP     Y  P           
Subjt:  LTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKG--SDYVNSETALRRKYQL-----QLIVAVILTTLYLALSYGLYVPDWE--YQVPSQTNSQMASLK

Query:  IFSVKCGIRGDTG--PGC--NAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPNAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKD
              G  GD G  P C   A G IDR + G  HLY+ P  A      +                   +DPEG+L T+ ++V   +G+  G I++++K 
Subjt:  IFSVKCGIRGDTG--PGC--NAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPNAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKD

Query:  HRDRMLHWIIPSSCLI-VLAIGLDFLG-----MHINKVLYTVSYMSVTAGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYV
            +L       C++ ++++ L  +      + +NK L+++SY++  +  A  +   +Y +VDV +         + G +++++YV
Subjt:  HRDRMLHWIIPSSCLI-VLAIGLDFLG-----MHINKVLYTVSYMSVTAGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYV

Arabidopsis top hitse value%identityAlignment
AT5G47900.1 Protein of unknown function (DUF1624)3.2e-16164.63Show/hide
Query:  SARPIHRSSD---QRQRLVSLDVFRGITVALMIVVDYAGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGL
        SA  I RSS     ++RLVSLDVFRG+TVA MI+VD  GG++P+INHSPW+G+TLAD VMPFFLFIVGVSLA AYK +  R +AT+KA++R+LKLL LGL
Subjt:  SARPIHRSSD---QRQRLVSLDVFRGITVALMIVVDYAGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGL

Query:  FLQGGFLHGLNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQLIVAVILTTLYLALSYGLYVPDWEYQV-PSQTNSQM
        FLQGGF+HGLNNLTYG+D+++IR MGILQRIAIAY + ALCEIWLKG+  V+SE ++ +KY+   +VA ++TT+YL+L YGLYVPDWEYQ+      S +
Subjt:  FLQGGFLHGLNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQLIVAVILTTLYLALSYGLYVPDWEYQV-PSQTNSQM

Query:  ASLKIFSVKCGIRGDTGPGCNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPNAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKD
         +     VKCG+RG TGPGCNAVGM+DR   GIQHLY++P+YAR++QCSIN P+ GPLPP+APSWCQAPFDPEGLLS++MA VTCLVGLHYGHII+HFKD
Subjt:  ASLKIFSVKCGIRGDTGPGCNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPNAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKD

Query:  HRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFYWGQP
        H+ R+  WI+ S CL++L + L+  GMH+NK LYT+SYM VT+GA+G L + IYLMVDVY ++R ++V+EWMG HAL IYVL ACN++ +++ GFYW  P
Subjt:  HRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFYWGQP

Query:  RNNILRLIGI
         NN+L LIGI
Subjt:  RNNILRLIGI

AT5G47900.2 Protein of unknown function (DUF1624)8.1e-12067.56Show/hide
Query:  SARPIHRSSD---QRQRLVSLDVFRGITVALMIVVDYAGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGL
        SA  I RSS     ++RLVSLDVFRG+TVA MI+VD  GG++P+INHSPW+G+TLAD VMPFFLFIVGVSLA AYK +  R +AT+KA++R+LKLL LGL
Subjt:  SARPIHRSSD---QRQRLVSLDVFRGITVALMIVVDYAGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGL

Query:  FLQGGFLHGLNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQLIVAVILTTLYLALSYGLYVPDWEYQV-PSQTNSQM
        FLQGGF+HGLNNLTYG+D+++IR MGILQRIAIAY + ALCEIWLKG+  V+SE ++ +KY+   +VA ++TT+YL+L YGLYVPDWEYQ+      S +
Subjt:  FLQGGFLHGLNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQLIVAVILTTLYLALSYGLYVPDWEYQV-PSQTNSQM

Query:  ASLKIFSVKCGIRGDTGPGCNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPNAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFK
         +     VKCG+RG TGPGCNAVGM+DR   GIQHLY++P+YAR++QCSIN P+ GPLPP+APSWCQAPFDPEGLLS++MA VTCLVGLHYGHII+HFK
Subjt:  ASLKIFSVKCGIRGDTGPGCNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPNAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFK

AT5G47900.4 Protein of unknown function (DUF1624)1.6e-13658.27Show/hide
Query:  SARPIHRSSD---QRQRLVSLDVFRGITVALMIVVDYAGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGL
        SA  I RSS     ++RLVSLDVFRG+TVA MI+VD  GG++P+INHSPW+G+TLAD VMPFFLFIVGVSLA AYK +  R +AT+KA++R+LKLL LGL
Subjt:  SARPIHRSSD---QRQRLVSLDVFRGITVALMIVVDYAGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGL

Query:  FLQGGFLHGLNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQLIVAVILTTLYLALSYGLYVPDWEYQV-PSQTNSQM
        FLQGGF+HGLNNLTYG+D+++IR MGILQRIAIAY + ALCEIWLKG+  V+SE ++ +KY+   +VA ++TT+YL+L YGLYVPDWEYQ+      S +
Subjt:  FLQGGFLHGLNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQLIVAVILTTLYLALSYGLYVPDWEYQV-PSQTNSQM

Query:  ASLKIFSVKCGIRGDTGPGCNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPNAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKD
         +     VKCG+RG TGPGCNAVGM+DR   GIQHLY++P+YAR++QCSIN P+ GPLPP+APSWCQAPFDPEGLLS++MA VTCLVGLHYGHII+HFK 
Subjt:  ASLKIFSVKCGIRGDTGPGCNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPNAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKD

Query:  HRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYL-------MVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQ
        +  +   +  PS  + +      F  M     L +     V +    L   GI++       +VDVY ++R ++V+EWMG HAL IYVL ACN++ +++ 
Subjt:  HRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYL-------MVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQ

Query:  GFYWGQPRNNILRLIGI
        GFYW  P NN+L LIGI
Subjt:  GFYWGQPRNNILRLIGI

AT5G47900.6 Protein of unknown function (DUF1624)4.1e-13262.72Show/hide
Query:  LAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGLNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQLIVAVILT
        +++  +PS+ +AT+KA++R+LKLL LGLFLQGGF+HGLNNLTYG+D+++IR MGILQRIAIAY + ALCEIWLKG+  V+SE ++ +KY+   +VA ++T
Subjt:  LAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGLNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQLIVAVILT

Query:  TLYLALSYGLYVPDWEYQV-PSQTNSQMASLKIFSVKCGIRGDTGPGCNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPNAPSWCQAPFDP
        T+YL+L YGLYVPDWEYQ+      S + +     VKCG+RG TGPGCNAVGM+DR   GIQHLY++P+YAR++QCSIN P+ GPLPP+APSWCQAPFDP
Subjt:  TLYLALSYGLYVPDWEYQV-PSQTNSQMASLKIFSVKCGIRGDTGPGCNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPNAPSWCQAPFDP

Query:  EGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMVDVYRWRRMNVVMEWM
        EGLLS++MA VTCLVGLHYGHII+HFKDH+ R+  WI+ S CL++L + L+  GMH+NK LYT+SYM VT+GA+G L + IYLMVDVY ++R ++V+EWM
Subjt:  EGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMVDVYRWRRMNVVMEWM

Query:  GKHALVIYVLAACNVLPVVLQGFYWGQPRNNILRLIGI
        G HAL IYVL ACN++ +++ GFYW  P NN+L LIGI
Subjt:  GKHALVIYVLAACNVLPVVLQGFYWGQPRNNILRLIGI

AT5G47900.7 Protein of unknown function (DUF1624)6.8e-13559.39Show/hide
Query:  SARPIHRSSD---QRQRLVSLDVFRGITVALMIVVDYAGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGL
        SA  I RSS     ++RLVSLDVFRG+TVA MI+VD  GG++P+INHSPW+G+TLAD VMPFFLFIVGVSLA AYK +  R +AT+KA++R+LKLL LGL
Subjt:  SARPIHRSSD---QRQRLVSLDVFRGITVALMIVVDYAGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGL

Query:  FLQGGFLHGLNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQLIVAVILTTLYLALSYGLYVPDWEYQV-PSQTNSQM
        FLQGGF+HGLNNLTYG+D+++IR MGILQRIAIAY + ALCEIWLKG+  V+SE ++ +KY+   +VA ++TT+YL+L YGLYVPDWEYQ+      S +
Subjt:  FLQGGFLHGLNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQLIVAVILTTLYLALSYGLYVPDWEYQV-PSQTNSQM

Query:  ASLKIFSVKCGIRGDTGPGCNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPNAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFK-
         +     VKCG+RG TGPGCNAVGM+DR   GIQHLY++P+YAR++QCSIN P+ GPLPP+APSWCQAPFDPEGLLS++MA VTCLVGLHYGHII+HFK 
Subjt:  ASLKIFSVKCGIRGDTGPGCNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPNAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFK-

Query:  ------------------------------------DHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMV
                                            DH+ R+  WI+ S CL++L + L+  GMH+NK LYT+SYM VT+GA+G L + IYLMV
Subjt:  ------------------------------------DHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATCCGTAAAGACATGGGCAAGTACGAGCCTATCAAAGGCGCTGACGACTGTGATCTCGCCAATGACACTGTCATCTCGGTTTCTAAGCACTGTAATCAGTCTGA
TGAAGATGTTGAGATGGCTCTTCGTGATTCCCATTCGAGATCTCCTCTTCCTCTTCACAATGCCAATCCTCTCACCGCTGCTGCTGCTTCTAAAATCGACGACGCCCAAT
TTTCCTCTTCTGCTAGGCCCATTCACCGATCTTCCGATCAACGGCAACGCCTTGTTTCCCTCGATGTATTTCGCGGCATCACGGTTGCGCTAATGATAGTGGTGGACTAT
GCTGGTGGGGTTATGCCTGCAATAAATCATTCACCATGGAATGGGTTAACACTGGCAGATCTTGTGATGCCATTTTTCCTATTCATTGTTGGAGTTTCACTTGCCCTTGC
TTACAAGAAAATTCCAAGCCGAGGCATTGCAACTCAGAAGGCTGTGTTACGGACGTTGAAACTTTTGTTCTTAGGCCTCTTTCTTCAAGGTGGGTTTCTCCATGGCCTAA
ACAATTTAACTTATGGAGTGGATATCCAGCAAATTAGATGGATGGGAATCTTACAGAGAATTGCAATAGCATATTTTCTTGCAGCACTGTGTGAGATATGGCTAAAGGGC
AGTGATTATGTGAATTCAGAAACTGCATTGCGGAGAAAGTATCAATTACAGCTAATTGTGGCCGTCATCCTCACAACGTTATATCTTGCCCTGTCATACGGATTGTACGT
TCCTGATTGGGAGTATCAAGTTCCAAGTCAAACTAATTCCCAAATGGCTTCTCTAAAGATATTTTCTGTGAAATGTGGGATACGTGGTGACACCGGACCTGGCTGCAATG
CTGTGGGAATGATAGATCGTAAGATTTTTGGTATTCAACATCTGTATAAAAGACCTATTTATGCACGGTCTGAGCAATGCAGCATTAATGCACCAGACTATGGTCCATTG
CCTCCTAATGCTCCTTCTTGGTGTCAAGCCCCTTTTGATCCAGAAGGGCTTTTAAGCACAGTGATGGCTGTTGTGACCTGCTTGGTTGGCTTGCATTATGGGCACATCAT
TGTCCATTTCAAAGATCATCGAGACAGAATGCTTCATTGGATCATCCCCTCGTCGTGTCTGATTGTGCTGGCCATTGGCTTAGACTTCTTAGGGATGCATATAAATAAGG
TTCTTTATACAGTTAGTTACATGAGTGTCACTGCTGGTGCAGCCGGTCTTCTCTTCACTGGGATATACTTGATGGTTGATGTGTACAGATGGAGGCGCATGAATGTGGTG
ATGGAGTGGATGGGAAAGCATGCATTGGTGATATACGTTCTCGCTGCCTGCAATGTGCTGCCTGTGGTTCTGCAAGGCTTCTATTGGGGGCAGCCTCGAAACAACATCCT
GAGGCTAATTGGAATTCCAACGTGA
mRNA sequenceShow/hide mRNA sequence
TTCTGCTCTCTGGGTTGGTTCTCTCTCGCATTCAGTTTGGTTCGAACTATAAATAAGCAAGTTGGAAGCAGAGGGGCCAATGGAACTTATTGAAAGAAAGAGAGAAAAGG
TTGCAGAAGGAAAGCAAAGTATTTTCAGGAAACTGGGAACCATTCCATCCCACCTTTGGAGGAAGAGGTTCCCTTCCTTGTAACCCTTCTCCTTTTCCCTTTTGGTTTCC
CCTTCAATCTCTCCATCTCCTTGTTCTTCATCCTCTTCAAACTCACACTCATTTGGGAGGAATTCTTTTGGTGCTTCTGTTTTTGTTTGGGGTTTCTTCCCTTTGATATA
TGGTGTTCTAATGGCAATCCGTAAAGACATGGGCAAGTACGAGCCTATCAAAGGCGCTGACGACTGTGATCTCGCCAATGACACTGTCATCTCGGTTTCTAAGCACTGTA
ATCAGTCTGATGAAGATGTTGAGATGGCTCTTCGTGATTCCCATTCGAGATCTCCTCTTCCTCTTCACAATGCCAATCCTCTCACCGCTGCTGCTGCTTCTAAAATCGAC
GACGCCCAATTTTCCTCTTCTGCTAGGCCCATTCACCGATCTTCCGATCAACGGCAACGCCTTGTTTCCCTCGATGTATTTCGCGGCATCACGGTTGCGCTAATGATAGT
GGTGGACTATGCTGGTGGGGTTATGCCTGCAATAAATCATTCACCATGGAATGGGTTAACACTGGCAGATCTTGTGATGCCATTTTTCCTATTCATTGTTGGAGTTTCAC
TTGCCCTTGCTTACAAGAAAATTCCAAGCCGAGGCATTGCAACTCAGAAGGCTGTGTTACGGACGTTGAAACTTTTGTTCTTAGGCCTCTTTCTTCAAGGTGGGTTTCTC
CATGGCCTAAACAATTTAACTTATGGAGTGGATATCCAGCAAATTAGATGGATGGGAATCTTACAGAGAATTGCAATAGCATATTTTCTTGCAGCACTGTGTGAGATATG
GCTAAAGGGCAGTGATTATGTGAATTCAGAAACTGCATTGCGGAGAAAGTATCAATTACAGCTAATTGTGGCCGTCATCCTCACAACGTTATATCTTGCCCTGTCATACG
GATTGTACGTTCCTGATTGGGAGTATCAAGTTCCAAGTCAAACTAATTCCCAAATGGCTTCTCTAAAGATATTTTCTGTGAAATGTGGGATACGTGGTGACACCGGACCT
GGCTGCAATGCTGTGGGAATGATAGATCGTAAGATTTTTGGTATTCAACATCTGTATAAAAGACCTATTTATGCACGGTCTGAGCAATGCAGCATTAATGCACCAGACTA
TGGTCCATTGCCTCCTAATGCTCCTTCTTGGTGTCAAGCCCCTTTTGATCCAGAAGGGCTTTTAAGCACAGTGATGGCTGTTGTGACCTGCTTGGTTGGCTTGCATTATG
GGCACATCATTGTCCATTTCAAAGATCATCGAGACAGAATGCTTCATTGGATCATCCCCTCGTCGTGTCTGATTGTGCTGGCCATTGGCTTAGACTTCTTAGGGATGCAT
ATAAATAAGGTTCTTTATACAGTTAGTTACATGAGTGTCACTGCTGGTGCAGCCGGTCTTCTCTTCACTGGGATATACTTGATGGTTGATGTGTACAGATGGAGGCGCAT
GAATGTGGTGATGGAGTGGATGGGAAAGCATGCATTGGTGATATACGTTCTCGCTGCCTGCAATGTGCTGCCTGTGGTTCTGCAAGGCTTCTATTGGGGGCAGCCTCGAA
ACAACATCCTGAGGCTAATTGGAATTCCAACGTGAAGGAAAGTAGCCTTTTGAGAGTCCAAATGCTACTTTGGAATATATAGACTAGACCACATAGCCCCATGAGATGTC
CATTGGTGGTTGATTGATATTTCCTATTAAAAGAGGAAGGAATAATAAGATAGAGAGGAGGTATATGCTTGCTGGCTCGATCACTCATTCGGCTCATCCGTTATAGGAGA
GATTAGTTTTAGTTTAGTTTTGTGTAAAATATTTAGAAGGAATGGAATGAAAAGGAAGTTATATATTATTTCGTACTCAACTCAACTCATTGTATACCAAATTCCTACGC
CTGCCCATATATAATTTACAAAAGCAAAACTGCGTGCTAGCTTTAGAAGATGATGATACTGATGACCTTAATCTTTCCTCGATTTATGCAGGAAAGTTATATGATATCAT
TGATTGATTGAATGAATGAATTCTCTTTCCCTTTCAA
Protein sequenceShow/hide protein sequence
MAIRKDMGKYEPIKGADDCDLANDTVISVSKHCNQSDEDVEMALRDSHSRSPLPLHNANPLTAAAASKIDDAQFSSSARPIHRSSDQRQRLVSLDVFRGITVALMIVVDY
AGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGLNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKG
SDYVNSETALRRKYQLQLIVAVILTTLYLALSYGLYVPDWEYQVPSQTNSQMASLKIFSVKCGIRGDTGPGCNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPL
PPNAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMVDVYRWRRMNVV
MEWMGKHALVIYVLAACNVLPVVLQGFYWGQPRNNILRLIGIPT