; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G00470 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G00470
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionCore-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein
Genome locationClcChr09:390661..393710
RNA-Seq ExpressionClc09G00470
SyntenyClc09G00470
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR003406 - Glycosyl transferase, family 14
IPR044174 - Glycosyltransferase BC10-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141671.1 glycosyltransferase BC10 [Cucumis sativus]8.3e-21893.62Show/hide
Query:  MQSRVVGIEEGKASATATAIATSTRTNNPIKTFPFRFLQLFFLFLLFVLGISLASLHTVKYFGGPNVAPVAQSIIRPCLEEPASIERWIRPPSSLLHTMN
        MQ+RVVGIEEGKASATATAIATSTRTNNPIKT PFRFLQLFFLFLLFVLGISLASLHTVKYFGGPNV PVAQSIIRPCLEEPASIERWI PPSSL+HTMN
Subjt:  MQSRVVGIEEGKASATATAIATSTRTNNPIKTFPFRFLQLFFLFLLFVLGISLASLHTVKYFGGPNVAPVAQSIIRPCLEEPASIERWIRPPSSLLHTMN

Query:  DAELLWRASFVPHVKKYPFKRVRKIAFMFLTKGPLPLAPLWERFLKGHEELYSIYIHPMPYYVADFPPSSVFYGRQIPSKVAEWGMMSMCDAERRLLANA
        DAELLWRASF+P VK YPFKRVRKIAFMFLTKGPLPLAPLWERFLKGHE+ YSIYIHPMP+YVADFPPSSVFYGRQIPSK+AEWG MSMCDAERRLLANA
Subjt:  DAELLWRASFVPHVKKYPFKRVRKIAFMFLTKGPLPLAPLWERFLKGHEELYSIYIHPMPYYVADFPPSSVFYGRQIPSKVAEWGMMSMCDAERRLLANA

Query:  LLDIANEWFILLSESCIPLHNFSIIYHYISRSRYSFMSSFDEPGPIGRGRYNESMAPTVNLANWRKGSQWFEVNRELAVKIVEDTVYYPIFKEFCKPPCY
        LLDIANEWFILLSESCIPLHNFSIIYHYISRSRYSFMSSFDEPGPIGRGRYNESMAP VNL NWRKGSQWFE+NRELAVK+VEDTVYYPIFK+FCKPPCY
Subjt:  LLDIANEWFILLSESCIPLHNFSIIYHYISRSRYSFMSSFDEPGPIGRGRYNESMAPTVNLANWRKGSQWFEVNRELAVKIVEDTVYYPIFKEFCKPPCY

Query:  VDEHYFQTMLSIKAPHLLANRSFTFVDWSRGGAHPATFGEADIEGEFFKKLLESRTCLYNNQPSALCFLFARKFAPNALGRLLNVSSEVMGF
        VDEHYFQTMLSIK PHLLANRSFTFVDWSRGGAHPATFGEADIE +FFKKLLESRTCLYNNQPS LCFLFARKFAP ALGRLLNVSS V+GF
Subjt:  VDEHYFQTMLSIKAPHLLANRSFTFVDWSRGGAHPATFGEADIEGEFFKKLLESRTCLYNNQPSALCFLFARKFAPNALGRLLNVSSEVMGF

XP_008462376.1 PREDICTED: uncharacterized protein LOC103500748 [Cucumis melo]2.0e-21693.88Show/hide
Query:  MQSRVVGIEEGKASATATAIATSTRTNNPIKTFPFRFLQLFFLFLLFVLGISLASLHTVKYFGGPNVAPVAQSIIRPCLEEPASIERWIRPPSSLLHTMN
        MQ+RVVGIEEGKASATATA ATSTRTNNPIKT PFRFLQLFFLFLLFVLGISLASLHTVKYFGGPN  PVA SIIRPC EE ASIERWI PPS L+H MN
Subjt:  MQSRVVGIEEGKASATATAIATSTRTNNPIKTFPFRFLQLFFLFLLFVLGISLASLHTVKYFGGPNVAPVAQSIIRPCLEEPASIERWIRPPSSLLHTMN

Query:  DAELLWRASFVPHVKKYPFKRVRKIAFMFLTKGPLPLAPLWERFLKGHEELYSIYIHPMPYYVADFPPSSVFYGRQIPSKVAEWGMMSMCDAERRLLANA
        DAELLWRASFVP VK YPFKRVRKIAFMFLTKGPLPLAPLWERFLKGHEELYSIYIHPMPYYVADFPPSSVFYGRQIPSK+AEWGMMSMCDAERRLLANA
Subjt:  DAELLWRASFVPHVKKYPFKRVRKIAFMFLTKGPLPLAPLWERFLKGHEELYSIYIHPMPYYVADFPPSSVFYGRQIPSKVAEWGMMSMCDAERRLLANA

Query:  LLDIANEWFILLSESCIPLHNFSIIYHYISRSRYSFMSSFDEPGPIGRGRYNESMAPTVNLANWRKGSQWFEVNRELAVKIVEDTVYYPIFKEFCKPPCY
        LLDIANEWFILLSESCIPLHNFSIIYHYISRSRYSFMSSFDEPG IGRGRYNESMAPTVNL NWRKGSQWFEVNRELAVKIVEDTVYYPIFKEFCKPPCY
Subjt:  LLDIANEWFILLSESCIPLHNFSIIYHYISRSRYSFMSSFDEPGPIGRGRYNESMAPTVNLANWRKGSQWFEVNRELAVKIVEDTVYYPIFKEFCKPPCY

Query:  VDEHYFQTMLSIKAPHLLANRSFTFVDWSRGGAHPATFGEADIEGEFFKKLLESRTCLYNNQPSALCFLFARKFAPNALGRLLNVSSEVMGF
        VDEHYFQTMLSIK PHLLANRSFTFVDWSRGGAHPATFGEADI+ EFFKK+LESRTCLYNNQPS LCFLFARKFAP ALGRLLNVSSEV+GF
Subjt:  VDEHYFQTMLSIKAPHLLANRSFTFVDWSRGGAHPATFGEADIEGEFFKKLLESRTCLYNNQPSALCFLFARKFAPNALGRLLNVSSEVMGF

XP_022153218.1 uncharacterized protein LOC111020764 [Momordica charantia]7.5e-20386.99Show/hide
Query:  MQSRVVGIEEGKASATATAIATSTRTNNPIKTFPFRFLQLFFLFLLFVLGISLASLHTVKYFGGPNVAPVAQSIIRPCLEEPASIERWIRPPSSLLHTMN
        MQ+RVVGIEEGKASAT    ATSTR NNPIK FPFRFLQLFFLFLLFVLGISL SLHTVKYFGGPNV PVAQSIIRPC EEPASIERWI+PPSSLLH M+
Subjt:  MQSRVVGIEEGKASATATAIATSTRTNNPIKTFPFRFLQLFFLFLLFVLGISLASLHTVKYFGGPNVAPVAQSIIRPCLEEPASIERWIRPPSSLLHTMN

Query:  DAELLWRASFVPHVKKYPFKRVRKIAFMFLTKGPLPLAPLWERFLKGHEELYSIYIHPMPYYVADFPPSSVFYGRQIPSKVAEWGMMSMCDAERRLLANA
        D ELLWRASFVP VKKYPFKRVRK+AFMFLTKGPLP+APLWERF KGHE LYSIY+H MPYYVADFPPSSVF+GRQIPS++AEWG +SMCDAERRLLANA
Subjt:  DAELLWRASFVPHVKKYPFKRVRKIAFMFLTKGPLPLAPLWERFLKGHEELYSIYIHPMPYYVADFPPSSVFYGRQIPSKVAEWGMMSMCDAERRLLANA

Query:  LLDIANEWFILLSESCIPLHNFSIIYHYISRSRYSFMSSFDEPGPIGRGRYNESMAPTVNLANWRKGSQWFEVNRELAVKIVEDTVYYPIFKEFCKPPCY
        LLD ANEWFILLSESCIPLHNFS+IYHYISRSRYSFM SFDEPGPIGRGRYNESMAP +NL NWRKG QWFEVNRELAVKIVEDTVYYP FK+FC PPCY
Subjt:  LLDIANEWFILLSESCIPLHNFSIIYHYISRSRYSFMSSFDEPGPIGRGRYNESMAPTVNLANWRKGSQWFEVNRELAVKIVEDTVYYPIFKEFCKPPCY

Query:  VDEHYFQTMLSIKAPHLLANRSFTFVDWSRGGAHPATFGEADIEGEFFKKLLESRTCLYNNQPSALCFLFARKFAPNALGRLLNVSSEVMGF
        VDEHYFQTMLSIK PHLLANRS T VDWSRGGAHPATFGEADIEGEFF++L +  +CLYN+QPS LC+LFARKFAPNAL RLLN+SSEVMGF
Subjt:  VDEHYFQTMLSIKAPHLLANRSFTFVDWSRGGAHPATFGEADIEGEFFKKLLESRTCLYNNQPSALCFLFARKFAPNALGRLLNVSSEVMGF

XP_023000362.1 uncharacterized protein LOC111494618 isoform X1 [Cucurbita maxima]8.3e-20285.79Show/hide
Query:  MQSRVVGIEEGKASATATAIAT-STRTNNPIKTFPFRFLQLFFLFLLFVLGISLASLHTVKYFGGPNVAP-VAQSIIRPCLEEPASIERWIRPPSSLLHT
        MQ+RVVG+EEGKASAT  AIA  STRTN+P+K FPFRFLQLFFLFLL  LGISLASLHTVKYFG PNVAP VA++IIRPCLEEP SIERWIRPPSSLLHT
Subjt:  MQSRVVGIEEGKASATATAIAT-STRTNNPIKTFPFRFLQLFFLFLLFVLGISLASLHTVKYFGGPNVAP-VAQSIIRPCLEEPASIERWIRPPSSLLHT

Query:  MNDAELLWRASFVPHVKKYPFKRVRKIAFMFLTKGPLPLAPLWERFLKGHEELYSIYIHPMPYYVADFPPSSVFYGRQIPSKVAEWGMMSMCDAERRLLA
        MNDAELLWRASFVP VKKYPFKRVRKIAFMFLTKGPLPL+PLWERF KGH+ELYSIYIH +P+YVADFPPSSVFY R+IPSK+AEWG MSMCDAERRLLA
Subjt:  MNDAELLWRASFVPHVKKYPFKRVRKIAFMFLTKGPLPLAPLWERFLKGHEELYSIYIHPMPYYVADFPPSSVFYGRQIPSKVAEWGMMSMCDAERRLLA

Query:  NALLDIANEWFILLSESCIPLHNFSIIYHYISRSRYSFMSSFDEPGPIGRGRYNESMAPTVNLANWRKGSQWFEVNRELAVKIVEDTVYYPIFKEFCKPP
        NALLD+ NEWFILLSESCIPLHNFS+IYHY+SRSR+SF+S+FDEPG IGRGRYNES+AP VNL NWRKGSQWFEVNRELAVK+VEDTVYYP FK+FCKPP
Subjt:  NALLDIANEWFILLSESCIPLHNFSIIYHYISRSRYSFMSSFDEPGPIGRGRYNESMAPTVNLANWRKGSQWFEVNRELAVKIVEDTVYYPIFKEFCKPP

Query:  CYVDEHYFQTMLSIKAPHLLANRSFTFVDWSRGGAHPATFGEADIEGEFFKKLLESRTCLYNNQPSALCFLFARKFAPNALGRLLNVSSEVMGF
        CYVDEHYFQT+LSIK PHL+ANRS TFVDWSRGGAHPA FG+ADI+G+FF KL ESRTC+YNNQPSALCFLFARKF PNALGRLLN+SSE+ GF
Subjt:  CYVDEHYFQTMLSIKAPHLLANRSFTFVDWSRGGAHPATFGEADIEGEFFKKLLESRTCLYNNQPSALCFLFARKFAPNALGRLLNVSSEVMGF

XP_038896978.1 glycosyltransferase BC10-like [Benincasa hispida]4.6e-21694.13Show/hide
Query:  MQSRVVGIEEGKASATATAIATSTRTNNPIKTFPFRFLQLFFLFLLFVLGISLASLHTVKYFGGPNVAPVAQSIIRPCLEEPASIERWIRPPSSLLHTMN
        MQ+RVVGIEEGK SATA AIA STRTNNPIKTFPFRFLQLFFLFLLFVLGISLASLHTVKYFGG NV PVAQSIIRPCLEEP SIERWIRPPSSLLHTMN
Subjt:  MQSRVVGIEEGKASATATAIATSTRTNNPIKTFPFRFLQLFFLFLLFVLGISLASLHTVKYFGGPNVAPVAQSIIRPCLEEPASIERWIRPPSSLLHTMN

Query:  DAELLWRASFVPHVKKYPFKRVRKIAFMFLTKGPLPLAPLWERFLKGHEELYSIYIHPMPYYVADFPPSSVFYGRQIPSKVAEWGMMSMCDAERRLLANA
        D ELLWRASFVP VKKYPFKRVRKIAFMFLTKGPLPLAP WERFLKGHEE YSIYIH MP YVADF PSSVFYGRQIPSK+AEWGMMSMCDAERRLLANA
Subjt:  DAELLWRASFVPHVKKYPFKRVRKIAFMFLTKGPLPLAPLWERFLKGHEELYSIYIHPMPYYVADFPPSSVFYGRQIPSKVAEWGMMSMCDAERRLLANA

Query:  LLDIANEWFILLSESCIPLHNFSIIYHYISRSRYSFMSSFDEPGPIGRGRYNESMAPTVNLANWRKGSQWFEVNRELAVKIVEDTVYYPIFKEFCKPPCY
        LLD+ANEWFILLSESCIPLHNFSIIYHYISRSRYSFMSSFDEPGPIGRGRYNESMAPTVNL NWRKGSQWFEVNRELAVKIVED VYY  FKEFC PPCY
Subjt:  LLDIANEWFILLSESCIPLHNFSIIYHYISRSRYSFMSSFDEPGPIGRGRYNESMAPTVNLANWRKGSQWFEVNRELAVKIVEDTVYYPIFKEFCKPPCY

Query:  VDEHYFQTMLSIKAPHLLANRSFTFVDWSRGGAHPATFGEADIEGEFFKKLLESRTCLYNNQPSALCFLFARKFAPNALGRLLNVSSEVMGF
        VDEHYFQTMLSIK PHLLANRSFTFVDWSRGGAHPATFGEADIEGEFFKKLLESRTCLYNNQPS LCFLFARKFAPNALGRLLNVSSEVMGF
Subjt:  VDEHYFQTMLSIKAPHLLANRSFTFVDWSRGGAHPATFGEADIEGEFFKKLLESRTCLYNNQPSALCFLFARKFAPNALGRLLNVSSEVMGF

TrEMBL top hitse value%identityAlignment
A0A0A0KCR1 Uncharacterized protein4.0e-21893.62Show/hide
Query:  MQSRVVGIEEGKASATATAIATSTRTNNPIKTFPFRFLQLFFLFLLFVLGISLASLHTVKYFGGPNVAPVAQSIIRPCLEEPASIERWIRPPSSLLHTMN
        MQ+RVVGIEEGKASATATAIATSTRTNNPIKT PFRFLQLFFLFLLFVLGISLASLHTVKYFGGPNV PVAQSIIRPCLEEPASIERWI PPSSL+HTMN
Subjt:  MQSRVVGIEEGKASATATAIATSTRTNNPIKTFPFRFLQLFFLFLLFVLGISLASLHTVKYFGGPNVAPVAQSIIRPCLEEPASIERWIRPPSSLLHTMN

Query:  DAELLWRASFVPHVKKYPFKRVRKIAFMFLTKGPLPLAPLWERFLKGHEELYSIYIHPMPYYVADFPPSSVFYGRQIPSKVAEWGMMSMCDAERRLLANA
        DAELLWRASF+P VK YPFKRVRKIAFMFLTKGPLPLAPLWERFLKGHE+ YSIYIHPMP+YVADFPPSSVFYGRQIPSK+AEWG MSMCDAERRLLANA
Subjt:  DAELLWRASFVPHVKKYPFKRVRKIAFMFLTKGPLPLAPLWERFLKGHEELYSIYIHPMPYYVADFPPSSVFYGRQIPSKVAEWGMMSMCDAERRLLANA

Query:  LLDIANEWFILLSESCIPLHNFSIIYHYISRSRYSFMSSFDEPGPIGRGRYNESMAPTVNLANWRKGSQWFEVNRELAVKIVEDTVYYPIFKEFCKPPCY
        LLDIANEWFILLSESCIPLHNFSIIYHYISRSRYSFMSSFDEPGPIGRGRYNESMAP VNL NWRKGSQWFE+NRELAVK+VEDTVYYPIFK+FCKPPCY
Subjt:  LLDIANEWFILLSESCIPLHNFSIIYHYISRSRYSFMSSFDEPGPIGRGRYNESMAPTVNLANWRKGSQWFEVNRELAVKIVEDTVYYPIFKEFCKPPCY

Query:  VDEHYFQTMLSIKAPHLLANRSFTFVDWSRGGAHPATFGEADIEGEFFKKLLESRTCLYNNQPSALCFLFARKFAPNALGRLLNVSSEVMGF
        VDEHYFQTMLSIK PHLLANRSFTFVDWSRGGAHPATFGEADIE +FFKKLLESRTCLYNNQPS LCFLFARKFAP ALGRLLNVSS V+GF
Subjt:  VDEHYFQTMLSIKAPHLLANRSFTFVDWSRGGAHPATFGEADIEGEFFKKLLESRTCLYNNQPSALCFLFARKFAPNALGRLLNVSSEVMGF

A0A1S3CGU9 uncharacterized protein LOC1035007489.9e-21793.88Show/hide
Query:  MQSRVVGIEEGKASATATAIATSTRTNNPIKTFPFRFLQLFFLFLLFVLGISLASLHTVKYFGGPNVAPVAQSIIRPCLEEPASIERWIRPPSSLLHTMN
        MQ+RVVGIEEGKASATATA ATSTRTNNPIKT PFRFLQLFFLFLLFVLGISLASLHTVKYFGGPN  PVA SIIRPC EE ASIERWI PPS L+H MN
Subjt:  MQSRVVGIEEGKASATATAIATSTRTNNPIKTFPFRFLQLFFLFLLFVLGISLASLHTVKYFGGPNVAPVAQSIIRPCLEEPASIERWIRPPSSLLHTMN

Query:  DAELLWRASFVPHVKKYPFKRVRKIAFMFLTKGPLPLAPLWERFLKGHEELYSIYIHPMPYYVADFPPSSVFYGRQIPSKVAEWGMMSMCDAERRLLANA
        DAELLWRASFVP VK YPFKRVRKIAFMFLTKGPLPLAPLWERFLKGHEELYSIYIHPMPYYVADFPPSSVFYGRQIPSK+AEWGMMSMCDAERRLLANA
Subjt:  DAELLWRASFVPHVKKYPFKRVRKIAFMFLTKGPLPLAPLWERFLKGHEELYSIYIHPMPYYVADFPPSSVFYGRQIPSKVAEWGMMSMCDAERRLLANA

Query:  LLDIANEWFILLSESCIPLHNFSIIYHYISRSRYSFMSSFDEPGPIGRGRYNESMAPTVNLANWRKGSQWFEVNRELAVKIVEDTVYYPIFKEFCKPPCY
        LLDIANEWFILLSESCIPLHNFSIIYHYISRSRYSFMSSFDEPG IGRGRYNESMAPTVNL NWRKGSQWFEVNRELAVKIVEDTVYYPIFKEFCKPPCY
Subjt:  LLDIANEWFILLSESCIPLHNFSIIYHYISRSRYSFMSSFDEPGPIGRGRYNESMAPTVNLANWRKGSQWFEVNRELAVKIVEDTVYYPIFKEFCKPPCY

Query:  VDEHYFQTMLSIKAPHLLANRSFTFVDWSRGGAHPATFGEADIEGEFFKKLLESRTCLYNNQPSALCFLFARKFAPNALGRLLNVSSEVMGF
        VDEHYFQTMLSIK PHLLANRSFTFVDWSRGGAHPATFGEADI+ EFFKK+LESRTCLYNNQPS LCFLFARKFAP ALGRLLNVSSEV+GF
Subjt:  VDEHYFQTMLSIKAPHLLANRSFTFVDWSRGGAHPATFGEADIEGEFFKKLLESRTCLYNNQPSALCFLFARKFAPNALGRLLNVSSEVMGF

A0A5A7UWG0 Uncharacterized protein9.9e-21793.88Show/hide
Query:  MQSRVVGIEEGKASATATAIATSTRTNNPIKTFPFRFLQLFFLFLLFVLGISLASLHTVKYFGGPNVAPVAQSIIRPCLEEPASIERWIRPPSSLLHTMN
        MQ+RVVGIEEGKASATATA ATSTRTNNPIKT PFRFLQLFFLFLLFVLGISLASLHTVKYFGGPN  PVA SIIRPC EE ASIERWI PPS L+H MN
Subjt:  MQSRVVGIEEGKASATATAIATSTRTNNPIKTFPFRFLQLFFLFLLFVLGISLASLHTVKYFGGPNVAPVAQSIIRPCLEEPASIERWIRPPSSLLHTMN

Query:  DAELLWRASFVPHVKKYPFKRVRKIAFMFLTKGPLPLAPLWERFLKGHEELYSIYIHPMPYYVADFPPSSVFYGRQIPSKVAEWGMMSMCDAERRLLANA
        DAELLWRASFVP VK YPFKRVRKIAFMFLTKGPLPLAPLWERFLKGHEELYSIYIHPMPYYVADFPPSSVFYGRQIPSK+AEWGMMSMCDAERRLLANA
Subjt:  DAELLWRASFVPHVKKYPFKRVRKIAFMFLTKGPLPLAPLWERFLKGHEELYSIYIHPMPYYVADFPPSSVFYGRQIPSKVAEWGMMSMCDAERRLLANA

Query:  LLDIANEWFILLSESCIPLHNFSIIYHYISRSRYSFMSSFDEPGPIGRGRYNESMAPTVNLANWRKGSQWFEVNRELAVKIVEDTVYYPIFKEFCKPPCY
        LLDIANEWFILLSESCIPLHNFSIIYHYISRSRYSFMSSFDEPG IGRGRYNESMAPTVNL NWRKGSQWFEVNRELAVKIVEDTVYYPIFKEFCKPPCY
Subjt:  LLDIANEWFILLSESCIPLHNFSIIYHYISRSRYSFMSSFDEPGPIGRGRYNESMAPTVNLANWRKGSQWFEVNRELAVKIVEDTVYYPIFKEFCKPPCY

Query:  VDEHYFQTMLSIKAPHLLANRSFTFVDWSRGGAHPATFGEADIEGEFFKKLLESRTCLYNNQPSALCFLFARKFAPNALGRLLNVSSEVMGF
        VDEHYFQTMLSIK PHLLANRSFTFVDWSRGGAHPATFGEADI+ EFFKK+LESRTCLYNNQPS LCFLFARKFAP ALGRLLNVSSEV+GF
Subjt:  VDEHYFQTMLSIKAPHLLANRSFTFVDWSRGGAHPATFGEADIEGEFFKKLLESRTCLYNNQPSALCFLFARKFAPNALGRLLNVSSEVMGF

A0A6J1DGW9 uncharacterized protein LOC1110207643.7e-20386.99Show/hide
Query:  MQSRVVGIEEGKASATATAIATSTRTNNPIKTFPFRFLQLFFLFLLFVLGISLASLHTVKYFGGPNVAPVAQSIIRPCLEEPASIERWIRPPSSLLHTMN
        MQ+RVVGIEEGKASAT    ATSTR NNPIK FPFRFLQLFFLFLLFVLGISL SLHTVKYFGGPNV PVAQSIIRPC EEPASIERWI+PPSSLLH M+
Subjt:  MQSRVVGIEEGKASATATAIATSTRTNNPIKTFPFRFLQLFFLFLLFVLGISLASLHTVKYFGGPNVAPVAQSIIRPCLEEPASIERWIRPPSSLLHTMN

Query:  DAELLWRASFVPHVKKYPFKRVRKIAFMFLTKGPLPLAPLWERFLKGHEELYSIYIHPMPYYVADFPPSSVFYGRQIPSKVAEWGMMSMCDAERRLLANA
        D ELLWRASFVP VKKYPFKRVRK+AFMFLTKGPLP+APLWERF KGHE LYSIY+H MPYYVADFPPSSVF+GRQIPS++AEWG +SMCDAERRLLANA
Subjt:  DAELLWRASFVPHVKKYPFKRVRKIAFMFLTKGPLPLAPLWERFLKGHEELYSIYIHPMPYYVADFPPSSVFYGRQIPSKVAEWGMMSMCDAERRLLANA

Query:  LLDIANEWFILLSESCIPLHNFSIIYHYISRSRYSFMSSFDEPGPIGRGRYNESMAPTVNLANWRKGSQWFEVNRELAVKIVEDTVYYPIFKEFCKPPCY
        LLD ANEWFILLSESCIPLHNFS+IYHYISRSRYSFM SFDEPGPIGRGRYNESMAP +NL NWRKG QWFEVNRELAVKIVEDTVYYP FK+FC PPCY
Subjt:  LLDIANEWFILLSESCIPLHNFSIIYHYISRSRYSFMSSFDEPGPIGRGRYNESMAPTVNLANWRKGSQWFEVNRELAVKIVEDTVYYPIFKEFCKPPCY

Query:  VDEHYFQTMLSIKAPHLLANRSFTFVDWSRGGAHPATFGEADIEGEFFKKLLESRTCLYNNQPSALCFLFARKFAPNALGRLLNVSSEVMGF
        VDEHYFQTMLSIK PHLLANRS T VDWSRGGAHPATFGEADIEGEFF++L +  +CLYN+QPS LC+LFARKFAPNAL RLLN+SSEVMGF
Subjt:  VDEHYFQTMLSIKAPHLLANRSFTFVDWSRGGAHPATFGEADIEGEFFKKLLESRTCLYNNQPSALCFLFARKFAPNALGRLLNVSSEVMGF

A0A6J1KME6 uncharacterized protein LOC111494618 isoform X14.0e-20285.79Show/hide
Query:  MQSRVVGIEEGKASATATAIAT-STRTNNPIKTFPFRFLQLFFLFLLFVLGISLASLHTVKYFGGPNVAP-VAQSIIRPCLEEPASIERWIRPPSSLLHT
        MQ+RVVG+EEGKASAT  AIA  STRTN+P+K FPFRFLQLFFLFLL  LGISLASLHTVKYFG PNVAP VA++IIRPCLEEP SIERWIRPPSSLLHT
Subjt:  MQSRVVGIEEGKASATATAIAT-STRTNNPIKTFPFRFLQLFFLFLLFVLGISLASLHTVKYFGGPNVAP-VAQSIIRPCLEEPASIERWIRPPSSLLHT

Query:  MNDAELLWRASFVPHVKKYPFKRVRKIAFMFLTKGPLPLAPLWERFLKGHEELYSIYIHPMPYYVADFPPSSVFYGRQIPSKVAEWGMMSMCDAERRLLA
        MNDAELLWRASFVP VKKYPFKRVRKIAFMFLTKGPLPL+PLWERF KGH+ELYSIYIH +P+YVADFPPSSVFY R+IPSK+AEWG MSMCDAERRLLA
Subjt:  MNDAELLWRASFVPHVKKYPFKRVRKIAFMFLTKGPLPLAPLWERFLKGHEELYSIYIHPMPYYVADFPPSSVFYGRQIPSKVAEWGMMSMCDAERRLLA

Query:  NALLDIANEWFILLSESCIPLHNFSIIYHYISRSRYSFMSSFDEPGPIGRGRYNESMAPTVNLANWRKGSQWFEVNRELAVKIVEDTVYYPIFKEFCKPP
        NALLD+ NEWFILLSESCIPLHNFS+IYHY+SRSR+SF+S+FDEPG IGRGRYNES+AP VNL NWRKGSQWFEVNRELAVK+VEDTVYYP FK+FCKPP
Subjt:  NALLDIANEWFILLSESCIPLHNFSIIYHYISRSRYSFMSSFDEPGPIGRGRYNESMAPTVNLANWRKGSQWFEVNRELAVKIVEDTVYYPIFKEFCKPP

Query:  CYVDEHYFQTMLSIKAPHLLANRSFTFVDWSRGGAHPATFGEADIEGEFFKKLLESRTCLYNNQPSALCFLFARKFAPNALGRLLNVSSEVMGF
        CYVDEHYFQT+LSIK PHL+ANRS TFVDWSRGGAHPA FG+ADI+G+FF KL ESRTC+YNNQPSALCFLFARKF PNALGRLLN+SSE+ GF
Subjt:  CYVDEHYFQTMLSIKAPHLLANRSFTFVDWSRGGAHPATFGEADIEGEFFKKLLESRTCLYNNQPSALCFLFARKFAPNALGRLLNVSSEVMGF

SwissProt top hitse value%identityAlignment
Q65XS5 Glycosyltransferase BC106.5e-4035.6Show/hide
Query:  KIAFMFLTKGPLPLAPLWERFLKGHEE-LYSIYIHPMPYYVAD--FPPSSVFYGRQIPSKV-AEWGMMSMCDAERRLLANALLDIANEWFILLSESCIPL
        ++AF+F+ +  LPL  +W+ F +G +E  +SI++H  P +V       S  FY RQ+ + V  +WG  SM +AER LLA+AL D  NE F+ +S+SC+PL
Subjt:  KIAFMFLTKGPLPLAPLWERFLKGHEE-LYSIYIHPMPYYVAD--FPPSSVFYGRQIPSKV-AEWGMMSMCDAERRLLANALLDIANEWFILLSESCIPL

Query:  HNFSIIYHYISRSRYSFMSSFDEPGPIGRGRYNESMAPTVNLANWRKGSQWFEVNRELAVKIVEDTVYYPIFKEFCK----------------------P
        +NF+  Y YI  S  SF+ SF +      GRYN  M P + + NWRKGSQW  + R+ A  +VED    P F++ C+                       
Subjt:  HNFSIIYHYISRSRYSFMSSFDEPGPIGRGRYNESMAPTVNLANWRKGSQWFEVNRELAVKIVEDTVYYPIFKEFCK----------------------P

Query:  PCYVDEHYFQTMLSIKA-PHLLANRSFTFVDW--------SRGGAHPATF-----------GEADIEGEFFKKLLESRTCLYNNQPSALCFLFARKFAPN
         C  DEHY QT+L+       L  RS T   W         R G HP T+              DI+  +++       C  N +P A CFLFARKF   
Subjt:  PCYVDEHYFQTMLSIKA-PHLLANRSFTFVDW--------SRGGAHPATF-----------GEADIEGEFFKKLLESRTCLYNNQPSALCFLFARKFAPN

Query:  ALGRLLNVS
        A  +LL++S
Subjt:  ALGRLLNVS

Arabidopsis top hitse value%identityAlignment
AT1G51770.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein5.1e-13360.66Show/hide
Query:  TNNPIKTFPFRFLQLFFLFLLFVLGISLASLHTVKYFGGPNVAPVAQ-SIIRPCLEEPASIERWIRPPSSLLHTMNDAELLWRASFVPHVKKYPFKRVRK
        TN      P R LQ+  LFL+  LGIS+ S+H +K+     + PVA  +++     E  +++ +IRPPS++ HTMND+ELLWRAS  P    YPF+RV K
Subjt:  TNNPIKTFPFRFLQLFFLFLLFVLGISLASLHTVKYFGGPNVAPVAQ-SIIRPCLEEPASIERWIRPPSSLLHTMNDAELLWRASFVPHVKKYPFKRVRK

Query:  IAFMFLTKGPLPLAPLWERFLKGHEELYSIYIHPMPYYVADFPPSSVFYGRQIPSKVAEWGMMSMCDAERRLLANALLDIANEWFILLSESCIPLHNFSI
        +AFMFL KGPLP APLWE+F KGHE LYSIY+H +P Y +DF  SSVFY R IPS+   WG MSM +AERRLLANALLDI+NEWF+LLSESCIPL  FS 
Subjt:  IAFMFLTKGPLPLAPLWERFLKGHEELYSIYIHPMPYYVADFPPSSVFYGRQIPSKVAEWGMMSMCDAERRLLANALLDIANEWFILLSESCIPLHNFSI

Query:  IYHYISRSRYSFMSSFDEPGPIGRGRYNESMAPTVNLANWRKGSQWFEVNRELAVKIVEDTVYYPIFKEFCKPPCYVDEHYFQTMLSIKAPHLLANRSFT
        IY Y+S SRYSFM + DE GP GRGRY   M P + L+ WRKGSQWFE+NR+LAV+IV+DT YYP FKEFC+PPCYVDEHYF TMLS+K   LLANR+ T
Subjt:  IYHYISRSRYSFMSSFDEPGPIGRGRYNESMAPTVNLANWRKGSQWFEVNRELAVKIVEDTVYYPIFKEFCKPPCYVDEHYFQTMLSIKAPHLLANRSFT

Query:  FVDWSRGGAHPATFGEADIEGEFFKKLLESRTCLYNNQPSALCFLFARKFAPNALGRLLNVSSEVM
        + DWSRGGAHPATFG+AD+   F KKL  +++CLYN+  S +C+LFARKFAP+AL  LL ++ +++
Subjt:  FVDWSRGGAHPATFGEADIEGEFFKKLLESRTCLYNNQPSALCFLFARKFAPNALGRLLNVSSEVM

AT1G51770.2 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein3.7e-11554.92Show/hide
Query:  TNNPIKTFPFRFLQLFFLFLLFVLGISLASLHTVKYFGGPNVAPVAQ-SIIRPCLEEPASIERWIRPPSSLLHTMNDAELLWRASFVPHVKKYPFKRVRK
        TN      P R LQ+  LFL+  LGIS+ S+H +K+     + PVA  +++     E  +++ +IRPPS++ HTMND+ELLWRAS  P    YPF+RV K
Subjt:  TNNPIKTFPFRFLQLFFLFLLFVLGISLASLHTVKYFGGPNVAPVAQ-SIIRPCLEEPASIERWIRPPSSLLHTMNDAELLWRASFVPHVKKYPFKRVRK

Query:  IAFMFLTKGPLPLAPLWERFLKGHEELYSIYIHPMPYYVADFPPSSVFYGRQIPSKVAEWGMMSMCDAERRLLANALLDIANEWFILLSESCIPLHNFSI
        +AFMFL KGPLP APLWE+F KGHE LYSIY+H +P Y +DF  SSVFY R IPS+   WG MSM +AERRLLANALLDI+NE                 
Subjt:  IAFMFLTKGPLPLAPLWERFLKGHEELYSIYIHPMPYYVADFPPSSVFYGRQIPSKVAEWGMMSMCDAERRLLANALLDIANEWFILLSESCIPLHNFSI

Query:  IYHYISRSRYSFMSSFDEPGPIGRGRYNESMAPTVNLANWRKGSQWFEVNRELAVKIVEDTVYYPIFKEFCKPPCYVDEHYFQTMLSIKAPHLLANRSFT
                   FM + DE GP GRGRY   M P + L+ WRKGSQWFE+NR+LAV+IV+DT YYP FKEFC+PPCYVDEHYF TMLS+K   LLANR+ T
Subjt:  IYHYISRSRYSFMSSFDEPGPIGRGRYNESMAPTVNLANWRKGSQWFEVNRELAVKIVEDTVYYPIFKEFCKPPCYVDEHYFQTMLSIKAPHLLANRSFT

Query:  FVDWSRGGAHPATFGEADIEGEFFKKLLESRTCLYNNQPSALCFLFARKFAPNALGRLLNVSSEVM
        + DWSRGGAHPATFG+AD+   F KKL  +++CLYN+  S +C+LFARKFAP+AL  LL ++ +++
Subjt:  FVDWSRGGAHPATFGEADIEGEFFKKLLESRTCLYNNQPSALCFLFARKFAPNALGRLLNVSSEVM

AT3G21310.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein2.2e-13961.6Show/hide
Query:  VGIEEGKASATATAIATSTRTNNPIKT-FPFRFLQLFFLFLLFVLGISLASLHTVKYFGGPNVAPVAQSIIRPCLEEPASIERWIRPPSSLLHTMNDAEL
        +G+EEG     A+  A ++R  N +K   P R LQ+F LF + VLGIS+ S+H +KY     +  +A S +    +E  ++E  I+PP +  H+MND+EL
Subjt:  VGIEEGKASATATAIATSTRTNNPIKT-FPFRFLQLFFLFLLFVLGISLASLHTVKYFGGPNVAPVAQSIIRPCLEEPASIERWIRPPSSLLHTMNDAEL

Query:  LWRASFVPHVKKYPFKRVRKIAFMFLTKGPLPLAPLWERFLKGHEELYSIYIHPMPYYVADFPPSSVFYGRQIPSKVAEWGMMSMCDAERRLLANALLDI
        LWRAS  P +  YPFKRV K+AFMFLTKGPLP APLWERF KGHE  YSIY+H +P Y +DFP SSVFY RQIPS+   WG MSMCDAERRLLANALLDI
Subjt:  LWRASFVPHVKKYPFKRVRKIAFMFLTKGPLPLAPLWERFLKGHEELYSIYIHPMPYYVADFPPSSVFYGRQIPSKVAEWGMMSMCDAERRLLANALLDI

Query:  ANEWFILLSESCIPLHNFSIIYHYISRSRYSFMSSFDEPGPIGRGRYNESMAPTVNLANWRKGSQWFEVNRELAVKIVEDTVYYPIFKEFCKPPCYVDEH
        +NEWF+LLSE+CIPL  F+ +Y Y+SRSRYSFM S DE GP GRGRY+ +M P V+L  WRKGSQWFE+NR LAV IVED VYY  FKEFC+PPCYVDEH
Subjt:  ANEWFILLSESCIPLHNFSIIYHYISRSRYSFMSSFDEPGPIGRGRYNESMAPTVNLANWRKGSQWFEVNRELAVKIVEDTVYYPIFKEFCKPPCYVDEH

Query:  YFQTMLSIKAPHLLANRSFTFVDWSRGGAHPATFGEADIEGEFFKKLLESRTCLYNNQPSALCFLFARKFAPNALGRLLNVSSEVMGF
        YF TMLSI  P  LANR+ T+ DWSRGGAHPATFG+ADI  +F KKL   + C YN+QPS +C+LFARKFAP+AL  LL ++ +V+GF
Subjt:  YFQTMLSIKAPHLLANRSFTFVDWSRGGAHPATFGEADIEGEFFKKLLESRTCLYNNQPSALCFLFARKFAPNALGRLLNVSSEVMGF

AT5G11730.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein1.9e-14360.81Show/hide
Query:  MQSRVVGIEEGKASATATAIATSTRTNNPIKTFPFRFLQLFFLFLLFVLGISLASLHTVKYFGGPNVAPVAQSIIRPCLE-EPASIERWIRPPSSLLHTM
        MQ+R+V +EEGK +         T  +   K FP + L L  LFL F + + + S+ T+KY G  +V     S   PC E EP S+ +WI+PP+ L+H M
Subjt:  MQSRVVGIEEGKASATATAIATSTRTNNPIKTFPFRFLQLFFLFLLFVLGISLASLHTVKYFGGPNVAPVAQSIIRPCLE-EPASIERWIRPPSSLLHTM

Query:  NDAELLWRASFVPHVKKYPFKRVRKIAFMFLTKGPLPLAPLWERFLKGHEELYSIYIHPMPYYVADFPPSSVFYGRQIPSKVAEWGMMSMCDAERRLLAN
        +D ELLWRASF P  K+YPFKRV K+AFMFLTKGPLPLA LWERFLKGH+ LYS+Y+HP P + A FP SSVF+ RQIPS+VAEWG MSMCDAE+RLLAN
Subjt:  NDAELLWRASFVPHVKKYPFKRVRKIAFMFLTKGPLPLAPLWERFLKGHEELYSIYIHPMPYYVADFPPSSVFYGRQIPSKVAEWGMMSMCDAERRLLAN

Query:  ALLDIANEWFILLSESCIPLHNFSIIYHYISRSRYSFMSSFDEPGPIGRGRYNESMAPTVNLANWRKGSQWFEVNRELAVKIVEDTVYYPIFKEFCKPPC
        ALLD++NEWF+L+SESCIPL+NF+ IY Y+SRS++SFM +FD+PGP GRGRYN +M P V L  WRKGSQWFEVNR+LA  IV+DT+YYP FKEFC+P C
Subjt:  ALLDIANEWFILLSESCIPLHNFSIIYHYISRSRYSFMSSFDEPGPIGRGRYNESMAPTVNLANWRKGSQWFEVNRELAVKIVEDTVYYPIFKEFCKPPC

Query:  YVDEHYFQTMLSIKAPHLLANRSFTFVDWSRGGAHPATFGEADIEGEFFKKLLESRTCLYNNQPSALCFLFARKFAPNALGRLLNVSSEVMGF
        YVDEHYF TML+I+ P +LANRS T+VDWSRGG HPATFG +DI   FF K+ + R C YN + +++C+LFARKFAP+AL  LL+++ +++GF
Subjt:  YVDEHYFQTMLSIKAPHLLANRSFTFVDWSRGGAHPATFGEADIEGEFFKKLLESRTCLYNNQPSALCFLFARKFAPNALGRLLNVSSEVMGF

AT5G25970.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein2.7e-13459.54Show/hide
Query:  SRVVGIEEGKASATATAIATSTRTNNPIKTFPFRFLQLFFLFLLFVLGISLASLHTVKYFGGPNVAPVAQSIIRPCLEEPASIERWIRPPSSLLHTMNDA
        SRV+ +EEGK       + TS+RT    K FP++ L L   FL F + +   S+ T+KY+G  +V     S   PC E+   +++WI+P   L+H M+D 
Subjt:  SRVVGIEEGKASATATAIATSTRTNNPIKTFPFRFLQLFFLFLLFVLGISLASLHTVKYFGGPNVAPVAQSIIRPCLEEPASIERWIRPPSSLLHTMNDA

Query:  ELLWRASFVPHVKKYPFKRVRKIAFMFLTKGPLPLAPLWERFLKGHEELYSIYIHPMPYYVADFPPSSVFYGRQIPSKVAEWGMMSMCDAERRLLANALL
        ELLW ASF+P  K+YPF RV KIAFMFLT GPLPLAPLWER LKGHE+LYS+YIH      A FP SSVFY R IPS+VAEWG M+MCDAERRLLANALL
Subjt:  ELLWRASFVPHVKKYPFKRVRKIAFMFLTKGPLPLAPLWERFLKGHEELYSIYIHPMPYYVADFPPSSVFYGRQIPSKVAEWGMMSMCDAERRLLANALL

Query:  DIANEWFILLSESCIPLHNFSIIYHYISRSRYSFMSSFDEPGPIGRGRYNESMAPTVNLANWRKGSQWFEVNRELAVKIVEDTVYYPIFKEFCKPPCYVD
        DI+NEWF+LLSESCIPL NF+ IY Y+++S +SFM SFD+PG  GRGRY+ +MAP V +  WRKGSQWFE+NRELAV IV+DT+YYP FKEFC+P CYVD
Subjt:  DIANEWFILLSESCIPLHNFSIIYHYISRSRYSFMSSFDEPGPIGRGRYNESMAPTVNLANWRKGSQWFEVNRELAVKIVEDTVYYPIFKEFCKPPCYVD

Query:  EHYFQTMLSIKAPHLLANRSFTFVDWSRGGAHPATFGEADIEGEFFKKLLESRTCLYNNQPSALCFLFARKFAPNALGRLLNVSSEVM
        EHYF TML+I+ P  LANRS T+VDWSRGGAHPATFG  DI  EFF ++L+   C YN   +++C+LFARKF+P+AL  L+ ++ +++
Subjt:  EHYFQTMLSIKAPHLLANRSFTFVDWSRGGAHPATFGEADIEGEFFKKLLESRTCLYNNQPSALCFLFARKFAPNALGRLLNVSSEVM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGTCCAGAGTGGTGGGGATTGAGGAAGGCAAGGCCTCTGCAACTGCCACTGCCATTGCCACTAGCACTAGAACAAACAACCCAATTAAAACCTTTCCTTTTAGGTT
TCTTCAGCTCTTCTTTCTGTTTCTGCTTTTCGTCCTTGGTATATCACTTGCTAGTTTGCACACTGTTAAGTATTTTGGAGGCCCAAATGTGGCACCTGTTGCTCAGTCCA
TAATTCGACCTTGTCTTGAAGAGCCAGCCAGTATAGAAAGATGGATCAGACCTCCATCGAGTCTTCTGCACACCATGAACGACGCCGAGCTCCTCTGGAGGGCTTCTTTC
GTTCCCCATGTGAAGAAGTATCCATTCAAGAGAGTTCGCAAAATTGCTTTCATGTTCTTGACCAAGGGACCACTGCCCTTGGCTCCTCTTTGGGAGCGGTTCTTGAAGGG
ACATGAGGAACTTTATTCTATATATATTCACCCCATGCCTTATTATGTTGCTGATTTTCCACCCTCATCGGTATTTTATGGACGACAAATCCCCAGTAAGGTTGCAGAAT
GGGGAATGATGAGTATGTGTGATGCTGAGAGGAGACTGCTTGCCAATGCACTTCTTGATATAGCAAATGAATGGTTCATTCTTCTCTCCGAGTCATGCATTCCTCTCCAC
AACTTCAGCATCATTTATCACTATATCTCCCGATCCCGTTACAGTTTCATGAGTTCATTTGATGAACCAGGACCCATTGGTAGGGGACGCTACAACGAAAGTATGGCACC
CACGGTTAACCTCGCCAATTGGCGGAAGGGATCTCAGTGGTTTGAAGTCAATAGAGAACTTGCAGTGAAGATAGTTGAAGACACAGTTTACTACCCTATATTTAAAGAGT
TCTGCAAGCCACCATGTTATGTTGATGAACATTACTTCCAGACTATGTTGAGCATCAAAGCACCTCATCTCCTAGCAAACAGGAGTTTTACATTTGTTGACTGGTCGAGG
GGCGGTGCTCATCCTGCAACGTTCGGGGAGGCAGATATTGAGGGCGAATTCTTCAAGAAACTTCTTGAAAGTAGGACGTGCCTTTACAATAACCAGCCATCAGCACTCTG
TTTCCTATTTGCTAGGAAGTTCGCTCCAAATGCCTTGGGTCGTCTGTTAAATGTATCGTCCGAAGTTATGGGATTTTGA
mRNA sequenceShow/hide mRNA sequence
TTCACTTTGAAGTTTCGAGCGCTTCGCCCTCTCACTCTCGTCTCTCGTACGGCCATCTTCTTTCTTCATTTTGGAGTTAAGCCTTCAACAACCGCTTGTACTGTGAACGA
AAGACTGATCAATCGGATTTTCGATTGTGGCCTGGTTTGGAATGATGGATGAGTGAACCGATCTGGGCTATTTCTCGATTGTTTTCAACTTTAATTGGTGGAAGGTTAGA
GCCCATCCGGCACCGCCCCACATGATTTCATTTCCTTGGATGTGATATATCCGGAAGTGAATCGGGTTGGAAATTGGGGTAAAGATGCAGTCCAGAGTGGTGGGGATTGA
GGAAGGCAAGGCCTCTGCAACTGCCACTGCCATTGCCACTAGCACTAGAACAAACAACCCAATTAAAACCTTTCCTTTTAGGTTTCTTCAGCTCTTCTTTCTGTTTCTGC
TTTTCGTCCTTGGTATATCACTTGCTAGTTTGCACACTGTTAAGTATTTTGGAGGCCCAAATGTGGCACCTGTTGCTCAGTCCATAATTCGACCTTGTCTTGAAGAGCCA
GCCAGTATAGAAAGATGGATCAGACCTCCATCGAGTCTTCTGCACACCATGAACGACGCCGAGCTCCTCTGGAGGGCTTCTTTCGTTCCCCATGTGAAGAAGTATCCATT
CAAGAGAGTTCGCAAAATTGCTTTCATGTTCTTGACCAAGGGACCACTGCCCTTGGCTCCTCTTTGGGAGCGGTTCTTGAAGGGACATGAGGAACTTTATTCTATATATA
TTCACCCCATGCCTTATTATGTTGCTGATTTTCCACCCTCATCGGTATTTTATGGACGACAAATCCCCAGTAAGGTTGCAGAATGGGGAATGATGAGTATGTGTGATGCT
GAGAGGAGACTGCTTGCCAATGCACTTCTTGATATAGCAAATGAATGGTTCATTCTTCTCTCCGAGTCATGCATTCCTCTCCACAACTTCAGCATCATTTATCACTATAT
CTCCCGATCCCGTTACAGTTTCATGAGTTCATTTGATGAACCAGGACCCATTGGTAGGGGACGCTACAACGAAAGTATGGCACCCACGGTTAACCTCGCCAATTGGCGGA
AGGGATCTCAGTGGTTTGAAGTCAATAGAGAACTTGCAGTGAAGATAGTTGAAGACACAGTTTACTACCCTATATTTAAAGAGTTCTGCAAGCCACCATGTTATGTTGAT
GAACATTACTTCCAGACTATGTTGAGCATCAAAGCACCTCATCTCCTAGCAAACAGGAGTTTTACATTTGTTGACTGGTCGAGGGGCGGTGCTCATCCTGCAACGTTCGG
GGAGGCAGATATTGAGGGCGAATTCTTCAAGAAACTTCTTGAAAGTAGGACGTGCCTTTACAATAACCAGCCATCAGCACTCTGTTTCCTATTTGCTAGGAAGTTCGCTC
CAAATGCCTTGGGTCGTCTGTTAAATGTATCGTCCGAAGTTATGGGATTTTGATACAAGTTTGGTAGAACAGCTCATGATATTGGCTTAGTAGATACAATCTGCTGTATT
TTTCACTAGTTCTTACACGGAATTTCAAACATTAGATTTACACTTGTAGATGTATAGAAAATATATACATTTCTCAAACTTTAGGCTCCAGGATGATACATATCTTTGTT
AATTATTAGTCAAACAGCTCGAGGTATTTTCTCTCTGGACTTTGTTTTGTTAGAATGAGATTGCATACCTGCTAATATTGATATTATTAGCAAATATTCTT
Protein sequenceShow/hide protein sequence
MQSRVVGIEEGKASATATAIATSTRTNNPIKTFPFRFLQLFFLFLLFVLGISLASLHTVKYFGGPNVAPVAQSIIRPCLEEPASIERWIRPPSSLLHTMNDAELLWRASF
VPHVKKYPFKRVRKIAFMFLTKGPLPLAPLWERFLKGHEELYSIYIHPMPYYVADFPPSSVFYGRQIPSKVAEWGMMSMCDAERRLLANALLDIANEWFILLSESCIPLH
NFSIIYHYISRSRYSFMSSFDEPGPIGRGRYNESMAPTVNLANWRKGSQWFEVNRELAVKIVEDTVYYPIFKEFCKPPCYVDEHYFQTMLSIKAPHLLANRSFTFVDWSR
GGAHPATFGEADIEGEFFKKLLESRTCLYNNQPSALCFLFARKFAPNALGRLLNVSSEVMGF