; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG11G016660 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG11G016660
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionGlcNAc kinase
Genome locationCG_Chr11:29930564..29935163
RNA-Seq ExpressionClCG11G016660
SyntenyClCG11G016660
Gene Ontology termsGO:0046835 - carbohydrate phosphorylation (biological process)
GO:0045127 - N-acetylglucosamine kinase activity (molecular function)
InterPro domainsIPR002731 - ATPase, BadF/BadG/BcrA/BcrD type
IPR043129 - ATPase, nucleotide binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008464652.1 PREDICTED: N-acetyl-D-glucosamine kinase-like [Cucumis melo]2.3e-18683.62Show/hide
Query:  MKRCRNGELWDFEHEILGGDDIILGIDGGTTSTVCVCIALSDPRVVSPSMSCPILARVVGGCSNHNSVGVGKFLHGSIKFRCLVSVLDSIVSIPDSFIMS
        MKRCRNGELWDFEHEILGGDDIILGIDGGTTSTVCVCIALSDPRVVSPS+SCP+LARVVGGCSNHNSVG                               
Subjt:  MKRCRNGELWDFEHEILGGDDIILGIDGGTTSTVCVCIALSDPRVVSPSMSCPILARVVGGCSNHNSVGVGKFLHGSIKFRCLVSVLDSIVSIPDSFIMS

Query:  LNGKNIHSLSSLETAARETLEQVMAEALSKSGSIRSSVRAVCLAVSGVNHPTDQQRILDWLRDIFPCHVNLYVQNDAVAALASGTLGKLHGCVLIAGTGT
                    ETAARETLEQVMAEALSKSGSIRSSVRAVCLAVSGVNHPTDQQRI DWLRDIFPCHVNLYVQNDAVAALASGT+GKLHGCVLIAGTGT
Subjt:  LNGKNIHSLSSLETAARETLEQVMAEALSKSGSIRSSVRAVCLAVSGVNHPTDQQRILDWLRDIFPCHVNLYVQNDAVAALASGTLGKLHGCVLIAGTGT

Query:  IAYGFTEDGREARAAGAGPILGDWGSGYGIAAQALTAIIRAHDGRGPYTKLTYSILKTLDLSSPDELIGSNYRAFLLVNKYLRIWWTYADPSWARIAALV
        IAYGFTEDGREARAAGAGPILGDWGSGYGIAAQALTAIIRAHDGRGP+TKLTYSILKTLDLSSPDELIG                WTYADPSWARIAALV
Subjt:  IAYGFTEDGREARAAGAGPILGDWGSGYGIAAQALTAIIRAHDGRGPYTKLTYSILKTLDLSSPDELIGSNYRAFLLVNKYLRIWWTYADPSWARIAALV

Query:  PVVIACAEAGDEVANNILLDSVEELALSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKKVINSISKEYPGVLPIWPKVEPALGAALLAWNFL
        PVV+ACAEAGDEVANNILLDSVEELALSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKKVINSISKEYPG+LP+WPKVEPALGAALLAWNFL
Subjt:  PVVIACAEAGDEVANNILLDSVEELALSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKKVINSISKEYPGVLPIWPKVEPALGAALLAWNFL

Query:  SKDYQQEGI
        SKDYQQEGI
Subjt:  SKDYQQEGI

XP_022936045.1 N-acetyl-D-glucosamine kinase-like [Cucurbita moschata]1.2e-18281.91Show/hide
Query:  MKRCRNGELWDFEHEILGGDDIILGIDGGTTSTVCVCIALSDPRVVSPSMSCPILARVVGGCSNHNSVGVGKFLHGSIKFRCLVSVLDSIVSIPDSFIMS
        MKRCRNGELWDFEHEILGGDDIILGIDGGTTSTVCVCIALSDPR +SPSM+CP+LARVVGGCSNHNSVG                               
Subjt:  MKRCRNGELWDFEHEILGGDDIILGIDGGTTSTVCVCIALSDPRVVSPSMSCPILARVVGGCSNHNSVGVGKFLHGSIKFRCLVSVLDSIVSIPDSFIMS

Query:  LNGKNIHSLSSLETAARETLEQVMAEALSKSGSIRSSVRAVCLAVSGVNHPTDQQRILDWLRDIFPCHVNLYVQNDAVAALASGTLGKLHGCVLIAGTGT
                    ETAARETLEQVMAEALSKS SIRSSVRAVCLAVSGVNHPTDQ+RIL+WLRDIFPCHVNLYVQNDAVAALASGT+GKLHGCVLIAGTGT
Subjt:  LNGKNIHSLSSLETAARETLEQVMAEALSKSGSIRSSVRAVCLAVSGVNHPTDQQRILDWLRDIFPCHVNLYVQNDAVAALASGTLGKLHGCVLIAGTGT

Query:  IAYGFTEDGREARAAGAGPILGDWGSGYGIAAQALTAIIRAHDGRGPYTKLTYSILKTLDLSSPDELIGSNYRAFLLVNKYLRIWWTYADPSWARIAALV
        IAYGFTEDGREARAAGAGPILGDWGSGYGIAAQALTAIIRAHDGRGP+T LTYSILKTLDLSSPDELIG                WTYADPSWARIAALV
Subjt:  IAYGFTEDGREARAAGAGPILGDWGSGYGIAAQALTAIIRAHDGRGPYTKLTYSILKTLDLSSPDELIGSNYRAFLLVNKYLRIWWTYADPSWARIAALV

Query:  PVVIACAEAGDEVANNILLDSVEELALSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKKVINSISKEYPGVLPIWPKVEPALGAALLAWNFL
        PV++ACAEAGDEVA+NILLDSVEELA SVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKKVINSISKEYPGVLP+WPKVEPALGAALLAWNFL
Subjt:  PVVIACAEAGDEVANNILLDSVEELALSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKKVINSISKEYPGVLPIWPKVEPALGAALLAWNFL

Query:  SKDYQQEGI
        SKDYQQEGI
Subjt:  SKDYQQEGI

XP_022976806.1 N-acetyl-D-glucosamine kinase-like [Cucurbita maxima]2.6e-18281.66Show/hide
Query:  MKRCRNGELWDFEHEILGGDDIILGIDGGTTSTVCVCIALSDPRVVSPSMSCPILARVVGGCSNHNSVGVGKFLHGSIKFRCLVSVLDSIVSIPDSFIMS
        MKRCRNGELWDFEHEILGGDDIILGIDGGTTSTVCVCI+LSDPR +SPSM+CP+LARVVGGCSNHNSVG                               
Subjt:  MKRCRNGELWDFEHEILGGDDIILGIDGGTTSTVCVCIALSDPRVVSPSMSCPILARVVGGCSNHNSVGVGKFLHGSIKFRCLVSVLDSIVSIPDSFIMS

Query:  LNGKNIHSLSSLETAARETLEQVMAEALSKSGSIRSSVRAVCLAVSGVNHPTDQQRILDWLRDIFPCHVNLYVQNDAVAALASGTLGKLHGCVLIAGTGT
                    ETAARETLEQVMAEALSKS SIRSSVRAVCLAVSGVNHPTDQ+RIL+WLRDIFPCHVNLYVQNDAVAALASGT+GKLHGCVLIAGTGT
Subjt:  LNGKNIHSLSSLETAARETLEQVMAEALSKSGSIRSSVRAVCLAVSGVNHPTDQQRILDWLRDIFPCHVNLYVQNDAVAALASGTLGKLHGCVLIAGTGT

Query:  IAYGFTEDGREARAAGAGPILGDWGSGYGIAAQALTAIIRAHDGRGPYTKLTYSILKTLDLSSPDELIGSNYRAFLLVNKYLRIWWTYADPSWARIAALV
        IAYGFTEDGREARAAGAGPILGDWGSGYGIAAQALTAIIRAHDGRGP+T LTYSILKTLDLSSPDELIG                WTYADPSWARIAALV
Subjt:  IAYGFTEDGREARAAGAGPILGDWGSGYGIAAQALTAIIRAHDGRGPYTKLTYSILKTLDLSSPDELIGSNYRAFLLVNKYLRIWWTYADPSWARIAALV

Query:  PVVIACAEAGDEVANNILLDSVEELALSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKKVINSISKEYPGVLPIWPKVEPALGAALLAWNFL
        PV++ACAEAGDEVA+NILLDSVEELA SVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKKVINSISKEYPGVLP+WPKVEPALGAALLAWNFL
Subjt:  PVVIACAEAGDEVANNILLDSVEELALSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKKVINSISKEYPGVLPIWPKVEPALGAALLAWNFL

Query:  SKDYQQEGI
        SKDYQQEGI
Subjt:  SKDYQQEGI

XP_031739612.1 LOW QUALITY PROTEIN: N-acetyl-D-glucosamine kinase [Cucumis sativus]3.3e-18583.13Show/hide
Query:  MKRCRNGELWDFEHEILGGDDIILGIDGGTTSTVCVCIALSDPRVVSPSMSCPILARVVGGCSNHNSVGVGKFLHGSIKFRCLVSVLDSIVSIPDSFIMS
        MKRCRN ELWDFEHEILGGDDIILGIDGGTTSTVCVCI LSDPRVVSPSMSCP+LARVVGGCSNHNSVG                               
Subjt:  MKRCRNGELWDFEHEILGGDDIILGIDGGTTSTVCVCIALSDPRVVSPSMSCPILARVVGGCSNHNSVGVGKFLHGSIKFRCLVSVLDSIVSIPDSFIMS

Query:  LNGKNIHSLSSLETAARETLEQVMAEALSKSGSIRSSVRAVCLAVSGVNHPTDQQRILDWLRDIFPCHVNLYVQNDAVAALASGTLGKLHGCVLIAGTGT
                    ETAARETLEQVMAEALSKSGSIRSSVRAVCLAVSGVNHPTDQQRILDWLRDIFPCHVNLYVQNDAVAALASGT+GKLHGCVLIAGTGT
Subjt:  LNGKNIHSLSSLETAARETLEQVMAEALSKSGSIRSSVRAVCLAVSGVNHPTDQQRILDWLRDIFPCHVNLYVQNDAVAALASGTLGKLHGCVLIAGTGT

Query:  IAYGFTEDGREARAAGAGPILGDWGSGYGIAAQALTAIIRAHDGRGPYTKLTYSILKTLDLSSPDELIGSNYRAFLLVNKYLRIWWTYADPSWARIAALV
        IAYGFTEDGREARAAGAGPILGDWGSGYGIAAQALTAIIRAHDG GP+TKLTYSILKTLDLSSPDELIG                WTYADPSWARIAALV
Subjt:  IAYGFTEDGREARAAGAGPILGDWGSGYGIAAQALTAIIRAHDGRGPYTKLTYSILKTLDLSSPDELIGSNYRAFLLVNKYLRIWWTYADPSWARIAALV

Query:  PVVIACAEAGDEVANNILLDSVEELALSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKKVINSISKEYPGVLPIWPKVEPALGAALLAWNFL
        PVV+ACAEAGDEVANNILLDSVEELALSV+AVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKKVINSISKEYPG+LP+WPKVEPALGAALLAWNFL
Subjt:  PVVIACAEAGDEVANNILLDSVEELALSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKKVINSISKEYPGVLPIWPKVEPALGAALLAWNFL

Query:  SKDYQQEGI
        SKDYQQEGI
Subjt:  SKDYQQEGI

XP_038900118.1 N-acetyl-D-glucosamine kinase-like isoform X1 [Benincasa hispida]3.1e-18382.4Show/hide
Query:  MKRCRNGELWDFEHEILGGDDIILGIDGGTTSTVCVCIALSDPRVVSPSMSCPILARVVGGCSNHNSVGVGKFLHGSIKFRCLVSVLDSIVSIPDSFIMS
        MKRCRNGELWDFEHEILGGDDIILGIDGGTTSTVCVCIALSD RVVSPSMSCP+LARVVGGCSNHNSVG                               
Subjt:  MKRCRNGELWDFEHEILGGDDIILGIDGGTTSTVCVCIALSDPRVVSPSMSCPILARVVGGCSNHNSVGVGKFLHGSIKFRCLVSVLDSIVSIPDSFIMS

Query:  LNGKNIHSLSSLETAARETLEQVMAEALSKSGSIRSSVRAVCLAVSGVNHPTDQQRILDWLRDIFPCHVNLYVQNDAVAALASGTLGKLHGCVLIAGTGT
                    ETAARETLEQVMAEALSKSGSIRSSVRAVCLA+SGVNHPTD+QRILDWLRDIFPCHVNLYVQNDAVAALASGT+GKLHGCVLIAGTGT
Subjt:  LNGKNIHSLSSLETAARETLEQVMAEALSKSGSIRSSVRAVCLAVSGVNHPTDQQRILDWLRDIFPCHVNLYVQNDAVAALASGTLGKLHGCVLIAGTGT

Query:  IAYGFTEDGREARAAGAGPILGDWGSGYGIAAQALTAIIRAHDGRGPYTKLTYSILKTLDLSSPDELIGSNYRAFLLVNKYLRIWWTYADPSWARIAALV
        IAYGFTEDGR+ARAAGAGPILGDWGSGYGIAAQALTAIIRA+DGRGP TKLTYSILKTLDLSSPDELIG                WTYADPSWARIA+LV
Subjt:  IAYGFTEDGREARAAGAGPILGDWGSGYGIAAQALTAIIRAHDGRGPYTKLTYSILKTLDLSSPDELIGSNYRAFLLVNKYLRIWWTYADPSWARIAALV

Query:  PVVIACAEAGDEVANNILLDSVEELALSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKKVINSISKEYPGVLPIWPKVEPALGAALLAWNFL
        PVV+ACAEAGDEVANNILLDSVEELALSVKAVIQRLGLAGEDG EAFPLVMVGGVLEAKRRWDIAK+VINSISKEYPGVLP+WPKVEPALGAALLAWNFL
Subjt:  PVVIACAEAGDEVANNILLDSVEELALSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKKVINSISKEYPGVLPIWPKVEPALGAALLAWNFL

Query:  SKDYQQEGI
        SKDYQQEGI
Subjt:  SKDYQQEGI

TrEMBL top hitse value%identityAlignment
A0A1S4E5H4 GlcNAc kinase1.1e-18683.62Show/hide
Query:  MKRCRNGELWDFEHEILGGDDIILGIDGGTTSTVCVCIALSDPRVVSPSMSCPILARVVGGCSNHNSVGVGKFLHGSIKFRCLVSVLDSIVSIPDSFIMS
        MKRCRNGELWDFEHEILGGDDIILGIDGGTTSTVCVCIALSDPRVVSPS+SCP+LARVVGGCSNHNSVG                               
Subjt:  MKRCRNGELWDFEHEILGGDDIILGIDGGTTSTVCVCIALSDPRVVSPSMSCPILARVVGGCSNHNSVGVGKFLHGSIKFRCLVSVLDSIVSIPDSFIMS

Query:  LNGKNIHSLSSLETAARETLEQVMAEALSKSGSIRSSVRAVCLAVSGVNHPTDQQRILDWLRDIFPCHVNLYVQNDAVAALASGTLGKLHGCVLIAGTGT
                    ETAARETLEQVMAEALSKSGSIRSSVRAVCLAVSGVNHPTDQQRI DWLRDIFPCHVNLYVQNDAVAALASGT+GKLHGCVLIAGTGT
Subjt:  LNGKNIHSLSSLETAARETLEQVMAEALSKSGSIRSSVRAVCLAVSGVNHPTDQQRILDWLRDIFPCHVNLYVQNDAVAALASGTLGKLHGCVLIAGTGT

Query:  IAYGFTEDGREARAAGAGPILGDWGSGYGIAAQALTAIIRAHDGRGPYTKLTYSILKTLDLSSPDELIGSNYRAFLLVNKYLRIWWTYADPSWARIAALV
        IAYGFTEDGREARAAGAGPILGDWGSGYGIAAQALTAIIRAHDGRGP+TKLTYSILKTLDLSSPDELIG                WTYADPSWARIAALV
Subjt:  IAYGFTEDGREARAAGAGPILGDWGSGYGIAAQALTAIIRAHDGRGPYTKLTYSILKTLDLSSPDELIGSNYRAFLLVNKYLRIWWTYADPSWARIAALV

Query:  PVVIACAEAGDEVANNILLDSVEELALSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKKVINSISKEYPGVLPIWPKVEPALGAALLAWNFL
        PVV+ACAEAGDEVANNILLDSVEELALSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKKVINSISKEYPG+LP+WPKVEPALGAALLAWNFL
Subjt:  PVVIACAEAGDEVANNILLDSVEELALSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKKVINSISKEYPGVLPIWPKVEPALGAALLAWNFL

Query:  SKDYQQEGI
        SKDYQQEGI
Subjt:  SKDYQQEGI

A0A5A7V6Z0 GlcNAc kinase1.1e-18683.62Show/hide
Query:  MKRCRNGELWDFEHEILGGDDIILGIDGGTTSTVCVCIALSDPRVVSPSMSCPILARVVGGCSNHNSVGVGKFLHGSIKFRCLVSVLDSIVSIPDSFIMS
        MKRCRNGELWDFEHEILGGDDIILGIDGGTTSTVCVCIALSDPRVVSPS+SCP+LARVVGGCSNHNSVG                               
Subjt:  MKRCRNGELWDFEHEILGGDDIILGIDGGTTSTVCVCIALSDPRVVSPSMSCPILARVVGGCSNHNSVGVGKFLHGSIKFRCLVSVLDSIVSIPDSFIMS

Query:  LNGKNIHSLSSLETAARETLEQVMAEALSKSGSIRSSVRAVCLAVSGVNHPTDQQRILDWLRDIFPCHVNLYVQNDAVAALASGTLGKLHGCVLIAGTGT
                    ETAARETLEQVMAEALSKSGSIRSSVRAVCLAVSGVNHPTDQQRI DWLRDIFPCHVNLYVQNDAVAALASGT+GKLHGCVLIAGTGT
Subjt:  LNGKNIHSLSSLETAARETLEQVMAEALSKSGSIRSSVRAVCLAVSGVNHPTDQQRILDWLRDIFPCHVNLYVQNDAVAALASGTLGKLHGCVLIAGTGT

Query:  IAYGFTEDGREARAAGAGPILGDWGSGYGIAAQALTAIIRAHDGRGPYTKLTYSILKTLDLSSPDELIGSNYRAFLLVNKYLRIWWTYADPSWARIAALV
        IAYGFTEDGREARAAGAGPILGDWGSGYGIAAQALTAIIRAHDGRGP+TKLTYSILKTLDLSSPDELIG                WTYADPSWARIAALV
Subjt:  IAYGFTEDGREARAAGAGPILGDWGSGYGIAAQALTAIIRAHDGRGPYTKLTYSILKTLDLSSPDELIGSNYRAFLLVNKYLRIWWTYADPSWARIAALV

Query:  PVVIACAEAGDEVANNILLDSVEELALSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKKVINSISKEYPGVLPIWPKVEPALGAALLAWNFL
        PVV+ACAEAGDEVANNILLDSVEELALSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKKVINSISKEYPG+LP+WPKVEPALGAALLAWNFL
Subjt:  PVVIACAEAGDEVANNILLDSVEELALSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKKVINSISKEYPGVLPIWPKVEPALGAALLAWNFL

Query:  SKDYQQEGI
        SKDYQQEGI
Subjt:  SKDYQQEGI

A0A6J1DBA3 GlcNAc kinase1.2e-18081.42Show/hide
Query:  MKRCRNGELWDFEHEILGGDDIILGIDGGTTSTVCVCIALSDPRVVSPSMSCPILARVVGGCSNHNSVGVGKFLHGSIKFRCLVSVLDSIVSIPDSFIMS
        MKRCRNGELWDFEHEILGGDD+ILGIDGGTTSTVCV IALS PR VSPSMSCPILARVVGGCSNHNSVG                               
Subjt:  MKRCRNGELWDFEHEILGGDDIILGIDGGTTSTVCVCIALSDPRVVSPSMSCPILARVVGGCSNHNSVGVGKFLHGSIKFRCLVSVLDSIVSIPDSFIMS

Query:  LNGKNIHSLSSLETAARETLEQVMAEALSKSGSIRSSVRAVCLAVSGVNHPTDQQRILDWLRDIFPCHVNLYVQNDAVAALASGTLGKLHGCVLIAGTGT
                    ETAARETLEQVMAEALSKSGSIRSSVRAVCLAVSGVNHPTDQQRIL+WLRDIFPCHV LYVQNDAVAALASGT+GKLHGCVLIAGTGT
Subjt:  LNGKNIHSLSSLETAARETLEQVMAEALSKSGSIRSSVRAVCLAVSGVNHPTDQQRILDWLRDIFPCHVNLYVQNDAVAALASGTLGKLHGCVLIAGTGT

Query:  IAYGFTEDGREARAAGAGPILGDWGSGYGIAAQALTAIIRAHDGRGPYTKLTYSILKTLDLSSPDELIGSNYRAFLLVNKYLRIWWTYADPSWARIAALV
        IAYGFTEDGREARAAGAGP LGDWGSGYGIAAQALTAIIRAHDGRGP TKLTYSILKTL LSSPDELIG                WTYADPSW+RIAALV
Subjt:  IAYGFTEDGREARAAGAGPILGDWGSGYGIAAQALTAIIRAHDGRGPYTKLTYSILKTLDLSSPDELIGSNYRAFLLVNKYLRIWWTYADPSWARIAALV

Query:  PVVIACAEAGDEVANNILLDSVEELALSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKKVINSISKEYPGVLPIWPKVEPALGAALLAWNFL
        PVV+ACAE GDEVANNILL+SVEELALSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAK+RWDIAKKVINSISKEYPGVLP+WPKVEPALGAALLAWNFL
Subjt:  PVVIACAEAGDEVANNILLDSVEELALSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKKVINSISKEYPGVLPIWPKVEPALGAALLAWNFL

Query:  SKDYQQEGI
        SKDYQQEG+
Subjt:  SKDYQQEGI

A0A6J1F6E8 GlcNAc kinase5.7e-18381.91Show/hide
Query:  MKRCRNGELWDFEHEILGGDDIILGIDGGTTSTVCVCIALSDPRVVSPSMSCPILARVVGGCSNHNSVGVGKFLHGSIKFRCLVSVLDSIVSIPDSFIMS
        MKRCRNGELWDFEHEILGGDDIILGIDGGTTSTVCVCIALSDPR +SPSM+CP+LARVVGGCSNHNSVG                               
Subjt:  MKRCRNGELWDFEHEILGGDDIILGIDGGTTSTVCVCIALSDPRVVSPSMSCPILARVVGGCSNHNSVGVGKFLHGSIKFRCLVSVLDSIVSIPDSFIMS

Query:  LNGKNIHSLSSLETAARETLEQVMAEALSKSGSIRSSVRAVCLAVSGVNHPTDQQRILDWLRDIFPCHVNLYVQNDAVAALASGTLGKLHGCVLIAGTGT
                    ETAARETLEQVMAEALSKS SIRSSVRAVCLAVSGVNHPTDQ+RIL+WLRDIFPCHVNLYVQNDAVAALASGT+GKLHGCVLIAGTGT
Subjt:  LNGKNIHSLSSLETAARETLEQVMAEALSKSGSIRSSVRAVCLAVSGVNHPTDQQRILDWLRDIFPCHVNLYVQNDAVAALASGTLGKLHGCVLIAGTGT

Query:  IAYGFTEDGREARAAGAGPILGDWGSGYGIAAQALTAIIRAHDGRGPYTKLTYSILKTLDLSSPDELIGSNYRAFLLVNKYLRIWWTYADPSWARIAALV
        IAYGFTEDGREARAAGAGPILGDWGSGYGIAAQALTAIIRAHDGRGP+T LTYSILKTLDLSSPDELIG                WTYADPSWARIAALV
Subjt:  IAYGFTEDGREARAAGAGPILGDWGSGYGIAAQALTAIIRAHDGRGPYTKLTYSILKTLDLSSPDELIGSNYRAFLLVNKYLRIWWTYADPSWARIAALV

Query:  PVVIACAEAGDEVANNILLDSVEELALSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKKVINSISKEYPGVLPIWPKVEPALGAALLAWNFL
        PV++ACAEAGDEVA+NILLDSVEELA SVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKKVINSISKEYPGVLP+WPKVEPALGAALLAWNFL
Subjt:  PVVIACAEAGDEVANNILLDSVEELALSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKKVINSISKEYPGVLPIWPKVEPALGAALLAWNFL

Query:  SKDYQQEGI
        SKDYQQEGI
Subjt:  SKDYQQEGI

A0A6J1IGQ9 GlcNAc kinase1.3e-18281.66Show/hide
Query:  MKRCRNGELWDFEHEILGGDDIILGIDGGTTSTVCVCIALSDPRVVSPSMSCPILARVVGGCSNHNSVGVGKFLHGSIKFRCLVSVLDSIVSIPDSFIMS
        MKRCRNGELWDFEHEILGGDDIILGIDGGTTSTVCVCI+LSDPR +SPSM+CP+LARVVGGCSNHNSVG                               
Subjt:  MKRCRNGELWDFEHEILGGDDIILGIDGGTTSTVCVCIALSDPRVVSPSMSCPILARVVGGCSNHNSVGVGKFLHGSIKFRCLVSVLDSIVSIPDSFIMS

Query:  LNGKNIHSLSSLETAARETLEQVMAEALSKSGSIRSSVRAVCLAVSGVNHPTDQQRILDWLRDIFPCHVNLYVQNDAVAALASGTLGKLHGCVLIAGTGT
                    ETAARETLEQVMAEALSKS SIRSSVRAVCLAVSGVNHPTDQ+RIL+WLRDIFPCHVNLYVQNDAVAALASGT+GKLHGCVLIAGTGT
Subjt:  LNGKNIHSLSSLETAARETLEQVMAEALSKSGSIRSSVRAVCLAVSGVNHPTDQQRILDWLRDIFPCHVNLYVQNDAVAALASGTLGKLHGCVLIAGTGT

Query:  IAYGFTEDGREARAAGAGPILGDWGSGYGIAAQALTAIIRAHDGRGPYTKLTYSILKTLDLSSPDELIGSNYRAFLLVNKYLRIWWTYADPSWARIAALV
        IAYGFTEDGREARAAGAGPILGDWGSGYGIAAQALTAIIRAHDGRGP+T LTYSILKTLDLSSPDELIG                WTYADPSWARIAALV
Subjt:  IAYGFTEDGREARAAGAGPILGDWGSGYGIAAQALTAIIRAHDGRGPYTKLTYSILKTLDLSSPDELIGSNYRAFLLVNKYLRIWWTYADPSWARIAALV

Query:  PVVIACAEAGDEVANNILLDSVEELALSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKKVINSISKEYPGVLPIWPKVEPALGAALLAWNFL
        PV++ACAEAGDEVA+NILLDSVEELA SVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKKVINSISKEYPGVLP+WPKVEPALGAALLAWNFL
Subjt:  PVVIACAEAGDEVANNILLDSVEELALSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKKVINSISKEYPGVLPIWPKVEPALGAALLAWNFL

Query:  SKDYQQEGI
        SKDYQQEGI
Subjt:  SKDYQQEGI

SwissProt top hitse value%identityAlignment
P81799 N-acetyl-D-glucosamine kinase4.0e-0824.02Show/hide
Query:  ETLEQVMAEALSKSG-SIRSSVRAVCLAVSGVNHPTDQQRILDWLRDIFP-CHVNLYVQNDAVAALASGTLGKLHGCVLIAGTGTIAYGFTEDGREARAA
        E + +++  A  K+G      +R++ L++SG       + +++ LRD FP    + ++  DA  ++A+ T     G VLI+GTG+       DG E+   
Subjt:  ETLEQVMAEALSKSG-SIRSSVRAVCLAVSGVNHPTDQQRILDWLRDIFP-CHVNLYVQNDAVAALASGTLGKLHGCVLIAGTGTIAYGFTEDGREARAA

Query:  GAGPILGDWGSGYGIAAQALTAIIRAHDGRGPYTKLTYSILKTLDLSSPD--ELIGSNYRAFLLVNKYLRIWWTYADPSWARIAALVPVVIACAEAGDEV
        G G ++GD GS Y IA QA+              K+ +  +  L+ +  D   +  + +  F + ++   +   Y D   ++ A     +   A+ GD +
Subjt:  GAGPILGDWGSGYGIAAQALTAIIRAHDGRGPYTKLTYSILKTLDLSSPD--ELIGSNYRAFLLVNKYLRIWWTYADPSWARIAALVPVVIACAEAGDEV

Query:  ANNILLDSVEELALSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKK
        +  I   + E L   V AV+  +      G+   P++ VG V ++   W++ K+
Subjt:  ANNILLDSVEELALSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKK

Q3SZM9 N-acetyl-D-glucosamine kinase2.4e-0825.78Show/hide
Query:  ETLEQVMAEALSKSG-SIRSSVRAVCLAVSGVNHPTDQQRILDWLRDIFPCHVNLY-VQNDAVAALASGTLGKLHGCVLIAGTGTIAYGFTEDGREARAA
        E + +++  A  K+G      +R + L++SG +     + +++ LRD FP     Y +  DA  ++A+ T     G VLI+GTG+       DG E+   
Subjt:  ETLEQVMAEALSKSG-SIRSSVRAVCLAVSGVNHPTDQQRILDWLRDIFPCHVNLY-VQNDAVAALASGTLGKLHGCVLIAGTGTIAYGFTEDGREARAA

Query:  GAGPILGDWGSGYGIAAQALTAIIRAHDGRGPYTKLTYSILKTLDLSSPDELIG----SNYRAFLLVNKYLRIWWTYADPSWARIAALVPVVIACAEAGD
        G G ++GD GS Y IA QA+              K+ +  +  L+ +  D  IG    + +  F + ++   +   Y D   +R A     V   A+ GD
Subjt:  GAGPILGDWGSGYGIAAQALTAIIRAHDGRGPYTKLTYSILKTLDLSSPDELIG----SNYRAFLLVNKYLRIWWTYADPSWARIAALVPVVIACAEAGD

Query:  EVANNILLDSVEELALSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKK
         ++  I   + E L   V AV+  +      G+   P++ VG V ++   W++ K+
Subjt:  EVANNILLDSVEELALSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKK

Q54PM7 N-acetyl-D-glucosamine kinase2.8e-4131.51Show/hide
Query:  DIILGIDGGTTSTVCVCIALSDPRVVSPSMSCPILARVVGGCSNHNSVGVGKFLHGSIKFRCLVSVLDSIVSIPDSFIMSLNGKNIHSLSSLETAARETL
        +I +GIDGG T T  V +  +             LAR    CSN++SVG                                           + A  E +
Subjt:  DIILGIDGGTTSTVCVCIALSDPRVVSPSMSCPILARVVGGCSNHNSVGVGKFLHGSIKFRCLVSVLDSIVSIPDSFIMSLNGKNIHSLSSLETAARETL

Query:  EQV---MAEALSKSGSIRSSVRAVCLAVSGVNHPTDQQRILDWLRDIFPCHVNLYVQNDAVAALASGTLGKLHGCVLIAGTGTIAYGFTEDGREARAAGA
        + V   + E ++   +   +V ++CL +SGV+   D+  +  W+ ++    +N  + NDA+ AL+SGT GKL G V+I GTG I+ GF  +G   R+ G 
Subjt:  EQV---MAEALSKSGSIRSSVRAVCLAVSGVNHPTDQQRILDWLRDIFPCHVNLYVQNDAVAALASGTLGKLHGCVLIAGTGTIAYGFTEDGREARAAGA

Query:  GPILGDWGSGYGIAAQALTAIIRAHDGRGPYTKLTYSILKTLDLSSPDELIGSNYRAFLLVNKYLRIWWTYADP---SWARIAALVPVVIACAEAGDEVA
        GP+LGD+GSGY I    L  +++A D  GP T LT  +L+ L L+  ++LI                 W Y DP   SW + A L P+    A+ GDE++
Subjt:  GPILGDWGSGYGIAAQALTAIIRAHDGRGPYTKLTYSILKTLDLSSPDELIGSNYRAFLLVNKYLRIWWTYADP---SWARIAALVPVVIACAEAGDEVA

Query:  NNILLDSVEELALSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKKVINSISKEYPGVLPIWPKVEPALGAALLAWN
        N IL+D+   L   + +VI++LGL   D +E FPLV  GG +E  R+  ++  +   I + YP    +    +P++GAALLA N
Subjt:  NNILLDSVEELALSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKKVINSISKEYPGVLPIWPKVEPALGAALLAWN

Q97ML3 N-acetylmuramic acid/N-acetylglucosamine kinase2.3e-2733.33Show/hide
Query:  SSLETAARETLEQVMAEALSKSGSIRSSVRAVCLAVSGVNHPTDQQRILDWLRDIFPCHVNLYVQNDAVAALASGTLGKLHGCVLIAGTGTIAYGFTEDG
        SS +   +  L++++ E L K G       A+C+  +G +   D+  I D +R +      + V NDA  ALA G + K  G ++I+GTG+I YG  ++G
Subjt:  SSLETAARETLEQVMAEALSKSGSIRSSVRAVCLAVSGVNHPTDQQRILDWLRDIFPCHVNLYVQNDAVAALASGTLGKLHGCVLIAGTGTIAYGFTEDG

Query:  REARAAGAGPILGDWGSGYGIAAQALTAIIRAHDGRGPYTKLTYSILKTLDLSSPDELIGSNYRAFLLVNKYLRIWWTYADPSWARIAALVPVVIACAEA
        R AR+ G G I+GD GSGY I  +A+ A +++ D RG  T L   IL  L L S ++LI   YR+ +   +               IA+L  VV +    
Subjt:  REARAAGAGPILGDWGSGYGIAAQALTAIIRAHDGRGPYTKLTYSILKTLDLSSPDELIGSNYRAFLLVNKYLRIWWTYADPSWARIAALVPVVIACAEA

Query:  GDEVANNILLDSVEELALSVKAVIQRLGLAGEDGQEAFPLVMVGGVL-EAKRRWDIAKKVINSISKEYPGVLPIWPKVEPALGAALLA
        GD V+  IL ++  EL LSVKAV++ L +      +   L   GGV+      +D  +K +N     YP V  I  K + A GA ++A
Subjt:  GDEVANNILLDSVEELALSVKAVIQRLGLAGEDGQEAFPLVMVGGVL-EAKRRWDIAKKVINSISKEYPGVLPIWPKVEPALGAALLA

Q9UJ70 N-acetyl-D-glucosamine kinase9.0e-0825Show/hide
Query:  ETLEQVMAEALSKSG-SIRSSVRAVCLAVSGVNHPTDQQRILDWLRDIFPCHVNLY-VQNDAVAALASGTLGKLHGCVLIAGTGTIAYGFTEDGREARAA
        E + +++  A  K+G      +R++ L++SG +     + +++ LRD FP     Y +  DA  ++A+ T     G VLI+GTG+       DG E+   
Subjt:  ETLEQVMAEALSKSG-SIRSSVRAVCLAVSGVNHPTDQQRILDWLRDIFPCHVNLY-VQNDAVAALASGTLGKLHGCVLIAGTGTIAYGFTEDGREARAA

Query:  GAGPILGDWGSGYGIAAQALTAIIRAHDGRGPYTKLTYSILKTLDLSSPDELIG----SNYRAFLLVNKYLRIWWTYADPSWARIAALVPVVIACAEAGD
        G G ++GD GS Y IA QA+              K+ +  +  L+ +  D  IG    + +  F + ++   +   Y D    R A     +   A+ GD
Subjt:  GAGPILGDWGSGYGIAAQALTAIIRAHDGRGPYTKLTYSILKTLDLSSPDELIG----SNYRAFLLVNKYLRIWWTYADPSWARIAALVPVVIACAEAGD

Query:  EVANNILLDSVEELALSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKK
         ++  I   + E L   + AV+  +      G+   P++ VG V ++   W++ K+
Subjt:  EVANNILLDSVEELALSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKK

Arabidopsis top hitse value%identityAlignment
AT1G30540.1 Actin-like ATPase superfamily protein4.4e-11959.21Show/hide
Query:  IILGIDGGTTSTVCVCIALSDPRVVSPSMSCPILARVVGGCSNHNSVGVGKFLHGSIKFRCLVSVLDSIVSIPDSFIMSLNGKNIHSLSSLETAARETLE
        +ILG+DGG TSTVCVC+         P    PIL R V GC+N NSVG                                           ETAAR++LE
Subjt:  IILGIDGGTTSTVCVCIALSDPRVVSPSMSCPILARVVGGCSNHNSVGVGKFLHGSIKFRCLVSVLDSIVSIPDSFIMSLNGKNIHSLSSLETAARETLE

Query:  QVMAEALSKSGSIRSSVRAVCLAVSGVNHPTDQQRILDWLRDIFPCHVNLYVQNDAVAALASGTLGKLHGCVLIAGTGTIAYGFTEDGREARAAGAGPIL
        QV++EAL +SG  +S VR VCL VSGVNHP+DQ++I +W+RD+FP HV +YVQNDA+ ALASGT+GKLHGCVLIAGTG IAYGF EDG+EARA+G GPIL
Subjt:  QVMAEALSKSGSIRSSVRAVCLAVSGVNHPTDQQRILDWLRDIFPCHVNLYVQNDAVAALASGTLGKLHGCVLIAGTGTIAYGFTEDGREARAAGAGPIL

Query:  GDWGSGYGIAAQALTAIIRAHDGRGPYTKLTYSILKTLDLSSPDELIGSNYRAFLLVNKYLRIWWTYADPSWARIAALVPVVIACAEAGDEVANNILLDS
        GDWGSGYGIAAQALTA+IRAHDGRGP T LT +ILK L LSSPDELIG                WTYADPSWARIAALVP V++CAEAGDE+++ IL+D+
Subjt:  GDWGSGYGIAAQALTAIIRAHDGRGPYTKLTYSILKTLDLSSPDELIGSNYRAFLLVNKYLRIWWTYADPSWARIAALVPVVIACAEAGDEVANNILLDS

Query:  VEELALSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKKVINSISKEYPGVLPIWPKVEPALGAALLAWNFLS
         E+LALSVKAV+QRLGL G+DG  +FP+VMVGGVL A ++WDI K+V   I++ +PG   I PKVEPA+GAALLA NFLS
Subjt:  VEELALSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKKVINSISKEYPGVLPIWPKVEPALGAALLAWNFLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAGGTGCAGAAATGGCGAACTGTGGGATTTTGAGCACGAGATTCTTGGCGGAGATGATATTATACTTGGAATCGACGGCGGTACCACTTCTACTGTTTGCGTTTG
TATCGCTCTCTCTGATCCTCGAGTTGTTTCTCCTTCAATGTCTTGTCCTATACTCGCTCGTGTTGTTGGTGGCTGCTCAAATCATAATAGTGTTGGCGTTGGGAAATTCC
TGCATGGAAGTATCAAATTTAGGTGTTTGGTGTCCGTGTTGGACTCTATTGTAAGCATACCTGATTCCTTCATCATGAGTTTGAATGGTAAAAATATTCACTCGCTGTCG
TCTCTAGAAACTGCTGCGAGGGAAACACTGGAGCAAGTTATGGCGGAGGCACTTTCAAAGTCTGGTTCAATTCGATCTTCAGTTCGAGCTGTCTGCCTAGCTGTTTCTGG
GGTCAACCATCCAACGGATCAACAAAGAATTTTGGATTGGCTTAGGGATATCTTTCCCTGCCACGTAAACCTATATGTTCAAAATGATGCTGTGGCGGCTCTTGCAAGTG
GCACTTTGGGAAAGCTTCATGGATGCGTTTTAATTGCTGGAACTGGAACAATTGCTTACGGATTCACAGAGGATGGACGAGAAGCTCGAGCTGCTGGTGCAGGACCAATC
TTAGGTGATTGGGGAAGTGGATATGGGATAGCTGCACAGGCGTTAACTGCAATAATTAGGGCTCATGATGGACGTGGTCCTTATACAAAGCTCACTTATAGCATTTTGAA
GACACTTGATCTTTCTTCTCCCGATGAACTAATAGGCTCTAACTATAGAGCTTTCTTGTTAGTAAATAAGTATTTGAGGATCTGGTGGACATATGCGGATCCATCCTGGG
CTCGAATTGCTGCTCTTGTTCCAGTTGTTATAGCATGTGCGGAGGCAGGTGATGAGGTTGCTAACAACATCCTCCTTGATTCAGTTGAAGAGTTGGCTTTGAGTGTGAAA
GCTGTTATTCAAAGACTCGGCCTAGCTGGTGAAGATGGACAAGAAGCTTTTCCGCTTGTTATGGTTGGTGGTGTACTCGAAGCCAAAAGGAGGTGGGATATAGCAAAAAA
GGTCATAAATTCCATATCCAAAGAGTATCCTGGGGTTCTTCCAATTTGGCCTAAGGTGGAACCTGCGCTTGGCGCAGCATTGCTGGCTTGGAATTTTTTGAGTAAGGATT
ATCAACAGGAAGGCATATAG
mRNA sequenceShow/hide mRNA sequence
AATCGATTTCTTTGGCAAATAAATTCAAGAAAAAAAAAAAAAGAAAAAAAAGAACTACGAACAGTTTCAATTCTTCTTCGAATTCTGTCTTCTCCTCCTTGTCGCTTCGT
GTTCGTCCCTATCCCTTGCTCTCGCTATCTCTTACTGTAATTTATTGTAAATTTTCTGTATATATGATCATGAAATCCTGGTGTTTTGACTGCGAGAGCCAATTTTTCTT
GTATTTTTTAACCTTTGATCCAATTCGTCACTGTTTTGCTCTTGATCGAGGCGGCTGGAGCCTGGAATAGAGATGAAGAGGTGCAGAAATGGCGAACTGTGGGATTTTGA
GCACGAGATTCTTGGCGGAGATGATATTATACTTGGAATCGACGGCGGTACCACTTCTACTGTTTGCGTTTGTATCGCTCTCTCTGATCCTCGAGTTGTTTCTCCTTCAA
TGTCTTGTCCTATACTCGCTCGTGTTGTTGGTGGCTGCTCAAATCATAATAGTGTTGGCGTTGGGAAATTCCTGCATGGAAGTATCAAATTTAGGTGTTTGGTGTCCGTG
TTGGACTCTATTGTAAGCATACCTGATTCCTTCATCATGAGTTTGAATGGTAAAAATATTCACTCGCTGTCGTCTCTAGAAACTGCTGCGAGGGAAACACTGGAGCAAGT
TATGGCGGAGGCACTTTCAAAGTCTGGTTCAATTCGATCTTCAGTTCGAGCTGTCTGCCTAGCTGTTTCTGGGGTCAACCATCCAACGGATCAACAAAGAATTTTGGATT
GGCTTAGGGATATCTTTCCCTGCCACGTAAACCTATATGTTCAAAATGATGCTGTGGCGGCTCTTGCAAGTGGCACTTTGGGAAAGCTTCATGGATGCGTTTTAATTGCT
GGAACTGGAACAATTGCTTACGGATTCACAGAGGATGGACGAGAAGCTCGAGCTGCTGGTGCAGGACCAATCTTAGGTGATTGGGGAAGTGGATATGGGATAGCTGCACA
GGCGTTAACTGCAATAATTAGGGCTCATGATGGACGTGGTCCTTATACAAAGCTCACTTATAGCATTTTGAAGACACTTGATCTTTCTTCTCCCGATGAACTAATAGGCT
CTAACTATAGAGCTTTCTTGTTAGTAAATAAGTATTTGAGGATCTGGTGGACATATGCGGATCCATCCTGGGCTCGAATTGCTGCTCTTGTTCCAGTTGTTATAGCATGT
GCGGAGGCAGGTGATGAGGTTGCTAACAACATCCTCCTTGATTCAGTTGAAGAGTTGGCTTTGAGTGTGAAAGCTGTTATTCAAAGACTCGGCCTAGCTGGTGAAGATGG
ACAAGAAGCTTTTCCGCTTGTTATGGTTGGTGGTGTACTCGAAGCCAAAAGGAGGTGGGATATAGCAAAAAAGGTCATAAATTCCATATCCAAAGAGTATCCTGGGGTTC
TTCCAATTTGGCCTAAGGTGGAACCTGCGCTTGGCGCAGCATTGCTGGCTTGGAATTTTTTGAGTAAGGATTATCAACAGGAAGGCATATAGAAGGTATTCATTAAGTTG
ATGAGAAAGAGCCGATGAACATGAACTGTACAGTTTCATTGTCAAACCAAACTTAATTAGCAATGTTGGAAAGATAACGTAGAAACTTGAAGTAGTCTTAGCTTGTATTA
AGGTCTTAAGGTCTTTGTTGTTTTTCATCATTGTTTTAGTTCCTTCATGTTAAGGTAATATATTTTTTGTTTGTATGAAGCACGCATTGTGCTTTTATTGCCATGTAGTT
TTATTTAAGTTATGGTCAGCCTTGTGGATTACGGTGATCCTATCACATGTCTATGTTGTAAGCTTTGTATGGTGGGAAAATCCAGTTGAATGCCAGCATTTTTGGTCTTC
AAAAGTCAAAACATTGTGATAAAGTTGGGAGGGAACACGTGTAAATCAATATTAATTAGGGAGAAAAAAACTTTGCAAATATTTGAATCCAAGATCTCTTGAACCATCTA
CTTTAATACTATGTTAAAC
Protein sequenceShow/hide protein sequence
MKRCRNGELWDFEHEILGGDDIILGIDGGTTSTVCVCIALSDPRVVSPSMSCPILARVVGGCSNHNSVGVGKFLHGSIKFRCLVSVLDSIVSIPDSFIMSLNGKNIHSLS
SLETAARETLEQVMAEALSKSGSIRSSVRAVCLAVSGVNHPTDQQRILDWLRDIFPCHVNLYVQNDAVAALASGTLGKLHGCVLIAGTGTIAYGFTEDGREARAAGAGPI
LGDWGSGYGIAAQALTAIIRAHDGRGPYTKLTYSILKTLDLSSPDELIGSNYRAFLLVNKYLRIWWTYADPSWARIAALVPVVIACAEAGDEVANNILLDSVEELALSVK
AVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKKVINSISKEYPGVLPIWPKVEPALGAALLAWNFLSKDYQQEGI