; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc07G10430 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc07G10430
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionGlcNAc kinase
Genome locationClcChr07:25003948..25017596
RNA-Seq ExpressionClc07G10430
SyntenyClc07G10430
Gene Ontology termsGO:0046835 - carbohydrate phosphorylation (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0045127 - N-acetylglucosamine kinase activity (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR002731 - ATPase, BadF/BadG/BcrA/BcrD type
IPR010666 - Zinc finger, GRF-type
IPR036875 - Zinc finger, CCHC-type superfamily
IPR043129 - ATPase, nucleotide binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK16416.1 N-acetyl-D-glucosamine kinase-like [Cucumis melo var. makuwa]3.7e-18491.51Show/hide
Query:  MTKKHRNGEVSELEREMSGGTGG---VGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDR
        MTKKHRNGE+SE ERE+SGGTGG   VG VILGIDGGTTSTVCVCVPFL P SLHLPD  PLLARVEAGCSNHNSVGETAARETLEQV+AEALSKSGSD 
Subjt:  MTKKHRNGEVSELEREMSGGTGG---VGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDR

Query:  SAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL
        SAVRAICLSVSGVNHPTDQQRILNWFRD FPSHVKLYVRNDAAAALASGTMG+LSGCVLIAGTG IAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL
Subjt:  SAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL

Query:  TAIIRAHDGRGPQTKLTNSILETLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGS----------
        TAIIRAHDGRGPQTKLTNSIL+TLGLSS DELIGWTYADQSWARIAALVPAVVSCAEAGDE+ANNILQDSVKELALSVTAVVQRLGLCGS          
Subjt:  TAIIRAHDGRGPQTKLTNSILETLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGS----------

Query:  DGKGSFPLVMVGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE
        +GKGSFPLVMVGGVLEGNKGWGIA+EVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE
Subjt:  DGKGSFPLVMVGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE

XP_004147744.1 N-acetyl-D-glucosamine kinase [Cucumis sativus]2.4e-18391.83Show/hide
Query:  MTKKHRNGEVSELEREMSGGTGG---VGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDR
        MTKKHRNGE+SE +RE+S GT G   VG VILGIDGGTTST CVC+PFL P SLHLPD LPLLARVEAGCSNHNSVGETAARETLEQV+AEALSKSG D 
Subjt:  MTKKHRNGEVSELEREMSGGTGG---VGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDR

Query:  SAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL
        SAVRAICLS+SGVNHPTDQQRILNWFRD FPSHVKLYVRNDAAAALASGTMG+LSGCVLIAGTG IAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL
Subjt:  SAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL

Query:  TAIIRAHDGRGPQTKLTNSILETLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVM
        TAIIRAHDGRGPQTKLTNSIL+TLGLSS DELIGWTYADQSWARIAALVPAVV+CAEAGDE+ANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVM
Subjt:  TAIIRAHDGRGPQTKLTNSILETLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVM

Query:  VGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE
        VGGVLEGNKGWGIA+EVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKD +QE
Subjt:  VGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE

XP_008451846.1 PREDICTED: N-acetyl-D-glucosamine kinase-like [Cucumis melo]3.6e-18794.37Show/hide
Query:  MTKKHRNGEVSELEREMSGGTGG---VGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDR
        MTKKHRNGE+SE ERE+SGGTGG   VG VILGIDGGTTSTVCVCVPFL P SLHLPD  PLLARVEAGCSNHNSVGETAARETLEQV+AEALSKSGSD 
Subjt:  MTKKHRNGEVSELEREMSGGTGG---VGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDR

Query:  SAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL
        SAVRAICLSVSGVNHPTDQQRILNWFRD FPSHVKLYVRNDAAAALASGTMG+LSGCVLIAGTG IAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL
Subjt:  SAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL

Query:  TAIIRAHDGRGPQTKLTNSILETLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVM
        TAIIRAHDGRGPQTKLTNSIL+TLGLSS DELIGWTYADQSWARIAALVPAVVSCAEAGDE+ANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVM
Subjt:  TAIIRAHDGRGPQTKLTNSILETLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVM

Query:  VGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE
        VGGVLEGNKGWGIA+EVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE
Subjt:  VGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE

XP_023553557.1 N-acetyl-D-glucosamine kinase-like [Cucurbita pepo subsp. pepo]1.3e-18191.85Show/hide
Query:  MTKKHRNGEVSELEREMS---GGTGGVGGVILGIDGGTTSTVCVCVPFLQ-PQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSD
        MTKK+RNGE+ E EREMS   GG GGVGGVILGIDGGTTST+CVCVP L   QSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQV+AEALSKSGSD
Subjt:  MTKKHRNGEVSELEREMS---GGTGGVGGVILGIDGGTTSTVCVCVPFLQ-PQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSD

Query:  RSAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQA
        RSAV+AICLSVSGVNHPTDQQRILNW RD+FPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTG IA+GFTDDGREARAAGAGPILGDWGSGYGISAQA
Subjt:  RSAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQA

Query:  LTAIIRAHDGRGPQTKLTNSILETLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLV
        LTAIIRAHDGRGPQTKLTNSIL+TLGLSS DELIGWTYAD SWARIAALVPAVVSCAEAGDE+ANNILQD+VKELALSV AVVQRLGL GSDGKGSFPLV
Subjt:  LTAIIRAHDGRGPQTKLTNSILETLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLV

Query:  MVGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE
        MVGGV+EGNKGWGIA+EVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDS QE
Subjt:  MVGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE

XP_038898748.1 N-acetyl-D-glucosamine kinase [Benincasa hispida]1.1e-18895.18Show/hide
Query:  KKHRNGEVSELEREMSGGT---GGVGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDRSA
        KKHRNGE+SE +REMSGGT   G VG VILGIDGGTTSTVCVCVPFLQPQSLHLPDPLP+LARVEAGCSNHNSVGETAARETLEQV+AEALSKSGSDRSA
Subjt:  KKHRNGEVSELEREMSGGT---GGVGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDRSA

Query:  VRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQALTA
        VRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTG IAYGFTDDGREARAAGAGPILGDWGSGYGISAQALTA
Subjt:  VRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQALTA

Query:  IIRAHDGRGPQTKLTNSILETLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVMVG
        IIRAHDGRGPQTKLTN+IL TLGLSS DELIGWTYADQSWARIAALVPAVVSCAEAGDE+AN ILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVMVG
Subjt:  IIRAHDGRGPQTKLTNSILETLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVMVG

Query:  GVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE
        GVLEGNKGWGIA+EVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE
Subjt:  GVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE

TrEMBL top hitse value%identityAlignment
A0A0A0L083 GlcNAc kinase1.2e-18391.83Show/hide
Query:  MTKKHRNGEVSELEREMSGGTGG---VGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDR
        MTKKHRNGE+SE +RE+S GT G   VG VILGIDGGTTST CVC+PFL P SLHLPD LPLLARVEAGCSNHNSVGETAARETLEQV+AEALSKSG D 
Subjt:  MTKKHRNGEVSELEREMSGGTGG---VGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDR

Query:  SAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL
        SAVRAICLS+SGVNHPTDQQRILNWFRD FPSHVKLYVRNDAAAALASGTMG+LSGCVLIAGTG IAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL
Subjt:  SAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL

Query:  TAIIRAHDGRGPQTKLTNSILETLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVM
        TAIIRAHDGRGPQTKLTNSIL+TLGLSS DELIGWTYADQSWARIAALVPAVV+CAEAGDE+ANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVM
Subjt:  TAIIRAHDGRGPQTKLTNSILETLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVM

Query:  VGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE
        VGGVLEGNKGWGIA+EVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKD +QE
Subjt:  VGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE

A0A1S3BRV5 GlcNAc kinase1.7e-18794.37Show/hide
Query:  MTKKHRNGEVSELEREMSGGTGG---VGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDR
        MTKKHRNGE+SE ERE+SGGTGG   VG VILGIDGGTTSTVCVCVPFL P SLHLPD  PLLARVEAGCSNHNSVGETAARETLEQV+AEALSKSGSD 
Subjt:  MTKKHRNGEVSELEREMSGGTGG---VGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDR

Query:  SAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL
        SAVRAICLSVSGVNHPTDQQRILNWFRD FPSHVKLYVRNDAAAALASGTMG+LSGCVLIAGTG IAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL
Subjt:  SAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL

Query:  TAIIRAHDGRGPQTKLTNSILETLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVM
        TAIIRAHDGRGPQTKLTNSIL+TLGLSS DELIGWTYADQSWARIAALVPAVVSCAEAGDE+ANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVM
Subjt:  TAIIRAHDGRGPQTKLTNSILETLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVM

Query:  VGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE
        VGGVLEGNKGWGIA+EVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE
Subjt:  VGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE

A0A5A7V7G8 GlcNAc kinase1.7e-18794.37Show/hide
Query:  MTKKHRNGEVSELEREMSGGTGG---VGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDR
        MTKKHRNGE+SE ERE+SGGTGG   VG VILGIDGGTTSTVCVCVPFL P SLHLPD  PLLARVEAGCSNHNSVGETAARETLEQV+AEALSKSGSD 
Subjt:  MTKKHRNGEVSELEREMSGGTGG---VGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDR

Query:  SAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL
        SAVRAICLSVSGVNHPTDQQRILNWFRD FPSHVKLYVRNDAAAALASGTMG+LSGCVLIAGTG IAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL
Subjt:  SAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL

Query:  TAIIRAHDGRGPQTKLTNSILETLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVM
        TAIIRAHDGRGPQTKLTNSIL+TLGLSS DELIGWTYADQSWARIAALVPAVVSCAEAGDE+ANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVM
Subjt:  TAIIRAHDGRGPQTKLTNSILETLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVM

Query:  VGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE
        VGGVLEGNKGWGIA+EVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE
Subjt:  VGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE

A0A5D3D163 GlcNAc kinase1.8e-18491.51Show/hide
Query:  MTKKHRNGEVSELEREMSGGTGG---VGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDR
        MTKKHRNGE+SE ERE+SGGTGG   VG VILGIDGGTTSTVCVCVPFL P SLHLPD  PLLARVEAGCSNHNSVGETAARETLEQV+AEALSKSGSD 
Subjt:  MTKKHRNGEVSELEREMSGGTGG---VGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDR

Query:  SAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL
        SAVRAICLSVSGVNHPTDQQRILNWFRD FPSHVKLYVRNDAAAALASGTMG+LSGCVLIAGTG IAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL
Subjt:  SAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL

Query:  TAIIRAHDGRGPQTKLTNSILETLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGS----------
        TAIIRAHDGRGPQTKLTNSIL+TLGLSS DELIGWTYADQSWARIAALVPAVVSCAEAGDE+ANNILQDSVKELALSVTAVVQRLGLCGS          
Subjt:  TAIIRAHDGRGPQTKLTNSILETLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGS----------

Query:  DGKGSFPLVMVGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE
        +GKGSFPLVMVGGVLEGNKGWGIA+EVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE
Subjt:  DGKGSFPLVMVGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE

A0A6J1JC39 GlcNAc kinase1.9e-18190.3Show/hide
Query:  MTKKHRNGEVSELEREMS---------GGTGGVGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALS
        MTKK+RNGE+ E EREMS         GG GGVGGVILGIDGGTTSTVCVCVP L  QSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQV+AEALS
Subjt:  MTKKHRNGEVSELEREMS---------GGTGGVGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALS

Query:  KSGSDRSAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYG
        KSGSDRSAV+AICLSVSGVNHPTDQQRILNW RD+FPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTG IA+GFTDDGREARAAGAGPILGDWGSGYG
Subjt:  KSGSDRSAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYG

Query:  ISAQALTAIIRAHDGRGPQTKLTNSILETLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKG
        ISAQALTAIIRAHDGRGPQT LTNSIL+TLGLSS DELIGWTYAD SWARIAALVPAVVSCAEAGDE+ANNILQD+VKELALSV AVVQRLG  GSDGKG
Subjt:  ISAQALTAIIRAHDGRGPQTKLTNSILETLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKG

Query:  SFPLVMVGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE
        SFPLVMVGGV+EGNKGWGIA+EVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDS QE
Subjt:  SFPLVMVGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE

SwissProt top hitse value%identityAlignment
P81799 N-acetyl-D-glucosamine kinase7.1e-1325.52Show/hide
Query:  GIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDRSA-VRAICLSVSGVNHPTDQQRILNWFRDIFP
        G++GG T          + + L L +   +LA  +   +NH  +G     E + +++  A  K+G D    +R++ LS+SG       + ++   RD FP
Subjt:  GIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDRSA-VRAICLSVSGVNHPTDQQRILNWFRDIFP

Query:  SHVKLY-VRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILETL--GLSS
           + Y +  DAA ++A+ T     G VLI+GTG        DG E+   G G ++GD GS Y I+ QA+  +  + D           + + +      
Subjt:  SHVKLY-VRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILETL--GLSS

Query:  PDEL--IGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVMVGGVLEGNKGWGIARE
        PD L  +   Y D   ++ A     +   A+ GD ++  I + + + L   V AV+  +      G+   P++ VG V    K W + +E
Subjt:  PDEL--IGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVMVGGVLEGNKGWGIARE

Q3SZM9 N-acetyl-D-glucosamine kinase2.5e-1326.21Show/hide
Query:  GIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDRSA-VRAICLSVSGVNHPTDQQRILNWFRDIFP
        G++GG T          + + L L +   +LA  +   +NH  +G     E + +++  A  K+G D    +R + LS+SG +     + ++   RD FP
Subjt:  GIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDRSA-VRAICLSVSGVNHPTDQQRILNWFRDIFP

Query:  SHVKLY-VRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILETL--GLSS
           + Y +  DAA ++A+ T     G VLI+GTG        DG E+   G G ++GD GS Y I+ QA+  +  + D           + + +      
Subjt:  SHVKLY-VRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILETL--GLSS

Query:  PDEL--IGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVMVGGVLEGNKGWGIARE
        PD L  +   Y D   +R A     V   A+ GD ++  I + + + L   V AV+  +      G+   P++ VG V    K W + +E
Subjt:  PDEL--IGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVMVGGVLEGNKGWGIARE

Q54PM7 N-acetyl-D-glucosamine kinase2.1e-4936.56Show/hide
Query:  VILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQ-------VIAEALSKSGSDRSAVRAICLSVSGVNHPTDQQRI
        + +GIDGG T T  V V     +          LAR  + CSN++SVGE  A+  + +        + E ++   +    V +ICL +SGV+   D+  +
Subjt:  VILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQ-------VIAEALSKSGSDRSAVRAICLSVSGVNHPTDQQRI

Query:  LNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILE
         +W  ++    +   + NDA  AL+SGT G+L G V+I GTGCI+ GF  +G   R+ G GP+LGD+GSGY I    L  +++A D  GP+T LT  +LE
Subjt:  LNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILE

Query:  TLGLSSPDELIGWTY--ADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVMVGGVLEGNKGWGIAREVIN-
         L L+  ++LI W Y    QSW + A L P     A+ GDEI+N IL D+   L   + +V+++LGL   D +  FPLV  GG +E     GI  ++++ 
Subjt:  TLGLSSPDELIGWTY--ADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVMVGGVLEGNKGWGIAREVIN-

Query:  CISKDYPGVVPIWPKVEPAIGAALLAWNFLK
         I ++YP    +    +P++GAALLA N  K
Subjt:  CISKDYPGVVPIWPKVEPAIGAALLAWNFLK

Q97ML3 N-acetylmuramic acid/N-acetylglucosamine kinase4.5e-3134.07Show/hide
Query:  ILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDRSAVRAICLSVSGVNHPTDQQRILNWFRDIF
        ++GIDGG + T       ++  +L       +L  V  G SN NS  +   +  L+++I E L K G       AIC+  +G +   D+  I +  R + 
Subjt:  ILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDRSAVRAICLSVSGVNHPTDQQRILNWFRDIF

Query:  PSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILETLGLSSPD
            K+ V NDA  ALA G   R  G ++I+GTG I YG   +GR AR+ G G I+GD GSGY I  +A+ A +++ D RG +T L   IL+ L L S +
Subjt:  PSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILETLGLSSPD

Query:  ELIGWTY-ADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVMVGGVLEG-NKGWGIAREVINCISKDYPGV
        +LI + Y +  +   IA+L   V S    GD ++  IL+++ +EL LSV AVV+ L +          L   GGV+   N  +   R+ +N    +YP V
Subjt:  ELIGWTY-ADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVMVGGVLEG-NKGWGIAREVINCISKDYPGV

Query:  VPIWPKVEPAIGAALLA
          I  K + A GA ++A
Subjt:  VPIWPKVEPAIGAALLA

Q9UJ70 N-acetyl-D-glucosamine kinase4.2e-1325.52Show/hide
Query:  GIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDRSA-VRAICLSVSGVNHPTDQQRILNWFRDIFP
        G++GG T          + + L + +   +LA  +   +NH  +G     E + +++  A  K+G D    +R++ LS+SG +     + ++   RD FP
Subjt:  GIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDRSA-VRAICLSVSGVNHPTDQQRILNWFRDIFP

Query:  SHVKLY-VRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILETL--GLSS
           + Y +  DAA ++A+ T     G VLI+GTG        DG E+   G G ++GD GS Y I+ QA+  +  + D           + + +      
Subjt:  SHVKLY-VRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILETL--GLSS

Query:  PDEL--IGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVMVGGVLEGNKGWGIARE
        PD L  +   Y D    R A     +   A+ GD ++  I + + + L   + AV+  +      GK   P++ VG V    K W + +E
Subjt:  PDEL--IGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVMVGGVLEGNKGWGIARE

Arabidopsis top hitse value%identityAlignment
AT1G30540.1 Actin-like ATPase superfamily protein3.6e-13769.71Show/hide
Query:  MTKKHRNGEVSELEREMSG----GTGGVGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSD
        M   H NG + +LE +  G      G V GVILG+DGG TSTVCVCVPF        PDPLP+L R  AGC+N NSVGETAAR++LEQVI+EAL +SG D
Subjt:  MTKKHRNGEVSELEREMSG----GTGGVGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSD

Query:  RSAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQA
        +S VR +CL VSGVNHP+DQ++I NW RD+FPSHVK+YV+NDA  ALASGTMG+L GCVLIAGTGCIAYGF +DG+EARA+G GPILGDWGSGYGI+AQA
Subjt:  RSAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQA

Query:  LTAIIRAHDGRGPQTKLTNSILETLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLV
        LTA+IRAHDGRGPQT LT++IL+ LGLSSPDELIGWTYAD SWARIAALVP VVSCAEAGDEI++ IL D+ ++LALSV AVVQRLGLCG DG  SFP+V
Subjt:  LTAIIRAHDGRGPQTKLTNSILETLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLV

Query:  MVGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFL
        MVGGVL  N+ W I +EV   I++ +PG   I PKVEPA+GAALLA NFL
Subjt:  MVGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFL

AT1G54930.1 GRF zinc finger / Zinc knuckle protein5.4e-0841.67Show/hide
Query:  LVTYGSQNRASSSSFSDLQCPCGAGSCIILTAKTGENVGQQFYRCPSYQATCGFFQWCKE
        L T      A++ S  D+ CPC AG C  +T+KT +N  ++FY CPS    CG+F+WC +
Subjt:  LVTYGSQNRASSSSFSDLQCPCGAGSCIILTAKTGENVGQQFYRCPSYQATCGFFQWCKE

AT2G17870.1 cold shock domain protein 34.3e-0542.11Show/hide
Query:  GSISANTTAQSEGGGSCFKCGKLGHWARDCDAPGGGGSFSASGNDTSVAEKAC-PCG
        GS++    +    GG+CF CG++GH A+DCD   GG SF   G   S  E  C  CG
Subjt:  GSISANTTAQSEGGGSCFKCGKLGHWARDCDAPGGGGSFSASGNDTSVAEKAC-PCG

AT3G42860.1 zinc knuckle (CCHC-type) family protein4.0e-4344.33Show/hide
Query:  EGIYIAALKGNPILLSQLSSWKANSGNKGSISANTTAQSEGGGSCFKCGKLGHWARDCDAPGGGGSFSASGNDTSVAEKACPCGLGVCSVLTANTERNRG
        EG Y+AALKG     S+   W+ +  N  S  + + A + GG                 + GGGG       +    EK+CPCG+G+C +LT+NT +N G
Subjt:  EGIYIAALKGNPILLSQLSSWKANSGNKGSISANTTAQSEGGGSCFKCGKLGHWARDCDAPGGGGSFSASGNDTSVAEKACPCGLGVCSVLTANTERNRG

Query:  RKFYKCPVRQENGGCGFFEWCDS--ASVTNLVTYGSQNRASSSSFSDLQCPCGAGSCIILTAKTGENVGQQFYRCPSYQATCGFFQWCKEPSMATKNQVS
        RKFYKCP R+ENGGCGFF+WCD+  +S T+  T  S    + + F D QCPCGAG C +LTAKTGENVG+QFYRCP ++ +CGFF+WC +  +++    S
Subjt:  RKFYKCPVRQENGGCGFFEWCDS--ASVTNLVTYGSQNRASSSSFSDLQCPCGAGSCIILTAKTGENVGQQFYRCPSYQATCGFFQWCKEPSMATKNQVS

Query:  ASK
         +K
Subjt:  ASK

AT5G13920.1 GRF zinc finger / Zinc knuckle protein4.9e-1734.48Show/hide
Query:  GSCFKCGKLGHWARDCDAPGGGGSFSASGNDTSVAEKACPCGLGVCSVLTANTERNRGRKFYKCPVRQENGGCGFFEWCDSASVTNLVTYGSQNRASSSS
        G CF+C + GHW  DC           S  D       CPCG G C +  ANT  N GRKFYKCP  Q    C FF+WCD  +         ++     +
Subjt:  GSCFKCGKLGHWARDCDAPGGGGSFSASGNDTSVAEKACPCGLGVCSVLTANTERNRGRKFYKCPVRQENGGCGFFEWCDSASVTNLVTYGSQNRASSSS

Query:  FSDLQCPCGAGSCIILTAKTGENVGQQFYRCPSYQ--ATCGFFQW
        F+   C CGAG C     +  +  G+ +  C   +    CGFF+W
Subjt:  FSDLQCPCGAGSCIILTAKTGENVGQQFYRCPSYQ--ATCGFFQW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGTTCGAAGCTAAGAGGAATTTTGATTTTGATCTCGAAGAAGACAACGATGAAGATTTCCTTTCAGCGGTGGCCGCAGCTGAAGCTGTTGCCCTAGCCACTAAACG
GAGAAAAATCCCCAATCATAACATCGATGCGCCCACAAATGCCGTCGCCGAGGTTTCGTACCAAATTCCCCCTAATGCAGACGCGGAAGGGATCTACATCGCGGCTTTGA
AAGGAAACCCTATTCTCCTCTCTCAACTTTCTAGTTGGAAAGCCAATTCTGGCAACAAAGGATCCATTTCTGCGAATACTACTGCTCAATCGGAAGGCGGTGGTTCGTGT
TTCAAATGCGGGAAATTAGGGCATTGGGCTAGAGACTGCGATGCTCCGGGAGGTGGAGGATCTTTCAGCGCTTCAGGAAATGATACGTCTGTTGCTGAGAAAGCATGCCC
CTGCGGATTGGGAGTTTGTTCCGTACTAACTGCGAATACGGAGAGGAATCGCGGTCGAAAATTTTACAAATGCCCGGTTCGGCAGGAAAATGGTGGCTGCGGATTCTTTG
AGTGGTGTGACAGTGCCTCTGTAACTAATTTAGTGACTTATGGAAGCCAAAATCGTGCATCAAGTTCTTCATTTTCAGACCTTCAGTGCCCCTGTGGTGCTGGTTCTTGC
ATAATTCTAACAGCCAAAACAGGGGAAAATGTTGGGCAGCAATTCTATCGTTGCCCTTCATACCAGGCAACTTGTGGTTTTTTCCAGTGGTGCAAGGAGCCTTCCATGGC
AACCAAGAATCAAGTTAGTGCCTCTAAGGGCATTGGGCTAGAGACTGTGCGCAACCACTTGCATCATCAAATCCTCCTTCAGAGTTTGGCAGAAGTCAATCTTCTTCAGT
TGGAACCTGTTTCAAATGTGGTAAGCCTGGCCATTGGGCCAGGGACTGCTCTAATCGTTAATAATAGTGTAACTGTGCAGGGAAGAAGATACAGAAGCTTGGTGAGGAGT
TTGGCATACGGATCTTCATCAACAGAACGGAGGAGAGTAGAGGTTTGGTTCTGTATTTTTGAAGCGATGACGAAGAAACATCGGAATGGGGAAGTCTCGGAATTGGAGCG
GGAAATGTCCGGCGGAACAGGAGGAGTTGGGGGCGTGATTCTTGGAATTGATGGAGGAACCACCTCCACCGTCTGTGTTTGTGTTCCTTTTCTGCAACCTCAGTCTCTTC
ATCTTCCTGATCCTCTTCCTCTTCTCGCCCGAGTCGAAGCTGGCTGCTCCAACCATAACAGCGTTGGCGAAACTGCTGCAAGGGAAACATTGGAGCAAGTTATCGCTGAG
GCTCTTTCAAAATCAGGTTCAGATAGATCTGCAGTTCGAGCTATTTGTTTATCTGTTTCTGGTGTAAACCATCCAACGGATCAGCAAAGGATTTTGAATTGGTTCAGAGA
TATATTTCCTAGCCATGTCAAACTCTATGTTCGAAATGATGCTGCGGCTGCTCTTGCAAGTGGTACCATGGGAAGGCTTAGTGGTTGTGTTCTAATTGCTGGCACGGGGT
GTATTGCCTATGGATTTACAGATGATGGAAGAGAAGCTCGAGCAGCTGGTGCAGGACCAATCTTAGGTGACTGGGGAAGTGGGTATGGCATATCAGCGCAGGCTTTGACT
GCAATTATAAGAGCTCATGATGGTCGTGGTCCTCAAACAAAGCTCACAAATAGCATTCTTGAGACGCTTGGTCTTTCTTCTCCTGATGAACTCATTGGGTGGACCTACGC
AGATCAATCTTGGGCTCGCATTGCTGCACTTGTTCCTGCTGTTGTGTCATGTGCAGAAGCAGGGGATGAAATTGCAAACAACATTTTGCAAGATTCAGTGAAGGAATTGG
CTCTAAGCGTCACTGCCGTTGTTCAAAGACTTGGATTGTGTGGTTCAGATGGAAAGGGTTCTTTTCCCCTTGTCATGGTTGGTGGAGTTCTCGAAGGAAATAAGGGATGG
GGTATAGCACGAGAAGTTATAAATTGCATTTCCAAGGACTACCCTGGGGTAGTTCCCATTTGGCCTAAGGTGGAGCCTGCAATTGGGGCTGCATTGTTAGCCTGGAATTT
CTTGAAAGATTCCCAACAAGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGTTCGAAGCTAAGAGGAATTTTGATTTTGATCTCGAAGAAGACAACGATGAAGATTTCCTTTCAGCGGTGGCCGCAGCTGAAGCTGTTGCCCTAGCCACTAAACG
GAGAAAAATCCCCAATCATAACATCGATGCGCCCACAAATGCCGTCGCCGAGGTTTCGTACCAAATTCCCCCTAATGCAGACGCGGAAGGGATCTACATCGCGGCTTTGA
AAGGAAACCCTATTCTCCTCTCTCAACTTTCTAGTTGGAAAGCCAATTCTGGCAACAAAGGATCCATTTCTGCGAATACTACTGCTCAATCGGAAGGCGGTGGTTCGTGT
TTCAAATGCGGGAAATTAGGGCATTGGGCTAGAGACTGCGATGCTCCGGGAGGTGGAGGATCTTTCAGCGCTTCAGGAAATGATACGTCTGTTGCTGAGAAAGCATGCCC
CTGCGGATTGGGAGTTTGTTCCGTACTAACTGCGAATACGGAGAGGAATCGCGGTCGAAAATTTTACAAATGCCCGGTTCGGCAGGAAAATGGTGGCTGCGGATTCTTTG
AGTGGTGTGACAGTGCCTCTGTAACTAATTTAGTGACTTATGGAAGCCAAAATCGTGCATCAAGTTCTTCATTTTCAGACCTTCAGTGCCCCTGTGGTGCTGGTTCTTGC
ATAATTCTAACAGCCAAAACAGGGGAAAATGTTGGGCAGCAATTCTATCGTTGCCCTTCATACCAGGCAACTTGTGGTTTTTTCCAGTGGTGCAAGGAGCCTTCCATGGC
AACCAAGAATCAAGTTAGTGCCTCTAAGGGCATTGGGCTAGAGACTGTGCGCAACCACTTGCATCATCAAATCCTCCTTCAGAGTTTGGCAGAAGTCAATCTTCTTCAGT
TGGAACCTGTTTCAAATGTGGTAAGCCTGGCCATTGGGCCAGGGACTGCTCTAATCGTTAATAATAGTGTAACTGTGCAGGGAAGAAGATACAGAAGCTTGGTGAGGAGT
TTGGCATACGGATCTTCATCAACAGAACGGAGGAGAGTAGAGGTTTGGTTCTGTATTTTTGAAGCGATGACGAAGAAACATCGGAATGGGGAAGTCTCGGAATTGGAGCG
GGAAATGTCCGGCGGAACAGGAGGAGTTGGGGGCGTGATTCTTGGAATTGATGGAGGAACCACCTCCACCGTCTGTGTTTGTGTTCCTTTTCTGCAACCTCAGTCTCTTC
ATCTTCCTGATCCTCTTCCTCTTCTCGCCCGAGTCGAAGCTGGCTGCTCCAACCATAACAGCGTTGGCGAAACTGCTGCAAGGGAAACATTGGAGCAAGTTATCGCTGAG
GCTCTTTCAAAATCAGGTTCAGATAGATCTGCAGTTCGAGCTATTTGTTTATCTGTTTCTGGTGTAAACCATCCAACGGATCAGCAAAGGATTTTGAATTGGTTCAGAGA
TATATTTCCTAGCCATGTCAAACTCTATGTTCGAAATGATGCTGCGGCTGCTCTTGCAAGTGGTACCATGGGAAGGCTTAGTGGTTGTGTTCTAATTGCTGGCACGGGGT
GTATTGCCTATGGATTTACAGATGATGGAAGAGAAGCTCGAGCAGCTGGTGCAGGACCAATCTTAGGTGACTGGGGAAGTGGGTATGGCATATCAGCGCAGGCTTTGACT
GCAATTATAAGAGCTCATGATGGTCGTGGTCCTCAAACAAAGCTCACAAATAGCATTCTTGAGACGCTTGGTCTTTCTTCTCCTGATGAACTCATTGGGTGGACCTACGC
AGATCAATCTTGGGCTCGCATTGCTGCACTTGTTCCTGCTGTTGTGTCATGTGCAGAAGCAGGGGATGAAATTGCAAACAACATTTTGCAAGATTCAGTGAAGGAATTGG
CTCTAAGCGTCACTGCCGTTGTTCAAAGACTTGGATTGTGTGGTTCAGATGGAAAGGGTTCTTTTCCCCTTGTCATGGTTGGTGGAGTTCTCGAAGGAAATAAGGGATGG
GGTATAGCACGAGAAGTTATAAATTGCATTTCCAAGGACTACCCTGGGGTAGTTCCCATTTGGCCTAAGGTGGAGCCTGCAATTGGGGCTGCATTGTTAGCCTGGAATTT
CTTGAAAGATTCCCAACAAGAATAA
Protein sequenceShow/hide protein sequence
MKFEAKRNFDFDLEEDNDEDFLSAVAAAEAVALATKRRKIPNHNIDAPTNAVAEVSYQIPPNADAEGIYIAALKGNPILLSQLSSWKANSGNKGSISANTTAQSEGGGSC
FKCGKLGHWARDCDAPGGGGSFSASGNDTSVAEKACPCGLGVCSVLTANTERNRGRKFYKCPVRQENGGCGFFEWCDSASVTNLVTYGSQNRASSSSFSDLQCPCGAGSC
IILTAKTGENVGQQFYRCPSYQATCGFFQWCKEPSMATKNQVSASKGIGLETVRNHLHHQILLQSLAEVNLLQLEPVSNVVSLAIGPGTALIVNNSVTVQGRRYRSLVRS
LAYGSSSTERRRVEVWFCIFEAMTKKHRNGEVSELEREMSGGTGGVGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAE
ALSKSGSDRSAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQALT
AIIRAHDGRGPQTKLTNSILETLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVMVGGVLEGNKGW
GIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE