; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG07G010830 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG07G010830
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionGlcNAc kinase
Genome locationCG_Chr07:26732647..26748203
RNA-Seq ExpressionClCG07G010830
SyntenyClCG07G010830
Gene Ontology termsGO:0046835 - carbohydrate phosphorylation (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0045127 - N-acetylglucosamine kinase activity (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR002731 - ATPase, BadF/BadG/BcrA/BcrD type
IPR010666 - Zinc finger, GRF-type
IPR036875 - Zinc finger, CCHC-type superfamily
IPR043129 - ATPase, nucleotide binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK16416.1 N-acetyl-D-glucosamine kinase-like [Cucumis melo var. makuwa]1.4e-18391.23Show/hide
Query:  MTKKHRNGEVSELEREMSGGTGG---VGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDR
        MTKKHRNGE+SE ERE+SGGTGG   VG VILGIDGGTTSTVCVCVPFL P SLHLPD  PLLARVEAGCSNHNSVGETAARETLEQV+AEALSKSGSD 
Subjt:  MTKKHRNGEVSELEREMSGGTGG---VGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDR

Query:  SAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL
        SAVRAICLSVSGVNHPTDQQRILNWFRD FPSHVKLYVRNDAAAALASGTMG+LSGCVLIAGTG IAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL
Subjt:  SAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL

Query:  TAIIRAHDGRGPQTKLTNSILEMLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGS----------
        TAIIRAHDGRGPQTKLTNSIL+ LGLSS DELIGWTYADQSWARIAALVPAVVSCAEAGDE+ANNILQDSVKELALSVTAVVQRLGLCGS          
Subjt:  TAIIRAHDGRGPQTKLTNSILEMLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGS----------

Query:  DGKGSFPLVMVGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE
        +GKGSFPLVMVGGVLEGNKGWGIA+EVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE
Subjt:  DGKGSFPLVMVGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE

XP_004147744.1 N-acetyl-D-glucosamine kinase [Cucumis sativus]9.1e-18391.55Show/hide
Query:  MTKKHRNGEVSELEREMSGGTGG---VGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDR
        MTKKHRNGE+SE +RE+S GT G   VG VILGIDGGTTST CVC+PFL P SLHLPD LPLLARVEAGCSNHNSVGETAARETLEQV+AEALSKSG D 
Subjt:  MTKKHRNGEVSELEREMSGGTGG---VGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDR

Query:  SAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL
        SAVRAICLS+SGVNHPTDQQRILNWFRD FPSHVKLYVRNDAAAALASGTMG+LSGCVLIAGTG IAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL
Subjt:  SAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL

Query:  TAIIRAHDGRGPQTKLTNSILEMLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVM
        TAIIRAHDGRGPQTKLTNSIL+ LGLSS DELIGWTYADQSWARIAALVPAVV+CAEAGDE+ANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVM
Subjt:  TAIIRAHDGRGPQTKLTNSILEMLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVM

Query:  VGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE
        VGGVLEGNKGWGIA+EVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKD +QE
Subjt:  VGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE

XP_008451846.1 PREDICTED: N-acetyl-D-glucosamine kinase-like [Cucumis melo]1.4e-18694.08Show/hide
Query:  MTKKHRNGEVSELEREMSGGTGG---VGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDR
        MTKKHRNGE+SE ERE+SGGTGG   VG VILGIDGGTTSTVCVCVPFL P SLHLPD  PLLARVEAGCSNHNSVGETAARETLEQV+AEALSKSGSD 
Subjt:  MTKKHRNGEVSELEREMSGGTGG---VGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDR

Query:  SAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL
        SAVRAICLSVSGVNHPTDQQRILNWFRD FPSHVKLYVRNDAAAALASGTMG+LSGCVLIAGTG IAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL
Subjt:  SAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL

Query:  TAIIRAHDGRGPQTKLTNSILEMLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVM
        TAIIRAHDGRGPQTKLTNSIL+ LGLSS DELIGWTYADQSWARIAALVPAVVSCAEAGDE+ANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVM
Subjt:  TAIIRAHDGRGPQTKLTNSILEMLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVM

Query:  VGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE
        VGGVLEGNKGWGIA+EVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE
Subjt:  VGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE

XP_023553557.1 N-acetyl-D-glucosamine kinase-like [Cucurbita pepo subsp. pepo]5.0e-18191.57Show/hide
Query:  MTKKHRNGEVSELEREMS---GGTGGVGGVILGIDGGTTSTVCVCVPFLQ-PQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSD
        MTKK+RNGE+ E EREMS   GG GGVGGVILGIDGGTTST+CVCVP L   QSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQV+AEALSKSGSD
Subjt:  MTKKHRNGEVSELEREMS---GGTGGVGGVILGIDGGTTSTVCVCVPFLQ-PQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSD

Query:  RSAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQA
        RSAV+AICLSVSGVNHPTDQQRILNW RD+FPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTG IA+GFTDDGREARAAGAGPILGDWGSGYGISAQA
Subjt:  RSAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQA

Query:  LTAIIRAHDGRGPQTKLTNSILEMLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLV
        LTAIIRAHDGRGPQTKLTNSIL+ LGLSS DELIGWTYAD SWARIAALVPAVVSCAEAGDE+ANNILQD+VKELALSV AVVQRLGL GSDGKGSFPLV
Subjt:  LTAIIRAHDGRGPQTKLTNSILEMLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLV

Query:  MVGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE
        MVGGV+EGNKGWGIA+EVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDS QE
Subjt:  MVGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE

XP_038898748.1 N-acetyl-D-glucosamine kinase [Benincasa hispida]5.5e-18894.9Show/hide
Query:  KKHRNGEVSELEREMSGGT---GGVGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDRSA
        KKHRNGE+SE +REMSGGT   G VG VILGIDGGTTSTVCVCVPFLQPQSLHLPDPLP+LARVEAGCSNHNSVGETAARETLEQV+AEALSKSGSDRSA
Subjt:  KKHRNGEVSELEREMSGGT---GGVGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDRSA

Query:  VRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQALTA
        VRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTG IAYGFTDDGREARAAGAGPILGDWGSGYGISAQALTA
Subjt:  VRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQALTA

Query:  IIRAHDGRGPQTKLTNSILEMLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVMVG
        IIRAHDGRGPQTKLTN+IL  LGLSS DELIGWTYADQSWARIAALVPAVVSCAEAGDE+AN ILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVMVG
Subjt:  IIRAHDGRGPQTKLTNSILEMLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVMVG

Query:  GVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE
        GVLEGNKGWGIA+EVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE
Subjt:  GVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE

TrEMBL top hitse value%identityAlignment
A0A0A0L083 GlcNAc kinase4.4e-18391.55Show/hide
Query:  MTKKHRNGEVSELEREMSGGTGG---VGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDR
        MTKKHRNGE+SE +RE+S GT G   VG VILGIDGGTTST CVC+PFL P SLHLPD LPLLARVEAGCSNHNSVGETAARETLEQV+AEALSKSG D 
Subjt:  MTKKHRNGEVSELEREMSGGTGG---VGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDR

Query:  SAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL
        SAVRAICLS+SGVNHPTDQQRILNWFRD FPSHVKLYVRNDAAAALASGTMG+LSGCVLIAGTG IAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL
Subjt:  SAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL

Query:  TAIIRAHDGRGPQTKLTNSILEMLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVM
        TAIIRAHDGRGPQTKLTNSIL+ LGLSS DELIGWTYADQSWARIAALVPAVV+CAEAGDE+ANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVM
Subjt:  TAIIRAHDGRGPQTKLTNSILEMLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVM

Query:  VGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE
        VGGVLEGNKGWGIA+EVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKD +QE
Subjt:  VGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE

A0A1S3BRV5 GlcNAc kinase6.6e-18794.08Show/hide
Query:  MTKKHRNGEVSELEREMSGGTGG---VGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDR
        MTKKHRNGE+SE ERE+SGGTGG   VG VILGIDGGTTSTVCVCVPFL P SLHLPD  PLLARVEAGCSNHNSVGETAARETLEQV+AEALSKSGSD 
Subjt:  MTKKHRNGEVSELEREMSGGTGG---VGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDR

Query:  SAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL
        SAVRAICLSVSGVNHPTDQQRILNWFRD FPSHVKLYVRNDAAAALASGTMG+LSGCVLIAGTG IAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL
Subjt:  SAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL

Query:  TAIIRAHDGRGPQTKLTNSILEMLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVM
        TAIIRAHDGRGPQTKLTNSIL+ LGLSS DELIGWTYADQSWARIAALVPAVVSCAEAGDE+ANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVM
Subjt:  TAIIRAHDGRGPQTKLTNSILEMLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVM

Query:  VGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE
        VGGVLEGNKGWGIA+EVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE
Subjt:  VGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE

A0A5A7V7G8 GlcNAc kinase6.6e-18794.08Show/hide
Query:  MTKKHRNGEVSELEREMSGGTGG---VGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDR
        MTKKHRNGE+SE ERE+SGGTGG   VG VILGIDGGTTSTVCVCVPFL P SLHLPD  PLLARVEAGCSNHNSVGETAARETLEQV+AEALSKSGSD 
Subjt:  MTKKHRNGEVSELEREMSGGTGG---VGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDR

Query:  SAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL
        SAVRAICLSVSGVNHPTDQQRILNWFRD FPSHVKLYVRNDAAAALASGTMG+LSGCVLIAGTG IAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL
Subjt:  SAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL

Query:  TAIIRAHDGRGPQTKLTNSILEMLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVM
        TAIIRAHDGRGPQTKLTNSIL+ LGLSS DELIGWTYADQSWARIAALVPAVVSCAEAGDE+ANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVM
Subjt:  TAIIRAHDGRGPQTKLTNSILEMLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVM

Query:  VGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE
        VGGVLEGNKGWGIA+EVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE
Subjt:  VGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE

A0A5D3D163 GlcNAc kinase6.8e-18491.23Show/hide
Query:  MTKKHRNGEVSELEREMSGGTGG---VGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDR
        MTKKHRNGE+SE ERE+SGGTGG   VG VILGIDGGTTSTVCVCVPFL P SLHLPD  PLLARVEAGCSNHNSVGETAARETLEQV+AEALSKSGSD 
Subjt:  MTKKHRNGEVSELEREMSGGTGG---VGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDR

Query:  SAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL
        SAVRAICLSVSGVNHPTDQQRILNWFRD FPSHVKLYVRNDAAAALASGTMG+LSGCVLIAGTG IAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL
Subjt:  SAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQAL

Query:  TAIIRAHDGRGPQTKLTNSILEMLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGS----------
        TAIIRAHDGRGPQTKLTNSIL+ LGLSS DELIGWTYADQSWARIAALVPAVVSCAEAGDE+ANNILQDSVKELALSVTAVVQRLGLCGS          
Subjt:  TAIIRAHDGRGPQTKLTNSILEMLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGS----------

Query:  DGKGSFPLVMVGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE
        +GKGSFPLVMVGGVLEGNKGWGIA+EVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE
Subjt:  DGKGSFPLVMVGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE

A0A6J1JC39 GlcNAc kinase7.1e-18190.03Show/hide
Query:  MTKKHRNGEVSELEREMS---------GGTGGVGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALS
        MTKK+RNGE+ E EREMS         GG GGVGGVILGIDGGTTSTVCVCVP L  QSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQV+AEALS
Subjt:  MTKKHRNGEVSELEREMS---------GGTGGVGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALS

Query:  KSGSDRSAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYG
        KSGSDRSAV+AICLSVSGVNHPTDQQRILNW RD+FPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTG IA+GFTDDGREARAAGAGPILGDWGSGYG
Subjt:  KSGSDRSAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYG

Query:  ISAQALTAIIRAHDGRGPQTKLTNSILEMLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKG
        ISAQALTAIIRAHDGRGPQT LTNSIL+ LGLSS DELIGWTYAD SWARIAALVPAVVSCAEAGDE+ANNILQD+VKELALSV AVVQRLG  GSDGKG
Subjt:  ISAQALTAIIRAHDGRGPQTKLTNSILEMLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKG

Query:  SFPLVMVGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE
        SFPLVMVGGV+EGNKGWGIA+EVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDS QE
Subjt:  SFPLVMVGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE

SwissProt top hitse value%identityAlignment
P81799 N-acetyl-D-glucosamine kinase9.3e-1325.52Show/hide
Query:  GIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDRSA-VRAICLSVSGVNHPTDQQRILNWFRDIFP
        G++GG T          + + L L +   +LA  +   +NH  +G     E + +++  A  K+G D    +R++ LS+SG       + ++   RD FP
Subjt:  GIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDRSA-VRAICLSVSGVNHPTDQQRILNWFRDIFP

Query:  SHVKLY-VRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILEML--GLSS
           + Y +  DAA ++A+ T     G VLI+GTG        DG E+   G G ++GD GS Y I+ QA+  +  + D           + + +      
Subjt:  SHVKLY-VRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILEML--GLSS

Query:  PDEL--IGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVMVGGVLEGNKGWGIARE
        PD L  +   Y D   ++ A     +   A+ GD ++  I + + + L   V AV+  +      G+   P++ VG V    K W + +E
Subjt:  PDEL--IGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVMVGGVLEGNKGWGIARE

Q3SZM9 N-acetyl-D-glucosamine kinase3.2e-1326.21Show/hide
Query:  GIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDRSA-VRAICLSVSGVNHPTDQQRILNWFRDIFP
        G++GG T          + + L L +   +LA  +   +NH  +G     E + +++  A  K+G D    +R + LS+SG +     + ++   RD FP
Subjt:  GIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDRSA-VRAICLSVSGVNHPTDQQRILNWFRDIFP

Query:  SHVKLY-VRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILEML--GLSS
           + Y +  DAA ++A+ T     G VLI+GTG        DG E+   G G ++GD GS Y I+ QA+  +  + D           + + +      
Subjt:  SHVKLY-VRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILEML--GLSS

Query:  PDEL--IGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVMVGGVLEGNKGWGIARE
        PD L  +   Y D   +R A     V   A+ GD ++  I + + + L   V AV+  +      G+   P++ VG V    K W + +E
Subjt:  PDEL--IGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVMVGGVLEGNKGWGIARE

Q54PM7 N-acetyl-D-glucosamine kinase1.6e-4936.56Show/hide
Query:  VILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQ-------VIAEALSKSGSDRSAVRAICLSVSGVNHPTDQQRI
        + +GIDGG T T  V V     +          LAR  + CSN++SVGE  A+  + +        + E ++   +    V +ICL +SGV+   D+  +
Subjt:  VILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQ-------VIAEALSKSGSDRSAVRAICLSVSGVNHPTDQQRI

Query:  LNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILE
         +W  ++    +   + NDA  AL+SGT G+L G V+I GTGCI+ GF  +G   R+ G GP+LGD+GSGY I    L  +++A D  GP+T LT  +LE
Subjt:  LNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILE

Query:  MLGLSSPDELIGWTY--ADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVMVGGVLEGNKGWGIAREVIN-
         L L+  ++LI W Y    QSW + A L P     A+ GDEI+N IL D+   L   + +V+++LGL   D +  FPLV  GG +E     GI  ++++ 
Subjt:  MLGLSSPDELIGWTY--ADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVMVGGVLEGNKGWGIAREVIN-

Query:  CISKDYPGVVPIWPKVEPAIGAALLAWNFLK
         I ++YP    +    +P++GAALLA N  K
Subjt:  CISKDYPGVVPIWPKVEPAIGAALLAWNFLK

Q97ML3 N-acetylmuramic acid/N-acetylglucosamine kinase2.6e-3134.07Show/hide
Query:  ILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDRSAVRAICLSVSGVNHPTDQQRILNWFRDIF
        ++GIDGG + T       ++  +L       +L  V  G SN NS  +   +  L+++I E L K G       AIC+  +G +   D+  I +  R + 
Subjt:  ILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDRSAVRAICLSVSGVNHPTDQQRILNWFRDIF

Query:  PSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILEMLGLSSPD
            K+ V NDA  ALA G   R  G ++I+GTG I YG   +GR AR+ G G I+GD GSGY I  +A+ A +++ D RG +T L   IL+ L L S +
Subjt:  PSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILEMLGLSSPD

Query:  ELIGWTY-ADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVMVGGVLEG-NKGWGIAREVINCISKDYPGV
        +LI + Y +  +   IA+L   V S    GD ++  IL+++ +EL LSV AVV+ L +          L   GGV+   N  +   R+ +N    +YP V
Subjt:  ELIGWTY-ADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVMVGGVLEG-NKGWGIAREVINCISKDYPGV

Query:  VPIWPKVEPAIGAALLA
          I  K + A GA ++A
Subjt:  VPIWPKVEPAIGAALLA

Q9UJ70 N-acetyl-D-glucosamine kinase5.5e-1325.52Show/hide
Query:  GIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDRSA-VRAICLSVSGVNHPTDQQRILNWFRDIFP
        G++GG T          + + L + +   +LA  +   +NH  +G     E + +++  A  K+G D    +R++ LS+SG +     + ++   RD FP
Subjt:  GIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSDRSA-VRAICLSVSGVNHPTDQQRILNWFRDIFP

Query:  SHVKLY-VRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILEML--GLSS
           + Y +  DAA ++A+ T     G VLI+GTG        DG E+   G G ++GD GS Y I+ QA+  +  + D           + + +      
Subjt:  SHVKLY-VRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILEML--GLSS

Query:  PDEL--IGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVMVGGVLEGNKGWGIARE
        PD L  +   Y D    R A     +   A+ GD ++  I + + + L   + AV+  +      GK   P++ VG V    K W + +E
Subjt:  PDEL--IGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVMVGGVLEGNKGWGIARE

Arabidopsis top hitse value%identityAlignment
AT1G30540.1 Actin-like ATPase superfamily protein3.6e-13769.71Show/hide
Query:  MTKKHRNGEVSELEREMSG----GTGGVGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSD
        M   H NG + +LE +  G      G V GVILG+DGG TSTVCVCVPF        PDPLP+L R  AGC+N NSVGETAAR++LEQVI+EAL +SG D
Subjt:  MTKKHRNGEVSELEREMSG----GTGGVGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAEALSKSGSD

Query:  RSAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQA
        +S VR +CL VSGVNHP+DQ++I NW RD+FPSHVK+YV+NDA  ALASGTMG+L GCVLIAGTGCIAYGF +DG+EARA+G GPILGDWGSGYGI+AQA
Subjt:  RSAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQA

Query:  LTAIIRAHDGRGPQTKLTNSILEMLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLV
        LTA+IRAHDGRGPQT LT++IL+ LGLSSPDELIGWTYAD SWARIAALVP VVSCAEAGDEI++ IL D+ ++LALSV AVVQRLGLCG DG  SFP+V
Subjt:  LTAIIRAHDGRGPQTKLTNSILEMLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLV

Query:  MVGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFL
        MVGGVL  N+ W I +EV   I++ +PG   I PKVEPA+GAALLA NFL
Subjt:  MVGGVLEGNKGWGIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFL

AT1G54930.1 GRF zinc finger / Zinc knuckle protein5.4e-0841.67Show/hide
Query:  LVTYGSQNRASSSSFSDLQCPCGAGSCIILTAKTGENVGQQFYRCPSYQATCGFFQWCKE
        L T      A++ S  D+ CPC AG C  +T+KT +N  ++FY CPS    CG+F+WC +
Subjt:  LVTYGSQNRASSSSFSDLQCPCGAGSCIILTAKTGENVGQQFYRCPSYQATCGFFQWCKE

AT2G17870.1 cold shock domain protein 34.3e-0542.11Show/hide
Query:  GSISANTTAQSEGGGSCFKCGKLGHWARDCDAPGGGGSFSASGNDTSVAEKAC-PCG
        GS++    +    GG+CF CG++GH A+DCD   GG SF   G   S  E  C  CG
Subjt:  GSISANTTAQSEGGGSCFKCGKLGHWARDCDAPGGGGSFSASGNDTSVAEKAC-PCG

AT3G42860.1 zinc knuckle (CCHC-type) family protein4.0e-4344.33Show/hide
Query:  EGIYIAALKGNPILLSQLSSWKANSGNKGSISANTTAQSEGGGSCFKCGKLGHWARDCDAPGGGGSFSASGNDTSVAEKACPCGLGVCSVLTANTERNRG
        EG Y+AALKG     S+   W+ +  N  S  + + A + GG                 + GGGG       +    EK+CPCG+G+C +LT+NT +N G
Subjt:  EGIYIAALKGNPILLSQLSSWKANSGNKGSISANTTAQSEGGGSCFKCGKLGHWARDCDAPGGGGSFSASGNDTSVAEKACPCGLGVCSVLTANTERNRG

Query:  RKFYKCPVRQENGGCGFFEWCDS--ASVTNLVTYGSQNRASSSSFSDLQCPCGAGSCIILTAKTGENVGQQFYRCPSYQATCGFFQWCKEPSMATKNQVS
        RKFYKCP R+ENGGCGFF+WCD+  +S T+  T  S    + + F D QCPCGAG C +LTAKTGENVG+QFYRCP ++ +CGFF+WC +  +++    S
Subjt:  RKFYKCPVRQENGGCGFFEWCDS--ASVTNLVTYGSQNRASSSSFSDLQCPCGAGSCIILTAKTGENVGQQFYRCPSYQATCGFFQWCKEPSMATKNQVS

Query:  ASK
         +K
Subjt:  ASK

AT5G13920.1 GRF zinc finger / Zinc knuckle protein4.9e-1734.48Show/hide
Query:  GSCFKCGKLGHWARDCDAPGGGGSFSASGNDTSVAEKACPCGLGVCSVLTANTERNRGRKFYKCPVRQENGGCGFFEWCDSASVTNLVTYGSQNRASSSS
        G CF+C + GHW  DC           S  D       CPCG G C +  ANT  N GRKFYKCP  Q    C FF+WCD  +         ++     +
Subjt:  GSCFKCGKLGHWARDCDAPGGGGSFSASGNDTSVAEKACPCGLGVCSVLTANTERNRGRKFYKCPVRQENGGCGFFEWCDSASVTNLVTYGSQNRASSSS

Query:  FSDLQCPCGAGSCIILTAKTGENVGQQFYRCPSYQ--ATCGFFQW
        F+   C CGAG C     +  +  G+ +  C   +    CGFF+W
Subjt:  FSDLQCPCGAGSCIILTAKTGENVGQQFYRCPSYQ--ATCGFFQW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGTTCGAAGCTAAGAGGAATTTTGATTTTGATCTCGAAGAAGACAACGATGAAGATTTCCTTTCAGCGGTGGCCGCAGCTGAAGCTGTTGCCCTAGCCACTAAACG
GAGAAAAATCCCCAATCATAACATCGATGCGCCCACAAATGCCGTCGCCGAGGTTTCGTACCAAATTCCCCCTAATGCAGACGCGGAAGGGATCTACATCGCGGCTTTGA
AAGGAAACCCTATTCTCCTCTCTCAACTTTCTAGTTGGAAAGCCAATTCTGGCAACAAAGGATCCATTTCTGCGAATACTACTGCTCAATCGGAAGGCGGTGGTTCGTGT
TTCAAATGCGGGAAATTAGGGCATTGGGCTAGAGACTGCGATGCTCCGGGAGGTGGAGGATCTTTCAGCGCTTCAGGAAATGATACGTCTGTTGCTGAGAAAGCATGCCC
CTGCGGATTGGGAGTTTGTTCCGTACTAACTGCGAATACGGAGAGGAATCGCGGTCGAAAATTTTACAAATGCCCGGTTCGGCAGGAAAATGGTGGCTGCGGATTCTTTG
AGTGGTGTGACAGTGCCTCTGTAACTAATTTAGTGACTTATGGAAGCCAAAATCGTGCATCAAGTTCTTCATTTTCAGACCTTCAGTGCCCCTGTGGTGCTGGTTCTTGC
ATAATTCTAACAGCCAAAACAGGGGAAAATGTTGGGCAGCAATTCTATCGTTGCCCTTCATACCAGGCAACTTGTGGTTTTTTCCAGTGGTGCAAGGAGCCTTCCATGGC
AACCAAGAATCAAGTTAGTGCCTCTAAGGGCATTGGGCTAGAGACTGTGCGCAACCACTTGCATCATCAAATCCTCCTTCAGAGTTTGGCAGAAGTCAATCTTCTTCAGT
TGGAACCTGTTTCAAATGTGGTAAGCCTGGCCATTGGGCCAGGGACTGCTCTAATCGTTAATAATAGTGTAACTGTGCAGGGAAGAAGATACAGAAGCTTGGTGAGGAGT
TTGGCATACGGATCTTCATCAACAGAACGGAGGAGAGTAGAGGTTTGGTTCTGTATTTTTGAAGCGATGACGAAGAAACATCGGAATGGGGAAGTCTCGGAATTGGAGCG
GGAAATGTCCGGCGGAACAGGAGGAGTTGGGGGCGTGATTCTTGGAATTGATGGAGGAACCACCTCCACCGTCTGTGTTTGTGTTCCTTTTCTGCAACCTCAGTCTCTTC
ATCTTCCTGATCCTCTTCCTCTTCTCGCCCGAGTCGAAGCTGGCTGCTCCAACCATAACAGCGTTGGCGAAACTGCTGCAAGGGAAACATTGGAGCAAGTTATCGCTGAG
GCTCTTTCAAAATCAGGTTCAGATAGATCTGCAGTTCGAGCTATTTGTTTATCTGTTTCTGGTGTAAACCATCCAACGGATCAGCAAAGGATTTTGAATTGGTTCAGAGA
TATATTTCCTAGCCATGTCAAACTCTATGTTCGAAATGATGCTGCGGCTGCTCTTGCAAGTGGTACCATGGGAAGGCTTAGTGGTTGTGTTCTAATTGCTGGCACGGGGT
GTATTGCCTATGGATTTACAGATGATGGAAGAGAAGCTCGAGCAGCTGGTGCAGGACCAATCTTAGGTGACTGGGGAAGTGGGTATGGCATATCAGCGCAGGCTTTGACT
GCAATTATAAGAGCTCATGATGGTCGTGGTCCTCAAACAAAGCTCACAAATAGCATTCTTGAGATGCTTGGTCTTTCTTCTCCTGATGAACTCATTGGGTGGACCTACGC
AGATCAATCTTGGGCTCGCATTGCTGCACTTGTTCCTGCTGTTGTGTCATGTGCAGAAGCAGGGGATGAAATTGCAAACAACATTTTGCAAGATTCAGTGAAGGAATTGG
CTCTAAGCGTCACTGCCGTTGTTCAAAGACTTGGATTGTGTGGTTCAGATGGAAAGGGTTCTTTTCCCCTTGTCATGGTTGGTGGAGTTCTCGAAGGAAATAAGGGATGG
GGTATAGCACGAGAAGTTATAAATTGCATTTCCAAGGACTACCCTGGGGTAGTTCCCATTTGGCCTAAGGTGGAGCCTGCAATTGGGGCTGCATTGTTAGCCTGGAATTT
CTTGAAAGATTCCCAACAAGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGTTCGAAGCTAAGAGGAATTTTGATTTTGATCTCGAAGAAGACAACGATGAAGATTTCCTTTCAGCGGTGGCCGCAGCTGAAGCTGTTGCCCTAGCCACTAAACG
GAGAAAAATCCCCAATCATAACATCGATGCGCCCACAAATGCCGTCGCCGAGGTTTCGTACCAAATTCCCCCTAATGCAGACGCGGAAGGGATCTACATCGCGGCTTTGA
AAGGAAACCCTATTCTCCTCTCTCAACTTTCTAGTTGGAAAGCCAATTCTGGCAACAAAGGATCCATTTCTGCGAATACTACTGCTCAATCGGAAGGCGGTGGTTCGTGT
TTCAAATGCGGGAAATTAGGGCATTGGGCTAGAGACTGCGATGCTCCGGGAGGTGGAGGATCTTTCAGCGCTTCAGGAAATGATACGTCTGTTGCTGAGAAAGCATGCCC
CTGCGGATTGGGAGTTTGTTCCGTACTAACTGCGAATACGGAGAGGAATCGCGGTCGAAAATTTTACAAATGCCCGGTTCGGCAGGAAAATGGTGGCTGCGGATTCTTTG
AGTGGTGTGACAGTGCCTCTGTAACTAATTTAGTGACTTATGGAAGCCAAAATCGTGCATCAAGTTCTTCATTTTCAGACCTTCAGTGCCCCTGTGGTGCTGGTTCTTGC
ATAATTCTAACAGCCAAAACAGGGGAAAATGTTGGGCAGCAATTCTATCGTTGCCCTTCATACCAGGCAACTTGTGGTTTTTTCCAGTGGTGCAAGGAGCCTTCCATGGC
AACCAAGAATCAAGTTAGTGCCTCTAAGGGCATTGGGCTAGAGACTGTGCGCAACCACTTGCATCATCAAATCCTCCTTCAGAGTTTGGCAGAAGTCAATCTTCTTCAGT
TGGAACCTGTTTCAAATGTGGTAAGCCTGGCCATTGGGCCAGGGACTGCTCTAATCGTTAATAATAGTGTAACTGTGCAGGGAAGAAGATACAGAAGCTTGGTGAGGAGT
TTGGCATACGGATCTTCATCAACAGAACGGAGGAGAGTAGAGGTTTGGTTCTGTATTTTTGAAGCGATGACGAAGAAACATCGGAATGGGGAAGTCTCGGAATTGGAGCG
GGAAATGTCCGGCGGAACAGGAGGAGTTGGGGGCGTGATTCTTGGAATTGATGGAGGAACCACCTCCACCGTCTGTGTTTGTGTTCCTTTTCTGCAACCTCAGTCTCTTC
ATCTTCCTGATCCTCTTCCTCTTCTCGCCCGAGTCGAAGCTGGCTGCTCCAACCATAACAGCGTTGGCGAAACTGCTGCAAGGGAAACATTGGAGCAAGTTATCGCTGAG
GCTCTTTCAAAATCAGGTTCAGATAGATCTGCAGTTCGAGCTATTTGTTTATCTGTTTCTGGTGTAAACCATCCAACGGATCAGCAAAGGATTTTGAATTGGTTCAGAGA
TATATTTCCTAGCCATGTCAAACTCTATGTTCGAAATGATGCTGCGGCTGCTCTTGCAAGTGGTACCATGGGAAGGCTTAGTGGTTGTGTTCTAATTGCTGGCACGGGGT
GTATTGCCTATGGATTTACAGATGATGGAAGAGAAGCTCGAGCAGCTGGTGCAGGACCAATCTTAGGTGACTGGGGAAGTGGGTATGGCATATCAGCGCAGGCTTTGACT
GCAATTATAAGAGCTCATGATGGTCGTGGTCCTCAAACAAAGCTCACAAATAGCATTCTTGAGATGCTTGGTCTTTCTTCTCCTGATGAACTCATTGGGTGGACCTACGC
AGATCAATCTTGGGCTCGCATTGCTGCACTTGTTCCTGCTGTTGTGTCATGTGCAGAAGCAGGGGATGAAATTGCAAACAACATTTTGCAAGATTCAGTGAAGGAATTGG
CTCTAAGCGTCACTGCCGTTGTTCAAAGACTTGGATTGTGTGGTTCAGATGGAAAGGGTTCTTTTCCCCTTGTCATGGTTGGTGGAGTTCTCGAAGGAAATAAGGGATGG
GGTATAGCACGAGAAGTTATAAATTGCATTTCCAAGGACTACCCTGGGGTAGTTCCCATTTGGCCTAAGGTGGAGCCTGCAATTGGGGCTGCATTGTTAGCCTGGAATTT
CTTGAAAGATTCCCAACAAGAATAA
Protein sequenceShow/hide protein sequence
MKFEAKRNFDFDLEEDNDEDFLSAVAAAEAVALATKRRKIPNHNIDAPTNAVAEVSYQIPPNADAEGIYIAALKGNPILLSQLSSWKANSGNKGSISANTTAQSEGGGSC
FKCGKLGHWARDCDAPGGGGSFSASGNDTSVAEKACPCGLGVCSVLTANTERNRGRKFYKCPVRQENGGCGFFEWCDSASVTNLVTYGSQNRASSSSFSDLQCPCGAGSC
IILTAKTGENVGQQFYRCPSYQATCGFFQWCKEPSMATKNQVSASKGIGLETVRNHLHHQILLQSLAEVNLLQLEPVSNVVSLAIGPGTALIVNNSVTVQGRRYRSLVRS
LAYGSSSTERRRVEVWFCIFEAMTKKHRNGEVSELEREMSGGTGGVGGVILGIDGGTTSTVCVCVPFLQPQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVIAE
ALSKSGSDRSAVRAICLSVSGVNHPTDQQRILNWFRDIFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGCIAYGFTDDGREARAAGAGPILGDWGSGYGISAQALT
AIIRAHDGRGPQTKLTNSILEMLGLSSPDELIGWTYADQSWARIAALVPAVVSCAEAGDEIANNILQDSVKELALSVTAVVQRLGLCGSDGKGSFPLVMVGGVLEGNKGW
GIAREVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE