; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0006353 (gene) of Chayote v1 genome

Gene IDSed0006353
OrganismSechium edule (Chayote v1)
Descriptiongalactokinase
Genome locationLG12:4592578..4598977
RNA-Seq ExpressionSed0006353
SyntenySed0006353
Gene Ontology termsGO:0006012 - galactose metabolic process (biological process)
GO:0046835 - carbohydrate phosphorylation (biological process)
GO:0005829 - cytosol (cellular component)
GO:0004335 - galactokinase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
InterPro domainsIPR000705 - Galactokinase
IPR006203 - GHMP kinase, ATP-binding, conserved site
IPR006204 - GHMP kinase N-terminal domain
IPR006206 - Mevalonate/galactokinase
IPR013750 - GHMP kinase, C-terminal domain
IPR014721 - Ribosomal protein S5 domain 2-type fold, subgroup
IPR019539 - Galactokinase, N-terminal domain
IPR019741 - Galactokinase, conserved site
IPR020568 - Ribosomal protein S5 domain 2-type fold
IPR036554 - GHMP kinase, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
NP_001284475.1 galactokinase [Cucumis melo]1.6e-26992.18Show/hide
Query:  MAKHEDLPVPVFSSLEPVYGDGSQLEEARIRFDHLKDKFLQVFGHAPDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGEAEHLLRISN
        MAKHEDLP+PVFSSL+PVYGDGSQLEEAR+RFDHLK KFLQVFGH PDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGE  HLL+I+N
Subjt:  MAKHEDLPVPVFSSLEPVYGDGSQLEEARIRFDHLKDKFLQVFGHAPDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGEAEHLLRISN

Query:  VNDKYSMCTYPADPDQEVDLKNHKWGHYFLCAYKGYYEFSKSKGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCE
        VNDKYSMCTYPADPDQEVDLKNHKWGHYFLC YKGYYEF+KSKGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAA G NFPKKE+AQLTC+
Subjt:  VNDKYSMCTYPADPDQEVDLKNHKWGHYFLCAYKGYYEFSKSKGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCE

Query:  CERHIGTQSGGMDQAISVMAKSGFAELIDFNPIRATDVKLPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDV
        CERHIGTQSGGMDQAISVMAKSGFAELIDFNPIRATDV+LP GGTFVIAHSLAESQKAVTAATNYNNRVVECRLA+IVLGIKLGMKP++AI  VKTLSDV
Subjt:  CERHIGTQSGGMDQAISVMAKSGFAELIDFNPIRATDVKLPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDV

Query:  EGQCVSFAKERNSSDPGLAVKELLKEDPYTAEEIEQITVDNLPSILGDSPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAVSSSLSEEDKLKKLGD
        EG C+SFAKERNSSDP LAVKELLKE+PYTAEEIEQITVDNLPS+LG+SPTSLDVLKAAKHFKLYQRA+HVYSEA+RV+AFKDAVSSSLSEEDKLKKLGD
Subjt:  EGQCVSFAKERNSSDPGLAVKELLKEDPYTAEEIEQITVDNLPSILGDSPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAVSSSLSEEDKLKKLGD

Query:  LMNDSHYSCSVLYECSCPELEELVKICRDNGALGARLTGAGWGGCAVALVKEAIVPQFILNLKEKFYQSRIERGIIDKNDLGLYVFASKPSSGAAIFQF
        LMNDSHYSCSVLYECSCPELEELVKICRDN ALGARLTGAGWGGCAVALVKEAIVPQFI NLKE FY+SRIERG+I K+D+GLYVFASKPSSGAAIFQF
Subjt:  LMNDSHYSCSVLYECSCPELEELVKICRDNGALGARLTGAGWGGCAVALVKEAIVPQFILNLKEKFYQSRIERGIIDKNDLGLYVFASKPSSGAAIFQF

XP_022137939.1 galactokinase [Momordica charantia]3.0e-27393.19Show/hide
Query:  MAKHEDLPVPVFSSLEPVYGDGSQLEEARIRFDHLKDKFLQVFGHAPDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGEAEHLLRISN
        MAKHEDLP+PVFSSL PVYGDGSQLEEA++RFDHLK KFLQVFGH PDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGEA HLLRI+N
Subjt:  MAKHEDLPVPVFSSLEPVYGDGSQLEEARIRFDHLKDKFLQVFGHAPDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGEAEHLLRISN

Query:  VNDKYSMCTYPADPDQEVDLKNHKWGHYFLCAYKGYYEFSKSKGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCE
        VNDKYS+CTYPADPDQEVDLKNHKWGHYFLC YKGYYE++KSKGQ+VGVPVGLDVLVDGTVPTGSGLSSSAAFVCS+TIAIMAAFG NFPKKE+AQLTCE
Subjt:  VNDKYSMCTYPADPDQEVDLKNHKWGHYFLCAYKGYYEFSKSKGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCE

Query:  CERHIGTQSGGMDQAISVMAKSGFAELIDFNPIRATDVKLPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDV
        CERHIGTQSGGMDQAISVMAKSGFAELIDFNPIRATDV+LPAGGTF+IAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKP +A+A+VKTLSDV
Subjt:  CERHIGTQSGGMDQAISVMAKSGFAELIDFNPIRATDVKLPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDV

Query:  EGQCVSFAKERNSSDPGLAVKELLKEDPYTAEEIEQITVDNLPSILGDSPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAVSSSLSEEDKLKKLGD
        EG C+SFA+ERNSSDP LAVKELLKEDPYTAEEIEQITV+NLPS+LG+SPTSLDVLKAAKHFKLYQRA+HVYSEAKRV+AFKDAVSSSLSEEDKLKKLGD
Subjt:  EGQCVSFAKERNSSDPGLAVKELLKEDPYTAEEIEQITVDNLPSILGDSPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAVSSSLSEEDKLKKLGD

Query:  LMNDSHYSCSVLYECSCPELEELVKICRDNGALGARLTGAGWGGCAVALVKEAIVPQFILNLKEKFYQSRIERGIIDKNDLGLYVFASKPSSGAAIFQF
        LMNDSHYSCSVLYECSCPELEELVKICRDNGALGARLTGAGWGGCAVALVKEAIVPQFILNLKE FY+SRI+RGIIDKNDLGLYVFASKPSSGAAIFQF
Subjt:  LMNDSHYSCSVLYECSCPELEELVKICRDNGALGARLTGAGWGGCAVALVKEAIVPQFILNLKEKFYQSRIERGIIDKNDLGLYVFASKPSSGAAIFQF

XP_022972025.1 galactokinase-like isoform X1 [Cucurbita maxima]1.3e-26892.18Show/hide
Query:  MAKHEDLPVPVFSSLEPVYGDGSQLEEARIRFDHLKDKFLQVFGHAPDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGEAEHLLRISN
        MAKHEDLP+PVFSSL+PVYGDG+QLEEAR+RFD+LK KFLQVFGH+P VFARSPGRVNLIGEHIDYEGYSVLPMAIR DTIVAIRKHDAGEAEHLLRI+N
Subjt:  MAKHEDLPVPVFSSLEPVYGDGSQLEEARIRFDHLKDKFLQVFGHAPDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGEAEHLLRISN

Query:  VNDKYSMCTYPADPDQEVDLKNHKWGHYFLCAYKGYYEFSKSKGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCE
        VNDKYSMCTYPADP QEVDLKNHKWGHYFLC YKGYYEF+KSKGQDVG PVGLDVLVDGTVPTGSGLSSSAAFVCS+TIAIMAAFG NFPKKELAQLTCE
Subjt:  VNDKYSMCTYPADPDQEVDLKNHKWGHYFLCAYKGYYEFSKSKGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCE

Query:  CERHIGTQSGGMDQAISVMAKSGFAELIDFNPIRATDVKLPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDV
        CERHIGTQSGGMDQAISVMAKSGFAELIDFNPIRATDV+LPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDV
Subjt:  CERHIGTQSGGMDQAISVMAKSGFAELIDFNPIRATDVKLPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDV

Query:  EGQCVSFAKERNSSDPGLAVKELLKEDPYTAEEIEQITVDNLPSILGDSPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAVSSSLSEEDKLKKLGD
        EG C+SFA ERNSSDP LAVKELLKE+PYTAE+IEQITVDNLPS+LG+SPTSLDVLKAAKHFKLYQRA+HVYSEAKRV+AFKDAVSSS+SEE++LKKLGD
Subjt:  EGQCVSFAKERNSSDPGLAVKELLKEDPYTAEEIEQITVDNLPSILGDSPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAVSSSLSEEDKLKKLGD

Query:  LMNDSHYSCSVLYECSCPELEELVKICRDNGALGARLTGAGWGGCAVALVKEAIVPQFILNLKEKFYQSRIERGIIDKNDLGLYVFASKPSSGAAIFQF
        LMNDSHYSCSVLYECSCPELEELVKICRDNGALGARLTGAGWGGCAVALVKEAIVPQFIL+LK+KFY+SRI+RGIIDKN +  YVFASKPSSGAAIF F
Subjt:  LMNDSHYSCSVLYECSCPELEELVKICRDNGALGARLTGAGWGGCAVALVKEAIVPQFILNLKEKFYQSRIERGIIDKNDLGLYVFASKPSSGAAIFQF

XP_022972026.1 galactokinase-like isoform X2 [Cucurbita maxima]1.3e-26892.18Show/hide
Query:  MAKHEDLPVPVFSSLEPVYGDGSQLEEARIRFDHLKDKFLQVFGHAPDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGEAEHLLRISN
        MAKHEDLP+PVFSSL+PVYGDG+QLEEAR+RFD+LK KFLQVFGH+P VFARSPGRVNLIGEHIDYEGYSVLPMAIR DTIVAIRKHDAGEAEHLLRI+N
Subjt:  MAKHEDLPVPVFSSLEPVYGDGSQLEEARIRFDHLKDKFLQVFGHAPDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGEAEHLLRISN

Query:  VNDKYSMCTYPADPDQEVDLKNHKWGHYFLCAYKGYYEFSKSKGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCE
        VNDKYSMCTYPADP QEVDLKNHKWGHYFLC YKGYYEF+KSKGQDVG PVGLDVLVDGTVPTGSGLSSSAAFVCS+TIAIMAAFG NFPKKELAQLTCE
Subjt:  VNDKYSMCTYPADPDQEVDLKNHKWGHYFLCAYKGYYEFSKSKGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCE

Query:  CERHIGTQSGGMDQAISVMAKSGFAELIDFNPIRATDVKLPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDV
        CERHIGTQSGGMDQAISVMAKSGFAELIDFNPIRATDV+LPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDV
Subjt:  CERHIGTQSGGMDQAISVMAKSGFAELIDFNPIRATDVKLPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDV

Query:  EGQCVSFAKERNSSDPGLAVKELLKEDPYTAEEIEQITVDNLPSILGDSPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAVSSSLSEEDKLKKLGD
        EG C+SFA ERNSSDP LAVKELLKE+PYTAE+IEQITVDNLPS+LG+SPTSLDVLKAAKHFKLYQRA+HVYSEAKRV+AFKDAVSSS+SEE++LKKLGD
Subjt:  EGQCVSFAKERNSSDPGLAVKELLKEDPYTAEEIEQITVDNLPSILGDSPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAVSSSLSEEDKLKKLGD

Query:  LMNDSHYSCSVLYECSCPELEELVKICRDNGALGARLTGAGWGGCAVALVKEAIVPQFILNLKEKFYQSRIERGIIDKNDLGLYVFASKPSSGAAIFQF
        LMNDSHYSCSVLYECSCPELEELVKICRDNGALGARLTGAGWGGCAVALVKEAIVPQFIL+LK+KFY+SRI+RGIIDKN +  YVFASKPSSGAAIF F
Subjt:  LMNDSHYSCSVLYECSCPELEELVKICRDNGALGARLTGAGWGGCAVALVKEAIVPQFILNLKEKFYQSRIERGIIDKNDLGLYVFASKPSSGAAIFQF

XP_038902198.1 galactokinase [Benincasa hispida]1.0e-26891.98Show/hide
Query:  MAKHEDLPVPVFSSLEPVYGDGSQLEEARIRFDHLKDKFLQVFGHAPDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGEAEHLLRISN
        MAKHEDLP+PVFSSL+PVYGDGSQLEEAR+RFDHLK KFLQVFGH PDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGE  HLL+I+N
Subjt:  MAKHEDLPVPVFSSLEPVYGDGSQLEEARIRFDHLKDKFLQVFGHAPDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGEAEHLLRISN

Query:  VNDKYSMCTYPADPDQEVDLKNHKWGHYFLCAYKGYYEFSKSKGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCE
        VNDKYS+CTYPADPDQEVDLKNHKWGHYFLC YKGYYEF+KSKGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAA G NFPKKE+AQLTC+
Subjt:  VNDKYSMCTYPADPDQEVDLKNHKWGHYFLCAYKGYYEFSKSKGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCE

Query:  CERHIGTQSGGMDQAISVMAKSGFAELIDFNPIRATDVKLPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDV
        CERHIGTQSGGMDQAISVMAKSGFAELIDFNPIRATDV+LP GGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKP++A   VKTLSDV
Subjt:  CERHIGTQSGGMDQAISVMAKSGFAELIDFNPIRATDVKLPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDV

Query:  EGQCVSFAKERNSSDPGLAVKELLKEDPYTAEEIEQITVDNLPSILGDSPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAVSSSLSEEDKLKKLGD
        EG C+SFAKERNSSDP LAVKELLKE+PYTAEEIEQITVDNLPS+L +SPTSLDVLKAAKHFKL+QRA+HVYSEA+RV+AFKDAVSSSLSEEDKLKKLGD
Subjt:  EGQCVSFAKERNSSDPGLAVKELLKEDPYTAEEIEQITVDNLPSILGDSPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAVSSSLSEEDKLKKLGD

Query:  LMNDSHYSCSVLYECSCPELEELVKICRDNGALGARLTGAGWGGCAVALVKEAIVPQFILNLKEKFYQSRIERGIIDKNDLGLYVFASKPSSGAAIFQF
        LMNDSHYSCSVLYECSCPELEELVKICRDN ALGARLTGAGWGGCAVALVKEAIVPQFI +LKE FY+SRIERG+IDKNDLGLYVFASKPSSGAAIFQF
Subjt:  LMNDSHYSCSVLYECSCPELEELVKICRDNGALGARLTGAGWGGCAVALVKEAIVPQFILNLKEKFYQSRIERGIIDKNDLGLYVFASKPSSGAAIFQF

TrEMBL top hitse value%identityAlignment
A0A0A0L5E4 Uncharacterized protein1.1e-26891.78Show/hide
Query:  MAKHEDLPVPVFSSLEPVYGDGSQLEEARIRFDHLKDKFLQVFGHAPDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGEAEHLLRISN
        MAKHEDLP+PVFSSL+PVYGDGSQLEEAR+RFDHLK KFLQVFGH PDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGE  HLL+I+N
Subjt:  MAKHEDLPVPVFSSLEPVYGDGSQLEEARIRFDHLKDKFLQVFGHAPDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGEAEHLLRISN

Query:  VNDKYSMCTYPADPDQEVDLKNHKWGHYFLCAYKGYYEFSKSKGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCE
        VNDKYS+CTYPADPDQEVDLKNHKWGHYFLC YKGYYEF+KSKGQDVG+PVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAA G NFPKKE+AQLTC+
Subjt:  VNDKYSMCTYPADPDQEVDLKNHKWGHYFLCAYKGYYEFSKSKGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCE

Query:  CERHIGTQSGGMDQAISVMAKSGFAELIDFNPIRATDVKLPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDV
        CERHIGTQSGGMDQAISVMAKSGFAELIDFNPI ATDV+LP GG+FVIAHSLAESQKAVTAATNYNNRVVECRLA+IVLGIKLGMKP++AI +VKTLSDV
Subjt:  CERHIGTQSGGMDQAISVMAKSGFAELIDFNPIRATDVKLPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDV

Query:  EGQCVSFAKERNSSDPGLAVKELLKEDPYTAEEIEQITVDNLPSILGDSPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAVSSSLSEEDKLKKLGD
        EG C+SFAKERNSSDP LAVKELLKE+PYTAEEIEQITVDNLPS+LG+SPTSLDVLKAAKHFKLYQRA+HVYSEA+RV+AFKDAVSSSLSEEDKLKKLGD
Subjt:  EGQCVSFAKERNSSDPGLAVKELLKEDPYTAEEIEQITVDNLPSILGDSPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAVSSSLSEEDKLKKLGD

Query:  LMNDSHYSCSVLYECSCPELEELVKICRDNGALGARLTGAGWGGCAVALVKEAIVPQFILNLKEKFYQSRIERGIIDKNDLGLYVFASKPSSGAAIFQF
        LMNDSHYSCSVLYECSCPELEELVKICRDN ALGARLTGAGWGGCAVALVKEAIVPQFILNLKE FY+SRIERG+I K+DLGLYVFASKPSSGAAIFQF
Subjt:  LMNDSHYSCSVLYECSCPELEELVKICRDNGALGARLTGAGWGGCAVALVKEAIVPQFILNLKEKFYQSRIERGIIDKNDLGLYVFASKPSSGAAIFQF

A0A6J1C8P2 galactokinase1.5e-27393.19Show/hide
Query:  MAKHEDLPVPVFSSLEPVYGDGSQLEEARIRFDHLKDKFLQVFGHAPDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGEAEHLLRISN
        MAKHEDLP+PVFSSL PVYGDGSQLEEA++RFDHLK KFLQVFGH PDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGEA HLLRI+N
Subjt:  MAKHEDLPVPVFSSLEPVYGDGSQLEEARIRFDHLKDKFLQVFGHAPDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGEAEHLLRISN

Query:  VNDKYSMCTYPADPDQEVDLKNHKWGHYFLCAYKGYYEFSKSKGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCE
        VNDKYS+CTYPADPDQEVDLKNHKWGHYFLC YKGYYE++KSKGQ+VGVPVGLDVLVDGTVPTGSGLSSSAAFVCS+TIAIMAAFG NFPKKE+AQLTCE
Subjt:  VNDKYSMCTYPADPDQEVDLKNHKWGHYFLCAYKGYYEFSKSKGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCE

Query:  CERHIGTQSGGMDQAISVMAKSGFAELIDFNPIRATDVKLPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDV
        CERHIGTQSGGMDQAISVMAKSGFAELIDFNPIRATDV+LPAGGTF+IAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKP +A+A+VKTLSDV
Subjt:  CERHIGTQSGGMDQAISVMAKSGFAELIDFNPIRATDVKLPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDV

Query:  EGQCVSFAKERNSSDPGLAVKELLKEDPYTAEEIEQITVDNLPSILGDSPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAVSSSLSEEDKLKKLGD
        EG C+SFA+ERNSSDP LAVKELLKEDPYTAEEIEQITV+NLPS+LG+SPTSLDVLKAAKHFKLYQRA+HVYSEAKRV+AFKDAVSSSLSEEDKLKKLGD
Subjt:  EGQCVSFAKERNSSDPGLAVKELLKEDPYTAEEIEQITVDNLPSILGDSPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAVSSSLSEEDKLKKLGD

Query:  LMNDSHYSCSVLYECSCPELEELVKICRDNGALGARLTGAGWGGCAVALVKEAIVPQFILNLKEKFYQSRIERGIIDKNDLGLYVFASKPSSGAAIFQF
        LMNDSHYSCSVLYECSCPELEELVKICRDNGALGARLTGAGWGGCAVALVKEAIVPQFILNLKE FY+SRI+RGIIDKNDLGLYVFASKPSSGAAIFQF
Subjt:  LMNDSHYSCSVLYECSCPELEELVKICRDNGALGARLTGAGWGGCAVALVKEAIVPQFILNLKEKFYQSRIERGIIDKNDLGLYVFASKPSSGAAIFQF

A0A6J1I3I9 galactokinase-like isoform X26.4e-26992.18Show/hide
Query:  MAKHEDLPVPVFSSLEPVYGDGSQLEEARIRFDHLKDKFLQVFGHAPDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGEAEHLLRISN
        MAKHEDLP+PVFSSL+PVYGDG+QLEEAR+RFD+LK KFLQVFGH+P VFARSPGRVNLIGEHIDYEGYSVLPMAIR DTIVAIRKHDAGEAEHLLRI+N
Subjt:  MAKHEDLPVPVFSSLEPVYGDGSQLEEARIRFDHLKDKFLQVFGHAPDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGEAEHLLRISN

Query:  VNDKYSMCTYPADPDQEVDLKNHKWGHYFLCAYKGYYEFSKSKGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCE
        VNDKYSMCTYPADP QEVDLKNHKWGHYFLC YKGYYEF+KSKGQDVG PVGLDVLVDGTVPTGSGLSSSAAFVCS+TIAIMAAFG NFPKKELAQLTCE
Subjt:  VNDKYSMCTYPADPDQEVDLKNHKWGHYFLCAYKGYYEFSKSKGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCE

Query:  CERHIGTQSGGMDQAISVMAKSGFAELIDFNPIRATDVKLPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDV
        CERHIGTQSGGMDQAISVMAKSGFAELIDFNPIRATDV+LPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDV
Subjt:  CERHIGTQSGGMDQAISVMAKSGFAELIDFNPIRATDVKLPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDV

Query:  EGQCVSFAKERNSSDPGLAVKELLKEDPYTAEEIEQITVDNLPSILGDSPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAVSSSLSEEDKLKKLGD
        EG C+SFA ERNSSDP LAVKELLKE+PYTAE+IEQITVDNLPS+LG+SPTSLDVLKAAKHFKLYQRA+HVYSEAKRV+AFKDAVSSS+SEE++LKKLGD
Subjt:  EGQCVSFAKERNSSDPGLAVKELLKEDPYTAEEIEQITVDNLPSILGDSPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAVSSSLSEEDKLKKLGD

Query:  LMNDSHYSCSVLYECSCPELEELVKICRDNGALGARLTGAGWGGCAVALVKEAIVPQFILNLKEKFYQSRIERGIIDKNDLGLYVFASKPSSGAAIFQF
        LMNDSHYSCSVLYECSCPELEELVKICRDNGALGARLTGAGWGGCAVALVKEAIVPQFIL+LK+KFY+SRI+RGIIDKN +  YVFASKPSSGAAIF F
Subjt:  LMNDSHYSCSVLYECSCPELEELVKICRDNGALGARLTGAGWGGCAVALVKEAIVPQFILNLKEKFYQSRIERGIIDKNDLGLYVFASKPSSGAAIFQF

A0A6J1I7D3 galactokinase-like isoform X16.4e-26992.18Show/hide
Query:  MAKHEDLPVPVFSSLEPVYGDGSQLEEARIRFDHLKDKFLQVFGHAPDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGEAEHLLRISN
        MAKHEDLP+PVFSSL+PVYGDG+QLEEAR+RFD+LK KFLQVFGH+P VFARSPGRVNLIGEHIDYEGYSVLPMAIR DTIVAIRKHDAGEAEHLLRI+N
Subjt:  MAKHEDLPVPVFSSLEPVYGDGSQLEEARIRFDHLKDKFLQVFGHAPDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGEAEHLLRISN

Query:  VNDKYSMCTYPADPDQEVDLKNHKWGHYFLCAYKGYYEFSKSKGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCE
        VNDKYSMCTYPADP QEVDLKNHKWGHYFLC YKGYYEF+KSKGQDVG PVGLDVLVDGTVPTGSGLSSSAAFVCS+TIAIMAAFG NFPKKELAQLTCE
Subjt:  VNDKYSMCTYPADPDQEVDLKNHKWGHYFLCAYKGYYEFSKSKGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCE

Query:  CERHIGTQSGGMDQAISVMAKSGFAELIDFNPIRATDVKLPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDV
        CERHIGTQSGGMDQAISVMAKSGFAELIDFNPIRATDV+LPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDV
Subjt:  CERHIGTQSGGMDQAISVMAKSGFAELIDFNPIRATDVKLPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDV

Query:  EGQCVSFAKERNSSDPGLAVKELLKEDPYTAEEIEQITVDNLPSILGDSPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAVSSSLSEEDKLKKLGD
        EG C+SFA ERNSSDP LAVKELLKE+PYTAE+IEQITVDNLPS+LG+SPTSLDVLKAAKHFKLYQRA+HVYSEAKRV+AFKDAVSSS+SEE++LKKLGD
Subjt:  EGQCVSFAKERNSSDPGLAVKELLKEDPYTAEEIEQITVDNLPSILGDSPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAVSSSLSEEDKLKKLGD

Query:  LMNDSHYSCSVLYECSCPELEELVKICRDNGALGARLTGAGWGGCAVALVKEAIVPQFILNLKEKFYQSRIERGIIDKNDLGLYVFASKPSSGAAIFQF
        LMNDSHYSCSVLYECSCPELEELVKICRDNGALGARLTGAGWGGCAVALVKEAIVPQFIL+LK+KFY+SRI+RGIIDKN +  YVFASKPSSGAAIF F
Subjt:  LMNDSHYSCSVLYECSCPELEELVKICRDNGALGARLTGAGWGGCAVALVKEAIVPQFILNLKEKFYQSRIERGIIDKNDLGLYVFASKPSSGAAIFQF

B6V3B9 Galactokinase7.5e-27092.18Show/hide
Query:  MAKHEDLPVPVFSSLEPVYGDGSQLEEARIRFDHLKDKFLQVFGHAPDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGEAEHLLRISN
        MAKHEDLP+PVFSSL+PVYGDGSQLEEAR+RFDHLK KFLQVFGH PDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGE  HLL+I+N
Subjt:  MAKHEDLPVPVFSSLEPVYGDGSQLEEARIRFDHLKDKFLQVFGHAPDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGEAEHLLRISN

Query:  VNDKYSMCTYPADPDQEVDLKNHKWGHYFLCAYKGYYEFSKSKGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCE
        VNDKYSMCTYPADPDQEVDLKNHKWGHYFLC YKGYYEF+KSKGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAA G NFPKKE+AQLTC+
Subjt:  VNDKYSMCTYPADPDQEVDLKNHKWGHYFLCAYKGYYEFSKSKGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCE

Query:  CERHIGTQSGGMDQAISVMAKSGFAELIDFNPIRATDVKLPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDV
        CERHIGTQSGGMDQAISVMAKSGFAELIDFNPIRATDV+LP GGTFVIAHSLAESQKAVTAATNYNNRVVECRLA+IVLGIKLGMKP++AI  VKTLSDV
Subjt:  CERHIGTQSGGMDQAISVMAKSGFAELIDFNPIRATDVKLPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDV

Query:  EGQCVSFAKERNSSDPGLAVKELLKEDPYTAEEIEQITVDNLPSILGDSPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAVSSSLSEEDKLKKLGD
        EG C+SFAKERNSSDP LAVKELLKE+PYTAEEIEQITVDNLPS+LG+SPTSLDVLKAAKHFKLYQRA+HVYSEA+RV+AFKDAVSSSLSEEDKLKKLGD
Subjt:  EGQCVSFAKERNSSDPGLAVKELLKEDPYTAEEIEQITVDNLPSILGDSPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAVSSSLSEEDKLKKLGD

Query:  LMNDSHYSCSVLYECSCPELEELVKICRDNGALGARLTGAGWGGCAVALVKEAIVPQFILNLKEKFYQSRIERGIIDKNDLGLYVFASKPSSGAAIFQF
        LMNDSHYSCSVLYECSCPELEELVKICRDN ALGARLTGAGWGGCAVALVKEAIVPQFI NLKE FY+SRIERG+I K+D+GLYVFASKPSSGAAIFQF
Subjt:  LMNDSHYSCSVLYECSCPELEELVKICRDNGALGARLTGAGWGGCAVALVKEAIVPQFILNLKEKFYQSRIERGIIDKNDLGLYVFASKPSSGAAIFQF

SwissProt top hitse value%identityAlignment
Q01415 N-acetylgalactosamine kinase1.8e-9042.8Show/hide
Query:  LKDKFLQVFGHAPDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGEAEHLLRISNVNDKYSMCTYPADPDQEVDLKNHKWGHYFLCAYK
        LK+ F   FG  P  + R+PGRVN+IGEHIDY GYSVLPMA+ QD ++A+         + L+++N N  Y   +  A+ + ++D     W +YFLC  K
Subjt:  LKDKFLQVFGHAPDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGEAEHLLRISNVNDKYSMCTYPADPDQEVDLKNHKWGHYFLCAYK

Query:  GYYE-FSKSKGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCECERHIGTQSGGMDQAISVMAKSGFAELIDFNPI
        G  E F  S         G++ LVDG +P  SGLSSS+A VC + +  +   G N  K ELA++  + ER+IGT+ GGMDQ+IS +A+ G A+LI+F+P+
Subjt:  GYYE-FSKSKGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCECERHIGTQSGGMDQAISVMAKSGFAELIDFNPI

Query:  RATDVKLPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDVEGQCVSFAKERNSSDPGLAVKELLKEDPYTAEE
        RATDVKLP+G  FVIA+S  E  KA T  +++N RV+ECRLAA +L     ++      +V  L +V+ +     +E       L  ++ L  +PY  EE
Subjt:  RATDVKLPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDVEGQCVSFAKERNSSDPGLAVKELLKEDPYTAEE

Query:  IEQITVDNLPSILGD--SPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAVSSSLSEEDKLKKLGDLMNDSHYSCSVLYECSCPELEELVKICRDNG
        I +    +L  +     SP + DVL     FKLYQRA HVYSEA RV  FK       + E+ ++ LG+LMN SH SC  +YECSCPEL++LV ICR  G
Subjt:  IEQITVDNLPSILGD--SPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAVSSSLSEEDKLKKLGDLMNDSHYSCSVLYECSCPELEELVKICRDNG

Query:  ALGARLTGAGWGGCAVALVKEAIVPQFILNLKEKFYQSRIERGIIDKNDLGLYVFASKPSSGAAI
        A G+RLTGAGWGGC V++V    +P F+ N+ + +YQ        +K  L    FA+KP  GA +
Subjt:  ALGARLTGAGWGGCAVALVKEAIVPQFILNLKEKFYQSRIERGIIDKNDLGLYVFASKPSSGAAI

Q54DN6 Galactokinase4.6e-9138.98Show/hide
Query:  VFSSLEPVYGDGSQLEEARIRFDHLKDKFLQVF-GHAPDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGEAEHLLRISNVNDKYS--M
        +  SL+ +Y      E  + R++ L + F +++ G  P  + R+PGRVNLIGEH+DY GY VLP A+ QDTIVA+  +       ++ I N N+KY+   
Subjt:  VFSSLEPVYGDGSQLEEARIRFDHLKDKFLQVF-GHAPDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGEAEHLLRISNVNDKYS--M

Query:  CTYPADPDQEVDLKNHKWGHYFLCAYKGYYEFSKSKGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCECERHIGT
               D E+D+K H W +Y L A+KG  +  +  G+   V    ++L  G VP G+G+SSS+A VC ST+AI         K+ELAQL+ + ER++G 
Subjt:  CTYPADPDQEVDLKNHKWGHYFLCAYKGYYEFSKSKGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCECERHIGT

Query:  QSGGMDQAISVMAKSGFAELIDFNP-IRATDVKLPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDVEGQCVS
        +SGGMDQ+IS +A+   A+LI+F+P ++  DV+LP G +FVI +SL +S K VT ATNYN RVVECRLAA++L    G+  +    +V+ L DV+ Q   
Subjt:  QSGGMDQAISVMAKSGFAELIDFNP-IRATDVKLPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDVEGQCVS

Query:  FAKERNSSDP---GLAVKELLKEDPYTAEEIEQITVDNLPSILGD-SPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAV--------------SSS
             N   P    L  + L ++  YT EE+  I   ++  ++    P+ + V   ++HF+LY+RA HV++E +RV+ F +                ++S
Subjt:  FAKERNSSDP---GLAVKELLKEDPYTAEEIEQITVDNLPSILGD-SPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAV--------------SSS

Query:  LSEEDKLKKLGDLMNDSHYSCSVLYECSCPELEELVKICRDNGALGARLTGAGWGGCAVALVKEAIVPQFILNLKEKFYQSRIERGIIDKNDLGLYVFAS
         +  + +++LG LMN+SH SCS L+ECSC EL+ L KICR+NGALG+RLTGAGWGGC ++LV  + V  F+  +   +Y   +    +   +   Y F +
Subjt:  LSEEDKLKKLGDLMNDSHYSCSVLYECSCPELEELVKICRDNGALGARLTGAGWGGCAVALVKEAIVPQFILNLKEKFYQSRIERGIIDKNDLGLYVFAS

Query:  KPSSGAAI
         P  GA I
Subjt:  KPSSGAAI

Q5XIG6 N-acetylgalactosamine kinase3.0e-9042.49Show/hide
Query:  LKDKFLQVFGHAPDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGEAEHLLRISNVNDKYSMCTYPADPDQEVDLKNHKWGHYFLCAYK
        LK+ F   FG  P  + R+PGRVN+IGEHIDY GYSVLPMA+ QD ++A+           L+++N +  Y   +  A+ +  +D     W +YFLC +K
Subjt:  LKDKFLQVFGHAPDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGEAEHLLRISNVNDKYSMCTYPADPDQEVDLKNHKWGHYFLCAYK

Query:  GYYE-FSKSKGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCECERHIGTQSGGMDQAISVMAKSGFAELIDFNPI
        G  E F  ++     +P G++ LVDG +P  SGLSSS+A VC + +  +   G    K ELA++  + ER+IGT+ GGMDQ+IS +A+ G A+LI+F+P+
Subjt:  GYYE-FSKSKGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCECERHIGTQSGGMDQAISVMAKSGFAELIDFNPI

Query:  RATDVKLPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDVEGQCVSFAKERNSSDPGLAVKELLKEDPYTAEE
        RATDVKLP+G  FVIA+S  E  KA T  +++N RV+ECRLAA VL    G++      +V  L +V+ +     +E       L  ++ L  +PY+ EE
Subjt:  RATDVKLPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDVEGQCVSFAKERNSSDPGLAVKELLKEDPYTAEE

Query:  IEQITVDNLPSILGD--SPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAVSSSLSEEDKLKKLGDLMNDSHYSCSVLYECSCPELEELVKICRDNG
        I +    +L  +     SP +   L     FKLYQRA HVYSEA RV  FK    +  + ++ ++ LG+LMN SH SC  +YECSCPEL++LV ICR  G
Subjt:  IEQITVDNLPSILGD--SPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAVSSSLSEEDKLKKLGDLMNDSHYSCSVLYECSCPELEELVKICRDNG

Query:  ALGARLTGAGWGGCAVALVKEAIVPQFILNLKEKFYQSRIERGIIDKNDLGLYVFASKPSSGAAIF
        A G+RLTGAGWGGC V+LV    +  F+ ++ E +YQ  + R   +K+ L    FA+KP  GA +F
Subjt:  ALGARLTGAGWGGCAVALVKEAIVPQFILNLKEKFYQSRIERGIIDKNDLGLYVFASKPSSGAAIF

Q68FH4 N-acetylgalactosamine kinase2.1e-9142.74Show/hide
Query:  LKDKFLQVFGHAPDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGEAEHLLRISNVNDKYSMCTYPADPDQEVDLKNHKWGHYFLCAYK
        LK+ F   FG  P  + R+PGRVN+IGEHIDY GYSV+PMA+ QD ++A+         H L+++N +  Y   +  A+ +  +D     W +YFLC +K
Subjt:  LKDKFLQVFGHAPDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGEAEHLLRISNVNDKYSMCTYPADPDQEVDLKNHKWGHYFLCAYK

Query:  GYYE-FSKSKGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCECERHIGTQSGGMDQAISVMAKSGFAELIDFNPI
        G  E F  SK     +P G++ LVDG +P  SGLSSS+A VC + +  +   G    K ELA++  + ER+IGT+ GGMDQ+IS +A+ G A+LI+F+P+
Subjt:  GYYE-FSKSKGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCECERHIGTQSGGMDQAISVMAKSGFAELIDFNPI

Query:  RATDVKLPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDVEGQCVSFAKERNSSDPGLAVKELLKEDPYTAEE
        RAT+VKLP+G  FVIA+S  E  KA T  +++N RV+ECRLAA VL    G++  +    V  L +V+ +     +E       L  ++ L  +PY+ EE
Subjt:  RATDVKLPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDVEGQCVSFAKERNSSDPGLAVKELLKEDPYTAEE

Query:  IEQ---ITVDNLPSILGDSPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAVSSSLSEEDKLKKLGDLMNDSHYSCSVLYECSCPELEELVKICRDN
        I +   I+++ L + +  +P + D L     FKLYQRA HVYSEA RV  FK       + ++ ++ LG+LMN SH SC  +YECSCPEL++LV ICR  
Subjt:  IEQ---ITVDNLPSILGDSPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAVSSSLSEEDKLKKLGDLMNDSHYSCSVLYECSCPELEELVKICRDN

Query:  GALGARLTGAGWGGCAVALVKEAIVPQFILNLKEKFYQSRIERGIIDKNDLGLYVFASKPSSGAAIFQ
        GA G+RLTGAGWGGC V+LV   ++  F+ ++ E +YQ    R   +K+ L    FA+KP  GA +F+
Subjt:  GALGARLTGAGWGGCAVALVKEAIVPQFILNLKEKFYQSRIERGIIDKNDLGLYVFASKPSSGAAIFQ

Q9SEE5 Galactokinase9.0e-22877.67Show/hide
Query:  MAKHEDLPVPVFSSLEPVYGDGSQLEEARIRFDHLKDKFLQVFGHAPDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGEAEHLLRISN
        MAK E++ VP+F+SLEPVYG+GS L+EA  RFD LK  F  VFG +P +FARSPGRVNLIGEHIDYEGYSVLPMAIRQDTI+AIRK    E +  LRI+N
Subjt:  MAKHEDLPVPVFSSLEPVYGDGSQLEEARIRFDHLKDKFLQVFGHAPDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGEAEHLLRISN

Query:  VNDKYSMCTYPADPDQEVDLKNHKWGHYFLCAYKGYYEFSKSKGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCE
        VNDKY+MCTYPADPDQE+DLKNHKWGHYF+CAYKG++E++KSKG ++G PVGLDVLVDG VPTGSGLSSSAAFVCS+TIAIMA FG NF KKELAQLTCE
Subjt:  VNDKYSMCTYPADPDQEVDLKNHKWGHYFLCAYKGYYEFSKSKGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCE

Query:  CERHIGTQSGGMDQAISVMAKSGFAELIDFNPIRATDVKLPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDV
        CERHIGTQSGGMDQAIS+MAK+GFAELIDFNP+RATDVKLP GG+FVIAHSLAESQKAVTAA NYNNRVVECRLA+I+LG+KLGM+P++AI++VKTLSDV
Subjt:  CERHIGTQSGGMDQAISVMAKSGFAELIDFNPIRATDVKLPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDV

Query:  EGQCVSFAKERNSSDPGLAVKELLKEDPYTAEEIEQITVDNLPSILGDSPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAVSSSLSEEDKLKKLGD
        EG CVSFA +R SSDP LAVKE LKE+PYTAEEIE+I  + LPSI+ + PTSL VL AA HFKL+QRAAHVYSEA+RV  FKD V+S+LS+E+KLKKLGD
Subjt:  EGQCVSFAKERNSSDPGLAVKELLKEDPYTAEEIEQITVDNLPSILGDSPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAVSSSLSEEDKLKKLGD

Query:  LMNDSHYSCSVLYECSCPELEELVKICRDNGALGARLTGAGWGGCAVALVKEAIVPQFILNLKEKFYQSRIERGIIDKNDLGLYVFASKPSSGAAIF
        LMN+SHYSCSVLYECSCPELEELV++C++NGALGARLTGAGWGGCAVALVKE  V QFI  +KEK+Y+ R+E+G++ K D+ LY+FASKPSSGAAIF
Subjt:  LMNDSHYSCSVLYECSCPELEELVKICRDNGALGARLTGAGWGGCAVALVKEAIVPQFILNLKEKFYQSRIERGIIDKNDLGLYVFASKPSSGAAIF

Arabidopsis top hitse value%identityAlignment
AT3G06580.1 Mevalonate/galactokinase family protein6.4e-22977.67Show/hide
Query:  MAKHEDLPVPVFSSLEPVYGDGSQLEEARIRFDHLKDKFLQVFGHAPDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGEAEHLLRISN
        MAK E++ VP+F+SLEPVYG+GS L+EA  RFD LK  F  VFG +P +FARSPGRVNLIGEHIDYEGYSVLPMAIRQDTI+AIRK    E +  LRI+N
Subjt:  MAKHEDLPVPVFSSLEPVYGDGSQLEEARIRFDHLKDKFLQVFGHAPDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGEAEHLLRISN

Query:  VNDKYSMCTYPADPDQEVDLKNHKWGHYFLCAYKGYYEFSKSKGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCE
        VNDKY+MCTYPADPDQE+DLKNHKWGHYF+CAYKG++E++KSKG ++G PVGLDVLVDG VPTGSGLSSSAAFVCS+TIAIMA FG NF KKELAQLTCE
Subjt:  VNDKYSMCTYPADPDQEVDLKNHKWGHYFLCAYKGYYEFSKSKGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCE

Query:  CERHIGTQSGGMDQAISVMAKSGFAELIDFNPIRATDVKLPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDV
        CERHIGTQSGGMDQAIS+MAK+GFAELIDFNP+RATDVKLP GG+FVIAHSLAESQKAVTAA NYNNRVVECRLA+I+LG+KLGM+P++AI++VKTLSDV
Subjt:  CERHIGTQSGGMDQAISVMAKSGFAELIDFNPIRATDVKLPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDV

Query:  EGQCVSFAKERNSSDPGLAVKELLKEDPYTAEEIEQITVDNLPSILGDSPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAVSSSLSEEDKLKKLGD
        EG CVSFA +R SSDP LAVKE LKE+PYTAEEIE+I  + LPSI+ + PTSL VL AA HFKL+QRAAHVYSEA+RV  FKD V+S+LS+E+KLKKLGD
Subjt:  EGQCVSFAKERNSSDPGLAVKELLKEDPYTAEEIEQITVDNLPSILGDSPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAVSSSLSEEDKLKKLGD

Query:  LMNDSHYSCSVLYECSCPELEELVKICRDNGALGARLTGAGWGGCAVALVKEAIVPQFILNLKEKFYQSRIERGIIDKNDLGLYVFASKPSSGAAIF
        LMN+SHYSCSVLYECSCPELEELV++C++NGALGARLTGAGWGGCAVALVKE  V QFI  +KEK+Y+ R+E+G++ K D+ LY+FASKPSSGAAIF
Subjt:  LMNDSHYSCSVLYECSCPELEELVKICRDNGALGARLTGAGWGGCAVALVKEAIVPQFILNLKEKFYQSRIERGIIDKNDLGLYVFASKPSSGAAIF

AT3G10700.1 galacturonic acid kinase1.3e-1124.58Show/hide
Query:  SPGRVNLIGEHIDYEGYSVLPMAIRQ----------DTIVAIRKHDAGEAEHLLRISNVNDKYSMCTYPADPDQEVDLKNHKWGHYFLCAYKGYYEFSKS
        +P R+  +G HID++G +V  M I +          DT V +R     E E   R+  +     +             +   WG Y   A        K+
Subjt:  SPGRVNLIGEHIDYEGYSVLPMAIRQ----------DTIVAIRKHDAGEAEHLLRISNVNDKYSMCTYPADPDQEVDLKNHKWGHYFLCAYKGYYEFSKS

Query:  KGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCECER-HIGTQSGGMDQAISVMAKSGF-----AELIDFNPIRAT
          Q +     +  L        SGLSSSAA   +  +A+  A        E  +     E  ++G ++G +DQ+  +++  G       + +D   ++A 
Subjt:  KGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCECER-HIGTQSGGMDQAISVMAKSGF-----AELIDFNPIRAT

Query:  DVKLPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDVEGQCVSFAKERNSSDPGLAVKELLKEDPYTAEEIEQ
        +++ P    F I  + +  ++A+T    YN RV EC+ AA VL    G    +      TL +VE                 AV E  K +         
Subjt:  DVKLPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDVEGQCVSFAKERNSSDPGLAVKELLKEDPYTAEEIEQ

Query:  ITVDNLPSILGDSPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAVSSSLSEEDKLKKLGDLMNDSHYSCSVLYECSCPELEELVKI-CRDNGALGA
                           LK      L +RA H +SE  RV   ++A +S       L++ G L++ S  S    YEC    L +L KI  +  G  GA
Subjt:  ITVDNLPSILGDSPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAVSSSLSEEDKLKKLGDLMNDSHYSCSVLYECSCPELEELVKI-CRDNGALGA

Query:  RLTGAGWGGCAVALV
        R +GAG+ GC +A V
Subjt:  RLTGAGWGGCAVALV

AT3G42850.1 Mevalonate/galactokinase family protein1.5e-1524.22Show/hide
Query:  VFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAI-RKHDA-------GEAEHLLRISNVNDKYSMCTYPAD--PDQEVDLKNHKWGHYFLCAY-KGYY
        V AR+PGR++++G   DY G  VL M  R+    A+ R H +        EA H  R + + +  S  +  ++  P  ++DL +         +Y K Y+
Subjt:  VFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAI-RKHDA-------GEAEHLLRISNVNDKYSMCTYPAD--PDQEVDLKNHKWGHYFLCAY-KGYY

Query:  EFSKSKGQ-----------------DVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCECERH-IGTQSGGMDQAISVM
         FS+   Q                 DV     + +LV  TVP G G+SSSA+   ++  A+ AA G     +++A L  + E + +G   G MDQ  S  
Subjt:  EFSKSKGQ-----------------DVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCECERH-IGTQSGGMDQAISVM

Query:  AKSGFAELIDFNPIRATD-VKLPA-----GGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDVEGQCVSFAKERNS
         ++     +   P      V++P+     G    I HS+  S                    ++ +G  +G          KT+  +     SFA E NS
Subjt:  AKSGFAELIDFNPIRATD-VKLPA-----GGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDVEGQCVSFAKERNS

Query:  SDPGLAVKELLKEDP----------------YTAEEIEQITVDNLPSILGDSPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAVSSSLSEEDKLKK
         +      EL++ D                 Y ++  + IT +      GD   S+  +     + +     H   E  RV AFK  ++++ SEE  +  
Subjt:  SDPGLAVKELLKEDP----------------YTAEEIEQITVDNLPSILGDSPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAVSSSLSEEDKLKK

Query:  LGDLM---NDSHYSCSVLYECS------CPELEELVKICRDNGAL-GARLTGAGWGGCAVALVKEAI-VPQFILNLKEKFYQS
        LG+LM   +DS+ +C +  + +         +E L     +NG L GA++TG G GG    + K ++   + IL +++K+ ++
Subjt:  LGDLM---NDSHYSCSVLYECS------CPELEELVKICRDNGAL-GARLTGAGWGGCAVALVKEAI-VPQFILNLKEKFYQS

AT4G16130.1 arabinose kinase1.1e-1524.69Show/hide
Query:  VFGHAPDVF-ARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAG--------EAEHLLRISNVNDKYSMCTYPAD-----PDQEVDLKNHKWGH
        +F    ++F AR+PGR++++G   DY G  VL M IR+   VA++++  G        +A    +         + +Y ++     P  ++DL +   G 
Subjt:  VFGHAPDVF-ARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAG--------EAEHLLRISNVNDKYSMCTYPAD-----PDQEVDLKNHKWGH

Query:  YFLCAYKGYYEFSKSKGQ---------------DVGVPV--GLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCECERHI-GTQS
          +   K    F++   Q               ++GV     + +LV   VP G G+SSSAA   +S  AI AA G +   ++LA L  + E HI G   
Subjt:  YFLCAYKGYYEFSKSKGQ---------------DVGVPV--GLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCECERHI-GTQS

Query:  GGMDQAISVMAKSGFAELIDFNPIRATD-VKLPAGGTF-----VIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDVEGQ
        G MDQ  S   ++     +   P      V++P    F      I HS+  +         Y  R +   +A+ +L       P  + A      ++E +
Subjt:  GGMDQAISVMAKSGFAELIDFNPIRATD-VKLPAGGTF-----VIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDVEGQ

Query:  CVSFAKERNSSDPGLAVKELLKEDPYTAEEIEQITVDNLPSIL---------GDSPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAVSSSLSEEDK
         +   +   S D       L    P+     E    D LP I+          D    + V+   + + +   A H   E  RV  FK  ++S+ S+E +
Subjt:  CVSFAKERNSSDPGLAVKELLKEDPYTAEEIEQITVDNLPSIL---------GDSPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAVSSSLSEEDK

Query:  LKKLGDLMNDSHYSCSV--LYECSCPELEELV------KICRDNGAL-GARLTGAGWGGCAVALVKEAI-VPQFILNLKEKF
        L  LG L+   HYS S   L       L +LV      K   ++G L GA++TG G GG    + + ++   Q IL +++++
Subjt:  LKKLGDLMNDSHYSCSV--LYECSCPELEELV------KICRDNGAL-GARLTGAGWGGCAVALVKEAI-VPQFILNLKEKF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAAGCACGAGGATCTTCCTGTCCCTGTTTTCTCTTCTCTAGAACCGGTTTATGGCGATGGATCTCAGCTTGAAGAAGCTCGCATTCGATTTGATCATCTCAAGGA
TAAGTTTCTTCAAGTTTTCGGCCATGCTCCTGATGTTTTTGCTCGTTCTCCAGGGAGAGTGAACTTGATCGGAGAGCACATTGACTATGAAGGATACTCGGTGTTGCCGA
TGGCGATTCGGCAAGACACGATCGTCGCGATAAGGAAACATGACGCCGGAGAGGCCGAGCATCTTCTCAGAATTTCTAATGTTAATGATAAATACTCGATGTGTACTTAT
CCTGCTGATCCTGATCAGGAAGTTGACTTGAAGAATCACAAATGGGGACACTATTTCCTTTGCGCGTACAAAGGTTATTACGAATTTTCCAAATCAAAAGGACAAGATGT
TGGCGTTCCAGTTGGACTTGACGTTCTTGTTGATGGAACTGTCCCTACAGGGTCTGGATTATCAAGTTCTGCTGCATTTGTTTGCTCTTCTACCATTGCTATAATGGCTG
CTTTTGGTACCAACTTTCCTAAGAAAGAACTTGCCCAACTCACCTGTGAATGTGAACGGCATATTGGTACACAATCTGGTGGAATGGACCAGGCAATCTCTGTCATGGCC
AAATCTGGGTTTGCAGAGCTGATTGATTTCAATCCCATTCGAGCTACTGATGTGAAACTTCCTGCTGGTGGGACTTTTGTTATAGCCCATTCTCTAGCCGAATCACAGAA
AGCAGTCACTGCTGCCACAAATTATAATAACAGAGTTGTTGAGTGCCGACTTGCTGCTATTGTTCTAGGCATAAAGCTTGGGATGAAACCACAAGATGCAATAGCACAAG
TGAAAACTTTATCTGATGTGGAAGGGCAGTGTGTTTCATTTGCTAAGGAGCGTAATTCTTCAGATCCTGGGCTTGCTGTCAAGGAACTGTTGAAAGAGGATCCCTATACA
GCTGAAGAAATCGAACAAATCACTGTGGACAATCTGCCTTCCATATTAGGCGATTCCCCAACTTCATTAGATGTTCTAAAAGCTGCTAAGCATTTCAAGCTGTATCAGCG
AGCGGCCCACGTGTACTCTGAAGCCAAGCGAGTCTTCGCTTTCAAGGATGCAGTGTCGTCAAGTTTAAGTGAGGAAGACAAGCTTAAGAAGCTGGGTGATCTTATGAATG
ATAGTCACTACAGCTGTAGCGTTCTTTATGAATGCAGCTGTCCTGAACTGGAAGAACTCGTAAAGATATGCCGGGACAACGGTGCTCTCGGGGCCAGGCTAACTGGAGCA
GGATGGGGTGGCTGTGCAGTCGCTCTCGTAAAAGAGGCCATTGTTCCCCAGTTCATTCTTAATCTAAAGGAGAAGTTCTACCAATCGAGGATCGAGAGAGGGATCATCGA
CAAGAATGATCTTGGTCTCTACGTCTTCGCTTCCAAGCCATCGAGTGGAGCAGCCATCTTCCAGTTTTAG
mRNA sequenceShow/hide mRNA sequence
ACAAAATCACACTTACTCGCTGAATATTTTTATAGAATATTTCATTTTCACCAAACTTTGCTCTTCGTTGCGGATTGATTGGAGGATTATCCGAAGACTTTGCCGGAAAT
TCAGATCGGAGACGCTACTTAGTAGGAAGAAGAAGAAGAAGAGTAATGGCGAAGCACGAGGATCTTCCTGTCCCTGTTTTCTCTTCTCTAGAACCGGTTTATGGCGATGG
ATCTCAGCTTGAAGAAGCTCGCATTCGATTTGATCATCTCAAGGATAAGTTTCTTCAAGTTTTCGGCCATGCTCCTGATGTTTTTGCTCGTTCTCCAGGGAGAGTGAACT
TGATCGGAGAGCACATTGACTATGAAGGATACTCGGTGTTGCCGATGGCGATTCGGCAAGACACGATCGTCGCGATAAGGAAACATGACGCCGGAGAGGCCGAGCATCTT
CTCAGAATTTCTAATGTTAATGATAAATACTCGATGTGTACTTATCCTGCTGATCCTGATCAGGAAGTTGACTTGAAGAATCACAAATGGGGACACTATTTCCTTTGCGC
GTACAAAGGTTATTACGAATTTTCCAAATCAAAAGGACAAGATGTTGGCGTTCCAGTTGGACTTGACGTTCTTGTTGATGGAACTGTCCCTACAGGGTCTGGATTATCAA
GTTCTGCTGCATTTGTTTGCTCTTCTACCATTGCTATAATGGCTGCTTTTGGTACCAACTTTCCTAAGAAAGAACTTGCCCAACTCACCTGTGAATGTGAACGGCATATT
GGTACACAATCTGGTGGAATGGACCAGGCAATCTCTGTCATGGCCAAATCTGGGTTTGCAGAGCTGATTGATTTCAATCCCATTCGAGCTACTGATGTGAAACTTCCTGC
TGGTGGGACTTTTGTTATAGCCCATTCTCTAGCCGAATCACAGAAAGCAGTCACTGCTGCCACAAATTATAATAACAGAGTTGTTGAGTGCCGACTTGCTGCTATTGTTC
TAGGCATAAAGCTTGGGATGAAACCACAAGATGCAATAGCACAAGTGAAAACTTTATCTGATGTGGAAGGGCAGTGTGTTTCATTTGCTAAGGAGCGTAATTCTTCAGAT
CCTGGGCTTGCTGTCAAGGAACTGTTGAAAGAGGATCCCTATACAGCTGAAGAAATCGAACAAATCACTGTGGACAATCTGCCTTCCATATTAGGCGATTCCCCAACTTC
ATTAGATGTTCTAAAAGCTGCTAAGCATTTCAAGCTGTATCAGCGAGCGGCCCACGTGTACTCTGAAGCCAAGCGAGTCTTCGCTTTCAAGGATGCAGTGTCGTCAAGTT
TAAGTGAGGAAGACAAGCTTAAGAAGCTGGGTGATCTTATGAATGATAGTCACTACAGCTGTAGCGTTCTTTATGAATGCAGCTGTCCTGAACTGGAAGAACTCGTAAAG
ATATGCCGGGACAACGGTGCTCTCGGGGCCAGGCTAACTGGAGCAGGATGGGGTGGCTGTGCAGTCGCTCTCGTAAAAGAGGCCATTGTTCCCCAGTTCATTCTTAATCT
AAAGGAGAAGTTCTACCAATCGAGGATCGAGAGAGGGATCATCGACAAGAATGATCTTGGTCTCTACGTCTTCGCTTCCAAGCCATCGAGTGGAGCAGCCATCTTCCAGT
TTTAGTTCGAATTCCTTCGGGGTTAGTCCGGAAGGTTTCTTCCATTATTGCAGTTGGAGGAAGTAAGTATATTATATCTCAATTGGATGAAAACTACGTTAACTTGTGCT
TTTAGAATTCTGTGGACATTTTTCATAACAAAGAATTCAGCTCTGAACTCGTTGTTCTTTGCCTATATAATAAAGGAATGTTTAGCCTGCAAAAATGAGCTATATTTTAA
TTCTCAGTTCAATGTTTT
Protein sequenceShow/hide protein sequence
MAKHEDLPVPVFSSLEPVYGDGSQLEEARIRFDHLKDKFLQVFGHAPDVFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRKHDAGEAEHLLRISNVNDKYSMCTY
PADPDQEVDLKNHKWGHYFLCAYKGYYEFSKSKGQDVGVPVGLDVLVDGTVPTGSGLSSSAAFVCSSTIAIMAAFGTNFPKKELAQLTCECERHIGTQSGGMDQAISVMA
KSGFAELIDFNPIRATDVKLPAGGTFVIAHSLAESQKAVTAATNYNNRVVECRLAAIVLGIKLGMKPQDAIAQVKTLSDVEGQCVSFAKERNSSDPGLAVKELLKEDPYT
AEEIEQITVDNLPSILGDSPTSLDVLKAAKHFKLYQRAAHVYSEAKRVFAFKDAVSSSLSEEDKLKKLGDLMNDSHYSCSVLYECSCPELEELVKICRDNGALGARLTGA
GWGGCAVALVKEAIVPQFILNLKEKFYQSRIERGIIDKNDLGLYVFASKPSSGAAIFQF