; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr030528 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr030528
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionCholine kinase
Genome locationtig00154107:1265878..1276865
RNA-Seq ExpressionSgr030528
SyntenySgr030528
Gene Ontology termsGO:0006646 - phosphatidylethanolamine biosynthetic process (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0005886 - plasma membrane (cellular component)
GO:0004305 - ethanolamine kinase activity (molecular function)
InterPro domainsIPR011009 - Protein kinase-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607620.1 putative ethanolamine kinase, partial [Cucurbita argyrosperma subsp. sororia]5.1e-16381.25Show/hide
Query:  MGAKKICNGSEQVEEA-GDRERNVETYQSS------------------ELCKDLFKKWSKLDDSRFSVETVSGGITNLLLKVSVKEESGNDVSVTVRLYG
        MGA+KI NGS  +  A GD E N E+Y+ S                  ELCKDLF +WS+LD+SRFSVETVSGGITNLLLKV+VKEESG+ VSVTVRLYG
Subjt:  MGAKKICNGSEQVEEA-GDRERNVETYQSS------------------ELCKDLFKKWSKLDDSRFSVETVSGGITNLLLKVSVKEESGNDVSVTVRLYG

Query:  PNTDYVINRDRELDAIKHLSAAGFGAKLLGVFGNGMVQSFINARTLEPPDMTKSKLAAEIAKQLNKFHQVNIPGSKEPQLWNDIFKFYKKASALQFDDTG
        PNTDYVINRDREL AIK+LSAAGFGAKLLGVF NGMVQSFI+ARTLEP DM K KLAAEIAKQLNKFH+V IPGSKEPQLWN+I KFY+KASALQFDDTG
Subjt:  PNTDYVINRDRELDAIKHLSAAGFGAKLLGVFGNGMVQSFINARTLEPPDMTKSKLAAEIAKQLNKFHQVNIPGSKEPQLWNDIFKFYKKASALQFDDTG

Query:  KQSIYDTISFEEIHKEILEIKELTDRLNAPVVFAHNDLLSGNLMLNEEEERLYFIDFEYGSYSYRGYDIGNHFNEYAGYDCNYSRYPSKVEQYHFFRHYL
        KQSIYDTISF+EIH E+LEIKELT  LN+PVVFAHNDLLSGN+MLNEEEERLY IDFEYGSYSYRG+DIGNHFNEYAGYDC+YS YPSK EQYHFFRHYL
Subjt:  KQSIYDTISFEEIHKEILEIKELTDRLNAPVVFAHNDLLSGNLMLNEEEERLYFIDFEYGSYSYRGYDIGNHFNEYAGYDCNYSRYPSKVEQYHFFRHYL

Query:  RPEKPDEVSQEDLEALDVESNTFMLASHLYWALWALIQARMSPINFDYLGYFFLRYGEYKKQKEKYCS
        +PEKPDEVSQ+DLEAL VESNTFMLASHLYWALWALIQARMSPI+FDYL YFFLRY EYKKQKEKYCS
Subjt:  RPEKPDEVSQEDLEALDVESNTFMLASHLYWALWALIQARMSPINFDYLGYFFLRYGEYKKQKEKYCS

XP_008463050.1 PREDICTED: probable ethanolamine kinase isoform X2 [Cucumis melo]2.0e-16281.25Show/hide
Query:  MGAKKICNGSEQVEEA-GDRERNVETYQSS------------------ELCKDLFKKWSKLDDSRFSVETVSGGITNLLLKVSVKEESGNDVSVTVRLYG
        MGAKKI NG   V EA  D   N E YQ S                  ELCKDLFK+WS+LD SRFSVETVSGGITN LLKV+VKEESG  VSVTVRLYG
Subjt:  MGAKKICNGSEQVEEA-GDRERNVETYQSS------------------ELCKDLFKKWSKLDDSRFSVETVSGGITNLLLKVSVKEESGNDVSVTVRLYG

Query:  PNTDYVINRDRELDAIKHLSAAGFGAKLLGVFGNGMVQSFINARTLEPPDMTKSKLAAEIAKQLNKFHQVNIPGSKEPQLWNDIFKFYKKASALQFDDTG
        PNTDYVINRDREL AIK+LSAAGFGAKLLGVF NGMVQSFI+ARTLEP D+ K +LAAEIAKQLNKFH+V IPGS EPQLWN++ KFY+KAS LQFDDTG
Subjt:  PNTDYVINRDRELDAIKHLSAAGFGAKLLGVFGNGMVQSFINARTLEPPDMTKSKLAAEIAKQLNKFHQVNIPGSKEPQLWNDIFKFYKKASALQFDDTG

Query:  KQSIYDTISFEEIHKEILEIKELTDRLNAPVVFAHNDLLSGNLMLNEEEERLYFIDFEYGSYSYRGYDIGNHFNEYAGYDCNYSRYPSKVEQYHFFRHYL
        KQSIYDTISF+EIH EILEIKELT  LNAPVVFAHNDLLSGNLMLNEEE RLYFIDFEYGSYSYRG+DIGNHFNEYAGYDC+YS YPSK EQYHFFRHYL
Subjt:  KQSIYDTISFEEIHKEILEIKELTDRLNAPVVFAHNDLLSGNLMLNEEEERLYFIDFEYGSYSYRGYDIGNHFNEYAGYDCNYSRYPSKVEQYHFFRHYL

Query:  RPEKPDEVSQEDLEALDVESNTFMLASHLYWALWALIQARMSPINFDYLGYFFLRYGEYKKQKEKYCS
        +PEKPDEVSQ+DLEAL VESNTFMLASHLYWALWALIQARMSPI+FDYL YFFLRYGEYKKQKEKYCS
Subjt:  RPEKPDEVSQEDLEALDVESNTFMLASHLYWALWALIQARMSPINFDYLGYFFLRYGEYKKQKEKYCS

XP_022143237.1 probable ethanolamine kinase [Momordica charantia]1.2e-16781.74Show/hide
Query:  MGAKKICNGSEQVEEAGDRERNVETYQSS------------------ELCKDLFKKWSKLDDSRFSVETVSGGITNLLLKVSVKEESGNDVSVTVRLYGP
        MGAKKICNGS  VEEAGD E   ETY +S                  ELCKDLFK+WS+LDDSRFSV+TVSGGITNLLLKV+VKEESG+DVSVTVRLYGP
Subjt:  MGAKKICNGSEQVEEAGDRERNVETYQSS------------------ELCKDLFKKWSKLDDSRFSVETVSGGITNLLLKVSVKEESGNDVSVTVRLYGP

Query:  NTDYVINRDRELDAIKHLSAAGFGAKLLGVFGNGMVQSFINARTLEPPDMTKSKLAAEIAKQLNKFHQVNIPGSKEPQLWNDIFKFYKKASALQFDDTGK
        NTDYVINRDREL AIK+LSAAGFGAKLLGVF NGMVQSFINARTLEP D    K+A EIAKQLNKFHQVN+ G KEPQLWNDIFKFY+KASALQFDDTGK
Subjt:  NTDYVINRDRELDAIKHLSAAGFGAKLLGVFGNGMVQSFINARTLEPPDMTKSKLAAEIAKQLNKFHQVNIPGSKEPQLWNDIFKFYKKASALQFDDTGK

Query:  QSIYDTISFEEIHKEILEIKELTDRLNAPVVFAHNDLLSGNLMLNEEEERLYFIDFEYGSYSYRGYDIGNHFNEYAGYDCNYSRYPSKVEQYHFFRHYLR
        QSIYDTISF+EIH EILEIKELT  LNAP VFAHNDLLSGN+MLNE E RLYFIDFEYGSY+YRGYDIGNHFNEYAGY+C+YS+YPSK+EQYHFFRHYL+
Subjt:  QSIYDTISFEEIHKEILEIKELTDRLNAPVVFAHNDLLSGNLMLNEEEERLYFIDFEYGSYSYRGYDIGNHFNEYAGYDCNYSRYPSKVEQYHFFRHYLR

Query:  PEKPDEVSQEDLEALDVESNTFMLASHLYWALWALIQARMSPINFDYLGYFFLRYGEYKKQKEKYCS
        PEKPDEVS++DLE L VESNTFMLASHLYWALWALIQARMSPI FDY+GYFFLRYGEYKKQKE  CS
Subjt:  PEKPDEVSQEDLEALDVESNTFMLASHLYWALWALIQARMSPINFDYLGYFFLRYGEYKKQKEKYCS

XP_022982034.1 probable ethanolamine kinase isoform X1 [Cucurbita maxima]5.1e-16380.98Show/hide
Query:  MGAKKICNGSEQVEEA-GDRERNVETYQSS------------------ELCKDLFKKWSKLDDSRFSVETVSGGITNLLLKVSVKEESGNDVSVTVRLYG
        MGA+KI NGS  +E A GD E N E+Y+ S                  ELCKDLFK+WS+LD+SRFSVETVSGGITNLLLKV+VKEESG+ VSVTVRLYG
Subjt:  MGAKKICNGSEQVEEA-GDRERNVETYQSS------------------ELCKDLFKKWSKLDDSRFSVETVSGGITNLLLKVSVKEESGNDVSVTVRLYG

Query:  PNTDYVINRDRELDAIKHLSAAGFGAKLLGVFGNGMVQSFINARTLEPPDMTKSKLAAEIAKQLNKFHQVNIPGSKEPQLWNDIFKFYKKASALQFDDTG
        PNTDYVINRDREL AIK+LSAAGFGA+LLGVF NGMVQSFI+ARTLEP DM K KLAAEIAKQLNKFH+V IPGSKEPQLWN+I KFY+KASALQFDDTG
Subjt:  PNTDYVINRDRELDAIKHLSAAGFGAKLLGVFGNGMVQSFINARTLEPPDMTKSKLAAEIAKQLNKFHQVNIPGSKEPQLWNDIFKFYKKASALQFDDTG

Query:  KQSIYDTISFEEIHKEILEIKELTDRLNAPVVFAHNDLLSGNLMLNEEEERLYFIDFEYGSYSYRGYDIGNHFNEYAGYDCNYSRYPSKVEQYHFFRHYL
        KQSIYDTISF+EIH E+LEIKELT  LN+PVVFAHNDLL+GN+MLNEEEERLY IDFEYGSYSYRG+DIGNHFNEYAGYDC+YS YPSK EQYHFFRHYL
Subjt:  KQSIYDTISFEEIHKEILEIKELTDRLNAPVVFAHNDLLSGNLMLNEEEERLYFIDFEYGSYSYRGYDIGNHFNEYAGYDCNYSRYPSKVEQYHFFRHYL

Query:  RPEKPDEVSQEDLEALDVESNTFMLASHLYWALWALIQARMSPINFDYLGYFFLRYGEYKKQKEKYCS
        +PEKPDEVS +DLEAL VESNTFMLASHLYWALWALIQARMSPI+FDYL YFFLRY EYKKQKEKYCS
Subjt:  RPEKPDEVSQEDLEALDVESNTFMLASHLYWALWALIQARMSPINFDYLGYFFLRYGEYKKQKEKYCS

XP_023521006.1 probable ethanolamine kinase isoform X1 [Cucurbita pepo subsp. pepo]5.1e-16381.25Show/hide
Query:  MGAKKICNGSEQVEEA-GDRERNVETYQSS------------------ELCKDLFKKWSKLDDSRFSVETVSGGITNLLLKVSVKEESGNDVSVTVRLYG
        MGA+KI NGS  +  A GD E N E+Y+ S                  ELCKDLF +WS+LD+SRFSVETVSGGITNLLLKV+VKEESG+ VSVTVRLYG
Subjt:  MGAKKICNGSEQVEEA-GDRERNVETYQSS------------------ELCKDLFKKWSKLDDSRFSVETVSGGITNLLLKVSVKEESGNDVSVTVRLYG

Query:  PNTDYVINRDRELDAIKHLSAAGFGAKLLGVFGNGMVQSFINARTLEPPDMTKSKLAAEIAKQLNKFHQVNIPGSKEPQLWNDIFKFYKKASALQFDDTG
        PNTDYVINRDREL AIK+LSAAGFGAKLLGVF NGMVQSFI+ARTLEP DM K KLAAEIAKQLNKFH+V IPGSKEPQLWN+I KFY+KASALQFDDTG
Subjt:  PNTDYVINRDRELDAIKHLSAAGFGAKLLGVFGNGMVQSFINARTLEPPDMTKSKLAAEIAKQLNKFHQVNIPGSKEPQLWNDIFKFYKKASALQFDDTG

Query:  KQSIYDTISFEEIHKEILEIKELTDRLNAPVVFAHNDLLSGNLMLNEEEERLYFIDFEYGSYSYRGYDIGNHFNEYAGYDCNYSRYPSKVEQYHFFRHYL
        KQSIYDTISF+EIH E+LEIKELT  LN+PVVFAHNDLLSGN+MLNEEEERLY IDFEYGSYSYRG+DIGNHFNEYAGYDC+YS YPSK EQYHFFRHYL
Subjt:  KQSIYDTISFEEIHKEILEIKELTDRLNAPVVFAHNDLLSGNLMLNEEEERLYFIDFEYGSYSYRGYDIGNHFNEYAGYDCNYSRYPSKVEQYHFFRHYL

Query:  RPEKPDEVSQEDLEALDVESNTFMLASHLYWALWALIQARMSPINFDYLGYFFLRYGEYKKQKEKYCS
         PEKPDEVSQ+DLEAL VESNTFMLASHLYWALWALIQARMSPI+FDYL YFFLRY EYKKQKEKYCS
Subjt:  RPEKPDEVSQEDLEALDVESNTFMLASHLYWALWALIQARMSPINFDYLGYFFLRYGEYKKQKEKYCS

TrEMBL top hitse value%identityAlignment
A0A0A0KDN9 Uncharacterized protein1.2e-16280.98Show/hide
Query:  MGAKKICNGSEQVEEA-GDRERNVETYQSS------------------ELCKDLFKKWSKLDDSRFSVETVSGGITNLLLKVSVKEESGNDVSVTVRLYG
        MGAKKI NG   V EA  D + N E+YQ S                  ELCKDLFK+WS+LD SRFSVETVSGGITN LLKV+VKEESG  VSVTVRLYG
Subjt:  MGAKKICNGSEQVEEA-GDRERNVETYQSS------------------ELCKDLFKKWSKLDDSRFSVETVSGGITNLLLKVSVKEESGNDVSVTVRLYG

Query:  PNTDYVINRDRELDAIKHLSAAGFGAKLLGVFGNGMVQSFINARTLEPPDMTKSKLAAEIAKQLNKFHQVNIPGSKEPQLWNDIFKFYKKASALQFDDTG
        PNTDYVINRDREL AIK+LSAAGFGAKLLGVF NGMVQSFI+ARTLEP D+ K +LAAEIAKQLNKFH+V IPGS EPQLWN+I  FY KAS LQFDDTG
Subjt:  PNTDYVINRDRELDAIKHLSAAGFGAKLLGVFGNGMVQSFINARTLEPPDMTKSKLAAEIAKQLNKFHQVNIPGSKEPQLWNDIFKFYKKASALQFDDTG

Query:  KQSIYDTISFEEIHKEILEIKELTDRLNAPVVFAHNDLLSGNLMLNEEEERLYFIDFEYGSYSYRGYDIGNHFNEYAGYDCNYSRYPSKVEQYHFFRHYL
        KQSIYDTISF+EIH EILEIKELT  LNAP+VFAHNDLLSGNLMLNEEE RLYFIDFEYGSYSYRG+DIGNHFNEYAGYDC+YS YPSK EQYHFFRHYL
Subjt:  KQSIYDTISFEEIHKEILEIKELTDRLNAPVVFAHNDLLSGNLMLNEEEERLYFIDFEYGSYSYRGYDIGNHFNEYAGYDCNYSRYPSKVEQYHFFRHYL

Query:  RPEKPDEVSQEDLEALDVESNTFMLASHLYWALWALIQARMSPINFDYLGYFFLRYGEYKKQKEKYCS
        +PEKPDEVSQ+DLEAL VESNTFMLASHLYWALWALIQARMSPI+FDYL YFFLRYGEYKKQKEKYCS
Subjt:  RPEKPDEVSQEDLEALDVESNTFMLASHLYWALWALIQARMSPINFDYLGYFFLRYGEYKKQKEKYCS

A0A1S3CIA0 probable ethanolamine kinase isoform X29.5e-16381.25Show/hide
Query:  MGAKKICNGSEQVEEA-GDRERNVETYQSS------------------ELCKDLFKKWSKLDDSRFSVETVSGGITNLLLKVSVKEESGNDVSVTVRLYG
        MGAKKI NG   V EA  D   N E YQ S                  ELCKDLFK+WS+LD SRFSVETVSGGITN LLKV+VKEESG  VSVTVRLYG
Subjt:  MGAKKICNGSEQVEEA-GDRERNVETYQSS------------------ELCKDLFKKWSKLDDSRFSVETVSGGITNLLLKVSVKEESGNDVSVTVRLYG

Query:  PNTDYVINRDRELDAIKHLSAAGFGAKLLGVFGNGMVQSFINARTLEPPDMTKSKLAAEIAKQLNKFHQVNIPGSKEPQLWNDIFKFYKKASALQFDDTG
        PNTDYVINRDREL AIK+LSAAGFGAKLLGVF NGMVQSFI+ARTLEP D+ K +LAAEIAKQLNKFH+V IPGS EPQLWN++ KFY+KAS LQFDDTG
Subjt:  PNTDYVINRDRELDAIKHLSAAGFGAKLLGVFGNGMVQSFINARTLEPPDMTKSKLAAEIAKQLNKFHQVNIPGSKEPQLWNDIFKFYKKASALQFDDTG

Query:  KQSIYDTISFEEIHKEILEIKELTDRLNAPVVFAHNDLLSGNLMLNEEEERLYFIDFEYGSYSYRGYDIGNHFNEYAGYDCNYSRYPSKVEQYHFFRHYL
        KQSIYDTISF+EIH EILEIKELT  LNAPVVFAHNDLLSGNLMLNEEE RLYFIDFEYGSYSYRG+DIGNHFNEYAGYDC+YS YPSK EQYHFFRHYL
Subjt:  KQSIYDTISFEEIHKEILEIKELTDRLNAPVVFAHNDLLSGNLMLNEEEERLYFIDFEYGSYSYRGYDIGNHFNEYAGYDCNYSRYPSKVEQYHFFRHYL

Query:  RPEKPDEVSQEDLEALDVESNTFMLASHLYWALWALIQARMSPINFDYLGYFFLRYGEYKKQKEKYCS
        +PEKPDEVSQ+DLEAL VESNTFMLASHLYWALWALIQARMSPI+FDYL YFFLRYGEYKKQKEKYCS
Subjt:  RPEKPDEVSQEDLEALDVESNTFMLASHLYWALWALIQARMSPINFDYLGYFFLRYGEYKKQKEKYCS

A0A5D3D4J8 Putative ethanolamine kinase isoform X29.5e-16381.25Show/hide
Query:  MGAKKICNGSEQVEEA-GDRERNVETYQSS------------------ELCKDLFKKWSKLDDSRFSVETVSGGITNLLLKVSVKEESGNDVSVTVRLYG
        MGAKKI NG   V EA  D   N E YQ S                  ELCKDLFK+WS+LD SRFSVETVSGGITN LLKV+VKEESG  VSVTVRLYG
Subjt:  MGAKKICNGSEQVEEA-GDRERNVETYQSS------------------ELCKDLFKKWSKLDDSRFSVETVSGGITNLLLKVSVKEESGNDVSVTVRLYG

Query:  PNTDYVINRDRELDAIKHLSAAGFGAKLLGVFGNGMVQSFINARTLEPPDMTKSKLAAEIAKQLNKFHQVNIPGSKEPQLWNDIFKFYKKASALQFDDTG
        PNTDYVINRDREL AIK+LSAAGFGAKLLGVF NGMVQSFI+ARTLEP D+ K +LAAEIAKQLNKFH+V IPGS EPQLWN++ KFY+KAS LQFDDTG
Subjt:  PNTDYVINRDRELDAIKHLSAAGFGAKLLGVFGNGMVQSFINARTLEPPDMTKSKLAAEIAKQLNKFHQVNIPGSKEPQLWNDIFKFYKKASALQFDDTG

Query:  KQSIYDTISFEEIHKEILEIKELTDRLNAPVVFAHNDLLSGNLMLNEEEERLYFIDFEYGSYSYRGYDIGNHFNEYAGYDCNYSRYPSKVEQYHFFRHYL
        KQSIYDTISF+EIH EILEIKELT  LNAPVVFAHNDLLSGNLMLNEEE RLYFIDFEYGSYSYRG+DIGNHFNEYAGYDC+YS YPSK EQYHFFRHYL
Subjt:  KQSIYDTISFEEIHKEILEIKELTDRLNAPVVFAHNDLLSGNLMLNEEEERLYFIDFEYGSYSYRGYDIGNHFNEYAGYDCNYSRYPSKVEQYHFFRHYL

Query:  RPEKPDEVSQEDLEALDVESNTFMLASHLYWALWALIQARMSPINFDYLGYFFLRYGEYKKQKEKYCS
        +PEKPDEVSQ+DLEAL VESNTFMLASHLYWALWALIQARMSPI+FDYL YFFLRYGEYKKQKEKYCS
Subjt:  RPEKPDEVSQEDLEALDVESNTFMLASHLYWALWALIQARMSPINFDYLGYFFLRYGEYKKQKEKYCS

A0A6J1CQ77 probable ethanolamine kinase5.7e-16881.74Show/hide
Query:  MGAKKICNGSEQVEEAGDRERNVETYQSS------------------ELCKDLFKKWSKLDDSRFSVETVSGGITNLLLKVSVKEESGNDVSVTVRLYGP
        MGAKKICNGS  VEEAGD E   ETY +S                  ELCKDLFK+WS+LDDSRFSV+TVSGGITNLLLKV+VKEESG+DVSVTVRLYGP
Subjt:  MGAKKICNGSEQVEEAGDRERNVETYQSS------------------ELCKDLFKKWSKLDDSRFSVETVSGGITNLLLKVSVKEESGNDVSVTVRLYGP

Query:  NTDYVINRDRELDAIKHLSAAGFGAKLLGVFGNGMVQSFINARTLEPPDMTKSKLAAEIAKQLNKFHQVNIPGSKEPQLWNDIFKFYKKASALQFDDTGK
        NTDYVINRDREL AIK+LSAAGFGAKLLGVF NGMVQSFINARTLEP D    K+A EIAKQLNKFHQVN+ G KEPQLWNDIFKFY+KASALQFDDTGK
Subjt:  NTDYVINRDRELDAIKHLSAAGFGAKLLGVFGNGMVQSFINARTLEPPDMTKSKLAAEIAKQLNKFHQVNIPGSKEPQLWNDIFKFYKKASALQFDDTGK

Query:  QSIYDTISFEEIHKEILEIKELTDRLNAPVVFAHNDLLSGNLMLNEEEERLYFIDFEYGSYSYRGYDIGNHFNEYAGYDCNYSRYPSKVEQYHFFRHYLR
        QSIYDTISF+EIH EILEIKELT  LNAP VFAHNDLLSGN+MLNE E RLYFIDFEYGSY+YRGYDIGNHFNEYAGY+C+YS+YPSK+EQYHFFRHYL+
Subjt:  QSIYDTISFEEIHKEILEIKELTDRLNAPVVFAHNDLLSGNLMLNEEEERLYFIDFEYGSYSYRGYDIGNHFNEYAGYDCNYSRYPSKVEQYHFFRHYLR

Query:  PEKPDEVSQEDLEALDVESNTFMLASHLYWALWALIQARMSPINFDYLGYFFLRYGEYKKQKEKYCS
        PEKPDEVS++DLE L VESNTFMLASHLYWALWALIQARMSPI FDY+GYFFLRYGEYKKQKE  CS
Subjt:  PEKPDEVSQEDLEALDVESNTFMLASHLYWALWALIQARMSPINFDYLGYFFLRYGEYKKQKEKYCS

A0A6J1J3G7 probable ethanolamine kinase isoform X12.5e-16380.98Show/hide
Query:  MGAKKICNGSEQVEEA-GDRERNVETYQSS------------------ELCKDLFKKWSKLDDSRFSVETVSGGITNLLLKVSVKEESGNDVSVTVRLYG
        MGA+KI NGS  +E A GD E N E+Y+ S                  ELCKDLFK+WS+LD+SRFSVETVSGGITNLLLKV+VKEESG+ VSVTVRLYG
Subjt:  MGAKKICNGSEQVEEA-GDRERNVETYQSS------------------ELCKDLFKKWSKLDDSRFSVETVSGGITNLLLKVSVKEESGNDVSVTVRLYG

Query:  PNTDYVINRDRELDAIKHLSAAGFGAKLLGVFGNGMVQSFINARTLEPPDMTKSKLAAEIAKQLNKFHQVNIPGSKEPQLWNDIFKFYKKASALQFDDTG
        PNTDYVINRDREL AIK+LSAAGFGA+LLGVF NGMVQSFI+ARTLEP DM K KLAAEIAKQLNKFH+V IPGSKEPQLWN+I KFY+KASALQFDDTG
Subjt:  PNTDYVINRDRELDAIKHLSAAGFGAKLLGVFGNGMVQSFINARTLEPPDMTKSKLAAEIAKQLNKFHQVNIPGSKEPQLWNDIFKFYKKASALQFDDTG

Query:  KQSIYDTISFEEIHKEILEIKELTDRLNAPVVFAHNDLLSGNLMLNEEEERLYFIDFEYGSYSYRGYDIGNHFNEYAGYDCNYSRYPSKVEQYHFFRHYL
        KQSIYDTISF+EIH E+LEIKELT  LN+PVVFAHNDLL+GN+MLNEEEERLY IDFEYGSYSYRG+DIGNHFNEYAGYDC+YS YPSK EQYHFFRHYL
Subjt:  KQSIYDTISFEEIHKEILEIKELTDRLNAPVVFAHNDLLSGNLMLNEEEERLYFIDFEYGSYSYRGYDIGNHFNEYAGYDCNYSRYPSKVEQYHFFRHYL

Query:  RPEKPDEVSQEDLEALDVESNTFMLASHLYWALWALIQARMSPINFDYLGYFFLRYGEYKKQKEKYCS
        +PEKPDEVS +DLEAL VESNTFMLASHLYWALWALIQARMSPI+FDYL YFFLRY EYKKQKEKYCS
Subjt:  RPEKPDEVSQEDLEALDVESNTFMLASHLYWALWALIQARMSPINFDYLGYFFLRYGEYKKQKEKYCS

SwissProt top hitse value%identityAlignment
A7MCT6 Ethanolamine kinase 23.0e-4937.33Show/hide
Query:  GITNLLLKVSVKEESGNDVSVTVRLYGPNTDYVINRDRELDAIKHLSAAGFGAKLLGVFGNGMVQSFINARTLEPPDMTKSKLAAEIAKQLNKFHQVNIP
        GITN LL   V+E+  +   V VR+YG  T+ +++R+ E+   + L A G   KL   F NG+   ++    L P  + + +L   IA ++ K H ++  
Subjt:  GITNLLLKVSVKEESGNDVSVTVRLYGPNTDYVINRDRELDAIKHLSAAGFGAKLLGVFGNGMVQSFINARTLEPPDMTKSKLAAEIAKQLNKFHQVNIP

Query:  GS-KEPQLWNDIFKFYKKASALQFDDTGKQSIYDTISFEEIHKEILEIKELTDRLNAPVVFAHNDLLSGNLMLNEEEERLYFIDFEYGSYSYRGYDIGNH
        GS  +P LW+ + +++     L  D+       D    E + +E+  +KE   +L++PVVF HNDLL  N++ + ++ R+ FID+EY  Y+Y+ +DIGNH
Subjt:  GS-KEPQLWNDIFKFYKKASALQFDDTGKQSIYDTISFEEIHKEILEIKELTDRLNAPVVFAHNDLLSGNLMLNEEEERLYFIDFEYGSYSYRGYDIGNH

Query:  FNEYAGYD-CNYSRYPSKVEQYHFFRHYLRPEKPDEVSQEDLEALDVESNTFMLASHLYWALWALIQARMSPINFDYLGYFFLRYGEYKKQK
        FNE+AG +  +YSRYP++  Q  + R+YL  +K    S  ++E L  + N F LASH +WALWALIQ + S I+FD+L Y  +R+ +Y K K
Subjt:  FNEYAGYD-CNYSRYPSKVEQYHFFRHYLRPEKPDEVSQEDLEALDVESNTFMLASHLYWALWALIQARMSPINFDYLGYFFLRYGEYKKQK

O81024 Probable ethanolamine kinase3.4e-14175.56Show/hide
Query:  ELCKDLFKKWSKLDDSRFSVETVSGGITNLLLKVSVKEESGNDVSVTVRLYGPNTDYVINRDRELDAIKHLSAAGFGAKLLGVFGNGMVQSFINARTLEP
        ELCKDLFK W +LDDS FSVE VSGGITNLLLKVSVKE++  +VSVTVRLYGPNT+YVINR+RE+ AIK+LSAAGFGAKLLG FGNGMVQSFINARTLEP
Subjt:  ELCKDLFKKWSKLDDSRFSVETVSGGITNLLLKVSVKEESGNDVSVTVRLYGPNTDYVINRDRELDAIKHLSAAGFGAKLLGVFGNGMVQSFINARTLEP

Query:  PDMTKSKLAAEIAKQLNKFHQVNIPGSKEPQLWNDIFKFYKKASALQFDDTGKQSIYDTISFEEIHKEILEIKELTDRLNAPVVFAHNDLLSGNLMLNEE
         DM + K+AA+IA++L KFH+V+IPGSKEPQLW DI KFY+KAS L F++  KQ +++TISFEE+HKEI+E++E T  LNAPVVFAHNDLLSGN MLN+E
Subjt:  PDMTKSKLAAEIAKQLNKFHQVNIPGSKEPQLWNDIFKFYKKASALQFDDTGKQSIYDTISFEEIHKEILEIKELTDRLNAPVVFAHNDLLSGNLMLNEE

Query:  EERLYFIDFEYGSYSYRGYDIGNHFNEYAGYDCNYSRYPSKVEQYHFFRHYLRPEKPDEVSQEDLEALDVESNTFMLASHLYWALWALIQARMSPINFDY
        EE+LY IDFEYGSY+YRG+DIGNHFNEYAGYDC+YS YPSK EQYHF +HYL+P+KPDEVS  ++E++ VE++ + LASHLYWA+WA+IQARMSPI F+Y
Subjt:  EERLYFIDFEYGSYSYRGYDIGNHFNEYAGYDCNYSRYPSKVEQYHFFRHYLRPEKPDEVSQEDLEALDVESNTFMLASHLYWALWALIQARMSPINFDY

Query:  LGYFFLRYGEYKKQK
        LGYFFLRY EYKKQK
Subjt:  LGYFFLRYGEYKKQK

Q869T9 Probable ethanolamine kinase A3.0e-5738.56Show/hide
Query:  DSRFSVETVSGGITNLLLKVSVK--EESGNDVSVTVRLYGPNTDYVINRDRELDAIKHLSAAGFGAKLLGVFGNGMVQSFINARTLEPPDMTKSKLAAEI
        D   +++ ++GGITN+L  V  K  E+    + V +RLYG  ++ +I+R  EL         G GAK  G+F NG +  FI    L   D++K  +   I
Subjt:  DSRFSVETVSGGITNLLLKVSVK--EESGNDVSVTVRLYGPNTDYVINRDRELDAIKHLSAAGFGAKLLGVFGNGMVQSFINARTLEPPDMTKSKLAAEI

Query:  AKQLNKFHQVNIPGSKEPQLWNDIFKFYKKASALQFDDTGKQSIYDTISFEEIHKEILEIKELTDRLNAPVVFAHNDLLSGNLMLNEEEERLYFIDFEYG
        AK++ ++H + +P  K P LW  I K+   A  + +    K   Y +I+ +++ +E   +++   +LN+P+VF HNDLLSGN++ +  +    FIDFEY 
Subjt:  AKQLNKFHQVNIPGSKEPQLWNDIFKFYKKASALQFDDTGKQSIYDTISFEEIHKEILEIKELTDRLNAPVVFAHNDLLSGNLMLNEEEERLYFIDFEYG

Query:  SYSYRGYDIGNHFNEYAGYDCNYSRYPSKVEQYHFFRHYLRPEKPDEVSQEDLEALDVESNTFMLASHLYWALWALIQARMSPINFDYLGYFFLRYGEYK
        +Y++RG ++GNHFNEYAG+  +YS YP+K  Q HF   Y R     E +Q++LE L +ESN F LASHLYW  WA++QA  S I+FDYL Y   R+  Y 
Subjt:  SYSYRGYDIGNHFNEYAGYDCNYSRYPSKVEQYHFFRHYLRPEKPDEVSQEDLEALDVESNTFMLASHLYWALWALIQARMSPINFDYLGYFFLRYGEYK

Query:  KQKEKY
        + ++++
Subjt:  KQKEKY

Q9D4V0 Ethanolamine kinase 12.6e-4836.33Show/hide
Query:  DDSRFSVETVSGGITNLLLKVSVKEESGNDVSVTVRLYGPNTDYVINRDRELDAIKHLSAAGFGAKLLGVFGNGMVQSFINARTLEPPDMTKSKLAAEIA
        D    +++  + GITN L+   V  ++  DV V VR+YG  T+ +++RD E+ + + L A G   +L   F NG+   FI    L+P  +    +   IA
Subjt:  DDSRFSVETVSGGITNLLLKVSVKEESGNDVSVTVRLYGPNTDYVINRDRELDAIKHLSAAGFGAKLLGVFGNGMVQSFINARTLEPPDMTKSKLAAEIA

Query:  KQLNKFHQVNIPGSKEPQ--LWNDIFKFYKK-ASALQFDDTGKQSIYDTISFEEIHKEILEIKELTDRLNAPVVFAHNDLLSGNLMLNEEEERLYFIDFE
        +QL K H ++      P+  LW  + K++    +    ++  K+ + +  S + + +E+  +KEL   L +PVV  HNDLL  N++ NE++  + FID+E
Subjt:  KQLNKFHQVNIPGSKEPQ--LWNDIFKFYKK-ASALQFDDTGKQSIYDTISFEEIHKEILEIKELTDRLNAPVVFAHNDLLSGNLMLNEEEERLYFIDFE

Query:  YGSYSYRGYDIGNHFNEYAGY-DCNYSRYPSKVEQYHFFRHYLRPEKP-----DEVSQEDLEALDVESNTFMLASHLYWALWALIQARMSPINFDYLGYF
        Y  Y+Y  YDIGNHFNE+AG  D +YS YP +  Q  + R YL   K       +V+++++E L ++ N F LASH +W LWALIQA+ S I FD+LGY 
Subjt:  YGSYSYRGYDIGNHFNEYAGY-DCNYSRYPSKVEQYHFFRHYLRPEKP-----DEVSQEDLEALDVESNTFMLASHLYWALWALIQARMSPINFDYLGYF

Query:  FLRYGEYKKQK
         +R+ +Y K K
Subjt:  FLRYGEYKKQK

Q9HBU6 Ethanolamine kinase 13.0e-4935.04Show/hide
Query:  GSEQVEEAGDRERNVETYQSSE----LCKDLFKKWSKLDDSRFSVETVSGGITNLLLKVSVKEESGNDVS--VTVRLYGPNTDYVINRDRELDAIKHLSA
        GS +V +     ++ E ++  E    L + L   W   D    +++  + GITN L+   V    GN +   V VR+YG  T+ +++RD E+ + + L A
Subjt:  GSEQVEEAGDRERNVETYQSSE----LCKDLFKKWSKLDDSRFSVETVSGGITNLLLKVSVKEESGNDVS--VTVRLYGPNTDYVINRDRELDAIKHLSA

Query:  AGFGAKLLGVFGNGMVQSFINARTLEPPDMTKSKLAAEIAKQLNKFHQVNIPGSKEPQ--LWNDIFKFYKK-ASALQFDDTGKQSIYDTISFEEIHKEIL
         G   +L   F NG+   FI    L+P  +    +   IA+QL K H ++      P+  LW  + K++    +    +D  K+ + D  S + + +E+ 
Subjt:  AGFGAKLLGVFGNGMVQSFINARTLEPPDMTKSKLAAEIAKQLNKFHQVNIPGSKEPQ--LWNDIFKFYKK-ASALQFDDTGKQSIYDTISFEEIHKEIL

Query:  EIKELTDRLNAPVVFAHNDLLSGNLMLNEEEERLYFIDFEYGSYSYRGYDIGNHFNEYAGY-DCNYSRYPSKVEQYHFFRHYLRPEKP-----DEVSQED
         +KE+   L +PVV  HNDLL  N++ NE++  + FID+EY  Y+Y  YDIGNHFNE+AG  D +YS YP +  Q  + R YL   K       EV++++
Subjt:  EIKELTDRLNAPVVFAHNDLLSGNLMLNEEEERLYFIDFEYGSYSYRGYDIGNHFNEYAGY-DCNYSRYPSKVEQYHFFRHYLRPEKP-----DEVSQED

Query:  LEALDVESNTFMLASHLYWALWALIQARMSPINFDYLGYFFLRYGEYKKQK
        +E L ++ N F LASH +W LWALIQA+ S I FD+LGY  +R+ +Y K K
Subjt:  LEALDVESNTFMLASHLYWALWALIQARMSPINFDYLGYFFLRYGEYKKQK

Arabidopsis top hitse value%identityAlignment
AT1G71697.1 choline kinase 19.1e-4132.15Show/hide
Query:  DDSRFSVETVSGGITNLLLKVSVKEESGNDV--SVTVRLYGPNTDYVINRDRELDAIKHLSAAGFGAKLLGVFGNGMVQSFINARTLEPPDMTKSKLAAE
        D  R  V  + G +TN + +++    +G DV   V VR+YG   D   NR  E+   + +S  G+G KLLG F +G ++ FI+ARTL   D+  ++ +  
Subjt:  DDSRFSVETVSGGITNLLLKVSVKEESGNDV--SVTVRLYGPNTDYVINRDRELDAIKHLSAAGFGAKLLGVFGNGMVQSFINARTLEPPDMTKSKLAAE

Query:  IAKQLNKFHQVNIPGSKEPQLWNDIFKFYKKASALQFDDTGKQSIYDTISFEEIHKEILEIKELTDRLNAPVVFAHNDLLSGNLMLNEEEERLYFIDFEY
        IA +L +FH++++PG K   LW  +  + K+A  L           D    E +  EI  ++E   R +  + F HNDL  GN+M++E    +  ID+EY
Subjt:  IAKQLNKFHQVNIPGSKEPQLWNDIFKFYKKASALQFDDTGKQSIYDTISFEEIHKEILEIKELTDRLNAPVVFAHNDLLSGNLMLNEEEERLYFIDFEY

Query:  GSYSYRGYDIGNHFNEYAG-------YDCNYSRYPSKVEQYHFFRHYLRPEKPDEVSQEDLEALDVESNTFMLASHLYWALWALIQARMSPINFDYLGYF
         S++   YDI NHF E A        +  +Y+ YP + E+  F   YL     +  S +++E L  ++ ++ LA+H++W LW +I   ++ I FDY+ Y 
Subjt:  GSYSYRGYDIGNHFNEYAG-------YDCNYSRYPSKVEQYHFFRHYLRPEKPDEVSQEDLEALDVESNTFMLASHLYWALWALIQARMSPINFDYLGYF

Query:  FLRYGEYKKQK
          R+ +Y  +K
Subjt:  FLRYGEYKKQK

AT1G74320.1 Protein kinase superfamily protein4.1e-4132.55Show/hide
Query:  GDRERNVETYQ------SSELCKDLFKKWSKLDDSR-FSVETVSGGITNLLLKVS-VKEESGNDVSVTVRLYGPNTDYVINRDRELDAIKHLSAAGFGAK
        G  E+NVE  Q        E  + +  +W  + DS+   V  + G +TN + ++     E G    V VR+YG   +   +R+ E+   + +S  G G  
Subjt:  GDRERNVETYQ------SSELCKDLFKKWSKLDDSR-FSVETVSGGITNLLLKVS-VKEESGNDVSVTVRLYGPNTDYVINRDRELDAIKHLSAAGFGAK

Query:  LLGVFGNGMVQSFINARTLEPPDMTKSKLAAEIAKQLNKFHQVNIPGSKEPQLWNDIFKFYKKASALQFDDTGKQSIYDTISFEEIHKEILEIKELTDRL
        LLG FGNG ++ F++ARTL   D+   +++  IA ++ +FH + +PG+K+  LW+ +  +      L   +  K    D +   E+   +LE K L D  
Subjt:  LLGVFGNGMVQSFINARTLEPPDMTKSKLAAEIAKQLNKFHQVNIPGSKEPQLWNDIFKFYKKASALQFDDTGKQSIYDTISFEEIHKEILEIKELTDRL

Query:  NAPVVFAHNDLLSGNLMLNEEEERLYFIDFEYGSYSYRGYDIGNHFNEYAG-------YDCNYSRYPSKVEQYHFFRHYL--RPEKPDEVSQEDLEALDV
        +  + F HNDL  GN+M++EE + +  ID+EY  Y+   YDI NHF E A        +  +YS+YP   E+  F + Y+    EKP +   + L   DV
Subjt:  NAPVVFAHNDLLSGNLMLNEEEERLYFIDFEYGSYSYRGYDIGNHFNEYAG-------YDCNYSRYPSKVEQYHFFRHYL--RPEKPDEVSQEDLEALDV

Query:  ESNTFMLASHLYWALWALIQARMSPINFDYLGYFFLRYGEY
        E  T  LASHL W LW +I   ++ I+FDY+ Y   R+ +Y
Subjt:  ESNTFMLASHLYWALWALIQARMSPINFDYLGYFFLRYGEY

AT2G26830.1 Protein kinase superfamily protein2.4e-14275.56Show/hide
Query:  ELCKDLFKKWSKLDDSRFSVETVSGGITNLLLKVSVKEESGNDVSVTVRLYGPNTDYVINRDRELDAIKHLSAAGFGAKLLGVFGNGMVQSFINARTLEP
        ELCKDLFK W +LDDS FSVE VSGGITNLLLKVSVKE++  +VSVTVRLYGPNT+YVINR+RE+ AIK+LSAAGFGAKLLG FGNGMVQSFINARTLEP
Subjt:  ELCKDLFKKWSKLDDSRFSVETVSGGITNLLLKVSVKEESGNDVSVTVRLYGPNTDYVINRDRELDAIKHLSAAGFGAKLLGVFGNGMVQSFINARTLEP

Query:  PDMTKSKLAAEIAKQLNKFHQVNIPGSKEPQLWNDIFKFYKKASALQFDDTGKQSIYDTISFEEIHKEILEIKELTDRLNAPVVFAHNDLLSGNLMLNEE
         DM + K+AA+IA++L KFH+V+IPGSKEPQLW DI KFY+KAS L F++  KQ +++TISFEE+HKEI+E++E T  LNAPVVFAHNDLLSGN MLN+E
Subjt:  PDMTKSKLAAEIAKQLNKFHQVNIPGSKEPQLWNDIFKFYKKASALQFDDTGKQSIYDTISFEEIHKEILEIKELTDRLNAPVVFAHNDLLSGNLMLNEE

Query:  EERLYFIDFEYGSYSYRGYDIGNHFNEYAGYDCNYSRYPSKVEQYHFFRHYLRPEKPDEVSQEDLEALDVESNTFMLASHLYWALWALIQARMSPINFDY
        EE+LY IDFEYGSY+YRG+DIGNHFNEYAGYDC+YS YPSK EQYHF +HYL+P+KPDEVS  ++E++ VE++ + LASHLYWA+WA+IQARMSPI F+Y
Subjt:  EERLYFIDFEYGSYSYRGYDIGNHFNEYAGYDCNYSRYPSKVEQYHFFRHYLRPEKPDEVSQEDLEALDVESNTFMLASHLYWALWALIQARMSPINFDY

Query:  LGYFFLRYGEYKKQK
        LGYFFLRY EYKKQK
Subjt:  LGYFFLRYGEYKKQK

AT4G09760.1 Protein kinase superfamily protein2.4e-4132.23Show/hide
Query:  ELCKDLFKKWSKL--DDSRFSVETVSGGITNLLLKVS--VKEESGNDVSVTVRLYGPNTDYVINRDRELDAIKHLSAAGFGAKLLGVFGNGMVQSFINAR
        ++ + L  KW  +  D     V+ + G +TN +  VS   KE +     + VR+YG   +   NRD E+   ++++  G G  LLG F  G V+ FI+AR
Subjt:  ELCKDLFKKWSKL--DDSRFSVETVSGGITNLLLKVS--VKEESGNDVSVTVRLYGPNTDYVINRDRELDAIKHLSAAGFGAKLLGVFGNGMVQSFINAR

Query:  TLEPPDMTKSKLAAEIAKQLNKFHQVNIPGSKEPQLWNDIFKFYKKASALQFDDTGKQSIYDTISFEEIHKEILEIKELTDRLNAPVVFAHNDLLSGNLM
        TL   D+    ++A +A +L +FH ++IPG +   +W+ +  +  +A  L  ++   +   D I  E      + + E        + F HNDL  GN+M
Subjt:  TLEPPDMTKSKLAAEIAKQLNKFHQVNIPGSKEPQLWNDIFKFYKKASALQFDDTGKQSIYDTISFEEIHKEILEIKELTDRLNAPVVFAHNDLLSGNLM

Query:  LNEEEERLYFIDFEYGSYSYRGYDIGNHFNEYAG-YDCN------YSRYPSKVEQYHFFRHYLRPEKPDEVSQEDLEALDVESNTFMLASHLYWALWALI
        ++EE   +  ID+EY SY+   YDI NHF E A  Y  N      Y+ YP + E+  F  +YL     +E  +ED+E L  +   + LASHL+W LW +I
Subjt:  LNEEEERLYFIDFEYGSYSYRGYDIGNHFNEYAG-YDCN------YSRYPSKVEQYHFFRHYLRPEKPDEVSQEDLEALDVESNTFMLASHLYWALWALI

Query:  QARMSPINFDYLGYFFLRYGEYKKQKEKYCSW
           ++ I FDY+ Y   R+ +Y  +K K  S+
Subjt:  QARMSPINFDYLGYFFLRYGEYKKQKEKYCSW

AT4G09760.2 Protein kinase superfamily protein2.4e-4132.23Show/hide
Query:  ELCKDLFKKWSKL--DDSRFSVETVSGGITNLLLKVS--VKEESGNDVSVTVRLYGPNTDYVINRDRELDAIKHLSAAGFGAKLLGVFGNGMVQSFINAR
        ++ + L  KW  +  D     V+ + G +TN +  VS   KE +     + VR+YG   +   NRD E+   ++++  G G  LLG F  G V+ FI+AR
Subjt:  ELCKDLFKKWSKL--DDSRFSVETVSGGITNLLLKVS--VKEESGNDVSVTVRLYGPNTDYVINRDRELDAIKHLSAAGFGAKLLGVFGNGMVQSFINAR

Query:  TLEPPDMTKSKLAAEIAKQLNKFHQVNIPGSKEPQLWNDIFKFYKKASALQFDDTGKQSIYDTISFEEIHKEILEIKELTDRLNAPVVFAHNDLLSGNLM
        TL   D+    ++A +A +L +FH ++IPG +   +W+ +  +  +A  L  ++   +   D I  E      + + E        + F HNDL  GN+M
Subjt:  TLEPPDMTKSKLAAEIAKQLNKFHQVNIPGSKEPQLWNDIFKFYKKASALQFDDTGKQSIYDTISFEEIHKEILEIKELTDRLNAPVVFAHNDLLSGNLM

Query:  LNEEEERLYFIDFEYGSYSYRGYDIGNHFNEYAG-YDCN------YSRYPSKVEQYHFFRHYLRPEKPDEVSQEDLEALDVESNTFMLASHLYWALWALI
        ++EE   +  ID+EY SY+   YDI NHF E A  Y  N      Y+ YP + E+  F  +YL     +E  +ED+E L  +   + LASHL+W LW +I
Subjt:  LNEEEERLYFIDFEYGSYSYRGYDIGNHFNEYAG-YDCN------YSRYPSKVEQYHFFRHYLRPEKPDEVSQEDLEALDVESNTFMLASHLYWALWALI

Query:  QARMSPINFDYLGYFFLRYGEYKKQKEKYCSW
           ++ I FDY+ Y   R+ +Y  +K K  S+
Subjt:  QARMSPINFDYLGYFFLRYGEYKKQKEKYCSW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGCCAAGAAGATCTGCAATGGCTCCGAGCAGGTGGAAGAAGCCGGAGACAGAGAACGCAACGTCGAGACGTATCAGTCATCCGAGTTGTGCAAAGATCTGTTCAA
GAAGTGGTCGAAGCTGGACGATTCTCGCTTCTCCGTTGAAACGGTCTCTGGTGGAATCACTAATCTACTGCTTAAGGTTTCTGTGAAAGAGGAAAGTGGTAATGACGTTT
CTGTCACCGTCAGACTATATGGACCTAACACGGATTATGTAATTAATCGTGACAGGGAACTGGATGCAATCAAACATCTTTCAGCTGCAGGATTTGGTGCTAAGCTTCTT
GGAGTTTTTGGAAATGGCATGGTGCAGTCATTTATTAATGCACGTACACTAGAACCACCAGATATGACAAAGTCAAAGCTAGCTGCAGAAATTGCCAAACAGCTTAATAA
ATTCCACCAAGTAAATATTCCAGGTTCTAAAGAACCTCAGTTATGGAATGACATTTTCAAATTTTACAAGAAAGCCTCTGCACTTCAATTTGATGATACTGGGAAGCAAA
GCATATATGATACAATTTCATTTGAGGAAATTCATAAGGAAATACTTGAGATTAAGGAACTAACAGACCGCCTTAATGCCCCTGTAGTGTTTGCTCACAATGACCTGCTT
TCTGGGAACCTAATGCTAAATGAGGAAGAAGAGCGGCTCTACTTCATTGATTTCGAGTACGGATCATACAGTTACAGAGGATATGACATCGGCAATCACTTCAATGAATA
TGCAGGCTATGACTGTAACTACAGCCGTTATCCATCCAAGGTTGAGCAGTATCATTTCTTCAGGCATTATTTACGACCTGAAAAACCAGATGAGGTATCTCAAGAAGATC
TCGAAGCTCTGGACGTAGAGTCGAACACATTCATGCTAGCTTCTCACTTGTATTGGGCTTTATGGGCGCTGATACAGGCAAGGATGTCTCCGATCAATTTCGATTACCTC
GGTTACTTCTTCCTGCGTTACGGCGAATACAAAAAGCAGAAAGAAAAATATTGCTCTTGGCGAGATCTTTCCTTGCTGGATCAGGATCGGGTTCTGTACCTGCATAGAAG
ATGCAATTGGGGGGGTAGGATCAGCAATCAGTTTATATTCCATAAAGATGTAGGAAGTTCTTTACTGGTTCTGTTTGATCTCTGCTGTAAACTGAACCAACCATACTCTC
CAAGAGACCATCATGGCTACCCAAAACAAGCAAATAAAAGGATTCAAATCCTTTTACAAGTCCCTGATGGAGCTCGGGGTATACCGGCATTCGATCCTGCTCCGATGGAC
AAAGTACCTGTTTATAAGTATAACTGGTTGGGTAGGAAAGCTAAGAAAGGAACTGATCAGCATGGCGGTCGGAGAGGAATGGAGGGAGGCGGTGGCGGTGGCGGTGGCGA
AGGAATGGGGGCGAGAAAGCTGGTGGTTGAGGGTAGGAAGTCAGTTTCCCATGTGGAAACAAACTTGGCATCAGTGGCTGCATTTCTTCAAGTGAAGGTATTGGTATCAG
ACATGCCTCAGATGATGCAGGTTCAGGCTTTTAGGTCAGCAAGGAGGAGTTATGACAGCTTGGAGAAGTTCAGCTCAAAACATATGGCTTACAATATTAAAAAGGTTTGT
TCTATTGGCAATATTTTTTTTTTTTTCTTTTTCTCTTTTGGGTATTGGGAATTTCATATTGATATGATCTGGTTTGAGGAAGTGCTCAAAAGAAAAGCAGATGACAAGGC
CCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGGCCAAGAAGATCTGCAATGGCTCCGAGCAGGTGGAAGAAGCCGGAGACAGAGAACGCAACGTCGAGACGTATCAGTCATCCGAGTTGTGCAAAGATCTGTTCAA
GAAGTGGTCGAAGCTGGACGATTCTCGCTTCTCCGTTGAAACGGTCTCTGGTGGAATCACTAATCTACTGCTTAAGGTTTCTGTGAAAGAGGAAAGTGGTAATGACGTTT
CTGTCACCGTCAGACTATATGGACCTAACACGGATTATGTAATTAATCGTGACAGGGAACTGGATGCAATCAAACATCTTTCAGCTGCAGGATTTGGTGCTAAGCTTCTT
GGAGTTTTTGGAAATGGCATGGTGCAGTCATTTATTAATGCACGTACACTAGAACCACCAGATATGACAAAGTCAAAGCTAGCTGCAGAAATTGCCAAACAGCTTAATAA
ATTCCACCAAGTAAATATTCCAGGTTCTAAAGAACCTCAGTTATGGAATGACATTTTCAAATTTTACAAGAAAGCCTCTGCACTTCAATTTGATGATACTGGGAAGCAAA
GCATATATGATACAATTTCATTTGAGGAAATTCATAAGGAAATACTTGAGATTAAGGAACTAACAGACCGCCTTAATGCCCCTGTAGTGTTTGCTCACAATGACCTGCTT
TCTGGGAACCTAATGCTAAATGAGGAAGAAGAGCGGCTCTACTTCATTGATTTCGAGTACGGATCATACAGTTACAGAGGATATGACATCGGCAATCACTTCAATGAATA
TGCAGGCTATGACTGTAACTACAGCCGTTATCCATCCAAGGTTGAGCAGTATCATTTCTTCAGGCATTATTTACGACCTGAAAAACCAGATGAGGTATCTCAAGAAGATC
TCGAAGCTCTGGACGTAGAGTCGAACACATTCATGCTAGCTTCTCACTTGTATTGGGCTTTATGGGCGCTGATACAGGCAAGGATGTCTCCGATCAATTTCGATTACCTC
GGTTACTTCTTCCTGCGTTACGGCGAATACAAAAAGCAGAAAGAAAAATATTGCTCTTGGCGAGATCTTTCCTTGCTGGATCAGGATCGGGTTCTGTACCTGCATAGAAG
ATGCAATTGGGGGGGTAGGATCAGCAATCAGTTTATATTCCATAAAGATGTAGGAAGTTCTTTACTGGTTCTGTTTGATCTCTGCTGTAAACTGAACCAACCATACTCTC
CAAGAGACCATCATGGCTACCCAAAACAAGCAAATAAAAGGATTCAAATCCTTTTACAAGTCCCTGATGGAGCTCGGGGTATACCGGCATTCGATCCTGCTCCGATGGAC
AAAGTACCTGTTTATAAGTATAACTGGTTGGGTAGGAAAGCTAAGAAAGGAACTGATCAGCATGGCGGTCGGAGAGGAATGGAGGGAGGCGGTGGCGGTGGCGGTGGCGA
AGGAATGGGGGCGAGAAAGCTGGTGGTTGAGGGTAGGAAGTCAGTTTCCCATGTGGAAACAAACTTGGCATCAGTGGCTGCATTTCTTCAAGTGAAGGTATTGGTATCAG
ACATGCCTCAGATGATGCAGGTTCAGGCTTTTAGGTCAGCAAGGAGGAGTTATGACAGCTTGGAGAAGTTCAGCTCAAAACATATGGCTTACAATATTAAAAAGGTTTGT
TCTATTGGCAATATTTTTTTTTTTTTCTTTTTCTCTTTTGGGTATTGGGAATTTCATATTGATATGATCTGGTTTGAGGAAGTGCTCAAAAGAAAAGCAGATGACAAGGC
CCCTTGA
Protein sequenceShow/hide protein sequence
MGAKKICNGSEQVEEAGDRERNVETYQSSELCKDLFKKWSKLDDSRFSVETVSGGITNLLLKVSVKEESGNDVSVTVRLYGPNTDYVINRDRELDAIKHLSAAGFGAKLL
GVFGNGMVQSFINARTLEPPDMTKSKLAAEIAKQLNKFHQVNIPGSKEPQLWNDIFKFYKKASALQFDDTGKQSIYDTISFEEIHKEILEIKELTDRLNAPVVFAHNDLL
SGNLMLNEEEERLYFIDFEYGSYSYRGYDIGNHFNEYAGYDCNYSRYPSKVEQYHFFRHYLRPEKPDEVSQEDLEALDVESNTFMLASHLYWALWALIQARMSPINFDYL
GYFFLRYGEYKKQKEKYCSWRDLSLLDQDRVLYLHRRCNWGGRISNQFIFHKDVGSSLLVLFDLCCKLNQPYSPRDHHGYPKQANKRIQILLQVPDGARGIPAFDPAPMD
KVPVYKYNWLGRKAKKGTDQHGGRRGMEGGGGGGGGEGMGARKLVVEGRKSVSHVETNLASVAAFLQVKVLVSDMPQMMQVQAFRSARRSYDSLEKFSSKHMAYNIKKVC
SIGNIFFFFFFSFGYWEFHIDMIWFEEVLKRKADDKAP