; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G15080 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G15080
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptiongalacturonokinase
Genome locationClcChr06:26023277..26033930
RNA-Seq ExpressionClc06G15080
SyntenyClc06G15080
Gene Ontology termsGO:0006012 - galactose metabolic process (biological process)
GO:0046396 - D-galacturonate metabolic process (biological process)
GO:0046835 - carbohydrate phosphorylation (biological process)
GO:0005829 - cytosol (cellular component)
GO:0004335 - galactokinase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0047912 - galacturonokinase activity (molecular function)
InterPro domainsIPR000705 - Galactokinase
IPR006204 - GHMP kinase N-terminal domain
IPR006206 - Mevalonate/galactokinase
IPR013750 - GHMP kinase, C-terminal domain
IPR014721 - Ribosomal protein S5 domain 2-type fold, subgroup
IPR019539 - Galactokinase, N-terminal domain
IPR020568 - Ribosomal protein S5 domain 2-type fold
IPR036554 - GHMP kinase, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004149677.1 galacturonokinase [Cucumis sativus]4.4e-20387.26Show/hide
Query:  MSKRSKEDVRIVVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQVILRSAQFKGDVNFRVDEKLYPNHCNNKKEGTNAN-------------
        MSKRSKEDVR+VVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQV+LRSAQFKGDVNFRVDEKLYPNHC+NKKEGTN N             
Subjt:  MSKRSKEDVRIVVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQVILRSAQFKGDVNFRVDEKLYPNHCNNKKEGTNAN-------------

Query:  ------------EHCLSQGIIGYICGSDGLDSSGLSSSAAVGLAYLLALENANNLTISPTENIEYDRLIENGYLGLKNGILDQSAILLSSYGCLLHMNCK
                    EHCLSQGIIGYI GSDGLDSSGLSSSAAVGLAYLLALENANNLTISPTENIEYDRLIENGYLGL+NGILDQSAILLSSYGCLLHMNCK
Subjt:  ------------EHCLSQGIIGYICGSDGLDSSGLSSSAAVGLAYLLALENANNLTISPTENIEYDRLIENGYLGLKNGILDQSAILLSSYGCLLHMNCK

Query:  TKDFKLIRPLYMESSLKSETQKEYQILLAFSGLKQALTSNPGYNHRVAECQEAAKILLNASGNSHVEPLLCNVEQEAYEAHKAQLETNLAKRAEHYFSEN
        TKDFKLIRPL MESSLKSE QKEYQILLAFSGLKQALT+NPGYNHRVAECQEAAKILLNASGNSH+EPLLCNV+QEAY+AHK+QLE NLAKRAEHYFSEN
Subjt:  TKDFKLIRPLYMESSLKSETQKEYQILLAFSGLKQALTSNPGYNHRVAECQEAAKILLNASGNSHVEPLLCNVEQEAYEAHKAQLETNLAKRAEHYFSEN

Query:  TRVLQGLEAWASGSLEDFGKLIAASGRSSIVNYECGAEPLIQLYEILLRAPGVCGARFSGAGFRGCCLAFVDANYAAEAVEFVRKEYMKVQPELAAQMNP
        TRVLQGLEAWASG LEDFGKLIA SGRSSIVNYECGAEPL+QLYEILLRAPGVCGARFSGAGFRGCCLA VD  YA EA EFVR EYMKVQPELAAQ+NP
Subjt:  TRVLQGLEAWASGSLEDFGKLIAASGRSSIVNYECGAEPLIQLYEILLRAPGVCGARFSGAGFRGCCLAFVDANYAAEAVEFVRKEYMKVQPELAAQMNP

Query:  ETAVLICEQGDCAHII
        +TAV+ICE G CAHII
Subjt:  ETAVLICEQGDCAHII

XP_016901439.1 PREDICTED: galacturonokinase [Cucumis melo]1.5e-20387.26Show/hide
Query:  MSKRSKEDVRIVVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQVILRSAQFKGDVNFRVDEKLYPNHCNNKKEGTNAN-------------
        MSKRSKEDVR+VVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQV+LRSAQFKGDVNFRVDEKLYPN C+NKKEGTNAN             
Subjt:  MSKRSKEDVRIVVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQVILRSAQFKGDVNFRVDEKLYPNHCNNKKEGTNAN-------------

Query:  ------------EHCLSQGIIGYICGSDGLDSSGLSSSAAVGLAYLLALENANNLTISPTENIEYDRLIENGYLGLKNGILDQSAILLSSYGCLLHMNCK
                    EHCLSQGIIGYICGSDGLDSSGLSSSAAVGLAYLLALENANNLTISPT+NIEYDRLIENGYLGL+NGILDQSAILLSSYGCLLHMNCK
Subjt:  ------------EHCLSQGIIGYICGSDGLDSSGLSSSAAVGLAYLLALENANNLTISPTENIEYDRLIENGYLGLKNGILDQSAILLSSYGCLLHMNCK

Query:  TKDFKLIRPLYMESSLKSETQKEYQILLAFSGLKQALTSNPGYNHRVAECQEAAKILLNASGNSHVEPLLCNVEQEAYEAHKAQLETNLAKRAEHYFSEN
        TKDFKLIRPL M +SLKSE QKEYQILLAFSGLKQALT+NPGYNHRVAECQEAAKILLNASGNSH+EPLLCNVEQEAY+AHK+QLE NLAKRAEHYFSEN
Subjt:  TKDFKLIRPLYMESSLKSETQKEYQILLAFSGLKQALTSNPGYNHRVAECQEAAKILLNASGNSHVEPLLCNVEQEAYEAHKAQLETNLAKRAEHYFSEN

Query:  TRVLQGLEAWASGSLEDFGKLIAASGRSSIVNYECGAEPLIQLYEILLRAPGVCGARFSGAGFRGCCLAFVDANYAAEAVEFVRKEYMKVQPELAAQMNP
         RVLQGLEAWASG LEDFGKLIAASGRSSIVNYECGAEPL+QLYEILLRAPGVCGARFSGAGFRGCCLAFV+  YAA+A EFVR EYMKVQPELAAQ+NP
Subjt:  TRVLQGLEAWASGSLEDFGKLIAASGRSSIVNYECGAEPLIQLYEILLRAPGVCGARFSGAGFRGCCLAFVDANYAAEAVEFVRKEYMKVQPELAAQMNP

Query:  ETAVLICEQGDCAHII
        +TAV+ICE GDCAHII
Subjt:  ETAVLICEQGDCAHII

XP_022921351.1 galacturonokinase [Cucurbita moschata]5.4e-20186.06Show/hide
Query:  MSKRSKEDVRIVVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQVILRSAQFKGDVNFRVDEKLYPNHCNNKKEGTNAN-------------
        MSK+S E VRIVVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGD QV+LRSAQFKGDVNFRVDEK YPNH NNKKE TNAN             
Subjt:  MSKRSKEDVRIVVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQVILRSAQFKGDVNFRVDEKLYPNHCNNKKEGTNAN-------------

Query:  ------------EHCLSQGIIGYICGSDGLDSSGLSSSAAVGLAYLLALENANNLTISPTENIEYDRLIENGYLGLKNGILDQSAILLSSYGCLLHMNCK
                    EHCLSQGI+GYICGSDGLDSSGLSSSAAVGLAYLLALENANNL ISPTENI+YDRLIENGYLGL+NGILDQSAILLSSYGCLLHMNCK
Subjt:  ------------EHCLSQGIIGYICGSDGLDSSGLSSSAAVGLAYLLALENANNLTISPTENIEYDRLIENGYLGLKNGILDQSAILLSSYGCLLHMNCK

Query:  TKDFKLIRPLYMESSLKSETQKEYQILLAFSGLKQALTSNPGYNHRVAECQEAAKILLNASGNSHVEPLLCNVEQEAYEAHKAQLETNLAKRAEHYFSEN
        TKDFKLIRPL MESSLKSETQKEYQILLA SGLKQALT+NPGYN+RVAECQEAAKILLNASGNSH+EPLLCNVEQEAYE HK++LETNLAKRAEHYFSEN
Subjt:  TKDFKLIRPLYMESSLKSETQKEYQILLAFSGLKQALTSNPGYNHRVAECQEAAKILLNASGNSHVEPLLCNVEQEAYEAHKAQLETNLAKRAEHYFSEN

Query:  TRVLQGLEAWASGSLEDFGKLIAASGRSSIVNYECGAEPLIQLYEILLRAPGVCGARFSGAGFRGCCLAFVDANYAAEAVEFVRKEYMKVQPELAAQMNP
        TRVLQGLEAWASG LEDFGKL+AASGRSSIVNYECGAEPL+QLYEILL+APGVCGARFSGAGFRGCC+AFVDA+YAAEA +FVRKEY KVQPELAAQ++P
Subjt:  TRVLQGLEAWASGSLEDFGKLIAASGRSSIVNYECGAEPLIQLYEILLRAPGVCGARFSGAGFRGCCLAFVDANYAAEAVEFVRKEYMKVQPELAAQMNP

Query:  ETAVLICEQGDCAHII
        ETAVLICEQGDCA I+
Subjt:  ETAVLICEQGDCAHII

XP_023516455.1 galacturonokinase [Cucurbita pepo subsp. pepo]3.2e-20186.3Show/hide
Query:  MSKRSKEDVRIVVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQVILRSAQFKGDVNFRVDEKLYPNHCNNKKEGTNAN-------------
        MSKRS E VRIVVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGD QV+LRSAQFKGDVNFRVDEK YPNH NNKKE TNAN             
Subjt:  MSKRSKEDVRIVVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQVILRSAQFKGDVNFRVDEKLYPNHCNNKKEGTNAN-------------

Query:  ------------EHCLSQGIIGYICGSDGLDSSGLSSSAAVGLAYLLALENANNLTISPTENIEYDRLIENGYLGLKNGILDQSAILLSSYGCLLHMNCK
                    EHCLSQGI+GYICGSDGLDSSGLSSS  VGLAYLLALENANNL ISPTENI+YDRLIENGYLGL+NGILDQSAILLSSYGCLLHMNCK
Subjt:  ------------EHCLSQGIIGYICGSDGLDSSGLSSSAAVGLAYLLALENANNLTISPTENIEYDRLIENGYLGLKNGILDQSAILLSSYGCLLHMNCK

Query:  TKDFKLIRPLYMESSLKSETQKEYQILLAFSGLKQALTSNPGYNHRVAECQEAAKILLNASGNSHVEPLLCNVEQEAYEAHKAQLETNLAKRAEHYFSEN
        TKDFKLIRPL MESSLKSETQKEYQILLA SGLKQALT+NPGYN+RVAECQEAAKILLNASGNSHVEPLLCNVEQEAYE HK++LETNLAKRAEHYFSEN
Subjt:  TKDFKLIRPLYMESSLKSETQKEYQILLAFSGLKQALTSNPGYNHRVAECQEAAKILLNASGNSHVEPLLCNVEQEAYEAHKAQLETNLAKRAEHYFSEN

Query:  TRVLQGLEAWASGSLEDFGKLIAASGRSSIVNYECGAEPLIQLYEILLRAPGVCGARFSGAGFRGCCLAFVDANYAAEAVEFVRKEYMKVQPELAAQMNP
        TRVLQG+EAWASG LEDFGKL+AASGRSSIVNYECGAEPL+QLYEILL+APGVCGARFSGAGFRGCC+AFVDA+YAAEA EFVRKEY KVQPELAAQ+NP
Subjt:  TRVLQGLEAWASGSLEDFGKLIAASGRSSIVNYECGAEPLIQLYEILLRAPGVCGARFSGAGFRGCCLAFVDANYAAEAVEFVRKEYMKVQPELAAQMNP

Query:  ETAVLICEQGDCAHII
        ETAVLICEQGDCA I+
Subjt:  ETAVLICEQGDCAHII

XP_038879629.1 galacturonokinase isoform X1 [Benincasa hispida]1.1e-20488.94Show/hide
Query:  MSKRSKEDVRIVVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQVILRSAQFKGDVNFRVDEKLYPNHCNNKKEGTNAN-------------
        MSKRSKEDVRIVVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGD QV+LRSAQFKGDVNFRVDE  YPNH  NKKEGTNAN             
Subjt:  MSKRSKEDVRIVVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQVILRSAQFKGDVNFRVDEKLYPNHCNNKKEGTNAN-------------

Query:  ------------EHCLSQGIIGYICGSDGLDSSGLSSSAAVGLAYLLALENANNLTISPTENIEYDRLIENGYLGLKNGILDQSAILLSSYGCLLHMNCK
                    EHCLSQGIIGYI GSD LDSSGLSSSAAVGLAYLLALENANNLTISP+ENIEYDRLIENGYLGL+NGILDQSAILLSSYGCLLHMNCK
Subjt:  ------------EHCLSQGIIGYICGSDGLDSSGLSSSAAVGLAYLLALENANNLTISPTENIEYDRLIENGYLGLKNGILDQSAILLSSYGCLLHMNCK

Query:  TKDFKLIRPLYMESSLKSETQKEYQILLAFSGLKQALTSNPGYNHRVAECQEAAKILLNASGNSHVEPLLCNVEQEAYEAHKAQLETNLAKRAEHYFSEN
        TKDFKLIRPL MESSLKSETQKEYQILLAFSGLKQALT+NPGYNHRVAECQEAAKILLNASGNSHVEPLLCNVEQE YEAHK+QLETNLAKRAEHYFSEN
Subjt:  TKDFKLIRPLYMESSLKSETQKEYQILLAFSGLKQALTSNPGYNHRVAECQEAAKILLNASGNSHVEPLLCNVEQEAYEAHKAQLETNLAKRAEHYFSEN

Query:  TRVLQGLEAWASGSLEDFGKLIAASGRSSIVNYECGAEPLIQLYEILLRAPGVCGARFSGAGFRGCCLAFVDANYAAEAVEFVRKEYMKVQPELAAQMNP
        TRVLQGLEAWASG LEDFGKLIAASGRSSIVNYECGAEPL+QLYEILLRAPGVCGARFSGAGFRGCCLAFVDANYAAEAV+FV  EY KVQPELAAQMNP
Subjt:  TRVLQGLEAWASGSLEDFGKLIAASGRSSIVNYECGAEPLIQLYEILLRAPGVCGARFSGAGFRGCCLAFVDANYAAEAVEFVRKEYMKVQPELAAQMNP

Query:  ETAVLICEQGDCAHII
        ETAVLICE GDCAHII
Subjt:  ETAVLICEQGDCAHII

TrEMBL top hitse value%identityAlignment
A0A0A0LXI8 Uncharacterized protein2.1e-20387.26Show/hide
Query:  MSKRSKEDVRIVVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQVILRSAQFKGDVNFRVDEKLYPNHCNNKKEGTNAN-------------
        MSKRSKEDVR+VVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQV+LRSAQFKGDVNFRVDEKLYPNHC+NKKEGTN N             
Subjt:  MSKRSKEDVRIVVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQVILRSAQFKGDVNFRVDEKLYPNHCNNKKEGTNAN-------------

Query:  ------------EHCLSQGIIGYICGSDGLDSSGLSSSAAVGLAYLLALENANNLTISPTENIEYDRLIENGYLGLKNGILDQSAILLSSYGCLLHMNCK
                    EHCLSQGIIGYI GSDGLDSSGLSSSAAVGLAYLLALENANNLTISPTENIEYDRLIENGYLGL+NGILDQSAILLSSYGCLLHMNCK
Subjt:  ------------EHCLSQGIIGYICGSDGLDSSGLSSSAAVGLAYLLALENANNLTISPTENIEYDRLIENGYLGLKNGILDQSAILLSSYGCLLHMNCK

Query:  TKDFKLIRPLYMESSLKSETQKEYQILLAFSGLKQALTSNPGYNHRVAECQEAAKILLNASGNSHVEPLLCNVEQEAYEAHKAQLETNLAKRAEHYFSEN
        TKDFKLIRPL MESSLKSE QKEYQILLAFSGLKQALT+NPGYNHRVAECQEAAKILLNASGNSH+EPLLCNV+QEAY+AHK+QLE NLAKRAEHYFSEN
Subjt:  TKDFKLIRPLYMESSLKSETQKEYQILLAFSGLKQALTSNPGYNHRVAECQEAAKILLNASGNSHVEPLLCNVEQEAYEAHKAQLETNLAKRAEHYFSEN

Query:  TRVLQGLEAWASGSLEDFGKLIAASGRSSIVNYECGAEPLIQLYEILLRAPGVCGARFSGAGFRGCCLAFVDANYAAEAVEFVRKEYMKVQPELAAQMNP
        TRVLQGLEAWASG LEDFGKLIA SGRSSIVNYECGAEPL+QLYEILLRAPGVCGARFSGAGFRGCCLA VD  YA EA EFVR EYMKVQPELAAQ+NP
Subjt:  TRVLQGLEAWASGSLEDFGKLIAASGRSSIVNYECGAEPLIQLYEILLRAPGVCGARFSGAGFRGCCLAFVDANYAAEAVEFVRKEYMKVQPELAAQMNP

Query:  ETAVLICEQGDCAHII
        +TAV+ICE G CAHII
Subjt:  ETAVLICEQGDCAHII

A0A1S4DZQ3 galacturonokinase7.3e-20487.26Show/hide
Query:  MSKRSKEDVRIVVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQVILRSAQFKGDVNFRVDEKLYPNHCNNKKEGTNAN-------------
        MSKRSKEDVR+VVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQV+LRSAQFKGDVNFRVDEKLYPN C+NKKEGTNAN             
Subjt:  MSKRSKEDVRIVVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQVILRSAQFKGDVNFRVDEKLYPNHCNNKKEGTNAN-------------

Query:  ------------EHCLSQGIIGYICGSDGLDSSGLSSSAAVGLAYLLALENANNLTISPTENIEYDRLIENGYLGLKNGILDQSAILLSSYGCLLHMNCK
                    EHCLSQGIIGYICGSDGLDSSGLSSSAAVGLAYLLALENANNLTISPT+NIEYDRLIENGYLGL+NGILDQSAILLSSYGCLLHMNCK
Subjt:  ------------EHCLSQGIIGYICGSDGLDSSGLSSSAAVGLAYLLALENANNLTISPTENIEYDRLIENGYLGLKNGILDQSAILLSSYGCLLHMNCK

Query:  TKDFKLIRPLYMESSLKSETQKEYQILLAFSGLKQALTSNPGYNHRVAECQEAAKILLNASGNSHVEPLLCNVEQEAYEAHKAQLETNLAKRAEHYFSEN
        TKDFKLIRPL M +SLKSE QKEYQILLAFSGLKQALT+NPGYNHRVAECQEAAKILLNASGNSH+EPLLCNVEQEAY+AHK+QLE NLAKRAEHYFSEN
Subjt:  TKDFKLIRPLYMESSLKSETQKEYQILLAFSGLKQALTSNPGYNHRVAECQEAAKILLNASGNSHVEPLLCNVEQEAYEAHKAQLETNLAKRAEHYFSEN

Query:  TRVLQGLEAWASGSLEDFGKLIAASGRSSIVNYECGAEPLIQLYEILLRAPGVCGARFSGAGFRGCCLAFVDANYAAEAVEFVRKEYMKVQPELAAQMNP
         RVLQGLEAWASG LEDFGKLIAASGRSSIVNYECGAEPL+QLYEILLRAPGVCGARFSGAGFRGCCLAFV+  YAA+A EFVR EYMKVQPELAAQ+NP
Subjt:  TRVLQGLEAWASGSLEDFGKLIAASGRSSIVNYECGAEPLIQLYEILLRAPGVCGARFSGAGFRGCCLAFVDANYAAEAVEFVRKEYMKVQPELAAQMNP

Query:  ETAVLICEQGDCAHII
        +TAV+ICE GDCAHII
Subjt:  ETAVLICEQGDCAHII

A0A6J1C161 galacturonokinase isoform X19.3e-19184.03Show/hide
Query:  MSKRSKEDVRIVVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQVILRSAQFKGDVNFRVDEKLYPNHCNNKKEGTNAN-------------
        MSKRS EDVRIVVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGD QV+LRSA+FKGDVNFRVDE  YP+  +NKKEGT  N             
Subjt:  MSKRSKEDVRIVVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQVILRSAQFKGDVNFRVDEKLYPNHCNNKKEGTNAN-------------

Query:  ---EHCLSQGIIGYICGSDGLDSSGLSSSAAVGLAYLLALENANNLTISPTENIEYDRLIENGYLGLKNGILDQSAILLSSYGCLLHMNCKTKDFKLIRP
           EHCLSQGIIGY+CGS+GLDSSGLSSSAAVGLAYLLALE+ANNLTISPTENIEYDRLIENGYLGL+NGILDQSAILLSSYGCLLHMNCKTK+F+LIRP
Subjt:  ---EHCLSQGIIGYICGSDGLDSSGLSSSAAVGLAYLLALENANNLTISPTENIEYDRLIENGYLGLKNGILDQSAILLSSYGCLLHMNCKTKDFKLIRP

Query:  LYMESSLKSETQKEYQILLAFSGLKQALTSNPGYNHRVAECQEAAKILLNASGNSHVEPLLCNVEQEAYEAHKAQLETNLAKRAEHYFSENTRVLQGLEA
        L  ESS KS+T + YQILLA SGL+QALT+NPGYNHRVAECQEAAKILLNASGN  VEPLLCNVE E YEAHK+ LETNLAKRAEHYFSEN RVLQGLEA
Subjt:  LYMESSLKSETQKEYQILLAFSGLKQALTSNPGYNHRVAECQEAAKILLNASGNSHVEPLLCNVEQEAYEAHKAQLETNLAKRAEHYFSENTRVLQGLEA

Query:  WASGSLEDFGKLIAASGRSSIVNYECGAEPLIQLYEILLRAPGVCGARFSGAGFRGCCLAFVDANYAAEAVEFVRKEYMKVQPELAAQMNPETAVLICEQ
        WASG LE+FGKLIAASGRSSIVNYECG+EPL+QLYEILLRAPGV GARFSGAGFRGCCLAFVDA+ AAEA EFVR EY+KVQPELA Q+NPETAV ICE 
Subjt:  WASGSLEDFGKLIAASGRSSIVNYECGAEPLIQLYEILLRAPGVCGARFSGAGFRGCCLAFVDANYAAEAVEFVRKEYMKVQPELAAQMNPETAVLICEQ

Query:  GDCAHII
        GDCAHII
Subjt:  GDCAHII

A0A6J1E153 galacturonokinase2.6e-20186.06Show/hide
Query:  MSKRSKEDVRIVVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQVILRSAQFKGDVNFRVDEKLYPNHCNNKKEGTNAN-------------
        MSK+S E VRIVVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGD QV+LRSAQFKGDVNFRVDEK YPNH NNKKE TNAN             
Subjt:  MSKRSKEDVRIVVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQVILRSAQFKGDVNFRVDEKLYPNHCNNKKEGTNAN-------------

Query:  ------------EHCLSQGIIGYICGSDGLDSSGLSSSAAVGLAYLLALENANNLTISPTENIEYDRLIENGYLGLKNGILDQSAILLSSYGCLLHMNCK
                    EHCLSQGI+GYICGSDGLDSSGLSSSAAVGLAYLLALENANNL ISPTENI+YDRLIENGYLGL+NGILDQSAILLSSYGCLLHMNCK
Subjt:  ------------EHCLSQGIIGYICGSDGLDSSGLSSSAAVGLAYLLALENANNLTISPTENIEYDRLIENGYLGLKNGILDQSAILLSSYGCLLHMNCK

Query:  TKDFKLIRPLYMESSLKSETQKEYQILLAFSGLKQALTSNPGYNHRVAECQEAAKILLNASGNSHVEPLLCNVEQEAYEAHKAQLETNLAKRAEHYFSEN
        TKDFKLIRPL MESSLKSETQKEYQILLA SGLKQALT+NPGYN+RVAECQEAAKILLNASGNSH+EPLLCNVEQEAYE HK++LETNLAKRAEHYFSEN
Subjt:  TKDFKLIRPLYMESSLKSETQKEYQILLAFSGLKQALTSNPGYNHRVAECQEAAKILLNASGNSHVEPLLCNVEQEAYEAHKAQLETNLAKRAEHYFSEN

Query:  TRVLQGLEAWASGSLEDFGKLIAASGRSSIVNYECGAEPLIQLYEILLRAPGVCGARFSGAGFRGCCLAFVDANYAAEAVEFVRKEYMKVQPELAAQMNP
        TRVLQGLEAWASG LEDFGKL+AASGRSSIVNYECGAEPL+QLYEILL+APGVCGARFSGAGFRGCC+AFVDA+YAAEA +FVRKEY KVQPELAAQ++P
Subjt:  TRVLQGLEAWASGSLEDFGKLIAASGRSSIVNYECGAEPLIQLYEILLRAPGVCGARFSGAGFRGCCLAFVDANYAAEAVEFVRKEYMKVQPELAAQMNP

Query:  ETAVLICEQGDCAHII
        ETAVLICEQGDCA I+
Subjt:  ETAVLICEQGDCAHII

A0A6J1JJT8 galacturonokinase3.4e-20186.3Show/hide
Query:  MSKRSKEDVRIVVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQVILRSAQFKGDVNFRVDEKLYPNHCNNKKEGTNAN-------------
        MSKRS EDVRIVVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGD QV+LRSAQFKGDVNFRVDEK YPNH NNKKE T AN             
Subjt:  MSKRSKEDVRIVVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQVILRSAQFKGDVNFRVDEKLYPNHCNNKKEGTNAN-------------

Query:  ------------EHCLSQGIIGYICGSDGLDSSGLSSSAAVGLAYLLALENANNLTISPTENIEYDRLIENGYLGLKNGILDQSAILLSSYGCLLHMNCK
                    EHCLSQGI+GYICGSDGLDSSGLSSSAAVGLAYLLALENAN+L ISPTENI+YDRLIENGYLGL+NGILDQSAILLSSYGCLLHMNCK
Subjt:  ------------EHCLSQGIIGYICGSDGLDSSGLSSSAAVGLAYLLALENANNLTISPTENIEYDRLIENGYLGLKNGILDQSAILLSSYGCLLHMNCK

Query:  TKDFKLIRPLYMESSLKSETQKEYQILLAFSGLKQALTSNPGYNHRVAECQEAAKILLNASGNSHVEPLLCNVEQEAYEAHKAQLETNLAKRAEHYFSEN
        TKDFKLIRPL MESSLKSETQKEYQILLA SGLKQALT+NPGYN+RVAECQEAAKILLNASGNSH+EPLLCNVEQEAYE HK++LETNLAKRAEHYFSEN
Subjt:  TKDFKLIRPLYMESSLKSETQKEYQILLAFSGLKQALTSNPGYNHRVAECQEAAKILLNASGNSHVEPLLCNVEQEAYEAHKAQLETNLAKRAEHYFSEN

Query:  TRVLQGLEAWASGSLEDFGKLIAASGRSSIVNYECGAEPLIQLYEILLRAPGVCGARFSGAGFRGCCLAFVDANYAAEAVEFVRKEYMKVQPELAAQMNP
        TRVLQGLEAWA G LEDFGKL+AASGRSSIVNYECGAEPL+QLYEILL+APGVCGARFSGAGFRGCC+AFVDA+YAAEA EFVRKEY KVQPELAAQ+NP
Subjt:  TRVLQGLEAWASGSLEDFGKLIAASGRSSIVNYECGAEPLIQLYEILLRAPGVCGARFSGAGFRGCCLAFVDANYAAEAVEFVRKEYMKVQPELAAQMNP

Query:  ETAVLICEQGDCAHII
        ETAVLICEQGDCA I+
Subjt:  ETAVLICEQGDCAHII

SwissProt top hitse value%identityAlignment
B1YIH8 Galactokinase1.1e-2831.29Show/hide
Query:  SPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQVILRSAQFKGDVNFRV-DEKLYPNHCNNKKEGTNANEHCLSQGIIGYICGSDGL------D
        +P RI  +G H D+ GG+V   A+  G         DV     S  F+ D    V  + L P   +          H L +       G D L      +
Subjt:  SPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQVILRSAQFKGDVNFRV-DEKLYPNHCNNKKEGTNANEHCLSQGIIGYICGSDGL------D

Query:  SSGLSSSAAVGLAYLLALENANNLTISPTENIEYDRLIENGYLGLKNGILDQSAILLSSYGCLLHMNCKTKDFKLIRPLYMESSLKSETQKEYQILLAFS
         +GLSSSA++ L   + L+   NL I   + ++Y + +EN Y+G+ +GI+DQ AI +   G  L ++C+T D+    PL +           Y I++  +
Subjt:  SSGLSSSAAVGLAYLLALENANNLTISPTENIEYDRLIENGYLGLKNGILDQSAILLSSYGCLLHMNCKTKDFKLIRPLYMESSLKSETQKEYQILLAFS

Query:  GLKQALTSNPGYNHRVAECQEAAKILLNASGNSHVEPLLCN-VEQEAYEAHKAQLETNLAKRAEHYFSENTRVLQGLEAWASGSLEDFGKLIAASGRSSI
          ++ L  +  YN R +EC+ A   L      + +     N  E  ++E      +  L +RA H  SEN R LQ L+A     LE FG+L+ AS RS  
Subjt:  GLKQALTSNPGYNHRVAECQEAAKILLNASGNSHVEPLLCN-VEQEAYEAHKAQLETNLAKRAEHYFSENTRVLQGLEAWASGSLEDFGKLIAASGRSSI

Query:  VNYECGAEPLIQLYEILLRAPGVCGARFSGAGFRGCCLAFVD
        V+YE   + L  L E     PGV GAR +GAGF GC +A V+
Subjt:  VNYECGAEPLIQLYEILLRAPGVCGARFSGAGFRGCCLAFVD

Q88SE8 Galactokinase1.3e-2730.47Show/hide
Query:  RIVVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQVILRSAQF--KGDVNFRVDEKLY------PNHCNNKKEGTNANEHCLSQGIIGYICG
        R+  SP RI  +G H D+ GG+V   AI  G    + P  D  V + SA     G V F V++  Y       N+     +           G   Y+ G
Subjt:  RIVVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQVILRSAQF--KGDVNFRVDEKLY------PNHCNNKKEGTNANEHCLSQGIIGYICG

Query:  SDGLDSSGLSSSAAVGLAYLLALENANNLTISPTENIEYDRLIENGYLGLKNGILDQSAILLSSYGCLLHMNCKTKDFKLIRPLYMESSLKSETQKEYQI
        +   D +GLSSSA++ L   + L    NL IS  + ++  +  EN Y+G+ +GI+DQ A+ +      + ++  T D+      Y    L +       +
Subjt:  SDGLDSSGLSSSAAVGLAYLLALENANNLTISPTENIEYDRLIENGYLGLKNGILDQSAILLSSYGCLLHMNCKTKDFKLIRPLYMESSLKSETQKEYQI

Query:  LLAFSGLKQALTSNPGYNHRVAECQEAAKILLNASGNSHVEPLLCNVEQEAYEAHKAQLETNLAKRAEHYFSENTRVLQGLEAWASGSLEDFGKLIAASG
        ++  +  K+   ++  YN R +EC+EA + L        +  L  +   EA  A+    ET L KRA H   EN R ++  +A A   L  FG+L+ AS 
Subjt:  LLAFSGLKQALTSNPGYNHRVAECQEAAKILLNASGNSHVEPLLCNVEQEAYEAHKAQLETNLAKRAEHYFSENTRVLQGLEAWASGSLEDFGKLIAASG

Query:  RSSIVNYECGAEPLIQLYEILLRAPGVCGARFSGAGFRGCCLAFVDANYAAEAVEFVRKEY
         S   +YE   + L  L E   + PGV GAR +GAGF GC +A VD +      E V K Y
Subjt:  RSSIVNYECGAEPLIQLYEILLRAPGVCGARFSGAGFRGCCLAFVDANYAAEAVEFVRKEY

Q8R8R7 Galactokinase5.7e-2828.42Show/hide
Query:  RSKEDVRIVVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQVILRSAQFKGDVNFRVDEKLYPNHCNNKKEGTNANEHCLSQGII------G
        +S  ++R+  SP R+  +G H D+ GG V   A++ G         D +V + S  F   V   +D   Y       KE   AN     +G++      G
Subjt:  RSKEDVRIVVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQVILRSAQFKGDVNFRVDEKLYPNHCNNKKEGTNANEHCLSQGII------G

Query:  Y-------ICGSDGLDSSGLSSSAAVGLAYLLALENANNLTISPTENIEYDRLIENGYLGLKNGILDQSAILLSSYGCLLHMNCKTKDFKLIRPLYMESS
        Y       + G +    +GLSSSA++ +   +A+    NL I     ++  +  EN ++G+  GI+DQ A+ +   G  + +   T ++  + PL +E  
Subjt:  Y-------ICGSDGLDSSGLSSSAAVGLAYLLALENANNLTISPTENIEYDRLIENGYLGLKNGILDQSAILLSSYGCLLHMNCKTKDFKLIRPLYMESS

Query:  LKSETQKEYQILLAFSGLKQALTSNPGYNHRVAECQEAAKILLNASGNSHVEPLLCNVEQEAYEAHKAQL-ETNLAKRAEHYFSENTRVLQGLEAWASGS
                Y+IL+  +  K+ L  +  YN R +EC++A   L  A    +    L  V  E +E +K  + +  L KRA H  +EN RVL  ++A     
Subjt:  LKSETQKEYQILLAFSGLKQALTSNPGYNHRVAECQEAAKILLNASGNSHVEPLLCNVEQEAYEAHKAQL-ETNLAKRAEHYFSENTRVLQGLEAWASGS

Query:  LEDFGKLIAASGRSSIVNYECGAEPLIQLYEILLRAPGVCGARFSGAGFRGCCLAFVDANYAAEAVEFVRKEY
        +  FGKL+  S  S   ++E   + L  L E  L+  GV G+R +GAGF GC ++ V  +   E +E V + Y
Subjt:  LEDFGKLIAASGRSSIVNYECGAEPLIQLYEILLRAPGVCGARFSGAGFRGCCLAFVDANYAAEAVEFVRKEY

Q8VYG2 Galacturonokinase6.5e-14967.8Show/hide
Query:  MSKRSKEDVRIVVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQVILRSAQFKGDVNFRVDEKLYPNHCNNK---------KE----GTNA-
        MS R K +VR+VV+PYRICPLGAHIDHQGG VSAM INKG+LLGFVPSGD QV LRSAQF+G+V FRVDE  +P    NK         KE    GT A 
Subjt:  MSKRSKEDVRIVVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQVILRSAQFKGDVNFRVDEKLYPNHCNNK---------KE----GTNA-

Query:  --------NEHCLSQGIIGYICGSDGLDSSGLSSSAAVGLAYLLALENANNLTISPTENIEYDRLIENGYLGLKNGILDQSAILLSSYGCLLHMNCKTKD
                ++  L QGIIGY+ GS+GLDSSGLSSSAAVG+AYLLALENAN LT+SPTENIEYDRLIENGYLGL+NGILDQSAILLS+YGCL +M+CKT D
Subjt:  --------NEHCLSQGIIGYICGSDGLDSSGLSSSAAVGLAYLLALENANNLTISPTENIEYDRLIENGYLGLKNGILDQSAILLSSYGCLLHMNCKTKD

Query:  FKLIRPLYMESSLKSETQKEYQILLAFSGLKQALTSNPGYNHRVAECQEAAKILLNASGNSHVEPLLCNVEQEAYEAHKAQLETNLAKRAEHYFSENTRV
         +L++          E +K ++ILLAFSGL+QALT+NPGYN RV+ECQEAAK+LL ASGNS +EP LCNVE   YEAHK +L+  LAKRAEHYFSEN RV
Subjt:  FKLIRPLYMESSLKSETQKEYQILLAFSGLKQALTSNPGYNHRVAECQEAAKILLNASGNSHVEPLLCNVEQEAYEAHKAQLETNLAKRAEHYFSENTRV

Query:  LQGLEAWASGSLEDFGKLIAASGRSSIVNYECGAEPLIQLYEILLRAPGVCGARFSGAGFRGCCLAFVDANYAAEAVEFVRKEYMKVQPELAAQMNPETA
        ++G EAWASG+LE+FGKLI+ASG SSI NYECGAEPLIQLY+ILL+APGV GARFSGAGFRGCCLAFVDA  A  A  +V+ EY K QPE A  +N    
Subjt:  LQGLEAWASGSLEDFGKLIAASGRSSIVNYECGAEPLIQLYEILLRAPGVCGARFSGAGFRGCCLAFVDANYAAEAVEFVRKEYMKVQPELAAQMNPETA

Query:  VLICEQGDCAHII
        VLICE GD A ++
Subjt:  VLICEQGDCAHII

Q97EZ6 Galactokinase3.1e-2627.99Show/hide
Query:  KRSKEDVRIVVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQVILRSAQFK--GDVNFRVDEKL------YPNHCNNKKEGTNANEHCLSQG
        KR  E+V    SP R+  +G H D+ GG+V   A+  G         D +V+  S  F   G + F +D+        + N+     +  N + H +  G
Subjt:  KRSKEDVRIVVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQVILRSAQFK--GDVNFRVDEKL------YPNHCNNKKEGTNANEHCLSQG

Query:  IIGYICGSDGLDSSGLSSSAAVGLAYLLALENANNLTISPTENIEYDRLIENGYLGLKNGILDQSAILLSSYGCLLHMNCKTKDFKLIRPLYMESSLKSE
              GS   + SGLSSSA++ +   + L +   L I+  E ++  +  EN ++G+  GI+DQ +I +    C + ++C T          +E S    
Subjt:  IIGYICGSDGLDSSGLSSSAAVGLAYLLALENANNLTISPTENIEYDRLIENGYLGLKNGILDQSAILLSSYGCLLHMNCKTKDFKLIRPLYMESSLKSE

Query:  TQKEYQILLAFSGLKQALTSNPGYNHRVAECQEAAKILLNASGNSHVEPLLCNVEQEAYEAHKAQLETNLAKRAEHYFSENTRVLQGLEAWASGSLEDFG
            Y+I++A +  K+ L  +  YN R +EC+ A K L      + +  L    E E  E      +    +RA H   EN R L+ + +  +  L+ FG
Subjt:  TQKEYQILLAFSGLKQALTSNPGYNHRVAECQEAAKILLNASGNSHVEPLLCNVEQEAYEAHKAQLETNLAKRAEHYFSENTRVLQGLEAWASGSLEDFG

Query:  KLIAASGRSSIVNYECGAEPLIQLYEILLRAPGVCGARFSGAGFRGCCLAFVDANYAAEAVEFVRKEY
        KL+  S  S   +YE     L  L  + L + GV G+R +GAGF GC ++ V  +Y  E +E ++ +Y
Subjt:  KLIAASGRSSIVNYECGAEPLIQLYEILLRAPGVCGARFSGAGFRGCCLAFVDANYAAEAVEFVRKEY

Arabidopsis top hitse value%identityAlignment
AT3G06580.1 Mevalonate/galactokinase family protein1.7e-1124.65Show/hide
Query:  SPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQVILRSAQFKGDVNFRVDEKLYPN------HCNNKKEG-------TNANEHCLSQGI-IGYI
        SP R+  +G HID++G +V  MAI +  ++      D Q  LR A    +VN +     YP          N K G          +E+  S+G+ +G  
Subjt:  SPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQVILRSAQFKGDVNFRVDEKLYPN------HCNNKKEG-------TNANEHCLSQGI-IGYI

Query:  CGSDGL------DSSGLSSSAAVGLAYLLALENANNLTISPTENIEYDRLIENGYLGLKNGILDQSAILLSSYGCLLHMNCKTKDFKLIRPLYMESSLKS
         G D L        SGLSSSAA   +  +A+           E  +     E  ++G ++G +DQ+  +++  G       +  DF  +R     + +K 
Subjt:  CGSDGL------DSSGLSSSAAVGLAYLLALENANNLTISPTENIEYDRLIENGYLGLKNGILDQSAILLSSYGCLLHMNCKTKDFKLIRPLYMESSLKS

Query:  ETQKEYQILLAFSGLKQALTSNPGYNHRVAECQEAAKIL--------------------------LNASGNSHVEPLLC---NVEQEAYEAHK-------
             + I  + +  ++A+T+   YN+RV EC+ A+ IL                            A      +PLL     +++E Y A +       
Subjt:  ETQKEYQILLAFSGLKQALTSNPGYNHRVAECQEAAKIL--------------------------LNASGNSHVEPLLC---NVEQEAYEAHK-------

Query:  -----------------AQLETNLAKRAEHYFSENTRVLQGLEAWASGSLED------FGKLIAASGRSSIVNYECGAEPLIQLYEILLRAPGVCGARFS
                         A     L +RA H +SE  RV  G +   + +L D       G L+  S  S  V YEC    L +L ++  +  G  GAR +
Subjt:  -----------------AQLETNLAKRAEHYFSENTRVLQGLEAWASGSLED------FGKLIAASGRSSIVNYECGAEPLIQLYEILLRAPGVCGARFS

Query:  GAGFRGCCLAFVDANYAAEAVEFVRKEYMKVQPE
        GAG+ GC +A V      + +  V+++Y K + E
Subjt:  GAGFRGCCLAFVDANYAAEAVEFVRKEYMKVQPE

AT3G10700.1 galacturonic acid kinase4.6e-15067.8Show/hide
Query:  MSKRSKEDVRIVVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQVILRSAQFKGDVNFRVDEKLYPNHCNNK---------KE----GTNA-
        MS R K +VR+VV+PYRICPLGAHIDHQGG VSAM INKG+LLGFVPSGD QV LRSAQF+G+V FRVDE  +P    NK         KE    GT A 
Subjt:  MSKRSKEDVRIVVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQVILRSAQFKGDVNFRVDEKLYPNHCNNK---------KE----GTNA-

Query:  --------NEHCLSQGIIGYICGSDGLDSSGLSSSAAVGLAYLLALENANNLTISPTENIEYDRLIENGYLGLKNGILDQSAILLSSYGCLLHMNCKTKD
                ++  L QGIIGY+ GS+GLDSSGLSSSAAVG+AYLLALENAN LT+SPTENIEYDRLIENGYLGL+NGILDQSAILLS+YGCL +M+CKT D
Subjt:  --------NEHCLSQGIIGYICGSDGLDSSGLSSSAAVGLAYLLALENANNLTISPTENIEYDRLIENGYLGLKNGILDQSAILLSSYGCLLHMNCKTKD

Query:  FKLIRPLYMESSLKSETQKEYQILLAFSGLKQALTSNPGYNHRVAECQEAAKILLNASGNSHVEPLLCNVEQEAYEAHKAQLETNLAKRAEHYFSENTRV
         +L++          E +K ++ILLAFSGL+QALT+NPGYN RV+ECQEAAK+LL ASGNS +EP LCNVE   YEAHK +L+  LAKRAEHYFSEN RV
Subjt:  FKLIRPLYMESSLKSETQKEYQILLAFSGLKQALTSNPGYNHRVAECQEAAKILLNASGNSHVEPLLCNVEQEAYEAHKAQLETNLAKRAEHYFSENTRV

Query:  LQGLEAWASGSLEDFGKLIAASGRSSIVNYECGAEPLIQLYEILLRAPGVCGARFSGAGFRGCCLAFVDANYAAEAVEFVRKEYMKVQPELAAQMNPETA
        ++G EAWASG+LE+FGKLI+ASG SSI NYECGAEPLIQLY+ILL+APGV GARFSGAGFRGCCLAFVDA  A  A  +V+ EY K QPE A  +N    
Subjt:  LQGLEAWASGSLEDFGKLIAASGRSSIVNYECGAEPLIQLYEILLRAPGVCGARFSGAGFRGCCLAFVDANYAAEAVEFVRKEYMKVQPELAAQMNPETA

Query:  VLICEQGDCAHII
        VLICE GD A ++
Subjt:  VLICEQGDCAHII


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCAAGAGAAGCAAGGAGGACGTTCGCATAGTTGTCTCTCCCTATCGCATTTGTCCACTGGGAGCTCATATTGATCATCAGGGTGGGAATGTTTCAGCGATGGCCAT
AAACAAGGGAGTGCTTTTAGGATTTGTTCCTTCTGGCGATGTTCAGGTCATACTCCGTTCAGCTCAGTTCAAAGGAGATGTTAATTTCAGAGTTGATGAAAAGCTGTATC
CAAACCACTGTAATAACAAGAAGGAAGGGACTAATGCAAATGAACATTGTCTTTCTCAGGGTATAATAGGATATATTTGTGGTTCTGATGGACTTGACAGTTCAGGCCTC
AGCTCTTCTGCAGCTGTTGGATTGGCTTACTTGTTAGCGCTGGAAAATGCTAATAATTTAACAATATCTCCCACAGAAAATATTGAATATGACAGGCTAATTGAAAATGG
ATACTTGGGCCTGAAAAATGGCATACTGGACCAATCAGCAATATTGCTTTCAAGCTATGGTTGTCTATTGCACATGAACTGCAAGACTAAGGATTTCAAGCTTATACGCC
CACTATATATGGAAAGCAGTCTAAAATCTGAGACGCAGAAGGAATACCAAATTTTATTAGCATTTTCAGGATTGAAGCAGGCTTTGACAAGTAACCCTGGATATAATCAC
CGCGTTGCAGAATGTCAAGAAGCTGCAAAAATTCTTCTGAATGCGTCTGGCAATTCTCATGTGGAGCCACTCCTTTGTAATGTGGAACAGGAAGCTTATGAAGCTCATAA
GGCCCAGCTAGAAACAAACTTGGCAAAAAGAGCAGAGCATTATTTCTCAGAAAATACGCGGGTTTTACAAGGACTCGAAGCTTGGGCTTCGGGAAGCTTGGAAGACTTTG
GAAAACTCATTGCGGCTTCTGGGCGAAGTTCAATTGTAAACTACGAATGCGGTGCGGAGCCACTTATTCAACTATATGAGATCCTCTTGAGAGCACCTGGAGTATGCGGA
GCGCGGTTCAGTGGTGCTGGCTTTAGAGGTTGTTGTCTCGCTTTTGTAGACGCCAACTATGCTGCTGAAGCTGTAGAATTCGTGCGGAAAGAGTATATGAAGGTGCAGCC
AGAGTTAGCAGCACAGATGAACCCAGAGACAGCCGTGTTGATTTGTGAACAGGGCGACTGTGCTCATATCATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCCAAGAGAAGCAAGGAGGACGTTCGCATAGTTGTCTCTCCCTATCGCATTTGTCCACTGGGAGCTCATATTGATCATCAGGGTGGGAATGTTTCAGCGATGGCCAT
AAACAAGGGAGTGCTTTTAGGATTTGTTCCTTCTGGCGATGTTCAGGTCATACTCCGTTCAGCTCAGTTCAAAGGAGATGTTAATTTCAGAGTTGATGAAAAGCTGTATC
CAAACCACTGTAATAACAAGAAGGAAGGGACTAATGCAAATGAACATTGTCTTTCTCAGGGTATAATAGGATATATTTGTGGTTCTGATGGACTTGACAGTTCAGGCCTC
AGCTCTTCTGCAGCTGTTGGATTGGCTTACTTGTTAGCGCTGGAAAATGCTAATAATTTAACAATATCTCCCACAGAAAATATTGAATATGACAGGCTAATTGAAAATGG
ATACTTGGGCCTGAAAAATGGCATACTGGACCAATCAGCAATATTGCTTTCAAGCTATGGTTGTCTATTGCACATGAACTGCAAGACTAAGGATTTCAAGCTTATACGCC
CACTATATATGGAAAGCAGTCTAAAATCTGAGACGCAGAAGGAATACCAAATTTTATTAGCATTTTCAGGATTGAAGCAGGCTTTGACAAGTAACCCTGGATATAATCAC
CGCGTTGCAGAATGTCAAGAAGCTGCAAAAATTCTTCTGAATGCGTCTGGCAATTCTCATGTGGAGCCACTCCTTTGTAATGTGGAACAGGAAGCTTATGAAGCTCATAA
GGCCCAGCTAGAAACAAACTTGGCAAAAAGAGCAGAGCATTATTTCTCAGAAAATACGCGGGTTTTACAAGGACTCGAAGCTTGGGCTTCGGGAAGCTTGGAAGACTTTG
GAAAACTCATTGCGGCTTCTGGGCGAAGTTCAATTGTAAACTACGAATGCGGTGCGGAGCCACTTATTCAACTATATGAGATCCTCTTGAGAGCACCTGGAGTATGCGGA
GCGCGGTTCAGTGGTGCTGGCTTTAGAGGTTGTTGTCTCGCTTTTGTAGACGCCAACTATGCTGCTGAAGCTGTAGAATTCGTGCGGAAAGAGTATATGAAGGTGCAGCC
AGAGTTAGCAGCACAGATGAACCCAGAGACAGCCGTGTTGATTTGTGAACAGGGCGACTGTGCTCATATCATTTGA
Protein sequenceShow/hide protein sequence
MSKRSKEDVRIVVSPYRICPLGAHIDHQGGNVSAMAINKGVLLGFVPSGDVQVILRSAQFKGDVNFRVDEKLYPNHCNNKKEGTNANEHCLSQGIIGYICGSDGLDSSGL
SSSAAVGLAYLLALENANNLTISPTENIEYDRLIENGYLGLKNGILDQSAILLSSYGCLLHMNCKTKDFKLIRPLYMESSLKSETQKEYQILLAFSGLKQALTSNPGYNH
RVAECQEAAKILLNASGNSHVEPLLCNVEQEAYEAHKAQLETNLAKRAEHYFSENTRVLQGLEAWASGSLEDFGKLIAASGRSSIVNYECGAEPLIQLYEILLRAPGVCG
ARFSGAGFRGCCLAFVDANYAAEAVEFVRKEYMKVQPELAAQMNPETAVLICEQGDCAHII