; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032217 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032217
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGluconokinase
Genome locationchr11:27574939..27585562
RNA-Seq ExpressionLag0032217
SyntenyLag0032217
Gene Ontology termsGO:0016310 - phosphorylation (biological process)
GO:0046177 - D-gluconate catabolic process (biological process)
GO:0005524 - ATP binding (molecular function)
GO:0046316 - gluconokinase activity (molecular function)
InterPro domainsIPR006001 - Carbohydrate kinase, thermoresistant glucokinase
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR031322 - Shikimate kinase/gluconokinase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055116.1 thermosensitive gluconokinase [Cucumis melo var. makuwa]2.3e-5880Show/hide
Query:  TIGAMLGNSMDFFTFLDADDFHPISNKEKMSKGIPLLDEDRNPWLEKLRDTLRENIGCRSRVILGCSALKKQYREILRSADPNYE-MG--MSSVVKFVLL
        TIG MLG ++D FTFLDAD FHPISNKEKMSKGIPL DEDR PWLEK+RDTLRENI  +  V++GCSAL+K YREILRS+DPNYE MG  M  VVKFVLL
Subjt:  TIGAMLGNSMDFFTFLDADDFHPISNKEKMSKGIPLLDEDRNPWLEKLRDTLRENIGCRSRVILGCSALKKQYREILRSADPNYE-MG--MSSVVKFVLL

Query:  DAPAEVIASRLEKRAKEGKHFMPSTLLKSQLDLLQIDDSEGILKVDATLSPQAIM
        DAPAEVIA RLEKRAKEG HFMPS LLKSQLDLLQIDDSEGI+KVDAT +PQAI+
Subjt:  DAPAEVIASRLEKRAKEGKHFMPSTLLKSQLDLLQIDDSEGILKVDATLSPQAIM

KAE8647942.1 hypothetical protein Csa_000183 [Cucumis sativus]5.4e-6069.15Show/hide
Query:  MARGGQILINLHLQGYGLSASEFERTIGAMLGNSMDFFTFLDADDFHPISNKEKMSKGIPLLDEDRNPWLEKLRDTLRENIGCRSRVILGCSALKKQYRE
        M     ++I + + G G S      TIG MLG ++D FTFLDAD FHPISNKEKMSKGIPL DEDR PWLEK+RDTLRENI C+S V+LGCSAL+K YRE
Subjt:  MARGGQILINLHLQGYGLSASEFERTIGAMLGNSMDFFTFLDADDFHPISNKEKMSKGIPLLDEDRNPWLEKLRDTLRENIGCRSRVILGCSALKKQYRE

Query:  ILRSADPNYE---MGMSSVVKFVLLDAPAEVIASRLEKRAKEGKHFMPSTLLKSQLDLLQIDDSEGILKVDATLSPQAIMFIATGQSG
        ILRS+DPNYE   + M  VVKFVLLDAPAEVIA RLEKRAKEG HFMPS LLKSQLDLLQI+D+EGI+KVDAT +PQAI+      SG
Subjt:  ILRSADPNYE---MGMSSVVKFVLLDAPAEVIASRLEKRAKEGKHFMPSTLLKSQLDLLQIDDSEGILKVDATLSPQAIMFIATGQSG

XP_004143662.1 gluconokinase isoform X1 [Cucumis sativus]1.2e-5971.11Show/hide
Query:  MARGGQILINLHLQGYGLSASEFERTIGAMLGNSMDFFTFLDADDFHPISNKEKMSKGIPLLDEDRNPWLEKLRDTLRENIGCRSRVILGCSALKKQYRE
        M     ++I + + G G S      TIG MLG ++D FTFLDAD FHPISNKEKMSKGIPL DEDR PWLEK+RDTLRENI C+S V+LGCSAL+K YRE
Subjt:  MARGGQILINLHLQGYGLSASEFERTIGAMLGNSMDFFTFLDADDFHPISNKEKMSKGIPLLDEDRNPWLEKLRDTLRENIGCRSRVILGCSALKKQYRE

Query:  ILRSADPNYE---MGMSSVVKFVLLDAPAEVIASRLEKRAKEGKHFMPSTLLKSQLDLLQIDDSEGILKVDATLSPQAIM
        ILRS+DPNYE   + M  VVKFVLLDAPAEVIA RLEKRAKEG HFMPS LLKSQLDLLQI+D+EGI+KVDAT +PQAI+
Subjt:  ILRSADPNYE---MGMSSVVKFVLLDAPAEVIASRLEKRAKEGKHFMPSTLLKSQLDLLQIDDSEGILKVDATLSPQAIM

XP_008467300.1 PREDICTED: thermosensitive gluconokinase [Cucumis melo]2.3e-5880Show/hide
Query:  TIGAMLGNSMDFFTFLDADDFHPISNKEKMSKGIPLLDEDRNPWLEKLRDTLRENIGCRSRVILGCSALKKQYREILRSADPNYE-MG--MSSVVKFVLL
        TIG MLG ++D FTFLDAD FHPISNKEKMSKGIPL DEDR PWLEK+RDTLRENI  +  V++GCSAL+K YREILRS+DPNYE MG  M  VVKFVLL
Subjt:  TIGAMLGNSMDFFTFLDADDFHPISNKEKMSKGIPLLDEDRNPWLEKLRDTLRENIGCRSRVILGCSALKKQYREILRSADPNYE-MG--MSSVVKFVLL

Query:  DAPAEVIASRLEKRAKEGKHFMPSTLLKSQLDLLQIDDSEGILKVDATLSPQAIM
        DAPAEVIA RLEKRAKEG HFMPS LLKSQLDLLQIDDSEGI+KVDAT +PQAI+
Subjt:  DAPAEVIASRLEKRAKEGKHFMPSTLLKSQLDLLQIDDSEGILKVDATLSPQAIM

XP_038874456.1 gluconokinase [Benincasa hispida]1.7e-6183.12Show/hide
Query:  TIGAMLGNSMDFFTFLDADDFHPISNKEKMSKGIPLLDEDRNPWLEKLRDTLRENIGCRSRVILGCSALKKQYREILRSADPNYEM--GMSSVVKFVLLD
        TIG MLG ++D FTFLDADDFHPISNKEKMSKGIPL DEDR PWLEK+RDTLRENI CR  V+LGCSAL+K YREILRS+D NYE    M  VVKFVLLD
Subjt:  TIGAMLGNSMDFFTFLDADDFHPISNKEKMSKGIPLLDEDRNPWLEKLRDTLRENIGCRSRVILGCSALKKQYREILRSADPNYEM--GMSSVVKFVLLD

Query:  APAEVIASRLEKRAKEGKHFMPSTLLKSQLDLLQIDDSEGILKVDATLSPQAIM
        APAEVIASRLEKRAKEG HFMPSTLLKSQLDLLQIDDSEGI+KVDATL+PQAI+
Subjt:  APAEVIASRLEKRAKEGKHFMPSTLLKSQLDLLQIDDSEGILKVDATLSPQAIM

TrEMBL top hitse value%identityAlignment
A0A0A0KN86 Gluconokinase5.9e-6071.11Show/hide
Query:  MARGGQILINLHLQGYGLSASEFERTIGAMLGNSMDFFTFLDADDFHPISNKEKMSKGIPLLDEDRNPWLEKLRDTLRENIGCRSRVILGCSALKKQYRE
        M     ++I + + G G S      TIG MLG ++D FTFLDAD FHPISNKEKMSKGIPL DEDR PWLEK+RDTLRENI C+S V+LGCSAL+K YRE
Subjt:  MARGGQILINLHLQGYGLSASEFERTIGAMLGNSMDFFTFLDADDFHPISNKEKMSKGIPLLDEDRNPWLEKLRDTLRENIGCRSRVILGCSALKKQYRE

Query:  ILRSADPNYE---MGMSSVVKFVLLDAPAEVIASRLEKRAKEGKHFMPSTLLKSQLDLLQIDDSEGILKVDATLSPQAIM
        ILRS+DPNYE   + M  VVKFVLLDAPAEVIA RLEKRAKEG HFMPS LLKSQLDLLQI+D+EGI+KVDAT +PQAI+
Subjt:  ILRSADPNYE---MGMSSVVKFVLLDAPAEVIASRLEKRAKEGKHFMPSTLLKSQLDLLQIDDSEGILKVDATLSPQAIM

A0A1S3CTF0 Gluconokinase1.1e-5880Show/hide
Query:  TIGAMLGNSMDFFTFLDADDFHPISNKEKMSKGIPLLDEDRNPWLEKLRDTLRENIGCRSRVILGCSALKKQYREILRSADPNYE-MG--MSSVVKFVLL
        TIG MLG ++D FTFLDAD FHPISNKEKMSKGIPL DEDR PWLEK+RDTLRENI  +  V++GCSAL+K YREILRS+DPNYE MG  M  VVKFVLL
Subjt:  TIGAMLGNSMDFFTFLDADDFHPISNKEKMSKGIPLLDEDRNPWLEKLRDTLRENIGCRSRVILGCSALKKQYREILRSADPNYE-MG--MSSVVKFVLL

Query:  DAPAEVIASRLEKRAKEGKHFMPSTLLKSQLDLLQIDDSEGILKVDATLSPQAIM
        DAPAEVIA RLEKRAKEG HFMPS LLKSQLDLLQIDDSEGI+KVDAT +PQAI+
Subjt:  DAPAEVIASRLEKRAKEGKHFMPSTLLKSQLDLLQIDDSEGILKVDATLSPQAIM

A0A5A7ULE7 Gluconokinase1.1e-5880Show/hide
Query:  TIGAMLGNSMDFFTFLDADDFHPISNKEKMSKGIPLLDEDRNPWLEKLRDTLRENIGCRSRVILGCSALKKQYREILRSADPNYE-MG--MSSVVKFVLL
        TIG MLG ++D FTFLDAD FHPISNKEKMSKGIPL DEDR PWLEK+RDTLRENI  +  V++GCSAL+K YREILRS+DPNYE MG  M  VVKFVLL
Subjt:  TIGAMLGNSMDFFTFLDADDFHPISNKEKMSKGIPLLDEDRNPWLEKLRDTLRENIGCRSRVILGCSALKKQYREILRSADPNYE-MG--MSSVVKFVLL

Query:  DAPAEVIASRLEKRAKEGKHFMPSTLLKSQLDLLQIDDSEGILKVDATLSPQAIM
        DAPAEVIA RLEKRAKEG HFMPS LLKSQLDLLQIDDSEGI+KVDAT +PQAI+
Subjt:  DAPAEVIASRLEKRAKEGKHFMPSTLLKSQLDLLQIDDSEGILKVDATLSPQAIM

A0A6J1H380 Gluconokinase9.4e-5879.74Show/hide
Query:  TIGAMLGNSMDFFTFLDADDFHPISNKEKMSKGIPLLDEDRNPWLEKLRDTLRENIGCRSRVILGCSALKKQYREILRSADPNYE-MGMSSVVKFVLLDA
        TIGAML  +M   TFLDADDFHP SNKEKMSKGIPL DEDR PWLEK+RDTLRE +G ++ V+LGCSAL+KQYR+ILRSADPNYE +G+  VVKFVLLDA
Subjt:  TIGAMLGNSMDFFTFLDADDFHPISNKEKMSKGIPLLDEDRNPWLEKLRDTLRENIGCRSRVILGCSALKKQYREILRSADPNYE-MGMSSVVKFVLLDA

Query:  PAEVIASRLEKRAKEGKHFMPSTLLKSQLDLLQIDDSEGILKVDATLSPQAIM
        PAEVIA RLEKRAKEG HFMPSTLL SQLDLLQID SEGIL+VDAT SPQAI+
Subjt:  PAEVIASRLEKRAKEGKHFMPSTLLKSQLDLLQIDDSEGILKVDATLSPQAIM

A0A6J1JYN8 Gluconokinase1.4e-5881.05Show/hide
Query:  TIGAMLGNSMDFFTFLDADDFHPISNKEKMSKGIPLLDEDRNPWLEKLRDTLRENIGCRSRVILGCSALKKQYREILRSADPNYE-MGMSSVVKFVLLDA
        TIGAML  +M   TFLDADDFHP SNKEKMSKGIPL DEDR PWLEK+R TLRE +G ++ V+LGCSAL+KQYREILRSADPNYE +G+S VVKFVLLDA
Subjt:  TIGAMLGNSMDFFTFLDADDFHPISNKEKMSKGIPLLDEDRNPWLEKLRDTLRENIGCRSRVILGCSALKKQYREILRSADPNYE-MGMSSVVKFVLLDA

Query:  PAEVIASRLEKRAKEGKHFMPSTLLKSQLDLLQIDDSEGILKVDATLSPQAIM
        PAEVIA RLEKRAKEG HFMPSTLL SQLDLLQID SEGIL+VDATLSPQAI+
Subjt:  PAEVIASRLEKRAKEGKHFMPSTLLKSQLDLLQIDDSEGILKVDATLSPQAIM

SwissProt top hitse value%identityAlignment
B0BML1 Probable gluconokinase3.4e-2040.54Show/hide
Query:  GLSASEFERTIGAMLGNSMDFFTFLDADDFHPISNKEKMSKGIPLLDEDRNPWLEKLRDTLRENIGCRSRVILGCSALKKQYREILRSAD-----PNYEM
        G+S S  +  +G+ L   +  + F DADD+HP+ NKEKMS+G PL D+DR+PWL +L + +         V+L CSALK+ YR  L +        NY+ 
Subjt:  GLSASEFERTIGAMLGNSMDFFTFLDADDFHPISNKEKMSKGIPLLDEDRNPWLEKLRDTLRENIGCRSRVILGCSALKKQYREILRSAD-----PNYEM

Query:  G--MSSVVKFVLLDAPAEVIASRLEKRAKEGKHFMPSTLLKSQLDLLQ
           +SS   FV L    E+++ RL +R     HFMP TLL SQ+D L+
Subjt:  G--MSSVVKFVLLDAPAEVIASRLEKRAKEGKHFMPSTLLKSQLDLLQ

P39208 Thermosensitive gluconokinase6.8e-2144.27Show/hide
Query:  FLDADDFHPISNKEKMSKGIPLLDEDRNPWLEKLRDTLRENIGCRSRVILGCSALKKQYREILRSADPNYEMGMSSVVKFVLLDAPAEVIASRLEKRAKE
        F+D DD HP  N +KMS+GIPL DEDR PWLE+L D             + CS+LKKQYR+ILR   P+        V F+ LD   E I +R+++RA  
Subjt:  FLDADDFHPISNKEKMSKGIPLLDEDRNPWLEKLRDTLRENIGCRSRVILGCSALKKQYREILRSADPNYEMGMSSVVKFVLLDAPAEVIASRLEKRAKE

Query:  GKHFMPSTLLKSQLDLLQID--DSEGILKVD
          HFMP  LLKSQ + L+    D + I+++D
Subjt:  GKHFMPSTLLKSQLDLLQID--DSEGILKVD

Q5FQ97 Gluconokinase1.8e-1841.01Show/hide
Query:  FLDADDFHPISNKEKMSKGIPLLDEDRNPWLEKLRDTLRENIGCRSRVILGCSALKKQYREILRSADPNYEMGMSSVVKFVLLDAPAEVIASRLEKRAKE
        F + D  HP +N EKMS G PL D DR PWL    D LRE +      +L CSALK+ YRE LR  D          ++FV +D     +A RL++R  E
Subjt:  FLDADDFHPISNKEKMSKGIPLLDEDRNPWLEKLRDTLRENIGCRSRVILGCSALKKQYREILRSADPNYEMGMSSVVKFVLLDAPAEVIASRLEKRAKE

Query:  GKHFMPSTLLKSQLDLLQI-DDSEGILKVDATLSPQAIM
        G HFMP++LL SQL  L++  D E +++V     P  ++
Subjt:  GKHFMPSTLLKSQLDLLQI-DDSEGILKVDATLSPQAIM

Q5T6J7 Probable gluconokinase6.8e-2141.32Show/hide
Query:  MARGGQILINLHLQGYGLSASEFERTIGAMLGNSMDFFTFLDADDFHPISNKEKMSKGIPLLDEDRNPWLEKLRDTLRENIGCRSRVILGCSALKKQYRE
        MA  G +L+ + + G G S      T+GA+L + +  + F DADD+HP  N+ KM KGIPL D+DR PWL  L D L  ++    RV+L CSALKK YR+
Subjt:  MARGGQILINLHLQGYGLSASEFERTIGAMLGNSMDFFTFLDADDFHPISNKEKMSKGIPLLDEDRNPWLEKLRDTLRENIGCRSRVILGCSALKKQYRE

Query:  ILRSADPNYEMGMSSVVK----------FVLLDAPAEVIASRLEKRAKEGKHFMPSTLLKSQLDLLQ
        IL        +      K           V L    EVI+ RL KR  EG HFMP  LL+SQ + L+
Subjt:  ILRSADPNYEMGMSSVVK----------FVLLDAPAEVIASRLEKRAKEGKHFMPSTLLKSQLDLLQ

Q9SLE0 Gluconokinase3.4e-4152Show/hide
Query:  GQILINLHLQGYGLSASEFERTIGAMLGNSMDFFTFLDADDFHPISNKEKMSKGIPLLDEDRNPWLEKLRDTLRENIGCRSRVILGCSALKKQYREILRS
        G+++  + + G G S      TIG MLG ++    FLDADDFH +SN++KM +GI L DEDR PWLEK++++LR+ +     V+L CS+L+KQYREILR 
Subjt:  GQILINLHLQGYGLSASEFERTIGAMLGNSMDFFTFLDADDFHPISNKEKMSKGIPLLDEDRNPWLEKLRDTLRENIGCRSRVILGCSALKKQYREILRS

Query:  ADPNYEMG--MSSVVKFVLLDAPAEVIASRLEKRAKEGKHFMPSTLLKSQLDLLQIDDSEGILKVDATLSPQAIM
        +DP+Y+ G   S  V FVLL+  AEVIA+RL+KRA E +HFMP TLL+SQ DLLQ D+ E I K+   LSP+ I+
Subjt:  ADPNYEMG--MSSVVKFVLLDAPAEVIASRLEKRAKEGKHFMPSTLLKSQLDLLQIDDSEGILKVDATLSPQAIM

Arabidopsis top hitse value%identityAlignment
AT2G16790.1 P-loop containing nucleoside triphosphate hydrolases superfamily protein2.4e-4252Show/hide
Query:  GQILINLHLQGYGLSASEFERTIGAMLGNSMDFFTFLDADDFHPISNKEKMSKGIPLLDEDRNPWLEKLRDTLRENIGCRSRVILGCSALKKQYREILRS
        G+++  + + G G S      TIG MLG ++    FLDADDFH +SN++KM +GI L DEDR PWLEK++++LR+ +     V+L CS+L+KQYREILR 
Subjt:  GQILINLHLQGYGLSASEFERTIGAMLGNSMDFFTFLDADDFHPISNKEKMSKGIPLLDEDRNPWLEKLRDTLRENIGCRSRVILGCSALKKQYREILRS

Query:  ADPNYEMG--MSSVVKFVLLDAPAEVIASRLEKRAKEGKHFMPSTLLKSQLDLLQIDDSEGILKVDATLSPQAIM
        +DP+Y+ G   S  V FVLL+  AEVIA+RL+KRA E +HFMP TLL+SQ DLLQ D+ E I K+   LSP+ I+
Subjt:  ADPNYEMG--MSSVVKFVLLDAPAEVIASRLEKRAKEGKHFMPSTLLKSQLDLLQIDDSEGILKVDATLSPQAIM

AT2G16790.2 P-loop containing nucleoside triphosphate hydrolases superfamily protein2.4e-4252Show/hide
Query:  GQILINLHLQGYGLSASEFERTIGAMLGNSMDFFTFLDADDFHPISNKEKMSKGIPLLDEDRNPWLEKLRDTLRENIGCRSRVILGCSALKKQYREILRS
        G+++  + + G G S      TIG MLG ++    FLDADDFH +SN++KM +GI L DEDR PWLEK++++LR+ +     V+L CS+L+KQYREILR 
Subjt:  GQILINLHLQGYGLSASEFERTIGAMLGNSMDFFTFLDADDFHPISNKEKMSKGIPLLDEDRNPWLEKLRDTLRENIGCRSRVILGCSALKKQYREILRS

Query:  ADPNYEMG--MSSVVKFVLLDAPAEVIASRLEKRAKEGKHFMPSTLLKSQLDLLQIDDSEGILKVDATLSPQAIM
        +DP+Y+ G   S  V FVLL+  AEVIA+RL+KRA E +HFMP TLL+SQ DLLQ D+ E I K+   LSP+ I+
Subjt:  ADPNYEMG--MSSVVKFVLLDAPAEVIASRLEKRAKEGKHFMPSTLLKSQLDLLQIDDSEGILKVDATLSPQAIM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACAAGGAGCTGACAAGGAAAACCCGCCACGAGCCTGGAAATAGGACAAAAAAGTGGACTCCAGAGGCAAAACAGGCTAAAGGCTTTTCTGACTTAAGCATCGAAGG
CGATGTGGCAAGCACCACACCGGTGTGCAGGTTTTCCTTGTCTTGCAGGCCACGTCCTCCCCCTCAAACAAATTTACCGTTGGTGGCACGTGAAGGTCAGGAATTCGGAG
GTGTTTCAGGACGAACCAGACGAAACCGGGACGGTCAGGGACATCAGGGACCAAAGGGAGGTGGCCAAGCTCGGCCTGCGCAAGTAGGCCGAATGGTCGGCCTCGGCCTC
TTCCTGATTGTCCTAGTCAGCTCCTTGTGCGCAATTTTGGACCACCCCGATGTACGAGGAGCTGACGAGGACAATCGGACAGGAAGATGGACCAAGGAGGCAAAACCGGC
AAGTGAGACGGGCCAAGACCGAAGGGGTCGGGTTTTAGGCCCGAACCCCTGCTCGGCCTCGGCCATGTGCCGAGGCCGACCCTCGGCTCGCTCGCGCGGGCCGAGCCCGT
TCGGTCTCGTCTGGTCCCCACTGCCTCTGGATGCCCCGGTTTCGCCCGAGGGGATCCCGAATTCTATCCCTAAACGCTATTTTACATTCTCCACTCTCTTTCCTCTTGCT
CTTACTTTTCCACTCCCTACCGTTCTGCTTGCTGACTTAAGCATCGGAGCCAGTGTGGCGAGCACCACACCGGTGCGCAGGTTTACTGTCTTGCAGGCCACGTCTTCCCC
TTCATCTACAAATTTACCGTTGATGGCACGTGGAGGTCAGATTCTGATCAATCTCCACCTTCAGGGATATGGACTCTCAGCATCCGAATTCGAACGTACTATTGGTGCGA
TGCTGGGCAACTCCATGGACTTTTTCACTTTTCTTGATGCTGATGATTTTCACCCAATTTCTAACAAGGAAAAAATGTCGAAAGGAATCCCTCTTTTAGACGAAGATCGG
AACCCCTGGCTCGAGAAGCTTCGAGACACCTTGAGAGAGAACATAGGTTGTAGAAGCAGAGTAATTCTTGGCTGCTCAGCTCTAAAAAAACAGTACAGAGAGATTCTGAG
ATCAGCAGATCCAAATTATGAAATGGGAATGAGCAGTGTGGTGAAGTTTGTTCTGTTGGATGCTCCAGCTGAAGTGATTGCTTCCAGATTGGAGAAAAGAGCCAAAGAAG
GGAAGCATTTCATGCCTTCAACTCTTTTGAAATCCCAATTGGATCTGTTGCAGATTGATGATTCTGAAGGCATTTTAAAAGTTGATGCTACTCTTAGCCCTCAAGCAATT
ATGTTTATTGCAACAGGACAATCTGGGGGAAGTGCATCAAAAGATCCCCACTCTCACCTCAAATCATTCCTCAACATCTGCAACAAGTTCATCATTCTAGAAGTTACCTC
GGAACAACTTCAAGTCATATTATTTCTATACTCGCTACGAGATGCTGCTAAGATATGGCTGATAGCGCGCATAGCGAGCTATTTTTCCTGGTGTTCGGGACCTAAAGAAG
AGCAAAGGAAGAGAAAAGAATCAAGTTGGGAGCTTAATAAAAGAGGCAAGCCCGAGAGGCAGCGTCGCGACGCTATCGACAAATTTCCTATAAATACATTTTATTCTTGC
ACGGCCAAGGGGAGTCCATTTGACCCAAGCCCAATAAAAGCCCAAGCCCAAGTTGTTAGGCCCAAAAGTCACCAGGGCCCATCCAGCGAGAACTCTATAAATAGAGGGGT
TCTCCAATGGGGTTCAGAAATTCTACACTCTCACAAAGACAAGAGTTCAGAGTTTTCAAAGCTCTCAAGCAGAACTAGAGAATTCAGAGAGACTCCACCAAGTCTGAAGA
CCAAAGACTCTCTACAATCCACAAGTCCAAGTGTTGAATCCTTGAAGATCAAAGACTCTTCAGGACATCAACACTTCTTGAAGACTGAAGGCTCCTTCAAGACTAGAAGA
CTTCAAGCTCCAAGAATCCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGTACAAGGAGCTGACAAGGAAAACCCGCCACGAGCCTGGAAATAGGACAAAAAAGTGGACTCCAGAGGCAAAACAGGCTAAAGGCTTTTCTGACTTAAGCATCGAAGG
CGATGTGGCAAGCACCACACCGGTGTGCAGGTTTTCCTTGTCTTGCAGGCCACGTCCTCCCCCTCAAACAAATTTACCGTTGGTGGCACGTGAAGGTCAGGAATTCGGAG
GTGTTTCAGGACGAACCAGACGAAACCGGGACGGTCAGGGACATCAGGGACCAAAGGGAGGTGGCCAAGCTCGGCCTGCGCAAGTAGGCCGAATGGTCGGCCTCGGCCTC
TTCCTGATTGTCCTAGTCAGCTCCTTGTGCGCAATTTTGGACCACCCCGATGTACGAGGAGCTGACGAGGACAATCGGACAGGAAGATGGACCAAGGAGGCAAAACCGGC
AAGTGAGACGGGCCAAGACCGAAGGGGTCGGGTTTTAGGCCCGAACCCCTGCTCGGCCTCGGCCATGTGCCGAGGCCGACCCTCGGCTCGCTCGCGCGGGCCGAGCCCGT
TCGGTCTCGTCTGGTCCCCACTGCCTCTGGATGCCCCGGTTTCGCCCGAGGGGATCCCGAATTCTATCCCTAAACGCTATTTTACATTCTCCACTCTCTTTCCTCTTGCT
CTTACTTTTCCACTCCCTACCGTTCTGCTTGCTGACTTAAGCATCGGAGCCAGTGTGGCGAGCACCACACCGGTGCGCAGGTTTACTGTCTTGCAGGCCACGTCTTCCCC
TTCATCTACAAATTTACCGTTGATGGCACGTGGAGGTCAGATTCTGATCAATCTCCACCTTCAGGGATATGGACTCTCAGCATCCGAATTCGAACGTACTATTGGTGCGA
TGCTGGGCAACTCCATGGACTTTTTCACTTTTCTTGATGCTGATGATTTTCACCCAATTTCTAACAAGGAAAAAATGTCGAAAGGAATCCCTCTTTTAGACGAAGATCGG
AACCCCTGGCTCGAGAAGCTTCGAGACACCTTGAGAGAGAACATAGGTTGTAGAAGCAGAGTAATTCTTGGCTGCTCAGCTCTAAAAAAACAGTACAGAGAGATTCTGAG
ATCAGCAGATCCAAATTATGAAATGGGAATGAGCAGTGTGGTGAAGTTTGTTCTGTTGGATGCTCCAGCTGAAGTGATTGCTTCCAGATTGGAGAAAAGAGCCAAAGAAG
GGAAGCATTTCATGCCTTCAACTCTTTTGAAATCCCAATTGGATCTGTTGCAGATTGATGATTCTGAAGGCATTTTAAAAGTTGATGCTACTCTTAGCCCTCAAGCAATT
ATGTTTATTGCAACAGGACAATCTGGGGGAAGTGCATCAAAAGATCCCCACTCTCACCTCAAATCATTCCTCAACATCTGCAACAAGTTCATCATTCTAGAAGTTACCTC
GGAACAACTTCAAGTCATATTATTTCTATACTCGCTACGAGATGCTGCTAAGATATGGCTGATAGCGCGCATAGCGAGCTATTTTTCCTGGTGTTCGGGACCTAAAGAAG
AGCAAAGGAAGAGAAAAGAATCAAGTTGGGAGCTTAATAAAAGAGGCAAGCCCGAGAGGCAGCGTCGCGACGCTATCGACAAATTTCCTATAAATACATTTTATTCTTGC
ACGGCCAAGGGGAGTCCATTTGACCCAAGCCCAATAAAAGCCCAAGCCCAAGTTGTTAGGCCCAAAAGTCACCAGGGCCCATCCAGCGAGAACTCTATAAATAGAGGGGT
TCTCCAATGGGGTTCAGAAATTCTACACTCTCACAAAGACAAGAGTTCAGAGTTTTCAAAGCTCTCAAGCAGAACTAGAGAATTCAGAGAGACTCCACCAAGTCTGAAGA
CCAAAGACTCTCTACAATCCACAAGTCCAAGTGTTGAATCCTTGAAGATCAAAGACTCTTCAGGACATCAACACTTCTTGAAGACTGAAGGCTCCTTCAAGACTAGAAGA
CTTCAAGCTCCAAGAATCCATTGA
Protein sequenceShow/hide protein sequence
MYKELTRKTRHEPGNRTKKWTPEAKQAKGFSDLSIEGDVASTTPVCRFSLSCRPRPPPQTNLPLVAREGQEFGGVSGRTRRNRDGQGHQGPKGGGQARPAQVGRMVGLGL
FLIVLVSSLCAILDHPDVRGADEDNRTGRWTKEAKPASETGQDRRGRVLGPNPCSASAMCRGRPSARSRGPSPFGLVWSPLPLDAPVSPEGIPNSIPKRYFTFSTLFPLA
LTFPLPTVLLADLSIGASVASTTPVRRFTVLQATSSPSSTNLPLMARGGQILINLHLQGYGLSASEFERTIGAMLGNSMDFFTFLDADDFHPISNKEKMSKGIPLLDEDR
NPWLEKLRDTLRENIGCRSRVILGCSALKKQYREILRSADPNYEMGMSSVVKFVLLDAPAEVIASRLEKRAKEGKHFMPSTLLKSQLDLLQIDDSEGILKVDATLSPQAI
MFIATGQSGGSASKDPHSHLKSFLNICNKFIILEVTSEQLQVILFLYSLRDAAKIWLIARIASYFSWCSGPKEEQRKRKESSWELNKRGKPERQRRDAIDKFPINTFYSC
TAKGSPFDPSPIKAQAQVVRPKSHQGPSSENSINRGVLQWGSEILHSHKDKSSEFSKLSSRTREFRETPPSLKTKDSLQSTSPSVESLKIKDSSGHQHFLKTEGSFKTRR
LQAPRIH