; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg035294 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg035294
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionprotein kinase family protein
Genome locationscaffold7:22118519..22130587
RNA-Seq ExpressionSpg035294
SyntenySpg035294
Gene Ontology termsGO:0006468 - protein phosphorylation (biological process)
GO:0004672 - protein kinase activity (molecular function)
GO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsIPR001245 - Serine-threonine/tyrosine-protein kinase, catalytic domain
IPR011009 - Protein kinase-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAG7890839.1 unnamed protein product [Brassica rapa]7.3e-2345.51Show/hide
Query:  AECSEYIAERLEGAKSVIQQSREQNCHFVCVGVLY--YLAILAQSKQEYSDDVIVSFRQEVSLMKKLRHPNILLFMGVVTSPQRLCIVTEFLPRFVPNDH
        ++C EY  E L     + +Q  + +C  V  G+ +   +A+   SKQEYS++V+ SFRQEVSLMK+L+HPN+LLFMG V+SPQ LCIV+EFLPRF     
Subjt:  AECSEYIAERLEGAKSVIQQSREQNCHFVCVGVLY--YLAILAQSKQEYSDDVIVSFRQEVSLMKKLRHPNILLFMGVVTSPQRLCIVTEFLPRFVPNDH

Query:  SWFNYLIYASKSGILHRLIEVSGTYKKQWKFVSFTTEEHRQIRLETPCSYGFEHYT
          ++Y ++A +S     LI +      QWK +  TTEEH  I LE   SYG  H+T
Subjt:  SWFNYLIYASKSGILHRLIEVSGTYKKQWKFVSFTTEEHRQIRLETPCSYGFEHYT

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]2.6e-2836.17Show/hide
Query:  GVQRRRRRKQKADRIKVIRTDTPSPSTTESEKENSEKGQEIKGSEDQYIEMLKRDFLFERGFGDDLPHFLRAGIANHGWRQFCAKPEPVNSNIVREFYAN
        G++R  R+  KA + +    +T   +  ++   N+EKG  +  SE                    LP F+   I  H W+QFCA PE     +VREFYAN
Subjt:  GVQRRRRRKQKADRIKVIRTDTPSPSTTESEKENSEKGQEIKGSEDQYIEMLKRDFLFERGFGDDLPHFLRAGIANHGWRQFCAKPEPVNSNIVREFYAN

Query:  IDDQEEFQVIVRGVAVDWSPGAINSLFNLQD--FPHAGFNGMV-----------VAPSNDQLN-------TAVREV---------GFIKLRLLPTTHDST
        + D  E  V VRGV V WS  AIN++F L D    H+ F   +           VA +  + N       T +R            F+K  LLPTTH  T
Subjt:  IDDQEEFQVIVRGVAVDWSPGAINSLFNLQD--FPHAGFNGMV-----------VAPSNDQLN-------TAVREV---------GFIKLRLLPTTHDST

Query:  VSQDRVLLVFSILRSLSIDVGKIISSEIYDCWRKKVGKLFFPNTITMLCSRARVPMSADDVTLMDKGIIDTPNLARLQRTQE
        VS+DR+LL+ S+L   SI+VG++I SEI  C  +K G LFFP+ IT LC  AR P   ++  L + G ID   +AR+  TQE
Subjt:  VSQDRVLLVFSILRSLSIDVGKIISSEIYDCWRKKVGKLFFPNTITMLCSRARVPMSADDVTLMDKGIIDTPNLARLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]4.0e-2936.46Show/hide
Query:  GVQRRRRRKQKADRIKVIRTDTPSPSTTESEKENSEKGQEIKGSEDQYIEMLKRDFLFERGFGDDLPHFLRAGIANHGWRQFCAKPEPVNSNIVREFYAN
        G++R  R+  KA + +     T   +  ++   N+EKG  +  SE                    LP F+   I  H W+QFCA PE     +VREFYAN
Subjt:  GVQRRRRRKQKADRIKVIRTDTPSPSTTESEKENSEKGQEIKGSEDQYIEMLKRDFLFERGFGDDLPHFLRAGIANHGWRQFCAKPEPVNSNIVREFYAN

Query:  IDDQEEFQVIVRGVAVDWSPGAINSLFNLQD--FPHAGFNGMV-----------VAPSNDQLN-------TAVREV---------GFIKLRLLPTTHDST
        + D EE  V VRGV V WS  AIN++F L D    H+ F   +           VA +  + N       T +R            F+K RLLPTTH  T
Subjt:  IDDQEEFQVIVRGVAVDWSPGAINSLFNLQD--FPHAGFNGMV-----------VAPSNDQLN-------TAVREV---------GFIKLRLLPTTHDST

Query:  VSQDRVLLVFSILRSLSIDVGKIISSEIYDCWRKKVGKLFFPNTITMLCSRARVPMSADDVTLMDKGIIDTPNLARL
        VS+DR+LL+ S+L   SI+VG++I SEI  C  +K G LFFP+ IT LC  AR P   ++  L + G ID   +AR+
Subjt:  VSQDRVLLVFSILRSLSIDVGKIISSEIYDCWRKKVGKLFFPNTITMLCSRARVPMSADDVTLMDKGIIDTPNLARL

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]2.8e-2232.77Show/hide
Query:  EIKGSEDQYIE-------MLKRDFLFERGFGDDLPHFLRAGIANHGWRQFCAKPEPVNSNIVREFYANIDDQEEFQVIVRGVAVDWSPGAINSLFNLQD-
        E K +E +Y E        ++++F+++     + P F+   I  H W+ FCA PE     +VREFY N+ + ++  V +RGV V  S  AIN++F+L D 
Subjt:  EIKGSEDQYIE-------MLKRDFLFERGFGDDLPHFLRAGIANHGWRQFCAKPEPVNSNIVREFYANIDDQEEFQVIVRGVAVDWSPGAINSLFNLQD-

Query:  -FPHAGFNGMVVAPSNDQLNTAVREVG---------------------------FIKLRLLPTTHDSTVSQDRVLLVFSILRSLSIDVGKIISSEIYDCW
           H+ F   +  P    +   V  VG                           F+K RLLPTTH  TVS++ V L++S+L   SI+VG++I  EI  C 
Subjt:  -FPHAGFNGMVVAPSNDQLNTAVREVG---------------------------FIKLRLLPTTHDSTVSQDRVLLVFSILRSLSIDVGKIISSEIYDCW

Query:  RKKVGKLFFPNTITMLCSRARVPMSADDVTLMDKG
         +K G LFFP+ IT +C   R P   ++  L + G
Subjt:  RKKVGKLFFPNTITMLCSRARVPMSADDVTLMDKG

VDC77924.1 unnamed protein product [Brassica rapa]7.3e-2345.51Show/hide
Query:  AECSEYIAERLEGAKSVIQQSREQNCHFVCVGVLY--YLAILAQSKQEYSDDVIVSFRQEVSLMKKLRHPNILLFMGVVTSPQRLCIVTEFLPRFVPNDH
        ++C EY  E L     + +Q  + +C  V  G+ +   +A+   SKQEYS++V+ SFRQEVSLMK+L+HPN+LLFMG V+SPQ LCIV+EFLPRF     
Subjt:  AECSEYIAERLEGAKSVIQQSREQNCHFVCVGVLY--YLAILAQSKQEYSDDVIVSFRQEVSLMKKLRHPNILLFMGVVTSPQRLCIVTEFLPRFVPNDH

Query:  SWFNYLIYASKSGILHRLIEVSGTYKKQWKFVSFTTEEHRQIRLETPCSYGFEHYT
          ++Y ++A +S     LI +      QWK +  TTEEH  I LE   SYG  H+T
Subjt:  SWFNYLIYASKSGILHRLIEVSGTYKKQWKFVSFTTEEHRQIRLETPCSYGFEHYT

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)1.3e-2836.17Show/hide
Query:  GVQRRRRRKQKADRIKVIRTDTPSPSTTESEKENSEKGQEIKGSEDQYIEMLKRDFLFERGFGDDLPHFLRAGIANHGWRQFCAKPEPVNSNIVREFYAN
        G++R  R+  KA + +    +T   +  ++   N+EKG  +  SE                    LP F+   I  H W+QFCA PE     +VREFYAN
Subjt:  GVQRRRRRKQKADRIKVIRTDTPSPSTTESEKENSEKGQEIKGSEDQYIEMLKRDFLFERGFGDDLPHFLRAGIANHGWRQFCAKPEPVNSNIVREFYAN

Query:  IDDQEEFQVIVRGVAVDWSPGAINSLFNLQD--FPHAGFNGMV-----------VAPSNDQLN-------TAVREV---------GFIKLRLLPTTHDST
        + D  E  V VRGV V WS  AIN++F L D    H+ F   +           VA +  + N       T +R            F+K  LLPTTH  T
Subjt:  IDDQEEFQVIVRGVAVDWSPGAINSLFNLQD--FPHAGFNGMV-----------VAPSNDQLN-------TAVREV---------GFIKLRLLPTTHDST

Query:  VSQDRVLLVFSILRSLSIDVGKIISSEIYDCWRKKVGKLFFPNTITMLCSRARVPMSADDVTLMDKGIIDTPNLARLQRTQE
        VS+DR+LL+ S+L   SI+VG++I SEI  C  +K G LFFP+ IT LC  AR P   ++  L + G ID   +AR+  TQE
Subjt:  VSQDRVLLVFSILRSLSIDVGKIISSEIYDCWRKKVGKLFFPNTITMLCSRARVPMSADDVTLMDKGIIDTPNLARLQRTQE

A0A2P5BCG4 Uncharacterized protein (Fragment)1.9e-2936.46Show/hide
Query:  GVQRRRRRKQKADRIKVIRTDTPSPSTTESEKENSEKGQEIKGSEDQYIEMLKRDFLFERGFGDDLPHFLRAGIANHGWRQFCAKPEPVNSNIVREFYAN
        G++R  R+  KA + +     T   +  ++   N+EKG  +  SE                    LP F+   I  H W+QFCA PE     +VREFYAN
Subjt:  GVQRRRRRKQKADRIKVIRTDTPSPSTTESEKENSEKGQEIKGSEDQYIEMLKRDFLFERGFGDDLPHFLRAGIANHGWRQFCAKPEPVNSNIVREFYAN

Query:  IDDQEEFQVIVRGVAVDWSPGAINSLFNLQD--FPHAGFNGMV-----------VAPSNDQLN-------TAVREV---------GFIKLRLLPTTHDST
        + D EE  V VRGV V WS  AIN++F L D    H+ F   +           VA +  + N       T +R            F+K RLLPTTH  T
Subjt:  IDDQEEFQVIVRGVAVDWSPGAINSLFNLQD--FPHAGFNGMV-----------VAPSNDQLN-------TAVREV---------GFIKLRLLPTTHDST

Query:  VSQDRVLLVFSILRSLSIDVGKIISSEIYDCWRKKVGKLFFPNTITMLCSRARVPMSADDVTLMDKGIIDTPNLARL
        VS+DR+LL+ S+L   SI+VG++I SEI  C  +K G LFFP+ IT LC  AR P   ++  L + G ID   +AR+
Subjt:  VSQDRVLLVFSILRSLSIDVGKIISSEIYDCWRKKVGKLFFPNTITMLCSRARVPMSADDVTLMDKGIIDTPNLARL

A0A2P5DAQ2 Uncharacterized protein1.3e-2232.77Show/hide
Query:  EIKGSEDQYIE-------MLKRDFLFERGFGDDLPHFLRAGIANHGWRQFCAKPEPVNSNIVREFYANIDDQEEFQVIVRGVAVDWSPGAINSLFNLQD-
        E K +E +Y E        ++++F+++     + P F+   I  H W+ FCA PE     +VREFY N+ + ++  V +RGV V  S  AIN++F+L D 
Subjt:  EIKGSEDQYIE-------MLKRDFLFERGFGDDLPHFLRAGIANHGWRQFCAKPEPVNSNIVREFYANIDDQEEFQVIVRGVAVDWSPGAINSLFNLQD-

Query:  -FPHAGFNGMVVAPSNDQLNTAVREVG---------------------------FIKLRLLPTTHDSTVSQDRVLLVFSILRSLSIDVGKIISSEIYDCW
           H+ F   +  P    +   V  VG                           F+K RLLPTTH  TVS++ V L++S+L   SI+VG++I  EI  C 
Subjt:  -FPHAGFNGMVVAPSNDQLNTAVREVG---------------------------FIKLRLLPTTHDSTVSQDRVLLVFSILRSLSIDVGKIISSEIYDCW

Query:  RKKVGKLFFPNTITMLCSRARVPMSADDVTLMDKG
         +K G LFFP+ IT +C   R P   ++  L + G
Subjt:  RKKVGKLFFPNTITMLCSRARVPMSADDVTLMDKG

A0A3P5ZQZ1 Uncharacterized protein3.5e-2345.51Show/hide
Query:  AECSEYIAERLEGAKSVIQQSREQNCHFVCVGVLY--YLAILAQSKQEYSDDVIVSFRQEVSLMKKLRHPNILLFMGVVTSPQRLCIVTEFLPRFVPNDH
        ++C EY  E L     + +Q  + +C  V  G+ +   +A+   SKQEYS++V+ SFRQEVSLMK+L+HPN+LLFMG V+SPQ LCIV+EFLPRF     
Subjt:  AECSEYIAERLEGAKSVIQQSREQNCHFVCVGVLY--YLAILAQSKQEYSDDVIVSFRQEVSLMKKLRHPNILLFMGVVTSPQRLCIVTEFLPRFVPNDH

Query:  SWFNYLIYASKSGILHRLIEVSGTYKKQWKFVSFTTEEHRQIRLETPCSYGFEHYT
          ++Y ++A +S     LI +      QWK +  TTEEH  I LE   SYG  H+T
Subjt:  SWFNYLIYASKSGILHRLIEVSGTYKKQWKFVSFTTEEHRQIRLETPCSYGFEHYT

M4FGQ4 Uncharacterized protein4.6e-2346.15Show/hide
Query:  AECSEYIAERLEGAKSVIQQSREQNCHFVCVGVLY--YLAILAQSKQEYSDDVIVSFRQEVSLMKKLRHPNILLFMGVVTSPQRLCIVTEFLPRFVPNDH
        ++C EY  E L     + +Q  + +C  V  G+ +   +A+   SKQEYS++V+ SFRQ VSLMK+LRHPN+LLFMG VTSPQRLCIV+EFLPRF     
Subjt:  AECSEYIAERLEGAKSVIQQSREQNCHFVCVGVLY--YLAILAQSKQEYSDDVIVSFRQEVSLMKKLRHPNILLFMGVVTSPQRLCIVTEFLPRFVPNDH

Query:  SWFNYLIYASKSGILHRLIEVSGTYKKQWKFVSFTTEEHRQIRLETPCSYGFEHYT
         +++Y ++A  S     LI +      QWK +S TTE H    LE   SYG  H+T
Subjt:  SWFNYLIYASKSGILHRLIEVSGTYKKQWKFVSFTTEEHRQIRLETPCSYGFEHYT

SwissProt top hitse value%identityAlignment
Q05609 Serine/threonine-protein kinase CTR12.6e-0748.21Show/hide
Query:  LAILAQSKQEYSDDVIVSFRQEVSLMKKLRHPNILLFMGVVTSPQRLCIVTEFLPR
        +A+    +Q++  + +  F +EV++MK+LRHPNI+LFMG VT P  L IVTE+L R
Subjt:  LAILAQSKQEYSDDVIVSFRQEVSLMKKLRHPNILLFMGVVTSPQRLCIVTEFLPR

Q54IP4 Dual specificity protein kinase shkB3.8e-0644.64Show/hide
Query:  LAILAQSKQEYSDDVIVSFRQEVSLMKKLRHPNILLFMGVVTSPQRLCIVTEFLPR
        +AI   ++  + ++ +  F++EVSLM KLR+P++LLFMG  T+P+ L IVTE +P+
Subjt:  LAILAQSKQEYSDDVIVSFRQEVSLMKKLRHPNILLFMGVVTSPQRLCIVTEFLPR

Q55GU0 Probable serine/threonine-protein kinase DDB_G02675147.6e-0725.22Show/hide
Query:  QEYSDDVIVSFRQEVSLMKKLRHPNILLFMGVVTSPQRLCIVTEFLP----------RFVPNDHSWFNYLIYASKSGILHRLIEVSGTYKKQWKFVSFTT
        ++ ++ V+  FR+E++++ +LRHPNI+L M   T+P  LC +TE+LP          + +  +   +  L      G+    + +SG   +  K ++   
Subjt:  QEYSDDVIVSFRQEVSLMKKLRHPNILLFMGVVTSPQRLCIVTEFLP----------RFVPNDHSWFNYLIYASKSGILHRLIEVSGTYKKQWKFVSFTT

Query:  EEHRQIRLETPCSYG
        +EH  +++   C +G
Subjt:  EEHRQIRLETPCSYG

Q9C9U5 Probable serine/threonine-protein kinase SIS83.7e-0962.5Show/hide
Query:  QEYSDDVIVSFRQEVSLMKKLRHPNILLFMGVVTSPQRLCIVTEFLPR
        Q+ + + +  FR EV +MKKLRHPNI+LFMG VT P  L IVTEFLPR
Subjt:  QEYSDDVIVSFRQEVSLMKKLRHPNILLFMGVVTSPQRLCIVTEFLPR

Q9FPR3 Serine/threonine-protein kinase EDR14.0e-0854.17Show/hide
Query:  QEYSDDVIVSFRQEVSLMKKLRHPNILLFMGVVTSPQRLCIVTEFLPR
        Q++S   +  FR EV +M++LRHPN++ F+G VT P  L IVTEFLPR
Subjt:  QEYSDDVIVSFRQEVSLMKKLRHPNILLFMGVVTSPQRLCIVTEFLPR

Arabidopsis top hitse value%identityAlignment
AT1G67890.1 PAS domain-containing protein tyrosine kinase family protein1.2e-1853.19Show/hide
Query:  AECSEYIAERLEGAKSVIQQSREQNCHFVCVGVLY--YLAILAQSKQEYSDDVIVSFRQEVSLMKKLRHPNILLFMGVVTSPQRLCIVTEFLPR
        ++C +Y  E L    ++ +Q  + +C  V  G+ +   +A+   SKQEYS+++I SF+QEVSLMK+LRHPN+LLFMG V SPQRLCIVTEFLPR
Subjt:  AECSEYIAERLEGAKSVIQQSREQNCHFVCVGVLY--YLAILAQSKQEYSDDVIVSFRQEVSLMKKLRHPNILLFMGVVTSPQRLCIVTEFLPR

AT5G49470.1 PAS domain-containing protein tyrosine kinase family protein1.4e-1955.32Show/hide
Query:  AECSEYIAERLEGAKSVIQQSREQNCHFVCVGVLY--YLAILAQSKQEYSDDVIVSFRQEVSLMKKLRHPNILLFMGVVTSPQRLCIVTEFLPR
        ++C +Y  E L    ++ +Q  + +C  V  G+ +   +A+   SKQEYS+++I SFRQEVSLMK+LRHPN+LLFMG VTSPQRLCIVTEFLPR
Subjt:  AECSEYIAERLEGAKSVIQQSREQNCHFVCVGVLY--YLAILAQSKQEYSDDVIVSFRQEVSLMKKLRHPNILLFMGVVTSPQRLCIVTEFLPR

AT5G49470.2 PAS domain-containing protein tyrosine kinase family protein2.8e-2055.79Show/hide
Query:  AECSEYIAERLEGAKSVIQQSREQNCHFVCVGVLY--YLAILAQSKQEYSDDVIVSFRQEVSLMKKLRHPNILLFMGVVTSPQRLCIVTEFLPRF
        ++C +Y  E L    ++ +Q  + +C  V  G+ +   +A+   SKQEYS+++I SFRQEVSLMK+LRHPN+LLFMG VTSPQRLCIVTEFLPRF
Subjt:  AECSEYIAERLEGAKSVIQQSREQNCHFVCVGVLY--YLAILAQSKQEYSDDVIVSFRQEVSLMKKLRHPNILLFMGVVTSPQRLCIVTEFLPRF

AT5G49470.3 PAS domain-containing protein tyrosine kinase family protein1.4e-1955.32Show/hide
Query:  AECSEYIAERLEGAKSVIQQSREQNCHFVCVGVLY--YLAILAQSKQEYSDDVIVSFRQEVSLMKKLRHPNILLFMGVVTSPQRLCIVTEFLPR
        ++C +Y  E L    ++ +Q  + +C  V  G+ +   +A+   SKQEYS+++I SFRQEVSLMK+LRHPN+LLFMG VTSPQRLCIVTEFLPR
Subjt:  AECSEYIAERLEGAKSVIQQSREQNCHFVCVGVLY--YLAILAQSKQEYSDDVIVSFRQEVSLMKKLRHPNILLFMGVVTSPQRLCIVTEFLPR

AT5G49470.4 PAS domain-containing protein tyrosine kinase family protein1.4e-1955.32Show/hide
Query:  AECSEYIAERLEGAKSVIQQSREQNCHFVCVGVLY--YLAILAQSKQEYSDDVIVSFRQEVSLMKKLRHPNILLFMGVVTSPQRLCIVTEFLPR
        ++C +Y  E L    ++ +Q  + +C  V  G+ +   +A+   SKQEYS+++I SFRQEVSLMK+LRHPN+LLFMG VTSPQRLCIVTEFLPR
Subjt:  AECSEYIAERLEGAKSVIQQSREQNCHFVCVGVLY--YLAILAQSKQEYSDDVIVSFRQEVSLMKKLRHPNILLFMGVVTSPQRLCIVTEFLPR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAAAACGAGAGGGAGGAAGGAGAGAGATGCTGAGGAAGAAGAAGTGCCTAATACACCTGAGGCACCGAAGACAAAGACAAAGAAGTGGAAGACGCCGGAAGAAAG
AGAGGTTGAGAGAAGAAGACGACAACAGAGGGCTGAAGTCGAAGAAGTTAGTGAACCGATAGTTGAAAATCTGCAAGAAGTGCAGGAGAAACGTGTGGAAGACATACAAG
AAAAAGGTAATGGACAGGAGGTTCAAGAGCAGGAAGTTCAGGCTGATATCATTGTACCGGGAGTTCAGCGTCGCCGCCGTCGGAAGCAGAAGGCTGACCGCATCAAGGTA
ATCAGAACAGACACTCCTTCGCCGTCGACGACTGAGTCTGAAAAAGAAAACTCGGAAAAAGGACAAGAGATAAAGGGGTCAGAGGACCAGTATATAGAGATGCTAAAAAG
GGACTTTCTGTTTGAACGGGGATTTGGTGATGATCTGCCGCATTTCTTGAGGGCTGGAATAGCAAATCACGGATGGAGGCAATTCTGCGCAAAGCCAGAGCCAGTAAATT
CGAATATTGTCCGCGAATTTTATGCGAACATAGATGATCAAGAAGAATTCCAAGTCATAGTTCGAGGGGTAGCCGTTGATTGGAGCCCAGGAGCAATAAATTCTCTCTTC
AACCTCCAAGATTTTCCGCATGCTGGGTTTAATGGAATGGTGGTGGCACCATCTAACGATCAATTAAACACGGCTGTCAGGGAGGTTGGCTTCATCAAGTTGCGCTTGCT
TCCGACAACCCATGATTCAACGGTGTCTCAAGACCGGGTGCTTCTGGTATTTTCTATTCTTCGTTCTTTGAGTATAGATGTTGGGAAAATTATTTCGAGTGAAATTTATG
ATTGCTGGCGAAAGAAGGTAGGAAAACTGTTCTTCCCGAACACAATTACCATGTTGTGTAGCAGGGCTAGGGTTCCCATGAGCGCAGACGATGTCACTCTAATGGATAAG
GGAATAATAGACACGCCGAACTTGGCAAGGCTTCAGAGGACTCAAGAAGCGCGCCAAGGCGTAGTGCTTGGTTTTGCAGAATGCTCAGAATATATTGCGGAGCGACTTGA
GGGAGCAAAATCTGTGATCCAGCAAAGCAGGGAGCAAAACTGCCATTTTGTTTGTGTGGGAGTTTTGTATTACTTAGCCATCCTTGCACAATCTAAACAAGAGTATTCAG
ATGATGTGATTGTCTCCTTCAGACAGGAGGTATCCCTAATGAAAAAGCTTAGGCACCCCAATATTCTTCTCTTTATGGGAGTTGTGACTTCGCCTCAGCGTCTCTGCATT
GTCACAGAATTCCTTCCACGCTTTGTTCCTAACGACCATTCTTGGTTTAATTATTTGATTTATGCTTCCAAAAGTGGCATTCTTCATAGGCTGATTGAAGTGTCTGGTAC
CTATAAAAAACAGTGGAAGTTTGTTTCGTTTACTACAGAGGAACACAGGCAAATTAGATTGGAAACGCCGTGTTCATATGGCTTTGAACATTATACCCAATCTGCTTTAG
CTTTCTCTTCTTTGCGTATTGAGATATCATGTTCTTGCAATTTGGAACAGGGTCTTGCAGGTGCTACTTCAAGATATCTGGGTTTTAGAAGCATGATGACTTTGCTCGAC
CCAATACTCCCACAAGTCGTTCTTGGCCCGGAGAATAGTAGGGAAGACACTAGAGGTGGTCCACACACCGTTCGTGATCAAAGGAATCAAAGGGAGTTGTTGAATCGGTC
TCCAAAAGATCCATGCTTGGCCTCGAGGCGCCCTGGGAGCGTCCCCCTACGGATGGTGTTTGCATGGGTTAATATCAAGGTGAATGGGGAAAGTGTTCATAGTAAGTGGG
AGAAGGAAACATGTCAAACGTATCCTGCGGTCTCCATCATTAGGTTGCACCGTGAGATTCCTATGCGCTGCCTGCGAGTCGCCCTGGGAGCGATCACCCTACGGAGGGCT
TGCGCATGGGATTCGGAACAACGCAAACTCCAGAAATGGATAGGGTTTCCTAGGGCCATTCCCAACATTAGCTCTTCCCTACGGTGGCATTGTTGGGGCCGTCCTCTGCG
ATCTGAAAATGATGGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGAAAACGAGAGGGAGGAAGGAGAGAGATGCTGAGGAAGAAGAAGTGCCTAATACACCTGAGGCACCGAAGACAAAGACAAAGAAGTGGAAGACGCCGGAAGAAAG
AGAGGTTGAGAGAAGAAGACGACAACAGAGGGCTGAAGTCGAAGAAGTTAGTGAACCGATAGTTGAAAATCTGCAAGAAGTGCAGGAGAAACGTGTGGAAGACATACAAG
AAAAAGGTAATGGACAGGAGGTTCAAGAGCAGGAAGTTCAGGCTGATATCATTGTACCGGGAGTTCAGCGTCGCCGCCGTCGGAAGCAGAAGGCTGACCGCATCAAGGTA
ATCAGAACAGACACTCCTTCGCCGTCGACGACTGAGTCTGAAAAAGAAAACTCGGAAAAAGGACAAGAGATAAAGGGGTCAGAGGACCAGTATATAGAGATGCTAAAAAG
GGACTTTCTGTTTGAACGGGGATTTGGTGATGATCTGCCGCATTTCTTGAGGGCTGGAATAGCAAATCACGGATGGAGGCAATTCTGCGCAAAGCCAGAGCCAGTAAATT
CGAATATTGTCCGCGAATTTTATGCGAACATAGATGATCAAGAAGAATTCCAAGTCATAGTTCGAGGGGTAGCCGTTGATTGGAGCCCAGGAGCAATAAATTCTCTCTTC
AACCTCCAAGATTTTCCGCATGCTGGGTTTAATGGAATGGTGGTGGCACCATCTAACGATCAATTAAACACGGCTGTCAGGGAGGTTGGCTTCATCAAGTTGCGCTTGCT
TCCGACAACCCATGATTCAACGGTGTCTCAAGACCGGGTGCTTCTGGTATTTTCTATTCTTCGTTCTTTGAGTATAGATGTTGGGAAAATTATTTCGAGTGAAATTTATG
ATTGCTGGCGAAAGAAGGTAGGAAAACTGTTCTTCCCGAACACAATTACCATGTTGTGTAGCAGGGCTAGGGTTCCCATGAGCGCAGACGATGTCACTCTAATGGATAAG
GGAATAATAGACACGCCGAACTTGGCAAGGCTTCAGAGGACTCAAGAAGCGCGCCAAGGCGTAGTGCTTGGTTTTGCAGAATGCTCAGAATATATTGCGGAGCGACTTGA
GGGAGCAAAATCTGTGATCCAGCAAAGCAGGGAGCAAAACTGCCATTTTGTTTGTGTGGGAGTTTTGTATTACTTAGCCATCCTTGCACAATCTAAACAAGAGTATTCAG
ATGATGTGATTGTCTCCTTCAGACAGGAGGTATCCCTAATGAAAAAGCTTAGGCACCCCAATATTCTTCTCTTTATGGGAGTTGTGACTTCGCCTCAGCGTCTCTGCATT
GTCACAGAATTCCTTCCACGCTTTGTTCCTAACGACCATTCTTGGTTTAATTATTTGATTTATGCTTCCAAAAGTGGCATTCTTCATAGGCTGATTGAAGTGTCTGGTAC
CTATAAAAAACAGTGGAAGTTTGTTTCGTTTACTACAGAGGAACACAGGCAAATTAGATTGGAAACGCCGTGTTCATATGGCTTTGAACATTATACCCAATCTGCTTTAG
CTTTCTCTTCTTTGCGTATTGAGATATCATGTTCTTGCAATTTGGAACAGGGTCTTGCAGGTGCTACTTCAAGATATCTGGGTTTTAGAAGCATGATGACTTTGCTCGAC
CCAATACTCCCACAAGTCGTTCTTGGCCCGGAGAATAGTAGGGAAGACACTAGAGGTGGTCCACACACCGTTCGTGATCAAAGGAATCAAAGGGAGTTGTTGAATCGGTC
TCCAAAAGATCCATGCTTGGCCTCGAGGCGCCCTGGGAGCGTCCCCCTACGGATGGTGTTTGCATGGGTTAATATCAAGGTGAATGGGGAAAGTGTTCATAGTAAGTGGG
AGAAGGAAACATGTCAAACGTATCCTGCGGTCTCCATCATTAGGTTGCACCGTGAGATTCCTATGCGCTGCCTGCGAGTCGCCCTGGGAGCGATCACCCTACGGAGGGCT
TGCGCATGGGATTCGGAACAACGCAAACTCCAGAAATGGATAGGGTTTCCTAGGGCCATTCCCAACATTAGCTCTTCCCTACGGTGGCATTGTTGGGGCCGTCCTCTGCG
ATCTGAAAATGATGGGTAA
Protein sequenceShow/hide protein sequence
MAKTRGRKERDAEEEEVPNTPEAPKTKTKKWKTPEEREVERRRRQQRAEVEEVSEPIVENLQEVQEKRVEDIQEKGNGQEVQEQEVQADIIVPGVQRRRRRKQKADRIKV
IRTDTPSPSTTESEKENSEKGQEIKGSEDQYIEMLKRDFLFERGFGDDLPHFLRAGIANHGWRQFCAKPEPVNSNIVREFYANIDDQEEFQVIVRGVAVDWSPGAINSLF
NLQDFPHAGFNGMVVAPSNDQLNTAVREVGFIKLRLLPTTHDSTVSQDRVLLVFSILRSLSIDVGKIISSEIYDCWRKKVGKLFFPNTITMLCSRARVPMSADDVTLMDK
GIIDTPNLARLQRTQEARQGVVLGFAECSEYIAERLEGAKSVIQQSREQNCHFVCVGVLYYLAILAQSKQEYSDDVIVSFRQEVSLMKKLRHPNILLFMGVVTSPQRLCI
VTEFLPRFVPNDHSWFNYLIYASKSGILHRLIEVSGTYKKQWKFVSFTTEEHRQIRLETPCSYGFEHYTQSALAFSSLRIEISCSCNLEQGLAGATSRYLGFRSMMTLLD
PILPQVVLGPENSREDTRGGPHTVRDQRNQRELLNRSPKDPCLASRRPGSVPLRMVFAWVNIKVNGESVHSKWEKETCQTYPAVSIIRLHREIPMRCLRVALGAITLRRA
CAWDSEQRKLQKWIGFPRAIPNISSSLRWHCWGRPLRSENDG