; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G003080 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G003080
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionGlycine hydroxymethyltransferase
Genome locationCG_Chr05:2950557..2958563
RNA-Seq ExpressionClCG05G003080
SyntenyClCG05G003080
Gene Ontology termsGO:0019722 - calcium-mediated signaling (biological process)
GO:0044237 - cellular metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0044281 - small molecule metabolic process (biological process)
GO:1901564 - organonitrogen compound metabolic process (biological process)
GO:1901576 - organic substance biosynthetic process (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR008801 - Rapid ALkalinization Factor


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CUG90010.1 unnamed protein product [Bodo saltans]6.9e-2933.45Show/hide
Query:  GARHHKAGVASGTSQVHEPPLGKHDDARVSIREHPPVSLRLDRDALHSGESLEPEHVNFVIKVTNVADNCIVLHLPHVIHHDDVLVACSGDENVCLRYDI
        G RH +  V  G ++V E  LG+ DD          V+L LD  A+ +G  L+P  ++  ++V NVAD+ +V H   V+  DDV+V   G E+V +   +
Subjt:  GARHHKAGVASGTSQVHEPPLGKHDDARVSIREHPPVSLRLDRDALHSGESLEPEHVNFVIKVTNVADNCIVLHLPHVIHHDDVLVACSGDENVCLRYDI

Query:  IECQNLEAFHQCLKGTNRVNFGDDHTSSTLFEGKSTALANISIATNNSNLASNHNISSSHQTIWE----------------VIDIDGGEQQSSVSLHLVQ
        +  ++L A H  L+G + V+ GD +  +   E    +L +I++A ++  LA NH++S +   + +                V+D+D G+ Q++    LVQ
Subjt:  IECQNLEAFHQCLKGTNRVNFGDDHTSSTLFEGKSTALANISIATNNSNLASNHNISSSHQTIWE----------------VIDIDGGEQQSSVSLHLVQ

Query:  PFHTSGCLLRDTDQSLLHL-------VRSITTIVDDQIRTTVGA-PIQGPLSAPPVLLKSLALPGKHGGTISSNGSGSVVLSRENVAGTPTNLSAE
          H S  LL DT+  L HL       V  +TT+++D +   V A  + G   AP VLL  L LPG+HG     +  GSVVL  E+VA  P  LSAE
Subjt:  PFHTSGCLLRDTDQSLLHL-------VRSITTIVDDQIRTTVGA-PIQGPLSAPPVLLKSLALPGKHGGTISSNGSGSVVLSRENVAGTPTNLSAE

KAE8722565.1 Serine hydroxymethyltransferase 2 [Hibiscus syriacus]4.6e-3346.02Show/hide
Query:  DDARVSIREHPPVSLRLDRDALHSGESLEPEHVNFVIKVTNVADNCIVLHLPHVIHHDDVLVACSGDENVCLRYDIIECQNLEAFHQCLKGTNRVNFGDD
        +DA     E PP+ LRLD DAL +   L+  H++FV++VTNVA+N IVLHLP V++HDDVLV   GD+++ L  D+ E QNL++FH+ LK T+ +N  D+
Subjt:  DDARVSIREHPPVSLRLDRDALHSGESLEPEHVNFVIKVTNVADNCIVLHLPHVIHHDDVLVACSGDENVCLRYDIIECQNLEAFHQCLKGTNRVNFGDD

Query:  HTSSTLFEGKSTALANISIATNNSNLASNHNISSSHQTIWEVIDIDGGEQQSSVSLHLVQPFHTSGCLLRDTDQSL
         TS+ LF+G S ALANI++ T+NSNL+ NH+I SSHQTI        G++ ++       PF     +   T +SL
Subjt:  HTSSTLFEGKSTALANISIATNNSNLASNHNISSSHQTIWEVIDIDGGEQQSSVSLHLVQPFHTSGCLLRDTDQSL

KUI61723.1 hypothetical protein VP1G_11265 [Valsa mali var. pyri]5.2e-3738.87Show/hide
Query:  GARHHKAGVASGTSQVHEPPLGKHDDARVSIREHPPVSLRLD-RDALHSGESLEPEHVNFVIKVTNVADNCIVLHLPHVIHHDDVLVACSGDENVCLRYD
        GA H++ GVA GT+QV +  L + DD   ++ E   V L LD  DAL  G  LEP +V+  +KV NVADN +V HL  V  ++DV  A  GD+++     
Subjt:  GARHHKAGVASGTSQVHEPPLGKHDDARVSIREHPPVSLRLD-RDALHSGESLEPEHVNFVIKVTNVADNCIVLHLPHVIHHDDVLVACSGDENVCLRYD

Query:  IIECQNLEAFHQCLKGTNRVNFGDDHTSSTLFEGKSTALANISIATNNSNLASNHNISSSHQTIWE----------------VIDIDGGEQQSSVSLHLV
        ++   +L A H  L+G + V+  ++ T +   +G    L +I+  +N+S+LASNH++  +  T+ E                V+D+DGG++Q +V  H V
Subjt:  IIECQNLEAFHQCLKGTNRVNFGDDHTSSTLFEGKSTALANISIATNNSNLASNHNISSSHQTIWE----------------VIDIDGGEQQSSVSLHLV

Query:  QPFHTSGCLLRDTDQSLLHL-------VRSITTIVDDQIRTTVGAPIQGP---LSAPPVLLKSLALPGKHGGTISSNGSGSVVLSRENVAGTPTNLSAES
        Q  +T G LL DT   L H+       V  +TT+V+D++     A ++G    L AP VLL  LALPGK G T S +GS  VVL  E+VAG P NLS ES
Subjt:  QPFHTSGCLLRDTDQSLLHL-------VRSITTIVDDQIRTTVGAPIQGP---LSAPPVLLKSLALPGKHGGTISSNGSGSVVLSRENVAGTPTNLSAES

Query:  G
        G
Subjt:  G

RHN42549.1 hypothetical protein MtrunA17_Chr8g0378081 [Medicago truncatula]1.1e-3935.95Show/hide
Query:  GARHHKAGVASGTSQVHEPPLGKHDDARVSIREHPPVSLRLDRDALHSGESLEPEHVNFVIKVTNVADNCIVLHLPHVIHHDDVLVACSGDENVCLRYDI
        GA HHK  V+S T+QVH+  LG+ +DA V + E+PPV LRLD DALH+   L+  H+NF++KVTNVA+N IV H PHVI+HDDVLV   G+++V +R +I
Subjt:  GARHHKAGVASGTSQVHEPPLGKHDDARVSIREHPPVSLRLDRDALHSGESLEPEHVNFVIKVTNVADNCIVLHLPHVIHHDDVLVACSGDENVCLRYDI

Query:  IECQNLEAFHQCLKGTNRVNFGDDHTSSTLFEGKSTALANISIATNNSNLASNHNISSSHQTIWEVIDIDGGEQQSSVSLHLVQPFHTSGCLLRDTDQSL
        I                                                               ++IDI+  EQ+SS SLHL++P +TS  L R+T++S+
Subjt:  IECQNLEAFHQCLKGTNRVNFGDDHTSSTLFEGKSTALANISIATNNSNLASNHNISSSHQTIWEVIDIDGGEQQSSVSLHLVQPFHTSGCLLRDTDQSL

Query:  LHLV------------------------------------------------RSITTIVDDQIRTTVGAPIQGPLSAPPVLLKSLALPGKHGGTISSNGS
        LHL                                                  SITTI+++QIRTT   P++   S PPVLLKSL LPG++GG ++S+G 
Subjt:  LHLV------------------------------------------------RSITTIVDDQIRTTVGAPIQGPLSAPPVLLKSLALPGKHGGTISSNGS

Query:  GSVVLS
        G VVLS
Subjt:  GSVVLS

VEU33756.1 unnamed protein product [Pseudo-nitzschia multistriata]8.7e-3232.73Show/hide
Query:  KIGECLTEPEMESEISRRVLMMQKKYISYD-TLRRDMVPCSRPGARHHKAGVASGTSQVHEPPLGKHDDARVSIREHPPVSLRLDRDALHSGESLEPEHV
        K+G+ L E  +   ++    ++   ++S    L +D+V C   GARH +  V   TSQV +  LG+ D+  V + E   V+L LD DAL  G   E  HV
Subjt:  KIGECLTEPEMESEISRRVLMMQKKYISYD-TLRRDMVPCSRPGARHHKAGVASGTSQVHEPPLGKHDDARVSIREHPPVSLRLDRDALHSGESLEPEHV

Query:  NFVIKVTNVADNCIVLHLPHVIHHDDVLVACSGDENVCLRYDIIECQNLEAFHQCLKGTNRVNFGDDHTSSTLFEGKSTALANISIATNNSNLASNHNIS
        NFVI+VTNV+D+ IVLHL HV+ H+D LV   G+E++     ++E  +   FH  LKG + VNF D   +S    G ST+L +IS++ ++S L+  H+I 
Subjt:  NFVIKVTNVADNCIVLHLPHVIHHDDVLVACSGDENVCLRYDIIECQNLEAFHQCLKGTNRVNFGDDHTSSTLFEGKSTALANISIATNNSNLASNHNIS

Query:  SSHQTIWE----------------VIDIDGGEQQSSVSLHLVQPFHTSGCLLRDTDQSLLHLV-------------------------------------
         +H TI +                V+D+DG EQ+  V  H VQ   T G L  D+  S  +LV                                     
Subjt:  SSHQTIWE----------------VIDIDGGEQQSSVSLHLVQPFHTSGCLLRDTDQSLLHLV-------------------------------------

Query:  ------------RSITTIVDDQIRTT----VGAPIQGPLSAPPVLLKSLALPGK-HGGTISSNGSGSVVLSRENVAGTPTNLSAE
                      +TT++D+ IR+     V  P +G  SA PV L+ L+LPGK  GG I+ NG   V+L  E+VA  P+++ ++
Subjt:  ------------RSITTIVDDQIRTT----VGAPIQGPLSAPPVLLKSLALPGK-HGGTISSNGSGSVVLSRENVAGTPTNLSAE

TrEMBL top hitse value%identityAlignment
A0A194VCT6 Uncharacterized protein2.5e-3738.87Show/hide
Query:  GARHHKAGVASGTSQVHEPPLGKHDDARVSIREHPPVSLRLD-RDALHSGESLEPEHVNFVIKVTNVADNCIVLHLPHVIHHDDVLVACSGDENVCLRYD
        GA H++ GVA GT+QV +  L + DD   ++ E   V L LD  DAL  G  LEP +V+  +KV NVADN +V HL  V  ++DV  A  GD+++     
Subjt:  GARHHKAGVASGTSQVHEPPLGKHDDARVSIREHPPVSLRLD-RDALHSGESLEPEHVNFVIKVTNVADNCIVLHLPHVIHHDDVLVACSGDENVCLRYD

Query:  IIECQNLEAFHQCLKGTNRVNFGDDHTSSTLFEGKSTALANISIATNNSNLASNHNISSSHQTIWE----------------VIDIDGGEQQSSVSLHLV
        ++   +L A H  L+G + V+  ++ T +   +G    L +I+  +N+S+LASNH++  +  T+ E                V+D+DGG++Q +V  H V
Subjt:  IIECQNLEAFHQCLKGTNRVNFGDDHTSSTLFEGKSTALANISIATNNSNLASNHNISSSHQTIWE----------------VIDIDGGEQQSSVSLHLV

Query:  QPFHTSGCLLRDTDQSLLHL-------VRSITTIVDDQIRTTVGAPIQGP---LSAPPVLLKSLALPGKHGGTISSNGSGSVVLSRENVAGTPTNLSAES
        Q  +T G LL DT   L H+       V  +TT+V+D++     A ++G    L AP VLL  LALPGK G T S +GS  VVL  E+VAG P NLS ES
Subjt:  QPFHTSGCLLRDTDQSLLHL-------VRSITTIVDDQIRTTVGAPIQGP---LSAPPVLLKSLALPGKHGGTISSNGSGSVVLSRENVAGTPTNLSAES

Query:  G
        G
Subjt:  G

A0A396GN29 Uncharacterized protein5.5e-4035.95Show/hide
Query:  GARHHKAGVASGTSQVHEPPLGKHDDARVSIREHPPVSLRLDRDALHSGESLEPEHVNFVIKVTNVADNCIVLHLPHVIHHDDVLVACSGDENVCLRYDI
        GA HHK  V+S T+QVH+  LG+ +DA V + E+PPV LRLD DALH+   L+  H+NF++KVTNVA+N IV H PHVI+HDDVLV   G+++V +R +I
Subjt:  GARHHKAGVASGTSQVHEPPLGKHDDARVSIREHPPVSLRLDRDALHSGESLEPEHVNFVIKVTNVADNCIVLHLPHVIHHDDVLVACSGDENVCLRYDI

Query:  IECQNLEAFHQCLKGTNRVNFGDDHTSSTLFEGKSTALANISIATNNSNLASNHNISSSHQTIWEVIDIDGGEQQSSVSLHLVQPFHTSGCLLRDTDQSL
        I                                                               ++IDI+  EQ+SS SLHL++P +TS  L R+T++S+
Subjt:  IECQNLEAFHQCLKGTNRVNFGDDHTSSTLFEGKSTALANISIATNNSNLASNHNISSSHQTIWEVIDIDGGEQQSSVSLHLVQPFHTSGCLLRDTDQSL

Query:  LHLV------------------------------------------------RSITTIVDDQIRTTVGAPIQGPLSAPPVLLKSLALPGKHGGTISSNGS
        LHL                                                  SITTI+++QIRTT   P++   S PPVLLKSL LPG++GG ++S+G 
Subjt:  LHLV------------------------------------------------RSITTIVDDQIRTTVGAPIQGPLSAPPVLLKSLALPGKHGGTISSNGS

Query:  GSVVLS
        G VVLS
Subjt:  GSVVLS

A0A448YVC4 Uncharacterized protein4.2e-3232.73Show/hide
Query:  KIGECLTEPEMESEISRRVLMMQKKYISYD-TLRRDMVPCSRPGARHHKAGVASGTSQVHEPPLGKHDDARVSIREHPPVSLRLDRDALHSGESLEPEHV
        K+G+ L E  +   ++    ++   ++S    L +D+V C   GARH +  V   TSQV +  LG+ D+  V + E   V+L LD DAL  G   E  HV
Subjt:  KIGECLTEPEMESEISRRVLMMQKKYISYD-TLRRDMVPCSRPGARHHKAGVASGTSQVHEPPLGKHDDARVSIREHPPVSLRLDRDALHSGESLEPEHV

Query:  NFVIKVTNVADNCIVLHLPHVIHHDDVLVACSGDENVCLRYDIIECQNLEAFHQCLKGTNRVNFGDDHTSSTLFEGKSTALANISIATNNSNLASNHNIS
        NFVI+VTNV+D+ IVLHL HV+ H+D LV   G+E++     ++E  +   FH  LKG + VNF D   +S    G ST+L +IS++ ++S L+  H+I 
Subjt:  NFVIKVTNVADNCIVLHLPHVIHHDDVLVACSGDENVCLRYDIIECQNLEAFHQCLKGTNRVNFGDDHTSSTLFEGKSTALANISIATNNSNLASNHNIS

Query:  SSHQTIWE----------------VIDIDGGEQQSSVSLHLVQPFHTSGCLLRDTDQSLLHLV-------------------------------------
         +H TI +                V+D+DG EQ+  V  H VQ   T G L  D+  S  +LV                                     
Subjt:  SSHQTIWE----------------VIDIDGGEQQSSVSLHLVQPFHTSGCLLRDTDQSLLHLV-------------------------------------

Query:  ------------RSITTIVDDQIRTT----VGAPIQGPLSAPPVLLKSLALPGK-HGGTISSNGSGSVVLSRENVAGTPTNLSAE
                      +TT++D+ IR+     V  P +G  SA PV L+ L+LPGK  GG I+ NG   V+L  E+VA  P+++ ++
Subjt:  ------------RSITTIVDDQIRTT----VGAPIQGPLSAPPVLLKSLALPGK-HGGTISSNGSGSVVLSRENVAGTPTNLSAE

A0A6A3C0F8 Glycine hydroxymethyltransferase2.2e-3346.02Show/hide
Query:  DDARVSIREHPPVSLRLDRDALHSGESLEPEHVNFVIKVTNVADNCIVLHLPHVIHHDDVLVACSGDENVCLRYDIIECQNLEAFHQCLKGTNRVNFGDD
        +DA     E PP+ LRLD DAL +   L+  H++FV++VTNVA+N IVLHLP V++HDDVLV   GD+++ L  D+ E QNL++FH+ LK T+ +N  D+
Subjt:  DDARVSIREHPPVSLRLDRDALHSGESLEPEHVNFVIKVTNVADNCIVLHLPHVIHHDDVLVACSGDENVCLRYDIIECQNLEAFHQCLKGTNRVNFGDD

Query:  HTSSTLFEGKSTALANISIATNNSNLASNHNISSSHQTIWEVIDIDGGEQQSSVSLHLVQPFHTSGCLLRDTDQSL
         TS+ LF+G S ALANI++ T+NSNL+ NH+I SSHQTI        G++ ++       PF     +   T +SL
Subjt:  HTSSTLFEGKSTALANISIATNNSNLASNHNISSSHQTIWEVIDIDGGEQQSSVSLHLVQPFHTSGCLLRDTDQSL

A0A7S0CAL3 Hypothetical protein (Fragment)2.4e-3537.97Show/hide
Query:  RHHKAGVASGTSQVHEPPLGKHDDARVSIREHPPVSLRLDRDALHSGESLEPEHVNFVIKVTNVADNCIVLHLPHVIHHDDVLVACSGDENVCLRYDIIE
        RH +  VASG  +V EP LG+ DD    + E  PV LRLD +    G  LEP  VN  ++VT+VAD+ IVLH  HV   DD L A  GDE+   R   + 
Subjt:  RHHKAGVASGTSQVHEPPLGKHDDARVSIREHPPVSLRLDRDALHSGESLEPEHVNFVIKVTNVADNCIVLHLPHVIHHDDVLVACSGDENVCLRYDIIE

Query:  CQNLEAFHQCLKGTNRVNFGDDHTSSTLFEGKSTALANISIATNNSNLASNHNISSSHQTIWE----------------VIDIDGGEQQSSVSLHLVQPF
          +  A H  L+  +RV+  DD  S+   E    ALA++++A +++NLA NH++      + E                V+D+DG E + ++  HLVQ  
Subjt:  CQNLEAFHQCLKGTNRVNFGDDHTSSTLFEGKSTALANISIATNNSNLASNHNISSSHQTIWE----------------VIDIDGGEQQSSVSLHLVQPF

Query:  HTSGCLLRDTDQS-------LLHLVRSITTIVDDQIRTTVGAPIQGPLSAPPVLLKSLALPGKHGGTISSNGSGSVVLSRENVAGTPTNLSAESG
        H SG LLRD   +       L+  V  +TT+V+D +         G L AP V L  L LPG H  T   NGSG  VL  E+VA  P + SAE G
Subjt:  HTSGCLLRDTDQS-------LLHLVRSITTIVDDQIRTTVGAPIQGPLSAPPVLLKSLALPGKHGGTISSNGSGSVVLSRENVAGTPTNLSAESG

SwissProt top hitse value%identityAlignment
Q2HIM9 Protein RALF-like 318.2e-0953.85Show/hide
Query:  MGVCNQKIGECLTEPEMESEISRRVLMMQKKYISYDTLRRDMVPCSRPGARHH--KAGVASGTSQ
        M + N  IGE   E  M +EISRRVLM QK+YI Y+TLRRDMVPC +PGA ++  ++G A+  S+
Subjt:  MGVCNQKIGECLTEPEMESEISRRVLMMQKKYISYDTLRRDMVPCSRPGARHH--KAGVASGTSQ

Q945T0 Rapid alkalinization factor1.4e-0549.09Show/hide
Query:  GVCNQKIGECLTEP---EMESEISRRVLMMQKKYISYDTLRRDMVPCSRPGARHH
        G C   IGEC+ E    E++SE +RR+L   KKYISY  L+++ VPCSR GA ++
Subjt:  GVCNQKIGECLTEP---EMESEISRRVLMMQKKYISYDTLRRDMVPCSRPGARHH

Q9LK37 Protein RALF-like 249.7e-1065.31Show/hide
Query:  NQKIGECLTEPEMESEISRRVLMMQKKYISYDTLRRDMVPCSRPGARHH
        N  IGE   E  M SEISRRV+MM+K+YISY+TLRRDMVPC +PGA ++
Subjt:  NQKIGECLTEPEMESEISRRVLMMQKKYISYDTLRRDMVPCSRPGARHH

Q9MA62 Protein RALF-like 222.9e-0652.83Show/hide
Query:  CNQKIGECLTEP---EMESEISRRVLMMQKKYISYDTLRRDMVPCSRPGARHH
        C   I EC+ E    E +S+ISRR+L  QKKYISY  +RR+ VPCSR GA ++
Subjt:  CNQKIGECLTEP---EMESEISRRVLMMQKKYISYDTLRRDMVPCSRPGARHH

Q9SRY3 Protein RALF-like 11.1e-0551.92Show/hide
Query:  CNQKIGECL--TEPEMESEISRRVLMMQKKYISYDTLRRDMVPCSRPGARHH
        C+  I EC+   E EM+SEI+RR+L    KYISY +L+R+ VPCSR GA ++
Subjt:  CNQKIGECL--TEPEMESEISRRVLMMQKKYISYDTLRRDMVPCSRPGARHH

Arabidopsis top hitse value%identityAlignment
AT1G02900.1 rapid alkalinization factor 17.9e-0751.92Show/hide
Query:  CNQKIGECL--TEPEMESEISRRVLMMQKKYISYDTLRRDMVPCSRPGARHH
        C+  I EC+   E EM+SEI+RR+L    KYISY +L+R+ VPCSR GA ++
Subjt:  CNQKIGECL--TEPEMESEISRRVLMMQKKYISYDTLRRDMVPCSRPGARHH

AT3G05490.1 ralf-like 222.1e-0752.83Show/hide
Query:  CNQKIGECLTEP---EMESEISRRVLMMQKKYISYDTLRRDMVPCSRPGARHH
        C   I EC+ E    E +S+ISRR+L  QKKYISY  +RR+ VPCSR GA ++
Subjt:  CNQKIGECLTEP---EMESEISRRVLMMQKKYISYDTLRRDMVPCSRPGARHH

AT3G23805.1 ralf-like 246.9e-1165.31Show/hide
Query:  NQKIGECLTEPEMESEISRRVLMMQKKYISYDTLRRDMVPCSRPGARHH
        N  IGE   E  M SEISRRV+MM+K+YISY+TLRRDMVPC +PGA ++
Subjt:  NQKIGECLTEPEMESEISRRVLMMQKKYISYDTLRRDMVPCSRPGARHH

AT4G13950.1 ralf-like 315.8e-1053.85Show/hide
Query:  MGVCNQKIGECLTEPEMESEISRRVLMMQKKYISYDTLRRDMVPCSRPGARHH--KAGVASGTSQ
        M + N  IGE   E  M +EISRRVLM QK+YI Y+TLRRDMVPC +PGA ++  ++G A+  S+
Subjt:  MGVCNQKIGECLTEPEMESEISRRVLMMQKKYISYDTLRRDMVPCSRPGARHH--KAGVASGTSQ

AT4G15800.1 ralf-like 333.0e-0652.73Show/hide
Query:  CNQKIGECL-----TEPEMESEISRRVLMMQKKYISYDTLRRDMVPCSRPGARHH
        CN  I EC       E EM+SEI+RR+L    KYISY  LRR+ VPCSR GA ++
Subjt:  CNQKIGECL-----TEPEMESEISRRVLMMQKKYISYDTLRRDMVPCSRPGARHH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGTTTGCAATCAGAAGATCGGTGAGTGTCTGACTGAGCCAGAAATGGAATCAGAAATCAGTAGGAGAGTTCTGATGATGCAGAAGAAGTATATAAGCTACGATAC
ACTTAGAAGGGACATGGTTCCTTGTTCAAGACCAGGAGCAAGACATCACAAAGCTGGGGTGGCCAGTGGCACATCCCAAGTTCATGAGCCGCCCCTCGGCAAGCACGATG
ATGCCCGAGTTAGTATCAGGGAACACCCACCTGTCAGTTTGAGGCTTGATCGTGATGCGCTTCACTCCGGGGAAAGTCTCGAGCCCGAGCATGTCAATTTCGTTATCAAA
GTGACCAATGTTGCAGACAATTGCATTGTTCTTCATCTTCCTCATGTCATTCACCATGATGATGTCCTTGTTGCCTGTAGTGGTGACGAAAATGTCTGCCTCAGATACGA
TATCATTGAGTGTCAAAACCTGGAAGCCTTCCATCAATGCTTGAAGGGCACAAATCGGGTCAATTTCGGTGACGATCACACGAGCTCCACCTTGTTTGAGGGCAAAAGCA
CAGCCCTTGCCAACATCTCCATAGCCACAAACAACAGCAACCTTGCCAGCAATCATAACATCAGTAGCTCTCATCAAACCATCTGGGAAGTCATTGACATTGATGGCGGG
GAACAGCAAAGTTCCGTTAGCCTGCATCTGGTACAACCTTTTCACACCAGTGGTTGTCTCCTCCGAGACACCGACCAATCTCTCCTTCATCTTGTGCGTAGCATCACCAC
CATCGTCGACGATCAGATCCGGACCACCGTCGGGGCCCCAATCCAAGGACCGCTCAGTGCACCACCAGTACTCCTGAAGAGTCTCGCCCTTCCAGGCAAACACGGCGGCA
CTATCTCGAGCAATGGCAGCGGCAGCGTGGTCTTGAGTCGAGAAAATGTTGCAGGAACACCAACGAACCTCAGCGCCGAGAGCGGCTGTGATGGTCCATATTCGGTTCTA
CAAGCCATCAATCCAGGCATTTCAACCTCAGCGAGATCGATTTCAAGGCGACCGAAATCGGCTTGCGCCATGTCCTTGACCTTGTATTCACGGCCACTGCTGGTTTTGTC
TACCAATAGAGACATTGTGCTGATCAAAATTGAATCGAAGAATAAACAAACAAAAAAGAAATCGAATGAATTCAAACGAAATGAAAATGGAACTTTTGTTTTGATTTTGA
GATTTTCCCTTTTTTCCACTGAGATTGGAACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGGTTTGCAATCAGAAGATCGGTGAGTGTCTGACTGAGCCAGAAATGGAATCAGAAATCAGTAGGAGAGTTCTGATGATGCAGAAGAAGTATATAAGCTACGATAC
ACTTAGAAGGGACATGGTTCCTTGTTCAAGACCAGGAGCAAGACATCACAAAGCTGGGGTGGCCAGTGGCACATCCCAAGTTCATGAGCCGCCCCTCGGCAAGCACGATG
ATGCCCGAGTTAGTATCAGGGAACACCCACCTGTCAGTTTGAGGCTTGATCGTGATGCGCTTCACTCCGGGGAAAGTCTCGAGCCCGAGCATGTCAATTTCGTTATCAAA
GTGACCAATGTTGCAGACAATTGCATTGTTCTTCATCTTCCTCATGTCATTCACCATGATGATGTCCTTGTTGCCTGTAGTGGTGACGAAAATGTCTGCCTCAGATACGA
TATCATTGAGTGTCAAAACCTGGAAGCCTTCCATCAATGCTTGAAGGGCACAAATCGGGTCAATTTCGGTGACGATCACACGAGCTCCACCTTGTTTGAGGGCAAAAGCA
CAGCCCTTGCCAACATCTCCATAGCCACAAACAACAGCAACCTTGCCAGCAATCATAACATCAGTAGCTCTCATCAAACCATCTGGGAAGTCATTGACATTGATGGCGGG
GAACAGCAAAGTTCCGTTAGCCTGCATCTGGTACAACCTTTTCACACCAGTGGTTGTCTCCTCCGAGACACCGACCAATCTCTCCTTCATCTTGTGCGTAGCATCACCAC
CATCGTCGACGATCAGATCCGGACCACCGTCGGGGCCCCAATCCAAGGACCGCTCAGTGCACCACCAGTACTCCTGAAGAGTCTCGCCCTTCCAGGCAAACACGGCGGCA
CTATCTCGAGCAATGGCAGCGGCAGCGTGGTCTTGAGTCGAGAAAATGTTGCAGGAACACCAACGAACCTCAGCGCCGAGAGCGGCTGTGATGGTCCATATTCGGTTCTA
CAAGCCATCAATCCAGGCATTTCAACCTCAGCGAGATCGATTTCAAGGCGACCGAAATCGGCTTGCGCCATGTCCTTGACCTTGTATTCACGGCCACTGCTGGTTTTGTC
TACCAATAGAGACATTGTGCTGATCAAAATTGAATCGAAGAATAAACAAACAAAAAAGAAATCGAATGAATTCAAACGAAATGAAAATGGAACTTTTGTTTTGATTTTGA
GATTTTCCCTTTTTTCCACTGAGATTGGAACTTGA
Protein sequenceShow/hide protein sequence
MGVCNQKIGECLTEPEMESEISRRVLMMQKKYISYDTLRRDMVPCSRPGARHHKAGVASGTSQVHEPPLGKHDDARVSIREHPPVSLRLDRDALHSGESLEPEHVNFVIK
VTNVADNCIVLHLPHVIHHDDVLVACSGDENVCLRYDIIECQNLEAFHQCLKGTNRVNFGDDHTSSTLFEGKSTALANISIATNNSNLASNHNISSSHQTIWEVIDIDGG
EQQSSVSLHLVQPFHTSGCLLRDTDQSLLHLVRSITTIVDDQIRTTVGAPIQGPLSAPPVLLKSLALPGKHGGTISSNGSGSVVLSRENVAGTPTNLSAESGCDGPYSVL
QAINPGISTSARSISRRPKSACAMSLTLYSRPLLVLSTNRDIVLIKIESKNKQTKKKSNEFKRNENGTFVLILRFSLFSTEIGT