; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh12G006840 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh12G006840
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionENDO3c domain-containing protein
Genome locationCmo_Chr12:4884010..4885212
RNA-Seq ExpressionCmoCh12G006840
SyntenyCmoCh12G006840
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585875.1 hypothetical protein SDJN03_18608, partial [Cucurbita argyrosperma subsp. sororia]3.3e-18399.69Show/hide
Query:  MIELKLGVGVSDFNLEKAVCNHGAFMMAPNQWIPSSKTLQRPLRLSNSDTSLLVSINQSSSSLLTLQIHSPRSLPPKDEVAILDQVARMLRLTEKDEDEI
        MIELKLGV VSDFNLEKAVCNHGAFMMAPNQWIPSSKTLQRPLRLSNSDTSLLVSINQSSSSLLTLQIHSPRSLPPKDEVAILDQVARMLRLTEKDEDEI
Subjt:  MIELKLGVGVSDFNLEKAVCNHGAFMMAPNQWIPSSKTLQRPLRLSNSDTSLLVSINQSSSSLLTLQIHSPRSLPPKDEVAILDQVARMLRLTEKDEDEI

Query:  RRFQNLHPTAKQIGFGRIFRSPSLFEDVVKSILMCNTSWRRTLEMAEKLCEVQAKMRESKKRKRKGNNERGNFPNAREVCRMGVEALKNHCLGYRANYVV
        RRFQNLHPTAKQIGFGRIFRSPSLFEDVVKSILMCNTSWRRTLEMAEKLCEVQAKMRESKKRKRKGNNERGNFPNAREVCRMGVEALKNHCLGYRANYVV
Subjt:  RRFQNLHPTAKQIGFGRIFRSPSLFEDVVKSILMCNTSWRRTLEMAEKLCEVQAKMRESKKRKRKGNNERGNFPNAREVCRMGVEALKNHCLGYRANYVV

Query:  KFAQSVESGRINLQSLEKPVSSPDAFPKIKGFGPFATANIFMCLGFYHQLPIDTETIRHLKQVHGIQYCTKKTVGEDVKQIYDTYAPYQCLAYWLELVQY
        KFAQSVESGRINLQSLEKPVSSPDAFPKIKGFGPFATANIFMCLGFYHQLPIDTETIRHLKQVHGIQYCTKKTVGEDVKQIYDTYAPYQCLAYWLELVQY
Subjt:  KFAQSVESGRINLQSLEKPVSSPDAFPKIKGFGPFATANIFMCLGFYHQLPIDTETIRHLKQVHGIQYCTKKTVGEDVKQIYDTYAPYQCLAYWLELVQY

Query:  YETKFGKLSELSSFDYHKISGSTLHL
        YETKFGKLSELSSFDYHKISGSTLHL
Subjt:  YETKFGKLSELSSFDYHKISGSTLHL

XP_022156993.1 uncharacterized protein LOC111023822 [Momordica charantia]6.9e-11764.78Show/hide
Query:  KMIELKLGVGVSDFNLEKAVCNHGAFMMAPNQWIPSSKTLQRPLRLSNSDTSLLVSINQSSSSLLTLQIHSPRSLPPKDEVAILDQVARMLRLTEKDEDE
        +MI+L LG   S F+LE+AVCNHG FMM PN+WIPSSKTLQRPLRL++S TS+LVSI+Q SS LL +QIHS  S  P D  AILDQV RMLR+TE+DE+ 
Subjt:  KMIELKLGVGVSDFNLEKAVCNHGAFMMAPNQWIPSSKTLQRPLRLSNSDTSLLVSINQSSSSLLTLQIHSPRSLPPKDEVAILDQVARMLRLTEKDEDE

Query:  IRRFQNLHPTAKQIGFGRIFRSPSLFEDVVKSILMCNTSWRRTLEMAEKLCEVQAKM-----RESKKRKRKGNN----ERGNFPNAREVCRMGVEALKNH
        IR FQNLH  AK+IGFGR+FRSP+LFED VKSIL+CN +WRRTL MA +LCE+QAK+      + KKRKRKG      E GNFP A E+CRM V  L+ H
Subjt:  IRRFQNLHPTAKQIGFGRIFRSPSLFEDVVKSILMCNTSWRRTLEMAEKLCEVQAKM-----RESKKRKRKGNN----ERGNFPNAREVCRMGVEALKNH

Query:  CLGYRANYVVKFAQSVESGRINLQSLEKPVSSPDAFPKIKGFGPFATANIFMCLGFYHQLPIDTETIRHLKQVHGIQYCTKKTVGEDVKQIYDTYAPYQC
         +GYRA Y++  AQ V++G+I+LQ +E+ +S    FPKIKGFGPF TAN+FMCLG Y +LPIDTETIRHLKQVHG Q C  KT  E VK +YD YAP+QC
Subjt:  CLGYRANYVVKFAQSVESGRINLQSLEKPVSSPDAFPKIKGFGPFATANIFMCLGFYHQLPIDTETIRHLKQVHGIQYCTKKTVGEDVKQIYDTYAPYQC

Query:  LAYWLELVQYYETKFGKLSELSSFDYHKISGSTLH
        LAYW+ELV+YYE++FGKLSEL   DY KISG+T H
Subjt:  LAYWLELVQYYETKFGKLSELSSFDYHKISGSTLH

XP_022951918.1 uncharacterized protein LOC111454659 [Cucurbita moschata]3.9e-184100Show/hide
Query:  MIELKLGVGVSDFNLEKAVCNHGAFMMAPNQWIPSSKTLQRPLRLSNSDTSLLVSINQSSSSLLTLQIHSPRSLPPKDEVAILDQVARMLRLTEKDEDEI
        MIELKLGVGVSDFNLEKAVCNHGAFMMAPNQWIPSSKTLQRPLRLSNSDTSLLVSINQSSSSLLTLQIHSPRSLPPKDEVAILDQVARMLRLTEKDEDEI
Subjt:  MIELKLGVGVSDFNLEKAVCNHGAFMMAPNQWIPSSKTLQRPLRLSNSDTSLLVSINQSSSSLLTLQIHSPRSLPPKDEVAILDQVARMLRLTEKDEDEI

Query:  RRFQNLHPTAKQIGFGRIFRSPSLFEDVVKSILMCNTSWRRTLEMAEKLCEVQAKMRESKKRKRKGNNERGNFPNAREVCRMGVEALKNHCLGYRANYVV
        RRFQNLHPTAKQIGFGRIFRSPSLFEDVVKSILMCNTSWRRTLEMAEKLCEVQAKMRESKKRKRKGNNERGNFPNAREVCRMGVEALKNHCLGYRANYVV
Subjt:  RRFQNLHPTAKQIGFGRIFRSPSLFEDVVKSILMCNTSWRRTLEMAEKLCEVQAKMRESKKRKRKGNNERGNFPNAREVCRMGVEALKNHCLGYRANYVV

Query:  KFAQSVESGRINLQSLEKPVSSPDAFPKIKGFGPFATANIFMCLGFYHQLPIDTETIRHLKQVHGIQYCTKKTVGEDVKQIYDTYAPYQCLAYWLELVQY
        KFAQSVESGRINLQSLEKPVSSPDAFPKIKGFGPFATANIFMCLGFYHQLPIDTETIRHLKQVHGIQYCTKKTVGEDVKQIYDTYAPYQCLAYWLELVQY
Subjt:  KFAQSVESGRINLQSLEKPVSSPDAFPKIKGFGPFATANIFMCLGFYHQLPIDTETIRHLKQVHGIQYCTKKTVGEDVKQIYDTYAPYQCLAYWLELVQY

Query:  YETKFGKLSELSSFDYHKISGSTLHL
        YETKFGKLSELSSFDYHKISGSTLHL
Subjt:  YETKFGKLSELSSFDYHKISGSTLHL

XP_023521143.1 uncharacterized protein LOC111784765 [Cucurbita pepo subsp. pepo]6.5e-9998.35Show/hide
Query:  MAEKLCEVQAKMRESKKRKRKGNNERGNFPNAREVCRMGVEALKNHCLGYRANYVVKFAQSVESGRINLQSLEKPVSSPDAFPKIKGFGPFATANIFMCL
        MAEKLC +QAKMRESKKRKRKGNNERGNFPNAREVCRMGVEALKNHCLGYRANY+VKFAQSVESGRINLQSLEKPVSSPDAFPKIKGFGPFATANIFMCL
Subjt:  MAEKLCEVQAKMRESKKRKRKGNNERGNFPNAREVCRMGVEALKNHCLGYRANYVVKFAQSVESGRINLQSLEKPVSSPDAFPKIKGFGPFATANIFMCL

Query:  GFYHQLPIDTETIRHLKQVHGIQYCTKKTVGEDVKQIYDTYAPYQCLAYWLELVQYYETKFGKLSELSSFDYHKISGSTLHL
        GFYHQLPIDTETIRHLKQVHGIQYCTKKTVGEDVKQIYDTYAPYQCLAYWLELVQYYETKFGKLSELSSFDYHKISGSTLHL
Subjt:  GFYHQLPIDTETIRHLKQVHGIQYCTKKTVGEDVKQIYDTYAPYQCLAYWLELVQYYETKFGKLSELSSFDYHKISGSTLHL

XP_038877617.1 uncharacterized protein LOC120069874 [Benincasa hispida]1.6e-11872.37Show/hide
Query:  MKMIELKLGVGVSDFNLEKAVCNHGAFMMAPNQWIPSSKTLQRPLRLSNSDTSLLVSINQSSSSLLTLQIHSPRS-LPPKDEVAILDQVARMLRLTEKDE
        MK I L LGV VSDF+LEKAVCNHG FMM PNQWIPSSKTLQRPLRLS+S +S+ VSINQ SSSLLT+QIHS  + L P+D+ AILDQV RMLRLTEKDE
Subjt:  MKMIELKLGVGVSDFNLEKAVCNHGAFMMAPNQWIPSSKTLQRPLRLSNSDTSLLVSINQSSSSLLTLQIHSPRS-LPPKDEVAILDQVARMLRLTEKDE

Query:  DEIRRFQNLHPTAKQIGFGRIFRSPSLFEDVVKSILMCNTSWRRTLEMAEKLCEVQAKMRE--SKKRKRK---GNNERGNFPNAREVCRMGVEALKNHCL
        DE+R+FQ+LHP AKQ+GFGR+FRSP+LFED +KSIL+CNT+W+RTL MA +LCE+QAKMR   ++KRKRK      E GNFPNA EVCRMGVE LK HCL
Subjt:  DEIRRFQNLHPTAKQIGFGRIFRSPSLFEDVVKSILMCNTSWRRTLEMAEKLCEVQAKMRE--SKKRKRK---GNNERGNFPNAREVCRMGVEALKNHCL

Query:  GYRANYVVKFAQSVESGRINLQSLEKPVSSPDAFPKIKGFGPFATANIFMCLGFYHQLPIDTETIRHLKQVHGIQYCTKKTVGEDVKQIYDTYAPYQCLA
        GYRA Y++ FA+ V+SG+I+LQ       +P+ FPKIKGFGPFATAN+ MCLG Y QLPIDTETIRHLKQVHG Q+C  KTV EDVKQIYD YAP+QCLA
Subjt:  GYRANYVVKFAQSVESGRINLQSLEKPVSSPDAFPKIKGFGPFATANIFMCLGFYHQLPIDTETIRHLKQVHGIQYCTKKTVGEDVKQIYDTYAPYQCLA

Query:  YWLE
        YWLE
Subjt:  YWLE

TrEMBL top hitse value%identityAlignment
A0A2P5ACW8 DNA glycosylase2.0e-8549.45Show/hide
Query:  MIELKLGVGVSDFNLEKAVCNHGAFMMAPNQWIPSSKTLQRPLRLSNSDTSLLVSINQS--SSSLLTLQI--HSP-RSLPPKDEVAILDQVARMLRLTEK
        ++ L LG   S FN+EKAVCNHG FMMAPN+W PS+KTLQRPLRL++  +S+ VSI+ S   S LL +++   SP ++L   D  AIL+QV RMLR+T++
Subjt:  MIELKLGVGVSDFNLEKAVCNHGAFMMAPNQWIPSSKTLQRPLRLSNSDTSLLVSINQS--SSSLLTLQI--HSP-RSLPPKDEVAILDQVARMLRLTEK

Query:  DEDEIRRFQNLHPTAKQIGFGRIFRSPSLFEDVVKSILMCNTSWRRTLEMAEKLCEVQAKMRES-------------------KKRKRKGNNE-----RG
        DE ++R FQ +HP AK+ GFGR+FRSPSLFED VKSIL+CN SW RTL+MAE LC++Q ++ E+                   K+ K K  ++      G
Subjt:  DEDEIRRFQNLHPTAKQIGFGRIFRSPSLFEDVVKSILMCNTSWRRTLEMAEKLCEVQAKMRES-------------------KKRKRKGNNE-----RG

Query:  NFPNAREVCRMGVEALKNH---CLGYRANYVVKFAQSVESGRIN-LQSLEKPVSSP-------DAFPKIKGFGPFATANIFMCLGFYHQLPIDTETIRHL
        NFPNARE+  +            LGYRA +++  A+  ESG++N L+  EK                KI+GFGPF  AN+ MC+  Y  +P D+ETIRHL
Subjt:  NFPNAREVCRMGVEALKNH---CLGYRANYVVKFAQSVESGRIN-LQSLEKPVSSP-------DAFPKIKGFGPFATANIFMCLGFYHQLPIDTETIRHL

Query:  KQVHGIQYCTKKTVGEDVKQIYDTYAPYQCLAYWLELVQYYETKFGKLSELSSFDYHKISGSTL
        +QVHG + C KKT+ ++VK+IYD YAP+QCLAYW+EL++YYE KFGKLSEL    Y  ISGS L
Subjt:  KQVHGIQYCTKKTVGEDVKQIYDTYAPYQCLAYWLELVQYYETKFGKLSELSSFDYHKISGSTL

A0A438CJ05 Uncharacterized protein2.8e-8750Show/hide
Query:  IELKLGVGVSDFNLEKAVCNHGAFMMAPNQWIPSSKTLQRPLRLSNSDTSLLVSINQ-SSSSLLTLQIHSPRSLPPKDEVAILDQVARMLRLTEKDEDEI
        + + LG G S FNLE AVCNHG FMMAPN WIPS+KTLQRPLRL++  TS+L SI+   + + + +++H    + P D+  IL  VARMLR++++DE ++
Subjt:  IELKLGVGVSDFNLEKAVCNHGAFMMAPNQWIPSSKTLQRPLRLSNSDTSLLVSINQ-SSSSLLTLQIHSPRSLPPKDEVAILDQVARMLRLTEKDEDEI

Query:  RRFQNLHPTAKQIGFGRIFRSPSLFEDVVKSILMCNTSWRRTLEMAEKLCEVQAKMRESKKR-----KRKGNNER------GNFPNAREVCRMGVEALKN
        ++F  + P AK   FGRIFRSPS+FED+VKSIL+CN  WRRTL+MA+ LCE+Q +++  K++     + K  N        GNFPN+ E+  +  E LK 
Subjt:  RRFQNLHPTAKQIGFGRIFRSPSLFEDVVKSILMCNTSWRRTLEMAEKLCEVQAKMRESKKR-----KRKGNNER------GNFPNAREVCRMGVEALKN

Query:  HC-LGYRANYVVKFAQSVESGRINLQSLEKPVSSP------DAFPKIKGFGPFATANIFMCLGFYHQLPIDTETIRHLKQVHGIQYCTKKTVGEDVKQIY
         C LGYRA  +++ A S+E+G + LQ+ EK + +       D   K KGFGPFA ANI MC+G+Y ++P D+ET RH+K++HG +   KK   +DVK+IY
Subjt:  HC-LGYRANYVVKFAQSVESGRINLQSLEKPVSSP------DAFPKIKGFGPFATANIFMCLGFYHQLPIDTETIRHLKQVHGIQYCTKKTVGEDVKQIY

Query:  DTYAPYQCLAYWLELVQYYETKFGKLSELSSFDYHKISGS
        D YAP+QCLAYWLEL +YY+++FGKLSEL   +YH I+GS
Subjt:  DTYAPYQCLAYWLELVQYYETKFGKLSELSSFDYHKISGS

A0A6A1W9S6 Uncharacterized protein2.4e-9149.73Show/hide
Query:  MIELKLGVGVSDFNLEKAVCNHGAFMMAPNQWIPSSKTLQRPLRLSNSDTSLLVSINQSSS---SLLTLQIHSPRSLPPKDEVAILDQVARMLRLTEKDE
        +++L+L   V  FN+EKAVCNHG FMMAPN WIPS+KTLQRPLRL+NS  S+LVSI+  +S   + + +Q+H    + P+DE AIL+QVARMLR++E+DE
Subjt:  MIELKLGVGVSDFNLEKAVCNHGAFMMAPNQWIPSSKTLQRPLRLSNSDTSLLVSINQSSS---SLLTLQIHSPRSLPPKDEVAILDQVARMLRLTEKDE

Query:  DEIRRFQNLHPTAKQIGFGRIFRSPSLFEDVVKSILMCNTSWRRTLEMAEKLCEV-----------------------------QAKMRESKKRK-----
          +R FQNLHP AK+ GFGR FRSPSLFED +KS+L+CN +W RTL+MA+ LCE+                             QA  ++SK +K     
Subjt:  DEIRRFQNLHPTAKQIGFGRIFRSPSLFEDVVKSILMCNTSWRRTLEMAEKLCEV-----------------------------QAKMRESKKRK-----

Query:  -------RKGNNER--GNFPNAREVCRMGVEALKNHC-LGYRANYVVKFAQSVESGRINLQSLEKPVSSP-----DAFPKIKGFGPFATANIFMCLGFYH
                KG + R  GNFP+++EV  +    L+NHC LGYRA Y+VK A+ VESG++ L+  +   S+      +   KIKGFGPFA AN+ MC+G+Y 
Subjt:  -------RKGNNER--GNFPNAREVCRMGVEALKNHC-LGYRANYVVKFAQSVESGRINLQSLEKPVSSP-----DAFPKIKGFGPFATANIFMCLGFYH

Query:  QLPIDTETIRHLKQVHGIQYCTKKTVGEDVKQIYDTYAPYQCLAYWLELVQYYETKFGKLSELSSFDYHKISGS
         +P+DTET+RHL+QVHG +   K+TV EDVK +YD +AP+Q LAYW EL+++YE KFGKLSEL +  Y  +SGS
Subjt:  QLPIDTETIRHLKQVHGIQYCTKKTVGEDVKQIYDTYAPYQCLAYWLELVQYYETKFGKLSELSSFDYHKISGS

A0A6J1DS88 uncharacterized protein LOC1110238223.4e-11764.78Show/hide
Query:  KMIELKLGVGVSDFNLEKAVCNHGAFMMAPNQWIPSSKTLQRPLRLSNSDTSLLVSINQSSSSLLTLQIHSPRSLPPKDEVAILDQVARMLRLTEKDEDE
        +MI+L LG   S F+LE+AVCNHG FMM PN+WIPSSKTLQRPLRL++S TS+LVSI+Q SS LL +QIHS  S  P D  AILDQV RMLR+TE+DE+ 
Subjt:  KMIELKLGVGVSDFNLEKAVCNHGAFMMAPNQWIPSSKTLQRPLRLSNSDTSLLVSINQSSSSLLTLQIHSPRSLPPKDEVAILDQVARMLRLTEKDEDE

Query:  IRRFQNLHPTAKQIGFGRIFRSPSLFEDVVKSILMCNTSWRRTLEMAEKLCEVQAKM-----RESKKRKRKGNN----ERGNFPNAREVCRMGVEALKNH
        IR FQNLH  AK+IGFGR+FRSP+LFED VKSIL+CN +WRRTL MA +LCE+QAK+      + KKRKRKG      E GNFP A E+CRM V  L+ H
Subjt:  IRRFQNLHPTAKQIGFGRIFRSPSLFEDVVKSILMCNTSWRRTLEMAEKLCEVQAKM-----RESKKRKRKGNN----ERGNFPNAREVCRMGVEALKNH

Query:  CLGYRANYVVKFAQSVESGRINLQSLEKPVSSPDAFPKIKGFGPFATANIFMCLGFYHQLPIDTETIRHLKQVHGIQYCTKKTVGEDVKQIYDTYAPYQC
         +GYRA Y++  AQ V++G+I+LQ +E+ +S    FPKIKGFGPF TAN+FMCLG Y +LPIDTETIRHLKQVHG Q C  KT  E VK +YD YAP+QC
Subjt:  CLGYRANYVVKFAQSVESGRINLQSLEKPVSSPDAFPKIKGFGPFATANIFMCLGFYHQLPIDTETIRHLKQVHGIQYCTKKTVGEDVKQIYDTYAPYQC

Query:  LAYWLELVQYYETKFGKLSELSSFDYHKISGSTLH
        LAYW+ELV+YYE++FGKLSEL   DY KISG+T H
Subjt:  LAYWLELVQYYETKFGKLSELSSFDYHKISGSTLH

A0A6J1GJ25 uncharacterized protein LOC1114546591.9e-184100Show/hide
Query:  MIELKLGVGVSDFNLEKAVCNHGAFMMAPNQWIPSSKTLQRPLRLSNSDTSLLVSINQSSSSLLTLQIHSPRSLPPKDEVAILDQVARMLRLTEKDEDEI
        MIELKLGVGVSDFNLEKAVCNHGAFMMAPNQWIPSSKTLQRPLRLSNSDTSLLVSINQSSSSLLTLQIHSPRSLPPKDEVAILDQVARMLRLTEKDEDEI
Subjt:  MIELKLGVGVSDFNLEKAVCNHGAFMMAPNQWIPSSKTLQRPLRLSNSDTSLLVSINQSSSSLLTLQIHSPRSLPPKDEVAILDQVARMLRLTEKDEDEI

Query:  RRFQNLHPTAKQIGFGRIFRSPSLFEDVVKSILMCNTSWRRTLEMAEKLCEVQAKMRESKKRKRKGNNERGNFPNAREVCRMGVEALKNHCLGYRANYVV
        RRFQNLHPTAKQIGFGRIFRSPSLFEDVVKSILMCNTSWRRTLEMAEKLCEVQAKMRESKKRKRKGNNERGNFPNAREVCRMGVEALKNHCLGYRANYVV
Subjt:  RRFQNLHPTAKQIGFGRIFRSPSLFEDVVKSILMCNTSWRRTLEMAEKLCEVQAKMRESKKRKRKGNNERGNFPNAREVCRMGVEALKNHCLGYRANYVV

Query:  KFAQSVESGRINLQSLEKPVSSPDAFPKIKGFGPFATANIFMCLGFYHQLPIDTETIRHLKQVHGIQYCTKKTVGEDVKQIYDTYAPYQCLAYWLELVQY
        KFAQSVESGRINLQSLEKPVSSPDAFPKIKGFGPFATANIFMCLGFYHQLPIDTETIRHLKQVHGIQYCTKKTVGEDVKQIYDTYAPYQCLAYWLELVQY
Subjt:  KFAQSVESGRINLQSLEKPVSSPDAFPKIKGFGPFATANIFMCLGFYHQLPIDTETIRHLKQVHGIQYCTKKTVGEDVKQIYDTYAPYQCLAYWLELVQY

Query:  YETKFGKLSELSSFDYHKISGSTLHL
        YETKFGKLSELSSFDYHKISGSTLHL
Subjt:  YETKFGKLSELSSFDYHKISGSTLHL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATGATTGAATTGAAATTAGGAGTTGGAGTGAGTGATTTCAACCTTGAGAAAGCTGTGTGTAATCATGGTGCGTTTATGATGGCACCAAACCAATGGATTCCTTC
TTCCAAGACACTCCAACGTCCACTTCGTCTCTCCAATTCAGACACTTCCCTTTTGGTCTCTATCAACCAATCTTCTTCTTCTCTACTTACCCTTCAAATCCACTCTCCTC
GCTCCCTCCCTCCTAAAGATGAAGTGGCTATATTGGATCAAGTGGCTCGGATGTTGCGACTTACAGAGAAAGATGAAGATGAGATTAGAAGATTTCAAAATCTGCACCCC
ACAGCCAAACAGATTGGATTTGGTCGGATTTTTCGATCTCCATCTCTCTTCGAAGATGTGGTCAAGTCCATCCTTATGTGCAATACCTCGTGGAGAAGGACGCTGGAAAT
GGCGGAGAAGCTATGTGAGGTACAAGCCAAAATGAGGGAAAGTAAGAAGAGGAAAAGGAAAGGGAATAATGAAAGAGGCAATTTTCCAAATGCAAGGGAGGTTTGTAGGA
TGGGAGTTGAAGCGTTGAAGAATCATTGCCTTGGTTATAGAGCTAATTACGTGGTTAAATTTGCTCAAAGTGTTGAGAGTGGGAGAATTAACCTCCAATCATTAGAAAAA
CCAGTGTCCTCTCCGGATGCATTCCCTAAAATCAAAGGGTTTGGTCCTTTTGCGACAGCCAATATATTCATGTGCCTTGGATTTTACCACCAACTCCCAATTGATACTGA
AACCATAAGGCACTTAAAACAAGTGCATGGAATCCAATATTGTACCAAGAAGACAGTTGGGGAAGATGTGAAGCAAATTTACGACACCTATGCTCCTTATCAATGCTTGG
CTTATTGGTTGGAGCTTGTCCAGTACTATGAGACCAAATTCGGAAAGCTCAGTGAATTGTCTTCCTTTGACTATCACAAGATTAGTGGCTCCACTCTCCACCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAATGATTGAATTGAAATTAGGAGTTGGAGTGAGTGATTTCAACCTTGAGAAAGCTGTGTGTAATCATGGTGCGTTTATGATGGCACCAAACCAATGGATTCCTTC
TTCCAAGACACTCCAACGTCCACTTCGTCTCTCCAATTCAGACACTTCCCTTTTGGTCTCTATCAACCAATCTTCTTCTTCTCTACTTACCCTTCAAATCCACTCTCCTC
GCTCCCTCCCTCCTAAAGATGAAGTGGCTATATTGGATCAAGTGGCTCGGATGTTGCGACTTACAGAGAAAGATGAAGATGAGATTAGAAGATTTCAAAATCTGCACCCC
ACAGCCAAACAGATTGGATTTGGTCGGATTTTTCGATCTCCATCTCTCTTCGAAGATGTGGTCAAGTCCATCCTTATGTGCAATACCTCGTGGAGAAGGACGCTGGAAAT
GGCGGAGAAGCTATGTGAGGTACAAGCCAAAATGAGGGAAAGTAAGAAGAGGAAAAGGAAAGGGAATAATGAAAGAGGCAATTTTCCAAATGCAAGGGAGGTTTGTAGGA
TGGGAGTTGAAGCGTTGAAGAATCATTGCCTTGGTTATAGAGCTAATTACGTGGTTAAATTTGCTCAAAGTGTTGAGAGTGGGAGAATTAACCTCCAATCATTAGAAAAA
CCAGTGTCCTCTCCGGATGCATTCCCTAAAATCAAAGGGTTTGGTCCTTTTGCGACAGCCAATATATTCATGTGCCTTGGATTTTACCACCAACTCCCAATTGATACTGA
AACCATAAGGCACTTAAAACAAGTGCATGGAATCCAATATTGTACCAAGAAGACAGTTGGGGAAGATGTGAAGCAAATTTACGACACCTATGCTCCTTATCAATGCTTGG
CTTATTGGTTGGAGCTTGTCCAGTACTATGAGACCAAATTCGGAAAGCTCAGTGAATTGTCTTCCTTTGACTATCACAAGATTAGTGGCTCCACTCTCCACCTTTGA
Protein sequenceShow/hide protein sequence
MKMIELKLGVGVSDFNLEKAVCNHGAFMMAPNQWIPSSKTLQRPLRLSNSDTSLLVSINQSSSSLLTLQIHSPRSLPPKDEVAILDQVARMLRLTEKDEDEIRRFQNLHP
TAKQIGFGRIFRSPSLFEDVVKSILMCNTSWRRTLEMAEKLCEVQAKMRESKKRKRKGNNERGNFPNAREVCRMGVEALKNHCLGYRANYVVKFAQSVESGRINLQSLEK
PVSSPDAFPKIKGFGPFATANIFMCLGFYHQLPIDTETIRHLKQVHGIQYCTKKTVGEDVKQIYDTYAPYQCLAYWLELVQYYETKFGKLSELSSFDYHKISGSTLHL