; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10003371 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10003371
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionDNA glycosylase
Genome locationChr08:202248..203583
RNA-Seq ExpressionHG10003371
SyntenyHG10003371
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR003265 - HhH-GPD domain
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585875.1 hypothetical protein SDJN03_18608, partial [Cucurbita argyrosperma subsp. sororia]3.1e-13073.48Show/hide
Query:  MIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDE
        MI L LGV  SDF+LEKAVCNHG FMM+PNQWIPSSKTLQRPLRL SNS++S+ VSIN SSS LLT+QIHS   L P+D+ AILDQV RMLRLTEKDEDE
Subjt:  MIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDE

Query:  LRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSNQSRKRKRK---VIGNFPNAEEVCRMGVELLKKHNLGYRAGY
        +R+FQ+LHP AKQ+GFGR+FRSP+LFED +KSIL+CNT+W+RTL MAE+LCE+QAKM  +S+KRKRK     GNFPNA EVCRMGVE LK H LGYRA Y
Subjt:  LRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSNQSRKRKRK---VIGNFPNAEEVCRMGVELLKKHNLGYRAGY

Query:  IINFAQRVQNGTINLQ-------NPNHLPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWLELV
        ++ FAQ V++G INLQ       +P+  PKIKGFGPFATAN+ MCLGFY QLPIDTETIRHLKQ+HG Q+C KKTV EDVKQIYD YAP+QCLAYWLELV
Subjt:  IINFAQRVQNGTINLQ-------NPNHLPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWLELV

Query:  EYYESKFGKLSELCSLDYNKISGTTLNL
        +YYE+KFGKLSEL S DY+KISG+TL+L
Subjt:  EYYESKFGKLSELCSLDYNKISGTTLNL

XP_021905122.1 uncharacterized protein LOC110820055 isoform X2 [Carica papaya]7.0e-9053.92Show/hide
Query:  LNLG-VSSDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINH-SSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDELR
        L+LG     F+LEKAVCNHG FMM PN W PS KTL+RPLRL SN +SSV+ SI+H S+S  L IQ+H    +S  D+ AIL+QV RMLR+++KDE+ +R
Subjt:  LNLG-VSSDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINH-SSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDELR

Query:  KFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAK----MSNQSRKRKRKVI---------------GNFPNAEEVCRMGV
        +FQ +H  AK  GFGR+FRSP+LFED +KS+LLCN TW RTL MA+ LCELQ +    +S + RK++++                 GNFPNAEE+  +  
Subjt:  KFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAK----MSNQSRKRKRKVI---------------GNFPNAEEVCRMGV

Query:  ELLKKH-NLGYRAGYIINFAQRVQNGTINLQNPNHLPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQC
        +LL++   LGYRA Y+IN AQ V++G ++L N   L KIKGFG F  AN+ MC+GFY+ +P DTET+RHLKQ+HG + C++ T+ +DVK IYDKY+PFQ 
Subjt:  ELLKKH-NLGYRAGYIINFAQRVQNGTINLQNPNHLPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQC

Query:  LAYWLELVEYYESKFGKLSELCSLDYNKISGT
        LAYW EL+ YYESK GKLSEL    Y  ++G+
Subjt:  LAYWLELVEYYESKFGKLSELCSLDYNKISGT

XP_022156993.1 uncharacterized protein LOC111023822 [Momordica charantia]3.8e-12069.49Show/hide
Query:  KKMIVLNLG-VSSDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDE
        ++MI LNLG  +S FDLE+AVCNHG FMM PN+WIPSSKTLQRPLRL ++S +SV VSI+  SS LL IQIHSSP  SP D+QAILDQV RMLR+TE+DE
Subjt:  KKMIVLNLG-VSSDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDE

Query:  DELRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMS----NQSRKRKRK-------VIGNFPNAEEVCRMGVELLK
        + +R FQ+LH KAK++GFGRLFRSPTLFEDA+KSILLCN TW+RTLAMA QLCELQAK+        +KRKRK         GNFP A E+CRM V LL+
Subjt:  DELRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMS----NQSRKRKRK-------VIGNFPNAEEVCRMGVELLK

Query:  KHNLGYRAGYIINFAQRVQNGTINLQNPNH---LPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLA
        KH +GYRA YII+ AQRVQNG I+LQ        PKIKGFGPF TAN+ MCLG Y +LPIDTETIRHLKQ+HGRQ CN KT +E VK +YDKYAPFQCLA
Subjt:  KHNLGYRAGYIINFAQRVQNGTINLQNPNH---LPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLA

Query:  YWLELVEYYESKFGKLSELCSLDYNKISGTT
        YW+ELVEYYES+FGKLSEL   DY KISGTT
Subjt:  YWLELVEYYESKFGKLSELCSLDYNKISGTT

XP_022951918.1 uncharacterized protein LOC111454659 [Cucurbita moschata]2.4e-13073.48Show/hide
Query:  MIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDE
        MI L LGV  SDF+LEKAVCNHG FMM+PNQWIPSSKTLQRPLRL SNS++S+ VSIN SSS LLT+QIHS   L P+D+ AILDQV RMLRLTEKDEDE
Subjt:  MIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDE

Query:  LRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSNQSRKRKRK---VIGNFPNAEEVCRMGVELLKKHNLGYRAGY
        +R+FQ+LHP AKQ+GFGR+FRSP+LFED +KSIL+CNT+W+RTL MAE+LCE+QAKM  +S+KRKRK     GNFPNA EVCRMGVE LK H LGYRA Y
Subjt:  LRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSNQSRKRKRK---VIGNFPNAEEVCRMGVELLKKHNLGYRAGY

Query:  IINFAQRVQNGTINLQ-------NPNHLPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWLELV
        ++ FAQ V++G INLQ       +P+  PKIKGFGPFATAN+ MCLGFY QLPIDTETIRHLKQ+HG Q+C KKTV EDVKQIYD YAP+QCLAYWLELV
Subjt:  IINFAQRVQNGTINLQ-------NPNHLPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWLELV

Query:  EYYESKFGKLSELCSLDYNKISGTTLNL
        +YYE+KFGKLSEL S DY+KISG+TL+L
Subjt:  EYYESKFGKLSELCSLDYNKISGTTLNL

XP_038877617.1 uncharacterized protein LOC120069874 [Benincasa hispida]8.5e-13686.53Show/hide
Query:  KMIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIH-SSPPLSPQDQQAILDQVVRMLRLTEKDE
        K I LNLGVS SDFDLEKAVCNHGQFMM PNQWIPSSKTLQRPLRL S+S+SSVFVSIN  SS LLTIQIH SS PLSPQDQQAILDQVVRMLRLTEKDE
Subjt:  KMIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIH-SSPPLSPQDQQAILDQVVRMLRLTEKDE

Query:  DELRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSNQ-SRKRKRKV------IGNFPNAEEVCRMGVELLKKHNL
        DELRKFQSLHP+AKQMGFGRLFRSPTLFEDA+KSILLCNTTWKRTLAMA QLCELQAKM  Q +RKRKRK+      IGNFPNAEEVCRMGVELLKKH L
Subjt:  DELRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSNQ-SRKRKRKV------IGNFPNAEEVCRMGVELLKKHNL

Query:  GYRAGYIINFAQRVQNGTINLQNPNHLPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWLE
        GYRA YIINFA+ VQ+G I+LQNPN+ PKIKGFGPFATAN+LMCLG YRQLPIDTETIRHLKQ+HGRQFCN KTV+EDVKQIYDKYAPFQCLAYWLE
Subjt:  GYRAGYIINFAQRVQNGTINLQNPNHLPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWLE

TrEMBL top hitse value%identityAlignment
A0A2P5ACW8 DNA glycosylase1.6e-8751.23Show/hide
Query:  MIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHS--SSFLLTIQIHSSPP---LSPQDQQAILDQVVRMLRLTE
        ++ L LG S S F++EKAVCNHG FMM+PN+W PS+KTLQRPLRL ++  SSV VSI+HS   S LL I++    P   LS  D  AIL+QV RMLR+T+
Subjt:  MIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHS--SSFLLTIQIHSSPP---LSPQDQQAILDQVVRMLRLTE

Query:  KDEDELRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKM---------------SNQSRKRKR-----------KVI
        +DE ++R+FQ +HP+AK+ GFGR+FRSP+LFEDA+KSILLCN +W RTL MAE LC+LQ ++               SN+  KRKR           +++
Subjt:  KDEDELRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKM---------------SNQSRKRKR-----------KVI

Query:  GNFPNAEEVCRMGVE-LLKKHN--LGYRAGYIINFAQRVQNGTIN---------LQNPNH------LPKIKGFGPFATANLLMCLGFYRQLPIDTETIRH
        GNFPNA E+  +     L+K+   LGYRA +I++ A+  ++G +N          +  +H      + KI+GFGPF  AN+LMC+  Y  +P D+ETIRH
Subjt:  GNFPNAEEVCRMGVE-LLKKHN--LGYRAGYIINFAQRVQNGTIN---------LQNPNH------LPKIKGFGPFATANLLMCLGFYRQLPIDTETIRH

Query:  LKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWLELVEYYESKFGKLSELCSLDYNKISGTTL
        L+Q+HGR+ CNKKT+ ++VK+IYDKYAPFQCLAYW+EL+EYYE KFGKLSEL    Y  ISG+ L
Subjt:  LKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWLELVEYYESKFGKLSELCSLDYNKISGTTL

A0A2P5FT40 DNA glycosylase1.2e-8751.09Show/hide
Query:  MIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHS--SSFLLTIQIHSSPP---LSPQDQQAILDQVVRMLRLTE
        ++ L LG S S F++EKAVCNHG FMM+PN+W PS+KTLQRPLRL ++  SSV VSI+HS   S LL I++    P   LS  D  AIL+QV RMLR+TE
Subjt:  MIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHS--SSFLLTIQIHSSPP---LSPQDQQAILDQVVRMLRLTE

Query:  KDEDELRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQ---------------AKMSNQSRKRKR------------KV
        +DE ++R+FQ +HP+AK+ GFGR+FRSP+LFEDA+KSILLCN +W RTL MAE LC+LQ               +  SN+  KRKR            ++
Subjt:  KDEDELRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQ---------------AKMSNQSRKRKR------------KV

Query:  IGNFPNAEEVCRM-GVELLKKHN--LGYRAGYIINFAQRVQNGTIN--------LQNPNH-------LPKIKGFGPFATANLLMCLGFYRQLPIDTETIR
        +GNFPNA E+  +     L+K+   LGYRA +I++ A+  ++G +N         +   H       +  I+GFGPF  AN+LMC+  Y  +P D+ETIR
Subjt:  IGNFPNAEEVCRM-GVELLKKHN--LGYRAGYIINFAQRVQNGTIN--------LQNPNH-------LPKIKGFGPFATANLLMCLGFYRQLPIDTETIR

Query:  HLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWLELVEYYESKFGKLSELCSLDYNKISGTTL
        HL+Q+H R+ CNKKT+Q++VK+IYDKYAPFQCLAYW+EL+EYYE KFGKLSEL    Y  ISG+ L
Subjt:  HLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWLELVEYYESKFGKLSELCSLDYNKISGTTL

A0A6A1W9S6 Uncharacterized protein9.9e-9051.52Show/hide
Query:  FDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSS---FLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDELRKFQSLHP
        F++EKAVCNHG FMM+PN WIPS+KTLQRPLRL +NS  SV VSI+H +S     + IQ+H +  +SPQD++AIL+QV RMLR++E+DE  LR+FQ+LHP
Subjt:  FDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSS---FLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDELRKFQSLHP

Query:  KAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSN-------------QSRKR--KRK---------------------------
        +AK+ GFGR FRSP+LFEDAIKS+LLCN TW RTL MA+ LCELQ +++N              SRKR  KRK                           
Subjt:  KAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSN-------------QSRKR--KRK---------------------------

Query:  ---VIGNFPNAEEVCRMGVELLKKH-NLGYRAGYIINFAQRVQNGTINLQ--NPNH----------LPKIKGFGPFATANLLMCLGFYRQLPIDTETIRH
            +GNFP+++EV  +    L+ H NLGYRA YI+  A++V++G + L+  + +H          L KIKGFGPFA AN++MC+G+Y+ +P+DTET+RH
Subjt:  ---VIGNFPNAEEVCRMGVELLKKH-NLGYRAGYIINFAQRVQNGTINLQ--NPNH----------LPKIKGFGPFATANLLMCLGFYRQLPIDTETIRH

Query:  LKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWLELVEYYESKFGKLSELCSLDYNKISGT
        L+Q+HGR+   K+TV EDVK +YDK+APFQ LAYW EL+E+YE KFGKLSEL +  Y  +SG+
Subjt:  LKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWLELVEYYESKFGKLSELCSLDYNKISGT

A0A6J1DS88 uncharacterized protein LOC1110238221.9e-12069.49Show/hide
Query:  KKMIVLNLG-VSSDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDE
        ++MI LNLG  +S FDLE+AVCNHG FMM PN+WIPSSKTLQRPLRL ++S +SV VSI+  SS LL IQIHSSP  SP D+QAILDQV RMLR+TE+DE
Subjt:  KKMIVLNLG-VSSDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDE

Query:  DELRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMS----NQSRKRKRK-------VIGNFPNAEEVCRMGVELLK
        + +R FQ+LH KAK++GFGRLFRSPTLFEDA+KSILLCN TW+RTLAMA QLCELQAK+        +KRKRK         GNFP A E+CRM V LL+
Subjt:  DELRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMS----NQSRKRKRK-------VIGNFPNAEEVCRMGVELLK

Query:  KHNLGYRAGYIINFAQRVQNGTINLQNPNH---LPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLA
        KH +GYRA YII+ AQRVQNG I+LQ        PKIKGFGPF TAN+ MCLG Y +LPIDTETIRHLKQ+HGRQ CN KT +E VK +YDKYAPFQCLA
Subjt:  KHNLGYRAGYIINFAQRVQNGTINLQNPNH---LPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLA

Query:  YWLELVEYYESKFGKLSELCSLDYNKISGTT
        YW+ELVEYYES+FGKLSEL   DY KISGTT
Subjt:  YWLELVEYYESKFGKLSELCSLDYNKISGTT

A0A6J1GJ25 uncharacterized protein LOC1114546591.2e-13073.48Show/hide
Query:  MIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDE
        MI L LGV  SDF+LEKAVCNHG FMM+PNQWIPSSKTLQRPLRL SNS++S+ VSIN SSS LLT+QIHS   L P+D+ AILDQV RMLRLTEKDEDE
Subjt:  MIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDE

Query:  LRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSNQSRKRKRK---VIGNFPNAEEVCRMGVELLKKHNLGYRAGY
        +R+FQ+LHP AKQ+GFGR+FRSP+LFED +KSIL+CNT+W+RTL MAE+LCE+QAKM  +S+KRKRK     GNFPNA EVCRMGVE LK H LGYRA Y
Subjt:  LRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSNQSRKRKRK---VIGNFPNAEEVCRMGVELLKKHNLGYRAGY

Query:  IINFAQRVQNGTINLQ-------NPNHLPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWLELV
        ++ FAQ V++G INLQ       +P+  PKIKGFGPFATAN+ MCLGFY QLPIDTETIRHLKQ+HG Q+C KKTV EDVKQIYD YAP+QCLAYWLELV
Subjt:  IINFAQRVQNGTINLQ-------NPNHLPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWLELV

Query:  EYYESKFGKLSELCSLDYNKISGTTLNL
        +YYE+KFGKLSEL S DY+KISG+TL+L
Subjt:  EYYESKFGKLSELCSLDYNKISGTTLNL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAGATGATTGTTTTGAATTTGGGAGTGAGTAGTGATTTTGATCTTGAGAAAGCAGTTTGTAACCATGGGCAATTTATGATGTCACCAAACCAATGGATTCCTTC
TTCTAAAACTCTCCAACGTCCACTTCGACTTATTTCTAATTCTAACTCTTCTGTTTTTGTCTCTATCAACCATTCTTCTTCTTTTCTTCTAACCATTCAAATCCACTCTT
CTCCCCCTCTCTCTCCCCAAGATCAACAAGCTATATTGGATCAAGTGGTTCGGATGCTTAGGCTTACGGAGAAAGATGAAGATGAGTTGAGGAAATTTCAAAGTTTGCAT
CCCAAAGCCAAACAGATGGGATTTGGTCGCCTTTTTCGATCTCCCACTCTTTTTGAAGATGCAATCAAGTCCATCCTTCTATGCAATACCACGTGGAAAAGGACACTGGC
AATGGCTGAACAGCTATGTGAGCTCCAAGCCAAAATGAGCAACCAAAGTAGGAAGAGAAAAAGGAAAGTAATTGGGAATTTTCCAAATGCAGAAGAAGTTTGTAGAATGG
GGGTTGAATTGTTGAAGAAGCATAATCTTGGTTACAGAGCTGGTTACATCATTAACTTTGCTCAACGTGTTCAAAATGGCACAATTAATCTCCAAAATCCTAATCATTTA
CCTAAAATCAAAGGCTTTGGACCTTTTGCAACCGCTAATTTACTCATGTGCCTCGGTTTTTACCGCCAACTTCCAATTGATACTGAAACTATAAGGCACTTAAAACAGCT
ACATGGGAGACAATTTTGCAACAAAAAGACAGTACAGGAAGACGTCAAACAAATTTACGACAAGTATGCTCCTTTCCAATGCTTGGCCTATTGGTTGGAGCTTGTGGAGT
ATTATGAGAGCAAATTCGGGAAGCTAAGTGAACTGTGCTCCCTTGATTATAACAAGATCAGTGGCACCACCCTCAACCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAAGATGATTGTTTTGAATTTGGGAGTGAGTAGTGATTTTGATCTTGAGAAAGCAGTTTGTAACCATGGGCAATTTATGATGTCACCAAACCAATGGATTCCTTC
TTCTAAAACTCTCCAACGTCCACTTCGACTTATTTCTAATTCTAACTCTTCTGTTTTTGTCTCTATCAACCATTCTTCTTCTTTTCTTCTAACCATTCAAATCCACTCTT
CTCCCCCTCTCTCTCCCCAAGATCAACAAGCTATATTGGATCAAGTGGTTCGGATGCTTAGGCTTACGGAGAAAGATGAAGATGAGTTGAGGAAATTTCAAAGTTTGCAT
CCCAAAGCCAAACAGATGGGATTTGGTCGCCTTTTTCGATCTCCCACTCTTTTTGAAGATGCAATCAAGTCCATCCTTCTATGCAATACCACGTGGAAAAGGACACTGGC
AATGGCTGAACAGCTATGTGAGCTCCAAGCCAAAATGAGCAACCAAAGTAGGAAGAGAAAAAGGAAAGTAATTGGGAATTTTCCAAATGCAGAAGAAGTTTGTAGAATGG
GGGTTGAATTGTTGAAGAAGCATAATCTTGGTTACAGAGCTGGTTACATCATTAACTTTGCTCAACGTGTTCAAAATGGCACAATTAATCTCCAAAATCCTAATCATTTA
CCTAAAATCAAAGGCTTTGGACCTTTTGCAACCGCTAATTTACTCATGTGCCTCGGTTTTTACCGCCAACTTCCAATTGATACTGAAACTATAAGGCACTTAAAACAGCT
ACATGGGAGACAATTTTGCAACAAAAAGACAGTACAGGAAGACGTCAAACAAATTTACGACAAGTATGCTCCTTTCCAATGCTTGGCCTATTGGTTGGAGCTTGTGGAGT
ATTATGAGAGCAAATTCGGGAAGCTAAGTGAACTGTGCTCCCTTGATTATAACAAGATCAGTGGCACCACCCTCAACCTTTGA
Protein sequenceShow/hide protein sequence
MKKMIVLNLGVSSDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDELRKFQSLH
PKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSNQSRKRKRKVIGNFPNAEEVCRMGVELLKKHNLGYRAGYIINFAQRVQNGTINLQNPNHL
PKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWLELVEYYESKFGKLSELCSLDYNKISGTTLNL