; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi08G000040 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi08G000040
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionDNA glycosylase
Genome locationchr08:112925..118068
RNA-Seq ExpressionLsi08G000040
SyntenyLsi08G000040
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR003265 - HhH-GPD domain
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585875.1 hypothetical protein SDJN03_18608, partial [Cucurbita argyrosperma subsp. sororia]9.5e-11672.97Show/hide
Query:  MIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDE
        MI L LGV  SDF+LEKAVCNHG FMM+PNQWIPSSKTLQRPLRL SNS++S+ VSIN SSS LLT+QIHS   L P+D+ AILDQV RMLRLTEKDEDE
Subjt:  MIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDE

Query:  LRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSNQSRKRKRKVS---GNFPNAEEVCRMGVELLKKHNLGYRAGY
        +R+FQ+LHP AKQ+GFGR+FRSP+LFED +KSIL+CNT+W+RTL MAE+LCE+QAKM  +S+KRKRK +   GNFPNA EVCRMGVE LK H LGYRA Y
Subjt:  LRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSNQSRKRKRKVS---GNFPNAEEVCRMGVELLKKHNLGYRAGY

Query:  IINFAQRVQNGTINLQ-------NPNHLPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYW
        ++ FAQ V++G INLQ       +P+  PKIKGFGPFATAN+ MCLGFY QLPIDTETIRHLKQ+HG Q+C KKTV EDVKQIYD YAP+QCLAYW
Subjt:  IINFAQRVQNGTINLQ-------NPNHLPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYW

XP_021905122.1 uncharacterized protein LOC110820055 isoform X2 [Carica papaya]5.3e-8253.59Show/hide
Query:  LNLG-VSSDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINH-SSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDELR
        L+LG     F+LEKAVCNHG FMM PN W PS KTL+RPLRL SN +SSV+ SI+H S+S  L IQ+H    +S  D+ AIL+QV RMLR+++KDE+ +R
Subjt:  LNLG-VSSDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINH-SSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDELR

Query:  KFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAK----MSNQSRKRKRKVS---------------GNFPNAEEVCRMGV
        +FQ +H  AK  GFGR+FRSP+LFED +KS+LLCN TW RTL MA+ LCELQ +    +S + RK++++ +               GNFPNAEE+  +  
Subjt:  KFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAK----MSNQSRKRKRKVS---------------GNFPNAEEVCRMGV

Query:  ELLKKH-NLGYRAGYIINFAQRVQNGTINLQNPNHLPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQC
        +LL++   LGYRA Y+IN AQ V++G ++L N   L KIKGFG F  AN+ MC+GFY+ +P DTET+RHLKQ+HG + C++ T+ +DVK IYDKY+PFQ 
Subjt:  ELLKKH-NLGYRAGYIINFAQRVQNGTINLQNPNHLPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQC

Query:  LAYWYK
        LAYW++
Subjt:  LAYWYK

XP_022156993.1 uncharacterized protein LOC111023822 [Momordica charantia]2.1e-10768.54Show/hide
Query:  KKMIVLNLG-VSSDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDE
        ++MI LNLG  +S FDLE+AVCNHG FMM PN+WIPSSKTLQRPLRL ++S +SV VSI+  SS LL IQIHSSP  SP D+QAILDQV RMLR+TE+DE
Subjt:  KKMIVLNLG-VSSDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDE

Query:  DELRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMS----NQSRKRKRK-------VSGNFPNAEEVCRMGVELLK
        + +R FQ+LH KAK++GFGRLFRSPTLFEDA+KSILLCN TW+RTLAMA QLCELQAK+        +KRKRK         GNFP A E+CRM V LL+
Subjt:  DELRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMS----NQSRKRKRK-------VSGNFPNAEEVCRMGVELLK

Query:  KHNLGYRAGYIINFAQRVQNGTINLQNPNH---LPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLA
        KH +GYRA YII+ AQRVQNG I+LQ        PKIKGFGPF TAN+ MCLG Y +LPIDTETIRHLKQ+HGRQ CN KT +E VK +YDKYAPFQCLA
Subjt:  KHNLGYRAGYIINFAQRVQNGTINLQNPNH---LPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLA

Query:  YW
        YW
Subjt:  YW

XP_022951918.1 uncharacterized protein LOC111454659 [Cucurbita moschata]7.3e-11672.97Show/hide
Query:  MIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDE
        MI L LGV  SDF+LEKAVCNHG FMM+PNQWIPSSKTLQRPLRL SNS++S+ VSIN SSS LLT+QIHS   L P+D+ AILDQV RMLRLTEKDEDE
Subjt:  MIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDE

Query:  LRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSNQSRKRKRKVS---GNFPNAEEVCRMGVELLKKHNLGYRAGY
        +R+FQ+LHP AKQ+GFGR+FRSP+LFED +KSIL+CNT+W+RTL MAE+LCE+QAKM  +S+KRKRK +   GNFPNA EVCRMGVE LK H LGYRA Y
Subjt:  LRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSNQSRKRKRKVS---GNFPNAEEVCRMGVELLKKHNLGYRAGY

Query:  IINFAQRVQNGTINLQ-------NPNHLPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYW
        ++ FAQ V++G INLQ       +P+  PKIKGFGPFATAN+ MCLGFY QLPIDTETIRHLKQ+HG Q+C KKTV EDVKQIYD YAP+QCLAYW
Subjt:  IINFAQRVQNGTINLQ-------NPNHLPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYW

XP_038877617.1 uncharacterized protein LOC120069874 [Benincasa hispida]3.5e-13486.1Show/hide
Query:  KMIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIH-SSPPLSPQDQQAILDQVVRMLRLTEKDE
        K I LNLGVS SDFDLEKAVCNHGQFMM PNQWIPSSKTLQRPLRL S+S+SSVFVSIN  SS LLTIQIH SS PLSPQDQQAILDQVVRMLRLTEKDE
Subjt:  KMIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIH-SSPPLSPQDQQAILDQVVRMLRLTEKDE

Query:  DELRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSNQ-SRKRKRKVS------GNFPNAEEVCRMGVELLKKHNL
        DELRKFQSLHP+AKQMGFGRLFRSPTLFEDA+KSILLCNTTWKRTLAMA QLCELQAKM  Q +RKRKRK+       GNFPNAEEVCRMGVELLKKH L
Subjt:  DELRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSNQ-SRKRKRKVS------GNFPNAEEVCRMGVELLKKHNL

Query:  GYRAGYIINFAQRVQNGTINLQNPNHLPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYW
        GYRA YIINFA+ VQ+G I+LQNPN+ PKIKGFGPFATAN+LMCLG YRQLPIDTETIRHLKQ+HGRQFCN KTV+EDVKQIYDKYAPFQCLAYW
Subjt:  GYRAGYIINFAQRVQNGTINLQNPNHLPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYW

TrEMBL top hitse value%identityAlignment
A0A2P5ACW8 DNA glycosylase1.2e-7649.57Show/hide
Query:  MIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHS--SSFLLTIQIHSSPP---LSPQDQQAILDQVVRMLRLTE
        ++ L LG S S F++EKAVCNHG FMM+PN+W PS+KTLQRPLRL ++  SSV VSI+HS   S LL I++    P   LS  D  AIL+QV RMLR+T+
Subjt:  MIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHS--SSFLLTIQIHSSPP---LSPQDQQAILDQVVRMLRLTE

Query:  KDEDELRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKM---------------SNQSRKRKR-----------KVS
        +DE ++R+FQ +HP+AK+ GFGR+FRSP+LFEDA+KSILLCN +W RTL MAE LC+LQ ++               SN+  KRKR           ++ 
Subjt:  KDEDELRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKM---------------SNQSRKRKR-----------KVS

Query:  GNFPNAEEVCRMGVE-LLKKHN--LGYRAGYIINFAQRVQNGTIN---------LQNPNH------LPKIKGFGPFATANLLMCLGFYRQLPIDTETIRH
        GNFPNA E+  +     L+K+   LGYRA +I++ A+  ++G +N          +  +H      + KI+GFGPF  AN+LMC+  Y  +P D+ETIRH
Subjt:  GNFPNAEEVCRMGVE-LLKKHN--LGYRAGYIINFAQRVQNGTIN---------LQNPNH------LPKIKGFGPFATANLLMCLGFYRQLPIDTETIRH

Query:  LKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYW------YKDK
        L+Q+HGR+ CNKKT+ ++VK+IYDKYAPFQCLAYW      Y+DK
Subjt:  LKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYW------YKDK

A0A6A1W9S6 Uncharacterized protein1.1e-8050.74Show/hide
Query:  FDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSS---FLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDELRKFQSLHP
        F++EKAVCNHG FMM+PN WIPS+KTLQRPLRL +NS  SV VSI+H +S     + IQ+H +  +SPQD++AIL+QV RMLR++E+DE  LR+FQ+LHP
Subjt:  FDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSS---FLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDELRKFQSLHP

Query:  KAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSN-------------QSRKR--KRKVS-------------------------
        +AK+ GFGR FRSP+LFEDAIKS+LLCN TW RTL MA+ LCELQ +++N              SRKR  KRK +                         
Subjt:  KAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSN-------------QSRKR--KRKVS-------------------------

Query:  -----GNFPNAEEVCRMGVELLKKH-NLGYRAGYIINFAQRVQNGTINLQ--NPNH----------LPKIKGFGPFATANLLMCLGFYRQLPIDTETIRH
             GNFP+++EV  +    L+ H NLGYRA YI+  A++V++G + L+  + +H          L KIKGFGPFA AN++MC+G+Y+ +P+DTET+RH
Subjt:  -----GNFPNAEEVCRMGVELLKKH-NLGYRAGYIINFAQRVQNGTINLQ--NPNH----------LPKIKGFGPFATANLLMCLGFYRQLPIDTETIRH

Query:  LKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWYK
        L+Q+HGR+   K+TV EDVK +YDK+APFQ LAYW++
Subjt:  LKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWYK

A0A6J1DS88 uncharacterized protein LOC1110238221.0e-10768.54Show/hide
Query:  KKMIVLNLG-VSSDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDE
        ++MI LNLG  +S FDLE+AVCNHG FMM PN+WIPSSKTLQRPLRL ++S +SV VSI+  SS LL IQIHSSP  SP D+QAILDQV RMLR+TE+DE
Subjt:  KKMIVLNLG-VSSDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDE

Query:  DELRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMS----NQSRKRKRK-------VSGNFPNAEEVCRMGVELLK
        + +R FQ+LH KAK++GFGRLFRSPTLFEDA+KSILLCN TW+RTLAMA QLCELQAK+        +KRKRK         GNFP A E+CRM V LL+
Subjt:  DELRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMS----NQSRKRKRK-------VSGNFPNAEEVCRMGVELLK

Query:  KHNLGYRAGYIINFAQRVQNGTINLQNPNH---LPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLA
        KH +GYRA YII+ AQRVQNG I+LQ        PKIKGFGPF TAN+ MCLG Y +LPIDTETIRHLKQ+HGRQ CN KT +E VK +YDKYAPFQCLA
Subjt:  KHNLGYRAGYIINFAQRVQNGTINLQNPNH---LPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLA

Query:  YW
        YW
Subjt:  YW

A0A6J1GJ25 uncharacterized protein LOC1114546593.5e-11672.97Show/hide
Query:  MIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDE
        MI L LGV  SDF+LEKAVCNHG FMM+PNQWIPSSKTLQRPLRL SNS++S+ VSIN SSS LLT+QIHS   L P+D+ AILDQV RMLRLTEKDEDE
Subjt:  MIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDE

Query:  LRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSNQSRKRKRKVS---GNFPNAEEVCRMGVELLKKHNLGYRAGY
        +R+FQ+LHP AKQ+GFGR+FRSP+LFED +KSIL+CNT+W+RTL MAE+LCE+QAKM  +S+KRKRK +   GNFPNA EVCRMGVE LK H LGYRA Y
Subjt:  LRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSNQSRKRKRKVS---GNFPNAEEVCRMGVELLKKHNLGYRAGY

Query:  IINFAQRVQNGTINLQ-------NPNHLPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYW
        ++ FAQ V++G INLQ       +P+  PKIKGFGPFATAN+ MCLGFY QLPIDTETIRHLKQ+HG Q+C KKTV EDVKQIYD YAP+QCLAYW
Subjt:  IINFAQRVQNGTINLQ-------NPNHLPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYW

A0A6P4BPN5 uncharacterized protein LOC1074341912.5e-7750.47Show/hide
Query:  SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINH---SSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDELRKFQSL
        S F+LEKAVCNHG FMM+PN WIPS+KTLQRPLRL S+  +S  VSI+H    S  LL I +HS  P S  D+ AIL QV RMLR++E+DE ++R+FQ  
Subjt:  SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINH---SSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDELRKFQSL

Query:  HPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSNQSR---KRKRKVS----------GNFPNAEEVCRMGVELLKKHN--LG
         PKAK  GFGRLFRSP++FEDA+KSILLCN TW ++L MA+ LCELQ +++N  +   KRKR  S          GNFP ++E+  +    L++    LG
Subjt:  HPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSNQSR---KRKRKVS----------GNFPNAEEVCRMGVELLKKHN--LG

Query:  YRAGYIINFAQRVQNGTINLQNP---------------NHLPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKY
        YRA YI+  A+ V++G ++L+                   L  + GFGP+  AN+ MC+G Y+ +P+DTETIRH++Q+HGR+ C+KKTV++ V++IYDK+
Subjt:  YRAGYIINFAQRVQNGTINLQNP---------------NHLPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKY

Query:  APFQCLAYW------YKDK
        APFQCLAYW      Y+DK
Subjt:  APFQCLAYW------YKDK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGAGAGTAGAGTAAGAAAGAAAATAATGAAGAAGATGATTGTTTTGAATTTGGGAGTGAGTAGTGATTTTGATCTTGAGAAAGCAGTTTGTAACCATGGGCAATT
TATGATGTCACCAAACCAATGGATTCCTTCTTCTAAAACTCTCCAACGTCCACTTCGACTTATTTCTAATTCTAACTCTTCTGTTTTTGTCTCTATCAACCATTCTTCTT
CTTTTCTTCTAACCATTCAAATCCACTCTTCTCCCCCTCTCTCTCCCCAAGATCAACAAGCTATATTGGATCAAGTGGTTCGGATGCTTAGGCTTACGGAGAAAGATGAA
GATGAGTTGAGGAAATTTCAAAGTTTGCATCCCAAAGCCAAACAGATGGGATTTGGTCGCCTTTTTCGATCTCCCACTCTTTTTGAAGATGCAATCAAGTCCATCCTTCT
ATGCAATACCACGTGGAAAAGGACACTGGCAATGGCTGAACAGCTATGTGAGCTCCAAGCCAAAATGAGCAACCAAAGTAGGAAGAGAAAAAGGAAAGTAAGTGGGAATT
TTCCAAATGCAGAAGAAGTTTGTAGAATGGGGGTTGAATTGTTGAAGAAGCATAATCTTGGTTACAGAGCTGGTTACATCATTAACTTTGCTCAACGTGTTCAAAATGGC
ACAATTAATCTCCAAAATCCTAATCATTTACCTAAAATCAAAGGCTTTGGACCTTTTGCAACCGCTAATTTACTCATGTGCCTCGGTTTTTACCGCCAACTTCCAATTGA
TACTGAAACTATAAGGCACTTAAAACAGCTACATGGGAGACAATTTTGCAACAAAAAGACAGTACAGGAAGACGTCAAACAAATTTACGACAAGTATGCTCCTTTCCAAT
GCTTGGCCTATTGGTACAAAGATAAGACAATGAATACAGAAGAAGGAAGCAGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAGAGAGTAGAGTAAGAAAGAAAATAATGAAGAAGATGATTGTTTTGAATTTGGGAGTGAGTAGTGATTTTGATCTTGAGAAAGCAGTTTGTAACCATGGGCAATT
TATGATGTCACCAAACCAATGGATTCCTTCTTCTAAAACTCTCCAACGTCCACTTCGACTTATTTCTAATTCTAACTCTTCTGTTTTTGTCTCTATCAACCATTCTTCTT
CTTTTCTTCTAACCATTCAAATCCACTCTTCTCCCCCTCTCTCTCCCCAAGATCAACAAGCTATATTGGATCAAGTGGTTCGGATGCTTAGGCTTACGGAGAAAGATGAA
GATGAGTTGAGGAAATTTCAAAGTTTGCATCCCAAAGCCAAACAGATGGGATTTGGTCGCCTTTTTCGATCTCCCACTCTTTTTGAAGATGCAATCAAGTCCATCCTTCT
ATGCAATACCACGTGGAAAAGGACACTGGCAATGGCTGAACAGCTATGTGAGCTCCAAGCCAAAATGAGCAACCAAAGTAGGAAGAGAAAAAGGAAAGTAAGTGGGAATT
TTCCAAATGCAGAAGAAGTTTGTAGAATGGGGGTTGAATTGTTGAAGAAGCATAATCTTGGTTACAGAGCTGGTTACATCATTAACTTTGCTCAACGTGTTCAAAATGGC
ACAATTAATCTCCAAAATCCTAATCATTTACCTAAAATCAAAGGCTTTGGACCTTTTGCAACCGCTAATTTACTCATGTGCCTCGGTTTTTACCGCCAACTTCCAATTGA
TACTGAAACTATAAGGCACTTAAAACAGCTACATGGGAGACAATTTTGCAACAAAAAGACAGTACAGGAAGACGTCAAACAAATTTACGACAAGTATGCTCCTTTCCAAT
GCTTGGCCTATTGGTACAAAGATAAGACAATGAATACAGAAGAAGGAAGCAGTTAA
Protein sequenceShow/hide protein sequence
MKESRVRKKIMKKMIVLNLGVSSDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDE
DELRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSNQSRKRKRKVSGNFPNAEEVCRMGVELLKKHNLGYRAGYIINFAQRVQNG
TINLQNPNHLPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWYKDKTMNTEEGSS