; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G00010 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G00010
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionDNA glycosylase
Genome locationClcChr08:61045..62751
RNA-Seq ExpressionClc08G00010
SyntenyClc08G00010
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585875.1 hypothetical protein SDJN03_18608, partial [Cucurbita argyrosperma subsp. sororia]4.5e-12470.85Show/hide
Query:  MSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQTSSFLLTIQIHSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSLHP
        +SDF+LE+AVCNHG FMM PNQWIPSSKTLQRPLRLSNS++S+ VSINQ+SS LLT+QIH S  +L P+D+  ILDQV RMLRLTEKDEDE+R+FQ+LHP
Subjt:  MSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQTSSFLLTIQIHSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSLHP

Query:  KAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQARMSSQNRKRKRK---LIGNFPNAEEVCRMGVELLKKHNLGYRAGFIINFAQRVQ
         AKQ+GFGR+FRSP++FED +KSIL+CNT+W+RTL MA +LCE+QA+M  +++KRKRK     GNFPNA EVCRMGVE LK H LGYRA +++ FAQ V+
Subjt:  KAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQARMSSQNRKRKRK---LIGNFPNAEEVCRMGVELLKKHNLGYRAGFIINFAQRVQ

Query:  NATIDLQ-------NPNNFPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYYESKFGK
        +  I+LQ       +P+ FPKIKGFGPFATAN+ MCLGFY QLPIDTETIRH+KQVHG Q+C  KTV EDVKQIYD YAP+QCLAY LELV+YYE+KFGK
Subjt:  NATIDLQ-------NPNNFPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYYESKFGK

Query:  LSELCSLDYHKISGATLNL
        LSEL S DYHKISG+TL+L
Subjt:  LSELCSLDYHKISGATLNL

XP_021905122.1 uncharacterized protein LOC110820055 isoform X2 [Carica papaya]3.2e-8553.25Show/hide
Query:  FDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQTS-SFLLTIQIHSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSLHPKA
        F+LE+AVCNHG FMMPPN W PS KTL+RPLRLSN +SSV+ SI+  S S  L IQ+H     +S  D+  IL+QV RMLR+++KDE+ +R+FQ +H  A
Subjt:  FDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQTS-SFLLTIQIHSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSLHPKA

Query:  KQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQ---ARMSSQNRKRKRKLI----------------GNFPNAEEVCRMGVELLKKH-NL
        K  GFGR+FRSP++FED +KS+LLCN TW RTL MA  LCELQ    R  S  +++KRK                  GNFPNAEE+  +  +LL++   L
Subjt:  KQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQ---ARMSSQNRKRKRKLI----------------GNFPNAEEVCRMGVELLKKH-NL

Query:  GYRAGFIINFAQRVQNATIDLQNPNNFPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLELVE
        GYRA ++IN AQ V++  +DL N  +  KIKGFG F  AN+ MC+GFY+ +P DTET+RH+KQVHG + C+  T+ +DVK IYDKY+PFQ LAY  EL+ 
Subjt:  GYRAGFIINFAQRVQNATIDLQNPNNFPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLELVE

Query:  YYESKFGKLSELCSLDYHKISGA
        YYESK GKLSEL    Y  ++G+
Subjt:  YYESKFGKLSELCSLDYHKISGA

XP_022156993.1 uncharacterized protein LOC111023822 [Momordica charantia]4.1e-11769.25Show/hide
Query:  KTMSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQTSSFLLTIQIHSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSL
        +T S FDLERAVCNHG FMMPPN+WIPSSKTLQRPLRL++S +SV VSI+Q SS LL IQIH SS + SP D+Q ILDQV RMLR+TE+DE+ +R FQ+L
Subjt:  KTMSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQTSSFLLTIQIHSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSL

Query:  HPKAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQARMS----SQNRKRKRK-------LIGNFPNAEEVCRMGVELLKKHNLGYRAG
        H KAK++GFGRLFRSPT+FEDA+KSILLCN TW+RTLAMAGQLCELQA++     +  +KRKRK         GNFP A E+CRM V LL+KH +GYRA 
Subjt:  HPKAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQARMS----SQNRKRKRK-------LIGNFPNAEEVCRMGVELLKKHNLGYRAG

Query:  FIINFAQRVQNATIDLQNPN---NFPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYY
        +II+ AQRVQN  IDLQ      +FPKIKGFGPF TAN+ MCLG Y +LPIDTETIRH+KQVHGRQ CN KT  E VK +YDKYAPFQCLAY +ELVEYY
Subjt:  FIINFAQRVQNATIDLQNPN---NFPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYY

Query:  ESKFGKLSELCSLDYHKISGAT
        ES+FGKLSEL   DY KISG T
Subjt:  ESKFGKLSELCSLDYHKISGAT

XP_022951918.1 uncharacterized protein LOC111454659 [Cucurbita moschata]4.5e-12470.85Show/hide
Query:  MSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQTSSFLLTIQIHSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSLHP
        +SDF+LE+AVCNHG FMM PNQWIPSSKTLQRPLRLSNS++S+ VSINQ+SS LLT+QIH S  +L P+D+  ILDQV RMLRLTEKDEDE+R+FQ+LHP
Subjt:  MSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQTSSFLLTIQIHSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSLHP

Query:  KAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQARMSSQNRKRKRK---LIGNFPNAEEVCRMGVELLKKHNLGYRAGFIINFAQRVQ
         AKQ+GFGR+FRSP++FED +KSIL+CNT+W+RTL MA +LCE+QA+M  +++KRKRK     GNFPNA EVCRMGVE LK H LGYRA +++ FAQ V+
Subjt:  KAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQARMSSQNRKRKRK---LIGNFPNAEEVCRMGVELLKKHNLGYRAGFIINFAQRVQ

Query:  NATIDLQ-------NPNNFPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYYESKFGK
        +  I+LQ       +P+ FPKIKGFGPFATAN+ MCLGFY QLPIDTETIRH+KQVHG Q+C  KTV EDVKQIYD YAP+QCLAY LELV+YYE+KFGK
Subjt:  NATIDLQ-------NPNNFPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYYESKFGK

Query:  LSELCSLDYHKISGATLNL
        LSEL S DYHKISG+TL+L
Subjt:  LSELCSLDYHKISGATLNL

XP_038877617.1 uncharacterized protein LOC120069874 [Benincasa hispida]1.0e-13687.46Show/hide
Query:  TMSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQTSSFLLTIQIHSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSLH
        ++SDFDLE+AVCNHGQFMMPPNQWIPSSKTLQRPLRLS+S+SSVFVSINQ SS LLTIQIHSSS  LSPQDQQ ILDQVVRMLRLTEKDEDELRKFQSLH
Subjt:  TMSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQTSSFLLTIQIHSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSLH

Query:  PKAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQARMSSQ-NRKRKRKL------IGNFPNAEEVCRMGVELLKKHNLGYRAGFIINF
        P+AKQMGFGRLFRSPT+FEDALKSILLCNTTWKRTLAMAGQLCELQA+M  Q  RKRKRKL      IGNFPNAEEVCRMGVELLKKH LGYRA +IINF
Subjt:  PKAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQARMSSQ-NRKRKRKL------IGNFPNAEEVCRMGVELLKKHNLGYRAGFIINF

Query:  AQRVQNATIDLQNPNNFPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLE
        A+ VQ+  IDLQNPN FPKIKGFGPFATAN+LMCLG YRQLPIDTETIRH+KQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAY LE
Subjt:  AQRVQNATIDLQNPNNFPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLE

TrEMBL top hitse value%identityAlignment
A0A2P5ACW8 DNA glycosylase3.9e-8148.74Show/hide
Query:  KTMSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQT--SSFLLTIQI--HSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRK
        ++ S F++E+AVCNHG FMM PN+W PS+KTLQRPLRL++  SSV VSI+ +   S LL I++   S S ALS  D   IL+QV RMLR+T++DE ++R+
Subjt:  KTMSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQT--SSFLLTIQI--HSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRK

Query:  FQSLHPKAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQARM---------------SSQNRKRKR-----------KLIGNFPNAEE
        FQ +HP+AK+ GFGR+FRSP++FEDA+KSILLCN +W RTL MA  LC+LQ  +               S++  KRKR           +++GNFPNA E
Subjt:  FQSLHPKAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQARM---------------SSQNRKRKR-----------KLIGNFPNAEE

Query:  VCRMGVE-LLKKHN--LGYRAGFIINFAQRVQNATID------------LQNPNN---FPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGRQ
        +  +     L+K+   LGYRA  I++ A+  ++  ++            L +        KI+GFGPF  AN+LMC+  Y  +P D+ETIRH++QVHGR+
Subjt:  VCRMGVE-LLKKHN--LGYRAGFIINFAQRVQNATID------------LQNPNN---FPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGRQ

Query:  FCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYYESKFGKLSELCSLDYHKISGATL
         CN KT+ ++VK+IYDKYAPFQCLAY +EL+EYYE KFGKLSEL    Y  ISG+ L
Subjt:  FCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYYESKFGKLSELCSLDYHKISGATL

A0A2P5FT40 DNA glycosylase1.1e-8048.04Show/hide
Query:  KTMSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQT--SSFLLTIQI--HSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRK
        ++ S F++E+AVCNHG FMM PN+W PS+KTLQRPLRL++  SSV VSI+ +   S LL I++   S S  LS  D   IL+QV RMLR+TE+DE ++R+
Subjt:  KTMSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQT--SSFLLTIQI--HSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRK

Query:  FQSLHPKAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQ---------------ARMSSQNRKRKR------------KLIGNFPNAE
        FQ +HP+AK+ GFGR+FRSP++FEDA+KSILLCN +W RTL MA  LC+LQ               +  S+++ KRKR            +++GNFPNA 
Subjt:  FQSLHPKAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQ---------------ARMSSQNRKRKR------------KLIGNFPNAE

Query:  EVCRM-GVELLKKHN--LGYRAGFIINFAQRVQNATID-LQNPNN--------------FPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGR
        E+  +     L+K+   LGYRA  I++ A+  ++  ++ L+                     I+GFGPF  AN+LMC+  Y  +P D+ETIRH++QVH R
Subjt:  EVCRM-GVELLKKHN--LGYRAGFIINFAQRVQNATID-LQNPNN--------------FPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGR

Query:  QFCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYYESKFGKLSELCSLDYHKISGATL
        + CN KT++++VK+IYDKYAPFQCLAY +EL+EYYE KFGKLSEL    Y  ISG+ L
Subjt:  QFCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYYESKFGKLSELCSLDYHKISGATL

A0A6A1W9S6 Uncharacterized protein6.4e-8447.84Show/hide
Query:  EKTMSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQ----TSSFLLTIQIHSSSAALSPQDQQTILDQVVRMLRLTEKDEDELR
        E+ +  F++E+AVCNHG FMM PN WIPS+KTLQRPLRL+NS  SV VSI+     T++++L IQ+H +   +SPQD++ IL+QV RMLR++E+DE  LR
Subjt:  EKTMSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQ----TSSFLLTIQIHSSSAALSPQDQQTILDQVVRMLRLTEKDEDELR

Query:  KFQSLHPKAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQ---------------ARMSSQNRKRKRKL-------------------
        +FQ+LHP+AK+ GFGR FRSP++FEDA+KS+LLCN TW RTL MA  LCELQ               AR  S+ R  KRK                    
Subjt:  KFQSLHPKAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQ---------------ARMSSQNRKRKRKL-------------------

Query:  -----------IGNFPNAEEVCRMGVELLKKH-NLGYRAGFIINFAQRVQNATIDLQNPNN------------FPKIKGFGPFATANLLMCLGFYRQLPI
                   +GNFP+++EV  +    L+ H NLGYRA +I+  A++V++  + L+  ++              KIKGFGPFA AN++MC+G+Y+ +P+
Subjt:  -----------IGNFPNAEEVCRMGVELLKKH-NLGYRAGFIINFAQRVQNATIDLQNPNN------------FPKIKGFGPFATANLLMCLGFYRQLPI

Query:  DTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYYESKFGKLSELCSLDYHKISGA
        DTET+RH++QVHGR+    +TV EDVK +YDK+APFQ LAY  EL+E+YE KFGKLSEL +  Y  +SG+
Subjt:  DTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYYESKFGKLSELCSLDYHKISGA

A0A6J1DS88 uncharacterized protein LOC1110238222.0e-11769.25Show/hide
Query:  KTMSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQTSSFLLTIQIHSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSL
        +T S FDLERAVCNHG FMMPPN+WIPSSKTLQRPLRL++S +SV VSI+Q SS LL IQIH SS + SP D+Q ILDQV RMLR+TE+DE+ +R FQ+L
Subjt:  KTMSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQTSSFLLTIQIHSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSL

Query:  HPKAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQARMS----SQNRKRKRK-------LIGNFPNAEEVCRMGVELLKKHNLGYRAG
        H KAK++GFGRLFRSPT+FEDA+KSILLCN TW+RTLAMAGQLCELQA++     +  +KRKRK         GNFP A E+CRM V LL+KH +GYRA 
Subjt:  HPKAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQARMS----SQNRKRKRK-------LIGNFPNAEEVCRMGVELLKKHNLGYRAG

Query:  FIINFAQRVQNATIDLQNPN---NFPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYY
        +II+ AQRVQN  IDLQ      +FPKIKGFGPF TAN+ MCLG Y +LPIDTETIRH+KQVHGRQ CN KT  E VK +YDKYAPFQCLAY +ELVEYY
Subjt:  FIINFAQRVQNATIDLQNPN---NFPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYY

Query:  ESKFGKLSELCSLDYHKISGAT
        ES+FGKLSEL   DY KISG T
Subjt:  ESKFGKLSELCSLDYHKISGAT

A0A6J1GJ25 uncharacterized protein LOC1114546592.2e-12470.85Show/hide
Query:  MSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQTSSFLLTIQIHSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSLHP
        +SDF+LE+AVCNHG FMM PNQWIPSSKTLQRPLRLSNS++S+ VSINQ+SS LLT+QIH S  +L P+D+  ILDQV RMLRLTEKDEDE+R+FQ+LHP
Subjt:  MSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQTSSFLLTIQIHSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSLHP

Query:  KAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQARMSSQNRKRKRK---LIGNFPNAEEVCRMGVELLKKHNLGYRAGFIINFAQRVQ
         AKQ+GFGR+FRSP++FED +KSIL+CNT+W+RTL MA +LCE+QA+M  +++KRKRK     GNFPNA EVCRMGVE LK H LGYRA +++ FAQ V+
Subjt:  KAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQARMSSQNRKRKRK---LIGNFPNAEEVCRMGVELLKKHNLGYRAGFIINFAQRVQ

Query:  NATIDLQ-------NPNNFPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYYESKFGK
        +  I+LQ       +P+ FPKIKGFGPFATAN+ MCLGFY QLPIDTETIRH+KQVHG Q+C  KTV EDVKQIYD YAP+QCLAY LELV+YYE+KFGK
Subjt:  NATIDLQ-------NPNNFPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYYESKFGK

Query:  LSELCSLDYHKISGATLNL
        LSEL S DYHKISG+TL+L
Subjt:  LSELCSLDYHKISGATLNL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGCAATCACGCATGCAATTTCAAAATTACAGAGATATGAAGAAGAAGATGATGAAAAGACGATGAGTGATTTCGATCTTGAGAGAGCAGTTTGTAACCATGGGCA
GTTTATGATGCCACCAAACCAATGGATTCCTTCTTCTAAAACTCTCCAACGTCCACTTCGACTCTCTAATTCAAACTCTTCTGTATTTGTCTCTATCAACCAAACTTCGT
CTTTTCTTCTCACCATTCAAATCCACTCTTCTTCTGCTGCTCTCTCTCCCCAAGATCAACAAACTATATTGGATCAAGTGGTTCGGATGCTTAGGCTTACGGAGAAAGAT
GAGGATGAGTTGAGGAAATTTCAAAGTTTGCATCCCAAAGCCAAACAGATGGGATTTGGTCGGCTTTTTCGGTCTCCCACTGTTTTTGAAGATGCACTCAAGTCCATCCT
TCTATGCAATACCACGTGGAAAAGGACACTGGCAATGGCTGGACAGCTGTGTGAGCTCCAAGCCAGAATGAGCAGCCAAAATAGGAAGAGAAAAAGGAAATTAATTGGGA
ATTTTCCAAATGCAGAAGAAGTTTGTAGAATGGGCGTTGAATTGTTGAAGAAGCATAACCTTGGTTACAGAGCTGGTTTCATCATTAACTTTGCTCAACGCGTTCAAAAT
GCCACAATTGATCTCCAAAATCCTAATAATTTCCCTAAAATCAAAGGCTTCGGACCTTTTGCAACCGCTAATCTACTCATGTGCCTCGGATTTTACCGTCAACTTCCTAT
TGATACTGAAACTATAAGGCACATAAAACAGGTACATGGAAGACAATTTTGCAACAATAAGACAGTACGGGAAGATGTCAAACAAATTTACGACAAGTATGCTCCATTCC
AGTGCTTGGCCTATCGGTTGGAGCTTGTGGAATATTACGAGAGCAAATTCGGGAAGCTAAGTGAACTGTGCTCCCTTGATTATCATAAGATCAGTGGCGCCACCCTCAAC
CTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGCAATCACGCATGCAATTTCAAAATTACAGAGATATGAAGAAGAAGATGATGAAAAGACGATGAGTGATTTCGATCTTGAGAGAGCAGTTTGTAACCATGGGCA
GTTTATGATGCCACCAAACCAATGGATTCCTTCTTCTAAAACTCTCCAACGTCCACTTCGACTCTCTAATTCAAACTCTTCTGTATTTGTCTCTATCAACCAAACTTCGT
CTTTTCTTCTCACCATTCAAATCCACTCTTCTTCTGCTGCTCTCTCTCCCCAAGATCAACAAACTATATTGGATCAAGTGGTTCGGATGCTTAGGCTTACGGAGAAAGAT
GAGGATGAGTTGAGGAAATTTCAAAGTTTGCATCCCAAAGCCAAACAGATGGGATTTGGTCGGCTTTTTCGGTCTCCCACTGTTTTTGAAGATGCACTCAAGTCCATCCT
TCTATGCAATACCACGTGGAAAAGGACACTGGCAATGGCTGGACAGCTGTGTGAGCTCCAAGCCAGAATGAGCAGCCAAAATAGGAAGAGAAAAAGGAAATTAATTGGGA
ATTTTCCAAATGCAGAAGAAGTTTGTAGAATGGGCGTTGAATTGTTGAAGAAGCATAACCTTGGTTACAGAGCTGGTTTCATCATTAACTTTGCTCAACGCGTTCAAAAT
GCCACAATTGATCTCCAAAATCCTAATAATTTCCCTAAAATCAAAGGCTTCGGACCTTTTGCAACCGCTAATCTACTCATGTGCCTCGGATTTTACCGTCAACTTCCTAT
TGATACTGAAACTATAAGGCACATAAAACAGGTACATGGAAGACAATTTTGCAACAATAAGACAGTACGGGAAGATGTCAAACAAATTTACGACAAGTATGCTCCATTCC
AGTGCTTGGCCTATCGGTTGGAGCTTGTGGAATATTACGAGAGCAAATTCGGGAAGCTAAGTGAACTGTGCTCCCTTGATTATCATAAGATCAGTGGCGCCACCCTCAAC
CTTTGA
Protein sequenceShow/hide protein sequence
MKAITHAISKLQRYEEEDDEKTMSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQTSSFLLTIQIHSSSAALSPQDQQTILDQVVRMLRLTEKD
EDELRKFQSLHPKAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQARMSSQNRKRKRKLIGNFPNAEEVCRMGVELLKKHNLGYRAGFIINFAQRVQN
ATIDLQNPNNFPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYYESKFGKLSELCSLDYHKISGATLN
L