; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr007477 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr007477
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionENDO3c domain-containing protein
Genome locationtig00005607:126721..130283
RNA-Seq ExpressionSgr007477
SyntenySgr007477
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4278625.1 unnamed protein product [Prunus armeniaca]1.6e-6145.73Show/hide
Query:  FDLEKAVCNHGLFMMPPNKWIPSSKTLQRPLRLADSTTCIWVSISQTSSHL-LNIQLHSSRTLSPADQQAILDQVTRMLRITQKDEDDLRKFQILHPKAK
        F+LEKAVCNHG FMM PN+WIPSSKTLQRPLRLADSTTC+ VSI    +   L +++H  + +S  DQ+AIL+QV RMLRI+++DE D+R++Q +HP+AK
Subjt:  FDLEKAVCNHGLFMMPPNKWIPSSKTLQRPLRLADSTTCIWVSISQTSSHL-LNIQLHSSRTLSPADQQAILDQVTRMLRITQKDEDDLRKFQILHPKAK

Query:  ETGFGRLFRSPTLFEDAVKSILLCNTTWRRTLAMAGQLCELQAEISKKKKRKRKGKGQSGE------------YEG---GNFPNATELSRMSVSLLKKH-
        E GFGR+FRSPTLFED VK +LLCN TW  TL MA  LCELQ E+S  +   R G+ +  E             EG   GNFP + EL+ +  + L    
Subjt:  ETGFGRLFRSPTLFEDAVKSILLCNTTWRRTLAMAGQLCELQAEISKKKKRKRKGKGQSGE------------YEG---GNFPNATELSRMSVSLLKKH-

Query:  -LIGYRAVYIIKFAQSVENGRIDLQKIEEEAVAAS-DCDHV---LPKIKGFGPFATANLLM---------------------------------------
         ++GYRA  I+K A+ VE G I L + E+     S + D V   L KIKGFG +A AN LM                                       
Subjt:  -LIGYRAVYIIKFAQSVENGRIDLQKIEEEAVAAS-DCDHV---LPKIKGFGPFATANLLM---------------------------------------

Query:  LELVEYYEGRFGKLSELCWHDYDKISGT
        LEL+++YE +FGKLSEL    Y+ +SG+
Subjt:  LELVEYYEGRFGKLSELCWHDYDKISGT

KAG6585875.1 hypothetical protein SDJN03_18608, partial [Cucurbita argyrosperma subsp. sororia]3.8e-8753.89Show/hide
Query:  LNLEDSMRMRGFDLEKAVCNHGLFMMPPNKWIPSSKTLQRPLRLADSTTCIWVSISQTSSHLLNIQLHSSRTLSPADQQAILDQVTRMLRITQKDEDDLR
        + L+  +R+  F+LEKAVCNHG FMM PN+WIPSSKTLQRPLRL++S T + VSI+Q+SS LL +Q+HS R+L P D+ AILDQV RMLR+T+KDED++R
Subjt:  LNLEDSMRMRGFDLEKAVCNHGLFMMPPNKWIPSSKTLQRPLRLADSTTCIWVSISQTSSHLLNIQLHSSRTLSPADQQAILDQVTRMLRITQKDEDDLR

Query:  KFQILHPKAKETGFGRLFRSPTLFEDAVKSILLCNTTWRRTLAMAGQLCELQAEISKKKKRKRKGKGQSGEYEGGNFPNATELSRMSVSLLKKHLIGYRA
        +FQ LHP AK+ GFGR+FRSP+LFED VKSIL+CNT+WRRTL MA +LCE+QA++ + KKRKRKG       E GNFPNA E+ RM V  LK H +GYRA
Subjt:  KFQILHPKAKETGFGRLFRSPTLFEDAVKSILLCNTTWRRTLAMAGQLCELQAEISKKKKRKRKGKGQSGEYEGGNFPNATELSRMSVSLLKKHLIGYRA

Query:  VYIIKFAQSVENGRIDLQKIEEEAVAASDCDHVLPKIKGFGPFATANLLM--------------------------------------------------
         Y++KFAQSVE+GRI+LQ + E+ V++ D     PKIKGFGPFATAN+ M                                                  
Subjt:  VYIIKFAQSVENGRIDLQKIEEEAVAASDCDHVLPKIKGFGPFATANLLM--------------------------------------------------

Query:  --LELVEYYEGRFGKLSELCWHDYDKISGTTVNL
          LELV+YYE +FGKLSEL   DY KISG+T++L
Subjt:  --LELVEYYEGRFGKLSELCWHDYDKISGTTVNL

XP_022156993.1 uncharacterized protein LOC111023822 [Momordica charantia]2.1e-10162.35Show/hide
Query:  KMLHLNLEDSMRMRGFDLEKAVCNHGLFMMPPNKWIPSSKTLQRPLRLADSTTCIWVSISQTSSHLLNIQLHSSRTLSPADQQAILDQVTRMLRITQKDE
        +M+ LNL ++    GFDLE+AVCNHG FMMPPNKWIPSSKTLQRPLRLADSTT + VSISQ SSHLLNIQ+HSS + SP D+QAILDQVTRMLRIT++DE
Subjt:  KMLHLNLEDSMRMRGFDLEKAVCNHGLFMMPPNKWIPSSKTLQRPLRLADSTTCIWVSISQTSSHLLNIQLHSSRTLSPADQQAILDQVTRMLRITQKDE

Query:  DDLRKFQILHPKAKETGFGRLFRSPTLFEDAVKSILLCNTTWRRTLAMAGQLCELQAE-----ISKKKKRKRKGKGQSGEYEGGNFPNATELSRMSVSLL
        +++R FQ LH KAKE GFGRLFRSPTLFEDAVKSILLCN TWRRTLAMAGQLCELQA+     I+  KKRKRKGKG+  E EGGNFP A EL RMSV LL
Subjt:  DDLRKFQILHPKAKETGFGRLFRSPTLFEDAVKSILLCNTTWRRTLAMAGQLCELQAE-----ISKKKKRKRKGKGQSGEYEGGNFPNATELSRMSVSLL

Query:  KKHLIGYRAVYIIKFAQSVENGRIDLQKIEEEAVAASDCDHVLPKIKGFGPFATANLLM-----------------------------------------
        +KH IGYRAVYII  AQ V+NG+IDLQKIE             PKIKGFGPF TAN+ M                                         
Subjt:  KKHLIGYRAVYIIKFAQSVENGRIDLQKIEEEAVAASDCDHVLPKIKGFGPFATANLLM-----------------------------------------

Query:  -----------LELVEYYEGRFGKLSELCWHDYDKISGTT
                   +ELVEYYE RFGKLSEL WHDY KISGTT
Subjt:  -----------LELVEYYEGRFGKLSELCWHDYDKISGTT

XP_022951918.1 uncharacterized protein LOC111454659 [Cucurbita moschata]2.5e-8653.59Show/hide
Query:  LNLEDSMRMRGFDLEKAVCNHGLFMMPPNKWIPSSKTLQRPLRLADSTTCIWVSISQTSSHLLNIQLHSSRTLSPADQQAILDQVTRMLRITQKDEDDLR
        + L+  + +  F+LEKAVCNHG FMM PN+WIPSSKTLQRPLRL++S T + VSI+Q+SS LL +Q+HS R+L P D+ AILDQV RMLR+T+KDED++R
Subjt:  LNLEDSMRMRGFDLEKAVCNHGLFMMPPNKWIPSSKTLQRPLRLADSTTCIWVSISQTSSHLLNIQLHSSRTLSPADQQAILDQVTRMLRITQKDEDDLR

Query:  KFQILHPKAKETGFGRLFRSPTLFEDAVKSILLCNTTWRRTLAMAGQLCELQAEISKKKKRKRKGKGQSGEYEGGNFPNATELSRMSVSLLKKHLIGYRA
        +FQ LHP AK+ GFGR+FRSP+LFED VKSIL+CNT+WRRTL MA +LCE+QA++ + KKRKRKG       E GNFPNA E+ RM V  LK H +GYRA
Subjt:  KFQILHPKAKETGFGRLFRSPTLFEDAVKSILLCNTTWRRTLAMAGQLCELQAEISKKKKRKRKGKGQSGEYEGGNFPNATELSRMSVSLLKKHLIGYRA

Query:  VYIIKFAQSVENGRIDLQKIEEEAVAASDCDHVLPKIKGFGPFATANLLM--------------------------------------------------
         Y++KFAQSVE+GRI+LQ + E+ V++ D     PKIKGFGPFATAN+ M                                                  
Subjt:  VYIIKFAQSVENGRIDLQKIEEEAVAASDCDHVLPKIKGFGPFATANLLM--------------------------------------------------

Query:  --LELVEYYEGRFGKLSELCWHDYDKISGTTVNL
          LELV+YYE +FGKLSEL   DY KISG+T++L
Subjt:  --LELVEYYEGRFGKLSELCWHDYDKISGTTVNL

XP_038877617.1 uncharacterized protein LOC120069874 [Benincasa hispida]5.0e-8768.36Show/hide
Query:  MKMLHLNLEDSMRMRGFDLEKAVCNHGLFMMPPNKWIPSSKTLQRPLRLADSTTCIWVSISQTSSHLLNIQLHSSRT-LSPADQQAILDQVTRMLRITQK
        MK +HLNL   + +  FDLEKAVCNHG FMMPPN+WIPSSKTLQRPLRL+DS + ++VSI+Q SS LL IQ+HSS T LSP DQQAILDQV RMLR+T+K
Subjt:  MKMLHLNLEDSMRMRGFDLEKAVCNHGLFMMPPNKWIPSSKTLQRPLRLADSTTCIWVSISQTSSHLLNIQLHSSRT-LSPADQQAILDQVTRMLRITQK

Query:  DEDDLRKFQILHPKAKETGFGRLFRSPTLFEDAVKSILLCNTTWRRTLAMAGQLCELQAEISKKKKRKRKGKGQSGEYEGGNFPNATELSRMSVSLLKKH
        DED+LRKFQ LHP+AK+ GFGRLFRSPTLFEDA+KSILLCNTTW+RTLAMAGQLCELQA++ ++  RKRK K    E E GNFPNA E+ RM V LLKKH
Subjt:  DEDDLRKFQILHPKAKETGFGRLFRSPTLFEDAVKSILLCNTTWRRTLAMAGQLCELQAEISKKKKRKRKGKGQSGEYEGGNFPNATELSRMSVSLLKKH

Query:  LIGYRAVYIIKFAQSVENGRIDLQKIEEEAVAASDCDHVLPKIKGFGPFATANLLM
         +GYRA YII FA+ V++G+IDLQ             +  PKIKGFGPFATAN+LM
Subjt:  LIGYRAVYIIKFAQSVENGRIDLQKIEEEAVAASDCDHVLPKIKGFGPFATANLLM

TrEMBL top hitse value%identityAlignment
A0A5C7H480 Uncharacterized protein5.1e-6154.3Show/hide
Query:  LHLNLEDSMRMRGFDLEKAVCNHGLFMMPPNKWIPSSKTLQRPLRLADSTTCIWVSISQ--TSSHLLNIQL-HSSRTLSPADQQAILDQVTRMLRITQKD
        LHL L +      F LEKAVCNHG +MM PN W  S+K+LQRPLRLADS T + VSIS    + H L I++ H ++TLS  DQQAI DQV+RMLRIT+KD
Subjt:  LHLNLEDSMRMRGFDLEKAVCNHGLFMMPPNKWIPSSKTLQRPLRLADSTTCIWVSISQ--TSSHLLNIQL-HSSRTLSPADQQAILDQVTRMLRITQKD

Query:  EDDLRKFQILHPKAKETGFGRLFRSPTLFEDAVKSILLCNTTWRRTLAMAGQLCELQAEISKKKKRKRKGKGQSGEYEGGNFPNATELSRMSVSLLKKHL
        E D+++FQ LH +AK+ GFGRLFRSP LFED +K ILLCN TW+RTL+MA  LCE QAE+    K  R+ K        GNFP++ EL+ MS   LK   
Subjt:  EDDLRKFQILHPKAKETGFGRLFRSPTLFEDAVKSILLCNTTWRRTLAMAGQLCELQAEISKKKKRKRKGKGQSGEYEGGNFPNATELSRMSVSLLKKHL

Query:  -IGYRAVYIIKFAQSVENGRIDLQKIEEEAVAASDCDHVLPKIKGFGPFATANLLM
         +GYRA YI +FA++VE+G IDL K E++  +       L K KGFG F  AN+LM
Subjt:  -IGYRAVYIIKFAQSVENGRIDLQKIEEEAVAASDCDHVLPKIKGFGPFATANLLM

A0A6J1DS88 uncharacterized protein LOC1110238221.0e-10162.35Show/hide
Query:  KMLHLNLEDSMRMRGFDLEKAVCNHGLFMMPPNKWIPSSKTLQRPLRLADSTTCIWVSISQTSSHLLNIQLHSSRTLSPADQQAILDQVTRMLRITQKDE
        +M+ LNL ++    GFDLE+AVCNHG FMMPPNKWIPSSKTLQRPLRLADSTT + VSISQ SSHLLNIQ+HSS + SP D+QAILDQVTRMLRIT++DE
Subjt:  KMLHLNLEDSMRMRGFDLEKAVCNHGLFMMPPNKWIPSSKTLQRPLRLADSTTCIWVSISQTSSHLLNIQLHSSRTLSPADQQAILDQVTRMLRITQKDE

Query:  DDLRKFQILHPKAKETGFGRLFRSPTLFEDAVKSILLCNTTWRRTLAMAGQLCELQAE-----ISKKKKRKRKGKGQSGEYEGGNFPNATELSRMSVSLL
        +++R FQ LH KAKE GFGRLFRSPTLFEDAVKSILLCN TWRRTLAMAGQLCELQA+     I+  KKRKRKGKG+  E EGGNFP A EL RMSV LL
Subjt:  DDLRKFQILHPKAKETGFGRLFRSPTLFEDAVKSILLCNTTWRRTLAMAGQLCELQAE-----ISKKKKRKRKGKGQSGEYEGGNFPNATELSRMSVSLL

Query:  KKHLIGYRAVYIIKFAQSVENGRIDLQKIEEEAVAASDCDHVLPKIKGFGPFATANLLM-----------------------------------------
        +KH IGYRAVYII  AQ V+NG+IDLQKIE             PKIKGFGPF TAN+ M                                         
Subjt:  KKHLIGYRAVYIIKFAQSVENGRIDLQKIEEEAVAASDCDHVLPKIKGFGPFATANLLM-----------------------------------------

Query:  -----------LELVEYYEGRFGKLSELCWHDYDKISGTT
                   +ELVEYYE RFGKLSEL WHDY KISGTT
Subjt:  -----------LELVEYYEGRFGKLSELCWHDYDKISGTT

A0A6J1GJ25 uncharacterized protein LOC1114546591.2e-8653.59Show/hide
Query:  LNLEDSMRMRGFDLEKAVCNHGLFMMPPNKWIPSSKTLQRPLRLADSTTCIWVSISQTSSHLLNIQLHSSRTLSPADQQAILDQVTRMLRITQKDEDDLR
        + L+  + +  F+LEKAVCNHG FMM PN+WIPSSKTLQRPLRL++S T + VSI+Q+SS LL +Q+HS R+L P D+ AILDQV RMLR+T+KDED++R
Subjt:  LNLEDSMRMRGFDLEKAVCNHGLFMMPPNKWIPSSKTLQRPLRLADSTTCIWVSISQTSSHLLNIQLHSSRTLSPADQQAILDQVTRMLRITQKDEDDLR

Query:  KFQILHPKAKETGFGRLFRSPTLFEDAVKSILLCNTTWRRTLAMAGQLCELQAEISKKKKRKRKGKGQSGEYEGGNFPNATELSRMSVSLLKKHLIGYRA
        +FQ LHP AK+ GFGR+FRSP+LFED VKSIL+CNT+WRRTL MA +LCE+QA++ + KKRKRKG       E GNFPNA E+ RM V  LK H +GYRA
Subjt:  KFQILHPKAKETGFGRLFRSPTLFEDAVKSILLCNTTWRRTLAMAGQLCELQAEISKKKKRKRKGKGQSGEYEGGNFPNATELSRMSVSLLKKHLIGYRA

Query:  VYIIKFAQSVENGRIDLQKIEEEAVAASDCDHVLPKIKGFGPFATANLLM--------------------------------------------------
         Y++KFAQSVE+GRI+LQ + E+ V++ D     PKIKGFGPFATAN+ M                                                  
Subjt:  VYIIKFAQSVENGRIDLQKIEEEAVAASDCDHVLPKIKGFGPFATANLLM--------------------------------------------------

Query:  --LELVEYYEGRFGKLSELCWHDYDKISGTTVNL
          LELV+YYE +FGKLSEL   DY KISG+T++L
Subjt:  --LELVEYYEGRFGKLSELCWHDYDKISGTTVNL

A0A6J5UTX2 Uncharacterized protein7.8e-6245.73Show/hide
Query:  FDLEKAVCNHGLFMMPPNKWIPSSKTLQRPLRLADSTTCIWVSISQTSSHL-LNIQLHSSRTLSPADQQAILDQVTRMLRITQKDEDDLRKFQILHPKAK
        F+LEKAVCNHG FMM PN+WIPSSKTLQRPLRLADSTTC+ VSI    +   L +++H  + +S  DQ+AIL+QV RMLRI+++DE D+R++Q +HP+AK
Subjt:  FDLEKAVCNHGLFMMPPNKWIPSSKTLQRPLRLADSTTCIWVSISQTSSHL-LNIQLHSSRTLSPADQQAILDQVTRMLRITQKDEDDLRKFQILHPKAK

Query:  ETGFGRLFRSPTLFEDAVKSILLCNTTWRRTLAMAGQLCELQAEISKKKKRKRKGKGQSGE------------YEG---GNFPNATELSRMSVSLLKKH-
        E GFGR+FRSPTLFED VK +LLCN TW  TL MA  LCELQ E+S  +   R G+ +  E             EG   GNFP + EL+ +  + L    
Subjt:  ETGFGRLFRSPTLFEDAVKSILLCNTTWRRTLAMAGQLCELQAEISKKKKRKRKGKGQSGE------------YEG---GNFPNATELSRMSVSLLKKH-

Query:  -LIGYRAVYIIKFAQSVENGRIDLQKIEEEAVAAS-DCDHV---LPKIKGFGPFATANLLM---------------------------------------
         ++GYRA  I+K A+ VE G I L + E+     S + D V   L KIKGFG +A AN LM                                       
Subjt:  -LIGYRAVYIIKFAQSVENGRIDLQKIEEEAVAAS-DCDHV---LPKIKGFGPFATANLLM---------------------------------------

Query:  LELVEYYEGRFGKLSELCWHDYDKISGT
        LEL+++YE +FGKLSEL    Y+ +SG+
Subjt:  LELVEYYEGRFGKLSELCWHDYDKISGT

A0A6J5XDT5 Uncharacterized protein7.8e-6245.73Show/hide
Query:  FDLEKAVCNHGLFMMPPNKWIPSSKTLQRPLRLADSTTCIWVSISQTSSHL-LNIQLHSSRTLSPADQQAILDQVTRMLRITQKDEDDLRKFQILHPKAK
        F+LEKAVCNHG FMM PN+WIPSSKTLQRPLRLADSTTC+ VSI    +   L +++H  + +S  DQ+AIL+QV RMLRI+++DE D+R++Q +HP+AK
Subjt:  FDLEKAVCNHGLFMMPPNKWIPSSKTLQRPLRLADSTTCIWVSISQTSSHL-LNIQLHSSRTLSPADQQAILDQVTRMLRITQKDEDDLRKFQILHPKAK

Query:  ETGFGRLFRSPTLFEDAVKSILLCNTTWRRTLAMAGQLCELQAEISKKKKRKRKGKGQSGE------------YEG---GNFPNATELSRMSVSLLKKH-
        E GFGR+FRSPTLFED VK +LLCN TW  TL MA  LCELQ E+S  +   R G+ +  E             EG   GNFP + EL+ +  + L    
Subjt:  ETGFGRLFRSPTLFEDAVKSILLCNTTWRRTLAMAGQLCELQAEISKKKKRKRKGKGQSGE------------YEG---GNFPNATELSRMSVSLLKKH-

Query:  -LIGYRAVYIIKFAQSVENGRIDLQKIEEEAVAAS-DCDHV---LPKIKGFGPFATANLLM---------------------------------------
         ++GYRA  I+K A+ VE G I L + E+     S + D V   L KIKGFG +A AN LM                                       
Subjt:  -LIGYRAVYIIKFAQSVENGRIDLQKIEEEAVAAS-DCDHV---LPKIKGFGPFATANLLM---------------------------------------

Query:  LELVEYYEGRFGKLSELCWHDYDKISGT
        LEL+++YE +FGKLSEL    Y+ +SG+
Subjt:  LELVEYYEGRFGKLSELCWHDYDKISGT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGATGCTTCATTTGAATTTGGAAGATTCAATGAGAATGAGGGGGTTCGATCTTGAGAAAGCTGTTTGCAATCATGGGCTGTTCATGATGCCTCCAAACAAATGGAT
TCCTTCTTCCAAAACTCTCCAACGTCCACTTCGCCTCGCCGACTCCACGACTTGTATTTGGGTCTCCATTTCTCAAACTTCTTCTCATCTCCTCAACATTCAACTCCACT
CCTCCCGCACTCTCTCCCCTGCTGATCAGCAAGCTATCTTGGACCAAGTCACTCGAATGCTTAGAATAACACAGAAAGATGAAGATGACCTTAGGAAGTTTCAAATTTTG
CATCCCAAGGCCAAGGAGACGGGATTTGGTCGACTTTTTCGATCTCCCACTCTTTTCGAAGATGCAGTCAAGTCCATCCTTCTATGCAATACCACGTGGAGAAGAACATT
GGCAATGGCTGGACAACTGTGCGAGCTCCAAGCGGAAATTAGTAAGAAGAAGAAGAGAAAACGAAAAGGGAAAGGGCAGAGTGGGGAATACGAGGGAGGCAATTTTCCAA
ATGCCACAGAACTTTCCAGAATGAGCGTTTCATTGTTGAAGAAGCATTTAATTGGTTACAGAGCAGTTTACATCATTAAATTTGCTCAAAGCGTTGAAAATGGGAGAATC
GATTTGCAAAAGATTGAAGAAGAAGCAGTGGCTGCATCTGACTGTGACCATGTCCTCCCTAAAATCAAAGGCTTTGGTCCTTTTGCAACTGCCAATTTACTCATGTTGGA
GCTTGTTGAGTATTACGAGGGCAGATTTGGGAAGCTAAGTGAATTGTGTTGGCATGATTATGACAAGATTAGTGGCACCACAGTCAACCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGATGCTTCATTTGAATTTGGAAGATTCAATGAGAATGAGGGGGTTCGATCTTGAGAAAGCTGTTTGCAATCATGGGCTGTTCATGATGCCTCCAAACAAATGGAT
TCCTTCTTCCAAAACTCTCCAACGTCCACTTCGCCTCGCCGACTCCACGACTTGTATTTGGGTCTCCATTTCTCAAACTTCTTCTCATCTCCTCAACATTCAACTCCACT
CCTCCCGCACTCTCTCCCCTGCTGATCAGCAAGCTATCTTGGACCAAGTCACTCGAATGCTTAGAATAACACAGAAAGATGAAGATGACCTTAGGAAGTTTCAAATTTTG
CATCCCAAGGCCAAGGAGACGGGATTTGGTCGACTTTTTCGATCTCCCACTCTTTTCGAAGATGCAGTCAAGTCCATCCTTCTATGCAATACCACGTGGAGAAGAACATT
GGCAATGGCTGGACAACTGTGCGAGCTCCAAGCGGAAATTAGTAAGAAGAAGAAGAGAAAACGAAAAGGGAAAGGGCAGAGTGGGGAATACGAGGGAGGCAATTTTCCAA
ATGCCACAGAACTTTCCAGAATGAGCGTTTCATTGTTGAAGAAGCATTTAATTGGTTACAGAGCAGTTTACATCATTAAATTTGCTCAAAGCGTTGAAAATGGGAGAATC
GATTTGCAAAAGATTGAAGAAGAAGCAGTGGCTGCATCTGACTGTGACCATGTCCTCCCTAAAATCAAAGGCTTTGGTCCTTTTGCAACTGCCAATTTACTCATGTTGGA
GCTTGTTGAGTATTACGAGGGCAGATTTGGGAAGCTAAGTGAATTGTGTTGGCATGATTATGACAAGATTAGTGGCACCACAGTCAACCTTTGA
Protein sequenceShow/hide protein sequence
MKMLHLNLEDSMRMRGFDLEKAVCNHGLFMMPPNKWIPSSKTLQRPLRLADSTTCIWVSISQTSSHLLNIQLHSSRTLSPADQQAILDQVTRMLRITQKDEDDLRKFQIL
HPKAKETGFGRLFRSPTLFEDAVKSILLCNTTWRRTLAMAGQLCELQAEISKKKKRKRKGKGQSGEYEGGNFPNATELSRMSVSLLKKHLIGYRAVYIIKFAQSVENGRI
DLQKIEEEAVAASDCDHVLPKIKGFGPFATANLLMLELVEYYEGRFGKLSELCWHDYDKISGTTVNL