; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg006631 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg006631
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDNA glycosylase
Genome locationscaffold7:48324353..48326143
RNA-Seq ExpressionSpg006631
SyntenySpg006631
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585875.1 hypothetical protein SDJN03_18608, partial [Cucurbita argyrosperma subsp. sororia]2.1e-13773.48Show/hide
Query:  VSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSDTSVFVSINQSSSFLLSIQIHTSRSLPPKDQQAILDQVARMLRITEKDEDDLRKFHNLH
        V +SDF+LEKAVCNHG FMM+PN+WIPSSKTLQRPLRLSNSDTS+ VSINQSSS LL++QIH+ RSLPPKD+ AILDQVARMLR+TEKDED++R+F NLH
Subjt:  VSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSDTSVFVSINQSSSFLLSIQIHTSRSLPPKDQQAILDQVARMLRITEKDEDDLRKFHNLH

Query:  PRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGELCELQAKMSETKKRKRKGKGRGGQREYEARGGNFPNATELSGMSVELLKKHLLGYRADY
        P AK++GFGRIFRSP+LFED VKSIL+CNTSWRRTL MA +LCE+QAKM E+KKRKRKG             GNFPNA E+  M VE LK H LGYRA+Y
Subjt:  PRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGELCELQAKMSETKKRKRKGKGRGGQREYEARGGNFPNATELSGMSVELLKKHLLGYRADY

Query:  IINFATCVENGKIDLQRFEEALCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCSKKTVGEDVKQIYDKYAPFQCLAYWLELV
        ++ FA  VE+G+I+LQ  E+ + S +AFPKIKGFGPFA ANI MCLGFYHQ+PIDTETIRHLKQVHG ++C+KKTVGEDVKQIYD YAP+QCLAYWLELV
Subjt:  IINFATCVENGKIDLQRFEEALCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCSKKTVGEDVKQIYDKYAPFQCLAYWLELV

Query:  EYYESKFGKLSELCSLDYHKISGTNVNL
        +YYE+KFGKLSEL S DYHKISG+ ++L
Subjt:  EYYESKFGKLSELCSLDYHKISGTNVNL

XP_022156993.1 uncharacterized protein LOC111023822 [Momordica charantia]5.5e-12569.82Show/hide
Query:  KTIHLNLESVSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSDTSVFVSINQSSSFLLSIQIHTSRSLPPKDQQAILDQVARMLRITEKDED
        + I LNL   + S FDLE+AVCNHG FMM PNKWIPSSKTLQRPLRL++S TSV VSI+Q SS LL+IQIH+S S  P D+QAILDQV RMLRITE+DE+
Subjt:  KTIHLNLESVSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSDTSVFVSINQSSSFLLSIQIHTSRSLPPKDQQAILDQVARMLRITEKDED

Query:  DLRKFHNLHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGELCELQAKM-----SETKKRKRKGKGRGGQREYEARGGNFPNATELSGMSV
        ++R F NLH +AKE+GFGR+FRSPTLFEDAVKSILLCN +WRRTLAMAG+LCELQAK+     ++ KKRKRKGKG     E E  GGNFP A EL  MSV
Subjt:  DLRKFHNLHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGELCELQAKM-----SETKKRKRKGKGRGGQREYEARGGNFPNATELSGMSV

Query:  ELLKKHLLGYRADYIINFATCVENGKIDLQRFEEALCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCSKKTVGEDVKQIYDK
         LL+KH +GYRA YII+ A  V+NGKIDLQ+ E AL    +FPKIKGFGPF  AN+ MCLG Y ++PIDTETIRHLKQVHGR+ C+ KT  E VK +YDK
Subjt:  ELLKKHLLGYRADYIINFATCVENGKIDLQRFEEALCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCSKKTVGEDVKQIYDK

Query:  YAPFQCLAYWLELVEYYESKFGKLSELCSLDYHKISGT
        YAPFQCLAYW+ELVEYYES+FGKLSEL   DY KISGT
Subjt:  YAPFQCLAYWLELVEYYESKFGKLSELCSLDYHKISGT

XP_022951918.1 uncharacterized protein LOC111454659 [Cucurbita moschata]1.6e-13773.48Show/hide
Query:  VSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSDTSVFVSINQSSSFLLSIQIHTSRSLPPKDQQAILDQVARMLRITEKDEDDLRKFHNLH
        V +SDF+LEKAVCNHG FMM+PN+WIPSSKTLQRPLRLSNSDTS+ VSINQSSS LL++QIH+ RSLPPKD+ AILDQVARMLR+TEKDED++R+F NLH
Subjt:  VSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSDTSVFVSINQSSSFLLSIQIHTSRSLPPKDQQAILDQVARMLRITEKDEDDLRKFHNLH

Query:  PRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGELCELQAKMSETKKRKRKGKGRGGQREYEARGGNFPNATELSGMSVELLKKHLLGYRADY
        P AK++GFGRIFRSP+LFED VKSIL+CNTSWRRTL MA +LCE+QAKM E+KKRKRKG             GNFPNA E+  M VE LK H LGYRA+Y
Subjt:  PRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGELCELQAKMSETKKRKRKGKGRGGQREYEARGGNFPNATELSGMSVELLKKHLLGYRADY

Query:  IINFATCVENGKIDLQRFEEALCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCSKKTVGEDVKQIYDKYAPFQCLAYWLELV
        ++ FA  VE+G+I+LQ  E+ + S +AFPKIKGFGPFA ANI MCLGFYHQ+PIDTETIRHLKQVHG ++C+KKTVGEDVKQIYD YAP+QCLAYWLELV
Subjt:  IINFATCVENGKIDLQRFEEALCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCSKKTVGEDVKQIYDKYAPFQCLAYWLELV

Query:  EYYESKFGKLSELCSLDYHKISGTNVNL
        +YYE+KFGKLSEL S DYHKISG+ ++L
Subjt:  EYYESKFGKLSELCSLDYHKISGTNVNL

XP_034673386.1 uncharacterized protein LOC117904736 [Vitis riparia]1.0e-9152.77Show/hide
Query:  TIHLNLESVSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSDTSVFVSINQ-SSSFLLSIQIHTSRSLPPKDQQAILDQVARMLRITEKDED
        T+H+ L     S F LE AVCNHG FMM+PN WIPS+KTLQRPLRL++  TS+  SI+   +   + +++H +  + P DQQ IL QVARMLRI+++DE 
Subjt:  TIHLNLESVSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSDTSVFVSINQ-SSSFLLSIQIHTSRSLPPKDQQAILDQVARMLRITEKDED

Query:  DLRKFHNLHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGELCELQAKMSETKKRK---RKGKGRGGQREYEARGGNFPNATELSGMSVEL
        D+++FH + P AK   FGRIFRSP++FED VKSILLCN  WRRTL MA  LCELQ ++   K+++    + K +    E ++  GNFPN+ EL+ +  E 
Subjt:  DLRKFHNLHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGELCELQAKMSETKKRK---RKGKGRGGQREYEARGGNFPNATELSGMSVEL

Query:  LKKHL-LGYRADYIINFATCVENGKIDLQRFEEALCS------RNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCSKKTVGEDVK
        LKK   LGYRA  I+  AT +ENG++ LQ FE+AL +       +   K +GFGPFA ANI MC+G+Y ++P D+ET RH+K++HGR    KK   +DVK
Subjt:  LKKHL-LGYRADYIINFATCVENGKIDLQRFEEALCS------RNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCSKKTVGEDVK

Query:  QIYDKYAPFQCLAYWLELVEYYESKFGKLSELCSLDYHKISGT
        +IYDKYAPFQCLAYWLEL EYY+S+FGKLSEL   +YH I+G+
Subjt:  QIYDKYAPFQCLAYWLELVEYYESKFGKLSELCSLDYHKISGT

XP_038877617.1 uncharacterized protein LOC120069874 [Benincasa hispida]2.1e-12476.05Show/hide
Query:  MKTIHLNLESVSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSDTSVFVSINQSSSFLLSIQIHTSRS-LPPKDQQAILDQVARMLRITEKD
        MKTIHLNL  VS+SDFDLEKAVCNHG FMM PN+WIPSSKTLQRPLRLS+S +SVFVSINQ SS LL+IQIH+S + L P+DQQAILDQV RMLR+TEKD
Subjt:  MKTIHLNLESVSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSDTSVFVSINQSSSFLLSIQIHTSRS-LPPKDQQAILDQVARMLRITEKD

Query:  EDDLRKFHNLHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGELCELQAKMSETKKRKRKGKGRGGQREYEARGGNFPNATELSGMSVELL
        ED+LRKF +LHPRAK+MGFGR+FRSPTLFEDA+KSILLCNT+W+RTLAMAG+LCELQAKM     RKRK K   G++E E   GNFPNA E+  M VELL
Subjt:  EDDLRKFHNLHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGELCELQAKMSETKKRKRKGKGRGGQREYEARGGNFPNATELSGMSVELL

Query:  KKHLLGYRADYIINFATCVENGKIDLQRFEEALCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCSKKTVGEDVKQIYDKYAP
        KKH LGYRA YIINFA CV++GKIDLQ       + N FPKIKGFGPFA AN+ MCLG Y Q+PIDTETIRHLKQVHGR+FC+ KTV EDVKQIYDKYAP
Subjt:  KKHLLGYRADYIINFATCVENGKIDLQRFEEALCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCSKKTVGEDVKQIYDKYAP

Query:  FQCLAYWLE
        FQCLAYWLE
Subjt:  FQCLAYWLE

TrEMBL top hitse value%identityAlignment
A0A438CJ05 Uncharacterized protein1.6e-9052.48Show/hide
Query:  TIHLNLESVSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSDTSVFVSINQ-SSSFLLSIQIHTSRSLPPKDQQAILDQVARMLRITEKDED
        T+H+ L     S F+LE AVCNHG FMM+PN WIPS+KTLQRPLRL++  TS+  SI+   +   + +++H +  + P DQ+ IL  VARMLRI+++DE 
Subjt:  TIHLNLESVSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSDTSVFVSINQ-SSSFLLSIQIHTSRSLPPKDQQAILDQVARMLRITEKDED

Query:  DLRKFHNLHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGELCELQAKMSETKKRK---RKGKGRGGQREYEARGGNFPNATELSGMSVEL
        D+++FH + P AK   FGRIFRSP++FED VKSILLCN  WRRTL MA  LCELQ ++   K+++    + K +    E ++  GNFPN+ EL+ +  E 
Subjt:  DLRKFHNLHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGELCELQAKMSETKKRK---RKGKGRGGQREYEARGGNFPNATELSGMSVEL

Query:  LKKHL-LGYRADYIINFATCVENGKIDLQRFEEALCS------RNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCSKKTVGEDVK
        LKK   LGYRA  I+  AT +ENG++ LQ FE+AL +       +   K KGFGPFA ANI MC+G+Y ++P D+ET RH+K++HGR    KK   +DVK
Subjt:  LKKHL-LGYRADYIINFATCVENGKIDLQRFEEALCS------RNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCSKKTVGEDVK

Query:  QIYDKYAPFQCLAYWLELVEYYESKFGKLSELCSLDYHKISGT
        +IYDKYAPFQCLAYWLEL EYY+S+FGKLSEL   +YH I+G+
Subjt:  QIYDKYAPFQCLAYWLELVEYYESKFGKLSELCSLDYHKISGT

A0A6A1W9S6 Uncharacterized protein1.6e-9049.87Show/hide
Query:  IHLNLESVSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSDTSVFVSINQSSS---FLLSIQIHTSRSLPPKDQQAILDQVARMLRITEKDE
        + L LE   +  F++EKAVCNHG FMM+PN WIPS+KTLQRPLRL+NS  SV VSI+  +S     + IQ+H +  + P+D++AIL+QVARMLRI+E+DE
Subjt:  IHLNLESVSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSDTSVFVSINQSSS---FLLSIQIHTSRSLPPKDQQAILDQVARMLRITEKDE

Query:  DDLRKFHNLHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGELCELQAKMSE-------TKKRKRKGKGRGGQREYEARG-----------
         +LR+F NLHP AKE GFGR FRSP+LFEDA+KS+LLCN +W RTL MA  LCELQ +++            ++  + RG +R+   R            
Subjt:  DDLRKFHNLHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGELCELQAKMSE-------TKKRKRKGKGRGGQREYEARG-----------

Query:  ----------------GNFPNATELSGMSVELLKKHL-LGYRADYIINFATCVENGKIDLQRFEE---ALCSR--NAFPKIKGFGPFAIANIRMCLGFYH
                        GNFP++ E++ ++   L+ H  LGYRA YI+  A  VE+GK+ L+ F++   A C        KIKGFGPFA AN+ MC+G+Y 
Subjt:  ----------------GNFPNATELSGMSVELLKKHL-LGYRADYIINFATCVENGKIDLQRFEE---ALCSR--NAFPKIKGFGPFAIANIRMCLGFYH

Query:  QVPIDTETIRHLKQVHGREFCSKKTVGEDVKQIYDKYAPFQCLAYWLELVEYYESKFGKLSELCSLDYHKISGTN
         VP+DTET+RHL+QVHGR+   K+TV EDVK +YDK+APFQ LAYW EL+E+YE KFGKLSEL +  Y  +SG++
Subjt:  QVPIDTETIRHLKQVHGREFCSKKTVGEDVKQIYDKYAPFQCLAYWLELVEYYESKFGKLSELCSLDYHKISGTN

A0A6J1DS88 uncharacterized protein LOC1110238222.6e-12569.82Show/hide
Query:  KTIHLNLESVSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSDTSVFVSINQSSSFLLSIQIHTSRSLPPKDQQAILDQVARMLRITEKDED
        + I LNL   + S FDLE+AVCNHG FMM PNKWIPSSKTLQRPLRL++S TSV VSI+Q SS LL+IQIH+S S  P D+QAILDQV RMLRITE+DE+
Subjt:  KTIHLNLESVSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSDTSVFVSINQSSSFLLSIQIHTSRSLPPKDQQAILDQVARMLRITEKDED

Query:  DLRKFHNLHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGELCELQAKM-----SETKKRKRKGKGRGGQREYEARGGNFPNATELSGMSV
        ++R F NLH +AKE+GFGR+FRSPTLFEDAVKSILLCN +WRRTLAMAG+LCELQAK+     ++ KKRKRKGKG     E E  GGNFP A EL  MSV
Subjt:  DLRKFHNLHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGELCELQAKM-----SETKKRKRKGKGRGGQREYEARGGNFPNATELSGMSV

Query:  ELLKKHLLGYRADYIINFATCVENGKIDLQRFEEALCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCSKKTVGEDVKQIYDK
         LL+KH +GYRA YII+ A  V+NGKIDLQ+ E AL    +FPKIKGFGPF  AN+ MCLG Y ++PIDTETIRHLKQVHGR+ C+ KT  E VK +YDK
Subjt:  ELLKKHLLGYRADYIINFATCVENGKIDLQRFEEALCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCSKKTVGEDVKQIYDK

Query:  YAPFQCLAYWLELVEYYESKFGKLSELCSLDYHKISGT
        YAPFQCLAYW+ELVEYYES+FGKLSEL   DY KISGT
Subjt:  YAPFQCLAYWLELVEYYESKFGKLSELCSLDYHKISGT

A0A6J1GJ25 uncharacterized protein LOC1114546597.9e-13873.48Show/hide
Query:  VSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSDTSVFVSINQSSSFLLSIQIHTSRSLPPKDQQAILDQVARMLRITEKDEDDLRKFHNLH
        V +SDF+LEKAVCNHG FMM+PN+WIPSSKTLQRPLRLSNSDTS+ VSINQSSS LL++QIH+ RSLPPKD+ AILDQVARMLR+TEKDED++R+F NLH
Subjt:  VSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSDTSVFVSINQSSSFLLSIQIHTSRSLPPKDQQAILDQVARMLRITEKDEDDLRKFHNLH

Query:  PRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGELCELQAKMSETKKRKRKGKGRGGQREYEARGGNFPNATELSGMSVELLKKHLLGYRADY
        P AK++GFGRIFRSP+LFED VKSIL+CNTSWRRTL MA +LCE+QAKM E+KKRKRKG             GNFPNA E+  M VE LK H LGYRA+Y
Subjt:  PRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGELCELQAKMSETKKRKRKGKGRGGQREYEARGGNFPNATELSGMSVELLKKHLLGYRADY

Query:  IINFATCVENGKIDLQRFEEALCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCSKKTVGEDVKQIYDKYAPFQCLAYWLELV
        ++ FA  VE+G+I+LQ  E+ + S +AFPKIKGFGPFA ANI MCLGFYHQ+PIDTETIRHLKQVHG ++C+KKTVGEDVKQIYD YAP+QCLAYWLELV
Subjt:  IINFATCVENGKIDLQRFEEALCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCSKKTVGEDVKQIYDKYAPFQCLAYWLELV

Query:  EYYESKFGKLSELCSLDYHKISGTNVNL
        +YYE+KFGKLSEL S DYHKISG+ ++L
Subjt:  EYYESKFGKLSELCSLDYHKISGTNVNL

A0A6P4BPN5 uncharacterized protein LOC1074341916.8e-8952.38Show/hide
Query:  SDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSDTSVFVSINQ---SSSFLLSIQIHTSRSLPPKDQQAILDQVARMLRITEKDEDDLRKFHNLH
        S F+LEKAVCNHG FMM+PN WIPS+KTLQRPLRLS+  TS  VSI+     S  LL I +H+       D+ AIL QV RMLRI+E+DE D+R+F    
Subjt:  SDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSDTSVFVSINQ---SSSFLLSIQIHTSRSLPPKDQQAILDQVARMLRITEKDEDDLRKFHNLH

Query:  PRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGELCELQAKMSETKKRKRKGKGRGGQR----EYEARGGNFPNATELSGMSVELLKKH--LL
        P+AK  GFGR+FRSP++FEDAVKSILLCN +W ++L MA  LCELQ +++ T  RK KGK + G+     + E R GNFP + EL+ +    L++   +L
Subjt:  PRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGELCELQAKMSETKKRKRKGKGRGGQR----EYEARGGNFPNATELSGMSVELLKKH--LL

Query:  GYRADYIINFATCVENGKIDLQRFEEALCSR--------NAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCSKKTVGEDVKQIYDK
        GYRA YI+  A  VE+G++ L+  EE +                + GFGP+  AN+ MC+G Y  VP+DTETIRH++QVHGR+ C KKTV + V++IYDK
Subjt:  GYRADYIINFATCVENGKIDLQRFEEALCSR--------NAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCSKKTVGEDVKQIYDK

Query:  YAPFQCLAYWLELVEYYESKFGKLSELCSLDYHKIS
        +APFQCLAYW+EL++ YE KFGKLSEL    Y  +S
Subjt:  YAPFQCLAYWLELVEYYESKFGKLSELCSLDYHKIS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGACAATTCATTTGAATTTGGAATCAGTTTCAATCAGTGATTTTGATCTTGAGAAAGCAGTTTGCAATCATGGGGTGTTTATGATGTCTCCAAACAAATGGATTCC
TTCTTCTAAAACTCTCCAACGTCCACTTCGACTCTCCAATTCAGACACTTCTGTTTTTGTCTCTATAAACCAATCTTCTTCTTTTCTCCTCTCCATTCAAATTCACACTA
GTCGCTCTCTTCCTCCTAAAGATCAACAAGCTATATTGGATCAAGTGGCTCGAATGCTTAGAATTACGGAGAAAGATGAAGATGACCTTAGAAAATTTCATAATTTGCAT
CCGAGAGCCAAAGAGATGGGATTTGGTCGAATTTTTCGATCTCCAACTCTTTTTGAAGATGCAGTAAAGTCCATCCTTTTGTGCAATACCTCGTGGAGAAGAACGTTGGC
AATGGCTGGAGAGCTATGTGAGTTACAAGCCAAAATGAGCGAAACTAAGAAGAGAAAGAGAAAAGGCAAAGGACGAGGAGGGCAAAGGGAATACGAGGCCAGGGGAGGGA
ATTTTCCAAATGCCACAGAACTTTCTGGAATGAGCGTTGAATTGTTGAAGAAGCATTTACTTGGTTATAGAGCTGATTACATCATCAATTTCGCTACATGCGTTGAAAAT
GGCAAAATCGATCTCCAAAGATTTGAAGAAGCACTTTGCTCTCGTAATGCTTTCCCTAAAATCAAAGGCTTTGGTCCTTTTGCAATAGCCAATATTCGCATGTGCCTCGG
ATTTTACCATCAAGTTCCAATTGATACCGAGACTATAAGACACTTAAAACAGGTACATGGAAGAGAATTTTGCAGCAAGAAGACAGTAGGGGAAGATGTCAAACAAATTT
ACGACAAGTATGCTCCTTTCCAATGCTTGGCCTATTGGTTGGAGCTTGTTGAGTATTATGAGAGCAAATTCGGGAAGCTAAGTGAATTGTGTTCCCTTGATTATCACAAG
ATAAGTGGCACCAACGTCAATCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGACAATTCATTTGAATTTGGAATCAGTTTCAATCAGTGATTTTGATCTTGAGAAAGCAGTTTGCAATCATGGGGTGTTTATGATGTCTCCAAACAAATGGATTCC
TTCTTCTAAAACTCTCCAACGTCCACTTCGACTCTCCAATTCAGACACTTCTGTTTTTGTCTCTATAAACCAATCTTCTTCTTTTCTCCTCTCCATTCAAATTCACACTA
GTCGCTCTCTTCCTCCTAAAGATCAACAAGCTATATTGGATCAAGTGGCTCGAATGCTTAGAATTACGGAGAAAGATGAAGATGACCTTAGAAAATTTCATAATTTGCAT
CCGAGAGCCAAAGAGATGGGATTTGGTCGAATTTTTCGATCTCCAACTCTTTTTGAAGATGCAGTAAAGTCCATCCTTTTGTGCAATACCTCGTGGAGAAGAACGTTGGC
AATGGCTGGAGAGCTATGTGAGTTACAAGCCAAAATGAGCGAAACTAAGAAGAGAAAGAGAAAAGGCAAAGGACGAGGAGGGCAAAGGGAATACGAGGCCAGGGGAGGGA
ATTTTCCAAATGCCACAGAACTTTCTGGAATGAGCGTTGAATTGTTGAAGAAGCATTTACTTGGTTATAGAGCTGATTACATCATCAATTTCGCTACATGCGTTGAAAAT
GGCAAAATCGATCTCCAAAGATTTGAAGAAGCACTTTGCTCTCGTAATGCTTTCCCTAAAATCAAAGGCTTTGGTCCTTTTGCAATAGCCAATATTCGCATGTGCCTCGG
ATTTTACCATCAAGTTCCAATTGATACCGAGACTATAAGACACTTAAAACAGGTACATGGAAGAGAATTTTGCAGCAAGAAGACAGTAGGGGAAGATGTCAAACAAATTT
ACGACAAGTATGCTCCTTTCCAATGCTTGGCCTATTGGTTGGAGCTTGTTGAGTATTATGAGAGCAAATTCGGGAAGCTAAGTGAATTGTGTTCCCTTGATTATCACAAG
ATAAGTGGCACCAACGTCAATCTTTGA
Protein sequenceShow/hide protein sequence
MKTIHLNLESVSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSDTSVFVSINQSSSFLLSIQIHTSRSLPPKDQQAILDQVARMLRITEKDEDDLRKFHNLH
PRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGELCELQAKMSETKKRKRKGKGRGGQREYEARGGNFPNATELSGMSVELLKKHLLGYRADYIINFATCVEN
GKIDLQRFEEALCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCSKKTVGEDVKQIYDKYAPFQCLAYWLELVEYYESKFGKLSELCSLDYHK
ISGTNVNL