; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg038383 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg038383
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDNA glycosylase
Genome locationscaffold1:56892913..56894663
RNA-Seq ExpressionSpg038383
SyntenySpg038383
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585875.1 hypothetical protein SDJN03_18608, partial [Cucurbita argyrosperma subsp. sororia]7.6e-13572.56Show/hide
Query:  VSISDFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSNTSVFVSINQSSSFLLSIQIHTCRSLSPKDQQAILDQVARMLRITEKDEDDLRKFHNLH
        V +SDF+LEKAVCNHG FMM PN+WIPSSKTLQRPLRLSNS+TS+ VSINQSSS LL++QIH+ RSL PKD+ AILDQVARMLR+TEKDED++R+F NLH
Subjt:  VSISDFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSNTSVFVSINQSSSFLLSIQIHTCRSLSPKDQQAILDQVARMLRITEKDEDDLRKFHNLH

Query:  PRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGGLCELQAKMNETRKRKRKGKGRGGQREYEVRGGNFPNAIELSGMSVELLKKHLLGYRADY
        P AK++GFGRIFRSP+LFED VKSIL+CNTSWRRTL MA  LCE+QAKM E++KRKRKG             GNFPNA E+  M VE LK H LGYRA+Y
Subjt:  PRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGGLCELQAKMNETRKRKRKGKGRGGQREYEVRGGNFPNAIELSGMSVELLKKHLLGYRADY

Query:  IINFATSIENGKIDLQRFEEALCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCNKKTVGEDVKQIYDKYAPFQCLAYWLELV
        ++ FA S+E+G+I+LQ  E+ + S +AFPKIKGFGPFA ANI MCLGFYHQ+PIDTETIRHLKQVHG ++C KKTVGEDVKQIYD YAP+QCLAYWLELV
Subjt:  IINFATSIENGKIDLQRFEEALCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCNKKTVGEDVKQIYDKYAPFQCLAYWLELV

Query:  EYYESKFGKLSELCSVDYHKISGTNVNL
        +YYE+KFGKLSEL S DYHKISG+ ++L
Subjt:  EYYESKFGKLSELCSVDYHKISGTNVNL

XP_022156993.1 uncharacterized protein LOC111023822 [Momordica charantia]4.9e-12669.82Show/hide
Query:  KTIHLNLESVSISDFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSNTSVFVSINQSSSFLLSIQIHTCRSLSPKDQQAILDQVARMLRITEKDED
        + I LNL   + S FDLE+AVCNHG FMMPPNKWIPSSKTLQRPLRL++S TSV VSI+Q SS LL+IQIH+  S SP D+QAILDQV RMLRITE+DE+
Subjt:  KTIHLNLESVSISDFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSNTSVFVSINQSSSFLLSIQIHTCRSLSPKDQQAILDQVARMLRITEKDED

Query:  DLRKFHNLHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGGLCELQAKM-----NETRKRKRKGKGRGGQREYEVRGGNFPNAIELSGMSV
        ++R F NLH +AKE+GFGR+FRSPTLFEDAVKSILLCN +WRRTLAMAG LCELQAK+      + +KRKRKGKG     E E+ GGNFP A EL  MSV
Subjt:  DLRKFHNLHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGGLCELQAKM-----NETRKRKRKGKGRGGQREYEVRGGNFPNAIELSGMSV

Query:  ELLKKHLLGYRADYIINFATSIENGKIDLQRFEEALCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCNKKTVGEDVKQIYDK
         LL+KH +GYRA YII+ A  ++NGKIDLQ+ E AL    +FPKIKGFGPF  AN+ MCLG Y ++PIDTETIRHLKQVHGR+ CN KT  E VK +YDK
Subjt:  ELLKKHLLGYRADYIINFATSIENGKIDLQRFEEALCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCNKKTVGEDVKQIYDK

Query:  YAPFQCLAYWLELVEYYESKFGKLSELCSVDYHKISGT
        YAPFQCLAYW+ELVEYYES+FGKLSEL   DY KISGT
Subjt:  YAPFQCLAYWLELVEYYESKFGKLSELCSVDYHKISGT

XP_022951918.1 uncharacterized protein LOC111454659 [Cucurbita moschata]5.8e-13572.56Show/hide
Query:  VSISDFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSNTSVFVSINQSSSFLLSIQIHTCRSLSPKDQQAILDQVARMLRITEKDEDDLRKFHNLH
        V +SDF+LEKAVCNHG FMM PN+WIPSSKTLQRPLRLSNS+TS+ VSINQSSS LL++QIH+ RSL PKD+ AILDQVARMLR+TEKDED++R+F NLH
Subjt:  VSISDFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSNTSVFVSINQSSSFLLSIQIHTCRSLSPKDQQAILDQVARMLRITEKDEDDLRKFHNLH

Query:  PRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGGLCELQAKMNETRKRKRKGKGRGGQREYEVRGGNFPNAIELSGMSVELLKKHLLGYRADY
        P AK++GFGRIFRSP+LFED VKSIL+CNTSWRRTL MA  LCE+QAKM E++KRKRKG             GNFPNA E+  M VE LK H LGYRA+Y
Subjt:  PRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGGLCELQAKMNETRKRKRKGKGRGGQREYEVRGGNFPNAIELSGMSVELLKKHLLGYRADY

Query:  IINFATSIENGKIDLQRFEEALCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCNKKTVGEDVKQIYDKYAPFQCLAYWLELV
        ++ FA S+E+G+I+LQ  E+ + S +AFPKIKGFGPFA ANI MCLGFYHQ+PIDTETIRHLKQVHG ++C KKTVGEDVKQIYD YAP+QCLAYWLELV
Subjt:  IINFATSIENGKIDLQRFEEALCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCNKKTVGEDVKQIYDKYAPFQCLAYWLELV

Query:  EYYESKFGKLSELCSVDYHKISGTNVNL
        +YYE+KFGKLSEL S DYHKISG+ ++L
Subjt:  EYYESKFGKLSELCSVDYHKISGTNVNL

XP_034673386.1 uncharacterized protein LOC117904736 [Vitis riparia]1.4e-9354.39Show/hide
Query:  TIHLNLESVSISDFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSNTSVFVSINQ-SSSFLLSIQIHTCRSLSPKDQQAILDQVARMLRITEKDED
        T+H+ L     S F LE AVCNHG FMM PN WIPS+KTLQRPLRL++  TS+  SI+   +   + +++H    +SP DQQ IL QVARMLRI+++DE 
Subjt:  TIHLNLESVSISDFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSNTSVFVSINQ-SSSFLLSIQIHTCRSLSPKDQQAILDQVARMLRITEKDED

Query:  DLRKFHNLHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGGLCELQAKM-NETRKRKRKGKGRGGQREYEVRG-GNFPNAIELSGMSVELL
        D+++FH + P AK   FGRIFRSP++FED VKSILLCN  WRRTL MA  LCELQ ++    RKR    + +      EV+  GNFPN++EL+ +  E L
Subjt:  DLRKFHNLHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGGLCELQAKM-NETRKRKRKGKGRGGQREYEVRG-GNFPNAIELSGMSVELL

Query:  KKHL-LGYRADYIINFATSIENGKIDLQRFEEALCS------RNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCNKKTVGEDVKQ
        KK   LGYRA  I+  ATSIENG++ LQ FE+AL +       +   K +GFGPFA ANI MC+G+Y ++P D+ET RH+K++HGR    KK   +DVK+
Subjt:  KKHL-LGYRADYIINFATSIENGKIDLQRFEEALCS------RNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCNKKTVGEDVKQ

Query:  IYDKYAPFQCLAYWLELVEYYESKFGKLSELCSVDYHKISGT
        IYDKYAPFQCLAYWLEL EYY+S+FGKLSEL   +YH I+G+
Subjt:  IYDKYAPFQCLAYWLELVEYYESKFGKLSELCSVDYHKISGT

XP_038877617.1 uncharacterized protein LOC120069874 [Benincasa hispida]2.4e-12576.21Show/hide
Query:  MKTIHLNLESVSISDFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSNTSVFVSINQSSSFLLSIQIHTCRS-LSPKDQQAILDQVARMLRITEKD
        MKTIHLNL  VS+SDFDLEKAVCNHG FMMPPN+WIPSSKTLQRPLRLS+S++SVFVSINQ SS LL+IQIH+  + LSP+DQQAILDQV RMLR+TEKD
Subjt:  MKTIHLNLESVSISDFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSNTSVFVSINQSSSFLLSIQIHTCRS-LSPKDQQAILDQVARMLRITEKD

Query:  EDDLRKFHNLHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGGLCELQAKMNE--TRKRKRKGKGRGGQREYEVRGGNFPNAIELSGMSVE
        ED+LRKF +LHPRAK+MGFGR+FRSPTLFEDA+KSILLCNT+W+RTLAMAG LCELQAKM    TRKRKRK     G++E E+  GNFPNA E+  M VE
Subjt:  EDDLRKFHNLHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGGLCELQAKMNE--TRKRKRKGKGRGGQREYEVRGGNFPNAIELSGMSVE

Query:  LLKKHLLGYRADYIINFATSIENGKIDLQRFEEALCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCNKKTVGEDVKQIYDKY
        LLKKH LGYRA YIINFA  +++GKIDLQ       + N FPKIKGFGPFA AN+ MCLG Y Q+PIDTETIRHLKQVHGR+FCN KTV EDVKQIYDKY
Subjt:  LLKKHLLGYRADYIINFATSIENGKIDLQRFEEALCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCNKKTVGEDVKQIYDKY

Query:  APFQCLAYWLE
        APFQCLAYWLE
Subjt:  APFQCLAYWLE

TrEMBL top hitse value%identityAlignment
A0A438CJ05 Uncharacterized protein2.3e-9254.09Show/hide
Query:  TIHLNLESVSISDFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSNTSVFVSINQ-SSSFLLSIQIHTCRSLSPKDQQAILDQVARMLRITEKDED
        T+H+ L     S F+LE AVCNHG FMM PN WIPS+KTLQRPLRL++  TS+  SI+   +   + +++H    +SP DQ+ IL  VARMLRI+++DE 
Subjt:  TIHLNLESVSISDFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSNTSVFVSINQ-SSSFLLSIQIHTCRSLSPKDQQAILDQVARMLRITEKDED

Query:  DLRKFHNLHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGGLCELQAKM-NETRKRKRKGKGRGGQREYEVRG-GNFPNAIELSGMSVELL
        D+++FH + P AK   FGRIFRSP++FED VKSILLCN  WRRTL MA  LCELQ ++    RKR    + +      EV+  GNFPN++EL+ +  E L
Subjt:  DLRKFHNLHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGGLCELQAKM-NETRKRKRKGKGRGGQREYEVRG-GNFPNAIELSGMSVELL

Query:  KKHL-LGYRADYIINFATSIENGKIDLQRFEEALCS------RNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCNKKTVGEDVKQ
        KK   LGYRA  I+  ATSIENG++ LQ FE+AL +       +   K KGFGPFA ANI MC+G+Y ++P D+ET RH+K++HGR    KK   +DVK+
Subjt:  KKHL-LGYRADYIINFATSIENGKIDLQRFEEALCS------RNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCNKKTVGEDVKQ

Query:  IYDKYAPFQCLAYWLELVEYYESKFGKLSELCSVDYHKISGT
        IYDKYAPFQCLAYWLEL EYY+S+FGKLSEL   +YH I+G+
Subjt:  IYDKYAPFQCLAYWLELVEYYESKFGKLSELCSVDYHKISGT

A0A6A1W9S6 Uncharacterized protein1.5e-9150.53Show/hide
Query:  IHLNLESVSISDFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSNTSVFVSINQSSS---FLLSIQIHTCRSLSPKDQQAILDQVARMLRITEKDE
        + L LE   +  F++EKAVCNHG FMM PN WIPS+KTLQRPLRL+NS  SV VSI+  +S     + IQ+H    +SP+D++AIL+QVARMLRI+E+DE
Subjt:  IHLNLESVSISDFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSNTSVFVSINQSSS---FLLSIQIHTCRSLSPKDQQAILDQVARMLRITEKDE

Query:  DDLRKFHNLHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGGLCELQAKM----------NETRKRKRKGKGRGGQREYEVRG--------
         +LR+F NLHP AKE GFGR FRSP+LFEDA+KS+LLCN +W RTL MA  LCELQ ++          N  R+  RK   RG +R+   R         
Subjt:  DDLRKFHNLHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGGLCELQAKM----------NETRKRKRKGKGRGGQREYEVRG--------

Query:  -------------------GNFPNAIELSGMSVELLKKHL-LGYRADYIINFATSIENGKIDLQRFEE---ALCSR--NAFPKIKGFGPFAIANIRMCLG
                           GNFP++ E++ ++   L+ H  LGYRA YI+  A  +E+GK+ L+ F++   A C        KIKGFGPFA AN+ MC+G
Subjt:  -------------------GNFPNAIELSGMSVELLKKHL-LGYRADYIINFATSIENGKIDLQRFEE---ALCSR--NAFPKIKGFGPFAIANIRMCLG

Query:  FYHQVPIDTETIRHLKQVHGREFCNKKTVGEDVKQIYDKYAPFQCLAYWLELVEYYESKFGKLSELCSVDYHKISGTN
        +Y  VP+DTET+RHL+QVHGR+   K+TV EDVK +YDK+APFQ LAYW EL+E+YE KFGKLSEL +  Y  +SG++
Subjt:  FYHQVPIDTETIRHLKQVHGREFCNKKTVGEDVKQIYDKYAPFQCLAYWLELVEYYESKFGKLSELCSVDYHKISGTN

A0A6J1DS88 uncharacterized protein LOC1110238222.4e-12669.82Show/hide
Query:  KTIHLNLESVSISDFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSNTSVFVSINQSSSFLLSIQIHTCRSLSPKDQQAILDQVARMLRITEKDED
        + I LNL   + S FDLE+AVCNHG FMMPPNKWIPSSKTLQRPLRL++S TSV VSI+Q SS LL+IQIH+  S SP D+QAILDQV RMLRITE+DE+
Subjt:  KTIHLNLESVSISDFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSNTSVFVSINQSSSFLLSIQIHTCRSLSPKDQQAILDQVARMLRITEKDED

Query:  DLRKFHNLHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGGLCELQAKM-----NETRKRKRKGKGRGGQREYEVRGGNFPNAIELSGMSV
        ++R F NLH +AKE+GFGR+FRSPTLFEDAVKSILLCN +WRRTLAMAG LCELQAK+      + +KRKRKGKG     E E+ GGNFP A EL  MSV
Subjt:  DLRKFHNLHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGGLCELQAKM-----NETRKRKRKGKGRGGQREYEVRGGNFPNAIELSGMSV

Query:  ELLKKHLLGYRADYIINFATSIENGKIDLQRFEEALCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCNKKTVGEDVKQIYDK
         LL+KH +GYRA YII+ A  ++NGKIDLQ+ E AL    +FPKIKGFGPF  AN+ MCLG Y ++PIDTETIRHLKQVHGR+ CN KT  E VK +YDK
Subjt:  ELLKKHLLGYRADYIINFATSIENGKIDLQRFEEALCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCNKKTVGEDVKQIYDK

Query:  YAPFQCLAYWLELVEYYESKFGKLSELCSVDYHKISGT
        YAPFQCLAYW+ELVEYYES+FGKLSEL   DY KISGT
Subjt:  YAPFQCLAYWLELVEYYESKFGKLSELCSVDYHKISGT

A0A6J1GJ25 uncharacterized protein LOC1114546592.8e-13572.56Show/hide
Query:  VSISDFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSNTSVFVSINQSSSFLLSIQIHTCRSLSPKDQQAILDQVARMLRITEKDEDDLRKFHNLH
        V +SDF+LEKAVCNHG FMM PN+WIPSSKTLQRPLRLSNS+TS+ VSINQSSS LL++QIH+ RSL PKD+ AILDQVARMLR+TEKDED++R+F NLH
Subjt:  VSISDFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSNTSVFVSINQSSSFLLSIQIHTCRSLSPKDQQAILDQVARMLRITEKDEDDLRKFHNLH

Query:  PRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGGLCELQAKMNETRKRKRKGKGRGGQREYEVRGGNFPNAIELSGMSVELLKKHLLGYRADY
        P AK++GFGRIFRSP+LFED VKSIL+CNTSWRRTL MA  LCE+QAKM E++KRKRKG             GNFPNA E+  M VE LK H LGYRA+Y
Subjt:  PRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGGLCELQAKMNETRKRKRKGKGRGGQREYEVRGGNFPNAIELSGMSVELLKKHLLGYRADY

Query:  IINFATSIENGKIDLQRFEEALCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCNKKTVGEDVKQIYDKYAPFQCLAYWLELV
        ++ FA S+E+G+I+LQ  E+ + S +AFPKIKGFGPFA ANI MCLGFYHQ+PIDTETIRHLKQVHG ++C KKTVGEDVKQIYD YAP+QCLAYWLELV
Subjt:  IINFATSIENGKIDLQRFEEALCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCNKKTVGEDVKQIYDKYAPFQCLAYWLELV

Query:  EYYESKFGKLSELCSVDYHKISGTNVNL
        +YYE+KFGKLSEL S DYHKISG+ ++L
Subjt:  EYYESKFGKLSELCSVDYHKISGTNVNL

A0A6P4BPN5 uncharacterized protein LOC1074341916.1e-9053.13Show/hide
Query:  SDFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSNTSVFVSINQ---SSSFLLSIQIHTCRSLSPKDQQAILDQVARMLRITEKDEDDLRKFHNLH
        S F+LEKAVCNHG FMM PN WIPS+KTLQRPLRLS+  TS  VSI+     S  LL I +H+    S  D+ AIL QV RMLRI+E+DE D+R+F    
Subjt:  SDFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSNTSVFVSINQ---SSSFLLSIQIHTCRSLSPKDQQAILDQVARMLRITEKDEDDLRKFHNLH

Query:  PRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGGLCELQAKMNETRKRKRKGKGRGGQR---EYEVRGGNFPNAIELSGMSVELLKKH--LLG
        P+AK  GFGR+FRSP++FEDAVKSILLCN +W ++L MA  LCELQ ++  TRK K K K RG      + EVR GNFP + EL+ +    L++   +LG
Subjt:  PRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGGLCELQAKMNETRKRKRKGKGRGGQR---EYEVRGGNFPNAIELSGMSVELLKKH--LLG

Query:  YRADYIINFATSIENGKIDLQRFEEALCSR--------NAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCNKKTVGEDVKQIYDKY
        YRA YI+  A ++E+G++ L+  EE +                + GFGP+  AN+ MC+G Y  VP+DTETIRH++QVHGR+ C+KKTV + V++IYDK+
Subjt:  YRADYIINFATSIENGKIDLQRFEEALCSR--------NAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCNKKTVGEDVKQIYDKY

Query:  APFQCLAYWLELVEYYESKFGKLSELCSVDYHKIS
        APFQCLAYW+EL++ YE KFGKLSEL    Y  +S
Subjt:  APFQCLAYWLELVEYYESKFGKLSELCSVDYHKIS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGACAATTCATTTGAATTTGGAATCAGTTTCAATCAGTGATTTTGATCTTGAGAAAGCAGTTTGCAATCATGGGGTGTTTATGATGCCTCCAAACAAATGGATTCC
TTCTTCTAAAACTCTCCAACGTCCACTTCGACTCTCCAATTCAAACACTTCTGTTTTTGTCTCTATAAACCAATCTTCTTCTTTTCTCCTCTCCATTCAAATTCACACTT
GTCGCTCTCTTTCTCCTAAAGATCAACAAGCTATATTGGATCAAGTGGCTCGAATGCTTAGAATTACGGAGAAAGATGAAGATGACCTTAGAAAATTTCATAATTTGCAT
CCGAGAGCCAAAGAGATGGGATTTGGTCGGATTTTTCGATCTCCAACTCTTTTTGAAGATGCAGTGAAGTCCATCCTTCTGTGCAATACCTCGTGGAGAAGGACATTGGC
AATGGCCGGAGGGCTATGTGAGTTACAAGCCAAAATGAACGAAACTAGGAAGAGAAAGAGAAAAGGCAAAGGACGAGGAGGGCAAAGGGAATACGAGGTCAGAGGAGGGA
ATTTTCCAAATGCCATAGAACTTTCTGGAATGAGCGTTGAATTGTTGAAGAAGCATTTACTTGGTTATAGAGCTGATTACATCATCAATTTCGCTACAAGCATTGAAAAT
GGCAAAATCGATCTCCAAAGATTTGAAGAAGCACTTTGCTCTCGTAATGCTTTCCCTAAAATCAAAGGCTTTGGTCCTTTTGCAATAGCCAATATTCGCATGTGCCTCGG
ATTTTACCATCAAGTTCCAATTGATACCGAGACTATAAGACACTTAAAACAGGTACATGGAAGAGAATTTTGCAACAAGAAGACAGTAGGGGAAGATGTCAAACAAATTT
ACGACAAGTATGCTCCTTTCCAATGCTTGGCCTATTGGTTGGAGCTTGTTGAGTATTATGAAAGCAAATTCGGGAAGCTAAGTGAATTGTGTTCCGTTGATTATCACAAG
ATCAGTGGCACCAACGTTAACCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGACAATTCATTTGAATTTGGAATCAGTTTCAATCAGTGATTTTGATCTTGAGAAAGCAGTTTGCAATCATGGGGTGTTTATGATGCCTCCAAACAAATGGATTCC
TTCTTCTAAAACTCTCCAACGTCCACTTCGACTCTCCAATTCAAACACTTCTGTTTTTGTCTCTATAAACCAATCTTCTTCTTTTCTCCTCTCCATTCAAATTCACACTT
GTCGCTCTCTTTCTCCTAAAGATCAACAAGCTATATTGGATCAAGTGGCTCGAATGCTTAGAATTACGGAGAAAGATGAAGATGACCTTAGAAAATTTCATAATTTGCAT
CCGAGAGCCAAAGAGATGGGATTTGGTCGGATTTTTCGATCTCCAACTCTTTTTGAAGATGCAGTGAAGTCCATCCTTCTGTGCAATACCTCGTGGAGAAGGACATTGGC
AATGGCCGGAGGGCTATGTGAGTTACAAGCCAAAATGAACGAAACTAGGAAGAGAAAGAGAAAAGGCAAAGGACGAGGAGGGCAAAGGGAATACGAGGTCAGAGGAGGGA
ATTTTCCAAATGCCATAGAACTTTCTGGAATGAGCGTTGAATTGTTGAAGAAGCATTTACTTGGTTATAGAGCTGATTACATCATCAATTTCGCTACAAGCATTGAAAAT
GGCAAAATCGATCTCCAAAGATTTGAAGAAGCACTTTGCTCTCGTAATGCTTTCCCTAAAATCAAAGGCTTTGGTCCTTTTGCAATAGCCAATATTCGCATGTGCCTCGG
ATTTTACCATCAAGTTCCAATTGATACCGAGACTATAAGACACTTAAAACAGGTACATGGAAGAGAATTTTGCAACAAGAAGACAGTAGGGGAAGATGTCAAACAAATTT
ACGACAAGTATGCTCCTTTCCAATGCTTGGCCTATTGGTTGGAGCTTGTTGAGTATTATGAAAGCAAATTCGGGAAGCTAAGTGAATTGTGTTCCGTTGATTATCACAAG
ATCAGTGGCACCAACGTTAACCTTTGA
Protein sequenceShow/hide protein sequence
MKTIHLNLESVSISDFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSNTSVFVSINQSSSFLLSIQIHTCRSLSPKDQQAILDQVARMLRITEKDEDDLRKFHNLH
PRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAGGLCELQAKMNETRKRKRKGKGRGGQREYEVRGGNFPNAIELSGMSVELLKKHLLGYRADYIINFATSIEN
GKIDLQRFEEALCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGREFCNKKTVGEDVKQIYDKYAPFQCLAYWLELVEYYESKFGKLSELCSVDYHK
ISGTNVNL