; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0010503 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0010503
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDNA glycosylase
Genome locationchr1:164012..165780
RNA-Seq ExpressionLag0010503
SyntenyLag0010503
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585875.1 hypothetical protein SDJN03_18608, partial [Cucurbita argyrosperma subsp. sororia]7.3e-13873.78Show/hide
Query:  VSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSNTSVFVSINQTSSSLLSIQIHTCRSLPPKDQQAILDQVARMLRITEKDEDDLRNFHNLH
        V +SDF+LEKAVCNHG FMM+PN+WIPSSKTLQRPLRLSNS+TS+ VSINQ+SSSLL++QIH+ RSLPPKD+ AILDQVARMLR+TEKDED++R F NLH
Subjt:  VSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSNTSVFVSINQTSSSLLSIQIHTCRSLPPKDQQAILDQVARMLRITEKDEDDLRNFHNLH

Query:  PRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAEELCELQAKMSETKKRKRKGKGRGGKREYEARGGNFPNATELSRMSVELLKKHLLGYRADY
        P AK++GFGRIFRSP+LFED VKSIL+CNTSWRRTL MAE+LCE+QAKM E+KKRKRKG             GNFPNA E+ RM VE LK H LGYRA+Y
Subjt:  PRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAEELCELQAKMSETKKRKRKGKGRGGKREYEARGGNFPNATELSRMSVELLKKHLLGYRADY

Query:  IINFSTSVENGKIDLQRFEETLCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGKEFCSKKTVGEDVKQIYDKYAPFQCLAYWLELV
        ++ F+ SVE+G+I+LQ  E+ + S +AFPKIKGFGPFA ANI MCLGFYHQ+PIDTETIRHLKQVHG ++C+KKTVGEDVKQIYD YAP+QCLAYWLELV
Subjt:  IINFSTSVENGKIDLQRFEETLCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGKEFCSKKTVGEDVKQIYDKYAPFQCLAYWLELV

Query:  EYYESKFGKLSELCSLDYHKISGTSVNL
        +YYE+KFGKLSEL S DYHKISG++++L
Subjt:  EYYESKFGKLSELCSLDYHKISGTSVNL

XP_022156993.1 uncharacterized protein LOC111023822 [Momordica charantia]1.3e-12368.73Show/hide
Query:  KTIHLNLESVSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSNTSVFVSINQTSSSLLSIQIHTCRSLPPKDQQAILDQVARMLRITEKDED
        + I LNL   + S FDLE+AVCNHG FMM PNKWIPSSKTLQRPLRL++S TSV VSI+Q SS LL+IQIH+  S  P D+QAILDQV RMLRITE+DE+
Subjt:  KTIHLNLESVSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSNTSVFVSINQTSSSLLSIQIHTCRSLPPKDQQAILDQVARMLRITEKDED

Query:  DLRNFHNLHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAEELCELQAKM-----SETKKRKRKGKGRGGKREYEARGGNFPNATELSRMSV
        ++RNF NLH +AKE+GFGR+FRSPTLFEDAVKSILLCN +WRRTLAMA +LCELQAK+     ++ KKRKRKGKG     E E  GGNFP A EL RMSV
Subjt:  DLRNFHNLHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAEELCELQAKM-----SETKKRKRKGKGRGGKREYEARGGNFPNATELSRMSV

Query:  ELLKKHLLGYRADYIINFSTSVENGKIDLQRFEETLCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGKEFCSKKTVGEDVKQIYDK
         LL+KH +GYRA YII+ +  V+NGKIDLQ+ E  L    +FPKIKGFGPF  AN+ MCLG Y ++PIDTETIRHLKQVHG++ C+ KT  E VK +YDK
Subjt:  ELLKKHLLGYRADYIINFSTSVENGKIDLQRFEETLCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGKEFCSKKTVGEDVKQIYDK

Query:  YAPFQCLAYWLELVEYYESKFGKLSELCSLDYHKISGTS
        YAPFQCLAYW+ELVEYYES+FGKLSEL   DY KISGT+
Subjt:  YAPFQCLAYWLELVEYYESKFGKLSELCSLDYHKISGTS

XP_022951918.1 uncharacterized protein LOC111454659 [Cucurbita moschata]5.6e-13873.78Show/hide
Query:  VSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSNTSVFVSINQTSSSLLSIQIHTCRSLPPKDQQAILDQVARMLRITEKDEDDLRNFHNLH
        V +SDF+LEKAVCNHG FMM+PN+WIPSSKTLQRPLRLSNS+TS+ VSINQ+SSSLL++QIH+ RSLPPKD+ AILDQVARMLR+TEKDED++R F NLH
Subjt:  VSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSNTSVFVSINQTSSSLLSIQIHTCRSLPPKDQQAILDQVARMLRITEKDEDDLRNFHNLH

Query:  PRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAEELCELQAKMSETKKRKRKGKGRGGKREYEARGGNFPNATELSRMSVELLKKHLLGYRADY
        P AK++GFGRIFRSP+LFED VKSIL+CNTSWRRTL MAE+LCE+QAKM E+KKRKRKG             GNFPNA E+ RM VE LK H LGYRA+Y
Subjt:  PRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAEELCELQAKMSETKKRKRKGKGRGGKREYEARGGNFPNATELSRMSVELLKKHLLGYRADY

Query:  IINFSTSVENGKIDLQRFEETLCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGKEFCSKKTVGEDVKQIYDKYAPFQCLAYWLELV
        ++ F+ SVE+G+I+LQ  E+ + S +AFPKIKGFGPFA ANI MCLGFYHQ+PIDTETIRHLKQVHG ++C+KKTVGEDVKQIYD YAP+QCLAYWLELV
Subjt:  IINFSTSVENGKIDLQRFEETLCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGKEFCSKKTVGEDVKQIYDKYAPFQCLAYWLELV

Query:  EYYESKFGKLSELCSLDYHKISGTSVNL
        +YYE+KFGKLSEL S DYHKISG++++L
Subjt:  EYYESKFGKLSELCSLDYHKISGTSVNL

XP_034673386.1 uncharacterized protein LOC117904736 [Vitis riparia]5.1e-9152.19Show/hide
Query:  TIHLNLESVSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSNTSVFVSINQ-TSSSLLSIQIHTCRSLPPKDQQAILDQVARMLRITEKDED
        T+H+ L     S F LE AVCNHG FMM+PN WIPS+KTLQRPLRL++  TS+  SI+   + + + +++H    + P DQQ IL QVARMLRI+++DE 
Subjt:  TIHLNLESVSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSNTSVFVSINQ-TSSSLLSIQIHTCRSLPPKDQQAILDQVARMLRITEKDED

Query:  DLRNFHNLHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAEELCELQAKMSETKKRK---RKGKGRGGKREYEARGGNFPNATELSRMSVEL
        D++ FH + P AK   FGRIFRSP++FED VKSILLCN  WRRTL MA+ LCELQ ++   K+++    + K +    E ++  GNFPN+ EL+ +  E 
Subjt:  DLRNFHNLHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAEELCELQAKMSETKKRK---RKGKGRGGKREYEARGGNFPNATELSRMSVEL

Query:  LKKHL-LGYRADYIINFSTSVENGKIDLQRFEETLCS------RNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGKEFCSKKTVGEDVK
        LKK   LGYRA  I+  +TS+ENG++ LQ FE+ L +       +   K +GFGPFA ANI MC+G+Y ++P D+ET RH+K++HG+    KK   +DVK
Subjt:  LKKHL-LGYRADYIINFSTSVENGKIDLQRFEETLCS------RNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGKEFCSKKTVGEDVK

Query:  QIYDKYAPFQCLAYWLELVEYYESKFGKLSELCSLDYHKISGT
        +IYDKYAPFQCLAYWLEL EYY+S+FGKLSEL   +YH I+G+
Subjt:  QIYDKYAPFQCLAYWLELVEYYESKFGKLSELCSLDYHKISGT

XP_038877617.1 uncharacterized protein LOC120069874 [Benincasa hispida]2.8e-12174.43Show/hide
Query:  MKTIHLNLESVSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSNTSVFVSINQTSSSLLSIQIHTCRS-LPPKDQQAILDQVARMLRITEKD
        MKTIHLNL  VS+SDFDLEKAVCNHG FMM PN+WIPSSKTLQRPLRLS+S++SVFVSINQ SSSLL+IQIH+  + L P+DQQAILDQV RMLR+TEKD
Subjt:  MKTIHLNLESVSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSNTSVFVSINQTSSSLLSIQIHTCRS-LPPKDQQAILDQVARMLRITEKD

Query:  EDDLRNFHNLHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAEELCELQAKMSETKKRKRKGKGRGGKREYEARGGNFPNATELSRMSVELL
        ED+LR F +LHPRAK+MGFGR+FRSPTLFEDA+KSILLCNT+W+RTLAMA +LCELQAKM     RKRK K      E E   GNFPNA E+ RM VELL
Subjt:  EDDLRNFHNLHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAEELCELQAKMSETKKRKRKGKGRGGKREYEARGGNFPNATELSRMSVELL

Query:  KKHLLGYRADYIINFSTSVENGKIDLQRFEETLCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGKEFCSKKTVGEDVKQIYDKYAP
        KKH LGYRA YIINF+  V++GKIDLQ       + N FPKIKGFGPFA AN+ MCLG Y Q+PIDTETIRHLKQVHG++FC+ KTV EDVKQIYDKYAP
Subjt:  KKHLLGYRADYIINFSTSVENGKIDLQRFEETLCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGKEFCSKKTVGEDVKQIYDKYAP

Query:  FQCLAYWLE
        FQCLAYWLE
Subjt:  FQCLAYWLE

TrEMBL top hitse value%identityAlignment
A0A438CJ05 Uncharacterized protein8.0e-9051.9Show/hide
Query:  TIHLNLESVSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSNTSVFVSINQ-TSSSLLSIQIHTCRSLPPKDQQAILDQVARMLRITEKDED
        T+H+ L     S F+LE AVCNHG FMM+PN WIPS+KTLQRPLRL++  TS+  SI+   + + + +++H    + P DQ+ IL  VARMLRI+++DE 
Subjt:  TIHLNLESVSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSNTSVFVSINQ-TSSSLLSIQIHTCRSLPPKDQQAILDQVARMLRITEKDED

Query:  DLRNFHNLHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAEELCELQAKMSETKKRK---RKGKGRGGKREYEARGGNFPNATELSRMSVEL
        D++ FH + P AK   FGRIFRSP++FED VKSILLCN  WRRTL MA+ LCELQ ++   K+++    + K +    E ++  GNFPN+ EL+ +  E 
Subjt:  DLRNFHNLHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAEELCELQAKMSETKKRK---RKGKGRGGKREYEARGGNFPNATELSRMSVEL

Query:  LKKHL-LGYRADYIINFSTSVENGKIDLQRFEETLCS------RNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGKEFCSKKTVGEDVK
        LKK   LGYRA  I+  +TS+ENG++ LQ FE+ L +       +   K KGFGPFA ANI MC+G+Y ++P D+ET RH+K++HG+    KK   +DVK
Subjt:  LKKHL-LGYRADYIINFSTSVENGKIDLQRFEETLCS------RNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGKEFCSKKTVGEDVK

Query:  QIYDKYAPFQCLAYWLELVEYYESKFGKLSELCSLDYHKISGT
        +IYDKYAPFQCLAYWLEL EYY+S+FGKLSEL   +YH I+G+
Subjt:  QIYDKYAPFQCLAYWLELVEYYESKFGKLSELCSLDYHKISGT

A0A6A1W9S6 Uncharacterized protein4.7e-9049.47Show/hide
Query:  IHLNLESVSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSNTSVFVSINQTSS---SLLSIQIHTCRSLPPKDQQAILDQVARMLRITEKDE
        + L LE   +  F++EKAVCNHG FMM+PN WIPS+KTLQRPLRL+NS  SV VSI+  +S   + + IQ+H    + P+D++AIL+QVARMLRI+E+DE
Subjt:  IHLNLESVSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSNTSVFVSINQTSS---SLLSIQIHTCRSLPPKDQQAILDQVARMLRITEKDE

Query:  DDLRNFHNLHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAEELCELQAKMSE-------TKKRKRKGKGRGGKREYEARG-----------
         +LR F NLHP AKE GFGR FRSP+LFEDA+KS+LLCN +W RTL MA+ LCELQ +++            ++  + RG KR+   R            
Subjt:  DDLRNFHNLHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAEELCELQAKMSE-------TKKRKRKGKGRGGKREYEARG-----------

Query:  ----------------GNFPNATELSRMSVELLKKHL-LGYRADYIINFSTSVENGKIDLQRFEE---TLCSR--NAFPKIKGFGPFAIANIRMCLGFYH
                        GNFP++ E++ ++   L+ H  LGYRA YI+  +  VE+GK+ L+ F++     C        KIKGFGPFA AN+ MC+G+Y 
Subjt:  ----------------GNFPNATELSRMSVELLKKHL-LGYRADYIINFSTSVENGKIDLQRFEE---TLCSR--NAFPKIKGFGPFAIANIRMCLGFYH

Query:  QVPIDTETIRHLKQVHGKEFCSKKTVGEDVKQIYDKYAPFQCLAYWLELVEYYESKFGKLSELCSLDYHKISGT
         VP+DTET+RHL+QVHG++   K+TV EDVK +YDK+APFQ LAYW EL+E+YE KFGKLSEL +  Y  +SG+
Subjt:  QVPIDTETIRHLKQVHGKEFCSKKTVGEDVKQIYDKYAPFQCLAYWLELVEYYESKFGKLSELCSLDYHKISGT

A0A6J1DS88 uncharacterized protein LOC1110238226.5e-12468.73Show/hide
Query:  KTIHLNLESVSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSNTSVFVSINQTSSSLLSIQIHTCRSLPPKDQQAILDQVARMLRITEKDED
        + I LNL   + S FDLE+AVCNHG FMM PNKWIPSSKTLQRPLRL++S TSV VSI+Q SS LL+IQIH+  S  P D+QAILDQV RMLRITE+DE+
Subjt:  KTIHLNLESVSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSNTSVFVSINQTSSSLLSIQIHTCRSLPPKDQQAILDQVARMLRITEKDED

Query:  DLRNFHNLHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAEELCELQAKM-----SETKKRKRKGKGRGGKREYEARGGNFPNATELSRMSV
        ++RNF NLH +AKE+GFGR+FRSPTLFEDAVKSILLCN +WRRTLAMA +LCELQAK+     ++ KKRKRKGKG     E E  GGNFP A EL RMSV
Subjt:  DLRNFHNLHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAEELCELQAKM-----SETKKRKRKGKGRGGKREYEARGGNFPNATELSRMSV

Query:  ELLKKHLLGYRADYIINFSTSVENGKIDLQRFEETLCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGKEFCSKKTVGEDVKQIYDK
         LL+KH +GYRA YII+ +  V+NGKIDLQ+ E  L    +FPKIKGFGPF  AN+ MCLG Y ++PIDTETIRHLKQVHG++ C+ KT  E VK +YDK
Subjt:  ELLKKHLLGYRADYIINFSTSVENGKIDLQRFEETLCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGKEFCSKKTVGEDVKQIYDK

Query:  YAPFQCLAYWLELVEYYESKFGKLSELCSLDYHKISGTS
        YAPFQCLAYW+ELVEYYES+FGKLSEL   DY KISGT+
Subjt:  YAPFQCLAYWLELVEYYESKFGKLSELCSLDYHKISGTS

A0A6J1GJ25 uncharacterized protein LOC1114546592.7e-13873.78Show/hide
Query:  VSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSNTSVFVSINQTSSSLLSIQIHTCRSLPPKDQQAILDQVARMLRITEKDEDDLRNFHNLH
        V +SDF+LEKAVCNHG FMM+PN+WIPSSKTLQRPLRLSNS+TS+ VSINQ+SSSLL++QIH+ RSLPPKD+ AILDQVARMLR+TEKDED++R F NLH
Subjt:  VSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSNTSVFVSINQTSSSLLSIQIHTCRSLPPKDQQAILDQVARMLRITEKDEDDLRNFHNLH

Query:  PRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAEELCELQAKMSETKKRKRKGKGRGGKREYEARGGNFPNATELSRMSVELLKKHLLGYRADY
        P AK++GFGRIFRSP+LFED VKSIL+CNTSWRRTL MAE+LCE+QAKM E+KKRKRKG             GNFPNA E+ RM VE LK H LGYRA+Y
Subjt:  PRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAEELCELQAKMSETKKRKRKGKGRGGKREYEARGGNFPNATELSRMSVELLKKHLLGYRADY

Query:  IINFSTSVENGKIDLQRFEETLCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGKEFCSKKTVGEDVKQIYDKYAPFQCLAYWLELV
        ++ F+ SVE+G+I+LQ  E+ + S +AFPKIKGFGPFA ANI MCLGFYHQ+PIDTETIRHLKQVHG ++C+KKTVGEDVKQIYD YAP+QCLAYWLELV
Subjt:  IINFSTSVENGKIDLQRFEETLCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGKEFCSKKTVGEDVKQIYDKYAPFQCLAYWLELV

Query:  EYYESKFGKLSELCSLDYHKISGTSVNL
        +YYE+KFGKLSEL S DYHKISG++++L
Subjt:  EYYESKFGKLSELCSLDYHKISGTSVNL

A0A6P4BPN5 uncharacterized protein LOC1074341916.8e-8952.66Show/hide
Query:  SDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSNTSVFVSINQT---SSSLLSIQIHTCRSLPPK--DQQAILDQVARMLRITEKDEDDLRNFHN
        S F+LEKAVCNHG FMM+PN WIPS+KTLQRPLRLS+  TS  VSI+     S  LL I +H+    PP   D+ AIL QV RMLRI+E+DE D+R F  
Subjt:  SDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSNTSVFVSINQT---SSSLLSIQIHTCRSLPPK--DQQAILDQVARMLRITEKDEDDLRNFHN

Query:  LHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAEELCELQAKMSETKKRKRKGKGRGGKR----EYEARGGNFPNATELSRMSVELLKKH--
          P+AK  GFGR+FRSP++FEDAVKSILLCN +W ++L MA+ LCELQ +++ T  RK KGK + GK     + E R GNFP + EL+ +    L++   
Subjt:  LHPRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAEELCELQAKMSETKKRKRKGKGRGGKR----EYEARGGNFPNATELSRMSVELLKKH--

Query:  LLGYRADYIINFSTSVENGKIDLQRFEETLCSR--------NAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGKEFCSKKTVGEDVKQIY
        +LGYRA YI+  + +VE+G++ L+  EET+                + GFGP+  AN+ MC+G Y  VP+DTETIRH++QVHG++ C KKTV + V++IY
Subjt:  LLGYRADYIINFSTSVENGKIDLQRFEETLCSR--------NAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGKEFCSKKTVGEDVKQIY

Query:  DKYAPFQCLAYWLELVEYYESKFGKLSELCSLDYHKIS
        DK+APFQCLAYW+EL++ YE KFGKLSEL    Y  +S
Subjt:  DKYAPFQCLAYWLELVEYYESKFGKLSELCSLDYHKIS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGACAATTCATTTGAATTTGGAATCGGTTTCAATCAGTGATTTTGATCTTGAGAAAGCAGTTTGCAATCATGGGGTGTTTATGATGTCTCCAAACAAATGGATTCC
TTCTTCTAAAACTCTCCAACGTCCACTTCGACTCTCCAATTCAAACACTTCTGTTTTTGTCTCTATAAACCAAACTTCTTCTTCTCTCCTCTCCATTCAAATTCACACTT
GTCGCTCTCTTCCTCCTAAAGATCAACAAGCTATATTGGATCAAGTGGCTCGAATGCTTAGAATTACGGAGAAAGATGAAGATGACCTTAGAAATTTTCATAATTTGCAT
CCGAGAGCCAAAGAGATGGGATTTGGTCGAATTTTTCGATCTCCAACTCTTTTTGAAGATGCAGTAAAGTCCATCCTTCTGTGCAATACCTCGTGGAGAAGAACGTTGGC
AATGGCTGAAGAGCTATGTGAGTTACAAGCCAAAATGAGCGAAACTAAGAAGAGAAAGAGAAAAGGCAAAGGACGAGGAGGGAAAAGGGAATACGAGGCCAGGGGAGGGA
ATTTTCCAAATGCCACAGAACTTTCTAGGATGAGCGTTGAATTGTTGAAGAAGCATTTACTTGGTTATAGAGCTGATTACATCATCAATTTCTCTACAAGTGTTGAAAAT
GGCAAAATCGATCTGCAAAGATTTGAAGAAACACTTTGCTCTCGTAATGCTTTCCCTAAAATCAAAGGCTTTGGTCCTTTTGCAATAGCCAATATTCGCATGTGCCTCGG
ATTTTACCATCAAGTTCCAATTGATACCGAGACTATAAGACACTTAAAACAGGTACATGGAAAAGAATTTTGCAGCAAGAAGACAGTTGGAGAAGATGTCAAACAAATTT
ATGACAAGTATGCTCCTTTCCAATGCTTGGCCTATTGGTTGGAGCTTGTTGAGTATTATGAGAGCAAATTCGGGAAGCTAAGTGAATTGTGTTCCCTTGATTATCACAAG
ATAAGTGGCACCAGCGTCAACCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGACAATTCATTTGAATTTGGAATCGGTTTCAATCAGTGATTTTGATCTTGAGAAAGCAGTTTGCAATCATGGGGTGTTTATGATGTCTCCAAACAAATGGATTCC
TTCTTCTAAAACTCTCCAACGTCCACTTCGACTCTCCAATTCAAACACTTCTGTTTTTGTCTCTATAAACCAAACTTCTTCTTCTCTCCTCTCCATTCAAATTCACACTT
GTCGCTCTCTTCCTCCTAAAGATCAACAAGCTATATTGGATCAAGTGGCTCGAATGCTTAGAATTACGGAGAAAGATGAAGATGACCTTAGAAATTTTCATAATTTGCAT
CCGAGAGCCAAAGAGATGGGATTTGGTCGAATTTTTCGATCTCCAACTCTTTTTGAAGATGCAGTAAAGTCCATCCTTCTGTGCAATACCTCGTGGAGAAGAACGTTGGC
AATGGCTGAAGAGCTATGTGAGTTACAAGCCAAAATGAGCGAAACTAAGAAGAGAAAGAGAAAAGGCAAAGGACGAGGAGGGAAAAGGGAATACGAGGCCAGGGGAGGGA
ATTTTCCAAATGCCACAGAACTTTCTAGGATGAGCGTTGAATTGTTGAAGAAGCATTTACTTGGTTATAGAGCTGATTACATCATCAATTTCTCTACAAGTGTTGAAAAT
GGCAAAATCGATCTGCAAAGATTTGAAGAAACACTTTGCTCTCGTAATGCTTTCCCTAAAATCAAAGGCTTTGGTCCTTTTGCAATAGCCAATATTCGCATGTGCCTCGG
ATTTTACCATCAAGTTCCAATTGATACCGAGACTATAAGACACTTAAAACAGGTACATGGAAAAGAATTTTGCAGCAAGAAGACAGTTGGAGAAGATGTCAAACAAATTT
ATGACAAGTATGCTCCTTTCCAATGCTTGGCCTATTGGTTGGAGCTTGTTGAGTATTATGAGAGCAAATTCGGGAAGCTAAGTGAATTGTGTTCCCTTGATTATCACAAG
ATAAGTGGCACCAGCGTCAACCTTTGA
Protein sequenceShow/hide protein sequence
MKTIHLNLESVSISDFDLEKAVCNHGVFMMSPNKWIPSSKTLQRPLRLSNSNTSVFVSINQTSSSLLSIQIHTCRSLPPKDQQAILDQVARMLRITEKDEDDLRNFHNLH
PRAKEMGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMAEELCELQAKMSETKKRKRKGKGRGGKREYEARGGNFPNATELSRMSVELLKKHLLGYRADYIINFSTSVEN
GKIDLQRFEETLCSRNAFPKIKGFGPFAIANIRMCLGFYHQVPIDTETIRHLKQVHGKEFCSKKTVGEDVKQIYDKYAPFQCLAYWLELVEYYESKFGKLSELCSLDYHK
ISGTSVNL