; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh15G006690 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh15G006690
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionDNA-(apurinic or apyrimidinic site) lyase
Genome locationCmo_Chr15:3273761..3277119
RNA-Seq ExpressionCmoCh15G006690
SyntenyCmoCh15G006690
Gene Ontology termsGO:0006285 - base-excision repair, AP site formation (biological process)
GO:0006289 - nucleotide-excision repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003684 - damaged DNA binding (molecular function)
GO:0034039 - 8-oxo-7,8-dihydroguanine DNA N-glycosylase activity (molecular function)
GO:0140078 - class I DNA-(apurinic or apyrimidinic site) endonuclease activity (molecular function)
InterPro domainsIPR003265 - HhH-GPD domain
IPR011257 - DNA glycosylase
IPR012904 - 8-oxoguanine DNA glycosylase, N-terminal
IPR023170 - Helix-hairpin-helix, base-excision DNA repair, C-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578949.1 N-glycosylase/DNA lyase OGG1, partial [Cucurbita argyrosperma subsp. sororia]2.6e-22298.04Show/hide
Query:  MPSLSLRHHLMAKRLRPTPPSTPSAKPSPSPPSLPPSPPTPQLFHSKPTTVSLRHSSNDRNKTLTYLVSPASAASSNWVSLNLTRSDLSLPLTFPTGQTF
        MPSLSLRHHLMAKRLRPTPPSTPS KPSPSPPSLPPSPPTPQLFHSKPTTVSLRHSSNDRNKTLT+LVSPASA+SSNWVSLNLTRSDLSLPLTFPTGQTF
Subjt:  MPSLSLRHHLMAKRLRPTPPSTPSAKPSPSPPSLPPSPPTPQLFHSKPTTVSLRHSSNDRNKTLTYLVSPASAASSNWVSLNLTRSDLSLPLTFPTGQTF

Query:  RWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLHSCST-SSSSSSAAAARLALLDFLNAGISLSAIWEVFSAADPRFDFLSRHLEGARVLRQDPLECLIQ
        RWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLHSCST SSSSSS+AAARLALLDFLNAGISLSAIWEVFSAADPRFD LSRHLEGARVLRQDPLECLIQ
Subjt:  RWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLHSCST-SSSSSSAAAARLALLDFLNAGISLSAIWEVFSAADPRFDFLSRHLEGARVLRQDPLECLIQ

Query:  FLCSSNNNIGRITKMVDYISSLGNHLGNIGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDLALEEVIEGLTALP
        FLCSSNNNIGRITKMVDYISSLGNHLGNIGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDLALEEVIE LTALP
Subjt:  FLCSSNNNIGRITKMVDYISSLGNHLGNIGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDLALEEVIEGLTALP

Query:  GVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLENTKRKKSTKEQR
        GVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLENTKRKKSTKEQ 
Subjt:  GVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLENTKRKKSTKEQR

Query:  EKAHTEQDP
        EKAHTEQDP
Subjt:  EKAHTEQDP

KAG7016473.1 N-glycosylase/DNA lyase OGG1, partial [Cucurbita argyrosperma subsp. argyrosperma]4.1e-22098.02Show/hide
Query:  MPSLSLRHHLMAKRLRPTPPSTPSAKPSPSPPSLPPSPPTPQLFHSKPTTVSLRHSSNDRNKTLTYLVSPASAASSNWVSLNLTRSDLSLPLTFPTGQTF
        MPSLSLRHHLMAKRLRPTPPSTPS KPSPSPPSLPPSPPTPQLFHSKPTTVSLRHSSNDRNKTLT+LVSPASA+SSNWVSLNLTRSDLSLPLTFPTGQTF
Subjt:  MPSLSLRHHLMAKRLRPTPPSTPSAKPSPSPPSLPPSPPTPQLFHSKPTTVSLRHSSNDRNKTLTYLVSPASAASSNWVSLNLTRSDLSLPLTFPTGQTF

Query:  RWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLHSCST-SSSSSSAAAARLALLDFLNAGISLSAIWEVFSAADPRFDFLSRHLEGARVLRQDPLECLIQ
        RWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLHSCST SSSSSS+AAARLALLDFLNAGISLSAIWEVFSAADPRFD LSRHLEGARVLRQDPLECLIQ
Subjt:  RWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLHSCST-SSSSSSAAAARLALLDFLNAGISLSAIWEVFSAADPRFDFLSRHLEGARVLRQDPLECLIQ

Query:  FLCSSNNNIGRITKMVDYISSLGNHLGNIGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDLALEEVIEGLTALP
        FLCSSNNNIGRITKMVDYISSLGNHLGNIGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDLALEEVIE LTALP
Subjt:  FLCSSNNNIGRITKMVDYISSLGNHLGNIGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDLALEEVIEGLTALP

Query:  GVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLENTKRKKSTKEQR
        GVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLENTKRKKSTKEQ 
Subjt:  GVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLENTKRKKSTKEQR

Query:  EKAHT
        EKAHT
Subjt:  EKAHT

XP_022939268.1 N-glycosylase/DNA lyase OGG1 isoform X1 [Cucurbita moschata]1.1e-225100Show/hide
Query:  MPSLSLRHHLMAKRLRPTPPSTPSAKPSPSPPSLPPSPPTPQLFHSKPTTVSLRHSSNDRNKTLTYLVSPASAASSNWVSLNLTRSDLSLPLTFPTGQTF
        MPSLSLRHHLMAKRLRPTPPSTPSAKPSPSPPSLPPSPPTPQLFHSKPTTVSLRHSSNDRNKTLTYLVSPASAASSNWVSLNLTRSDLSLPLTFPTGQTF
Subjt:  MPSLSLRHHLMAKRLRPTPPSTPSAKPSPSPPSLPPSPPTPQLFHSKPTTVSLRHSSNDRNKTLTYLVSPASAASSNWVSLNLTRSDLSLPLTFPTGQTF

Query:  RWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLHSCSTSSSSSSAAAARLALLDFLNAGISLSAIWEVFSAADPRFDFLSRHLEGARVLRQDPLECLIQF
        RWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLHSCSTSSSSSSAAAARLALLDFLNAGISLSAIWEVFSAADPRFDFLSRHLEGARVLRQDPLECLIQF
Subjt:  RWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLHSCSTSSSSSSAAAARLALLDFLNAGISLSAIWEVFSAADPRFDFLSRHLEGARVLRQDPLECLIQF

Query:  LCSSNNNIGRITKMVDYISSLGNHLGNIGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDLALEEVIEGLTALPG
        LCSSNNNIGRITKMVDYISSLGNHLGNIGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDLALEEVIEGLTALPG
Subjt:  LCSSNNNIGRITKMVDYISSLGNHLGNIGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDLALEEVIEGLTALPG

Query:  VGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLENTKRKKSTKEQRE
        VGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLENTKRKKSTKEQRE
Subjt:  VGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLENTKRKKSTKEQRE

Query:  KAHT
        KAHT
Subjt:  KAHT

XP_022939269.1 N-glycosylase/DNA lyase OGG1 isoform X2 [Cucurbita moschata]7.1e-228100Show/hide
Query:  MPSLSLRHHLMAKRLRPTPPSTPSAKPSPSPPSLPPSPPTPQLFHSKPTTVSLRHSSNDRNKTLTYLVSPASAASSNWVSLNLTRSDLSLPLTFPTGQTF
        MPSLSLRHHLMAKRLRPTPPSTPSAKPSPSPPSLPPSPPTPQLFHSKPTTVSLRHSSNDRNKTLTYLVSPASAASSNWVSLNLTRSDLSLPLTFPTGQTF
Subjt:  MPSLSLRHHLMAKRLRPTPPSTPSAKPSPSPPSLPPSPPTPQLFHSKPTTVSLRHSSNDRNKTLTYLVSPASAASSNWVSLNLTRSDLSLPLTFPTGQTF

Query:  RWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLHSCSTSSSSSSAAAARLALLDFLNAGISLSAIWEVFSAADPRFDFLSRHLEGARVLRQDPLECLIQF
        RWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLHSCSTSSSSSSAAAARLALLDFLNAGISLSAIWEVFSAADPRFDFLSRHLEGARVLRQDPLECLIQF
Subjt:  RWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLHSCSTSSSSSSAAAARLALLDFLNAGISLSAIWEVFSAADPRFDFLSRHLEGARVLRQDPLECLIQF

Query:  LCSSNNNIGRITKMVDYISSLGNHLGNIGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDLALEEVIEGLTALPG
        LCSSNNNIGRITKMVDYISSLGNHLGNIGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDLALEEVIEGLTALPG
Subjt:  LCSSNNNIGRITKMVDYISSLGNHLGNIGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDLALEEVIEGLTALPG

Query:  VGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLENTKRKKSTKEQRE
        VGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLENTKRKKSTKEQRE
Subjt:  VGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLENTKRKKSTKEQRE

Query:  KAHTEQDP
        KAHTEQDP
Subjt:  KAHTEQDP

XP_022993891.1 N-glycosylase/DNA lyase OGG1 isoform X2 [Cucurbita maxima]6.6e-21896.34Show/hide
Query:  MPSLSLRHHLMAKRLRPTPPSTPSAK--PSPSPPSLPPSPPTPQLFHSKPTTVSLRHSSNDRNKTLTYLVSPASAASSNWVSLNLTRSDLSLPLTFPTGQ
        MPSLSLRHHLMAKRLRPTPPSTPSAK  PSPSPPSLPPSPPTPQLFHSKPTTVSLRHSSNDR+KTLT+LVSPASA+SSNWVSLNLTRSDLSLPLTFPTGQ
Subjt:  MPSLSLRHHLMAKRLRPTPPSTPSAK--PSPSPPSLPPSPPTPQLFHSKPTTVSLRHSSNDRNKTLTYLVSPASAASSNWVSLNLTRSDLSLPLTFPTGQ

Query:  TFRWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLHSCSTSSSSSSAAAARLALLDFLNAGISLSAIWEVFSAADPRFDFLSRHLEGARVLRQDPLECLI
        TFRWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLH CSTSSSSS+AAAARLALLDFLNAGISL AIWEVFSAADPRFD LS HLEGARVLRQDPLECLI
Subjt:  TFRWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLHSCSTSSSSSSAAAARLALLDFLNAGISLSAIWEVFSAADPRFDFLSRHLEGARVLRQDPLECLI

Query:  QFLCSSNNNIGRITKMVDYISSLGNHLGNIGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDLALEEVIEGLTAL
        QFLCSSNNNIGRITKMVDYISSLGNHLGNIGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDL L+EVIE LTAL
Subjt:  QFLCSSNNNIGRITKMVDYISSLGNHLGNIGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDLALEEVIEGLTAL

Query:  PGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLENTKRKKSTKEQ
        PGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLE TKRKKSTKEQ
Subjt:  PGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLENTKRKKSTKEQ

Query:  REKAHTEQDP
         EKAHTEQDP
Subjt:  REKAHTEQDP

TrEMBL top hitse value%identityAlignment
A0A5D3CBS3 DNA-(apurinic or apyrimidinic site) lyase8.2e-19086.03Show/hide
Query:  MPSLSLRH-HLMAKRLRPTPPSTPSAKPSPSPPSLPPSPPTPQLFHSKPTTVSLRHSSNDRNKTLTYLVSPASAASSNWVSLNLTRSDLSLPLTFPTGQT
        MPSLS +   LM KR +PT PSTPS KPSP     PPSPPTPQL HSKPTTVS+ HSS + NKTLT L SP S +SSNWVSLNLTRSDLSLPLTFPTGQT
Subjt:  MPSLSLRH-HLMAKRLRPTPPSTPSAKPSPSPPSLPPSPPTPQLFHSKPTTVSLRHSSNDRNKTLTYLVSPASAASSNWVSLNLTRSDLSLPLTFPTGQT

Query:  FRWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLHSCSTSSSSSSAAAARLALLDFLNAGISLSAIWEVFSAADPRFDFLSRHLEGARVLRQDPLECLIQ
        FRWKQT+PL FTGVVG HLISL HLPNG+VSYCLH  STS+SSS  AAARLALLDFLNAGISLS+IWEVFSAADPRFD L+RHLEGARVLRQDPLECLIQ
Subjt:  FRWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLHSCSTSSSSSSAAAARLALLDFLNAGISLSAIWEVFSAADPRFDFLSRHLEGARVLRQDPLECLIQ

Query:  FLCSSNNNIGRITKMVDYISSLGNHLGNIGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDLALEEVIEGLTALP
        FLCSSNNNIGRITKMVDYISSLGN+LGN+GGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTV  LK KPGGGAEWLLSLRD  LEEVI  L+ LP
Subjt:  FLCSSNNNIGRITKMVDYISSLGNHLGNIGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDLALEEVIEGLTALP

Query:  GVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLENTKRKKSTKEQR
        GVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVA+LPQQKALLPA+LENTKRK+STK+QR
Subjt:  GVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLENTKRKKSTKEQR

Query:  EKAHTEQD
        + AH EQD
Subjt:  EKAHTEQD

A0A6J1FGN4 DNA-(apurinic or apyrimidinic site) lyase3.4e-228100Show/hide
Query:  MPSLSLRHHLMAKRLRPTPPSTPSAKPSPSPPSLPPSPPTPQLFHSKPTTVSLRHSSNDRNKTLTYLVSPASAASSNWVSLNLTRSDLSLPLTFPTGQTF
        MPSLSLRHHLMAKRLRPTPPSTPSAKPSPSPPSLPPSPPTPQLFHSKPTTVSLRHSSNDRNKTLTYLVSPASAASSNWVSLNLTRSDLSLPLTFPTGQTF
Subjt:  MPSLSLRHHLMAKRLRPTPPSTPSAKPSPSPPSLPPSPPTPQLFHSKPTTVSLRHSSNDRNKTLTYLVSPASAASSNWVSLNLTRSDLSLPLTFPTGQTF

Query:  RWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLHSCSTSSSSSSAAAARLALLDFLNAGISLSAIWEVFSAADPRFDFLSRHLEGARVLRQDPLECLIQF
        RWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLHSCSTSSSSSSAAAARLALLDFLNAGISLSAIWEVFSAADPRFDFLSRHLEGARVLRQDPLECLIQF
Subjt:  RWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLHSCSTSSSSSSAAAARLALLDFLNAGISLSAIWEVFSAADPRFDFLSRHLEGARVLRQDPLECLIQF

Query:  LCSSNNNIGRITKMVDYISSLGNHLGNIGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDLALEEVIEGLTALPG
        LCSSNNNIGRITKMVDYISSLGNHLGNIGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDLALEEVIEGLTALPG
Subjt:  LCSSNNNIGRITKMVDYISSLGNHLGNIGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDLALEEVIEGLTALPG

Query:  VGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLENTKRKKSTKEQRE
        VGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLENTKRKKSTKEQRE
Subjt:  VGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLENTKRKKSTKEQRE

Query:  KAHTEQDP
        KAHTEQDP
Subjt:  KAHTEQDP

A0A6J1FL67 DNA-(apurinic or apyrimidinic site) lyase5.5e-226100Show/hide
Query:  MPSLSLRHHLMAKRLRPTPPSTPSAKPSPSPPSLPPSPPTPQLFHSKPTTVSLRHSSNDRNKTLTYLVSPASAASSNWVSLNLTRSDLSLPLTFPTGQTF
        MPSLSLRHHLMAKRLRPTPPSTPSAKPSPSPPSLPPSPPTPQLFHSKPTTVSLRHSSNDRNKTLTYLVSPASAASSNWVSLNLTRSDLSLPLTFPTGQTF
Subjt:  MPSLSLRHHLMAKRLRPTPPSTPSAKPSPSPPSLPPSPPTPQLFHSKPTTVSLRHSSNDRNKTLTYLVSPASAASSNWVSLNLTRSDLSLPLTFPTGQTF

Query:  RWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLHSCSTSSSSSSAAAARLALLDFLNAGISLSAIWEVFSAADPRFDFLSRHLEGARVLRQDPLECLIQF
        RWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLHSCSTSSSSSSAAAARLALLDFLNAGISLSAIWEVFSAADPRFDFLSRHLEGARVLRQDPLECLIQF
Subjt:  RWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLHSCSTSSSSSSAAAARLALLDFLNAGISLSAIWEVFSAADPRFDFLSRHLEGARVLRQDPLECLIQF

Query:  LCSSNNNIGRITKMVDYISSLGNHLGNIGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDLALEEVIEGLTALPG
        LCSSNNNIGRITKMVDYISSLGNHLGNIGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDLALEEVIEGLTALPG
Subjt:  LCSSNNNIGRITKMVDYISSLGNHLGNIGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDLALEEVIEGLTALPG

Query:  VGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLENTKRKKSTKEQRE
        VGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLENTKRKKSTKEQRE
Subjt:  VGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLENTKRKKSTKEQRE

Query:  KAHT
        KAHT
Subjt:  KAHT

A0A6J1JU60 DNA-(apurinic or apyrimidinic site) lyase3.2e-21896.34Show/hide
Query:  MPSLSLRHHLMAKRLRPTPPSTPSAK--PSPSPPSLPPSPPTPQLFHSKPTTVSLRHSSNDRNKTLTYLVSPASAASSNWVSLNLTRSDLSLPLTFPTGQ
        MPSLSLRHHLMAKRLRPTPPSTPSAK  PSPSPPSLPPSPPTPQLFHSKPTTVSLRHSSNDR+KTLT+LVSPASA+SSNWVSLNLTRSDLSLPLTFPTGQ
Subjt:  MPSLSLRHHLMAKRLRPTPPSTPSAK--PSPSPPSLPPSPPTPQLFHSKPTTVSLRHSSNDRNKTLTYLVSPASAASSNWVSLNLTRSDLSLPLTFPTGQ

Query:  TFRWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLHSCSTSSSSSSAAAARLALLDFLNAGISLSAIWEVFSAADPRFDFLSRHLEGARVLRQDPLECLI
        TFRWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLH CSTSSSSS+AAAARLALLDFLNAGISL AIWEVFSAADPRFD LS HLEGARVLRQDPLECLI
Subjt:  TFRWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLHSCSTSSSSSSAAAARLALLDFLNAGISLSAIWEVFSAADPRFDFLSRHLEGARVLRQDPLECLI

Query:  QFLCSSNNNIGRITKMVDYISSLGNHLGNIGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDLALEEVIEGLTAL
        QFLCSSNNNIGRITKMVDYISSLGNHLGNIGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDL L+EVIE LTAL
Subjt:  QFLCSSNNNIGRITKMVDYISSLGNHLGNIGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDLALEEVIEGLTAL

Query:  PGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLENTKRKKSTKEQ
        PGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLE TKRKKSTKEQ
Subjt:  PGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLENTKRKKSTKEQ

Query:  REKAHTEQDP
         EKAHTEQDP
Subjt:  REKAHTEQDP

A0A6J1JZS0 DNA-(apurinic or apyrimidinic site) lyase5.1e-21696.31Show/hide
Query:  MPSLSLRHHLMAKRLRPTPPSTPSAK--PSPSPPSLPPSPPTPQLFHSKPTTVSLRHSSNDRNKTLTYLVSPASAASSNWVSLNLTRSDLSLPLTFPTGQ
        MPSLSLRHHLMAKRLRPTPPSTPSAK  PSPSPPSLPPSPPTPQLFHSKPTTVSLRHSSNDR+KTLT+LVSPASA+SSNWVSLNLTRSDLSLPLTFPTGQ
Subjt:  MPSLSLRHHLMAKRLRPTPPSTPSAK--PSPSPPSLPPSPPTPQLFHSKPTTVSLRHSSNDRNKTLTYLVSPASAASSNWVSLNLTRSDLSLPLTFPTGQ

Query:  TFRWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLHSCSTSSSSSSAAAARLALLDFLNAGISLSAIWEVFSAADPRFDFLSRHLEGARVLRQDPLECLI
        TFRWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLH CSTSSSSS+AAAARLALLDFLNAGISL AIWEVFSAADPRFD LS HLEGARVLRQDPLECLI
Subjt:  TFRWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLHSCSTSSSSSSAAAARLALLDFLNAGISLSAIWEVFSAADPRFDFLSRHLEGARVLRQDPLECLI

Query:  QFLCSSNNNIGRITKMVDYISSLGNHLGNIGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDLALEEVIEGLTAL
        QFLCSSNNNIGRITKMVDYISSLGNHLGNIGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDL L+EVIE LTAL
Subjt:  QFLCSSNNNIGRITKMVDYISSLGNHLGNIGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDLALEEVIEGLTAL

Query:  PGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLENTKRKKSTKEQ
        PGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLE TKRKKSTKEQ
Subjt:  PGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLENTKRKKSTKEQ

Query:  REKAHT
         EKAHT
Subjt:  REKAHT

SwissProt top hitse value%identityAlignment
O08760 N-glycosylase/DNA lyase5.9e-5238.15Show/hide
Query:  SNDRNKTLTYLVSPASAASSNWVSLNLTRSDLSLPLTFPTGQTFRWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLHSCSTSSSSSSAAAARLALL-DF
        S+ R++TL       S++ + W S+   RS+L L L   +GQ+FRWK+ SP H++GV+   + +LT     D  YC       S  S      L  L  +
Subjt:  SNDRNKTLTYLVSPASAASSNWVSLNLTRSDLSLPLTFPTGQTFRWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLHSCSTSSSSSSAAAARLALL-DF

Query:  LNAGISLSAIWEVFSAADPRFDFLSRHLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYI-SSLGNHLGNIGGFDFHEFPSLERLS-LVSEAELREA
            +SL+ ++  +++ D  F  +++  +G R+LRQDP ECL  F+CSSNNNI RIT MV+ +  + G  L  +    +H FP+L  L+   +E  LR+ 
Subjt:  LNAGISLSAIWEVFSAADPRFDFLSRHLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYI-SSLGNHLGNIGGFDFHEFPSLERLS-LVSEAELREA

Query:  GFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDLALEEVIEGLTALPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYL--VPELAGARLTPKLCNR-
        G GYRA+Y+  + K +  + GG A WL  LR    EE  + L  LPGVG KVA C+ L +LD+  A+PVD HVWQIA R     P+ + A+    L N+ 
Subjt:  GFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDLALEEVIEGLTALPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYL--VPELAGARLTPKLCNR-

Query:  VAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLENTKRKKSTK
        +   F + +G YAGWAQ +LF ADL Q      +     KRKK +K
Subjt:  VAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLENTKRKKSTK

O15527 N-glycosylase/DNA lyase5.9e-5237.85Show/hide
Query:  WVSLNLTRSDLSLPLTFPTGQTFRWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLHSCSTSSSSSSAAAARL-ALLDFLNAGISLSAIWEVFSAADPRF
        W S+   RS+L L L  P+GQ+FRW++ SP H++GV+   + +LT     +  +C       S +S      L A+  +    ++L+ ++  + + D  F
Subjt:  WVSLNLTRSDLSLPLTFPTGQTFRWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLHSCSTSSSSSSAAAARL-ALLDFLNAGISLSAIWEVFSAADPRF

Query:  DFLSRHLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYI-SSLGNHLGNIGGFDFHEFPSLERLSLVS-EAELREAGFGYRAKYIIGTVKELKGKPG
          +++  +G R+LRQDP+ECL  F+CSSNNNI RIT MV+ +  + G  L  +    +H FPSL+ L+    EA LR+ G GYRA+Y+  + + +  + G
Subjt:  DFLSRHLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYI-SSLGNHLGNIGGFDFHEFPSLERLSLVS-EAELREAGFGYRAKYIIGTVKELKGKPG

Query:  GGAEWLLSLRDLALEEVIEGLTALPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLV--PELAGAR-LTPKLCNRVAEAFVSKYGKYAGWAQTLLF
        G A WL  LR+ + EE  + L  LPGVG KVA C+ L +LD+  A+PVD H+W IA R     P  + A+  +P+    +   F S +G YAGWAQ +LF
Subjt:  GGAEWLLSLRDLALEEVIEGLTALPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLV--PELAGAR-LTPKLCNRVAEAFVSKYGKYAGWAQTLLF

Query:  VADLPQQKALLPASLENTKRKKSTK
         ADL Q +    A     KR+K +K
Subjt:  VADLPQQKALLPASLENTKRKKSTK

O70249 N-glycosylase/DNA lyase5.9e-5238.22Show/hide
Query:  SSNDRNKTLTYLVSPASAASSNWVSLNLTRSDLSLPLTFPTGQTFRWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLHSCSTSSSSSSAAAARLALL-D
        SS+ R++TLT   SPA      W S+   RS+L L L   +GQ+FRW++ SP H++GV+   + +LT     D  YC                 L  L  
Subjt:  SSNDRNKTLTYLVSPASAASSNWVSLNLTRSDLSLPLTFPTGQTFRWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLHSCSTSSSSSSAAAARLALL-D

Query:  FLNAGISLSAIWEVFSAADPRFDFLSRHLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYI-SSLGNHLGNIGGFDFHEFPSLERLSLVS-EAELRE
        +    +SL+ ++  +++ D  F  +++  +G R+LRQDP ECL  F+CSSNNNI RIT MV+ +  + G  L  +    +H FP+L  L+    E  LR+
Subjt:  FLNAGISLSAIWEVFSAADPRFDFLSRHLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYI-SSLGNHLGNIGGFDFHEFPSLERLSLVS-EAELRE

Query:  AGFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDLALEEVIEGLTALPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYL--VPELAGARLTPKLCNR
         G GYRA+Y+  + K +  + GG A WL  LR  + EE  + L  LPGVG KVA C+ L +LD+  A+PVD HVWQIA R     P+ +  +    L N+
Subjt:  AGFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDLALEEVIEGLTALPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYL--VPELAGARLTPKLCNR

Query:  -VAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLENTKRKKSTKE
         +   F + +G YAGWAQ +LF ADL QQ     +     KRKK +K+
Subjt:  -VAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLENTKRKKSTKE

Q9FNY7 N-glycosylase/DNA lyase OGG11.7e-12060.72Show/hide
Query:  RPTPPSTPSAKPSPSPPSLPP-SPPTPQLFHSKPTTVSLRHSSNDRNKTLTYLVSPASAASSNWVSLNLTRSDLSLPLTFPTGQTFRWKQTSPLHFTGVV
        RP P S PS   +  PP  PP +P   Q  H   T                            W  L LT ++L+LPLTFPTGQTFRWK+T  + ++G +
Subjt:  RPTPPSTPSAKPSPSPPSLPP-SPPTPQLFHSKPTTVSLRHSSNDRNKTLTYLVSPASAASSNWVSLNLTRSDLSLPLTFPTGQTFRWKQTSPLHFTGVV

Query:  GPHLISLTHLPNGD-VSYCLHSCSTSSSSSSAAAARLALLDFLNAGISLSAIWEVFSAADPRFDFLSRHLEGARVLRQDPLECLIQFLCSSNNNIGRITK
        GPHL+SL   P  D VSYC+H CSTS  S     A LALLDFLNA ISL+ +W  FS  DPRF  L+RHL GARVLRQDPLECLIQFLCSSNNNI RITK
Subjt:  GPHLISLTHLPNGD-VSYCLHSCSTSSSSSSAAAARLALLDFLNAGISLSAIWEVFSAADPRFDFLSRHLEGARVLRQDPLECLIQFLCSSNNNIGRITK

Query:  MVDYISSLGNHLGNIGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDLALEEVIEGLTALPGVGPKVAACVALFS
        MVD++SSLG HLG+I GF+FH+FPSL+RLS VSE E R+AGFGYRAKYI GTV  L+ KPGGG EWLLSLR + L+E +  L  LPGVGPKVAAC+ALFS
Subjt:  MVDYISSLGNHLGNIGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDLALEEVIEGLTALPGVGPKVAACVALFS

Query:  LDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLENTKRKKSTKEQRE
        LDQH AIPVDTHVWQIAT YL+P+LAGA+LTPKL  RVAEAFVSKYG+YAGWAQTLLF+A+LP QK LL +  +   +   + E  E
Subjt:  LDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLENTKRKKSTKEQRE

Q9V3I8 N-glycosylase/DNA lyase2.5e-4234.06Show/hide
Query:  LNLTRSDLSLPLTFPTGQTFRWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLHSCSTSSSSSSAAAARLALLDFLNAGISLSAIWEVFSAADPRF-DFL
        + L+  +  L  T   GQ+FRW+     + T   G    +   L   +      +  TSS  ++   + L + D+L     L    + + + D  F  FL
Subjt:  LNLTRSDLSLPLTFPTGQTFRWKQTSPLHFTGVVGPHLISLTHLPNGDVSYCLHSCSTSSSSSSAAAARLALLDFLNAGISLSAIWEVFSAADPRF-DFL

Query:  SRHLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVD-YISSLGNHLGNIGGFDFHEFPSLERLSLVS----EAELREAGFGYRAKYIIGTVKELKGKPG
        S+ +   R+L Q+P E +  FLCS NNNI RI+ M++ + ++ G  +G+  G D + FP++ R   +      A+LR A FGYRAK+I  T++E++ K  
Subjt:  SRHLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVD-YISSLGNHLGNIGGFDFHEFPSLERLSLVS----EAELREAGFGYRAKYIIGTVKELKGKPG

Query:  GGAEWLLSLRDLALEEVIEGLTALPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGAR-LTPKLCNRVAEAFVSKYGKYAGWAQTLLFVA
        GG  W +SL+ +  E+  E LT LPG+G KVA C+ L S+    ++PVD H+++IA  Y +P L G + +T K+   V++ F   +GKYAGWAQ +LF A
Subjt:  GGAEWLLSLRDLALEEVIEGLTALPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGAR-LTPKLCNRVAEAFVSKYGKYAGWAQTLLFVA

Query:  DLPQQKALLPASLENTKRKKSTK
        DL Q +     + +    KK  K
Subjt:  DLPQQKALLPASLENTKRKKSTK

Arabidopsis top hitse value%identityAlignment
AT1G21710.1 8-oxoguanine-DNA glycosylase 11.2e-12160.72Show/hide
Query:  RPTPPSTPSAKPSPSPPSLPP-SPPTPQLFHSKPTTVSLRHSSNDRNKTLTYLVSPASAASSNWVSLNLTRSDLSLPLTFPTGQTFRWKQTSPLHFTGVV
        RP P S PS   +  PP  PP +P   Q  H   T                            W  L LT ++L+LPLTFPTGQTFRWK+T  + ++G +
Subjt:  RPTPPSTPSAKPSPSPPSLPP-SPPTPQLFHSKPTTVSLRHSSNDRNKTLTYLVSPASAASSNWVSLNLTRSDLSLPLTFPTGQTFRWKQTSPLHFTGVV

Query:  GPHLISLTHLPNGD-VSYCLHSCSTSSSSSSAAAARLALLDFLNAGISLSAIWEVFSAADPRFDFLSRHLEGARVLRQDPLECLIQFLCSSNNNIGRITK
        GPHL+SL   P  D VSYC+H CSTS  S     A LALLDFLNA ISL+ +W  FS  DPRF  L+RHL GARVLRQDPLECLIQFLCSSNNNI RITK
Subjt:  GPHLISLTHLPNGD-VSYCLHSCSTSSSSSSAAAARLALLDFLNAGISLSAIWEVFSAADPRFDFLSRHLEGARVLRQDPLECLIQFLCSSNNNIGRITK

Query:  MVDYISSLGNHLGNIGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDLALEEVIEGLTALPGVGPKVAACVALFS
        MVD++SSLG HLG+I GF+FH+FPSL+RLS VSE E R+AGFGYRAKYI GTV  L+ KPGGG EWLLSLR + L+E +  L  LPGVGPKVAAC+ALFS
Subjt:  MVDYISSLGNHLGNIGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDLALEEVIEGLTALPGVGPKVAACVALFS

Query:  LDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLENTKRKKSTKEQRE
        LDQH AIPVDTHVWQIAT YL+P+LAGA+LTPKL  RVAEAFVSKYG+YAGWAQTLLF+A+LP QK LL +  +   +   + E  E
Subjt:  LDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLENTKRKKSTKEQRE

AT3G47830.1 DNA glycosylase superfamily protein3.5e-0746.38Show/hide
Query:  LRDLALEEVIEGLTALPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRY-LVPELAGARLTPKLCNR
        LR L++EEV   L+   GVGPK  +CV +F+L QH+  PVDTHV++IA     VP+ A    T    NR
Subjt:  LRDLALEEVIEGLTALPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRY-LVPELAGARLTPKLCNR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTTCATTGTCATTGAGACACCATCTAATGGCGAAGAGGCTCAGACCCACCCCACCCTCCACTCCCTCCGCCAAGCCATCGCCATCGCCACCGTCATTGCCG
CCGTCTCCTCCGACCCCTCAACTCTTCCATTCAAAGCCCACCACCGTGTCCCTCCGCCACTCATCCAACGATCGAAACAAAACCCTAACCTACCTCGTATCCCCC
GCATCCGCAGCATCCTCCAACTGGGTCTCTCTAAATCTCACCAGATCAGACCTCTCTTTGCCTCTCACTTTCCCCACCGGCCAAACCTTCCGCTGGAAACAAACC
AGCCCCCTTCACTTCACCGGCGTTGTTGGGCCTCATCTTATCTCTCTCACCCATCTCCCAAATGGCGACGTTTCATACTGCCTTCACTCTTGTTCTACATCCTCC
TCCTCCTCCTCCGCCGCCGCCGCCAGATTGGCCTTGCTTGATTTCCTTAACGCCGGTATCTCCCTAAGTGCCATTTGGGAGGTCTTCTCGGCGGCTGATCCAAGA
TTCGATTTCTTGTCGCGCCATTTGGAGGGGGCTCGAGTTCTCAGGCAAGACCCACTCGAGTGTTTGATTCAGTTTTTGTGTTCTTCAAATAACAATATTGGAAGA
ATCACCAAAATGGTGGATTACATCTCATCACTTGGGAATCATTTGGGTAATATTGGAGGCTTTGATTTCCATGAATTCCCCTCTTTGGAGAGGCTATCATTGGTC
TCTGAGGCTGAGCTTAGAGAGGCAGGCTTTGGTTACAGGGCTAAATACATAATTGGCACTGTGAAAGAACTAAAAGGCAAACCTGGGGGAGGTGCAGAATGGCTT
CTGTCTCTTCGTGATTTGGCTCTCGAAGAAGTGATAGAAGGCCTTACAGCTTTACCGGGCGTGGGTCCAAAGGTAGCAGCTTGTGTTGCTCTCTTCTCTCTCGAT
CAGCACCATGCCATTCCTGTTGACACACACGTCTGGCAGATTGCTACTAGGTACCTTGTCCCTGAGCTTGCTGGTGCACGTCTAACGCCAAAGCTTTGCAATCGT
GTGGCTGAGGCATTTGTCAGCAAGTATGGAAAATATGCTGGTTGGGCTCAAACTCTGCTCTTCGTCGCTGATTTGCCTCAACAGAAGGCCCTCTTACCTGCAAGC
CTTGAGAATACCAAAAGGAAAAAATCTACAAAGGAGCAGAGAGAAAAGGCACATACTGAGCAAGATCCATAG
mRNA sequenceShow/hide mRNA sequence
ACGGAAGCGCCACTTGCCAACCCCCCACATGCCTTCATTGTCATTGAGACACCATCTAATGGCGAAGAGGCTCAGACCCACCCCACCCTCCACTCCCTCCGCCAA
GCCATCGCCATCGCCACCGTCATTGCCGCCGTCTCCTCCGACCCCTCAACTCTTCCATTCAAAGCCCACCACCGTGTCCCTCCGCCACTCATCCAACGATCGAAA
CAAAACCCTAACCTACCTCGTATCCCCCGCATCCGCAGCATCCTCCAACTGGGTCTCTCTAAATCTCACCAGATCAGACCTCTCTTTGCCTCTCACTTTCCCCAC
CGGCCAAACCTTCCGCTGGAAACAAACCAGCCCCCTTCACTTCACCGGCGTTGTTGGGCCTCATCTTATCTCTCTCACCCATCTCCCAAATGGCGACGTTTCATA
CTGCCTTCACTCTTGTTCTACATCCTCCTCCTCCTCCTCCGCCGCCGCCGCCAGATTGGCCTTGCTTGATTTCCTTAACGCCGGTATCTCCCTAAGTGCCATTTG
GGAGGTCTTCTCGGCGGCTGATCCAAGATTCGATTTCTTGTCGCGCCATTTGGAGGGGGCTCGAGTTCTCAGGCAAGACCCACTCGAGTGTTTGATTCAGTTTTT
GTGTTCTTCAAATAACAATATTGGAAGAATCACCAAAATGGTGGATTACATCTCATCACTTGGGAATCATTTGGGTAATATTGGAGGCTTTGATTTCCATGAATT
CCCCTCTTTGGAGAGGCTATCATTGGTCTCTGAGGCTGAGCTTAGAGAGGCAGGCTTTGGTTACAGGGCTAAATACATAATTGGCACTGTGAAAGAACTAAAAGG
CAAACCTGGGGGAGGTGCAGAATGGCTTCTGTCTCTTCGTGATTTGGCTCTCGAAGAAGTGATAGAAGGCCTTACAGCTTTACCGGGCGTGGGTCCAAAGGTAGC
AGCTTGTGTTGCTCTCTTCTCTCTCGATCAGCACCATGCCATTCCTGTTGACACACACGTCTGGCAGATTGCTACTAGGTACCTTGTCCCTGAGCTTGCTGGTGC
ACGTCTAACGCCAAAGCTTTGCAATCGTGTGGCTGAGGCATTTGTCAGCAAGTATGGAAAATATGCTGGTTGGGCTCAAACTCTGCTCTTCGTCGCTGATTTGCC
TCAACAGAAGGCCCTCTTACCTGCAAGCCTTGAGAATACCAAAAGGAAAAAATCTACAAAGGAGCAGAGAGAAAAGGCACATACTGAGCAAGATCCATAGGTGTT
GGCATAGCTCTTGAAATTTGTTCGGGTTTTCTTTGGCTCGTTGACGTATTCTCCTGGAGGAAGAATTCAATATGACTCCTAAAAGAATGAAATAAGCTGTGATCA
TGGAGATGATTATTGCTAAAGCTATGTATCTATTGAAAGCTACTGTGAATTATCTTGCCATGTGCTTCATTCATGCTAGACATCTTCCATCATTTATGTGCATTG
TGCATCGACCAACAGTGTTTTGGTGACAAACGGACATGGATCGCCTCTAAAGTTCCGATTCGAGATTGGAATCACGCATTTAGCCTTCCCTAGACACGCCTGAAA
CATCACAAAGCTTTCATGAAATTACTCACATGTTCTACAATGGCTCTGGAGTTTGGCGAGTGACATATTCCAACAGCGTAGCTTTGACAGTCACCAGATGGTGTC
CCAAAGCTTGCAAATAAGATTTTGGAGATGTTCTTGTTAGTAGGGCAGCTTAATCGGACCTTAGGCCTTCTACTTTTGTTCTTTGTTCTACTTGCCCTCTGTTTC
TTTGCACTCATCCATGAAGCTACCAGAGGATAATGTGATTCAGACACTTGCCCACATGTTTTGCTAATTGAAACAGAATCCAAAGATATCCCAACTGGGCTTCCT
GTTTCTTCTTCCAAAATGACCAACAGGTTCTCAGTCGGCTTGAGGAAGGAACGTGGAACATTATACCTGGATTCAAAGGAGAGAATATGGAAGGAATTTTCAAAG
GTGAGGTATAATATAACACCAGATATTAAGATCTGAATAAACTGTTTACCATTTCTGTGAAGGCTCCCCTGTTGGGGTGAGGAAGGAGACCCAGTATCGGCCAAT
ACCCCAGCCGTTAACCCAAACTGCACCCTTCCCCATGGAACCAAGGTTCAGTGCAATTGGGTCATCACCAGGAGGTGCATCGAACTGAGTCTGCCCAATTAATGA
CGACAAAGGTCAGTTAATGAACACTGAATTCAGGATAGTTGCTATGCAAATGGTCTAAAAATTGATTGTTGGTAGTACCTTGTACCATGTGAGCGGCTGGGAAGA
GTCTCCTAACCTGCTCCACTGAACATTACTTGACCCATTGTCTAAAAATATTTGCGATTGCTCTCCTGATAGGCCAACCTATGCAAAAATTGTTCAGAAAAACCG
ACAAAAATACATGTACGTATATTTTTTTTTAATATGACATCGGCTTCAAGTGTGTATGAACTCATATTACGGGTTCAAAAATTAAAGAACCTTGTATCCCCAAGG
TTGCGCGGAGAAATCCTCGTCTTGAATCCTCACTCTTCGCAGTCCAGCAATTCGTCTCTCAAGAAATGCCCCAGAATCCTTCACATTATAGCTATTAGAATTCGC
TAAGGTGAAATGCGAAATTAATATGCATCAACCATGGAAGGGAAGGTAAGAAGATTCGTACCGGTAAACCAACCATCACACTGAGCAATGAGATGTTGTTGATGC
CATTTCTCAACGTAATATTATTCTCCAGAGAGAAACCTTTTTCTTTGTAAGTTCCGTGGGCGGAGCCTATAAACATCAATAGTGAAATTCTCGATATTGTGGTTT
GATCGAGC
Protein sequenceShow/hide protein sequence
MPSLSLRHHLMAKRLRPTPPSTPSAKPSPSPPSLPPSPPTPQLFHSKPTTVSLRHSSNDRNKTLTYLVSPASAASSNWVSLNLTRSDLSLPLTFPTGQTFRWKQT
SPLHFTGVVGPHLISLTHLPNGDVSYCLHSCSTSSSSSSAAAARLALLDFLNAGISLSAIWEVFSAADPRFDFLSRHLEGARVLRQDPLECLIQFLCSSNNNIGR
ITKMVDYISSLGNHLGNIGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVKELKGKPGGGAEWLLSLRDLALEEVIEGLTALPGVGPKVAACVALFSLD
QHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPASLENTKRKKSTKEQREKAHTEQDP