; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy6G007210 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy6G007210
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionCytochrome P450
Genome locationGy14Chr6:6160674..6167813
RNA-Seq ExpressionCsGy6G007210
SyntenyCsGy6G007210
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0010268 - brassinosteroid homeostasis (biological process)
GO:0016125 - sterol metabolic process (biological process)
GO:0016132 - brassinosteroid biosynthetic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0000701 - purine-specific mismatch base pair DNA N-glycosylase activity (molecular function)
GO:0051539 - 4 iron, 4 sulfur cluster binding (molecular function)
GO:0020037 - heme binding (molecular function)
GO:0016705 - oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0004497 - monooxygenase activity (molecular function)
GO:0003677 - DNA binding (molecular function)
InterPro domainsIPR000445 - Helix-hairpin-helix motif
IPR044298 - Adenine/Thymine-DNA glycosylase
IPR036396 - Cytochrome P450 superfamily
IPR029119 - MutY, C-terminal
IPR023170 - Helix-hairpin-helix, base-excision DNA repair, C-terminal
IPR017972 - Cytochrome P450, conserved site
IPR015797 - NUDIX hydrolase-like domain superfamily
IPR011257 - DNA glycosylase
IPR004036 - Endonuclease III-like, conserved site-2
IPR003265 - HhH-GPD domain
IPR002401 - Cytochrome P450, E-class, group I
IPR001128 - Cytochrome P450


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN46403.2 hypothetical protein Csa_005328 [Cucumis sativus]0.0100Show/hide
Query:  MSTPVTGLVAALVVIGLTYLVHRWRNPKCNGVLPPGSMGFPLIGETLQLIASGYTLDLPPFIKKRVHKYGPIFRTSLVGRSIVVTADPEINSFIYNQEGR
        MSTPVTGLVAALVVIGLTYLVHRWRNPKCNGVLPPGSMGFPLIGETLQLIASGYTLDLPPFIKKRVHKYGPIFRTSLVGRSIVVTADPEINSFIYNQEGR
Subjt:  MSTPVTGLVAALVVIGLTYLVHRWRNPKCNGVLPPGSMGFPLIGETLQLIASGYTLDLPPFIKKRVHKYGPIFRTSLVGRSIVVTADPEINSFIYNQEGR

Query:  TVELWYLDSISKVFKQDGEVKTTAGGAIHKYLRSITLNHFGSESLKSKLLADIQRYVDKVFTQWSNHPSVEMQRGTLTMLYDFNAYIMFGYDPEKSNENI
        TVELWYLDSISKVFKQDGEVKTTAGGAIHKYLRSITLNHFGSESLKSKLLADIQRYVDKVFTQWSNHPSVEMQRGTLTMLYDFNAYIMFGYDPEKSNENI
Subjt:  TVELWYLDSISKVFKQDGEVKTTAGGAIHKYLRSITLNHFGSESLKSKLLADIQRYVDKVFTQWSNHPSVEMQRGTLTMLYDFNAYIMFGYDPEKSNENI

Query:  SESLITLADGFMSFPVNVPGTKYNKCLKAQKRLVNTFKALVKERRQASVAAARGDFLDQALRDIENEQFLTEEFVSNLLFGVLFASGSISGSLTLMFKLL
        SESLITLADGFMSFPVNVPGTKYNKCLKAQKRLVNTFKALVKERRQASVAAARGDFLDQALRDIENEQFLTEEFVSNLLFGVLFASGSISGSLTLMFKLL
Subjt:  SESLITLADGFMSFPVNVPGTKYNKCLKAQKRLVNTFKALVKERRQASVAAARGDFLDQALRDIENEQFLTEEFVSNLLFGVLFASGSISGSLTLMFKLL

Query:  AENPSVVKELTAEHETFLKQRKDPKSPITWEEYKSMTFTLYVIYEVFRLSNAMPFLLRRTTKDVNIKGYTIPAGWTIMVANSALHLNPQTHKDPLDFNPW
        AENPSVVKELTAEHETFLKQRKDPKSPITWEEYKSMTFTLYVIYEVFRLSNAMPFLLRRTTKDVNIKGYTIPAGWTIMVANSALHLNPQTHKDPLDFNPW
Subjt:  AENPSVVKELTAEHETFLKQRKDPKSPITWEEYKSMTFTLYVIYEVFRLSNAMPFLLRRTTKDVNIKGYTIPAGWTIMVANSALHLNPQTHKDPLDFNPW

Query:  RWKDHDQYSISKTLQPFGGGTRQCAGADYTRVFMAIFLHTLVTKYSWKKVKGGEVSRSPILKFGDGIHIVELFTSSFDTQIDVKLVFPAPISNFLFIYNL
        RWKDHDQYSISKTLQPFGGGTRQCAGADYTRVFMAIFLHTLVTKYSWKKVKGGEVSRSPILKFGDGIHIVELFTSSFDTQIDVKLVFPAPISNFLFIYNL
Subjt:  RWKDHDQYSISKTLQPFGGGTRQCAGADYTRVFMAIFLHTLVTKYSWKKVKGGEVSRSPILKFGDGIHIVELFTSSFDTQIDVKLVFPAPISNFLFIYNL

Query:  KLWGLLTSNRSSGSGCCSMSDGEKNENDEYMKKNTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGE
        KLWGLLTSNRSSGSGCCSMSDGEKNENDEYMKKNTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGE
Subjt:  KLWGLLTSNRSSGSGCCSMSDGEKNENDEYMKKNTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGE

Query:  PETRAYGVWVSEIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIA
        PETRAYGVWVSEIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIA
Subjt:  PETRAYGVWVSEIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIA

Query:  SIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYP
        SIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYP
Subjt:  SIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYP

Query:  AKGIKIKQRHDYSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHI
        AKGIKIKQRHDYSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHI
Subjt:  AKGIKIKQRHDYSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHI

Query:  FTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS
        FTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS
Subjt:  FTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS

NP_001292621.1 cytochrome P450 87A3-like [Cucumis sativus]0.099.79Show/hide
Query:  MELPSVMSTPVTGLVAALVVIGLTYLVHRWRNPKCNGVLPPGSMGFPLIGETLQLIASGYTLDLPPFIKKRVHKYGPIFRTSLVGRSIVVTADPEINSFI
        MELPSVMSTPVTGLVAALVVIGLTYLVHRWRNPKCNGVLPPGSMGFPLIGETLQLIASGYTLDLPPFIKKRVHKYGPIFRTSLVGRSIVVTADPEINSFI
Subjt:  MELPSVMSTPVTGLVAALVVIGLTYLVHRWRNPKCNGVLPPGSMGFPLIGETLQLIASGYTLDLPPFIKKRVHKYGPIFRTSLVGRSIVVTADPEINSFI

Query:  YNQEGRTVELWYLDSISKVFKQDGEVKTTAGGAIHKYLRSITLNHFGSESLKSKLLADIQRYVDKVFTQWSNHPSVEMQRGTLTMLYDFNAYIMFGYDPE
        YNQEGRTVELWYLDSISKVFKQDGEVKTTAGGAIHKYLRSITLNHFGSESLKSKLLADIQRYVDKVFTQWSNHPSVEMQRGTLTMLYDFNAYIMFGYDPE
Subjt:  YNQEGRTVELWYLDSISKVFKQDGEVKTTAGGAIHKYLRSITLNHFGSESLKSKLLADIQRYVDKVFTQWSNHPSVEMQRGTLTMLYDFNAYIMFGYDPE

Query:  KSNENISESLITLADGFMSFPVNVPGTKYNKCLKAQKRLVNTFKALVKERRQASVAAARGDFLDQALRDIENEQFLTEEFVSNLLFGVLFASGSISGSLT
        KSNENISESLITLADGFMSFPVNVPGTKYNKCLKAQKRLVNTFKALVKERRQASVAAARGDFLDQALRDIENEQFLTEEFVSNLLFGVLFASGSISGSLT
Subjt:  KSNENISESLITLADGFMSFPVNVPGTKYNKCLKAQKRLVNTFKALVKERRQASVAAARGDFLDQALRDIENEQFLTEEFVSNLLFGVLFASGSISGSLT

Query:  LMFKLLAENPSVVKELTAEHETFLKQRKDPKSPITWEEYKSMTFTLYVIYEVFRLSNAMPFLLRRTTKDVNIKGYTIPAGWTIMVANSALHLNPQTHKDP
        LMFKLLAENPSVVKELTAEHETFLKQRKDPKSPITWEEYKSMTFTLYVIYEVFRLSNAMPFLLRRTTKDVNIKGYTIPAGWTIMVANSALHLNPQTHKDP
Subjt:  LMFKLLAENPSVVKELTAEHETFLKQRKDPKSPITWEEYKSMTFTLYVIYEVFRLSNAMPFLLRRTTKDVNIKGYTIPAGWTIMVANSALHLNPQTHKDP

Query:  LDFNPWRWKDHDQYSISKTLQPFGGGTRQCAGADYTRVFMAIFLHTLVTKYSWKKVKGGEVSRSPILKFGDGIHI
        LDFNPWRWKDHDQYSISKTLQPFGGGTRQCAGADYTRVFMAIFLHTLVTKYSWKKVKGGEVSRSPILKFGDGIH+
Subjt:  LDFNPWRWKDHDQYSISKTLQPFGGGTRQCAGADYTRVFMAIFLHTLVTKYSWKKVKGGEVSRSPILKFGDGIHI

XP_004140565.2 adenine DNA glycosylase isoform X1 [Cucumis sativus]0.0100Show/hide
Query:  MSDGEKNENDEYMKKNTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
        MSDGEKNENDEYMKKNTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
Subjt:  MSDGEKNENDEYMKKNTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT

Query:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
        RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
Subjt:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV

Query:  IARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHDYSAVCVV
        IARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHDYSAVCVV
Subjt:  IARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHDYSAVCVV

Query:  EILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVEHLVLCLK
        EILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVEHLVLCLK
Subjt:  EILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVEHLVLCLK

Query:  GEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS
        GEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS
Subjt:  GEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS

XP_031743605.1 adenine DNA glycosylase isoform X2 [Cucumis sativus]0.099.78Show/hide
Query:  MSDGEKNENDEYMKKNTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
        MSDGEKNENDEYMKKNTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
Subjt:  MSDGEKNENDEYMKKNTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT

Query:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
        RVQTVVQFYNRWMLKWPTVQHLSRASLE VNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
Subjt:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV

Query:  IARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHDYSAVCVV
        IARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHDYSAVCVV
Subjt:  IARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHDYSAVCVV

Query:  EILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVEHLVLCLK
        EILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVEHLVLCLK
Subjt:  EILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVEHLVLCLK

Query:  GEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS
        GEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS
Subjt:  GEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS

XP_031743606.1 adenine DNA glycosylase isoform X3 [Cucumis sativus]0.099.35Show/hide
Query:  MSDGEKNENDEYMKKNTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
        MSDGEKNENDEYMKKNTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
Subjt:  MSDGEKNENDEYMKKNTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT

Query:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
        RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFE   MIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
Subjt:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV

Query:  IARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHDYSAVCVV
        IARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHDYSAVCVV
Subjt:  IARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHDYSAVCVV

Query:  EILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVEHLVLCLK
        EILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVEHLVLCLK
Subjt:  EILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVEHLVLCLK

Query:  GEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS
        GEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS
Subjt:  GEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS

TrEMBL top hitse value%identityAlignment
A0A0A0KA91 Cytochrome P4500.099.79Show/hide
Query:  MELPSVMSTPVTGLVAALVVIGLTYLVHRWRNPKCNGVLPPGSMGFPLIGETLQLIASGYTLDLPPFIKKRVHKYGPIFRTSLVGRSIVVTADPEINSFI
        MELPSVMSTPVTGLVAALVVIGLTYLVHRWRNPKCNGVLPPGSMGFPLIGETLQLIASGYTLDLPPFIKKRVHKYGPIFRTSLVGRSIVVTADPEINSFI
Subjt:  MELPSVMSTPVTGLVAALVVIGLTYLVHRWRNPKCNGVLPPGSMGFPLIGETLQLIASGYTLDLPPFIKKRVHKYGPIFRTSLVGRSIVVTADPEINSFI

Query:  YNQEGRTVELWYLDSISKVFKQDGEVKTTAGGAIHKYLRSITLNHFGSESLKSKLLADIQRYVDKVFTQWSNHPSVEMQRGTLTMLYDFNAYIMFGYDPE
        YNQEGRTVELWYLDSISKVFKQDGEVKTTAGGAIHKYLRSITLNHFGSESLKSKLLADIQRYVDKVFTQWSNHPSVEMQRGTLTMLYDFNAYIMFGYDPE
Subjt:  YNQEGRTVELWYLDSISKVFKQDGEVKTTAGGAIHKYLRSITLNHFGSESLKSKLLADIQRYVDKVFTQWSNHPSVEMQRGTLTMLYDFNAYIMFGYDPE

Query:  KSNENISESLITLADGFMSFPVNVPGTKYNKCLKAQKRLVNTFKALVKERRQASVAAARGDFLDQALRDIENEQFLTEEFVSNLLFGVLFASGSISGSLT
        KSNENISESLITLADGFMSFPVNVPGTKYNKCLKAQKRLVNTFKALVKERRQASVAAARGDFLDQALRDIENEQFLTEEFVSNLLFGVLFASGSISGSLT
Subjt:  KSNENISESLITLADGFMSFPVNVPGTKYNKCLKAQKRLVNTFKALVKERRQASVAAARGDFLDQALRDIENEQFLTEEFVSNLLFGVLFASGSISGSLT

Query:  LMFKLLAENPSVVKELTAEHETFLKQRKDPKSPITWEEYKSMTFTLYVIYEVFRLSNAMPFLLRRTTKDVNIKGYTIPAGWTIMVANSALHLNPQTHKDP
        LMFKLLAENPSVVKELTAEHETFLKQRKDPKSPITWEEYKSMTFTLYVIYEVFRLSNAMPFLLRRTTKDVNIKGYTIPAGWTIMVANSALHLNPQTHKDP
Subjt:  LMFKLLAENPSVVKELTAEHETFLKQRKDPKSPITWEEYKSMTFTLYVIYEVFRLSNAMPFLLRRTTKDVNIKGYTIPAGWTIMVANSALHLNPQTHKDP

Query:  LDFNPWRWKDHDQYSISKTLQPFGGGTRQCAGADYTRVFMAIFLHTLVTKYSWKKVKGGEVSRSPILKFGDGIHI
        LDFNPWRWKDHDQYSISKTLQPFGGGTRQCAGADYTRVFMAIFLHTLVTKYSWKKVKGGEVSRSPILKFGDGIH+
Subjt:  LDFNPWRWKDHDQYSISKTLQPFGGGTRQCAGADYTRVFMAIFLHTLVTKYSWKKVKGGEVSRSPILKFGDGIHI

A0A0A0KC27 Adenine DNA glycosylase0.0100Show/hide
Query:  IVELFTSSFDTQIDVKLVFPAPISNFLFIYNLKLWGLLTSNRSSGSGCCSMSDGEKNENDEYMKKNTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMF
        IVELFTSSFDTQIDVKLVFPAPISNFLFIYNLKLWGLLTSNRSSGSGCCSMSDGEKNENDEYMKKNTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMF
Subjt:  IVELFTSSFDTQIDVKLVFPAPISNFLFIYNLKLWGLLTSNRSSGSGCCSMSDGEKNENDEYMKKNTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMF

Query:  SIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGA
        SIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGA
Subjt:  SIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGA

Query:  KMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTP
        KMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTP
Subjt:  KMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTP

Query:  TNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHDYSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRE
        TNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHDYSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRE
Subjt:  TNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHDYSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRE

Query:  SINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTS
        SINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTS
Subjt:  SINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTS

Query:  SSSNCALPRKKQKS
        SSSNCALPRKKQKS
Subjt:  SSSNCALPRKKQKS

A0A1S3CBT2 Adenine DNA glycosylase2.69e-31196.12Show/hide
Query:  MSDGEKNENDEYMKKNTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
        MSDGEKNEN+E +KK TDFRRKKKPTT+RKRR RSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
Subjt:  MSDGEKNENDEYMKKNTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT

Query:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
        RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFP+TVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
Subjt:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV

Query:  IARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHDYSAVCVV
        IARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISK DSSVLVTDYPAKGIK KQRHDYSAVCVV
Subjt:  IARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHDYSAVCVV

Query:  EILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVEHLVLCLK
        EILESQGT ELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEAD STRRESI+SLLSKNFGLE KKNFEIVNREDVGDFIH+FTHIRLKIYVEHLVLCLK
Subjt:  EILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVEHLVLCLK

Query:  GEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS
        GEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSS+  LP KKQKS
Subjt:  GEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS

A0A1S3CCK2 cytochrome P450 87A3-like0.092.66Show/hide
Query:  MELPSVMSTPVTGL--VAALVVIGLTYLVHRWRNPKCNGVLPPGSMGFPLIGETLQLIASGYTLDLPPFIKKRVHKYGPIFRTSLVGRSIVVTADPEINS
        ME+PS+MSTPVTGL  VAA+VVIGLTYLV+RWRNPKCNGVLPPGSMGFPLIGETLQLIA GYTLDLPPFIKKRV KYGPIFRTSLVGRSIVVTADPEINS
Subjt:  MELPSVMSTPVTGL--VAALVVIGLTYLVHRWRNPKCNGVLPPGSMGFPLIGETLQLIASGYTLDLPPFIKKRVHKYGPIFRTSLVGRSIVVTADPEINS

Query:  FIYNQEGRTVELWYLDSISKVFKQDGEVKTTAGGAIHKYLRSITLNHFGSESLKSKLLADIQRYVDKVFTQWSNHPSVEMQRGTLTMLYDFNAYIMFGYD
        FIYNQEGRTVELWYLDSISKVFKQDGE KTTAGGAIHKYLRSITLNHFGSESLKSKLLADIQRYVDKV  QWSN PSVE+QRGTLTMLYDFNA +MFGYD
Subjt:  FIYNQEGRTVELWYLDSISKVFKQDGEVKTTAGGAIHKYLRSITLNHFGSESLKSKLLADIQRYVDKVFTQWSNHPSVEMQRGTLTMLYDFNAYIMFGYD

Query:  PEKSNENISESLITLADGFMSFPVNVPGTKYNKCLKAQKRLVNTFKALVKERRQASVAAARGDFLDQALRDIENEQFLTEEFVSNLLFGVLFASGSISGS
        PEKSNENISESLITLADGFMSFPVN+PGTKYNKCLKAQK+LVN FK LVKER QASVAAARGDFLDQALRDIENEQFLTEEFV NLLFG+LFASGS+SGS
Subjt:  PEKSNENISESLITLADGFMSFPVNVPGTKYNKCLKAQKRLVNTFKALVKERRQASVAAARGDFLDQALRDIENEQFLTEEFVSNLLFGVLFASGSISGS

Query:  LTLMFKLLAENPSVVKELTAEHETFLKQRKDPKSPITWEEYKSMTFTLYVIYEVFRLSNAMPFLLRRTTKDVNIKGYTIPAGWTIMVANSALHLNPQTHK
        LTLMFKLLAENPSVV+ELTAEHE FLKQRKDP+SPITWEEYKSMTFTLYVIYE+FRLSNAMPFLLRRTTKDVNI+GYTIPAGWTIMVANSALHLNP+THK
Subjt:  LTLMFKLLAENPSVVKELTAEHETFLKQRKDPKSPITWEEYKSMTFTLYVIYEVFRLSNAMPFLLRRTTKDVNIKGYTIPAGWTIMVANSALHLNPQTHK

Query:  DPLDFNPWRWKDHDQYSISKTLQPFGGGTRQCAGADYTRVFMAIFLHTLVTKYSWKKVKGGEVSRSPILKFGDGIHI
        DPLDFNPWRWKD DQYS+SK LQPFGGGTRQCAGADY+RVFMAIFLHTLVTKYSWK VKGGEVSRSPILKFGDGIH+
Subjt:  DPLDFNPWRWKDHDQYSISKTLQPFGGGTRQCAGADYTRVFMAIFLHTLVTKYSWKKVKGGEVSRSPILKFGDGIHI

A0A5A7T8X3 Adenine DNA glycosylase1.50e-30791.62Show/hide
Query:  MSDGEKNENDEYMKKNTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
        MSDGEKNEN+E +KK TDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
Subjt:  MSDGEKNENDEYMKKNTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT

Query:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEV-----------
        RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFP+TVSSLRKIPGIGEYTAGAIASIAFGEV           
Subjt:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEV-----------

Query:  --------------VPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSS
                      VPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVD SRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISK DSS
Subjt:  --------------VPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSS

Query:  VLVTDYPAKGIKIKQRHDYSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNRED
        VLVTDYPAKGIK KQRHDYSAVCVVEILESQGT ELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADL+TRRESI+SLLSKNFGLE KKNFEIVNRED
Subjt:  VLVTDYPAKGIKIKQRHDYSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNRED

Query:  VGDFIHIFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS
        VGDFIH+FTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSS+  LPRKKQKS
Subjt:  VGDFIHIFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS

SwissProt top hitse value%identityAlignment
F4JRF4 Adenine DNA glycosylase2.6e-13055.56Show/hide
Query:  DGEKNENDEYMKKNTDFRRKKKPTTERKRRGRSPSKSEAVV---DIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRS-LDKGEPETRAYGVWVSEIMLQ
        + E+ E  E  +   D    ++ + E +      +++E      DIED +FS +  Q IR  LLDWYD ++RDLPWR+   + E E RAY VWVSEIMLQ
Subjt:  DGEKNENDEYMKKNTDFRRKKKPTTERKRRGRSPSKSEAVV---DIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRS-LDKGEPETRAYGVWVSEIMLQ

Query:  QTRVQTVVQFYNRWMLKWPTVQHLSRASLE-------------------EVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAG
        QTRVQTV+++Y RWM KWPT+  L +ASLE                   EVNEMWAGLGYYRRARFL EGAKM+V     FP   SSL K+ GIG+YTAG
Subjt:  QTRVQTVVQFYNRWMLKWPTVQHLSRASLE-------------------EVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAG

Query:  AIASIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVT
        AIASIAF E VPVVDGNVIRV+ARLKAIS NPKD    +  WK AAQLVD SRPGDFNQ+LMELGATLCT + PSCS+CPV   C A S+S+ + ++ VT
Subjt:  AIASIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVT

Query:  DYPAKGIKIKQRHDYSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSK--NFGLEAKKNFEIVNREDVG
        DYP K IK K RHD+  VCV+EI   +     G   RF+LVKRP++GLLAGLWEFPSV L+ EAD +TRR +IN  L +   F +E KK   IV+RE++G
Subjt:  DYPAKGIKIKQRHDYSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSK--NFGLEAKKNFEIVNREDVG

Query:  DFIHIFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRK
        +F+HIFTHIR K+YVE LV+ L G    LF+ Q K ++ WKCV + V+ST+GLTS+VRK
Subjt:  DFIHIFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRK

K7NBR2 Cucurbitadienol 11-hydroxylase1.2e-13551.7Show/hide
Query:  MSTPVTGLVAALVVIGLTYLVHRWRNPKCNGVLPPGSMGFPLIGETLQLIASGYTLDLPPFIKKRVHKYGPIFRTSLVGRSIVVTADPEINSFIYNQEGR
        M T V GL A L V    + +++WR+ K NGVLPPG+MG PLIGET+QL     +LD+ PFI+K+V +YGPIF+T L GR +VV+AD E N++I  QEGR
Subjt:  MSTPVTGLVAALVVIGLTYLVHRWRNPKCNGVLPPGSMGFPLIGETLQLIASGYTLDLPPFIKKRVHKYGPIFRTSLVGRSIVVTADPEINSFIYNQEGR

Query:  TVELWYLDSISKVFKQDGEVKTTAGGAIHKYLRSITLNHFGSESLKSKLLADIQRYVDKVFTQWSNHPSVEMQRGTLTMLYDFNAYIMFGYDPEKSNENI
         VE+WYLD++SK F  D E    A G IHKY+RSITLNHFG+E+L+ + L  I+    +    WS  PSVE++  +  M++  +   MFG D +K + NI
Subjt:  TVELWYLDSISKVFKQDGEVKTTAGGAIHKYLRSITLNHFGSESLKSKLLADIQRYVDKVFTQWSNHPSVEMQRGTLTMLYDFNAYIMFGYDPEKSNENI

Query:  SESLITLADGFMSFPVNVPGTKYNKCLKAQKRLVNTFKALVKERRQASVAAARGDFLDQALRDIENEQFLTEEFVSNLLFGVLFAS-GSISGSLTLMFKL
              L  GF+S P+N PGT Y+KCLK  K +    + +V + R A+V     DFL QAL+D E+E+F++EEF+  LLF + FAS  SIS +LTL+ KL
Subjt:  SESLITLADGFMSFPVNVPGTKYNKCLKAQKRLVNTFKALVKERRQASVAAARGDFLDQALRDIENEQFLTEEFVSNLLFGVLFAS-GSISGSLTLMFKL

Query:  LAENPSVVKELTAEHETFLKQRKDPKSPITWEEYKSMTFTLYVIYEVFRLSNAMPFLLRRTTKDVNIKGYTIPAGWTIMVANSALHLNPQTHKDPLDFNP
        L E+P VVKEL AEHE   K R DP  PITWEEYKSMTFTL VI E  RL +  P LLR+T KD+ +KGY IP GWTIM+  ++ H +P+ +KDP  FNP
Subjt:  LAENPSVVKELTAEHETFLKQRKDPKSPITWEEYKSMTFTLYVIYEVFRLSNAMPFLLRRTTKDVNIKGYTIPAGWTIMVANSALHLNPQTHKDPLDFNP

Query:  WRWKDHDQYSISKTLQPFGGGTRQCAGADYTRVFMAIFLHTLVTKYSWKKVKGGEVSRSPILKFGDGIHI
        WRWKD D  +I K   PFGGG R CAGA+Y++V++  FLH L TKY W K+ GG ++R+ IL F DG+H+
Subjt:  WRWKDHDQYSISKTLQPFGGGTRQCAGADYTRVFMAIFLHTLVTKYSWKKVKGGEVSRSPILKFGDGIHI

Q7XU38 Cytochrome P450 87A33.2e-12045.82Show/hide
Query:  LVAAL-VVIGLTYLVHRWRNPKCNGVLPPGSMGFPLIGETLQLIASGYTLDLPPFIKKRVHKYGPIFRTSLVGRSIVVTADPEINSFIYNQEGRTVELWY
        L AAL  V+ L    +RW +P+ NG LPPGS+G P+IGETLQ  A   T DL PF+K+R+ +YG IF+TS+VGR +VV+ADPE+N +++ QEG+  E WY
Subjt:  LVAAL-VVIGLTYLVHRWRNPKCNGVLPPGSMGFPLIGETLQLIASGYTLDLPPFIKKRVHKYGPIFRTSLVGRSIVVTADPEINSFIYNQEGRTVELWY

Query:  LDSISKVFKQDGEVKTTAGGAIHKYLRSITLNHFGSESLKSKLLADIQRYVDKVFTQWSNHPSVEMQRGTLTMLYDFNAYIMFGYDPEK-SNENISESLI
         D+ +++F +D     +  G ++KYL+++ L  +G E+LKS LLA+           W++ PSVE++ G  TM++D  A  + GYDP K S  N+ ++  
Subjt:  LDSISKVFKQDGEVKTTAGGAIHKYLRSITLNHFGSESLKSKLLADIQRYVDKVFTQWSNHPSVEMQRGTLTMLYDFNAYIMFGYDPEK-SNENISESLI

Query:  TLADGFMSFPVNVPGTKYNKCLKAQKRLVNTFKALVKERRQASVAAARGDFLDQALRDIENEQ-FLTEEFVSNLLFGVLFASGSISG-SLTLMFKLLAEN
            G +SFP+N+PGT Y++C++ +K  +   + ++KE R A       DF D  ++++  E+  LTE    +L+F +LFAS   +  +LT+  KLL EN
Subjt:  TLADGFMSFPVNVPGTKYNKCLKAQKRLVNTFKALVKERRQASVAAARGDFLDQALRDIENEQ-FLTEEFVSNLLFGVLFASGSISG-SLTLMFKLLAEN

Query:  PSVVKELTAEHETFLKQRKDPKSPITWEEYKSMTFTLYVIYEVFRLSNAMPFLLRRTTKDVNIKGYTIPAGWTIMVANSALHLNPQTHKDPLDFNPWRWK
        P VV  L  EHE  ++ RKDP S +TW EYKSMTFT  VI E+ RL+N +P + R+  +DV IKGYTIPAGW IMV   A+HLNP+ ++DPL FNPWRW+
Subjt:  PSVVKELTAEHETFLKQRKDPKSPITWEEYKSMTFTLYVIYEVFRLSNAMPFLLRRTTKDVNIKGYTIPAGWTIMVANSALHLNPQTHKDPLDFNPWRWK

Query:  DHDQYS-ISKTLQPFGGGTRQCAGADYTRVFMAIFLHTLVTKYSWKKVKGGEVSRSPILKFGDGIHI
           + +  +K    FGGG R C G D ++V MA F+H+LVTKYSW+ VKGG + R+P L F DG HI
Subjt:  DHDQYS-ISKTLQPFGGGTRQCAGADYTRVFMAIFLHTLVTKYSWKKVKGGEVSRSPILKFGDGIHI

Q8L7D5 Cytochrome P450 708A28.6e-8135.16Show/hide
Query:  VAALVVIGLTYLVHRWRNPKCNGVLPPGSMGFPLIGETLQLIASGYTLDLPPFIKKRVHKYGPIFRTSLVGRSIVVTADPEINSFIYNQEGRTVELWYLD
        V A+  + ++  ++RW NPKCNG LPPGSMG P+IGET          ++ PF+KKR+ KYGP+FRT++ G + VV  +P+I   ++ QE ++    Y +
Subjt:  VAALVVIGLTYLVHRWRNPKCNGVLPPGSMGFPLIGETLQLIASGYTLDLPPFIKKRVHKYGPIFRTSLVGRSIVVTADPEINSFIYNQEGRTVELWYLD

Query:  SISKVFKQDGEVKTTAGGAIHKYLRSITLNHFGSESLKSKLLADIQRYVDKVFTQWSNHPSVEMQRGTLTMLYDFNAYIMFGYDPEKSNENISESLITLA
        +  K F ++        G IHK+++ I+L H GSE+LK K++ +I R   +     +N  S + +           + IM    P+  +    E+  TL 
Subjt:  SISKVFKQDGEVKTTAGGAIHKYLRSITLNHFGSESLKSKLLADIQRYVDKVFTQWSNHPSVEMQRGTLTMLYDFNAYIMFGYDPEKSNENISESLITLA

Query:  DGFMSFPVNVPGTKYNKCLKAQKRLVNTFKALVKER-----------RQASVAAARGDFLDQALRDIENEQFL-TEEFVSNLLFGVLFASGSISGSLT-L
        D  M+      G+++ +       L++ +K  +  R           R+ +     GDFLD  + + E E  +  EE   NL+F +L  +   + S+T L
Subjt:  DGFMSFPVNVPGTKYNKCLKAQKRLVNTFKALVKER-----------RQASVAAARGDFLDQALRDIENEQFL-TEEFVSNLLFGVLFASGSISGSLT-L

Query:  MFKLLAENPSVVKELTAEHETFLKQRKDPKSPITWEEYK-SMTFTLYVIYEVFRLSNAMPFLLRRTTKDVNIKGYTIPAGWTIMVANSALHLNPQTHKDP
          K LAEN   + EL  EH   L+ R    + ++WEEY+  MTFT  VI E  R++N  P + R+   DV IKGYTIPAGW + V   A+H N   +++P
Subjt:  MFKLLAENPSVVKELTAEHETFLKQRKDPKSPITWEEYK-SMTFTLYVIYEVFRLSNAMPFLLRRTTKDVNIKGYTIPAGWTIMVANSALHLNPQTHKDP

Query:  LDFNPWRWKDHDQYSISKTLQPFGGGTRQCAGADYTRVFMAIFLHTLVTKYSWKKVKGGEVSRSPILKFGDGIHI
        L+FNPWRW+  +  S SKT   FGGG RQC GA++ R+ ++IF+H LVT Y +   +  E  R+P+  F  G+ I
Subjt:  LDFNPWRWKDHDQYSISKTLQPFGGGTRQCAGADYTRVFMAIFLHTLVTKYSWKKVKGGEVSRSPILKFGDGIHI

Q99P21 Adenine DNA glycosylase1.1e-8339.88Show/hide
Query:  KKKPTTERKRRGRSPSKSEAVVDIED-------------------IMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGE--PETRAYGVWVSEIMLQQT
        KK+P   ++RR R+ S S+A     D                   +   + +V   R++LL WYD+ +RDLPWR+L K E   + RAY VWVSE+MLQQT
Subjt:  KKKPTTERKRRGRSPSKSEAVVDIED-------------------IMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGE--PETRAYGVWVSEIMLQQT

Query:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKE-GGRFPRTVSSLRK-IPGIGEYTAGAIASIAFGEVVPVVDGNVI
        +V TV+ +Y RWM KWP +Q L+ ASLEEVN++W+GLGYY R R L EGA+ +V+E GG  PRT  +L++ +PG+G YTAGAIASIAF +V  VVDGNV+
Subjt:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKE-GGRFPRTVSSLRK-IPGIGEYTAGAIASIAFGEVVPVVDGNVI

Query:  RVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEA--------------------------------
        RV+ R++AI  +P    +   +W  A QLVD +RPGDFNQA MELGAT+CTP  P CS CPV   C A                                
Subjt:  RVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEA--------------------------------

Query:  LSISKHDSSVLVTDYPAKGIKIKQRHDYSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKK
         S S  D S+ V ++P K  +   R +YSA CVVE   + G P +      LLV+RPD GLLAGLWEFPSV+L  E     + +++   L +  G     
Subjt:  LSISKHDSSVLVTDYPAKGIKIKQRHDYSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKK

Query:  NFEIVNREDVGDFIHIFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAG--KTSSSSNCALPRKKQK
            +  + +G+ IHIF+HI+L   V  L L    +          + + W+   N  +ST     +++K + M E  + G  K S  S    P  ++K
Subjt:  NFEIVNREDVGDFIHIFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAG--KTSSSSNCALPRKKQK

Arabidopsis top hitse value%identityAlignment
AT1G12740.1 cytochrome P450, family 87, subfamily A, polypeptide 27.1e-12344.71Show/hide
Query:  ALVVIGLTYLVHRWRNPKCNGVLPPGSMGFPLIGETLQLIASGYTLDLPPFIKKRVHKYGPIFRTSLVGRSIVVTADPEINSFIYNQEGRTVELWYLDSI
        +L++I +T+ V+ WRNPKC G LPPGSMGFPL+GE++Q      T D+PPFIK+RV KYGPIF+T+LVGR ++V+ D +++ F++NQEGR  + WY D+ 
Subjt:  ALVVIGLTYLVHRWRNPKCNGVLPPGSMGFPLIGETLQLIASGYTLDLPPFIKKRVHKYGPIFRTSLVGRSIVVTADPEINSFIYNQEGRTVELWYLDSI

Query:  SKVF--KQDGEVKTTAGGAIHKYLRSITLNHFGSESLKSKLLADIQRYVDKVFTQWSNHPSVEMQRGTLTMLYDFNAYIMFGYDPEKSNENISESLITLA
        + +F  K  G +     G ++KYL+++ L  FG + LK K+L  ++   +K    WSN  SVE++  T +M++D  A  +  +DP+KS+EN+  + +   
Subjt:  SKVF--KQDGEVKTTAGGAIHKYLRSITLNHFGSESLKSKLLADIQRYVDKVFTQWSNHPSVEMQRGTLTMLYDFNAYIMFGYDPEKSNENISESLITLA

Query:  DGFMSFPVNVPGTKYNKCLKAQKRLVNTFKALVKERRQASVAAARGDFLDQALRDIENE-QFLTEEFVSNLLFGVLFAS-GSISGSLTLMFKLLAENPSV
         G +SFP ++PGT Y+KCL+ + + +   + +++ERR+ +      DF D  + +I+ E   LTEE   +L+F +LFAS  + S +LTL  K L+++P V
Subjt:  DGFMSFPVNVPGTKYNKCLKAQKRLVNTFKALVKERRQASVAAARGDFLDQALRDIENE-QFLTEEFVSNLLFGVLFAS-GSISGSLTLMFKLLAENPSV

Query:  VKELTAEHETFLKQRKDPKSPITWEEYKSMTFTLYVIYEVFRLSNAMPFLLRRTTKDVNIKGYTIPAGWTIMVANSALHLNPQTHKDPLDFNPWRWKDHD
        +K LT EHET L+ R+D  S +TWEEYKSMT+T   I E  RL+N +P + R+  +D+  K YTIPAGW +MV   A+HLNP+ +KDPL FNP RW+   
Subjt:  VKELTAEHETFLKQRKDPKSPITWEEYKSMTFTLYVIYEVFRLSNAMPFLLRRTTKDVNIKGYTIPAGWTIMVANSALHLNPQTHKDPLDFNPWRWKDHD

Query:  QYSISKTLQPFGGGTRQCAGADYTRVFMAIFLHTLVTKYSWKKVKGGEVSRSPILKFGDGIHI
          + SK    FGGG R C G D+T++ MA FLH+LVTKY W+++KGG ++R+P L+F +G H+
Subjt:  QYSISKTLQPFGGGTRQCAGADYTRVFMAIFLHTLVTKYSWKKVKGGEVSRSPILKFGDGIHI

AT1G12740.2 cytochrome P450, family 87, subfamily A, polypeptide 21.5e-12043.92Show/hide
Query:  ALVVIGLTYLVHRWRNPKCNGVLPPGSMGFPLIGETLQLIASGYTLDLPPFIKKRVH------KYGPIFRTSLVGRSIVVTADPEINSFIYNQEGRTVEL
        +L++I +T+ V+ WRNPKC G LPPGSMGFPL+GE++Q      T D+PPFIK+RV       +YGPIF+T+LVGR ++V+ D +++ F++NQEGR  + 
Subjt:  ALVVIGLTYLVHRWRNPKCNGVLPPGSMGFPLIGETLQLIASGYTLDLPPFIKKRVH------KYGPIFRTSLVGRSIVVTADPEINSFIYNQEGRTVEL

Query:  WYLDSISKVF--KQDGEVKTTAGGAIHKYLRSITLNHFGSESLKSKLLADIQRYVDKVFTQWSNHPSVEMQRGTLTMLYDFNAYIMFGYDPEKSNENISE
        WY D+ + +F  K  G +     G ++KYL+++ L  FG + LK K+L  ++   +K    WSN  SVE++  T +M++D  A  +  +DP+KS+EN+  
Subjt:  WYLDSISKVF--KQDGEVKTTAGGAIHKYLRSITLNHFGSESLKSKLLADIQRYVDKVFTQWSNHPSVEMQRGTLTMLYDFNAYIMFGYDPEKSNENISE

Query:  SLITLADGFMSFPVNVPGTKYNKCLKAQKRLVNTFKALVKERRQASVAAARGDFLDQALRDIENE-QFLTEEFVSNLLFGVLFAS-GSISGSLTLMFKLL
        + +    G +SFP ++PGT Y+KCL+ + + +   + +++ERR+ +      DF D  + +I+ E   LTEE   +L+F +LFAS  + S +LTL  K L
Subjt:  SLITLADGFMSFPVNVPGTKYNKCLKAQKRLVNTFKALVKERRQASVAAARGDFLDQALRDIENE-QFLTEEFVSNLLFGVLFAS-GSISGSLTLMFKLL

Query:  AENPSVVKELTAEHETFLKQRKDPKSPITWEEYKSMTFTLYVIYEVFRLSNAMPFLLRRTTKDVNIKGYTIPAGWTIMVANSALHLNPQTHKDPLDFNPW
        +++P V+K LT EHET L+ R+D  S +TWEEYKSMT+T   I E  RL+N +P + R+  +D+  K YTIPAGW +MV   A+HLNP+ +KDPL FNP 
Subjt:  AENPSVVKELTAEHETFLKQRKDPKSPITWEEYKSMTFTLYVIYEVFRLSNAMPFLLRRTTKDVNIKGYTIPAGWTIMVANSALHLNPQTHKDPLDFNPW

Query:  RWKDHDQYSISKTLQPFGGGTRQCAGADYTRVFMAIFLHTLVTKYSWKKVKGGEVSRSPILKFGDGIHI
        RW+     + SK    FGGG R C G D+T++ MA FLH+LVTKY W+++KGG ++R+P L+F +G H+
Subjt:  RWKDHDQYSISKTLQPFGGGTRQCAGADYTRVFMAIFLHTLVTKYSWKKVKGGEVSRSPILKFGDGIHI

AT1G55940.1 cytochrome P450, family 708, subfamily A, polypeptide 12.5e-8336.51Show/hide
Query:  VAALVVIGLTYLVHRWRNPKCNGVLPPGSMGFPLIGETLQLIASGYTLDLPPFIKKRVHKY-GPIFRTSLVGRSIVVTADPEINSFIYNQEGRTVELWYL
        V ALVV+ ++  ++RW NP C+G LPPGSMGFP+IGET++        ++ PF+KKR+ K+ G +FRT+++G   +V+ DPE+N  I  QE R   + Y 
Subjt:  VAALVVIGLTYLVHRWRNPKCNGVLPPGSMGFPLIGETLQLIASGYTLDLPPFIKKRVHKY-GPIFRTSLVGRSIVVTADPEINSFIYNQEGRTVELWYL

Query:  DSISKVFKQDGEVKTTAGGAIHKYLRSITLNHFGSESLKSKLLADIQRYVDKVFTQWSNHPSVEMQRGTL--------TMLYDFNAYIMFGYDPE-----
        +++ ++F +D  +    G   H+Y+R I L   G E LK       QR++ ++    S H      +G +         +L      I+    PE     
Subjt:  DSISKVFKQDGEVKTTAGGAIHKYLRSITLNHFGSESLKSKLLADIQRYVDKVFTQWSNHPSVEMQRGTL--------TMLYDFNAYIMFGYDPE-----

Query:  -KSNENISESLI----------TLADGFMSFP--VNVPGTKYNKCLKAQKRLVNTFKALVKERRQASVA--AARGDFLDQALRDIENE-QFLTEEFVSNL
         +S  + S  L+           L +G M F   +N+    +   +KA+  ++   K + KERR+ + +  +  GDF++  + ++E E   + EE    L
Subjt:  -KSNENISESLI----------TLADGFMSFP--VNVPGTKYNKCLKAQKRLVNTFKALVKERRQASVA--AARGDFLDQALRDIENE-QFLTEEFVSNL

Query:  LFGVLFASGSISGSLT-LMFKLLAENPSVVKELTAEHETFLKQRKDPKSPITWEEYKS-MTFTLYVIYEVFRLSNAMPFLLRRTTKDVNIKGYTIPAGWT
        +  +L AS   + ++T L  K +AENP V+ EL  EHET L+ R D +S +TW+EY+S M FT  VI E  RL +  P + R+   DV IKGYTIPAGW 
Subjt:  LFGVLFASGSISGSLT-LMFKLLAENPSVVKELTAEHETFLKQRKDPKSPITWEEYKS-MTFTLYVIYEVFRLSNAMPFLLRRTTKDVNIKGYTIPAGWT

Query:  IMVANSALHLNPQTHKDPLDFNPWRWKDHDQYSISKTLQPFGGGTRQCAGADYTRVFMAIFLHTLVTKYSWKKVKGGEVSRSPILKFGDGIHI
        ++V  S LH +PQ ++ P +FNPWRW+  +  S SKT   FGGG R CAGA++ R+ MAIFLH LVT Y +  +    + R+P+L+F   I I
Subjt:  IMVANSALHLNPQTHKDPLDFNPWRWKDHDQYSISKTLQPFGGGTRQCAGADYTRVFMAIFLHTLVTKYSWKKVKGGEVSRSPILKFGDGIHI

AT1G78490.1 cytochrome P450, family 708, subfamily A, polypeptide 37.7e-8537.55Show/hide
Query:  VTGLVAALVVIGLTYLVHRWRNPKCNGVLPPGSMGFPLIGETLQLIASGYTLDLPPFIKKRVHKYGPIFRTSLVGRSIVVTADPEINSFIYNQEGRTVEL
        V  L+ ALVV+ +++ ++RW NPKC G LPPGSMGFP+IGETL          +P F+KKR+ +YGP+FRT++ G   VV+ DP++   I+ QE  + EL
Subjt:  VTGLVAALVVIGLTYLVHRWRNPKCNGVLPPGSMGFPLIGETLQLIASGYTLDLPPFIKKRVHKYGPIFRTSLVGRSIVVTADPEINSFIYNQEGRTVEL

Query:  WYLDSISKVFKQDGEVKTTAGGAIHKYLRSITLNHFGSESLKSKLLADIQR----YVDKVFTQWSNHPSVEMQRGTLTMLYDFNAYIMFGYDPEKSNENI
         Y D   KVF +D          IHKYL+ IT+   GSE LK  +L ++ +    ++  + +Q S +   E++   L + Y     ++    PE  ++ I
Subjt:  WYLDSISKVFKQDGEVKTTAGGAIHKYLRSITLNHFGSESLKSKLLADIQR----YVDKVFTQWSNHPSVEMQRGTLTMLYDFNAYIMFGYDPEKSNENI

Query:  SESLITLADGFMSFPVNVPGTKYNKCLKAQKRLVNTFKALVKERRQASVAAARGDFLDQALRDIENE-QFLTEEFVSNLLFGVLFASGSISGSLT-LMFK
                D F SF          K LK+++  +   K ++  R++      + DFL+  L ++E +  F  +    NL+F + FA    + S T L  K
Subjt:  SESLITLADGFMSFPVNVPGTKYNKCLKAQKRLVNTFKALVKERRQASVAAARGDFLDQALRDIENE-QFLTEEFVSNLLFGVLFASGSISGSLT-LMFK

Query:  LLAENPSVVKELTAEHETFLKQRKDPKSPITWEEYK-SMTFTLYVIYEVFRLSNAMPFLLRRTTKDVNIKGYTIPAGWTIMVANSALHLNPQTHKDPLDF
         ++++P V+ EL  EH+  +  RKD ++ ++WEEY+ +MTFT  V  EV RL+N  P L R+  +DV IKGYTIPAGW + VA SA+H +P  +++P +F
Subjt:  LLAENPSVVKELTAEHETFLKQRKDPKSPITWEEYK-SMTFTLYVIYEVFRLSNAMPFLLRRTTKDVNIKGYTIPAGWTIMVANSALHLNPQTHKDPLDF

Query:  NPWRWKDHDQYSISKTLQPFGGGTRQCAGADYTRVFMAIFLHTLVTKYSWKKVKGGEVSRSPILKF
        NPWRW+  +    SKT   FG G R C GA+++R+ MAIFLH LV  Y +  V+  E+ RSP  ++
Subjt:  NPWRWKDHDQYSISKTLQPFGGGTRQCAGADYTRVFMAIFLHTLVTKYSWKKVKGGEVSRSPILKF

AT4G12740.1 HhH-GPD base excision DNA repair family protein1.9e-13155.56Show/hide
Query:  DGEKNENDEYMKKNTDFRRKKKPTTERKRRGRSPSKSEAVV---DIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRS-LDKGEPETRAYGVWVSEIMLQ
        + E+ E  E  +   D    ++ + E +      +++E      DIED +FS +  Q IR  LLDWYD ++RDLPWR+   + E E RAY VWVSEIMLQ
Subjt:  DGEKNENDEYMKKNTDFRRKKKPTTERKRRGRSPSKSEAVV---DIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRS-LDKGEPETRAYGVWVSEIMLQ

Query:  QTRVQTVVQFYNRWMLKWPTVQHLSRASLE-------------------EVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAG
        QTRVQTV+++Y RWM KWPT+  L +ASLE                   EVNEMWAGLGYYRRARFL EGAKM+V     FP   SSL K+ GIG+YTAG
Subjt:  QTRVQTVVQFYNRWMLKWPTVQHLSRASLE-------------------EVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAG

Query:  AIASIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVT
        AIASIAF E VPVVDGNVIRV+ARLKAIS NPKD    +  WK AAQLVD SRPGDFNQ+LMELGATLCT + PSCS+CPV   C A S+S+ + ++ VT
Subjt:  AIASIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVT

Query:  DYPAKGIKIKQRHDYSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSK--NFGLEAKKNFEIVNREDVG
        DYP K IK K RHD+  VCV+EI   +     G   RF+LVKRP++GLLAGLWEFPSV L+ EAD +TRR +IN  L +   F +E KK   IV+RE++G
Subjt:  DYPAKGIKIKQRHDYSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSK--NFGLEAKKNFEIVNREDVG

Query:  DFIHIFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRK
        +F+HIFTHIR K+YVE LV+ L G    LF+ Q K ++ WKCV + V+ST+GLTS+VRK
Subjt:  DFIHIFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCTTCCATCAGTAATGTCGACGCCGGTGACCGGGTTAGTCGCCGCACTGGTGGTTATAGGCTTGACCTACTTGGTTCATAGATGGAGAAATCCTAAATGT
AATGGAGTTCTTCCTCCGGGCTCCATGGGCTTCCCTCTCATCGGAGAAACCCTTCAGCTCATTGCTTCCGGTTACACTCTTGACCTTCCTCCCTTCATCAAGAAA
AGAGTTCACAAATATGGACCCATATTCCGGACGAGTTTGGTGGGTCGATCCATTGTGGTAACAGCTGACCCTGAAATCAACAGTTTCATATACAACCAAGAAGGA
AGGACTGTGGAGCTTTGGTATCTAGACTCCATCTCCAAGGTGTTTAAGCAAGACGGAGAGGTTAAAACCACCGCCGGTGGAGCCATCCATAAGTACCTTCGAAGC
ATCACTTTGAACCACTTCGGTTCCGAAAGTCTCAAGTCCAAGCTACTGGCTGATATCCAACGATATGTTGATAAAGTTTTCACCCAATGGTCTAACCACCCCTCC
GTAGAAATGCAACGTGGAACTCTTACAATGTTATACGATTTCAATGCGTATATAATGTTCGGTTATGACCCTGAAAAGTCTAATGAGAATATAAGTGAGAGCTTA
ATCACATTAGCTGATGGCTTCATGTCTTTCCCAGTGAACGTCCCTGGAACTAAGTATAACAAGTGTCTTAAGGCACAAAAGAGGTTGGTCAACACGTTCAAGGCT
CTCGTCAAAGAGAGGCGCCAAGCCTCTGTTGCTGCTGCTCGTGGGGATTTTCTCGATCAAGCCCTTCGCGACATTGAAAACGAACAGTTTCTCACTGAAGAGTTT
GTTTCCAATTTGTTGTTTGGTGTTTTATTTGCCAGTGGCTCTATTTCTGGATCTCTTACCCTGATGTTCAAGTTACTTGCGGAAAATCCATCGGTCGTGAAGGAG
TTGACCGCTGAGCATGAGACATTCTTGAAACAGAGAAAAGATCCAAAATCTCCCATCACATGGGAGGAATACAAGTCAATGACATTTACGCTTTACGTTATATAC
GAAGTTTTTAGATTATCAAACGCAATGCCTTTTCTGTTGCGGAGAACTACAAAAGATGTGAACATAAAAGGATATACAATTCCAGCAGGGTGGACGATAATGGTT
GCCAATTCAGCTCTTCATTTGAACCCTCAAACCCACAAGGATCCCTTGGACTTCAACCCATGGCGCTGGAAGGATCATGACCAATATTCGATTTCGAAGACCTTG
CAGCCTTTTGGAGGAGGAACTCGACAATGCGCCGGAGCTGATTACACCAGGGTTTTCATGGCCATCTTCCTCCACACTCTTGTCACCAAGTATAGTTGGAAAAAG
GTGAAGGGAGGAGAAGTTTCTCGAAGTCCTATTCTCAAATTTGGAGATGGCATTCATATTGTTGAACTATTCACTTCTAGCTTTGATACCCAAATCGATGTCAAA
CTTGTATTTCCCGCCCCAATCAGCAACTTTCTCTTCATCTACAATTTGAAGTTGTGGGGATTACTGACGAGTAATCGGAGTAGTGGGTCGGGCTGTTGCAGTATG
AGCGACGGAGAAAAGAATGAAAACGATGAGTATATGAAGAAAAATACTGACTTTCGTCGGAAAAAGAAACCCACGACGGAACGGAAACGCCGGGGCCGAAGTCCG
TCTAAAAGTGAAGCAGTTGTTGACATTGAAGATATTATGTTCAGCATAGACAATGTTCAAACAATCAGGGCATCGCTATTGGATTGGTACGACCGTAGCCGCAGG
GACCTTCCATGGAGGAGCTTGGACAAAGGGGAACCTGAAACACGGGCTTACGGTGTGTGGGTTTCAGAAATAATGCTGCAGCAGACCAGAGTTCAGACCGTCGTC
CAATTTTACAACCGTTGGATGCTTAAATGGCCTACCGTTCAACATCTCTCTCGTGCTTCTCTTGAGGAGGTGAATGAAATGTGGGCAGGCTTGGGGTATTATAGA
CGAGCTCGTTTTCTTTTTGAGGGTGCAAAGATGATAGTCAAAGAAGGTGGTAGGTTTCCTAGAACAGTTTCTTCCCTGCGAAAAATTCCAGGAATTGGAGAATAC
ACAGCAGGGGCTATTGCCTCTATAGCGTTCGGTGAAGTGGTGCCTGTGGTCGATGGTAATGTGATAAGGGTAATTGCTCGATTAAAGGCTATTTCAGGAAACCCA
AAAGACCCAAAGTTGATCAAGCAAGTTTGGAAGGCAGCTGCTCAATTAGTTGATCTTTCCAGGCCTGGGGATTTCAATCAGGCACTCATGGAACTTGGTGCAACT
TTATGCACTCCAACAAACCCAAGCTGCTCAACATGCCCCGTGTTTGATCACTGTGAGGCCCTTTCAATCTCAAAGCATGATAGTTCAGTTCTTGTCACAGATTAT
CCCGCTAAGGGGATAAAGATCAAACAAAGACATGATTACTCTGCTGTATGTGTGGTTGAGATATTGGAAAGTCAGGGTACACCTGAGTTAGGGCAATCTAGTAGA
TTTCTTCTTGTAAAGAGGCCTGATGAAGGTTTGCTTGCTGGTCTATGGGAGTTTCCATCTGTCTCGTTGGATGGAGAAGCTGATTTAAGCACCAGGAGAGAATCC
ATTAATAGCCTCTTGAGCAAAAACTTTGGACTTGAAGCAAAAAAGAATTTTGAAATAGTTAATAGAGAAGATGTTGGAGATTTTATCCATATTTTCACACACATC
CGTCTCAAGATATATGTTGAGCACTTGGTGTTATGTTTAAAAGGTGAAGGTAGCAAGTTGTTTCGGAAACAGGAGAAGAAATCTATATTATGGAAATGTGTAGAG
AACAAGGTTATGTCAACAATGGGCTTGACGTCCAGTGTGAGGAAGGCCTATGCCATGGTCGAGAAATTTCAGGCAGGGAAGACATCTTCTAGTTCTAACTGTGCA
CTACCCAGAAAGAAACAGAAATCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGCTTCCATCAGTAATGTCGACGCCGGTGACCGGGTTAGTCGCCGCACTGGTGGTTATAGGCTTGACCTACTTGGTTCATAGATGGAGAAATCCTAAATGT
AATGGAGTTCTTCCTCCGGGCTCCATGGGCTTCCCTCTCATCGGAGAAACCCTTCAGCTCATTGCTTCCGGTTACACTCTTGACCTTCCTCCCTTCATCAAGAAA
AGAGTTCACAAATATGGACCCATATTCCGGACGAGTTTGGTGGGTCGATCCATTGTGGTAACAGCTGACCCTGAAATCAACAGTTTCATATACAACCAAGAAGGA
AGGACTGTGGAGCTTTGGTATCTAGACTCCATCTCCAAGGTGTTTAAGCAAGACGGAGAGGTTAAAACCACCGCCGGTGGAGCCATCCATAAGTACCTTCGAAGC
ATCACTTTGAACCACTTCGGTTCCGAAAGTCTCAAGTCCAAGCTACTGGCTGATATCCAACGATATGTTGATAAAGTTTTCACCCAATGGTCTAACCACCCCTCC
GTAGAAATGCAACGTGGAACTCTTACAATGTTATACGATTTCAATGCGTATATAATGTTCGGTTATGACCCTGAAAAGTCTAATGAGAATATAAGTGAGAGCTTA
ATCACATTAGCTGATGGCTTCATGTCTTTCCCAGTGAACGTCCCTGGAACTAAGTATAACAAGTGTCTTAAGGCACAAAAGAGGTTGGTCAACACGTTCAAGGCT
CTCGTCAAAGAGAGGCGCCAAGCCTCTGTTGCTGCTGCTCGTGGGGATTTTCTCGATCAAGCCCTTCGCGACATTGAAAACGAACAGTTTCTCACTGAAGAGTTT
GTTTCCAATTTGTTGTTTGGTGTTTTATTTGCCAGTGGCTCTATTTCTGGATCTCTTACCCTGATGTTCAAGTTACTTGCGGAAAATCCATCGGTCGTGAAGGAG
TTGACCGCTGAGCATGAGACATTCTTGAAACAGAGAAAAGATCCAAAATCTCCCATCACATGGGAGGAATACAAGTCAATGACATTTACGCTTTACGTTATATAC
GAAGTTTTTAGATTATCAAACGCAATGCCTTTTCTGTTGCGGAGAACTACAAAAGATGTGAACATAAAAGGATATACAATTCCAGCAGGGTGGACGATAATGGTT
GCCAATTCAGCTCTTCATTTGAACCCTCAAACCCACAAGGATCCCTTGGACTTCAACCCATGGCGCTGGAAGGATCATGACCAATATTCGATTTCGAAGACCTTG
CAGCCTTTTGGAGGAGGAACTCGACAATGCGCCGGAGCTGATTACACCAGGGTTTTCATGGCCATCTTCCTCCACACTCTTGTCACCAAGTATAGTTGGAAAAAG
GTGAAGGGAGGAGAAGTTTCTCGAAGTCCTATTCTCAAATTTGGAGATGGCATTCATATTGTTGAACTATTCACTTCTAGCTTTGATACCCAAATCGATGTCAAA
CTTGTATTTCCCGCCCCAATCAGCAACTTTCTCTTCATCTACAATTTGAAGTTGTGGGGATTACTGACGAGTAATCGGAGTAGTGGGTCGGGCTGTTGCAGTATG
AGCGACGGAGAAAAGAATGAAAACGATGAGTATATGAAGAAAAATACTGACTTTCGTCGGAAAAAGAAACCCACGACGGAACGGAAACGCCGGGGCCGAAGTCCG
TCTAAAAGTGAAGCAGTTGTTGACATTGAAGATATTATGTTCAGCATAGACAATGTTCAAACAATCAGGGCATCGCTATTGGATTGGTACGACCGTAGCCGCAGG
GACCTTCCATGGAGGAGCTTGGACAAAGGGGAACCTGAAACACGGGCTTACGGTGTGTGGGTTTCAGAAATAATGCTGCAGCAGACCAGAGTTCAGACCGTCGTC
CAATTTTACAACCGTTGGATGCTTAAATGGCCTACCGTTCAACATCTCTCTCGTGCTTCTCTTGAGGAGGTGAATGAAATGTGGGCAGGCTTGGGGTATTATAGA
CGAGCTCGTTTTCTTTTTGAGGGTGCAAAGATGATAGTCAAAGAAGGTGGTAGGTTTCCTAGAACAGTTTCTTCCCTGCGAAAAATTCCAGGAATTGGAGAATAC
ACAGCAGGGGCTATTGCCTCTATAGCGTTCGGTGAAGTGGTGCCTGTGGTCGATGGTAATGTGATAAGGGTAATTGCTCGATTAAAGGCTATTTCAGGAAACCCA
AAAGACCCAAAGTTGATCAAGCAAGTTTGGAAGGCAGCTGCTCAATTAGTTGATCTTTCCAGGCCTGGGGATTTCAATCAGGCACTCATGGAACTTGGTGCAACT
TTATGCACTCCAACAAACCCAAGCTGCTCAACATGCCCCGTGTTTGATCACTGTGAGGCCCTTTCAATCTCAAAGCATGATAGTTCAGTTCTTGTCACAGATTAT
CCCGCTAAGGGGATAAAGATCAAACAAAGACATGATTACTCTGCTGTATGTGTGGTTGAGATATTGGAAAGTCAGGGTACACCTGAGTTAGGGCAATCTAGTAGA
TTTCTTCTTGTAAAGAGGCCTGATGAAGGTTTGCTTGCTGGTCTATGGGAGTTTCCATCTGTCTCGTTGGATGGAGAAGCTGATTTAAGCACCAGGAGAGAATCC
ATTAATAGCCTCTTGAGCAAAAACTTTGGACTTGAAGCAAAAAAGAATTTTGAAATAGTTAATAGAGAAGATGTTGGAGATTTTATCCATATTTTCACACACATC
CGTCTCAAGATATATGTTGAGCACTTGGTGTTATGTTTAAAAGGTGAAGGTAGCAAGTTGTTTCGGAAACAGGAGAAGAAATCTATATTATGGAAATGTGTAGAG
AACAAGGTTATGTCAACAATGGGCTTGACGTCCAGTGTGAGGAAGGCCTATGCCATGGTCGAGAAATTTCAGGCAGGGAAGACATCTTCTAGTTCTAACTGTGCA
CTACCCAGAAAGAAACAGAAATCTTGA
Protein sequenceShow/hide protein sequence
MELPSVMSTPVTGLVAALVVIGLTYLVHRWRNPKCNGVLPPGSMGFPLIGETLQLIASGYTLDLPPFIKKRVHKYGPIFRTSLVGRSIVVTADPEINSFIYNQEG
RTVELWYLDSISKVFKQDGEVKTTAGGAIHKYLRSITLNHFGSESLKSKLLADIQRYVDKVFTQWSNHPSVEMQRGTLTMLYDFNAYIMFGYDPEKSNENISESL
ITLADGFMSFPVNVPGTKYNKCLKAQKRLVNTFKALVKERRQASVAAARGDFLDQALRDIENEQFLTEEFVSNLLFGVLFASGSISGSLTLMFKLLAENPSVVKE
LTAEHETFLKQRKDPKSPITWEEYKSMTFTLYVIYEVFRLSNAMPFLLRRTTKDVNIKGYTIPAGWTIMVANSALHLNPQTHKDPLDFNPWRWKDHDQYSISKTL
QPFGGGTRQCAGADYTRVFMAIFLHTLVTKYSWKKVKGGEVSRSPILKFGDGIHIVELFTSSFDTQIDVKLVFPAPISNFLFIYNLKLWGLLTSNRSSGSGCCSM
SDGEKNENDEYMKKNTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQTRVQTVV
QFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRVIARLKAISGNP
KDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHDYSAVCVVEILESQGTPELGQSSR
FLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVE
NKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS