; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0005802 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0005802
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionFe2OG dioxygenase domain-containing protein
Genome locationchr08:14583372..14586424
RNA-Seq ExpressionPI0005802
SyntenyPI0005802
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0035513 - oxidative RNA demethylation (biological process)
GO:0035552 - oxidative single-stranded DNA demethylation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0008198 - ferrous iron binding (molecular function)
GO:0035515 - oxidative RNA demethylase activity (molecular function)
GO:0035516 - oxidative DNA demethylase activity (molecular function)
InterPro domainsIPR004574 - Alkylated DNA repair protein AlkB
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR027450 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like
IPR037151 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050407.1 2-oxoglutarate-dependent dioxygenase family protein isoform 1 [Cucumis melo var. makuwa]1.6e-22389.57Show/hide
Query:  SHALPDSSCCGRSCGCGGNKEHLHNTDYNSDVIFVGSVPVYLNPKERELRSLTPLSVNKCDYVELGSGRYGISSNEPKSSHYDEFLPVSRQNTKRSRIDL
        S A PDSSC G SCGCG +KEHL + D  SDVIFVGS  V+LNPKERE +SLTPLSV KCDYVE+GS ++GISSNEPKS HYDE LPVSRQNT+R+RIDL
Subjt:  SHALPDSSCCGRSCGCGGNKEHLHNTDYNSDVIFVGSVPVYLNPKERELRSLTPLSVNKCDYVELGSGRYGISSNEPKSSHYDEFLPVSRQNTKRSRIDL

Query:  GSKRDLKSSARSFQVERYEFLNDSCQEYESSLPIHFGKKNEVFFSKRQSLDIGSKESVVTDNSLSFEPPFDICFPGGGNMKDRTSWRVKDSGTVKDYRLL
        GSKRDLKS+ARSFQVER+EFLND CQEYESSLPIHFGKKNEVFFSKRQSLDIGSKESVVTD+SL FEPPFDICFPGGGN+K R  WRVKDSGTVKDYRLL
Subjt:  GSKRDLKSSARSFQVERYEFLNDSCQEYESSLPIHFGKKNEVFFSKRQSLDIGSKESVVTDNSLSFEPPFDICFPGGGNMKDRTSWRVKDSGTVKDYRLL

Query:  RPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPNIPPQFSFLVKKALNDAHAFIKNKCN
        RPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPP+IPP FSFLVK AL DAHAFIKNKCN
Subjt:  RPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPNIPPQFSFLVKKALNDAHAFIKNKCN

Query:  ISNVEDILPSMSPDICIANFYTTTGRLGLHQDRDESKESLVSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGDSRHIYHGVSSIIPKST
        ISNVEDILPSMSPDICIANFYTT+GRLGLHQDRDESKESL SGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGG+SRH++HGVSSIIPKST
Subjt:  ISNVEDILPSMSPDICIANFYTTTGRLGLHQDRDESKESLVSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGDSRHIYHGVSSIIPKST

Query:  PKFLLFHTGLRPGRLNLTFRKY
        PKFLL+HTGLRPGRLNLTFRKY
Subjt:  PKFLLFHTGLRPGRLNLTFRKY

KAG6595567.1 hypothetical protein SDJN03_12120, partial [Cucurbita argyrosperma subsp. sororia]6.2e-15964.26Show/hide
Query:  MFFIRTLPLPPSPSSNQLRRLLFPASQFPGVRGFPLLQFQLMDSFSSSANSHALPDSSCCGRSCGCGGNKEHLHNTDYNSDVIFVGSVPVYLNPKERELR
        M  IRT+P    P SN LRRLLF  S+        LLQFQ MDSF SS    ALPDSSC G S  CGGN+E LHN D+NS+VI +G +PV LN K  E  
Subjt:  MFFIRTLPLPPSPSSNQLRRLLFPASQFPGVRGFPLLQFQLMDSFSSSANSHALPDSSCCGRSCGCGGNKEHLHNTDYNSDVIFVGSVPVYLNPKERELR

Query:  SLTPLSVNKCDYVELGSGRYGISSNEPKSSHYDEFLPVSRQNTK-RSRIDLGSKRDLKSSARSFQVERYEFLNDSCQEYESSLPIHFGKKNEVF-FSKRQ
        SL+ LSV KCD  +L S + GI +N P S H DEF PV RQNTK RSRIDLGS+R LK+S  S Q+ER                      NE F F K +
Subjt:  SLTPLSVNKCDYVELGSGRYGISSNEPKSSHYDEFLPVSRQNTK-RSRIDLGSKRDLKSSARSFQVERYEFLNDSCQEYESSLPIHFGKKNEVF-FSKRQ

Query:  SLDIGSKESVVTDNSLSFEPPFDICFP-GGGNMKDRTSWRVKDSGTVKDYR---------LLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGY
        S DIGSK S+ T N    E  FDICFP   G  K R SW+ KD  T+K            ++RPGMVLLKHYI   EQ+NIVKT QKLGLGPGGFYQPGY
Subjt:  SLDIGSKESVVTDNSLSFEPPFDICFP-GGGNMKDRTSWRVKDSGTVKDYR---------LLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGY

Query:  KDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPNIPPQFSFLVKKALNDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTTGRLGLHQDRDESKE
        KDGAKLRL+MMCLGLDWDPQTR+Y  KRV DGNKPP++PP+F+ LV KALNDAHA IKN  + +N+EDILP+MSPDICI NFY+T+GRLGLHQDRDES+E
Subjt:  KDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPNIPPQFSFLVKKALNDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTTGRLGLHQDRDESKE

Query:  SLVSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGDSRHIYHGVSSIIPKSTPKFLLFHTGLRPGRLNLTFRKY
        SLV GLPVVSFS+GN+AEFLYGD+RDVDKA K+ LESGDVLIFGG+SRHI+HGVSSIIPKSTPKFLL HTGLRPGRLNLTFRKY
Subjt:  SLVSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGDSRHIYHGVSSIIPKSTPKFLLFHTGLRPGRLNLTFRKY

TYJ97997.1 2-oxoglutarate-dependent dioxygenase family protein isoform 1 [Cucumis melo var. makuwa]1.0e-22289.34Show/hide
Query:  SHALPDSSCCGRSCGCGGNKEHLHNTDYNSDVIFVGSVPVYLNPKERELRSLTPLSVNKCDYVELGSGRYGISSNEPKSSHYDEFLPVSRQNTKRSRIDL
        S A PDSSC G SCGCG +KEHL + D  SDVIFVGS  V+LNPKERE +SLTPLSV KCDYVE+GS ++GISSNEPKS HYDE LPVSRQNT+R+RIDL
Subjt:  SHALPDSSCCGRSCGCGGNKEHLHNTDYNSDVIFVGSVPVYLNPKERELRSLTPLSVNKCDYVELGSGRYGISSNEPKSSHYDEFLPVSRQNTKRSRIDL

Query:  GSKRDLKSSARSFQVERYEFLNDSCQEYESSLPIHFGKKNEVFFSKRQSLDIGSKESVVTDNSLSFEPPFDICFPGGGNMKDRTSWRVKDSGTVKDYRLL
        GSKRDLKS+ARSFQVER+EFLND CQEYESSLPIHFGKKNEVFFSKRQSLDIGSKESVVTD+S  FEPPFDICFPGGGN+K R  WRVKDSGTVKDYRLL
Subjt:  GSKRDLKSSARSFQVERYEFLNDSCQEYESSLPIHFGKKNEVFFSKRQSLDIGSKESVVTDNSLSFEPPFDICFPGGGNMKDRTSWRVKDSGTVKDYRLL

Query:  RPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPNIPPQFSFLVKKALNDAHAFIKNKCN
        RPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPP+IPP FSFLVK AL DAHAFIKNKCN
Subjt:  RPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPNIPPQFSFLVKKALNDAHAFIKNKCN

Query:  ISNVEDILPSMSPDICIANFYTTTGRLGLHQDRDESKESLVSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGDSRHIYHGVSSIIPKST
        ISNVEDILPSMSPDICIANFYTT+GRLGLHQDRDESKESL SGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGG+SRH++HGVSSIIPKST
Subjt:  ISNVEDILPSMSPDICIANFYTTTGRLGLHQDRDESKESLVSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGDSRHIYHGVSSIIPKST

Query:  PKFLLFHTGLRPGRLNLTFRKY
        PKFLL+HTGLRPGRLNLTFRKY
Subjt:  PKFLLFHTGLRPGRLNLTFRKY

XP_004149927.1 uncharacterized protein LOC101210053 isoform X1 [Cucumis sativus]2.1e-22381.99Show/hide
Query:  MFFIRTLPLPPSPSSNQLRRLLFPASQFPGVRGFPLLQFQLMDSFSSSANSHALPDSSCCGRSCGCGGNKEHLHNTDYNSDVIFVGSVPVYLNPKERELR
        MFFIRTLPLPPSPSSNQLRRLLFPAS FP +RGF LLQFQ MDSFS+SANSHALPDSSCCG SCGCG +KEHLH+ D +SDVI VGS+PV+LNPKER   
Subjt:  MFFIRTLPLPPSPSSNQLRRLLFPASQFPGVRGFPLLQFQLMDSFSSSANSHALPDSSCCGRSCGCGGNKEHLHNTDYNSDVIFVGSVPVYLNPKERELR

Query:  SLTPLSVNKCDYVELGSGRYGISSNEPKSSHYDEFLPVSRQNTKRSRIDLGSKRDLKSSARSFQVERYEFLNDSCQEYESSLPIHFGKKNEVFFSKRQSL
                                 EPKS +YDE LPV RQNT+RSRIDLGSKRDLKS+ARS+QVER EFLNDSCQEY+SSLPIHFGKKNEVF SK QSL
Subjt:  SLTPLSVNKCDYVELGSGRYGISSNEPKSSHYDEFLPVSRQNTKRSRIDLGSKRDLKSSARSFQVERYEFLNDSCQEYESSLPIHFGKKNEVFFSKRQSL

Query:  DIGSKESVVTDNSLSFEPPFDICFPGGGNMKDRTSWRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMC
        D G KESVVTDNSL FEPPFDIC PGGGN+K R  + VK+ GTVKDYRLLRPGMVLLKHYITP EQINIVKTCQ LG+GPGGFYQPGYKDGAKLRLRMMC
Subjt:  DIGSKESVVTDNSLSFEPPFDICFPGGGNMKDRTSWRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMC

Query:  LGLDWDPQTRRYENKRVVDGNKPPNIPPQFSFLVKKALNDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTTGRLGLHQDRDESKESLVSGLPVVSFS
        LGLDWDPQTRRYENKRVVDGNKPP+IPPQF+FLVK+AL DAHAFIKN CNISNVE+ILPSMSPDICIANFYTT GRLGLHQDRDESKESL  GLPVVSFS
Subjt:  LGLDWDPQTRRYENKRVVDGNKPPNIPPQFSFLVKKALNDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTTGRLGLHQDRDESKESLVSGLPVVSFS

Query:  VGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGDSRHIYHGVSSIIPKSTPKFLLFHTGLRPGRLNLTFRKY
        VGN AEFLYGDKR+VDKAE VELESGDVLIFGG+SRHI+HGVSSIIPKSTPKFLL HTGLRPGRLNLTFRKY
Subjt:  VGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGDSRHIYHGVSSIIPKSTPKFLLFHTGLRPGRLNLTFRKY

XP_016903166.1 PREDICTED: uncharacterized protein LOC103502183 [Cucumis melo]2.9e-24988.98Show/hide
Query:  MFFIRTLPLPPSPSSNQLRRLLFPASQFPGVRGFPLLQFQLMDSFSSSANSHALPDSSCCGRSCGCGGNKEHLHNTDYNSDVIFVGSVPVYLNPKERELR
        MFFIRTLPLPPSPSSNQLRRLLFPAS FPG RGF LLQFQ MDSFSSSANSHA PDSSC G SCGCG +KEHL + D  SDVIF+GS  V+LNPKERE +
Subjt:  MFFIRTLPLPPSPSSNQLRRLLFPASQFPGVRGFPLLQFQLMDSFSSSANSHALPDSSCCGRSCGCGGNKEHLHNTDYNSDVIFVGSVPVYLNPKERELR

Query:  SLTPLSVNKCDYVELGSGRYGISSNEPKSSHYDEFLPVSRQNTKRSRIDLGSKRDLKSSARSFQVERYEFLNDSCQEYESSLPIHFGKKNEVFFSKRQSL
        SLTPLS  KCDYVE+GS ++GISSNEPKS HYDEFLPVSRQNT+R+RIDLGSKRDLKS+ARSFQVER+EF ND CQEYESSLPIHFGKKNEVFFSKRQSL
Subjt:  SLTPLSVNKCDYVELGSGRYGISSNEPKSSHYDEFLPVSRQNTKRSRIDLGSKRDLKSSARSFQVERYEFLNDSCQEYESSLPIHFGKKNEVFFSKRQSL

Query:  DIGSKESVVTDNSLSFEPPFDICFPGGGNMKDRTSWRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMC
        DIGSKESVVTD+SL FEPPFDICFPGGGN+K R  WRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQP YKDGAKLRLRMMC
Subjt:  DIGSKESVVTDNSLSFEPPFDICFPGGGNMKDRTSWRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMC

Query:  LGLDWDPQTRRYENKRVVDGNKPPNIPPQFSFLVKKALNDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTTGRLGLHQDRDESKESLVSGLPVVSFS
        LGLDWDPQTRRY+NKRVVDGNKPP+IPP FSFLVK AL DAHAFIKNKCNISNVEDILPSMSPDICIANFYTT+GRLGLHQDRDESKESL SGLPVVSFS
Subjt:  LGLDWDPQTRRYENKRVVDGNKPPNIPPQFSFLVKKALNDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTTGRLGLHQDRDESKESLVSGLPVVSFS

Query:  VGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGDSRHIYHGVSSIIPKSTPKFLLFHTGLRPGRLNLTFRKY
        VGNTAEFLYGDKRDV+KAEKVELESGDVLIFGG+SRH++HGVSSIIPKSTPKFLL+HTGLRPGRLNLTFRKY
Subjt:  VGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGDSRHIYHGVSSIIPKSTPKFLLFHTGLRPGRLNLTFRKY

TrEMBL top hitse value%identityAlignment
A0A0A0KY56 Fe2OG dioxygenase domain-containing protein1.0e-22381.99Show/hide
Query:  MFFIRTLPLPPSPSSNQLRRLLFPASQFPGVRGFPLLQFQLMDSFSSSANSHALPDSSCCGRSCGCGGNKEHLHNTDYNSDVIFVGSVPVYLNPKERELR
        MFFIRTLPLPPSPSSNQLRRLLFPAS FP +RGF LLQFQ MDSFS+SANSHALPDSSCCG SCGCG +KEHLH+ D +SDVI VGS+PV+LNPKER   
Subjt:  MFFIRTLPLPPSPSSNQLRRLLFPASQFPGVRGFPLLQFQLMDSFSSSANSHALPDSSCCGRSCGCGGNKEHLHNTDYNSDVIFVGSVPVYLNPKERELR

Query:  SLTPLSVNKCDYVELGSGRYGISSNEPKSSHYDEFLPVSRQNTKRSRIDLGSKRDLKSSARSFQVERYEFLNDSCQEYESSLPIHFGKKNEVFFSKRQSL
                                 EPKS +YDE LPV RQNT+RSRIDLGSKRDLKS+ARS+QVER EFLNDSCQEY+SSLPIHFGKKNEVF SK QSL
Subjt:  SLTPLSVNKCDYVELGSGRYGISSNEPKSSHYDEFLPVSRQNTKRSRIDLGSKRDLKSSARSFQVERYEFLNDSCQEYESSLPIHFGKKNEVFFSKRQSL

Query:  DIGSKESVVTDNSLSFEPPFDICFPGGGNMKDRTSWRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMC
        D G KESVVTDNSL FEPPFDIC PGGGN+K R  + VK+ GTVKDYRLLRPGMVLLKHYITP EQINIVKTCQ LG+GPGGFYQPGYKDGAKLRLRMMC
Subjt:  DIGSKESVVTDNSLSFEPPFDICFPGGGNMKDRTSWRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMC

Query:  LGLDWDPQTRRYENKRVVDGNKPPNIPPQFSFLVKKALNDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTTGRLGLHQDRDESKESLVSGLPVVSFS
        LGLDWDPQTRRYENKRVVDGNKPP+IPPQF+FLVK+AL DAHAFIKN CNISNVE+ILPSMSPDICIANFYTT GRLGLHQDRDESKESL  GLPVVSFS
Subjt:  LGLDWDPQTRRYENKRVVDGNKPPNIPPQFSFLVKKALNDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTTGRLGLHQDRDESKESLVSGLPVVSFS

Query:  VGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGDSRHIYHGVSSIIPKSTPKFLLFHTGLRPGRLNLTFRKY
        VGN AEFLYGDKR+VDKAE VELESGDVLIFGG+SRHI+HGVSSIIPKSTPKFLL HTGLRPGRLNLTFRKY
Subjt:  VGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGDSRHIYHGVSSIIPKSTPKFLLFHTGLRPGRLNLTFRKY

A0A1S4E4K6 uncharacterized protein LOC1035021831.4e-24988.98Show/hide
Query:  MFFIRTLPLPPSPSSNQLRRLLFPASQFPGVRGFPLLQFQLMDSFSSSANSHALPDSSCCGRSCGCGGNKEHLHNTDYNSDVIFVGSVPVYLNPKERELR
        MFFIRTLPLPPSPSSNQLRRLLFPAS FPG RGF LLQFQ MDSFSSSANSHA PDSSC G SCGCG +KEHL + D  SDVIF+GS  V+LNPKERE +
Subjt:  MFFIRTLPLPPSPSSNQLRRLLFPASQFPGVRGFPLLQFQLMDSFSSSANSHALPDSSCCGRSCGCGGNKEHLHNTDYNSDVIFVGSVPVYLNPKERELR

Query:  SLTPLSVNKCDYVELGSGRYGISSNEPKSSHYDEFLPVSRQNTKRSRIDLGSKRDLKSSARSFQVERYEFLNDSCQEYESSLPIHFGKKNEVFFSKRQSL
        SLTPLS  KCDYVE+GS ++GISSNEPKS HYDEFLPVSRQNT+R+RIDLGSKRDLKS+ARSFQVER+EF ND CQEYESSLPIHFGKKNEVFFSKRQSL
Subjt:  SLTPLSVNKCDYVELGSGRYGISSNEPKSSHYDEFLPVSRQNTKRSRIDLGSKRDLKSSARSFQVERYEFLNDSCQEYESSLPIHFGKKNEVFFSKRQSL

Query:  DIGSKESVVTDNSLSFEPPFDICFPGGGNMKDRTSWRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMC
        DIGSKESVVTD+SL FEPPFDICFPGGGN+K R  WRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQP YKDGAKLRLRMMC
Subjt:  DIGSKESVVTDNSLSFEPPFDICFPGGGNMKDRTSWRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMC

Query:  LGLDWDPQTRRYENKRVVDGNKPPNIPPQFSFLVKKALNDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTTGRLGLHQDRDESKESLVSGLPVVSFS
        LGLDWDPQTRRY+NKRVVDGNKPP+IPP FSFLVK AL DAHAFIKNKCNISNVEDILPSMSPDICIANFYTT+GRLGLHQDRDESKESL SGLPVVSFS
Subjt:  LGLDWDPQTRRYENKRVVDGNKPPNIPPQFSFLVKKALNDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTTGRLGLHQDRDESKESLVSGLPVVSFS

Query:  VGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGDSRHIYHGVSSIIPKSTPKFLLFHTGLRPGRLNLTFRKY
        VGNTAEFLYGDKRDV+KAEKVELESGDVLIFGG+SRH++HGVSSIIPKSTPKFLL+HTGLRPGRLNLTFRKY
Subjt:  VGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGDSRHIYHGVSSIIPKSTPKFLLFHTGLRPGRLNLTFRKY

A0A5A7U7Q2 2-oxoglutarate-dependent dioxygenase family protein isoform 17.7e-22489.57Show/hide
Query:  SHALPDSSCCGRSCGCGGNKEHLHNTDYNSDVIFVGSVPVYLNPKERELRSLTPLSVNKCDYVELGSGRYGISSNEPKSSHYDEFLPVSRQNTKRSRIDL
        S A PDSSC G SCGCG +KEHL + D  SDVIFVGS  V+LNPKERE +SLTPLSV KCDYVE+GS ++GISSNEPKS HYDE LPVSRQNT+R+RIDL
Subjt:  SHALPDSSCCGRSCGCGGNKEHLHNTDYNSDVIFVGSVPVYLNPKERELRSLTPLSVNKCDYVELGSGRYGISSNEPKSSHYDEFLPVSRQNTKRSRIDL

Query:  GSKRDLKSSARSFQVERYEFLNDSCQEYESSLPIHFGKKNEVFFSKRQSLDIGSKESVVTDNSLSFEPPFDICFPGGGNMKDRTSWRVKDSGTVKDYRLL
        GSKRDLKS+ARSFQVER+EFLND CQEYESSLPIHFGKKNEVFFSKRQSLDIGSKESVVTD+SL FEPPFDICFPGGGN+K R  WRVKDSGTVKDYRLL
Subjt:  GSKRDLKSSARSFQVERYEFLNDSCQEYESSLPIHFGKKNEVFFSKRQSLDIGSKESVVTDNSLSFEPPFDICFPGGGNMKDRTSWRVKDSGTVKDYRLL

Query:  RPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPNIPPQFSFLVKKALNDAHAFIKNKCN
        RPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPP+IPP FSFLVK AL DAHAFIKNKCN
Subjt:  RPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPNIPPQFSFLVKKALNDAHAFIKNKCN

Query:  ISNVEDILPSMSPDICIANFYTTTGRLGLHQDRDESKESLVSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGDSRHIYHGVSSIIPKST
        ISNVEDILPSMSPDICIANFYTT+GRLGLHQDRDESKESL SGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGG+SRH++HGVSSIIPKST
Subjt:  ISNVEDILPSMSPDICIANFYTTTGRLGLHQDRDESKESLVSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGDSRHIYHGVSSIIPKST

Query:  PKFLLFHTGLRPGRLNLTFRKY
        PKFLL+HTGLRPGRLNLTFRKY
Subjt:  PKFLLFHTGLRPGRLNLTFRKY

A0A5D3BFV0 2-oxoglutarate-dependent dioxygenase family protein isoform 15.0e-22389.34Show/hide
Query:  SHALPDSSCCGRSCGCGGNKEHLHNTDYNSDVIFVGSVPVYLNPKERELRSLTPLSVNKCDYVELGSGRYGISSNEPKSSHYDEFLPVSRQNTKRSRIDL
        S A PDSSC G SCGCG +KEHL + D  SDVIFVGS  V+LNPKERE +SLTPLSV KCDYVE+GS ++GISSNEPKS HYDE LPVSRQNT+R+RIDL
Subjt:  SHALPDSSCCGRSCGCGGNKEHLHNTDYNSDVIFVGSVPVYLNPKERELRSLTPLSVNKCDYVELGSGRYGISSNEPKSSHYDEFLPVSRQNTKRSRIDL

Query:  GSKRDLKSSARSFQVERYEFLNDSCQEYESSLPIHFGKKNEVFFSKRQSLDIGSKESVVTDNSLSFEPPFDICFPGGGNMKDRTSWRVKDSGTVKDYRLL
        GSKRDLKS+ARSFQVER+EFLND CQEYESSLPIHFGKKNEVFFSKRQSLDIGSKESVVTD+S  FEPPFDICFPGGGN+K R  WRVKDSGTVKDYRLL
Subjt:  GSKRDLKSSARSFQVERYEFLNDSCQEYESSLPIHFGKKNEVFFSKRQSLDIGSKESVVTDNSLSFEPPFDICFPGGGNMKDRTSWRVKDSGTVKDYRLL

Query:  RPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPNIPPQFSFLVKKALNDAHAFIKNKCN
        RPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPP+IPP FSFLVK AL DAHAFIKNKCN
Subjt:  RPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPNIPPQFSFLVKKALNDAHAFIKNKCN

Query:  ISNVEDILPSMSPDICIANFYTTTGRLGLHQDRDESKESLVSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGDSRHIYHGVSSIIPKST
        ISNVEDILPSMSPDICIANFYTT+GRLGLHQDRDESKESL SGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGG+SRH++HGVSSIIPKST
Subjt:  ISNVEDILPSMSPDICIANFYTTTGRLGLHQDRDESKESLVSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGDSRHIYHGVSSIIPKST

Query:  PKFLLFHTGLRPGRLNLTFRKY
        PKFLL+HTGLRPGRLNLTFRKY
Subjt:  PKFLLFHTGLRPGRLNLTFRKY

A0A6J1EDT3 uncharacterized protein LOC1114323188.7e-15964.05Show/hide
Query:  MFFIRTLPLPPSPSSNQLRRLLFPASQFPGVRGFPLLQFQLMDSFSSSANSHALPDSSCCGRSCGCGGNKEHLHNTDYNSDVIFVGSVPVYLNPKERELR
        M  IRT+P    P SN LRRLLF  S+        LLQFQ +DSF SS    ALPDSSC G S  CGGN+E LHN D+NS+VI +G +PV LN K  E  
Subjt:  MFFIRTLPLPPSPSSNQLRRLLFPASQFPGVRGFPLLQFQLMDSFSSSANSHALPDSSCCGRSCGCGGNKEHLHNTDYNSDVIFVGSVPVYLNPKERELR

Query:  SLTPLSVNKCDYVELGSGRYGISSNEPKSSHYDEFLPVSRQNTK-RSRIDLGSKRDLKSSARSFQVERYEFLNDSCQEYESSLPIHFGKKNEVF-FSKRQ
        SL+ LSV KCD  +L S + GI +N P S H DEF PV RQNTK RSRIDLGS+R LK+S  S Q+ER                      NE F F K +
Subjt:  SLTPLSVNKCDYVELGSGRYGISSNEPKSSHYDEFLPVSRQNTK-RSRIDLGSKRDLKSSARSFQVERYEFLNDSCQEYESSLPIHFGKKNEVF-FSKRQ

Query:  SLDIGSKESVVTDNSLSFEPPFDICFP-GGGNMKDRTSWRVKDSGTVKDYR---------LLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGY
        S DIGSK S+ T N    E  FDICFP   G  K R SW+ KD  T+K            ++RPGMVLLKHYI   EQ+NIVKT QKLGLGPGGFYQPGY
Subjt:  SLDIGSKESVVTDNSLSFEPPFDICFP-GGGNMKDRTSWRVKDSGTVKDYR---------LLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGY

Query:  KDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPNIPPQFSFLVKKALNDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTTGRLGLHQDRDESKE
        KDGAKLRL+MMCLGLDWDPQTR+Y  KRV DGNKPP++PP+F+ LV KALNDAHA IKN  + +N+EDILP+MSPDICI NFY+T+GRLGLHQDRDES+E
Subjt:  KDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPNIPPQFSFLVKKALNDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTTGRLGLHQDRDESKE

Query:  SLVSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGDSRHIYHGVSSIIPKSTPKFLLFHTGLRPGRLNLTFRKY
        SLV GLPVVSFS+GN+AEFLYGD+RDVDKA K+ LESGDVLIFGG+SRHI+HGVSSIIPKSTPKFLL HTGLRPGRLNLTFRKY
Subjt:  SLVSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGDSRHIYHGVSSIIPKSTPKFLLFHTGLRPGRLNLTFRKY

SwissProt top hitse value%identityAlignment
B8GWW6 Alpha-ketoglutarate-dependent dioxygenase AlkB homolog7.9e-1632.99Show/hide
Query:  PGGFYQPGYKDGAKLRLRMMCLG-LDWDPQTR--RYENKRVVDGNKPPNIPPQFSFLVKKALNDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTTGR
        P   Y+  Y  G  + + M  LG L W    R  RY ++    G   P++PP        AL D    + +           P   PD C+ N Y    R
Subjt:  PGGFYQPGYKDGAKLRLRMMCLG-LDWDPQTR--RYENKRVVDGNKPPNIPPQFSFLVKKALNDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTTGR

Query:  LGLHQDRDESKESLVSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGDSRHIYHGVSSIIPKSTPKFLLFHTGLRP--GRLNLTFRK
        +GLHQDRDE+        PV+S S+G+TA F  G     D    + L SGDV    G +R  +HGV  I+P S        + L P  GR+NLT R+
Subjt:  LGLHQDRDESKESLVSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGDSRHIYHGVSSIIPKSTPKFLLFHTGLRP--GRLNLTFRK

O60066 Alpha-ketoglutarate-dependent dioxygenase abh13.5e-1627.5Show/hide
Query:  PGMVLLKHYITPPEQINIVKTCQ-----------------KLGLGPGGFYQPGYK-DGAKL------------------RLRMMCLGLDWDPQTRRYENK
        PG+++LK+Y++   Q+ ++K+                   +L LG    ++  Y  DG  +                  +LR + LG  +D  T+ Y   
Subjt:  PGMVLLKHYITPPEQINIVKTCQ-----------------KLGLGPGGFYQPGYK-DGAKL------------------RLRMMCLGLDWDPQTRRYENK

Query:  RVVDGNKPPNIPPQFSFLVKKALNDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTTGRLGLHQDRDESKESLVSGLPVVSFSVGNTAEFLYGDKRDV
           D +K P  P      V+K + ++  F+  K               +  I NFY+    L  H   DES+E L   LP++S S+G    +L G +   
Subjt:  RVVDGNKPPNIPPQFSFLVKKALNDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTTGRLGLHQDRDESKESLVSGLPVVSFSVGNTAEFLYGDKRDV

Query:  DKAEKVELESGDVLIFGGDSRHIYHGVSSIIPKSTPKFLL
        +K   + L SGDV+I  G SR  +H V  IIP STP +LL
Subjt:  DKAEKVELESGDVLIFGGDSRHIYHGVSSIIPKSTPKFLL

P05050 Alpha-ketoglutarate-dependent dioxygenase AlkB2.8e-1333.53Show/hide
Query:  DPQTRRYENKRVVDGNKP-PNIPPQFSFLVKKALNDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTTGRLGLHQDRDESKESLVSGLPVVSFSVGNT
        DPQT           NKP P +P  F  L ++A   A                 P   PD C+ N Y    +L LHQD+DE         P+VS S+G  
Subjt:  DPQTRRYENKRVVDGNKP-PNIPPQFSFLVKKALNDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTTGRLGLHQDRDESKESLVSGLPVVSFSVGNT

Query:  AEFLYGDKRDVDKAEKVELESGDVLIFGGDSRHIYHGVSSIIPKSTPKFLLFHTGLRPGRLNLTFRK
        A F +G  +  D  +++ LE GDV+++GG+SR  YHG+        P    FH      R NLTFR+
Subjt:  AEFLYGDKRDVDKAEKVELESGDVLIFGGDSRHIYHGVSSIIPKSTPKFLLFHTGLRPGRLNLTFRK

P0CAT7 Alpha-ketoglutarate-dependent dioxygenase AlkB homolog7.9e-1632.99Show/hide
Query:  PGGFYQPGYKDGAKLRLRMMCLG-LDWDPQTR--RYENKRVVDGNKPPNIPPQFSFLVKKALNDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTTGR
        P   Y+  Y  G  + + M  LG L W    R  RY ++    G   P++PP        AL D    + +           P   PD C+ N Y    R
Subjt:  PGGFYQPGYKDGAKLRLRMMCLG-LDWDPQTR--RYENKRVVDGNKPPNIPPQFSFLVKKALNDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTTGR

Query:  LGLHQDRDESKESLVSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGDSRHIYHGVSSIIPKSTPKFLLFHTGLRP--GRLNLTFRK
        +GLHQDRDE+        PV+S S+G+TA F  G     D    + L SGDV    G +R  +HGV  I+P S        + L P  GR+NLT R+
Subjt:  LGLHQDRDESKESLVSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGDSRHIYHGVSSIIPKSTPKFLLFHTGLRP--GRLNLTFRK

P37462 Alpha-ketoglutarate-dependent dioxygenase AlkB2.1e-1338.39Show/hide
Query:  SMSPDICIANFYTTTGRLGLHQDRDESKESLVSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGDSRHIYHGVSSIIPKSTPKFLLFHTG
        S  PD C+ N Y    +L LHQD+DE         P+VS S+G  A F +G  R  D  +++ LE GD++++GG+SR  YHG+        P    FH  
Subjt:  SMSPDICIANFYTTTGRLGLHQDRDESKESLVSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGDSRHIYHGVSSIIPKSTPKFLLFHTG

Query:  LRPGRLNLTFRK
            R NLTFR+
Subjt:  LRPGRLNLTFRK

Arabidopsis top hitse value%identityAlignment
AT1G11780.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein2.8e-0832.24Show/hide
Query:  RLRMMCLGLDWDPQTRRYENKRVVDGNKP-PNIPPQFSFLVKKALNDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTTGRLGLHQDRDESKESLVSG
        +LR   LGL +D   R Y      D + P  NIP     L K      HA I     + + E+      P+  I N++     LG H D  E+  S    
Subjt:  RLRMMCLGLDWDPQTRRYENKRVVDGNKP-PNIPPQFSFLVKKALNDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTTGRLGLHQDRDESKESLVSG

Query:  LPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGDSRHIYHGVSSI
         P+VS S+G  A FL G K   D    + L SGDV++  G++R  +HG+  I
Subjt:  LPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGDSRHIYHGVSSI

AT3G14140.1 2-oxoglutarate-dependent dioxygenase family protein6.1e-6451.53Show/hide
Query:  LLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPNIPPQFSFLVKKALNDAHAFIKNK
        ++RPGMVLLK+Y++   Q+ IV  C++LGLG GGFYQPG++DG  L L+MMCLG +WD QTRRY   R +DG+ PP IP +FS LV+KA+ ++ + +   
Subjt:  LLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPNIPPQFSFLVKKALNDAHAFIKNK

Query:  CNISNVEDILPSMSPDICIANFYTTTGRLGLHQ---------------------DRDESKESLVSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGD
         N +   D +P + PDIC+ NFYT+TG+LGLHQ                     D+ ESK+SL  GLP+VSFS+G++AEFLYGD++DVDKA+ + LESGD
Subjt:  CNISNVEDILPSMSPDICIANFYTTTGRLGLHQ---------------------DRDESKESLVSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGD

Query:  VLIFGGDSRHIYHGVSSIIPKSTPKFLLF
        VLIFG  SR+++HGV S I K  P  L F
Subjt:  VLIFGGDSRHIYHGVSSIIPKSTPKFLLF

AT3G14160.1 2-oxoglutarate-dependent dioxygenase family protein3.7e-7758.19Show/hide
Query:  SGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPNIPPQFSFLVKKALND
        SGTV     +RPGMVLLK+Y++  +Q+ IV  C++LGLG GGFYQPGY+D AKL L+MMCLG +WDP+T RY   R  DG+  P IP +F+  V+KA+ +
Subjt:  SGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPNIPPQFSFLVKKALND

Query:  AHAFIKNKCNISNVEDILPSMSPDICIANFYTTTGRLGLHQDRDESKESLVSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGDSRHIYH
        + +   +    +   D +P M PDICI NFY++TGRLGLHQD+DES+ S+  GLPVVSFS+G++AEFLYGD+RD DKAE + LESGDVL+FGG SR ++H
Subjt:  AHAFIKNKCNISNVEDILPSMSPDICIANFYTTTGRLGLHQDRDESKESLVSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGDSRHIYH

Query:  GVSSIIPKSTPKFLLFHTGLRPGRLNLTFRKY
        GV SI   + PK LL  T LRPGRLNLTFR+Y
Subjt:  GVSSIIPKSTPKFLLFHTGLRPGRLNLTFRKY

AT5G01780.1 2-oxoglutarate-dependent dioxygenase family protein5.0e-8252.96Show/hide
Query:  RQSLDIGSKESVVTDNSLSFEPPFDICFPGGGNMKDRTSWRVKD--------SGTVK---DYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQ
        R   + G K    +   +   PPFDIC     ++ +R    +KD          TV+    ++++RPGMVLLK ++TP  Q++IVKTC++LG+ P GFYQ
Subjt:  RQSLDIGSKESVVTDNSLSFEPPFDICFPGGGNMKDRTSWRVKD--------SGTVK---DYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQ

Query:  PGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPNIPPQFSFLVKKALNDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTTGRLGLHQDRDE
        PGY  G+KL L+MMCLG +WDPQT+  +N  +   +K P IP  F+ LV+KA+ +AHA I  +    + E ILP MSPDICI NFY+ TGRLGLHQDRDE
Subjt:  PGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPNIPPQFSFLVKKALNDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTTGRLGLHQDRDE

Query:  SKESLVSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGDSRHIYHGVSSIIPKSTPKFLLFHTGLRPGRLNLTFRKY
        S+ES+  GLP+VSFS+G++AEFLYG+KRDV++A+ V LESGDVLIFGG+SR I+HGV SIIP S P  LL  + LR GRLNLTFR +
Subjt:  SKESLVSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGDSRHIYHGVSSIIPKSTPKFLLFHTGLRPGRLNLTFRKY

AT5G01780.2 2-oxoglutarate-dependent dioxygenase family protein5.0e-8252.96Show/hide
Query:  RQSLDIGSKESVVTDNSLSFEPPFDICFPGGGNMKDRTSWRVKD--------SGTVK---DYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQ
        R   + G K    +   +   PPFDIC     ++ +R    +KD          TV+    ++++RPGMVLLK ++TP  Q++IVKTC++LG+ P GFYQ
Subjt:  RQSLDIGSKESVVTDNSLSFEPPFDICFPGGGNMKDRTSWRVKD--------SGTVK---DYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQ

Query:  PGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPNIPPQFSFLVKKALNDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTTGRLGLHQDRDE
        PGY  G+KL L+MMCLG +WDPQT+  +N  +   +K P IP  F+ LV+KA+ +AHA I  +    + E ILP MSPDICI NFY+ TGRLGLHQDRDE
Subjt:  PGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPNIPPQFSFLVKKALNDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTTGRLGLHQDRDE

Query:  SKESLVSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGDSRHIYHGVSSIIPKSTPKFLLFHTGLRPGRLNLTFRKY
        S+ES+  GLP+VSFS+G++AEFLYG+KRDV++A+ V LESGDVLIFGG+SR I+HGV SIIP S P  LL  + LR GRLNLTFR +
Subjt:  SKESLVSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGDSRHIYHGVSSIIPKSTPKFLLFHTGLRPGRLNLTFRKY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTTTCATCCGTACACTTCCCCTTCCCCCGTCGCCGTCGTCCAATCAACTTCGTCGCCTTTTATTCCCCGCTTCTCAATTTCCCGGCGTGCGTGGCTTTCCTTTGCT
TCAATTTCAACTAATGGATTCGTTTTCCAGTTCAGCAAATAGTCATGCACTACCTGACTCTTCATGTTGCGGTCGTTCTTGTGGTTGTGGGGGAAACAAGGAACATTTGC
ATAACACAGATTATAATTCAGATGTGATATTTGTGGGAAGCGTTCCTGTGTATCTAAATCCCAAGGAACGTGAACTGAGATCTTTAACTCCGTTATCTGTTAATAAATGT
GATTATGTTGAATTGGGAAGTGGTAGGTATGGGATTTCTTCAAATGAACCGAAATCTTCTCATTATGATGAGTTTCTACCTGTTTCAAGACAAAATACTAAAAGAAGCCG
AATAGATTTAGGGTCCAAAAGAGATTTGAAGAGTAGTGCAAGATCATTTCAAGTAGAGAGGTATGAATTTTTGAACGATTCTTGTCAGGAGTATGAATCATCTCTTCCTA
TTCATTTTGGGAAGAAAAATGAAGTTTTTTTCTCAAAGCGCCAGTCCCTTGATATCGGTTCCAAGGAATCTGTAGTTACGGACAATTCGCTTTCCTTTGAACCACCATTT
GATATTTGTTTCCCTGGAGGAGGTAATATGAAAGATAGAACTTCTTGGCGAGTTAAAGACAGTGGCACTGTGAAAGATTATAGACTGCTGAGGCCTGGAATGGTTTTACT
GAAGCACTACATTACTCCACCTGAACAGATCAATATAGTGAAAACATGTCAAAAGCTTGGTCTTGGCCCAGGGGGATTTTACCAGCCTGGTTATAAAGATGGAGCAAAAC
TTAGGCTTCGTATGATGTGTCTTGGATTGGACTGGGATCCTCAAACAAGAAGATATGAAAACAAACGGGTTGTGGATGGTAATAAACCACCAAATATACCTCCTCAATTT
TCATTTCTAGTTAAAAAAGCACTTAACGATGCACATGCCTTCATCAAGAACAAATGCAATATAAGTAATGTAGAAGACATTCTTCCATCAATGTCTCCAGACATATGCAT
TGCGAACTTCTACACAACGACTGGAAGATTGGGTCTGCATCAGGATCGCGATGAAAGCAAAGAGAGTCTTGTCAGCGGACTACCGGTCGTTTCCTTTTCTGTAGGCAATA
CAGCTGAATTCTTGTATGGAGACAAAAGAGATGTGGATAAAGCAGAGAAGGTTGAACTGGAATCAGGTGATGTTCTAATTTTTGGTGGCGACTCTAGACATATATATCAT
GGAGTATCTTCAATCATACCAAAATCCACACCTAAGTTTTTGCTTTTTCATACTGGTCTGCGCCCCGGTCGTCTAAATCTTACATTTCGAAAGTATTAA
mRNA sequenceShow/hide mRNA sequence
TAGCATTTAAAAATGAATATATAGAATTAGTCCAAAAAAAAAGTAAAAATTGCAATAATGTTTAGTTTTATGAAATATAGGTTTCATAATTTCAATGAATATCATCAATT
TAGTCTCTTTATTTCAGCACAAAAAATGATCACACACACAGCTGTGAAAATGGCCGAAATCCTTACTCCTTCCTCTTCCCGGTGCAGTCTCCTCGGCCAACGCGGCGGCG
ACGACCGAAACTTGCGGATTAATTCTACAATATCCTCATTATGTCAATCCCATACTCCTCCGACGCTTCCATTTCCTTCTCCTTTTCCGAATTTTTCTTTCATCCTCAAT
TTCTGAACCCTATAAAGTATCTCTCTACCATCCAATTTTCCCAATGTTTTTCATCCGTACACTTCCCCTTCCCCCGTCGCCGTCGTCCAATCAACTTCGTCGCCTTTTAT
TCCCCGCTTCTCAATTTCCCGGCGTGCGTGGCTTTCCTTTGCTTCAATTTCAACTAATGGATTCGTTTTCCAGTTCAGCAAATAGTCATGCACTACCTGACTCTTCATGT
TGCGGTCGTTCTTGTGGTTGTGGGGGAAACAAGGAACATTTGCATAACACAGATTATAATTCAGATGTGATATTTGTGGGAAGCGTTCCTGTGTATCTAAATCCCAAGGA
ACGTGAACTGAGATCTTTAACTCCGTTATCTGTTAATAAATGTGATTATGTTGAATTGGGAAGTGGTAGGTATGGGATTTCTTCAAATGAACCGAAATCTTCTCATTATG
ATGAGTTTCTACCTGTTTCAAGACAAAATACTAAAAGAAGCCGAATAGATTTAGGGTCCAAAAGAGATTTGAAGAGTAGTGCAAGATCATTTCAAGTAGAGAGGTATGAA
TTTTTGAACGATTCTTGTCAGGAGTATGAATCATCTCTTCCTATTCATTTTGGGAAGAAAAATGAAGTTTTTTTCTCAAAGCGCCAGTCCCTTGATATCGGTTCCAAGGA
ATCTGTAGTTACGGACAATTCGCTTTCCTTTGAACCACCATTTGATATTTGTTTCCCTGGAGGAGGTAATATGAAAGATAGAACTTCTTGGCGAGTTAAAGACAGTGGCA
CTGTGAAAGATTATAGACTGCTGAGGCCTGGAATGGTTTTACTGAAGCACTACATTACTCCACCTGAACAGATCAATATAGTGAAAACATGTCAAAAGCTTGGTCTTGGC
CCAGGGGGATTTTACCAGCCTGGTTATAAAGATGGAGCAAAACTTAGGCTTCGTATGATGTGTCTTGGATTGGACTGGGATCCTCAAACAAGAAGATATGAAAACAAACG
GGTTGTGGATGGTAATAAACCACCAAATATACCTCCTCAATTTTCATTTCTAGTTAAAAAAGCACTTAACGATGCACATGCCTTCATCAAGAACAAATGCAATATAAGTA
ATGTAGAAGACATTCTTCCATCAATGTCTCCAGACATATGCATTGCGAACTTCTACACAACGACTGGAAGATTGGGTCTGCATCAGGATCGCGATGAAAGCAAAGAGAGT
CTTGTCAGCGGACTACCGGTCGTTTCCTTTTCTGTAGGCAATACAGCTGAATTCTTGTATGGAGACAAAAGAGATGTGGATAAAGCAGAGAAGGTTGAACTGGAATCAGG
TGATGTTCTAATTTTTGGTGGCGACTCTAGACATATATATCATGGAGTATCTTCAATCATACCAAAATCCACACCTAAGTTTTTGCTTTTTCATACTGGTCTGCGCCCCG
GTCGTCTAAATCTTACATTTCGAAAGTATTAAAACACTACCTCCTTGTTTATGCTATACATGTGAATGGCTGTTGTTATTCATTAGATGTTCATTTATGGAATCTAGTAA
ATCTATAATGTAATTATTGTCTGTTTCTATTTCATTTACTTTGGATGTTCTTTGTTCAGTACTTTACAGTTAGCGTACGTGAATGATATAAAGTTTTAATTTCAATTTCG
TATTTAATAGGTGATTGTATTTAGTATACTTTATTTTATCTCAAATTCTCAGTACACTGTTAAATATCTCTTTTACATGCTCAATATATTTTTAGTTCTCT
Protein sequenceShow/hide protein sequence
MFFIRTLPLPPSPSSNQLRRLLFPASQFPGVRGFPLLQFQLMDSFSSSANSHALPDSSCCGRSCGCGGNKEHLHNTDYNSDVIFVGSVPVYLNPKERELRSLTPLSVNKC
DYVELGSGRYGISSNEPKSSHYDEFLPVSRQNTKRSRIDLGSKRDLKSSARSFQVERYEFLNDSCQEYESSLPIHFGKKNEVFFSKRQSLDIGSKESVVTDNSLSFEPPF
DICFPGGGNMKDRTSWRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPNIPPQF
SFLVKKALNDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTTGRLGLHQDRDESKESLVSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGDSRHIYH
GVSSIIPKSTPKFLLFHTGLRPGRLNLTFRKY