; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MELO3C026046 (gene) of Melon (DHL92) v4 genome

Gene IDMELO3C026046
OrganismCucumis melo DHL92 (Melon (DHL92) v4)
Description2-oxoglutarate-dependent dioxygenase family protein isoform 1
Genome locationchr08:27262615..27265273
RNA-Seq ExpressionMELO3C026046
SyntenyMELO3C026046
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0035513 - oxidative RNA demethylation (biological process)
GO:0035552 - oxidative single-stranded DNA demethylation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0008198 - ferrous iron binding (molecular function)
GO:0035515 - oxidative RNA demethylase activity (molecular function)
GO:0035516 - oxidative DNA demethylase activity (molecular function)
InterPro domainsIPR004574 - Alkylated DNA repair protein AlkB
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR027450 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like
IPR037151 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050407.1 2-oxoglutarate-dependent dioxygenase family protein isoform 1 [Cucumis melo var. makuwa]4.2e-24897.87Show/hide
Query:  SHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFLGSFRVHLNPKEREPKSLTPLSAKKCDYVEVGSDKFGISSNEPKSYHYDEFLPVSRQNTRRNRIDL
        S APPDSSCRGNSCGCGRDKEHLRDRDNCSDVIF+GSFRVHLNPKEREPKSLTPLS KKCDYVEVGSDKFGISSNEPKSYHYDE LPVSRQNTRRNRIDL
Subjt:  SHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFLGSFRVHLNPKEREPKSLTPLSAKKCDYVEVGSDKFGISSNEPKSYHYDEFLPVSRQNTRRNRIDL

Query:  GSKRDLKSNARSFQVERHEFFNDYCQEYESSLPIHFGKKNEVFFSKRQSLDIGSKESVVTDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLL
        GSKRDLKSNARSFQVERHEF NDYCQEYESSLPIHFGKKNEVFFSKRQSLDIGSKESVVTDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLL
Subjt:  GSKRDLKSNARSFQVERHEFFNDYCQEYESSLPIHFGKKNEVFFSKRQSLDIGSKESVVTDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLL

Query:  RPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPSYKDGAKLRLRMMCLGLDWDPQTRRYKNKRVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNKCN
        RPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQP YKDGAKLRLRMMCLGLDWDPQTRRY+NKRVVDGNKPPDIPPPFSFLVK+ALKDAHAFIKNKCN
Subjt:  RPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPSYKDGAKLRLRMMCLGLDWDPQTRRYKNKRVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNKCN

Query:  ISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVEKAEKVELESGDVLIFGGESRHVFHGVSSIIPKST
        ISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDV+KAEKVELESGDVLIFGGESRHVFHGVSSIIPKST
Subjt:  ISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVEKAEKVELESGDVLIFGGESRHVFHGVSSIIPKST

Query:  PKFLLYHTGLRPGRLNLTFRKY
        PKFLLYHTGLRPGRLNLTFRKY
Subjt:  PKFLLYHTGLRPGRLNLTFRKY

KAG6595567.1 hypothetical protein SDJN03_12120, partial [Cucurbita argyrosperma subsp. sororia]3.9e-15362.32Show/hide
Query:  MFFIRTLPLPPSPSSNQLRRLLFPASSFPGARGFSLLQFQRMDSFSSSANSHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFLGSFRVHLNPKEREPK
        M  IRT+P    P SN LRRLLF  S         LLQFQRMDSF SS    A PDSSC G+S  CG ++E L +RD+ S+VI +G   V+LN K  E +
Subjt:  MFFIRTLPLPPSPSSNQLRRLLFPASSFPGARGFSLLQFQRMDSFSSSANSHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFLGSFRVHLNPKEREPK

Query:  SLTPLSAKKCDYVEVGSDKFGISSNEPKSYHYDEFLPVSRQNT-RRNRIDLGSKRDLKSNARSFQVERHEFFNDYCQEYESSLPIHFGKKNEVFFSKRQS
        SL+ LS  KCD  ++ SD+ GI +N P SYH DEF PV RQNT RR+RIDLGS+R LK++  S Q+ER+E F+                     F K +S
Subjt:  SLTPLSAKKCDYVEVGSDKFGISSNEPKSYHYDEFLPVSRQNT-RRNRIDLGSKRDLKSNARSFQVERHEFFNDYCQEYESSLPIHFGKKNEVFFSKRQS

Query:  LDIGSKESVVTDHSLPFEPPFDICFP-GGGNVKHRNFWRVKDSGTVKDYR---------LLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPSYK
         DIGSK S+ T +  P E  FDICFP   G  KHR  W+ KD  T+K            ++RPGMVLLKHYI   EQ+NIVKT QKLGLGPGGFYQP YK
Subjt:  LDIGSKESVVTDHSLPFEPPFDICFP-GGGNVKHRNFWRVKDSGTVKDYR---------LLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPSYK

Query:  DGAKLRLRMMCLGLDWDPQTRRYKNKRVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKES
        DGAKLRL+MMCLGLDWDPQTR+Y  KRV DGNKPPD+PP F+ LV  AL DAHA IKN  + +N+EDILP+MSPDICI NFY+TSGRLGLHQDRDES+ES
Subjt:  DGAKLRLRMMCLGLDWDPQTRRYKNKRVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKES

Query:  LSSGLPVVSFSVGNTAEFLYGDKRDVEKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY
        L  GLPVVSFS+GN+AEFLYGD+RDV+KA K+ LESGDVLIFGGESRH+FHGVSSIIPKSTPKFLL HTGLRPGRLNLTFRKY
Subjt:  LSSGLPVVSFSVGNTAEFLYGDKRDVEKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY

TYJ97997.1 2-oxoglutarate-dependent dioxygenase family protein isoform 1 [Cucumis melo var. makuwa]2.7e-24797.63Show/hide
Query:  SHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFLGSFRVHLNPKEREPKSLTPLSAKKCDYVEVGSDKFGISSNEPKSYHYDEFLPVSRQNTRRNRIDL
        S APPDSSCRGNSCGCGRDKEHLRDRDNCSDVIF+GSFRVHLNPKEREPKSLTPLS KKCDYVEVGSDKFGISSNEPKSYHYDE LPVSRQNTRRNRIDL
Subjt:  SHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFLGSFRVHLNPKEREPKSLTPLSAKKCDYVEVGSDKFGISSNEPKSYHYDEFLPVSRQNTRRNRIDL

Query:  GSKRDLKSNARSFQVERHEFFNDYCQEYESSLPIHFGKKNEVFFSKRQSLDIGSKESVVTDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLL
        GSKRDLKSNARSFQVERHEF NDYCQEYESSLPIHFGKKNEVFFSKRQSLDIGSKESVVTDHS PFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLL
Subjt:  GSKRDLKSNARSFQVERHEFFNDYCQEYESSLPIHFGKKNEVFFSKRQSLDIGSKESVVTDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLL

Query:  RPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPSYKDGAKLRLRMMCLGLDWDPQTRRYKNKRVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNKCN
        RPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQP YKDGAKLRLRMMCLGLDWDPQTRRY+NKRVVDGNKPPDIPPPFSFLVK+ALKDAHAFIKNKCN
Subjt:  RPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPSYKDGAKLRLRMMCLGLDWDPQTRRYKNKRVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNKCN

Query:  ISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVEKAEKVELESGDVLIFGGESRHVFHGVSSIIPKST
        ISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDV+KAEKVELESGDVLIFGGESRHVFHGVSSIIPKST
Subjt:  ISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVEKAEKVELESGDVLIFGGESRHVFHGVSSIIPKST

Query:  PKFLLYHTGLRPGRLNLTFRKY
        PKFLLYHTGLRPGRLNLTFRKY
Subjt:  PKFLLYHTGLRPGRLNLTFRKY

XP_004149927.1 uncharacterized protein LOC101210053 isoform X1 [Cucumis sativus]8.4e-22582.84Show/hide
Query:  MFFIRTLPLPPSPSSNQLRRLLFPASSFPGARGFSLLQFQRMDSFSSSANSHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFLGSFRVHLNPKEREPK
        MFFIRTLPLPPSPSSNQLRRLLFPASSFP  RGF LLQFQ MDSFS+SANSHA PDSSC G+SCGCGRDKEHL DRDN SDVI +GS  VHLNPKER   
Subjt:  MFFIRTLPLPPSPSSNQLRRLLFPASSFPGARGFSLLQFQRMDSFSSSANSHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFLGSFRVHLNPKEREPK

Query:  SLTPLSAKKCDYVEVGSDKFGISSNEPKSYHYDEFLPVSRQNTRRNRIDLGSKRDLKSNARSFQVERHEFFNDYCQEYESSLPIHFGKKNEVFFSKRQSL
                                 EPKSY+YDE LPV RQNTRR+RIDLGSKRDLKSNARS+QVER EF ND CQEY+SSLPIHFGKKNEVF SK QSL
Subjt:  SLTPLSAKKCDYVEVGSDKFGISSNEPKSYHYDEFLPVSRQNTRRNRIDLGSKRDLKSNARSFQVERHEFFNDYCQEYESSLPIHFGKKNEVFFSKRQSL

Query:  DIGSKESVVTDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPSYKDGAKLRLRMMC
        D G KESVVTD+SLPFEPPFDIC PGGGNVKHRN + VK+ GTVKDYRLLRPGMVLLKHYITP EQINIVKTCQ LG+GPGGFYQP YKDGAKLRLRMMC
Subjt:  DIGSKESVVTDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPSYKDGAKLRLRMMC

Query:  LGLDWDPQTRRYKNKRVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFS
        LGLDWDPQTRRY+NKRVVDGNKPPDIPP F+FLVK ALKDAHAFIKN CNISNVE+ILPSMSPDICIANFYTT GRLGLHQDRDESKESL  GLPVVSFS
Subjt:  LGLDWDPQTRRYKNKRVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFS

Query:  VGNTAEFLYGDKRDVEKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY
        VGN AEFLYGDKR+V+KAE VELESGDVLIFGGESRH+FHGVSSIIPKSTPKFLL+HTGLRPGRLNLTFRKY
Subjt:  VGNTAEFLYGDKRDVEKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY

XP_016903166.1 PREDICTED: uncharacterized protein LOC103502183 [Cucumis melo]9.8e-282100Show/hide
Query:  MFFIRTLPLPPSPSSNQLRRLLFPASSFPGARGFSLLQFQRMDSFSSSANSHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFLGSFRVHLNPKEREPK
        MFFIRTLPLPPSPSSNQLRRLLFPASSFPGARGFSLLQFQRMDSFSSSANSHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFLGSFRVHLNPKEREPK
Subjt:  MFFIRTLPLPPSPSSNQLRRLLFPASSFPGARGFSLLQFQRMDSFSSSANSHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFLGSFRVHLNPKEREPK

Query:  SLTPLSAKKCDYVEVGSDKFGISSNEPKSYHYDEFLPVSRQNTRRNRIDLGSKRDLKSNARSFQVERHEFFNDYCQEYESSLPIHFGKKNEVFFSKRQSL
        SLTPLSAKKCDYVEVGSDKFGISSNEPKSYHYDEFLPVSRQNTRRNRIDLGSKRDLKSNARSFQVERHEFFNDYCQEYESSLPIHFGKKNEVFFSKRQSL
Subjt:  SLTPLSAKKCDYVEVGSDKFGISSNEPKSYHYDEFLPVSRQNTRRNRIDLGSKRDLKSNARSFQVERHEFFNDYCQEYESSLPIHFGKKNEVFFSKRQSL

Query:  DIGSKESVVTDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPSYKDGAKLRLRMMC
        DIGSKESVVTDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPSYKDGAKLRLRMMC
Subjt:  DIGSKESVVTDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPSYKDGAKLRLRMMC

Query:  LGLDWDPQTRRYKNKRVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFS
        LGLDWDPQTRRYKNKRVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFS
Subjt:  LGLDWDPQTRRYKNKRVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFS

Query:  VGNTAEFLYGDKRDVEKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY
        VGNTAEFLYGDKRDVEKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY
Subjt:  VGNTAEFLYGDKRDVEKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY

TrEMBL top hitse value%identityAlignment
A0A0A0KY56 Fe2OG dioxygenase domain-containing protein4.1e-22582.84Show/hide
Query:  MFFIRTLPLPPSPSSNQLRRLLFPASSFPGARGFSLLQFQRMDSFSSSANSHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFLGSFRVHLNPKEREPK
        MFFIRTLPLPPSPSSNQLRRLLFPASSFP  RGF LLQFQ MDSFS+SANSHA PDSSC G+SCGCGRDKEHL DRDN SDVI +GS  VHLNPKER   
Subjt:  MFFIRTLPLPPSPSSNQLRRLLFPASSFPGARGFSLLQFQRMDSFSSSANSHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFLGSFRVHLNPKEREPK

Query:  SLTPLSAKKCDYVEVGSDKFGISSNEPKSYHYDEFLPVSRQNTRRNRIDLGSKRDLKSNARSFQVERHEFFNDYCQEYESSLPIHFGKKNEVFFSKRQSL
                                 EPKSY+YDE LPV RQNTRR+RIDLGSKRDLKSNARS+QVER EF ND CQEY+SSLPIHFGKKNEVF SK QSL
Subjt:  SLTPLSAKKCDYVEVGSDKFGISSNEPKSYHYDEFLPVSRQNTRRNRIDLGSKRDLKSNARSFQVERHEFFNDYCQEYESSLPIHFGKKNEVFFSKRQSL

Query:  DIGSKESVVTDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPSYKDGAKLRLRMMC
        D G KESVVTD+SLPFEPPFDIC PGGGNVKHRN + VK+ GTVKDYRLLRPGMVLLKHYITP EQINIVKTCQ LG+GPGGFYQP YKDGAKLRLRMMC
Subjt:  DIGSKESVVTDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPSYKDGAKLRLRMMC

Query:  LGLDWDPQTRRYKNKRVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFS
        LGLDWDPQTRRY+NKRVVDGNKPPDIPP F+FLVK ALKDAHAFIKN CNISNVE+ILPSMSPDICIANFYTT GRLGLHQDRDESKESL  GLPVVSFS
Subjt:  LGLDWDPQTRRYKNKRVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFS

Query:  VGNTAEFLYGDKRDVEKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY
        VGN AEFLYGDKR+V+KAE VELESGDVLIFGGESRH+FHGVSSIIPKSTPKFLL+HTGLRPGRLNLTFRKY
Subjt:  VGNTAEFLYGDKRDVEKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY

A0A1S4E4K6 uncharacterized protein LOC1035021834.8e-282100Show/hide
Query:  MFFIRTLPLPPSPSSNQLRRLLFPASSFPGARGFSLLQFQRMDSFSSSANSHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFLGSFRVHLNPKEREPK
        MFFIRTLPLPPSPSSNQLRRLLFPASSFPGARGFSLLQFQRMDSFSSSANSHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFLGSFRVHLNPKEREPK
Subjt:  MFFIRTLPLPPSPSSNQLRRLLFPASSFPGARGFSLLQFQRMDSFSSSANSHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFLGSFRVHLNPKEREPK

Query:  SLTPLSAKKCDYVEVGSDKFGISSNEPKSYHYDEFLPVSRQNTRRNRIDLGSKRDLKSNARSFQVERHEFFNDYCQEYESSLPIHFGKKNEVFFSKRQSL
        SLTPLSAKKCDYVEVGSDKFGISSNEPKSYHYDEFLPVSRQNTRRNRIDLGSKRDLKSNARSFQVERHEFFNDYCQEYESSLPIHFGKKNEVFFSKRQSL
Subjt:  SLTPLSAKKCDYVEVGSDKFGISSNEPKSYHYDEFLPVSRQNTRRNRIDLGSKRDLKSNARSFQVERHEFFNDYCQEYESSLPIHFGKKNEVFFSKRQSL

Query:  DIGSKESVVTDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPSYKDGAKLRLRMMC
        DIGSKESVVTDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPSYKDGAKLRLRMMC
Subjt:  DIGSKESVVTDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPSYKDGAKLRLRMMC

Query:  LGLDWDPQTRRYKNKRVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFS
        LGLDWDPQTRRYKNKRVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFS
Subjt:  LGLDWDPQTRRYKNKRVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFS

Query:  VGNTAEFLYGDKRDVEKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY
        VGNTAEFLYGDKRDVEKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY
Subjt:  VGNTAEFLYGDKRDVEKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY

A0A5A7U7Q2 2-oxoglutarate-dependent dioxygenase family protein isoform 12.0e-24897.87Show/hide
Query:  SHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFLGSFRVHLNPKEREPKSLTPLSAKKCDYVEVGSDKFGISSNEPKSYHYDEFLPVSRQNTRRNRIDL
        S APPDSSCRGNSCGCGRDKEHLRDRDNCSDVIF+GSFRVHLNPKEREPKSLTPLS KKCDYVEVGSDKFGISSNEPKSYHYDE LPVSRQNTRRNRIDL
Subjt:  SHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFLGSFRVHLNPKEREPKSLTPLSAKKCDYVEVGSDKFGISSNEPKSYHYDEFLPVSRQNTRRNRIDL

Query:  GSKRDLKSNARSFQVERHEFFNDYCQEYESSLPIHFGKKNEVFFSKRQSLDIGSKESVVTDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLL
        GSKRDLKSNARSFQVERHEF NDYCQEYESSLPIHFGKKNEVFFSKRQSLDIGSKESVVTDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLL
Subjt:  GSKRDLKSNARSFQVERHEFFNDYCQEYESSLPIHFGKKNEVFFSKRQSLDIGSKESVVTDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLL

Query:  RPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPSYKDGAKLRLRMMCLGLDWDPQTRRYKNKRVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNKCN
        RPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQP YKDGAKLRLRMMCLGLDWDPQTRRY+NKRVVDGNKPPDIPPPFSFLVK+ALKDAHAFIKNKCN
Subjt:  RPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPSYKDGAKLRLRMMCLGLDWDPQTRRYKNKRVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNKCN

Query:  ISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVEKAEKVELESGDVLIFGGESRHVFHGVSSIIPKST
        ISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDV+KAEKVELESGDVLIFGGESRHVFHGVSSIIPKST
Subjt:  ISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVEKAEKVELESGDVLIFGGESRHVFHGVSSIIPKST

Query:  PKFLLYHTGLRPGRLNLTFRKY
        PKFLLYHTGLRPGRLNLTFRKY
Subjt:  PKFLLYHTGLRPGRLNLTFRKY

A0A5D3BFV0 2-oxoglutarate-dependent dioxygenase family protein isoform 11.3e-24797.63Show/hide
Query:  SHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFLGSFRVHLNPKEREPKSLTPLSAKKCDYVEVGSDKFGISSNEPKSYHYDEFLPVSRQNTRRNRIDL
        S APPDSSCRGNSCGCGRDKEHLRDRDNCSDVIF+GSFRVHLNPKEREPKSLTPLS KKCDYVEVGSDKFGISSNEPKSYHYDE LPVSRQNTRRNRIDL
Subjt:  SHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFLGSFRVHLNPKEREPKSLTPLSAKKCDYVEVGSDKFGISSNEPKSYHYDEFLPVSRQNTRRNRIDL

Query:  GSKRDLKSNARSFQVERHEFFNDYCQEYESSLPIHFGKKNEVFFSKRQSLDIGSKESVVTDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLL
        GSKRDLKSNARSFQVERHEF NDYCQEYESSLPIHFGKKNEVFFSKRQSLDIGSKESVVTDHS PFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLL
Subjt:  GSKRDLKSNARSFQVERHEFFNDYCQEYESSLPIHFGKKNEVFFSKRQSLDIGSKESVVTDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLL

Query:  RPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPSYKDGAKLRLRMMCLGLDWDPQTRRYKNKRVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNKCN
        RPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQP YKDGAKLRLRMMCLGLDWDPQTRRY+NKRVVDGNKPPDIPPPFSFLVK+ALKDAHAFIKNKCN
Subjt:  RPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPSYKDGAKLRLRMMCLGLDWDPQTRRYKNKRVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNKCN

Query:  ISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVEKAEKVELESGDVLIFGGESRHVFHGVSSIIPKST
        ISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDV+KAEKVELESGDVLIFGGESRHVFHGVSSIIPKST
Subjt:  ISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVEKAEKVELESGDVLIFGGESRHVFHGVSSIIPKST

Query:  PKFLLYHTGLRPGRLNLTFRKY
        PKFLLYHTGLRPGRLNLTFRKY
Subjt:  PKFLLYHTGLRPGRLNLTFRKY

A0A6J1EDT3 uncharacterized protein LOC1114323187.9e-15261.9Show/hide
Query:  MFFIRTLPLPPSPSSNQLRRLLFPASSFPGARGFSLLQFQRMDSFSSSANSHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFLGSFRVHLNPKEREPK
        M  IRT+P    P SN LRRLLF  S         LLQFQR+DSF SS    A PDSSC G+S  CG ++E L +RD+ S+VI +G   V+LN K  E +
Subjt:  MFFIRTLPLPPSPSSNQLRRLLFPASSFPGARGFSLLQFQRMDSFSSSANSHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFLGSFRVHLNPKEREPK

Query:  SLTPLSAKKCDYVEVGSDKFGISSNEPKSYHYDEFLPVSRQNT-RRNRIDLGSKRDLKSNARSFQVERHEFFNDYCQEYESSLPIHFGKKNEVFFSKRQS
        SL+ LS  KCD  ++ SD+ GI +N P SYH DEF PV RQNT RR+RIDLGS+R LK++  S Q+ER+E F+                     F K +S
Subjt:  SLTPLSAKKCDYVEVGSDKFGISSNEPKSYHYDEFLPVSRQNT-RRNRIDLGSKRDLKSNARSFQVERHEFFNDYCQEYESSLPIHFGKKNEVFFSKRQS

Query:  LDIGSKESVVTDHSLPFEPPFDICFP-GGGNVKHRNFWRVKDSGTVKDYR---------LLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPSYK
         DIGSK S+ T +  P E  FDICFP   G  K R  W+ KD  T+K            ++RPGMVLLKHYI   EQ+NIVKT QKLGLGPGGFYQP YK
Subjt:  LDIGSKESVVTDHSLPFEPPFDICFP-GGGNVKHRNFWRVKDSGTVKDYR---------LLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPSYK

Query:  DGAKLRLRMMCLGLDWDPQTRRYKNKRVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKES
        DGAKLRL+MMCLGLDWDPQTR+Y  KRV DGNKPPD+PP F+ LV  AL DAHA IKN  + +N+EDILP+MSPDICI NFY+TSGRLGLHQDRDES+ES
Subjt:  DGAKLRLRMMCLGLDWDPQTRRYKNKRVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKES

Query:  LSSGLPVVSFSVGNTAEFLYGDKRDVEKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY
        L  GLPVVSFS+GN+AEFLYGD+RDV+KA K+ LESGDVLIFGGESRH+FHGVSSIIPKSTPKFLL HTGLRPGRLNLTFRKY
Subjt:  LSSGLPVVSFSVGNTAEFLYGDKRDVEKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY

SwissProt top hitse value%identityAlignment
B8GWW6 Alpha-ketoglutarate-dependent dioxygenase AlkB homolog2.1e-1633.5Show/hide
Query:  PGGFYQPSYKDGAKLRLRMMCLG-LDWDPQTR--RYKNKRVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGR
        P   Y+ +Y  G  + + M  LG L W    R  RY ++    G   PD+PP        AL D    + +           P   PD C+ N Y    R
Subjt:  PGGFYQPSYKDGAKLRLRMMCLG-LDWDPQTR--RYKNKRVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGR

Query:  LGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVEKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRP--GRLNLTFRK
        +GLHQDRDE+        PV+S S+G+TA F  G     +    + L SGDV    G +R  FHGV  I+P S        + L P  GR+NLT R+
Subjt:  LGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVEKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRP--GRLNLTFRK

O60066 Alpha-ketoglutarate-dependent dioxygenase abh17.1e-1728.33Show/hide
Query:  PGMVLLKHYITPPEQINIVKTCQ-----------------KLGLGPGGFYQPSYK-DGAKL------------------RLRMMCLGLDWDPQTRRYKNK
        PG+++LK+Y++   Q+ ++K+                   +L LG    ++  Y  DG  +                  +LR + LG  +D  T+ Y   
Subjt:  PGMVLLKHYITPPEQINIVKTCQ-----------------KLGLGPGGFYQPSYK-DGAKL------------------RLRMMCLGLDWDPQTRRYKNK

Query:  RVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDV
           D +K P  P      V+  +K++  F+  K               +  I NFY+    L  H   DES+E L+  LP++S S+G    +L G +   
Subjt:  RVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDV

Query:  EKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLL
        EK   + L SGDV+I  G SR  FH V  IIP STP +LL
Subjt:  EKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLL

P05050 Alpha-ketoglutarate-dependent dioxygenase AlkB6.9e-1234.4Show/hide
Query:  NKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVEKAEKVELESGDVLIFGGESRHVFHGVSSII
        N C  +      P   PD C+ N Y    +L LHQD+DE         P+VS S+G  A F +G  +  +  +++ LE GDV+++GGESR  +HG+    
Subjt:  NKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVEKAEKVELESGDVLIFGGESRHVFHGVSSII

Query:  PKSTPKFLLYHTGLRPGRLNLTFRK
            P    +H      R NLTFR+
Subjt:  PKSTPKFLLYHTGLRPGRLNLTFRK

P0CAT7 Alpha-ketoglutarate-dependent dioxygenase AlkB homolog2.1e-1633.5Show/hide
Query:  PGGFYQPSYKDGAKLRLRMMCLG-LDWDPQTR--RYKNKRVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGR
        P   Y+ +Y  G  + + M  LG L W    R  RY ++    G   PD+PP        AL D    + +           P   PD C+ N Y    R
Subjt:  PGGFYQPSYKDGAKLRLRMMCLG-LDWDPQTR--RYKNKRVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGR

Query:  LGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVEKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRP--GRLNLTFRK
        +GLHQDRDE+        PV+S S+G+TA F  G     +    + L SGDV    G +R  FHGV  I+P S        + L P  GR+NLT R+
Subjt:  LGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVEKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRP--GRLNLTFRK

P37462 Alpha-ketoglutarate-dependent dioxygenase AlkB2.4e-1236.61Show/hide
Query:  SMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVEKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTG
        S  PD C+ N Y    +L LHQD+DE         P+VS S+G  A F +G  R  +  +++ LE GD++++GGESR  +HG+        P    +H  
Subjt:  SMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVEKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTG

Query:  LRPGRLNLTFRK
            R NLTFR+
Subjt:  LRPGRLNLTFRK

Arabidopsis top hitse value%identityAlignment
AT1G11780.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein1.8e-0737.35Show/hide
Query:  PDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVEKAEKVELESGDVLIFGGESRHVFHGVSSI
        P+  I N++     LG H D  E+  S     P+VS S+G  A FL G K   +    + L SGDV++  GE+R  FHG+  I
Subjt:  PDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVEKAEKVELESGDVLIFGGESRHVFHGVSSI

AT3G14140.1 2-oxoglutarate-dependent dioxygenase family protein5.7e-6251.83Show/hide
Query:  LLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPSYKDGAKLRLRMMCLGLDWDPQTRRYKNKRVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNK
        ++RPGMVLLK+Y++   Q+ IV  C++LGLG GGFYQP ++DG  L L+MMCLG +WD QTRRY   R +DG+ PP IP  FS LV+ A+K++ + +   
Subjt:  LLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPSYKDGAKLRLRMMCLGLDWDPQTRRYKNKRVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNK

Query:  CNISNVEDILPSMSPDICIANFYTTSGRLGLHQ---------------------DRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVEKAEKVELESGD
         N +   D +P + PDIC+ NFYT++G+LGLHQ                     D+ ESK+SL  GLP+VSFS+G++AEFLYGD++DV+KA+ + LESGD
Subjt:  CNISNVEDILPSMSPDICIANFYTTSGRLGLHQ---------------------DRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVEKAEKVELESGD

Query:  VLIFGGESRHVFHGVSSI
        VLIFG  SR+VFHGV SI
Subjt:  VLIFGGESRHVFHGVSSI

AT3G14160.1 2-oxoglutarate-dependent dioxygenase family protein1.2e-7557.76Show/hide
Query:  SGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPSYKDGAKLRLRMMCLGLDWDPQTRRYKNKRVVDGNKPPDIPPPFSFLVKSALKD
        SGTV     +RPGMVLLK+Y++  +Q+ IV  C++LGLG GGFYQP Y+D AKL L+MMCLG +WDP+T RY   R  DG+  P IP  F+  V+ A+K+
Subjt:  SGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPSYKDGAKLRLRMMCLGLDWDPQTRRYKNKRVVDGNKPPDIPPPFSFLVKSALKD

Query:  AHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVEKAEKVELESGDVLIFGGESRHVFH
        + +   +    +   D +P M PDICI NFY+++GRLGLHQD+DES+ S+  GLPVVSFS+G++AEFLYGD+RD +KAE + LESGDVL+FGG SR VFH
Subjt:  AHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVEKAEKVELESGDVLIFGGESRHVFH

Query:  GVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY
        GV SI   + PK LL  T LRPGRLNLTFR+Y
Subjt:  GVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY

AT5G01780.1 2-oxoglutarate-dependent dioxygenase family protein2.9e-8256.77Show/hide
Query:  PPFDICFPGGGNVKHRNFWRVKD--------SGTVK---DYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPSYKDGAKLRLRMMCLGLDWD
        PPFDIC     +V  RN   +KD          TV+    ++++RPGMVLLK ++TP  Q++IVKTC++LG+ P GFYQP Y  G+KL L+MMCLG +WD
Subjt:  PPFDICFPGGGNVKHRNFWRVKD--------SGTVK---DYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPSYKDGAKLRLRMMCLGLDWD

Query:  PQTRRYKNKRVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAE
        PQT+  KN  +   +K P+IP  F+ LV+ A+++AHA I  +    + E ILP MSPDICI NFY+ +GRLGLHQDRDES+ES++ GLP+VSFS+G++AE
Subjt:  PQTRRYKNKRVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAE

Query:  FLYGDKRDVEKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY
        FLYG+KRDVE+A+ V LESGDVLIFGGESR +FHGV SIIP S P  LL  + LR GRLNLTFR +
Subjt:  FLYGDKRDVEKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY

AT5G01780.2 2-oxoglutarate-dependent dioxygenase family protein2.9e-8256.77Show/hide
Query:  PPFDICFPGGGNVKHRNFWRVKD--------SGTVK---DYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPSYKDGAKLRLRMMCLGLDWD
        PPFDIC     +V  RN   +KD          TV+    ++++RPGMVLLK ++TP  Q++IVKTC++LG+ P GFYQP Y  G+KL L+MMCLG +WD
Subjt:  PPFDICFPGGGNVKHRNFWRVKD--------SGTVK---DYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPSYKDGAKLRLRMMCLGLDWD

Query:  PQTRRYKNKRVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAE
        PQT+  KN  +   +K P+IP  F+ LV+ A+++AHA I  +    + E ILP MSPDICI NFY+ +GRLGLHQDRDES+ES++ GLP+VSFS+G++AE
Subjt:  PQTRRYKNKRVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAE

Query:  FLYGDKRDVEKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY
        FLYG+KRDVE+A+ V LESGDVLIFGGESR +FHGV SIIP S P  LL  + LR GRLNLTFR +
Subjt:  FLYGDKRDVEKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTTTCATCCGTACACTTCCCCTTCCCCCATCGCCCTCCTCCAATCAACTTCGTCGCCTTCTATTCCCCGCTTCTTCATTTCCCGGCGCGCGCGGCTTTAGTTTGCT
TCAATTTCAGCGAATGGATTCGTTTTCCAGTTCCGCGAATAGCCATGCACCACCTGACTCTTCATGTCGTGGTAATTCTTGTGGTTGTGGGAGAGACAAGGAACATTTGC
GTGACAGAGATAATTGTTCAGATGTAATATTTTTGGGAAGCTTTCGTGTGCATCTAAATCCCAAGGAACGTGAACCGAAATCTCTAACTCCGCTTTCTGCTAAGAAATGT
GATTATGTTGAGGTGGGAAGTGATAAGTTTGGGATTTCTTCAAATGAACCGAAATCTTATCATTATGATGAGTTTCTACCTGTTTCTAGGCAAAATACTAGGAGAAACCG
GATAGATTTAGGGTCCAAAAGAGATTTGAAGAGTAATGCAAGATCATTTCAAGTAGAGAGGCATGAATTTTTCAATGATTATTGTCAGGAGTATGAATCATCTCTTCCTA
TTCATTTTGGGAAGAAAAATGAAGTTTTTTTCTCAAAGCGCCAGTCCCTTGATATCGGTTCCAAAGAATCTGTAGTTACGGACCATTCGCTTCCCTTTGAACCACCATTT
GATATTTGTTTCCCTGGAGGAGGTAATGTGAAACATAGAAATTTTTGGCGAGTTAAAGACAGTGGCACTGTGAAAGATTATAGACTGCTGAGGCCTGGAATGGTTTTACT
GAAGCACTATATCACTCCACCTGAACAGATCAATATAGTGAAAACTTGTCAAAAGCTTGGTCTTGGGCCAGGGGGATTTTACCAGCCTAGTTATAAAGATGGAGCAAAAC
TTAGGCTTCGTATGATGTGTCTTGGATTGGACTGGGATCCTCAAACAAGAAGATATAAAAACAAACGGGTTGTGGATGGTAATAAACCACCAGATATACCTCCTCCATTT
TCATTTCTAGTTAAAAGTGCACTTAAAGATGCACATGCCTTCATCAAGAACAAATGCAATATAAGCAATGTAGAAGACATTCTTCCATCAATGTCTCCTGACATATGCAT
TGCGAACTTTTACACAACGAGTGGAAGATTGGGTCTGCATCAGGACCGGGATGAAAGCAAAGAGAGTCTTTCCAGCGGACTACCAGTCGTTTCCTTTTCTGTAGGCAATA
CAGCAGAATTCTTGTATGGAGATAAAAGAGATGTGGAAAAAGCAGAGAAGGTTGAACTGGAATCGGGTGATGTTCTAATTTTTGGTGGCGAATCTAGACACGTATTTCAT
GGAGTGTCTTCAATCATCCCAAAATCGACACCTAAGTTTTTGCTTTATCATACTGGTCTGCGTCCTGGTCGTCTAAATCTTACCTTTAGAAAGTATTAA
mRNA sequenceShow/hide mRNA sequence
AGCATAAGAAATAATCACACACACAGCTTTGAAAATGCCCGAAATCCTTACTCCTTCATCTTCCTCTTCCTCTTCCTCTTCCCGGTGCAGTCTCCTCGGCCAACGCGGCT
GCCACGACCGAAACTTGCGGATTACTTCTACAGTATCCTCATTATGTCAACCCCATCCTCCTCCGACCCTTCCACTTCCTTCTTCTTTTCTGATTTTTTCATTCATCCTC
AATTTCTGAACCCTATAAAACCATCTCTAGTATCCAATTTTCTCAATGTTTTTCATCCGTACACTTCCCCTTCCCCCATCGCCCTCCTCCAATCAACTTCGTCGCCTTCT
ATTCCCCGCTTCTTCATTTCCCGGCGCGCGCGGCTTTAGTTTGCTTCAATTTCAGCGAATGGATTCGTTTTCCAGTTCCGCGAATAGCCATGCACCACCTGACTCTTCAT
GTCGTGGTAATTCTTGTGGTTGTGGGAGAGACAAGGAACATTTGCGTGACAGAGATAATTGTTCAGATGTAATATTTTTGGGAAGCTTTCGTGTGCATCTAAATCCCAAG
GAACGTGAACCGAAATCTCTAACTCCGCTTTCTGCTAAGAAATGTGATTATGTTGAGGTGGGAAGTGATAAGTTTGGGATTTCTTCAAATGAACCGAAATCTTATCATTA
TGATGAGTTTCTACCTGTTTCTAGGCAAAATACTAGGAGAAACCGGATAGATTTAGGGTCCAAAAGAGATTTGAAGAGTAATGCAAGATCATTTCAAGTAGAGAGGCATG
AATTTTTCAATGATTATTGTCAGGAGTATGAATCATCTCTTCCTATTCATTTTGGGAAGAAAAATGAAGTTTTTTTCTCAAAGCGCCAGTCCCTTGATATCGGTTCCAAA
GAATCTGTAGTTACGGACCATTCGCTTCCCTTTGAACCACCATTTGATATTTGTTTCCCTGGAGGAGGTAATGTGAAACATAGAAATTTTTGGCGAGTTAAAGACAGTGG
CACTGTGAAAGATTATAGACTGCTGAGGCCTGGAATGGTTTTACTGAAGCACTATATCACTCCACCTGAACAGATCAATATAGTGAAAACTTGTCAAAAGCTTGGTCTTG
GGCCAGGGGGATTTTACCAGCCTAGTTATAAAGATGGAGCAAAACTTAGGCTTCGTATGATGTGTCTTGGATTGGACTGGGATCCTCAAACAAGAAGATATAAAAACAAA
CGGGTTGTGGATGGTAATAAACCACCAGATATACCTCCTCCATTTTCATTTCTAGTTAAAAGTGCACTTAAAGATGCACATGCCTTCATCAAGAACAAATGCAATATAAG
CAATGTAGAAGACATTCTTCCATCAATGTCTCCTGACATATGCATTGCGAACTTTTACACAACGAGTGGAAGATTGGGTCTGCATCAGGACCGGGATGAAAGCAAAGAGA
GTCTTTCCAGCGGACTACCAGTCGTTTCCTTTTCTGTAGGCAATACAGCAGAATTCTTGTATGGAGATAAAAGAGATGTGGAAAAAGCAGAGAAGGTTGAACTGGAATCG
GGTGATGTTCTAATTTTTGGTGGCGAATCTAGACACGTATTTCATGGAGTGTCTTCAATCATCCCAAAATCGACACCTAAGTTTTTGCTTTATCATACTGGTCTGCGTCC
TGGTCGTCTAAATCTTACCTTTAGAAAGTATTAAAACACTACCTCCTTGTTTATGCTATACATATGAATGGCTGTTGTTTTTCATTAGATGTTC
Protein sequenceShow/hide protein sequence
MFFIRTLPLPPSPSSNQLRRLLFPASSFPGARGFSLLQFQRMDSFSSSANSHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFLGSFRVHLNPKEREPKSLTPLSAKKC
DYVEVGSDKFGISSNEPKSYHYDEFLPVSRQNTRRNRIDLGSKRDLKSNARSFQVERHEFFNDYCQEYESSLPIHFGKKNEVFFSKRQSLDIGSKESVVTDHSLPFEPPF
DICFPGGGNVKHRNFWRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPSYKDGAKLRLRMMCLGLDWDPQTRRYKNKRVVDGNKPPDIPPPF
SFLVKSALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVEKAEKVELESGDVLIFGGESRHVFH
GVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY