; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0003010 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0003010
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
Description2-oxoglutarate-dependent dioxygenase family protein isoform 1
Genome locationchr08:25296681..25299015
RNA-Seq ExpressionIVF0003010
SyntenyIVF0003010
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0035513 - oxidative RNA demethylation (biological process)
GO:0035552 - oxidative single-stranded DNA demethylation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0008198 - ferrous iron binding (molecular function)
GO:0035515 - oxidative RNA demethylase activity (molecular function)
GO:0035516 - oxidative DNA demethylase activity (molecular function)
InterPro domainsIPR004574 - Alkylated DNA repair protein AlkB
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR027450 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like
IPR037151 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050407.1 2-oxoglutarate-dependent dioxygenase family protein isoform 1 [Cucumis melo var. makuwa]3.80e-30696.21Show/hide
Query:  SHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFVGSFRVHLNPKEREPKSLTPLSVKKCDYVEVGSDKFGISSNEPKSYHYDECLPVSRQNTRRNRIDL
        S APPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFVGSFRVHLNPKEREPKSLTPLSVKKCDYVEVGSDKFGISSNEPKSYHYDECLPVSRQNTRRNRIDL
Subjt:  SHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFVGSFRVHLNPKEREPKSLTPLSVKKCDYVEVGSDKFGISSNEPKSYHYDECLPVSRQNTRRNRIDL

Query:  GSKRDLKSNARSFQVERHEFLNDYCQEYESSLPIHFGKKNEVF-----------SQSAITDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLL
        GSKRDLKSNARSFQVERHEFLNDYCQEYESSLPIHFGKKNEVF            +S +TDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLL
Subjt:  GSKRDLKSNARSFQVERHEFLNDYCQEYESSLPIHFGKKNEVF-----------SQSAITDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLL

Query:  RPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCN
        RPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCN
Subjt:  RPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCN

Query:  ISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKST
        ISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKST
Subjt:  ISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKST

Query:  PKFLLYHTGLRPGRLNLTFRKY
        PKFLLYHTGLRPGRLNLTFRKY
Subjt:  PKFLLYHTGLRPGRLNLTFRKY

KAG6595567.1 hypothetical protein SDJN03_12120, partial [Cucurbita argyrosperma subsp. sororia]1.66e-18862.45Show/hide
Query:  MFFIRTLPLPPSPSSNQLRRLLFPASSFPGARGFSLLQFQRMDSFSSSANSHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFVGSFRVHLNPKEREPK
        M  IRT+P    P SN LRRLLF  S         LLQFQRMDSF SSA     PDSSC G+SCG   ++E L +RD+ S+VI +G   V+LN K  E +
Subjt:  MFFIRTLPLPPSPSSNQLRRLLFPASSFPGARGFSLLQFQRMDSFSSSANSHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFVGSFRVHLNPKEREPK

Query:  SLTPLSVKKCDYVEVGSDKFGISSNEPKSYHYDECLPVSRQNT-RRNRIDLGSKRDLKSNARSFQVERHEFLNDYCQEYESSLPIHFGKKN--EVFSQSA
        SL+ LSV KCD  ++ SD+ GI +N P SYH DE  PV RQNT RR+RIDLGS+R LK++  S Q+ER+E             P  F K    ++ S+++
Subjt:  SLTPLSVKKCDYVEVGSDKFGISSNEPKSYHYDECLPVSRQNT-RRNRIDLGSKRDLKSNARSFQVERHEFLNDYCQEYESSLPIHFGKKN--EVFSQSA

Query:  ITDHSLPFEPPFDICFPGG-GNVKHRNFWRVKDSGTVKDYR---------LLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRM
        +   +LP    FDICFP   G  KHR  W+ KD  T+K            ++RPGMVLLKHYI   EQ+NIVKT QKLGLGPGGFYQPGYKDGAKLRL+M
Subjt:  ITDHSLPFEPPFDICFPGG-GNVKHRNFWRVKDSGTVKDYR---------LLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRM

Query:  MCLGLDWDPQTRRYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVS
        MCLGLDWDPQTR+Y  KRV DGNKPPD+PP F+ LV  AL DAHA IKN  + +N+EDILP+MSPDICI NFY+TSGRLGLHQDRDES+ESL  GLPVVS
Subjt:  MCLGLDWDPQTRRYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVS

Query:  FSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY
        FS+GN+AEFLYGD+RDVDKA K+ LESGDVLIFGGESRH+FHGVSSIIPKSTPKFLL HTGLRPGRLNLTFRKY
Subjt:  FSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY

TYJ97997.1 2-oxoglutarate-dependent dioxygenase family protein isoform 1 [Cucumis melo var. makuwa]4.42e-30595.97Show/hide
Query:  SHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFVGSFRVHLNPKEREPKSLTPLSVKKCDYVEVGSDKFGISSNEPKSYHYDECLPVSRQNTRRNRIDL
        S APPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFVGSFRVHLNPKEREPKSLTPLSVKKCDYVEVGSDKFGISSNEPKSYHYDECLPVSRQNTRRNRIDL
Subjt:  SHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFVGSFRVHLNPKEREPKSLTPLSVKKCDYVEVGSDKFGISSNEPKSYHYDECLPVSRQNTRRNRIDL

Query:  GSKRDLKSNARSFQVERHEFLNDYCQEYESSLPIHFGKKNEVF-----------SQSAITDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLL
        GSKRDLKSNARSFQVERHEFLNDYCQEYESSLPIHFGKKNEVF            +S +TDHS PFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLL
Subjt:  GSKRDLKSNARSFQVERHEFLNDYCQEYESSLPIHFGKKNEVF-----------SQSAITDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLL

Query:  RPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCN
        RPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCN
Subjt:  RPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCN

Query:  ISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKST
        ISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKST
Subjt:  ISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKST

Query:  PKFLLYHTGLRPGRLNLTFRKY
        PKFLLYHTGLRPGRLNLTFRKY
Subjt:  PKFLLYHTGLRPGRLNLTFRKY

XP_004149927.1 uncharacterized protein LOC101210053 isoform X1 [Cucumis sativus]5.65e-28081.57Show/hide
Query:  MFFIRTLPLPPSPSSNQLRRLLFPASSFPGARGFSLLQFQRMDSFSSSANSHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFVGSFRVHLNPKEREPK
        MFFIRTLPLPPSPSSNQLRRLLFPASSFP  RGF LLQFQ MDSFS+SANSHA PDSSC G+SCGCGRDKEHL DRDN SDVI VGS  VHLNPKEREPK
Subjt:  MFFIRTLPLPPSPSSNQLRRLLFPASSFPGARGFSLLQFQRMDSFSSSANSHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFVGSFRVHLNPKEREPK

Query:  SLTPLSVKKCDYVEVGSDKFGISSNEPKSYHYDECLPVSRQNTRRNRIDLGSKRDLKSNARSFQVERHEFLNDYCQEYESSLPIHFGKKNEVF-------
        S                            Y+YDE LPV RQNTRR+RIDLGSKRDLKSNARS+QVER EFLND CQEY+SSLPIHFGKKNEVF       
Subjt:  SLTPLSVKKCDYVEVGSDKFGISSNEPKSYHYDECLPVSRQNTRRNRIDLGSKRDLKSNARSFQVERHEFLNDYCQEYESSLPIHFGKKNEVF-------

Query:  ----SQSAITDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMC
             +S +TD+SLPFEPPFDIC PGGGNVKHRN + VK+ GTVKDYRLLRPGMVLLKHYITP EQINIVKTCQ LG+GPGGFYQPGYKDGAKLRLRMMC
Subjt:  ----SQSAITDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMC

Query:  LGLDWDPQTRRYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFS
        LGLDWDPQTRRYENKRVVDGNKPPDIPP F+FLVK ALKDAHAFIKN CNISNVE+ILPSMSPDICIANFYTT GRLGLHQDRDESKESL  GLPVVSFS
Subjt:  LGLDWDPQTRRYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFS

Query:  VGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY
        VGN AEFLYGDKR+VDKAE VELESGDVLIFGGESRH+FHGVSSIIPKSTPKFLL+HTGLRPGRLNLTFRKY
Subjt:  VGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY

XP_016903166.1 PREDICTED: uncharacterized protein LOC103502183 [Cucumis melo]0.095.13Show/hide
Query:  MFFIRTLPLPPSPSSNQLRRLLFPASSFPGARGFSLLQFQRMDSFSSSANSHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFVGSFRVHLNPKEREPK
        MFFIRTLPLPPSPSSNQLRRLLFPASSFPGARGFSLLQFQRMDSFSSSANSHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIF+GSFRVHLNPKEREPK
Subjt:  MFFIRTLPLPPSPSSNQLRRLLFPASSFPGARGFSLLQFQRMDSFSSSANSHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFVGSFRVHLNPKEREPK

Query:  SLTPLSVKKCDYVEVGSDKFGISSNEPKSYHYDECLPVSRQNTRRNRIDLGSKRDLKSNARSFQVERHEFLNDYCQEYESSLPIHFGKKNEVF-------
        SLTPLS KKCDYVEVGSDKFGISSNEPKSYHYDE LPVSRQNTRRNRIDLGSKRDLKSNARSFQVERHEF NDYCQEYESSLPIHFGKKNEVF       
Subjt:  SLTPLSVKKCDYVEVGSDKFGISSNEPKSYHYDECLPVSRQNTRRNRIDLGSKRDLKSNARSFQVERHEFLNDYCQEYESSLPIHFGKKNEVF-------

Query:  ----SQSAITDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMC
             +S +TDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQP YKDGAKLRLRMMC
Subjt:  ----SQSAITDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMC

Query:  LGLDWDPQTRRYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFS
        LGLDWDPQTRRY+NKRVVDGNKPPDIPPPFSFLVK+ALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFS
Subjt:  LGLDWDPQTRRYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFS

Query:  VGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY
        VGNTAEFLYGDKRDV+KAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY
Subjt:  VGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY

TrEMBL top hitse value%identityAlignment
A0A0A0KY56 Fe2OG dioxygenase domain-containing protein1.3e-22081.57Show/hide
Query:  MFFIRTLPLPPSPSSNQLRRLLFPASSFPGARGFSLLQFQRMDSFSSSANSHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFVGSFRVHLNPKEREPK
        MFFIRTLPLPPSPSSNQLRRLLFPASSFP  RGF LLQFQ MDSFS+SANSHA PDSSC G+SCGCGRDKEHL DRDN SDVI VGS  VHLNPKER   
Subjt:  MFFIRTLPLPPSPSSNQLRRLLFPASSFPGARGFSLLQFQRMDSFSSSANSHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFVGSFRVHLNPKEREPK

Query:  SLTPLSVKKCDYVEVGSDKFGISSNEPKSYHYDECLPVSRQNTRRNRIDLGSKRDLKSNARSFQVERHEFLNDYCQEYESSLPIHFGKKNEVF-------
                                 EPKSY+YDE LPV RQNTRR+RIDLGSKRDLKSNARS+QVER EFLND CQEY+SSLPIHFGKKNEVF       
Subjt:  SLTPLSVKKCDYVEVGSDKFGISSNEPKSYHYDECLPVSRQNTRRNRIDLGSKRDLKSNARSFQVERHEFLNDYCQEYESSLPIHFGKKNEVF-------

Query:  ----SQSAITDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMC
             +S +TD+SLPFEPPFDIC PGGGNVKHRN + VK+ GTVKDYRLLRPGMVLLKHYITP EQINIVKTCQ LG+GPGGFYQPGYKDGAKLRLRMMC
Subjt:  ----SQSAITDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMC

Query:  LGLDWDPQTRRYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFS
        LGLDWDPQTRRYENKRVVDGNKPPDIPP F+FLVK ALKDAHAFIKN CNISNVE+ILPSMSPDICIANFYTT GRLGLHQDRDESKESL  GLPVVSFS
Subjt:  LGLDWDPQTRRYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFS

Query:  VGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY
        VGN AEFLYGDKR+VDKAE VELESGDVLIFGGESRH+FHGVSSIIPKSTPKFLL+HTGLRPGRLNLTFRKY
Subjt:  VGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY

A0A1S4E4K6 uncharacterized protein LOC1035021833.8e-26895.13Show/hide
Query:  MFFIRTLPLPPSPSSNQLRRLLFPASSFPGARGFSLLQFQRMDSFSSSANSHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFVGSFRVHLNPKEREPK
        MFFIRTLPLPPSPSSNQLRRLLFPASSFPGARGFSLLQFQRMDSFSSSANSHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIF+GSFRVHLNPKEREPK
Subjt:  MFFIRTLPLPPSPSSNQLRRLLFPASSFPGARGFSLLQFQRMDSFSSSANSHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFVGSFRVHLNPKEREPK

Query:  SLTPLSVKKCDYVEVGSDKFGISSNEPKSYHYDECLPVSRQNTRRNRIDLGSKRDLKSNARSFQVERHEFLNDYCQEYESSLPIHFGKKNEVF-------
        SLTPLS KKCDYVEVGSDKFGISSNEPKSYHYDE LPVSRQNTRRNRIDLGSKRDLKSNARSFQVERHEF NDYCQEYESSLPIHFGKKNEVF       
Subjt:  SLTPLSVKKCDYVEVGSDKFGISSNEPKSYHYDECLPVSRQNTRRNRIDLGSKRDLKSNARSFQVERHEFLNDYCQEYESSLPIHFGKKNEVF-------

Query:  ----SQSAITDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMC
             +S +TDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQP YKDGAKLRLRMMC
Subjt:  ----SQSAITDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMC

Query:  LGLDWDPQTRRYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFS
        LGLDWDPQTRRY+NKRVVDGNKPPDIPPPFSFLVK+ALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFS
Subjt:  LGLDWDPQTRRYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFS

Query:  VGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY
        VGNTAEFLYGDKRDV+KAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY
Subjt:  VGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY

A0A5A7U7Q2 2-oxoglutarate-dependent dioxygenase family protein isoform 12.5e-24396.21Show/hide
Query:  SHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFVGSFRVHLNPKEREPKSLTPLSVKKCDYVEVGSDKFGISSNEPKSYHYDECLPVSRQNTRRNRIDL
        S APPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFVGSFRVHLNPKEREPKSLTPLSVKKCDYVEVGSDKFGISSNEPKSYHYDECLPVSRQNTRRNRIDL
Subjt:  SHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFVGSFRVHLNPKEREPKSLTPLSVKKCDYVEVGSDKFGISSNEPKSYHYDECLPVSRQNTRRNRIDL

Query:  GSKRDLKSNARSFQVERHEFLNDYCQEYESSLPIHFGKKNEVF-----------SQSAITDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLL
        GSKRDLKSNARSFQVERHEFLNDYCQEYESSLPIHFGKKNEVF            +S +TDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLL
Subjt:  GSKRDLKSNARSFQVERHEFLNDYCQEYESSLPIHFGKKNEVF-----------SQSAITDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLL

Query:  RPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCN
        RPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCN
Subjt:  RPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCN

Query:  ISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKST
        ISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKST
Subjt:  ISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKST

Query:  PKFLLYHTGLRPGRLNLTFRKY
        PKFLLYHTGLRPGRLNLTFRKY
Subjt:  PKFLLYHTGLRPGRLNLTFRKY

A0A5D3BFV0 2-oxoglutarate-dependent dioxygenase family protein isoform 11.6e-24295.97Show/hide
Query:  SHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFVGSFRVHLNPKEREPKSLTPLSVKKCDYVEVGSDKFGISSNEPKSYHYDECLPVSRQNTRRNRIDL
        S APPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFVGSFRVHLNPKEREPKSLTPLSVKKCDYVEVGSDKFGISSNEPKSYHYDECLPVSRQNTRRNRIDL
Subjt:  SHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFVGSFRVHLNPKEREPKSLTPLSVKKCDYVEVGSDKFGISSNEPKSYHYDECLPVSRQNTRRNRIDL

Query:  GSKRDLKSNARSFQVERHEFLNDYCQEYESSLPIHFGKKNEVF-----------SQSAITDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLL
        GSKRDLKSNARSFQVERHEFLNDYCQEYESSLPIHFGKKNEVF            +S +TDHS PFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLL
Subjt:  GSKRDLKSNARSFQVERHEFLNDYCQEYESSLPIHFGKKNEVF-----------SQSAITDHSLPFEPPFDICFPGGGNVKHRNFWRVKDSGTVKDYRLL

Query:  RPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCN
        RPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCN
Subjt:  RPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCN

Query:  ISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKST
        ISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKST
Subjt:  ISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKST

Query:  PKFLLYHTGLRPGRLNLTFRKY
        PKFLLYHTGLRPGRLNLTFRKY
Subjt:  PKFLLYHTGLRPGRLNLTFRKY

A0A6J1EDT3 uncharacterized protein LOC1114323183.2e-15062.03Show/hide
Query:  MFFIRTLPLPPSPSSNQLRRLLFPASSFPGARGFSLLQFQRMDSFSSSANSHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFVGSFRVHLNPKEREPK
        M  IRT+P    P SN LRRLLF  S         LLQFQR+DSF SS    A PDSSC G+S  CG ++E L +RD+ S+VI +G   V+LN K  E +
Subjt:  MFFIRTLPLPPSPSSNQLRRLLFPASSFPGARGFSLLQFQRMDSFSSSANSHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFVGSFRVHLNPKEREPK

Query:  SLTPLSVKKCDYVEVGSDKFGISSNEPKSYHYDECLPVSRQNT-RRNRIDLGSKRDLKSNARSFQVERHEFLNDYCQEYESSLPIHFGKKN--EVFSQSA
        SL+ LSV KCD  ++ SD+ GI +N P SYH DE  PV RQNT RR+RIDLGS+R LK++  S Q+ER+E             P  F K    ++ S+++
Subjt:  SLTPLSVKKCDYVEVGSDKFGISSNEPKSYHYDECLPVSRQNT-RRNRIDLGSKRDLKSNARSFQVERHEFLNDYCQEYESSLPIHFGKKN--EVFSQSA

Query:  ITDHSLPFEPPFDICFP-GGGNVKHRNFWRVKDSGTVKDYR---------LLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRM
        +   +LP    FDICFP   G  K R  W+ KD  T+K            ++RPGMVLLKHYI   EQ+NIVKT QKLGLGPGGFYQPGYKDGAKLRL+M
Subjt:  ITDHSLPFEPPFDICFP-GGGNVKHRNFWRVKDSGTVKDYR---------LLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRM

Query:  MCLGLDWDPQTRRYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVS
        MCLGLDWDPQTR+Y  KRV DGNKPPD+PP F+ LV  AL DAHA IKN  + +N+EDILP+MSPDICI NFY+TSGRLGLHQDRDES+ESL  GLPVVS
Subjt:  MCLGLDWDPQTRRYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVS

Query:  FSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY
        FS+GN+AEFLYGD+RDVDKA K+ LESGDVLIFGGESRH+FHGVSSIIPKSTPKFLL HTGLRPGRLNLTFRKY
Subjt:  FSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY

SwissProt top hitse value%identityAlignment
B8GWW6 Alpha-ketoglutarate-dependent dioxygenase AlkB homolog7.0e-1734.01Show/hide
Query:  PGGFYQPGYKDGAKLRLRMMCLG-LDWDPQTR--RYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGR
        P   Y+  Y  G  + + M  LG L W    R  RY ++    G   PD+PP        AL D    + +           P   PD C+ N Y    R
Subjt:  PGGFYQPGYKDGAKLRLRMMCLG-LDWDPQTR--RYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGR

Query:  LGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRP--GRLNLTFRK
        +GLHQDRDE+        PV+S S+G+TA F  G     D    + L SGDV    G +R  FHGV  I+P S        + L P  GR+NLT R+
Subjt:  LGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRP--GRLNLTFRK

O60066 Alpha-ketoglutarate-dependent dioxygenase abh11.6e-1627.92Show/hide
Query:  PGMVLLKHYITPPEQINIVKTCQ-----------------KLGLGPGGFYQPGYK-DGAKL------------------RLRMMCLGLDWDPQTRRYENK
        PG+++LK+Y++   Q+ ++K+                   +L LG    ++  Y  DG  +                  +LR + LG  +D  T+ Y   
Subjt:  PGMVLLKHYITPPEQINIVKTCQ-----------------KLGLGPGGFYQPGYK-DGAKL------------------RLRMMCLGLDWDPQTRRYENK

Query:  RVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDV
           D +K P  P      V+  +K++  F+  K               +  I NFY+    L  H   DES+E L+  LP++S S+G    +L G +   
Subjt:  RVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDV

Query:  DKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLL
        +K   + L SGDV+I  G SR  FH V  IIP STP +LL
Subjt:  DKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLL

P05050 Alpha-ketoglutarate-dependent dioxygenase AlkB2.3e-1235.2Show/hide
Query:  NKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSII
        N C  +      P   PD C+ N Y    +L LHQD+DE         P+VS S+G  A F +G  +  D  +++ LE GDV+++GGESR  +HG+    
Subjt:  NKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSII

Query:  PKSTPKFLLYHTGLRPGRLNLTFRK
            P    +H      R NLTFR+
Subjt:  PKSTPKFLLYHTGLRPGRLNLTFRK

P0CAT7 Alpha-ketoglutarate-dependent dioxygenase AlkB homolog7.0e-1734.01Show/hide
Query:  PGGFYQPGYKDGAKLRLRMMCLG-LDWDPQTR--RYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGR
        P   Y+  Y  G  + + M  LG L W    R  RY ++    G   PD+PP        AL D    + +           P   PD C+ N Y    R
Subjt:  PGGFYQPGYKDGAKLRLRMMCLG-LDWDPQTR--RYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGR

Query:  LGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRP--GRLNLTFRK
        +GLHQDRDE+        PV+S S+G+TA F  G     D    + L SGDV    G +R  FHGV  I+P S        + L P  GR+NLT R+
Subjt:  LGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRP--GRLNLTFRK

P37462 Alpha-ketoglutarate-dependent dioxygenase AlkB6.1e-1337.5Show/hide
Query:  SMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTG
        S  PD C+ N Y    +L LHQD+DE         P+VS S+G  A F +G  R  D  +++ LE GD++++GGESR  +HG+        P    +H  
Subjt:  SMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTG

Query:  LRPGRLNLTFRK
            R NLTFR+
Subjt:  LRPGRLNLTFRK

Arabidopsis top hitse value%identityAlignment
AT1G11780.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein6.1e-0838.55Show/hide
Query:  PDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSI
        P+  I N++     LG H D  E+  S     P+VS S+G  A FL G K   D    + L SGDV++  GE+R  FHG+  I
Subjt:  PDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSI

AT3G14140.1 2-oxoglutarate-dependent dioxygenase family protein3.0e-6352.75Show/hide
Query:  LLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNK
        ++RPGMVLLK+Y++   Q+ IV  C++LGLG GGFYQPG++DG  L L+MMCLG +WD QTRRY   R +DG+ PP IP  FS LV+ A+K++ + +   
Subjt:  LLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNK

Query:  CNISNVEDILPSMSPDICIANFYTTSGRLGLHQ---------------------DRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGD
         N +   D +P + PDIC+ NFYT++G+LGLHQ                     D+ ESK+SL  GLP+VSFS+G++AEFLYGD++DVDKA+ + LESGD
Subjt:  CNISNVEDILPSMSPDICIANFYTTSGRLGLHQ---------------------DRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGD

Query:  VLIFGGESRHVFHGVSSI
        VLIFG  SR+VFHGV SI
Subjt:  VLIFGGESRHVFHGVSSI

AT3G14160.1 2-oxoglutarate-dependent dioxygenase family protein6.2e-7758.62Show/hide
Query:  SGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPPFSFLVKNALKD
        SGTV     +RPGMVLLK+Y++  +Q+ IV  C++LGLG GGFYQPGY+D AKL L+MMCLG +WDP+T RY   R  DG+  P IP  F+  V+ A+K+
Subjt:  SGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPPFSFLVKNALKD

Query:  AHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFH
        + +   +    +   D +P M PDICI NFY+++GRLGLHQD+DES+ S+  GLPVVSFS+G++AEFLYGD+RD DKAE + LESGDVL+FGG SR VFH
Subjt:  AHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFH

Query:  GVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY
        GV SI   + PK LL  T LRPGRLNLTFR+Y
Subjt:  GVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY

AT5G01780.1 2-oxoglutarate-dependent dioxygenase family protein1.3e-8253.52Show/hide
Query:  KNEVFSQSAITDHSLPFEPPFDICFPGGGNVKHRNFWRVKD--------SGTVK---DYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGY
        +N  F     +   +   PPFDIC     +V  RN   +KD          TV+    ++++RPGMVLLK ++TP  Q++IVKTC++LG+ P GFYQPGY
Subjt:  KNEVFSQSAITDHSLPFEPPFDICFPGGGNVKHRNFWRVKD--------SGTVK---DYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGY

Query:  KDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKE
          G+KL L+MMCLG +WDPQT+  +N  +   +K P+IP  F+ LV+ A+++AHA I  +    + E ILP MSPDICI NFY+ +GRLGLHQDRDES+E
Subjt:  KDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKE

Query:  SLSSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY
        S++ GLP+VSFS+G++AEFLYG+KRDV++A+ V LESGDVLIFGGESR +FHGV SIIP S P  LL  + LR GRLNLTFR +
Subjt:  SLSSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY

AT5G01780.2 2-oxoglutarate-dependent dioxygenase family protein1.3e-8253.52Show/hide
Query:  KNEVFSQSAITDHSLPFEPPFDICFPGGGNVKHRNFWRVKD--------SGTVK---DYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGY
        +N  F     +   +   PPFDIC     +V  RN   +KD          TV+    ++++RPGMVLLK ++TP  Q++IVKTC++LG+ P GFYQPGY
Subjt:  KNEVFSQSAITDHSLPFEPPFDICFPGGGNVKHRNFWRVKD--------SGTVK---DYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGY

Query:  KDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKE
          G+KL L+MMCLG +WDPQT+  +N  +   +K P+IP  F+ LV+ A+++AHA I  +    + E ILP MSPDICI NFY+ +GRLGLHQDRDES+E
Subjt:  KDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKE

Query:  SLSSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY
        S++ GLP+VSFS+G++AEFLYG+KRDV++A+ V LESGDVLIFGGESR +FHGV SIIP S P  LL  + LR GRLNLTFR +
Subjt:  SLSSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTTTCATCCGTACACTTCCCCTTCCCCCATCGCCCTCCTCCAATCAACTTCGTCGCCTTCTATTCCCCGCTTCTTCATTTCCCGGCGCGCGCGGCTTTAGTTTGCT
TCAATTTCAGCGAATGGATTCGTTTTCCAGTTCCGCGAATAGCCATGCACCACCTGACTCTTCATGTCGTGGTAATTCTTGTGGTTGTGGAAGAGACAAGGAACATTTGC
GTGACAGAGATAATTGTTCAGATGTAATATTTGTGGGAAGCTTTCGTGTGCATCTAAATCCCAAGGAACGTGAACCGAAATCTCTAACTCCGCTTTCTGTTAAGAAATGT
GATTATGTTGAGGTGGGAAGTGATAAGTTTGGGATTTCTTCAAATGAACCGAAATCTTATCATTATGATGAGTGTCTACCTGTTTCTAGGCAAAATACTAGGAGAAACCG
GATAGATTTAGGGTCCAAAAGAGATTTGAAGAGTAATGCAAGATCATTTCAAGTAGAGAGGCATGAATTTTTGAATGATTATTGTCAGGAGTATGAATCATCTCTTCCTA
TTCATTTTGGGAAGAAAAATGAAGTTTTTTCTCAAAGCGCCATTACGGACCATTCGCTTCCCTTTGAACCACCATTTGATATTTGTTTCCCTGGAGGAGGTAATGTGAAA
CATAGAAATTTTTGGCGAGTTAAAGACAGTGGCACTGTGAAAGATTATAGACTGCTGAGGCCTGGAATGGTTTTACTGAAGCACTATATCACTCCACCTGAACAGATCAA
TATAGTGAAAACTTGTCAAAAGCTTGGTCTTGGGCCAGGGGGATTTTACCAGCCTGGTTATAAAGATGGAGCAAAACTTAGGCTTCGTATGATGTGTCTTGGATTGGACT
GGGATCCTCAAACAAGAAGATATGAAAACAAACGGGTTGTGGATGGTAATAAACCACCAGATATACCTCCTCCATTTTCATTTCTAGTTAAAAATGCACTTAAAGATGCA
CATGCCTTCATCAAGAACAAATGCAATATAAGCAATGTAGAAGACATTCTTCCATCAATGTCTCCTGACATATGCATTGCGAACTTTTACACAACGAGTGGAAGATTGGG
TCTGCATCAGGACCGGGATGAAAGCAAAGAGAGTCTTTCCAGCGGACTACCAGTCGTTTCCTTTTCTGTAGGCAATACAGCAGAATTCTTGTATGGAGATAAAAGAGATG
TGGACAAAGCAGAGAAGGTTGAACTGGAATCGGGTGATGTTCTAATTTTTGGTGGCGAATCTAGACACGTATTTCATGGAGTGTCTTCAATCATCCCAAAATCGACACCT
AAGTTTTTGCTTTATCATACTGGTCTGCGTCCTGGTCGTCTAAATCTTACCTTTAGAAAGTATTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTTTTCATCCGTACACTTCCCCTTCCCCCATCGCCCTCCTCCAATCAACTTCGTCGCCTTCTATTCCCCGCTTCTTCATTTCCCGGCGCGCGCGGCTTTAGTTTGCT
TCAATTTCAGCGAATGGATTCGTTTTCCAGTTCCGCGAATAGCCATGCACCACCTGACTCTTCATGTCGTGGTAATTCTTGTGGTTGTGGAAGAGACAAGGAACATTTGC
GTGACAGAGATAATTGTTCAGATGTAATATTTGTGGGAAGCTTTCGTGTGCATCTAAATCCCAAGGAACGTGAACCGAAATCTCTAACTCCGCTTTCTGTTAAGAAATGT
GATTATGTTGAGGTGGGAAGTGATAAGTTTGGGATTTCTTCAAATGAACCGAAATCTTATCATTATGATGAGTGTCTACCTGTTTCTAGGCAAAATACTAGGAGAAACCG
GATAGATTTAGGGTCCAAAAGAGATTTGAAGAGTAATGCAAGATCATTTCAAGTAGAGAGGCATGAATTTTTGAATGATTATTGTCAGGAGTATGAATCATCTCTTCCTA
TTCATTTTGGGAAGAAAAATGAAGTTTTTTCTCAAAGCGCCATTACGGACCATTCGCTTCCCTTTGAACCACCATTTGATATTTGTTTCCCTGGAGGAGGTAATGTGAAA
CATAGAAATTTTTGGCGAGTTAAAGACAGTGGCACTGTGAAAGATTATAGACTGCTGAGGCCTGGAATGGTTTTACTGAAGCACTATATCACTCCACCTGAACAGATCAA
TATAGTGAAAACTTGTCAAAAGCTTGGTCTTGGGCCAGGGGGATTTTACCAGCCTGGTTATAAAGATGGAGCAAAACTTAGGCTTCGTATGATGTGTCTTGGATTGGACT
GGGATCCTCAAACAAGAAGATATGAAAACAAACGGGTTGTGGATGGTAATAAACCACCAGATATACCTCCTCCATTTTCATTTCTAGTTAAAAATGCACTTAAAGATGCA
CATGCCTTCATCAAGAACAAATGCAATATAAGCAATGTAGAAGACATTCTTCCATCAATGTCTCCTGACATATGCATTGCGAACTTTTACACAACGAGTGGAAGATTGGG
TCTGCATCAGGACCGGGATGAAAGCAAAGAGAGTCTTTCCAGCGGACTACCAGTCGTTTCCTTTTCTGTAGGCAATACAGCAGAATTCTTGTATGGAGATAAAAGAGATG
TGGACAAAGCAGAGAAGGTTGAACTGGAATCGGGTGATGTTCTAATTTTTGGTGGCGAATCTAGACACGTATTTCATGGAGTGTCTTCAATCATCCCAAAATCGACACCT
AAGTTTTTGCTTTATCATACTGGTCTGCGTCCTGGTCGTCTAAATCTTACCTTTAGAAAGTATTAA
Protein sequenceShow/hide protein sequence
MFFIRTLPLPPSPSSNQLRRLLFPASSFPGARGFSLLQFQRMDSFSSSANSHAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFVGSFRVHLNPKEREPKSLTPLSVKKC
DYVEVGSDKFGISSNEPKSYHYDECLPVSRQNTRRNRIDLGSKRDLKSNARSFQVERHEFLNDYCQEYESSLPIHFGKKNEVFSQSAITDHSLPFEPPFDICFPGGGNVK
HRNFWRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPPFSFLVKNALKDA
HAFIKNKCNISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTP
KFLLYHTGLRPGRLNLTFRKY