; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy4G015690 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy4G015690
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionFe2OG dioxygenase domain-containing protein
Genome locationGy14Chr4:20401143..20403826
RNA-Seq ExpressionCsGy4G015690
SyntenyCsGy4G015690
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0035513 - oxidative RNA demethylation (biological process)
GO:0035552 - oxidative single-stranded DNA demethylation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0008198 - ferrous iron binding (molecular function)
GO:0035515 - oxidative RNA demethylase activity (molecular function)
GO:0035516 - oxidative DNA demethylase activity (molecular function)
InterPro domainsIPR004574 - Alkylated DNA repair protein AlkB
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR027450 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like
IPR037151 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050407.1 2-oxoglutarate-dependent dioxygenase family protein isoform 1 [Cucumis melo var. makuwa]5.28e-25082.94Show/hide
Query:  SHALPDSSCCGSSYGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPKS----------------------------YNYDESLPVHRQNTRRSRIDL
        S A PDSSC G+S GCGRDKEHL DRDN SDVI VGS  VHLNPKEREPKS                            Y+YDE LPV RQNTRR+RIDL
Subjt:  SHALPDSSCCGSSYGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPKS----------------------------YNYDESLPVHRQNTRRSRIDL

Query:  GSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKESVVTDNSLPFEPPFDICLPGGGNVKHRNIYVVKEGGTVKDYRLL
        GSKRDLKSNARS+QVER EFLND CQEY+SSLPIHFGKKNEVF SK QSLD G KESVVTD+SLPFEPPFDIC PGGGNVKHRN + VK+ GTVKDYRLL
Subjt:  GSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKESVVTDNSLPFEPPFDICLPGGGNVKHRNIYVVKEGGTVKDYRLL

Query:  RPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCN
        RPGMVLLKHYITP EQINIVKTCQ LG+GPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPP F+FLVK ALKDAHAFIKN CN
Subjt:  RPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCN

Query:  ISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKST
        ISNVE+ILPSMSPDICIANFYTT GRLGLHQDRDESKESL +GLPVVSFSVGNTAEFLYGDKR+VDKAE VELESGDVLIFGGESRH+FHGVSSIIPKST
Subjt:  ISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKST

Query:  PKFLLHHTGLRPGRLNLTFRKY
        PKFLL+HTGLRPGRLNLTFRKY
Subjt:  PKFLLHHTGLRPGRLNLTFRKY

TYJ97997.1 2-oxoglutarate-dependent dioxygenase family protein isoform 1 [Cucumis melo var. makuwa]6.12e-24982.7Show/hide
Query:  SHALPDSSCCGSSYGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPKS----------------------------YNYDESLPVHRQNTRRSRIDL
        S A PDSSC G+S GCGRDKEHL DRDN SDVI VGS  VHLNPKEREPKS                            Y+YDE LPV RQNTRR+RIDL
Subjt:  SHALPDSSCCGSSYGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPKS----------------------------YNYDESLPVHRQNTRRSRIDL

Query:  GSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKESVVTDNSLPFEPPFDICLPGGGNVKHRNIYVVKEGGTVKDYRLL
        GSKRDLKSNARS+QVER EFLND CQEY+SSLPIHFGKKNEVF SK QSLD G KESVVTD+S PFEPPFDIC PGGGNVKHRN + VK+ GTVKDYRLL
Subjt:  GSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKESVVTDNSLPFEPPFDICLPGGGNVKHRNIYVVKEGGTVKDYRLL

Query:  RPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCN
        RPGMVLLKHYITP EQINIVKTCQ LG+GPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPP F+FLVK ALKDAHAFIKN CN
Subjt:  RPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCN

Query:  ISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKST
        ISNVE+ILPSMSPDICIANFYTT GRLGLHQDRDESKESL +GLPVVSFSVGNTAEFLYGDKR+VDKAE VELESGDVLIFGGESRH+FHGVSSIIPKST
Subjt:  ISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKST

Query:  PKFLLHHTGLRPGRLNLTFRKY
        PKFLL+HTGLRPGRLNLTFRKY
Subjt:  PKFLLHHTGLRPGRLNLTFRKY

XP_004149927.1 uncharacterized protein LOC101210053 isoform X1 [Cucumis sativus]0.099.32Show/hide
Query:  MFFIRTLPLPPSPSSNQLRRLLFPASSFPCLRGFRLLQFQPMDSFSTSANSHALPDSSCCGSSYGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPK
        MFFIRTLPLPPSPSSNQLRRLLFPASSFPCLRGFRLLQFQPMDSFSTSANSHALPDSSCCGSS GCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPK
Subjt:  MFFIRTLPLPPSPSSNQLRRLLFPASSFPCLRGFRLLQFQPMDSFSTSANSHALPDSSCCGSSYGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPK

Query:  SYNYDESLPVHRQNTRRSRIDLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKESVVTDNSLPFEPPFDICLPGGG
        SYNYDESLPVHRQNTRRSRIDLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKESVVTDNSLPFEPPFDICLPGGG
Subjt:  SYNYDESLPVHRQNTRRSRIDLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKESVVTDNSLPFEPPFDICLPGGG

Query:  NVKHRNIYVVKEGGTVKDYRLLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPP
        NVKHRNIYVVKEGGTVKDYRLLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPP
Subjt:  NVKHRNIYVVKEGGTVKDYRLLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPP

Query:  QFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELESGDV
        QFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLW GLPVVSFSVGN AEFLYGDKRNVDKAEMVELESGDV
Subjt:  QFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELESGDV

Query:  LIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY
        LIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY
Subjt:  LIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY

XP_016903166.1 PREDICTED: uncharacterized protein LOC103502183 [Cucumis melo]9.37e-28482.84Show/hide
Query:  MFFIRTLPLPPSPSSNQLRRLLFPASSFPCLRGFRLLQFQPMDSFSTSANSHALPDSSCCGSSYGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPK
        MFFIRTLPLPPSPSSNQLRRLLFPASSFP  RGF LLQFQ MDSFS+SANSHA PDSSC G+S GCGRDKEHL DRDN SDVI +GS  VHLNPKEREPK
Subjt:  MFFIRTLPLPPSPSSNQLRRLLFPASSFPCLRGFRLLQFQPMDSFSTSANSHALPDSSCCGSSYGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPK

Query:  S----------------------------YNYDESLPVHRQNTRRSRIDLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSL
        S                            Y+YDE LPV RQNTRR+RIDLGSKRDLKSNARS+QVER EF ND CQEY+SSLPIHFGKKNEVF SK QSL
Subjt:  S----------------------------YNYDESLPVHRQNTRRSRIDLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSL

Query:  DTGPKESVVTDNSLPFEPPFDICLPGGGNVKHRNIYVVKEGGTVKDYRLLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMC
        D G KESVVTD+SLPFEPPFDIC PGGGNVKHRN + VK+ GTVKDYRLLRPGMVLLKHYITP EQINIVKTCQ LG+GPGGFYQP YKDGAKLRLRMMC
Subjt:  DTGPKESVVTDNSLPFEPPFDICLPGGGNVKHRNIYVVKEGGTVKDYRLLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMC

Query:  LGLDWDPQTRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWTGLPVVSFS
        LGLDWDPQTRRY+NKRVVDGNKPPDIPP F+FLVK ALKDAHAFIKN CNISNVE+ILPSMSPDICIANFYTT GRLGLHQDRDESKESL +GLPVVSFS
Subjt:  LGLDWDPQTRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWTGLPVVSFS

Query:  VGNTAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY
        VGNTAEFLYGDKR+V+KAE VELESGDVLIFGGESRH+FHGVSSIIPKSTPKFLL+HTGLRPGRLNLTFRKY
Subjt:  VGNTAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY

XP_031739557.1 uncharacterized protein LOC101210053 isoform X2 [Cucumis sativus]3.06e-20899.3Show/hide
Query:  MFFIRTLPLPPSPSSNQLRRLLFPASSFPCLRGFRLLQFQPMDSFSTSANSHALPDSSCCGSSYGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPK
        MFFIRTLPLPPSPSSNQLRRLLFPASSFPCLRGFRLLQFQPMDSFSTSANSHALPDSSCCGSS GCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPK
Subjt:  MFFIRTLPLPPSPSSNQLRRLLFPASSFPCLRGFRLLQFQPMDSFSTSANSHALPDSSCCGSSYGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPK

Query:  SYNYDESLPVHRQNTRRSRIDLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKESVVTDNSLPFEPPFDICLPGGG
        SYNYDESLPVHRQNTRRSRIDLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKESVVTDNSLPFEPPFDICLPGGG
Subjt:  SYNYDESLPVHRQNTRRSRIDLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKESVVTDNSLPFEPPFDICLPGGG

Query:  NVKHRNIYVVKEGGTVKDYRLLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRY
        NVKHRNIYVVKEGGTVKDYRLLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRR+
Subjt:  NVKHRNIYVVKEGGTVKDYRLLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRY

TrEMBL top hitse value%identityAlignment
A0A0A0KY56 Fe2OG dioxygenase domain-containing protein0.099.32Show/hide
Query:  MFFIRTLPLPPSPSSNQLRRLLFPASSFPCLRGFRLLQFQPMDSFSTSANSHALPDSSCCGSSYGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPK
        MFFIRTLPLPPSPSSNQLRRLLFPASSFPCLRGFRLLQFQPMDSFSTSANSHALPDSSCCGSS GCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPK
Subjt:  MFFIRTLPLPPSPSSNQLRRLLFPASSFPCLRGFRLLQFQPMDSFSTSANSHALPDSSCCGSSYGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPK

Query:  SYNYDESLPVHRQNTRRSRIDLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKESVVTDNSLPFEPPFDICLPGGG
        SYNYDESLPVHRQNTRRSRIDLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKESVVTDNSLPFEPPFDICLPGGG
Subjt:  SYNYDESLPVHRQNTRRSRIDLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKESVVTDNSLPFEPPFDICLPGGG

Query:  NVKHRNIYVVKEGGTVKDYRLLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPP
        NVKHRNIYVVKEGGTVKDYRLLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPP
Subjt:  NVKHRNIYVVKEGGTVKDYRLLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPP

Query:  QFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELESGDV
        QFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLW GLPVVSFSVGN AEFLYGDKRNVDKAEMVELESGDV
Subjt:  QFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELESGDV

Query:  LIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY
        LIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY
Subjt:  LIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY

A0A1S4E4K6 uncharacterized protein LOC1035021834.53e-28482.84Show/hide
Query:  MFFIRTLPLPPSPSSNQLRRLLFPASSFPCLRGFRLLQFQPMDSFSTSANSHALPDSSCCGSSYGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPK
        MFFIRTLPLPPSPSSNQLRRLLFPASSFP  RGF LLQFQ MDSFS+SANSHA PDSSC G+S GCGRDKEHL DRDN SDVI +GS  VHLNPKEREPK
Subjt:  MFFIRTLPLPPSPSSNQLRRLLFPASSFPCLRGFRLLQFQPMDSFSTSANSHALPDSSCCGSSYGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPK

Query:  S----------------------------YNYDESLPVHRQNTRRSRIDLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSL
        S                            Y+YDE LPV RQNTRR+RIDLGSKRDLKSNARS+QVER EF ND CQEY+SSLPIHFGKKNEVF SK QSL
Subjt:  S----------------------------YNYDESLPVHRQNTRRSRIDLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSL

Query:  DTGPKESVVTDNSLPFEPPFDICLPGGGNVKHRNIYVVKEGGTVKDYRLLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMC
        D G KESVVTD+SLPFEPPFDIC PGGGNVKHRN + VK+ GTVKDYRLLRPGMVLLKHYITP EQINIVKTCQ LG+GPGGFYQP YKDGAKLRLRMMC
Subjt:  DTGPKESVVTDNSLPFEPPFDICLPGGGNVKHRNIYVVKEGGTVKDYRLLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMC

Query:  LGLDWDPQTRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWTGLPVVSFS
        LGLDWDPQTRRY+NKRVVDGNKPPDIPP F+FLVK ALKDAHAFIKN CNISNVE+ILPSMSPDICIANFYTT GRLGLHQDRDESKESL +GLPVVSFS
Subjt:  LGLDWDPQTRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWTGLPVVSFS

Query:  VGNTAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY
        VGNTAEFLYGDKR+V+KAE VELESGDVLIFGGESRH+FHGVSSIIPKSTPKFLL+HTGLRPGRLNLTFRKY
Subjt:  VGNTAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY

A0A5A7U7Q2 2-oxoglutarate-dependent dioxygenase family protein isoform 12.56e-25082.94Show/hide
Query:  SHALPDSSCCGSSYGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPKS----------------------------YNYDESLPVHRQNTRRSRIDL
        S A PDSSC G+S GCGRDKEHL DRDN SDVI VGS  VHLNPKEREPKS                            Y+YDE LPV RQNTRR+RIDL
Subjt:  SHALPDSSCCGSSYGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPKS----------------------------YNYDESLPVHRQNTRRSRIDL

Query:  GSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKESVVTDNSLPFEPPFDICLPGGGNVKHRNIYVVKEGGTVKDYRLL
        GSKRDLKSNARS+QVER EFLND CQEY+SSLPIHFGKKNEVF SK QSLD G KESVVTD+SLPFEPPFDIC PGGGNVKHRN + VK+ GTVKDYRLL
Subjt:  GSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKESVVTDNSLPFEPPFDICLPGGGNVKHRNIYVVKEGGTVKDYRLL

Query:  RPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCN
        RPGMVLLKHYITP EQINIVKTCQ LG+GPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPP F+FLVK ALKDAHAFIKN CN
Subjt:  RPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCN

Query:  ISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKST
        ISNVE+ILPSMSPDICIANFYTT GRLGLHQDRDESKESL +GLPVVSFSVGNTAEFLYGDKR+VDKAE VELESGDVLIFGGESRH+FHGVSSIIPKST
Subjt:  ISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKST

Query:  PKFLLHHTGLRPGRLNLTFRKY
        PKFLL+HTGLRPGRLNLTFRKY
Subjt:  PKFLLHHTGLRPGRLNLTFRKY

A0A5D3BFV0 2-oxoglutarate-dependent dioxygenase family protein isoform 12.96e-24982.7Show/hide
Query:  SHALPDSSCCGSSYGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPKS----------------------------YNYDESLPVHRQNTRRSRIDL
        S A PDSSC G+S GCGRDKEHL DRDN SDVI VGS  VHLNPKEREPKS                            Y+YDE LPV RQNTRR+RIDL
Subjt:  SHALPDSSCCGSSYGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPKS----------------------------YNYDESLPVHRQNTRRSRIDL

Query:  GSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKESVVTDNSLPFEPPFDICLPGGGNVKHRNIYVVKEGGTVKDYRLL
        GSKRDLKSNARS+QVER EFLND CQEY+SSLPIHFGKKNEVF SK QSLD G KESVVTD+S PFEPPFDIC PGGGNVKHRN + VK+ GTVKDYRLL
Subjt:  GSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKESVVTDNSLPFEPPFDICLPGGGNVKHRNIYVVKEGGTVKDYRLL

Query:  RPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCN
        RPGMVLLKHYITP EQINIVKTCQ LG+GPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPP F+FLVK ALKDAHAFIKN CN
Subjt:  RPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCN

Query:  ISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKST
        ISNVE+ILPSMSPDICIANFYTT GRLGLHQDRDESKESL +GLPVVSFSVGNTAEFLYGDKR+VDKAE VELESGDVLIFGGESRH+FHGVSSIIPKST
Subjt:  ISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKST

Query:  PKFLLHHTGLRPGRLNLTFRKY
        PKFLL+HTGLRPGRLNLTFRKY
Subjt:  PKFLLHHTGLRPGRLNLTFRKY

A0A6J1EDT3 uncharacterized protein LOC1114323188.34e-17058.8Show/hide
Query:  MFFIRTLPLPPSPSSNQLRRLLFPASSFPCLRGFRLLQFQPMDSFSTSANSHALPDSSCCGSSYGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKERE--
        M  IRT+P    P SN LRRLLF  S        RLLQFQ +DSF +SA    LPDSSC GSS  CG ++E LH+RD++S+VI +G IPV+LN K  E  
Subjt:  MFFIRTLPLPPSPSSNQLRRLLFPASSFPCLRGFRLLQFQPMDSFSTSANSHALPDSSCCGSSYGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKERE--

Query:  --------------------------PKSYNYDESLPVHRQNT-RRSRIDLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQS
                                  P SY+ DE  PV RQNT RRSRIDLGS+R LK++  S Q+ER E             P  F K         +S
Subjt:  --------------------------PKSYNYDESLPVHRQNT-RRSRIDLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQS

Query:  LDTGPKESVVTDNSLPFEPPFDICLPGG-GNVKHRNIYVVKEGGTVKDYR---------LLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYK
         D G K S+ T N  P E  FDIC P   G  K R  +  K+  T+K            ++RPGMVLLKHYI   EQ+NIVKT Q LG+GPGGFYQPGYK
Subjt:  LDTGPKESVVTDNSLPFEPPFDICLPGG-GNVKHRNIYVVKEGGTVKDYR---------LLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYK

Query:  DGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKES
        DGAKLRL+MMCLGLDWDPQTR+Y  KRV DGNKPPD+PP+F  LV +AL DAHA IKNN + +N+E+ILP+MSPDICI NFY+T GRLGLHQDRDES+ES
Subjt:  DGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKES

Query:  LWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY
        L  GLPVVSFS+GN+AEFLYGD+R+VDKA  + LESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL HTGLRPGRLNLTFRKY
Subjt:  LWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY

SwissProt top hitse value%identityAlignment
B8GWW6 Alpha-ketoglutarate-dependent dioxygenase AlkB homolog1.4e-1734.01Show/hide
Query:  PGGFYQPGYKDGAKLRLRMMCLG-LDWDPQTR--RYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGR
        P   Y+  Y  G  + + M  LG L W    R  RY ++    G   PD+PP        AL D    + +           P   PD C+ N Y    R
Subjt:  PGGFYQPGYKDGAKLRLRMMCLG-LDWDPQTR--RYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGR

Query:  LGLHQDRDESKESLWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRP--GRLNLTFRK
        +GLHQDRDE+        PV+S S+G+TA F  G     D    + L SGDV    G +R  FHGV  I+P S        + L P  GR+NLT R+
Subjt:  LGLHQDRDESKESLWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRP--GRLNLTFRK

O60066 Alpha-ketoglutarate-dependent dioxygenase abh12.6e-1627.08Show/hide
Query:  PGMVLLKHYITPREQINIVKTCQ-----------------NLGIGPGGFYQPGYK-DGAKL------------------RLRMMCLGLDWDPQTRRYENK
        PG+++LK+Y++   Q+ ++K+                    L +G    ++  Y  DG  +                  +LR + LG  +D  T+ Y   
Subjt:  PGMVLLKHYITPREQINIVKTCQ-----------------NLGIGPGGFYQPGYK-DGAKL------------------RLRMMCLGLDWDPQTRRYENK

Query:  RVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWTGLPVVSFSVGNTAEFLYGDKRNV
           D +K P  P      V++ +K++  F+                  +  I NFY+    L  H   DES+E L   LP++S S+G    +L G +   
Subjt:  RVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWTGLPVVSFSVGNTAEFLYGDKRNV

Query:  DKAEMVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL
        +K   + L SGDV+I  G SR  FH V  IIP STP +LL
Subjt:  DKAEMVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL

P05050 Alpha-ketoglutarate-dependent dioxygenase AlkB2.6e-1332.93Show/hide
Query:  DPQTRRYENKRVVDGNKP-PDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWTGLPVVSFSVGNT
        DPQT           NKP P +P  F  L +RA   A                 P   PD C+ N Y    +L LHQD+DE         P+VS S+G  
Subjt:  DPQTRRYENKRVVDGNKP-PDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWTGLPVVSFSVGNT

Query:  AEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRK
        A F +G  +  D  + + LE GDV+++GGESR  +HG+  +     P  +         R NLTFR+
Subjt:  AEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRK

P0CAT7 Alpha-ketoglutarate-dependent dioxygenase AlkB homolog1.4e-1734.01Show/hide
Query:  PGGFYQPGYKDGAKLRLRMMCLG-LDWDPQTR--RYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGR
        P   Y+  Y  G  + + M  LG L W    R  RY ++    G   PD+PP        AL D    + +           P   PD C+ N Y    R
Subjt:  PGGFYQPGYKDGAKLRLRMMCLG-LDWDPQTR--RYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGR

Query:  LGLHQDRDESKESLWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRP--GRLNLTFRK
        +GLHQDRDE+        PV+S S+G+TA F  G     D    + L SGDV    G +R  FHGV  I+P S        + L P  GR+NLT R+
Subjt:  LGLHQDRDESKESLWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRP--GRLNLTFRK

P37462 Alpha-ketoglutarate-dependent dioxygenase AlkB5.9e-1337.5Show/hide
Query:  SMSPDICIANFYTTRGRLGLHQDRDESKESLWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTG
        S  PD C+ N Y    +L LHQD+DE         P+VS S+G  A F +G  R  D  + + LE GD++++GGESR  +HG+        P     H  
Subjt:  SMSPDICIANFYTTRGRLGLHQDRDESKESLWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTG

Query:  LRPGRLNLTFRK
            R NLTFR+
Subjt:  LRPGRLNLTFRK

Arabidopsis top hitse value%identityAlignment
AT1G11780.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein2.0e-0838.55Show/hide
Query:  PDICIANFYTTRGRLGLHQDRDESKESLWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSI
        P+  I N++     LG H D     E+ W+  P+VS S+G  A FL G K   D    + L SGDV++  GE+R  FHG+  I
Subjt:  PDICIANFYTTRGRLGLHQDRDESKESLWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSI

AT3G14140.1 2-oxoglutarate-dependent dioxygenase family protein3.7e-6350.92Show/hide
Query:  LLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNN
        ++RPGMVLLK+Y++   Q+ IV  C+ LG+G GGFYQPG++DG  L L+MMCLG +WD QTRRY   R +DG+ PP IP +F+ LV++A+K++ + +  N
Subjt:  LLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNN

Query:  CNISNVEEILPSMSPDICIANFYTTRGRLGLHQ---------------------DRDESKESLWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELESGD
         N +   + +P + PDIC+ NFYT+ G+LGLHQ                     D+ ESK+SL  GLP+VSFS+G++AEFLYGD+++VDKA+ + LESGD
Subjt:  CNISNVEEILPSMSPDICIANFYTTRGRLGLHQ---------------------DRDESKESLWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELESGD

Query:  VLIFGGESRHIFHGVSSI
        VLIFG  SR++FHGV SI
Subjt:  VLIFGGESRHIFHGVSSI

AT3G14160.1 2-oxoglutarate-dependent dioxygenase family protein1.5e-8047.27Show/hide
Query:  LGSKRDLKSNARSY--QVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKESVVTDNSLPFEP-PFDICLPGGGNVKHRNIYVV--KEGGTV
        + S+ + K  A+ Y   V R+  +  SCQE  SS  +      +V +S ++   + PK     ++S       FDI L   G V   N+ V+  ++    
Subjt:  LGSKRDLKSNARSY--QVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKESVVTDNSLPFEP-PFDICLPGGGNVKHRNIYVV--KEGGTV

Query:  KDY--RLLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAH
        K Y   ++RPGMVLLK+Y++  +Q+ IV  C+ LG+G GGFYQPGY+D AKL L+MMCLG +WDP+T RY   R  DG+  P IP +F   V++A+K++ 
Subjt:  KDY--RLLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAH

Query:  AFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGV
        +   +N   +   + +P M PDICI NFY++ GRLGLHQD+DES+ S+  GLPVVSFS+G++AEFLYGD+R+ DKAE + LESGDVL+FGG SR +FHGV
Subjt:  AFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGV

Query:  SSIIPKSTPKFLLHHTGLRPGRLNLTFRKY
         SI   + PK LL  T LRPGRLNLTFR+Y
Subjt:  SSIIPKSTPKFLLHHTGLRPGRLNLTFRKY

AT5G01780.1 2-oxoglutarate-dependent dioxygenase family protein1.0e-8156.02Show/hide
Query:  PPFDICLPGGGNVKHRNIYVVKE--------GGTVK---DYRLLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWD
        PPFDIC     +V  RN   +K+          TV+    ++++RPGMVLLK ++TP  Q++IVKTC+ LG+ P GFYQPGY  G+KL L+MMCLG +WD
Subjt:  PPFDICLPGGGNVKHRNIYVVKE--------GGTVK---DYRLLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWD

Query:  PQTRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWTGLPVVSFSVGNTAE
        PQT+  +N  +   +K P+IP  F  LV++A+++AHA I       + E ILP MSPDICI NFY+  GRLGLHQDRDES+ES+  GLP+VSFS+G++AE
Subjt:  PQTRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWTGLPVVSFSVGNTAE

Query:  FLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY
        FLYG+KR+V++A+ V LESGDVLIFGGESR IFHGV SIIP S P  LL+ + LR GRLNLTFR +
Subjt:  FLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY

AT5G01780.2 2-oxoglutarate-dependent dioxygenase family protein1.0e-8156.02Show/hide
Query:  PPFDICLPGGGNVKHRNIYVVKE--------GGTVK---DYRLLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWD
        PPFDIC     +V  RN   +K+          TV+    ++++RPGMVLLK ++TP  Q++IVKTC+ LG+ P GFYQPGY  G+KL L+MMCLG +WD
Subjt:  PPFDICLPGGGNVKHRNIYVVKE--------GGTVK---DYRLLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWD

Query:  PQTRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWTGLPVVSFSVGNTAE
        PQT+  +N  +   +K P+IP  F  LV++A+++AHA I       + E ILP MSPDICI NFY+  GRLGLHQDRDES+ES+  GLP+VSFS+G++AE
Subjt:  PQTRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWTGLPVVSFSVGNTAE

Query:  FLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY
        FLYG+KR+V++A+ V LESGDVLIFGGESR IFHGV SIIP S P  LL+ + LR GRLNLTFR +
Subjt:  FLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTTTCATCCGTACACTTCCCCTTCCCCCATCGCCCTCCTCCAATCAACTTCGTCGCCTTCTATTCCCTGCTTCTTCATTTCCCTGCCTGCGCGGCTTTCGTTTGCT
TCAATTTCAACCAATGGATTCGTTTTCCACTTCAGCAAATAGCCATGCACTACCTGACTCTTCATGTTGTGGTAGTTCTTATGGTTGTGGGAGAGACAAGGAACATTTGC
ATGACAGAGATAATAGTTCAGATGTCATACATGTGGGAAGCATTCCTGTGCATCTAAATCCCAAGGAACGTGAACCCAAATCTTATAATTATGATGAGTCTCTACCTGTT
CATAGACAAAATACTAGAAGAAGCCGGATAGATTTAGGGTCCAAAAGAGATTTGAAGAGTAATGCAAGATCATATCAAGTAGAGAGGCTTGAATTTTTGAACGATTCTTG
TCAGGAGTATAAATCATCTCTTCCTATTCATTTTGGGAAGAAAAATGAAGTGTTTGTCTCAAAGCTCCAGTCCCTTGATACCGGTCCCAAAGAATCTGTAGTTACGGACA
ATTCACTTCCCTTTGAACCACCATTTGATATTTGTTTACCTGGAGGAGGTAATGTGAAACATAGAAATATTTATGTTGTTAAAGAGGGTGGCACTGTGAAAGATTATAGA
CTGTTGAGGCCTGGAATGGTTTTACTGAAGCACTACATCACTCCACGTGAACAGATCAATATAGTGAAAACTTGTCAAAATCTTGGTATTGGCCCAGGGGGATTTTACCA
GCCTGGTTATAAAGATGGAGCAAAACTTAGGCTTCGTATGATGTGTCTTGGATTGGACTGGGATCCTCAAACAAGAAGGTATGAAAACAAACGGGTTGTGGATGGTAATA
AACCACCAGATATACCTCCTCAATTTACATTTCTTGTTAAACGTGCACTTAAAGATGCACATGCCTTCATCAAGAACAACTGCAATATAAGTAATGTAGAAGAAATTCTT
CCGTCAATGTCTCCAGACATATGCATTGCGAACTTCTACACAACGAGGGGAAGATTGGGTCTGCATCAGGACCGTGATGAAAGCAAAGAGAGTCTTTGGACGGGACTACC
GGTTGTTTCCTTTTCTGTAGGCAATACAGCAGAATTCTTGTATGGAGATAAAAGAAATGTGGATAAAGCAGAGATGGTTGAACTGGAATCAGGTGATGTTCTAATTTTTG
GTGGCGAATCTAGACATATATTCCATGGAGTATCTTCAATCATACCGAAATCGACACCTAAGTTTTTGCTTCATCATACTGGTCTGCGTCCCGGCCGTCTTAATCTTACC
TTTAGAAAGTATTAA
mRNA sequenceShow/hide mRNA sequence
TGACCATCAATTTAGCCTCTTTATTTCAGCACAAGAAATGATCACACACACAGCTTTGAAAATGGCCGAAATCCTGACTCCTTCCTCTTCCTCTTCCTCTTCCCGGTGCA
GTCTCCTCGGCCAACGCGGCGGCCACGACCGAAACTTGCGGATTAATTCTACAATATCCTCATTATGTCATTCCCATCCTCCTCCGACCCTTCCATTTCCTTCTTCTTTT
CCGATTTTATCATCATTCATCCTCAATTTCTGAACCCTATAAAGTATCTCTCTAATATCCAATTTTCCAAATGTTTTTCATCCGTACACTTCCCCTTCCCCCATCGCCCT
CCTCCAATCAACTTCGTCGCCTTCTATTCCCTGCTTCTTCATTTCCCTGCCTGCGCGGCTTTCGTTTGCTTCAATTTCAACCAATGGATTCGTTTTCCACTTCAGCAAAT
AGCCATGCACTACCTGACTCTTCATGTTGTGGTAGTTCTTATGGTTGTGGGAGAGACAAGGAACATTTGCATGACAGAGATAATAGTTCAGATGTCATACATGTGGGAAG
CATTCCTGTGCATCTAAATCCCAAGGAACGTGAACCCAAATCTTATAATTATGATGAGTCTCTACCTGTTCATAGACAAAATACTAGAAGAAGCCGGATAGATTTAGGGT
CCAAAAGAGATTTGAAGAGTAATGCAAGATCATATCAAGTAGAGAGGCTTGAATTTTTGAACGATTCTTGTCAGGAGTATAAATCATCTCTTCCTATTCATTTTGGGAAG
AAAAATGAAGTGTTTGTCTCAAAGCTCCAGTCCCTTGATACCGGTCCCAAAGAATCTGTAGTTACGGACAATTCACTTCCCTTTGAACCACCATTTGATATTTGTTTACC
TGGAGGAGGTAATGTGAAACATAGAAATATTTATGTTGTTAAAGAGGGTGGCACTGTGAAAGATTATAGACTGTTGAGGCCTGGAATGGTTTTACTGAAGCACTACATCA
CTCCACGTGAACAGATCAATATAGTGAAAACTTGTCAAAATCTTGGTATTGGCCCAGGGGGATTTTACCAGCCTGGTTATAAAGATGGAGCAAAACTTAGGCTTCGTATG
ATGTGTCTTGGATTGGACTGGGATCCTCAAACAAGAAGGTATGAAAACAAACGGGTTGTGGATGGTAATAAACCACCAGATATACCTCCTCAATTTACATTTCTTGTTAA
ACGTGCACTTAAAGATGCACATGCCTTCATCAAGAACAACTGCAATATAAGTAATGTAGAAGAAATTCTTCCGTCAATGTCTCCAGACATATGCATTGCGAACTTCTACA
CAACGAGGGGAAGATTGGGTCTGCATCAGGACCGTGATGAAAGCAAAGAGAGTCTTTGGACGGGACTACCGGTTGTTTCCTTTTCTGTAGGCAATACAGCAGAATTCTTG
TATGGAGATAAAAGAAATGTGGATAAAGCAGAGATGGTTGAACTGGAATCAGGTGATGTTCTAATTTTTGGTGGCGAATCTAGACATATATTCCATGGAGTATCTTCAAT
CATACCGAAATCGACACCTAAGTTTTTGCTTCATCATACTGGTCTGCGTCCCGGCCGTCTTAATCTTACCTTTAGAAAGTATTAAAACACTACCTCCATGTTTATGCTAT
ACATCTGAATCGGTGTTATTCATTTGATGTTCATTTATGGAATCGTGTAAATCTATAATGTAAGTATTGTCTGTTTCTGTTTCATTTACTTTGGATGTTACTTTGCTCAG
TACTTTTCAGTTGCCGTACGTGAATGATACAAAGTTTTAATTT
Protein sequenceShow/hide protein sequence
MFFIRTLPLPPSPSSNQLRRLLFPASSFPCLRGFRLLQFQPMDSFSTSANSHALPDSSCCGSSYGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPKSYNYDESLPV
HRQNTRRSRIDLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKESVVTDNSLPFEPPFDICLPGGGNVKHRNIYVVKEGGTVKDYR
LLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEIL
PSMSPDICIANFYTTRGRLGLHQDRDESKESLWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLT
FRKY