; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG04G006530 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG04G006530
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionFe2OG dioxygenase domain-containing protein
Genome locationCG_Chr04:21264913..21267797
RNA-Seq ExpressionClCG04G006530
SyntenyClCG04G006530
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0035513 - oxidative RNA demethylation (biological process)
GO:0035552 - oxidative single-stranded DNA demethylation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0008198 - ferrous iron binding (molecular function)
GO:0035515 - oxidative RNA demethylase activity (molecular function)
GO:0035516 - oxidative DNA demethylase activity (molecular function)
InterPro domainsIPR004574 - Alkylated DNA repair protein AlkB
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR027450 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like
IPR037151 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050407.1 2-oxoglutarate-dependent dioxygenase family protein isoform 1 [Cucumis melo var. makuwa]7.3e-18477.55Show/hide
Query:  SHALPDSSCSGSS--CGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELKSLSPPSVNKCDDFELGRNKKGIPANEPKSYHYDEFLPVSRQHTKRSRIDL
        S A PDSSC G+S  CG DKEH  +RD+ SDVIFVGS  V+LNPKERE KSL+P SV KCD  E+G +K GI +NEPKSYHYDE LPVSRQ+T+R+RIDL
Subjt:  SHALPDSSCSGSS--CGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELKSLSPPSVNKCDDFELGRNKKGIPANEPKSYHYDEFLPVSRQHTKRSRIDL

Query:  GFKRGLKSNVRSFQMERFECLND-----ESSLPNHFGKKIEAFYFSKHQSVDIGSKESVVADKSLPIE-PFNICFPGRGNVKSGASWQVKGRDTVKENDC
        G KR LKSN RSFQ+ER E LND     ESSLP HFGKK E F FSK QS+DIGSKESVV D SLP E PF+ICFPG GNVK    W+VK   TVK+   
Subjt:  GFKRGLKSNVRSFQMERFECLND-----ESSLPNHFGKKIEAFYFSKHQSVDIGSKESVVADKSLPIE-PFNICFPGRGNVKSGASWQVKGRDTVKENDC

Query:  FVETTNYRMLRPGMVLLKHYMTQREQINIVKTCQMLGLGQGGFYQPGYKDGAKLRLHMMCLGLDWDPQTRKYENKRATDGNKPPNIPPQFAFLVKAALKD
              YR+LRPGMVLLKHY+T  EQINIVKTCQ LGLG GGFYQPGYKDGAKLRL MMCLGLDWDPQTR+YENKR  DGNKPP+IPP F+FLVK ALKD
Subjt:  FVETTNYRMLRPGMVLLKHYMTQREQINIVKTCQMLGLGQGGFYQPGYKDGAKLRLHMMCLGLDWDPQTRKYENKRATDGNKPPNIPPQFAFLVKAALKD

Query:  AHAFIKNKFNTSNVEDILPSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSFSVGNAAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFH
        AHAFIKNK N SNVEDILPSMSPDICI NFYTTSGRLGLHQDRDESKESL SGLPVVSFSVGN AEFLYGDKRDVDKAEKV LESGDVLIFGG+SRH+FH
Subjt:  AHAFIKNKFNTSNVEDILPSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSFSVGNAAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFH

Query:  GVSSIIPKSSPKFLLGHSGLRPGRLNLTFRKY
        GVSSIIPKS+PKFLL H+GLRPGRLNLTFRKY
Subjt:  GVSSIIPKSSPKFLLGHSGLRPGRLNLTFRKY

KAG6595567.1 hypothetical protein SDJN03_12120, partial [Cucurbita argyrosperma subsp. sororia]3.2e-17166.6Show/hide
Query:  MLFVRTLPLSSSPWSNQHRRLFFLASRFPGERGFRLLRFQRMDSFASTANSHALPDSSCSGSSCGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELKSL
        ML +RT+P S  PWSN  RRL F  S        RLL+FQRMDSF S+    ALPDSSC GSSCGG++E  HNRDHNS+VI +G +PV LN K  E +SL
Subjt:  MLFVRTLPLSSSPWSNQHRRLFFLASRFPGERGFRLLRFQRMDSFASTANSHALPDSSCSGSSCGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELKSL

Query:  SPPSVNKCDDFELGRNKKGIPANEPKSYHYDEFLPVSRQHTK-RSRIDLGFKRGLKSNVRSFQMERFECLNDESSLPNHFGKKIEAFYFSKHQSVDIGSK
        S  SV KCDDF+L  ++KGIPAN P SYH DEF PV RQ+TK RSRIDLG +R LK++  S QMER                  E F F KH+S DIGSK
Subjt:  SPPSVNKCDDFELGRNKKGIPANEPKSYHYDEFLPVSRQHTK-RSRIDLGFKRGLKSNVRSFQMERFECLNDESSLPNHFGKKIEAFYFSKHQSVDIGSK

Query:  ESVVADKSLPIEPFNICFP-GRGNVKSGASWQVKGRDTVKENDCFVETTNYRMLRPGMVLLKHYMTQREQINIVKTCQMLGLGQGGFYQPGYKDGAKLRL
         S+      PIE F+ICFP  RG  K   SWQ K RDT+K  +   E TN  ++RPGMVLLKHY+   EQ+NIVKT Q LGLG GGFYQPGYKDGAKLRL
Subjt:  ESVVADKSLPIEPFNICFP-GRGNVKSGASWQVKGRDTVKENDCFVETTNYRMLRPGMVLLKHYMTQREQINIVKTCQMLGLGQGGFYQPGYKDGAKLRL

Query:  HMMCLGLDWDPQTRKYENKRATDGNKPPNIPPQFAFLVKAALKDAHAFIKNKFNTSNVEDILPSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPV
         MMCLGLDWDPQTRKY  KR  DGNKPP++PP+FA LV  AL DAHA IKN  +T+N+EDILP+MSPDICIVNFY+TSGRLGLHQDRDES+ESLV GLPV
Subjt:  HMMCLGLDWDPQTRKYENKRATDGNKPPNIPPQFAFLVKAALKDAHAFIKNKFNTSNVEDILPSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPV

Query:  VSFSVGNAAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSSPKFLLGHSGLRPGRLNLTFRKY
        VSFS+GN+AEFLYGD+RDVDKA K++LESGDVLIFGG+SRHIFHGVSSIIPKS+PKFLL H+GLRPGRLNLTFRKY
Subjt:  VSFSVGNAAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSSPKFLLGHSGLRPGRLNLTFRKY

TYJ97997.1 2-oxoglutarate-dependent dioxygenase family protein isoform 1 [Cucumis melo var. makuwa]4.7e-18377.31Show/hide
Query:  SHALPDSSCSGSS--CGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELKSLSPPSVNKCDDFELGRNKKGIPANEPKSYHYDEFLPVSRQHTKRSRIDL
        S A PDSSC G+S  CG DKEH  +RD+ SDVIFVGS  V+LNPKERE KSL+P SV KCD  E+G +K GI +NEPKSYHYDE LPVSRQ+T+R+RIDL
Subjt:  SHALPDSSCSGSS--CGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELKSLSPPSVNKCDDFELGRNKKGIPANEPKSYHYDEFLPVSRQHTKRSRIDL

Query:  GFKRGLKSNVRSFQMERFECLND-----ESSLPNHFGKKIEAFYFSKHQSVDIGSKESVVADKSLPIE-PFNICFPGRGNVKSGASWQVKGRDTVKENDC
        G KR LKSN RSFQ+ER E LND     ESSLP HFGKK E F FSK QS+DIGSKESVV D S P E PF+ICFPG GNVK    W+VK   TVK+   
Subjt:  GFKRGLKSNVRSFQMERFECLND-----ESSLPNHFGKKIEAFYFSKHQSVDIGSKESVVADKSLPIE-PFNICFPGRGNVKSGASWQVKGRDTVKENDC

Query:  FVETTNYRMLRPGMVLLKHYMTQREQINIVKTCQMLGLGQGGFYQPGYKDGAKLRLHMMCLGLDWDPQTRKYENKRATDGNKPPNIPPQFAFLVKAALKD
              YR+LRPGMVLLKHY+T  EQINIVKTCQ LGLG GGFYQPGYKDGAKLRL MMCLGLDWDPQTR+YENKR  DGNKPP+IPP F+FLVK ALKD
Subjt:  FVETTNYRMLRPGMVLLKHYMTQREQINIVKTCQMLGLGQGGFYQPGYKDGAKLRLHMMCLGLDWDPQTRKYENKRATDGNKPPNIPPQFAFLVKAALKD

Query:  AHAFIKNKFNTSNVEDILPSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSFSVGNAAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFH
        AHAFIKNK N SNVEDILPSMSPDICI NFYTTSGRLGLHQDRDESKESL SGLPVVSFSVGN AEFLYGDKRDVDKAEKV LESGDVLIFGG+SRH+FH
Subjt:  AHAFIKNKFNTSNVEDILPSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSFSVGNAAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFH

Query:  GVSSIIPKSSPKFLLGHSGLRPGRLNLTFRKY
        GVSSIIPKS+PKFLL H+GLRPGRLNLTFRKY
Subjt:  GVSSIIPKSSPKFLLGHSGLRPGRLNLTFRKY

XP_004149927.1 uncharacterized protein LOC101210053 isoform X1 [Cucumis sativus]3.0e-18571.78Show/hide
Query:  MLFVRTLPLSSSPWSNQHRRLFFLASRFPGERGFRLLRFQRMDSFASTANSHALPDSSCSGSS--CGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELK
        M F+RTLPL  SP SNQ RRL F AS FP  RGFRLL+FQ MDSF+++ANSHALPDSSC GSS  CG DKEH H+RD++SDVI VGS+PV+LNPKER   
Subjt:  MLFVRTLPLSSSPWSNQHRRLFFLASRFPGERGFRLLRFQRMDSFASTANSHALPDSSCSGSS--CGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELK

Query:  SLSPPSVNKCDDFELGRNKKGIPANEPKSYHYDEFLPVSRQHTKRSRIDLGFKRGLKSNVRSFQMERFECLND-----ESSLPNHFGKKIEAFYFSKHQS
                                 EPKSY+YDE LPV RQ+T+RSRIDLG KR LKSN RS+Q+ER E LND     +SSLP HFGKK E F  SK QS
Subjt:  SLSPPSVNKCDDFELGRNKKGIPANEPKSYHYDEFLPVSRQHTKRSRIDLGFKRGLKSNVRSFQMERFECLND-----ESSLPNHFGKKIEAFYFSKHQS

Query:  VDIGSKESVVADKSLPIE-PFNICFPGRGNVKSGASWQVKGRDTVKENDCFVETTNYRMLRPGMVLLKHYMTQREQINIVKTCQMLGLGQGGFYQPGYKD
        +D G KESVV D SLP E PF+IC PG GNVK    + VK   TVK+         YR+LRPGMVLLKHY+T REQINIVKTCQ LG+G GGFYQPGYKD
Subjt:  VDIGSKESVVADKSLPIE-PFNICFPGRGNVKSGASWQVKGRDTVKENDCFVETTNYRMLRPGMVLLKHYMTQREQINIVKTCQMLGLGQGGFYQPGYKD

Query:  GAKLRLHMMCLGLDWDPQTRKYENKRATDGNKPPNIPPQFAFLVKAALKDAHAFIKNKFNTSNVEDILPSMSPDICIVNFYTTSGRLGLHQDRDESKESL
        GAKLRL MMCLGLDWDPQTR+YENKR  DGNKPP+IPPQF FLVK ALKDAHAFIKN  N SNVE+ILPSMSPDICI NFYTT GRLGLHQDRDESKESL
Subjt:  GAKLRLHMMCLGLDWDPQTRKYENKRATDGNKPPNIPPQFAFLVKAALKDAHAFIKNKFNTSNVEDILPSMSPDICIVNFYTTSGRLGLHQDRDESKESL

Query:  VSGLPVVSFSVGNAAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSSPKFLLGHSGLRPGRLNLTFRKY
          GLPVVSFSVGNAAEFLYGDKR+VDKAE V LESGDVLIFGG+SRHIFHGVSSIIPKS+PKFLL H+GLRPGRLNLTFRKY
Subjt:  VSGLPVVSFSVGNAAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSSPKFLLGHSGLRPGRLNLTFRKY

XP_016903166.1 PREDICTED: uncharacterized protein LOC103502183 [Cucumis melo]1.4e-20376.14Show/hide
Query:  MLFVRTLPLSSSPWSNQHRRLFFLASRFPGERGFRLLRFQRMDSFASTANSHALPDSSCSGSS--CGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELK
        M F+RTLPL  SP SNQ RRL F AS FPG RGF LL+FQRMDSF+S+ANSHA PDSSC G+S  CG DKEH  +RD+ SDVIF+GS  V+LNPKERE K
Subjt:  MLFVRTLPLSSSPWSNQHRRLFFLASRFPGERGFRLLRFQRMDSFASTANSHALPDSSCSGSS--CGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELK

Query:  SLSPPSVNKCDDFELGRNKKGIPANEPKSYHYDEFLPVSRQHTKRSRIDLGFKRGLKSNVRSFQMERFECLND-----ESSLPNHFGKKIEAFYFSKHQS
        SL+P S  KCD  E+G +K GI +NEPKSYHYDEFLPVSRQ+T+R+RIDLG KR LKSN RSFQ+ER E  ND     ESSLP HFGKK E F FSK QS
Subjt:  SLSPPSVNKCDDFELGRNKKGIPANEPKSYHYDEFLPVSRQHTKRSRIDLGFKRGLKSNVRSFQMERFECLND-----ESSLPNHFGKKIEAFYFSKHQS

Query:  VDIGSKESVVADKSLPIE-PFNICFPGRGNVKSGASWQVKGRDTVKENDCFVETTNYRMLRPGMVLLKHYMTQREQINIVKTCQMLGLGQGGFYQPGYKD
        +DIGSKESVV D SLP E PF+ICFPG GNVK    W+VK   TVK+         YR+LRPGMVLLKHY+T  EQINIVKTCQ LGLG GGFYQP YKD
Subjt:  VDIGSKESVVADKSLPIE-PFNICFPGRGNVKSGASWQVKGRDTVKENDCFVETTNYRMLRPGMVLLKHYMTQREQINIVKTCQMLGLGQGGFYQPGYKD

Query:  GAKLRLHMMCLGLDWDPQTRKYENKRATDGNKPPNIPPQFAFLVKAALKDAHAFIKNKFNTSNVEDILPSMSPDICIVNFYTTSGRLGLHQDRDESKESL
        GAKLRL MMCLGLDWDPQTR+Y+NKR  DGNKPP+IPP F+FLVK+ALKDAHAFIKNK N SNVEDILPSMSPDICI NFYTTSGRLGLHQDRDESKESL
Subjt:  GAKLRLHMMCLGLDWDPQTRKYENKRATDGNKPPNIPPQFAFLVKAALKDAHAFIKNKFNTSNVEDILPSMSPDICIVNFYTTSGRLGLHQDRDESKESL

Query:  VSGLPVVSFSVGNAAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSSPKFLLGHSGLRPGRLNLTFRKY
         SGLPVVSFSVGN AEFLYGDKRDV+KAEKV LESGDVLIFGG+SRH+FHGVSSIIPKS+PKFLL H+GLRPGRLNLTFRKY
Subjt:  VSGLPVVSFSVGNAAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSSPKFLLGHSGLRPGRLNLTFRKY

TrEMBL top hitse value%identityAlignment
A0A0A0KY56 Fe2OG dioxygenase domain-containing protein1.4e-18571.78Show/hide
Query:  MLFVRTLPLSSSPWSNQHRRLFFLASRFPGERGFRLLRFQRMDSFASTANSHALPDSSCSGSS--CGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELK
        M F+RTLPL  SP SNQ RRL F AS FP  RGFRLL+FQ MDSF+++ANSHALPDSSC GSS  CG DKEH H+RD++SDVI VGS+PV+LNPKER   
Subjt:  MLFVRTLPLSSSPWSNQHRRLFFLASRFPGERGFRLLRFQRMDSFASTANSHALPDSSCSGSS--CGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELK

Query:  SLSPPSVNKCDDFELGRNKKGIPANEPKSYHYDEFLPVSRQHTKRSRIDLGFKRGLKSNVRSFQMERFECLND-----ESSLPNHFGKKIEAFYFSKHQS
                                 EPKSY+YDE LPV RQ+T+RSRIDLG KR LKSN RS+Q+ER E LND     +SSLP HFGKK E F  SK QS
Subjt:  SLSPPSVNKCDDFELGRNKKGIPANEPKSYHYDEFLPVSRQHTKRSRIDLGFKRGLKSNVRSFQMERFECLND-----ESSLPNHFGKKIEAFYFSKHQS

Query:  VDIGSKESVVADKSLPIE-PFNICFPGRGNVKSGASWQVKGRDTVKENDCFVETTNYRMLRPGMVLLKHYMTQREQINIVKTCQMLGLGQGGFYQPGYKD
        +D G KESVV D SLP E PF+IC PG GNVK    + VK   TVK+         YR+LRPGMVLLKHY+T REQINIVKTCQ LG+G GGFYQPGYKD
Subjt:  VDIGSKESVVADKSLPIE-PFNICFPGRGNVKSGASWQVKGRDTVKENDCFVETTNYRMLRPGMVLLKHYMTQREQINIVKTCQMLGLGQGGFYQPGYKD

Query:  GAKLRLHMMCLGLDWDPQTRKYENKRATDGNKPPNIPPQFAFLVKAALKDAHAFIKNKFNTSNVEDILPSMSPDICIVNFYTTSGRLGLHQDRDESKESL
        GAKLRL MMCLGLDWDPQTR+YENKR  DGNKPP+IPPQF FLVK ALKDAHAFIKN  N SNVE+ILPSMSPDICI NFYTT GRLGLHQDRDESKESL
Subjt:  GAKLRLHMMCLGLDWDPQTRKYENKRATDGNKPPNIPPQFAFLVKAALKDAHAFIKNKFNTSNVEDILPSMSPDICIVNFYTTSGRLGLHQDRDESKESL

Query:  VSGLPVVSFSVGNAAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSSPKFLLGHSGLRPGRLNLTFRKY
          GLPVVSFSVGNAAEFLYGDKR+VDKAE V LESGDVLIFGG+SRHIFHGVSSIIPKS+PKFLL H+GLRPGRLNLTFRKY
Subjt:  VSGLPVVSFSVGNAAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSSPKFLLGHSGLRPGRLNLTFRKY

A0A1S4E4K6 uncharacterized protein LOC1035021836.8e-20476.14Show/hide
Query:  MLFVRTLPLSSSPWSNQHRRLFFLASRFPGERGFRLLRFQRMDSFASTANSHALPDSSCSGSS--CGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELK
        M F+RTLPL  SP SNQ RRL F AS FPG RGF LL+FQRMDSF+S+ANSHA PDSSC G+S  CG DKEH  +RD+ SDVIF+GS  V+LNPKERE K
Subjt:  MLFVRTLPLSSSPWSNQHRRLFFLASRFPGERGFRLLRFQRMDSFASTANSHALPDSSCSGSS--CGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELK

Query:  SLSPPSVNKCDDFELGRNKKGIPANEPKSYHYDEFLPVSRQHTKRSRIDLGFKRGLKSNVRSFQMERFECLND-----ESSLPNHFGKKIEAFYFSKHQS
        SL+P S  KCD  E+G +K GI +NEPKSYHYDEFLPVSRQ+T+R+RIDLG KR LKSN RSFQ+ER E  ND     ESSLP HFGKK E F FSK QS
Subjt:  SLSPPSVNKCDDFELGRNKKGIPANEPKSYHYDEFLPVSRQHTKRSRIDLGFKRGLKSNVRSFQMERFECLND-----ESSLPNHFGKKIEAFYFSKHQS

Query:  VDIGSKESVVADKSLPIE-PFNICFPGRGNVKSGASWQVKGRDTVKENDCFVETTNYRMLRPGMVLLKHYMTQREQINIVKTCQMLGLGQGGFYQPGYKD
        +DIGSKESVV D SLP E PF+ICFPG GNVK    W+VK   TVK+         YR+LRPGMVLLKHY+T  EQINIVKTCQ LGLG GGFYQP YKD
Subjt:  VDIGSKESVVADKSLPIE-PFNICFPGRGNVKSGASWQVKGRDTVKENDCFVETTNYRMLRPGMVLLKHYMTQREQINIVKTCQMLGLGQGGFYQPGYKD

Query:  GAKLRLHMMCLGLDWDPQTRKYENKRATDGNKPPNIPPQFAFLVKAALKDAHAFIKNKFNTSNVEDILPSMSPDICIVNFYTTSGRLGLHQDRDESKESL
        GAKLRL MMCLGLDWDPQTR+Y+NKR  DGNKPP+IPP F+FLVK+ALKDAHAFIKNK N SNVEDILPSMSPDICI NFYTTSGRLGLHQDRDESKESL
Subjt:  GAKLRLHMMCLGLDWDPQTRKYENKRATDGNKPPNIPPQFAFLVKAALKDAHAFIKNKFNTSNVEDILPSMSPDICIVNFYTTSGRLGLHQDRDESKESL

Query:  VSGLPVVSFSVGNAAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSSPKFLLGHSGLRPGRLNLTFRKY
         SGLPVVSFSVGN AEFLYGDKRDV+KAEKV LESGDVLIFGG+SRH+FHGVSSIIPKS+PKFLL H+GLRPGRLNLTFRKY
Subjt:  VSGLPVVSFSVGNAAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSSPKFLLGHSGLRPGRLNLTFRKY

A0A5A7U7Q2 2-oxoglutarate-dependent dioxygenase family protein isoform 13.5e-18477.55Show/hide
Query:  SHALPDSSCSGSS--CGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELKSLSPPSVNKCDDFELGRNKKGIPANEPKSYHYDEFLPVSRQHTKRSRIDL
        S A PDSSC G+S  CG DKEH  +RD+ SDVIFVGS  V+LNPKERE KSL+P SV KCD  E+G +K GI +NEPKSYHYDE LPVSRQ+T+R+RIDL
Subjt:  SHALPDSSCSGSS--CGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELKSLSPPSVNKCDDFELGRNKKGIPANEPKSYHYDEFLPVSRQHTKRSRIDL

Query:  GFKRGLKSNVRSFQMERFECLND-----ESSLPNHFGKKIEAFYFSKHQSVDIGSKESVVADKSLPIE-PFNICFPGRGNVKSGASWQVKGRDTVKENDC
        G KR LKSN RSFQ+ER E LND     ESSLP HFGKK E F FSK QS+DIGSKESVV D SLP E PF+ICFPG GNVK    W+VK   TVK+   
Subjt:  GFKRGLKSNVRSFQMERFECLND-----ESSLPNHFGKKIEAFYFSKHQSVDIGSKESVVADKSLPIE-PFNICFPGRGNVKSGASWQVKGRDTVKENDC

Query:  FVETTNYRMLRPGMVLLKHYMTQREQINIVKTCQMLGLGQGGFYQPGYKDGAKLRLHMMCLGLDWDPQTRKYENKRATDGNKPPNIPPQFAFLVKAALKD
              YR+LRPGMVLLKHY+T  EQINIVKTCQ LGLG GGFYQPGYKDGAKLRL MMCLGLDWDPQTR+YENKR  DGNKPP+IPP F+FLVK ALKD
Subjt:  FVETTNYRMLRPGMVLLKHYMTQREQINIVKTCQMLGLGQGGFYQPGYKDGAKLRLHMMCLGLDWDPQTRKYENKRATDGNKPPNIPPQFAFLVKAALKD

Query:  AHAFIKNKFNTSNVEDILPSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSFSVGNAAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFH
        AHAFIKNK N SNVEDILPSMSPDICI NFYTTSGRLGLHQDRDESKESL SGLPVVSFSVGN AEFLYGDKRDVDKAEKV LESGDVLIFGG+SRH+FH
Subjt:  AHAFIKNKFNTSNVEDILPSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSFSVGNAAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFH

Query:  GVSSIIPKSSPKFLLGHSGLRPGRLNLTFRKY
        GVSSIIPKS+PKFLL H+GLRPGRLNLTFRKY
Subjt:  GVSSIIPKSSPKFLLGHSGLRPGRLNLTFRKY

A0A5D3BFV0 2-oxoglutarate-dependent dioxygenase family protein isoform 12.3e-18377.31Show/hide
Query:  SHALPDSSCSGSS--CGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELKSLSPPSVNKCDDFELGRNKKGIPANEPKSYHYDEFLPVSRQHTKRSRIDL
        S A PDSSC G+S  CG DKEH  +RD+ SDVIFVGS  V+LNPKERE KSL+P SV KCD  E+G +K GI +NEPKSYHYDE LPVSRQ+T+R+RIDL
Subjt:  SHALPDSSCSGSS--CGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELKSLSPPSVNKCDDFELGRNKKGIPANEPKSYHYDEFLPVSRQHTKRSRIDL

Query:  GFKRGLKSNVRSFQMERFECLND-----ESSLPNHFGKKIEAFYFSKHQSVDIGSKESVVADKSLPIE-PFNICFPGRGNVKSGASWQVKGRDTVKENDC
        G KR LKSN RSFQ+ER E LND     ESSLP HFGKK E F FSK QS+DIGSKESVV D S P E PF+ICFPG GNVK    W+VK   TVK+   
Subjt:  GFKRGLKSNVRSFQMERFECLND-----ESSLPNHFGKKIEAFYFSKHQSVDIGSKESVVADKSLPIE-PFNICFPGRGNVKSGASWQVKGRDTVKENDC

Query:  FVETTNYRMLRPGMVLLKHYMTQREQINIVKTCQMLGLGQGGFYQPGYKDGAKLRLHMMCLGLDWDPQTRKYENKRATDGNKPPNIPPQFAFLVKAALKD
              YR+LRPGMVLLKHY+T  EQINIVKTCQ LGLG GGFYQPGYKDGAKLRL MMCLGLDWDPQTR+YENKR  DGNKPP+IPP F+FLVK ALKD
Subjt:  FVETTNYRMLRPGMVLLKHYMTQREQINIVKTCQMLGLGQGGFYQPGYKDGAKLRLHMMCLGLDWDPQTRKYENKRATDGNKPPNIPPQFAFLVKAALKD

Query:  AHAFIKNKFNTSNVEDILPSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSFSVGNAAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFH
        AHAFIKNK N SNVEDILPSMSPDICI NFYTTSGRLGLHQDRDESKESL SGLPVVSFSVGN AEFLYGDKRDVDKAEKV LESGDVLIFGG+SRH+FH
Subjt:  AHAFIKNKFNTSNVEDILPSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSFSVGNAAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFH

Query:  GVSSIIPKSSPKFLLGHSGLRPGRLNLTFRKY
        GVSSIIPKS+PKFLL H+GLRPGRLNLTFRKY
Subjt:  GVSSIIPKSSPKFLLGHSGLRPGRLNLTFRKY

A0A6J1EDT3 uncharacterized protein LOC1114323184.5e-17166.39Show/hide
Query:  MLFVRTLPLSSSPWSNQHRRLFFLASRFPGERGFRLLRFQRMDSFASTANSHALPDSSCSGSSCGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELKSL
        ML +RT+P S  PWSN  RRL F  S        RLL+FQR+DSF S+    ALPDSSC GSSCGG++E  HNRDHNS+VI +G +PV LN K  E +SL
Subjt:  MLFVRTLPLSSSPWSNQHRRLFFLASRFPGERGFRLLRFQRMDSFASTANSHALPDSSCSGSSCGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELKSL

Query:  SPPSVNKCDDFELGRNKKGIPANEPKSYHYDEFLPVSRQHTK-RSRIDLGFKRGLKSNVRSFQMERFECLNDESSLPNHFGKKIEAFYFSKHQSVDIGSK
        S  SV KCDDF+L  ++KGIPAN P SYH DEF PV RQ+TK RSRIDLG +R LK++  S QMER                  E F F KH+S DIGSK
Subjt:  SPPSVNKCDDFELGRNKKGIPANEPKSYHYDEFLPVSRQHTK-RSRIDLGFKRGLKSNVRSFQMERFECLNDESSLPNHFGKKIEAFYFSKHQSVDIGSK

Query:  ESVVADKSLPIEPFNICFP-GRGNVKSGASWQVKGRDTVKENDCFVETTNYRMLRPGMVLLKHYMTQREQINIVKTCQMLGLGQGGFYQPGYKDGAKLRL
         S+      PIE F+ICFP  RG  K   SWQ K RDT+K  +   E TN  ++RPGMVLLKHY+   EQ+NIVKT Q LGLG GGFYQPGYKDGAKLRL
Subjt:  ESVVADKSLPIEPFNICFP-GRGNVKSGASWQVKGRDTVKENDCFVETTNYRMLRPGMVLLKHYMTQREQINIVKTCQMLGLGQGGFYQPGYKDGAKLRL

Query:  HMMCLGLDWDPQTRKYENKRATDGNKPPNIPPQFAFLVKAALKDAHAFIKNKFNTSNVEDILPSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPV
         MMCLGLDWDPQTRKY  KR  DGNKPP++PP+FA LV  AL DAHA IKN  +T+N+EDILP+MSPDICIVNFY+TSGRLGLHQDRDES+ESLV GLPV
Subjt:  HMMCLGLDWDPQTRKYENKRATDGNKPPNIPPQFAFLVKAALKDAHAFIKNKFNTSNVEDILPSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPV

Query:  VSFSVGNAAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSSPKFLLGHSGLRPGRLNLTFRKY
        VSFS+GN+AEFLYGD+RDVDKA K++LESGDVLIFGG+SRHIFHGVSSIIPKS+PKFLL H+GLRPGRLNLTFRKY
Subjt:  VSFSVGNAAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSSPKFLLGHSGLRPGRLNLTFRKY

SwissProt top hitse value%identityAlignment
B8GWW6 Alpha-ketoglutarate-dependent dioxygenase AlkB homolog2.7e-1633.68Show/hide
Query:  YQPGYKDGAKLRLHMMCLG-LDWDPQTR--KYENKRATDGNKPPNIPPQFAFLVKAALKDAHAFIKNKFNTSNVEDILPSMSPDICIVNFYTTSGRLGLH
        Y+  Y  G  + + M  LG L W    R  +Y ++    G   P++PP        AL D    + +           P   PD C+VN Y    R+GLH
Subjt:  YQPGYKDGAKLRLHMMCLG-LDWDPQTR--KYENKRATDGNKPPNIPPQFAFLVKAALKDAHAFIKNKFNTSNVEDILPSMSPDICIVNFYTTSGRLGLH

Query:  QDRDESKESLVSGLPVVSFSVGNAAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSSPKFLLGHSGLRP--GRLNLTFRK
        QDRDE+        PV+S S+G+ A F  G     D    + L SGDV    G +R  FHGV  I+P        G S L P  GR+NLT R+
Subjt:  QDRDESKESLVSGLPVVSFSVGNAAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSSPKFLLGHSGLRP--GRLNLTFRK

O60066 Alpha-ketoglutarate-dependent dioxygenase abh11.8e-1527.5Show/hide
Query:  PGMVLLKHYMTQREQINIVKTCQ-----------------MLGLGQGGFYQPGYK-DGAKL------------------RLHMMCLGLDWDPQTRKYENK
        PG+++LK+Y++   Q+ ++K+                    L LG    ++  Y  DG  +                  +L  + LG  +D  T++Y   
Subjt:  PGMVLLKHYMTQREQINIVKTCQ-----------------MLGLGQGGFYQPGYK-DGAKL------------------RLHMMCLGLDWDPQTRKYENK

Query:  RATDGNKPPNIPPQFAFLVKAALKDAHAFIKNKFNTSNVEDILPSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSFSVGNAAEFLYGDKRDV
           D +K P  P      V+  +K++  F+  K               +  IVNFY+    L  H   DES+E L   LP++S S+G    +L G +   
Subjt:  RATDGNKPPNIPPQFAFLVKAALKDAHAFIKNKFNTSNVEDILPSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSFSVGNAAEFLYGDKRDV

Query:  DKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSSPKFLL
        +K   + L SGDV+I  G SR  FH V  IIP S+P +LL
Subjt:  DKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSSPKFLL

P05050 Alpha-ketoglutarate-dependent dioxygenase AlkB3.7e-1335.4Show/hide
Query:  PSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSFSVGNAAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSSPKFLLGHS
        P   PD C++N Y    +L LHQD+DE         P+VS S+G  A F +G  +  D  ++++LE GDV+++GG+SR  +HG+  +     P  +    
Subjt:  PSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSFSVGNAAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSSPKFLLGHS

Query:  GLRPGRLNLTFRK
             R NLTFR+
Subjt:  GLRPGRLNLTFRK

P0CAT7 Alpha-ketoglutarate-dependent dioxygenase AlkB homolog2.7e-1633.68Show/hide
Query:  YQPGYKDGAKLRLHMMCLG-LDWDPQTR--KYENKRATDGNKPPNIPPQFAFLVKAALKDAHAFIKNKFNTSNVEDILPSMSPDICIVNFYTTSGRLGLH
        Y+  Y  G  + + M  LG L W    R  +Y ++    G   P++PP        AL D    + +           P   PD C+VN Y    R+GLH
Subjt:  YQPGYKDGAKLRLHMMCLG-LDWDPQTR--KYENKRATDGNKPPNIPPQFAFLVKAALKDAHAFIKNKFNTSNVEDILPSMSPDICIVNFYTTSGRLGLH

Query:  QDRDESKESLVSGLPVVSFSVGNAAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSSPKFLLGHSGLRP--GRLNLTFRK
        QDRDE+        PV+S S+G+ A F  G     D    + L SGDV    G +R  FHGV  I+P        G S L P  GR+NLT R+
Subjt:  QDRDESKESLVSGLPVVSFSVGNAAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSSPKFLLGHSGLRP--GRLNLTFRK

P37462 Alpha-ketoglutarate-dependent dioxygenase AlkB1.7e-1336.61Show/hide
Query:  SMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSFSVGNAAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSSPKFLLGHSG
        S  PD C++N Y    +L LHQD+DE         P+VS S+G  A F +G  R  D  ++++LE GD++++GG+SR  +HG+  +     P  + G   
Subjt:  SMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSFSVGNAAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSSPKFLLGHSG

Query:  LRPGRLNLTFRK
            R NLTFR+
Subjt:  LRPGRLNLTFRK

Arabidopsis top hitse value%identityAlignment
AT1G11780.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein1.5e-0931.65Show/hide
Query:  KLRLHMMCLGLDWDPQTRKYENKRATDGNKP-PNIPPQFAFLVKA----ALKDAHAFIKNKFNTSNVEDILPSMSPDICIVNFYTTSGRLGLHQDRDESK
        KLR   + L  DW        +KR  D + P  NIP     L K     A+ D   F                  P+  IVN++     LG H D  E+ 
Subjt:  KLRLHMMCLGLDWDPQTRKYENKRATDGNKP-PNIPPQFAFLVKA----ALKDAHAFIKNKFNTSNVEDILPSMSPDICIVNFYTTSGRLGLHQDRDESK

Query:  ESLVSGLPVVSFSVGNAAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSI
         S     P+VS S+G  A FL G K   D    + L SGDV++  G++R  FHG+  I
Subjt:  ESLVSGLPVVSFSVGNAAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSI

AT3G14140.1 2-oxoglutarate-dependent dioxygenase family protein1.8e-6351.83Show/hide
Query:  MLRPGMVLLKHYMTQREQINIVKTCQMLGLGQGGFYQPGYKDGAKLRLHMMCLGLDWDPQTRKYENKRATDGNKPPNIPPQFAFLVKAALKDAHAFIKNK
        ++RPGMVLLK+Y++   Q+ IV  C+ LGLG+GGFYQPG++DG  L L MMCLG +WD QTR+Y   R  DG+ PP IP +F+ LV+ A+K++ + +   
Subjt:  MLRPGMVLLKHYMTQREQINIVKTCQMLGLGQGGFYQPGYKDGAKLRLHMMCLGLDWDPQTRKYENKRATDGNKPPNIPPQFAFLVKAALKDAHAFIKNK

Query:  FNTSNVEDILPSMSPDICIVNFYTTSGRLGLHQ---------------------DRDESKESLVSGLPVVSFSVGNAAEFLYGDKRDVDKAEKVVLESGD
         N +   D +P + PDIC+VNFYT++G+LGLHQ                     D+ ESK+SL  GLP+VSFS+G++AEFLYGD++DVDKA+ ++LESGD
Subjt:  FNTSNVEDILPSMSPDICIVNFYTTSGRLGLHQ---------------------DRDESKESLVSGLPVVSFSVGNAAEFLYGDKRDVDKAEKVVLESGD

Query:  VLIFGGDSRHIFHGVSSI
        VLIFG  SR++FHGV SI
Subjt:  VLIFGGDSRHIFHGVSSI

AT3G14160.1 2-oxoglutarate-dependent dioxygenase family protein5.4e-7658.04Show/hide
Query:  MLRPGMVLLKHYMTQREQINIVKTCQMLGLGQGGFYQPGYKDGAKLRLHMMCLGLDWDPQTRKYENKRATDGNKPPNIPPQFAFLVKAALKDAHAFIKNK
        ++RPGMVLLK+Y++  +Q+ IV  C+ LGLG+GGFYQPGY+D AKL L MMCLG +WDP+T +Y   R  DG+  P IP +F   V+ A+K++ +   + 
Subjt:  MLRPGMVLLKHYMTQREQINIVKTCQMLGLGQGGFYQPGYKDGAKLRLHMMCLGLDWDPQTRKYENKRATDGNKPPNIPPQFAFLVKAALKDAHAFIKNK

Query:  FNTSNVEDILPSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSFSVGNAAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPK
           +   D +P M PDICIVNFY+++GRLGLHQD+DES+ S+  GLPVVSFS+G++AEFLYGD+RD DKAE + LESGDVL+FGG SR +FHGV SI   
Subjt:  FNTSNVEDILPSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSFSVGNAAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPK

Query:  SSPKFLLGHSGLRPGRLNLTFRKY
        ++PK LL  + LRPGRLNLTFR+Y
Subjt:  SSPKFLLGHSGLRPGRLNLTFRKY

AT5G01780.1 2-oxoglutarate-dependent dioxygenase family protein2.1e-8050.16Show/hide
Query:  MERFECLNDESSLPNHFGKKIEAFYFSKHQSVDIGSKESVVADKSLPIEPFNICFPGRGNVKSGASWQVKGRDTVKENDCFVETTN-YRMLRPGMVLLKH
        M  F+  N  SS  +   + ++      H++    S++        P  PF+IC        +     +   +T +E    VE +N ++++RPGMVLLK 
Subjt:  MERFECLNDESSLPNHFGKKIEAFYFSKHQSVDIGSKESVVADKSLPIEPFNICFPGRGNVKSGASWQVKGRDTVKENDCFVETTN-YRMLRPGMVLLKH

Query:  YMTQREQINIVKTCQMLGLGQGGFYQPGYKDGAKLRLHMMCLGLDWDPQTRKYENKRATDGNKPPNIPPQFAFLVKAALKDAHAFIKNKFNTSNVEDILP
        ++T   Q++IVKTC+ LG+   GFYQPGY  G+KL L MMCLG +WDPQT KY      D +K P IP  F  LV+ A+++AHA I  +  T + E ILP
Subjt:  YMTQREQINIVKTCQMLGLGQGGFYQPGYKDGAKLRLHMMCLGLDWDPQTRKYENKRATDGNKPPNIPPQFAFLVKAALKDAHAFIKNKFNTSNVEDILP

Query:  SMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSFSVGNAAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSSPKFLLGHSG
         MSPDICIVNFY+ +GRLGLHQDRDES+ES+  GLP+VSFS+G++AEFLYG+KRDV++A+ V+LESGDVLIFGG+SR IFHGV SIIP S+P  LL  S 
Subjt:  SMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSFSVGNAAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSSPKFLLGHSG

Query:  LRPGRLNLTFRKY
        LR GRLNLTFR +
Subjt:  LRPGRLNLTFRKY

AT5G01780.2 2-oxoglutarate-dependent dioxygenase family protein2.1e-8050.16Show/hide
Query:  MERFECLNDESSLPNHFGKKIEAFYFSKHQSVDIGSKESVVADKSLPIEPFNICFPGRGNVKSGASWQVKGRDTVKENDCFVETTN-YRMLRPGMVLLKH
        M  F+  N  SS  +   + ++      H++    S++        P  PF+IC        +     +   +T +E    VE +N ++++RPGMVLLK 
Subjt:  MERFECLNDESSLPNHFGKKIEAFYFSKHQSVDIGSKESVVADKSLPIEPFNICFPGRGNVKSGASWQVKGRDTVKENDCFVETTN-YRMLRPGMVLLKH

Query:  YMTQREQINIVKTCQMLGLGQGGFYQPGYKDGAKLRLHMMCLGLDWDPQTRKYENKRATDGNKPPNIPPQFAFLVKAALKDAHAFIKNKFNTSNVEDILP
        ++T   Q++IVKTC+ LG+   GFYQPGY  G+KL L MMCLG +WDPQT KY      D +K P IP  F  LV+ A+++AHA I  +  T + E ILP
Subjt:  YMTQREQINIVKTCQMLGLGQGGFYQPGYKDGAKLRLHMMCLGLDWDPQTRKYENKRATDGNKPPNIPPQFAFLVKAALKDAHAFIKNKFNTSNVEDILP

Query:  SMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSFSVGNAAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSSPKFLLGHSG
         MSPDICIVNFY+ +GRLGLHQDRDES+ES+  GLP+VSFS+G++AEFLYG+KRDV++A+ V+LESGDVLIFGG+SR IFHGV SIIP S+P  LL  S 
Subjt:  SMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSFSVGNAAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSSPKFLLGHSG

Query:  LRPGRLNLTFRKY
        LR GRLNLTFR +
Subjt:  LRPGRLNLTFRKY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGTTCGTCCGTACACTTCCCCTTTCGTCATCGCCCTGGTCCAATCAACATCGTCGTCTTTTCTTCCTCGCTTCTCGATTTCCGGGCGAACGCGGTTTTCGATTGCT
CCGATTTCAACGAATGGATTCGTTTGCCAGTACGGCAAATAGCCATGCACTACCTGATTCTTCATGTTCTGGTAGTTCTTGTGGCGGAGACAAGGAACATTCGCATAATA
GAGATCATAATTCAGATGTGATATTTGTGGGAAGCCTTCCTGTGTATCTAAATCCCAAGGAACGTGAACTGAAATCTTTATCTCCACCGTCTGTTAATAAATGTGATGAT
TTTGAGTTGGGAAGAAATAAGAAGGGGATTCCTGCAAATGAACCGAAATCTTACCATTATGATGAGTTTCTACCTGTTTCTAGACAACATACTAAAAGAAGCCGGATAGA
TTTAGGGTTCAAAAGAGGTCTGAAGAGTAATGTAAGATCATTTCAAATGGAGAGGTTTGAATGTTTGAACGATGAATCATCTCTGCCTAATCATTTTGGGAAGAAAATTG
AAGCCTTCTATTTCTCGAAGCACCAGTCTGTTGATATTGGTTCCAAAGAATCTGTAGTTGCGGACAAATCGCTTCCCATTGAACCATTTAATATTTGTTTCCCCGGAAGA
GGTAATGTGAAGTCTGGAGCTTCTTGGCAAGTTAAAGGCAGGGACACTGTGAAAGAAAATGATTGTTTTGTTGAAACTACAAATTATAGAATGCTGAGGCCTGGAATGGT
TTTACTGAAGCACTACATGACTCAACGTGAACAGATCAATATAGTGAAAACTTGTCAAATGCTTGGTCTTGGCCAAGGGGGATTTTACCAGCCTGGTTATAAAGATGGAG
CAAAACTTAGGCTTCATATGATGTGTCTTGGATTGGACTGGGATCCTCAAACAAGAAAATATGAAAACAAACGGGCTACCGATGGTAATAAACCACCAAATATACCTCCT
CAATTTGCATTTCTAGTTAAAGCTGCACTTAAAGATGCACATGCCTTTATCAAGAACAAATTCAATACAAGTAACGTAGAAGACATTCTTCCATCAATGTCTCCAGACAT
ATGCATTGTGAACTTCTACACAACTAGTGGAAGACTGGGCCTGCATCAGGATCGTGATGAAAGCAAAGAGAGTCTTGTCAGTGGACTACCGGTCGTTTCCTTTTCTGTGG
GCAATGCAGCAGAATTCTTGTATGGAGATAAAAGAGATGTGGATAAAGCAGAGAAGGTTGTTCTGGAATCAGGTGATGTTCTAATTTTTGGTGGCGACTCTAGACATATA
TTTCATGGAGTATCTTCGATCATACCAAAATCGTCACCTAAGTTTTTGCTTGGTCATTCTGGTCTGCGTCCTGGACGTCTGAATCTTACCTTCAGAAAATATTAA
mRNA sequenceShow/hide mRNA sequence
GCATTAAAAATGGCCGAAATCGTTACTCCTTCCTCTTCCCGGTGCAGTCTCCTCGGCCAACGCGACGGCGACCGAAACTTGCGGACTAAGTCTACGGTATCGTCATTATG
TCAATCCGATCCTCCTCCGACGGTTCCATTTCCTTCTCCTTTTCCAATTTTTGGATTCATCCTCAAATTTTGAAACCTATAAGTATCTCTGTTGCATCCAAAATTACCTA
TGCTGTTCGTCCGTACACTTCCCCTTTCGTCATCGCCCTGGTCCAATCAACATCGTCGTCTTTTCTTCCTCGCTTCTCGATTTCCGGGCGAACGCGGTTTTCGATTGCTC
CGATTTCAACGAATGGATTCGTTTGCCAGTACGGCAAATAGCCATGCACTACCTGATTCTTCATGTTCTGGTAGTTCTTGTGGCGGAGACAAGGAACATTCGCATAATAG
AGATCATAATTCAGATGTGATATTTGTGGGAAGCCTTCCTGTGTATCTAAATCCCAAGGAACGTGAACTGAAATCTTTATCTCCACCGTCTGTTAATAAATGTGATGATT
TTGAGTTGGGAAGAAATAAGAAGGGGATTCCTGCAAATGAACCGAAATCTTACCATTATGATGAGTTTCTACCTGTTTCTAGACAACATACTAAAAGAAGCCGGATAGAT
TTAGGGTTCAAAAGAGGTCTGAAGAGTAATGTAAGATCATTTCAAATGGAGAGGTTTGAATGTTTGAACGATGAATCATCTCTGCCTAATCATTTTGGGAAGAAAATTGA
AGCCTTCTATTTCTCGAAGCACCAGTCTGTTGATATTGGTTCCAAAGAATCTGTAGTTGCGGACAAATCGCTTCCCATTGAACCATTTAATATTTGTTTCCCCGGAAGAG
GTAATGTGAAGTCTGGAGCTTCTTGGCAAGTTAAAGGCAGGGACACTGTGAAAGAAAATGATTGTTTTGTTGAAACTACAAATTATAGAATGCTGAGGCCTGGAATGGTT
TTACTGAAGCACTACATGACTCAACGTGAACAGATCAATATAGTGAAAACTTGTCAAATGCTTGGTCTTGGCCAAGGGGGATTTTACCAGCCTGGTTATAAAGATGGAGC
AAAACTTAGGCTTCATATGATGTGTCTTGGATTGGACTGGGATCCTCAAACAAGAAAATATGAAAACAAACGGGCTACCGATGGTAATAAACCACCAAATATACCTCCTC
AATTTGCATTTCTAGTTAAAGCTGCACTTAAAGATGCACATGCCTTTATCAAGAACAAATTCAATACAAGTAACGTAGAAGACATTCTTCCATCAATGTCTCCAGACATA
TGCATTGTGAACTTCTACACAACTAGTGGAAGACTGGGCCTGCATCAGGATCGTGATGAAAGCAAAGAGAGTCTTGTCAGTGGACTACCGGTCGTTTCCTTTTCTGTGGG
CAATGCAGCAGAATTCTTGTATGGAGATAAAAGAGATGTGGATAAAGCAGAGAAGGTTGTTCTGGAATCAGGTGATGTTCTAATTTTTGGTGGCGACTCTAGACATATAT
TTCATGGAGTATCTTCGATCATACCAAAATCGTCACCTAAGTTTTTGCTTGGTCATTCTGGTCTGCGTCCTGGACGTCTGAATCTTACCTTCAGAAAATATTAAAACACT
TCAGTGTCTGGTTTCTTTCTTATGGTCTCCATACTTATCTTGTACATGTGAATGGCCGTTGTGCTTATTCATTGGATGATCATTTATATGAAATATTGTGAATCTATATT
GTTTCATTCTGGTTCTTGTACGTTGGATACTTCTGCTCATTACTTG
Protein sequenceShow/hide protein sequence
MLFVRTLPLSSSPWSNQHRRLFFLASRFPGERGFRLLRFQRMDSFASTANSHALPDSSCSGSSCGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELKSLSPPSVNKCDD
FELGRNKKGIPANEPKSYHYDEFLPVSRQHTKRSRIDLGFKRGLKSNVRSFQMERFECLNDESSLPNHFGKKIEAFYFSKHQSVDIGSKESVVADKSLPIEPFNICFPGR
GNVKSGASWQVKGRDTVKENDCFVETTNYRMLRPGMVLLKHYMTQREQINIVKTCQMLGLGQGGFYQPGYKDGAKLRLHMMCLGLDWDPQTRKYENKRATDGNKPPNIPP
QFAFLVKAALKDAHAFIKNKFNTSNVEDILPSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSFSVGNAAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHI
FHGVSSIIPKSSPKFLLGHSGLRPGRLNLTFRKY