; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC04G064720 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC04G064720
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionFe2OG dioxygenase domain-containing protein
Genome locationCicolChr04:21458497..21460996
RNA-Seq ExpressionCcUC04G064720
SyntenyCcUC04G064720
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0035513 - oxidative RNA demethylation (biological process)
GO:0035552 - oxidative single-stranded DNA demethylation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0008198 - ferrous iron binding (molecular function)
GO:0035515 - oxidative RNA demethylase activity (molecular function)
GO:0035516 - oxidative DNA demethylase activity (molecular function)
InterPro domainsIPR004574 - Alkylated DNA repair protein AlkB
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR027450 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like
IPR037151 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050407.1 2-oxoglutarate-dependent dioxygenase family protein isoform 1 [Cucumis melo var. makuwa]1.0e-18276.62Show/hide
Query:  SHALPDTSCSGSS--CGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELKSLSPPSVNKCDDFELGRNKKGIPANEPKSYQYNEFLPVSRQHTKRSRIDL
        S A PD+SC G+S  CG DKEH  +RD+ SDVIFVGS  V+LNPKERE KSL+P SV KCD  E+G +K GI +NEPKSY Y+E LPVSRQ+T+R+RIDL
Subjt:  SHALPDTSCSGSS--CGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELKSLSPPSVNKCDDFELGRNKKGIPANEPKSYQYNEFLPVSRQHTKRSRIDL

Query:  GFKGGLKSNVRSFQMERFECLND-----ESSLPNHFGKKIETFYFSKHQSVDIGSKESVVRDKSLPTE-PFNICFRGGGNVKSGSSWQVKGRDTVKENDC
        G K  LKSN RSFQ+ER E LND     ESSLP HFGKK E F FSK QS+DIGSKESVV D SLP E PF+ICF GGGNVK  + W+VK   TVK+   
Subjt:  GFKGGLKSNVRSFQMERFECLND-----ESSLPNHFGKKIETFYFSKHQSVDIGSKESVVRDKSLPTE-PFNICFRGGGNVKSGSSWQVKGRDTVKENDC

Query:  FVETTNYRMLRPGMVLLKHYMTQHEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPQTRKYENKRAIDGNKPPNIPPQFACLVKAALKD
              YR+LRPGMVLLKHY+T  EQINIVKTCQKLGLGPGGFYQPGYKDGAKLRL MMCLGLDWDPQTR+YENKR +DGNKPP+IPP F+ LVK ALKD
Subjt:  FVETTNYRMLRPGMVLLKHYMTQHEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPQTRKYENKRAIDGNKPPNIPPQFACLVKAALKD

Query:  AHAFIKNKCNTSNVEDILPSMSPDICVVNFYTTSGRLGLHQDCDESKESLVSGLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFH
        AHAFIKNKCN SNVEDILPSMSPDIC+ NFYTTSGRLGLHQD DESKESL SGLPVVSFSVGN AEFLYGDKRD+DKAEKV LESGDVLIFGG+SRH+FH
Subjt:  AHAFIKNKCNTSNVEDILPSMSPDICVVNFYTTSGRLGLHQDCDESKESLVSGLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFH

Query:  GVSSIIPKSTPKVLLGHTGLRPGRLNLTFRKY
        GVSSIIPKSTPK LL HTGLRPGRLNLTFRKY
Subjt:  GVSSIIPKSTPKVLLGHTGLRPGRLNLTFRKY

TYJ97997.1 2-oxoglutarate-dependent dioxygenase family protein isoform 1 [Cucumis melo var. makuwa]6.8e-18276.39Show/hide
Query:  SHALPDTSCSGSS--CGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELKSLSPPSVNKCDDFELGRNKKGIPANEPKSYQYNEFLPVSRQHTKRSRIDL
        S A PD+SC G+S  CG DKEH  +RD+ SDVIFVGS  V+LNPKERE KSL+P SV KCD  E+G +K GI +NEPKSY Y+E LPVSRQ+T+R+RIDL
Subjt:  SHALPDTSCSGSS--CGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELKSLSPPSVNKCDDFELGRNKKGIPANEPKSYQYNEFLPVSRQHTKRSRIDL

Query:  GFKGGLKSNVRSFQMERFECLND-----ESSLPNHFGKKIETFYFSKHQSVDIGSKESVVRDKSLPTE-PFNICFRGGGNVKSGSSWQVKGRDTVKENDC
        G K  LKSN RSFQ+ER E LND     ESSLP HFGKK E F FSK QS+DIGSKESVV D S P E PF+ICF GGGNVK  + W+VK   TVK+   
Subjt:  GFKGGLKSNVRSFQMERFECLND-----ESSLPNHFGKKIETFYFSKHQSVDIGSKESVVRDKSLPTE-PFNICFRGGGNVKSGSSWQVKGRDTVKENDC

Query:  FVETTNYRMLRPGMVLLKHYMTQHEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPQTRKYENKRAIDGNKPPNIPPQFACLVKAALKD
              YR+LRPGMVLLKHY+T  EQINIVKTCQKLGLGPGGFYQPGYKDGAKLRL MMCLGLDWDPQTR+YENKR +DGNKPP+IPP F+ LVK ALKD
Subjt:  FVETTNYRMLRPGMVLLKHYMTQHEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPQTRKYENKRAIDGNKPPNIPPQFACLVKAALKD

Query:  AHAFIKNKCNTSNVEDILPSMSPDICVVNFYTTSGRLGLHQDCDESKESLVSGLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFH
        AHAFIKNKCN SNVEDILPSMSPDIC+ NFYTTSGRLGLHQD DESKESL SGLPVVSFSVGN AEFLYGDKRD+DKAEKV LESGDVLIFGG+SRH+FH
Subjt:  AHAFIKNKCNTSNVEDILPSMSPDICVVNFYTTSGRLGLHQDCDESKESLVSGLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFH

Query:  GVSSIIPKSTPKVLLGHTGLRPGRLNLTFRKY
        GVSSIIPKSTPK LL HTGLRPGRLNLTFRKY
Subjt:  GVSSIIPKSTPKVLLGHTGLRPGRLNLTFRKY

XP_004149927.1 uncharacterized protein LOC101210053 isoform X1 [Cucumis sativus]2.4e-18771.58Show/hide
Query:  MLFVRTLPLSPSPWSNQLRRLFFPASRFPGERGFRLLQFQRMDSFASTANSHALPDTSCSGSS--CGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELK
        M F+RTLPL PSP SNQLRRL FPAS FP  RGFRLLQFQ MDSF+++ANSHALPD+SC GSS  CG DKEH H+RD++SDVI VGS+PV+LNPKER   
Subjt:  MLFVRTLPLSPSPWSNQLRRLFFPASRFPGERGFRLLQFQRMDSFASTANSHALPDTSCSGSS--CGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELK

Query:  SLSPPSVNKCDDFELGRNKKGIPANEPKSYQYNEFLPVSRQHTKRSRIDLGFKGGLKSNVRSFQMERFECLND-----ESSLPNHFGKKIETFYFSKHQS
                                 EPKSY Y+E LPV RQ+T+RSRIDLG K  LKSN RS+Q+ER E LND     +SSLP HFGKK E F  SK QS
Subjt:  SLSPPSVNKCDDFELGRNKKGIPANEPKSYQYNEFLPVSRQHTKRSRIDLGFKGGLKSNVRSFQMERFECLND-----ESSLPNHFGKKIETFYFSKHQS

Query:  VDIGSKESVVRDKSLPTE-PFNICFRGGGNVKSGSSWQVKGRDTVKENDCFVETTNYRMLRPGMVLLKHYMTQHEQINIVKTCQKLGLGPGGFYQPGYKD
        +D G KESVV D SLP E PF+IC  GGGNVK  + + VK   TVK+         YR+LRPGMVLLKHY+T  EQINIVKTCQ LG+GPGGFYQPGYKD
Subjt:  VDIGSKESVVRDKSLPTE-PFNICFRGGGNVKSGSSWQVKGRDTVKENDCFVETTNYRMLRPGMVLLKHYMTQHEQINIVKTCQKLGLGPGGFYQPGYKD

Query:  GAKLRLHMMCLGLDWDPQTRKYENKRAIDGNKPPNIPPQFACLVKAALKDAHAFIKNKCNTSNVEDILPSMSPDICVVNFYTTSGRLGLHQDCDESKESL
        GAKLRL MMCLGLDWDPQTR+YENKR +DGNKPP+IPPQF  LVK ALKDAHAFIKN CN SNVE+ILPSMSPDIC+ NFYTT GRLGLHQD DESKESL
Subjt:  GAKLRLHMMCLGLDWDPQTRKYENKRAIDGNKPPNIPPQFACLVKAALKDAHAFIKNKCNTSNVEDILPSMSPDICVVNFYTTSGRLGLHQDCDESKESL

Query:  VSGLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKVLLGHTGLRPGRLNLTFRKY
          GLPVVSFSVGNAAEFLYGDKR++DKAE V LESGDVLIFGG+SRHIFHGVSSIIPKSTPK LL HTGLRPGRLNLTFRKY
Subjt:  VSGLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKVLLGHTGLRPGRLNLTFRKY

XP_016903166.1 PREDICTED: uncharacterized protein LOC103502183 [Cucumis melo]1.2e-20576.14Show/hide
Query:  MLFVRTLPLSPSPWSNQLRRLFFPASRFPGERGFRLLQFQRMDSFASTANSHALPDTSCSGSS--CGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELK
        M F+RTLPL PSP SNQLRRL FPAS FPG RGF LLQFQRMDSF+S+ANSHA PD+SC G+S  CG DKEH  +RD+ SDVIF+GS  V+LNPKERE K
Subjt:  MLFVRTLPLSPSPWSNQLRRLFFPASRFPGERGFRLLQFQRMDSFASTANSHALPDTSCSGSS--CGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELK

Query:  SLSPPSVNKCDDFELGRNKKGIPANEPKSYQYNEFLPVSRQHTKRSRIDLGFKGGLKSNVRSFQMERFECLND-----ESSLPNHFGKKIETFYFSKHQS
        SL+P S  KCD  E+G +K GI +NEPKSY Y+EFLPVSRQ+T+R+RIDLG K  LKSN RSFQ+ER E  ND     ESSLP HFGKK E F FSK QS
Subjt:  SLSPPSVNKCDDFELGRNKKGIPANEPKSYQYNEFLPVSRQHTKRSRIDLGFKGGLKSNVRSFQMERFECLND-----ESSLPNHFGKKIETFYFSKHQS

Query:  VDIGSKESVVRDKSLPTE-PFNICFRGGGNVKSGSSWQVKGRDTVKENDCFVETTNYRMLRPGMVLLKHYMTQHEQINIVKTCQKLGLGPGGFYQPGYKD
        +DIGSKESVV D SLP E PF+ICF GGGNVK  + W+VK   TVK+         YR+LRPGMVLLKHY+T  EQINIVKTCQKLGLGPGGFYQP YKD
Subjt:  VDIGSKESVVRDKSLPTE-PFNICFRGGGNVKSGSSWQVKGRDTVKENDCFVETTNYRMLRPGMVLLKHYMTQHEQINIVKTCQKLGLGPGGFYQPGYKD

Query:  GAKLRLHMMCLGLDWDPQTRKYENKRAIDGNKPPNIPPQFACLVKAALKDAHAFIKNKCNTSNVEDILPSMSPDICVVNFYTTSGRLGLHQDCDESKESL
        GAKLRL MMCLGLDWDPQTR+Y+NKR +DGNKPP+IPP F+ LVK+ALKDAHAFIKNKCN SNVEDILPSMSPDIC+ NFYTTSGRLGLHQD DESKESL
Subjt:  GAKLRLHMMCLGLDWDPQTRKYENKRAIDGNKPPNIPPQFACLVKAALKDAHAFIKNKCNTSNVEDILPSMSPDICVVNFYTTSGRLGLHQDCDESKESL

Query:  VSGLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKVLLGHTGLRPGRLNLTFRKY
         SGLPVVSFSVGN AEFLYGDKRD++KAEKV LESGDVLIFGG+SRH+FHGVSSIIPKSTPK LL HTGLRPGRLNLTFRKY
Subjt:  VSGLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKVLLGHTGLRPGRLNLTFRKY

XP_022144035.1 uncharacterized protein LOC111013827 [Momordica charantia]3.9e-16956.27Show/hide
Query:  MLFVRTLPLSPSPWSNQLRRLFFPASRFPGERGFRLLQFQRMDSFASTANSHALPDTSCSGSSCGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELKSL
        M  +RT+P+  SP SNQL RL F +SRFPG R  RLLQF+RMDS  ++A SH            G   E+SHNR H+SD++ VG +PVYLN K  E +S 
Subjt:  MLFVRTLPLSPSPWSNQLRRLFFPASRFPGERGFRLLQFQRMDSFASTANSHALPDTSCSGSSCGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELKSL

Query:  SPPSVNKCDDFELGRNKKGIPANEPKSY-------------------------------------------------------------------QY---
        SP SVNK DDFELGR +K  PAN P SY                                                                   QY   
Subjt:  SPPSVNKCDDFELGRNKKGIPANEPKSY-------------------------------------------------------------------QY---

Query:  ---------------------------------------NEFLPVSRQHTK-RSRIDLGFKGGLKSNVRSFQMERFECLN-----DESSLPNHFGKKIET
                                               +EF PVSRQ+TK R+R+DLGF+    +N  SFQ+E F  LN     DESS PN FGKK E 
Subjt:  ---------------------------------------NEFLPVSRQHTK-RSRIDLGFKGGLKSNVRSFQMERFECLN-----DESSLPNHFGKKIET

Query:  FYFSKHQSVDIGSKESVVRDKSLPTEPFNIC-FRGGGNVKSGSSWQVKGRDTVKENDCFVETTNYRMLRPGMVLLKHYMTQHEQINIVKTCQKLGLGPGG
        FY  K QS+DIGSK S+V D   P EPF+IC     GN K G+ WQ KGRDTVK  +   E +NYR+LRPGMVLLK+Y+T HEQ+NIVKTCQ+LG+GPGG
Subjt:  FYFSKHQSVDIGSKESVVRDKSLPTEPFNIC-FRGGGNVKSGSSWQVKGRDTVKENDCFVETTNYRMLRPGMVLLKHYMTQHEQINIVKTCQKLGLGPGG

Query:  FYQPGYKDGAKLRLHMMCLGLDWDPQTRKYENKRAIDGNKPPNIPPQFACLVKAALKDAHAFIKNKCNTSNVEDILPSMSPDICVVNFYTTSGRLGLHQD
        FY+PGYKDGAKLRL MMCLGLDWDPQTRKY +KRA+DG+KPP IPP+FA LV  ALKDAHA IKNKCNT NVE ILPSMSPDIC+VNFYTTSGRLGLHQD
Subjt:  FYQPGYKDGAKLRLHMMCLGLDWDPQTRKYENKRAIDGNKPPNIPPQFACLVKAALKDAHAFIKNKCNTSNVEDILPSMSPDICVVNFYTTSGRLGLHQD

Query:  CDESKESLVSGLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKVLLGHTGLRPGRLNLTFRKY
         DESKESLVSGLPVVS S+G++AEFLYGD+RD+DKAEKV+LESGDVLIFGGDSRH+FHGVSSIIP STPK LL HTGLRPGRLNLTFRKY
Subjt:  CDESKESLVSGLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKVLLGHTGLRPGRLNLTFRKY

TrEMBL top hitse value%identityAlignment
A0A0A0KY56 Fe2OG dioxygenase domain-containing protein1.2e-18771.58Show/hide
Query:  MLFVRTLPLSPSPWSNQLRRLFFPASRFPGERGFRLLQFQRMDSFASTANSHALPDTSCSGSS--CGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELK
        M F+RTLPL PSP SNQLRRL FPAS FP  RGFRLLQFQ MDSF+++ANSHALPD+SC GSS  CG DKEH H+RD++SDVI VGS+PV+LNPKER   
Subjt:  MLFVRTLPLSPSPWSNQLRRLFFPASRFPGERGFRLLQFQRMDSFASTANSHALPDTSCSGSS--CGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELK

Query:  SLSPPSVNKCDDFELGRNKKGIPANEPKSYQYNEFLPVSRQHTKRSRIDLGFKGGLKSNVRSFQMERFECLND-----ESSLPNHFGKKIETFYFSKHQS
                                 EPKSY Y+E LPV RQ+T+RSRIDLG K  LKSN RS+Q+ER E LND     +SSLP HFGKK E F  SK QS
Subjt:  SLSPPSVNKCDDFELGRNKKGIPANEPKSYQYNEFLPVSRQHTKRSRIDLGFKGGLKSNVRSFQMERFECLND-----ESSLPNHFGKKIETFYFSKHQS

Query:  VDIGSKESVVRDKSLPTE-PFNICFRGGGNVKSGSSWQVKGRDTVKENDCFVETTNYRMLRPGMVLLKHYMTQHEQINIVKTCQKLGLGPGGFYQPGYKD
        +D G KESVV D SLP E PF+IC  GGGNVK  + + VK   TVK+         YR+LRPGMVLLKHY+T  EQINIVKTCQ LG+GPGGFYQPGYKD
Subjt:  VDIGSKESVVRDKSLPTE-PFNICFRGGGNVKSGSSWQVKGRDTVKENDCFVETTNYRMLRPGMVLLKHYMTQHEQINIVKTCQKLGLGPGGFYQPGYKD

Query:  GAKLRLHMMCLGLDWDPQTRKYENKRAIDGNKPPNIPPQFACLVKAALKDAHAFIKNKCNTSNVEDILPSMSPDICVVNFYTTSGRLGLHQDCDESKESL
        GAKLRL MMCLGLDWDPQTR+YENKR +DGNKPP+IPPQF  LVK ALKDAHAFIKN CN SNVE+ILPSMSPDIC+ NFYTT GRLGLHQD DESKESL
Subjt:  GAKLRLHMMCLGLDWDPQTRKYENKRAIDGNKPPNIPPQFACLVKAALKDAHAFIKNKCNTSNVEDILPSMSPDICVVNFYTTSGRLGLHQDCDESKESL

Query:  VSGLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKVLLGHTGLRPGRLNLTFRKY
          GLPVVSFSVGNAAEFLYGDKR++DKAE V LESGDVLIFGG+SRHIFHGVSSIIPKSTPK LL HTGLRPGRLNLTFRKY
Subjt:  VSGLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKVLLGHTGLRPGRLNLTFRKY

A0A1S4E4K6 uncharacterized protein LOC1035021835.6e-20676.14Show/hide
Query:  MLFVRTLPLSPSPWSNQLRRLFFPASRFPGERGFRLLQFQRMDSFASTANSHALPDTSCSGSS--CGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELK
        M F+RTLPL PSP SNQLRRL FPAS FPG RGF LLQFQRMDSF+S+ANSHA PD+SC G+S  CG DKEH  +RD+ SDVIF+GS  V+LNPKERE K
Subjt:  MLFVRTLPLSPSPWSNQLRRLFFPASRFPGERGFRLLQFQRMDSFASTANSHALPDTSCSGSS--CGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELK

Query:  SLSPPSVNKCDDFELGRNKKGIPANEPKSYQYNEFLPVSRQHTKRSRIDLGFKGGLKSNVRSFQMERFECLND-----ESSLPNHFGKKIETFYFSKHQS
        SL+P S  KCD  E+G +K GI +NEPKSY Y+EFLPVSRQ+T+R+RIDLG K  LKSN RSFQ+ER E  ND     ESSLP HFGKK E F FSK QS
Subjt:  SLSPPSVNKCDDFELGRNKKGIPANEPKSYQYNEFLPVSRQHTKRSRIDLGFKGGLKSNVRSFQMERFECLND-----ESSLPNHFGKKIETFYFSKHQS

Query:  VDIGSKESVVRDKSLPTE-PFNICFRGGGNVKSGSSWQVKGRDTVKENDCFVETTNYRMLRPGMVLLKHYMTQHEQINIVKTCQKLGLGPGGFYQPGYKD
        +DIGSKESVV D SLP E PF+ICF GGGNVK  + W+VK   TVK+         YR+LRPGMVLLKHY+T  EQINIVKTCQKLGLGPGGFYQP YKD
Subjt:  VDIGSKESVVRDKSLPTE-PFNICFRGGGNVKSGSSWQVKGRDTVKENDCFVETTNYRMLRPGMVLLKHYMTQHEQINIVKTCQKLGLGPGGFYQPGYKD

Query:  GAKLRLHMMCLGLDWDPQTRKYENKRAIDGNKPPNIPPQFACLVKAALKDAHAFIKNKCNTSNVEDILPSMSPDICVVNFYTTSGRLGLHQDCDESKESL
        GAKLRL MMCLGLDWDPQTR+Y+NKR +DGNKPP+IPP F+ LVK+ALKDAHAFIKNKCN SNVEDILPSMSPDIC+ NFYTTSGRLGLHQD DESKESL
Subjt:  GAKLRLHMMCLGLDWDPQTRKYENKRAIDGNKPPNIPPQFACLVKAALKDAHAFIKNKCNTSNVEDILPSMSPDICVVNFYTTSGRLGLHQDCDESKESL

Query:  VSGLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKVLLGHTGLRPGRLNLTFRKY
         SGLPVVSFSVGN AEFLYGDKRD++KAEKV LESGDVLIFGG+SRH+FHGVSSIIPKSTPK LL HTGLRPGRLNLTFRKY
Subjt:  VSGLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKVLLGHTGLRPGRLNLTFRKY

A0A5A7U7Q2 2-oxoglutarate-dependent dioxygenase family protein isoform 15.1e-18376.62Show/hide
Query:  SHALPDTSCSGSS--CGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELKSLSPPSVNKCDDFELGRNKKGIPANEPKSYQYNEFLPVSRQHTKRSRIDL
        S A PD+SC G+S  CG DKEH  +RD+ SDVIFVGS  V+LNPKERE KSL+P SV KCD  E+G +K GI +NEPKSY Y+E LPVSRQ+T+R+RIDL
Subjt:  SHALPDTSCSGSS--CGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELKSLSPPSVNKCDDFELGRNKKGIPANEPKSYQYNEFLPVSRQHTKRSRIDL

Query:  GFKGGLKSNVRSFQMERFECLND-----ESSLPNHFGKKIETFYFSKHQSVDIGSKESVVRDKSLPTE-PFNICFRGGGNVKSGSSWQVKGRDTVKENDC
        G K  LKSN RSFQ+ER E LND     ESSLP HFGKK E F FSK QS+DIGSKESVV D SLP E PF+ICF GGGNVK  + W+VK   TVK+   
Subjt:  GFKGGLKSNVRSFQMERFECLND-----ESSLPNHFGKKIETFYFSKHQSVDIGSKESVVRDKSLPTE-PFNICFRGGGNVKSGSSWQVKGRDTVKENDC

Query:  FVETTNYRMLRPGMVLLKHYMTQHEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPQTRKYENKRAIDGNKPPNIPPQFACLVKAALKD
              YR+LRPGMVLLKHY+T  EQINIVKTCQKLGLGPGGFYQPGYKDGAKLRL MMCLGLDWDPQTR+YENKR +DGNKPP+IPP F+ LVK ALKD
Subjt:  FVETTNYRMLRPGMVLLKHYMTQHEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPQTRKYENKRAIDGNKPPNIPPQFACLVKAALKD

Query:  AHAFIKNKCNTSNVEDILPSMSPDICVVNFYTTSGRLGLHQDCDESKESLVSGLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFH
        AHAFIKNKCN SNVEDILPSMSPDIC+ NFYTTSGRLGLHQD DESKESL SGLPVVSFSVGN AEFLYGDKRD+DKAEKV LESGDVLIFGG+SRH+FH
Subjt:  AHAFIKNKCNTSNVEDILPSMSPDICVVNFYTTSGRLGLHQDCDESKESLVSGLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFH

Query:  GVSSIIPKSTPKVLLGHTGLRPGRLNLTFRKY
        GVSSIIPKSTPK LL HTGLRPGRLNLTFRKY
Subjt:  GVSSIIPKSTPKVLLGHTGLRPGRLNLTFRKY

A0A5D3BFV0 2-oxoglutarate-dependent dioxygenase family protein isoform 13.3e-18276.39Show/hide
Query:  SHALPDTSCSGSS--CGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELKSLSPPSVNKCDDFELGRNKKGIPANEPKSYQYNEFLPVSRQHTKRSRIDL
        S A PD+SC G+S  CG DKEH  +RD+ SDVIFVGS  V+LNPKERE KSL+P SV KCD  E+G +K GI +NEPKSY Y+E LPVSRQ+T+R+RIDL
Subjt:  SHALPDTSCSGSS--CGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELKSLSPPSVNKCDDFELGRNKKGIPANEPKSYQYNEFLPVSRQHTKRSRIDL

Query:  GFKGGLKSNVRSFQMERFECLND-----ESSLPNHFGKKIETFYFSKHQSVDIGSKESVVRDKSLPTE-PFNICFRGGGNVKSGSSWQVKGRDTVKENDC
        G K  LKSN RSFQ+ER E LND     ESSLP HFGKK E F FSK QS+DIGSKESVV D S P E PF+ICF GGGNVK  + W+VK   TVK+   
Subjt:  GFKGGLKSNVRSFQMERFECLND-----ESSLPNHFGKKIETFYFSKHQSVDIGSKESVVRDKSLPTE-PFNICFRGGGNVKSGSSWQVKGRDTVKENDC

Query:  FVETTNYRMLRPGMVLLKHYMTQHEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPQTRKYENKRAIDGNKPPNIPPQFACLVKAALKD
              YR+LRPGMVLLKHY+T  EQINIVKTCQKLGLGPGGFYQPGYKDGAKLRL MMCLGLDWDPQTR+YENKR +DGNKPP+IPP F+ LVK ALKD
Subjt:  FVETTNYRMLRPGMVLLKHYMTQHEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPQTRKYENKRAIDGNKPPNIPPQFACLVKAALKD

Query:  AHAFIKNKCNTSNVEDILPSMSPDICVVNFYTTSGRLGLHQDCDESKESLVSGLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFH
        AHAFIKNKCN SNVEDILPSMSPDIC+ NFYTTSGRLGLHQD DESKESL SGLPVVSFSVGN AEFLYGDKRD+DKAEKV LESGDVLIFGG+SRH+FH
Subjt:  AHAFIKNKCNTSNVEDILPSMSPDICVVNFYTTSGRLGLHQDCDESKESLVSGLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFH

Query:  GVSSIIPKSTPKVLLGHTGLRPGRLNLTFRKY
        GVSSIIPKSTPK LL HTGLRPGRLNLTFRKY
Subjt:  GVSSIIPKSTPKVLLGHTGLRPGRLNLTFRKY

A0A6J1CQI1 uncharacterized protein LOC1110138271.9e-16956.27Show/hide
Query:  MLFVRTLPLSPSPWSNQLRRLFFPASRFPGERGFRLLQFQRMDSFASTANSHALPDTSCSGSSCGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELKSL
        M  +RT+P+  SP SNQL RL F +SRFPG R  RLLQF+RMDS  ++A SH            G   E+SHNR H+SD++ VG +PVYLN K  E +S 
Subjt:  MLFVRTLPLSPSPWSNQLRRLFFPASRFPGERGFRLLQFQRMDSFASTANSHALPDTSCSGSSCGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELKSL

Query:  SPPSVNKCDDFELGRNKKGIPANEPKSY-------------------------------------------------------------------QY---
        SP SVNK DDFELGR +K  PAN P SY                                                                   QY   
Subjt:  SPPSVNKCDDFELGRNKKGIPANEPKSY-------------------------------------------------------------------QY---

Query:  ---------------------------------------NEFLPVSRQHTK-RSRIDLGFKGGLKSNVRSFQMERFECLN-----DESSLPNHFGKKIET
                                               +EF PVSRQ+TK R+R+DLGF+    +N  SFQ+E F  LN     DESS PN FGKK E 
Subjt:  ---------------------------------------NEFLPVSRQHTK-RSRIDLGFKGGLKSNVRSFQMERFECLN-----DESSLPNHFGKKIET

Query:  FYFSKHQSVDIGSKESVVRDKSLPTEPFNIC-FRGGGNVKSGSSWQVKGRDTVKENDCFVETTNYRMLRPGMVLLKHYMTQHEQINIVKTCQKLGLGPGG
        FY  K QS+DIGSK S+V D   P EPF+IC     GN K G+ WQ KGRDTVK  +   E +NYR+LRPGMVLLK+Y+T HEQ+NIVKTCQ+LG+GPGG
Subjt:  FYFSKHQSVDIGSKESVVRDKSLPTEPFNIC-FRGGGNVKSGSSWQVKGRDTVKENDCFVETTNYRMLRPGMVLLKHYMTQHEQINIVKTCQKLGLGPGG

Query:  FYQPGYKDGAKLRLHMMCLGLDWDPQTRKYENKRAIDGNKPPNIPPQFACLVKAALKDAHAFIKNKCNTSNVEDILPSMSPDICVVNFYTTSGRLGLHQD
        FY+PGYKDGAKLRL MMCLGLDWDPQTRKY +KRA+DG+KPP IPP+FA LV  ALKDAHA IKNKCNT NVE ILPSMSPDIC+VNFYTTSGRLGLHQD
Subjt:  FYQPGYKDGAKLRLHMMCLGLDWDPQTRKYENKRAIDGNKPPNIPPQFACLVKAALKDAHAFIKNKCNTSNVEDILPSMSPDICVVNFYTTSGRLGLHQD

Query:  CDESKESLVSGLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKVLLGHTGLRPGRLNLTFRKY
         DESKESLVSGLPVVS S+G++AEFLYGD+RD+DKAEKV+LESGDVLIFGGDSRH+FHGVSSIIP STPK LL HTGLRPGRLNLTFRKY
Subjt:  CDESKESLVSGLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKVLLGHTGLRPGRLNLTFRKY

SwissProt top hitse value%identityAlignment
B8GWW6 Alpha-ketoglutarate-dependent dioxygenase AlkB homolog7.4e-1440.71Show/hide
Query:  PSMSPDICVVNFYTTSGRLGLHQDCDESKESLVSGLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKVLLGHT
        P   PD C+VN Y    R+GLHQD DE+        PV+S S+G+ A F  G     D    + L SGDV    G +R  FHGV  I+P S+  V  G  
Subjt:  PSMSPDICVVNFYTTSGRLGLHQDCDESKESLVSGLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKVLLGHT

Query:  GLRPGRLNLTFRK
            GR+NLT R+
Subjt:  GLRPGRLNLTFRK

P05050 Alpha-ketoglutarate-dependent dioxygenase AlkB4.8e-1333.6Show/hide
Query:  NKCNTSNVEDILPSMSPDICVVNFYTTSGRLGLHQDCDESKESLVSGLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFHGVSSII
        N C  +      P   PD C++N Y    +L LHQD DE         P+VS S+G  A F +G  +  D  ++++LE GDV+++GG+SR  +HG+  + 
Subjt:  NKCNTSNVEDILPSMSPDICVVNFYTTSGRLGLHQDCDESKESLVSGLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFHGVSSII

Query:  PKSTPKVLLGHTGLRPGRLNLTFRK
            P  +         R NLTFR+
Subjt:  PKSTPKVLLGHTGLRPGRLNLTFRK

P0CAT7 Alpha-ketoglutarate-dependent dioxygenase AlkB homolog7.4e-1440.71Show/hide
Query:  PSMSPDICVVNFYTTSGRLGLHQDCDESKESLVSGLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKVLLGHT
        P   PD C+VN Y    R+GLHQD DE+        PV+S S+G+ A F  G     D    + L SGDV    G +R  FHGV  I+P S+  V  G  
Subjt:  PSMSPDICVVNFYTTSGRLGLHQDCDESKESLVSGLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKVLLGHT

Query:  GLRPGRLNLTFRK
            GR+NLT R+
Subjt:  GLRPGRLNLTFRK

P37462 Alpha-ketoglutarate-dependent dioxygenase AlkB3.7e-1334.96Show/hide
Query:  CNTSNVEDILPSMSPDICVVNFYTTSGRLGLHQDCDESKESLVSGLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPK
        C  + +     S  PD C++N Y    +L LHQD DE         P+VS S+G  A F +G  R  D  ++++LE GD++++GG+SR  +HG+  +   
Subjt:  CNTSNVEDILPSMSPDICVVNFYTTSGRLGLHQDCDESKESLVSGLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPK

Query:  STPKVLLGHTGLRPGRLNLTFRK
          P      TG    R NLTFR+
Subjt:  STPKVLLGHTGLRPGRLNLTFRK

Q9SA98 Alpha-ketoglutarate-dependent dioxygenase alkB9.4e-0931.37Show/hide
Query:  KLRLHMMCLGLDWDPQTRKYENKRAIDGNKPPNIPPQFACLVKAALKDAHAFIKNKCNTSNVEDILPSMSPDICVVNFYTTSGRLGLHQDCDESKESLVS
        KLR   + L  DW        +KR  D + P N  P   C     L   HA I                 P+  +VN++     LG H D  E+  S   
Subjt:  KLRLHMMCLGLDWDPQTRKYENKRAIDGNKPPNIPPQFACLVKAALKDAHAFIKNKCNTSNVEDILPSMSPDICVVNFYTTSGRLGLHQDCDESKESLVS

Query:  GLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFHGVSSI
          P+VS S+G  A FL G K   D    + L SGDV++  G++R  FHG+  I
Subjt:  GLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFHGVSSI

Arabidopsis top hitse value%identityAlignment
AT1G11780.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein6.7e-1031.37Show/hide
Query:  KLRLHMMCLGLDWDPQTRKYENKRAIDGNKPPNIPPQFACLVKAALKDAHAFIKNKCNTSNVEDILPSMSPDICVVNFYTTSGRLGLHQDCDESKESLVS
        KLR   + L  DW        +KR  D + P N  P   C     L   HA I                 P+  +VN++     LG H D  E+  S   
Subjt:  KLRLHMMCLGLDWDPQTRKYENKRAIDGNKPPNIPPQFACLVKAALKDAHAFIKNKCNTSNVEDILPSMSPDICVVNFYTTSGRLGLHQDCDESKESLVS

Query:  GLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFHGVSSI
          P+VS S+G  A FL G K   D    + L SGDV++  G++R  FHG+  I
Subjt:  GLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFHGVSSI

AT3G14140.1 2-oxoglutarate-dependent dioxygenase family protein1.0e-6352.29Show/hide
Query:  MLRPGMVLLKHYMTQHEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPQTRKYENKRAIDGNKPPNIPPQFACLVKAALKDAHAFIKNK
        ++RPGMVLLK+Y++ + Q+ IV  C++LGLG GGFYQPG++DG  L L MMCLG +WD QTR+Y   R IDG+ PP IP +F+ LV+ A+K++ + +   
Subjt:  MLRPGMVLLKHYMTQHEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPQTRKYENKRAIDGNKPPNIPPQFACLVKAALKDAHAFIKNK

Query:  CNTSNVEDILPSMSPDICVVNFYTTSGRLGLHQ---------------------DCDESKESLVSGLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGD
         N +   D +P + PDICVVNFYT++G+LGLHQ                     D  ESK+SL  GLP+VSFS+G++AEFLYGD++D+DKA+ ++LESGD
Subjt:  CNTSNVEDILPSMSPDICVVNFYTTSGRLGLHQ---------------------DCDESKESLVSGLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGD

Query:  VLIFGGDSRHIFHGVSSI
        VLIFG  SR++FHGV SI
Subjt:  VLIFGGDSRHIFHGVSSI

AT3G14160.1 2-oxoglutarate-dependent dioxygenase family protein1.1e-7658.04Show/hide
Query:  MLRPGMVLLKHYMTQHEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPQTRKYENKRAIDGNKPPNIPPQFACLVKAALKDAHAFIKNK
        ++RPGMVLLK+Y++ ++Q+ IV  C++LGLG GGFYQPGY+D AKL L MMCLG +WDP+T +Y   R  DG+  P IP +F   V+ A+K++ +   + 
Subjt:  MLRPGMVLLKHYMTQHEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPQTRKYENKRAIDGNKPPNIPPQFACLVKAALKDAHAFIKNK

Query:  CNTSNVEDILPSMSPDICVVNFYTTSGRLGLHQDCDESKESLVSGLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPK
           +   D +P M PDIC+VNFY+++GRLGLHQD DES+ S+  GLPVVSFS+G++AEFLYGD+RD DKAE + LESGDVL+FGG SR +FHGV SI   
Subjt:  CNTSNVEDILPSMSPDICVVNFYTTSGRLGLHQDCDESKESLVSGLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPK

Query:  STPKVLLGHTGLRPGRLNLTFRKY
        + PK LL  T LRPGRLNLTFR+Y
Subjt:  STPKVLLGHTGLRPGRLNLTFRKY

AT5G01780.1 2-oxoglutarate-dependent dioxygenase family protein8.5e-8249.84Show/hide
Query:  MERFECLNDESSLPNHFGKKIETFYFSKHQSVDIGSK-ESVVRDKSLPTEPFNICF----RGGGNVKSGSSWQVKGRDTVKENDCFVETTNYRMLRPGMV
        M  F+  N  SS  +   + ++      H++    S+ +S  R K  P  PF+IC     R   ++K         R+TV+ ++       ++++RPGMV
Subjt:  MERFECLNDESSLPNHFGKKIETFYFSKHQSVDIGSK-ESVVRDKSLPTEPFNICF----RGGGNVKSGSSWQVKGRDTVKENDCFVETTNYRMLRPGMV

Query:  LLKHYMTQHEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPQTRKYENKRAIDGNKPPNIPPQFACLVKAALKDAHAFIKNKCNTSNVE
        LLK ++T   Q++IVKTC++LG+ P GFYQPGY  G+KL L MMCLG +WDPQT KY     ID +K P IP  F  LV+ A+++AHA I  +  T + E
Subjt:  LLKHYMTQHEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPQTRKYENKRAIDGNKPPNIPPQFACLVKAALKDAHAFIKNKCNTSNVE

Query:  DILPSMSPDICVVNFYTTSGRLGLHQDCDESKESLVSGLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKVLL
         ILP MSPDIC+VNFY+ +GRLGLHQD DES+ES+  GLP+VSFS+G++AEFLYG+KRD+++A+ V+LESGDVLIFGG+SR IFHGV SIIP S P  LL
Subjt:  DILPSMSPDICVVNFYTTSGRLGLHQDCDESKESLVSGLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKVLL

Query:  GHTGLRPGRLNLTFRKY
          + LR GRLNLTFR +
Subjt:  GHTGLRPGRLNLTFRKY

AT5G01780.2 2-oxoglutarate-dependent dioxygenase family protein8.5e-8249.84Show/hide
Query:  MERFECLNDESSLPNHFGKKIETFYFSKHQSVDIGSK-ESVVRDKSLPTEPFNICF----RGGGNVKSGSSWQVKGRDTVKENDCFVETTNYRMLRPGMV
        M  F+  N  SS  +   + ++      H++    S+ +S  R K  P  PF+IC     R   ++K         R+TV+ ++       ++++RPGMV
Subjt:  MERFECLNDESSLPNHFGKKIETFYFSKHQSVDIGSK-ESVVRDKSLPTEPFNICF----RGGGNVKSGSSWQVKGRDTVKENDCFVETTNYRMLRPGMV

Query:  LLKHYMTQHEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPQTRKYENKRAIDGNKPPNIPPQFACLVKAALKDAHAFIKNKCNTSNVE
        LLK ++T   Q++IVKTC++LG+ P GFYQPGY  G+KL L MMCLG +WDPQT KY     ID +K P IP  F  LV+ A+++AHA I  +  T + E
Subjt:  LLKHYMTQHEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPQTRKYENKRAIDGNKPPNIPPQFACLVKAALKDAHAFIKNKCNTSNVE

Query:  DILPSMSPDICVVNFYTTSGRLGLHQDCDESKESLVSGLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKVLL
         ILP MSPDIC+VNFY+ +GRLGLHQD DES+ES+  GLP+VSFS+G++AEFLYG+KRD+++A+ V+LESGDVLIFGG+SR IFHGV SIIP S P  LL
Subjt:  DILPSMSPDICVVNFYTTSGRLGLHQDCDESKESLVSGLPVVSFSVGNAAEFLYGDKRDLDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKVLL

Query:  GHTGLRPGRLNLTFRKY
          + LR GRLNLTFR +
Subjt:  GHTGLRPGRLNLTFRKY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGTTCGTCCGTACACTTCCCCTTTCGCCATCGCCCTGGTCCAATCAACTTCGTCGTCTTTTCTTCCCCGCTTCTCGATTTCCCGGCGAACGCGGTTTTCGA
TTGCTCCAATTTCAACGAATGGATTCGTTTGCCAGTACGGCAAATAGCCATGCACTACCTGATACTTCATGTTCTGGTAGTTCTTGTGGCGGAGACAAGGAACAT
TCGCATAATAGAGATCATAATTCAGATGTGATATTTGTGGGAAGCCTTCCTGTGTATCTAAATCCTAAGGAACGTGAACTGAAATCTTTATCTCCACCTTCTGTT
AATAAATGTGATGATTTTGAGTTGGGAAGAAATAAGAAGGGGATTCCTGCAAATGAACCGAAATCTTACCAGTATAATGAGTTTCTACCTGTTTCTAGACAACAT
ACTAAAAGAAGCCGGATAGATTTAGGGTTCAAAGGAGGTCTGAAGAGTAATGTAAGATCATTTCAAATGGAGAGGTTTGAATGTTTGAACGATGAATCATCTCTG
CCTAATCATTTTGGGAAGAAAATTGAAACCTTCTATTTCTCGAAGCACCAGTCTGTTGATATTGGTTCCAAAGAATCTGTAGTTCGGGACAAATCGCTTCCCACT
GAACCATTTAATATTTGTTTCCGTGGGGGAGGTAATGTGAAATCGGGATCTTCTTGGCAAGTTAAAGGCAGGGACACTGTGAAAGAAAATGATTGTTTTGTTGAA
ACTACAAATTATAGAATGCTGAGGCCAGGAATGGTTTTACTGAAGCACTACATGACTCAACATGAACAGATCAATATAGTGAAAACTTGTCAAAAGCTTGGTCTT
GGCCCAGGGGGATTTTACCAGCCTGGTTATAAAGATGGAGCAAAACTTAGGCTTCATATGATGTGTCTTGGATTGGACTGGGATCCTCAAACAAGAAAATATGAA
AACAAACGGGCTATCGATGGTAATAAACCACCAAATATACCTCCTCAATTTGCATGTCTAGTTAAAGCTGCACTTAAAGATGCACATGCCTTTATCAAGAACAAA
TGCAATACAAGTAACGTAGAAGACATTCTTCCATCAATGTCTCCAGACATATGCGTTGTGAACTTCTACACAACTAGTGGAAGACTGGGCCTGCATCAGGATTGT
GATGAAAGCAAAGAGAGTCTTGTCAGCGGACTACCGGTCGTTTCCTTTTCTGTAGGCAATGCAGCAGAATTCTTGTATGGAGATAAAAGAGATTTGGATAAAGCA
GAGAAGGTTGTTCTGGAATCAGGTGATGTTCTAATTTTTGGTGGCGACTCTAGACATATATTTCATGGAGTATCTTCGATCATACCAAAATCGACACCTAAGGTT
TTGCTTGGTCATACTGGTCTGCGTCCTGGACGTCTGAATCTTACCTTCAGAAAATATTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTGTTCGTCCGTACACTTCCCCTTTCGCCATCGCCCTGGTCCAATCAACTTCGTCGTCTTTTCTTCCCCGCTTCTCGATTTCCCGGCGAACGCGGTTTTCGA
TTGCTCCAATTTCAACGAATGGATTCGTTTGCCAGTACGGCAAATAGCCATGCACTACCTGATACTTCATGTTCTGGTAGTTCTTGTGGCGGAGACAAGGAACAT
TCGCATAATAGAGATCATAATTCAGATGTGATATTTGTGGGAAGCCTTCCTGTGTATCTAAATCCTAAGGAACGTGAACTGAAATCTTTATCTCCACCTTCTGTT
AATAAATGTGATGATTTTGAGTTGGGAAGAAATAAGAAGGGGATTCCTGCAAATGAACCGAAATCTTACCAGTATAATGAGTTTCTACCTGTTTCTAGACAACAT
ACTAAAAGAAGCCGGATAGATTTAGGGTTCAAAGGAGGTCTGAAGAGTAATGTAAGATCATTTCAAATGGAGAGGTTTGAATGTTTGAACGATGAATCATCTCTG
CCTAATCATTTTGGGAAGAAAATTGAAACCTTCTATTTCTCGAAGCACCAGTCTGTTGATATTGGTTCCAAAGAATCTGTAGTTCGGGACAAATCGCTTCCCACT
GAACCATTTAATATTTGTTTCCGTGGGGGAGGTAATGTGAAATCGGGATCTTCTTGGCAAGTTAAAGGCAGGGACACTGTGAAAGAAAATGATTGTTTTGTTGAA
ACTACAAATTATAGAATGCTGAGGCCAGGAATGGTTTTACTGAAGCACTACATGACTCAACATGAACAGATCAATATAGTGAAAACTTGTCAAAAGCTTGGTCTT
GGCCCAGGGGGATTTTACCAGCCTGGTTATAAAGATGGAGCAAAACTTAGGCTTCATATGATGTGTCTTGGATTGGACTGGGATCCTCAAACAAGAAAATATGAA
AACAAACGGGCTATCGATGGTAATAAACCACCAAATATACCTCCTCAATTTGCATGTCTAGTTAAAGCTGCACTTAAAGATGCACATGCCTTTATCAAGAACAAA
TGCAATACAAGTAACGTAGAAGACATTCTTCCATCAATGTCTCCAGACATATGCGTTGTGAACTTCTACACAACTAGTGGAAGACTGGGCCTGCATCAGGATTGT
GATGAAAGCAAAGAGAGTCTTGTCAGCGGACTACCGGTCGTTTCCTTTTCTGTAGGCAATGCAGCAGAATTCTTGTATGGAGATAAAAGAGATTTGGATAAAGCA
GAGAAGGTTGTTCTGGAATCAGGTGATGTTCTAATTTTTGGTGGCGACTCTAGACATATATTTCATGGAGTATCTTCGATCATACCAAAATCGACACCTAAGGTT
TTGCTTGGTCATACTGGTCTGCGTCCTGGACGTCTGAATCTTACCTTCAGAAAATATTAA
Protein sequenceShow/hide protein sequence
MLFVRTLPLSPSPWSNQLRRLFFPASRFPGERGFRLLQFQRMDSFASTANSHALPDTSCSGSSCGGDKEHSHNRDHNSDVIFVGSLPVYLNPKERELKSLSPPSV
NKCDDFELGRNKKGIPANEPKSYQYNEFLPVSRQHTKRSRIDLGFKGGLKSNVRSFQMERFECLNDESSLPNHFGKKIETFYFSKHQSVDIGSKESVVRDKSLPT
EPFNICFRGGGNVKSGSSWQVKGRDTVKENDCFVETTNYRMLRPGMVLLKHYMTQHEQINIVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPQTRKYE
NKRAIDGNKPPNIPPQFACLVKAALKDAHAFIKNKCNTSNVEDILPSMSPDICVVNFYTTSGRLGLHQDCDESKESLVSGLPVVSFSVGNAAEFLYGDKRDLDKA
EKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKVLLGHTGLRPGRLNLTFRKY