; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0009033 (gene) of Chayote v1 genome

Gene IDSed0009033
OrganismSechium edule (Chayote v1)
DescriptionFe2OG dioxygenase domain-containing protein
Genome locationLG03:23238069..23240753
RNA-Seq ExpressionSed0009033
SyntenySed0009033
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0006281 - DNA repair (biological process)
GO:0006974 - cellular response to DNA damage stimulus (biological process)
GO:0035513 - oxidative RNA demethylation (biological process)
GO:0070989 - oxidative demethylation (biological process)
GO:0035552 - oxidative single-stranded DNA demethylation (biological process)
GO:0043412 - macromolecule modification (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0005622 - intracellular (cellular component)
GO:0035516 - oxidative DNA demethylase activity (molecular function)
GO:0140640 - catalytic activity, acting on a nucleic acid (molecular function)
GO:0035515 - oxidative RNA demethylase activity (molecular function)
GO:0032451 - demethylase activity (molecular function)
GO:0016706 - 2-oxoglutarate-dependent dioxygenase activity (molecular function)
GO:0008198 - ferrous iron binding (molecular function)
InterPro domainsIPR037151 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamily
IPR027450 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR004574 - Alkylated DNA repair protein AlkB


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6595567.1 hypothetical protein SDJN03_12120, partial [Cucurbita argyrosperma subsp. sororia]2.5e-17070.07Show/hide
Query:  MLLIRTIPVSPSPSLNLLHRLLLAQSR---FQPMDSFASSANSREIPDLPCCGSSCGAN---LHGRDHNSNVIMIGTVPVNLNHKVSKQTSLSRLPVDKS
        MLLIRT+P S  P  NLL RLL A+SR   FQ MDSF SSA    +PD  C GSSCG N   LH RDHNSNVIMIG +PVNLN K ++Q SLSRL V K 
Subjt:  MLLIRTIPVSPSPSLNLLHRLLLAQSR---FQPMDSFASSANSREIPDLPCCGSSCGAN---LHGRDHNSNVIMIGTVPVNLNHKVSKQTSLSRLPVDKS

Query:  DDFESRRDQKGISPNVPSSYYD--FPPV--SPSKRRNRIDLGFERSLKSNTRTTHVDESSLPNQFGKKNGSYFPYNCWPVDIDSKSYLFTDNLHPFEPFD
        DDF+ R DQKGI  N+PSSY+D  FPPV    +KRR+RIDLG ER LK++T ++ ++ +  P  F K             DI SK+ L T NL P E FD
Subjt:  DDFESRRDQKGISPNVPSSYYD--FPPV--SPSKRRNRIDLGFERSLKSNTRTTHVDESSLPNQFGKKNGSYFPYNCWPVDIDSKSYLFTDNLHPFEPFD

Query:  ICSSKRRGKAKSGGHWQVKDNG--KVMEHAVEAKNNTVLRPGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDGAKLRLQMMCLGLDWDPQARK
        IC  +RRGK+K    WQ KD    KVMEHA EA N  V+RPGMVLLKHYI LH+QV++VKT QKLGLGP GFYQPGYKDGAKLRLQMMCLGLDWDPQ RK
Subjt:  ICSSKRRGKAKSGGHWQVKDNG--KVMEHAVEAKNNTVLRPGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDGAKLRLQMMCLGLDWDPQARK

Query:  YEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGD
        Y + R  DGNKPP +PPEFA+LV +AL  AHALIKNNG+TNN+ED LP+MSPDICIVNFYSTSGRLGLHQDRDES+ESLV GLPVVSFSLGNSAEFLYGD
Subjt:  YEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGD

Query:  RRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLDHTGLRPGRLNLTFRKY
        +RDVD A KI+LESGDVLIFGGESRHIFHGVSSIIPKSTPK LLDHTGLRPGRLNLTFRKY
Subjt:  RRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLDHTGLRPGRLNLTFRKY

KAG7027547.1 alkB, partial [Cucurbita argyrosperma subsp. argyrosperma]4.1e-16070.91Show/hide
Query:  IPDLPCCGSSCGAN---LHGRDHNSNVIMIGTVPVNLNHKVSKQTSLSRLPVDKSDDFESRRDQKGISPNVPSSYYD--FPPV--SPSKRRNRIDLGFER
        +PD  C GSSCG N   LH RDHNSNVIMIG +PVNLN K ++Q SLSRL V K DDF+ R DQKGI  N+PSSY+D  FPPV    +KRR+RIDLG ER
Subjt:  IPDLPCCGSSCGAN---LHGRDHNSNVIMIGTVPVNLNHKVSKQTSLSRLPVDKSDDFESRRDQKGISPNVPSSYYD--FPPV--SPSKRRNRIDLGFER

Query:  SLKSNTRTTHVDESSLPNQFGKKNGSYFPYNCWPVDIDSKSYLFTDNLHPFEPFDICSSKRRGKAKSGGHWQVKDNG--KVMEHAVEAKNNTVLRPGMVL
         LK++T ++ ++ +  P  F K             DI SK+ + T NL P E FDIC  +RRGK+K    WQ KD    KVMEHA EA N  V+RPGMVL
Subjt:  SLKSNTRTTHVDESSLPNQFGKKNGSYFPYNCWPVDIDSKSYLFTDNLHPFEPFDICSSKRRGKAKSGGHWQVKDNG--KVMEHAVEAKNNTVLRPGMVL

Query:  LKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDGAKLRLQMMCLGLDWDPQARKYEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVED
        LKHYI LH+QV++VKT QKLGLGP GFYQPGYKDGAKLRLQMMCLGLDWDPQ RKY + R  DGNKPP +PPEFA+LV +AL  AHALIKNNG+TNN+ED
Subjt:  LKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDGAKLRLQMMCLGLDWDPQARKYEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVED

Query:  TLPSMSPDICIVNFYSTSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGDRRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLD
         LP+MSPDICIVNFYSTSGRLGLHQDRDES+ESLV GLPVVSFSLGNSAEFLYGD+RDVD A KI+LESGDVLIFGGESRHIFHGVSSIIPKSTPK LLD
Subjt:  TLPSMSPDICIVNFYSTSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGDRRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLD

Query:  HTGLRPGRLNLTFRKY
        HTGLRPGRLNLTFRKY
Subjt:  HTGLRPGRLNLTFRKY

XP_022924913.1 uncharacterized protein LOC111432318 [Cucurbita moschata]7.4e-17069.85Show/hide
Query:  MLLIRTIPVSPSPSLNLLHRLLLAQSR---FQPMDSFASSANSREIPDLPCCGSSCGAN---LHGRDHNSNVIMIGTVPVNLNHKVSKQTSLSRLPVDKS
        MLLIRT+P S  P  NLL RLL A+SR   FQ +DSF SSA    +PD  C GSSCG N   LH RDHNSNVIMIG +PVNLN K ++Q SLSRL V K 
Subjt:  MLLIRTIPVSPSPSLNLLHRLLLAQSR---FQPMDSFASSANSREIPDLPCCGSSCGAN---LHGRDHNSNVIMIGTVPVNLNHKVSKQTSLSRLPVDKS

Query:  DDFESRRDQKGISPNVPSSYYD--FPPV--SPSKRRNRIDLGFERSLKSNTRTTHVDESSLPNQFGKKNGSYFPYNCWPVDIDSKSYLFTDNLHPFEPFD
        DDF+ R DQKGI  N+PSSY+D  FPPV    +KRR+RIDLG ER LK++T ++ ++ +  P  F K             DI SK+ L T NL P E FD
Subjt:  DDFESRRDQKGISPNVPSSYYD--FPPV--SPSKRRNRIDLGFERSLKSNTRTTHVDESSLPNQFGKKNGSYFPYNCWPVDIDSKSYLFTDNLHPFEPFD

Query:  ICSSKRRGKAKSGGHWQVKDNG--KVMEHAVEAKNNTVLRPGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDGAKLRLQMMCLGLDWDPQARK
        IC  +RRGK+K    WQ KD    KVMEHA EA N  V+RPGMVLLKHYI LH+QV++VKT QKLGLGP GFYQPGYKDGAKLRLQMMCLGLDWDPQ RK
Subjt:  ICSSKRRGKAKSGGHWQVKDNG--KVMEHAVEAKNNTVLRPGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDGAKLRLQMMCLGLDWDPQARK

Query:  YEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGD
        Y + R  DGNKPP +PPEFA+LV +AL  AHALIKNNG+TNN+ED LP+MSPDICIVNFYSTSGRLGLHQDRDES+ESLV GLPVVSFSLGNSAEFLYGD
Subjt:  YEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGD

Query:  RRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLDHTGLRPGRLNLTFRKY
        +RDVD A KI+LESGDVLIFGGESRHIFHGVSSIIPKSTPK LLDHTGLRPGRLNLTFRKY
Subjt:  RRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLDHTGLRPGRLNLTFRKY

XP_022966314.1 uncharacterized protein LOC111466008 [Cucurbita maxima]1.8e-16066.81Show/hide
Query:  MLLIRTIPVSPSPSLNLLHRLLLAQSR---FQPMDSFASSANSREIPDLPCCGSSCGAN---LHGRDHNSNVIMIGTVPVNLNHKVSKQTSLSRLPVDKS
        M LIRT+P S  P  NLL +LL A+SR   FQ MDSF SSA    +P+  C GSS G N   LH RDHNSNVIMIG +PVNLN K ++Q SLSRL V K 
Subjt:  MLLIRTIPVSPSPSLNLLHRLLLAQSR---FQPMDSFASSANSREIPDLPCCGSSCGAN---LHGRDHNSNVIMIGTVPVNLNHKVSKQTSLSRLPVDKS

Query:  DDFESRRDQKGISPNVPSSYYD--FPPV--SPSKRRNRIDLGFERSLKSNTRTTHVDESSLPNQFGKKNGSYFPYNCWPVDIDSKSYLFTDNLHPFEPFD
        DDF+ R DQKGI  N+PS Y+D  FPPV    +KRR+RID G ER LK++T ++ +  +  P  F K   +         DI SK+ L T NL P E FD
Subjt:  DDFESRRDQKGISPNVPSSYYD--FPPV--SPSKRRNRIDLGFERSLKSNTRTTHVDESSLPNQFGKKNGSYFPYNCWPVDIDSKSYLFTDNLHPFEPFD

Query:  ICSSKRRGKAKSGGHWQVKDNG--KVMEHAVEAKNNTVLRPGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDGAKLRLQMMCLGLDWDPQARK
        IC  +RRGK+K    WQ KD    KVMEH  EA N  V+RPGMVLLKHYI LH+QV++VKT QKLGLGP GFYQPGYKDGAKLRLQMMCLGLDWDPQ RK
Subjt:  ICSSKRRGKAKSGGHWQVKDNG--KVMEHAVEAKNNTVLRPGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDGAKLRLQMMCLGLDWDPQARK

Query:  YEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGD
        Y + R  DGNKPP +PPEFA+LV +AL  AHALIKNNG+ N +ED LP+MSPDICIVNFYST GRLGLHQDRDES+ESLV+GLPVVSFSLGNSA FLYGD
Subjt:  YEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGD

Query:  RRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLDHTGLRPGRLNLTFRKY
         R+VD A KI+LESGDVLIFGGESRHIFHGVSSIIPKS PK LLDHTG RPG LNLTFRKY
Subjt:  RRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLDHTGLRPGRLNLTFRKY

XP_023517205.1 uncharacterized protein LOC111781040 [Cucurbita pepo subsp. pepo]1.6e-16969.63Show/hide
Query:  MLLIRTIPVSPSPSLNLLHRLLLAQSR---FQPMDSFASSANSREIPDLPCCGSSCGAN---LHGRDHNSNVIMIGTVPVNLNHKVSKQTSLSRLPVDKS
        M LIRT+P S  P  NLL RLL A+SR   FQ MDSF SSA    +PD  C GSSCG N   LH RDHNSNVIMIG +PVNLN K ++Q SLSRL V K 
Subjt:  MLLIRTIPVSPSPSLNLLHRLLLAQSR---FQPMDSFASSANSREIPDLPCCGSSCGAN---LHGRDHNSNVIMIGTVPVNLNHKVSKQTSLSRLPVDKS

Query:  DDFESRRDQKGISPNVPSSYYD--FPPV--SPSKRRNRIDLGFERSLKSNTRTTHVDESSLPNQFGKKNGSYFPYNCWPVDIDSKSYLFTDNLHPFEPFD
        DDF+ R DQK I  N+PSSY+D  FPPV    +KRR+RIDLG ER LK++T ++ ++ +  P  F K   +         DI SK+ L T NL P E FD
Subjt:  DDFESRRDQKGISPNVPSSYYD--FPPV--SPSKRRNRIDLGFERSLKSNTRTTHVDESSLPNQFGKKNGSYFPYNCWPVDIDSKSYLFTDNLHPFEPFD

Query:  ICSSKRRGKAKSGGHWQVKDNG--KVMEHAVEAKNNTVLRPGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDGAKLRLQMMCLGLDWDPQARK
        IC  +RRGK+K    WQ KD    KVMEHA EA N  V+RPGMVLLKHYI LH+QV++VKT QKLGLGP GFYQPGYKDGAKLRLQMMCLGLDWDPQ RK
Subjt:  ICSSKRRGKAKSGGHWQVKDNG--KVMEHAVEAKNNTVLRPGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDGAKLRLQMMCLGLDWDPQARK

Query:  YEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGD
        Y + R  DGNKPP +PPEFA+LV +AL  AHALIKNNG+TNN+ED LP+MSPDICIVNFYSTSGRLGLHQDRDES+ESLV+GLPVVSFSLGNSAEFLYGD
Subjt:  YEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGD

Query:  RRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLDHTGLRPGRLNLTFRKY
        +RDVD A KI+LESGDVLIFGGESRHIFHGVSSIIPKSTPK LLDHTGLRPGRLNLTFRKY
Subjt:  RRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLDHTGLRPGRLNLTFRKY

TrEMBL top hitse value%identityAlignment
A0A0A0KY56 Fe2OG dioxygenase domain-containing protein4.7e-13857.8Show/hide
Query:  MLLIRTIPVSPSPSLNLLHRLLLAQS-----------RFQPMDSFASSANSREIPDLPCCGSSCGA-----NLHGRDHNSNVIMIGTVPVNLNHKVSKQT
        M  IRT+P+ PSPS N L RLL   S           +FQPMDSF++SANS  +PD  CCGSSCG      +LH RD++S+VI +G++PV+LN K  +  
Subjt:  MLLIRTIPVSPSPSLNLLHRLLLAQS-----------RFQPMDSFASSANSREIPDLPCCGSSCGA-----NLHGRDHNSNVIMIGTVPVNLNHKVSKQT

Query:  SLSRLPVDKSDDFESRRDQKGISPNVPSSY-YD--FPPVSPSKRRNRIDLGFERSLKSNTRTTHVD------------ESSLPNQFGKKNGSYFPYNCWP
                                  P SY YD   P    + RR+RIDLG +R LKSN R+  V+            +SSLP  FGKKN   F      
Subjt:  SLSRLPVDKSDDFESRRDQKGISPNVPSSY-YD--FPPVSPSKRRNRIDLGFERSLKSNTRTTHVD------------ESSLPNQFGKKNGSYFPYNCWP

Query:  VDIDSKSYLFTDNLHPFE-PFDICSSKRRGKAKSGGHWQVKDNGKVMEHAVEAKNNTVLRPGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDG
        +D   K  + TDN  PFE PFDIC     G  K    + VK+ G V       K+  +LRPGMVLLKHYI+  +Q+++VKTCQ LG+GP GFYQPGYKDG
Subjt:  VDIDSKSYLFTDNLHPFE-PFDICSSKRRGKAKSGGHWQVKDNGKVMEHAVEAKNNTVLRPGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDG

Query:  AKLRLQMMCLGLDWDPQARKYEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGRLGLHQDRDESKESLV
        AKLRL+MMCLGLDWDPQ R+YE  R VDGNKPP IPP+F  LVK ALK AHA IKNN N +NVE+ LPSMSPDICI NFY+T GRLGLHQDRDESKESL 
Subjt:  AKLRLQMMCLGLDWDPQARKYEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGRLGLHQDRDESKESLV

Query:  NGLPVVSFSLGNSAEFLYGDRRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLDHTGLRPGRLNLTFRKY
         GLPVVSFS+GN+AEFLYGD+R+VD AE + LESGDVLIFGGESRHIFHGVSSIIPKSTPK LL HTGLRPGRLNLTFRKY
Subjt:  NGLPVVSFSLGNSAEFLYGDRRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLDHTGLRPGRLNLTFRKY

A0A1S4E4K6 uncharacterized protein LOC1035021831.4e-14560.71Show/hide
Query:  MLLIRTIPVSPSPSLNLLHRLLLAQS-----------RFQPMDSFASSANSREIPDLPCCGSSCGA-----NLHGRDHNSNVIMIGTVPVNLNHKVSKQT
        M  IRT+P+ PSPS N L RLL   S           +FQ MDSF+SSANS   PD  C G+SCG      +L  RD+ S+VI +G+  V+LN K  +  
Subjt:  MLLIRTIPVSPSPSLNLLHRLLLAQS-----------RFQPMDSFASSANSREIPDLPCCGSSCGA-----NLHGRDHNSNVIMIGTVPVNLNHKVSKQT

Query:  SLSRLPVDKSDDFESRRDQKGISPNVPSSYY--DFPPVS-PSKRRNRIDLGFERSLKSNTRTTHVD------------ESSLPNQFGKKNGSYFPYNCWP
        SL+ L   K D  E   D+ GIS N P SY+  +F PVS  + RRNRIDLG +R LKSN R+  V+            ESSLP  FGKKN  +F      
Subjt:  SLSRLPVDKSDDFESRRDQKGISPNVPSSYY--DFPPVS-PSKRRNRIDLGFERSLKSNTRTTHVD------------ESSLPNQFGKKNGSYFPYNCWP

Query:  VDIDSKSYLFTDNLHPFE-PFDICSSKRRGKAKSGGHWQVKDNGKVMEHAVEAKNNTVLRPGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDG
        +DI SK  + TD+  PFE PFDIC     G  K    W+VKD+G V       K+  +LRPGMVLLKHYI+  +Q+++VKTCQKLGLGP GFYQP YKDG
Subjt:  VDIDSKSYLFTDNLHPFE-PFDICSSKRRGKAKSGGHWQVKDNGKVMEHAVEAKNNTVLRPGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDG

Query:  AKLRLQMMCLGLDWDPQARKYEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGRLGLHQDRDESKESLV
        AKLRL+MMCLGLDWDPQ R+Y+  R VDGNKPP IPP F+ LVK ALK AHA IKN  N +NVED LPSMSPDICI NFY+TSGRLGLHQDRDESKESL 
Subjt:  AKLRLQMMCLGLDWDPQARKYEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGRLGLHQDRDESKESLV

Query:  NGLPVVSFSLGNSAEFLYGDRRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLDHTGLRPGRLNLTFRKY
        +GLPVVSFS+GN+AEFLYGD+RDV+ AEK+ LESGDVLIFGGESRH+FHGVSSIIPKSTPK LL HTGLRPGRLNLTFRKY
Subjt:  NGLPVVSFSLGNSAEFLYGDRRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLDHTGLRPGRLNLTFRKY

A0A6J1CQI1 uncharacterized protein LOC1110138278.3e-15152.14Show/hide
Query:  MLLIRTIPVSPSPSLNLLHRLLLAQSR-----------FQPMDSFASSANSREIPDLPCCGSSCGANLHGRDHNSNVIMIGTVPVNLNHKVSKQTSLSRL
        M +IRT+P+SP    N LHRLL A SR           F+ MDS  +SA S           +   N H R H+S+++M+G +PV LN K  ++ S S  
Subjt:  MLLIRTIPVSPSPSLNLLHRLLLAQSR-----------FQPMDSFASSANSREIPDLPCCGSSCGANLHGRDHNSNVIMIGTVPVNLNHKVSKQTSLSRL

Query:  PVDKSDDFESRRDQKGISPNVPSSYYD-------------------------------------------------------------------------
         V+KSDDFE  R++K    NVP+SY+D                                                                         
Subjt:  PVDKSDDFESRRDQKGISPNVPSSYYD-------------------------------------------------------------------------

Query:  --------------------------------------FPPVS--PSKRRNRIDLGFERSLKSNT----------RTTHVDESSLPNQFGKKNGSYFPYN
                                              F PVS   +KRRNR+DLGF+RS  +++            + +DESS PNQFGKKN  ++   
Subjt:  --------------------------------------FPPVS--PSKRRNRIDLGFERSLKSNT----------RTTHVDESSLPNQFGKKNGSYFPYN

Query:  CWPVDIDSKSYLFTDNLHPFEPFDICSSKRRGKAKSGGHWQVK--DNGKVMEHAVEAKNNTVLRPGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPG
        C  +DI SK+ L  DNLHPFEPFDIC  +RRG AK G HWQ K  D  KVMEH  EA N  VLRPGMVLLK+YI+LH+QV++VKTCQ+LG+GP GFY+PG
Subjt:  CWPVDIDSKSYLFTDNLHPFEPFDICSSKRRGKAKSGGHWQVK--DNGKVMEHAVEAKNNTVLRPGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPG

Query:  YKDGAKLRLQMMCLGLDWDPQARKYEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGRLGLHQDRDESK
        YKDGAKLRLQMMCLGLDWDPQ RKY   RAVDG+KPP+IPP+FA+LV EALK AHALIKN  NT NVE  LPSMSPDICIVNFY+TSGRLGLHQDRDESK
Subjt:  YKDGAKLRLQMMCLGLDWDPQARKYEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGRLGLHQDRDESK

Query:  ESLVNGLPVVSFSLGNSAEFLYGDRRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLDHTGLRPGRLNLTFRKY
        ESLV+GLPVVS SLG+SAEFLYGDRRDVD AEK++LESGDVLIFGG+SRH+FHGVSSIIP STPK LLDHTGLRPGRLNLTFRKY
Subjt:  ESLVNGLPVVSFSLGNSAEFLYGDRRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLDHTGLRPGRLNLTFRKY

A0A6J1EDT3 uncharacterized protein LOC1114323183.6e-17069.85Show/hide
Query:  MLLIRTIPVSPSPSLNLLHRLLLAQSR---FQPMDSFASSANSREIPDLPCCGSSCGAN---LHGRDHNSNVIMIGTVPVNLNHKVSKQTSLSRLPVDKS
        MLLIRT+P S  P  NLL RLL A+SR   FQ +DSF SSA    +PD  C GSSCG N   LH RDHNSNVIMIG +PVNLN K ++Q SLSRL V K 
Subjt:  MLLIRTIPVSPSPSLNLLHRLLLAQSR---FQPMDSFASSANSREIPDLPCCGSSCGAN---LHGRDHNSNVIMIGTVPVNLNHKVSKQTSLSRLPVDKS

Query:  DDFESRRDQKGISPNVPSSYYD--FPPV--SPSKRRNRIDLGFERSLKSNTRTTHVDESSLPNQFGKKNGSYFPYNCWPVDIDSKSYLFTDNLHPFEPFD
        DDF+ R DQKGI  N+PSSY+D  FPPV    +KRR+RIDLG ER LK++T ++ ++ +  P  F K             DI SK+ L T NL P E FD
Subjt:  DDFESRRDQKGISPNVPSSYYD--FPPV--SPSKRRNRIDLGFERSLKSNTRTTHVDESSLPNQFGKKNGSYFPYNCWPVDIDSKSYLFTDNLHPFEPFD

Query:  ICSSKRRGKAKSGGHWQVKDNG--KVMEHAVEAKNNTVLRPGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDGAKLRLQMMCLGLDWDPQARK
        IC  +RRGK+K    WQ KD    KVMEHA EA N  V+RPGMVLLKHYI LH+QV++VKT QKLGLGP GFYQPGYKDGAKLRLQMMCLGLDWDPQ RK
Subjt:  ICSSKRRGKAKSGGHWQVKDNG--KVMEHAVEAKNNTVLRPGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDGAKLRLQMMCLGLDWDPQARK

Query:  YEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGD
        Y + R  DGNKPP +PPEFA+LV +AL  AHALIKNNG+TNN+ED LP+MSPDICIVNFYSTSGRLGLHQDRDES+ESLV GLPVVSFSLGNSAEFLYGD
Subjt:  YEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGD

Query:  RRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLDHTGLRPGRLNLTFRKY
        +RDVD A KI+LESGDVLIFGGESRHIFHGVSSIIPKSTPK LLDHTGLRPGRLNLTFRKY
Subjt:  RRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLDHTGLRPGRLNLTFRKY

A0A6J1HTF0 uncharacterized protein LOC1114660088.8e-16166.81Show/hide
Query:  MLLIRTIPVSPSPSLNLLHRLLLAQSR---FQPMDSFASSANSREIPDLPCCGSSCGAN---LHGRDHNSNVIMIGTVPVNLNHKVSKQTSLSRLPVDKS
        M LIRT+P S  P  NLL +LL A+SR   FQ MDSF SSA    +P+  C GSS G N   LH RDHNSNVIMIG +PVNLN K ++Q SLSRL V K 
Subjt:  MLLIRTIPVSPSPSLNLLHRLLLAQSR---FQPMDSFASSANSREIPDLPCCGSSCGAN---LHGRDHNSNVIMIGTVPVNLNHKVSKQTSLSRLPVDKS

Query:  DDFESRRDQKGISPNVPSSYYD--FPPV--SPSKRRNRIDLGFERSLKSNTRTTHVDESSLPNQFGKKNGSYFPYNCWPVDIDSKSYLFTDNLHPFEPFD
        DDF+ R DQKGI  N+PS Y+D  FPPV    +KRR+RID G ER LK++T ++ +  +  P  F K   +         DI SK+ L T NL P E FD
Subjt:  DDFESRRDQKGISPNVPSSYYD--FPPV--SPSKRRNRIDLGFERSLKSNTRTTHVDESSLPNQFGKKNGSYFPYNCWPVDIDSKSYLFTDNLHPFEPFD

Query:  ICSSKRRGKAKSGGHWQVKDNG--KVMEHAVEAKNNTVLRPGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDGAKLRLQMMCLGLDWDPQARK
        IC  +RRGK+K    WQ KD    KVMEH  EA N  V+RPGMVLLKHYI LH+QV++VKT QKLGLGP GFYQPGYKDGAKLRLQMMCLGLDWDPQ RK
Subjt:  ICSSKRRGKAKSGGHWQVKDNG--KVMEHAVEAKNNTVLRPGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDGAKLRLQMMCLGLDWDPQARK

Query:  YEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGD
        Y + R  DGNKPP +PPEFA+LV +AL  AHALIKNNG+ N +ED LP+MSPDICIVNFYST GRLGLHQDRDES+ESLV+GLPVVSFSLGNSA FLYGD
Subjt:  YEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGD

Query:  RRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLDHTGLRPGRLNLTFRKY
         R+VD A KI+LESGDVLIFGGESRHIFHGVSSIIPKS PK LLDHTG RPG LNLTFRKY
Subjt:  RRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLDHTGLRPGRLNLTFRKY

SwissProt top hitse value%identityAlignment
B8GWW6 Alpha-ketoglutarate-dependent dioxygenase AlkB homolog1.3e-1533.5Show/hide
Query:  PWGFYQPGYKDGAKLRLQMMCLG-LDWDPQAR--KYEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGR
        P+  Y+  Y  G  + + M  LG L W   AR  +Y       G   P +PP        AL     ++ +           P   PD C+VN Y    R
Subjt:  PWGFYQPGYKDGAKLRLQMMCLG-LDWDPQAR--KYEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGR

Query:  LGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGDRRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLDHTGLRP--GRLNLTFRK
        +GLHQDRDE+        PV+S SLG++A F  G     D    + L SGDV    G +R  FHGV  I+P S        + L P  GR+NLT R+
Subjt:  LGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGDRRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLDHTGLRP--GRLNLTFRK

P05050 Alpha-ketoglutarate-dependent dioxygenase AlkB1.4e-1434.87Show/hide
Query:  NKP-PKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGDRRDVDIAE
        NKP P +P  F  L + A   A                 P   PD C++N Y+   +L LHQD+DE         P+VS SLG  A F +G  +  D  +
Subjt:  NKP-PKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGDRRDVDIAE

Query:  KIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLDHTGLRPGRLNLTFRK
        +++LE GDV+++GGESR  +HG+  +     P L +D       R NLTFR+
Subjt:  KIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLDHTGLRPGRLNLTFRK

P0CAT7 Alpha-ketoglutarate-dependent dioxygenase AlkB homolog1.3e-1533.5Show/hide
Query:  PWGFYQPGYKDGAKLRLQMMCLG-LDWDPQAR--KYEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGR
        P+  Y+  Y  G  + + M  LG L W   AR  +Y       G   P +PP        AL     ++ +           P   PD C+VN Y    R
Subjt:  PWGFYQPGYKDGAKLRLQMMCLG-LDWDPQAR--KYEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGR

Query:  LGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGDRRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLDHTGLRP--GRLNLTFRK
        +GLHQDRDE+        PV+S SLG++A F  G     D    + L SGDV    G +R  FHGV  I+P S        + L P  GR+NLT R+
Subjt:  LGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGDRRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLDHTGLRP--GRLNLTFRK

P37462 Alpha-ketoglutarate-dependent dioxygenase AlkB1.4e-1440.18Show/hide
Query:  SMSPDICIVNFYSTSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGDRRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLDHTG
        S  PD C++N Y+   +L LHQD+DE         P+VS SLG  A F +G  R  D  ++I+LE GD++++GGESR  +HG+  +     P      TG
Subjt:  SMSPDICIVNFYSTSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGDRRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLDHTG

Query:  LRPGRLNLTFRK
            R NLTFR+
Subjt:  LRPGRLNLTFRK

Q54N08 Alpha-ketoglutarate-dependent dioxygenase alkB1.2e-0837.78Show/hide
Query:  VNFYSTSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYG-DRRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLD
        VNFYS    +G H D  E +       P++S S G++A FL G + RD+     + + SGD++I GG SR+ +HGV+ I+  S    L+D
Subjt:  VNFYSTSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYG-DRRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLD

Arabidopsis top hitse value%identityAlignment
AT1G11780.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein4.1e-0939.76Show/hide
Query:  PDICIVNFYSTSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGDRRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSI
        P+  IVN++     LG H D  E+  S     P+VS SLG  A FL G +   D    + L SGDV++  GE+R  FHG+  I
Subjt:  PDICIVNFYSTSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGDRRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSI

AT3G14140.1 2-oxoglutarate-dependent dioxygenase family protein2.8e-6646.97Show/hide
Query:  PFDICSSKRRGKAKSGGHWQVKDNGKVMEHAVEAKNNTVLRPGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDGAKLRLQMMCLGLDWDPQAR
        PFDI   K+  + K        +  +  + A +  +  V+RPGMVLLK+Y+S++ QV +V  C++LGLG  GFYQPG++DG  L L+MMCLG +WD Q R
Subjt:  PFDICSSKRRGKAKSGGHWQVKDNGKVMEHAVEAKNNTVLRPGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDGAKLRLQMMCLGLDWDPQAR

Query:  KYEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGRLGLHQ---------------------DRDESKES
        +Y + R +DG+ PP+IP EF+ LV++A+K + +L+  N N     D +P + PDIC+VNFY+++G+LGLHQ                     D+ ESK+S
Subjt:  KYEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGRLGLHQ---------------------DRDESKES

Query:  LVNGLPVVSFSLGNSAEFLYGDRRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKL
        L  GLP+VSFS+G+SAEFLYGD++DVD A+ ++LESGDVLIFG  SR++FHGV SI     P+L
Subjt:  LVNGLPVVSFSLGNSAEFLYGDRRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKL

AT3G14160.1 2-oxoglutarate-dependent dioxygenase family protein7.6e-8059.05Show/hide
Query:  AVEAKNNTVLRPGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDGAKLRLQMMCLGLDWDPQARKYEQNRAVDGNKPPKIPPEFAVLVKEALKC
        A +  + TV+RPGMVLLK+Y+S++ QV +V  C++LGLG  GFYQPGY+D AKL L+MMCLG +WDP+  +Y + R  DG+  P+IP EF   V++A+K 
Subjt:  AVEAKNNTVLRPGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDGAKLRLQMMCLGLDWDPQARKYEQNRAVDGNKPPKIPPEFAVLVKEALKC

Query:  AHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGDRRDVDIAEKIVLESGDVLIFGGESRHIFH
        + +L  +N       D +P M PDICIVNFYS++GRLGLHQD+DES+ S+  GLPVVSFS+G+SAEFLYGD+RD D AE + LESGDVL+FGG SR +FH
Subjt:  AHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGDRRDVDIAEKIVLESGDVLIFGGESRHIFH

Query:  GVSSIIPKSTPKLLLDHTGLRPGRLNLTFRKY
        GV SI   + PK LL  T LRPGRLNLTFR+Y
Subjt:  GVSSIIPKSTPKLLLDHTGLRPGRLNLTFRKY

AT5G01780.1 2-oxoglutarate-dependent dioxygenase family protein1.1e-8359.32Show/hide
Query:  PFDICSSKRRGKAKSGGHWQVKDNGKVMEHAVEAKN-NTVLRPGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDGAKLRLQMMCLGLDWDPQA
        PFDICSS       S   W + D  +     VE  N + V+RPGMVLLK +++   QV +VKTC++LG+ P GFYQPGY  G+KL LQMMCLG +WDPQ 
Subjt:  PFDICSSKRRGKAKSGGHWQVKDNGKVMEHAVEAKN-NTVLRPGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDGAKLRLQMMCLGLDWDPQA

Query:  RKYEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLY
         KY +N  +D +K P+IP  F VLV++A++ AHALI     T + E  LP MSPDICIVNFYS +GRLGLHQDRDES+ES+  GLP+VSFS+G+SAEFLY
Subjt:  RKYEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLY

Query:  GDRRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLDHTGLRPGRLNLTFRKY
        G++RDV+ A+ ++LESGDVLIFGGESR IFHGV SIIP S P  LL+ + LR GRLNLTFR +
Subjt:  GDRRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLDHTGLRPGRLNLTFRKY

AT5G01780.2 2-oxoglutarate-dependent dioxygenase family protein1.1e-8359.32Show/hide
Query:  PFDICSSKRRGKAKSGGHWQVKDNGKVMEHAVEAKN-NTVLRPGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDGAKLRLQMMCLGLDWDPQA
        PFDICSS       S   W + D  +     VE  N + V+RPGMVLLK +++   QV +VKTC++LG+ P GFYQPGY  G+KL LQMMCLG +WDPQ 
Subjt:  PFDICSSKRRGKAKSGGHWQVKDNGKVMEHAVEAKN-NTVLRPGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDGAKLRLQMMCLGLDWDPQA

Query:  RKYEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLY
         KY +N  +D +K P+IP  F VLV++A++ AHALI     T + E  LP MSPDICIVNFYS +GRLGLHQDRDES+ES+  GLP+VSFS+G+SAEFLY
Subjt:  RKYEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLY

Query:  GDRRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLDHTGLRPGRLNLTFRKY
        G++RDV+ A+ ++LESGDVLIFGGESR IFHGV SIIP S P  LL+ + LR GRLNLTFR +
Subjt:  GDRRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLDHTGLRPGRLNLTFRKY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGTTGATCCGTACAATTCCCGTCTCGCCCTCGCCGTCGTTGAATCTTCTTCATCGGCTTTTGCTCGCTCAGTCTCGATTTCAGCCCATGGATTCGTTTGCCAGTTC
AGCAAATTCCCGCGAAATACCTGATCTGCCTTGTTGTGGTAGTTCTTGTGGCGCCAACTTGCATGGTAGAGATCATAATTCAAATGTGATAATGATAGGAACAGTTCCTG
TGAATCTGAATCACAAGGTCAGTAAACAGACATCTTTGTCTCGGTTGCCTGTTGATAAAAGTGATGATTTCGAGTCGAGAAGAGATCAAAAGGGGATTTCTCCAAATGTA
CCCAGTTCTTACTATGATTTTCCACCTGTTTCTCCTTCCAAAAGAAGAAACCGAATCGATTTAGGATTTGAAAGAAGTTTGAAGAGTAATACAAGAACAACTCATGTGGA
TGAATCATCCTTGCCTAATCAATTTGGAAAGAAAAATGGATCGTATTTTCCTTATAACTGCTGGCCTGTGGATATCGATTCCAAAAGTTATCTATTTACTGACAATTTGC
ATCCCTTTGAACCATTTGATATATGTTCTTCAAAAAGAAGAGGTAAAGCAAAATCCGGAGGTCATTGGCAGGTTAAAGACAATGGGAAAGTTATGGAGCATGCTGTAGAA
GCTAAAAATAATACAGTGTTGAGGCCTGGAATGGTTTTATTGAAGCACTACATTAGTCTACATAAACAGGTCAGTTTAGTGAAAACTTGTCAAAAGCTTGGTCTTGGCCC
ATGGGGGTTTTACCAGCCTGGTTATAAAGATGGTGCAAAACTCCGGCTTCAGATGATGTGCCTTGGATTGGATTGGGATCCTCAAGCAAGGAAATATGAACAAAACCGGG
CTGTTGATGGTAATAAACCACCAAAAATACCTCCTGAATTCGCAGTTCTGGTTAAAGAAGCACTTAAATGTGCACATGCCTTGATCAAGAACAACGGCAATACAAATAAC
GTAGAAGACACACTTCCATCAATGTCTCCTGATATATGCATTGTGAATTTCTACTCGACAAGTGGAAGACTGGGTTTGCATCAGGATCGTGATGAAAGCAAAGAGAGTCT
CGTTAACGGACTACCGGTCGTCTCGTTTTCTTTAGGCAATTCAGCAGAATTCTTGTATGGAGATCGAAGAGATGTCGATATTGCAGAGAAGATTGTACTGGAATCAGGCG
ATGTTCTAATATTTGGTGGAGAATCTAGGCATATATTTCATGGAGTATCTTCAATCATACCTAAATCAACGCCTAAGTTGTTGCTTGATCATACGGGTCTTCGACCTGGG
CGTCTAAATCTTACCTTTAGAAAGTATTAG
mRNA sequenceShow/hide mRNA sequence
TTAATTTGGTGGAAATTGTTACGCCTCCTCGGCCAACTCGGCGGCGACCGAAACTTGCGGATTAAGCTCCCAAAATCGTATTCCTCCTCCGACGATTCCATTTCCGACTC
TTGGGTTCGTTCCCAAATTTTGAAACCTATAAACAATCTCTGTACAACCCATTATTCCCAATGTTGTTGATCCGTACAATTCCCGTCTCGCCCTCGCCGTCGTTGAATCT
TCTTCATCGGCTTTTGCTCGCTCAGTCTCGATTTCAGCCCATGGATTCGTTTGCCAGTTCAGCAAATTCCCGCGAAATACCTGATCTGCCTTGTTGTGGTAGTTCTTGTG
GCGCCAACTTGCATGGTAGAGATCATAATTCAAATGTGATAATGATAGGAACAGTTCCTGTGAATCTGAATCACAAGGTCAGTAAACAGACATCTTTGTCTCGGTTGCCT
GTTGATAAAAGTGATGATTTCGAGTCGAGAAGAGATCAAAAGGGGATTTCTCCAAATGTACCCAGTTCTTACTATGATTTTCCACCTGTTTCTCCTTCCAAAAGAAGAAA
CCGAATCGATTTAGGATTTGAAAGAAGTTTGAAGAGTAATACAAGAACAACTCATGTGGATGAATCATCCTTGCCTAATCAATTTGGAAAGAAAAATGGATCGTATTTTC
CTTATAACTGCTGGCCTGTGGATATCGATTCCAAAAGTTATCTATTTACTGACAATTTGCATCCCTTTGAACCATTTGATATATGTTCTTCAAAAAGAAGAGGTAAAGCA
AAATCCGGAGGTCATTGGCAGGTTAAAGACAATGGGAAAGTTATGGAGCATGCTGTAGAAGCTAAAAATAATACAGTGTTGAGGCCTGGAATGGTTTTATTGAAGCACTA
CATTAGTCTACATAAACAGGTCAGTTTAGTGAAAACTTGTCAAAAGCTTGGTCTTGGCCCATGGGGGTTTTACCAGCCTGGTTATAAAGATGGTGCAAAACTCCGGCTTC
AGATGATGTGCCTTGGATTGGATTGGGATCCTCAAGCAAGGAAATATGAACAAAACCGGGCTGTTGATGGTAATAAACCACCAAAAATACCTCCTGAATTCGCAGTTCTG
GTTAAAGAAGCACTTAAATGTGCACATGCCTTGATCAAGAACAACGGCAATACAAATAACGTAGAAGACACACTTCCATCAATGTCTCCTGATATATGCATTGTGAATTT
CTACTCGACAAGTGGAAGACTGGGTTTGCATCAGGATCGTGATGAAAGCAAAGAGAGTCTCGTTAACGGACTACCGGTCGTCTCGTTTTCTTTAGGCAATTCAGCAGAAT
TCTTGTATGGAGATCGAAGAGATGTCGATATTGCAGAGAAGATTGTACTGGAATCAGGCGATGTTCTAATATTTGGTGGAGAATCTAGGCATATATTTCATGGAGTATCT
TCAATCATACCTAAATCAACGCCTAAGTTGTTGCTTGATCATACGGGTCTTCGACCTGGGCGTCTAAATCTTACCTTTAGAAAGTATTAGAACATAGATGTGTCTGTTTT
TAGGTCTCGTTTGTCATGTACACATGAAATGGCTGTTGTATTTCTTCATCACATATTTATCAAAATGAAATATCGCAAGTCTAAAATGTAGGCCCCG
Protein sequenceShow/hide protein sequence
MLLIRTIPVSPSPSLNLLHRLLLAQSRFQPMDSFASSANSREIPDLPCCGSSCGANLHGRDHNSNVIMIGTVPVNLNHKVSKQTSLSRLPVDKSDDFESRRDQKGISPNV
PSSYYDFPPVSPSKRRNRIDLGFERSLKSNTRTTHVDESSLPNQFGKKNGSYFPYNCWPVDIDSKSYLFTDNLHPFEPFDICSSKRRGKAKSGGHWQVKDNGKVMEHAVE
AKNNTVLRPGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDGAKLRLQMMCLGLDWDPQARKYEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNN
VEDTLPSMSPDICIVNFYSTSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGDRRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLDHTGLRPG
RLNLTFRKY