; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS013880 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS013880
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Description2-oxoglutarate-dependent dioxygenase family protein isoform 1
Genome locationscaffold607:541765..543732
RNA-Seq ExpressionMS013880
SyntenyMS013880
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0035513 - oxidative RNA demethylation (biological process)
GO:0035552 - oxidative single-stranded DNA demethylation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0008198 - ferrous iron binding (molecular function)
GO:0035515 - oxidative RNA demethylase activity (molecular function)
GO:0035516 - oxidative DNA demethylase activity (molecular function)
InterPro domainsIPR004574 - Alkylated DNA repair protein AlkB
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR027450 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like
IPR037151 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6595567.1 hypothetical protein SDJN03_12120, partial [Cucurbita argyrosperma subsp. sororia]4.1e-15969.66Show/hide
Query:  HNRGHSSDMVMVGEIPVYLNRKRYEEESSSPWSVNKSDDFELGRERKRTPANVPNSYHDDKDEFQPVSRQNTKRRNRVDLGFQR--SNNTSSFQVEGFSL
        HNR H+S+++M+GEIPV LNRK  E+ES S  SV K DDF+L  ++K  PAN+P+SYHD  DEF PV RQNTKRR+R+DLG +R   N+TSS Q+E    
Subjt:  HNRGHSSDMVMVGEIPVYLNRKRYEEESSSPWSVNKSDDFELGRERKRTPANVPNSYHDDKDEFQPVSRQNTKRRNRVDLGFQR--SNNTSSFQVEGFSL

Query:  LNNNSQLDESSQPNQFGKKNEPFYVQKCQSMDIGSKNSLVMDNLHPFEPFDICPHERRGNAKPGAHWQFKGRETVKVMEHVAEATNYRVLRPGMVLLKNY
                          +NEPF  +K +S DIGSKNSL   NL P E FDIC  ERRG +K    WQ K R+T+KVMEH  EATN  V+RPGMVLLK+Y
Subjt:  LNNNSQLDESSQPNQFGKKNEPFYVQKCQSMDIGSKNSLVMDNLHPFEPFDICPHERRGNAKPGAHWQFKGRETVKVMEHVAEATNYRVLRPGMVLLKNY

Query:  ITLHEQVNIVKTCQELGVGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYGDKRAVDGDKPPEIPPKFAILVTEALKDAHALIKNKCNTGNVESILPS
        I LHEQVNIVKT Q+LG+GPGGFY+PGYKDGAKLRLQMMCLGLDWDPQTRKY  KR  DG+KPP++PP+FAILV +AL DAHALIKN  +T N+E ILP+
Subjt:  ITLHEQVNIVKTCQELGVGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYGDKRAVDGDKPPEIPPKFAILVTEALKDAHALIKNKCNTGNVESILPS

Query:  MSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLIFGGDSRHVFHGVSSIIPNSTPKFLLDHTGL
        MSPDICIVNFY+TSGRLGLHQDRDES+ESLV GLPVVS SLG+SAEFLYGD+RDVDKA K+ILESGDVLIFGG+SRH+FHGVSSIIP STPKFLLDHTGL
Subjt:  MSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLIFGGDSRHVFHGVSSIIPNSTPKFLLDHTGL

Query:  RPGRLNLTFRKY
        RPGRLNLTFRKY
Subjt:  RPGRLNLTFRKY

KAG7027547.1 alkB, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-15969.66Show/hide
Query:  HNRGHSSDMVMVGEIPVYLNRKRYEEESSSPWSVNKSDDFELGRERKRTPANVPNSYHDDKDEFQPVSRQNTKRRNRVDLGFQR--SNNTSSFQVEGFSL
        HNR H+S+++M+GEIPV LNRK  E+ES S  SV K DDF+L  ++K  PAN+P+SYHD  DEF PV RQNTKRR+R+DLG +R   N+TSS Q+E    
Subjt:  HNRGHSSDMVMVGEIPVYLNRKRYEEESSSPWSVNKSDDFELGRERKRTPANVPNSYHDDKDEFQPVSRQNTKRRNRVDLGFQR--SNNTSSFQVEGFSL

Query:  LNNNSQLDESSQPNQFGKKNEPFYVQKCQSMDIGSKNSLVMDNLHPFEPFDICPHERRGNAKPGAHWQFKGRETVKVMEHVAEATNYRVLRPGMVLLKNY
                          +NEPF  +K +S DIGSKNS+   NL P E FDIC  ERRG +KP   WQ K R T+KVMEH  EATN  V+RPGMVLLK+Y
Subjt:  LNNNSQLDESSQPNQFGKKNEPFYVQKCQSMDIGSKNSLVMDNLHPFEPFDICPHERRGNAKPGAHWQFKGRETVKVMEHVAEATNYRVLRPGMVLLKNY

Query:  ITLHEQVNIVKTCQELGVGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYGDKRAVDGDKPPEIPPKFAILVTEALKDAHALIKNKCNTGNVESILPS
        I LHEQVNIVKT Q+LG+GPGGFY+PGYKDGAKLRLQMMCLGLDWDPQTRKY  KR  DG+KPP++PP+FAILV +AL DAHALIKN  +T N+E ILP+
Subjt:  ITLHEQVNIVKTCQELGVGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYGDKRAVDGDKPPEIPPKFAILVTEALKDAHALIKNKCNTGNVESILPS

Query:  MSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLIFGGDSRHVFHGVSSIIPNSTPKFLLDHTGL
        MSPDICIVNFY+TSGRLGLHQDRDES+ESLV GLPVVS SLG+SAEFLYGD+RDVDKA K+ILESGDVLIFGG+SRH+FHGVSSIIP STPKFLLDHTGL
Subjt:  MSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLIFGGDSRHVFHGVSSIIPNSTPKFLLDHTGL

Query:  RPGRLNLTFRKY
        RPGRLNLTFRKY
Subjt:  RPGRLNLTFRKY

XP_022144035.1 uncharacterized protein LOC111013827 [Momordica charantia]1.3e-22979.04Show/hide
Query:  ENSHNRGHSSDMVMVGEIPVYLNRKRYEEESSSPWSVNKSDDFELGRERKRTPANVPNSYHDDK------------------------------------
        ENSHNRGHSSDMVMVGEIPVYLNRKRYEEESSSPWSVNKSDDFELGRERKRTPANVPNSYHDDK                                    
Subjt:  ENSHNRGHSSDMVMVGEIPVYLNRKRYEEESSSPWSVNKSDDFELGRERKRTPANVPNSYHDDK------------------------------------

Query:  -----------------------------------------------------------------------DEFQPVSRQNTKRRNRVDLGFQRSNNTSS
                                                                               DEFQPVSRQNTKRRNRVDLGFQRSNNTSS
Subjt:  -----------------------------------------------------------------------DEFQPVSRQNTKRRNRVDLGFQRSNNTSS

Query:  FQVEGFSLLNNNSQLDESSQPNQFGKKNEPFYVQKCQSMDIGSKNSLVMDNLHPFEPFDICPHERRGNAKPGAHWQFKGRETVKVMEHVAEATNYRVLRP
        FQVEGFSLLNNNSQLDESSQPNQFGKKNEPFYVQKCQSMDIGSKNSLVMDNLHPFEPFDICPHERRGNAKPGAHWQFKGR+TVKVMEHVAEA+NYRVLRP
Subjt:  FQVEGFSLLNNNSQLDESSQPNQFGKKNEPFYVQKCQSMDIGSKNSLVMDNLHPFEPFDICPHERRGNAKPGAHWQFKGRETVKVMEHVAEATNYRVLRP

Query:  GMVLLKNYITLHEQVNIVKTCQELGVGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYGDKRAVDGDKPPEIPPKFAILVTEALKDAHALIKNKCNTG
        GMVLLKNYITLHEQVNIVKTCQELGVGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYGDKRAVDGDKPPEIPPKFAILVTEALKDAHALIKNKCNTG
Subjt:  GMVLLKNYITLHEQVNIVKTCQELGVGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYGDKRAVDGDKPPEIPPKFAILVTEALKDAHALIKNKCNTG

Query:  NVESILPSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLIFGGDSRHVFHGVSSIIPNSTPK
        NVESILPSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLIFGGDSRHVFHGVSSIIPNSTPK
Subjt:  NVESILPSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLIFGGDSRHVFHGVSSIIPNSTPK

Query:  FLLDHTGLRPGRLNLTFRKY
        FLLDHTGLRPGRLNLTFRKY
Subjt:  FLLDHTGLRPGRLNLTFRKY

XP_022924913.1 uncharacterized protein LOC111432318 [Cucurbita moschata]3.7e-16069.9Show/hide
Query:  HNRGHSSDMVMVGEIPVYLNRKRYEEESSSPWSVNKSDDFELGRERKRTPANVPNSYHDDKDEFQPVSRQNTKRRNRVDLGFQR--SNNTSSFQVEGFSL
        HNR H+S+++M+GEIPV LNRK  E+ES S  SV K DDF+L  ++K  PAN+P+SYHD  DEF PV RQNTKRR+R+DLG +R   N+TSS Q+E    
Subjt:  HNRGHSSDMVMVGEIPVYLNRKRYEEESSSPWSVNKSDDFELGRERKRTPANVPNSYHDDKDEFQPVSRQNTKRRNRVDLGFQR--SNNTSSFQVEGFSL

Query:  LNNNSQLDESSQPNQFGKKNEPFYVQKCQSMDIGSKNSLVMDNLHPFEPFDICPHERRGNAKPGAHWQFKGRETVKVMEHVAEATNYRVLRPGMVLLKNY
                          +NEPF  +K +S DIGSKNSL   NL P E FDIC  ERRG +KP   WQ K R+T+KVMEH  EATN  V+RPGMVLLK+Y
Subjt:  LNNNSQLDESSQPNQFGKKNEPFYVQKCQSMDIGSKNSLVMDNLHPFEPFDICPHERRGNAKPGAHWQFKGRETVKVMEHVAEATNYRVLRPGMVLLKNY

Query:  ITLHEQVNIVKTCQELGVGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYGDKRAVDGDKPPEIPPKFAILVTEALKDAHALIKNKCNTGNVESILPS
        I LHEQVNIVKT Q+LG+GPGGFY+PGYKDGAKLRLQMMCLGLDWDPQTRKY  KR  DG+KPP++PP+FAILV +AL DAHALIKN  +T N+E ILP+
Subjt:  ITLHEQVNIVKTCQELGVGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYGDKRAVDGDKPPEIPPKFAILVTEALKDAHALIKNKCNTGNVESILPS

Query:  MSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLIFGGDSRHVFHGVSSIIPNSTPKFLLDHTGL
        MSPDICIVNFY+TSGRLGLHQDRDES+ESLV GLPVVS SLG+SAEFLYGD+RDVDKA K+ILESGDVLIFGG+SRH+FHGVSSIIP STPKFLLDHTGL
Subjt:  MSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLIFGGDSRHVFHGVSSIIPNSTPKFLLDHTGL

Query:  RPGRLNLTFRKY
        RPGRLNLTFRKY
Subjt:  RPGRLNLTFRKY

XP_023517205.1 uncharacterized protein LOC111781040 [Cucurbita pepo subsp. pepo]2.0e-16170.39Show/hide
Query:  HNRGHSSDMVMVGEIPVYLNRKRYEEESSSPWSVNKSDDFELGRERKRTPANVPNSYHDDKDEFQPVSRQNTKRRNRVDLGFQR--SNNTSSFQVEGFSL
        HNR H+S+++M+GEIPV LNRK  E+ES S  SV K DDF+L  ++KR PAN+P+SYHD  DEF PV RQNTKRR+R+DLG +R   N+TSS Q+E    
Subjt:  HNRGHSSDMVMVGEIPVYLNRKRYEEESSSPWSVNKSDDFELGRERKRTPANVPNSYHDDKDEFQPVSRQNTKRRNRVDLGFQR--SNNTSSFQVEGFSL

Query:  LNNNSQLDESSQPNQFGKKNEPFYVQKCQSMDIGSKNSLVMDNLHPFEPFDICPHERRGNAKPGAHWQFKGRETVKVMEHVAEATNYRVLRPGMVLLKNY
                          +NEPF   K +S DIGSKNSL   NL P E FDIC  ERRG +KP   WQ K R+T+KVMEH  EATN  V+RPGMVLLK+Y
Subjt:  LNNNSQLDESSQPNQFGKKNEPFYVQKCQSMDIGSKNSLVMDNLHPFEPFDICPHERRGNAKPGAHWQFKGRETVKVMEHVAEATNYRVLRPGMVLLKNY

Query:  ITLHEQVNIVKTCQELGVGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYGDKRAVDGDKPPEIPPKFAILVTEALKDAHALIKNKCNTGNVESILPS
        I LHEQVNIVKT Q+LG+GPGGFY+PGYKDGAKLRLQMMCLGLDWDPQTRKY  KR  DG+KPP++PP+FAILV +AL DAHALIKN  +T N+E ILP+
Subjt:  ITLHEQVNIVKTCQELGVGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYGDKRAVDGDKPPEIPPKFAILVTEALKDAHALIKNKCNTGNVESILPS

Query:  MSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLIFGGDSRHVFHGVSSIIPNSTPKFLLDHTGL
        MSPDICIVNFY+TSGRLGLHQDRDES+ESLVSGLPVVS SLG+SAEFLYGD+RDVDKA K+ILESGDVLIFGG+SRH+FHGVSSIIP STPKFLLDHTGL
Subjt:  MSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLIFGGDSRHVFHGVSSIIPNSTPKFLLDHTGL

Query:  RPGRLNLTFRKY
        RPGRLNLTFRKY
Subjt:  RPGRLNLTFRKY

TrEMBL top hitse value%identityAlignment
A0A5A7U7Q2 2-oxoglutarate-dependent dioxygenase family protein isoform 19.6e-14667.07Show/hide
Query:  ENSHNRGHSSDMVMVGEIPVYLNRKRYEEESSSPWSVNKSDDFELGRERKRTPANVPNSYHDDKDEFQPVSRQNTKRRNRVDLGFQR--SNNTSSFQVEG
        E+  +R + SD++ VG   V+LN K  E +S +P SV K D  E+G ++    +N P SYH   DE  PVSRQNT RRNR+DLG +R   +N  SFQVE 
Subjt:  ENSHNRGHSSDMVMVGEIPVYLNRKRYEEESSSPWSVNKSDDFELGRERKRTPANVPNSYHDDKDEFQPVSRQNTKRRNRVDLGFQR--SNNTSSFQVEG

Query:  FSLLNNNSQLDESSQPNQFGKKNEPFYVQKCQSMDIGSKNSLVMDNLHPFE-PFDICPHERRGNAKPGAHWQFKGRETVKVMEHVAEATNYRVLRPGMVL
           LN+  Q  ESS P  FGKKNE F+  K QS+DIGSK S+V D+  PFE PFDIC     GN K    W+ K   TVK         +YR+LRPGMVL
Subjt:  FSLLNNNSQLDESSQPNQFGKKNEPFYVQKCQSMDIGSKNSLVMDNLHPFE-PFDICPHERRGNAKPGAHWQFKGRETVKVMEHVAEATNYRVLRPGMVL

Query:  LKNYITLHEQVNIVKTCQELGVGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYGDKRAVDGDKPPEIPPKFAILVTEALKDAHALIKNKCNTGNVES
        LK+YIT  EQ+NIVKTCQ+LG+GPGGFY+PGYKDGAKLRL+MMCLGLDWDPQTR+Y +KR VDG+KPP+IPP F+ LV  ALKDAHA IKNKCN  NVE 
Subjt:  LKNYITLHEQVNIVKTCQELGVGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYGDKRAVDGDKPPEIPPKFAILVTEALKDAHALIKNKCNTGNVES

Query:  ILPSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLIFGGDSRHVFHGVSSIIPNSTPKFLLD
        ILPSMSPDICI NFYTTSGRLGLHQDRDESKESL SGLPVVS S+G++AEFLYGD+RDVDKAEKV LESGDVLIFGG+SRHVFHGVSSIIP STPKFLL 
Subjt:  ILPSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLIFGGDSRHVFHGVSSIIPNSTPKFLLD

Query:  HTGLRPGRLNLTFRKY
        HTGLRPGRLNLTFRKY
Subjt:  HTGLRPGRLNLTFRKY

A0A5D3BFV0 2-oxoglutarate-dependent dioxygenase family protein isoform 17.4e-14667.07Show/hide
Query:  ENSHNRGHSSDMVMVGEIPVYLNRKRYEEESSSPWSVNKSDDFELGRERKRTPANVPNSYHDDKDEFQPVSRQNTKRRNRVDLGFQR--SNNTSSFQVEG
        E+  +R + SD++ VG   V+LN K  E +S +P SV K D  E+G ++    +N P SYH   DE  PVSRQNT RRNR+DLG +R   +N  SFQVE 
Subjt:  ENSHNRGHSSDMVMVGEIPVYLNRKRYEEESSSPWSVNKSDDFELGRERKRTPANVPNSYHDDKDEFQPVSRQNTKRRNRVDLGFQR--SNNTSSFQVEG

Query:  FSLLNNNSQLDESSQPNQFGKKNEPFYVQKCQSMDIGSKNSLVMDNLHPFE-PFDICPHERRGNAKPGAHWQFKGRETVKVMEHVAEATNYRVLRPGMVL
           LN+  Q  ESS P  FGKKNE F+  K QS+DIGSK S+V D+  PFE PFDIC     GN K    W+ K   TVK         +YR+LRPGMVL
Subjt:  FSLLNNNSQLDESSQPNQFGKKNEPFYVQKCQSMDIGSKNSLVMDNLHPFE-PFDICPHERRGNAKPGAHWQFKGRETVKVMEHVAEATNYRVLRPGMVL

Query:  LKNYITLHEQVNIVKTCQELGVGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYGDKRAVDGDKPPEIPPKFAILVTEALKDAHALIKNKCNTGNVES
        LK+YIT  EQ+NIVKTCQ+LG+GPGGFY+PGYKDGAKLRL+MMCLGLDWDPQTR+Y +KR VDG+KPP+IPP F+ LV  ALKDAHA IKNKCN  NVE 
Subjt:  LKNYITLHEQVNIVKTCQELGVGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYGDKRAVDGDKPPEIPPKFAILVTEALKDAHALIKNKCNTGNVES

Query:  ILPSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLIFGGDSRHVFHGVSSIIPNSTPKFLLD
        ILPSMSPDICI NFYTTSGRLGLHQDRDESKESL SGLPVVS S+G++AEFLYGD+RDVDKAEKV LESGDVLIFGG+SRHVFHGVSSIIP STPKFLL 
Subjt:  ILPSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLIFGGDSRHVFHGVSSIIPNSTPKFLLD

Query:  HTGLRPGRLNLTFRKY
        HTGLRPGRLNLTFRKY
Subjt:  HTGLRPGRLNLTFRKY

A0A6J1CQI1 uncharacterized protein LOC1110138276.3e-23079.04Show/hide
Query:  ENSHNRGHSSDMVMVGEIPVYLNRKRYEEESSSPWSVNKSDDFELGRERKRTPANVPNSYHDDK------------------------------------
        ENSHNRGHSSDMVMVGEIPVYLNRKRYEEESSSPWSVNKSDDFELGRERKRTPANVPNSYHDDK                                    
Subjt:  ENSHNRGHSSDMVMVGEIPVYLNRKRYEEESSSPWSVNKSDDFELGRERKRTPANVPNSYHDDK------------------------------------

Query:  -----------------------------------------------------------------------DEFQPVSRQNTKRRNRVDLGFQRSNNTSS
                                                                               DEFQPVSRQNTKRRNRVDLGFQRSNNTSS
Subjt:  -----------------------------------------------------------------------DEFQPVSRQNTKRRNRVDLGFQRSNNTSS

Query:  FQVEGFSLLNNNSQLDESSQPNQFGKKNEPFYVQKCQSMDIGSKNSLVMDNLHPFEPFDICPHERRGNAKPGAHWQFKGRETVKVMEHVAEATNYRVLRP
        FQVEGFSLLNNNSQLDESSQPNQFGKKNEPFYVQKCQSMDIGSKNSLVMDNLHPFEPFDICPHERRGNAKPGAHWQFKGR+TVKVMEHVAEA+NYRVLRP
Subjt:  FQVEGFSLLNNNSQLDESSQPNQFGKKNEPFYVQKCQSMDIGSKNSLVMDNLHPFEPFDICPHERRGNAKPGAHWQFKGRETVKVMEHVAEATNYRVLRP

Query:  GMVLLKNYITLHEQVNIVKTCQELGVGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYGDKRAVDGDKPPEIPPKFAILVTEALKDAHALIKNKCNTG
        GMVLLKNYITLHEQVNIVKTCQELGVGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYGDKRAVDGDKPPEIPPKFAILVTEALKDAHALIKNKCNTG
Subjt:  GMVLLKNYITLHEQVNIVKTCQELGVGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYGDKRAVDGDKPPEIPPKFAILVTEALKDAHALIKNKCNTG

Query:  NVESILPSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLIFGGDSRHVFHGVSSIIPNSTPK
        NVESILPSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLIFGGDSRHVFHGVSSIIPNSTPK
Subjt:  NVESILPSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLIFGGDSRHVFHGVSSIIPNSTPK

Query:  FLLDHTGLRPGRLNLTFRKY
        FLLDHTGLRPGRLNLTFRKY
Subjt:  FLLDHTGLRPGRLNLTFRKY

A0A6J1EDT3 uncharacterized protein LOC1114323181.8e-16069.9Show/hide
Query:  HNRGHSSDMVMVGEIPVYLNRKRYEEESSSPWSVNKSDDFELGRERKRTPANVPNSYHDDKDEFQPVSRQNTKRRNRVDLGFQR--SNNTSSFQVEGFSL
        HNR H+S+++M+GEIPV LNRK  E+ES S  SV K DDF+L  ++K  PAN+P+SYHD  DEF PV RQNTKRR+R+DLG +R   N+TSS Q+E    
Subjt:  HNRGHSSDMVMVGEIPVYLNRKRYEEESSSPWSVNKSDDFELGRERKRTPANVPNSYHDDKDEFQPVSRQNTKRRNRVDLGFQR--SNNTSSFQVEGFSL

Query:  LNNNSQLDESSQPNQFGKKNEPFYVQKCQSMDIGSKNSLVMDNLHPFEPFDICPHERRGNAKPGAHWQFKGRETVKVMEHVAEATNYRVLRPGMVLLKNY
                          +NEPF  +K +S DIGSKNSL   NL P E FDIC  ERRG +KP   WQ K R+T+KVMEH  EATN  V+RPGMVLLK+Y
Subjt:  LNNNSQLDESSQPNQFGKKNEPFYVQKCQSMDIGSKNSLVMDNLHPFEPFDICPHERRGNAKPGAHWQFKGRETVKVMEHVAEATNYRVLRPGMVLLKNY

Query:  ITLHEQVNIVKTCQELGVGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYGDKRAVDGDKPPEIPPKFAILVTEALKDAHALIKNKCNTGNVESILPS
        I LHEQVNIVKT Q+LG+GPGGFY+PGYKDGAKLRLQMMCLGLDWDPQTRKY  KR  DG+KPP++PP+FAILV +AL DAHALIKN  +T N+E ILP+
Subjt:  ITLHEQVNIVKTCQELGVGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYGDKRAVDGDKPPEIPPKFAILVTEALKDAHALIKNKCNTGNVESILPS

Query:  MSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLIFGGDSRHVFHGVSSIIPNSTPKFLLDHTGL
        MSPDICIVNFY+TSGRLGLHQDRDES+ESLV GLPVVS SLG+SAEFLYGD+RDVDKA K+ILESGDVLIFGG+SRH+FHGVSSIIP STPKFLLDHTGL
Subjt:  MSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLIFGGDSRHVFHGVSSIIPNSTPKFLLDHTGL

Query:  RPGRLNLTFRKY
        RPGRLNLTFRKY
Subjt:  RPGRLNLTFRKY

A0A6J1HTF0 uncharacterized protein LOC1114660083.0e-15567.96Show/hide
Query:  HNRGHSSDMVMVGEIPVYLNRKRYEEESSSPWSVNKSDDFELGRERKRTPANVPNSYHDDKDEFQPVSRQNTKRRNRVDLGFQR--SNNTSSFQVEGFSL
        HNR H+S+++M+GEIPV LNRK  E+ES S  SV K DDF+L  ++K  PAN+P+ YHD  DEF PV RQNTKRR+R+D G +R   N+TSS Q+     
Subjt:  HNRGHSSDMVMVGEIPVYLNRKRYEEESSSPWSVNKSDDFELGRERKRTPANVPNSYHDDKDEFQPVSRQNTKRRNRVDLGFQR--SNNTSSFQVEGFSL

Query:  LNNNSQLDESSQPNQFGKKNEPFYVQKCQSMDIGSKNSLVMDNLHPFEPFDICPHERRGNAKPGAHWQFKGRETVKVMEHVAEATNYRVLRPGMVLLKNY
                         K+NEPF   K +S DIGSKNSL   NL P E FDIC  ERRG +KP   WQ+K R+T+KVMEHV EATN  V+RPGMVLLK+Y
Subjt:  LNNNSQLDESSQPNQFGKKNEPFYVQKCQSMDIGSKNSLVMDNLHPFEPFDICPHERRGNAKPGAHWQFKGRETVKVMEHVAEATNYRVLRPGMVLLKNY

Query:  ITLHEQVNIVKTCQELGVGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYGDKRAVDGDKPPEIPPKFAILVTEALKDAHALIKNKCNTGNVESILPS
        I LHEQVNIVKT Q+LG+GPGGFY+PGYKDGAKLRLQMMCLGLDWDPQTRKY  KR  DG+KPP++PP+FAILV +AL DAHALIKN  +   +E ILP+
Subjt:  ITLHEQVNIVKTCQELGVGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYGDKRAVDGDKPPEIPPKFAILVTEALKDAHALIKNKCNTGNVESILPS

Query:  MSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLIFGGDSRHVFHGVSSIIPNSTPKFLLDHTGL
        MSPDICIVNFY+T GRLGLHQDRDES+ESLVSGLPVVS SLG+SA FLYGD R+VDKA K+ILESGDVLIFGG+SRH+FHGVSSIIP S PKFLLDHTG 
Subjt:  MSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLIFGGDSRHVFHGVSSIIPNSTPKFLLDHTGL

Query:  RPGRLNLTFRKY
        RPG LNLTFRKY
Subjt:  RPGRLNLTFRKY

SwissProt top hitse value%identityAlignment
B8GWW6 Alpha-ketoglutarate-dependent dioxygenase AlkB homolog1.9e-1835.03Show/hide
Query:  PGGFYRPGYKDGAKLRLQMMCLG-LDWDPQTR--KYGDKRAVDGDKPPEIPPKFAILVTEALKDAHALIKNKCNTGNVESILPSMSPDICIVNFYTTSGR
        P   YR  Y  G  + + M  LG L W    R  +Y D+    G   P++PP        AL D   ++ +           P   PD C+VN Y    R
Subjt:  PGGFYRPGYKDGAKLRLQMMCLG-LDWDPQTR--KYGDKRAVDGDKPPEIPPKFAILVTEALKDAHALIKNKCNTGNVESILPSMSPDICIVNFYTTSGR

Query:  LGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLIFGGDSRHVFHGVSSIIPNSTPKFLLDHTGLRP--GRLNLTFRK
        +GLHQDRDE+        PV+S+SLGD+A F  G     D    + L SGDV    G +R  FHGV  I+P S        + L P  GR+NLT R+
Subjt:  LGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLIFGGDSRHVFHGVSSIIPNSTPKFLLDHTGLRP--GRLNLTFRK

O60066 Alpha-ketoglutarate-dependent dioxygenase abh12.8e-1728.75Show/hide
Query:  PGMVLLKNYITLHEQVNIVKTCQ-----------------ELGVGPGGFYRPGYK-DGAKL------------------RLQMMCLGLDWDPQTRKYGDK
        PG+++LKNY++   Q+ ++K+                   +L +G    +R  Y  DG  +                  +L+ + LG  +D  T++Y D 
Subjt:  PGMVLLKNYITLHEQVNIVKTCQ-----------------ELGVGPGGFYRPGYK-DGAKL------------------RLQMMCLGLDWDPQTRKYGDK

Query:  RAVDGDKPPEIPPKFAILVTEALKDAHALIKNKCNTGNVESILPSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDV
              K P  P      V + +K++   +  K               +  IVNFY+    L  H   DES+E L   LP++SLS+G    +L G     
Subjt:  RAVDGDKPPEIPPKFAILVTEALKDAHALIKNKCNTGNVESILPSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDV

Query:  DKAEKVILESGDVLIFGGDSRHVFHGVSSIIPNSTPKFLL
        +K   + L SGDV+I  G SR  FH V  IIPNSTP +LL
Subjt:  DKAEKVILESGDVLIFGGDSRHVFHGVSSIIPNSTPKFLL

P05050 Alpha-ketoglutarate-dependent dioxygenase AlkB1.7e-1434.4Show/hide
Query:  NKCNTGNVESILPSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLIFGGDSRHVFHGVSSII
        N C      +  P   PD C++N Y    +L LHQD+DE         P+VS+SLG  A F +G  +  D  ++++LE GDV+++GG+SR  +HG+  + 
Subjt:  NKCNTGNVESILPSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLIFGGDSRHVFHGVSSII

Query:  PNSTPKFLLDHTGLRPGRLNLTFRK
            P  +         R NLTFR+
Subjt:  PNSTPKFLLDHTGLRPGRLNLTFRK

P0CAT7 Alpha-ketoglutarate-dependent dioxygenase AlkB homolog1.9e-1835.03Show/hide
Query:  PGGFYRPGYKDGAKLRLQMMCLG-LDWDPQTR--KYGDKRAVDGDKPPEIPPKFAILVTEALKDAHALIKNKCNTGNVESILPSMSPDICIVNFYTTSGR
        P   YR  Y  G  + + M  LG L W    R  +Y D+    G   P++PP        AL D   ++ +           P   PD C+VN Y    R
Subjt:  PGGFYRPGYKDGAKLRLQMMCLG-LDWDPQTR--KYGDKRAVDGDKPPEIPPKFAILVTEALKDAHALIKNKCNTGNVESILPSMSPDICIVNFYTTSGR

Query:  LGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLIFGGDSRHVFHGVSSIIPNSTPKFLLDHTGLRP--GRLNLTFRK
        +GLHQDRDE+        PV+S+SLGD+A F  G     D    + L SGDV    G +R  FHGV  I+P S        + L P  GR+NLT R+
Subjt:  LGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLIFGGDSRHVFHGVSSIIPNSTPKFLLDHTGLRP--GRLNLTFRK

P37462 Alpha-ketoglutarate-dependent dioxygenase AlkB4.0e-1629.88Show/hide
Query:  EATNYRVLRPGMVLLKNYI-----TLHEQVNIVKTCQELG--VGPGGFYRPGYKDGAKLRLQMM-CLGLDWDPQTRKYGDKRAVDG---DKP-PEIPPKF
        EA     L PG V+L+ +      +L + +  V +       V PGG+          + + M  C  L W   T ++G   AV     DKP P +P  F
Subjt:  EATNYRVLRPGMVLLKNYI-----TLHEQVNIVKTCQELG--VGPGGFYRPGYKDGAKLRLQMM-CLGLDWDPQTRKYGDKRAVDG---DKP-PEIPPKF

Query:  AILVTEALKDAHALIKNKCNTGNVESILPSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLI
        A               + C    + +   S  PD C++N Y    +L LHQD+DE         P+VS+SLG  A F +G  R  D  ++++LE GD+++
Subjt:  AILVTEALKDAHALIKNKCNTGNVESILPSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLI

Query:  FGGDSRHVFHGVSSIIPNSTPKFLLDHTGLRPGRLNLTFRK
        +GG+SR  +HG+  +     P      TG    R NLTFR+
Subjt:  FGGDSRHVFHGVSSIIPNSTPKFLLDHTGLRPGRLNLTFRK

Arabidopsis top hitse value%identityAlignment
AT1G11780.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein6.4e-0938.55Show/hide
Query:  PDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLIFGGDSRHVFHGVSSI
        P+  IVN++     LG H D  E+  S     P+VS+SLG  A FL G +   D    + L SGDV++  G++R  FHG+  I
Subjt:  PDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLIFGGDSRHVFHGVSSI

AT3G14140.1 2-oxoglutarate-dependent dioxygenase family protein1.5e-6648.84Show/hide
Query:  PFDICPHERRGNAKPGAHWQFKGRETVKVMEHVAEATNYRVLRPGMVLLKNYITLHEQVNIVKTCQELGVGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQ
        PFDI   ++    KP         E  +  +  A+  +  V+RPGMVLLKNY++++ QV IV  C++LG+G GGFY+PG++DG  L L+MMCLG +WD Q
Subjt:  PFDICPHERRGNAKPGAHWQFKGRETVKVMEHVAEATNYRVLRPGMVLLKNYITLHEQVNIVKTCQELGVGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQ

Query:  TRKYGDKRAVDGDKPPEIPPKFAILVTEALKDAHALIKNKCNTGNVESILPSMSPDICIVNFYTTSGRLGLHQ---------------------DRDESK
        TR+YG+ R +DG  PP IP +F+ LV +A+K++ +L+    N       +P + PDIC+VNFYT++G+LGLHQ                     D+ ESK
Subjt:  TRKYGDKRAVDGDKPPEIPPKFAILVTEALKDAHALIKNKCNTGNVESILPSMSPDICIVNFYTTSGRLGLHQ---------------------DRDESK

Query:  ESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLIFGGDSRHVFHGVSSI
        +SL  GLP+VS S+GDSAEFLYGD++DVDKA+ +ILESGDVLIFG  SR+VFHGV SI
Subjt:  ESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLIFGGDSRHVFHGVSSI

AT3G14160.1 2-oxoglutarate-dependent dioxygenase family protein1.0e-7860.27Show/hide
Query:  VLRPGMVLLKNYITLHEQVNIVKTCQELGVGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYGDKRAVDGDKPPEIPPKFAILVTEALKDAHALIKNK
        V+RPGMVLLKNY+++++QV IV  C+ LG+G GGFY+PGY+D AKL L+MMCLG +WDP+T +YG+ R  DG   P IP +F   V +A+K++ +L  + 
Subjt:  VLRPGMVLLKNYITLHEQVNIVKTCQELGVGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYGDKRAVDGDKPPEIPPKFAILVTEALKDAHALIKNK

Query:  CNTGNVESILPSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLIFGGDSRHVFHGVSSIIPN
                 +P M PDICIVNFY+++GRLGLHQD+DES+ S+  GLPVVS S+GDSAEFLYGD+RD DKAE + LESGDVL+FGG SR VFHGV SI  +
Subjt:  CNTGNVESILPSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLIFGGDSRHVFHGVSSIIPN

Query:  STPKFLLDHTGLRPGRLNLTFRKY
        + PK LL  T LRPGRLNLTFR+Y
Subjt:  STPKFLLDHTGLRPGRLNLTFRKY

AT5G01780.1 2-oxoglutarate-dependent dioxygenase family protein1.2e-8449.71Show/hide
Query:  KRRNRVDLGFQRSNNTSSFQV-----EGFSLLNNNSQLDESSQPNQFGKKNEPFYVQKCQSMDIGSKNSLVMDNLHPFEPFDICPHERRGNAKPGAHW--
        +RR R        +N ++F+V        S+++ +S    SS  +   +  +   V+  ++    S++        P  PFDIC      N      W  
Subjt:  KRRNRVDLGFQRSNNTSSFQV-----EGFSLLNNNSQLDESSQPNQFGKKNEPFYVQKCQSMDIGSKNSLVMDNLHPFEPFDICPHERRGNAKPGAHW--

Query:  -QFKGRETVKVMEHVAEATNYRVLRPGMVLLKNYITLHEQVNIVKTCQELGVGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYGDKRAVDGDKPPEI
             RETV+V      +  ++V+RPGMVLLK+++T   QV+IVKTC+ELGV P GFY+PGY  G+KL LQMMCLG +WDPQT KY     +D  K PEI
Subjt:  -QFKGRETVKVMEHVAEATNYRVLRPGMVLLKNYITLHEQVNIVKTCQELGVGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYGDKRAVDGDKPPEI

Query:  PPKFAILVTEALKDAHALIKNKCNTGNVESILPSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESG
        P  F +LV +A+++AHALI  +  T + E ILP MSPDICIVNFY+ +GRLGLHQDRDES+ES+  GLP+VS S+GDSAEFLYG++RDV++A+ VILESG
Subjt:  PPKFAILVTEALKDAHALIKNKCNTGNVESILPSMSPDICIVNFYTTSGRLGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESG

Query:  DVLIFGGDSRHVFHGVSSIIPNSTPKFLLDHTGLRPGRLNLTFRKY
        DVLIFGG+SR +FHGV SIIPNS P  LL+ + LR GRLNLTFR +
Subjt:  DVLIFGGDSRHVFHGVSSIIPNSTPKFLLDHTGLRPGRLNLTFRKY

AT5G01780.2 2-oxoglutarate-dependent dioxygenase family protein9.4e-8546.12Show/hide
Query:  YEEESSSPWSVNKSDDFELGRERKRTP--ANVPNSYHDDKD--EFQPVSRQNTKRRNRVDLGFQRSNNTSSFQV-----EGFSLLNNNSQLDESSQPNQF
        YEE+       +K+  F LG     TP  ++   ++   KD    Q       +RR R        +N ++F+V        S+++ +S    SS  +  
Subjt:  YEEESSSPWSVNKSDDFELGRERKRTP--ANVPNSYHDDKD--EFQPVSRQNTKRRNRVDLGFQRSNNTSSFQV-----EGFSLLNNNSQLDESSQPNQF

Query:  GKKNEPFYVQKCQSMDIGSKNSLVMDNLHPFEPFDICPHERRGNAKPGAHW---QFKGRETVKVMEHVAEATNYRVLRPGMVLLKNYITLHEQVNIVKTC
         +  +   V+  ++    S++        P  PFDIC      N      W       RETV+V      +  ++V+RPGMVLLK+++T   QV+IVKTC
Subjt:  GKKNEPFYVQKCQSMDIGSKNSLVMDNLHPFEPFDICPHERRGNAKPGAHW---QFKGRETVKVMEHVAEATNYRVLRPGMVLLKNYITLHEQVNIVKTC

Query:  QELGVGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYGDKRAVDGDKPPEIPPKFAILVTEALKDAHALIKNKCNTGNVESILPSMSPDICIVNFYTT
        +ELGV P GFY+PGY  G+KL LQMMCLG +WDPQT KY     +D  K PEIP  F +LV +A+++AHALI  +  T + E ILP MSPDICIVNFY+ 
Subjt:  QELGVGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYGDKRAVDGDKPPEIPPKFAILVTEALKDAHALIKNKCNTGNVESILPSMSPDICIVNFYTT

Query:  SGRLGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLIFGGDSRHVFHGVSSIIPNSTPKFLLDHTGLRPGRLNLTFRKY
        +GRLGLHQDRDES+ES+  GLP+VS S+GDSAEFLYG++RDV++A+ VILESGDVLIFGG+SR +FHGV SIIPNS P  LL+ + LR GRLNLTFR +
Subjt:  SGRLGLHQDRDESKESLVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLIFGGDSRHVFHGVSSIIPNSTPKFLLDHTGLRPGRLNLTFRKY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GAAAATTCGCATAATAGAGGTCATAGTTCAGATATGGTAATGGTGGGAGAAATTCCTGTGTATCTAAATCGCAAGAGATATGAAGAGGAATCTTCATCTCCGTGGTCTGT
AAATAAAAGCGATGATTTTGAGTTGGGAAGAGAGCGAAAAAGGACTCCTGCAAATGTGCCAAATTCTTACCATGATGATAAGGATGAGTTTCAACCAGTGTCTAGACAAA
ATACTAAAAGAAGAAATCGGGTAGATTTAGGATTCCAGAGATCAAATAATACAAGTTCATTTCAAGTGGAGGGGTTCTCATTGCTGAACAACAACAGTCAGCTGGATGAA
TCATCTCAGCCTAATCAATTTGGGAAGAAAAACGAACCGTTTTATGTCCAGAAATGTCAGTCTATGGATATCGGTTCCAAAAATTCTCTAGTCATGGACAATTTGCATCC
CTTTGAACCATTCGATATCTGTCCCCATGAAAGAAGAGGCAATGCGAAACCCGGAGCTCATTGGCAATTTAAAGGTAGGGAGACTGTGAAAGTAATGGAGCATGTTGCTG
AAGCTACTAATTATAGAGTGCTGAGACCTGGAATGGTTTTACTGAAGAATTACATTACTCTACATGAACAGGTCAATATAGTGAAAACTTGTCAAGAGCTTGGAGTTGGC
CCAGGGGGATTTTACAGGCCTGGTTATAAAGATGGTGCAAAACTTAGGCTTCAGATGATGTGTCTTGGTTTGGACTGGGATCCTCAAACAAGGAAATATGGAGATAAGCG
GGCAGTCGATGGCGATAAACCACCAGAAATACCTCCTAAATTCGCAATTCTAGTTACAGAAGCACTTAAAGATGCACATGCCCTGATCAAGAACAAATGCAATACGGGTA
ACGTAGAAAGCATACTTCCATCAATGTCTCCTGATATCTGCATTGTGAACTTCTATACAACGAGCGGAAGACTGGGTCTGCATCAGGATCGCGATGAAAGCAAAGAGAGT
CTCGTTAGCGGACTACCCGTTGTTTCGTTATCATTAGGCGATTCGGCAGAATTCTTGTATGGAGATCGAAGAGATGTAGATAAAGCAGAGAAGGTTATATTGGAATCAGG
TGATGTTCTGATATTTGGTGGAGATTCTAGGCATGTATTTCATGGAGTATCTTCAATCATACCGAATTCGACACCTAAGTTTTTGCTTGATCATACCGGTCTTCGTCCCG
GGCGTCTAAATCTTACCTTTAGAAAGTAT
mRNA sequenceShow/hide mRNA sequence
GAAAATTCGCATAATAGAGGTCATAGTTCAGATATGGTAATGGTGGGAGAAATTCCTGTGTATCTAAATCGCAAGAGATATGAAGAGGAATCTTCATCTCCGTGGTCTGT
AAATAAAAGCGATGATTTTGAGTTGGGAAGAGAGCGAAAAAGGACTCCTGCAAATGTGCCAAATTCTTACCATGATGATAAGGATGAGTTTCAACCAGTGTCTAGACAAA
ATACTAAAAGAAGAAATCGGGTAGATTTAGGATTCCAGAGATCAAATAATACAAGTTCATTTCAAGTGGAGGGGTTCTCATTGCTGAACAACAACAGTCAGCTGGATGAA
TCATCTCAGCCTAATCAATTTGGGAAGAAAAACGAACCGTTTTATGTCCAGAAATGTCAGTCTATGGATATCGGTTCCAAAAATTCTCTAGTCATGGACAATTTGCATCC
CTTTGAACCATTCGATATCTGTCCCCATGAAAGAAGAGGCAATGCGAAACCCGGAGCTCATTGGCAATTTAAAGGTAGGGAGACTGTGAAAGTAATGGAGCATGTTGCTG
AAGCTACTAATTATAGAGTGCTGAGACCTGGAATGGTTTTACTGAAGAATTACATTACTCTACATGAACAGGTCAATATAGTGAAAACTTGTCAAGAGCTTGGAGTTGGC
CCAGGGGGATTTTACAGGCCTGGTTATAAAGATGGTGCAAAACTTAGGCTTCAGATGATGTGTCTTGGTTTGGACTGGGATCCTCAAACAAGGAAATATGGAGATAAGCG
GGCAGTCGATGGCGATAAACCACCAGAAATACCTCCTAAATTCGCAATTCTAGTTACAGAAGCACTTAAAGATGCACATGCCCTGATCAAGAACAAATGCAATACGGGTA
ACGTAGAAAGCATACTTCCATCAATGTCTCCTGATATCTGCATTGTGAACTTCTATACAACGAGCGGAAGACTGGGTCTGCATCAGGATCGCGATGAAAGCAAAGAGAGT
CTCGTTAGCGGACTACCCGTTGTTTCGTTATCATTAGGCGATTCGGCAGAATTCTTGTATGGAGATCGAAGAGATGTAGATAAAGCAGAGAAGGTTATATTGGAATCAGG
TGATGTTCTGATATTTGGTGGAGATTCTAGGCATGTATTTCATGGAGTATCTTCAATCATACCGAATTCGACACCTAAGTTTTTGCTTGATCATACCGGTCTTCGTCCCG
GGCGTCTAAATCTTACCTTTAGAAAGTAT
Protein sequenceShow/hide protein sequence
ENSHNRGHSSDMVMVGEIPVYLNRKRYEEESSSPWSVNKSDDFELGRERKRTPANVPNSYHDDKDEFQPVSRQNTKRRNRVDLGFQRSNNTSSFQVEGFSLLNNNSQLDE
SSQPNQFGKKNEPFYVQKCQSMDIGSKNSLVMDNLHPFEPFDICPHERRGNAKPGAHWQFKGRETVKVMEHVAEATNYRVLRPGMVLLKNYITLHEQVNIVKTCQELGVG
PGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYGDKRAVDGDKPPEIPPKFAILVTEALKDAHALIKNKCNTGNVESILPSMSPDICIVNFYTTSGRLGLHQDRDESKES
LVSGLPVVSLSLGDSAEFLYGDRRDVDKAEKVILESGDVLIFGGDSRHVFHGVSSIIPNSTPKFLLDHTGLRPGRLNLTFRKY