; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10017340 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10017340
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionFe2OG dioxygenase domain-containing protein
Genome locationChr03:13298274..13300787
RNA-Seq ExpressionHG10017340
SyntenyHG10017340
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0035513 - oxidative RNA demethylation (biological process)
GO:0035552 - oxidative single-stranded DNA demethylation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0008198 - ferrous iron binding (molecular function)
GO:0035515 - oxidative RNA demethylase activity (molecular function)
GO:0035516 - oxidative DNA demethylase activity (molecular function)
InterPro domainsIPR004574 - Alkylated DNA repair protein AlkB
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR027450 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like
IPR037151 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050407.1 2-oxoglutarate-dependent dioxygenase family protein isoform 1 [Cucumis melo var. makuwa]3.7e-17774.6Show/hide
Query:  SHALPDSSCYGSS--CGGDKEHLHNRDHNSDVIFVGSLPVYLNPKEH---------------------KKGIPANEPKSYHYDESLSVSRKNIKRSRIDL
        S A PDSSC G+S  CG DKEHL +RD+ SDVIFVGS  V+LNPKE                      K GI +NEPKSYHYDE L VSR+N +R+RIDL
Subjt:  SHALPDSSCYGSS--CGGDKEHLHNRDHNSDVIFVGSLPVYLNPKEH---------------------KKGIPANEPKSYHYDESLSVSRKNIKRSRIDL

Query:  GLKRGLKSNARSFQVERFECLNNSCQQDESSLPNHFGREIETFYFRKRQSVDIGSKKSLVTDNSLPFE-PFNICFPGGGNVKLGAYWQVKGSRDTVKGND
        G KR LKSNARSFQVER E LN+ CQ+ ESSLP HFG++ E F F KRQS+DIGSK+S+VTD+SLPFE PF+ICFPGGGNVK   +W+VK S  TVK   
Subjt:  GLKRGLKSNARSFQVERFECLNNSCQQDESSLPNHFGREIETFYFRKRQSVDIGSKKSLVTDNSLPFE-PFNICFPGGGNVKLGAYWQVKGSRDTVKGND

Query:  HLVEDTNYRVLRPGMVLLKHYITPPQQITLVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPESRKYENKRAVDGKKPPNIPIQFSFLVKSALK
              +YR+LRPGMVLLKHYITPP+QI +VKTCQKLGLGPGGFYQPGYKDGAKLRL MMCLGLDWDP++R+YENKR VDG KPP+IP  FSFLVK+ALK
Subjt:  HLVEDTNYRVLRPGMVLLKHYITPPQQITLVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPESRKYENKRAVDGKKPPNIPIQFSFLVKSALK

Query:  EARAFFKNKCNTSNVEDILPSMSPDICIVNFYKTSGRLGLHQDRDESKESLVSGLPVVSFSVGNSAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIF
        +A AF KNKCN SNVEDILPSMSPDICI NFY TSGRLGLHQDRDESKESL SGLPVVSFSVGN+AEFLYGDKRDVDKAEKV LESGDVLIFGG+SRH+F
Subjt:  EARAFFKNKCNTSNVEDILPSMSPDICIVNFYKTSGRLGLHQDRDESKESLVSGLPVVSFSVGNSAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIF

Query:  HGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY
        HGVSSIIPKSTPKFLL HTGLRPGRLNLTFRKY
Subjt:  HGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY

TYJ97997.1 2-oxoglutarate-dependent dioxygenase family protein isoform 1 [Cucumis melo var. makuwa]2.4e-17674.36Show/hide
Query:  SHALPDSSCYGSS--CGGDKEHLHNRDHNSDVIFVGSLPVYLNPKEH---------------------KKGIPANEPKSYHYDESLSVSRKNIKRSRIDL
        S A PDSSC G+S  CG DKEHL +RD+ SDVIFVGS  V+LNPKE                      K GI +NEPKSYHYDE L VSR+N +R+RIDL
Subjt:  SHALPDSSCYGSS--CGGDKEHLHNRDHNSDVIFVGSLPVYLNPKEH---------------------KKGIPANEPKSYHYDESLSVSRKNIKRSRIDL

Query:  GLKRGLKSNARSFQVERFECLNNSCQQDESSLPNHFGREIETFYFRKRQSVDIGSKKSLVTDNSLPFE-PFNICFPGGGNVKLGAYWQVKGSRDTVKGND
        G KR LKSNARSFQVER E LN+ CQ+ ESSLP HFG++ E F F KRQS+DIGSK+S+VTD+S PFE PF+ICFPGGGNVK   +W+VK S  TVK   
Subjt:  GLKRGLKSNARSFQVERFECLNNSCQQDESSLPNHFGREIETFYFRKRQSVDIGSKKSLVTDNSLPFE-PFNICFPGGGNVKLGAYWQVKGSRDTVKGND

Query:  HLVEDTNYRVLRPGMVLLKHYITPPQQITLVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPESRKYENKRAVDGKKPPNIPIQFSFLVKSALK
              +YR+LRPGMVLLKHYITPP+QI +VKTCQKLGLGPGGFYQPGYKDGAKLRL MMCLGLDWDP++R+YENKR VDG KPP+IP  FSFLVK+ALK
Subjt:  HLVEDTNYRVLRPGMVLLKHYITPPQQITLVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPESRKYENKRAVDGKKPPNIPIQFSFLVKSALK

Query:  EARAFFKNKCNTSNVEDILPSMSPDICIVNFYKTSGRLGLHQDRDESKESLVSGLPVVSFSVGNSAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIF
        +A AF KNKCN SNVEDILPSMSPDICI NFY TSGRLGLHQDRDESKESL SGLPVVSFSVGN+AEFLYGDKRDVDKAEKV LESGDVLIFGG+SRH+F
Subjt:  EARAFFKNKCNTSNVEDILPSMSPDICIVNFYKTSGRLGLHQDRDESKESLVSGLPVVSFSVGNSAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIF

Query:  HGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY
        HGVSSIIPKSTPKFLL HTGLRPGRLNLTFRKY
Subjt:  HGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY

XP_004149927.1 uncharacterized protein LOC101210053 isoform X1 [Cucumis sativus]1.1e-18974.68Show/hide
Query:  MLLIRTLPVSPSPWSNQLPRLLFPASRFLGERGFRLLQFQRMDSFASSANSHALPDSSCYGSS--CGGDKEHLHNRDHNSDVIFVGSLPVYLNPKEHKKG
        M  IRTLP+ PSP SNQL RLLFPAS F   RGFRLLQFQ MDSF++SANSHALPDSSC GSS  CG DKEHLH+RD++SDVI VGS+PV+LNPKE    
Subjt:  MLLIRTLPVSPSPWSNQLPRLLFPASRFLGERGFRLLQFQRMDSFASSANSHALPDSSCYGSS--CGGDKEHLHNRDHNSDVIFVGSLPVYLNPKEHKKG

Query:  IPANEPKSYHYDESLSVSRKNIKRSRIDLGLKRGLKSNARSFQVERFECLNNSCQQDESSLPNHFGREIETFYFRKRQSVDIGSKKSLVTDNSLPFE-PF
            EPKSY+YDESL V R+N +RSRIDLG KR LKSNARS+QVER E LN+SCQ+ +SSLP HFG++ E F   K QS+D G K+S+VTDNSLPFE PF
Subjt:  IPANEPKSYHYDESLSVSRKNIKRSRIDLGLKRGLKSNARSFQVERFECLNNSCQQDESSLPNHFGREIETFYFRKRQSVDIGSKKSLVTDNSLPFE-PF

Query:  NICFPGGGNVKLGAYWQVKGSRDTVKGNDHLVEDTNYRVLRPGMVLLKHYITPPQQITLVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPESR
        +IC PGGGNVK    + VK    TVK         +YR+LRPGMVLLKHYITP +QI +VKTCQ LG+GPGGFYQPGYKDGAKLRL MMCLGLDWDP++R
Subjt:  NICFPGGGNVKLGAYWQVKGSRDTVKGNDHLVEDTNYRVLRPGMVLLKHYITPPQQITLVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPESR

Query:  KYENKRAVDGKKPPNIPIQFSFLVKSALKEARAFFKNKCNTSNVEDILPSMSPDICIVNFYKTSGRLGLHQDRDESKESLVSGLPVVSFSVGNSAEFLYG
        +YENKR VDG KPP+IP QF+FLVK ALK+A AF KN CN SNVE+ILPSMSPDICI NFY T GRLGLHQDRDESKESL  GLPVVSFSVGN+AEFLYG
Subjt:  KYENKRAVDGKKPPNIPIQFSFLVKSALKEARAFFKNKCNTSNVEDILPSMSPDICIVNFYKTSGRLGLHQDRDESKESLVSGLPVVSFSVGNSAEFLYG

Query:  DKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY
        DKR+VDKAE V LESGDVLIFGG+SRHIFHGVSSIIPKSTPKFLL HTGLRPGRLNLTFRKY
Subjt:  DKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY

XP_016903166.1 PREDICTED: uncharacterized protein LOC103502183 [Cucumis melo]1.9e-19774.33Show/hide
Query:  MLLIRTLPVSPSPWSNQLPRLLFPASRFLGERGFRLLQFQRMDSFASSANSHALPDSSCYGSS--CGGDKEHLHNRDHNSDVIFVGSLPVYLNPKEH---
        M  IRTLP+ PSP SNQL RLLFPAS F G RGF LLQFQRMDSF+SSANSHA PDSSC G+S  CG DKEHL +RD+ SDVIF+GS  V+LNPKE    
Subjt:  MLLIRTLPVSPSPWSNQLPRLLFPASRFLGERGFRLLQFQRMDSFASSANSHALPDSSCYGSS--CGGDKEHLHNRDHNSDVIFVGSLPVYLNPKEH---

Query:  ------------------KKGIPANEPKSYHYDESLSVSRKNIKRSRIDLGLKRGLKSNARSFQVERFECLNNSCQQDESSLPNHFGREIETFYFRKRQS
                          K GI +NEPKSYHYDE L VSR+N +R+RIDLG KR LKSNARSFQVER E  N+ CQ+ ESSLP HFG++ E F F KRQS
Subjt:  ------------------KKGIPANEPKSYHYDESLSVSRKNIKRSRIDLGLKRGLKSNARSFQVERFECLNNSCQQDESSLPNHFGREIETFYFRKRQS

Query:  VDIGSKKSLVTDNSLPFE-PFNICFPGGGNVKLGAYWQVKGSRDTVKGNDHLVEDTNYRVLRPGMVLLKHYITPPQQITLVKTCQKLGLGPGGFYQPGYK
        +DIGSK+S+VTD+SLPFE PF+ICFPGGGNVK   +W+VK S  TVK         +YR+LRPGMVLLKHYITPP+QI +VKTCQKLGLGPGGFYQP YK
Subjt:  VDIGSKKSLVTDNSLPFE-PFNICFPGGGNVKLGAYWQVKGSRDTVKGNDHLVEDTNYRVLRPGMVLLKHYITPPQQITLVKTCQKLGLGPGGFYQPGYK

Query:  DGAKLRLHMMCLGLDWDPESRKYENKRAVDGKKPPNIPIQFSFLVKSALKEARAFFKNKCNTSNVEDILPSMSPDICIVNFYKTSGRLGLHQDRDESKES
        DGAKLRL MMCLGLDWDP++R+Y+NKR VDG KPP+IP  FSFLVKSALK+A AF KNKCN SNVEDILPSMSPDICI NFY TSGRLGLHQDRDESKES
Subjt:  DGAKLRLHMMCLGLDWDPESRKYENKRAVDGKKPPNIPIQFSFLVKSALKEARAFFKNKCNTSNVEDILPSMSPDICIVNFYKTSGRLGLHQDRDESKES

Query:  LVSGLPVVSFSVGNSAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY
        L SGLPVVSFSVGN+AEFLYGDKRDV+KAEKV LESGDVLIFGG+SRH+FHGVSSIIPKSTPKFLL HTGLRPGRLNLTFRKY
Subjt:  LVSGLPVVSFSVGNSAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY

XP_022144035.1 uncharacterized protein LOC111013827 [Momordica charantia]6.0e-15954.5Show/hide
Query:  MLLIRTLPVSPSPWSNQLPRLLFPASRFLGERGFRLLQFQRMDSFASSANSHALPDSSCYGSSCGGDKEHLHNRDHNSDVIFVGSLPVYLNPKEH-----
        M +IRT+P+  SP SNQL RLLF +SRF G R  RLLQF+RMDS  +SA SH            G   E+ HNR H+SD++ VG +PVYLN K +     
Subjt:  MLLIRTLPVSPSPWSNQLPRLLFPASRFLGERGFRLLQFQRMDSFASSANSHALPDSSCYGSSCGGDKEHLHNRDHNSDVIFVGSLPVYLNPKEH-----

Query:  ----------------KKGIPANEPKSYHYDESLSVSRKNIK-RSRIDLGLKRGL---------------------------------------------
                        +K  PAN P SYH D+   VSR+N K RSR+DLGL+R +                                             
Subjt:  ----------------KKGIPANEPKSYHYDESLSVSRKNIK-RSRIDLGLKRGL---------------------------------------------

Query:  ------------------------KSNAR--------------------------------------SFQVERFECLNNSCQQDESSLPNHFGREIETFY
                                +SNA+                                      SFQVE F  LNN+ Q DESS PN FG++ E FY
Subjt:  ------------------------KSNAR--------------------------------------SFQVERFECLNNSCQQDESSLPNHFGREIETFY

Query:  FRKRQSVDIGSKKSLVTDNSLPFEPFNIC-FPGGGNVKLGAYWQVKGSRDTVKGNDHLVEDTNYRVLRPGMVLLKHYITPPQQITLVKTCQKLGLGPGGF
         +K QS+DIGSK SLV DN  PFEPF+IC     GN K GA+WQ KG RDTVK  +H+ E +NYRVLRPGMVLLK+YIT  +Q+ +VKTCQ+LG+GPGGF
Subjt:  FRKRQSVDIGSKKSLVTDNSLPFEPFNIC-FPGGGNVKLGAYWQVKGSRDTVKGNDHLVEDTNYRVLRPGMVLLKHYITPPQQITLVKTCQKLGLGPGGF

Query:  YQPGYKDGAKLRLHMMCLGLDWDPESRKYENKRAVDGKKPPNIPIQFSFLVKSALKEARAFFKNKCNTSNVEDILPSMSPDICIVNFYKTSGRLGLHQDR
        Y+PGYKDGAKLRL MMCLGLDWDP++RKY +KRAVDG KPP IP +F+ LV  ALK+A A  KNKCNT NVE ILPSMSPDICIVNFY TSGRLGLHQDR
Subjt:  YQPGYKDGAKLRLHMMCLGLDWDPESRKYENKRAVDGKKPPNIPIQFSFLVKSALKEARAFFKNKCNTSNVEDILPSMSPDICIVNFYKTSGRLGLHQDR

Query:  DESKESLVSGLPVVSFSVGNSAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY
        DESKESLVSGLPVVS S+G+SAEFLYGD+RDVDKAEKV+LESGDVLIFGGDSRH+FHGVSSIIP STPKFLLDHTGLRPGRLNLTFRKY
Subjt:  DESKESLVSGLPVVSFSVGNSAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY

TrEMBL top hitse value%identityAlignment
A0A0A0KY56 Fe2OG dioxygenase domain-containing protein5.4e-19074.68Show/hide
Query:  MLLIRTLPVSPSPWSNQLPRLLFPASRFLGERGFRLLQFQRMDSFASSANSHALPDSSCYGSS--CGGDKEHLHNRDHNSDVIFVGSLPVYLNPKEHKKG
        M  IRTLP+ PSP SNQL RLLFPAS F   RGFRLLQFQ MDSF++SANSHALPDSSC GSS  CG DKEHLH+RD++SDVI VGS+PV+LNPKE    
Subjt:  MLLIRTLPVSPSPWSNQLPRLLFPASRFLGERGFRLLQFQRMDSFASSANSHALPDSSCYGSS--CGGDKEHLHNRDHNSDVIFVGSLPVYLNPKEHKKG

Query:  IPANEPKSYHYDESLSVSRKNIKRSRIDLGLKRGLKSNARSFQVERFECLNNSCQQDESSLPNHFGREIETFYFRKRQSVDIGSKKSLVTDNSLPFE-PF
            EPKSY+YDESL V R+N +RSRIDLG KR LKSNARS+QVER E LN+SCQ+ +SSLP HFG++ E F   K QS+D G K+S+VTDNSLPFE PF
Subjt:  IPANEPKSYHYDESLSVSRKNIKRSRIDLGLKRGLKSNARSFQVERFECLNNSCQQDESSLPNHFGREIETFYFRKRQSVDIGSKKSLVTDNSLPFE-PF

Query:  NICFPGGGNVKLGAYWQVKGSRDTVKGNDHLVEDTNYRVLRPGMVLLKHYITPPQQITLVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPESR
        +IC PGGGNVK    + VK    TVK         +YR+LRPGMVLLKHYITP +QI +VKTCQ LG+GPGGFYQPGYKDGAKLRL MMCLGLDWDP++R
Subjt:  NICFPGGGNVKLGAYWQVKGSRDTVKGNDHLVEDTNYRVLRPGMVLLKHYITPPQQITLVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPESR

Query:  KYENKRAVDGKKPPNIPIQFSFLVKSALKEARAFFKNKCNTSNVEDILPSMSPDICIVNFYKTSGRLGLHQDRDESKESLVSGLPVVSFSVGNSAEFLYG
        +YENKR VDG KPP+IP QF+FLVK ALK+A AF KN CN SNVE+ILPSMSPDICI NFY T GRLGLHQDRDESKESL  GLPVVSFSVGN+AEFLYG
Subjt:  KYENKRAVDGKKPPNIPIQFSFLVKSALKEARAFFKNKCNTSNVEDILPSMSPDICIVNFYKTSGRLGLHQDRDESKESLVSGLPVVSFSVGNSAEFLYG

Query:  DKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY
        DKR+VDKAE V LESGDVLIFGG+SRHIFHGVSSIIPKSTPKFLL HTGLRPGRLNLTFRKY
Subjt:  DKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY

A0A1S4E4K6 uncharacterized protein LOC1035021839.2e-19874.33Show/hide
Query:  MLLIRTLPVSPSPWSNQLPRLLFPASRFLGERGFRLLQFQRMDSFASSANSHALPDSSCYGSS--CGGDKEHLHNRDHNSDVIFVGSLPVYLNPKEH---
        M  IRTLP+ PSP SNQL RLLFPAS F G RGF LLQFQRMDSF+SSANSHA PDSSC G+S  CG DKEHL +RD+ SDVIF+GS  V+LNPKE    
Subjt:  MLLIRTLPVSPSPWSNQLPRLLFPASRFLGERGFRLLQFQRMDSFASSANSHALPDSSCYGSS--CGGDKEHLHNRDHNSDVIFVGSLPVYLNPKEH---

Query:  ------------------KKGIPANEPKSYHYDESLSVSRKNIKRSRIDLGLKRGLKSNARSFQVERFECLNNSCQQDESSLPNHFGREIETFYFRKRQS
                          K GI +NEPKSYHYDE L VSR+N +R+RIDLG KR LKSNARSFQVER E  N+ CQ+ ESSLP HFG++ E F F KRQS
Subjt:  ------------------KKGIPANEPKSYHYDESLSVSRKNIKRSRIDLGLKRGLKSNARSFQVERFECLNNSCQQDESSLPNHFGREIETFYFRKRQS

Query:  VDIGSKKSLVTDNSLPFE-PFNICFPGGGNVKLGAYWQVKGSRDTVKGNDHLVEDTNYRVLRPGMVLLKHYITPPQQITLVKTCQKLGLGPGGFYQPGYK
        +DIGSK+S+VTD+SLPFE PF+ICFPGGGNVK   +W+VK S  TVK         +YR+LRPGMVLLKHYITPP+QI +VKTCQKLGLGPGGFYQP YK
Subjt:  VDIGSKKSLVTDNSLPFE-PFNICFPGGGNVKLGAYWQVKGSRDTVKGNDHLVEDTNYRVLRPGMVLLKHYITPPQQITLVKTCQKLGLGPGGFYQPGYK

Query:  DGAKLRLHMMCLGLDWDPESRKYENKRAVDGKKPPNIPIQFSFLVKSALKEARAFFKNKCNTSNVEDILPSMSPDICIVNFYKTSGRLGLHQDRDESKES
        DGAKLRL MMCLGLDWDP++R+Y+NKR VDG KPP+IP  FSFLVKSALK+A AF KNKCN SNVEDILPSMSPDICI NFY TSGRLGLHQDRDESKES
Subjt:  DGAKLRLHMMCLGLDWDPESRKYENKRAVDGKKPPNIPIQFSFLVKSALKEARAFFKNKCNTSNVEDILPSMSPDICIVNFYKTSGRLGLHQDRDESKES

Query:  LVSGLPVVSFSVGNSAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY
        L SGLPVVSFSVGN+AEFLYGDKRDV+KAEKV LESGDVLIFGG+SRH+FHGVSSIIPKSTPKFLL HTGLRPGRLNLTFRKY
Subjt:  LVSGLPVVSFSVGNSAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY

A0A5A7U7Q2 2-oxoglutarate-dependent dioxygenase family protein isoform 11.8e-17774.6Show/hide
Query:  SHALPDSSCYGSS--CGGDKEHLHNRDHNSDVIFVGSLPVYLNPKEH---------------------KKGIPANEPKSYHYDESLSVSRKNIKRSRIDL
        S A PDSSC G+S  CG DKEHL +RD+ SDVIFVGS  V+LNPKE                      K GI +NEPKSYHYDE L VSR+N +R+RIDL
Subjt:  SHALPDSSCYGSS--CGGDKEHLHNRDHNSDVIFVGSLPVYLNPKEH---------------------KKGIPANEPKSYHYDESLSVSRKNIKRSRIDL

Query:  GLKRGLKSNARSFQVERFECLNNSCQQDESSLPNHFGREIETFYFRKRQSVDIGSKKSLVTDNSLPFE-PFNICFPGGGNVKLGAYWQVKGSRDTVKGND
        G KR LKSNARSFQVER E LN+ CQ+ ESSLP HFG++ E F F KRQS+DIGSK+S+VTD+SLPFE PF+ICFPGGGNVK   +W+VK S  TVK   
Subjt:  GLKRGLKSNARSFQVERFECLNNSCQQDESSLPNHFGREIETFYFRKRQSVDIGSKKSLVTDNSLPFE-PFNICFPGGGNVKLGAYWQVKGSRDTVKGND

Query:  HLVEDTNYRVLRPGMVLLKHYITPPQQITLVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPESRKYENKRAVDGKKPPNIPIQFSFLVKSALK
              +YR+LRPGMVLLKHYITPP+QI +VKTCQKLGLGPGGFYQPGYKDGAKLRL MMCLGLDWDP++R+YENKR VDG KPP+IP  FSFLVK+ALK
Subjt:  HLVEDTNYRVLRPGMVLLKHYITPPQQITLVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPESRKYENKRAVDGKKPPNIPIQFSFLVKSALK

Query:  EARAFFKNKCNTSNVEDILPSMSPDICIVNFYKTSGRLGLHQDRDESKESLVSGLPVVSFSVGNSAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIF
        +A AF KNKCN SNVEDILPSMSPDICI NFY TSGRLGLHQDRDESKESL SGLPVVSFSVGN+AEFLYGDKRDVDKAEKV LESGDVLIFGG+SRH+F
Subjt:  EARAFFKNKCNTSNVEDILPSMSPDICIVNFYKTSGRLGLHQDRDESKESLVSGLPVVSFSVGNSAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIF

Query:  HGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY
        HGVSSIIPKSTPKFLL HTGLRPGRLNLTFRKY
Subjt:  HGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY

A0A5D3BFV0 2-oxoglutarate-dependent dioxygenase family protein isoform 11.2e-17674.36Show/hide
Query:  SHALPDSSCYGSS--CGGDKEHLHNRDHNSDVIFVGSLPVYLNPKEH---------------------KKGIPANEPKSYHYDESLSVSRKNIKRSRIDL
        S A PDSSC G+S  CG DKEHL +RD+ SDVIFVGS  V+LNPKE                      K GI +NEPKSYHYDE L VSR+N +R+RIDL
Subjt:  SHALPDSSCYGSS--CGGDKEHLHNRDHNSDVIFVGSLPVYLNPKEH---------------------KKGIPANEPKSYHYDESLSVSRKNIKRSRIDL

Query:  GLKRGLKSNARSFQVERFECLNNSCQQDESSLPNHFGREIETFYFRKRQSVDIGSKKSLVTDNSLPFE-PFNICFPGGGNVKLGAYWQVKGSRDTVKGND
        G KR LKSNARSFQVER E LN+ CQ+ ESSLP HFG++ E F F KRQS+DIGSK+S+VTD+S PFE PF+ICFPGGGNVK   +W+VK S  TVK   
Subjt:  GLKRGLKSNARSFQVERFECLNNSCQQDESSLPNHFGREIETFYFRKRQSVDIGSKKSLVTDNSLPFE-PFNICFPGGGNVKLGAYWQVKGSRDTVKGND

Query:  HLVEDTNYRVLRPGMVLLKHYITPPQQITLVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPESRKYENKRAVDGKKPPNIPIQFSFLVKSALK
              +YR+LRPGMVLLKHYITPP+QI +VKTCQKLGLGPGGFYQPGYKDGAKLRL MMCLGLDWDP++R+YENKR VDG KPP+IP  FSFLVK+ALK
Subjt:  HLVEDTNYRVLRPGMVLLKHYITPPQQITLVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPESRKYENKRAVDGKKPPNIPIQFSFLVKSALK

Query:  EARAFFKNKCNTSNVEDILPSMSPDICIVNFYKTSGRLGLHQDRDESKESLVSGLPVVSFSVGNSAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIF
        +A AF KNKCN SNVEDILPSMSPDICI NFY TSGRLGLHQDRDESKESL SGLPVVSFSVGN+AEFLYGDKRDVDKAEKV LESGDVLIFGG+SRH+F
Subjt:  EARAFFKNKCNTSNVEDILPSMSPDICIVNFYKTSGRLGLHQDRDESKESLVSGLPVVSFSVGNSAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIF

Query:  HGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY
        HGVSSIIPKSTPKFLL HTGLRPGRLNLTFRKY
Subjt:  HGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY

A0A6J1CQI1 uncharacterized protein LOC1110138272.9e-15954.5Show/hide
Query:  MLLIRTLPVSPSPWSNQLPRLLFPASRFLGERGFRLLQFQRMDSFASSANSHALPDSSCYGSSCGGDKEHLHNRDHNSDVIFVGSLPVYLNPKEH-----
        M +IRT+P+  SP SNQL RLLF +SRF G R  RLLQF+RMDS  +SA SH            G   E+ HNR H+SD++ VG +PVYLN K +     
Subjt:  MLLIRTLPVSPSPWSNQLPRLLFPASRFLGERGFRLLQFQRMDSFASSANSHALPDSSCYGSSCGGDKEHLHNRDHNSDVIFVGSLPVYLNPKEH-----

Query:  ----------------KKGIPANEPKSYHYDESLSVSRKNIK-RSRIDLGLKRGL---------------------------------------------
                        +K  PAN P SYH D+   VSR+N K RSR+DLGL+R +                                             
Subjt:  ----------------KKGIPANEPKSYHYDESLSVSRKNIK-RSRIDLGLKRGL---------------------------------------------

Query:  ------------------------KSNAR--------------------------------------SFQVERFECLNNSCQQDESSLPNHFGREIETFY
                                +SNA+                                      SFQVE F  LNN+ Q DESS PN FG++ E FY
Subjt:  ------------------------KSNAR--------------------------------------SFQVERFECLNNSCQQDESSLPNHFGREIETFY

Query:  FRKRQSVDIGSKKSLVTDNSLPFEPFNIC-FPGGGNVKLGAYWQVKGSRDTVKGNDHLVEDTNYRVLRPGMVLLKHYITPPQQITLVKTCQKLGLGPGGF
         +K QS+DIGSK SLV DN  PFEPF+IC     GN K GA+WQ KG RDTVK  +H+ E +NYRVLRPGMVLLK+YIT  +Q+ +VKTCQ+LG+GPGGF
Subjt:  FRKRQSVDIGSKKSLVTDNSLPFEPFNIC-FPGGGNVKLGAYWQVKGSRDTVKGNDHLVEDTNYRVLRPGMVLLKHYITPPQQITLVKTCQKLGLGPGGF

Query:  YQPGYKDGAKLRLHMMCLGLDWDPESRKYENKRAVDGKKPPNIPIQFSFLVKSALKEARAFFKNKCNTSNVEDILPSMSPDICIVNFYKTSGRLGLHQDR
        Y+PGYKDGAKLRL MMCLGLDWDP++RKY +KRAVDG KPP IP +F+ LV  ALK+A A  KNKCNT NVE ILPSMSPDICIVNFY TSGRLGLHQDR
Subjt:  YQPGYKDGAKLRLHMMCLGLDWDPESRKYENKRAVDGKKPPNIPIQFSFLVKSALKEARAFFKNKCNTSNVEDILPSMSPDICIVNFYKTSGRLGLHQDR

Query:  DESKESLVSGLPVVSFSVGNSAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY
        DESKESLVSGLPVVS S+G+SAEFLYGD+RDVDKAEKV+LESGDVLIFGGDSRH+FHGVSSIIP STPKFLLDHTGLRPGRLNLTFRKY
Subjt:  DESKESLVSGLPVVSFSVGNSAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY

SwissProt top hitse value%identityAlignment
B8GWW6 Alpha-ketoglutarate-dependent dioxygenase AlkB homolog1.9e-1440.87Show/hide
Query:  PSMSPDICIVNFYKTSGRLGLHQDRDESKESLVSGLPVVSFSVGNSAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKFLLDHT
        P   PD C+VN Y+   R+GLHQDRDE+        PV+S S+G++A F  G     D    + L SGDV    G +R  FHGV  I+P S        +
Subjt:  PSMSPDICIVNFYKTSGRLGLHQDRDESKESLVSGLPVVSFSVGNSAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKFLLDHT

Query:  GLRP--GRLNLTFRK
         L P  GR+NLT R+
Subjt:  GLRP--GRLNLTFRK

O60066 Alpha-ketoglutarate-dependent dioxygenase abh15.9e-1628.33Show/hide
Query:  PGMVLLKHYITPPQQITLVKTCQ-----------------KLGLGPGGFYQPGYK-DGAKL------------------RLHMMCLGLDWDPESRKYENK
        PG+++LK+Y++   Q+ L+K+                   +L LG    ++  Y  DG  +                  +L  + LG  +D  +++Y   
Subjt:  PGMVLLKHYITPPQQITLVKTCQ-----------------KLGLGPGGFYQPGYK-DGAKL------------------RLHMMCLGLDWDPESRKYENK

Query:  RAVDGKKPPNIPIQFSFLVKSALKEARAFFKNKCNTSNVEDILPSMSPDICIVNFYKTSGRLGLHQDRDESKESLVSGLPVVSFSVGNSAEFLYGDKRDV
           D  K P  P      V+  +KE+  F   K               +  IVNFY     L  H   DES+E L   LP++S S+G    +L G +   
Subjt:  RAVDGKKPPNIPIQFSFLVKSALKEARAFFKNKCNTSNVEDILPSMSPDICIVNFYKTSGRLGLHQDRDESKESLVSGLPVVSFSVGNSAEFLYGDKRDV

Query:  DKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKFLL
        +K   + L SGDV+I  G SR  FH V  IIP STP +LL
Subjt:  DKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKFLL

P05050 Alpha-ketoglutarate-dependent dioxygenase AlkB5.5e-1433.86Show/hide
Query:  FKNKCNTSNVEDILPSMSPDICIVNFYKTSGRLGLHQDRDESKESLVSGLPVVSFSVGNSAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSS
        F N C  +      P   PD C++N Y    +L LHQD+DE         P+VS S+G  A F +G  +  D  ++++LE GDV+++GG+SR  +HG+  
Subjt:  FKNKCNTSNVEDILPSMSPDICIVNFYKTSGRLGLHQDRDESKESLVSGLPVVSFSVGNSAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSS

Query:  IIPKSTPKFLLDHTGLRPGRLNLTFRK
        +     P  +         R NLTFR+
Subjt:  IIPKSTPKFLLDHTGLRPGRLNLTFRK

P0CAT7 Alpha-ketoglutarate-dependent dioxygenase AlkB homolog1.9e-1440.87Show/hide
Query:  PSMSPDICIVNFYKTSGRLGLHQDRDESKESLVSGLPVVSFSVGNSAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKFLLDHT
        P   PD C+VN Y+   R+GLHQDRDE+        PV+S S+G++A F  G     D    + L SGDV    G +R  FHGV  I+P S        +
Subjt:  PSMSPDICIVNFYKTSGRLGLHQDRDESKESLVSGLPVVSFSVGNSAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKFLLDHT

Query:  GLRP--GRLNLTFRK
         L P  GR+NLT R+
Subjt:  GLRP--GRLNLTFRK

P37462 Alpha-ketoglutarate-dependent dioxygenase AlkB3.2e-1434.65Show/hide
Query:  FKNKCNTSNVEDILPSMSPDICIVNFYKTSGRLGLHQDRDESKESLVSGLPVVSFSVGNSAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSS
        F + C  + +     S  PD C++N Y    +L LHQD+DE         P+VS S+G  A F +G  R  D  ++++LE GD++++GG+SR  +HG+  
Subjt:  FKNKCNTSNVEDILPSMSPDICIVNFYKTSGRLGLHQDRDESKESLVSGLPVVSFSVGNSAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSS

Query:  IIPKSTPKFLLDHTGLRPGRLNLTFRK
        +     P      TG    R NLTFR+
Subjt:  IIPKSTPKFLLDHTGLRPGRLNLTFRK

Arabidopsis top hitse value%identityAlignment
AT1G11780.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein1.2e-0838.55Show/hide
Query:  PDICIVNFYKTSGRLGLHQDRDESKESLVSGLPVVSFSVGNSAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSI
        P+  IVN++     LG H D  E+  S     P+VS S+G  A FL G K   D    + L SGDV++  G++R  FHG+  I
Subjt:  PDICIVNFYKTSGRLGLHQDRDESKESLVSGLPVVSFSVGNSAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSI

AT3G14140.1 2-oxoglutarate-dependent dioxygenase family protein4.6e-6451.83Show/hide
Query:  VLRPGMVLLKHYITPPQQITLVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPESRKYENKRAVDGKKPPNIPIQFSFLVKSALKEARAFFKNK
        V+RPGMVLLK+Y++   Q+ +V  C++LGLG GGFYQPG++DG  L L MMCLG +WD ++R+Y   R +DG  PP IP++FS LV+ A+KE+++     
Subjt:  VLRPGMVLLKHYITPPQQITLVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPESRKYENKRAVDGKKPPNIPIQFSFLVKSALKEARAFFKNK

Query:  CNTSNVEDILPSMSPDICIVNFYKTSGRLGLHQ---------------------DRDESKESLVSGLPVVSFSVGNSAEFLYGDKRDVDKAEKVVLESGD
         N +   D +P + PDIC+VNFY ++G+LGLHQ                     D+ ESK+SL  GLP+VSFS+G+SAEFLYGD++DVDKA+ ++LESGD
Subjt:  CNTSNVEDILPSMSPDICIVNFYKTSGRLGLHQ---------------------DRDESKESLVSGLPVVSFSVGNSAEFLYGDKRDVDKAEKVVLESGD

Query:  VLIFGGDSRHIFHGVSSI
        VLIFG  SR++FHGV SI
Subjt:  VLIFGGDSRHIFHGVSSI

AT3G14160.1 2-oxoglutarate-dependent dioxygenase family protein9.4e-7859.38Show/hide
Query:  VLRPGMVLLKHYITPPQQITLVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPESRKYENKRAVDGKKPPNIPIQFSFLVKSALKEARAFFKNK
        V+RPGMVLLK+Y++   Q+ +V  C++LGLG GGFYQPGY+D AKL L MMCLG +WDPE+ +Y   R  DG   P IP +F+  V+ A+KE+++   + 
Subjt:  VLRPGMVLLKHYITPPQQITLVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPESRKYENKRAVDGKKPPNIPIQFSFLVKSALKEARAFFKNK

Query:  CNTSNVEDILPSMSPDICIVNFYKTSGRLGLHQDRDESKESLVSGLPVVSFSVGNSAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPK
           +   D +P M PDICIVNFY ++GRLGLHQD+DES+ S+  GLPVVSFS+G+SAEFLYGD+RD DKAE + LESGDVL+FGG SR +FHGV SI   
Subjt:  CNTSNVEDILPSMSPDICIVNFYKTSGRLGLHQDRDESKESLVSGLPVVSFSVGNSAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPK

Query:  STPKFLLDHTGLRPGRLNLTFRKY
        + PK LL  T LRPGRLNLTFR+Y
Subjt:  STPKFLLDHTGLRPGRLNLTFRKY

AT5G01780.1 2-oxoglutarate-dependent dioxygenase family protein5.4e-8160.59Show/hide
Query:  NDHLVEDTN-YRVLRPGMVLLKHYITPPQQITLVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPESRKYENKRAVDGKKPPNIPIQFSFLVKS
        N   VE +N ++V+RPGMVLLK ++TP  Q+ +VKTC++LG+ P GFYQPGY  G+KL L MMCLG +WDP++ KY     +D  K P IP+ F+ LV+ 
Subjt:  NDHLVEDTN-YRVLRPGMVLLKHYITPPQQITLVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPESRKYENKRAVDGKKPPNIPIQFSFLVKS

Query:  ALKEARAFFKNKCNTSNVEDILPSMSPDICIVNFYKTSGRLGLHQDRDESKESLVSGLPVVSFSVGNSAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSR
        A++EA A    +  T + E ILP MSPDICIVNFY  +GRLGLHQDRDES+ES+  GLP+VSFS+G+SAEFLYG+KRDV++A+ V+LESGDVLIFGG+SR
Subjt:  ALKEARAFFKNKCNTSNVEDILPSMSPDICIVNFYKTSGRLGLHQDRDESKESLVSGLPVVSFSVGNSAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSR

Query:  HIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY
         IFHGV SIIP S P  LL+ + LR GRLNLTFR +
Subjt:  HIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY

AT5G01780.2 2-oxoglutarate-dependent dioxygenase family protein5.4e-8160.59Show/hide
Query:  NDHLVEDTN-YRVLRPGMVLLKHYITPPQQITLVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPESRKYENKRAVDGKKPPNIPIQFSFLVKS
        N   VE +N ++V+RPGMVLLK ++TP  Q+ +VKTC++LG+ P GFYQPGY  G+KL L MMCLG +WDP++ KY     +D  K P IP+ F+ LV+ 
Subjt:  NDHLVEDTN-YRVLRPGMVLLKHYITPPQQITLVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPESRKYENKRAVDGKKPPNIPIQFSFLVKS

Query:  ALKEARAFFKNKCNTSNVEDILPSMSPDICIVNFYKTSGRLGLHQDRDESKESLVSGLPVVSFSVGNSAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSR
        A++EA A    +  T + E ILP MSPDICIVNFY  +GRLGLHQDRDES+ES+  GLP+VSFS+G+SAEFLYG+KRDV++A+ V+LESGDVLIFGG+SR
Subjt:  ALKEARAFFKNKCNTSNVEDILPSMSPDICIVNFYKTSGRLGLHQDRDESKESLVSGLPVVSFSVGNSAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSR

Query:  HIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY
         IFHGV SIIP S P  LL+ + LR GRLNLTFR +
Subjt:  HIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGTTGATCCGTACACTTCCCGTTTCGCCATCGCCGTGGTCCAATCAACTTCCTCGCCTTTTATTCCCCGCTTCTCGATTTCTCGGCGAACGCGGGTTTCGATTGCT
CCAATTTCAACGAATGGATTCGTTTGCCAGTTCGGCAAATAGCCATGCACTACCTGATTCTTCATGTTATGGTAGTTCTTGTGGGGGAGACAAGGAACATTTGCATAACA
GAGATCATAATTCAGATGTGATATTTGTGGGAAGCCTTCCTGTGTATCTAAATCCAAAGGAACATAAGAAGGGGATTCCTGCAAATGAACCAAAATCTTACCACTATGAT
GAGTCTCTATCTGTTTCTAGAAAAAATATTAAAAGAAGCCGGATAGATTTAGGGTTGAAAAGAGGTTTGAAGAGTAATGCAAGATCATTTCAAGTGGAGAGGTTTGAATG
TTTGAACAATTCCTGTCAGCAGGATGAATCATCTCTGCCTAATCATTTTGGGAGGGAAATTGAAACCTTTTATTTCCGGAAGCGCCAGTCTGTTGATATCGGTTCCAAAA
AATCTTTAGTTACAGACAATTCGCTTCCCTTTGAACCGTTTAATATTTGTTTTCCTGGAGGAGGTAACGTGAAACTTGGAGCTTATTGGCAAGTTAAAGGCAGCAGGGAC
ACTGTGAAAGGAAATGATCATTTGGTTGAAGATACAAATTATAGAGTGCTGAGGCCTGGAATGGTTTTACTGAAGCACTACATTACTCCACCTCAACAGATCACTTTAGT
GAAAACTTGTCAAAAGCTTGGTCTTGGCCCAGGGGGATTTTACCAGCCTGGTTATAAAGATGGAGCAAAACTTAGGCTTCATATGATGTGCCTTGGATTGGACTGGGATC
CTGAATCAAGAAAATATGAAAACAAACGGGCTGTTGATGGTAAGAAACCACCAAATATACCTATTCAGTTTTCATTTCTAGTTAAAAGTGCACTTAAAGAAGCACGTGCC
TTTTTCAAGAACAAATGCAATACAAGTAACGTAGAAGACATTCTTCCATCAATGTCTCCAGACATATGCATTGTGAACTTCTACAAAACTAGTGGAAGACTGGGCCTGCA
TCAGGATCGTGATGAAAGCAAAGAGAGTCTCGTCAGTGGACTACCGGTCGTTTCCTTTTCTGTAGGCAATTCAGCAGAATTCTTGTATGGAGATAAACGAGATGTGGATA
AAGCAGAGAAGGTTGTACTGGAATCAGGTGATGTTCTAATTTTTGGTGGCGACTCTAGACATATATTTCATGGAGTATCTTCAATCATACCAAAATCGACACCTAAGTTT
TTGCTTGATCATACTGGTTTGCGTCCTGGACGTCTAAATCTTACCTTTAGAAAGTATTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTGTTGATCCGTACACTTCCCGTTTCGCCATCGCCGTGGTCCAATCAACTTCCTCGCCTTTTATTCCCCGCTTCTCGATTTCTCGGCGAACGCGGGTTTCGATTGCT
CCAATTTCAACGAATGGATTCGTTTGCCAGTTCGGCAAATAGCCATGCACTACCTGATTCTTCATGTTATGGTAGTTCTTGTGGGGGAGACAAGGAACATTTGCATAACA
GAGATCATAATTCAGATGTGATATTTGTGGGAAGCCTTCCTGTGTATCTAAATCCAAAGGAACATAAGAAGGGGATTCCTGCAAATGAACCAAAATCTTACCACTATGAT
GAGTCTCTATCTGTTTCTAGAAAAAATATTAAAAGAAGCCGGATAGATTTAGGGTTGAAAAGAGGTTTGAAGAGTAATGCAAGATCATTTCAAGTGGAGAGGTTTGAATG
TTTGAACAATTCCTGTCAGCAGGATGAATCATCTCTGCCTAATCATTTTGGGAGGGAAATTGAAACCTTTTATTTCCGGAAGCGCCAGTCTGTTGATATCGGTTCCAAAA
AATCTTTAGTTACAGACAATTCGCTTCCCTTTGAACCGTTTAATATTTGTTTTCCTGGAGGAGGTAACGTGAAACTTGGAGCTTATTGGCAAGTTAAAGGCAGCAGGGAC
ACTGTGAAAGGAAATGATCATTTGGTTGAAGATACAAATTATAGAGTGCTGAGGCCTGGAATGGTTTTACTGAAGCACTACATTACTCCACCTCAACAGATCACTTTAGT
GAAAACTTGTCAAAAGCTTGGTCTTGGCCCAGGGGGATTTTACCAGCCTGGTTATAAAGATGGAGCAAAACTTAGGCTTCATATGATGTGCCTTGGATTGGACTGGGATC
CTGAATCAAGAAAATATGAAAACAAACGGGCTGTTGATGGTAAGAAACCACCAAATATACCTATTCAGTTTTCATTTCTAGTTAAAAGTGCACTTAAAGAAGCACGTGCC
TTTTTCAAGAACAAATGCAATACAAGTAACGTAGAAGACATTCTTCCATCAATGTCTCCAGACATATGCATTGTGAACTTCTACAAAACTAGTGGAAGACTGGGCCTGCA
TCAGGATCGTGATGAAAGCAAAGAGAGTCTCGTCAGTGGACTACCGGTCGTTTCCTTTTCTGTAGGCAATTCAGCAGAATTCTTGTATGGAGATAAACGAGATGTGGATA
AAGCAGAGAAGGTTGTACTGGAATCAGGTGATGTTCTAATTTTTGGTGGCGACTCTAGACATATATTTCATGGAGTATCTTCAATCATACCAAAATCGACACCTAAGTTT
TTGCTTGATCATACTGGTTTGCGTCCTGGACGTCTAAATCTTACCTTTAGAAAGTATTAA
Protein sequenceShow/hide protein sequence
MLLIRTLPVSPSPWSNQLPRLLFPASRFLGERGFRLLQFQRMDSFASSANSHALPDSSCYGSSCGGDKEHLHNRDHNSDVIFVGSLPVYLNPKEHKKGIPANEPKSYHYD
ESLSVSRKNIKRSRIDLGLKRGLKSNARSFQVERFECLNNSCQQDESSLPNHFGREIETFYFRKRQSVDIGSKKSLVTDNSLPFEPFNICFPGGGNVKLGAYWQVKGSRD
TVKGNDHLVEDTNYRVLRPGMVLLKHYITPPQQITLVKTCQKLGLGPGGFYQPGYKDGAKLRLHMMCLGLDWDPESRKYENKRAVDGKKPPNIPIQFSFLVKSALKEARA
FFKNKCNTSNVEDILPSMSPDICIVNFYKTSGRLGLHQDRDESKESLVSGLPVVSFSVGNSAEFLYGDKRDVDKAEKVVLESGDVLIFGGDSRHIFHGVSSIIPKSTPKF
LLDHTGLRPGRLNLTFRKY