; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr025557 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr025557
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionOxoglutarate/iron-dependent dioxygenase
Genome locationtig00007935:1043644..1045553
RNA-Seq ExpressionSgr025557
SyntenySgr025557
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0006974 - cellular response to DNA damage stimulus (biological process)
GO:0043412 - macromolecule modification (biological process)
GO:0070989 - oxidative demethylation (biological process)
GO:0005622 - intracellular (cellular component)
GO:0016706 - 2-oxoglutarate-dependent dioxygenase activity (molecular function)
GO:0032451 - demethylase activity (molecular function)
GO:0140640 - catalytic activity, acting on a nucleic acid (molecular function)
InterPro domainsIPR004574 - Alkylated DNA repair protein AlkB
IPR027450 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like
IPR037151 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058530.1 Oxoglutarate/iron-dependent dioxygenase [Cucumis melo var. makuwa]1.2e-14861.54Show/hide
Query:  AGGDEPSVAGSSSPNNSGAFRDRNSHQMPSSNSKKHFAQKQASSSEHRQWRPLNGGKSASPG--------------------FLGSIASDSTSNELS---
        AGG+  SVAGSS+P  SGAFR R+SHQM S N +K   QKQASSSE  QWRPLN  K ASPG                     LGSIAS+S SN++S   
Subjt:  AGGDEPSVAGSSSPNNSGAFRDRNSHQMPSSNSKKHFAQKQASSSEHRQWRPLNGGKSASPG--------------------FLGSIASDSTSNELS---

Query:  -----------------PSSAQYVSKNLHSAVDRIQIGERTAEGGSCSGPLPYDSRNGLDAVDQELMVQVPLESCAKDESSSIKLKE-NNVSDCKDSEVN
                          SSAQ VSK+LHSAV+RIQI   TA  G+CS   PYD+RNG D V QEL VQ  LESCAKD+SS+IKL E NNV +  DS+  
Subjt:  -----------------PSSAQYVSKNLHSAVDRIQIGERTAEGGSCSGPLPYDSRNGLDAVDQELMVQVPLESCAKDESSSIKLKE-NNVSDCKDSEVN

Query:  KPSVNLDPFDICPPKSGAVMLKPSLLAKNREKRNEMKRTTEGNSGTVLRPGMVHLKCGISLRDQEGKLTFPGQRNCLIIKQKVDLIRIGENSKECRELGL
        KPSV+LD FDICPPK+G V L PSLLA NREKRNEMKR  +GN+G VLRPGMVHLK  ISLRDQ                         +  K+CR+LG+
Subjt:  KPSVNLDPFDICPPKSGAVMLKPSLLAKNREKRNEMKRTTEGNSGTVLRPGMVHLKCGISLRDQEGKLTFPGQRNCLIIKQKVDLIRIGENSKECRELGL

Query:  GAGGFYQPGYRDGGKLHLKMMCLGKNWDPDTGTYGDVRPFDGTKPPGIPVEFYQLVEKAMKESYAIMGKDSTTKNPERVLPGMKPDICIVTFTHKMG---
        GAGGFYQPGYR+GGKLHLKMMCLGKNWDPD+ TYGDVRPFD TKPP +P EFYQLVEKA+K+SYAI+ KDST KNPERVLP MKP+ICIV F  + G   
Subjt:  GAGGFYQPGYRDGGKLHLKMMCLGKNWDPDTGTYGDVRPFDGTKPPGIPVEFYQLVEKAMKESYAIMGKDSTTKNPERVLPGMKPDICIVTFTHKMG---

Query:  ---------DWVFIRVPVVSLSIGDSAEFLFGDQRDTDQAEKVTLESGDILIFGGKSRHVFHGVAAIRPDTAPKPLLEETNLRPGRLNLTFRQY
                 + +   +PV+S SIGDSAEFLFGD  D DQAEKVTLESGDILIFGGKSRHVFHGV AI  +TAPK LLE TNLRPGRLNLTFRQY
Subjt:  ---------DWVFIRVPVVSLSIGDSAEFLFGDQRDTDQAEKVTLESGDILIFGGKSRHVFHGVAAIRPDTAPKPLLEETNLRPGRLNLTFRQY

XP_022142665.1 uncharacterized protein LOC111012720 [Momordica charantia]1.3e-15866.18Show/hide
Query:  AGGDEPSVAGSSSPNNSGAFRDRNSHQMPSSNSKKHFAQKQASSSEHRQWRPLNGGKSASPG----------------------FLGSIASDSTSNELSP
        AGG+ PS+ GSS+P +SG FR R+S QMPS N +K++ Q QA SSE  QWRPLN GK ASP                        L S  S+S SNELSP
Subjt:  AGGDEPSVAGSSSPNNSGAFRDRNSHQMPSSNSKKHFAQKQASSSEHRQWRPLNGGKSASPG----------------------FLGSIASDSTSNELSP

Query:  SSAQYVSKNLHSAVDRIQIGERTAEGGSCSGPLPYDSRNGLDAVDQELMVQVPLESCAKDESSSIKLKENNVSDCKDSE-VNKPSVNLDPFDICPPKSGA
        SSA  VSK+LHSAV+RIQI E T EG SCS  LPYD+ N L AVDQEL VQVPLESCAK+ESSSIK KENNVS+CK SE  NKPSVNLDPFDICPPKSG 
Subjt:  SSAQYVSKNLHSAVDRIQIGERTAEGGSCSGPLPYDSRNGLDAVDQELMVQVPLESCAKDESSSIKLKENNVSDCKDSE-VNKPSVNLDPFDICPPKSGA

Query:  VMLKPSLLAKNREKRNEMKRTTEGNSGTVLRPGMVHLKCGISLRDQEGKLTFPGQRNCLIIKQKVDLIRIGENSKECRELGLGAGGFYQPGYRDGGKLHL
        + L PSLLA N+EKRNE KRTTEGN G VLRPGMV LK GISLRDQ                  V ++      K CR LGLGAGGFYQPGYR+GGKLHL
Subjt:  VMLKPSLLAKNREKRNEMKRTTEGNSGTVLRPGMVHLKCGISLRDQEGKLTFPGQRNCLIIKQKVDLIRIGENSKECRELGLGAGGFYQPGYRDGGKLHL

Query:  KMMCLGKNWDPDTGTYGDVRPFDGTKPPGIPVEFYQLVEKAMKESYAIMGKDSTTKNPERVLPGMKPDICIVTFTHKMG------------DWVFIRVPV
        KMMCLGKNWDPDT TYGDVRPFD TKPP IPVEF+QLVEKA+K+SYAIMGKDST K PERVLP M+P+ICIV F  + G            + +   +PV
Subjt:  KMMCLGKNWDPDTGTYGDVRPFDGTKPPGIPVEFYQLVEKAMKESYAIMGKDSTTKNPERVLPGMKPDICIVTFTHKMG------------DWVFIRVPV

Query:  VSLSIGDSAEFLFGDQRDTDQAEKVTLESGDILIFGGKSRHVFHGVAAIRPDTAPKPLLEETNLRPGRLNLTFRQY
        VS SIGDSAEFL+GD+ D DQAEKVTL+SGDILIFGGKSRHVFHGV  I P+TAPK LLEETNLR GRLNLTFRQY
Subjt:  VSLSIGDSAEFLFGDQRDTDQAEKVTLESGDILIFGGKSRHVFHGVAAIRPDTAPKPLLEETNLRPGRLNLTFRQY

XP_022980604.1 uncharacterized protein LOC111479925 isoform X1 [Cucurbita maxima]1.3e-14761.05Show/hide
Query:  QAGGDEPSVAGSSSPNNSGAFRDRNSHQMPSSNSKKHFAQKQASSSEHRQWRPLNGGKSASPG-------------------------------------
        QAGG+ PSVAGSS+P N GAF+DR+SH  PS NS+  F QKQA  SE  QWRPLN GK +SPG                                     
Subjt:  QAGGDEPSVAGSSSPNNSGAFRDRNSHQMPSSNSKKHFAQKQASSSEHRQWRPLNGGKSASPG-------------------------------------

Query:  -FLGSIASDSTSNELSPSSAQYVSKNLHSAVDRIQIGERTAEGGSCSGPLPYDSRNGLDAVDQELMVQVPLESCAKDESSSIKLKE-NNVSDCKDSEVNK
          LGSIAS S  NE SPSSAQ VSK+LHSAV+RIQI E TAEG  C    PY S    DA  QE MVQ        D++++IKL+E NNVSDCKDS+  K
Subjt:  -FLGSIASDSTSNELSPSSAQYVSKNLHSAVDRIQIGERTAEGGSCSGPLPYDSRNGLDAVDQELMVQVPLESCAKDESSSIKLKE-NNVSDCKDSEVNK

Query:  PSVNLDPFDICPPKSGAVMLKPSLLAKNREKRNEMKRTTEGNSGTVLRPGMVHLKCGISLRDQEGKLTFPGQRNCLIIKQKVDLIRIGENSKECRELGLG
        PS NL+ FDICPPKSG V L PSLL+KNREKRNEMKR  EGN+G VLRPGMVHLK GISL DQ                  V ++      K CR+LG+G
Subjt:  PSVNLDPFDICPPKSGAVMLKPSLLAKNREKRNEMKRTTEGNSGTVLRPGMVHLKCGISLRDQEGKLTFPGQRNCLIIKQKVDLIRIGENSKECRELGLG

Query:  AGGFYQPGYRDGGKLHLKMMCLGKNWDPDTGTYGDVRPFDGTKPPGIPVEFYQLVEKAMKESYAIMGKDSTTKNPERVLPGMKPDICIVTFTHKMG----
        AGGFYQPGYR+GGKLHLKMMCLGKNWDPD+ TYGDVRPFD T PP +PVEFY+LVEKA+K+SYA++GKDS TKNPERVLP MKP+ICIV F  + G    
Subjt:  AGGFYQPGYRDGGKLHLKMMCLGKNWDPDTGTYGDVRPFDGTKPPGIPVEFYQLVEKAMKESYAIMGKDSTTKNPERVLPGMKPDICIVTFTHKMG----

Query:  --------DWVFIRVPVVSLSIGDSAEFLFGDQRDTDQAEKVTLESGDILIFGGKSRHVFHGVAAIRPDTAPKPLLEETNLRPGRLNLTFRQY
                + +   +PVVS SIGDSAEFLFGD+ D DQAEKVTLESGDILIFGGKSRHVFHGV AI  +TAPK LLE TNLRPGRLNLTFRQY
Subjt:  --------DWVFIRVPVVSLSIGDSAEFLFGDQRDTDQAEKVTLESGDILIFGGKSRHVFHGVAAIRPDTAPKPLLEETNLRPGRLNLTFRQY

XP_023528076.1 uncharacterized protein LOC111791104 isoform X1 [Cucurbita pepo subsp. pepo]6.0e-14861.05Show/hide
Query:  QAGGDEPSVAGSSSPNNSGAFRDRNSHQMPSSNSKKHFAQKQASSSEHRQWRPLNGGKSASPG-------------------------------------
        QAGG+ PS AGSS+P N GAF+DR+SH  PS NS+  F QKQA  SE  QWRPLN GK ASPG                                     
Subjt:  QAGGDEPSVAGSSSPNNSGAFRDRNSHQMPSSNSKKHFAQKQASSSEHRQWRPLNGGKSASPG-------------------------------------

Query:  -FLGSIASDSTSNELSPSSAQYVSKNLHSAVDRIQIGERTAEGGSCSGPLPYDSRNGLDAVDQELMVQVPLESCAKDESSSIKLKE-NNVSDCKDSEVNK
          LGSIAS S  NE SPSSAQ VSK+LHSAV+RIQI E TAEG  C    PY S  G DA  QELMVQ        D++ +IKL+E NNVSDC DS+  K
Subjt:  -FLGSIASDSTSNELSPSSAQYVSKNLHSAVDRIQIGERTAEGGSCSGPLPYDSRNGLDAVDQELMVQVPLESCAKDESSSIKLKE-NNVSDCKDSEVNK

Query:  PSVNLDPFDICPPKSGAVMLKPSLLAKNREKRNEMKRTTEGNSGTVLRPGMVHLKCGISLRDQEGKLTFPGQRNCLIIKQKVDLIRIGENSKECRELGLG
        PS NL+ FDICPPKSG VML PSLL+KNREKRNEMKR  EGN+G VLRPGMVHLK GIS  DQ                  V ++      K CR+LG+G
Subjt:  PSVNLDPFDICPPKSGAVMLKPSLLAKNREKRNEMKRTTEGNSGTVLRPGMVHLKCGISLRDQEGKLTFPGQRNCLIIKQKVDLIRIGENSKECRELGLG

Query:  AGGFYQPGYRDGGKLHLKMMCLGKNWDPDTGTYGDVRPFDGTKPPGIPVEFYQLVEKAMKESYAIMGKDSTTKNPERVLPGMKPDICIVTFTHKMG----
        AGGFYQPGYR+GGKLHLKMMCLGKNWDPD+ TYGDVRPFD T PP +PVEFY+LVEKA+K++YA+MGKDS TKNPER LP MKP+ICIV F  + G    
Subjt:  AGGFYQPGYRDGGKLHLKMMCLGKNWDPDTGTYGDVRPFDGTKPPGIPVEFYQLVEKAMKESYAIMGKDSTTKNPERVLPGMKPDICIVTFTHKMG----

Query:  --------DWVFIRVPVVSLSIGDSAEFLFGDQRDTDQAEKVTLESGDILIFGGKSRHVFHGVAAIRPDTAPKPLLEETNLRPGRLNLTFRQY
                + +   +PVVS SIGDSAEFLFGD  D DQAEKVTLESGDILIFGGKSRHVFHGV AI  +TAPK LLE TNLRPGRLNLTFRQY
Subjt:  --------DWVFIRVPVVSLSIGDSAEFLFGDQRDTDQAEKVTLESGDILIFGGKSRHVFHGVAAIRPDTAPKPLLEETNLRPGRLNLTFRQY

XP_038875730.1 uncharacterized protein LOC120068103 isoform X1 [Benincasa hispida]1.6e-15363.16Show/hide
Query:  AGGDEPSVAGSSSPNNSGAFRDRNSHQMPSSNSKKHFAQKQASSSEHRQWRPLNGGKSASPG--------------------------------------
        AGG+ PSVAGSS+P N G FRDR+S  M S N  K F QKQASSSE  QWRPLN GK AS                                        
Subjt:  AGGDEPSVAGSSSPNNSGAFRDRNSHQMPSSNSKKHFAQKQASSSEHRQWRPLNGGKSASPG--------------------------------------

Query:  --FLGSIASDSTSNELSPSSAQYVSKNLHSAVDRIQIGERTAEGGSCSGPLPYDSRNGLDAVDQELMVQVPLESCAKDESSSIKLKE-NNVSDCKDSEVN
           LGSIAS+S   E SPSSAQ VSK+LHSAV+RIQI E TA G SCS   P+D+ N  DAV Q+L VQV LESC KDESS+ KL+E NNVS   DS+  
Subjt:  --FLGSIASDSTSNELSPSSAQYVSKNLHSAVDRIQIGERTAEGGSCSGPLPYDSRNGLDAVDQELMVQVPLESCAKDESSSIKLKE-NNVSDCKDSEVN

Query:  KPSVNLDPFDICPPKSGAVMLKPSLLAKNREKRNEMKRTTEGNSGTVLRPGMVHLKCGISLRDQEGKLTFPGQRNCLIIKQKVDLIRIGENSKECRELGL
        KPSVNLDPFDIC PK+G V L PSL AKNREKRNEMKR  EGNSG VLRPGMVHLK GISLRDQ            +I+K+             CR+LG+
Subjt:  KPSVNLDPFDICPPKSGAVMLKPSLLAKNREKRNEMKRTTEGNSGTVLRPGMVHLKCGISLRDQEGKLTFPGQRNCLIIKQKVDLIRIGENSKECRELGL

Query:  GAGGFYQPGYRDGGKLHLKMMCLGKNWDPDTGTYGDVRPFDGTKPPGIPVEFYQLVEKAMKESYAIMGKDSTTKNPERVLPGMKPDICIVTFTHKMG---
        GAGGFYQPGYR+GGKLHLKMMCLGKNWDPD+ TYGDVRPFD TKPP +P EFYQLVEKA+K SYAIMGKDST KNPERVLP MKP+ICIV F  + G   
Subjt:  GAGGFYQPGYRDGGKLHLKMMCLGKNWDPDTGTYGDVRPFDGTKPPGIPVEFYQLVEKAMKESYAIMGKDSTTKNPERVLPGMKPDICIVTFTHKMG---

Query:  ---------DWVFIRVPVVSLSIGDSAEFLFGDQRDTDQAEKVTLESGDILIFGGKSRHVFHGVAAIRPDTAPKPLLEETNLRPGRLNLTFRQY
                 + +   +PVVS SIGDSAEFLFGDQ D DQAEKVTLESGDILIFGGKSRHVFHGV  I P+TAPK LLE TNLRPGRLNLTFRQY
Subjt:  ---------DWVFIRVPVVSLSIGDSAEFLFGDQRDTDQAEKVTLESGDILIFGGKSRHVFHGVAAIRPDTAPKPLLEETNLRPGRLNLTFRQY

TrEMBL top hitse value%identityAlignment
A0A0A0LC72 Fe2OG dioxygenase domain-containing protein6.5e-14862.66Show/hide
Query:  AGGDEPSVAGSSSPNNSGAFRDRNSHQMPSSNSKKHFAQKQASSSEHRQWRPLNGGKSASPG--------------------FLGSIASDSTSNELSPSS
        AGG+  SVAGSS+P  SGAFR R+SHQM S N +K   QKQASSSE  QWRPLN GK ASPG                     L SIAS+S   ELS SS
Subjt:  AGGDEPSVAGSSSPNNSGAFRDRNSHQMPSSNSKKHFAQKQASSSEHRQWRPLNGGKSASPG--------------------FLGSIASDSTSNELSPSS

Query:  AQYVSKNLHSAVDRIQIGERTAEGGSCSGPLPYDSRNGLDAVDQELMVQVPLESCAKDESSSIKL-KENNVSDCKDSEVNKPSVNLDPFDICPPKSGAVM
        AQ VSK+LHSAV+RI +   TA  GS     PYD+ N  D V QEL VQ  L+SCAKDES +I+L K N+V +  DS+  KPSV+LD FDICPPK+G VM
Subjt:  AQYVSKNLHSAVDRIQIGERTAEGGSCSGPLPYDSRNGLDAVDQELMVQVPLESCAKDESSSIKL-KENNVSDCKDSEVNKPSVNLDPFDICPPKSGAVM

Query:  LKPSLLAKNREKRNEMKRTTEGNSGTVLRPGMVHLKCGISLRDQEGKLTFPGQRNCLIIKQKVDLIRIGENSKECRELGLGAGGFYQPGYRDGGKLHLKM
        L PSLLA NREKRNEM+R  EGN+G VLRPGMVHLK GIS+RDQ                         +  K+CR+LG+GAGGFYQPGYR+GGKLHLKM
Subjt:  LKPSLLAKNREKRNEMKRTTEGNSGTVLRPGMVHLKCGISLRDQEGKLTFPGQRNCLIIKQKVDLIRIGENSKECRELGLGAGGFYQPGYRDGGKLHLKM

Query:  MCLGKNWDPDTGTYGDVRPFDGTKPPGIPVEFYQLVEKAMKESYAIMGKDSTTKNPERVLPGMKPDICIVTFTHKMG------------DWVFIRVPVVS
        MCLGKNWDPD+ TYGD+RPFD TKPP +P EFYQLVEKA+K+SYAIM +DST KNPERVLP MKPDICIV F  + G            + +   +PV+S
Subjt:  MCLGKNWDPDTGTYGDVRPFDGTKPPGIPVEFYQLVEKAMKESYAIMGKDSTTKNPERVLPGMKPDICIVTFTHKMG------------DWVFIRVPVVS

Query:  LSIGDSAEFLFGDQRDTDQAEKVTLESGDILIFGGKSRHVFHGVAAIRPDTAPKPLLEETNLRPGRLNLTFRQY
         SIGDSAEFLFGD+ D DQAEKVTLESGDILIFGGKSRHVFHGV AI  +TAPK LLE TNLRPGRLNLTFRQY
Subjt:  LSIGDSAEFLFGDQRDTDQAEKVTLESGDILIFGGKSRHVFHGVAAIRPDTAPKPLLEETNLRPGRLNLTFRQY

A0A5D3CA69 Oxoglutarate/iron-dependent dioxygenase5.9e-14961.54Show/hide
Query:  AGGDEPSVAGSSSPNNSGAFRDRNSHQMPSSNSKKHFAQKQASSSEHRQWRPLNGGKSASPG--------------------FLGSIASDSTSNELS---
        AGG+  SVAGSS+P  SGAFR R+SHQM S N +K   QKQASSSE  QWRPLN  K ASPG                     LGSIAS+S SN++S   
Subjt:  AGGDEPSVAGSSSPNNSGAFRDRNSHQMPSSNSKKHFAQKQASSSEHRQWRPLNGGKSASPG--------------------FLGSIASDSTSNELS---

Query:  -----------------PSSAQYVSKNLHSAVDRIQIGERTAEGGSCSGPLPYDSRNGLDAVDQELMVQVPLESCAKDESSSIKLKE-NNVSDCKDSEVN
                          SSAQ VSK+LHSAV+RIQI   TA  G+CS   PYD+RNG D V QEL VQ  LESCAKD+SS+IKL E NNV +  DS+  
Subjt:  -----------------PSSAQYVSKNLHSAVDRIQIGERTAEGGSCSGPLPYDSRNGLDAVDQELMVQVPLESCAKDESSSIKLKE-NNVSDCKDSEVN

Query:  KPSVNLDPFDICPPKSGAVMLKPSLLAKNREKRNEMKRTTEGNSGTVLRPGMVHLKCGISLRDQEGKLTFPGQRNCLIIKQKVDLIRIGENSKECRELGL
        KPSV+LD FDICPPK+G V L PSLLA NREKRNEMKR  +GN+G VLRPGMVHLK  ISLRDQ                         +  K+CR+LG+
Subjt:  KPSVNLDPFDICPPKSGAVMLKPSLLAKNREKRNEMKRTTEGNSGTVLRPGMVHLKCGISLRDQEGKLTFPGQRNCLIIKQKVDLIRIGENSKECRELGL

Query:  GAGGFYQPGYRDGGKLHLKMMCLGKNWDPDTGTYGDVRPFDGTKPPGIPVEFYQLVEKAMKESYAIMGKDSTTKNPERVLPGMKPDICIVTFTHKMG---
        GAGGFYQPGYR+GGKLHLKMMCLGKNWDPD+ TYGDVRPFD TKPP +P EFYQLVEKA+K+SYAI+ KDST KNPERVLP MKP+ICIV F  + G   
Subjt:  GAGGFYQPGYRDGGKLHLKMMCLGKNWDPDTGTYGDVRPFDGTKPPGIPVEFYQLVEKAMKESYAIMGKDSTTKNPERVLPGMKPDICIVTFTHKMG---

Query:  ---------DWVFIRVPVVSLSIGDSAEFLFGDQRDTDQAEKVTLESGDILIFGGKSRHVFHGVAAIRPDTAPKPLLEETNLRPGRLNLTFRQY
                 + +   +PV+S SIGDSAEFLFGD  D DQAEKVTLESGDILIFGGKSRHVFHGV AI  +TAPK LLE TNLRPGRLNLTFRQY
Subjt:  ---------DWVFIRVPVVSLSIGDSAEFLFGDQRDTDQAEKVTLESGDILIFGGKSRHVFHGVAAIRPDTAPKPLLEETNLRPGRLNLTFRQY

A0A6J1CLK3 uncharacterized protein LOC1110127206.2e-15966.18Show/hide
Query:  AGGDEPSVAGSSSPNNSGAFRDRNSHQMPSSNSKKHFAQKQASSSEHRQWRPLNGGKSASPG----------------------FLGSIASDSTSNELSP
        AGG+ PS+ GSS+P +SG FR R+S QMPS N +K++ Q QA SSE  QWRPLN GK ASP                        L S  S+S SNELSP
Subjt:  AGGDEPSVAGSSSPNNSGAFRDRNSHQMPSSNSKKHFAQKQASSSEHRQWRPLNGGKSASPG----------------------FLGSIASDSTSNELSP

Query:  SSAQYVSKNLHSAVDRIQIGERTAEGGSCSGPLPYDSRNGLDAVDQELMVQVPLESCAKDESSSIKLKENNVSDCKDSE-VNKPSVNLDPFDICPPKSGA
        SSA  VSK+LHSAV+RIQI E T EG SCS  LPYD+ N L AVDQEL VQVPLESCAK+ESSSIK KENNVS+CK SE  NKPSVNLDPFDICPPKSG 
Subjt:  SSAQYVSKNLHSAVDRIQIGERTAEGGSCSGPLPYDSRNGLDAVDQELMVQVPLESCAKDESSSIKLKENNVSDCKDSE-VNKPSVNLDPFDICPPKSGA

Query:  VMLKPSLLAKNREKRNEMKRTTEGNSGTVLRPGMVHLKCGISLRDQEGKLTFPGQRNCLIIKQKVDLIRIGENSKECRELGLGAGGFYQPGYRDGGKLHL
        + L PSLLA N+EKRNE KRTTEGN G VLRPGMV LK GISLRDQ                  V ++      K CR LGLGAGGFYQPGYR+GGKLHL
Subjt:  VMLKPSLLAKNREKRNEMKRTTEGNSGTVLRPGMVHLKCGISLRDQEGKLTFPGQRNCLIIKQKVDLIRIGENSKECRELGLGAGGFYQPGYRDGGKLHL

Query:  KMMCLGKNWDPDTGTYGDVRPFDGTKPPGIPVEFYQLVEKAMKESYAIMGKDSTTKNPERVLPGMKPDICIVTFTHKMG------------DWVFIRVPV
        KMMCLGKNWDPDT TYGDVRPFD TKPP IPVEF+QLVEKA+K+SYAIMGKDST K PERVLP M+P+ICIV F  + G            + +   +PV
Subjt:  KMMCLGKNWDPDTGTYGDVRPFDGTKPPGIPVEFYQLVEKAMKESYAIMGKDSTTKNPERVLPGMKPDICIVTFTHKMG------------DWVFIRVPV

Query:  VSLSIGDSAEFLFGDQRDTDQAEKVTLESGDILIFGGKSRHVFHGVAAIRPDTAPKPLLEETNLRPGRLNLTFRQY
        VS SIGDSAEFL+GD+ D DQAEKVTL+SGDILIFGGKSRHVFHGV  I P+TAPK LLEETNLR GRLNLTFRQY
Subjt:  VSLSIGDSAEFLFGDQRDTDQAEKVTLESGDILIFGGKSRHVFHGVAAIRPDTAPKPLLEETNLRPGRLNLTFRQY

A0A6J1GWM4 uncharacterized protein LOC111457830 isoform X11.9e-14761.05Show/hide
Query:  QAGGDEPSVAGSSSPNNSGAFRDRNSHQMPSSNSKKHFAQKQASSSEHRQWRPLNGGKSASPG-------------------------------------
        QAGG+ PS AGSS+P N GAF+DR+SH  PS NS+  F QKQA  SE  QWRPLN GK ASPG                                     
Subjt:  QAGGDEPSVAGSSSPNNSGAFRDRNSHQMPSSNSKKHFAQKQASSSEHRQWRPLNGGKSASPG-------------------------------------

Query:  -FLGSIASDSTSNELSPSSAQYVSKNLHSAVDRIQIGERTAEGGSCSGPLPYDSRNGLDAVDQELMVQVPLESCAKDESSSIKLKE-NNVSDCKDSEVNK
          LGSIAS S  NE SP SAQ VSK+LHSAV+RIQI E TAEG  C    PYDS    DA  QELMVQ        D++++IKL+E NNVSDCKDS+  K
Subjt:  -FLGSIASDSTSNELSPSSAQYVSKNLHSAVDRIQIGERTAEGGSCSGPLPYDSRNGLDAVDQELMVQVPLESCAKDESSSIKLKE-NNVSDCKDSEVNK

Query:  PSVNLDPFDICPPKSGAVMLKPSLLAKNREKRNEMKRTTEGNSGTVLRPGMVHLKCGISLRDQEGKLTFPGQRNCLIIKQKVDLIRIGENSKECRELGLG
        P  NL+ FDICPPKSG V L PSLL+KNREKRNEMKR  EGN+G VLRPGMVHLK GISL DQ                  V ++      K+CR+LG+G
Subjt:  PSVNLDPFDICPPKSGAVMLKPSLLAKNREKRNEMKRTTEGNSGTVLRPGMVHLKCGISLRDQEGKLTFPGQRNCLIIKQKVDLIRIGENSKECRELGLG

Query:  AGGFYQPGYRDGGKLHLKMMCLGKNWDPDTGTYGDVRPFDGTKPPGIPVEFYQLVEKAMKESYAIMGKDSTTKNPERVLPGMKPDICIVTFTHKMG----
        AGGFYQPGYR+GGKLHLKMMCLGKNWDPD+  YGDVRPFD T PP +PVEFY+LVEKA+K+SYA+MGKDS TKNPERVLP MKP+ICIV F  + G    
Subjt:  AGGFYQPGYRDGGKLHLKMMCLGKNWDPDTGTYGDVRPFDGTKPPGIPVEFYQLVEKAMKESYAIMGKDSTTKNPERVLPGMKPDICIVTFTHKMG----

Query:  --------DWVFIRVPVVSLSIGDSAEFLFGDQRDTDQAEKVTLESGDILIFGGKSRHVFHGVAAIRPDTAPKPLLEETNLRPGRLNLTFRQY
                + +   +PVVS SIGDSAEFLFGD  D DQAEKVTLESGDILIFGGKSRHVFHGV AI  +TAPK LLE TNLRPGRLNLTFRQY
Subjt:  --------DWVFIRVPVVSLSIGDSAEFLFGDQRDTDQAEKVTLESGDILIFGGKSRHVFHGVAAIRPDTAPKPLLEETNLRPGRLNLTFRQY

A0A6J1IZQ9 uncharacterized protein LOC111479925 isoform X16.5e-14861.05Show/hide
Query:  QAGGDEPSVAGSSSPNNSGAFRDRNSHQMPSSNSKKHFAQKQASSSEHRQWRPLNGGKSASPG-------------------------------------
        QAGG+ PSVAGSS+P N GAF+DR+SH  PS NS+  F QKQA  SE  QWRPLN GK +SPG                                     
Subjt:  QAGGDEPSVAGSSSPNNSGAFRDRNSHQMPSSNSKKHFAQKQASSSEHRQWRPLNGGKSASPG-------------------------------------

Query:  -FLGSIASDSTSNELSPSSAQYVSKNLHSAVDRIQIGERTAEGGSCSGPLPYDSRNGLDAVDQELMVQVPLESCAKDESSSIKLKE-NNVSDCKDSEVNK
          LGSIAS S  NE SPSSAQ VSK+LHSAV+RIQI E TAEG  C    PY S    DA  QE MVQ        D++++IKL+E NNVSDCKDS+  K
Subjt:  -FLGSIASDSTSNELSPSSAQYVSKNLHSAVDRIQIGERTAEGGSCSGPLPYDSRNGLDAVDQELMVQVPLESCAKDESSSIKLKE-NNVSDCKDSEVNK

Query:  PSVNLDPFDICPPKSGAVMLKPSLLAKNREKRNEMKRTTEGNSGTVLRPGMVHLKCGISLRDQEGKLTFPGQRNCLIIKQKVDLIRIGENSKECRELGLG
        PS NL+ FDICPPKSG V L PSLL+KNREKRNEMKR  EGN+G VLRPGMVHLK GISL DQ                  V ++      K CR+LG+G
Subjt:  PSVNLDPFDICPPKSGAVMLKPSLLAKNREKRNEMKRTTEGNSGTVLRPGMVHLKCGISLRDQEGKLTFPGQRNCLIIKQKVDLIRIGENSKECRELGLG

Query:  AGGFYQPGYRDGGKLHLKMMCLGKNWDPDTGTYGDVRPFDGTKPPGIPVEFYQLVEKAMKESYAIMGKDSTTKNPERVLPGMKPDICIVTFTHKMG----
        AGGFYQPGYR+GGKLHLKMMCLGKNWDPD+ TYGDVRPFD T PP +PVEFY+LVEKA+K+SYA++GKDS TKNPERVLP MKP+ICIV F  + G    
Subjt:  AGGFYQPGYRDGGKLHLKMMCLGKNWDPDTGTYGDVRPFDGTKPPGIPVEFYQLVEKAMKESYAIMGKDSTTKNPERVLPGMKPDICIVTFTHKMG----

Query:  --------DWVFIRVPVVSLSIGDSAEFLFGDQRDTDQAEKVTLESGDILIFGGKSRHVFHGVAAIRPDTAPKPLLEETNLRPGRLNLTFRQY
                + +   +PVVS SIGDSAEFLFGD+ D DQAEKVTLESGDILIFGGKSRHVFHGV AI  +TAPK LLE TNLRPGRLNLTFRQY
Subjt:  --------DWVFIRVPVVSLSIGDSAEFLFGDQRDTDQAEKVTLESGDILIFGGKSRHVFHGVAAIRPDTAPKPLLEETNLRPGRLNLTFRQY

SwissProt top hitse value%identityAlignment
B8GWW6 Alpha-ketoglutarate-dependent dioxygenase AlkB homolog5.7e-0828.57Show/hide
Query:  YQPGYRDGGKLHLKMMCLGK-NWDPDTG--TYGDVRPFDGTKPPGIPVEFYQLVEKAMKESYAIMGKDSTTKNPERVLPGMKPDICIVTF--------TH
        Y+  Y  G  + + M  LG   W  D     Y D  P  G   P +P         A+ + + ++G            P   PD C+V           H
Subjt:  YQPGYRDGGKLHLKMMCLGK-NWDPDTG--TYGDVRPFDGTKPPGIPVEFYQLVEKAMKESYAIMGKDSTTKNPERVLPGMKPDICIVTF--------TH

Query:  KMGDWVFIRVPVVSLSIGDSAEFLFGDQRDTDQAEKVTLESGDILIFGGKSRHVFHGVAAIRPDTAPKPLLEETNLRP--GRLNLTFRQ
        +  D    R PV+S+S+GD+A F  G     D    + L SGD+    G +R  FHGV  I P +        ++L P  GR+NLT R+
Subjt:  KMGDWVFIRVPVVSLSIGDSAEFLFGDQRDTDQAEKVTLESGDILIFGGKSRHVFHGVAAIRPDTAPKPLLEETNLRP--GRLNLTFRQ

P05050 Alpha-ketoglutarate-dependent dioxygenase AlkB7.2e-1130.2Show/hide
Query:  YGDVRPFDGTKPPGIPVEFYQLVEKAMKES-YAIMGKDSTTKNPERVLPGMKPDICIVTFTHKMGDWVFIRVPVVSLSIGDSAEFLFGDQRDTDQAEKVT
        Y  + P      P +P  F+ L ++A   + Y     D+   N  R  PG K  +      H+  D   +R P+VS+S+G  A F FG  +  D  +++ 
Subjt:  YGDVRPFDGTKPPGIPVEFYQLVEKAMKES-YAIMGKDSTTKNPERVLPGMKPDICIVTFTHKMGDWVFIRVPVVSLSIGDSAEFLFGDQRDTDQAEKVT

Query:  LESGDILIFGGKSRHVFHGVAAIRPDTAPKPLLEETNLRPGRLNLTFRQ
        LE GD++++GG+SR  +HG+  ++    P  +         R NLTFRQ
Subjt:  LESGDILIFGGKSRHVFHGVAAIRPDTAPKPLLEETNLRPGRLNLTFRQ

P0CAT7 Alpha-ketoglutarate-dependent dioxygenase AlkB homolog5.7e-0828.57Show/hide
Query:  YQPGYRDGGKLHLKMMCLGK-NWDPDTG--TYGDVRPFDGTKPPGIPVEFYQLVEKAMKESYAIMGKDSTTKNPERVLPGMKPDICIVTF--------TH
        Y+  Y  G  + + M  LG   W  D     Y D  P  G   P +P         A+ + + ++G            P   PD C+V           H
Subjt:  YQPGYRDGGKLHLKMMCLGK-NWDPDTG--TYGDVRPFDGTKPPGIPVEFYQLVEKAMKESYAIMGKDSTTKNPERVLPGMKPDICIVTF--------TH

Query:  KMGDWVFIRVPVVSLSIGDSAEFLFGDQRDTDQAEKVTLESGDILIFGGKSRHVFHGVAAIRPDTAPKPLLEETNLRP--GRLNLTFRQ
        +  D    R PV+S+S+GD+A F  G     D    + L SGD+    G +R  FHGV  I P +        ++L P  GR+NLT R+
Subjt:  KMGDWVFIRVPVVSLSIGDSAEFLFGDQRDTDQAEKVTLESGDILIFGGKSRHVFHGVAAIRPDTAPKPLLEETNLRP--GRLNLTFRQ

P37462 Alpha-ketoglutarate-dependent dioxygenase AlkB3.2e-1132.85Show/hide
Query:  PGIPVEFYQLV-EKAMKESYAIMGKDSTTKNPERVLPGMKPDICIVTFTHKMGDWVFIRVPVVSLSIGDSAEFLFGDQRDTDQAEKVTLESGDILIFGGK
        P +P+ F  +  + A+   YA    D+   N  R  PG K  +      H+  D   +R P+VS+S+G  A F FG  R +D  +++ LE GDI+++GG+
Subjt:  PGIPVEFYQLV-EKAMKESYAIMGKDSTTKNPERVLPGMKPDICIVTFTHKMGDWVFIRVPVVSLSIGDSAEFLFGDQRDTDQAEKVTLESGDILIFGGK

Query:  SRHVFHGVAAIRPDTAPKPLLEETNLRPGRLNLTFRQ
        SR  +HG+  ++    P            R NLTFRQ
Subjt:  SRHVFHGVAAIRPDTAPKPLLEETNLRPGRLNLTFRQ

Q54N08 Alpha-ketoglutarate-dependent dioxygenase alkB5.3e-0636.92Show/hide
Query:  PVVSLSIGDSAEFLFGDQRDTDQAEKVTLESGDILIFGGKSRHVFHGVAAIRPDTAPKPLLEETN
        P++S+S G +A FL G +        + + SGDI+I GG+SR+ +HGVA I  ++    L++E +
Subjt:  PVVSLSIGDSAEFLFGDQRDTDQAEKVTLESGDILIFGGKSRHVFHGVAAIRPDTAPKPLLEETN

Arabidopsis top hitse value%identityAlignment
AT1G11780.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein1.2e-0539.22Show/hide
Query:  PVVSLSIGDSAEFLFGDQRDTDQAEKVTLESGDILIFGGKSRHVFHGVAAI
        P+VS+S+G  A FL G +   D    + L SGD+++  G++R  FHG+  I
Subjt:  PVVSLSIGDSAEFLFGDQRDTDQAEKVTLESGDILIFGGKSRHVFHGVAAI

AT3G14140.1 2-oxoglutarate-dependent dioxygenase family protein3.3e-5945.49Show/hide
Query:  PFDICPPKSGAVMLKPSLLAKNREKRNEMKRTTEGNSGTVLRPGMVHLKCGISLRDQEGKLTFPGQRNCLIIKQKVDLIRIGENSKECRELGLGAGGFYQ
        PFDI   K   + LKPS L  NREK    K+  +G SG V+RPGMV LK                  N L I  +V ++       +CR+LGLG GGFYQ
Subjt:  PFDICPPKSGAVMLKPSLLAKNREKRNEMKRTTEGNSGTVLRPGMVHLKCGISLRDQEGKLTFPGQRNCLIIKQKVDLIRIGENSKECRELGLGAGGFYQ

Query:  PGYRDGGKLHLKMMCLGKNWDPDTGTYGDVRPFDGTKPPGIPVEFYQLVEKAMKESYAIMGKDSTTKNPERVLPGMKPDICIVTF---THKMG-------
        PG++DGG LHLKMMCLGKNWD  T  YG++RP DG+ PP IPVEF QLVEKA+KES +++  +S        +P + PDIC+V F   T K+G       
Subjt:  PGYRDGGKLHLKMMCLGKNWDPDTGTYGDVRPFDGTKPPGIPVEFYQLVEKAMKESYAIMGKDSTTKNPERVLPGMKPDICIVTF---THKMG-------

Query:  ---DWVFIR--------------------VPVVSLSIGDSAEFLFGDQRDTDQAEKVTLESGDILIFGGKSRHVFHGVAAIRPDTAPK
            + F++                    +P+VS SIGDSAEFL+GDQ+D D+A+ + LESGD+LIFG +SR+VFHGV +IR    P+
Subjt:  ---DWVFIR--------------------VPVVSLSIGDSAEFLFGDQRDTDQAEKVTLESGDILIFGGKSRHVFHGVAAIRPDTAPK

AT3G14160.1 2-oxoglutarate-dependent dioxygenase family protein2.7e-7750.3Show/hide
Query:  SCAKDESSSI--KLKENNVSDCKDSEVNKPSVNLD------PFDICPPKSGAVMLKPSLLAKNREKRNEMKRTTEGNSGTVLRPGMVHLKCGISLRDQEG
        SC +  SS++  K++ ++V D K +     + N         FDI   K G ++LKP+LL  +REK    K+  +G SGTV+RPGMV LK  +S+ DQ  
Subjt:  SCAKDESSSI--KLKENNVSDCKDSEVNKPSVNLD------PFDICPPKSGAVMLKPSLLAKNREKRNEMKRTTEGNSGTVLRPGMVHLKCGISLRDQEG

Query:  KLTFPGQRNCLIIKQKVDLIRIGENSKECRELGLGAGGFYQPGYRDGGKLHLKMMCLGKNWDPDTGTYGDVRPFDGTKPPGIPVEFYQLVEKAMKESYAI
                  +I+              +CR LGLG GGFYQPGYRD  KLHLKMMCLGKNWDP+T  YG+ RPFDG+  P IP EF Q VEKA+KES ++
Subjt:  KLTFPGQRNCLIIKQKVDLIRIGENSKECRELGLGAGGFYQPGYRDGGKLHLKMMCLGKNWDPDTGTYGDVRPFDGTKPPGIPVEFYQLVEKAMKESYAI

Query:  MGKDSTTKNPERVLPGMKPDICIVTF---THKMG---------DWVFIRVPVVSLSIGDSAEFLFGDQRDTDQAEKVTLESGDILIFGGKSRHVFHGVAA
           +S        +P M PDICIV F   T ++G         + +   +PVVS SIGDSAEFL+GDQRD D+AE +TLESGD+L+FGG+SR VFHGV +
Subjt:  MGKDSTTKNPERVLPGMKPDICIVTF---THKMG---------DWVFIRVPVVSLSIGDSAEFLFGDQRDTDQAEKVTLESGDILIFGGKSRHVFHGVAA

Query:  IRPDTAPKPLLEETNLRPGRLNLTFRQY
        IR DTAPK LL+ET+LRPGRLNLTFRQY
Subjt:  IRPDTAPKPLLEETNLRPGRLNLTFRQY

AT5G01780.1 2-oxoglutarate-dependent dioxygenase family protein9.5e-5939.6Show/hide
Query:  GSCSGPLPYDSRNGLDAVDQELMVQVPLESCAKDESSSIKLKENNVSDCKDSEVNKPSVNLDPFDICPPKSGAVMLKPSLLAKNREKRNEMKRTT--EGN
        GS +  + +DS N   +        + +       +S  K ++ +    KD           PFDIC     +V+ +     K+    +E  R T    N
Subjt:  GSCSGPLPYDSRNGLDAVDQELMVQVPLESCAKDESSSIKLKENNVSDCKDSEVNKPSVNLDPFDICPPKSGAVMLKPSLLAKNREKRNEMKRTT--EGN

Query:  SGTVLRPGMVHLKCGISLRDQEGKLTFPGQRNCLIIKQKVDLIRIGENSKECRELGLGAGGFYQPGYRDGGKLHLKMMCLGKNWDPDTGTYGDVRPFDGT
           V+RPGMV LK                  + L    +VD++      K CRELG+   GFYQPGY  G KLHL+MMCLG+NWDP T  Y      D +
Subjt:  SGTVLRPGMVHLKCGISLRDQEGKLTFPGQRNCLIIKQKVDLIRIGENSKECRELGLGAGGFYQPGYRDGGKLHLKMMCLGKNWDPDTGTYGDVRPFDGT

Query:  KPPGIPVEFYQLVEKAMKESYAIMGKDSTTKNPERVLPGMKPDICIVTFTHKMG------------DWVFIRVPVVSLSIGDSAEFLFGDQRDTDQAEKV
        K P IPV F  LVEKA++E++A++ ++S T++ ER+LP M PDICIV F  + G            + +   +P+VS SIGDSAEFL+G++RD ++A+ V
Subjt:  KPPGIPVEFYQLVEKAMKESYAIMGKDSTTKNPERVLPGMKPDICIVTFTHKMG------------DWVFIRVPVVSLSIGDSAEFLFGDQRDTDQAEKV

Query:  TLESGDILIFGGKSRHVFHGVAAIRPDTAPKPLLEETNLRPGRLNLTFRQY
         LESGD+LIFGG+SR +FHGV +I P++AP  LL E+ LR GRLNLTFR +
Subjt:  TLESGDILIFGGKSRHVFHGVAAIRPDTAPKPLLEETNLRPGRLNLTFRQY

AT5G01780.2 2-oxoglutarate-dependent dioxygenase family protein3.6e-5835.5Show/hide
Query:  SPNNSGAFRDRNSHQMPSSNSKKHFAQKQASSSEHRQWRPLNGGKSASPGFLGS------IASDSTSNELSPSSAQYVSKNLHSAVDRIQIGER------
        SP+ S      + H   S   K H A   + +    Q+ PL GG      +LGS        S    N  S   A  + +NL     R +   R      
Subjt:  SPNNSGAFRDRNSHQMPSSNSKKHFAQKQASSSEHRQWRPLNGGKSASPGFLGS------IASDSTSNELSPSSAQYVSKNLHSAVDRIQIGER------

Query:  -------TAEGGSCSGPLPYDSRNGLDAVDQELMVQVPLESCAKDESSSIKLKENNVSDCKDSEVNKPSVNLDPFDICPPKSGAVMLKPSLLAKNREKRN
                A  GS +  + +DS N   +        + +       +S  K ++ +    KD           PFDIC     +V+ +     K+    +
Subjt:  -------TAEGGSCSGPLPYDSRNGLDAVDQELMVQVPLESCAKDESSSIKLKENNVSDCKDSEVNKPSVNLDPFDICPPKSGAVMLKPSLLAKNREKRN

Query:  EMKRTT--EGNSGTVLRPGMVHLKCGISLRDQEGKLTFPGQRNCLIIKQKVDLIRIGENSKECRELGLGAGGFYQPGYRDGGKLHLKMMCLGKNWDPDTG
        E  R T    N   V+RPGMV LK                  + L    +VD++      K CRELG+   GFYQPGY  G KLHL+MMCLG+NWDP T 
Subjt:  EMKRTT--EGNSGTVLRPGMVHLKCGISLRDQEGKLTFPGQRNCLIIKQKVDLIRIGENSKECRELGLGAGGFYQPGYRDGGKLHLKMMCLGKNWDPDTG

Query:  TYGDVRPFDGTKPPGIPVEFYQLVEKAMKESYAIMGKDSTTKNPERVLPGMKPDICIVTFTHKMG------------DWVFIRVPVVSLSIGDSAEFLFG
         Y      D +K P IPV F  LVEKA++E++A++ ++S T++ ER+LP M PDICIV F  + G            + +   +P+VS SIGDSAEFL+G
Subjt:  TYGDVRPFDGTKPPGIPVEFYQLVEKAMKESYAIMGKDSTTKNPERVLPGMKPDICIVTFTHKMG------------DWVFIRVPVVSLSIGDSAEFLFG

Query:  DQRDTDQAEKVTLESGDILIFGGKSRHVFHGVAAIRPDTAPKPLLEETNLRPGRLNLTFRQY
        ++RD ++A+ V LESGD+LIFGG+SR +FHGV +I P++AP  LL E+ LR GRLNLTFR +
Subjt:  DQRDTDQAEKVTLESGDILIFGGKSRHVFHGVAAIRPDTAPKPLLEETNLRPGRLNLTFRQY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGGCTGGTGGTGATGAGCCTTCTGTTGCTGGTTCTTCTAGCCCTAACAATAGTGGAGCTTTTCGAGACAGAAATTCTCACCAGATGCCATCATCGAATTCTAAGAA
GCATTTTGCACAAAAACAGGCCTCGAGTTCTGAACATCGGCAGTGGCGGCCTTTAAATGGTGGGAAAAGTGCATCTCCTGGTTTCTTGGGATCTATTGCATCAGATTCAA
CTAGCAATGAACTCTCGCCATCATCTGCTCAATATGTCTCTAAGAATTTGCATTCTGCTGTAGATAGAATTCAGATTGGAGAACGTACAGCAGAAGGTGGAAGTTGCAGT
GGTCCATTGCCTTATGATAGTAGGAACGGATTGGATGCTGTTGACCAGGAACTCATGGTTCAAGTGCCCTTGGAATCCTGTGCTAAAGATGAGAGTTCCTCTATAAAATT
AAAGGAAAATAATGTTTCTGACTGTAAGGACTCAGAGGTTAATAAGCCTTCTGTGAACCTTGATCCTTTTGATATTTGCCCTCCAAAGTCTGGAGCTGTTATGCTGAAAC
CTTCTTTATTAGCTAAGAACAGGGAAAAGAGGAATGAGATGAAGCGTACCACGGAGGGAAATAGTGGAACTGTATTGAGACCTGGCATGGTTCATCTGAAGTGTGGCATT
TCCCTCAGGGATCAGGAGGGAAAGCTCACTTTCCCAGGACAGAGAAATTGCTTAATCATCAAGCAGAAAGTGGATCTTATTAGGATTGGTGAAAATAGTAAAGAATGTCG
GGAACTTGGTTTAGGCGCTGGAGGTTTTTACCAACCTGGTTACCGTGATGGAGGAAAACTACACTTGAAGATGATGTGCCTTGGTAAAAATTGGGATCCTGACACTGGTA
CATACGGGGATGTCCGTCCATTTGATGGTACAAAACCACCAGGCATTCCAGTTGAATTTTACCAATTGGTTGAAAAGGCAATGAAAGAGTCTTATGCCATAATGGGAAAA
GATTCAACAACCAAAAATCCTGAACGTGTACTTCCAGGGATGAAACCGGACATCTGTATCGTAACTTTTACTCACAAAATGGGCGATTGGGTCTTCATCAGAGTGCCTGT
TGTCTCCTTGTCCATTGGTGACTCTGCAGAATTCCTGTTTGGTGATCAGAGGGACACTGATCAGGCGGAAAAAGTTACTCTGGAATCAGGAGATATTTTGATATTTGGTG
GGAAATCGAGACATGTTTTTCATGGGGTGGCTGCAATTCGTCCAGACACTGCTCCGAAGCCACTATTAGAAGAAACAAATCTTCGTCCAGGGCGCTTGAATCTTACTTTC
CGTCAGTACTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAGGCTGGTGGTGATGAGCCTTCTGTTGCTGGTTCTTCTAGCCCTAACAATAGTGGAGCTTTTCGAGACAGAAATTCTCACCAGATGCCATCATCGAATTCTAAGAA
GCATTTTGCACAAAAACAGGCCTCGAGTTCTGAACATCGGCAGTGGCGGCCTTTAAATGGTGGGAAAAGTGCATCTCCTGGTTTCTTGGGATCTATTGCATCAGATTCAA
CTAGCAATGAACTCTCGCCATCATCTGCTCAATATGTCTCTAAGAATTTGCATTCTGCTGTAGATAGAATTCAGATTGGAGAACGTACAGCAGAAGGTGGAAGTTGCAGT
GGTCCATTGCCTTATGATAGTAGGAACGGATTGGATGCTGTTGACCAGGAACTCATGGTTCAAGTGCCCTTGGAATCCTGTGCTAAAGATGAGAGTTCCTCTATAAAATT
AAAGGAAAATAATGTTTCTGACTGTAAGGACTCAGAGGTTAATAAGCCTTCTGTGAACCTTGATCCTTTTGATATTTGCCCTCCAAAGTCTGGAGCTGTTATGCTGAAAC
CTTCTTTATTAGCTAAGAACAGGGAAAAGAGGAATGAGATGAAGCGTACCACGGAGGGAAATAGTGGAACTGTATTGAGACCTGGCATGGTTCATCTGAAGTGTGGCATT
TCCCTCAGGGATCAGGAGGGAAAGCTCACTTTCCCAGGACAGAGAAATTGCTTAATCATCAAGCAGAAAGTGGATCTTATTAGGATTGGTGAAAATAGTAAAGAATGTCG
GGAACTTGGTTTAGGCGCTGGAGGTTTTTACCAACCTGGTTACCGTGATGGAGGAAAACTACACTTGAAGATGATGTGCCTTGGTAAAAATTGGGATCCTGACACTGGTA
CATACGGGGATGTCCGTCCATTTGATGGTACAAAACCACCAGGCATTCCAGTTGAATTTTACCAATTGGTTGAAAAGGCAATGAAAGAGTCTTATGCCATAATGGGAAAA
GATTCAACAACCAAAAATCCTGAACGTGTACTTCCAGGGATGAAACCGGACATCTGTATCGTAACTTTTACTCACAAAATGGGCGATTGGGTCTTCATCAGAGTGCCTGT
TGTCTCCTTGTCCATTGGTGACTCTGCAGAATTCCTGTTTGGTGATCAGAGGGACACTGATCAGGCGGAAAAAGTTACTCTGGAATCAGGAGATATTTTGATATTTGGTG
GGAAATCGAGACATGTTTTTCATGGGGTGGCTGCAATTCGTCCAGACACTGCTCCGAAGCCACTATTAGAAGAAACAAATCTTCGTCCAGGGCGCTTGAATCTTACTTTC
CGTCAGTACTGA
Protein sequenceShow/hide protein sequence
MQAGGDEPSVAGSSSPNNSGAFRDRNSHQMPSSNSKKHFAQKQASSSEHRQWRPLNGGKSASPGFLGSIASDSTSNELSPSSAQYVSKNLHSAVDRIQIGERTAEGGSCS
GPLPYDSRNGLDAVDQELMVQVPLESCAKDESSSIKLKENNVSDCKDSEVNKPSVNLDPFDICPPKSGAVMLKPSLLAKNREKRNEMKRTTEGNSGTVLRPGMVHLKCGI
SLRDQEGKLTFPGQRNCLIIKQKVDLIRIGENSKECRELGLGAGGFYQPGYRDGGKLHLKMMCLGKNWDPDTGTYGDVRPFDGTKPPGIPVEFYQLVEKAMKESYAIMGK
DSTTKNPERVLPGMKPDICIVTFTHKMGDWVFIRVPVVSLSIGDSAEFLFGDQRDTDQAEKVTLESGDILIFGGKSRHVFHGVAAIRPDTAPKPLLEETNLRPGRLNLTF
RQY