; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr026639 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr026639
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionFe2OG dioxygenase domain-containing protein
Genome locationtig00153033:2122135..2124531
RNA-Seq ExpressionSgr026639
SyntenySgr026639
Gene Ontology termsGO:0006974 - cellular response to DNA damage stimulus (biological process)
GO:0035513 - oxidative RNA demethylation (biological process)
GO:0035552 - oxidative single-stranded DNA demethylation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0008198 - ferrous iron binding (molecular function)
GO:0035515 - oxidative RNA demethylase activity (molecular function)
GO:0035516 - oxidative DNA demethylase activity (molecular function)
InterPro domainsIPR004574 - Alkylated DNA repair protein AlkB
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR027450 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like
IPR037151 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6595567.1 hypothetical protein SDJN03_12120, partial [Cucurbita argyrosperma subsp. sororia]1.4e-15866.59Show/hide
Query:  MLLIRTLPFSSS--SNQLRQLLFAAPRFPGERSFRLLKFQRMDSFGCSANSHALPDSPRYGSSGGGDEEHSHNRYHISDVVMVGKIPLYLKRKVDEHKSL
        MLLIRT+P S    SN LR+LLFA          RLL+FQRMDSFG    S ALPDS  YGSS GG+EE  HNR H S+V+M+G+IP+ L RK +E +SL
Subjt:  MLLIRTLPFSSS--SNQLRQLLFAAPRFPGERSFRLLKFQRMDSFGCSANSHALPDSPRYGSSGGGDEEHSHNRYHISDVVMVGKIPLYLKRKVDEHKSL

Query:  FPSSVHKCDNFELGRDRKGTPANLLNSYHEDEFPPVSRQITKRRSWIDLGLD---KSSLPNQSEKKNEPFYSQKRLSMDISSRNSQVTNNLPPL-SHSIY
           SV KCD+F+L  D+KG PAN+ +SYH+DEFPPV RQ TKRRS IDLG +   K+S  +   ++NEPF  +K  S DI S+NS  T NLPP+ S  I 
Subjt:  FPSSVHKCDNFELGRDRKGTPANLLNSYHEDEFPPVSRQITKRRSWIDLGLD---KSSLPNQSEKKNEPFYSQKRLSMDISSRNSQVTNNLPPL-SHSIY

Query:  VP-------------------LKEEEMQNPEPLGKVLRPGMVLLKHYITLREQVNIVKTCQKLGLGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYE
         P                   +K  E  +    G V+RPGMVLLKHYI L EQVNIVKT QKLGLGPGGFY+PGYKDGAKLRLQMMCLGLDWDPQTRKY 
Subjt:  VP-------------------LKEEEMQNPEPLGKVLRPGMVLLKHYITLREQVNIVKTCQKLGLGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYE

Query:  NNRAVDGDKPPDIPPEFAILVRKALKDARALIKIKNQCNTSNVEGILPFMSPDICIVNFYTTSGRLGLHQDRDESKDSLISGLPVVSFSVGNSAEFLYGD
          R  DG+KPPD+PPEFAILV KAL DA AL  IKN  +T+N+E ILP MSPDICIVNFY+TSGRLGLHQDRDES++SL+ GLPVVSFS+GNSAEFLYGD
Subjt:  NNRAVDGDKPPDIPPEFAILVRKALKDARALIKIKNQCNTSNVEGILPFMSPDICIVNFYTTSGRLGLHQDRDESKDSLISGLPVVSFSVGNSAEFLYGD

Query:  QRDVDKAEKVVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY
        QRDVDKA K++LESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY
Subjt:  QRDVDKAEKVVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY

KAG7027547.1 alkB, partial [Cucurbita argyrosperma subsp. argyrosperma]8.4e-15662.02Show/hide
Query:  MAEIFTPPASRCRLLGQHGGDRNLRIKATEISLSFPMLLIRTLPFSSSSNQLRQLLFAAPRFPGERSFRLLKFQRMDSFGCSANSHALPDSPRYGSSGGG
        MAEI TP  SRC L+ Q G   NLRI+ T   L+       T+  SSSS  +R++  A+    G        +Q +D         ALPDS  YGSS GG
Subjt:  MAEIFTPPASRCRLLGQHGGDRNLRIKATEISLSFPMLLIRTLPFSSSSNQLRQLLFAAPRFPGERSFRLLKFQRMDSFGCSANSHALPDSPRYGSSGGG

Query:  DEEHSHNRYHISDVVMVGKIPLYLKRKVDEHKSLFPSSVHKCDNFELGRDRKGTPANLLNSYHEDEFPPVSRQITKRRSWIDLGLD---KSSLPNQSEKK
        +EE  HNR H S+V+M+G+IP+ L RK +E +SL   SV KCD+F+L  D+KG PAN+ +SYH+DEFPPV RQ TKRRS IDLG +   K+S  +   ++
Subjt:  DEEHSHNRYHISDVVMVGKIPLYLKRKVDEHKSLFPSSVHKCDNFELGRDRKGTPANLLNSYHEDEFPPVSRQITKRRSWIDLGLD---KSSLPNQSEKK

Query:  NEPFYSQKRLSMDISSRNSQVTNNLPPL-SHSIYVP-------------------LKEEEMQNPEPLGKVLRPGMVLLKHYITLREQVNIVKTCQKLGLG
        NEPF  +K  S DI S+NS  T NLPP+ S  I  P                   +K  E  +    G V+RPGMVLLKHYI L EQVNIVKT QKLGLG
Subjt:  NEPFYSQKRLSMDISSRNSQVTNNLPPL-SHSIYVP-------------------LKEEEMQNPEPLGKVLRPGMVLLKHYITLREQVNIVKTCQKLGLG

Query:  PGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYENNRAVDGDKPPDIPPEFAILVRKALKDARALIKIKNQCNTSNVEGILPFMSPDICIVNFYTTSGRL
        PGGFY+PGYKDGAKLRLQMMCLGLDWDPQTRKY   R  DG+KPPD+PPEFAILV KAL DA AL  IKN  +T+N+E ILP MSPDICIVNFY+TSGRL
Subjt:  PGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYENNRAVDGDKPPDIPPEFAILVRKALKDARALIKIKNQCNTSNVEGILPFMSPDICIVNFYTTSGRL

Query:  GLHQDRDESKDSLISGLPVVSFSVGNSAEFLYGDQRDVDKAEKVVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY
        GLHQDRDES++SL+ GLPVVSFS+GNSAEFLYGDQRDVDKA K++LESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY
Subjt:  GLHQDRDESKDSLISGLPVVSFSVGNSAEFLYGDQRDVDKAEKVVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY

XP_022924913.1 uncharacterized protein LOC111432318 [Cucurbita moschata]4.0e-15866.38Show/hide
Query:  MLLIRTLPFSSS--SNQLRQLLFAAPRFPGERSFRLLKFQRMDSFGCSANSHALPDSPRYGSSGGGDEEHSHNRYHISDVVMVGKIPLYLKRKVDEHKSL
        MLLIRT+P S    SN LR+LLFA          RLL+FQR+DSFG    S ALPDS  YGSS GG+EE  HNR H S+V+M+G+IP+ L RK +E +SL
Subjt:  MLLIRTLPFSSS--SNQLRQLLFAAPRFPGERSFRLLKFQRMDSFGCSANSHALPDSPRYGSSGGGDEEHSHNRYHISDVVMVGKIPLYLKRKVDEHKSL

Query:  FPSSVHKCDNFELGRDRKGTPANLLNSYHEDEFPPVSRQITKRRSWIDLGLD---KSSLPNQSEKKNEPFYSQKRLSMDISSRNSQVTNNLPPL-SHSIY
           SV KCD+F+L  D+KG PAN+ +SYH+DEFPPV RQ TKRRS IDLG +   K+S  +   ++NEPF  +K  S DI S+NS  T NLPP+ S  I 
Subjt:  FPSSVHKCDNFELGRDRKGTPANLLNSYHEDEFPPVSRQITKRRSWIDLGLD---KSSLPNQSEKKNEPFYSQKRLSMDISSRNSQVTNNLPPL-SHSIY

Query:  VP-------------------LKEEEMQNPEPLGKVLRPGMVLLKHYITLREQVNIVKTCQKLGLGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYE
         P                   +K  E  +    G V+RPGMVLLKHYI L EQVNIVKT QKLGLGPGGFY+PGYKDGAKLRLQMMCLGLDWDPQTRKY 
Subjt:  VP-------------------LKEEEMQNPEPLGKVLRPGMVLLKHYITLREQVNIVKTCQKLGLGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYE

Query:  NNRAVDGDKPPDIPPEFAILVRKALKDARALIKIKNQCNTSNVEGILPFMSPDICIVNFYTTSGRLGLHQDRDESKDSLISGLPVVSFSVGNSAEFLYGD
          R  DG+KPPD+PPEFAILV KAL DA AL  IKN  +T+N+E ILP MSPDICIVNFY+TSGRLGLHQDRDES++SL+ GLPVVSFS+GNSAEFLYGD
Subjt:  NNRAVDGDKPPDIPPEFAILVRKALKDARALIKIKNQCNTSNVEGILPFMSPDICIVNFYTTSGRLGLHQDRDESKDSLISGLPVVSFSVGNSAEFLYGD

Query:  QRDVDKAEKVVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY
        QRDVDKA K++LESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY
Subjt:  QRDVDKAEKVVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY

XP_022966314.1 uncharacterized protein LOC111466008 [Cucurbita maxima]7.4e-15264.43Show/hide
Query:  MLLIRTLPFSSS--SNQLRQLLFAAPRFPGERSFRLLKFQRMDSFGCSANSHALPDSPRYGSSGGGDEEHSHNRYHISDVVMVGKIPLYLKRKVDEHKSL
        M LIRT+P S    SN LRQLLFA          RLL+FQRMDSFG    S ALP+S  YGSS GG+EE  HNR H S+V+M+G+IP+ L RK +E +SL
Subjt:  MLLIRTLPFSSS--SNQLRQLLFAAPRFPGERSFRLLKFQRMDSFGCSANSHALPDSPRYGSSGGGDEEHSHNRYHISDVVMVGKIPLYLKRKVDEHKSL

Query:  FPSSVHKCDNFELGRDRKGTPANLLNSYHEDEFPPVSRQITKRRSWIDLGLD---KSSLPNQSEKKNEPFYSQKRLSMDISSRNSQVTNNLPPL-SHSIY
           SV KCD+F+L  D+KG PAN+ + YH+DEFPPV RQ TKRRS ID G +   K+S  +   K+NEPF   K  S DI S+NS  T NLPP+ S  I 
Subjt:  FPSSVHKCDNFELGRDRKGTPANLLNSYHEDEFPPVSRQITKRRSWIDLGLD---KSSLPNQSEKKNEPFYSQKRLSMDISSRNSQVTNNLPPL-SHSIY

Query:  VP-------------------LKEEEMQNPEPLGKVLRPGMVLLKHYITLREQVNIVKTCQKLGLGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYE
         P                   +K  E  +    G V+RPGMVLLKHYI L EQVNIVKT QKLGLGPGGFY+PGYKDGAKLRLQMMCLGLDWDPQTRKY 
Subjt:  VP-------------------LKEEEMQNPEPLGKVLRPGMVLLKHYITLREQVNIVKTCQKLGLGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYE

Query:  NNRAVDGDKPPDIPPEFAILVRKALKDARALIKIKNQCNTSNVEGILPFMSPDICIVNFYTTSGRLGLHQDRDESKDSLISGLPVVSFSVGNSAEFLYGD
          R  DG+KPPD+PPEFAILV KAL DA AL  IKN  + + +E ILP MSPDICIVNFY+T GRLGLHQDRDES++SL+SGLPVVSFS+GNSA FLYGD
Subjt:  NNRAVDGDKPPDIPPEFAILVRKALKDARALIKIKNQCNTSNVEGILPFMSPDICIVNFYTTSGRLGLHQDRDESKDSLISGLPVVSFSVGNSAEFLYGD

Query:  QRDVDKAEKVVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY
        +R+VDKA K++LESGDVLIFGGESRHIFHGVSSIIPKS PKFLLDHTG RPG LNLTFRKY
Subjt:  QRDVDKAEKVVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY

XP_023517205.1 uncharacterized protein LOC111781040 [Cucurbita pepo subsp. pepo]1.5e-15766.38Show/hide
Query:  MLLIRTLPFS--SSSNQLRQLLFAAPRFPGERSFRLLKFQRMDSFGCSANSHALPDSPRYGSSGGGDEEHSHNRYHISDVVMVGKIPLYLKRKVDEHKSL
        M LIRT+P S    SN LR+LLFA          RLL+FQRMDSFG    S ALPDS  YGSS GG+EE  HNR H S+V+M+G+IP+ L RK +E +SL
Subjt:  MLLIRTLPFS--SSSNQLRQLLFAAPRFPGERSFRLLKFQRMDSFGCSANSHALPDSPRYGSSGGGDEEHSHNRYHISDVVMVGKIPLYLKRKVDEHKSL

Query:  FPSSVHKCDNFELGRDRKGTPANLLNSYHEDEFPPVSRQITKRRSWIDLGLD---KSSLPNQSEKKNEPFYSQKRLSMDISSRNSQVTNNLPPL-SHSIY
           SV KCD+F+L  D+K  PAN+ +SYH+DEFPPV RQ TKRRS IDLG +   K+S  +   ++NEPF   K  S DI S+NS  T NLPP+ S  I 
Subjt:  FPSSVHKCDNFELGRDRKGTPANLLNSYHEDEFPPVSRQITKRRSWIDLGLD---KSSLPNQSEKKNEPFYSQKRLSMDISSRNSQVTNNLPPL-SHSIY

Query:  VP-------------------LKEEEMQNPEPLGKVLRPGMVLLKHYITLREQVNIVKTCQKLGLGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYE
         P                   +K  E  +    G V+RPGMVLLKHYI L EQVNIVKT QKLGLGPGGFY+PGYKDGAKLRLQMMCLGLDWDPQTRKY 
Subjt:  VP-------------------LKEEEMQNPEPLGKVLRPGMVLLKHYITLREQVNIVKTCQKLGLGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYE

Query:  NNRAVDGDKPPDIPPEFAILVRKALKDARALIKIKNQCNTSNVEGILPFMSPDICIVNFYTTSGRLGLHQDRDESKDSLISGLPVVSFSVGNSAEFLYGD
          R  DG+KPPD+PPEFAILV KAL DA AL  IKN  +T+N+E ILP MSPDICIVNFY+TSGRLGLHQDRDES++SL+SGLPVVSFS+GNSAEFLYGD
Subjt:  NNRAVDGDKPPDIPPEFAILVRKALKDARALIKIKNQCNTSNVEGILPFMSPDICIVNFYTTSGRLGLHQDRDESKDSLISGLPVVSFSVGNSAEFLYGD

Query:  QRDVDKAEKVVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY
        QRDVDKA K++LESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY
Subjt:  QRDVDKAEKVVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY

TrEMBL top hitse value%identityAlignment
A0A0A0KY56 Fe2OG dioxygenase domain-containing protein3.8e-13860.92Show/hide
Query:  MLLIRTLPF--SSSSNQLRQLLFAAPRFPGERSFRLLKFQRMDSFGCSANSHALPDSPRYGSS--GGGDEEHSHNRYHISDVVMVGKIPLYLKRKVDEHK
        M  IRTLP   S SSNQLR+LLF A  FP  R FRLL+FQ MDSF  SANSHALPDS   GSS   G D+EH H+R + SDV+ VG IP++L  K  E K
Subjt:  MLLIRTLPF--SSSSNQLRQLLFAAPRFPGERSFRLLKFQRMDSFGCSANSHALPDSPRYGSS--GGGDEEHSHNRYHISDVVMVGKIPLYLKRKVDEHK

Query:  SLFPSSVHKCDNFELGRDRKGTPANLLNSYHEDEFPPVSRQITKRRSWIDLGLD-------------------------KSSLPNQSEKKNEPFYSQKRL
                                    SY+ DE  PV RQ T RRS IDLG                           KSSLP    KKNE F S K  
Subjt:  SLFPSSVHKCDNFELGRDRKGTPANLLNSYHEDEFPPVSRQITKRRSWIDLGLD-------------------------KSSLPNQSEKKNEPFYSQKRL

Query:  SMDISSRNSQVTNNLPPLS--HSIYVP----LKEEEMQNPEPLG-----KVLRPGMVLLKHYITLREQVNIVKTCQKLGLGPGGFYRPGYKDGAKLRLQM
        S+D   + S VT+N  P      I +P    +K   +   +  G     ++LRPGMVLLKHYIT REQ+NIVKTCQ LG+GPGGFY+PGYKDGAKLRL+M
Subjt:  SMDISSRNSQVTNNLPPLS--HSIYVP----LKEEEMQNPEPLG-----KVLRPGMVLLKHYITLREQVNIVKTCQKLGLGPGGFYRPGYKDGAKLRLQM

Query:  MCLGLDWDPQTRKYENNRAVDGDKPPDIPPEFAILVRKALKDARALIKIKNQCNTSNVEGILPFMSPDICIVNFYTTSGRLGLHQDRDESKDSLISGLPV
        MCLGLDWDPQTR+YEN R VDG+KPPDIPP+F  LV++ALKDA A   IKN CN SNVE ILP MSPDICI NFYTT GRLGLHQDRDESK+SL  GLPV
Subjt:  MCLGLDWDPQTRKYENNRAVDGDKPPDIPPEFAILVRKALKDARALIKIKNQCNTSNVEGILPFMSPDICIVNFYTTSGRLGLHQDRDESKDSLISGLPV

Query:  VSFSVGNSAEFLYGDQRDVDKAEKVVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY
        VSFSVGN+AEFLYGD+R+VDKAE V LESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL HTGLRPGRLNLTFRKY
Subjt:  VSFSVGNSAEFLYGDQRDVDKAEKVVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY

A0A1S4E4K6 uncharacterized protein LOC1035021838.8e-15163.24Show/hide
Query:  MLLIRTLPF--SSSSNQLRQLLFAAPRFPGERSFRLLKFQRMDSFGCSANSHALPDSPRYGSS--GGGDEEHSHNRYHISDVVMVGKIPLYLKRKVDEHK
        M  IRTLP   S SSNQLR+LLF A  FPG R F LL+FQRMDSF  SANSHA PDS   G+S   G D+EH  +R + SDV+ +G   ++L  K  E K
Subjt:  MLLIRTLPF--SSSSNQLRQLLFAAPRFPGERSFRLLKFQRMDSFGCSANSHALPDSPRYGSS--GGGDEEHSHNRYHISDVVMVGKIPLYLKRKVDEHK

Query:  SLFPSSVHKCDNFELGRDRKGTPANLLNSYHEDEFPPVSRQITKRRSWIDLGLDK-------------------------SSLPNQSEKKNEPFYSQKRL
        SL P S  KCD  E+G D+ G  +N   SYH DEF PVSRQ T RR+ IDLG  +                         SSLP    KKNE F+S KR 
Subjt:  SLFPSSVHKCDNFELGRDRKGTPANLLNSYHEDEFPPVSRQITKRRSWIDLGLDK-------------------------SSLPNQSEKKNEPFYSQKRL

Query:  SMDISSRNSQVTNNLPPLS--HSIYVP----LKEEEMQNPEPLG-----KVLRPGMVLLKHYITLREQVNIVKTCQKLGLGPGGFYRPGYKDGAKLRLQM
        S+DI S+ S VT++  P      I  P    +K       +  G     ++LRPGMVLLKHYIT  EQ+NIVKTCQKLGLGPGGFY+P YKDGAKLRL+M
Subjt:  SMDISSRNSQVTNNLPPLS--HSIYVP----LKEEEMQNPEPLG-----KVLRPGMVLLKHYITLREQVNIVKTCQKLGLGPGGFYRPGYKDGAKLRLQM

Query:  MCLGLDWDPQTRKYENNRAVDGDKPPDIPPEFAILVRKALKDARALIKIKNQCNTSNVEGILPFMSPDICIVNFYTTSGRLGLHQDRDESKDSLISGLPV
        MCLGLDWDPQTR+Y+N R VDG+KPPDIPP F+ LV+ ALKDA A   IKN+CN SNVE ILP MSPDICI NFYTTSGRLGLHQDRDESK+SL SGLPV
Subjt:  MCLGLDWDPQTRKYENNRAVDGDKPPDIPPEFAILVRKALKDARALIKIKNQCNTSNVEGILPFMSPDICIVNFYTTSGRLGLHQDRDESKDSLISGLPV

Query:  VSFSVGNSAEFLYGDQRDVDKAEKVVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY
        VSFSVGN+AEFLYGD+RDV+KAEKV LESGDVLIFGGESRH+FHGVSSIIPKSTPKFLL HTGLRPGRLNLTFRKY
Subjt:  VSFSVGNSAEFLYGDQRDVDKAEKVVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY

A0A6J1CQI1 uncharacterized protein LOC1110138273.6e-15253.73Show/hide
Query:  MLLIRTLPFSSSSNQLRQLLFAAPRFPGERSFRLLKFQRMDSFGCSANSHALPDSPRYGSSGGGDEEHSHNRYHISDVVMVGKIPLYLKRKVDEHKSLFP
        M +IRT+P S  SNQL +LLFA+ RFPG RS RLL+F+RMDS   SA SH            G   E+SHNR H SD+VMVG+IP+YL RK  E +S  P
Subjt:  MLLIRTLPFSSSSNQLRQLLFAAPRFPGERSFRLLKFQRMDSFGCSANSHALPDSPRYGSSGGGDEEHSHNRYHISDVVMVGKIPLYLKRKVDEHKSLFP

Query:  SSVHKCDNFELGRDRKGTPANLLNSYHE------------------------------------------------------------------------
         SV+K D+FELGR+RK TPAN+ NSYH+                                                                        
Subjt:  SSVHKCDNFELGRDRKGTPANLLNSYHE------------------------------------------------------------------------

Query:  -------------------------------------DEFPPVSRQITKRRSWIDLG-----------------------LDKSSLPNQSEKKNEPFYSQ
                                             DEF PVSRQ TKRR+ +DLG                       LD+SS PNQ  KKNEPFY Q
Subjt:  -------------------------------------DEFPPVSRQITKRRSWIDLG-----------------------LDKSSLPNQSEKKNEPFYSQ

Query:  KRLSMDISSRNSQVTNNLPPLSHSIYVPLKEEEMQNPEPLG----------------------KVLRPGMVLLKHYITLREQVNIVKTCQKLGLGPGGFY
        K  SMDI S+NS V +NL P       P   E   N +P                        +VLRPGMVLLK+YITL EQVNIVKTCQ+LG+GPGGFY
Subjt:  KRLSMDISSRNSQVTNNLPPLSHSIYVPLKEEEMQNPEPLG----------------------KVLRPGMVLLKHYITLREQVNIVKTCQKLGLGPGGFY

Query:  RPGYKDGAKLRLQMMCLGLDWDPQTRKYENNRAVDGDKPPDIPPEFAILVRKALKDARALIKIKNQCNTSNVEGILPFMSPDICIVNFYTTSGRLGLHQD
        RPGYKDGAKLRLQMMCLGLDWDPQTRKY + RAVDGDKPP+IPP+FAILV +ALKDA AL  IKN+CNT NVE ILP MSPDICIVNFYTTSGRLGLHQD
Subjt:  RPGYKDGAKLRLQMMCLGLDWDPQTRKYENNRAVDGDKPPDIPPEFAILVRKALKDARALIKIKNQCNTSNVEGILPFMSPDICIVNFYTTSGRLGLHQD

Query:  RDESKDSLISGLPVVSFSVGNSAEFLYGDQRDVDKAEKVVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY
        RDESK+SL+SGLPVVS S+G+SAEFLYGD+RDVDKAEKV+LESGDVLIFGG+SRH+FHGVSSIIP STPKFLLDHTGLRPGRLNLTFRKY
Subjt:  RDESKDSLISGLPVVSFSVGNSAEFLYGDQRDVDKAEKVVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY

A0A6J1EDT3 uncharacterized protein LOC1114323182.0e-15866.38Show/hide
Query:  MLLIRTLPFSSS--SNQLRQLLFAAPRFPGERSFRLLKFQRMDSFGCSANSHALPDSPRYGSSGGGDEEHSHNRYHISDVVMVGKIPLYLKRKVDEHKSL
        MLLIRT+P S    SN LR+LLFA          RLL+FQR+DSFG    S ALPDS  YGSS GG+EE  HNR H S+V+M+G+IP+ L RK +E +SL
Subjt:  MLLIRTLPFSSS--SNQLRQLLFAAPRFPGERSFRLLKFQRMDSFGCSANSHALPDSPRYGSSGGGDEEHSHNRYHISDVVMVGKIPLYLKRKVDEHKSL

Query:  FPSSVHKCDNFELGRDRKGTPANLLNSYHEDEFPPVSRQITKRRSWIDLGLD---KSSLPNQSEKKNEPFYSQKRLSMDISSRNSQVTNNLPPL-SHSIY
           SV KCD+F+L  D+KG PAN+ +SYH+DEFPPV RQ TKRRS IDLG +   K+S  +   ++NEPF  +K  S DI S+NS  T NLPP+ S  I 
Subjt:  FPSSVHKCDNFELGRDRKGTPANLLNSYHEDEFPPVSRQITKRRSWIDLGLD---KSSLPNQSEKKNEPFYSQKRLSMDISSRNSQVTNNLPPL-SHSIY

Query:  VP-------------------LKEEEMQNPEPLGKVLRPGMVLLKHYITLREQVNIVKTCQKLGLGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYE
         P                   +K  E  +    G V+RPGMVLLKHYI L EQVNIVKT QKLGLGPGGFY+PGYKDGAKLRLQMMCLGLDWDPQTRKY 
Subjt:  VP-------------------LKEEEMQNPEPLGKVLRPGMVLLKHYITLREQVNIVKTCQKLGLGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYE

Query:  NNRAVDGDKPPDIPPEFAILVRKALKDARALIKIKNQCNTSNVEGILPFMSPDICIVNFYTTSGRLGLHQDRDESKDSLISGLPVVSFSVGNSAEFLYGD
          R  DG+KPPD+PPEFAILV KAL DA AL  IKN  +T+N+E ILP MSPDICIVNFY+TSGRLGLHQDRDES++SL+ GLPVVSFS+GNSAEFLYGD
Subjt:  NNRAVDGDKPPDIPPEFAILVRKALKDARALIKIKNQCNTSNVEGILPFMSPDICIVNFYTTSGRLGLHQDRDESKDSLISGLPVVSFSVGNSAEFLYGD

Query:  QRDVDKAEKVVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY
        QRDVDKA K++LESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY
Subjt:  QRDVDKAEKVVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY

A0A6J1HTF0 uncharacterized protein LOC1114660083.6e-15264.43Show/hide
Query:  MLLIRTLPFSSS--SNQLRQLLFAAPRFPGERSFRLLKFQRMDSFGCSANSHALPDSPRYGSSGGGDEEHSHNRYHISDVVMVGKIPLYLKRKVDEHKSL
        M LIRT+P S    SN LRQLLFA          RLL+FQRMDSFG    S ALP+S  YGSS GG+EE  HNR H S+V+M+G+IP+ L RK +E +SL
Subjt:  MLLIRTLPFSSS--SNQLRQLLFAAPRFPGERSFRLLKFQRMDSFGCSANSHALPDSPRYGSSGGGDEEHSHNRYHISDVVMVGKIPLYLKRKVDEHKSL

Query:  FPSSVHKCDNFELGRDRKGTPANLLNSYHEDEFPPVSRQITKRRSWIDLGLD---KSSLPNQSEKKNEPFYSQKRLSMDISSRNSQVTNNLPPL-SHSIY
           SV KCD+F+L  D+KG PAN+ + YH+DEFPPV RQ TKRRS ID G +   K+S  +   K+NEPF   K  S DI S+NS  T NLPP+ S  I 
Subjt:  FPSSVHKCDNFELGRDRKGTPANLLNSYHEDEFPPVSRQITKRRSWIDLGLD---KSSLPNQSEKKNEPFYSQKRLSMDISSRNSQVTNNLPPL-SHSIY

Query:  VP-------------------LKEEEMQNPEPLGKVLRPGMVLLKHYITLREQVNIVKTCQKLGLGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYE
         P                   +K  E  +    G V+RPGMVLLKHYI L EQVNIVKT QKLGLGPGGFY+PGYKDGAKLRLQMMCLGLDWDPQTRKY 
Subjt:  VP-------------------LKEEEMQNPEPLGKVLRPGMVLLKHYITLREQVNIVKTCQKLGLGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYE

Query:  NNRAVDGDKPPDIPPEFAILVRKALKDARALIKIKNQCNTSNVEGILPFMSPDICIVNFYTTSGRLGLHQDRDESKDSLISGLPVVSFSVGNSAEFLYGD
          R  DG+KPPD+PPEFAILV KAL DA AL  IKN  + + +E ILP MSPDICIVNFY+T GRLGLHQDRDES++SL+SGLPVVSFS+GNSA FLYGD
Subjt:  NNRAVDGDKPPDIPPEFAILVRKALKDARALIKIKNQCNTSNVEGILPFMSPDICIVNFYTTSGRLGLHQDRDESKDSLISGLPVVSFSVGNSAEFLYGD

Query:  QRDVDKAEKVVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY
        +R+VDKA K++LESGDVLIFGGESRHIFHGVSSIIPKS PKFLLDHTG RPG LNLTFRKY
Subjt:  QRDVDKAEKVVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY

SwissProt top hitse value%identityAlignment
B8GWW6 Alpha-ketoglutarate-dependent dioxygenase AlkB homolog1.4e-1533.67Show/hide
Query:  PGGFYRPGYKDGAKLRLQMMCLG-LDWDPQTR--KYENNRAVDGDKPPDIPPEFAILVRKALKDARALIKIKNQCNTSNVEGILPFMSPDICIVNFYTTS
        P   YR  Y  G  + + M  LG L W    R  +Y +     G   PD+PP        AL D   ++               P   PD C+VN Y   
Subjt:  PGGFYRPGYKDGAKLRLQMMCLG-LDWDPQTR--KYENNRAVDGDKPPDIPPEFAILVRKALKDARALIKIKNQCNTSNVEGILPFMSPDICIVNFYTTS

Query:  GRLGLHQDRDESKDSLISGLPVVSFSVGNSAEFLYGDQRDVDKAEKVVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRP--GRLNLTFRK
         R+GLHQDRDE+        PV+S S+G++A F  G     D    + L SGDV    G +R  FHGV  I+P S        + L P  GR+NLT R+
Subjt:  GRLGLHQDRDESKDSLISGLPVVSFSVGNSAEFLYGDQRDVDKAEKVVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRP--GRLNLTFRK

O60066 Alpha-ketoglutarate-dependent dioxygenase abh12.7e-1627.69Show/hide
Query:  PGMVLLKHYITLREQVNIVKTCQ-----------------KLGLGPGGFYRPGYK-DGAKL------------------RLQMMCLGLDWDPQTRKYENN
        PG+++LK+Y++   Q+ ++K+                   +L LG    +R  Y  DG  +                  +L+ + LG  +D  T++Y   
Subjt:  PGMVLLKHYITLREQVNIVKTCQ-----------------KLGLGPGGFYRPGYK-DGAKL------------------RLQMMCLGLDWDPQTRKYENN

Query:  RAVDGDKPPDIPPEFAILVRKALKDARALIKIKNQCNTSNVEGILPFMSPDICIVNFYTTSGRLGLHQDRDESKDSLISGLPVVSFSVGNSAEFLYGDQR
           D  K P  P +    V K +K++   +  K                 +  IVNFY+    L  H   DES++ L   LP++S S+G    +L G + 
Subjt:  RAVDGDKPPDIPPEFAILVRKALKDARALIKIKNQCNTSNVEGILPFMSPDICIVNFYTTSGRLGLHQDRDESKDSLISGLPVVSFSVGNSAEFLYGDQR

Query:  DVDKAEKVVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL
          +K   + L SGDV+I  G SR  FH V  IIP STP +LL
Subjt:  DVDKAEKVVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL

P05050 Alpha-ketoglutarate-dependent dioxygenase AlkB5.7e-1434.4Show/hide
Query:  NQCNTSNVEGILPFMSPDICIVNFYTTSGRLGLHQDRDESKDSLISGLPVVSFSVGNSAEFLYGDQRDVDKAEKVVLESGDVLIFGGESRHIFHGVSSII
        N C  +      P   PD C++N Y    +L LHQD+DE         P+VS S+G  A F +G  +  D  ++++LE GDV+++GGESR  +HG+  + 
Subjt:  NQCNTSNVEGILPFMSPDICIVNFYTTSGRLGLHQDRDESKDSLISGLPVVSFSVGNSAEFLYGDQRDVDKAEKVVLESGDVLIFGGESRHIFHGVSSII

Query:  PKSTPKFLLDHTGLRPGRLNLTFRK
            P  +         R NLTFR+
Subjt:  PKSTPKFLLDHTGLRPGRLNLTFRK

P0CAT7 Alpha-ketoglutarate-dependent dioxygenase AlkB homolog1.4e-1533.67Show/hide
Query:  PGGFYRPGYKDGAKLRLQMMCLG-LDWDPQTR--KYENNRAVDGDKPPDIPPEFAILVRKALKDARALIKIKNQCNTSNVEGILPFMSPDICIVNFYTTS
        P   YR  Y  G  + + M  LG L W    R  +Y +     G   PD+PP        AL D   ++               P   PD C+VN Y   
Subjt:  PGGFYRPGYKDGAKLRLQMMCLG-LDWDPQTR--KYENNRAVDGDKPPDIPPEFAILVRKALKDARALIKIKNQCNTSNVEGILPFMSPDICIVNFYTTS

Query:  GRLGLHQDRDESKDSLISGLPVVSFSVGNSAEFLYGDQRDVDKAEKVVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRP--GRLNLTFRK
         R+GLHQDRDE+        PV+S S+G++A F  G     D    + L SGDV    G +R  FHGV  I+P S        + L P  GR+NLT R+
Subjt:  GRLGLHQDRDESKDSLISGLPVVSFSVGNSAEFLYGDQRDVDKAEKVVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRP--GRLNLTFRK

P37462 Alpha-ketoglutarate-dependent dioxygenase AlkB5.7e-1434.96Show/hide
Query:  CNTSNVEGILPFMSPDICIVNFYTTSGRLGLHQDRDESKDSLISGLPVVSFSVGNSAEFLYGDQRDVDKAEKVVLESGDVLIFGGESRHIFHGVSSIIPK
        C  + +        PD C++N Y    +L LHQD+DE         P+VS S+G  A F +G  R  D  ++++LE GD++++GGESR  +HG+  +   
Subjt:  CNTSNVEGILPFMSPDICIVNFYTTSGRLGLHQDRDESKDSLISGLPVVSFSVGNSAEFLYGDQRDVDKAEKVVLESGDVLIFGGESRHIFHGVSSIIPK

Query:  STPKFLLDHTGLRPGRLNLTFRK
          P      TG    R NLTFR+
Subjt:  STPKFLLDHTGLRPGRLNLTFRK

Arabidopsis top hitse value%identityAlignment
AT1G11780.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein2.2e-0838.55Show/hide
Query:  PDICIVNFYTTSGRLGLHQDRDESKDSLISGLPVVSFSVGNSAEFLYGDQRDVDKAEKVVLESGDVLIFGGESRHIFHGVSSI
        P+  IVN++     LG H D  E+  S     P+VS S+G  A FL G +   D    + L SGDV++  GE+R  FHG+  I
Subjt:  PDICIVNFYTTSGRLGLHQDRDESKDSLISGLPVVSFSVGNSAEFLYGDQRDVDKAEKVVLESGDVLIFGGESRHIFHGVSSI

AT3G14140.1 2-oxoglutarate-dependent dioxygenase family protein1.1e-6553.6Show/hide
Query:  GKVLRPGMVLLKHYITLREQVNIVKTCQKLGLGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYENNRAVDGDKPPDIPPEFAILVRKALKDARALIK
        G V+RPGMVLLK+Y+++  QV IV  C++LGLG GGFY+PG++DG  L L+MMCLG +WD QTR+Y   R +DG  PP IP EF+ LV KA+K++++L+ 
Subjt:  GKVLRPGMVLLKHYITLREQVNIVKTCQKLGLGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYENNRAVDGDKPPDIPPEFAILVRKALKDARALIK

Query:  IKNQCNTSNVEGILPFMSPDICIVNFYTTSGRLGLHQ---------------------DRDESKDSLISGLPVVSFSVGNSAEFLYGDQRDVDKAEKVVL
          +  N +     +P + PDIC+VNFYT++G+LGLHQ                     D+ ESK SL  GLP+VSFS+G+SAEFLYGDQ+DVDKA+ ++L
Subjt:  IKNQCNTSNVEGILPFMSPDICIVNFYTTSGRLGLHQ---------------------DRDESKDSLISGLPVVSFSVGNSAEFLYGDQRDVDKAEKVVL

Query:  ESGDVLIFGGESRHIFHGVSSI
        ESGDVLIFG  SR++FHGV SI
Subjt:  ESGDVLIFGGESRHIFHGVSSI

AT3G14160.1 2-oxoglutarate-dependent dioxygenase family protein1.6e-8060.96Show/hide
Query:  GKVLRPGMVLLKHYITLREQVNIVKTCQKLGLGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYENNRAVDGDKPPDIPPEFAILVRKALKDARALIK
        G V+RPGMVLLK+Y+++ +QV IV  C++LGLG GGFY+PGY+D AKL L+MMCLG +WDP+T +Y   R  DG   P IP EF   V KA+K++++L  
Subjt:  GKVLRPGMVLLKHYITLREQVNIVKTCQKLGLGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYENNRAVDGDKPPDIPPEFAILVRKALKDARALIK

Query:  IKNQCNTSNVEGILPFMSPDICIVNFYTTSGRLGLHQDRDESKDSLISGLPVVSFSVGNSAEFLYGDQRDVDKAEKVVLESGDVLIFGGESRHIFHGVSS
          ++      E  +PFM PDICIVNFY+++GRLGLHQD+DES++S+  GLPVVSFS+G+SAEFLYGDQRD DKAE + LESGDVL+FGG SR +FHGV S
Subjt:  IKNQCNTSNVEGILPFMSPDICIVNFYTTSGRLGLHQDRDESKDSLISGLPVVSFSVGNSAEFLYGDQRDVDKAEKVVLESGDVLIFGGESRHIFHGVSS

Query:  IIPKSTPKFLLDHTGLRPGRLNLTFRKY
        I   + PK LL  T LRPGRLNLTFR+Y
Subjt:  IIPKSTPKFLLDHTGLRPGRLNLTFRKY

AT5G01780.1 2-oxoglutarate-dependent dioxygenase family protein9.5e-8163.88Show/hide
Query:  KVLRPGMVLLKHYITLREQVNIVKTCQKLGLGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYENNRAVDGDKPPDIPPEFAILVRKALKDARALIKI
        KV+RPGMVLLK ++T   QV+IVKTC++LG+ P GFY+PGY  G+KL LQMMCLG +WDPQT KY  N  +D  K P+IP  F +LV KA+++A AL  I
Subjt:  KVLRPGMVLLKHYITLREQVNIVKTCQKLGLGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYENNRAVDGDKPPDIPPEFAILVRKALKDARALIKI

Query:  KNQCNTSNVEGILPFMSPDICIVNFYTTSGRLGLHQDRDESKDSLISGLPVVSFSVGNSAEFLYGDQRDVDKAEKVVLESGDVLIFGGESRHIFHGVSSI
          +  T + E ILP MSPDICIVNFY+ +GRLGLHQDRDES++S+  GLP+VSFS+G+SAEFLYG++RDV++A+ V+LESGDVLIFGGESR IFHGV SI
Subjt:  KNQCNTSNVEGILPFMSPDICIVNFYTTSGRLGLHQDRDESKDSLISGLPVVSFSVGNSAEFLYGDQRDVDKAEKVVLESGDVLIFGGESRHIFHGVSSI

Query:  IPKSTPKFLLDHTGLRPGRLNLTFRKY
        IP S P  LL+ + LR GRLNLTFR +
Subjt:  IPKSTPKFLLDHTGLRPGRLNLTFRKY

AT5G01780.2 2-oxoglutarate-dependent dioxygenase family protein9.5e-8163.88Show/hide
Query:  KVLRPGMVLLKHYITLREQVNIVKTCQKLGLGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYENNRAVDGDKPPDIPPEFAILVRKALKDARALIKI
        KV+RPGMVLLK ++T   QV+IVKTC++LG+ P GFY+PGY  G+KL LQMMCLG +WDPQT KY  N  +D  K P+IP  F +LV KA+++A AL  I
Subjt:  KVLRPGMVLLKHYITLREQVNIVKTCQKLGLGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYENNRAVDGDKPPDIPPEFAILVRKALKDARALIKI

Query:  KNQCNTSNVEGILPFMSPDICIVNFYTTSGRLGLHQDRDESKDSLISGLPVVSFSVGNSAEFLYGDQRDVDKAEKVVLESGDVLIFGGESRHIFHGVSSI
          +  T + E ILP MSPDICIVNFY+ +GRLGLHQDRDES++S+  GLP+VSFS+G+SAEFLYG++RDV++A+ V+LESGDVLIFGGESR IFHGV SI
Subjt:  KNQCNTSNVEGILPFMSPDICIVNFYTTSGRLGLHQDRDESKDSLISGLPVVSFSVGNSAEFLYGDQRDVDKAEKVVLESGDVLIFGGESRHIFHGVSSI

Query:  IPKSTPKFLLDHTGLRPGRLNLTFRKY
        IP S P  LL+ + LR GRLNLTFR +
Subjt:  IPKSTPKFLLDHTGLRPGRLNLTFRKY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTACCGCTTTGAAAATGGCAGAAATATTTACTCCTCCCGCTTCCCGGTGCAGGCTCCTCGGCCAACACGGCGGCGACCGGAACTTGCGGATTAAGGCGACAGAGAT
TTCATTATCATTCCCAATGTTGTTGATCCGTACGCTTCCCTTTTCGTCGTCGTCGAATCAACTTCGTCAGCTTTTATTCGCCGCGCCACGGTTTCCCGGCGAACGCAGTT
TTCGATTGCTGAAATTTCAGCGGATGGATTCCTTTGGCTGTTCTGCAAATAGCCATGCATTGCCTGATTCTCCACGTTATGGTAGTTCTGGTGGCGGCGACGAGGAACAT
TCGCATAATAGATATCATATTTCAGATGTGGTAATGGTGGGAAAGATTCCTCTGTATCTAAAGCGCAAGGTAGATGAACATAAGTCTTTATTTCCGTCGTCTGTACATAA
ATGTGATAATTTTGAATTGGGAAGAGATAGAAAAGGGACTCCTGCAAATTTGCTGAATTCTTACCATGAAGATGAGTTTCCACCAGTTTCTAGACAAATTACTAAAAGAA
GAAGCTGGATAGATTTAGGGTTGGATAAATCATCTTTGCCTAATCAATCTGAGAAGAAAAATGAACCGTTTTATTCTCAGAAACGCCTGTCTATGGATATCAGTTCCAGA
AATTCTCAAGTTACCAACAATTTGCCTCCTTTGAGCCATTCGATATATGTCCCTTTGAAAGAAGAGGAAATGCAAAACCCGGAGCCACTTGGCAAAGTGCTGAGGCCTGG
AATGGTTTTACTCAAGCATTACATTACCCTACGTGAACAGGTCAATATAGTGAAAACTTGTCAAAAGCTTGGTCTTGGCCCAGGGGGATTTTACAGGCCTGGTTATAAAG
ATGGTGCAAAACTTAGGCTTCAGATGATGTGTCTTGGTTTGGACTGGGATCCCCAAACAAGGAAATATGAAAATAACCGGGCAGTTGATGGCGATAAACCACCAGATATA
CCTCCTGAATTTGCAATTCTGGTTAGAAAAGCACTTAAAGATGCACGTGCCCTTATTAAGATCAAGAACCAATGCAATACAAGTAACGTAGAAGGCATACTTCCATTCAT
GTCTCCTGACATATGCATTGTGAACTTCTACACAACGAGTGGAAGACTGGGTCTGCACCAGGATCGCGATGAAAGCAAAGATAGTCTCATTAGCGGATTACCCGTGGTTT
CCTTCTCAGTAGGCAATTCGGCAGAATTCTTGTATGGAGATCAAAGAGATGTAGATAAAGCAGAGAAGGTTGTACTAGAATCAGGTGATGTTCTGATATTTGGTGGAGAA
TCTAGGCATATCTTTCATGGAGTGTCTTCAATCATACCAAAATCTACACCTAAGTTTTTGCTTGATCATACCGGTCTTCGTCCTGGCCGTCTGAATCTTACCTTTAGAAA
GTATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTACCGCTTTGAAAATGGCAGAAATATTTACTCCTCCCGCTTCCCGGTGCAGGCTCCTCGGCCAACACGGCGGCGACCGGAACTTGCGGATTAAGGCGACAGAGAT
TTCATTATCATTCCCAATGTTGTTGATCCGTACGCTTCCCTTTTCGTCGTCGTCGAATCAACTTCGTCAGCTTTTATTCGCCGCGCCACGGTTTCCCGGCGAACGCAGTT
TTCGATTGCTGAAATTTCAGCGGATGGATTCCTTTGGCTGTTCTGCAAATAGCCATGCATTGCCTGATTCTCCACGTTATGGTAGTTCTGGTGGCGGCGACGAGGAACAT
TCGCATAATAGATATCATATTTCAGATGTGGTAATGGTGGGAAAGATTCCTCTGTATCTAAAGCGCAAGGTAGATGAACATAAGTCTTTATTTCCGTCGTCTGTACATAA
ATGTGATAATTTTGAATTGGGAAGAGATAGAAAAGGGACTCCTGCAAATTTGCTGAATTCTTACCATGAAGATGAGTTTCCACCAGTTTCTAGACAAATTACTAAAAGAA
GAAGCTGGATAGATTTAGGGTTGGATAAATCATCTTTGCCTAATCAATCTGAGAAGAAAAATGAACCGTTTTATTCTCAGAAACGCCTGTCTATGGATATCAGTTCCAGA
AATTCTCAAGTTACCAACAATTTGCCTCCTTTGAGCCATTCGATATATGTCCCTTTGAAAGAAGAGGAAATGCAAAACCCGGAGCCACTTGGCAAAGTGCTGAGGCCTGG
AATGGTTTTACTCAAGCATTACATTACCCTACGTGAACAGGTCAATATAGTGAAAACTTGTCAAAAGCTTGGTCTTGGCCCAGGGGGATTTTACAGGCCTGGTTATAAAG
ATGGTGCAAAACTTAGGCTTCAGATGATGTGTCTTGGTTTGGACTGGGATCCCCAAACAAGGAAATATGAAAATAACCGGGCAGTTGATGGCGATAAACCACCAGATATA
CCTCCTGAATTTGCAATTCTGGTTAGAAAAGCACTTAAAGATGCACGTGCCCTTATTAAGATCAAGAACCAATGCAATACAAGTAACGTAGAAGGCATACTTCCATTCAT
GTCTCCTGACATATGCATTGTGAACTTCTACACAACGAGTGGAAGACTGGGTCTGCACCAGGATCGCGATGAAAGCAAAGATAGTCTCATTAGCGGATTACCCGTGGTTT
CCTTCTCAGTAGGCAATTCGGCAGAATTCTTGTATGGAGATCAAAGAGATGTAGATAAAGCAGAGAAGGTTGTACTAGAATCAGGTGATGTTCTGATATTTGGTGGAGAA
TCTAGGCATATCTTTCATGGAGTGTCTTCAATCATACCAAAATCTACACCTAAGTTTTTGCTTGATCATACCGGTCTTCGTCCTGGCCGTCTGAATCTTACCTTTAGAAA
GTATTAA
Protein sequenceShow/hide protein sequence
MVTALKMAEIFTPPASRCRLLGQHGGDRNLRIKATEISLSFPMLLIRTLPFSSSSNQLRQLLFAAPRFPGERSFRLLKFQRMDSFGCSANSHALPDSPRYGSSGGGDEEH
SHNRYHISDVVMVGKIPLYLKRKVDEHKSLFPSSVHKCDNFELGRDRKGTPANLLNSYHEDEFPPVSRQITKRRSWIDLGLDKSSLPNQSEKKNEPFYSQKRLSMDISSR
NSQVTNNLPPLSHSIYVPLKEEEMQNPEPLGKVLRPGMVLLKHYITLREQVNIVKTCQKLGLGPGGFYRPGYKDGAKLRLQMMCLGLDWDPQTRKYENNRAVDGDKPPDI
PPEFAILVRKALKDARALIKIKNQCNTSNVEGILPFMSPDICIVNFYTTSGRLGLHQDRDESKDSLISGLPVVSFSVGNSAEFLYGDQRDVDKAEKVVLESGDVLIFGGE
SRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY