; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0001659 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0001659
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionOxoglutarate/iron-dependent dioxygenase
Genome locationchr04:17821386..17823344
RNA-Seq ExpressionIVF0001659
SyntenyIVF0001659
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0051213 - dioxygenase activity (molecular function)
InterPro domainsIPR004574 - Alkylated DNA repair protein AlkB
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR027450 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like
IPR037151 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058530.1 Oxoglutarate/iron-dependent dioxygenase [Cucumis melo var. makuwa]1.79e-30195.49Show/hide
Query:  MSSRNFRKPVGQKQASSSEQWQWRPLNSRKDASPGAVDLHLQHNSTDDMSNTNKQLLGSIASNSDSNQISNTSEQRLGPIASNSDCIELLSSSAQNVSKS
        MSSRNFRKPVGQKQASSSEQWQWRPLNSRKDASPGAVDLHLQHNSTDDMSNTNKQLLGSIASNSDSNQISNTSEQRLGPIASNSDCIELLSSSAQNVSKS
Subjt:  MSSRNFRKPVGQKQASSSEQWQWRPLNSRKDASPGAVDLHLQHNSTDDMSNTNKQLLGSIASNSDSNQISNTSEQRLGPIASNSDCIELLSSSAQNVSKS

Query:  LHSAVERIQIQGPTAVCGNCSDSFPYDNRNGSDVVGQELKVQPTLESCAKDKSSTIKLGESNN----------KPSVDLDTFDICPPKTGGVTLNPSLLA
        LHSAVERIQIQGPTAVCGNCSDSFPYDNRNGSDVVGQELKVQPTLESCAKDKSSTIKLGESNN          KPSVDLDTFDICPPKTGGVTLNPSLLA
Subjt:  LHSAVERIQIQGPTAVCGNCSDSFPYDNRNGSDVVGQELKVQPTLESCAKDKSSTIKLGESNN----------KPSVDLDTFDICPPKTGGVTLNPSLLA

Query:  MNREKRNEMKRAMDGNNGIVLRPGMVHLKGSISLRDQAKIVKKCRDLGIGAGG----------KLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDE
        MNREKRNEMKRAMDGNNGIVLRPGMVHLKGSISLRDQAKIVKKCRDLGIGAGG          KLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDE
Subjt:  MNREKRNEMKRAMDGNNGIVLRPGMVHLKGSISLRDQAKIVKKCRDLGIGAGG----------KLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDE

Query:  FYQLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGDWSDVDQAEKVTLESGDIL
        FYQLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGDWSDVDQAEKVTLESGDIL
Subjt:  FYQLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGDWSDVDQAEKVTLESGDIL

Query:  IFGGKSRHVFHGVTAIHSNTAPKALLEATNLRPGRLNLTFRQY
        IFGGKSRHVFHGVTAIHSNTAPKALLEATNLRPGRLNLTFRQY
Subjt:  IFGGKSRHVFHGVTAIHSNTAPKALLEATNLRPGRLNLTFRQY

XP_016903133.1 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103501985 [Cucumis melo]7.49e-29694.81Show/hide
Query:  MSSRNFRKPVGQKQASSSEQWQWRPLNSRKDASPGAVDLHLQHNSTDDMSNTNKQLLGSIASNSDSNQISNTSEQRLGPIASNSDCIELLSSSAQNVSKS
        MSSRNFRKPVGQKQASSSEQWQWRPLNSRKDASPGAVDLHLQHNSTDDMSNTNKQLLGSIASNSDSNQISNTSEQRLGPIASNSDCIELLSSSAQNVSKS
Subjt:  MSSRNFRKPVGQKQASSSEQWQWRPLNSRKDASPGAVDLHLQHNSTDDMSNTNKQLLGSIASNSDSNQISNTSEQRLGPIASNSDCIELLSSSAQNVSKS

Query:  LHSAVERIQIQGPTAVCGNCSDSFPYDNRNGSDVVGQELKVQPTLESCAKDKSSTIKLGESNN----------KPSVDLDTFDICPPKTGGVTLNPSLLA
        LHSAVERIQIQGPTAVCGNCSDSFPYDNRNGSDVVGQELKVQPTLESCAKDKSSTIKLGESNN          KPSVDLDTFDICPPKTGGVTLNPSLLA
Subjt:  LHSAVERIQIQGPTAVCGNCSDSFPYDNRNGSDVVGQELKVQPTLESCAKDKSSTIKLGESNN----------KPSVDLDTFDICPPKTGGVTLNPSLLA

Query:  MNREKRNEMKRAMDGNNGIVLRPGMVHLKGSISLRDQAKIVKKCRDLGIGAGG----------KLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDE
        MNREK NEMKRAMDGNNGIVLRPGMVHLKGSISLRDQAKIVKKCRDLGIGAGG          KLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDE
Subjt:  MNREKRNEMKRAMDGNNGIVLRPGMVHLKGSISLRDQAKIVKKCRDLGIGAGG----------KLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDE

Query:  FYQLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGDWSDVDQAEKVTLESGDIL
        FYQLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGD SDVDQAEKVTLESGDIL
Subjt:  FYQLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGDWSDVDQAEKVTLESGDIL

Query:  IFGGKSRHVFHGVTAIHSNTAPKALLEATNLRPGRLNLTFRQY
        IFGGKSRHVFHGVTAIHS TAPKALLEATNLRPGRLNLTFRQY
Subjt:  IFGGKSRHVFHGVTAIHSNTAPKALLEATNLRPGRLNLTFRQY

XP_022956002.1 uncharacterized protein LOC111457830 isoform X2 [Cucurbita moschata]4.98e-22475.74Show/hide
Query:  SRNFRKPVGQKQASSSEQWQWRPLNSRKDASPGAVDLHLQHNSTDDMSNTNKQLLGSIASNSDSNQISNTSEQRLGPIASNSDCIELLSSSAQNVSKSLH
        SRN R   GQKQA  SEQWQWRPLNS KDASPGAVDL   HN+ DD++   KQL+GSI+SNSD N +SNTSEQ LG IAS SD  E    SAQNVSKSLH
Subjt:  SRNFRKPVGQKQASSSEQWQWRPLNSRKDASPGAVDLHLQHNSTDDMSNTNKQLLGSIASNSDSNQISNTSEQRLGPIASNSDCIELLSSSAQNVSKSLH

Query:  SAVERIQIQGPTAVCGNCSDSFPYDNRNGSDVVGQELKVQPTLESCAKDKSSTIKLGESNN----------KPSVDLDTFDICPPKTGGVTLNPSLLAMN
        SAVERIQI+ PTA    C DSFPYD+   SD  GQEL VQ        D+++TIKL ESNN          KP  +L+ FDICPPK+G VTLNPSLL+ N
Subjt:  SAVERIQIQGPTAVCGNCSDSFPYDNRNGSDVVGQELKVQPTLESCAKDKSSTIKLGESNN----------KPSVDLDTFDICPPKTGGVTLNPSLLAMN

Query:  REKRNEMKRAMDGNNGIVLRPGMVHLKGSISLRDQAKIVKKCRDLGIGAGG----------KLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDEFY
        REKRNEMKRAM+GNNG VLRPGMVHLK  ISL DQ KIVKKCRDLGIGAGG          KLHLKMMCLGKNWDPDSS YGDVRPFD+T PPN+P EFY
Subjt:  REKRNEMKRAMDGNNGIVLRPGMVHLKGSISLRDQAKIVKKCRDLGIGAGG----------KLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDEFY

Query:  QLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGDWSDVDQAEKVTLESGDILIF
        +LVEKAIKDSYA++ KDS  KNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDESQESL+KGLPV+SFSIGDSAEFLFGDWSD+DQAEKVTLESGDILIF
Subjt:  QLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGDWSDVDQAEKVTLESGDILIF

Query:  GGKSRHVFHGVTAIHSNTAPKALLEATNLRPGRLNLTFRQY
        GGKSRHVFHGVTAIH NTAPKALLEATNLRPGRLNLTFRQY
Subjt:  GGKSRHVFHGVTAIHSNTAPKALLEATNLRPGRLNLTFRQY

XP_038875730.1 uncharacterized protein LOC120068103 isoform X1 [Benincasa hispida]2.61e-24380.14Show/hide
Query:  MSSRNFRKPVGQKQASSSEQWQWRPLNSRKDASPGAVDLHLQHNSTDDMSNTNKQLLGSIASNSDSNQISNTSEQRLGPIASNSDCIELLSSSAQNVSKS
        MSSRN  KP GQKQASSSEQWQWRPLNS KDAS  A+DL  +HNS DD+SNT+KQLLGSIASN+DSN +S +S+Q LG IASNS+C E   SSAQNVSKS
Subjt:  MSSRNFRKPVGQKQASSSEQWQWRPLNSRKDASPGAVDLHLQHNSTDDMSNTNKQLLGSIASNSDSNQISNTSEQRLGPIASNSDCIELLSSSAQNVSKS

Query:  LHSAVERIQIQGPTAVCGNCSDSFPYDNRNGSDVVGQELKVQPTLESCAKDKSSTIKLGESNN----------KPSVDLDTFDICPPKTGGVTLNPSLLA
        LHSAVERIQI+  TAV  +CSDSFP+DN N SD VGQ+LKVQ  LESC KD+SST KL ESNN          KPSV+LD FDIC PKTG VTLNPSL A
Subjt:  LHSAVERIQIQGPTAVCGNCSDSFPYDNRNGSDVVGQELKVQPTLESCAKDKSSTIKLGESNN----------KPSVDLDTFDICPPKTGGVTLNPSLLA

Query:  MNREKRNEMKRAMDGNNGIVLRPGMVHLKGSISLRDQAKIVKKCRDLGIGAGG----------KLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDE
         NREKRNEMKRAM+GN+GIVLRPGMVHLK  ISLRDQ  IVK+CRDLGIGAGG          KLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPP+LPDE
Subjt:  MNREKRNEMKRAMDGNNGIVLRPGMVHLKGSISLRDQAKIVKKCRDLGIGAGG----------KLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDE

Query:  FYQLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGDWSDVDQAEKVTLESGDIL
        FYQLVEKAIK SYAI+ KDST+KNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDES+ESLDKGLPV+SFSIGDSAEFLFGD SD DQAEKVTLESGDIL
Subjt:  FYQLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGDWSDVDQAEKVTLESGDIL

Query:  IFGGKSRHVFHGVTAIHSNTAPKALLEATNLRPGRLNLTFRQY
        IFGGKSRHVFHGVT IH NTAPK LLEATNLRPGRLNLTFRQY
Subjt:  IFGGKSRHVFHGVTAIHSNTAPKALLEATNLRPGRLNLTFRQY

XP_038875734.1 uncharacterized protein LOC120068103 isoform X2 [Benincasa hispida]3.00e-24480.14Show/hide
Query:  MSSRNFRKPVGQKQASSSEQWQWRPLNSRKDASPGAVDLHLQHNSTDDMSNTNKQLLGSIASNSDSNQISNTSEQRLGPIASNSDCIELLSSSAQNVSKS
        MSSRN  KP GQKQASSSEQWQWRPLNS KDAS  A+DL  +HNS DD+SNT+KQLLGSIASN+DSN +S +S+Q LG IASNS+C E   SSAQNVSKS
Subjt:  MSSRNFRKPVGQKQASSSEQWQWRPLNSRKDASPGAVDLHLQHNSTDDMSNTNKQLLGSIASNSDSNQISNTSEQRLGPIASNSDCIELLSSSAQNVSKS

Query:  LHSAVERIQIQGPTAVCGNCSDSFPYDNRNGSDVVGQELKVQPTLESCAKDKSSTIKLGESNN----------KPSVDLDTFDICPPKTGGVTLNPSLLA
        LHSAVERIQI+  TAV  +CSDSFP+DN N SD VGQ+LKVQ  LESC KD+SST KL ESNN          KPSV+LD FDIC PKTG VTLNPSL A
Subjt:  LHSAVERIQIQGPTAVCGNCSDSFPYDNRNGSDVVGQELKVQPTLESCAKDKSSTIKLGESNN----------KPSVDLDTFDICPPKTGGVTLNPSLLA

Query:  MNREKRNEMKRAMDGNNGIVLRPGMVHLKGSISLRDQAKIVKKCRDLGIGAGG----------KLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDE
         NREKRNEMKRAM+GN+GIVLRPGMVHLK  ISLRDQ  IVK+CRDLGIGAGG          KLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPP+LPDE
Subjt:  MNREKRNEMKRAMDGNNGIVLRPGMVHLKGSISLRDQAKIVKKCRDLGIGAGG----------KLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDE

Query:  FYQLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGDWSDVDQAEKVTLESGDIL
        FYQLVEKAIK SYAI+ KDST+KNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDES+ESLDKGLPV+SFSIGDSAEFLFGD SD DQAEKVTLESGDIL
Subjt:  FYQLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGDWSDVDQAEKVTLESGDIL

Query:  IFGGKSRHVFHGVTAIHSNTAPKALLEATNLRPGRLNLTFRQY
        IFGGKSRHVFHGVT IH NTAPK LLEATNLRPGRLNLTFRQY
Subjt:  IFGGKSRHVFHGVTAIHSNTAPKALLEATNLRPGRLNLTFRQY

TrEMBL top hitse value%identityAlignment
A0A0A0LC72 Fe2OG dioxygenase domain-containing protein1.3e-20684.2Show/hide
Query:  MSSRNFRKPVGQKQASSSEQWQWRPLNSRKDASPGAVDLHLQHNSTDDMSNTNKQLLGSIASNSDSNQISNTSEQRLGPIASNSDCIELLSSSAQNVSKS
        MSSRNFRKPVGQKQASSSEQWQWRPLNS KDASPGAVDL LQHNSTDDMSN NKQLL S                    IASNSDCIEL SSSAQNVSKS
Subjt:  MSSRNFRKPVGQKQASSSEQWQWRPLNSRKDASPGAVDLHLQHNSTDDMSNTNKQLLGSIASNSDSNQISNTSEQRLGPIASNSDCIELLSSSAQNVSKS

Query:  LHSAVERIQIQGPTAVCGNCSDSFPYDNRNGSDVVGQELKVQPTLESCAKDKSSTIKLGESNN----------KPSVDLDTFDICPPKTGGVTLNPSLLA
        LHSAVERI +QGPTAVCG+  DSFPYDN N SDVVGQELKVQP+L+SCAKD+S TI+LG+SN+          KPSVDLD+FDICPPKTGGV LNPSLLA
Subjt:  LHSAVERIQIQGPTAVCGNCSDSFPYDNRNGSDVVGQELKVQPTLESCAKDKSSTIKLGESNN----------KPSVDLDTFDICPPKTGGVTLNPSLLA

Query:  MNREKRNEMKRAMDGNNGIVLRPGMVHLKGSISLRDQAKIVKKCRDLGIGA----------GGKLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDE
        MNREKRNEM+RAM+GNNGIVLRPGMVHLKG IS+RDQAKIVKKCRDLGIGA          GGKLHLKMMCLGKNWDPDSSTYGD+RPFDDTKPPNLPDE
Subjt:  MNREKRNEMKRAMDGNNGIVLRPGMVHLKGSISLRDQAKIVKKCRDLGIGA----------GGKLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDE

Query:  FYQLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGDWSDVDQAEKVTLESGDIL
        FYQLVEKAIKDSYAI+A+DSTIKNPERVLPWMKP+ICIVNFYSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGD SDVDQAEKVTLESGDIL
Subjt:  FYQLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGDWSDVDQAEKVTLESGDIL

Query:  IFGGKSRHVFHGVTAIHSNTAPKALLEATNLRPGRLNLTFRQY
        IFGGKSRHVFHGVTAIHSNTAPKALLEATNLRPGRLNLTFRQY
Subjt:  IFGGKSRHVFHGVTAIHSNTAPKALLEATNLRPGRLNLTFRQY

A0A1S4E4H2 LOW QUALITY PROTEIN: uncharacterized protein LOC1035019851.1e-23494.81Show/hide
Query:  MSSRNFRKPVGQKQASSSEQWQWRPLNSRKDASPGAVDLHLQHNSTDDMSNTNKQLLGSIASNSDSNQISNTSEQRLGPIASNSDCIELLSSSAQNVSKS
        MSSRNFRKPVGQKQASSSEQWQWRPLNSRKDASPGAVDLHLQHNSTDDMSNTNKQLLGSIASNSDSNQISNTSEQRLGPIASNSDCIELLSSSAQNVSKS
Subjt:  MSSRNFRKPVGQKQASSSEQWQWRPLNSRKDASPGAVDLHLQHNSTDDMSNTNKQLLGSIASNSDSNQISNTSEQRLGPIASNSDCIELLSSSAQNVSKS

Query:  LHSAVERIQIQGPTAVCGNCSDSFPYDNRNGSDVVGQELKVQPTLESCAKDKSSTIKLGESNN----------KPSVDLDTFDICPPKTGGVTLNPSLLA
        LHSAVERIQIQGPTAVCGNCSDSFPYDNRNGSDVVGQELKVQPTLESCAKDKSSTIKLGESNN          KPSVDLDTFDICPPKTGGVTLNPSLLA
Subjt:  LHSAVERIQIQGPTAVCGNCSDSFPYDNRNGSDVVGQELKVQPTLESCAKDKSSTIKLGESNN----------KPSVDLDTFDICPPKTGGVTLNPSLLA

Query:  MNREKRNEMKRAMDGNNGIVLRPGMVHLKGSISLRDQAKIVKKCRDLGIGA----------GGKLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDE
        MNREK NEMKRAMDGNNGIVLRPGMVHLKGSISLRDQAKIVKKCRDLGIGA          GGKLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDE
Subjt:  MNREKRNEMKRAMDGNNGIVLRPGMVHLKGSISLRDQAKIVKKCRDLGIGA----------GGKLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDE

Query:  FYQLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGDWSDVDQAEKVTLESGDIL
        FYQLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGD SDVDQAEKVTLESGDIL
Subjt:  FYQLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGDWSDVDQAEKVTLESGDIL

Query:  IFGGKSRHVFHGVTAIHSNTAPKALLEATNLRPGRLNLTFRQY
        IFGGKSRHVFHGVTAIHS TAPKALLEATNLRPGRLNLTFRQY
Subjt:  IFGGKSRHVFHGVTAIHSNTAPKALLEATNLRPGRLNLTFRQY

A0A5D3CA69 Oxoglutarate/iron-dependent dioxygenase7.6e-23995.49Show/hide
Query:  MSSRNFRKPVGQKQASSSEQWQWRPLNSRKDASPGAVDLHLQHNSTDDMSNTNKQLLGSIASNSDSNQISNTSEQRLGPIASNSDCIELLSSSAQNVSKS
        MSSRNFRKPVGQKQASSSEQWQWRPLNSRKDASPGAVDLHLQHNSTDDMSNTNKQLLGSIASNSDSNQISNTSEQRLGPIASNSDCIELLSSSAQNVSKS
Subjt:  MSSRNFRKPVGQKQASSSEQWQWRPLNSRKDASPGAVDLHLQHNSTDDMSNTNKQLLGSIASNSDSNQISNTSEQRLGPIASNSDCIELLSSSAQNVSKS

Query:  LHSAVERIQIQGPTAVCGNCSDSFPYDNRNGSDVVGQELKVQPTLESCAKDKSSTIKLGESNN----------KPSVDLDTFDICPPKTGGVTLNPSLLA
        LHSAVERIQIQGPTAVCGNCSDSFPYDNRNGSDVVGQELKVQPTLESCAKDKSSTIKLGESNN          KPSVDLDTFDICPPKTGGVTLNPSLLA
Subjt:  LHSAVERIQIQGPTAVCGNCSDSFPYDNRNGSDVVGQELKVQPTLESCAKDKSSTIKLGESNN----------KPSVDLDTFDICPPKTGGVTLNPSLLA

Query:  MNREKRNEMKRAMDGNNGIVLRPGMVHLKGSISLRDQAKIVKKCRDLGIGA----------GGKLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDE
        MNREKRNEMKRAMDGNNGIVLRPGMVHLKGSISLRDQAKIVKKCRDLGIGA          GGKLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDE
Subjt:  MNREKRNEMKRAMDGNNGIVLRPGMVHLKGSISLRDQAKIVKKCRDLGIGA----------GGKLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDE

Query:  FYQLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGDWSDVDQAEKVTLESGDIL
        FYQLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGDWSDVDQAEKVTLESGDIL
Subjt:  FYQLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGDWSDVDQAEKVTLESGDIL

Query:  IFGGKSRHVFHGVTAIHSNTAPKALLEATNLRPGRLNLTFRQY
        IFGGKSRHVFHGVTAIHSNTAPKALLEATNLRPGRLNLTFRQY
Subjt:  IFGGKSRHVFHGVTAIHSNTAPKALLEATNLRPGRLNLTFRQY

A0A6J1GV59 uncharacterized protein LOC111457830 isoform X28.0e-18075.74Show/hide
Query:  SRNFRKPVGQKQASSSEQWQWRPLNSRKDASPGAVDLHLQHNSTDDMSNTNKQLLGSIASNSDSNQISNTSEQRLGPIASNSDCIELLSSSAQNVSKSLH
        SRN R   GQKQA  SEQWQWRPLNS KDASPGAVDL   HN+ DD++   KQL+GSI+SNSD N +SNTSEQ LG IAS SD  E    SAQNVSKSLH
Subjt:  SRNFRKPVGQKQASSSEQWQWRPLNSRKDASPGAVDLHLQHNSTDDMSNTNKQLLGSIASNSDSNQISNTSEQRLGPIASNSDCIELLSSSAQNVSKSLH

Query:  SAVERIQIQGPTAVCGNCSDSFPYDNRNGSDVVGQELKVQPTLESCAKDKSSTIKLGESNN----------KPSVDLDTFDICPPKTGGVTLNPSLLAMN
        SAVERIQI+ PTA    C DSFPYD+   SD  GQEL VQ        D+++TIKL ESNN          KP  +L+ FDICPPK+G VTLNPSLL+ N
Subjt:  SAVERIQIQGPTAVCGNCSDSFPYDNRNGSDVVGQELKVQPTLESCAKDKSSTIKLGESNN----------KPSVDLDTFDICPPKTGGVTLNPSLLAMN

Query:  REKRNEMKRAMDGNNGIVLRPGMVHLKGSISLRDQAKIVKKCRDLGIGA----------GGKLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDEFY
        REKRNEMKRAM+GNNG VLRPGMVHLK  ISL DQ KIVKKCRDLGIGA          GGKLHLKMMCLGKNWDPDSS YGDVRPFD+T PPN+P EFY
Subjt:  REKRNEMKRAMDGNNGIVLRPGMVHLKGSISLRDQAKIVKKCRDLGIGA----------GGKLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDEFY

Query:  QLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGDWSDVDQAEKVTLESGDILIF
        +LVEKAIKDSYA++ KDS  KNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDESQESL+KGLPV+SFSIGDSAEFLFGDWSD+DQAEKVTLESGDILIF
Subjt:  QLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGDWSDVDQAEKVTLESGDILIF

Query:  GGKSRHVFHGVTAIHSNTAPKALLEATNLRPGRLNLTFRQY
        GGKSRHVFHGVTAIH NTAPKALLEATNLRPGRLNLTFRQY
Subjt:  GGKSRHVFHGVTAIHSNTAPKALLEATNLRPGRLNLTFRQY

A0A6J1GWM4 uncharacterized protein LOC111457830 isoform X18.0e-18075.74Show/hide
Query:  SRNFRKPVGQKQASSSEQWQWRPLNSRKDASPGAVDLHLQHNSTDDMSNTNKQLLGSIASNSDSNQISNTSEQRLGPIASNSDCIELLSSSAQNVSKSLH
        SRN R   GQKQA  SEQWQWRPLNS KDASPGAVDL   HN+ DD++   KQL+GSI+SNSD N +SNTSEQ LG IAS SD  E    SAQNVSKSLH
Subjt:  SRNFRKPVGQKQASSSEQWQWRPLNSRKDASPGAVDLHLQHNSTDDMSNTNKQLLGSIASNSDSNQISNTSEQRLGPIASNSDCIELLSSSAQNVSKSLH

Query:  SAVERIQIQGPTAVCGNCSDSFPYDNRNGSDVVGQELKVQPTLESCAKDKSSTIKLGESNN----------KPSVDLDTFDICPPKTGGVTLNPSLLAMN
        SAVERIQI+ PTA    C DSFPYD+   SD  GQEL VQ        D+++TIKL ESNN          KP  +L+ FDICPPK+G VTLNPSLL+ N
Subjt:  SAVERIQIQGPTAVCGNCSDSFPYDNRNGSDVVGQELKVQPTLESCAKDKSSTIKLGESNN----------KPSVDLDTFDICPPKTGGVTLNPSLLAMN

Query:  REKRNEMKRAMDGNNGIVLRPGMVHLKGSISLRDQAKIVKKCRDLGIGA----------GGKLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDEFY
        REKRNEMKRAM+GNNG VLRPGMVHLK  ISL DQ KIVKKCRDLGIGA          GGKLHLKMMCLGKNWDPDSS YGDVRPFD+T PPN+P EFY
Subjt:  REKRNEMKRAMDGNNGIVLRPGMVHLKGSISLRDQAKIVKKCRDLGIGA----------GGKLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDEFY

Query:  QLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGDWSDVDQAEKVTLESGDILIF
        +LVEKAIKDSYA++ KDS  KNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDESQESL+KGLPV+SFSIGDSAEFLFGDWSD+DQAEKVTLESGDILIF
Subjt:  QLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGDWSDVDQAEKVTLESGDILIF

Query:  GGKSRHVFHGVTAIHSNTAPKALLEATNLRPGRLNLTFRQY
        GGKSRHVFHGVTAIH NTAPKALLEATNLRPGRLNLTFRQY
Subjt:  GGKSRHVFHGVTAIHSNTAPKALLEATNLRPGRLNLTFRQY

SwissProt top hitse value%identityAlignment
B8GWW6 Alpha-ketoglutarate-dependent dioxygenase AlkB homolog3.3e-1340Show/hide
Query:  PWMKPNICIVNFYSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGDWSDVDQAEKVTLESGDILIFGGKSRHVFHGVTAIHSNTAPKALLEAT
        P   P+ C+VN Y    R+GLHQDRDE+    D   PV+S S+GD+A F  G  +  D    + L SGD+    G +R  FHGV  I        L  ++
Subjt:  PWMKPNICIVNFYSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGDWSDVDQAEKVTLESGDILIFGGKSRHVFHGVTAIHSNTAPKALLEAT

Query:  NLRP--GRLNLTFRQ
        +L P  GR+NLT R+
Subjt:  NLRP--GRLNLTFRQ

P05050 Alpha-ketoglutarate-dependent dioxygenase AlkB8.6e-1429.38Show/hide
Query:  YGDVRPFDDTKPPNLPDEFYQLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGD
        Y  + P  +   P +P  F+ L ++A   +                 P  +P+ C++N Y+   +L LHQD+DE     D   P++S S+G  A F FG 
Subjt:  YGDVRPFDDTKPPNLPDEFYQLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGD

Query:  WSDVDQAEKVTLESGDILIFGGKSRHVFHGVTAIHSNTAPKALLEATNLRPGRLNLTFRQ
            D  +++ LE GD++++GG+SR  +HG+  + +   P  +         R NLTFRQ
Subjt:  WSDVDQAEKVTLESGDILIFGGKSRHVFHGVTAIHSNTAPKALLEATNLRPGRLNLTFRQ

P0CAT7 Alpha-ketoglutarate-dependent dioxygenase AlkB homolog3.3e-1340Show/hide
Query:  PWMKPNICIVNFYSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGDWSDVDQAEKVTLESGDILIFGGKSRHVFHGVTAIHSNTAPKALLEAT
        P   P+ C+VN Y    R+GLHQDRDE+    D   PV+S S+GD+A F  G  +  D    + L SGD+    G +R  FHGV  I        L  ++
Subjt:  PWMKPNICIVNFYSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGDWSDVDQAEKVTLESGDILIFGGKSRHVFHGVTAIHSNTAPKALLEAT

Query:  NLRP--GRLNLTFRQ
        +L P  GR+NLT R+
Subjt:  NLRP--GRLNLTFRQ

P37462 Alpha-ketoglutarate-dependent dioxygenase AlkB1.1e-1331.58Show/hide
Query:  LGKNWDPDSSTYGDVRPFDDTKPPNLPDEFYQLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDESQESLDKGLPVISFS
        LG   D     Y    P  D   P LP  F  +  +A     AI A  ++           +P+ C++N Y+   +L LHQD+DE     D   P++S S
Subjt:  LGKNWDPDSSTYGDVRPFDDTKPPNLPDEFYQLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDESQESLDKGLPVISFS

Query:  IGDSAEFLFGDWSDVDQAEKVTLESGDILIFGGKSRHVFHGVTAIHSNTAPKALLEATNLRPGRLNLTFRQ
        +G  A F FG     D  +++ LE GDI+++GG+SR  +HG+  + +   P            R NLTFRQ
Subjt:  IGDSAEFLFGDWSDVDQAEKVTLESGDILIFGGKSRHVFHGVTAIHSNTAPKALLEATNLRPGRLNLTFRQ

Q54N08 Alpha-ketoglutarate-dependent dioxygenase alkB1.1e-0837.08Show/hide
Query:  VNFYSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGDWSDVDQAEKVTLESGDILIFGGKSRHVFHGVTAIHSNTAPKALLE
        VNFYS++  +G H   D++++ ++K  P+IS S G +A FL G  +       + + SGDI+I GG+SR+ +HGV  I  N+    L++
Subjt:  VNFYSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGDWSDVDQAEKVTLESGDILIFGGKSRHVFHGVTAIHSNTAPKALLE

Arabidopsis top hitse value%identityAlignment
AT1G11780.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein8.6e-0930Show/hide
Query:  RPFDDTKP-PNLPDEFYQLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGDWSD
        R +D + P  N+PD   QL     K   AI   D     PE           IVN++     LG H D  E+    D   P++S S+G  A FL G  S 
Subjt:  RPFDDTKP-PNLPDEFYQLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGDWSD

Query:  VDQAEKVTLESGDILIFGGKSRHVFHGVTAIHS--NTAPKALLE-----------ATNLRPGRLNLTFRQ
         D    + L SGD+++  G++R  FHG+  I +    A    LE           A  ++  R+N+  RQ
Subjt:  VDQAEKVTLESGDILIFGGKSRHVFHGVTAIHS--NTAPKALLE-----------ATNLRPGRLNLTFRQ

AT3G14140.1 2-oxoglutarate-dependent dioxygenase family protein2.5e-6447.46Show/hide
Query:  GESNNKPSVDLDTFDICPPKTGGVTLNPSLLAMNREKRNEMKRAMDGNNGIVLRPGMVHLKGSISLRDQAKIVKKCRDLGIGA----------GGKLHLK
        G  N+        FDI   K   + L PS L +NREK    K+A  G +GIV+RPGMV LK  +S+ +Q  IV KCR LG+G           GG LHLK
Subjt:  GESNNKPSVDLDTFDICPPKTGGVTLNPSLLAMNREKRNEMKRAMDGNNGIVLRPGMVHLKGSISLRDQAKIVKKCRDLGIGA----------GGKLHLK

Query:  MMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDEFYQLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQ----------------
        MMCLGKNWD  +  YG++RP D + PP +P EF QLVEKAIK+S +++A +S        +P + P+IC+VNFY+  G+LGLHQ                
Subjt:  MMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDEFYQLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQ----------------

Query:  -----DRDESQESLDKGLPVISFSIGDSAEFLFGDWSDVDQAEKVTLESGDILIFGGKSRHVFHGVTAIHSNTAPK
             D+ ES++SL KGLP++SFSIGDSAEFL+GD  DVD+A+ + LESGD+LIFG +SR+VFHGV +I     P+
Subjt:  -----DRDESQESLDKGLPVISFSIGDSAEFLFGDWSDVDQAEKVTLESGDILIFGGKSRHVFHGVTAIHSNTAPK

AT3G14160.1 2-oxoglutarate-dependent dioxygenase family protein2.5e-8548.37Show/hide
Query:  ISNTSEQRLGPIASNSDCIELLSSSAQNVSKSLHSAVERIQIQGPTAVC---GNCSDSFPYDNRNGSDVVGQELKVQPTLESCAKDKSSTIKLGESNNKP
        I +T +   G  A ++  +  L S + NVS     + E     G    C      +D       + S  V Q+++    L S    KS+    G  N+  
Subjt:  ISNTSEQRLGPIASNSDCIELLSSSAQNVSKSLHSAVERIQIQGPTAVC---GNCSDSFPYDNRNGSDVVGQELKVQPTLESCAKDKSSTIKLGESNNKP

Query:  SVDLDTFDICPPKTGGVTLNPSLLAMNREKRNEMKRAMDGNNGIVLRPGMVHLKGSISLRDQAKIVKKCRDLGIGAGG----------KLHLKMMCLGKN
              FDI   K  G+ L P+LL ++REK    K+A  G +G V+RPGMV LK  +S+ DQ  IV KCR LG+G GG          KLHLKMMCLGKN
Subjt:  SVDLDTFDICPPKTGGVTLNPSLLAMNREKRNEMKRAMDGNNGIVLRPGMVHLKGSISLRDQAKIVKKCRDLGIGAGG----------KLHLKMMCLGKN

Query:  WDPDSSTYGDVRPFDDTKPPNLPDEFYQLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDESQESLDKGLPVISFSIGDS
        WDP++S YG+ RPFD +  P +P EF Q VEKA+K+S ++ A +S        +P+M P+ICIVNFYS  GRLGLHQD+DES+ S+ KGLPV+SFSIGDS
Subjt:  WDPDSSTYGDVRPFDDTKPPNLPDEFYQLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDESQESLDKGLPVISFSIGDS

Query:  AEFLFGDWSDVDQAEKVTLESGDILIFGGKSRHVFHGVTAIHSNTAPKALLEATNLRPGRLNLTFRQY
        AEFL+GD  D D+AE +TLESGD+L+FGG+SR VFHGV +I  +TAPKALL+ T+LRPGRLNLTFRQY
Subjt:  AEFLFGDWSDVDQAEKVTLESGDILIFGGKSRHVFHGVTAIHSNTAPKALLEATNLRPGRLNLTFRQY

AT5G01780.1 2-oxoglutarate-dependent dioxygenase family protein2.2e-6539.43Show/hide
Query:  QLLGSIASNSDSNQISNTSEQRLGPIASNSDCIELLSSSAQNVSKSLHSAVERIQIQGPTAVCGNCSDSFPYDNRNGSDVVGQELKVQPTLESCAKDKSS
        Q++  + S S S  I ++S     PI      +   +   +  S+ LH        +    + G+ +    +D+ N S            +      ++S
Subjt:  QLLGSIASNSDSNQISNTSEQRLGPIASNSDCIELLSSSAQNVSKSLHSAVERIQIQGPTAVCGNCSDSFPYDNRNGSDVVGQELKVQPTLESCAKDKSS

Query:  TIKLGESNNKPSVDLDTFDICPP--KTGGVTLNPSLLA--MNREKRNEMKRAMDGNNGIVLRPGMVHLKGSISLRDQAKIVKKCRDLGI----------G
          K  + + +   D   FDIC    +    ++   +LA   NRE           N   V+RPGMV LK  ++   Q  IVK CR+LG+           
Subjt:  TIKLGESNNKPSVDLDTFDICPP--KTGGVTLNPSLLA--MNREKRNEMKRAMDGNNGIVLRPGMVHLKGSISLRDQAKIVKKCRDLGI----------G

Query:  AGGKLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDEFYQLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDESQES
         G KLHL+MMCLG+NWDP +    +     D+K P +P  F  LVEKAI++++A+I ++S  ++ ER+LP M P+ICIVNFYS+ GRLGLHQDRDES+ES
Subjt:  AGGKLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDEFYQLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQDRDESQES

Query:  LDKGLPVISFSIGDSAEFLFGDWSDVDQAEKVTLESGDILIFGGKSRHVFHGVTAIHSNTAPKALLEATNLRPGRLNLTFRQY
        + +GLP++SFSIGDSAEFL+G+  DV++A+ V LESGD+LIFGG+SR +FHGV +I  N+AP +LL  + LR GRLNLTFR +
Subjt:  LDKGLPVISFSIGDSAEFLFGDWSDVDQAEKVTLESGDILIFGGKSRHVFHGVTAIHSNTAPKALLEATNLRPGRLNLTFRQY

AT5G01780.2 2-oxoglutarate-dependent dioxygenase family protein1.7e-6539.9Show/hide
Query:  SNTNKQLLGSI--ASNSDSNQISN-TSEQRLGPIASNSDCIELLSSSAQNVSKSLHSAVERIQIQGPTAVCGNCSDSFPYDNRNGSDVVGQELKVQPTLE
        S T K  LGS   ++   S+Q+ N TS +    +  N  C    +   +  S+ LH        +    + G+ +    +D+ N S            + 
Subjt:  SNTNKQLLGSI--ASNSDSNQISN-TSEQRLGPIASNSDCIELLSSSAQNVSKSLHSAVERIQIQGPTAVCGNCSDSFPYDNRNGSDVVGQELKVQPTLE

Query:  SCAKDKSSTIKLGESNNKPSVDLDTFDICPP--KTGGVTLNPSLLA--MNREKRNEMKRAMDGNNGIVLRPGMVHLKGSISLRDQAKIVKKCRDLGI---
             ++S  K  + + +   D   FDIC    +    ++   +LA   NRE           N   V+RPGMV LK  ++   Q  IVK CR+LG+   
Subjt:  SCAKDKSSTIKLGESNNKPSVDLDTFDICPP--KTGGVTLNPSLLA--MNREKRNEMKRAMDGNNGIVLRPGMVHLKGSISLRDQAKIVKKCRDLGI---

Query:  -------GAGGKLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDEFYQLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQ
                 G KLHL+MMCLG+NWDP +    +     D+K P +P  F  LVEKAI++++A+I ++S  ++ ER+LP M P+ICIVNFYS+ GRLGLHQ
Subjt:  -------GAGGKLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDEFYQLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGLHQ

Query:  DRDESQESLDKGLPVISFSIGDSAEFLFGDWSDVDQAEKVTLESGDILIFGGKSRHVFHGVTAIHSNTAPKALLEATNLRPGRLNLTFRQY
        DRDES+ES+ +GLP++SFSIGDSAEFL+G+  DV++A+ V LESGD+LIFGG+SR +FHGV +I  N+AP +LL  + LR GRLNLTFR +
Subjt:  DRDESQESLDKGLPVISFSIGDSAEFLFGDWSDVDQAEKVTLESGDILIFGGKSRHVFHGVTAIHSNTAPKALLEATNLRPGRLNLTFRQY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATCAAGGAATTTCAGGAAGCCTGTTGGACAAAAACAGGCCTCAAGTTCTGAACAGTGGCAGTGGCGGCCTTTAAATAGCAGGAAAGATGCATCTCCTGGTGCTGT
TGATCTTCACCTGCAACATAATTCAACTGATGATATGAGCAATACCAATAAACAATTATTGGGATCTATTGCATCGAATTCTGATAGTAATCAAATAAGCAATACTTCAG
AGCAACGATTGGGACCTATTGCATCAAATTCTGATTGCATCGAACTCTTGTCCTCTTCTGCTCAAAATGTCTCTAAGAGTTTGCATTCTGCTGTAGAAAGAATTCAGATT
CAGGGACCTACAGCAGTATGTGGAAATTGTAGTGATTCTTTTCCTTATGATAATCGTAACGGATCAGATGTGGTTGGACAGGAACTAAAGGTTCAACCTACACTAGAATC
CTGTGCAAAAGATAAGAGTTCCACCATAAAACTTGGGGAAAGTAATAATAAGCCTTCAGTAGACCTCGATACTTTTGATATATGCCCTCCAAAAACTGGAGGTGTCACAC
TGAATCCTTCTTTATTAGCTATGAACAGAGAAAAAAGAAATGAGATGAAGCGAGCAATGGATGGAAATAATGGAATTGTGTTGAGACCAGGAATGGTTCATCTGAAGGGT
AGCATTTCCCTCAGGGATCAGGCAAAGATAGTAAAAAAATGTCGGGATCTTGGTATTGGAGCTGGAGGAAAACTGCACCTGAAAATGATGTGCCTTGGTAAAAATTGGGA
TCCTGACAGTAGTACATATGGGGATGTTCGTCCATTTGATGATACAAAACCACCTAACCTACCAGATGAATTTTATCAACTGGTTGAAAAGGCAATCAAAGATTCTTATG
CTATTATAGCAAAAGATTCAACAATAAAAAATCCTGAACGCGTACTTCCATGGATGAAACCTAACATCTGTATTGTAAACTTCTACTCACAAAATGGACGATTGGGTCTT
CATCAGGATCGAGATGAAAGTCAAGAAAGTCTTGATAAAGGATTGCCTGTCATCTCCTTCTCCATTGGTGACTCTGCTGAATTCCTATTTGGTGATTGGAGTGATGTTGA
TCAAGCAGAGAAAGTTACTTTGGAATCAGGAGATATCTTGATATTTGGTGGGAAATCAAGGCACGTTTTCCATGGAGTGACTGCAATTCATTCAAACACTGCTCCAAAAG
CACTTTTAGAAGCAACAAATCTTCGTCCAGGTCGCTTAAACCTTACTTTCCGTCAGTATTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCATCAAGGAATTTCAGGAAGCCTGTTGGACAAAAACAGGCCTCAAGTTCTGAACAGTGGCAGTGGCGGCCTTTAAATAGCAGGAAAGATGCATCTCCTGGTGCTGT
TGATCTTCACCTGCAACATAATTCAACTGATGATATGAGCAATACCAATAAACAATTATTGGGATCTATTGCATCGAATTCTGATAGTAATCAAATAAGCAATACTTCAG
AGCAACGATTGGGACCTATTGCATCAAATTCTGATTGCATCGAACTCTTGTCCTCTTCTGCTCAAAATGTCTCTAAGAGTTTGCATTCTGCTGTAGAAAGAATTCAGATT
CAGGGACCTACAGCAGTATGTGGAAATTGTAGTGATTCTTTTCCTTATGATAATCGTAACGGATCAGATGTGGTTGGACAGGAACTAAAGGTTCAACCTACACTAGAATC
CTGTGCAAAAGATAAGAGTTCCACCATAAAACTTGGGGAAAGTAATAATAAGCCTTCAGTAGACCTCGATACTTTTGATATATGCCCTCCAAAAACTGGAGGTGTCACAC
TGAATCCTTCTTTATTAGCTATGAACAGAGAAAAAAGAAATGAGATGAAGCGAGCAATGGATGGAAATAATGGAATTGTGTTGAGACCAGGAATGGTTCATCTGAAGGGT
AGCATTTCCCTCAGGGATCAGGCAAAGATAGTAAAAAAATGTCGGGATCTTGGTATTGGAGCTGGAGGAAAACTGCACCTGAAAATGATGTGCCTTGGTAAAAATTGGGA
TCCTGACAGTAGTACATATGGGGATGTTCGTCCATTTGATGATACAAAACCACCTAACCTACCAGATGAATTTTATCAACTGGTTGAAAAGGCAATCAAAGATTCTTATG
CTATTATAGCAAAAGATTCAACAATAAAAAATCCTGAACGCGTACTTCCATGGATGAAACCTAACATCTGTATTGTAAACTTCTACTCACAAAATGGACGATTGGGTCTT
CATCAGGATCGAGATGAAAGTCAAGAAAGTCTTGATAAAGGATTGCCTGTCATCTCCTTCTCCATTGGTGACTCTGCTGAATTCCTATTTGGTGATTGGAGTGATGTTGA
TCAAGCAGAGAAAGTTACTTTGGAATCAGGAGATATCTTGATATTTGGTGGGAAATCAAGGCACGTTTTCCATGGAGTGACTGCAATTCATTCAAACACTGCTCCAAAAG
CACTTTTAGAAGCAACAAATCTTCGTCCAGGTCGCTTAAACCTTACTTTCCGTCAGTATTGA
Protein sequenceShow/hide protein sequence
MSSRNFRKPVGQKQASSSEQWQWRPLNSRKDASPGAVDLHLQHNSTDDMSNTNKQLLGSIASNSDSNQISNTSEQRLGPIASNSDCIELLSSSAQNVSKSLHSAVERIQI
QGPTAVCGNCSDSFPYDNRNGSDVVGQELKVQPTLESCAKDKSSTIKLGESNNKPSVDLDTFDICPPKTGGVTLNPSLLAMNREKRNEMKRAMDGNNGIVLRPGMVHLKG
SISLRDQAKIVKKCRDLGIGAGGKLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDEFYQLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNFYSQNGRLGL
HQDRDESQESLDKGLPVISFSIGDSAEFLFGDWSDVDQAEKVTLESGDILIFGGKSRHVFHGVTAIHSNTAPKALLEATNLRPGRLNLTFRQY