; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg12666 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg12666
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
Description2-oxoglutarate-dependent dioxygenase family protein isoform 1
Genome locationCarg_Chr07:7228348..7231741
RNA-Seq ExpressionCarg12666
SyntenyCarg12666
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0035513 - oxidative RNA demethylation (biological process)
GO:0035552 - oxidative single-stranded DNA demethylation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0008198 - ferrous iron binding (molecular function)
GO:0035515 - oxidative RNA demethylase activity (molecular function)
GO:0035516 - oxidative DNA demethylase activity (molecular function)
InterPro domainsIPR004574 - Alkylated DNA repair protein AlkB
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR027450 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like
IPR037151 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6595567.1 hypothetical protein SDJN03_12120, partial [Cucurbita argyrosperma subsp. sororia]3.0e-23897.36Show/hide
Query:  WQFVDFVILQALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTK
        +Q +D     ALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTK
Subjt:  WQFVDFVILQALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTK

Query:  RRSRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGSKNSIATANLPPIESFDICFPERRGKSKPRYSWQSKDRNTMKVMEHADEATNGIVMRPGMV
        RRSRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGSKNS+ATANLPPIESFDICFPERRGKSK RYSWQSKDR+TMKVMEHADEATNGIVMRPGMV
Subjt:  RRSRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGSKNSIATANLPPIESFDICFPERRGKSKPRYSWQSKDRNTMKVMEHADEATNGIVMRPGMV

Query:  LLKHYIPLHEQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIE
        LLKHYIPLHEQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIE
Subjt:  LLKHYIPLHEQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIE

Query:  DILPTMSPDICIVNFYSTSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL
        DILPTMSPDICIVNFYSTSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL
Subjt:  DILPTMSPDICIVNFYSTSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL

Query:  DHTGLRPGRLNLTFRKY
        DHTGLRPGRLNLTFRKY
Subjt:  DHTGLRPGRLNLTFRKY

KAG7027547.1 alkB, partial [Cucurbita argyrosperma subsp. argyrosperma]3.5e-279100Show/hide
Query:  MAEIITPSTSRCSLVAQLGVRWNLRIRLTNPRLATTVVESSSSAFVRRVSIASVSADGFIWQFVDFVILQALPDSSCYGSSCGGNEECLHNRDHNSNVIM
        MAEIITPSTSRCSLVAQLGVRWNLRIRLTNPRLATTVVESSSSAFVRRVSIASVSADGFIWQFVDFVILQALPDSSCYGSSCGGNEECLHNRDHNSNVIM
Subjt:  MAEIITPSTSRCSLVAQLGVRWNLRIRLTNPRLATTVVESSSSAFVRRVSIASVSADGFIWQFVDFVILQALPDSSCYGSSCGGNEECLHNRDHNSNVIM

Query:  IGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTKRRSRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGS
        IGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTKRRSRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGS
Subjt:  IGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTKRRSRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGS

Query:  KNSIATANLPPIESFDICFPERRGKSKPRYSWQSKDRNTMKVMEHADEATNGIVMRPGMVLLKHYIPLHEQVNIVKTIQKLGLGPGGFYQPGYKDGAKLR
        KNSIATANLPPIESFDICFPERRGKSKPRYSWQSKDRNTMKVMEHADEATNGIVMRPGMVLLKHYIPLHEQVNIVKTIQKLGLGPGGFYQPGYKDGAKLR
Subjt:  KNSIATANLPPIESFDICFPERRGKSKPRYSWQSKDRNTMKVMEHADEATNGIVMRPGMVLLKHYIPLHEQVNIVKTIQKLGLGPGGFYQPGYKDGAKLR

Query:  LQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTMSPDICIVNFYSTSGRLGLHQDRDESRESLVGGLP
        LQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTMSPDICIVNFYSTSGRLGLHQDRDESRESLVGGLP
Subjt:  LQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTMSPDICIVNFYSTSGRLGLHQDRDESRESLVGGLP

Query:  VVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY
        VVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY
Subjt:  VVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY

XP_022924913.1 uncharacterized protein LOC111432318 [Cucurbita moschata]1.2e-23997.84Show/hide
Query:  WQFVDFVILQALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTK
        +Q VD     ALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTK
Subjt:  WQFVDFVILQALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTK

Query:  RRSRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGSKNSIATANLPPIESFDICFPERRGKSKPRYSWQSKDRNTMKVMEHADEATNGIVMRPGMV
        RRSRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGSKNS+ATANLPPIESFDICFPERRGKSKPRYSWQSKDR+TMKVMEHADEATNGIVMRPGMV
Subjt:  RRSRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGSKNSIATANLPPIESFDICFPERRGKSKPRYSWQSKDRNTMKVMEHADEATNGIVMRPGMV

Query:  LLKHYIPLHEQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIE
        LLKHYIPLHEQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIE
Subjt:  LLKHYIPLHEQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIE

Query:  DILPTMSPDICIVNFYSTSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL
        DILPTMSPDICIVNFYSTSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL
Subjt:  DILPTMSPDICIVNFYSTSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL

Query:  DHTGLRPGRLNLTFRKY
        DHTGLRPGRLNLTFRKY
Subjt:  DHTGLRPGRLNLTFRKY

XP_022966314.1 uncharacterized protein LOC111466008 [Cucurbita maxima]1.3e-22592.81Show/hide
Query:  WQFVDFVILQALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTK
        +Q +D     ALP+SSCYGSS GGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQKGIPANIPS YHDDEFPPVPRQNTK
Subjt:  WQFVDFVILQALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTK

Query:  RRSRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGSKNSIATANLPPIESFDICFPERRGKSKPRYSWQSKDRNTMKVMEHADEATNGIVMRPGMV
        RRSRID GSERRLKNSTSSSQM+RNEPFSF KHRS DIGSKNS+ATANLPPIESFDICFPERRGKSKPR SWQ KDR+TMKVMEH DEATNGIVMRPGMV
Subjt:  RRSRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGSKNSIATANLPPIESFDICFPERRGKSKPRYSWQSKDRNTMKVMEHADEATNGIVMRPGMV

Query:  LLKHYIPLHEQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIE
        LLKHYIPLHEQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGD N IE
Subjt:  LLKHYIPLHEQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIE

Query:  DILPTMSPDICIVNFYSTSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL
        DILPTMSPDICIVNFYST GRLGLHQDRDESRESLV GLPVVSFSLGNSA FLYGD+R+VDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKS PKFLL
Subjt:  DILPTMSPDICIVNFYSTSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL

Query:  DHTGLRPGRLNLTFRKY
        DHTG RPG LNLTFRKY
Subjt:  DHTGLRPGRLNLTFRKY

XP_023517205.1 uncharacterized protein LOC111781040 [Cucurbita pepo subsp. pepo]3.7e-23696.64Show/hide
Query:  WQFVDFVILQALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTK
        +Q +D     ALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQK IPANIPSSYHDDEFPPVPRQNTK
Subjt:  WQFVDFVILQALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTK

Query:  RRSRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGSKNSIATANLPPIESFDICFPERRGKSKPRYSWQSKDRNTMKVMEHADEATNGIVMRPGMV
        RRSRIDLGSERRLKNSTSSSQMERNEPFSF KHRS DIGSKNS+ATANLPPIESFDICFPERRGKSKPRYSWQSKDR+TMKVMEHADEATNGIVMRPGMV
Subjt:  RRSRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGSKNSIATANLPPIESFDICFPERRGKSKPRYSWQSKDRNTMKVMEHADEATNGIVMRPGMV

Query:  LLKHYIPLHEQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIE
        LLKHYIPLHEQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIE
Subjt:  LLKHYIPLHEQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIE

Query:  DILPTMSPDICIVNFYSTSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL
        DILPTMSPDICIVNFYSTSGRLGLHQDRDESRESLV GLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL
Subjt:  DILPTMSPDICIVNFYSTSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL

Query:  DHTGLRPGRLNLTFRKY
        DHTGLRPGRLNLTFRKY
Subjt:  DHTGLRPGRLNLTFRKY

TrEMBL top hitse value%identityAlignment
A0A5A7U7Q2 2-oxoglutarate-dependent dioxygenase family protein isoform 12.6e-14263.57Show/hide
Query:  ALPDSSCYGSS--CGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTKRRSRIDLG
        A PDSSC G+S  CG ++E L +RD+ S+VI +G   V+LN K  E +SL+ LSV KCD  ++ SD+ GI +N P SYH DE  PV RQNT RR+RIDLG
Subjt:  ALPDSSCYGSS--CGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTKRRSRIDLG

Query:  SERRLKNSTSSSQMERNEPFS---------------------FKKHRSPDIGSKNSIATANLPPIE-SFDICFPERRGKSKPRYSWQSKDRNTMKVMEHA
        S+R LK++  S Q+ER+E  +                     F K +S DIGSK S+ T +  P E  FDICFP   G  K R  W+ KD  T+K     
Subjt:  SERRLKNSTSSSQMERNEPFS---------------------FKKHRSPDIGSKNSIATANLPPIE-SFDICFPERRGKSKPRYSWQSKDRNTMKVMEHA

Query:  DEATNGIVMRPGMVLLKHYIPLHEQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDA
               ++RPGMVLLKHYI   EQ+NIVKT QKLGLGPGGFYQPGYKDGAKLRL+MMCLGLDWDPQTR+Y  KRV DGNKPPD+PP F+ LV  AL DA
Subjt:  DEATNGIVMRPGMVLLKHYIPLHEQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDA

Query:  HALIKNNGDTNNIEDILPTMSPDICIVNFYSTSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHG
        HA IKN  + +N+EDILP+MSPDICI NFY+TSGRLGLHQDRDES+ESL  GLPVVSFS+GN+AEFLYGD+RDVDKA K+ LESGDVLIFGGESRH+FHG
Subjt:  HALIKNNGDTNNIEDILPTMSPDICIVNFYSTSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHG

Query:  VSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY
        VSSIIPKSTPKFLL HTGLRPGRLNLTFRKY
Subjt:  VSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY

A0A5D3BFV0 2-oxoglutarate-dependent dioxygenase family protein isoform 11.8e-14363.81Show/hide
Query:  ALPDSSCYGSS--CGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTKRRSRIDLG
        A PDSSC G+S  CG ++E L +RD+ S+VI +G   V+LN K  E +SL+ LSV KCD  ++ SD+ GI +N P SYH DE  PV RQNT RR+RIDLG
Subjt:  ALPDSSCYGSS--CGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTKRRSRIDLG

Query:  SERRLKNSTSSSQMERNEPFS---------------------FKKHRSPDIGSKNSIATANLPPIE-SFDICFPERRGKSKPRYSWQSKDRNTMKVMEHA
        S+R LK++  S Q+ER+E  +                     F K +S DIGSK S+ T + PP E  FDICFP   G  K R  W+ KD  T+K     
Subjt:  SERRLKNSTSSSQMERNEPFS---------------------FKKHRSPDIGSKNSIATANLPPIE-SFDICFPERRGKSKPRYSWQSKDRNTMKVMEHA

Query:  DEATNGIVMRPGMVLLKHYIPLHEQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDA
               ++RPGMVLLKHYI   EQ+NIVKT QKLGLGPGGFYQPGYKDGAKLRL+MMCLGLDWDPQTR+Y  KRV DGNKPPD+PP F+ LV  AL DA
Subjt:  DEATNGIVMRPGMVLLKHYIPLHEQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDA

Query:  HALIKNNGDTNNIEDILPTMSPDICIVNFYSTSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHG
        HA IKN  + +N+EDILP+MSPDICI NFY+TSGRLGLHQDRDES+ESL  GLPVVSFS+GN+AEFLYGD+RDVDKA K+ LESGDVLIFGGESRH+FHG
Subjt:  HALIKNNGDTNNIEDILPTMSPDICIVNFYSTSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHG

Query:  VSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY
        VSSIIPKSTPKFLL HTGLRPGRLNLTFRKY
Subjt:  VSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY

A0A6J1CQI1 uncharacterized protein LOC1110138272.0e-14754.77Show/hide
Query:  GGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTKRRSRIDLGSERRL---------
        G   E  HNR H+S+++M+GEIPV LNRK  E+ES S  SV K DDF+L  ++K  PAN+P+SYHDD+F PV RQN K RSR+DLG ER +         
Subjt:  GGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTKRRSRIDLGSERRL---------

Query:  --------------------------------------------------------------------------------------------------KN
                                                                                                           N
Subjt:  --------------------------------------------------------------------------------------------------KN

Query:  STSSSQME----------------------RNEPFSFKKHRSPDIGSKNSIATANLPPIESFDICFPERRGKSKPRYSWQSKDRNTMKVMEHADEATNGI
        +TSS Q+E                      +NEPF  +K +S DIGSKNS+   NL P E FDIC  ERRG +KP   WQ K R+T+KVMEH  EA+N  
Subjt:  STSSSQME----------------------RNEPFSFKKHRSPDIGSKNSIATANLPPIESFDICFPERRGKSKPRYSWQSKDRNTMKVMEHADEATNGI

Query:  VMRPGMVLLKHYIPLHEQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNN
        V+RPGMVLLK+YI LHEQVNIVKT Q+LG+GPGGFY+PGYKDGAKLRLQMMCLGLDWDPQTRKY  KR  DG+KPP++PP+FAILV +AL DAHALIKN 
Subjt:  VMRPGMVLLKHYIPLHEQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNN

Query:  GDTNNIEDILPTMSPDICIVNFYSTSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPK
         +T N+E ILP+MSPDICIVNFY+TSGRLGLHQDRDES+ESLV GLPVVS SLG+SAEFLYGD+RDVDKA K+ILESGDVLIFGG+SRH+FHGVSSIIP 
Subjt:  GDTNNIEDILPTMSPDICIVNFYSTSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPK

Query:  STPKFLLDHTGLRPGRLNLTFRKY
        STPKFLLDHTGLRPGRLNLTFRKY
Subjt:  STPKFLLDHTGLRPGRLNLTFRKY

A0A6J1EDT3 uncharacterized protein LOC1114323185.9e-24097.84Show/hide
Query:  WQFVDFVILQALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTK
        +Q VD     ALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTK
Subjt:  WQFVDFVILQALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTK

Query:  RRSRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGSKNSIATANLPPIESFDICFPERRGKSKPRYSWQSKDRNTMKVMEHADEATNGIVMRPGMV
        RRSRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGSKNS+ATANLPPIESFDICFPERRGKSKPRYSWQSKDR+TMKVMEHADEATNGIVMRPGMV
Subjt:  RRSRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGSKNSIATANLPPIESFDICFPERRGKSKPRYSWQSKDRNTMKVMEHADEATNGIVMRPGMV

Query:  LLKHYIPLHEQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIE
        LLKHYIPLHEQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIE
Subjt:  LLKHYIPLHEQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIE

Query:  DILPTMSPDICIVNFYSTSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL
        DILPTMSPDICIVNFYSTSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL
Subjt:  DILPTMSPDICIVNFYSTSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL

Query:  DHTGLRPGRLNLTFRKY
        DHTGLRPGRLNLTFRKY
Subjt:  DHTGLRPGRLNLTFRKY

A0A6J1HTF0 uncharacterized protein LOC1114660086.4e-22692.81Show/hide
Query:  WQFVDFVILQALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTK
        +Q +D     ALP+SSCYGSS GGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQKGIPANIPS YHDDEFPPVPRQNTK
Subjt:  WQFVDFVILQALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTK

Query:  RRSRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGSKNSIATANLPPIESFDICFPERRGKSKPRYSWQSKDRNTMKVMEHADEATNGIVMRPGMV
        RRSRID GSERRLKNSTSSSQM+RNEPFSF KHRS DIGSKNS+ATANLPPIESFDICFPERRGKSKPR SWQ KDR+TMKVMEH DEATNGIVMRPGMV
Subjt:  RRSRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGSKNSIATANLPPIESFDICFPERRGKSKPRYSWQSKDRNTMKVMEHADEATNGIVMRPGMV

Query:  LLKHYIPLHEQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIE
        LLKHYIPLHEQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGD N IE
Subjt:  LLKHYIPLHEQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIE

Query:  DILPTMSPDICIVNFYSTSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL
        DILPTMSPDICIVNFYST GRLGLHQDRDESRESLV GLPVVSFSLGNSA FLYGD+R+VDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKS PKFLL
Subjt:  DILPTMSPDICIVNFYSTSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL

Query:  DHTGLRPGRLNLTFRKY
        DHTG RPG LNLTFRKY
Subjt:  DHTGLRPGRLNLTFRKY

SwissProt top hitse value%identityAlignment
B8GWW6 Alpha-ketoglutarate-dependent dioxygenase AlkB homolog4.2e-1735.03Show/hide
Query:  PGGFYQPGYKDGAKLRLQMMCLG-LDWDPQTR--KYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTMSPDICIVNFYSTSGR
        P   Y+  Y  G  + + M  LG L W    R  +Y  +    G   PD+PP        AL D   ++   GD        P   PD C+VN Y    R
Subjt:  PGGFYQPGYKDGAKLRLQMMCLG-LDWDPQTR--KYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTMSPDICIVNFYSTSGR

Query:  LGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRP--GRLNLTFRK
        +GLHQDRDE+        PV+S SLG++A F  G     D    + L SGDV    G +R  FHGV  I+P S        + L P  GR+NLT R+
Subjt:  LGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRP--GRLNLTFRK

O60066 Alpha-ketoglutarate-dependent dioxygenase abh11.6e-1627.92Show/hide
Query:  PGMVLLKHYIPLHEQVNIVKTIQ-----------------KLGLGPGGFYQPGYK-DGAKL------------------RLQMMCLGLDWDPQTRKYARK
        PG+++LK+Y+    Q+ ++K+I                  +L LG    ++  Y  DG  +                  +L+ + LG  +D  T++Y   
Subjt:  PGMVLLKHYIPLHEQVNIVKTIQ-----------------KLGLGPGGFYQPGYK-DGAKL------------------RLQMMCLGLDWDPQTRKYARK

Query:  RVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTMSPDICIVNFYSTSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDV
           D +K P  P +    V K + ++   +                  +  IVNFYS    L  H   DES E L   LP++S S+G    +L G +   
Subjt:  RVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTMSPDICIVNFYSTSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDV

Query:  DKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL
        +K   + L SGDV+I  G SR  FH V  IIP STP +LL
Subjt:  DKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL

P05050 Alpha-ketoglutarate-dependent dioxygenase AlkB1.2e-1431.61Show/hide
Query:  CLGLDWDPQTRKYARKRV-ADGNKP-PDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTMSPDICIVNFYSTSGRLGLHQDRDESRESLVGGLPVV
        C  L W    + Y    +    NKP P +P  F  L  +A   A                 P   PD C++N Y+   +L LHQD+DE         P+V
Subjt:  CLGLDWDPQTRKYARKRV-ADGNKP-PDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTMSPDICIVNFYSTSGRLGLHQDRDESRESLVGGLPVV

Query:  SFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRK
        S SLG  A F +G  +  D   +++LE GDV+++GGESR  +HG+  +     P  +         R NLTFR+
Subjt:  SFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRK

P0CAT7 Alpha-ketoglutarate-dependent dioxygenase AlkB homolog4.2e-1735.03Show/hide
Query:  PGGFYQPGYKDGAKLRLQMMCLG-LDWDPQTR--KYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTMSPDICIVNFYSTSGR
        P   Y+  Y  G  + + M  LG L W    R  +Y  +    G   PD+PP        AL D   ++   GD        P   PD C+VN Y    R
Subjt:  PGGFYQPGYKDGAKLRLQMMCLG-LDWDPQTR--KYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTMSPDICIVNFYSTSGR

Query:  LGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRP--GRLNLTFRK
        +GLHQDRDE+        PV+S SLG++A F  G     D    + L SGDV    G +R  FHGV  I+P S        + L P  GR+NLT R+
Subjt:  LGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRP--GRLNLTFRK

P37462 Alpha-ketoglutarate-dependent dioxygenase AlkB4.4e-1439.29Show/hide
Query:  TMSPDICIVNFYSTSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTG
        +  PD C++N Y+   +L LHQD+DE         P+VS SLG  A F +G  R  D   +I+LE GD++++GGESR  +HG+  +     P      TG
Subjt:  TMSPDICIVNFYSTSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTG

Query:  LRPGRLNLTFRK
            R NLTFR+
Subjt:  LRPGRLNLTFRK

Arabidopsis top hitse value%identityAlignment
AT1G11780.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein5.7e-0939.76Show/hide
Query:  PDICIVNFYSTSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSI
        P+  IVN++     LG H D  E+  S     P+VS SLG  A FL G +   D    + L SGDV++  GE+R  FHG+  I
Subjt:  PDICIVNFYSTSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSI

AT3G14140.1 2-oxoglutarate-dependent dioxygenase family protein8.6e-6645.12Show/hide
Query:  NSTSSSQMERNEPFSFKKHRSPDIGSKNSIATANLPPIESFDICFPERRGKSKPRYSWQSKDRNTMKVMEHADEATNGIVMRPGMVLLKHYIPLHEQVNI
        +STS   +E +   + K     D G  NS   ++  P   FDI   ++  + KP +   ++++      + A +  +GIV+RPGMVLLK+Y+ ++ QV I
Subjt:  NSTSSSQMERNEPFSFKKHRSPDIGSKNSIATANLPPIESFDICFPERRGKSKPRYSWQSKDRNTMKVMEHADEATNGIVMRPGMVLLKHYIPLHEQVNI

Query:  VKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTMSPDICIVN
        V   ++LGLG GGFYQPG++DG  L L+MMCLG +WD QTR+Y   R  DG+ PP +P EF+ LV KA+ ++ +L+  N +     D +P + PDIC+VN
Subjt:  VKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTMSPDICIVN

Query:  FYSTSGRLGLHQ---------------------DRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSI
        FY+++G+LGLHQ                     D+ ES++SL  GLP+VSFS+G+SAEFLYGDQ+DVDKA  +ILESGDVLIFG  SR++FHGV SI
Subjt:  FYSTSGRLGLHQ---------------------DRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSI

AT3G14160.1 2-oxoglutarate-dependent dioxygenase family protein8.9e-7950.33Show/hide
Query:  KNSTSSSQMERNEPFSFKKHRSPD--IGSKNSIATANLPPIESFDICFPERRGKSKPRYSWQSKDRNTMKVMEHADEATNGIVMRPGMVLLKHYIPLHEQ
        + S SS+ +++ E  S +  +S     G+ NS   +N      FDI   ++    KP     S+++      + A +  +G V+RPGMVLLK+Y+ +++Q
Subjt:  KNSTSSSQMERNEPFSFKKHRSPD--IGSKNSIATANLPPIESFDICFPERRGKSKPRYSWQSKDRNTMKVMEHADEATNGIVMRPGMVLLKHYIPLHEQ

Query:  VNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTMSPDIC
        V IV   ++LGLG GGFYQPGY+D AKL L+MMCLG +WDP+T +Y   R  DG+  P +P EF   V KA+ ++ +L  +N       D +P M PDIC
Subjt:  VNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTMSPDIC

Query:  IVNFYSTSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLN
        IVNFYS++GRLGLHQD+DES  S+  GLPVVSFS+G+SAEFLYGDQRD DKA  + LESGDVL+FGG SR +FHGV SI   + PK LL  T LRPGRLN
Subjt:  IVNFYSTSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLN

Query:  LTFRKY
        LTFR+Y
Subjt:  LTFRKY

AT5G01780.1 2-oxoglutarate-dependent dioxygenase family protein4.3e-8152.48Show/hide
Query:  NSTSSSQMERNEPFSFKKHRSPDIGSKNSIATANLPPIESFDICFPERRGKSKPRYSWQSKDRNTMKVMEHADEATNGIVMRPGMVLLKHYIPLHEQVNI
        +S+ SSQ +  +    + HR+    S++  +   +     FDIC             W   D    + +E +++     V+RPGMVLLK ++    QV+I
Subjt:  NSTSSSQMERNEPFSFKKHRSPDIGSKNSIATANLPPIESFDICFPERRGKSKPRYSWQSKDRNTMKVMEHADEATNGIVMRPGMVLLKHYIPLHEQVNI

Query:  VKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTMSPDICIVN
        VKT ++LG+ P GFYQPGY  G+KL LQMMCLG +WDPQT KY RK     +K P++P  F +LV KA+ +AHALI     T + E ILP MSPDICIVN
Subjt:  VKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTMSPDICIVN

Query:  FYSTSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTF
        FYS +GRLGLHQDRDES ES+  GLP+VSFS+G+SAEFLYG++RDV++A  +ILESGDVLIFGGESR IFHGV SIIP S P  LL+ + LR GRLNLTF
Subjt:  FYSTSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTF

Query:  RKY
        R +
Subjt:  RKY

AT5G01780.2 2-oxoglutarate-dependent dioxygenase family protein4.3e-8152.48Show/hide
Query:  NSTSSSQMERNEPFSFKKHRSPDIGSKNSIATANLPPIESFDICFPERRGKSKPRYSWQSKDRNTMKVMEHADEATNGIVMRPGMVLLKHYIPLHEQVNI
        +S+ SSQ +  +    + HR+    S++  +   +     FDIC             W   D    + +E +++     V+RPGMVLLK ++    QV+I
Subjt:  NSTSSSQMERNEPFSFKKHRSPDIGSKNSIATANLPPIESFDICFPERRGKSKPRYSWQSKDRNTMKVMEHADEATNGIVMRPGMVLLKHYIPLHEQVNI

Query:  VKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTMSPDICIVN
        VKT ++LG+ P GFYQPGY  G+KL LQMMCLG +WDPQT KY RK     +K P++P  F +LV KA+ +AHALI     T + E ILP MSPDICIVN
Subjt:  VKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTMSPDICIVN

Query:  FYSTSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTF
        FYS +GRLGLHQDRDES ES+  GLP+VSFS+G+SAEFLYG++RDV++A  +ILESGDVLIFGGESR IFHGV SIIP S P  LL+ + LR GRLNLTF
Subjt:  FYSTSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTF

Query:  RKY
        R +
Subjt:  RKY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAAATCATTACTCCTTCCACTTCCCGGTGCAGTCTCGTCGCCCAACTCGGCGTCCGCTGGAACTTGCGGATTAGGCTTACCAATCCCCGCCTCGCTACCACCGT
GGTCGAATCTTCTTCGTCGGCTTTTGTTCGCCGAGTCTCGATTGCTTCAGTTTCAGCGGATGGATTCATTTGGCAGTTCGTTGATTTCGTAATTCTTCAGGCGTTACCTG
ATTCATCATGTTATGGTAGTTCTTGTGGCGGCAATGAGGAATGTTTGCATAATAGAGATCATAATTCGAATGTGATAATGATAGGAGAGATTCCTGTGAATCTAAATCGT
AAAGGAAATGAACAGGAATCCTTGTCTCGGTTGTCTGTTGGTAAATGTGATGATTTCAAGTTGAGAAGCGATCAAAAAGGGATTCCTGCTAATATACCGAGTTCATACCA
TGATGATGAGTTTCCTCCTGTTCCTAGACAAAATACCAAAAGGAGAAGCCGGATAGATTTAGGATCGGAAAGAAGGTTGAAGAACAGTACAAGCTCATCACAAATGGAGA
GGAATGAACCATTTAGTTTCAAGAAACATCGGTCTCCGGATATTGGTTCCAAAAATTCTATTGCCACTGCCAATTTGCCTCCCATTGAATCTTTCGATATATGCTTTCCT
GAAAGAAGAGGTAAATCAAAACCCAGATATTCTTGGCAGTCTAAGGATAGGAACACTATGAAAGTAATGGAGCATGCTGATGAAGCTACAAATGGTATTGTGATGAGGCC
TGGAATGGTTTTACTGAAGCATTACATTCCTCTACATGAACAGGTTAATATTGTGAAAACTATTCAAAAACTTGGTCTTGGCCCAGGGGGATTTTACCAGCCTGGTTATA
AAGATGGTGCAAAACTCCGGCTTCAGATGATGTGTCTTGGATTGGACTGGGATCCTCAGACGAGGAAATATGCTCGTAAACGGGTTGCTGATGGTAATAAACCTCCAGAT
TTACCTCCTGAATTTGCAATTCTGGTTGGGAAAGCACTTAACGATGCACATGCCTTGATCAAGAACAATGGCGACACGAATAACATAGAAGACATACTTCCAACAATGTC
TCCTGACATATGCATTGTGAATTTCTACTCAACGAGTGGAAGACTGGGTCTGCATCAGGATCGTGATGAAAGCAGAGAGAGTCTCGTCGGCGGACTACCTGTTGTTTCCT
TTTCTTTGGGCAATTCAGCAGAGTTCTTGTATGGAGATCAAAGAGATGTCGATAAAGCAGGGAAGATTATACTGGAATCAGGTGATGTTCTAATTTTTGGTGGAGAATCT
AGGCATATATTTCATGGAGTATCTTCAATCATACCTAAATCGACGCCTAAGTTTCTGCTTGATCATACTGGTCTTCGTCCTGGGCGTCTAAATCTTACATTTAGAAAGTA
CTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGAAATCATTACTCCTTCCACTTCCCGGTGCAGTCTCGTCGCCCAACTCGGCGTCCGCTGGAACTTGCGGATTAGGCTTACCAATCCCCGCCTCGCTACCACCGT
GGTCGAATCTTCTTCGTCGGCTTTTGTTCGCCGAGTCTCGATTGCTTCAGTTTCAGCGGATGGATTCATTTGGCAGTTCGTTGATTTCGTAATTCTTCAGGCGTTACCTG
ATTCATCATGTTATGGTAGTTCTTGTGGCGGCAATGAGGAATGTTTGCATAATAGAGATCATAATTCGAATGTGATAATGATAGGAGAGATTCCTGTGAATCTAAATCGT
AAAGGAAATGAACAGGAATCCTTGTCTCGGTTGTCTGTTGGTAAATGTGATGATTTCAAGTTGAGAAGCGATCAAAAAGGGATTCCTGCTAATATACCGAGTTCATACCA
TGATGATGAGTTTCCTCCTGTTCCTAGACAAAATACCAAAAGGAGAAGCCGGATAGATTTAGGATCGGAAAGAAGGTTGAAGAACAGTACAAGCTCATCACAAATGGAGA
GGAATGAACCATTTAGTTTCAAGAAACATCGGTCTCCGGATATTGGTTCCAAAAATTCTATTGCCACTGCCAATTTGCCTCCCATTGAATCTTTCGATATATGCTTTCCT
GAAAGAAGAGGTAAATCAAAACCCAGATATTCTTGGCAGTCTAAGGATAGGAACACTATGAAAGTAATGGAGCATGCTGATGAAGCTACAAATGGTATTGTGATGAGGCC
TGGAATGGTTTTACTGAAGCATTACATTCCTCTACATGAACAGGTTAATATTGTGAAAACTATTCAAAAACTTGGTCTTGGCCCAGGGGGATTTTACCAGCCTGGTTATA
AAGATGGTGCAAAACTCCGGCTTCAGATGATGTGTCTTGGATTGGACTGGGATCCTCAGACGAGGAAATATGCTCGTAAACGGGTTGCTGATGGTAATAAACCTCCAGAT
TTACCTCCTGAATTTGCAATTCTGGTTGGGAAAGCACTTAACGATGCACATGCCTTGATCAAGAACAATGGCGACACGAATAACATAGAAGACATACTTCCAACAATGTC
TCCTGACATATGCATTGTGAATTTCTACTCAACGAGTGGAAGACTGGGTCTGCATCAGGATCGTGATGAAAGCAGAGAGAGTCTCGTCGGCGGACTACCTGTTGTTTCCT
TTTCTTTGGGCAATTCAGCAGAGTTCTTGTATGGAGATCAAAGAGATGTCGATAAAGCAGGGAAGATTATACTGGAATCAGGTGATGTTCTAATTTTTGGTGGAGAATCT
AGGCATATATTTCATGGAGTATCTTCAATCATACCTAAATCGACGCCTAAGTTTCTGCTTGATCATACTGGTCTTCGTCCTGGGCGTCTAAATCTTACATTTAGAAAGTA
CTAAAACACAGCTGTTCCTTGTTTGTCCTGTACAAATGAATAGCTGTTTTATTTATTAATTAGATGATTATCTAAGTGGGATATTATAATTCTAGACTGTAACTGTTCTT
GGTTTCTGATTTGTCTTTAGGATACCTTTTCTGTCTTCCACAATGCCTTTCCCATGAACATCTTGTAGTTATACAAGCTAGTATTCTTGACTTTCGTTAATGTTGTTTCA
TGGGGTGTTGATTTGACCAATGGTAGACGGACAACGGTTTGGATTGAGTATAGAGATACAATAAAAATGATCTAGGGTGTGATCGGTTACTCGTCGATCAGGCCTCGGAG
TCCTTTTTTTTCTCCAATTTACCTCTACAAAAAATTAAAAGACTGAATGCATGTTGATTTCTTTGCCTCCAACGAATAGGGCTCAATAGTT
Protein sequenceShow/hide protein sequence
MAEIITPSTSRCSLVAQLGVRWNLRIRLTNPRLATTVVESSSSAFVRRVSIASVSADGFIWQFVDFVILQALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNR
KGNEQESLSRLSVGKCDDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTKRRSRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGSKNSIATANLPPIESFDICFP
ERRGKSKPRYSWQSKDRNTMKVMEHADEATNGIVMRPGMVLLKHYIPLHEQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPD
LPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTMSPDICIVNFYSTSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGES
RHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY