; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh07G012800 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh07G012800
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionFe2OG dioxygenase domain-containing protein
Genome locationCmo_Chr07:7043393..7046188
RNA-Seq ExpressionCmoCh07G012800
SyntenyCmoCh07G012800
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0051213 - dioxygenase activity (molecular function)
InterPro domainsIPR004574 - Alkylated DNA repair protein AlkB
IPR027450 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like
IPR037151 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6595567.1 hypothetical protein SDJN03_12120, partial [Cucurbita argyrosperma subsp. sororia]1.5e-22779.92Show/hide
Query:  MLLIRTVPASLPPWSNLLRRLLFAESRLLQFQRVDSFGSSVSKRALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKC
        MLLIRTVPASLPPWSNLLRRLLFAESRLLQFQR+DSFGSS    ALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKC
Subjt:  MLLIRTVPASLPPWSNLLRRLLFAESRLLQFQRVDSFGSSVSKRALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKC

Query:  DDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTKRRSRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGSKNSLATANLPPIESFDICFPERRGKS
        DDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTKRRSRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGSKNSLATANLPPIESFDICFPERRGKS
Subjt:  DDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTKRRSRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGSKNSLATANLPPIESFDICFPERRGKS

Query:  KPRYSWQSKDRDTMKVMEHADEATNGIVMRPGMVLLKHYIPLHEQVLPCLGCKSLQLFVFNEELLVSVNHKLALTMSLSFVHTGHRSYRLNGLVKCSDRG
        K RYSWQSKDRDTMKVMEHADEATNGIVMRPGMVLLKHYIPLHE                                                        
Subjt:  KPRYSWQSKDRDTMKVMEHADEATNGIVMRPGMVLLKHYIPLHEQVLPCLGCKSLQLFVFNEELLVSVNHKLALTMSLSFVHTGHRSYRLNGLVKCSDRG

Query:  DVGGSLPATLVSFFLLQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNG
                        QVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNG
Subjt:  DVGGSLPATLVSFFLLQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNG

Query:  DTNNIEDILPTI--------------------------ESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKS
        DTNNIEDILPT+                          ESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKS
Subjt:  DTNNIEDILPTI--------------------------ESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKS

Query:  TPKFLLDHTGLRPGRLNLTFRKY
        TPKFLLDHTGLRPGRLNLTFRKY
Subjt:  TPKFLLDHTGLRPGRLNLTFRKY

KAG7027547.1 alkB, partial [Cucurbita argyrosperma subsp. argyrosperma]1.3e-20778.03Show/hide
Query:  FGSSVSKRALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTKRR
        F   V  +ALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTKRR
Subjt:  FGSSVSKRALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTKRR

Query:  SRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGSKNSLATANLPPIESFDICFPERRGKSKPRYSWQSKDRDTMKVMEHADEATNGIVMRPGMVLL
        SRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGSKNS+ATANLPPIESFDICFPERRGKSKPRYSWQSKDR+TMKVMEHADEATNGIVMRPGMVLL
Subjt:  SRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGSKNSLATANLPPIESFDICFPERRGKSKPRYSWQSKDRDTMKVMEHADEATNGIVMRPGMVLL

Query:  KHYIPLHEQVLPCLGCKSLQLFVFNEELLVSVNHKLALTMSLSFVHTGHRSYRLNGLVKCSDRGDVGGSLPATLVSFFLLQVNIVKTIQKLGLGPGGFYQ
        KHYIPLHE                                                                        QVNIVKTIQKLGLGPGGFYQ
Subjt:  KHYIPLHEQVLPCLGCKSLQLFVFNEELLVSVNHKLALTMSLSFVHTGHRSYRLNGLVKCSDRGDVGGSLPATLVSFFLLQVNIVKTIQKLGLGPGGFYQ

Query:  PGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTI------------------------
        PGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPT+                        
Subjt:  PGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTI------------------------

Query:  --ESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY
          ESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY
Subjt:  --ESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY

XP_022924913.1 uncharacterized protein LOC111432318 [Cucurbita moschata]6.0e-22980.31Show/hide
Query:  MLLIRTVPASLPPWSNLLRRLLFAESRLLQFQRVDSFGSSVSKRALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKC
        MLLIRTVPASLPPWSNLLRRLLFAESRLLQFQRVDSFGSS    ALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKC
Subjt:  MLLIRTVPASLPPWSNLLRRLLFAESRLLQFQRVDSFGSSVSKRALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKC

Query:  DDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTKRRSRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGSKNSLATANLPPIESFDICFPERRGKS
        DDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTKRRSRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGSKNSLATANLPPIESFDICFPERRGKS
Subjt:  DDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTKRRSRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGSKNSLATANLPPIESFDICFPERRGKS

Query:  KPRYSWQSKDRDTMKVMEHADEATNGIVMRPGMVLLKHYIPLHEQVLPCLGCKSLQLFVFNEELLVSVNHKLALTMSLSFVHTGHRSYRLNGLVKCSDRG
        KPRYSWQSKDRDTMKVMEHADEATNGIVMRPGMVLLKHYIPLHE                                                        
Subjt:  KPRYSWQSKDRDTMKVMEHADEATNGIVMRPGMVLLKHYIPLHEQVLPCLGCKSLQLFVFNEELLVSVNHKLALTMSLSFVHTGHRSYRLNGLVKCSDRG

Query:  DVGGSLPATLVSFFLLQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNG
                        QVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNG
Subjt:  DVGGSLPATLVSFFLLQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNG

Query:  DTNNIEDILPTI--------------------------ESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKS
        DTNNIEDILPT+                          ESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKS
Subjt:  DTNNIEDILPTI--------------------------ESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKS

Query:  TPKFLLDHTGLRPGRLNLTFRKY
        TPKFLLDHTGLRPGRLNLTFRKY
Subjt:  TPKFLLDHTGLRPGRLNLTFRKY

XP_022966314.1 uncharacterized protein LOC111466008 [Cucurbita maxima]1.9e-21476.1Show/hide
Query:  MLLIRTVPASLPPWSNLLRRLLFAESRLLQFQRVDSFGSSVSKRALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKC
        M LIRTVPASLPPWSNLLR+LLFAESRLLQFQR+DSFGSS    ALP+SSCYGSS GGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKC
Subjt:  MLLIRTVPASLPPWSNLLRRLLFAESRLLQFQRVDSFGSSVSKRALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKC

Query:  DDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTKRRSRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGSKNSLATANLPPIESFDICFPERRGKS
        DDFKLRSDQKGIPANIPS YHDDEFPPVPRQNTKRRSRID GSERRLKNSTSSSQM+RNEPFSF KHRS DIGSKNSLATANLPPIESFDICFPERRGKS
Subjt:  DDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTKRRSRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGSKNSLATANLPPIESFDICFPERRGKS

Query:  KPRYSWQSKDRDTMKVMEHADEATNGIVMRPGMVLLKHYIPLHEQVLPCLGCKSLQLFVFNEELLVSVNHKLALTMSLSFVHTGHRSYRLNGLVKCSDRG
        KPR SWQ KDRDTMKVMEH DEATNGIVMRPGMVLLKHYIPLHE                                                        
Subjt:  KPRYSWQSKDRDTMKVMEHADEATNGIVMRPGMVLLKHYIPLHEQVLPCLGCKSLQLFVFNEELLVSVNHKLALTMSLSFVHTGHRSYRLNGLVKCSDRG

Query:  DVGGSLPATLVSFFLLQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNG
                        QVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNG
Subjt:  DVGGSLPATLVSFFLLQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNG

Query:  DTNNIEDILPTI--------------------------ESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKS
        D N IEDILPT+                          ESLV GLPVVSFSLGNSA FLYGD+R+VDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKS
Subjt:  DTNNIEDILPTI--------------------------ESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKS

Query:  TPKFLLDHTGLRPGRLNLTFRKY
         PKFLLDHTG RPG LNLTFRKY
Subjt:  TPKFLLDHTGLRPGRLNLTFRKY

XP_023517205.1 uncharacterized protein LOC111781040 [Cucurbita pepo subsp. pepo]3.7e-22378.97Show/hide
Query:  MLLIRTVPASLPPWSNLLRRLLFAESRLLQFQRVDSFGSSVSKRALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKC
        M LIRTVPASLPP SNLLRRLLFAESRLLQFQR+DSFGSS    ALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKC
Subjt:  MLLIRTVPASLPPWSNLLRRLLFAESRLLQFQRVDSFGSSVSKRALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKC

Query:  DDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTKRRSRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGSKNSLATANLPPIESFDICFPERRGKS
        DDFKLRSDQK IPANIPSSYHDDEFPPVPRQNTKRRSRIDLGSERRLKNSTSSSQMERNEPFSF KHRS DIGSKNSLATANLPPIESFDICFPERRGKS
Subjt:  DDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTKRRSRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGSKNSLATANLPPIESFDICFPERRGKS

Query:  KPRYSWQSKDRDTMKVMEHADEATNGIVMRPGMVLLKHYIPLHEQVLPCLGCKSLQLFVFNEELLVSVNHKLALTMSLSFVHTGHRSYRLNGLVKCSDRG
        KPRYSWQSKDRDTMKVMEHADEATNGIVMRPGMVLLKHYIPLHE                                                        
Subjt:  KPRYSWQSKDRDTMKVMEHADEATNGIVMRPGMVLLKHYIPLHEQVLPCLGCKSLQLFVFNEELLVSVNHKLALTMSLSFVHTGHRSYRLNGLVKCSDRG

Query:  DVGGSLPATLVSFFLLQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNG
                        QVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNG
Subjt:  DVGGSLPATLVSFFLLQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNG

Query:  DTNNIEDILPTI--------------------------ESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKS
        DTNNIEDILPT+                          ESLV GLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKS
Subjt:  DTNNIEDILPTI--------------------------ESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKS

Query:  TPKFLLDHTGLRPGRLNLTFRKY
        TPKFLLDHTGLRPGRLNLTFRKY
Subjt:  TPKFLLDHTGLRPGRLNLTFRKY

TrEMBL top hitse value%identityAlignment
A0A1S4E4K6 uncharacterized protein LOC1035021832.4e-12249.55Show/hide
Query:  MLLIRTVPASLPPWSNLLRRLLFAES--------RLLQFQRVDSFGSSVSKRALPDSSCYGSS--CGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQE
        M  IRT+P    P SN LRRLLF  S         LLQFQR+DSF SS +  A PDSSC G+S  CG ++E L +RD+ S+VI +G   V+LN K  E +
Subjt:  MLLIRTVPASLPPWSNLLRRLLFAES--------RLLQFQRVDSFGSSVSKRALPDSSCYGSS--CGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQE

Query:  SLSRLSVGKCDDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTKRRSRIDLGSERRLKNSTSSSQMERNEPFS---------------------FKKHRS
        SL+ LS  KCD  ++ SD+ GI +N P SYH DEF PV RQNT RR+RIDLGS+R LK++  S Q+ER+E F+                     F K +S
Subjt:  SLSRLSVGKCDDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTKRRSRIDLGSERRLKNSTSSSQMERNEPFS---------------------FKKHRS

Query:  PDIGSKNSLATANLPPIE-SFDICFPERRGKSKPRYSWQSKDRDTMKVMEHADEATNGIVMRPGMVLLKHYIPLHEQVLPCLGCKSLQLFVFNEELLVSV
         DIGSK S+ T +  P E  FDICFP   G  K R  W+ KD  T+K            ++RPGMVLLKHYI   E                        
Subjt:  PDIGSKNSLATANLPPIE-SFDICFPERRGKSKPRYSWQSKDRDTMKVMEHADEATNGIVMRPGMVLLKHYIPLHEQVLPCLGCKSLQLFVFNEELLVSV

Query:  NHKLALTMSLSFVHTGHRSYRLNGLVKCSDRGDVGGSLPATLVSFFLLQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRV
                                                        Q+NIVKT QKLGLGPGGFYQP YKDGAKLRL+MMCLGLDWDPQTR+Y  KRV
Subjt:  NHKLALTMSLSFVHTGHRSYRLNGLVKCSDRGDVGGSLPATLVSFFLLQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRV

Query:  ADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTI--------------------------ESLVGGLPVVSFSLGNSAEFLYGDQRDVDK
         DGNKPPD+PP F+ LV  AL DAHA IKN  + +N+EDILP++                          ESL  GLPVVSFS+GN+AEFLYGD+RDV+K
Subjt:  ADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTI--------------------------ESLVGGLPVVSFSLGNSAEFLYGDQRDVDK

Query:  AGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY
        A K+ LESGDVLIFGGESRH+FHGVSSIIPKSTPKFLL HTGLRPGRLNLTFRKY
Subjt:  AGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY

A0A5D3BFV0 2-oxoglutarate-dependent dioxygenase family protein isoform 13.4e-11349.51Show/hide
Query:  VSKRALPDSSCYGSS--CGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTKRRSR
        ++  A PDSSC G+S  CG ++E L +RD+ S+VI +G   V+LN K  E +SL+ LSV KCD  ++ SD+ GI +N P SYH DE  PV RQNT RR+R
Subjt:  VSKRALPDSSCYGSS--CGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTKRRSR

Query:  IDLGSERRLKNSTSSSQMERNEPFS---------------------FKKHRSPDIGSKNSLATANLPPIE-SFDICFPERRGKSKPRYSWQSKDRDTMKV
        IDLGS+R LK++  S Q+ER+E  +                     F K +S DIGSK S+ T + PP E  FDICFP   G  K R  W+ KD  T+K 
Subjt:  IDLGSERRLKNSTSSSQMERNEPFS---------------------FKKHRSPDIGSKNSLATANLPPIE-SFDICFPERRGKSKPRYSWQSKDRDTMKV

Query:  MEHADEATNGIVMRPGMVLLKHYIPLHEQVLPCLGCKSLQLFVFNEELLVSVNHKLALTMSLSFVHTGHRSYRLNGLVKCSDRGDVGGSLPATLVSFFLL
                   ++RPGMVLLKHYI   E                                                                        
Subjt:  MEHADEATNGIVMRPGMVLLKHYIPLHEQVLPCLGCKSLQLFVFNEELLVSVNHKLALTMSLSFVHTGHRSYRLNGLVKCSDRGDVGGSLPATLVSFFLL

Query:  QVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTI----
        Q+NIVKT QKLGLGPGGFYQPGYKDGAKLRL+MMCLGLDWDPQTR+Y  KRV DGNKPPD+PP F+ LV  AL DAHA IKN  + +N+EDILP++    
Subjt:  QVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTI----

Query:  ----------------------ESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRL
                              ESL  GLPVVSFS+GN+AEFLYGD+RDVDKA K+ LESGDVLIFGGESRH+FHGVSSIIPKSTPKFLL HTGLRPGRL
Subjt:  ----------------------ESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRL

Query:  NLTFRKY
        NLTFRKY
Subjt:  NLTFRKY

A0A6J1CQI1 uncharacterized protein LOC1110138271.9e-12444.09Show/hide
Query:  MLLIRTVPASLPPWSNLLRRLLFA--------ESRLLQFQRVDSFGSSVSKRALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESL
        M +IRTVP S  P SN L RLLFA         SRLLQF+R+DS  +S             S  G   E  HNR H+S+++M+GEIPV LNRK  E+ES 
Subjt:  MLLIRTVPASLPPWSNLLRRLLFA--------ESRLLQFQRVDSFGSSVSKRALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESL

Query:  SRLSVGKCDDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTKRRSRIDLGSERRL---------------------------------------------
        S  SV K DDF+L  ++K  PAN+P+SYHDD+F PV RQN K RSR+DLG ER +                                             
Subjt:  SRLSVGKCDDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTKRRSRIDLGSERRL---------------------------------------------

Query:  --------------------------------------------------------------KNSTSSSQME----------------------RNEPFS
                                                                       N+TSS Q+E                      +NEPF 
Subjt:  --------------------------------------------------------------KNSTSSSQME----------------------RNEPFS

Query:  FKKHRSPDIGSKNSLATANLPPIESFDICFPERRGKSKPRYSWQSKDRDTMKVMEHADEATNGIVMRPGMVLLKHYIPLHEQVLPCLGCKSLQLFVFNEE
         +K +S DIGSKNSL   NL P E FDIC  ERRG +KP   WQ K RDT+KVMEH  EA+N  V+RPGMVLLK+YI LHE                   
Subjt:  FKKHRSPDIGSKNSLATANLPPIESFDICFPERRGKSKPRYSWQSKDRDTMKVMEHADEATNGIVMRPGMVLLKHYIPLHEQVLPCLGCKSLQLFVFNEE

Query:  LLVSVNHKLALTMSLSFVHTGHRSYRLNGLVKCSDRGDVGGSLPATLVSFFLLQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKY
                                                             QVNIVKT Q+LG+GPGGFY+PGYKDGAKLRLQMMCLGLDWDPQTRKY
Subjt:  LLVSVNHKLALTMSLSFVHTGHRSYRLNGLVKCSDRGDVGGSLPATLVSFFLLQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKY

Query:  ARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTI--------------------------ESLVGGLPVVSFSLGNSAEFLYGDQ
          KR  DG+KPP++PP+FAILV +AL DAHALIKN  +T N+E ILP++                          ESLV GLPVVS SLG+SAEFLYGD+
Subjt:  ARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTI--------------------------ESLVGGLPVVSFSLGNSAEFLYGDQ

Query:  RDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY
        RDVDKA K+ILESGDVLIFGG+SRH+FHGVSSIIP STPKFLLDHTGLRPGRLNLTFRKY
Subjt:  RDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY

A0A6J1EDT3 uncharacterized protein LOC1114323182.9e-22980.31Show/hide
Query:  MLLIRTVPASLPPWSNLLRRLLFAESRLLQFQRVDSFGSSVSKRALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKC
        MLLIRTVPASLPPWSNLLRRLLFAESRLLQFQRVDSFGSS    ALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKC
Subjt:  MLLIRTVPASLPPWSNLLRRLLFAESRLLQFQRVDSFGSSVSKRALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKC

Query:  DDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTKRRSRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGSKNSLATANLPPIESFDICFPERRGKS
        DDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTKRRSRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGSKNSLATANLPPIESFDICFPERRGKS
Subjt:  DDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTKRRSRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGSKNSLATANLPPIESFDICFPERRGKS

Query:  KPRYSWQSKDRDTMKVMEHADEATNGIVMRPGMVLLKHYIPLHEQVLPCLGCKSLQLFVFNEELLVSVNHKLALTMSLSFVHTGHRSYRLNGLVKCSDRG
        KPRYSWQSKDRDTMKVMEHADEATNGIVMRPGMVLLKHYIPLHE                                                        
Subjt:  KPRYSWQSKDRDTMKVMEHADEATNGIVMRPGMVLLKHYIPLHEQVLPCLGCKSLQLFVFNEELLVSVNHKLALTMSLSFVHTGHRSYRLNGLVKCSDRG

Query:  DVGGSLPATLVSFFLLQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNG
                        QVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNG
Subjt:  DVGGSLPATLVSFFLLQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNG

Query:  DTNNIEDILPTI--------------------------ESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKS
        DTNNIEDILPT+                          ESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKS
Subjt:  DTNNIEDILPTI--------------------------ESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKS

Query:  TPKFLLDHTGLRPGRLNLTFRKY
        TPKFLLDHTGLRPGRLNLTFRKY
Subjt:  TPKFLLDHTGLRPGRLNLTFRKY

A0A6J1HTF0 uncharacterized protein LOC1114660089.0e-21576.1Show/hide
Query:  MLLIRTVPASLPPWSNLLRRLLFAESRLLQFQRVDSFGSSVSKRALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKC
        M LIRTVPASLPPWSNLLR+LLFAESRLLQFQR+DSFGSS    ALP+SSCYGSS GGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKC
Subjt:  MLLIRTVPASLPPWSNLLRRLLFAESRLLQFQRVDSFGSSVSKRALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKC

Query:  DDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTKRRSRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGSKNSLATANLPPIESFDICFPERRGKS
        DDFKLRSDQKGIPANIPS YHDDEFPPVPRQNTKRRSRID GSERRLKNSTSSSQM+RNEPFSF KHRS DIGSKNSLATANLPPIESFDICFPERRGKS
Subjt:  DDFKLRSDQKGIPANIPSSYHDDEFPPVPRQNTKRRSRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGSKNSLATANLPPIESFDICFPERRGKS

Query:  KPRYSWQSKDRDTMKVMEHADEATNGIVMRPGMVLLKHYIPLHEQVLPCLGCKSLQLFVFNEELLVSVNHKLALTMSLSFVHTGHRSYRLNGLVKCSDRG
        KPR SWQ KDRDTMKVMEH DEATNGIVMRPGMVLLKHYIPLHE                                                        
Subjt:  KPRYSWQSKDRDTMKVMEHADEATNGIVMRPGMVLLKHYIPLHEQVLPCLGCKSLQLFVFNEELLVSVNHKLALTMSLSFVHTGHRSYRLNGLVKCSDRG

Query:  DVGGSLPATLVSFFLLQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNG
                        QVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNG
Subjt:  DVGGSLPATLVSFFLLQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNG

Query:  DTNNIEDILPTI--------------------------ESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKS
        D N IEDILPT+                          ESLV GLPVVSFSLGNSA FLYGD+R+VDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKS
Subjt:  DTNNIEDILPTI--------------------------ESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKS

Query:  TPKFLLDHTGLRPGRLNLTFRKY
         PKFLLDHTG RPG LNLTFRKY
Subjt:  TPKFLLDHTGLRPGRLNLTFRKY

SwissProt top hitse value%identityAlignment
B8GWW6 Alpha-ketoglutarate-dependent dioxygenase AlkB homolog9.2e-0728.74Show/hide
Query:  PGGFYQPGYKDGAKLRLQMMCLG-LDWDPQTR--KYARKRVADGNKPPDLPP---EFAILVGKALNDAHALIKNNGDTNNIEDILPTIESLVGGLPVVSF
        P   Y+  Y  G  + + M  LG L W    R  +Y  +    G   PD+PP   +   ++G       + + N         +    +      PV+S 
Subjt:  PGGFYQPGYKDGAKLRLQMMCLG-LDWDPQTR--KYARKRVADGNKPPDLPP---EFAILVGKALNDAHALIKNNGDTNNIEDILPTIESLVGGLPVVSF

Query:  SLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRP--GRLNLTFRK
        SLG++A F  G     D    + L SGDV    G +R  FHGV  I+P S        + L P  GR+NLT R+
Subjt:  SLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRP--GRLNLTFRK

O60066 Alpha-ketoglutarate-dependent dioxygenase abh11.8e-1034.25Show/hide
Query:  RLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALND--------AHALIKN---NGDTNNIEDILPTIESLVGGLPVVSFSLGNSAEFLY
        +L+ + LG  +D  T++Y      D +K P  P +    V K + +        A A I N    GDT +   I  + E L   LP++S S+G    +L 
Subjt:  RLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALND--------AHALIKN---NGDTNNIEDILPTIESLVGGLPVVSFSLGNSAEFLY

Query:  GDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL
        G +   +K   + L SGDV+I  G SR  FH V  IIP STP +LL
Subjt:  GDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL

P05050 Alpha-ketoglutarate-dependent dioxygenase AlkB1.2e-0628.78Show/hide
Query:  PQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTIESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFG
        PQ+     +R A     PD  P+ A L+ +    A   +  + D  ++              P+VS SLG  A F +G  +  D   +++LE GDV+++G
Subjt:  PQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTIESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFG

Query:  GESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRK
        GESR  +HG+  +     P  +         R NLTFR+
Subjt:  GESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRK

P0CAT7 Alpha-ketoglutarate-dependent dioxygenase AlkB homolog9.2e-0728.74Show/hide
Query:  PGGFYQPGYKDGAKLRLQMMCLG-LDWDPQTR--KYARKRVADGNKPPDLPP---EFAILVGKALNDAHALIKNNGDTNNIEDILPTIESLVGGLPVVSF
        P   Y+  Y  G  + + M  LG L W    R  +Y  +    G   PD+PP   +   ++G       + + N         +    +      PV+S 
Subjt:  PGGFYQPGYKDGAKLRLQMMCLG-LDWDPQTR--KYARKRVADGNKPPDLPP---EFAILVGKALNDAHALIKNNGDTNNIEDILPTIESLVGGLPVVSF

Query:  SLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRP--GRLNLTFRK
        SLG++A F  G     D    + L SGDV    G +R  FHGV  I+P S        + L P  GR+NLT R+
Subjt:  SLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRP--GRLNLTFRK

P37462 Alpha-ketoglutarate-dependent dioxygenase AlkB5.4e-0741.56Show/hide
Query:  PVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRK
        P+VS SLG  A F +G  R  D   +I+LE GD++++GGESR  +HG+  +     P      TG    R NLTFR+
Subjt:  PVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRK

Arabidopsis top hitse value%identityAlignment
AT1G11780.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein1.0e-0545.1Show/hide
Query:  PVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSI
        P+VS SLG  A FL G +   D    + L SGDV++  GE+R  FHG+  I
Subjt:  PVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSI

AT3G14140.1 2-oxoglutarate-dependent dioxygenase family protein4.5e-4144.28Show/hide
Query:  QVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTI----
        QV IV   ++LGLG GGFYQPG++DG  L L+MMCLG +WD QTR+Y   R  DG+ PP +P EF+ LV KA+ ++ +L+  N +     D +P +    
Subjt:  QVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTI----

Query:  -------------------------------------------ESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSS
                                                   +SL  GLP+VSFS+G+SAEFLYGDQ+DVDKA  +ILESGDVLIFG  SR++FHGV S
Subjt:  -------------------------------------------ESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSS

Query:  I
        I
Subjt:  I

AT3G14160.1 2-oxoglutarate-dependent dioxygenase family protein7.4e-5251.21Show/hide
Query:  QVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDT----NNIEDILPTI
        QV IV   ++LGLG GGFYQPGY+D AKL L+MMCLG +WDP+T +Y   R  DG+  P +P EF   V KA+ ++ +L  +N       + I  +LP I
Subjt:  QVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDT----NNIEDILPTI

Query:  ----------------------ESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRL
                               S+  GLPVVSFS+G+SAEFLYGDQRD DKA  + LESGDVL+FGG SR +FHGV SI   + PK LL  T LRPGRL
Subjt:  ----------------------ESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRL

Query:  NLTFRKY
        NLTFR+Y
Subjt:  NLTFRKY

AT5G01780.1 2-oxoglutarate-dependent dioxygenase family protein7.1e-5553.85Show/hide
Query:  LQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTI---
        +QV+IVKT ++LG+ P GFYQPGY  G+KL LQMMCLG +WDPQT KY RK     +K P++P  F +LV KA+ +AHALI     T + E ILP +   
Subjt:  LQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTI---

Query:  -----------------------ESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGR
                               ES+  GLP+VSFS+G+SAEFLYG++RDV++A  +ILESGDVLIFGGESR IFHGV SIIP S P  LL+ + LR GR
Subjt:  -----------------------ESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGR

Query:  LNLTFRKY
        LNLTFR +
Subjt:  LNLTFRKY

AT5G01780.2 2-oxoglutarate-dependent dioxygenase family protein7.1e-5553.85Show/hide
Query:  LQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTI---
        +QV+IVKT ++LG+ P GFYQPGY  G+KL LQMMCLG +WDPQT KY RK     +K P++P  F +LV KA+ +AHALI     T + E ILP +   
Subjt:  LQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTI---

Query:  -----------------------ESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGR
                               ES+  GLP+VSFS+G+SAEFLYG++RDV++A  +ILESGDVLIFGGESR IFHGV SIIP S P  LL+ + LR GR
Subjt:  -----------------------ESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGR

Query:  LNLTFRKY
        LNLTFR +
Subjt:  LNLTFRKY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGTTGATCCGAACAGTCCCCGCCTCGCTACCACCGTGGTCGAATCTTCTTCGTCGGCTTTTGTTCGCCGAGTCTCGATTGCTTCAGTTTCAGCGGGTGGATTCATT
TGGCAGTTCGGTAAGTAAGCGAGCGTTACCTGATTCATCATGTTATGGTAGTTCTTGTGGCGGCAATGAGGAATGTTTGCATAATAGAGATCATAATTCGAATGTGATAA
TGATAGGAGAGATTCCTGTGAATCTAAATCGTAAAGGAAATGAACAGGAATCCTTGTCTCGGTTGTCTGTTGGTAAATGTGATGATTTCAAGTTGAGAAGCGATCAAAAA
GGGATTCCTGCTAATATACCGAGTTCATACCATGATGATGAGTTTCCTCCTGTTCCTAGACAAAATACCAAAAGGAGAAGCCGGATAGATTTAGGATCGGAAAGAAGGTT
GAAGAACAGTACAAGCTCATCACAAATGGAGAGGAATGAACCATTTAGTTTCAAGAAACATCGGTCTCCGGATATTGGTTCCAAAAATTCTCTTGCCACTGCCAATTTGC
CTCCCATTGAATCTTTCGATATATGCTTTCCTGAAAGAAGAGGTAAATCAAAACCCAGATATTCTTGGCAGTCTAAGGATAGGGACACTATGAAAGTAATGGAGCATGCT
GATGAAGCTACAAATGGTATTGTGATGAGGCCTGGAATGGTTTTACTGAAGCATTACATTCCTCTACATGAACAGGTACTTCCATGCTTAGGTTGTAAATCATTGCAATT
GTTCGTCTTCAACGAAGAACTCCTAGTAAGCGTGAATCATAAGCTCGCGTTGACTATGTCCTTGTCCTTTGTACATACCGGCCATCGCTCCTACCGATTGAATGGTCTGG
TGAAGTGTTCGGATCGCGGCGATGTGGGCGGTTCGCTGCCTGCAACGTTGGTTTCGTTTTTCTTGTTGCAGGTTAATATTGTGAAAACTATTCAAAAACTTGGTCTTGGC
CCAGGGGGATTTTACCAGCCTGGTTATAAAGATGGTGCAAAACTCCGGCTTCAGATGATGTGTCTTGGATTGGACTGGGATCCTCAGACGAGGAAATATGCTCGTAAACG
GGTTGCTGATGGTAATAAACCTCCAGATTTACCTCCTGAATTTGCAATTCTGGTTGGGAAAGCACTTAACGATGCACATGCCTTGATCAAGAACAATGGCGACACGAATA
ACATAGAAGACATACTTCCAACAATAGAGAGTCTGGTCGGCGGACTACCTGTTGTTTCCTTTTCTTTGGGCAATTCAGCAGAGTTCTTGTATGGAGATCAAAGAGATGTC
GATAAAGCAGGGAAGATTATACTGGAATCAGGTGATGTTCTAATTTTTGGTGGAGAATCTAGGCATATATTTCATGGAGTATCTTCAATCATACCTAAATCGACGCCTAA
GTTTCTGCTTGATCATACTGGTCTTCGTCCTGGGCGTCTAAATCTTACATTTAGAAAGTACTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTGTTGATCCGAACAGTCCCCGCCTCGCTACCACCGTGGTCGAATCTTCTTCGTCGGCTTTTGTTCGCCGAGTCTCGATTGCTTCAGTTTCAGCGGGTGGATTCATT
TGGCAGTTCGGTAAGTAAGCGAGCGTTACCTGATTCATCATGTTATGGTAGTTCTTGTGGCGGCAATGAGGAATGTTTGCATAATAGAGATCATAATTCGAATGTGATAA
TGATAGGAGAGATTCCTGTGAATCTAAATCGTAAAGGAAATGAACAGGAATCCTTGTCTCGGTTGTCTGTTGGTAAATGTGATGATTTCAAGTTGAGAAGCGATCAAAAA
GGGATTCCTGCTAATATACCGAGTTCATACCATGATGATGAGTTTCCTCCTGTTCCTAGACAAAATACCAAAAGGAGAAGCCGGATAGATTTAGGATCGGAAAGAAGGTT
GAAGAACAGTACAAGCTCATCACAAATGGAGAGGAATGAACCATTTAGTTTCAAGAAACATCGGTCTCCGGATATTGGTTCCAAAAATTCTCTTGCCACTGCCAATTTGC
CTCCCATTGAATCTTTCGATATATGCTTTCCTGAAAGAAGAGGTAAATCAAAACCCAGATATTCTTGGCAGTCTAAGGATAGGGACACTATGAAAGTAATGGAGCATGCT
GATGAAGCTACAAATGGTATTGTGATGAGGCCTGGAATGGTTTTACTGAAGCATTACATTCCTCTACATGAACAGGTACTTCCATGCTTAGGTTGTAAATCATTGCAATT
GTTCGTCTTCAACGAAGAACTCCTAGTAAGCGTGAATCATAAGCTCGCGTTGACTATGTCCTTGTCCTTTGTACATACCGGCCATCGCTCCTACCGATTGAATGGTCTGG
TGAAGTGTTCGGATCGCGGCGATGTGGGCGGTTCGCTGCCTGCAACGTTGGTTTCGTTTTTCTTGTTGCAGGTTAATATTGTGAAAACTATTCAAAAACTTGGTCTTGGC
CCAGGGGGATTTTACCAGCCTGGTTATAAAGATGGTGCAAAACTCCGGCTTCAGATGATGTGTCTTGGATTGGACTGGGATCCTCAGACGAGGAAATATGCTCGTAAACG
GGTTGCTGATGGTAATAAACCTCCAGATTTACCTCCTGAATTTGCAATTCTGGTTGGGAAAGCACTTAACGATGCACATGCCTTGATCAAGAACAATGGCGACACGAATA
ACATAGAAGACATACTTCCAACAATAGAGAGTCTGGTCGGCGGACTACCTGTTGTTTCCTTTTCTTTGGGCAATTCAGCAGAGTTCTTGTATGGAGATCAAAGAGATGTC
GATAAAGCAGGGAAGATTATACTGGAATCAGGTGATGTTCTAATTTTTGGTGGAGAATCTAGGCATATATTTCATGGAGTATCTTCAATCATACCTAAATCGACGCCTAA
GTTTCTGCTTGATCATACTGGTCTTCGTCCTGGGCGTCTAAATCTTACATTTAGAAAGTACTAA
Protein sequenceShow/hide protein sequence
MLLIRTVPASLPPWSNLLRRLLFAESRLLQFQRVDSFGSSVSKRALPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQK
GIPANIPSSYHDDEFPPVPRQNTKRRSRIDLGSERRLKNSTSSSQMERNEPFSFKKHRSPDIGSKNSLATANLPPIESFDICFPERRGKSKPRYSWQSKDRDTMKVMEHA
DEATNGIVMRPGMVLLKHYIPLHEQVLPCLGCKSLQLFVFNEELLVSVNHKLALTMSLSFVHTGHRSYRLNGLVKCSDRGDVGGSLPATLVSFFLLQVNIVKTIQKLGLG
PGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTIESLVGGLPVVSFSLGNSAEFLYGDQRDV
DKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY