; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC02G034210 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC02G034210
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
Description2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein
Genome locationCicolChr02:29999187..30004003
RNA-Seq ExpressionCcUC02G034210
SyntenyCcUC02G034210
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005506 - iron ion binding (molecular function)
GO:0016705 - oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
GO:0051213 - dioxygenase activity (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008456388.1 PREDICTED: probable prolyl 4-hydroxylase 3 isoform X2 [Cucumis melo]7.6e-14792.96Show/hide
Query:  MAVSKGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSHRNLPHRLASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
        MAVSKGKYIKLQ KKWSTFQLSKMIMALV  LGF ML+ALRFFSPPETSH    HRL S+R TA Q SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
Subjt:  MAVSKGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSHRNLPHRLASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC

Query:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL
        LYLISLAKPHMEKSTVVDSK+GKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL
Subjt:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL

Query:  LM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH
        LM  SDVEEGGETVFPAAKGNFSSVP WNELSECGKGGLSVKPKMGDALLFWSMKPD TLDPTSLHGACPVI GNKWSCTKWMH
Subjt:  LM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH

XP_011648735.2 probable prolyl 4-hydroxylase 3 [Cucumis sativus]9.3e-14590.85Show/hide
Query:  MAVSKGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSHRNLPHRLASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
        MA+SKGKYIKLQ +KWSTFQLSKMIMALV  LGF ML+ALRFFSPPETSH    HR +S+R+TA   SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
Subjt:  MAVSKGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSHRNLPHRLASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC

Query:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL
        LYLISLAKPHMEKSTVVDSK+G+SVDSRVRTSSGMFLNRGQDKII NIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL
Subjt:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL

Query:  LM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH
        LM  SDVEEGGETVFPAAKGNFSSVP WNELSECGKGGLSVKPKMGDALLFWSMKPD TLDPTSLHGACPVI GNKWSCTKWMH
Subjt:  LM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH

XP_011650099.2 probable prolyl 4-hydroxylase 3 [Cucumis sativus]7.2e-13786.27Show/hide
Query:  MAVSKGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSHRNLPHRLASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
        MAVS  KYIKLQ KKWSTFQLSKMIMALV  LGF ML+ALRF SPPETSH    HR +S+R+TA   SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
Subjt:  MAVSKGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSHRNLPHRLASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC

Query:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL
        LYLISLAKPHMEKSTVVD+++GK+V+  VRTSSGMFLNRGQDKI+SNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYD+F DE+N+K+ GQRMATL
Subjt:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL

Query:  LM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH
        LM  SDVEEGGETVFPAAKGNFSSVP WNELS+CGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVI GNKWSCTKW+H
Subjt:  LM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH

XP_016901368.1 PREDICTED: probable prolyl 4-hydroxylase 3 [Cucumis melo]1.7e-13888.03Show/hide
Query:  MAVSKGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSHRNLPHRLASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
        MAVS GKYIKLQ KKWSTFQLSKMIMALV  LGF ML AL FFSPPETSH    HRL+S+R+TA   SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
Subjt:  MAVSKGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSHRNLPHRLASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC

Query:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL
        LYLISLAKPHMEKSTVVDS++GKSVDS VRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGE +QILHY VGQKYDAHYD+FVDEYN+K  GQRMATL
Subjt:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL

Query:  LM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH
        LM  SDVEEGGETVFPAAKGNFSSVP WNELSECGK GLS+KPKMGDALLFWSMKPDTTLDPTSLHGACPVI GNKWSCTKW+H
Subjt:  LM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH

XP_038889689.1 probable prolyl 4-hydroxylase 3 [Benincasa hispida]2.1e-14992.25Show/hide
Query:  MAVSKGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSHRNLPHRLASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
        MA+SKGKY K+Q KKWSTF+LSKMIMALV  LGF MLLALRFFSPPETSHRNLPH LAS+R++A++ SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
Subjt:  MAVSKGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSHRNLPHRLASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC

Query:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL
        LYLISLAKPHMEKSTVVDSK+GKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL
Subjt:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL

Query:  LM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH
        LM  SDVEEGGETVFPAA+GNFSSVP WNELSECGKGGLSVKPKMGDALLFWSMKPD TLDPTSLHGACPVI GNKWSCTKWMH
Subjt:  LM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH

TrEMBL top hitse value%identityAlignment
A0A1S3C2P6 probable prolyl 4-hydroxylase 3 isoform X23.7e-14792.96Show/hide
Query:  MAVSKGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSHRNLPHRLASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
        MAVSKGKYIKLQ KKWSTFQLSKMIMALV  LGF ML+ALRFFSPPETSH    HRL S+R TA Q SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
Subjt:  MAVSKGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSHRNLPHRLASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC

Query:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL
        LYLISLAKPHMEKSTVVDSK+GKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL
Subjt:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL

Query:  LM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH
        LM  SDVEEGGETVFPAAKGNFSSVP WNELSECGKGGLSVKPKMGDALLFWSMKPD TLDPTSLHGACPVI GNKWSCTKWMH
Subjt:  LM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH

A0A1S3C367 probable prolyl 4-hydroxylase 3 isoform X16.5e-13686.39Show/hide
Query:  MAVSKGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSHRNLPHRLASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
        MAVSKGKYIKLQ KKWSTFQLSKMIMALV  LGF ML+ALRFFSPPETSH    HRL S+R TA Q SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
Subjt:  MAVSKGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSHRNLPHRLASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC

Query:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL
        LYLISLAKPHMEKSTVVDSK+GKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL
Subjt:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL

Query:  LM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMHGVLEVLCWGR
        LM  SDVEEGGETVFPAAKGNFSSVP WNELSECGKGGLSVKPKMGDALLFWSMKPD TLDPTSLHG           C +        LCWGR
Subjt:  LM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMHGVLEVLCWGR

A0A1S4DZG7 probable prolyl 4-hydroxylase 38.3e-13988.03Show/hide
Query:  MAVSKGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSHRNLPHRLASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
        MAVS GKYIKLQ KKWSTFQLSKMIMALV  LGF ML AL FFSPPETSH    HRL+S+R+TA   SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
Subjt:  MAVSKGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSHRNLPHRLASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC

Query:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL
        LYLISLAKPHMEKSTVVDS++GKSVDS VRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGE +QILHY VGQKYDAHYD+FVDEYN+K  GQRMATL
Subjt:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL

Query:  LM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH
        LM  SDVEEGGETVFPAAKGNFSSVP WNELSECGK GLS+KPKMGDALLFWSMKPDTTLDPTSLHGACPVI GNKWSCTKW+H
Subjt:  LM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH

A0A6J1CNS9 probable prolyl 4-hydroxylase 32.5e-13584.62Show/hide
Query:  MAVSKGKY--IKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSHRNLPHRLASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKE
        MAVSKGKY  I    +KWST +LSK+IMALV  LGF MLLALRFFSPPE+S  NLP RLAS+R  A++ S+GLGKRG+QWVE ISWEPRAF+YHNFLSKE
Subjt:  MAVSKGKY--IKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSHRNLPHRLASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKE

Query:  ECLYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMA
        ECLYLISLAKP M KSTV+DS++GKS+DSRVRTSSGMFL+RGQD+II NIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAH+DYFVDEYNIKKG QRMA
Subjt:  ECLYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMA

Query:  TLLM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH
        TLLM  SDVEEGGETVFPAAKGNFSSVP WNELSECGKGGLSVKPKMGDALLFWSMKPD +LDPTSLHGACPVI GNKWSCTKWMH
Subjt:  TLLM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH

A0A6J1FBR3 probable prolyl 4-hydroxylase 37.7e-13783.75Show/hide
Query:  MAVSKGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSHRNLPHRLASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
        MAVSKGKY+K Q +KWSTF+LSK+IMA +  LG  ML+A RFFSPPE+SH NL HR+AS+++ A+  SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
Subjt:  MAVSKGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSHRNLPHRLASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC

Query:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL
        LYLISLAKPHM KSTVVD+K+GKS+DSRVRTSSGMFL RGQ+KI+SNIEKRIADFTFIP+EHGE LQILHYEVGQKYDAH+DYF DE+NIK+GGQRMATL
Subjt:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL

Query:  LM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWM
        LM  SDVEEGGETVFPAA+GNFSS+P WNELSECGKGGLSVKPKMGDALLFWSMKPD T+DPTSLHGACPVI GNKWSCTKWM
Subjt:  LM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWM

SwissProt top hitse value%identityAlignment
F4JNU8 Probable prolyl 4-hydroxylase 83.7e-9661.05Show/hide
Query:  KGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSH-RNLPHRLASLRYTALQSSDGLGK----RGDQWVEFISWEPRAFVYHNFLSKEE
        K K ++ +P+K  + Q   +++ ++F++  ++L+ L  FS P T+   ++P  L ++  T +Q  +  G      GD+W+E ISWEPRAFVYHNFL+ EE
Subjt:  KGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSH-RNLPHRLASLRYTALQSSDGLGK----RGDQWVEFISWEPRAFVYHNFLSKEE

Query:  CLYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMAT
        C +LISLAKP M KS VVD K+GKS+DSRVRTSSG FLNRG D+I+  IE RI+DFTFIP E+GEGLQ+LHYEVGQ+Y+ H+DYF DE+N++KGGQR+AT
Subjt:  CLYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMAT

Query:  LLM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH
        +LM  SDV+EGGETVFPAAKGN S VP W+ELS+CGK GLSV PK  DALLFWSMKPD +LDP+SLHG CPVI GNKWS TKW H
Subjt:  LLM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH

F4JZ24 Probable prolyl 4-hydroxylase 104.6e-10267.92Show/hide
Query:  SKMIMALVFILGFVMLLALRF---FSPPETSHRNLPHRLASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVD
        S ++ A++ +  FV+L+ L F     P   +  +  + L S+    LQ S     + ++WVE ISWEPRA VYHNFL+KEEC YLI LAKPHMEKSTVVD
Subjt:  SKMIMALVFILGFVMLLALRF---FSPPETSHRNLPHRLASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVD

Query:  SKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATLLM--SDVEEGGETVFPAA
         K+GKS DSRVRTSSG FL RG+DK I  IEKRI+DFTFIP+EHGEGLQ+LHYE+GQKY+ HYDYF+DEYN + GGQR+AT+LM  SDVEEGGETVFPAA
Subjt:  SKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATLLM--SDVEEGGETVFPAA

Query:  KGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWM
        KGN+S+VP WNELSECGKGGLSVKPKMGDALLFWSM PD TLDP+SLHG C VI GNKWS TKW+
Subjt:  KGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWM

Q24JN5 Prolyl 4-hydroxylase 53.9e-9359.72Show/hide
Query:  MAVSKGKYIKLQPKK---WSTFQLSKMIMALVFILGFVMLLALRFFSPPETS-HRNLPHRLASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLS
        MA    ++++ QP+K    ST   + +I+ LV IL   +LL L   S P  + + +  + L ++   +  SS      G++WVE ISWEPRA VYHNFL+
Subjt:  MAVSKGKYIKLQPKK---WSTFQLSKMIMALVFILGFVMLLALRFFSPPETS-HRNLPHRLASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLS

Query:  KEECLYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQR
         EEC +LISLAKP M KSTVVD K+G S DSRVRTSSG FL RG D+++  IEKRI+DFTFIP+E+GEGLQ+LHY+VGQKY+ HYDYF+DE+N K GGQR
Subjt:  KEECLYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQR

Query:  MATLLM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH
        +AT+LM  SDV++GGETVFPAA+GN S+VP WNELS+CGK GLSV PK  DALLFW+M+PD +LDP+SLHG CPV+ GNKWS TKW H
Subjt:  MATLLM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH

Q8L970 Probable prolyl 4-hydroxylase 74.2e-6355.67Show/hide
Query:  ISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHY
        +SW PR F+Y  FLS EEC + I LAK  +EKS V D+ SG+SV+S VRTSSGMFL++ QD I+SN+E ++A +TF+P E+GE +QILHYE GQKY+ H+
Subjt:  ISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHY

Query:  DYFVDEYNIKKGGQRMATLLM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTK
        DYF D+ N++ GG R+AT+LM  S+VE+GGETVFP  KG  + +   +  +EC K G +VKP+ GDALLF+++ P+ T D  SLHG+CPV+ G KWS T+
Subjt:  DYFVDEYNIKKGGQRMATLLM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTK

Query:  WMH
        W+H
Subjt:  WMH

Q9LN20 Probable prolyl 4-hydroxylase 32.9e-11270.07Show/hide
Query:  VSKGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRF--FSPPETSHRNLPHRLASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
        ++K ++ + Q +KWST  L   ++ ++F+L  V+L+ L F  FS P  +  + P  L+  R  A + S+GLGKRGDQW E +SWEPRAFVYHNFLSKEEC
Subjt:  VSKGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRF--FSPPETSHRNLPHRLASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC

Query:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL
         YLISLAKPHM KSTVVDS++GKS DSRVRTSSG FL RG+DKII  IEKRIAD+TFIP +HGEGLQ+LHYE GQKY+ HYDYFVDE+N K GGQRMAT+
Subjt:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL

Query:  LM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH
        LM  SDVEEGGETVFPAA  NFSSVP +NELSECGK GLSVKP+MGDALLFWSM+PD TLDPTSLHG CPVI GNKWS TKWMH
Subjt:  LM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH

Arabidopsis top hitse value%identityAlignment
AT1G20270.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.0e-11370.07Show/hide
Query:  VSKGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRF--FSPPETSHRNLPHRLASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
        ++K ++ + Q +KWST  L   ++ ++F+L  V+L+ L F  FS P  +  + P  L+  R  A + S+GLGKRGDQW E +SWEPRAFVYHNFLSKEEC
Subjt:  VSKGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRF--FSPPETSHRNLPHRLASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC

Query:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL
         YLISLAKPHM KSTVVDS++GKS DSRVRTSSG FL RG+DKII  IEKRIAD+TFIP +HGEGLQ+LHYE GQKY+ HYDYFVDE+N K GGQRMAT+
Subjt:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL

Query:  LM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH
        LM  SDVEEGGETVFPAA  NFSSVP +NELSECGK GLSVKP+MGDALLFWSM+PD TLDPTSLHG CPVI GNKWS TKWMH
Subjt:  LM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH

AT2G17720.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.8e-9459.72Show/hide
Query:  MAVSKGKYIKLQPKK---WSTFQLSKMIMALVFILGFVMLLALRFFSPPETS-HRNLPHRLASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLS
        MA    ++++ QP+K    ST   + +I+ LV IL   +LL L   S P  + + +  + L ++   +  SS      G++WVE ISWEPRA VYHNFL+
Subjt:  MAVSKGKYIKLQPKK---WSTFQLSKMIMALVFILGFVMLLALRFFSPPETS-HRNLPHRLASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLS

Query:  KEECLYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQR
         EEC +LISLAKP M KSTVVD K+G S DSRVRTSSG FL RG D+++  IEKRI+DFTFIP+E+GEGLQ+LHY+VGQKY+ HYDYF+DE+N K GGQR
Subjt:  KEECLYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQR

Query:  MATLLM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH
        +AT+LM  SDV++GGETVFPAA+GN S+VP WNELS+CGK GLSV PK  DALLFW+M+PD +LDP+SLHG CPV+ GNKWS TKW H
Subjt:  MATLLM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH

AT4G35810.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.7e-9761.05Show/hide
Query:  KGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSH-RNLPHRLASLRYTALQSSDGLGK----RGDQWVEFISWEPRAFVYHNFLSKEE
        K K ++ +P+K  + Q   +++ ++F++  ++L+ L  FS P T+   ++P  L ++  T +Q  +  G      GD+W+E ISWEPRAFVYHNFL+ EE
Subjt:  KGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSH-RNLPHRLASLRYTALQSSDGLGK----RGDQWVEFISWEPRAFVYHNFLSKEE

Query:  CLYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMAT
        C +LISLAKP M KS VVD K+GKS+DSRVRTSSG FLNRG D+I+  IE RI+DFTFIP E+GEGLQ+LHYEVGQ+Y+ H+DYF DE+N++KGGQR+AT
Subjt:  CLYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMAT

Query:  LLM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH
        +LM  SDV+EGGETVFPAAKGN S VP W+ELS+CGK GLSV PK  DALLFWSMKPD +LDP+SLHG CPVI GNKWS TKW H
Subjt:  LLM--SDVEEGGETVFPAAKGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMH

AT5G66060.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein3.2e-10367.92Show/hide
Query:  SKMIMALVFILGFVMLLALRF---FSPPETSHRNLPHRLASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVD
        S ++ A++ +  FV+L+ L F     P   +  +  + L S+    LQ S     + ++WVE ISWEPRA VYHNFL+KEEC YLI LAKPHMEKSTVVD
Subjt:  SKMIMALVFILGFVMLLALRF---FSPPETSHRNLPHRLASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVD

Query:  SKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATLLM--SDVEEGGETVFPAA
         K+GKS DSRVRTSSG FL RG+DK I  IEKRI+DFTFIP+EHGEGLQ+LHYE+GQKY+ HYDYF+DEYN + GGQR+AT+LM  SDVEEGGETVFPAA
Subjt:  SKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATLLM--SDVEEGGETVFPAA

Query:  KGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWM
        KGN+S+VP WNELSECGKGGLSVKPKMGDALLFWSM PD TLDP+SLHG C VI GNKWS TKW+
Subjt:  KGNFSSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWM

AT5G66060.2 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein7.5e-7664.55Show/hide
Query:  SKMIMALVFILGFVMLLALRF---FSPPETSHRNLPHRLASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVD
        S ++ A++ +  FV+L+ L F     P   +  +  + L S+    LQ S     + ++WVE ISWEPRA VYHNFL  EEC YLI LAKPHMEKSTVVD
Subjt:  SKMIMALVFILGFVMLLALRF---FSPPETSHRNLPHRLASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVD

Query:  SKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATLLM--SDVEEGGETVFPAA
         K+GKS DSRVRTSSG FL RG+DK I  IEKRI+DFTFIP+EHGEGLQ+LHYE+GQKY+ HYDYF+DEYN + GGQR+AT+LM  SDVEEGGETVFPAA
Subjt:  SKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATLLM--SDVEEGGETVFPAA

Query:  KGNFSSVPQWNELSECGKGG
        KGN+S+VP WNELSECGKGG
Subjt:  KGNFSSVPQWNELSECGKGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGTATCGAAAGGGAAATACATCAAGTTACAGCCCAAGAAATGGTCCACATTTCAGCTTTCGAAAATGATCATGGCCCTCGTTTTCATACTTGGGTTTGTCATGCT
TCTTGCTCTCCGGTTCTTCTCTCCTCCGGAAACTTCTCATCGGAATCTACCCCACCGTCTTGCTTCCCTCCGATATACAGCCCTTCAAAGTAGTGATGGGTTAGGGAAGA
GAGGGGATCAGTGGGTTGAGTTCATTTCTTGGGAGCCTAGAGCTTTTGTTTATCACAATTTCTTGTCCAAGGAAGAATGCTTGTACTTGATTAGTCTTGCAAAACCTCAC
ATGGAGAAATCAACTGTGGTTGATAGCAAAAGTGGCAAAAGTGTGGATAGCAGGGTGCGCACCAGTTCTGGGATGTTTCTGAATAGAGGGCAGGACAAAATCATCAGTAA
CATAGAGAAAAGAATAGCAGATTTTACATTCATTCCTATTGAGCATGGAGAAGGACTTCAAATTCTCCATTATGAAGTTGGGCAGAAGTATGATGCTCATTATGATTACT
TTGTTGATGAGTACAACATCAAAAAAGGAGGCCAAAGAATGGCCACCCTCCTCATGTCGGACGTCGAAGAAGGGGGCGAGACGGTGTTCCCAGCTGCGAAAGGAAACTTT
AGCTCTGTGCCACAGTGGAATGAACTGTCTGAATGTGGTAAAGGTGGACTCTCTGTAAAACCAAAGATGGGAGATGCTTTATTGTTCTGGAGCATGAAGCCTGATACTAC
CTTAGACCCTACAAGTTTGCATGGTGCTTGCCCTGTCATAAGTGGGAACAAATGGTCATGTACAAAGTGGATGCATGGTGTACTGGAAGTACTTTGTTGGGGAAGGAGAG
AAATAGAAGACGACGGCTGA
mRNA sequenceShow/hide mRNA sequence
CACAAATCCAATCCCCCAATTACCCACCAAAAATTCTCTCCCACTTTCCTCCATGGCCAAATCTCCCAAACCCCACTTAGTATATATTCATCTTCGTTTCAATTCCAGCA
GCAGTGTTTCAGAATCAGCTTGTTTTGATTCTTTTCTAGCATCTTCGTTCTGTCAATGGCGGTATCGAAAGGGAAATACATCAAGTTACAGCCCAAGAAATGGTCCACAT
TTCAGCTTTCGAAAATGATCATGGCCCTCGTTTTCATACTTGGGTTTGTCATGCTTCTTGCTCTCCGGTTCTTCTCTCCTCCGGAAACTTCTCATCGGAATCTACCCCAC
CGTCTTGCTTCCCTCCGATATACAGCCCTTCAAAGTAGTGATGGGTTAGGGAAGAGAGGGGATCAGTGGGTTGAGTTCATTTCTTGGGAGCCTAGAGCTTTTGTTTATCA
CAATTTCTTGTCCAAGGAAGAATGCTTGTACTTGATTAGTCTTGCAAAACCTCACATGGAGAAATCAACTGTGGTTGATAGCAAAAGTGGCAAAAGTGTGGATAGCAGGG
TGCGCACCAGTTCTGGGATGTTTCTGAATAGAGGGCAGGACAAAATCATCAGTAACATAGAGAAAAGAATAGCAGATTTTACATTCATTCCTATTGAGCATGGAGAAGGA
CTTCAAATTCTCCATTATGAAGTTGGGCAGAAGTATGATGCTCATTATGATTACTTTGTTGATGAGTACAACATCAAAAAAGGAGGCCAAAGAATGGCCACCCTCCTCAT
GTCGGACGTCGAAGAAGGGGGCGAGACGGTGTTCCCAGCTGCGAAAGGAAACTTTAGCTCTGTGCCACAGTGGAATGAACTGTCTGAATGTGGTAAAGGTGGACTCTCTG
TAAAACCAAAGATGGGAGATGCTTTATTGTTCTGGAGCATGAAGCCTGATACTACCTTAGACCCTACAAGTTTGCATGGTGCTTGCCCTGTCATAAGTGGGAACAAATGG
TCATGTACAAAGTGGATGCATGGTGTACTGGAAGTACTTTGTTGGGGAAGGAGAGAAATAGAAGACGACGGCTGATTGTTTATTTATTTTTTTATTATATTTTTGAGCTT
GATTTATAATTCACCCTCTGAAAATTTTCATATTTAGTTGTTAATATATACATAGAGAAGTTCCTTTTTTCTAGGCAGAAG
Protein sequenceShow/hide protein sequence
MAVSKGKYIKLQPKKWSTFQLSKMIMALVFILGFVMLLALRFFSPPETSHRNLPHRLASLRYTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPH
MEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATLLMSDVEEGGETVFPAAKGNF
SSVPQWNELSECGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVISGNKWSCTKWMHGVLEVLCWGRREIEDDG