; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC02G042400 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC02G042400
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
Description2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein
Genome locationCiama_Chr02:29878366..29882953
RNA-Seq ExpressionCaUC02G042400
SyntenyCaUC02G042400
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005506 - iron ion binding (molecular function)
GO:0016705 - oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
GO:0051213 - dioxygenase activity (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008456388.1 PREDICTED: probable prolyl 4-hydroxylase 3 isoform X2 [Cucumis melo]6.3e-14992.96Show/hide
Query:  MAVSKGKYIKLQPKKWSTFQLSKMIMALVFVLGFVMLLALRFFSPPETSHRNLPHRLASLRHTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
        MAVSKGKYIKLQ KKWSTFQLSKMIMALV  LGF ML+ALRFFSPPETSH    HRL S+R TA Q SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
Subjt:  MAVSKGKYIKLQPKKWSTFQLSKMIMALVFVLGFVMLLALRFFSPPETSHRNLPHRLASLRHTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC

Query:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL
        LYLISLAKPHMEKSTVVDSK+GKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIP EHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL
Subjt:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL

Query:  LM--SDVEEGGETVFPAAKGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWMH
        LM  SDVEEGGETVFPAAKGNFSSVPWWNELSECG+GGLSVKPKMGDALLFWSMKPD TLDPTSLHGACPVIRGNKWSCTKWMH
Subjt:  LM--SDVEEGGETVFPAAKGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWMH

XP_011648735.2 probable prolyl 4-hydroxylase 3 [Cucumis sativus]2.6e-14791.2Show/hide
Query:  MAVSKGKYIKLQPKKWSTFQLSKMIMALVFVLGFVMLLALRFFSPPETSHRNLPHRLASLRHTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
        MA+SKGKYIKLQ +KWSTFQLSKMIMALV  LGF ML+ALRFFSPPETSH    HR +S+RHTA   SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
Subjt:  MAVSKGKYIKLQPKKWSTFQLSKMIMALVFVLGFVMLLALRFFSPPETSHRNLPHRLASLRHTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC

Query:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL
        LYLISLAKPHMEKSTVVDSK+G+SVDSRVRTSSGMFLNRGQDKII NIEKRIADFTFIP EHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL
Subjt:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL

Query:  LM--SDVEEGGETVFPAAKGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWMH
        LM  SDVEEGGETVFPAAKGNFSSVPWWNELSECG+GGLSVKPKMGDALLFWSMKPD TLDPTSLHGACPVIRGNKWSCTKWMH
Subjt:  LM--SDVEEGGETVFPAAKGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWMH

XP_011650099.2 probable prolyl 4-hydroxylase 3 [Cucumis sativus]2.0e-13986.62Show/hide
Query:  MAVSKGKYIKLQPKKWSTFQLSKMIMALVFVLGFVMLLALRFFSPPETSHRNLPHRLASLRHTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
        MAVS  KYIKLQ KKWSTFQLSKMIMALV  LGF ML+ALRF SPPETSH    HR +S+RHTA   SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
Subjt:  MAVSKGKYIKLQPKKWSTFQLSKMIMALVFVLGFVMLLALRFFSPPETSHRNLPHRLASLRHTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC

Query:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL
        LYLISLAKPHMEKSTVVD+++GK+V+  VRTSSGMFLNRGQDKI+SNIEKRIADFTFIP EHGEGLQILHYEVGQKYDAHYD+F DE+N+K+ GQRMATL
Subjt:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL

Query:  LM--SDVEEGGETVFPAAKGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWMH
        LM  SDVEEGGETVFPAAKGNFSSVPWWNELS+CG+GGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKW+H
Subjt:  LM--SDVEEGGETVFPAAKGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWMH

XP_016901368.1 PREDICTED: probable prolyl 4-hydroxylase 3 [Cucumis melo]4.8e-14188.38Show/hide
Query:  MAVSKGKYIKLQPKKWSTFQLSKMIMALVFVLGFVMLLALRFFSPPETSHRNLPHRLASLRHTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
        MAVS GKYIKLQ KKWSTFQLSKMIMALV  LGF ML AL FFSPPETSH    HRL+S+RHTA   SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
Subjt:  MAVSKGKYIKLQPKKWSTFQLSKMIMALVFVLGFVMLLALRFFSPPETSHRNLPHRLASLRHTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC

Query:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL
        LYLISLAKPHMEKSTVVDS++GKSVDS VRTSSGMFLNRGQDKIISNIEKRIADFTFIP EHGE +QILHY VGQKYDAHYD+FVDEYN+K  GQRMATL
Subjt:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL

Query:  LM--SDVEEGGETVFPAAKGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWMH
        LM  SDVEEGGETVFPAAKGNFSSVPWWNELSECG+ GLS+KPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKW+H
Subjt:  LM--SDVEEGGETVFPAAKGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWMH

XP_038889689.1 probable prolyl 4-hydroxylase 3 [Benincasa hispida]7.9e-15292.61Show/hide
Query:  MAVSKGKYIKLQPKKWSTFQLSKMIMALVFVLGFVMLLALRFFSPPETSHRNLPHRLASLRHTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
        MA+SKGKY K+Q KKWSTF+LSKMIMALV  LGF MLLALRFFSPPETSHRNLPH LAS+RH+A++ SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
Subjt:  MAVSKGKYIKLQPKKWSTFQLSKMIMALVFVLGFVMLLALRFFSPPETSHRNLPHRLASLRHTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC

Query:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL
        LYLISLAKPHMEKSTVVDSK+GKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIP EHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL
Subjt:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL

Query:  LM--SDVEEGGETVFPAAKGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWMH
        LM  SDVEEGGETVFPAA+GNFSSVPWWNELSECG+GGLSVKPKMGDALLFWSMKPD TLDPTSLHGACPVIRGNKWSCTKWMH
Subjt:  LM--SDVEEGGETVFPAAKGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWMH

TrEMBL top hitse value%identityAlignment
A0A1S3C2P6 probable prolyl 4-hydroxylase 3 isoform X23.0e-14992.96Show/hide
Query:  MAVSKGKYIKLQPKKWSTFQLSKMIMALVFVLGFVMLLALRFFSPPETSHRNLPHRLASLRHTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
        MAVSKGKYIKLQ KKWSTFQLSKMIMALV  LGF ML+ALRFFSPPETSH    HRL S+R TA Q SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
Subjt:  MAVSKGKYIKLQPKKWSTFQLSKMIMALVFVLGFVMLLALRFFSPPETSHRNLPHRLASLRHTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC

Query:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL
        LYLISLAKPHMEKSTVVDSK+GKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIP EHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL
Subjt:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL

Query:  LM--SDVEEGGETVFPAAKGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWMH
        LM  SDVEEGGETVFPAAKGNFSSVPWWNELSECG+GGLSVKPKMGDALLFWSMKPD TLDPTSLHGACPVIRGNKWSCTKWMH
Subjt:  LM--SDVEEGGETVFPAAKGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWMH

A0A1S3C367 probable prolyl 4-hydroxylase 3 isoform X13.5e-13786.05Show/hide
Query:  MAVSKGKYIKLQPKKWSTFQLSKMIMALVFVLGFVMLLALRFFSPPETSHRNLPHRLASLRHTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
        MAVSKGKYIKLQ KKWSTFQLSKMIMALV  LGF ML+ALRFFSPPETSH    HRL S+R TA Q SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
Subjt:  MAVSKGKYIKLQPKKWSTFQLSKMIMALVFVLGFVMLLALRFFSPPETSHRNLPHRLASLRHTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC

Query:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL
        LYLISLAKPHMEKSTVVDSK+GKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIP EHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL
Subjt:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL

Query:  LM--SDVEEGGETVFPAAKGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWMHGVLEVLCWGR
        LM  SDVEEGGETVFPAAKGNFSSVPWWNELSECG+GGLSVKPKMGDALLFWSMKPD TLDPTSLHG           C +        LCWGR
Subjt:  LM--SDVEEGGETVFPAAKGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWMHGVLEVLCWGR

A0A1S4DZG7 probable prolyl 4-hydroxylase 32.3e-14188.38Show/hide
Query:  MAVSKGKYIKLQPKKWSTFQLSKMIMALVFVLGFVMLLALRFFSPPETSHRNLPHRLASLRHTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
        MAVS GKYIKLQ KKWSTFQLSKMIMALV  LGF ML AL FFSPPETSH    HRL+S+RHTA   SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
Subjt:  MAVSKGKYIKLQPKKWSTFQLSKMIMALVFVLGFVMLLALRFFSPPETSHRNLPHRLASLRHTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC

Query:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL
        LYLISLAKPHMEKSTVVDS++GKSVDS VRTSSGMFLNRGQDKIISNIEKRIADFTFIP EHGE +QILHY VGQKYDAHYD+FVDEYN+K  GQRMATL
Subjt:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL

Query:  LM--SDVEEGGETVFPAAKGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWMH
        LM  SDVEEGGETVFPAAKGNFSSVPWWNELSECG+ GLS+KPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKW+H
Subjt:  LM--SDVEEGGETVFPAAKGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWMH

A0A6J1CNS9 probable prolyl 4-hydroxylase 37.7e-13784.27Show/hide
Query:  MAVSKGKY--IKLQPKKWSTFQLSKMIMALVFVLGFVMLLALRFFSPPETSHRNLPHRLASLRHTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKE
        MAVSKGKY  I    +KWST +LSK+IMALV  LGF MLLALRFFSPPE+S  NLP RLAS+R  A++ S+GLGKRG+QWVE ISWEPRAF+YHNFLSKE
Subjt:  MAVSKGKY--IKLQPKKWSTFQLSKMIMALVFVLGFVMLLALRFFSPPETSHRNLPHRLASLRHTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKE

Query:  ECLYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMA
        ECLYLISLAKP M KSTV+DS++GKS+DSRVRTSSGMFL+RGQD+II NIEKRIADFTFIP EHGEGLQILHYEVGQKYDAH+DYFVDEYNIKKG QRMA
Subjt:  ECLYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMA

Query:  TLLM--SDVEEGGETVFPAAKGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWMH
        TLLM  SDVEEGGETVFPAAKGNFSSVPWWNELSECG+GGLSVKPKMGDALLFWSMKPD +LDPTSLHGACPVI+GNKWSCTKWMH
Subjt:  TLLM--SDVEEGGETVFPAAKGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWMH

A0A6J1FBR3 probable prolyl 4-hydroxylase 34.1e-13884.1Show/hide
Query:  MAVSKGKYIKLQPKKWSTFQLSKMIMALVFVLGFVMLLALRFFSPPETSHRNLPHRLASLRHTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
        MAVSKGKY+K Q +KWSTF+LSK+IMA +  LG  ML+A RFFSPPE+SH NL HR+AS++H A+  SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
Subjt:  MAVSKGKYIKLQPKKWSTFQLSKMIMALVFVLGFVMLLALRFFSPPETSHRNLPHRLASLRHTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC

Query:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL
        LYLISLAKPHM KSTVVD+K+GKS+DSRVRTSSGMFL RGQ+KI+SNIEKRIADFTFIP EHGE LQILHYEVGQKYDAH+DYF DE+NIK+GGQRMATL
Subjt:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL

Query:  LM--SDVEEGGETVFPAAKGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWM
        LM  SDVEEGGETVFPAA+GNFSS+P WNELSECG+GGLSVKPKMGDALLFWSMKPD T+DPTSLHGACPVIRGNKWSCTKWM
Subjt:  LM--SDVEEGGETVFPAAKGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWM

SwissProt top hitse value%identityAlignment
F4JNU8 Probable prolyl 4-hydroxylase 82.3e-9861.4Show/hide
Query:  KGKYIKLQPKKWSTFQLSKMIMALVFVLGFVMLLALRFFSPPETSH-RNLPHRLASLRHTALQSSDGLGK----RGDQWVEFISWEPRAFVYHNFLSKEE
        K K ++ +P+K  + Q   +++ ++FV+  ++L+ L  FS P T+   ++P  L ++  T +Q  +  G      GD+W+E ISWEPRAFVYHNFL+ EE
Subjt:  KGKYIKLQPKKWSTFQLSKMIMALVFVLGFVMLLALRFFSPPETSH-RNLPHRLASLRHTALQSSDGLGK----RGDQWVEFISWEPRAFVYHNFLSKEE

Query:  CLYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMAT
        C +LISLAKP M KS VVD K+GKS+DSRVRTSSG FLNRG D+I+  IE RI+DFTFIP E+GEGLQ+LHYEVGQ+Y+ H+DYF DE+N++KGGQR+AT
Subjt:  CLYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMAT

Query:  LLM--SDVEEGGETVFPAAKGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWMH
        +LM  SDV+EGGETVFPAAKGN S VPWW+ELS+CG+ GLSV PK  DALLFWSMKPD +LDP+SLHG CPVI+GNKWS TKW H
Subjt:  LLM--SDVEEGGETVFPAAKGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWMH

F4JZ24 Probable prolyl 4-hydroxylase 101.1e-10367.92Show/hide
Query:  SKMIMALVFVLGFVMLLALRF---FSPPETSHRNLPHRLASLRHTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVD
        S ++ A++ +  FV+L+ L F     P   +  +  + L S+    LQ S     + ++WVE ISWEPRA VYHNFL+KEEC YLI LAKPHMEKSTVVD
Subjt:  SKMIMALVFVLGFVMLLALRF---FSPPETSHRNLPHRLASLRHTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVD

Query:  SKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATLLM--SDVEEGGETVFPAA
         K+GKS DSRVRTSSG FL RG+DK I  IEKRI+DFTFIP EHGEGLQ+LHYE+GQKY+ HYDYF+DEYN + GGQR+AT+LM  SDVEEGGETVFPAA
Subjt:  SKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATLLM--SDVEEGGETVFPAA

Query:  KGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWM
        KGN+S+VPWWNELSECG+GGLSVKPKMGDALLFWSM PD TLDP+SLHG C VI+GNKWS TKW+
Subjt:  KGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWM

Q24JN5 Prolyl 4-hydroxylase 51.2e-9459.38Show/hide
Query:  MAVSKGKYIKLQPKK---WSTFQLSKMIMALVFVLGFVMLLALRFFSPPETS-HRNLPHRLASLRHTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLS
        MA    ++++ QP+K    ST   + +I+ LV +L   +LL L   S P  + + +  + L ++   +  SS      G++WVE ISWEPRA VYHNFL+
Subjt:  MAVSKGKYIKLQPKK---WSTFQLSKMIMALVFVLGFVMLLALRFFSPPETS-HRNLPHRLASLRHTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLS

Query:  KEECLYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQR
         EEC +LISLAKP M KSTVVD K+G S DSRVRTSSG FL RG D+++  IEKRI+DFTFIP E+GEGLQ+LHY+VGQKY+ HYDYF+DE+N K GGQR
Subjt:  KEECLYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQR

Query:  MATLLM--SDVEEGGETVFPAAKGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWMH
        +AT+LM  SDV++GGETVFPAA+GN S+VPWWNELS+CG+ GLSV PK  DALLFW+M+PD +LDP+SLHG CPV++GNKWS TKW H
Subjt:  MATLLM--SDVEEGGETVFPAAKGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWMH

Q8L970 Probable prolyl 4-hydroxylase 75.5e-6355.17Show/hide
Query:  ISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHY
        +SW PR F+Y  FLS EEC + I LAK  +EKS V D+ SG+SV+S VRTSSGMFL++ QD I+SN+E ++A +TF+P E+GE +QILHYE GQKY+ H+
Subjt:  ISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHY

Query:  DYFVDEYNIKKGGQRMATLLM--SDVEEGGETVFPAAKGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTK
        DYF D+ N++ GG R+AT+LM  S+VE+GGETVFP  KG  + +   +  +EC + G +VKP+ GDALLF+++ P+ T D  SLHG+CPV+ G KWS T+
Subjt:  DYFVDEYNIKKGGQRMATLLM--SDVEEGGETVFPAAKGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTK

Query:  WMH
        W+H
Subjt:  WMH

Q9LN20 Probable prolyl 4-hydroxylase 38.0e-11570.42Show/hide
Query:  VSKGKYIKLQPKKWSTFQLSKMIMALVFVLGFVMLLALRF--FSPPETSHRNLPHRLASLRHTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
        ++K ++ + Q +KWST  L   ++ ++F+L  V+L+ L F  FS P  +  + P  L+  R  A + S+GLGKRGDQW E +SWEPRAFVYHNFLSKEEC
Subjt:  VSKGKYIKLQPKKWSTFQLSKMIMALVFVLGFVMLLALRF--FSPPETSHRNLPHRLASLRHTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC

Query:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL
         YLISLAKPHM KSTVVDS++GKS DSRVRTSSG FL RG+DKII  IEKRIAD+TFIP +HGEGLQ+LHYE GQKY+ HYDYFVDE+N K GGQRMAT+
Subjt:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL

Query:  LM--SDVEEGGETVFPAAKGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWMH
        LM  SDVEEGGETVFPAA  NFSSVPW+NELSECG+ GLSVKP+MGDALLFWSM+PD TLDPTSLHG CPVIRGNKWS TKWMH
Subjt:  LM--SDVEEGGETVFPAAKGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWMH

Arabidopsis top hitse value%identityAlignment
AT1G20270.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein5.7e-11670.42Show/hide
Query:  VSKGKYIKLQPKKWSTFQLSKMIMALVFVLGFVMLLALRF--FSPPETSHRNLPHRLASLRHTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
        ++K ++ + Q +KWST  L   ++ ++F+L  V+L+ L F  FS P  +  + P  L+  R  A + S+GLGKRGDQW E +SWEPRAFVYHNFLSKEEC
Subjt:  VSKGKYIKLQPKKWSTFQLSKMIMALVFVLGFVMLLALRF--FSPPETSHRNLPHRLASLRHTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC

Query:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL
         YLISLAKPHM KSTVVDS++GKS DSRVRTSSG FL RG+DKII  IEKRIAD+TFIP +HGEGLQ+LHYE GQKY+ HYDYFVDE+N K GGQRMAT+
Subjt:  LYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATL

Query:  LM--SDVEEGGETVFPAAKGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWMH
        LM  SDVEEGGETVFPAA  NFSSVPW+NELSECG+ GLSVKP+MGDALLFWSM+PD TLDPTSLHG CPVIRGNKWS TKWMH
Subjt:  LM--SDVEEGGETVFPAAKGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWMH

AT2G17720.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein8.5e-9659.38Show/hide
Query:  MAVSKGKYIKLQPKK---WSTFQLSKMIMALVFVLGFVMLLALRFFSPPETS-HRNLPHRLASLRHTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLS
        MA    ++++ QP+K    ST   + +I+ LV +L   +LL L   S P  + + +  + L ++   +  SS      G++WVE ISWEPRA VYHNFL+
Subjt:  MAVSKGKYIKLQPKK---WSTFQLSKMIMALVFVLGFVMLLALRFFSPPETS-HRNLPHRLASLRHTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLS

Query:  KEECLYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQR
         EEC +LISLAKP M KSTVVD K+G S DSRVRTSSG FL RG D+++  IEKRI+DFTFIP E+GEGLQ+LHY+VGQKY+ HYDYF+DE+N K GGQR
Subjt:  KEECLYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQR

Query:  MATLLM--SDVEEGGETVFPAAKGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWMH
        +AT+LM  SDV++GGETVFPAA+GN S+VPWWNELS+CG+ GLSV PK  DALLFW+M+PD +LDP+SLHG CPV++GNKWS TKW H
Subjt:  MATLLM--SDVEEGGETVFPAAKGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWMH

AT4G35810.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.7e-9961.4Show/hide
Query:  KGKYIKLQPKKWSTFQLSKMIMALVFVLGFVMLLALRFFSPPETSH-RNLPHRLASLRHTALQSSDGLGK----RGDQWVEFISWEPRAFVYHNFLSKEE
        K K ++ +P+K  + Q   +++ ++FV+  ++L+ L  FS P T+   ++P  L ++  T +Q  +  G      GD+W+E ISWEPRAFVYHNFL+ EE
Subjt:  KGKYIKLQPKKWSTFQLSKMIMALVFVLGFVMLLALRFFSPPETSH-RNLPHRLASLRHTALQSSDGLGK----RGDQWVEFISWEPRAFVYHNFLSKEE

Query:  CLYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMAT
        C +LISLAKP M KS VVD K+GKS+DSRVRTSSG FLNRG D+I+  IE RI+DFTFIP E+GEGLQ+LHYEVGQ+Y+ H+DYF DE+N++KGGQR+AT
Subjt:  CLYLISLAKPHMEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMAT

Query:  LLM--SDVEEGGETVFPAAKGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWMH
        +LM  SDV+EGGETVFPAAKGN S VPWW+ELS+CG+ GLSV PK  DALLFWSMKPD +LDP+SLHG CPVI+GNKWS TKW H
Subjt:  LLM--SDVEEGGETVFPAAKGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWMH

AT5G66060.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein7.7e-10567.92Show/hide
Query:  SKMIMALVFVLGFVMLLALRF---FSPPETSHRNLPHRLASLRHTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVD
        S ++ A++ +  FV+L+ L F     P   +  +  + L S+    LQ S     + ++WVE ISWEPRA VYHNFL+KEEC YLI LAKPHMEKSTVVD
Subjt:  SKMIMALVFVLGFVMLLALRF---FSPPETSHRNLPHRLASLRHTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVD

Query:  SKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATLLM--SDVEEGGETVFPAA
         K+GKS DSRVRTSSG FL RG+DK I  IEKRI+DFTFIP EHGEGLQ+LHYE+GQKY+ HYDYF+DEYN + GGQR+AT+LM  SDVEEGGETVFPAA
Subjt:  SKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATLLM--SDVEEGGETVFPAA

Query:  KGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWM
        KGN+S+VPWWNELSECG+GGLSVKPKMGDALLFWSM PD TLDP+SLHG C VI+GNKWS TKW+
Subjt:  KGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWM

AT5G66060.2 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein5.2e-7764.55Show/hide
Query:  SKMIMALVFVLGFVMLLALRF---FSPPETSHRNLPHRLASLRHTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVD
        S ++ A++ +  FV+L+ L F     P   +  +  + L S+    LQ S     + ++WVE ISWEPRA VYHNFL  EEC YLI LAKPHMEKSTVVD
Subjt:  SKMIMALVFVLGFVMLLALRF---FSPPETSHRNLPHRLASLRHTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVD

Query:  SKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATLLM--SDVEEGGETVFPAA
         K+GKS DSRVRTSSG FL RG+DK I  IEKRI+DFTFIP EHGEGLQ+LHYE+GQKY+ HYDYF+DEYN + GGQR+AT+LM  SDVEEGGETVFPAA
Subjt:  SKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATLLM--SDVEEGGETVFPAA

Query:  KGNFSSVPWWNELSECGRGG
        KGN+S+VPWWNELSECG+GG
Subjt:  KGNFSSVPWWNELSECGRGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGTGTCGAAAGGGAAATACATCAAGTTACAGCCCAAGAAATGGTCCACATTTCAGCTTTCGAAAATGATCATGGCCCTCGTTTTCGTACTTGGGTTTGTCATGCT
TCTTGCTCTCCGATTCTTCTCTCCTCCGGAAACTTCTCATCGGAATCTACCCCACCGTCTCGCTTCCCTCCGCCATACAGCCCTTCAAAGTAGTGATGGGTTAGGGAAGA
GAGGGGATCAGTGGGTTGAGTTCATTTCTTGGGAGCCTAGAGCTTTTGTTTATCACAATTTCTTGTCCAAGGAAGAATGCTTGTACTTGATTAGTCTTGCAAAACCTCAC
ATGGAGAAATCAACTGTGGTTGATAGCAAAAGTGGCAAAAGTGTGGATAGCAGGGTGCGCACCAGTTCTGGGATGTTTCTGAATAGAGGGCAGGACAAAATCATCAGTAA
CATAGAGAAAAGAATAGCAGATTTTACATTCATTCCTACAGAGCATGGAGAAGGACTTCAAATTCTCCATTATGAAGTTGGGCAGAAGTATGATGCTCATTATGATTACT
TTGTTGATGAGTACAACATCAAAAAAGGAGGCCAAAGAATGGCCACCCTCCTCATGTCGGACGTCGAAGAAGGGGGCGAGACGGTGTTCCCGGCTGCGAAAGGAAACTTT
AGCTCTGTGCCATGGTGGAATGAACTGTCTGAATGTGGTAGAGGTGGACTCTCTGTAAAACCAAAGATGGGAGATGCTTTATTGTTCTGGAGCATGAAGCCTGATACTAC
CTTAGACCCTACAAGTTTGCATGGTGCTTGCCCTGTCATAAGAGGGAACAAATGGTCATGTACAAAGTGGATGCATGGTGTACTGGAAGTACTTTGTTGGGGAAGGAGAG
AAATAGAAGACGACGGCTGA
mRNA sequenceShow/hide mRNA sequence
CACAAATCCAATCCCCCAATTACCCACCAAAAATTCTCTCCCACTTTCCTCCATGGCCAAATCTCCCAAACCCCACTTAGTATATATTCATCTTCGTTTCAATTCCAGCA
GCAGTGTTTCAGAATCAGCTTGTTTTGATTCTTTTCTAGCATCTTCGTTCTGTCAATGGCGGTGTCGAAAGGGAAATACATCAAGTTACAGCCCAAGAAATGGTCCACAT
TTCAGCTTTCGAAAATGATCATGGCCCTCGTTTTCGTACTTGGGTTTGTCATGCTTCTTGCTCTCCGATTCTTCTCTCCTCCGGAAACTTCTCATCGGAATCTACCCCAC
CGTCTCGCTTCCCTCCGCCATACAGCCCTTCAAAGTAGTGATGGGTTAGGGAAGAGAGGGGATCAGTGGGTTGAGTTCATTTCTTGGGAGCCTAGAGCTTTTGTTTATCA
CAATTTCTTGTCCAAGGAAGAATGCTTGTACTTGATTAGTCTTGCAAAACCTCACATGGAGAAATCAACTGTGGTTGATAGCAAAAGTGGCAAAAGTGTGGATAGCAGGG
TGCGCACCAGTTCTGGGATGTTTCTGAATAGAGGGCAGGACAAAATCATCAGTAACATAGAGAAAAGAATAGCAGATTTTACATTCATTCCTACAGAGCATGGAGAAGGA
CTTCAAATTCTCCATTATGAAGTTGGGCAGAAGTATGATGCTCATTATGATTACTTTGTTGATGAGTACAACATCAAAAAAGGAGGCCAAAGAATGGCCACCCTCCTCAT
GTCGGACGTCGAAGAAGGGGGCGAGACGGTGTTCCCGGCTGCGAAAGGAAACTTTAGCTCTGTGCCATGGTGGAATGAACTGTCTGAATGTGGTAGAGGTGGACTCTCTG
TAAAACCAAAGATGGGAGATGCTTTATTGTTCTGGAGCATGAAGCCTGATACTACCTTAGACCCTACAAGTTTGCATGGTGCTTGCCCTGTCATAAGAGGGAACAAATGG
TCATGTACAAAGTGGATGCATGGTGTACTGGAAGTACTTTGTTGGGGAAGGAGAGAAATAGAAGACGACGGCTGATTGTTTATTTATTTATTTTTATTATATTTTTTGAG
CTTGATTTATAATTCACCCTCTGAAAATTTTCATATTTAGTTGTTAATATATAATATAGAGAAGTTCCTTTTTTCTAGGCAGAAG
Protein sequenceShow/hide protein sequence
MAVSKGKYIKLQPKKWSTFQLSKMIMALVFVLGFVMLLALRFFSPPETSHRNLPHRLASLRHTALQSSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPH
MEKSTVVDSKSGKSVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPTEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATLLMSDVEEGGETVFPAAKGNF
SSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWMHGVLEVLCWGRREIEDDG