; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G03790 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G03790
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptionprolyl 4-hydroxylase 1
Genome locationChr4:2348323..2359451
RNA-Seq ExpressionCSPI04G03790
SyntenyCSPI04G03790
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0000137 - Golgi cis cisterna (cellular component)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152082.1 prolyl 4-hydroxylase 1 [Cucumis sativus]3.6e-165100Show/hide
Query:  MVSSQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLPRGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST
        MVSSQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLPRGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST
Subjt:  MVSSQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLPRGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST

Query:  KECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ
        KECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ
Subjt:  KECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ

Query:  RIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKSTLVP
        RIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKSTLVP
Subjt:  RIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKSTLVP

XP_008453925.1 PREDICTED: prolyl 4-hydroxylase 1 isoform X3 [Cucumis melo]1.5e-16096.55Show/hide
Query:  MVSSQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLPRGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST
        MVS+QMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHK QYDSQ QLPRG PNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST
Subjt:  MVSSQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLPRGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST

Query:  KECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ
        +ECDYLKGIAL RLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKN+PMVQAIEKRISVYSQ+PVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ
Subjt:  KECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ

Query:  RIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKSTLVP
        RIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDA+LFWSMGLDGQSDP SIHGGCEVLSGEKWSATKWMRQKSTLVP
Subjt:  RIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKSTLVP

XP_016901567.1 PREDICTED: prolyl 4-hydroxylase 1 isoform X1 [Cucumis melo]4.6e-15790.61Show/hide
Query:  MVSSQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLPR-------------------GFPNWINDKEAEILRLGYVK
        MVS+QMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHK QYDSQ QLPR                   G PNWINDKEAEILRLGYVK
Subjt:  MVSSQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLPR-------------------GFPNWINDKEAEILRLGYVK

Query:  PEVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQF
        PEVVSWSPRIIVLHNFLST+ECDYLKGIAL RLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKN+PMVQAIEKRISVYSQ+PVENGELIQVLRYEKNQF
Subjt:  PEVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQF

Query:  YKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKW
        YKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDA+LFWSMGLDGQSDP SIHGGCEVLSGEKWSATKW
Subjt:  YKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKW

Query:  MRQKSTLVP
        MRQKSTLVP
Subjt:  MRQKSTLVP

XP_022137963.1 prolyl 4-hydroxylase 1 [Momordica charantia]9.1e-15390Show/hide
Query:  MVSSQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLPRGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST
        M S+ MRIVFGLLTFVT+GMIIGAL QLAF+RRLEDS GTEFL AGRLHK QYD   QLPRGFPNWIND+EAEILRLGYVKPEVVSWSPRIIVLHNFLST
Subjt:  MVSSQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLPRGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST

Query:  KECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ
        +ECDYL+ +AL RLE+STVVDTKTGKGVKSDFRTSSGMFLSH EKN+PM+QAIEKRISVYSQ+P+ENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ
Subjt:  KECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ

Query:  RIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKSTLVP
        R+ATMLMYLS+N+EGGETYFPKAGSGECSCGGKTVPGLSVKP KGDAVLFWSMGLDGQSDP SIHGGCEVLSGEKWSATKWMRQKSTLVP
Subjt:  RIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKSTLVP

XP_038904320.1 prolyl 4-hydroxylase 1 isoform X1 [Benincasa hispida]8.8e-15694.48Show/hide
Query:  MVSSQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLPRGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST
        M S+ MRIVFGLLTFVTVGMIIGALLQLAF+RRLEDSIGTEFL AGRLHK QYDSQ QL RG PNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST
Subjt:  MVSSQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLPRGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST

Query:  KECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ
        +ECDYLK IAL RLEISTVVDTKTGKGVKSDFRTSSGMFLSH EKN+PMVQAIEKRISVYSQ+PVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ
Subjt:  KECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ

Query:  RIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKSTLVP
        RIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDP SIHGGCEVLSGEKWSATKWMRQKSTLVP
Subjt:  RIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKSTLVP

TrEMBL top hitse value%identityAlignment
A0A0A0KU17 Fe2OG dioxygenase domain-containing protein1.7e-165100Show/hide
Query:  MVSSQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLPRGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST
        MVSSQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLPRGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST
Subjt:  MVSSQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLPRGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST

Query:  KECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ
        KECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ
Subjt:  KECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ

Query:  RIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKSTLVP
        RIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKSTLVP
Subjt:  RIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKSTLVP

A0A1S3BXE6 prolyl 4-hydroxylase 1 isoform X37.5e-16196.55Show/hide
Query:  MVSSQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLPRGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST
        MVS+QMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHK QYDSQ QLPRG PNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST
Subjt:  MVSSQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLPRGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST

Query:  KECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ
        +ECDYLKGIAL RLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKN+PMVQAIEKRISVYSQ+PVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ
Subjt:  KECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ

Query:  RIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKSTLVP
        RIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDA+LFWSMGLDGQSDP SIHGGCEVLSGEKWSATKWMRQKSTLVP
Subjt:  RIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKSTLVP

A0A1S4DZZ9 prolyl 4-hydroxylase 1 isoform X12.3e-15790.61Show/hide
Query:  MVSSQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLPR-------------------GFPNWINDKEAEILRLGYVK
        MVS+QMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHK QYDSQ QLPR                   G PNWINDKEAEILRLGYVK
Subjt:  MVSSQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLPR-------------------GFPNWINDKEAEILRLGYVK

Query:  PEVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQF
        PEVVSWSPRIIVLHNFLST+ECDYLKGIAL RLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKN+PMVQAIEKRISVYSQ+PVENGELIQVLRYEKNQF
Subjt:  PEVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQF

Query:  YKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKW
        YKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDA+LFWSMGLDGQSDP SIHGGCEVLSGEKWSATKW
Subjt:  YKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKW

Query:  MRQKSTLVP
        MRQKSTLVP
Subjt:  MRQKSTLVP

A0A6J1CBS4 prolyl 4-hydroxylase 14.4e-15390Show/hide
Query:  MVSSQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLPRGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST
        M S+ MRIVFGLLTFVT+GMIIGAL QLAF+RRLEDS GTEFL AGRLHK QYD   QLPRGFPNWIND+EAEILRLGYVKPEVVSWSPRIIVLHNFLST
Subjt:  MVSSQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLPRGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST

Query:  KECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ
        +ECDYL+ +AL RLE+STVVDTKTGKGVKSDFRTSSGMFLSH EKN+PM+QAIEKRISVYSQ+P+ENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ
Subjt:  KECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ

Query:  RIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKSTLVP
        R+ATMLMYLS+N+EGGETYFPKAGSGECSCGGKTVPGLSVKP KGDAVLFWSMGLDGQSDP SIHGGCEVLSGEKWSATKWMRQKSTLVP
Subjt:  RIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKSTLVP

A0A6J1GWV8 prolyl 4-hydroxylase 1 isoform X17.8e-15089.62Show/hide
Query:  MVSSQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLPRGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST
        M S+ MRI FGLLTFVTVGMIIGAL QLAF+RRLEDS G EFLPAGRLHK QYDSQHQLPRG PNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLS 
Subjt:  MVSSQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLPRGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST

Query:  KECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ
        +ECDYLK IAL  LEISTVVDTKTGKGVKSDFRTSSGMFL H +K FPMVQAIEKRISVYSQ+P+ENGE IQVLRYEKNQFYKPHHDYFSDT+NL  GGQ
Subjt:  KECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ

Query:  RIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKSTLV
        RIAT+LMYLSEN+EGGETYFPKAGSGECSCGGKTVPGLSVKP +GDAVLFWSMGLDGQSDP SIHGGCEVLSGEKWSATKWMRQKSTLV
Subjt:  RIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKSTLV

SwissProt top hitse value%identityAlignment
F4JNU8 Probable prolyl 4-hydroxylase 84.5e-5453.85Show/hide
Query:  EVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSH-HEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQF
        EV+SW PR  V HNFL+ +EC++L  +A   +  S VVD KTGK + S  RTSSG FL+  H++   +V+ IE RIS ++ +P ENGE +QVL YE  Q 
Subjt:  EVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSH-HEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQF

Query:  YKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSG--------ECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSG
        Y+PHHDYF D FN+++GGQRIAT+LMYLS+  EGGET FP A           E S  GK   GLSV P K DA+LFWSM  D   DP S+HGGC V+ G
Subjt:  YKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSG--------ECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSG

Query:  EKWSATKW
         KWS+TKW
Subjt:  EKWSATKW

F4JZ24 Probable prolyl 4-hydroxylase 104.1e-5552.83Show/hide
Query:  EVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFY
        E++SW PR  V HNFL+ +EC YL  +A   +E STVVD KTGK   S  RTSSG FL+        ++ IEKRIS ++ +PVE+GE +QVL YE  Q Y
Subjt:  EVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFY

Query:  KPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGS-----------GECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVL
        +PH+DYF D +N + GGQRIAT+LMYLS+  EGGET FP A              EC  G     GLSVKP  GDA+LFWSM  D   DP S+HGGC V+
Subjt:  KPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGS-----------GECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVL

Query:  SGEKWSATKWMR
         G KWS+TKW+R
Subjt:  SGEKWSATKWMR

Q24JN5 Prolyl 4-hydroxylase 58.5e-5353.37Show/hide
Query:  EVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSH-HEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQF
        EV+SW PR +V HNFL+ +EC++L  +A   +  STVVD KTG    S  RTSSG FL   H++   +V+ IEKRIS ++ +PVENGE +QVL Y+  Q 
Subjt:  EVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSH-HEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQF

Query:  YKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGS--------GECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSG
        Y+PH+DYF D FN K GGQRIAT+LMYLS+  +GGET FP A           E S  GK   GLSV P K DA+LFW+M  D   DP S+HGGC V+ G
Subjt:  YKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGS--------GECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSG

Query:  EKWSATKW
         KWS+TKW
Subjt:  EKWSATKW

Q9LN20 Probable prolyl 4-hydroxylase 32.4e-5554.07Show/hide
Query:  EVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFY
        EV+SW PR  V HNFLS +EC+YL  +A   +  STVVD++TGK   S  RTSSG FL        +++ IEKRI+ Y+ +P ++GE +QVL YE  Q Y
Subjt:  EVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFY

Query:  KPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECS---------CGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSG
        +PH+DYF D FN K GGQR+ATMLMYLS+  EGGET FP A     S         CG K   GLSVKP  GDA+LFWSM  D   DP S+HGGC V+ G
Subjt:  KPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECS---------CGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSG

Query:  EKWSATKWM
         KWS+TKWM
Subjt:  EKWSATKWM

Q9ZW86 Prolyl 4-hydroxylase 11.1e-12174.82Show/hide
Query:  MRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLPRGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTKECDY
        M+IVFGLLTFVTVGM+IG+LLQLAF+ RLEDS GT F P+ R  + Q     +  R    W NDK+AE+LR+G VKPEVVSWSPRIIVLH+FLS +EC+Y
Subjt:  MRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLPRGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTKECDY

Query:  LKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATM
        LK IA  RL++STVVD KTGKGVKSD RTSSGMFL+H E+++P++QAIEKRI+V+SQVP ENGELIQVLRYE  QFYKPHHDYF+DTFNLKRGGQR+ATM
Subjt:  LKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATM

Query:  LMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKST
        LMYL++++EGGETYFP AG G+C+CGGK + G+SVKP KGDAVLFWSMGLDGQSDP+SIHGGCEVLSGEKWSATKWMRQK+T
Subjt:  LMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKST

Arabidopsis top hitse value%identityAlignment
AT1G20270.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.7e-5654.07Show/hide
Query:  EVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFY
        EV+SW PR  V HNFLS +EC+YL  +A   +  STVVD++TGK   S  RTSSG FL        +++ IEKRI+ Y+ +P ++GE +QVL YE  Q Y
Subjt:  EVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFY

Query:  KPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECS---------CGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSG
        +PH+DYF D FN K GGQR+ATMLMYLS+  EGGET FP A     S         CG K   GLSVKP  GDA+LFWSM  D   DP S+HGGC V+ G
Subjt:  KPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECS---------CGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSG

Query:  EKWSATKWM
         KWS+TKWM
Subjt:  EKWSATKWM

AT2G17720.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein6.0e-5453.37Show/hide
Query:  EVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSH-HEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQF
        EV+SW PR +V HNFL+ +EC++L  +A   +  STVVD KTG    S  RTSSG FL   H++   +V+ IEKRIS ++ +PVENGE +QVL Y+  Q 
Subjt:  EVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSH-HEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQF

Query:  YKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGS--------GECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSG
        Y+PH+DYF D FN K GGQRIAT+LMYLS+  +GGET FP A           E S  GK   GLSV P K DA+LFW+M  D   DP S+HGGC V+ G
Subjt:  YKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGS--------GECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSG

Query:  EKWSATKW
         KWS+TKW
Subjt:  EKWSATKW

AT2G43080.1 P4H isoform 18.0e-12374.82Show/hide
Query:  MRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLPRGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTKECDY
        M+IVFGLLTFVTVGM+IG+LLQLAF+ RLEDS GT F P+ R  + Q     +  R    W NDK+AE+LR+G VKPEVVSWSPRIIVLH+FLS +EC+Y
Subjt:  MRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLPRGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTKECDY

Query:  LKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATM
        LK IA  RL++STVVD KTGKGVKSD RTSSGMFL+H E+++P++QAIEKRI+V+SQVP ENGELIQVLRYE  QFYKPHHDYF+DTFNLKRGGQR+ATM
Subjt:  LKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATM

Query:  LMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKST
        LMYL++++EGGETYFP AG G+C+CGGK + G+SVKP KGDAVLFWSMGLDGQSDP+SIHGGCEVLSGEKWSATKWMRQK+T
Subjt:  LMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKST

AT4G35810.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein3.2e-5553.85Show/hide
Query:  EVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSH-HEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQF
        EV+SW PR  V HNFL+ +EC++L  +A   +  S VVD KTGK + S  RTSSG FL+  H++   +V+ IE RIS ++ +P ENGE +QVL YE  Q 
Subjt:  EVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSH-HEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQF

Query:  YKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSG--------ECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSG
        Y+PHHDYF D FN+++GGQRIAT+LMYLS+  EGGET FP A           E S  GK   GLSV P K DA+LFWSM  D   DP S+HGGC V+ G
Subjt:  YKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSG--------ECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSG

Query:  EKWSATKW
         KWS+TKW
Subjt:  EKWSATKW

AT5G66060.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.9e-5652.83Show/hide
Query:  EVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFY
        E++SW PR  V HNFL+ +EC YL  +A   +E STVVD KTGK   S  RTSSG FL+        ++ IEKRIS ++ +PVE+GE +QVL YE  Q Y
Subjt:  EVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFY

Query:  KPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGS-----------GECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVL
        +PH+DYF D +N + GGQRIAT+LMYLS+  EGGET FP A              EC  G     GLSVKP  GDA+LFWSM  D   DP S+HGGC V+
Subjt:  KPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGS-----------GECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVL

Query:  SGEKWSATKWMR
         G KWS+TKW+R
Subjt:  SGEKWSATKWMR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTCCTCTCAGATGAGGATTGTCTTCGGTCTCTTGACATTTGTCACCGTCGGCATGATCATCGGTGCTTTGTTACAACTAGCATTTTTAAGAAGGCTGGAGGACTC
TATTGGCACGGAGTTTCTACCTGCTGGAAGGTTACATAAAGCTCAGTATGATAGCCAACATCAATTGCCCCGAGGCTTTCCTAATTGGATTAATGACAAAGAAGCAGAAA
TTCTTCGTCTTGGCTATGTTAAACCAGAAGTAGTAAGCTGGTCACCACGAATCATAGTATTGCATAATTTTTTGAGCACGAAGGAGTGTGACTACCTTAAGGGAATAGCA
CTTGCTCGCCTTGAAATTTCCACTGTCGTGGATACGAAAACCGGGAAGGGCGTTAAGAGTGATTTTAGAACGAGCTCTGGAATGTTTTTAAGTCATCATGAGAAAAACTT
TCCAATGGTCCAGGCAATTGAAAAAAGAATTTCTGTCTATTCTCAAGTTCCAGTAGAAAATGGAGAGCTAATTCAAGTGTTAAGGTACGAGAAGAATCAATTTTACAAGC
CTCATCATGACTACTTTTCTGATACTTTTAACTTGAAGCGTGGTGGTCAGCGGATTGCAACTATGCTTATGTATCTAAGTGAAAACATTGAAGGAGGAGAAACTTACTTT
CCGAAGGCTGGTTCTGGTGAGTGTAGTTGTGGTGGGAAGACCGTTCCAGGACTGTCAGTCAAACCAGCCAAAGGAGATGCAGTACTTTTCTGGAGCATGGGGTTAGATGG
GCAATCAGATCCAAAGAGCATTCATGGAGGGTGTGAAGTACTGTCAGGGGAAAAATGGTCTGCCACAAAATGGATGAGGCAAAAGAGTACTCTGGTACCATAA
mRNA sequenceShow/hide mRNA sequence
CTCCCATATTAAATCTAGCAATCAAAATGGAAACCTAAACTAAAACTCAACCTCCCTTACATTATTTCAATAGCTTGAATTCCCTAATAAAACACAAACATTACAACCAA
GAAATCTAAAGGATCTCAAATCACAATCACTTTTTCTGCTTAGTTACGGAAGAAATCATAGTGAACGAGGATGATGAAGAACAACAACAACCAGTAGAAGCAAATCATCT
GCACTGGGTCATCCACACTTCAAACACTATAACGGTTTCTTCTCAAGAGTTTGGAGTTAAAATTTTGTTTAGGCAGCTATGGTTTCCTCTCAGATGAGGATTGTCTTCGG
TCTCTTGACATTTGTCACCGTCGGCATGATCATCGGTGCTTTGTTACAACTAGCATTTTTAAGAAGGCTGGAGGACTCTATTGGCACGGAGTTTCTACCTGCTGGAAGGT
TACATAAAGCTCAGTATGATAGCCAACATCAATTGCCCCGAGGCTTTCCTAATTGGATTAATGACAAAGAAGCAGAAATTCTTCGTCTTGGCTATGTTAAACCAGAAGTA
GTAAGCTGGTCACCACGAATCATAGTATTGCATAATTTTTTGAGCACGAAGGAGTGTGACTACCTTAAGGGAATAGCACTTGCTCGCCTTGAAATTTCCACTGTCGTGGA
TACGAAAACCGGGAAGGGCGTTAAGAGTGATTTTAGAACGAGCTCTGGAATGTTTTTAAGTCATCATGAGAAAAACTTTCCAATGGTCCAGGCAATTGAAAAAAGAATTT
CTGTCTATTCTCAAGTTCCAGTAGAAAATGGAGAGCTAATTCAAGTGTTAAGGTACGAGAAGAATCAATTTTACAAGCCTCATCATGACTACTTTTCTGATACTTTTAAC
TTGAAGCGTGGTGGTCAGCGGATTGCAACTATGCTTATGTATCTAAGTGAAAACATTGAAGGAGGAGAAACTTACTTTCCGAAGGCTGGTTCTGGTGAGTGTAGTTGTGG
TGGGAAGACCGTTCCAGGACTGTCAGTCAAACCAGCCAAAGGAGATGCAGTACTTTTCTGGAGCATGGGGTTAGATGGGCAATCAGATCCAAAGAGCATTCATGGAGGGT
GTGAAGTACTGTCAGGGGAAAAATGGTCTGCCACAAAATGGATGAGGCAAAAGAGTACTCTGGTACCATAATTCATACTTTCTAGTTTCATTTGTATTGTATATCACAAT
TGAATATTTTGTTACATATATCAACTAATAAATTTATATATATAGAGAGAGAGAGAGAAAAAATGGAGAGGCTAATTTAGATAGCGTCTTAACATAATTAATCACACTTA
GAATGAATCAATTTATTTAATGAATAATACAGTAGAAGCAACATCTGATGTTTCTTTGTTCT
Protein sequenceShow/hide protein sequence
MVSSQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLPRGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTKECDYLKGIA
LARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYF
PKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKSTLVP