; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10001835 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10001835
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionprolyl 4-hydroxylase 1
Genome locationChr11:887376..905641
RNA-Seq ExpressionHG10001835
SyntenyHG10001835
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0000137 - Golgi cis cisterna (cellular component)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008453926.1 PREDICTED: prolyl 4-hydroxylase 1 isoform X6 [Cucumis melo]7.3e-11072.45Show/hide
Query:  MASAPMRIVFGLLTFVTVGMIIGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAE--RVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLDFGGML
        M SA MRIVFGLLTFVTVGMIIG +   + +GR+ +  +D   +  +    +  + +AE  R+  V    V   P   +                     
Subjt:  MASAPMRIVFGLLTFVTVGMIIGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAE--RVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLDFGGML

Query:  RLKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNLK
         L  EECDYLK IALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSH EKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEK+QFYKPHHDYFSDTFNLK
Subjt:  RLKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNLK

Query:  RGGQRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP
        RGGQRIATMLMYLSEN+EGGETYFPKAGSGECSCGGKTVPGLSVKP+KGDA+LFWSMGLDGQSDP+SIHGGCEVL+GEKWSATKWMRQKSTLVP
Subjt:  RGGQRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP

XP_008453928.1 PREDICTED: prolyl 4-hydroxylase 1 isoform X7 [Cucumis melo]1.1e-10873.38Show/hide
Query:  MASAPMRIVFGLLTFVTVGMIIGEDVSKVRSGRM-DRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLDFGGMLR
        M SA MRIVFGLLTFVTVGMIIG  +      R+ D  G + +  G      +  + Q  R         GL         ++ +E           +LR
Subjt:  MASAPMRIVFGLLTFVTVGMIIGEDVSKVRSGRM-DRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLDFGGMLR

Query:  LKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNLKR
        L Y ECDYLK IALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSH EKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEK+QFYKPHHDYFSDTFNLKR
Subjt:  LKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNLKR

Query:  GGQRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP
        GGQRIATMLMYLSEN+EGGETYFPKAGSGECSCGGKTVPGLSVKP+KGDA+LFWSMGLDGQSDP+SIHGGCEVL+GEKWSATKWMRQKSTLVP
Subjt:  GGQRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP

XP_016901569.1 PREDICTED: prolyl 4-hydroxylase 1 isoform X4 [Cucumis melo]3.6e-10973.22Show/hide
Query:  MASAPMRIVFGLLTFVTVGMIIGEDVSKVRSGRM-DRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLD--FGGM
        M SA MRIVFGLLTFVTVGMIIG  +      R+ D  G + +  G      +  + Q  RV +        +  F    G +    G   +++     +
Subjt:  MASAPMRIVFGLLTFVTVGMIIGEDVSKVRSGRM-DRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLD--FGGM

Query:  LRLKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNL
        LRL Y ECDYLK IALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSH EKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEK+QFYKPHHDYFSDTFNL
Subjt:  LRLKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNL

Query:  KRGGQRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP
        KRGGQRIATMLMYLSEN+EGGETYFPKAGSGECSCGGKTVPGLSVKP+KGDA+LFWSMGLDGQSDP+SIHGGCEVL+GEKWSATKWMRQKSTLVP
Subjt:  KRGGQRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP

XP_022137963.1 prolyl 4-hydroxylase 1 [Momordica charantia]1.8e-10872.45Show/hide
Query:  MASAPMRIVFGLLTFVTVGMIIGEDVSKVRSGRM-DRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVDGREDGFC-RFLDFGGML
        MASAPMRIVFGLLTFVT+GMIIG         R+ D  G + +S G      +  + Q  R +   +     + L    G V      +  R +      
Subjt:  MASAPMRIVFGLLTFVTVGMIIGEDVSKVRSGRM-DRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVDGREDGFC-RFLDFGGML

Query:  RLKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNLK
         L  EECDYL+A+ALPRLE+STVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPM+QAIEKRISVYSQIP+ENGELIQVLRYEK+QFYKPHHDYFSDTFNLK
Subjt:  RLKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNLK

Query:  RGGQRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP
        RGGQR+ATMLMYLS+NVEGGETYFPKAGSGECSCGGKTVPGLSVKP KGDAVLFWSMGLDGQSDP+SIHGGCEVL+GEKWSATKWMRQKSTLVP
Subjt:  RGGQRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP

XP_038904320.1 prolyl 4-hydroxylase 1 isoform X1 [Benincasa hispida]6.6e-11171.43Show/hide
Query:  MASAPMRIVFGLLTFVTVGMIIG-----------ED---VSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAE--RVWLVVVGKVGLDPLFEIFGGVDGR
        MASAPMRIVFGLLTFVTVGMIIG           ED      + +GR+ +  +D   +  +    +  + +AE  R+  V    V   P   +       
Subjt:  MASAPMRIVFGLLTFVTVGMIIG-----------ED---VSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAE--RVWLVVVGKVGLDPLFEIFGGVDGR

Query:  EDGFCRFLDFGGMLRLKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFY
                       L  EECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEK+QFY
Subjt:  EDGFCRFLDFGGMLRLKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFY

Query:  KPHHDYFSDTFNLKRGGQRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWM
        KPHHDYFSDTFNLKRGGQRIATMLMYLSEN+EGGETYFPKAGSGECSCGGKTVPGLSVKP+KGDAVLFWSMGLDGQSDP+SIHGGCEVL+GEKWSATKWM
Subjt:  KPHHDYFSDTFNLKRGGQRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWM

Query:  RQKSTLVP
        RQKSTLVP
Subjt:  RQKSTLVP

TrEMBL top hitse value%identityAlignment
A0A1S3BY76 prolyl 4-hydroxylase 1 isoform X63.5e-11072.45Show/hide
Query:  MASAPMRIVFGLLTFVTVGMIIGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAE--RVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLDFGGML
        M SA MRIVFGLLTFVTVGMIIG +   + +GR+ +  +D   +  +    +  + +AE  R+  V    V   P   +                     
Subjt:  MASAPMRIVFGLLTFVTVGMIIGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAE--RVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLDFGGML

Query:  RLKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNLK
         L  EECDYLK IALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSH EKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEK+QFYKPHHDYFSDTFNLK
Subjt:  RLKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNLK

Query:  RGGQRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP
        RGGQRIATMLMYLSEN+EGGETYFPKAGSGECSCGGKTVPGLSVKP+KGDA+LFWSMGLDGQSDP+SIHGGCEVL+GEKWSATKWMRQKSTLVP
Subjt:  RGGQRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP

A0A1S3BYM5 prolyl 4-hydroxylase 1 isoform X75.1e-10973.38Show/hide
Query:  MASAPMRIVFGLLTFVTVGMIIGEDVSKVRSGRM-DRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLDFGGMLR
        M SA MRIVFGLLTFVTVGMIIG  +      R+ D  G + +  G      +  + Q  R         GL         ++ +E           +LR
Subjt:  MASAPMRIVFGLLTFVTVGMIIGEDVSKVRSGRM-DRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLDFGGMLR

Query:  LKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNLKR
        L Y ECDYLK IALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSH EKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEK+QFYKPHHDYFSDTFNLKR
Subjt:  LKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNLKR

Query:  GGQRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP
        GGQRIATMLMYLSEN+EGGETYFPKAGSGECSCGGKTVPGLSVKP+KGDA+LFWSMGLDGQSDP+SIHGGCEVL+GEKWSATKWMRQKSTLVP
Subjt:  GGQRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP

A0A1S4E011 prolyl 4-hydroxylase 1 isoform X21.1e-10871.05Show/hide
Query:  MASAPMRIVFGLLTFVTVGMIIGEDVSKVRSGRMDRGGFDCM---------SKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEI--FGGVDGREDGFC
        M SA MRIVFGLLTFVTVGMIIG +   + +GR+ +  +D           ++   K +G    +     W+        D   EI   G V      + 
Subjt:  MASAPMRIVFGLLTFVTVGMIIGEDVSKVRSGRMDRGGFDCM---------SKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEI--FGGVDGREDGFC

Query:  -RFLDFGGMLRLKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHH
         R +       L  EECDYLK IALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSH EKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEK+QFYKPHH
Subjt:  -RFLDFGGMLRLKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHH

Query:  DYFSDTFNLKRGGQRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKS
        DYFSDTFNLKRGGQRIATMLMYLSEN+EGGETYFPKAGSGECSCGGKTVPGLSVKP+KGDA+LFWSMGLDGQSDP+SIHGGCEVL+GEKWSATKWMRQKS
Subjt:  DYFSDTFNLKRGGQRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKS

Query:  TLVP
        TLVP
Subjt:  TLVP

A0A1S4E033 prolyl 4-hydroxylase 1 isoform X41.8e-10973.22Show/hide
Query:  MASAPMRIVFGLLTFVTVGMIIGEDVSKVRSGRM-DRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLD--FGGM
        M SA MRIVFGLLTFVTVGMIIG  +      R+ D  G + +  G      +  + Q  RV +        +  F    G +    G   +++     +
Subjt:  MASAPMRIVFGLLTFVTVGMIIGEDVSKVRSGRM-DRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLD--FGGM

Query:  LRLKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNL
        LRL Y ECDYLK IALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSH EKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEK+QFYKPHHDYFSDTFNL
Subjt:  LRLKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNL

Query:  KRGGQRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP
        KRGGQRIATMLMYLSEN+EGGETYFPKAGSGECSCGGKTVPGLSVKP+KGDA+LFWSMGLDGQSDP+SIHGGCEVL+GEKWSATKWMRQKSTLVP
Subjt:  KRGGQRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP

A0A6J1CBS4 prolyl 4-hydroxylase 18.7e-10972.45Show/hide
Query:  MASAPMRIVFGLLTFVTVGMIIGEDVSKVRSGRM-DRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVDGREDGFC-RFLDFGGML
        MASAPMRIVFGLLTFVT+GMIIG         R+ D  G + +S G      +  + Q  R +   +     + L    G V      +  R +      
Subjt:  MASAPMRIVFGLLTFVTVGMIIGEDVSKVRSGRM-DRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVDGREDGFC-RFLDFGGML

Query:  RLKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNLK
         L  EECDYL+A+ALPRLE+STVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPM+QAIEKRISVYSQIP+ENGELIQVLRYEK+QFYKPHHDYFSDTFNLK
Subjt:  RLKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNLK

Query:  RGGQRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP
        RGGQR+ATMLMYLS+NVEGGETYFPKAGSGECSCGGKTVPGLSVKP KGDAVLFWSMGLDGQSDP+SIHGGCEVL+GEKWSATKWMRQKSTLVP
Subjt:  RGGQRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP

SwissProt top hitse value%identityAlignment
F4JNU8 Probable prolyl 4-hydroxylase 82.8e-4855.15Show/hide
Query:  LKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLS--HQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNL
        L  EEC++L ++A P +  S VVD KTGK + S  RTSSG FL+  H E    +V+ IE RIS ++ IP ENGE +QVL YE  Q Y+PHHDYF D FN+
Subjt:  LKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLS--HQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNL

Query:  KRGGQRIATMLMYLSENVEGGETYFPKAGSG--------ECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKW
        ++GGQRIAT+LMYLS+  EGGET FP A           E S  GK   GLSV P K DA+LFWSM  D   DPSS+HGGC V+ G KWS+TKW
Subjt:  KRGGQRIATMLMYLSENVEGGETYFPKAGSG--------ECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKW

F4JZ24 Probable prolyl 4-hydroxylase 106.8e-5054.31Show/hide
Query:  LKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNLKR
        L  EEC YL  +A P +E STVVD KTGK   S  RTSSG FL+        ++ IEKRIS ++ IPVE+GE +QVL YE  Q Y+PH+DYF D +N + 
Subjt:  LKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNLKR

Query:  GGQRIATMLMYLSENVEGGETYFPKAGS-----------GECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMR
        GGQRIAT+LMYLS+  EGGET FP A              EC  G     GLSVKP  GDA+LFWSM  D   DPSS+HGGC V+ G KWS+TKW+R
Subjt:  GGQRIATMLMYLSENVEGGETYFPKAGS-----------GECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMR

Q24JN5 Prolyl 4-hydroxylase 59.1e-4754.64Show/hide
Query:  LKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFL--SHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNL
        L  EEC++L ++A P +  STVVD KTG    S  RTSSG FL   H E    +V+ IEKRIS ++ IPVENGE +QVL Y+  Q Y+PH+DYF D FN 
Subjt:  LKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFL--SHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNL

Query:  KRGGQRIATMLMYLSENVEGGETYFPKAGS--------GECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKW
        K GGQRIAT+LMYLS+  +GGET FP A           E S  GK   GLSV P K DA+LFW+M  D   DPSS+HGGC V+ G KWS+TKW
Subjt:  KRGGQRIATMLMYLSENVEGGETYFPKAGS--------GECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKW

Q9LN20 Probable prolyl 4-hydroxylase 31.5e-4954.12Show/hide
Query:  LKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNLKR
        L  EEC+YL ++A P +  STVVD++TGK   S  RTSSG FL        +++ IEKRI+ Y+ IP ++GE +QVL YE  Q Y+PH+DYF D FN K 
Subjt:  LKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNLKR

Query:  GGQRIATMLMYLSENVEGGETYFPKAGSGECS---------CGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWM
        GGQR+ATMLMYLS+  EGGET FP A     S         CG K   GLSVKP  GDA+LFWSM  D   DP+S+HGGC V+ G KWS+TKWM
Subjt:  GGQRIATMLMYLSENVEGGETYFPKAGSGECS---------CGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWM

Q9ZW86 Prolyl 4-hydroxylase 14.0e-9563.45Show/hide
Query:  MRIVFGLLTFVTVGMIIGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEI--FGGVDGREDGFCRFL----DFGGMLR
        M+IVFGLLTFVTVGM+IG  +      R++    D    G       GL  Q  R +L  V +   D   E+   G V      +   +    DF     
Subjt:  MRIVFGLLTFVTVGMIIGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEI--FGGVDGREDGFCRFL----DFGGMLR

Query:  LKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNLKR
        L  EEC+YLKAIA PRL++STVVD KTGKGVKSD RTSSGMFL+H E++YP++QAIEKRI+V+SQ+P ENGELIQVLRYE  QFYKPHHDYF+DTFNLKR
Subjt:  LKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNLKR

Query:  GGQRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKST
        GGQR+ATMLMYL+++VEGGETYFP AG G+C+CGGK + G+SVKP+KGDAVLFWSMGLDGQSDP SIHGGCEVL+GEKWSATKWMRQK+T
Subjt:  GGQRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKST

Arabidopsis top hitse value%identityAlignment
AT1G20270.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.1e-5054.12Show/hide
Query:  LKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNLKR
        L  EEC+YL ++A P +  STVVD++TGK   S  RTSSG FL        +++ IEKRI+ Y+ IP ++GE +QVL YE  Q Y+PH+DYF D FN K 
Subjt:  LKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNLKR

Query:  GGQRIATMLMYLSENVEGGETYFPKAGSGECS---------CGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWM
        GGQR+ATMLMYLS+  EGGET FP A     S         CG K   GLSVKP  GDA+LFWSM  D   DP+S+HGGC V+ G KWS+TKWM
Subjt:  GGQRIATMLMYLSENVEGGETYFPKAGSGECS---------CGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWM

AT2G17720.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein6.5e-4854.64Show/hide
Query:  LKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFL--SHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNL
        L  EEC++L ++A P +  STVVD KTG    S  RTSSG FL   H E    +V+ IEKRIS ++ IPVENGE +QVL Y+  Q Y+PH+DYF D FN 
Subjt:  LKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFL--SHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNL

Query:  KRGGQRIATMLMYLSENVEGGETYFPKAGS--------GECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKW
        K GGQRIAT+LMYLS+  +GGET FP A           E S  GK   GLSV P K DA+LFW+M  D   DPSS+HGGC V+ G KWS+TKW
Subjt:  KRGGQRIATMLMYLSENVEGGETYFPKAGS--------GECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKW

AT2G43080.1 P4H isoform 12.9e-9663.45Show/hide
Query:  MRIVFGLLTFVTVGMIIGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEI--FGGVDGREDGFCRFL----DFGGMLR
        M+IVFGLLTFVTVGM+IG  +      R++    D    G       GL  Q  R +L  V +   D   E+   G V      +   +    DF     
Subjt:  MRIVFGLLTFVTVGMIIGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEI--FGGVDGREDGFCRFL----DFGGMLR

Query:  LKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNLKR
        L  EEC+YLKAIA PRL++STVVD KTGKGVKSD RTSSGMFL+H E++YP++QAIEKRI+V+SQ+P ENGELIQVLRYE  QFYKPHHDYF+DTFNLKR
Subjt:  LKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNLKR

Query:  GGQRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKST
        GGQR+ATMLMYL+++VEGGETYFP AG G+C+CGGK + G+SVKP+KGDAVLFWSMGLDGQSDP SIHGGCEVL+GEKWSATKWMRQK+T
Subjt:  GGQRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKST

AT4G35810.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.0e-4955.15Show/hide
Query:  LKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLS--HQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNL
        L  EEC++L ++A P +  S VVD KTGK + S  RTSSG FL+  H E    +V+ IE RIS ++ IP ENGE +QVL YE  Q Y+PHHDYF D FN+
Subjt:  LKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLS--HQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNL

Query:  KRGGQRIATMLMYLSENVEGGETYFPKAGSG--------ECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKW
        ++GGQRIAT+LMYLS+  EGGET FP A           E S  GK   GLSV P K DA+LFWSM  D   DPSS+HGGC V+ G KWS+TKW
Subjt:  KRGGQRIATMLMYLSENVEGGETYFPKAGSG--------ECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKW

AT5G66060.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein4.8e-5154.31Show/hide
Query:  LKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNLKR
        L  EEC YL  +A P +E STVVD KTGK   S  RTSSG FL+        ++ IEKRIS ++ IPVE+GE +QVL YE  Q Y+PH+DYF D +N + 
Subjt:  LKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNLKR

Query:  GGQRIATMLMYLSENVEGGETYFPKAGS-----------GECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMR
        GGQRIAT+LMYLS+  EGGET FP A              EC  G     GLSVKP  GDA+LFWSM  D   DPSS+HGGC V+ G KWS+TKW+R
Subjt:  GGQRIATMLMYLSENVEGGETYFPKAGS-----------GECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTCCGCTCCGATGAGGATTGTCTTCGGTCTTCTAACCTTCGTCACCGTCGGCATGATCATCGGAGAGGATGTAAGTAAAGTCAGGTCAGGTAGAATGGACAGGGG
CGGGTTTGATTGTATGTCCAAGGGTATAAAAAAGTCAATGGGCTTTGGGCTGGAAGATCAGGCAGAGAGAGTTTGGTTGGTGGTTGTGGGCAAGGTGGGCCTTGATCCAT
TGTTTGAGATTTTTGGAGGAGTGGATGGGCGGGAAGATGGGTTTTGTAGGTTCTTGGACTTTGGAGGGATGTTGAGGTTAAAGTACGAGGAGTGCGACTACCTTAAGGCA
ATAGCACTTCCTCGCCTTGAAATTTCCACGGTCGTGGATACAAAAACTGGGAAGGGAGTTAAGAGTGATTTCAGAACGAGCTCTGGAATGTTTTTAAGTCATCAAGAGAA
AAATTATCCAATGGTCCAGGCAATTGAAAAAAGAATTTCCGTTTATTCTCAAATACCAGTAGAAAATGGAGAGCTCATTCAAGTGTTAAGGTACGAGAAGAGTCAATTTT
ACAAGCCTCATCATGACTACTTTTCTGATACTTTTAACTTGAAGCGTGGTGGTCAGCGAATAGCAACTATGCTTATGTACCTAAGTGAAAACGTTGAAGGAGGAGAAACC
TACTTTCCGAAGGCTGGTTCTGGTGAGTGTAGCTGTGGTGGGAAGACCGTTCCAGGACTGTCAGTTAAACCATCCAAAGGAGATGCAGTGCTTTTCTGGAGCATGGGCTT
AGATGGACAATCAGATCCAAGTAGCATTCATGGAGGGTGTGAAGTACTGGCAGGGGAAAAATGGTCTGCCACAAAATGGATGAGACAAAAGAGTACTCTGGTACCATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCTCCGCTCCGATGAGGATTGTCTTCGGTCTTCTAACCTTCGTCACCGTCGGCATGATCATCGGAGAGGATGTAAGTAAAGTCAGGTCAGGTAGAATGGACAGGGG
CGGGTTTGATTGTATGTCCAAGGGTATAAAAAAGTCAATGGGCTTTGGGCTGGAAGATCAGGCAGAGAGAGTTTGGTTGGTGGTTGTGGGCAAGGTGGGCCTTGATCCAT
TGTTTGAGATTTTTGGAGGAGTGGATGGGCGGGAAGATGGGTTTTGTAGGTTCTTGGACTTTGGAGGGATGTTGAGGTTAAAGTACGAGGAGTGCGACTACCTTAAGGCA
ATAGCACTTCCTCGCCTTGAAATTTCCACGGTCGTGGATACAAAAACTGGGAAGGGAGTTAAGAGTGATTTCAGAACGAGCTCTGGAATGTTTTTAAGTCATCAAGAGAA
AAATTATCCAATGGTCCAGGCAATTGAAAAAAGAATTTCCGTTTATTCTCAAATACCAGTAGAAAATGGAGAGCTCATTCAAGTGTTAAGGTACGAGAAGAGTCAATTTT
ACAAGCCTCATCATGACTACTTTTCTGATACTTTTAACTTGAAGCGTGGTGGTCAGCGAATAGCAACTATGCTTATGTACCTAAGTGAAAACGTTGAAGGAGGAGAAACC
TACTTTCCGAAGGCTGGTTCTGGTGAGTGTAGCTGTGGTGGGAAGACCGTTCCAGGACTGTCAGTTAAACCATCCAAAGGAGATGCAGTGCTTTTCTGGAGCATGGGCTT
AGATGGACAATCAGATCCAAGTAGCATTCATGGAGGGTGTGAAGTACTGGCAGGGGAAAAATGGTCTGCCACAAAATGGATGAGACAAAAGAGTACTCTGGTACCATAA
Protein sequenceShow/hide protein sequence
MASAPMRIVFGLLTFVTVGMIIGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLDFGGMLRLKYEECDYLKA
IALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENVEGGET
YFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP