; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi07G000850 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi07G000850
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionprolyl 4-hydroxylase 1
Genome locationchr07:926208..945306
RNA-Seq ExpressionLsi07G000850
SyntenyLsi07G000850
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
GO:0005506 - iron ion binding (molecular function)
GO:0016705 - oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008453925.1 PREDICTED: prolyl 4-hydroxylase 1 isoform X3 [Cucumis melo]1.2e-7849.62Show/hide
Query:  MASAPMRIVFGLLTFVTVGMIIGALLQLAFIRRLEDSI-----------------------GLPNWINDKEAEILRLGYEEGANLCAIPNQFALVKVGVG
        M SA MRIVFGLLTFVTVGMIIGALLQLAF+RRLEDSI                       GLPNWINDKEAEILRLGY               VK  V 
Subjt:  MASAPMRIVFGLLTFVTVGMIIGALLQLAFIRRLEDSI-----------------------GLPNWINDKEAEILRLGYEEGANLCAIPNQFALVKVGVG

Query:  FCHHTGWGSRKRERVGYIQSQGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLDFGGMLRLK
              W  R                                                   ++V+                                 L 
Subjt:  FCHHTGWGSRKRERVGYIQSQGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLDFGGMLRLK

Query:  YEECDYLKAIALPRLEIST--------GVKSDFRTSSGMFLSHQEKNYPMVQ-----------------------------------------FNLKRGG
         EECDYLK IALPRLEIST        GVKSDFRTSSGMFLSH EKNYPMVQ                                         FNLKRGG
Subjt:  YEECDYLKAIALPRLEIST--------GVKSDFRTSSGMFLSHQEKNYPMVQ-----------------------------------------FNLKRGG

Query:  QRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP
        QRIATMLMYLSEN+EGGETYFPKAGSGECSCGGKTVPGLSVKP+KGDA+LFWSMGLDGQSDP+SIHGGCEVL+GEKWSATKWMRQKSTLVP
Subjt:  QRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP

XP_016901570.1 PREDICTED: prolyl 4-hydroxylase 1 isoform X5 [Cucumis melo]5.6e-7950Show/hide
Query:  MASAPMRIVFGLLTFVTVGMIIGALLQLAFIRRLEDSI------------------------------------------GLPNWINDKEAEILRLGYEE
        M SA MRIVFGLLTFVTVGMIIGALLQLAF+RRLEDSI                                          GLPNWINDKEAEILRLGY  
Subjt:  MASAPMRIVFGLLTFVTVGMIIGALLQLAFIRRLEDSI------------------------------------------GLPNWINDKEAEILRLGYEE

Query:  GANLCAIPNQFALVKVGVGFCHHTGWGSRKRERVGYIQSQGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVD
                     VK  V       W  R                                                   ++V+                
Subjt:  GANLCAIPNQFALVKVGVGFCHHTGWGSRKRERVGYIQSQGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVD

Query:  GREDGFCRFLDFGGMLRLKYEECDYLKAIALPRLEIST--------GVKSDFRTSSGMFLSHQEKNYPMVQ-------------------FNLKRGGQRI
                         L  EECDYLK IALPRLEIST        GVKSDFRTSSGMFLSH EKNYPMVQ                   FNLKRGGQRI
Subjt:  GREDGFCRFLDFGGMLRLKYEECDYLKAIALPRLEIST--------GVKSDFRTSSGMFLSHQEKNYPMVQ-------------------FNLKRGGQRI

Query:  ATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP
        ATMLMYLSEN+EGGETYFPKAGSGECSCGGKTVPGLSVKP+KGDA+LFWSMGLDGQSDP+SIHGGCEVL+GEKWSATKWMRQKSTLVP
Subjt:  ATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP

XP_022137963.1 prolyl 4-hydroxylase 1 [Momordica charantia]8.0e-7848.59Show/hide
Query:  MASAPMRIVFGLLTFVTVGMIIGALLQLAFIRRLEDSI-----------------------GLPNWINDKEAEILRLGYEEGANLCAIPNQFALVKVGVG
        MASAPMRIVFGLLTFVT+GMIIGAL QLAFIRRLEDS                        G PNWIND+EAEILRLGY               VK  V 
Subjt:  MASAPMRIVFGLLTFVTVGMIIGALLQLAFIRRLEDSI-----------------------GLPNWINDKEAEILRLGYEEGANLCAIPNQFALVKVGVG

Query:  FCHHTGWGSRKRERVGYIQSQGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLDFGGMLRLK
              W  R                                                   ++V+                                 L 
Subjt:  FCHHTGWGSRKRERVGYIQSQGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLDFGGMLRLK

Query:  YEECDYLKAIALPRLEIST--------GVKSDFRTSSGMFLSHQEKNYPMVQ-----------------------------------------FNLKRGG
         EECDYL+A+ALPRLE+ST        GVKSDFRTSSGMFLSHQEKNYPM+Q                                         FNLKRGG
Subjt:  YEECDYLKAIALPRLEIST--------GVKSDFRTSSGMFLSHQEKNYPMVQ-----------------------------------------FNLKRGG

Query:  QRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP
        QR+ATMLMYLS+NVEGGETYFPKAGSGECSCGGKTVPGLSVKP KGDAVLFWSMGLDGQSDP+SIHGGCEVL+GEKWSATKWMRQKSTLVP
Subjt:  QRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP

XP_023539759.1 prolyl 4-hydroxylase 1-like isoform X1 [Cucurbita pepo subsp. pepo]1.1e-7749.49Show/hide
Query:  MASAPMRIVFGLLTFVTVGMIIGALLQLAFIRRLEDSI-----------------------GLPNWINDKEAEILRLGYEEGANLCAIPNQFALVKVGVG
        MAS  MRIVFGLLTFVTVGMIIGAL QLAFIRRLEDSI                       GLPNWINDKEAEILRLGY               VK  V 
Subjt:  MASAPMRIVFGLLTFVTVGMIIGALLQLAFIRRLEDSI-----------------------GLPNWINDKEAEILRLGYEEGANLCAIPNQFALVKVGVG

Query:  FCHHTGWGSRKRERVGYIQSQGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLDFGGMLRLK
              W  R      ++ S             +RG   C    +                ++   ++G+                              
Subjt:  FCHHTGWGSRKRERVGYIQSQGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLDFGGMLRLK

Query:  YEECDYLKAIALPRLEIST--------GVKSDFRTSSGMFLSHQEKNYPMVQ-----------------------------------------FNLKRGG
          ECDYLKAIALPRLEIST        G+KSDFRTSSGMFLSHQE+NYPMVQ                                         FNLKRGG
Subjt:  YEECDYLKAIALPRLEIST--------GVKSDFRTSSGMFLSHQEKNYPMVQ-----------------------------------------FNLKRGG

Query:  QRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLV
        QRIATMLMYL++NVEGGETYFPKAGSG CSCGGKTVPGLSVKP KGDAVLFWSMGLDGQSDP+SIHGGCEVL GEKWSATKWMRQKSTL+
Subjt:  QRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLV

XP_038904320.1 prolyl 4-hydroxylase 1 isoform X1 [Benincasa hispida]2.0e-8151.15Show/hide
Query:  MASAPMRIVFGLLTFVTVGMIIGALLQLAFIRRLEDSI-----------------------GLPNWINDKEAEILRLGYEEGANLCAIPNQFALVKVGVG
        MASAPMRIVFGLLTFVTVGMIIGALLQLAFIRRLEDSI                       GLPNWINDKEAEILRLGY               VK  V 
Subjt:  MASAPMRIVFGLLTFVTVGMIIGALLQLAFIRRLEDSI-----------------------GLPNWINDKEAEILRLGYEEGANLCAIPNQFALVKVGVG

Query:  FCHHTGWGSRKRERVGYIQSQGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLDFGGMLRLK
              W  R                                                   ++V+                                 L 
Subjt:  FCHHTGWGSRKRERVGYIQSQGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLDFGGMLRLK

Query:  YEECDYLKAIALPRLEIST--------GVKSDFRTSSGMFLSHQEKNYPMVQ-----------------------------------------FNLKRGG
         EECDYLKAIALPRLEIST        GVKSDFRTSSGMFLSHQEKNYPMVQ                                         FNLKRGG
Subjt:  YEECDYLKAIALPRLEIST--------GVKSDFRTSSGMFLSHQEKNYPMVQ-----------------------------------------FNLKRGG

Query:  QRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP
        QRIATMLMYLSEN+EGGETYFPKAGSGECSCGGKTVPGLSVKP+KGDAVLFWSMGLDGQSDP+SIHGGCEVL+GEKWSATKWMRQKSTLVP
Subjt:  QRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP

TrEMBL top hitse value%identityAlignment
A0A1S3BXE6 prolyl 4-hydroxylase 1 isoform X36.0e-7949.62Show/hide
Query:  MASAPMRIVFGLLTFVTVGMIIGALLQLAFIRRLEDSI-----------------------GLPNWINDKEAEILRLGYEEGANLCAIPNQFALVKVGVG
        M SA MRIVFGLLTFVTVGMIIGALLQLAF+RRLEDSI                       GLPNWINDKEAEILRLGY               VK  V 
Subjt:  MASAPMRIVFGLLTFVTVGMIIGALLQLAFIRRLEDSI-----------------------GLPNWINDKEAEILRLGYEEGANLCAIPNQFALVKVGVG

Query:  FCHHTGWGSRKRERVGYIQSQGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLDFGGMLRLK
              W  R                                                   ++V+                                 L 
Subjt:  FCHHTGWGSRKRERVGYIQSQGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLDFGGMLRLK

Query:  YEECDYLKAIALPRLEIST--------GVKSDFRTSSGMFLSHQEKNYPMVQ-----------------------------------------FNLKRGG
         EECDYLK IALPRLEIST        GVKSDFRTSSGMFLSH EKNYPMVQ                                         FNLKRGG
Subjt:  YEECDYLKAIALPRLEIST--------GVKSDFRTSSGMFLSHQEKNYPMVQ-----------------------------------------FNLKRGG

Query:  QRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP
        QRIATMLMYLSEN+EGGETYFPKAGSGECSCGGKTVPGLSVKP+KGDA+LFWSMGLDGQSDP+SIHGGCEVL+GEKWSATKWMRQKSTLVP
Subjt:  QRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP

A0A1S4E0S2 prolyl 4-hydroxylase 1 isoform X52.7e-7950Show/hide
Query:  MASAPMRIVFGLLTFVTVGMIIGALLQLAFIRRLEDSI------------------------------------------GLPNWINDKEAEILRLGYEE
        M SA MRIVFGLLTFVTVGMIIGALLQLAF+RRLEDSI                                          GLPNWINDKEAEILRLGY  
Subjt:  MASAPMRIVFGLLTFVTVGMIIGALLQLAFIRRLEDSI------------------------------------------GLPNWINDKEAEILRLGYEE

Query:  GANLCAIPNQFALVKVGVGFCHHTGWGSRKRERVGYIQSQGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVD
                     VK  V       W  R                                                   ++V+                
Subjt:  GANLCAIPNQFALVKVGVGFCHHTGWGSRKRERVGYIQSQGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVD

Query:  GREDGFCRFLDFGGMLRLKYEECDYLKAIALPRLEIST--------GVKSDFRTSSGMFLSHQEKNYPMVQ-------------------FNLKRGGQRI
                         L  EECDYLK IALPRLEIST        GVKSDFRTSSGMFLSH EKNYPMVQ                   FNLKRGGQRI
Subjt:  GREDGFCRFLDFGGMLRLKYEECDYLKAIALPRLEIST--------GVKSDFRTSSGMFLSHQEKNYPMVQ-------------------FNLKRGGQRI

Query:  ATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP
        ATMLMYLSEN+EGGETYFPKAGSGECSCGGKTVPGLSVKP+KGDA+LFWSMGLDGQSDP+SIHGGCEVL+GEKWSATKWMRQKSTLVP
Subjt:  ATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP

A0A6J1CBS4 prolyl 4-hydroxylase 13.9e-7848.59Show/hide
Query:  MASAPMRIVFGLLTFVTVGMIIGALLQLAFIRRLEDSI-----------------------GLPNWINDKEAEILRLGYEEGANLCAIPNQFALVKVGVG
        MASAPMRIVFGLLTFVT+GMIIGAL QLAFIRRLEDS                        G PNWIND+EAEILRLGY               VK  V 
Subjt:  MASAPMRIVFGLLTFVTVGMIIGALLQLAFIRRLEDSI-----------------------GLPNWINDKEAEILRLGYEEGANLCAIPNQFALVKVGVG

Query:  FCHHTGWGSRKRERVGYIQSQGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLDFGGMLRLK
              W  R                                                   ++V+                                 L 
Subjt:  FCHHTGWGSRKRERVGYIQSQGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLDFGGMLRLK

Query:  YEECDYLKAIALPRLEIST--------GVKSDFRTSSGMFLSHQEKNYPMVQ-----------------------------------------FNLKRGG
         EECDYL+A+ALPRLE+ST        GVKSDFRTSSGMFLSHQEKNYPM+Q                                         FNLKRGG
Subjt:  YEECDYLKAIALPRLEIST--------GVKSDFRTSSGMFLSHQEKNYPMVQ-----------------------------------------FNLKRGG

Query:  QRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP
        QR+ATMLMYLS+NVEGGETYFPKAGSGECSCGGKTVPGLSVKP KGDAVLFWSMGLDGQSDP+SIHGGCEVL+GEKWSATKWMRQKSTLVP
Subjt:  QRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP

A0A6J1EXD7 prolyl 4-hydroxylase 1-like isoform X15.1e-7849.49Show/hide
Query:  MASAPMRIVFGLLTFVTVGMIIGALLQLAFIRRLEDSI-----------------------GLPNWINDKEAEILRLGYEEGANLCAIPNQFALVKVGVG
        MAS  MRIVFGLLTFVTVGMIIGAL QLAFIRRLEDSI                       GLPNWINDKEAEILRLGY               VK  V 
Subjt:  MASAPMRIVFGLLTFVTVGMIIGALLQLAFIRRLEDSI-----------------------GLPNWINDKEAEILRLGYEEGANLCAIPNQFALVKVGVG

Query:  FCHHTGWGSRKRERVGYIQSQGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLDFGGMLRLK
              W  R      ++ S             +RG   C    +                ++   ++G+                              
Subjt:  FCHHTGWGSRKRERVGYIQSQGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLDFGGMLRLK

Query:  YEECDYLKAIALPRLEIST--------GVKSDFRTSSGMFLSHQEKNYPMVQ-----------------------------------------FNLKRGG
          ECDYLKAIALPRLEIST        G+KSDFRTSSGMFLSHQE+NYPMVQ                                         FNLKRGG
Subjt:  YEECDYLKAIALPRLEIST--------GVKSDFRTSSGMFLSHQEKNYPMVQ-----------------------------------------FNLKRGG

Query:  QRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLV
        QRIATMLMYL++NVEGGETYFPKAGSG CSCGGKTVPGLSVKP KGDAVLFWSMGLDGQSDP+SIHGGCEVL GEKWSATKWMRQKSTL+
Subjt:  QRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLV

A0A6J1I5D6 prolyl 4-hydroxylase 1-like isoform X14.3e-7748.97Show/hide
Query:  MASAPMRIVFGLLTFVTVGMIIGALLQLAFIRRLEDSI-----------------------GLPNWINDKEAEILRLGYEEGANLCAIPNQFALVKVGVG
        MAS  MRIVFGLLTFVTVGMIIGAL QLAFIRRLEDSI                       GLPNWINDKE+EILRLGY               VK  V 
Subjt:  MASAPMRIVFGLLTFVTVGMIIGALLQLAFIRRLEDSI-----------------------GLPNWINDKEAEILRLGYEEGANLCAIPNQFALVKVGVG

Query:  FCHHTGWGSRKRERVGYIQSQGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLDFGGMLRLK
              W  R      ++ S             +RG   C    +                ++   ++G+                              
Subjt:  FCHHTGWGSRKRERVGYIQSQGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLDFGGMLRLK

Query:  YEECDYLKAIALPRLEIST--------GVKSDFRTSSGMFLSHQEKNYPMVQ-----------------------------------------FNLKRGG
          ECDYLKAIALPRLEIST        G+KSDFRTSSGMFLSHQE++YPMVQ                                         FNLKRGG
Subjt:  YEECDYLKAIALPRLEIST--------GVKSDFRTSSGMFLSHQEKNYPMVQ-----------------------------------------FNLKRGG

Query:  QRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLV
        QRIATMLMYL++NVEGGETYFPKAGSG CSCGGKTVPGLSVKP KGDAVLFWSMGLDGQSDP+SIHGGCEVL GEKWSATKWMRQKSTL+
Subjt:  QRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLV

SwissProt top hitse value%identityAlignment
F4JNU8 Probable prolyl 4-hydroxylase 85.3e-2440Show/hide
Query:  LKYEECDYLKAIALPRL------EISTG--VKSDFRTSSGMFLS--HQE-------------------------------------KNYPMVQFNLKRGG
        L  EEC++L ++A P +      ++ TG  + S  RTSSG FL+  H E                                      +Y   +FN+++GG
Subjt:  LKYEECDYLKAIALPRL------EISTG--VKSDFRTSSGMFLS--HQE-------------------------------------KNYPMVQFNLKRGG

Query:  QRIATMLMYLSENVEGGETYFPKAGSG--------ECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKW
        QRIAT+LMYLS+  EGGET FP A           E S  GK   GLSV P K DA+LFWSM  D   DPSS+HGGC V+ G KWS+TKW
Subjt:  QRIATMLMYLSENVEGGETYFPKAGSG--------ECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKW

F4JZ24 Probable prolyl 4-hydroxylase 105.7e-2641.03Show/hide
Query:  LKYEECDYLKAIALPRLEISTGV--------KSDFRTSSGMFLS-------------------------------HQE--------KNYPMVQFNLKRGG
        L  EEC YL  +A P +E ST V         S  RTSSG FL+                               H E         +Y M ++N + GG
Subjt:  LKYEECDYLKAIALPRLEISTGV--------KSDFRTSSGMFLS-------------------------------HQE--------KNYPMVQFNLKRGG

Query:  QRIATMLMYLSENVEGGETYFPKAGS-----------GECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMR
        QRIAT+LMYLS+  EGGET FP A              EC  G     GLSVKP  GDA+LFWSM  D   DPSS+HGGC V+ G KWS+TKW+R
Subjt:  QRIATMLMYLSENVEGGETYFPKAGS-----------GECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMR

Q24JN5 Prolyl 4-hydroxylase 51.0e-2240Show/hide
Query:  LKYEECDYLKAIALPRLEISTGV--------KSDFRTSSGMFL--SHQE-------------------------------------KNYPMVQFNLKRGG
        L  EEC++L ++A P +  ST V         S  RTSSG FL   H E                                      +Y + +FN K GG
Subjt:  LKYEECDYLKAIALPRLEISTGV--------KSDFRTSSGMFL--SHQE-------------------------------------KNYPMVQFNLKRGG

Query:  QRIATMLMYLSENVEGGETYFPKAGS--------GECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKW
        QRIAT+LMYLS+  +GGET FP A           E S  GK   GLSV P K DA+LFW+M  D   DPSS+HGGC V+ G KWS+TKW
Subjt:  QRIATMLMYLSENVEGGETYFPKAGS--------GECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKW

Q9LN20 Probable prolyl 4-hydroxylase 32.0e-2641.67Show/hide
Query:  LKYEECDYLKAIALPRLEISTGVKSD--------FRTSSGMFLS-------------------------------HQEK--------NYPMVQFNLKRGG
        L  EEC+YL ++A P +  ST V S+         RTSSG FL                                H E         +Y + +FN K GG
Subjt:  LKYEECDYLKAIALPRLEISTGVKSD--------FRTSSGMFLS-------------------------------HQEK--------NYPMVQFNLKRGG

Query:  QRIATMLMYLSENVEGGETYFPKAGSGECS---------CGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWM
        QR+ATMLMYLS+  EGGET FP A     S         CG K   GLSVKP  GDA+LFWSM  D   DP+S+HGGC V+ G KWS+TKWM
Subjt:  QRIATMLMYLSENVEGGETYFPKAGSGECS---------CGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWM

Q9ZW86 Prolyl 4-hydroxylase 19.9e-6341.69Show/hide
Query:  MRIVFGLLTFVTVGMIIGALLQLAFIRRLEDSIG-------------------LPNWINDKEAEILRLGYEEGANLCAIPNQFALVKVGVGFCHHTGWGS
        M+IVFGLLTFVTVGM+IG+LLQLAFI RLEDS G                   +  W NDK+AE+LR+G                VK  V       W  
Subjt:  MRIVFGLLTFVTVGMIIGALLQLAFIRRLEDSIG-------------------LPNWINDKEAEILRLGYEEGANLCAIPNQFALVKVGVGFCHHTGWGS

Query:  RKRERVGYIQSQGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLDFGGMLRLKYEECDYLKA
        R                                                   ++V+                          DF     L  EEC+YLKA
Subjt:  RKRERVGYIQSQGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLDFGGMLRLKYEECDYLKA

Query:  IALPRLEIST--------GVKSDFRTSSGMFLSHQEKNYPMVQ-----------------------------------------FNLKRGGQRIATMLMY
        IA PRL++ST        GVKSD RTSSGMFL+H E++YP++Q                                         FNLKRGGQR+ATMLMY
Subjt:  IALPRLEIST--------GVKSDFRTSSGMFLSHQEKNYPMVQ-----------------------------------------FNLKRGGQRIATMLMY

Query:  LSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKST
        L+++VEGGETYFP AG G+C+CGGK + G+SVKP+KGDAVLFWSMGLDGQSDP SIHGGCEVL+GEKWSATKWMRQK+T
Subjt:  LSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKST

Arabidopsis top hitse value%identityAlignment
AT1G20270.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.4e-2741.67Show/hide
Query:  LKYEECDYLKAIALPRLEISTGVKSD--------FRTSSGMFLS-------------------------------HQEK--------NYPMVQFNLKRGG
        L  EEC+YL ++A P +  ST V S+         RTSSG FL                                H E         +Y + +FN K GG
Subjt:  LKYEECDYLKAIALPRLEISTGVKSD--------FRTSSGMFLS-------------------------------HQEK--------NYPMVQFNLKRGG

Query:  QRIATMLMYLSENVEGGETYFPKAGSGECS---------CGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWM
        QR+ATMLMYLS+  EGGET FP A     S         CG K   GLSVKP  GDA+LFWSM  D   DP+S+HGGC V+ G KWS+TKWM
Subjt:  QRIATMLMYLSENVEGGETYFPKAGSGECS---------CGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWM

AT2G17720.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein7.1e-2440Show/hide
Query:  LKYEECDYLKAIALPRLEISTGV--------KSDFRTSSGMFL--SHQE-------------------------------------KNYPMVQFNLKRGG
        L  EEC++L ++A P +  ST V         S  RTSSG FL   H E                                      +Y + +FN K GG
Subjt:  LKYEECDYLKAIALPRLEISTGV--------KSDFRTSSGMFL--SHQE-------------------------------------KNYPMVQFNLKRGG

Query:  QRIATMLMYLSENVEGGETYFPKAGS--------GECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKW
        QRIAT+LMYLS+  +GGET FP A           E S  GK   GLSV P K DA+LFW+M  D   DPSS+HGGC V+ G KWS+TKW
Subjt:  QRIATMLMYLSENVEGGETYFPKAGS--------GECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKW

AT2G43080.1 P4H isoform 17.0e-6441.69Show/hide
Query:  MRIVFGLLTFVTVGMIIGALLQLAFIRRLEDSIG-------------------LPNWINDKEAEILRLGYEEGANLCAIPNQFALVKVGVGFCHHTGWGS
        M+IVFGLLTFVTVGM+IG+LLQLAFI RLEDS G                   +  W NDK+AE+LR+G                VK  V       W  
Subjt:  MRIVFGLLTFVTVGMIIGALLQLAFIRRLEDSIG-------------------LPNWINDKEAEILRLGYEEGANLCAIPNQFALVKVGVGFCHHTGWGS

Query:  RKRERVGYIQSQGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLDFGGMLRLKYEECDYLKA
        R                                                   ++V+                          DF     L  EEC+YLKA
Subjt:  RKRERVGYIQSQGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLDFGGMLRLKYEECDYLKA

Query:  IALPRLEIST--------GVKSDFRTSSGMFLSHQEKNYPMVQ-----------------------------------------FNLKRGGQRIATMLMY
        IA PRL++ST        GVKSD RTSSGMFL+H E++YP++Q                                         FNLKRGGQR+ATMLMY
Subjt:  IALPRLEIST--------GVKSDFRTSSGMFLSHQEKNYPMVQ-----------------------------------------FNLKRGGQRIATMLMY

Query:  LSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKST
        L+++VEGGETYFP AG G+C+CGGK + G+SVKP+KGDAVLFWSMGLDGQSDP SIHGGCEVL+GEKWSATKWMRQK+T
Subjt:  LSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKST

AT4G35810.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein3.8e-2540Show/hide
Query:  LKYEECDYLKAIALPRL------EISTG--VKSDFRTSSGMFLS--HQE-------------------------------------KNYPMVQFNLKRGG
        L  EEC++L ++A P +      ++ TG  + S  RTSSG FL+  H E                                      +Y   +FN+++GG
Subjt:  LKYEECDYLKAIALPRL------EISTG--VKSDFRTSSGMFLS--HQE-------------------------------------KNYPMVQFNLKRGG

Query:  QRIATMLMYLSENVEGGETYFPKAGSG--------ECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKW
        QRIAT+LMYLS+  EGGET FP A           E S  GK   GLSV P K DA+LFWSM  D   DPSS+HGGC V+ G KWS+TKW
Subjt:  QRIATMLMYLSENVEGGETYFPKAGSG--------ECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKW

AT5G66060.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein4.0e-2741.03Show/hide
Query:  LKYEECDYLKAIALPRLEISTGV--------KSDFRTSSGMFLS-------------------------------HQE--------KNYPMVQFNLKRGG
        L  EEC YL  +A P +E ST V         S  RTSSG FL+                               H E         +Y M ++N + GG
Subjt:  LKYEECDYLKAIALPRLEISTGV--------KSDFRTSSGMFLS-------------------------------HQE--------KNYPMVQFNLKRGG

Query:  QRIATMLMYLSENVEGGETYFPKAGS-----------GECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMR
        QRIAT+LMYLS+  EGGET FP A              EC  G     GLSVKP  GDA+LFWSM  D   DPSS+HGGC V+ G KWS+TKW+R
Subjt:  QRIATMLMYLSENVEGGETYFPKAGS-----------GECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTCCGCTCCGATGAGGATTGTCTTCGGTCTTCTAACCTTCGTCACCGTCGGCATGATCATCGGTGCTTTGTTGCAACTAGCATTTATAAGAAGGCTGGAGGACTC
TATTGGCCTTCCTAATTGGATTAATGACAAAGAAGCAGAAATTCTTCGTCTTGGCTACGAGGAAGGTGCCAATCTCTGTGCCATACCAAATCAATTTGCTTTGGTGAAGG
TTGGTGTGGGGTTTTGTCACCACACAGGGTGGGGTAGCAGAAAGAGGGAAAGGGTTGGATACATTCAAAGTCAAGGAGAGGATGTAAGTAAAGTCAGGTCAGGTAGAATG
GACAGGGGCGGGTTTGATTGTATGTCCAAGGGTATAAAAAAGTCAATGGGCTTTGGGCTGGAAGATCAGGCAGAGAGAGTTTGGTTGGTGGTTGTGGGCAAGGTGGGCCT
TGATCCATTGTTTGAGATTTTTGGAGGAGTGGATGGGCGGGAAGATGGGTTTTGTAGGTTCTTGGACTTTGGAGGGATGTTGAGGTTAAAGTACGAGGAGTGCGACTACC
TTAAGGCAATAGCACTTCCTCGCCTTGAAATTTCCACGGGAGTTAAGAGTGATTTCAGAACGAGCTCTGGAATGTTTTTAAGTCATCAAGAGAAAAATTATCCAATGGTC
CAGTTTAACTTGAAGCGTGGTGGTCAGCGAATAGCAACTATGCTTATGTACCTAAGTGAAAACGTTGAAGGAGGAGAAACCTACTTTCCGAAGGCTGGTTCTGGTGAGTG
TAGCTGTGGTGGGAAGACCGTTCCAGGACTGTCAGTTAAACCATCCAAAGGAGATGCAGTGCTTTTCTGGAGCATGGGCTTAGATGGACAATCAGATCCAAGTAGCATTC
ATGGAGGGTGTGAAGTACTGGCAGGGGAAAAATGGTCTGCCACAAAATGGATGAGACAAAAGAGTACTCTGGTACCATAA
mRNA sequenceShow/hide mRNA sequence
AAAAAAATGAAGAACTAAAACACAAACACAACTTCCTCTCATTTTTTCAAAGCTAATATTTCCTTAAACACAAAATTAGAAGAAAGAAATCAAAGGGATCTGAAAACGCA
ATCACTTTTTCTTCACTGTTACGAAAGAAATCACAGTGAACGGGAAAGAAGAAGAACAAGAAAAAGCAGTAGGGAATCCTCTGCAGTGGGTCATCTCATTGGGTCCCTGA
GCTGATTTCTACCTTCCACACTTCAAAGACTAACGGCTTCTCCTTCTTCTCCTCATTTTGGAGTCAAAATTTCGTTTAGGCAGCTATGGCCTCCGCTCCGATGAGGATTG
TCTTCGGTCTTCTAACCTTCGTCACCGTCGGCATGATCATCGGTGCTTTGTTGCAACTAGCATTTATAAGAAGGCTGGAGGACTCTATTGGCCTTCCTAATTGGATTAAT
GACAAAGAAGCAGAAATTCTTCGTCTTGGCTACGAGGAAGGTGCCAATCTCTGTGCCATACCAAATCAATTTGCTTTGGTGAAGGTTGGTGTGGGGTTTTGTCACCACAC
AGGGTGGGGTAGCAGAAAGAGGGAAAGGGTTGGATACATTCAAAGTCAAGGAGAGGATGTAAGTAAAGTCAGGTCAGGTAGAATGGACAGGGGCGGGTTTGATTGTATGT
CCAAGGGTATAAAAAAGTCAATGGGCTTTGGGCTGGAAGATCAGGCAGAGAGAGTTTGGTTGGTGGTTGTGGGCAAGGTGGGCCTTGATCCATTGTTTGAGATTTTTGGA
GGAGTGGATGGGCGGGAAGATGGGTTTTGTAGGTTCTTGGACTTTGGAGGGATGTTGAGGTTAAAGTACGAGGAGTGCGACTACCTTAAGGCAATAGCACTTCCTCGCCT
TGAAATTTCCACGGGAGTTAAGAGTGATTTCAGAACGAGCTCTGGAATGTTTTTAAGTCATCAAGAGAAAAATTATCCAATGGTCCAGTTTAACTTGAAGCGTGGTGGTC
AGCGAATAGCAACTATGCTTATGTACCTAAGTGAAAACGTTGAAGGAGGAGAAACCTACTTTCCGAAGGCTGGTTCTGGTGAGTGTAGCTGTGGTGGGAAGACCGTTCCA
GGACTGTCAGTTAAACCATCCAAAGGAGATGCAGTGCTTTTCTGGAGCATGGGCTTAGATGGACAATCAGATCCAAGTAGCATTCATGGAGGGTGTGAAGTACTGGCAGG
GGAAAAATGGTCTGCCACAAAATGGATGAGACAAAAGAGTACTCTGGTACCATAATTCAAACTTTCTAGTTCCATTGTATTGTATATCAGCATTGAATATTTTGTTACAT
ATCAACTAATAAATCTATAGAGAGAGAAACAAAGAAAAATGGAGAGAGCCTAATTTAGATAGCATCTTAACATAATTAATAACACTTACAGTGAATCAATTTATATAATG
AATAATATAATACAGCAGCATCTGATGTTATTTTCCCCTCACTCTGTCATTTCAG
Protein sequenceShow/hide protein sequence
MASAPMRIVFGLLTFVTVGMIIGALLQLAFIRRLEDSIGLPNWINDKEAEILRLGYEEGANLCAIPNQFALVKVGVGFCHHTGWGSRKRERVGYIQSQGEDVSKVRSGRM
DRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLDFGGMLRLKYEECDYLKAIALPRLEISTGVKSDFRTSSGMFLSHQEKNYPMV
QFNLKRGGQRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP