; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC05g0767 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC05g0767
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionprolyl 4-hydroxylase 1
Genome locationMC05:6279527..6295603
RNA-Seq ExpressionMC05g0767
SyntenyMC05g0767
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0000137 - Golgi cis cisterna (cellular component)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008453925.1 PREDICTED: prolyl 4-hydroxylase 1 isoform X3 [Cucumis melo]3.39e-19992.07Show/hide
Query:  MASAPMRIVFGLLTFVTLGMIIGALFQLAFIRRLEDSYGTEFLSAGRLHKTQYDGDRQLPRGFPNWINDREAEILRLGYVKPEVVSWSPRIIVLHNFLST
        M SA MRIVFGLLTFVT+GMIIGAL QLAF+RRLEDS GTEFL AGRLHKTQYD  RQLPRG PNWIND+EAEILRLGYVKPEVVSWSPRIIVLHNFLST
Subjt:  MASAPMRIVFGLLTFVTLGMIIGALFQLAFIRRLEDSYGTEFLSAGRLHKTQYDGDRQLPRGFPNWINDREAEILRLGYVKPEVVSWSPRIIVLHNFLST

Query:  EECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ
        EECDYL+ +ALPRLE+STVVDTKTGKGVKSDFRTSSGMFLSH EKNYPM+QAIEKRISVYSQIP+ENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ
Subjt:  EECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ

Query:  RVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVP
        R+ATMLMYLS+N+EGGETYFPKAGSGECSCGGKTVPGLSVKP KGDA+LFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVP
Subjt:  RVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVP

XP_022137963.1 prolyl 4-hydroxylase 1 [Momordica charantia]5.29e-212100Show/hide
Query:  MASAPMRIVFGLLTFVTLGMIIGALFQLAFIRRLEDSYGTEFLSAGRLHKTQYDGDRQLPRGFPNWINDREAEILRLGYVKPEVVSWSPRIIVLHNFLST
        MASAPMRIVFGLLTFVTLGMIIGALFQLAFIRRLEDSYGTEFLSAGRLHKTQYDGDRQLPRGFPNWINDREAEILRLGYVKPEVVSWSPRIIVLHNFLST
Subjt:  MASAPMRIVFGLLTFVTLGMIIGALFQLAFIRRLEDSYGTEFLSAGRLHKTQYDGDRQLPRGFPNWINDREAEILRLGYVKPEVVSWSPRIIVLHNFLST

Query:  EECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ
        EECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ
Subjt:  EECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ

Query:  RVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVP
        RVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVP
Subjt:  RVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVP

XP_022932580.1 prolyl 4-hydroxylase 1-like isoform X2 [Cucurbita moschata]1.51e-19692.39Show/hide
Query:  MASAPMRIVFGLLTFVTLGMIIGALFQLAFIRRLEDSYGTEFLSAGRLHKTQYDGDRQLPRGFPNWINDREAEILRLGYVKPEVVSWSPRIIVLHNFLST
        MAS  MRIVFGLLTFVT+GMIIGALFQLAFIRRLEDS GTEFLSAGRLHKTQYDG RQ  +G PNWIND+EAEILRLGYVKPEVVSWSPRIIVLHNFLS+
Subjt:  MASAPMRIVFGLLTFVTLGMIIGALFQLAFIRRLEDSYGTEFLSAGRLHKTQYDGDRQLPRGFPNWINDREAEILRLGYVKPEVVSWSPRIIVLHNFLST

Query:  EECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ
        EECDYL+A+ALPRLE+STVVDTKTGKG+KSDFRTSSGMFLSHQE+NYPM+QAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ
Subjt:  EECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ

Query:  RVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLV
        R+ATMLMYL+DNVEGGETYFPKAGSG CSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVL GEKWSATKWMRQKSTL+
Subjt:  RVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLV

XP_023539760.1 prolyl 4-hydroxylase 1-like isoform X2 [Cucurbita pepo subsp. pepo]1.29e-19592.04Show/hide
Query:  MASAPMRIVFGLLTFVTLGMIIGALFQLAFIRRLEDSYGTEFLSAGRLHKTQYDGDRQLPRGFPNWINDREAEILRLGYVKPEVVSWSPRIIVLHNFLST
        MAS  MRIVFGLLTFVT+GMIIGALFQLAFIRRLEDS GTEFLSAGRLHKTQYDG RQ  +G PNWIND+EAEILRLGYVKPEVVSWSPRIIVLHNFLS+
Subjt:  MASAPMRIVFGLLTFVTLGMIIGALFQLAFIRRLEDSYGTEFLSAGRLHKTQYDGDRQLPRGFPNWINDREAEILRLGYVKPEVVSWSPRIIVLHNFLST

Query:  EECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ
        EECDYL+A+ALPRLE+STVVDTKTGKG+KSDFRTSSGMFLSHQE+NYPM+QAIEKRISVYSQIP ENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ
Subjt:  EECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ

Query:  RVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLV
        R+ATMLMYL+DNVEGGETYFPKAGSG CSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVL GEKWSATKWMRQKSTL+
Subjt:  RVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLV

XP_038904320.1 prolyl 4-hydroxylase 1 isoform X1 [Benincasa hispida]2.13e-20294.14Show/hide
Query:  MASAPMRIVFGLLTFVTLGMIIGALFQLAFIRRLEDSYGTEFLSAGRLHKTQYDGDRQLPRGFPNWINDREAEILRLGYVKPEVVSWSPRIIVLHNFLST
        MASAPMRIVFGLLTFVT+GMIIGAL QLAFIRRLEDS GTEFLSAGRLHKTQYD  RQL RG PNWIND+EAEILRLGYVKPEVVSWSPRIIVLHNFLST
Subjt:  MASAPMRIVFGLLTFVTLGMIIGALFQLAFIRRLEDSYGTEFLSAGRLHKTQYDGDRQLPRGFPNWINDREAEILRLGYVKPEVVSWSPRIIVLHNFLST

Query:  EECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ
        EECDYL+A+ALPRLE+STVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPM+QAIEKRISVYSQIP+ENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ
Subjt:  EECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ

Query:  RVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVP
        R+ATMLMYLS+N+EGGETYFPKAGSGECSCGGKTVPGLSVKP KGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVP
Subjt:  RVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVP

TrEMBL top hitse value%identityAlignment
A0A0A0KU17 Fe2OG dioxygenase domain-containing protein4.32e-19590Show/hide
Query:  MASAPMRIVFGLLTFVTLGMIIGALFQLAFIRRLEDSYGTEFLSAGRLHKTQYDGDRQLPRGFPNWINDREAEILRLGYVKPEVVSWSPRIIVLHNFLST
        M S+ MRIVFGLLTFVT+GMIIGAL QLAF+RRLEDS GTEFL AGRLHK QYD   QLPRGFPNWIND+EAEILRLGYVKPEVVSWSPRIIVLHNFLST
Subjt:  MASAPMRIVFGLLTFVTLGMIIGALFQLAFIRRLEDSYGTEFLSAGRLHKTQYDGDRQLPRGFPNWINDREAEILRLGYVKPEVVSWSPRIIVLHNFLST

Query:  EECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ
        +ECDYL+ +AL RLE+STVVDTKTGKGVKSDFRTSSGMFLSH EKN+PM+QAIEKRISVYSQ+P+ENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ
Subjt:  EECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ

Query:  RVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVP
        R+ATMLMYLS+N+EGGETYFPKAGSGECSCGGKTVPGLSVKP KGDAVLFWSMGLDGQSDP SIHGGCEVLSGEKWSATKWMRQKSTLVP
Subjt:  RVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVP

A0A1S3BXE6 prolyl 4-hydroxylase 1 isoform X31.64e-19992.07Show/hide
Query:  MASAPMRIVFGLLTFVTLGMIIGALFQLAFIRRLEDSYGTEFLSAGRLHKTQYDGDRQLPRGFPNWINDREAEILRLGYVKPEVVSWSPRIIVLHNFLST
        M SA MRIVFGLLTFVT+GMIIGAL QLAF+RRLEDS GTEFL AGRLHKTQYD  RQLPRG PNWIND+EAEILRLGYVKPEVVSWSPRIIVLHNFLST
Subjt:  MASAPMRIVFGLLTFVTLGMIIGALFQLAFIRRLEDSYGTEFLSAGRLHKTQYDGDRQLPRGFPNWINDREAEILRLGYVKPEVVSWSPRIIVLHNFLST

Query:  EECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ
        EECDYL+ +ALPRLE+STVVDTKTGKGVKSDFRTSSGMFLSH EKNYPM+QAIEKRISVYSQIP+ENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ
Subjt:  EECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ

Query:  RVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVP
        R+ATMLMYLS+N+EGGETYFPKAGSGECSCGGKTVPGLSVKP KGDA+LFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVP
Subjt:  RVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVP

A0A1S4DZZ9 prolyl 4-hydroxylase 1 isoform X11.23e-19486.41Show/hide
Query:  MASAPMRIVFGLLTFVTLGMIIGALFQLAFIRRLEDSYGTEFLSAGRLHKTQYDGDRQLPR-------------------GFPNWINDREAEILRLGYVK
        M SA MRIVFGLLTFVT+GMIIGAL QLAF+RRLEDS GTEFL AGRLHKTQYD  RQLPR                   G PNWIND+EAEILRLGYVK
Subjt:  MASAPMRIVFGLLTFVTLGMIIGALFQLAFIRRLEDSYGTEFLSAGRLHKTQYDGDRQLPR-------------------GFPNWINDREAEILRLGYVK

Query:  PEVVSWSPRIIVLHNFLSTEECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQF
        PEVVSWSPRIIVLHNFLSTEECDYL+ +ALPRLE+STVVDTKTGKGVKSDFRTSSGMFLSH EKNYPM+QAIEKRISVYSQIP+ENGELIQVLRYEKNQF
Subjt:  PEVVSWSPRIIVLHNFLSTEECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQF

Query:  YKPHHDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKW
        YKPHHDYFSDTFNLKRGGQR+ATMLMYLS+N+EGGETYFPKAGSGECSCGGKTVPGLSVKP KGDA+LFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKW
Subjt:  YKPHHDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKW

Query:  MRQKSTLVP
        MRQKSTLVP
Subjt:  MRQKSTLVP

A0A6J1CBS4 prolyl 4-hydroxylase 12.56e-212100Show/hide
Query:  MASAPMRIVFGLLTFVTLGMIIGALFQLAFIRRLEDSYGTEFLSAGRLHKTQYDGDRQLPRGFPNWINDREAEILRLGYVKPEVVSWSPRIIVLHNFLST
        MASAPMRIVFGLLTFVTLGMIIGALFQLAFIRRLEDSYGTEFLSAGRLHKTQYDGDRQLPRGFPNWINDREAEILRLGYVKPEVVSWSPRIIVLHNFLST
Subjt:  MASAPMRIVFGLLTFVTLGMIIGALFQLAFIRRLEDSYGTEFLSAGRLHKTQYDGDRQLPRGFPNWINDREAEILRLGYVKPEVVSWSPRIIVLHNFLST

Query:  EECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ
        EECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ
Subjt:  EECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ

Query:  RVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVP
        RVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVP
Subjt:  RVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVP

A0A6J1F2K0 prolyl 4-hydroxylase 1-like isoform X27.33e-19792.39Show/hide
Query:  MASAPMRIVFGLLTFVTLGMIIGALFQLAFIRRLEDSYGTEFLSAGRLHKTQYDGDRQLPRGFPNWINDREAEILRLGYVKPEVVSWSPRIIVLHNFLST
        MAS  MRIVFGLLTFVT+GMIIGALFQLAFIRRLEDS GTEFLSAGRLHKTQYDG RQ  +G PNWIND+EAEILRLGYVKPEVVSWSPRIIVLHNFLS+
Subjt:  MASAPMRIVFGLLTFVTLGMIIGALFQLAFIRRLEDSYGTEFLSAGRLHKTQYDGDRQLPRGFPNWINDREAEILRLGYVKPEVVSWSPRIIVLHNFLST

Query:  EECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ
        EECDYL+A+ALPRLE+STVVDTKTGKG+KSDFRTSSGMFLSHQE+NYPM+QAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ
Subjt:  EECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ

Query:  RVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLV
        R+ATMLMYL+DNVEGGETYFPKAGSG CSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVL GEKWSATKWMRQKSTL+
Subjt:  RVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLV

SwissProt top hitse value%identityAlignment
F4JNU8 Probable prolyl 4-hydroxylase 88.2e-5655.02Show/hide
Query:  EVVSWSPRIIVLHNFLSTEECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLS--HQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQ
        EV+SW PR  V HNFL+ EEC++L ++A P +  S VVD KTGK + S  RTSSG FL+  H E    +++ IE RIS ++ IP ENGE +QVL YE  Q
Subjt:  EVVSWSPRIIVLHNFLSTEECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLS--HQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQ

Query:  FYKPHHDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETYFPKAGSG--------ECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLS
         Y+PHHDYF D FN+++GGQR+AT+LMYLSD  EGGET FP A           E S  GK   GLSV P K DA+LFWSM  D   DP+S+HGGC V+ 
Subjt:  FYKPHHDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETYFPKAGSG--------ECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLS

Query:  GEKWSATKW
        G KWS+TKW
Subjt:  GEKWSATKW

F4JZ24 Probable prolyl 4-hydroxylase 103.3e-5754.25Show/hide
Query:  EVVSWSPRIIVLHNFLSTEECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQFY
        E++SW PR  V HNFL+ EEC YL  +A P +E STVVD KTGK   S  RTSSG FL+        I+ IEKRIS ++ IP+E+GE +QVL YE  Q Y
Subjt:  EVVSWSPRIIVLHNFLSTEECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQFY

Query:  KPHHDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETYFPKAGS-----------GECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVL
        +PH+DYF D +N + GGQR+AT+LMYLSD  EGGET FP A              EC  G     GLSVKP  GDA+LFWSM  D   DP+S+HGGC V+
Subjt:  KPHHDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETYFPKAGS-----------GECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVL

Query:  SGEKWSATKWMR
         G KWS+TKW+R
Subjt:  SGEKWSATKWMR

Q24JN5 Prolyl 4-hydroxylase 51.2e-5454.07Show/hide
Query:  EVVSWSPRIIVLHNFLSTEECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFL--SHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQ
        EV+SW PR +V HNFL+ EEC++L ++A P +  STVVD KTG    S  RTSSG FL   H E    +++ IEKRIS ++ IP+ENGE +QVL Y+  Q
Subjt:  EVVSWSPRIIVLHNFLSTEECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFL--SHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQ

Query:  FYKPHHDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETYFPKAGS--------GECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLS
         Y+PH+DYF D FN K GGQR+AT+LMYLSD  +GGET FP A           E S  GK   GLSV P K DA+LFW+M  D   DP+S+HGGC V+ 
Subjt:  FYKPHHDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETYFPKAGS--------GECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLS

Query:  GEKWSATKW
        G KWS+TKW
Subjt:  GEKWSATKW

Q9LN20 Probable prolyl 4-hydroxylase 38.7e-5846.91Show/hide
Query:  TLGMIIGALFQLAFIRRLEDSYGTEFLSAGRLHKTQYDGDRQLPRGFPNWINDREAEILRLGYVKPEVVSWSPRIIVLHNFLSTEECDYLRAVALPRLEV
        TL +++  LF L  +  +  ++G   L       +  D        F     +R   + + G    EV+SW PR  V HNFLS EEC+YL ++A P +  
Subjt:  TLGMIIGALFQLAFIRRLEDSYGTEFLSAGRLHKTQYDGDRQLPRGFPNWINDREAEILRLGYVKPEVVSWSPRIIVLHNFLSTEECDYLRAVALPRLEV

Query:  STVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRVATMLMYLSDNVEGG
        STVVD++TGK   S  RTSSG FL        +I+ IEKRI+ Y+ IP ++GE +QVL YE  Q Y+PH+DYF D FN K GGQR+ATMLMYLSD  EGG
Subjt:  STVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRVATMLMYLSDNVEGG

Query:  ETYFPKAGSGECS---------CGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWM
        ET FP A     S         CG K   GLSVKP  GDA+LFWSM  D   DP S+HGGC V+ G KWS+TKWM
Subjt:  ETYFPKAGSGECS---------CGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWM

Q9ZW86 Prolyl 4-hydroxylase 11.8e-12476.95Show/hide
Query:  MRIVFGLLTFVTLGMIIGALFQLAFIRRLEDSYGTEFLSAGRLHKTQYDGDRQLPRGFPNWINDREAEILRLGYVKPEVVSWSPRIIVLHNFLSTEECDY
        M+IVFGLLTFVT+GM+IG+L QLAFI RLEDSYGT F S   L   +    R L R    W ND++AE+LR+G VKPEVVSWSPRIIVLH+FLS EEC+Y
Subjt:  MRIVFGLLTFVTLGMIIGALFQLAFIRRLEDSYGTEFLSAGRLHKTQYDGDRQLPRGFPNWINDREAEILRLGYVKPEVVSWSPRIIVLHNFLSTEECDY

Query:  LRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRVATM
        L+A+A PRL+VSTVVD KTGKGVKSD RTSSGMFL+H E++YP+IQAIEKRI+V+SQ+P ENGELIQVLRYE  QFYKPHHDYF+DTFNLKRGGQRVATM
Subjt:  LRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRVATM

Query:  LMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKST
        LMYL+D+VEGGETYFP AG G+C+CGGK + G+SVKP KGDAVLFWSMGLDGQSDP SIHGGCEVLSGEKWSATKWMRQK+T
Subjt:  LMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKST

Arabidopsis top hitse value%identityAlignment
AT1G20270.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein6.2e-5946.91Show/hide
Query:  TLGMIIGALFQLAFIRRLEDSYGTEFLSAGRLHKTQYDGDRQLPRGFPNWINDREAEILRLGYVKPEVVSWSPRIIVLHNFLSTEECDYLRAVALPRLEV
        TL +++  LF L  +  +  ++G   L       +  D        F     +R   + + G    EV+SW PR  V HNFLS EEC+YL ++A P +  
Subjt:  TLGMIIGALFQLAFIRRLEDSYGTEFLSAGRLHKTQYDGDRQLPRGFPNWINDREAEILRLGYVKPEVVSWSPRIIVLHNFLSTEECDYLRAVALPRLEV

Query:  STVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRVATMLMYLSDNVEGG
        STVVD++TGK   S  RTSSG FL        +I+ IEKRI+ Y+ IP ++GE +QVL YE  Q Y+PH+DYF D FN K GGQR+ATMLMYLSD  EGG
Subjt:  STVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRVATMLMYLSDNVEGG

Query:  ETYFPKAGSGECS---------CGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWM
        ET FP A     S         CG K   GLSVKP  GDA+LFWSM  D   DP S+HGGC V+ G KWS+TKWM
Subjt:  ETYFPKAGSGECS---------CGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWM

AT2G17720.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein8.4e-5654.07Show/hide
Query:  EVVSWSPRIIVLHNFLSTEECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFL--SHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQ
        EV+SW PR +V HNFL+ EEC++L ++A P +  STVVD KTG    S  RTSSG FL   H E    +++ IEKRIS ++ IP+ENGE +QVL Y+  Q
Subjt:  EVVSWSPRIIVLHNFLSTEECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFL--SHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQ

Query:  FYKPHHDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETYFPKAGS--------GECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLS
         Y+PH+DYF D FN K GGQR+AT+LMYLSD  +GGET FP A           E S  GK   GLSV P K DA+LFW+M  D   DP+S+HGGC V+ 
Subjt:  FYKPHHDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETYFPKAGS--------GECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLS

Query:  GEKWSATKW
        G KWS+TKW
Subjt:  GEKWSATKW

AT2G43080.1 P4H isoform 11.3e-12576.95Show/hide
Query:  MRIVFGLLTFVTLGMIIGALFQLAFIRRLEDSYGTEFLSAGRLHKTQYDGDRQLPRGFPNWINDREAEILRLGYVKPEVVSWSPRIIVLHNFLSTEECDY
        M+IVFGLLTFVT+GM+IG+L QLAFI RLEDSYGT F S   L   +    R L R    W ND++AE+LR+G VKPEVVSWSPRIIVLH+FLS EEC+Y
Subjt:  MRIVFGLLTFVTLGMIIGALFQLAFIRRLEDSYGTEFLSAGRLHKTQYDGDRQLPRGFPNWINDREAEILRLGYVKPEVVSWSPRIIVLHNFLSTEECDY

Query:  LRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRVATM
        L+A+A PRL+VSTVVD KTGKGVKSD RTSSGMFL+H E++YP+IQAIEKRI+V+SQ+P ENGELIQVLRYE  QFYKPHHDYF+DTFNLKRGGQRVATM
Subjt:  LRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRVATM

Query:  LMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKST
        LMYL+D+VEGGETYFP AG G+C+CGGK + G+SVKP KGDAVLFWSMGLDGQSDP SIHGGCEVLSGEKWSATKWMRQK+T
Subjt:  LMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKST

AT4G35810.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein5.8e-5755.02Show/hide
Query:  EVVSWSPRIIVLHNFLSTEECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLS--HQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQ
        EV+SW PR  V HNFL+ EEC++L ++A P +  S VVD KTGK + S  RTSSG FL+  H E    +++ IE RIS ++ IP ENGE +QVL YE  Q
Subjt:  EVVSWSPRIIVLHNFLSTEECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLS--HQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQ

Query:  FYKPHHDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETYFPKAGSG--------ECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLS
         Y+PHHDYF D FN+++GGQR+AT+LMYLSD  EGGET FP A           E S  GK   GLSV P K DA+LFWSM  D   DP+S+HGGC V+ 
Subjt:  FYKPHHDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETYFPKAGSG--------ECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLS

Query:  GEKWSATKW
        G KWS+TKW
Subjt:  GEKWSATKW

AT5G66060.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.4e-5854.25Show/hide
Query:  EVVSWSPRIIVLHNFLSTEECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQFY
        E++SW PR  V HNFL+ EEC YL  +A P +E STVVD KTGK   S  RTSSG FL+        I+ IEKRIS ++ IP+E+GE +QVL YE  Q Y
Subjt:  EVVSWSPRIIVLHNFLSTEECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQFY

Query:  KPHHDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETYFPKAGS-----------GECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVL
        +PH+DYF D +N + GGQR+AT+LMYLSD  EGGET FP A              EC  G     GLSVKP  GDA+LFWSM  D   DP+S+HGGC V+
Subjt:  KPHHDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETYFPKAGS-----------GECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVL

Query:  SGEKWSATKWMR
         G KWS+TKW+R
Subjt:  SGEKWSATKWMR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTCCGCTCCGATGAGGATTGTATTCGGTCTGCTCACCTTCGTCACCCTCGGCATGATCATTGGTGCTTTGTTCCAATTAGCATTTATAAGGAGGCTGGAGGACTC
TTATGGCACGGAGTTTCTATCTGCTGGAAGGTTACATAAAACTCAGTATGATGGCGATCGTCAATTACCCCGAGGCTTTCCTAATTGGATTAATGACAGAGAAGCAGAAA
TTCTTCGTCTTGGCTATGTTAAACCAGAAGTAGTAAGCTGGTCACCACGAATCATTGTATTGCATAATTTTTTGAGCACAGAGGAGTGCGACTATCTTAGGGCAGTAGCA
CTTCCTCGCCTTGAAGTTTCCACGGTTGTGGATACAAAAACAGGGAAGGGAGTTAAGAGTGATTTCAGAACGAGCTCTGGAATGTTTTTAAGTCATCAAGAGAAAAATTA
TCCAATGATCCAGGCAATTGAAAAAAGAATTTCTGTCTATTCTCAAATACCAATAGAAAATGGAGAGCTCATTCAAGTTTTAAGGTACGAGAAGAATCAATTTTACAAGC
CTCATCATGACTACTTTTCTGATACTTTTAACTTGAAGCGTGGTGGTCAGCGAGTAGCAACCATGCTTATGTATCTAAGTGACAACGTTGAAGGTGGAGAAACCTACTTT
CCGAAGGCTGGTTCTGGGGAGTGTAGTTGTGGTGGGAAAACTGTCCCAGGGTTGTCCGTTAAACCAGTCAAAGGAGATGCAGTGCTTTTCTGGAGCATGGGATTAGATGG
ACAGTCGGATCCTAATAGCATTCATGGAGGGTGTGAAGTACTGTCTGGCGAAAAATGGTCTGCCACAAAATGGATGAGGCAAAAGAGTACTCTGGTACCATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAAACTAACCGCAAAACAAAATCAAAGGGATCGCAAAATTCTTTTTTTCTTCACTGTTACGAAAGAAATCAAAGTAAACGCGAAGCAAAAAAGCGGAATCGAATTG
ATCTTCTTCTGCAGTGGGTCATCTCTGAGCGCGAGCGGTTTCTCGTTGGGTCCCTGAGCTGATTTCTGTTCCTTCTTGGCAGTTTTCGCTCCAAGGAGTCGTTTCAGGCA
CCCATGGCCTCCGCTCCGATGAGGATTGTATTCGGTCTGCTCACCTTCGTCACCCTCGGCATGATCATTGGTGCTTTGTTCCAATTAGCATTTATAAGGAGGCTGGAGGA
CTCTTATGGCACGGAGTTTCTATCTGCTGGAAGGTTACATAAAACTCAGTATGATGGCGATCGTCAATTACCCCGAGGCTTTCCTAATTGGATTAATGACAGAGAAGCAG
AAATTCTTCGTCTTGGCTATGTTAAACCAGAAGTAGTAAGCTGGTCACCACGAATCATTGTATTGCATAATTTTTTGAGCACAGAGGAGTGCGACTATCTTAGGGCAGTA
GCACTTCCTCGCCTTGAAGTTTCCACGGTTGTGGATACAAAAACAGGGAAGGGAGTTAAGAGTGATTTCAGAACGAGCTCTGGAATGTTTTTAAGTCATCAAGAGAAAAA
TTATCCAATGATCCAGGCAATTGAAAAAAGAATTTCTGTCTATTCTCAAATACCAATAGAAAATGGAGAGCTCATTCAAGTTTTAAGGTACGAGAAGAATCAATTTTACA
AGCCTCATCATGACTACTTTTCTGATACTTTTAACTTGAAGCGTGGTGGTCAGCGAGTAGCAACCATGCTTATGTATCTAAGTGACAACGTTGAAGGTGGAGAAACCTAC
TTTCCGAAGGCTGGTTCTGGGGAGTGTAGTTGTGGTGGGAAAACTGTCCCAGGGTTGTCCGTTAAACCAGTCAAAGGAGATGCAGTGCTTTTCTGGAGCATGGGATTAGA
TGGACAGTCGGATCCTAATAGCATTCATGGAGGGTGTGAAGTACTGTCTGGCGAAAAATGGTCTGCCACAAAATGGATGAGGCAAAAGAGTACTCTGGTACCATAGTTCA
TACTTATTATAGAGTTCCACTGCATACGGCATTTAATATTTTGTTACATAACAAAGTAATAAATTTAGAGAGAGAAAGAGAAAGAGAGAAGCTAGTTTAGATAGTGTTTA
ACATATTAACAATACTCACTGTGAATCAATTTATTTAATGAATAATACAGCATCTGGTGTTATCCTCCCCCAGTGTGTGTATCTGCTGAAGTTATAGATTTTTTCTTTTC
TAAGAACCAAGATCAGACTATTTAAACGATGTTATTGTGTACGTTGAGAGTATTGATTACTAGAGTTCAACAAATATAGAGGTGAAGATTGAACCTTTGACTTCAAGGAT
GGTAGTAGATGTCTTATCCACTGAACTATGTTTGAATTGACGATTAGTACTTACATGTATTCTAACTGAAAATGAG
Protein sequenceShow/hide protein sequence
MASAPMRIVFGLLTFVTLGMIIGALFQLAFIRRLEDSYGTEFLSAGRLHKTQYDGDRQLPRGFPNWINDREAEILRLGYVKPEVVSWSPRIIVLHNFLSTEECDYLRAVA
LPRLEVSTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETYF
PKAGSGECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVP