; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh14G015400 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh14G015400
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionprolyl 4-hydroxylase 1
Genome locationCmo_Chr14:12321100..12324628
RNA-Seq ExpressionCmoCh14G015400
SyntenyCmoCh14G015400
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0000137 - Golgi cis cisterna (cellular component)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582084.1 Prolyl 4-hydroxylase 1, partial [Cucurbita argyrosperma subsp. sororia]1.4e-5499.05Show/hide
Query:  YNLMHGGQRIATILMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVRRY
        YNLMHGGQRIAT+LMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVRRY
Subjt:  YNLMHGGQRIATILMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVRRY

Query:  QLKTF
        QLKTF
Subjt:  QLKTF

KAG7018500.1 Prolyl 4-hydroxylase 1 [Cucurbita argyrosperma subsp. argyrosperma]1.4e-5499.05Show/hide
Query:  YNLMHGGQRIATILMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVRRY
        YNLMHGGQRIAT+LMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVRRY
Subjt:  YNLMHGGQRIATILMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVRRY

Query:  QLKTF
        QLKTF
Subjt:  QLKTF

XP_022956089.1 prolyl 4-hydroxylase 1 isoform X1 [Cucurbita moschata]6.1e-55100Show/hide
Query:  YNLMHGGQRIATILMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVRRY
        YNLMHGGQRIATILMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVRRY
Subjt:  YNLMHGGQRIATILMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVRRY

Query:  QLKTF
        QLKTF
Subjt:  QLKTF

XP_022956091.1 prolyl 4-hydroxylase 1 isoform X2 [Cucurbita moschata]6.1e-55100Show/hide
Query:  YNLMHGGQRIATILMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVRRY
        YNLMHGGQRIATILMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVRRY
Subjt:  YNLMHGGQRIATILMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVRRY

Query:  QLKTF
        QLKTF
Subjt:  QLKTF

XP_023527986.1 prolyl 4-hydroxylase 1 [Cucurbita pepo subsp. pepo]1.4e-5499.05Show/hide
Query:  YNLMHGGQRIATILMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVRRY
        YNLMHGGQRIAT+LMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVRRY
Subjt:  YNLMHGGQRIATILMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVRRY

Query:  QLKTF
        QLKTF
Subjt:  QLKTF

TrEMBL top hitse value%identityAlignment
A0A5A7TMC8 Prolyl 4-hydroxylase 1 isoform X77.8e-4890.91Show/hide
Query:  VQYNLMHGGQRIATILMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLV
        +Q+NL  GGQRIAT+LMYLSEN+EGGETYFPKAGSGECSCGGKTVPGLSVKP +GDA+LFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLV
Subjt:  VQYNLMHGGQRIATILMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLV

A0A6J1GVM2 prolyl 4-hydroxylase 1 isoform X23.0e-55100Show/hide
Query:  YNLMHGGQRIATILMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVRRY
        YNLMHGGQRIATILMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVRRY
Subjt:  YNLMHGGQRIATILMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVRRY

Query:  QLKTF
        QLKTF
Subjt:  QLKTF

A0A6J1GWV8 prolyl 4-hydroxylase 1 isoform X13.0e-55100Show/hide
Query:  YNLMHGGQRIATILMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVRRY
        YNLMHGGQRIATILMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVRRY
Subjt:  YNLMHGGQRIATILMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVRRY

Query:  QLKTF
        QLKTF
Subjt:  QLKTF

A0A6J1IS22 prolyl 4-hydroxylase 1 isoform X22.2e-5097.98Show/hide
Query:  YNLMHGGQRIATILMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVRR
        YNLM GGQRIAT+LMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVRR
Subjt:  YNLMHGGQRIATILMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVRR

A0A6J1IUM8 prolyl 4-hydroxylase 1 isoform X12.2e-5097.98Show/hide
Query:  YNLMHGGQRIATILMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVRR
        YNLM GGQRIAT+LMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVRR
Subjt:  YNLMHGGQRIATILMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVRR

SwissProt top hitse value%identityAlignment
F4JNU8 Probable prolyl 4-hydroxylase 82.7e-2154.08Show/hide
Query:  QYNLMHGGQRIATILMYLSENVEGGETYFPKAGSG--------ECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKW
        ++N+  GGQRIAT+LMYLS+  EGGET FP A           E S  GK   GLSV P + DA+LFWSM  D   DP+S+HGGC V+ G KWS+TKW
Subjt:  QYNLMHGGQRIATILMYLSENVEGGETYFPKAGSG--------ECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKW

F4JZ24 Probable prolyl 4-hydroxylase 107.6e-2454.37Show/hide
Query:  QYNLMHGGQRIATILMYLSENVEGGETYFPKAGS-----------GECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATK
        +YN  +GGQRIAT+LMYLS+  EGGET FP A              EC  G     GLSVKP  GDA+LFWSM  D   DP+S+HGGC V+ G KWS+TK
Subjt:  QYNLMHGGQRIATILMYLSENVEGGETYFPKAGS-----------GECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATK

Query:  WMR
        W+R
Subjt:  WMR

Q24JN5 Prolyl 4-hydroxylase 51.8e-2052.04Show/hide
Query:  QYNLMHGGQRIATILMYLSENVEGGETYFPKAGS--------GECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKW
        ++N  +GGQRIAT+LMYLS+  +GGET FP A           E S  GK   GLSV P + DA+LFW+M  D   DP+S+HGGC V+ G KWS+TKW
Subjt:  QYNLMHGGQRIATILMYLSENVEGGETYFPKAGS--------GECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKW

Q9LN20 Probable prolyl 4-hydroxylase 36.4e-2355Show/hide
Query:  QYNLMHGGQRIATILMYLSENVEGGETYFPKAGSGECS---------CGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWM
        ++N  +GGQR+AT+LMYLS+  EGGET FP A     S         CG K   GLSVKP  GDA+LFWSM  D   DP S+HGGC V+ G KWS+TKWM
Subjt:  QYNLMHGGQRIATILMYLSENVEGGETYFPKAGSGECS---------CGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWM

Q9ZW86 Prolyl 4-hydroxylase 12.1e-4278.95Show/hide
Query:  YNLMHGGQRIATILMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKST
        +NL  GGQR+AT+LMYL+++VEGGETYFP AG G+C+CGGK + G+SVKP +GDAVLFWSMGLDGQSDP SIHGGCEVLSGEKWSATKWMRQK+T
Subjt:  YNLMHGGQRIATILMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKST

Arabidopsis top hitse value%identityAlignment
AT1G20270.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein4.6e-2455Show/hide
Query:  QYNLMHGGQRIATILMYLSENVEGGETYFPKAGSGECS---------CGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWM
        ++N  +GGQR+AT+LMYLS+  EGGET FP A     S         CG K   GLSVKP  GDA+LFWSM  D   DP S+HGGC V+ G KWS+TKWM
Subjt:  QYNLMHGGQRIATILMYLSENVEGGETYFPKAGSGECS---------CGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWM

AT2G17720.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.2e-2152.04Show/hide
Query:  QYNLMHGGQRIATILMYLSENVEGGETYFPKAGS--------GECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKW
        ++N  +GGQRIAT+LMYLS+  +GGET FP A           E S  GK   GLSV P + DA+LFW+M  D   DP+S+HGGC V+ G KWS+TKW
Subjt:  QYNLMHGGQRIATILMYLSENVEGGETYFPKAGS--------GECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKW

AT2G43080.1 P4H isoform 11.5e-4378.95Show/hide
Query:  YNLMHGGQRIATILMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKST
        +NL  GGQR+AT+LMYL+++VEGGETYFP AG G+C+CGGK + G+SVKP +GDAVLFWSMGLDGQSDP SIHGGCEVLSGEKWSATKWMRQK+T
Subjt:  YNLMHGGQRIATILMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKST

AT4G35810.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.9e-2254.08Show/hide
Query:  QYNLMHGGQRIATILMYLSENVEGGETYFPKAGSG--------ECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKW
        ++N+  GGQRIAT+LMYLS+  EGGET FP A           E S  GK   GLSV P + DA+LFWSM  D   DP+S+HGGC V+ G KWS+TKW
Subjt:  QYNLMHGGQRIATILMYLSENVEGGETYFPKAGSG--------ECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKW

AT5G66060.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein5.4e-2554.37Show/hide
Query:  QYNLMHGGQRIATILMYLSENVEGGETYFPKAGS-----------GECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATK
        +YN  +GGQRIAT+LMYLS+  EGGET FP A              EC  G     GLSVKP  GDA+LFWSM  D   DP+S+HGGC V+ G KWS+TK
Subjt:  QYNLMHGGQRIATILMYLSENVEGGETYFPKAGS-----------GECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATK

Query:  WMR
        W+R
Subjt:  WMR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTGTGCTTCAGTTTTAAGGCTAACATTTCAAGCAGTACAGTATAACTTGATGCATGGTGGTCAGCGAATTGCAACCATTCTTATGTATCTAAGTGAAAACGTTGA
AGGTGGAGAAACCTACTTTCCGAAGGCTGGTTCTGGTGAGTGTAGTTGTGGCGGGAAGACCGTTCCAGGACTGTCAGTTAAACCAGTCAGAGGAGATGCAGTGCTTTTCT
GGAGTATGGGGTTGGATGGACAGTCGGATCCAAATAGCATTCATGGAGGTTGTGAAGTGCTGTCAGGCGAAAAATGGTCTGCAACAAAATGGATGAGGCAAAAGAGTACT
CTGGTACGAAGGTACCAACTCAAAACCTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTTGTGCTTCAGTTTTAAGGCTAACATTTCAAGCAGTACAGTATAACTTGATGCATGGTGGTCAGCGAATTGCAACCATTCTTATGTATCTAAGTGAAAACGTTGA
AGGTGGAGAAACCTACTTTCCGAAGGCTGGTTCTGGTGAGTGTAGTTGTGGCGGGAAGACCGTTCCAGGACTGTCAGTTAAACCAGTCAGAGGAGATGCAGTGCTTTTCT
GGAGTATGGGGTTGGATGGACAGTCGGATCCAAATAGCATTCATGGAGGTTGTGAAGTGCTGTCAGGCGAAAAATGGTCTGCAACAAAATGGATGAGGCAAAAGAGTACT
CTGGTACGAAGGTACCAACTCAAAACCTTTTAGTTCCATTGTGTTGTTTATCGGCATTGAATATTATGTTACATATCAAGTAATAAATCTATAGAAAGAGAAAGAAAGAA
AGAAAAGAAAGGGAGAGAGCCTAATTTAGATAGGGTCATAACATCATGAATAACACACAGTGAATCAATTTATTTAATGAATAATACAACAGCATCTAATGTTACCGAAG
CTACTGTTTCTTTTTTGTGACAAATGTGGGAATTCTCTAAATACCCTTCTAATTTGTATTATATCTATTTTTGTGCATGAATATGACATTTCTTGAACAACTATTAAGGC
ATGATTTAGAAGTTTTTATTGTGTTCATTGAAATTGTATATACTATGGTCATATT
Protein sequenceShow/hide protein sequence
MVCASVLRLTFQAVQYNLMHGGQRIATILMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPVRGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKST
LVRRYQLKTF