; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001955 (gene) of Snake gourd v1 genome

Gene IDTan0001955
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprolyl 4-hydroxylase 1
Genome locationLG10:13097423..13129305
RNA-Seq ExpressionTan0001955
SyntenyTan0001955
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0000137 - Golgi cis cisterna (cellular component)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008453926.1 PREDICTED: prolyl 4-hydroxylase 1 isoform X6 [Cucumis melo]6.8e-13990.49Show/hide
Query:  MIDFCMNHVFAGTEFLSAGRLHQTQYDGQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKG
        ++ F    +  GTEFL AGRLH+TQYD QRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLS+EECDYLK IALPRLEISTVVDTKTGKG
Subjt:  MIDFCMNHVFAGTEFLSAGRLHQTQYDGQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKG

Query:  VKSDFRTSSGMFLSTQEKNYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGSGE
        VKSDFRTSSGMFLS  EKNYPMVQAIEKRISVYSQIP+ENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYL++N+EGGETYFPKAGSGE
Subjt:  VKSDFRTSSGMFLSTQEKNYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGSGE

Query:  CSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLGGEKWSATKWLRQKSTLVP
        CSCGGKTVPGLSVKP KGDA+LFWSMGLDGQSDPNSIHGGCEVL GEKWSATKW+RQKSTLVP
Subjt:  CSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLGGEKWSATKWLRQKSTLVP

XP_022137963.1 prolyl 4-hydroxylase 1 [Momordica charantia]6.1e-14094.44Show/hide
Query:  GTEFLSAGRLHQTQYDGQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGM
        GTEFLSAGRLH+TQYDG RQLPRG PNWIND+EAEILRLGYVKPEVVSWSPRIIVLHNFLS+EECDYL+A+ALPRLE+STVVDTKTGKGVKSDFRTSSGM
Subjt:  GTEFLSAGRLHQTQYDGQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGM

Query:  FLSTQEKNYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGSGECSCGGKTVPGL
        FLS QEKNYPM+QAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQR+ATMLMYL+DNVEGGETYFPKAGSGECSCGGKTVPGL
Subjt:  FLSTQEKNYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGSGECSCGGKTVPGL

Query:  SVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLGGEKWSATKWLRQKSTLVP
        SVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVL GEKWSATKW+RQKSTLVP
Subjt:  SVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLGGEKWSATKWLRQKSTLVP

XP_022932580.1 prolyl 4-hydroxylase 1-like isoform X2 [Cucurbita moschata]2.3e-13996.02Show/hide
Query:  GTEFLSAGRLHQTQYDGQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGM
        GTEFLSAGRLH+TQYDGQRQ  +GLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKG+KSDFRTSSGM
Subjt:  GTEFLSAGRLHQTQYDGQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGM

Query:  FLSTQEKNYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGSGECSCGGKTVPGL
        FLS QE+NYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGSG CSCGGKTVPGL
Subjt:  FLSTQEKNYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGSGECSCGGKTVPGL

Query:  SVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLGGEKWSATKWLRQKSTLV
        SVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLGGEKWSATKW+RQKSTL+
Subjt:  SVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLGGEKWSATKWLRQKSTLV

XP_023539760.1 prolyl 4-hydroxylase 1-like isoform X2 [Cucurbita pepo subsp. pepo]1.2e-13895.62Show/hide
Query:  GTEFLSAGRLHQTQYDGQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGM
        GTEFLSAGRLH+TQYDGQRQ  +GLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKG+KSDFRTSSGM
Subjt:  GTEFLSAGRLHQTQYDGQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGM

Query:  FLSTQEKNYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGSGECSCGGKTVPGL
        FLS QE+NYPMVQAIEKRISVYSQIP ENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGSG CSCGGKTVPGL
Subjt:  FLSTQEKNYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGSGECSCGGKTVPGL

Query:  SVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLGGEKWSATKWLRQKSTLV
        SVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLGGEKWSATKW+RQKSTL+
Subjt:  SVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLGGEKWSATKWLRQKSTLV

XP_038904320.1 prolyl 4-hydroxylase 1 isoform X1 [Benincasa hispida]2.3e-13995.24Show/hide
Query:  GTEFLSAGRLHQTQYDGQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGM
        GTEFLSAGRLH+TQYD QRQL RGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLS+EECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGM
Subjt:  GTEFLSAGRLHQTQYDGQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGM

Query:  FLSTQEKNYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGSGECSCGGKTVPGL
        FLS QEKNYPMVQAIEKRISVYSQIP+ENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYL++N+EGGETYFPKAGSGECSCGGKTVPGL
Subjt:  FLSTQEKNYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGSGECSCGGKTVPGL

Query:  SVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLGGEKWSATKWLRQKSTLVP
        SVKP KGDAVLFWSMGLDGQSDPNSIHGGCEVL GEKWSATKW+RQKSTLVP
Subjt:  SVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLGGEKWSATKWLRQKSTLVP

TrEMBL top hitse value%identityAlignment
A0A1S3BXE6 prolyl 4-hydroxylase 1 isoform X37.3e-13994.05Show/hide
Query:  GTEFLSAGRLHQTQYDGQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGM
        GTEFL AGRLH+TQYD QRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLS+EECDYLK IALPRLEISTVVDTKTGKGVKSDFRTSSGM
Subjt:  GTEFLSAGRLHQTQYDGQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGM

Query:  FLSTQEKNYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGSGECSCGGKTVPGL
        FLS  EKNYPMVQAIEKRISVYSQIP+ENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYL++N+EGGETYFPKAGSGECSCGGKTVPGL
Subjt:  FLSTQEKNYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGSGECSCGGKTVPGL

Query:  SVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLGGEKWSATKWLRQKSTLVP
        SVKP KGDA+LFWSMGLDGQSDPNSIHGGCEVL GEKWSATKW+RQKSTLVP
Subjt:  SVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLGGEKWSATKWLRQKSTLVP

A0A1S3BY76 prolyl 4-hydroxylase 1 isoform X63.3e-13990.49Show/hide
Query:  MIDFCMNHVFAGTEFLSAGRLHQTQYDGQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKG
        ++ F    +  GTEFL AGRLH+TQYD QRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLS+EECDYLK IALPRLEISTVVDTKTGKG
Subjt:  MIDFCMNHVFAGTEFLSAGRLHQTQYDGQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKG

Query:  VKSDFRTSSGMFLSTQEKNYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGSGE
        VKSDFRTSSGMFLS  EKNYPMVQAIEKRISVYSQIP+ENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYL++N+EGGETYFPKAGSGE
Subjt:  VKSDFRTSSGMFLSTQEKNYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGSGE

Query:  CSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLGGEKWSATKWLRQKSTLVP
        CSCGGKTVPGLSVKP KGDA+LFWSMGLDGQSDPNSIHGGCEVL GEKWSATKW+RQKSTLVP
Subjt:  CSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLGGEKWSATKWLRQKSTLVP

A0A6J1CBS4 prolyl 4-hydroxylase 13.0e-14094.44Show/hide
Query:  GTEFLSAGRLHQTQYDGQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGM
        GTEFLSAGRLH+TQYDG RQLPRG PNWIND+EAEILRLGYVKPEVVSWSPRIIVLHNFLS+EECDYL+A+ALPRLE+STVVDTKTGKGVKSDFRTSSGM
Subjt:  GTEFLSAGRLHQTQYDGQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGM

Query:  FLSTQEKNYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGSGECSCGGKTVPGL
        FLS QEKNYPM+QAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQR+ATMLMYL+DNVEGGETYFPKAGSGECSCGGKTVPGL
Subjt:  FLSTQEKNYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGSGECSCGGKTVPGL

Query:  SVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLGGEKWSATKWLRQKSTLVP
        SVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVL GEKWSATKW+RQKSTLVP
Subjt:  SVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLGGEKWSATKWLRQKSTLVP

A0A6J1F2K0 prolyl 4-hydroxylase 1-like isoform X21.1e-13996.02Show/hide
Query:  GTEFLSAGRLHQTQYDGQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGM
        GTEFLSAGRLH+TQYDGQRQ  +GLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKG+KSDFRTSSGM
Subjt:  GTEFLSAGRLHQTQYDGQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGM

Query:  FLSTQEKNYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGSGECSCGGKTVPGL
        FLS QE+NYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGSG CSCGGKTVPGL
Subjt:  FLSTQEKNYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGSGECSCGGKTVPGL

Query:  SVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLGGEKWSATKWLRQKSTLV
        SVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLGGEKWSATKW+RQKSTL+
Subjt:  SVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLGGEKWSATKWLRQKSTLV

A0A6J1I482 prolyl 4-hydroxylase 1-like isoform X24.0e-13794.42Show/hide
Query:  GTEFLSAGRLHQTQYDGQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGM
        GTEFLSAGRL +TQYDGQRQ  +GLPNWINDKE+EILRLGYVKPEVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKG+KSDFRTSSGM
Subjt:  GTEFLSAGRLHQTQYDGQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGM

Query:  FLSTQEKNYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGSGECSCGGKTVPGL
        FLS QE++YPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQ YKPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGSG CSCGGKTVPGL
Subjt:  FLSTQEKNYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGSGECSCGGKTVPGL

Query:  SVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLGGEKWSATKWLRQKSTLV
        SVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLGGEKWSATKW+RQKSTL+
Subjt:  SVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLGGEKWSATKWLRQKSTLV

SwissProt top hitse value%identityAlignment
F4JNU8 Probable prolyl 4-hydroxylase 82.2e-5555.07Show/hide
Query:  EVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSTQEKNYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFY
        EV+SW PR  V HNFL++EEC++L ++A P +  S VVD KTGK + S  RTSSG FL+       +V+ IE RIS ++ IP ENGE +QVL YE  Q Y
Subjt:  EVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSTQEKNYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFY

Query:  KPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGSG--------ECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLGGE
        +PHHDYF D FN+++GGQRIAT+LMYL+D  EGGET FP A           E S  GK   GLSV P K DA+LFWSM  D   DP+S+HGGC V+ G 
Subjt:  KPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGSG--------ECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLGGE

Query:  KWSATKW
        KWS+TKW
Subjt:  KWSATKW

F4JZ24 Probable prolyl 4-hydroxylase 105.2e-5754.25Show/hide
Query:  EVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSTQEKNYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFY
        E++SW PR  V HNFL+ EEC YL  +A P +E STVVD KTGK   S  RTSSG FL+        ++ IEKRIS ++ IP+E+GE +QVL YE  Q Y
Subjt:  EVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSTQEKNYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFY

Query:  KPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGS-----------GECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVL
        +PH+DYF D +N + GGQRIAT+LMYL+D  EGGET FP A              EC  G     GLSVKP  GDA+LFWSM  D   DP+S+HGGC V+
Subjt:  KPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGS-----------GECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVL

Query:  GGEKWSATKWLR
         G KWS+TKWLR
Subjt:  GGEKWSATKWLR

Q24JN5 Prolyl 4-hydroxylase 52.4e-5454.11Show/hide
Query:  EVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSTQEKNYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFY
        EV+SW PR +V HNFL++EEC++L ++A P +  STVVD KTG    S  RTSSG FL  +  +  +V+ IEKRIS ++ IP+ENGE +QVL Y+  Q Y
Subjt:  EVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSTQEKNYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFY

Query:  KPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGS--------GECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLGGE
        +PH+DYF D FN K GGQRIAT+LMYL+D  +GGET FP A           E S  GK   GLSV P K DA+LFW+M  D   DP+S+HGGC V+ G 
Subjt:  KPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGS--------GECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLGGE

Query:  KWSATKW
        KWS+TKW
Subjt:  KWSATKW

Q9LN20 Probable prolyl 4-hydroxylase 31.2e-5655.02Show/hide
Query:  EVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSTQEKNYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFY
        EV+SW PR  V HNFLS EEC+YL ++A P +  STVVD++TGK   S  RTSSG FL  +     +++ IEKRI+ Y+ IP ++GE +QVL YE  Q Y
Subjt:  EVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSTQEKNYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFY

Query:  KPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGSGECS---------CGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLGG
        +PH+DYF D FN K GGQR+ATMLMYL+D  EGGET FP A     S         CG K   GLSVKP  GDA+LFWSM  D   DP S+HGGC V+ G
Subjt:  KPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGSGECS---------CGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLGG

Query:  EKWSATKWL
         KWS+TKW+
Subjt:  EKWSATKWL

Q9ZW86 Prolyl 4-hydroxylase 15.8e-10979.74Show/hide
Query:  RGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSTQEKNYPMVQAIEKRISVY
        R +  W NDK+AE+LR+G VKPEVVSWSPRIIVLH+FLS EEC+YLKAIA PRL++STVVD KTGKGVKSD RTSSGMFL+  E++YP++QAIEKRI+V+
Subjt:  RGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSTQEKNYPMVQAIEKRISVY

Query:  SQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSD
        SQ+P ENGELIQVLRYE  QFYKPHHDYF+DTFNLKRGGQR+ATMLMYLTD+VEGGETYFP AG G+C+CGGK + G+SVKP KGDAVLFWSMGLDGQSD
Subjt:  SQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSD

Query:  PNSIHGGCEVLGGEKWSATKWLRQKST
        P SIHGGCEVL GEKWSATKW+RQK+T
Subjt:  PNSIHGGCEVLGGEKWSATKWLRQKST

Arabidopsis top hitse value%identityAlignment
AT1G20270.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein8.2e-5855.02Show/hide
Query:  EVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSTQEKNYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFY
        EV+SW PR  V HNFLS EEC+YL ++A P +  STVVD++TGK   S  RTSSG FL  +     +++ IEKRI+ Y+ IP ++GE +QVL YE  Q Y
Subjt:  EVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSTQEKNYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFY

Query:  KPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGSGECS---------CGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLGG
        +PH+DYF D FN K GGQR+ATMLMYL+D  EGGET FP A     S         CG K   GLSVKP  GDA+LFWSM  D   DP S+HGGC V+ G
Subjt:  KPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGSGECS---------CGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLGG

Query:  EKWSATKWL
         KWS+TKW+
Subjt:  EKWSATKWL

AT2G17720.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.7e-5554.11Show/hide
Query:  EVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSTQEKNYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFY
        EV+SW PR +V HNFL++EEC++L ++A P +  STVVD KTG    S  RTSSG FL  +  +  +V+ IEKRIS ++ IP+ENGE +QVL Y+  Q Y
Subjt:  EVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSTQEKNYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFY

Query:  KPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGS--------GECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLGGE
        +PH+DYF D FN K GGQRIAT+LMYL+D  +GGET FP A           E S  GK   GLSV P K DA+LFW+M  D   DP+S+HGGC V+ G 
Subjt:  KPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGS--------GECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLGGE

Query:  KWSATKW
        KWS+TKW
Subjt:  KWSATKW

AT2G43080.1 P4H isoform 14.1e-11079.74Show/hide
Query:  RGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSTQEKNYPMVQAIEKRISVY
        R +  W NDK+AE+LR+G VKPEVVSWSPRIIVLH+FLS EEC+YLKAIA PRL++STVVD KTGKGVKSD RTSSGMFL+  E++YP++QAIEKRI+V+
Subjt:  RGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSTQEKNYPMVQAIEKRISVY

Query:  SQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSD
        SQ+P ENGELIQVLRYE  QFYKPHHDYF+DTFNLKRGGQR+ATMLMYLTD+VEGGETYFP AG G+C+CGGK + G+SVKP KGDAVLFWSMGLDGQSD
Subjt:  SQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSD

Query:  PNSIHGGCEVLGGEKWSATKWLRQKST
        P SIHGGCEVL GEKWSATKW+RQK+T
Subjt:  PNSIHGGCEVLGGEKWSATKWLRQKST

AT4G35810.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.5e-5655.07Show/hide
Query:  EVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSTQEKNYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFY
        EV+SW PR  V HNFL++EEC++L ++A P +  S VVD KTGK + S  RTSSG FL+       +V+ IE RIS ++ IP ENGE +QVL YE  Q Y
Subjt:  EVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSTQEKNYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFY

Query:  KPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGSG--------ECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLGGE
        +PHHDYF D FN+++GGQRIAT+LMYL+D  EGGET FP A           E S  GK   GLSV P K DA+LFWSM  D   DP+S+HGGC V+ G 
Subjt:  KPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGSG--------ECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLGGE

Query:  KWSATKW
        KWS+TKW
Subjt:  KWSATKW

AT5G66060.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein3.7e-5854.25Show/hide
Query:  EVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSTQEKNYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFY
        E++SW PR  V HNFL+ EEC YL  +A P +E STVVD KTGK   S  RTSSG FL+        ++ IEKRIS ++ IP+E+GE +QVL YE  Q Y
Subjt:  EVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSTQEKNYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFY

Query:  KPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGS-----------GECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVL
        +PH+DYF D +N + GGQRIAT+LMYL+D  EGGET FP A              EC  G     GLSVKP  GDA+LFWSM  D   DP+S+HGGC V+
Subjt:  KPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGS-----------GECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVL

Query:  GGEKWSATKWLR
         G KWS+TKWLR
Subjt:  GGEKWSATKWLR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGACTTTTGTATGAACCATGTCTTTGCAGGCACGGAGTTTCTATCAGCTGGAAGGTTACATCAAACTCAGTATGATGGCCAACGTCAATTACCCCGAGGCCTTCC
TAATTGGATTAACGACAAAGAAGCAGAAATTCTTCGTCTTGGCTACGTTAAACCAGAAGTAGTAAGCTGGTCACCACGAATCATTGTATTACATAATTTTTTGAGCTCAG
AGGAGTGCGACTACCTGAAGGCAATTGCACTTCCTCGCCTTGAAATTTCCACGGTTGTGGATACAAAAACAGGGAAGGGAGTTAAGAGCGATTTCAGAACGAGCTCTGGA
ATGTTTTTAAGTACTCAAGAGAAAAATTATCCAATGGTCCAGGCAATCGAAAAAAGAATTTCTGTCTATTCTCAAATACCGATAGAAAATGGAGAGCTTATTCAAGTATT
AAGGTACGAGAAGAATCAATTTTACAAGCCTCATCATGACTACTTTTCTGATACTTTTAACTTGAAGCGTGGTGGTCAGCGAATAGCAACCATGCTTATGTACCTAACTG
ACAACGTTGAAGGTGGAGAAACCTACTTTCCGAAGGCTGGTTCTGGCGAGTGTAGTTGTGGTGGGAAAACTGTCCCAGGGTTGTCGGTTAAACCAGTCAAAGGAGATGCA
GTGCTTTTCTGGAGCATGGGATTGGATGGACAGTCGGATCCTAATAGCATTCATGGAGGCTGTGAAGTACTGGGAGGCGAAAAATGGTCTGCCACAAAATGGCTGAGGCA
AAAGAGTACTCTGGTACCAACAGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATAAATCTAGCCAAAAACGAAAACGAATCACAAAAACACAACTTTCACAAAAGCTCTGTTACCAAACAAACCCAAAATTAGAAGAAAGAAATCAAAGGCATCTCAAAATC
ACAATCACTATTTCTTCACTGTTACAAAAGAAACCAAAGTAAACGCCAAAAAAAAAAAGAAAGCACAAACGAATCCTCCGCCAGTGCAAGGCGGTTTCTCATTGATTTAA
CGGCTTCTCCTTCCTCTCCTGATTTCTTGCTACTTTTCGCTTCAACGACTCAAAGTTTTCCTTTCCCTTAGGCTGCCATGGCCTCCGCTCCGATGAGGATTGTCTTCGGT
CTCCTAACCTTCGTCACCGTCGGCATGATCATCGGTGCTTTGTTTCAATTAGCATTTATAAGAAGGCTGGAGGACTCTATTGGTATTTATTTGGAAGAATGACTGAACGG
TGTAGACACTTCATTTAGCCAAGAGCTCTTGACGTATCTATGCAAAACAATATCATCTATCTTTTCTGAAGGCATGACATTTGCTGGGTGCTGGCCACATCCTTATGCTT
ACGAGGCACGATTTATTTTAGTATTTTCAGAATGATTGACTTTTGTATGAACCATGTCTTTGCAGGCACGGAGTTTCTATCAGCTGGAAGGTTACATCAAACTCAGTATG
ATGGCCAACGTCAATTACCCCGAGGCCTTCCTAATTGGATTAACGACAAAGAAGCAGAAATTCTTCGTCTTGGCTACGTTAAACCAGAAGTAGTAAGCTGGTCACCACGA
ATCATTGTATTACATAATTTTTTGAGCTCAGAGGAGTGCGACTACCTGAAGGCAATTGCACTTCCTCGCCTTGAAATTTCCACGGTTGTGGATACAAAAACAGGGAAGGG
AGTTAAGAGCGATTTCAGAACGAGCTCTGGAATGTTTTTAAGTACTCAAGAGAAAAATTATCCAATGGTCCAGGCAATCGAAAAAAGAATTTCTGTCTATTCTCAAATAC
CGATAGAAAATGGAGAGCTTATTCAAGTATTAAGGTACGAGAAGAATCAATTTTACAAGCCTCATCATGACTACTTTTCTGATACTTTTAACTTGAAGCGTGGTGGTCAG
CGAATAGCAACCATGCTTATGTACCTAACTGACAACGTTGAAGGTGGAGAAACCTACTTTCCGAAGGCTGGTTCTGGCGAGTGTAGTTGTGGTGGGAAAACTGTCCCAGG
GTTGTCGGTTAAACCAGTCAAAGGAGATGCAGTGCTTTTCTGGAGCATGGGATTGGATGGACAGTCGGATCCTAATAGCATTCATGGAGGCTGTGAAGTACTGGGAGGCG
AAAAATGGTCTGCCACAAAATGGCTGAGGCAAAAGAGTACTCTGGTACCAACAGTTTAA
Protein sequenceShow/hide protein sequence
MIDFCMNHVFAGTEFLSAGRLHQTQYDGQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSSEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSG
MFLSTQEKNYPMVQAIEKRISVYSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVKGDA
VLFWSMGLDGQSDPNSIHGGCEVLGGEKWSATKWLRQKSTLVPTV