; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0010756 (gene) of Chayote v1 genome

Gene IDSed0010756
OrganismSechium edule (Chayote v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationLG06:5276451..5286522
RNA-Seq ExpressionSed0010756
SyntenySed0010756
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
GO:0005506 - iron ion binding (molecular function)
GO:0016705 - oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588394.1 putative prolyl 4-hydroxylase 7, partial [Cucurbita argyrosperma subsp. sororia]1.1e-7953.25Show/hide
Query:  MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAVGF----VSGKIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDET
        MDSRF LAFSLCFLCSFPL  R  NRLPKL++  T T+ S +       S KIDPTRV+QLSSQPRAFLYKGFLSA +C HI+++     E+S+V DD T
Subjt:  MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAVGF----VSGKIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDET

Query:  GASVASEDRTSSGMFLDIAQ------------------------------------------------------------------------------VK
        GAS +S DRTS+GMFL  AQ                                                                               K
Subjt:  GASVASEDRTSSGMFLDIAQ------------------------------------------------------------------------------VK

Query:  LSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPVIEGEKWSATKWIHMLPYTEVWRNPDCVDESEHCSVWANAGECEKNPSYMV-
        + EE+  DL DC+  GYGVKPKKGDALLFFSLH N+T D TS+HGSCPVIEGEKWSATKWIHMLP  E+WRNPDCVDE+EHCSVWA AGECEKNP YMV 
Subjt:  LSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPVIEGEKWSATKWIHMLPYTEVWRNPDCVDESEHCSVWANAGECEKNPSYMV-

Query:  ---GSKGELGYCRESCKVCSSPS
           GSK ELGYCR SCK CS PS
Subjt:  ---GSKGELGYCRESCKVCSSPS

XP_022931100.1 probable prolyl 4-hydroxylase 7 [Cucurbita moschata]6.6e-8553.87Show/hide
Query:  MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAVGF----VSGKIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDET
        MDSRF LAFSLCFLCSFPL  R+ NRLPKL++  T T+ S +       S KIDPTRV+QLSSQPRAFLYKGFLSA +C H+I+LA+D++E+S+V DD T
Subjt:  MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAVGF----VSGKIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDET

Query:  GASVASEDRTSSGMFLDIAQ------------------------------------------------------------------------------VK
        GAS +S DRTS+GMFL  AQ                                                                               K
Subjt:  GASVASEDRTSSGMFLDIAQ------------------------------------------------------------------------------VK

Query:  LSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPVIEGEKWSATKWIHMLPYTEVWRNPDCVDESEHCSVWANAGECEKNPSYMV-
        + EE+  DL DC+  GYGVKPKKGDALLFFSLH N+T D TS+HGSCPVIEGEKWSATKWIHMLP  E+WRNPDCVDE+EHCS WA AGECEKNP YMV 
Subjt:  LSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPVIEGEKWSATKWIHMLPYTEVWRNPDCVDESEHCSVWANAGECEKNPSYMV-

Query:  ---GSKGELGYCRESCKVCSSPS
           GSK ELGYCR SCK CS PS
Subjt:  ---GSKGELGYCRESCKVCSSPS

XP_022971148.1 probable prolyl 4-hydroxylase 7 [Cucurbita maxima]1.5e-8454.49Show/hide
Query:  MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAVGF----VSGKIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDET
        MDSRF LAFSLCFLCSFPL  R+ NRLPKL++  T T+ S +       S KIDPTRV+QLSSQPRAFLYKGFLSA +C H+I+LA+D++E+S+V DD T
Subjt:  MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAVGF----VSGKIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDET

Query:  GASVASEDRTSSGMFLDIAQ------------------------------------------------------------------------------VK
        GAS +S DRTS+GMFL  AQ                                                                               K
Subjt:  GASVASEDRTSSGMFLDIAQ------------------------------------------------------------------------------VK

Query:  LSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPVIEGEKWSATKWIHMLPYTEVWRNPDCVDESEHCSVWANAGECEKNPSYMV-
        + EE K DLSDC+  GYGVKPKKGDALLFFSLH N+T D TS+HGSCPVIEGEKWSATKWIHMLP  E+WRNPDCVDE+EHCS WA AGECEKNP YMV 
Subjt:  LSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPVIEGEKWSATKWIHMLPYTEVWRNPDCVDESEHCSVWANAGECEKNPSYMV-

Query:  ---GSKGELGYCRESCKVCSSPS
           GSK ELGYCR SCK CS PS
Subjt:  ---GSKGELGYCRESCKVCSSPS

XP_023530715.1 probable prolyl 4-hydroxylase 7 [Cucurbita pepo subsp. pepo]3.0e-8553.87Show/hide
Query:  MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAVGF----VSGKIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDET
        MDSRF LAFSLCFLCSFPL  R+ NRLPKL++  T T+ S +       S KIDPTRV+QLSSQPRAFLYKGFLSA +C H+I+LA+D++E+S+V DD T
Subjt:  MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAVGF----VSGKIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDET

Query:  GASVASEDRTSSGMFLDIAQ------------------------------------------------------------------------------VK
        GAS +S DRTS+GMFL  AQ                                                                               K
Subjt:  GASVASEDRTSSGMFLDIAQ------------------------------------------------------------------------------VK

Query:  LSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPVIEGEKWSATKWIHMLPYTEVWRNPDCVDESEHCSVWANAGECEKNPSYMV-
        + EE+  DLSDC+  GYGVKPKKGDALLFFSLH N+T D TS+HGSCPVIEGEKWSATKWIHMLP  E+WRNPDCVDE+EHCS WA AGECEKNP YMV 
Subjt:  LSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPVIEGEKWSATKWIHMLPYTEVWRNPDCVDESEHCSVWANAGECEKNPSYMV-

Query:  ---GSKGELGYCRESCKVCSSPS
           GSK +LGYCR SCK CS PS
Subjt:  ---GSKGELGYCRESCKVCSSPS

XP_038905408.1 probable prolyl 4-hydroxylase 7 isoform X1 [Benincasa hispida]2.1e-7850.78Show/hide
Query:  MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAVGFVS----GKIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDET
        M SRF LAFSLCFLC FP   R+ NRLPKL++ +    +S +   +      IDPTRVI+LSS+PRAFLYKGFLS  +C H+INLA+  +++S+VA  ET
Subjt:  MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAVGFVS----GKIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDET

Query:  GASVASEDRTSSGMFLDIAQ------------------------------------------------------------------------------VK
        G SV S++RTS+GMFL  AQ                                                                              +K
Subjt:  GASVASEDRTSSGMFLDIAQ------------------------------------------------------------------------------VK

Query:  LSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPVIEGEKWSATKWIHMLPYTEVWRNPDCVDESEHCSVWANAGECEKNPSYMVG
        LSE+++ DLSDCAK+GYGVKPK GDALLFFSL+ N+TPD+TS+HGSCPVIEGEKWSATKWIHMLP  E+WRNP CVDE+  C  WANAGECEKNP YM+G
Subjt:  LSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPVIEGEKWSATKWIHMLPYTEVWRNPDCVDESEHCSVWANAGECEKNPSYMVG

Query:  SKGELGYCRESCKVCSSPS
        SK ELG+CR SCKVCS PS
Subjt:  SKGELGYCRESCKVCSSPS

TrEMBL top hitse value%identityAlignment
A0A1S3B814 Procollagen-proline 4-dioxygenase8.2e-7348.59Show/hide
Query:  MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAVGFVSG----KIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDET
        M S F LAFS+ FL   PL   + NR PK+++ +    +S +   +G     IDPTRVIQLSS+PRAFLYKGFLS  +C H+I+LA+  + +S+VA   T
Subjt:  MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAVGFVSG----KIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDET

Query:  GASVASEDRTSSGMFLDIAQ------------------------------------------------------------------------------VK
        G SV S++RTS+GMFL  AQ                                                                              VK
Subjt:  GASVASEDRTSSGMFLDIAQ------------------------------------------------------------------------------VK

Query:  LSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPVIEGEKWSATKWIHMLPYTEVWRNPDCVDESEHCSVWANAGECEKNPSYMVG
        LSEE+K DLS+CAK+GYGV+PK GDALLFFS++ N+TPD+TS+HGSCPVIEGEKWSATKWIHMLP  EVWRNP CVDE++HCS WA AGEC+KNP YM+G
Subjt:  LSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPVIEGEKWSATKWIHMLPYTEVWRNPDCVDESEHCSVWANAGECEKNPSYMVG

Query:  SKGELGYCRESCKVCSSPS
        SK ELG+CR SCKVCS  S
Subjt:  SKGELGYCRESCKVCSSPS

A0A6J1DTY4 Procollagen-proline 4-dioxygenase1.3e-7549.69Show/hide
Query:  MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAV-----GFVSGKIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDE
        MDSR  LAFSLCFLC FPL  R+TN +P+L++      + ++     G  S  IDP+RV QLSSQPRAF+YKGFLSA +C+H+INLA+D +E+S+VADD 
Subjt:  MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAV-----GFVSGKIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDE

Query:  TGASVASEDRTSSGMFL---------------------------------------------------DIAQ---------------------------V
        TG SV S +RTS+GMFL                                                   ++AQ                           V
Subjt:  TGASVASEDRTSSGMFL---------------------------------------------------DIAQ---------------------------V

Query:  KLSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPVIEGEKWSATKWIHMLPYTEVWRNPDCVDESEHCSVWANAGECEKNPSYMV
        KLS  +K +LSDCAK+GY VKPK GDALLFFSLHAN T DS+S+HGSCPVI+GEKWSATKWIHML   E+WR+PDCVD S  C+ WA+ GEC KNP YM+
Subjt:  KLSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPVIEGEKWSATKWIHMLPYTEVWRNPDCVDESEHCSVWANAGECEKNPSYMV

Query:  GSKGELGYCRESCKVCSS
        GSK ELGYCR+SC  CSS
Subjt:  GSKGELGYCRESCKVCSS

A0A6J1DX45 Procollagen-proline 4-dioxygenase1.3e-7549.69Show/hide
Query:  MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAV-----GFVSGKIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDE
        MDSR  LAFSLCFLC FPL  R+TN +P+L++      + ++     G  S  IDP+RV QLSSQPRAF+YKGFLSA +C+H+INLA+D +E+S+VADD 
Subjt:  MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAV-----GFVSGKIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDE

Query:  TGASVASEDRTSSGMFL---------------------------------------------------DIAQ---------------------------V
        TG SV S +RTS+GMFL                                                   ++AQ                           V
Subjt:  TGASVASEDRTSSGMFL---------------------------------------------------DIAQ---------------------------V

Query:  KLSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPVIEGEKWSATKWIHMLPYTEVWRNPDCVDESEHCSVWANAGECEKNPSYMV
        KLS  +K +LSDCAK+GY VKPK GDALLFFSLHAN T DS+S+HGSCPVI+GEKWSATKWIHML   E+WR+PDCVD S  C+ WA+ GEC KNP YM+
Subjt:  KLSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPVIEGEKWSATKWIHMLPYTEVWRNPDCVDESEHCSVWANAGECEKNPSYMV

Query:  GSKGELGYCRESCKVCSS
        GSK ELGYCR+SC  CSS
Subjt:  GSKGELGYCRESCKVCSS

A0A6J1EYJ1 Procollagen-proline 4-dioxygenase3.2e-8553.87Show/hide
Query:  MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAVGF----VSGKIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDET
        MDSRF LAFSLCFLCSFPL  R+ NRLPKL++  T T+ S +       S KIDPTRV+QLSSQPRAFLYKGFLSA +C H+I+LA+D++E+S+V DD T
Subjt:  MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAVGF----VSGKIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDET

Query:  GASVASEDRTSSGMFLDIAQ------------------------------------------------------------------------------VK
        GAS +S DRTS+GMFL  AQ                                                                               K
Subjt:  GASVASEDRTSSGMFLDIAQ------------------------------------------------------------------------------VK

Query:  LSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPVIEGEKWSATKWIHMLPYTEVWRNPDCVDESEHCSVWANAGECEKNPSYMV-
        + EE+  DL DC+  GYGVKPKKGDALLFFSLH N+T D TS+HGSCPVIEGEKWSATKWIHMLP  E+WRNPDCVDE+EHCS WA AGECEKNP YMV 
Subjt:  LSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPVIEGEKWSATKWIHMLPYTEVWRNPDCVDESEHCSVWANAGECEKNPSYMV-

Query:  ---GSKGELGYCRESCKVCSSPS
           GSK ELGYCR SCK CS PS
Subjt:  ---GSKGELGYCRESCKVCSSPS

A0A6J1I5Z9 Procollagen-proline 4-dioxygenase7.1e-8554.49Show/hide
Query:  MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAVGF----VSGKIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDET
        MDSRF LAFSLCFLCSFPL  R+ NRLPKL++  T T+ S +       S KIDPTRV+QLSSQPRAFLYKGFLSA +C H+I+LA+D++E+S+V DD T
Subjt:  MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAVGF----VSGKIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDET

Query:  GASVASEDRTSSGMFLDIAQ------------------------------------------------------------------------------VK
        GAS +S DRTS+GMFL  AQ                                                                               K
Subjt:  GASVASEDRTSSGMFLDIAQ------------------------------------------------------------------------------VK

Query:  LSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPVIEGEKWSATKWIHMLPYTEVWRNPDCVDESEHCSVWANAGECEKNPSYMV-
        + EE K DLSDC+  GYGVKPKKGDALLFFSLH N+T D TS+HGSCPVIEGEKWSATKWIHMLP  E+WRNPDCVDE+EHCS WA AGECEKNP YMV 
Subjt:  LSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPVIEGEKWSATKWIHMLPYTEVWRNPDCVDESEHCSVWANAGECEKNPSYMV-

Query:  ---GSKGELGYCRESCKVCSSPS
           GSK ELGYCR SCK CS PS
Subjt:  ---GSKGELGYCRESCKVCSSPS

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 67.6e-5242.31Show/hide
Query:  MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAVGFVSGKIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSM-VADDETGAS
        MDS++ LAFSL  L  F                      S +   S  +DPTR+ QLS  PRAFLYKGFLS  +CDH+I LA+  +EKSM VAD ++G S
Subjt:  MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAVGFVSGKIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSM-VADDETGAS

Query:  VASEDRTSSGMFLD------IAQVK--------LSEE-------------QKND----------------------------------------------
          SE RTSSGMFL       +A V+        L EE             QK D                                              
Subjt:  VASEDRTSSGMFLD------IAQVK--------LSEE-------------QKND----------------------------------------------

Query:  -----LSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPVIEGEKWSATKWIHMLPYTEVWRNPDCVDESEHCSVWANAGECEKNPSYMVGSKG
              S CAK GY VKP+KGDALLFF+LH N T D  S HGSCPVIEGEKWSAT+WIH+  + +  +   CVD+ E C  WA+AGECEKNP YMVGS+ 
Subjt:  -----LSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPVIEGEKWSATKWIHMLPYTEVWRNPDCVDESEHCSVWANAGECEKNPSYMVGSKG

Query:  ELGYCRESCKVC
         LG+CR+SCK C
Subjt:  ELGYCRESCKVC

F4JAU3 Prolyl 4-hydroxylase 22.0e-4438.52Show/hide
Query:  SGKIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDETGASVASEDRTSSGMFLD--------------------------------
        S  I+P++V Q+SS+PRAF+Y+GFL+  +CDH+I+LA++++++S VAD++ G S  S+ RTSSG F+                                 
Subjt:  SGKIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDETGASVASEDRTSSGMFLD--------------------------------

Query:  --------------------------IAQVKL-----------------------SEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGS
                                  IA V L                         E K+DLSDCAK G  VKPKKG+ALLFF+L  +  PD  S HG 
Subjt:  --------------------------IAQVKL-----------------------SEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGS

Query:  CPVIEGEKWSATKWIHMLPYTEV-WRNPDCVDESEHCSVWANAGECEKNPSYMVGSKGELGYCRESCKVC
        CPVIEGEKWSATKWIH+  + ++   + +C D +E C  WA  GEC KNP YMVG+    G CR SCK C
Subjt:  CPVIEGEKWSATKWIHMLPYTEV-WRNPDCVDESEHCSVWANAGECEKNPSYMVGSKGELGYCRESCKVC

Q8GXT7 Probable prolyl 4-hydroxylase 121.8e-2933.2Show/hide
Query:  ITKKS---AVGFVSGK--IDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADD------------------------ETGASVASEDRT
        IT KS      +V G   +DPTRV+QLS  PR FLY+GFLS  +CDH+I+L ++  E   V  D                        E G S+     T
Subjt:  ITKKS---AVGFVSGK--IDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADD------------------------ETGASVASEDRT

Query:  S--SGMFLDI---------------------------AQVKLSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPVIEGEKWSATK
        S  SG  LD                             ++     +    + C + G  ++P KG+A+LFF+   N + D  S H  CPV++GE   ATK
Subjt:  S--SGMFLDI---------------------------AQVKLSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPVIEGEKWSATK

Query:  WIHMLPYTEVWRNPDCVDESEHCSVWANAGECEKNPSYMVGSKGELGYCRESCKVC
         I+      +  + +C DE E+C  WA  GEC+KNP YM+GS    G CR+SC  C
Subjt:  WIHMLPYTEVWRNPDCVDESEHCSVWANAGECEKNPSYMVGSKGELGYCRESCKVC

Q8L970 Probable prolyl 4-hydroxylase 73.8e-5942.63Show/hide
Query:  MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAV-----GFVSGKIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDE
        MDSR  LAFSLCFL + PL+  A NR    +  S+ T+  +V        S   DPTRV QLS  PR FLY+GFLS  +CDH I LA+  +EKSMVAD++
Subjt:  MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAV-----GFVSGKIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDE

Query:  TGASVASEDRTSSGMFLD----------------------------------------------------------IAQV--------------------
        +G SV SE RTSSGMFL                                                           IA V                    
Subjt:  TGASVASEDRTSSGMFLD----------------------------------------------------------IAQV--------------------

Query:  KLSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPVIEGEKWSATKWIHMLPYTEVW-RNPDCVDESEHCSVWANAGECEKNPSYM
        K ++ + +  ++CAK GY VKP+KGDALLFF+LH N T DS S HGSCPV+EGEKWSAT+WIH+  +   + +   C+DE+  C  WA AGEC+KNP+YM
Subjt:  KLSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPVIEGEKWSATKWIHMLPYTEVW-RNPDCVDESEHCSVWANAGECEKNPSYM

Query:  VGSKGELGYCRESCKVCSS
        VGS  + GYCR+SCK CSS
Subjt:  VGSKGELGYCRESCKVCSS

Q8LAN3 Probable prolyl 4-hydroxylase 46.3e-4638.2Show/hide
Query:  IDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDETGASVASEDRTSSGMFL------------------------------------
        ++P++V Q+SS+PRAF+Y+GFL+  +CDH+++LA+  +++S VAD+++G S  SE RTSSG F+                                    
Subjt:  IDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDETGASVASEDRTSSGMFL------------------------------------

Query:  ---------------------------------------------DIAQVKLSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPV
                                                     +I   ++  E K DLSDCAK G  VKP+KGDALLFF+LH +  PD  S HG CPV
Subjt:  ---------------------------------------------DIAQVKLSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPV

Query:  IEGEKWSATKWIHMLPYTE-VWRNPDCVDESEHCSVWANAGECEKNPSYMVGSKGELGYCRESCKVC
        IEGEKWSATKWIH+  +   V  + +C D +E C  WA  GEC KNP YMVG+    GYCR SCK C
Subjt:  IEGEKWSATKWIHMLPYTE-VWRNPDCVDESEHCSVWANAGECEKNPSYMVGSKGELGYCRESCKVC

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 21.4e-4538.52Show/hide
Query:  SGKIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDETGASVASEDRTSSGMFLD--------------------------------
        S  I+P++V Q+SS+PRAF+Y+GFL+  +CDH+I+LA++++++S VAD++ G S  S+ RTSSG F+                                 
Subjt:  SGKIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDETGASVASEDRTSSGMFLD--------------------------------

Query:  --------------------------IAQVKL-----------------------SEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGS
                                  IA V L                         E K+DLSDCAK G  VKPKKG+ALLFF+L  +  PD  S HG 
Subjt:  --------------------------IAQVKL-----------------------SEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGS

Query:  CPVIEGEKWSATKWIHMLPYTEV-WRNPDCVDESEHCSVWANAGECEKNPSYMVGSKGELGYCRESCKVC
        CPVIEGEKWSATKWIH+  + ++   + +C D +E C  WA  GEC KNP YMVG+    G CR SCK C
Subjt:  CPVIEGEKWSATKWIHMLPYTEV-WRNPDCVDESEHCSVWANAGECEKNPSYMVGSKGELGYCRESCKVC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase2.7e-6042.63Show/hide
Query:  MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAV-----GFVSGKIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDE
        MDSR  LAFSLCFL + PL+  A NR    +  S+ T+  +V        S   DPTRV QLS  PR FLY+GFLS  +CDH I LA+  +EKSMVAD++
Subjt:  MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAV-----GFVSGKIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDE

Query:  TGASVASEDRTSSGMFLD----------------------------------------------------------IAQV--------------------
        +G SV SE RTSSGMFL                                                           IA V                    
Subjt:  TGASVASEDRTSSGMFLD----------------------------------------------------------IAQV--------------------

Query:  KLSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPVIEGEKWSATKWIHMLPYTEVW-RNPDCVDESEHCSVWANAGECEKNPSYM
        K ++ + +  ++CAK GY VKP+KGDALLFF+LH N T DS S HGSCPV+EGEKWSAT+WIH+  +   + +   C+DE+  C  WA AGEC+KNP+YM
Subjt:  KLSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPVIEGEKWSATKWIHMLPYTEVW-RNPDCVDESEHCSVWANAGECEKNPSYM

Query:  VGSKGELGYCRESCKVCSS
        VGS  + GYCR+SCK CSS
Subjt:  VGSKGELGYCRESCKVCSS

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase4.0e-5640.37Show/hide
Query:  MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAV-----GFVSGKIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDE
        MDSR  LAFSLCFL + PL+  A NR    +  S+ T+  +V        S   DPTRV QLS  PR FLY+GFLS  +CDH I LA+  +EKSMVAD++
Subjt:  MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAV-----GFVSGKIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDE

Query:  TGASVASED-----RTSSGM---------------------------------------------------FLDIAQVKL--------------------
        +G SV SED     R SS                                                     F D A ++L                    
Subjt:  TGASVASED-----RTSSGM---------------------------------------------------FLDIAQVKL--------------------

Query:  ----------SEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPVIEGEKWSATKWIHMLPYTEVW-RNPDCVDESEHCSVWANAGE
                  ++ + +  ++CAK GY VKP+KGDALLFF+LH N T DS S HGSCPV+EGEKWSAT+WIH+  +   + +   C+DE+  C  WA AGE
Subjt:  ----------SEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPVIEGEKWSATKWIHMLPYTEVW-RNPDCVDESEHCSVWANAGE

Query:  CEKNPSYMVGSKGELGYCRESCKVCSS
        C+KNP+YMVGS  + GYCR+SCK CSS
Subjt:  CEKNPSYMVGSKGELGYCRESCKVCSS

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase5.4e-5342.31Show/hide
Query:  MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAVGFVSGKIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSM-VADDETGAS
        MDS++ LAFSL  L  F                      S +   S  +DPTR+ QLS  PRAFLYKGFLS  +CDH+I LA+  +EKSM VAD ++G S
Subjt:  MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAVGFVSGKIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSM-VADDETGAS

Query:  VASEDRTSSGMFLD------IAQVK--------LSEE-------------QKND----------------------------------------------
          SE RTSSGMFL       +A V+        L EE             QK D                                              
Subjt:  VASEDRTSSGMFLD------IAQVK--------LSEE-------------QKND----------------------------------------------

Query:  -----LSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPVIEGEKWSATKWIHMLPYTEVWRNPDCVDESEHCSVWANAGECEKNPSYMVGSKG
              S CAK GY VKP+KGDALLFF+LH N T D  S HGSCPVIEGEKWSAT+WIH+  + +  +   CVD+ E C  WA+AGECEKNP YMVGS+ 
Subjt:  -----LSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPVIEGEKWSATKWIHMLPYTEVWRNPDCVDESEHCSVWANAGECEKNPSYMVGSKG

Query:  ELGYCRESCKVC
         LG+CR+SCK C
Subjt:  ELGYCRESCKVC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein4.5e-4738.2Show/hide
Query:  IDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDETGASVASEDRTSSGMFL------------------------------------
        ++P++V Q+SS+PRAF+Y+GFL+  +CDH+++LA+  +++S VAD+++G S  SE RTSSG F+                                    
Subjt:  IDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDETGASVASEDRTSSGMFL------------------------------------

Query:  ---------------------------------------------DIAQVKLSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPV
                                                     +I   ++  E K DLSDCAK G  VKP+KGDALLFF+LH +  PD  S HG CPV
Subjt:  ---------------------------------------------DIAQVKLSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPV

Query:  IEGEKWSATKWIHMLPYTE-VWRNPDCVDESEHCSVWANAGECEKNPSYMVGSKGELGYCRESCKVC
        IEGEKWSATKWIH+  +   V  + +C D +E C  WA  GEC KNP YMVG+    GYCR SCK C
Subjt:  IEGEKWSATKWIHMLPYTE-VWRNPDCVDESEHCSVWANAGECEKNPSYMVGSKGELGYCRESCKVC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCTCGATTTTCTCTCGCATTTTCCCTCTGTTTCCTCTGCTCATTTCCTCTCCTGTTTCGCGCCACCAATCGCTTGCCCAAATTGGTCATAACCAGCACCATCAC
GAAAAAATCTGCCGTTGGCTTCGTCTCCGGTAAAATCGATCCCACTCGTGTAATTCAGCTTTCATCGCAACCCAGGGCTTTCTTGTATAAGGGATTTTTGTCTGCAGCGC
AGTGCGATCATATTATCAATTTGGCTAGGGATCATATGGAGAAATCAATGGTGGCTGATGACGAAACGGGTGCGAGTGTTGCGAGTGAAGATCGGACGAGTAGCGGCATG
TTTCTTGATATAGCTCAGGTTAAACTATCCGAGGAGCAGAAGAATGACTTGTCCGATTGTGCTAAGATCGGCTACGGAGTAAAACCAAAGAAGGGTGATGCTTTACTGTT
CTTCAGTCTCCATGCCAATTTGACACCAGACTCGACCAGCTTTCACGGAAGCTGTCCGGTGATAGAGGGCGAGAAGTGGTCTGCAACCAAATGGATTCACATGCTTCCAT
ACACTGAGGTTTGGAGGAATCCAGATTGTGTGGATGAGAGTGAGCACTGTAGTGTGTGGGCAAATGCAGGTGAGTGTGAAAAGAATCCAAGCTATATGGTGGGCTCAAAG
GGTGAGCTTGGATATTGTAGAGAGAGTTGCAAAGTGTGTTCTTCCCCCTCATAG
mRNA sequenceShow/hide mRNA sequence
ATGGATTCTCGATTTTCTCTCGCATTTTCCCTCTGTTTCCTCTGCTCATTTCCTCTCCTGTTTCGCGCCACCAATCGCTTGCCCAAATTGGTCATAACCAGCACCATCAC
GAAAAAATCTGCCGTTGGCTTCGTCTCCGGTAAAATCGATCCCACTCGTGTAATTCAGCTTTCATCGCAACCCAGGGCTTTCTTGTATAAGGGATTTTTGTCTGCAGCGC
AGTGCGATCATATTATCAATTTGGCTAGGGATCATATGGAGAAATCAATGGTGGCTGATGACGAAACGGGTGCGAGTGTTGCGAGTGAAGATCGGACGAGTAGCGGCATG
TTTCTTGATATAGCTCAGGTTAAACTATCCGAGGAGCAGAAGAATGACTTGTCCGATTGTGCTAAGATCGGCTACGGAGTAAAACCAAAGAAGGGTGATGCTTTACTGTT
CTTCAGTCTCCATGCCAATTTGACACCAGACTCGACCAGCTTTCACGGAAGCTGTCCGGTGATAGAGGGCGAGAAGTGGTCTGCAACCAAATGGATTCACATGCTTCCAT
ACACTGAGGTTTGGAGGAATCCAGATTGTGTGGATGAGAGTGAGCACTGTAGTGTGTGGGCAAATGCAGGTGAGTGTGAAAAGAATCCAAGCTATATGGTGGGCTCAAAG
GGTGAGCTTGGATATTGTAGAGAGAGTTGCAAAGTGTGTTCTTCCCCCTCATAG
Protein sequenceShow/hide protein sequence
MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAVGFVSGKIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDETGASVASEDRTSSGM
FLDIAQVKLSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPVIEGEKWSATKWIHMLPYTEVWRNPDCVDESEHCSVWANAGECEKNPSYMVGSK
GELGYCRESCKVCSSPS