; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0023363 (gene) of Chayote v1 genome

Gene IDSed0023363
OrganismSechium edule (Chayote v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationLG05:3364056..3372712
RNA-Seq ExpressionSed0023363
SyntenySed0023363
Gene Ontology termsGO:0016042 - lipid catabolic process (biological process)
GO:0019511 - peptidyl-proline hydroxylation (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR002921 - Fungal lipase-like domain
IPR003582 - ShKT domain
IPR005592 - Mono-/di-acylglycerol lipase, N-terminal
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR029058 - Alpha/Beta hydrolase fold
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4271435.1 unnamed protein product [Prunus armeniaca]1.7e-18448.95Show/hide
Query:  MSIICGVPIIECVCCLGCARWVWKRCLHTAGHDSEHWGFATAEEFKPIPRVCRYILAVYEDDIRHPLWAPVGGYGIDPDWLLVKKTYRDTRGRAPPYILY
        MSI+CG P+IECV CL C RW WKRCLHTAGHDSE WG ATAEEF+P+PR+CRYILAVYEDD+R PLW P GGYGI PDWL++KKTY DT+G+APPYILY
Subjt:  MSIICGVPIIECVCCLGCARWVWKRCLHTAGHDSEHWGFATAEEFKPIPRVCRYILAVYEDDIRHPLWAPVGGYGIDPDWLLVKKTYRDTRGRAPPYILY

Query:  LDHEHGDIVLAIRGLNMAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDTENETLRDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVAQNSEKLEH
        LDH+H DIVLA RGLN+A+ESDYAVL+DNKLGK KFDGGYVHNGLLKAA WVLD E E L+DLV+KYP+YTLTF GHSLGSGVAA+LT+VV Q+ ++L +
Subjt:  LDHEHGDIVLAIRGLNMAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDTENETLRDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVAQNSEKLEH

Query:  IDRRRIRCYAIAPARCMSLNLAVRYADIINSVVLQPS---------------------------------------------------------------
        IDR+R+R YAIAPARC+SLNLAVRYAD+INSVVLQ +                                                               
Subjt:  IDRRRIRCYAIAPARCMSLNLAVRYADIINSVVLQPS---------------------------------------------------------------

Query:  -----MDSRLNFLLLFTTA---------------------------------------------------------------------------------
             +D R   ++L   A                                                                                 
Subjt:  -----MDSRLNFLLLFTTA---------------------------------------------------------------------------------

Query:  -------FSFSTCLAQSNLIS-GRKGLRDPLAN---IVALSYSNHSGTIDPSRVVQVSWRPRVFLYEGFLSDVECEHLISLALNSEDKPSRINAGSGNTS
                SFS+       ++  RK LR   AN    +   +S HS  IDPSR VQ+SWRPRVFLY+GFLSD EC+HL+SLA   E+         GNT+
Subjt:  -------FSFSTCLAQSNLIS-GRKGLRDPLAN---IVALSYSNHSGTIDPSRVVQVSWRPRVFLYEGFLSDVECEHLISLALNSEDKPSRINAGSGNTS

Query:  TVLTEWLNSSRAILNSTDDIIARIENRIAVWTLLPIDYSMPLQIMRYRGEEAEHKY-IFGNGSAMSSSEPLMATVVLYLSDSASGGEMLFPESKAKSTFW
        T+      S +  LN  D+I++RIE RI+ WT LP + S  LQ+ R   EEAE     FGN S +  SEPL+ATV+LY+S+   GGE+LFPES+ +S  W
Subjt:  TVLTEWLNSSRAILNSTDDIIARIENRIAVWTLLPIDYSMPLQIMRYRGEEAEHKY-IFGNGSAMSSSEPLMATVVLYLSDSASGGEMLFPESKAKSTFW

Query:  SDRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPVLDGELWVATKFFYLRQTTTGSTKHTVESNENDCIDEDNSCPQWAAIGECERNAVFMIGSP
        SD  K ++IL+P KGNAILFF++  NASPDKSS H+R PVL+GE+W ATKF Y +    G  K + +S  ++C DED++CP WA+IGEC+RN VFM+GSP
Subjt:  SDRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPVLDGELWVATKFFYLRQTTTGSTKHTVESNENDCIDEDNSCPQWAAIGECERNAVFMIGSP

Query:  DYYGTCRKSCNAC
        DYYGTCRKSCN C
Subjt:  DYYGTCRKSCNAC

CAB4301873.1 unnamed protein product [Prunus armeniaca]7.7e-18548.95Show/hide
Query:  MSIICGVPIIECVCCLGCARWVWKRCLHTAGHDSEHWGFATAEEFKPIPRVCRYILAVYEDDIRHPLWAPVGGYGIDPDWLLVKKTYRDTRGRAPPYILY
        MSI+CG P+IECV CL C RW WKRCLHTAGHDSE WG ATAEEF+P+PR+CRYILAVYEDD+R PLW P GGYGI PDWL++KKTY DT+G+APPYILY
Subjt:  MSIICGVPIIECVCCLGCARWVWKRCLHTAGHDSEHWGFATAEEFKPIPRVCRYILAVYEDDIRHPLWAPVGGYGIDPDWLLVKKTYRDTRGRAPPYILY

Query:  LDHEHGDIVLAIRGLNMAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDTENETLRDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVAQNSEKLEH
        LDH+H DIVLA RGLN+A+ESDYAVL+DNKLGK KFDGGYVHNGLLKAA WVLD E E L+DLV+KYP+YTLTF GHSLGSGVAA+LT+VV Q+ ++L +
Subjt:  LDHEHGDIVLAIRGLNMAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDTENETLRDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVAQNSEKLEH

Query:  IDRRRIRCYAIAPARCMSLNLAVRYADIINSVVLQPS---------------------------------------------------------------
        IDR+R+R YAIAPARC+SLNLAVRYAD+INSVVLQ +                                                               
Subjt:  IDRRRIRCYAIAPARCMSLNLAVRYADIINSVVLQPS---------------------------------------------------------------

Query:  -----MDSRLNFLLLFTTA---------------------------------------------------------------------------------
             +D R   ++L   A                                                                                 
Subjt:  -----MDSRLNFLLLFTTA---------------------------------------------------------------------------------

Query:  -------FSFSTCLAQSNLIS-GRKGLRDPLAN---IVALSYSNHSGTIDPSRVVQVSWRPRVFLYEGFLSDVECEHLISLALNSEDKPSRINAGSGNTS
                SFS+       ++  RK LR   AN    +   +S HS  IDPSR VQ+SWRPRVFLY+GFLSD EC+HL+SLA   E+         GNT+
Subjt:  -------FSFSTCLAQSNLIS-GRKGLRDPLAN---IVALSYSNHSGTIDPSRVVQVSWRPRVFLYEGFLSDVECEHLISLALNSEDKPSRINAGSGNTS

Query:  TVLTEWLNSSRAILNSTDDIIARIENRIAVWTLLPIDYSMPLQIMRYRGEEAEHKY-IFGNGSAMSSSEPLMATVVLYLSDSASGGEMLFPESKAKSTFW
        T+      S +  LN  D+I++RIE RI+ WT LP + S  LQ+ R   EEAE     FGN S +  SEPL+ATV+LY+S+   GGE+LFPES+ +S  W
Subjt:  TVLTEWLNSSRAILNSTDDIIARIENRIAVWTLLPIDYSMPLQIMRYRGEEAEHKY-IFGNGSAMSSSEPLMATVVLYLSDSASGGEMLFPESKAKSTFW

Query:  SDRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPVLDGELWVATKFFYLRQTTTGSTKHTVESNENDCIDEDNSCPQWAAIGECERNAVFMIGSP
        SD  K ++IL+P KGNAILFF++  NASPDKSS H+R PVL+GE+W ATKF Y +    G  K + +S  ++C DED++CP WA+IGEC+RN VFM+GSP
Subjt:  SDRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPVLDGELWVATKFFYLRQTTTGSTKHTVESNENDCIDEDNSCPQWAAIGECERNAVFMIGSP

Query:  DYYGTCRKSCNAC
        DYYGTCRKSCN C
Subjt:  DYYGTCRKSCNAC

KAF4351179.1 hypothetical protein F8388_024210 [Cannabis sativa]6.3e-17145.23Show/hide
Query:  MSIICGVPIIECVCCLGCARWVWKRCLHTAGHDSEHWGFATAEEFKPIPRVCRYILAVYEDDIRHPLWAPVGGYGIDPDWLLVKKTYRDTRGRAPPYILY
        MSIICG+P++ECV CL CARW WKRCLHTAGHDSE+WG ATAEEF+P+PR+C YILAVYEDD+RHPLW P  GYGI+PDWL  KK+Y DT G+APPYILY
Subjt:  MSIICGVPIIECVCCLGCARWVWKRCLHTAGHDSEHWGFATAEEFKPIPRVCRYILAVYEDDIRHPLWAPVGGYGIDPDWLLVKKTYRDTRGRAPPYILY

Query:  LDHEHGDIVLAIRGLNMAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDTENETLRDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVAQNSEKLEH
        LDH+H DIVLA RGLN+AKESDYAVLLDNKLG+ KFDGGYVHNGLLKAA  VL  E++TL+ LV KYP+YTLTFAGHSLGSGVA +L ++  QN  +L +
Subjt:  LDHEHGDIVLAIRGLNMAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDTENETLRDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVAQNSEKLEH

Query:  IDRRRIRCYAIAPARCMSLNLAVRYADIINSVVLQP----------------------------------------------------------------
        IDRRRIRCYAIAPARCMSLNLAVRYAD+INSVVLQ                                                                 
Subjt:  IDRRRIRCYAIAPARCMSLNLAVRYADIINSVVLQP----------------------------------------------------------------

Query:  ----------------SMDSRLNFLLLFTTAFS-------------------------------------------------------------------
                         +D R   ++L   A S                                                                   
Subjt:  ----------------SMDSRLNFLLLFTTAFS-------------------------------------------------------------------

Query:  -----------------------------------------FSTC-LAQSN-----------------LISGRKGLRD---PLANIVALSYSNHSGTIDP
                                                 FS+  LA SN                   S RK LRD       ++    S HS  IDP
Subjt:  -----------------------------------------FSTC-LAQSN-----------------LISGRKGLRD---PLANIVALSYSNHSGTIDP

Query:  SRVVQVSWRPRVFLYEGFLSDVECEHLISLALNSEDKPSRINAGSGNTSTVLTEWLNSSRAILNSTDDIIARIENRIAVWTLLPIDYSMPLQIMRYRGEE
        SRVVQ+SWRPRVFLYEGFLSD EC+HLISL    +D        SGN  T+  + + SS       DD+++RIE RI+ WT LP +    LQI RY  E+
Subjt:  SRVVQVSWRPRVFLYEGFLSDVECEHLISLALNSEDKPSRINAGSGNTSTVLTEWLNSSRAILNSTDDIIARIENRIAVWTLLPIDYSMPLQIMRYRGEE

Query:  AEHKY-IFGNGSAMSSSEPLMATVVLYLSDSASGGEMLFPESKAKSTFWSDRRK---KNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPVLDGELWVA
        +E  +  FGN S +  S+PL+ATVVLYLSD+ +GG+++FP+SK K   WSD  K    ++I++P KGNAILFF+++ N++ D SS H R PVL+GE+W A
Subjt:  AEHKY-IFGNGSAMSSSEPLMATVVLYLSDSASGGEMLFPESKAKSTFWSDRRK---KNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPVLDGELWVA

Query:  TKFFYLRQTTTGSTKHTVESNENDCIDEDNSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
         KFF ++ T       +  S+  DC DED++C  WAA+GEC++NAVFMIGS DYYGTCRKSCNAC
Subjt:  TKFFYLRQTTTGSTKHTVESNENDCIDEDNSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

KAF4353598.1 hypothetical protein F8388_017773 [Cannabis sativa]6.7e-17346.19Show/hide
Query:  MSIICGVPIIECVCCLGCARWVWKRCLHTAGHDSEHWGFATAEEFKPIPRVCRYILAVYEDDIRHPLWAPVGGYGIDPDWLLVKKTYRDTRGRAPPYILY
        MSIICG+P++ECV CL CARW WKRCLHTAGHDSE+WG ATAEEF+P+PR+C YILAVYEDD+RHPLW P  GYGI+PDWL  KK+Y DT G+APPYILY
Subjt:  MSIICGVPIIECVCCLGCARWVWKRCLHTAGHDSEHWGFATAEEFKPIPRVCRYILAVYEDDIRHPLWAPVGGYGIDPDWLLVKKTYRDTRGRAPPYILY

Query:  LDHEHGDIVLAIRGLNMAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDTENETLRDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVAQNSEKLEH
        LDH+H DIVLA RGLN+AKESDYAVLLDNKLG+ KFDGGYVHNGLLKAA  VL  E++TL+ LV KYP+YTLTFAGHSLGSGVA +L ++  QN  +L +
Subjt:  LDHEHGDIVLAIRGLNMAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDTENETLRDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVAQNSEKLEH

Query:  IDRRRIRCYAIAPARCMSLNLAVRYADIINSVVLQPS---------------------------------------------------------------
        IDRRRIRCYAIAPARCMSLNLAVRYAD+INSVVLQ +                                                               
Subjt:  IDRRRIRCYAIAPARCMSLNLAVRYADIINSVVLQPS---------------------------------------------------------------

Query:  -----MDSRLNFLLLFTTAFS-------------------------------------------------------------------------------
             +D R   ++L   A S                                                                               
Subjt:  -----MDSRLNFLLLFTTAFS-------------------------------------------------------------------------------

Query:  -----------------------------FSTC-LAQSNL-------------ISGRKGLRD---PLANIVALSYSNHSGTIDPSRVVQVSWRPRVFLYE
                                     FS+  LA SN              +S RK LRD       ++    S HS  IDPSRVVQ+SWRPRVFLYE
Subjt:  -----------------------------FSTC-LAQSNL-------------ISGRKGLRD---PLANIVALSYSNHSGTIDPSRVVQVSWRPRVFLYE

Query:  GFLSDVECEHLISLALNSEDKPSRINAGSGNTSTVLTEWLNSSRAILNSTDDIIARIENRIAVWTLLPIDYSMPLQIMRYRGEEAEHKY-IFGNGSAMSS
        GFLSD EC+HLIS     ED        SGN  T+  + + SS       DD+++RIE RI+ WT LP +    LQI RY  E++E  +  FGN S +  
Subjt:  GFLSDVECEHLISLALNSEDKPSRINAGSGNTSTVLTEWLNSSRAILNSTDDIIARIENRIAVWTLLPIDYSMPLQIMRYRGEEAEHKY-IFGNGSAMSS

Query:  SEPLMATVVLYLSDSASGGEMLFPESKAKSTFWSDRRK---KNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPVLDGELWVATKFFYLRQTTTGSTKH
        S+PL+ATVVLYLSD+ +GG+++FP+SK K   WSD  K    ++I++P KGNAILFF+++ N++ D SS H R PVL+GE+W A KFF ++ T       
Subjt:  SEPLMATVVLYLSDSASGGEMLFPESKAKSTFWSDRRK---KNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPVLDGELWVATKFFYLRQTTTGSTKH

Query:  TVESNENDCIDEDNSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        +  S+ +DC DED++C  WAA+GEC++NAVFMIGS DYYGTCRKSCNAC
Subjt:  TVESNENDCIDEDNSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

RXH95088.1 hypothetical protein DVH24_024772 [Malus domestica]2.6e-16944.41Show/hide
Query:  MSIICGVPIIECVCCLGCARWVWKRCLHTAGHDSEHWGFATAEEFKPIPRVCRYILAVYEDDIRHPLWAPVGGYGIDPDWLLVKKTYRDTRGRAPPYILY
        MSI+C  P++ECV CL C RW WKRCLHTAGHDSE WG +TAEEF+P+PR+CRYILAVYEDD+R PLW P GGYGI+PDWL++KKTY DT G APPYILY
Subjt:  MSIICGVPIIECVCCLGCARWVWKRCLHTAGHDSEHWGFATAEEFKPIPRVCRYILAVYEDDIRHPLWAPVGGYGIDPDWLLVKKTYRDTRGRAPPYILY

Query:  LDHEHGDIVLAIRGLNMAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDTENETLRDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVAQNSEKLEH
        LDH H DIVLA RGLN+A+ESDYAVL+DNKLG+ KFDGGYVHNGLLK+A WV+D E E L+DLV+ YP+YTLTFAGHSLGSGVAA+LT+VV +N ++L  
Subjt:  LDHEHGDIVLAIRGLNMAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDTENETLRDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVAQNSEKLEH

Query:  IDRRRIRCYAIAPARCMSLNLAVRYADIINSVVLQPS---------------------------------------------------------------
        IDR+R+R YAIAPARCMSLNLAVRYAD+INSVVLQ                                                                 
Subjt:  IDRRRIRCYAIAPARCMSLNLAVRYADIINSVVLQPS---------------------------------------------------------------

Query:  -------MDSRLNFLLL----------------------------------------------------FTTAFSFSTCLA-------------------
               +D R   ++L                                                    +  A   +  LA                   
Subjt:  -------MDSRLNFLLL----------------------------------------------------FTTAFSFSTCLA-------------------

Query:  ----------------QSNLISGRKGLRDPLANIVAL-----------------------------------SYSNHSGTIDPSRVVQVSWRPRVFLYEG
                        +S   + R+G    +A++ ++                                    +S HS  IDPSRVVQ+SW+PR      
Subjt:  ----------------QSNLISGRKGLRDPLANIVAL-----------------------------------SYSNHSGTIDPSRVVQVSWRPRVFLYEG

Query:  FLSDVECEHLISLALNSEDKPSRINAGSGNTSTVLTEWLNSSRAILNSTDDIIARIENRIAVWTLLPIDYSMPLQIMRYRGEEAEHKY-IFGNGSAMSSS
          SD EC+HL+SLAL  EDK        GNT+T+    + S    L+  D++++RIE RI+ WT LP + S  +Q+  +  EE +  +  FGN S +  +
Subjt:  FLSDVECEHLISLALNSEDKPSRINAGSGNTSTVLTEWLNSSRAILNSTDDIIARIENRIAVWTLLPIDYSMPLQIMRYRGEEAEHKY-IFGNGSAMSSS

Query:  EPLMATVVLYLSDSASGGEMLFPESKAKSTFWSDRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPVLDGELWVATKFFYLRQTTTGSTKHTVES
        EPL+ATV+LYLS+   GGE+LFPES+  S   SD R+ ++ILRPVKGNAILFF++H NASPDKSS HTR PVL+GE+W ATKF + +       K + +S
Subjt:  EPLMATVVLYLSDSASGGEMLFPESKAKSTFWSDRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPVLDGELWVATKFFYLRQTTTGSTKHTVES

Query:  NENDCIDEDNSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCN
          ++C DED++CP+WA++GEC+RN VFM+GSPDYYGTCRKSCN
Subjt:  NENDCIDEDNSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCN

TrEMBL top hitse value%identityAlignment
A0A498JHB5 Procollagen-proline 4-dioxygenase1.3e-16944.41Show/hide
Query:  MSIICGVPIIECVCCLGCARWVWKRCLHTAGHDSEHWGFATAEEFKPIPRVCRYILAVYEDDIRHPLWAPVGGYGIDPDWLLVKKTYRDTRGRAPPYILY
        MSI+C  P++ECV CL C RW WKRCLHTAGHDSE WG +TAEEF+P+PR+CRYILAVYEDD+R PLW P GGYGI+PDWL++KKTY DT G APPYILY
Subjt:  MSIICGVPIIECVCCLGCARWVWKRCLHTAGHDSEHWGFATAEEFKPIPRVCRYILAVYEDDIRHPLWAPVGGYGIDPDWLLVKKTYRDTRGRAPPYILY

Query:  LDHEHGDIVLAIRGLNMAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDTENETLRDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVAQNSEKLEH
        LDH H DIVLA RGLN+A+ESDYAVL+DNKLG+ KFDGGYVHNGLLK+A WV+D E E L+DLV+ YP+YTLTFAGHSLGSGVAA+LT+VV +N ++L  
Subjt:  LDHEHGDIVLAIRGLNMAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDTENETLRDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVAQNSEKLEH

Query:  IDRRRIRCYAIAPARCMSLNLAVRYADIINSVVLQPS---------------------------------------------------------------
        IDR+R+R YAIAPARCMSLNLAVRYAD+INSVVLQ                                                                 
Subjt:  IDRRRIRCYAIAPARCMSLNLAVRYADIINSVVLQPS---------------------------------------------------------------

Query:  -------MDSRLNFLLL----------------------------------------------------FTTAFSFSTCLA-------------------
               +D R   ++L                                                    +  A   +  LA                   
Subjt:  -------MDSRLNFLLL----------------------------------------------------FTTAFSFSTCLA-------------------

Query:  ----------------QSNLISGRKGLRDPLANIVAL-----------------------------------SYSNHSGTIDPSRVVQVSWRPRVFLYEG
                        +S   + R+G    +A++ ++                                    +S HS  IDPSRVVQ+SW+PR      
Subjt:  ----------------QSNLISGRKGLRDPLANIVAL-----------------------------------SYSNHSGTIDPSRVVQVSWRPRVFLYEG

Query:  FLSDVECEHLISLALNSEDKPSRINAGSGNTSTVLTEWLNSSRAILNSTDDIIARIENRIAVWTLLPIDYSMPLQIMRYRGEEAEHKY-IFGNGSAMSSS
          SD EC+HL+SLAL  EDK        GNT+T+    + S    L+  D++++RIE RI+ WT LP + S  +Q+  +  EE +  +  FGN S +  +
Subjt:  FLSDVECEHLISLALNSEDKPSRINAGSGNTSTVLTEWLNSSRAILNSTDDIIARIENRIAVWTLLPIDYSMPLQIMRYRGEEAEHKY-IFGNGSAMSSS

Query:  EPLMATVVLYLSDSASGGEMLFPESKAKSTFWSDRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPVLDGELWVATKFFYLRQTTTGSTKHTVES
        EPL+ATV+LYLS+   GGE+LFPES+  S   SD R+ ++ILRPVKGNAILFF++H NASPDKSS HTR PVL+GE+W ATKF + +       K + +S
Subjt:  EPLMATVVLYLSDSASGGEMLFPESKAKSTFWSDRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPVLDGELWVATKFFYLRQTTTGSTKHTVES

Query:  NENDCIDEDNSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCN
          ++C DED++CP+WA++GEC+RN VFM+GSPDYYGTCRKSCN
Subjt:  NENDCIDEDNSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCN

A0A6J5U8N9 Procollagen-proline 4-dioxygenase8.3e-18548.95Show/hide
Query:  MSIICGVPIIECVCCLGCARWVWKRCLHTAGHDSEHWGFATAEEFKPIPRVCRYILAVYEDDIRHPLWAPVGGYGIDPDWLLVKKTYRDTRGRAPPYILY
        MSI+CG P+IECV CL C RW WKRCLHTAGHDSE WG ATAEEF+P+PR+CRYILAVYEDD+R PLW P GGYGI PDWL++KKTY DT+G+APPYILY
Subjt:  MSIICGVPIIECVCCLGCARWVWKRCLHTAGHDSEHWGFATAEEFKPIPRVCRYILAVYEDDIRHPLWAPVGGYGIDPDWLLVKKTYRDTRGRAPPYILY

Query:  LDHEHGDIVLAIRGLNMAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDTENETLRDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVAQNSEKLEH
        LDH+H DIVLA RGLN+A+ESDYAVL+DNKLGK KFDGGYVHNGLLKAA WVLD E E L+DLV+KYP+YTLTF GHSLGSGVAA+LT+VV Q+ ++L +
Subjt:  LDHEHGDIVLAIRGLNMAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDTENETLRDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVAQNSEKLEH

Query:  IDRRRIRCYAIAPARCMSLNLAVRYADIINSVVLQPS---------------------------------------------------------------
        IDR+R+R YAIAPARC+SLNLAVRYAD+INSVVLQ +                                                               
Subjt:  IDRRRIRCYAIAPARCMSLNLAVRYADIINSVVLQPS---------------------------------------------------------------

Query:  -----MDSRLNFLLLFTTA---------------------------------------------------------------------------------
             +D R   ++L   A                                                                                 
Subjt:  -----MDSRLNFLLLFTTA---------------------------------------------------------------------------------

Query:  -------FSFSTCLAQSNLIS-GRKGLRDPLAN---IVALSYSNHSGTIDPSRVVQVSWRPRVFLYEGFLSDVECEHLISLALNSEDKPSRINAGSGNTS
                SFS+       ++  RK LR   AN    +   +S HS  IDPSR VQ+SWRPRVFLY+GFLSD EC+HL+SLA   E+         GNT+
Subjt:  -------FSFSTCLAQSNLIS-GRKGLRDPLAN---IVALSYSNHSGTIDPSRVVQVSWRPRVFLYEGFLSDVECEHLISLALNSEDKPSRINAGSGNTS

Query:  TVLTEWLNSSRAILNSTDDIIARIENRIAVWTLLPIDYSMPLQIMRYRGEEAEHKY-IFGNGSAMSSSEPLMATVVLYLSDSASGGEMLFPESKAKSTFW
        T+      S +  LN  D+I++RIE RI+ WT LP + S  LQ+ R   EEAE     FGN S +  SEPL+ATV+LY+S+   GGE+LFPES+ +S  W
Subjt:  TVLTEWLNSSRAILNSTDDIIARIENRIAVWTLLPIDYSMPLQIMRYRGEEAEHKY-IFGNGSAMSSSEPLMATVVLYLSDSASGGEMLFPESKAKSTFW

Query:  SDRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPVLDGELWVATKFFYLRQTTTGSTKHTVESNENDCIDEDNSCPQWAAIGECERNAVFMIGSP
        SD  K ++IL+P KGNAILFF++  NASPDKSS H+R PVL+GE+W ATKF Y +    G  K + +S  ++C DED++CP WA+IGEC+RN VFM+GSP
Subjt:  SDRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPVLDGELWVATKFFYLRQTTTGSTKHTVESNENDCIDEDNSCPQWAAIGECERNAVFMIGSP

Query:  DYYGTCRKSCNAC
        DYYGTCRKSCN C
Subjt:  DYYGTCRKSCNAC

A0A6J5WND9 Procollagen-proline 4-dioxygenase3.7e-18548.95Show/hide
Query:  MSIICGVPIIECVCCLGCARWVWKRCLHTAGHDSEHWGFATAEEFKPIPRVCRYILAVYEDDIRHPLWAPVGGYGIDPDWLLVKKTYRDTRGRAPPYILY
        MSI+CG P+IECV CL C RW WKRCLHTAGHDSE WG ATAEEF+P+PR+CRYILAVYEDD+R PLW P GGYGI PDWL++KKTY DT+G+APPYILY
Subjt:  MSIICGVPIIECVCCLGCARWVWKRCLHTAGHDSEHWGFATAEEFKPIPRVCRYILAVYEDDIRHPLWAPVGGYGIDPDWLLVKKTYRDTRGRAPPYILY

Query:  LDHEHGDIVLAIRGLNMAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDTENETLRDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVAQNSEKLEH
        LDH+H DIVLA RGLN+A+ESDYAVL+DNKLGK KFDGGYVHNGLLKAA WVLD E E L+DLV+KYP+YTLTF GHSLGSGVAA+LT+VV Q+ ++L +
Subjt:  LDHEHGDIVLAIRGLNMAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDTENETLRDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVAQNSEKLEH

Query:  IDRRRIRCYAIAPARCMSLNLAVRYADIINSVVLQPS---------------------------------------------------------------
        IDR+R+R YAIAPARC+SLNLAVRYAD+INSVVLQ +                                                               
Subjt:  IDRRRIRCYAIAPARCMSLNLAVRYADIINSVVLQPS---------------------------------------------------------------

Query:  -----MDSRLNFLLLFTTA---------------------------------------------------------------------------------
             +D R   ++L   A                                                                                 
Subjt:  -----MDSRLNFLLLFTTA---------------------------------------------------------------------------------

Query:  -------FSFSTCLAQSNLIS-GRKGLRDPLAN---IVALSYSNHSGTIDPSRVVQVSWRPRVFLYEGFLSDVECEHLISLALNSEDKPSRINAGSGNTS
                SFS+       ++  RK LR   AN    +   +S HS  IDPSR VQ+SWRPRVFLY+GFLSD EC+HL+SLA   E+         GNT+
Subjt:  -------FSFSTCLAQSNLIS-GRKGLRDPLAN---IVALSYSNHSGTIDPSRVVQVSWRPRVFLYEGFLSDVECEHLISLALNSEDKPSRINAGSGNTS

Query:  TVLTEWLNSSRAILNSTDDIIARIENRIAVWTLLPIDYSMPLQIMRYRGEEAEHKY-IFGNGSAMSSSEPLMATVVLYLSDSASGGEMLFPESKAKSTFW
        T+      S +  LN  D+I++RIE RI+ WT LP + S  LQ+ R   EEAE     FGN S +  SEPL+ATV+LY+S+   GGE+LFPES+ +S  W
Subjt:  TVLTEWLNSSRAILNSTDDIIARIENRIAVWTLLPIDYSMPLQIMRYRGEEAEHKY-IFGNGSAMSSSEPLMATVVLYLSDSASGGEMLFPESKAKSTFW

Query:  SDRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPVLDGELWVATKFFYLRQTTTGSTKHTVESNENDCIDEDNSCPQWAAIGECERNAVFMIGSP
        SD  K ++IL+P KGNAILFF++  NASPDKSS H+R PVL+GE+W ATKF Y +    G  K + +S  ++C DED++CP WA+IGEC+RN VFM+GSP
Subjt:  SDRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPVLDGELWVATKFFYLRQTTTGSTKHTVESNENDCIDEDNSCPQWAAIGECERNAVFMIGSP

Query:  DYYGTCRKSCNAC
        DYYGTCRKSCN C
Subjt:  DYYGTCRKSCNAC

A0A7J6E0F0 Procollagen-proline 4-dioxygenase3.1e-17145.23Show/hide
Query:  MSIICGVPIIECVCCLGCARWVWKRCLHTAGHDSEHWGFATAEEFKPIPRVCRYILAVYEDDIRHPLWAPVGGYGIDPDWLLVKKTYRDTRGRAPPYILY
        MSIICG+P++ECV CL CARW WKRCLHTAGHDSE+WG ATAEEF+P+PR+C YILAVYEDD+RHPLW P  GYGI+PDWL  KK+Y DT G+APPYILY
Subjt:  MSIICGVPIIECVCCLGCARWVWKRCLHTAGHDSEHWGFATAEEFKPIPRVCRYILAVYEDDIRHPLWAPVGGYGIDPDWLLVKKTYRDTRGRAPPYILY

Query:  LDHEHGDIVLAIRGLNMAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDTENETLRDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVAQNSEKLEH
        LDH+H DIVLA RGLN+AKESDYAVLLDNKLG+ KFDGGYVHNGLLKAA  VL  E++TL+ LV KYP+YTLTFAGHSLGSGVA +L ++  QN  +L +
Subjt:  LDHEHGDIVLAIRGLNMAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDTENETLRDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVAQNSEKLEH

Query:  IDRRRIRCYAIAPARCMSLNLAVRYADIINSVVLQP----------------------------------------------------------------
        IDRRRIRCYAIAPARCMSLNLAVRYAD+INSVVLQ                                                                 
Subjt:  IDRRRIRCYAIAPARCMSLNLAVRYADIINSVVLQP----------------------------------------------------------------

Query:  ----------------SMDSRLNFLLLFTTAFS-------------------------------------------------------------------
                         +D R   ++L   A S                                                                   
Subjt:  ----------------SMDSRLNFLLLFTTAFS-------------------------------------------------------------------

Query:  -----------------------------------------FSTC-LAQSN-----------------LISGRKGLRD---PLANIVALSYSNHSGTIDP
                                                 FS+  LA SN                   S RK LRD       ++    S HS  IDP
Subjt:  -----------------------------------------FSTC-LAQSN-----------------LISGRKGLRD---PLANIVALSYSNHSGTIDP

Query:  SRVVQVSWRPRVFLYEGFLSDVECEHLISLALNSEDKPSRINAGSGNTSTVLTEWLNSSRAILNSTDDIIARIENRIAVWTLLPIDYSMPLQIMRYRGEE
        SRVVQ+SWRPRVFLYEGFLSD EC+HLISL    +D        SGN  T+  + + SS       DD+++RIE RI+ WT LP +    LQI RY  E+
Subjt:  SRVVQVSWRPRVFLYEGFLSDVECEHLISLALNSEDKPSRINAGSGNTSTVLTEWLNSSRAILNSTDDIIARIENRIAVWTLLPIDYSMPLQIMRYRGEE

Query:  AEHKY-IFGNGSAMSSSEPLMATVVLYLSDSASGGEMLFPESKAKSTFWSDRRK---KNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPVLDGELWVA
        +E  +  FGN S +  S+PL+ATVVLYLSD+ +GG+++FP+SK K   WSD  K    ++I++P KGNAILFF+++ N++ D SS H R PVL+GE+W A
Subjt:  AEHKY-IFGNGSAMSSSEPLMATVVLYLSDSASGGEMLFPESKAKSTFWSDRRK---KNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPVLDGELWVA

Query:  TKFFYLRQTTTGSTKHTVESNENDCIDEDNSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
         KFF ++ T       +  S+  DC DED++C  WAA+GEC++NAVFMIGS DYYGTCRKSCNAC
Subjt:  TKFFYLRQTTTGSTKHTVESNENDCIDEDNSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

A0A7J6E5B7 Procollagen-proline 4-dioxygenase3.3e-17346.19Show/hide
Query:  MSIICGVPIIECVCCLGCARWVWKRCLHTAGHDSEHWGFATAEEFKPIPRVCRYILAVYEDDIRHPLWAPVGGYGIDPDWLLVKKTYRDTRGRAPPYILY
        MSIICG+P++ECV CL CARW WKRCLHTAGHDSE+WG ATAEEF+P+PR+C YILAVYEDD+RHPLW P  GYGI+PDWL  KK+Y DT G+APPYILY
Subjt:  MSIICGVPIIECVCCLGCARWVWKRCLHTAGHDSEHWGFATAEEFKPIPRVCRYILAVYEDDIRHPLWAPVGGYGIDPDWLLVKKTYRDTRGRAPPYILY

Query:  LDHEHGDIVLAIRGLNMAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDTENETLRDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVAQNSEKLEH
        LDH+H DIVLA RGLN+AKESDYAVLLDNKLG+ KFDGGYVHNGLLKAA  VL  E++TL+ LV KYP+YTLTFAGHSLGSGVA +L ++  QN  +L +
Subjt:  LDHEHGDIVLAIRGLNMAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDTENETLRDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVAQNSEKLEH

Query:  IDRRRIRCYAIAPARCMSLNLAVRYADIINSVVLQPS---------------------------------------------------------------
        IDRRRIRCYAIAPARCMSLNLAVRYAD+INSVVLQ +                                                               
Subjt:  IDRRRIRCYAIAPARCMSLNLAVRYADIINSVVLQPS---------------------------------------------------------------

Query:  -----MDSRLNFLLLFTTAFS-------------------------------------------------------------------------------
             +D R   ++L   A S                                                                               
Subjt:  -----MDSRLNFLLLFTTAFS-------------------------------------------------------------------------------

Query:  -----------------------------FSTC-LAQSNL-------------ISGRKGLRD---PLANIVALSYSNHSGTIDPSRVVQVSWRPRVFLYE
                                     FS+  LA SN              +S RK LRD       ++    S HS  IDPSRVVQ+SWRPRVFLYE
Subjt:  -----------------------------FSTC-LAQSNL-------------ISGRKGLRD---PLANIVALSYSNHSGTIDPSRVVQVSWRPRVFLYE

Query:  GFLSDVECEHLISLALNSEDKPSRINAGSGNTSTVLTEWLNSSRAILNSTDDIIARIENRIAVWTLLPIDYSMPLQIMRYRGEEAEHKY-IFGNGSAMSS
        GFLSD EC+HLIS     ED        SGN  T+  + + SS       DD+++RIE RI+ WT LP +    LQI RY  E++E  +  FGN S +  
Subjt:  GFLSDVECEHLISLALNSEDKPSRINAGSGNTSTVLTEWLNSSRAILNSTDDIIARIENRIAVWTLLPIDYSMPLQIMRYRGEEAEHKY-IFGNGSAMSS

Query:  SEPLMATVVLYLSDSASGGEMLFPESKAKSTFWSDRRK---KNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPVLDGELWVATKFFYLRQTTTGSTKH
        S+PL+ATVVLYLSD+ +GG+++FP+SK K   WSD  K    ++I++P KGNAILFF+++ N++ D SS H R PVL+GE+W A KFF ++ T       
Subjt:  SEPLMATVVLYLSDSASGGEMLFPESKAKSTFWSDRRK---KNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPVLDGELWVATKFFYLRQTTTGSTKH

Query:  TVESNENDCIDEDNSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        +  S+ +DC DED++C  WAA+GEC++NAVFMIGS DYYGTCRKSCNAC
Subjt:  TVESNENDCIDEDNSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 64.0e-5138.93Show/hide
Query:  SNHSGTIDPSRVVQVSWRPRVFLYEGFLSDVECEHLISLALNSEDKP---SRINAGSGNTSTVLTEWLNSSRAILNSTDDIIARIENRIAVWTLLPIDYS
        S+ S ++DP+R+ Q+SW PR FLY+GFLSD EC+HLI LA    +K    + +++G    S V T   +S   +    DDI+A +E ++A WT LP +  
Subjt:  SNHSGTIDPSRVVQVSWRPRVFLYEGFLSDVECEHLISLALNSEDKP---SRINAGSGNTSTVLTEWLNSSRAILNSTDDIIARIENRIAVWTLLPIDYS

Query:  MPLQIMRYRGEEA--EHKYIFGNGSAMSSSEPLMATVVLYLSDSASGGEMLFPESKAKS-----TFWSDRRKKNNILRPVKGNAILFFSVHLNASPDKSS
          LQI+ Y   +    H   F +  A+      +ATV++YLS+   GGE +FP  K K+       WS   K+   ++P KG+A+LFF++HLN + D +S
Subjt:  MPLQIMRYRGEEA--EHKYIFGNGSAMSSSEPLMATVVLYLSDSASGGEMLFPESKAKS-----TFWSDRRKKNNILRPVKGNAILFFSVHLNASPDKSS

Query:  YHTRSPVLDGELWVATKFFYLRQTTTGSTKHTVESNENDCIDEDNSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
         H   PV++GE W AT++ ++R  + G  K         C+D+  SC +WA  GECE+N ++M+GS    G CRKSC AC
Subjt:  YHTRSPVLDGELWVATKFFYLRQTTTGSTKHTVESNENDCIDEDNSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

F4JAU3 Prolyl 4-hydroxylase 23.8e-4637.14Show/hide
Query:  SNHSGTIDPSRVVQVSWRPRVFLYEGFLSDVECEHLISLALNSEDKPSRINAGSGNTSTVLTEWLNSSRAILNSTDDIIARIENRIAVWTLLPIDYSMPL
        S+ S  I+PS+V QVS +PR F+YEGFL+D+EC+HLISLA  +  + S +       S V     +S   I    D I++ IE++++ WT LP +    L
Subjt:  SNHSGTIDPSRVVQVSWRPRVFLYEGFLSDVECEHLISLALNSEDKPSRINAGSGNTSTVLTEWLNSSRAILNSTDDIIARIENRIAVWTLLPIDYSMPL

Query:  QIMRY-RGEEAE-HKYIFGNGSAMSSSEPLMATVVLYLSDSASGGEMLFPESKAKS--------TFWSDRRKKNNILRPVKGNAILFFSVHLNASPDKSS
        Q++RY  G++ + H   F +   ++     +ATV+LYLS+   GGE +FP+++  S           SD  KK   ++P KGNA+LFF++  +A PD  S
Subjt:  QIMRY-RGEEAE-HKYIFGNGSAMSSSEPLMATVVLYLSDSASGGEMLFPESKAKS--------TFWSDRRKKNNILRPVKGNAILFFSVHLNASPDKSS

Query:  YHTRSPVLDGELWVATKFFYLRQTTTGSTKHTVESNENDCIDEDNSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
         H   PV++GE W ATK+ ++       +   + +++ +C D + SC +WA +GEC +N  +M+G+P+  G CR+SC AC
Subjt:  YHTRSPVLDGELWVATKFFYLRQTTTGSTKHTVESNENDCIDEDNSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

Q8GXT7 Probable prolyl 4-hydroxylase 122.8e-5743.31Show/hide
Query:  FLLLFTTAFSFSTCLAQSNLISGRKGLRD----PLANIVALSYSNHSGTIDPSRVVQVSWRPRVFLYEGFLSDVECEHLISLALNSEDKPSRINAGSGNT
        FL+L  T  S S           RK LRD      ++    SY   S  +DP+RV+Q+SW PRVFLY GFLS+ EC+HLISL   + +  S ++A  G T
Subjt:  FLLLFTTAFSFSTCLAQSNLISGRKGLRD----PLANIVALSYSNHSGTIDPSRVVQVSWRPRVFLYEGFLSDVECEHLISLALNSEDKPSRINAGSGNT

Query:  STVLTEWLNSSRAILNSTDDIIARIENRIAVWTLLPIDYSMPLQIMRYRGEEAEHKY-IFGNGSAMSSSEPLMATVVLYLSDSASGGEMLFPESKAKSTF
                          D ++A IE +++ WT LP +    +++  Y  E++  K   FG   +    E L+ATVVLYLS++  GGE+LFP S+ K   
Subjt:  STVLTEWLNSSRAILNSTDDIIARIENRIAVWTLLPIDYSMPLQIMRYRGEEAEHKY-IFGNGSAMSSSEPLMATVVLYLSDSASGGEMLFPESKAKSTF

Query:  WSDRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPVLDGELWVATKFFYLRQTTTGSTKHTVESNENDCIDEDNSCPQWAAIGECERNAVFMIGS
         +   +  NILRPVKGNAILFF+  LNAS D  S H R PV+ GEL VATK  Y +       K        +C DED +C +WA +GEC++N V+MIGS
Subjt:  WSDRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPVLDGELWVATKFFYLRQTTTGSTKHTVESNENDCIDEDNSCPQWAAIGECERNAVFMIGS

Query:  PDYYGTCRKSCNAC
        PDYYGTCRKSCNAC
Subjt:  PDYYGTCRKSCNAC

Q8L970 Probable prolyl 4-hydroxylase 75.5e-5337.61Show/hide
Query:  MDSRLNFLLLFTTAFSFSTCL---AQSNLISGRKGLRDPLANIVALSYSNHSGTIDPSRVVQVSWRPRVFLYEGFLSDVECEHLISLALNSEDKPSRINA
        MDSR+   L F+  F F+  L   A +  ++     RD   +++ +  S  S   DP+RV Q+SW PRVFLYEGFLSD EC+H I LA    +K    + 
Subjt:  MDSRLNFLLLFTTAFSFSTCL---AQSNLISGRKGLRDPLANIVALSYSNHSGTIDPSRVVQVSWRPRVFLYEGFLSDVECEHLISLALNSEDKPSRINA

Query:  GSGNTSTVLTEWLNSSRAILNS-TDDIIARIENRIAVWTLLPIDYSMPLQIMRY-RGEEAE-HKYIFGNGSAMSSSEPLMATVVLYLSDSASGGEMLFPE
         SG   +V +E   SS   L+   DDI++ +E ++A WT LP +    +QI+ Y  G++ E H   F + + +      +ATV++YLS+   GGE +FP 
Subjt:  GSGNTSTVLTEWLNSSRAILNS-TDDIIARIENRIAVWTLLPIDYSMPLQIMRY-RGEEAE-HKYIFGNGSAMSSSEPLMATVVLYLSDSASGGEMLFPE

Query:  SKAKST-----FWSDRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPVLDGELWVATKFFYLRQTTTGSTKHTVESNENDCIDEDNSCPQWAAIG
         K K+T      W++  K+   ++P KG+A+LFF++H NA+ D +S H   PV++GE W AT++ +++       K      ++ C+DE+ SC +WA  G
Subjt:  SKAKST-----FWSDRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPVLDGELWVATKFFYLRQTTTGSTKHTVESNENDCIDEDNSCPQWAAIG

Query:  ECERNAVFMIGSPDYYGTCRKSCNACS
        EC++N  +M+GS   +G CRKSC ACS
Subjt:  ECERNAVFMIGSPDYYGTCRKSCNACS

Q8LAN3 Probable prolyl 4-hydroxylase 44.0e-4335.46Show/hide
Query:  SNHSGTIDPSRVVQVSWRPRVFLYEGFLSDVECEHLISLALNSEDKPSRINAGSGNT--STVLTEWLNSSRAILNSTDDIIARIENRIAVWTLLPIDYSM
        S+ S  ++PS+V QVS +PR F+YEGFL+++EC+H++SLA  S  + +  +  SG +  S V T   +S   I    D I++ IE++I+ WT LP +   
Subjt:  SNHSGTIDPSRVVQVSWRPRVFLYEGFLSDVECEHLISLALNSEDKPSRINAGSGNT--STVLTEWLNSSRAILNSTDDIIARIENRIAVWTLLPIDYSM

Query:  PLQIMRY-RGEEAE-HKYIFGNGSAMSSSEPLMATVVLYLSDSASGGEMLFPESKAKS--------TFWSDRRKKNNILRPVKGNAILFFSVHLNASPDK
         +Q++RY  G++ + H   F +   +      MAT+++YLS+   GGE +FP+++  S           SD  K+   ++P KG+A+LFF++H +A PD 
Subjt:  PLQIMRY-RGEEAE-HKYIFGNGSAMSSSEPLMATVVLYLSDSASGGEMLFPESKAKS--------TFWSDRRKKNNILRPVKGNAILFFSVHLNASPDK

Query:  SSYHTRSPVLDGELWVATKFFYLRQTTTGSTKHTVESNENDCIDEDNSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
         S H   PV++GE W ATK+ ++       +   + +   +C D + SC +WA +GEC +N  +M+G+ +  G CR+SC AC
Subjt:  SSYHTRSPVLDGELWVATKFFYLRQTTTGSTKHTVESNENDCIDEDNSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

Arabidopsis top hitse value%identityAlignment
AT3G49050.1 alpha/beta-Hydrolases superfamily protein9.5e-10967.14Show/hide
Query:  MSIICG-VPIIECVCCLGCARWVWKRCLHTAGHDSEHWGFATAEEFKPIPRVCRYILAVYEDDIRHPLWAPVGGYGIDPDWLLVKKTYRDTRGRAPPYIL
        MSI+CG  P++ECV CLGCARW +KRCL+TAGHDSE WG AT +EF+P+PR CRYILAVYEDDIR+PLW P  GYGI+PDWLL+KKTY DT+GRAP YIL
Subjt:  MSIICG-VPIIECVCCLGCARWVWKRCLHTAGHDSEHWGFATAEEFKPIPRVCRYILAVYEDDIRHPLWAPVGGYGIDPDWLLVKKTYRDTRGRAPPYIL

Query:  YLDHEHGDIVLAIRGLNMAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDTENETLRDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVAQNSEKLE
        YLDH H DIV+AIRGLN+AKESDYA+LLDNKLG+ KFDGGYVHNGL+K+AG+VLD E + L++LVKKYP YTLTFAGHSLGSGVA ML L+V ++ E+L 
Subjt:  YLDHEHGDIVLAIRGLNMAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDTENETLRDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVAQNSEKLE

Query:  HIDRRRIRCYAIAPARCMSLNLAVRYADIINSVVLQPSMDSRLNFLL--LFTTAFSFSTCLA----QSNLISGRKGLRDP
        +IDR+R+RC+AIAPARCMSLNLAVRYAD+INSV+LQ     R    L  +F + F     L     +   +  +K L+DP
Subjt:  HIDRRRIRCYAIAPARCMSLNLAVRYADIINSVVLQPSMDSRLNFLL--LFTTAFSFSTCLA----QSNLISGRKGLRDP

AT4G00500.1 alpha/beta-Hydrolases superfamily protein6.4e-9765.96Show/hide
Query:  MSIICGVPIIECVCCLGCARWVWKRCLHTAGHDSEHWGFATAEEFKPIPRVCRYILAVYEDDIRHPLWAPVGGYGIDPDWLLVKKTYRDTRGRAPPYILY
        MSI+C VP++ECV CLGC  W+WK+CL++AGH+SE+WG AT++EF+PIPR+CR ILAVYE+++  P+WAP  GYGIDP+ +++KK Y  T GR  PY++Y
Subjt:  MSIICGVPIIECVCCLGCARWVWKRCLHTAGHDSEHWGFATAEEFKPIPRVCRYILAVYEDDIRHPLWAPVGGYGIDPDWLLVKKTYRDTRGRAPPYILY

Query:  LDHEHGDIVLAIRGLNMAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDTENETLRDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVAQNSEKLEH
        LDHE+GD+VLAIRGLN+AKE DYAVLLDNKLG+ KFDGGYVHNGLLKAA WV + E+  LR+L++  P Y+LTF GHSLG+GV ++L L V QN  +L +
Subjt:  LDHEHGDIVLAIRGLNMAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDTENETLRDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVAQNSEKLEH

Query:  IDRRRIRCYAIAPARCMSLNLAVRYADIINSVVLQ
        I+R+RIRC+AIAP RCMSL+LAV YAD+INSVVLQ
Subjt:  IDRRRIRCYAIAPARCMSLNLAVRYADIINSVVLQ

AT4G00500.2 alpha/beta-Hydrolases superfamily protein6.4e-9765.96Show/hide
Query:  MSIICGVPIIECVCCLGCARWVWKRCLHTAGHDSEHWGFATAEEFKPIPRVCRYILAVYEDDIRHPLWAPVGGYGIDPDWLLVKKTYRDTRGRAPPYILY
        MSI+C VP++ECV CLGC  W+WK+CL++AGH+SE+WG AT++EF+PIPR+CR ILAVYE+++  P+WAP  GYGIDP+ +++KK Y  T GR  PY++Y
Subjt:  MSIICGVPIIECVCCLGCARWVWKRCLHTAGHDSEHWGFATAEEFKPIPRVCRYILAVYEDDIRHPLWAPVGGYGIDPDWLLVKKTYRDTRGRAPPYILY

Query:  LDHEHGDIVLAIRGLNMAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDTENETLRDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVAQNSEKLEH
        LDHE+GD+VLAIRGLN+AKE DYAVLLDNKLG+ KFDGGYVHNGLLKAA WV + E+  LR+L++  P Y+LTF GHSLG+GV ++L L V QN  +L +
Subjt:  LDHEHGDIVLAIRGLNMAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDTENETLRDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVAQNSEKLEH

Query:  IDRRRIRCYAIAPARCMSLNLAVRYADIINSVVLQ
        I+R+RIRC+AIAP RCMSL+LAV YAD+INSVVLQ
Subjt:  IDRRRIRCYAIAPARCMSLNLAVRYADIINSVVLQ

AT4G25600.1 Oxoglutarate/iron-dependent oxygenase2.0e-5843.31Show/hide
Query:  FLLLFTTAFSFSTCLAQSNLISGRKGLRD----PLANIVALSYSNHSGTIDPSRVVQVSWRPRVFLYEGFLSDVECEHLISLALNSEDKPSRINAGSGNT
        FL+L  T  S S           RK LRD      ++    SY   S  +DP+RV+Q+SW PRVFLY GFLS+ EC+HLISL   + +  S ++A  G T
Subjt:  FLLLFTTAFSFSTCLAQSNLISGRKGLRD----PLANIVALSYSNHSGTIDPSRVVQVSWRPRVFLYEGFLSDVECEHLISLALNSEDKPSRINAGSGNT

Query:  STVLTEWLNSSRAILNSTDDIIARIENRIAVWTLLPIDYSMPLQIMRYRGEEAEHKY-IFGNGSAMSSSEPLMATVVLYLSDSASGGEMLFPESKAKSTF
                          D ++A IE +++ WT LP +    +++  Y  E++  K   FG   +    E L+ATVVLYLS++  GGE+LFP S+ K   
Subjt:  STVLTEWLNSSRAILNSTDDIIARIENRIAVWTLLPIDYSMPLQIMRYRGEEAEHKY-IFGNGSAMSSSEPLMATVVLYLSDSASGGEMLFPESKAKSTF

Query:  WSDRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPVLDGELWVATKFFYLRQTTTGSTKHTVESNENDCIDEDNSCPQWAAIGECERNAVFMIGS
         +   +  NILRPVKGNAILFF+  LNAS D  S H R PV+ GEL VATK  Y +       K        +C DED +C +WA +GEC++N V+MIGS
Subjt:  WSDRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPVLDGELWVATKFFYLRQTTTGSTKHTVESNENDCIDEDNSCPQWAAIGECERNAVFMIGS

Query:  PDYYGTCRKSCNAC
        PDYYGTCRKSCNAC
Subjt:  PDYYGTCRKSCNAC

AT5G37710.1 alpha/beta-Hydrolases superfamily protein1.2e-7952.48Show/hide
Query:  MSIICGVPIIECVCCLGCARWVWKRCLHTAGHDSEHWGFATAEEFKPIPRVCRYILAVYEDDIRHPLWAP-VGGYGIDPDWLLVKKTYRDTRGRAPPYIL
        MS+ CG   +ECV C+G +RW WKRC H    DS  W  AT EEF+PIPR+ R ILAVYE D+R+P  +P +G + ++P+W++ + T+  T+GR+PPYI+
Subjt:  MSIICGVPIIECVCCLGCARWVWKRCLHTAGHDSEHWGFATAEEFKPIPRVCRYILAVYEDDIRHPLWAP-VGGYGIDPDWLLVKKTYRDTRGRAPPYIL

Query:  YLDHEHGDIVLAIRGLNMAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDTENETL-RDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVAQNSEKL
        Y+DH+H +IVLAIRGLN+AKESDY +LLDNKLG+    GGYVH GLLK+A WVL+ E+ETL R   +   +Y L FAGHSLGSGVAA++ ++V      +
Subjt:  YLDHEHGDIVLAIRGLNMAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDTENETL-RDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVAQNSEKL

Query:  EHIDRRRIRCYAIAPARCMSLNLAVRYADIINSVVLQPSMDSRLNFLL--LFTTAFS-----FSTCLAQSNLISGRKGLRDP
          I R ++RC+A+APARCMSLNLAV+YAD+I+SV+LQ     R    L  +F + F      F  CL  + +  GRK LRDP
Subjt:  EHIDRRRIRCYAIAPARCMSLNLAVRYADIINSVVLQPSMDSRLNFLL--LFTTAFS-----FSTCLAQSNLISGRKGLRDP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTATCATTTGTGGCGTACCTATAATCGAGTGTGTATGTTGCCTGGGATGTGCTCGTTGGGTCTGGAAACGCTGTCTCCACACAGCTGGTCATGACAGTGAGCATTG
GGGCTTTGCCACTGCCGAGGAGTTCAAGCCTATTCCTCGAGTTTGTCGATATATCCTAGCTGTGTATGAAGATGATATACGACACCCCCTTTGGGCACCGGTTGGTGGTT
ATGGAATCGATCCAGATTGGTTACTCGTGAAGAAAACATACAGAGATACACGAGGGCGGGCTCCCCCGTATATTTTATATCTCGATCACGAACATGGCGATATTGTTCTT
GCCATCAGGGGACTTAATATGGCCAAGGAGAGTGATTATGCAGTATTATTGGATAACAAGCTGGGAAAGATGAAATTTGATGGTGGATATGTTCACAATGGGCTTCTGAA
GGCAGCTGGGTGGGTTTTGGACACTGAGAATGAAACTTTACGGGATTTGGTGAAGAAGTATCCGGATTATACTTTGACGTTTGCAGGTCATTCTCTTGGCTCCGGAGTAG
CAGCCATGTTGACTTTGGTAGTAGCACAGAACAGCGAAAAATTGGAACATATCGATCGGAGGCGGATAAGGTGCTATGCTATTGCTCCTGCCAGGTGCATGTCCCTAAAT
TTGGCTGTTAGATATGCAGATATCATCAACTCTGTTGTTCTTCAGCCATCCATGGATTCTCGTCTCAACTTCTTGCTTCTTTTCACGACCGCATTTTCATTCTCAACCTG
CCTCGCACAAAGCAATTTGATCAGTGGCCGAAAGGGTTTACGGGATCCATTGGCGAACATTGTAGCTTTGAGCTACTCAAATCATTCTGGAACAATCGATCCTTCAAGAG
TTGTCCAAGTTTCTTGGCGACCAAGGGTTTTCTTGTATGAAGGTTTTCTCTCAGATGTGGAGTGTGAACACCTTATTTCTTTGGCTTTGAATTCGGAAGATAAACCTTCT
CGGATCAATGCTGGTTCCGGGAACACGAGCACTGTCTTGACCGAATGGCTAAACAGTTCAAGAGCTATTTTGAACTCAACAGATGATATCATTGCAAGGATTGAAAATCG
AATTGCAGTGTGGACTCTTCTCCCCATAGATTATAGCATGCCTTTACAGATTATGCGATACCGGGGTGAAGAAGCAGAGCATAAATACATTTTTGGCAACGGATCCGCAA
TGTCGTCGAGTGAGCCTTTGATGGCCACGGTAGTTTTGTATCTCTCAGATTCTGCTAGCGGTGGCGAGATGCTCTTTCCTGAATCAAAGGCAAAGAGCACATTTTGGTCA
GATCGAAGAAAGAAAAACAACATTCTGAGACCGGTGAAAGGCAATGCAATTCTTTTTTTCTCTGTTCATCTCAATGCTTCTCCAGACAAGAGTAGCTACCATACTCGATC
TCCGGTACTCGACGGGGAATTGTGGGTCGCTACAAAATTCTTCTACTTAAGACAAACCACCACTGGGAGTACTAAACACACAGTTGAATCCAATGAAAATGACTGCATTG
ATGAAGATAACAGCTGCCCCCAATGGGCGGCCATTGGCGAATGCGAACGGAACGCTGTTTTCATGATCGGTTCTCCAGATTACTATGGAACATGTAGAAAAAGCTGCAAT
GCATGTTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTATCATTTGTGGCGTACCTATAATCGAGTGTGTATGTTGCCTGGGATGTGCTCGTTGGGTCTGGAAACGCTGTCTCCACACAGCTGGTCATGACAGTGAGCATTG
GGGCTTTGCCACTGCCGAGGAGTTCAAGCCTATTCCTCGAGTTTGTCGATATATCCTAGCTGTGTATGAAGATGATATACGACACCCCCTTTGGGCACCGGTTGGTGGTT
ATGGAATCGATCCAGATTGGTTACTCGTGAAGAAAACATACAGAGATACACGAGGGCGGGCTCCCCCGTATATTTTATATCTCGATCACGAACATGGCGATATTGTTCTT
GCCATCAGGGGACTTAATATGGCCAAGGAGAGTGATTATGCAGTATTATTGGATAACAAGCTGGGAAAGATGAAATTTGATGGTGGATATGTTCACAATGGGCTTCTGAA
GGCAGCTGGGTGGGTTTTGGACACTGAGAATGAAACTTTACGGGATTTGGTGAAGAAGTATCCGGATTATACTTTGACGTTTGCAGGTCATTCTCTTGGCTCCGGAGTAG
CAGCCATGTTGACTTTGGTAGTAGCACAGAACAGCGAAAAATTGGAACATATCGATCGGAGGCGGATAAGGTGCTATGCTATTGCTCCTGCCAGGTGCATGTCCCTAAAT
TTGGCTGTTAGATATGCAGATATCATCAACTCTGTTGTTCTTCAGCCATCCATGGATTCTCGTCTCAACTTCTTGCTTCTTTTCACGACCGCATTTTCATTCTCAACCTG
CCTCGCACAAAGCAATTTGATCAGTGGCCGAAAGGGTTTACGGGATCCATTGGCGAACATTGTAGCTTTGAGCTACTCAAATCATTCTGGAACAATCGATCCTTCAAGAG
TTGTCCAAGTTTCTTGGCGACCAAGGGTTTTCTTGTATGAAGGTTTTCTCTCAGATGTGGAGTGTGAACACCTTATTTCTTTGGCTTTGAATTCGGAAGATAAACCTTCT
CGGATCAATGCTGGTTCCGGGAACACGAGCACTGTCTTGACCGAATGGCTAAACAGTTCAAGAGCTATTTTGAACTCAACAGATGATATCATTGCAAGGATTGAAAATCG
AATTGCAGTGTGGACTCTTCTCCCCATAGATTATAGCATGCCTTTACAGATTATGCGATACCGGGGTGAAGAAGCAGAGCATAAATACATTTTTGGCAACGGATCCGCAA
TGTCGTCGAGTGAGCCTTTGATGGCCACGGTAGTTTTGTATCTCTCAGATTCTGCTAGCGGTGGCGAGATGCTCTTTCCTGAATCAAAGGCAAAGAGCACATTTTGGTCA
GATCGAAGAAAGAAAAACAACATTCTGAGACCGGTGAAAGGCAATGCAATTCTTTTTTTCTCTGTTCATCTCAATGCTTCTCCAGACAAGAGTAGCTACCATACTCGATC
TCCGGTACTCGACGGGGAATTGTGGGTCGCTACAAAATTCTTCTACTTAAGACAAACCACCACTGGGAGTACTAAACACACAGTTGAATCCAATGAAAATGACTGCATTG
ATGAAGATAACAGCTGCCCCCAATGGGCGGCCATTGGCGAATGCGAACGGAACGCTGTTTTCATGATCGGTTCTCCAGATTACTATGGAACATGTAGAAAAAGCTGCAAT
GCATGTTCATGA
Protein sequenceShow/hide protein sequence
MSIICGVPIIECVCCLGCARWVWKRCLHTAGHDSEHWGFATAEEFKPIPRVCRYILAVYEDDIRHPLWAPVGGYGIDPDWLLVKKTYRDTRGRAPPYILYLDHEHGDIVL
AIRGLNMAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDTENETLRDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVAQNSEKLEHIDRRRIRCYAIAPARCMSLN
LAVRYADIINSVVLQPSMDSRLNFLLLFTTAFSFSTCLAQSNLISGRKGLRDPLANIVALSYSNHSGTIDPSRVVQVSWRPRVFLYEGFLSDVECEHLISLALNSEDKPS
RINAGSGNTSTVLTEWLNSSRAILNSTDDIIARIENRIAVWTLLPIDYSMPLQIMRYRGEEAEHKYIFGNGSAMSSSEPLMATVVLYLSDSASGGEMLFPESKAKSTFWS
DRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPVLDGELWVATKFFYLRQTTTGSTKHTVESNENDCIDEDNSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCN
ACS