; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS017764 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS017764
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationscaffold373:2326000..2334399
RNA-Seq ExpressionMS017764
SyntenyMS017764
Gene Ontology termsGO:0016042 - lipid catabolic process (biological process)
GO:0019511 - peptidyl-proline hydroxylation (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR002921 - Fungal lipase-like domain
IPR003582 - ShKT domain
IPR005592 - Mono-/di-acylglycerol lipase, N-terminal
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4271435.1 unnamed protein product [Prunus armeniaca]3.2e-18850.07Show/hide
Query:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKTYKDTRGRAPPYILY
        MSI+CG P++ECV CL C RWA KRC H+A HDSETWG ATA+EF P+PR+CRYILAVYEDD++QPLWEP GGYGI PDWL++KKTY+DT+G+APPYILY
Subjt:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKTYKDTRGRAPPYILY

Query:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN
        LDH+H DIVLA RGLN+A+ESDYAVL+DNKLGKKKFDGGYVHNGLLKAA WVLD E E LKDLV KYP+YTLTF GHSLGSGVAA+LT+VVVQ+RD+L N
Subjt:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQ--------------------VNC------------------------------------------
        IDRKR+R YAIAPARC+SLNLAVRYADVINSVVLQ                    + C                                          
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQ--------------------VNC------------------------------------------

Query:  ------------LFLSYNLIS----------GRKGL-----RDQLIE-------------------------------SVPLSYSN--------------
                    + LS N  S           ++ L     +DQ++E                               +VP +YS               
Subjt:  ------------LFLSYNLIS----------GRKGL-----RDQLIE-------------------------------SVPLSYSN--------------

Query:  --------------------------------------------HSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTV
                                                    HS RIDPSR VQ+SWRPRVFLY+GFLSDEECDHL+SLA   E+       D GNT 
Subjt:  --------------------------------------------HSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTV

Query:  PTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKY-VFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFWSD
          ++  S    LN  D+I++RIE RI+ WTFLPK+ S  LQ+ + G EEAE     FGN+S +  SEPL+ATV+LY+S+   GGE+ FPES+++S  WSD
Subjt:  PTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKY-VFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFWSD

Query:  RRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNK-HTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYG
          K ++IL+P KGNA+L F++  NASPDKSS H+R P+L+GE+W ATKF Y + I G K  +D    +C DED +CP WA+IGEC+RN VFM+GSPDYYG
Subjt:  RRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNK-HTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYG

Query:  TCRKSCNAC
        TCRKSCN C
Subjt:  TCRKSCNAC

CAB4301873.1 unnamed protein product [Prunus armeniaca]6.4e-18950.21Show/hide
Query:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKTYKDTRGRAPPYILY
        MSI+CG P++ECV CL C RWA KRC H+A HDSETWG ATA+EF P+PR+CRYILAVYEDD++QPLWEP GGYGI PDWL++KKTY+DT+G+APPYILY
Subjt:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKTYKDTRGRAPPYILY

Query:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN
        LDH+H DIVLA RGLN+A+ESDYAVL+DNKLGKKKFDGGYVHNGLLKAA WVLD E E LKDLV KYP+YTLTF GHSLGSGVAA+LT+VVVQ+RD+L N
Subjt:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQ--------------------VNC------------------------------------------
        IDRKR+R YAIAPARC+SLNLAVRYADVINSVVLQ                    + C                                          
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQ--------------------VNC------------------------------------------

Query:  ------------LFLSYNLIS----------GRKGL-----RDQLIE-------------------------------SVPLSYSN--------------
                    + LS N  S           ++ L     +DQ++E                               +VP +YS               
Subjt:  ------------LFLSYNLIS----------GRKGL-----RDQLIE-------------------------------SVPLSYSN--------------

Query:  --------------------------------------------HSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTV
                                                    HS RIDPSR VQ+SWRPRVFLY+GFLSDEECDHL+SLA   E+       D GNT 
Subjt:  --------------------------------------------HSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTV

Query:  PTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKY-VFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFWSD
          ++ KS    LN  D+I++RIE RI+ WTFLPK+ S  LQ+ + G EEAE     FGN+S +  SEPL+ATV+LY+S+   GGE+ FPES+++S  WSD
Subjt:  PTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKY-VFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFWSD

Query:  RRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNK-HTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYG
          K ++IL+P KGNA+L F++  NASPDKSS H+R P+L+GE+W ATKF Y + I G K   D    +C DED +CP WA+IGEC+RN VFM+GSPDYYG
Subjt:  RRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNK-HTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYG

Query:  TCRKSCNAC
        TCRKSCN C
Subjt:  TCRKSCNAC

KAF4351179.1 hypothetical protein F8388_024210 [Cannabis sativa]1.3e-17045.03Show/hide
Query:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKTYKDTRGRAPPYILY
        MSIICGIP+LECV CL CARWA KRC H+A HDSE WG ATA+EF P+PR+C YILAVYEDD++ PLWEP  GYGINPDWL  KK+Y+DT G+APPYILY
Subjt:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKTYKDTRGRAPPYILY

Query:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN
        LDH+H DIVLA RGLN+AKESDYAVLLDNKLG++KFDGGYVHNGLLKAA  VL  E++ LK LV KYP+YTLTFAGHSLGSGVA +L ++ VQNR +L N
Subjt:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQ-----------------------------------------------------------------
        IDR+RIRCYAIAPARCMSLNLAVRYADVINSVVLQ                                                                 
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQ-----------------------------------------------------------------

Query:  --------------------------VNC-----------------------------------------------------------------------
                                  ++C                                                                       
Subjt:  --------------------------VNC-----------------------------------------------------------------------

Query:  --------------------------------------------------------------LFLSYNLISGRKGLRDQLIES---VPLSYSNHSGRIDP
                                                                      L+LS    S RK LRD+  +    +    S HS RIDP
Subjt:  --------------------------------------------------------------LFLSYNLISGRKGLRDQLIES---VPLSYSNHSGRIDP

Query:  SRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAE
        SRVVQ+SWRPRVFLY+GFLSDEECDHLISL +  +D        SGNT+  K++KSS       DD+++RIE RI+ WTFLPK+    LQI +Y  E++E
Subjt:  SRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAE

Query:  HKY-VFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFWSDRRK---KNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATK
          +  FGN S +  S+PL+ATVVLYLSD+ +GG++ FP+SKVK   WSD  K    ++I++P KGNA+L F+++ N++ D SSSH R P+L+GE+W A K
Subjt:  HKY-VFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFWSDRRK---KNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATK

Query:  FFYLRPITGNKHTDEPD----GDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        FF ++ IT  +     +     DC DED +C  WAA+GEC++NAVFMIGS DYYGTCRKSCNAC
Subjt:  FFYLRPITGNKHTDEPD----GDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

KAF4353598.1 hypothetical protein F8388_017773 [Cannabis sativa]9.3e-17246.26Show/hide
Query:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKTYKDTRGRAPPYILY
        MSIICGIP+LECV CL CARWA KRC H+A HDSE WG ATA+EF P+PR+C YILAVYEDD++ PLWEP  GYGINPDWL  KK+Y+DT G+APPYILY
Subjt:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKTYKDTRGRAPPYILY

Query:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN
        LDH+H DIVLA RGLN+AKESDYAVLLDNKLG++KFDGGYVHNGLLKAA  VL  E++ LK LV KYP+YTLTFAGHSLGSGVA +L ++ VQNR +L N
Subjt:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQ-----------------VNCL--------------------------------------------
        IDR+RIRCYAIAPARCMSLNLAVRYADVINSVVLQ                 + CL                                            
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQ-----------------VNCL--------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------FLSYNL------------------ISGRKGLRDQLIES---VPLSYSNHSGRIDPSRVVQVSWRPRVFLYK
                                     F S NL                  +S RK LRD+  +    +    S HS RIDPSRVVQ+SWRPRVFLY+
Subjt:  -----------------------------FLSYNL------------------ISGRKGLRDQLIES---VPLSYSNHSGRIDPSRVVQVSWRPRVFLYK

Query:  GFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKY-VFGNRSAMLSSE
        GFLSDEECDHLIS  +  ED        SGNT+  K++KSS       DD+++RIE RI+ WTFLPK+    LQI +Y  E++E  +  FGN S +  S+
Subjt:  GFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKY-VFGNRSAMLSSE

Query:  PLMATVVLYLSDSASGGEMRFPESKVKSRFWSDRRK---KNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEP
        PL+ATVVLYLSD+ +GG++ FP+SKVK   WSD  K    ++I++P KGNA+L F+++ N++ D SSSH R P+L+GE+W A KFF ++ IT  +     
Subjt:  PLMATVVLYLSDSASGGEMRFPESKVKSRFWSDRRK---KNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEP

Query:  D----GDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        +     DC DED +C  WAA+GEC++NAVFMIGS DYYGTCRKSCNAC
Subjt:  D----GDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

RXH95088.1 hypothetical protein DVH24_024772 [Malus domestica]1.3e-17646.01Show/hide
Query:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKTYKDTRGRAPPYILY
        MSI+C  P+LECV CL C RWA KRC H+A HDSETWG +TA+EF P+PR+CRYILAVYEDD++ PLWEP GGYGINPDWL++KKTY+DT G APPYILY
Subjt:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKTYKDTRGRAPPYILY

Query:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN
        LDHNH DIVLA RGLN+A+ESDYAVL+DNKLG++KFDGGYVHNGLLK+A WV+D E EILKDLV  YP+YTLTFAGHSLGSGVAA+LT+VVV+NRD+L +
Subjt:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQ-------------------VNCL------------------------------------------
        IDRKR+R YAIAPARCMSLNLAVRYADVINSVVLQ                   + CL                                          
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQ-------------------VNCL------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------FLSYNLISGRKGLR-DQLIES--VPLSYSNHSGRIDPSRVVQVSWRPRVFLYKG
                                                      F S +    RK LR +Q I+   +   +S HS RIDPSRVVQ+SW+PR      
Subjt:  ----------------------------------------------FLSYNLISGRKGLR-DQLIES--VPLSYSNHSGRIDPSRVVQVSWRPRVFLYKG

Query:  FLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKY-VFGNRSAMLSSEP
          SDEECDHL+SLA   EDK      + GNT   +++KS    L+  D++++RIE RI+ WTFLPK+ S  +Q+  +G EE +  +  FGN+S +  +EP
Subjt:  FLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKY-VFGNRSAMLSSEP

Query:  LMATVVLYLSDSASGGEMRFPESKVKSRFWSDRRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNK-HTDEPDGD
        L+ATV+LYLS+   GGE+ FPES++ S+  SD R+ ++ILRPVKGNA+L F++H NASPDKSS HTR P+L+GE+W ATKF + + I G K  +D    +
Subjt:  LMATVVLYLSDSASGGEMRFPESKVKSRFWSDRRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNK-HTDEPDGD

Query:  CNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCN
        C DED +CP+WA++GEC+RN VFM+GSPDYYGTCRKSCN
Subjt:  CNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCN

TrEMBL top hitse value%identityAlignment
A0A498JHB5 Procollagen-proline 4-dioxygenase6.1e-17746.01Show/hide
Query:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKTYKDTRGRAPPYILY
        MSI+C  P+LECV CL C RWA KRC H+A HDSETWG +TA+EF P+PR+CRYILAVYEDD++ PLWEP GGYGINPDWL++KKTY+DT G APPYILY
Subjt:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKTYKDTRGRAPPYILY

Query:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN
        LDHNH DIVLA RGLN+A+ESDYAVL+DNKLG++KFDGGYVHNGLLK+A WV+D E EILKDLV  YP+YTLTFAGHSLGSGVAA+LT+VVV+NRD+L +
Subjt:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQ-------------------VNCL------------------------------------------
        IDRKR+R YAIAPARCMSLNLAVRYADVINSVVLQ                   + CL                                          
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQ-------------------VNCL------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------FLSYNLISGRKGLR-DQLIES--VPLSYSNHSGRIDPSRVVQVSWRPRVFLYKG
                                                      F S +    RK LR +Q I+   +   +S HS RIDPSRVVQ+SW+PR      
Subjt:  ----------------------------------------------FLSYNLISGRKGLR-DQLIES--VPLSYSNHSGRIDPSRVVQVSWRPRVFLYKG

Query:  FLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKY-VFGNRSAMLSSEP
          SDEECDHL+SLA   EDK      + GNT   +++KS    L+  D++++RIE RI+ WTFLPK+ S  +Q+  +G EE +  +  FGN+S +  +EP
Subjt:  FLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKY-VFGNRSAMLSSEP

Query:  LMATVVLYLSDSASGGEMRFPESKVKSRFWSDRRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNK-HTDEPDGD
        L+ATV+LYLS+   GGE+ FPES++ S+  SD R+ ++ILRPVKGNA+L F++H NASPDKSS HTR P+L+GE+W ATKF + + I G K  +D    +
Subjt:  LMATVVLYLSDSASGGEMRFPESKVKSRFWSDRRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNK-HTDEPDGD

Query:  CNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCN
        C DED +CP+WA++GEC+RN VFM+GSPDYYGTCRKSCN
Subjt:  CNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCN

A0A6J5U8N9 Procollagen-proline 4-dioxygenase1.5e-18850.07Show/hide
Query:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKTYKDTRGRAPPYILY
        MSI+CG P++ECV CL C RWA KRC H+A HDSETWG ATA+EF P+PR+CRYILAVYEDD++QPLWEP GGYGI PDWL++KKTY+DT+G+APPYILY
Subjt:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKTYKDTRGRAPPYILY

Query:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN
        LDH+H DIVLA RGLN+A+ESDYAVL+DNKLGKKKFDGGYVHNGLLKAA WVLD E E LKDLV KYP+YTLTF GHSLGSGVAA+LT+VVVQ+RD+L N
Subjt:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQ--------------------VNC------------------------------------------
        IDRKR+R YAIAPARC+SLNLAVRYADVINSVVLQ                    + C                                          
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQ--------------------VNC------------------------------------------

Query:  ------------LFLSYNLIS----------GRKGL-----RDQLIE-------------------------------SVPLSYSN--------------
                    + LS N  S           ++ L     +DQ++E                               +VP +YS               
Subjt:  ------------LFLSYNLIS----------GRKGL-----RDQLIE-------------------------------SVPLSYSN--------------

Query:  --------------------------------------------HSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTV
                                                    HS RIDPSR VQ+SWRPRVFLY+GFLSDEECDHL+SLA   E+       D GNT 
Subjt:  --------------------------------------------HSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTV

Query:  PTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKY-VFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFWSD
          ++  S    LN  D+I++RIE RI+ WTFLPK+ S  LQ+ + G EEAE     FGN+S +  SEPL+ATV+LY+S+   GGE+ FPES+++S  WSD
Subjt:  PTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKY-VFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFWSD

Query:  RRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNK-HTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYG
          K ++IL+P KGNA+L F++  NASPDKSS H+R P+L+GE+W ATKF Y + I G K  +D    +C DED +CP WA+IGEC+RN VFM+GSPDYYG
Subjt:  RRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNK-HTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYG

Query:  TCRKSCNAC
        TCRKSCN C
Subjt:  TCRKSCNAC

A0A6J5WND9 Procollagen-proline 4-dioxygenase3.1e-18950.21Show/hide
Query:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKTYKDTRGRAPPYILY
        MSI+CG P++ECV CL C RWA KRC H+A HDSETWG ATA+EF P+PR+CRYILAVYEDD++QPLWEP GGYGI PDWL++KKTY+DT+G+APPYILY
Subjt:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKTYKDTRGRAPPYILY

Query:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN
        LDH+H DIVLA RGLN+A+ESDYAVL+DNKLGKKKFDGGYVHNGLLKAA WVLD E E LKDLV KYP+YTLTF GHSLGSGVAA+LT+VVVQ+RD+L N
Subjt:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQ--------------------VNC------------------------------------------
        IDRKR+R YAIAPARC+SLNLAVRYADVINSVVLQ                    + C                                          
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQ--------------------VNC------------------------------------------

Query:  ------------LFLSYNLIS----------GRKGL-----RDQLIE-------------------------------SVPLSYSN--------------
                    + LS N  S           ++ L     +DQ++E                               +VP +YS               
Subjt:  ------------LFLSYNLIS----------GRKGL-----RDQLIE-------------------------------SVPLSYSN--------------

Query:  --------------------------------------------HSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTV
                                                    HS RIDPSR VQ+SWRPRVFLY+GFLSDEECDHL+SLA   E+       D GNT 
Subjt:  --------------------------------------------HSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTV

Query:  PTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKY-VFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFWSD
          ++ KS    LN  D+I++RIE RI+ WTFLPK+ S  LQ+ + G EEAE     FGN+S +  SEPL+ATV+LY+S+   GGE+ FPES+++S  WSD
Subjt:  PTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKY-VFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFWSD

Query:  RRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNK-HTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYG
          K ++IL+P KGNA+L F++  NASPDKSS H+R P+L+GE+W ATKF Y + I G K   D    +C DED +CP WA+IGEC+RN VFM+GSPDYYG
Subjt:  RRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNK-HTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYG

Query:  TCRKSCNAC
        TCRKSCN C
Subjt:  TCRKSCNAC

A0A7J6E0F0 Procollagen-proline 4-dioxygenase6.5e-17145.03Show/hide
Query:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKTYKDTRGRAPPYILY
        MSIICGIP+LECV CL CARWA KRC H+A HDSE WG ATA+EF P+PR+C YILAVYEDD++ PLWEP  GYGINPDWL  KK+Y+DT G+APPYILY
Subjt:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKTYKDTRGRAPPYILY

Query:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN
        LDH+H DIVLA RGLN+AKESDYAVLLDNKLG++KFDGGYVHNGLLKAA  VL  E++ LK LV KYP+YTLTFAGHSLGSGVA +L ++ VQNR +L N
Subjt:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQ-----------------------------------------------------------------
        IDR+RIRCYAIAPARCMSLNLAVRYADVINSVVLQ                                                                 
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQ-----------------------------------------------------------------

Query:  --------------------------VNC-----------------------------------------------------------------------
                                  ++C                                                                       
Subjt:  --------------------------VNC-----------------------------------------------------------------------

Query:  --------------------------------------------------------------LFLSYNLISGRKGLRDQLIES---VPLSYSNHSGRIDP
                                                                      L+LS    S RK LRD+  +    +    S HS RIDP
Subjt:  --------------------------------------------------------------LFLSYNLISGRKGLRDQLIES---VPLSYSNHSGRIDP

Query:  SRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAE
        SRVVQ+SWRPRVFLY+GFLSDEECDHLISL +  +D        SGNT+  K++KSS       DD+++RIE RI+ WTFLPK+    LQI +Y  E++E
Subjt:  SRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAE

Query:  HKY-VFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFWSDRRK---KNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATK
          +  FGN S +  S+PL+ATVVLYLSD+ +GG++ FP+SKVK   WSD  K    ++I++P KGNA+L F+++ N++ D SSSH R P+L+GE+W A K
Subjt:  HKY-VFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFWSDRRK---KNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATK

Query:  FFYLRPITGNKHTDEPD----GDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        FF ++ IT  +     +     DC DED +C  WAA+GEC++NAVFMIGS DYYGTCRKSCNAC
Subjt:  FFYLRPITGNKHTDEPD----GDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

A0A7J6E5B7 Procollagen-proline 4-dioxygenase4.5e-17246.26Show/hide
Query:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKTYKDTRGRAPPYILY
        MSIICGIP+LECV CL CARWA KRC H+A HDSE WG ATA+EF P+PR+C YILAVYEDD++ PLWEP  GYGINPDWL  KK+Y+DT G+APPYILY
Subjt:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKTYKDTRGRAPPYILY

Query:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN
        LDH+H DIVLA RGLN+AKESDYAVLLDNKLG++KFDGGYVHNGLLKAA  VL  E++ LK LV KYP+YTLTFAGHSLGSGVA +L ++ VQNR +L N
Subjt:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQ-----------------VNCL--------------------------------------------
        IDR+RIRCYAIAPARCMSLNLAVRYADVINSVVLQ                 + CL                                            
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQ-----------------VNCL--------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------FLSYNL------------------ISGRKGLRDQLIES---VPLSYSNHSGRIDPSRVVQVSWRPRVFLYK
                                     F S NL                  +S RK LRD+  +    +    S HS RIDPSRVVQ+SWRPRVFLY+
Subjt:  -----------------------------FLSYNL------------------ISGRKGLRDQLIES---VPLSYSNHSGRIDPSRVVQVSWRPRVFLYK

Query:  GFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKY-VFGNRSAMLSSE
        GFLSDEECDHLIS  +  ED        SGNT+  K++KSS       DD+++RIE RI+ WTFLPK+    LQI +Y  E++E  +  FGN S +  S+
Subjt:  GFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKY-VFGNRSAMLSSE

Query:  PLMATVVLYLSDSASGGEMRFPESKVKSRFWSDRRK---KNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEP
        PL+ATVVLYLSD+ +GG++ FP+SKVK   WSD  K    ++I++P KGNA+L F+++ N++ D SSSH R P+L+GE+W A KFF ++ IT  +     
Subjt:  PLMATVVLYLSDSASGGEMRFPESKVKSRFWSDRRK---KNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEP

Query:  D----GDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        +     DC DED +C  WAA+GEC++NAVFMIGS DYYGTCRKSCNAC
Subjt:  D----GDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 61.6e-5441.61Show/hide
Query:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPS-GNSTDSGNTVPTKILKSSGAIL-NTTDDIIARIENRIAVWTFLPKDYSMPL
        S+ S  +DP+R+ Q+SW PR FLYKGFLSDEECDHLI LA    +K       DSG +  +++  SSG  L    DDI+A +E ++A WTFLP++    L
Subjt:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPS-GNSTDSGNTVPTKILKSSGAIL-NTTDDIIARIENRIAVWTFLPKDYSMPL

Query:  QILQY--GGEEAEHKYVFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESK-----VKSRFWSDRRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHT
        QIL Y  G +   H   F ++ A+      +ATV++YLS+   GGE  FP  K     +K   WS   K+   ++P KG+A+L F++HLN + D +S H 
Subjt:  QILQY--GGEEAEHKYVFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESK-----VKSRFWSDRRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHT

Query:  RSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
          P+++GE W AT++ ++R     K        C D+ +SC +WA  GECE+N ++M+GS    G CRKSC AC
Subjt:  RSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

F4JAU3 Prolyl 4-hydroxylase 22.5e-5038.25Show/hide
Query:  LIESVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNT-TDDIIARIENRIAVWTFL
        L++S     S+ S  I+PS+V QVS +PR F+Y+GFL+D ECDHLISLA  +  + +    D+G +  + +  SSG  ++   D I++ IE++++ WTFL
Subjt:  LIESVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNT-TDDIIARIENRIAVWTFL

Query:  PKDYSMPLQILQY--GGEEAEHKYVFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSR--------FWSDRRKKNNILRPVKGNAVLIFSVHL
        PK+    LQ+L+Y  G +   H   F ++  +      +ATV+LYLS+   GGE  FP+++  SR          SD  KK   ++P KGNA+L F++  
Subjt:  PKDYSMPLQILQY--GGEEAEHKYVFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSR--------FWSDRRKKNNILRPVKGNAVLIFSVHL

Query:  NASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        +A PD  S H   P+++GE W ATK+ +   +         DG+C D ++SC +WA +GEC +N  +M+G+P+  G CR+SC AC
Subjt:  NASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

Q8GXT7 Probable prolyl 4-hydroxylase 121.3e-6245.45Show/hide
Query:  RKGLRDQLIES----VPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDDIIARI
        RK LRD+ I S       SY   S  +DP+RV+Q+SW PRVFLY+GFLS+EECDHLISL   + +  S ++   G T                D ++A I
Subjt:  RKGLRDQLIES----VPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDDIIARI

Query:  ENRIAVWTFLPKDYSMPLQILQYGGEEAEHKY-VFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFWSDRRKKNNILRPVKGNAVLIFSVH
        E +++ WTFLP +    +++  Y  E++  K   FG   + +  E L+ATVVLYLS++  GGE+ FP S++K +  +   +  NILRPVKGNA+L F+  
Subjt:  ENRIAVWTFLPKDYSMPLQILQYGGEEAEHKY-VFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFWSDRRKKNNILRPVKGNAVLIFSVH

Query:  LNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        LNAS D  S+H R P++ GEL +ATK  Y +     +   E  G+C+DED++C +WA +GEC++N V+MIGSPDYYGTCRKSCNAC
Subjt:  LNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

Q8L970 Probable prolyl 4-hydroxylase 76.3e-5438.85Show/hide
Query:  VPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILN-TTDDIIARIENRIAVWTFLPKDY
        + +  S  S   DP+RV Q+SW PRVFLY+GFLSDEECDH I LA    +K      DSG +V +++  SSG  L+   DDI++ +E ++A WTFLP++ 
Subjt:  VPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILN-TTDDIIARIENRIAVWTFLPKDY

Query:  SMPLQILQY--GGEEAEHKYVFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFP-----ESKVKSRFWSDRRKKNNILRPVKGNAVLIFSVHLNASPDKS
           +QIL Y  G +   H   F +++ +      +ATV++YLS+   GGE  FP      +++K   W++  K+   ++P KG+A+L F++H NA+ D +
Subjt:  SMPLQILQY--GGEEAEHKYVFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFP-----ESKVKSRFWSDRRKKNNILRPVKGNAVLIFSVHLNASPDKS

Query:  SSHTRSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        S H   P+++GE W AT++ +++     +        C DE+ SC +WA  GEC++N  +M+GS   +G CRKSC AC
Subjt:  SSHTRSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

Q8LAN3 Probable prolyl 4-hydroxylase 43.2e-5036.49Show/hide
Query:  LIESVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNT-TDDIIARIENRIAVWTFL
        L++S     S+ S  ++PS+V QVS +PR F+Y+GFL++ ECDH++SLA +S  + +    DSG +  +++  SSG  ++   D I++ IE++I+ WTFL
Subjt:  LIESVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNT-TDDIIARIENRIAVWTFL

Query:  PKDYSMPLQILQY--GGEEAEHKYVFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSR--------FWSDRRKKNNILRPVKGNAVLIFSVHL
        PK+    +Q+L+Y  G +   H   F ++  ++     MAT+++YLS+   GGE  FP++++ SR          SD  K+   ++P KG+A+L F++H 
Subjt:  PKDYSMPLQILQY--GGEEAEHKYVFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSR--------FWSDRRKKNNILRPVKGNAVLIFSVHL

Query:  NASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        +A PD  S H   P+++GE W ATK+ +   +        P G+C D ++SC +WA +GEC +N  +M+G+ +  G CR+SC AC
Subjt:  NASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

Arabidopsis top hitse value%identityAlignment
AT3G49050.1 alpha/beta-Hydrolases superfamily protein4.1e-10977.12Show/hide
Query:  MSIICG-IPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKTYKDTRGRAPPYIL
        MSI+CG  P+LECV CLGCARW  KRC ++A HDSE WG AT DEF P+PR CRYILAVYEDDI+ PLWEP  GYGINPDWLL+KKTY+DT+GRAP YIL
Subjt:  MSIICG-IPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKTYKDTRGRAPPYIL

Query:  YLDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLE
        YLDH H DIV+AIRGLN+AKESDYA+LLDNKLG++KFDGGYVHNGL+K+AG+VLD E ++LK+LV KYP YTLTFAGHSLGSGVA ML L+VV++ ++L 
Subjt:  YLDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLE

Query:  NIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQ
        NIDRKR+RC+AIAPARCMSLNLAVRYADVINSV+LQ
Subjt:  NIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQ

AT4G00500.1 alpha/beta-Hydrolases superfamily protein1.7e-9465.96Show/hide
Query:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKTYKDTRGRAPPYILY
        MSI+C +P+LECV CLGC  W  K+C +SA H+SE WG AT+DEF PIPRICR ILAVYE+++  P+W P  GYGI+P+ +++KK Y  T GR  PY++Y
Subjt:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKTYKDTRGRAPPYILY

Query:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN
        LDH +GD+VLAIRGLN+AKE DYAVLLDNKLG+ KFDGGYVHNGLLKAA WV + E+ +L++L+   P Y+LTF GHSLG+GV ++L L V+QNR +L N
Subjt:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQ
        I+RKRIRC+AIAP RCMSL+LAV YADVINSVVLQ
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQ

AT4G00500.2 alpha/beta-Hydrolases superfamily protein1.7e-9465.96Show/hide
Query:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKTYKDTRGRAPPYILY
        MSI+C +P+LECV CLGC  W  K+C +SA H+SE WG AT+DEF PIPRICR ILAVYE+++  P+W P  GYGI+P+ +++KK Y  T GR  PY++Y
Subjt:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKTYKDTRGRAPPYILY

Query:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN
        LDH +GD+VLAIRGLN+AKE DYAVLLDNKLG+ KFDGGYVHNGLLKAA WV + E+ +L++L+   P Y+LTF GHSLG+GV ++L L V+QNR +L N
Subjt:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQ
        I+RKRIRC+AIAP RCMSL+LAV YADVINSVVLQ
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQ

AT4G25600.1 Oxoglutarate/iron-dependent oxygenase9.0e-6445.45Show/hide
Query:  RKGLRDQLIES----VPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDDIIARI
        RK LRD+ I S       SY   S  +DP+RV+Q+SW PRVFLY+GFLS+EECDHLISL   + +  S ++   G T                D ++A I
Subjt:  RKGLRDQLIES----VPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDDIIARI

Query:  ENRIAVWTFLPKDYSMPLQILQYGGEEAEHKY-VFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFWSDRRKKNNILRPVKGNAVLIFSVH
        E +++ WTFLP +    +++  Y  E++  K   FG   + +  E L+ATVVLYLS++  GGE+ FP S++K +  +   +  NILRPVKGNA+L F+  
Subjt:  ENRIAVWTFLPKDYSMPLQILQYGGEEAEHKY-VFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFWSDRRKKNNILRPVKGNAVLIFSVH

Query:  LNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        LNAS D  S+H R P++ GEL +ATK  Y +     +   E  G+C+DED++C +WA +GEC++N V+MIGSPDYYGTCRKSCNAC
Subjt:  LNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

AT5G37710.1 alpha/beta-Hydrolases superfamily protein4.6e-7657.81Show/hide
Query:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPA-GGYGINPDWLLIKKTYKDTRGRAPPYIL
        MS+ CG   LECV C+G +RWA KRC H    DS TW  AT +EF PIPRI R ILAVYE D++ P   P+ G + +NP+W++ + T++ T+GR+PPYI+
Subjt:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPA-GGYGINPDWLLIKKTYKDTRGRAPPYIL

Query:  YLDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEIL-KDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKL
        Y+DH+H +IVLAIRGLN+AKESDY +LLDNKLG+K   GGYVH GLLK+A WVL+ E+E L +       +Y L FAGHSLGSGVAA++ ++VV     +
Subjt:  YLDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEIL-KDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKL

Query:  ENIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQ
         +I R ++RC+A+APARCMSLNLAV+YADVI+SV+LQ
Subjt:  ENIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTATCATATGTGGCATTCCTATTCTTGAGTGTGTATGCTGTCTGGGATGTGCTCGTTGGGCCTGTAAACGCTGTTTTCACTCAGCCGTTCATGACAGTGAAACTTG
GGGCTTTGCCACCGCTGATGAGTTCGGGCCTATTCCCCGAATTTGTCGGTATATCCTAGCTGTGTATGAAGATGATATTCAACAACCCCTTTGGGAACCGGCTGGTGGTT
ATGGAATCAATCCAGATTGGTTGCTCATTAAAAAAACATATAAGGATACTCGAGGACGTGCGCCTCCGTATATTTTATACCTTGATCACAATCATGGGGACATTGTTCTT
GCCATCAGGGGACTTAATATGGCAAAGGAGAGTGATTATGCAGTTTTATTGGACAACAAGCTGGGGAAGAAGAAATTTGACGGTGGATATGTTCACAATGGGCTTCTGAA
GGCAGCTGGGTGGGTTTTGGACACTGAGAACGAAATTTTAAAGGATTTGGTGAGCAAATATCCGGATTATACATTGACGTTTGCAGGGCATTCCCTTGGCTCCGGAGTAG
CAGCCATGTTAACTCTGGTAGTAGTTCAGAATCGCGATAAATTGGAAAATATTGATCGGAAGAGGATAAGGTGCTATGCGATTGCTCCTGCCAGGTGCATGTCCCTAAAT
TTGGCTGTTAGATATGCAGATGTGATCAACTCTGTTGTTCTTCAGGTAAATTGTTTATTTCTTTCATACAATTTGATTAGTGGGCGCAAGGGTTTAAGGGACCAATTGAT
CGAAAGTGTACCTTTGAGCTACTCTAATCATTCTGGAAGAATCGACCCATCAAGAGTTGTCCAAGTCTCTTGGCGACCAAGGGTTTTCTTGTATAAAGGATTTCTCTCAG
ATGAGGAGTGTGATCACCTTATTTCTTTGGCTACAAGTTCAGAAGATAAACCTTCTGGGAACAGTACTGACTCTGGGAACACTGTCCCAACCAAAATTCTAAAGAGTTCA
GGAGCCATTTTAAACACAACAGATGATATCATTGCAAGGATCGAGAATCGAATTGCTGTGTGGACTTTTCTTCCAAAAGATTATAGCATGCCTTTGCAGATTTTGCAATA
TGGGGGTGAAGAAGCAGAGCATAAGTACGTTTTTGGTAACAGATCTGCAATGTTGTCCAGTGAGCCTTTGATGGCCACAGTAGTTCTGTATCTCTCAGATTCTGCTAGCG
GTGGCGAGATGCGCTTTCCTGAATCAAAGGTAAAGAGCAGATTTTGGTCAGACCGGAGAAAGAAAAACAACATTCTGAGACCAGTGAAAGGCAATGCAGTTCTTATTTTC
TCTGTGCATCTTAATGCTTCTCCAGACAAGAGTAGCTCCCATACCCGATCTCCGATACTCGATGGGGAATTGTGGATTGCAACAAAATTCTTCTACTTAAGACCAATCAC
TGGGAATAAACACACAGACGAACCTGATGGAGACTGTAATGATGAAGATAAAAGCTGCCCCCAATGGGCTGCCATTGGCGAATGCGAACGAAACGCTGTTTTCATGATTG
GTTCTCCAGATTACTATGGAACATGTAGAAAAAGCTGCAACGCATGT
mRNA sequenceShow/hide mRNA sequence
ATGTCTATCATATGTGGCATTCCTATTCTTGAGTGTGTATGCTGTCTGGGATGTGCTCGTTGGGCCTGTAAACGCTGTTTTCACTCAGCCGTTCATGACAGTGAAACTTG
GGGCTTTGCCACCGCTGATGAGTTCGGGCCTATTCCCCGAATTTGTCGGTATATCCTAGCTGTGTATGAAGATGATATTCAACAACCCCTTTGGGAACCGGCTGGTGGTT
ATGGAATCAATCCAGATTGGTTGCTCATTAAAAAAACATATAAGGATACTCGAGGACGTGCGCCTCCGTATATTTTATACCTTGATCACAATCATGGGGACATTGTTCTT
GCCATCAGGGGACTTAATATGGCAAAGGAGAGTGATTATGCAGTTTTATTGGACAACAAGCTGGGGAAGAAGAAATTTGACGGTGGATATGTTCACAATGGGCTTCTGAA
GGCAGCTGGGTGGGTTTTGGACACTGAGAACGAAATTTTAAAGGATTTGGTGAGCAAATATCCGGATTATACATTGACGTTTGCAGGGCATTCCCTTGGCTCCGGAGTAG
CAGCCATGTTAACTCTGGTAGTAGTTCAGAATCGCGATAAATTGGAAAATATTGATCGGAAGAGGATAAGGTGCTATGCGATTGCTCCTGCCAGGTGCATGTCCCTAAAT
TTGGCTGTTAGATATGCAGATGTGATCAACTCTGTTGTTCTTCAGGTAAATTGTTTATTTCTTTCATACAATTTGATTAGTGGGCGCAAGGGTTTAAGGGACCAATTGAT
CGAAAGTGTACCTTTGAGCTACTCTAATCATTCTGGAAGAATCGACCCATCAAGAGTTGTCCAAGTCTCTTGGCGACCAAGGGTTTTCTTGTATAAAGGATTTCTCTCAG
ATGAGGAGTGTGATCACCTTATTTCTTTGGCTACAAGTTCAGAAGATAAACCTTCTGGGAACAGTACTGACTCTGGGAACACTGTCCCAACCAAAATTCTAAAGAGTTCA
GGAGCCATTTTAAACACAACAGATGATATCATTGCAAGGATCGAGAATCGAATTGCTGTGTGGACTTTTCTTCCAAAAGATTATAGCATGCCTTTGCAGATTTTGCAATA
TGGGGGTGAAGAAGCAGAGCATAAGTACGTTTTTGGTAACAGATCTGCAATGTTGTCCAGTGAGCCTTTGATGGCCACAGTAGTTCTGTATCTCTCAGATTCTGCTAGCG
GTGGCGAGATGCGCTTTCCTGAATCAAAGGTAAAGAGCAGATTTTGGTCAGACCGGAGAAAGAAAAACAACATTCTGAGACCAGTGAAAGGCAATGCAGTTCTTATTTTC
TCTGTGCATCTTAATGCTTCTCCAGACAAGAGTAGCTCCCATACCCGATCTCCGATACTCGATGGGGAATTGTGGATTGCAACAAAATTCTTCTACTTAAGACCAATCAC
TGGGAATAAACACACAGACGAACCTGATGGAGACTGTAATGATGAAGATAAAAGCTGCCCCCAATGGGCTGCCATTGGCGAATGCGAACGAAACGCTGTTTTCATGATTG
GTTCTCCAGATTACTATGGAACATGTAGAAAAAGCTGCAACGCATGT
Protein sequenceShow/hide protein sequence
MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKTYKDTRGRAPPYILYLDHNHGDIVL
AIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLENIDRKRIRCYAIAPARCMSLN
LAVRYADVINSVVLQVNCLFLSYNLISGRKGLRDQLIESVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSS
GAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKYVFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFWSDRRKKNNILRPVKGNAVLIF
SVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC