; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC06g1898 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC06g1898
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationMC06:26398570..26407926
RNA-Seq ExpressionMC06g1898
SyntenyMC06g1898
Gene Ontology termsGO:0016042 - lipid catabolic process (biological process)
GO:0019511 - peptidyl-proline hydroxylation (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR002921 - Fungal lipase-like domain
IPR003582 - ShKT domain
IPR005592 - Mono-/di-acylglycerol lipase, N-terminal
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4271435.1 unnamed protein product [Prunus armeniaca]1.53e-23150.78Show/hide
Query:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKSYKDTRGRAPPYILY
        MSI+CG P++ECV CL C RWA KRC H+A HDSETWG ATA+EF P+PR+CRYILAVYEDD++QPLWEP GGYGI PDWL++KK+Y+DT+G+APPYILY
Subjt:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKSYKDTRGRAPPYILY

Query:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN
        LDH+H DIVLA RGLN+A+ESDYAVL+DNKLGKKKFDGGYVHNGLLKAA WVLD E E LKDLV KYP+YTLTF GHSLGSGVAA+LT+VVVQ+RD+L N
Subjt:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQPSMDSR--------LPVLL--------------------------------------------LL
        IDRKR+R YAIAPARC+SLNLAVRYADVINSVVLQ +            LP LL                                            ++
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQPSMDSR--------LPVLL--------------------------------------------LL

Query:  ATAISF--------LSCLAQSNLI------SGRKGLR-----DQLIE-------------------------------SVPLSYSN--------------
         TA+          LSC A S+          ++ L+     DQ++E                               +VP +YS               
Subjt:  ATAISF--------LSCLAQSNLI------SGRKGLR-----DQLIE-------------------------------SVPLSYSN--------------

Query:  --------------------------------------------HSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTV
                                                    HS RIDPSR VQ+SWRPRVFLY+GFLSDEECDHL+SLA   E+       D GNT 
Subjt:  --------------------------------------------HSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTV

Query:  PTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKY-VFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFWSD
          ++  S    LN  D+I++RIE RI+ WTFLPK+ S  LQ+ + G EEAE     FGN+S +  SEPL+ATV+LY+S+   GGE+ FPES+++S  WSD
Subjt:  PTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKY-VFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFWSD

Query:  RRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNK-HTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYG
          K ++IL+P KGNA+L F++  NASPDKSS H+R P+L+GE+W ATKF Y + I G K  +D    +C DED +CP WA+IGEC+RN VFM+GSPDYYG
Subjt:  RRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNK-HTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYG

Query:  TCRKSCNAC
        TCRKSCN C
Subjt:  TCRKSCNAC

CAB4301873.1 unnamed protein product [Prunus armeniaca]2.92e-23250.92Show/hide
Query:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKSYKDTRGRAPPYILY
        MSI+CG P++ECV CL C RWA KRC H+A HDSETWG ATA+EF P+PR+CRYILAVYEDD++QPLWEP GGYGI PDWL++KK+Y+DT+G+APPYILY
Subjt:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKSYKDTRGRAPPYILY

Query:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN
        LDH+H DIVLA RGLN+A+ESDYAVL+DNKLGKKKFDGGYVHNGLLKAA WVLD E E LKDLV KYP+YTLTF GHSLGSGVAA+LT+VVVQ+RD+L N
Subjt:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQPSMDSR--------LPVLL--------------------------------------------LL
        IDRKR+R YAIAPARC+SLNLAVRYADVINSVVLQ +            LP LL                                            ++
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQPSMDSR--------LPVLL--------------------------------------------LL

Query:  ATAISF--------LSCLAQSNLI------SGRKGLR-----DQLIE-------------------------------SVPLSYSN--------------
         TA+          LSC A S+          ++ L+     DQ++E                               +VP +YS               
Subjt:  ATAISF--------LSCLAQSNLI------SGRKGLR-----DQLIE-------------------------------SVPLSYSN--------------

Query:  --------------------------------------------HSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTV
                                                    HS RIDPSR VQ+SWRPRVFLY+GFLSDEECDHL+SLA   E+       D GNT 
Subjt:  --------------------------------------------HSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTV

Query:  PTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKY-VFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFWSD
          ++ KS    LN  D+I++RIE RI+ WTFLPK+ S  LQ+ + G EEAE     FGN+S +  SEPL+ATV+LY+S+   GGE+ FPES+++S  WSD
Subjt:  PTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKY-VFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFWSD

Query:  RRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHT-DEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYG
          K ++IL+P KGNA+L F++  NASPDKSS H+R P+L+GE+W ATKF Y + I G K + D    +C DED +CP WA+IGEC+RN VFM+GSPDYYG
Subjt:  RRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHT-DEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYG

Query:  TCRKSCNAC
        TCRKSCN C
Subjt:  TCRKSCNAC

KAF4351179.1 hypothetical protein F8388_024210 [Cannabis sativa]3.64e-20846.34Show/hide
Query:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKSYKDTRGRAPPYILY
        MSIICGIP+LECV CL CARWA KRC H+A HDSE WG ATA+EF P+PR+C YILAVYEDD++ PLWEP  GYGINPDWL  KKSY+DT G+APPYILY
Subjt:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKSYKDTRGRAPPYILY

Query:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN
        LDH+H DIVLA RGLN+AKESDYAVLLDNKLG++KFDGGYVHNGLLKAA  VL  E++ LK LV KYP+YTLTFAGHSLGSGVA +L ++ VQNR +L N
Subjt:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQ-----------------------------------------------------------------
        IDR+RIRCYAIAPARCMSLNLAVRYADVINSVVLQ                                                                 
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQ-----------------------------------------------------------------

Query:  ---------PSMDSRLPV------LLLLATAIS-------------------------------------------------------------------
                 P + + +PV      ++L   A S                                                                   
Subjt:  ---------PSMDSRLPV------LLLLATAIS-------------------------------------------------------------------

Query:  ------------------------FLSCLAQSNLI-----------------------------------SGRKGLRDQLIES---VPLSYSNHSGRIDP
                                FL   + S++I                                   S RK LRD+  +    +    S HS RIDP
Subjt:  ------------------------FLSCLAQSNLI-----------------------------------SGRKGLRDQLIES---VPLSYSNHSGRIDP

Query:  SRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAE
        SRVVQ+SWRPRVFLY+GFLSDEECDHLISL TS ED  SGN      T+  K++KSS       DD+++RIE RI+ WTFLPK+    LQI +Y  E++E
Subjt:  SRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAE

Query:  HKY-VFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFWSDRRKKN---NILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATK
          +  FGN S +  S+PL+ATVVLYLSD+ +GG++ FP+SKVK   WSD  K +   +I++P KGNA+L F+++ N++ D SSSH R P+L+GE+W A K
Subjt:  HKY-VFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFWSDRRKKN---NILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATK

Query:  FFYLRPITGNKHTDEPD----GDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        FF ++ IT  +     +     DC DED +C  WAA+GEC++NAVFMIGS DYYGTCRKSCNAC
Subjt:  FFYLRPITGNKHTDEPD----GDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

KAF4353598.1 hypothetical protein F8388_017773 [Cannabis sativa]8.24e-21147.33Show/hide
Query:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKSYKDTRGRAPPYILY
        MSIICGIP+LECV CL CARWA KRC H+A HDSE WG ATA+EF P+PR+C YILAVYEDD++ PLWEP  GYGINPDWL  KKSY+DT G+APPYILY
Subjt:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKSYKDTRGRAPPYILY

Query:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN
        LDH+H DIVLA RGLN+AKESDYAVLLDNKLG++KFDGGYVHNGLLKAA  VL  E++ LK LV KYP+YTLTFAGHSLGSGVA +L ++ VQNR +L N
Subjt:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQPSMDSR--------LPVLLLLA-------------------------------------------
        IDR+RIRCYAIAPARCMSLNLAVRYADVINSVVLQ +            LP +L L                                            
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQPSMDSR--------LPVLLLLA-------------------------------------------

Query:  -TAISF--------LSC-----------------------------------------------------------------------------------
         TA+          LSC                                                                                   
Subjt:  -TAISF--------LSC-----------------------------------------------------------------------------------

Query:  ----------------------------------LAQSNL-------------ISGRKGLRDQLIES---VPLSYSNHSGRIDPSRVVQVSWRPRVFLYK
                                          LA SN              +S RK LRD+  +    +    S HS RIDPSRVVQ+SWRPRVFLY+
Subjt:  ----------------------------------LAQSNL-------------ISGRKGLRDQLIES---VPLSYSNHSGRIDPSRVVQVSWRPRVFLYK

Query:  GFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKY-VFGNRSAMLSSE
        GFLSDEECDHLIS  +  ED        SGNT+  K++KSS       DD+++RIE RI+ WTFLPK+    LQI +Y  E++E  +  FGN S +  S+
Subjt:  GFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKY-VFGNRSAMLSSE

Query:  PLMATVVLYLSDSASGGEMRFPESKVKSRFWSDRRKKN---NILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEP
        PL+ATVVLYLSD+ +GG++ FP+SKVK   WSD  K +   +I++P KGNA+L F+++ N++ D SSSH R P+L+GE+W A KFF ++ IT  +     
Subjt:  PLMATVVLYLSDSASGGEMRFPESKVKSRFWSDRRKKN---NILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEP

Query:  D----GDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        +     DC DED +C  WAA+GEC++NAVFMIGS DYYGTCRKSCNAC
Subjt:  D----GDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

XP_022159842.1 probable prolyl 4-hydroxylase 12 [Momordica charantia]2.88e-222100Show/hide
Query:  MDSRLPVLLLLATAISFLSCLAQSNLISGRKGLRDQLIESVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSG
        MDSRLPVLLLLATAISFLSCLAQSNLISGRKGLRDQLIESVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSG
Subjt:  MDSRLPVLLLLATAISFLSCLAQSNLISGRKGLRDQLIESVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSG

Query:  NTVPTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKYVFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFW
        NTVPTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKYVFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFW
Subjt:  NTVPTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKYVFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFW

Query:  SDRRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYY
        SDRRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYY
Subjt:  SDRRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYY

Query:  GTCRKSCNAC
        GTCRKSCNAC
Subjt:  GTCRKSCNAC

TrEMBL top hitse value%identityAlignment
A0A6J1E0X9 Procollagen-proline 4-dioxygenase1.39e-222100Show/hide
Query:  MDSRLPVLLLLATAISFLSCLAQSNLISGRKGLRDQLIESVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSG
        MDSRLPVLLLLATAISFLSCLAQSNLISGRKGLRDQLIESVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSG
Subjt:  MDSRLPVLLLLATAISFLSCLAQSNLISGRKGLRDQLIESVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSG

Query:  NTVPTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKYVFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFW
        NTVPTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKYVFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFW
Subjt:  NTVPTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKYVFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFW

Query:  SDRRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYY
        SDRRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYY
Subjt:  SDRRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYY

Query:  GTCRKSCNAC
        GTCRKSCNAC
Subjt:  GTCRKSCNAC

A0A6J5U8N9 Procollagen-proline 4-dioxygenase7.39e-23250.78Show/hide
Query:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKSYKDTRGRAPPYILY
        MSI+CG P++ECV CL C RWA KRC H+A HDSETWG ATA+EF P+PR+CRYILAVYEDD++QPLWEP GGYGI PDWL++KK+Y+DT+G+APPYILY
Subjt:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKSYKDTRGRAPPYILY

Query:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN
        LDH+H DIVLA RGLN+A+ESDYAVL+DNKLGKKKFDGGYVHNGLLKAA WVLD E E LKDLV KYP+YTLTF GHSLGSGVAA+LT+VVVQ+RD+L N
Subjt:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQPSMDSR--------LPVLL--------------------------------------------LL
        IDRKR+R YAIAPARC+SLNLAVRYADVINSVVLQ +            LP LL                                            ++
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQPSMDSR--------LPVLL--------------------------------------------LL

Query:  ATAISF--------LSCLAQSNLI------SGRKGLR-----DQLIE-------------------------------SVPLSYSN--------------
         TA+          LSC A S+          ++ L+     DQ++E                               +VP +YS               
Subjt:  ATAISF--------LSCLAQSNLI------SGRKGLR-----DQLIE-------------------------------SVPLSYSN--------------

Query:  --------------------------------------------HSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTV
                                                    HS RIDPSR VQ+SWRPRVFLY+GFLSDEECDHL+SLA   E+       D GNT 
Subjt:  --------------------------------------------HSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTV

Query:  PTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKY-VFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFWSD
          ++  S    LN  D+I++RIE RI+ WTFLPK+ S  LQ+ + G EEAE     FGN+S +  SEPL+ATV+LY+S+   GGE+ FPES+++S  WSD
Subjt:  PTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKY-VFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFWSD

Query:  RRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNK-HTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYG
          K ++IL+P KGNA+L F++  NASPDKSS H+R P+L+GE+W ATKF Y + I G K  +D    +C DED +CP WA+IGEC+RN VFM+GSPDYYG
Subjt:  RRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNK-HTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYG

Query:  TCRKSCNAC
        TCRKSCN C
Subjt:  TCRKSCNAC

A0A6J5WND9 Procollagen-proline 4-dioxygenase1.42e-23250.92Show/hide
Query:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKSYKDTRGRAPPYILY
        MSI+CG P++ECV CL C RWA KRC H+A HDSETWG ATA+EF P+PR+CRYILAVYEDD++QPLWEP GGYGI PDWL++KK+Y+DT+G+APPYILY
Subjt:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKSYKDTRGRAPPYILY

Query:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN
        LDH+H DIVLA RGLN+A+ESDYAVL+DNKLGKKKFDGGYVHNGLLKAA WVLD E E LKDLV KYP+YTLTF GHSLGSGVAA+LT+VVVQ+RD+L N
Subjt:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQPSMDSR--------LPVLL--------------------------------------------LL
        IDRKR+R YAIAPARC+SLNLAVRYADVINSVVLQ +            LP LL                                            ++
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQPSMDSR--------LPVLL--------------------------------------------LL

Query:  ATAISF--------LSCLAQSNLI------SGRKGLR-----DQLIE-------------------------------SVPLSYSN--------------
         TA+          LSC A S+          ++ L+     DQ++E                               +VP +YS               
Subjt:  ATAISF--------LSCLAQSNLI------SGRKGLR-----DQLIE-------------------------------SVPLSYSN--------------

Query:  --------------------------------------------HSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTV
                                                    HS RIDPSR VQ+SWRPRVFLY+GFLSDEECDHL+SLA   E+       D GNT 
Subjt:  --------------------------------------------HSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTV

Query:  PTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKY-VFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFWSD
          ++ KS    LN  D+I++RIE RI+ WTFLPK+ S  LQ+ + G EEAE     FGN+S +  SEPL+ATV+LY+S+   GGE+ FPES+++S  WSD
Subjt:  PTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKY-VFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFWSD

Query:  RRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHT-DEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYG
          K ++IL+P KGNA+L F++  NASPDKSS H+R P+L+GE+W ATKF Y + I G K + D    +C DED +CP WA+IGEC+RN VFM+GSPDYYG
Subjt:  RRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHT-DEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYG

Query:  TCRKSCNAC
        TCRKSCN C
Subjt:  TCRKSCNAC

A0A7J6E0F0 Procollagen-proline 4-dioxygenase1.76e-20846.34Show/hide
Query:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKSYKDTRGRAPPYILY
        MSIICGIP+LECV CL CARWA KRC H+A HDSE WG ATA+EF P+PR+C YILAVYEDD++ PLWEP  GYGINPDWL  KKSY+DT G+APPYILY
Subjt:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKSYKDTRGRAPPYILY

Query:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN
        LDH+H DIVLA RGLN+AKESDYAVLLDNKLG++KFDGGYVHNGLLKAA  VL  E++ LK LV KYP+YTLTFAGHSLGSGVA +L ++ VQNR +L N
Subjt:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQ-----------------------------------------------------------------
        IDR+RIRCYAIAPARCMSLNLAVRYADVINSVVLQ                                                                 
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQ-----------------------------------------------------------------

Query:  ---------PSMDSRLPV------LLLLATAIS-------------------------------------------------------------------
                 P + + +PV      ++L   A S                                                                   
Subjt:  ---------PSMDSRLPV------LLLLATAIS-------------------------------------------------------------------

Query:  ------------------------FLSCLAQSNLI-----------------------------------SGRKGLRDQLIES---VPLSYSNHSGRIDP
                                FL   + S++I                                   S RK LRD+  +    +    S HS RIDP
Subjt:  ------------------------FLSCLAQSNLI-----------------------------------SGRKGLRDQLIES---VPLSYSNHSGRIDP

Query:  SRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAE
        SRVVQ+SWRPRVFLY+GFLSDEECDHLISL TS ED  SGN      T+  K++KSS       DD+++RIE RI+ WTFLPK+    LQI +Y  E++E
Subjt:  SRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAE

Query:  HKY-VFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFWSDRRKKN---NILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATK
          +  FGN S +  S+PL+ATVVLYLSD+ +GG++ FP+SKVK   WSD  K +   +I++P KGNA+L F+++ N++ D SSSH R P+L+GE+W A K
Subjt:  HKY-VFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFWSDRRKKN---NILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATK

Query:  FFYLRPITGNKHTDEPD----GDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        FF ++ IT  +     +     DC DED +C  WAA+GEC++NAVFMIGS DYYGTCRKSCNAC
Subjt:  FFYLRPITGNKHTDEPD----GDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

A0A7J6E5B7 Procollagen-proline 4-dioxygenase3.99e-21147.33Show/hide
Query:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKSYKDTRGRAPPYILY
        MSIICGIP+LECV CL CARWA KRC H+A HDSE WG ATA+EF P+PR+C YILAVYEDD++ PLWEP  GYGINPDWL  KKSY+DT G+APPYILY
Subjt:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKSYKDTRGRAPPYILY

Query:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN
        LDH+H DIVLA RGLN+AKESDYAVLLDNKLG++KFDGGYVHNGLLKAA  VL  E++ LK LV KYP+YTLTFAGHSLGSGVA +L ++ VQNR +L N
Subjt:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQPSMDSR--------LPVLLLLA-------------------------------------------
        IDR+RIRCYAIAPARCMSLNLAVRYADVINSVVLQ +            LP +L L                                            
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQPSMDSR--------LPVLLLLA-------------------------------------------

Query:  -TAISF--------LSC-----------------------------------------------------------------------------------
         TA+          LSC                                                                                   
Subjt:  -TAISF--------LSC-----------------------------------------------------------------------------------

Query:  ----------------------------------LAQSNL-------------ISGRKGLRDQLIES---VPLSYSNHSGRIDPSRVVQVSWRPRVFLYK
                                          LA SN              +S RK LRD+  +    +    S HS RIDPSRVVQ+SWRPRVFLY+
Subjt:  ----------------------------------LAQSNL-------------ISGRKGLRDQLIES---VPLSYSNHSGRIDPSRVVQVSWRPRVFLYK

Query:  GFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKY-VFGNRSAMLSSE
        GFLSDEECDHLIS  +  ED        SGNT+  K++KSS       DD+++RIE RI+ WTFLPK+    LQI +Y  E++E  +  FGN S +  S+
Subjt:  GFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKY-VFGNRSAMLSSE

Query:  PLMATVVLYLSDSASGGEMRFPESKVKSRFWSDRRKKN---NILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEP
        PL+ATVVLYLSD+ +GG++ FP+SKVK   WSD  K +   +I++P KGNA+L F+++ N++ D SSSH R P+L+GE+W A KFF ++ IT  +     
Subjt:  PLMATVVLYLSDSASGGEMRFPESKVKSRFWSDRRKKN---NILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEP

Query:  D----GDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        +     DC DED +C  WAA+GEC++NAVFMIGS DYYGTCRKSCNAC
Subjt:  D----GDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 61.7e-5441.61Show/hide
Query:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPS-GNSTDSGNTVPTKILKSSGAIL-NTTDDIIARIENRIAVWTFLPKDYSMPL
        S+ S  +DP+R+ Q+SW PR FLYKGFLSDEECDHLI LA    +K       DSG +  +++  SSG  L    DDI+A +E ++A WTFLP++    L
Subjt:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPS-GNSTDSGNTVPTKILKSSGAIL-NTTDDIIARIENRIAVWTFLPKDYSMPL

Query:  QILQY--GGEEAEHKYVFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESK-----VKSRFWSDRRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHT
        QIL Y  G +   H   F ++ A+      +ATV++YLS+   GGE  FP  K     +K   WS   K+   ++P KG+A+L F++HLN + D +S H 
Subjt:  QILQY--GGEEAEHKYVFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESK-----VKSRFWSDRRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHT

Query:  RSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
          P+++GE W AT++ ++R     K        C D+ +SC +WA  GECE+N ++M+GS    G CRKSC AC
Subjt:  RSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

F4JAU3 Prolyl 4-hydroxylase 23.3e-5038.25Show/hide
Query:  LIESVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNT-TDDIIARIENRIAVWTFL
        L++S     S+ S  I+PS+V QVS +PR F+Y+GFL+D ECDHLISLA  +  + +    D+G +  + +  SSG  ++   D I++ IE++++ WTFL
Subjt:  LIESVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNT-TDDIIARIENRIAVWTFL

Query:  PKDYSMPLQILQY--GGEEAEHKYVFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSR--------FWSDRRKKNNILRPVKGNAVLIFSVHL
        PK+    LQ+L+Y  G +   H   F ++  +      +ATV+LYLS+   GGE  FP+++  SR          SD  KK   ++P KGNA+L F++  
Subjt:  PKDYSMPLQILQY--GGEEAEHKYVFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSR--------FWSDRRKKNNILRPVKGNAVLIFSVHL

Query:  NASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        +A PD  S H   P+++GE W ATK+ +   +         DG+C D ++SC +WA +GEC +N  +M+G+P+  G CR+SC AC
Subjt:  NASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

Q8GXT7 Probable prolyl 4-hydroxylase 121.7e-6245.45Show/hide
Query:  RKGLRDQLIES----VPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDDIIARI
        RK LRD+ I S       SY   S  +DP+RV+Q+SW PRVFLY+GFLS+EECDHLISL   + +  S ++   G T                D ++A I
Subjt:  RKGLRDQLIES----VPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDDIIARI

Query:  ENRIAVWTFLPKDYSMPLQILQYGGEEAEHKY-VFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFWSDRRKKNNILRPVKGNAVLIFSVH
        E +++ WTFLP +    +++  Y  E++  K   FG   + +  E L+ATVVLYLS++  GGE+ FP S++K +  +   +  NILRPVKGNA+L F+  
Subjt:  ENRIAVWTFLPKDYSMPLQILQYGGEEAEHKY-VFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFWSDRRKKNNILRPVKGNAVLIFSVH

Query:  LNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        LNAS D  S+H R P++ GEL +ATK  Y +     +   E  G+C+DED++C +WA +GEC++N V+MIGSPDYYGTCRKSCNAC
Subjt:  LNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

Q8L970 Probable prolyl 4-hydroxylase 71.3e-5437.15Show/hide
Query:  MDSRLPVLLLLATAISFLSCL-----AQSNLISGRKGLRDQLIESVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGN
        MDSR    + LA ++ FL  L     A +  ++     RD  +  + +  S  S   DP+RV Q+SW PRVFLY+GFLSDEECDH I LA    +K    
Subjt:  MDSRLPVLLLLATAISFLSCL-----AQSNLISGRKGLRDQLIESVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGN

Query:  STDSGNTVPTKILKSSGAILN-TTDDIIARIENRIAVWTFLPKDYSMPLQILQY--GGEEAEHKYVFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFP-
          DSG +V +++  SSG  L+   DDI++ +E ++A WTFLP++    +QIL Y  G +   H   F +++ +      +ATV++YLS+   GGE  FP 
Subjt:  STDSGNTVPTKILKSSGAILN-TTDDIIARIENRIAVWTFLPKDYSMPLQILQY--GGEEAEHKYVFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFP-

Query:  ----ESKVKSRFWSDRRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECE
             +++K   W++  K+   ++P KG+A+L F++H NA+ D +S H   P+++GE W AT++ +++     +        C DE+ SC +WA  GEC+
Subjt:  ----ESKVKSRFWSDRRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECE

Query:  RNAVFMIGSPDYYGTCRKSCNAC
        +N  +M+GS   +G CRKSC AC
Subjt:  RNAVFMIGSPDYYGTCRKSCNAC

Q8LAN3 Probable prolyl 4-hydroxylase 43.3e-5035.22Show/hide
Query:  LAQSNLISGRKGLRDQLIESVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNT-TD
        +A+  L+     +   L++S     S+ S  ++PS+V QVS +PR F+Y+GFL++ ECDH++SLA +S  + +    DSG +  +++  SSG  ++   D
Subjt:  LAQSNLISGRKGLRDQLIESVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNT-TD

Query:  DIIARIENRIAVWTFLPKDYSMPLQILQY--GGEEAEHKYVFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSR--------FWSDRRKKNNI
         I++ IE++I+ WTFLPK+    +Q+L+Y  G +   H   F ++  ++     MAT+++YLS+   GGE  FP++++ SR          SD  K+   
Subjt:  DIIARIENRIAVWTFLPKDYSMPLQILQY--GGEEAEHKYVFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSR--------FWSDRRKKNNI

Query:  LRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNA
        ++P KG+A+L F++H +A PD  S H   P+++GE W ATK+ +   +        P G+C D ++SC +WA +GEC +N  +M+G+ +  G CR+SC A
Subjt:  LRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNA

Query:  C
        C
Subjt:  C

Arabidopsis top hitse value%identityAlignment
AT3G49050.1 alpha/beta-Hydrolases superfamily protein1.2e-10876.69Show/hide
Query:  MSIICG-IPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKSYKDTRGRAPPYIL
        MSI+CG  P+LECV CLGCARW  KRC ++A HDSE WG AT DEF P+PR CRYILAVYEDDI+ PLWEP  GYGINPDWLL+KK+Y+DT+GRAP YIL
Subjt:  MSIICG-IPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKSYKDTRGRAPPYIL

Query:  YLDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLE
        YLDH H DIV+AIRGLN+AKESDYA+LLDNKLG++KFDGGYVHNGL+K+AG+VLD E ++LK+LV KYP YTLTFAGHSLGSGVA ML L+VV++ ++L 
Subjt:  YLDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLE

Query:  NIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQ
        NIDRKR+RC+AIAPARCMSLNLAVRYADVINSV+LQ
Subjt:  NIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQ

AT4G00500.1 alpha/beta-Hydrolases superfamily protein3.9e-9458.91Show/hide
Query:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKSYKDTRGRAPPYILY
        MSI+C +P+LECV CLGC  W  K+C +SA H+SE WG AT+DEF PIPRICR ILAVYE+++  P+W P  GYGI+P+ +++KK Y  T GR  PY++Y
Subjt:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKSYKDTRGRAPPYILY

Query:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN
        LDH +GD+VLAIRGLN+AKE DYAVLLDNKLG+ KFDGGYVHNGLLKAA WV + E+ +L++L+   P Y+LTF GHSLG+GV ++L L V+QNR +L N
Subjt:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQPSMDSRLPVLL-------LLATAISFLSCLAQSNLISGRK
        I+RKRIRC+AIAP RCMSL+LAV YADVINSVVLQ     R    L       +    +  L+CL  +     RK
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQPSMDSRLPVLL-------LLATAISFLSCLAQSNLISGRK

AT4G00500.2 alpha/beta-Hydrolases superfamily protein3.9e-9458.91Show/hide
Query:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKSYKDTRGRAPPYILY
        MSI+C +P+LECV CLGC  W  K+C +SA H+SE WG AT+DEF PIPRICR ILAVYE+++  P+W P  GYGI+P+ +++KK Y  T GR  PY++Y
Subjt:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKSYKDTRGRAPPYILY

Query:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN
        LDH +GD+VLAIRGLN+AKE DYAVLLDNKLG+ KFDGGYVHNGLLKAA WV + E+ +L++L+   P Y+LTF GHSLG+GV ++L L V+QNR +L N
Subjt:  LDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQPSMDSRLPVLL-------LLATAISFLSCLAQSNLISGRK
        I+RKRIRC+AIAP RCMSL+LAV YADVINSVVLQ     R    L       +    +  L+CL  +     RK
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQPSMDSRLPVLL-------LLATAISFLSCLAQSNLISGRK

AT4G25600.1 Oxoglutarate/iron-dependent oxygenase1.2e-6345.45Show/hide
Query:  RKGLRDQLIES----VPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDDIIARI
        RK LRD+ I S       SY   S  +DP+RV+Q+SW PRVFLY+GFLS+EECDHLISL   + +  S ++   G T                D ++A I
Subjt:  RKGLRDQLIES----VPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDDIIARI

Query:  ENRIAVWTFLPKDYSMPLQILQYGGEEAEHKY-VFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFWSDRRKKNNILRPVKGNAVLIFSVH
        E +++ WTFLP +    +++  Y  E++  K   FG   + +  E L+ATVVLYLS++  GGE+ FP S++K +  +   +  NILRPVKGNA+L F+  
Subjt:  ENRIAVWTFLPKDYSMPLQILQYGGEEAEHKY-VFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFWSDRRKKNNILRPVKGNAVLIFSVH

Query:  LNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        LNAS D  S+H R P++ GEL +ATK  Y +     +   E  G+C+DED++C +WA +GEC++N V+MIGSPDYYGTCRKSCNAC
Subjt:  LNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

AT5G37710.1 alpha/beta-Hydrolases superfamily protein1.6e-7652.67Show/hide
Query:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPA-GGYGINPDWLLIKKSYKDTRGRAPPYIL
        MS+ CG   LECV C+G +RWA KRC H    DS TW  AT +EF PIPRI R ILAVYE D++ P   P+ G + +NP+W++ + +++ T+GR+PPYI+
Subjt:  MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPA-GGYGINPDWLLIKKSYKDTRGRAPPYIL

Query:  YLDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEIL-KDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKL
        Y+DH+H +IVLAIRGLN+AKESDY +LLDNKLG+K   GGYVH GLLK+A WVL+ E+E L +       +Y L FAGHSLGSGVAA++ ++VV     +
Subjt:  YLDHNHGDIVLAIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEIL-KDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKL

Query:  ENIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQPSMDSRLPVLL-------LLATAISFLSCLAQSNLISGRKGLRD
         +I R ++RC+A+APARCMSLNLAV+YADVI+SV+LQ     R    L            + FL CL  + +  GRK LRD
Subjt:  ENIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQPSMDSRLPVLL-------LLATAISFLSCLAQSNLISGRKGLRD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTATCATATGTGGCATTCCTATTCTTGAGTGTGTATGCTGTCTGGGATGTGCTCGTTGGGCCTGTAAACGCTGTTTTCACTCAGCCGTTCATGACAGTGAAACTTG
GGGCTTTGCCACCGCTGATGAGTTCGGGCCTATTCCCCGAATTTGTCGATATATCCTAGCTGTGTATGAAGATGATATTCAACAACCCCTTTGGGAACCGGCTGGTGGTT
ATGGAATCAATCCAGATTGGTTGCTCATTAAAAAATCATATAAGGATACTCGAGGACGTGCGCCTCCGTATATTTTATACCTTGATCACAATCATGGGGACATTGTTCTT
GCCATCAGGGGACTTAATATGGCAAAGGAGAGTGATTATGCAGTTTTATTGGACAACAAGCTGGGGAAGAAGAAATTTGATGGTGGATATGTTCACAATGGGCTTCTGAA
GGCAGCTGGGTGGGTTTTGGACACTGAGAACGAAATTTTAAAGGATTTGGTGAGCAAATATCCGGATTATACATTGACGTTTGCAGGGCATTCCCTTGGCTCCGGAGTAG
CAGCCATGTTAACTCTGGTAGTAGTTCAGAATCGCGATAAATTGGAAAATATCGATCGGAAGAGGATAAGGTGCTATGCGATTGCTCCTGCCAGGTGCATGTCCCTAAAT
TTGGCTGTTAGATATGCAGATGTGATCAACTCTGTTGTTCTTCAGCCATCCATGGATTCTCGTCTTCCCGTCTTACTTCTTTTAGCGACTGCAATTTCGTTCTTAAGCTG
CCTTGCACAAAGCAATTTGATTAGTGGGCGCAAGGGTTTAAGGGACCAATTGATCGAAAGTGTACCTTTGAGCTACTCTAATCATTCTGGAAGAATCGACCCATCAAGAG
TTGTCCAAGTCTCTTGGCGACCAAGGGTTTTCTTGTATAAAGGATTTCTCTCAGATGAGGAGTGTGATCACCTTATTTCTTTGGCTACAAGTTCAGAAGATAAACCTTCT
GGGAACAGTACTGACTCTGGGAACACTGTCCCAACCAAAATTCTAAAGAGTTCAGGAGCCATTTTAAACACAACAGATGATATCATTGCAAGGATCGAGAATCGAATTGC
TGTGTGGACTTTTCTTCCAAAAGATTATAGCATGCCTTTGCAGATTTTGCAATATGGGGGTGAAGAAGCAGAGCATAAGTACGTTTTTGGTAACAGATCTGCAATGTTGT
CCAGTGAGCCTTTGATGGCCACAGTAGTTCTGTATCTCTCAGATTCTGCTAGCGGTGGCGAGATGCGCTTTCCTGAATCAAAGGTAAAGAGCAGATTTTGGTCAGACCGG
AGAAAGAAAAACAACATTCTGAGACCAGTGAAAGGCAATGCAGTTCTTATTTTCTCTGTGCATCTTAATGCTTCTCCAGACAAGAGTAGCTCCCATACCCGATCTCCGAT
ACTCGATGGGGAATTGTGGATTGCAACAAAATTCTTCTACTTAAGACCAATCACTGGGAATAAACACACAGACGAACCTGATGGAGACTGTAATGATGAAGATAAAAGCT
GCCCCCAATGGGCTGCCATTGGCGAATGCGAACGAAACGCTGTTTTCATGATTGGTTCTCCAGATTACTATGGAACATGTAGAAAAAGCTGCAACGCATGTTGA
mRNA sequenceShow/hide mRNA sequence
ATTGCTCTCGATCGAGCTTACAGCTGAGTTCTGAGGCATTGACTCAGTTTCTCCGTTTCAATTAATTAATATTTAGAAATCCAGCGATTGAAAGCTCCTTTTCAATCACT
AAAGCCCTGTTATCAGTTTAGGAATATTCTTGTATAATTCTGATTCTCAAGTTCTGACCGGAAGTGAGAATTTGTGTGTGCATATTATTTTCACTTCCCTTGCTTCTGTT
CTCTGCTCTTGAAGGAAAATCTGTATGATTTACTGTTCTTTTGAGCTTAGACCGTAGGCATCAATGTCTATCATATGTGGCATTCCTATTCTTGAGTGTGTATGCTGTCT
GGGATGTGCTCGTTGGGCCTGTAAACGCTGTTTTCACTCAGCCGTTCATGACAGTGAAACTTGGGGCTTTGCCACCGCTGATGAGTTCGGGCCTATTCCCCGAATTTGTC
GATATATCCTAGCTGTGTATGAAGATGATATTCAACAACCCCTTTGGGAACCGGCTGGTGGTTATGGAATCAATCCAGATTGGTTGCTCATTAAAAAATCATATAAGGAT
ACTCGAGGACGTGCGCCTCCGTATATTTTATACCTTGATCACAATCATGGGGACATTGTTCTTGCCATCAGGGGACTTAATATGGCAAAGGAGAGTGATTATGCAGTTTT
ATTGGACAACAAGCTGGGGAAGAAGAAATTTGATGGTGGATATGTTCACAATGGGCTTCTGAAGGCAGCTGGGTGGGTTTTGGACACTGAGAACGAAATTTTAAAGGATT
TGGTGAGCAAATATCCGGATTATACATTGACGTTTGCAGGGCATTCCCTTGGCTCCGGAGTAGCAGCCATGTTAACTCTGGTAGTAGTTCAGAATCGCGATAAATTGGAA
AATATCGATCGGAAGAGGATAAGGTGCTATGCGATTGCTCCTGCCAGGTGCATGTCCCTAAATTTGGCTGTTAGATATGCAGATGTGATCAACTCTGTTGTTCTTCAGCC
ATCCATGGATTCTCGTCTTCCCGTCTTACTTCTTTTAGCGACTGCAATTTCGTTCTTAAGCTGCCTTGCACAAAGCAATTTGATTAGTGGGCGCAAGGGTTTAAGGGACC
AATTGATCGAAAGTGTACCTTTGAGCTACTCTAATCATTCTGGAAGAATCGACCCATCAAGAGTTGTCCAAGTCTCTTGGCGACCAAGGGTTTTCTTGTATAAAGGATTT
CTCTCAGATGAGGAGTGTGATCACCTTATTTCTTTGGCTACAAGTTCAGAAGATAAACCTTCTGGGAACAGTACTGACTCTGGGAACACTGTCCCAACCAAAATTCTAAA
GAGTTCAGGAGCCATTTTAAACACAACAGATGATATCATTGCAAGGATCGAGAATCGAATTGCTGTGTGGACTTTTCTTCCAAAAGATTATAGCATGCCTTTGCAGATTT
TGCAATATGGGGGTGAAGAAGCAGAGCATAAGTACGTTTTTGGTAACAGATCTGCAATGTTGTCCAGTGAGCCTTTGATGGCCACAGTAGTTCTGTATCTCTCAGATTCT
GCTAGCGGTGGCGAGATGCGCTTTCCTGAATCAAAGGTAAAGAGCAGATTTTGGTCAGACCGGAGAAAGAAAAACAACATTCTGAGACCAGTGAAAGGCAATGCAGTTCT
TATTTTCTCTGTGCATCTTAATGCTTCTCCAGACAAGAGTAGCTCCCATACCCGATCTCCGATACTCGATGGGGAATTGTGGATTGCAACAAAATTCTTCTACTTAAGAC
CAATCACTGGGAATAAACACACAGACGAACCTGATGGAGACTGTAATGATGAAGATAAAAGCTGCCCCCAATGGGCTGCCATTGGCGAATGCGAACGAAACGCTGTTTTC
ATGATTGGTTCTCCAGATTACTATGGAACATGTAGAAAAAGCTGCAACGCATGTTGATGAATAACCAACCAACCGTTCAAGTAAAAATTCCTCTCTTCCTAATTTGAGCA
AGTATTTGTTAATTTTTTCTGTTGTCACACAAAATTAGAGTTAAATGAGTAATTTGATTTGTGATCGGCCTATTTTAGAATCTTGGTAACTTTTGGAGGATTTTGGTTCT
TTGAAAATCCAAATTTAGAGCTCAAATACTTCTCCATATGAGTCTATTATTTGTTTAGATATGGGATTGTATTAATATATCTCTTCTGCAACTGTAAGGTTCAGATATGA
TATTGTAGCATGC
Protein sequenceShow/hide protein sequence
MSIICGIPILECVCCLGCARWACKRCFHSAVHDSETWGFATADEFGPIPRICRYILAVYEDDIQQPLWEPAGGYGINPDWLLIKKSYKDTRGRAPPYILYLDHNHGDIVL
AIRGLNMAKESDYAVLLDNKLGKKKFDGGYVHNGLLKAAGWVLDTENEILKDLVSKYPDYTLTFAGHSLGSGVAAMLTLVVVQNRDKLENIDRKRIRCYAIAPARCMSLN
LAVRYADVINSVVLQPSMDSRLPVLLLLATAISFLSCLAQSNLISGRKGLRDQLIESVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPS
GNSTDSGNTVPTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKYVFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFWSDR
RKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC