; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0019270 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0019270
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationchr09:3469497..3472020
RNA-Seq ExpressionPI0019270
SyntenyPI0019270
Gene Ontology termsGO:0019511 - peptidyl-proline hydroxylation (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043468.1 putative prolyl 4-hydroxylase 12 [Cucumis melo var. makuwa]1.3e-14685.85Show/hide
Query:  MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDQLVDRPLNYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSGDSAGSEN
        MDSRLNFLLL ATAFSFSTCLAQSNLISGRKGLRDQLVDRPL+YSN S RIDPSRVVQVSWRPRVFLYKGFLSD+ECDHLISLASNS+DNPS +SAGS N
Subjt:  MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDQLVDRPLNYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSGDSAGSEN

Query:  S--------SGVILNTTDDIIARIENRIAVWTLLPKDHSMPFQIMQYRSEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFPESKVKSKFW
        +        SGVILNTTDDIIARIENRIAVWTLLPKDH MPFQIMQYR EEAKHKYFYGNRSAMSSSSEPLMATVVLYLS SA GGEMLFPESKVKSKFW
Subjt:  S--------SGVILNTTDDIIARIENRIAVWTLLPKDHSMPFQIMQYRSEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFPESKVKSKFW

Query:  SGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPILNGELWVATKFFYLRPPTGNN----------TPSNPICPQWAAIGECERNAVFMIGSPDY
        SGRRKK NFLRPVKGNAILFFSVHLNASPDKSSYHIR PI NGELWVATKF YLRPPTGN              +  CPQWAAIGECERNAVFM+GSPDY
Subjt:  SGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPILNGELWVATKFFYLRPPTGNN----------TPSNPICPQWAAIGECERNAVFMIGSPDY

Query:  YGTCRKSCNAC
        YGTCRKSCNAC
Subjt:  YGTCRKSCNAC

XP_004152378.1 probable prolyl 4-hydroxylase 12 [Cucumis sativus]1.3e-14685.21Show/hide
Query:  MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDQLVDRPLNYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSGDSAGSE-
        MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRD+LVDRPL+YSN+SGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPS +SAGS  
Subjt:  MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDQLVDRPLNYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSGDSAGSE-

Query:  -------NSSGVILNTTDDIIARIENRIAVWTLLPKDHSMPFQIMQYRSEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFPESKVKSKFW
               NSSGVILNTTDDI+ARIENR+A+WTLLPKDHSMPFQIMQYR EEAKHKYFYGNRSAM  SSEPLMATVVLYLS SA GGE+LFPESKVKSKFW
Subjt:  -------NSSGVILNTTDDIIARIENRIAVWTLLPKDHSMPFQIMQYRSEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFPESKVKSKFW

Query:  SGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPILNGELWVATKFFYLRPPTGNN----------TPSNPICPQWAAIGECERNAVFMIGSPDY
        SGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPI +GELWVATKF YL PP GN              +  CPQWAAIGECERNAVFM+GSPDY
Subjt:  SGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPILNGELWVATKFFYLRPPTGNN----------TPSNPICPQWAAIGECERNAVFMIGSPDY

Query:  YGTCRKSCNAC
        YGTCRKSCNAC
Subjt:  YGTCRKSCNAC

XP_008436994.1 PREDICTED: probable prolyl 4-hydroxylase 12 [Cucumis melo]2.0e-14786.5Show/hide
Query:  MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDQLVDRPLNYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSGDSAGSEN
        MDSRLNFLLL ATAFSFSTCLAQSNLISGRKGLRDQLVDRPL+YSN S RIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPS +SAGS N
Subjt:  MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDQLVDRPLNYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSGDSAGSEN

Query:  S--------SGVILNTTDDIIARIENRIAVWTLLPKDHSMPFQIMQYRSEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFPESKVKSKFW
        +        SGVILNTTDDIIARIENRIAVWTLLPKDH MPFQIMQYR EEAKHKYFYGNRSAMSSSSEPLMATVVLYLS SA GGEMLFPESKVKSKFW
Subjt:  S--------SGVILNTTDDIIARIENRIAVWTLLPKDHSMPFQIMQYRSEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFPESKVKSKFW

Query:  SGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPILNGELWVATKFFYLRPPTGNN----------TPSNPICPQWAAIGECERNAVFMIGSPDY
        SGRRKK NFLRPVKGNAILFFSVHLNASPDKSSYHIR PI NGELWVATKF YLRPPTGN              +  CPQWAAIGECERNAVFM+GSPDY
Subjt:  SGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPILNGELWVATKFFYLRPPTGNN----------TPSNPICPQWAAIGECERNAVFMIGSPDY

Query:  YGTCRKSCNAC
        YGTCRKSCNAC
Subjt:  YGTCRKSCNAC

XP_022159842.1 probable prolyl 4-hydroxylase 12 [Momordica charantia]1.5e-13179.1Show/hide
Query:  MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDQLVDR-PLNYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSGDSAGSE
        MDSRL  LLLLATA SF +CLAQSNLISGRKGLRDQL++  PL+YSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLA++SED PSG+S  S 
Subjt:  MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDQLVDR-PLNYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSGDSAGSE

Query:  N--------SSGVILNTTDDIIARIENRIAVWTLLPKDHSMPFQIMQYRSEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFPESKVKSKF
        N        SSG ILNTTDDIIARIENRIAVWT LPKD+SMP QI+QY  EEA+HKY +GNRSAM  SSEPLMATVVLYLS SA GGEM FPESKVKS+F
Subjt:  N--------SSGVILNTTDDIIARIENRIAVWTLLPKDHSMPFQIMQYRSEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFPESKVKSKF

Query:  WSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPILNGELWVATKFFYLRPPTGNNTPSNP---------ICPQWAAIGECERNAVFMIGSPDY
        WS RRKKNN LRPVKGNA+L FSVHLNASPDKSS H RSPIL+GELW+ATKFFYLRP TGN     P          CPQWAAIGECERNAVFMIGSPDY
Subjt:  WSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPILNGELWVATKFFYLRPPTGNNTPSNP---------ICPQWAAIGECERNAVFMIGSPDY

Query:  YGTCRKSCNAC
        YGTCRKSCNAC
Subjt:  YGTCRKSCNAC

XP_038906497.1 probable prolyl 4-hydroxylase 12 [Benincasa hispida]2.2e-14686.17Show/hide
Query:  MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDQLVDRPLNYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSGDSAGSE-
        MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDQLVDRPL+YSNHSGRIDPSRVVQVSW+PRVFLYKGFLSDEECDHLISLASNSEDNPSG+SAGS  
Subjt:  MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDQLVDRPLNYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSGDSAGSE-

Query:  -------NSSGVILNTTDDIIARIENRIAVWTLLPKDHSMPFQIMQYRSEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFPESKVKSKFW
               NSSGVILNT+DDIIARIEN+IAVWT LPKDH MPFQIMQYR EEA+HKYFYGN SAM SSSEPLMATVVLYLS SA GGEMLFPESKVKSKFW
Subjt:  -------NSSGVILNTTDDIIARIENRIAVWTLLPKDHSMPFQIMQYRSEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFPESKVKSKFW

Query:  SGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPILNGELWVATKFFYLRPPTGNN----------TPSNPICPQWAAIGECERNAVFMIGSPDY
        S RRKKNNFLRPVKGNAILFFSVHLNASPDKSSYH RSPILNGELWVATKFFYLRP TGN              +  CPQWAAIGECERN VFMIGSPDY
Subjt:  SGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPILNGELWVATKFFYLRPPTGNN----------TPSNPICPQWAAIGECERNAVFMIGSPDY

Query:  YGTCRKSCNAC
        YGTCRKSCNAC
Subjt:  YGTCRKSCNAC

TrEMBL top hitse value%identityAlignment
A0A0A0KPE4 Procollagen-proline 4-dioxygenase2.1e-13483.74Show/hide
Query:  QSNLISGRKGLRDQLVDRPLNYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSGDSAGSE--------NSSGVILNTTDDIIA
        +SNLISGRKGLRD+LVDRPL+YSN+SGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPS +SAGS         NSSGVILNTTDDI+A
Subjt:  QSNLISGRKGLRDQLVDRPLNYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSGDSAGSE--------NSSGVILNTTDDIIA

Query:  RIENRIAVWTLLPKDHSMPFQIMQYRSEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFPESKVKSKFWSGRRKKNNFLRPVKGNAILFFS
        RIENR+A+WTLLPKDHSMPFQIMQYR EEAKHKYFYGNRSAM  SSEPLMATVVLYLS SA GGE+LFPESKVKSKFWSGRRKKNNFLRPVKGNAILFFS
Subjt:  RIENRIAVWTLLPKDHSMPFQIMQYRSEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFPESKVKSKFWSGRRKKNNFLRPVKGNAILFFS

Query:  VHLNASPDKSSYHIRSPILNGELWVATKFFYLRPPTGNN----------TPSNPICPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        VHLNASPDKSSYHIRSPI +GELWVATKF YL PP GN              +  CPQWAAIGECERNAVFM+GSPDYYGTCRKSCNAC
Subjt:  VHLNASPDKSSYHIRSPILNGELWVATKFFYLRPPTGNN----------TPSNPICPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

A0A1S3AT39 Procollagen-proline 4-dioxygenase9.5e-14886.5Show/hide
Query:  MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDQLVDRPLNYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSGDSAGSEN
        MDSRLNFLLL ATAFSFSTCLAQSNLISGRKGLRDQLVDRPL+YSN S RIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPS +SAGS N
Subjt:  MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDQLVDRPLNYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSGDSAGSEN

Query:  S--------SGVILNTTDDIIARIENRIAVWTLLPKDHSMPFQIMQYRSEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFPESKVKSKFW
        +        SGVILNTTDDIIARIENRIAVWTLLPKDH MPFQIMQYR EEAKHKYFYGNRSAMSSSSEPLMATVVLYLS SA GGEMLFPESKVKSKFW
Subjt:  S--------SGVILNTTDDIIARIENRIAVWTLLPKDHSMPFQIMQYRSEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFPESKVKSKFW

Query:  SGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPILNGELWVATKFFYLRPPTGNN----------TPSNPICPQWAAIGECERNAVFMIGSPDY
        SGRRKK NFLRPVKGNAILFFSVHLNASPDKSSYHIR PI NGELWVATKF YLRPPTGN              +  CPQWAAIGECERNAVFM+GSPDY
Subjt:  SGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPILNGELWVATKFFYLRPPTGNN----------TPSNPICPQWAAIGECERNAVFMIGSPDY

Query:  YGTCRKSCNAC
        YGTCRKSCNAC
Subjt:  YGTCRKSCNAC

A0A5A7TKX1 Procollagen-proline 4-dioxygenase6.2e-14785.85Show/hide
Query:  MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDQLVDRPLNYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSGDSAGSEN
        MDSRLNFLLL ATAFSFSTCLAQSNLISGRKGLRDQLVDRPL+YSN S RIDPSRVVQVSWRPRVFLYKGFLSD+ECDHLISLASNS+DNPS +SAGS N
Subjt:  MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDQLVDRPLNYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSGDSAGSEN

Query:  S--------SGVILNTTDDIIARIENRIAVWTLLPKDHSMPFQIMQYRSEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFPESKVKSKFW
        +        SGVILNTTDDIIARIENRIAVWTLLPKDH MPFQIMQYR EEAKHKYFYGNRSAMSSSSEPLMATVVLYLS SA GGEMLFPESKVKSKFW
Subjt:  S--------SGVILNTTDDIIARIENRIAVWTLLPKDHSMPFQIMQYRSEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFPESKVKSKFW

Query:  SGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPILNGELWVATKFFYLRPPTGNN----------TPSNPICPQWAAIGECERNAVFMIGSPDY
        SGRRKK NFLRPVKGNAILFFSVHLNASPDKSSYHIR PI NGELWVATKF YLRPPTGN              +  CPQWAAIGECERNAVFM+GSPDY
Subjt:  SGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPILNGELWVATKFFYLRPPTGNN----------TPSNPICPQWAAIGECERNAVFMIGSPDY

Query:  YGTCRKSCNAC
        YGTCRKSCNAC
Subjt:  YGTCRKSCNAC

A0A6J1E0X9 Procollagen-proline 4-dioxygenase7.3e-13279.1Show/hide
Query:  MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDQLVDR-PLNYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSGDSAGSE
        MDSRL  LLLLATA SF +CLAQSNLISGRKGLRDQL++  PL+YSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLA++SED PSG+S  S 
Subjt:  MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDQLVDR-PLNYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSGDSAGSE

Query:  N--------SSGVILNTTDDIIARIENRIAVWTLLPKDHSMPFQIMQYRSEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFPESKVKSKF
        N        SSG ILNTTDDIIARIENRIAVWT LPKD+SMP QI+QY  EEA+HKY +GNRSAM  SSEPLMATVVLYLS SA GGEM FPESKVKS+F
Subjt:  N--------SSGVILNTTDDIIARIENRIAVWTLLPKDHSMPFQIMQYRSEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFPESKVKSKF

Query:  WSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPILNGELWVATKFFYLRPPTGNNTPSNP---------ICPQWAAIGECERNAVFMIGSPDY
        WS RRKKNN LRPVKGNA+L FSVHLNASPDKSS H RSPIL+GELW+ATKFFYLRP TGN     P          CPQWAAIGECERNAVFMIGSPDY
Subjt:  WSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPILNGELWVATKFFYLRPPTGNNTPSNP---------ICPQWAAIGECERNAVFMIGSPDY

Query:  YGTCRKSCNAC
        YGTCRKSCNAC
Subjt:  YGTCRKSCNAC

A0A6J1E2P0 Procollagen-proline 4-dioxygenase5.4e-12775.87Show/hide
Query:  MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDQLVDR-PLNYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSGDSAGSE
        MDSRL FLLLLA AFSFS+CLAQSN ISGRKGLRDQ+V+   L+YSNHS RIDPSRVVQ+SW+PR FLYKGFLSDEECDHLI+LASNSED PS ++AGS 
Subjt:  MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDQLVDR-PLNYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSGDSAGSE

Query:  N--------SSGVILNTTDDIIARIENRIAVWTLLPKDHSMPFQIMQYRSEEAK-HKYFYGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFPESKVKSK
        N        +SG ILNTTDDII RIENRIAVWT LPKDHSMPFQIM+Y  EEA  HKYF+GNRSAM  SSEPLMATVVLYLS SA GGE+LFP SKVK +
Subjt:  N--------SSGVILNTTDDIIARIENRIAVWTLLPKDHSMPFQIMQYRSEEAK-HKYFYGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFPESKVKSK

Query:  FWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPILNGELWVATKFFYLRP-PTGNN-----------TPSNPICPQWAAIGECERNAVFMIG
        FWS RRKKNNFLRPVKGNA+LFFSVHLNASPDKS YH R+PIL+G+LWVATKFFY+RP  TGN               +  CP+WAAIGEC+RNAVFMIG
Subjt:  FWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPILNGELWVATKFFYLRP-PTGNN-----------TPSNPICPQWAAIGECERNAVFMIG

Query:  SPDYYGTCRKSCNAC
        SPDYYGTCRKSCNAC
Subjt:  SPDYYGTCRKSCNAC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 62.0e-4940.37Show/hide
Query:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNS-------EDNPSGDSAGSE--NSSGVIL-NTTDDIIARIENRIAVWTLLPKDHSMPF
        S+ S  +DP+R+ Q+SW PR FLYKGFLSDEECDHLI LA           D  SG+S  SE   SSG+ L    DDI+A +E ++A WT LP+++    
Subjt:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNS-------EDNPSGDSAGSE--NSSGVIL-NTTDDIIARIENRIAVWTLLPKDHSMPF

Query:  QIMQY---RSEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFPESK-----VKSKFWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSY
        QI+ Y   +  +    YFY  ++         +ATV++YLS    GGE +FP  K     +K   WS   K+   ++P KG+A+LFF++HLN + D +S 
Subjt:  QIMQY---RSEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFPESK-----VKSKFWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSY

Query:  HIRSPILNGELWVATKFFYLRPPTGNN---TPSNPICPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        H   P++ GE W AT++ ++R            +  C +WA  GECE+N ++M+GS    G CRKSC AC
Subjt:  HIRSPILNGELWVATKFFYLRPPTGNN---TPSNPICPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

F4JAU3 Prolyl 4-hydroxylase 28.6e-4538.1Show/hide
Query:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNS------EDNPSGDSAGSE--NSSGVILNT-TDDIIARIENRIAVWTLLPKDHSMPFQ
        S+ S  I+PS+V QVS +PR F+Y+GFL+D ECDHLISLA  +       DN +G+S  S+   SSG  ++   D I++ IE++++ WT LPK++    Q
Subjt:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNS------EDNPSGDSAGSE--NSSGVILNT-TDDIIARIENRIAVWTLLPKDHSMPFQ

Query:  IMQYRSEEAKHKYF-YGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFPESKVKSK--------FWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSY
        +++Y   +    +F Y +     +     +ATV+LYLS    GGE +FP+++  S+          S   KK   ++P KGNA+LFF++  +A PD  S 
Subjt:  IMQYRSEEAKHKYF-YGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFPESKVKSK--------FWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSY

Query:  HIRSPILNGELWVATKFFY------LRPPTGNNTPSNPICPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        H   P++ GE W ATK+ +      +    GN T  N  C +WA +GEC +N  +M+G+P+  G CR+SC AC
Subjt:  HIRSPILNGELWVATKFFY------LRPPTGNNTPSNPICPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

Q8GXT7 Probable prolyl 4-hydroxylase 122.2e-6144.78Show/hide
Query:  FLLLLATAFSFSTCLAQSNLISGRKGLRDQLV-----DRPLNYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSGDSAGSENS
        FL+L+ T  S S           RK LRD+ +     D   +Y   S  +DP+RV+Q+SW PRVFLY+GFLS+EECDHLISL   + +  S D+ G    
Subjt:  FLLLLATAFSFSTCLAQSNLISGRKGLRDQLV-----DRPLNYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSGDSAGSENS

Query:  SGVILNTTDDIIARIENRIAVWTLLPKDHSMPFQIMQYRSEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFPESKVKSKFWSGRRKKNNF
                D ++A IE +++ WT LP ++    ++  Y SE++  K  Y      S   E L+ATVVLYLS +  GGE+LFP S++K K  +   +  N 
Subjt:  SGVILNTTDDIIARIENRIAVWTLLPKDHSMPFQIMQYRSEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFPESKVKSKFWSGRRKKNNF

Query:  LRPVKGNAILFFSVHLNASPDKSSYHIRSPILNGELWVATKFFYLR-----PPTGNNTPSNPICPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        LRPVKGNAILFF+  LNAS D  S H+R P++ GEL VATK  Y +       +G  +  +  C +WA +GEC++N V+MIGSPDYYGTCRKSCNAC
Subjt:  LRPVKGNAILFFSVHLNASPDKSSYHIRSPILNGELWVATKFFYLR-----PPTGNNTPSNPICPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

Q8L970 Probable prolyl 4-hydroxylase 72.6e-4937.54Show/hide
Query:  MDSRLNFLLLLATAFSFSTCL---AQSNLISGRKGLRDQLVDRPLNYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLA------SNSEDNP
        MDSR+   L  +  F F+  L   A +  ++     RD  V + +  S  S   DP+RV Q+SW PRVFLY+GFLSDEECDH I LA      S   DN 
Subjt:  MDSRLNFLLLLATAFSFSTCL---AQSNLISGRKGLRDQLVDRPLNYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLA------SNSEDNP

Query:  SGDSAGSE--NSSGVILN-TTDDIIARIENRIAVWTLLPKDHSMPFQIMQYRSEEAKHKYF-YGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFP----
        SG+S  SE   SSG+ L+   DDI++ +E ++A WT LP+++    QI+ Y + +    +F Y +  A        +ATV++YLS    GGE +FP    
Subjt:  SGDSAGSE--NSSGVILN-TTDDIIARIENRIAVWTLLPKDHSMPFQIMQYRSEEAKHKYF-YGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFP----

Query:  -ESKVKSKFWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPILNGELWVATKFFYLRP------PTGNNTPSNPICPQWAAIGECERNAVFM
          +++K   W+   K+   ++P KG+A+LFF++H NA+ D +S H   P++ GE W AT++ +++               N  C +WA  GEC++N  +M
Subjt:  -ESKVKSKFWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPILNGELWVATKFFYLRP------PTGNNTPSNPICPQWAAIGECERNAVFM

Query:  IGSPDYYGTCRKSCNAC
        +GS   +G CRKSC AC
Subjt:  IGSPDYYGTCRKSCNAC

Q8LAN3 Probable prolyl 4-hydroxylase 41.0e-4537.45Show/hide
Query:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNS------EDNPSGDSAGSE--NSSGVILNT-TDDIIARIENRIAVWTLLPKDHSMPFQ
        S+ S  ++PS+V QVS +PR F+Y+GFL++ ECDH++SLA  S       DN SG+S  SE   SSG  ++   D I++ IE++I+ WT LPK++    Q
Subjt:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNS------EDNPSGDSAGSE--NSSGVILNT-TDDIIARIENRIAVWTLLPKDHSMPFQ

Query:  IMQY---RSEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFPESKVKSK--------FWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKS
        +++Y   +  +A   YF+   + +       MAT+++YLS    GGE +FP++++ S+          S   K+   ++P KG+A+LFF++H +A PD  
Subjt:  IMQY---RSEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFPESKVKSK--------FWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKS

Query:  SYHIRSPILNGELWVATKFFY------LRPPTGNNTPSNPICPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        S H   P++ GE W ATK+ +      +  P+GN T  N  C +WA +GEC +N  +M+G+ +  G CR+SC AC
Subjt:  SYHIRSPILNGELWVATKFFY------LRPPTGNNTPSNPICPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

Arabidopsis top hitse value%identityAlignment
AT3G28480.1 Oxoglutarate/iron-dependent oxygenase1.8e-5037.54Show/hide
Query:  MDSRLNFLLLLATAFSFSTCL---AQSNLISGRKGLRDQLVDRPLNYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLA------SNSEDNP
        MDSR+   L  +  F F+  L   A +  ++     RD  V + +  S  S   DP+RV Q+SW PRVFLY+GFLSDEECDH I LA      S   DN 
Subjt:  MDSRLNFLLLLATAFSFSTCL---AQSNLISGRKGLRDQLVDRPLNYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLA------SNSEDNP

Query:  SGDSAGSE--NSSGVILN-TTDDIIARIENRIAVWTLLPKDHSMPFQIMQYRSEEAKHKYF-YGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFP----
        SG+S  SE   SSG+ L+   DDI++ +E ++A WT LP+++    QI+ Y + +    +F Y +  A        +ATV++YLS    GGE +FP    
Subjt:  SGDSAGSE--NSSGVILN-TTDDIIARIENRIAVWTLLPKDHSMPFQIMQYRSEEAKHKYF-YGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFP----

Query:  -ESKVKSKFWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPILNGELWVATKFFYLRP------PTGNNTPSNPICPQWAAIGECERNAVFM
          +++K   W+   K+   ++P KG+A+LFF++H NA+ D +S H   P++ GE W AT++ +++               N  C +WA  GEC++N  +M
Subjt:  -ESKVKSKFWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPILNGELWVATKFFYLRP------PTGNNTPSNPICPQWAAIGECERNAVFM

Query:  IGSPDYYGTCRKSCNAC
        +GS   +G CRKSC AC
Subjt:  IGSPDYYGTCRKSCNAC

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase6.9e-5036Show/hide
Query:  MDSRLNFLLLLATAFSFSTCL---AQSNLISGRKGLRDQLVDRPLNYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLA------SNSEDNP
        MDSR+   L  +  F F+  L   A +  ++     RD  V + +  S  S   DP+RV Q+SW PRVFLY+GFLSDEECDH I LA      S   DN 
Subjt:  MDSRLNFLLLLATAFSFSTCL---AQSNLISGRKGLRDQLVDRPLNYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLA------SNSEDNP

Query:  SGDSAGSENSSGVILNTT-----------DDIIARIENRIAVWTLLPKDHSMPFQIMQYRSEEAKHKYF-YGNRSAMSSSSEPLMATVVLYLSGSAHGGE
        SG+S  SE+S  V+  ++           DDI++ +E ++A WT LP+++    QI+ Y + +    +F Y +  A        +ATV++YLS    GGE
Subjt:  SGDSAGSENSSGVILNTT-----------DDIIARIENRIAVWTLLPKDHSMPFQIMQYRSEEAKHKYF-YGNRSAMSSSSEPLMATVVLYLSGSAHGGE

Query:  MLFP-----ESKVKSKFWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPILNGELWVATKFFYLRP------PTGNNTPSNPICPQWAAIGE
         +FP      +++K   W+   K+   ++P KG+A+LFF++H NA+ D +S H   P++ GE W AT++ +++               N  C +WA  GE
Subjt:  MLFP-----ESKVKSKFWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPILNGELWVATKFFYLRP------PTGNNTPSNPICPQWAAIGE

Query:  CERNAVFMIGSPDYYGTCRKSCNAC
        C++N  +M+GS   +G CRKSC AC
Subjt:  CERNAVFMIGSPDYYGTCRKSCNAC

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase1.4e-5040.37Show/hide
Query:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNS-------EDNPSGDSAGSE--NSSGVIL-NTTDDIIARIENRIAVWTLLPKDHSMPF
        S+ S  +DP+R+ Q+SW PR FLYKGFLSDEECDHLI LA           D  SG+S  SE   SSG+ L    DDI+A +E ++A WT LP+++    
Subjt:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNS-------EDNPSGDSAGSE--NSSGVIL-NTTDDIIARIENRIAVWTLLPKDHSMPF

Query:  QIMQY---RSEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFPESK-----VKSKFWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSY
        QI+ Y   +  +    YFY  ++         +ATV++YLS    GGE +FP  K     +K   WS   K+   ++P KG+A+LFF++HLN + D +S 
Subjt:  QIMQY---RSEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFPESK-----VKSKFWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSY

Query:  HIRSPILNGELWVATKFFYLRPPTGNN---TPSNPICPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        H   P++ GE W AT++ ++R            +  C +WA  GECE+N ++M+GS    G CRKSC AC
Subjt:  HIRSPILNGELWVATKFFYLRPPTGNN---TPSNPICPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

AT4G25600.1 Oxoglutarate/iron-dependent oxygenase1.6e-6244.78Show/hide
Query:  FLLLLATAFSFSTCLAQSNLISGRKGLRDQLV-----DRPLNYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSGDSAGSENS
        FL+L+ T  S S           RK LRD+ +     D   +Y   S  +DP+RV+Q+SW PRVFLY+GFLS+EECDHLISL   + +  S D+ G    
Subjt:  FLLLLATAFSFSTCLAQSNLISGRKGLRDQLV-----DRPLNYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSGDSAGSENS

Query:  SGVILNTTDDIIARIENRIAVWTLLPKDHSMPFQIMQYRSEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFPESKVKSKFWSGRRKKNNF
                D ++A IE +++ WT LP ++    ++  Y SE++  K  Y      S   E L+ATVVLYLS +  GGE+LFP S++K K  +   +  N 
Subjt:  SGVILNTTDDIIARIENRIAVWTLLPKDHSMPFQIMQYRSEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFPESKVKSKFWSGRRKKNNF

Query:  LRPVKGNAILFFSVHLNASPDKSSYHIRSPILNGELWVATKFFYLR-----PPTGNNTPSNPICPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        LRPVKGNAILFF+  LNAS D  S H+R P++ GEL VATK  Y +       +G  +  +  C +WA +GEC++N V+MIGSPDYYGTCRKSCNAC
Subjt:  LRPVKGNAILFFSVHLNASPDKSSYHIRSPILNGELWVATKFFYLR-----PPTGNNTPSNPICPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein7.2e-4737.45Show/hide
Query:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNS------EDNPSGDSAGSE--NSSGVILNT-TDDIIARIENRIAVWTLLPKDHSMPFQ
        S+ S  ++PS+V QVS +PR F+Y+GFL++ ECDH++SLA  S       DN SG+S  SE   SSG  ++   D I++ IE++I+ WT LPK++    Q
Subjt:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNS------EDNPSGDSAGSE--NSSGVILNT-TDDIIARIENRIAVWTLLPKDHSMPFQ

Query:  IMQY---RSEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFPESKVKSK--------FWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKS
        +++Y   +  +A   YF+   + +       MAT+++YLS    GGE +FP++++ S+          S   K+   ++P KG+A+LFF++H +A PD  
Subjt:  IMQY---RSEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFPESKVKSK--------FWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKS

Query:  SYHIRSPILNGELWVATKFFY------LRPPTGNNTPSNPICPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        S H   P++ GE W ATK+ +      +  P+GN T  N  C +WA +GEC +N  +M+G+ +  G CR+SC AC
Subjt:  SYHIRSPILNGELWVATKFFY------LRPPTGNNTPSNPICPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCTCGTCTTAACTTTTTGCTTCTGTTAGCGACTGCATTTTCATTCTCAACCTGCCTTGCACAAAGCAATTTGATTAGTGGCCGTAAGGGTTTAAGGGACCAATT
GGTTGACAGACCTTTGAACTACTCAAATCATTCTGGTAGAATCGACCCATCAAGAGTTGTCCAAGTCTCTTGGCGACCAAGGGTCTTCTTGTATAAAGGTTTTCTCTCAG
ATGAAGAGTGTGATCATCTTATTTCTTTGGCTTCAAATTCGGAAGACAATCCTTCTGGGGATAGTGCTGGTTCGGAGAACAGTTCAGGAGTCATTTTAAACACAACAGAT
GATATCATTGCAAGAATTGAAAATCGAATTGCAGTATGGACTTTGCTCCCAAAGGATCATAGCATGCCTTTTCAGATCATGCAATACAGGAGTGAAGAAGCAAAGCATAA
GTACTTTTATGGCAACAGATCTGCAATGTCGTCATCCAGTGAGCCTTTGATGGCCACAGTAGTTTTGTATCTCTCAGGTTCTGCTCATGGTGGTGAAATGCTGTTTCCAG
AATCAAAGGTAAAGAGCAAATTTTGGTCAGGCCGAAGAAAGAAAAACAACTTTCTGAGACCAGTGAAAGGCAATGCAATTCTTTTTTTCTCTGTGCATCTTAATGCATCT
CCAGACAAGAGTAGCTACCACATTCGATCCCCAATACTCAATGGGGAATTGTGGGTTGCTACAAAATTCTTCTACTTAAGACCACCCACAGGGAATAACACACCATCGAA
TCCGATCTGCCCTCAATGGGCTGCCATTGGTGAATGTGAACGAAACGCTGTGTTCATGATTGGTTCTCCAGATTACTATGGTACATGTAGAAAAAGCTGCAATGCATGTT
GA
mRNA sequenceShow/hide mRNA sequence
ATGGATTCTCGTCTTAACTTTTTGCTTCTGTTAGCGACTGCATTTTCATTCTCAACCTGCCTTGCACAAAGCAATTTGATTAGTGGCCGTAAGGGTTTAAGGGACCAATT
GGTTGACAGACCTTTGAACTACTCAAATCATTCTGGTAGAATCGACCCATCAAGAGTTGTCCAAGTCTCTTGGCGACCAAGGGTCTTCTTGTATAAAGGTTTTCTCTCAG
ATGAAGAGTGTGATCATCTTATTTCTTTGGCTTCAAATTCGGAAGACAATCCTTCTGGGGATAGTGCTGGTTCGGAGAACAGTTCAGGAGTCATTTTAAACACAACAGAT
GATATCATTGCAAGAATTGAAAATCGAATTGCAGTATGGACTTTGCTCCCAAAGGATCATAGCATGCCTTTTCAGATCATGCAATACAGGAGTGAAGAAGCAAAGCATAA
GTACTTTTATGGCAACAGATCTGCAATGTCGTCATCCAGTGAGCCTTTGATGGCCACAGTAGTTTTGTATCTCTCAGGTTCTGCTCATGGTGGTGAAATGCTGTTTCCAG
AATCAAAGGTAAAGAGCAAATTTTGGTCAGGCCGAAGAAAGAAAAACAACTTTCTGAGACCAGTGAAAGGCAATGCAATTCTTTTTTTCTCTGTGCATCTTAATGCATCT
CCAGACAAGAGTAGCTACCACATTCGATCCCCAATACTCAATGGGGAATTGTGGGTTGCTACAAAATTCTTCTACTTAAGACCACCCACAGGGAATAACACACCATCGAA
TCCGATCTGCCCTCAATGGGCTGCCATTGGTGAATGTGAACGAAACGCTGTGTTCATGATTGGTTCTCCAGATTACTATGGTACATGTAGAAAAAGCTGCAATGCATGTT
GA
Protein sequenceShow/hide protein sequence
MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDQLVDRPLNYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSGDSAGSENSSGVILNTTD
DIIARIENRIAVWTLLPKDHSMPFQIMQYRSEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSGSAHGGEMLFPESKVKSKFWSGRRKKNNFLRPVKGNAILFFSVHLNAS
PDKSSYHIRSPILNGELWVATKFFYLRPPTGNNTPSNPICPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC