; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr021518 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr021518
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationtig00153754:479616..482524
RNA-Seq ExpressionSgr021518
SyntenySgr021518
Gene Ontology termsGO:0019511 - peptidyl-proline hydroxylation (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043468.1 putative prolyl 4-hydroxylase 12 [Cucumis melo var. makuwa]2.5e-11176.49Show/hide
Query:  WKNRPIKSCPS---LLATKVRVFLYKGFLSDEECDHLISLAASSEDKPSRNN------VSIKLLKSSGAILNTTDNIIARIENRIAVWTFLPKDYSMPLQ
        + N+ ++  PS    ++ + RVFLYKGFLSD+ECDHLISLA++S+D PSRN+      VS +LL  SG ILNTTD+IIARIENRIAVWT LPKD+ MP Q
Subjt:  WKNRPIKSCPS---LLATKVRVFLYKGFLSDEECDHLISLAASSEDKPSRNN------VSIKLLKSSGAILNTTDNIIARIENRIAVWTFLPKDYSMPLQ

Query:  IMQYGGEEAEHKYVFGNRSTM-SSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFWSDCRKKRNILKPVKGNAILFFSVHLNASPDKSSSHTRCPILDG
        IMQY GEEA+HKY +GNRS M SSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFWS  RKK+N L+PVKGNAILFFSVHLNASPDKSS H R PI +G
Subjt:  IMQYGGEEAEHKYVFGNRSTM-SSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFWSDCRKKRNILKPVKGNAILFFSVHLNASPDKSSSHTRCPILDG

Query:  ELWVATKFFYLRPITGNKHTVESNGD-CIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        ELWVATKF YLRP TGNKHT++SN D CIDED+SCPQWAAIGECERNAVFM+GSPDYYGTCRKSCNAC
Subjt:  ELWVATKFFYLRPITGNKHTVESNGD-CIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

KAG6579383.1 putative prolyl 4-hydroxylase 12, partial [Cucurbita argyrosperma subsp. sororia]8.7e-11273.61Show/hide
Query:  GFKGSIDEQCTFGLLKSFWKNRPIKSCPS---LLATKVRVFLYKGFLSDEECDHLISLAASSEDKPSRNN------VSIKLLKSSGAILNTTDNIIARIE
        G KG  D+    G L   + N   +  PS    ++ + R FLYKGFLSDEECDHLI+LA++SEDKPSRNN      VS K L +SGAILNTTD+IIARIE
Subjt:  GFKGSIDEQCTFGLLKSFWKNRPIKSCPS---LLATKVRVFLYKGFLSDEECDHLISLAASSEDKPSRNN------VSIKLLKSSGAILNTTDNIIARIE

Query:  NRIAVWTFLPKDYSMPLQIMQYGGEEAE-HKYVFGNRSTMSSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFWSDCRKKRNILKPVKGNAILFFSVHL
        NRIAVWTFLPKD+SMP QIMQYGGEEA  HKY FGNRS M SSEPLMATVVLYLSDSASGGE+LFP SKVK +FWSD RKK N L+PVKGNA+LFFSVHL
Subjt:  NRIAVWTFLPKDYSMPLQIMQYGGEEAE-HKYVFGNRSTMSSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFWSDCRKKRNILKPVKGNAILFFSVHL

Query:  NASPDKSSSHTRCPILDGELWVATKFFYLRP-ITGNKHTVES--NGDCIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        NASPDKS  H+R PILDG+LWVATKFFY+RP  TGN+H VES  + DCIDEDESCP+WAAIGEC+RNAVFMIGSPDYYGTCRKSCNAC
Subjt:  NASPDKSSSHTRCPILDGELWVATKFFYLRP-ITGNKHTVES--NGDCIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

XP_008436994.1 PREDICTED: probable prolyl 4-hydroxylase 12 [Cucumis melo]3.9e-11277.24Show/hide
Query:  WKNRPIKSCPS---LLATKVRVFLYKGFLSDEECDHLISLAASSEDKPSRNN------VSIKLLKSSGAILNTTDNIIARIENRIAVWTFLPKDYSMPLQ
        + N+ ++  PS    ++ + RVFLYKGFLSDEECDHLISLA++SED PSRN+      VS +LL  SG ILNTTD+IIARIENRIAVWT LPKD+ MP Q
Subjt:  WKNRPIKSCPS---LLATKVRVFLYKGFLSDEECDHLISLAASSEDKPSRNN------VSIKLLKSSGAILNTTDNIIARIENRIAVWTFLPKDYSMPLQ

Query:  IMQYGGEEAEHKYVFGNRSTM-SSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFWSDCRKKRNILKPVKGNAILFFSVHLNASPDKSSSHTRCPILDG
        IMQY GEEA+HKY +GNRS M SSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFWS  RKK+N L+PVKGNAILFFSVHLNASPDKSS H R PI +G
Subjt:  IMQYGGEEAEHKYVFGNRSTM-SSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFWSDCRKKRNILKPVKGNAILFFSVHLNASPDKSSSHTRCPILDG

Query:  ELWVATKFFYLRPITGNKHTVESNGD-CIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        ELWVATKF YLRP TGNKHT++SN D CIDED+SCPQWAAIGECERNAVFM+GSPDYYGTCRKSCNAC
Subjt:  ELWVATKFFYLRPITGNKHTVESNGD-CIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

XP_022159842.1 probable prolyl 4-hydroxylase 12 [Momordica charantia]3.5e-12187.8Show/hide
Query:  RVFLYKGFLSDEECDHLISLAASSEDKPSRNN------VSIKLLKSSGAILNTTDNIIARIENRIAVWTFLPKDYSMPLQIMQYGGEEAEHKYVFGNRST
        RVFLYKGFLSDEECDHLISLA SSEDKPS N+      V  K+LKSSGAILNTTD+IIARIENRIAVWTFLPKDYSMPLQI+QYGGEEAEHKYVFGNRS 
Subjt:  RVFLYKGFLSDEECDHLISLAASSEDKPSRNN------VSIKLLKSSGAILNTTDNIIARIENRIAVWTFLPKDYSMPLQIMQYGGEEAEHKYVFGNRST

Query:  MSSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFWSDCRKKRNILKPVKGNAILFFSVHLNASPDKSSSHTRCPILDGELWVATKFFYLRPITGNKHTV
        M SSEPLMATVVLYLSDSASGGEM FPESKVKS+FWSD RKK NIL+PVKGNA+L FSVHLNASPDKSSSHTR PILDGELW+ATKFFYLRPITGNKHT 
Subjt:  MSSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFWSDCRKKRNILKPVKGNAILFFSVHLNASPDKSSSHTRCPILDGELWVATKFFYLRPITGNKHTV

Query:  ESNGDCIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        E +GDC DED+SCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
Subjt:  ESNGDCIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

XP_038906497.1 probable prolyl 4-hydroxylase 12 [Benincasa hispida]1.6e-11383.81Show/hide
Query:  RVFLYKGFLSDEECDHLISLAASSEDKPSRNN------VSIKLLKSSGAILNTTDNIIARIENRIAVWTFLPKDYSMPLQIMQYGGEEAEHKYVFGNRST
        RVFLYKGFLSDEECDHLISLA++SED PS N+      VS KLL SSG ILNT+D+IIARIEN+IAVWTFLPKD+ MP QIMQY GEEAEHKY +GN S 
Subjt:  RVFLYKGFLSDEECDHLISLAASSEDKPSRNN------VSIKLLKSSGAILNTTDNIIARIENRIAVWTFLPKDYSMPLQIMQYGGEEAEHKYVFGNRST

Query:  MSSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFWSDCRKKRNILKPVKGNAILFFSVHLNASPDKSSSHTRCPILDGELWVATKFFYLRPITGNKHTV
        MSSSEPLMATVVLYLSDSA GGEMLFPESKVKSKFWSD RKK N L+PVKGNAILFFSVHLNASPDKSS HTR PIL+GELWVATKFFYLRP TGNK TV
Subjt:  MSSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFWSDCRKKRNILKPVKGNAILFFSVHLNASPDKSSSHTRCPILDGELWVATKFFYLRPITGNKHTV

Query:  ESNGD-CIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        ES+ D CIDED+SCPQWAAIGECERN VFMIGSPDYYGTCRKSCNAC
Subjt:  ESNGD-CIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

TrEMBL top hitse value%identityAlignment
A0A0A0KPE4 Procollagen-proline 4-dioxygenase3.0e-11072.79Show/hide
Query:  GFKGSIDEQCTFGLLKSFWKNRPIKSCPSLLATKVRVFLYKGFLSDEECDHLISLAASSEDKPSRNN------VSIKLLKSSGAILNTTDNIIARIENRI
        G KG  D      L  S +  R   S    ++ + RVFLYKGFLSDEECDHLISLA++SED PSRN+      VS +LL SSG ILNTTD+I+ARIENR+
Subjt:  GFKGSIDEQCTFGLLKSFWKNRPIKSCPSLLATKVRVFLYKGFLSDEECDHLISLAASSEDKPSRNN------VSIKLLKSSGAILNTTDNIIARIENRI

Query:  AVWTFLPKDYSMPLQIMQYGGEEAEHKYVFGNRSTM-SSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFWSDCRKKRNILKPVKGNAILFFSVHLNAS
        A+WT LPKD+SMP QIMQY GEEA+HKY +GNRS M  SSEPLMATVVLYLSDSASGGE+LFPESKVKSKFWS  RKK N L+PVKGNAILFFSVHLNAS
Subjt:  AVWTFLPKDYSMPLQIMQYGGEEAEHKYVFGNRSTM-SSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFWSDCRKKRNILKPVKGNAILFFSVHLNAS

Query:  PDKSSSHTRCPILDGELWVATKFFYLRPITGNKHTVESNGD-CIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        PDKSS H R PI DGELWVATKF YL P  GNKHT++S+ D C DED+SCPQWAAIGECERNAVFM+GSPDYYGTCRKSCNAC
Subjt:  PDKSSSHTRCPILDGELWVATKFFYLRPITGNKHTVESNGD-CIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

A0A1S3AT39 Procollagen-proline 4-dioxygenase1.9e-11277.24Show/hide
Query:  WKNRPIKSCPS---LLATKVRVFLYKGFLSDEECDHLISLAASSEDKPSRNN------VSIKLLKSSGAILNTTDNIIARIENRIAVWTFLPKDYSMPLQ
        + N+ ++  PS    ++ + RVFLYKGFLSDEECDHLISLA++SED PSRN+      VS +LL  SG ILNTTD+IIARIENRIAVWT LPKD+ MP Q
Subjt:  WKNRPIKSCPS---LLATKVRVFLYKGFLSDEECDHLISLAASSEDKPSRNN------VSIKLLKSSGAILNTTDNIIARIENRIAVWTFLPKDYSMPLQ

Query:  IMQYGGEEAEHKYVFGNRSTM-SSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFWSDCRKKRNILKPVKGNAILFFSVHLNASPDKSSSHTRCPILDG
        IMQY GEEA+HKY +GNRS M SSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFWS  RKK+N L+PVKGNAILFFSVHLNASPDKSS H R PI +G
Subjt:  IMQYGGEEAEHKYVFGNRSTM-SSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFWSDCRKKRNILKPVKGNAILFFSVHLNASPDKSSSHTRCPILDG

Query:  ELWVATKFFYLRPITGNKHTVESNGD-CIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        ELWVATKF YLRP TGNKHT++SN D CIDED+SCPQWAAIGECERNAVFM+GSPDYYGTCRKSCNAC
Subjt:  ELWVATKFFYLRPITGNKHTVESNGD-CIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

A0A5A7TKX1 Procollagen-proline 4-dioxygenase1.2e-11176.49Show/hide
Query:  WKNRPIKSCPS---LLATKVRVFLYKGFLSDEECDHLISLAASSEDKPSRNN------VSIKLLKSSGAILNTTDNIIARIENRIAVWTFLPKDYSMPLQ
        + N+ ++  PS    ++ + RVFLYKGFLSD+ECDHLISLA++S+D PSRN+      VS +LL  SG ILNTTD+IIARIENRIAVWT LPKD+ MP Q
Subjt:  WKNRPIKSCPS---LLATKVRVFLYKGFLSDEECDHLISLAASSEDKPSRNN------VSIKLLKSSGAILNTTDNIIARIENRIAVWTFLPKDYSMPLQ

Query:  IMQYGGEEAEHKYVFGNRSTM-SSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFWSDCRKKRNILKPVKGNAILFFSVHLNASPDKSSSHTRCPILDG
        IMQY GEEA+HKY +GNRS M SSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFWS  RKK+N L+PVKGNAILFFSVHLNASPDKSS H R PI +G
Subjt:  IMQYGGEEAEHKYVFGNRSTM-SSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFWSDCRKKRNILKPVKGNAILFFSVHLNASPDKSSSHTRCPILDG

Query:  ELWVATKFFYLRPITGNKHTVESNGD-CIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        ELWVATKF YLRP TGNKHT++SN D CIDED+SCPQWAAIGECERNAVFM+GSPDYYGTCRKSCNAC
Subjt:  ELWVATKFFYLRPITGNKHTVESNGD-CIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

A0A6J1E0X9 Procollagen-proline 4-dioxygenase1.7e-12187.8Show/hide
Query:  RVFLYKGFLSDEECDHLISLAASSEDKPSRNN------VSIKLLKSSGAILNTTDNIIARIENRIAVWTFLPKDYSMPLQIMQYGGEEAEHKYVFGNRST
        RVFLYKGFLSDEECDHLISLA SSEDKPS N+      V  K+LKSSGAILNTTD+IIARIENRIAVWTFLPKDYSMPLQI+QYGGEEAEHKYVFGNRS 
Subjt:  RVFLYKGFLSDEECDHLISLAASSEDKPSRNN------VSIKLLKSSGAILNTTDNIIARIENRIAVWTFLPKDYSMPLQIMQYGGEEAEHKYVFGNRST

Query:  MSSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFWSDCRKKRNILKPVKGNAILFFSVHLNASPDKSSSHTRCPILDGELWVATKFFYLRPITGNKHTV
        M SSEPLMATVVLYLSDSASGGEM FPESKVKS+FWSD RKK NIL+PVKGNA+L FSVHLNASPDKSSSHTR PILDGELW+ATKFFYLRPITGNKHT 
Subjt:  MSSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFWSDCRKKRNILKPVKGNAILFFSVHLNASPDKSSSHTRCPILDGELWVATKFFYLRPITGNKHTV

Query:  ESNGDCIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        E +GDC DED+SCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
Subjt:  ESNGDCIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

A0A6J1E2P0 Procollagen-proline 4-dioxygenase3.6e-11172.92Show/hide
Query:  GFKGSIDEQCTFGLLKSFWKNRPIKSCPS---LLATKVRVFLYKGFLSDEECDHLISLAASSEDKPSRNN------VSIKLLKSSGAILNTTDNIIARIE
        G KG  D+    G L   + N   +  PS    ++ + R FLYKGFLSDEECDHLI+LA++SEDKPSRNN      VS K L +SGAILNTTD+II RIE
Subjt:  GFKGSIDEQCTFGLLKSFWKNRPIKSCPS---LLATKVRVFLYKGFLSDEECDHLISLAASSEDKPSRNN------VSIKLLKSSGAILNTTDNIIARIE

Query:  NRIAVWTFLPKDYSMPLQIMQYGGEEAE-HKYVFGNRSTMSSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFWSDCRKKRNILKPVKGNAILFFSVHL
        NRIAVWTFLPKD+SMP QIM+YGGEEA  HKY FGNRS M SSEPLMATVVLYLSDSASGGE+LFP SKVK +FWSD RKK N L+PVKGNA+LFFSVHL
Subjt:  NRIAVWTFLPKDYSMPLQIMQYGGEEAE-HKYVFGNRSTMSSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFWSDCRKKRNILKPVKGNAILFFSVHL

Query:  NASPDKSSSHTRCPILDGELWVATKFFYLRP-ITGNKHTVES--NGDCIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        NASPDKS  H+R PILDG+LWVATKFFY+RP  TGN+H VES  + DCIDEDESCP+WAAIGEC+RNAVFMIGSPDYYGTCRKSCNAC
Subjt:  NASPDKSSSHTRCPILDGELWVATKFFYLRP-ITGNKHTVES--NGDCIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 61.2e-5041.96Show/hide
Query:  RVFLYKGFLSDEECDHLISLAASSEDK-------PSRNNVSIKLLKSSGAIL-NTTDNIIARIENRIAVWTFLPKDYSMPLQIMQY--GGEEAEHKYVFG
        R FLYKGFLSDEECDHLI LA    +K        S  +   ++  SSG  L    D+I+A +E ++A WTFLP++    LQI+ Y  G +   H   F 
Subjt:  RVFLYKGFLSDEECDHLISLAASSEDK-------PSRNNVSIKLLKSSGAIL-NTTDNIIARIENRIAVWTFLPKDYSMPLQIMQY--GGEEAEHKYVFG

Query:  NRSTMSSSEPLMATVVLYLSDSASGGEMLFPESK-----VKSKFWSDCRKKRNILKPVKGNAILFFSVHLNASPDKSSSHTRCPILDGELWVATKFFYLR
        ++  +      +ATV++YLS+   GGE +FP  K     +K   WS C K+   +KP KG+A+LFF++HLN + D +S H  CP+++GE W AT++ ++R
Subjt:  NRSTMSSSEPLMATVVLYLSDSASGGEMLFPESK-----VKSKFWSDCRKKRNILKPVKGNAILFFSVHLNASPDKSSSHTRCPILDGELWVATKFFYLR

Query:  PITGNKHTVESNGDCIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
           G K  V     C+D+ ESC +WA  GECE+N ++M+GS    G CRKSC AC
Subjt:  PITGNKHTVESNGDCIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

F4JAU3 Prolyl 4-hydroxylase 23.5e-4738.17Show/hide
Query:  LATKVRVFLYKGFLSDEECDHLISLA-------ASSEDKPSRNNVSIKLLKSSGAILNTTDNIIARIENRIAVWTFLPKDYSMPLQIMQY--GGEEAEHK
        +++K R F+Y+GFL+D ECDHLISLA       A +++    + VS     S   I    D I++ IE++++ WTFLPK+    LQ+++Y  G +   H 
Subjt:  LATKVRVFLYKGFLSDEECDHLISLA-------ASSEDKPSRNNVSIKLLKSSGAILNTTDNIIARIENRIAVWTFLPKDYSMPLQIMQY--GGEEAEHK

Query:  YVFGNRSTMSSSEPLMATVVLYLSDSASGGEMLFPESKVKSK--------FWSDCRKKRNILKPVKGNAILFFSVHLNASPDKSSSHTRCPILDGELWVA
          F ++  ++     +ATV+LYLS+   GGE +FP+++  S+          SDC KK   +KP KGNA+LFF++  +A PD  S H  CP+++GE W A
Subjt:  YVFGNRSTMSSSEPLMATVVLYLSDSASGGEMLFPESKVKSK--------FWSDCRKKRNILKPVKGNAILFFSVHLNASPDKSSSHTRCPILDGELWVA

Query:  TKFFYLRPITGNKHTVESNGDCIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        TK+ +   +      +  +G+C D +ESC +WA +GEC +N  +M+G+P+  G CR+SC AC
Subjt:  TKFFYLRPITGNKHTVESNGDCIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

Q8GXT7 Probable prolyl 4-hydroxylase 121.4e-5948.55Show/hide
Query:  RVFLYKGFLSDEECDHLISLAASSEDKPSRNNVSIKLLKSSGAILNTTDNIIARIENRIAVWTFLPKDYSMPLQIMQYGGEEAEHKY-VFGNRSTMSSSE
        RVFLY+GFLS+EECDHLISL         +    +  + + G      D ++A IE +++ WTFLP +    +++  Y  E++  K   FG   +    E
Subjt:  RVFLYKGFLSDEECDHLISLAASSEDKPSRNNVSIKLLKSSGAILNTTDNIIARIENRIAVWTFLPKDYSMPLQIMQYGGEEAEHKY-VFGNRSTMSSSE

Query:  PLMATVVLYLSDSASGGEMLFPESKVKSKFWSDCRKKRNILKPVKGNAILFFSVHLNASPDKSSSHTRCPILDGELWVATKFFYLRPITGNKHTVESNGD
         L+ATVVLYLS++  GGE+LFP S++K K  + C +  NIL+PVKGNAILFF+  LNAS D  S+H RCP++ GEL VATK  Y +     +  +E +G+
Subjt:  PLMATVVLYLSDSASGGEMLFPESKVKSKFWSDCRKKRNILKPVKGNAILFFSVHLNASPDKSSSHTRCPILDGELWVATKFFYLRPITGNKHTVESNGD

Query:  CIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        C DEDE+C +WA +GEC++N V+MIGSPDYYGTCRKSCNAC
Subjt:  CIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

Q8L970 Probable prolyl 4-hydroxylase 73.7e-4938.58Show/hide
Query:  RVFLYKGFLSDEECDHLISLAASS------EDKPSRNNVSIKLLKSSGAILN-TTDNIIARIENRIAVWTFLPKDYSMPLQIMQY--GGEEAEHKYVFGN
        RVFLY+GFLSDEECDH I LA          D  S  +V  ++  SSG  L+   D+I++ +E ++A WTFLP++    +QI+ Y  G +   H   F +
Subjt:  RVFLYKGFLSDEECDHLISLAASS------EDKPSRNNVSIKLLKSSGAILN-TTDNIIARIENRIAVWTFLPKDYSMPLQIMQY--GGEEAEHKYVFGN

Query:  RSTMSSSEPLMATVVLYLSDSASGGEMLFP-----ESKVKSKFWSDCRKKRNILKPVKGNAILFFSVHLNASPDKSSSHTRCPILDGELWVATKFFYLRP
        ++ +      +ATV++YLS+   GGE +FP      +++K   W++C K+   +KP KG+A+LFF++H NA+ D +S H  CP+++GE W AT++ +++ 
Subjt:  RSTMSSSEPLMATVVLYLSDSASGGEMLFP-----ESKVKSKFWSDCRKKRNILKPVKGNAILFFSVHLNASPDKSSSHTRCPILDGELWVATKFFYLRP

Query:  ITGNKHTVESNGDCIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
            +        C+DE+ SC +WA  GEC++N  +M+GS   +G CRKSC AC
Subjt:  ITGNKHTVESNGDCIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

Q8LAN3 Probable prolyl 4-hydroxylase 41.0e-4636.64Show/hide
Query:  LATKVRVFLYKGFLSDEECDHLISLAASS------EDKPSRNNVSIKLLKSSGAILNT-TDNIIARIENRIAVWTFLPKDYSMPLQIMQY--GGEEAEHK
        +++K R F+Y+GFL++ ECDH++SLA +S       D  S  +   ++  SSG  ++   D I++ IE++I+ WTFLPK+    +Q+++Y  G +   H 
Subjt:  LATKVRVFLYKGFLSDEECDHLISLAASS------EDKPSRNNVSIKLLKSSGAILNT-TDNIIARIENRIAVWTFLPKDYSMPLQIMQY--GGEEAEHK

Query:  YVFGNRSTMSSSEPLMATVVLYLSDSASGGEMLFPESKVKSK--------FWSDCRKKRNILKPVKGNAILFFSVHLNASPDKSSSHTRCPILDGELWVA
          F ++  +      MAT+++YLS+   GGE +FP++++ S+          SDC K+   +KP KG+A+LFF++H +A PD  S H  CP+++GE W A
Subjt:  YVFGNRSTMSSSEPLMATVVLYLSDSASGGEMLFPESKVKSK--------FWSDCRKKRNILKPVKGNAILFFSVHLNASPDKSSSHTRCPILDGELWVA

Query:  TKFFYLRPITGNKHTVESNGDCIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        TK+ +   +      V  +G+C D +ESC +WA +GEC +N  +M+G+ +  G CR+SC AC
Subjt:  TKFFYLRPITGNKHTVESNGDCIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 22.5e-4838.17Show/hide
Query:  LATKVRVFLYKGFLSDEECDHLISLA-------ASSEDKPSRNNVSIKLLKSSGAILNTTDNIIARIENRIAVWTFLPKDYSMPLQIMQY--GGEEAEHK
        +++K R F+Y+GFL+D ECDHLISLA       A +++    + VS     S   I    D I++ IE++++ WTFLPK+    LQ+++Y  G +   H 
Subjt:  LATKVRVFLYKGFLSDEECDHLISLA-------ASSEDKPSRNNVSIKLLKSSGAILNTTDNIIARIENRIAVWTFLPKDYSMPLQIMQY--GGEEAEHK

Query:  YVFGNRSTMSSSEPLMATVVLYLSDSASGGEMLFPESKVKSK--------FWSDCRKKRNILKPVKGNAILFFSVHLNASPDKSSSHTRCPILDGELWVA
          F ++  ++     +ATV+LYLS+   GGE +FP+++  S+          SDC KK   +KP KGNA+LFF++  +A PD  S H  CP+++GE W A
Subjt:  YVFGNRSTMSSSEPLMATVVLYLSDSASGGEMLFPESKVKSK--------FWSDCRKKRNILKPVKGNAILFFSVHLNASPDKSSSHTRCPILDGELWVA

Query:  TKFFYLRPITGNKHTVESNGDCIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        TK+ +   +      +  +G+C D +ESC +WA +GEC +N  +M+G+P+  G CR+SC AC
Subjt:  TKFFYLRPITGNKHTVESNGDCIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase2.7e-5038.58Show/hide
Query:  RVFLYKGFLSDEECDHLISLAASS------EDKPSRNNVSIKLLKSSGAILN-TTDNIIARIENRIAVWTFLPKDYSMPLQIMQY--GGEEAEHKYVFGN
        RVFLY+GFLSDEECDH I LA          D  S  +V  ++  SSG  L+   D+I++ +E ++A WTFLP++    +QI+ Y  G +   H   F +
Subjt:  RVFLYKGFLSDEECDHLISLAASS------EDKPSRNNVSIKLLKSSGAILN-TTDNIIARIENRIAVWTFLPKDYSMPLQIMQY--GGEEAEHKYVFGN

Query:  RSTMSSSEPLMATVVLYLSDSASGGEMLFP-----ESKVKSKFWSDCRKKRNILKPVKGNAILFFSVHLNASPDKSSSHTRCPILDGELWVATKFFYLRP
        ++ +      +ATV++YLS+   GGE +FP      +++K   W++C K+   +KP KG+A+LFF++H NA+ D +S H  CP+++GE W AT++ +++ 
Subjt:  RSTMSSSEPLMATVVLYLSDSASGGEMLFP-----ESKVKSKFWSDCRKKRNILKPVKGNAILFFSVHLNASPDKSSSHTRCPILDGELWVATKFFYLRP

Query:  ITGNKHTVESNGDCIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
            +        C+DE+ SC +WA  GEC++N  +M+GS   +G CRKSC AC
Subjt:  ITGNKHTVESNGDCIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase2.9e-4938.02Show/hide
Query:  RVFLYKGFLSDEECDHLISLAA------------SSEDKPSRNNVSIKLLKSSGAILN----TTDNIIARIENRIAVWTFLPKDYSMPLQIMQY--GGEE
        RVFLY+GFLSDEECDH I LA             S E   S ++VS+ + +SS  I N      D+I++ +E ++A WTFLP++    +QI+ Y  G + 
Subjt:  RVFLYKGFLSDEECDHLISLAA------------SSEDKPSRNNVSIKLLKSSGAILN----TTDNIIARIENRIAVWTFLPKDYSMPLQIMQY--GGEE

Query:  AEHKYVFGNRSTMSSSEPLMATVVLYLSDSASGGEMLFP-----ESKVKSKFWSDCRKKRNILKPVKGNAILFFSVHLNASPDKSSSHTRCPILDGELWV
          H   F +++ +      +ATV++YLS+   GGE +FP      +++K   W++C K+   +KP KG+A+LFF++H NA+ D +S H  CP+++GE W 
Subjt:  AEHKYVFGNRSTMSSSEPLMATVVLYLSDSASGGEMLFP-----ESKVKSKFWSDCRKKRNILKPVKGNAILFFSVHLNASPDKSSSHTRCPILDGELWV

Query:  ATKFFYLRPITGNKHTVESNGDCIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        AT++ +++     +        C+DE+ SC +WA  GEC++N  +M+GS   +G CRKSC AC
Subjt:  ATKFFYLRPITGNKHTVESNGDCIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase8.3e-5241.96Show/hide
Query:  RVFLYKGFLSDEECDHLISLAASSEDK-------PSRNNVSIKLLKSSGAIL-NTTDNIIARIENRIAVWTFLPKDYSMPLQIMQY--GGEEAEHKYVFG
        R FLYKGFLSDEECDHLI LA    +K        S  +   ++  SSG  L    D+I+A +E ++A WTFLP++    LQI+ Y  G +   H   F 
Subjt:  RVFLYKGFLSDEECDHLISLAASSEDK-------PSRNNVSIKLLKSSGAIL-NTTDNIIARIENRIAVWTFLPKDYSMPLQIMQY--GGEEAEHKYVFG

Query:  NRSTMSSSEPLMATVVLYLSDSASGGEMLFPESK-----VKSKFWSDCRKKRNILKPVKGNAILFFSVHLNASPDKSSSHTRCPILDGELWVATKFFYLR
        ++  +      +ATV++YLS+   GGE +FP  K     +K   WS C K+   +KP KG+A+LFF++HLN + D +S H  CP+++GE W AT++ ++R
Subjt:  NRSTMSSSEPLMATVVLYLSDSASGGEMLFPESK-----VKSKFWSDCRKKRNILKPVKGNAILFFSVHLNASPDKSSSHTRCPILDGELWVATKFFYLR

Query:  PITGNKHTVESNGDCIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
           G K  V     C+D+ ESC +WA  GECE+N ++M+GS    G CRKSC AC
Subjt:  PITGNKHTVESNGDCIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

AT4G25600.1 Oxoglutarate/iron-dependent oxygenase9.7e-6148.55Show/hide
Query:  RVFLYKGFLSDEECDHLISLAASSEDKPSRNNVSIKLLKSSGAILNTTDNIIARIENRIAVWTFLPKDYSMPLQIMQYGGEEAEHKY-VFGNRSTMSSSE
        RVFLY+GFLS+EECDHLISL         +    +  + + G      D ++A IE +++ WTFLP +    +++  Y  E++  K   FG   +    E
Subjt:  RVFLYKGFLSDEECDHLISLAASSEDKPSRNNVSIKLLKSSGAILNTTDNIIARIENRIAVWTFLPKDYSMPLQIMQYGGEEAEHKY-VFGNRSTMSSSE

Query:  PLMATVVLYLSDSASGGEMLFPESKVKSKFWSDCRKKRNILKPVKGNAILFFSVHLNASPDKSSSHTRCPILDGELWVATKFFYLRPITGNKHTVESNGD
         L+ATVVLYLS++  GGE+LFP S++K K  + C +  NIL+PVKGNAILFF+  LNAS D  S+H RCP++ GEL VATK  Y +     +  +E +G+
Subjt:  PLMATVVLYLSDSASGGEMLFPESKVKSKFWSDCRKKRNILKPVKGNAILFFSVHLNASPDKSSSHTRCPILDGELWVATKFFYLRPITGNKHTVESNGD

Query:  CIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        C DEDE+C +WA +GEC++N V+MIGSPDYYGTCRKSCNAC
Subjt:  CIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATCCAGATTCATCGAACTACAGAAGAAATCATGGATTTCTACAGCAGCAGCCATCCATGGATTCTCGTCTTCACGTTTTCCTTCTTCTAGCGACTGCATTTTCAT
TCTCGAGCTGCCTTGCACAAAGCAATTTGATCAGTGGCCGGAAGGGTTTAAGGGATCAATTGATGAACAGTGTACCTTTGGGCTACTCAAATCATTCTGGAAGAATCGAC
CCATCAAGAGTTGTCCAAGTCTCTTGGCGACCAAGGTCAGGGTTTTCTTGTACAAAGGTTTTCTTTCAGATGAGGAGTGTGATCACCTTATTTCTTTGGCTGCAAGTTCA
GAAGATAAGCCTTCTCGGAACAATGTCTCAATTAAATTGCTGAAGAGTTCAGGAGCCATTTTAAACACAACAGATAACATCATTGCAAGGATTGAAAATCGAATTGCCGT
ATGGACTTTTCTCCCGAAAGATTACAGCATGCCTTTGCAGATTATGCAGTACGGGGGTGAAGAAGCAGAGCATAAGTATGTTTTTGGTAACAGGTCCACTATGTCGTCCA
GTGAGCCTTTGATGGCCACAGTAGTTTTGTATCTTTCAGATTCGGCTAGCGGAGGCGAGATGCTCTTTCCTGAATCAAAGGTAAAGAGCAAATTTTGGTCAGACTGTAGA
AAGAAAAGAAACATTCTGAAACCAGTGAAAGGCAATGCAATTCTTTTTTTCTCTGTGCATCTTAATGCTTCTCCAGACAAGAGTAGCTCCCATACCCGATGTCCGATACT
CGATGGGGAATTGTGGGTTGCAACAAAATTCTTCTACTTAAGACCGATCACTGGGAATAAACACACAGTCGAATCCAATGGAGACTGCATTGATGAAGATGAAAGCTGCC
CCCAATGGGCTGCCATTGGCGAATGCGAACGAAATGCTGTTTTCATGATCGGTTCTCCAGATTACTATGGAACATGTAGAAAAAGCTGCAACGCATGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGATCCAGATTCATCGAACTACAGAAGAAATCATGGATTTCTACAGCAGCAGCCATCCATGGATTCTCGTCTTCACGTTTTCCTTCTTCTAGCGACTGCATTTTCAT
TCTCGAGCTGCCTTGCACAAAGCAATTTGATCAGTGGCCGGAAGGGTTTAAGGGATCAATTGATGAACAGTGTACCTTTGGGCTACTCAAATCATTCTGGAAGAATCGAC
CCATCAAGAGTTGTCCAAGTCTCTTGGCGACCAAGGTCAGGGTTTTCTTGTACAAAGGTTTTCTTTCAGATGAGGAGTGTGATCACCTTATTTCTTTGGCTGCAAGTTCA
GAAGATAAGCCTTCTCGGAACAATGTCTCAATTAAATTGCTGAAGAGTTCAGGAGCCATTTTAAACACAACAGATAACATCATTGCAAGGATTGAAAATCGAATTGCCGT
ATGGACTTTTCTCCCGAAAGATTACAGCATGCCTTTGCAGATTATGCAGTACGGGGGTGAAGAAGCAGAGCATAAGTATGTTTTTGGTAACAGGTCCACTATGTCGTCCA
GTGAGCCTTTGATGGCCACAGTAGTTTTGTATCTTTCAGATTCGGCTAGCGGAGGCGAGATGCTCTTTCCTGAATCAAAGGTAAAGAGCAAATTTTGGTCAGACTGTAGA
AAGAAAAGAAACATTCTGAAACCAGTGAAAGGCAATGCAATTCTTTTTTTCTCTGTGCATCTTAATGCTTCTCCAGACAAGAGTAGCTCCCATACCCGATGTCCGATACT
CGATGGGGAATTGTGGGTTGCAACAAAATTCTTCTACTTAAGACCGATCACTGGGAATAAACACACAGTCGAATCCAATGGAGACTGCATTGATGAAGATGAAAGCTGCC
CCCAATGGGCTGCCATTGGCGAATGCGAACGAAATGCTGTTTTCATGATCGGTTCTCCAGATTACTATGGAACATGTAGAAAAAGCTGCAACGCATGTTGA
Protein sequenceShow/hide protein sequence
MGSRFIELQKKSWISTAAAIHGFSSSRFPSSSDCIFILELPCTKQFDQWPEGFKGSIDEQCTFGLLKSFWKNRPIKSCPSLLATKVRVFLYKGFLSDEECDHLISLAASS
EDKPSRNNVSIKLLKSSGAILNTTDNIIARIENRIAVWTFLPKDYSMPLQIMQYGGEEAEHKYVFGNRSTMSSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFWSDCR
KKRNILKPVKGNAILFFSVHLNASPDKSSSHTRCPILDGELWVATKFFYLRPITGNKHTVESNGDCIDEDESCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC