; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g16970 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g16970
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationchr8:12969470..12982702
RNA-Seq ExpressionMoc08g16970
SyntenyMoc08g16970
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022157663.1 probable prolyl 4-hydroxylase 7 isoform X1 [Momordica charantia]1.5e-142100Show/hide
Query:  GRGSLIRMKTGGSSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTF
        GRGSLIRMKTGGSSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTF
Subjt:  GRGSLIRMKTGGSSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTF

Query:  LPVDNGEPMQVLRYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHVKLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANG
        LPVDNGEPMQVLRYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHVKLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANG
Subjt:  LPVDNGEPMQVLRYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHVKLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANG

Query:  TTDSSSYHGSCPVIKGEKWSATKWIHMLSADEIWRSPDCVDISVSCAAWASTG
        TTDSSSYHGSCPVIKGEKWSATKWIHMLSADEIWRSPDCVDISVSCAAWASTG
Subjt:  TTDSSSYHGSCPVIKGEKWSATKWIHMLSADEIWRSPDCVDISVSCAAWASTG

XP_022157664.1 probable prolyl 4-hydroxylase 7 isoform X2 [Momordica charantia]1.5e-142100Show/hide
Query:  GRGSLIRMKTGGSSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTF
        GRGSLIRMKTGGSSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTF
Subjt:  GRGSLIRMKTGGSSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTF

Query:  LPVDNGEPMQVLRYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHVKLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANG
        LPVDNGEPMQVLRYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHVKLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANG
Subjt:  LPVDNGEPMQVLRYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHVKLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANG

Query:  TTDSSSYHGSCPVIKGEKWSATKWIHMLSADEIWRSPDCVDISVSCAAWASTG
        TTDSSSYHGSCPVIKGEKWSATKWIHMLSADEIWRSPDCVDISVSCAAWASTG
Subjt:  TTDSSSYHGSCPVIKGEKWSATKWIHMLSADEIWRSPDCVDISVSCAAWASTG

XP_022931100.1 probable prolyl 4-hydroxylase 7 [Cucurbita moschata]9.3e-10876Show/hide
Query:  SLIRMKTGGSSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTFLPV
        S+IRMK  GSSI IDP+RV QLSSQPRAF+YKGFLSAEEC+HLI+LAKD LE+SLV DD+TG S +S +RTSTGMFL K QD IVAGIE++IAAWTFLPV
Subjt:  SLIRMKTGGSSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTFLPV

Query:  DNGEPMQVLRYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHVKLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANGTTD
        DNGEP+Q+LRYENGQ+Y PHFDFFQDPVN+A GGHRIATVLMYLSNVE GGETVFP+S  K+   E K+L DC+  GY VKPK GDALLFFSLH N TTD
Subjt:  DNGEPMQVLRYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHVKLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANGTTD

Query:  SSSYHGSCPVIKGEKWSATKWIHMLSADEIWRSPDCVDISVSCAAWASTG
         +SYHGSCPVI+GEKWSATKWIHML  DEIWR+PDCVD +  C+AWA  G
Subjt:  SSSYHGSCPVIKGEKWSATKWIHMLSADEIWRSPDCVDISVSCAAWASTG

XP_023530715.1 probable prolyl 4-hydroxylase 7 [Cucurbita pepo subsp. pepo]1.9e-10876.4Show/hide
Query:  SLIRMKTGGSSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTFLPV
        S+IRMK  GSSI IDP+RV QLSSQPRAF+YKGFLSAEEC+HLI+LAKD LE+SLV DD+TG S +S +RTSTGMFL K QD IVAGIE++IAAWTFLPV
Subjt:  SLIRMKTGGSSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTFLPV

Query:  DNGEPMQVLRYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHVKLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANGTTD
        DNGEP+Q+LRYENGQ+Y PHFDFFQDPVN+A GGHRIATVLMYLSNVE GGETVFP+S  K+   E K+LSDC+  GY VKPK GDALLFFSLH N TTD
Subjt:  DNGEPMQVLRYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHVKLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANGTTD

Query:  SSSYHGSCPVIKGEKWSATKWIHMLSADEIWRSPDCVDISVSCAAWASTG
         +SYHGSCPVI+GEKWSATKWIHML  DEIWR+PDCVD +  C+AWA  G
Subjt:  SSSYHGSCPVIKGEKWSATKWIHMLSADEIWRSPDCVDISVSCAAWASTG

XP_038905410.1 probable prolyl 4-hydroxylase 7 isoform X2 [Benincasa hispida]2.7e-10775.6Show/hide
Query:  SLIRMKTGGSSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTFLPV
        S+IRMKT GS ++IDP+RV +LSS+PRAF+YKGFLS +EC+HLINLAK KL++SLVA + TGESVTS+ERTSTGMFL + QD+IVA IESRIAAWTFLP+
Subjt:  SLIRMKTGGSSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTFLPV

Query:  DNGEPMQVLRYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHVKLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANGTTD
        DNGEP+Q+LRYENGQKY+PHFDFFQDPVN+A GGHRIAT+LMYLS+VE+GGETVFPNS +KLS +E+ +LSDCAK+GY VKPKMGDALLFFSL+ N T D
Subjt:  DNGEPMQVLRYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHVKLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANGTTD

Query:  SSSYHGSCPVIKGEKWSATKWIHMLSADEIWRSPDCVDISVSCAAWASTG
        ++SYHGSCPVI+GEKWSATKWIHML   EIWR+P CVD +V C AWA+ G
Subjt:  SSSYHGSCPVIKGEKWSATKWIHMLSADEIWRSPDCVDISVSCAAWASTG

TrEMBL top hitse value%identityAlignment
A0A5D3D1X2 Procollagen-proline 4-dioxygenase5.0e-10776Show/hide
Query:  SLIRMKTGGSSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTFLPV
        S+IRMKTGGS+I+IDP+RV QLSS+PRAF+YKGFLS EEC+HLI+LAK KL +SLVA   TGESVTS+ERTSTGMFL K QDKIVA IESRIAAWTFLP+
Subjt:  SLIRMKTGGSSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTFLPV

Query:  DNGEPMQVLRYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHVKLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANGTTD
        DNGEP+Q+LRYENGQKY+PHFDFFQDP N+A GGHRIAT+LMYLS+VE+GGETVFPNS VKLS  EK +LS+CAK+GY V+PK+GDALLFFS++ N T D
Subjt:  DNGEPMQVLRYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHVKLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANGTTD

Query:  SSSYHGSCPVIKGEKWSATKWIHMLSADEIWRSPDCVDISVSCAAWASTG
        ++SYHGSCPVI+GEKWSATKWIHML  DE+WR+P CVD +  C+AWA  G
Subjt:  SSSYHGSCPVIKGEKWSATKWIHMLSADEIWRSPDCVDISVSCAAWASTG

A0A6J1DTY4 Procollagen-proline 4-dioxygenase7.3e-143100Show/hide
Query:  GRGSLIRMKTGGSSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTF
        GRGSLIRMKTGGSSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTF
Subjt:  GRGSLIRMKTGGSSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTF

Query:  LPVDNGEPMQVLRYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHVKLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANG
        LPVDNGEPMQVLRYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHVKLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANG
Subjt:  LPVDNGEPMQVLRYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHVKLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANG

Query:  TTDSSSYHGSCPVIKGEKWSATKWIHMLSADEIWRSPDCVDISVSCAAWASTG
        TTDSSSYHGSCPVIKGEKWSATKWIHMLSADEIWRSPDCVDISVSCAAWASTG
Subjt:  TTDSSSYHGSCPVIKGEKWSATKWIHMLSADEIWRSPDCVDISVSCAAWASTG

A0A6J1DX45 Procollagen-proline 4-dioxygenase7.3e-143100Show/hide
Query:  GRGSLIRMKTGGSSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTF
        GRGSLIRMKTGGSSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTF
Subjt:  GRGSLIRMKTGGSSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTF

Query:  LPVDNGEPMQVLRYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHVKLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANG
        LPVDNGEPMQVLRYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHVKLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANG
Subjt:  LPVDNGEPMQVLRYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHVKLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANG

Query:  TTDSSSYHGSCPVIKGEKWSATKWIHMLSADEIWRSPDCVDISVSCAAWASTG
        TTDSSSYHGSCPVIKGEKWSATKWIHMLSADEIWRSPDCVDISVSCAAWASTG
Subjt:  TTDSSSYHGSCPVIKGEKWSATKWIHMLSADEIWRSPDCVDISVSCAAWASTG

A0A6J1EYJ1 Procollagen-proline 4-dioxygenase4.5e-10876Show/hide
Query:  SLIRMKTGGSSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTFLPV
        S+IRMK  GSSI IDP+RV QLSSQPRAF+YKGFLSAEEC+HLI+LAKD LE+SLV DD+TG S +S +RTSTGMFL K QD IVAGIE++IAAWTFLPV
Subjt:  SLIRMKTGGSSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTFLPV

Query:  DNGEPMQVLRYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHVKLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANGTTD
        DNGEP+Q+LRYENGQ+Y PHFDFFQDPVN+A GGHRIATVLMYLSNVE GGETVFP+S  K+   E K+L DC+  GY VKPK GDALLFFSLH N TTD
Subjt:  DNGEPMQVLRYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHVKLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANGTTD

Query:  SSSYHGSCPVIKGEKWSATKWIHMLSADEIWRSPDCVDISVSCAAWASTG
         +SYHGSCPVI+GEKWSATKWIHML  DEIWR+PDCVD +  C+AWA  G
Subjt:  SSSYHGSCPVIKGEKWSATKWIHMLSADEIWRSPDCVDISVSCAAWASTG

A0A6J1I5Z9 Procollagen-proline 4-dioxygenase3.8e-10776Show/hide
Query:  SLIRMKTGGSSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTFLPV
        S+IRMK  GSSI IDP+RV QLSSQPRAF+YKGFLSAEEC+HLI+LAKD LE+SLV DD+TG S +S +RTSTGMFL K QD IVAGIE++IAAWTFLPV
Subjt:  SLIRMKTGGSSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTFLPV

Query:  DNGEPMQVLRYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHVKLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANGTTD
        DNGEP+Q+LRYENGQ+Y PHFDFFQDPVN+A GGHRIATVL+YLSNVE GGETVFP+S  K+   E K+LSDC+  GY VKPK GDALLFFSLH N TTD
Subjt:  DNGEPMQVLRYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHVKLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANGTTD

Query:  SSSYHGSCPVIKGEKWSATKWIHMLSADEIWRSPDCVDISVSCAAWASTG
         +SYHGSCPVI+GEKWSATKWIHML  DEIWR+PDCVD +  C+AWA  G
Subjt:  SSSYHGSCPVIKGEKWSATKWIHMLSADEIWRSPDCVDISVSCAAWASTG

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 65.7e-8462.4Show/hide
Query:  LIRMKTGGSSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDV-TGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTFLPV
        LI  +    S S+DP+R+TQLS  PRAF+YKGFLS EEC+HLI LAK KLE+S+V  DV +GES  S  RTS+GMFL K QD IVA +E+++AAWTFLP 
Subjt:  LIRMKTGGSSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDV-TGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTFLPV

Query:  DNGEPMQVLRYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHVKLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANGTTD
        +NGE +Q+L YENGQKYDPHFD+F D   +  GGHRIATVLMYLSNV +GGETVFPN   K    +    S CAK GYAVKP+ GDALLFF+LH NGTTD
Subjt:  DNGEPMQVLRYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHVKLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANGTTD

Query:  SSSYHGSCPVIKGEKWSATKWIHMLSADEIWRSPDCVDISVSCAAWASTG
         +S HGSCPVI+GEKWSAT+WIH+ S  +  +   CVD   SC  WA  G
Subjt:  SSSYHGSCPVIKGEKWSATKWIHMLSADEIWRSPDCVDISVSCAAWASTG

F4JAU3 Prolyl 4-hydroxylase 22.1e-7859.34Show/hide
Query:  IDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTFLPVDNGEPMQVLRYEN
        I+PS+V Q+SS+PRAFVY+GFL+  EC+HLI+LAK+ L+ S VAD+  GES  S  RTS+G F+ KG+D IV+GIE +++ WTFLP +NGE +QVLRYE+
Subjt:  IDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTFLPVDNGEPMQVLRYEN

Query:  GQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHV---KLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANGTTDSSSYHGSCPV
        GQKYD HFD+F D VN+A+GGHRIATVL+YLSNV +GGETVFP++     +  +  K +LSDCAK G AVKPK G+ALLFF+L  +   D  S HG CPV
Subjt:  GQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHV---KLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANGTTDSSSYHGSCPV

Query:  IKGEKWSATKWIHMLSADEI-WRSPDCVDISVSCAAWASTG
        I+GEKWSATKWIH+ S D+I     +C D++ SC  WA  G
Subjt:  IKGEKWSATKWIHMLSADEI-WRSPDCVDISVSCAAWASTG

Q8L970 Probable prolyl 4-hydroxylase 71.5e-8962.3Show/hide
Query:  GSLIRMKTGGSSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTFLP
        GS+I+MKT  SS   DP+RVTQLS  PR F+Y+GFLS EEC+H I LAK KLE+S+VAD+ +GESV S  RTS+GMFL K QD IV+ +E+++AAWTFLP
Subjt:  GSLIRMKTGGSSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTFLP

Query:  VDNGEPMQVLRYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHVKLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANGTT
         +NGE MQ+L YENGQKY+PHFD+F D  N+  GGHRIATVLMYLSNVE+GGETVFP    K +  +    ++CAK GYAVKP+ GDALLFF+LH N TT
Subjt:  VDNGEPMQVLRYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHVKLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANGTT

Query:  DSSSYHGSCPVIKGEKWSATKWIHMLSADEIW-RSPDCVDISVSCAAWASTG
        DS+S HGSCPV++GEKWSAT+WIH+ S +  + +   C+D +VSC  WA  G
Subjt:  DSSSYHGSCPVIKGEKWSATKWIHMLSADEIW-RSPDCVDISVSCAAWASTG

Q8LAN3 Probable prolyl 4-hydroxylase 42.2e-8058.37Show/hide
Query:  SSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTFLPVDNGEPMQVL
        SS+ ++PS+V Q+SS+PRAFVY+GFL+  EC+H+++LAK  L+ S VAD+ +GES  S  RTS+G F+ KG+D IV+GIE +I+ WTFLP +NGE +QVL
Subjt:  SSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTFLPVDNGEPMQVL

Query:  RYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHV---KLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANGTTDSSSYHG
        RYE+GQKYD HFD+F D VN+ +GGHR+AT+LMYLSNV +GGETVFP++ +   ++ +  K++LSDCAK G AVKP+ GDALLFF+LH +   D  S HG
Subjt:  RYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHV---KLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANGTTDSSSYHG

Query:  SCPVIKGEKWSATKWIHMLSADEI-WRSPDCVDISVSCAAWASTG
         CPVI+GEKWSATKWIH+ S D I   S +C D++ SC  WA  G
Subjt:  SCPVIKGEKWSATKWIHMLSADEI-WRSPDCVDISVSCAAWASTG

Q9LN20 Probable prolyl 4-hydroxylase 31.7e-6458.33Show/hide
Query:  LSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTFLPVDNGEPMQVLRYENGQKYDPHF
        LS +PRAFVY  FLS EECE+LI+LAK  + +S V D  TG+S  SR RTS+G FL +G+DKI+  IE RIA +TF+P D+GE +QVL YE GQKY+PH+
Subjt:  LSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTFLPVDNGEPMQVLRYENGQKYDPHF

Query:  DFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHVKLSARE-KKELSDCAKLGYAVKPKMGDALLFFSLHANGTTDSSSYHGSCPVIKGEKWSATK
        D+F D  N   GG R+AT+LMYLS+VEEGGETVFP +++  S+     ELS+C K G +VKP+MGDALLF+S+  + T D +S HG CPVI+G KWS+TK
Subjt:  DFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHVKLSARE-KKELSDCAKLGYAVKPKMGDALLFFSLHANGTTDSSSYHGSCPVIKGEKWSATK

Query:  WIHM
        W+H+
Subjt:  WIHM

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 21.5e-7959.34Show/hide
Query:  IDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTFLPVDNGEPMQVLRYEN
        I+PS+V Q+SS+PRAFVY+GFL+  EC+HLI+LAK+ L+ S VAD+  GES  S  RTS+G F+ KG+D IV+GIE +++ WTFLP +NGE +QVLRYE+
Subjt:  IDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTFLPVDNGEPMQVLRYEN

Query:  GQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHV---KLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANGTTDSSSYHGSCPV
        GQKYD HFD+F D VN+A+GGHRIATVL+YLSNV +GGETVFP++     +  +  K +LSDCAK G AVKPK G+ALLFF+L  +   D  S HG CPV
Subjt:  GQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHV---KLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANGTTDSSSYHGSCPV

Query:  IKGEKWSATKWIHMLSADEI-WRSPDCVDISVSCAAWASTG
        I+GEKWSATKWIH+ S D+I     +C D++ SC  WA  G
Subjt:  IKGEKWSATKWIHMLSADEI-WRSPDCVDISVSCAAWASTG

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase1.1e-9062.3Show/hide
Query:  GSLIRMKTGGSSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTFLP
        GS+I+MKT  SS   DP+RVTQLS  PR F+Y+GFLS EEC+H I LAK KLE+S+VAD+ +GESV S  RTS+GMFL K QD IV+ +E+++AAWTFLP
Subjt:  GSLIRMKTGGSSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTFLP

Query:  VDNGEPMQVLRYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHVKLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANGTT
         +NGE MQ+L YENGQKY+PHFD+F D  N+  GGHRIATVLMYLSNVE+GGETVFP    K +  +    ++CAK GYAVKP+ GDALLFF+LH N TT
Subjt:  VDNGEPMQVLRYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHVKLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANGTT

Query:  DSSSYHGSCPVIKGEKWSATKWIHMLSADEIW-RSPDCVDISVSCAAWASTG
        DS+S HGSCPV++GEKWSAT+WIH+ S +  + +   C+D +VSC  WA  G
Subjt:  DSSSYHGSCPVIKGEKWSATKWIHMLSADEIW-RSPDCVDISVSCAAWASTG

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase1.0e-8357.69Show/hide
Query:  GSLIRMKTGGSSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTS----TGMFLVKGQ----DKIVAGIESR
        GS+I+MKT  SS   DP+RVTQLS  PR F+Y+GFLS EEC+H I LAK KLE+S+VAD+ +GESV S +  S    +  F+        D IV+ +E++
Subjt:  GSLIRMKTGGSSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTS----TGMFLVKGQ----DKIVAGIESR

Query:  IAAWTFLPVDNGEPMQVLRYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHVKLSAREKKELSDCAKLGYAVKPKMGDALLFF
        +AAWTFLP +NGE MQ+L YENGQKY+PHFD+F D  N+  GGHRIATVLMYLSNVE+GGETVFP    K +  +    ++CAK GYAVKP+ GDALLFF
Subjt:  IAAWTFLPVDNGEPMQVLRYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHVKLSAREKKELSDCAKLGYAVKPKMGDALLFF

Query:  SLHANGTTDSSSYHGSCPVIKGEKWSATKWIHMLSADEIW-RSPDCVDISVSCAAWASTG
        +LH N TTDS+S HGSCPV++GEKWSAT+WIH+ S +  + +   C+D +VSC  WA  G
Subjt:  SLHANGTTDSSSYHGSCPVIKGEKWSATKWIHMLSADEIW-RSPDCVDISVSCAAWASTG

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase4.0e-8562.4Show/hide
Query:  LIRMKTGGSSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDV-TGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTFLPV
        LI  +    S S+DP+R+TQLS  PRAF+YKGFLS EEC+HLI LAK KLE+S+V  DV +GES  S  RTS+GMFL K QD IVA +E+++AAWTFLP 
Subjt:  LIRMKTGGSSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDV-TGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTFLPV

Query:  DNGEPMQVLRYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHVKLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANGTTD
        +NGE +Q+L YENGQKYDPHFD+F D   +  GGHRIATVLMYLSNV +GGETVFPN   K    +    S CAK GYAVKP+ GDALLFF+LH NGTTD
Subjt:  DNGEPMQVLRYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHVKLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANGTTD

Query:  SSSYHGSCPVIKGEKWSATKWIHMLSADEIWRSPDCVDISVSCAAWASTG
         +S HGSCPVI+GEKWSAT+WIH+ S  +  +   CVD   SC  WA  G
Subjt:  SSSYHGSCPVIKGEKWSATKWIHMLSADEIWRSPDCVDISVSCAAWASTG

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.6e-8158.37Show/hide
Query:  SSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTFLPVDNGEPMQVL
        SS+ ++PS+V Q+SS+PRAFVY+GFL+  EC+H+++LAK  L+ S VAD+ +GES  S  RTS+G F+ KG+D IV+GIE +I+ WTFLP +NGE +QVL
Subjt:  SSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKGQDKIVAGIESRIAAWTFLPVDNGEPMQVL

Query:  RYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHV---KLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANGTTDSSSYHG
        RYE+GQKYD HFD+F D VN+ +GGHR+AT+LMYLSNV +GGETVFP++ +   ++ +  K++LSDCAK G AVKP+ GDALLFF+LH +   D  S HG
Subjt:  RYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHV---KLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANGTTDSSSYHG

Query:  SCPVIKGEKWSATKWIHMLSADEI-WRSPDCVDISVSCAAWASTG
         CPVI+GEKWSATKWIH+ S D I   S +C D++ SC  WA  G
Subjt:  SCPVIKGEKWSATKWIHMLSADEI-WRSPDCVDISVSCAAWASTG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTAGACTTCTGTTCTGTCCTCCGATAAATTGTGTGACGAATGTCCTTGCAAGTTCTTTGAAGGAGGAGATTGACTTCCTCTTCAATTGTTGGAACCATATGCGGGC
TGAACCAGTCAACGTGAAGGAGAACACCTTACATCTGATGACTTCTGTCACCTCCTCGTTCATGCTTGCTACATTGCAATCTCTTGTCAACAAGGGTGTGCTTGAGATCT
GGTCAGCTAACCTGGTTGCTCTTGCTCCTACCGCACGCCACAAACTTCTGTTGTTGCGTGGACTTCTTCAGGCCCAAGCAATTCTCACTTCACTTTTCTCCTCGAATATG
TCAAAGTTGATAGGATATCTTCTCTTCTCAAATAGCTTGGATTTTGGAGATAACACCATCGTTGGCCAGACGAAAGCTTCCAAAGTAACTCTTGATCCAGCAAAGTTACC
TGAGTCGATGTCAACTGAAGAAAAGGAGACAATGGAAGAGATCGCTTATGGGACAATCATTCTGAATCTTAGTGATAGTGTCCTACGACAAGTCATTGATCTCGAAACTG
CATACAAGGAAGTTCATAAATCAACTACTAAAGCAGTTGAAAAATCCAAAATTGAGGTGGAGTTACCAACTGAAATAAAGGATAGCAGACATCAAGTTGGTCTTGGTCAT
GATGAAACTGAAGTAGTGGAAGGGGAAGAAACTCTACATATAGGTGAAACTTCTATGCAACAATCAGATTTGGCTTACTACTCACTTGCTAGATACAGACAGAGAAAGAA
AATTCATCCACCAAAGAGGGGACGAGGATCTCTTATTAGGATGAAAACGGGCGGTTCCTCCATTTCAATCGATCCCAGTCGTGTCACTCAGCTCTCATCGCAACCCAGGG
CTTTCGTATATAAGGGATTTCTGTCTGCAGAGGAGTGTGAGCATCTTATCAATTTGGCGAAGGATAAGCTTGAGGAATCGTTGGTGGCTGACGACGTGACGGGTGAGAGT
GTTACAAGTCGAGAACGGACGAGTACTGGCATGTTTCTTGTCAAGGGTCAGGACAAAATAGTTGCTGGCATCGAGTCCAGGATTGCTGCATGGACCTTTCTTCCCGTCGA
TAATGGGGAGCCTATGCAAGTACTGAGATACGAGAACGGTCAGAAATATGATCCACATTTTGATTTTTTTCAAGACCCAGTTAATATGGCCCAAGGTGGTCACCGGATTG
CCACAGTCTTGATGTATTTGTCCAATGTTGAAGAGGGTGGAGAAACAGTCTTTCCCAATTCTCATGTTAAATTATCCGCACGGGAGAAGAAGGAACTGTCTGATTGTGCT
AAGCTTGGGTATGCAGTAAAACCAAAGATGGGCGATGCTTTGCTGTTCTTCAGTCTCCACGCAAATGGGACAACGGACTCAAGCAGCTACCACGGGAGCTGCCCAGTGAT
AAAGGGCGAGAAATGGTCCGCGACGAAATGGATTCACATGCTTTCAGCCGACGAGATTTGGAGGAGTCCAGATTGTGTGGATATAAGTGTGAGCTGCGCTGCATGGGCAA
GTACAGGTGGTCTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTAGACTTCTGTTCTGTCCTCCGATAAATTGTGTGACGAATGTCCTTGCAAGTTCTTTGAAGGAGGAGATTGACTTCCTCTTCAATTGTTGGAACCATATGCGGGC
TGAACCAGTCAACGTGAAGGAGAACACCTTACATCTGATGACTTCTGTCACCTCCTCGTTCATGCTTGCTACATTGCAATCTCTTGTCAACAAGGGTGTGCTTGAGATCT
GGTCAGCTAACCTGGTTGCTCTTGCTCCTACCGCACGCCACAAACTTCTGTTGTTGCGTGGACTTCTTCAGGCCCAAGCAATTCTCACTTCACTTTTCTCCTCGAATATG
TCAAAGTTGATAGGATATCTTCTCTTCTCAAATAGCTTGGATTTTGGAGATAACACCATCGTTGGCCAGACGAAAGCTTCCAAAGTAACTCTTGATCCAGCAAAGTTACC
TGAGTCGATGTCAACTGAAGAAAAGGAGACAATGGAAGAGATCGCTTATGGGACAATCATTCTGAATCTTAGTGATAGTGTCCTACGACAAGTCATTGATCTCGAAACTG
CATACAAGGAAGTTCATAAATCAACTACTAAAGCAGTTGAAAAATCCAAAATTGAGGTGGAGTTACCAACTGAAATAAAGGATAGCAGACATCAAGTTGGTCTTGGTCAT
GATGAAACTGAAGTAGTGGAAGGGGAAGAAACTCTACATATAGGTGAAACTTCTATGCAACAATCAGATTTGGCTTACTACTCACTTGCTAGATACAGACAGAGAAAGAA
AATTCATCCACCAAAGAGGGGACGAGGATCTCTTATTAGGATGAAAACGGGCGGTTCCTCCATTTCAATCGATCCCAGTCGTGTCACTCAGCTCTCATCGCAACCCAGGG
CTTTCGTATATAAGGGATTTCTGTCTGCAGAGGAGTGTGAGCATCTTATCAATTTGGCGAAGGATAAGCTTGAGGAATCGTTGGTGGCTGACGACGTGACGGGTGAGAGT
GTTACAAGTCGAGAACGGACGAGTACTGGCATGTTTCTTGTCAAGGGTCAGGACAAAATAGTTGCTGGCATCGAGTCCAGGATTGCTGCATGGACCTTTCTTCCCGTCGA
TAATGGGGAGCCTATGCAAGTACTGAGATACGAGAACGGTCAGAAATATGATCCACATTTTGATTTTTTTCAAGACCCAGTTAATATGGCCCAAGGTGGTCACCGGATTG
CCACAGTCTTGATGTATTTGTCCAATGTTGAAGAGGGTGGAGAAACAGTCTTTCCCAATTCTCATGTTAAATTATCCGCACGGGAGAAGAAGGAACTGTCTGATTGTGCT
AAGCTTGGGTATGCAGTAAAACCAAAGATGGGCGATGCTTTGCTGTTCTTCAGTCTCCACGCAAATGGGACAACGGACTCAAGCAGCTACCACGGGAGCTGCCCAGTGAT
AAAGGGCGAGAAATGGTCCGCGACGAAATGGATTCACATGCTTTCAGCCGACGAGATTTGGAGGAGTCCAGATTGTGTGGATATAAGTGTGAGCTGCGCTGCATGGGCAA
GTACAGGTGGTCTCTGA
Protein sequenceShow/hide protein sequence
MGRLLFCPPINCVTNVLASSLKEEIDFLFNCWNHMRAEPVNVKENTLHLMTSVTSSFMLATLQSLVNKGVLEIWSANLVALAPTARHKLLLLRGLLQAQAILTSLFSSNM
SKLIGYLLFSNSLDFGDNTIVGQTKASKVTLDPAKLPESMSTEEKETMEEIAYGTIILNLSDSVLRQVIDLETAYKEVHKSTTKAVEKSKIEVELPTEIKDSRHQVGLGH
DETEVVEGEETLHIGETSMQQSDLAYYSLARYRQRKKIHPPKRGRGSLIRMKTGGSSISIDPSRVTQLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGES
VTSRERTSTGMFLVKGQDKIVAGIESRIAAWTFLPVDNGEPMQVLRYENGQKYDPHFDFFQDPVNMAQGGHRIATVLMYLSNVEEGGETVFPNSHVKLSAREKKELSDCA
KLGYAVKPKMGDALLFFSLHANGTTDSSSYHGSCPVIKGEKWSATKWIHMLSADEIWRSPDCVDISVSCAAWASTGGL