; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0027836 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0027836
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationchr8:5726023..5728987
RNA-Seq ExpressionLag0027836
SyntenyLag0027836
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147455.1 probable prolyl 4-hydroxylase 7 [Cucumis sativus]8.8e-13577.7Show/hide
Query:  SRSANRLPKLLLHNKNMGQSVIRMKTDGSSISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDATGASVASEERTSTGMFLHRAQ
        S SANR PKL+LHN ++ +SVIRMKT GS+++IDP+RV QL  +PRAFLYKGFLSAEEC HLIN AK KL QSLV    TG SV S+ERTSTGMFLH+AQ
Subjt:  SRSANRLPKLLLHNKNMGQSVIRMKTDGSSISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDATGASVASEERTSTGMFLHRAQ

Query:  DEIVAGIESRIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPDSPVKLSEQERENLSDCAKNGYGVK
        DEIVA IESRIAAWTFLP+DNGEPIQ+LRYENGQKYEPH+DFFQDP NIA GGHRIATILMYLS+VEKGGETVFP+SPVKLSE+E+ +LS+C K GYGV+
Subjt:  DEIVAGIESRIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPDSPVKLSEQERENLSDCAKNGYGVK

Query:  PKKGDALLFFSLHPNATPDTTSFHGSCPVIKGEKWSATKWIHMLSLNEIWRNKACVDESEYCQAWAKAGECEKNPTYMVGSKDDLGYCRKSCKVCS
        PK GDALLFFS++PN TPDTTS+HGSCPVI+GEKWSATKWIHML ++E WRN ACVDE+++C AWAKAGECEKNP YM+GSK++LG+CR SCKVCS
Subjt:  PKKGDALLFFSLHPNATPDTTSFHGSCPVIKGEKWSATKWIHMLSLNEIWRNKACVDESEYCQAWAKAGECEKNPTYMVGSKDDLGYCRKSCKVCS

XP_008443446.1 PREDICTED: probable prolyl 4-hydroxylase 7 [Cucumis melo]2.6e-13476.92Show/hide
Query:  SRSANRLPKLLLHNKNMGQSVIRMKTDGSSISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDATGASVASEERTSTGMFLHRAQ
        S SANR PK+LLHN +M +SVIRMKT GS+I+IDP+RV QL  +PRAFLYKGFLS EEC HLI+LAK KL+QSLV    TG SV S+ERTSTGMFL +AQ
Subjt:  SRSANRLPKLLLHNKNMGQSVIRMKTDGSSISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDATGASVASEERTSTGMFLHRAQ

Query:  DEIVAGIESRIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPDSPVKLSEQERENLSDCAKNGYGVK
        D+IVA IESRIAAWTFLP+DNGEPIQ+LRYENGQKYEPH+DFFQDP NIA GGHRIATILMYLSDVEKGGETVFP+SPVKLSE+E+ +LS+CAK GYGV+
Subjt:  DEIVAGIESRIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPDSPVKLSEQERENLSDCAKNGYGVK

Query:  PKKGDALLFFSLHPNATPDTTSFHGSCPVIKGEKWSATKWIHMLSLNEIWRNKACVDESEYCQAWAKAGECEKNPTYMVGSKDDLGYCRKSCKVCSAPS
        PK GDALLFFS++PN TPD TS+HGSCPVI+GEKWSATKWIHML ++E+WRN ACVDE+++C AWAKAGEC+KNP YM+GSK++LG+CR SCKVCS  S
Subjt:  PKKGDALLFFSLHPNATPDTTSFHGSCPVIKGEKWSATKWIHMLSLNEIWRNKACVDESEYCQAWAKAGECEKNPTYMVGSKDDLGYCRKSCKVCSAPS

XP_022931100.1 probable prolyl 4-hydroxylase 7 [Cucurbita moschata]2.0e-13475.58Show/hide
Query:  SRSANRLPKLLLHNKNMGQSVIRMKTDGSSISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDATGASVASEERTSTGMFLHRAQ
        +RSANRLPKLLL +     SVIRMK DGSSI IDP+RV QL  QPRAFLYKGFLSAEEC HLI+LAK+ L+QSLV DD TGAS +S +RTSTGMFL++AQ
Subjt:  SRSANRLPKLLLHNKNMGQSVIRMKTDGSSISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDATGASVASEERTSTGMFLHRAQ

Query:  DEIVAGIESRIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPDSPVKLSEQERENLSDCAKNGYGVK
        D+IVAGIE++IAAWTFLP+DNGEPIQ+LRYENGQ+Y PH+DFFQDPVN+A+GGHRIAT+LMYLS+VE+GGETVFPDSP K+ E+E ++L DC+  GYGVK
Subjt:  DEIVAGIESRIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPDSPVKLSEQERENLSDCAKNGYGVK

Query:  PKKGDALLFFSLHPNATPDTTSFHGSCPVIKGEKWSATKWIHMLSLNEIWRNKACVDESEYCQAWAKAGECEKNPTYMV----GSKDDLGYCRKSCKVCS
        PKKGDALLFFSLHPN T D TS+HGSCPVI+GEKWSATKWIHML ++EIWRN  CVDE+E+C AWAKAGECEKNP YMV    GSK++LGYCR SCK CS
Subjt:  PKKGDALLFFSLHPNATPDTTSFHGSCPVIKGEKWSATKWIHMLSLNEIWRNKACVDESEYCQAWAKAGECEKNPTYMV----GSKDDLGYCRKSCKVCS

Query:  APS
         PS
Subjt:  APS

XP_023530715.1 probable prolyl 4-hydroxylase 7 [Cucurbita pepo subsp. pepo]1.4e-13576.24Show/hide
Query:  SRSANRLPKLLLHNKNMGQSVIRMKTDGSSISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDATGASVASEERTSTGMFLHRAQ
        +RSANRLPKLLL +     SVIRMK DGSSI IDP+RV QL  QPRAFLYKGFLSAEEC HLI+LAK+ L+QSLV DD TGAS +S +RTSTGMFL++AQ
Subjt:  SRSANRLPKLLLHNKNMGQSVIRMKTDGSSISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDATGASVASEERTSTGMFLHRAQ

Query:  DEIVAGIESRIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPDSPVKLSEQERENLSDCAKNGYGVK
        D+IVAGIE++IAAWTFLP+DNGEPIQ+LRYENGQ+Y PH+DFFQDPVN+A+GGHRIAT+LMYLS+VE+GGETVFPDSP K+ E+E ++LSDC+  GYGVK
Subjt:  DEIVAGIESRIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPDSPVKLSEQERENLSDCAKNGYGVK

Query:  PKKGDALLFFSLHPNATPDTTSFHGSCPVIKGEKWSATKWIHMLSLNEIWRNKACVDESEYCQAWAKAGECEKNPTYMV----GSKDDLGYCRKSCKVCS
        PKKGDALLFFSLHPN T D TS+HGSCPVI+GEKWSATKWIHML ++EIWRN  CVDE+E+C AWAKAGECEKNP YMV    GSK+DLGYCR SCK CS
Subjt:  PKKGDALLFFSLHPNATPDTTSFHGSCPVIKGEKWSATKWIHMLSLNEIWRNKACVDESEYCQAWAKAGECEKNPTYMV----GSKDDLGYCRKSCKVCS

Query:  APS
         PS
Subjt:  APS

XP_038905408.1 probable prolyl 4-hydroxylase 7 isoform X1 [Benincasa hispida]1.8e-14081.61Show/hide
Query:  SRSANRLPKLLLHNKNMGQSVIRMKTDGSSISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDATGASVASEERTSTGMFLHRAQ
        SRSANRLPKLLLHN NM QSVIRMKT GS ++IDP+RV +L  +PRAFLYKGFLS +EC HLINLAK KLQQSLV  + TG SV S+ERTSTGMFL RAQ
Subjt:  SRSANRLPKLLLHNKNMGQSVIRMKTDGSSISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDATGASVASEERTSTGMFLHRAQ

Query:  DEIVAGIESRIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPDSPVKLSEQERENLSDCAKNGYGVK
        DEIVA IESRIAAWTFLPIDNGEPIQ+LRYENGQKYEPH+DFFQDPVNIA GGHRIATILMYLSDVEKGGETVFP+SP+KLSEQER +LSDCAK GYGVK
Subjt:  DEIVAGIESRIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPDSPVKLSEQERENLSDCAKNGYGVK

Query:  PKKGDALLFFSLHPNATPDTTSFHGSCPVIKGEKWSATKWIHMLSLNEIWRNKACVDESEYCQAWAKAGECEKNPTYMVGSKDDLGYCRKSCKVCSAPS
        PK GDALLFFSL+PN TPD TS+HGSCPVI+GEKWSATKWIHML + EIWRN ACVDE+  C+AWA AGECEKNP YM+GSK++LG+CR SCKVCS PS
Subjt:  PKKGDALLFFSLHPNATPDTTSFHGSCPVIKGEKWSATKWIHMLSLNEIWRNKACVDESEYCQAWAKAGECEKNPTYMVGSKDDLGYCRKSCKVCSAPS

TrEMBL top hitse value%identityAlignment
A0A0A0LG32 Procollagen-proline 4-dioxygenase4.3e-13577.7Show/hide
Query:  SRSANRLPKLLLHNKNMGQSVIRMKTDGSSISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDATGASVASEERTSTGMFLHRAQ
        S SANR PKL+LHN ++ +SVIRMKT GS+++IDP+RV QL  +PRAFLYKGFLSAEEC HLIN AK KL QSLV    TG SV S+ERTSTGMFLH+AQ
Subjt:  SRSANRLPKLLLHNKNMGQSVIRMKTDGSSISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDATGASVASEERTSTGMFLHRAQ

Query:  DEIVAGIESRIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPDSPVKLSEQERENLSDCAKNGYGVK
        DEIVA IESRIAAWTFLP+DNGEPIQ+LRYENGQKYEPH+DFFQDP NIA GGHRIATILMYLS+VEKGGETVFP+SPVKLSE+E+ +LS+C K GYGV+
Subjt:  DEIVAGIESRIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPDSPVKLSEQERENLSDCAKNGYGVK

Query:  PKKGDALLFFSLHPNATPDTTSFHGSCPVIKGEKWSATKWIHMLSLNEIWRNKACVDESEYCQAWAKAGECEKNPTYMVGSKDDLGYCRKSCKVCS
        PK GDALLFFS++PN TPDTTS+HGSCPVI+GEKWSATKWIHML ++E WRN ACVDE+++C AWAKAGECEKNP YM+GSK++LG+CR SCKVCS
Subjt:  PKKGDALLFFSLHPNATPDTTSFHGSCPVIKGEKWSATKWIHMLSLNEIWRNKACVDESEYCQAWAKAGECEKNPTYMVGSKDDLGYCRKSCKVCS

A0A1S3B814 Procollagen-proline 4-dioxygenase1.2e-13476.92Show/hide
Query:  SRSANRLPKLLLHNKNMGQSVIRMKTDGSSISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDATGASVASEERTSTGMFLHRAQ
        S SANR PK+LLHN +M +SVIRMKT GS+I+IDP+RV QL  +PRAFLYKGFLS EEC HLI+LAK KL+QSLV    TG SV S+ERTSTGMFL +AQ
Subjt:  SRSANRLPKLLLHNKNMGQSVIRMKTDGSSISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDATGASVASEERTSTGMFLHRAQ

Query:  DEIVAGIESRIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPDSPVKLSEQERENLSDCAKNGYGVK
        D+IVA IESRIAAWTFLP+DNGEPIQ+LRYENGQKYEPH+DFFQDP NIA GGHRIATILMYLSDVEKGGETVFP+SPVKLSE+E+ +LS+CAK GYGV+
Subjt:  DEIVAGIESRIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPDSPVKLSEQERENLSDCAKNGYGVK

Query:  PKKGDALLFFSLHPNATPDTTSFHGSCPVIKGEKWSATKWIHMLSLNEIWRNKACVDESEYCQAWAKAGECEKNPTYMVGSKDDLGYCRKSCKVCSAPS
        PK GDALLFFS++PN TPD TS+HGSCPVI+GEKWSATKWIHML ++E+WRN ACVDE+++C AWAKAGEC+KNP YM+GSK++LG+CR SCKVCS  S
Subjt:  PKKGDALLFFSLHPNATPDTTSFHGSCPVIKGEKWSATKWIHMLSLNEIWRNKACVDESEYCQAWAKAGECEKNPTYMVGSKDDLGYCRKSCKVCSAPS

A0A6J1DX45 Procollagen-proline 4-dioxygenase1.4e-13075.76Show/hide
Query:  RSANRLPKLLLHNKNMGQ-SVIRMKTDGSSISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDATGASVASEERTSTGMFLHRAQ
        RS N +P+LL+   NMG+ S+IRMKT GSSISIDPSRVTQL  QPRAF+YKGFLSAEEC HLINLAK+KL++SLV DD TG SV S ERTSTGMFL + Q
Subjt:  RSANRLPKLLLHNKNMGQ-SVIRMKTDGSSISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDATGASVASEERTSTGMFLHRAQ

Query:  DEIVAGIESRIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPDSPVKLSEQERENLSDCAKNGYGVK
        D+IVAGIESRIAAWTFLP+DNGEP+QVLRYENGQKY+PH+DFFQDPVN+A GGHRIAT+LMYLS+VE+GGETVFP+S VKLS +E++ LSDCAK GY VK
Subjt:  DEIVAGIESRIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPDSPVKLSEQERENLSDCAKNGYGVK

Query:  PKKGDALLFFSLHPNATPDTTSFHGSCPVIKGEKWSATKWIHMLSLNEIWRNKACVDESEYCQAWAKAGECEKNPTYMVGSKDDLGYCRKSCKVCSA
        PK GDALLFFSLH N T D++S+HGSCPVIKGEKWSATKWIHMLS +EIWR+  CVD S  C AWA  GEC KNP YM+GSK +LGYCRKSC  CS+
Subjt:  PKKGDALLFFSLHPNATPDTTSFHGSCPVIKGEKWSATKWIHMLSLNEIWRNKACVDESEYCQAWAKAGECEKNPTYMVGSKDDLGYCRKSCKVCSA

A0A6J1EYJ1 Procollagen-proline 4-dioxygenase9.5e-13575.58Show/hide
Query:  SRSANRLPKLLLHNKNMGQSVIRMKTDGSSISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDATGASVASEERTSTGMFLHRAQ
        +RSANRLPKLLL +     SVIRMK DGSSI IDP+RV QL  QPRAFLYKGFLSAEEC HLI+LAK+ L+QSLV DD TGAS +S +RTSTGMFL++AQ
Subjt:  SRSANRLPKLLLHNKNMGQSVIRMKTDGSSISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDATGASVASEERTSTGMFLHRAQ

Query:  DEIVAGIESRIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPDSPVKLSEQERENLSDCAKNGYGVK
        D+IVAGIE++IAAWTFLP+DNGEPIQ+LRYENGQ+Y PH+DFFQDPVN+A+GGHRIAT+LMYLS+VE+GGETVFPDSP K+ E+E ++L DC+  GYGVK
Subjt:  DEIVAGIESRIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPDSPVKLSEQERENLSDCAKNGYGVK

Query:  PKKGDALLFFSLHPNATPDTTSFHGSCPVIKGEKWSATKWIHMLSLNEIWRNKACVDESEYCQAWAKAGECEKNPTYMV----GSKDDLGYCRKSCKVCS
        PKKGDALLFFSLHPN T D TS+HGSCPVI+GEKWSATKWIHML ++EIWRN  CVDE+E+C AWAKAGECEKNP YMV    GSK++LGYCR SCK CS
Subjt:  PKKGDALLFFSLHPNATPDTTSFHGSCPVIKGEKWSATKWIHMLSLNEIWRNKACVDESEYCQAWAKAGECEKNPTYMV----GSKDDLGYCRKSCKVCS

Query:  APS
         PS
Subjt:  APS

A0A6J1I5Z9 Procollagen-proline 4-dioxygenase1.4e-13375.58Show/hide
Query:  SRSANRLPKLLLHNKNMGQSVIRMKTDGSSISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDATGASVASEERTSTGMFLHRAQ
        +RSANRLPKLLL +     SVIRMK DGSSI IDP+RV QL  QPRAFLYKGFLSAEEC HLI+LAK+ L+QSLV DD TGAS +S +RTSTGMFL++AQ
Subjt:  SRSANRLPKLLLHNKNMGQSVIRMKTDGSSISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDATGASVASEERTSTGMFLHRAQ

Query:  DEIVAGIESRIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPDSPVKLSEQERENLSDCAKNGYGVK
        D+IVAGIE++IAAWTFLP+DNGEP+Q+LRYENGQ+Y PH+DFFQDPVN+A+GGHRIAT+L+YLS+VE+GGETVFPDSP K+ E E ++LSDC+  GYGVK
Subjt:  DEIVAGIESRIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPDSPVKLSEQERENLSDCAKNGYGVK

Query:  PKKGDALLFFSLHPNATPDTTSFHGSCPVIKGEKWSATKWIHMLSLNEIWRNKACVDESEYCQAWAKAGECEKNPTYMV----GSKDDLGYCRKSCKVCS
        PKKGDALLFFSLHPN T D TS+HGSCPVI+GEKWSATKWIHML L+EIWRN  CVDE+E+C AWAKAGECEKNP YMV    GSK++LGYCR SCK CS
Subjt:  PKKGDALLFFSLHPNATPDTTSFHGSCPVIKGEKWSATKWIHMLSLNEIWRNKACVDESEYCQAWAKAGECEKNPTYMV----GSKDDLGYCRKSCKVCS

Query:  APS
         PS
Subjt:  APS

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 61.5e-9762.55Show/hide
Query:  SISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDA-TGASVASEERTSTGMFLHRAQDEIVAGIESRIAAWTFLPIDNGEPIQVL
        S S+DP+R+TQL   PRAFLYKGFLS EEC HLI LAK KL++S+V  D  +G S  SE RTS+GMFL + QD+IVA +E+++AAWTFLP +NGE +Q+L
Subjt:  SISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDA-TGASVASEERTSTGMFLHRAQDEIVAGIESRIAAWTFLPIDNGEPIQVL

Query:  RYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPDSPVKLSEQERENLSDCAKNGYGVKPKKGDALLFFSLHPNATPDTTSFHGSCP
         YENGQKY+PH+D+F D   +  GGHRIAT+LMYLS+V KGGETVFP+   K  + + ++ S CAK GY VKP+KGDALLFF+LH N T D  S HGSCP
Subjt:  RYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPDSPVKLSEQERENLSDCAKNGYGVKPKKGDALLFFSLHPNATPDTTSFHGSCP

Query:  VIKGEKWSATKWIHMLSLNEIWRNKACVDESEYCQAWAKAGECEKNPTYMVGSKDDLGYCRKSCKVC
        VI+GEKWSAT+WIH+ S  +  +   CVD+ E CQ WA AGECEKNP YMVGS+  LG+CRKSCK C
Subjt:  VIKGEKWSATKWIHMLSLNEIWRNKACVDESEYCQAWAKAGECEKNPTYMVGSKDDLGYCRKSCKVC

F4JAU3 Prolyl 4-hydroxylase 23.8e-8857.09Show/hide
Query:  QSVIRMKTDGSSISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDATGASVASEERTSTGMFLHRAQDEIVAGIESRIAAWTFLP
        QS   + +  SSI I+PS+V Q+  +PRAF+Y+GFL+  EC HLI+LAK  LQ+S V D+  G S  S+ RTS+G F+ + +D IV+GIE +++ WTFLP
Subjt:  QSVIRMKTDGSSISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDATGASVASEERTSTGMFLHRAQDEIVAGIESRIAAWTFLP

Query:  IDNGEPIQVLRYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPD----SPVKLSEQERENLSDCAKNGYGVKPKKGDALLFFSLHP
         +NGE +QVLRYE+GQKY+ H+D+F D VNIA GGHRIAT+L+YLS+V KGGETVFPD    S   LSE  +++LSDCAK G  VKPKKG+ALLFF+L  
Subjt:  IDNGEPIQVLRYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPD----SPVKLSEQERENLSDCAKNGYGVKPKKGDALLFFSLHP

Query:  NATPDTTSFHGSCPVIKGEKWSATKWIHMLSLNEIWRNKA-CVDESEYCQAWAKAGECEKNPTYMVGSKDDLGYCRKSCKVC
        +A PD  S HG CPVI+GEKWSATKWIH+ S ++I  +   C D +E C+ WA  GEC KNP YMVG+ +  G CR+SCK C
Subjt:  NATPDTTSFHGSCPVIKGEKWSATKWIHMLSLNEIWRNKA-CVDESEYCQAWAKAGECEKNPTYMVGSKDDLGYCRKSCKVC

Q8L970 Probable prolyl 4-hydroxylase 74.0e-10662.72Show/hide
Query:  SVIRMKTDGSSISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDATGASVASEERTSTGMFLHRAQDEIVAGIESRIAAWTFLPI
        SVI+MKT  SS   DP+RVTQL   PR FLY+GFLS EEC H I LAK KL++S+V D+ +G SV SE RTS+GMFL + QD+IV+ +E+++AAWTFLP 
Subjt:  SVIRMKTDGSSISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDATGASVASEERTSTGMFLHRAQDEIVAGIESRIAAWTFLPI

Query:  DNGEPIQVLRYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPDSPVKLSEQERENLSDCAKNGYGVKPKKGDALLFFSLHPNATPD
        +NGE +Q+L YENGQKYEPH+D+F D  N+  GGHRIAT+LMYLS+VEKGGETVFP    K ++ + ++ ++CAK GY VKP+KGDALLFF+LHPNAT D
Subjt:  DNGEPIQVLRYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPDSPVKLSEQERENLSDCAKNGYGVKPKKGDALLFFSLHPNATPD

Query:  TTSFHGSCPVIKGEKWSATKWIHMLSLNEIW-RNKACVDESEYCQAWAKAGECEKNPTYMVGSKDDLGYCRKSCKVCSA
        + S HGSCPV++GEKWSAT+WIH+ S    + +   C+DE+  C+ WAKAGEC+KNPTYMVGS  D GYCRKSCK CS+
Subjt:  TTSFHGSCPVIKGEKWSATKWIHMLSLNEIW-RNKACVDESEYCQAWAKAGECEKNPTYMVGSKDDLGYCRKSCKVCSA

Q8LAN3 Probable prolyl 4-hydroxylase 42.5e-9257.93Show/hide
Query:  SSISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDATGASVASEERTSTGMFLHRAQDEIVAGIESRIAAWTFLPIDNGEPIQVL
        SS+ ++PS+V Q+  +PRAF+Y+GFL+  EC H+++LAK  L++S V D+ +G S  SE RTS+G F+ + +D IV+GIE +I+ WTFLP +NGE IQVL
Subjt:  SSISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDATGASVASEERTSTGMFLHRAQDEIVAGIESRIAAWTFLPIDNGEPIQVL

Query:  RYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPDSPV---KLSEQERENLSDCAKNGYGVKPKKGDALLFFSLHPNATPDTTSFHG
        RYE+GQKY+ H+D+F D VNI  GGHR+ATILMYLS+V KGGETVFPD+ +   ++  + +E+LSDCAK G  VKP+KGDALLFF+LHP+A PD  S HG
Subjt:  RYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPDSPV---KLSEQERENLSDCAKNGYGVKPKKGDALLFFSLHPNATPDTTSFHG

Query:  SCPVIKGEKWSATKWIHMLSLNEIWRNKA-CVDESEYCQAWAKAGECEKNPTYMVGSKDDLGYCRKSCKVC
         CPVI+GEKWSATKWIH+ S + I      C D +E C+ WA  GEC KNP YMVG+ +  GYCR+SCK C
Subjt:  SCPVIKGEKWSATKWIHMLSLNEIWRNKA-CVDESEYCQAWAKAGECEKNPTYMVGSKDDLGYCRKSCKVC

Q9LN20 Probable prolyl 4-hydroxylase 38.4e-6458.21Show/hide
Query:  QPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDATGASVASEERTSTGMFLHRAQDEIVAGIESRIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDFF
        +PRAF+Y  FLS EEC +LI+LAK  + +S V D  TG S  S  RTS+G FL R +D+I+  IE RIA +TF+P D+GE +QVL YE GQKYEPHYD+F
Subjt:  QPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDATGASVASEERTSTGMFLHRAQDEIVAGIESRIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDFF

Query:  QDPVNIASGGHRIATILMYLSDVEKGGETVFPDSPVKLSEQEREN-LSDCAKNGYGVKPKKGDALLFFSLHPNATPDTTSFHGSCPVIKGEKWSATKWIH
         D  N  +GG R+AT+LMYLSDVE+GGETVFP + +  S     N LS+C K G  VKP+ GDALLF+S+ P+AT D TS HG CPVI+G KWS+TKW+H
Subjt:  QDPVNIASGGHRIATILMYLSDVEKGGETVFPDSPVKLSEQEREN-LSDCAKNGYGVKPKKGDALLFFSLHPNATPDTTSFHGSCPVIKGEKWSATKWIH

Query:  M
        +
Subjt:  M

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 22.7e-8957.09Show/hide
Query:  QSVIRMKTDGSSISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDATGASVASEERTSTGMFLHRAQDEIVAGIESRIAAWTFLP
        QS   + +  SSI I+PS+V Q+  +PRAF+Y+GFL+  EC HLI+LAK  LQ+S V D+  G S  S+ RTS+G F+ + +D IV+GIE +++ WTFLP
Subjt:  QSVIRMKTDGSSISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDATGASVASEERTSTGMFLHRAQDEIVAGIESRIAAWTFLP

Query:  IDNGEPIQVLRYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPD----SPVKLSEQERENLSDCAKNGYGVKPKKGDALLFFSLHP
         +NGE +QVLRYE+GQKY+ H+D+F D VNIA GGHRIAT+L+YLS+V KGGETVFPD    S   LSE  +++LSDCAK G  VKPKKG+ALLFF+L  
Subjt:  IDNGEPIQVLRYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPD----SPVKLSEQERENLSDCAKNGYGVKPKKGDALLFFSLHP

Query:  NATPDTTSFHGSCPVIKGEKWSATKWIHMLSLNEIWRNKA-CVDESEYCQAWAKAGECEKNPTYMVGSKDDLGYCRKSCKVC
        +A PD  S HG CPVI+GEKWSATKWIH+ S ++I  +   C D +E C+ WA  GEC KNP YMVG+ +  G CR+SCK C
Subjt:  NATPDTTSFHGSCPVIKGEKWSATKWIHMLSLNEIWRNKA-CVDESEYCQAWAKAGECEKNPTYMVGSKDDLGYCRKSCKVC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase2.8e-10762.72Show/hide
Query:  SVIRMKTDGSSISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDATGASVASEERTSTGMFLHRAQDEIVAGIESRIAAWTFLPI
        SVI+MKT  SS   DP+RVTQL   PR FLY+GFLS EEC H I LAK KL++S+V D+ +G SV SE RTS+GMFL + QD+IV+ +E+++AAWTFLP 
Subjt:  SVIRMKTDGSSISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDATGASVASEERTSTGMFLHRAQDEIVAGIESRIAAWTFLPI

Query:  DNGEPIQVLRYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPDSPVKLSEQERENLSDCAKNGYGVKPKKGDALLFFSLHPNATPD
        +NGE +Q+L YENGQKYEPH+D+F D  N+  GGHRIAT+LMYLS+VEKGGETVFP    K ++ + ++ ++CAK GY VKP+KGDALLFF+LHPNAT D
Subjt:  DNGEPIQVLRYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPDSPVKLSEQERENLSDCAKNGYGVKPKKGDALLFFSLHPNATPD

Query:  TTSFHGSCPVIKGEKWSATKWIHMLSLNEIW-RNKACVDESEYCQAWAKAGECEKNPTYMVGSKDDLGYCRKSCKVCSA
        + S HGSCPV++GEKWSAT+WIH+ S    + +   C+DE+  C+ WAKAGEC+KNPTYMVGS  D GYCRKSCK CS+
Subjt:  TTSFHGSCPVIKGEKWSATKWIHMLSLNEIW-RNKACVDESEYCQAWAKAGECEKNPTYMVGSKDDLGYCRKSCKVCSA

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase2.0e-10058.89Show/hide
Query:  SVIRMKTDGSSISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDATGASVASEERTS----TGMFLHRAQ----DEIVAGIESRI
        SVI+MKT  SS   DP+RVTQL   PR FLY+GFLS EEC H I LAK KL++S+V D+ +G SV SE+  S    +  F+        D+IV+ +E+++
Subjt:  SVIRMKTDGSSISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDATGASVASEERTS----TGMFLHRAQ----DEIVAGIESRI

Query:  AAWTFLPIDNGEPIQVLRYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPDSPVKLSEQERENLSDCAKNGYGVKPKKGDALLFFS
        AAWTFLP +NGE +Q+L YENGQKYEPH+D+F D  N+  GGHRIAT+LMYLS+VEKGGETVFP    K ++ + ++ ++CAK GY VKP+KGDALLFF+
Subjt:  AAWTFLPIDNGEPIQVLRYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPDSPVKLSEQERENLSDCAKNGYGVKPKKGDALLFFS

Query:  LHPNATPDTTSFHGSCPVIKGEKWSATKWIHMLSLNEIW-RNKACVDESEYCQAWAKAGECEKNPTYMVGSKDDLGYCRKSCKVCSA
        LHPNAT D+ S HGSCPV++GEKWSAT+WIH+ S    + +   C+DE+  C+ WAKAGEC+KNPTYMVGS  D GYCRKSCK CS+
Subjt:  LHPNATPDTTSFHGSCPVIKGEKWSATKWIHMLSLNEIW-RNKACVDESEYCQAWAKAGECEKNPTYMVGSKDDLGYCRKSCKVCSA

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase1.1e-9862.55Show/hide
Query:  SISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDA-TGASVASEERTSTGMFLHRAQDEIVAGIESRIAAWTFLPIDNGEPIQVL
        S S+DP+R+TQL   PRAFLYKGFLS EEC HLI LAK KL++S+V  D  +G S  SE RTS+GMFL + QD+IVA +E+++AAWTFLP +NGE +Q+L
Subjt:  SISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDA-TGASVASEERTSTGMFLHRAQDEIVAGIESRIAAWTFLPIDNGEPIQVL

Query:  RYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPDSPVKLSEQERENLSDCAKNGYGVKPKKGDALLFFSLHPNATPDTTSFHGSCP
         YENGQKY+PH+D+F D   +  GGHRIAT+LMYLS+V KGGETVFP+   K  + + ++ S CAK GY VKP+KGDALLFF+LH N T D  S HGSCP
Subjt:  RYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPDSPVKLSEQERENLSDCAKNGYGVKPKKGDALLFFSLHPNATPDTTSFHGSCP

Query:  VIKGEKWSATKWIHMLSLNEIWRNKACVDESEYCQAWAKAGECEKNPTYMVGSKDDLGYCRKSCKVC
        VI+GEKWSAT+WIH+ S  +  +   CVD+ E CQ WA AGECEKNP YMVGS+  LG+CRKSCK C
Subjt:  VIKGEKWSATKWIHMLSLNEIWRNKACVDESEYCQAWAKAGECEKNPTYMVGSKDDLGYCRKSCKVC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.8e-9357.93Show/hide
Query:  SSISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDATGASVASEERTSTGMFLHRAQDEIVAGIESRIAAWTFLPIDNGEPIQVL
        SS+ ++PS+V Q+  +PRAF+Y+GFL+  EC H+++LAK  L++S V D+ +G S  SE RTS+G F+ + +D IV+GIE +I+ WTFLP +NGE IQVL
Subjt:  SSISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDATGASVASEERTSTGMFLHRAQDEIVAGIESRIAAWTFLPIDNGEPIQVL

Query:  RYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPDSPV---KLSEQERENLSDCAKNGYGVKPKKGDALLFFSLHPNATPDTTSFHG
        RYE+GQKY+ H+D+F D VNI  GGHR+ATILMYLS+V KGGETVFPD+ +   ++  + +E+LSDCAK G  VKP+KGDALLFF+LHP+A PD  S HG
Subjt:  RYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPDSPV---KLSEQERENLSDCAKNGYGVKPKKGDALLFFSLHPNATPDTTSFHG

Query:  SCPVIKGEKWSATKWIHMLSLNEIWRNKA-CVDESEYCQAWAKAGECEKNPTYMVGSKDDLGYCRKSCKVC
         CPVI+GEKWSATKWIH+ S + I      C D +E C+ WA  GEC KNP YMVG+ +  GYCR+SCK C
Subjt:  SCPVIKGEKWSATKWIHMLSLNEIWRNKA-CVDESEYCQAWAKAGECEKNPTYMVGSKDDLGYCRKSCKVC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCGCTCCGCCAATCGCTTGCCGAAATTGCTCTTACACAACAAGAACATGGGACAATCTGTTATTAGGATGAAAACGGACGGCTCCTCCATTTCAATCGATCCCAG
TCGCGTTACTCAGCTTTTATTACAACCTAGGGCTTTCTTATATAAGGGATTTTTGTCTGCAGAGGAGTGCCATCATCTTATCAATTTGGCGAAGAATAAGCTACAGCAAT
CGTTGGTGACCGATGATGCAACGGGGGCGAGTGTTGCGAGTGAAGAACGGACGAGTACTGGCATGTTTCTTCATAGGGCTCAGGATGAAATAGTTGCTGGCATAGAGTCA
AGGATTGCTGCGTGGACCTTCCTTCCCATTGATAATGGGGAGCCTATTCAAGTATTAAGGTATGAGAACGGTCAAAAATACGAGCCACATTATGATTTTTTTCAAGACCC
AGTTAATATAGCTAGTGGTGGTCATCGAATCGCCACAATCTTGATGTATTTGTCTGATGTTGAAAAGGGTGGAGAAACAGTCTTTCCTGATTCTCCGGTTAAATTATCCG
AGCAGGAGAGGGAAAACTTGTCCGATTGTGCTAAGAATGGTTACGGAGTAAAACCAAAGAAGGGTGATGCTTTACTGTTCTTCAGTCTCCATCCAAATGCGACGCCAGAC
ACGACCAGCTTTCATGGGAGCTGCCCGGTGATAAAGGGCGAGAAGTGGTCTGCAACAAAATGGATTCACATGCTTTCACTCAATGAAATTTGGAGGAATAAAGCTTGTGT
GGATGAGAGTGAGTACTGTCAGGCATGGGCAAAGGCAGGTGAGTGTGAAAAGAATCCTACTTATATGGTGGGTTCTAAGGATGATCTTGGATATTGTAGGAAGAGTTGCA
AAGTGTGCTCTGCCCCCTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCGCTCCGCCAATCGCTTGCCGAAATTGCTCTTACACAACAAGAACATGGGACAATCTGTTATTAGGATGAAAACGGACGGCTCCTCCATTTCAATCGATCCCAG
TCGCGTTACTCAGCTTTTATTACAACCTAGGGCTTTCTTATATAAGGGATTTTTGTCTGCAGAGGAGTGCCATCATCTTATCAATTTGGCGAAGAATAAGCTACAGCAAT
CGTTGGTGACCGATGATGCAACGGGGGCGAGTGTTGCGAGTGAAGAACGGACGAGTACTGGCATGTTTCTTCATAGGGCTCAGGATGAAATAGTTGCTGGCATAGAGTCA
AGGATTGCTGCGTGGACCTTCCTTCCCATTGATAATGGGGAGCCTATTCAAGTATTAAGGTATGAGAACGGTCAAAAATACGAGCCACATTATGATTTTTTTCAAGACCC
AGTTAATATAGCTAGTGGTGGTCATCGAATCGCCACAATCTTGATGTATTTGTCTGATGTTGAAAAGGGTGGAGAAACAGTCTTTCCTGATTCTCCGGTTAAATTATCCG
AGCAGGAGAGGGAAAACTTGTCCGATTGTGCTAAGAATGGTTACGGAGTAAAACCAAAGAAGGGTGATGCTTTACTGTTCTTCAGTCTCCATCCAAATGCGACGCCAGAC
ACGACCAGCTTTCATGGGAGCTGCCCGGTGATAAAGGGCGAGAAGTGGTCTGCAACAAAATGGATTCACATGCTTTCACTCAATGAAATTTGGAGGAATAAAGCTTGTGT
GGATGAGAGTGAGTACTGTCAGGCATGGGCAAAGGCAGGTGAGTGTGAAAAGAATCCTACTTATATGGTGGGTTCTAAGGATGATCTTGGATATTGTAGGAAGAGTTGCA
AAGTGTGCTCTGCCCCCTCTTAA
Protein sequenceShow/hide protein sequence
MSRSANRLPKLLLHNKNMGQSVIRMKTDGSSISIDPSRVTQLLLQPRAFLYKGFLSAEECHHLINLAKNKLQQSLVTDDATGASVASEERTSTGMFLHRAQDEIVAGIES
RIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDFFQDPVNIASGGHRIATILMYLSDVEKGGETVFPDSPVKLSEQERENLSDCAKNGYGVKPKKGDALLFFSLHPNATPD
TTSFHGSCPVIKGEKWSATKWIHMLSLNEIWRNKACVDESEYCQAWAKAGECEKNPTYMVGSKDDLGYCRKSCKVCSAPS