; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021437 (gene) of Snake gourd v1 genome

Gene IDTan0021437
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationLG06:78640148..78643314
RNA-Seq ExpressionTan0021437
SyntenyTan0021437
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575033.1 putative prolyl 4-hydroxylase 4, partial [Cucurbita argyrosperma subsp. sororia]1.1e-16192.67Show/hide
Query:  MSRFRSLFFIFLISIASVVRESNCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFISKS
        MS+FR+L F+FLI I+SVVRES+CSYAGSA+STVDPSKVKQISWKPRAFVYEGFLTDLECDHL+SIARSELKRSEVADN+SG SKLSTVRTSSGMFISKS
Subjt:  MSRFRSLFFIFLISIASVVRESNCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFISKS

Query:  KDPIVTGIEDKISAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRASETDEDLSECARKG
        KDPIV+GIEDKI+AWTFLPKENGEDIQVLRYEHGQ+YESHYDYFVDKVNIA GGHRLATVLMYLS+VTKGGETVFPLAEKSP RRASETDEDL+ECAR+G
Subjt:  KDPIVTGIEDKISAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRASETDEDLSECARKG

Query:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMVGSPELPGYCTRSCRIC
        IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIG+CTDLNE+CERWAALGECTKNPEYMVGSPELPGYC RSCRIC
Subjt:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMVGSPELPGYCTRSCRIC

XP_022930331.1 probable prolyl 4-hydroxylase 4 isoform X4 [Cucurbita moschata]1.0e-16294Show/hide
Query:  MSRFRSLFFIFLISIASVVRESNCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFISKS
        MSRFRS+ FIFLISIASVVRES CS A SAS+TVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFI KS
Subjt:  MSRFRSLFFIFLISIASVVRESNCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFISKS

Query:  KDPIVTGIEDKISAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRASETDEDLSECARKG
        KD IV+GIEDKI+AWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFP+AEKSPHRRASETDEDLS+CARKG
Subjt:  KDPIVTGIEDKISAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRASETDEDLSECARKG

Query:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMVGSPELPGYCTRSCRIC
        IAVKPKKGDALLFFSLEPNAIPDT SLHGGCPVLEGEKWSATKWIHVDSFSKNL N+GNCTDLNE+CERWAALGECTKNPEYMVGSPELPGYC RSCR C
Subjt:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMVGSPELPGYCTRSCRIC

XP_023000081.1 probable prolyl 4-hydroxylase 4 [Cucurbita maxima]4.5e-16394.33Show/hide
Query:  MSRFRSLFFIFLISIASVVRESNCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFISKS
        MS+FRSL FIFLISIASVVRES CS A SAS+TVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFI KS
Subjt:  MSRFRSLFFIFLISIASVVRESNCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFISKS

Query:  KDPIVTGIEDKISAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRASETDEDLSECARKG
        KD IV+GIEDKI+AWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFP+AEKSPHRRASETDEDLSECARKG
Subjt:  KDPIVTGIEDKISAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRASETDEDLSECARKG

Query:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMVGSPELPGYCTRSCRIC
        IAVKPKKGDALLFFSLEPNAIPDT SLHGGCPVLEGEKWSATKWIHVDSFSKNL N+GNCTDLNE+CERWAALGECTKNPEYMVGSPELPGYC RSCR C
Subjt:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMVGSPELPGYCTRSCRIC

XP_023514355.1 probable prolyl 4-hydroxylase 4 isoform X1 [Cucurbita pepo subsp. pepo]5.8e-16394.33Show/hide
Query:  MSRFRSLFFIFLISIASVVRESNCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFISKS
        MSRFRSL FIFLISIASVVRES CS A SAS+TVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFI KS
Subjt:  MSRFRSLFFIFLISIASVVRESNCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFISKS

Query:  KDPIVTGIEDKISAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRASETDEDLSECARKG
        KD IV+GIEDKI+AWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFP+AEKSPHRRASETDEDLS+CARKG
Subjt:  KDPIVTGIEDKISAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRASETDEDLSECARKG

Query:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMVGSPELPGYCTRSCRIC
        IAVKPKKGDALLFFSLEPNAIPDT SLHGGCPVLEGEKWSATKWIHVDSFSKNL N+GNCTDLNE+CERWAALGECTKNPEYMVGSPELPGYC RSCR C
Subjt:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMVGSPELPGYCTRSCRIC

XP_038874583.1 probable prolyl 4-hydroxylase 4 [Benincasa hispida]1.5e-16393.67Show/hide
Query:  MSRFRSLFFIFLISIASVVRESNCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFISKS
        M +FR+L FIFLI I+ VVRES CSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHL+SIARSELKRSEVADN+SGKSKLSTVRTSSGMFISKS
Subjt:  MSRFRSLFFIFLISIASVVRESNCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFISKS

Query:  KDPIVTGIEDKISAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRASETDEDLSECARKG
        KDPIV+GIEDKISAWTFLPKENGEDIQVLRYEHG++YESHYDYFVDKVNIAWGGHRLATVLMYLS+VTKGGETVFPLAEKS HRRA ETDEDLSECARKG
Subjt:  KDPIVTGIEDKISAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRASETDEDLSECARKG

Query:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMVGSPELPGYCTRSCRIC
        IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNE+CERWAALGECTKNPEYMVGSPE+PGYC RSCRIC
Subjt:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMVGSPELPGYCTRSCRIC

TrEMBL top hitse value%identityAlignment
A0A1S3C816 Procollagen-proline 4-dioxygenase7.0e-16292Show/hide
Query:  MSRFRSLFFIFLISIASVVRESNCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFISKS
        M +FR+L F FLI I+S VRES CSYAGSAS+TVDPS+VKQISWKPRAFVYEGFLTDLECDHL+SIARSELKRSEVADN+SGKSKLSTVRTSSGMFISK+
Subjt:  MSRFRSLFFIFLISIASVVRESNCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFISKS

Query:  KDPIVTGIEDKISAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRASETDEDLSECARKG
        KDPIV+GIEDKISAWTFLPKENGEDIQVLRYEHGQ+YESHYDYFVDKVNIAWGGHRLATVLMYLS+VTKGGETVFPLAEKS HRRA ETDEDLSECA+KG
Subjt:  KDPIVTGIEDKISAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRASETDEDLSECARKG

Query:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMVGSPELPGYCTRSCRIC
        IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLG+IGNCTDLNE+CERWAALGECTKNPEYMVGSPE+PGYC RSCRIC
Subjt:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMVGSPELPGYCTRSCRIC

A0A5A7SVW6 Procollagen-proline 4-dioxygenase7.0e-16292Show/hide
Query:  MSRFRSLFFIFLISIASVVRESNCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFISKS
        M +FR+L F FLI I+S VRES CSYAGSAS+TVDPS+VKQISWKPRAFVYEGFLTDLECDHL+SIARSELKRSEVADN+SGKSKLSTVRTSSGMFISK+
Subjt:  MSRFRSLFFIFLISIASVVRESNCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFISKS

Query:  KDPIVTGIEDKISAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRASETDEDLSECARKG
        KDPIV+GIEDKISAWTFLPKENGEDIQVLRYEHGQ+YESHYDYFVDKVNIAWGGHRLATVLMYLS+VTKGGETVFPLAEKS HRRA ETDEDLSECA+KG
Subjt:  KDPIVTGIEDKISAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRASETDEDLSECARKG

Query:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMVGSPELPGYCTRSCRIC
        IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLG+IGNCTDLNE+CERWAALGECTKNPEYMVGSPE+PGYC RSCRIC
Subjt:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMVGSPELPGYCTRSCRIC

A0A6J1EQM4 Procollagen-proline 4-dioxygenase4.8e-16394Show/hide
Query:  MSRFRSLFFIFLISIASVVRESNCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFISKS
        MSRFRS+ FIFLISIASVVRES CS A SAS+TVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFI KS
Subjt:  MSRFRSLFFIFLISIASVVRESNCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFISKS

Query:  KDPIVTGIEDKISAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRASETDEDLSECARKG
        KD IV+GIEDKI+AWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFP+AEKSPHRRASETDEDLS+CARKG
Subjt:  KDPIVTGIEDKISAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRASETDEDLSECARKG

Query:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMVGSPELPGYCTRSCRIC
        IAVKPKKGDALLFFSLEPNAIPDT SLHGGCPVLEGEKWSATKWIHVDSFSKNL N+GNCTDLNE+CERWAALGECTKNPEYMVGSPELPGYC RSCR C
Subjt:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMVGSPELPGYCTRSCRIC

A0A6J1H545 Procollagen-proline 4-dioxygenase7.0e-16292.67Show/hide
Query:  MSRFRSLFFIFLISIASVVRESNCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFISKS
        MS+FR+L F+FLI I+SVVRES+CSYAGSA+STVDPSKVKQISWKPRAFVYEGFLTDLECDHL+SIARSELKRSEVADN+SG SKLSTVRTSSGMFISKS
Subjt:  MSRFRSLFFIFLISIASVVRESNCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFISKS

Query:  KDPIVTGIEDKISAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRASETDEDLSECARKG
        KDPIV+GIEDKI+AWTFLPKENGEDIQVLRYEHGQ+YESHYDYFVDKVNIA GGHRLATVLMYLS+VTKGGETVFPLAEKSP RRASETDEDLSECAR+G
Subjt:  KDPIVTGIEDKISAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRASETDEDLSECARKG

Query:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMVGSPELPGYCTRSCRIC
        IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIG+C+DLNE+CERWAALGECTKNPEYMVGSPELPGYC RSCRIC
Subjt:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMVGSPELPGYCTRSCRIC

A0A6J1KCK1 Procollagen-proline 4-dioxygenase2.2e-16394.33Show/hide
Query:  MSRFRSLFFIFLISIASVVRESNCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFISKS
        MS+FRSL FIFLISIASVVRES CS A SAS+TVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFI KS
Subjt:  MSRFRSLFFIFLISIASVVRESNCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFISKS

Query:  KDPIVTGIEDKISAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRASETDEDLSECARKG
        KD IV+GIEDKI+AWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFP+AEKSPHRRASETDEDLSECARKG
Subjt:  KDPIVTGIEDKISAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRASETDEDLSECARKG

Query:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMVGSPELPGYCTRSCRIC
        IAVKPKKGDALLFFSLEPNAIPDT SLHGGCPVLEGEKWSATKWIHVDSFSKNL N+GNCTDLNE+CERWAALGECTKNPEYMVGSPELPGYC RSCR C
Subjt:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMVGSPELPGYCTRSCRIC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 65.2e-9057.55Show/hide
Query:  SYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRS-EVADNESGKSKLSTVRTSSGMFISKSKDPIVTGIEDKISAWTFLPKENG
        S   S S +VDP+++ Q+SW PRAF+Y+GFL+D ECDHLI +A+ +L++S  VAD +SG+S+ S VRTSSGMF++K +D IV  +E K++AWTFLP+ENG
Subjt:  SYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRS-EVADNESGKSKLSTVRTSSGMFISKSKDPIVTGIEDKISAWTFLPKENG

Query:  EDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFP-LAEKSPHRRASETDEDLSECARKGIAVKPKKGDALLFFSLEPNAIP
        E +Q+L YE+GQ+Y+ H+DYF DK  +  GGHR+ATVLMYLS+VTKGGETVFP    K+P  +    D+  S+CA++G AVKP+KGDALLFF+L  N   
Subjt:  EDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFP-LAEKSPHRRASETDEDLSECARKGIAVKPKKGDALLFFSLEPNAIP

Query:  DTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMVGSPELPGYCTRSCRIC
        D NSLHG CPV+EGEKWSAT+WIHV SF K       C D +E+C+ WA  GEC KNP YMVGS    G+C +SC+ C
Subjt:  DTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMVGSPELPGYCTRSCRIC

F4JAU3 Prolyl 4-hydroxylase 21.2e-12370.33Show/hide
Query:  MSRFRSLFFIFLISIASVVRESNCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFISKS
        MSR   L F   ++I  V+ +S+     S SS ++PSKVKQ+S KPRAFVYEGFLTDLECDHLIS+A+  L+RS VADN++G+S++S VRTSSG FISK 
Subjt:  MSRFRSLFFIFLISIASVVRESNCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFISKS

Query:  KDPIVTGIEDKISAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRASETDEDLSECARKG
        KDPIV+GIEDK+S WTFLPKENGED+QVLRYEHGQ+Y++H+DYF DKVNIA GGHR+ATVL+YLS+VTKGGETVFP A++   R  SE  +DLS+CA+KG
Subjt:  KDPIVTGIEDKISAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRASETDEDLSECARKG

Query:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMVGSPELPGYCTRSCRIC
        IAVKPKKG+ALLFF+L+ +AIPD  SLHGGCPV+EGEKWSATKWIHVDSF K L + GNCTD+NE+CERWA LGEC KNPEYMVG+PE+PG C RSC+ C
Subjt:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMVGSPELPGYCTRSCRIC

F4JNU8 Probable prolyl 4-hydroxylase 85.8e-6556.13Show/hide
Query:  ISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFISKSKDPIVTGIEDKISAWTFLPKENGEDIQVLRYEHGQRYESHY
        ISW+PRAFVY  FLT+ EC+HLIS+A+  + +S+V D ++GKS  S VRTSSG F+++  D IV  IE++IS +TF+P ENGE +QVL YE GQRYE H+
Subjt:  ISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFISKSKDPIVTGIEDKISAWTFLPKENGEDIQVLRYEHGQRYESHY

Query:  DYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRASETD--EDLSECARKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKW
        DYF D+ N+  GG R+ATVLMYLSDV +GGETVFP A+ +     S+    ++LS+C ++G++V PKK DALLF+S++P+A  D +SLHGGCPV++G KW
Subjt:  DYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRASETD--EDLSECARKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKW

Query:  SATKWIHVDSFS
        S+TKW HV  ++
Subjt:  SATKWIHVDSFS

Q8L970 Probable prolyl 4-hydroxylase 78.4e-9654.55Show/hide
Query:  SLFFIFLISIAS------VVRESNC-------SYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTS
        SL F+F + + S      + R SN            ++S   DP++V Q+SW PR F+YEGFL+D ECDH I +A+ +L++S VADN+SG+S  S VRTS
Subjt:  SLFFIFLISIAS------VVRESNC-------SYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTS

Query:  SGMFISKSKDPIVTGIEDKISAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRASETDED
        SGMF+SK +D IV+ +E K++AWTFLP+ENGE +Q+L YE+GQ+YE H+DYF D+ N+  GGHR+ATVLMYLS+V KGGETVFP+ +    +     D+ 
Subjt:  SGMFISKSKDPIVTGIEDKISAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRASETDED

Query:  LSECARKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMVGSPELPGY
         +ECA++G AVKP+KGDALLFF+L PNA  D+NSLHG CPV+EGEKWSAT+WIHV SF +       C D N +CE+WA  GEC KNP YMVGS +  GY
Subjt:  LSECARKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMVGSPELPGY

Query:  CTRSCRIC
        C +SC+ C
Subjt:  CTRSCRIC

Q8LAN3 Probable prolyl 4-hydroxylase 41.7e-12572.85Show/hide
Query:  IFLISIASVVRESNCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFISKSKDPIVTGIE
        I   +I SV+ +S+ S   S+S  V+PSKVKQ+S KPRAFVYEGFLT+LECDH++S+A++ LKRS VADN+SG+SK S VRTSSG FISK KDPIV+GIE
Subjt:  IFLISIASVVRESNCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFISKSKDPIVTGIE

Query:  DKISAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRASETDEDLSECARKGIAVKPKKGD
        DKIS WTFLPKENGEDIQVLRYEHGQ+Y++H+DYF DKVNI  GGHR+AT+LMYLS+VTKGGETVFP AE    R  SE  EDLS+CA++GIAVKP+KGD
Subjt:  DKISAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRASETDEDLSECARKGIAVKPKKGD

Query:  ALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMVGSPELPGYCTRSCRIC
        ALLFF+L P+AIPD  SLHGGCPV+EGEKWSATKWIHVDSF + +   GNCTD+NE+CERWA LGECTKNPEYMVG+ ELPGYC RSC+ C
Subjt:  ALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMVGSPELPGYCTRSCRIC

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 28.8e-12570.33Show/hide
Query:  MSRFRSLFFIFLISIASVVRESNCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFISKS
        MSR   L F   ++I  V+ +S+     S SS ++PSKVKQ+S KPRAFVYEGFLTDLECDHLIS+A+  L+RS VADN++G+S++S VRTSSG FISK 
Subjt:  MSRFRSLFFIFLISIASVVRESNCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFISKS

Query:  KDPIVTGIEDKISAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRASETDEDLSECARKG
        KDPIV+GIEDK+S WTFLPKENGED+QVLRYEHGQ+Y++H+DYF DKVNIA GGHR+ATVL+YLS+VTKGGETVFP A++   R  SE  +DLS+CA+KG
Subjt:  KDPIVTGIEDKISAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRASETDEDLSECARKG

Query:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMVGSPELPGYCTRSCRIC
        IAVKPKKG+ALLFF+L+ +AIPD  SLHGGCPV+EGEKWSATKWIHVDSF K L + GNCTD+NE+CERWA LGEC KNPEYMVG+PE+PG C RSC+ C
Subjt:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMVGSPELPGYCTRSCRIC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase5.9e-9754.55Show/hide
Query:  SLFFIFLISIAS------VVRESNC-------SYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTS
        SL F+F + + S      + R SN            ++S   DP++V Q+SW PR F+YEGFL+D ECDH I +A+ +L++S VADN+SG+S  S VRTS
Subjt:  SLFFIFLISIAS------VVRESNC-------SYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTS

Query:  SGMFISKSKDPIVTGIEDKISAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRASETDED
        SGMF+SK +D IV+ +E K++AWTFLP+ENGE +Q+L YE+GQ+YE H+DYF D+ N+  GGHR+ATVLMYLS+V KGGETVFP+ +    +     D+ 
Subjt:  SGMFISKSKDPIVTGIEDKISAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRASETDED

Query:  LSECARKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMVGSPELPGY
         +ECA++G AVKP+KGDALLFF+L PNA  D+NSLHG CPV+EGEKWSAT+WIHV SF +       C D N +CE+WA  GEC KNP YMVGS +  GY
Subjt:  LSECARKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMVGSPELPGY

Query:  CTRSCRIC
        C +SC+ C
Subjt:  CTRSCRIC

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase3.2e-9051.27Show/hide
Query:  SLFFIFLISIAS------VVRESNC-------SYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKS-----KLS
        SL F+F + + S      + R SN            ++S   DP++V Q+SW PR F+YEGFL+D ECDH I +A+ +L++S VADN+SG+S      +S
Subjt:  SLFFIFLISIAS------VVRESNC-------SYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKS-----KLS

Query:  TVRTSSGMFISKSK---DPIVTGIEDKISAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHR
         VR SS    +      D IV+ +E K++AWTFLP+ENGE +Q+L YE+GQ+YE H+DYF D+ N+  GGHR+ATVLMYLS+V KGGETVFP+ +    +
Subjt:  TVRTSSGMFISKSK---DPIVTGIEDKISAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHR

Query:  RASETDEDLSECARKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMV
             D+  +ECA++G AVKP+KGDALLFF+L PNA  D+NSLHG CPV+EGEKWSAT+WIHV SF +       C D N +CE+WA  GEC KNP YMV
Subjt:  RASETDEDLSECARKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMV

Query:  GSPELPGYCTRSCRIC
        GS +  GYC +SC+ C
Subjt:  GSPELPGYCTRSCRIC

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase3.7e-9157.55Show/hide
Query:  SYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRS-EVADNESGKSKLSTVRTSSGMFISKSKDPIVTGIEDKISAWTFLPKENG
        S   S S +VDP+++ Q+SW PRAF+Y+GFL+D ECDHLI +A+ +L++S  VAD +SG+S+ S VRTSSGMF++K +D IV  +E K++AWTFLP+ENG
Subjt:  SYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRS-EVADNESGKSKLSTVRTSSGMFISKSKDPIVTGIEDKISAWTFLPKENG

Query:  EDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFP-LAEKSPHRRASETDEDLSECARKGIAVKPKKGDALLFFSLEPNAIP
        E +Q+L YE+GQ+Y+ H+DYF DK  +  GGHR+ATVLMYLS+VTKGGETVFP    K+P  +    D+  S+CA++G AVKP+KGDALLFF+L  N   
Subjt:  EDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFP-LAEKSPHRRASETDEDLSECARKGIAVKPKKGDALLFFSLEPNAIP

Query:  DTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMVGSPELPGYCTRSCRIC
        D NSLHG CPV+EGEKWSAT+WIHV SF K       C D +E+C+ WA  GEC KNP YMVGS    G+C +SC+ C
Subjt:  DTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMVGSPELPGYCTRSCRIC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.2e-12672.85Show/hide
Query:  IFLISIASVVRESNCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFISKSKDPIVTGIE
        I   +I SV+ +S+ S   S+S  V+PSKVKQ+S KPRAFVYEGFLT+LECDH++S+A++ LKRS VADN+SG+SK S VRTSSG FISK KDPIV+GIE
Subjt:  IFLISIASVVRESNCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFISKSKDPIVTGIE

Query:  DKISAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRASETDEDLSECARKGIAVKPKKGD
        DKIS WTFLPKENGEDIQVLRYEHGQ+Y++H+DYF DKVNI  GGHR+AT+LMYLS+VTKGGETVFP AE    R  SE  EDLS+CA++GIAVKP+KGD
Subjt:  DKISAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRASETDEDLSECARKGIAVKPKKGD

Query:  ALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMVGSPELPGYCTRSCRIC
        ALLFF+L P+AIPD  SLHGGCPV+EGEKWSATKWIHVDSF + +   GNCTD+NE+CERWA LGECTKNPEYMVG+ ELPGYC RSC+ C
Subjt:  ALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMVGSPELPGYCTRSCRIC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCAGATTTCGCTCTCTGTTCTTTATCTTCTTGATTTCAATTGCATCGGTTGTTCGAGAATCCAACTGTTCGTATGCTGGTTCGGCTAGCTCCACCGTAGATCCTAG
TAAAGTGAAGCAGATTTCATGGAAACCGAGAGCTTTTGTATATGAAGGTTTTCTCACGGACCTAGAATGCGACCACCTGATTTCGATAGCGAGATCCGAGCTAAAGAGAT
CTGAGGTTGCTGATAATGAGTCAGGAAAAAGCAAGCTCAGTACTGTTCGGACGAGCTCAGGAATGTTCATTTCTAAGAGCAAGGATCCTATTGTTACTGGAATAGAGGAC
AAAATTTCTGCGTGGACTTTTCTTCCGAAAGAAAATGGGGAAGACATTCAGGTATTGAGATATGAGCATGGGCAGAGATATGAATCACATTATGATTACTTTGTTGACAA
GGTGAATATTGCCTGGGGAGGACATCGTTTGGCTACTGTTCTCATGTATCTCTCCGACGTGACCAAGGGCGGTGAAACAGTTTTCCCCTTGGCAGAGAAATCTCCCCACC
GTAGGGCTTCTGAAACAGACGAGGATCTCTCAGAGTGTGCAAGGAAAGGAATTGCAGTGAAACCAAAGAAAGGCGATGCCCTTCTTTTCTTTAGCCTTGAACCAAATGCT
ATCCCTGATACCAACAGTTTGCATGGAGGTTGCCCTGTCTTAGAAGGAGAAAAATGGTCAGCAACAAAGTGGATTCACGTAGATTCTTTCAGTAAAAACTTAGGAAACAT
TGGGAACTGTACTGATCTAAATGAAAATTGTGAGAGATGGGCTGCCTTAGGGGAATGCACCAAAAACCCAGAATATATGGTCGGATCTCCAGAACTTCCTGGATACTGTA
CGAGGAGTTGCAGGATCTGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCCAGATTTCGCTCTCTGTTCTTTATCTTCTTGATTTCAATTGCATCGGTTGTTCGAGAATCCAACTGTTCGTATGCTGGTTCGGCTAGCTCCACCGTAGATCCTAG
TAAAGTGAAGCAGATTTCATGGAAACCGAGAGCTTTTGTATATGAAGGTTTTCTCACGGACCTAGAATGCGACCACCTGATTTCGATAGCGAGATCCGAGCTAAAGAGAT
CTGAGGTTGCTGATAATGAGTCAGGAAAAAGCAAGCTCAGTACTGTTCGGACGAGCTCAGGAATGTTCATTTCTAAGAGCAAGGATCCTATTGTTACTGGAATAGAGGAC
AAAATTTCTGCGTGGACTTTTCTTCCGAAAGAAAATGGGGAAGACATTCAGGTATTGAGATATGAGCATGGGCAGAGATATGAATCACATTATGATTACTTTGTTGACAA
GGTGAATATTGCCTGGGGAGGACATCGTTTGGCTACTGTTCTCATGTATCTCTCCGACGTGACCAAGGGCGGTGAAACAGTTTTCCCCTTGGCAGAGAAATCTCCCCACC
GTAGGGCTTCTGAAACAGACGAGGATCTCTCAGAGTGTGCAAGGAAAGGAATTGCAGTGAAACCAAAGAAAGGCGATGCCCTTCTTTTCTTTAGCCTTGAACCAAATGCT
ATCCCTGATACCAACAGTTTGCATGGAGGTTGCCCTGTCTTAGAAGGAGAAAAATGGTCAGCAACAAAGTGGATTCACGTAGATTCTTTCAGTAAAAACTTAGGAAACAT
TGGGAACTGTACTGATCTAAATGAAAATTGTGAGAGATGGGCTGCCTTAGGGGAATGCACCAAAAACCCAGAATATATGGTCGGATCTCCAGAACTTCCTGGATACTGTA
CGAGGAGTTGCAGGATCTGTTGA
Protein sequenceShow/hide protein sequence
MSRFRSLFFIFLISIASVVRESNCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFISKSKDPIVTGIED
KISAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRASETDEDLSECARKGIAVKPKKGDALLFFSLEPNA
IPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNENCERWAALGECTKNPEYMVGSPELPGYCTRSCRIC