; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0014121 (gene) of Chayote v1 genome

Gene IDSed0014121
OrganismSechium edule (Chayote v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationLG11:34014487..34019575
RNA-Seq ExpressionSed0014121
SyntenySed0014121
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008458700.1 PREDICTED: probable prolyl 4-hydroxylase 7 [Cucumis melo]1.9e-13584.48Show/hide
Query:  MDSRRFLTFSLCFLSVLTAFARLPES----HLHNALTGSVLRFNGDISPLVFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSMVADNESG
        MDSR FL FSLCFLSV TAFARLPE+    H +   TGSVLR   D SPL+FDPTRVTQLSWQPRAFLYK  LSD ECDHLI LAKDKLEKSMVADNESG
Subjt:  MDSRRFLTFSLCFLSVLTAFARLPES----HLHNALTGSVLRFNGDISPLVFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSMVADNESG

Query:  KSVSSEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELAE
        KSVSSEVRTSSGMFL+KAQD+IVAG+EARI+AWT LP ENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSE  E
Subjt:  KSVSSEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELAE

Query:  SQEKDDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWAER
        SQEKDDSWSDC+RKGYAVKAQKGDALLFFSLH DATTD RSLHGSCPVIEGEKWSATKWIHVRSFEK  R  + + CVDE+E+CP+WA+R
Subjt:  SQEKDDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWAER

XP_011655982.1 probable prolyl 4-hydroxylase 7 isoform X1 [Cucumis sativus]2.1e-13483.22Show/hide
Query:  MDSRRFLTFSLCFLSVLTAFARLPESHLHNALTGSVLRFNGDISPLVFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSMVADNESGKSVS
        MDSR FL FSLCFLSV TAFARLPE+  H   +GSVLR   D SPL+FDPTRVTQLSWQPRAFLYK  LSD ECDHLI LAKDKLEKSMVADN+SGKSVS
Subjt:  MDSRRFLTFSLCFLSVLTAFARLPESHLHNALTGSVLRFNGDISPLVFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSMVADNESGKSVS

Query:  SEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELAESQEK
        SEVRTSSGMFL+KAQDE+VAG+EARI+AWT LP ENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSE  ESQ K
Subjt:  SEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELAESQEK

Query:  DDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWAER
        D+SWSDC+RKGYAVKAQKGDALLFFSL+ DATTD RSLHGSCPVI GEKWSATKWIHVRSFEK T   + +GCVDE+E+C +WA++
Subjt:  DDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWAER

XP_022134044.1 probable prolyl 4-hydroxylase 7 [Momordica charantia]3.3e-13583.57Show/hide
Query:  MDSRRFLTFSLCFLSVLTAFARLPESHLHNALTGSVLRFNGDISPLVFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSMVADNESGKSVS
        MDS RFL+FSLCFL V TA ARLP+   H  ++GSVLR  G+ SPL+FDPTRVTQLSWQPRAFLYK  LSD ECDHLI LAKDKLEKSMVADN SGKSVS
Subjt:  MDSRRFLTFSLCFLSVLTAFARLPESHLHNALTGSVLRFNGDISPLVFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSMVADNESGKSVS

Query:  SEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELAESQEK
        SEVRTSSGMFL KAQDEIVA +EARI+AWTFLP ENGESIQILHYENGQKYEPHFD+FHDKVNQELGGHR+ATVLMYLSNVEKGGETIFPNSE  ESQEK
Subjt:  SEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELAESQEK

Query:  DDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWAER
        DDSWSDCARKGYAVKA+KGDALLFFSLH DATTD +SLHGSCPVIEGEKWSATKWIHVRSFEK TR      CVDE+E+C SWA+R
Subjt:  DDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWAER

XP_022993651.1 probable prolyl 4-hydroxylase 7 [Cucurbita maxima]1.6e-13486.71Show/hide
Query:  MDSRRFLTFSLCFLSVLTAFARLPESHLHNALTGSVLRFNGDISPLVFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSMVADNESGKSVS
        MDSRRFL FSL FLSV T FARLPE+  H  L+GSVL    D   L+FDPTRVTQLSWQPRAFLYK  L+D ECDHLI LAKDKLEKSMVADNESGKSVS
Subjt:  MDSRRFLTFSLCFLSVLTAFARLPESHLHNALTGSVLRFNGDISPLVFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSMVADNESGKSVS

Query:  SEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELAESQEK
        SEVRTSSGMFL+KAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS   ESQEK
Subjt:  SEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELAESQEK

Query:  DDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWAER
        DDSWSDCARKGYAVKAQKGDALLFFSLH DATTD RSLHGSCPVIEGEKWSATKWIHVRSF+KATRT +S+ CVDE+++CPSWA+R
Subjt:  DDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWAER

XP_038889686.1 probable prolyl 4-hydroxylase 7 isoform X1 [Benincasa hispida]3.6e-13484.62Show/hide
Query:  MDSRRFLTFSLCFLSVLTAFARLPESHLHNALTGSVLRFNGDISPLVFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSMVADNESGKSVS
        MDSRRFL F LCFLSV T FARLPE       +GSV+R   D SPLVFDPTRVTQLSW+PRAFLYK  LSD ECDHLI LAKDKLEKSMVADNESGKSVS
Subjt:  MDSRRFLTFSLCFLSVLTAFARLPESHLHNALTGSVLRFNGDISPLVFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSMVADNESGKSVS

Query:  SEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELAESQEK
        SEVRTSSGMFL+KAQDEIVA IEARISAWT LP ENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSE  ESQEK
Subjt:  SEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELAESQEK

Query:  DDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWAER
        D+SWSDCARKGYAVKA+KGDALLFFSL PDATTD +SLHGSCPVIEGEKWSATKWIHVRSFEKATR  + + CVDE+E+C  WA+R
Subjt:  DDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWAER

TrEMBL top hitse value%identityAlignment
A0A0A0KS38 Procollagen-proline 4-dioxygenase1.0e-13483.22Show/hide
Query:  MDSRRFLTFSLCFLSVLTAFARLPESHLHNALTGSVLRFNGDISPLVFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSMVADNESGKSVS
        MDSR FL FSLCFLSV TAFARLPE+  H   +GSVLR   D SPL+FDPTRVTQLSWQPRAFLYK  LSD ECDHLI LAKDKLEKSMVADN+SGKSVS
Subjt:  MDSRRFLTFSLCFLSVLTAFARLPESHLHNALTGSVLRFNGDISPLVFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSMVADNESGKSVS

Query:  SEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELAESQEK
        SEVRTSSGMFL+KAQDE+VAG+EARI+AWT LP ENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSE  ESQ K
Subjt:  SEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELAESQEK

Query:  DDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWAER
        D+SWSDC+RKGYAVKAQKGDALLFFSL+ DATTD RSLHGSCPVI GEKWSATKWIHVRSFEK T   + +GCVDE+E+C +WA++
Subjt:  DDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWAER

A0A1S3C8G4 Procollagen-proline 4-dioxygenase9.2e-13684.48Show/hide
Query:  MDSRRFLTFSLCFLSVLTAFARLPES----HLHNALTGSVLRFNGDISPLVFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSMVADNESG
        MDSR FL FSLCFLSV TAFARLPE+    H +   TGSVLR   D SPL+FDPTRVTQLSWQPRAFLYK  LSD ECDHLI LAKDKLEKSMVADNESG
Subjt:  MDSRRFLTFSLCFLSVLTAFARLPES----HLHNALTGSVLRFNGDISPLVFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSMVADNESG

Query:  KSVSSEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELAE
        KSVSSEVRTSSGMFL+KAQD+IVAG+EARI+AWT LP ENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSE  E
Subjt:  KSVSSEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELAE

Query:  SQEKDDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWAER
        SQEKDDSWSDC+RKGYAVKAQKGDALLFFSLH DATTD RSLHGSCPVIEGEKWSATKWIHVRSFEK  R  + + CVDE+E+CP+WA+R
Subjt:  SQEKDDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWAER

A0A6J1BXN9 Procollagen-proline 4-dioxygenase1.6e-13583.57Show/hide
Query:  MDSRRFLTFSLCFLSVLTAFARLPESHLHNALTGSVLRFNGDISPLVFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSMVADNESGKSVS
        MDS RFL+FSLCFL V TA ARLP+   H  ++GSVLR  G+ SPL+FDPTRVTQLSWQPRAFLYK  LSD ECDHLI LAKDKLEKSMVADN SGKSVS
Subjt:  MDSRRFLTFSLCFLSVLTAFARLPESHLHNALTGSVLRFNGDISPLVFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSMVADNESGKSVS

Query:  SEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELAESQEK
        SEVRTSSGMFL KAQDEIVA +EARI+AWTFLP ENGESIQILHYENGQKYEPHFD+FHDKVNQELGGHR+ATVLMYLSNVEKGGETIFPNSE  ESQEK
Subjt:  SEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELAESQEK

Query:  DDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWAER
        DDSWSDCARKGYAVKA+KGDALLFFSLH DATTD +SLHGSCPVIEGEKWSATKWIHVRSFEK TR      CVDE+E+C SWA+R
Subjt:  DDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWAER

A0A6J1FJ93 Procollagen-proline 4-dioxygenase2.3e-13486.36Show/hide
Query:  MDSRRFLTFSLCFLSVLTAFARLPESHLHNALTGSVLRFNGDISPLVFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSMVADNESGKSVS
        MDSRRFL FSL FLSV T FARLPE+  H  L+GSVL    D   L+FDPTRVTQLSWQPRAFLYK  L+D ECDHLI LAKDKLEKSMVADNESGKSVS
Subjt:  MDSRRFLTFSLCFLSVLTAFARLPESHLHNALTGSVLRFNGDISPLVFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSMVADNESGKSVS

Query:  SEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELAESQEK
        SEVRTSSGMFL+KAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS   ESQEK
Subjt:  SEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELAESQEK

Query:  DDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWAER
        DDSWSDCARKGYAVKAQKGDALLFFSLH DATTD RSLHGSCPVIEGEKWSATKWIHVRSF+KATR  +S+ CVDE+++CPSWA+R
Subjt:  DDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWAER

A0A6J1JWX0 Procollagen-proline 4-dioxygenase7.8e-13586.71Show/hide
Query:  MDSRRFLTFSLCFLSVLTAFARLPESHLHNALTGSVLRFNGDISPLVFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSMVADNESGKSVS
        MDSRRFL FSL FLSV T FARLPE+  H  L+GSVL    D   L+FDPTRVTQLSWQPRAFLYK  L+D ECDHLI LAKDKLEKSMVADNESGKSVS
Subjt:  MDSRRFLTFSLCFLSVLTAFARLPESHLHNALTGSVLRFNGDISPLVFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSMVADNESGKSVS

Query:  SEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELAESQEK
        SEVRTSSGMFL+KAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS   ESQEK
Subjt:  SEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELAESQEK

Query:  DDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWAER
        DDSWSDCARKGYAVKAQKGDALLFFSLH DATTD RSLHGSCPVIEGEKWSATKWIHVRSF+KATRT +S+ CVDE+++CPSWA+R
Subjt:  DDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWAER

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 62.4e-9663.64Show/hide
Query:  MDSRRFLTFSLCFLSVLTAFARLPESHLHNALTGSVLRFNGDISPLVFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSM-VADNESGKSV
        MDS+ FL FSL  L + +  +    S                      DPTR+TQLSW PRAFLYK  LSD ECDHLI LAK KLEKSM VAD +SG+S 
Subjt:  MDSRRFLTFSLCFLSVLTAFARLPESHLHNALTGSVLRFNGDISPLVFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSM-VADNESGKSV

Query:  SSEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELAESQE
         SEVRTSSGMFL K QD+IVA +EA+++AWTFLP ENGE++QILHYENGQKY+PHFD+F+DK   ELGGHRIATVLMYLSNV KGGET+FPN +    Q 
Subjt:  SSEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELAESQE

Query:  KDDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWAE
        KDDSWS CA++GYAVK +KGDALLFF+LH + TTD  SLHGSCPVIEGEKWSAT+WIHVRSF K         CVD+ ESC  WA+
Subjt:  KDDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWAE

F4JAU3 Prolyl 4-hydroxylase 21.4e-8061.57Show/hide
Query:  VFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQILHYE
        + +P++V Q+S +PRAF+Y+  L+D+ECDHLI LAK+ L++S VADN++G+S  S+VRTSSG F+ K +D IV+GIE ++S WTFLP ENGE +Q+L YE
Subjt:  VFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQILHYE

Query:  NGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPN----SELAESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHGSC
        +GQKY+ HFD+FHDKVN   GGHRIATVL+YLSNV KGGET+FP+    S  + S+ KDD  SDCA+KG AVK +KG+ALLFF+L  DA  D  SLHG C
Subjt:  NGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPN----SELAESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHGSC

Query:  PVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWA
        PVIEGEKWSATKWIHV SF+K   TH    C D +ESC  WA
Subjt:  PVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWA

Q8L970 Probable prolyl 4-hydroxylase 75.8e-11167.71Show/hide
Query:  MDSRRFLTFSLCFLSVLTAFARLPESHL---HNALTGSVLRFNGDISPLVFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSMVADNESGK
        MDSR FL FSLCFL  L   +  P   L    N   GSV++     S   FDPTRVTQLSW PR FLY+  LSD ECDH I LAK KLEKSMVADN+SG+
Subjt:  MDSRRFLTFSLCFLSVLTAFARLPESHL---HNALTGSVLRFNGDISPLVFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSMVADNESGK

Query:  SVSSEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELAES
        SV SEVRTSSGMFL K QD+IV+ +EA+++AWTFLP ENGES+QILHYENGQKYEPHFD+FHD+ N ELGGHRIATVLMYLSNVEKGGET+FP  +   +
Subjt:  SVSSEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELAES

Query:  QEKDDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWAE
        Q KDDSW++CA++GYAVK +KGDALLFF+LHP+ATTD+ SLHGSCPV+EGEKWSAT+WIHV+SFE+A   +   GC+DE+ SC  WA+
Subjt:  QEKDDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWAE

Q8LAN3 Probable prolyl 4-hydroxylase 43.3e-8260.25Show/hide
Query:  SPLVFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQIL
        S +  +P++V Q+S +PRAF+Y+  L+++ECDH++ LAK  L++S VADN+SG+S  SEVRTSSG F+ K +D IV+GIE +IS WTFLP ENGE IQ+L
Subjt:  SPLVFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQIL

Query:  HYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELAESQ---EKDDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHG
         YE+GQKY+ HFD+FHDKVN   GGHR+AT+LMYLSNV KGGET+FP++E+   +   E  +  SDCA++G AVK +KGDALLFF+LHPDA  D  SLHG
Subjt:  HYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELAESQ---EKDDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHG

Query:  SCPVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWA
         CPVIEGEKWSATKWIHV SF++      S  C D +ESC  WA
Subjt:  SCPVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWA

Q9LN20 Probable prolyl 4-hydroxylase 31.2e-6355.29Show/hide
Query:  LSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHF
        LSW+PRAF+Y   LS  EC++LI LAK  + KS V D+E+GKS  S VRTSSG FL++ +D+I+  IE RI+ +TF+P ++GE +Q+LHYE GQKYEPH+
Subjt:  LSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHF

Query:  DFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEL-AESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHGSCPVIEGEKWSATK
        D+F D+ N + GG R+AT+LMYLS+VE+GGET+FP + +   S    +  S+C +KG +VK + GDALLF+S+ PDAT D  SLHG CPVI G KWS+TK
Subjt:  DFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEL-AESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHGSCPVIEGEKWSATK

Query:  WIHVRSFE
        W+HV  ++
Subjt:  WIHVRSFE

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 29.9e-8261.57Show/hide
Query:  VFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQILHYE
        + +P++V Q+S +PRAF+Y+  L+D+ECDHLI LAK+ L++S VADN++G+S  S+VRTSSG F+ K +D IV+GIE ++S WTFLP ENGE +Q+L YE
Subjt:  VFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQILHYE

Query:  NGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPN----SELAESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHGSC
        +GQKY+ HFD+FHDKVN   GGHRIATVL+YLSNV KGGET+FP+    S  + S+ KDD  SDCA+KG AVK +KG+ALLFF+L  DA  D  SLHG C
Subjt:  NGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPN----SELAESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHGSC

Query:  PVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWA
        PVIEGEKWSATKWIHV SF+K   TH    C D +ESC  WA
Subjt:  PVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWA

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase4.1e-11267.71Show/hide
Query:  MDSRRFLTFSLCFLSVLTAFARLPESHL---HNALTGSVLRFNGDISPLVFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSMVADNESGK
        MDSR FL FSLCFL  L   +  P   L    N   GSV++     S   FDPTRVTQLSW PR FLY+  LSD ECDH I LAK KLEKSMVADN+SG+
Subjt:  MDSRRFLTFSLCFLSVLTAFARLPESHL---HNALTGSVLRFNGDISPLVFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSMVADNESGK

Query:  SVSSEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELAES
        SV SEVRTSSGMFL K QD+IV+ +EA+++AWTFLP ENGES+QILHYENGQKYEPHFD+FHD+ N ELGGHRIATVLMYLSNVEKGGET+FP  +   +
Subjt:  SVSSEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELAES

Query:  QEKDDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWAE
        Q KDDSW++CA++GYAVK +KGDALLFF+LHP+ATTD+ SLHGSCPV+EGEKWSAT+WIHV+SFE+A   +   GC+DE+ SC  WA+
Subjt:  QEKDDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWAE

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase5.4e-10463.51Show/hide
Query:  MDSRRFLTFSLCFLSVLTAFARLPESHL---HNALTGSVLRFNGDISPLVFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSMVADNESGK
        MDSR FL FSLCFL  L   +  P   L    N   GSV++     S   FDPTRVTQLSW PR FLY+  LSD ECDH I LAK KLEKSMVADN+SG+
Subjt:  MDSRRFLTFSLCFLSVLTAFARLPESHL---HNALTGSVLRFNGDISPLVFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSMVADNESGK

Query:  SVSSE-----VRTSSGMFLQKAQ---DEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIF
        SV SE     VR SS           D+IV+ +EA+++AWTFLP ENGES+QILHYENGQKYEPHFD+FHD+ N ELGGHRIATVLMYLSNVEKGGET+F
Subjt:  SVSSE-----VRTSSGMFLQKAQ---DEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIF

Query:  PNSELAESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWAE
        P  +   +Q KDDSW++CA++GYAVK +KGDALLFF+LHP+ATTD+ SLHGSCPV+EGEKWSAT+WIHV+SFE+A   +   GC+DE+ SC  WA+
Subjt:  PNSELAESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWAE

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase1.7e-9763.64Show/hide
Query:  MDSRRFLTFSLCFLSVLTAFARLPESHLHNALTGSVLRFNGDISPLVFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSM-VADNESGKSV
        MDS+ FL FSL  L + +  +    S                      DPTR+TQLSW PRAFLYK  LSD ECDHLI LAK KLEKSM VAD +SG+S 
Subjt:  MDSRRFLTFSLCFLSVLTAFARLPESHLHNALTGSVLRFNGDISPLVFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSM-VADNESGKSV

Query:  SSEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELAESQE
         SEVRTSSGMFL K QD+IVA +EA+++AWTFLP ENGE++QILHYENGQKY+PHFD+F+DK   ELGGHRIATVLMYLSNV KGGET+FPN +    Q 
Subjt:  SSEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELAESQE

Query:  KDDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWAE
        KDDSWS CA++GYAVK +KGDALLFF+LH + TTD  SLHGSCPVIEGEKWSAT+WIHVRSF K         CVD+ ESC  WA+
Subjt:  KDDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWAE

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.4e-8360.25Show/hide
Query:  SPLVFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQIL
        S +  +P++V Q+S +PRAF+Y+  L+++ECDH++ LAK  L++S VADN+SG+S  SEVRTSSG F+ K +D IV+GIE +IS WTFLP ENGE IQ+L
Subjt:  SPLVFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLQKAQDEIVAGIEARISAWTFLPVENGESIQIL

Query:  HYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELAESQ---EKDDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHG
         YE+GQKY+ HFD+FHDKVN   GGHR+AT+LMYLSNV KGGET+FP++E+   +   E  +  SDCA++G AVK +KGDALLFF+LHPDA  D  SLHG
Subjt:  HYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELAESQ---EKDDSWSDCARKGYAVKAQKGDALLFFSLHPDATTDTRSLHG

Query:  SCPVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWA
         CPVIEGEKWSATKWIHV SF++      S  C D +ESC  WA
Subjt:  SCPVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCCCGTCGGTTTCTCACATTTTCCCTTTGCTTTCTCTCTGTGTTGACCGCCTTCGCTCGCCTGCCTGAATCGCATTTGCACAACGCATTAACTGGATCTGTGCT
TCGGTTCAATGGAGATATATCTCCGCTCGTCTTCGATCCAACTCGAGTCACACAGCTCTCATGGCAACCCAGGGCGTTTTTGTATAAACGACTTTTATCTGATGTGGAAT
GTGATCATCTAATCGTTCTGGCTAAGGATAAGTTAGAGAAGTCGATGGTGGCCGATAATGAGTCGGGTAAGAGTGTAAGTAGTGAAGTTCGGACGAGTTCTGGCATGTTC
CTTCAGAAGGCTCAGGATGAAATTGTTGCTGGCATTGAGGCCAGAATTTCTGCATGGACATTCCTTCCAGTAGAAAATGGAGAGTCTATTCAAATTCTGCACTATGAGAA
TGGTCAAAAGTACGAACCACATTTTGATTTTTTTCACGACAAGGTGAATCAGGAGTTAGGTGGTCACCGAATAGCCACAGTGTTGATGTATTTGTCCAATGTCGAAAAGG
GTGGAGAAACCATCTTTCCAAATTCCGAGTTAGCAGAATCTCAAGAGAAGGATGACAGTTGGTCTGATTGTGCTCGAAAGGGTTATGCAGTTAAAGCACAGAAGGGTGAT
GCATTACTTTTCTTCAGCCTCCATCCCGATGCAACGACAGATACGAGAAGCTTGCATGGAAGTTGTCCTGTGATCGAGGGCGAGAAATGGTCTGCAACGAAGTGGATTCA
TGTGAGATCCTTCGAGAAGGCGACTCGCACACACGCAAGTAAGGGCTGCGTGGACGAGGATGAAAGCTGCCCTTCATGGGCGGAAAGGGCATCCTCCACAGTTGCATTTG
AAATGGTCCAAACACGAAAACGATTGCAAAAATGGTACCATCCAATGGCTGAAGAATGGCATCATAGCAGCGAGGGCGAACTTCCAAGATAG
mRNA sequenceShow/hide mRNA sequence
GTTGTTGAGTATTGACAGTTGAAATCTTCGCAATAAATTCAATTTCCCTTCTCGTACACGAAAAAGGACCCCTCAATTACTTCAATTTTCATTTCTCTTTCTCCGATTTG
ACATCGGAGAAACGAGATTCAGCTCATGGATTCCCGTCGGTTTCTCACATTTTCCCTTTGCTTTCTCTCTGTGTTGACCGCCTTCGCTCGCCTGCCTGAATCGCATTTGC
ACAACGCATTAACTGGATCTGTGCTTCGGTTCAATGGAGATATATCTCCGCTCGTCTTCGATCCAACTCGAGTCACACAGCTCTCATGGCAACCCAGGGCGTTTTTGTAT
AAACGACTTTTATCTGATGTGGAATGTGATCATCTAATCGTTCTGGCTAAGGATAAGTTAGAGAAGTCGATGGTGGCCGATAATGAGTCGGGTAAGAGTGTAAGTAGTGA
AGTTCGGACGAGTTCTGGCATGTTCCTTCAGAAGGCTCAGGATGAAATTGTTGCTGGCATTGAGGCCAGAATTTCTGCATGGACATTCCTTCCAGTAGAAAATGGAGAGT
CTATTCAAATTCTGCACTATGAGAATGGTCAAAAGTACGAACCACATTTTGATTTTTTTCACGACAAGGTGAATCAGGAGTTAGGTGGTCACCGAATAGCCACAGTGTTG
ATGTATTTGTCCAATGTCGAAAAGGGTGGAGAAACCATCTTTCCAAATTCCGAGTTAGCAGAATCTCAAGAGAAGGATGACAGTTGGTCTGATTGTGCTCGAAAGGGTTA
TGCAGTTAAAGCACAGAAGGGTGATGCATTACTTTTCTTCAGCCTCCATCCCGATGCAACGACAGATACGAGAAGCTTGCATGGAAGTTGTCCTGTGATCGAGGGCGAGA
AATGGTCTGCAACGAAGTGGATTCATGTGAGATCCTTCGAGAAGGCGACTCGCACACACGCAAGTAAGGGCTGCGTGGACGAGGATGAAAGCTGCCCTTCATGGGCGGAA
AGGGCATCCTCCACAGTTGCATTTGAAATGGTCCAAACACGAAAACGATTGCAAAAATGGTACCATCCAATGGCTGAAGAATGGCATCATAGCAGCGAGGGCGAACTTCC
AAGATAGTAATCACTTCAAAGTTTTCAGTTTCAACATCGAGAGTATATACAAGGAACTCATTATCTTTTTCTTTTCCAGTCCAATAGACAACTCCATCCATATAAACACC
ATGAGACTGAAGAGTTATAGGAGATGGCAGGGTAGCAGAGGTCTATTGATTATCGTCCCTACCAAACCTCAAAACCTTCAACTCGGAATCGGAGAGTCTGGAAGAACTCA
TGGGTCATTGGATTGAAAATTCCCTCGCAGTAACCGTCAATTCCTTCAGTATCAAGCGATTTGTCAATAAACAAGAGGCCATTGCAGTGATTGTAAAAGCTCATAAGGTT
GATTCCAGGTTCGGGAAAGGTACATGAGGCAACAAAGCTTAAGGTATCAATGTCGAAACAGTGAATCTTGGGAGTGGGAAAATCAGGAAAAGGGCCATGAGTAGCAAACA
GGATAGCTTTTGGAAGCTGGATTTTAGGACAGATCTACTGCATAACTTAAAACTAGAGAGTTCCATGATCTACAAACTAACCTGCAGCCGGGCAGATTGGAGATCGGAAC
TCTGGAGAAGATGAGCGGAGCAATATGCTACAGAAGTGGAAGCCCCCAATCACTACAAGTTTCCTGCTTCTTCTTCATAATCACTGCTGAGCGAGCGGCGTAGGGGGGTT
TGGGC
Protein sequenceShow/hide protein sequence
MDSRRFLTFSLCFLSVLTAFARLPESHLHNALTGSVLRFNGDISPLVFDPTRVTQLSWQPRAFLYKRLLSDVECDHLIVLAKDKLEKSMVADNESGKSVSSEVRTSSGMF
LQKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELAESQEKDDSWSDCARKGYAVKAQKGD
ALLFFSLHPDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKATRTHASKGCVDEDESCPSWAERASSTVAFEMVQTRKRLQKWYHPMAEEWHHSSEGELPR