; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0026869 (gene) of Chayote v1 genome

Gene IDSed0026869
OrganismSechium edule (Chayote v1)
DescriptionProcollagen-proline 3-dioxygenase
Genome locationLG06:2043781..2049438
RNA-Seq ExpressionSed0026869
SyntenySed0026869
Gene Ontology termsGO:0019511 - peptidyl-proline hydroxylation (biological process)
GO:0032963 - collagen metabolic process (biological process)
GO:0005506 - iron ion binding (molecular function)
GO:0019797 - procollagen-proline 3-dioxygenase activity (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR039575 - Prolyl 3-hydroxylase
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155191.1 uncharacterized protein LOC111022327 [Momordica charantia]1.5e-19081.36Show/hide
Query:  MGDESEN--NRRRRLILGNFLSLEECKELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
        MGDE EN   RRRRLIL NFL+ EEC+ELEFIHKSCCTVGYRP+VFSTTLLHLVA+NSAHLIMPFV IRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
Subjt:  MGDESEN--NRRRRLILGNFLSLEECKELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA

Query:  CIGWHSDDNRPYLKQREFTAVCYLNSYRVDFGGGLFHFRDGEPKTISPSCGDCAMYTADGLNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS
         IGWHSDDNRPYLKQREFTAVCYLNSY VDF GGLFHF+DGEPK+ISP CGDC MYTAD  NVHSVDEITNGERLTLTLWFTRD+SHDEDAKLLSLLSQS
Subjt:  CIGWHSDDNRPYLKQREFTAVCYLNSYRVDFGGGLFHFRDGEPKTISPSCGDCAMYTADGLNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS

Query:  HLHNRFPDSCLPQPPSCIMYWFSPEDDPNFKFGFNICWARLHALGYDIYFPREHSLLEYPDLFSQNVRLVRGNEMFFQEFDSVLHALQVVQFMYWKGKEL
        HLH+RFP+SC+P PPSC MYWFSPE+DPNFKFG ++CWARLHALGYDIYFPR++ L +YP LFS  V+LVR  ++FFQEF ++LHALQVVQFM WKGKEL
Subjt:  HLHNRFPDSCLPQPPSCIMYWFSPEDDPNFKFGFNICWARLHALGYDIYFPREHSLLEYPDLFSQNVRLVRGNEMFFQEFDSVLHALQVVQFMYWKGKEL

Query:  DSTSFKGDSSYVEDLSPERNVGVDYFKSEFSKDDALAESVFSFGSGDGKEKQHRLGWAKLAAAAAAWEDYASNLRRKLLQSLTHWRTNQSIYSVSLG
        DST+FKG+SSY   LSP+ N GV YFKSEFSK+  LA+SVFS  S D KEKQ  LGWAKLA A AAWEDYASNLR +LL+SL HWRTNQS+Y VSLG
Subjt:  DSTSFKGDSSYVEDLSPERNVGVDYFKSEFSKDDALAESVFSFGSGDGKEKQHRLGWAKLAAAAAAWEDYASNLRRKLLQSLTHWRTNQSIYSVSLG

XP_022942569.1 prolyl 3-hydroxylase 1 isoform X1 [Cucurbita moschata]5.3e-20486.08Show/hide
Query:  MGDESENNRRRRLILGNFLSLEECKELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGACI
        MGDE+E N+RRRL L NFL+LEEC+ELEFIHKSCCTVGYRP VFSTTLLHLV SNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA I
Subjt:  MGDESENNRRRRLILGNFLSLEECKELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGACI

Query:  GWHSDDNRPYLKQREFTAVCYLNSYRVDFGGGLFHFRDGEPKTISPSCGDCAMYTADGLNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
        GWHSDDNRPYLKQREFTAVCYLNSY VDF GGLFHF+DGEPKTISP CGDC MYTAD LNVHSVDE+T+GERLTLTLWFTRDSSHDEDAKL+SLLSQSHL
Subjt:  GWHSDDNRPYLKQREFTAVCYLNSYRVDFGGGLFHFRDGEPKTISPSCGDCAMYTADGLNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL

Query:  HNRFPDSCLPQPPSCIMYWFSPEDDPNFKFGFNICWARLHALGYDIYFPREHSLLEYPDLFSQNVRLVRGNEMFFQEFDSVLHALQVVQFMYWKGKELDS
        H+R PDSCLPQPPSC MYWFSP+DDPNFKFGF+ICWARLHALGY IYFP++HSL EYPDLFSQ+V+LVRGN++F Q+FDS+LHALQVVQF+YWKGKELDS
Subjt:  HNRFPDSCLPQPPSCIMYWFSPEDDPNFKFGFNICWARLHALGYDIYFPREHSLLEYPDLFSQNVRLVRGNEMFFQEFDSVLHALQVVQFMYWKGKELDS

Query:  TSFKGDSSYVEDLSPERNVGVDYFKSEFSKDDALAESVFSFGSGDGKEKQHRLGWAKLAAAAAAWEDYASNLRRKLLQSLTHWRTNQSIYSVSLG
        T+ K DSSY E LSP+RNVGVDYFKSEFSKDDALAESVF + S D KEKQHRLGWAKLAA AAAWEDYASNLRR+LL+S  HWRT+QSIYSV  G
Subjt:  TSFKGDSSYVEDLSPERNVGVDYFKSEFSKDDALAESVFSFGSGDGKEKQHRLGWAKLAAAAAAWEDYASNLRRKLLQSLTHWRTNQSIYSVSLG

XP_022989994.1 prolyl 3-hydroxylase 1 isoform X1 [Cucurbita maxima]3.8e-20285.57Show/hide
Query:  MGDESENNRRRRLILGNFLSLEECKELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGACI
        MGDE+E N+R RLIL NFL+LEEC+ELEFIHKSCCTVGYRP VFSTTLLHLV SNSA LIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA I
Subjt:  MGDESENNRRRRLILGNFLSLEECKELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGACI

Query:  GWHSDDNRPYLKQREFTAVCYLNSYRVDFGGGLFHFRDGEPKTISPSCGDCAMYTADGLNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
        GWHSDDNRPYLKQREFTAVCYLNSY VDF GGLFHF+DGEPKTISP CGDC MYTAD LNVHSVDE+T+GERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
Subjt:  GWHSDDNRPYLKQREFTAVCYLNSYRVDFGGGLFHFRDGEPKTISPSCGDCAMYTADGLNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL

Query:  HNRFPDSCLPQPPSCIMYWFSPEDDPNFKFGFNICWARLHALGYDIYFPREHSLLEYPDLFSQNVRLVRGNEMFFQEFDSVLHALQVVQFMYWKGKELDS
        H+R PDSCLPQPPSC MYWFSP+DDPNFKFGF+ICWARLHALGY IYFP++HSL EYPDLFSQ+V+LVRGN++F Q+FDS+LHALQVVQF+YWKGKELDS
Subjt:  HNRFPDSCLPQPPSCIMYWFSPEDDPNFKFGFNICWARLHALGYDIYFPREHSLLEYPDLFSQNVRLVRGNEMFFQEFDSVLHALQVVQFMYWKGKELDS

Query:  TSFKGDSSYVEDLSPERNVGVDYFKSEFSKDDALAESVFSFGSGDGKEKQHRLGWAKLAAAAAAWEDYASNLRRKLLQSLTHWRTNQSIYSVSLG
        T  K DSSY E LSP+RNVGVD+FKSEFSKDDALAESVF + S D KEKQHRLGWAKLAA A AWEDYASNLRR+LL+S  HWRT+QSIYSV  G
Subjt:  TSFKGDSSYVEDLSPERNVGVDYFKSEFSKDDALAESVFSFGSGDGKEKQHRLGWAKLAAAAAAWEDYASNLRRKLLQSLTHWRTNQSIYSVSLG

XP_023552239.1 prolyl 3-hydroxylase 1 [Cucurbita pepo subsp. pepo]1.4e-20486.33Show/hide
Query:  MGDESENNRRRRLILGNFLSLEECKELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGACI
        MGDE+E N+RRRLIL NFL+LEEC+ELEFIHKSCCTVGYRP VFSTTLLHLV SNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA I
Subjt:  MGDESENNRRRRLILGNFLSLEECKELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGACI

Query:  GWHSDDNRPYLKQREFTAVCYLNSYRVDFGGGLFHFRDGEPKTISPSCGDCAMYTADGLNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
        GWHSDDNRPYLKQREFTAVCYLNSY VDF GGLFHF+DGEPKTISP CGDC MYTAD LNVHSVDE+T+GERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
Subjt:  GWHSDDNRPYLKQREFTAVCYLNSYRVDFGGGLFHFRDGEPKTISPSCGDCAMYTADGLNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL

Query:  HNRFPDSCLPQPPSCIMYWFSPEDDPNFKFGFNICWARLHALGYDIYFPREHSLLEYPDLFSQNVRLVRGNEMFFQEFDSVLHALQVVQFMYWKGKELDS
        H+R PDSCLPQPPSC MYWFSP+DDPNFKFGF+ICWARLHALGY IYFP++HSL EYPDLFSQ+V+LVRGN++F Q+FDS+LHALQVVQF+YWKGKELDS
Subjt:  HNRFPDSCLPQPPSCIMYWFSPEDDPNFKFGFNICWARLHALGYDIYFPREHSLLEYPDLFSQNVRLVRGNEMFFQEFDSVLHALQVVQFMYWKGKELDS

Query:  TSFKGDSSYVEDLSPERNVGVDYFKSEFSKDDALAESVFSFGSGDGKEKQHRLGWAKLAAAAAAWEDYASNLRRKLLQSLTHWRTNQSIYSVSLG
        T+ K DSSY E LSP+RNVGVDYFKSEFSKD+ALAESVF + S D KEKQHRLGWAKLAA AAAWEDYASNLRR+LL+S  HWRT+QSIYSV  G
Subjt:  TSFKGDSSYVEDLSPERNVGVDYFKSEFSKDDALAESVFSFGSGDGKEKQHRLGWAKLAAAAAAWEDYASNLRRKLLQSLTHWRTNQSIYSVSLG

XP_038893062.1 uncharacterized protein LOC120081945 [Benincasa hispida]3.0e-19182.07Show/hide
Query:  MGD--ESENNRRRRLILGNFLSLEECKELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
        MGD  ES   RRRRLIL NFL+ EEC+ELEFIHKSCCTVGYRPNVFSTTLLHLVA+NSAHLIMPFV IRERLKEKAEEFFGC YELFVEFTGLISWTRGA
Subjt:  MGD--ESENNRRRRLILGNFLSLEECKELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA

Query:  CIGWHSDDNRPYLKQREFTAVCYLNSYRVDFGGGLFHFRDGEPKTISPSCGDCAMYTADGLNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS
         IGWHSDDNRPYLKQR+F+AVCYLNSY V+FGGGLFHF+DGEP+TISP CGDC MYTAD  NVHSVDEITNGERLTLTLW TRDSSHDED+KLLSLLSQS
Subjt:  CIGWHSDDNRPYLKQREFTAVCYLNSYRVDFGGGLFHFRDGEPKTISPSCGDCAMYTADGLNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS

Query:  HLHNRFPDSCLPQPPSCIMYWFSPEDDPNFKFGFNICWARLHALGYDIYFPREHSLLEYPDLFSQNVRLVRGNEMFFQEFDSVLHALQVVQFMYWKGKEL
        HLH+R PDS LPQPPSC MYWFS EDDPNFK GF+ICWARLHALGYDIYF  +HS  EYPDLFS++V+LV+GN++FFQEF+++LH LQVVQF+ WKGKEL
Subjt:  HLHNRFPDSCLPQPPSCIMYWFSPEDDPNFKFGFNICWARLHALGYDIYFPREHSLLEYPDLFSQNVRLVRGNEMFFQEFDSVLHALQVVQFMYWKGKEL

Query:  DSTSFKGDSSYVEDLSPERNVGVDYFKSEFSKDDALAESVFSFGSGDGKEKQHRLGWAKLAAAAAAWEDYASNLRRKLLQSLTHWRTNQSIYSVSL
        DST+ K DSSY E LSP+RNVGV YFKSEFSKDD LAESVFS  + DGKE QH LGW KLAAAAAAWEDYAS LRR+LL SL++WR +QSIYSVSL
Subjt:  DSTSFKGDSSYVEDLSPERNVGVDYFKSEFSKDDALAESVFSFGSGDGKEKQHRLGWAKLAAAAAAWEDYASNLRRKLLQSLTHWRTNQSIYSVSL

TrEMBL top hitse value%identityAlignment
A0A5A7SSL8 Procollagen-proline 3-dioxygenase1.6e-18579.75Show/hide
Query:  MGDESENNRRRRLILGNFLSLEECKELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGACI
        M D +E+ +RRRLIL NFLS EEC+ELEFIHKSCCTVGYRPNV STTLLHLVA+NSAHLI+PFV IRE+LKEKAEEFFGC YELFVEFTGLISWTRGA I
Subjt:  MGDESENNRRRRLILGNFLSLEECKELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGACI

Query:  GWHSDDNRPYLKQREFTAVCYLNSYRVDFGGGLFHFRDGEPKTISPSCGDCAMYTADGLNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
        GWHSDDNRPYLKQREF+AVCYLNSY V+FGGGLFHF+DGEP+TISP  GDC MYTAD  NVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS L
Subjt:  GWHSDDNRPYLKQREFTAVCYLNSYRVDFGGGLFHFRDGEPKTISPSCGDCAMYTADGLNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL

Query:  HNRFPDSCLPQPPSCIMYWFSPEDDPNFKFGFNICWARLHALGYDIYFPREHSLLEYPDLFSQNVRLVRGNEMFFQEFDSVLHALQVVQFMYWKGKELDS
        H+RF +SCLPQPPSC MYWFSPE+DPNFKFGF+ICWARLHALGYDIYFP +H   EYPDLFSQ+V+LV G+++FFQ+F+++LH LQVVQF+ WKGKELD+
Subjt:  HNRFPDSCLPQPPSCIMYWFSPEDDPNFKFGFNICWARLHALGYDIYFPREHSLLEYPDLFSQNVRLVRGNEMFFQEFDSVLHALQVVQFMYWKGKELDS

Query:  TSFKGDSSYVEDLSPERNVGVDYFKSEFSKDDALAESVFSFGSGDGKEKQHRLGWAKL-AAAAAAWEDYASNLRRKLLQSLTHWRTNQSIYSVSL
        T+   DS Y E LSP+RNVGV YFKSEFSK+D LAESVFS  +  GKE QH LGW KL  AAAAAWEDYAS LRR+LL S +HWR  QSIYSVSL
Subjt:  TSFKGDSSYVEDLSPERNVGVDYFKSEFSKDDALAESVFSFGSGDGKEKQHRLGWAKL-AAAAAAWEDYASNLRRKLLQSLTHWRTNQSIYSVSL

A0A5D3CRE9 Procollagen-proline 3-dioxygenase3.1e-18680Show/hide
Query:  MGDESENNRRRRLILGNFLSLEECKELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGACI
        M D +E+ +RRRLIL NFLS EEC+ELEFIHKSCCTVGYRPNV STTLLHLVA+NSAHLI+PFV IRE+LKEKAEEFFGC YELFVEFTGLISWTRGA I
Subjt:  MGDESENNRRRRLILGNFLSLEECKELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGACI

Query:  GWHSDDNRPYLKQREFTAVCYLNSYRVDFGGGLFHFRDGEPKTISPSCGDCAMYTADGLNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
        GWHSDDNRPYLKQREF+AVCYLNSY V+FGGGLFHF+DGEP+TISP  GDC MY AD  NVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS L
Subjt:  GWHSDDNRPYLKQREFTAVCYLNSYRVDFGGGLFHFRDGEPKTISPSCGDCAMYTADGLNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL

Query:  HNRFPDSCLPQPPSCIMYWFSPEDDPNFKFGFNICWARLHALGYDIYFPREHSLLEYPDLFSQNVRLVRGNEMFFQEFDSVLHALQVVQFMYWKGKELDS
        H+RFP+SCLPQPPSC MYWFSPEDDPNFKFGF+ICWARLHALGYDIYFP +H   EYPDLFSQ+V+LV G+++FFQ+F+++LH LQVVQF+ WKGKELD+
Subjt:  HNRFPDSCLPQPPSCIMYWFSPEDDPNFKFGFNICWARLHALGYDIYFPREHSLLEYPDLFSQNVRLVRGNEMFFQEFDSVLHALQVVQFMYWKGKELDS

Query:  TSFKGDSSYVEDLSPERNVGVDYFKSEFSKDDALAESVFSFGSGDGKEKQHRLGWAKL-AAAAAAWEDYASNLRRKLLQSLTHWRTNQSIYSVSL
        T+   DS Y E LSP+RNVGV YFKSEFSK+D LAESVFS  +  GKE QH LGW KL  AAAAAWEDYAS LRR+LL S +HWR  QSIYSVSL
Subjt:  TSFKGDSSYVEDLSPERNVGVDYFKSEFSKDDALAESVFSFGSGDGKEKQHRLGWAKL-AAAAAAWEDYASNLRRKLLQSLTHWRTNQSIYSVSL

A0A6J1DQY8 Procollagen-proline 3-dioxygenase7.2e-19181.36Show/hide
Query:  MGDESEN--NRRRRLILGNFLSLEECKELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
        MGDE EN   RRRRLIL NFL+ EEC+ELEFIHKSCCTVGYRP+VFSTTLLHLVA+NSAHLIMPFV IRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
Subjt:  MGDESEN--NRRRRLILGNFLSLEECKELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA

Query:  CIGWHSDDNRPYLKQREFTAVCYLNSYRVDFGGGLFHFRDGEPKTISPSCGDCAMYTADGLNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS
         IGWHSDDNRPYLKQREFTAVCYLNSY VDF GGLFHF+DGEPK+ISP CGDC MYTAD  NVHSVDEITNGERLTLTLWFTRD+SHDEDAKLLSLLSQS
Subjt:  CIGWHSDDNRPYLKQREFTAVCYLNSYRVDFGGGLFHFRDGEPKTISPSCGDCAMYTADGLNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS

Query:  HLHNRFPDSCLPQPPSCIMYWFSPEDDPNFKFGFNICWARLHALGYDIYFPREHSLLEYPDLFSQNVRLVRGNEMFFQEFDSVLHALQVVQFMYWKGKEL
        HLH+RFP+SC+P PPSC MYWFSPE+DPNFKFG ++CWARLHALGYDIYFPR++ L +YP LFS  V+LVR  ++FFQEF ++LHALQVVQFM WKGKEL
Subjt:  HLHNRFPDSCLPQPPSCIMYWFSPEDDPNFKFGFNICWARLHALGYDIYFPREHSLLEYPDLFSQNVRLVRGNEMFFQEFDSVLHALQVVQFMYWKGKEL

Query:  DSTSFKGDSSYVEDLSPERNVGVDYFKSEFSKDDALAESVFSFGSGDGKEKQHRLGWAKLAAAAAAWEDYASNLRRKLLQSLTHWRTNQSIYSVSLG
        DST+FKG+SSY   LSP+ N GV YFKSEFSK+  LA+SVFS  S D KEKQ  LGWAKLA A AAWEDYASNLR +LL+SL HWRTNQS+Y VSLG
Subjt:  DSTSFKGDSSYVEDLSPERNVGVDYFKSEFSKDDALAESVFSFGSGDGKEKQHRLGWAKLAAAAAAWEDYASNLRRKLLQSLTHWRTNQSIYSVSLG

A0A6J1FRP8 Procollagen-proline 3-dioxygenase2.6e-20486.08Show/hide
Query:  MGDESENNRRRRLILGNFLSLEECKELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGACI
        MGDE+E N+RRRL L NFL+LEEC+ELEFIHKSCCTVGYRP VFSTTLLHLV SNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA I
Subjt:  MGDESENNRRRRLILGNFLSLEECKELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGACI

Query:  GWHSDDNRPYLKQREFTAVCYLNSYRVDFGGGLFHFRDGEPKTISPSCGDCAMYTADGLNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
        GWHSDDNRPYLKQREFTAVCYLNSY VDF GGLFHF+DGEPKTISP CGDC MYTAD LNVHSVDE+T+GERLTLTLWFTRDSSHDEDAKL+SLLSQSHL
Subjt:  GWHSDDNRPYLKQREFTAVCYLNSYRVDFGGGLFHFRDGEPKTISPSCGDCAMYTADGLNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL

Query:  HNRFPDSCLPQPPSCIMYWFSPEDDPNFKFGFNICWARLHALGYDIYFPREHSLLEYPDLFSQNVRLVRGNEMFFQEFDSVLHALQVVQFMYWKGKELDS
        H+R PDSCLPQPPSC MYWFSP+DDPNFKFGF+ICWARLHALGY IYFP++HSL EYPDLFSQ+V+LVRGN++F Q+FDS+LHALQVVQF+YWKGKELDS
Subjt:  HNRFPDSCLPQPPSCIMYWFSPEDDPNFKFGFNICWARLHALGYDIYFPREHSLLEYPDLFSQNVRLVRGNEMFFQEFDSVLHALQVVQFMYWKGKELDS

Query:  TSFKGDSSYVEDLSPERNVGVDYFKSEFSKDDALAESVFSFGSGDGKEKQHRLGWAKLAAAAAAWEDYASNLRRKLLQSLTHWRTNQSIYSVSLG
        T+ K DSSY E LSP+RNVGVDYFKSEFSKDDALAESVF + S D KEKQHRLGWAKLAA AAAWEDYASNLRR+LL+S  HWRT+QSIYSV  G
Subjt:  TSFKGDSSYVEDLSPERNVGVDYFKSEFSKDDALAESVFSFGSGDGKEKQHRLGWAKLAAAAAAWEDYASNLRRKLLQSLTHWRTNQSIYSVSLG

A0A6J1JQV6 Procollagen-proline 3-dioxygenase1.8e-20285.57Show/hide
Query:  MGDESENNRRRRLILGNFLSLEECKELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGACI
        MGDE+E N+R RLIL NFL+LEEC+ELEFIHKSCCTVGYRP VFSTTLLHLV SNSA LIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA I
Subjt:  MGDESENNRRRRLILGNFLSLEECKELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGACI

Query:  GWHSDDNRPYLKQREFTAVCYLNSYRVDFGGGLFHFRDGEPKTISPSCGDCAMYTADGLNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
        GWHSDDNRPYLKQREFTAVCYLNSY VDF GGLFHF+DGEPKTISP CGDC MYTAD LNVHSVDE+T+GERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
Subjt:  GWHSDDNRPYLKQREFTAVCYLNSYRVDFGGGLFHFRDGEPKTISPSCGDCAMYTADGLNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL

Query:  HNRFPDSCLPQPPSCIMYWFSPEDDPNFKFGFNICWARLHALGYDIYFPREHSLLEYPDLFSQNVRLVRGNEMFFQEFDSVLHALQVVQFMYWKGKELDS
        H+R PDSCLPQPPSC MYWFSP+DDPNFKFGF+ICWARLHALGY IYFP++HSL EYPDLFSQ+V+LVRGN++F Q+FDS+LHALQVVQF+YWKGKELDS
Subjt:  HNRFPDSCLPQPPSCIMYWFSPEDDPNFKFGFNICWARLHALGYDIYFPREHSLLEYPDLFSQNVRLVRGNEMFFQEFDSVLHALQVVQFMYWKGKELDS

Query:  TSFKGDSSYVEDLSPERNVGVDYFKSEFSKDDALAESVFSFGSGDGKEKQHRLGWAKLAAAAAAWEDYASNLRRKLLQSLTHWRTNQSIYSVSLG
        T  K DSSY E LSP+RNVGVD+FKSEFSKDDALAESVF + S D KEKQHRLGWAKLAA A AWEDYASNLRR+LL+S  HWRT+QSIYSV  G
Subjt:  TSFKGDSSYVEDLSPERNVGVDYFKSEFSKDDALAESVFSFGSGDGKEKQHRLGWAKLAAAAAAWEDYASNLRRKLLQSLTHWRTNQSIYSVSLG

SwissProt top hitse value%identityAlignment
Q5XGE0 2-oxoglutarate and iron-dependent oxygenase domain-containing protein 31.0e-0832.94Show/hide
Query:  WHSDDNRPYLKQREFTAVCYLNSYRVDFGGGLFHFRD-GEPKTISPSCGDCAMYTADGLNVHSVDEITNGERLTLTLWFTRDSSH
        WH   ++      ++T++ YL+ Y  DFGGG F F D G  +T+ P  G  + +T+   N+H V++++ G R  +T+ FT +  H
Subjt:  WHSDDNRPYLKQREFTAVCYLNSYRVDFGGGLFHFRD-GEPKTISPSCGDCAMYTADGLNVHSVDEITNGERLTLTLWFTRDSSH

Arabidopsis top hitse value%identityAlignment
AT1G68080.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.4e-11754.95Show/hide
Query:  RLILGNFLSLEECKELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGACIGWHSDDNRPYL
        RLIL NFLS  ECKELE IHKS  T+GYRPNVFSTTL HL+A+NS HLI+PFVSIRERLKEK EE FGCEYELF+EFTGLISW +GA IGWHSDDNR YL
Subjt:  RLILGNFLSLEECKELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGACIGWHSDDNRPYL

Query:  KQREFTAVCYLNSYRVDFGGGLFHFRDGEPKTISPSCGDCAMYTADGLNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHNRFPDSCLPQ
        KQR+F AVCYLNSY  DF GGLF F+ GEP T++PS GD  MYTAD  N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H    + CLP 
Subjt:  KQREFTAVCYLNSYRVDFGGGLFHFRDGEPKTISPSCGDCAMYTADGLNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHNRFPDSCLPQ

Query:  PPSCIMYWFSP-EDDPNFKFGFNICWARLHALGYDIY-FPREHSLLEYPDLFSQNVRLVRGNEMFFQEFDSVLHALQVVQFMYWKGKELDSTSFKGDS-S
        P S  MYWF P +D  N   GF++C ARLH LG+D++    E    +  +     ++L +G ++  ++F ++LHALQVVQF +WK  EL +++ + D+  
Subjt:  PPSCIMYWFSP-EDDPNFKFGFNICWARLHALGYDIY-FPREHSLLEYPDLFSQNVRLVRGNEMFFQEFDSVLHALQVVQFMYWKGKELDSTSFKGDS-S

Query:  YVEDLSPERNVGVDYFKSEFSKDDALAESVFSFGSGDGKEKQHRLGWAKLAAAAAAWEDYASNLRRKLLQSLTHWRTNQSIYSV
         V+ +S  +   ++  KS F  D+ L  + F + S  G++++  L    +A A  +WE+Y+  L ++LL SL  W+T Q+I+ V
Subjt:  YVEDLSPERNVGVDYFKSEFSKDDALAESVFSFGSGDGKEKQHRLGWAKLAAAAAAWEDYASNLRRKLLQSLTHWRTNQSIYSV

AT1G68080.2 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.8e-9347.52Show/hide
Query:  RLILGNFLSLEECKELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGACIGWHSDDNRPYL
        RLIL NFLS  ECKELE IHKS  T+GYRPNVFSTTL HL+A+NS HLI+PFVSIRERLKEK EE FGCEYELF+EFTGLISW +GA IGWHSDDNR YL
Subjt:  RLILGNFLSLEECKELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGACIGWHSDDNRPYL

Query:  KQREFTAVCYLNSYRVDFGGGLFHFRDGEPKTISPSCGDCAMYTADGLNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHNRFPDSCLPQ
        KQR+F +                    GEP T++PS GD  MYTAD  N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ              
Subjt:  KQREFTAVCYLNSYRVDFGGGLFHFRDGEPKTISPSCGDCAMYTADGLNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHNRFPDSCLPQ

Query:  PPSCIMYWFSPEDDPNFKFGFNICWARLHALGYDIY-FPREHSLLEYPDLFSQNVRLVRGNEMFFQEFDSVLHALQVVQFMYWKGKELDSTSFKGDS-SY
                            F++C ARLH LG+D++    E    +  +     ++L +G ++  ++F ++LHALQVVQF +WK  EL +++ + D+   
Subjt:  PPSCIMYWFSPEDDPNFKFGFNICWARLHALGYDIY-FPREHSLLEYPDLFSQNVRLVRGNEMFFQEFDSVLHALQVVQFMYWKGKELDSTSFKGDS-SY

Query:  VEDLSPERNVGVDYFKSEFSKDDALAESVFSFGSGDGKEKQHRLGWAKLAAAAAAWEDYASNLRRKLLQSLTHWRTNQSIYSV
        V+ +S  +   ++  KS F  D+ L  + F + S  G++++  L    +A A  +WE+Y+  L ++LL SL  W+T Q+I+ V
Subjt:  VEDLSPERNVGVDYFKSEFSKDDALAESVFSFGSGDGKEKQHRLGWAKLAAAAAAWEDYASNLRRKLLQSLTHWRTNQSIYSV

AT1G68080.3 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.7e-10451.04Show/hide
Query:  RLILGNFLSLEECKELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGACIGWHSDDNRPYL
        RLIL NFLS  ECKELE IHKS  T+GYRPNVFSTTL HL+A+NS HLI+PFVSIRERLKEK EE FGCEYELF+EFTGLISW +GA IGWHSDDNR YL
Subjt:  RLILGNFLSLEECKELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGACIGWHSDDNRPYL

Query:  KQREFTAVCYLNSYRVDFGGGLFHFRDGEPKTISPSCGDCAMYTADGLNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHNRFPDSCLPQ
        KQR+F +                    GEP T++PS GD  MYTAD  N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H    + CLP 
Subjt:  KQREFTAVCYLNSYRVDFGGGLFHFRDGEPKTISPSCGDCAMYTADGLNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHNRFPDSCLPQ

Query:  PPSCIMYWFSP-EDDPNFKFGFNICWARLHALGYDIY-FPREHSLLEYPDLFSQNVRLVRGNEMFFQEFDSVLHALQVVQFMYWKGKELDSTSFKGDS-S
        P S  MYWF P +D  N   GF++C ARLH LG+D++    E    +  +     ++L +G ++  ++F ++LHALQVVQF +WK  EL +++ + D+  
Subjt:  PPSCIMYWFSP-EDDPNFKFGFNICWARLHALGYDIY-FPREHSLLEYPDLFSQNVRLVRGNEMFFQEFDSVLHALQVVQFMYWKGKELDSTSFKGDS-S

Query:  YVEDLSPERNVGVDYFKSEFSKDDALAESVFSFGSGDGKEKQHRLGWAKLAAAAAAWEDYASNLRRKLLQSLTHWRTNQSIYSV
         V+ +S  +   ++  KS F  D+ L  + F + S  G++++  L    +A A  +WE+Y+  L ++LL SL  W+T Q+I+ V
Subjt:  YVEDLSPERNVGVDYFKSEFSKDDALAESVFSFGSGDGKEKQHRLGWAKLAAAAAAWEDYASNLRRKLLQSLTHWRTNQSIYSV

AT1G68080.4 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.4e-4842.39Show/hide
Query:  MYTADGLNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHNRFPDSCLPQPPSCIMYWFSP-EDDPNFKFGFNICWARLHALGYDIY-FPR
        MYTAD  N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H    + CLP P S  MYWF P +D  N   GF++C ARLH LG+D++    
Subjt:  MYTADGLNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHNRFPDSCLPQPPSCIMYWFSP-EDDPNFKFGFNICWARLHALGYDIY-FPR

Query:  EHSLLEYPDLFSQNVRLVRGNEMFFQEFDSVLHALQVVQFMYWKGKELDSTSFKGDS-SYVEDLSPERNVGVDYFKSEFSKDDALAESVFSFGSGDGKEK
        E    +  +     ++L +G ++  ++F ++LHALQVVQF +WK  EL +++ + D+   V+ +S  +   ++  KS F  D+ L  + F + S  G+++
Subjt:  EHSLLEYPDLFSQNVRLVRGNEMFFQEFDSVLHALQVVQFMYWKGKELDSTSFKGDS-SYVEDLSPERNVGVDYFKSEFSKDDALAESVFSFGSGDGKEK

Query:  QHRLGWAKLAAAAAAWEDYASNLRRKLLQSLTHWRTNQSIYSV
        +  L    +A A  +WE+Y+  L ++LL SL  W+T Q+I+ V
Subjt:  QHRLGWAKLAAAAAAWEDYASNLRRKLLQSLTHWRTNQSIYSV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGACGAATCGGAGAACAATCGGCGGCGGCGTCTGATTCTGGGAAATTTCTTATCCTTGGAAGAATGCAAGGAACTGGAATTCATCCATAAGAGCTGCTGTACGGT
GGGTTATAGACCAAATGTATTTTCCACCACTCTTTTGCATCTTGTTGCTTCTAATTCTGCTCATTTGATCATGCCCTTTGTTTCAATTAGAGAGAGGTTGAAGGAGAAAG
CGGAGGAGTTTTTTGGTTGTGAGTATGAACTGTTCGTTGAGTTCACTGGCTTGATCAGCTGGACCAGGGGAGCATGCATTGGATGGCATAGTGACGATAACCGGCCCTAT
CTAAAACAACGTGAATTTACAGCAGTGTGTTACTTGAATAGTTACAGAGTAGATTTTGGCGGTGGGCTGTTTCACTTTCGGGACGGGGAACCAAAAACTATCTCACCTTC
TTGTGGAGATTGTGCAATGTACACGGCTGACGGCCTCAATGTTCATTCTGTTGATGAGATAACCAATGGAGAAAGACTTACACTGACATTGTGGTTCACCCGTGATAGTT
CCCATGACGAAGATGCAAAACTTCTTTCGCTTCTTTCACAAAGCCATTTACACAATCGTTTTCCTGACTCGTGCTTACCACAGCCTCCATCCTGCATCATGTATTGGTTT
TCACCTGAAGACGATCCAAATTTCAAGTTTGGGTTTAATATATGTTGGGCGAGACTGCATGCGCTTGGATACGACATTTATTTTCCTCGGGAGCATAGTTTGTTAGAGTA
TCCAGATTTATTCTCACAGAATGTAAGATTAGTACGTGGAAATGAGATGTTTTTTCAGGAGTTCGATAGCGTTTTGCATGCACTTCAGGTAGTGCAGTTTATGTATTGGA
AAGGCAAAGAATTGGATTCTACTAGCTTCAAGGGAGATTCAAGCTATGTAGAAGATTTATCTCCAGAAAGGAATGTGGGAGTTGATTACTTTAAATCCGAGTTCTCGAAG
GATGATGCACTGGCCGAGTCCGTCTTCTCGTTTGGTAGTGGTGATGGCAAGGAGAAGCAGCACAGGTTGGGGTGGGCTAAGCTTGCTGCAGCAGCAGCAGCTTGGGAAGA
TTATGCTTCCAATTTAAGGAGAAAACTTCTTCAGAGCTTAACTCATTGGAGAACCAATCAATCCATATACAGTGTTTCACTTGGTGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATTTTTAAAAAAATGACGCTGGATATTGGAGCTTCTATTTTGCGTTAATGGAAGAAGGGCAACAATTTTAGGAACCCTAGAATCAGTATAGTCGAACGATATTTTACACA
AAAATCGTTTGATCTTTGTAGATATCGAAGCAATAATTTGAATAATCAAGAGAAATCTATTCAGATTATTACCATCGAAACGGAGAATTGGGCGAAAATGGGAGACGAAT
CGGAGAACAATCGGCGGCGGCGTCTGATTCTGGGAAATTTCTTATCCTTGGAAGAATGCAAGGAACTGGAATTCATCCATAAGAGCTGCTGTACGGTGGGTTATAGACCA
AATGTATTTTCCACCACTCTTTTGCATCTTGTTGCTTCTAATTCTGCTCATTTGATCATGCCCTTTGTTTCAATTAGAGAGAGGTTGAAGGAGAAAGCGGAGGAGTTTTT
TGGTTGTGAGTATGAACTGTTCGTTGAGTTCACTGGCTTGATCAGCTGGACCAGGGGAGCATGCATTGGATGGCATAGTGACGATAACCGGCCCTATCTAAAACAACGTG
AATTTACAGCAGTGTGTTACTTGAATAGTTACAGAGTAGATTTTGGCGGTGGGCTGTTTCACTTTCGGGACGGGGAACCAAAAACTATCTCACCTTCTTGTGGAGATTGT
GCAATGTACACGGCTGACGGCCTCAATGTTCATTCTGTTGATGAGATAACCAATGGAGAAAGACTTACACTGACATTGTGGTTCACCCGTGATAGTTCCCATGACGAAGA
TGCAAAACTTCTTTCGCTTCTTTCACAAAGCCATTTACACAATCGTTTTCCTGACTCGTGCTTACCACAGCCTCCATCCTGCATCATGTATTGGTTTTCACCTGAAGACG
ATCCAAATTTCAAGTTTGGGTTTAATATATGTTGGGCGAGACTGCATGCGCTTGGATACGACATTTATTTTCCTCGGGAGCATAGTTTGTTAGAGTATCCAGATTTATTC
TCACAGAATGTAAGATTAGTACGTGGAAATGAGATGTTTTTTCAGGAGTTCGATAGCGTTTTGCATGCACTTCAGGTAGTGCAGTTTATGTATTGGAAAGGCAAAGAATT
GGATTCTACTAGCTTCAAGGGAGATTCAAGCTATGTAGAAGATTTATCTCCAGAAAGGAATGTGGGAGTTGATTACTTTAAATCCGAGTTCTCGAAGGATGATGCACTGG
CCGAGTCCGTCTTCTCGTTTGGTAGTGGTGATGGCAAGGAGAAGCAGCACAGGTTGGGGTGGGCTAAGCTTGCTGCAGCAGCAGCAGCTTGGGAAGATTATGCTTCCAAT
TTAAGGAGAAAACTTCTTCAGAGCTTAACTCATTGGAGAACCAATCAATCCATATACAGTGTTTCACTTGGTGGTTGAACCTTCCACTTGTGGGAAAGTTACAATCCCAA
AAGCTAAAAGTAGCTGAGCTTTAAGGTTATTTCTGGGCCTTTGTATAGCTTAACTAGATTATACTTGTCCAGTCATACCCTCCTCAACGACTAGGTCGAGGTCTTTTGTT
TGATTCCTCCTTTTTCGCATCGACTAAACAGCCTTAAGAATCTTATATCTTTAAGCCGAGTTAAGTGGATTTATGGTGATGGATGTATCAAACATTTTGTAGATTAACCT
GGTAGTTAAGTATGATATATGATTACCGAATGAGTCGGTGGCTTAATCACTTTCTATTCCACGATCACGATTTCGATCCCAAGAATCGGTATTTGTAATCTTCTTCCCCC
AACTTGTACTAAAAAAAGGTATGATATTTGATTCAATCTATGTGAGCTTATGAGTCGCTTCCGAGATGCAAGTGGCACAAGATAGACTCAACACCATTGTTGCTCACGCT
CACCAGACCTATCTAGAGGTTACAGATTTGAATCTTCAGATGAGTTTTAAGAAAATCTTTTACGTGTAAGGCTCGGC
Protein sequenceShow/hide protein sequence
MGDESENNRRRRLILGNFLSLEECKELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGACIGWHSDDNRPY
LKQREFTAVCYLNSYRVDFGGGLFHFRDGEPKTISPSCGDCAMYTADGLNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHNRFPDSCLPQPPSCIMYWF
SPEDDPNFKFGFNICWARLHALGYDIYFPREHSLLEYPDLFSQNVRLVRGNEMFFQEFDSVLHALQVVQFMYWKGKELDSTSFKGDSSYVEDLSPERNVGVDYFKSEFSK
DDALAESVFSFGSGDGKEKQHRLGWAKLAAAAAAWEDYASNLRRKLLQSLTHWRTNQSIYSVSLGG