; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0003101 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0003101
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionProcollagen-proline 3-dioxygenase
Genome locationchr10:9432683..9447070
RNA-Seq ExpressionPI0003101
SyntenyPI0003101
Gene Ontology termsGO:0019511 - peptidyl-proline hydroxylation (biological process)
GO:0032963 - collagen metabolic process (biological process)
GO:0005506 - iron ion binding (molecular function)
GO:0019797 - procollagen-proline 3-dioxygenase activity (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR039575 - Prolyl 3-hydroxylase
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK14443.1 prolyl 3-hydroxylase 1 [Cucumis melo var. makuwa]1.2e-22494.71Show/hide
Query:  MGDEAESRQRRRLILENFLSREECRELEFIHRSCCTVGYRPNVFSTTLLHLVATNSAQLIIPFVPIRERLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        M D AESRQRRRLILENFLSREECRELEFIH+SCCTVGYRPNV STTLLHLVATNSA LIIPFVPIRE+LKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MGDEAESRQRRRLILENFLSREECRELEFIHRSCCTVGYRPNVFSTTLLHLVATNSAQLIIPFVPIRERLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKLLSLLSQNPL
        GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMY ADSDNVHSVDEITNGERLTLTLWFTRDSS DEDAKLLSLLSQ+PL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKLLSLLSQNPL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS
        HDRFP+SCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLH+LGYDIYFPGDH FSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS

Query:  TNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVV-AAAAWEDYASILRRELLRSFSHWRNCQSIYSVSLDS
        TNL EDS YAEYLSPKRNVGVSYFKSEFSKN GLAESVF SATS G ENQHWLGWDKLVV AAAAWEDYASILRRELL SFSHWRNCQSIYSVSLDS
Subjt:  TNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVV-AAAAWEDYASILRRELLRSFSHWRNCQSIYSVSLDS

XP_004140463.1 prolyl 3-hydroxylase 1 [Cucumis sativus]1.3e-22394.19Show/hide
Query:  MGDEAESRQRRRLILENFLSREECRELEFIHRSCCTVGYRPNVFSTTLLHLVATNSAQLIIPFVPIRERLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        M D AESRQRRRLILENFLSREECRELEFIH+SC TVGYRPNVFSTTLLHLVATNSA LIIPFVPIRE+LKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MGDEAESRQRRRLILENFLSREECRELEFIHRSCCTVGYRPNVFSTTLLHLVATNSAQLIIPFVPIRERLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKLLSLLSQNPL
        GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTAD+DNVHSVDEITNGERLTLTLWFTRDSS DEDAKLLSLLSQ+PL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKLLSLLSQNPL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS
        HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARL +LGYD+YFPGDH FSEYPDLF QDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS

Query:  TNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVVAAAAWEDYASILRRELLRSFSHWRNCQSIYSVSLDS
        TNL EDSSYAEYLSPKRNVGVSYFKSEFSKN GLAESVF SA SDG ENQ WLGWDKLV AAAAWE YASILRRELL SFSHWRNCQSIYSVSLDS
Subjt:  TNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVVAAAAWEDYASILRRELLRSFSHWRNCQSIYSVSLDS

XP_008456831.1 PREDICTED: uncharacterized protein LOC103496668 isoform X1 [Cucumis melo]6.0e-22494.46Show/hide
Query:  MGDEAESRQRRRLILENFLSREECRELEFIHRSCCTVGYRPNVFSTTLLHLVATNSAQLIIPFVPIRERLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        M D AESRQRRRLILENFLSREECRELEFIH+SCCTVGYRPNV STTLLHLVATNSA LIIPFVPIRE+LKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MGDEAESRQRRRLILENFLSREECRELEFIHRSCCTVGYRPNVFSTTLLHLVATNSAQLIIPFVPIRERLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKLLSLLSQNPL
        GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSS DEDAKLLSLLSQ+PL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKLLSLLSQNPL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS
        HDRF +SCLPQPPSCNMYWFSPE+DPNFKFGFDICWARLH+LGYDIYFPGDH FSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS

Query:  TNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVV-AAAAWEDYASILRRELLRSFSHWRNCQSIYSVSLDS
        TNL EDS YAEYLSPKRNVGVSYFKSEFSKN GLAESVF SATS G ENQHWLGWDKLVV AAAAWEDYASILRRELL SFSHWRNCQSIYSVSLDS
Subjt:  TNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVV-AAAAWEDYASILRRELLRSFSHWRNCQSIYSVSLDS

XP_008456833.1 PREDICTED: uncharacterized protein LOC103496668 isoform X2 [Cucumis melo]9.6e-19886.4Show/hide
Query:  MGDEAESRQRRRLILENFLSREECRELEFIHRSCCTVGYRPNVFSTTLLHLVATNSAQLIIPFVPIRERLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        M D AESRQRRRLILENFLSREECRELEFIH+SCCTVGYRPNV STTLLHLVATNSA LIIPFVPIRE+LKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MGDEAESRQRRRLILENFLSREECRELEFIHRSCCTVGYRPNVFSTTLLHLVATNSAQLIIPFVPIRERLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKLLSLLSQNPL
        GWHSDDNRPYLKQREFS                                DCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSS DEDAKLLSLLSQ+PL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKLLSLLSQNPL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS
        HDRF +SCLPQPPSCNMYWFSPE+DPNFKFGFDICWARLH+LGYDIYFPGDH FSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS

Query:  TNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVV-AAAAWEDYASILRRELLRSFSHWRNCQSIYSVSLDS
        TNL EDS YAEYLSPKRNVGVSYFKSEFSKN GLAESVF SATS G ENQHWLGWDKLVV AAAAWEDYASILRRELL SFSHWRNCQSIYSVSLDS
Subjt:  TNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVV-AAAAWEDYASILRRELLRSFSHWRNCQSIYSVSLDS

XP_038893062.1 uncharacterized protein LOC120081945 [Benincasa hispida]1.5e-21189.95Show/hide
Query:  MGDEAES--RQRRRLILENFLSREECRELEFIHRSCCTVGYRPNVFSTTLLHLVATNSAQLIIPFVPIRERLKEKAEEFFGCHYELFVEFTGLISWTRGA
        MGDE ES  R+RRRLILENFL+REECRELEFIH+SCCTVGYRPNVFSTTLLHLVATNSA LI+PFVPIRERLKEKAEEFFGCHYELFVEFTGLISWTRGA
Subjt:  MGDEAES--RQRRRLILENFLSREECRELEFIHRSCCTVGYRPNVFSTTLLHLVATNSAQLIIPFVPIRERLKEKAEEFFGCHYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKLLSLLSQN
        SIGWHSDDNRPYLKQR+FSAVCYLNSYGVEFGGGLFHFQDGEPETISPF GDCVMYTADSDNVHSVDEITNGERLTLTLW TRDSS DED+KLLSLLSQ+
Subjt:  SIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKLLSLLSQN

Query:  PLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKEL
         LHDR PDS LPQPPSCNMYWFS EDDPNFK GFDICWARLH+LGYDIYF GDH FSEYPDLFS+DVQLV G+K+FFQ+FENILHLLQVVQFLCWKGKEL
Subjt:  PLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKEL

Query:  DSTNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVVAAAAWEDYASILRRELLRSFSHWRNCQSIYSVSLDS
        DSTN+KEDSSYAEYLSPKRNVGVSYFKSEFSK+  LAESVF SATSDG ENQHWLGWDKL  AAAAWEDYASILRRELL S S+WRN QSIYSVSL S
Subjt:  DSTNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVVAAAAWEDYASILRRELLRSFSHWRNCQSIYSVSLDS

TrEMBL top hitse value%identityAlignment
A0A0A0KMN7 Procollagen-proline 3-dioxygenase6.7e-22190.75Show/hide
Query:  MGDEAESRQRRRLILENFLSREECRELEFIHRSCCTVGYRPNVFSTTLLHLVATNSAQLIIPFVPIRERLKEKAEEFFGCHYELFVEFTGLIS-------
        M D AESRQRRRLILENFLSREECRELEFIH+SC TVGYRPNVFSTTLLHLVATNSA LIIPFVPIRE+LKEKAEEFFGCHYELFVEFTGLIS       
Subjt:  MGDEAESRQRRRLILENFLSREECRELEFIHRSCCTVGYRPNVFSTTLLHLVATNSAQLIIPFVPIRERLKEKAEEFFGCHYELFVEFTGLIS-------

Query:  --------WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQ
                WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTAD+DNVHSVDEITNGERLTLTLWFTRDSS 
Subjt:  --------WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQ

Query:  DEDAKLLSLLSQNPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLL
        DEDAKLLSLLSQ+PLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARL +LGYD+YFPGDH FSEYPDLF QDVQLVWGDKIFFQKFENILHLL
Subjt:  DEDAKLLSLLSQNPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLL

Query:  QVVQFLCWKGKELDSTNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVVAAAAWEDYASILRRELLRSFSHWRN
        QVVQFLCWKGKELDSTNL EDSSYAEYLSPKRNVGVSYFKSEFSKN GLAESVF SA SDG ENQ WLGWDKLV AAAAWE YASILRRELL SFSHWRN
Subjt:  QVVQFLCWKGKELDSTNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVVAAAAWEDYASILRRELLRSFSHWRN

Query:  CQSIYSVSLDS
        CQSIYSVSLDS
Subjt:  CQSIYSVSLDS

A0A1S3C486 Procollagen-proline 3-dioxygenase2.9e-22494.46Show/hide
Query:  MGDEAESRQRRRLILENFLSREECRELEFIHRSCCTVGYRPNVFSTTLLHLVATNSAQLIIPFVPIRERLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        M D AESRQRRRLILENFLSREECRELEFIH+SCCTVGYRPNV STTLLHLVATNSA LIIPFVPIRE+LKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MGDEAESRQRRRLILENFLSREECRELEFIHRSCCTVGYRPNVFSTTLLHLVATNSAQLIIPFVPIRERLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKLLSLLSQNPL
        GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSS DEDAKLLSLLSQ+PL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKLLSLLSQNPL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS
        HDRF +SCLPQPPSCNMYWFSPE+DPNFKFGFDICWARLH+LGYDIYFPGDH FSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS

Query:  TNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVV-AAAAWEDYASILRRELLRSFSHWRNCQSIYSVSLDS
        TNL EDS YAEYLSPKRNVGVSYFKSEFSKN GLAESVF SATS G ENQHWLGWDKLVV AAAAWEDYASILRRELL SFSHWRNCQSIYSVSLDS
Subjt:  TNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVV-AAAAWEDYASILRRELLRSFSHWRNCQSIYSVSLDS

A0A1S3C4U3 uncharacterized protein LOC103496668 isoform X24.7e-19886.4Show/hide
Query:  MGDEAESRQRRRLILENFLSREECRELEFIHRSCCTVGYRPNVFSTTLLHLVATNSAQLIIPFVPIRERLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        M D AESRQRRRLILENFLSREECRELEFIH+SCCTVGYRPNV STTLLHLVATNSA LIIPFVPIRE+LKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MGDEAESRQRRRLILENFLSREECRELEFIHRSCCTVGYRPNVFSTTLLHLVATNSAQLIIPFVPIRERLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKLLSLLSQNPL
        GWHSDDNRPYLKQREFS                                DCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSS DEDAKLLSLLSQ+PL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKLLSLLSQNPL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS
        HDRF +SCLPQPPSCNMYWFSPE+DPNFKFGFDICWARLH+LGYDIYFPGDH FSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS

Query:  TNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVV-AAAAWEDYASILRRELLRSFSHWRNCQSIYSVSLDS
        TNL EDS YAEYLSPKRNVGVSYFKSEFSKN GLAESVF SATS G ENQHWLGWDKLVV AAAAWEDYASILRRELL SFSHWRNCQSIYSVSLDS
Subjt:  TNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVV-AAAAWEDYASILRRELLRSFSHWRNCQSIYSVSLDS

A0A5A7SSL8 Procollagen-proline 3-dioxygenase2.9e-22494.46Show/hide
Query:  MGDEAESRQRRRLILENFLSREECRELEFIHRSCCTVGYRPNVFSTTLLHLVATNSAQLIIPFVPIRERLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        M D AESRQRRRLILENFLSREECRELEFIH+SCCTVGYRPNV STTLLHLVATNSA LIIPFVPIRE+LKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MGDEAESRQRRRLILENFLSREECRELEFIHRSCCTVGYRPNVFSTTLLHLVATNSAQLIIPFVPIRERLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKLLSLLSQNPL
        GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSS DEDAKLLSLLSQ+PL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKLLSLLSQNPL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS
        HDRF +SCLPQPPSCNMYWFSPE+DPNFKFGFDICWARLH+LGYDIYFPGDH FSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS

Query:  TNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVV-AAAAWEDYASILRRELLRSFSHWRNCQSIYSVSLDS
        TNL EDS YAEYLSPKRNVGVSYFKSEFSKN GLAESVF SATS G ENQHWLGWDKLVV AAAAWEDYASILRRELL SFSHWRNCQSIYSVSLDS
Subjt:  TNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVV-AAAAWEDYASILRRELLRSFSHWRNCQSIYSVSLDS

A0A5D3CRE9 Procollagen-proline 3-dioxygenase5.9e-22594.71Show/hide
Query:  MGDEAESRQRRRLILENFLSREECRELEFIHRSCCTVGYRPNVFSTTLLHLVATNSAQLIIPFVPIRERLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        M D AESRQRRRLILENFLSREECRELEFIH+SCCTVGYRPNV STTLLHLVATNSA LIIPFVPIRE+LKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MGDEAESRQRRRLILENFLSREECRELEFIHRSCCTVGYRPNVFSTTLLHLVATNSAQLIIPFVPIRERLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKLLSLLSQNPL
        GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMY ADSDNVHSVDEITNGERLTLTLWFTRDSS DEDAKLLSLLSQ+PL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKLLSLLSQNPL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS
        HDRFP+SCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLH+LGYDIYFPGDH FSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS

Query:  TNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVV-AAAAWEDYASILRRELLRSFSHWRNCQSIYSVSLDS
        TNL EDS YAEYLSPKRNVGVSYFKSEFSKN GLAESVF SATS G ENQHWLGWDKLVV AAAAWEDYASILRRELL SFSHWRNCQSIYSVSLDS
Subjt:  TNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVV-AAAAWEDYASILRRELLRSFSHWRNCQSIYSVSLDS

SwissProt top hitse value%identityAlignment
Q5XGE0 2-oxoglutarate and iron-dependent oxygenase domain-containing protein 37.3e-0732.5Show/hide
Query:  WHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQD-GEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFT
        WH   ++      +++++ YL+ Y  +FGGG F F D G   T+ P  G    +T+ S+N+H V++++ G R  +T+ FT
Subjt:  WHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQD-GEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFT

Q8CG71 Prolyl 3-hydroxylase 27.1e-1026.85Show/hide
Query:  RRLILENFLSREECRELEFIHRSCCTVG---------YRPN---VFSTTLLHL-------VATNSAQLIIPFVPIRERLKEKAEEFFGCHYELFVEFTGL
        +R++L+N LS+E+CREL  +      VG         + PN     +T L  L       V   SA+L   F  I E+ ++  E +F  +  L+  +T +
Subjt:  RRLILENFLSREECRELEFIHRSCCTVG---------YRPN---VFSTTLLHL-------VATNSAQLIIPFVPIRERLKEKAEEFFGCHYELFVEFTGL

Query:  ISWT---------RGASIGWHSDD------------NRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPET----ISPFYGDCVMYTADSDNVHSVDE
        +  T            S   H+D+              P    R++SA+ Y+N    +F GG F F + + +T    I P  G  + +++  +N H V  
Subjt:  ISWT---------RGASIGWHSDD------------NRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPET----ISPFYGDCVMYTADSDNVHSVDE

Query:  ITNGERLTLTLWFTRD
        +T G+R  + LWFT D
Subjt:  ITNGERLTLTLWFTRD

Arabidopsis top hitse value%identityAlignment
AT1G68080.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.1e-11453.71Show/hide
Query:  RQRRRLILENFLSREECRELEFIHRSCCTVGYRPNVFSTTLLHLVATNSAQLIIPFVPIRERLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDN
        ++  RLIL NFLS  EC+ELE IH+S  T+GYRPNVFSTTL HL+ATNS  LIIPFV IRERLKEK EE FGC YELF+EFTGLISW +GASIGWHSDDN
Subjt:  RQRRRLILENFLSREECRELEFIHRSCCTVGYRPNVFSTTLLHLVATNSAQLIIPFVPIRERLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDN

Query:  RPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKLLSLLSQNPLHDRFPDS
        R YLKQR+F+AVCYLNSY  +F GGLF FQ GEP T++P  GD +MYTAD  N+HSVDE+T+GERLTL LWF+RDSS DED+KLLS LSQ   H    + 
Subjt:  RPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKLLSLLSQNPLHDRFPDS

Query:  CLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLHSLGYDIY-FPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDSTNLKE
        CLP P S NMYWF P +D  N   GFD+C ARLH LG+D++   G+   ++  +     +QL  G K+  +KF NILH LQVVQF  WK  EL ++N++ 
Subjt:  CLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLHSLGYDIY-FPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDSTNLKE

Query:  DS-SYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVVAAAAWEDYASILRRELLRSFSHWRNCQSIYSVSLD
        D+    + +S  +   ++  KS F  +  L  + F   +  G + +  L    + +A  +WE+Y+  L +ELL S   W+  Q+I+ V  D
Subjt:  DS-SYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVVAAAAWEDYASILRRELLRSFSHWRNCQSIYSVSLD

AT1G68080.2 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein4.2e-9046.41Show/hide
Query:  RQRRRLILENFLSREECRELEFIHRSCCTVGYRPNVFSTTLLHLVATNSAQLIIPFVPIRERLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDN
        ++  RLIL NFLS  EC+ELE IH+S  T+GYRPNVFSTTL HL+ATNS  LIIPFV IRERLKEK EE FGC YELF+EFTGLISW +GASIGWHSDDN
Subjt:  RQRRRLILENFLSREECRELEFIHRSCCTVGYRPNVFSTTLLHLVATNSAQLIIPFVPIRERLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDN

Query:  RPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKLLSLLSQNPLHDRFPDS
        R YLKQR+F++                    GEP T++P  GD +MYTAD  N+HSVDE+T+GERLTL LWF+RDSS DED+KLLS LSQ          
Subjt:  RPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKLLSLLSQNPLHDRFPDS

Query:  CLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIY-FPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDSTNLKED
               C                FD+C ARLH LG+D++   G+   ++  +     +QL  G K+  +KF NILH LQVVQF  WK  EL ++N++ D
Subjt:  CLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIY-FPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDSTNLKED

Query:  S-SYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVVAAAAWEDYASILRRELLRSFSHWRNCQSIYSVSLD
        +    + +S  +   ++  KS F  +  L  + F   +  G + +  L    + +A  +WE+Y+  L +ELL S   W+  Q+I+ V  D
Subjt:  S-SYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVVAAAAWEDYASILRRELLRSFSHWRNCQSIYSVSLD

AT1G68080.3 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein4.7e-10249.87Show/hide
Query:  RQRRRLILENFLSREECRELEFIHRSCCTVGYRPNVFSTTLLHLVATNSAQLIIPFVPIRERLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDN
        ++  RLIL NFLS  EC+ELE IH+S  T+GYRPNVFSTTL HL+ATNS  LIIPFV IRERLKEK EE FGC YELF+EFTGLISW +GASIGWHSDDN
Subjt:  RQRRRLILENFLSREECRELEFIHRSCCTVGYRPNVFSTTLLHLVATNSAQLIIPFVPIRERLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDN

Query:  RPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKLLSLLSQNPLHDRFPDS
        R YLKQR+F++                    GEP T++P  GD +MYTAD  N+HSVDE+T+GERLTL LWF+RDSS DED+KLLS LSQ   H    + 
Subjt:  RPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKLLSLLSQNPLHDRFPDS

Query:  CLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLHSLGYDIY-FPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDSTNLKE
        CLP P S NMYWF P +D  N   GFD+C ARLH LG+D++   G+   ++  +     +QL  G K+  +KF NILH LQVVQF  WK  EL ++N++ 
Subjt:  CLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLHSLGYDIY-FPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDSTNLKE

Query:  DS-SYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVVAAAAWEDYASILRRELLRSFSHWRNCQSIYSVSLD
        D+    + +S  +   ++  KS F  +  L  + F   +  G + +  L    + +A  +WE+Y+  L +ELL S   W+  Q+I+ V  D
Subjt:  DS-SYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVVAAAAWEDYASILRRELLRSFSHWRNCQSIYSVSLD

AT1G68080.4 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein6.7e-4842.68Show/hide
Query:  MYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKLLSLLSQNPLHDRFPDSCLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLHSLGYDIY-FPG
        MYTAD  N+HSVDE+T+GERLTL LWF+RDSS DED+KLLS LSQ   H    + CLP P S NMYWF P +D  N   GFD+C ARLH LG+D++   G
Subjt:  MYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKLLSLLSQNPLHDRFPDSCLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLHSLGYDIY-FPG

Query:  DHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDSTNLKEDS-SYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNEN
        +   ++  +     +QL  G K+  +KF NILH LQVVQF  WK  EL ++N++ D+    + +S  +   ++  KS F  +  L  + F   +  G + 
Subjt:  DHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDSTNLKEDS-SYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNEN

Query:  QHWLGWDKLVVAAAAWEDYASILRRELLRSFSHWRNCQSIYSVSLD
        +  L    + +A  +WE+Y+  L +ELL S   W+  Q+I+ V  D
Subjt:  QHWLGWDKLVVAAAAWEDYASILRRELLRSFSHWRNCQSIYSVSLD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGATGAAGCTGAGAGCAGACAGCGGCGGCGTCTGATTCTCGAAAATTTCTTAAGCCGCGAAGAATGCAGGGAACTGGAGTTCATCCATAGGAGCTGCTGTACGGT
GGGGTATAGACCGAACGTCTTCTCCACCACTTTGTTGCATCTTGTTGCCACTAATTCTGCTCAATTGATCATCCCTTTTGTTCCGATTAGAGAGAGGTTGAAGGAGAAAG
CCGAGGAATTCTTTGGGTGTCATTATGAGCTATTTGTTGAGTTCACTGGCTTGATCAGCTGGACCAGGGGAGCAAGCATTGGATGGCATAGTGATGATAACCGGCCCTAT
CTAAAACAACGTGAATTTTCTGCAGTGTGTTACTTGAATAGTTATGGAGTAGAATTTGGAGGTGGGCTGTTTCACTTTCAGGATGGGGAACCAGAAACCATCTCACCTTT
TTATGGAGATTGTGTGATGTACACGGCTGACAGTGACAATGTTCATTCTGTTGATGAGATAACCAATGGAGAAAGGCTTACACTGACATTATGGTTCACCCGTGATAGCT
CCCAGGATGAAGATGCAAAACTTCTTTCCCTTCTTTCACAAAACCCTTTACACGATCGTTTTCCTGACTCGTGCCTCCCTCAGCCTCCATCCTGTAATATGTATTGGTTT
TCACCAGAAGATGATCCAAATTTTAAGTTCGGTTTTGATATATGCTGGGCGAGACTGCATTCGCTTGGATACGACATCTATTTTCCTGGGGACCATGGTTTTTCAGAGTA
TCCAGATTTATTCTCACAGGATGTACAATTAGTGTGGGGAGATAAGATATTCTTTCAGAAATTTGAGAACATCTTGCATTTGCTTCAGGTAGTGCAGTTCCTGTGTTGGA
AAGGCAAAGAGCTGGATTCTACCAACCTCAAGGAGGATTCGAGCTATGCAGAATATTTATCCCCAAAGAGAAATGTGGGAGTCAGTTACTTTAAATCTGAGTTTTCGAAG
AATATTGGGTTGGCTGAATCGGTCTTCTTATCTGCTACTTCCGATGGCAACGAGAACCAACACTGGTTGGGGTGGGATAAGCTTGTTGTTGCTGCAGCAGCTTGGGAAGA
TTATGCTTCCATTTTAAGGAGAGAACTCCTTCGGAGCTTCAGCCATTGGAGAAATTGTCAATCCATATACAGTGTTTCACTTGATAGCTGA
mRNA sequenceShow/hide mRNA sequence
TTTTGCCTTAAAAATCTGCTTTCTTTCGAAGCAAACTTATTCTATTTTTGTGCAATTTCCCTAAGTTATTTTGGACAATTTTCCCACGTCTCGCTGAAACGGTAGGGACG
GAGAACTGGACGCCAAAATGGGAGATGAAGCTGAGAGCAGACAGCGGCGGCGTCTGATTCTCGAAAATTTCTTAAGCCGCGAAGAATGCAGGGAACTGGAGTTCATCCAT
AGGAGCTGCTGTACGGTGGGGTATAGACCGAACGTCTTCTCCACCACTTTGTTGCATCTTGTTGCCACTAATTCTGCTCAATTGATCATCCCTTTTGTTCCGATTAGAGA
GAGGTTGAAGGAGAAAGCCGAGGAATTCTTTGGGTGTCATTATGAGCTATTTGTTGAGTTCACTGGCTTGATCAGCTGGACCAGGGGAGCAAGCATTGGATGGCATAGTG
ATGATAACCGGCCCTATCTAAAACAACGTGAATTTTCTGCAGTGTGTTACTTGAATAGTTATGGAGTAGAATTTGGAGGTGGGCTGTTTCACTTTCAGGATGGGGAACCA
GAAACCATCTCACCTTTTTATGGAGATTGTGTGATGTACACGGCTGACAGTGACAATGTTCATTCTGTTGATGAGATAACCAATGGAGAAAGGCTTACACTGACATTATG
GTTCACCCGTGATAGCTCCCAGGATGAAGATGCAAAACTTCTTTCCCTTCTTTCACAAAACCCTTTACACGATCGTTTTCCTGACTCGTGCCTCCCTCAGCCTCCATCCT
GTAATATGTATTGGTTTTCACCAGAAGATGATCCAAATTTTAAGTTCGGTTTTGATATATGCTGGGCGAGACTGCATTCGCTTGGATACGACATCTATTTTCCTGGGGAC
CATGGTTTTTCAGAGTATCCAGATTTATTCTCACAGGATGTACAATTAGTGTGGGGAGATAAGATATTCTTTCAGAAATTTGAGAACATCTTGCATTTGCTTCAGGTAGT
GCAGTTCCTGTGTTGGAAAGGCAAAGAGCTGGATTCTACCAACCTCAAGGAGGATTCGAGCTATGCAGAATATTTATCCCCAAAGAGAAATGTGGGAGTCAGTTACTTTA
AATCTGAGTTTTCGAAGAATATTGGGTTGGCTGAATCGGTCTTCTTATCTGCTACTTCCGATGGCAACGAGAACCAACACTGGTTGGGGTGGGATAAGCTTGTTGTTGCT
GCAGCAGCTTGGGAAGATTATGCTTCCATTTTAAGGAGAGAACTCCTTCGGAGCTTCAGCCATTGGAGAAATTGTCAATCCATATACAGTGTTTCACTTGATAGCTGAGT
ATCCCAAGTACTTAAAGTAGCAGCAGCTTCAAGGTTAGTTCAGGGCCTTTGTATAGCTTAACTTTTACAAGATCAATAACTCAATAAAGTTATGTTCTGTAACAATTTTG
TTTTTCTATTTTAGAAATTTGTGTTTGTTTCATCTCAAATTTAAAACCAAGTAAAATTTTGAAAACTAAAAATAGTAGATTTTAAAA
Protein sequenceShow/hide protein sequence
MGDEAESRQRRRLILENFLSREECRELEFIHRSCCTVGYRPNVFSTTLLHLVATNSAQLIIPFVPIRERLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPY
LKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKLLSLLSQNPLHDRFPDSCLPQPPSCNMYWF
SPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDSTNLKEDSSYAEYLSPKRNVGVSYFKSEFSK
NIGLAESVFLSATSDGNENQHWLGWDKLVVAAAAWEDYASILRRELLRSFSHWRNCQSIYSVSLDS