; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018526 (gene) of Snake gourd v1 genome

Gene IDTan0018526
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProcollagen-proline 3-dioxygenase
Genome locationLG04:35983434..36017130
RNA-Seq ExpressionTan0018526
SyntenyTan0018526
Gene Ontology termsGO:0019511 - peptidyl-proline hydroxylation (biological process)
GO:0032963 - collagen metabolic process (biological process)
GO:0005506 - iron ion binding (molecular function)
GO:0019797 - procollagen-proline 3-dioxygenase activity (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR039575 - Prolyl 3-hydroxylase
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140463.1 prolyl 3-hydroxylase 1 [Cucumis sativus]1.5e-18280.05Show/hide
Query:  MGDEAESNQRRRLILENFITREECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRGKT---------AHSSIFTIVSFLVIWTRGASI
        M D AES QRRRLILENF++REECRELEFIHKSC TVGYRPNVFSTTLLHLVA+NSAHLI+PFV IR K           H  +F   + L+ WTRGASI
Subjt:  MGDEAESNQRRRLILENFITREECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRGKT---------AHSSIFTIVSFLVIWTRGASI

Query:  GWHSDDNRPYLKQREFTAVCYLNNYGVGFGGGLFHFQDGEPKTISPFCGDCVMYTADSRNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
        GWHSDDNRPYLKQREF+AVCYLN+YGV FGGGLFHFQDGEP+TISPF GDCVMYTAD+ NVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS L
Subjt:  GWHSDDNRPYLKQREFTAVCYLNNYGVGFGGGLFHFQDGEPKTISPFCGDCVMYTADSRNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHVLGYNIYFPQDHSLSEYPDLFSQDVQLVRGNKIFFQDFDSILHALQVVQFLCWKGTELDS
        HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARL  LGY++YFP DH  SEYPDLF QDVQLV G+KIFFQ F++ILH LQVVQFLCWKG ELDS
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHVLGYNIYFPQDHSLSEYPDLFSQDVQLVRGNKIFFQDFDSILHALQVVQFLCWKGTELDS

Query:  TNFKGDSSYAEDLSPKRNVGVDYFKSEFSKDNALVESVFSSASSDGKEKQHGLGWVKLAAATTAWEDYASNLRRELLQSLTHWRTSQSIYSVSLGS
        TN   DSSYAE LSPKRNVGV YFKSEFSK++ L ESVFSSA+SDGKE Q  LGW KL AA  AWE YAS LRRELL S +HWR  QSIYSVSL S
Subjt:  TNFKGDSSYAEDLSPKRNVGVDYFKSEFSKDNALVESVFSSASSDGKEKQHGLGWVKLAAATTAWEDYASNLRRELLQSLTHWRTSQSIYSVSLGS

XP_022942569.1 prolyl 3-hydroxylase 1 isoform X1 [Cucurbita moschata]5.9e-19283.59Show/hide
Query:  MGDEAESNQRRRLILENFITREECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRGKTAHSS---------IFTIVSFLVIWTRGASI
        MGDEAE NQRRRL LENF+T EECRELEFIHKSCCTVGYRP VFSTTLLHLV SNSAHLIMPFVSIR +    +         +F   + L+ WTRGA I
Subjt:  MGDEAESNQRRRLILENFITREECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRGKTAHSS---------IFTIVSFLVIWTRGASI

Query:  GWHSDDNRPYLKQREFTAVCYLNNYGVGFGGGLFHFQDGEPKTISPFCGDCVMYTADSRNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
        GWHSDDNRPYLKQREFTAVCYLN+YGV F GGLFHFQDGEPKTISP CGDCVMYTADS NVHSVDE+T+GERLTLTLWFTRDSSHDEDAKL+SLLSQSHL
Subjt:  GWHSDDNRPYLKQREFTAVCYLNNYGVGFGGGLFHFQDGEPKTISPFCGDCVMYTADSRNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHVLGYNIYFPQDHSLSEYPDLFSQDVQLVRGNKIFFQDFDSILHALQVVQFLCWKGTELDS
        HDR PDSCLPQPPSCNMYWFSP+DDPNFKFGFDICWARLH LGY IYFPQDHSLSEYPDLFSQDVQLVRGNKIF Q FDSILHALQVVQFL WKG ELDS
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHVLGYNIYFPQDHSLSEYPDLFSQDVQLVRGNKIFFQDFDSILHALQVVQFLCWKGTELDS

Query:  TNFKGDSSYAEDLSPKRNVGVDYFKSEFSKDNALVESVFSSASSDGKEKQHGLGWVKLAAATTAWEDYASNLRRELLQSLTHWRTSQSIYSVSLGS
        TN K DSSYAE LSPKRNVGVDYFKSEFSKD+AL ESVF  ASSD KEKQH LGW KLAA   AWEDYASNLRRELL+S  HWRTSQSIYSV  GS
Subjt:  TNFKGDSSYAEDLSPKRNVGVDYFKSEFSKDNALVESVFSSASSDGKEKQHGLGWVKLAAATTAWEDYASNLRRELLQSLTHWRTSQSIYSVSLGS

XP_022989994.1 prolyl 3-hydroxylase 1 isoform X1 [Cucurbita maxima]4.2e-19083.08Show/hide
Query:  MGDEAESNQRRRLILENFITREECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRGKTAHSS---------IFTIVSFLVIWTRGASI
        MGDEAE NQR RLILENF+T EECRELEFIHKSCCTVGYRP VFSTTLLHLV SNSA LIMPFVSIR +    +         +F   + L+ WTRGA I
Subjt:  MGDEAESNQRRRLILENFITREECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRGKTAHSS---------IFTIVSFLVIWTRGASI

Query:  GWHSDDNRPYLKQREFTAVCYLNNYGVGFGGGLFHFQDGEPKTISPFCGDCVMYTADSRNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
        GWHSDDNRPYLKQREFTAVCYLN+YGV F GGLFHFQDGEPKTISP CGDCVMYTADS NVHSVDE+T+GERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
Subjt:  GWHSDDNRPYLKQREFTAVCYLNNYGVGFGGGLFHFQDGEPKTISPFCGDCVMYTADSRNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHVLGYNIYFPQDHSLSEYPDLFSQDVQLVRGNKIFFQDFDSILHALQVVQFLCWKGTELDS
        HDR PDSCLPQPPSCNMYWFSP+DDPNFKFGFDICWARLH LGY IYFPQDHSLSEYPDLFSQDVQLVRGNKIF Q FDSILHALQVVQFL WKG ELDS
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHVLGYNIYFPQDHSLSEYPDLFSQDVQLVRGNKIFFQDFDSILHALQVVQFLCWKGTELDS

Query:  TNFKGDSSYAEDLSPKRNVGVDYFKSEFSKDNALVESVFSSASSDGKEKQHGLGWVKLAAATTAWEDYASNLRRELLQSLTHWRTSQSIYSVSLGS
        T+ K DSSYAE LSPKRNVGVD+FKSEFSKD+AL ESVF  ASSD KEKQH LGW KLAA   AWEDYASNLRRELL+S  HWRTSQSIYSV  GS
Subjt:  TNFKGDSSYAEDLSPKRNVGVDYFKSEFSKDNALVESVFSSASSDGKEKQHGLGWVKLAAATTAWEDYASNLRRELLQSLTHWRTSQSIYSVSLGS

XP_023552239.1 prolyl 3-hydroxylase 1 [Cucurbita pepo subsp. pepo]1.1e-19384.34Show/hide
Query:  MGDEAESNQRRRLILENFITREECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRGKTAHSS---------IFTIVSFLVIWTRGASI
        MGDEAE NQRRRLILENF+T EECRELEFIHKSCCTVGYRP VFSTTLLHLV SNSAHLIMPFVSIR +    +         +F   + L+ WTRGA I
Subjt:  MGDEAESNQRRRLILENFITREECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRGKTAHSS---------IFTIVSFLVIWTRGASI

Query:  GWHSDDNRPYLKQREFTAVCYLNNYGVGFGGGLFHFQDGEPKTISPFCGDCVMYTADSRNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
        GWHSDDNRPYLKQREFTAVCYLN+YGV F GGLFHFQDGEPKTISP CGDCVMYTADS NVHSVDE+T+GERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
Subjt:  GWHSDDNRPYLKQREFTAVCYLNNYGVGFGGGLFHFQDGEPKTISPFCGDCVMYTADSRNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHVLGYNIYFPQDHSLSEYPDLFSQDVQLVRGNKIFFQDFDSILHALQVVQFLCWKGTELDS
        HDR PDSCLPQPPSCNMYWFSP+DDPNFKFGFDICWARLH LGY IYFPQDHSLSEYPDLFSQDVQLVRGNKIF Q FDSILHALQVVQFL WKG ELDS
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHVLGYNIYFPQDHSLSEYPDLFSQDVQLVRGNKIFFQDFDSILHALQVVQFLCWKGTELDS

Query:  TNFKGDSSYAEDLSPKRNVGVDYFKSEFSKDNALVESVFSSASSDGKEKQHGLGWVKLAAATTAWEDYASNLRRELLQSLTHWRTSQSIYSVSLGS
        TN K DSSYAE LSPKRNVGVDYFKSEFSKDNAL ESVF  ASSD KEKQH LGW KLAA   AWEDYASNLRRELL+S  HWRTSQSIYSV  GS
Subjt:  TNFKGDSSYAEDLSPKRNVGVDYFKSEFSKDNALVESVFSSASSDGKEKQHGLGWVKLAAATTAWEDYASNLRRELLQSLTHWRTSQSIYSVSLGS

XP_038893062.1 uncharacterized protein LOC120081945 [Benincasa hispida]9.8e-18781.16Show/hide
Query:  MGDEAES--NQRRRLILENFITREECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRGKT---------AHSSIFTIVSFLVIWTRGA
        MGDE ES   +RRRLILENF+TREECRELEFIHKSCCTVGYRPNVFSTTLLHLVA+NSAHLIMPFV IR +           H  +F   + L+ WTRGA
Subjt:  MGDEAES--NQRRRLILENFITREECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRGKT---------AHSSIFTIVSFLVIWTRGA

Query:  SIGWHSDDNRPYLKQREFTAVCYLNNYGVGFGGGLFHFQDGEPKTISPFCGDCVMYTADSRNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS
        SIGWHSDDNRPYLKQR+F+AVCYLN+YGV FGGGLFHFQDGEP+TISPFCGDCVMYTADS NVHSVDEITNGERLTLTLW TRDSSHDED+KLLSLLSQS
Subjt:  SIGWHSDDNRPYLKQREFTAVCYLNNYGVGFGGGLFHFQDGEPKTISPFCGDCVMYTADSRNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS

Query:  HLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHVLGYNIYFPQDHSLSEYPDLFSQDVQLVRGNKIFFQDFDSILHALQVVQFLCWKGTEL
        HLHDR PDS LPQPPSCNMYWFS EDDPNFK GFDICWARLH LGY+IYF  DHS SEYPDLFS+DVQLV+GNK+FFQ+F++ILH LQVVQFLCWKG EL
Subjt:  HLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHVLGYNIYFPQDHSLSEYPDLFSQDVQLVRGNKIFFQDFDSILHALQVVQFLCWKGTEL

Query:  DSTNFKGDSSYAEDLSPKRNVGVDYFKSEFSKDNALVESVFSSASSDGKEKQHGLGWVKLAAATTAWEDYASNLRRELLQSLTHWRTSQSIYSVSLGS
        DSTN K DSSYAE LSPKRNVGV YFKSEFSKD+ L ESVFSSA+SDGKE QH LGW KLAAA  AWEDYAS LRRELL SL++WR SQSIYSVSL S
Subjt:  DSTNFKGDSSYAEDLSPKRNVGVDYFKSEFSKDNALVESVFSSASSDGKEKQHGLGWVKLAAATTAWEDYASNLRRELLQSLTHWRTSQSIYSVSLGS

TrEMBL top hitse value%identityAlignment
A0A5A7SSL8 Procollagen-proline 3-dioxygenase3.5e-18279.85Show/hide
Query:  MGDEAESNQRRRLILENFITREECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRGKT---------AHSSIFTIVSFLVIWTRGASI
        M D AES QRRRLILENF++REECRELEFIHKSCCTVGYRPNV STTLLHLVA+NSAHLI+PFV IR K           H  +F   + L+ WTRGASI
Subjt:  MGDEAESNQRRRLILENFITREECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRGKT---------AHSSIFTIVSFLVIWTRGASI

Query:  GWHSDDNRPYLKQREFTAVCYLNNYGVGFGGGLFHFQDGEPKTISPFCGDCVMYTADSRNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
        GWHSDDNRPYLKQREF+AVCYLN+YGV FGGGLFHFQDGEP+TISPF GDCVMYTADS NVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS L
Subjt:  GWHSDDNRPYLKQREFTAVCYLNNYGVGFGGGLFHFQDGEPKTISPFCGDCVMYTADSRNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHVLGYNIYFPQDHSLSEYPDLFSQDVQLVRGNKIFFQDFDSILHALQVVQFLCWKGTELDS
        HDRF +SCLPQPPSCNMYWFSPE+DPNFKFGFDICWARLH LGY+IYFP DH  SEYPDLFSQDVQLV G+KIFFQ F++ILH LQVVQFLCWKG ELD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHVLGYNIYFPQDHSLSEYPDLFSQDVQLVRGNKIFFQDFDSILHALQVVQFLCWKGTELDS

Query:  TNFKGDSSYAEDLSPKRNVGVDYFKSEFSKDNALVESVFSSASSDGKEKQHGLGWVKL-AAATTAWEDYASNLRRELLQSLTHWRTSQSIYSVSLGS
        TN   DS YAE LSPKRNVGV YFKSEFSK++ L ESVFSSA+S GKE QH LGW KL  AA  AWEDYAS LRRELL S +HWR  QSIYSVSL S
Subjt:  TNFKGDSSYAEDLSPKRNVGVDYFKSEFSKDNALVESVFSSASSDGKEKQHGLGWVKL-AAATTAWEDYASNLRRELLQSLTHWRTSQSIYSVSLGS

A0A5D3CRE9 Procollagen-proline 3-dioxygenase7.1e-18380.1Show/hide
Query:  MGDEAESNQRRRLILENFITREECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRGKT---------AHSSIFTIVSFLVIWTRGASI
        M D AES QRRRLILENF++REECRELEFIHKSCCTVGYRPNV STTLLHLVA+NSAHLI+PFV IR K           H  +F   + L+ WTRGASI
Subjt:  MGDEAESNQRRRLILENFITREECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRGKT---------AHSSIFTIVSFLVIWTRGASI

Query:  GWHSDDNRPYLKQREFTAVCYLNNYGVGFGGGLFHFQDGEPKTISPFCGDCVMYTADSRNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
        GWHSDDNRPYLKQREF+AVCYLN+YGV FGGGLFHFQDGEP+TISPF GDCVMY ADS NVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS L
Subjt:  GWHSDDNRPYLKQREFTAVCYLNNYGVGFGGGLFHFQDGEPKTISPFCGDCVMYTADSRNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHVLGYNIYFPQDHSLSEYPDLFSQDVQLVRGNKIFFQDFDSILHALQVVQFLCWKGTELDS
        HDRFP+SCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLH LGY+IYFP DH  SEYPDLFSQDVQLV G+KIFFQ F++ILH LQVVQFLCWKG ELD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHVLGYNIYFPQDHSLSEYPDLFSQDVQLVRGNKIFFQDFDSILHALQVVQFLCWKGTELDS

Query:  TNFKGDSSYAEDLSPKRNVGVDYFKSEFSKDNALVESVFSSASSDGKEKQHGLGWVKL-AAATTAWEDYASNLRRELLQSLTHWRTSQSIYSVSLGS
        TN   DS YAE LSPKRNVGV YFKSEFSK++ L ESVFSSA+S GKE QH LGW KL  AA  AWEDYAS LRRELL S +HWR  QSIYSVSL S
Subjt:  TNFKGDSSYAEDLSPKRNVGVDYFKSEFSKDNALVESVFSSASSDGKEKQHGLGWVKL-AAATTAWEDYASNLRRELLQSLTHWRTSQSIYSVSLGS

A0A6J1DQY8 Procollagen-proline 3-dioxygenase9.2e-18378.64Show/hide
Query:  MGDEAESNQ--RRRLILENFITREECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRGKTAHSS---------IFTIVSFLVIWTRGA
        MGDE E+ Q  RRRLILENF+TREECRELEFIHKSCCTVGYRP+VFSTTLLHLVA+NSAHLIMPFV IR +    +         +F   + L+ WTRGA
Subjt:  MGDEAESNQ--RRRLILENFITREECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRGKTAHSS---------IFTIVSFLVIWTRGA

Query:  SIGWHSDDNRPYLKQREFTAVCYLNNYGVGFGGGLFHFQDGEPKTISPFCGDCVMYTADSRNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS
        SIGWHSDDNRPYLKQREFTAVCYLN+YGV F GGLFHFQDGEPK+ISPFCGDCVMYTADS NVHSVDEITNGERLTLTLWFTRD+SHDEDAKLLSLLSQS
Subjt:  SIGWHSDDNRPYLKQREFTAVCYLNNYGVGFGGGLFHFQDGEPKTISPFCGDCVMYTADSRNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS

Query:  HLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHVLGYNIYFPQDHSLSEYPDLFSQDVQLVRGNKIFFQDFDSILHALQVVQFLCWKGTEL
        HLHDRFP+SC+P PPSCNMYWFSPE+DPNFKFG D+CWARLH LGY+IYFP+D+ LS+YP LFS  VQLVR  KIFFQ+F +ILHALQVVQF+CWKG EL
Subjt:  HLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHVLGYNIYFPQDHSLSEYPDLFSQDVQLVRGNKIFFQDFDSILHALQVVQFLCWKGTEL

Query:  DSTNFKGDSSYAEDLSPKRNVGVDYFKSEFSKDNALVESVFSSASSDGKEKQHGLGWVKLAAATTAWEDYASNLRRELLQSLTHWRTSQSIYSVSLGS
        DSTNFKG+SSYA  LSPK N GV YFKSEFSK+  L +SVFSSASSD KEKQ  LGW KLA A  AWEDYASNLR ELL+SL HWRT+QS+Y VSLGS
Subjt:  DSTNFKGDSSYAEDLSPKRNVGVDYFKSEFSKDNALVESVFSSASSDGKEKQHGLGWVKLAAATTAWEDYASNLRRELLQSLTHWRTSQSIYSVSLGS

A0A6J1FRP8 Procollagen-proline 3-dioxygenase2.9e-19283.59Show/hide
Query:  MGDEAESNQRRRLILENFITREECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRGKTAHSS---------IFTIVSFLVIWTRGASI
        MGDEAE NQRRRL LENF+T EECRELEFIHKSCCTVGYRP VFSTTLLHLV SNSAHLIMPFVSIR +    +         +F   + L+ WTRGA I
Subjt:  MGDEAESNQRRRLILENFITREECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRGKTAHSS---------IFTIVSFLVIWTRGASI

Query:  GWHSDDNRPYLKQREFTAVCYLNNYGVGFGGGLFHFQDGEPKTISPFCGDCVMYTADSRNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
        GWHSDDNRPYLKQREFTAVCYLN+YGV F GGLFHFQDGEPKTISP CGDCVMYTADS NVHSVDE+T+GERLTLTLWFTRDSSHDEDAKL+SLLSQSHL
Subjt:  GWHSDDNRPYLKQREFTAVCYLNNYGVGFGGGLFHFQDGEPKTISPFCGDCVMYTADSRNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHVLGYNIYFPQDHSLSEYPDLFSQDVQLVRGNKIFFQDFDSILHALQVVQFLCWKGTELDS
        HDR PDSCLPQPPSCNMYWFSP+DDPNFKFGFDICWARLH LGY IYFPQDHSLSEYPDLFSQDVQLVRGNKIF Q FDSILHALQVVQFL WKG ELDS
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHVLGYNIYFPQDHSLSEYPDLFSQDVQLVRGNKIFFQDFDSILHALQVVQFLCWKGTELDS

Query:  TNFKGDSSYAEDLSPKRNVGVDYFKSEFSKDNALVESVFSSASSDGKEKQHGLGWVKLAAATTAWEDYASNLRRELLQSLTHWRTSQSIYSVSLGS
        TN K DSSYAE LSPKRNVGVDYFKSEFSKD+AL ESVF  ASSD KEKQH LGW KLAA   AWEDYASNLRRELL+S  HWRTSQSIYSV  GS
Subjt:  TNFKGDSSYAEDLSPKRNVGVDYFKSEFSKDNALVESVFSSASSDGKEKQHGLGWVKLAAATTAWEDYASNLRRELLQSLTHWRTSQSIYSVSLGS

A0A6J1JQV6 Procollagen-proline 3-dioxygenase2.1e-19083.08Show/hide
Query:  MGDEAESNQRRRLILENFITREECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRGKTAHSS---------IFTIVSFLVIWTRGASI
        MGDEAE NQR RLILENF+T EECRELEFIHKSCCTVGYRP VFSTTLLHLV SNSA LIMPFVSIR +    +         +F   + L+ WTRGA I
Subjt:  MGDEAESNQRRRLILENFITREECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRGKTAHSS---------IFTIVSFLVIWTRGASI

Query:  GWHSDDNRPYLKQREFTAVCYLNNYGVGFGGGLFHFQDGEPKTISPFCGDCVMYTADSRNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
        GWHSDDNRPYLKQREFTAVCYLN+YGV F GGLFHFQDGEPKTISP CGDCVMYTADS NVHSVDE+T+GERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
Subjt:  GWHSDDNRPYLKQREFTAVCYLNNYGVGFGGGLFHFQDGEPKTISPFCGDCVMYTADSRNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHVLGYNIYFPQDHSLSEYPDLFSQDVQLVRGNKIFFQDFDSILHALQVVQFLCWKGTELDS
        HDR PDSCLPQPPSCNMYWFSP+DDPNFKFGFDICWARLH LGY IYFPQDHSLSEYPDLFSQDVQLVRGNKIF Q FDSILHALQVVQFL WKG ELDS
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHVLGYNIYFPQDHSLSEYPDLFSQDVQLVRGNKIFFQDFDSILHALQVVQFLCWKGTELDS

Query:  TNFKGDSSYAEDLSPKRNVGVDYFKSEFSKDNALVESVFSSASSDGKEKQHGLGWVKLAAATTAWEDYASNLRRELLQSLTHWRTSQSIYSVSLGS
        T+ K DSSYAE LSPKRNVGVD+FKSEFSKD+AL ESVF  ASSD KEKQH LGW KLAA   AWEDYASNLRRELL+S  HWRTSQSIYSV  GS
Subjt:  TNFKGDSSYAEDLSPKRNVGVDYFKSEFSKDNALVESVFSSASSDGKEKQHGLGWVKLAAATTAWEDYASNLRRELLQSLTHWRTSQSIYSVSLGS

SwissProt top hitse value%identityAlignment
Q5XGE0 2-oxoglutarate and iron-dependent oxygenase domain-containing protein 32.9e-0832.94Show/hide
Query:  WHSDDNRPYLKQREFTAVCYLNNYGVGFGGGLFHFQD-GEPKTISPFCGDCVMYTADSRNVHSVDEITNGERLTLTLWFTRDSSH
        WH   ++      ++T++ YL++Y   FGGG F F D G  +T+ P  G    +T+ S N+H V++++ G R  +T+ FT +  H
Subjt:  WHSDDNRPYLKQREFTAVCYLNNYGVGFGGGLFHFQD-GEPKTISPFCGDCVMYTADSRNVHSVDEITNGERLTLTLWFTRDSSH

Arabidopsis top hitse value%identityAlignment
AT1G68080.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein7.6e-10551.43Show/hide
Query:  RLILENFITREECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRGKTAHS---------SIFTIVSFLVIWTRGASIGWHSDDNRPYL
        RLIL NF++  EC+ELE IHKS  T+GYRPNVFSTTL HL+A+NS HLI+PFVSIR +              +F   + L+ W +GASIGWHSDDNR YL
Subjt:  RLILENFITREECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRGKTAHS---------SIFTIVSFLVIWTRGASIGWHSDDNRPYL

Query:  KQREFTAVCYLNNYGVGFGGGLFHFQDGEPKTISPFCGDCVMYTADSRNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDSCLPQ
        KQR+F AVCYLN+Y   F GGLF FQ GEP T++P  GD +MYTAD RN+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H    + CLP 
Subjt:  KQREFTAVCYLNNYGVGFGGGLFHFQDGEPKTISPFCGDCVMYTADSRNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDSCLPQ

Query:  PPSCNMYWFSP-EDDPNFKFGFDICWARLHVLGYNIYFPQ--DHSLSEYPDLFSQDVQLVRGNKIFFQDFDSILHALQVVQFLCWKGTELDSTNFKGDS-
        P S NMYWF P +D  N   GFD+C ARLH+LG++++  Q  DHS      L    +QL +G K+  + F +ILHALQVVQF  WK +EL ++N + D+ 
Subjt:  PPSCNMYWFSP-EDDPNFKFGFDICWARLHVLGYNIYFPQ--DHSLSEYPDLFSQDVQLVRGNKIFFQDFDSILHALQVVQFLCWKGTELDSTNFKGDS-

Query:  SYAEDLSPKRNVGVDYFKSEFSKDNALVESVFSSASSDGKEKQHGLGWVKLAAATTAWEDYASNLRRELLQSLTHWRTSQSIYSV
           + +S  +   ++  KS F  D  LV + F  + S G++++  L    +A A T+WE+Y+  L +ELL SL  W+T Q+I+ V
Subjt:  SYAEDLSPKRNVGVDYFKSEFSKDNALVESVFSSASSDGKEKQHGLGWVKLAAATTAWEDYASNLRRELLQSLTHWRTSQSIYSV

AT1G68080.2 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein5.9e-8144.01Show/hide
Query:  RLILENFITREECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRGKTAHS---------SIFTIVSFLVIWTRGASIGWHSDDNRPYL
        RLIL NF++  EC+ELE IHKS  T+GYRPNVFSTTL HL+A+NS HLI+PFVSIR +              +F   + L+ W +GASIGWHSDDNR YL
Subjt:  RLILENFITREECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRGKTAHS---------SIFTIVSFLVIWTRGASIGWHSDDNRPYL

Query:  KQREFTAVCYLNNYGVGFGGGLFHFQDGEPKTISPFCGDCVMYTADSRNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDSCLPQ
        KQR+F +                    GEP T++P  GD +MYTAD RN+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ              
Subjt:  KQREFTAVCYLNNYGVGFGGGLFHFQDGEPKTISPFCGDCVMYTADSRNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDSCLPQ

Query:  PPSCNMYWFSPEDDPNFKFGFDICWARLHVLGYNIYFPQ--DHSLSEYPDLFSQDVQLVRGNKIFFQDFDSILHALQVVQFLCWKGTELDSTNFKGDS-S
                            FD+C ARLH+LG++++  Q  DHS      L    +QL +G K+  + F +ILHALQVVQF  WK +EL ++N + D+  
Subjt:  PPSCNMYWFSPEDDPNFKFGFDICWARLHVLGYNIYFPQ--DHSLSEYPDLFSQDVQLVRGNKIFFQDFDSILHALQVVQFLCWKGTELDSTNFKGDS-S

Query:  YAEDLSPKRNVGVDYFKSEFSKDNALVESVFSSASSDGKEKQHGLGWVKLAAATTAWEDYASNLRRELLQSLTHWRTSQSIYSV
          + +S  +   ++  KS F  D  LV + F  + S G++++  L    +A A T+WE+Y+  L +ELL SL  W+T Q+I+ V
Subjt:  YAEDLSPKRNVGVDYFKSEFSKDNALVESVFSSASSDGKEKQHGLGWVKLAAATTAWEDYASNLRRELLQSLTHWRTSQSIYSV

AT1G68080.3 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein5.1e-9347.79Show/hide
Query:  RLILENFITREECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRGKTAHS---------SIFTIVSFLVIWTRGASIGWHSDDNRPYL
        RLIL NF++  EC+ELE IHKS  T+GYRPNVFSTTL HL+A+NS HLI+PFVSIR +              +F   + L+ W +GASIGWHSDDNR YL
Subjt:  RLILENFITREECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRGKTAHS---------SIFTIVSFLVIWTRGASIGWHSDDNRPYL

Query:  KQREFTAVCYLNNYGVGFGGGLFHFQDGEPKTISPFCGDCVMYTADSRNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDSCLPQ
        KQR+F +                    GEP T++P  GD +MYTAD RN+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H    + CLP 
Subjt:  KQREFTAVCYLNNYGVGFGGGLFHFQDGEPKTISPFCGDCVMYTADSRNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDSCLPQ

Query:  PPSCNMYWFSP-EDDPNFKFGFDICWARLHVLGYNIYFPQ--DHSLSEYPDLFSQDVQLVRGNKIFFQDFDSILHALQVVQFLCWKGTELDSTNFKGDS-
        P S NMYWF P +D  N   GFD+C ARLH+LG++++  Q  DHS      L    +QL +G K+  + F +ILHALQVVQF  WK +EL ++N + D+ 
Subjt:  PPSCNMYWFSP-EDDPNFKFGFDICWARLHVLGYNIYFPQ--DHSLSEYPDLFSQDVQLVRGNKIFFQDFDSILHALQVVQFLCWKGTELDSTNFKGDS-

Query:  SYAEDLSPKRNVGVDYFKSEFSKDNALVESVFSSASSDGKEKQHGLGWVKLAAATTAWEDYASNLRRELLQSLTHWRTSQSIYSV
           + +S  +   ++  KS F  D  LV + F  + S G++++  L    +A A T+WE+Y+  L +ELL SL  W+T Q+I+ V
Subjt:  SYAEDLSPKRNVGVDYFKSEFSKDNALVESVFSSASSDGKEKQHGLGWVKLAAATTAWEDYASNLRRELLQSLTHWRTSQSIYSV

AT1G68080.4 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein6.1e-5447.13Show/hide
Query:  MYTADSRNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDSCLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLHVLGYNIYFPQ-
        MYTAD RN+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H    + CLP P S NMYWF P +D  N   GFD+C ARLH+LG++++  Q 
Subjt:  MYTADSRNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDSCLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLHVLGYNIYFPQ-

Query:  -DHSLSEYPDLFSQDVQLVRGNKIFFQDFDSILHALQVVQFLCWKGTELDSTNFKGDS-SYAEDLSPKRNVGVDYFKSEFSKDNALVESVFSSASSDGKE
         DHS      L    +QL +G K+  + F +ILHALQVVQF  WK +EL ++N + D+    + +S  +   ++  KS F  D  LV + F  + S G++
Subjt:  -DHSLSEYPDLFSQDVQLVRGNKIFFQDFDSILHALQVVQFLCWKGTELDSTNFKGDS-SYAEDLSPKRNVGVDYFKSEFSKDNALVESVFSSASSDGKE

Query:  KQHGLGWVKLAAATTAWEDYASNLRRELLQSLTHWRTSQSIYSV
        ++  L    +A A T+WE+Y+  L +ELL SL  W+T Q+I+ V
Subjt:  KQHGLGWVKLAAATTAWEDYASNLRRELLQSLTHWRTSQSIYSV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGACGAAGCGGAGAGCAATCAGCGGCGGCGTCTGATTCTGGAAAATTTCATAACCCGCGAAGAATGCAGGGAACTGGAGTTCATCCATAAGAGCTGCTGTACGGT
GGGTTATAGACCAAATGTCTTCTCCACTACTCTTTTGCATCTTGTTGCCTCTAATTCTGCTCATTTAATCATGCCTTTTGTTTCGATTAGAGGTAAAACAGCCCATTCCT
CGATTTTCACCATTGTTTCCTTTTTGGTAATCTGGACCAGGGGAGCAAGCATTGGATGGCATAGTGACGATAACCGGCCGTATCTAAAACAACGTGAATTTACAGCAGTG
TGTTACTTGAATAATTATGGAGTTGGTTTTGGAGGTGGGCTATTTCATTTTCAGGACGGGGAACCAAAAACTATCTCGCCTTTTTGTGGAGATTGTGTGATGTACACGGC
CGATAGTCGCAATGTTCATTCCGTTGATGAGATAACCAATGGAGAGAGACTTACACTGACATTATGGTTCACCCGTGATAGTTCCCATGATGAAGATGCAAAACTTCTTT
CGCTTCTTTCACAAAGCCATTTACACGATCGATTTCCCGACTCCTGCCTACCTCAGCCTCCATCCTGTAATATGTATTGGTTTTCACCAGAAGATGATCCAAATTTCAAG
TTCGGTTTTGATATATGTTGGGCGAGACTGCATGTGCTTGGATACAACATTTATTTTCCTCAGGACCATAGTTTGTCAGAGTATCCAGATTTATTCTCACAGGATGTACA
ATTAGTACGGGGAAATAAGATATTCTTTCAGGATTTTGACAGCATTTTGCATGCGCTTCAGGTAGTGCAGTTTCTGTGTTGGAAAGGCACAGAACTGGATTCTACTAACT
TCAAGGGTGATTCAAGCTATGCAGAAGATTTATCTCCAAAGAGGAACGTGGGAGTCGATTACTTTAAATCTGAGTTCTCAAAGGACAATGCACTGGTCGAGTCAGTCTTC
TCATCTGCTAGTTCTGATGGCAAGGAGAAGCAACACGGGTTGGGGTGGGTTAAGCTTGCTGCAGCAACAACAGCTTGGGAAGATTATGCTTCCAATTTAAGGAGAGAACT
CCTTCAGAGCTTGACCCATTGGAGAACCAGTCAATCCATATACAGTGTTTCACTTGGTAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGACGAAGCGGAGAGCAATCAGCGGCGGCGTCTGATTCTGGAAAATTTCATAACCCGCGAAGAATGCAGGGAACTGGAGTTCATCCATAAGAGCTGCTGTACGGT
GGGTTATAGACCAAATGTCTTCTCCACTACTCTTTTGCATCTTGTTGCCTCTAATTCTGCTCATTTAATCATGCCTTTTGTTTCGATTAGAGGTAAAACAGCCCATTCCT
CGATTTTCACCATTGTTTCCTTTTTGGTAATCTGGACCAGGGGAGCAAGCATTGGATGGCATAGTGACGATAACCGGCCGTATCTAAAACAACGTGAATTTACAGCAGTG
TGTTACTTGAATAATTATGGAGTTGGTTTTGGAGGTGGGCTATTTCATTTTCAGGACGGGGAACCAAAAACTATCTCGCCTTTTTGTGGAGATTGTGTGATGTACACGGC
CGATAGTCGCAATGTTCATTCCGTTGATGAGATAACCAATGGAGAGAGACTTACACTGACATTATGGTTCACCCGTGATAGTTCCCATGATGAAGATGCAAAACTTCTTT
CGCTTCTTTCACAAAGCCATTTACACGATCGATTTCCCGACTCCTGCCTACCTCAGCCTCCATCCTGTAATATGTATTGGTTTTCACCAGAAGATGATCCAAATTTCAAG
TTCGGTTTTGATATATGTTGGGCGAGACTGCATGTGCTTGGATACAACATTTATTTTCCTCAGGACCATAGTTTGTCAGAGTATCCAGATTTATTCTCACAGGATGTACA
ATTAGTACGGGGAAATAAGATATTCTTTCAGGATTTTGACAGCATTTTGCATGCGCTTCAGGTAGTGCAGTTTCTGTGTTGGAAAGGCACAGAACTGGATTCTACTAACT
TCAAGGGTGATTCAAGCTATGCAGAAGATTTATCTCCAAAGAGGAACGTGGGAGTCGATTACTTTAAATCTGAGTTCTCAAAGGACAATGCACTGGTCGAGTCAGTCTTC
TCATCTGCTAGTTCTGATGGCAAGGAGAAGCAACACGGGTTGGGGTGGGTTAAGCTTGCTGCAGCAACAACAGCTTGGGAAGATTATGCTTCCAATTTAAGGAGAGAACT
CCTTCAGAGCTTGACCCATTGGAGAACCAGTCAATCCATATACAGTGTTTCACTTGGTAGTTGA
Protein sequenceShow/hide protein sequence
MGDEAESNQRRRLILENFITREECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVSIRGKTAHSSIFTIVSFLVIWTRGASIGWHSDDNRPYLKQREFTAV
CYLNNYGVGFGGGLFHFQDGEPKTISPFCGDCVMYTADSRNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDSCLPQPPSCNMYWFSPEDDPNFK
FGFDICWARLHVLGYNIYFPQDHSLSEYPDLFSQDVQLVRGNKIFFQDFDSILHALQVVQFLCWKGTELDSTNFKGDSSYAEDLSPKRNVGVDYFKSEFSKDNALVESVF
SSASSDGKEKQHGLGWVKLAAATTAWEDYASNLRRELLQSLTHWRTSQSIYSVSLGS