; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g28070 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g28070
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionProcollagen-proline 3-dioxygenase
Genome locationchr9:21086647..21101144
RNA-Seq ExpressionMoc09g28070
SyntenyMoc09g28070
Gene Ontology termsGO:0019511 - peptidyl-proline hydroxylation (biological process)
GO:0032963 - collagen metabolic process (biological process)
GO:0005506 - iron ion binding (molecular function)
GO:0019797 - procollagen-proline 3-dioxygenase activity (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR039575 - Prolyl 3-hydroxylase
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008456831.1 PREDICTED: uncharacterized protein LOC103496668 isoform X1 [Cucumis melo]4.8e-18981.95Show/hide
Query:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
        M D  E+RQ  RRRLILENFL+REECRELEFIHKSCCTVGYRP+V STTLLHLVATNSAHLI+PFVPIRE+LKEKAEEFFGC YELFVEFTGLISWTRGA
Subjt:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS
        SIGWHSDDNRPYLKQREF+AVCYLNSYGV+F GGLFHFQDGEP++ISPF GDCVMYTADS NVHSVDEITNGERLTLTLWFTRD+SHDEDAKLLSLLSQS
Subjt:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS

Query:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGIDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL
         LHDRF NSC+P PPSCNMYWFSPEEDPNFKFG D+CWARLHALGYDIYFP D+D S+YP LFS  VQLV   KIFFQ+F NILH LQVVQF+CWKGKEL
Subjt:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGIDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL

Query:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKL-ATAVAAWEDYASNLRTELLRSLPHWRTNQSMYRVSLGS
        D+TN   +S YA YLSPK N GVSYFKSEFSKN  LA+SVFSSA+S  KE Q WLGW KL   A AAWEDYAS LR ELL S  HWR  QS+Y VSL S
Subjt:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKL-ATAVAAWEDYASNLRTELLRSLPHWRTNQSMYRVSLGS

XP_022155191.1 uncharacterized protein LOC111022327 [Momordica charantia]1.5e-238100Show/hide
Query:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
        MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
Subjt:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS
        SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS
Subjt:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS

Query:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGIDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL
        HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGIDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL
Subjt:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGIDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL

Query:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWRTNQSMYRVSLGS
        DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWRTNQSMYRVSLGS
Subjt:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWRTNQSMYRVSLGS

XP_022942569.1 prolyl 3-hydroxylase 1 isoform X1 [Cucurbita moschata]9.7e-19081.91Show/hide
Query:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
        MGDE E  Q  RRRL LENFLT EECRELEFIHKSCCTVGYRP VFSTTLLHLV +NSAHLIMPFV IRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
Subjt:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS
         IGWHSDDNRPYLKQREFTAVCYLNSYGVDF GGLFHFQDGEPK+ISP CGDCVMYTADS NVHSVDE+T+GERLTLTLWFTRD+SHDEDAKL+SLLSQS
Subjt:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS

Query:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGIDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL
        HLHDR P+SC+P PPSCNMYWFSP++DPNFKFG D+CWARLHALGY IYFP+D+ LS+YP LFS  VQLVR  KIF Q+F +ILHALQVVQF+ WKGKEL
Subjt:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGIDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL

Query:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWRTNQSMYRVSLGS
        DSTN K +SSYA  LSPK N GV YFKSEFSK+  LA+SVF  ASSD KEKQ  LGWAKLA   AAWEDYASNLR ELLRS  HWRT+QS+Y V  GS
Subjt:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWRTNQSMYRVSLGS

XP_023552239.1 prolyl 3-hydroxylase 1 [Cucurbita pepo subsp. pepo]8.8e-19182.41Show/hide
Query:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
        MGDE E  Q  RRRLILENFLT EECRELEFIHKSCCTVGYRP VFSTTLLHLV +NSAHLIMPFV IRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
Subjt:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS
         IGWHSDDNRPYLKQREFTAVCYLNSYGVDF GGLFHFQDGEPK+ISP CGDCVMYTADS NVHSVDE+T+GERLTLTLWFTRD+SHDEDAKLLSLLSQS
Subjt:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS

Query:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGIDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL
        HLHDR P+SC+P PPSCNMYWFSP++DPNFKFG D+CWARLHALGY IYFP+D+ LS+YP LFS  VQLVR  KIF Q+F +ILHALQVVQF+ WKGKEL
Subjt:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGIDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL

Query:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWRTNQSMYRVSLGS
        DSTN K +SSYA  LSPK N GV YFKSEFSK+  LA+SVF  ASSD KEKQ  LGWAKLA   AAWEDYASNLR ELLRS  HWRT+QS+Y V  GS
Subjt:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWRTNQSMYRVSLGS

XP_038893062.1 uncharacterized protein LOC120081945 [Benincasa hispida]2.6e-19583.17Show/hide
Query:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
        MGDEVE+R++RRRRLILENFLTREECRELEFIHKSCCTVGYRP+VFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGC YELFVEFTGLISWTRGA
Subjt:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS
        SIGWHSDDNRPYLKQR+F+AVCYLNSYGV+F GGLFHFQDGEP++ISPFCGDCVMYTADS NVHSVDEITNGERLTLTLW TRD+SHDED+KLLSLLSQS
Subjt:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS

Query:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGIDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL
        HLHDR P+S +P PPSCNMYWFS E+DPNFK G D+CWARLHALGYDIYF  D+  S+YP LFS  VQLV+  K+FFQEF NILH LQVVQF+CWKGKEL
Subjt:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGIDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL

Query:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWRTNQSMYRVSLGS
        DSTN K +SSYA YLSPK N GVSYFKSEFSK+ VLA+SVFSSA+SD KE Q WLGW KLA A AAWEDYAS LR ELL SL +WR +QS+Y VSL S
Subjt:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWRTNQSMYRVSLGS

TrEMBL top hitse value%identityAlignment
A0A1S3C486 Procollagen-proline 3-dioxygenase2.3e-18981.95Show/hide
Query:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
        M D  E+RQ  RRRLILENFL+REECRELEFIHKSCCTVGYRP+V STTLLHLVATNSAHLI+PFVPIRE+LKEKAEEFFGC YELFVEFTGLISWTRGA
Subjt:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS
        SIGWHSDDNRPYLKQREF+AVCYLNSYGV+F GGLFHFQDGEP++ISPF GDCVMYTADS NVHSVDEITNGERLTLTLWFTRD+SHDEDAKLLSLLSQS
Subjt:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS

Query:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGIDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL
         LHDRF NSC+P PPSCNMYWFSPEEDPNFKFG D+CWARLHALGYDIYFP D+D S+YP LFS  VQLV   KIFFQ+F NILH LQVVQF+CWKGKEL
Subjt:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGIDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL

Query:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKL-ATAVAAWEDYASNLRTELLRSLPHWRTNQSMYRVSLGS
        D+TN   +S YA YLSPK N GVSYFKSEFSKN  LA+SVFSSA+S  KE Q WLGW KL   A AAWEDYAS LR ELL S  HWR  QS+Y VSL S
Subjt:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKL-ATAVAAWEDYASNLRTELLRSLPHWRTNQSMYRVSLGS

A0A5A7SSL8 Procollagen-proline 3-dioxygenase2.3e-18981.95Show/hide
Query:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
        M D  E+RQ  RRRLILENFL+REECRELEFIHKSCCTVGYRP+V STTLLHLVATNSAHLI+PFVPIRE+LKEKAEEFFGC YELFVEFTGLISWTRGA
Subjt:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS
        SIGWHSDDNRPYLKQREF+AVCYLNSYGV+F GGLFHFQDGEP++ISPF GDCVMYTADS NVHSVDEITNGERLTLTLWFTRD+SHDEDAKLLSLLSQS
Subjt:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS

Query:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGIDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL
         LHDRF NSC+P PPSCNMYWFSPEEDPNFKFG D+CWARLHALGYDIYFP D+D S+YP LFS  VQLV   KIFFQ+F NILH LQVVQF+CWKGKEL
Subjt:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGIDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL

Query:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKL-ATAVAAWEDYASNLRTELLRSLPHWRTNQSMYRVSLGS
        D+TN   +S YA YLSPK N GVSYFKSEFSKN  LA+SVFSSA+S  KE Q WLGW KL   A AAWEDYAS LR ELL S  HWR  QS+Y VSL S
Subjt:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKL-ATAVAAWEDYASNLRTELLRSLPHWRTNQSMYRVSLGS

A0A5D3CRE9 Procollagen-proline 3-dioxygenase3.1e-18981.7Show/hide
Query:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
        M D  E+RQ  RRRLILENFL+REECRELEFIHKSCCTVGYRP+V STTLLHLVATNSAHLI+PFVPIRE+LKEKAEEFFGC YELFVEFTGLISWTRGA
Subjt:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS
        SIGWHSDDNRPYLKQREF+AVCYLNSYGV+F GGLFHFQDGEP++ISPF GDCVMY ADS NVHSVDEITNGERLTLTLWFTRD+SHDEDAKLLSLLSQS
Subjt:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS

Query:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGIDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL
         LHDRFPNSC+P PPSCNMYWFSPE+DPNFKFG D+CWARLHALGYDIYFP D+D S+YP LFS  VQLV   KIFFQ+F NILH LQVVQF+CWKGKEL
Subjt:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGIDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL

Query:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKL-ATAVAAWEDYASNLRTELLRSLPHWRTNQSMYRVSLGS
        D+TN   +S YA YLSPK N GVSYFKSEFSKN  LA+SVFSSA+S  KE Q WLGW KL   A AAWEDYAS LR ELL S  HWR  QS+Y VSL S
Subjt:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKL-ATAVAAWEDYASNLRTELLRSLPHWRTNQSMYRVSLGS

A0A6J1DQY8 Procollagen-proline 3-dioxygenase7.2e-239100Show/hide
Query:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
        MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
Subjt:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS
        SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS
Subjt:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS

Query:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGIDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL
        HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGIDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL
Subjt:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGIDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL

Query:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWRTNQSMYRVSLGS
        DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWRTNQSMYRVSLGS
Subjt:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWRTNQSMYRVSLGS

A0A6J1FRP8 Procollagen-proline 3-dioxygenase4.7e-19081.91Show/hide
Query:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
        MGDE E  Q  RRRL LENFLT EECRELEFIHKSCCTVGYRP VFSTTLLHLV +NSAHLIMPFV IRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
Subjt:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS
         IGWHSDDNRPYLKQREFTAVCYLNSYGVDF GGLFHFQDGEPK+ISP CGDCVMYTADS NVHSVDE+T+GERLTLTLWFTRD+SHDEDAKL+SLLSQS
Subjt:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS

Query:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGIDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL
        HLHDR P+SC+P PPSCNMYWFSP++DPNFKFG D+CWARLHALGY IYFP+D+ LS+YP LFS  VQLVR  KIF Q+F +ILHALQVVQF+ WKGKEL
Subjt:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGIDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL

Query:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWRTNQSMYRVSLGS
        DSTN K +SSYA  LSPK N GV YFKSEFSK+  LA+SVF  ASSD KEKQ  LGWAKLA   AAWEDYASNLR ELLRS  HWRT+QS+Y V  GS
Subjt:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWRTNQSMYRVSLGS

SwissProt top hitse value%identityAlignment
Q5XGE0 2-oxoglutarate and iron-dependent oxygenase domain-containing protein 32.5e-0731.76Show/hide
Query:  WHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQD-GEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSH
        WH   ++      ++T++ YL+ Y  DF GG F F D G  +++ P  G    +T+ S N+H V++++ G R  +T+ FT +  H
Subjt:  WHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQD-GEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSH

Arabidopsis top hitse value%identityAlignment
AT1G68080.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein9.0e-11755.53Show/hide
Query:  QQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGASIGWHSDD
        ++   RLIL NFL+  EC+ELE IHKS  T+GYRP+VFSTTL HL+ATNS HLI+PFV IRERLKEK EE FGCEYELF+EFTGLISW +GASIGWHSDD
Subjt:  QQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGASIGWHSDD

Query:  NRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQSHLHDRFPN
        NR YLKQR+F AVCYLNSY  DF GGLF FQ GEP +++P  GD +MYTAD  N+HSVDE+T+GERLTL LWF+RD+SHDED+KLLS LSQ   H+    
Subjt:  NRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQSHLHDRFPN

Query:  SCIPLPPSCNMYWFSPEED-PNFKFGIDVCWARLHALGYDIYFPRDYDLS-DYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKELDSTNFK
         C+PLP S NMYWF P +D  N   G DVC ARLH LG+D++  +  D S D  +     +QL +  K+  ++F NILHALQVVQF  WK  EL ++N +
Subjt:  SCIPLPPSCNMYWFSPEED-PNFKFGIDVCWARLHALGYDIYFPRDYDLS-DYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKELDSTNFK

Query:  GNSSYAV-YLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWRTNQSMYRV
         ++   V  +S      ++  KS F  +  L  + F  + S E  K   L    +A AV +WE+Y+  L  ELL SLP W+T Q++++V
Subjt:  GNSSYAV-YLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWRTNQSMYRV

AT1G68080.2 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.1e-9047.68Show/hide
Query:  QQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGASIGWHSDD
        ++   RLIL NFL+  EC+ELE IHKS  T+GYRP+VFSTTL HL+ATNS HLI+PFV IRERLKEK EE FGCEYELF+EFTGLISW +GASIGWHSDD
Subjt:  QQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGASIGWHSDD

Query:  NRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQSHLHDRFPN
        NR YLKQR+F +                    GEP +++P  GD +MYTAD  N+HSVDE+T+GERLTL LWF+RD+SHDED+KLLS LSQ         
Subjt:  NRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQSHLHDRFPN

Query:  SCIPLPPSCNMYWFSPEEDPNFKFGIDVCWARLHALGYDIYFPRDYDLS-DYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKELDSTNFKG
                                  DVC ARLH LG+D++  +  D S D  +     +QL +  K+  ++F NILHALQVVQF  WK  EL ++N + 
Subjt:  SCIPLPPSCNMYWFSPEEDPNFKFGIDVCWARLHALGYDIYFPRDYDLS-DYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKELDSTNFKG

Query:  NSSYAV-YLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWRTNQSMYRV
        ++   V  +S      ++  KS F  +  L  + F  + S E  K   L    +A AV +WE+Y+  L  ELL SLP W+T Q++++V
Subjt:  NSSYAV-YLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWRTNQSMYRV

AT1G68080.3 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.9e-10351.41Show/hide
Query:  QQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGASIGWHSDD
        ++   RLIL NFL+  EC+ELE IHKS  T+GYRP+VFSTTL HL+ATNS HLI+PFV IRERLKEK EE FGCEYELF+EFTGLISW +GASIGWHSDD
Subjt:  QQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGASIGWHSDD

Query:  NRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQSHLHDRFPN
        NR YLKQR+F +                    GEP +++P  GD +MYTAD  N+HSVDE+T+GERLTL LWF+RD+SHDED+KLLS LSQ   H+    
Subjt:  NRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQSHLHDRFPN

Query:  SCIPLPPSCNMYWFSPEED-PNFKFGIDVCWARLHALGYDIYFPRDYDLS-DYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKELDSTNFK
         C+PLP S NMYWF P +D  N   G DVC ARLH LG+D++  +  D S D  +     +QL +  K+  ++F NILHALQVVQF  WK  EL ++N +
Subjt:  SCIPLPPSCNMYWFSPEED-PNFKFGIDVCWARLHALGYDIYFPRDYDLS-DYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKELDSTNFK

Query:  GNSSYAV-YLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWRTNQSMYRV
         ++   V  +S      ++  KS F  +  L  + F  + S E  K   L    +A AV +WE+Y+  L  ELL SLP W+T Q++++V
Subjt:  GNSSYAV-YLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWRTNQSMYRV

AT1G68080.4 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.1e-4945.68Show/hide
Query:  MYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQSHLHDRFPNSCIPLPPSCNMYWFSPEED-PNFKFGIDVCWARLHALGYDIYFPRD
        MYTAD  N+HSVDE+T+GERLTL LWF+RD+SHDED+KLLS LSQ   H+     C+PLP S NMYWF P +D  N   G DVC ARLH LG+D++  + 
Subjt:  MYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQSHLHDRFPNSCIPLPPSCNMYWFSPEED-PNFKFGIDVCWARLHALGYDIYFPRD

Query:  YDLS-DYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKELDSTNFKGNSSYAV-YLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEK
         D S D  +     +QL +  K+  ++F NILHALQVVQF  WK  EL ++N + ++   V  +S      ++  KS F  +  L  + F  + S E  K
Subjt:  YDLS-DYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKELDSTNFKGNSSYAV-YLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEK

Query:  QPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWRTNQSMYRV
           L    +A AV +WE+Y+  L  ELL SLP W+T Q++++V
Subjt:  QPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWRTNQSMYRV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGACGAAGTGGAGAACAGGCAGCAGCGGCGGCGGCGTCTGATCCTCGAAAATTTCTTAACCCGCGAAGAATGCAGGGAACTGGAGTTCATCCATAAGAGCTGCTG
TACGGTGGGTTATAGACCCAGCGTCTTCTCCACCACACTTTTGCATCTTGTAGCCACTAATTCTGCTCATTTGATCATGCCTTTCGTTCCGATCAGAGAGAGGTTGAAGG
AGAAGGCGGAGGAATTCTTTGGCTGTGAATATGAACTCTTCGTCGAGTTCACTGGCTTGATCAGCTGGACGAGAGGGGCAAGCATTGGATGGCATAGTGATGATAACAGG
CCCTATTTAAAACAACGTGAATTTACAGCAGTGTGTTACTTGAATAGTTATGGAGTAGATTTCAGAGGTGGGCTGTTTCACTTTCAGGACGGGGAACCAAAATCTATCTC
TCCTTTTTGTGGAGATTGTGTGATGTACACAGCCGACAGCCACAACGTTCATTCTGTAGATGAGATAACCAATGGAGAAAGGCTTACACTGACATTATGGTTCACCCGCG
ATAATTCCCATGATGAAGATGCAAAACTACTTTCCCTTCTTTCACAAAGCCATTTACATGATCGTTTTCCCAACTCATGTATACCTCTGCCACCGTCCTGTAATATGTAT
TGGTTTTCACCCGAAGAAGATCCAAATTTCAAGTTCGGTATTGATGTATGTTGGGCGAGACTGCATGCGCTCGGATACGACATTTATTTTCCTCGGGACTATGATTTGTC
AGATTATCCAAAATTATTCTCAGACTACGTGCAATTAGTACGGAAAAAGAAGATATTCTTTCAGGAATTTTATAACATTTTGCATGCGCTTCAGGTAGTGCAGTTTATGT
GTTGGAAAGGCAAAGAACTAGATTCTACTAACTTCAAGGGGAATTCAAGCTATGCAGTATATTTATCTCCAAAGGGGAATGCGGGAGTCAGTTACTTTAAGTCCGAGTTT
TCGAAGAATGCTGTACTAGCCAAGTCAGTCTTCTCGTCTGCTAGTTCCGATGAAAAGGAGAAGCAACCCTGGTTGGGGTGGGCTAAGCTTGCTACTGCTGTAGCAGCTTG
GGAAGATTATGCTTCCAATTTAAGGACAGAACTCCTTAGGAGCTTGCCCCATTGGAGAACCAATCAATCCATGTACAGAGTTTCACTCGGTAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGACGAAGTGGAGAACAGGCAGCAGCGGCGGCGGCGTCTGATCCTCGAAAATTTCTTAACCCGCGAAGAATGCAGGGAACTGGAGTTCATCCATAAGAGCTGCTG
TACGGTGGGTTATAGACCCAGCGTCTTCTCCACCACACTTTTGCATCTTGTAGCCACTAATTCTGCTCATTTGATCATGCCTTTCGTTCCGATCAGAGAGAGGTTGAAGG
AGAAGGCGGAGGAATTCTTTGGCTGTGAATATGAACTCTTCGTCGAGTTCACTGGCTTGATCAGCTGGACGAGAGGGGCAAGCATTGGATGGCATAGTGATGATAACAGG
CCCTATTTAAAACAACGTGAATTTACAGCAGTGTGTTACTTGAATAGTTATGGAGTAGATTTCAGAGGTGGGCTGTTTCACTTTCAGGACGGGGAACCAAAATCTATCTC
TCCTTTTTGTGGAGATTGTGTGATGTACACAGCCGACAGCCACAACGTTCATTCTGTAGATGAGATAACCAATGGAGAAAGGCTTACACTGACATTATGGTTCACCCGCG
ATAATTCCCATGATGAAGATGCAAAACTACTTTCCCTTCTTTCACAAAGCCATTTACATGATCGTTTTCCCAACTCATGTATACCTCTGCCACCGTCCTGTAATATGTAT
TGGTTTTCACCCGAAGAAGATCCAAATTTCAAGTTCGGTATTGATGTATGTTGGGCGAGACTGCATGCGCTCGGATACGACATTTATTTTCCTCGGGACTATGATTTGTC
AGATTATCCAAAATTATTCTCAGACTACGTGCAATTAGTACGGAAAAAGAAGATATTCTTTCAGGAATTTTATAACATTTTGCATGCGCTTCAGGTAGTGCAGTTTATGT
GTTGGAAAGGCAAAGAACTAGATTCTACTAACTTCAAGGGGAATTCAAGCTATGCAGTATATTTATCTCCAAAGGGGAATGCGGGAGTCAGTTACTTTAAGTCCGAGTTT
TCGAAGAATGCTGTACTAGCCAAGTCAGTCTTCTCGTCTGCTAGTTCCGATGAAAAGGAGAAGCAACCCTGGTTGGGGTGGGCTAAGCTTGCTACTGCTGTAGCAGCTTG
GGAAGATTATGCTTCCAATTTAAGGACAGAACTCCTTAGGAGCTTGCCCCATTGGAGAACCAATCAATCCATGTACAGAGTTTCACTCGGTAGTTGA
Protein sequenceShow/hide protein sequence
MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGASIGWHSDDNR
PYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQSHLHDRFPNSCIPLPPSCNMY
WFSPEEDPNFKFGIDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKELDSTNFKGNSSYAVYLSPKGNAGVSYFKSEF
SKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWRTNQSMYRVSLGS