; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS008886 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS008886
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProcollagen-proline 3-dioxygenase
Genome locationscaffold629:336642..351112
RNA-Seq ExpressionMS008886
SyntenyMS008886
Gene Ontology termsGO:0019511 - peptidyl-proline hydroxylation (biological process)
GO:0032963 - collagen metabolic process (biological process)
GO:0005506 - iron ion binding (molecular function)
GO:0019797 - procollagen-proline 3-dioxygenase activity (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR039575 - Prolyl 3-hydroxylase
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008456831.1 PREDICTED: uncharacterized protein LOC103496668 isoform X1 [Cucumis melo]1.8e-18881.45Show/hide
Query:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
        M D  E+RQ  RRRLILENFL+REECRELEFIHKSCCTVGYRP+V STTLLHLVATNSAHLI+PFVPIRE+LKEKAEEFFGC YELFVEFTGLISWTRGA
Subjt:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFTVLCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS
        SIGWHSDDNRPYLKQREF+ +CYLNSYGV+F GGLFHFQDGEP++ISPF GDCVMYTADS NVHSVDEITNGERLTLTLWFTRD+SHDEDAKLLSLLSQS
Subjt:  SIGWHSDDNRPYLKQREFTVLCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS

Query:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL
         LHDRF NSC+P PPSCNMYWFSPEEDPNFKFGFD+CWARLHALGYDIYFP D+D S+YP LFS  VQLV   KIFFQ+F NILH LQVVQF+CWKGKEL
Subjt:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL

Query:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKL-ATAVAAWEDYASNLRTELLRSLPHWKTNQSMYRVSLGS
        D+TN   +S YA YLSPK N GVSYFKSEFSKN  LA+SVFSSA+S  KE Q WLGW KL   A AAWEDYAS LR ELL S  HW+  QS+Y VSL S
Subjt:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKL-ATAVAAWEDYASNLRTELLRSLPHWKTNQSMYRVSLGS

XP_022155191.1 uncharacterized protein LOC111022327 [Momordica charantia]8.1e-23798.99Show/hide
Query:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
        MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
Subjt:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFTVLCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS
        SIGWHSDDNRPYLKQREFT +CYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS
Subjt:  SIGWHSDDNRPYLKQREFTVLCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS

Query:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL
        HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFG DVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL
Subjt:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL

Query:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWKTNQSMYRVSLGS
        DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHW+TNQSMYRVSLGS
Subjt:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWKTNQSMYRVSLGS

XP_022942569.1 prolyl 3-hydroxylase 1 isoform X1 [Cucurbita moschata]3.7e-18981.41Show/hide
Query:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
        MGDE E  Q  RRRL LENFLT EECRELEFIHKSCCTVGYRP VFSTTLLHLV +NSAHLIMPFV IRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
Subjt:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFTVLCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS
         IGWHSDDNRPYLKQREFT +CYLNSYGVDF GGLFHFQDGEPK+ISP CGDCVMYTADS NVHSVDE+T+GERLTLTLWFTRD+SHDEDAKL+SLLSQS
Subjt:  SIGWHSDDNRPYLKQREFTVLCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS

Query:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL
        HLHDR P+SC+P PPSCNMYWFSP++DPNFKFGFD+CWARLHALGY IYFP+D+ LS+YP LFS  VQLVR  KIF Q+F +ILHALQVVQF+ WKGKEL
Subjt:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL

Query:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWKTNQSMYRVSLGS
        DSTN K +SSYA  LSPK N GV YFKSEFSK+  LA+SVF  ASSD KEKQ  LGWAKLA   AAWEDYASNLR ELLRS  HW+T+QS+Y V  GS
Subjt:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWKTNQSMYRVSLGS

XP_023552239.1 prolyl 3-hydroxylase 1 [Cucurbita pepo subsp. pepo]3.3e-19081.91Show/hide
Query:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
        MGDE E  Q  RRRLILENFLT EECRELEFIHKSCCTVGYRP VFSTTLLHLV +NSAHLIMPFV IRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
Subjt:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFTVLCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS
         IGWHSDDNRPYLKQREFT +CYLNSYGVDF GGLFHFQDGEPK+ISP CGDCVMYTADS NVHSVDE+T+GERLTLTLWFTRD+SHDEDAKLLSLLSQS
Subjt:  SIGWHSDDNRPYLKQREFTVLCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS

Query:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL
        HLHDR P+SC+P PPSCNMYWFSP++DPNFKFGFD+CWARLHALGY IYFP+D+ LS+YP LFS  VQLVR  KIF Q+F +ILHALQVVQF+ WKGKEL
Subjt:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL

Query:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWKTNQSMYRVSLGS
        DSTN K +SSYA  LSPK N GV YFKSEFSK+  LA+SVF  ASSD KEKQ  LGWAKLA   AAWEDYASNLR ELLRS  HW+T+QS+Y V  GS
Subjt:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWKTNQSMYRVSLGS

XP_038893062.1 uncharacterized protein LOC120081945 [Benincasa hispida]1.0e-19482.66Show/hide
Query:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
        MGDEVE+R++RRRRLILENFLTREECRELEFIHKSCCTVGYRP+VFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGC YELFVEFTGLISWTRGA
Subjt:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFTVLCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS
        SIGWHSDDNRPYLKQR+F+ +CYLNSYGV+F GGLFHFQDGEP++ISPFCGDCVMYTADS NVHSVDEITNGERLTLTLW TRD+SHDED+KLLSLLSQS
Subjt:  SIGWHSDDNRPYLKQREFTVLCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS

Query:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL
        HLHDR P+S +P PPSCNMYWFS E+DPNFK GFD+CWARLHALGYDIYF  D+  S+YP LFS  VQLV+  K+FFQEF NILH LQVVQF+CWKGKEL
Subjt:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL

Query:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWKTNQSMYRVSLGS
        DSTN K +SSYA YLSPK N GVSYFKSEFSK+ VLA+SVFSSA+SD KE Q WLGW KLA A AAWEDYAS LR ELL SL +W+ +QS+Y VSL S
Subjt:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWKTNQSMYRVSLGS

TrEMBL top hitse value%identityAlignment
A0A1S3C486 Procollagen-proline 3-dioxygenase8.9e-18981.45Show/hide
Query:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
        M D  E+RQ  RRRLILENFL+REECRELEFIHKSCCTVGYRP+V STTLLHLVATNSAHLI+PFVPIRE+LKEKAEEFFGC YELFVEFTGLISWTRGA
Subjt:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFTVLCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS
        SIGWHSDDNRPYLKQREF+ +CYLNSYGV+F GGLFHFQDGEP++ISPF GDCVMYTADS NVHSVDEITNGERLTLTLWFTRD+SHDEDAKLLSLLSQS
Subjt:  SIGWHSDDNRPYLKQREFTVLCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS

Query:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL
         LHDRF NSC+P PPSCNMYWFSPEEDPNFKFGFD+CWARLHALGYDIYFP D+D S+YP LFS  VQLV   KIFFQ+F NILH LQVVQF+CWKGKEL
Subjt:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL

Query:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKL-ATAVAAWEDYASNLRTELLRSLPHWKTNQSMYRVSLGS
        D+TN   +S YA YLSPK N GVSYFKSEFSKN  LA+SVFSSA+S  KE Q WLGW KL   A AAWEDYAS LR ELL S  HW+  QS+Y VSL S
Subjt:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKL-ATAVAAWEDYASNLRTELLRSLPHWKTNQSMYRVSLGS

A0A5A7SSL8 Procollagen-proline 3-dioxygenase8.9e-18981.45Show/hide
Query:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
        M D  E+RQ  RRRLILENFL+REECRELEFIHKSCCTVGYRP+V STTLLHLVATNSAHLI+PFVPIRE+LKEKAEEFFGC YELFVEFTGLISWTRGA
Subjt:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFTVLCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS
        SIGWHSDDNRPYLKQREF+ +CYLNSYGV+F GGLFHFQDGEP++ISPF GDCVMYTADS NVHSVDEITNGERLTLTLWFTRD+SHDEDAKLLSLLSQS
Subjt:  SIGWHSDDNRPYLKQREFTVLCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS

Query:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL
         LHDRF NSC+P PPSCNMYWFSPEEDPNFKFGFD+CWARLHALGYDIYFP D+D S+YP LFS  VQLV   KIFFQ+F NILH LQVVQF+CWKGKEL
Subjt:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL

Query:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKL-ATAVAAWEDYASNLRTELLRSLPHWKTNQSMYRVSLGS
        D+TN   +S YA YLSPK N GVSYFKSEFSKN  LA+SVFSSA+S  KE Q WLGW KL   A AAWEDYAS LR ELL S  HW+  QS+Y VSL S
Subjt:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKL-ATAVAAWEDYASNLRTELLRSLPHWKTNQSMYRVSLGS

A0A5D3CRE9 Procollagen-proline 3-dioxygenase1.2e-18881.2Show/hide
Query:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
        M D  E+RQ  RRRLILENFL+REECRELEFIHKSCCTVGYRP+V STTLLHLVATNSAHLI+PFVPIRE+LKEKAEEFFGC YELFVEFTGLISWTRGA
Subjt:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFTVLCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS
        SIGWHSDDNRPYLKQREF+ +CYLNSYGV+F GGLFHFQDGEP++ISPF GDCVMY ADS NVHSVDEITNGERLTLTLWFTRD+SHDEDAKLLSLLSQS
Subjt:  SIGWHSDDNRPYLKQREFTVLCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS

Query:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL
         LHDRFPNSC+P PPSCNMYWFSPE+DPNFKFGFD+CWARLHALGYDIYFP D+D S+YP LFS  VQLV   KIFFQ+F NILH LQVVQF+CWKGKEL
Subjt:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL

Query:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKL-ATAVAAWEDYASNLRTELLRSLPHWKTNQSMYRVSLGS
        D+TN   +S YA YLSPK N GVSYFKSEFSKN  LA+SVFSSA+S  KE Q WLGW KL   A AAWEDYAS LR ELL S  HW+  QS+Y VSL S
Subjt:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKL-ATAVAAWEDYASNLRTELLRSLPHWKTNQSMYRVSLGS

A0A6J1DQY8 Procollagen-proline 3-dioxygenase3.9e-23798.99Show/hide
Query:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
        MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
Subjt:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFTVLCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS
        SIGWHSDDNRPYLKQREFT +CYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS
Subjt:  SIGWHSDDNRPYLKQREFTVLCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS

Query:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL
        HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFG DVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL
Subjt:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL

Query:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWKTNQSMYRVSLGS
        DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHW+TNQSMYRVSLGS
Subjt:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWKTNQSMYRVSLGS

A0A6J1FRP8 Procollagen-proline 3-dioxygenase1.8e-18981.41Show/hide
Query:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
        MGDE E  Q  RRRL LENFLT EECRELEFIHKSCCTVGYRP VFSTTLLHLV +NSAHLIMPFV IRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
Subjt:  MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFTVLCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS
         IGWHSDDNRPYLKQREFT +CYLNSYGVDF GGLFHFQDGEPK+ISP CGDCVMYTADS NVHSVDE+T+GERLTLTLWFTRD+SHDEDAKL+SLLSQS
Subjt:  SIGWHSDDNRPYLKQREFTVLCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQS

Query:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL
        HLHDR P+SC+P PPSCNMYWFSP++DPNFKFGFD+CWARLHALGY IYFP+D+ LS+YP LFS  VQLVR  KIF Q+F +ILHALQVVQF+ WKGKEL
Subjt:  HLHDRFPNSCIPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKEL

Query:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWKTNQSMYRVSLGS
        DSTN K +SSYA  LSPK N GV YFKSEFSK+  LA+SVF  ASSD KEKQ  LGWAKLA   AAWEDYASNLR ELLRS  HW+T+QS+Y V  GS
Subjt:  DSTNFKGNSSYAVYLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWKTNQSMYRVSLGS

SwissProt top hitse value%identityAlignment
Q5XGE0 2-oxoglutarate and iron-dependent oxygenase domain-containing protein 31.9e-0732.94Show/hide
Query:  WHSDDNRPYLKQREFTVLCYLNSYGVDFRGGLFHFQD-GEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSH
        WH   ++      ++T L YL+ Y  DF GG F F D G  +++ P  G    +T+ S N+H V++++ G R  +T+ FT +  H
Subjt:  WHSDDNRPYLKQREFTVLCYLNSYGVDFRGGLFHFQD-GEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSH

Arabidopsis top hitse value%identityAlignment
AT1G68080.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein6.9e-11755.53Show/hide
Query:  QQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGASIGWHSDD
        ++   RLIL NFL+  EC+ELE IHKS  T+GYRP+VFSTTL HL+ATNS HLI+PFV IRERLKEK EE FGCEYELF+EFTGLISW +GASIGWHSDD
Subjt:  QQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGASIGWHSDD

Query:  NRPYLKQREFTVLCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQSHLHDRFPN
        NR YLKQR+F  +CYLNSY  DF GGLF FQ GEP +++P  GD +MYTAD  N+HSVDE+T+GERLTL LWF+RD+SHDED+KLLS LSQ   H+    
Subjt:  NRPYLKQREFTVLCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQSHLHDRFPN

Query:  SCIPLPPSCNMYWFSPEED-PNFKFGFDVCWARLHALGYDIYFPRDYDLS-DYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKELDSTNFK
         C+PLP S NMYWF P +D  N   GFDVC ARLH LG+D++  +  D S D  +     +QL +  K+  ++F NILHALQVVQF  WK  EL ++N +
Subjt:  SCIPLPPSCNMYWFSPEED-PNFKFGFDVCWARLHALGYDIYFPRDYDLS-DYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKELDSTNFK

Query:  GNSSYAV-YLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWKTNQSMYRV
         ++   V  +S      ++  KS F  +  L  + F  + S E  K   L    +A AV +WE+Y+  L  ELL SLP WKT Q++++V
Subjt:  GNSSYAV-YLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWKTNQSMYRV

AT1G68080.2 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.7e-9148.2Show/hide
Query:  QQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGASIGWHSDD
        ++   RLIL NFL+  EC+ELE IHKS  T+GYRP+VFSTTL HL+ATNS HLI+PFV IRERLKEK EE FGCEYELF+EFTGLISW +GASIGWHSDD
Subjt:  QQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGASIGWHSDD

Query:  NRPYLKQREFTVLCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQSHLHDRFPN
        NR YLKQR+F                      GEP +++P  GD +MYTAD  N+HSVDE+T+GERLTL LWF+RD+SHDED+KLLS LSQ         
Subjt:  NRPYLKQREFTVLCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQSHLHDRFPN

Query:  SCIPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPRDYDLS-DYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKELDSTNFKG
                                 FDVC ARLH LG+D++  +  D S D  +     +QL +  K+  ++F NILHALQVVQF  WK  EL ++N + 
Subjt:  SCIPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPRDYDLS-DYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKELDSTNFKG

Query:  NSSYAV-YLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWKTNQSMYRV
        ++   V  +S      ++  KS F  +  L  + F  + S E  K   L    +A AV +WE+Y+  L  ELL SLP WKT Q++++V
Subjt:  NSSYAV-YLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWKTNQSMYRV

AT1G68080.3 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein3.0e-10451.93Show/hide
Query:  QQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGASIGWHSDD
        ++   RLIL NFL+  EC+ELE IHKS  T+GYRP+VFSTTL HL+ATNS HLI+PFV IRERLKEK EE FGCEYELF+EFTGLISW +GASIGWHSDD
Subjt:  QQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGASIGWHSDD

Query:  NRPYLKQREFTVLCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQSHLHDRFPN
        NR YLKQR+F                      GEP +++P  GD +MYTAD  N+HSVDE+T+GERLTL LWF+RD+SHDED+KLLS LSQ   H+    
Subjt:  NRPYLKQREFTVLCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQSHLHDRFPN

Query:  SCIPLPPSCNMYWFSPEED-PNFKFGFDVCWARLHALGYDIYFPRDYDLS-DYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKELDSTNFK
         C+PLP S NMYWF P +D  N   GFDVC ARLH LG+D++  +  D S D  +     +QL +  K+  ++F NILHALQVVQF  WK  EL ++N +
Subjt:  SCIPLPPSCNMYWFSPEED-PNFKFGFDVCWARLHALGYDIYFPRDYDLS-DYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKELDSTNFK

Query:  GNSSYAV-YLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWKTNQSMYRV
         ++   V  +S      ++  KS F  +  L  + F  + S E  K   L    +A AV +WE+Y+  L  ELL SLP WKT Q++++V
Subjt:  GNSSYAV-YLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWKTNQSMYRV

AT1G68080.4 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.5e-5046.5Show/hide
Query:  MYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQSHLHDRFPNSCIPLPPSCNMYWFSPEED-PNFKFGFDVCWARLHALGYDIYFPRD
        MYTAD  N+HSVDE+T+GERLTL LWF+RD+SHDED+KLLS LSQ   H+     C+PLP S NMYWF P +D  N   GFDVC ARLH LG+D++  + 
Subjt:  MYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQSHLHDRFPNSCIPLPPSCNMYWFSPEED-PNFKFGFDVCWARLHALGYDIYFPRD

Query:  YDLS-DYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKELDSTNFKGNSSYAV-YLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEK
         D S D  +     +QL +  K+  ++F NILHALQVVQF  WK  EL ++N + ++   V  +S      ++  KS F  +  L  + F  + S E  K
Subjt:  YDLS-DYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKELDSTNFKGNSSYAV-YLSPKGNAGVSYFKSEFSKNAVLAKSVFSSASSDEKEK

Query:  QPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWKTNQSMYRV
           L    +A AV +WE+Y+  L  ELL SLP WKT Q++++V
Subjt:  QPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWKTNQSMYRV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGACGAAGTGGAGAACAGGCAGCAGCGGCGGCGGCGTCTGATCCTCGAAAATTTCTTAACCCGCGAAGAATGCAGGGAACTGGAGTTCATCCATAAGAGCTGCTG
TACGGTGGGTTATAGACCCAGCGTCTTCTCCACCACACTTTTGCATCTTGTAGCCACTAATTCTGCTCATTTGATCATGCCTTTCGTTCCGATCAGAGAGAGGTTGAAGG
AGAAGGCGGAGGAATTCTTTGGCTGTGAATATGAACTCTTCGTCGAGTTCACTGGCTTGATCAGCTGGACGAGAGGGGCAAGCATTGGATGGCATAGTGATGATAACAGG
CCCTATTTAAAACAACGTGAATTTACAGTATTGTGTTACTTGAATAGTTATGGAGTAGATTTCAGAGGTGGGCTGTTTCACTTTCAGGACGGGGAACCAAAATCTATCTC
TCCTTTTTGTGGAGATTGTGTGATGTACACAGCCGACAGCCACAACGTTCATTCTGTAGATGAGATAACCAATGGAGAAAGGCTTACACTGACATTATGGTTCACCCGCG
ATAATTCCCATGATGAAGATGCAAAACTTCTTTCCCTTCTTTCACAAAGCCATTTACATGATCGTTTTCCCAACTCATGTATACCTCTGCCACCGTCCTGTAATATGTAT
TGGTTTTCACCCGAAGAAGATCCAAATTTCAAGTTCGGTTTTGATGTATGTTGGGCGAGACTGCATGCGCTCGGATACGACATTTATTTTCCTCGGGACTATGATTTGTC
AGATTATCCAAAATTATTCTCAGACTACGTGCAATTAGTACGGAAAAAGAAGATATTCTTTCAGGAATTTTATAACATTTTGCATGCGCTTCAGGTAGTGCAGTTTATGT
GTTGGAAAGGCAAAGAACTCGATTCTACTAACTTCAAGGGGAATTCAAGCTATGCAGTATATTTATCTCCAAAGGGGAATGCGGGAGTCAGTTACTTTAAGTCCGAGTTT
TCGAAGAATGCTGTACTAGCCAAGTCAGTCTTCTCGTCTGCTAGTTCCGATGAAAAGGAGAAGCAACCCTGGTTGGGGTGGGCTAAGCTTGCTACTGCTGTAGCAGCTTG
GGAAGATTATGCTTCCAATTTAAGGACAGAACTCCTTAGGAGCTTGCCCCATTGGAAAACCAATCAATCCATGTACAGAGTTTCACTCGGTAGT
mRNA sequenceShow/hide mRNA sequence
ATGGGAGACGAAGTGGAGAACAGGCAGCAGCGGCGGCGGCGTCTGATCCTCGAAAATTTCTTAACCCGCGAAGAATGCAGGGAACTGGAGTTCATCCATAAGAGCTGCTG
TACGGTGGGTTATAGACCCAGCGTCTTCTCCACCACACTTTTGCATCTTGTAGCCACTAATTCTGCTCATTTGATCATGCCTTTCGTTCCGATCAGAGAGAGGTTGAAGG
AGAAGGCGGAGGAATTCTTTGGCTGTGAATATGAACTCTTCGTCGAGTTCACTGGCTTGATCAGCTGGACGAGAGGGGCAAGCATTGGATGGCATAGTGATGATAACAGG
CCCTATTTAAAACAACGTGAATTTACAGTATTGTGTTACTTGAATAGTTATGGAGTAGATTTCAGAGGTGGGCTGTTTCACTTTCAGGACGGGGAACCAAAATCTATCTC
TCCTTTTTGTGGAGATTGTGTGATGTACACAGCCGACAGCCACAACGTTCATTCTGTAGATGAGATAACCAATGGAGAAAGGCTTACACTGACATTATGGTTCACCCGCG
ATAATTCCCATGATGAAGATGCAAAACTTCTTTCCCTTCTTTCACAAAGCCATTTACATGATCGTTTTCCCAACTCATGTATACCTCTGCCACCGTCCTGTAATATGTAT
TGGTTTTCACCCGAAGAAGATCCAAATTTCAAGTTCGGTTTTGATGTATGTTGGGCGAGACTGCATGCGCTCGGATACGACATTTATTTTCCTCGGGACTATGATTTGTC
AGATTATCCAAAATTATTCTCAGACTACGTGCAATTAGTACGGAAAAAGAAGATATTCTTTCAGGAATTTTATAACATTTTGCATGCGCTTCAGGTAGTGCAGTTTATGT
GTTGGAAAGGCAAAGAACTCGATTCTACTAACTTCAAGGGGAATTCAAGCTATGCAGTATATTTATCTCCAAAGGGGAATGCGGGAGTCAGTTACTTTAAGTCCGAGTTT
TCGAAGAATGCTGTACTAGCCAAGTCAGTCTTCTCGTCTGCTAGTTCCGATGAAAAGGAGAAGCAACCCTGGTTGGGGTGGGCTAAGCTTGCTACTGCTGTAGCAGCTTG
GGAAGATTATGCTTCCAATTTAAGGACAGAACTCCTTAGGAGCTTGCCCCATTGGAAAACCAATCAATCCATGTACAGAGTTTCACTCGGTAGT
Protein sequenceShow/hide protein sequence
MGDEVENRQQRRRRLILENFLTREECRELEFIHKSCCTVGYRPSVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGASIGWHSDDNR
PYLKQREFTVLCYLNSYGVDFRGGLFHFQDGEPKSISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDNSHDEDAKLLSLLSQSHLHDRFPNSCIPLPPSCNMY
WFSPEEDPNFKFGFDVCWARLHALGYDIYFPRDYDLSDYPKLFSDYVQLVRKKKIFFQEFYNILHALQVVQFMCWKGKELDSTNFKGNSSYAVYLSPKGNAGVSYFKSEF
SKNAVLAKSVFSSASSDEKEKQPWLGWAKLATAVAAWEDYASNLRTELLRSLPHWKTNQSMYRVSLGS