; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr018352 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr018352
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProcollagen-proline 3-dioxygenase
Genome locationtig00153197:334511..338277
RNA-Seq ExpressionSgr018352
SyntenySgr018352
Gene Ontology termsGO:0019511 - peptidyl-proline hydroxylation (biological process)
GO:0032963 - collagen metabolic process (biological process)
GO:0005506 - iron ion binding (molecular function)
GO:0019797 - procollagen-proline 3-dioxygenase activity (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR039575 - Prolyl 3-hydroxylase
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155191.1 uncharacterized protein LOC111022327 [Momordica charantia]6.1e-20084.42Show/hide
Query:  MGDKAGSRQQQLRRLILENFLTREECMELEFIHKSCCTVGYRPYVLSTTLLHLVAANSAHLIMPFVPIRERLKEKVEEFFGCEYELFVEFTGLISWTRGA
        MGD+  +RQQ+ RRLILENFLTREEC ELEFIHKSCCTVGYRP V STTLLHLVA NSAHLIMPFVPIRERLKEK EEFFGCEYELFVEFTGLISWTRGA
Subjt:  MGDKAGSRQQQLRRLILENFLTREECMELEFIHKSCCTVGYRPYVLSTTLLHLVAANSAHLIMPFVPIRERLKEKVEEFFGCEYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKTVSPFCGDCVMYSADSHNVHSVDEITNGERLTLTLWFSRDSSHDEDAKLLSLLSQS
        SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPK++SPFCGDCVMY+ADSHNVHSVDEITNGERLTLTLWF+RD+SHDEDAKLLSLLSQS
Subjt:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKTVSPFCGDCVMYSADSHNVHSVDEITNGERLTLTLWFSRDSSHDEDAKLLSLLSQS

Query:  HLHDCFPDSCLPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPQDSSLSDYEHLFTQYVQLVCGKEIFCLKFCNVLHALQVVQFLCWKGKEL
        HLHD FP+SC+PLPPSCNMYWFSPEEDPNFKFG DVCWARLHALGYDIYFP+D  LSDY  LF+ YVQLV  K+IF  +F N+LHALQVVQF+CWKGKEL
Subjt:  HLHDCFPDSCLPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPQDSSLSDYEHLFTQYVQLVCGKEIFCLKFCNVLHALQVVQFLCWKGKEL

Query:  DSTNFKGDASYAVDLSPKGNVAVSYFKSEFSKDDVLAESIFSFASSGGEEKQRRLGWSKLTAAAAGWEDYASNLRRELLRSLSHWRTNQSIYSVLLCS
        DSTNFKG++SYAV LSPKGN  VSYFKSEFSK+ VLA+S+FS ASS  +EKQ  LGW+KL  A A WEDYASNLR ELLRSL HWRTNQS+Y V L S
Subjt:  DSTNFKGDASYAVDLSPKGNVAVSYFKSEFSKDDVLAESIFSFASSGGEEKQRRLGWSKLTAAAAGWEDYASNLRRELLRSLSHWRTNQSIYSVLLCS

XP_022942569.1 prolyl 3-hydroxylase 1 isoform X1 [Cucurbita moschata]8.8e-19182.23Show/hide
Query:  MGDKAGSRQQQLRRLILENFLTREECMELEFIHKSCCTVGYRPYVLSTTLLHLVAANSAHLIMPFVPIRERLKEKVEEFFGCEYELFVEFTGLISWTRGA
        MGD+A   Q+  RRL LENFLT EEC ELEFIHKSCCTVGYRPYV STTLLHLV +NSAHLIMPFV IRERLKEK EEFFGCEYELFVEFTGLISWTRGA
Subjt:  MGDKAGSRQQQLRRLILENFLTREECMELEFIHKSCCTVGYRPYVLSTTLLHLVAANSAHLIMPFVPIRERLKEKVEEFFGCEYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKTVSPFCGDCVMYSADSHNVHSVDEITNGERLTLTLWFSRDSSHDEDAKLLSLLSQS
         IGWHSDDNRPYLKQREFTAVCYLNSYGVDF GGLFHFQDGEPKT+SP CGDCVMY+ADS NVHSVDE+T+GERLTLTLWF+RDSSHDEDAKL+SLLSQS
Subjt:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKTVSPFCGDCVMYSADSHNVHSVDEITNGERLTLTLWFSRDSSHDEDAKLLSLLSQS

Query:  HLHDCFPDSCLPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPQDSSLSDYEHLFTQYVQLVCGKEIFCLKFCNVLHALQVVQFLCWKGKEL
        HLHD  PDSCLP PPSCNMYWFSP++DPNFKFGFD+CWARLHALGY IYFPQD SLS+Y  LF+Q VQLV G +IF  KF ++LHALQVVQFL WKGKEL
Subjt:  HLHDCFPDSCLPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPQDSSLSDYEHLFTQYVQLVCGKEIFCLKFCNVLHALQVVQFLCWKGKEL

Query:  DSTNFKGDASYAVDLSPKGNVAVSYFKSEFSKDDVLAESIFSFASSGGEEKQRRLGWSKLTAAAAGWEDYASNLRRELLRSLSHWRTNQSIYSV
        DSTN K D+SYA  LSPK NV V YFKSEFSKDD LAES+F +ASS  +EKQ RLGW+KL A AA WEDYASNLRRELLRS +HWRT+QSIYSV
Subjt:  DSTNFKGDASYAVDLSPKGNVAVSYFKSEFSKDDVLAESIFSFASSGGEEKQRRLGWSKLTAAAAGWEDYASNLRRELLRSLSHWRTNQSIYSV

XP_022989994.1 prolyl 3-hydroxylase 1 isoform X1 [Cucurbita maxima]1.4e-18881.47Show/hide
Query:  MGDKAGSRQQQLRRLILENFLTREECMELEFIHKSCCTVGYRPYVLSTTLLHLVAANSAHLIMPFVPIRERLKEKVEEFFGCEYELFVEFTGLISWTRGA
        MGD+A   Q+   RLILENFLT EEC ELEFIHKSCCTVGYRPYV STTLLHLV +NSA LIMPFV IRERLKEK EEFFGCEYELFVEFTGLISWTRGA
Subjt:  MGDKAGSRQQQLRRLILENFLTREECMELEFIHKSCCTVGYRPYVLSTTLLHLVAANSAHLIMPFVPIRERLKEKVEEFFGCEYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKTVSPFCGDCVMYSADSHNVHSVDEITNGERLTLTLWFSRDSSHDEDAKLLSLLSQS
         IGWHSDDNRPYLKQREFTAVCYLNSYGVDF GGLFHFQDGEPKT+SP CGDCVMY+ADS NVHSVDE+T+GERLTLTLWF+RDSSHDEDAKLLSLLSQS
Subjt:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKTVSPFCGDCVMYSADSHNVHSVDEITNGERLTLTLWFSRDSSHDEDAKLLSLLSQS

Query:  HLHDCFPDSCLPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPQDSSLSDYEHLFTQYVQLVCGKEIFCLKFCNVLHALQVVQFLCWKGKEL
        HLHD  PDSCLP PPSCNMYWFSP++DPNFKFGFD+CWARLHALGY IYFPQD SLS+Y  LF+Q VQLV G +IF  KF ++LHALQVVQFL WKGKEL
Subjt:  HLHDCFPDSCLPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPQDSSLSDYEHLFTQYVQLVCGKEIFCLKFCNVLHALQVVQFLCWKGKEL

Query:  DSTNFKGDASYAVDLSPKGNVAVSYFKSEFSKDDVLAESIFSFASSGGEEKQRRLGWSKLTAAAAGWEDYASNLRRELLRSLSHWRTNQSIYSV
        DST+ K D+SYA  LSPK NV V +FKSEFSKDD LAES+F +ASS  +EKQ RLGW+KL A A  WEDYASNLRRELLRS +HWRT+QSIYSV
Subjt:  DSTNFKGDASYAVDLSPKGNVAVSYFKSEFSKDDVLAESIFSFASSGGEEKQRRLGWSKLTAAAAGWEDYASNLRRELLRSLSHWRTNQSIYSV

XP_023552239.1 prolyl 3-hydroxylase 1 [Cucurbita pepo subsp. pepo]5.2e-19182.49Show/hide
Query:  MGDKAGSRQQQLRRLILENFLTREECMELEFIHKSCCTVGYRPYVLSTTLLHLVAANSAHLIMPFVPIRERLKEKVEEFFGCEYELFVEFTGLISWTRGA
        MGD+A   Q+  RRLILENFLT EEC ELEFIHKSCCTVGYRPYV STTLLHLV +NSAHLIMPFV IRERLKEK EEFFGCEYELFVEFTGLISWTRGA
Subjt:  MGDKAGSRQQQLRRLILENFLTREECMELEFIHKSCCTVGYRPYVLSTTLLHLVAANSAHLIMPFVPIRERLKEKVEEFFGCEYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKTVSPFCGDCVMYSADSHNVHSVDEITNGERLTLTLWFSRDSSHDEDAKLLSLLSQS
         IGWHSDDNRPYLKQREFTAVCYLNSYGVDF GGLFHFQDGEPKT+SP CGDCVMY+ADS NVHSVDE+T+GERLTLTLWF+RDSSHDEDAKLLSLLSQS
Subjt:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKTVSPFCGDCVMYSADSHNVHSVDEITNGERLTLTLWFSRDSSHDEDAKLLSLLSQS

Query:  HLHDCFPDSCLPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPQDSSLSDYEHLFTQYVQLVCGKEIFCLKFCNVLHALQVVQFLCWKGKEL
        HLHD  PDSCLP PPSCNMYWFSP++DPNFKFGFD+CWARLHALGY IYFPQD SLS+Y  LF+Q VQLV G +IF  KF ++LHALQVVQFL WKGKEL
Subjt:  HLHDCFPDSCLPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPQDSSLSDYEHLFTQYVQLVCGKEIFCLKFCNVLHALQVVQFLCWKGKEL

Query:  DSTNFKGDASYAVDLSPKGNVAVSYFKSEFSKDDVLAESIFSFASSGGEEKQRRLGWSKLTAAAAGWEDYASNLRRELLRSLSHWRTNQSIYSV
        DSTN K D+SYA  LSPK NV V YFKSEFSKD+ LAES+F +ASS  +EKQ RLGW+KL A AA WEDYASNLRRELLRS  HWRT+QSIYSV
Subjt:  DSTNFKGDASYAVDLSPKGNVAVSYFKSEFSKDDVLAESIFSFASSGGEEKQRRLGWSKLTAAAAGWEDYASNLRRELLRSLSHWRTNQSIYSV

XP_038893062.1 uncharacterized protein LOC120081945 [Benincasa hispida]2.0e-18780.65Show/hide
Query:  MGDKAGSRQQQLRRLILENFLTREECMELEFIHKSCCTVGYRPYVLSTTLLHLVAANSAHLIMPFVPIRERLKEKVEEFFGCEYELFVEFTGLISWTRGA
        MGD+  SR+++ RRLILENFLTREEC ELEFIHKSCCTVGYRP V STTLLHLVA NSAHLIMPFVPIRERLKEK EEFFGC YELFVEFTGLISWTRGA
Subjt:  MGDKAGSRQQQLRRLILENFLTREECMELEFIHKSCCTVGYRPYVLSTTLLHLVAANSAHLIMPFVPIRERLKEKVEEFFGCEYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKTVSPFCGDCVMYSADSHNVHSVDEITNGERLTLTLWFSRDSSHDEDAKLLSLLSQS
        SIGWHSDDNRPYLKQR+F+AVCYLNSYGV+F GGLFHFQDGEP+T+SPFCGDCVMY+ADS NVHSVDEITNGERLTLTLW +RDSSHDED+KLLSLLSQS
Subjt:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKTVSPFCGDCVMYSADSHNVHSVDEITNGERLTLTLWFSRDSSHDEDAKLLSLLSQS

Query:  HLHDCFPDSCLPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPQDSSLSDYEHLFTQYVQLVCGKEIFCLKFCNVLHALQVVQFLCWKGKEL
        HLHD  PDS LP PPSCNMYWFS E+DPNFK GFD+CWARLHALGYDIYF  D S S+Y  LF++ VQLV G ++F  +F N+LH LQVVQFLCWKGKEL
Subjt:  HLHDCFPDSCLPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPQDSSLSDYEHLFTQYVQLVCGKEIFCLKFCNVLHALQVVQFLCWKGKEL

Query:  DSTNFKGDASYAVDLSPKGNVAVSYFKSEFSKDDVLAESIFSFASSGGEEKQRRLGWSKLTAAAAGWEDYASNLRRELLRSLSHWRTNQSIYSVLLCS
        DSTN K D+SYA  LSPK NV VSYFKSEFSKDDVLAES+FS A+S G+E Q  LGW KL AAAA WEDYAS LRRELL SLS+WR +QSIYSV L S
Subjt:  DSTNFKGDASYAVDLSPKGNVAVSYFKSEFSKDDVLAESIFSFASSGGEEKQRRLGWSKLTAAAAGWEDYASNLRRELLRSLSHWRTNQSIYSVLLCS

TrEMBL top hitse value%identityAlignment
A0A5A7SSL8 Procollagen-proline 3-dioxygenase7.8e-18580.96Show/hide
Query:  GSRQQQLRRLILENFLTREECMELEFIHKSCCTVGYRPYVLSTTLLHLVAANSAHLIMPFVPIRERLKEKVEEFFGCEYELFVEFTGLISWTRGASIGWH
        G+  +Q RRLILENFL+REEC ELEFIHKSCCTVGYRP VLSTTLLHLVA NSAHLI+PFVPIRE+LKEK EEFFGC YELFVEFTGLISWTRGASIGWH
Subjt:  GSRQQQLRRLILENFLTREECMELEFIHKSCCTVGYRPYVLSTTLLHLVAANSAHLIMPFVPIRERLKEKVEEFFGCEYELFVEFTGLISWTRGASIGWH

Query:  SDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKTVSPFCGDCVMYSADSHNVHSVDEITNGERLTLTLWFSRDSSHDEDAKLLSLLSQSHLHDC
        SDDNRPYLKQREF+AVCYLNSYGV+F GGLFHFQDGEP+T+SPF GDCVMY+ADS NVHSVDEITNGERLTLTLWF+RDSSHDEDAKLLSLLSQS LHD 
Subjt:  SDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKTVSPFCGDCVMYSADSHNVHSVDEITNGERLTLTLWFSRDSSHDEDAKLLSLLSQSHLHDC

Query:  FPDSCLPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPQDSSLSDYEHLFTQYVQLVCGKEIFCLKFCNVLHALQVVQFLCWKGKELDSTNF
        F +SCLP PPSCNMYWFSPEEDPNFKFGFD+CWARLHALGYDIYFP D   S+Y  LF+Q VQLV G +IF  KF N+LH LQVVQFLCWKGKELD+TN 
Subjt:  FPDSCLPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPQDSSLSDYEHLFTQYVQLVCGKEIFCLKFCNVLHALQVVQFLCWKGKELDSTNF

Query:  KGDASYAVDLSPKGNVAVSYFKSEFSKDDVLAESIFSFASSGGEEKQRRLGWSKL-TAAAAGWEDYASNLRRELLRSLSHWRTNQSIYSVLLCS
          D+ YA  LSPK NV VSYFKSEFSK+D LAES+FS A+SGG+E Q  LGW KL  AAAA WEDYAS LRRELL S SHWR  QSIYSV L S
Subjt:  KGDASYAVDLSPKGNVAVSYFKSEFSKDDVLAESIFSFASSGGEEKQRRLGWSKL-TAAAAGWEDYASNLRRELLRSLSHWRTNQSIYSVLLCS

A0A5D3CRE9 Procollagen-proline 3-dioxygenase3.5e-18580.96Show/hide
Query:  GSRQQQLRRLILENFLTREECMELEFIHKSCCTVGYRPYVLSTTLLHLVAANSAHLIMPFVPIRERLKEKVEEFFGCEYELFVEFTGLISWTRGASIGWH
        G+  +Q RRLILENFL+REEC ELEFIHKSCCTVGYRP VLSTTLLHLVA NSAHLI+PFVPIRE+LKEK EEFFGC YELFVEFTGLISWTRGASIGWH
Subjt:  GSRQQQLRRLILENFLTREECMELEFIHKSCCTVGYRPYVLSTTLLHLVAANSAHLIMPFVPIRERLKEKVEEFFGCEYELFVEFTGLISWTRGASIGWH

Query:  SDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKTVSPFCGDCVMYSADSHNVHSVDEITNGERLTLTLWFSRDSSHDEDAKLLSLLSQSHLHDC
        SDDNRPYLKQREF+AVCYLNSYGV+F GGLFHFQDGEP+T+SPF GDCVMY ADS NVHSVDEITNGERLTLTLWF+RDSSHDEDAKLLSLLSQS LHD 
Subjt:  SDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKTVSPFCGDCVMYSADSHNVHSVDEITNGERLTLTLWFSRDSSHDEDAKLLSLLSQSHLHDC

Query:  FPDSCLPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPQDSSLSDYEHLFTQYVQLVCGKEIFCLKFCNVLHALQVVQFLCWKGKELDSTNF
        FP+SCLP PPSCNMYWFSPE+DPNFKFGFD+CWARLHALGYDIYFP D   S+Y  LF+Q VQLV G +IF  KF N+LH LQVVQFLCWKGKELD+TN 
Subjt:  FPDSCLPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPQDSSLSDYEHLFTQYVQLVCGKEIFCLKFCNVLHALQVVQFLCWKGKELDSTNF

Query:  KGDASYAVDLSPKGNVAVSYFKSEFSKDDVLAESIFSFASSGGEEKQRRLGWSKL-TAAAAGWEDYASNLRRELLRSLSHWRTNQSIYSVLLCS
          D+ YA  LSPK NV VSYFKSEFSK+D LAES+FS A+SGG+E Q  LGW KL  AAAA WEDYAS LRRELL S SHWR  QSIYSV L S
Subjt:  KGDASYAVDLSPKGNVAVSYFKSEFSKDDVLAESIFSFASSGGEEKQRRLGWSKL-TAAAAGWEDYASNLRRELLRSLSHWRTNQSIYSVLLCS

A0A6J1DQY8 Procollagen-proline 3-dioxygenase2.9e-20084.42Show/hide
Query:  MGDKAGSRQQQLRRLILENFLTREECMELEFIHKSCCTVGYRPYVLSTTLLHLVAANSAHLIMPFVPIRERLKEKVEEFFGCEYELFVEFTGLISWTRGA
        MGD+  +RQQ+ RRLILENFLTREEC ELEFIHKSCCTVGYRP V STTLLHLVA NSAHLIMPFVPIRERLKEK EEFFGCEYELFVEFTGLISWTRGA
Subjt:  MGDKAGSRQQQLRRLILENFLTREECMELEFIHKSCCTVGYRPYVLSTTLLHLVAANSAHLIMPFVPIRERLKEKVEEFFGCEYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKTVSPFCGDCVMYSADSHNVHSVDEITNGERLTLTLWFSRDSSHDEDAKLLSLLSQS
        SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPK++SPFCGDCVMY+ADSHNVHSVDEITNGERLTLTLWF+RD+SHDEDAKLLSLLSQS
Subjt:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKTVSPFCGDCVMYSADSHNVHSVDEITNGERLTLTLWFSRDSSHDEDAKLLSLLSQS

Query:  HLHDCFPDSCLPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPQDSSLSDYEHLFTQYVQLVCGKEIFCLKFCNVLHALQVVQFLCWKGKEL
        HLHD FP+SC+PLPPSCNMYWFSPEEDPNFKFG DVCWARLHALGYDIYFP+D  LSDY  LF+ YVQLV  K+IF  +F N+LHALQVVQF+CWKGKEL
Subjt:  HLHDCFPDSCLPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPQDSSLSDYEHLFTQYVQLVCGKEIFCLKFCNVLHALQVVQFLCWKGKEL

Query:  DSTNFKGDASYAVDLSPKGNVAVSYFKSEFSKDDVLAESIFSFASSGGEEKQRRLGWSKLTAAAAGWEDYASNLRRELLRSLSHWRTNQSIYSVLLCS
        DSTNFKG++SYAV LSPKGN  VSYFKSEFSK+ VLA+S+FS ASS  +EKQ  LGW+KL  A A WEDYASNLR ELLRSL HWRTNQS+Y V L S
Subjt:  DSTNFKGDASYAVDLSPKGNVAVSYFKSEFSKDDVLAESIFSFASSGGEEKQRRLGWSKLTAAAAGWEDYASNLRRELLRSLSHWRTNQSIYSVLLCS

A0A6J1FRP8 Procollagen-proline 3-dioxygenase4.3e-19182.23Show/hide
Query:  MGDKAGSRQQQLRRLILENFLTREECMELEFIHKSCCTVGYRPYVLSTTLLHLVAANSAHLIMPFVPIRERLKEKVEEFFGCEYELFVEFTGLISWTRGA
        MGD+A   Q+  RRL LENFLT EEC ELEFIHKSCCTVGYRPYV STTLLHLV +NSAHLIMPFV IRERLKEK EEFFGCEYELFVEFTGLISWTRGA
Subjt:  MGDKAGSRQQQLRRLILENFLTREECMELEFIHKSCCTVGYRPYVLSTTLLHLVAANSAHLIMPFVPIRERLKEKVEEFFGCEYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKTVSPFCGDCVMYSADSHNVHSVDEITNGERLTLTLWFSRDSSHDEDAKLLSLLSQS
         IGWHSDDNRPYLKQREFTAVCYLNSYGVDF GGLFHFQDGEPKT+SP CGDCVMY+ADS NVHSVDE+T+GERLTLTLWF+RDSSHDEDAKL+SLLSQS
Subjt:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKTVSPFCGDCVMYSADSHNVHSVDEITNGERLTLTLWFSRDSSHDEDAKLLSLLSQS

Query:  HLHDCFPDSCLPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPQDSSLSDYEHLFTQYVQLVCGKEIFCLKFCNVLHALQVVQFLCWKGKEL
        HLHD  PDSCLP PPSCNMYWFSP++DPNFKFGFD+CWARLHALGY IYFPQD SLS+Y  LF+Q VQLV G +IF  KF ++LHALQVVQFL WKGKEL
Subjt:  HLHDCFPDSCLPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPQDSSLSDYEHLFTQYVQLVCGKEIFCLKFCNVLHALQVVQFLCWKGKEL

Query:  DSTNFKGDASYAVDLSPKGNVAVSYFKSEFSKDDVLAESIFSFASSGGEEKQRRLGWSKLTAAAAGWEDYASNLRRELLRSLSHWRTNQSIYSV
        DSTN K D+SYA  LSPK NV V YFKSEFSKDD LAES+F +ASS  +EKQ RLGW+KL A AA WEDYASNLRRELLRS +HWRT+QSIYSV
Subjt:  DSTNFKGDASYAVDLSPKGNVAVSYFKSEFSKDDVLAESIFSFASSGGEEKQRRLGWSKLTAAAAGWEDYASNLRRELLRSLSHWRTNQSIYSV

A0A6J1JQV6 Procollagen-proline 3-dioxygenase6.8e-18981.47Show/hide
Query:  MGDKAGSRQQQLRRLILENFLTREECMELEFIHKSCCTVGYRPYVLSTTLLHLVAANSAHLIMPFVPIRERLKEKVEEFFGCEYELFVEFTGLISWTRGA
        MGD+A   Q+   RLILENFLT EEC ELEFIHKSCCTVGYRPYV STTLLHLV +NSA LIMPFV IRERLKEK EEFFGCEYELFVEFTGLISWTRGA
Subjt:  MGDKAGSRQQQLRRLILENFLTREECMELEFIHKSCCTVGYRPYVLSTTLLHLVAANSAHLIMPFVPIRERLKEKVEEFFGCEYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKTVSPFCGDCVMYSADSHNVHSVDEITNGERLTLTLWFSRDSSHDEDAKLLSLLSQS
         IGWHSDDNRPYLKQREFTAVCYLNSYGVDF GGLFHFQDGEPKT+SP CGDCVMY+ADS NVHSVDE+T+GERLTLTLWF+RDSSHDEDAKLLSLLSQS
Subjt:  SIGWHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKTVSPFCGDCVMYSADSHNVHSVDEITNGERLTLTLWFSRDSSHDEDAKLLSLLSQS

Query:  HLHDCFPDSCLPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPQDSSLSDYEHLFTQYVQLVCGKEIFCLKFCNVLHALQVVQFLCWKGKEL
        HLHD  PDSCLP PPSCNMYWFSP++DPNFKFGFD+CWARLHALGY IYFPQD SLS+Y  LF+Q VQLV G +IF  KF ++LHALQVVQFL WKGKEL
Subjt:  HLHDCFPDSCLPLPPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPQDSSLSDYEHLFTQYVQLVCGKEIFCLKFCNVLHALQVVQFLCWKGKEL

Query:  DSTNFKGDASYAVDLSPKGNVAVSYFKSEFSKDDVLAESIFSFASSGGEEKQRRLGWSKLTAAAAGWEDYASNLRRELLRSLSHWRTNQSIYSV
        DST+ K D+SYA  LSPK NV V +FKSEFSKDD LAES+F +ASS  +EKQ RLGW+KL A A  WEDYASNLRRELLRS +HWRT+QSIYSV
Subjt:  DSTNFKGDASYAVDLSPKGNVAVSYFKSEFSKDDVLAESIFSFASSGGEEKQRRLGWSKLTAAAAGWEDYASNLRRELLRSLSHWRTNQSIYSV

SwissProt top hitse value%identityAlignment
Q5XGE0 2-oxoglutarate and iron-dependent oxygenase domain-containing protein 33.3e-0731.76Show/hide
Query:  WHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQD-GEPKTVSPFCGDCVMYSADSHNVHSVDEITNGERLTLTLWFSRDSSH
        WH   ++      ++T++ YL+ Y  DF GG F F D G  +TV P  G    +++ S N+H V++++ G R  +T+ F+ +  H
Subjt:  WHSDDNRPYLKQREFTAVCYLNSYGVDFRGGLFHFQD-GEPKTVSPFCGDCVMYSADSHNVHSVDEITNGERLTLTLWFSRDSSH

Arabidopsis top hitse value%identityAlignment
AT1G68080.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.9e-11957.4Show/hide
Query:  RLILENFLTREECMELEFIHKSCCTVGYRPYVLSTTLLHLVAANSAHLIMPFVPIRERLKEKVEEFFGCEYELFVEFTGLISWTRGASIGWHSDDNRPYL
        RLIL NFL+  EC ELE IHKS  T+GYRP V STTL HL+A NS HLI+PFV IRERLKEK+EE FGCEYELF+EFTGLISW +GASIGWHSDDNR YL
Subjt:  RLILENFLTREECMELEFIHKSCCTVGYRPYVLSTTLLHLVAANSAHLIMPFVPIRERLKEKVEEFFGCEYELFVEFTGLISWTRGASIGWHSDDNRPYL

Query:  KQREFTAVCYLNSYGVDFRGGLFHFQDGEPKTVSPFCGDCVMYSADSHNVHSVDEITNGERLTLTLWFSRDSSHDEDAKLLSLLSQSHLHDCFPDSCLPL
        KQR+F AVCYLNSY  DF GGLF FQ GEP TV+P  GD +MY+AD  N+HSVDE+T+GERLTL LWFSRDSSHDED+KLLS LSQ   H+     CLPL
Subjt:  KQREFTAVCYLNSYGVDFRGGLFHFQDGEPKTVSPFCGDCVMYSADSHNVHSVDEITNGERLTLTLWFSRDSSHDEDAKLLSLLSQSHLHDCFPDSCLPL

Query:  PPSCNMYWFSPEED-PNFKFGFDVCWARLHALGYDIYFPQ--DSSLSDYEHLFTQYVQLVCGKEIFCLKFCNVLHALQVVQFLCWKGKELDSTNFKGDAS
        P S NMYWF P +D  N   GFDVC ARLH LG+D++  Q  D S    E L    +QL  G ++   KF N+LHALQVVQF  WK  EL ++N + D  
Subjt:  PPSCNMYWFSPEED-PNFKFGFDVCWARLHALGYDIYFPQ--DSSLSDYEHLFTQYVQLVCGKEIFCLKFCNVLHALQVVQFLCWKGKELDSTNFKGDAS

Query:  YAVD-LSPKGNVAVSYFKSEFSKDDVLAESIFSFASSGGEEKQRRLGWSKLTAAAAGWEDYASNLRRELLRSLSHWRTNQSIYSV
          V  +S      ++  KS F  D+ L  + F ++ S GE+++  L  + +  A   WE+Y+  L +ELL SL  W+T Q+I+ V
Subjt:  YAVD-LSPKGNVAVSYFKSEFSKDDVLAESIFSFASSGGEEKQRRLGWSKLTAAAAGWEDYASNLRRELLRSLSHWRTNQSIYSV

AT1G68080.2 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein4.0e-9349.22Show/hide
Query:  RLILENFLTREECMELEFIHKSCCTVGYRPYVLSTTLLHLVAANSAHLIMPFVPIRERLKEKVEEFFGCEYELFVEFTGLISWTRGASIGWHSDDNRPYL
        RLIL NFL+  EC ELE IHKS  T+GYRP V STTL HL+A NS HLI+PFV IRERLKEK+EE FGCEYELF+EFTGLISW +GASIGWHSDDNR YL
Subjt:  RLILENFLTREECMELEFIHKSCCTVGYRPYVLSTTLLHLVAANSAHLIMPFVPIRERLKEKVEEFFGCEYELFVEFTGLISWTRGASIGWHSDDNRPYL

Query:  KQREFTAVCYLNSYGVDFRGGLFHFQDGEPKTVSPFCGDCVMYSADSHNVHSVDEITNGERLTLTLWFSRDSSHDEDAKLLSLLSQSHLHDCFPDSCLPL
        KQR+F +                    GEP TV+P  GD +MY+AD  N+HSVDE+T+GERLTL LWFSRDSSHDED+KLLS LSQ              
Subjt:  KQREFTAVCYLNSYGVDFRGGLFHFQDGEPKTVSPFCGDCVMYSADSHNVHSVDEITNGERLTLTLWFSRDSSHDEDAKLLSLLSQSHLHDCFPDSCLPL

Query:  PPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPQ--DSSLSDYEHLFTQYVQLVCGKEIFCLKFCNVLHALQVVQFLCWKGKELDSTNFKGDASY
                            FDVC ARLH LG+D++  Q  D S    E L    +QL  G ++   KF N+LHALQVVQF  WK  EL ++N + D   
Subjt:  PPSCNMYWFSPEEDPNFKFGFDVCWARLHALGYDIYFPQ--DSSLSDYEHLFTQYVQLVCGKEIFCLKFCNVLHALQVVQFLCWKGKELDSTNFKGDASY

Query:  AVD-LSPKGNVAVSYFKSEFSKDDVLAESIFSFASSGGEEKQRRLGWSKLTAAAAGWEDYASNLRRELLRSLSHWRTNQSIYSV
         V  +S      ++  KS F  D+ L  + F ++ S GE+++  L  + +  A   WE+Y+  L +ELL SL  W+T Q+I+ V
Subjt:  AVD-LSPKGNVAVSYFKSEFSKDDVLAESIFSFASSGGEEKQRRLGWSKLTAAAAGWEDYASNLRRELLRSLSHWRTNQSIYSV

AT1G68080.3 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein4.2e-10653.25Show/hide
Query:  RLILENFLTREECMELEFIHKSCCTVGYRPYVLSTTLLHLVAANSAHLIMPFVPIRERLKEKVEEFFGCEYELFVEFTGLISWTRGASIGWHSDDNRPYL
        RLIL NFL+  EC ELE IHKS  T+GYRP V STTL HL+A NS HLI+PFV IRERLKEK+EE FGCEYELF+EFTGLISW +GASIGWHSDDNR YL
Subjt:  RLILENFLTREECMELEFIHKSCCTVGYRPYVLSTTLLHLVAANSAHLIMPFVPIRERLKEKVEEFFGCEYELFVEFTGLISWTRGASIGWHSDDNRPYL

Query:  KQREFTAVCYLNSYGVDFRGGLFHFQDGEPKTVSPFCGDCVMYSADSHNVHSVDEITNGERLTLTLWFSRDSSHDEDAKLLSLLSQSHLHDCFPDSCLPL
        KQR+F +                    GEP TV+P  GD +MY+AD  N+HSVDE+T+GERLTL LWFSRDSSHDED+KLLS LSQ   H+     CLPL
Subjt:  KQREFTAVCYLNSYGVDFRGGLFHFQDGEPKTVSPFCGDCVMYSADSHNVHSVDEITNGERLTLTLWFSRDSSHDEDAKLLSLLSQSHLHDCFPDSCLPL

Query:  PPSCNMYWFSPEED-PNFKFGFDVCWARLHALGYDIYFPQ--DSSLSDYEHLFTQYVQLVCGKEIFCLKFCNVLHALQVVQFLCWKGKELDSTNFKGDAS
        P S NMYWF P +D  N   GFDVC ARLH LG+D++  Q  D S    E L    +QL  G ++   KF N+LHALQVVQF  WK  EL ++N + D  
Subjt:  PPSCNMYWFSPEED-PNFKFGFDVCWARLHALGYDIYFPQ--DSSLSDYEHLFTQYVQLVCGKEIFCLKFCNVLHALQVVQFLCWKGKELDSTNFKGDAS

Query:  YAVD-LSPKGNVAVSYFKSEFSKDDVLAESIFSFASSGGEEKQRRLGWSKLTAAAAGWEDYASNLRRELLRSLSHWRTNQSIYSV
          V  +S      ++  KS F  D+ L  + F ++ S GE+++  L  + +  A   WE+Y+  L +ELL SL  W+T Q+I+ V
Subjt:  YAVD-LSPKGNVAVSYFKSEFSKDDVLAESIFSFASSGGEEKQRRLGWSKLTAAAAGWEDYASNLRRELLRSLSHWRTNQSIYSV

AT1G68080.4 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.2e-5247.54Show/hide
Query:  MYSADSHNVHSVDEITNGERLTLTLWFSRDSSHDEDAKLLSLLSQSHLHDCFPDSCLPLPPSCNMYWFSPEED-PNFKFGFDVCWARLHALGYDIYFPQ-
        MY+AD  N+HSVDE+T+GERLTL LWFSRDSSHDED+KLLS LSQ   H+     CLPLP S NMYWF P +D  N   GFDVC ARLH LG+D++  Q 
Subjt:  MYSADSHNVHSVDEITNGERLTLTLWFSRDSSHDEDAKLLSLLSQSHLHDCFPDSCLPLPPSCNMYWFSPEED-PNFKFGFDVCWARLHALGYDIYFPQ-

Query:  -DSSLSDYEHLFTQYVQLVCGKEIFCLKFCNVLHALQVVQFLCWKGKELDSTNFKGDASYAVD-LSPKGNVAVSYFKSEFSKDDVLAESIFSFASSGGEE
         D S    E L    +QL  G ++   KF N+LHALQVVQF  WK  EL ++N + D    V  +S      ++  KS F  D+ L  + F ++ S GE+
Subjt:  -DSSLSDYEHLFTQYVQLVCGKEIFCLKFCNVLHALQVVQFLCWKGKELDSTNFKGDASYAVD-LSPKGNVAVSYFKSEFSKDDVLAESIFSFASSGGEE

Query:  KQRRLGWSKLTAAAAGWEDYASNLRRELLRSLSHWRTNQSIYSV
        ++  L  + +  A   WE+Y+  L +ELL SL  W+T Q+I+ V
Subjt:  KQRRLGWSKLTAAAAGWEDYASNLRRELLRSLSHWRTNQSIYSV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGACAAAGCGGGGAGCAGGCAGCAGCAGCTGCGCCGTCTGATCCTCGAAAATTTCTTAACCCGCGAAGAATGTATGGAGCTGGAGTTTATCCATAAGAGCTGCTG
CACAGTGGGTTATAGACCCTACGTCTTATCCACCACTCTTTTGCATCTCGTTGCTGCTAATTCTGCTCATTTAATCATGCCTTTTGTTCCGATCAGAGAGAGGTTGAAGG
AGAAGGTGGAGGAATTCTTTGGCTGTGAATATGAACTCTTTGTCGAGTTCACTGGCTTGATCAGCTGGACCAGGGGAGCAAGCATTGGATGGCATAGTGATGATAACAGG
CCCTATTTAAAACAACGTGAATTTACAGCAGTCTGTTACTTGAATAGTTATGGAGTTGATTTTAGAGGTGGGCTGTTTCACTTTCAGGACGGGGAACCAAAAACTGTCTC
GCCTTTTTGTGGAGATTGTGTGATGTACTCAGCCGACAGCCACAATGTTCATTCTGTTGATGAGATAACCAATGGAGAAAGGCTAACACTGACATTATGGTTCAGCCGTG
ATAGTTCCCATGATGAGGATGCAAAACTTCTTTCCCTTCTCTCACAAAGCCATTTACATGATTGTTTTCCTGACTCGTGTCTACCTCTCCCACCATCCTGTAATATGTAT
TGGTTTTCACCAGAAGAAGATCCAAATTTCAAGTTCGGTTTTGATGTATGTTGGGCGAGACTGCATGCTCTCGGATACGACATTTATTTTCCTCAGGACTCTAGTTTGTC
AGATTACGAACATTTATTCACACAGTACGTGCAATTAGTATGCGGAAAGGAGATATTCTGTTTGAAATTTTGCAACGTTTTGCATGCACTTCAGGTAGTGCAGTTTCTGT
GTTGGAAAGGCAAAGAACTGGATTCTACTAACTTCAAGGGAGATGCAAGCTATGCGGTAGATTTATCTCCAAAGGGGAATGTGGCAGTCAGTTACTTTAAATCCGAGTTT
TCGAAGGACGATGTACTGGCCGAGTCAATCTTCTCGTTTGCTAGTTCTGGTGGTGAGGAGAAGCAACGCAGGTTGGGGTGGTCTAAGCTTACTGCTGCAGCTGCAGGTTG
GGAAGATTATGCTTCCAATTTAAGGAGAGAACTCCTTAGGAGCTTGTCCCATTGGAGAACCAATCAATCCATATACAGTGTTTTACTTTGTAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGACAAAGCGGGGAGCAGGCAGCAGCAGCTGCGCCGTCTGATCCTCGAAAATTTCTTAACCCGCGAAGAATGTATGGAGCTGGAGTTTATCCATAAGAGCTGCTG
CACAGTGGGTTATAGACCCTACGTCTTATCCACCACTCTTTTGCATCTCGTTGCTGCTAATTCTGCTCATTTAATCATGCCTTTTGTTCCGATCAGAGAGAGGTTGAAGG
AGAAGGTGGAGGAATTCTTTGGCTGTGAATATGAACTCTTTGTCGAGTTCACTGGCTTGATCAGCTGGACCAGGGGAGCAAGCATTGGATGGCATAGTGATGATAACAGG
CCCTATTTAAAACAACGTGAATTTACAGCAGTCTGTTACTTGAATAGTTATGGAGTTGATTTTAGAGGTGGGCTGTTTCACTTTCAGGACGGGGAACCAAAAACTGTCTC
GCCTTTTTGTGGAGATTGTGTGATGTACTCAGCCGACAGCCACAATGTTCATTCTGTTGATGAGATAACCAATGGAGAAAGGCTAACACTGACATTATGGTTCAGCCGTG
ATAGTTCCCATGATGAGGATGCAAAACTTCTTTCCCTTCTCTCACAAAGCCATTTACATGATTGTTTTCCTGACTCGTGTCTACCTCTCCCACCATCCTGTAATATGTAT
TGGTTTTCACCAGAAGAAGATCCAAATTTCAAGTTCGGTTTTGATGTATGTTGGGCGAGACTGCATGCTCTCGGATACGACATTTATTTTCCTCAGGACTCTAGTTTGTC
AGATTACGAACATTTATTCACACAGTACGTGCAATTAGTATGCGGAAAGGAGATATTCTGTTTGAAATTTTGCAACGTTTTGCATGCACTTCAGGTAGTGCAGTTTCTGT
GTTGGAAAGGCAAAGAACTGGATTCTACTAACTTCAAGGGAGATGCAAGCTATGCGGTAGATTTATCTCCAAAGGGGAATGTGGCAGTCAGTTACTTTAAATCCGAGTTT
TCGAAGGACGATGTACTGGCCGAGTCAATCTTCTCGTTTGCTAGTTCTGGTGGTGAGGAGAAGCAACGCAGGTTGGGGTGGTCTAAGCTTACTGCTGCAGCTGCAGGTTG
GGAAGATTATGCTTCCAATTTAAGGAGAGAACTCCTTAGGAGCTTGTCCCATTGGAGAACCAATCAATCCATATACAGTGTTTTACTTTGTAGTTGA
Protein sequenceShow/hide protein sequence
MGDKAGSRQQQLRRLILENFLTREECMELEFIHKSCCTVGYRPYVLSTTLLHLVAANSAHLIMPFVPIRERLKEKVEEFFGCEYELFVEFTGLISWTRGASIGWHSDDNR
PYLKQREFTAVCYLNSYGVDFRGGLFHFQDGEPKTVSPFCGDCVMYSADSHNVHSVDEITNGERLTLTLWFSRDSSHDEDAKLLSLLSQSHLHDCFPDSCLPLPPSCNMY
WFSPEEDPNFKFGFDVCWARLHALGYDIYFPQDSSLSDYEHLFTQYVQLVCGKEIFCLKFCNVLHALQVVQFLCWKGKELDSTNFKGDASYAVDLSPKGNVAVSYFKSEF
SKDDVLAESIFSFASSGGEEKQRRLGWSKLTAAAAGWEDYASNLRRELLRSLSHWRTNQSIYSVLLCS