; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041443 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041443
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProcollagen-proline 3-dioxygenase
Genome locationchr13:17925432..17937515
RNA-Seq ExpressionLag0041443
SyntenyLag0041443
Gene Ontology termsGO:0019511 - peptidyl-proline hydroxylation (biological process)
GO:0032963 - collagen metabolic process (biological process)
GO:0005506 - iron ion binding (molecular function)
GO:0019797 - procollagen-proline 3-dioxygenase activity (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR039575 - Prolyl 3-hydroxylase
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155191.1 uncharacterized protein LOC111022327 [Momordica charantia]4.8e-20587.19Show/hide
Query:  MGDEAVSRQ--RRRLILENFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
        MGDE  +RQ  RRRLILENFLT EECRELEFIHKSCCTVGYRP+VFSTTLLHLVA+NSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
Subjt:  MGDEAVSRQ--RRRLILENFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFSAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS
        SIGWHSDDNRPYLKQREF+AVCYLNSYGVDF GGLFHFQDGEPK+ISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRD+SHDEDAKLLSLLSQS
Subjt:  SIGWHSDDNRPYLKQREFSAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS

Query:  DLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKEL
         LHDRFP+SC+P PPSCNMYWFSPE+DPNFKFG DVCWARLHALGY+IYFP D+ LS+YP LFS  VQLVR+KKIFFQEF +ILHALQVVQF+CWKGKEL
Subjt:  DLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKEL

Query:  DSTNFKGDSSYAEYLSPKRNVGVSYFKSEFSKDDALAESVFSSASSDGKEKQHWFGWAKLAAAAAAWEDHASNLRRELLRSLSHWRASQSIYSVSLAS
        DSTNFKG+SSYA YLSPK N GVSYFKSEFSK+  LA+SVFSSASSD KEKQ W GWAKLA A AAWED+ASNLR ELLRSL HWR +QS+Y VSL S
Subjt:  DSTNFKGDSSYAEYLSPKRNVGVSYFKSEFSKDDALAESVFSSASSDGKEKQHWFGWAKLAAAAAAWEDHASNLRRELLRSLSHWRASQSIYSVSLAS

XP_022942569.1 prolyl 3-hydroxylase 1 isoform X1 [Cucurbita moschata]2.8e-20588.89Show/hide
Query:  MGDEAVSRQRRRLILENFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGASI
        MGDEA   QRRRL LENFLTLEECRELEFIHKSCCTVGYRP VFSTTLLHLV SNSAHLIMPFV IRERLKEKAEEFFGCEYELFVEFTGLISWTRGA I
Subjt:  MGDEAVSRQRRRLILENFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDL
        GWHSDDNRPYLKQREF+AVCYLNSYGVDF GGLFHFQDGEPKTISP CGDCVMYTADS NVHSVDE+T+GERLTLTLWFTRDSSHDEDAKL+SLLSQS L
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDS
        HDR PDSCLPQPPSCNMYWFSP+DDPNFKFGFD+CWARLHALGY IYFP DHSLSEYPDLFSQ+VQLVR  KIF Q+FDSILHALQVVQFL WKGKELDS
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDS

Query:  TNFKGDSSYAEYLSPKRNVGVSYFKSEFSKDDALAESVFSSASSDGKEKQHWFGWAKLAAAAAAWEDHASNLRRELLRSLSHWRASQSIYSVSLAS
        TN K DSSYAE LSPKRNVGV YFKSEFSKDDALAESVF  ASSD KEKQH  GWAKLAA AAAWED+ASNLRRELLRS +HWR SQSIYSV   S
Subjt:  TNFKGDSSYAEYLSPKRNVGVSYFKSEFSKDDALAESVFSSASSDGKEKQHWFGWAKLAAAAAAWEDHASNLRRELLRSLSHWRASQSIYSVSLAS

XP_022989994.1 prolyl 3-hydroxylase 1 isoform X1 [Cucurbita maxima]5.8e-20388.13Show/hide
Query:  MGDEAVSRQRRRLILENFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGASI
        MGDEA   QR RLILENFLTLEECRELEFIHKSCCTVGYRP VFSTTLLHLV SNSA LIMPFV IRERLKEKAEEFFGCEYELFVEFTGLISWTRGA I
Subjt:  MGDEAVSRQRRRLILENFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDL
        GWHSDDNRPYLKQREF+AVCYLNSYGVDF GGLFHFQDGEPKTISP CGDCVMYTADS NVHSVDE+T+GERLTLTLWFTRDSSHDEDAKLLSLLSQS L
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDS
        HDR PDSCLPQPPSCNMYWFSP+DDPNFKFGFD+CWARLHALGY IYFP DHSLSEYPDLFSQ+VQLVR  KIF Q+FDSILHALQVVQFL WKGKELDS
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDS

Query:  TNFKGDSSYAEYLSPKRNVGVSYFKSEFSKDDALAESVFSSASSDGKEKQHWFGWAKLAAAAAAWEDHASNLRRELLRSLSHWRASQSIYSVSLAS
        T+ K DSSYAE LSPKRNVGV +FKSEFSKDDALAESVF  ASSD KEKQH  GWAKLAA A AWED+ASNLRRELLRS +HWR SQSIYSV   S
Subjt:  TNFKGDSSYAEYLSPKRNVGVSYFKSEFSKDDALAESVFSSASSDGKEKQHWFGWAKLAAAAAAWEDHASNLRRELLRSLSHWRASQSIYSVSLAS

XP_023552239.1 prolyl 3-hydroxylase 1 [Cucurbita pepo subsp. pepo]1.6e-20589.14Show/hide
Query:  MGDEAVSRQRRRLILENFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGASI
        MGDEA   QRRRLILENFLTLEECRELEFIHKSCCTVGYRP VFSTTLLHLV SNSAHLIMPFV IRERLKEKAEEFFGCEYELFVEFTGLISWTRGA I
Subjt:  MGDEAVSRQRRRLILENFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDL
        GWHSDDNRPYLKQREF+AVCYLNSYGVDF GGLFHFQDGEPKTISP CGDCVMYTADS NVHSVDE+T+GERLTLTLWFTRDSSHDEDAKLLSLLSQS L
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDS
        HDR PDSCLPQPPSCNMYWFSP+DDPNFKFGFD+CWARLHALGY IYFP DHSLSEYPDLFSQ+VQLVR  KIF Q+FDSILHALQVVQFL WKGKELDS
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDS

Query:  TNFKGDSSYAEYLSPKRNVGVSYFKSEFSKDDALAESVFSSASSDGKEKQHWFGWAKLAAAAAAWEDHASNLRRELLRSLSHWRASQSIYSVSLAS
        TN K DSSYAE LSPKRNVGV YFKSEFSKD+ALAESVF  ASSD KEKQH  GWAKLAA AAAWED+ASNLRRELLRS  HWR SQSIYSV   S
Subjt:  TNFKGDSSYAEYLSPKRNVGVSYFKSEFSKDDALAESVFSSASSDGKEKQHWFGWAKLAAAAAAWEDHASNLRRELLRSLSHWRASQSIYSVSLAS

XP_038893062.1 uncharacterized protein LOC120081945 [Benincasa hispida]1.9e-20688.44Show/hide
Query:  MGDEAVS--RQRRRLILENFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
        MGDE  S  R+RRRLILENFLT EECRELEFIHKSCCTVGYRPNVFSTTLLHLVA+NSAHLIMPFVPIRERLKEKAEEFFGC YELFVEFTGLISWTRGA
Subjt:  MGDEAVS--RQRRRLILENFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFSAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS
        SIGWHSDDNRPYLKQR+FSAVCYLNSYGV+FGGGLFHFQDGEP+TISPFCGDCVMYTADS NVHSVDEITNGERLTLTLW TRDSSHDED+KLLSLLSQS
Subjt:  SIGWHSDDNRPYLKQREFSAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS

Query:  DLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKEL
         LHDR PDS LPQPPSCNMYWFS EDDPNFK GFD+CWARLHALGY+IYF  DHS SEYPDLFS++VQLV+  K+FFQEF++ILH LQVVQFLCWKGKEL
Subjt:  DLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKEL

Query:  DSTNFKGDSSYAEYLSPKRNVGVSYFKSEFSKDDALAESVFSSASSDGKEKQHWFGWAKLAAAAAAWEDHASNLRRELLRSLSHWRASQSIYSVSLAS
        DSTN K DSSYAEYLSPKRNVGVSYFKSEFSKDD LAESVFSSA+SDGKE QHW GW KLAAAAAAWED+AS LRRELL SLS+WR SQSIYSVSL+S
Subjt:  DSTNFKGDSSYAEYLSPKRNVGVSYFKSEFSKDDALAESVFSSASSDGKEKQHWFGWAKLAAAAAAWEDHASNLRRELLRSLSHWRASQSIYSVSLAS

TrEMBL top hitse value%identityAlignment
A0A5A7SSL8 Procollagen-proline 3-dioxygenase1.8e-20286.9Show/hide
Query:  MGDEAVSRQRRRLILENFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGASI
        M D A SRQRRRLILENFL+ EECRELEFIHKSCCTVGYRPNV STTLLHLVA+NSAHLI+PFVPIRE+LKEKAEEFFGC YELFVEFTGLISWTRGASI
Subjt:  MGDEAVSRQRRRLILENFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDL
        GWHSDDNRPYLKQREFSAVCYLNSYGV+FGGGLFHFQDGEP+TISPF GDCVMYTADS NVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS L
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDS
        HDRF +SCLPQPPSCNMYWFSPE+DPNFKFGFD+CWARLHALGY+IYFP DH  SEYPDLFSQ+VQLV   KIFFQ+F++ILH LQVVQFLCWKGKELD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDS

Query:  TNFKGDSSYAEYLSPKRNVGVSYFKSEFSKDDALAESVFSSASSDGKEKQHWFGWAKL-AAAAAAWEDHASNLRRELLRSLSHWRASQSIYSVSLAS
        TN   DS YAEYLSPKRNVGVSYFKSEFSK+D LAESVFSSA+S GKE QHW GW KL  AAAAAWED+AS LRRELL S SHWR  QSIYSVSL S
Subjt:  TNFKGDSSYAEYLSPKRNVGVSYFKSEFSKDDALAESVFSSASSDGKEKQHWFGWAKL-AAAAAAWEDHASNLRRELLRSLSHWRASQSIYSVSLAS

A0A5D3CRE9 Procollagen-proline 3-dioxygenase3.7e-20387.15Show/hide
Query:  MGDEAVSRQRRRLILENFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGASI
        M D A SRQRRRLILENFL+ EECRELEFIHKSCCTVGYRPNV STTLLHLVA+NSAHLI+PFVPIRE+LKEKAEEFFGC YELFVEFTGLISWTRGASI
Subjt:  MGDEAVSRQRRRLILENFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDL
        GWHSDDNRPYLKQREFSAVCYLNSYGV+FGGGLFHFQDGEP+TISPF GDCVMY ADS NVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS L
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDS
        HDRFP+SCLPQPPSCNMYWFSPEDDPNFKFGFD+CWARLHALGY+IYFP DH  SEYPDLFSQ+VQLV   KIFFQ+F++ILH LQVVQFLCWKGKELD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDS

Query:  TNFKGDSSYAEYLSPKRNVGVSYFKSEFSKDDALAESVFSSASSDGKEKQHWFGWAKL-AAAAAAWEDHASNLRRELLRSLSHWRASQSIYSVSLAS
        TN   DS YAEYLSPKRNVGVSYFKSEFSK+D LAESVFSSA+S GKE QHW GW KL  AAAAAWED+AS LRRELL S SHWR  QSIYSVSL S
Subjt:  TNFKGDSSYAEYLSPKRNVGVSYFKSEFSKDDALAESVFSSASSDGKEKQHWFGWAKL-AAAAAAWEDHASNLRRELLRSLSHWRASQSIYSVSLAS

A0A6J1DQY8 Procollagen-proline 3-dioxygenase2.3e-20587.19Show/hide
Query:  MGDEAVSRQ--RRRLILENFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
        MGDE  +RQ  RRRLILENFLT EECRELEFIHKSCCTVGYRP+VFSTTLLHLVA+NSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA
Subjt:  MGDEAVSRQ--RRRLILENFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFSAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS
        SIGWHSDDNRPYLKQREF+AVCYLNSYGVDF GGLFHFQDGEPK+ISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRD+SHDEDAKLLSLLSQS
Subjt:  SIGWHSDDNRPYLKQREFSAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS

Query:  DLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKEL
         LHDRFP+SC+P PPSCNMYWFSPE+DPNFKFG DVCWARLHALGY+IYFP D+ LS+YP LFS  VQLVR+KKIFFQEF +ILHALQVVQF+CWKGKEL
Subjt:  DLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKEL

Query:  DSTNFKGDSSYAEYLSPKRNVGVSYFKSEFSKDDALAESVFSSASSDGKEKQHWFGWAKLAAAAAAWEDHASNLRRELLRSLSHWRASQSIYSVSLAS
        DSTNFKG+SSYA YLSPK N GVSYFKSEFSK+  LA+SVFSSASSD KEKQ W GWAKLA A AAWED+ASNLR ELLRSL HWR +QS+Y VSL S
Subjt:  DSTNFKGDSSYAEYLSPKRNVGVSYFKSEFSKDDALAESVFSSASSDGKEKQHWFGWAKLAAAAAAWEDHASNLRRELLRSLSHWRASQSIYSVSLAS

A0A6J1FRP8 Procollagen-proline 3-dioxygenase1.4e-20588.89Show/hide
Query:  MGDEAVSRQRRRLILENFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGASI
        MGDEA   QRRRL LENFLTLEECRELEFIHKSCCTVGYRP VFSTTLLHLV SNSAHLIMPFV IRERLKEKAEEFFGCEYELFVEFTGLISWTRGA I
Subjt:  MGDEAVSRQRRRLILENFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDL
        GWHSDDNRPYLKQREF+AVCYLNSYGVDF GGLFHFQDGEPKTISP CGDCVMYTADS NVHSVDE+T+GERLTLTLWFTRDSSHDEDAKL+SLLSQS L
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDS
        HDR PDSCLPQPPSCNMYWFSP+DDPNFKFGFD+CWARLHALGY IYFP DHSLSEYPDLFSQ+VQLVR  KIF Q+FDSILHALQVVQFL WKGKELDS
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDS

Query:  TNFKGDSSYAEYLSPKRNVGVSYFKSEFSKDDALAESVFSSASSDGKEKQHWFGWAKLAAAAAAWEDHASNLRRELLRSLSHWRASQSIYSVSLAS
        TN K DSSYAE LSPKRNVGV YFKSEFSKDDALAESVF  ASSD KEKQH  GWAKLAA AAAWED+ASNLRRELLRS +HWR SQSIYSV   S
Subjt:  TNFKGDSSYAEYLSPKRNVGVSYFKSEFSKDDALAESVFSSASSDGKEKQHWFGWAKLAAAAAAWEDHASNLRRELLRSLSHWRASQSIYSVSLAS

A0A6J1JQV6 Procollagen-proline 3-dioxygenase2.8e-20388.13Show/hide
Query:  MGDEAVSRQRRRLILENFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGASI
        MGDEA   QR RLILENFLTLEECRELEFIHKSCCTVGYRP VFSTTLLHLV SNSA LIMPFV IRERLKEKAEEFFGCEYELFVEFTGLISWTRGA I
Subjt:  MGDEAVSRQRRRLILENFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDL
        GWHSDDNRPYLKQREF+AVCYLNSYGVDF GGLFHFQDGEPKTISP CGDCVMYTADS NVHSVDE+T+GERLTLTLWFTRDSSHDEDAKLLSLLSQS L
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDS
        HDR PDSCLPQPPSCNMYWFSP+DDPNFKFGFD+CWARLHALGY IYFP DHSLSEYPDLFSQ+VQLVR  KIF Q+FDSILHALQVVQFL WKGKELDS
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDS

Query:  TNFKGDSSYAEYLSPKRNVGVSYFKSEFSKDDALAESVFSSASSDGKEKQHWFGWAKLAAAAAAWEDHASNLRRELLRSLSHWRASQSIYSVSLAS
        T+ K DSSYAE LSPKRNVGV +FKSEFSKDDALAESVF  ASSD KEKQH  GWAKLAA A AWED+ASNLRRELLRS +HWR SQSIYSV   S
Subjt:  TNFKGDSSYAEYLSPKRNVGVSYFKSEFSKDDALAESVFSSASSDGKEKQHWFGWAKLAAAAAAWEDHASNLRRELLRSLSHWRASQSIYSVSLAS

SwissProt top hitse value%identityAlignment
Q5XGE0 2-oxoglutarate and iron-dependent oxygenase domain-containing protein 32.3e-0832.94Show/hide
Query:  WHSDDNRPYLKQREFSAVCYLNSYGVDFGGGLFHFQD-GEPKTISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSH
        WH   ++      +++++ YL+ Y  DFGGG F F D G  +T+ P  G    +T+ S N+H V++++ G R  +T+ FT +  H
Subjt:  WHSDDNRPYLKQREFSAVCYLNSYGVDFGGGLFHFQD-GEPKTISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSH

Arabidopsis top hitse value%identityAlignment
AT1G68080.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein9.9e-11655.01Show/hide
Query:  RQRRRLILENFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGASIGWHSDDN
        ++  RLIL NFL+  EC+ELE IHKS  T+GYRPNVFSTTL HL+A+NS HLI+PFV IRERLKEK EE FGCEYELF+EFTGLISW +GASIGWHSDDN
Subjt:  RQRRRLILENFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGASIGWHSDDN

Query:  RPYLKQREFSAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDLHDRFPDS
        R YLKQR+F+AVCYLNSY  DF GGLF FQ GEP T++P  GD +MYTAD  N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H    + 
Subjt:  RPYLKQREFSAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDLHDRFPDS

Query:  CLPQPPSCNMYWFSP-EDDPNFKFGFDVCWARLHALGYNIYF--PCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDSTNFK
        CLP P S NMYWF P +D  N   GFDVC ARLH LG++++     DHS      L    +QL +  K+  ++F +ILHALQVVQF  WK  EL ++N +
Subjt:  CLPQPPSCNMYWFSP-EDDPNFKFGFDVCWARLHALGYNIYF--PCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDSTNFK

Query:  GDS-SYAEYLSPKRNVGVSYFKSEFSKDDALAESVFSSASSDGKEKQHWFGWAKLAAAAAAWEDHASNLRRELLRSLSHWRASQSIYSV
         D+    + +S  +   ++  KS F  D+ L  + F  + S G++++       +A A  +WE+++  L +ELL SL  W+  Q+I+ V
Subjt:  GDS-SYAEYLSPKRNVGVSYFKSEFSKDDALAESVFSSASSDGKEKQHWFGWAKLAAAAAAWEDHASNLRRELLRSLSHWRASQSIYSV

AT1G68080.2 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.4e-9047.16Show/hide
Query:  RQRRRLILENFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGASIGWHSDDN
        ++  RLIL NFL+  EC+ELE IHKS  T+GYRPNVFSTTL HL+A+NS HLI+PFV IRERLKEK EE FGCEYELF+EFTGLISW +GASIGWHSDDN
Subjt:  RQRRRLILENFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGASIGWHSDDN

Query:  RPYLKQREFSAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDLHDRFPDS
        R YLKQR+F++                    GEP T++P  GD +MYTAD  N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ          
Subjt:  RPYLKQREFSAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDLHDRFPDS

Query:  CLPQPPSCNMYWFSPEDDPNFKFGFDVCWARLHALGYNIYF--PCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDSTNFKG
                                FDVC ARLH LG++++     DHS      L    +QL +  K+  ++F +ILHALQVVQF  WK  EL ++N + 
Subjt:  CLPQPPSCNMYWFSPEDDPNFKFGFDVCWARLHALGYNIYF--PCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDSTNFKG

Query:  DS-SYAEYLSPKRNVGVSYFKSEFSKDDALAESVFSSASSDGKEKQHWFGWAKLAAAAAAWEDHASNLRRELLRSLSHWRASQSIYSV
        D+    + +S  +   ++  KS F  D+ L  + F  + S G++++       +A A  +WE+++  L +ELL SL  W+  Q+I+ V
Subjt:  DS-SYAEYLSPKRNVGVSYFKSEFSKDDALAESVFSSASSDGKEKQHWFGWAKLAAAAAAWEDHASNLRRELLRSLSHWRASQSIYSV

AT1G68080.3 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.2e-10250.9Show/hide
Query:  RQRRRLILENFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGASIGWHSDDN
        ++  RLIL NFL+  EC+ELE IHKS  T+GYRPNVFSTTL HL+A+NS HLI+PFV IRERLKEK EE FGCEYELF+EFTGLISW +GASIGWHSDDN
Subjt:  RQRRRLILENFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGASIGWHSDDN

Query:  RPYLKQREFSAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDLHDRFPDS
        R YLKQR+F++                    GEP T++P  GD +MYTAD  N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H    + 
Subjt:  RPYLKQREFSAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDLHDRFPDS

Query:  CLPQPPSCNMYWFSP-EDDPNFKFGFDVCWARLHALGYNIYF--PCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDSTNFK
        CLP P S NMYWF P +D  N   GFDVC ARLH LG++++     DHS      L    +QL +  K+  ++F +ILHALQVVQF  WK  EL ++N +
Subjt:  CLPQPPSCNMYWFSP-EDDPNFKFGFDVCWARLHALGYNIYF--PCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDSTNFK

Query:  GDS-SYAEYLSPKRNVGVSYFKSEFSKDDALAESVFSSASSDGKEKQHWFGWAKLAAAAAAWEDHASNLRRELLRSLSHWRASQSIYSV
         D+    + +S  +   ++  KS F  D+ L  + F  + S G++++       +A A  +WE+++  L +ELL SL  W+  Q+I+ V
Subjt:  GDS-SYAEYLSPKRNVGVSYFKSEFSKDDALAESVFSSASSDGKEKQHWFGWAKLAAAAAAWEDHASNLRRELLRSLSHWRASQSIYSV

AT1G68080.4 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.6e-4744.26Show/hide
Query:  MYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDLHDRFPDSCLPQPPSCNMYWFSP-EDDPNFKFGFDVCWARLHALGYNIYF--P
        MYTAD  N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H    + CLP P S NMYWF P +D  N   GFDVC ARLH LG++++    
Subjt:  MYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDLHDRFPDSCLPQPPSCNMYWFSP-EDDPNFKFGFDVCWARLHALGYNIYF--P

Query:  CDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDSTNFKGDS-SYAEYLSPKRNVGVSYFKSEFSKDDALAESVFSSASSDGKE
         DHS      L    +QL +  K+  ++F +ILHALQVVQF  WK  EL ++N + D+    + +S  +   ++  KS F  D+ L  + F  + S G++
Subjt:  CDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDSTNFKGDS-SYAEYLSPKRNVGVSYFKSEFSKDDALAESVFSSASSDGKE

Query:  KQHWFGWAKLAAAAAAWEDHASNLRRELLRSLSHWRASQSIYSV
        ++       +A A  +WE+++  L +ELL SL  W+  Q+I+ V
Subjt:  KQHWFGWAKLAAAAAAWEDHASNLRRELLRSLSHWRASQSIYSV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGACGAAGCGGTGAGCAGGCAGCGGCGGCGTCTGATTCTCGAAAATTTCCTGACCCTCGAAGAATGCAGGGAACTGGAGTTCATCCATAAGAGTTGCTGTACGGT
GGGTTATAGACCAAATGTCTTCTCCACCACTCTTTTGCATCTTGTCGCCTCTAATTCTGCTCACTTGATCATGCCTTTTGTTCCGATCAGAGAGAGGTTGAAGGAGAAAG
CGGAGGAATTCTTTGGCTGTGAATATGAACTCTTCGTTGAGTTCACTGGCTTGATAAGCTGGACCAGGGGTGCAAGCATTGGATGGCATAGTGACGATAACCGGCCCTAT
CTGAAACAACGTGAATTTTCAGCAGTGTGTTACTTGAATAGTTATGGAGTAGATTTTGGAGGTGGGCTGTTTCACTTTCAGGATGGGGAACCAAAAACCATCTCGCCTTT
TTGTGGAGATTGTGTGATGTACACGGCCGACAGCCACAATGTTCATTCTGTTGATGAGATAACCAATGGAGAAAGGCTTACACTGACATTATGGTTCACCCGTGATAGTT
CCCATGATGAAGATGCCAAACTGCTTTCGCTTCTTTCACAAAGCGATTTACATGATCGTTTTCCTGACTCGTGCCTACCTCAGCCTCCATCCTGTAATATGTATTGGTTT
TCACCAGAAGACGATCCAAATTTCAAGTTCGGTTTTGATGTCTGTTGGGCGAGACTGCACGCACTTGGATACAACATTTATTTTCCTTGTGATCATAGTTTGTCAGAGTA
TCCAGATTTATTCTCACAGGAAGTACAATTAGTACGGGAAAAGAAGATATTCTTTCAGGAATTTGATAGCATTTTGCATGCACTTCAGGTAGTGCAATTCCTGTGTTGGA
AAGGCAAAGAACTGGATTCTACTAACTTCAAGGGGGATTCAAGCTATGCAGAATATTTATCTCCAAAGAGGAATGTGGGTGTCAGTTACTTTAAATCCGAGTTTTCGAAG
GACGATGCACTGGCCGAGTCAGTCTTCTCGTCTGCTAGTTCTGATGGCAAGGAGAAGCAACACTGGTTCGGGTGGGCTAAGCTTGCTGCTGCTGCAGCAGCTTGGGAAGA
TCATGCTTCCAACCTAAGGAGAGAACTCCTTCGGAGCTTGAGCCATTGGAGAGCCAGTCAATCCATATACAGTGTTTCACTTGCTAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGACGAAGCGGTGAGCAGGCAGCGGCGGCGTCTGATTCTCGAAAATTTCCTGACCCTCGAAGAATGCAGGGAACTGGAGTTCATCCATAAGAGTTGCTGTACGGT
GGGTTATAGACCAAATGTCTTCTCCACCACTCTTTTGCATCTTGTCGCCTCTAATTCTGCTCACTTGATCATGCCTTTTGTTCCGATCAGAGAGAGGTTGAAGGAGAAAG
CGGAGGAATTCTTTGGCTGTGAATATGAACTCTTCGTTGAGTTCACTGGCTTGATAAGCTGGACCAGGGGTGCAAGCATTGGATGGCATAGTGACGATAACCGGCCCTAT
CTGAAACAACGTGAATTTTCAGCAGTGTGTTACTTGAATAGTTATGGAGTAGATTTTGGAGGTGGGCTGTTTCACTTTCAGGATGGGGAACCAAAAACCATCTCGCCTTT
TTGTGGAGATTGTGTGATGTACACGGCCGACAGCCACAATGTTCATTCTGTTGATGAGATAACCAATGGAGAAAGGCTTACACTGACATTATGGTTCACCCGTGATAGTT
CCCATGATGAAGATGCCAAACTGCTTTCGCTTCTTTCACAAAGCGATTTACATGATCGTTTTCCTGACTCGTGCCTACCTCAGCCTCCATCCTGTAATATGTATTGGTTT
TCACCAGAAGACGATCCAAATTTCAAGTTCGGTTTTGATGTCTGTTGGGCGAGACTGCACGCACTTGGATACAACATTTATTTTCCTTGTGATCATAGTTTGTCAGAGTA
TCCAGATTTATTCTCACAGGAAGTACAATTAGTACGGGAAAAGAAGATATTCTTTCAGGAATTTGATAGCATTTTGCATGCACTTCAGGTAGTGCAATTCCTGTGTTGGA
AAGGCAAAGAACTGGATTCTACTAACTTCAAGGGGGATTCAAGCTATGCAGAATATTTATCTCCAAAGAGGAATGTGGGTGTCAGTTACTTTAAATCCGAGTTTTCGAAG
GACGATGCACTGGCCGAGTCAGTCTTCTCGTCTGCTAGTTCTGATGGCAAGGAGAAGCAACACTGGTTCGGGTGGGCTAAGCTTGCTGCTGCTGCAGCAGCTTGGGAAGA
TCATGCTTCCAACCTAAGGAGAGAACTCCTTCGGAGCTTGAGCCATTGGAGAGCCAGTCAATCCATATACAGTGTTTCACTTGCTAGTTGA
Protein sequenceShow/hide protein sequence
MGDEAVSRQRRRLILENFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVASNSAHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLISWTRGASIGWHSDDNRPY
LKQREFSAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDLHDRFPDSCLPQPPSCNMYWF
SPEDDPNFKFGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDSTNFKGDSSYAEYLSPKRNVGVSYFKSEFSK
DDALAESVFSSASSDGKEKQHWFGWAKLAAAAAAWEDHASNLRRELLRSLSHWRASQSIYSVSLAS