; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10006842 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10006842
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProcollagen-proline 3-dioxygenase
Genome locationChr07:22483824..22488176
RNA-Seq ExpressionHG10006842
SyntenyHG10006842
Gene Ontology termsGO:0019511 - peptidyl-proline hydroxylation (biological process)
GO:0032963 - collagen metabolic process (biological process)
GO:0005506 - iron ion binding (molecular function)
GO:0019797 - procollagen-proline 3-dioxygenase activity (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR039575 - Prolyl 3-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK14443.1 prolyl 3-hydroxylase 1 [Cucumis melo var. makuwa]8.0e-18682.37Show/hide
Query:  MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASI
        M D  ESRQR+RLIL NFL+ EECRELEFIHKSCCTVGYRPNV STTLLHLVATNSAHLI PFVPIRE+L+EKAEEFFGC YELFVEFTGLISWTRGASI
Subjt:  MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSV------RGPIFPGKV---------------NDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
        GWHSDDNRPYLKQREFS        G  F G +                DCVMY ADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS L
Subjt:  GWHSDDNRPYLKQREFSV------RGPIFPGKV---------------NDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGFDICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDS
        HDRFP+SCLPQPPSCNMYWFSPEDDPNF FGFDICWARLHALGYDIYFPGDH FSEYPDLFSQ VQLV G+K+FFQ+F NILHLLQVVQFLCWKGKELD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGFDICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDS

Query:  TNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKL-AAAAAAWEDYASILRRELLGSLSHWRNSKSIYSASLVS
        TN+  DS YAEYLSPKRNVGVSYFKSEFSK+D LAESVFSS T  GKENQHWLGW KL  AAAAAWEDYASILRRELLGS SHWRN +SIYS SL S
Subjt:  TNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKL-AAAAAAWEDYASILRRELLGSLSHWRNSKSIYSASLVS

XP_004140463.1 prolyl 3-hydroxylase 1 [Cucumis sativus]3.0e-18582.07Show/hide
Query:  MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASI
        M D  ESRQR+RLIL NFL+ EECRELEFIHKSC TVGYRPNVFSTTLLHLVATNSAHLI PFVPIRE+L+EKAEEFFGC YELFVEFTGLISWTRGASI
Subjt:  MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSV------RGPIFPGKV---------------NDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
        GWHSDDNRPYLKQREFS        G  F G +                DCVMYTAD+DNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS L
Subjt:  GWHSDDNRPYLKQREFSV------RGPIFPGKV---------------NDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGFDICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDS
        HDRFPDSCLPQPPSCNMYWFSPEDDPNF FGFDICWARL ALGYD+YFPGDH FSEYPDLF Q VQLV G+K+FFQ+F NILHLLQVVQFLCWKGKELDS
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGFDICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDS

Query:  TNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKLAAAAAAWEDYASILRRELLGSLSHWRNSKSIYSASLVS
        TN+  DSSYAEYLSPKRNVGVSYFKSEFSK+D LAESVFSS   DGKENQ WLGW KL AAAAAWE YASILRRELLGS SHWRN +SIYS SL S
Subjt:  TNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKLAAAAAAWEDYASILRRELLGSLSHWRNSKSIYSASLVS

XP_008456831.1 PREDICTED: uncharacterized protein LOC103496668 isoform X1 [Cucumis melo]4.0e-18582.12Show/hide
Query:  MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASI
        M D  ESRQR+RLIL NFL+ EECRELEFIHKSCCTVGYRPNV STTLLHLVATNSAHLI PFVPIRE+L+EKAEEFFGC YELFVEFTGLISWTRGASI
Subjt:  MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSV------RGPIFPGKV---------------NDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
        GWHSDDNRPYLKQREFS        G  F G +                DCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS L
Subjt:  GWHSDDNRPYLKQREFSV------RGPIFPGKV---------------NDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGFDICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDS
        HDRF +SCLPQPPSCNMYWFSPE+DPNF FGFDICWARLHALGYDIYFPGDH FSEYPDLFSQ VQLV G+K+FFQ+F NILHLLQVVQFLCWKGKELD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGFDICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDS

Query:  TNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKL-AAAAAAWEDYASILRRELLGSLSHWRNSKSIYSASLVS
        TN+  DS YAEYLSPKRNVGVSYFKSEFSK+D LAESVFSS T  GKENQHWLGW KL  AAAAAWEDYASILRRELLGS SHWRN +SIYS SL S
Subjt:  TNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKL-AAAAAAWEDYASILRRELLGSLSHWRNSKSIYSASLVS

XP_008456833.1 PREDICTED: uncharacterized protein LOC103496668 isoform X2 [Cucumis melo]2.1e-18685.9Show/hide
Query:  MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASI
        M D  ESRQR+RLIL NFL+ EECRELEFIHKSCCTVGYRPNV STTLLHLVATNSAHLI PFVPIRE+L+EKAEEFFGC YELFVEFTGLISWTRGASI
Subjt:  MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSVRGPIFPGKVNDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDSCLPQPPSCNMYWFS
        GWHSDDNRPYLKQREFS           DCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS LHDRF +SCLPQPPSCNMYWFS
Subjt:  GWHSDDNRPYLKQREFSVRGPIFPGKVNDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDSCLPQPPSCNMYWFS

Query:  PEDDPNFNFGFDICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDSTNIKGDSSYAEYLSPKRNVGV
        PE+DPNF FGFDICWARLHALGYDIYFPGDH FSEYPDLFSQ VQLV G+K+FFQ+F NILHLLQVVQFLCWKGKELD+TN+  DS YAEYLSPKRNVGV
Subjt:  PEDDPNFNFGFDICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDSTNIKGDSSYAEYLSPKRNVGV

Query:  SYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKL-AAAAAAWEDYASILRRELLGSLSHWRNSKSIYSASLVS
        SYFKSEFSK+D LAESVFSS T  GKENQHWLGW KL  AAAAAWEDYASILRRELLGS SHWRN +SIYS SL S
Subjt:  SYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKL-AAAAAAWEDYASILRRELLGSLSHWRNSKSIYSASLVS

XP_038893062.1 uncharacterized protein LOC120081945 [Benincasa hispida]1.8e-19084.92Show/hide
Query:  MEDEVES--RQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGA
        M DEVES  R+R+RLIL NFLT EECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLI PFVPIRERL+EKAEEFFGC YELFVEFTGLISWTRGA
Subjt:  MEDEVES--RQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFSV------RGPIFPGKV---------------NDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS
        SIGWHSDDNRPYLKQR+FS        G  F G +                DCVMYTADSDNVHSVDEITNGERLTLTLW TRDSSHDED+KLLSLLSQS
Subjt:  SIGWHSDDNRPYLKQREFSV------RGPIFPGKV---------------NDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS

Query:  HLHDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGFDICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKEL
        HLHDR PDS LPQPPSCNMYWFS EDDPNF  GFDICWARLHALGYDIYF GDHSFSEYPDLFS+ VQLV+GNKLFFQEF NILHLLQVVQFLCWKGKEL
Subjt:  HLHDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGFDICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKEL

Query:  DSTNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKLAAAAAAWEDYASILRRELLGSLSHWRNSKSIYSASLVS
        DSTNIK DSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSS T DGKENQHWLGW KLAAAAAAWEDYASILRRELLGSLS+WRNS+SIYS SL S
Subjt:  DSTNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKLAAAAAAWEDYASILRRELLGSLSHWRNSKSIYSASLVS

TrEMBL top hitse value%identityAlignment
A0A0A0KMN7 Procollagen-proline 3-dioxygenase1.5e-18279.08Show/hide
Query:  MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLIS-------
        M D  ESRQR+RLIL NFL+ EECRELEFIHKSC TVGYRPNVFSTTLLHLVATNSAHLI PFVPIRE+L+EKAEEFFGC YELFVEFTGLIS       
Subjt:  MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLIS-------

Query:  --------WTRGASIGWHSDDNRPYLKQREFSV------RGPIFPGKV---------------NDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSH
                WTRGASIGWHSDDNRPYLKQREFS        G  F G +                DCVMYTAD+DNVHSVDEITNGERLTLTLWFTRDSSH
Subjt:  --------WTRGASIGWHSDDNRPYLKQREFSV------RGPIFPGKV---------------NDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSH

Query:  DEDAKLLSLLSQSHLHDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGFDICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLL
        DEDAKLLSLLSQS LHDRFPDSCLPQPPSCNMYWFSPEDDPNF FGFDICWARL ALGYD+YFPGDH FSEYPDLF Q VQLV G+K+FFQ+F NILHLL
Subjt:  DEDAKLLSLLSQSHLHDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGFDICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLL

Query:  QVVQFLCWKGKELDSTNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKLAAAAAAWEDYASILRRELLGSLSHWRN
        QVVQFLCWKGKELDSTN+  DSSYAEYLSPKRNVGVSYFKSEFSK+D LAESVFSS   DGKENQ WLGW KL AAAAAWE YASILRRELLGS SHWRN
Subjt:  QVVQFLCWKGKELDSTNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKLAAAAAAWEDYASILRRELLGSLSHWRN

Query:  SKSIYSASLVS
         +SIYS SL S
Subjt:  SKSIYSASLVS

A0A1S3C486 Procollagen-proline 3-dioxygenase1.9e-18582.12Show/hide
Query:  MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASI
        M D  ESRQR+RLIL NFL+ EECRELEFIHKSCCTVGYRPNV STTLLHLVATNSAHLI PFVPIRE+L+EKAEEFFGC YELFVEFTGLISWTRGASI
Subjt:  MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSV------RGPIFPGKV---------------NDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
        GWHSDDNRPYLKQREFS        G  F G +                DCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS L
Subjt:  GWHSDDNRPYLKQREFSV------RGPIFPGKV---------------NDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGFDICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDS
        HDRF +SCLPQPPSCNMYWFSPE+DPNF FGFDICWARLHALGYDIYFPGDH FSEYPDLFSQ VQLV G+K+FFQ+F NILHLLQVVQFLCWKGKELD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGFDICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDS

Query:  TNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKL-AAAAAAWEDYASILRRELLGSLSHWRNSKSIYSASLVS
        TN+  DS YAEYLSPKRNVGVSYFKSEFSK+D LAESVFSS T  GKENQHWLGW KL  AAAAAWEDYASILRRELLGS SHWRN +SIYS SL S
Subjt:  TNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKL-AAAAAAWEDYASILRRELLGSLSHWRNSKSIYSASLVS

A0A1S3C4U3 uncharacterized protein LOC103496668 isoform X21.0e-18685.9Show/hide
Query:  MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASI
        M D  ESRQR+RLIL NFL+ EECRELEFIHKSCCTVGYRPNV STTLLHLVATNSAHLI PFVPIRE+L+EKAEEFFGC YELFVEFTGLISWTRGASI
Subjt:  MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSVRGPIFPGKVNDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDSCLPQPPSCNMYWFS
        GWHSDDNRPYLKQREFS           DCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS LHDRF +SCLPQPPSCNMYWFS
Subjt:  GWHSDDNRPYLKQREFSVRGPIFPGKVNDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDSCLPQPPSCNMYWFS

Query:  PEDDPNFNFGFDICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDSTNIKGDSSYAEYLSPKRNVGV
        PE+DPNF FGFDICWARLHALGYDIYFPGDH FSEYPDLFSQ VQLV G+K+FFQ+F NILHLLQVVQFLCWKGKELD+TN+  DS YAEYLSPKRNVGV
Subjt:  PEDDPNFNFGFDICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDSTNIKGDSSYAEYLSPKRNVGV

Query:  SYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKL-AAAAAAWEDYASILRRELLGSLSHWRNSKSIYSASLVS
        SYFKSEFSK+D LAESVFSS T  GKENQHWLGW KL  AAAAAWEDYASILRRELLGS SHWRN +SIYS SL S
Subjt:  SYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKL-AAAAAAWEDYASILRRELLGSLSHWRNSKSIYSASLVS

A0A5A7SSL8 Procollagen-proline 3-dioxygenase1.9e-18582.12Show/hide
Query:  MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASI
        M D  ESRQR+RLIL NFL+ EECRELEFIHKSCCTVGYRPNV STTLLHLVATNSAHLI PFVPIRE+L+EKAEEFFGC YELFVEFTGLISWTRGASI
Subjt:  MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSV------RGPIFPGKV---------------NDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
        GWHSDDNRPYLKQREFS        G  F G +                DCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS L
Subjt:  GWHSDDNRPYLKQREFSV------RGPIFPGKV---------------NDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGFDICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDS
        HDRF +SCLPQPPSCNMYWFSPE+DPNF FGFDICWARLHALGYDIYFPGDH FSEYPDLFSQ VQLV G+K+FFQ+F NILHLLQVVQFLCWKGKELD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGFDICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDS

Query:  TNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKL-AAAAAAWEDYASILRRELLGSLSHWRNSKSIYSASLVS
        TN+  DS YAEYLSPKRNVGVSYFKSEFSK+D LAESVFSS T  GKENQHWLGW KL  AAAAAWEDYASILRRELLGS SHWRN +SIYS SL S
Subjt:  TNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKL-AAAAAAWEDYASILRRELLGSLSHWRNSKSIYSASLVS

A0A5D3CRE9 Procollagen-proline 3-dioxygenase3.9e-18682.37Show/hide
Query:  MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASI
        M D  ESRQR+RLIL NFL+ EECRELEFIHKSCCTVGYRPNV STTLLHLVATNSAHLI PFVPIRE+L+EKAEEFFGC YELFVEFTGLISWTRGASI
Subjt:  MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSV------RGPIFPGKV---------------NDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
        GWHSDDNRPYLKQREFS        G  F G +                DCVMY ADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS L
Subjt:  GWHSDDNRPYLKQREFSV------RGPIFPGKV---------------NDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGFDICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDS
        HDRFP+SCLPQPPSCNMYWFSPEDDPNF FGFDICWARLHALGYDIYFPGDH FSEYPDLFSQ VQLV G+K+FFQ+F NILHLLQVVQFLCWKGKELD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGFDICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDS

Query:  TNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKL-AAAAAAWEDYASILRRELLGSLSHWRNSKSIYSASLVS
        TN+  DS YAEYLSPKRNVGVSYFKSEFSK+D LAESVFSS T  GKENQHWLGW KL  AAAAAWEDYASILRRELLGS SHWRN +SIYS SL S
Subjt:  TNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKL-AAAAAAWEDYASILRRELLGSLSHWRNSKSIYSASLVS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G68080.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.1e-10049.36Show/hide
Query:  VESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASIGWHS
        +  ++  RLIL NFL+  EC+ELE IHKS  T+GYRPNVFSTTL HL+ATNS HLI PFV IRERL+EK EE FGC+YELF+EFTGLISW +GASIGWHS
Subjt:  VESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASIGWHS

Query:  DDNRPYLKQREFSV-----------RGPIF----------PGKVNDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRF
        DDNR YLKQR+F+             G +F               D +MYTAD  N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H   
Subjt:  DDNRPYLKQREFSV-----------RGPIF----------PGKVNDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRF

Query:  PDSCLPQPPSCNMYWFSP-EDDPNFNFGFDICWARLHALGYDIY-FPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDSTN
         + CLP P S NMYWF P +D  N N GFD+C ARLH LG+D++   G+   ++  +     +QL +G KL  ++F NILH LQVVQF  WK  EL ++N
Subjt:  PDSCLPQPPSCNMYWFSP-EDDPNFNFGFDICWARLHALGYDIY-FPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDSTN

Query:  IKGDS-SYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKLAAAAAAWEDYASILRRELLGSLSHWRNSKSIY
        ++ D+    + +S  +   ++  KS F  D+ L  + F  +   G++ +  L    +A A  +WE+Y+  L +ELL SL  W+  ++I+
Subjt:  IKGDS-SYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKLAAAAAAWEDYASILRRELLGSLSHWRNSKSIY

AT1G68080.2 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein4.7e-9147.55Show/hide
Query:  VESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASIGWHS
        +  ++  RLIL NFL+  EC+ELE IHKS  T+GYRPNVFSTTL HL+ATNS HLI PFV IRERL+EK EE FGC+YELF+EFTGLISW +GASIGWHS
Subjt:  VESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASIGWHS

Query:  DDNRPYLKQREFSVRGPI-FPGKVNDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDSCLPQPPSCNMYWFSPED
        DDNR YLKQR+F+   P+       D +MYTAD  N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ                           
Subjt:  DDNRPYLKQREFSVRGPI-FPGKVNDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDSCLPQPPSCNMYWFSPED

Query:  DPNFNFGFDICWARLHALGYDIY-FPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDSTNIKGDS-SYAEYLSPKRNVGVS
               FD+C ARLH LG+D++   G+   ++  +     +QL +G KL  ++F NILH LQVVQF  WK  EL ++N++ D+    + +S  +   ++
Subjt:  DPNFNFGFDICWARLHALGYDIY-FPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDSTNIKGDS-SYAEYLSPKRNVGVS

Query:  YFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKLAAAAAAWEDYASILRRELLGSLSHWRNSKSIY
          KS F  D+ L  + F  +   G++ +  L    +A A  +WE+Y+  L +ELL SL  W+  ++I+
Subjt:  YFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKLAAAAAAWEDYASILRRELLGSLSHWRNSKSIY

AT1G68080.3 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.4e-10351.76Show/hide
Query:  VESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASIGWHS
        +  ++  RLIL NFL+  EC+ELE IHKS  T+GYRPNVFSTTL HL+ATNS HLI PFV IRERL+EK EE FGC+YELF+EFTGLISW +GASIGWHS
Subjt:  VESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASIGWHS

Query:  DDNRPYLKQREFSVRGPI-FPGKVNDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDSCLPQPPSCNMYWFSP-E
        DDNR YLKQR+F+   P+       D +MYTAD  N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H    + CLP P S NMYWF P +
Subjt:  DDNRPYLKQREFSVRGPI-FPGKVNDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDSCLPQPPSCNMYWFSP-E

Query:  DDPNFNFGFDICWARLHALGYDIY-FPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDSTNIKGDS-SYAEYLSPKRNVGV
        D  N N GFD+C ARLH LG+D++   G+   ++  +     +QL +G KL  ++F NILH LQVVQF  WK  EL ++N++ D+    + +S  +   +
Subjt:  DDPNFNFGFDICWARLHALGYDIY-FPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDSTNIKGDS-SYAEYLSPKRNVGV

Query:  SYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKLAAAAAAWEDYASILRRELLGSLSHWRNSKSIY
        +  KS F  D+ L  + F  +   G++ +  L    +A A  +WE+Y+  L +ELL SL  W+  ++I+
Subjt:  SYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKLAAAAAAWEDYASILRRELLGSLSHWRNSKSIY

AT1G68080.4 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.1e-5044.4Show/hide
Query:  MYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDSCLPQPPSCNMYWFSP-EDDPNFNFGFDICWARLHALGYDIY-FPG
        MYTAD  N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H    + CLP P S NMYWF P +D  N N GFD+C ARLH LG+D++   G
Subjt:  MYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDSCLPQPPSCNMYWFSP-EDDPNFNFGFDICWARLHALGYDIY-FPG

Query:  DHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDSTNIKGDS-SYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKEN
        +   ++  +     +QL +G KL  ++F NILH LQVVQF  WK  EL ++N++ D+    + +S  +   ++  KS F  D+ L  + F  +   G++ 
Subjt:  DHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDSTNIKGDS-SYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKEN

Query:  QHWLGWAKLAAAAAAWEDYASILRRELLGSLSHWRNSKSIY
        +  L    +A A  +WE+Y+  L +ELL SL  W+  ++I+
Subjt:  QHWLGWAKLAAAAAAWEDYASILRRELLGSLSHWRNSKSIY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGACGAAGTGGAGAGCAGGCAGCGGCAGCGTCTGATTCTGGGAAATTTCTTAACCCTCGAAGAATGCAGGGAACTGGAGTTCATTCATAAGAGCTGCTGTACGGT
GGGTTATAGACCAAACGTCTTCTCCACCACTCTTTTGCATCTTGTTGCCACTAATTCTGCTCATTTGATCACGCCTTTTGTTCCGATTAGAGAGAGGTTGAGGGAGAAAG
CGGAGGAATTCTTTGGCTGTGATTATGAACTCTTTGTCGAATTCACTGGCTTGATCAGCTGGACCAGGGGAGCAAGCATTGGATGGCATAGTGACGATAACCGGCCCTAC
CTAAAACAACGTGAATTTTCAGTACGTGGACCAATCTTTCCTGGGAAAGTCAACGATTGTGTGATGTACACGGCCGACAGCGACAATGTTCATTCTGTTGATGAGATAAC
CAATGGAGAAAGGCTTACACTGACATTATGGTTCACCCGTGATAGTTCCCATGATGAAGATGCAAAACTTCTTTCCCTTCTTTCACAAAGCCATTTACATGATCGTTTTC
CTGACTCGTGCCTACCTCAGCCTCCGTCCTGTAATATGTATTGGTTTTCACCAGAAGATGATCCAAATTTCAACTTCGGTTTTGATATATGTTGGGCGAGACTGCATGCG
CTTGGATATGACATCTATTTTCCTGGGGATCATAGTTTTTCAGAGTATCCAGATTTATTCTCACAGGGCGTACAATTAGTACGGGGAAATAAGTTATTCTTTCAGGAATT
TGTGAACATCTTGCATTTGCTTCAGGTAGTGCAGTTCCTGTGTTGGAAAGGCAAAGAACTGGATTCTACTAACATCAAGGGGGATTCAAGCTATGCAGAATATTTATCCC
CAAAGAGGAATGTGGGAGTCAGTTACTTTAAATCTGAGTTTTCGAAGGACGATGTATTGGCCGAATCGGTCTTCTCATCTACTACTTATGATGGCAAGGAGAACCAACAC
TGGTTGGGGTGGGCCAAGCTTGCTGCTGCTGCAGCAGCTTGGGAAGATTATGCTTCCATTTTAAGGAGAGAACTCCTTGGGAGCTTGAGCCATTGGAGAAACAGTAAATC
TATATACAGTGCTTCACTTGTTAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGACGAAGTGGAGAGCAGGCAGCGGCAGCGTCTGATTCTGGGAAATTTCTTAACCCTCGAAGAATGCAGGGAACTGGAGTTCATTCATAAGAGCTGCTGTACGGT
GGGTTATAGACCAAACGTCTTCTCCACCACTCTTTTGCATCTTGTTGCCACTAATTCTGCTCATTTGATCACGCCTTTTGTTCCGATTAGAGAGAGGTTGAGGGAGAAAG
CGGAGGAATTCTTTGGCTGTGATTATGAACTCTTTGTCGAATTCACTGGCTTGATCAGCTGGACCAGGGGAGCAAGCATTGGATGGCATAGTGACGATAACCGGCCCTAC
CTAAAACAACGTGAATTTTCAGTACGTGGACCAATCTTTCCTGGGAAAGTCAACGATTGTGTGATGTACACGGCCGACAGCGACAATGTTCATTCTGTTGATGAGATAAC
CAATGGAGAAAGGCTTACACTGACATTATGGTTCACCCGTGATAGTTCCCATGATGAAGATGCAAAACTTCTTTCCCTTCTTTCACAAAGCCATTTACATGATCGTTTTC
CTGACTCGTGCCTACCTCAGCCTCCGTCCTGTAATATGTATTGGTTTTCACCAGAAGATGATCCAAATTTCAACTTCGGTTTTGATATATGTTGGGCGAGACTGCATGCG
CTTGGATATGACATCTATTTTCCTGGGGATCATAGTTTTTCAGAGTATCCAGATTTATTCTCACAGGGCGTACAATTAGTACGGGGAAATAAGTTATTCTTTCAGGAATT
TGTGAACATCTTGCATTTGCTTCAGGTAGTGCAGTTCCTGTGTTGGAAAGGCAAAGAACTGGATTCTACTAACATCAAGGGGGATTCAAGCTATGCAGAATATTTATCCC
CAAAGAGGAATGTGGGAGTCAGTTACTTTAAATCTGAGTTTTCGAAGGACGATGTATTGGCCGAATCGGTCTTCTCATCTACTACTTATGATGGCAAGGAGAACCAACAC
TGGTTGGGGTGGGCCAAGCTTGCTGCTGCTGCAGCAGCTTGGGAAGATTATGCTTCCATTTTAAGGAGAGAACTCCTTGGGAGCTTGAGCCATTGGAGAAACAGTAAATC
TATATACAGTGCTTCACTTGTTAGCTGA
Protein sequenceShow/hide protein sequence
MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASIGWHSDDNRPY
LKQREFSVRGPIFPGKVNDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGFDICWARLHA
LGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDSTNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQH
WLGWAKLAAAAAAWEDYASILRRELLGSLSHWRNSKSIYSASLVS