; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi11G001290 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi11G001290
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionProcollagen-proline 3-dioxygenase
Genome locationchr11:1449358..1454427
RNA-Seq ExpressionLsi11G001290
SyntenyLsi11G001290
Gene Ontology termsGO:0019511 - peptidyl-proline hydroxylation (biological process)
GO:0032963 - collagen metabolic process (biological process)
GO:0005506 - iron ion binding (molecular function)
GO:0019797 - procollagen-proline 3-dioxygenase activity (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR039575 - Prolyl 3-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK14443.1 prolyl 3-hydroxylase 1 [Cucumis melo var. makuwa]8.6e-18581.86Show/hide
Query:  MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASI
        M D  ESRQR+RLIL NFL+ EECRELEFIHKSCCTVGYRPNV STTLLHLVATNSAHLI PFVPIRE+L+EKAEEFFGC YELFVEFTGLISWTRGASI
Subjt:  MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFS--------------------------------DCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
        GWHSDDNRPYLKQREFS                                DCVMY ADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS L
Subjt:  GWHSDDNRPYLKQREFS--------------------------------DCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGFDICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDS
        HDRFP+SCLPQPPSCNMYWFSPEDDPNF FGFDICWARLHALGYDIYFPGDH FSEYPDLFSQ VQLV G+K+FFQ+F NILHLLQVVQFLCWKGKELD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGFDICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDS

Query:  TNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKL-AAAAAAWEDYASILRRELLGSLSHWRNSQSIYSASLVS
        TN+  DS YAEYLSPKRNVGVSYFKSEFSK+D LAESVFSS T  GKENQHWLGW KL  AAAAAWEDYASILRRELLGS SHWRN QSIYS SL S
Subjt:  TNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKL-AAAAAAWEDYASILRRELLGSLSHWRNSQSIYSASLVS

XP_004140463.1 prolyl 3-hydroxylase 1 [Cucumis sativus]3.3e-18481.57Show/hide
Query:  MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASI
        M D  ESRQR+RLIL NFL+ EECRELEFIHKSC TVGYRPNVFSTTLLHLVATNSAHLI PFVPIRE+L+EKAEEFFGC YELFVEFTGLISWTRGASI
Subjt:  MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFS--------------------------------DCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
        GWHSDDNRPYLKQREFS                                DCVMYTAD+DNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS L
Subjt:  GWHSDDNRPYLKQREFS--------------------------------DCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGFDICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDS
        HDRFPDSCLPQPPSCNMYWFSPEDDPNF FGFDICWARL ALGYD+YFPGDH FSEYPDLF Q VQLV G+K+FFQ+F NILHLLQVVQFLCWKGKELDS
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGFDICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDS

Query:  TNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKLAAAAAAWEDYASILRRELLGSLSHWRNSQSIYSASLVS
        TN+  DSSYAEYLSPKRNVGVSYFKSEFSK+D LAESVFSS   DGKENQ WLGW KL AAAAAWE YASILRRELLGS SHWRN QSIYS SL S
Subjt:  TNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKLAAAAAAWEDYASILRRELLGSLSHWRNSQSIYSASLVS

XP_008456831.1 PREDICTED: uncharacterized protein LOC103496668 isoform X1 [Cucumis melo]4.3e-18481.61Show/hide
Query:  MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASI
        M D  ESRQR+RLIL NFL+ EECRELEFIHKSCCTVGYRPNV STTLLHLVATNSAHLI PFVPIRE+L+EKAEEFFGC YELFVEFTGLISWTRGASI
Subjt:  MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFS--------------------------------DCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
        GWHSDDNRPYLKQREFS                                DCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS L
Subjt:  GWHSDDNRPYLKQREFS--------------------------------DCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGFDICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDS
        HDRF +SCLPQPPSCNMYWFSPE+DPNF FGFDICWARLHALGYDIYFPGDH FSEYPDLFSQ VQLV G+K+FFQ+F NILHLLQVVQFLCWKGKELD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGFDICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDS

Query:  TNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKL-AAAAAAWEDYASILRRELLGSLSHWRNSQSIYSASLVS
        TN+  DS YAEYLSPKRNVGVSYFKSEFSK+D LAESVFSS T  GKENQHWLGW KL  AAAAAWEDYASILRRELLGS SHWRN QSIYS SL S
Subjt:  TNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKL-AAAAAAWEDYASILRRELLGSLSHWRNSQSIYSASLVS

XP_008456833.1 PREDICTED: uncharacterized protein LOC103496668 isoform X2 [Cucumis melo]4.4e-18988.77Show/hide
Query:  MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASI
        M D  ESRQR+RLIL NFL+ EECRELEFIHKSCCTVGYRPNV STTLLHLVATNSAHLI PFVPIRE+L+EKAEEFFGC YELFVEFTGLISWTRGASI
Subjt:  MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGF
        GWHSDDNRPYLKQREFSDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS LHDRF +SCLPQPPSCNMYWFSPE+DPNF FGF
Subjt:  GWHSDDNRPYLKQREFSDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGF

Query:  DICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDSTNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDD
        DICWARLHALGYDIYFPGDH FSEYPDLFSQ VQLV G+K+FFQ+F NILHLLQVVQFLCWKGKELD+TN+  DS YAEYLSPKRNVGVSYFKSEFSK+D
Subjt:  DICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDSTNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDD

Query:  VLAESVFSSTTYDGKENQHWLGWAKL-AAAAAAWEDYASILRRELLGSLSHWRNSQSIYSASLVS
         LAESVFSS T  GKENQHWLGW KL  AAAAAWEDYASILRRELLGS SHWRN QSIYS SL S
Subjt:  VLAESVFSSTTYDGKENQHWLGWAKL-AAAAAAWEDYASILRRELLGSLSHWRNSQSIYSASLVS

XP_038893062.1 uncharacterized protein LOC120081945 [Benincasa hispida]2.0e-18984.42Show/hide
Query:  MEDEVES--RQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGA
        M DEVES  R+R+RLIL NFLT EECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLI PFVPIRERL+EKAEEFFGC YELFVEFTGLISWTRGA
Subjt:  MEDEVES--RQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFS--------------------------------DCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS
        SIGWHSDDNRPYLKQR+FS                                DCVMYTADSDNVHSVDEITNGERLTLTLW TRDSSHDED+KLLSLLSQS
Subjt:  SIGWHSDDNRPYLKQREFS--------------------------------DCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS

Query:  HLHDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGFDICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKEL
        HLHDR PDS LPQPPSCNMYWFS EDDPNF  GFDICWARLHALGYDIYF GDHSFSEYPDLFS+ VQLV+GNKLFFQEF NILHLLQVVQFLCWKGKEL
Subjt:  HLHDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGFDICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKEL

Query:  DSTNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKLAAAAAAWEDYASILRRELLGSLSHWRNSQSIYSASLVS
        DSTNIK DSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSS T DGKENQHWLGW KLAAAAAAWEDYASILRRELLGSLS+WRNSQSIYS SL S
Subjt:  DSTNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKLAAAAAAWEDYASILRRELLGSLSHWRNSQSIYSASLVS

TrEMBL top hitse value%identityAlignment
A0A0A0KMN7 Procollagen-proline 3-dioxygenase1.6e-18178.59Show/hide
Query:  MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLIS-------
        M D  ESRQR+RLIL NFL+ EECRELEFIHKSC TVGYRPNVFSTTLLHLVATNSAHLI PFVPIRE+L+EKAEEFFGC YELFVEFTGLIS       
Subjt:  MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLIS-------

Query:  --------WTRGASIGWHSDDNRPYLKQREFS--------------------------------DCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSH
                WTRGASIGWHSDDNRPYLKQREFS                                DCVMYTAD+DNVHSVDEITNGERLTLTLWFTRDSSH
Subjt:  --------WTRGASIGWHSDDNRPYLKQREFS--------------------------------DCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSH

Query:  DEDAKLLSLLSQSHLHDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGFDICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLL
        DEDAKLLSLLSQS LHDRFPDSCLPQPPSCNMYWFSPEDDPNF FGFDICWARL ALGYD+YFPGDH FSEYPDLF Q VQLV G+K+FFQ+F NILHLL
Subjt:  DEDAKLLSLLSQSHLHDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGFDICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLL

Query:  QVVQFLCWKGKELDSTNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKLAAAAAAWEDYASILRRELLGSLSHWRN
        QVVQFLCWKGKELDSTN+  DSSYAEYLSPKRNVGVSYFKSEFSK+D LAESVFSS   DGKENQ WLGW KL AAAAAWE YASILRRELLGS SHWRN
Subjt:  QVVQFLCWKGKELDSTNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKLAAAAAAWEDYASILRRELLGSLSHWRN

Query:  SQSIYSASLVS
         QSIYS SL S
Subjt:  SQSIYSASLVS

A0A1S3C486 Procollagen-proline 3-dioxygenase2.1e-18481.61Show/hide
Query:  MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASI
        M D  ESRQR+RLIL NFL+ EECRELEFIHKSCCTVGYRPNV STTLLHLVATNSAHLI PFVPIRE+L+EKAEEFFGC YELFVEFTGLISWTRGASI
Subjt:  MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFS--------------------------------DCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
        GWHSDDNRPYLKQREFS                                DCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS L
Subjt:  GWHSDDNRPYLKQREFS--------------------------------DCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGFDICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDS
        HDRF +SCLPQPPSCNMYWFSPE+DPNF FGFDICWARLHALGYDIYFPGDH FSEYPDLFSQ VQLV G+K+FFQ+F NILHLLQVVQFLCWKGKELD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGFDICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDS

Query:  TNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKL-AAAAAAWEDYASILRRELLGSLSHWRNSQSIYSASLVS
        TN+  DS YAEYLSPKRNVGVSYFKSEFSK+D LAESVFSS T  GKENQHWLGW KL  AAAAAWEDYASILRRELLGS SHWRN QSIYS SL S
Subjt:  TNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKL-AAAAAAWEDYASILRRELLGSLSHWRNSQSIYSASLVS

A0A1S3C4U3 uncharacterized protein LOC103496668 isoform X22.1e-18988.77Show/hide
Query:  MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASI
        M D  ESRQR+RLIL NFL+ EECRELEFIHKSCCTVGYRPNV STTLLHLVATNSAHLI PFVPIRE+L+EKAEEFFGC YELFVEFTGLISWTRGASI
Subjt:  MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGF
        GWHSDDNRPYLKQREFSDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS LHDRF +SCLPQPPSCNMYWFSPE+DPNF FGF
Subjt:  GWHSDDNRPYLKQREFSDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGF

Query:  DICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDSTNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDD
        DICWARLHALGYDIYFPGDH FSEYPDLFSQ VQLV G+K+FFQ+F NILHLLQVVQFLCWKGKELD+TN+  DS YAEYLSPKRNVGVSYFKSEFSK+D
Subjt:  DICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDSTNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDD

Query:  VLAESVFSSTTYDGKENQHWLGWAKL-AAAAAAWEDYASILRRELLGSLSHWRNSQSIYSASLVS
         LAESVFSS T  GKENQHWLGW KL  AAAAAWEDYASILRRELLGS SHWRN QSIYS SL S
Subjt:  VLAESVFSSTTYDGKENQHWLGWAKL-AAAAAAWEDYASILRRELLGSLSHWRNSQSIYSASLVS

A0A5A7SSL8 Procollagen-proline 3-dioxygenase2.1e-18481.61Show/hide
Query:  MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASI
        M D  ESRQR+RLIL NFL+ EECRELEFIHKSCCTVGYRPNV STTLLHLVATNSAHLI PFVPIRE+L+EKAEEFFGC YELFVEFTGLISWTRGASI
Subjt:  MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFS--------------------------------DCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
        GWHSDDNRPYLKQREFS                                DCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS L
Subjt:  GWHSDDNRPYLKQREFS--------------------------------DCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGFDICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDS
        HDRF +SCLPQPPSCNMYWFSPE+DPNF FGFDICWARLHALGYDIYFPGDH FSEYPDLFSQ VQLV G+K+FFQ+F NILHLLQVVQFLCWKGKELD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGFDICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDS

Query:  TNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKL-AAAAAAWEDYASILRRELLGSLSHWRNSQSIYSASLVS
        TN+  DS YAEYLSPKRNVGVSYFKSEFSK+D LAESVFSS T  GKENQHWLGW KL  AAAAAWEDYASILRRELLGS SHWRN QSIYS SL S
Subjt:  TNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKL-AAAAAAWEDYASILRRELLGSLSHWRNSQSIYSASLVS

A0A5D3CRE9 Procollagen-proline 3-dioxygenase4.2e-18581.86Show/hide
Query:  MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASI
        M D  ESRQR+RLIL NFL+ EECRELEFIHKSCCTVGYRPNV STTLLHLVATNSAHLI PFVPIRE+L+EKAEEFFGC YELFVEFTGLISWTRGASI
Subjt:  MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFS--------------------------------DCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL
        GWHSDDNRPYLKQREFS                                DCVMY ADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS L
Subjt:  GWHSDDNRPYLKQREFS--------------------------------DCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGFDICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDS
        HDRFP+SCLPQPPSCNMYWFSPEDDPNF FGFDICWARLHALGYDIYFPGDH FSEYPDLFSQ VQLV G+K+FFQ+F NILHLLQVVQFLCWKGKELD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGFDICWARLHALGYDIYFPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDS

Query:  TNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKL-AAAAAAWEDYASILRRELLGSLSHWRNSQSIYSASLVS
        TN+  DS YAEYLSPKRNVGVSYFKSEFSK+D LAESVFSS T  GKENQHWLGW KL  AAAAAWEDYASILRRELLGS SHWRN QSIYS SL S
Subjt:  TNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKL-AAAAAAWEDYASILRRELLGSLSHWRNSQSIYSASLVS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G68080.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein5.3e-10049.1Show/hide
Query:  VESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASIGWHS
        +  ++  RLIL NFL+  EC+ELE IHKS  T+GYRPNVFSTTL HL+ATNS HLI PFV IRERL+EK EE FGC+YELF+EFTGLISW +GASIGWHS
Subjt:  VESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASIGWHS

Query:  DDNRPYLKQREFS--------------------------------DCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRF
        DDNR YLKQR+F+                                D +MYTAD  N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H   
Subjt:  DDNRPYLKQREFS--------------------------------DCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRF

Query:  PDSCLPQPPSCNMYWFSP-EDDPNFNFGFDICWARLHALGYDIY-FPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDSTN
         + CLP P S NMYWF P +D  N N GFD+C ARLH LG+D++   G+   ++  +     +QL +G KL  ++F NILH LQVVQF  WK  EL ++N
Subjt:  PDSCLPQPPSCNMYWFSP-EDDPNFNFGFDICWARLHALGYDIY-FPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDSTN

Query:  IKGDS-SYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKLAAAAAAWEDYASILRRELLGSLSHWRNSQSIY
        ++ D+    + +S  +   ++  KS F  D+ L  + F  +   G++ +  L    +A A  +WE+Y+  L +ELL SL  W+  Q+I+
Subjt:  IKGDS-SYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKLAAAAAAWEDYASILRRELLGSLSHWRNSQSIY

AT1G68080.2 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.1e-8947.55Show/hide
Query:  VESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASIGWHS
        +  ++  RLIL NFL+  EC+ELE IHKS  T+GYRPNVFSTTL HL+ATNS HLI PFV IRERL+EK EE FGC+YELF+EFTGLISW +GASIGWHS
Subjt:  VESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASIGWHS

Query:  DDNRPYLKQREFS------------DCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDSCLPQPPSCNMYWFSPED
        DDNR YLKQR+F+            D +MYTAD  N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ                           
Subjt:  DDNRPYLKQREFS------------DCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDSCLPQPPSCNMYWFSPED

Query:  DPNFNFGFDICWARLHALGYDIY-FPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDSTNIKGDS-SYAEYLSPKRNVGVS
               FD+C ARLH LG+D++   G+   ++  +     +QL +G KL  ++F NILH LQVVQF  WK  EL ++N++ D+    + +S  +   ++
Subjt:  DPNFNFGFDICWARLHALGYDIY-FPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDSTNIKGDS-SYAEYLSPKRNVGVS

Query:  YFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKLAAAAAAWEDYASILRRELLGSLSHWRNSQSIY
          KS F  D+ L  + F  +   G++ +  L    +A A  +WE+Y+  L +ELL SL  W+  Q+I+
Subjt:  YFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKLAAAAAAWEDYASILRRELLGSLSHWRNSQSIY

AT1G68080.3 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.6e-10251.76Show/hide
Query:  VESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASIGWHS
        +  ++  RLIL NFL+  EC+ELE IHKS  T+GYRPNVFSTTL HL+ATNS HLI PFV IRERL+EK EE FGC+YELF+EFTGLISW +GASIGWHS
Subjt:  VESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASIGWHS

Query:  DDNRPYLKQREFS------------DCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDSCLPQPPSCNMYWFSP-E
        DDNR YLKQR+F+            D +MYTAD  N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H    + CLP P S NMYWF P +
Subjt:  DDNRPYLKQREFS------------DCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDSCLPQPPSCNMYWFSP-E

Query:  DDPNFNFGFDICWARLHALGYDIY-FPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDSTNIKGDS-SYAEYLSPKRNVGV
        D  N N GFD+C ARLH LG+D++   G+   ++  +     +QL +G KL  ++F NILH LQVVQF  WK  EL ++N++ D+    + +S  +   +
Subjt:  DDPNFNFGFDICWARLHALGYDIY-FPGDHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDSTNIKGDS-SYAEYLSPKRNVGV

Query:  SYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKLAAAAAAWEDYASILRRELLGSLSHWRNSQSIY
        +  KS F  D+ L  + F  +   G++ +  L    +A A  +WE+Y+  L +ELL SL  W+  Q+I+
Subjt:  SYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKLAAAAAAWEDYASILRRELLGSLSHWRNSQSIY

AT1G68080.4 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein4.6e-5144.81Show/hide
Query:  MYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDSCLPQPPSCNMYWFSP-EDDPNFNFGFDICWARLHALGYDIY-FPG
        MYTAD  N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H    + CLP P S NMYWF P +D  N N GFD+C ARLH LG+D++   G
Subjt:  MYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDSCLPQPPSCNMYWFSP-EDDPNFNFGFDICWARLHALGYDIY-FPG

Query:  DHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDSTNIKGDS-SYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKEN
        +   ++  +     +QL +G KL  ++F NILH LQVVQF  WK  EL ++N++ D+    + +S  +   ++  KS F  D+ L  + F  +   G++ 
Subjt:  DHSFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDSTNIKGDS-SYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKEN

Query:  QHWLGWAKLAAAAAAWEDYASILRRELLGSLSHWRNSQSIY
        +  L    +A A  +WE+Y+  L +ELL SL  W+  Q+I+
Subjt:  QHWLGWAKLAAAAAAWEDYASILRRELLGSLSHWRNSQSIY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGACGAAGTGGAGAGCAGGCAGCGGCAGCGTCTGATTCTGGGAAATTTCTTAACCCTCGAAGAATGCAGGGAACTGGAGTTCATTCATAAGAGCTGCTGTACGGT
GGGTTATAGACCAAACGTCTTCTCCACCACTCTTTTGCATCTTGTTGCCACTAATTCTGCTCATTTGATCACGCCTTTTGTTCCGATTAGAGAGAGGTTGAGGGAGAAAG
CGGAGGAATTCTTTGGCTGTGATTATGAACTCTTTGTCGAATTCACTGGCTTGATCAGCTGGACCAGGGGAGCAAGCATTGGATGGCATAGTGACGATAACCGGCCCTAC
CTAAAACAACGTGAATTTTCAGATTGTGTGATGTACACGGCCGACAGCGACAATGTTCATTCTGTTGATGAGATAACCAATGGAGAAAGGCTTACACTGACATTATGGTT
CACCCGTGATAGTTCCCATGATGAAGATGCAAAACTTCTTTCCCTTCTTTCACAAAGCCATTTACATGATCGTTTTCCTGACTCGTGCCTACCTCAGCCTCCGTCCTGTA
ATATGTATTGGTTTTCACCAGAAGATGATCCAAATTTCAACTTCGGTTTTGATATATGTTGGGCGAGACTGCATGCGCTTGGATATGACATCTATTTTCCTGGGGATCAT
AGTTTTTCAGAGTATCCAGATTTATTCTCACAGGGCGTACAATTAGTACGGGGAAATAAGTTATTCTTTCAGGAATTTGTGAACATCTTGCATTTGCTTCAGGTAGTGCA
GTTCCTGTGTTGGAAAGGCAAAGAACTGGATTCTACTAACATCAAGGGGGATTCAAGCTATGCAGAATATTTATCCCCAAAGAGGAATGTGGGAGTCAGTTACTTTAAAT
CTGAGTTTTCGAAGGACGATGTATTGGCCGAATCGGTCTTCTCATCTACTACTTATGATGGCAAGGAGAACCAACACTGGTTGGGGTGGGCCAAGCTTGCTGCTGCTGCA
GCAGCTTGGGAAGATTATGCTTCCATTTTAAGGAGAGAACTCCTTGGGAGCTTGAGCCATTGGAGAAACAGTCAATCTATATACAGTGCTTCACTTGTTAGCTGA
mRNA sequenceShow/hide mRNA sequence
AAAAAAAAAAAAAAGGGCGCGCATAAACTTCTCGCAGATGAAGAACAAGATCACTTTCGTACCGAGAAGAGCTTCATCAATGGCGCTGGGTATTGGAGCTCCTACGCGCT
ACATCGATATTGGATATTTCGTGTGCTTGTAGCTGGGATATCACTTTTGACGCTAGAATTTGGATAGTCAAGGGCATTTTGTTCGATTACTGCCATTGAAACGAGAGAGA
CGGAGAATTGGACGAAAATGGAAGACGAAGTGGAGAGCAGGCAGCGGCAGCGTCTGATTCTGGGAAATTTCTTAACCCTCGAAGAATGCAGGGAACTGGAGTTCATTCAT
AAGAGCTGCTGTACGGTGGGTTATAGACCAAACGTCTTCTCCACCACTCTTTTGCATCTTGTTGCCACTAATTCTGCTCATTTGATCACGCCTTTTGTTCCGATTAGAGA
GAGGTTGAGGGAGAAAGCGGAGGAATTCTTTGGCTGTGATTATGAACTCTTTGTCGAATTCACTGGCTTGATCAGCTGGACCAGGGGAGCAAGCATTGGATGGCATAGTG
ACGATAACCGGCCCTACCTAAAACAACGTGAATTTTCAGATTGTGTGATGTACACGGCCGACAGCGACAATGTTCATTCTGTTGATGAGATAACCAATGGAGAAAGGCTT
ACACTGACATTATGGTTCACCCGTGATAGTTCCCATGATGAAGATGCAAAACTTCTTTCCCTTCTTTCACAAAGCCATTTACATGATCGTTTTCCTGACTCGTGCCTACC
TCAGCCTCCGTCCTGTAATATGTATTGGTTTTCACCAGAAGATGATCCAAATTTCAACTTCGGTTTTGATATATGTTGGGCGAGACTGCATGCGCTTGGATATGACATCT
ATTTTCCTGGGGATCATAGTTTTTCAGAGTATCCAGATTTATTCTCACAGGGCGTACAATTAGTACGGGGAAATAAGTTATTCTTTCAGGAATTTGTGAACATCTTGCAT
TTGCTTCAGGTAGTGCAGTTCCTGTGTTGGAAAGGCAAAGAACTGGATTCTACTAACATCAAGGGGGATTCAAGCTATGCAGAATATTTATCCCCAAAGAGGAATGTGGG
AGTCAGTTACTTTAAATCTGAGTTTTCGAAGGACGATGTATTGGCCGAATCGGTCTTCTCATCTACTACTTATGATGGCAAGGAGAACCAACACTGGTTGGGGTGGGCCA
AGCTTGCTGCTGCTGCAGCAGCTTGGGAAGATTATGCTTCCATTTTAAGGAGAGAACTCCTTGGGAGCTTGAGCCATTGGAGAAACAGTCAATCTATATACAGTGCTTCA
CTTGTTAGCTGAATCTCCCACTTGTGGGAAAGTGGCATCCCAAACTGAGCCAGTATTTCTATTTTTAGTTTTTATTCTTTTAATAACTTTCAAAGTCAATCCTAGATCTT
TAGATTTTAGTTTCCCTTTAGTCATGTAATTAGGTGTGACTTAGTATATAAGCTTTTATAGATGCCTAGGAACAAGGTTGTTGAGGATAAATAAATTATTGGTATTGTGC
ATGAATGCTATCAGAACTTTATAGAGAGAAATTTTGTTTGTATTTAGTTTACTGTATTAATCATGCCAACCTCATGTAATAGATGAAAGACAAAAAAAAATTATAATAAA
TTTTAAATTTTGATTG
Protein sequenceShow/hide protein sequence
MEDEVESRQRQRLILGNFLTLEECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLITPFVPIRERLREKAEEFFGCDYELFVEFTGLISWTRGASIGWHSDDNRPY
LKQREFSDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRFPDSCLPQPPSCNMYWFSPEDDPNFNFGFDICWARLHALGYDIYFPGDH
SFSEYPDLFSQGVQLVRGNKLFFQEFVNILHLLQVVQFLCWKGKELDSTNIKGDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSTTYDGKENQHWLGWAKLAAAA
AAWEDYASILRRELLGSLSHWRNSQSIYSASLVS