; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0005346 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0005346
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionProcollagen-proline 3-dioxygenase
Genome locationchr12:11302352..11310162
RNA-Seq ExpressionPI0005346
SyntenyPI0005346
Gene Ontology termsGO:0019511 - peptidyl-proline hydroxylation (biological process)
GO:0032963 - collagen metabolic process (biological process)
GO:0005506 - iron ion binding (molecular function)
GO:0019797 - procollagen-proline 3-dioxygenase activity (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR039575 - Prolyl 3-hydroxylase
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK14443.1 prolyl 3-hydroxylase 1 [Cucumis melo var. makuwa]2.0e-17294.12Show/hide
Query:  LGWTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKL
        + WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMY ADSDNVHSVDEITNGERLTLTLWFTRDSS DEDAKL
Subjt:  LGWTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKL

Query:  LSLLSQNPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFL
        LSLLSQ+PLHDRFP+SCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLH+LGYDIYFPGDH FSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFL
Subjt:  LSLLSQNPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFL

Query:  CWKGKELDSTNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVV-AAAAWEDYASILRRELLRSFNHWRNCQSIY
        CWKGKELD+TNL EDS YAEYLSPKRNVGVSYFKSEFSKN GLAESVF SATS G ENQHWLGWDKLVV AAAAWEDYASILRRELL SF+HWRNCQSIY
Subjt:  CWKGKELDSTNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVV-AAAAWEDYASILRRELLRSFNHWRNCQSIY

Query:  SVSLDS
        SVSLDS
Subjt:  SVSLDS

XP_004140463.1 prolyl 3-hydroxylase 1 [Cucumis sativus]7.8e-17293.44Show/hide
Query:  LGWTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKL
        + WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTAD+DNVHSVDEITNGERLTLTLWFTRDSS DEDAKL
Subjt:  LGWTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKL

Query:  LSLLSQNPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFL
        LSLLSQ+PLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARL +LGYD+YFPGDH FSEYPDLF QDVQLVWGDKIFFQKFENILHLLQVVQFL
Subjt:  LSLLSQNPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFL

Query:  CWKGKELDSTNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVVAAAAWEDYASILRRELLRSFNHWRNCQSIYS
        CWKGKELDSTNL EDSSYAEYLSPKRNVGVSYFKSEFSKN GLAESVF SA SDG ENQ WLGWDKLV AAAAWE YASILRRELL SF+HWRNCQSIYS
Subjt:  CWKGKELDSTNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVVAAAAWEDYASILRRELLRSFNHWRNCQSIYS

Query:  VSLDS
        VSLDS
Subjt:  VSLDS

XP_008456831.1 PREDICTED: uncharacterized protein LOC103496668 isoform X1 [Cucumis melo]1.0e-17193.79Show/hide
Query:  LGWTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKL
        + WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSS DEDAKL
Subjt:  LGWTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKL

Query:  LSLLSQNPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFL
        LSLLSQ+PLHDRF +SCLPQPPSCNMYWFSPE+DPNFKFGFDICWARLH+LGYDIYFPGDH FSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFL
Subjt:  LSLLSQNPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFL

Query:  CWKGKELDSTNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVV-AAAAWEDYASILRRELLRSFNHWRNCQSIY
        CWKGKELD+TNL EDS YAEYLSPKRNVGVSYFKSEFSKN GLAESVF SATS G ENQHWLGWDKLVV AAAAWEDYASILRRELL SF+HWRNCQSIY
Subjt:  CWKGKELDSTNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVV-AAAAWEDYASILRRELLRSFNHWRNCQSIY

Query:  SVSLDS
        SVSLDS
Subjt:  SVSLDS

XP_023552239.1 prolyl 3-hydroxylase 1 [Cucurbita pepo subsp. pepo]1.1e-14682.95Show/hide
Query:  LGWTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKL
        + WTRGA IGWHSDDNRPYLKQREF+AVCYLNSYGV+F GGLFHFQDGEP+TISP  GDCVMYTADS NVHSVDE+T+GERLTLTLWFTRDSS DEDAKL
Subjt:  LGWTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKL

Query:  LSLLSQNPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFL
        LSLLSQ+ LHDR PDSCLPQPPSCNMYWFSP+DDPNFKFGFDICWARLH+LGY IYFP DH  SEYPDLFSQDVQLV G+KIF QKF++ILH LQVVQFL
Subjt:  LSLLSQNPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFL

Query:  CWKGKELDSTNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVVAAAAWEDYASILRRELLRSFNHWRNCQSIYS
         WKGKELDSTN KEDSSYAE LSPKRNVGV YFKSEFSK+  LAESVFL A+SD  E QH LGW KL   AAAWEDYAS LRRELLRSF HWR  QSIYS
Subjt:  CWKGKELDSTNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVVAAAAWEDYASILRRELLRSFNHWRNCQSIYS

Query:  VSLDS
        V   S
Subjt:  VSLDS

XP_038893062.1 uncharacterized protein LOC120081945 [Benincasa hispida]1.5e-15988.52Show/hide
Query:  LGWTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKL
        + WTRGASIGWHSDDNRPYLKQR+FSAVCYLNSYGVEFGGGLFHFQDGEPETISPF GDCVMYTADSDNVHSVDEITNGERLTLTLW TRDSS DED+KL
Subjt:  LGWTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKL

Query:  LSLLSQNPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFL
        LSLLSQ+ LHDR PDS LPQPPSCNMYWFS EDDPNFK GFDICWARLH+LGYDIYF GDH FSEYPDLFS+DVQLV G+K+FFQ+FENILHLLQVVQFL
Subjt:  LSLLSQNPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFL

Query:  CWKGKELDSTNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVVAAAAWEDYASILRRELLRSFNHWRNCQSIYS
        CWKGKELDSTN+KEDSSYAEYLSPKRNVGVSYFKSEFSK+  LAESVF SATSDG ENQHWLGWDKL  AAAAWEDYASILRRELL S ++WRN QSIYS
Subjt:  CWKGKELDSTNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVVAAAAWEDYASILRRELLRSFNHWRNCQSIYS

Query:  VSLDS
        VSL S
Subjt:  VSLDS

TrEMBL top hitse value%identityAlignment
A0A0A0KMN7 Procollagen-proline 3-dioxygenase2.3e-17792.21Show/hide
Query:  LIEFTFQMTLQPSSSNLGWTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTL
        LI    +  LQPSSSNLGWTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTAD+DNVHSVDEITNGERLTL
Subjt:  LIEFTFQMTLQPSSSNLGWTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTL

Query:  TLWFTRDSSQDEDAKLLSLLSQNPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFF
        TLWFTRDSS DEDAKLLSLLSQ+PLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARL +LGYD+YFPGDH FSEYPDLF QDVQLVWGDKIFF
Subjt:  TLWFTRDSSQDEDAKLLSLLSQNPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFF

Query:  QKFENILHLLQVVQFLCWKGKELDSTNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVVAAAAWEDYASILRRE
        QKFENILHLLQVVQFLCWKGKELDSTNL EDSSYAEYLSPKRNVGVSYFKSEFSKN GLAESVF SA SDG ENQ WLGWDKLV AAAAWE YASILRRE
Subjt:  QKFENILHLLQVVQFLCWKGKELDSTNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVVAAAAWEDYASILRRE

Query:  LLRSFNHWRNCQSIYSVSLDS
        LL SF+HWRNCQSIYSVSLDS
Subjt:  LLRSFNHWRNCQSIYSVSLDS

A0A1S3C486 Procollagen-proline 3-dioxygenase4.9e-17293.79Show/hide
Query:  LGWTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKL
        + WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSS DEDAKL
Subjt:  LGWTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKL

Query:  LSLLSQNPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFL
        LSLLSQ+PLHDRF +SCLPQPPSCNMYWFSPE+DPNFKFGFDICWARLH+LGYDIYFPGDH FSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFL
Subjt:  LSLLSQNPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFL

Query:  CWKGKELDSTNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVV-AAAAWEDYASILRRELLRSFNHWRNCQSIY
        CWKGKELD+TNL EDS YAEYLSPKRNVGVSYFKSEFSKN GLAESVF SATS G ENQHWLGWDKLVV AAAAWEDYASILRRELL SF+HWRNCQSIY
Subjt:  CWKGKELDSTNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVV-AAAAWEDYASILRRELLRSFNHWRNCQSIY

Query:  SVSLDS
        SVSLDS
Subjt:  SVSLDS

A0A5A7SSL8 Procollagen-proline 3-dioxygenase4.9e-17293.79Show/hide
Query:  LGWTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKL
        + WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSS DEDAKL
Subjt:  LGWTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKL

Query:  LSLLSQNPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFL
        LSLLSQ+PLHDRF +SCLPQPPSCNMYWFSPE+DPNFKFGFDICWARLH+LGYDIYFPGDH FSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFL
Subjt:  LSLLSQNPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFL

Query:  CWKGKELDSTNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVV-AAAAWEDYASILRRELLRSFNHWRNCQSIY
        CWKGKELD+TNL EDS YAEYLSPKRNVGVSYFKSEFSKN GLAESVF SATS G ENQHWLGWDKLVV AAAAWEDYASILRRELL SF+HWRNCQSIY
Subjt:  CWKGKELDSTNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVV-AAAAWEDYASILRRELLRSFNHWRNCQSIY

Query:  SVSLDS
        SVSLDS
Subjt:  SVSLDS

A0A5D3CRE9 Procollagen-proline 3-dioxygenase9.9e-17394.12Show/hide
Query:  LGWTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKL
        + WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMY ADSDNVHSVDEITNGERLTLTLWFTRDSS DEDAKL
Subjt:  LGWTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKL

Query:  LSLLSQNPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFL
        LSLLSQ+PLHDRFP+SCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLH+LGYDIYFPGDH FSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFL
Subjt:  LSLLSQNPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFL

Query:  CWKGKELDSTNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVV-AAAAWEDYASILRRELLRSFNHWRNCQSIY
        CWKGKELD+TNL EDS YAEYLSPKRNVGVSYFKSEFSKN GLAESVF SATS G ENQHWLGWDKLVV AAAAWEDYASILRRELL SF+HWRNCQSIY
Subjt:  CWKGKELDSTNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVV-AAAAWEDYASILRRELLRSFNHWRNCQSIY

Query:  SVSLDS
        SVSLDS
Subjt:  SVSLDS

A0A6J1FRP8 Procollagen-proline 3-dioxygenase9.3e-14782.62Show/hide
Query:  LGWTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKL
        + WTRGA IGWHSDDNRPYLKQREF+AVCYLNSYGV+F GGLFHFQDGEP+TISP  GDCVMYTADS NVHSVDE+T+GERLTLTLWFTRDSS DEDAKL
Subjt:  LGWTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKL

Query:  LSLLSQNPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFL
        +SLLSQ+ LHDR PDSCLPQPPSCNMYWFSP+DDPNFKFGFDICWARLH+LGY IYFP DH  SEYPDLFSQDVQLV G+KIF QKF++ILH LQVVQFL
Subjt:  LSLLSQNPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFL

Query:  CWKGKELDSTNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVVAAAAWEDYASILRRELLRSFNHWRNCQSIYS
         WKGKELDSTN KEDSSYAE LSPKRNVGV YFKSEFSK+  LAESVFL A+SD  E QH LGW KL   AAAWEDYAS LRRELLRSF HWR  QSIYS
Subjt:  CWKGKELDSTNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVVAAAAWEDYASILRRELLRSFNHWRNCQSIYS

Query:  VSLDS
        V   S
Subjt:  VSLDS

SwissProt top hitse value%identityAlignment
Q5XGE0 2-oxoglutarate and iron-dependent oxygenase domain-containing protein 38.2e-0732.5Show/hide
Query:  WHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQD-GEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFT
        WH   ++      +++++ YL+ Y  +FGGG F F D G   T+ P  G    +T+ S+N+H V++++ G R  +T+ FT
Subjt:  WHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQD-GEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFT

Arabidopsis top hitse value%identityAlignment
AT1G68080.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.9e-7647.56Show/hide
Query:  LGWTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKL
        + W +GASIGWHSDDNR YLKQR+F+AVCYLNSY  +F GGLF FQ GEP T++P  GD +MYTAD  N+HSVDE+T+GERLTL LWF+RDSS DED+KL
Subjt:  LGWTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKL

Query:  LSLLSQNPLHDRFPDSCLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLHSLGYDIY-FPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQ
        LS LSQ   H    + CLP P S NMYWF P +D  N   GFD+C ARLH LG+D++   G+   ++  +     +QL  G K+  +KF NILH LQVVQ
Subjt:  LSLLSQNPLHDRFPDSCLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLHSLGYDIY-FPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQ

Query:  FLCWKGKELDSTNLKEDS-SYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVVAAAAWEDYASILRRELLRSFNHWRNCQS
        F  WK  EL ++N++ D+    + +S  +   ++  KS F  +  L  + F   +  G + +  L    + +A  +WE+Y+  L +ELL S   W+  Q+
Subjt:  FLCWKGKELDSTNLKEDS-SYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVVAAAAWEDYASILRRELLRSFNHWRNCQS

Query:  IYSVSLD
        I+ V  D
Subjt:  IYSVSLD

AT1G68080.2 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.1e-5138.24Show/hide
Query:  LGWTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKL
        + W +GASIGWHSDDNR YLKQR+F++                    GEP T++P  GD +MYTAD  N+HSVDE+T+GERLTL LWF+RDSS DED+KL
Subjt:  LGWTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKL

Query:  LSLLSQNPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIY-FPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQF
        LS LSQ                 C                FD+C ARLH LG+D++   G+   ++  +     +QL  G K+  +KF NILH LQVVQF
Subjt:  LSLLSQNPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIY-FPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQF

Query:  LCWKGKELDSTNLKEDS-SYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVVAAAAWEDYASILRRELLRSFNHWRNCQSI
          WK  EL ++N++ D+    + +S  +   ++  KS F  +  L  + F   +  G + +  L    + +A  +WE+Y+  L +ELL S   W+  Q+I
Subjt:  LCWKGKELDSTNLKEDS-SYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVVAAAAWEDYASILRRELLRSFNHWRNCQSI

Query:  YSVSLD
        + V  D
Subjt:  YSVSLD

AT1G68080.3 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.3e-6342.67Show/hide
Query:  LGWTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKL
        + W +GASIGWHSDDNR YLKQR+F++                    GEP T++P  GD +MYTAD  N+HSVDE+T+GERLTL LWF+RDSS DED+KL
Subjt:  LGWTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKL

Query:  LSLLSQNPLHDRFPDSCLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLHSLGYDIY-FPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQ
        LS LSQ   H    + CLP P S NMYWF P +D  N   GFD+C ARLH LG+D++   G+   ++  +     +QL  G K+  +KF NILH LQVVQ
Subjt:  LSLLSQNPLHDRFPDSCLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLHSLGYDIY-FPGDHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQ

Query:  FLCWKGKELDSTNLKEDS-SYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVVAAAAWEDYASILRRELLRSFNHWRNCQS
        F  WK  EL ++N++ D+    + +S  +   ++  KS F  +  L  + F   +  G + +  L    + +A  +WE+Y+  L +ELL S   W+  Q+
Subjt:  FLCWKGKELDSTNLKEDS-SYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVVAAAAWEDYASILRRELLRSFNHWRNCQS

Query:  IYSVSLD
        I+ V  D
Subjt:  IYSVSLD

AT1G68080.4 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein4.4e-4842.68Show/hide
Query:  MYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKLLSLLSQNPLHDRFPDSCLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLHSLGYDIY-FPG
        MYTAD  N+HSVDE+T+GERLTL LWF+RDSS DED+KLLS LSQ   H    + CLP P S NMYWF P +D  N   GFD+C ARLH LG+D++   G
Subjt:  MYTADSDNVHSVDEITNGERLTLTLWFTRDSSQDEDAKLLSLLSQNPLHDRFPDSCLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLHSLGYDIY-FPG

Query:  DHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDSTNLKEDS-SYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNEN
        +   ++  +     +QL  G K+  +KF NILH LQVVQF  WK  EL ++N++ D+    + +S  +   ++  KS F  +  L  + F   +  G + 
Subjt:  DHGFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDSTNLKEDS-SYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNEN

Query:  QHWLGWDKLVVAAAAWEDYASILRRELLRSFNHWRNCQSIYSVSLD
        +  L    + +A  +WE+Y+  L +ELL S   W+  Q+I+ V  D
Subjt:  QHWLGWDKLVVAAAAWEDYASILRRELLRSFNHWRNCQSIYSVSLD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTATTTGTAGAATGTAGATTTTCTGTGGCCAACTACCACTGCTTGGGCTTCTTGATCGAGTTTACATTCCAAATGACACTGCAACCTTCATCATCCAACTTAGGCTG
GACCAGGGGAGCAAGCATTGGATGGCATAGTGATGATAACCGGCCCTATCTAAAACAACGTGAATTTTCTGCAGTGTGTTACTTGAATAGTTATGGAGTAGAATTTGGAG
GTGGGCTGTTTCACTTTCAGGATGGGGAACCAGAAACCATCTCACCTTTTTATGGAGATTGTGTGATGTACACGGCTGACAGTGACAATGTTCATTCTGTTGATGAGATA
ACCAATGGAGAAAGGCTTACACTGACATTATGGTTCACCCGTGATAGCTCCCAGGATGAAGATGCAAAACTTCTTTCCCTTCTTTCACAAAACCCTTTACATGATCGTTT
TCCTGACTCGTGCCTCCCTCAGCCTCCATCCTGTAATATGTATTGGTTTTCACCAGAAGACGATCCAAATTTTAAGTTCGGTTTTGATATATGCTGGGCGAGACTGCATT
CGCTTGGATACGACATCTATTTTCCTGGGGACCATGGTTTTTCAGAGTATCCAGATTTATTCTCACAGGACGTACAATTAGTGTGGGGAGATAAGATATTCTTTCAGAAA
TTTGAGAACATCTTGCATTTGCTTCAGGTAGTGCAGTTCCTGTGTTGGAAAGGCAAAGAGCTGGATTCTACCAACCTCAAGGAGGATTCGAGCTATGCAGAATATTTATC
CCCAAAGAGAAATGTGGGAGTCAGTTACTTTAAATCTGAGTTTTCGAAGAATATTGGGTTGGCTGAATCGGTCTTCTTATCTGCTACTTCCGATGGCAACGAGAACCAAC
ACTGGTTGGGGTGGGATAAGCTTGTTGTTGCTGCAGCAGCTTGGGAAGATTATGCTTCCATTTTAAGGAGAGAACTCCTTCGGAGCTTCAACCATTGGAGAAATTGTCAA
TCCATATACAGTGTTTCACTTGATAGCTGA
mRNA sequenceShow/hide mRNA sequence
CCCTAAGTTATTTTGGACAATTTTCCCACGTCTCGCTGAAACGGTAGGGACGGAGAACTGGACGCCAAAATGGGAGATGAAGCTGAGAGCAGACAGCGGCGGCGTCTGAT
TCTCGAAAATTTCTTAAGCCGCGAAGAATGCAGGGAACTGGAGTTCATCCATAAGAGCTGCTGTACGGTGGGGTATAGACCGAACGTCTTCTCCACCACTTTGTTGCATC
TTGTTGCCACTAATTCTGCTCAATTGATCATCCCTTTTGTTCCGATTAGAGAGAGGTTGAAGGAGAAAGCCGAGGAATTCTTTGGGTGTCATTATGAGCTATTTGTTGAG
TTCACTGGCTTGATCAGGTTCTTCTTCCTCATGTGCTAAATGCAGAAATGGGTTGTTGTCTCTTTTTACATATAATTCCAATTTTTGAAGAATGAACATTGGGAGTAATT
GATGCCGACCTGTGAGAAAATAAATTAAAAGGATAGCAATACGTTGTTTTTTTTTTTGTCTCTGGGAGTTTGAGGGTTGTGGTGTAGTCGCCTTTTAGAAATTGAAGTGC
ACTACTTATTAGTGTCCATAACTTGTATGGTTAACATGTCAAGTAATACAAGTAGTTTAGATCCTGTATCAATTTGAATTGAGCCTTAAACAAGTGTGCCATGGGCATTT
TTCTCCACAAAGAGAAAAACCTTAAAATTTCTCACTTGGTGATAAAATGATGATCTTGGTAAACTCCAAAGTTGATCTTCTTGGTAAACTGCAAGGTTAAAAAAACCTAT
AAACTCAACCAAAAAAAAAAAAGGTAACTATTTGAAATATAACTGTAAATCATTAGTACTTGGCAAGTATGAGCAAGCTATGTGAACTTTTCCCCCCAAACGTAAGAGAC
AATCAAGTACTCCCACTTCGACTAAGTAGTAACACTCACCATTTCTTCCAACTCAAAGACTTGTTCCCAAAGAGGATGGATTCTATAACATTTGTTAAGGCTCCTAAGAA
GCTTATTCAAGCTCTTCCTCGACATTATACTCGAAAAGGTGCTCCCTCCCTTGCATCCAAGTTATCAAATGTTCATAATGATCTTATTGATGATGCTTGTGTACGAGTGG
AAGTCCCTAATTTGGATTTGTCTAAGTCCAATCACAATTCTTCTACTCTTATTCCCCCTTCACAAAATTCCAAGCACCAATGAATATGGAGTGTTTCTCAACAAGTTAGC
TCGTATTGTCACTCAATTCAAATTGATAGAGAAGATGATTTAGTTGTAACTGTTGGCAAGTGAAGAGCTAGAGCTAAATTCTTTGAATTCAAACTTAGAGAATTCTCCTC
TTGAGGAGAACTTTGGAGAGGATCTTGTTATCTCTTTTGATTGTTGAGTTGAAAAGAAGGTTAGTGAGGCAAATTTAGTCGCATTTCTCCTCTTTCTCCATCTCAAATAC
CCTCCAAGTTCTCCTCCCTAGTCGAAACTTGTGGCCTTCAATTTTGCAAAGCATCCAACCTTTCATCCAAGTTGGTAATTTGATTGTTCTATTTATCAGTATGGAGATCA
TTTCATGATTTATTTGGGTAGTTGGAGCAATTTCTTTTAATCTATAAGGAGTTATATGGTATTTGGTTATTCTTGCAAAATTACTTTTCACTCATTTAAAGATTTCAGAA
GTTTTGGGACGTTTGTTTGCTTGTTGGGCTTGCTTTGAGATAATCTTGGGGAAAGCTTTAGTTGCCGCCCGTGGAGTTTGGGAAAGTTTATTTAAATAATGTGGATCTAT
TGACAGTTTGGGTGGGTTTATAGCTCGTTGCGTTTTGAGAAGGATTTTGATAATCTTGAAGATTTTGATTTTAATGCCCGAAGGAGGGTCAGAAGTCTAATGTGGTTTTT
GGCTTTTTCAGAATTCAAGTTGTCCTAGAAGAAGGCTCATTTGATGTCAAGTTTTTATCTTCTCTTTTTTTTTTTTTAATTTGTTATCTGGTCTAGTTTTTAATCTCTTC
TTGTACTGGAAAGCTTCTTATACCTTTATGTTTAGTATCCTCTTTTGATTATGTGGCTTAAAAGATTCAAACTTTCTGCCAAACTAAAGACTTGAACTTTGACCTTGGCA
ATTAACTTGGTCTTTTTCATTTTCATGCTTCAATTCACCTAGGCGACTTCATCATCTTATTAGAATGTTTTTCAATTGGGGTCTGGGTAGCGACACATTGAAATCAACTA
TCTCTCTTAGACTTCCCAATTGTTTTTGGTTCATAGATCTAGATCTGACTGATTGAAATTGGTTGTCGTTGCTACTGGTTTAACTATACACCTACTAAAACCGTTTGGAA
CTACGCATCCATTGTCTCATTGATCCTTTGACAATATTTAAATGCTATTTGTAGAATGTAGATTTTCTGTGGCCAACTACCACTGCTTGGGCTTCTTGATCGAGTTTACA
TTCCAAATGACACTGCAACCTTCATCATCCAACTTAGGCTGGACCAGGGGAGCAAGCATTGGATGGCATAGTGATGATAACCGGCCCTATCTAAAACAACGTGAATTTTC
TGCAGTGTGTTACTTGAATAGTTATGGAGTAGAATTTGGAGGTGGGCTGTTTCACTTTCAGGATGGGGAACCAGAAACCATCTCACCTTTTTATGGAGATTGTGTGATGT
ACACGGCTGACAGTGACAATGTTCATTCTGTTGATGAGATAACCAATGGAGAAAGGCTTACACTGACATTATGGTTCACCCGTGATAGCTCCCAGGATGAAGATGCAAAA
CTTCTTTCCCTTCTTTCACAAAACCCTTTACATGATCGTTTTCCTGACTCGTGCCTCCCTCAGCCTCCATCCTGTAATATGTATTGGTTTTCACCAGAAGACGATCCAAA
TTTTAAGTTCGGTTTTGATATATGCTGGGCGAGACTGCATTCGCTTGGATACGACATCTATTTTCCTGGGGACCATGGTTTTTCAGAGTATCCAGATTTATTCTCACAGG
ACGTACAATTAGTGTGGGGAGATAAGATATTCTTTCAGAAATTTGAGAACATCTTGCATTTGCTTCAGGTAGTGCAGTTCCTGTGTTGGAAAGGCAAAGAGCTGGATTCT
ACCAACCTCAAGGAGGATTCGAGCTATGCAGAATATTTATCCCCAAAGAGAAATGTGGGAGTCAGTTACTTTAAATCTGAGTTTTCGAAGAATATTGGGTTGGCTGAATC
GGTCTTCTTATCTGCTACTTCCGATGGCAACGAGAACCAACACTGGTTGGGGTGGGATAAGCTTGTTGTTGCTGCAGCAGCTTGGGAAGATTATGCTTCCATTTTAAGGA
GAGAACTCCTTCGGAGCTTCAACCATTGGAGAAATTGTCAATCCATATACAGTGTTTCACTTGATAGCTGAGTATCCCAAGTACTTAAAGTAGCAGCAGCTTCAAGGTGA
TCGTCGAAGTTGATCATCAGTTATTGACGATGGCGGTGAGAGGTGGCAAAAGGTTGGCTAAAAGAGGTCGTCTAAAAACAATGGCCGAAGGTTGGCCAACAGTGGTCACT
AGAAAATGGTGGTGGAAGAGTTTTCGATGGCGCTTGTTGAAAAACGTGATCGGAGATTGGCCGAAAATGGTTGTCGGAGATAATGGTCAGAGGTTGACTGGTGGCTATCG
CCGTGGACAATTTGGTCAATGATAGTTAGAGGAGGTGGTGATCGAAATTTTTCATACAAATTAGAAGAGATTTTATTAACCACTTCAAATTTACTTTTTAAAAAATTGAT
TTTAAGTGTTTATTCCAAACCAACTTATTTTATAAACTCATTTCTCAATATCCATTTTAGGTGGTTGCCAAACACATGAATTTTTTACAAAACAACTTATTTTTTAATTT
AATCACTTGAAAAGGTATTTCAAACACATTCAAAATCTTCGGAAATAGTTTCCAAAGTATAGGGACTATT
Protein sequenceShow/hide protein sequence
MLFVECRFSVANYHCLGFLIEFTFQMTLQPSSSNLGWTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEI
TNGERLTLTLWFTRDSSQDEDAKLLSLLSQNPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHSLGYDIYFPGDHGFSEYPDLFSQDVQLVWGDKIFFQK
FENILHLLQVVQFLCWKGKELDSTNLKEDSSYAEYLSPKRNVGVSYFKSEFSKNIGLAESVFLSATSDGNENQHWLGWDKLVVAAAAWEDYASILRRELLRSFNHWRNCQ
SIYSVSLDS