; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G12520 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G12520
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionProcollagen-proline 3-dioxygenase
Genome locationChr5:11787099..11789939
RNA-Seq ExpressionCSPI05G12520
SyntenyCSPI05G12520
Gene Ontology termsGO:0019511 - peptidyl-proline hydroxylation (biological process)
GO:0032963 - collagen metabolic process (biological process)
GO:0005506 - iron ion binding (molecular function)
GO:0019797 - procollagen-proline 3-dioxygenase activity (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR039575 - Prolyl 3-hydroxylase
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK14443.1 prolyl 3-hydroxylase 1 [Cucumis melo var. makuwa]4.8e-17395.39Show/hide
Query:  WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLS
        WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMY AD+DNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLS
Subjt:  WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLS

Query:  LLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCW
        LLSQSPLHDRFP+SCLPQPPSCNMYWFSPEDDPNFKFGFDICWARL ALGYD+YFPGDHDFSEYPDLF QDVQLVWGDKIFFQKFENILHLLQVVQFLCW
Subjt:  LLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCW

Query:  KGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKL-VAAAAAWEHYASILRRELLGSFSHWRNCQSIYSV
        KGKELD+TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSA S GKENQ WLGWDKL VAAAAAWE YASILRRELLGSFSHWRNCQSIYSV
Subjt:  KGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKL-VAAAAAWEHYASILRRELLGSFSHWRNCQSIYSV

Query:  SLDS
        SLDS
Subjt:  SLDS

XP_004140463.1 prolyl 3-hydroxylase 1 [Cucumis sativus]3.3e-182100Show/hide
Query:  WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLS
        WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLS
Subjt:  WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLS

Query:  LLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCW
        LLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCW
Subjt:  LLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCW

Query:  KGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVS
        KGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVS
Subjt:  KGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVS

Query:  LDS
        LDS
Subjt:  LDS

XP_008456831.1 PREDICTED: uncharacterized protein LOC103496668 isoform X1 [Cucumis melo]2.4e-17295.07Show/hide
Query:  WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLS
        WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTAD+DNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLS
Subjt:  WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLS

Query:  LLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCW
        LLSQSPLHDRF +SCLPQPPSCNMYWFSPE+DPNFKFGFDICWARL ALGYD+YFPGDHDFSEYPDLF QDVQLVWGDKIFFQKFENILHLLQVVQFLCW
Subjt:  LLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCW

Query:  KGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKL-VAAAAAWEHYASILRRELLGSFSHWRNCQSIYSV
        KGKELD+TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSA S GKENQ WLGWDKL VAAAAAWE YASILRRELLGSFSHWRNCQSIYSV
Subjt:  KGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKL-VAAAAAWEHYASILRRELLGSFSHWRNCQSIYSV

Query:  SLDS
        SLDS
Subjt:  SLDS

XP_008456833.1 PREDICTED: uncharacterized protein LOC103496668 isoform X2 [Cucumis melo]3.9e-14684.54Show/hide
Query:  WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLS
        WTRGASIGWHSDDNRPYLKQREFS                                DCVMYTAD+DNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLS
Subjt:  WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLS

Query:  LLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCW
        LLSQSPLHDRF +SCLPQPPSCNMYWFSPE+DPNFKFGFDICWARL ALGYD+YFPGDHDFSEYPDLF QDVQLVWGDKIFFQKFENILHLLQVVQFLCW
Subjt:  LLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCW

Query:  KGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKL-VAAAAAWEHYASILRRELLGSFSHWRNCQSIYSV
        KGKELD+TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSA S GKENQ WLGWDKL VAAAAAWE YASILRRELLGSFSHWRNCQSIYSV
Subjt:  KGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKL-VAAAAAWEHYASILRRELLGSFSHWRNCQSIYSV

Query:  SLDS
        SLDS
Subjt:  SLDS

XP_038893062.1 uncharacterized protein LOC120081945 [Benincasa hispida]4.0e-15989.44Show/hide
Query:  WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLS
        WTRGASIGWHSDDNRPYLKQR+FSAVCYLNSYGVEFGGGLFHFQDGEPETISPF GDCVMYTAD+DNVHSVDEITNGERLTLTLW TRDSSHDED+KLLS
Subjt:  WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLS

Query:  LLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCW
        LLSQS LHDR PDS LPQPPSCNMYWFS EDDPNFK GFDICWARL ALGYD+YF GDH FSEYPDLF +DVQLV G+K+FFQ+FENILHLLQVVQFLCW
Subjt:  LLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCW

Query:  KGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVS
        KGKELDSTN+ EDSSYAEYLSPKRNVGVSYFKSEFSK+D LAESVFSSA SDGKENQ WLGWDKL AAAAAWE YASILRRELLGS S+WRN QSIYSVS
Subjt:  KGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVS

Query:  LDS
        L S
Subjt:  LDS

TrEMBL top hitse value%identityAlignment
A0A0A0KMN7 Procollagen-proline 3-dioxygenase1.6e-182100Show/hide
Query:  WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLS
        WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLS
Subjt:  WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLS

Query:  LLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCW
        LLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCW
Subjt:  LLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCW

Query:  KGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVS
        KGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVS
Subjt:  KGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVS

Query:  LDS
        LDS
Subjt:  LDS

A0A1S3C486 Procollagen-proline 3-dioxygenase1.2e-17295.07Show/hide
Query:  WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLS
        WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTAD+DNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLS
Subjt:  WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLS

Query:  LLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCW
        LLSQSPLHDRF +SCLPQPPSCNMYWFSPE+DPNFKFGFDICWARL ALGYD+YFPGDHDFSEYPDLF QDVQLVWGDKIFFQKFENILHLLQVVQFLCW
Subjt:  LLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCW

Query:  KGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKL-VAAAAAWEHYASILRRELLGSFSHWRNCQSIYSV
        KGKELD+TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSA S GKENQ WLGWDKL VAAAAAWE YASILRRELLGSFSHWRNCQSIYSV
Subjt:  KGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKL-VAAAAAWEHYASILRRELLGSFSHWRNCQSIYSV

Query:  SLDS
        SLDS
Subjt:  SLDS

A0A1S3C4U3 uncharacterized protein LOC103496668 isoform X21.9e-14684.54Show/hide
Query:  WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLS
        WTRGASIGWHSDDNRPYLKQREFS                                DCVMYTAD+DNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLS
Subjt:  WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLS

Query:  LLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCW
        LLSQSPLHDRF +SCLPQPPSCNMYWFSPE+DPNFKFGFDICWARL ALGYD+YFPGDHDFSEYPDLF QDVQLVWGDKIFFQKFENILHLLQVVQFLCW
Subjt:  LLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCW

Query:  KGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKL-VAAAAAWEHYASILRRELLGSFSHWRNCQSIYSV
        KGKELD+TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSA S GKENQ WLGWDKL VAAAAAWE YASILRRELLGSFSHWRNCQSIYSV
Subjt:  KGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKL-VAAAAAWEHYASILRRELLGSFSHWRNCQSIYSV

Query:  SLDS
        SLDS
Subjt:  SLDS

A0A5A7SSL8 Procollagen-proline 3-dioxygenase1.2e-17295.07Show/hide
Query:  WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLS
        WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTAD+DNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLS
Subjt:  WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLS

Query:  LLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCW
        LLSQSPLHDRF +SCLPQPPSCNMYWFSPE+DPNFKFGFDICWARL ALGYD+YFPGDHDFSEYPDLF QDVQLVWGDKIFFQKFENILHLLQVVQFLCW
Subjt:  LLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCW

Query:  KGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKL-VAAAAAWEHYASILRRELLGSFSHWRNCQSIYSV
        KGKELD+TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSA S GKENQ WLGWDKL VAAAAAWE YASILRRELLGSFSHWRNCQSIYSV
Subjt:  KGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKL-VAAAAAWEHYASILRRELLGSFSHWRNCQSIYSV

Query:  SLDS
        SLDS
Subjt:  SLDS

A0A5D3CRE9 Procollagen-proline 3-dioxygenase2.3e-17395.39Show/hide
Query:  WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLS
        WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMY AD+DNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLS
Subjt:  WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLS

Query:  LLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCW
        LLSQSPLHDRFP+SCLPQPPSCNMYWFSPEDDPNFKFGFDICWARL ALGYD+YFPGDHDFSEYPDLF QDVQLVWGDKIFFQKFENILHLLQVVQFLCW
Subjt:  LLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCW

Query:  KGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKL-VAAAAAWEHYASILRRELLGSFSHWRNCQSIYSV
        KGKELD+TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSA S GKENQ WLGWDKL VAAAAAWE YASILRRELLGSFSHWRNCQSIYSV
Subjt:  KGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKL-VAAAAAWEHYASILRRELLGSFSHWRNCQSIYSV

Query:  SLDS
        SLDS
Subjt:  SLDS

SwissProt top hitse value%identityAlignment
Q5XGE0 2-oxoglutarate and iron-dependent oxygenase domain-containing protein 33.3e-0730.59Show/hide
Query:  WHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQD-GEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSH
        WH   ++      +++++ YL+ Y  +FGGG F F D G   T+ P  G    +T+ ++N+H V++++ G R  +T+ FT +  H
Subjt:  WHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQD-GEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSH

Arabidopsis top hitse value%identityAlignment
AT1G68080.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein6.9e-7748.2Show/hide
Query:  WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLS
        W +GASIGWHSDDNR YLKQR+F+AVCYLNSY  +F GGLF FQ GEP T++P  GD +MYTAD+ N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS
Subjt:  WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLS

Query:  LLSQSPLHDRFPDSCLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLRALGYDLY-FPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFL
         LSQ   H    + CLP P S NMYWF P +D  N   GFD+C ARL  LG+D++   G+   ++  +     +QL  G K+  +KF NILH LQVVQF 
Subjt:  LLSQSPLHDRFPDSCLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLRALGYDLY-FPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFL

Query:  CWKGKELDSTNLSEDS-SYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIY
         WK  EL ++N+  D+    + +S  +   ++  KS F  ++ L  + F  + S G++ +  L    +  A  +WE Y+  L +ELL S   W+  Q+I+
Subjt:  CWKGKELDSTNLSEDS-SYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIY

Query:  SVSLD
         V  D
Subjt:  SVSLD

AT1G68080.2 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein3.5e-5238.49Show/hide
Query:  WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLS
        W +GASIGWHSDDNR YLKQR+F++                    GEP T++P  GD +MYTAD+ N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS
Subjt:  WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLS

Query:  LLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLY-FPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLC
         LSQ                                  FD+C ARL  LG+D++   G+   ++  +     +QL  G K+  +KF NILH LQVVQF  
Subjt:  LLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLY-FPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLC

Query:  WKGKELDSTNLSEDS-SYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYS
        WK  EL ++N+  D+    + +S  +   ++  KS F  ++ L  + F  + S G++ +  L    +  A  +WE Y+  L +ELL S   W+  Q+I+ 
Subjt:  WKGKELDSTNLSEDS-SYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYS

Query:  VSLD
        V  D
Subjt:  VSLD

AT1G68080.3 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein3.0e-6443.28Show/hide
Query:  WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLS
        W +GASIGWHSDDNR YLKQR+F++                    GEP T++P  GD +MYTAD+ N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS
Subjt:  WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLS

Query:  LLSQSPLHDRFPDSCLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLRALGYDLY-FPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFL
         LSQ   H    + CLP P S NMYWF P +D  N   GFD+C ARL  LG+D++   G+   ++  +     +QL  G K+  +KF NILH LQVVQF 
Subjt:  LLSQSPLHDRFPDSCLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLRALGYDLY-FPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFL

Query:  CWKGKELDSTNLSEDS-SYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIY
         WK  EL ++N+  D+    + +S  +   ++  KS F  ++ L  + F  + S G++ +  L    +  A  +WE Y+  L +ELL S   W+  Q+I+
Subjt:  CWKGKELDSTNLSEDS-SYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIY

Query:  SVSLD
         V  D
Subjt:  SVSLD

AT1G68080.4 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein6.1e-4943.09Show/hide
Query:  MYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLRALGYDLY-FPG
        MYTAD+ N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H    + CLP P S NMYWF P +D  N   GFD+C ARL  LG+D++   G
Subjt:  MYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLRALGYDLY-FPG

Query:  DHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDSTNLSEDS-SYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKEN
        +   ++  +     +QL  G K+  +KF NILH LQVVQF  WK  EL ++N+  D+    + +S  +   ++  KS F  ++ L  + F  + S G++ 
Subjt:  DHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDSTNLSEDS-SYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKEN

Query:  QQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLD
        +  L    +  A  +WE Y+  L +ELL S   W+  Q+I+ V  D
Subjt:  QQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TGGACCAGGGGAGCAAGCATTGGATGGCATAGTGATGATAACCGGCCCTATCTAAAACAACGTGAATTTTCTGCAGTGTGTTACTTGAATAGTTATGGAGTAGAATTTGG
AGGTGGACTGTTTCACTTTCAGGATGGGGAACCAGAAACCATCTCACCTTTTTATGGAGATTGTGTGATGTACACGGCTGACAATGACAATGTTCATTCTGTTGATGAGA
TAACCAATGGAGAAAGGCTTACACTGACATTATGGTTCACCCGTGATAGCTCTCATGATGAAGATGCAAAACTTCTTTCCCTTCTTTCACAAAGCCCTTTACATGATCGT
TTTCCTGACTCGTGCCTCCCTCAGCCTCCGTCCTGTAATATGTATTGGTTTTCACCAGAAGACGATCCAAATTTCAAGTTTGGTTTTGATATATGCTGGGCGAGACTGCG
TGCGCTTGGATACGACCTCTATTTTCCTGGGGACCATGATTTTTCAGAGTATCCAGATTTATTCTTTCAGGACGTACAATTAGTGTGGGGAGATAAGATATTCTTTCAGA
AATTTGAGAACATTTTGCATTTGCTTCAGGTAGTGCAGTTCCTGTGTTGGAAAGGCAAAGAGCTGGATTCTACCAACCTCAGTGAGGATTCGAGCTATGCAGAATATTTA
TCCCCAAAGAGAAATGTGGGAGTCAGTTACTTTAAATCTGAGTTTTCGAAGAACGATGGGTTGGCCGAATCGGTCTTCTCATCTGCTGCATCTGATGGCAAGGAGAACCA
GCAATGGTTGGGGTGGGATAAGCTTGTTGCTGCAGCAGCAGCTTGGGAACATTATGCTTCCATTTTAAGGAGAGAACTCCTTGGAAGCTTCAGCCATTGGAGGAATTGCC
AATCCATATACAGTGTTTCACTTGATAGCTGA
mRNA sequenceShow/hide mRNA sequence
CTGGACCAGGGGAGCAAGCATTGGATGGCATAGTGATGATAACCGGCCCTATCTAAAACAACGTGAATTTTCTGCAGTGTGTTACTTGAATAGTTATGGAGTAGAATTTG
GAGGTGGACTGTTTCACTTTCAGGATGGGGAACCAGAAACCATCTCACCTTTTTATGGAGATTGTGTGATGTACACGGCTGACAATGACAATGTTCATTCTGTTGATGAG
ATAACCAATGGAGAAAGGCTTACACTGACATTATGGTTCACCCGTGATAGCTCTCATGATGAAGATGCAAAACTTCTTTCCCTTCTTTCACAAAGCCCTTTACATGATCG
TTTTCCTGACTCGTGCCTCCCTCAGCCTCCGTCCTGTAATATGTATTGGTTTTCACCAGAAGACGATCCAAATTTCAAGTTTGGTTTTGATATATGCTGGGCGAGACTGC
GTGCGCTTGGATACGACCTCTATTTTCCTGGGGACCATGATTTTTCAGAGTATCCAGATTTATTCTTTCAGGACGTACAATTAGTGTGGGGAGATAAGATATTCTTTCAG
AAATTTGAGAACATTTTGCATTTGCTTCAGGTAGTGCAGTTCCTGTGTTGGAAAGGCAAAGAGCTGGATTCTACCAACCTCAGTGAGGATTCGAGCTATGCAGAATATTT
ATCCCCAAAGAGAAATGTGGGAGTCAGTTACTTTAAATCTGAGTTTTCGAAGAACGATGGGTTGGCCGAATCGGTCTTCTCATCTGCTGCATCTGATGGCAAGGAGAACC
AGCAATGGTTGGGGTGGGATAAGCTTGTTGCTGCAGCAGCAGCTTGGGAACATTATGCTTCCATTTTAAGGAGAGAACTCCTTGGAAGCTTCAGCCATTGGAGGAATTGC
CAATCCATATACAGTGTTTCACTTGATAGCTGAATAGCCCAAATACTTAAACTAGCAGTAGCTTCGAGG
Protein sequenceShow/hide protein sequence
WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDR
FPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDSTNLSEDSSYAEYL
SPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS