; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G16185 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G16185
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionProcollagen-proline 3-dioxygenase
Genome locationctg2231:461221..473042
RNA-Seq ExpressionCucsat.G16185
SyntenyCucsat.G16185
Gene Ontology termsGO:0019511 - peptidyl-proline hydroxylation (biological process)
GO:0032963 - collagen metabolic process (biological process)
GO:0005506 - iron ion binding (molecular function)
GO:0019797 - procollagen-proline 3-dioxygenase activity (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR039575 - Prolyl 3-hydroxylase
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK14443.1 prolyl 3-hydroxylase 1 [Cucumis melo var. makuwa]1.20e-28995.71Show/hide
Query:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        MVDGAESRQRRRLILENFLSREECRELEFIHKSC TVGYRPNV STTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
        GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMY AD+DNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPELFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS
        HDRFP+SCLPQPPSCNMYWFSPEDDPNFKFGFDICWARL ALGYD+YFPGDHDFSEYP+LF QDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPELFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS

Query:  TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLV-AAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLD
        TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSA S GKENQ WLGWDKLV AAAAAWE YASILRRELLGSFSHWRNCQSIYSVSLD
Subjt:  TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLV-AAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLD

XP_004140463.1 prolyl 3-hydroxylase 1 [Cucumis sativus]2.20e-30399.75Show/hide
Query:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
        GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPELFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS
        HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYP+LFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPELFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS

Query:  TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLD
        TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLD
Subjt:  TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLD

XP_008456831.1 PREDICTED: uncharacterized protein LOC103496668 isoform X1 [Cucumis melo]9.89e-28995.45Show/hide
Query:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        MVDGAESRQRRRLILENFLSREECRELEFIHKSC TVGYRPNV STTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
        GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTAD+DNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPELFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS
        HDRF +SCLPQPPSCNMYWFSPE+DPNFKFGFDICWARL ALGYD+YFPGDHDFSEYP+LF QDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPELFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS

Query:  TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLV-AAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLD
        TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSA S GKENQ WLGWDKLV AAAAAWE YASILRRELLGSFSHWRNCQSIYSVSLD
Subjt:  TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLV-AAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLD

XP_008456833.1 PREDICTED: uncharacterized protein LOC103496668 isoform X2 [Cucumis melo]1.17e-25487.37Show/hide
Query:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        MVDGAESRQRRRLILENFLSREECRELEFIHKSC TVGYRPNV STTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
        GWHSDDNRPYLKQREFS                                DCVMYTAD+DNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPELFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS
        HDRF +SCLPQPPSCNMYWFSPE+DPNFKFGFDICWARL ALGYD+YFPGDHDFSEYP+LF QDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPELFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS

Query:  TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLV-AAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLD
        TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSA S GKENQ WLGWDKLV AAAAAWE YASILRRELLGSFSHWRNCQSIYSVSLD
Subjt:  TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLV-AAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLD

XP_038893062.1 uncharacterized protein LOC120081945 [Benincasa hispida]2.14e-26589.39Show/hide
Query:  MVDGAESRQRRR--LILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGA
        M D  ESR+RRR  LILENFL+REECRELEFIHKSC TVGYRPNVFSTTLLHLVATNSAHLI+PFVPIRE+LKEKAEEFFGCHYELFVEFTGLISWTRGA
Subjt:  MVDGAESRQRRR--LILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS
        SIGWHSDDNRPYLKQR+FSAVCYLNSYGVEFGGGLFHFQDGEPETISPF GDCVMYTAD+DNVHSVDEITNGERLTLTLW TRDSSHDED+KLLSLLSQS
Subjt:  SIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS

Query:  PLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPELFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKEL
         LHDR PDS LPQPPSCNMYWFS EDDPNFK GFDICWARL ALGYD+YF GDH FSEYP+LF +DVQLV G+K+FFQ+FENILHLLQVVQFLCWKGKEL
Subjt:  PLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPELFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKEL

Query:  DSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSL
        DSTN+ EDSSYAEYLSPKRNVGVSYFKSEFSK+D LAESVFSSA SDGKENQ WLGWDKL AAAAAWE YASILRRELLGS S+WRN QSIYSVSL
Subjt:  DSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSL

TrEMBL top hitse value%identityAlignment
A0A0A0KMN7 Procollagen-proline 3-dioxygenase1.74e-29996.1Show/hide
Query:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLIS-------
        MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLIS       
Subjt:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLIS-------

Query:  --------WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSH
                WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSH
Subjt:  --------WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSH

Query:  DEDAKLLSLLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPELFFQDVQLVWGDKIFFQKFENILHLL
        DEDAKLLSLLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYP+LFFQDVQLVWGDKIFFQKFENILHLL
Subjt:  DEDAKLLSLLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPELFFQDVQLVWGDKIFFQKFENILHLL

Query:  QVVQFLCWKGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRN
        QVVQFLCWKGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRN
Subjt:  QVVQFLCWKGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRN

Query:  CQSIYSVSLD
        CQSIYSVSLD
Subjt:  CQSIYSVSLD

A0A1S3C486 Procollagen-proline 3-dioxygenase4.79e-28995.45Show/hide
Query:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        MVDGAESRQRRRLILENFLSREECRELEFIHKSC TVGYRPNV STTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
        GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTAD+DNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPELFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS
        HDRF +SCLPQPPSCNMYWFSPE+DPNFKFGFDICWARL ALGYD+YFPGDHDFSEYP+LF QDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPELFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS

Query:  TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLV-AAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLD
        TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSA S GKENQ WLGWDKLV AAAAAWE YASILRRELLGSFSHWRNCQSIYSVSLD
Subjt:  TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLV-AAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLD

A0A1S3C4U3 uncharacterized protein LOC103496668 isoform X25.66e-25587.37Show/hide
Query:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        MVDGAESRQRRRLILENFLSREECRELEFIHKSC TVGYRPNV STTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
        GWHSDDNRPYLKQREFS                                DCVMYTAD+DNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPELFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS
        HDRF +SCLPQPPSCNMYWFSPE+DPNFKFGFDICWARL ALGYD+YFPGDHDFSEYP+LF QDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPELFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS

Query:  TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLV-AAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLD
        TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSA S GKENQ WLGWDKLV AAAAAWE YASILRRELLGSFSHWRNCQSIYSVSLD
Subjt:  TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLV-AAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLD

A0A5A7SSL8 Procollagen-proline 3-dioxygenase4.79e-28995.45Show/hide
Query:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        MVDGAESRQRRRLILENFLSREECRELEFIHKSC TVGYRPNV STTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
        GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTAD+DNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPELFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS
        HDRF +SCLPQPPSCNMYWFSPE+DPNFKFGFDICWARL ALGYD+YFPGDHDFSEYP+LF QDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPELFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS

Query:  TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLV-AAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLD
        TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSA S GKENQ WLGWDKLV AAAAAWE YASILRRELLGSFSHWRNCQSIYSVSLD
Subjt:  TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLV-AAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLD

A0A5D3CRE9 Procollagen-proline 3-dioxygenase5.83e-29095.71Show/hide
Query:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        MVDGAESRQRRRLILENFLSREECRELEFIHKSC TVGYRPNV STTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
        GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMY AD+DNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPELFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS
        HDRFP+SCLPQPPSCNMYWFSPEDDPNFKFGFDICWARL ALGYD+YFPGDHDFSEYP+LF QDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPELFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS

Query:  TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLV-AAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLD
        TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSA S GKENQ WLGWDKLV AAAAAWE YASILRRELLGSFSHWRNCQSIYSVSLD
Subjt:  TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLV-AAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLD

SwissProt top hitse value%identityAlignment
Q4KLM6 Prolyl 3-hydroxylase 24.2e-1027.15Show/hide
Query:  RRLILENFLSREECRELEFIHKSCSTVG---------YRPN---VFSTTLLHL-------VATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGL
        +R++L+N LS E+CREL  +      VG         + PN     +T L  L       V   SA L   F  I EK ++  E +F  +  L+  +T +
Subjt:  RRLILENFLSREECRELEFIHKSCSTVG---------YRPN---VFSTTLLHL-------VATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGL

Query:  ISWT---------RGASIGWHSDD------------NRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPET----ISPFYGDCVMYTADNDNVHSVDE
        +  T            S   H+D+              P    R++SA+ Y+N    +F GG F F + + +T    I P  G  + +++  +N H V  
Subjt:  ISWT---------RGASIGWHSDD------------NRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPET----ISPFYGDCVMYTADNDNVHSVDE

Query:  ITNGERLTLTLWFTRDSSHDE
        +T G+R  + LWFT D  + E
Subjt:  ITNGERLTLTLWFTRDSSHDE

Q5XGE0 2-oxoglutarate and iron-dependent oxygenase domain-containing protein 34.3e-0730.59Show/hide
Query:  WHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQD-GEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSH
        WH   ++      +++++ YL+ Y  +FGGG F F D G   T+ P  G    +T+ ++N+H V++++ G R  +T+ FT +  H
Subjt:  WHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQD-GEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSH

Q8CG71 Prolyl 3-hydroxylase 22.4e-1027.15Show/hide
Query:  RRLILENFLSREECRELEFIHKSCSTVG---------YRPN---VFSTTLLHL-------VATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGL
        +R++L+N LS+E+CREL  +      VG         + PN     +T L  L       V   SA L   F  I EK ++  E +F  +  L+  +T +
Subjt:  RRLILENFLSREECRELEFIHKSCSTVG---------YRPN---VFSTTLLHL-------VATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGL

Query:  ISWT---------RGASIGWHSDD------------NRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPET----ISPFYGDCVMYTADNDNVHSVDE
        +  T            S   H+D+              P    R++SA+ Y+N    +F GG F F + + +T    I P  G  + +++  +N H V  
Subjt:  ISWT---------RGASIGWHSDD------------NRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPET----ISPFYGDCVMYTADNDNVHSVDE

Query:  ITNGERLTLTLWFTRDSSHDE
        +T G+R  + LWFT D  + E
Subjt:  ITNGERLTLTLWFTRDSSHDE

Q8IVL5 Prolyl 3-hydroxylase 27.1e-1027.15Show/hide
Query:  RRLILENFLSREECRELEFIHKSCSTVG---------YRPN---VFSTTLLHL-------VATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGL
        +R++L+N LS E+CREL  +      VG         + PN     +T L  L       V   SA L   F  I EK +   E +F  +  L+  +T +
Subjt:  RRLILENFLSREECRELEFIHKSCSTVG---------YRPN---VFSTTLLHL-------VATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGL

Query:  ISWT---------RGASIGWHSDD------------NRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPET----ISPFYGDCVMYTADNDNVHSVDE
        +  T            S   H+D+              P    R++SA+ Y+N    +F GG F F + + +T    I P  G  + +++  +N H V  
Subjt:  ISWT---------RGASIGWHSDD------------NRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPET----ISPFYGDCVMYTADNDNVHSVDE

Query:  ITNGERLTLTLWFTRDSSHDE
        +T G+R  + LWFT D  + E
Subjt:  ITNGERLTLTLWFTRDSSHDE

Arabidopsis top hitse value%identityAlignment
AT1G68080.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein4.0e-11754.73Show/hide
Query:  RQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDN
        ++  RLIL NFLS  EC+ELE IHKS ST+GYRPNVFSTTL HL+ATNS HLIIPFV IRE+LKEK EE FGC YELF+EFTGLISW +GASIGWHSDDN
Subjt:  RQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDN

Query:  RPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDS
        R YLKQR+F+AVCYLNSY  +F GGLF FQ GEP T++P  GD +MYTAD+ N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H    + 
Subjt:  RPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDS

Query:  CLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLRALGYDLY-FPGDHDFSEYPELFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDSTNLSE
        CLP P S NMYWF P +D  N   GFD+C ARL  LG+D++   G+   ++  E     +QL  G K+  +KF NILH LQVVQF  WK  EL ++N+  
Subjt:  CLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLRALGYDLY-FPGDHDFSEYPELFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDSTNLSE

Query:  DS-SYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLD
        D+    + +S  +   ++  KS F  ++ L  + F  + S G++ +  L    +  A  +WE Y+  L +ELL S   W+  Q+I+ V  D
Subjt:  DS-SYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLD

AT1G68080.2 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.5e-9247.18Show/hide
Query:  RQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDN
        ++  RLIL NFLS  EC+ELE IHKS ST+GYRPNVFSTTL HL+ATNS HLIIPFV IRE+LKEK EE FGC YELF+EFTGLISW +GASIGWHSDDN
Subjt:  RQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDN

Query:  RPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDS
        R YLKQR+F++                    GEP T++P  GD +MYTAD+ N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ          
Subjt:  RPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDS

Query:  CLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLY-FPGDHDFSEYPELFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDSTNLSED
                                FD+C ARL  LG+D++   G+   ++  E     +QL  G K+  +KF NILH LQVVQF  WK  EL ++N+  D
Subjt:  CLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLY-FPGDHDFSEYPELFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDSTNLSED

Query:  S-SYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLD
        +    + +S  +   ++  KS F  ++ L  + F  + S G++ +  L    +  A  +WE Y+  L +ELL S   W+  Q+I+ V  D
Subjt:  S-SYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLD

AT1G68080.3 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.3e-10450.9Show/hide
Query:  RQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDN
        ++  RLIL NFLS  EC+ELE IHKS ST+GYRPNVFSTTL HL+ATNS HLIIPFV IRE+LKEK EE FGC YELF+EFTGLISW +GASIGWHSDDN
Subjt:  RQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDN

Query:  RPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDS
        R YLKQR+F++                    GEP T++P  GD +MYTAD+ N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H    + 
Subjt:  RPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDS

Query:  CLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLRALGYDLY-FPGDHDFSEYPELFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDSTNLSE
        CLP P S NMYWF P +D  N   GFD+C ARL  LG+D++   G+   ++  E     +QL  G K+  +KF NILH LQVVQF  WK  EL ++N+  
Subjt:  CLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLRALGYDLY-FPGDHDFSEYPELFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDSTNLSE

Query:  DS-SYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLD
        D+    + +S  +   ++  KS F  ++ L  + F  + S G++ +  L    +  A  +WE Y+  L +ELL S   W+  Q+I+ V  D
Subjt:  DS-SYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLD

AT1G68080.4 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein4.7e-4943.5Show/hide
Query:  MYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLRALGYDLY-FPG
        MYTAD+ N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H    + CLP P S NMYWF P +D  N   GFD+C ARL  LG+D++   G
Subjt:  MYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLRALGYDLY-FPG

Query:  DHDFSEYPELFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDSTNLSEDS-SYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKEN
        +   ++  E     +QL  G K+  +KF NILH LQVVQF  WK  EL ++N+  D+    + +S  +   ++  KS F  ++ L  + F  + S G++ 
Subjt:  DHDFSEYPELFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDSTNLSEDS-SYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKEN

Query:  QQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLD
        +  L    +  A  +WE Y+  L +ELL S   W+  Q+I+ V  D
Subjt:  QQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGATGGAGCTGAGAGCAGGCAGCGGCGGCGTCTGATTCTTGAAAATTTCCTAAGCCGCGAAGAATGCAGGGAACTGGAGTTCATCCATAAGAGCTGCTCTACGGT
GGGGTATAGACCAAACGTCTTTTCCACCACTCTGTTGCATCTTGTTGCCACCAATTCCGCTCATTTGATCATCCCTTTTGTTCCGATTAGAGAGAAGTTGAAGGAGAAAG
CCGAGGAATTCTTTGGGTGTCATTATGAGCTATTTGTTGAGTTCACTGGCTTGATCAGCTGGACCAGGGGAGCAAGCATTGGATGGCATAGTGATGATAACCGGCCCTAT
CTAAAACAACGTGAATTTTCTGCAGTGTGTTACTTGAATAGTTATGGAGTAGAATTTGGAGGTGGACTGTTTCACTTTCAGGATGGGGAACCAGAAACCATCTCACCTTT
TTATGGAGATTGTGTGATGTACACGGCTGACAATGACAATGTTCATTCTGTTGATGAGATAACCAATGGAGAAAGGCTTACACTGACATTATGGTTCACCCGTGATAGCT
CTCATGATGAAGATGCAAAACTTCTTTCCCTTCTTTCACAAAGCCCTTTACATGATCGTTTTCCTGACTCGTGCCTCCCTCAGCCTCCGTCCTGTAATATGTATTGGTTT
TCACCAGAAGACGATCCAAATTTCAAGTTCGGTTTTGATATATGCTGGGCGAGACTGCGTGCGCTTGGATACGACCTCTATTTTCCTGGGGACCATGATTTTTCAGAGTA
TCCAGAGTTATTCTTTCAGGACGTACAATTAGTGTGGGGAGATAAGATATTCTTTCAGAAATTTGAGAACATTTTGCATTTGCTTCAGGTAGTGCAGTTCCTGTGTTGGA
AAGGCAAAGAGCTGGATTCTACCAACCTCAGTGAGGATTCGAGCTATGCAGAATATTTATCCCCAAAGAGAAATGTGGGAGTCAGTTACTTTAAATCTGAGTTTTCGAAG
AACGATGGGTTGGCCGAATCGGTCTTCTCATCTGCTGCATCTGATGGCAAGGAGAACCAGCAATGGTTGGGGTGGGATAAGCTTGTTGCTGCAGCAGCAGCTTGGGAACA
TTATGCTTCCATTTTAAGGAGAGAACTCCTTGGAAGCTTCAGCCATTGGAGGAATTGCCAATCCATATACAGTGTTTCACTTGATGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTAGATGGAGCTGAGAGCAGGCAGCGGCGGCGTCTGATTCTTGAAAATTTCCTAAGCCGCGAAGAATGCAGGGAACTGGAGTTCATCCATAAGAGCTGCTCTACGGT
GGGGTATAGACCAAACGTCTTTTCCACCACTCTGTTGCATCTTGTTGCCACCAATTCCGCTCATTTGATCATCCCTTTTGTTCCGATTAGAGAGAAGTTGAAGGAGAAAG
CCGAGGAATTCTTTGGGTGTCATTATGAGCTATTTGTTGAGTTCACTGGCTTGATCAGCTGGACCAGGGGAGCAAGCATTGGATGGCATAGTGATGATAACCGGCCCTAT
CTAAAACAACGTGAATTTTCTGCAGTGTGTTACTTGAATAGTTATGGAGTAGAATTTGGAGGTGGACTGTTTCACTTTCAGGATGGGGAACCAGAAACCATCTCACCTTT
TTATGGAGATTGTGTGATGTACACGGCTGACAATGACAATGTTCATTCTGTTGATGAGATAACCAATGGAGAAAGGCTTACACTGACATTATGGTTCACCCGTGATAGCT
CTCATGATGAAGATGCAAAACTTCTTTCCCTTCTTTCACAAAGCCCTTTACATGATCGTTTTCCTGACTCGTGCCTCCCTCAGCCTCCGTCCTGTAATATGTATTGGTTT
TCACCAGAAGACGATCCAAATTTCAAGTTCGGTTTTGATATATGCTGGGCGAGACTGCGTGCGCTTGGATACGACCTCTATTTTCCTGGGGACCATGATTTTTCAGAGTA
TCCAGAGTTATTCTTTCAGGACGTACAATTAGTGTGGGGAGATAAGATATTCTTTCAGAAATTTGAGAACATTTTGCATTTGCTTCAGGTAGTGCAGTTCCTGTGTTGGA
AAGGCAAAGAGCTGGATTCTACCAACCTCAGTGAGGATTCGAGCTATGCAGAATATTTATCCCCAAAGAGAAATGTGGGAGTCAGTTACTTTAAATCTGAGTTTTCGAAG
AACGATGGGTTGGCCGAATCGGTCTTCTCATCTGCTGCATCTGATGGCAAGGAGAACCAGCAATGGTTGGGGTGGGATAAGCTTGTTGCTGCAGCAGCAGCTTGGGAACA
TTATGCTTCCATTTTAAGGAGAGAACTCCTTGGAAGCTTCAGCCATTGGAGGAATTGCCAATCCATATACAGTGTTTCACTTGATGGCTGA
Protein sequenceShow/hide protein sequence
MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPY
LKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPPSCNMYWF
SPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPELFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSK
NDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDG