; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy5G009980 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy5G009980
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionProcollagen-proline 3-dioxygenase
Genome locationGy14Chr5:8862185..8873909
RNA-Seq ExpressionCsGy5G009980
SyntenyCsGy5G009980
Gene Ontology termsGO:0019511 - peptidyl-proline hydroxylation (biological process)
GO:0032963 - collagen metabolic process (biological process)
GO:0005506 - iron ion binding (molecular function)
GO:0019797 - procollagen-proline 3-dioxygenase activity (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR039575 - Prolyl 3-hydroxylase
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK14443.1 prolyl 3-hydroxylase 1 [Cucumis melo var. makuwa]7.56e-29195.97Show/hide
Query:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        MVDGAESRQRRRLILENFLSREECRELEFIHKSC TVGYRPNV STTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
        GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMY AD+DNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS
        HDRFP+SCLPQPPSCNMYWFSPEDDPNFKFGFDICWARL ALGYD+YFPGDHDFSEYPDLF QDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS

Query:  TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLV-AAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS
        TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSA S GKENQ WLGWDKLV AAAAAWE YASILRRELLGSFSHWRNCQSIYSVSLDS
Subjt:  TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLV-AAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS

XP_004140463.1 prolyl 3-hydroxylase 1 [Cucumis sativus]1.38e-304100Show/hide
Query:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
        GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS
        HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS

Query:  TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS
        TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS
Subjt:  TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS

XP_008456831.1 PREDICTED: uncharacterized protein LOC103496668 isoform X1 [Cucumis melo]6.21e-29095.72Show/hide
Query:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        MVDGAESRQRRRLILENFLSREECRELEFIHKSC TVGYRPNV STTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
        GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTAD+DNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS
        HDRF +SCLPQPPSCNMYWFSPE+DPNFKFGFDICWARL ALGYD+YFPGDHDFSEYPDLF QDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS

Query:  TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLV-AAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS
        TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSA S GKENQ WLGWDKLV AAAAAWE YASILRRELLGSFSHWRNCQSIYSVSLDS
Subjt:  TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLV-AAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS

XP_008456833.1 PREDICTED: uncharacterized protein LOC103496668 isoform X2 [Cucumis melo]7.35e-25687.66Show/hide
Query:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        MVDGAESRQRRRLILENFLSREECRELEFIHKSC TVGYRPNV STTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
        GWHSDDNRPYLKQREFS                                DCVMYTAD+DNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS
        HDRF +SCLPQPPSCNMYWFSPE+DPNFKFGFDICWARL ALGYD+YFPGDHDFSEYPDLF QDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS

Query:  TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLV-AAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS
        TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSA S GKENQ WLGWDKLV AAAAAWE YASILRRELLGSFSHWRNCQSIYSVSLDS
Subjt:  TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLV-AAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS

XP_038893062.1 uncharacterized protein LOC120081945 [Benincasa hispida]1.34e-26689.45Show/hide
Query:  MVDGAESRQRRR--LILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGA
        M D  ESR+RRR  LILENFL+REECRELEFIHKSC TVGYRPNVFSTTLLHLVATNSAHLI+PFVPIRE+LKEKAEEFFGCHYELFVEFTGLISWTRGA
Subjt:  MVDGAESRQRRR--LILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS
        SIGWHSDDNRPYLKQR+FSAVCYLNSYGVEFGGGLFHFQDGEPETISPF GDCVMYTAD+DNVHSVDEITNGERLTLTLW TRDSSHDED+KLLSLLSQS
Subjt:  SIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS

Query:  PLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKEL
         LHDR PDS LPQPPSCNMYWFS EDDPNFK GFDICWARL ALGYD+YF GDH FSEYPDLF +DVQLV G+K+FFQ+FENILHLLQVVQFLCWKGKEL
Subjt:  PLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKEL

Query:  DSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS
        DSTN+ EDSSYAEYLSPKRNVGVSYFKSEFSK+D LAESVFSSA SDGKENQ WLGWDKL AAAAAWE YASILRRELLGS S+WRN QSIYSVSL S
Subjt:  DSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS

TrEMBL top hitse value%identityAlignment
A0A0A0KMN7 Procollagen-proline 3-dioxygenase1.09e-30096.35Show/hide
Query:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLIS-------
        MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLIS       
Subjt:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLIS-------

Query:  --------WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSH
                WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSH
Subjt:  --------WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSH

Query:  DEDAKLLSLLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLL
        DEDAKLLSLLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLL
Subjt:  DEDAKLLSLLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLL

Query:  QVVQFLCWKGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRN
        QVVQFLCWKGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRN
Subjt:  QVVQFLCWKGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRN

Query:  CQSIYSVSLDS
        CQSIYSVSLDS
Subjt:  CQSIYSVSLDS

A0A1S3C486 Procollagen-proline 3-dioxygenase3.01e-29095.72Show/hide
Query:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        MVDGAESRQRRRLILENFLSREECRELEFIHKSC TVGYRPNV STTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
        GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTAD+DNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS
        HDRF +SCLPQPPSCNMYWFSPE+DPNFKFGFDICWARL ALGYD+YFPGDHDFSEYPDLF QDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS

Query:  TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLV-AAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS
        TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSA S GKENQ WLGWDKLV AAAAAWE YASILRRELLGSFSHWRNCQSIYSVSLDS
Subjt:  TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLV-AAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS

A0A1S3C4U3 uncharacterized protein LOC103496668 isoform X23.56e-25687.66Show/hide
Query:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        MVDGAESRQRRRLILENFLSREECRELEFIHKSC TVGYRPNV STTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
        GWHSDDNRPYLKQREFS                                DCVMYTAD+DNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS
        HDRF +SCLPQPPSCNMYWFSPE+DPNFKFGFDICWARL ALGYD+YFPGDHDFSEYPDLF QDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS

Query:  TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLV-AAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS
        TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSA S GKENQ WLGWDKLV AAAAAWE YASILRRELLGSFSHWRNCQSIYSVSLDS
Subjt:  TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLV-AAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS

A0A5A7SSL8 Procollagen-proline 3-dioxygenase3.01e-29095.72Show/hide
Query:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        MVDGAESRQRRRLILENFLSREECRELEFIHKSC TVGYRPNV STTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
        GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTAD+DNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS
        HDRF +SCLPQPPSCNMYWFSPE+DPNFKFGFDICWARL ALGYD+YFPGDHDFSEYPDLF QDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS

Query:  TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLV-AAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS
        TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSA S GKENQ WLGWDKLV AAAAAWE YASILRRELLGSFSHWRNCQSIYSVSLDS
Subjt:  TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLV-AAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS

A0A5D3CRE9 Procollagen-proline 3-dioxygenase3.66e-29195.97Show/hide
Query:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        MVDGAESRQRRRLILENFLSREECRELEFIHKSC TVGYRPNV STTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
        GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMY AD+DNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL

Query:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS
        HDRFP+SCLPQPPSCNMYWFSPEDDPNFKFGFDICWARL ALGYD+YFPGDHDFSEYPDLF QDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELD+
Subjt:  HDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS

Query:  TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLV-AAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS
        TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSA S GKENQ WLGWDKLV AAAAAWE YASILRRELLGSFSHWRNCQSIYSVSLDS
Subjt:  TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLV-AAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS

SwissProt top hitse value%identityAlignment
Q4KLM6 Prolyl 3-hydroxylase 24.2e-1027.15Show/hide
Query:  RRLILENFLSREECRELEFIHKSCSTVG---------YRPN---VFSTTLLHL-------VATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGL
        +R++L+N LS E+CREL  +      VG         + PN     +T L  L       V   SA L   F  I EK ++  E +F  +  L+  +T +
Subjt:  RRLILENFLSREECRELEFIHKSCSTVG---------YRPN---VFSTTLLHL-------VATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGL

Query:  ISWT---------RGASIGWHSDD------------NRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPET----ISPFYGDCVMYTADNDNVHSVDE
        +  T            S   H+D+              P    R++SA+ Y+N    +F GG F F + + +T    I P  G  + +++  +N H V  
Subjt:  ISWT---------RGASIGWHSDD------------NRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPET----ISPFYGDCVMYTADNDNVHSVDE

Query:  ITNGERLTLTLWFTRDSSHDE
        +T G+R  + LWFT D  + E
Subjt:  ITNGERLTLTLWFTRDSSHDE

Q5XGE0 2-oxoglutarate and iron-dependent oxygenase domain-containing protein 34.3e-0730.59Show/hide
Query:  WHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQD-GEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSH
        WH   ++      +++++ YL+ Y  +FGGG F F D G   T+ P  G    +T+ ++N+H V++++ G R  +T+ FT +  H
Subjt:  WHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQD-GEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSH

Q8CG71 Prolyl 3-hydroxylase 22.4e-1027.15Show/hide
Query:  RRLILENFLSREECRELEFIHKSCSTVG---------YRPN---VFSTTLLHL-------VATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGL
        +R++L+N LS+E+CREL  +      VG         + PN     +T L  L       V   SA L   F  I EK ++  E +F  +  L+  +T +
Subjt:  RRLILENFLSREECRELEFIHKSCSTVG---------YRPN---VFSTTLLHL-------VATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGL

Query:  ISWT---------RGASIGWHSDD------------NRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPET----ISPFYGDCVMYTADNDNVHSVDE
        +  T            S   H+D+              P    R++SA+ Y+N    +F GG F F + + +T    I P  G  + +++  +N H V  
Subjt:  ISWT---------RGASIGWHSDD------------NRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPET----ISPFYGDCVMYTADNDNVHSVDE

Query:  ITNGERLTLTLWFTRDSSHDE
        +T G+R  + LWFT D  + E
Subjt:  ITNGERLTLTLWFTRDSSHDE

Q8IVL5 Prolyl 3-hydroxylase 27.1e-1027.15Show/hide
Query:  RRLILENFLSREECRELEFIHKSCSTVG---------YRPN---VFSTTLLHL-------VATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGL
        +R++L+N LS E+CREL  +      VG         + PN     +T L  L       V   SA L   F  I EK +   E +F  +  L+  +T +
Subjt:  RRLILENFLSREECRELEFIHKSCSTVG---------YRPN---VFSTTLLHL-------VATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGL

Query:  ISWT---------RGASIGWHSDD------------NRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPET----ISPFYGDCVMYTADNDNVHSVDE
        +  T            S   H+D+              P    R++SA+ Y+N    +F GG F F + + +T    I P  G  + +++  +N H V  
Subjt:  ISWT---------RGASIGWHSDD------------NRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPET----ISPFYGDCVMYTADNDNVHSVDE

Query:  ITNGERLTLTLWFTRDSSHDE
        +T G+R  + LWFT D  + E
Subjt:  ITNGERLTLTLWFTRDSSHDE

Arabidopsis top hitse value%identityAlignment
AT1G68080.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein6.8e-11754.48Show/hide
Query:  RQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDN
        ++  RLIL NFLS  EC+ELE IHKS ST+GYRPNVFSTTL HL+ATNS HLIIPFV IRE+LKEK EE FGC YELF+EFTGLISW +GASIGWHSDDN
Subjt:  RQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDN

Query:  RPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDS
        R YLKQR+F+AVCYLNSY  +F GGLF FQ GEP T++P  GD +MYTAD+ N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H    + 
Subjt:  RPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDS

Query:  CLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLRALGYDLY-FPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDSTNLSE
        CLP P S NMYWF P +D  N   GFD+C ARL  LG+D++   G+   ++  +     +QL  G K+  +KF NILH LQVVQF  WK  EL ++N+  
Subjt:  CLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLRALGYDLY-FPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDSTNLSE

Query:  DS-SYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLD
        D+    + +S  +   ++  KS F  ++ L  + F  + S G++ +  L    +  A  +WE Y+  L +ELL S   W+  Q+I+ V  D
Subjt:  DS-SYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLD

AT1G68080.2 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein3.4e-9246.92Show/hide
Query:  RQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDN
        ++  RLIL NFLS  EC+ELE IHKS ST+GYRPNVFSTTL HL+ATNS HLIIPFV IRE+LKEK EE FGC YELF+EFTGLISW +GASIGWHSDDN
Subjt:  RQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDN

Query:  RPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDS
        R YLKQR+F++                    GEP T++P  GD +MYTAD+ N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ          
Subjt:  RPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDS

Query:  CLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLY-FPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDSTNLSED
                                FD+C ARL  LG+D++   G+   ++  +     +QL  G K+  +KF NILH LQVVQF  WK  EL ++N+  D
Subjt:  CLPQPPSCNMYWFSPEDDPNFKFGFDICWARLRALGYDLY-FPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDSTNLSED

Query:  S-SYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLD
        +    + +S  +   ++  KS F  ++ L  + F  + S G++ +  L    +  A  +WE Y+  L +ELL S   W+  Q+I+ V  D
Subjt:  S-SYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLD

AT1G68080.3 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein3.0e-10450.64Show/hide
Query:  RQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDN
        ++  RLIL NFLS  EC+ELE IHKS ST+GYRPNVFSTTL HL+ATNS HLIIPFV IRE+LKEK EE FGC YELF+EFTGLISW +GASIGWHSDDN
Subjt:  RQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDN

Query:  RPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDS
        R YLKQR+F++                    GEP T++P  GD +MYTAD+ N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H    + 
Subjt:  RPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDS

Query:  CLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLRALGYDLY-FPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDSTNLSE
        CLP P S NMYWF P +D  N   GFD+C ARL  LG+D++   G+   ++  +     +QL  G K+  +KF NILH LQVVQF  WK  EL ++N+  
Subjt:  CLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLRALGYDLY-FPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDSTNLSE

Query:  DS-SYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLD
        D+    + +S  +   ++  KS F  ++ L  + F  + S G++ +  L    +  A  +WE Y+  L +ELL S   W+  Q+I+ V  D
Subjt:  DS-SYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLD

AT1G68080.4 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein8.0e-4943.09Show/hide
Query:  MYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLRALGYDLY-FPG
        MYTAD+ N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H    + CLP P S NMYWF P +D  N   GFD+C ARL  LG+D++   G
Subjt:  MYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLRALGYDLY-FPG

Query:  DHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDSTNLSEDS-SYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKEN
        +   ++  +     +QL  G K+  +KF NILH LQVVQF  WK  EL ++N+  D+    + +S  +   ++  KS F  ++ L  + F  + S G++ 
Subjt:  DHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDSTNLSEDS-SYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKEN

Query:  QQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLD
        +  L    +  A  +WE Y+  L +ELL S   W+  Q+I+ V  D
Subjt:  QQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGATGGAGCTGAGAGCAGGCAGCGGCGGCGTCTGATTCTTGAAAATTTCCTAAGCCGCGAAGAATGCAGGGAACTGGAGTTCATCCATAAGAGCTGCTCTACGGT
GGGGTATAGACCAAACGTCTTTTCCACCACTCTGTTGCATCTTGTTGCCACCAATTCCGCTCATTTGATCATCCCTTTTGTTCCGATTAGAGAGAAGTTGAAGGAGAAAG
CCGAGGAATTCTTTGGGTGTCATTATGAGCTATTTGTTGAGTTCACTGGCTTGATCAGCTGGACCAGGGGAGCAAGCATTGGATGGCATAGTGATGATAACCGGCCCTAT
CTAAAACAACGTGAATTTTCTGCAGTGTGTTACTTGAATAGTTATGGAGTAGAATTTGGAGGTGGACTGTTTCACTTTCAGGATGGGGAACCAGAAACCATCTCACCTTT
TTATGGAGATTGTGTGATGTACACGGCTGACAATGACAATGTTCATTCTGTTGATGAGATAACCAATGGAGAAAGGCTTACACTGACATTATGGTTCACCCGTGATAGCT
CTCATGATGAAGATGCAAAACTTCTTTCCCTTCTTTCACAAAGCCCTTTACATGATCGTTTTCCTGACTCGTGCCTCCCTCAGCCTCCGTCCTGTAATATGTATTGGTTT
TCACCAGAAGATGATCCAAATTTCAAGTTCGGTTTTGATATATGCTGGGCGAGACTGCGTGCGCTTGGATACGACCTCTATTTTCCTGGGGACCATGATTTTTCAGAGTA
TCCAGATTTATTCTTTCAGGACGTACAATTAGTGTGGGGAGATAAGATATTCTTTCAGAAATTTGAGAACATTTTGCATTTGCTTCAGGTAGTGCAGTTCCTGTGTTGGA
AAGGCAAAGAGCTGGATTCTACCAACCTCAGTGAGGATTCGAGCTATGCAGAATATTTATCCCCAAAGAGAAATGTGGGAGTCAGTTACTTTAAATCTGAGTTTTCGAAG
AACGATGGGTTGGCCGAATCGGTCTTCTCATCTGCTGCATCTGATGGCAAGGAGAACCAACAATGGTTGGGGTGGGATAAGCTTGTTGCTGCAGCAGCAGCTTGGGAACA
TTATGCTTCCATTTTAAGGAGAGAACTCCTTGGAAGCTTCAGCCATTGGAGGAATTGCCAATCCATATACAGTGTTTCACTTGATAGCTGA
mRNA sequenceShow/hide mRNA sequence
AGGATATTGAAAAAGAAAAGAAAGGAAGGTTTTAATGAAAGGGAGCAGTGGCATGTTTCTTTTCAAAATAGTATTTTCTTCTATAGAAGAACTTTTGTTCGTATTTATAG
ATGAAATGAAATGGAATGAAAAATTATGTAAAAGTAATGGATGAGTTTACCATAAATTAAAGTTTTTCAAATTTGTTGGTAACTCCAATTCTTCATTTCCAATCAAGCTA
AGTCCAATCCTTCATTTCAAGTTTCTTAGGTTATTTTGGACCATTTTCCCACCCGTCTCGCTGAAACGGGAGAGACGGAGAACTGGACGCCAAAATGGTAGATGGAGCTG
AGAGCAGGCAGCGGCGGCGTCTGATTCTTGAAAATTTCCTAAGCCGCGAAGAATGCAGGGAACTGGAGTTCATCCATAAGAGCTGCTCTACGGTGGGGTATAGACCAAAC
GTCTTTTCCACCACTCTGTTGCATCTTGTTGCCACCAATTCCGCTCATTTGATCATCCCTTTTGTTCCGATTAGAGAGAAGTTGAAGGAGAAAGCCGAGGAATTCTTTGG
GTGTCATTATGAGCTATTTGTTGAGTTCACTGGCTTGATCAGCTGGACCAGGGGAGCAAGCATTGGATGGCATAGTGATGATAACCGGCCCTATCTAAAACAACGTGAAT
TTTCTGCAGTGTGTTACTTGAATAGTTATGGAGTAGAATTTGGAGGTGGACTGTTTCACTTTCAGGATGGGGAACCAGAAACCATCTCACCTTTTTATGGAGATTGTGTG
ATGTACACGGCTGACAATGACAATGTTCATTCTGTTGATGAGATAACCAATGGAGAAAGGCTTACACTGACATTATGGTTCACCCGTGATAGCTCTCATGATGAAGATGC
AAAACTTCTTTCCCTTCTTTCACAAAGCCCTTTACATGATCGTTTTCCTGACTCGTGCCTCCCTCAGCCTCCGTCCTGTAATATGTATTGGTTTTCACCAGAAGATGATC
CAAATTTCAAGTTCGGTTTTGATATATGCTGGGCGAGACTGCGTGCGCTTGGATACGACCTCTATTTTCCTGGGGACCATGATTTTTCAGAGTATCCAGATTTATTCTTT
CAGGACGTACAATTAGTGTGGGGAGATAAGATATTCTTTCAGAAATTTGAGAACATTTTGCATTTGCTTCAGGTAGTGCAGTTCCTGTGTTGGAAAGGCAAAGAGCTGGA
TTCTACCAACCTCAGTGAGGATTCGAGCTATGCAGAATATTTATCCCCAAAGAGAAATGTGGGAGTCAGTTACTTTAAATCTGAGTTTTCGAAGAACGATGGGTTGGCCG
AATCGGTCTTCTCATCTGCTGCATCTGATGGCAAGGAGAACCAACAATGGTTGGGGTGGGATAAGCTTGTTGCTGCAGCAGCAGCTTGGGAACATTATGCTTCCATTTTA
AGGAGAGAACTCCTTGGAAGCTTCAGCCATTGGAGGAATTGCCAATCCATATACAGTGTTTCACTTGATAGCTGAATATCCCAAATACTTAAACTAGCAGTAGCTTCGAG
GTTAGTTCTGGGCCTTTGTATAGCTTAACTTTTACAATATCAAATAAAGTTCCGTTTGGTAACAATTTTGTTTTTTGTTTTGGAAATTTATGTTTGTTTCATTCCAAATT
TCCAATTATAGTTTTCAATTTTGTTAATTAAACACTAGGGTCCTGTTTGGTAATCATTTTGTTTTTTGTTTTTGAAAAATTAAGTCTATTGACACACTACTTAAAGAACC
Protein sequenceShow/hide protein sequence
MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPY
LKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPPSCNMYWF
SPEDDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSK
NDGLAESVFSSAASDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS