; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MELO3C020028 (gene) of Melon (DHL92) v4 genome

Gene IDMELO3C020028
OrganismCucumis melo DHL92 (Melon (DHL92) v4)
DescriptionProcollagen-proline 3-dioxygenase
Genome locationchr10:10249462..10256093
RNA-Seq ExpressionMELO3C020028
SyntenyMELO3C020028
Gene Ontology termsGO:0019511 - peptidyl-proline hydroxylation (biological process)
GO:0032963 - collagen metabolic process (biological process)
GO:0005506 - iron ion binding (molecular function)
GO:0019797 - procollagen-proline 3-dioxygenase activity (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR039575 - Prolyl 3-hydroxylase
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK14443.1 prolyl 3-hydroxylase 1 [Cucumis melo var. makuwa]2.5e-23899.24Show/hide
Query:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
        GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMY ADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL

Query:  HDRFSNSCLPQPPSCNMYWFSPEEDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT
        HDRF NSCLPQPPSCNMYWFSPE+DPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT
Subjt:  HDRFSNSCLPQPPSCNMYWFSPEEDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT

Query:  TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS
        TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS
Subjt:  TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS

XP_004140463.1 prolyl 3-hydroxylase 1 [Cucumis sativus]2.0e-22795.72Show/hide
Query:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        MVDGAESRQRRRLILENFLSREECRELEFIHKSC TVGYRPNV STTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
        GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTAD+DNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL

Query:  HDRFSNSCLPQPPSCNMYWFSPEEDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT
        HDRF +SCLPQPPSCNMYWFSPE+DPNFKFGFDICWARL ALGYD+YFPGDHDFSEYPDLF QDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELD+
Subjt:  HDRFSNSCLPQPPSCNMYWFSPEEDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT

Query:  TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS
        TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSA S GKENQ WLGWDKL VAAAAAWE YASILRRELLGSFSHWRNCQSIYSVSLDS
Subjt:  TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS

XP_008456831.1 PREDICTED: uncharacterized protein LOC103496668 isoform X1 [Cucumis melo]6.0e-240100Show/hide
Query:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
        GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL

Query:  HDRFSNSCLPQPPSCNMYWFSPEEDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT
        HDRFSNSCLPQPPSCNMYWFSPEEDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT
Subjt:  HDRFSNSCLPQPPSCNMYWFSPEEDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT

Query:  TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS
        TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS
Subjt:  TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS

XP_008456833.1 PREDICTED: uncharacterized protein LOC103496668 isoform X2 [Cucumis melo]9.6e-21491.94Show/hide
Query:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
        GWHSDDNRPYLKQREFS                                DCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL

Query:  HDRFSNSCLPQPPSCNMYWFSPEEDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT
        HDRFSNSCLPQPPSCNMYWFSPEEDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT
Subjt:  HDRFSNSCLPQPPSCNMYWFSPEEDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT

Query:  TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS
        TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS
Subjt:  TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS

XP_038893062.1 uncharacterized protein LOC120081945 [Benincasa hispida]1.4e-20989.47Show/hide
Query:  MVDGAES--RQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGA
        M D  ES  R+RRRLILENFL+REECRELEFIHKSCCTVGYRPNV STTLLHLVATNSAHLI+PFVPIRE+LKEKAEEFFGCHYELFVEFTGLISWTRGA
Subjt:  MVDGAES--RQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS
        SIGWHSDDNRPYLKQR+FSAVCYLNSYGVEFGGGLFHFQDGEPETISPF GDCVMYTADSDNVHSVDEITNGERLTLTLW TRDSSHDED+KLLSLLSQS
Subjt:  SIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS

Query:  PLHDRFSNSCLPQPPSCNMYWFSPEEDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKEL
         LHDR  +S LPQPPSCNMYWFS E+DPNFK GFDICWARLHALGYDIYF GDH FSEYPDLFS+DVQLV G+K+FFQ+FENILHLLQVVQFLCWKGKEL
Subjt:  PLHDRFSNSCLPQPPSCNMYWFSPEEDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKEL

Query:  DTTNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS
        D+TN+ EDS YAEYLSPKRNVGVSYFKSEFSK+D LAESVFSSATS GKENQHWLGWDKL  AAAAAWEDYASILRRELLGS S+WRN QSIYSVSL S
Subjt:  DTTNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS

TrEMBL top hitse value%identityAlignment
A0A0A0KMN7 Procollagen-proline 3-dioxygenase1.0e-22492.23Show/hide
Query:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLIS-------
        MVDGAESRQRRRLILENFLSREECRELEFIHKSC TVGYRPNV STTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLIS       
Subjt:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLIS-------

Query:  --------WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSH
                WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTAD+DNVHSVDEITNGERLTLTLWFTRDSSH
Subjt:  --------WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSH

Query:  DEDAKLLSLLSQSPLHDRFSNSCLPQPPSCNMYWFSPEEDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLL
        DEDAKLLSLLSQSPLHDRF +SCLPQPPSCNMYWFSPE+DPNFKFGFDICWARL ALGYD+YFPGDHDFSEYPDLF QDVQLVWGDKIFFQKFENILHLL
Subjt:  DEDAKLLSLLSQSPLHDRFSNSCLPQPPSCNMYWFSPEEDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLL

Query:  QVVQFLCWKGKELDTTNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWR
        QVVQFLCWKGKELD+TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSA S GKENQ WLGWDKL VAAAAAWE YASILRRELLGSFSHWR
Subjt:  QVVQFLCWKGKELDTTNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWR

Query:  NCQSIYSVSLDS
        NCQSIYSVSLDS
Subjt:  NCQSIYSVSLDS

A0A1S3C486 Procollagen-proline 3-dioxygenase2.9e-240100Show/hide
Query:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
        GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL

Query:  HDRFSNSCLPQPPSCNMYWFSPEEDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT
        HDRFSNSCLPQPPSCNMYWFSPEEDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT
Subjt:  HDRFSNSCLPQPPSCNMYWFSPEEDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT

Query:  TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS
        TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS
Subjt:  TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS

A0A1S3C4U3 uncharacterized protein LOC103496668 isoform X24.7e-21491.94Show/hide
Query:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
        GWHSDDNRPYLKQREFS                                DCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL

Query:  HDRFSNSCLPQPPSCNMYWFSPEEDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT
        HDRFSNSCLPQPPSCNMYWFSPEEDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT
Subjt:  HDRFSNSCLPQPPSCNMYWFSPEEDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT

Query:  TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS
        TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS
Subjt:  TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS

A0A5A7SSL8 Procollagen-proline 3-dioxygenase2.9e-240100Show/hide
Query:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
        GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL

Query:  HDRFSNSCLPQPPSCNMYWFSPEEDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT
        HDRFSNSCLPQPPSCNMYWFSPEEDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT
Subjt:  HDRFSNSCLPQPPSCNMYWFSPEEDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT

Query:  TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS
        TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS
Subjt:  TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS

A0A5D3CRE9 Procollagen-proline 3-dioxygenase1.2e-23899.24Show/hide
Query:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
        GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMY ADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL

Query:  HDRFSNSCLPQPPSCNMYWFSPEEDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT
        HDRF NSCLPQPPSCNMYWFSPE+DPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT
Subjt:  HDRFSNSCLPQPPSCNMYWFSPEEDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT

Query:  TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS
        TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS
Subjt:  TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS

SwissProt top hitse value%identityAlignment
Q4KLM6 Prolyl 3-hydroxylase 23.2e-1027.15Show/hide
Query:  RRLILENFLSREECRELEFIHKSCCTVG---------YRPNVL---STTLLHL-------VATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGL
        +R++L+N LS E+CREL  +      VG         + PN     +T L  L       V   SA L   F  I EK ++  E +F  +  L+  +T +
Subjt:  RRLILENFLSREECRELEFIHKSCCTVG---------YRPNVL---STTLLHL-------VATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGL

Query:  ISWT---------RGASIGWHSDD------------NRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPET----ISPFYGDCVMYTADSDNVHSVDE
        +  T            S   H+D+              P    R++SA+ Y+N    +F GG F F + + +T    I P  G  + +++  +N H V  
Subjt:  ISWT---------RGASIGWHSDD------------NRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPET----ISPFYGDCVMYTADSDNVHSVDE

Query:  ITNGERLTLTLWFTRDSSHDE
        +T G+R  + LWFT D  + E
Subjt:  ITNGERLTLTLWFTRDSSHDE

Q5XGE0 2-oxoglutarate and iron-dependent oxygenase domain-containing protein 31.9e-0731.76Show/hide
Query:  WHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQD-GEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSH
        WH   ++      +++++ YL+ Y  +FGGG F F D G   T+ P  G    +T+ S+N+H V++++ G R  +T+ FT +  H
Subjt:  WHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQD-GEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSH

Q8CG71 Prolyl 3-hydroxylase 21.9e-1027.15Show/hide
Query:  RRLILENFLSREECRELEFIHKSCCTVG---------YRPNVL---STTLLHL-------VATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGL
        +R++L+N LS+E+CREL  +      VG         + PN     +T L  L       V   SA L   F  I EK ++  E +F  +  L+  +T +
Subjt:  RRLILENFLSREECRELEFIHKSCCTVG---------YRPNVL---STTLLHL-------VATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGL

Query:  ISWT---------RGASIGWHSDD------------NRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPET----ISPFYGDCVMYTADSDNVHSVDE
        +  T            S   H+D+              P    R++SA+ Y+N    +F GG F F + + +T    I P  G  + +++  +N H V  
Subjt:  ISWT---------RGASIGWHSDD------------NRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPET----ISPFYGDCVMYTADSDNVHSVDE

Query:  ITNGERLTLTLWFTRDSSHDE
        +T G+R  + LWFT D  + E
Subjt:  ITNGERLTLTLWFTRDSSHDE

Q8IVL5 Prolyl 3-hydroxylase 25.4e-1026.24Show/hide
Query:  RRLILENFLSREECRELEFIHKSCCTVG---------YRPN----------VLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGL
        +R++L+N LS E+CREL  +      VG         + PN           L +     V   SA L   F  I EK +   E +F  +  L+  +T +
Subjt:  RRLILENFLSREECRELEFIHKSCCTVG---------YRPN----------VLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGL

Query:  ISWT---------RGASIGWHSDD------------NRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPET----ISPFYGDCVMYTADSDNVHSVDE
        +  T            S   H+D+              P    R++SA+ Y+N    +F GG F F + + +T    I P  G  + +++  +N H V  
Subjt:  ISWT---------RGASIGWHSDD------------NRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPET----ISPFYGDCVMYTADSDNVHSVDE

Query:  ITNGERLTLTLWFTRDSSHDE
        +T G+R  + LWFT D  + E
Subjt:  ITNGERLTLTLWFTRDSSHDE

Arabidopsis top hitse value%identityAlignment
AT1G68080.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.2e-11654.31Show/hide
Query:  RQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDN
        ++  RLIL NFLS  EC+ELE IHKS  T+GYRPNV STTL HL+ATNS HLIIPFV IRE+LKEK EE FGC YELF+EFTGLISW +GASIGWHSDDN
Subjt:  RQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDN

Query:  RPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFSNS
        R YLKQR+F+AVCYLNSY  +F GGLF FQ GEP T++P  GD +MYTAD  N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H+     
Subjt:  RPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFSNS

Query:  CLPQPPSCNMYWFSPEED-PNFKFGFDICWARLHALGYDIY-FPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDTTNLNE
        CLP P S NMYWF P +D  N   GFD+C ARLH LG+D++   G+   ++  +     +QL  G K+  +KF NILH LQVVQF  WK  EL T+N+  
Subjt:  CLPQPPSCNMYWFSPEED-PNFKFGFDICWARLHALGYDIY-FPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDTTNLNE

Query:  DSC-YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSG--GKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLD
        D+    + +S  +   ++  KS F  ++ L  + F  + SG   K++    G    +  A  +WE+Y+  L +ELL S   W+  Q+I+ V  D
Subjt:  DSC-YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSG--GKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLD

AT1G68080.2 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein9.9e-9246.82Show/hide
Query:  RQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDN
        ++  RLIL NFLS  EC+ELE IHKS  T+GYRPNV STTL HL+ATNS HLIIPFV IRE+LKEK EE FGC YELF+EFTGLISW +GASIGWHSDDN
Subjt:  RQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDN

Query:  RPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFSNS
        R YLKQR+F++                    GEP T++P  GD +MYTAD  N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ          
Subjt:  RPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFSNS

Query:  CLPQPPSCNMYWFSPEEDPNFKFGFDICWARLHALGYDIY-FPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDTTNLNED
                                FD+C ARLH LG+D++   G+   ++  +     +QL  G K+  +KF NILH LQVVQF  WK  EL T+N+  D
Subjt:  CLPQPPSCNMYWFSPEEDPNFKFGFDICWARLHALGYDIY-FPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDTTNLNED

Query:  SC-YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSG--GKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLD
        +    + +S  +   ++  KS F  ++ L  + F  + SG   K++    G    +  A  +WE+Y+  L +ELL S   W+  Q+I+ V  D
Subjt:  SC-YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSG--GKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLD

AT1G68080.3 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein5.1e-10450.51Show/hide
Query:  RQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDN
        ++  RLIL NFLS  EC+ELE IHKS  T+GYRPNV STTL HL+ATNS HLIIPFV IRE+LKEK EE FGC YELF+EFTGLISW +GASIGWHSDDN
Subjt:  RQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDN

Query:  RPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFSNS
        R YLKQR+F++                    GEP T++P  GD +MYTAD  N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H+     
Subjt:  RPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFSNS

Query:  CLPQPPSCNMYWFSPEED-PNFKFGFDICWARLHALGYDIY-FPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDTTNLNE
        CLP P S NMYWF P +D  N   GFD+C ARLH LG+D++   G+   ++  +     +QL  G K+  +KF NILH LQVVQF  WK  EL T+N+  
Subjt:  CLPQPPSCNMYWFSPEED-PNFKFGFDICWARLHALGYDIY-FPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDTTNLNE

Query:  DSC-YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSG--GKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLD
        D+    + +S  +   ++  KS F  ++ L  + F  + SG   K++    G    +  A  +WE+Y+  L +ELL S   W+  Q+I+ V  D
Subjt:  DSC-YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSG--GKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLD

AT1G68080.4 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein9.4e-5043.78Show/hide
Query:  MYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFSNSCLPQPPSCNMYWFSPEED-PNFKFGFDICWARLHALGYDIY-FPG
        MYTAD  N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H+     CLP P S NMYWF P +D  N   GFD+C ARLH LG+D++   G
Subjt:  MYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFSNSCLPQPPSCNMYWFSPEED-PNFKFGFDICWARLHALGYDIY-FPG

Query:  DHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDTTNLNEDSC-YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSG--GK
        +   ++  +     +QL  G K+  +KF NILH LQVVQF  WK  EL T+N+  D+    + +S  +   ++  KS F  ++ L  + F  + SG   K
Subjt:  DHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDTTNLNEDSC-YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSG--GK

Query:  ENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLD
        ++    G    +  A  +WE+Y+  L +ELL S   W+  Q+I+ V  D
Subjt:  ENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGATGGAGCTGAGAGCAGGCAGCGCCGGCGTCTGATTCTTGAAAATTTCTTAAGCCGTGAAGAATGCAGGGAACTGGAGTTCATCCATAAGAGCTGCTGTACGGT
GGGGTATAGACCAAACGTCCTTTCCACCACTCTTTTGCATCTTGTTGCCACTAATTCTGCTCATTTGATCATCCCTTTTGTTCCGATTAGAGAGAAGCTGAAGGAGAAAG
CCGAGGAATTCTTTGGGTGTCATTATGAGCTATTTGTTGAGTTCACTGGCTTGATCAGCTGGACCAGGGGAGCAAGCATTGGATGGCATAGTGATGATAACCGACCCTAT
CTAAAACAACGCGAATTTTCTGCAGTGTGTTACTTGAATAGTTATGGAGTAGAATTTGGAGGTGGACTGTTTCACTTTCAGGATGGGGAACCAGAAACCATCTCACCTTT
TTATGGAGATTGTGTGATGTACACGGCTGACAGTGACAATGTTCATTCTGTTGATGAGATAACCAATGGAGAAAGGCTTACACTGACACTATGGTTCACCCGTGATAGCT
CTCATGATGAAGATGCAAAACTTCTTTCCCTTCTTTCACAAAGCCCTTTACATGATCGTTTTTCTAACTCGTGCCTCCCTCAGCCTCCGTCCTGTAATATGTATTGGTTT
TCACCAGAAGAGGATCCAAATTTCAAGTTCGGTTTTGATATATGCTGGGCGAGACTGCATGCGCTTGGATACGACATCTATTTTCCTGGGGACCATGATTTTTCAGAGTA
TCCAGATTTATTCTCACAGGACGTACAATTAGTGTGGGGAGATAAGATATTCTTTCAGAAATTTGAGAACATCTTGCATTTGCTTCAGGTAGTGCAGTTCCTGTGTTGGA
AAGGCAAAGAGCTGGATACTACCAACCTCAATGAGGATTCGTGCTATGCAGAATATTTATCCCCAAAGAGAAATGTGGGAGTCAGTTACTTTAAATCTGAGTTTTCGAAG
AACGATGGGTTGGCTGAATCGGTCTTCTCATCTGCTACATCTGGTGGCAAGGAGAACCAACACTGGTTGGGGTGGGATAAGCTTGTTGTTGCTGCAGCAGCAGCTTGGGA
AGATTATGCTTCCATTTTACGTAGAGAACTCCTTGGGAGCTTCAGCCATTGGAGGAATTGTCAATCCATATACAGTGTTTCACTTGATAGCTGA
mRNA sequenceShow/hide mRNA sequence
GACCACTTTTAAAATGTGTGCTCCTTGTACCTTCCATTGGTGAAACGTATCCTGGGTTATTTTGGACCATTTTCCCACCCGTCTTACTGAAACGGGAGAGACGGAGAACT
GAGACGCCAAAATGGTAGATGGAGCTGAGAGCAGGCAGCGCCGGCGTCTGATTCTTGAAAATTTCTTAAGCCGTGAAGAATGCAGGGAACTGGAGTTCATCCATAAGAGC
TGCTGTACGGTGGGGTATAGACCAAACGTCCTTTCCACCACTCTTTTGCATCTTGTTGCCACTAATTCTGCTCATTTGATCATCCCTTTTGTTCCGATTAGAGAGAAGCT
GAAGGAGAAAGCCGAGGAATTCTTTGGGTGTCATTATGAGCTATTTGTTGAGTTCACTGGCTTGATCAGCTGGACCAGGGGAGCAAGCATTGGATGGCATAGTGATGATA
ACCGACCCTATCTAAAACAACGCGAATTTTCTGCAGTGTGTTACTTGAATAGTTATGGAGTAGAATTTGGAGGTGGACTGTTTCACTTTCAGGATGGGGAACCAGAAACC
ATCTCACCTTTTTATGGAGATTGTGTGATGTACACGGCTGACAGTGACAATGTTCATTCTGTTGATGAGATAACCAATGGAGAAAGGCTTACACTGACACTATGGTTCAC
CCGTGATAGCTCTCATGATGAAGATGCAAAACTTCTTTCCCTTCTTTCACAAAGCCCTTTACATGATCGTTTTTCTAACTCGTGCCTCCCTCAGCCTCCGTCCTGTAATA
TGTATTGGTTTTCACCAGAAGAGGATCCAAATTTCAAGTTCGGTTTTGATATATGCTGGGCGAGACTGCATGCGCTTGGATACGACATCTATTTTCCTGGGGACCATGAT
TTTTCAGAGTATCCAGATTTATTCTCACAGGACGTACAATTAGTGTGGGGAGATAAGATATTCTTTCAGAAATTTGAGAACATCTTGCATTTGCTTCAGGTAGTGCAGTT
CCTGTGTTGGAAAGGCAAAGAGCTGGATACTACCAACCTCAATGAGGATTCGTGCTATGCAGAATATTTATCCCCAAAGAGAAATGTGGGAGTCAGTTACTTTAAATCTG
AGTTTTCGAAGAACGATGGGTTGGCTGAATCGGTCTTCTCATCTGCTACATCTGGTGGCAAGGAGAACCAACACTGGTTGGGGTGGGATAAGCTTGTTGTTGCTGCAGCA
GCAGCTTGGGAAGATTATGCTTCCATTTTACGTAGAGAACTCCTTGGGAGCTTCAGCCATTGGAGGAATTGTCAATCCATATACAGTGTTTCACTTGATAGCTGA
Protein sequenceShow/hide protein sequence
MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPY
LKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFSNSCLPQPPSCNMYWF
SPEEDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDTTNLNEDSCYAEYLSPKRNVGVSYFKSEFSK
NDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS