; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0022204 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0022204
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionProcollagen-proline 3-dioxygenase
Genome locationchr10:10004783..10011599
RNA-Seq ExpressionIVF0022204
SyntenyIVF0022204
Gene Ontology termsGO:0019511 - peptidyl-proline hydroxylation (biological process)
GO:0032963 - collagen metabolic process (biological process)
GO:0005506 - iron ion binding (molecular function)
GO:0019797 - procollagen-proline 3-dioxygenase activity (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR039575 - Prolyl 3-hydroxylase
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK14443.1 prolyl 3-hydroxylase 1 [Cucumis melo var. makuwa]1.50e-306100Show/hide
Query:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYRADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
        GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYRADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYRADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL

Query:  HDRFPNSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT
        HDRFPNSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT
Subjt:  HDRFPNSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT

Query:  TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS
        TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS
Subjt:  TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS

XP_004140463.1 prolyl 3-hydroxylase 1 [Cucumis sativus]2.09e-29095.97Show/hide
Query:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        MVDGAESRQRRRLILENFLSREECRELEFIHKSC TVGYRPNV STTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYRADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
        GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMY AD+DNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYRADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL

Query:  HDRFPNSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT
        HDRFP+SCLPQPPSCNMYWFSPEDDPNFKFGFDICWARL ALGYD+YFPGDHDFSEYPDLF QDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELD+
Subjt:  HDRFPNSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT

Query:  TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS
        TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSA S GKENQ WLGWDKLV AAAAAWE YASILRRELLGSFSHWRNCQSIYSVSLDS
Subjt:  TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS

XP_008456831.1 PREDICTED: uncharacterized protein LOC103496668 isoform X1 [Cucumis melo]8.32e-30499.24Show/hide
Query:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYRADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
        GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMY ADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYRADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL

Query:  HDRFPNSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT
        HDRF NSCLPQPPSCNMYWFSPE+DPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT
Subjt:  HDRFPNSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT

Query:  TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS
        TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS
Subjt:  TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS

XP_008456833.1 PREDICTED: uncharacterized protein LOC103496668 isoform X2 [Cucumis melo]1.41e-26991.18Show/hide
Query:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYRADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
        GWHSDDNRPYLKQREFS                                DCVMY ADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYRADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL

Query:  HDRFPNSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT
        HDRF NSCLPQPPSCNMYWFSPE+DPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT
Subjt:  HDRFPNSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT

Query:  TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS
        TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS
Subjt:  TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS

XP_038893062.1 uncharacterized protein LOC120081945 [Benincasa hispida]6.67e-26789.72Show/hide
Query:  MVDGAESRQRRR--LILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGA
        M D  ESR+RRR  LILENFL+REECRELEFIHKSCCTVGYRPNV STTLLHLVATNSAHLI+PFVPIRE+LKEKAEEFFGCHYELFVEFTGLISWTRGA
Subjt:  MVDGAESRQRRR--LILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGA

Query:  SIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYRADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS
        SIGWHSDDNRPYLKQR+FSAVCYLNSYGVEFGGGLFHFQDGEPETISPF GDCVMY ADSDNVHSVDEITNGERLTLTLW TRDSSHDED+KLLSLLSQS
Subjt:  SIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYRADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS

Query:  PLHDRFPNSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKEL
         LHDR P+S LPQPPSCNMYWFS EDDPNFK GFDICWARLHALGYDIYF GDH FSEYPDLFS+DVQLV G+K+FFQ+FENILHLLQVVQFLCWKGKEL
Subjt:  PLHDRFPNSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKEL

Query:  DTTNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS
        D+TN+ EDS YAEYLSPKRNVGVSYFKSEFSK+D LAESVFSSATS GKENQHWLGWDKL  AAAAAWEDYASILRRELLGS S+WRN QSIYSVSL S
Subjt:  DTTNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS

TrEMBL top hitse value%identityAlignment
A0A0A0KMN7 Procollagen-proline 3-dioxygenase1.5e-22592.48Show/hide
Query:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLIS-------
        MVDGAESRQRRRLILENFLSREECRELEFIHKSC TVGYRPNV STTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLIS       
Subjt:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLIS-------

Query:  --------WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYRADSDNVHSVDEITNGERLTLTLWFTRDSSH
                WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMY AD+DNVHSVDEITNGERLTLTLWFTRDSSH
Subjt:  --------WTRGASIGWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYRADSDNVHSVDEITNGERLTLTLWFTRDSSH

Query:  DEDAKLLSLLSQSPLHDRFPNSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLL
        DEDAKLLSLLSQSPLHDRFP+SCLPQPPSCNMYWFSPEDDPNFKFGFDICWARL ALGYD+YFPGDHDFSEYPDLF QDVQLVWGDKIFFQKFENILHLL
Subjt:  DEDAKLLSLLSQSPLHDRFPNSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLL

Query:  QVVQFLCWKGKELDTTNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWR
        QVVQFLCWKGKELD+TNL+EDS YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSA S GKENQ WLGWDKL VAAAAAWE YASILRRELLGSFSHWR
Subjt:  QVVQFLCWKGKELDTTNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWR

Query:  NCQSIYSVSLDS
        NCQSIYSVSLDS
Subjt:  NCQSIYSVSLDS

A0A1S3C486 Procollagen-proline 3-dioxygenase7.1e-23999.24Show/hide
Query:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYRADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
        GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMY ADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYRADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL

Query:  HDRFPNSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT
        HDRF NSCLPQPPSCNMYWFSPE+DPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT
Subjt:  HDRFPNSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT

Query:  TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS
        TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS
Subjt:  TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS

A0A1S3C4U3 uncharacterized protein LOC103496668 isoform X21.5e-21291.18Show/hide
Query:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYRADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
        GWHSDDNRPYLKQREFS                                DCVMY ADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYRADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL

Query:  HDRFPNSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT
        HDRF NSCLPQPPSCNMYWFSPE+DPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT
Subjt:  HDRFPNSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT

Query:  TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS
        TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS
Subjt:  TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS

A0A5A7SSL8 Procollagen-proline 3-dioxygenase7.1e-23999.24Show/hide
Query:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYRADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
        GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMY ADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYRADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL

Query:  HDRFPNSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT
        HDRF NSCLPQPPSCNMYWFSPE+DPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT
Subjt:  HDRFPNSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT

Query:  TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS
        TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS
Subjt:  TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS

A0A5D3CRE9 Procollagen-proline 3-dioxygenase5.8e-241100Show/hide
Query:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
        MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI
Subjt:  MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASI

Query:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYRADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
        GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYRADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL
Subjt:  GWHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYRADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPL

Query:  HDRFPNSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT
        HDRFPNSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT
Subjt:  HDRFPNSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT

Query:  TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS
        TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS
Subjt:  TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS

SwissProt top hitse value%identityAlignment
Q4KLM6 Prolyl 3-hydroxylase 24.2e-1027.15Show/hide
Query:  RRLILENFLSREECRELEFIHKSCCTVG---------YRPNVL---STTLLHL-------VATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGL
        +R++L+N LS E+CREL  +      VG         + PN     +T L  L       V   SA L   F  I EK ++  E +F  +  L+  +T +
Subjt:  RRLILENFLSREECRELEFIHKSCCTVG---------YRPNVL---STTLLHL-------VATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGL

Query:  ISWT---------RGASIGWHSDD------------NRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPET----ISPFYGDCVMYRADSDNVHSVDE
        +  T            S   H+D+              P    R++SA+ Y+N    +F GG F F + + +T    I P  G  + + +  +N H V  
Subjt:  ISWT---------RGASIGWHSDD------------NRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPET----ISPFYGDCVMYRADSDNVHSVDE

Query:  ITNGERLTLTLWFTRDSSHDE
        +T G+R  + LWFT D  + E
Subjt:  ITNGERLTLTLWFTRDSSHDE

Q5XGE0 2-oxoglutarate and iron-dependent oxygenase domain-containing protein 37.4e-0730.59Show/hide
Query:  WHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQD-GEPETISPFYGDCVMYRADSDNVHSVDEITNGERLTLTLWFTRDSSH
        WH   ++      +++++ YL+ Y  +FGGG F F D G   T+ P  G    + + S+N+H V++++ G R  +T+ FT +  H
Subjt:  WHSDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQD-GEPETISPFYGDCVMYRADSDNVHSVDEITNGERLTLTLWFTRDSSH

Q8CG71 Prolyl 3-hydroxylase 22.4e-1027.15Show/hide
Query:  RRLILENFLSREECRELEFIHKSCCTVG---------YRPNVL---STTLLHL-------VATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGL
        +R++L+N LS+E+CREL  +      VG         + PN     +T L  L       V   SA L   F  I EK ++  E +F  +  L+  +T +
Subjt:  RRLILENFLSREECRELEFIHKSCCTVG---------YRPNVL---STTLLHL-------VATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGL

Query:  ISWT---------RGASIGWHSDD------------NRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPET----ISPFYGDCVMYRADSDNVHSVDE
        +  T            S   H+D+              P    R++SA+ Y+N    +F GG F F + + +T    I P  G  + + +  +N H V  
Subjt:  ISWT---------RGASIGWHSDD------------NRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPET----ISPFYGDCVMYRADSDNVHSVDE

Query:  ITNGERLTLTLWFTRDSSHDE
        +T G+R  + LWFT D  + E
Subjt:  ITNGERLTLTLWFTRDSSHDE

Q8IVL5 Prolyl 3-hydroxylase 27.1e-1026.24Show/hide
Query:  RRLILENFLSREECRELEFIHKSCCTVG---------YRPN----------VLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGL
        +R++L+N LS E+CREL  +      VG         + PN           L +     V   SA L   F  I EK +   E +F  +  L+  +T +
Subjt:  RRLILENFLSREECRELEFIHKSCCTVG---------YRPN----------VLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGL

Query:  ISWT---------RGASIGWHSDD------------NRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPET----ISPFYGDCVMYRADSDNVHSVDE
        +  T            S   H+D+              P    R++SA+ Y+N    +F GG F F + + +T    I P  G  + + +  +N H V  
Subjt:  ISWT---------RGASIGWHSDD------------NRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPET----ISPFYGDCVMYRADSDNVHSVDE

Query:  ITNGERLTLTLWFTRDSSHDE
        +T G+R  + LWFT D  + E
Subjt:  ITNGERLTLTLWFTRDSSHDE

Arabidopsis top hitse value%identityAlignment
AT1G68080.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein5.8e-11654.06Show/hide
Query:  RQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDN
        ++  RLIL NFLS  EC+ELE IHKS  T+GYRPNV STTL HL+ATNS HLIIPFV IRE+LKEK EE FGC YELF+EFTGLISW +GASIGWHSDDN
Subjt:  RQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDN

Query:  RPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYRADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPNS
        R YLKQR+F+AVCYLNSY  +F GGLF FQ GEP T++P  GD +MY AD  N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H+     
Subjt:  RPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYRADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPNS

Query:  CLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLHALGYDIY-FPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDTTNLNE
        CLP P S NMYWF P +D  N   GFD+C ARLH LG+D++   G+   ++  +     +QL  G K+  +KF NILH LQVVQF  WK  EL T+N+  
Subjt:  CLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLHALGYDIY-FPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDTTNLNE

Query:  DSC-YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSG--GKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLD
        D+    + +S  +   ++  KS F  ++ L  + F  + SG   K++    G    +  A  +WE+Y+  L +ELL S   W+  Q+I+ V  D
Subjt:  DSC-YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSG--GKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLD

AT1G68080.2 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.9e-9146.56Show/hide
Query:  RQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDN
        ++  RLIL NFLS  EC+ELE IHKS  T+GYRPNV STTL HL+ATNS HLIIPFV IRE+LKEK EE FGC YELF+EFTGLISW +GASIGWHSDDN
Subjt:  RQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDN

Query:  RPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYRADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPNS
        R YLKQR+F++                    GEP T++P  GD +MY AD  N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ          
Subjt:  RPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYRADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPNS

Query:  CLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHALGYDIY-FPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDTTNLNED
                                FD+C ARLH LG+D++   G+   ++  +     +QL  G K+  +KF NILH LQVVQF  WK  EL T+N+  D
Subjt:  CLPQPPSCNMYWFSPEDDPNFKFGFDICWARLHALGYDIY-FPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDTTNLNED

Query:  SC-YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSG--GKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLD
        +    + +S  +   ++  KS F  ++ L  + F  + SG   K++    G    +  A  +WE+Y+  L +ELL S   W+  Q+I+ V  D
Subjt:  SC-YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSG--GKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLD

AT1G68080.3 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.9e-10350.25Show/hide
Query:  RQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDN
        ++  RLIL NFLS  EC+ELE IHKS  T+GYRPNV STTL HL+ATNS HLIIPFV IRE+LKEK EE FGC YELF+EFTGLISW +GASIGWHSDDN
Subjt:  RQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDN

Query:  RPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYRADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPNS
        R YLKQR+F++                    GEP T++P  GD +MY AD  N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H+     
Subjt:  RPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYRADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPNS

Query:  CLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLHALGYDIY-FPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDTTNLNE
        CLP P S NMYWF P +D  N   GFD+C ARLH LG+D++   G+   ++  +     +QL  G K+  +KF NILH LQVVQF  WK  EL T+N+  
Subjt:  CLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLHALGYDIY-FPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDTTNLNE

Query:  DSC-YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSG--GKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLD
        D+    + +S  +   ++  KS F  ++ L  + F  + SG   K++    G    +  A  +WE+Y+  L +ELL S   W+  Q+I+ V  D
Subjt:  DSC-YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSG--GKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLD

AT1G68080.4 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein4.7e-4943.37Show/hide
Query:  MYRADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPNSCLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLHALGYDIY-FPG
        MY AD  N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H+     CLP P S NMYWF P +D  N   GFD+C ARLH LG+D++   G
Subjt:  MYRADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPNSCLPQPPSCNMYWFSP-EDDPNFKFGFDICWARLHALGYDIY-FPG

Query:  DHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDTTNLNEDSC-YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSG--GK
        +   ++  +     +QL  G K+  +KF NILH LQVVQF  WK  EL T+N+  D+    + +S  +   ++  KS F  ++ L  + F  + SG   K
Subjt:  DHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDTTNLNEDSC-YAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSG--GK

Query:  ENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLD
        ++    G    +  A  +WE+Y+  L +ELL S   W+  Q+I+ V  D
Subjt:  ENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGATGGAGCTGAGAGCAGGCAGCGCCGGCGTCTGATTCTTGAAAATTTCTTAAGCCGTGAAGAATGCAGGGAACTGGAGTTCATCCATAAGAGCTGCTGTACGGT
GGGGTATAGACCAAACGTCCTTTCCACCACTCTTTTGCATCTTGTTGCCACTAATTCTGCTCATTTGATCATCCCTTTTGTTCCGATTAGAGAGAAGTTGAAGGAGAAAG
CCGAGGAATTCTTTGGGTGTCATTATGAGCTATTTGTTGAGTTCACTGGCTTGATCAGCTGGACCAGGGGAGCAAGCATTGGATGGCATAGTGATGATAACCGACCCTAT
CTAAAACAACGCGAATTTTCTGCAGTGTGTTACTTGAATAGTTATGGAGTAGAATTTGGAGGTGGACTGTTTCACTTTCAGGATGGGGAACCAGAAACCATCTCACCTTT
TTATGGAGATTGTGTGATGTACAGGGCTGACAGTGACAATGTTCATTCTGTTGATGAGATAACCAATGGAGAAAGGCTTACACTGACACTATGGTTCACCCGTGATAGCT
CTCATGATGAAGATGCAAAACTTCTTTCCCTTCTTTCACAAAGCCCTTTACATGATCGTTTTCCTAACTCGTGCCTCCCTCAGCCTCCGTCCTGTAATATGTATTGGTTT
TCACCAGAAGACGATCCAAATTTCAAGTTCGGTTTTGATATATGCTGGGCGAGACTGCATGCGCTTGGATACGACATCTATTTTCCTGGGGACCATGATTTTTCAGAGTA
TCCAGATTTATTCTCACAGGACGTACAATTAGTGTGGGGAGATAAGATATTCTTTCAGAAATTTGAGAACATCTTGCATTTGCTTCAGGTAGTGCAGTTCCTGTGTTGGA
AAGGCAAAGAGCTGGATACTACCAACCTCAATGAGGATTCGTGCTATGCAGAATATTTATCCCCAAAGAGAAATGTGGGAGTCAGTTACTTTAAATCTGAGTTTTCGAAG
AACGATGGGTTGGCTGAATCGGTCTTCTCATCTGCTACATCTGGTGGCAAGGAGAACCAACACTGGTTGGGGTGGGATAAGCTTGTTGTTGCTGCAGCAGCAGCTTGGGA
AGATTATGCTTCCATTTTACGTAGAGAACTCCTTGGGAGCTTCAGCCATTGGAGGAATTGTCAATCCATATACAGTGTTTCACTTGATAGCTGA
mRNA sequenceShow/hide mRNA sequence
AACGTATCCTGGGTTATTTTGGACCATTTTCCCACCCGTCTTACTGAAACGGGAGAGACGGAGAGCTGAGACGCCAAAATGGTAGATGGAGCTGAGAGCAGGCAGCGCCG
GCGTCTGATTCTTGAAAATTTCTTAAGCCGTGAAGAATGCAGGGAACTGGAGTTCATCCATAAGAGCTGCTGTACGGTGGGGTATAGACCAAACGTCCTTTCCACCACTC
TTTTGCATCTTGTTGCCACTAATTCTGCTCATTTGATCATCCCTTTTGTTCCGATTAGAGAGAAGTTGAAGGAGAAAGCCGAGGAATTCTTTGGGTGTCATTATGAGCTA
TTTGTTGAGTTCACTGGCTTGATCAGCTGGACCAGGGGAGCAAGCATTGGATGGCATAGTGATGATAACCGACCCTATCTAAAACAACGCGAATTTTCTGCAGTGTGTTA
CTTGAATAGTTATGGAGTAGAATTTGGAGGTGGACTGTTTCACTTTCAGGATGGGGAACCAGAAACCATCTCACCTTTTTATGGAGATTGTGTGATGTACAGGGCTGACA
GTGACAATGTTCATTCTGTTGATGAGATAACCAATGGAGAAAGGCTTACACTGACACTATGGTTCACCCGTGATAGCTCTCATGATGAAGATGCAAAACTTCTTTCCCTT
CTTTCACAAAGCCCTTTACATGATCGTTTTCCTAACTCGTGCCTCCCTCAGCCTCCGTCCTGTAATATGTATTGGTTTTCACCAGAAGACGATCCAAATTTCAAGTTCGG
TTTTGATATATGCTGGGCGAGACTGCATGCGCTTGGATACGACATCTATTTTCCTGGGGACCATGATTTTTCAGAGTATCCAGATTTATTCTCACAGGACGTACAATTAG
TGTGGGGAGATAAGATATTCTTTCAGAAATTTGAGAACATCTTGCATTTGCTTCAGGTAGTGCAGTTCCTGTGTTGGAAAGGCAAAGAGCTGGATACTACCAACCTCAAT
GAGGATTCGTGCTATGCAGAATATTTATCCCCAAAGAGAAATGTGGGAGTCAGTTACTTTAAATCTGAGTTTTCGAAGAACGATGGGTTGGCTGAATCGGTCTTCTCATC
TGCTACATCTGGTGGCAAGGAGAACCAACACTGGTTGGGGTGGGATAAGCTTGTTGTTGCTGCAGCAGCAGCTTGGGAAGATTATGCTTCCATTTTACGTAGAGAACTCC
TTGGGAGCTTCAGCCATTGGAGGAATTGTCAATCCATATACAGTGTTTCACTTGATAGCTGAATATCCCAAATACTTAAAGTAGTAGAGCTTCAAGGTTAGTTTTGGGCC
TTTGTATAGCTTAATCTCCGTTCGGTAACAATTTTGTTTTTTGTTTTAGAAATTTATGTTTGTTTCATCCCAAATTTCCAATTATGGTTTTCAATTTTGTTATTTACACT
CTAGGATCCCATTTGTAATCGTTTTGTTTTTGAAAAATTAAGTCCGTTGACACCACTACTTATCTAATTTTTACATCAATCCTCG
Protein sequenceShow/hide protein sequence
MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPY
LKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYRADSDNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPNSCLPQPPSCNMYWF
SPEDDPNFKFGFDICWARLHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDTTNLNEDSCYAEYLSPKRNVGVSYFKSEFSK
NDGLAESVFSSATSGGKENQHWLGWDKLVVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS