; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G17320 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G17320
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Description2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein
Genome locationClcChr01:30042444..30047555
RNA-Seq ExpressionClc01G17320
SyntenyClc01G17320
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0080147 - root hair cell development (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0016020 - membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141886.1 probable prolyl 4-hydroxylase 9 [Cucumis sativus]1.1e-14980.42Show/hide
Query:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMSPGENGDDSISSIPFQVLSWRPRALFFPKFATAE
        MKGKSG+SNWSLRSKLGLPALIFVLCLFCFLAGFFGS+LLSQDVDDDRPR RLLQS SD +EFDLMS GENGDDSISSIPFQVLSWRPRAL+FPKFATAE
Subjt:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMSPGENGDDSISSIPFQVLSWRPRALFFPKFATAE

Query:  QCQSIVNMAKPKLRPSTLALRKGETEESTKGIRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCAILAFSGNDILSGVFFSASEDESGTLGVIEEKIAR
        QCQSIVN+AKPKLRPSTLALRKGET ESTKG+RTS                                         SGVFFSASEDESGTLGVIEEKIAR
Subjt:  QCQSIVNMAKPKLRPSTLALRKGETEESTKGIRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCAILAFSGNDILSGVFFSASEDESGTLGVIEEKIAR

Query:  ATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCVGLKVKPRQGDGLLFYSVFPNG
        ATM+PR HGEA+NILRYEIGQKYNSHYDAF P+EYGPQ+SQRVASFLLYLTDVEEGGETMFPFENGLNMDG+Y++Q C+GLKVKPRQGDGLLFYSVFPNG
Subjt:  ATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCVGLKVKPRQGDGLLFYSVFPNG

Query:  TIDPTSLHGSCPVIKGQKWVATKWIRDQIQDD
        TIDPTSLHGSCPVIKGQKWVATKWIRDQ+Q+D
Subjt:  TIDPTSLHGSCPVIKGQKWVATKWIRDQIQDD

XP_008440351.1 PREDICTED: probable prolyl 4-hydroxylase 9 [Cucumis melo]2.4e-14981.33Show/hide
Query:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMSPGENGDDSISSIPFQVLSWRPRALFFPKFATAE
        MK KSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPR RLLQS SDG+EFDLMS GENGD SISSIPFQVLSWRPRAL+FPKFATAE
Subjt:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMSPGENGDDSISSIPFQVLSWRPRALFFPKFATAE

Query:  QCQSIVNMAKPKLRPSTLALRKGETEESTKGIRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCAILAFSGNDILSGVFFSASEDESGTLGVIEEKIAR
        QCQSIVNMAKPKLRPSTLALRKGET E+TKGIRTS                                         SGVFFSASEDESG LGVIEEKIAR
Subjt:  QCQSIVNMAKPKLRPSTLALRKGETEESTKGIRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCAILAFSGNDILSGVFFSASEDESGTLGVIEEKIAR

Query:  ATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCVGLKVKPRQGDGLLFYSVFPNG
        ATM+PR HGEA+NILRYEIGQKYNSHYDAF P+EYGPQ+SQRVASFLLYLTDVEEGGETMFPFENG NMDG+Y+YQ CVGLKVKPRQGDGLLFYSVFPNG
Subjt:  ATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCVGLKVKPRQGDGLLFYSVFPNG

Query:  TIDPTSLHGSCPVIKGQKWVATKWIRDQIQDD
        TIDPTSLHGSCPVIKGQKWVATKWIRDQ QDD
Subjt:  TIDPTSLHGSCPVIKGQKWVATKWIRDQIQDD

XP_022977335.1 probable prolyl 4-hydroxylase 9 [Cucurbita maxima]1.2e-14879.82Show/hide
Query:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMSPGENGDDSISSIPFQVLSWRPRALFFPKFATAE
        MKGKSGKS+ SLRSKLGLPA+IFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSV+ GSEFDLM  GENGDDSIS IPFQ+LSW+PRAL+FPKFATAE
Subjt:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMSPGENGDDSISSIPFQVLSWRPRALFFPKFATAE

Query:  QCQSIVNMAKPKLRPSTLALRKGETEESTKGIRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCAILAFSGNDILSGVFFSASEDESGTLGVIEEKIAR
        QC++IVNMA+P+LRPS+LALRKGETEE+TKGIRTS                                         SGVFFSASEDESGTLG IEEKIAR
Subjt:  QCQSIVNMAKPKLRPSTLALRKGETEESTKGIRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCAILAFSGNDILSGVFFSASEDESGTLGVIEEKIAR

Query:  ATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCVGLKVKPRQGDGLLFYSVFPNG
        ATMLPRMHGEAFNILRYEIGQKYNSHYDAFNP+EYGPQ+SQRVASFLLYLTDVEEGGETMFPFENGLNMDG YDYQ+C GLKVKPRQGDGLLFYSVFPNG
Subjt:  ATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCVGLKVKPRQGDGLLFYSVFPNG

Query:  TIDPTSLHGSCPVIKGQKWVATKWIRDQIQDD
        TIDPTSLHGSCPV+KGQKWVATKWIRDQ+Q+D
Subjt:  TIDPTSLHGSCPVIKGQKWVATKWIRDQIQDD

XP_023543701.1 probable prolyl 4-hydroxylase 9 [Cucurbita pepo subsp. pepo]6.9e-14980.12Show/hide
Query:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMSPGENGDDSISSIPFQVLSWRPRALFFPKFATAE
        MK KSGKS+ SLRSKLGLPA+IFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSV+ GSEFDLM  GENGDDSIS IPFQVLSW+PRAL+FPKFATAE
Subjt:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMSPGENGDDSISSIPFQVLSWRPRALFFPKFATAE

Query:  QCQSIVNMAKPKLRPSTLALRKGETEESTKGIRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCAILAFSGNDILSGVFFSASEDESGTLGVIEEKIAR
        QC++IVNMA+P+LRPS+LALRKGETEE+TKGIRTS                                         SGVFFSASEDESGTLG IEEKIAR
Subjt:  QCQSIVNMAKPKLRPSTLALRKGETEESTKGIRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCAILAFSGNDILSGVFFSASEDESGTLGVIEEKIAR

Query:  ATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCVGLKVKPRQGDGLLFYSVFPNG
        ATMLPRMHGEAFNILRYEIGQKYNSHYDAFNP+EYGPQ+SQRVASFLLYLTDVEEGGETMFPFENGLNMDG YDYQ+C GLKVKPRQGDGLLFYSVFPNG
Subjt:  ATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCVGLKVKPRQGDGLLFYSVFPNG

Query:  TIDPTSLHGSCPVIKGQKWVATKWIRDQIQDD
        TIDPTSLHGSCPV+KGQKWVATKWIRDQ+QDD
Subjt:  TIDPTSLHGSCPVIKGQKWVATKWIRDQIQDD

XP_038881540.1 probable prolyl 4-hydroxylase 9 [Benincasa hispida]3.6e-15382.23Show/hide
Query:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMSPGENGDDSISSIPFQVLSWRPRALFFPKFATAE
        MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDV+DDRP PRLLQSVSDGSEFDLM+PGENGDDSISSIPFQVLSWRPRALFFPKFATAE
Subjt:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMSPGENGDDSISSIPFQVLSWRPRALFFPKFATAE

Query:  QCQSIVNMAKPKLRPSTLALRKGETEESTKGIRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCAILAFSGNDILSGVFFSASEDESGTLGVIEEKIAR
        QC SIVNMA+PKLRPSTLALRKGETEE+TKGIRTS                                         SG FF ASEDESGTLGVIEEKIAR
Subjt:  QCQSIVNMAKPKLRPSTLALRKGETEESTKGIRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCAILAFSGNDILSGVFFSASEDESGTLGVIEEKIAR

Query:  ATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCVGLKVKPRQGDGLLFYSVFPNG
        ATM+PR HGEAFNILRYEIGQKYNSHYDAFNP+EYGPQ+SQRVASFLLYLTDV+EGGETMFPFENG NMDG YD+QRC+GLKVKPRQGDGLLFYSVFPNG
Subjt:  ATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCVGLKVKPRQGDGLLFYSVFPNG

Query:  TIDPTSLHGSCPVIKGQKWVATKWIRDQIQDD
        TIDPTSLHGSCPV+KGQKWVATKWIRDQIQDD
Subjt:  TIDPTSLHGSCPVIKGQKWVATKWIRDQIQDD

TrEMBL top hitse value%identityAlignment
A0A0A0KGG4 Fe2OG dioxygenase domain-containing protein5.2e-15080.42Show/hide
Query:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMSPGENGDDSISSIPFQVLSWRPRALFFPKFATAE
        MKGKSG+SNWSLRSKLGLPALIFVLCLFCFLAGFFGS+LLSQDVDDDRPR RLLQS SD +EFDLMS GENGDDSISSIPFQVLSWRPRAL+FPKFATAE
Subjt:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMSPGENGDDSISSIPFQVLSWRPRALFFPKFATAE

Query:  QCQSIVNMAKPKLRPSTLALRKGETEESTKGIRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCAILAFSGNDILSGVFFSASEDESGTLGVIEEKIAR
        QCQSIVN+AKPKLRPSTLALRKGET ESTKG+RTS                                         SGVFFSASEDESGTLGVIEEKIAR
Subjt:  QCQSIVNMAKPKLRPSTLALRKGETEESTKGIRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCAILAFSGNDILSGVFFSASEDESGTLGVIEEKIAR

Query:  ATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCVGLKVKPRQGDGLLFYSVFPNG
        ATM+PR HGEA+NILRYEIGQKYNSHYDAF P+EYGPQ+SQRVASFLLYLTDVEEGGETMFPFENGLNMDG+Y++Q C+GLKVKPRQGDGLLFYSVFPNG
Subjt:  ATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCVGLKVKPRQGDGLLFYSVFPNG

Query:  TIDPTSLHGSCPVIKGQKWVATKWIRDQIQDD
        TIDPTSLHGSCPVIKGQKWVATKWIRDQ+Q+D
Subjt:  TIDPTSLHGSCPVIKGQKWVATKWIRDQIQDD

A0A1S3B0H7 probable prolyl 4-hydroxylase 91.2e-14981.33Show/hide
Query:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMSPGENGDDSISSIPFQVLSWRPRALFFPKFATAE
        MK KSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPR RLLQS SDG+EFDLMS GENGD SISSIPFQVLSWRPRAL+FPKFATAE
Subjt:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMSPGENGDDSISSIPFQVLSWRPRALFFPKFATAE

Query:  QCQSIVNMAKPKLRPSTLALRKGETEESTKGIRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCAILAFSGNDILSGVFFSASEDESGTLGVIEEKIAR
        QCQSIVNMAKPKLRPSTLALRKGET E+TKGIRTS                                         SGVFFSASEDESG LGVIEEKIAR
Subjt:  QCQSIVNMAKPKLRPSTLALRKGETEESTKGIRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCAILAFSGNDILSGVFFSASEDESGTLGVIEEKIAR

Query:  ATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCVGLKVKPRQGDGLLFYSVFPNG
        ATM+PR HGEA+NILRYEIGQKYNSHYDAF P+EYGPQ+SQRVASFLLYLTDVEEGGETMFPFENG NMDG+Y+YQ CVGLKVKPRQGDGLLFYSVFPNG
Subjt:  ATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCVGLKVKPRQGDGLLFYSVFPNG

Query:  TIDPTSLHGSCPVIKGQKWVATKWIRDQIQDD
        TIDPTSLHGSCPVIKGQKWVATKWIRDQ QDD
Subjt:  TIDPTSLHGSCPVIKGQKWVATKWIRDQIQDD

A0A6J1GEZ7 probable prolyl 4-hydroxylase 91.8e-14779.22Show/hide
Query:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMSPGENGDDSISSIPFQVLSWRPRALFFPKFATAE
        MK KSGKS  SLRSKLGLPA+IFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSV+ GS+FDLM  GENGDDSIS IPFQVLSW+PRAL+FPKFATAE
Subjt:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMSPGENGDDSISSIPFQVLSWRPRALFFPKFATAE

Query:  QCQSIVNMAKPKLRPSTLALRKGETEESTKGIRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCAILAFSGNDILSGVFFSASEDESGTLGVIEEKIAR
        QC++IVNMA+P+LRPS+LALRKGETEE+TKGIRTS                                         SGVFFSASEDESGTLG IEEKIAR
Subjt:  QCQSIVNMAKPKLRPSTLALRKGETEESTKGIRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCAILAFSGNDILSGVFFSASEDESGTLGVIEEKIAR

Query:  ATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCVGLKVKPRQGDGLLFYSVFPNG
         TMLPRMHGEAFNILRYEIGQKYNSHYDAFNP+EYGPQ+SQRVASFLLYLTDVE+GGETMFPFENGLNMDG YDYQ+C GLKVKPRQGDGLLFYSVFPNG
Subjt:  ATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCVGLKVKPRQGDGLLFYSVFPNG

Query:  TIDPTSLHGSCPVIKGQKWVATKWIRDQIQDD
        TIDPTSLHGSCPV+KGQKWVATKWIRDQ+QDD
Subjt:  TIDPTSLHGSCPVIKGQKWVATKWIRDQIQDD

A0A6J1HHX9 probable prolyl 4-hydroxylase 93.3e-14477.95Show/hide
Query:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMSPGENGDDSISSIPFQVLSWRPRALFFPKFATAE
        MK  +GKSNWS RSKLGLP  IF+LCLFCFLAGFFGSSLLSQDVDDDRP PRLLQSVSD SEFDLM+ GE GDDSISSIPFQ+LSWRPRAL FPKFATAE
Subjt:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMSPGENGDDSISSIPFQVLSWRPRALFFPKFATAE

Query:  QCQSIVNMAKPKLRPSTLALRKGETEESTKGIRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCAILAFSGNDILSGVFFSASEDESGTLGVIEEKIAR
        QCQSIVNMA+P L+PS LALRKGET+ESTKG RTS                                         SGVF SASEDESGTL  IEEKIAR
Subjt:  QCQSIVNMAKPKLRPSTLALRKGETEESTKGIRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCAILAFSGNDILSGVFFSASEDESGTLGVIEEKIAR

Query:  ATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCVGLKVKPRQGDGLLFYSVFPNG
        ATMLPR HGEAFN+LRYEIGQKYN+HYDAFNP+EYGPQRSQRVASFLLYLTDVEEGGETMFPF+NGLNMDG+Y YQRC+GLKVKPRQGDGLLFYSVFPNG
Subjt:  ATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCVGLKVKPRQGDGLLFYSVFPNG

Query:  TIDPTSLHGSCPVIKGQKWVATKWIRDQIQD
        TIDPTSLHGSCPVIKGQKWVATKWIRDQI+D
Subjt:  TIDPTSLHGSCPVIKGQKWVATKWIRDQIQD

A0A6J1IM09 probable prolyl 4-hydroxylase 95.7e-14979.82Show/hide
Query:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMSPGENGDDSISSIPFQVLSWRPRALFFPKFATAE
        MKGKSGKS+ SLRSKLGLPA+IFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSV+ GSEFDLM  GENGDDSIS IPFQ+LSW+PRAL+FPKFATAE
Subjt:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMSPGENGDDSISSIPFQVLSWRPRALFFPKFATAE

Query:  QCQSIVNMAKPKLRPSTLALRKGETEESTKGIRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCAILAFSGNDILSGVFFSASEDESGTLGVIEEKIAR
        QC++IVNMA+P+LRPS+LALRKGETEE+TKGIRTS                                         SGVFFSASEDESGTLG IEEKIAR
Subjt:  QCQSIVNMAKPKLRPSTLALRKGETEESTKGIRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCAILAFSGNDILSGVFFSASEDESGTLGVIEEKIAR

Query:  ATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCVGLKVKPRQGDGLLFYSVFPNG
        ATMLPRMHGEAFNILRYEIGQKYNSHYDAFNP+EYGPQ+SQRVASFLLYLTDVEEGGETMFPFENGLNMDG YDYQ+C GLKVKPRQGDGLLFYSVFPNG
Subjt:  ATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCVGLKVKPRQGDGLLFYSVFPNG

Query:  TIDPTSLHGSCPVIKGQKWVATKWIRDQIQDD
        TIDPTSLHGSCPV+KGQKWVATKWIRDQ+Q+D
Subjt:  TIDPTSLHGSCPVIKGQKWVATKWIRDQIQDD

SwissProt top hitse value%identityAlignment
F4ILF8 Prolyl 4-hydroxylase 133.6e-8450.15Show/hide
Query:  NWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMSPGENGDDSISSIPFQVLSWRPRALFFPKFATAEQCQSIVNM
        ++    KL  P +    C F  + GF   +L SQ +      P   +SV+D  E D +  G     S+S+IPF  LSW PR  + P FAT +QC+++++M
Subjt:  NWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMSPGENGDDSISSIPFQVLSWRPRALFFPKFATAEQCQSIVNM

Query:  AKPKLRPSTLALRKGETEESTKGIRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCAILAFSGNDILSGVFFSASEDESGTLGVIEEKIARATMLPRMH
        AKPKL+PSTLALRKGET E+T+  R+  +                                               EDESG L  IEEKIA AT  P+ +
Subjt:  AKPKLRPSTLALRKGETEESTKGIRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCAILAFSGNDILSGVFFSASEDESGTLGVIEEKIARATMLPRMH

Query:  GEAFNILRYEIGQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCVGLKVKPRQGDGLLFYSVFPNGTIDPTSLH
         E+FNILRY++GQKY+SHYDAF+  EYGP  SQRV +FLL+L+ VEEGGETMFPFENG NM+G YDY++CVGLKVKPRQGD + FY++FPNGTID TSLH
Subjt:  GEAFNILRYEIGQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCVGLKVKPRQGDGLLFYSVFPNGTIDPTSLH

Query:  GSCPVIKGQKWVATKWIRDQIQD
        GSCPVIKG+KWVATKWIRDQ  D
Subjt:  GSCPVIKGQKWVATKWIRDQIQD

F4J0A8 Probable prolyl 4-hydroxylase 67.5e-3736.64Show/hide
Query:  ISSIPFQV-------LSWRPRALFFPKFATAEQCQSIVNMAKPKLRPSTLA--LRKGETEESTKGIRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCA
        ISS  F V       LSW PRA  +  F + E+C  ++ +AK KL  S +   +  GE+E+S   +RTS                               
Subjt:  ISSIPFQV-------LSWRPRALFFPKFATAEQCQSIVNMAKPKLRPSTLA--LRKGETEESTKGIRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCA

Query:  ILAFSGNDILSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENG
                  SG+F +  +D+   +  +E K+A  T LP  +GEA  IL YE GQKY+ H+D F   +       R+A+ L+YL++V +GGET+FP   G
Subjt:  ILAFSGNDILSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENG

Query:  LN---MDGSYDYQRCVGLKVKPRQGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWI
              D S+      G  VKPR+GD LLF+++  NGT DP SLHGSCPVI+G+KW AT+WI
Subjt:  LN---MDGSYDYQRCVGLKVKPRQGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWI

F4JZ24 Probable prolyl 4-hydroxylase 103.2e-4037.45Show/hide
Query:  DDSISSIPFQVLSWRPRALFFPKFATAEQCQSIVNMAKPKLRPSTLALRKGETEESTKG-IRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCAILAFS
        DDS +    +++SW PRA  +  F T E+C+ ++ +AKP +  ST+   K  T +ST   +RTS                                    
Subjt:  DDSISSIPFQVLSWRPRALFFPKFATAEQCQSIVNMAKPKLRPSTLALRKGETEESTKG-IRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCAILAFS

Query:  GNDILSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDG
             SG F +   D+  T+  IE++I+  T +P  HGE   +L YEIGQKY  HYD F          QR+A+ L+YL+DVEEGGET+FP   G     
Subjt:  GNDILSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDG

Query:  SY--DYQRC--VGLKVKPRQGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIR
         +  +   C   GL VKP+ GD LLF+S+ P+ T+DP+SLHG C VIKG KW +TKW+R
Subjt:  SY--DYQRC--VGLKVKPRQGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIR

Q8VZJ7 Probable prolyl 4-hydroxylase 97.3e-10961.99Show/hide
Query:  RSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDG-SEFDLMSPGENGDDSISSIPFQVLSWRPRALFFPKFATAEQCQSIVNMAKP
        R KLGL A + V C  CFL GF+GS+LLSQ+V   +PR R+L  V +G  E   M  G  G++SI SIPFQVLSWRPRA++FP FATAEQCQ+I+  AK 
Subjt:  RSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDG-SEFDLMSPGENGDDSISSIPFQVLSWRPRALFFPKFATAEQCQSIVNMAKP

Query:  KLRPSTLALRKGETEESTKGIRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCAILAFSGNDILSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEA
         L+PS LALRKGET E+TKG RTS                                         SG F SASE+ +G L  +E KIARATM+PR HGE+
Subjt:  KLRPSTLALRKGETEESTKGIRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCAILAFSGNDILSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEA

Query:  FNILRYEIGQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCVGLKVKPRQGDGLLFYSVFPNGTIDPTSLHGSC
        FNILRYE+GQKY+SHYD FNPTEYGPQ SQR+ASFLLYL+DVEEGGETMFPFENG NM   YDY++C+GLKVKPR+GDGLLFYSVFPNGTID TSLHGSC
Subjt:  FNILRYEIGQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCVGLKVKPRQGDGLLFYSVFPNGTIDPTSLHGSC

Query:  PVIKGQKWVATKWIRDQIQDD
        PV KG+KWVATKWIRDQ Q++
Subjt:  PVIKGQKWVATKWIRDQIQDD

Q9LN20 Probable prolyl 4-hydroxylase 32.7e-3933.76Show/hide
Query:  LIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMSPGENGDDSISSIPFQVLSWRPRALFFPKFATAEQCQSIVNMAKPKLRPSTLA-
        ++F+L +   +   FG  + S  +++D   P  L      +       G+ GD        +VLSW PRA  +  F + E+C+ ++++AKP +  ST+  
Subjt:  LIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMSPGENGDDSISSIPFQVLSWRPRALFFPKFATAEQCQSIVNMAKPKLRPSTLA-

Query:  LRKGETEESTKGIRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCAILAFSGNDILSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEI
           G++++S   +RTS                                         SG F     D+   +  IE++IA  T +P  HGE   +L YE 
Subjt:  LRKGETEESTKGIRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCAILAFSGNDILSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEI

Query:  GQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRC-----VGLKVKPRQGDGLLFYSVFPNGTIDPTSLHGSCPVI
        GQKY  HYD F          QR+A+ L+YL+DVEEGGET+FP  N +N      Y         GL VKPR GD LLF+S+ P+ T+DPTSLHG CPVI
Subjt:  GQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRC-----VGLKVKPRQGDGLLFYSVFPNGTIDPTSLHGSCPVI

Query:  KGQKWVATKWI
        +G KW +TKW+
Subjt:  KGQKWVATKWI

Arabidopsis top hitse value%identityAlignment
AT1G20270.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.9e-4033.76Show/hide
Query:  LIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMSPGENGDDSISSIPFQVLSWRPRALFFPKFATAEQCQSIVNMAKPKLRPSTLA-
        ++F+L +   +   FG  + S  +++D   P  L      +       G+ GD        +VLSW PRA  +  F + E+C+ ++++AKP +  ST+  
Subjt:  LIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMSPGENGDDSISSIPFQVLSWRPRALFFPKFATAEQCQSIVNMAKPKLRPSTLA-

Query:  LRKGETEESTKGIRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCAILAFSGNDILSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEI
           G++++S   +RTS                                         SG F     D+   +  IE++IA  T +P  HGE   +L YE 
Subjt:  LRKGETEESTKGIRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCAILAFSGNDILSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEI

Query:  GQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRC-----VGLKVKPRQGDGLLFYSVFPNGTIDPTSLHGSCPVI
        GQKY  HYD F          QR+A+ L+YL+DVEEGGET+FP  N +N      Y         GL VKPR GD LLF+S+ P+ T+DPTSLHG CPVI
Subjt:  GQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRC-----VGLKVKPRQGDGLLFYSVFPNGTIDPTSLHGSCPVI

Query:  KGQKWVATKWI
        +G KW +TKW+
Subjt:  KGQKWVATKWI

AT2G23096.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.6e-8550.15Show/hide
Query:  NWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMSPGENGDDSISSIPFQVLSWRPRALFFPKFATAEQCQSIVNM
        ++    KL  P +    C F  + GF   +L SQ +      P   +SV+D  E D +  G     S+S+IPF  LSW PR  + P FAT +QC+++++M
Subjt:  NWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMSPGENGDDSISSIPFQVLSWRPRALFFPKFATAEQCQSIVNM

Query:  AKPKLRPSTLALRKGETEESTKGIRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCAILAFSGNDILSGVFFSASEDESGTLGVIEEKIARATMLPRMH
        AKPKL+PSTLALRKGET E+T+  R+  +                                               EDESG L  IEEKIA AT  P+ +
Subjt:  AKPKLRPSTLALRKGETEESTKGIRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCAILAFSGNDILSGVFFSASEDESGTLGVIEEKIARATMLPRMH

Query:  GEAFNILRYEIGQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCVGLKVKPRQGDGLLFYSVFPNGTIDPTSLH
         E+FNILRY++GQKY+SHYDAF+  EYGP  SQRV +FLL+L+ VEEGGETMFPFENG NM+G YDY++CVGLKVKPRQGD + FY++FPNGTID TSLH
Subjt:  GEAFNILRYEIGQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCVGLKVKPRQGDGLLFYSVFPNGTIDPTSLH

Query:  GSCPVIKGQKWVATKWIRDQIQD
        GSCPVIKG+KWVATKWIRDQ  D
Subjt:  GSCPVIKGQKWVATKWIRDQIQD

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase5.3e-3836.64Show/hide
Query:  ISSIPFQV-------LSWRPRALFFPKFATAEQCQSIVNMAKPKLRPSTLA--LRKGETEESTKGIRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCA
        ISS  F V       LSW PRA  +  F + E+C  ++ +AK KL  S +   +  GE+E+S   +RTS                               
Subjt:  ISSIPFQV-------LSWRPRALFFPKFATAEQCQSIVNMAKPKLRPSTLA--LRKGETEESTKGIRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCA

Query:  ILAFSGNDILSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENG
                  SG+F +  +D+   +  +E K+A  T LP  +GEA  IL YE GQKY+ H+D F   +       R+A+ L+YL++V +GGET+FP   G
Subjt:  ILAFSGNDILSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENG

Query:  LN---MDGSYDYQRCVGLKVKPRQGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWI
              D S+      G  VKPR+GD LLF+++  NGT DP SLHGSCPVI+G+KW AT+WI
Subjt:  LN---MDGSYDYQRCVGLKVKPRQGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWI

AT4G33910.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein5.2e-11061.99Show/hide
Query:  RSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDG-SEFDLMSPGENGDDSISSIPFQVLSWRPRALFFPKFATAEQCQSIVNMAKP
        R KLGL A + V C  CFL GF+GS+LLSQ+V   +PR R+L  V +G  E   M  G  G++SI SIPFQVLSWRPRA++FP FATAEQCQ+I+  AK 
Subjt:  RSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDG-SEFDLMSPGENGDDSISSIPFQVLSWRPRALFFPKFATAEQCQSIVNMAKP

Query:  KLRPSTLALRKGETEESTKGIRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCAILAFSGNDILSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEA
         L+PS LALRKGET E+TKG RTS                                         SG F SASE+ +G L  +E KIARATM+PR HGE+
Subjt:  KLRPSTLALRKGETEESTKGIRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCAILAFSGNDILSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEA

Query:  FNILRYEIGQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCVGLKVKPRQGDGLLFYSVFPNGTIDPTSLHGSC
        FNILRYE+GQKY+SHYD FNPTEYGPQ SQR+ASFLLYL+DVEEGGETMFPFENG NM   YDY++C+GLKVKPR+GDGLLFYSVFPNGTID TSLHGSC
Subjt:  FNILRYEIGQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCVGLKVKPRQGDGLLFYSVFPNGTIDPTSLHGSC

Query:  PVIKGQKWVATKWIRDQIQDD
        PV KG+KWVATKWIRDQ Q++
Subjt:  PVIKGQKWVATKWIRDQIQDD

AT5G66060.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.3e-4137.45Show/hide
Query:  DDSISSIPFQVLSWRPRALFFPKFATAEQCQSIVNMAKPKLRPSTLALRKGETEESTKG-IRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCAILAFS
        DDS +    +++SW PRA  +  F T E+C+ ++ +AKP +  ST+   K  T +ST   +RTS                                    
Subjt:  DDSISSIPFQVLSWRPRALFFPKFATAEQCQSIVNMAKPKLRPSTLALRKGETEESTKG-IRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCAILAFS

Query:  GNDILSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDG
             SG F +   D+  T+  IE++I+  T +P  HGE   +L YEIGQKY  HYD F          QR+A+ L+YL+DVEEGGET+FP   G     
Subjt:  GNDILSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDG

Query:  SY--DYQRC--VGLKVKPRQGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIR
         +  +   C   GL VKP+ GD LLF+S+ P+ T+DP+SLHG C VIKG KW +TKW+R
Subjt:  SY--DYQRC--VGLKVKPRQGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGGCAAAAGTGGAAAAAGTAATTGGAGCTTGAGATCGAAGCTAGGTTTGCCGGCACTTATCTTCGTTTTATGCCTTTTTTGTTTCCTCGCCGGATTCTTCGGTTC
TTCTCTTCTCTCTCAGGATGTAGATGACGATAGGCCGAGGCCAAGGTTGCTTCAATCGGTCAGCGATGGTAGCGAGTTCGATTTGATGTCTCCGGGAGAAAACGGCGACG
ATTCCATTTCGTCGATTCCTTTCCAGGTTTTGAGCTGGCGACCTCGCGCCCTTTTTTTCCCCAAGTTTGCAACTGCGGAGCAATGCCAGAGCATAGTTAATATGGCGAAA
CCTAAGCTTAGACCGTCTACCTTGGCTCTACGTAAGGGAGAAACCGAAGAGAGCACGAAAGGAATCCGAACAAGCATGGAGATAATGCTCGACAAGGTGATTCAAGGAGT
TAATCCGGTCAGACTATGGCGGTTGGGGATTTTGAATGAGGATGTGAAAGTATGTGCTATACTCGCATTCTCTGGGAACGACATTCTTTCTGGAGTGTTCTTTAGTGCTT
CAGAAGATGAAAGTGGCACTCTGGGTGTAATCGAGGAAAAAATTGCAAGGGCAACTATGCTTCCAAGGATGCATGGAGAGGCATTTAATATCTTGCGTTATGAGATTGGG
CAGAAGTATAATTCTCATTATGACGCCTTCAATCCTACTGAATATGGGCCACAGAGGAGCCAAAGGGTGGCTTCCTTCTTGTTGTACTTGACTGATGTTGAAGAAGGTGG
AGAAACCATGTTTCCATTTGAGAATGGCTTGAATATGGATGGAAGCTATGATTACCAAAGATGTGTTGGTTTGAAAGTGAAGCCACGTCAAGGTGATGGACTTCTGTTTT
ATTCTGTTTTCCCAAATGGTACAATTGATCCGACATCACTTCATGGAAGCTGTCCTGTGATTAAAGGGCAGAAATGGGTCGCTACGAAGTGGATCAGAGATCAAATACAG
GACGATTAA
mRNA sequenceShow/hide mRNA sequence
CGCGTCTGAAATCGGAAAATAGGAGCAAATAAGGAAGAGATCGGTGATTCCATCAAAGGAGAGAAAGATGAAAGGCAAAAGTGGAAAAAGTAATTGGAGCTTGAGATCGA
AGCTAGGTTTGCCGGCACTTATCTTCGTTTTATGCCTTTTTTGTTTCCTCGCCGGATTCTTCGGTTCTTCTCTTCTCTCTCAGGATGTAGATGACGATAGGCCGAGGCCA
AGGTTGCTTCAATCGGTCAGCGATGGTAGCGAGTTCGATTTGATGTCTCCGGGAGAAAACGGCGACGATTCCATTTCGTCGATTCCTTTCCAGGTTTTGAGCTGGCGACC
TCGCGCCCTTTTTTTCCCCAAGTTTGCAACTGCGGAGCAATGCCAGAGCATAGTTAATATGGCGAAACCTAAGCTTAGACCGTCTACCTTGGCTCTACGTAAGGGAGAAA
CCGAAGAGAGCACGAAAGGAATCCGAACAAGCATGGAGATAATGCTCGACAAGGTGATTCAAGGAGTTAATCCGGTCAGACTATGGCGGTTGGGGATTTTGAATGAGGAT
GTGAAAGTATGTGCTATACTCGCATTCTCTGGGAACGACATTCTTTCTGGAGTGTTCTTTAGTGCTTCAGAAGATGAAAGTGGCACTCTGGGTGTAATCGAGGAAAAAAT
TGCAAGGGCAACTATGCTTCCAAGGATGCATGGAGAGGCATTTAATATCTTGCGTTATGAGATTGGGCAGAAGTATAATTCTCATTATGACGCCTTCAATCCTACTGAAT
ATGGGCCACAGAGGAGCCAAAGGGTGGCTTCCTTCTTGTTGTACTTGACTGATGTTGAAGAAGGTGGAGAAACCATGTTTCCATTTGAGAATGGCTTGAATATGGATGGA
AGCTATGATTACCAAAGATGTGTTGGTTTGAAAGTGAAGCCACGTCAAGGTGATGGACTTCTGTTTTATTCTGTTTTCCCAAATGGTACAATTGATCCGACATCACTTCA
TGGAAGCTGTCCTGTGATTAAAGGGCAGAAATGGGTCGCTACGAAGTGGATCAGAGATCAAATACAGGACGATTAAGCGTTACTATTTACTACACAGAATGCATAATCCG
TAAATCTCTACCTCATCATTTTCTCTGTATGATAAAATGAGTTCTAATATTATAAAAACTTGATTCACATGCTCTCCTCTTTACCAGATAACATAATAGGCTAGTTATTA
ATGTTGGAATGTTTTTTCCATTCTAGTTTATTGTTTAAAAGGTTTTAATTTAGGCTATTTTGATTTTACATTCTTCAATTTAGTCTCACTGCAGAGTAGGTATTATTGAT
AGACTTAAATGGCAGATACGTGAGAATTTAAAAGGAG
Protein sequenceShow/hide protein sequence
MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMSPGENGDDSISSIPFQVLSWRPRALFFPKFATAEQCQSIVNMAK
PKLRPSTLALRKGETEESTKGIRTSMEIMLDKVIQGVNPVRLWRLGILNEDVKVCAILAFSGNDILSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIG
QKYNSHYDAFNPTEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCVGLKVKPRQGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIRDQIQ
DD