; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi01G009200 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi01G009200
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Description2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein
Genome locationchr01:7432084..7439300
RNA-Seq ExpressionLsi01G009200
SyntenyLsi01G009200
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0080147 - root hair cell development (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0016020 - membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141886.1 probable prolyl 4-hydroxylase 9 [Cucumis sativus]1.2e-14473.74Show/hide
Query:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMTPGENGDDSISSIPFQVLSWRPRALFFPKFATAE
        MKGKSG+SNWSLRSKLGLPALIFVLCLFCFLAGFFGS+LLSQDVDDDRPR RLLQS SD +EFDLM+ GENGDDSISSIPFQVLSWRPRAL+FPKFATAE
Subjt:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMTPGENGDDSISSIPFQVLSWRPRALFFPKFATAE

Query:  QCQSIVNMARPKLRPSTLALRKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFYSGYRDIEFCKDARLHFQFDFLPPSFSVP
        QCQSIVN+A+PKLRPSTLALRKGET E+TKG+RT                                                                  
Subjt:  QCQSIVNMARPKLRPSTLALRKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFYSGYRDIEFCKDARLHFQFDFLPPSFSVP

Query:  SSSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYD
         SSGVFFSASEDESGTLGVIEEKIARATM+PR HGEA+NILRYEIGQKYNSHYDAF PSEYGPQ+SQRVASFLLYLTDVEEGGETMFPFENGLNMDG+Y+
Subjt:  SSSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYD

Query:  YQRCIGLKVKPRQGDGLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKWIRDQIQDD
        +Q CIGLKVKPRQGDGLLFYSVFPNGTID TSLHGSCPV+KGQKWVATKWIRDQ+Q+D
Subjt:  YQRCIGLKVKPRQGDGLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKWIRDQIQDD

XP_008440351.1 PREDICTED: probable prolyl 4-hydroxylase 9 [Cucumis melo]5.4e-14574.58Show/hide
Query:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMTPGENGDDSISSIPFQVLSWRPRALFFPKFATAE
        MK KSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPR RLLQS SDG+EFDLM+ GENGD SISSIPFQVLSWRPRAL+FPKFATAE
Subjt:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMTPGENGDDSISSIPFQVLSWRPRALFFPKFATAE

Query:  QCQSIVNMARPKLRPSTLALRKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFYSGYRDIEFCKDARLHFQFDFLPPSFSVP
        QCQSIVNMA+PKLRPSTLALRKGET ENTKGIRT                                                                  
Subjt:  QCQSIVNMARPKLRPSTLALRKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFYSGYRDIEFCKDARLHFQFDFLPPSFSVP

Query:  SSSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYD
         SSGVFFSASEDESG LGVIEEKIARATM+PR HGEA+NILRYEIGQKYNSHYDAF PSEYGPQ+SQRVASFLLYLTDVEEGGETMFPFENG NMDG+Y+
Subjt:  SSSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYD

Query:  YQRCIGLKVKPRQGDGLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKWIRDQIQDD
        YQ C+GLKVKPRQGDGLLFYSVFPNGTID TSLHGSCPV+KGQKWVATKWIRDQ QDD
Subjt:  YQRCIGLKVKPRQGDGLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKWIRDQIQDD

XP_022977335.1 probable prolyl 4-hydroxylase 9 [Cucurbita maxima]1.4e-14574.86Show/hide
Query:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMTPGENGDDSISSIPFQVLSWRPRALFFPKFATAE
        MKGKSGKS+ SLRSKLGLPA+IFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSV+ GSEFDLM  GENGDDSIS IPFQ+LSW+PRAL+FPKFATAE
Subjt:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMTPGENGDDSISSIPFQVLSWRPRALFFPKFATAE

Query:  QCQSIVNMARPKLRPSTLALRKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFYSGYRDIEFCKDARLHFQFDFLPPSFSVP
        QC++IVNMARP+LRPS+LALRKGETEENTKGIRT                                                                  
Subjt:  QCQSIVNMARPKLRPSTLALRKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFYSGYRDIEFCKDARLHFQFDFLPPSFSVP

Query:  SSSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYD
         SSGVFFSASEDESGTLG IEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQ+SQRVASFLLYLTDVEEGGETMFPFENGLNMDG YD
Subjt:  SSSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYD

Query:  YQRCIGLKVKPRQGDGLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKWIRDQIQDD
        YQ+C GLKVKPRQGDGLLFYSVFPNGTID TSLHGSCPVVKGQKWVATKWIRDQ+Q+D
Subjt:  YQRCIGLKVKPRQGDGLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKWIRDQIQDD

XP_023543701.1 probable prolyl 4-hydroxylase 9 [Cucurbita pepo subsp. pepo]8.3e-14675.14Show/hide
Query:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMTPGENGDDSISSIPFQVLSWRPRALFFPKFATAE
        MK KSGKS+ SLRSKLGLPA+IFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSV+ GSEFDLM  GENGDDSIS IPFQVLSW+PRAL+FPKFATAE
Subjt:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMTPGENGDDSISSIPFQVLSWRPRALFFPKFATAE

Query:  QCQSIVNMARPKLRPSTLALRKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFYSGYRDIEFCKDARLHFQFDFLPPSFSVP
        QC++IVNMARP+LRPS+LALRKGETEENTKGIRT                                                                  
Subjt:  QCQSIVNMARPKLRPSTLALRKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFYSGYRDIEFCKDARLHFQFDFLPPSFSVP

Query:  SSSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYD
         SSGVFFSASEDESGTLG IEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQ+SQRVASFLLYLTDVEEGGETMFPFENGLNMDG YD
Subjt:  SSSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYD

Query:  YQRCIGLKVKPRQGDGLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKWIRDQIQDD
        YQ+C GLKVKPRQGDGLLFYSVFPNGTID TSLHGSCPVVKGQKWVATKWIRDQ+QDD
Subjt:  YQRCIGLKVKPRQGDGLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKWIRDQIQDD

XP_038881540.1 probable prolyl 4-hydroxylase 9 [Benincasa hispida]8.5e-15177.65Show/hide
Query:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMTPGENGDDSISSIPFQVLSWRPRALFFPKFATAE
        MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDV+DDRP PRLLQSVSDGSEFDLMTPGENGDDSISSIPFQVLSWRPRALFFPKFATAE
Subjt:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMTPGENGDDSISSIPFQVLSWRPRALFFPKFATAE

Query:  QCQSIVNMARPKLRPSTLALRKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFYSGYRDIEFCKDARLHFQFDFLPPSFSVP
        QC SIVNMARPKLRPSTLALRKGETEENTKGIRT                                                                  
Subjt:  QCQSIVNMARPKLRPSTLALRKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFYSGYRDIEFCKDARLHFQFDFLPPSFSVP

Query:  SSSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYD
         SSG FF ASEDESGTLGVIEEKIARATM+PR HGEAFNILRYEIGQKYNSHYDAFNPSEYGPQ+SQRVASFLLYLTDV+EGGETMFPFENG NMDG YD
Subjt:  SSSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYD

Query:  YQRCIGLKVKPRQGDGLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKWIRDQIQDD
        +QRCIGLKVKPRQGDGLLFYSVFPNGTID TSLHGSCPVVKGQKWVATKWIRDQIQDD
Subjt:  YQRCIGLKVKPRQGDGLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKWIRDQIQDD

TrEMBL top hitse value%identityAlignment
A0A0A0KGG4 Fe2OG dioxygenase domain-containing protein5.8e-14573.74Show/hide
Query:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMTPGENGDDSISSIPFQVLSWRPRALFFPKFATAE
        MKGKSG+SNWSLRSKLGLPALIFVLCLFCFLAGFFGS+LLSQDVDDDRPR RLLQS SD +EFDLM+ GENGDDSISSIPFQVLSWRPRAL+FPKFATAE
Subjt:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMTPGENGDDSISSIPFQVLSWRPRALFFPKFATAE

Query:  QCQSIVNMARPKLRPSTLALRKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFYSGYRDIEFCKDARLHFQFDFLPPSFSVP
        QCQSIVN+A+PKLRPSTLALRKGET E+TKG+RT                                                                  
Subjt:  QCQSIVNMARPKLRPSTLALRKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFYSGYRDIEFCKDARLHFQFDFLPPSFSVP

Query:  SSSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYD
         SSGVFFSASEDESGTLGVIEEKIARATM+PR HGEA+NILRYEIGQKYNSHYDAF PSEYGPQ+SQRVASFLLYLTDVEEGGETMFPFENGLNMDG+Y+
Subjt:  SSSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYD

Query:  YQRCIGLKVKPRQGDGLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKWIRDQIQDD
        +Q CIGLKVKPRQGDGLLFYSVFPNGTID TSLHGSCPV+KGQKWVATKWIRDQ+Q+D
Subjt:  YQRCIGLKVKPRQGDGLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKWIRDQIQDD

A0A1S3B0H7 probable prolyl 4-hydroxylase 92.6e-14574.58Show/hide
Query:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMTPGENGDDSISSIPFQVLSWRPRALFFPKFATAE
        MK KSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPR RLLQS SDG+EFDLM+ GENGD SISSIPFQVLSWRPRAL+FPKFATAE
Subjt:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMTPGENGDDSISSIPFQVLSWRPRALFFPKFATAE

Query:  QCQSIVNMARPKLRPSTLALRKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFYSGYRDIEFCKDARLHFQFDFLPPSFSVP
        QCQSIVNMA+PKLRPSTLALRKGET ENTKGIRT                                                                  
Subjt:  QCQSIVNMARPKLRPSTLALRKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFYSGYRDIEFCKDARLHFQFDFLPPSFSVP

Query:  SSSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYD
         SSGVFFSASEDESG LGVIEEKIARATM+PR HGEA+NILRYEIGQKYNSHYDAF PSEYGPQ+SQRVASFLLYLTDVEEGGETMFPFENG NMDG+Y+
Subjt:  SSSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYD

Query:  YQRCIGLKVKPRQGDGLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKWIRDQIQDD
        YQ C+GLKVKPRQGDGLLFYSVFPNGTID TSLHGSCPV+KGQKWVATKWIRDQ QDD
Subjt:  YQRCIGLKVKPRQGDGLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKWIRDQIQDD

A0A6J1GEZ7 probable prolyl 4-hydroxylase 92.2e-14474.3Show/hide
Query:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMTPGENGDDSISSIPFQVLSWRPRALFFPKFATAE
        MK KSGKS  SLRSKLGLPA+IFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSV+ GS+FDLM  GENGDDSIS IPFQVLSW+PRAL+FPKFATAE
Subjt:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMTPGENGDDSISSIPFQVLSWRPRALFFPKFATAE

Query:  QCQSIVNMARPKLRPSTLALRKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFYSGYRDIEFCKDARLHFQFDFLPPSFSVP
        QC++IVNMARP+LRPS+LALRKGETEENTKGIRT                                                                  
Subjt:  QCQSIVNMARPKLRPSTLALRKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFYSGYRDIEFCKDARLHFQFDFLPPSFSVP

Query:  SSSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYD
         SSGVFFSASEDESGTLG IEEKIAR TMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQ+SQRVASFLLYLTDVE+GGETMFPFENGLNMDG YD
Subjt:  SSSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYD

Query:  YQRCIGLKVKPRQGDGLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKWIRDQIQDD
        YQ+C GLKVKPRQGDGLLFYSVFPNGTID TSLHGSCPVVKGQKWVATKWIRDQ+QDD
Subjt:  YQRCIGLKVKPRQGDGLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKWIRDQIQDD

A0A6J1HHX9 probable prolyl 4-hydroxylase 91.1e-14072.55Show/hide
Query:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMTPGENGDDSISSIPFQVLSWRPRALFFPKFATAE
        MK  +GKSNWS RSKLGLP  IF+LCLFCFLAGFFGSSLLSQDVDDDRP PRLLQSVSD SEFDLMT GE GDDSISSIPFQ+LSWRPRAL FPKFATAE
Subjt:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMTPGENGDDSISSIPFQVLSWRPRALFFPKFATAE

Query:  QCQSIVNMARPKLRPSTLALRKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFYSGYRDIEFCKDARLHFQFDFLPPSFSVP
        QCQSIVNMARP L+PS LALRKGET+E+TKG RT                                                                  
Subjt:  QCQSIVNMARPKLRPSTLALRKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFYSGYRDIEFCKDARLHFQFDFLPPSFSVP

Query:  SSSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYD
         SSGVF SASEDESGTL  IEEKIARATMLPR HGEAFN+LRYEIGQKYN+HYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPF+NGLNMDG+Y 
Subjt:  SSSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYD

Query:  YQRCIGLKVKPRQGDGLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKWIRDQIQD
        YQRCIGLKVKPRQGDGLLFYSVFPNGTID TSLHGSCPV+KGQKWVATKWIRDQI+D
Subjt:  YQRCIGLKVKPRQGDGLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKWIRDQIQD

A0A6J1IM09 probable prolyl 4-hydroxylase 96.8e-14674.86Show/hide
Query:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMTPGENGDDSISSIPFQVLSWRPRALFFPKFATAE
        MKGKSGKS+ SLRSKLGLPA+IFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSV+ GSEFDLM  GENGDDSIS IPFQ+LSW+PRAL+FPKFATAE
Subjt:  MKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMTPGENGDDSISSIPFQVLSWRPRALFFPKFATAE

Query:  QCQSIVNMARPKLRPSTLALRKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFYSGYRDIEFCKDARLHFQFDFLPPSFSVP
        QC++IVNMARP+LRPS+LALRKGETEENTKGIRT                                                                  
Subjt:  QCQSIVNMARPKLRPSTLALRKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFYSGYRDIEFCKDARLHFQFDFLPPSFSVP

Query:  SSSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYD
         SSGVFFSASEDESGTLG IEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQ+SQRVASFLLYLTDVEEGGETMFPFENGLNMDG YD
Subjt:  SSSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYD

Query:  YQRCIGLKVKPRQGDGLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKWIRDQIQDD
        YQ+C GLKVKPRQGDGLLFYSVFPNGTID TSLHGSCPVVKGQKWVATKWIRDQ+Q+D
Subjt:  YQRCIGLKVKPRQGDGLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKWIRDQIQDD

SwissProt top hitse value%identityAlignment
F4ILF8 Prolyl 4-hydroxylase 134.3e-8146.42Show/hide
Query:  NWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMTPGENGDDSISSIPFQVLSWRPRALFFPKFATAEQCQSIVNM
        ++    KL  P +    C F  + GF   +L SQ +      P   +SV+D  E D +  G     S+S+IPF  LSW PR  + P FAT +QC+++++M
Subjt:  NWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMTPGENGDDSISSIPFQVLSWRPRALFFPKFATAEQCQSIVNM

Query:  ARPKLRPSTLALRKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFYSGYRDIEFCKDARLHFQFDFLPPSFSVPSSSGVFFS
        A+PKL+PSTLALRKGET E T+  R+                                                   LH   D                 
Subjt:  ARPKLRPSTLALRKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFYSGYRDIEFCKDARLHFQFDFLPPSFSVPSSSGVFFS

Query:  ASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCIGLK
          EDESG L  IEEKIA AT  P+ + E+FNILRY++GQKY+SHYDAF+ +EYGP  SQRV +FLL+L+ VEEGGETMFPFENG NM+G YDY++C+GLK
Subjt:  ASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCIGLK

Query:  VKPRQGDGLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKWIRDQIQD
        VKPRQGD + FY++FPNGTID+TSLHGSCPV+KG+KWVATKWIRDQ  D
Subjt:  VKPRQGDGLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKWIRDQIQD

F4JZ24 Probable prolyl 4-hydroxylase 107.3e-3633.1Show/hide
Query:  DDSISSIPFQVLSWRPRALFFPKFATAEQCQSIVNMARPKLRPSTLALRKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFY
        DDS +    +++SW PRA  +  F T E+C+ ++ +A+P +  ST+   K  T ++T                                           
Subjt:  DDSISSIPFQVLSWRPRALFFPKFATAEQCQSIVNMARPKLRPSTLALRKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFY

Query:  SGYRDIEFCKDARLHFQFDFLPPSFSVPSSSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASF
                  D+R             V +SSG F +   D+  T+  IE++I+  T +P  HGE   +L YEIGQKY  HYD F          QR+A+ 
Subjt:  SGYRDIEFCKDARLHFQFDFLPPSFSVPSSSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASF

Query:  LLYLTDVEEGGETMFPFENGLNMDGSY--DYQRC--IGLKVKPRQGDGLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKWIR
        L+YL+DVEEGGET+FP   G      +  +   C   GL VKP+ GD LLF+S+ P+ T+D +SLHG C V+KG KW +TKW+R
Subjt:  LLYLTDVEEGGETMFPFENGLNMDGSY--DYQRC--IGLKVKPRQGDGLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKWIR

Q24JN5 Prolyl 4-hydroxylase 57.5e-3329.45Show/hide
Query:  RSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMTPGENGDDSISSIPFQVLSWRPRALFFPKFATAEQCQSIVNMARPK
        RS      LI +L +   L G    SL + + +  +     L ++   SE        NG+  +     +V+SW PRA+ +  F T E+C+ ++++A+P 
Subjt:  RSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMTPGENGDDSISSIPFQVLSWRPRALFFPKFATAEQCQSIVNMARPK

Query:  LRPSTLALRKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFYSGYRDIEFCKDARLHFQFDFLPPSFSVPSSSGVFFSASED
        +  ST+       +E T G                                                  KD+R             V +SSG F     D
Subjt:  LRPSTLALRKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFYSGYRDIEFCKDARLHFQFDFLPPSFSVPSSSGVFFSASED

Query:  ESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGS---YDYQRC--IGL
        E   + VIE++I+  T +P  +GE   +L Y++GQKY  HYD F          QR+A+ L+YL+DV++GGET+FP   G N+       +  +C   GL
Subjt:  ESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGS---YDYQRC--IGL

Query:  KVKPRQGDGLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKW
         V P++ D LLF+++ P+ ++D +SLHG CPVVKG KW +TKW
Subjt:  KVKPRQGDGLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKW

Q8VZJ7 Probable prolyl 4-hydroxylase 96.7e-10657.35Show/hide
Query:  RSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDG-SEFDLMTPGENGDDSISSIPFQVLSWRPRALFFPKFATAEQCQSIVNMARP
        R KLGL A + V C  CFL GF+GS+LLSQ+V   +PR R+L  V +G  E   M  G  G++SI SIPFQVLSWRPRA++FP FATAEQCQ+I+  A+ 
Subjt:  RSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDG-SEFDLMTPGENGDDSISSIPFQVLSWRPRALFFPKFATAEQCQSIVNMARP

Query:  KLRPSTLALRKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFYSGYRDIEFCKDARLHFQFDFLPPSFSVPSSSGVFFSASE
         L+PS LALRKGET ENTKG RT                                                                   SSG F SASE
Subjt:  KLRPSTLALRKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFYSGYRDIEFCKDARLHFQFDFLPPSFSVPSSSGVFFSASE

Query:  DESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCIGLKVKP
        + +G L  +E KIARATM+PR HGE+FNILRYE+GQKY+SHYD FNP+EYGPQ SQR+ASFLLYL+DVEEGGETMFPFENG NM   YDY++CIGLKVKP
Subjt:  DESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCIGLKVKP

Query:  RQGDGLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKWIRDQIQDD
        R+GDGLLFYSVFPNGTID+TSLHGSCPV KG+KWVATKWIRDQ Q++
Subjt:  RQGDGLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKWIRDQIQDD

Q9LN20 Probable prolyl 4-hydroxylase 39.5e-3630.95Show/hide
Query:  LIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMTPGENGDDSISSIPFQVLSWRPRALFFPKFATAEQCQSIVNMARPKLRPSTLAL
        ++F+L +   +   FG  + S  +++D   P  L      +       G+ GD        +VLSW PRA  +  F + E+C+ ++++A+P +  ST+  
Subjt:  LIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMTPGENGDDSISSIPFQVLSWRPRALFFPKFATAEQCQSIVNMARPKLRPSTLAL

Query:  RKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFYSGYRDIEFCKDARLHFQFDFLPPSFSVPSSSGVFFSASEDESGTLGVI
           ET ++                                                     KD+R             V +SSG F     D+   +  I
Subjt:  RKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFYSGYRDIEFCKDARLHFQFDFLPPSFSVPSSSGVFFSASEDESGTLGVI

Query:  EEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCI-----GLKVKPRQGD
        E++IA  T +P  HGE   +L YE GQKY  HYD F          QR+A+ L+YL+DVEEGGET+FP  N +N      Y         GL VKPR GD
Subjt:  EEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCI-----GLKVKPRQGD

Query:  GLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKWI
         LLF+S+ P+ T+D TSLHG CPV++G KW +TKW+
Subjt:  GLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKWI

Arabidopsis top hitse value%identityAlignment
AT1G20270.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein6.7e-3730.95Show/hide
Query:  LIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMTPGENGDDSISSIPFQVLSWRPRALFFPKFATAEQCQSIVNMARPKLRPSTLAL
        ++F+L +   +   FG  + S  +++D   P  L      +       G+ GD        +VLSW PRA  +  F + E+C+ ++++A+P +  ST+  
Subjt:  LIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMTPGENGDDSISSIPFQVLSWRPRALFFPKFATAEQCQSIVNMARPKLRPSTLAL

Query:  RKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFYSGYRDIEFCKDARLHFQFDFLPPSFSVPSSSGVFFSASEDESGTLGVI
           ET ++                                                     KD+R             V +SSG F     D+   +  I
Subjt:  RKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFYSGYRDIEFCKDARLHFQFDFLPPSFSVPSSSGVFFSASEDESGTLGVI

Query:  EEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCI-----GLKVKPRQGD
        E++IA  T +P  HGE   +L YE GQKY  HYD F          QR+A+ L+YL+DVEEGGET+FP  N +N      Y         GL VKPR GD
Subjt:  EEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCI-----GLKVKPRQGD

Query:  GLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKWI
         LLF+S+ P+ T+D TSLHG CPV++G KW +TKW+
Subjt:  GLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKWI

AT2G17720.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein5.3e-3429.45Show/hide
Query:  RSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMTPGENGDDSISSIPFQVLSWRPRALFFPKFATAEQCQSIVNMARPK
        RS      LI +L +   L G    SL + + +  +     L ++   SE        NG+  +     +V+SW PRA+ +  F T E+C+ ++++A+P 
Subjt:  RSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMTPGENGDDSISSIPFQVLSWRPRALFFPKFATAEQCQSIVNMARPK

Query:  LRPSTLALRKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFYSGYRDIEFCKDARLHFQFDFLPPSFSVPSSSGVFFSASED
        +  ST+       +E T G                                                  KD+R             V +SSG F     D
Subjt:  LRPSTLALRKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFYSGYRDIEFCKDARLHFQFDFLPPSFSVPSSSGVFFSASED

Query:  ESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGS---YDYQRC--IGL
        E   + VIE++I+  T +P  +GE   +L Y++GQKY  HYD F          QR+A+ L+YL+DV++GGET+FP   G N+       +  +C   GL
Subjt:  ESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGS---YDYQRC--IGL

Query:  KVKPRQGDGLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKW
         V P++ D LLF+++ P+ ++D +SLHG CPVVKG KW +TKW
Subjt:  KVKPRQGDGLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKW

AT2G23096.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein3.1e-8246.42Show/hide
Query:  NWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMTPGENGDDSISSIPFQVLSWRPRALFFPKFATAEQCQSIVNM
        ++    KL  P +    C F  + GF   +L SQ +      P   +SV+D  E D +  G     S+S+IPF  LSW PR  + P FAT +QC+++++M
Subjt:  NWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMTPGENGDDSISSIPFQVLSWRPRALFFPKFATAEQCQSIVNM

Query:  ARPKLRPSTLALRKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFYSGYRDIEFCKDARLHFQFDFLPPSFSVPSSSGVFFS
        A+PKL+PSTLALRKGET E T+  R+                                                   LH   D                 
Subjt:  ARPKLRPSTLALRKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFYSGYRDIEFCKDARLHFQFDFLPPSFSVPSSSGVFFS

Query:  ASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCIGLK
          EDESG L  IEEKIA AT  P+ + E+FNILRY++GQKY+SHYDAF+ +EYGP  SQRV +FLL+L+ VEEGGETMFPFENG NM+G YDY++C+GLK
Subjt:  ASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCIGLK

Query:  VKPRQGDGLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKWIRDQIQD
        VKPRQGD + FY++FPNGTID+TSLHGSCPV+KG+KWVATKWIRDQ  D
Subjt:  VKPRQGDGLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKWIRDQIQD

AT4G33910.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein4.7e-10757.35Show/hide
Query:  RSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDG-SEFDLMTPGENGDDSISSIPFQVLSWRPRALFFPKFATAEQCQSIVNMARP
        R KLGL A + V C  CFL GF+GS+LLSQ+V   +PR R+L  V +G  E   M  G  G++SI SIPFQVLSWRPRA++FP FATAEQCQ+I+  A+ 
Subjt:  RSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDG-SEFDLMTPGENGDDSISSIPFQVLSWRPRALFFPKFATAEQCQSIVNMARP

Query:  KLRPSTLALRKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFYSGYRDIEFCKDARLHFQFDFLPPSFSVPSSSGVFFSASE
         L+PS LALRKGET ENTKG RT                                                                   SSG F SASE
Subjt:  KLRPSTLALRKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFYSGYRDIEFCKDARLHFQFDFLPPSFSVPSSSGVFFSASE

Query:  DESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCIGLKVKP
        + +G L  +E KIARATM+PR HGE+FNILRYE+GQKY+SHYD FNP+EYGPQ SQR+ASFLLYL+DVEEGGETMFPFENG NM   YDY++CIGLKVKP
Subjt:  DESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCIGLKVKP

Query:  RQGDGLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKWIRDQIQDD
        R+GDGLLFYSVFPNGTID+TSLHGSCPV KG+KWVATKWIRDQ Q++
Subjt:  RQGDGLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKWIRDQIQDD

AT5G66060.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein5.2e-3733.1Show/hide
Query:  DDSISSIPFQVLSWRPRALFFPKFATAEQCQSIVNMARPKLRPSTLALRKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFY
        DDS +    +++SW PRA  +  F T E+C+ ++ +A+P +  ST+   K  T ++T                                           
Subjt:  DDSISSIPFQVLSWRPRALFFPKFATAEQCQSIVNMARPKLRPSTLALRKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFY

Query:  SGYRDIEFCKDARLHFQFDFLPPSFSVPSSSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASF
                  D+R             V +SSG F +   D+  T+  IE++I+  T +P  HGE   +L YEIGQKY  HYD F          QR+A+ 
Subjt:  SGYRDIEFCKDARLHFQFDFLPPSFSVPSSSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASF

Query:  LLYLTDVEEGGETMFPFENGLNMDGSY--DYQRC--IGLKVKPRQGDGLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKWIR
        L+YL+DVEEGGET+FP   G      +  +   C   GL VKP+ GD LLF+S+ P+ T+D +SLHG C V+KG KW +TKW+R
Subjt:  LLYLTDVEEGGETMFPFENGLNMDGSY--DYQRC--IGLKVKPRQGDGLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKWIR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAGGATCTGCCCTAAGGGTAGGGAGCAAATAAGGAAGAGATCGGTGATTCCATCAAAGGAAAGAAAGATGAAAGGCAAAAGCGGAAAAAGTAATTGGAGCTTGAG
ATCGAAGCTAGGTTTGCCGGCACTCATCTTCGTTTTATGCCTTTTTTGTTTCCTCGCCGGATTCTTCGGTTCTTCTCTTCTCTCTCAGGATGTAGATGACGATAGGCCGA
GGCCGAGGTTGCTTCAATCGGTCAGCGATGGTAGCGAGTTCGATTTGATGACTCCGGGAGAAAACGGCGATGATTCCATTTCTTCGATTCCGTTTCAGGTTTTAAGCTGG
CGACCTCGCGCCCTTTTTTTCCCCAAATTTGCAACTGCGGAGCAATGCCAGAGCATAGTTAATATGGCGAGACCGAAGCTTAGACCGTCTACCTTGGCTCTACGTAAGGG
AGAAACCGAAGAGAACACGAAAGGAATCCGAACAAGGATGGAGTTAATGATCGATGAGGTGATTCAAGGAGTTAATCCGGTCAGACTGTGGGGATTGGGGATTTTGAATG
AGGATGAATACACGGATCTTCTAAGTTCTTGGTTTTACTCTGGTTATAGGGATATAGAATTTTGTAAAGATGCTCGATTACATTTTCAATTTGATTTTCTCCCTCCCTCT
TTCTCTGTCCCAAGCAGTTCTGGAGTGTTCTTTAGTGCTTCAGAAGATGAAAGTGGGACTCTGGGTGTAATTGAGGAAAAAATTGCAAGGGCAACTATGCTTCCAAGGAT
GCACGGAGAGGCATTTAATATCTTGCGTTATGAGATTGGGCAGAAGTATAATTCTCATTATGATGCCTTCAATCCTTCTGAATATGGGCCACAGAGGAGCCAAAGGGTGG
CTTCCTTCTTGTTGTACTTGACTGATGTTGAAGAAGGTGGAGAAACCATGTTTCCATTTGAGAATGGCTTGAATATGGATGGAAGCTATGATTACCAAAGATGTATTGGT
TTGAAAGTAAAGCCACGTCAAGGTGATGGACTTCTGTTTTATTCTGTTTTCCCAAATGGTACAATCGATCGGACATCACTTCATGGAAGCTGTCCTGTGGTCAAAGGGCA
GAAATGGGTCGCTACGAAGTGGATCAGAGATCAAATACAGGACGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATAGGATCTGCCCTAAGGGTAGGGAGCAAATAAGGAAGAGATCGGTGATTCCATCAAAGGAAAGAAAGATGAAAGGCAAAAGCGGAAAAAGTAATTGGAGCTTGAG
ATCGAAGCTAGGTTTGCCGGCACTCATCTTCGTTTTATGCCTTTTTTGTTTCCTCGCCGGATTCTTCGGTTCTTCTCTTCTCTCTCAGGATGTAGATGACGATAGGCCGA
GGCCGAGGTTGCTTCAATCGGTCAGCGATGGTAGCGAGTTCGATTTGATGACTCCGGGAGAAAACGGCGATGATTCCATTTCTTCGATTCCGTTTCAGGTTTTAAGCTGG
CGACCTCGCGCCCTTTTTTTCCCCAAATTTGCAACTGCGGAGCAATGCCAGAGCATAGTTAATATGGCGAGACCGAAGCTTAGACCGTCTACCTTGGCTCTACGTAAGGG
AGAAACCGAAGAGAACACGAAAGGAATCCGAACAAGGATGGAGTTAATGATCGATGAGGTGATTCAAGGAGTTAATCCGGTCAGACTGTGGGGATTGGGGATTTTGAATG
AGGATGAATACACGGATCTTCTAAGTTCTTGGTTTTACTCTGGTTATAGGGATATAGAATTTTGTAAAGATGCTCGATTACATTTTCAATTTGATTTTCTCCCTCCCTCT
TTCTCTGTCCCAAGCAGTTCTGGAGTGTTCTTTAGTGCTTCAGAAGATGAAAGTGGGACTCTGGGTGTAATTGAGGAAAAAATTGCAAGGGCAACTATGCTTCCAAGGAT
GCACGGAGAGGCATTTAATATCTTGCGTTATGAGATTGGGCAGAAGTATAATTCTCATTATGATGCCTTCAATCCTTCTGAATATGGGCCACAGAGGAGCCAAAGGGTGG
CTTCCTTCTTGTTGTACTTGACTGATGTTGAAGAAGGTGGAGAAACCATGTTTCCATTTGAGAATGGCTTGAATATGGATGGAAGCTATGATTACCAAAGATGTATTGGT
TTGAAAGTAAAGCCACGTCAAGGTGATGGACTTCTGTTTTATTCTGTTTTCCCAAATGGTACAATCGATCGGACATCACTTCATGGAAGCTGTCCTGTGGTCAAAGGGCA
GAAATGGGTCGCTACGAAGTGGATCAGAGATCAAATACAGGACGATTAAGCGTTACTATTTACTACACATGATGCATAATCCGTAAATCTCTACCTCATCATCCCCCAGA
AAACCCTCGATCTCCAAGCCCCTTGTATAGGTTGGGCAAGCTTGGATGCCCCTACTCCCAACAAGTTGCTGGTCGGGATCGTGGAACTATCGCGGGTCGCTTGAGTGTCA
AAACCAAGCGGTGGTTGTTTTTCCTTGGCTTATCGAACCGTAGAGAAAGCCTCAATTTGCCGATGCTTTGTATGATAAAATGAATTTTAATATTATAAAAACTTGATTCC
CATGCTCTCCTCTTTACCAGATAACATAATAAGCTAGTTATTAATGTTGGAATGTTGTTCCATTCTAGTTTATTGCTTAAAAGGTTTAGGCAAATGGGTTACTATTTACA
TGGCAGATAGGTGAGAATTTAAAAGGAGGACAAATCATTTTAGAATGAAGTTTTTAATATTAATAGAAATCTGAGTTTCTTATTTT
Protein sequenceShow/hide protein sequence
MDRICPKGREQIRKRSVIPSKERKMKGKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRPRLLQSVSDGSEFDLMTPGENGDDSISSIPFQVLSW
RPRALFFPKFATAEQCQSIVNMARPKLRPSTLALRKGETEENTKGIRTRMELMIDEVIQGVNPVRLWGLGILNEDEYTDLLSSWFYSGYRDIEFCKDARLHFQFDFLPPS
FSVPSSSGVFFSASEDESGTLGVIEEKIARATMLPRMHGEAFNILRYEIGQKYNSHYDAFNPSEYGPQRSQRVASFLLYLTDVEEGGETMFPFENGLNMDGSYDYQRCIG
LKVKPRQGDGLLFYSVFPNGTIDRTSLHGSCPVVKGQKWVATKWIRDQIQDD