; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc07G16500 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc07G16500
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionprolyl 4-hydroxylase 1
Genome locationClcChr07:31018949..31038370
RNA-Seq ExpressionClc07G16500
SyntenyClc07G16500
Gene Ontology termsGO:0005737 - cytoplasm (cellular component)
GO:0012505 - endomembrane system (cellular component)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0005506 - iron ion binding (molecular function)
GO:0016705 - oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
GO:0051213 - dioxygenase activity (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008453925.1 PREDICTED: prolyl 4-hydroxylase 1 isoform X3 [Cucumis melo]1.3e-14167.07Show/hide
Query:  MASAPMRIVFGLLTFVTVGMIIGSLLQLAFIRRLEDSIGTEFLSAGRLHKIQYDSQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST
        M SA MRIVFGLLTFVTVGMIIG+LLQLAF+RRLEDSIGTEFL AGRLHK QYDSQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST
Subjt:  MASAPMRIVFGLLTFVTVGMIIGSLLQLAFIRRLEDSIGTEFLSAGRLHKIQYDSQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST

Query:  EVGQFHIGNQTPPELIAVTNKDNPSTEAISNQEHKEWIIVDQALLAWLYEGMDIIVTHMVLLKPWSNDTHLGEGEGALGFSASTSSSSTKFSRRKECDYL
        E                                                                                              ECDYL
Subjt:  EVGQFHIGNQTPPELIAVTNKDNPSTEAISNQEHKEWIIVDQALLAWLYEGMDIIVTHMVLLKPWSNDTHLGEGEGALGFSASTSSSSTKFSRRKECDYL

Query:  KAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIPVENGELIQVL
        K IALPRLEISTVVDTKTGKGVKSDFRTSSGMFLS  EKNYPMVQ                                AIEKRISVYSQIPVENGELIQVL
Subjt:  KAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIPVENGELIQVL

Query:  RYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLAGE
        RYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDA+LFWSMGLDGQSDPNSIHGGCEVL+GE
Subjt:  RYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLAGE

Query:  KWSATKWMRQKSTLVP
        KWSATKWMRQKSTLVP
Subjt:  KWSATKWMRQKSTLVP

XP_022137963.1 prolyl 4-hydroxylase 1 [Momordica charantia]7.4e-14064.9Show/hide
Query:  MASAPMRIVFGLLTFVTVGMIIGSLLQLAFIRRLEDSIGTEFLSAGRLHKIQYDSQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST
        MASAPMRIVFGLLTFVT+GMIIG+L QLAFIRRLEDS GTEFLSAGRLHK QYD  RQLPRG PNWIND+EAEILRLGYVKPEVVSWSPRIIVLHNFLST
Subjt:  MASAPMRIVFGLLTFVTVGMIIGSLLQLAFIRRLEDSIGTEFLSAGRLHKIQYDSQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST

Query:  EVGQFHIGNQTPPELIAVTNKDNPSTEAISNQEHKEWIIVDQALLAWLYEGMDIIVTHMVLLKPWSNDTHLGEGEGALGFSASTSSSSTKFSRRKECDYL
        E                                                                                              ECDYL
Subjt:  EVGQFHIGNQTPPELIAVTNKDNPSTEAISNQEHKEWIIVDQALLAWLYEGMDIIVTHMVLLKPWSNDTHLGEGEGALGFSASTSSSSTKFSRRKECDYL

Query:  KAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIPVENGELIQVL
        +A+ALPRLE+STVVDTKTGKGVKSDFRTSSGMFLS QEKNYPM+Q                                AIEKRISVYSQIP+ENGELIQVL
Subjt:  KAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIPVENGELIQVL

Query:  RYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLAGE
        RYEKNQFYKPHHDYFSDTFNLKRGGQR+ATMLMYLS+N+EGGETYFPKAGSGECSCGGKTVPGLSVKP KGDAVLFWSMGLDGQSDPNSIHGGCEVL+GE
Subjt:  RYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLAGE

Query:  KWSATKWMRQKSTLVP
        KWSATKWMRQKSTLVP
Subjt:  KWSATKWMRQKSTLVP

XP_022932579.1 prolyl 4-hydroxylase 1-like isoform X1 [Cucurbita moschata]1.6e-13966.51Show/hide
Query:  MASAPMRIVFGLLTFVTVGMIIGSLLQLAFIRRLEDSIGTEFLSAGRLHKIQYDSQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST
        MAS  MRIVFGLLTFVTVGMIIG+L QLAFIRRLEDSIGTEFLSAGRLHK QYD QRQ  +GLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLS+
Subjt:  MASAPMRIVFGLLTFVTVGMIIGSLLQLAFIRRLEDSIGTEFLSAGRLHKIQYDSQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST

Query:  EVGQFHIGNQTPPELIAVTNKDNPSTEAISNQEHKEWIIVDQALLAWLYEGMDIIVTHMVLLKPWSNDTHLGEGEGALGFSASTSSSSTKFSRRKECDYL
        E G+       PP+LI                                             ++P             LG                ECDYL
Subjt:  EVGQFHIGNQTPPELIAVTNKDNPSTEAISNQEHKEWIIVDQALLAWLYEGMDIIVTHMVLLKPWSNDTHLGEGEGALGFSASTSSSSTKFSRRKECDYL

Query:  KAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIPVENGELIQVL
        KAIALPRLEISTVVDTKTGKG+KSDFRTSSGMFLS QE+NYPMVQ                                AIEKRISVYSQIP+ENGELIQVL
Subjt:  KAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIPVENGELIQVL

Query:  RYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLAGE
        RYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYL++N+EGGETYFPKAGSG CSCGGKTVPGLSVKP KGDAVLFWSMGLDGQSDPNSIHGGCEVL GE
Subjt:  RYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLAGE

Query:  KWSATKWMRQKSTLV
        KWSATKWMRQKSTL+
Subjt:  KWSATKWMRQKSTLV

XP_023539759.1 prolyl 4-hydroxylase 1-like isoform X1 [Cucurbita pepo subsp. pepo]6.2e-13966.51Show/hide
Query:  MASAPMRIVFGLLTFVTVGMIIGSLLQLAFIRRLEDSIGTEFLSAGRLHKIQYDSQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST
        MAS  MRIVFGLLTFVTVGMIIG+L QLAFIRRLEDSIGTEFLSAGRLHK QYD QRQ  +GLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLS+
Subjt:  MASAPMRIVFGLLTFVTVGMIIGSLLQLAFIRRLEDSIGTEFLSAGRLHKIQYDSQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST

Query:  EVGQFHIGNQTPPELIAVTNKDNPSTEAISNQEHKEWIIVDQALLAWLYEGMDIIVTHMVLLKPWSNDTHLGEGEGALGFSASTSSSSTKFSRRKECDYL
        E G+       PP+LI                                             ++P             LG                ECDYL
Subjt:  EVGQFHIGNQTPPELIAVTNKDNPSTEAISNQEHKEWIIVDQALLAWLYEGMDIIVTHMVLLKPWSNDTHLGEGEGALGFSASTSSSSTKFSRRKECDYL

Query:  KAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIPVENGELIQVL
        KAIALPRLEISTVVDTKTGKG+KSDFRTSSGMFLS QE+NYPMVQ                                AIEKRISVYSQIP ENGELIQVL
Subjt:  KAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIPVENGELIQVL

Query:  RYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLAGE
        RYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYL++N+EGGETYFPKAGSG CSCGGKTVPGLSVKP KGDAVLFWSMGLDGQSDPNSIHGGCEVL GE
Subjt:  RYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLAGE

Query:  KWSATKWMRQKSTLV
        KWSATKWMRQKSTL+
Subjt:  KWSATKWMRQKSTLV

XP_038904320.1 prolyl 4-hydroxylase 1 isoform X1 [Benincasa hispida]4.9e-14468.51Show/hide
Query:  MASAPMRIVFGLLTFVTVGMIIGSLLQLAFIRRLEDSIGTEFLSAGRLHKIQYDSQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST
        MASAPMRIVFGLLTFVTVGMIIG+LLQLAFIRRLEDSIGTEFLSAGRLHK QYDSQRQL RGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST
Subjt:  MASAPMRIVFGLLTFVTVGMIIGSLLQLAFIRRLEDSIGTEFLSAGRLHKIQYDSQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST

Query:  EVGQFHIGNQTPPELIAVTNKDNPSTEAISNQEHKEWIIVDQALLAWLYEGMDIIVTHMVLLKPWSNDTHLGEGEGALGFSASTSSSSTKFSRRKECDYL
        E                                                                                              ECDYL
Subjt:  EVGQFHIGNQTPPELIAVTNKDNPSTEAISNQEHKEWIIVDQALLAWLYEGMDIIVTHMVLLKPWSNDTHLGEGEGALGFSASTSSSSTKFSRRKECDYL

Query:  KAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIPVENGELIQVL
        KAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLS QEKNYPMVQ                                AIEKRISVYSQIPVENGELIQVL
Subjt:  KAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIPVENGELIQVL

Query:  RYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLAGE
        RYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVL+GE
Subjt:  RYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLAGE

Query:  KWSATKWMRQKSTLVP
        KWSATKWMRQKSTLVP
Subjt:  KWSATKWMRQKSTLVP

TrEMBL top hitse value%identityAlignment
A0A0A0KU17 Fe2OG dioxygenase domain-containing protein2.0e-13865.62Show/hide
Query:  MASAPMRIVFGLLTFVTVGMIIGSLLQLAFIRRLEDSIGTEFLSAGRLHKIQYDSQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST
        M S+ MRIVFGLLTFVTVGMIIG+LLQLAF+RRLEDSIGTEFL AGRLHK QYDSQ QLPRG PNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST
Subjt:  MASAPMRIVFGLLTFVTVGMIIGSLLQLAFIRRLEDSIGTEFLSAGRLHKIQYDSQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST

Query:  EVGQFHIGNQTPPELIAVTNKDNPSTEAISNQEHKEWIIVDQALLAWLYEGMDIIVTHMVLLKPWSNDTHLGEGEGALGFSASTSSSSTKFSRRKECDYL
                                                                                                      KECDYL
Subjt:  EVGQFHIGNQTPPELIAVTNKDNPSTEAISNQEHKEWIIVDQALLAWLYEGMDIIVTHMVLLKPWSNDTHLGEGEGALGFSASTSSSSTKFSRRKECDYL

Query:  KAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIPVENGELIQVL
        K IAL RLEISTVVDTKTGKGVKSDFRTSSGMFLS  EKN+PMVQ                                AIEKRISVYSQ+PVENGELIQVL
Subjt:  KAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIPVENGELIQVL

Query:  RYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLAGE
        RYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDP SIHGGCEVL+GE
Subjt:  RYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLAGE

Query:  KWSATKWMRQKSTLVP
        KWSATKWMRQKSTLVP
Subjt:  KWSATKWMRQKSTLVP

A0A1S3BXE6 prolyl 4-hydroxylase 1 isoform X36.5e-14267.07Show/hide
Query:  MASAPMRIVFGLLTFVTVGMIIGSLLQLAFIRRLEDSIGTEFLSAGRLHKIQYDSQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST
        M SA MRIVFGLLTFVTVGMIIG+LLQLAF+RRLEDSIGTEFL AGRLHK QYDSQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST
Subjt:  MASAPMRIVFGLLTFVTVGMIIGSLLQLAFIRRLEDSIGTEFLSAGRLHKIQYDSQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST

Query:  EVGQFHIGNQTPPELIAVTNKDNPSTEAISNQEHKEWIIVDQALLAWLYEGMDIIVTHMVLLKPWSNDTHLGEGEGALGFSASTSSSSTKFSRRKECDYL
        E                                                                                              ECDYL
Subjt:  EVGQFHIGNQTPPELIAVTNKDNPSTEAISNQEHKEWIIVDQALLAWLYEGMDIIVTHMVLLKPWSNDTHLGEGEGALGFSASTSSSSTKFSRRKECDYL

Query:  KAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIPVENGELIQVL
        K IALPRLEISTVVDTKTGKGVKSDFRTSSGMFLS  EKNYPMVQ                                AIEKRISVYSQIPVENGELIQVL
Subjt:  KAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIPVENGELIQVL

Query:  RYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLAGE
        RYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDA+LFWSMGLDGQSDPNSIHGGCEVL+GE
Subjt:  RYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLAGE

Query:  KWSATKWMRQKSTLVP
        KWSATKWMRQKSTLVP
Subjt:  KWSATKWMRQKSTLVP

A0A1S4DZZ9 prolyl 4-hydroxylase 1 isoform X12.0e-13864.14Show/hide
Query:  MASAPMRIVFGLLTFVTVGMIIGSLLQLAFIRRLEDSIGTEFLSAGRLHKIQYDSQRQLPR-------------------GLPNWINDKEAEILRLGYVK
        M SA MRIVFGLLTFVTVGMIIG+LLQLAF+RRLEDSIGTEFL AGRLHK QYDSQRQLPR                   GLPNWINDKEAEILRLGYVK
Subjt:  MASAPMRIVFGLLTFVTVGMIIGSLLQLAFIRRLEDSIGTEFLSAGRLHKIQYDSQRQLPR-------------------GLPNWINDKEAEILRLGYVK

Query:  PEVVSWSPRIIVLHNFLSTEVGQFHIGNQTPPELIAVTNKDNPSTEAISNQEHKEWIIVDQALLAWLYEGMDIIVTHMVLLKPWSNDTHLGEGEGALGFS
        PEVVSWSPRIIVLHNFLSTE                                                                                
Subjt:  PEVVSWSPRIIVLHNFLSTEVGQFHIGNQTPPELIAVTNKDNPSTEAISNQEHKEWIIVDQALLAWLYEGMDIIVTHMVLLKPWSNDTHLGEGEGALGFS

Query:  ASTSSSSTKFSRRKECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEK
                      ECDYLK IALPRLEISTVVDTKTGKGVKSDFRTSSGMFLS  EKNYPMVQ                                AIEK
Subjt:  ASTSSSSTKFSRRKECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEK

Query:  RISVYSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGL
        RISVYSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDA+LFWSMGL
Subjt:  RISVYSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGL

Query:  DGQSDPNSIHGGCEVLAGEKWSATKWMRQKSTLVP
        DGQSDPNSIHGGCEVL+GEKWSATKWMRQKSTLVP
Subjt:  DGQSDPNSIHGGCEVLAGEKWSATKWMRQKSTLVP

A0A6J1CBS4 prolyl 4-hydroxylase 13.6e-14064.9Show/hide
Query:  MASAPMRIVFGLLTFVTVGMIIGSLLQLAFIRRLEDSIGTEFLSAGRLHKIQYDSQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST
        MASAPMRIVFGLLTFVT+GMIIG+L QLAFIRRLEDS GTEFLSAGRLHK QYD  RQLPRG PNWIND+EAEILRLGYVKPEVVSWSPRIIVLHNFLST
Subjt:  MASAPMRIVFGLLTFVTVGMIIGSLLQLAFIRRLEDSIGTEFLSAGRLHKIQYDSQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST

Query:  EVGQFHIGNQTPPELIAVTNKDNPSTEAISNQEHKEWIIVDQALLAWLYEGMDIIVTHMVLLKPWSNDTHLGEGEGALGFSASTSSSSTKFSRRKECDYL
        E                                                                                              ECDYL
Subjt:  EVGQFHIGNQTPPELIAVTNKDNPSTEAISNQEHKEWIIVDQALLAWLYEGMDIIVTHMVLLKPWSNDTHLGEGEGALGFSASTSSSSTKFSRRKECDYL

Query:  KAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIPVENGELIQVL
        +A+ALPRLE+STVVDTKTGKGVKSDFRTSSGMFLS QEKNYPM+Q                                AIEKRISVYSQIP+ENGELIQVL
Subjt:  KAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIPVENGELIQVL

Query:  RYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLAGE
        RYEKNQFYKPHHDYFSDTFNLKRGGQR+ATMLMYLS+N+EGGETYFPKAGSGECSCGGKTVPGLSVKP KGDAVLFWSMGLDGQSDPNSIHGGCEVL+GE
Subjt:  RYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLAGE

Query:  KWSATKWMRQKSTLVP
        KWSATKWMRQKSTLVP
Subjt:  KWSATKWMRQKSTLVP

A0A6J1EXD7 prolyl 4-hydroxylase 1-like isoform X18.0e-14066.51Show/hide
Query:  MASAPMRIVFGLLTFVTVGMIIGSLLQLAFIRRLEDSIGTEFLSAGRLHKIQYDSQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST
        MAS  MRIVFGLLTFVTVGMIIG+L QLAFIRRLEDSIGTEFLSAGRLHK QYD QRQ  +GLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLS+
Subjt:  MASAPMRIVFGLLTFVTVGMIIGSLLQLAFIRRLEDSIGTEFLSAGRLHKIQYDSQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLST

Query:  EVGQFHIGNQTPPELIAVTNKDNPSTEAISNQEHKEWIIVDQALLAWLYEGMDIIVTHMVLLKPWSNDTHLGEGEGALGFSASTSSSSTKFSRRKECDYL
        E G+       PP+LI                                             ++P             LG                ECDYL
Subjt:  EVGQFHIGNQTPPELIAVTNKDNPSTEAISNQEHKEWIIVDQALLAWLYEGMDIIVTHMVLLKPWSNDTHLGEGEGALGFSASTSSSSTKFSRRKECDYL

Query:  KAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIPVENGELIQVL
        KAIALPRLEISTVVDTKTGKG+KSDFRTSSGMFLS QE+NYPMVQ                                AIEKRISVYSQIP+ENGELIQVL
Subjt:  KAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIPVENGELIQVL

Query:  RYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLAGE
        RYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYL++N+EGGETYFPKAGSG CSCGGKTVPGLSVKP KGDAVLFWSMGLDGQSDPNSIHGGCEVL GE
Subjt:  RYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLAGE

Query:  KWSATKWMRQKSTLV
        KWSATKWMRQKSTL+
Subjt:  KWSATKWMRQKSTLV

SwissProt top hitse value%identityAlignment
F4JNU8 Probable prolyl 4-hydroxylase 88.7e-4345.78Show/hide
Query:  FSRRKECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIP
        F   +EC++L ++A P +  S VVD KTGK + S  RTSSG FL++                             ++ +E        IE RIS ++ IP
Subjt:  FSRRKECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIP

Query:  VENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSG--------ECSCGGKTVPGLSVKPAKGDAVLFWSMGLD
         ENGE +QVL YE  Q Y+PHHDYF D FN+++GGQRIAT+LMYLS+  EGGET FP A           E S  GK   GLSV P K DA+LFWSM  D
Subjt:  VENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSG--------ECSCGGKTVPGLSVKPAKGDAVLFWSMGLD

Query:  GQSDPNSIHGGCEVLAGEKWSATKW
           DP+S+HGGC V+ G KWS+TKW
Subjt:  GQSDPNSIHGGCEVLAGEKWSATKW

F4JZ24 Probable prolyl 4-hydroxylase 102.5e-4546.52Show/hide
Query:  FSRRKECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIP
        F  ++EC YL  +A P +E STVVD KTGK   S  RTSSG FL++                            R+K +         IEKRIS ++ IP
Subjt:  FSRRKECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIP

Query:  VENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGS-----------GECSCGGKTVPGLSVKPAKGDAVLFWSM
        VE+GE +QVL YE  Q Y+PH+DYF D +N + GGQRIAT+LMYLS+  EGGET FP A              EC  G     GLSVKP  GDA+LFWSM
Subjt:  VENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGS-----------GECSCGGKTVPGLSVKPAKGDAVLFWSM

Query:  GLDGQSDPNSIHGGCEVLAGEKWSATKWMR
          D   DP+S+HGGC V+ G KWS+TKW+R
Subjt:  GLDGQSDPNSIHGGCEVLAGEKWSATKWMR

Q24JN5 Prolyl 4-hydroxylase 52.8e-4146.22Show/hide
Query:  FSRRKECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIP
        F   +EC++L ++A P +  STVVD KTG    S  RTSSG FL                              R    E  E+    IEKRIS ++ IP
Subjt:  FSRRKECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIP

Query:  VENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGS--------GECSCGGKTVPGLSVKPAKGDAVLFWSMGLD
        VENGE +QVL Y+  Q Y+PH+DYF D FN K GGQRIAT+LMYLS+  +GGET FP A           E S  GK   GLSV P K DA+LFW+M  D
Subjt:  VENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGS--------GECSCGGKTVPGLSVKPAKGDAVLFWSMGLD

Query:  GQSDPNSIHGGCEVLAGEKWSATKW
           DP+S+HGGC V+ G KWS+TKW
Subjt:  GQSDPNSIHGGCEVLAGEKWSATKW

Q9LN20 Probable prolyl 4-hydroxylase 32.5e-4546.7Show/hide
Query:  FSRRKECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIP
        F  ++EC+YL ++A P +  STVVD++TGK   S  RTSSG FL +                            R+K ++        IEKRI+ Y+ IP
Subjt:  FSRRKECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIP

Query:  VENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECS---------CGGKTVPGLSVKPAKGDAVLFWSMGL
         ++GE +QVL YE  Q Y+PH+DYF D FN K GGQR+ATMLMYLS+  EGGET FP A     S         CG K   GLSVKP  GDA+LFWSM  
Subjt:  VENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECS---------CGGKTVPGLSVKPAKGDAVLFWSMGL

Query:  DGQSDPNSIHGGCEVLAGEKWSATKWM
        D   DP S+HGGC V+ G KWS+TKWM
Subjt:  DGQSDPNSIHGGCEVLAGEKWSATKWM

Q9ZW86 Prolyl 4-hydroxylase 11.1e-10652.7Show/hide
Query:  MRIVFGLLTFVTVGMIIGSLLQLAFIRRLEDSIGTEFLSAGRLHKIQYDSQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTEVGQF
        M+IVFGLLTFVTVGM+IGSLLQLAFI RLEDS GT F S   L  ++  + R L R +  W NDK+AE+LR+G VKPEVVSWSPRIIVLH+FLS E    
Subjt:  MRIVFGLLTFVTVGMIIGSLLQLAFIRRLEDSIGTEFLSAGRLHKIQYDSQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTEVGQF

Query:  HIGNQTPPELIAVTNKDNPSTEAISNQEHKEWIIVDQALLAWLYEGMDIIVTHMVLLKPWSNDTHLGEGEGALGFSASTSSSSTKFSRRKECDYLKAIAL
                                                                                                  EC+YLKAIA 
Subjt:  HIGNQTPPELIAVTNKDNPSTEAISNQEHKEWIIVDQALLAWLYEGMDIIVTHMVLLKPWSNDTHLGEGEGALGFSASTSSSSTKFSRRKECDYLKAIAL

Query:  PRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIPVENGELIQVLRYEKN
        PRL++STVVD KTGKGVKSD RTSSGMFL+  E++YP++Q                                AIEKRI+V+SQ+P ENGELIQVLRYE  
Subjt:  PRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIPVENGELIQVLRYEKN

Query:  QFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLAGEKWSAT
        QFYKPHHDYF+DTFNLKRGGQR+ATMLMYL++++EGGETYFP AG G+C+CGGK + G+SVKP KGDAVLFWSMGLDGQSDP SIHGGCEVL+GEKWSAT
Subjt:  QFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLAGEKWSAT

Query:  KWMRQKST
        KWMRQK+T
Subjt:  KWMRQKST

Arabidopsis top hitse value%identityAlignment
AT1G20270.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.7e-4646.7Show/hide
Query:  FSRRKECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIP
        F  ++EC+YL ++A P +  STVVD++TGK   S  RTSSG FL +                            R+K ++        IEKRI+ Y+ IP
Subjt:  FSRRKECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIP

Query:  VENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECS---------CGGKTVPGLSVKPAKGDAVLFWSMGL
         ++GE +QVL YE  Q Y+PH+DYF D FN K GGQR+ATMLMYLS+  EGGET FP A     S         CG K   GLSVKP  GDA+LFWSM  
Subjt:  VENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECS---------CGGKTVPGLSVKPAKGDAVLFWSMGL

Query:  DGQSDPNSIHGGCEVLAGEKWSATKWM
        D   DP S+HGGC V+ G KWS+TKWM
Subjt:  DGQSDPNSIHGGCEVLAGEKWSATKWM

AT2G17720.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.0e-4246.22Show/hide
Query:  FSRRKECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIP
        F   +EC++L ++A P +  STVVD KTG    S  RTSSG FL                              R    E  E+    IEKRIS ++ IP
Subjt:  FSRRKECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIP

Query:  VENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGS--------GECSCGGKTVPGLSVKPAKGDAVLFWSMGLD
        VENGE +QVL Y+  Q Y+PH+DYF D FN K GGQRIAT+LMYLS+  +GGET FP A           E S  GK   GLSV P K DA+LFW+M  D
Subjt:  VENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGS--------GECSCGGKTVPGLSVKPAKGDAVLFWSMGLD

Query:  GQSDPNSIHGGCEVLAGEKWSATKW
           DP+S+HGGC V+ G KWS+TKW
Subjt:  GQSDPNSIHGGCEVLAGEKWSATKW

AT2G43080.1 P4H isoform 17.9e-10852.7Show/hide
Query:  MRIVFGLLTFVTVGMIIGSLLQLAFIRRLEDSIGTEFLSAGRLHKIQYDSQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTEVGQF
        M+IVFGLLTFVTVGM+IGSLLQLAFI RLEDS GT F S   L  ++  + R L R +  W NDK+AE+LR+G VKPEVVSWSPRIIVLH+FLS E    
Subjt:  MRIVFGLLTFVTVGMIIGSLLQLAFIRRLEDSIGTEFLSAGRLHKIQYDSQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTEVGQF

Query:  HIGNQTPPELIAVTNKDNPSTEAISNQEHKEWIIVDQALLAWLYEGMDIIVTHMVLLKPWSNDTHLGEGEGALGFSASTSSSSTKFSRRKECDYLKAIAL
                                                                                                  EC+YLKAIA 
Subjt:  HIGNQTPPELIAVTNKDNPSTEAISNQEHKEWIIVDQALLAWLYEGMDIIVTHMVLLKPWSNDTHLGEGEGALGFSASTSSSSTKFSRRKECDYLKAIAL

Query:  PRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIPVENGELIQVLRYEKN
        PRL++STVVD KTGKGVKSD RTSSGMFL+  E++YP++Q                                AIEKRI+V+SQ+P ENGELIQVLRYE  
Subjt:  PRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIPVENGELIQVLRYEKN

Query:  QFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLAGEKWSAT
        QFYKPHHDYF+DTFNLKRGGQR+ATMLMYL++++EGGETYFP AG G+C+CGGK + G+SVKP KGDAVLFWSMGLDGQSDP SIHGGCEVL+GEKWSAT
Subjt:  QFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLAGEKWSAT

Query:  KWMRQKST
        KWMRQK+T
Subjt:  KWMRQKST

AT4G35810.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein6.2e-4445.78Show/hide
Query:  FSRRKECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIP
        F   +EC++L ++A P +  S VVD KTGK + S  RTSSG FL++                             ++ +E        IE RIS ++ IP
Subjt:  FSRRKECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIP

Query:  VENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSG--------ECSCGGKTVPGLSVKPAKGDAVLFWSMGLD
         ENGE +QVL YE  Q Y+PHHDYF D FN+++GGQRIAT+LMYLS+  EGGET FP A           E S  GK   GLSV P K DA+LFWSM  D
Subjt:  VENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSG--------ECSCGGKTVPGLSVKPAKGDAVLFWSMGLD

Query:  GQSDPNSIHGGCEVLAGEKWSATKW
           DP+S+HGGC V+ G KWS+TKW
Subjt:  GQSDPNSIHGGCEVLAGEKWSATKW

AT5G66060.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.7e-4646.52Show/hide
Query:  FSRRKECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIP
        F  ++EC YL  +A P +E STVVD KTGK   S  RTSSG FL++                            R+K +         IEKRIS ++ IP
Subjt:  FSRRKECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIP

Query:  VENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGS-----------GECSCGGKTVPGLSVKPAKGDAVLFWSM
        VE+GE +QVL YE  Q Y+PH+DYF D +N + GGQRIAT+LMYLS+  EGGET FP A              EC  G     GLSVKP  GDA+LFWSM
Subjt:  VENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGS-----------GECSCGGKTVPGLSVKPAKGDAVLFWSM

Query:  GLDGQSDPNSIHGGCEVLAGEKWSATKWMR
          D   DP+S+HGGC V+ G KWS+TKW+R
Subjt:  GLDGQSDPNSIHGGCEVLAGEKWSATKWMR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTCCGCTCCGATGAGGATTGTCTTCGGTCTTCTCACCTTCGTCACCGTCGGCATGATCATCGGTTCTTTGTTGCAACTAGCATTTATAAGAAGGCTGGAGGACTC
TATTGGCACGGAGTTTCTATCTGCTGGAAGGTTACATAAAATTCAGTATGATAGCCAACGTCAATTACCCCGAGGCCTTCCTAATTGGATTAACGACAAAGAAGCAGAAA
TTCTTCGTCTTGGCTATGTTAAACCAGAAGTAGTAAGCTGGTCACCACGAATCATTGTATTGCATAATTTTTTGAGCACAGAGGTTGGACAGTTTCATATTGGGAACCAA
ACTCCACCAGAGCTCATAGCTGTTACCAACAAAGACAATCCCTCAACCGAAGCCATTTCAAACCAAGAGCACAAAGAATGGATAATTGTGGATCAAGCCCTTTTAGCATG
GCTATATGAAGGGATGGACATAATAGTAACACACATGGTACTTTTGAAGCCATGGAGCAATGATACACATCTAGGTGAAGGAGAAGGGGCTCTTGGTTTCTCTGCTTCAA
CTTCATCTTCCTCAACAAAGTTCTCTAGGAGGAAGGAGTGCGACTACCTTAAGGCAATAGCACTTCCTCGCCTTGAAATTTCCACCGTCGTGGATACAAAAACTGGGAAG
GGAGTTAAGAGTGATTTCAGAACAAGCTCTGGAATGTTTTTAAGTCAACAAGAGAAAAATTATCCAATGGTCCAGTGTGTAAGGTATCAATCTCTTCCTTCAGAAGCTCC
TTGTCCTGACCAGCCTCCCAGAAGGAACAAGTTTATGGAGAATCCGGAAATTGGGAATTACGCAATTGAAAAAAGAATTTCTGTCTATTCTCAAATACCAGTCGAAAATG
GAGAGCTCATTCAAGTGTTAAGGTACGAGAAGAATCAATTTTACAAGCCTCATCATGACTACTTTTCTGATACTTTTAACTTGAAGCGTGGTGGTCAGCGAATAGCAACT
ATGCTTATGTATCTAAGTGAAAACATTGAAGGAGGAGAAACCTACTTTCCGAAGGCTGGTTCTGGTGAGTGTAGCTGTGGTGGGAAGACCGTTCCTGGACTATCAGTTAA
ACCAGCCAAAGGGGATGCAGTGCTTTTCTGGAGCATGGGCTTAGATGGACAATCAGATCCAAATAGCATTCATGGAGGGTGTGAAGTACTGGCAGGGGAAAAATGGTCTG
CCACCAAATGGATGAGGCAAAAGAGTACTCTGGTACCATAA
mRNA sequenceShow/hide mRNA sequence
AACAAGAAAGAAGAAGAACAAGAAAGAAGAAGAACAAGAAAAAGCAGTAGCGAATCCTCTGCAGTGGGTCATCTCATTGGGTCCATGAGCTGATTTCTATCTTCCACACT
CGAAAGACTAACGGCTTCTCATTCTTCACCTGAATTTGGAGTCAAAATTTCGTTTAGGCAGCCATGGCCTCCGCTCCGATGAGGATTGTCTTCGGTCTTCTCACCTTCGT
CACCGTCGGCATGATCATCGGTTCTTTGTTGCAACTAGCATTTATAAGAAGGCTGGAGGACTCTATTGGCACGGAGTTTCTATCTGCTGGAAGGTTACATAAAATTCAGT
ATGATAGCCAACGTCAATTACCCCGAGGCCTTCCTAATTGGATTAACGACAAAGAAGCAGAAATTCTTCGTCTTGGCTATGTTAAACCAGAAGTAGTAAGCTGGTCACCA
CGAATCATTGTATTGCATAATTTTTTGAGCACAGAGGTTGGACAGTTTCATATTGGGAACCAAACTCCACCAGAGCTCATAGCTGTTACCAACAAAGACAATCCCTCAAC
CGAAGCCATTTCAAACCAAGAGCACAAAGAATGGATAATTGTGGATCAAGCCCTTTTAGCATGGCTATATGAAGGGATGGACATAATAGTAACACACATGGTACTTTTGA
AGCCATGGAGCAATGATACACATCTAGGTGAAGGAGAAGGGGCTCTTGGTTTCTCTGCTTCAACTTCATCTTCCTCAACAAAGTTCTCTAGGAGGAAGGAGTGCGACTAC
CTTAAGGCAATAGCACTTCCTCGCCTTGAAATTTCCACCGTCGTGGATACAAAAACTGGGAAGGGAGTTAAGAGTGATTTCAGAACAAGCTCTGGAATGTTTTTAAGTCA
ACAAGAGAAAAATTATCCAATGGTCCAGTGTGTAAGGTATCAATCTCTTCCTTCAGAAGCTCCTTGTCCTGACCAGCCTCCCAGAAGGAACAAGTTTATGGAGAATCCGG
AAATTGGGAATTACGCAATTGAAAAAAGAATTTCTGTCTATTCTCAAATACCAGTCGAAAATGGAGAGCTCATTCAAGTGTTAAGGTACGAGAAGAATCAATTTTACAAG
CCTCATCATGACTACTTTTCTGATACTTTTAACTTGAAGCGTGGTGGTCAGCGAATAGCAACTATGCTTATGTATCTAAGTGAAAACATTGAAGGAGGAGAAACCTACTT
TCCGAAGGCTGGTTCTGGTGAGTGTAGCTGTGGTGGGAAGACCGTTCCTGGACTATCAGTTAAACCAGCCAAAGGGGATGCAGTGCTTTTCTGGAGCATGGGCTTAGATG
GACAATCAGATCCAAATAGCATTCATGGAGGGTGTGAAGTACTGGCAGGGGAAAAATGGTCTGCCACCAAATGGATGAGGCAAAAGAGTACTCTGGTACCATAATTCAAA
CTTTCTAGTTCCATTGTATTGTATATCAGCATTGAATATTTTGTTACATATCAACTAATAAATCTATAGAGAGAGGAAGAAAAAATGGAGAGCCTAATTTAGATAGCATC
TTAACATAATTAATAACACTTAGAGTGAATCAATTTATTTAATGAATAATACAATACAGCAGCATCTGATGTTATTTT
Protein sequenceShow/hide protein sequence
MASAPMRIVFGLLTFVTVGMIIGSLLQLAFIRRLEDSIGTEFLSAGRLHKIQYDSQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTEVGQFHIGNQ
TPPELIAVTNKDNPSTEAISNQEHKEWIIVDQALLAWLYEGMDIIVTHMVLLKPWSNDTHLGEGEGALGFSASTSSSSTKFSRRKECDYLKAIALPRLEISTVVDTKTGK
GVKSDFRTSSGMFLSQQEKNYPMVQCVRYQSLPSEAPCPDQPPRRNKFMENPEIGNYAIEKRISVYSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIAT
MLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLAGEKWSATKWMRQKSTLVP