; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh12G011650 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh12G011650
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionProlyl 4-hydroxylase
Genome locationCmo_Chr12:10541259..10550882
RNA-Seq ExpressionCmoCh12G011650
SyntenyCmoCh12G011650
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005506 - iron ion binding (molecular function)
GO:0016705 - oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
GO:0051213 - dioxygenase activity (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6586330.1 putative prolyl 4-hydroxylase 3, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0097.55Show/hide
Query:  MAVSKGKYIKFQGRKWSTFKLSKIIMVFVLALGVSMFIAFRFFSPPESSHSELLHRLASVQHSAVHSDGLGKREDQWVEIISWEPRAFVYHNFLSKEECL
        MAVSKGKYIKFQGRKWSTFKLSKIIMVF+LALGVSMFIAFRFFSPPESSHSELLHRLASVQHSAVHSDGLGKREDQWVEIISWEPRAFVYHNFLSKEECL
Subjt:  MAVSKGKYIKFQGRKWSTFKLSKIIMVFVLALGVSMFIAFRFFSPPESSHSELLHRLASVQHSAVHSDGLGKREDQWVEIISWEPRAFVYHNFLSKEECL

Query:  YLISLAKPYMEKSTVVDIKTGKNIDSRARTSSGMFLRRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFFADEFNIKHGGQRIATLL
        YLISLAKPYMEKS+VVD KTGKNIDSRARTSSGMFLRRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFFADEFNIKHGGQRIATLL
Subjt:  YLISLAKPYMEKSTVVDIKTGKNIDSRARTSSGMFLRRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFFADEFNIKHGGQRIATLL

Query:  MYLSDVEEGGETVFPAAEGNFSSLPGWNERSECGKGGLSLNPKMGDALLFWSMRPDNTLDPTSMHGSCPVIRGNKWSCTKWMHTLPYQVIVGTSVIDPTK
        MYLSDVEEGGETVFPAAEGNFSSLPGWNERSECGKGGLSL PKMGDALLFWSMRPDNTLDPTSMHGSCPVIRGNKWSCTKWMHTLPYQVIVGT VIDPTK
Subjt:  MYLSDVEEGGETVFPAAEGNFSSLPGWNERSECGKGGLSLNPKMGDALLFWSMRPDNTLDPTSMHGSCPVIRGNKWSCTKWMHTLPYQVIVGTSVIDPTK

Query:  EAGRVIKDDTSVGKLSSPTGGTLMSGLRYSSNIDLGGLDGTTIAYDFTDVAAYRNRGLILLKLIRSLMAVSKGKYVKFQARKWSTFKLSKIIMAFLLALG
        EAGRVIKDDTSVGKLSSPTGGT MSGLRY       GLDGTTIAYDFTDVAAYRNRGLILLKLIRSLMAVSKGKY+KFQARKWSTFKLSKIIMAFLLALG
Subjt:  EAGRVIKDDTSVGKLSSPTGGTLMSGLRYSSNIDLGGLDGTTIAYDFTDVAAYRNRGLILLKLIRSLMAVSKGKYVKFQARKWSTFKLSKIIMAFLLALG

Query:  VSMLIAFRFFSPPESSHSNLLHRVASVQHRAVHSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMAKSTVVDNKTGKSIDSRVRTSSG
        VSMLIAFRFFSPPESSHSNLLHRVASVQHRAVHSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKP+M KSTVVDNKTGKSIDSRVRTSSG
Subjt:  VSMLIAFRFFSPPESSHSNLLHRVASVQHRAVHSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMAKSTVVDNKTGKSIDSRVRTSSG

Query:  MFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHHDYFADEFNIKQGGQRMATLLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSEC
        MFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHHDYFADEFNIKQGGQRMATLLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSEC
Subjt:  MFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHHDYFADEFNIKQGGQRMATLLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSEC

Query:  GKGGLSVKPKMGDALLFWSMKPDNTVDPTSLHGACPVIRGNKWSCTKWMRVNEY
        GKGGLSVKPKMGDALLFWSMKPDNTVDPTSLHGACPVIRGNKWSCTKWMRVNEY
Subjt:  GKGGLSVKPKMGDALLFWSMKPDNTVDPTSLHGACPVIRGNKWSCTKWMRVNEY

XP_022937930.1 probable prolyl 4-hydroxylase 3 [Cucurbita moschata]8.8e-16498.95Show/hide
Query:  MAVSKGKYIKFQGRKWSTFKLSKIIMVFVLALGVSMFIAFRFFSPPESSHSELLHRLASVQHSAVHSDGLGKREDQWVEIISWEPRAFVYHNFLSKEECL
        MAVSKGKYIKFQGRKWSTFKLSKIIMVFVLALGVSMFIAFRFFSPPESSHSELLHRLASVQHSAVHSDGLGKREDQWVEIISWEPRAFVYHNFLSKEECL
Subjt:  MAVSKGKYIKFQGRKWSTFKLSKIIMVFVLALGVSMFIAFRFFSPPESSHSELLHRLASVQHSAVHSDGLGKREDQWVEIISWEPRAFVYHNFLSKEECL

Query:  YLISLAKPYMEKSTVVDIKTGKNIDSRARTSSGMFLRRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFFADEFNIKHGGQRIATLL
        YLISLAKPYMEKSTVVDIKTGKNIDSRARTSSGMFLRRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFFADEFNIKHGGQRIATLL
Subjt:  YLISLAKPYMEKSTVVDIKTGKNIDSRARTSSGMFLRRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFFADEFNIKHGGQRIATLL

Query:  MYLSDVEEGGETVFPAAEGNFSSLPGWNERSECGKGGLSLNPKMGDALLFWSMRPDNTLDPTSMHGSCPVIRGNKWSCTKWMHTLPY
        MYLSDVEEGGETVFPAAEGNFSSLPGWNERSECGKGGLSLNPKMGDALLFWSMRPDNTLDPTSMHGSCPVIRGNKWSCTKWMH   Y
Subjt:  MYLSDVEEGGETVFPAAEGNFSSLPGWNERSECGKGGLSLNPKMGDALLFWSMRPDNTLDPTSMHGSCPVIRGNKWSCTKWMHTLPY

XP_022937931.1 probable prolyl 4-hydroxylase 3 [Cucurbita moschata]2.1e-165100Show/hide
Query:  MAVSKGKYVKFQARKWSTFKLSKIIMAFLLALGVSMLIAFRFFSPPESSHSNLLHRVASVQHRAVHSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECL
        MAVSKGKYVKFQARKWSTFKLSKIIMAFLLALGVSMLIAFRFFSPPESSHSNLLHRVASVQHRAVHSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECL
Subjt:  MAVSKGKYVKFQARKWSTFKLSKIIMAFLLALGVSMLIAFRFFSPPESSHSNLLHRVASVQHRAVHSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECL

Query:  YLISLAKPHMAKSTVVDNKTGKSIDSRVRTSSGMFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHHDYFADEFNIKQGGQRMATLL
        YLISLAKPHMAKSTVVDNKTGKSIDSRVRTSSGMFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHHDYFADEFNIKQGGQRMATLL
Subjt:  YLISLAKPHMAKSTVVDNKTGKSIDSRVRTSSGMFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHHDYFADEFNIKQGGQRMATLL

Query:  MYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWSMKPDNTVDPTSLHGACPVIRGNKWSCTKWMRVNEY
        MYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWSMKPDNTVDPTSLHGACPVIRGNKWSCTKWMRVNEY
Subjt:  MYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWSMKPDNTVDPTSLHGACPVIRGNKWSCTKWMRVNEY

XP_022965872.1 probable prolyl 4-hydroxylase 3 [Cucurbita maxima]1.1e-16197.21Show/hide
Query:  MAVSKGKYVKFQARKWSTFKLSKIIMAFLLALGVSMLIAFRFFSPPESSHSNLLHRVASVQHRAVHSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECL
        MAVSKGKY+KFQARKWSTFKLSKIIM FLLALGVSM IAFRFFSPPESSHS LLHR+ASVQHRAVHSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECL
Subjt:  MAVSKGKYVKFQARKWSTFKLSKIIMAFLLALGVSMLIAFRFFSPPESSHSNLLHRVASVQHRAVHSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECL

Query:  YLISLAKPHMAKSTVVDNKTGKSIDSRVRTSSGMFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHHDYFADEFNIKQGGQRMATLL
        YLISLAKP+MAKSTVVDNKTGKSIDSRVRTSSGMFL+RGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHHDYFADEFNIKQGGQRMATLL
Subjt:  YLISLAKPHMAKSTVVDNKTGKSIDSRVRTSSGMFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHHDYFADEFNIKQGGQRMATLL

Query:  MYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWSMKPDNTVDPTSLHGACPVIRGNKWSCTKWMRVNEY
        MYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWSM+PDNTVDPTSLHGACPVIRGNKWSCTKWMRVNEY
Subjt:  MYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWSMKPDNTVDPTSLHGACPVIRGNKWSCTKWMRVNEY

XP_023537750.1 probable prolyl 4-hydroxylase 3 isoform X2 [Cucurbita pepo subsp. pepo]4.4e-16397.91Show/hide
Query:  MAVSKGKYVKFQARKWSTFKLSKIIMAFLLALGVSMLIAFRFFSPPESSHSNLLHRVASVQHRAVHSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECL
        MAVSKGKY+KFQARKWSTFKLSKIIM FLLALGVSMLIAFRFFSPPESSHS LLHR+ASVQHRAVHSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECL
Subjt:  MAVSKGKYVKFQARKWSTFKLSKIIMAFLLALGVSMLIAFRFFSPPESSHSNLLHRVASVQHRAVHSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECL

Query:  YLISLAKPHMAKSTVVDNKTGKSIDSRVRTSSGMFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHHDYFADEFNIKQGGQRMATLL
        YLISLAKPHMAKSTVVDNKTGKSIDSRVRTSSGMFL+RGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHHDYFADEFNIKQGGQRMATLL
Subjt:  YLISLAKPHMAKSTVVDNKTGKSIDSRVRTSSGMFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHHDYFADEFNIKQGGQRMATLL

Query:  MYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWSMKPDNTVDPTSLHGACPVIRGNKWSCTKWMRVNEY
        MYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWSM+PDNTVDPTSLHGACPVIRGNKWSCTKWMRVNEY
Subjt:  MYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWSMKPDNTVDPTSLHGACPVIRGNKWSCTKWMRVNEY

TrEMBL top hitse value%identityAlignment
A0A6J1FBR3 probable prolyl 4-hydroxylase 31.0e-165100Show/hide
Query:  MAVSKGKYVKFQARKWSTFKLSKIIMAFLLALGVSMLIAFRFFSPPESSHSNLLHRVASVQHRAVHSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECL
        MAVSKGKYVKFQARKWSTFKLSKIIMAFLLALGVSMLIAFRFFSPPESSHSNLLHRVASVQHRAVHSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECL
Subjt:  MAVSKGKYVKFQARKWSTFKLSKIIMAFLLALGVSMLIAFRFFSPPESSHSNLLHRVASVQHRAVHSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECL

Query:  YLISLAKPHMAKSTVVDNKTGKSIDSRVRTSSGMFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHHDYFADEFNIKQGGQRMATLL
        YLISLAKPHMAKSTVVDNKTGKSIDSRVRTSSGMFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHHDYFADEFNIKQGGQRMATLL
Subjt:  YLISLAKPHMAKSTVVDNKTGKSIDSRVRTSSGMFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHHDYFADEFNIKQGGQRMATLL

Query:  MYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWSMKPDNTVDPTSLHGACPVIRGNKWSCTKWMRVNEY
        MYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWSMKPDNTVDPTSLHGACPVIRGNKWSCTKWMRVNEY
Subjt:  MYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWSMKPDNTVDPTSLHGACPVIRGNKWSCTKWMRVNEY

A0A6J1FI70 probable prolyl 4-hydroxylase 31.9e-15687.18Show/hide
Query:  IAYDFTDVAAYRNRGLILLKLIRSLMAVSKGKYVKFQARKWSTFKLSKIIMAFLLALGVSMLIAFRFFSPPESSHSNLLHRVASVQHRAVHSDGLGKRGD
        I    ++VAAYRNRGLILLKL+RSLMAV KGKY+KFQ RKWSTFKLSKI+MA LLALG+SM IAFRFFSP ESSHSNLLHR+ASVQHRAVHSDGLGKR D
Subjt:  IAYDFTDVAAYRNRGLILLKLIRSLMAVSKGKYVKFQARKWSTFKLSKIIMAFLLALGVSMLIAFRFFSPPESSHSNLLHRVASVQHRAVHSDGLGKRGD

Query:  QWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMAKSTVVDNKTGKSIDSRVRTSSGMFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQK
        QWVEFISWEPRAFVYHNFLSKEECLYLISLA P+M KSTVVD KTGK  DSR RTSSGMFL RGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQK
Subjt:  QWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMAKSTVVDNKTGKSIDSRVRTSSGMFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQK

Query:  YDAHHDYFADEFNIKQGGQRMATLLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWSMKPDNTVDPTSLHGACPVIRGNK
        YDAH+D+ ++EF I++GGQR+ATLLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFW+++P+NT+DPTSLHGACPVIRGNK
Subjt:  YDAHHDYFADEFNIKQGGQRMATLLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWSMKPDNTVDPTSLHGACPVIRGNK

Query:  WSCTKWMRVNEY
        WSCTKWM VNEY
Subjt:  WSCTKWMRVNEY

A0A6J1FI75 probable prolyl 4-hydroxylase 34.3e-16498.95Show/hide
Query:  MAVSKGKYIKFQGRKWSTFKLSKIIMVFVLALGVSMFIAFRFFSPPESSHSELLHRLASVQHSAVHSDGLGKREDQWVEIISWEPRAFVYHNFLSKEECL
        MAVSKGKYIKFQGRKWSTFKLSKIIMVFVLALGVSMFIAFRFFSPPESSHSELLHRLASVQHSAVHSDGLGKREDQWVEIISWEPRAFVYHNFLSKEECL
Subjt:  MAVSKGKYIKFQGRKWSTFKLSKIIMVFVLALGVSMFIAFRFFSPPESSHSELLHRLASVQHSAVHSDGLGKREDQWVEIISWEPRAFVYHNFLSKEECL

Query:  YLISLAKPYMEKSTVVDIKTGKNIDSRARTSSGMFLRRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFFADEFNIKHGGQRIATLL
        YLISLAKPYMEKSTVVDIKTGKNIDSRARTSSGMFLRRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFFADEFNIKHGGQRIATLL
Subjt:  YLISLAKPYMEKSTVVDIKTGKNIDSRARTSSGMFLRRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFFADEFNIKHGGQRIATLL

Query:  MYLSDVEEGGETVFPAAEGNFSSLPGWNERSECGKGGLSLNPKMGDALLFWSMRPDNTLDPTSMHGSCPVIRGNKWSCTKWMHTLPY
        MYLSDVEEGGETVFPAAEGNFSSLPGWNERSECGKGGLSLNPKMGDALLFWSMRPDNTLDPTSMHGSCPVIRGNKWSCTKWMH   Y
Subjt:  MYLSDVEEGGETVFPAAEGNFSSLPGWNERSECGKGGLSLNPKMGDALLFWSMRPDNTLDPTSMHGSCPVIRGNKWSCTKWMHTLPY

A0A6J1HLH2 probable prolyl 4-hydroxylase 35.2e-16297.21Show/hide
Query:  MAVSKGKYVKFQARKWSTFKLSKIIMAFLLALGVSMLIAFRFFSPPESSHSNLLHRVASVQHRAVHSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECL
        MAVSKGKY+KFQARKWSTFKLSKIIM FLLALGVSM IAFRFFSPPESSHS LLHR+ASVQHRAVHSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECL
Subjt:  MAVSKGKYVKFQARKWSTFKLSKIIMAFLLALGVSMLIAFRFFSPPESSHSNLLHRVASVQHRAVHSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECL

Query:  YLISLAKPHMAKSTVVDNKTGKSIDSRVRTSSGMFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHHDYFADEFNIKQGGQRMATLL
        YLISLAKP+MAKSTVVDNKTGKSIDSRVRTSSGMFL+RGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHHDYFADEFNIKQGGQRMATLL
Subjt:  YLISLAKPHMAKSTVVDNKTGKSIDSRVRTSSGMFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHHDYFADEFNIKQGGQRMATLL

Query:  MYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWSMKPDNTVDPTSLHGACPVIRGNKWSCTKWMRVNEY
        MYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWSM+PDNTVDPTSLHGACPVIRGNKWSCTKWMRVNEY
Subjt:  MYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWSMKPDNTVDPTSLHGACPVIRGNKWSCTKWMRVNEY

A0A6J1HMV0 probable prolyl 4-hydroxylase 34.1e-15995.82Show/hide
Query:  MAVSKGKYIKFQGRKWSTFKLSKIIMVFVLALGVSMFIAFRFFSPPESSHSELLHRLASVQHSAVHSDGLGKREDQWVEIISWEPRAFVYHNFLSKEECL
        MAV KGKYIKFQGRKWSTFKLSKIIMVF+LALGVSMFIAFRFFSPPESSHSELLHRLASVQHSAVHSDGLGKR DQWVE ISWEPRAFVYHNFLSKEECL
Subjt:  MAVSKGKYIKFQGRKWSTFKLSKIIMVFVLALGVSMFIAFRFFSPPESSHSELLHRLASVQHSAVHSDGLGKREDQWVEIISWEPRAFVYHNFLSKEECL

Query:  YLISLAKPYMEKSTVVDIKTGKNIDSRARTSSGMFLRRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFFADEFNIKHGGQRIATLL
        YLISLAKPYMEKSTVVD KTGK++DSRARTSSGMFL RGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFFADEFNIKHGGQRIATLL
Subjt:  YLISLAKPYMEKSTVVDIKTGKNIDSRARTSSGMFLRRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFFADEFNIKHGGQRIATLL

Query:  MYLSDVEEGGETVFPAAEGNFSSLPGWNERSECGKGGLSLNPKMGDALLFWSMRPDNTLDPTSMHGSCPVIRGNKWSCTKWMHTLPY
        MYLSDVEEGGETVFPAAEGNFSSLPGWNERSECGKGGLSL PKMGDALLFWSMRPDNTLDPTSMHGSCPVIRGNKWSCTKWMH   Y
Subjt:  MYLSDVEEGGETVFPAAEGNFSSLPGWNERSECGKGGLSLNPKMGDALLFWSMRPDNTLDPTSMHGSCPVIRGNKWSCTKWMHTLPY

SwissProt top hitse value%identityAlignment
F4JNU8 Probable prolyl 4-hydroxylase 85.7e-9760.42Show/hide
Query:  KGKYVKFQARKWSTFKLSKIIMAFLLALGVSMLIAFRFFSPPESSHS-----NLLHRVASVQHRAVHSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
        K K ++ + RK  +F      +  L+   + +L+    FS P ++ +     +L   V ++Q R    D     GD+W+E ISWEPRAFVYHNFL+ EEC
Subjt:  KGKYVKFQARKWSTFKLSKIIMAFLLALGVSMLIAFRFFSPPESSHS-----NLLHRVASVQHRAVHSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC

Query:  LYLISLAKPHMAKSTVVDNKTGKSIDSRVRTSSGMFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHHDYFADEFNIKQGGQRMATL
         +LISLAKP M KS VVD KTGKSIDSRVRTSSG FL RG ++IV  IE RI+DFTFIP E+GE LQ+LHYEVGQ+Y+ HHDYF DEFN+++GGQR+AT+
Subjt:  LYLISLAKPHMAKSTVVDNKTGKSIDSRVRTSSGMFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHHDYFADEFNIKQGGQRMATL

Query:  LMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWSMKPDNTVDPTSLHGACPVIRGNKWSCTKWMRVNEY
        LMYLSDV+EGGETVFPAA+GN S +P W+ELS+CGK GLSV PK  DALLFWSMKPD ++DP+SLHG CPVI+GNKWS TKW  V+EY
Subjt:  LMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWSMKPDNTVDPTSLHGACPVIRGNKWSCTKWMRVNEY

F4JZ24 Probable prolyl 4-hydroxylase 102.0e-10265.57Show/hide
Query:  STFKLSKIIMAFLLALGVSMLIAFRFFS-PPESSHSNLLHRVASVQHRAVHSDGL-GKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMAKST
        ST   + +IM+  + L   +L+AF   S P  ++ S+  + + S+  + +   G    + ++WVE ISWEPRA VYHNFL+KEEC YLI LAKPHM KST
Subjt:  STFKLSKIIMAFLLALGVSMLIAFRFFS-PPESSHSNLLHRVASVQHRAVHSDGL-GKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMAKST

Query:  VVDNKTGKSIDSRVRTSSGMFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHHDYFADEFNIKQGGQRMATLLMYLSDVEEGGETVF
        VVD KTGKS DSRVRTSSG FL RG++K +  IEKRI+DFTFIPVEHGE LQ+LHYE+GQKY+ H+DYF DE+N + GGQR+AT+LMYLSDVEEGGETVF
Subjt:  VVDNKTGKSIDSRVRTSSGMFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHHDYFADEFNIKQGGQRMATLLMYLSDVEEGGETVF

Query:  PAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWSMKPDNTVDPTSLHGACPVIRGNKWSCTKWMRVNEY
        PAA+GN+S++P WNELSECGKGGLSVKPKMGDALLFWSM PD T+DP+SLHG C VI+GNKWS TKW+RV+EY
Subjt:  PAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWSMKPDNTVDPTSLHGACPVIRGNKWSCTKWMRVNEY

Q24JN5 Prolyl 4-hydroxylase 51.5e-9457.79Show/hide
Query:  MAVSKGKYVKFQARKWSTFKLSKIIMAFLLALGVSMLIAFRFFSPPESS-HSNLLHRVASVQHRAVHSDGLGK-RGDQWVEFISWEPRAFVYHNFLSKEE
        MA    +++++Q RK  +       +  LL + + +L+     S P ++ +S+  + + ++  ++  S G  +  G++WVE ISWEPRA VYHNFL+ EE
Subjt:  MAVSKGKYVKFQARKWSTFKLSKIIMAFLLALGVSMLIAFRFFSPPESS-HSNLLHRVASVQHRAVHSDGLGK-RGDQWVEFISWEPRAFVYHNFLSKEE

Query:  CLYLISLAKPHMAKSTVVDNKTGKSIDSRVRTSSGMFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHHDYFADEFNIKQGGQRMAT
        C +LISLAKP M KSTVVD KTG S DSRVRTSSG FL+RG +++V  IEKRI+DFTFIPVE+GE LQ+LHY+VGQKY+ H+DYF DEFN K GGQR+AT
Subjt:  CLYLISLAKPHMAKSTVVDNKTGKSIDSRVRTSSGMFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHHDYFADEFNIKQGGQRMAT

Query:  LLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWSMKPDNTVDPTSLHGACPVIRGNKWSCTKWMRVNEY
        +LMYLSDV++GGETVFPAA GN S++P WNELS+CGK GLSV PK  DALLFW+M+PD ++DP+SLHG CPV++GNKWS TKW  V+E+
Subjt:  LLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWSMKPDNTVDPTSLHGACPVIRGNKWSCTKWMRVNEY

Q8L970 Probable prolyl 4-hydroxylase 72.8e-6454.59Show/hide
Query:  ISWEPRAFVYHNFLSKEECLYLISLAKPHMAKSTVVDNKTGKSIDSRVRTSSGMFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHH
        +SW PR F+Y  FLS EEC + I LAK  + KS V DN +G+S++S VRTSSGMFL + Q+ IVSN+E ++A +TF+P E+GE++QILHYE GQKY+ H 
Subjt:  ISWEPRAFVYHNFLSKEECLYLISLAKPHMAKSTVVDNKTGKSIDSRVRTSSGMFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHH

Query:  DYFADEFNIKQGGQRMATLLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWSMKPDNTVDPTSLHGACPVIRGNKWSCTK
        DYF D+ N++ GG R+AT+LMYLS+VE+GGETVFP  +G  + L   +  +EC K G +VKP+ GDALLF+++ P+ T D  SLHG+CPV+ G KWS T+
Subjt:  DYFADEFNIKQGGQRMATLLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWSMKPDNTVDPTSLHGACPVIRGNKWSCTK

Query:  WMRVNEY
        W+ V  +
Subjt:  WMRVNEY

Q9LN20 Probable prolyl 4-hydroxylase 33.9e-11470.24Show/hide
Query:  VSKGKYVKFQARKWSTFKLSKIIMAFLLALGVSMLIAFRFFSPP----ESSHSNLLHRVASVQHRAVHSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEE
        ++K ++ +FQARKWST  L  + M F+L + + ML+AF  FS P    ESS  +L +   +   R   S+GLGKRGDQW E +SWEPRAFVYHNFLSKEE
Subjt:  VSKGKYVKFQARKWSTFKLSKIIMAFLLALGVSMLIAFRFFSPP----ESSHSNLLHRVASVQHRAVHSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEE

Query:  CLYLISLAKPHMAKSTVVDNKTGKSIDSRVRTSSGMFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHHDYFADEFNIKQGGQRMAT
        C YLISLAKPHM KSTVVD++TGKS DSRVRTSSG FL+RG++KI+  IEKRIAD+TFIP +HGE LQ+LHYE GQKY+ H+DYF DEFN K GGQRMAT
Subjt:  CLYLISLAKPHMAKSTVVDNKTGKSIDSRVRTSSGMFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHHDYFADEFNIKQGGQRMAT

Query:  LLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWSMKPDNTVDPTSLHGACPVIRGNKWSCTKWMRVNEY
        +LMYLSDVEEGGETVFPAA  NFSS+P +NELSECGK GLSVKP+MGDALLFWSM+PD T+DPTSLHG CPVIRGNKWS TKWM V EY
Subjt:  LLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWSMKPDNTVDPTSLHGACPVIRGNKWSCTKWMRVNEY

Arabidopsis top hitse value%identityAlignment
AT1G20270.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.8e-11570.24Show/hide
Query:  VSKGKYVKFQARKWSTFKLSKIIMAFLLALGVSMLIAFRFFSPP----ESSHSNLLHRVASVQHRAVHSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEE
        ++K ++ +FQARKWST  L  + M F+L + + ML+AF  FS P    ESS  +L +   +   R   S+GLGKRGDQW E +SWEPRAFVYHNFLSKEE
Subjt:  VSKGKYVKFQARKWSTFKLSKIIMAFLLALGVSMLIAFRFFSPP----ESSHSNLLHRVASVQHRAVHSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEE

Query:  CLYLISLAKPHMAKSTVVDNKTGKSIDSRVRTSSGMFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHHDYFADEFNIKQGGQRMAT
        C YLISLAKPHM KSTVVD++TGKS DSRVRTSSG FL+RG++KI+  IEKRIAD+TFIP +HGE LQ+LHYE GQKY+ H+DYF DEFN K GGQRMAT
Subjt:  CLYLISLAKPHMAKSTVVDNKTGKSIDSRVRTSSGMFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHHDYFADEFNIKQGGQRMAT

Query:  LLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWSMKPDNTVDPTSLHGACPVIRGNKWSCTKWMRVNEY
        +LMYLSDVEEGGETVFPAA  NFSS+P +NELSECGK GLSVKP+MGDALLFWSM+PD T+DPTSLHG CPVIRGNKWS TKWM V EY
Subjt:  LLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWSMKPDNTVDPTSLHGACPVIRGNKWSCTKWMRVNEY

AT2G17720.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.1e-9557.79Show/hide
Query:  MAVSKGKYVKFQARKWSTFKLSKIIMAFLLALGVSMLIAFRFFSPPESS-HSNLLHRVASVQHRAVHSDGLGK-RGDQWVEFISWEPRAFVYHNFLSKEE
        MA    +++++Q RK  +       +  LL + + +L+     S P ++ +S+  + + ++  ++  S G  +  G++WVE ISWEPRA VYHNFL+ EE
Subjt:  MAVSKGKYVKFQARKWSTFKLSKIIMAFLLALGVSMLIAFRFFSPPESS-HSNLLHRVASVQHRAVHSDGLGK-RGDQWVEFISWEPRAFVYHNFLSKEE

Query:  CLYLISLAKPHMAKSTVVDNKTGKSIDSRVRTSSGMFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHHDYFADEFNIKQGGQRMAT
        C +LISLAKP M KSTVVD KTG S DSRVRTSSG FL+RG +++V  IEKRI+DFTFIPVE+GE LQ+LHY+VGQKY+ H+DYF DEFN K GGQR+AT
Subjt:  CLYLISLAKPHMAKSTVVDNKTGKSIDSRVRTSSGMFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHHDYFADEFNIKQGGQRMAT

Query:  LLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWSMKPDNTVDPTSLHGACPVIRGNKWSCTKWMRVNEY
        +LMYLSDV++GGETVFPAA GN S++P WNELS+CGK GLSV PK  DALLFW+M+PD ++DP+SLHG CPV++GNKWS TKW  V+E+
Subjt:  LLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWSMKPDNTVDPTSLHGACPVIRGNKWSCTKWMRVNEY

AT4G35810.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein4.0e-9860.42Show/hide
Query:  KGKYVKFQARKWSTFKLSKIIMAFLLALGVSMLIAFRFFSPPESSHS-----NLLHRVASVQHRAVHSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC
        K K ++ + RK  +F      +  L+   + +L+    FS P ++ +     +L   V ++Q R    D     GD+W+E ISWEPRAFVYHNFL+ EEC
Subjt:  KGKYVKFQARKWSTFKLSKIIMAFLLALGVSMLIAFRFFSPPESSHS-----NLLHRVASVQHRAVHSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEEC

Query:  LYLISLAKPHMAKSTVVDNKTGKSIDSRVRTSSGMFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHHDYFADEFNIKQGGQRMATL
         +LISLAKP M KS VVD KTGKSIDSRVRTSSG FL RG ++IV  IE RI+DFTFIP E+GE LQ+LHYEVGQ+Y+ HHDYF DEFN+++GGQR+AT+
Subjt:  LYLISLAKPHMAKSTVVDNKTGKSIDSRVRTSSGMFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHHDYFADEFNIKQGGQRMATL

Query:  LMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWSMKPDNTVDPTSLHGACPVIRGNKWSCTKWMRVNEY
        LMYLSDV+EGGETVFPAA+GN S +P W+ELS+CGK GLSV PK  DALLFWSMKPD ++DP+SLHG CPVI+GNKWS TKW  V+EY
Subjt:  LMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWSMKPDNTVDPTSLHGACPVIRGNKWSCTKWMRVNEY

AT5G66060.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.4e-10365.57Show/hide
Query:  STFKLSKIIMAFLLALGVSMLIAFRFFS-PPESSHSNLLHRVASVQHRAVHSDGL-GKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMAKST
        ST   + +IM+  + L   +L+AF   S P  ++ S+  + + S+  + +   G    + ++WVE ISWEPRA VYHNFL+KEEC YLI LAKPHM KST
Subjt:  STFKLSKIIMAFLLALGVSMLIAFRFFS-PPESSHSNLLHRVASVQHRAVHSDGL-GKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMAKST

Query:  VVDNKTGKSIDSRVRTSSGMFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHHDYFADEFNIKQGGQRMATLLMYLSDVEEGGETVF
        VVD KTGKS DSRVRTSSG FL RG++K +  IEKRI+DFTFIPVEHGE LQ+LHYE+GQKY+ H+DYF DE+N + GGQR+AT+LMYLSDVEEGGETVF
Subjt:  VVDNKTGKSIDSRVRTSSGMFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHHDYFADEFNIKQGGQRMATLLMYLSDVEEGGETVF

Query:  PAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWSMKPDNTVDPTSLHGACPVIRGNKWSCTKWMRVNEY
        PAA+GN+S++P WNELSECGKGGLSVKPKMGDALLFWSM PD T+DP+SLHG C VI+GNKWS TKW+RV+EY
Subjt:  PAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWSMKPDNTVDPTSLHGACPVIRGNKWSCTKWMRVNEY

AT5G66060.2 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.2e-7361.88Show/hide
Query:  STFKLSKIIMAFLLALGVSMLIAFRFFS-PPESSHSNLLHRVASVQHRAVHSDGL-GKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMAKST
        ST   + +IM+  + L   +L+AF   S P  ++ S+  + + S+  + +   G    + ++WVE ISWEPRA VYHNFL  EEC YLI LAKPHM KST
Subjt:  STFKLSKIIMAFLLALGVSMLIAFRFFS-PPESSHSNLLHRVASVQHRAVHSDGL-GKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMAKST

Query:  VVDNKTGKSIDSRVRTSSGMFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHHDYFADEFNIKQGGQRMATLLMYLSDVEEGGETVF
        VVD KTGKS DSRVRTSSG FL RG++K +  IEKRI+DFTFIPVEHGE LQ+LHYE+GQKY+ H+DYF DE+N + GGQR+AT+LMYLSDVEEGGETVF
Subjt:  VVDNKTGKSIDSRVRTSSGMFLKRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHHDYFADEFNIKQGGQRMATLLMYLSDVEEGGETVF

Query:  PAAEGNFSSLPGWNELSECGKGG
        PAA+GN+S++P WNELSECGKGG
Subjt:  PAAEGNFSSLPGWNELSECGKGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGTGTCGAAAGGGAAATACATCAAGTTTCAGGGCCGGAAATGGTCCACATTCAAGCTTTCCAAGATAATCATGGTCTTCGTTTTGGCACTTGGGGTTTCC
ATGTTCATCGCCTTCCGATTCTTCTCTCCTCCTGAAAGTTCTCATAGCGAGCTACTCCACCGTCTCGCTTCTGTCCAGCATAGTGCCGTTCATAGTGATGGGTTG
GGGAAGAGAGAGGATCAGTGGGTTGAGATCATTTCATGGGAGCCTAGGGCTTTCGTTTATCACAATTTCTTGTCCAAGGAAGAATGCTTGTATTTGATTAGTCTA
GCAAAACCTTACATGGAAAAATCAACTGTGGTGGATATTAAAACTGGGAAGAATATAGATAGCAGGGCGCGCACCAGTTCCGGGATGTTTCTGAGAAGAGGGCAG
AACAAAATTGTCAGCAACATAGAGAAAAGAATAGCAGATTTTACATTCATTCCCGTAGAGCACGGAGAAGCACTTCAAATTCTGCACTATGAAGTCGGGCAGAAG
TATGATGCTCACTATGATTTCTTTGCCGATGAGTTCAACATCAAACATGGCGGCCAAAGAATAGCCACTCTTCTCATGTATCTGTCAGACGTCGAAGAAGGGGGT
GAGACAGTGTTCCCAGCAGCCGAAGGGAACTTCAGCTCTTTGCCTGGGTGGAATGAACGGTCTGAATGTGGTAAAGGTGGACTATCTCTAAATCCAAAGATGGGA
GATGCATTATTGTTCTGGAGCATGAGGCCTGATAATACCTTAGATCCTACAAGTATGCATGGTTCTTGCCCTGTCATAAGAGGGAACAAATGGTCATGTACAAAG
TGGATGCATACCCTTCCATATCAGGTTATTGTTGGTACTTCTGTAATCGACCCGACAAAGGAAGCTGGTAGAGTTATTAAAGACGATACATCTGTTGGCAAGTTG
TCCTCACCCACGGGTGGGACATTGATGTCAGGTCTACGCTATAGTTCGAACATTGACTTGGGAGGTCTGGATGGCACGACCATTGCTTATGATTTTACAGATGTA
GCAGCTTATCGGAATCGGGGTTTGATTCTTCTGAAGCTTATTCGTTCTTTAATGGCGGTGTCGAAAGGGAAATACGTCAAGTTTCAGGCCCGGAAATGGTCCACA
TTCAAGCTTTCCAAGATAATCATGGCCTTCCTTTTGGCACTTGGGGTTTCCATGCTCATCGCTTTCCGATTCTTCTCTCCTCCTGAAAGTTCTCATAGCAATCTA
CTACACCGTGTCGCTTCCGTCCAGCATAGAGCCGTTCATAGTGATGGGTTGGGGAAGAGAGGGGATCAGTGGGTTGAGTTCATTTCGTGGGAGCCTAGGGCTTTT
GTTTATCACAATTTCTTGTCCAAGGAAGAATGCTTGTATTTGATTAGTCTTGCAAAACCTCACATGGCTAAATCAACTGTGGTGGATAATAAAACTGGGAAGAGT
ATAGATAGCAGGGTGCGCACCAGTTCAGGGATGTTTCTGAAAAGAGGGCAGAACAAAATTGTCAGCAACATAGAGAAAAGAATAGCAGATTTTACATTCATTCCC
GTAGAGCATGGAGAAGCACTTCAAATTCTGCACTATGAAGTCGGGCAGAAGTATGATGCTCACCATGATTACTTTGCTGATGAGTTCAACATCAAACAAGGAGGC
CAAAGAATGGCCACCCTTCTCATGTATCTGTCAGACGTCGAAGAAGGGGGCGAGACAGTGTTCCCAGCAGCCGAAGGCAACTTCAGCTCGTTGCCTGGGTGGAAT
GAACTGTCTGAATGTGGTAAAGGTGGACTCTCTGTAAAACCAAAGATGGGTGATGCATTATTGTTCTGGAGCATGAAGCCTGATAATACCGTAGATCCTACAAGT
TTGCATGGTGCTTGCCCTGTCATAAGAGGGAACAAATGGTCATGTACAAAGTGGATGCGTGTTAATGAATACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGTGTCGAAAGGGAAATACATCAAGTTTCAGGGCCGGAAATGGTCCACATTCAAGCTTTCCAAGATAATCATGGTCTTCGTTTTGGCACTTGGGGTTTCC
ATGTTCATCGCCTTCCGATTCTTCTCTCCTCCTGAAAGTTCTCATAGCGAGCTACTCCACCGTCTCGCTTCTGTCCAGCATAGTGCCGTTCATAGTGATGGGTTG
GGGAAGAGAGAGGATCAGTGGGTTGAGATCATTTCATGGGAGCCTAGGGCTTTCGTTTATCACAATTTCTTGTCCAAGGAAGAATGCTTGTATTTGATTAGTCTA
GCAAAACCTTACATGGAAAAATCAACTGTGGTGGATATTAAAACTGGGAAGAATATAGATAGCAGGGCGCGCACCAGTTCCGGGATGTTTCTGAGAAGAGGGCAG
AACAAAATTGTCAGCAACATAGAGAAAAGAATAGCAGATTTTACATTCATTCCCGTAGAGCACGGAGAAGCACTTCAAATTCTGCACTATGAAGTCGGGCAGAAG
TATGATGCTCACTATGATTTCTTTGCCGATGAGTTCAACATCAAACATGGCGGCCAAAGAATAGCCACTCTTCTCATGTATCTGTCAGACGTCGAAGAAGGGGGT
GAGACAGTGTTCCCAGCAGCCGAAGGGAACTTCAGCTCTTTGCCTGGGTGGAATGAACGGTCTGAATGTGGTAAAGGTGGACTATCTCTAAATCCAAAGATGGGA
GATGCATTATTGTTCTGGAGCATGAGGCCTGATAATACCTTAGATCCTACAAGTATGCATGGTTCTTGCCCTGTCATAAGAGGGAACAAATGGTCATGTACAAAG
TGGATGCATACCCTTCCATATCAGGTTATTGTTGGTACTTCTGTAATCGACCCGACAAAGGAAGCTGGTAGAGTTATTAAAGACGATACATCTGTTGGCAAGTTG
TCCTCACCCACGGGTGGGACATTGATGTCAGGTCTACGCTATAGTTCGAACATTGACTTGGGAGGTCTGGATGGCACGACCATTGCTTATGATTTTACAGATGTA
GCAGCTTATCGGAATCGGGGTTTGATTCTTCTGAAGCTTATTCGTTCTTTAATGGCGGTGTCGAAAGGGAAATACGTCAAGTTTCAGGCCCGGAAATGGTCCACA
TTCAAGCTTTCCAAGATAATCATGGCCTTCCTTTTGGCACTTGGGGTTTCCATGCTCATCGCTTTCCGATTCTTCTCTCCTCCTGAAAGTTCTCATAGCAATCTA
CTACACCGTGTCGCTTCCGTCCAGCATAGAGCCGTTCATAGTGATGGGTTGGGGAAGAGAGGGGATCAGTGGGTTGAGTTCATTTCGTGGGAGCCTAGGGCTTTT
GTTTATCACAATTTCTTGTCCAAGGAAGAATGCTTGTATTTGATTAGTCTTGCAAAACCTCACATGGCTAAATCAACTGTGGTGGATAATAAAACTGGGAAGAGT
ATAGATAGCAGGGTGCGCACCAGTTCAGGGATGTTTCTGAAAAGAGGGCAGAACAAAATTGTCAGCAACATAGAGAAAAGAATAGCAGATTTTACATTCATTCCC
GTAGAGCATGGAGAAGCACTTCAAATTCTGCACTATGAAGTCGGGCAGAAGTATGATGCTCACCATGATTACTTTGCTGATGAGTTCAACATCAAACAAGGAGGC
CAAAGAATGGCCACCCTTCTCATGTATCTGTCAGACGTCGAAGAAGGGGGCGAGACAGTGTTCCCAGCAGCCGAAGGCAACTTCAGCTCGTTGCCTGGGTGGAAT
GAACTGTCTGAATGTGGTAAAGGTGGACTCTCTGTAAAACCAAAGATGGGTGATGCATTATTGTTCTGGAGCATGAAGCCTGATAATACCGTAGATCCTACAAGT
TTGCATGGTGCTTGCCCTGTCATAAGAGGGAACAAATGGTCATGTACAAAGTGGATGCGTGTTAATGAATACTAATTCCAAGGGTGGAATGAAGGTATACATTTG
TTAGAGGAGAAGAGAAGAAGACTTGATTCTTATGTTTCAATATTTTTTGTTCTTAGTTCATTACTCACCTTTCTTAAAAATTACATTTTTTTTTAGGAATTACAT
ATTTAGTTGTATAGTTACTTATATACTATACTCAAACAGAAAAGTTGAGTTTTAGTGTACTTCTTGTTCCAAGCCTCGCTTGATTTTGTTTCGATTTGGTCGGTT
GCCAATTTGAGCACAACGTGTCTAATACATTTGTTTGAACTTTCGAGCCAA
Protein sequenceShow/hide protein sequence
MAVSKGKYIKFQGRKWSTFKLSKIIMVFVLALGVSMFIAFRFFSPPESSHSELLHRLASVQHSAVHSDGLGKREDQWVEIISWEPRAFVYHNFLSKEECLYLISL
AKPYMEKSTVVDIKTGKNIDSRARTSSGMFLRRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFFADEFNIKHGGQRIATLLMYLSDVEEGG
ETVFPAAEGNFSSLPGWNERSECGKGGLSLNPKMGDALLFWSMRPDNTLDPTSMHGSCPVIRGNKWSCTKWMHTLPYQVIVGTSVIDPTKEAGRVIKDDTSVGKL
SSPTGGTLMSGLRYSSNIDLGGLDGTTIAYDFTDVAAYRNRGLILLKLIRSLMAVSKGKYVKFQARKWSTFKLSKIIMAFLLALGVSMLIAFRFFSPPESSHSNL
LHRVASVQHRAVHSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMAKSTVVDNKTGKSIDSRVRTSSGMFLKRGQNKIVSNIEKRIADFTFIP
VEHGEALQILHYEVGQKYDAHHDYFADEFNIKQGGQRMATLLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWSMKPDNTVDPTS
LHGACPVIRGNKWSCTKWMRVNEY