; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G13263 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G13263
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationctg1838:3420973..3424409
RNA-Seq ExpressionCucsat.G13263
SyntenyCucsat.G13263
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053723.1 putative prolyl 4-hydroxylase 7 isoform X2 [Cucumis melo var. makuwa]6.43e-19085.39Show/hide
Query:  ESVIRMKTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAGTGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPL
        ESVIRMKTGGSA+TIDPTRVIQLSSKPRAFLYKGFLS EECQHLI+ AKGKL QSLVAAGTG+SVTSKERTSTGMFL KAQD+IVARIESRIAAWTFLPL
Subjt:  ESVIRMKTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAGTGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPL

Query:  -----------------------------DNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSPVKLSEEEKADLS
                                     DNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLS+VEKGGETVFPNSPVKLSEEEK DLS
Subjt:  -----------------------------DNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSPVKLSEEEKADLS

Query:  ECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPIDEFWRNPACVDENDHCTAWAKAGECEKNPVYMMGSKNELGFCRF
        EC KVGYGVRPKLGDALLFFSMNPNVTPD TSYHGSCPVIEGEKWSATKWIHMLPIDE WRNPACVDENDHC+AWAKAGEC+KNPVYMMGSKNELGFCR 
Subjt:  ECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPIDEFWRNPACVDENDHCTAWAKAGECEKNPVYMMGSKNELGFCRF

Query:  SCKVCSPS
        SCKVCSPS
Subjt:  SCKVCSPS

TYK17735.1 putative prolyl 4-hydroxylase 7 isoform X2 [Cucumis melo var. makuwa]1.80e-19694.27Show/hide
Query:  ESVIRMKTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAGTGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPL
        ESVIRMKTGGSA+TIDPTRVIQLSSKPRAFLYKGFLS EECQHLI+ AKGKL QSLVAAGTG+SVTSKERTSTGMFL KAQD+IVARIESRIAAWTFLPL
Subjt:  ESVIRMKTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAGTGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPL

Query:  DNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSPVKLSEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNVTPD
        DNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLS+VEKGGETVFPNSPVKLSEEEK DLSEC KVGYGVRPKLGDALLFFSMNPNVTPD
Subjt:  DNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSPVKLSEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNVTPD

Query:  TTSYHGSCPVIEGEKWSATKWIHMLPIDEFWRNPACVDENDHCTAWAKAGECEKNPVYMMGSKNELGFCRFSCKVCSPS
         TSYHGSCPVIEGEKWSATKWIHMLPIDE WRNPACVDENDHC+AWAKAGEC+KNPVYMMGSKNELGFCR SCKVCSPS
Subjt:  TTSYHGSCPVIEGEKWSATKWIHMLPIDEFWRNPACVDENDHCTAWAKAGECEKNPVYMMGSKNELGFCRFSCKVCSPS

XP_004147455.1 probable prolyl 4-hydroxylase 7 [Cucumis sativus]9.06e-239100Show/hide
Query:  MASPFFLPFSIFFLFLFLLPFSSLSANRFPKLILHNNDIDESVIRMKTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAG
        MASPFFLPFSIFFLFLFLLPFSSLSANRFPKLILHNNDIDESVIRMKTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAG
Subjt:  MASPFFLPFSIFFLFLFLLPFSSLSANRFPKLILHNNDIDESVIRMKTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAG

Query:  TGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSPV
        TGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSPV
Subjt:  TGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSPV

Query:  KLSEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPIDEFWRNPACVDENDHCTAWAKAGECEKNPVYMM
        KLSEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPIDEFWRNPACVDENDHCTAWAKAGECEKNPVYMM
Subjt:  KLSEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPIDEFWRNPACVDENDHCTAWAKAGECEKNPVYMM

Query:  GSKNELGFCRFSCKVCSPS
        GSKNELGFCRFSCKVCSPS
Subjt:  GSKNELGFCRFSCKVCSPS

XP_008443446.1 PREDICTED: probable prolyl 4-hydroxylase 7 [Cucumis melo]1.58e-21691.85Show/hide
Query:  MASPFFLPFSIFFLFLFLLPFSSLSANRFPKLILHNNDIDESVIRMKTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAG
        MASPF L FSIFFL+L  LP SSLSANRFPK++LHNND+ ESVIRMKTGGSA+TIDPTRVIQLSSKPRAFLYKGFLS EECQHLI+ AKGKL QSLVAAG
Subjt:  MASPFFLPFSIFFLFLFLLPFSSLSANRFPKLILHNNDIDESVIRMKTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAG

Query:  TGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSPV
        TG+SVTSKERTSTGMFL KAQD+IVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLS+VEKGGETVFPNSPV
Subjt:  TGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSPV

Query:  KLSEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPIDEFWRNPACVDENDHCTAWAKAGECEKNPVYMM
        KLSEEEK DLSEC KVGYGVRPKLGDALLFFSMNPNVTPD TSYHGSCPVIEGEKWSATKWIHMLPIDE WRNPACVDENDHC+AWAKAGEC+KNPVYMM
Subjt:  KLSEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPIDEFWRNPACVDENDHCTAWAKAGECEKNPVYMM

Query:  GSKNELGFCRFSCKVCSPS
        GSKNELGFCR SCKVCSPS
Subjt:  GSKNELGFCRFSCKVCSPS

XP_038905408.1 probable prolyl 4-hydroxylase 7 isoform X1 [Benincasa hispida]3.05e-19884.91Show/hide
Query:  MASPFFLPFSIFFLFLFLLPFSSLSANRFPKLILHNNDIDESVIRMKTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAG
        MAS FFL FS+ FL  F  PF S SANR PKL+LHNN++D+SVIRMKT GS +TIDPTRVI+LSSKPRAFLYKGFLS +ECQHLIN AKGKL QSLVAA 
Subjt:  MASPFFLPFSIFFLFLFLLPFSSLSANRFPKLILHNNDIDESVIRMKTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAG

Query:  TGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSPV
        TG+SVTS+ERTSTGMFL +AQDEIVARIESRIAAWTFLP+DNGEPIQILRYENGQKYEPHFDFFQDP NIAIGGHRIATILMYLS+VEKGGETVFPNSP+
Subjt:  TGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSPV

Query:  KLSEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPIDEFWRNPACVDENDHCTAWAKAGECEKNPVYMM
        KLSE+E+ADLS+C KVGYGV+PK+GDALLFFS+NPNVTPD TSYHGSCPVIEGEKWSATKWIHMLPI E WRNPACVDEN  C AWA AGECEKNPVYMM
Subjt:  KLSEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPIDEFWRNPACVDENDHCTAWAKAGECEKNPVYMM

Query:  GSKNELGFCRFSCKVCSP
        GSKNELG CR SCKVCSP
Subjt:  GSKNELGFCRFSCKVCSP

TrEMBL top hitse value%identityAlignment
A0A0A0LG32 Procollagen-proline 4-dioxygenase4.39e-239100Show/hide
Query:  MASPFFLPFSIFFLFLFLLPFSSLSANRFPKLILHNNDIDESVIRMKTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAG
        MASPFFLPFSIFFLFLFLLPFSSLSANRFPKLILHNNDIDESVIRMKTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAG
Subjt:  MASPFFLPFSIFFLFLFLLPFSSLSANRFPKLILHNNDIDESVIRMKTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAG

Query:  TGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSPV
        TGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSPV
Subjt:  TGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSPV

Query:  KLSEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPIDEFWRNPACVDENDHCTAWAKAGECEKNPVYMM
        KLSEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPIDEFWRNPACVDENDHCTAWAKAGECEKNPVYMM
Subjt:  KLSEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPIDEFWRNPACVDENDHCTAWAKAGECEKNPVYMM

Query:  GSKNELGFCRFSCKVCSPS
        GSKNELGFCRFSCKVCSPS
Subjt:  GSKNELGFCRFSCKVCSPS

A0A1S3B814 Procollagen-proline 4-dioxygenase7.63e-21791.85Show/hide
Query:  MASPFFLPFSIFFLFLFLLPFSSLSANRFPKLILHNNDIDESVIRMKTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAG
        MASPF L FSIFFL+L  LP SSLSANRFPK++LHNND+ ESVIRMKTGGSA+TIDPTRVIQLSSKPRAFLYKGFLS EECQHLI+ AKGKL QSLVAAG
Subjt:  MASPFFLPFSIFFLFLFLLPFSSLSANRFPKLILHNNDIDESVIRMKTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAG

Query:  TGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSPV
        TG+SVTSKERTSTGMFL KAQD+IVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLS+VEKGGETVFPNSPV
Subjt:  TGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSPV

Query:  KLSEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPIDEFWRNPACVDENDHCTAWAKAGECEKNPVYMM
        KLSEEEK DLSEC KVGYGVRPKLGDALLFFSMNPNVTPD TSYHGSCPVIEGEKWSATKWIHMLPIDE WRNPACVDENDHC+AWAKAGEC+KNPVYMM
Subjt:  KLSEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPIDEFWRNPACVDENDHCTAWAKAGECEKNPVYMM

Query:  GSKNELGFCRFSCKVCSPS
        GSKNELGFCR SCKVCSPS
Subjt:  GSKNELGFCRFSCKVCSPS

A0A5A7UCT9 Procollagen-proline 4-dioxygenase3.11e-19085.39Show/hide
Query:  ESVIRMKTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAGTGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPL
        ESVIRMKTGGSA+TIDPTRVIQLSSKPRAFLYKGFLS EECQHLI+ AKGKL QSLVAAGTG+SVTSKERTSTGMFL KAQD+IVARIESRIAAWTFLPL
Subjt:  ESVIRMKTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAGTGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPL

Query:  -----------------------------DNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSPVKLSEEEKADLS
                                     DNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLS+VEKGGETVFPNSPVKLSEEEK DLS
Subjt:  -----------------------------DNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSPVKLSEEEKADLS

Query:  ECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPIDEFWRNPACVDENDHCTAWAKAGECEKNPVYMMGSKNELGFCRF
        EC KVGYGVRPKLGDALLFFSMNPNVTPD TSYHGSCPVIEGEKWSATKWIHMLPIDE WRNPACVDENDHC+AWAKAGEC+KNPVYMMGSKNELGFCR 
Subjt:  ECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPIDEFWRNPACVDENDHCTAWAKAGECEKNPVYMMGSKNELGFCRF

Query:  SCKVCSPS
        SCKVCSPS
Subjt:  SCKVCSPS

A0A5D3D1X2 Procollagen-proline 4-dioxygenase8.73e-19794.27Show/hide
Query:  ESVIRMKTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAGTGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPL
        ESVIRMKTGGSA+TIDPTRVIQLSSKPRAFLYKGFLS EECQHLI+ AKGKL QSLVAAGTG+SVTSKERTSTGMFL KAQD+IVARIESRIAAWTFLPL
Subjt:  ESVIRMKTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAGTGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPL

Query:  DNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSPVKLSEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNVTPD
        DNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLS+VEKGGETVFPNSPVKLSEEEK DLSEC KVGYGVRPKLGDALLFFSMNPNVTPD
Subjt:  DNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSPVKLSEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNVTPD

Query:  TTSYHGSCPVIEGEKWSATKWIHMLPIDEFWRNPACVDENDHCTAWAKAGECEKNPVYMMGSKNELGFCRFSCKVCSPS
         TSYHGSCPVIEGEKWSATKWIHMLPIDE WRNPACVDENDHC+AWAKAGEC+KNPVYMMGSKNELGFCR SCKVCSPS
Subjt:  TTSYHGSCPVIEGEKWSATKWIHMLPIDEFWRNPACVDENDHCTAWAKAGECEKNPVYMMGSKNELGFCRFSCKVCSPS

A0A6J1EYJ1 Procollagen-proline 4-dioxygenase8.80e-16972.76Show/hide
Query:  MASPFFLPFSIFFLFLFLLPFSSLSANRFPKLILHNNDIDESVIRMKTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAG
        M S FFL FS+ FL  F  P  + SANR PKL+L +   ++SVIRMK  GS++ IDPTRV+QLSS+PRAFLYKGFLSAEECQHLI+ AK  L QSLV   
Subjt:  MASPFFLPFSIFFLFLFLLPFSSLSANRFPKLILHNNDIDESVIRMKTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAG

Query:  -TGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSP
         TG S +S +RTSTGMFL+KAQD+IVA IE++IAAWTFLP+DNGEPIQILRYENGQ+Y PHFDFFQDP N+A GGHRIAT+LMYLSNVE+GGETVFP+SP
Subjt:  -TGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSP

Query:  VKLSEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPIDEFWRNPACVDENDHCTAWAKAGECEKNPVYM
         K+ EEE  DL +C   GYGV+PK GDALLFFS++PNVT D TSYHGSCPVIEGEKWSATKWIHMLP+DE WRNP CVDEN+HC+AWAKAGECEKNP YM
Subjt:  VKLSEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPIDEFWRNPACVDENDHCTAWAKAGECEKNPVYM

Query:  ----MGSKNELGFCRFSCKVCSP
            +GSK ELG+CR SCK CSP
Subjt:  ----MGSKNELGFCRFSCKVCSP

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 61.1e-9354.4Show/hide
Query:  MASPFFLPFSIFFLFLFLLPFSSLSANRFPKLILHNNDIDESVIRMKTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAA-
        M S +FL FS+  L +    FS +S+  F                        ++DPTR+ QLS  PRAFLYKGFLS EEC HLI  AKGKL +S+V A 
Subjt:  MASPFFLPFSIFFLFLFLLPFSSLSANRFPKLILHNNDIDESVIRMKTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAA-

Query:  -GTGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNS
          +G+S  S+ RTS+GMFL K QD+IVA +E+++AAWTFLP +NGE +QIL YENGQKY+PHFD+F D   + +GGHRIAT+LMYLSNV KGGETVFPN 
Subjt:  -GTGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNS

Query:  PVKLSEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPIDEFWRNPACVDENDHCTAWAKAGECEKNPVY
          K  + +    S+C K GY V+P+ GDALLFF+++ N T D  S HGSCPVIEGEKWSAT+WIH+    +  +   CVD+++ C  WA AGECEKNP+Y
Subjt:  PVKLSEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPIDEFWRNPACVDENDHCTAWAKAGECEKNPVY

Query:  MMGSKNELGFCRFSCKVC
        M+GS+  LGFCR SCK C
Subjt:  MMGSKNELGFCRFSCKVC

F4JAU3 Prolyl 4-hydroxylase 23.3e-8255.06Show/hide
Query:  IDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAG-TGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPLDNGEPIQILRYEN
        I+P++V Q+SSKPRAF+Y+GFL+  EC HLI+ AK  L +S VA    G+S  S  RTS+G F+ K +D IV+ IE +++ WTFLP +NGE +Q+LRYE+
Subjt:  IDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAG-TGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPLDNGEPIQILRYEN

Query:  GQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSPV---KLSEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPV
        GQKY+ HFD+F D  NIA GGHRIAT+L+YLSNV KGGETVFP++     +   E K DLS+C K G  V+PK G+ALLFF++  +  PD  S HG CPV
Subjt:  GQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSPV---KLSEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPV

Query:  IEGEKWSATKWIHMLPIDEFWRNPA-CVDENDHCTAWAKAGECEKNPVYMMGSKNELGFCRFSCKVC
        IEGEKWSATKWIH+   D+   +   C D N+ C  WA  GEC KNP YM+G+    G CR SCK C
Subjt:  IEGEKWSATKWIHMLPIDEFWRNPA-CVDENDHCTAWAKAGECEKNPVYMMGSKNELGFCRFSCKVC

Q8L970 Probable prolyl 4-hydroxylase 74.7e-10558.93Show/hide
Query:  MASPFFLPFSIFFLFLFLLPFSSLSANRFPKLILHNNDIDESVIRMKTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAG
        M S  FL FS+   FLF LP  S + NRF  L   +N  D SVI+MKT  S+   DPTRV QLS  PR FLY+GFLS EEC H I  AKGKL +S+VA  
Subjt:  MASPFFLPFSIFFLFLFLLPFSSLSANRFPKLILHNNDIDESVIRMKTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAG

Query:  -TGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSP
         +G+SV S+ RTS+GMFL K QD+IV+ +E+++AAWTFLP +NGE +QIL YENGQKYEPHFD+F D  N+ +GGHRIAT+LMYLSNVEKGGETVFP   
Subjt:  -TGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSP

Query:  VKLSEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPIDE-FWRNPACVDENDHCTAWAKAGECEKNPVY
         K ++ +    +EC K GY V+P+ GDALLFF+++PN T D+ S HGSCPV+EGEKWSAT+WIH+   +  F +   C+DEN  C  WAKAGEC+KNP Y
Subjt:  VKLSEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPIDE-FWRNPACVDENDHCTAWAKAGECEKNPVY

Query:  MMGSKNELGFCRFSCKVCS
        M+GS  + G+CR SCK CS
Subjt:  MMGSKNELGFCRFSCKVCS

Q8LAN3 Probable prolyl 4-hydroxylase 43.2e-8554.74Show/hide
Query:  SAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAG-TGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPLDNGEPIQIL
        S++ ++P++V Q+SSKPRAF+Y+GFL+  EC H+++ AK  L +S VA   +G+S  S+ RTS+G F+ K +D IV+ IE +I+ WTFLP +NGE IQ+L
Subjt:  SAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAG-TGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPLDNGEPIQIL

Query:  RYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSPV---KLSEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHG
        RYE+GQKY+ HFD+F D  NI  GGHR+ATILMYLSNV KGGETVFP++ +   ++  E K DLS+C K G  V+P+ GDALLFF+++P+  PD  S HG
Subjt:  RYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSPV---KLSEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHG

Query:  SCPVIEGEKWSATKWIHMLPIDEFWR----NPACVDENDHCTAWAKAGECEKNPVYMMGSKNELGFCRFSCKVC
         CPVIEGEKWSATKWIH   +D F R    +  C D N+ C  WA  GEC KNP YM+G+    G+CR SCK C
Subjt:  SCPVIEGEKWSATKWIHMLPIDEFWR----NPACVDENDHCTAWAKAGECEKNPVYMMGSKNELGFCRFSCKVC

Q9LN20 Probable prolyl 4-hydroxylase 31.8e-5947.71Show/hide
Query:  FSIFFLFLFLLPFSSLSANRFPKLILHNNDIDESVIRM----KTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQS-LVAAGTGQ
        F +F L + LL   +      P     ++ ID S  R     ++ G     D    + LS +PRAF+Y  FLS EEC++LI+ AK  + +S +V + TG+
Subjt:  FSIFFLFLFLLPFSSLSANRFPKLILHNNDIDESVIRM----KTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQS-LVAAGTGQ

Query:  SVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSPVKLS
        S  S+ RTS+G FL + +D+I+  IE RIA +TF+P D+GE +Q+L YE GQKYEPH+D+F D  N   GG R+AT+LMYLS+VE+GGETVFP + +  S
Subjt:  SVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSPVKLS

Query:  EEE-KADLSECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHM
              +LSECGK G  V+P++GDALLF+SM P+ T D TS HG CPVI G KWS+TKW+H+
Subjt:  EEE-KADLSECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHM

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 22.3e-8355.06Show/hide
Query:  IDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAG-TGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPLDNGEPIQILRYEN
        I+P++V Q+SSKPRAF+Y+GFL+  EC HLI+ AK  L +S VA    G+S  S  RTS+G F+ K +D IV+ IE +++ WTFLP +NGE +Q+LRYE+
Subjt:  IDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAG-TGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPLDNGEPIQILRYEN

Query:  GQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSPV---KLSEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPV
        GQKY+ HFD+F D  NIA GGHRIAT+L+YLSNV KGGETVFP++     +   E K DLS+C K G  V+PK G+ALLFF++  +  PD  S HG CPV
Subjt:  GQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSPV---KLSEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPV

Query:  IEGEKWSATKWIHMLPIDEFWRNPA-CVDENDHCTAWAKAGECEKNPVYMMGSKNELGFCRFSCKVC
        IEGEKWSATKWIH+   D+   +   C D N+ C  WA  GEC KNP YM+G+    G CR SCK C
Subjt:  IEGEKWSATKWIHMLPIDEFWRNPA-CVDENDHCTAWAKAGECEKNPVYMMGSKNELGFCRFSCKVC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase3.3e-10658.93Show/hide
Query:  MASPFFLPFSIFFLFLFLLPFSSLSANRFPKLILHNNDIDESVIRMKTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAG
        M S  FL FS+   FLF LP  S + NRF  L   +N  D SVI+MKT  S+   DPTRV QLS  PR FLY+GFLS EEC H I  AKGKL +S+VA  
Subjt:  MASPFFLPFSIFFLFLFLLPFSSLSANRFPKLILHNNDIDESVIRMKTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAG

Query:  -TGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSP
         +G+SV S+ RTS+GMFL K QD+IV+ +E+++AAWTFLP +NGE +QIL YENGQKYEPHFD+F D  N+ +GGHRIAT+LMYLSNVEKGGETVFP   
Subjt:  -TGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSP

Query:  VKLSEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPIDE-FWRNPACVDENDHCTAWAKAGECEKNPVY
         K ++ +    +EC K GY V+P+ GDALLFF+++PN T D+ S HGSCPV+EGEKWSAT+WIH+   +  F +   C+DEN  C  WAKAGEC+KNP Y
Subjt:  VKLSEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPIDE-FWRNPACVDENDHCTAWAKAGECEKNPVY

Query:  MMGSKNELGFCRFSCKVCS
        M+GS  + G+CR SCK CS
Subjt:  MMGSKNELGFCRFSCKVCS

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase5.2e-9955.35Show/hide
Query:  MASPFFLPFSIFFLFLFLLPFSSLSANRFPKLILHNNDIDESVIRMKTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAG
        M S  FL FS+   FLF LP  S + NRF  L   +N  D SVI+MKT  S+   DPTRV QLS  PR FLY+GFLS EEC H I  AKGKL +S+VA  
Subjt:  MASPFFLPFSIFFLFLFLLPFSSLSANRFPKLILHNNDIDESVIRMKTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAG

Query:  -TGQSVTSKERTS----TGMFLHKAQ----DEIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGG
         +G+SV S++  S    +  F+        D+IV+ +E+++AAWTFLP +NGE +QIL YENGQKYEPHFD+F D  N+ +GGHRIAT+LMYLSNVEKGG
Subjt:  -TGQSVTSKERTS----TGMFLHKAQ----DEIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGG

Query:  ETVFPNSPVKLSEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPIDE-FWRNPACVDENDHCTAWAKAG
        ETVFP    K ++ +    +EC K GY V+P+ GDALLFF+++PN T D+ S HGSCPV+EGEKWSAT+WIH+   +  F +   C+DEN  C  WAKAG
Subjt:  ETVFPNSPVKLSEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPIDE-FWRNPACVDENDHCTAWAKAG

Query:  ECEKNPVYMMGSKNELGFCRFSCKVCS
        EC+KNP YM+GS  + G+CR SCK CS
Subjt:  ECEKNPVYMMGSKNELGFCRFSCKVCS

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase7.7e-9554.4Show/hide
Query:  MASPFFLPFSIFFLFLFLLPFSSLSANRFPKLILHNNDIDESVIRMKTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAA-
        M S +FL FS+  L +    FS +S+  F                        ++DPTR+ QLS  PRAFLYKGFLS EEC HLI  AKGKL +S+V A 
Subjt:  MASPFFLPFSIFFLFLFLLPFSSLSANRFPKLILHNNDIDESVIRMKTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAA-

Query:  -GTGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNS
          +G+S  S+ RTS+GMFL K QD+IVA +E+++AAWTFLP +NGE +QIL YENGQKY+PHFD+F D   + +GGHRIAT+LMYLSNV KGGETVFPN 
Subjt:  -GTGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNS

Query:  PVKLSEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPIDEFWRNPACVDENDHCTAWAKAGECEKNPVY
          K  + +    S+C K GY V+P+ GDALLFF+++ N T D  S HGSCPVIEGEKWSAT+WIH+    +  +   CVD+++ C  WA AGECEKNP+Y
Subjt:  PVKLSEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPIDEFWRNPACVDENDHCTAWAKAGECEKNPVY

Query:  MMGSKNELGFCRFSCKVC
        M+GS+  LGFCR SCK C
Subjt:  MMGSKNELGFCRFSCKVC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.3e-8654.74Show/hide
Query:  SAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAG-TGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPLDNGEPIQIL
        S++ ++P++V Q+SSKPRAF+Y+GFL+  EC H+++ AK  L +S VA   +G+S  S+ RTS+G F+ K +D IV+ IE +I+ WTFLP +NGE IQ+L
Subjt:  SAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAG-TGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPLDNGEPIQIL

Query:  RYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSPV---KLSEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHG
        RYE+GQKY+ HFD+F D  NI  GGHR+ATILMYLSNV KGGETVFP++ +   ++  E K DLS+C K G  V+P+ GDALLFF+++P+  PD  S HG
Subjt:  RYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSPV---KLSEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHG

Query:  SCPVIEGEKWSATKWIHMLPIDEFWR----NPACVDENDHCTAWAKAGECEKNPVYMMGSKNELGFCRFSCKVC
         CPVIEGEKWSATKWIH   +D F R    +  C D N+ C  WA  GEC KNP YM+G+    G+CR SCK C
Subjt:  SCPVIEGEKWSATKWIHMLPIDEFWR----NPACVDENDHCTAWAKAGECEKNPVYMMGSKNELGFCRFSCKVC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTCCATTTTTTCTCCCATTTTCTATCTTTTTCCTTTTCCTTTTCCTTTTACCCTTTTCTTCTCTCTCCGCCAATCGCTTCCCCAAATTGATCTTACACAACAA
CGACATAGATGAGTCTGTTATTAGGATGAAAACGGGTGGTTCCGCCATGACAATCGATCCCACTCGTGTCATTCAGCTTTCATCCAAACCCAGGGCCTTCTTATATAAGG
GATTTTTGTCTGCTGAGGAGTGCCAACATCTTATCAATTCGGCGAAGGGTAAGCTACATCAATCATTGGTGGCGGCTGGAACAGGTCAGAGTGTTACAAGTAAAGAACGA
ACGAGTACTGGCATGTTTCTTCACAAGGCCCAGGATGAAATAGTTGCTCGCATCGAGTCAAGGATTGCTGCATGGACTTTCCTTCCCCTTGATAATGGGGAGCCTATTCA
AATACTAAGGTATGAGAACGGACAGAAATACGAGCCACATTTTGATTTTTTTCAAGACCCAGGCAATATAGCCATTGGAGGTCATCGGATAGCCACAATCTTGATGTATT
TATCCAATGTTGAAAAGGGTGGAGAAACAGTCTTTCCCAATTCTCCGGTTAAATTATCCGAGGAGGAAAAGGCTGACTTGTCTGAGTGCGGTAAGGTTGGCTATGGAGTA
AGACCAAAGTTAGGTGATGCTTTACTGTTCTTCAGTATGAATCCAAATGTGACGCCAGACACGACCAGCTATCACGGAAGCTGCCCAGTGATAGAGGGTGAGAAATGGTC
TGCAACTAAATGGATTCATATGCTTCCAATCGATGAATTTTGGAGGAATCCAGCTTGCGTAGACGAAAATGACCACTGTACTGCGTGGGCAAAAGCAGGTGAATGTGAAA
AGAATCCTGTTTATATGATGGGTTCTAAGAACGAACTTGGATTTTGTAGGTTTAGTTGCAAAGTATGCTCTCCCTCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTCCATTTTTTCTCCCATTTTCTATCTTTTTCCTTTTCCTTTTCCTTTTACCCTTTTCTTCTCTCTCCGCCAATCGCTTCCCCAAATTGATCTTACACAACAA
CGACATAGATGAGTCTGTTATTAGGATGAAAACGGGTGGTTCCGCCATGACAATCGATCCCACTCGTGTCATTCAGCTTTCATCCAAACCCAGGGCCTTCTTATATAAGG
GATTTTTGTCTGCTGAGGAGTGCCAACATCTTATCAATTCGGCGAAGGGTAAGCTACATCAATCATTGGTGGCGGCTGGAACAGGTCAGAGTGTTACAAGTAAAGAACGA
ACGAGTACTGGCATGTTTCTTCACAAGGCCCAGGATGAAATAGTTGCTCGCATCGAGTCAAGGATTGCTGCATGGACTTTCCTTCCCCTTGATAATGGGGAGCCTATTCA
AATACTAAGGTATGAGAACGGACAGAAATACGAGCCACATTTTGATTTTTTTCAAGACCCAGGCAATATAGCCATTGGAGGTCATCGGATAGCCACAATCTTGATGTATT
TATCCAATGTTGAAAAGGGTGGAGAAACAGTCTTTCCCAATTCTCCGGTTAAATTATCCGAGGAGGAAAAGGCTGACTTGTCTGAGTGCGGTAAGGTTGGCTATGGAGTA
AGACCAAAGTTAGGTGATGCTTTACTGTTCTTCAGTATGAATCCAAATGTGACGCCAGACACGACCAGCTATCACGGAAGCTGCCCAGTGATAGAGGGTGAGAAATGGTC
TGCAACTAAATGGATTCATATGCTTCCAATCGATGAATTTTGGAGGAATCCAGCTTGCGTAGACGAAAATGACCACTGTACTGCGTGGGCAAAAGCAGGTGAATGTGAAA
AGAATCCTGTTTATATGATGGGTTCTAAGAACGAACTTGGATTTTGTAGGTTTAGTTGCAAAGTATGCTCTCCCTCGTAG
Protein sequenceShow/hide protein sequence
MASPFFLPFSIFFLFLFLLPFSSLSANRFPKLILHNNDIDESVIRMKTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAGTGQSVTSKER
TSTGMFLHKAQDEIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSPVKLSEEEKADLSECGKVGYGV
RPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPIDEFWRNPACVDENDHCTAWAKAGECEKNPVYMMGSKNELGFCRFSCKVCSPS