; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0023880 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0023880
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationchr04:26269673..26272810
RNA-Seq ExpressionIVF0023880
SyntenyIVF0023880
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053723.1 putative prolyl 4-hydroxylase 7 isoform X2 [Cucumis melo var. makuwa]3.48e-20390.65Show/hide
Query:  YESVIRMKTGGSAITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAGTGESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLP
        YESVIRMKTGGSAITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAGTGESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLP
Subjt:  YESVIRMKTGGSAITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAGTGESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLP

Query:  L-----------------------------DNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKLSEEEKGDL
        L                             DNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKLSEEEKGDL
Subjt:  L-----------------------------DNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKLSEEEKGDL

Query:  SECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHGSCPVIEGEKWSATKWIHMLPIDEVWRNPACVDENDHCSAWAKAGECKKNPVYMMGSKNELGFCR
        SECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHGSCPVIEGEKWSATKWIHMLPIDEVWRNPACVDENDHCSAWAKAGECKKNPVYMMGSKNELGFCR
Subjt:  SECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHGSCPVIEGEKWSATKWIHMLPIDEVWRNPACVDENDHCSAWAKAGECKKNPVYMMGSKNELGFCR

Query:  LSCKVCSPSS
        LSCKVCSPSS
Subjt:  LSCKVCSPSS

TYK17735.1 putative prolyl 4-hydroxylase 7 isoform X2 [Cucumis melo var. makuwa]9.61e-210100Show/hide
Query:  YESVIRMKTGGSAITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAGTGESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLP
        YESVIRMKTGGSAITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAGTGESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLP
Subjt:  YESVIRMKTGGSAITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAGTGESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLP

Query:  LDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKLSEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTP
        LDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKLSEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTP
Subjt:  LDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKLSEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTP

Query:  DATSYHGSCPVIEGEKWSATKWIHMLPIDEVWRNPACVDENDHCSAWAKAGECKKNPVYMMGSKNELGFCRLSCKVCSPSS
        DATSYHGSCPVIEGEKWSATKWIHMLPIDEVWRNPACVDENDHCSAWAKAGECKKNPVYMMGSKNELGFCRLSCKVCSPSS
Subjt:  DATSYHGSCPVIEGEKWSATKWIHMLPIDEVWRNPACVDENDHCSAWAKAGECKKNPVYMMGSKNELGFCRLSCKVCSPSS

XP_004147455.1 probable prolyl 4-hydroxylase 7 [Cucumis sativus]4.50e-21691.85Show/hide
Query:  MASPFLLAFSIFFLWL--LPLSSLSANRFPKMLLHNNDMYESVIRMKTGGSAITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAG
        MASPF L FSIFFL+L  LP SSLSANRFPK++LHNND+ ESVIRMKTGGSA+TIDPTRVIQLSSKPRAFLYKGFLS EECQHLI+ AKGKL QSLVAAG
Subjt:  MASPFLLAFSIFFLWL--LPLSSLSANRFPKMLLHNNDMYESVIRMKTGGSAITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAG

Query:  TGESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPV
        TG+SVTSKERTSTGMFL KAQD+IVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLS+VEKGGETVFPNSPV
Subjt:  TGESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPV

Query:  KLSEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHGSCPVIEGEKWSATKWIHMLPIDEVWRNPACVDENDHCSAWAKAGECKKNPVYMM
        KLSEEEK DLSEC KVGYGVRPKLGDALLFFSMNPNVTPD TSYHGSCPVIEGEKWSATKWIHMLPIDE WRNPACVDENDHC+AWAKAGEC+KNPVYMM
Subjt:  KLSEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHGSCPVIEGEKWSATKWIHMLPIDEVWRNPACVDENDHCSAWAKAGECKKNPVYMM

Query:  GSKNELGFCRLSCKVCSPS
        GSKNELGFCR SCKVCSPS
Subjt:  GSKNELGFCRLSCKVCSPS

XP_008443446.1 PREDICTED: probable prolyl 4-hydroxylase 7 [Cucumis melo]8.04e-237100Show/hide
Query:  MASPFLLAFSIFFLWLLPLSSLSANRFPKMLLHNNDMYESVIRMKTGGSAITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAGTG
        MASPFLLAFSIFFLWLLPLSSLSANRFPKMLLHNNDMYESVIRMKTGGSAITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAGTG
Subjt:  MASPFLLAFSIFFLWLLPLSSLSANRFPKMLLHNNDMYESVIRMKTGGSAITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAGTG

Query:  ESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKL
        ESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKL
Subjt:  ESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKL

Query:  SEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHGSCPVIEGEKWSATKWIHMLPIDEVWRNPACVDENDHCSAWAKAGECKKNPVYMMGS
        SEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHGSCPVIEGEKWSATKWIHMLPIDEVWRNPACVDENDHCSAWAKAGECKKNPVYMMGS
Subjt:  SEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHGSCPVIEGEKWSATKWIHMLPIDEVWRNPACVDENDHCSAWAKAGECKKNPVYMMGS

Query:  KNELGFCRLSCKVCSPSS
        KNELGFCRLSCKVCSPSS
Subjt:  KNELGFCRLSCKVCSPSS

XP_038905408.1 probable prolyl 4-hydroxylase 7 isoform X1 [Benincasa hispida]1.25e-19984.91Show/hide
Query:  MASPFLLAFSIFFLWLLPLSSLSANRFPKMLLHNNDMYESVIRMKTGGSAITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAGTG
        MAS F LAFS+ FL   P  S SANR PK+LLHNN+M +SVIRMKT GS +TIDPTRVI+LSSKPRAFLYKGFLS +ECQHLI+LAKGKL+QSLVAA TG
Subjt:  MASPFLLAFSIFFLWLLPLSSLSANRFPKMLLHNNDMYESVIRMKTGGSAITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAGTG

Query:  ESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKL
        ESVTS+ERTSTGMFL +AQD+IVARIESRIAAWTFLP+DNGEPIQILRYENGQKYEPHFDFFQDP NIAIGGHRIATILMYLSDVEKGGETVFPNSP+KL
Subjt:  ESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKL

Query:  SEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHGSCPVIEGEKWSATKWIHMLPIDEVWRNPACVDENDHCSAWAKAGECKKNPVYMMGS
        SE+E+ DLS+CAKVGYGV+PK+GDALLFFS+NPNVTPDATSYHGSCPVIEGEKWSATKWIHMLPI E+WRNPACVDEN  C AWA AGEC+KNPVYMMGS
Subjt:  SEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHGSCPVIEGEKWSATKWIHMLPIDEVWRNPACVDENDHCSAWAKAGECKKNPVYMMGS

Query:  KNELGFCRLSCKVCSPSS
        KNELG CR+SCKVCSP S
Subjt:  KNELGFCRLSCKVCSPSS

TrEMBL top hitse value%identityAlignment
A0A0A0LG32 Procollagen-proline 4-dioxygenase1.1e-17091.85Show/hide
Query:  MASPFLLAFSIF--FLWLLPLSSLSANRFPKMLLHNNDMYESVIRMKTGGSAITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAG
        MASPF L FSIF  FL+LLP SSLSANRFPK++LHNND+ ESVIRMKTGGSA+TIDPTRVIQLSSKPRAFLYKGFLS EECQHLI+ AKGKL QSLVAAG
Subjt:  MASPFLLAFSIF--FLWLLPLSSLSANRFPKMLLHNNDMYESVIRMKTGGSAITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAG

Query:  TGESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPV
        TG+SVTSKERTSTGMFL KAQD+IVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLS+VEKGGETVFPNSPV
Subjt:  TGESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPV

Query:  KLSEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHGSCPVIEGEKWSATKWIHMLPIDEVWRNPACVDENDHCSAWAKAGECKKNPVYMM
        KLSEEEK DLSEC KVGYGVRPKLGDALLFFSMNPNVTPD TSYHGSCPVIEGEKWSATKWIHMLPIDE WRNPACVDENDHC+AWAKAGEC+KNPVYMM
Subjt:  KLSEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHGSCPVIEGEKWSATKWIHMLPIDEVWRNPACVDENDHCSAWAKAGECKKNPVYMM

Query:  GSKNELGFCRLSCKVCSPS
        GSKNELGFCR SCKVCSPS
Subjt:  GSKNELGFCRLSCKVCSPS

A0A1S3B814 Procollagen-proline 4-dioxygenase2.5e-186100Show/hide
Query:  MASPFLLAFSIFFLWLLPLSSLSANRFPKMLLHNNDMYESVIRMKTGGSAITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAGTG
        MASPFLLAFSIFFLWLLPLSSLSANRFPKMLLHNNDMYESVIRMKTGGSAITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAGTG
Subjt:  MASPFLLAFSIFFLWLLPLSSLSANRFPKMLLHNNDMYESVIRMKTGGSAITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAGTG

Query:  ESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKL
        ESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKL
Subjt:  ESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKL

Query:  SEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHGSCPVIEGEKWSATKWIHMLPIDEVWRNPACVDENDHCSAWAKAGECKKNPVYMMGS
        SEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHGSCPVIEGEKWSATKWIHMLPIDEVWRNPACVDENDHCSAWAKAGECKKNPVYMMGS
Subjt:  SEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHGSCPVIEGEKWSATKWIHMLPIDEVWRNPACVDENDHCSAWAKAGECKKNPVYMMGS

Query:  KNELGFCRLSCKVCSPSS
        KNELGFCRLSCKVCSPSS
Subjt:  KNELGFCRLSCKVCSPSS

A0A5A7UCT9 Procollagen-proline 4-dioxygenase1.1e-16090.65Show/hide
Query:  YESVIRMKTGGSAITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAGTGESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLP
        YESVIRMKTGGSAITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAGTGESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLP
Subjt:  YESVIRMKTGGSAITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAGTGESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLP

Query:  L-----------------------------DNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKLSEEEKGDL
        L                             DNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKLSEEEKGDL
Subjt:  L-----------------------------DNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKLSEEEKGDL

Query:  SECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHGSCPVIEGEKWSATKWIHMLPIDEVWRNPACVDENDHCSAWAKAGECKKNPVYMMGSKNELGFCR
        SECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHGSCPVIEGEKWSATKWIHMLPIDEVWRNPACVDENDHCSAWAKAGECKKNPVYMMGSKNELGFCR
Subjt:  SECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHGSCPVIEGEKWSATKWIHMLPIDEVWRNPACVDENDHCSAWAKAGECKKNPVYMMGSKNELGFCR

Query:  LSCKVCSPSS
        LSCKVCSPSS
Subjt:  LSCKVCSPSS

A0A5D3D1X2 Procollagen-proline 4-dioxygenase2.5e-165100Show/hide
Query:  YESVIRMKTGGSAITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAGTGESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLP
        YESVIRMKTGGSAITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAGTGESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLP
Subjt:  YESVIRMKTGGSAITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAGTGESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLP

Query:  LDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKLSEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTP
        LDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKLSEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTP
Subjt:  LDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKLSEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTP

Query:  DATSYHGSCPVIEGEKWSATKWIHMLPIDEVWRNPACVDENDHCSAWAKAGECKKNPVYMMGSKNELGFCRLSCKVCSPSS
        DATSYHGSCPVIEGEKWSATKWIHMLPIDEVWRNPACVDENDHCSAWAKAGECKKNPVYMMGSKNELGFCRLSCKVCSPSS
Subjt:  DATSYHGSCPVIEGEKWSATKWIHMLPIDEVWRNPACVDENDHCSAWAKAGECKKNPVYMMGSKNELGFCRLSCKVCSPSS

A0A6J1EYJ1 Procollagen-proline 4-dioxygenase1.3e-13473.37Show/hide
Query:  MASPFLLAFSIFFLWLLPLSSLSANRFPKMLLHNNDMYESVIRMKTGGSAITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAG-T
        M S F LAFS+ FL   PL + SANR PK+LL +    +SVIRMK  GS+I IDPTRV+QLSS+PRAFLYKGFLS EECQHLI LAK  L QSLV    T
Subjt:  MASPFLLAFSIFFLWLLPLSSLSANRFPKMLLHNNDMYESVIRMKTGGSAITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAG-T

Query:  GESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVK
        G S +S +RTSTGMFL KAQD IVA IE++IAAWTFLP+DNGEPIQILRYENGQ+Y PHFDFFQDP N+A GGHRIAT+LMYLS+VE+GGETVFP+SP K
Subjt:  GESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVK

Query:  LSEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHGSCPVIEGEKWSATKWIHMLPIDEVWRNPACVDENDHCSAWAKAGECKKNPVYM--
        + EEE  DL +C+  GYGV+PK GDALLFFS++PNVT D TSYHGSCPVIEGEKWSATKWIHMLP+DE+WRNP CVDEN+HCSAWAKAGEC+KNP YM  
Subjt:  LSEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHGSCPVIEGEKWSATKWIHMLPIDEVWRNPACVDENDHCSAWAKAGECKKNPVYM--

Query:  --MGSKNELGFCRLSCKVCSPSS
          +GSK ELG+CRLSCK CSP S
Subjt:  --MGSKNELGFCRLSCKVCSPSS

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 63.4e-9561.05Show/hide
Query:  AITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAA--GTGESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLPLDNGEPIQIL
        + ++DPTR+ QLS  PRAFLYKGFLS EEC HLI LAKGKL +S+V A   +GES  S+ RTS+GMFL K QD IVA +E+++AAWTFLP +NGE +QIL
Subjt:  AITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAA--GTGESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLPLDNGEPIQIL

Query:  RYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKLSEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHGSCP
         YENGQKY+PHFD+F D   + +GGHRIAT+LMYLS+V KGGETVFPN   K  + +    S+CAK GY V+P+ GDALLFF+++ N T D  S HGSCP
Subjt:  RYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKLSEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHGSCP

Query:  VIEGEKWSATKWIHMLPIDEVWRNPACVDENDHCSAWAKAGECKKNPVYMMGSKNELGFCRLSCKVC
        VIEGEKWSAT+WIH+    +  +   CVD+++ C  WA AGEC+KNP+YM+GS+  LGFCR SCK C
Subjt:  VIEGEKWSATKWIHMLPIDEVWRNPACVDENDHCSAWAKAGECKKNPVYMMGSKNELGFCRLSCKVC

F4JAU3 Prolyl 4-hydroxylase 22.3e-8355.81Show/hide
Query:  IDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAG-TGESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLPLDNGEPIQILRYEN
        I+P++V Q+SSKPRAF+Y+GFL+  EC HLI LAK  L++S VA    GES  S  RTS+G F+ K +D IV+ IE +++ WTFLP +NGE +Q+LRYE+
Subjt:  IDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAG-TGESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLPLDNGEPIQILRYEN

Query:  GQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPV---KLSEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHGSCPV
        GQKY+ HFD+F D  NIA GGHRIAT+L+YLS+V KGGETVFP++     +   E K DLS+CAK G  V+PK G+ALLFF++  +  PD  S HG CPV
Subjt:  GQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPV---KLSEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHGSCPV

Query:  IEGEKWSATKWIHMLPIDEVWRNPA-CVDENDHCSAWAKAGECKKNPVYMMGSKNELGFCRLSCKVC
        IEGEKWSATKWIH+   D++  +   C D N+ C  WA  GEC KNP YM+G+    G CR SCK C
Subjt:  IEGEKWSATKWIHMLPIDEVWRNPA-CVDENDHCSAWAKAGECKKNPVYMMGSKNELGFCRLSCKVC

Q8L970 Probable prolyl 4-hydroxylase 77.2e-10658.99Show/hide
Query:  MASPFLLAFSIFFLWLLPLSSLSANRFPKMLLHNNDMYESVIRMKTGGSAITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAG-T
        M S   LAFS+ FL+ LPL S + NRF  +   +N    SVI+MKT  S+   DPTRV QLS  PR FLY+GFLS EEC H I LAKGKL +S+VA   +
Subjt:  MASPFLLAFSIFFLWLLPLSSLSANRFPKMLLHNNDMYESVIRMKTGGSAITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAG-T

Query:  GESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVK
        GESV S+ RTS+GMFL K QD IV+ +E+++AAWTFLP +NGE +QIL YENGQKYEPHFD+F D  N+ +GGHRIAT+LMYLS+VEKGGETVFP    K
Subjt:  GESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVK

Query:  LSEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHGSCPVIEGEKWSATKWIHMLPIDEVW-RNPACVDENDHCSAWAKAGECKKNPVYMM
         ++ +    +ECAK GY V+P+ GDALLFF+++PN T D+ S HGSCPV+EGEKWSAT+WIH+   +  + +   C+DEN  C  WAKAGEC+KNP YM+
Subjt:  LSEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHGSCPVIEGEKWSATKWIHMLPIDEVW-RNPACVDENDHCSAWAKAGECKKNPVYMM

Query:  GSKNELGFCRLSCKVCS
        GS  + G+CR SCK CS
Subjt:  GSKNELGFCRLSCKVCS

Q8LAN3 Probable prolyl 4-hydroxylase 49.8e-8755.72Show/hide
Query:  SAITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAG-TGESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLPLDNGEPIQIL
        S++ ++P++V Q+SSKPRAF+Y+GFL+  EC H++ LAK  L++S VA   +GES  S+ RTS+G F+ K +D IV+ IE +I+ WTFLP +NGE IQ+L
Subjt:  SAITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAG-TGESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLPLDNGEPIQIL

Query:  RYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPV---KLSEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHG
        RYE+GQKY+ HFD+F D  NI  GGHR+ATILMYLS+V KGGETVFP++ +   ++  E K DLS+CAK G  V+P+ GDALLFF+++P+  PD  S HG
Subjt:  RYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPV---KLSEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHG

Query:  SCPVIEGEKWSATKWIHMLPIDE-VWRNPACVDENDHCSAWAKAGECKKNPVYMMGSKNELGFCRLSCKVC
         CPVIEGEKWSATKWIH+   D  V  +  C D N+ C  WA  GEC KNP YM+G+    G+CR SCK C
Subjt:  SCPVIEGEKWSATKWIHMLPIDE-VWRNPACVDENDHCSAWAKAGECKKNPVYMMGSKNELGFCRLSCKVC

Q9LN20 Probable prolyl 4-hydroxylase 32.9e-6256.37Show/hide
Query:  LSSKPRAFLYKGFLSYEECQHLIHLAKGKL-RQSLVAAGTGESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHF
        LS +PRAF+Y  FLS EEC++LI LAK  + + ++V + TG+S  S+ RTS+G FLR+ +DKI+  IE RIA +TF+P D+GE +Q+L YE GQKYEPH+
Subjt:  LSSKPRAFLYKGFLSYEECQHLIHLAKGKL-RQSLVAAGTGESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHF

Query:  DFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKLSEEE-KGDLSECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHGSCPVIEGEKWSATK
        D+F D  N   GG R+AT+LMYLSDVE+GGETVFP + +  S      +LSEC K G  V+P++GDALLF+SM P+ T D TS HG CPVI G KWS+TK
Subjt:  DFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKLSEEE-KGDLSECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHGSCPVIEGEKWSATK

Query:  WIHM
        W+H+
Subjt:  WIHM

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 21.6e-8455.81Show/hide
Query:  IDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAG-TGESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLPLDNGEPIQILRYEN
        I+P++V Q+SSKPRAF+Y+GFL+  EC HLI LAK  L++S VA    GES  S  RTS+G F+ K +D IV+ IE +++ WTFLP +NGE +Q+LRYE+
Subjt:  IDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAG-TGESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLPLDNGEPIQILRYEN

Query:  GQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPV---KLSEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHGSCPV
        GQKY+ HFD+F D  NIA GGHRIAT+L+YLS+V KGGETVFP++     +   E K DLS+CAK G  V+PK G+ALLFF++  +  PD  S HG CPV
Subjt:  GQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPV---KLSEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHGSCPV

Query:  IEGEKWSATKWIHMLPIDEVWRNPA-CVDENDHCSAWAKAGECKKNPVYMMGSKNELGFCRLSCKVC
        IEGEKWSATKWIH+   D++  +   C D N+ C  WA  GEC KNP YM+G+    G CR SCK C
Subjt:  IEGEKWSATKWIHMLPIDEVWRNPA-CVDENDHCSAWAKAGECKKNPVYMMGSKNELGFCRLSCKVC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase5.1e-10758.99Show/hide
Query:  MASPFLLAFSIFFLWLLPLSSLSANRFPKMLLHNNDMYESVIRMKTGGSAITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAG-T
        M S   LAFS+ FL+ LPL S + NRF  +   +N    SVI+MKT  S+   DPTRV QLS  PR FLY+GFLS EEC H I LAKGKL +S+VA   +
Subjt:  MASPFLLAFSIFFLWLLPLSSLSANRFPKMLLHNNDMYESVIRMKTGGSAITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAG-T

Query:  GESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVK
        GESV S+ RTS+GMFL K QD IV+ +E+++AAWTFLP +NGE +QIL YENGQKYEPHFD+F D  N+ +GGHRIAT+LMYLS+VEKGGETVFP    K
Subjt:  GESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVK

Query:  LSEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHGSCPVIEGEKWSATKWIHMLPIDEVW-RNPACVDENDHCSAWAKAGECKKNPVYMM
         ++ +    +ECAK GY V+P+ GDALLFF+++PN T D+ S HGSCPV+EGEKWSAT+WIH+   +  + +   C+DEN  C  WAKAGEC+KNP YM+
Subjt:  LSEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHGSCPVIEGEKWSATKWIHMLPIDEVW-RNPACVDENDHCSAWAKAGECKKNPVYMM

Query:  GSKNELGFCRLSCKVCS
        GS  + G+CR SCK CS
Subjt:  GSKNELGFCRLSCKVCS

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase6.1e-10055.38Show/hide
Query:  MASPFLLAFSIFFLWLLPLSSLSANRFPKMLLHNNDMYESVIRMKTGGSAITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAG-T
        M S   LAFS+ FL+ LPL S + NRF  +   +N    SVI+MKT  S+   DPTRV QLS  PR FLY+GFLS EEC H I LAKGKL +S+VA   +
Subjt:  MASPFLLAFSIFFLWLLPLSSLSANRFPKMLLHNNDMYESVIRMKTGGSAITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAG-T

Query:  GESVTSKERTS----TGMFLRKAQ----DKIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGET
        GESV S++  S    +  F+        D IV+ +E+++AAWTFLP +NGE +QIL YENGQKYEPHFD+F D  N+ +GGHRIAT+LMYLS+VEKGGET
Subjt:  GESVTSKERTS----TGMFLRKAQ----DKIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGET

Query:  VFPNSPVKLSEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHGSCPVIEGEKWSATKWIHMLPIDEVW-RNPACVDENDHCSAWAKAGEC
        VFP    K ++ +    +ECAK GY V+P+ GDALLFF+++PN T D+ S HGSCPV+EGEKWSAT+WIH+   +  + +   C+DEN  C  WAKAGEC
Subjt:  VFPNSPVKLSEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHGSCPVIEGEKWSATKWIHMLPIDEVW-RNPACVDENDHCSAWAKAGEC

Query:  KKNPVYMMGSKNELGFCRLSCKVCS
        +KNP YM+GS  + G+CR SCK CS
Subjt:  KKNPVYMMGSKNELGFCRLSCKVCS

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase2.4e-9661.05Show/hide
Query:  AITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAA--GTGESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLPLDNGEPIQIL
        + ++DPTR+ QLS  PRAFLYKGFLS EEC HLI LAKGKL +S+V A   +GES  S+ RTS+GMFL K QD IVA +E+++AAWTFLP +NGE +QIL
Subjt:  AITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAA--GTGESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLPLDNGEPIQIL

Query:  RYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKLSEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHGSCP
         YENGQKY+PHFD+F D   + +GGHRIAT+LMYLS+V KGGETVFPN   K  + +    S+CAK GY V+P+ GDALLFF+++ N T D  S HGSCP
Subjt:  RYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKLSEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHGSCP

Query:  VIEGEKWSATKWIHMLPIDEVWRNPACVDENDHCSAWAKAGECKKNPVYMMGSKNELGFCRLSCKVC
        VIEGEKWSAT+WIH+    +  +   CVD+++ C  WA AGEC+KNP+YM+GS+  LGFCR SCK C
Subjt:  VIEGEKWSATKWIHMLPIDEVWRNPACVDENDHCSAWAKAGECKKNPVYMMGSKNELGFCRLSCKVC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein7.0e-8855.72Show/hide
Query:  SAITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAG-TGESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLPLDNGEPIQIL
        S++ ++P++V Q+SSKPRAF+Y+GFL+  EC H++ LAK  L++S VA   +GES  S+ RTS+G F+ K +D IV+ IE +I+ WTFLP +NGE IQ+L
Subjt:  SAITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAG-TGESVTSKERTSTGMFLRKAQDKIVARIESRIAAWTFLPLDNGEPIQIL

Query:  RYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPV---KLSEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHG
        RYE+GQKY+ HFD+F D  NI  GGHR+ATILMYLS+V KGGETVFP++ +   ++  E K DLS+CAK G  V+P+ GDALLFF+++P+  PD  S HG
Subjt:  RYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPV---KLSEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTPDATSYHG

Query:  SCPVIEGEKWSATKWIHMLPIDE-VWRNPACVDENDHCSAWAKAGECKKNPVYMMGSKNELGFCRLSCKVC
         CPVIEGEKWSATKWIH+   D  V  +  C D N+ C  WA  GEC KNP YM+G+    G+CR SCK C
Subjt:  SCPVIEGEKWSATKWIHMLPIDE-VWRNPACVDENDHCSAWAKAGECKKNPVYMMGSKNELGFCRLSCKVC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTCCATTTCTTCTCGCATTTTCTATCTTTTTCCTTTGGCTTTTACCCCTTTCTTCTCTCTCTGCCAACCGCTTCCCCAAAATGCTCTTACACAACAACGACAT
GTATGAATCTGTTATTAGGATGAAAACGGGTGGTTCCGCCATTACAATCGATCCCACTCGTGTCATTCAGCTTTCATCCAAACCCAGGGCTTTCTTATATAAGGGATTTT
TGTCTTATGAGGAGTGCCAGCATCTTATCCATTTGGCGAAGGGTAAGCTACGTCAATCATTGGTGGCGGCTGGAACAGGTGAGAGTGTTACAAGTAAAGAACGGACGAGT
ACTGGCATGTTTCTTCGCAAGGCCCAGGATAAAATAGTTGCTCGCATTGAGTCAAGGATTGCTGCATGGACTTTCCTTCCCCTTGATAATGGGGAGCCTATTCAGATACT
AAGGTATGAGAACGGACAGAAATATGAGCCACATTTTGATTTTTTTCAAGACCCAGGCAATATAGCGATTGGAGGTCACCGGATAGCCACAATCTTGATGTATTTATCCG
ATGTTGAAAAGGGTGGAGAAACAGTCTTTCCCAATTCTCCGGTTAAATTATCCGAGGAGGAAAAGGGTGACTTGTCTGAATGCGCTAAGGTTGGCTATGGAGTAAGACCA
AAGTTGGGTGATGCTTTACTGTTCTTCAGTATGAATCCAAATGTGACACCAGACGCGACCAGCTATCACGGGAGCTGCCCAGTGATAGAGGGTGAGAAATGGTCTGCAAC
TAAATGGATTCACATGCTTCCAATCGATGAAGTTTGGAGGAATCCAGCTTGTGTAGACGAAAATGACCACTGTAGTGCGTGGGCAAAAGCAGGTGAATGTAAAAAGAATC
CTGTTTATATGATGGGTTCTAAGAATGAACTTGGATTTTGTAGGTTGAGTTGCAAAGTATGCTCTCCTTCCTCATAG
mRNA sequenceShow/hide mRNA sequence
GCTTTTCAGAGAGTCAATGCAAAGCAATTTCCCGCTGGAAGAAACTTCCATTTCCATGGTTATCAACGGCGGGTTTGCAATTGCAGCAAATCCGCTTCTGGGTTCCCTAA
ATTTCTTCATAAATGAGTATGAACCCCAAGAATCGAATTTGTACCTTCCTCTCTCTTCTCCATTTTCGTTCTTTGTTTTGATTTCCATTTTCTACATACATTTTGTTTAC
GTCTCCCTTTCTCTCTTGAATTTCTGGCTTCCGTACCTCCATTTTCGCTATGGCTTCTCCATTTCTTCTCGCATTTTCTATCTTTTTCCTTTGGCTTTTACCCCTTTCTT
CTCTCTCTGCCAACCGCTTCCCCAAAATGCTCTTACACAACAACGACATGTATGAATCTGTTATTAGGATGAAAACGGGTGGTTCCGCCATTACAATCGATCCCACTCGT
GTCATTCAGCTTTCATCCAAACCCAGGGCTTTCTTATATAAGGGATTTTTGTCTTATGAGGAGTGCCAGCATCTTATCCATTTGGCGAAGGGTAAGCTACGTCAATCATT
GGTGGCGGCTGGAACAGGTGAGAGTGTTACAAGTAAAGAACGGACGAGTACTGGCATGTTTCTTCGCAAGGCCCAGGATAAAATAGTTGCTCGCATTGAGTCAAGGATTG
CTGCATGGACTTTCCTTCCCCTTGATAATGGGGAGCCTATTCAGATACTAAGGTATGAGAACGGACAGAAATATGAGCCACATTTTGATTTTTTTCAAGACCCAGGCAAT
ATAGCGATTGGAGGTCACCGGATAGCCACAATCTTGATGTATTTATCCGATGTTGAAAAGGGTGGAGAAACAGTCTTTCCCAATTCTCCGGTTAAATTATCCGAGGAGGA
AAAGGGTGACTTGTCTGAATGCGCTAAGGTTGGCTATGGAGTAAGACCAAAGTTGGGTGATGCTTTACTGTTCTTCAGTATGAATCCAAATGTGACACCAGACGCGACCA
GCTATCACGGGAGCTGCCCAGTGATAGAGGGTGAGAAATGGTCTGCAACTAAATGGATTCACATGCTTCCAATCGATGAAGTTTGGAGGAATCCAGCTTGTGTAGACGAA
AATGACCACTGTAGTGCGTGGGCAAAAGCAGGTGAATGTAAAAAGAATCCTGTTTATATGATGGGTTCTAAGAATGAACTTGGATTTTGTAGGTTGAGTTGCAAAGTATG
CTCTCCTTCCTCATAGAAAAGGAAATGCTTGTTTTTACTTATACACAATTCAGTGGTAGGAGTTTTTTCTTCTTACACATGTACACATGTAAATGACTTTGAGAGGCATC
ACTTAGTATATTTAACTATTGAATCTCTC
Protein sequenceShow/hide protein sequence
MASPFLLAFSIFFLWLLPLSSLSANRFPKMLLHNNDMYESVIRMKTGGSAITIDPTRVIQLSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAGTGESVTSKERTS
TGMFLRKAQDKIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKLSEEEKGDLSECAKVGYGVRP
KLGDALLFFSMNPNVTPDATSYHGSCPVIEGEKWSATKWIHMLPIDEVWRNPACVDENDHCSAWAKAGECKKNPVYMMGSKNELGFCRLSCKVCSPSS