; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0018565 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0018565
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationchr06:4170212..4173416
RNA-Seq ExpressionPay0018565
SyntenyPay0018565
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049426.1 putative prolyl 4-hydroxylase 4 [Cucumis melo var. makuwa]1.1e-16999.01Show/hide
Query:  MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
        MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISW   AFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
Subjt:  MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  ACS
        ACS
Subjt:  ACS

XP_004134175.2 probable prolyl 4-hydroxylase 4 [Cucumis sativus]6.7e-16796.04Show/hide
Query:  MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
        MAE L FNLLFLFTL+IS+LL+RASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSG+SKVSEVRTSSGAFIH
Subjt:  MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPK+NGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWI VDSFD + RDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  ACS
        ACS
Subjt:  ACS

XP_008438765.1 PREDICTED: probable prolyl 4-hydroxylase 4 [Cucumis melo]2.8e-173100Show/hide
Query:  MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
        MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
Subjt:  MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  ACS
        ACS
Subjt:  ACS

XP_022137761.1 probable prolyl 4-hydroxylase 4 [Momordica charantia]1.6e-16092.05Show/hide
Query:  MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
        MA F   +LLF F+L+IS LLRRAS+SYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDH+ISLAKAELKRS+VADNLSGESKVSEVRTSSGAFIH
Subjt:  MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK
        KAKDPI+SGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYF+DKVNIARGGHRMATVLMYLS+VEKGGETVFPSAEESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPR+GDALLFFSLHPNA+PDT SLHGGCPVIEGEKWSATKWIHVDSFDTI RDHTNC DE+ SCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  AC
         C
Subjt:  AC

XP_038903083.1 probable prolyl 4-hydroxylase 4 [Benincasa hispida]2.7e-16394.39Show/hide
Query:  MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
        MA+F  FNLLF F+L+IS LLRRAS+SYAGSASSIVNPAKVKQISW+PRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
Subjt:  MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETN DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAIPD SSLHGGCPVIEGEKWSATKWIHVDSFD I RDHT+C DENPSCERWAEL ECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  ACS
        AC+
Subjt:  ACS

TrEMBL top hitse value%identityAlignment
A0A0A0L5Q6 Procollagen-proline 4-dioxygenase3.3e-16796.04Show/hide
Query:  MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
        MAE L FNLLFLFTL+IS+LL+RASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSG+SKVSEVRTSSGAFIH
Subjt:  MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPK+NGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWI VDSFD + RDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  ACS
        ACS
Subjt:  ACS

A0A1S3AWU7 Procollagen-proline 4-dioxygenase1.4e-173100Show/hide
Query:  MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
        MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
Subjt:  MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  ACS
        ACS
Subjt:  ACS

A0A5A7U593 Procollagen-proline 4-dioxygenase5.4e-17099.01Show/hide
Query:  MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
        MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISW   AFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
Subjt:  MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  ACS
        ACS
Subjt:  ACS

A0A6J1C7M6 Procollagen-proline 4-dioxygenase7.8e-16192.05Show/hide
Query:  MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
        MA F   +LLF F+L+IS LLRRAS+SYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDH+ISLAKAELKRS+VADNLSGESKVSEVRTSSGAFIH
Subjt:  MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK
        KAKDPI+SGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYF+DKVNIARGGHRMATVLMYLS+VEKGGETVFPSAEESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPR+GDALLFFSLHPNA+PDT SLHGGCPVIEGEKWSATKWIHVDSFDTI RDHTNC DE+ SCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  AC
         C
Subjt:  AC

A0A6J1I971 Procollagen-proline 4-dioxygenase4.3e-15991.39Show/hide
Query:  MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
        MA+F   NLLF+ +++IS LLRRAS+SYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRS+VAD+LSGESKVSEVRTSSGAFIH
Subjt:  MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYF DKVNIARGGHRMATVLMYLS+VEKGGETVFP+AEESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNA+PDT SLHGGCPVIEGEKWSATKWIHVDSFDTI  DHT+C D N SCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  AC
         C
Subjt:  AC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 63.7e-9160.29Show/hide
Query:  SYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSS-VADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENG
        S   S S  V+P ++ Q+SW+PRAF+Y+GFL+D ECDHLI LAK +L++S  VAD  SGES+ SEVRTSSG F+ K +D IV+ +E K+AAWTFLP+ENG
Subjt:  SYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSS-VADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENG

Query:  EDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPD
        E +Q+L YE GQKYD HFDYF DK  +  GGHR+ATVLMYLS+V KGGETVFP+ +    +   ++    S CAK+G AVKPRKGDALLFF+LH N   D
Subjt:  EDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPD

Query:  TSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC
         +SLHG CPVIEGEKWSAT+WIHV SF    +    C D++ SC+ WA+ GEC  NP YMVGS    G+CRKSCKAC
Subjt:  TSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC

F4JAU3 Prolyl 4-hydroxylase 26.6e-12574.23Show/hide
Query:  LFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIE
        L  + I  +L ++S     S SSI+NP+KVKQ+S  PRAFVYEGFLTDLECDHLISLAK  L+RS+VADN +GES+VS+VRTSSG FI K KDPIVSGIE
Subjt:  LFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIE

Query:  DKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAKKGIAVKPRKGD
        DK++ WTFLPKENGED+QVLRYE+GQKYDAHFDYF DKVNIARGGHR+ATVL+YLS+V KGGETVFP A+E  RR  SE   DLSDCAKKGIAVKP+KG+
Subjt:  DKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAKKGIAVKPRKGD

Query:  ALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC
        ALLFF+L  +AIPD  SLHGGCPVIEGEKWSATKWIHVDSFD I     NC D N SCERWA LGEC  NPEYMVG+PE+PG CR+SCKAC
Subjt:  ALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC

Q8L970 Probable prolyl 4-hydroxylase 74.6e-10260.19Show/hide
Query:  FLRFNLLFLFTLTI-----SYLLRRASASYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSE
        FL F+L FLFTL +     +  L R+S +  G       SASS   +P +V Q+SW+PR F+YEGFL+D ECDH I LAK +L++S VADN SGES  SE
Subjt:  FLRFNLLFLFTLTI-----SYLLRRASASYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSE

Query:  VRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASE
        VRTSSG F+ K +D IVS +E K+AAWTFLP+ENGE +Q+L YE GQKY+ HFDYF D+ N+  GGHR+ATVLMYLS+VEKGGETVFP      + +A++
Subjt:  VRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASE

Query:  TNQD-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSP
           D  ++CAK+G AVKPRKGDALLFF+LHPNA  D++SLHG CPV+EGEKWSAT+WIHV SF+      + C DEN SCE+WA+ GEC  NP YMVGS 
Subjt:  TNQD-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSP

Query:  ELPGYCRKSCKACS
        +  GYCRKSCKACS
Subjt:  ELPGYCRKSCKACS

Q8LAN3 Probable prolyl 4-hydroxylase 42.6e-12976.77Show/hide
Query:  RFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIHKAKDP
        R  LL  F    S LL ++S S   S+S  VNP+KVKQ+S  PRAFVYEGFLT+LECDH++SLAKA LKRS+VADN SGESK SEVRTSSG FI K KDP
Subjt:  RFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIHKAKDP

Query:  IVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAKKGIAV
        IVSGIEDKI+ WTFLPKENGEDIQVLRYE+GQKYDAHFDYF DKVNI RGGHRMAT+LMYLS+V KGGETVFP AE   RR  SE  +DLSDCAK+GIAV
Subjt:  IVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAKKGIAV

Query:  KPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC
        KPRKGDALLFF+LHP+AIPD  SLHGGCPVIEGEKWSATKWIHVDSFD I     NC D N SCERWA LGECT NPEYMVG+ ELPGYCR+SCKAC
Subjt:  KPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC

Q9LN20 Probable prolyl 4-hydroxylase 32.4e-6656.46Show/hide
Query:  ISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHF
        +SW PRAFVY  FL+  EC++LISLAK  + +S+V D+ +G+SK S VRTSSG F+ + +D I+  IE +IA +TF+P ++GE +QVL YE GQKY+ H+
Subjt:  ISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHF

Query:  DYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSA
        DYF D+ N   GG RMAT+LMYLSDVE+GGETVFP+A  +    +     +LS+C KKG++VKPR GDALLF+S+ P+A  D +SLHGGCPVI G KWS+
Subjt:  DYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSA

Query:  TKWIHVDSF
        TKW+HV  +
Subjt:  TKWIHVDSF

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 24.7e-12674.23Show/hide
Query:  LFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIE
        L  + I  +L ++S     S SSI+NP+KVKQ+S  PRAFVYEGFLTDLECDHLISLAK  L+RS+VADN +GES+VS+VRTSSG FI K KDPIVSGIE
Subjt:  LFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIE

Query:  DKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAKKGIAVKPRKGD
        DK++ WTFLPKENGED+QVLRYE+GQKYDAHFDYF DKVNIARGGHR+ATVL+YLS+V KGGETVFP A+E  RR  SE   DLSDCAKKGIAVKP+KG+
Subjt:  DKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAKKGIAVKPRKGD

Query:  ALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC
        ALLFF+L  +AIPD  SLHGGCPVIEGEKWSATKWIHVDSFD I     NC D N SCERWA LGEC  NPEYMVG+PE+PG CR+SCKAC
Subjt:  ALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase3.3e-10360.19Show/hide
Query:  FLRFNLLFLFTLTI-----SYLLRRASASYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSE
        FL F+L FLFTL +     +  L R+S +  G       SASS   +P +V Q+SW+PR F+YEGFL+D ECDH I LAK +L++S VADN SGES  SE
Subjt:  FLRFNLLFLFTLTI-----SYLLRRASASYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSE

Query:  VRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASE
        VRTSSG F+ K +D IVS +E K+AAWTFLP+ENGE +Q+L YE GQKY+ HFDYF D+ N+  GGHR+ATVLMYLS+VEKGGETVFP      + +A++
Subjt:  VRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASE

Query:  TNQD-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSP
           D  ++CAK+G AVKPRKGDALLFF+LHPNA  D++SLHG CPV+EGEKWSAT+WIHV SF+      + C DEN SCE+WA+ GEC  NP YMVGS 
Subjt:  TNQD-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSP

Query:  ELPGYCRKSCKACS
        +  GYCRKSCKACS
Subjt:  ELPGYCRKSCKACS

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase1.3e-9657.45Show/hide
Query:  FLRFNLLFLFTLTI-----SYLLRRASASYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGES----
        FL F+L FLFTL +     +  L R+S +  G       SASS   +P +V Q+SW+PR F+YEGFL+D ECDH I LAK +L++S VADN SGES    
Subjt:  FLRFNLLFLFTLTI-----SYLLRRASASYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGES----

Query:  -KVSEVRTSSGAFIHKAK---DPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEE
          VS VR SS    +      D IVS +E K+AAWTFLP+ENGE +Q+L YE GQKY+ HFDYF D+ N+  GGHR+ATVLMYLS+VEKGGETVFP    
Subjt:  -KVSEVRTSSGAFIHKAK---DPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEE

Query:  SQRRQASETNQD-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNN
          + +A++   D  ++CAK+G AVKPRKGDALLFF+LHPNA  D++SLHG CPV+EGEKWSAT+WIHV SF+      + C DEN SCE+WA+ GEC  N
Subjt:  SQRRQASETNQD-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNN

Query:  PEYMVGSPELPGYCRKSCKACS
        P YMVGS +  GYCRKSCKACS
Subjt:  PEYMVGSPELPGYCRKSCKACS

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase2.6e-9260.29Show/hide
Query:  SYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSS-VADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENG
        S   S S  V+P ++ Q+SW+PRAF+Y+GFL+D ECDHLI LAK +L++S  VAD  SGES+ SEVRTSSG F+ K +D IV+ +E K+AAWTFLP+ENG
Subjt:  SYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSS-VADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENG

Query:  EDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPD
        E +Q+L YE GQKYD HFDYF DK  +  GGHR+ATVLMYLS+V KGGETVFP+ +    +   ++    S CAK+G AVKPRKGDALLFF+LH N   D
Subjt:  EDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPD

Query:  TSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC
         +SLHG CPVIEGEKWSAT+WIHV SF    +    C D++ SC+ WA+ GEC  NP YMVGS    G+CRKSCKAC
Subjt:  TSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.8e-13076.77Show/hide
Query:  RFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIHKAKDP
        R  LL  F    S LL ++S S   S+S  VNP+KVKQ+S  PRAFVYEGFLT+LECDH++SLAKA LKRS+VADN SGESK SEVRTSSG FI K KDP
Subjt:  RFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIHKAKDP

Query:  IVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAKKGIAV
        IVSGIEDKI+ WTFLPKENGEDIQVLRYE+GQKYDAHFDYF DKVNI RGGHRMAT+LMYLS+V KGGETVFP AE   RR  SE  +DLSDCAK+GIAV
Subjt:  IVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAKKGIAV

Query:  KPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC
        KPRKGDALLFF+LHP+AIPD  SLHGGCPVIEGEKWSATKWIHVDSFD I     NC D N SCERWA LGECT NPEYMVG+ ELPGYCR+SCKAC
Subjt:  KPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGAATTTCTCCGTTTCAATCTACTTTTTCTTTTCACATTAACCATTTCCTACCTTCTCCGGCGAGCTTCAGCCTCCTATGCAGGTTCCGCTAGCTCAATCGTCAA
CCCTGCAAAAGTCAAACAGATTTCCTGGTCTCCTCGGGCTTTTGTCTATGAAGGTTTTCTCACGGATTTAGAATGCGATCATCTCATTTCCCTTGCTAAAGCGGAGCTGA
AGAGATCCTCTGTTGCGGATAATTTGTCCGGAGAGAGCAAAGTCAGCGAGGTCCGAACTAGCTCTGGGGCGTTTATTCATAAAGCCAAGGATCCTATTGTTTCTGGTATA
GAAGACAAAATTGCAGCATGGACATTTCTGCCAAAAGAAAATGGAGAAGACATTCAAGTATTGAGATATGAATATGGGCAGAAGTACGATGCTCACTTTGATTACTTTGC
TGACAAGGTTAACATTGCCCGAGGTGGACATCGAATGGCAACCGTTCTCATGTATCTTTCCGACGTGGAAAAAGGCGGTGAAACTGTGTTTCCTTCTGCAGAGGAATCTC
AAAGACGTCAGGCTTCCGAAACAAACCAAGATCTCTCAGATTGTGCAAAGAAAGGCATAGCAGTAAAACCTCGGAAAGGCGATGCTCTTCTCTTCTTCAGCCTCCATCCA
AATGCTATTCCAGACACAAGTAGTCTACATGGTGGGTGCCCTGTGATTGAAGGTGAGAAATGGTCAGCAACAAAGTGGATTCATGTTGATTCCTTCGACACGATCGCGAG
AGACCATACAAATTGTGGTGATGAAAATCCAAGTTGTGAGAGATGGGCTGAACTCGGCGAGTGCACGAATAACCCAGAGTATATGGTGGGATCTCCTGAGCTTCCTGGCT
ACTGCAGGAAAAGTTGCAAGGCATGCTCATAA
mRNA sequenceShow/hide mRNA sequence
ACTTTAGAAAATTTGACAGTTAACAGCTTGAAGAAGAGATGGAAATCTGAAAATTTTGTGAAGAAAAAGAAAATTTTTATTGCATTTTTCTTCTTCTTCTTCTTCATTTC
TTCTCTTTCTCTTTTGGTCCGATTCAGTTCCATGGCTGAATTTCTCCGTTTCAATCTACTTTTTCTTTTCACATTAACCATTTCCTACCTTCTCCGGCGAGCTTCAGCCT
CCTATGCAGGTTCCGCTAGCTCAATCGTCAACCCTGCAAAAGTCAAACAGATTTCCTGGTCTCCTCGGGCTTTTGTCTATGAAGGTTTTCTCACGGATTTAGAATGCGAT
CATCTCATTTCCCTTGCTAAAGCGGAGCTGAAGAGATCCTCTGTTGCGGATAATTTGTCCGGAGAGAGCAAAGTCAGCGAGGTCCGAACTAGCTCTGGGGCGTTTATTCA
TAAAGCCAAGGATCCTATTGTTTCTGGTATAGAAGACAAAATTGCAGCATGGACATTTCTGCCAAAAGAAAATGGAGAAGACATTCAAGTATTGAGATATGAATATGGGC
AGAAGTACGATGCTCACTTTGATTACTTTGCTGACAAGGTTAACATTGCCCGAGGTGGACATCGAATGGCAACCGTTCTCATGTATCTTTCCGACGTGGAAAAAGGCGGT
GAAACTGTGTTTCCTTCTGCAGAGGAATCTCAAAGACGTCAGGCTTCCGAAACAAACCAAGATCTCTCAGATTGTGCAAAGAAAGGCATAGCAGTAAAACCTCGGAAAGG
CGATGCTCTTCTCTTCTTCAGCCTCCATCCAAATGCTATTCCAGACACAAGTAGTCTACATGGTGGGTGCCCTGTGATTGAAGGTGAGAAATGGTCAGCAACAAAGTGGA
TTCATGTTGATTCCTTCGACACGATCGCGAGAGACCATACAAATTGTGGTGATGAAAATCCAAGTTGTGAGAGATGGGCTGAACTCGGCGAGTGCACGAATAACCCAGAG
TATATGGTGGGATCTCCTGAGCTTCCTGGCTACTGCAGGAAAAGTTGCAAGGCATGCTCATAAACTTGGTCCATTCTTTATAATGAGCATTCCCTTGCCTTTTCGTTTTT
GTACCAAAAACAGAAATGTTAGTTTTTCGAGAGCATTTTCATTGAAATGTTTTGTGAGAGTTAGCATTTGTATTGATTACTCAATCATGTAACATTATTTGAACACTAAC
GTAGTTTGCAAAGTTATTTTGGTTTGTATGGATCGTCCAATTATAACTCAGTTAAAGACCTATTTGGATTGATAAGTGCTTAAATATACATTTAGGAG
Protein sequenceShow/hide protein sequence
MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGI
EDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAKKGIAVKPRKGDALLFFSLHP
NAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKACS