; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg032155 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg032155
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationscaffold11:38634771..38637982
RNA-Seq ExpressionSpg032155
SyntenySpg032155
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597483.1 putative prolyl 4-hydroxylase 4, partial [Cucurbita argyrosperma subsp. sororia]2.6e-16394.04Show/hide
Query:  MAKFRSCNLLFIFSLSIFLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH
        MAKF SCNLLFI S+SI LLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHL+SLAKAELKRSAVAD+LSGESKVSEVRTSSGAFIH
Subjt:  MAKFRSCNLLFIFSLSIFLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAH+DYF DKVNIARGGHRMATVLMYLS+VEKGGETVFP+AEESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNA+PDT SLHGGCPVIEGEKWSATKWIHVDSFD IV DHT+C D NASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
        VC
Subjt:  VC

XP_008438765.1 PREDICTED: probable prolyl 4-hydroxylase 4 [Cucumis melo]4.5e-16394.04Show/hide
Query:  MAKFRSCNLLFIFSLSIFLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH
        MA+F   NLLF+F+L+I  LLRRAS+SYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHL+SLAKAELKRS+VADNLSGESKVSEVRTSSGAFIH
Subjt:  MAKFRSCNLLFIFSLSIFLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFD I RDHTNC DEN SCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
         C
Subjt:  VC

XP_022137761.1 probable prolyl 4-hydroxylase 4 [Momordica charantia]8.2e-16594.37Show/hide
Query:  MAKFRSCNLLFIFSLSIFLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH
        MA+F SC+LLF FSLSI LLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDH++SLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH
Subjt:  MAKFRSCNLLFIFSLSIFLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK
        KAKDPI+SGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYF+DKVNIARGGHRMATVLMYLS+VEKGGETVFPSAEESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPR+GDALLFFSLHPNA+PDT SLHGGCPVIEGEKWSATKWIHVDSFD I+RDHTNCADE+ASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
        VC
Subjt:  VC

XP_022954026.1 probable prolyl 4-hydroxylase 4 [Cucurbita moschata]5.9e-16393.71Show/hide
Query:  MAKFRSCNLLFIFSLSIFLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH
        MAKF SCNLLFI S+SI LLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHL+SLAKAELKRSAVAD+LSGESKVSEVRTSSGAFIH
Subjt:  MAKFRSCNLLFIFSLSIFLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAH+DYF DKVNIARGGHRMATVLMYLS+VEKGGETVFP+AEESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFF+LHPNA+PDT SLHGGCPVIEGEKWSATKWIHVDSFD IV DHT+C D NASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
        VC
Subjt:  VC

XP_022973641.1 probable prolyl 4-hydroxylase 4 [Cucurbita maxima]1.2e-16394.37Show/hide
Query:  MAKFRSCNLLFIFSLSIFLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH
        MAKF SCNLLFI S+SI LLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHL+SLAKAELKRSAVAD+LSGESKVSEVRTSSGAFIH
Subjt:  MAKFRSCNLLFIFSLSIFLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYF DKVNIARGGHRMATVLMYLS+VEKGGETVFP+AEESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNA+PDT SLHGGCPVIEGEKWSATKWIHVDSFD IV DHT+C D NASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
        VC
Subjt:  VC

TrEMBL top hitse value%identityAlignment
A0A0A0L5Q6 Procollagen-proline 4-dioxygenase5.4e-16293.05Show/hide
Query:  MAKFRSCNLLFIFSLSIFLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH
        MA+    NLLF+F+LSI  LL+RAS+SYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHL+SLAKAELKRS+VADNLSG+SKVSEVRTSSGAFIH
Subjt:  MAKFRSCNLLFIFSLSIFLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPK+NGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWI VDSFDM+VRDHTNC DEN SCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
         C
Subjt:  VC

A0A1S3AWU7 Procollagen-proline 4-dioxygenase2.2e-16394.04Show/hide
Query:  MAKFRSCNLLFIFSLSIFLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH
        MA+F   NLLF+F+L+I  LLRRAS+SYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHL+SLAKAELKRS+VADNLSGESKVSEVRTSSGAFIH
Subjt:  MAKFRSCNLLFIFSLSIFLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFD I RDHTNC DEN SCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
         C
Subjt:  VC

A0A6J1C7M6 Procollagen-proline 4-dioxygenase4.0e-16594.37Show/hide
Query:  MAKFRSCNLLFIFSLSIFLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH
        MA+F SC+LLF FSLSI LLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDH++SLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH
Subjt:  MAKFRSCNLLFIFSLSIFLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK
        KAKDPI+SGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYF+DKVNIARGGHRMATVLMYLS+VEKGGETVFPSAEESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPR+GDALLFFSLHPNA+PDT SLHGGCPVIEGEKWSATKWIHVDSFD I+RDHTNCADE+ASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
        VC
Subjt:  VC

A0A6J1GPQ8 Procollagen-proline 4-dioxygenase2.8e-16393.71Show/hide
Query:  MAKFRSCNLLFIFSLSIFLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH
        MAKF SCNLLFI S+SI LLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHL+SLAKAELKRSAVAD+LSGESKVSEVRTSSGAFIH
Subjt:  MAKFRSCNLLFIFSLSIFLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAH+DYF DKVNIARGGHRMATVLMYLS+VEKGGETVFP+AEESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFF+LHPNA+PDT SLHGGCPVIEGEKWSATKWIHVDSFD IV DHT+C D NASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
        VC
Subjt:  VC

A0A6J1I971 Procollagen-proline 4-dioxygenase5.7e-16494.37Show/hide
Query:  MAKFRSCNLLFIFSLSIFLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH
        MAKF SCNLLFI S+SI LLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHL+SLAKAELKRSAVAD+LSGESKVSEVRTSSGAFIH
Subjt:  MAKFRSCNLLFIFSLSIFLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYF DKVNIARGGHRMATVLMYLS+VEKGGETVFP+AEESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNA+PDT SLHGGCPVIEGEKWSATKWIHVDSFD IV DHT+C D NASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
        VC
Subjt:  VC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 69.6e-9258.42Show/hide
Query:  FSLSIFLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRS-AVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIE
        FSLS+ L+  + S     S S  V+P ++ Q+SW+PRAF+Y+GFL+D ECDHL+ LAK +L++S  VAD  SGES+ SEVRTSSG F+ K +D IV+ +E
Subjt:  FSLSIFLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRS-AVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIE

Query:  DKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAKKGIAVKPRKGD
         K+AAWTFLP+ENGE +Q+L YE GQKYD HFDYF DK  +  GGHR+ATVLMYLS+V KGGETVFP+ +    +   ++    S CAK+G AVKPRKGD
Subjt:  DKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAKKGIAVKPRKGD

Query:  ALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC
        ALLFF+LH N   D +SLHG CPVIEGEKWSAT+WIHV SF    +    C D++ SC+ WA+ GEC  NP YMVGS    G+CRKSCK C
Subjt:  ALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC

F4JAU3 Prolyl 4-hydroxylase 24.6e-12674.15Show/hide
Query:  LLFIFSLSIFLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVS
        LLF+   +I L+L ++S+    S SSI+NP+KVKQ+S  PRAFVYEGFLTDLECDHL+SLAK  L+RSAVADN +GES+VS+VRTSSG FI K KDPIVS
Subjt:  LLFIFSLSIFLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVS

Query:  GIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAKKGIAVKPR
        GIEDK++ WTFLPKENGED+QVLRYE+GQKYDAHFDYF DKVNIARGGHR+ATVL+YLS+V KGGETVFP A+E  RR  SE   DLSDCAKKGIAVKP+
Subjt:  GIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAKKGIAVKPR

Query:  KGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC
        KG+ALLFF+L  +AIPD  SLHGGCPVIEGEKWSATKWIHVDSFD I+    NC D N SCERWA LGEC  NPEYMVG+PE+PG CR+SCK C
Subjt:  KGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC

Q8L970 Probable prolyl 4-hydroxylase 75.6e-10058.15Show/hide
Query:  FRSCNLLFIFSLSIF-----LLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSE
        F + +L F+F+L +        L R+S++  G       SASS   +P +V Q+SW+PR F+YEGFL+D ECDH + LAK +L++S VADN SGES  SE
Subjt:  FRSCNLLFIFSLSIF-----LLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSE

Query:  VRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASE
        VRTSSG F+ K +D IVS +E K+AAWTFLP+ENGE +Q+L YE GQKY+ HFDYF D+ N+  GGHR+ATVLMYLS+VEKGGETVFP      + +A++
Subjt:  VRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASE

Query:  TNKD-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSP
           D  ++CAK+G AVKPRKGDALLFF+LHPNA  D++SLHG CPV+EGEKWSAT+WIHV SF+      + C DEN SCE+WA+ GEC  NP YMVGS 
Subjt:  TNKD-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSP

Query:  ELPGYCRKSCKVC
        +  GYCRKSCK C
Subjt:  ELPGYCRKSCKVC

Q8LAN3 Probable prolyl 4-hydroxylase 44.7e-13177.82Show/hide
Query:  LFIFSLSIFLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSG
        L I   +IF +L ++S+S   S+S  VNP+KVKQ+S  PRAFVYEGFLT+LECDH+VSLAKA LKRSAVADN SGESK SEVRTSSG FI K KDPIVSG
Subjt:  LFIFSLSIFLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSG

Query:  IEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAKKGIAVKPRK
        IEDKI+ WTFLPKENGEDIQVLRYE+GQKYDAHFDYF DKVNI RGGHRMAT+LMYLS+V KGGETVFP AE   RR  SE  +DLSDCAK+GIAVKPRK
Subjt:  IEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAKKGIAVKPRK

Query:  GDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC
        GDALLFF+LHP+AIPD  SLHGGCPVIEGEKWSATKWIHVDSFD IV    NC D N SCERWA LGECT NPEYMVG+ ELPGYCR+SCK C
Subjt:  GDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC

Q9LN20 Probable prolyl 4-hydroxylase 34.1e-6655.98Show/hide
Query:  ISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHF
        +SW PRAFVY  FL+  EC++L+SLAK  + +S V D+ +G+SK S VRTSSG F+ + +D I+  IE +IA +TF+P ++GE +QVL YE GQKY+ H+
Subjt:  ISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHF

Query:  DYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSA
        DYF D+ N   GG RMAT+LMYLSDVE+GGETVFP+A  +    +     +LS+C KKG++VKPR GDALLF+S+ P+A  D +SLHGGCPVI G KWS+
Subjt:  DYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSA

Query:  TKWIHVDSF
        TKW+HV  +
Subjt:  TKWIHVDSF

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 23.3e-12774.15Show/hide
Query:  LLFIFSLSIFLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVS
        LLF+   +I L+L ++S+    S SSI+NP+KVKQ+S  PRAFVYEGFLTDLECDHL+SLAK  L+RSAVADN +GES+VS+VRTSSG FI K KDPIVS
Subjt:  LLFIFSLSIFLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVS

Query:  GIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAKKGIAVKPR
        GIEDK++ WTFLPKENGED+QVLRYE+GQKYDAHFDYF DKVNIARGGHR+ATVL+YLS+V KGGETVFP A+E  RR  SE   DLSDCAKKGIAVKP+
Subjt:  GIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAKKGIAVKPR

Query:  KGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC
        KG+ALLFF+L  +AIPD  SLHGGCPVIEGEKWSATKWIHVDSFD I+    NC D N SCERWA LGEC  NPEYMVG+PE+PG CR+SCK C
Subjt:  KGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase4.0e-10158.15Show/hide
Query:  FRSCNLLFIFSLSIF-----LLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSE
        F + +L F+F+L +        L R+S++  G       SASS   +P +V Q+SW+PR F+YEGFL+D ECDH + LAK +L++S VADN SGES  SE
Subjt:  FRSCNLLFIFSLSIF-----LLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSE

Query:  VRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASE
        VRTSSG F+ K +D IVS +E K+AAWTFLP+ENGE +Q+L YE GQKY+ HFDYF D+ N+  GGHR+ATVLMYLS+VEKGGETVFP      + +A++
Subjt:  VRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASE

Query:  TNKD-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSP
           D  ++CAK+G AVKPRKGDALLFF+LHPNA  D++SLHG CPV+EGEKWSAT+WIHV SF+      + C DEN SCE+WA+ GEC  NP YMVGS 
Subjt:  TNKD-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSP

Query:  ELPGYCRKSCKVC
        +  GYCRKSCK C
Subjt:  ELPGYCRKSCKVC

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase1.6e-9455.45Show/hide
Query:  FRSCNLLFIFSLSIF-----LLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGES----
        F + +L F+F+L +        L R+S++  G       SASS   +P +V Q+SW+PR F+YEGFL+D ECDH + LAK +L++S VADN SGES    
Subjt:  FRSCNLLFIFSLSIF-----LLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGES----

Query:  -KVSEVRTSSGAFIHKAK---DPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEE
          VS VR SS    +      D IVS +E K+AAWTFLP+ENGE +Q+L YE GQKY+ HFDYF D+ N+  GGHR+ATVLMYLS+VEKGGETVFP    
Subjt:  -KVSEVRTSSGAFIHKAK---DPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEE

Query:  SQRRQASETNKD-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNN
          + +A++   D  ++CAK+G AVKPRKGDALLFF+LHPNA  D++SLHG CPV+EGEKWSAT+WIHV SF+      + C DEN SCE+WA+ GEC  N
Subjt:  SQRRQASETNKD-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNN

Query:  PEYMVGSPELPGYCRKSCKVC
        P YMVGS +  GYCRKSCK C
Subjt:  PEYMVGSPELPGYCRKSCKVC

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase6.8e-9358.42Show/hide
Query:  FSLSIFLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRS-AVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIE
        FSLS+ L+  + S     S S  V+P ++ Q+SW+PRAF+Y+GFL+D ECDHL+ LAK +L++S  VAD  SGES+ SEVRTSSG F+ K +D IV+ +E
Subjt:  FSLSIFLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRS-AVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIE

Query:  DKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAKKGIAVKPRKGD
         K+AAWTFLP+ENGE +Q+L YE GQKYD HFDYF DK  +  GGHR+ATVLMYLS+V KGGETVFP+ +    +   ++    S CAK+G AVKPRKGD
Subjt:  DKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAKKGIAVKPRKGD

Query:  ALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC
        ALLFF+LH N   D +SLHG CPVIEGEKWSAT+WIHV SF    +    C D++ SC+ WA+ GEC  NP YMVGS    G+CRKSCK C
Subjt:  ALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein3.4e-13277.82Show/hide
Query:  LFIFSLSIFLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSG
        L I   +IF +L ++S+S   S+S  VNP+KVKQ+S  PRAFVYEGFLT+LECDH+VSLAKA LKRSAVADN SGESK SEVRTSSG FI K KDPIVSG
Subjt:  LFIFSLSIFLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSG

Query:  IEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAKKGIAVKPRK
        IEDKI+ WTFLPKENGEDIQVLRYE+GQKYDAHFDYF DKVNI RGGHRMAT+LMYLS+V KGGETVFP AE   RR  SE  +DLSDCAK+GIAVKPRK
Subjt:  IEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAKKGIAVKPRK

Query:  GDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC
        GDALLFF+LHP+AIPD  SLHGGCPVIEGEKWSATKWIHVDSFD IV    NC D N SCERWA LGECT NPEYMVG+ ELPGYCR+SCK C
Subjt:  GDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAAATTTCGTAGTTGTAATCTACTGTTCATCTTCTCATTATCGATCTTCTTGCTTCTCCGGCGAGCTTCAAGCTCCTATGCAGGTTCCGCTAGCTCAATCGTCAA
TCCTGCAAAAGTCAAACAGATTTCATGGAGTCCCCGGGCTTTTGTGTACGAAGGTTTTCTCACGGACTTGGAATGCGATCATCTCGTCTCGCTTGCTAAAGCGGAGTTGA
AGAGATCTGCTGTTGCGGATAATTTGTCTGGAGAGAGTAAGGTCAGCGAAGTCCGAACTAGCTCTGGGGCGTTTATTCATAAAGCCAAGGATCCTATTGTTTCTGGTATA
GAAGACAAAATTGCAGCATGGACATTCCTGCCAAAAGAAAATGGGGAAGACATTCAAGTGTTGAGATATGAATATGGGCAGAAGTACGATGCACACTTTGATTACTTTGC
TGATAAGGTTAATATTGCCCGAGGTGGACATCGAATGGCAACTGTTCTTATGTATCTTTCCGACGTAGAAAAAGGCGGTGAAACAGTGTTTCCTTCTGCAGAGGAATCTC
AAAGACGCCAGGCTTCTGAAACAAACAAAGATCTCTCAGACTGTGCAAAGAAAGGGATAGCAGTGAAACCACGGAAAGGCGACGCTCTTCTCTTCTTCAGTCTTCATCCA
AATGCTATTCCAGACACGAGTAGTCTGCACGGAGGTTGCCCTGTGATTGAAGGTGAAAAATGGTCAGCAACGAAGTGGATTCATGTCGATTCTTTCGACATGATTGTGAG
AGACCATACCAATTGTGCTGATGAGAATGCAAGTTGTGAGAGATGGGCTGAACTCGGCGAGTGCACGAATAACCCGGAGTATATGGTGGGATCCCCTGAGCTTCCTGGCT
ACTGCAGGAAAAGTTGTAAGGTGTGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGAAATTTCGTAGTTGTAATCTACTGTTCATCTTCTCATTATCGATCTTCTTGCTTCTCCGGCGAGCTTCAAGCTCCTATGCAGGTTCCGCTAGCTCAATCGTCAA
TCCTGCAAAAGTCAAACAGATTTCATGGAGTCCCCGGGCTTTTGTGTACGAAGGTTTTCTCACGGACTTGGAATGCGATCATCTCGTCTCGCTTGCTAAAGCGGAGTTGA
AGAGATCTGCTGTTGCGGATAATTTGTCTGGAGAGAGTAAGGTCAGCGAAGTCCGAACTAGCTCTGGGGCGTTTATTCATAAAGCCAAGGATCCTATTGTTTCTGGTATA
GAAGACAAAATTGCAGCATGGACATTCCTGCCAAAAGAAAATGGGGAAGACATTCAAGTGTTGAGATATGAATATGGGCAGAAGTACGATGCACACTTTGATTACTTTGC
TGATAAGGTTAATATTGCCCGAGGTGGACATCGAATGGCAACTGTTCTTATGTATCTTTCCGACGTAGAAAAAGGCGGTGAAACAGTGTTTCCTTCTGCAGAGGAATCTC
AAAGACGCCAGGCTTCTGAAACAAACAAAGATCTCTCAGACTGTGCAAAGAAAGGGATAGCAGTGAAACCACGGAAAGGCGACGCTCTTCTCTTCTTCAGTCTTCATCCA
AATGCTATTCCAGACACGAGTAGTCTGCACGGAGGTTGCCCTGTGATTGAAGGTGAAAAATGGTCAGCAACGAAGTGGATTCATGTCGATTCTTTCGACATGATTGTGAG
AGACCATACCAATTGTGCTGATGAGAATGCAAGTTGTGAGAGATGGGCTGAACTCGGCGAGTGCACGAATAACCCGGAGTATATGGTGGGATCCCCTGAGCTTCCTGGCT
ACTGCAGGAAAAGTTGTAAGGTGTGTTGA
Protein sequenceShow/hide protein sequence
MAKFRSCNLLFIFSLSIFLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGI
EDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAKKGIAVKPRKGDALLFFSLHP
NAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC