; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10012619 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10012619
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationChr01:22898613..22901514
RNA-Seq ExpressionHG10012619
SyntenyHG10012619
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134175.2 probable prolyl 4-hydroxylase 4 [Cucumis sativus]7.0e-16494.06Show/hide
Query:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH
        MA+L CFNLLF F+LSIS LL+RAS+SYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAK ELKRS+VADNLSG SKVSEVRTSSGAFIH
Subjt:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNEDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPK+NGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLS++EKGGETVFPSAEESQRRQASETNEDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAIPDT+SLHGGCPVIEGEKWSATKWI VDSFD +VRDHTNC DENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  ACS
        ACS
Subjt:  ACS

XP_008438765.1 PREDICTED: probable prolyl 4-hydroxylase 4 [Cucumis melo]2.7e-16394.06Show/hide
Query:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH
        MA+   FNLLF F+L+IS LLRRAS+SYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAK ELKRS+VADNLSG SKVSEVRTSSGAFIH
Subjt:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNEDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLS++EKGGETVFPSAEESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAIPDT+SLHGGCPVIEGEKWSATKWIHVDSFDTI RDHTNC DENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  ACS
        ACS
Subjt:  ACS

XP_022137761.1 probable prolyl 4-hydroxylase 4 [Momordica charantia]3.5e-16393.71Show/hide
Query:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH
        MA+ C  +LLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDH+ISLAK ELKRSAVADNLSG SKVSEVRTSSGAFIH
Subjt:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNEDLSDCAK
        KAKDPI+SGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYF+DKVNIARGGHRMATVLMYLSN+EKGGETVFPSAEESQRRQASETNEDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPR+GDALLFFSLHPNA+PDT SLHGGCPVIEGEKWSATKWIHVDSFDTI+RDHTNC DE+ SCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  AC
         C
Subjt:  AC

XP_022973641.1 probable prolyl 4-hydroxylase 4 [Cucurbita maxima]1.5e-16193.38Show/hide
Query:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH
        MAK C  NLLF  S+SISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAK ELKRSAVAD+LSG SKVSEVRTSSGAFIH
Subjt:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNEDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYF DKVNIARGGHRMATVLMYLSN+EKGGETVFP+AEESQRRQASETNEDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNA+PDT SLHGGCPVIEGEKWSATKWIHVDSFDTIV DHT+C D N SCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  AC
         C
Subjt:  AC

XP_038903083.1 probable prolyl 4-hydroxylase 4 [Benincasa hispida]5.3e-16494.06Show/hide
Query:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH
        MAK  CFNLLFFFSLSIS LLRRASSSYAGSASSIVNPAKVKQISW+PRAFVYEGFLTDLECDHLISLAK ELKRS+VADNLSG SKVSEVRTSSGAFIH
Subjt:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNEDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLS++EKGGETVFPSAEESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAIPD +SLHGGCPVIEGEKWSATKWIHVDSFD I+RDHT+C DENPSCERWAEL ECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  ACS
        AC+
Subjt:  ACS

TrEMBL top hitse value%identityAlignment
A0A0A0L5Q6 Procollagen-proline 4-dioxygenase3.4e-16494.06Show/hide
Query:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH
        MA+L CFNLLF F+LSIS LL+RAS+SYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAK ELKRS+VADNLSG SKVSEVRTSSGAFIH
Subjt:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNEDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPK+NGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLS++EKGGETVFPSAEESQRRQASETNEDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAIPDT+SLHGGCPVIEGEKWSATKWI VDSFD +VRDHTNC DENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  ACS
        ACS
Subjt:  ACS

A0A1S3AWU7 Procollagen-proline 4-dioxygenase1.3e-16394.06Show/hide
Query:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH
        MA+   FNLLF F+L+IS LLRRAS+SYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAK ELKRS+VADNLSG SKVSEVRTSSGAFIH
Subjt:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNEDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLS++EKGGETVFPSAEESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAIPDT+SLHGGCPVIEGEKWSATKWIHVDSFDTI RDHTNC DENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  ACS
        ACS
Subjt:  ACS

A0A6J1C7M6 Procollagen-proline 4-dioxygenase1.7e-16393.71Show/hide
Query:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH
        MA+ C  +LLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDH+ISLAK ELKRSAVADNLSG SKVSEVRTSSGAFIH
Subjt:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNEDLSDCAK
        KAKDPI+SGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYF+DKVNIARGGHRMATVLMYLSN+EKGGETVFPSAEESQRRQASETNEDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPR+GDALLFFSLHPNA+PDT SLHGGCPVIEGEKWSATKWIHVDSFDTI+RDHTNC DE+ SCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  AC
         C
Subjt:  AC

A0A6J1GPQ8 Procollagen-proline 4-dioxygenase3.5e-16192.72Show/hide
Query:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH
        MAK C  NLLF  S+SISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAK ELKRSAVAD+LSG SKVSEVRTSSGAFIH
Subjt:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNEDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAH+DYF DKVNIARGGHRMATVLMYLSN+EKGGETVFP+AEESQRRQASETNEDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFF+LHPNA+PDT SLHGGCPVIEGEKWSATKWIHVDSFDTIV DHT+C D N SCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  AC
         C
Subjt:  AC

A0A6J1I971 Procollagen-proline 4-dioxygenase7.0e-16293.38Show/hide
Query:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH
        MAK C  NLLF  S+SISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAK ELKRSAVAD+LSG SKVSEVRTSSGAFIH
Subjt:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNEDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYF DKVNIARGGHRMATVLMYLSN+EKGGETVFP+AEESQRRQASETNEDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNA+PDT SLHGGCPVIEGEKWSATKWIHVDSFDTIV DHT+C D N SCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  AC
         C
Subjt:  AC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 61.9e-9259.04Show/hide
Query:  FFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRS-AVADNLSGASKVSEVRTSSGAFIHKAKDPIVSG
        +F + S+SLLL     S   S S  V+P ++ Q+SW+PRAF+Y+GFL+D ECDHLI LAK +L++S  VAD  SG S+ SEVRTSSG F+ K +D IV+ 
Subjt:  FFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRS-AVADNLSGASKVSEVRTSSGAFIHKAKDPIVSG

Query:  IEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRK
        +E K+AAWTFLP+ENGE +Q+L YE GQKYD HFDYF DK  +  GGHR+ATVLMYLSN+ KGGETVFP+    + +     ++  S CAK+G AVKPRK
Subjt:  IEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRK

Query:  GDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC
        GDALLFF+LH N   D NSLHG CPVIEGEKWSAT+WIHV SF    +    C D++ SC+ WA+ GEC  NP YMVGS    G+CRKSCKAC
Subjt:  GDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC

F4JAU3 Prolyl 4-hydroxylase 22.1e-12674.83Show/hide
Query:  LLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIHKAKDPIVS
        LL F  ++I L+L ++S+    S SSI+NP+KVKQ+S  PRAFVYEGFLTDLECDHLISLAK  L+RSAVADN +G S+VS+VRTSSG FI K KDPIVS
Subjt:  LLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIHKAKDPIVS

Query:  GIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPR
        GIEDK++ WTFLPKENGED+QVLRYE+GQKYDAHFDYF DKVNIARGGHR+ATVL+YLSN+ KGGETVFP A+E  RR  SE  +DLSDCAKKGIAVKP+
Subjt:  GIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPR

Query:  KGDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC
        KG+ALLFF+L  +AIPD  SLHGGCPVIEGEKWSATKWIHVDSFD I+    NCTD N SCERWA LGEC  NPEYMVG+PE+PG CR+SCKAC
Subjt:  KGDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC

Q8L970 Probable prolyl 4-hydroxylase 73.9e-10159.49Show/hide
Query:  FNLLFFFSLSI-----SLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRT
        F+L F F+L +     +  L R+S++  G       SASS   +P +V Q+SW+PR F+YEGFL+D ECDH I LAK +L++S VADN SG S  SEVRT
Subjt:  FNLLFFFSLSI-----SLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRT

Query:  SSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNE
        SSG F+ K +D IVS +E K+AAWTFLP+ENGE +Q+L YE GQKY+ HFDYF D+ N+  GGHR+ATVLMYLSN+EKGGETVFP      + +A++  +
Subjt:  SSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNE

Query:  D-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEYMVGSPELP
        D  ++CAK+G AVKPRKGDALLFF+LHPNA  D+NSLHG CPV+EGEKWSAT+WIHV SF+      + C DEN SCE+WA+ GEC  NP YMVGS +  
Subjt:  D-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEYMVGSPELP

Query:  GYCRKSCKACS
        GYCRKSCKACS
Subjt:  GYCRKSCKACS

Q8LAN3 Probable prolyl 4-hydroxylase 44.0e-13077.89Show/hide
Query:  LLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIHKAKDPIVS
        LL  F    S+LL ++S+S   S+S  VNP+KVKQ+S  PRAFVYEGFLT+LECDH++SLAK  LKRSAVADN SG SK SEVRTSSG FI K KDPIVS
Subjt:  LLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIHKAKDPIVS

Query:  GIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPR
        GIEDKI+ WTFLPKENGEDIQVLRYE+GQKYDAHFDYF DKVNI RGGHRMAT+LMYLSN+ KGGETVFP AE   RR  SE  EDLSDCAK+GIAVKPR
Subjt:  GIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPR

Query:  KGDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC
        KGDALLFF+LHP+AIPD  SLHGGCPVIEGEKWSATKWIHVDSFD IV    NCTD N SCERWA LGECT NPEYMVG+ ELPGYCR+SCKAC
Subjt:  KGDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC

Q9LN20 Probable prolyl 4-hydroxylase 33.5e-6555.5Show/hide
Query:  ISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHF
        +SW PRAFVY  FL+  EC++LISLAK  + +S V D+ +G SK S VRTSSG F+ + +D I+  IE +IA +TF+P ++GE +QVL YE GQKY+ H+
Subjt:  ISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHF

Query:  DYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSA
        DYF D+ N   GG RMAT+LMYLS++E+GGETVFP+A  +    +     +LS+C KKG++VKPR GDALLF+S+ P+A  D  SLHGGCPVI G KWS+
Subjt:  DYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSA

Query:  TKWIHVDSF
        TKW+HV  +
Subjt:  TKWIHVDSF

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 21.5e-12774.83Show/hide
Query:  LLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIHKAKDPIVS
        LL F  ++I L+L ++S+    S SSI+NP+KVKQ+S  PRAFVYEGFLTDLECDHLISLAK  L+RSAVADN +G S+VS+VRTSSG FI K KDPIVS
Subjt:  LLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIHKAKDPIVS

Query:  GIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPR
        GIEDK++ WTFLPKENGED+QVLRYE+GQKYDAHFDYF DKVNIARGGHR+ATVL+YLSN+ KGGETVFP A+E  RR  SE  +DLSDCAKKGIAVKP+
Subjt:  GIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPR

Query:  KGDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC
        KG+ALLFF+L  +AIPD  SLHGGCPVIEGEKWSATKWIHVDSFD I+    NCTD N SCERWA LGEC  NPEYMVG+PE+PG CR+SCKAC
Subjt:  KGDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase2.8e-10259.49Show/hide
Query:  FNLLFFFSLSI-----SLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRT
        F+L F F+L +     +  L R+S++  G       SASS   +P +V Q+SW+PR F+YEGFL+D ECDH I LAK +L++S VADN SG S  SEVRT
Subjt:  FNLLFFFSLSI-----SLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRT

Query:  SSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNE
        SSG F+ K +D IVS +E K+AAWTFLP+ENGE +Q+L YE GQKY+ HFDYF D+ N+  GGHR+ATVLMYLSN+EKGGETVFP      + +A++  +
Subjt:  SSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNE

Query:  D-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEYMVGSPELP
        D  ++CAK+G AVKPRKGDALLFF+LHPNA  D+NSLHG CPV+EGEKWSAT+WIHV SF+      + C DEN SCE+WA+ GEC  NP YMVGS +  
Subjt:  D-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEYMVGSPELP

Query:  GYCRKSCKACS
        GYCRKSCKACS
Subjt:  GYCRKSCKACS

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase1.1e-9556.74Show/hide
Query:  FNLLFFFSLSI-----SLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGAS-----KV
        F+L F F+L +     +  L R+S++  G       SASS   +P +V Q+SW+PR F+YEGFL+D ECDH I LAK +L++S VADN SG S      V
Subjt:  FNLLFFFSLSI-----SLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGAS-----KV

Query:  SEVRTSSGAFIHKAK---DPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQR
        S VR SS    +      D IVS +E K+AAWTFLP+ENGE +Q+L YE GQKY+ HFDYF D+ N+  GGHR+ATVLMYLSN+EKGGETVFP      +
Subjt:  SEVRTSSGAFIHKAK---DPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQR

Query:  RQASETNED-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEY
         +A++  +D  ++CAK+G AVKPRKGDALLFF+LHPNA  D+NSLHG CPV+EGEKWSAT+WIHV SF+      + C DEN SCE+WA+ GEC  NP Y
Subjt:  RQASETNED-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEY

Query:  MVGSPELPGYCRKSCKACS
        MVGS +  GYCRKSCKACS
Subjt:  MVGSPELPGYCRKSCKACS

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase1.4e-9359.04Show/hide
Query:  FFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRS-AVADNLSGASKVSEVRTSSGAFIHKAKDPIVSG
        +F + S+SLLL     S   S S  V+P ++ Q+SW+PRAF+Y+GFL+D ECDHLI LAK +L++S  VAD  SG S+ SEVRTSSG F+ K +D IV+ 
Subjt:  FFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRS-AVADNLSGASKVSEVRTSSGAFIHKAKDPIVSG

Query:  IEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRK
        +E K+AAWTFLP+ENGE +Q+L YE GQKYD HFDYF DK  +  GGHR+ATVLMYLSN+ KGGETVFP+    + +     ++  S CAK+G AVKPRK
Subjt:  IEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRK

Query:  GDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC
        GDALLFF+LH N   D NSLHG CPVIEGEKWSAT+WIHV SF    +    C D++ SC+ WA+ GEC  NP YMVGS    G+CRKSCKAC
Subjt:  GDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.9e-13177.89Show/hide
Query:  LLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIHKAKDPIVS
        LL  F    S+LL ++S+S   S+S  VNP+KVKQ+S  PRAFVYEGFLT+LECDH++SLAK  LKRSAVADN SG SK SEVRTSSG FI K KDPIVS
Subjt:  LLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIHKAKDPIVS

Query:  GIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPR
        GIEDKI+ WTFLPKENGEDIQVLRYE+GQKYDAHFDYF DKVNI RGGHRMAT+LMYLSN+ KGGETVFP AE   RR  SE  EDLSDCAK+GIAVKPR
Subjt:  GIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPR

Query:  KGDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC
        KGDALLFF+LHP+AIPD  SLHGGCPVIEGEKWSATKWIHVDSFD IV    NCTD N SCERWA LGECT NPEYMVG+ ELPGYCR+SCKAC
Subjt:  KGDALLFFSLHPNAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAAATTGTGCTGTTTCAATCTACTGTTTTTCTTCTCATTATCGATCTCATTGCTTCTCCGGCGAGCTTCAAGCTCCTATGCAGGTTCCGCTAGCTCAATCGTCAA
TCCTGCAAAAGTCAAACAGATTTCATGGAGTCCTCGGGCTTTTGTGTATGAAGGTTTTCTCACGGACTTAGAATGCGATCATCTCATCTCCCTCGCTAAAACGGAGTTGA
AGAGATCTGCTGTCGCGGATAATTTGTCCGGAGCGAGCAAAGTCAGCGAGGTCCGAACTAGCTCTGGGGCGTTTATTCATAAAGCCAAGGATCCTATTGTTTCTGGTATA
GAAGACAAAATTGCAGCATGGACATTTCTGCCAAAAGAGAATGGAGAAGACATTCAAGTGTTGAGATATGAATACGGGCAGAAGTACGATGCACACTTTGATTACTTTGC
TGACAAGGTTAATATTGCCCGAGGTGGACATCGAATGGCAACTGTTCTCATGTATCTTTCCAACATAGAAAAAGGCGGTGAAACTGTGTTTCCTTCTGCAGAGGAATCTC
AAAGACGCCAGGCTTCTGAAACAAACGAAGATCTCTCAGACTGTGCAAAGAAAGGGATAGCAGTTAAACCACGAAAAGGCGATGCTCTTCTCTTCTTCAGTCTCCATCCA
AATGCTATTCCAGACACAAATAGTCTACATGGCGGGTGCCCTGTGATTGAAGGTGAAAAATGGTCCGCAACGAAGTGGATTCATGTCGATTCTTTCGACACGATCGTGAG
AGACCATACGAACTGCACTGATGAAAATCCAAGTTGTGAGAGATGGGCTGAACTTGGCGAGTGCACGAATAACCCGGAGTATATGGTCGGATCTCCCGAGCTTCCTGGCT
ACTGCAGGAAAAGTTGTAAGGCATGTTCATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGAAATTGTGCTGTTTCAATCTACTGTTTTTCTTCTCATTATCGATCTCATTGCTTCTCCGGCGAGCTTCAAGCTCCTATGCAGGTTCCGCTAGCTCAATCGTCAA
TCCTGCAAAAGTCAAACAGATTTCATGGAGTCCTCGGGCTTTTGTGTATGAAGGTTTTCTCACGGACTTAGAATGCGATCATCTCATCTCCCTCGCTAAAACGGAGTTGA
AGAGATCTGCTGTCGCGGATAATTTGTCCGGAGCGAGCAAAGTCAGCGAGGTCCGAACTAGCTCTGGGGCGTTTATTCATAAAGCCAAGGATCCTATTGTTTCTGGTATA
GAAGACAAAATTGCAGCATGGACATTTCTGCCAAAAGAGAATGGAGAAGACATTCAAGTGTTGAGATATGAATACGGGCAGAAGTACGATGCACACTTTGATTACTTTGC
TGACAAGGTTAATATTGCCCGAGGTGGACATCGAATGGCAACTGTTCTCATGTATCTTTCCAACATAGAAAAAGGCGGTGAAACTGTGTTTCCTTCTGCAGAGGAATCTC
AAAGACGCCAGGCTTCTGAAACAAACGAAGATCTCTCAGACTGTGCAAAGAAAGGGATAGCAGTTAAACCACGAAAAGGCGATGCTCTTCTCTTCTTCAGTCTCCATCCA
AATGCTATTCCAGACACAAATAGTCTACATGGCGGGTGCCCTGTGATTGAAGGTGAAAAATGGTCCGCAACGAAGTGGATTCATGTCGATTCTTTCGACACGATCGTGAG
AGACCATACGAACTGCACTGATGAAAATCCAAGTTGTGAGAGATGGGCTGAACTTGGCGAGTGCACGAATAACCCGGAGTATATGGTCGGATCTCCCGAGCTTCCTGGCT
ACTGCAGGAAAAGTTGTAAGGCATGTTCATAA
Protein sequenceShow/hide protein sequence
MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIHKAKDPIVSGI
EDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNIEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGDALLFFSLHP
NAIPDTNSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCTDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKACS