; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0026236 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0026236
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationchr11:4559496..4563540
RNA-Seq ExpressionPI0026236
SyntenyPI0026236
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575033.1 putative prolyl 4-hydroxylase 4, partial [Cucurbita argyrosperma subsp. sororia]1.1e-16193.33Show/hide
Query:  MFKFRNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN
        M KFRNLLF+FLILISS VRES+CSYAGSA++TVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSG SKLSTVRTSSGMFISK+
Subjt:  MFKFRNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN

Query:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG
        KDPIVSGIEDKI+AWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIA GGHRLATVLMYLSNVTKGGETVFPLAEKS  RRA ETDEDL+ECA++G
Subjt:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG

Query:  VAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        +AVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIG+CTD NESCERWAALGECTKNPEYMVGSPE+PGYCRRSCRIC
Subjt:  VAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

XP_004144967.1 probable prolyl 4-hydroxylase 4 [Cucumis sativus]1.1e-16997.67Show/hide
Query:  MFKFRNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN
        MFKF NLLFIFLIL SSF+RESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN
Subjt:  MFKFRNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN

Query:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG
        KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVT+GGETVFPLAEK SHRRAYETDEDLSECAKKG
Subjt:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG

Query:  VAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        VAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLG+IGNCTD NESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
Subjt:  VAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

XP_008458517.1 PREDICTED: probable prolyl 4-hydroxylase 4 [Cucumis melo]2.6e-17198.33Show/hide
Query:  MFKFRNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN
        MFKFRNLLF FLILISSFVRESTCSYAGSASATVDPS+VKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN
Subjt:  MFKFRNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN

Query:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG
        KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG
Subjt:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG

Query:  VAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        +AVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLG+IGNCTD NESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
Subjt:  VAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

XP_022959148.1 probable prolyl 4-hydroxylase 4 [Cucurbita moschata]1.4e-16193.33Show/hide
Query:  MFKFRNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN
        M KFRNLLF+FLILISS VRES+CSYAGSA++TVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSG SKLSTVRTSSGMFISK+
Subjt:  MFKFRNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN

Query:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG
        KDPIVSGIEDKI+AWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIA GGHRLATVLMYLSNVTKGGETVFPLAEKS  RRA ETDEDLSECA++G
Subjt:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG

Query:  VAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        +AVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIG+C+D NESCERWAALGECTKNPEYMVGSPE+PGYCRRSCRIC
Subjt:  VAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

XP_038874583.1 probable prolyl 4-hydroxylase 4 [Benincasa hispida]3.2e-16997Show/hide
Query:  MFKFRNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN
        MFKFRNLLFIFLILIS  VRESTCSYAGSAS+TVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISK+
Subjt:  MFKFRNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN

Query:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG
        KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHG+KYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKS+HRRAYETDEDLSECA+KG
Subjt:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG

Query:  VAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        +AVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTD NESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
Subjt:  VAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

TrEMBL top hitse value%identityAlignment
A0A0A0KCQ5 Procollagen-proline 4-dioxygenase5.3e-17097.67Show/hide
Query:  MFKFRNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN
        MFKF NLLFIFLIL SSF+RESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN
Subjt:  MFKFRNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN

Query:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG
        KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVT+GGETVFPLAEK SHRRAYETDEDLSECAKKG
Subjt:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG

Query:  VAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        VAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLG+IGNCTD NESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
Subjt:  VAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

A0A1S3C816 Procollagen-proline 4-dioxygenase1.3e-17198.33Show/hide
Query:  MFKFRNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN
        MFKFRNLLF FLILISSFVRESTCSYAGSASATVDPS+VKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN
Subjt:  MFKFRNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN

Query:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG
        KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG
Subjt:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG

Query:  VAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        +AVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLG+IGNCTD NESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
Subjt:  VAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

A0A5A7SVW6 Procollagen-proline 4-dioxygenase1.3e-17198.33Show/hide
Query:  MFKFRNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN
        MFKFRNLLF FLILISSFVRESTCSYAGSASATVDPS+VKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN
Subjt:  MFKFRNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN

Query:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG
        KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG
Subjt:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG

Query:  VAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        +AVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLG+IGNCTD NESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
Subjt:  VAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

A0A6J1H545 Procollagen-proline 4-dioxygenase7.0e-16293.33Show/hide
Query:  MFKFRNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN
        M KFRNLLF+FLILISS VRES+CSYAGSA++TVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSG SKLSTVRTSSGMFISK+
Subjt:  MFKFRNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN

Query:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG
        KDPIVSGIEDKI+AWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIA GGHRLATVLMYLSNVTKGGETVFPLAEKS  RRA ETDEDLSECA++G
Subjt:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG

Query:  VAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        +AVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIG+C+D NESCERWAALGECTKNPEYMVGSPE+PGYCRRSCRIC
Subjt:  VAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

A0A6J1L4G1 Procollagen-proline 4-dioxygenase1.7e-16093Show/hide
Query:  MFKFRNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN
        M KFR LLF+FLILISS VRES+CSYAGSA++TVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSG SKLSTVRTSSGMFISK+
Subjt:  MFKFRNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN

Query:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG
        KDPIVSGIEDKI+AWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIA GGHRLATVLMYLSNVTKGGETVFPLAEKS  RRA ETDEDLSECA++G
Subjt:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG

Query:  VAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        +AVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIG+CTD NESCERWAALGECTKNPEYMVGS E+PGYCRRSCRIC
Subjt:  VAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 68.7e-9358.84Show/hide
Query:  SYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRS-EVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPKENG
        S   S S +VDP+++ Q+SW PRAF+Y+GFL+D ECDHL+ +A+ +L++S  VAD DSG+S+ S VRTSSGMF++K +D IV+ +E K++AWTFLP+ENG
Subjt:  SYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRS-EVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPKENG

Query:  EDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPD
        E +Q+L YE+GQKY+ H+DYF DK  +  GGHR+ATVLMYLSNVTKGGETVFP       +     D+  S+CAK+G AVKP+KGDALLFF+L  N   D
Subjt:  EDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPD

Query:  TNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
         NSLHG CPV+EGEKWSAT+WIHV SF K       C D +ESC+ WA  GEC KNP YMVGS    G+CR+SC+ C
Subjt:  TNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

F4JAU3 Prolyl 4-hydroxylase 21.7e-12572.79Show/hide
Query:  LLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVS
        LLF+ ++L+   ++ STC    S S+ ++PSKVKQ+S KPRAFVYEGFLTDLECDHL+S+A+  L+RS VADND+G+S++S VRTSSG FISK KDPIVS
Subjt:  LLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVS

Query:  GIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKGVAVKPK
        GIEDK+S WTFLPKENGED+QVLRYEHGQKY++H+DYF DKVNIA GGHR+ATVL+YLSNVTKGGETVFP A++ S R   E  +DLS+CAKKG+AVKPK
Subjt:  GIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKGVAVKPK

Query:  KGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        KG+ALLFF+L+ +AIPD  SLHGGCPV+EGEKWSATKWIHVDSF K L + GNCTD NESCERWA LGEC KNPEYMVG+PE+PG CRRSC+ C
Subjt:  KGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

Q8L970 Probable prolyl 4-hydroxylase 71.6e-9957.79Show/hide
Query:  NLLFIF-LILISS----FVRESTCSYAGS--------ASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTS
        +L F+F L LISS    F+  S+ +  GS        +S   DP++V Q+SW PR F+YEGFL+D ECDH + +A+ +L++S VADNDSG+S  S VRTS
Subjt:  NLLFIF-LILISS----FVRESTCSYAGS--------ASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTS

Query:  SGMFISKNKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDED
        SGMF+SK +D IVS +E K++AWTFLP+ENGE +Q+L YE+GQKYE H+DYF D+ N+  GGHR+ATVLMYLSNV KGGETVFP+ +  + +     D+ 
Subjt:  SGMFISKNKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDED

Query:  LSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGY
         +ECAK+G AVKP+KGDALLFF+L PNA  D+NSLHG CPV+EGEKWSAT+WIHV SF +       C D+N SCE+WA  GEC KNP YMVGS +  GY
Subjt:  LSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGY

Query:  CRRSCRIC
        CR+SC+ C
Subjt:  CRRSCRIC

Q8LAN3 Probable prolyl 4-hydroxylase 46.3e-12874.32Show/hide
Query:  RNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPI
        R LL  F  + S  ++ ST S   S+S  V+PSKVKQ+S KPRAFVYEGFLT+LECDH+VS+A++ LKRS VADNDSG+SK S VRTSSG FISK KDPI
Subjt:  RNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPI

Query:  VSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKGVAVK
        VSGIEDKIS WTFLPKENGEDIQVLRYEHGQKY++H+DYF DKVNI  GGHR+AT+LMYLSNVTKGGETVFP AE  S R   E  EDLS+CAK+G+AVK
Subjt:  VSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKGVAVK

Query:  PKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        P+KGDALLFF+L P+AIPD  SLHGGCPV+EGEKWSATKWIHVDSF + +   GNCTD NESCERWA LGECTKNPEYMVG+ E+PGYCRRSC+ C
Subjt:  PKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

Q9LN20 Probable prolyl 4-hydroxylase 32.2e-6454.55Show/hide
Query:  ISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHY
        +SW+PRAFVY  FL+  EC++L+S+A+  + +S V D+++GKSK S VRTSSG F+ + +D I+  IE +I+ +TF+P ++GE +QVL YE GQKYE HY
Subjt:  ISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHY

Query:  DYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSA
        DYFVD+ N   GG R+AT+LMYLS+V +GGETVFP A  +     +    +LSEC KKG++VKP+ GDALLF+S+ P+A  D  SLHGGCPV+ G KWS+
Subjt:  DYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSA

Query:  TKWIHVDSF
        TKW+HV  +
Subjt:  TKWIHVDSF

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 21.2e-12672.79Show/hide
Query:  LLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVS
        LLF+ ++L+   ++ STC    S S+ ++PSKVKQ+S KPRAFVYEGFLTDLECDHL+S+A+  L+RS VADND+G+S++S VRTSSG FISK KDPIVS
Subjt:  LLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVS

Query:  GIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKGVAVKPK
        GIEDK+S WTFLPKENGED+QVLRYEHGQKY++H+DYF DKVNIA GGHR+ATVL+YLSNVTKGGETVFP A++ S R   E  +DLS+CAKKG+AVKPK
Subjt:  GIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKGVAVKPK

Query:  KGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        KG+ALLFF+L+ +AIPD  SLHGGCPV+EGEKWSATKWIHVDSF K L + GNCTD NESCERWA LGEC KNPEYMVG+PE+PG CRRSC+ C
Subjt:  KGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase1.2e-10057.79Show/hide
Query:  NLLFIF-LILISS----FVRESTCSYAGS--------ASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTS
        +L F+F L LISS    F+  S+ +  GS        +S   DP++V Q+SW PR F+YEGFL+D ECDH + +A+ +L++S VADNDSG+S  S VRTS
Subjt:  NLLFIF-LILISS----FVRESTCSYAGS--------ASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTS

Query:  SGMFISKNKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDED
        SGMF+SK +D IVS +E K++AWTFLP+ENGE +Q+L YE+GQKYE H+DYF D+ N+  GGHR+ATVLMYLSNV KGGETVFP+ +  + +     D+ 
Subjt:  SGMFISKNKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDED

Query:  LSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGY
         +ECAK+G AVKP+KGDALLFF+L PNA  D+NSLHG CPV+EGEKWSAT+WIHV SF +       C D+N SCE+WA  GEC KNP YMVGS +  GY
Subjt:  LSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGY

Query:  CRRSCRIC
        CR+SC+ C
Subjt:  CRRSCRIC

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase6.1e-9454.43Show/hide
Query:  NLLFIF-LILISS----FVRESTCSYAGS--------ASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKS-----KLS
        +L F+F L LISS    F+  S+ +  GS        +S   DP++V Q+SW PR F+YEGFL+D ECDH + +A+ +L++S VADNDSG+S      +S
Subjt:  NLLFIF-LILISS----FVRESTCSYAGS--------ASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKS-----KLS

Query:  TVRTSSGMFISKNK---DPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHR
         VR SS    + +    D IVS +E K++AWTFLP+ENGE +Q+L YE+GQKYE H+DYF D+ N+  GGHR+ATVLMYLSNV KGGETVFP+ +  + +
Subjt:  TVRTSSGMFISKNK---DPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHR

Query:  RAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMV
             D+  +ECAK+G AVKP+KGDALLFF+L PNA  D+NSLHG CPV+EGEKWSAT+WIHV SF +       C D+N SCE+WA  GEC KNP YMV
Subjt:  RAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMV

Query:  GSPEMPGYCRRSCRIC
        GS +  GYCR+SC+ C
Subjt:  GSPEMPGYCRRSCRIC

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase6.1e-9458.84Show/hide
Query:  SYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRS-EVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPKENG
        S   S S +VDP+++ Q+SW PRAF+Y+GFL+D ECDHL+ +A+ +L++S  VAD DSG+S+ S VRTSSGMF++K +D IV+ +E K++AWTFLP+ENG
Subjt:  SYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRS-EVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPKENG

Query:  EDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPD
        E +Q+L YE+GQKY+ H+DYF DK  +  GGHR+ATVLMYLSNVTKGGETVFP       +     D+  S+CAK+G AVKP+KGDALLFF+L  N   D
Subjt:  EDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPD

Query:  TNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
         NSLHG CPV+EGEKWSAT+WIHV SF K       C D +ESC+ WA  GEC KNP YMVGS    G+CR+SC+ C
Subjt:  TNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein4.5e-12974.32Show/hide
Query:  RNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPI
        R LL  F  + S  ++ ST S   S+S  V+PSKVKQ+S KPRAFVYEGFLT+LECDH+VS+A++ LKRS VADNDSG+SK S VRTSSG FISK KDPI
Subjt:  RNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPI

Query:  VSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKGVAVK
        VSGIEDKIS WTFLPKENGEDIQVLRYEHGQKY++H+DYF DKVNI  GGHR+AT+LMYLSNVTKGGETVFP AE  S R   E  EDLS+CAK+G+AVK
Subjt:  VSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKGVAVK

Query:  PKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        P+KGDALLFF+L P+AIPD  SLHGGCPV+EGEKWSATKWIHVDSF + +   GNCTD NESCERWA LGECTKNPEYMVG+ E+PGYCRRSC+ C
Subjt:  PKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCAAATTTCGTAATCTGTTATTCATCTTCTTGATTTTGATCTCATCGTTTGTTCGGGAATCAACTTGTTCTTATGCTGGTTCGGCTAGCGCAACCGTAGATCCAAG
TAAAGTGAAGCAGATTTCATGGAAACCAAGAGCTTTTGTATATGAAGGTTTTCTAACGGACCTAGAATGCGACCATCTGGTTTCTATAGCGAGATCCGAGCTAAAGAGAT
CTGAGGTTGCTGATAATGATTCAGGAAAGAGCAAGCTCAGTACTGTACGAACGAGTTCAGGAATGTTCATTTCTAAAAACAAGGATCCTATTGTTTCTGGCATAGAGGAC
AAAATTTCTGCATGGACTTTTCTTCCAAAAGAAAATGGGGAGGATATACAGGTATTGAGATATGAGCATGGGCAGAAATATGAGTCACATTATGATTACTTTGTTGACAA
GGTGAATATTGCCTGGGGTGGACATCGTTTAGCTACAGTCCTTATGTATCTCTCTAATGTGACCAAAGGCGGTGAAACAGTTTTCCCCTTGGCAGAGAAATCTTCCCACC
GGAGAGCTTATGAAACAGACGAGGACCTCTCAGAGTGCGCAAAGAAAGGAGTTGCAGTGAAACCAAAGAAAGGCGATGCCCTTCTTTTCTTTAGCCTTGAACCAAATGCT
ATACCAGATACAAACAGTCTCCATGGAGGTTGCCCTGTCCTTGAAGGAGAAAAATGGTCAGCAACAAAGTGGATTCACGTAGATTCTTTCAGCAAAAACTTGGGAAACAT
TGGGAATTGTACTGATCAAAATGAAAGTTGTGAGAGATGGGCTGCCTTAGGGGAATGCACCAAAAACCCAGAATATATGGTTGGATCTCCAGAAATGCCCGGCTACTGTC
GGAGGAGCTGCAGGATCTGTTGA
mRNA sequenceShow/hide mRNA sequence
CTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCGCGCGCGCGCGCTCTTTCTAATTTGATCCGATCGAGACTATGTTCAAATTTCGTAATCTGTTATTCATCTTCT
TGATTTTGATCTCATCGTTTGTTCGGGAATCAACTTGTTCTTATGCTGGTTCGGCTAGCGCAACCGTAGATCCAAGTAAAGTGAAGCAGATTTCATGGAAACCAAGAGCT
TTTGTATATGAAGGTTTTCTAACGGACCTAGAATGCGACCATCTGGTTTCTATAGCGAGATCCGAGCTAAAGAGATCTGAGGTTGCTGATAATGATTCAGGAAAGAGCAA
GCTCAGTACTGTACGAACGAGTTCAGGAATGTTCATTTCTAAAAACAAGGATCCTATTGTTTCTGGCATAGAGGACAAAATTTCTGCATGGACTTTTCTTCCAAAAGAAA
ATGGGGAGGATATACAGGTATTGAGATATGAGCATGGGCAGAAATATGAGTCACATTATGATTACTTTGTTGACAAGGTGAATATTGCCTGGGGTGGACATCGTTTAGCT
ACAGTCCTTATGTATCTCTCTAATGTGACCAAAGGCGGTGAAACAGTTTTCCCCTTGGCAGAGAAATCTTCCCACCGGAGAGCTTATGAAACAGACGAGGACCTCTCAGA
GTGCGCAAAGAAAGGAGTTGCAGTGAAACCAAAGAAAGGCGATGCCCTTCTTTTCTTTAGCCTTGAACCAAATGCTATACCAGATACAAACAGTCTCCATGGAGGTTGCC
CTGTCCTTGAAGGAGAAAAATGGTCAGCAACAAAGTGGATTCACGTAGATTCTTTCAGCAAAAACTTGGGAAACATTGGGAATTGTACTGATCAAAATGAAAGTTGTGAG
AGATGGGCTGCCTTAGGGGAATGCACCAAAAACCCAGAATATATGGTTGGATCTCCAGAAATGCCCGGCTACTGTCGGAGGAGCTGCAGGATCTGTTGATTTCATCTCTA
ACATTACACCCAAAAATTTCGATACCTTTTGGCGTCATTTGTGGAATGGAAAGCGCTACAACATTGATTGTAAAGCTACGGATGGATGAAGCAGTAGCCGTTGGGTAATT
ATGACCTTTCCTTTTTTAACTGATGATTAATGTATTACTTTATTGTCATTATTTTTCTTGTTTGATTTTTGATATGATTTTCTTATAAGAAAGAATATTGTTGTTTTAAA
AGCTAAAGTTTATATACATTACAGAAAAAAAAAAGAGTAATTTTGCTGATGTGACAATGAAGTGATTAGAAAGAAGTGTTTTTAAATATAGCAAAATTTTACTTTCAATC
TATGTGGTTAAATGTAGCAAAATTTTTCTTTCAATTTGTGATAAGCCAACCATGATAGGCATGATTAGTTTAACAGAAAAAAA
Protein sequenceShow/hide protein sequence
MFKFRNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIED
KISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNA
IPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC