; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MELO3C021380 (gene) of Melon (DHL92) v4 genome

Gene IDMELO3C021380
OrganismCucumis melo DHL92 (Melon (DHL92) v4)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationchr11:27620618..27624718
RNA-Seq ExpressionMELO3C021380
SyntenyMELO3C021380
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575033.1 putative prolyl 4-hydroxylase 4, partial [Cucurbita argyrosperma subsp. sororia]1.9e-16193.33Show/hide
Query:  MFKFRNLLFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN
        M KFRNLLF FLILISS VRES+CSYAGSA++TVDPS+VKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSG SKLSTVRTSSGMFISK+
Subjt:  MFKFRNLLFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN

Query:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG
        KDPIVSGIEDKI+AWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIA GGHRLATVLMYLSNVTKGGETVFPLAEKS  RRA ETDEDL+ECA++G
Subjt:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG

Query:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLG+IG+CTDLNESCERWAALGECTKNPEYMVGSPE+PGYCRRSCRIC
Subjt:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

XP_004144967.1 probable prolyl 4-hydroxylase 4 [Cucumis sativus]3.8e-17097.33Show/hide
Query:  MFKFRNLLFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN
        MFKF NLLF FLIL SSF+RESTCSYAGSASATVDPS+VKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN
Subjt:  MFKFRNLLFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN

Query:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG
        KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVT+GGETVFPLAEK SHRRAYETDEDLSECAKKG
Subjt:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG

Query:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        +AVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
Subjt:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

XP_008458517.1 PREDICTED: probable prolyl 4-hydroxylase 4 [Cucumis melo]7.4e-174100Show/hide
Query:  MFKFRNLLFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN
        MFKFRNLLFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN
Subjt:  MFKFRNLLFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN

Query:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG
        KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG
Subjt:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG

Query:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
Subjt:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

XP_022959148.1 probable prolyl 4-hydroxylase 4 [Cucurbita moschata]2.5e-16193.33Show/hide
Query:  MFKFRNLLFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN
        M KFRNLLF FLILISS VRES+CSYAGSA++TVDPS+VKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSG SKLSTVRTSSGMFISK+
Subjt:  MFKFRNLLFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN

Query:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG
        KDPIVSGIEDKI+AWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIA GGHRLATVLMYLSNVTKGGETVFPLAEKS  RRA ETDEDLSECA++G
Subjt:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG

Query:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLG+IG+C+DLNESCERWAALGECTKNPEYMVGSPE+PGYCRRSCRIC
Subjt:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

XP_038874583.1 probable prolyl 4-hydroxylase 4 [Benincasa hispida]9.3e-16996.67Show/hide
Query:  MFKFRNLLFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN
        MFKFRNLLF FLILIS  VRESTCSYAGSAS+TVDPS+VKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISK+
Subjt:  MFKFRNLLFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN

Query:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG
        KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHG+KYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKS+HRRAYETDEDLSECA+KG
Subjt:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG

Query:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLG+IGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
Subjt:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

TrEMBL top hitse value%identityAlignment
A0A0A0KCQ5 Procollagen-proline 4-dioxygenase1.8e-17097.33Show/hide
Query:  MFKFRNLLFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN
        MFKF NLLF FLIL SSF+RESTCSYAGSASATVDPS+VKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN
Subjt:  MFKFRNLLFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN

Query:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG
        KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVT+GGETVFPLAEK SHRRAYETDEDLSECAKKG
Subjt:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG

Query:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        +AVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
Subjt:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

A0A1S3C816 Procollagen-proline 4-dioxygenase3.6e-174100Show/hide
Query:  MFKFRNLLFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN
        MFKFRNLLFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN
Subjt:  MFKFRNLLFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN

Query:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG
        KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG
Subjt:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG

Query:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
Subjt:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

A0A5A7SVW6 Procollagen-proline 4-dioxygenase3.6e-174100Show/hide
Query:  MFKFRNLLFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN
        MFKFRNLLFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN
Subjt:  MFKFRNLLFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN

Query:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG
        KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG
Subjt:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG

Query:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
Subjt:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

A0A6J1H545 Procollagen-proline 4-dioxygenase1.2e-16193.33Show/hide
Query:  MFKFRNLLFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN
        M KFRNLLF FLILISS VRES+CSYAGSA++TVDPS+VKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSG SKLSTVRTSSGMFISK+
Subjt:  MFKFRNLLFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN

Query:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG
        KDPIVSGIEDKI+AWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIA GGHRLATVLMYLSNVTKGGETVFPLAEKS  RRA ETDEDLSECA++G
Subjt:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG

Query:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLG+IG+C+DLNESCERWAALGECTKNPEYMVGSPE+PGYCRRSCRIC
Subjt:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

A0A6J1L4G1 Procollagen-proline 4-dioxygenase2.9e-16093Show/hide
Query:  MFKFRNLLFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN
        M KFR LLF FLILISS VRES+CSYAGSA++TVDPS+VKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSG SKLSTVRTSSGMFISK+
Subjt:  MFKFRNLLFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKN

Query:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG
        KDPIVSGIEDKI+AWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIA GGHRLATVLMYLSNVTKGGETVFPLAEKS  RRA ETDEDLSECA++G
Subjt:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKG

Query:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLG+IG+CTDLNESCERWAALGECTKNPEYMVGS E+PGYCRRSCRIC
Subjt:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 65.1e-9359.21Show/hide
Query:  SYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRS-EVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPKENG
        S   S S +VDP+R+ Q+SW PRAF+Y+GFL+D ECDHL+ +A+ +L++S  VAD DSG+S+ S VRTSSGMF++K +D IV+ +E K++AWTFLP+ENG
Subjt:  SYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRS-EVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPKENG

Query:  EDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPD
        E +Q+L YE+GQKY+ H+DYF DK  +  GGHR+ATVLMYLSNVTKGGETVFP       +     D+  S+CAK+G AVKP+KGDALLFF+L  N   D
Subjt:  EDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPD

Query:  TNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
         NSLHG CPV+EGEKWSAT+WIHV SF K       C D +ESC+ WA  GEC KNP YMVGS    G+CR+SC+ C
Subjt:  TNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

F4JAU3 Prolyl 4-hydroxylase 22.3e-12572.35Show/hide
Query:  LFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSG
        L  F+ ++   ++ STC    S S+ ++PS+VKQ+S KPRAFVYEGFLTDLECDHL+S+A+  L+RS VADND+G+S++S VRTSSG FISK KDPIVSG
Subjt:  LFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSG

Query:  IEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKGIAVKPKK
        IEDK+S WTFLPKENGED+QVLRYEHGQKY++H+DYF DKVNIA GGHR+ATVL+YLSNVTKGGETVFP A++ S R   E  +DLS+CAKKGIAVKPKK
Subjt:  IEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKGIAVKPKK

Query:  GDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        G+ALLFF+L+ +AIPD  SLHGGCPV+EGEKWSATKWIHVDSF K L   GNCTD+NESCERWA LGEC KNPEYMVG+PE+PG CRRSC+ C
Subjt:  GDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

Q8L970 Probable prolyl 4-hydroxylase 73.6e-9957.61Show/hide
Query:  FRNLLFFFLILISS----FVRESTCSYAGS--------ASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRT
        F     F L LISS    F+  S+ +  GS        +S   DP+RV Q+SW PR F+YEGFL+D ECDH + +A+ +L++S VADNDSG+S  S VRT
Subjt:  FRNLLFFFLILISS----FVRESTCSYAGS--------ASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRT

Query:  SSGMFISKNKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDE
        SSGMF+SK +D IVS +E K++AWTFLP+ENGE +Q+L YE+GQKYE H+DYF D+ N+  GGHR+ATVLMYLSNV KGGETVFP+ +  + +     D+
Subjt:  SSGMFISKNKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDE

Query:  DLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPG
          +ECAK+G AVKP+KGDALLFF+L PNA  D+NSLHG CPV+EGEKWSAT+WIHV SF +       C D N SCE+WA  GEC KNP YMVGS +  G
Subjt:  DLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPG

Query:  YCRRSCRIC
        YCR+SC+ C
Subjt:  YCRRSCRIC

Q8LAN3 Probable prolyl 4-hydroxylase 42.8e-12874.32Show/hide
Query:  RNLLFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPI
        R LL  F  + S  ++ ST S   S+S  V+PS+VKQ+S KPRAFVYEGFLT+LECDH+VS+A++ LKRS VADNDSG+SK S VRTSSG FISK KDPI
Subjt:  RNLLFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPI

Query:  VSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKGIAVK
        VSGIEDKIS WTFLPKENGEDIQVLRYEHGQKY++H+DYF DKVNI  GGHR+AT+LMYLSNVTKGGETVFP AE  S R   E  EDLS+CAK+GIAVK
Subjt:  VSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKGIAVK

Query:  PKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        P+KGDALLFF+L P+AIPD  SLHGGCPV+EGEKWSATKWIHVDSF + +   GNCTD+NESCERWA LGECTKNPEYMVG+ E+PGYCRRSC+ C
Subjt:  PKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

Q9LN20 Probable prolyl 4-hydroxylase 31.7e-6454.55Show/hide
Query:  ISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHY
        +SW+PRAFVY  FL+  EC++L+S+A+  + +S V D+++GKSK S VRTSSG F+ + +D I+  IE +I+ +TF+P ++GE +QVL YE GQKYE HY
Subjt:  ISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHY

Query:  DYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSA
        DYFVD+ N   GG R+AT+LMYLS+V +GGETVFP A  +     +    +LSEC KKG++VKP+ GDALLF+S+ P+A  D  SLHGGCPV+ G KWS+
Subjt:  DYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSA

Query:  TKWIHVDSF
        TKW+HV  +
Subjt:  TKWIHVDSF

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 21.6e-12672.35Show/hide
Query:  LFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSG
        L  F+ ++   ++ STC    S S+ ++PS+VKQ+S KPRAFVYEGFLTDLECDHL+S+A+  L+RS VADND+G+S++S VRTSSG FISK KDPIVSG
Subjt:  LFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSG

Query:  IEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKGIAVKPKK
        IEDK+S WTFLPKENGED+QVLRYEHGQKY++H+DYF DKVNIA GGHR+ATVL+YLSNVTKGGETVFP A++ S R   E  +DLS+CAKKGIAVKPKK
Subjt:  IEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKGIAVKPKK

Query:  GDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        G+ALLFF+L+ +AIPD  SLHGGCPV+EGEKWSATKWIHVDSF K L   GNCTD+NESCERWA LGEC KNPEYMVG+PE+PG CRRSC+ C
Subjt:  GDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase2.6e-10057.61Show/hide
Query:  FRNLLFFFLILISS----FVRESTCSYAGS--------ASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRT
        F     F L LISS    F+  S+ +  GS        +S   DP+RV Q+SW PR F+YEGFL+D ECDH + +A+ +L++S VADNDSG+S  S VRT
Subjt:  FRNLLFFFLILISS----FVRESTCSYAGS--------ASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRT

Query:  SSGMFISKNKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDE
        SSGMF+SK +D IVS +E K++AWTFLP+ENGE +Q+L YE+GQKYE H+DYF D+ N+  GGHR+ATVLMYLSNV KGGETVFP+ +  + +     D+
Subjt:  SSGMFISKNKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDE

Query:  DLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPG
          +ECAK+G AVKP+KGDALLFF+L PNA  D+NSLHG CPV+EGEKWSAT+WIHV SF +       C D N SCE+WA  GEC KNP YMVGS +  G
Subjt:  DLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPG

Query:  YCRRSCRIC
        YCR+SC+ C
Subjt:  YCRRSCRIC

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase1.4e-9354.26Show/hide
Query:  FRNLLFFFLILISS----FVRESTCSYAGS--------ASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKS-----KL
        F     F L LISS    F+  S+ +  GS        +S   DP+RV Q+SW PR F+YEGFL+D ECDH + +A+ +L++S VADNDSG+S      +
Subjt:  FRNLLFFFLILISS----FVRESTCSYAGS--------ASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKS-----KL

Query:  STVRTSSGMFISKNK---DPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSH
        S VR SS    + +    D IVS +E K++AWTFLP+ENGE +Q+L YE+GQKYE H+DYF D+ N+  GGHR+ATVLMYLSNV KGGETVFP+ +  + 
Subjt:  STVRTSSGMFISKNK---DPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSH

Query:  RRAYETDEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYM
        +     D+  +ECAK+G AVKP+KGDALLFF+L PNA  D+NSLHG CPV+EGEKWSAT+WIHV SF +       C D N SCE+WA  GEC KNP YM
Subjt:  RRAYETDEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYM

Query:  VGSPEMPGYCRRSCRIC
        VGS +  GYCR+SC+ C
Subjt:  VGSPEMPGYCRRSCRIC

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase3.6e-9459.21Show/hide
Query:  SYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRS-EVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPKENG
        S   S S +VDP+R+ Q+SW PRAF+Y+GFL+D ECDHL+ +A+ +L++S  VAD DSG+S+ S VRTSSGMF++K +D IV+ +E K++AWTFLP+ENG
Subjt:  SYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRS-EVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPKENG

Query:  EDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPD
        E +Q+L YE+GQKY+ H+DYF DK  +  GGHR+ATVLMYLSNVTKGGETVFP       +     D+  S+CAK+G AVKP+KGDALLFF+L  N   D
Subjt:  EDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPD

Query:  TNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
         NSLHG CPV+EGEKWSAT+WIHV SF K       C D +ESC+ WA  GEC KNP YMVGS    G+CR+SC+ C
Subjt:  TNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.0e-12974.32Show/hide
Query:  RNLLFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPI
        R LL  F  + S  ++ ST S   S+S  V+PS+VKQ+S KPRAFVYEGFLT+LECDH+VS+A++ LKRS VADNDSG+SK S VRTSSG FISK KDPI
Subjt:  RNLLFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPI

Query:  VSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKGIAVK
        VSGIEDKIS WTFLPKENGEDIQVLRYEHGQKY++H+DYF DKVNI  GGHR+AT+LMYLSNVTKGGETVFP AE  S R   E  EDLS+CAK+GIAVK
Subjt:  VSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKGIAVK

Query:  PKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        P+KGDALLFF+L P+AIPD  SLHGGCPV+EGEKWSATKWIHVDSF + +   GNCTD+NESCERWA LGECTKNPEYMVG+ E+PGYCRRSC+ C
Subjt:  PKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCAAATTTCGTAATCTGTTATTCTTCTTCTTGATTTTGATCTCATCGTTTGTTCGGGAATCAACTTGTTCTTATGCTGGTTCGGCTAGCGCAACCGTAGATCCTAG
TAGAGTGAAGCAGATTTCATGGAAACCGAGAGCTTTTGTATATGAAGGTTTTCTAACGGACCTAGAATGCGATCATCTGGTTTCTATAGCGAGATCCGAGCTAAAGAGAT
CTGAAGTTGCTGATAATGATTCAGGAAAGAGCAAGCTCAGTACTGTACGAACGAGTTCAGGAATGTTCATTTCTAAAAACAAGGATCCTATTGTTTCTGGCATAGAGGAC
AAAATTTCTGCATGGACTTTTCTTCCAAAAGAAAATGGGGAGGACATTCAGGTATTGAGATATGAGCATGGGCAGAAATATGAGTCGCATTATGATTACTTTGTTGACAA
GGTGAATATTGCCTGGGGTGGACATCGTTTAGCTACAGTCCTTATGTATCTCTCTAATGTGACCAAAGGCGGTGAAACAGTTTTCCCTTTGGCAGAGAAATCTTCCCACC
GGAGAGCTTATGAAACAGACGAGGACCTCTCAGAGTGCGCAAAGAAAGGAATTGCAGTGAAACCAAAGAAAGGCGATGCCCTTCTTTTCTTTAGCCTTGAACCAAATGCT
ATACCAGACACAAACAGTCTCCATGGAGGTTGCCCTGTCCTTGAAGGAGAAAAATGGTCAGCAACAAAGTGGATTCACGTAGATTCTTTCAGCAAAAACTTAGGAGACAT
TGGGAATTGTACTGATCTAAATGAAAGTTGTGAGAGATGGGCCGCCTTAGGGGAATGCACCAAAAACCCAGAATATATGGTTGGATCTCCAGAAATGCCCGGCTACTGTC
GGAGGAGTTGCAGGATCTGCTGA
mRNA sequenceShow/hide mRNA sequence
GACTAATTGACTTTCTATGAGAGGAAAGTAACTAAACCCTCCTCTCTATGAAATTTGGAACTTCCATTCAATTTCCATCGAATTTCCTCCGATAACTCTCTCTCTCTCCC
TCTCCCTCTCGCGCGCGCGCTCTTTCTAGTTTGATCCGATCGAGACTATGTTCAAATTTCGTAATCTGTTATTCTTCTTCTTGATTTTGATCTCATCGTTTGTTCGGGAA
TCAACTTGTTCTTATGCTGGTTCGGCTAGCGCAACCGTAGATCCTAGTAGAGTGAAGCAGATTTCATGGAAACCGAGAGCTTTTGTATATGAAGGTTTTCTAACGGACCT
AGAATGCGATCATCTGGTTTCTATAGCGAGATCCGAGCTAAAGAGATCTGAAGTTGCTGATAATGATTCAGGAAAGAGCAAGCTCAGTACTGTACGAACGAGTTCAGGAA
TGTTCATTTCTAAAAACAAGGATCCTATTGTTTCTGGCATAGAGGACAAAATTTCTGCATGGACTTTTCTTCCAAAAGAAAATGGGGAGGACATTCAGGTATTGAGATAT
GAGCATGGGCAGAAATATGAGTCGCATTATGATTACTTTGTTGACAAGGTGAATATTGCCTGGGGTGGACATCGTTTAGCTACAGTCCTTATGTATCTCTCTAATGTGAC
CAAAGGCGGTGAAACAGTTTTCCCTTTGGCAGAGAAATCTTCCCACCGGAGAGCTTATGAAACAGACGAGGACCTCTCAGAGTGCGCAAAGAAAGGAATTGCAGTGAAAC
CAAAGAAAGGCGATGCCCTTCTTTTCTTTAGCCTTGAACCAAATGCTATACCAGACACAAACAGTCTCCATGGAGGTTGCCCTGTCCTTGAAGGAGAAAAATGGTCAGCA
ACAAAGTGGATTCACGTAGATTCTTTCAGCAAAAACTTAGGAGACATTGGGAATTGTACTGATCTAAATGAAAGTTGTGAGAGATGGGCCGCCTTAGGGGAATGCACCAA
AAACCCAGAATATATGGTTGGATCTCCAGAAATGCCCGGCTACTGTCGGAGGAGTTGCAGGATCTGCTGATTTCATCTCAACATTACACTCGAAATTTCGATACCTTCTG
CGTCATTTGTGGAATGGAAAGCGCTACATCGATTGTAAAGCTATCGTGGAAGCAGGGATGGATGGGATGAAACAGTAGCCGTTGGGTAATTATGACCTTTCCTTTTTTAA
TTGATGATTATGTATTACTTTATTGTCATTATTTTTCTTATTTGTTTTTTTTATATGATTTTCTTATAAGAAAGAATATTGTTGTTTTAAAAGCTAAAAGTTTATACACA
TTACAGAAGAAAAGAAAATCGAGTAATATTGCTGATGTGACAGTGAAGTGATTAGAGAGTCTAACCAACCTTTCATTGTTTATAATATGAAAAATACAATCCTTCTCTTT
TTTCTTAATGCTTTCACTATTTTTTCTC
Protein sequenceShow/hide protein sequence
MFKFRNLLFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIED
KISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKGIAVKPKKGDALLFFSLEPNA
IPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC