; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G05890 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G05890
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationClcChr06:6025960..6030669
RNA-Seq ExpressionClc06G05890
SyntenyClc06G05890
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575033.1 putative prolyl 4-hydroxylase 4, partial [Cucurbita argyrosperma subsp. sororia]8.4e-16294Show/hide
Query:  MFKFPNLLFICLILISLVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGESKLSTVRTSSGMFISKS
        M KF NLLF+ LILIS VVRES+CSYAGSA+STVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSG+SKLSTVRTSSGMFISKS
Subjt:  MFKFPNLLFICLILISLVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGESKLSTVRTSSGMFISKS

Query:  KDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKG
        KDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIA GGHRLATVLMYLS+VTKGGETVFPLAEKS  RRA+ETD+DL+ECAR+G
Subjt:  KDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKG

Query:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIG+CTDLNESCERWAALGECTKNPEYMVGSPE+PGYCRRSCRIC
Subjt:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

XP_004144967.1 probable prolyl 4-hydroxylase 4 [Cucumis sativus]2.0e-16393.67Show/hide
Query:  MFKFPNLLFICLILISLVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGESKLSTVRTSSGMFISKS
        MFKF NLLFI LIL S  +RESTCSYAGSAS+TVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSG+SKLSTVRTSSGMFISK+
Subjt:  MFKFPNLLFICLILISLVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGESKLSTVRTSSGMFISKS

Query:  KDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKG
        KDPIVSGIEDKI+AWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLS+VT+GGETVFPLAEK  HRRA ETD+DLSECA+KG
Subjt:  KDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKG

Query:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        +AVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLG+IGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
Subjt:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

XP_008458517.1 PREDICTED: probable prolyl 4-hydroxylase 4 [Cucumis melo]2.4e-16494.67Show/hide
Query:  MFKFPNLLFICLILISLVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGESKLSTVRTSSGMFISKS
        MFKF NLLF  LILIS  VRESTCSYAGSAS+TVDPS+VKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSG+SKLSTVRTSSGMFISK+
Subjt:  MFKFPNLLFICLILISLVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGESKLSTVRTSSGMFISKS

Query:  KDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKG
        KDPIVSGIEDKI+AWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLS+VTKGGETVFPLAEKS HRRA ETD+DLSECA+KG
Subjt:  KDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKG

Query:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLG+IGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
Subjt:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

XP_022959148.1 probable prolyl 4-hydroxylase 4 [Cucurbita moschata]1.1e-16194Show/hide
Query:  MFKFPNLLFICLILISLVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGESKLSTVRTSSGMFISKS
        M KF NLLF+ LILIS VVRES+CSYAGSA+STVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSG+SKLSTVRTSSGMFISKS
Subjt:  MFKFPNLLFICLILISLVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGESKLSTVRTSSGMFISKS

Query:  KDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKG
        KDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIA GGHRLATVLMYLS+VTKGGETVFPLAEKS  RRA+ETD+DLSECAR+G
Subjt:  KDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKG

Query:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIG+C+DLNESCERWAALGECTKNPEYMVGSPE+PGYCRRSCRIC
Subjt:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

XP_038874583.1 probable prolyl 4-hydroxylase 4 [Benincasa hispida]1.0e-16797Show/hide
Query:  MFKFPNLLFICLILISLVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGESKLSTVRTSSGMFISKS
        MFKF NLLFI LILISLVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSG+SKLSTVRTSSGMFISKS
Subjt:  MFKFPNLLFICLILISLVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGESKLSTVRTSSGMFISKS

Query:  KDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKG
        KDPIVSGIEDKI+AWTFLPKENGEDIQVLRYEHG+KYESHYDYFVDKVNIAWGGHRLATVLMYLS+VTKGGETVFPLAEKS HRRA ETD+DLSECARKG
Subjt:  KDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKG

Query:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
Subjt:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

TrEMBL top hitse value%identityAlignment
A0A0A0KCQ5 Procollagen-proline 4-dioxygenase9.7e-16493.67Show/hide
Query:  MFKFPNLLFICLILISLVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGESKLSTVRTSSGMFISKS
        MFKF NLLFI LIL S  +RESTCSYAGSAS+TVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSG+SKLSTVRTSSGMFISK+
Subjt:  MFKFPNLLFICLILISLVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGESKLSTVRTSSGMFISKS

Query:  KDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKG
        KDPIVSGIEDKI+AWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLS+VT+GGETVFPLAEK  HRRA ETD+DLSECA+KG
Subjt:  KDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKG

Query:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        +AVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLG+IGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
Subjt:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

A0A1S3C816 Procollagen-proline 4-dioxygenase1.1e-16494.67Show/hide
Query:  MFKFPNLLFICLILISLVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGESKLSTVRTSSGMFISKS
        MFKF NLLF  LILIS  VRESTCSYAGSAS+TVDPS+VKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSG+SKLSTVRTSSGMFISK+
Subjt:  MFKFPNLLFICLILISLVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGESKLSTVRTSSGMFISKS

Query:  KDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKG
        KDPIVSGIEDKI+AWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLS+VTKGGETVFPLAEKS HRRA ETD+DLSECA+KG
Subjt:  KDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKG

Query:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLG+IGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
Subjt:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

A0A5A7SVW6 Procollagen-proline 4-dioxygenase1.1e-16494.67Show/hide
Query:  MFKFPNLLFICLILISLVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGESKLSTVRTSSGMFISKS
        MFKF NLLF  LILIS  VRESTCSYAGSAS+TVDPS+VKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSG+SKLSTVRTSSGMFISK+
Subjt:  MFKFPNLLFICLILISLVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGESKLSTVRTSSGMFISKS

Query:  KDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKG
        KDPIVSGIEDKI+AWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLS+VTKGGETVFPLAEKS HRRA ETD+DLSECA+KG
Subjt:  KDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKG

Query:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLG+IGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
Subjt:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

A0A6J1H545 Procollagen-proline 4-dioxygenase5.3e-16294Show/hide
Query:  MFKFPNLLFICLILISLVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGESKLSTVRTSSGMFISKS
        M KF NLLF+ LILIS VVRES+CSYAGSA+STVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSG+SKLSTVRTSSGMFISKS
Subjt:  MFKFPNLLFICLILISLVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGESKLSTVRTSSGMFISKS

Query:  KDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKG
        KDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIA GGHRLATVLMYLS+VTKGGETVFPLAEKS  RRA+ETD+DLSECAR+G
Subjt:  KDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKG

Query:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIG+C+DLNESCERWAALGECTKNPEYMVGSPE+PGYCRRSCRIC
Subjt:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

A0A6J1L4G1 Procollagen-proline 4-dioxygenase1.3e-16093.67Show/hide
Query:  MFKFPNLLFICLILISLVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGESKLSTVRTSSGMFISKS
        M KF  LLF+ LILIS VVRES+CSYAGSA+STVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSG+SKLSTVRTSSGMFISKS
Subjt:  MFKFPNLLFICLILISLVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGESKLSTVRTSSGMFISKS

Query:  KDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKG
        KDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIA GGHRLATVLMYLS+VTKGGETVFPLAEKS  RRA+ETD+DLSECAR+G
Subjt:  KDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKG

Query:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIG+CTDLNESCERWAALGECTKNPEYMVGS E+PGYCRRSCRIC
Subjt:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 63.3e-9258.84Show/hide
Query:  SYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRS-EVADNDSGESKLSTVRTSSGMFISKSKDPIVSGIEDKIAAWTFLPKENG
        S   S S +VDP+++ Q+SW PRAF+Y+GFL+D ECDHL+ +A+ +L++S  VAD DSGES+ S VRTSSGMF++K +D IV+ +E K+AAWTFLP+ENG
Subjt:  SYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRS-EVADNDSGESKLSTVRTSSGMFISKSKDPIVSGIEDKIAAWTFLPKENG

Query:  EDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKGIAVKPKKGDALLFFSLEPNAIPD
        E +Q+L YE+GQKY+ H+DYF DK  +  GGHR+ATVLMYLS+VTKGGETVFP     + +     D   S+CA++G AVKP+KGDALLFF+L  N   D
Subjt:  EDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKGIAVKPKKGDALLFFSLEPNAIPD

Query:  TNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
         NSLHG CPV+EGEKWSAT+WIHV SF K       C D +ESC+ WA  GEC KNP YMVGS    G+CR+SC+ C
Subjt:  TNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

F4JAU3 Prolyl 4-hydroxylase 21.7e-12572.45Show/hide
Query:  LLFICLILISLVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGESKLSTVRTSSGMFISKSKDPIVS
        LLF+ ++L+  +++ STC    S SS ++PSKVKQ+S KPRAFVYEGFLTDLECDHL+S+A+  L+RS VADND+GES++S VRTSSG FISK KDPIVS
Subjt:  LLFICLILISLVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGESKLSTVRTSSGMFISKSKDPIVS

Query:  GIEDKIAAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKGIAVKPK
        GIEDK++ WTFLPKENGED+QVLRYEHGQKY++H+DYF DKVNIA GGHR+ATVL+YLS+VTKGGETVFP A++   R  +E   DLS+CA+KGIAVKPK
Subjt:  GIEDKIAAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKGIAVKPK

Query:  KGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        KG+ALLFF+L+ +AIPD  SLHGGCPV+EGEKWSATKWIHVDSF K L + GNCTD+NESCERWA LGEC KNPEYMVG+PE+PG CRRSC+ C
Subjt:  KGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

Q8L970 Probable prolyl 4-hydroxylase 74.7e-9961.03Show/hide
Query:  SASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGESKLSTVRTSSGMFISKSKDPIVSGIEDKIAAWTFLPKENGEDIQV
        ++S   DP++V Q+SW PR F+YEGFL+D ECDH + +A+ +L++S VADNDSGES  S VRTSSGMF+SK +D IVS +E K+AAWTFLP+ENGE +Q+
Subjt:  SASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGESKLSTVRTSSGMFISKSKDPIVSGIEDKIAAWTFLPKENGEDIQV

Query:  LRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKGIAVKPKKGDALLFFSLEPNAIPDTNSLH
        L YE+GQKYE H+DYF D+ N+  GGHR+ATVLMYLS+V KGGETVFP+    + +     D   +ECA++G AVKP+KGDALLFF+L PNA  D+NSLH
Subjt:  LRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKGIAVKPKKGDALLFFSLEPNAIPDTNSLH

Query:  GGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        G CPV+EGEKWSAT+WIHV SF +       C D N SCE+WA  GEC KNP YMVGS +  GYCR+SC+ C
Subjt:  GGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

Q8LAN3 Probable prolyl 4-hydroxylase 43.5e-12673.38Show/hide
Query:  LFICLILISLVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGESKLSTVRTSSGMFISKSKDPIVSG
        L I    I  V+ +S+ S   S+S  V+PSKVKQ+S KPRAFVYEGFLT+LECDH+VS+A++ LKRS VADNDSGESK S VRTSSG FISK KDPIVSG
Subjt:  LFICLILISLVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGESKLSTVRTSSGMFISKSKDPIVSG

Query:  IEDKIAAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKGIAVKPKK
        IEDKI+ WTFLPKENGEDIQVLRYEHGQKY++H+DYF DKVNI  GGHR+AT+LMYLS+VTKGGETVFP AE    R  +E  +DLS+CA++GIAVKP+K
Subjt:  IEDKIAAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKGIAVKPKK

Query:  GDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        GDALLFF+L P+AIPD  SLHGGCPV+EGEKWSATKWIHVDSF + +   GNCTD+NESCERWA LGECTKNPEYMVG+ E+PGYCRRSC+ C
Subjt:  GDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

Q9LN20 Probable prolyl 4-hydroxylase 31.3e-6454.55Show/hide
Query:  ISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGESKLSTVRTSSGMFISKSKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQKYESHY
        +SW+PRAFVY  FL+  EC++L+S+A+  + +S V D+++G+SK S VRTSSG F+ + +D I+  IE +IA +TF+P ++GE +QVL YE GQKYE HY
Subjt:  ISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGESKLSTVRTSSGMFISKSKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQKYESHY

Query:  DYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSA
        DYFVD+ N   GG R+AT+LMYLSDV +GGETVFP A  + +  +     +LSEC +KG++VKP+ GDALLF+S+ P+A  D  SLHGGCPV+ G KWS+
Subjt:  DYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSA

Query:  TKWIHVDSF
        TKW+HV  +
Subjt:  TKWIHVDSF

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 21.2e-12672.45Show/hide
Query:  LLFICLILISLVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGESKLSTVRTSSGMFISKSKDPIVS
        LLF+ ++L+  +++ STC    S SS ++PSKVKQ+S KPRAFVYEGFLTDLECDHL+S+A+  L+RS VADND+GES++S VRTSSG FISK KDPIVS
Subjt:  LLFICLILISLVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGESKLSTVRTSSGMFISKSKDPIVS

Query:  GIEDKIAAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKGIAVKPK
        GIEDK++ WTFLPKENGED+QVLRYEHGQKY++H+DYF DKVNIA GGHR+ATVL+YLS+VTKGGETVFP A++   R  +E   DLS+CA+KGIAVKPK
Subjt:  GIEDKIAAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKGIAVKPK

Query:  KGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        KG+ALLFF+L+ +AIPD  SLHGGCPV+EGEKWSATKWIHVDSF K L + GNCTD+NESCERWA LGEC KNPEYMVG+PE+PG CRRSC+ C
Subjt:  KGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase3.4e-10061.03Show/hide
Query:  SASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGESKLSTVRTSSGMFISKSKDPIVSGIEDKIAAWTFLPKENGEDIQV
        ++S   DP++V Q+SW PR F+YEGFL+D ECDH + +A+ +L++S VADNDSGES  S VRTSSGMF+SK +D IVS +E K+AAWTFLP+ENGE +Q+
Subjt:  SASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGESKLSTVRTSSGMFISKSKDPIVSGIEDKIAAWTFLPKENGEDIQV

Query:  LRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKGIAVKPKKGDALLFFSLEPNAIPDTNSLH
        L YE+GQKYE H+DYF D+ N+  GGHR+ATVLMYLS+V KGGETVFP+    + +     D   +ECA++G AVKP+KGDALLFF+L PNA  D+NSLH
Subjt:  LRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKGIAVKPKKGDALLFFSLEPNAIPDTNSLH

Query:  GGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        G CPV+EGEKWSAT+WIHV SF +       C D N SCE+WA  GEC KNP YMVGS +  GYCR+SC+ C
Subjt:  GGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase1.8e-9357.14Show/hide
Query:  SASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGES-----KLSTVRTSSGMFISKSK---DPIVSGIEDKIAAWTFLPK
        ++S   DP++V Q+SW PR F+YEGFL+D ECDH + +A+ +L++S VADNDSGES      +S VR SS    +      D IVS +E K+AAWTFLP+
Subjt:  SASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGES-----KLSTVRTSSGMFISKSK---DPIVSGIEDKIAAWTFLPK

Query:  ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKGIAVKPKKGDALLFFSLEPNA
        ENGE +Q+L YE+GQKYE H+DYF D+ N+  GGHR+ATVLMYLS+V KGGETVFP+    + +     D   +ECA++G AVKP+KGDALLFF+L PNA
Subjt:  ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKGIAVKPKKGDALLFFSLEPNA

Query:  IPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
          D+NSLHG CPV+EGEKWSAT+WIHV SF +       C D N SCE+WA  GEC KNP YMVGS +  GYCR+SC+ C
Subjt:  IPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase2.3e-9358.84Show/hide
Query:  SYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRS-EVADNDSGESKLSTVRTSSGMFISKSKDPIVSGIEDKIAAWTFLPKENG
        S   S S +VDP+++ Q+SW PRAF+Y+GFL+D ECDHL+ +A+ +L++S  VAD DSGES+ S VRTSSGMF++K +D IV+ +E K+AAWTFLP+ENG
Subjt:  SYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRS-EVADNDSGESKLSTVRTSSGMFISKSKDPIVSGIEDKIAAWTFLPKENG

Query:  EDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKGIAVKPKKGDALLFFSLEPNAIPD
        E +Q+L YE+GQKY+ H+DYF DK  +  GGHR+ATVLMYLS+VTKGGETVFP     + +     D   S+CA++G AVKP+KGDALLFF+L  N   D
Subjt:  EDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKGIAVKPKKGDALLFFSLEPNAIPD

Query:  TNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
         NSLHG CPV+EGEKWSAT+WIHV SF K       C D +ESC+ WA  GEC KNP YMVGS    G+CR+SC+ C
Subjt:  TNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.5e-12773.38Show/hide
Query:  LFICLILISLVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGESKLSTVRTSSGMFISKSKDPIVSG
        L I    I  V+ +S+ S   S+S  V+PSKVKQ+S KPRAFVYEGFLT+LECDH+VS+A++ LKRS VADNDSGESK S VRTSSG FISK KDPIVSG
Subjt:  LFICLILISLVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGESKLSTVRTSSGMFISKSKDPIVSG

Query:  IEDKIAAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKGIAVKPKK
        IEDKI+ WTFLPKENGEDIQVLRYEHGQKY++H+DYF DKVNI  GGHR+AT+LMYLS+VTKGGETVFP AE    R  +E  +DLS+CA++GIAVKP+K
Subjt:  IEDKIAAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKGIAVKPKK

Query:  GDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
        GDALLFF+L P+AIPD  SLHGGCPV+EGEKWSATKWIHVDSF + +   GNCTD+NESCERWA LGECTKNPEYMVG+ E+PGYCRRSC+ C
Subjt:  GDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTAAATTTCCTAATCTGTTATTCATCTGCTTGATTTTGATCTCATTGGTTGTTCGGGAATCAACTTGTTCGTATGCTGGTTCGGCTAGCTCCACTGTAGATCCTAG
TAAAGTGAAGCAGATTTCATGGAAACCGAGAGCTTTTGTTTATGAAGGTTTTCTTACGGACCTAGAATGCGACCACCTGGTTTCTATTGCAAGATCCGAGCTAAAAAGAT
CTGAGGTTGCTGATAACGATTCAGGAGAGAGCAAGCTCAGTACTGTTCGAACGAGTTCGGGAATGTTCATTTCTAAAAGCAAGGATCCTATTGTTTCTGGCATAGAGGAC
AAAATTGCTGCATGGACTTTTCTTCCAAAAGAAAATGGGGAGGATATTCAGGTATTGAGATATGAGCATGGGCAGAAATATGAGTCACATTATGATTACTTTGTTGACAA
GGTGAATATTGCCTGGGGAGGACATCGTTTAGCTACAGTCCTCATGTATCTCTCTGATGTGACCAAAGGCGGTGAAACAGTTTTCCCCTTGGCAGAGAAATCTCGCCACC
GGAGGGCTGCTGAAACAGATAAGGATCTCTCAGAGTGTGCAAGGAAAGGAATTGCAGTGAAACCGAAGAAAGGCGACGCCCTTCTTTTCTTTAGTCTTGAACCAAATGCT
ATCCCAGACACAAACAGTCTCCATGGAGGTTGCCCTGTCCTTGAAGGAGAAAAATGGTCAGCAACAAAGTGGATTCACGTAGACTCTTTCAGCAAAAACTTAGGAAACAT
TGGGAATTGTACTGATCTAAATGAAAGTTGTGAGAGATGGGCTGCCTTAGGAGAATGCACCAAAAACCCAGAATATATGGTCGGATCTCCAGAAATGCCTGGCTACTGTA
GGCGGAGCTGCAGGATCTGTTGA
mRNA sequenceShow/hide mRNA sequence
TCTCATCTTACTTACAGACCCCACTAGTTGGATTGAAACTCAAAAAAGGACGGATTTCGCATTCTGACGTGGCAATTTCGCACGTGCAGTTGGCATCAATATCTTGCTGT
TCGACGTTGACGATTTGACTTCTGTGAGAGGAAAGAAAGCCTCTGCTCTGTGAAGTTCTGGATTTCCAATTTTCCAGACTTCCATCGAATTTCCTCCGATAATTCTCTCT
CTCTCTCTCGCTCTCTTTCTAATTTGATCCGATCGAAACTATGTTTAAATTTCCTAATCTGTTATTCATCTGCTTGATTTTGATCTCATTGGTTGTTCGGGAATCAACTT
GTTCGTATGCTGGTTCGGCTAGCTCCACTGTAGATCCTAGTAAAGTGAAGCAGATTTCATGGAAACCGAGAGCTTTTGTTTATGAAGGTTTTCTTACGGACCTAGAATGC
GACCACCTGGTTTCTATTGCAAGATCCGAGCTAAAAAGATCTGAGGTTGCTGATAACGATTCAGGAGAGAGCAAGCTCAGTACTGTTCGAACGAGTTCGGGAATGTTCAT
TTCTAAAAGCAAGGATCCTATTGTTTCTGGCATAGAGGACAAAATTGCTGCATGGACTTTTCTTCCAAAAGAAAATGGGGAGGATATTCAGGTATTGAGATATGAGCATG
GGCAGAAATATGAGTCACATTATGATTACTTTGTTGACAAGGTGAATATTGCCTGGGGAGGACATCGTTTAGCTACAGTCCTCATGTATCTCTCTGATGTGACCAAAGGC
GGTGAAACAGTTTTCCCCTTGGCAGAGAAATCTCGCCACCGGAGGGCTGCTGAAACAGATAAGGATCTCTCAGAGTGTGCAAGGAAAGGAATTGCAGTGAAACCGAAGAA
AGGCGACGCCCTTCTTTTCTTTAGTCTTGAACCAAATGCTATCCCAGACACAAACAGTCTCCATGGAGGTTGCCCTGTCCTTGAAGGAGAAAAATGGTCAGCAACAAAGT
GGATTCACGTAGACTCTTTCAGCAAAAACTTAGGAAACATTGGGAATTGTACTGATCTAAATGAAAGTTGTGAGAGATGGGCTGCCTTAGGAGAATGCACCAAAAACCCA
GAATATATGGTCGGATCTCCAGAAATGCCTGGCTACTGTAGGCGGAGCTGCAGGATCTGTTGATCTCAACAATACACTCGAAATTTCCATACCTTTGGGCGAGCACTACA
ACATTGATTGCGTAGCTATCTCTTGAAGCAAGGATGGATGAGGCAGTAGCCGTTGAGTAATTATGACCTTTCATTTTTAATTGATGAGTATGTATTACTTTATTCTCTTT
ATTTTTCTTATTTGATTTTTGATATGGTTTTCTTAGAAGAAAGAATATTGTAGTTTTAAAGGCTATAGTTTTATATATACATTACAGAAAAATTGAGTAATTTTGCTGAT
GTGACATTAGAGAGTCTATAACCTTTCATTTCAACAGAAATGTCTGGTACAAATGAGTTTTATAATACGAAGAGATAACAATACACTTGATCTATTTTAATTTTTAAATA
CTTTTTCTATGTCGAAAATAATTTGGTTTATGCATCTTTGAAGAGATGGGTTCATAAATGGGTCAACAATGAGCTCATTTGCAAGGAAGTCAATCGACTCTTTCTCTCCC
CCTCAAATCTAGAAAACTTCTTTACCTTTTTTTCTTAATATGATTATCATTTTAATCTTTGTATTTTGGGATTTACTTATTTTAGTCCTTAATATTAATATTCATTTTTG
TCAAATATTTCCAAAAAAAACCCTTATCAACCTTTCTTCTCCATTTTTCA
Protein sequenceShow/hide protein sequence
MFKFPNLLFICLILISLVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGESKLSTVRTSSGMFISKSKDPIVSGIED
KIAAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSRHRRAAETDKDLSECARKGIAVKPKKGDALLFFSLEPNA
IPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC