; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg05473 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg05473
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationCarg_Chr17:1652000..1654883
RNA-Seq ExpressionCarg05473
SyntenyCarg05473
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575033.1 putative prolyl 4-hydroxylase 4, partial [Cucurbita argyrosperma subsp. sororia]3.0e-17097.72Show/hide
Query:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
        MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
Subjt:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS

Query:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
        KDPIVSGIEDKIAAWTFLPK       ENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
Subjt:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL

Query:  TECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYC
        TECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYC
Subjt:  TECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYC

Query:  RRSCRIC
        RRSCRIC
Subjt:  RRSCRIC

KAG7013608.1 putative prolyl 4-hydroxylase 4 [Cucurbita argyrosperma subsp. argyrosperma]1.6e-176100Show/hide
Query:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
        MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
Subjt:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS

Query:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
        KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
Subjt:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL

Query:  TECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYC
        TECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYC
Subjt:  TECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYC

Query:  RRSCRIC
        RRSCRIC
Subjt:  RRSCRIC

XP_022959148.1 probable prolyl 4-hydroxylase 4 [Cucurbita moschata]2.5e-16997.07Show/hide
Query:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
        MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
Subjt:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS

Query:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
        KDPIVSGIEDKIAAWTFLPK       ENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
Subjt:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL

Query:  TECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYC
        +ECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDC+DLNESCERWAALGECTKNPEYMVGSPELPGYC
Subjt:  TECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYC

Query:  RRSCRIC
        RRSCRIC
Subjt:  RRSCRIC

XP_023006272.1 probable prolyl 4-hydroxylase 4 [Cucurbita maxima]6.2e-16896.74Show/hide
Query:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
        MSKFR LLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
Subjt:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS

Query:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
        KDPIVSGIEDKIAAWTFLPK       ENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
Subjt:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL

Query:  TECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYC
        +ECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGS ELPGYC
Subjt:  TECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYC

Query:  RRSCRIC
        RRSCRIC
Subjt:  RRSCRIC

XP_023547984.1 probable prolyl 4-hydroxylase 4 [Cucurbita pepo subsp. pepo]3.1e-16796.42Show/hide
Query:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
        M KFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
Subjt:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS

Query:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
        KD IVSGIEDKIAAWTFLPK       ENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
Subjt:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL

Query:  TECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYC
        +ECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGS ELPGYC
Subjt:  TECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYC

Query:  RRSCRIC
        RRSCRIC
Subjt:  RRSCRIC

TrEMBL top hitse value%identityAlignment
A0A0A0KCQ5 Procollagen-proline 4-dioxygenase6.2e-15889.58Show/hide
Query:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
        M KF NLLF+FLIL SS +RES+CSYAGSA++TVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSG SKLSTVRTSSGMFISK+
Subjt:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS

Query:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
        KDPIVSGIEDKI+AWTFLPK       ENGEDIQVLRYEHGQKYESHYDYFVDKVNIA GGHRLATVLMYLSNVT+GGETVFPLAEK   RRA ETDEDL
Subjt:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL

Query:  TECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYC
        +ECA++G+AVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLG+IG+CTDLNESCERWAALGECTKNPEYMVGSPE+PGYC
Subjt:  TECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYC

Query:  RRSCRIC
        RRSCRIC
Subjt:  RRSCRIC

A0A1S3C816 Procollagen-proline 4-dioxygenase5.1e-16091.21Show/hide
Query:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
        M KFRNLLF FLILISS VRES+CSYAGSA++TVDPS+VKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSG SKLSTVRTSSGMFISK+
Subjt:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS

Query:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
        KDPIVSGIEDKI+AWTFLPK       ENGEDIQVLRYEHGQKYESHYDYFVDKVNIA GGHRLATVLMYLSNVTKGGETVFPLAEKS  RRA ETDEDL
Subjt:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL

Query:  TECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYC
        +ECA++GIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLG+IG+CTDLNESCERWAALGECTKNPEYMVGSPE+PGYC
Subjt:  TECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYC

Query:  RRSCRIC
        RRSCRIC
Subjt:  RRSCRIC

A0A5A7SVW6 Procollagen-proline 4-dioxygenase5.1e-16091.21Show/hide
Query:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
        M KFRNLLF FLILISS VRES+CSYAGSA++TVDPS+VKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSG SKLSTVRTSSGMFISK+
Subjt:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS

Query:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
        KDPIVSGIEDKI+AWTFLPK       ENGEDIQVLRYEHGQKYESHYDYFVDKVNIA GGHRLATVLMYLSNVTKGGETVFPLAEKS  RRA ETDEDL
Subjt:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL

Query:  TECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYC
        +ECA++GIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLG+IG+CTDLNESCERWAALGECTKNPEYMVGSPE+PGYC
Subjt:  TECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYC

Query:  RRSCRIC
        RRSCRIC
Subjt:  RRSCRIC

A0A6J1H545 Procollagen-proline 4-dioxygenase1.2e-16997.07Show/hide
Query:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
        MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
Subjt:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS

Query:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
        KDPIVSGIEDKIAAWTFLPK       ENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
Subjt:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL

Query:  TECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYC
        +ECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDC+DLNESCERWAALGECTKNPEYMVGSPELPGYC
Subjt:  TECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYC

Query:  RRSCRIC
        RRSCRIC
Subjt:  RRSCRIC

A0A6J1L4G1 Procollagen-proline 4-dioxygenase3.0e-16896.74Show/hide
Query:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
        MSKFR LLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
Subjt:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS

Query:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
        KDPIVSGIEDKIAAWTFLPK       ENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
Subjt:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL

Query:  TECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYC
        +ECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGS ELPGYC
Subjt:  TECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYC

Query:  RRSCRIC
        RRSCRIC
Subjt:  RRSCRIC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 68.3e-9157.54Show/hide
Query:  SYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRS-EVADNDSGDSKLSTVRTSSGMFISKSKDPIVSGIEDKIAAWTFLPKAGI
        S   S + +VDP+++ Q+SW PRAF+Y+GFL+D ECDHL+ +A+ +L++S  VAD DSG+S+ S VRTSSGMF++K +D IV+ +E K+AAWTFLP+   
Subjt:  SYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRS-EVADNDSGDSKLSTVRTSSGMFISKSKDPIVSGIEDKIAAWTFLPKAGI

Query:  SHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFP-LAEKSPRRRASETDEDLTECARQGIAVKPKKGDALLFFS
            ENGE +Q+L YE+GQKY+ H+DYF DK  +  GGHR+ATVLMYLSNVTKGGETVFP    K+P+ +    D+  ++CA+QG AVKP+KGDALLFF+
Subjt:  SHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFP-LAEKSPRRRASETDEDLTECARQGIAVKPKKGDALLFFS

Query:  LEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRIC
        L  N   D NSLHG CPV+EGEKWSAT+WIHV SF K       C D +ESC+ WA  GEC KNP YMVGS    G+CR+SC+ C
Subjt:  LEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRIC

F4JAU3 Prolyl 4-hydroxylase 22.0e-12469.71Show/hide
Query:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
        MS+   LLF+ ++L   V+ +SS     S +S ++PSKVKQ+S KPRAFVYEGFLTDLECDHL+S+A+  L+RS VADND+G+S++S VRTSSG FISK 
Subjt:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS

Query:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
        KDPIVSGIEDK++ WTFLPK       ENGED+QVLRYEHGQKY++H+DYF DKVNIARGGHR+ATVL+YLSNVTKGGETVFP A++  RR  SE  +DL
Subjt:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL

Query:  TECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYC
        ++CA++GIAVKPKKG+ALLFF+L+ +AIPD  SLHGGCPV+EGEKWSATKWIHVDSF K L + G+CTD+NESCERWA LGEC KNPEYMVG+PE+PG C
Subjt:  TECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYC

Query:  RRSCRIC
        RRSC+ C
Subjt:  RRSCRIC

Q8L970 Probable prolyl 4-hydroxylase 74.1e-9858.5Show/hide
Query:  SSVVRESSCSYAGSATST--VDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKSKDPIVSGIEDKIA
        SS  R+ S     ++ S+   DP++V Q+SW PR F+YEGFL+D ECDH + +A+ +L++S VADNDSG+S  S VRTSSGMF+SK +D IVS +E K+A
Subjt:  SSVVRESSCSYAGSATST--VDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKSKDPIVSGIEDKIA

Query:  AWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDLTECARQGIAVKPK
        AWTFLP+       ENGE +Q+L YE+GQKYE H+DYF D+ N+  GGHR+ATVLMYLSNV KGGETVFP+ +    +     D+  TECA+QG AVKP+
Subjt:  AWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDLTECARQGIAVKPK

Query:  KGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRIC
        KGDALLFF+L PNA  D+NSLHG CPV+EGEKWSAT+WIHV SF +       C D N SCE+WA  GEC KNP YMVGS +  GYCR+SC+ C
Subjt:  KGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRIC

Q8LAN3 Probable prolyl 4-hydroxylase 41.6e-12672.28Show/hide
Query:  RNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKSKDPI
        R  L +    I SV+ +SS S   S++  V+PSKVKQ+S KPRAFVYEGFLT+LECDH+VS+A++ LKRS VADNDSG+SK S VRTSSG FISK KDPI
Subjt:  RNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKSKDPI

Query:  VSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDLTECA
        VSGIEDKI+ WTFLPK       ENGEDIQVLRYEHGQKY++H+DYF DKVNI RGGHR+AT+LMYLSNVTKGGETVFP AE   RR  SE  EDL++CA
Subjt:  VSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDLTECA

Query:  RQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSC
        ++GIAVKP+KGDALLFF+L P+AIPD  SLHGGCPV+EGEKWSATKWIHVDSF + +   G+CTD+NESCERWA LGECTKNPEYMVG+ ELPGYCRRSC
Subjt:  RQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSC

Query:  RIC
        + C
Subjt:  RIC

Q9LN20 Probable prolyl 4-hydroxylase 34.0e-6151.39Show/hide
Query:  ISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKSKDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHG
        +SW+PRAFVY  FL+  EC++L+S+A+  + +S V D+++G SK S VRTSSG F+ + +D I+  IE +IA +TF+P       +++GE +QVL YE G
Subjt:  ISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKSKDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHG

Query:  QKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDLTECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVL
        QKYE HYDYFVD+ N   GG R+AT+LMYLS+V +GGETVFP A  +    +     +L+EC ++G++VKP+ GDALLF+S+ P+A  D  SLHGGCPV+
Subjt:  QKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDLTECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVL

Query:  EGEKWSATKWIHVDSF
         G KWS+TKW+HV  +
Subjt:  EGEKWSATKWIHVDSF

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 21.4e-12569.71Show/hide
Query:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
        MS+   LLF+ ++L   V+ +SS     S +S ++PSKVKQ+S KPRAFVYEGFLTDLECDHL+S+A+  L+RS VADND+G+S++S VRTSSG FISK 
Subjt:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS

Query:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
        KDPIVSGIEDK++ WTFLPK       ENGED+QVLRYEHGQKY++H+DYF DKVNIARGGHR+ATVL+YLSNVTKGGETVFP A++  RR  SE  +DL
Subjt:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL

Query:  TECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYC
        ++CA++GIAVKPKKG+ALLFF+L+ +AIPD  SLHGGCPV+EGEKWSATKWIHVDSF K L + G+CTD+NESCERWA LGEC KNPEYMVG+PE+PG C
Subjt:  TECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYC

Query:  RRSCRIC
        RRSC+ C
Subjt:  RRSCRIC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase2.9e-9958.5Show/hide
Query:  SSVVRESSCSYAGSATST--VDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKSKDPIVSGIEDKIA
        SS  R+ S     ++ S+   DP++V Q+SW PR F+YEGFL+D ECDH + +A+ +L++S VADNDSG+S  S VRTSSGMF+SK +D IVS +E K+A
Subjt:  SSVVRESSCSYAGSATST--VDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKSKDPIVSGIEDKIA

Query:  AWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDLTECARQGIAVKPK
        AWTFLP+       ENGE +Q+L YE+GQKYE H+DYF D+ N+  GGHR+ATVLMYLSNV KGGETVFP+ +    +     D+  TECA+QG AVKP+
Subjt:  AWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDLTECARQGIAVKPK

Query:  KGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRIC
        KGDALLFF+L PNA  D+NSLHG CPV+EGEKWSAT+WIHV SF +       C D N SCE+WA  GEC KNP YMVGS +  GYCR+SC+ C
Subjt:  KGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRIC

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase1.5e-9254.97Show/hide
Query:  SSVVRESSCSYAGSATST--VDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDS-----KLSTVRTSSGMFISKSK---DPIV
        SS  R+ S     ++ S+   DP++V Q+SW PR F+YEGFL+D ECDH + +A+ +L++S VADNDSG+S      +S VR SS    +      D IV
Subjt:  SSVVRESSCSYAGSATST--VDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDS-----KLSTVRTSSGMFISKSK---DPIV

Query:  SGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDLTECAR
        S +E K+AAWTFLP+       ENGE +Q+L YE+GQKYE H+DYF D+ N+  GGHR+ATVLMYLSNV KGGETVFP+ +    +     D+  TECA+
Subjt:  SGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDLTECAR

Query:  QGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCR
        QG AVKP+KGDALLFF+L PNA  D+NSLHG CPV+EGEKWSAT+WIHV SF +       C D N SCE+WA  GEC KNP YMVGS +  GYCR+SC+
Subjt:  QGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCR

Query:  IC
         C
Subjt:  IC

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase5.9e-9257.54Show/hide
Query:  SYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRS-EVADNDSGDSKLSTVRTSSGMFISKSKDPIVSGIEDKIAAWTFLPKAGI
        S   S + +VDP+++ Q+SW PRAF+Y+GFL+D ECDHL+ +A+ +L++S  VAD DSG+S+ S VRTSSGMF++K +D IV+ +E K+AAWTFLP+   
Subjt:  SYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRS-EVADNDSGDSKLSTVRTSSGMFISKSKDPIVSGIEDKIAAWTFLPKAGI

Query:  SHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFP-LAEKSPRRRASETDEDLTECARQGIAVKPKKGDALLFFS
            ENGE +Q+L YE+GQKY+ H+DYF DK  +  GGHR+ATVLMYLSNVTKGGETVFP    K+P+ +    D+  ++CA+QG AVKP+KGDALLFF+
Subjt:  SHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFP-LAEKSPRRRASETDEDLTECARQGIAVKPKKGDALLFFS

Query:  LEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRIC
        L  N   D NSLHG CPV+EGEKWSAT+WIHV SF K       C D +ESC+ WA  GEC KNP YMVGS    G+CR+SC+ C
Subjt:  LEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRIC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.1e-12772.28Show/hide
Query:  RNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKSKDPI
        R  L +    I SV+ +SS S   S++  V+PSKVKQ+S KPRAFVYEGFLT+LECDH+VS+A++ LKRS VADNDSG+SK S VRTSSG FISK KDPI
Subjt:  RNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKSKDPI

Query:  VSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDLTECA
        VSGIEDKI+ WTFLPK       ENGEDIQVLRYEHGQKY++H+DYF DKVNI RGGHR+AT+LMYLSNVTKGGETVFP AE   RR  SE  EDL++CA
Subjt:  VSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDLTECA

Query:  RQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSC
        ++GIAVKP+KGDALLFF+L P+AIPD  SLHGGCPV+EGEKWSATKWIHVDSF + +   G+CTD+NESCERWA LGECTKNPEYMVG+ ELPGYCRRSC
Subjt:  RQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSC

Query:  RIC
        + C
Subjt:  RIC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCAAATTTCGCAATCTGTTATTTCTCTTCTTGATTTTGATCTCATCGGTTGTTCGGGAATCAAGTTGTTCGTATGCTGGTTCGGCTACCTCCACCGTAGATCCTAG
TAAAGTGAAGCAGATTTCATGGAAACCGAGAGCTTTTGTATATGAAGGATTTCTCACGGACCTAGAATGCGACCATCTGGTTTCTATAGCGAGATCCGAGCTAAAGAGAT
CTGAAGTTGCTGATAATGATTCGGGAGATAGCAAGCTCAGTACTGTTCGAACGAGCTCAGGAATGTTCATTTCTAAGAGCAAGGATCCTATTGTTTCTGGCATCGAGGAC
AAAATTGCTGCATGGACTTTTCTTCCAAAAGCGGGTATTTCCCATTTTTCAGAGAATGGAGAGGACATTCAGGTATTGAGATATGAGCATGGCCAGAAGTATGAATCACA
TTATGATTACTTTGTTGACAAGGTTAATATTGCCCGGGGAGGACATCGTTTAGCTACAGTCCTTATGTATCTCTCTAATGTGACCAAAGGCGGTGAAACAGTTTTTCCCT
TGGCTGAGAAATCTCCCCGCCGGAGGGCTTCTGAAACAGACGAGGATCTCACAGAGTGTGCAAGGCAAGGAATTGCAGTGAAACCAAAGAAAGGCGATGCCCTTCTTTTC
TTTAGTCTCGAACCAAATGCAATCCCAGACACCAATAGTCTCCATGGAGGCTGCCCTGTTCTTGAAGGAGAAAAATGGTCAGCAACAAAGTGGATTCACGTTGACTCTTT
CAGCAAAAACTTAGGAAACATTGGGGACTGCACGGATCTAAATGAAAGCTGTGAGAGATGGGCTGCCTTAGGGGAGTGCACCAAAAACCCTGAGTATATGGTCGGATCCC
CGGAGCTTCCAGGCTACTGTAGGCGGAGTTGCAGGATCTGTTGA
mRNA sequenceShow/hide mRNA sequence
TTTTCTCCGATCACTCTCTCTCTCTCTCTCTCTAATTTAATCCGATCGAGACTATGTCCAAATTTCGCAATCTGTTATTTCTCTTCTTGATTTTGATCTCATCGGTTGTT
CGGGAATCAAGTTGTTCGTATGCTGGTTCGGCTACCTCCACCGTAGATCCTAGTAAAGTGAAGCAGATTTCATGGAAACCGAGAGCTTTTGTATATGAAGGATTTCTCAC
GGACCTAGAATGCGACCATCTGGTTTCTATAGCGAGATCCGAGCTAAAGAGATCTGAAGTTGCTGATAATGATTCGGGAGATAGCAAGCTCAGTACTGTTCGAACGAGCT
CAGGAATGTTCATTTCTAAGAGCAAGGATCCTATTGTTTCTGGCATCGAGGACAAAATTGCTGCATGGACTTTTCTTCCAAAAGCGGGTATTTCCCATTTTTCAGAGAAT
GGAGAGGACATTCAGGTATTGAGATATGAGCATGGCCAGAAGTATGAATCACATTATGATTACTTTGTTGACAAGGTTAATATTGCCCGGGGAGGACATCGTTTAGCTAC
AGTCCTTATGTATCTCTCTAATGTGACCAAAGGCGGTGAAACAGTTTTTCCCTTGGCTGAGAAATCTCCCCGCCGGAGGGCTTCTGAAACAGACGAGGATCTCACAGAGT
GTGCAAGGCAAGGAATTGCAGTGAAACCAAAGAAAGGCGATGCCCTTCTTTTCTTTAGTCTCGAACCAAATGCAATCCCAGACACCAATAGTCTCCATGGAGGCTGCCCT
GTTCTTGAAGGAGAAAAATGGTCAGCAACAAAGTGGATTCACGTTGACTCTTTCAGCAAAAACTTAGGAAACATTGGGGACTGCACGGATCTAAATGAAAGCTGTGAGAG
ATGGGCTGCCTTAGGGGAGTGCACCAAAAACCCTGAGTATATGGTCGGATCCCCGGAGCTTCCAGGCTACTGTAGGCGGAGTTGCAGGATCTGTTGA
Protein sequenceShow/hide protein sequence
MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKSKDPIVSGIED
KIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDLTECARQGIAVKPKKGDALLF
FSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRIC