; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0010278 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0010278
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationchr09:20543844..20546314
RNA-Seq ExpressionIVF0010278
SyntenyIVF0010278
Gene Ontology termsGO:0019511 - peptidyl-proline hydroxylation (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043468.1 putative prolyl 4-hydroxylase 12 [Cucumis melo var. makuwa]3.79e-227100Show/hide
Query:  MDSRLNFLLLFATAFSFSTCLAQSNLISGRKGLRDQLVDRPLSYSNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAGSGN
        MDSRLNFLLLFATAFSFSTCLAQSNLISGRKGLRDQLVDRPLSYSNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAGSGN
Subjt:  MDSRLNFLLLFATAFSFSTCLAQSNLISGRKGLRDQLVDRPLSYSNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAGSGN

Query:  TVSTELLNGSGVILNTTDDIIARIENRIAVWTLLPKDHGMPFQIMQYRGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFW
        TVSTELLNGSGVILNTTDDIIARIENRIAVWTLLPKDHGMPFQIMQYRGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFW
Subjt:  TVSTELLNGSGVILNTTDDIIARIENRIAVWTLLPKDHGMPFQIMQYRGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFW

Query:  SGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPIRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDY
        SGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPIRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDY
Subjt:  SGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPIRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDY

Query:  YGTCRKSCNAC
        YGTCRKSCNAC
Subjt:  YGTCRKSCNAC

XP_004152378.1 probable prolyl 4-hydroxylase 12 [Cucumis sativus]3.68e-20992.28Show/hide
Query:  MDSRLNFLLLFATAFSFSTCLAQSNLISGRKGLRDQLVDRPLSYSNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAGSGN
        MDSRLNFLLL ATAFSFSTCLAQSNLISGRKGLRD+LVDRPLSYSN S RIDPSRVVQVSWRPRVFLYKGFLSD+ECDHLISLASNS+DNPSRNSAGSG 
Subjt:  MDSRLNFLLLFATAFSFSTCLAQSNLISGRKGLRDQLVDRPLSYSNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAGSGN

Query:  TVSTELLNGSGVILNTTDDIIARIENRIAVWTLLPKDHGMPFQIMQYRGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFW
        TVSTELLN SGVILNTTDDI+ARIENR+A+WTLLPKDH MPFQIMQYRGEEAKHKYFYGNRSAM  SSEPLMATVVLYLSDSASGGE+LFPESKVKSKFW
Subjt:  TVSTELLNGSGVILNTTDDIIARIENRIAVWTLLPKDHGMPFQIMQYRGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFW

Query:  SGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPIRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDY
        SGRRKK NFLRPVKGNAILFFSVHLNASPDKSSYHIR PIR+GELWVATKFLYL PP GNKHTI S++DGC DEDKSCPQWAAIGECERNAVFMVGSPDY
Subjt:  SGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPIRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDY

Query:  YGTCRKSCNAC
        YGTCRKSCNAC
Subjt:  YGTCRKSCNAC

XP_008436994.1 PREDICTED: probable prolyl 4-hydroxylase 12 [Cucumis melo]6.28e-22699.36Show/hide
Query:  MDSRLNFLLLFATAFSFSTCLAQSNLISGRKGLRDQLVDRPLSYSNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAGSGN
        MDSRLNFLLLFATAFSFSTCLAQSNLISGRKGLRDQLVDRPLSYSNQSVRIDPSRVVQVSWRPRVFLYKGFLSD+ECDHLISLASNS+DNPSRNSAGSGN
Subjt:  MDSRLNFLLLFATAFSFSTCLAQSNLISGRKGLRDQLVDRPLSYSNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAGSGN

Query:  TVSTELLNGSGVILNTTDDIIARIENRIAVWTLLPKDHGMPFQIMQYRGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFW
        TVSTELLNGSGVILNTTDDIIARIENRIAVWTLLPKDHGMPFQIMQYRGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFW
Subjt:  TVSTELLNGSGVILNTTDDIIARIENRIAVWTLLPKDHGMPFQIMQYRGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFW

Query:  SGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPIRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDY
        SGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPIRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDY
Subjt:  SGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPIRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDY

Query:  YGTCRKSCNAC
        YGTCRKSCNAC
Subjt:  YGTCRKSCNAC

XP_023549812.1 probable prolyl 4-hydroxylase 12 [Cucurbita pepo subsp. pepo]6.94e-17679.68Show/hide
Query:  MDSRLNFLLLFATAFSFSTCLAQSNLISGRKGLRDQLVDRP-LSYSNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAGSG
        MDSRLNFLLLFA AFSFS+CLAQSN +SGRKGLRDQ+V+   LSYSN   RIDPSRVVQ+SW+PRVFLYKGFLSD+ECDHLI+LASNS+D PSR++AGS 
Subjt:  MDSRLNFLLLFATAFSFSTCLAQSNLISGRKGLRDQLVDRP-LSYSNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAGSG

Query:  NTVSTELLNGSGVILNTTDDIIARIENRIAVWTLLPKDHGMPFQIMQYRGEEAK-HKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESKVKSK
        NTVST+ L  SG +LNTTDDIIARIENRIAVWT LPKDH MPFQIMQY GEEA  HKYF+GNRSAM SS EPLMATVVLYLSDSASGGE+LFP SKVK +
Subjt:  NTVSTELLNGSGVILNTTDDIIARIENRIAVWTLLPKDHGMPFQIMQYRGEEAK-HKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESKVKSK

Query:  FWSGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPIRNGELWVATKFLYLRPP-TGNKHTIDSNIDG-CIDEDKSCPQWAAIGECERNAVFMVG
        FWS RRKK NFLRPVKGNA+LFFSVHLNASPDKS YH R PI +G+LWVATKF Y+RP  TGN+H ++S +D  CIDED+SCP+WAAIGEC+RNAVFM+G
Subjt:  FWSGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPIRNGELWVATKFLYLRPP-TGNKHTIDSNIDG-CIDEDKSCPQWAAIGECERNAVFMVG

Query:  SPDYYGTCRKSCNAC
        SPDYYGTCRKSCNAC
Subjt:  SPDYYGTCRKSCNAC

XP_038906497.1 probable prolyl 4-hydroxylase 12 [Benincasa hispida]6.28e-20390.35Show/hide
Query:  MDSRLNFLLLFATAFSFSTCLAQSNLISGRKGLRDQLVDRPLSYSNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAGSGN
        MDSRLNFLLL ATAFSFSTCLAQSNLISGRKGLRDQLVDRPLSYSN S RIDPSRVVQVSW+PRVFLYKGFLSD+ECDHLISLASNS+DNPS NSAGSGN
Subjt:  MDSRLNFLLLFATAFSFSTCLAQSNLISGRKGLRDQLVDRPLSYSNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAGSGN

Query:  TVSTELLNGSGVILNTTDDIIARIENRIAVWTLLPKDHGMPFQIMQYRGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFW
        TVST+LLN SGVILNT+DDIIARIEN+IAVWT LPKDHGMPFQIMQYRGEEA+HKYFYGN SAMSSS EPLMATVVLYLSDSA GGEMLFPESKVKSKFW
Subjt:  TVSTELLNGSGVILNTTDDIIARIENRIAVWTLLPKDHGMPFQIMQYRGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFW

Query:  SGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPIRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDY
        S RRKK NFLRPVKGNAILFFSVHLNASPDKSSYH R PI NGELWVATKF YLRP TGNK T++S++DGCIDEDKSCPQWAAIGECERN VFM+GSPDY
Subjt:  SGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPIRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDY

Query:  YGTCRKSCNAC
        YGTCRKSCNAC
Subjt:  YGTCRKSCNAC

TrEMBL top hitse value%identityAlignment
A0A0A0KPE4 Procollagen-proline 4-dioxygenase2.1e-15391.7Show/hide
Query:  QSNLISGRKGLRDQLVDRPLSYSNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAGSGNTVSTELLNGSGVILNTTDDIIA
        +SNLISGRKGLRD+LVDRPLSYSN S RIDPSRVVQVSWRPRVFLYKGFLSD+ECDHLISLASNS+DNPSRNSAGSG TVSTELLN SGVILNTTDDI+A
Subjt:  QSNLISGRKGLRDQLVDRPLSYSNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAGSGNTVSTELLNGSGVILNTTDDIIA

Query:  RIENRIAVWTLLPKDHGMPFQIMQYRGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFWSGRRKKKNFLRPVKGNAILFFS
        RIENR+A+WTLLPKDH MPFQIMQYRGEEAKHKYFYGNRSAM  SSEPLMATVVLYLSDSASGGE+LFPESKVKSKFWSGRRKK NFLRPVKGNAILFFS
Subjt:  RIENRIAVWTLLPKDHGMPFQIMQYRGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFWSGRRKKKNFLRPVKGNAILFFS

Query:  VHLNASPDKSSYHIRYPIRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC
        VHLNASPDKSSYHIR PIR+GELWVATKFLYL PP GNKHTI S++DGC DEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC
Subjt:  VHLNASPDKSSYHIRYPIRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC

A0A1S3AT39 Procollagen-proline 4-dioxygenase3.2e-17899.36Show/hide
Query:  MDSRLNFLLLFATAFSFSTCLAQSNLISGRKGLRDQLVDRPLSYSNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAGSGN
        MDSRLNFLLLFATAFSFSTCLAQSNLISGRKGLRDQLVDRPLSYSNQSVRIDPSRVVQVSWRPRVFLYKGFLSD+ECDHLISLASNS+DNPSRNSAGSGN
Subjt:  MDSRLNFLLLFATAFSFSTCLAQSNLISGRKGLRDQLVDRPLSYSNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAGSGN

Query:  TVSTELLNGSGVILNTTDDIIARIENRIAVWTLLPKDHGMPFQIMQYRGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFW
        TVSTELLNGSGVILNTTDDIIARIENRIAVWTLLPKDHGMPFQIMQYRGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFW
Subjt:  TVSTELLNGSGVILNTTDDIIARIENRIAVWTLLPKDHGMPFQIMQYRGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFW

Query:  SGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPIRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDY
        SGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPIRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDY
Subjt:  SGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPIRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDY

Query:  YGTCRKSCNAC
        YGTCRKSCNAC
Subjt:  YGTCRKSCNAC

A0A5A7TKX1 Procollagen-proline 4-dioxygenase3.8e-179100Show/hide
Query:  MDSRLNFLLLFATAFSFSTCLAQSNLISGRKGLRDQLVDRPLSYSNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAGSGN
        MDSRLNFLLLFATAFSFSTCLAQSNLISGRKGLRDQLVDRPLSYSNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAGSGN
Subjt:  MDSRLNFLLLFATAFSFSTCLAQSNLISGRKGLRDQLVDRPLSYSNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAGSGN

Query:  TVSTELLNGSGVILNTTDDIIARIENRIAVWTLLPKDHGMPFQIMQYRGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFW
        TVSTELLNGSGVILNTTDDIIARIENRIAVWTLLPKDHGMPFQIMQYRGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFW
Subjt:  TVSTELLNGSGVILNTTDDIIARIENRIAVWTLLPKDHGMPFQIMQYRGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFW

Query:  SGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPIRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDY
        SGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPIRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDY
Subjt:  SGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPIRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDY

Query:  YGTCRKSCNAC
        YGTCRKSCNAC
Subjt:  YGTCRKSCNAC

A0A6J1E0X9 Procollagen-proline 4-dioxygenase6.6e-13980.77Show/hide
Query:  MDSRLNFLLLFATAFSFSTCLAQSNLISGRKGLRDQLVDR-PLSYSNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAGSG
        MDSRL  LLL ATA SF +CLAQSNLISGRKGLRDQL++  PLSYSN S RIDPSRVVQVSWRPRVFLYKGFLSD+ECDHLISLA++S+D PS NS  SG
Subjt:  MDSRLNFLLLFATAFSFSTCLAQSNLISGRKGLRDQLVDR-PLSYSNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAGSG

Query:  NTVSTELLNGSGVILNTTDDIIARIENRIAVWTLLPKDHGMPFQIMQYRGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESKVKSKF
        NTV T++L  SG ILNTTDDIIARIENRIAVWT LPKD+ MP QI+QY GEEA+HKY +GNRSAM  SSEPLMATVVLYLSDSASGGEM FPESKVKS+F
Subjt:  NTVSTELLNGSGVILNTTDDIIARIENRIAVWTLLPKDHGMPFQIMQYRGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESKVKSKF

Query:  WSGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPIRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPD
        WS RRKK N LRPVKGNA+L FSVHLNASPDKSS H R PI +GELW+ATKF YLRP TGNKHT + + D C DEDKSCPQWAAIGECERNAVFM+GSPD
Subjt:  WSGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPIRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPD

Query:  YYGTCRKSCNAC
        YYGTCRKSCNAC
Subjt:  YYGTCRKSCNAC

A0A6J1E2P0 Procollagen-proline 4-dioxygenase8.6e-13979.37Show/hide
Query:  MDSRLNFLLLFATAFSFSTCLAQSNLISGRKGLRDQLVDR-PLSYSNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAGSG
        MDSRL FLLL A AFSFS+CLAQSN ISGRKGLRDQ+V+   LSYSN S RIDPSRVVQ+SW+PR FLYKGFLSD+ECDHLI+LASNS+D PSRN+AGS 
Subjt:  MDSRLNFLLLFATAFSFSTCLAQSNLISGRKGLRDQLVDR-PLSYSNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAGSG

Query:  NTVSTELLNGSGVILNTTDDIIARIENRIAVWTLLPKDHGMPFQIMQYRGEEAK-HKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESKVKSK
        NTVST+ L  SG ILNTTDDII RIENRIAVWT LPKDH MPFQIM+Y GEEA  HKYF+GNRSAM  SSEPLMATVVLYLSDSASGGE+LFP SKVK +
Subjt:  NTVSTELLNGSGVILNTTDDIIARIENRIAVWTLLPKDHGMPFQIMQYRGEEAK-HKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESKVKSK

Query:  FWSGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPIRNGELWVATKFLYLRP-PTGNKHTIDSNI-DGCIDEDKSCPQWAAIGECERNAVFMVG
        FWS RRKK NFLRPVKGNA+LFFSVHLNASPDKS YH R PI +G+LWVATKF Y+RP  TGN+H ++S + D CIDED+SCP+WAAIGEC+RNAVFM+G
Subjt:  FWSGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPIRNGELWVATKFLYLRP-PTGNKHTIDSNI-DGCIDEDKSCPQWAAIGECERNAVFMVG

Query:  SPDYYGTCRKSCNAC
        SPDYYGTCRKSCNAC
Subjt:  SPDYYGTCRKSCNAC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 62.6e-5240.07Show/hide
Query:  SNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPS-RNSAGSGNTVSTELLNGSGVIL-NTTDDIIARIENRIAVWTLLPKDHGMPF
        S+ S  +DP+R+ Q+SW PR FLYKGFLSD+ECDHLI LA    +         SG +  +E+   SG+ L    DDI+A +E ++A WT LP+++G   
Subjt:  SNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPS-RNSAGSGNTVSTELLNGSGVIL-NTTDDIIARIENRIAVWTLLPKDHGMPF

Query:  QIMQYRG---EEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESK-----VKSKFWSGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSY
        QI+ Y      +    YFY  ++         +ATV++YLS+   GGE +FP  K     +K   WS   K+   ++P KG+A+LFF++HLN + D +S 
Subjt:  QIMQYRG---EEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESK-----VKSKFWSGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSY

Query:  HIRYPIRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC
        H   P+  GE W AT+++++R   G K  +      C+D+ +SC +WA  GECE+N ++MVGS    G CRKSC AC
Subjt:  HIRYPIRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC

F4JAU3 Prolyl 4-hydroxylase 27.0e-4536.46Show/hide
Query:  SNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAGSGNTVSTELLNGSGVILNT-TDDIIARIENRIAVWTLLPKDHGMPFQ
        S+ S  I+PS+V QVS +PR F+Y+GFL+D ECDHLISLA  +    +     +G +  +++   SG  ++   D I++ IE++++ WT LPK++G   Q
Subjt:  SNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAGSGNTVSTELLNGSGVILNT-TDDIIARIENRIAVWTLLPKDHGMPFQ

Query:  IMQY-RGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESKVKSK--------FWSGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSY
        +++Y  G++    + Y +     +     +ATV+LYLS+   GGE +FP+++  S+          S   KK   ++P KGNA+LFF++  +A PD  S 
Subjt:  IMQY-RGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESKVKSK--------FWSGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSY

Query:  HIRYPIRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC
        H   P+  GE W ATK++++     +   I ++   C D ++SC +WA +GEC +N  +MVG+P+  G CR+SC AC
Subjt:  HIRYPIRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC

Q8GXT7 Probable prolyl 4-hydroxylase 123.7e-6244.19Show/hide
Query:  FLLLFATAFSFSTCLAQSNLISGRKGLRDQLV-----DRPLSYSNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAGSGNT
        FL+L  T  S S           RK LRD+ +     D   SY   S  +DP+RV+Q+SW PRVFLY+GFLS++ECDHLISL   + +  S ++ G    
Subjt:  FLLLFATAFSFSTCLAQSNLISGRKGLRDQLV-----DRPLSYSNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAGSGNT

Query:  VSTELLNGSGVILNTTDDIIARIENRIAVWTLLPKDHGMPFQIMQYRGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFWS
          T+L           D ++A IE +++ WT LP ++G   ++  Y  E++  K  Y      S   E L+ATVVLYLS++  GGE+LFP S++K K  +
Subjt:  VSTELLNGSGVILNTTDDIIARIENRIAVWTLLPKDHGMPFQIMQYRGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFWS

Query:  GRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPIRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDYY
           +  N LRPVKGNAILFF+  LNAS D  S H+R P+  GEL VATK +Y +     K         C DED++C +WA +GEC++N V+M+GSPDYY
Subjt:  GRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPIRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDYY

Query:  GTCRKSCNAC
        GTCRKSCNAC
Subjt:  GTCRKSCNAC

Q8L970 Probable prolyl 4-hydroxylase 74.8e-5437.38Show/hide
Query:  MDSRLNFLLLFATAFSFSTCL---AQSNLISGRKGLRDQLVDRPLSYSNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAG
        MDSR+   L F+  F F+  L   A +  ++     RD  V + +  S  S   DP+RV Q+SW PRVFLY+GFLSD+ECDH I LA    +        
Subjt:  MDSRLNFLLLFATAFSFSTCL---AQSNLISGRKGLRDQLVDRPLSYSNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAG

Query:  SGNTVSTELLNGSGVILN-TTDDIIARIENRIAVWTLLPKDHGMPFQIMQY-RGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFP----
        SG +V +E+   SG+ L+   DDI++ +E ++A WT LP+++G   QI+ Y  G++ +  + Y +  A        +ATV++YLS+   GGE +FP    
Subjt:  SGNTVSTELLNGSGVILN-TTDDIIARIENRIAVWTLLPKDHGMPFQIMQY-RGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFP----

Query:  -ESKVKSKFWSGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPIRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERN
          +++K   W+   K+   ++P KG+A+LFF++H NA+ D +S H   P+  GE W AT++++++    +     +   GC+DE+ SC +WA  GEC++N
Subjt:  -ESKVKSKFWSGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPIRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERN

Query:  AVFMVGSPDYYGTCRKSCNAC
          +MVGS   +G CRKSC AC
Subjt:  AVFMVGSPDYYGTCRKSCNAC

Q8LAN3 Probable prolyl 4-hydroxylase 42.4e-4535.79Show/hide
Query:  SNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAGSGNTVSTELLNGSGVILNT-TDDIIARIENRIAVWTLLPKDHGMPFQ
        S+ SV ++PS+V QVS +PR F+Y+GFL++ ECDH++SLA  S    +     SG +  +E+   SG  ++   D I++ IE++I+ WT LPK++G   Q
Subjt:  SNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAGSGNTVSTELLNGSGVILNT-TDDIIARIENRIAVWTLLPKDHGMPFQ

Query:  IMQY---RGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESKVKSK--------FWSGRRKKKNFLRPVKGNAILFFSVHLNASPDKS
        +++Y   +  +A   YF+   + +       MAT+++YLS+   GGE +FP++++ S+          S   K+   ++P KG+A+LFF++H +A PD  
Subjt:  IMQY---RGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESKVKSK--------FWSGRRKKKNFLRPVKGNAILFFSVHLNASPDKS

Query:  SYHIRYPIRNGELWVATKFLY------LRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC
        S H   P+  GE W ATK+++      +  P+GN          C D ++SC +WA +GEC +N  +MVG+ +  G CR+SC AC
Subjt:  SYHIRYPIRNGELWVATKFLY------LRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC

Arabidopsis top hitse value%identityAlignment
AT3G28480.1 Oxoglutarate/iron-dependent oxygenase3.4e-5537.38Show/hide
Query:  MDSRLNFLLLFATAFSFSTCL---AQSNLISGRKGLRDQLVDRPLSYSNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAG
        MDSR+   L F+  F F+  L   A +  ++     RD  V + +  S  S   DP+RV Q+SW PRVFLY+GFLSD+ECDH I LA    +        
Subjt:  MDSRLNFLLLFATAFSFSTCL---AQSNLISGRKGLRDQLVDRPLSYSNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAG

Query:  SGNTVSTELLNGSGVILN-TTDDIIARIENRIAVWTLLPKDHGMPFQIMQY-RGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFP----
        SG +V +E+   SG+ L+   DDI++ +E ++A WT LP+++G   QI+ Y  G++ +  + Y +  A        +ATV++YLS+   GGE +FP    
Subjt:  SGNTVSTELLNGSGVILN-TTDDIIARIENRIAVWTLLPKDHGMPFQIMQY-RGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFP----

Query:  -ESKVKSKFWSGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPIRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERN
          +++K   W+   K+   ++P KG+A+LFF++H NA+ D +S H   P+  GE W AT++++++    +     +   GC+DE+ SC +WA  GEC++N
Subjt:  -ESKVKSKFWSGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPIRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERN

Query:  AVFMVGSPDYYGTCRKSCNAC
          +MVGS   +G CRKSC AC
Subjt:  AVFMVGSPDYYGTCRKSCNAC

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase9.3e-5337.58Show/hide
Query:  MDSRLNFLLLFATAFSFSTCL---AQSNLISGRKGLRDQLVDRPLSYSNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLA------SNSKDNP
        MDSR+   L F+  F F+  L   A +  ++     RD  V + +  S  S   DP+RV Q+SW PRVFLY+GFLSD+ECDH I LA      S   DN 
Subjt:  MDSRLNFLLLFATAFSFSTCL---AQSNLISGRKGLRDQLVDRPLSYSNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLA------SNSKDNP

Query:  SRNSAGSGNTVSTELLNGSGVILN----TTDDIIARIENRIAVWTLLPKDHGMPFQIMQY-RGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGG
        S  S  S ++VS  +   S  I N      DDI++ +E ++A WT LP+++G   QI+ Y  G++ +  + Y +  A        +ATV++YLS+   GG
Subjt:  SRNSAGSGNTVSTELLNGSGVILN----TTDDIIARIENRIAVWTLLPKDHGMPFQIMQY-RGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGG

Query:  EMLFP-----ESKVKSKFWSGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPIRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQW
        E +FP      +++K   W+   K+   ++P KG+A+LFF++H NA+ D +S H   P+  GE W AT++++++    +     +   GC+DE+ SC +W
Subjt:  EMLFP-----ESKVKSKFWSGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPIRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQW

Query:  AAIGECERNAVFMVGSPDYYGTCRKSCNAC
        A  GEC++N  +MVGS   +G CRKSC AC
Subjt:  AAIGECERNAVFMVGSPDYYGTCRKSCNAC

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase1.9e-5340.07Show/hide
Query:  SNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPS-RNSAGSGNTVSTELLNGSGVIL-NTTDDIIARIENRIAVWTLLPKDHGMPF
        S+ S  +DP+R+ Q+SW PR FLYKGFLSD+ECDHLI LA    +         SG +  +E+   SG+ L    DDI+A +E ++A WT LP+++G   
Subjt:  SNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPS-RNSAGSGNTVSTELLNGSGVIL-NTTDDIIARIENRIAVWTLLPKDHGMPF

Query:  QIMQYRG---EEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESK-----VKSKFWSGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSY
        QI+ Y      +    YFY  ++         +ATV++YLS+   GGE +FP  K     +K   WS   K+   ++P KG+A+LFF++HLN + D +S 
Subjt:  QIMQYRG---EEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESK-----VKSKFWSGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSY

Query:  HIRYPIRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC
        H   P+  GE W AT+++++R   G K  +      C+D+ +SC +WA  GECE+N ++MVGS    G CRKSC AC
Subjt:  HIRYPIRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC

AT4G25600.1 Oxoglutarate/iron-dependent oxygenase2.6e-6344.19Show/hide
Query:  FLLLFATAFSFSTCLAQSNLISGRKGLRDQLV-----DRPLSYSNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAGSGNT
        FL+L  T  S S           RK LRD+ +     D   SY   S  +DP+RV+Q+SW PRVFLY+GFLS++ECDHLISL   + +  S ++ G    
Subjt:  FLLLFATAFSFSTCLAQSNLISGRKGLRDQLV-----DRPLSYSNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAGSGNT

Query:  VSTELLNGSGVILNTTDDIIARIENRIAVWTLLPKDHGMPFQIMQYRGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFWS
          T+L           D ++A IE +++ WT LP ++G   ++  Y  E++  K  Y      S   E L+ATVVLYLS++  GGE+LFP S++K K  +
Subjt:  VSTELLNGSGVILNTTDDIIARIENRIAVWTLLPKDHGMPFQIMQYRGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFWS

Query:  GRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPIRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDYY
           +  N LRPVKGNAILFF+  LNAS D  S H+R P+  GEL VATK +Y +     K         C DED++C +WA +GEC++N V+M+GSPDYY
Subjt:  GRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPIRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDYY

Query:  GTCRKSCNAC
        GTCRKSCNAC
Subjt:  GTCRKSCNAC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.7e-4635.79Show/hide
Query:  SNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAGSGNTVSTELLNGSGVILNT-TDDIIARIENRIAVWTLLPKDHGMPFQ
        S+ SV ++PS+V QVS +PR F+Y+GFL++ ECDH++SLA  S    +     SG +  +E+   SG  ++   D I++ IE++I+ WT LPK++G   Q
Subjt:  SNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAGSGNTVSTELLNGSGVILNT-TDDIIARIENRIAVWTLLPKDHGMPFQ

Query:  IMQY---RGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESKVKSK--------FWSGRRKKKNFLRPVKGNAILFFSVHLNASPDKS
        +++Y   +  +A   YF+   + +       MAT+++YLS+   GGE +FP++++ S+          S   K+   ++P KG+A+LFF++H +A PD  
Subjt:  IMQY---RGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESKVKSK--------FWSGRRKKKNFLRPVKGNAILFFSVHLNASPDKS

Query:  SYHIRYPIRNGELWVATKFLY------LRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC
        S H   P+  GE W ATK+++      +  P+GN          C D ++SC +WA +GEC +N  +MVG+ +  G CR+SC AC
Subjt:  SYHIRYPIRNGELWVATKFLY------LRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCTCGTCTTAACTTTTTGCTTCTGTTTGCGACTGCATTTTCATTCTCAACCTGCCTTGCTCAAAGCAATTTGATTAGTGGCCGTAAGGGTTTAAGGGACCAATT
GGTTGACAGACCTTTGAGCTACTCAAATCAGTCTGTTAGAATCGACCCATCAAGAGTTGTCCAAGTCTCTTGGCGACCAAGGGTTTTCTTGTATAAAGGTTTTCTCTCAG
ATGACGAATGTGATCATCTTATTTCTTTGGCTTCAAATTCGAAAGACAATCCTTCTAGGAATAGTGCTGGTTCGGGGAACACTGTCTCAACCGAATTGCTAAACGGTTCA
GGAGTCATTTTAAACACAACAGATGATATCATTGCAAGAATTGAAAATCGAATTGCAGTATGGACTTTGCTCCCAAAAGATCATGGCATGCCTTTTCAGATAATGCAATA
CAGGGGTGAAGAAGCAAAGCATAAGTACTTTTATGGCAATAGATCTGCAATGTCGTCGTCCAGCGAGCCTTTGATGGCCACAGTAGTTTTGTATCTCTCAGATTCTGCTA
GTGGCGGTGAAATGCTGTTTCCAGAATCAAAGGTAAAGAGCAAATTTTGGTCAGGCCGAAGAAAGAAAAAGAACTTTTTGAGACCAGTGAAAGGCAATGCAATTCTTTTT
TTCTCCGTGCATCTTAATGCCTCTCCAGACAAGAGTAGCTACCACATTCGATACCCAATACGCAATGGGGAATTGTGGGTTGCTACAAAATTCTTATACTTAAGACCACC
CACAGGGAATAAACACACTATCGACTCCAATATAGATGGGTGCATTGATGAAGATAAAAGCTGCCCTCAATGGGCCGCCATTGGCGAATGCGAACGAAATGCTGTGTTCA
TGGTCGGTTCTCCAGATTACTATGGTACATGTAGAAAAAGCTGCAATGCATGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATTCTCGTCTTAACTTTTTGCTTCTGTTTGCGACTGCATTTTCATTCTCAACCTGCCTTGCTCAAAGCAATTTGATTAGTGGCCGTAAGGGTTTAAGGGACCAATT
GGTTGACAGACCTTTGAGCTACTCAAATCAGTCTGTTAGAATCGACCCATCAAGAGTTGTCCAAGTCTCTTGGCGACCAAGGGTTTTCTTGTATAAAGGTTTTCTCTCAG
ATGACGAATGTGATCATCTTATTTCTTTGGCTTCAAATTCGAAAGACAATCCTTCTAGGAATAGTGCTGGTTCGGGGAACACTGTCTCAACCGAATTGCTAAACGGTTCA
GGAGTCATTTTAAACACAACAGATGATATCATTGCAAGAATTGAAAATCGAATTGCAGTATGGACTTTGCTCCCAAAAGATCATGGCATGCCTTTTCAGATAATGCAATA
CAGGGGTGAAGAAGCAAAGCATAAGTACTTTTATGGCAATAGATCTGCAATGTCGTCGTCCAGCGAGCCTTTGATGGCCACAGTAGTTTTGTATCTCTCAGATTCTGCTA
GTGGCGGTGAAATGCTGTTTCCAGAATCAAAGGTAAAGAGCAAATTTTGGTCAGGCCGAAGAAAGAAAAAGAACTTTTTGAGACCAGTGAAAGGCAATGCAATTCTTTTT
TTCTCCGTGCATCTTAATGCCTCTCCAGACAAGAGTAGCTACCACATTCGATACCCAATACGCAATGGGGAATTGTGGGTTGCTACAAAATTCTTATACTTAAGACCACC
CACAGGGAATAAACACACTATCGACTCCAATATAGATGGGTGCATTGATGAAGATAAAAGCTGCCCTCAATGGGCCGCCATTGGCGAATGCGAACGAAATGCTGTGTTCA
TGGTCGGTTCTCCAGATTACTATGGTACATGTAGAAAAAGCTGCAATGCATGTTGA
Protein sequenceShow/hide protein sequence
MDSRLNFLLLFATAFSFSTCLAQSNLISGRKGLRDQLVDRPLSYSNQSVRIDPSRVVQVSWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAGSGNTVSTELLNGS
GVILNTTDDIIARIENRIAVWTLLPKDHGMPFQIMQYRGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLSDSASGGEMLFPESKVKSKFWSGRRKKKNFLRPVKGNAILF
FSVHLNASPDKSSYHIRYPIRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC