; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G07280 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G07280
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationChr5:6365811..6368599
RNA-Seq ExpressionCSPI05G07280
SyntenyCSPI05G07280
Gene Ontology termsGO:0019511 - peptidyl-proline hydroxylation (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043468.1 putative prolyl 4-hydroxylase 12 [Cucumis melo var. makuwa]2.0e-16692.28Show/hide
Query:  MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGI
        MDSRLNFLLL ATAFSFSTCLAQSNLISGRKGLRD+LVDRPLSYSN S RIDPSRVVQVSWRPRVFLYKGFLSD+ECDHLISLASNS+DNPSRNSAGSG 
Subjt:  MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGI

Query:  TVSTELLNSSGVILNTTDDIVARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSKFW
        TVSTELLN SGVILNTTDDI+ARIENR+A+WTLLPKDH MPFQIMQYRGEEAKHKYFYGNRSAM  SSEPLMATVVLYLSDSASGGE+LFPESKVKSKFW
Subjt:  TVSTELLNSSGVILNTTDDIVARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSKFW

Query:  SGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKHTIESDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDY
        SGRRKK NFLRPVKGNAILFFSVHLNASPDKSSYHIR PIR+GELWVATKFLYL PP GNKHTI+S++DGC DEDKSCPQWAAIGECERNAVFMVGSPDY
Subjt:  SGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKHTIESDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDY

Query:  YGTCRKSCNAC
        YGTCRKSCNAC
Subjt:  YGTCRKSCNAC

XP_004152378.1 probable prolyl 4-hydroxylase 12 [Cucumis sativus]4.6e-17999.68Show/hide
Query:  MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGI
        MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGI
Subjt:  MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGI

Query:  TVSTELLNSSGVILNTTDDIVARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSKFW
        TVSTELLNSSGVILNTTDDIVARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSKFW
Subjt:  TVSTELLNSSGVILNTTDDIVARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSKFW

Query:  SGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKHTIESDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDY
        SGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKHTI+SDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDY
Subjt:  SGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKHTIESDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDY

Query:  YGTCRKSCNAC
        YGTCRKSCNAC
Subjt:  YGTCRKSCNAC

XP_008436994.1 PREDICTED: probable prolyl 4-hydroxylase 12 [Cucumis melo]3.1e-16792.93Show/hide
Query:  MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGI
        MDSRLNFLLL ATAFSFSTCLAQSNLISGRKGLRD+LVDRPLSYSN S RIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSG 
Subjt:  MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGI

Query:  TVSTELLNSSGVILNTTDDIVARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSKFW
        TVSTELLN SGVILNTTDDI+ARIENR+A+WTLLPKDH MPFQIMQYRGEEAKHKYFYGNRSAM  SSEPLMATVVLYLSDSASGGE+LFPESKVKSKFW
Subjt:  TVSTELLNSSGVILNTTDDIVARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSKFW

Query:  SGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKHTIESDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDY
        SGRRKK NFLRPVKGNAILFFSVHLNASPDKSSYHIR PIR+GELWVATKFLYL PP GNKHTI+S++DGC DEDKSCPQWAAIGECERNAVFMVGSPDY
Subjt:  SGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKHTIESDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDY

Query:  YGTCRKSCNAC
        YGTCRKSCNAC
Subjt:  YGTCRKSCNAC

XP_022159842.1 probable prolyl 4-hydroxylase 12 [Momordica charantia]5.4e-14382.05Show/hide
Query:  MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDRLVDR-PLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSG
        MDSRL  LLLLATA SF +CLAQSNLISGRKGLRD+L++  PLSYSN+SGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLA++SED PS NS  SG
Subjt:  MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDRLVDR-PLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSG

Query:  ITVSTELLNSSGVILNTTDDIVARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSKF
         TV T++L SSG ILNTTDDI+ARIENR+A+WT LPKD+SMP QI+QY GEEA+HKY +GNRSAML SSEPLMATVVLYLSDSASGGE+ FPESKVKS+F
Subjt:  ITVSTELLNSSGVILNTTDDIVARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSKF

Query:  WSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKHTIESDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPD
        WS RRKKNN LRPVKGNA+L FSVHLNASPDKSS H RSPI DGELW+ATKF YL P  GNKHT E D D C DEDKSCPQWAAIGECERNAVFM+GSPD
Subjt:  WSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKHTIESDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPD

Query:  YYGTCRKSCNAC
        YYGTCRKSCNAC
Subjt:  YYGTCRKSCNAC

XP_038906497.1 probable prolyl 4-hydroxylase 12 [Benincasa hispida]1.3e-16089.71Show/hide
Query:  MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGI
        MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRD+LVDRPLSYSN+SGRIDPSRVVQVSW+PRVFLYKGFLSDEECDHLISLASNSEDNPS NSAGSG 
Subjt:  MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGI

Query:  TVSTELLNSSGVILNTTDDIVARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSKFW
        TVST+LLNSSGVILNT+DDI+ARIEN++A+WT LPKDH MPFQIMQYRGEEA+HKYFYGN SAM  SSEPLMATVVLYLSDSA GGE+LFPESKVKSKFW
Subjt:  TVSTELLNSSGVILNTTDDIVARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSKFW

Query:  SGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKHTIESDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDY
        S RRKKNNFLRPVKGNAILFFSVHLNASPDKSSYH RSPI +GELWVATKF YL P  GNK T+ESDVDGC DEDKSCPQWAAIGECERN VFM+GSPDY
Subjt:  SGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKHTIESDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDY

Query:  YGTCRKSCNAC
        YGTCRKSCNAC
Subjt:  YGTCRKSCNAC

TrEMBL top hitse value%identityAlignment
A0A0A0KPE4 Procollagen-proline 4-dioxygenase5.7e-16799.31Show/hide
Query:  QSNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTELLNSSGVILNTTDDIVA
        +SNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTELLNSSGVILNTTDDIVA
Subjt:  QSNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTELLNSSGVILNTTDDIVA

Query:  RIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSKFWSGRRKKNNFLRPVKGNAILFFS
        RIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSKFWSGRRKKNNFLRPVKGNAILFFS
Subjt:  RIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSKFWSGRRKKNNFLRPVKGNAILFFS

Query:  VHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKHTIESDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC
        VHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKHTI+SDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC
Subjt:  VHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKHTIESDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC

A0A1S3AT39 Procollagen-proline 4-dioxygenase1.5e-16792.93Show/hide
Query:  MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGI
        MDSRLNFLLL ATAFSFSTCLAQSNLISGRKGLRD+LVDRPLSYSN S RIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSG 
Subjt:  MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGI

Query:  TVSTELLNSSGVILNTTDDIVARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSKFW
        TVSTELLN SGVILNTTDDI+ARIENR+A+WTLLPKDH MPFQIMQYRGEEAKHKYFYGNRSAM  SSEPLMATVVLYLSDSASGGE+LFPESKVKSKFW
Subjt:  TVSTELLNSSGVILNTTDDIVARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSKFW

Query:  SGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKHTIESDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDY
        SGRRKK NFLRPVKGNAILFFSVHLNASPDKSSYHIR PIR+GELWVATKFLYL PP GNKHTI+S++DGC DEDKSCPQWAAIGECERNAVFMVGSPDY
Subjt:  SGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKHTIESDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDY

Query:  YGTCRKSCNAC
        YGTCRKSCNAC
Subjt:  YGTCRKSCNAC

A0A5A7TKX1 Procollagen-proline 4-dioxygenase9.8e-16792.28Show/hide
Query:  MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGI
        MDSRLNFLLL ATAFSFSTCLAQSNLISGRKGLRD+LVDRPLSYSN S RIDPSRVVQVSWRPRVFLYKGFLSD+ECDHLISLASNS+DNPSRNSAGSG 
Subjt:  MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGI

Query:  TVSTELLNSSGVILNTTDDIVARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSKFW
        TVSTELLN SGVILNTTDDI+ARIENR+A+WTLLPKDH MPFQIMQYRGEEAKHKYFYGNRSAM  SSEPLMATVVLYLSDSASGGE+LFPESKVKSKFW
Subjt:  TVSTELLNSSGVILNTTDDIVARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSKFW

Query:  SGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKHTIESDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDY
        SGRRKK NFLRPVKGNAILFFSVHLNASPDKSSYHIR PIR+GELWVATKFLYL PP GNKHTI+S++DGC DEDKSCPQWAAIGECERNAVFMVGSPDY
Subjt:  SGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKHTIESDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDY

Query:  YGTCRKSCNAC
        YGTCRKSCNAC
Subjt:  YGTCRKSCNAC

A0A6J1E0X9 Procollagen-proline 4-dioxygenase2.6e-14382.05Show/hide
Query:  MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDRLVDR-PLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSG
        MDSRL  LLLLATA SF +CLAQSNLISGRKGLRD+L++  PLSYSN+SGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLA++SED PS NS  SG
Subjt:  MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDRLVDR-PLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSG

Query:  ITVSTELLNSSGVILNTTDDIVARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSKF
         TV T++L SSG ILNTTDDI+ARIENR+A+WT LPKD+SMP QI+QY GEEA+HKY +GNRSAML SSEPLMATVVLYLSDSASGGE+ FPESKVKS+F
Subjt:  ITVSTELLNSSGVILNTTDDIVARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSKF

Query:  WSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKHTIESDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPD
        WS RRKKNN LRPVKGNA+L FSVHLNASPDKSS H RSPI DGELW+ATKF YL P  GNKHT E D D C DEDKSCPQWAAIGECERNAVFM+GSPD
Subjt:  WSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKHTIESDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPD

Query:  YYGTCRKSCNAC
        YYGTCRKSCNAC
Subjt:  YYGTCRKSCNAC

A0A6J1E2P0 Procollagen-proline 4-dioxygenase8.3e-14280.32Show/hide
Query:  MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDRLVDR-PLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSG
        MDSRL FLLLLA AFSFS+CLAQSN ISGRKGLRD++V+   LSYSN+S RIDPSRVVQ+SW+PR FLYKGFLSDEECDHLI+LASNSED PSRN+AGS 
Subjt:  MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDRLVDR-PLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSG

Query:  ITVSTELLNSSGVILNTTDDIVARIENRLAIWTLLPKDHSMPFQIMQYRGEEAK-HKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSK
         TVST+ L +SG ILNTTDDI+ RIENR+A+WT LPKDHSMPFQIM+Y GEEA  HKYF+GNRSAM PSSEPLMATVVLYLSDSASGGEILFP SKVK +
Subjt:  ITVSTELLNSSGVILNTTDDIVARIENRLAIWTLLPKDHSMPFQIMQYRGEEAK-HKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSK

Query:  FWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPA-GNKHTIESDV-DGCFDEDKSCPQWAAIGECERNAVFMVG
        FWS RRKKNNFLRPVKGNA+LFFSVHLNASPDKS YH R+PI DG+LWVATKF Y+ P A GN+H +ES V D C DED+SCP+WAAIGEC+RNAVFM+G
Subjt:  FWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPA-GNKHTIESDV-DGCFDEDKSCPQWAAIGECERNAVFMVG

Query:  SPDYYGTCRKSCNAC
        SPDYYGTCRKSCNAC
Subjt:  SPDYYGTCRKSCNAC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 63.7e-5441.52Show/hide
Query:  SNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNS-EDNPSRNSAGSGITVSTELLNSSGVIL-NTTDDIVARIENRLAIWTLLPKDHSMPF
        S++S  +DP+R+ Q+SW PR FLYKGFLSDEECDHLI LA    E +       SG +  +E+  SSG+ L    DDIVA +E +LA WT LP+++    
Subjt:  SNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNS-EDNPSRNSAGSGITVSTELLNSSGVIL-NTTDDIVARIENRLAIWTLLPKDHSMPF

Query:  QIMQYRG---EEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESK-----VKSKFWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSY
        QI+ Y      +    YFY  ++  L      +ATV++YLS+   GGE +FP  K     +K   WS   K+   ++P KG+A+LFF++HLN + D +S 
Subjt:  QIMQYRG---EEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESK-----VKSKFWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSY

Query:  HIRSPIRDGELWVATKFLYLGPPAGNKHTIESDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC
        H   P+ +GE W AT+++++    G K  +      C D+ +SC +WA  GECE+N ++MVGS    G CRKSC AC
Subjt:  HIRSPIRDGELWVATKFLYLGPPAGNKHTIESDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC

F4JAU3 Prolyl 4-hydroxylase 22.2e-4637.18Show/hide
Query:  SNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTELLNSSGVILNT-TDDIVARIENRLAIWTLLPKDHSMPFQ
        S+ S  I+PS+V QVS +PR F+Y+GFL+D ECDHLISLA  +    +     +G +  +++  SSG  ++   D IV+ IE++L+ WT LPK++    Q
Subjt:  SNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTELLNSSGVILNT-TDDIVARIENRLAIWTLLPKDHSMPFQ

Query:  IMQY-RGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSK--------FWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSY
        +++Y  G++    + Y +    +      +ATV+LYLS+   GGE +FP+++  S+          S   KK   ++P KGNA+LFF++  +A PD  S 
Subjt:  IMQY-RGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSK--------FWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSY

Query:  HIRSPIRDGELWVATKFLYLGPPAGNKHTIESDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC
        H   P+ +GE W ATK++++     +   I +    C D ++SC +WA +GEC +N  +MVG+P+  G CR+SC AC
Subjt:  HIRSPIRDGELWVATKFLYLGPPAGNKHTIESDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC

Q8GXT7 Probable prolyl 4-hydroxylase 129.7e-6344.84Show/hide
Query:  FLLLLATAFSFSTCLAQSNLISGRKGLRDRLV-----DRPLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGIT
        FL+L+ T  S S           RK LRD+ +     D   SY   S  +DP+RV+Q+SW PRVFLY+GFLS+EECDHLISL   + +  S ++ G    
Subjt:  FLLLLATAFSFSTCLAQSNLISGRKGLRDRLV-----DRPLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGIT

Query:  VSTELLNSSGVILNTTDDIVARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSKFWS
          T+L           D +VA IE +++ WT LP ++    ++  Y  E++  K  Y          E L+ATVVLYLS++  GGE+LFP S++K K  +
Subjt:  VSTELLNSSGVILNTTDDIVARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSKFWS

Query:  GRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKHTIESDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDYY
           +  N LRPVKGNAILFF+  LNAS D  S H+R P+  GEL VATK +Y    A  +  IE   + C DED++C +WA +GEC++N V+M+GSPDYY
Subjt:  GRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKHTIESDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDYY

Query:  GTCRKSCNAC
        GTCRKSCNAC
Subjt:  GTCRKSCNAC

Q8L970 Probable prolyl 4-hydroxylase 77.4e-5539.01Show/hide
Query:  MDSRLNFLLLLATAFSFSTCL---AQSNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAG
        MDSR+   L  +  F F+  L   A +  ++     RD  V + +  S  S   DP+RV Q+SW PRVFLY+GFLSDEECDH I LA    +        
Subjt:  MDSRLNFLLLLATAFSFSTCL---AQSNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAG

Query:  SGITVSTELLNSSGVILN-TTDDIVARIENRLAIWTLLPKDHSMPFQIMQY-RGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFP----
        SG +V +E+  SSG+ L+   DDIV+ +E +LA WT LP+++    QI+ Y  G++ +  + Y +  A L      +ATV++YLS+   GGE +FP    
Subjt:  SGITVSTELLNSSGVILN-TTDDIVARIENRLAIWTLLPKDHSMPFQIMQY-RGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFP----

Query:  -ESKVKSKFWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLG--PPAGNKHTIESDVDGCFDEDKSCPQWAAIGECE
          +++K   W+   K+   ++P KG+A+LFF++H NA+ D +S H   P+ +GE W AT+++++     A NK +      GC DE+ SC +WA  GEC+
Subjt:  -ESKVKSKFWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLG--PPAGNKHTIESDVDGCFDEDKSCPQWAAIGECE

Query:  RNAVFMVGSPDYYGTCRKSCNAC
        +N  +MVGS   +G CRKSC AC
Subjt:  RNAVFMVGSPDYYGTCRKSCNAC

Q8LAN3 Probable prolyl 4-hydroxylase 41.8e-4535.34Show/hide
Query:  SNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTELLNSSGVILNT-TDDIVARIENRLAIWTLLPKDHSMPFQ
        S+ S  ++PS+V QVS +PR F+Y+GFL++ ECDH++SLA  S    +     SG +  +E+  SSG  ++   D IV+ IE++++ WT LPK++    Q
Subjt:  SNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTELLNSSGVILNT-TDDIVARIENRLAIWTLLPKDHSMPFQ

Query:  IMQY-RGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSK--------FWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSY
        +++Y  G++    + Y +    +      MAT+++YLS+   GGE +FP++++ S+          S   K+   ++P KG+A+LFF++H +A PD  S 
Subjt:  IMQY-RGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSK--------FWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSY

Query:  HIRSPIRDGELWVATKFLYLG------PPAGNKHTIESDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC
        H   P+ +GE W ATK++++        P+GN          C D ++SC +WA +GEC +N  +MVG+ +  G CR+SC AC
Subjt:  HIRSPIRDGELWVATKFLYLG------PPAGNKHTIESDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 21.5e-4737.18Show/hide
Query:  SNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTELLNSSGVILNT-TDDIVARIENRLAIWTLLPKDHSMPFQ
        S+ S  I+PS+V QVS +PR F+Y+GFL+D ECDHLISLA  +    +     +G +  +++  SSG  ++   D IV+ IE++L+ WT LPK++    Q
Subjt:  SNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTELLNSSGVILNT-TDDIVARIENRLAIWTLLPKDHSMPFQ

Query:  IMQY-RGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSK--------FWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSY
        +++Y  G++    + Y +    +      +ATV+LYLS+   GGE +FP+++  S+          S   KK   ++P KGNA+LFF++  +A PD  S 
Subjt:  IMQY-RGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSK--------FWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSY

Query:  HIRSPIRDGELWVATKFLYLGPPAGNKHTIESDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC
        H   P+ +GE W ATK++++     +   I +    C D ++SC +WA +GEC +N  +MVG+P+  G CR+SC AC
Subjt:  HIRSPIRDGELWVATKFLYLGPPAGNKHTIESDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase5.3e-5639.01Show/hide
Query:  MDSRLNFLLLLATAFSFSTCL---AQSNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAG
        MDSR+   L  +  F F+  L   A +  ++     RD  V + +  S  S   DP+RV Q+SW PRVFLY+GFLSDEECDH I LA    +        
Subjt:  MDSRLNFLLLLATAFSFSTCL---AQSNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAG

Query:  SGITVSTELLNSSGVILN-TTDDIVARIENRLAIWTLLPKDHSMPFQIMQY-RGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFP----
        SG +V +E+  SSG+ L+   DDIV+ +E +LA WT LP+++    QI+ Y  G++ +  + Y +  A L      +ATV++YLS+   GGE +FP    
Subjt:  SGITVSTELLNSSGVILN-TTDDIVARIENRLAIWTLLPKDHSMPFQIMQY-RGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFP----

Query:  -ESKVKSKFWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLG--PPAGNKHTIESDVDGCFDEDKSCPQWAAIGECE
          +++K   W+   K+   ++P KG+A+LFF++H NA+ D +S H   P+ +GE W AT+++++     A NK +      GC DE+ SC +WA  GEC+
Subjt:  -ESKVKSKFWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLG--PPAGNKHTIESDVDGCFDEDKSCPQWAAIGECE

Query:  RNAVFMVGSPDYYGTCRKSCNAC
        +N  +MVGS   +G CRKSC AC
Subjt:  RNAVFMVGSPDYYGTCRKSCNAC

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase1.4e-5339.16Show/hide
Query:  MDSRLNFLLLLATAFSFSTCL---AQSNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLA------SNSEDNP
        MDSR+   L  +  F F+  L   A +  ++     RD  V + +  S  S   DP+RV Q+SW PRVFLY+GFLSDEECDH I LA      S   DN 
Subjt:  MDSRLNFLLLLATAFSFSTCL---AQSNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLA------SNSEDNP

Query:  SRNSAGSGITVSTELLNSSGVILN----TTDDIVARIENRLAIWTLLPKDHSMPFQIMQY-RGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGG
        S  S  S  +VS  +  SS  I N      DDIV+ +E +LA WT LP+++    QI+ Y  G++ +  + Y +  A L      +ATV++YLS+   GG
Subjt:  SRNSAGSGITVSTELLNSSGVILN----TTDDIVARIENRLAIWTLLPKDHSMPFQIMQY-RGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGG

Query:  EILFP-----ESKVKSKFWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLG--PPAGNKHTIESDVDGCFDEDKSCP
        E +FP      +++K   W+   K+   ++P KG+A+LFF++H NA+ D +S H   P+ +GE W AT+++++     A NK +      GC DE+ SC 
Subjt:  EILFP-----ESKVKSKFWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLG--PPAGNKHTIESDVDGCFDEDKSCP

Query:  QWAAIGECERNAVFMVGSPDYYGTCRKSCNAC
        +WA  GEC++N  +MVGS   +G CRKSC AC
Subjt:  QWAAIGECERNAVFMVGSPDYYGTCRKSCNAC

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase2.6e-5541.52Show/hide
Query:  SNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNS-EDNPSRNSAGSGITVSTELLNSSGVIL-NTTDDIVARIENRLAIWTLLPKDHSMPF
        S++S  +DP+R+ Q+SW PR FLYKGFLSDEECDHLI LA    E +       SG +  +E+  SSG+ L    DDIVA +E +LA WT LP+++    
Subjt:  SNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNS-EDNPSRNSAGSGITVSTELLNSSGVIL-NTTDDIVARIENRLAIWTLLPKDHSMPF

Query:  QIMQYRG---EEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESK-----VKSKFWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSY
        QI+ Y      +    YFY  ++  L      +ATV++YLS+   GGE +FP  K     +K   WS   K+   ++P KG+A+LFF++HLN + D +S 
Subjt:  QIMQYRG---EEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESK-----VKSKFWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSY

Query:  HIRSPIRDGELWVATKFLYLGPPAGNKHTIESDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC
        H   P+ +GE W AT+++++    G K  +      C D+ +SC +WA  GECE+N ++MVGS    G CRKSC AC
Subjt:  HIRSPIRDGELWVATKFLYLGPPAGNKHTIESDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC

AT4G25600.1 Oxoglutarate/iron-dependent oxygenase6.9e-6444.84Show/hide
Query:  FLLLLATAFSFSTCLAQSNLISGRKGLRDRLV-----DRPLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGIT
        FL+L+ T  S S           RK LRD+ +     D   SY   S  +DP+RV+Q+SW PRVFLY+GFLS+EECDHLISL   + +  S ++ G    
Subjt:  FLLLLATAFSFSTCLAQSNLISGRKGLRDRLV-----DRPLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGIT

Query:  VSTELLNSSGVILNTTDDIVARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSKFWS
          T+L           D +VA IE +++ WT LP ++    ++  Y  E++  K  Y          E L+ATVVLYLS++  GGE+LFP S++K K  +
Subjt:  VSTELLNSSGVILNTTDDIVARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSKFWS

Query:  GRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKHTIESDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDYY
           +  N LRPVKGNAILFF+  LNAS D  S H+R P+  GEL VATK +Y    A  +  IE   + C DED++C +WA +GEC++N V+M+GSPDYY
Subjt:  GRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKHTIESDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDYY

Query:  GTCRKSCNAC
        GTCRKSCNAC
Subjt:  GTCRKSCNAC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCTCGTCTTAACTTTTTGCTTCTGTTAGCGACTGCATTTTCATTCTCAACCTGCCTTGCTCAAAGCAATTTGATTAGTGGCCGTAAGGGTTTAAGGGACCGATT
GGTTGACAGACCTTTGAGCTACTCAAACTATTCTGGTAGAATCGACCCATCAAGAGTTGTCCAAGTCTCTTGGCGACCAAGGGTTTTCTTGTATAAAGGTTTTCTCTCAG
ATGAGGAGTGTGATCATCTTATTTCTTTGGCTTCAAATTCGGAAGACAATCCTTCTAGGAATAGTGCTGGTTCGGGGATCACTGTCTCAACCGAATTGCTAAACAGTTCA
GGGGTCATTTTAAACACAACAGATGATATCGTTGCAAGAATTGAAAATCGACTTGCAATATGGACTTTGCTTCCAAAAGATCATAGCATGCCTTTTCAGATCATGCAATA
CAGGGGTGAAGAAGCAAAGCACAAGTACTTTTATGGCAACAGATCAGCAATGTTGCCGTCCAGTGAGCCTTTGATGGCCACAGTAGTTTTGTATCTCTCAGATTCTGCTA
GTGGCGGTGAAATACTGTTTCCAGAATCAAAGGTAAAGAGCAAATTTTGGTCAGGCCGAAGAAAGAAAAACAACTTTCTGAGACCAGTGAAAGGCAATGCAATTCTTTTT
TTTTCTGTGCATCTTAATGCCTCTCCAGACAAGAGTAGCTACCACATTCGATCCCCAATACGCGATGGGGAGTTGTGGGTTGCTACAAAATTCTTATACTTAGGACCACC
TGCTGGGAATAAACACACTATCGAATCCGATGTAGATGGGTGCTTTGATGAAGATAAAAGCTGCCCTCAATGGGCTGCCATTGGCGAATGCGAACGAAATGCTGTGTTCA
TGGTTGGTTCTCCAGATTACTATGGTACATGTAGAAAAAGCTGCAATGCATGTTGA
mRNA sequenceShow/hide mRNA sequence
TCACAATAAATAACCGTCTAAAAAACGAGAGAGGAATATATTCAATTTGTCGCAACGGTGAGAAAGTGGAAGTTCAAGCTTCAAGGAAGTGGCGGTGGGATCCAGATTCG
TAGAACTACATAAGACGCCATGGATTTTTACACCACCAGCTTTCTCAAAATCTAATTTTGTCTTCAACTTCGTCTTGTTTCGATCTTCGTTCCACCCATCCATGGATTCT
CGTCTTAACTTTTTGCTTCTGTTAGCGACTGCATTTTCATTCTCAACCTGCCTTGCTCAAAGCAATTTGATTAGTGGCCGTAAGGGTTTAAGGGACCGATTGGTTGACAG
ACCTTTGAGCTACTCAAACTATTCTGGTAGAATCGACCCATCAAGAGTTGTCCAAGTCTCTTGGCGACCAAGGGTTTTCTTGTATAAAGGTTTTCTCTCAGATGAGGAGT
GTGATCATCTTATTTCTTTGGCTTCAAATTCGGAAGACAATCCTTCTAGGAATAGTGCTGGTTCGGGGATCACTGTCTCAACCGAATTGCTAAACAGTTCAGGGGTCATT
TTAAACACAACAGATGATATCGTTGCAAGAATTGAAAATCGACTTGCAATATGGACTTTGCTTCCAAAAGATCATAGCATGCCTTTTCAGATCATGCAATACAGGGGTGA
AGAAGCAAAGCACAAGTACTTTTATGGCAACAGATCAGCAATGTTGCCGTCCAGTGAGCCTTTGATGGCCACAGTAGTTTTGTATCTCTCAGATTCTGCTAGTGGCGGTG
AAATACTGTTTCCAGAATCAAAGGTAAAGAGCAAATTTTGGTCAGGCCGAAGAAAGAAAAACAACTTTCTGAGACCAGTGAAAGGCAATGCAATTCTTTTTTTTTCTGTG
CATCTTAATGCCTCTCCAGACAAGAGTAGCTACCACATTCGATCCCCAATACGCGATGGGGAGTTGTGGGTTGCTACAAAATTCTTATACTTAGGACCACCTGCTGGGAA
TAAACACACTATCGAATCCGATGTAGATGGGTGCTTTGATGAAGATAAAAGCTGCCCTCAATGGGCTGCCATTGGCGAATGCGAACGAAATGCTGTGTTCATGGTTGGTT
CTCCAGATTACTATGGTACATGTAGAAAAAGCTGCAATGCATGTTGATTGATGCACGACCAAATTCAAGTAAAAATTTCTCTCGTCCTGATTTGAGCAACTATTTCTTTA
TTTATTTATTCCTAATTTCATGCATGTACTACACCAAATGATTCTTAGATATGTAGTTGTAATATTGCTCATCTTCTGGGAACATGAGACAATCACGTTCTAGTCCTTAG
GTTTGAGATGTATTTAACTCTTGCTTAACGTTTTTAATTTAATTGCTATATTT
Protein sequenceShow/hide protein sequence
MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTELLNSS
GVILNTTDDIVARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSKFWSGRRKKNNFLRPVKGNAILF
FSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKHTIESDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC