; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi09G004730 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi09G004730
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationchr09:4985735..4988667
RNA-Seq ExpressionLsi09G004730
SyntenyLsi09G004730
Gene Ontology termsGO:0019511 - peptidyl-proline hydroxylation (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043468.1 putative prolyl 4-hydroxylase 12 [Cucumis melo var. makuwa]2.9e-15789.39Show/hide
Query:  MDSRLHFLLLLATAFSFSTCLAQSNLISGRKGLRDQLLDRPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRN
        MDSRL+FLLL ATAFSFSTCLAQSNLISGRKGLRDQL+DRPLSYSN S RIDPSRVVQVSWRPRVFLYKGFLSD+ECDHLI LAS+S+DNPS NSAGS N
Subjt:  MDSRLHFLLLLATAFSFSTCLAQSNLISGRKGLRDQLLDRPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRN

Query:  TVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKYFYGNRSAM-SSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFW
        TVST LL+ SGVILNTTDDIIARIE RIA+WTLLPKDH MPFQIMQYRGEEA+HKYFYGNRSAM SSSEPLMATVVLYLSDSA GGEMLFPESKVKSKFW
Subjt:  TVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKYFYGNRSAM-SSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFW

Query:  SSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDY
        S RRKK NFL PVKGNAILFFSVHLNASPDKSSYH R PI NGELWVATKF YLRP TGNKHT++S+ DGCIDEDKSCPQWAAIGECERNAVFM+GSPDY
Subjt:  SSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDY

Query:  YGTCRKSCNAC
        YGTCRKSCNAC
Subjt:  YGTCRKSCNAC

XP_004152378.1 probable prolyl 4-hydroxylase 12 [Cucumis sativus]1.3e-15789.07Show/hide
Query:  MDSRLHFLLLLATAFSFSTCLAQSNLISGRKGLRDQLLDRPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRN
        MDSRL+FLLLLATAFSFSTCLAQSNLISGRKGLRD+L+DRPLSYSN+SGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLI LAS+SEDNPS NSAGS  
Subjt:  MDSRLHFLLLLATAFSFSTCLAQSNLISGRKGLRDQLLDRPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRN

Query:  TVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKYFYGNRSAM-SSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFW
        TVST LL+SSGVILNTTDDI+ARIE R+A+WTLLPKDHSMPFQIMQYRGEEA+HKYFYGNRSAM  SSEPLMATVVLYLSDSA GGE+LFPESKVKSKFW
Subjt:  TVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKYFYGNRSAM-SSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFW

Query:  SSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDY
        S RRKKNNFL PVKGNAILFFSVHLNASPDKSSYH RSPI +GELWVATKF YL P  GNKHT++SD DGC DEDKSCPQWAAIGECERNAVFM+GSPDY
Subjt:  SSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDY

Query:  YGTCRKSCNAC
        YGTCRKSCNAC
Subjt:  YGTCRKSCNAC

XP_008436994.1 PREDICTED: probable prolyl 4-hydroxylase 12 [Cucumis melo]4.5e-15890.03Show/hide
Query:  MDSRLHFLLLLATAFSFSTCLAQSNLISGRKGLRDQLLDRPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRN
        MDSRL+FLLL ATAFSFSTCLAQSNLISGRKGLRDQL+DRPLSYSN S RIDPSRVVQVSWRPRVFLYKGFLSDEECDHLI LAS+SEDNPS NSAGS N
Subjt:  MDSRLHFLLLLATAFSFSTCLAQSNLISGRKGLRDQLLDRPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRN

Query:  TVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKYFYGNRSAM-SSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFW
        TVST LL+ SGVILNTTDDIIARIE RIA+WTLLPKDH MPFQIMQYRGEEA+HKYFYGNRSAM SSSEPLMATVVLYLSDSA GGEMLFPESKVKSKFW
Subjt:  TVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKYFYGNRSAM-SSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFW

Query:  SSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDY
        S RRKK NFL PVKGNAILFFSVHLNASPDKSSYH R PI NGELWVATKF YLRP TGNKHT++S+ DGCIDEDKSCPQWAAIGECERNAVFM+GSPDY
Subjt:  SSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDY

Query:  YGTCRKSCNAC
        YGTCRKSCNAC
Subjt:  YGTCRKSCNAC

XP_022159842.1 probable prolyl 4-hydroxylase 12 [Momordica charantia]1.4e-14684.89Show/hide
Query:  MDSRLHFLLLLATAFSFSTCLAQSNLISGRKGLRDQLLDR-PLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSR
        MDSRL  LLLLATA SF +CLAQSNLISGRKGLRDQL++  PLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLI LA+SSED PSGNS  S 
Subjt:  MDSRLHFLLLLATAFSFSTCLAQSNLISGRKGLRDQLLDR-PLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSR

Query:  NTVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKYFYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFW
        NTV T +L SSG ILNTTDDIIARIE RIA+WT LPKD+SMP QI+QY GEEAEHKY +GNRSAM SSEPLMATVVLYLSDSA GGEM FPESKVKS+FW
Subjt:  NTVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKYFYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFW

Query:  SSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDY
        S RRKKNN L PVKGNA+L FSVHLNASPDKSS HTRSPIL+GELW+ATKFFYLRP TGNKHT E D D C DEDKSCPQWAAIGECERNAVFMIGSPDY
Subjt:  SSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDY

Query:  YGTCRKSCNAC
        YGTCRKSCNAC
Subjt:  YGTCRKSCNAC

XP_038906497.1 probable prolyl 4-hydroxylase 12 [Benincasa hispida]7.6e-16693.55Show/hide
Query:  MDSRLHFLLLLATAFSFSTCLAQSNLISGRKGLRDQLLDRPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRN
        MDSRL+FLLLLATAFSFSTCLAQSNLISGRKGLRDQL+DRPLSYSNHSGRIDPSRVVQVSW+PRVFLYKGFLSDEECDHLI LAS+SEDNPSGNSAGS N
Subjt:  MDSRLHFLLLLATAFSFSTCLAQSNLISGRKGLRDQLLDRPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRN

Query:  TVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKYFYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFWS
        TVST LL+SSGVILNT+DDIIARIE +IA+WT LPKDH MPFQIMQYRGEEAEHKYFYGN SAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFWS
Subjt:  TVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKYFYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFWS

Query:  SRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDYY
         RRKKNNFL PVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNK TVESD DGCIDEDKSCPQWAAIGECERN VFMIGSPDYY
Subjt:  SRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDYY

Query:  GTCRKSCNAC
        GTCRKSCNAC
Subjt:  GTCRKSCNAC

TrEMBL top hitse value%identityAlignment
A0A0A0KPE4 Procollagen-proline 4-dioxygenase5.5e-14688.24Show/hide
Query:  QSNLISGRKGLRDQLLDRPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRNTVSTNLLSSSGVILNTTDDIIA
        +SNLISGRKGLRD+L+DRPLSYSN+SGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLI LAS+SEDNPS NSAGS  TVST LL+SSGVILNTTDDI+A
Subjt:  QSNLISGRKGLRDQLLDRPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRNTVSTNLLSSSGVILNTTDDIIA

Query:  RIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKYFYGNRSAM-SSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFWSSRRKKNNFLTPVKGNAILFFS
        RIE R+A+WTLLPKDHSMPFQIMQYRGEEA+HKYFYGNRSAM  SSEPLMATVVLYLSDSA GGE+LFPESKVKSKFWS RRKKNNFL PVKGNAILFFS
Subjt:  RIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKYFYGNRSAM-SSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFWSSRRKKNNFLTPVKGNAILFFS

Query:  VHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        VHLNASPDKSSYH RSPI +GELWVATKF YL P  GNKHT++SD DGC DEDKSCPQWAAIGECERNAVFM+GSPDYYGTCRKSCNAC
Subjt:  VHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

A0A1S3AT39 Procollagen-proline 4-dioxygenase2.2e-15890.03Show/hide
Query:  MDSRLHFLLLLATAFSFSTCLAQSNLISGRKGLRDQLLDRPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRN
        MDSRL+FLLL ATAFSFSTCLAQSNLISGRKGLRDQL+DRPLSYSN S RIDPSRVVQVSWRPRVFLYKGFLSDEECDHLI LAS+SEDNPS NSAGS N
Subjt:  MDSRLHFLLLLATAFSFSTCLAQSNLISGRKGLRDQLLDRPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRN

Query:  TVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKYFYGNRSAM-SSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFW
        TVST LL+ SGVILNTTDDIIARIE RIA+WTLLPKDH MPFQIMQYRGEEA+HKYFYGNRSAM SSSEPLMATVVLYLSDSA GGEMLFPESKVKSKFW
Subjt:  TVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKYFYGNRSAM-SSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFW

Query:  SSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDY
        S RRKK NFL PVKGNAILFFSVHLNASPDKSSYH R PI NGELWVATKF YLRP TGNKHT++S+ DGCIDEDKSCPQWAAIGECERNAVFM+GSPDY
Subjt:  SSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDY

Query:  YGTCRKSCNAC
        YGTCRKSCNAC
Subjt:  YGTCRKSCNAC

A0A5A7TKX1 Procollagen-proline 4-dioxygenase1.4e-15789.39Show/hide
Query:  MDSRLHFLLLLATAFSFSTCLAQSNLISGRKGLRDQLLDRPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRN
        MDSRL+FLLL ATAFSFSTCLAQSNLISGRKGLRDQL+DRPLSYSN S RIDPSRVVQVSWRPRVFLYKGFLSD+ECDHLI LAS+S+DNPS NSAGS N
Subjt:  MDSRLHFLLLLATAFSFSTCLAQSNLISGRKGLRDQLLDRPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRN

Query:  TVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKYFYGNRSAM-SSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFW
        TVST LL+ SGVILNTTDDIIARIE RIA+WTLLPKDH MPFQIMQYRGEEA+HKYFYGNRSAM SSSEPLMATVVLYLSDSA GGEMLFPESKVKSKFW
Subjt:  TVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKYFYGNRSAM-SSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFW

Query:  SSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDY
        S RRKK NFL PVKGNAILFFSVHLNASPDKSSYH R PI NGELWVATKF YLRP TGNKHT++S+ DGCIDEDKSCPQWAAIGECERNAVFM+GSPDY
Subjt:  SSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDY

Query:  YGTCRKSCNAC
        YGTCRKSCNAC
Subjt:  YGTCRKSCNAC

A0A6J1E0X9 Procollagen-proline 4-dioxygenase6.5e-14784.89Show/hide
Query:  MDSRLHFLLLLATAFSFSTCLAQSNLISGRKGLRDQLLDR-PLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSR
        MDSRL  LLLLATA SF +CLAQSNLISGRKGLRDQL++  PLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLI LA+SSED PSGNS  S 
Subjt:  MDSRLHFLLLLATAFSFSTCLAQSNLISGRKGLRDQLLDR-PLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSR

Query:  NTVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKYFYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFW
        NTV T +L SSG ILNTTDDIIARIE RIA+WT LPKD+SMP QI+QY GEEAEHKY +GNRSAM SSEPLMATVVLYLSDSA GGEM FPESKVKS+FW
Subjt:  NTVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKYFYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFW

Query:  SSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDY
        S RRKKNN L PVKGNA+L FSVHLNASPDKSS HTRSPIL+GELW+ATKFFYLRP TGNKHT E D D C DEDKSCPQWAAIGECERNAVFMIGSPDY
Subjt:  SSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDY

Query:  YGTCRKSCNAC
        YGTCRKSCNAC
Subjt:  YGTCRKSCNAC

A0A6J1E2P0 Procollagen-proline 4-dioxygenase3.7e-14281.21Show/hide
Query:  MDSRLHFLLLLATAFSFSTCLAQSNLISGRKGLRDQLLDR-PLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSR
        MDSRL FLLLLA AFSFS+CLAQSN ISGRKGLRDQ+++   LSYSNHS RIDPSRVVQ+SW+PR FLYKGFLSDEECDHLI LAS+SED PS N+AGSR
Subjt:  MDSRLHFLLLLATAFSFSTCLAQSNLISGRKGLRDQLLDR-PLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSR

Query:  NTVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIMQYRGEEAE-HKYFYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKF
        NTVST  L +SG ILNTTDDII RIE RIA+WT LPKDHSMPFQIM+Y GEEA  HKYF+GNRSAM SSEPLMATVVLYLSDSA GGE+LFP SKVK +F
Subjt:  NTVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIMQYRGEEAE-HKYFYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKF

Query:  WSSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRP-TTGNKHTVESD-EDGCIDEDKSCPQWAAIGECERNAVFMIGS
        WS RRKKNNFL PVKGNA+LFFSVHLNASPDKS YH+R+PIL+G+LWVATKFFY+RP  TGN+H VES  +D CIDED+SCP+WAAIGEC+RNAVFMIGS
Subjt:  WSSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRP-TTGNKHTVESD-EDGCIDEDKSCPQWAAIGECERNAVFMIGS

Query:  PDYYGTCRKSCNAC
        PDYYGTCRKSCNAC
Subjt:  PDYYGTCRKSCNAC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 63.1e-5340.58Show/hide
Query:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSS-EDNPSGNSAGSRNTVSTNLLSSSGVIL-NTTDDIIARIETRIALWTLLPKDHSMPF
        S+ S  +DP+R+ Q+SW PR FLYKGFLSDEECDHLI LA    E +       S  +  + + +SSG+ L    DDI+A +E ++A WT LP+++    
Subjt:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSS-EDNPSGNSAGSRNTVSTNLLSSSGVIL-NTTDDIIARIETRIALWTLLPKDHSMPF

Query:  QIMQYRG---EEAEHKYFYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFPESK-----VKSKFWSSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYH
        QI+ Y      +    YFY ++ A+      +ATV++YLS+  +GGE +FP  K     +K   WS   K+   + P KG+A+LFF++HLN + D +S H
Subjt:  QIMQYRG---EEAEHKYFYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFPESK-----VKSKFWSSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYH

Query:  TRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
           P++ GE W AT++ ++R + G K  V      C+D+ +SC +WA  GECE+N ++M+GS    G CRKSC AC
Subjt:  TRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

F4JAU3 Prolyl 4-hydroxylase 26.9e-4536.52Show/hide
Query:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLA------SSSEDNPSGNSAGSRNTVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDH
        S+ S  I+PS+V QVS +PR F+Y+GFL+D ECDHLI LA      S+  DN +G S  S    S+    S G      D I++ IE +++ WT LPK++
Subjt:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLA------SSSEDNPSGNSAGSRNTVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDH

Query:  SMPFQIMQY-RGEEAE-HKYFYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFWSSRR--------KKNNFLTPVKGNAILFFSVHLNASP
            Q+++Y  G++ + H  ++ ++  ++     +ATV+LYLS+  +GGE +FP+++  S+   S          KK   + P KGNA+LFF++  +A P
Subjt:  SMPFQIMQY-RGEEAE-HKYFYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFWSSRR--------KKNNFLTPVKGNAILFFSVHLNASP

Query:  DKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        D  S H   P++ GE W ATK+ ++     +   + + +  C D ++SC +WA +GEC +N  +M+G+P+  G CR+SC AC
Subjt:  DKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

Q8GXT7 Probable prolyl 4-hydroxylase 126.9e-6143.73Show/hide
Query:  FLLLLATAFSFSTCLAQSNLISGRKGLRDQLL-----DRPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRNT
        FL+L+ T  S S           RK LRD+ +     D   SY   S  +DP+RV+Q+SW PRVFLY+GFLS+EECDHLI L   + +  S ++ G    
Subjt:  FLLLLATAFSFSTCLAQSNLISGRKGLRDQLL-----DRPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRNT

Query:  VSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKY-FYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFWS
                        D ++A IE +++ WT LP ++    ++  Y  E++  K  ++G   +    E L+ATVVLYLS++ +GGE+LFP S++K K  +
Subjt:  VSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKY-FYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFWS

Query:  SRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDG-CIDEDKSCPQWAAIGECERNAVFMIGSPDY
        S  +  N L PVKGNAILFF+  LNAS D  S H R P++ GEL VATK  Y       K     +E G C DED++C +WA +GEC++N V+MIGSPDY
Subjt:  SRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDG-CIDEDKSCPQWAAIGECERNAVFMIGSPDY

Query:  YGTCRKSCNAC
        YGTCRKSCNAC
Subjt:  YGTCRKSCNAC

Q8L970 Probable prolyl 4-hydroxylase 72.0e-5235.74Show/hide
Query:  MDSRLHFLLLLATAFSFSTCLAQSN-LISGRKGLRDQLLDRPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSR
        MDSR+     L   F+     +  N  ++     RD  + + +  S  S   DP+RV Q+SW PRVFLY+GFLSDEECDH I LA    +        S 
Subjt:  MDSRLHFLLLLATAFSFSTCLAQSN-LISGRKGLRDQLLDRPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSR

Query:  NTVSTNLLSSSGVILN-TTDDIIARIETRIALWTLLPKDHSMPFQIMQY-RGEEAE-HKYFYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFP-----E
         +V + + +SSG+ L+   DDI++ +E ++A WT LP+++    QI+ Y  G++ E H  ++ +++ +      +ATV++YLS+  +GGE +FP      
Subjt:  NTVSTNLLSSSGVILN-TTDDIIARIETRIALWTLLPKDHSMPFQIMQY-RGEEAE-HKYFYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFP-----E

Query:  SKVKSKFWSSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAV
        +++K   W+   K+   + P KG+A+LFF++H NA+ D +S H   P++ GE W AT++ +++    +     + + GC+DE+ SC +WA  GEC++N  
Subjt:  SKVKSKFWSSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAV

Query:  FMIGSPDYYGTCRKSCNAC
        +M+GS   +G CRKSC AC
Subjt:  FMIGSPDYYGTCRKSCNAC

Q8LAN3 Probable prolyl 4-hydroxylase 44.5e-4434.98Show/hide
Query:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSS------EDNPSGNSAGSRNTVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDH
        S+ S  ++PS+V QVS +PR F+Y+GFL++ ECDH++ LA +S       DN SG S  S    S+    S G      D I++ IE +I+ WT LPK++
Subjt:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSS------EDNPSGNSAGSRNTVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDH

Query:  SMPFQIMQY---RGEEAEHKYFYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFWSSRR--------KKNNFLTPVKGNAILFFSVHLNAS
            Q+++Y   +  +A   YF+   + +      MAT+++YLS+  +GGE +FP++++ S+   S          K+   + P KG+A+LFF++H +A 
Subjt:  SMPFQIMQY---RGEEAEHKYFYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFWSSRR--------KKNNFLTPVKGNAILFFSVHLNAS

Query:  PDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        PD  S H   P++ GE W ATK+ ++     +   + +    C D ++SC +WA +GEC +N  +M+G+ +  G CR+SC AC
Subjt:  PDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 24.9e-4636.52Show/hide
Query:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLA------SSSEDNPSGNSAGSRNTVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDH
        S+ S  I+PS+V QVS +PR F+Y+GFL+D ECDHLI LA      S+  DN +G S  S    S+    S G      D I++ IE +++ WT LPK++
Subjt:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLA------SSSEDNPSGNSAGSRNTVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDH

Query:  SMPFQIMQY-RGEEAE-HKYFYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFWSSRR--------KKNNFLTPVKGNAILFFSVHLNASP
            Q+++Y  G++ + H  ++ ++  ++     +ATV+LYLS+  +GGE +FP+++  S+   S          KK   + P KGNA+LFF++  +A P
Subjt:  SMPFQIMQY-RGEEAE-HKYFYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFWSSRR--------KKNNFLTPVKGNAILFFSVHLNASP

Query:  DKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        D  S H   P++ GE W ATK+ ++     +   + + +  C D ++SC +WA +GEC +N  +M+G+P+  G CR+SC AC
Subjt:  DKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase1.4e-5335.74Show/hide
Query:  MDSRLHFLLLLATAFSFSTCLAQSN-LISGRKGLRDQLLDRPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSR
        MDSR+     L   F+     +  N  ++     RD  + + +  S  S   DP+RV Q+SW PRVFLY+GFLSDEECDH I LA    +        S 
Subjt:  MDSRLHFLLLLATAFSFSTCLAQSN-LISGRKGLRDQLLDRPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSR

Query:  NTVSTNLLSSSGVILN-TTDDIIARIETRIALWTLLPKDHSMPFQIMQY-RGEEAE-HKYFYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFP-----E
         +V + + +SSG+ L+   DDI++ +E ++A WT LP+++    QI+ Y  G++ E H  ++ +++ +      +ATV++YLS+  +GGE +FP      
Subjt:  NTVSTNLLSSSGVILN-TTDDIIARIETRIALWTLLPKDHSMPFQIMQY-RGEEAE-HKYFYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFP-----E

Query:  SKVKSKFWSSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAV
        +++K   W+   K+   + P KG+A+LFF++H NA+ D +S H   P++ GE W AT++ +++    +     + + GC+DE+ SC +WA  GEC++N  
Subjt:  SKVKSKFWSSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAV

Query:  FMIGSPDYYGTCRKSCNAC
        +M+GS   +G CRKSC AC
Subjt:  FMIGSPDYYGTCRKSCNAC

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase4.9e-5436.89Show/hide
Query:  MDSRLHFLLLLATAFSFSTCLAQSN-LISGRKGLRDQLLDRPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLA------SSSEDNPSG
        MDSR+     L   F+     +  N  ++     RD  + + +  S  S   DP+RV Q+SW PRVFLY+GFLSDEECDH I LA      S   DN SG
Subjt:  MDSRLHFLLLLATAFSFSTCLAQSN-LISGRKGLRDQLLDRPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLA------SSSEDNPSG

Query:  NSAGSRNTVSTNLLSSSGVILN----TTDDIIARIETRIALWTLLPKDHSMPFQIMQY-RGEEAE-HKYFYGNRSAMSSSEPLMATVVLYLSDSARGGEM
         S  S ++VS  +  SS  I N      DDI++ +E ++A WT LP+++    QI+ Y  G++ E H  ++ +++ +      +ATV++YLS+  +GGE 
Subjt:  NSAGSRNTVSTNLLSSSGVILN----TTDDIIARIETRIALWTLLPKDHSMPFQIMQY-RGEEAE-HKYFYGNRSAMSSSEPLMATVVLYLSDSARGGEM

Query:  LFP-----ESKVKSKFWSSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAA
        +FP      +++K   W+   K+   + P KG+A+LFF++H NA+ D +S H   P++ GE W AT++ +++    +     + + GC+DE+ SC +WA 
Subjt:  LFP-----ESKVKSKFWSSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAA

Query:  IGECERNAVFMIGSPDYYGTCRKSCNAC
         GEC++N  +M+GS   +G CRKSC AC
Subjt:  IGECERNAVFMIGSPDYYGTCRKSCNAC

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase2.2e-5440.58Show/hide
Query:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSS-EDNPSGNSAGSRNTVSTNLLSSSGVIL-NTTDDIIARIETRIALWTLLPKDHSMPF
        S+ S  +DP+R+ Q+SW PR FLYKGFLSDEECDHLI LA    E +       S  +  + + +SSG+ L    DDI+A +E ++A WT LP+++    
Subjt:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSS-EDNPSGNSAGSRNTVSTNLLSSSGVIL-NTTDDIIARIETRIALWTLLPKDHSMPF

Query:  QIMQYRG---EEAEHKYFYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFPESK-----VKSKFWSSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYH
        QI+ Y      +    YFY ++ A+      +ATV++YLS+  +GGE +FP  K     +K   WS   K+   + P KG+A+LFF++HLN + D +S H
Subjt:  QIMQYRG---EEAEHKYFYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFPESK-----VKSKFWSSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYH

Query:  TRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
           P++ GE W AT++ ++R + G K  V      C+D+ +SC +WA  GECE+N ++M+GS    G CRKSC AC
Subjt:  TRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

AT4G25600.1 Oxoglutarate/iron-dependent oxygenase4.9e-6243.73Show/hide
Query:  FLLLLATAFSFSTCLAQSNLISGRKGLRDQLL-----DRPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRNT
        FL+L+ T  S S           RK LRD+ +     D   SY   S  +DP+RV+Q+SW PRVFLY+GFLS+EECDHLI L   + +  S ++ G    
Subjt:  FLLLLATAFSFSTCLAQSNLISGRKGLRDQLL-----DRPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRNT

Query:  VSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKY-FYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFWS
                        D ++A IE +++ WT LP ++    ++  Y  E++  K  ++G   +    E L+ATVVLYLS++ +GGE+LFP S++K K  +
Subjt:  VSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKY-FYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFWS

Query:  SRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDG-CIDEDKSCPQWAAIGECERNAVFMIGSPDY
        S  +  N L PVKGNAILFF+  LNAS D  S H R P++ GEL VATK  Y       K     +E G C DED++C +WA +GEC++N V+MIGSPDY
Subjt:  SRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDG-CIDEDKSCPQWAAIGECERNAVFMIGSPDY

Query:  YGTCRKSCNAC
        YGTCRKSCNAC
Subjt:  YGTCRKSCNAC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCTCGTCTCCACTTTTTGCTTCTTTTAGCGACTGCATTTTCATTCTCAACCTGCCTTGCACAAAGCAATTTGATTAGTGGCCGGAAGGGTTTAAGGGACCAATT
GCTTGATAGACCTTTAAGCTACTCAAATCATTCAGGAAGAATCGACCCATCAAGAGTTGTCCAAGTCTCTTGGAGACCAAGGGTTTTCTTGTATAAAGGCTTTCTCTCAG
ATGAGGAGTGTGATCATCTTATTTTTTTGGCTTCAAGTTCAGAAGACAATCCATCTGGGAACAGTGCTGGTTCCAGGAACACTGTCTCAACCAACTTGCTAAGCAGTTCA
GGAGTCATTTTAAACACAACAGATGATATAATTGCAAGAATTGAAACTCGAATTGCACTGTGGACTCTTCTCCCAAAAGATCATAGCATGCCTTTTCAGATCATGCAATA
CAGGGGTGAAGAAGCAGAGCATAAGTACTTTTATGGCAACAGATCTGCAATGTCGTCCAGTGAGCCTTTGATGGCCACAGTAGTTTTGTATCTCTCGGATTCTGCTCGCG
GTGGCGAGATGCTCTTTCCAGAATCAAAGGTAAAGAGCAAATTTTGGTCAAGCCGGAGAAAGAAAAACAACTTTCTGACACCAGTGAAAGGCAATGCAATTCTTTTTTTC
TCTGTGCATCTTAATGCTTCTCCAGACAAGAGTAGCTACCACACCCGATCCCCAATACTCAATGGGGAATTGTGGGTTGCTACAAAATTCTTCTACTTAAGACCAACCAC
GGGGAATAAACACACAGTTGAATCCGATGAAGACGGGTGCATTGATGAAGATAAAAGCTGTCCTCAATGGGCTGCCATTGGTGAATGCGAACGAAATGCTGTGTTCATGA
TCGGTTCTCCAGATTATTATGGTACATGTAGAAAAAGTTGTAATGCATGTTGA
mRNA sequenceShow/hide mRNA sequence
TTAATTTGTCGCAACCGCGAGGGACTGGAAGTTGAAGAAAGTGGCGGTGGGATCCAGATTCATAGAATTACAGAAGAGACCATGGATTTCTAAACTAGCTCCTTTCTGAA
AATCTATATCCACATTCTCTAATTTTTCAACTTCATCTTGTTTCGATCTTCGTCCCAGCCATCCATGGATTCTCGTCTCCACTTTTTGCTTCTTTTAGCGACTGCATTTT
CATTCTCAACCTGCCTTGCACAAAGCAATTTGATTAGTGGCCGGAAGGGTTTAAGGGACCAATTGCTTGATAGACCTTTAAGCTACTCAAATCATTCAGGAAGAATCGAC
CCATCAAGAGTTGTCCAAGTCTCTTGGAGACCAAGGGTTTTCTTGTATAAAGGCTTTCTCTCAGATGAGGAGTGTGATCATCTTATTTTTTTGGCTTCAAGTTCAGAAGA
CAATCCATCTGGGAACAGTGCTGGTTCCAGGAACACTGTCTCAACCAACTTGCTAAGCAGTTCAGGAGTCATTTTAAACACAACAGATGATATAATTGCAAGAATTGAAA
CTCGAATTGCACTGTGGACTCTTCTCCCAAAAGATCATAGCATGCCTTTTCAGATCATGCAATACAGGGGTGAAGAAGCAGAGCATAAGTACTTTTATGGCAACAGATCT
GCAATGTCGTCCAGTGAGCCTTTGATGGCCACAGTAGTTTTGTATCTCTCGGATTCTGCTCGCGGTGGCGAGATGCTCTTTCCAGAATCAAAGGTAAAGAGCAAATTTTG
GTCAAGCCGGAGAAAGAAAAACAACTTTCTGACACCAGTGAAAGGCAATGCAATTCTTTTTTTCTCTGTGCATCTTAATGCTTCTCCAGACAAGAGTAGCTACCACACCC
GATCCCCAATACTCAATGGGGAATTGTGGGTTGCTACAAAATTCTTCTACTTAAGACCAACCACGGGGAATAAACACACAGTTGAATCCGATGAAGACGGGTGCATTGAT
GAAGATAAAAGCTGTCCTCAATGGGCTGCCATTGGTGAATGCGAACGAAATGCTGTGTTCATGATCGGTTCTCCAGATTATTATGGTACATGTAGAAAAAGTTGTAATGC
ATGTTGAAGCATAACTAAATTCATGTAAAAATTATTCTCGTCGTCCTGATTTGAGTAAGTATTTGTTTATTTTTTCTGATTTCAAACAAGTTTGGATATTGAATTCTATT
GGCATATCTCTGGGTTAGAAGTTACTTCTTGCACCTTTAGAACCATATAGGATAGGAATTCTGCTAACACTGTAAGCCATTGTATTAATGGGATAATTACATTTTAGTAT
TTAGGTTTGAGATATGTTTCTATTT
Protein sequenceShow/hide protein sequence
MDSRLHFLLLLATAFSFSTCLAQSNLISGRKGLRDQLLDRPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRNTVSTNLLSSS
GVILNTTDDIIARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKYFYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFWSSRRKKNNFLTPVKGNAILFF
SVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC