; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0030678 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0030678
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationchr11:346799..349487
RNA-Seq ExpressionLag0030678
SyntenyLag0030678
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043468.1 putative prolyl 4-hydroxylase 12 [Cucumis melo var. makuwa]1.1e-14885.9Show/hide
Query:  MASRLNFLLLVA-AFSFSSCLVQSNLISGRKGLRDQWVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSSDSG
        M SRLNFLLL A AFSFS+CL QSNLISGRKGLRDQ VD  PLSYSN S RIDPSRVVQVSWRPRVFLYKGFLSD+ECDHLISLA+NS+D PSRNS+ SG
Subjt:  MASRLNFLLLVA-AFSFSSCLVQSNLISGRKGLRDQWVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSSDSG

Query:  NTVSTKLLNSSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYRGEVAKHKYSFGNRSAM-ASSEPLMATVVLYLSDSASGGKMLFPESKVKHKF
        NTVST+LLN SG ILNTTDDIIARIENRIAVWT LPKDH MPFQIMQYRGE AKHKY +GNRSAM +SSEPLMATVVLYLSDSASGG+MLFPESKVK KF
Subjt:  NTVSTKLLNSSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYRGEVAKHKYSFGNRSAM-ASSEPLMATVVLYLSDSASGGKMLFPESKVKHKF

Query:  WSGRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPILDGELWVATKFFYLRPTTWNKHTVESDGDDCIDEDKSCPQWAAIGECERNAVFMIGSPD
        WSGRRKK N LRPVKGNAILFFSVHLNASPDKSSYH R PI +GELWVATKF YLRP T NKHT++S+ D CIDEDKSCPQWAAIGECERNAVFM+GSPD
Subjt:  WSGRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPILDGELWVATKFFYLRPTTWNKHTVESDGDDCIDEDKSCPQWAAIGECERNAVFMIGSPD

Query:  YYGTCRKSCHAC
        YYGTCRKSC+AC
Subjt:  YYGTCRKSCHAC

XP_004152378.1 probable prolyl 4-hydroxylase 12 [Cucumis sativus]4.5e-15085.9Show/hide
Query:  MASRLNFLLLVA-AFSFSSCLVQSNLISGRKGLRDQWVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSSDSG
        M SRLNFLLL+A AFSFS+CL QSNLISGRKGLRD+ VD  PLSYSN+SGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLA+NSED PSRNS+ SG
Subjt:  MASRLNFLLLVA-AFSFSSCLVQSNLISGRKGLRDQWVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSSDSG

Query:  NTVSTKLLNSSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYRGEVAKHKYSFGNRSAM-ASSEPLMATVVLYLSDSASGGKMLFPESKVKHKF
         TVST+LLNSSG ILNTTDDI+ARIENR+A+WT LPKDHSMPFQIMQYRGE AKHKY +GNRSAM  SSEPLMATVVLYLSDSASGG++LFPESKVK KF
Subjt:  NTVSTKLLNSSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYRGEVAKHKYSFGNRSAM-ASSEPLMATVVLYLSDSASGGKMLFPESKVKHKF

Query:  WSGRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPILDGELWVATKFFYLRPTTWNKHTVESDGDDCIDEDKSCPQWAAIGECERNAVFMIGSPD
        WSGRRKKNN LRPVKGNAILFFSVHLNASPDKSSYH RSPI DGELWVATKF YL P   NKHT++SD D C DEDKSCPQWAAIGECERNAVFM+GSPD
Subjt:  WSGRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPILDGELWVATKFFYLRPTTWNKHTVESDGDDCIDEDKSCPQWAAIGECERNAVFMIGSPD

Query:  YYGTCRKSCHAC
        YYGTCRKSC+AC
Subjt:  YYGTCRKSCHAC

XP_008436994.1 PREDICTED: probable prolyl 4-hydroxylase 12 [Cucumis melo]1.7e-14986.54Show/hide
Query:  MASRLNFLLLVA-AFSFSSCLVQSNLISGRKGLRDQWVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSSDSG
        M SRLNFLLL A AFSFS+CL QSNLISGRKGLRDQ VD  PLSYSN S RIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLA+NSED PSRNS+ SG
Subjt:  MASRLNFLLLVA-AFSFSSCLVQSNLISGRKGLRDQWVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSSDSG

Query:  NTVSTKLLNSSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYRGEVAKHKYSFGNRSAM-ASSEPLMATVVLYLSDSASGGKMLFPESKVKHKF
        NTVST+LLN SG ILNTTDDIIARIENRIAVWT LPKDH MPFQIMQYRGE AKHKY +GNRSAM +SSEPLMATVVLYLSDSASGG+MLFPESKVK KF
Subjt:  NTVSTKLLNSSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYRGEVAKHKYSFGNRSAM-ASSEPLMATVVLYLSDSASGGKMLFPESKVKHKF

Query:  WSGRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPILDGELWVATKFFYLRPTTWNKHTVESDGDDCIDEDKSCPQWAAIGECERNAVFMIGSPD
        WSGRRKK N LRPVKGNAILFFSVHLNASPDKSSYH R PI +GELWVATKF YLRP T NKHT++S+ D CIDEDKSCPQWAAIGECERNAVFM+GSPD
Subjt:  WSGRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPILDGELWVATKFFYLRPTTWNKHTVESDGDDCIDEDKSCPQWAAIGECERNAVFMIGSPD

Query:  YYGTCRKSCHAC
        YYGTCRKSC+AC
Subjt:  YYGTCRKSCHAC

XP_022159842.1 probable prolyl 4-hydroxylase 12 [Momordica charantia]7.7e-15086.5Show/hide
Query:  MASRLNFLLLVA-AFSFSSCLVQSNLISGRKGLRDQWVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSSDSG
        M SRL  LLL+A A SF SCL QSNLISGRKGLRDQ ++SVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLA +SEDKPS NS+DSG
Subjt:  MASRLNFLLLVA-AFSFSSCLVQSNLISGRKGLRDQWVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSSDSG

Query:  NTVSTKLLNSSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYRGEVAKHKYSFGNRSAMASSEPLMATVVLYLSDSASGGKMLFPESKVKHKFW
        NTV TK+L SSGAILNTTDDIIARIENRIAVWTFLPKD+SMP QI+QY GE A+HKY FGNRSAM SSEPLMATVVLYLSDSASGG+M FPESKVK +FW
Subjt:  NTVSTKLLNSSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYRGEVAKHKYSFGNRSAMASSEPLMATVVLYLSDSASGGKMLFPESKVKHKFW

Query:  SGRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPILDGELWVATKFFYLRPTTWNKHTVESDGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDY
        S RRKKNNILRPVKGNA+L FSVHLNASPDKSS HTRSPILDGELW+ATKFFYLRP T NKHT E DG DC DEDKSCPQWAAIGECERNAVFMIGSPDY
Subjt:  SGRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPILDGELWVATKFFYLRPTTWNKHTVESDGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDY

Query:  YGTCRKSCHAC
        YGTCRKSC+AC
Subjt:  YGTCRKSCHAC

XP_038906497.1 probable prolyl 4-hydroxylase 12 [Benincasa hispida]6.7e-15488.42Show/hide
Query:  MASRLNFLLLVA-AFSFSSCLVQSNLISGRKGLRDQWVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSSDSG
        M SRLNFLLL+A AFSFS+CL QSNLISGRKGLRDQ VD  PLSYSNHSGRIDPSRVVQVSW+PRVFLYKGFLSDEECDHLISLA+NSED PS NS+ SG
Subjt:  MASRLNFLLLVA-AFSFSSCLVQSNLISGRKGLRDQWVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSSDSG

Query:  NTVSTKLLNSSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYRGEVAKHKYSFGNRSAMASSEPLMATVVLYLSDSASGGKMLFPESKVKHKFW
        NTVSTKLLNSSG ILNT+DDIIARIEN+IAVWTFLPKDH MPFQIMQYRGE A+HKY +GN SAM+SSEPLMATVVLYLSDSA GG+MLFPESKVK KFW
Subjt:  NTVSTKLLNSSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYRGEVAKHKYSFGNRSAMASSEPLMATVVLYLSDSASGGKMLFPESKVKHKFW

Query:  SGRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPILDGELWVATKFFYLRPTTWNKHTVESDGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDY
        S RRKKNN LRPVKGNAILFFSVHLNASPDKSSYHTRSPIL+GELWVATKFFYLRPTT NK TVESD D CIDEDKSCPQWAAIGECERN VFMIGSPDY
Subjt:  SGRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPILDGELWVATKFFYLRPTTWNKHTVESDGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDY

Query:  YGTCRKSCHAC
        YGTCRKSC+AC
Subjt:  YGTCRKSCHAC

TrEMBL top hitse value%identityAlignment
A0A0A0KPE4 Procollagen-proline 4-dioxygenase2.2e-14286.21Show/hide
Query:  QSNLISGRKGLRDQWVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSSDSGNTVSTKLLNSSGAILNTTDDII
        +SNLISGRKGLRD+ VD  PLSYSN+SGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLA+NSED PSRNS+ SG TVST+LLNSSG ILNTTDDI+
Subjt:  QSNLISGRKGLRDQWVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSSDSGNTVSTKLLNSSGAILNTTDDII

Query:  ARIENRIAVWTFLPKDHSMPFQIMQYRGEVAKHKYSFGNRSAM-ASSEPLMATVVLYLSDSASGGKMLFPESKVKHKFWSGRRKKNNILRPVKGNAILFF
        ARIENR+A+WT LPKDHSMPFQIMQYRGE AKHKY +GNRSAM  SSEPLMATVVLYLSDSASGG++LFPESKVK KFWSGRRKKNN LRPVKGNAILFF
Subjt:  ARIENRIAVWTFLPKDHSMPFQIMQYRGEVAKHKYSFGNRSAM-ASSEPLMATVVLYLSDSASGGKMLFPESKVKHKFWSGRRKKNNILRPVKGNAILFF

Query:  SVHLNASPDKSSYHTRSPILDGELWVATKFFYLRPTTWNKHTVESDGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCHAC
        SVHLNASPDKSSYH RSPI DGELWVATKF YL P   NKHT++SD D C DEDKSCPQWAAIGECERNAVFM+GSPDYYGTCRKSC+AC
Subjt:  SVHLNASPDKSSYHTRSPILDGELWVATKFFYLRPTTWNKHTVESDGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCHAC

A0A1S3AT39 Procollagen-proline 4-dioxygenase8.3e-15086.54Show/hide
Query:  MASRLNFLLLVA-AFSFSSCLVQSNLISGRKGLRDQWVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSSDSG
        M SRLNFLLL A AFSFS+CL QSNLISGRKGLRDQ VD  PLSYSN S RIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLA+NSED PSRNS+ SG
Subjt:  MASRLNFLLLVA-AFSFSSCLVQSNLISGRKGLRDQWVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSSDSG

Query:  NTVSTKLLNSSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYRGEVAKHKYSFGNRSAM-ASSEPLMATVVLYLSDSASGGKMLFPESKVKHKF
        NTVST+LLN SG ILNTTDDIIARIENRIAVWT LPKDH MPFQIMQYRGE AKHKY +GNRSAM +SSEPLMATVVLYLSDSASGG+MLFPESKVK KF
Subjt:  NTVSTKLLNSSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYRGEVAKHKYSFGNRSAM-ASSEPLMATVVLYLSDSASGGKMLFPESKVKHKF

Query:  WSGRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPILDGELWVATKFFYLRPTTWNKHTVESDGDDCIDEDKSCPQWAAIGECERNAVFMIGSPD
        WSGRRKK N LRPVKGNAILFFSVHLNASPDKSSYH R PI +GELWVATKF YLRP T NKHT++S+ D CIDEDKSCPQWAAIGECERNAVFM+GSPD
Subjt:  WSGRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPILDGELWVATKFFYLRPTTWNKHTVESDGDDCIDEDKSCPQWAAIGECERNAVFMIGSPD

Query:  YYGTCRKSCHAC
        YYGTCRKSC+AC
Subjt:  YYGTCRKSCHAC

A0A5A7TKX1 Procollagen-proline 4-dioxygenase5.4e-14985.9Show/hide
Query:  MASRLNFLLLVA-AFSFSSCLVQSNLISGRKGLRDQWVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSSDSG
        M SRLNFLLL A AFSFS+CL QSNLISGRKGLRDQ VD  PLSYSN S RIDPSRVVQVSWRPRVFLYKGFLSD+ECDHLISLA+NS+D PSRNS+ SG
Subjt:  MASRLNFLLLVA-AFSFSSCLVQSNLISGRKGLRDQWVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSSDSG

Query:  NTVSTKLLNSSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYRGEVAKHKYSFGNRSAM-ASSEPLMATVVLYLSDSASGGKMLFPESKVKHKF
        NTVST+LLN SG ILNTTDDIIARIENRIAVWT LPKDH MPFQIMQYRGE AKHKY +GNRSAM +SSEPLMATVVLYLSDSASGG+MLFPESKVK KF
Subjt:  NTVSTKLLNSSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYRGEVAKHKYSFGNRSAM-ASSEPLMATVVLYLSDSASGGKMLFPESKVKHKF

Query:  WSGRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPILDGELWVATKFFYLRPTTWNKHTVESDGDDCIDEDKSCPQWAAIGECERNAVFMIGSPD
        WSGRRKK N LRPVKGNAILFFSVHLNASPDKSSYH R PI +GELWVATKF YLRP T NKHT++S+ D CIDEDKSCPQWAAIGECERNAVFM+GSPD
Subjt:  WSGRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPILDGELWVATKFFYLRPTTWNKHTVESDGDDCIDEDKSCPQWAAIGECERNAVFMIGSPD

Query:  YYGTCRKSCHAC
        YYGTCRKSC+AC
Subjt:  YYGTCRKSCHAC

A0A6J1E0X9 Procollagen-proline 4-dioxygenase3.7e-15086.5Show/hide
Query:  MASRLNFLLLVA-AFSFSSCLVQSNLISGRKGLRDQWVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSSDSG
        M SRL  LLL+A A SF SCL QSNLISGRKGLRDQ ++SVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLA +SEDKPS NS+DSG
Subjt:  MASRLNFLLLVA-AFSFSSCLVQSNLISGRKGLRDQWVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSSDSG

Query:  NTVSTKLLNSSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYRGEVAKHKYSFGNRSAMASSEPLMATVVLYLSDSASGGKMLFPESKVKHKFW
        NTV TK+L SSGAILNTTDDIIARIENRIAVWTFLPKD+SMP QI+QY GE A+HKY FGNRSAM SSEPLMATVVLYLSDSASGG+M FPESKVK +FW
Subjt:  NTVSTKLLNSSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYRGEVAKHKYSFGNRSAMASSEPLMATVVLYLSDSASGGKMLFPESKVKHKFW

Query:  SGRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPILDGELWVATKFFYLRPTTWNKHTVESDGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDY
        S RRKKNNILRPVKGNA+L FSVHLNASPDKSS HTRSPILDGELW+ATKFFYLRP T NKHT E DG DC DEDKSCPQWAAIGECERNAVFMIGSPDY
Subjt:  SGRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPILDGELWVATKFFYLRPTTWNKHTVESDGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDY

Query:  YGTCRKSCHAC
        YGTCRKSC+AC
Subjt:  YGTCRKSCHAC

A0A6J1E2P0 Procollagen-proline 4-dioxygenase7.5e-14382.17Show/hide
Query:  MASRLNFLLLV-AAFSFSSCLVQSNLISGRKGLRDQWVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSSDSG
        M SRL FLLL+ AAFSFSSCL QSN ISGRKGLRDQ V+S  LSYSNHS RIDPSRVVQ+SW+PR FLYKGFLSDEECDHLI+LA+NSEDKPSRN++ S 
Subjt:  MASRLNFLLLV-AAFSFSSCLVQSNLISGRKGLRDQWVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSSDSG

Query:  NTVSTKLLNSSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYRG-EVAKHKYSFGNRSAMASSEPLMATVVLYLSDSASGGKMLFPESKVKHKF
        NTVSTK L +SGAILNTTDDII RIENRIAVWTFLPKDHSMPFQIM+Y G E A HKY FGNRSAM SSEPLMATVVLYLSDSASGG++LFP SKVK +F
Subjt:  NTVSTKLLNSSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYRG-EVAKHKYSFGNRSAMASSEPLMATVVLYLSDSASGGKMLFPESKVKHKF

Query:  WSGRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPILDGELWVATKFFYLRP-TTWNKHTVESD-GDDCIDEDKSCPQWAAIGECERNAVFMIGS
        WS RRKKNN LRPVKGNA+LFFSVHLNASPDKS YH+R+PILDG+LWVATKFFY+RP  T N+H VES   DDCIDED+SCP+WAAIGEC+RNAVFMIGS
Subjt:  WSGRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPILDGELWVATKFFYLRP-TTWNKHTVESD-GDDCIDEDKSCPQWAAIGECERNAVFMIGS

Query:  PDYYGTCRKSCHAC
        PDYYGTCRKSC+AC
Subjt:  PDYYGTCRKSCHAC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 61.7e-5440.36Show/hide
Query:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSS-DSGNTVSTKLLNSSGAIL-NTTDDIIARIENRIAVWTFLPKDHSMPF
        S+ S  +DP+R+ Q+SW PR FLYKGFLSDEECDHLI LA    +K    +  DSG +  +++  SSG  L    DDI+A +E ++A WTFLP+++    
Subjt:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSS-DSGNTVSTKLLNSSGAIL-NTTDDIIARIENRIAVWTFLPKDHSMPF

Query:  QIMQYRG--EVAKHKYSFGNRSAMASSEPLMATVVLYLSDSASGGKMLFPESK-----VKHKFWSGRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHT
        QI+ Y    +   H   F ++ A+      +ATV++YLS+   GG+ +FP  K     +K   WS   K+   ++P KG+A+LFF++HLN + D +S H 
Subjt:  QIMQYRG--EVAKHKYSFGNRSAMASSEPLMATVVLYLSDSASGGKMLFPESK-----VKHKFWSGRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHT

Query:  RSPILDGELWVATKFFYLRPTTWNKHTVESDGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCHAC
          P+++GE W AT++ ++R     K         C+D+ +SC +WA  GECE+N ++M+GS    G CRKSC AC
Subjt:  RSPILDGELWVATKFFYLRPTTWNKHTVESDGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCHAC

F4JAU3 Prolyl 4-hydroxylase 23.3e-4737.18Show/hide
Query:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSSDSGNTVSTKLLNSSGAILNT-TDDIIARIENRIAVWTFLPKDHSMPFQ
        S+ S  I+PS+V QVS +PR F+Y+GFL+D ECDHLISLA  +  + +   +D+G +  + +  SSG  ++   D I++ IE++++ WTFLPK++    Q
Subjt:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSSDSGNTVSTKLLNSSGAILNT-TDDIIARIENRIAVWTFLPKDHSMPFQ

Query:  IMQYR--GEVAKHKYSFGNRSAMASSEPLMATVVLYLSDSASGGKMLFPESK--------VKHKFWSGRRKKNNILRPVKGNAILFFSVHLNASPDKSSY
        +++Y    +   H   F ++  +A     +ATV+LYLS+   GG+ +FP+++              S   KK   ++P KGNA+LFF++  +A PD  S 
Subjt:  IMQYR--GEVAKHKYSFGNRSAMASSEPLMATVVLYLSDSASGGKMLFPESK--------VKHKFWSGRRKKNNILRPVKGNAILFFSVHLNASPDKSSY

Query:  HTRSPILDGELWVATKFFYLRPTTWNKHTVESDGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCHAC
        H   P+++GE W ATK+ ++   +++K  +  DG +C D ++SC +WA +GEC +N  +M+G+P+  G CR+SC AC
Subjt:  HTRSPILDGELWVATKFFYLRPTTWNKHTVESDGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCHAC

Q8GXT7 Probable prolyl 4-hydroxylase 125.7e-6344.98Show/hide
Query:  FLLLVAAFSFSSCLVQSNLISGRKGLRDQWV----DSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSSDSGNTV
        FL+L+   S SS    S     RK LRD+ +    D    SY   S  +DP+RV+Q+SW PRVFLY+GFLS+EECDHLISL   + +  S ++   G T 
Subjt:  FLLLVAAFSFSSCLVQSNLISGRKGLRDQWV----DSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSSDSGNTV

Query:  STKLLNSSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYRGEVAKHKYS-FGNRSAMASSEPLMATVVLYLSDSASGGKMLFPESKVKHKFWSG
                       D ++A IE +++ WTFLP ++    ++  Y  E +  K   FG   +    E L+ATVVLYLS++  GG++LFP S++K K  + 
Subjt:  STKLLNSSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYRGEVAKHKYS-FGNRSAMASSEPLMATVVLYLSDSASGGKMLFPESKVKHKFWSG

Query:  RRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPILDGELWVATKFFYLRPTTWNKHTVESDGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDYYG
          +  NILRPVKGNAILFF+  LNAS D  S H R P++ GEL VATK  Y +     +  +E  G +C DED++C +WA +GEC++N V+MIGSPDYYG
Subjt:  RRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPILDGELWVATKFFYLRPTTWNKHTVESDGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDYYG

Query:  TCRKSCHAC
        TCRKSC+AC
Subjt:  TCRKSCHAC

Q8L970 Probable prolyl 4-hydroxylase 72.8e-5437.15Show/hide
Query:  MASRLNFLLLVAAFSFSSCLVQS---NLISGRKGLRDQWVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSSD
        M SR+ FL     F F+  L+ S     ++     RD  V  + +  S  S   DP+RV Q+SW PRVFLY+GFLSDEECDH I LA    +K     +D
Subjt:  MASRLNFLLLVAAFSFSSCLVQS---NLISGRKGLRDQWVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSSD

Query:  SGNTVSTKLLNSSGAILN-TTDDIIARIENRIAVWTFLPKDHSMPFQIMQYRG--EVAKHKYSFGNRSAMASSEPLMATVVLYLSDSASGGKMLFP----
        SG +V +++  SSG  L+   DDI++ +E ++A WTFLP+++    QI+ Y    +   H   F +++ +      +ATV++YLS+   GG+ +FP    
Subjt:  SGNTVSTKLLNSSGAILN-TTDDIIARIENRIAVWTFLPKDHSMPFQIMQYRG--EVAKHKYSFGNRSAMASSEPLMATVVLYLSDSASGGKMLFP----

Query:  -ESKVKHKFWSGRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPILDGELWVATKFFYLR--PTTWNKHTVESDGDDCIDEDKSCPQWAAIGECE
          +++K   W+   K+   ++P KG+A+LFF++H NA+ D +S H   P+++GE W AT++ +++     +NK +       C+DE+ SC +WA  GEC+
Subjt:  -ESKVKHKFWSGRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPILDGELWVATKFFYLR--PTTWNKHTVESDGDDCIDEDKSCPQWAAIGECE

Query:  RNAVFMIGSPDYYGTCRKSCHAC
        +N  +M+GS   +G CRKSC AC
Subjt:  RNAVFMIGSPDYYGTCRKSCHAC

Q8LAN3 Probable prolyl 4-hydroxylase 46.3e-4634.3Show/hide
Query:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSSDSGNTVSTKLLNSSGAILNT-TDDIIARIENRIAVWTFLPKDHSMPFQ
        S+ S  ++PS+V QVS +PR F+Y+GFL++ ECDH++SLA  S  + +   +DSG +  +++  SSG  ++   D I++ IE++I+ WTFLPK++    Q
Subjt:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSSDSGNTVSTKLLNSSGAILNT-TDDIIARIENRIAVWTFLPKDHSMPFQ

Query:  IMQYR--GEVAKHKYSFGNRSAMASSEPLMATVVLYLSDSASGGKMLFPESKV--------KHKFWSGRRKKNNILRPVKGNAILFFSVHLNASPDKSSY
        +++Y    +   H   F ++  +      MAT+++YLS+   GG+ +FP++++          +  S   K+   ++P KG+A+LFF++H +A PD  S 
Subjt:  IMQYR--GEVAKHKYSFGNRSAMASSEPLMATVVLYLSDSASGGKMLFPESKV--------KHKFWSGRRKKNNILRPVKGNAILFFSVHLNASPDKSSY

Query:  HTRSPILDGELWVATKFFYLRPTTWNKHTVESDGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCHAC
        H   P+++GE W ATK+ ++     +   + +   +C D ++SC +WA +GEC +N  +M+G+ +  G CR+SC AC
Subjt:  HTRSPILDGELWVATKFFYLRPTTWNKHTVESDGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCHAC

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 22.4e-4837.18Show/hide
Query:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSSDSGNTVSTKLLNSSGAILNT-TDDIIARIENRIAVWTFLPKDHSMPFQ
        S+ S  I+PS+V QVS +PR F+Y+GFL+D ECDHLISLA  +  + +   +D+G +  + +  SSG  ++   D I++ IE++++ WTFLPK++    Q
Subjt:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSSDSGNTVSTKLLNSSGAILNT-TDDIIARIENRIAVWTFLPKDHSMPFQ

Query:  IMQYR--GEVAKHKYSFGNRSAMASSEPLMATVVLYLSDSASGGKMLFPESK--------VKHKFWSGRRKKNNILRPVKGNAILFFSVHLNASPDKSSY
        +++Y    +   H   F ++  +A     +ATV+LYLS+   GG+ +FP+++              S   KK   ++P KGNA+LFF++  +A PD  S 
Subjt:  IMQYR--GEVAKHKYSFGNRSAMASSEPLMATVVLYLSDSASGGKMLFPESK--------VKHKFWSGRRKKNNILRPVKGNAILFFSVHLNASPDKSSY

Query:  HTRSPILDGELWVATKFFYLRPTTWNKHTVESDGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCHAC
        H   P+++GE W ATK+ ++   +++K  +  DG +C D ++SC +WA +GEC +N  +M+G+P+  G CR+SC AC
Subjt:  HTRSPILDGELWVATKFFYLRPTTWNKHTVESDGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCHAC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase2.0e-5537.15Show/hide
Query:  MASRLNFLLLVAAFSFSSCLVQS---NLISGRKGLRDQWVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSSD
        M SR+ FL     F F+  L+ S     ++     RD  V  + +  S  S   DP+RV Q+SW PRVFLY+GFLSDEECDH I LA    +K     +D
Subjt:  MASRLNFLLLVAAFSFSSCLVQS---NLISGRKGLRDQWVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSSD

Query:  SGNTVSTKLLNSSGAILN-TTDDIIARIENRIAVWTFLPKDHSMPFQIMQYRG--EVAKHKYSFGNRSAMASSEPLMATVVLYLSDSASGGKMLFP----
        SG +V +++  SSG  L+   DDI++ +E ++A WTFLP+++    QI+ Y    +   H   F +++ +      +ATV++YLS+   GG+ +FP    
Subjt:  SGNTVSTKLLNSSGAILN-TTDDIIARIENRIAVWTFLPKDHSMPFQIMQYRG--EVAKHKYSFGNRSAMASSEPLMATVVLYLSDSASGGKMLFP----

Query:  -ESKVKHKFWSGRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPILDGELWVATKFFYLR--PTTWNKHTVESDGDDCIDEDKSCPQWAAIGECE
          +++K   W+   K+   ++P KG+A+LFF++H NA+ D +S H   P+++GE W AT++ +++     +NK +       C+DE+ SC +WA  GEC+
Subjt:  -ESKVKHKFWSGRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPILDGELWVATKFFYLR--PTTWNKHTVESDGDDCIDEDKSCPQWAAIGECE

Query:  RNAVFMIGSPDYYGTCRKSCHAC
        +N  +M+GS   +G CRKSC AC
Subjt:  RNAVFMIGSPDYYGTCRKSCHAC

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase7.1e-5336.25Show/hide
Query:  MASRLNFLLLVAAFSFSSCLVQS---NLISGRKGLRDQWVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSSD
        M SR+ FL     F F+  L+ S     ++     RD  V  + +  S  S   DP+RV Q+SW PRVFLY+GFLSDEECDH I LA    +K     +D
Subjt:  MASRLNFLLLVAAFSFSSCLVQS---NLISGRKGLRDQWVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSSD

Query:  SGNTVSTK-----LLNSSGAILN----TTDDIIARIENRIAVWTFLPKDHSMPFQIMQYRG--EVAKHKYSFGNRSAMASSEPLMATVVLYLSDSASGGK
        SG +V ++     +  SS  I N      DDI++ +E ++A WTFLP+++    QI+ Y    +   H   F +++ +      +ATV++YLS+   GG+
Subjt:  SGNTVSTK-----LLNSSGAILN----TTDDIIARIENRIAVWTFLPKDHSMPFQIMQYRG--EVAKHKYSFGNRSAMASSEPLMATVVLYLSDSASGGK

Query:  MLFP-----ESKVKHKFWSGRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPILDGELWVATKFFYLR--PTTWNKHTVESDGDDCIDEDKSCPQ
         +FP      +++K   W+   K+   ++P KG+A+LFF++H NA+ D +S H   P+++GE W AT++ +++     +NK +       C+DE+ SC +
Subjt:  MLFP-----ESKVKHKFWSGRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPILDGELWVATKFFYLR--PTTWNKHTVESDGDDCIDEDKSCPQ

Query:  WAAIGECERNAVFMIGSPDYYGTCRKSCHAC
        WA  GEC++N  +M+GS   +G CRKSC AC
Subjt:  WAAIGECERNAVFMIGSPDYYGTCRKSCHAC

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase1.2e-5540.36Show/hide
Query:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSS-DSGNTVSTKLLNSSGAIL-NTTDDIIARIENRIAVWTFLPKDHSMPF
        S+ S  +DP+R+ Q+SW PR FLYKGFLSDEECDHLI LA    +K    +  DSG +  +++  SSG  L    DDI+A +E ++A WTFLP+++    
Subjt:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSS-DSGNTVSTKLLNSSGAIL-NTTDDIIARIENRIAVWTFLPKDHSMPF

Query:  QIMQYRG--EVAKHKYSFGNRSAMASSEPLMATVVLYLSDSASGGKMLFPESK-----VKHKFWSGRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHT
        QI+ Y    +   H   F ++ A+      +ATV++YLS+   GG+ +FP  K     +K   WS   K+   ++P KG+A+LFF++HLN + D +S H 
Subjt:  QIMQYRG--EVAKHKYSFGNRSAMASSEPLMATVVLYLSDSASGGKMLFPESK-----VKHKFWSGRRKKNNILRPVKGNAILFFSVHLNASPDKSSYHT

Query:  RSPILDGELWVATKFFYLRPTTWNKHTVESDGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCHAC
          P+++GE W AT++ ++R     K         C+D+ +SC +WA  GECE+N ++M+GS    G CRKSC AC
Subjt:  RSPILDGELWVATKFFYLRPTTWNKHTVESDGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCHAC

AT4G25600.1 Oxoglutarate/iron-dependent oxygenase4.0e-6444.98Show/hide
Query:  FLLLVAAFSFSSCLVQSNLISGRKGLRDQWV----DSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSSDSGNTV
        FL+L+   S SS    S     RK LRD+ +    D    SY   S  +DP+RV+Q+SW PRVFLY+GFLS+EECDHLISL   + +  S ++   G T 
Subjt:  FLLLVAAFSFSSCLVQSNLISGRKGLRDQWV----DSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSSDSGNTV

Query:  STKLLNSSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYRGEVAKHKYS-FGNRSAMASSEPLMATVVLYLSDSASGGKMLFPESKVKHKFWSG
                       D ++A IE +++ WTFLP ++    ++  Y  E +  K   FG   +    E L+ATVVLYLS++  GG++LFP S++K K  + 
Subjt:  STKLLNSSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYRGEVAKHKYS-FGNRSAMASSEPLMATVVLYLSDSASGGKMLFPESKVKHKFWSG

Query:  RRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPILDGELWVATKFFYLRPTTWNKHTVESDGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDYYG
          +  NILRPVKGNAILFF+  LNAS D  S H R P++ GEL VATK  Y +     +  +E  G +C DED++C +WA +GEC++N V+MIGSPDYYG
Subjt:  RRKKNNILRPVKGNAILFFSVHLNASPDKSSYHTRSPILDGELWVATKFFYLRPTTWNKHTVESDGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDYYG

Query:  TCRKSCHAC
        TCRKSC+AC
Subjt:  TCRKSCHAC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTCGTCTTAACTTTCTGCTCTTAGTGGCTGCATTTTCATTCTCAAGCTGCCTTGTACAAAGCAATTTGATCAGTGGCCGGAAGGGTTTAAGGGACCAATGGGT
TGACAGTGTACCTTTGAGCTACTCAAATCATTCTGGAAGAATCGACCCATCAAGAGTTGTCCAAGTCTCTTGGCGACCAAGGGTTTTCTTGTATAAAGGTTTTCTTTCAG
ATGAGGAGTGTGATCATCTTATTTCTTTGGCTGCAAATTCAGAAGATAAACCCTCTAGGAACAGTTCTGATTCCGGGAACACTGTCTCAACCAAATTGCTGAACAGTTCA
GGAGCCATTTTAAACACAACAGATGATATCATTGCAAGGATTGAAAATCGAATTGCGGTGTGGACTTTTCTCCCAAAAGATCACAGCATGCCTTTTCAGATTATGCAATA
CAGGGGTGAAGTAGCAAAGCATAAGTACAGTTTTGGCAACAGATCTGCAATGGCGTCCAGTGAGCCTTTGATGGCCACAGTAGTTTTGTATCTCTCAGATTCTGCTAGCG
GTGGCAAGATGCTCTTTCCAGAATCAAAGGTAAAGCACAAATTTTGGTCGGGTCGGAGAAAGAAAAACAACATTCTGAGACCAGTGAAAGGGAATGCAATTCTTTTTTTC
TCTGTACATCTTAATGCTTCTCCAGACAAGAGTAGCTACCATACCCGATCTCCAATACTCGATGGGGAGTTGTGGGTTGCTACAAAATTCTTCTACTTAAGACCAACCAC
ATGGAATAAACACACAGTTGAATCCGATGGAGACGACTGCATTGATGAAGATAAAAGCTGCCCCCAATGGGCTGCCATTGGCGAATGCGAACGAAATGCTGTTTTCATGA
TCGGTTCTCCAGATTACTATGGAACATGTAGAAAAAGCTGCCATGCATGTCGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTCGTCTTAACTTTCTGCTCTTAGTGGCTGCATTTTCATTCTCAAGCTGCCTTGTACAAAGCAATTTGATCAGTGGCCGGAAGGGTTTAAGGGACCAATGGGT
TGACAGTGTACCTTTGAGCTACTCAAATCATTCTGGAAGAATCGACCCATCAAGAGTTGTCCAAGTCTCTTGGCGACCAAGGGTTTTCTTGTATAAAGGTTTTCTTTCAG
ATGAGGAGTGTGATCATCTTATTTCTTTGGCTGCAAATTCAGAAGATAAACCCTCTAGGAACAGTTCTGATTCCGGGAACACTGTCTCAACCAAATTGCTGAACAGTTCA
GGAGCCATTTTAAACACAACAGATGATATCATTGCAAGGATTGAAAATCGAATTGCGGTGTGGACTTTTCTCCCAAAAGATCACAGCATGCCTTTTCAGATTATGCAATA
CAGGGGTGAAGTAGCAAAGCATAAGTACAGTTTTGGCAACAGATCTGCAATGGCGTCCAGTGAGCCTTTGATGGCCACAGTAGTTTTGTATCTCTCAGATTCTGCTAGCG
GTGGCAAGATGCTCTTTCCAGAATCAAAGGTAAAGCACAAATTTTGGTCGGGTCGGAGAAAGAAAAACAACATTCTGAGACCAGTGAAAGGGAATGCAATTCTTTTTTTC
TCTGTACATCTTAATGCTTCTCCAGACAAGAGTAGCTACCATACCCGATCTCCAATACTCGATGGGGAGTTGTGGGTTGCTACAAAATTCTTCTACTTAAGACCAACCAC
ATGGAATAAACACACAGTTGAATCCGATGGAGACGACTGCATTGATGAAGATAAAAGCTGCCCCCAATGGGCTGCCATTGGCGAATGCGAACGAAATGCTGTTTTCATGA
TCGGTTCTCCAGATTACTATGGAACATGTAGAAAAAGCTGCCATGCATGTCGATGA
Protein sequenceShow/hide protein sequence
MASRLNFLLLVAAFSFSSCLVQSNLISGRKGLRDQWVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLAANSEDKPSRNSSDSGNTVSTKLLNSS
GAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYRGEVAKHKYSFGNRSAMASSEPLMATVVLYLSDSASGGKMLFPESKVKHKFWSGRRKKNNILRPVKGNAILFF
SVHLNASPDKSSYHTRSPILDGELWVATKFFYLRPTTWNKHTVESDGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCHACR