; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr004682 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr004682
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationtig00003185:58792..64602
RNA-Seq ExpressionSgr004682
SyntenySgr004682
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAD5324401.1 unnamed protein product [Arabidopsis thaliana]2.9e-14152.56Show/hide
Query:  MGCRLFLAFSLCFLYFFPPLSRSANRLPGLLLDNKNMGGGSVSRIKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDV
        M  R+FLAFSLCFL+  P +S + NR    L  + N   GSV ++K   SS   DPTRVTQLS  PR FLY+GFLS EECDH I LAK KLEKSMVAD+ 
Subjt:  MGCRLFLAFSLCFLYFFPPLSRSANRLPGLLLDNKNMGGGSVSRIKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDV

Query:  TGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV
        +G SV S+ RTSSG FL K QD+IV+ +E+K+AAWTFLP +NGE +Q+L YENGQKYEPH+DYF D  N+ +GGHRIATVLMYLS+V+KGGETVFP  + 
Subjt:  TGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV

Query:  KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIW-RNPDCVDENVYCAMWASA----------
        K ++ + D  ++CAK GY VKP+KGDALLFF+LHPNAT DS+S HGSCPV+EGEKWSAT+WIHV+S +  + +   C+DENV C  WA A          
Subjt:  KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIW-RNPDCVDENVYCAMWASA----------

Query:  ---------------------------------------------------GLNRCGKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQAIGSRWSLIA
                                                           GLNRCGKSCRLRWTNYLRPDL+HDSF+ QEE+LII+ H+AIGSRWS IA
Subjt:  ---------------------------------------------------GLNRCGKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQAIGSRWSLIA

Query:  KQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSLPNT---PKPLPSSFTT----------TMAKPHQPSTSSSPPAIANPPFPGT
        ++LPGRTDNDVKN+WNTKL+KKL KMGIDP+THKP SQ+L ++  IS   N     +P  +S  T               H+   ++SP    N     T
Subjt:  KQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSLPNT---PKPLPSSFTT----------TMAKPHQPSTSSSPPAIANPPFPGT

Query:  FRPHFFNEP-------TSSCSSTSSSS
           HF++ P       TSSCSS+SSS+
Subjt:  FRPHFFNEP-------TSSCSSTSSSS

GEV60359.1 probable prolyl 4-hydroxylase 6 [Tanacetum cinerariifolium]5.5e-14048.12Show/hide
Query:  LLLDNKNMGGGSVSRIK-------PGGSSMA--VDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDVTGASVVSKERTSSGTFLLKA
        LLL N ++   S+ +I+       P G S     DPTRVTQ+S  PRAFLY+ FL+ +ECDHLI LAKDKLEKSMVAD+ +G S+ S+ RTSSG FL KA
Subjt:  LLLDNKNMGGGSVSRIK-------PGGSSMA--VDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDVTGASVVSKERTSSGTFLLKA

Query:  QDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQVKLSE-QEKDDLSDCAKIGYG
        QDE+VAG+ES+I+AWTFLP++NGE +Q+L YENGQKYEPH+DYF D  N A+GGHRIATVLMYLS+V+KGGETVFP S++K S+ + K+D S+CAK GY 
Subjt:  QDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQVKLSE-QEKDDLSDCAKIGYG

Query:  VKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIWRNPD-CVDENVYCAMWAS------------------------------
        VKPKKGDALLFFSLHPNAT D+ S HGSCPVIEGEKWSATKWIHV++ D      D C DENV C  WA+                              
Subjt:  VKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIWRNPD-CVDENVYCAMWAS------------------------------

Query:  ---------------------------------------------------AGLNRCGKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQAIGSRWSLI
                                                           AGLNRCGKSCRLRWTNYLRPDL+HDSFTP EE+LI++ HQAIGSRWSLI
Subjt:  ---------------------------------------------------AGLNRCGKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQAIGSRWSLI

Query:  AKQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSL--PNTPKPL------PSSF-----TTTMAKPHQPSTSSSPPAIANPPFPG
        A++LPGRTDNDVKN+WNTKL+KKLSKMGIDPITHKPF Q+L DYG I+ +  PNT +P       PS F     + TM  P+Q     +P ++    F  
Subjt:  AKQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSL--PNTPKPL------PSSF-----TTTMAKPHQPSTSSSPPAIANPPFPG

Query:  TFRPHFFNEPTSSCSSTSSSSFNGG------GGGGDVLFHFAASSSPEQSH-GIMEPFSQSSDGFVCGAMGQRGCEGSSSSSFNSFVDALLEQDFQIKGS
               NE  SS +S+ ++S   G          D L  +       +S  GI+ P  ++ + +    +         +++  SFV+++L +D +++  
Subjt:  TFRPHFFNEPTSSCSSTSSSSFNGG------GGGGDVLFHFAASSSPEQSH-GIMEPFSQSSDGFVCGAMGQRGCEGSSSSSFNSFVDALLEQDFQIKGS

Query:  FPEILEGCFDY
        FP++L+G  +Y
Subjt:  FPEILEGCFDY

KAF3955258.1 hypothetical protein CMV_019505 [Castanea mollissima]2.0e-16664.54Show/hide
Query:  MGCRLFLAFSLCFLYFFPPLSRSANRLPGLLLDNKNMGGGSVSRIKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDV
        M  R FLA SLCFL+ FP LS S  +LPG L DNK    GSV ++K G SS   DP+RVTQLS +PRAFLYKGFLS EECDHLINLA+DKLEKSMVAD+ 
Subjt:  MGCRLFLAFSLCFLYFFPPLSRSANRLPGLLLDNKNMGGGSVSRIKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDV

Query:  TGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV
        +G S++S+ RTSSG FL K QDE+VA IE++IAAWTFLPI+NGE +QVL Y +G+KYEPH+DYF D  N  +GGHR+ATVLMYLS+V+KGGETVFPNS+ 
Subjt:  TGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV

Query:  KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDT------IWRNPDCVDENVYCAMWASAGLNRC
        K+S+ + DD+SDCAK GY VKPKKGDALLFFSL+P+AT D+ S HGSCPVIEGEKWSATKWIHV+S +       +    DCVDEN  C +WA AGLNRC
Subjt:  KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDT------IWRNPDCVDENVYCAMWASAGLNRC

Query:  GKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQAIGSRWSLIAKQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSLPN-----
        GKSCRLRWTNYLR DL+HD FTPQEE+LII LH+AIGSRWSLIAKQLPGRTDNDVKNYWNTKLRKKLSKMGIDP+THKP+SQIL DYG IS + N     
Subjt:  GKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQAIGSRWSLIAKQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSLPN-----

Query:  -----------TPKPLPSSFTTT----------MAKPHQPSTSSSPPAI--ANPPF------PGTFRPHFFNEPTSSCSSTSSSS
                   TPK  PSS  T+          M   + PS  +  P++   +  F        T +PHFFNE TSSCSS+SSSS
Subjt:  -----------TPKPLPSSFTTT----------MAKPHQPSTSSSPPAI--ANPPF------PGTFRPHFFNEPTSSCSSTSSSS

RXH69379.1 hypothetical protein DVH24_037163 [Malus domestica]5.8e-15849.05Show/hide
Query:  MGCRLFLAFSLCFLYFFPPLSRSANRLPGLLLDNKNMGGGSVSRIKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDV
        M  R FLA SLCFL  FP L+ S  R+P LL   K    GSV R++ G SS   DPTRV+QLS +PRAFL+KGFLS EECDHLI +AKDKLEKSMVAD+ 
Subjt:  MGCRLFLAFSLCFLYFFPPLSRSANRLPGLLLDNKNMGGGSVSRIKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDV

Query:  TGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV
        +G S+ S+ RTSSG FLLKAQDEIVA IE++IAAWTFLP++NGE IQ+L YE+GQKYEPH+DYF D  N  +GGHR+ATVLMYLS+V+KGGETVFPNS+ 
Subjt:  TGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV

Query:  KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLD-------------------TIW----------
        KLS+ + DD+SDCAK GY VKP KGDALLFFSLHPNAT D SS HGSCPVIEGEKWSATKWIHV+S +                    +W          
Subjt:  KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLD-------------------TIW----------

Query:  -----------------RNPDCVDE-NVYCAMWAS------------------------AGLNRCGKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQA
                         + P C D+ NV   +W +                        AGLNRCGKSCRLRWTNYLRPDL+HD+FTPQEE  II LH+A
Subjt:  -----------------RNPDCVDE-NVYCAMWAS------------------------AGLNRCGKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQA

Query:  IGSRWSLIAKQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSLPN-----------------TPKPLPSS------FTTTMAKPH
        IGSRWS IAKQLPGRTDNDVKNYWNTKL+KKLSKMGIDP+THKP+SQIL DYG IS LP                   P+P PSS      +T  M  P+
Subjt:  IGSRWSLIAKQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSLPN-----------------TPKPLPSS------FTTTMAKPH

Query:  ---QPSTSSSPPAIANPPFPG--------TFRPHFFNEPTSSCSST----------------------------SSSSFNGGGGGGDVLFH--------F
           Q S  +S  +  +  F            +PHF NE TSSCSS+                            SSS FN      + L H        F
Subjt:  ---QPSTSSSPPAIANPPFPG--------TFRPHFFNEPTSSCSST----------------------------SSSSFNGGGGGGDVLFH--------F

Query:  AASSSPEQSHGIMEPFSQSSDGFVCGAMGQ----RGC------------EGSSSSSFNSFVDALLEQDFQIKGSFPEILEGCFDY
        A++   +Q   + +    + +    G  G      GC            + +SSSS +SFV+ +L+QD +++ +FP++L+  FDY
Subjt:  AASSSPEQSHGIMEPFSQSSDGFVCGAMGQ----RGC------------EGSSSSSFNSFVDALLEQDFQIKGSFPEILEGCFDY

RXH89486.1 hypothetical protein DVH24_031843 [Malus domestica]1.9e-15351.85Show/hide
Query:  FLAFSLCFLYFFPPLSRSANRLPGLLLDNKNMGGGSVSRIKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDVTGASV
        FLA SLCFL     L+ S   +PG L + K    GSV R + G SS   DPTRV+QL+ +PRAFL+KGFLS EECDHLI +AK+KLEKSMVAD+ +G S+
Subjt:  FLAFSLCFLYFFPPLSRSANRLPGLLLDNKNMGGGSVSRIKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDVTGASV

Query:  VSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQVKLSEQ
         S+ RTSSG FLLKAQDE+VA IE++IAAWTFLPI+NGE IQ+L YE+GQKYEPH+DYF D  N  +GGHRIATVLMYLS+V+KGGETVFPNS+ KLS+ 
Subjt:  VSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQVKLSEQ

Query:  EKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIW----RNPDCVDENVYCAMWA--------------
        + DD SDCAK GY VKP KGDAL+FFSLHPN T D SS HGSCPVIEGEKWSATKWIHV+S +           C DEN  C +WA              
Subjt:  EKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIW----RNPDCVDENVYCAMWA--------------

Query:  -----------------------------------------------------SAGLNRCGKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQAIGSRW
                                                              AGLNRCGKSCRLRWTNYLRPDL+HD+FTPQEE+ II LH+AIGSRW
Subjt:  -----------------------------------------------------SAGLNRCGKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQAIGSRW

Query:  SLIAKQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSLPNTPKPLP-SSFTTTMAKPHQPSTSSSPPAIANPPFPG-TFRPHF--
        S IAK+LPGRTDNDVKNYWNTKL+KKLSKMGIDP+THKP+SQIL DYG IS LP+T    P SS          P  +SS   I   P+   T  P+   
Subjt:  SLIAKQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSLPNTPKPLP-SSFTTTMAKPHQPSTSSSPPAIANPPFPG-TFRPHF--

Query:  --FNEPTSSC-------SSTSSSSFN-------GGGGGGDVL-----FHFAASSSPEQSHGIMEPFSQSSDGFVCGAMGQRGCE----GSSSSSFNSFVD
          F++PT++C        + SSSSFN             D L      H    SSPE     ++  S + +G  C  +GQR  +     +SS+S  SFV+
Subjt:  --FNEPTSSC-------SSTSSSSFN-------GGGGGGDVL-----FHFAASSSPEQSHGIMEPFSQSSDGFVCGAMGQRGCE----GSSSSSFNSFVD

Query:  ALLEQDFQIKGSFPEILEGCFDY
         +L+QD +++ +FP++L+  FDY
Subjt:  ALLEQDFQIKGSFPEILEGCFDY

TrEMBL top hitse value%identityAlignment
A0A498HEC5 Procollagen-proline 4-dioxygenase2.8e-15849.05Show/hide
Query:  MGCRLFLAFSLCFLYFFPPLSRSANRLPGLLLDNKNMGGGSVSRIKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDV
        M  R FLA SLCFL  FP L+ S  R+P LL   K    GSV R++ G SS   DPTRV+QLS +PRAFL+KGFLS EECDHLI +AKDKLEKSMVAD+ 
Subjt:  MGCRLFLAFSLCFLYFFPPLSRSANRLPGLLLDNKNMGGGSVSRIKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDV

Query:  TGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV
        +G S+ S+ RTSSG FLLKAQDEIVA IE++IAAWTFLP++NGE IQ+L YE+GQKYEPH+DYF D  N  +GGHR+ATVLMYLS+V+KGGETVFPNS+ 
Subjt:  TGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV

Query:  KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLD-------------------TIW----------
        KLS+ + DD+SDCAK GY VKP KGDALLFFSLHPNAT D SS HGSCPVIEGEKWSATKWIHV+S +                    +W          
Subjt:  KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLD-------------------TIW----------

Query:  -----------------RNPDCVDE-NVYCAMWAS------------------------AGLNRCGKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQA
                         + P C D+ NV   +W +                        AGLNRCGKSCRLRWTNYLRPDL+HD+FTPQEE  II LH+A
Subjt:  -----------------RNPDCVDE-NVYCAMWAS------------------------AGLNRCGKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQA

Query:  IGSRWSLIAKQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSLPN-----------------TPKPLPSS------FTTTMAKPH
        IGSRWS IAKQLPGRTDNDVKNYWNTKL+KKLSKMGIDP+THKP+SQIL DYG IS LP                   P+P PSS      +T  M  P+
Subjt:  IGSRWSLIAKQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSLPN-----------------TPKPLPSS------FTTTMAKPH

Query:  ---QPSTSSSPPAIANPPFPG--------TFRPHFFNEPTSSCSST----------------------------SSSSFNGGGGGGDVLFH--------F
           Q S  +S  +  +  F            +PHF NE TSSCSS+                            SSS FN      + L H        F
Subjt:  ---QPSTSSSPPAIANPPFPG--------TFRPHFFNEPTSSCSST----------------------------SSSSFNGGGGGGDVLFH--------F

Query:  AASSSPEQSHGIMEPFSQSSDGFVCGAMGQ----RGC------------EGSSSSSFNSFVDALLEQDFQIKGSFPEILEGCFDY
        A++   +Q   + +    + +    G  G      GC            + +SSSS +SFV+ +L+QD +++ +FP++L+  FDY
Subjt:  AASSSPEQSHGIMEPFSQSSDGFVCGAMGQ----RGC------------EGSSSSSFNSFVDALLEQDFQIKGSFPEILEGCFDY

A0A498J2T3 Procollagen-proline 4-dioxygenase9.4e-15451.85Show/hide
Query:  FLAFSLCFLYFFPPLSRSANRLPGLLLDNKNMGGGSVSRIKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDVTGASV
        FLA SLCFL     L+ S   +PG L + K    GSV R + G SS   DPTRV+QL+ +PRAFL+KGFLS EECDHLI +AK+KLEKSMVAD+ +G S+
Subjt:  FLAFSLCFLYFFPPLSRSANRLPGLLLDNKNMGGGSVSRIKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDVTGASV

Query:  VSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQVKLSEQ
         S+ RTSSG FLLKAQDE+VA IE++IAAWTFLPI+NGE IQ+L YE+GQKYEPH+DYF D  N  +GGHRIATVLMYLS+V+KGGETVFPNS+ KLS+ 
Subjt:  VSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQVKLSEQ

Query:  EKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIW----RNPDCVDENVYCAMWA--------------
        + DD SDCAK GY VKP KGDAL+FFSLHPN T D SS HGSCPVIEGEKWSATKWIHV+S +           C DEN  C +WA              
Subjt:  EKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIW----RNPDCVDENVYCAMWA--------------

Query:  -----------------------------------------------------SAGLNRCGKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQAIGSRW
                                                              AGLNRCGKSCRLRWTNYLRPDL+HD+FTPQEE+ II LH+AIGSRW
Subjt:  -----------------------------------------------------SAGLNRCGKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQAIGSRW

Query:  SLIAKQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSLPNTPKPLP-SSFTTTMAKPHQPSTSSSPPAIANPPFPG-TFRPHF--
        S IAK+LPGRTDNDVKNYWNTKL+KKLSKMGIDP+THKP+SQIL DYG IS LP+T    P SS          P  +SS   I   P+   T  P+   
Subjt:  SLIAKQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSLPNTPKPLP-SSFTTTMAKPHQPSTSSSPPAIANPPFPG-TFRPHF--

Query:  --FNEPTSSC-------SSTSSSSFN-------GGGGGGDVL-----FHFAASSSPEQSHGIMEPFSQSSDGFVCGAMGQRGCE----GSSSSSFNSFVD
          F++PT++C        + SSSSFN             D L      H    SSPE     ++  S + +G  C  +GQR  +     +SS+S  SFV+
Subjt:  --FNEPTSSC-------SSTSSSSFN-------GGGGGGDVL-----FHFAASSSPEQSHGIMEPFSQSSDGFVCGAMGQRGCE----GSSSSSFNSFVD

Query:  ALLEQDFQIKGSFPEILEGCFDY
         +L+QD +++ +FP++L+  FDY
Subjt:  ALLEQDFQIKGSFPEILEGCFDY

A0A5C7HPP5 Uncharacterized protein9.5e-13854.58Show/hide
Query:  FLAFSLC-FLYFFPPLSRSANRLPGLLLDNKNMGGGSVSRIKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDVTGAS
        FLA SLC FL FFP LS S  ++PG L D +    GSVSR+K    S+  +PTRVTQLS  PRAFLYKGFLS EECDHLI+LAKDKLEKSMVAD+ +G S
Subjt:  FLAFSLC-FLYFFPPLSRSANRLPGLLLDNKNMGGGSVSRIKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDVTGAS

Query:  VVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQVKLSE
        V S+ RTSSG FL KAQDE+VA IE +IAAWTFLPI+NGE IQ+L YENGQKYEPH+DYF D VN  +GGHR+ TVLMYLS V+KGGET+FPNS+ K+S+
Subjt:  VVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQVKLSE

Query:  QEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDT--IW-------------------------------
         + +  S+CA+ GY VK +KGDALLF+SLHP+AT D  S HGSCPVIEGEKWSATKWIH  +  T  IW                               
Subjt:  QEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDT--IW-------------------------------

Query:  -----------------------------------------------------------RNPDCVDE-NVYCAMWA------------------------
                                                                     P C D+ NV   +W                         
Subjt:  -----------------------------------------------------------RNPDCVDE-NVYCAMWA------------------------

Query:  SAGLNRCGKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQAIGSRWSLIAKQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSL
         AGLNRCGKSCRLRWTNYLRPDL+HD FTP+EE+ II LH+AIGSRWSLIAKQLPGRTDNDVKNYWNTKLRKKL K+GIDPITHKPFSQI  D+G IS L
Subjt:  SAGLNRCGKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQAIGSRWSLIAKQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSL

Query:  PN
         N
Subjt:  PN

A0A5N5GEN8 Prolyl 4-hydroxylase subunit alpha-1-like5.5e-13851Show/hide
Query:  LAFSLCFLYFFPPLSRSANRLPGLLLDNKNMGGGSVSRIKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDVTGASVV
        LA SLCFL  FP L+ S  R+PG  LD K    GSV R++ G SS   DPTRV+QLS +PRAFLYKGFLS EECDHLI +AKDKLEKSMVAD+ +G S+ 
Subjt:  LAFSLCFLYFFPPLSRSANRLPGLLLDNKNMGGGSVSRIKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDVTGASVV

Query:  SKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQVKLSEQE
        S+ RTSSG FLLKAQDE+VA IE++IAAWTFLP++NGE IQ+L YE+GQKYEPH+DYF D  N  +GGHRIATVLMYLS+V+KGGETVFPNS+ KLS+ +
Subjt:  SKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQVKLSEQE

Query:  KDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIW----RNPDCVDENVYCAMWASAGLNRCGKSCRLRW
         DD SDCA+ GY VKP KGDAL+FFSLHPNAT D +S HGSCPVIEGEKWSATKWIHV+S +           C DEN  C +WA A             
Subjt:  KDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIW----RNPDCVDENVYCAMWASAGLNRCGKSCRLRW

Query:  TNYLRPDLRHDSFTPQEEDLIIKLHQAIGSRWSLIAKQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSLPNTPKPLP-SSFTTT
              DL+HD+FTPQEE+ II LH+AI S           RTDNDVKNYWNTKL+KK SKMGIDP+THKP+SQIL DYG IS LP+T    P SSF   
Subjt:  TNYLRPDLRHDSFTPQEEDLIIKLHQAIGSRWSLIAKQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSLPNTPKPLP-SSFTTT

Query:  MAKPHQPSTSSS-------PPAIANPPFPG------------------------TFRPHFFNEPTSSCSST----------------------------S
              P  +SS          I NP   G                          +PH  NE TSSCSS+                            S
Subjt:  MAKPHQPSTSSS-------PPAIANPPFPG------------------------TFRPHFFNEPTSSCSST----------------------------S

Query:  SSSFNGGGGGGDVLFHFAASSSPEQS-HGIMEPFSQ--------SSDGFV-CGAMGQRGC----EGSSSSSFNSFVDALLEQDFQIKGSFPEILEGCFDY
        SSSFN         F  A     EQ  HG+M   S+        SSD    C  +GQR        +SS+S  SFV+ +L+QD +++ +FP++L+  FDY
Subjt:  SSSFNGGGGGGDVLFHFAASSSPEQS-HGIMEPFSQ--------SSDGFV-CGAMGQRGC----EGSSSSSFNSFVDALLEQDFQIKGSFPEILEGCFDY

A0A7G2ERG3 Procollagen-proline 4-dioxygenase1.4e-14152.56Show/hide
Query:  MGCRLFLAFSLCFLYFFPPLSRSANRLPGLLLDNKNMGGGSVSRIKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDV
        M  R+FLAFSLCFL+  P +S + NR    L  + N   GSV ++K   SS   DPTRVTQLS  PR FLY+GFLS EECDH I LAK KLEKSMVAD+ 
Subjt:  MGCRLFLAFSLCFLYFFPPLSRSANRLPGLLLDNKNMGGGSVSRIKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDV

Query:  TGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV
        +G SV S+ RTSSG FL K QD+IV+ +E+K+AAWTFLP +NGE +Q+L YENGQKYEPH+DYF D  N+ +GGHRIATVLMYLS+V+KGGETVFP  + 
Subjt:  TGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV

Query:  KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIW-RNPDCVDENVYCAMWASA----------
        K ++ + D  ++CAK GY VKP+KGDALLFF+LHPNAT DS+S HGSCPV+EGEKWSAT+WIHV+S +  + +   C+DENV C  WA A          
Subjt:  KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIW-RNPDCVDENVYCAMWASA----------

Query:  ---------------------------------------------------GLNRCGKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQAIGSRWSLIA
                                                           GLNRCGKSCRLRWTNYLRPDL+HDSF+ QEE+LII+ H+AIGSRWS IA
Subjt:  ---------------------------------------------------GLNRCGKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQAIGSRWSLIA

Query:  KQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSLPNT---PKPLPSSFTT----------TMAKPHQPSTSSSPPAIANPPFPGT
        ++LPGRTDNDVKN+WNTKL+KKL KMGIDP+THKP SQ+L ++  IS   N     +P  +S  T               H+   ++SP    N     T
Subjt:  KQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSLPNT---PKPLPSSFTT----------TMAKPHQPSTSSSPPAIANPPFPGT

Query:  FRPHFFNEP-------TSSCSSTSSSS
           HF++ P       TSSCSS+SSS+
Subjt:  FRPHFFNEP-------TSSCSSTSSSS

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 65.6e-8755.24Show/hide
Query:  MGCRLFLAFSLCFLYFFPPLSRSANRLPGLLLDNKNMGGGSVSRIKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDV
        M  + FLAFSL  L  F  +S                             S +VDPTR+TQLS  PRAFLYKGFLS EECDHLI LAK KLEKSMV  DV
Subjt:  MGCRLFLAFSLCFLYFFPPLSRSANRLPGLLLDNKNMGGGSVSRIKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDV

Query:  -TGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQ
         +G S  S+ RTSSG FL K QD+IVA +E+K+AAWTFLP +NGE +Q+L YENGQKY+PH+DYF D   + +GGHRIATVLMYLS+V KGGETVFPN +
Subjt:  -TGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQ

Query:  VKLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIWRNPDCVDENVYCAMWASAG---------
         K  + + D  S CAK GY VKP+KGDALLFF+LH N T D +S HGSCPVIEGEKWSAT+WIHV+S     +   CVD++  C  WA AG         
Subjt:  VKLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIWRNPDCVDENVYCAMWASAG---------

Query:  ------LNRCGKSCR
              L  C KSC+
Subjt:  ------LNRCGKSCR

F4JAU3 Prolyl 4-hydroxylase 27.6e-8461.35Show/hide
Query:  SMAVDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDVTGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLR
        S  ++P++V Q+SS+PRAF+Y+GFL+  ECDHLI+LAK+ L++S VAD+  G S VS  RTSSGTF+ K +D IV+GIE K++ WTFLP +NGE +QVLR
Subjt:  SMAVDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDVTGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLR

Query:  YENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV---KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGS
        YE+GQKY+ H+DYF D VNIA GGHRIATVL+YLS+V KGGETVFP++Q    +   + KDDLSDCAK G  VKPKKG+ALLFF+L  +A PD  S HG 
Subjt:  YENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV---KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGS

Query:  CPVIEGEKWSATKWIHVQSLDTI-WRNPDCVDENVYCAMWASAGLNRCGKS
        CPVIEGEKWSATKWIHV S D I   + +C D N  C  WA  G   CGK+
Subjt:  CPVIEGEKWSATKWIHVQSLDTI-WRNPDCVDENVYCAMWASAGLNRCGKS

Q8L970 Probable prolyl 4-hydroxylase 78.1e-10262.54Show/hide
Query:  MGCRLFLAFSLCFLYFFPPLSRSANRLPGLLLDNKNMGGGSVSRIKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDV
        M  R+FLAFSLCFL+  P +S + NR    L  + N   GSV ++K   SS   DPTRVTQLS  PR FLY+GFLS EECDH I LAK KLEKSMVAD+ 
Subjt:  MGCRLFLAFSLCFLYFFPPLSRSANRLPGLLLDNKNMGGGSVSRIKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDV

Query:  TGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV
        +G SV S+ RTSSG FL K QD+IV+ +E+K+AAWTFLP +NGE +Q+L YENGQKYEPH+DYF D  N+ +GGHRIATVLMYLS+V+KGGETVFP  + 
Subjt:  TGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV

Query:  KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIW-RNPDCVDENVYCAMWASAG
        K ++ + D  ++CAK GY VKP+KGDALLFF+LHPNAT DS+S HGSCPV+EGEKWSAT+WIHV+S +  + +   C+DENV C  WA AG
Subjt:  KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIW-RNPDCVDENVYCAMWASAG

Q8LAN3 Probable prolyl 4-hydroxylase 42.0e-8461.22Show/hide
Query:  SSMAVDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDVTGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVL
        SS+ V+P++V Q+SS+PRAF+Y+GFL+  ECDH+++LAK  L++S VAD+ +G S  S+ RTSSGTF+ K +D IV+GIE KI+ WTFLP +NGE IQVL
Subjt:  SSMAVDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDVTGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVL

Query:  RYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV---KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHG
        RYE+GQKY+ H+DYF D VNI  GGHR+AT+LMYLS+V KGGETVFP++++   ++  + K+DLSDCAK G  VKP+KGDALLFF+LHP+A PD  S HG
Subjt:  RYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV---KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHG

Query:  SCPVIEGEKWSATKWIHVQSLDTI-WRNPDCVDENVYCAMWASAG
         CPVIEGEKWSATKWIHV S D I   + +C D N  C  WA  G
Subjt:  SCPVIEGEKWSATKWIHVQSLDTI-WRNPDCVDENVYCAMWASAG

Q9LN20 Probable prolyl 4-hydroxylase 34.3e-6357.84Show/hide
Query:  LSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDVTGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHY
        LS +PRAF+Y  FLS EEC++LI+LAK  + KS V D  TG S  S+ RTSSGTFL + +D+I+  IE +IA +TF+P D+GE +QVL YE GQKYEPHY
Subjt:  LSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDVTGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHY

Query:  DYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQVKLSEQE-KDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATK
        DYF+D  N   GG R+AT+LMYLS V++GGETVFP + +  S     ++LS+C K G  VKP+ GDALLF+S+ P+AT D +S HG CPVI G KWS+TK
Subjt:  DYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQVKLSEQE-KDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATK

Query:  WIHV
        W+HV
Subjt:  WIHV

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 25.4e-8561.35Show/hide
Query:  SMAVDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDVTGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLR
        S  ++P++V Q+SS+PRAF+Y+GFL+  ECDHLI+LAK+ L++S VAD+  G S VS  RTSSGTF+ K +D IV+GIE K++ WTFLP +NGE +QVLR
Subjt:  SMAVDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDVTGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLR

Query:  YENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV---KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGS
        YE+GQKY+ H+DYF D VNIA GGHRIATVL+YLS+V KGGETVFP++Q    +   + KDDLSDCAK G  VKPKKG+ALLFF+L  +A PD  S HG 
Subjt:  YENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV---KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGS

Query:  CPVIEGEKWSATKWIHVQSLDTI-WRNPDCVDENVYCAMWASAGLNRCGKS
        CPVIEGEKWSATKWIHV S D I   + +C D N  C  WA  G   CGK+
Subjt:  CPVIEGEKWSATKWIHVQSLDTI-WRNPDCVDENVYCAMWASAGLNRCGKS

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase5.7e-10362.54Show/hide
Query:  MGCRLFLAFSLCFLYFFPPLSRSANRLPGLLLDNKNMGGGSVSRIKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDV
        M  R+FLAFSLCFL+  P +S + NR    L  + N   GSV ++K   SS   DPTRVTQLS  PR FLY+GFLS EECDH I LAK KLEKSMVAD+ 
Subjt:  MGCRLFLAFSLCFLYFFPPLSRSANRLPGLLLDNKNMGGGSVSRIKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDV

Query:  TGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV
        +G SV S+ RTSSG FL K QD+IV+ +E+K+AAWTFLP +NGE +Q+L YENGQKYEPH+DYF D  N+ +GGHRIATVLMYLS+V+KGGETVFP  + 
Subjt:  TGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV

Query:  KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIW-RNPDCVDENVYCAMWASAG
        K ++ + D  ++CAK GY VKP+KGDALLFF+LHPNAT DS+S HGSCPV+EGEKWSAT+WIHV+S +  + +   C+DENV C  WA AG
Subjt:  KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIW-RNPDCVDENVYCAMWASAG

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase6.1e-9758.86Show/hide
Query:  MGCRLFLAFSLCFLYFFPPLSRSANRLPGLLLDNKNMGGGSVSRIKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDV
        M  R+FLAFSLCFL+  P +S + NR    L  + N   GSV ++K   SS   DPTRVTQLS  PR FLY+GFLS EECDH I LAK KLEKSMVAD+ 
Subjt:  MGCRLFLAFSLCFLYFFPPLSRSANRLPGLLLDNKNMGGGSVSRIKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDV

Query:  TGASVVSKERTS----SGTFLLKAQ----DEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGE
        +G SV S++  S    S +F+        D+IV+ +E+K+AAWTFLP +NGE +Q+L YENGQKYEPH+DYF D  N+ +GGHRIATVLMYLS+V+KGGE
Subjt:  TGASVVSKERTS----SGTFLLKAQ----DEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGE

Query:  TVFPNSQVKLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIW-RNPDCVDENVYCAMWASAG
        TVFP  + K ++ + D  ++CAK GY VKP+KGDALLFF+LHPNAT DS+S HGSCPV+EGEKWSAT+WIHV+S +  + +   C+DENV C  WA AG
Subjt:  TVFPNSQVKLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIW-RNPDCVDENVYCAMWASAG

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase4.0e-8855.24Show/hide
Query:  MGCRLFLAFSLCFLYFFPPLSRSANRLPGLLLDNKNMGGGSVSRIKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDV
        M  + FLAFSL  L  F  +S                             S +VDPTR+TQLS  PRAFLYKGFLS EECDHLI LAK KLEKSMV  DV
Subjt:  MGCRLFLAFSLCFLYFFPPLSRSANRLPGLLLDNKNMGGGSVSRIKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDV

Query:  -TGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQ
         +G S  S+ RTSSG FL K QD+IVA +E+K+AAWTFLP +NGE +Q+L YENGQKY+PH+DYF D   + +GGHRIATVLMYLS+V KGGETVFPN +
Subjt:  -TGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQ

Query:  VKLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIWRNPDCVDENVYCAMWASAG---------
         K  + + D  S CAK GY VKP+KGDALLFF+LH N T D +S HGSCPVIEGEKWSAT+WIHV+S     +   CVD++  C  WA AG         
Subjt:  VKLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIWRNPDCVDENVYCAMWASAG---------

Query:  ------LNRCGKSCR
              L  C KSC+
Subjt:  ------LNRCGKSCR

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.4e-8561.22Show/hide
Query:  SSMAVDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDVTGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVL
        SS+ V+P++V Q+SS+PRAF+Y+GFL+  ECDH+++LAK  L++S VAD+ +G S  S+ RTSSGTF+ K +D IV+GIE KI+ WTFLP +NGE IQVL
Subjt:  SSMAVDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDVTGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVL

Query:  RYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV---KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHG
        RYE+GQKY+ H+DYF D VNI  GGHR+AT+LMYLS+V KGGETVFP++++   ++  + K+DLSDCAK G  VKP+KGDALLFF+LHP+A PD  S HG
Subjt:  RYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV---KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHG

Query:  SCPVIEGEKWSATKWIHVQSLDTI-WRNPDCVDENVYCAMWASAG
         CPVIEGEKWSATKWIHV S D I   + +C D N  C  WA  G
Subjt:  SCPVIEGEKWSATKWIHVQSLDTI-WRNPDCVDENVYCAMWASAG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCTGTCGGCTTTTTCTCGCATTTTCCCTCTGTTTCCTCTACTTCTTCCCTCCCCTTTCTCGCTCTGCCAATCGCTTGCCGGGATTGCTCCTAGACAACAAGAACAT
GGGAGGAGGATCTGTTAGTAGGATTAAACCAGGTGGTTCCTCCATGGCAGTTGATCCCACTCGTGTCACTCAGCTTTCATCGCAACCCAGGGCTTTCTTATATAAGGGAT
TTTTGTCTGCAGAGGAGTGCGATCATCTTATCAATTTGGCGAAGGATAAGCTTGAGAAATCAATGGTGGCCGATGATGTAACGGGTGCGAGTGTTGTGAGTAAAGAACGA
ACGAGTAGCGGCACGTTCCTTCTCAAGGCTCAGGACGAAATAGTTGCAGGCATCGAGTCCAAGATTGCTGCATGGACCTTCCTTCCCATTGATAATGGGGAGCCTATTCA
AGTACTGAGGTACGAGAACGGTCAGAAATATGAGCCACATTACGATTATTTTCTAGACCCAGTTAATATAGCTGTTGGCGGTCACCGGATCGCCACAGTCTTGATGTATT
TGTCCCATGTCAAAAAGGGTGGAGAAACTGTCTTTCCCAATTCTCAGGTTAAATTATCCGAGCAGGAGAAGGATGACTTGTCCGATTGTGCTAAGATTGGTTATGGAGTA
AAACCAAAGAAGGGTGATGCATTGCTGTTCTTCAGTCTGCATCCAAATGCGACGCCAGACTCGAGCAGCTATCATGGGAGCTGCCCGGTGATAGAGGGCGAGAAGTGGTC
TGCGACGAAATGGATTCACGTGCAATCACTCGATACGATTTGGAGGAATCCAGATTGTGTGGATGAGAATGTGTACTGCGCTATGTGGGCTAGTGCAGGGCTGAATAGAT
GTGGGAAGAGTTGCAGGCTGAGATGGACTAATTACCTCAGGCCTGATCTCAGGCATGACAGCTTCACTCCTCAAGAAGAAGACCTCATTATCAAACTCCATCAAGCCATT
GGAAGCAGGTGGTCTCTGATAGCAAAGCAACTTCCTGGAAGAACAGACAATGATGTGAAGAATTACTGGAACACAAAGCTGAGGAAGAAGCTTTCAAAGATGGGAATTGA
TCCAATAACTCACAAGCCATTCTCTCAGATCCTCTTCGATTATGGAACCATAAGCAGCCTCCCCAACACCCCAAAACCACTTCCCAGCTCCTTCACCACAACAATGGCGA
AACCCCACCAGCCATCTACTTCTTCTTCACCTCCCGCCATAGCCAACCCTCCATTTCCAGGGACCTTTCGACCACACTTTTTCAATGAACCAACCTCCTCCTGCTCATCC
ACTTCATCTTCTTCCTTCAATGGCGGCGGCGGCGGCGGCGACGTTCTCTTCCATTTCGCTGCGTCTTCTTCGCCGGAGCAGAGCCATGGAATAATGGAGCCGTTTTCGCA
GAGCTCAGATGGGTTTGTTTGTGGAGCCATGGGCCAGCGGGGATGTGAAGGTTCTTCTAGTTCGTCTTTCAATTCGTTTGTGGATGCTCTGTTGGAGCAGGATTTCCAGA
TTAAGGGTTCGTTTCCGGAGATTTTGGAGGGGTGTTTTGATTACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCTGTCGGCTTTTTCTCGCATTTTCCCTCTGTTTCCTCTACTTCTTCCCTCCCCTTTCTCGCTCTGCCAATCGCTTGCCGGGATTGCTCCTAGACAACAAGAACAT
GGGAGGAGGATCTGTTAGTAGGATTAAACCAGGTGGTTCCTCCATGGCAGTTGATCCCACTCGTGTCACTCAGCTTTCATCGCAACCCAGGGCTTTCTTATATAAGGGAT
TTTTGTCTGCAGAGGAGTGCGATCATCTTATCAATTTGGCGAAGGATAAGCTTGAGAAATCAATGGTGGCCGATGATGTAACGGGTGCGAGTGTTGTGAGTAAAGAACGA
ACGAGTAGCGGCACGTTCCTTCTCAAGGCTCAGGACGAAATAGTTGCAGGCATCGAGTCCAAGATTGCTGCATGGACCTTCCTTCCCATTGATAATGGGGAGCCTATTCA
AGTACTGAGGTACGAGAACGGTCAGAAATATGAGCCACATTACGATTATTTTCTAGACCCAGTTAATATAGCTGTTGGCGGTCACCGGATCGCCACAGTCTTGATGTATT
TGTCCCATGTCAAAAAGGGTGGAGAAACTGTCTTTCCCAATTCTCAGGTTAAATTATCCGAGCAGGAGAAGGATGACTTGTCCGATTGTGCTAAGATTGGTTATGGAGTA
AAACCAAAGAAGGGTGATGCATTGCTGTTCTTCAGTCTGCATCCAAATGCGACGCCAGACTCGAGCAGCTATCATGGGAGCTGCCCGGTGATAGAGGGCGAGAAGTGGTC
TGCGACGAAATGGATTCACGTGCAATCACTCGATACGATTTGGAGGAATCCAGATTGTGTGGATGAGAATGTGTACTGCGCTATGTGGGCTAGTGCAGGGCTGAATAGAT
GTGGGAAGAGTTGCAGGCTGAGATGGACTAATTACCTCAGGCCTGATCTCAGGCATGACAGCTTCACTCCTCAAGAAGAAGACCTCATTATCAAACTCCATCAAGCCATT
GGAAGCAGGTGGTCTCTGATAGCAAAGCAACTTCCTGGAAGAACAGACAATGATGTGAAGAATTACTGGAACACAAAGCTGAGGAAGAAGCTTTCAAAGATGGGAATTGA
TCCAATAACTCACAAGCCATTCTCTCAGATCCTCTTCGATTATGGAACCATAAGCAGCCTCCCCAACACCCCAAAACCACTTCCCAGCTCCTTCACCACAACAATGGCGA
AACCCCACCAGCCATCTACTTCTTCTTCACCTCCCGCCATAGCCAACCCTCCATTTCCAGGGACCTTTCGACCACACTTTTTCAATGAACCAACCTCCTCCTGCTCATCC
ACTTCATCTTCTTCCTTCAATGGCGGCGGCGGCGGCGGCGACGTTCTCTTCCATTTCGCTGCGTCTTCTTCGCCGGAGCAGAGCCATGGAATAATGGAGCCGTTTTCGCA
GAGCTCAGATGGGTTTGTTTGTGGAGCCATGGGCCAGCGGGGATGTGAAGGTTCTTCTAGTTCGTCTTTCAATTCGTTTGTGGATGCTCTGTTGGAGCAGGATTTCCAGA
TTAAGGGTTCGTTTCCGGAGATTTTGGAGGGGTGTTTTGATTACTGA
Protein sequenceShow/hide protein sequence
MGCRLFLAFSLCFLYFFPPLSRSANRLPGLLLDNKNMGGGSVSRIKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEECDHLINLAKDKLEKSMVADDVTGASVVSKER
TSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQVKLSEQEKDDLSDCAKIGYGV
KPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIWRNPDCVDENVYCAMWASAGLNRCGKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQAI
GSRWSLIAKQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSLPNTPKPLPSSFTTTMAKPHQPSTSSSPPAIANPPFPGTFRPHFFNEPTSSCSS
TSSSSFNGGGGGGDVLFHFAASSSPEQSHGIMEPFSQSSDGFVCGAMGQRGCEGSSSSSFNSFVDALLEQDFQIKGSFPEILEGCFDY