; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr021452 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr021452
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationtig00153699:464746..470573
RNA-Seq ExpressionSgr021452
SyntenySgr021452
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAD5324401.1 unnamed protein product [Arabidopsis thaliana]1.8e-13852.37Show/hide
Query:  MGCRLFLAFSLCFLCFFPPFSRSANRLPGLLLDNKNMGGGSVSRMKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDV
        M  R+FLAFSLCFL   P  S + NR    L  + N   GSV +MK   SS   DPTRVTQLS  PR FLY+GFLS EE DH I LAK KLE SMVAD+ 
Subjt:  MGCRLFLAFSLCFLCFFPPFSRSANRLPGLLLDNKNMGGGSVSRMKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDV

Query:  TGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV
        +G SV S+ RTSSG FL K QD+IV+ +E+K+AAWTFLP +NGE +Q+L YENGQKYEPH+DYF D  N+ +GGHRIATVLMYLS+V+KGGETVFP  + 
Subjt:  TGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV

Query:  KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIW-RNPDCVDENVYCAMWASA----------
        K ++ + D  ++CAK GY VKP+KGDALLFF+LHPNAT DS+S HGSCPV+EGEKWSAT+WIHV+S +  + +   C+DENV C  WA A          
Subjt:  KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIW-RNPDCVDENVYCAMWASA----------

Query:  ---------------------------------------------------GLNRCGKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQAIGSRWSLIA
                                                           GLNRCGKSCRLRWTNYLRPDL+HDSF+ QEE+LII+ H+AIGSRWS IA
Subjt:  ---------------------------------------------------GLNRCGKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQAIGSRWSLIA

Query:  KQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSLPNT---PKPLPSSFTT----------TMAKPHQPSTSSSPPAIANPPFPGT
        ++LPGRTDNDVKN+WNTKL+KKL KMGIDP+THKP SQ+L ++  IS   N     +P  +S  T               H+   ++SP    N     T
Subjt:  KQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSLPNT---PKPLPSSFTT----------TMAKPHQPSTSSSPPAIANPPFPGT

Query:  FRPHFFNEP-------TSSCSSTSSSS
           HF++ P       TSSCSS+SSS+
Subjt:  FRPHFFNEP-------TSSCSSTSSSS

GEV60359.1 probable prolyl 4-hydroxylase 6 [Tanacetum cinerariifolium]1.5e-13747.79Show/hide
Query:  LLLDNKNMGGGSVSRMK-------PGGSSMA--VDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDVTGASVVSKERTSSGTFLLKA
        LLL N ++   S+ +++       P G S     DPTRVTQ+S  PRAFLY+ FL+ +E DHLI LAKDKLE SMVAD+ +G S+ S+ RTSSG FL KA
Subjt:  LLLDNKNMGGGSVSRMK-------PGGSSMA--VDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDVTGASVVSKERTSSGTFLLKA

Query:  QDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQVKLSE-QEKDDLSDCAKIGYG
        QDE+VAG+ES+I+AWTFLP++NGE +Q+L YENGQKYEPH+DYF D  N A+GGHRIATVLMYLS+V+KGGETVFP S++K S+ + K+D S+CAK GY 
Subjt:  QDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQVKLSE-QEKDDLSDCAKIGYG

Query:  VKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIWRNPD-CVDENVYCAMWAS------------------------------
        VKPKKGDALLFFSLHPNAT D+ S HGSCPVIEGEKWSATKWIHV++ D      D C DENV C  WA+                              
Subjt:  VKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIWRNPD-CVDENVYCAMWAS------------------------------

Query:  ---------------------------------------------------AGLNRCGKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQAIGSRWSLI
                                                           AGLNRCGKSCRLRWTNYLRPDL+HDSFTP EE+LI++ HQAIGSRWSLI
Subjt:  ---------------------------------------------------AGLNRCGKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQAIGSRWSLI

Query:  AKQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSL--PNTPKPL------PSSF-----TTTMAKPHQPSTSSSPPAIANPPFPG
        A++LPGRTDNDVKN+WNTKL+KKLSKMGIDPITHKPF Q+L DYG I+ +  PNT +P       PS F     + TM  P+Q     +P ++    F  
Subjt:  AKQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSL--PNTPKPL------PSSF-----TTTMAKPHQPSTSSSPPAIANPPFPG

Query:  TFRPHFFNEPTSSCSSTSSSSFNGG------GGGGDVLFHFAASSSPEQSH-GIMEPFSQSSDGFVCGAMGQRGCEGSSSSSFNSFVDALLEQDFEIKGS
               NE  SS +S+ ++S   G          D L  +       +S  GI+ P  ++ + +    +         +++  SFV+++L +D E++  
Subjt:  TFRPHFFNEPTSSCSSTSSSSFNGG------GGGGDVLFHFAASSSPEQSH-GIMEPFSQSSDGFVCGAMGQRGCEGSSSSSFNSFVDALLEQDFEIKGS

Query:  FPEILEGCFDY
        FP++L+G  +Y
Subjt:  FPEILEGCFDY

KAF3955258.1 hypothetical protein CMV_019505 [Castanea mollissima]6.0e-16363.92Show/hide
Query:  MGCRLFLAFSLCFLCFFPPFSRSANRLPGLLLDNKNMGGGSVSRMKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDV
        M  R FLA SLCFL  FP  S S  +LPG L DNK    GSV ++K G SS   DP+RVTQLS +PRAFLYKGFLS EE DHLINLA+DKLE SMVAD+ 
Subjt:  MGCRLFLAFSLCFLCFFPPFSRSANRLPGLLLDNKNMGGGSVSRMKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDV

Query:  TGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV
        +G S++S+ RTSSG FL K QDE+VA IE++IAAWTFLPI+NGE +QVL Y +G+KYEPH+DYF D  N  +GGHR+ATVLMYLS+V+KGGETVFPNS+ 
Subjt:  TGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV

Query:  KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDT------IWRNPDCVDENVYCAMWASAGLNRC
        K+S+ + DD+SDCAK GY VKPKKGDALLFFSL+P+AT D+ S HGSCPVIEGEKWSATKWIHV+S +       +    DCVDEN  C +WA AGLNRC
Subjt:  KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDT------IWRNPDCVDENVYCAMWASAGLNRC

Query:  GKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQAIGSRWSLIAKQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSLPN-----
        GKSCRLRWTNYLR DL+HD FTPQEE+LII LH+AIGSRWSLIAKQLPGRTDNDVKNYWNTKLRKKLSKMGIDP+THKP+SQIL DYG IS + N     
Subjt:  GKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQAIGSRWSLIAKQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSLPN-----

Query:  -----------TPKPLPSSFTTT----------MAKPHQPSTSSSPPAI--ANPPF------PGTFRPHFFNEPTSSCSSTSSSS
                   TPK  PSS  T+          M   + PS  +  P++   +  F        T +PHFFNE TSSCSS+SSSS
Subjt:  -----------TPKPLPSSFTTT----------MAKPHQPSTSSSPPAI--ANPPF------PGTFRPHFFNEPTSSCSSTSSSS

RXH69379.1 hypothetical protein DVH24_037163 [Malus domestica]1.4e-15648.91Show/hide
Query:  MGCRLFLAFSLCFLCFFPPFSRSANRLPGLLLDNKNMGGGSVSRMKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDV
        M  R FLA SLCFLC FP  + S  R+P LL   K    GSV R++ G SS   DPTRV+QLS +PRAFL+KGFLS EE DHLI +AKDKLE SMVAD+ 
Subjt:  MGCRLFLAFSLCFLCFFPPFSRSANRLPGLLLDNKNMGGGSVSRMKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDV

Query:  TGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV
        +G S+ S+ RTSSG FLLKAQDEIVA IE++IAAWTFLP++NGE IQ+L YE+GQKYEPH+DYF D  N  +GGHR+ATVLMYLS+V+KGGETVFPNS+ 
Subjt:  TGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV

Query:  KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLD-------------------TIW----------
        KLS+ + DD+SDCAK GY VKP KGDALLFFSLHPNAT D SS HGSCPVIEGEKWSATKWIHV+S +                    +W          
Subjt:  KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLD-------------------TIW----------

Query:  -----------------RNPDCVDE-NVYCAMWAS------------------------AGLNRCGKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQA
                         + P C D+ NV   +W +                        AGLNRCGKSCRLRWTNYLRPDL+HD+FTPQEE  II LH+A
Subjt:  -----------------RNPDCVDE-NVYCAMWAS------------------------AGLNRCGKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQA

Query:  IGSRWSLIAKQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSLPN-----------------TPKPLPSS------FTTTMAKPH
        IGSRWS IAKQLPGRTDNDVKNYWNTKL+KKLSKMGIDP+THKP+SQIL DYG IS LP                   P+P PSS      +T  M  P+
Subjt:  IGSRWSLIAKQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSLPN-----------------TPKPLPSS------FTTTMAKPH

Query:  ---QPSTSSSPPAIANPPFPG--------TFRPHFFNEPTSSCSST----------------------------SSSSFNGGGGGGDVLFH--------F
           Q S  +S  +  +  F            +PHF NE TSSCSS+                            SSS FN      + L H        F
Subjt:  ---QPSTSSSPPAIANPPFPG--------TFRPHFFNEPTSSCSST----------------------------SSSSFNGGGGGGDVLFH--------F

Query:  AASSSPEQSHGIMEPFSQSSDGFVCGAMGQ----RGC------------EGSSSSSFNSFVDALLEQDFEIKGSFPEILEGCFDY
        A++   +Q   + +    + +    G  G      GC            + +SSSS +SFV+ +L+QD E++ +FP++L+  FDY
Subjt:  AASSSPEQSHGIMEPFSQSSDGFVCGAMGQ----RGC------------EGSSSSSFNSFVDALLEQDFEIKGSFPEILEGCFDY

RXH89486.1 hypothetical protein DVH24_031843 [Malus domestica]3.7e-15251.69Show/hide
Query:  FLAFSLCFLCFFPPFSRSANRLPGLLLDNKNMGGGSVSRMKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDVTGASV
        FLA SLCFLC     + S   +PG L + K    GSV R + G SS   DPTRV+QL+ +PRAFL+KGFLS EE DHLI +AK+KLE SMVAD+ +G S+
Subjt:  FLAFSLCFLCFFPPFSRSANRLPGLLLDNKNMGGGSVSRMKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDVTGASV

Query:  VSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQVKLSEQ
         S+ RTSSG FLLKAQDE+VA IE++IAAWTFLPI+NGE IQ+L YE+GQKYEPH+DYF D  N  +GGHRIATVLMYLS+V+KGGETVFPNS+ KLS+ 
Subjt:  VSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQVKLSEQ

Query:  EKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIW----RNPDCVDENVYCAMWA--------------
        + DD SDCAK GY VKP KGDAL+FFSLHPN T D SS HGSCPVIEGEKWSATKWIHV+S +           C DEN  C +WA              
Subjt:  EKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIW----RNPDCVDENVYCAMWA--------------

Query:  -----------------------------------------------------SAGLNRCGKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQAIGSRW
                                                              AGLNRCGKSCRLRWTNYLRPDL+HD+FTPQEE+ II LH+AIGSRW
Subjt:  -----------------------------------------------------SAGLNRCGKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQAIGSRW

Query:  SLIAKQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSLPNTPKPLP-SSFTTTMAKPHQPSTSSSPPAIANPPFPG-TFRPHF--
        S IAK+LPGRTDNDVKNYWNTKL+KKLSKMGIDP+THKP+SQIL DYG IS LP+T    P SS          P  +SS   I   P+   T  P+   
Subjt:  SLIAKQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSLPNTPKPLP-SSFTTTMAKPHQPSTSSSPPAIANPPFPG-TFRPHF--

Query:  --FNEPTSSC-------SSTSSSSFN-------GGGGGGDVL-----FHFAASSSPEQSHGIMEPFSQSSDGFVCGAMGQRGCE----GSSSSSFNSFVD
          F++PT++C        + SSSSFN             D L      H    SSPE     ++  S + +G  C  +GQR  +     +SS+S  SFV+
Subjt:  --FNEPTSSC-------SSTSSSSFN-------GGGGGGDVL-----FHFAASSSPEQSHGIMEPFSQSSDGFVCGAMGQRGCE----GSSSSSFNSFVD

Query:  ALLEQDFEIKGSFPEILEGCFDY
         +L+QD E++ +FP++L+  FDY
Subjt:  ALLEQDFEIKGSFPEILEGCFDY

TrEMBL top hitse value%identityAlignment
A0A498HEC5 Procollagen-proline 4-dioxygenase7.0e-15748.91Show/hide
Query:  MGCRLFLAFSLCFLCFFPPFSRSANRLPGLLLDNKNMGGGSVSRMKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDV
        M  R FLA SLCFLC FP  + S  R+P LL   K    GSV R++ G SS   DPTRV+QLS +PRAFL+KGFLS EE DHLI +AKDKLE SMVAD+ 
Subjt:  MGCRLFLAFSLCFLCFFPPFSRSANRLPGLLLDNKNMGGGSVSRMKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDV

Query:  TGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV
        +G S+ S+ RTSSG FLLKAQDEIVA IE++IAAWTFLP++NGE IQ+L YE+GQKYEPH+DYF D  N  +GGHR+ATVLMYLS+V+KGGETVFPNS+ 
Subjt:  TGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV

Query:  KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLD-------------------TIW----------
        KLS+ + DD+SDCAK GY VKP KGDALLFFSLHPNAT D SS HGSCPVIEGEKWSATKWIHV+S +                    +W          
Subjt:  KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLD-------------------TIW----------

Query:  -----------------RNPDCVDE-NVYCAMWAS------------------------AGLNRCGKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQA
                         + P C D+ NV   +W +                        AGLNRCGKSCRLRWTNYLRPDL+HD+FTPQEE  II LH+A
Subjt:  -----------------RNPDCVDE-NVYCAMWAS------------------------AGLNRCGKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQA

Query:  IGSRWSLIAKQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSLPN-----------------TPKPLPSS------FTTTMAKPH
        IGSRWS IAKQLPGRTDNDVKNYWNTKL+KKLSKMGIDP+THKP+SQIL DYG IS LP                   P+P PSS      +T  M  P+
Subjt:  IGSRWSLIAKQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSLPN-----------------TPKPLPSS------FTTTMAKPH

Query:  ---QPSTSSSPPAIANPPFPG--------TFRPHFFNEPTSSCSST----------------------------SSSSFNGGGGGGDVLFH--------F
           Q S  +S  +  +  F            +PHF NE TSSCSS+                            SSS FN      + L H        F
Subjt:  ---QPSTSSSPPAIANPPFPG--------TFRPHFFNEPTSSCSST----------------------------SSSSFNGGGGGGDVLFH--------F

Query:  AASSSPEQSHGIMEPFSQSSDGFVCGAMGQ----RGC------------EGSSSSSFNSFVDALLEQDFEIKGSFPEILEGCFDY
        A++   +Q   + +    + +    G  G      GC            + +SSSS +SFV+ +L+QD E++ +FP++L+  FDY
Subjt:  AASSSPEQSHGIMEPFSQSSDGFVCGAMGQ----RGC------------EGSSSSSFNSFVDALLEQDFEIKGSFPEILEGCFDY

A0A498J2T3 Procollagen-proline 4-dioxygenase1.8e-15251.69Show/hide
Query:  FLAFSLCFLCFFPPFSRSANRLPGLLLDNKNMGGGSVSRMKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDVTGASV
        FLA SLCFLC     + S   +PG L + K    GSV R + G SS   DPTRV+QL+ +PRAFL+KGFLS EE DHLI +AK+KLE SMVAD+ +G S+
Subjt:  FLAFSLCFLCFFPPFSRSANRLPGLLLDNKNMGGGSVSRMKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDVTGASV

Query:  VSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQVKLSEQ
         S+ RTSSG FLLKAQDE+VA IE++IAAWTFLPI+NGE IQ+L YE+GQKYEPH+DYF D  N  +GGHRIATVLMYLS+V+KGGETVFPNS+ KLS+ 
Subjt:  VSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQVKLSEQ

Query:  EKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIW----RNPDCVDENVYCAMWA--------------
        + DD SDCAK GY VKP KGDAL+FFSLHPN T D SS HGSCPVIEGEKWSATKWIHV+S +           C DEN  C +WA              
Subjt:  EKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIW----RNPDCVDENVYCAMWA--------------

Query:  -----------------------------------------------------SAGLNRCGKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQAIGSRW
                                                              AGLNRCGKSCRLRWTNYLRPDL+HD+FTPQEE+ II LH+AIGSRW
Subjt:  -----------------------------------------------------SAGLNRCGKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQAIGSRW

Query:  SLIAKQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSLPNTPKPLP-SSFTTTMAKPHQPSTSSSPPAIANPPFPG-TFRPHF--
        S IAK+LPGRTDNDVKNYWNTKL+KKLSKMGIDP+THKP+SQIL DYG IS LP+T    P SS          P  +SS   I   P+   T  P+   
Subjt:  SLIAKQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSLPNTPKPLP-SSFTTTMAKPHQPSTSSSPPAIANPPFPG-TFRPHF--

Query:  --FNEPTSSC-------SSTSSSSFN-------GGGGGGDVL-----FHFAASSSPEQSHGIMEPFSQSSDGFVCGAMGQRGCE----GSSSSSFNSFVD
          F++PT++C        + SSSSFN             D L      H    SSPE     ++  S + +G  C  +GQR  +     +SS+S  SFV+
Subjt:  --FNEPTSSC-------SSTSSSSFN-------GGGGGGDVL-----FHFAASSSPEQSHGIMEPFSQSSDGFVCGAMGQRGCE----GSSSSSFNSFVD

Query:  ALLEQDFEIKGSFPEILEGCFDY
         +L+QD E++ +FP++L+  FDY
Subjt:  ALLEQDFEIKGSFPEILEGCFDY

A0A5C7HPP5 Uncharacterized protein5.7e-13553.49Show/hide
Query:  FLAFSLCFLCFFPPFSRSANRLPGLLLDNKNMGGGSVSRMKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDVTGASV
        FLA SLCF   F P   S+ ++PG L D +    GSVSR+K    S+  +PTRVTQLS  PRAFLYKGFLS EE DHLI+LAKDKLE SMVAD+ +G SV
Subjt:  FLAFSLCFLCFFPPFSRSANRLPGLLLDNKNMGGGSVSRMKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDVTGASV

Query:  VSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQVKLSEQ
         S+ RTSSG FL KAQDE+VA IE +IAAWTFLPI+NGE IQ+L YENGQKYEPH+DYF D VN  +GGHR+ TVLMYLS V+KGGET+FPNS+ K+S+ 
Subjt:  VSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQVKLSEQ

Query:  EKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDT--IW--------------------------------
        + +  S+CA+ GY VK +KGDALLF+SLHP+AT D  S HGSCPVIEGEKWSATKWIH  +  T  IW                                
Subjt:  EKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDT--IW--------------------------------

Query:  ----------------------------------------------------------RNPDCVDE-NVYCAMWA------------------------S
                                                                    P C D+ NV   +W                          
Subjt:  ----------------------------------------------------------RNPDCVDE-NVYCAMWA------------------------S

Query:  AGLNRCGKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQAIGSRWSLIAKQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSLP
        AGLNRCGKSCRLRWTNYLRPDL+HD FTP+EE+ II LH+AIGSRWSLIAKQLPGRTDNDVKNYWNTKLRKKL K+GIDPITHKPFSQI  D+G IS L 
Subjt:  AGLNRCGKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQAIGSRWSLIAKQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSLP

Query:  N
        N
Subjt:  N

A0A5N5GEN8 Prolyl 4-hydroxylase subunit alpha-1-like1.0e-13650.83Show/hide
Query:  LAFSLCFLCFFPPFSRSANRLPGLLLDNKNMGGGSVSRMKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDVTGASVV
        LA SLCFLC FP  + S  R+PG  LD K    GSV R++ G SS   DPTRV+QLS +PRAFLYKGFLS EE DHLI +AKDKLE SMVAD+ +G S+ 
Subjt:  LAFSLCFLCFFPPFSRSANRLPGLLLDNKNMGGGSVSRMKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDVTGASVV

Query:  SKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQVKLSEQE
        S+ RTSSG FLLKAQDE+VA IE++IAAWTFLP++NGE IQ+L YE+GQKYEPH+DYF D  N  +GGHRIATVLMYLS+V+KGGETVFPNS+ KLS+ +
Subjt:  SKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQVKLSEQE

Query:  KDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIW----RNPDCVDENVYCAMWASAGLNRCGKSCRLRW
         DD SDCA+ GY VKP KGDAL+FFSLHPNAT D +S HGSCPVIEGEKWSATKWIHV+S +           C DEN  C +WA A             
Subjt:  KDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIW----RNPDCVDENVYCAMWASAGLNRCGKSCRLRW

Query:  TNYLRPDLRHDSFTPQEEDLIIKLHQAIGSRWSLIAKQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSLPNTPKPLP-SSFTTT
              DL+HD+FTPQEE+ II LH+AI S           RTDNDVKNYWNTKL+KK SKMGIDP+THKP+SQIL DYG IS LP+T    P SSF   
Subjt:  TNYLRPDLRHDSFTPQEEDLIIKLHQAIGSRWSLIAKQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSLPNTPKPLP-SSFTTT

Query:  MAKPHQPSTSSS-------PPAIANPPFPG------------------------TFRPHFFNEPTSSCSST----------------------------S
              P  +SS          I NP   G                          +PH  NE TSSCSS+                            S
Subjt:  MAKPHQPSTSSS-------PPAIANPPFPG------------------------TFRPHFFNEPTSSCSST----------------------------S

Query:  SSSFNGGGGGGDVLFHFAASSSPEQS-HGIMEPFSQ--------SSDGFV-CGAMGQRGC----EGSSSSSFNSFVDALLEQDFEIKGSFPEILEGCFDY
        SSSFN         F  A     EQ  HG+M   S+        SSD    C  +GQR        +SS+S  SFV+ +L+QD E++ +FP++L+  FDY
Subjt:  SSSFNGGGGGGDVLFHFAASSSPEQS-HGIMEPFSQ--------SSDGFV-CGAMGQRGC----EGSSSSSFNSFVDALLEQDFEIKGSFPEILEGCFDY

A0A7G2ERG3 Procollagen-proline 4-dioxygenase8.6e-13952.37Show/hide
Query:  MGCRLFLAFSLCFLCFFPPFSRSANRLPGLLLDNKNMGGGSVSRMKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDV
        M  R+FLAFSLCFL   P  S + NR    L  + N   GSV +MK   SS   DPTRVTQLS  PR FLY+GFLS EE DH I LAK KLE SMVAD+ 
Subjt:  MGCRLFLAFSLCFLCFFPPFSRSANRLPGLLLDNKNMGGGSVSRMKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDV

Query:  TGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV
        +G SV S+ RTSSG FL K QD+IV+ +E+K+AAWTFLP +NGE +Q+L YENGQKYEPH+DYF D  N+ +GGHRIATVLMYLS+V+KGGETVFP  + 
Subjt:  TGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV

Query:  KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIW-RNPDCVDENVYCAMWASA----------
        K ++ + D  ++CAK GY VKP+KGDALLFF+LHPNAT DS+S HGSCPV+EGEKWSAT+WIHV+S +  + +   C+DENV C  WA A          
Subjt:  KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIW-RNPDCVDENVYCAMWASA----------

Query:  ---------------------------------------------------GLNRCGKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQAIGSRWSLIA
                                                           GLNRCGKSCRLRWTNYLRPDL+HDSF+ QEE+LII+ H+AIGSRWS IA
Subjt:  ---------------------------------------------------GLNRCGKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQAIGSRWSLIA

Query:  KQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSLPNT---PKPLPSSFTT----------TMAKPHQPSTSSSPPAIANPPFPGT
        ++LPGRTDNDVKN+WNTKL+KKL KMGIDP+THKP SQ+L ++  IS   N     +P  +S  T               H+   ++SP    N     T
Subjt:  KQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSLPNT---PKPLPSSFTT----------TMAKPHQPSTSSSPPAIANPPFPGT

Query:  FRPHFFNEP-------TSSCSSTSSSS
           HF++ P       TSSCSS+SSS+
Subjt:  FRPHFFNEP-------TSSCSSTSSSS

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 62.0e-8454.6Show/hide
Query:  MGCRLFLAFSLCFLCFFPPFSRSANRLPGLLLDNKNMGGGSVSRMKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDV
        M  + FLAFSL  L  F   S                             S +VDPTR+TQLS  PRAFLYKGFLS EE DHLI LAK KLE SMV  DV
Subjt:  MGCRLFLAFSLCFLCFFPPFSRSANRLPGLLLDNKNMGGGSVSRMKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDV

Query:  -TGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQ
         +G S  S+ RTSSG FL K QD+IVA +E+K+AAWTFLP +NGE +Q+L YENGQKY+PH+DYF D   + +GGHRIATVLMYLS+V KGGETVFPN +
Subjt:  -TGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQ

Query:  VKLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIWRNPDCVDENVYCAMWASAG---------
         K  + + D  S CAK GY VKP+KGDALLFF+LH N T D +S HGSCPVIEGEKWSAT+WIHV+S     +   CVD++  C  WA AG         
Subjt:  VKLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIWRNPDCVDENVYCAMWASAG---------

Query:  ------LNRCGKSCR
              L  C KSC+
Subjt:  ------LNRCGKSCR

F4JAU3 Prolyl 4-hydroxylase 25.4e-8260.96Show/hide
Query:  SMAVDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDVTGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLR
        S  ++P++V Q+SS+PRAF+Y+GFL+  E DHLI+LAK+ L+ S VAD+  G S VS  RTSSGTF+ K +D IV+GIE K++ WTFLP +NGE +QVLR
Subjt:  SMAVDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDVTGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLR

Query:  YENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV---KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGS
        YE+GQKY+ H+DYF D VNIA GGHRIATVL+YLS+V KGGETVFP++Q    +   + KDDLSDCAK G  VKPKKG+ALLFF+L  +A PD  S HG 
Subjt:  YENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV---KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGS

Query:  CPVIEGEKWSATKWIHVQSLDTI-WRNPDCVDENVYCAMWASAGLNRCGKS
        CPVIEGEKWSATKWIHV S D I   + +C D N  C  WA  G   CGK+
Subjt:  CPVIEGEKWSATKWIHVQSLDTI-WRNPDCVDENVYCAMWASAGLNRCGKS

Q8L970 Probable prolyl 4-hydroxylase 74.9e-9962.2Show/hide
Query:  MGCRLFLAFSLCFLCFFPPFSRSANRLPGLLLDNKNMGGGSVSRMKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDV
        M  R+FLAFSLCFL   P  S + NR    L  + N   GSV +MK   SS   DPTRVTQLS  PR FLY+GFLS EE DH I LAK KLE SMVAD+ 
Subjt:  MGCRLFLAFSLCFLCFFPPFSRSANRLPGLLLDNKNMGGGSVSRMKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDV

Query:  TGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV
        +G SV S+ RTSSG FL K QD+IV+ +E+K+AAWTFLP +NGE +Q+L YENGQKYEPH+DYF D  N+ +GGHRIATVLMYLS+V+KGGETVFP  + 
Subjt:  TGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV

Query:  KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIW-RNPDCVDENVYCAMWASAG
        K ++ + D  ++CAK GY VKP+KGDALLFF+LHPNAT DS+S HGSCPV+EGEKWSAT+WIHV+S +  + +   C+DENV C  WA AG
Subjt:  KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIW-RNPDCVDENVYCAMWASAG

Q8LAN3 Probable prolyl 4-hydroxylase 41.9e-8260.82Show/hide
Query:  SSMAVDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDVTGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVL
        SS+ V+P++V Q+SS+PRAF+Y+GFL+  E DH+++LAK  L+ S VAD+ +G S  S+ RTSSGTF+ K +D IV+GIE KI+ WTFLP +NGE IQVL
Subjt:  SSMAVDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDVTGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVL

Query:  RYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV---KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHG
        RYE+GQKY+ H+DYF D VNI  GGHR+AT+LMYLS+V KGGETVFP++++   ++  + K+DLSDCAK G  VKP+KGDALLFF+LHP+A PD  S HG
Subjt:  RYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV---KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHG

Query:  SCPVIEGEKWSATKWIHVQSLDTI-WRNPDCVDENVYCAMWASAG
         CPVIEGEKWSATKWIHV S D I   + +C D N  C  WA  G
Subjt:  SCPVIEGEKWSATKWIHVQSLDTI-WRNPDCVDENVYCAMWASAG

Q9LN20 Probable prolyl 4-hydroxylase 36.9e-6156.86Show/hide
Query:  LSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDVTGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHY
        LS +PRAF+Y  FLS EE ++LI+LAK  +  S V D  TG S  S+ RTSSGTFL + +D+I+  IE +IA +TF+P D+GE +QVL YE GQKYEPHY
Subjt:  LSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDVTGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHY

Query:  DYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQVKLSEQE-KDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATK
        DYF+D  N   GG R+AT+LMYLS V++GGETVFP + +  S     ++LS+C K G  VKP+ GDALLF+S+ P+AT D +S HG CPVI G KWS+TK
Subjt:  DYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQVKLSEQE-KDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATK

Query:  WIHV
        W+HV
Subjt:  WIHV

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 23.9e-8360.96Show/hide
Query:  SMAVDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDVTGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLR
        S  ++P++V Q+SS+PRAF+Y+GFL+  E DHLI+LAK+ L+ S VAD+  G S VS  RTSSGTF+ K +D IV+GIE K++ WTFLP +NGE +QVLR
Subjt:  SMAVDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDVTGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLR

Query:  YENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV---KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGS
        YE+GQKY+ H+DYF D VNIA GGHRIATVL+YLS+V KGGETVFP++Q    +   + KDDLSDCAK G  VKPKKG+ALLFF+L  +A PD  S HG 
Subjt:  YENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV---KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGS

Query:  CPVIEGEKWSATKWIHVQSLDTI-WRNPDCVDENVYCAMWASAGLNRCGKS
        CPVIEGEKWSATKWIHV S D I   + +C D N  C  WA  G   CGK+
Subjt:  CPVIEGEKWSATKWIHVQSLDTI-WRNPDCVDENVYCAMWASAGLNRCGKS

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase3.5e-10062.2Show/hide
Query:  MGCRLFLAFSLCFLCFFPPFSRSANRLPGLLLDNKNMGGGSVSRMKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDV
        M  R+FLAFSLCFL   P  S + NR    L  + N   GSV +MK   SS   DPTRVTQLS  PR FLY+GFLS EE DH I LAK KLE SMVAD+ 
Subjt:  MGCRLFLAFSLCFLCFFPPFSRSANRLPGLLLDNKNMGGGSVSRMKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDV

Query:  TGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV
        +G SV S+ RTSSG FL K QD+IV+ +E+K+AAWTFLP +NGE +Q+L YENGQKYEPH+DYF D  N+ +GGHRIATVLMYLS+V+KGGETVFP  + 
Subjt:  TGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV

Query:  KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIW-RNPDCVDENVYCAMWASAG
        K ++ + D  ++CAK GY VKP+KGDALLFF+LHPNAT DS+S HGSCPV+EGEKWSAT+WIHV+S +  + +   C+DENV C  WA AG
Subjt:  KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIW-RNPDCVDENVYCAMWASAG

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase3.7e-9458.53Show/hide
Query:  MGCRLFLAFSLCFLCFFPPFSRSANRLPGLLLDNKNMGGGSVSRMKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDV
        M  R+FLAFSLCFL   P  S + NR    L  + N   GSV +MK   SS   DPTRVTQLS  PR FLY+GFLS EE DH I LAK KLE SMVAD+ 
Subjt:  MGCRLFLAFSLCFLCFFPPFSRSANRLPGLLLDNKNMGGGSVSRMKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDV

Query:  TGASVVSKERTS----SGTFLLKAQ----DEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGE
        +G SV S++  S    S +F+        D+IV+ +E+K+AAWTFLP +NGE +Q+L YENGQKYEPH+DYF D  N+ +GGHRIATVLMYLS+V+KGGE
Subjt:  TGASVVSKERTS----SGTFLLKAQ----DEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGE

Query:  TVFPNSQVKLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIW-RNPDCVDENVYCAMWASAG
        TVFP  + K ++ + D  ++CAK GY VKP+KGDALLFF+LHPNAT DS+S HGSCPV+EGEKWSAT+WIHV+S +  + +   C+DENV C  WA AG
Subjt:  TVFPNSQVKLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIW-RNPDCVDENVYCAMWASAG

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase1.4e-8554.6Show/hide
Query:  MGCRLFLAFSLCFLCFFPPFSRSANRLPGLLLDNKNMGGGSVSRMKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDV
        M  + FLAFSL  L  F   S                             S +VDPTR+TQLS  PRAFLYKGFLS EE DHLI LAK KLE SMV  DV
Subjt:  MGCRLFLAFSLCFLCFFPPFSRSANRLPGLLLDNKNMGGGSVSRMKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDV

Query:  -TGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQ
         +G S  S+ RTSSG FL K QD+IVA +E+K+AAWTFLP +NGE +Q+L YENGQKY+PH+DYF D   + +GGHRIATVLMYLS+V KGGETVFPN +
Subjt:  -TGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQ

Query:  VKLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIWRNPDCVDENVYCAMWASAG---------
         K  + + D  S CAK GY VKP+KGDALLFF+LH N T D +S HGSCPVIEGEKWSAT+WIHV+S     +   CVD++  C  WA AG         
Subjt:  VKLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIWRNPDCVDENVYCAMWASAG---------

Query:  ------LNRCGKSCR
              L  C KSC+
Subjt:  ------LNRCGKSCR

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.3e-8360.82Show/hide
Query:  SSMAVDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDVTGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVL
        SS+ V+P++V Q+SS+PRAF+Y+GFL+  E DH+++LAK  L+ S VAD+ +G S  S+ RTSSGTF+ K +D IV+GIE KI+ WTFLP +NGE IQVL
Subjt:  SSMAVDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDVTGASVVSKERTSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVL

Query:  RYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV---KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHG
        RYE+GQKY+ H+DYF D VNI  GGHR+AT+LMYLS+V KGGETVFP++++   ++  + K+DLSDCAK G  VKP+KGDALLFF+LHP+A PD  S HG
Subjt:  RYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQV---KLSEQEKDDLSDCAKIGYGVKPKKGDALLFFSLHPNATPDSSSYHG

Query:  SCPVIEGEKWSATKWIHVQSLDTI-WRNPDCVDENVYCAMWASAG
         CPVIEGEKWSATKWIHV S D I   + +C D N  C  WA  G
Subjt:  SCPVIEGEKWSATKWIHVQSLDTI-WRNPDCVDENVYCAMWASAG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCTGTCGGCTTTTTCTCGCATTTTCCCTCTGTTTCCTCTGCTTCTTCCCTCCCTTTTCTCGCTCTGCCAATCGCTTGCCGGGATTGCTCCTAGACAACAAGAACAT
GGGAGGAGGGTCTGTTAGTAGGATGAAACCAGGTGGTTCCTCCATGGCAGTCGATCCCACTCGTGTCACTCAGCTTTCATCGCAACCCAGGGCTTTCTTATATAAGGGAT
TTTTGTCTGCAGAGGAGAGCGATCATCTTATCAATTTGGCGAAGGATAAGCTTGAGATATCAATGGTGGCCGATGATGTAACGGGTGCGAGTGTTGTGAGTAAAGAACGA
ACGAGTAGCGGCACGTTTCTTCTCAAGGCTCAGGACGAAATAGTTGCAGGCATCGAGTCCAAGATTGCTGCATGGACCTTCCTTCCCATCGATAATGGGGAGCCTATTCA
AGTACTGAGGTACGAGAACGGTCAGAAATATGAGCCACATTACGATTATTTTCTAGACCCAGTTAATATAGCTGTTGGCGGTCACCGGATCGCCACAGTCTTGATGTATT
TGTCCCATGTCAAAAAGGGTGGAGAAACTGTCTTTCCCAATTCTCAGGTTAAATTATCCGAGCAGGAGAAGGATGACTTGTCCGATTGTGCTAAGATTGGTTATGGAGTA
AAACCAAAGAAGGGTGATGCATTGCTGTTCTTCAGTCTGCATCCAAATGCGACGCCAGACTCGAGCAGCTATCATGGGAGCTGCCCGGTGATAGAGGGCGAGAAGTGGTC
TGCGACGAAATGGATTCACGTGCAATCACTCGATACGATTTGGAGGAATCCAGATTGTGTGGATGAGAATGTGTACTGCGCTATGTGGGCTAGTGCAGGGCTGAATAGAT
GTGGGAAGAGTTGCAGGCTGAGATGGACTAATTACCTCAGGCCTGATCTCAGGCATGACAGCTTCACTCCTCAAGAAGAAGACCTCATTATCAAACTCCATCAAGCCATT
GGAAGCAGGTGGTCTCTGATAGCAAAGCAACTTCCTGGAAGAACAGACAATGATGTGAAGAATTACTGGAACACAAAGCTGAGGAAGAAGCTTTCAAAGATGGGAATTGA
TCCAATAACTCACAAGCCATTCTCTCAGATCCTCTTCGACTATGGAACCATAAGCAGCCTCCCCAACACCCCAAAACCACTTCCCAGCTCCTTCACCACAACAATGGCGA
AACCCCACCAGCCATCTACTTCTTCTTCACCTCCCGCCATAGCCAACCCTCCATTTCCAGGGACCTTTCGACCACACTTTTTCAATGAACCAACCTCCTCCTGCTCATCC
ACTTCATCTTCTTCCTTCAATGGCGGCGGCGGCGGCGGCGACGTTCTCTTCCATTTCGCTGCGTCTTCTTCGCCGGAGCAGAGCCATGGAATAATGGAGCCGTTTTCGCA
GAGCTCAGATGGGTTTGTTTGTGGAGCCATGGGCCAACGGGGATGTGAAGGTTCTTCTAGTTCGTCTTTCAATTCGTTTGTGGATGCTCTGTTGGAGCAGGATTTCGAGA
TTAAGGGTTCGTTTCCGGAGATTTTGGAGGGGTGTTTTGATTACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCTGTCGGCTTTTTCTCGCATTTTCCCTCTGTTTCCTCTGCTTCTTCCCTCCCTTTTCTCGCTCTGCCAATCGCTTGCCGGGATTGCTCCTAGACAACAAGAACAT
GGGAGGAGGGTCTGTTAGTAGGATGAAACCAGGTGGTTCCTCCATGGCAGTCGATCCCACTCGTGTCACTCAGCTTTCATCGCAACCCAGGGCTTTCTTATATAAGGGAT
TTTTGTCTGCAGAGGAGAGCGATCATCTTATCAATTTGGCGAAGGATAAGCTTGAGATATCAATGGTGGCCGATGATGTAACGGGTGCGAGTGTTGTGAGTAAAGAACGA
ACGAGTAGCGGCACGTTTCTTCTCAAGGCTCAGGACGAAATAGTTGCAGGCATCGAGTCCAAGATTGCTGCATGGACCTTCCTTCCCATCGATAATGGGGAGCCTATTCA
AGTACTGAGGTACGAGAACGGTCAGAAATATGAGCCACATTACGATTATTTTCTAGACCCAGTTAATATAGCTGTTGGCGGTCACCGGATCGCCACAGTCTTGATGTATT
TGTCCCATGTCAAAAAGGGTGGAGAAACTGTCTTTCCCAATTCTCAGGTTAAATTATCCGAGCAGGAGAAGGATGACTTGTCCGATTGTGCTAAGATTGGTTATGGAGTA
AAACCAAAGAAGGGTGATGCATTGCTGTTCTTCAGTCTGCATCCAAATGCGACGCCAGACTCGAGCAGCTATCATGGGAGCTGCCCGGTGATAGAGGGCGAGAAGTGGTC
TGCGACGAAATGGATTCACGTGCAATCACTCGATACGATTTGGAGGAATCCAGATTGTGTGGATGAGAATGTGTACTGCGCTATGTGGGCTAGTGCAGGGCTGAATAGAT
GTGGGAAGAGTTGCAGGCTGAGATGGACTAATTACCTCAGGCCTGATCTCAGGCATGACAGCTTCACTCCTCAAGAAGAAGACCTCATTATCAAACTCCATCAAGCCATT
GGAAGCAGGTGGTCTCTGATAGCAAAGCAACTTCCTGGAAGAACAGACAATGATGTGAAGAATTACTGGAACACAAAGCTGAGGAAGAAGCTTTCAAAGATGGGAATTGA
TCCAATAACTCACAAGCCATTCTCTCAGATCCTCTTCGACTATGGAACCATAAGCAGCCTCCCCAACACCCCAAAACCACTTCCCAGCTCCTTCACCACAACAATGGCGA
AACCCCACCAGCCATCTACTTCTTCTTCACCTCCCGCCATAGCCAACCCTCCATTTCCAGGGACCTTTCGACCACACTTTTTCAATGAACCAACCTCCTCCTGCTCATCC
ACTTCATCTTCTTCCTTCAATGGCGGCGGCGGCGGCGGCGACGTTCTCTTCCATTTCGCTGCGTCTTCTTCGCCGGAGCAGAGCCATGGAATAATGGAGCCGTTTTCGCA
GAGCTCAGATGGGTTTGTTTGTGGAGCCATGGGCCAACGGGGATGTGAAGGTTCTTCTAGTTCGTCTTTCAATTCGTTTGTGGATGCTCTGTTGGAGCAGGATTTCGAGA
TTAAGGGTTCGTTTCCGGAGATTTTGGAGGGGTGTTTTGATTACTGA
Protein sequenceShow/hide protein sequence
MGCRLFLAFSLCFLCFFPPFSRSANRLPGLLLDNKNMGGGSVSRMKPGGSSMAVDPTRVTQLSSQPRAFLYKGFLSAEESDHLINLAKDKLEISMVADDVTGASVVSKER
TSSGTFLLKAQDEIVAGIESKIAAWTFLPIDNGEPIQVLRYENGQKYEPHYDYFLDPVNIAVGGHRIATVLMYLSHVKKGGETVFPNSQVKLSEQEKDDLSDCAKIGYGV
KPKKGDALLFFSLHPNATPDSSSYHGSCPVIEGEKWSATKWIHVQSLDTIWRNPDCVDENVYCAMWASAGLNRCGKSCRLRWTNYLRPDLRHDSFTPQEEDLIIKLHQAI
GSRWSLIAKQLPGRTDNDVKNYWNTKLRKKLSKMGIDPITHKPFSQILFDYGTISSLPNTPKPLPSSFTTTMAKPHQPSTSSSPPAIANPPFPGTFRPHFFNEPTSSCSS
TSSSSFNGGGGGGDVLFHFAASSSPEQSHGIMEPFSQSSDGFVCGAMGQRGCEGSSSSSFNSFVDALLEQDFEIKGSFPEILEGCFDY