; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh15G011490 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh15G011490
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationCmo_Chr15:8023707..8032693
RNA-Seq ExpressionCmoCh15G011490
SyntenyCmoCh15G011490
Gene Ontology termsGO:0016042 - lipid catabolic process (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0016020 - membrane (cellular component)
GO:0005506 - iron ion binding (molecular function)
GO:0016705 - oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
GO:0051213 - dioxygenase activity (molecular function)
InterPro domainsIPR002921 - Fungal lipase-like domain
IPR003582 - ShKT domain
IPR005592 - Mono-/di-acylglycerol lipase, N-terminal
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4271435.1 unnamed protein product [Prunus armeniaca]6.5e-28159.9Show/hide
Query:  MSIICGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATADEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLILKKTYKDTRGRAPPYILY
        MSI+CG P++ECV CL C RW WKRCLHTAGHDSETWG ATA+EFEP+PR+CRYILAVYEDD+R+PLWEP GGYGI PDWLILKKTY+DT+G+APPYILY
Subjt:  MSIICGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATADEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLILKKTYKDTRGRAPPYILY

Query:  LDHDHADIVLAIRGLNLAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDSENETLKDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVVQNHEKLEN
        LDHDHADIVLA RGLNLA+ESDYAVL+DNKLGK KFDGGYVHNGLLKAA WVLD+E E LKDLV+KYP+YTLTF GHSLGSGVAA+LT+VVVQ+ ++L N
Subjt:  LDHDHADIVLAIRGLNLAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDSENETLKDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVVQNHEKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFCLPCLLCLGCLRDTCISEDKMLKDPRRLYTPGRLYHIVERKPFRCG
        IDRKR+R YAIAPARC+SLNLAVRYADVINSVVLQ        TPLEDIFKSLFCLPCLLC+ C+RDTCI E+KMLKDPRRLY PGRLYHIVERKPFR G
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFCLPCLLCLGCLRDTCISEDKMLKDPRRLYTPGRLYHIVERKPFRCG

Query:  RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMLEDNKVMEIPPQQKMERQNTLAREHTEEYKAALQRAVTLAVPHAYTLSPYGTFSE
        RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIE+EA+ AL+LMLE +++MEIPP+QKMERQ TLA+EHTEEY+AALQRAVTLAVPHAY+ S YGTF  
Subjt:  RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMLEDNKVMEIPPQQKMERQNTLAREHTEEYKAALQRAVTLAVPHAYTLSPYGTFSE

Query:  TVEGEEEKEEESPASSGGSSRKRKETWDELIERLYDKDDSRHDKNESRYTVLKKSFSNSWKYRGDHGFVQQQLPENLYPQLLILHTFSTSPSSSLLFGSS
            E+++EE S  SSG SS                                                                  FS++  S      S
Subjt:  TVEGEEEKEEESPASSGGSSRKRKETWDELIERLYDKDDSRHDKNESRYTVLKKSFSNSWKYRGDHGFVQQQLPENLYPQLLILHTFSTSPSSSLLFGSS

Query:  SHPSMDSRLTFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSG---HLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDK--
                                         RK LR +  N     H  +S HS RIDPSR VQ+SW+PR FLY+GFLSDEECDHL++LA   E+   
Subjt:  SHPSMDSRLTFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSG---HLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDK--

Query:  PSRNNAGSRNTVSTKFLGNSGAILNTTDDIIGRIENRIAVWTFLPKDHSMPFQIMKYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILF
           ++ G+ NT+  +   +    LN  D+I+ RIE RI+ WTFLPK++S   Q+ + G EEA  +  FFGN+S +  SEPL+ATV+LY+S+   GGEILF
Subjt:  PSRNNAGSRNTVSTKFLGNSGAILNTTDDIIGRIENRIAVWTFLPKDHSMPFQIMKYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILF

Query:  PVSKVKRRFWSDRRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECK
        P S+++   WSD  K ++ L+P KGNA+LFF++  NASPDKS  HSR P+L+G++W ATKF Y +  A G E         +C DED++CP WA+IGEC+
Subjt:  PVSKVKRRFWSDRRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECK

Query:  RNAVFMIGSPDYYGTCRKSCNAC
        RN VFM+GSPDYYGTCRKSCN C
Subjt:  RNAVFMIGSPDYYGTCRKSCNAC

CAB4301873.1 unnamed protein product [Prunus armeniaca]5.0e-28159.93Show/hide
Query:  MSIICGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATADEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLILKKTYKDTRGRAPPYILY
        MSI+CG P++ECV CL C RW WKRCLHTAGHDSETWG ATA+EFEP+PR+CRYILAVYEDD+R+PLWEP GGYGI PDWLILKKTY+DT+G+APPYILY
Subjt:  MSIICGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATADEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLILKKTYKDTRGRAPPYILY

Query:  LDHDHADIVLAIRGLNLAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDSENETLKDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVVQNHEKLEN
        LDHDHADIVLA RGLNLA+ESDYAVL+DNKLGK KFDGGYVHNGLLKAA WVLD+E E LKDLV+KYP+YTLTF GHSLGSGVAA+LT+VVVQ+ ++L N
Subjt:  LDHDHADIVLAIRGLNLAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDSENETLKDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVVQNHEKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFCLPCLLCLGCLRDTCISEDKMLKDPRRLYTPGRLYHIVERKPFRCG
        IDRKR+R YAIAPARC+SLNLAVRYADVINSVVLQ        TPLEDIFKSLFCLPCLLC+ C+RDTCI E+KMLKDPRRLY PGRLYHIVERKPFR G
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFCLPCLLCLGCLRDTCISEDKMLKDPRRLYTPGRLYHIVERKPFRCG

Query:  RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMLEDNKVMEIPPQQKMERQNTLAREHTEEYKAALQRAVTLAVPHAYTLSPYGTFSE
        RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIE+EA+ AL+LMLE +++MEIPP+QKMERQ TLA+EHTEEY+AALQRAVTLAVPHAY+ S YGTF  
Subjt:  RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMLEDNKVMEIPPQQKMERQNTLAREHTEEYKAALQRAVTLAVPHAYTLSPYGTFSE

Query:  TVEGEEEKEEESPASSGGSSRKRKETWDELIERLYDKDDSRHDKNESRYTVLKKSFSNSWKYRGDHGFVQQQLPENLYPQLLILHTFSTSPSSSLLFGSS
            E+++EE S  SSG SS                                                                  FS++  S      S
Subjt:  TVEGEEEKEEESPASSGGSSRKRKETWDELIERLYDKDDSRHDKNESRYTVLKKSFSNSWKYRGDHGFVQQQLPENLYPQLLILHTFSTSPSSSLLFGSS

Query:  SHPSMDSRLTFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSG---HLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPS
                                         RK LR +  N     H  +S HS RIDPSR VQ+SW+PR FLY+GFLSDEECDHL++LA   E+   
Subjt:  SHPSMDSRLTFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSG---HLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPS

Query:  RNNAGSRNTVSTKFLGNSGAILNTTDDIIGRIENRIAVWTFLPKDHSMPFQIMKYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPV
               NT + +   +    LN  D+I+ RIE RI+ WTFLPK++S   Q+ + G EEA  +  FFGN+S +  SEPL+ATV+LY+S+   GGEILFP 
Subjt:  RNNAGSRNTVSTKFLGNSGAILNTTDDIIGRIENRIAVWTFLPKDHSMPFQIMKYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPV

Query:  SKVKRRFWSDRRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRN
        S+++   WSD  K ++ L+P KGNA+LFF++  NASPDKS  HSR P+L+G++W ATKF Y +  A G E         +C DED++CP WA+IGEC+RN
Subjt:  SKVKRRFWSDRRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRN

Query:  AVFMIGSPDYYGTCRKSCNAC
         VFM+GSPDYYGTCRKSCN C
Subjt:  AVFMIGSPDYYGTCRKSCNAC

PON44192.1 Mono-/di-acylglycerol lipase [Trema orientale]1.9e-27760.46Show/hide
Query:  MSIICGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATADEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLILKKTYKDTRGRAPPYILY
        MSI+CG+P+LECV CL CARW WKRCLHTAGHDSETWG ATA+EFEP+PRIC YILAVYEDD+R PLWEP  GYGINPDWL+LK+TY+DT+G+APPYILY
Subjt:  MSIICGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATADEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLILKKTYKDTRGRAPPYILY

Query:  LDHDHADIVLAIRGLNLAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDSENETLKDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVVQNHEKLEN
        LDHDHADIVLA RGLNLAKESDYAVLLDNKLGK KFDGGYVHNGLLKAAGWVL +E++ LKDLV+KYP+YTLTFAGHSLGSGVAA+LT+V VQN +KL N
Subjt:  LDHDHADIVLAIRGLNLAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDSENETLKDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVVQNHEKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFCLPCLLCLGCLRDTCISEDKMLKDPRRLYTPGRLYHIVERKPFRCG
        IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFCLPCLLCL C+RDTCI E+KMLKDPRRLY PGRLYHIVERKPFR G
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFCLPCLLCLGCLRDTCISEDKMLKDPRRLYTPGRLYHIVERKPFRCG

Query:  RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMLEDNKVMEIPPQQKMERQNTLAREHTEEYKAALQRAVTLAVPHAYTLSPYGTFSE
        RFPPVV+TAVPVDGRFEHIVLSCNATSDHAIIWIE+EA+ ALELMLE + +MEIP +Q+MERQ TLA+E +EEYKAALQRAVTLAVPHAY+ S YGTF +
Subjt:  RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMLEDNKVMEIPPQQKMERQNTLAREHTEEYKAALQRAVTLAVPHAYTLSPYGTFSE

Query:  TVEGEEEKEEESPASSGGSSRKR--KETWDELIERLYDKDDSRHD-KNESRYTVLKKSFSNSWKYRGDHGFVQQQLPENLYPQLLILHTFSTSPSSSLLF
          EG          SS GSSRK   KETWDELIERL+DKDDS H    E+ Y+  K      +   G HGF+        + ++ +    S   S     
Subjt:  TVEGEEEKEEESPASSGGSSRKR--KETWDELIERLYDKDDSRHD-KNESRYTVLKKSFSNSWKYRGDHGFVQQQLPENLYPQLLILHTFSTSPSSSLLF

Query:  GSSSHPSMDSRLTFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPS
         S+  P +       L   A F F + +    +    +GL   + N G                                           A     K S
Subjt:  GSSSHPSMDSRLTFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPS

Query:  RNNAGSRNTVSTKFLGNSGAILNTTDDIIGRIENRIAVWTFLPKDHSMPFQIMKYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPV
        R+   S +T+  + L +SG      DDI+  IE RI+ WTFLPK++    Q++ Y  E++  +  +FGN S +   +PL+ATVVLYLS+   GG+ILFP 
Subjt:  RNNAGSRNTVSTKFLGNSGAILNTTDDIIGRIENRIAVWTFLPKDHSMPFQIMKYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPV

Query:  SKVKRRFWSDRRKKN-NFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKR
        S+VK + WSD  K + N  RP+KGNA+LFF+++ N + D S  H+R P+L+G++W ATKFF ++  A G + ++ES  +++C D+DE+CP WAA+GEC+R
Subjt:  SKVKRRFWSDRRKKN-NFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKR

Query:  NAVFMIGSPDYYGTCRKSCNAC
        N VFM+GSPDYYGTCR+SCNAC
Subjt:  NAVFMIGSPDYYGTCRKSCNAC

PON51727.1 Mono-/di-acylglycerol lipase [Parasponia andersonii]2.2e-27359.61Show/hide
Query:  MSIICGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATADEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLILKKTYKDTRGRAPPYILY
        MSI+CG+P+LECV CL CARW WKRCLHTAGHDSETWG ATA+EFEP+PRIC YILAVYEDD+R+PLWEP  GYGINPDWL+LK+TY+DT+G+APPYILY
Subjt:  MSIICGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATADEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLILKKTYKDTRGRAPPYILY

Query:  LDHDHADIVLAIRGLNLAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDSENETLKDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVVQNHEKLEN
        LDHDHADIVLA RGLNLAKESDYAVLLDNKLGK KFDGGYVHNGLLKAAGWVL +E++ LKDLV++YP+YTLTFAGHSLGSGVAA+LT+V VQN +KL N
Subjt:  LDHDHADIVLAIRGLNLAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDSENETLKDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVVQNHEKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFCLPCLLCLGCLRDTCISEDKMLKDPRRLYTPGRLYHIVERKPFRCG
        IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFCLPCLLCL C+RDTCI E+KMLKDPRRLY PGRLYHIVERKPFR G
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFCLPCLLCLGCLRDTCISEDKMLKDPRRLYTPGRLYHIVERKPFRCG

Query:  RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMLEDNKVMEIPPQQKMERQNTLAREHTEEYKAALQRAVTLAVPHAYTLSPYGTFSE
        RFPPVV+TAVPVDGRFEHIVLSCNATSDHAIIWIE+EA+ ALEL+ E + +MEIP +Q+MERQ TLA+E +EEYKAALQRAVTLAVPHAY+ S YGTF +
Subjt:  RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMLEDNKVMEIPPQQKMERQNTLAREHTEEYKAALQRAVTLAVPHAYTLSPYGTFSE

Query:  TVEGEEEKEEESPASSGGSSRKR--KETWDELIERLYDKDDSRH-DKNESRYTVLKKSFSNSWKYRGDHGFVQQQLPENLYPQLLILHTFSTSPSSSLLF
          EG          SS  SSRK   KETWDELIERL+DKDDS H    E+ Y+  K          G H F+        + +  +    S   S     
Subjt:  TVEGEEEKEEESPASSGGSSRKR--KETWDELIERLYDKDDSRH-DKNESRYTVLKKSFSNSWKYRGDHGFVQQQLPENLYPQLLILHTFSTSPSSSLLF

Query:  GSSSHPSMDSRLTFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPS
         S+  P ++      L   A F F + +    +    +GL   + N G                                           A     K S
Subjt:  GSSSHPSMDSRLTFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPS

Query:  RNNAGSRNTVSTKFLGNSGAILNTTDDIIGRIENRIAVWTFLPKDHSMPFQIMKYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPV
        R+   S +T+    L +SG      DD++  IE RI+ WTFLPK++    Q++ Y  E++  +  +FGN S +  S+PL+ATVVLYLS+   GG+ILFP 
Subjt:  RNNAGSRNTVSTKFLGNSGAILNTTDDIIGRIENRIAVWTFLPKDHSMPFQIMKYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPV

Query:  SKVKRRFWSDRRKKNNFL-RPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKR
        S+VK + WSD  K +  + RP+KGNA+LFF+++ N + D S  H+R P+++G++W ATKFF ++  A G + ++ES   ++C D+DE+CP WAA+GEC+R
Subjt:  SKVKRRFWSDRRKKNNFL-RPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKR

Query:  NAVFMIGSPDYYGTCRKSCNAC
        N VFM+GSPDYYGTCR+SCNAC
Subjt:  NAVFMIGSPDYYGTCRKSCNAC

RXH95088.1 hypothetical protein DVH24_024772 [Malus domestica]3.0e-27860.17Show/hide
Query:  MSIICGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATADEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLILKKTYKDTRGRAPPYILY
        MSI+C  P+LECV CL C RW WKRCLHTAGHDSETWG +TA+EFEP+PR+CRYILAVYEDD+R PLWEP GGYGINPDWLILKKTY+DT G APPYILY
Subjt:  MSIICGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATADEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLILKKTYKDTRGRAPPYILY

Query:  LDHDHADIVLAIRGLNLAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDSENETLKDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVVQNHEKLEN
        LDH+HADIVLA RGLNLA+ESDYAVL+DNKLG+ KFDGGYVHNGLLK+A WV+D+E E LKDLV+ YP+YTLTFAGHSLGSGVAA+LT+VVV+N ++L +
Subjt:  LDHDHADIVLAIRGLNLAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDSENETLKDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVVQNHEKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFCLPCLLCLGCLRDTCISEDKMLKDPRRLYTPGRLYHIVERKPFRCG
        IDRKR+R YAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIF     LPC+LCL C+RDTCI E+KMLKDPRRLY PGRLYHIVERKPFRCG
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFCLPCLLCLGCLRDTCISEDKMLKDPRRLYTPGRLYHIVERKPFRCG

Query:  RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMLEDNKVMEIPPQQKMERQNTLAREHTEEYKAALQRAVTLAVPHAYTLSPYGTFSE
        RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIE+EA+ AL+LML+ + +MEIP +Q+MERQ TLA+EHTEEYKAALQRAVTLAVPHAY+ SPYGTF  
Subjt:  RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMLEDNKVMEIPPQQKMERQNTLAREHTEEYKAALQRAVTLAVPHAYTLSPYGTFSE

Query:  TVEGEEEKEEESPASSGGSSRKRKETWDELIERLYDKDDSRHDKNESRYTVLKKSFSNSWKYRGDHGFVQQQLPENLYPQLLILHTFSTSPSSSLLFGSS
            E+++E+ S  SSG                            ES +   KKS S + + RG                                 GS+
Subjt:  TVEGEEEKEEESPASSGGSSRKRKETWDELIERLYDKDDSRHDKNESRYTVLKKSFSNSWKYRGDHGFVQQQLPENLYPQLLILHTFSTSPSSSLLFGSS

Query:  SHPSMDSRLTFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNN
        S  S+ S +  LL + ++F  SS       +   + +++ +++ GH   S HS RIDPSRVVQ+SWQPR        SDEECDHL++LA   EDK     
Subjt:  SHPSMDSRLTFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNN

Query:  AGSRNTVSTKFLGNSGAILNTTDDIIGRIENRIAVWTFLPKDHSMPFQIMKYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKV
            NT + + + +    L+  D+++ RIE RI+ WTFLPK++S   Q+  +G EE   +  +FGN+S +  +EPL+ATV+LYLS+   GGEILFP S++
Subjt:  AGSRNTVSTKFLGNSGAILNTTDDIIGRIENRIAVWTFLPKDHSMPFQIMKYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKV

Query:  KRRFWSDRRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVF
          +  SD R+ ++ LRPVKGNA+LFF++H NASPDKS  H+R P+L+G++W ATKF + + A  G + + +SG   +C DED++CP+WA++GEC+RN VF
Subjt:  KRRFWSDRRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVF

Query:  MIGSPDYYGTCRKSCN
        M+GSPDYYGTCRKSCN
Subjt:  MIGSPDYYGTCRKSCN

TrEMBL top hitse value%identityAlignment
A0A2P5B5Y0 Procollagen-proline 4-dioxygenase9.4e-27860.46Show/hide
Query:  MSIICGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATADEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLILKKTYKDTRGRAPPYILY
        MSI+CG+P+LECV CL CARW WKRCLHTAGHDSETWG ATA+EFEP+PRIC YILAVYEDD+R PLWEP  GYGINPDWL+LK+TY+DT+G+APPYILY
Subjt:  MSIICGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATADEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLILKKTYKDTRGRAPPYILY

Query:  LDHDHADIVLAIRGLNLAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDSENETLKDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVVQNHEKLEN
        LDHDHADIVLA RGLNLAKESDYAVLLDNKLGK KFDGGYVHNGLLKAAGWVL +E++ LKDLV+KYP+YTLTFAGHSLGSGVAA+LT+V VQN +KL N
Subjt:  LDHDHADIVLAIRGLNLAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDSENETLKDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVVQNHEKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFCLPCLLCLGCLRDTCISEDKMLKDPRRLYTPGRLYHIVERKPFRCG
        IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFCLPCLLCL C+RDTCI E+KMLKDPRRLY PGRLYHIVERKPFR G
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFCLPCLLCLGCLRDTCISEDKMLKDPRRLYTPGRLYHIVERKPFRCG

Query:  RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMLEDNKVMEIPPQQKMERQNTLAREHTEEYKAALQRAVTLAVPHAYTLSPYGTFSE
        RFPPVV+TAVPVDGRFEHIVLSCNATSDHAIIWIE+EA+ ALELMLE + +MEIP +Q+MERQ TLA+E +EEYKAALQRAVTLAVPHAY+ S YGTF +
Subjt:  RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMLEDNKVMEIPPQQKMERQNTLAREHTEEYKAALQRAVTLAVPHAYTLSPYGTFSE

Query:  TVEGEEEKEEESPASSGGSSRKR--KETWDELIERLYDKDDSRHD-KNESRYTVLKKSFSNSWKYRGDHGFVQQQLPENLYPQLLILHTFSTSPSSSLLF
          EG          SS GSSRK   KETWDELIERL+DKDDS H    E+ Y+  K      +   G HGF+        + ++ +    S   S     
Subjt:  TVEGEEEKEEESPASSGGSSRKR--KETWDELIERLYDKDDSRHD-KNESRYTVLKKSFSNSWKYRGDHGFVQQQLPENLYPQLLILHTFSTSPSSSLLF

Query:  GSSSHPSMDSRLTFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPS
         S+  P +       L   A F F + +    +    +GL   + N G                                           A     K S
Subjt:  GSSSHPSMDSRLTFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPS

Query:  RNNAGSRNTVSTKFLGNSGAILNTTDDIIGRIENRIAVWTFLPKDHSMPFQIMKYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPV
        R+   S +T+  + L +SG      DDI+  IE RI+ WTFLPK++    Q++ Y  E++  +  +FGN S +   +PL+ATVVLYLS+   GG+ILFP 
Subjt:  RNNAGSRNTVSTKFLGNSGAILNTTDDIIGRIENRIAVWTFLPKDHSMPFQIMKYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPV

Query:  SKVKRRFWSDRRKKN-NFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKR
        S+VK + WSD  K + N  RP+KGNA+LFF+++ N + D S  H+R P+L+G++W ATKFF ++  A G + ++ES  +++C D+DE+CP WAA+GEC+R
Subjt:  SKVKRRFWSDRRKKN-NFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKR

Query:  NAVFMIGSPDYYGTCRKSCNAC
        N VFM+GSPDYYGTCR+SCNAC
Subjt:  NAVFMIGSPDYYGTCRKSCNAC

A0A2P5BSE2 Procollagen-proline 4-dioxygenase1.1e-27359.61Show/hide
Query:  MSIICGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATADEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLILKKTYKDTRGRAPPYILY
        MSI+CG+P+LECV CL CARW WKRCLHTAGHDSETWG ATA+EFEP+PRIC YILAVYEDD+R+PLWEP  GYGINPDWL+LK+TY+DT+G+APPYILY
Subjt:  MSIICGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATADEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLILKKTYKDTRGRAPPYILY

Query:  LDHDHADIVLAIRGLNLAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDSENETLKDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVVQNHEKLEN
        LDHDHADIVLA RGLNLAKESDYAVLLDNKLGK KFDGGYVHNGLLKAAGWVL +E++ LKDLV++YP+YTLTFAGHSLGSGVAA+LT+V VQN +KL N
Subjt:  LDHDHADIVLAIRGLNLAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDSENETLKDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVVQNHEKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFCLPCLLCLGCLRDTCISEDKMLKDPRRLYTPGRLYHIVERKPFRCG
        IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFCLPCLLCL C+RDTCI E+KMLKDPRRLY PGRLYHIVERKPFR G
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFCLPCLLCLGCLRDTCISEDKMLKDPRRLYTPGRLYHIVERKPFRCG

Query:  RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMLEDNKVMEIPPQQKMERQNTLAREHTEEYKAALQRAVTLAVPHAYTLSPYGTFSE
        RFPPVV+TAVPVDGRFEHIVLSCNATSDHAIIWIE+EA+ ALEL+ E + +MEIP +Q+MERQ TLA+E +EEYKAALQRAVTLAVPHAY+ S YGTF +
Subjt:  RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMLEDNKVMEIPPQQKMERQNTLAREHTEEYKAALQRAVTLAVPHAYTLSPYGTFSE

Query:  TVEGEEEKEEESPASSGGSSRKR--KETWDELIERLYDKDDSRH-DKNESRYTVLKKSFSNSWKYRGDHGFVQQQLPENLYPQLLILHTFSTSPSSSLLF
          EG          SS  SSRK   KETWDELIERL+DKDDS H    E+ Y+  K          G H F+        + +  +    S   S     
Subjt:  TVEGEEEKEEESPASSGGSSRKR--KETWDELIERLYDKDDSRH-DKNESRYTVLKKSFSNSWKYRGDHGFVQQQLPENLYPQLLILHTFSTSPSSSLLF

Query:  GSSSHPSMDSRLTFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPS
         S+  P ++      L   A F F + +    +    +GL   + N G                                           A     K S
Subjt:  GSSSHPSMDSRLTFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPS

Query:  RNNAGSRNTVSTKFLGNSGAILNTTDDIIGRIENRIAVWTFLPKDHSMPFQIMKYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPV
        R+   S +T+    L +SG      DD++  IE RI+ WTFLPK++    Q++ Y  E++  +  +FGN S +  S+PL+ATVVLYLS+   GG+ILFP 
Subjt:  RNNAGSRNTVSTKFLGNSGAILNTTDDIIGRIENRIAVWTFLPKDHSMPFQIMKYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPV

Query:  SKVKRRFWSDRRKKNNFL-RPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKR
        S+VK + WSD  K +  + RP+KGNA+LFF+++ N + D S  H+R P+++G++W ATKFF ++  A G + ++ES   ++C D+DE+CP WAA+GEC+R
Subjt:  SKVKRRFWSDRRKKNNFL-RPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKR

Query:  NAVFMIGSPDYYGTCRKSCNAC
        N VFM+GSPDYYGTCR+SCNAC
Subjt:  NAVFMIGSPDYYGTCRKSCNAC

A0A498JHB5 Procollagen-proline 4-dioxygenase1.5e-27860.17Show/hide
Query:  MSIICGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATADEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLILKKTYKDTRGRAPPYILY
        MSI+C  P+LECV CL C RW WKRCLHTAGHDSETWG +TA+EFEP+PR+CRYILAVYEDD+R PLWEP GGYGINPDWLILKKTY+DT G APPYILY
Subjt:  MSIICGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATADEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLILKKTYKDTRGRAPPYILY

Query:  LDHDHADIVLAIRGLNLAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDSENETLKDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVVQNHEKLEN
        LDH+HADIVLA RGLNLA+ESDYAVL+DNKLG+ KFDGGYVHNGLLK+A WV+D+E E LKDLV+ YP+YTLTFAGHSLGSGVAA+LT+VVV+N ++L +
Subjt:  LDHDHADIVLAIRGLNLAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDSENETLKDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVVQNHEKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFCLPCLLCLGCLRDTCISEDKMLKDPRRLYTPGRLYHIVERKPFRCG
        IDRKR+R YAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIF     LPC+LCL C+RDTCI E+KMLKDPRRLY PGRLYHIVERKPFRCG
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFCLPCLLCLGCLRDTCISEDKMLKDPRRLYTPGRLYHIVERKPFRCG

Query:  RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMLEDNKVMEIPPQQKMERQNTLAREHTEEYKAALQRAVTLAVPHAYTLSPYGTFSE
        RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIE+EA+ AL+LML+ + +MEIP +Q+MERQ TLA+EHTEEYKAALQRAVTLAVPHAY+ SPYGTF  
Subjt:  RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMLEDNKVMEIPPQQKMERQNTLAREHTEEYKAALQRAVTLAVPHAYTLSPYGTFSE

Query:  TVEGEEEKEEESPASSGGSSRKRKETWDELIERLYDKDDSRHDKNESRYTVLKKSFSNSWKYRGDHGFVQQQLPENLYPQLLILHTFSTSPSSSLLFGSS
            E+++E+ S  SSG                            ES +   KKS S + + RG                                 GS+
Subjt:  TVEGEEEKEEESPASSGGSSRKRKETWDELIERLYDKDDSRHDKNESRYTVLKKSFSNSWKYRGDHGFVQQQLPENLYPQLLILHTFSTSPSSSLLFGSS

Query:  SHPSMDSRLTFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNN
        S  S+ S +  LL + ++F  SS       +   + +++ +++ GH   S HS RIDPSRVVQ+SWQPR        SDEECDHL++LA   EDK     
Subjt:  SHPSMDSRLTFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNN

Query:  AGSRNTVSTKFLGNSGAILNTTDDIIGRIENRIAVWTFLPKDHSMPFQIMKYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKV
            NT + + + +    L+  D+++ RIE RI+ WTFLPK++S   Q+  +G EE   +  +FGN+S +  +EPL+ATV+LYLS+   GGEILFP S++
Subjt:  AGSRNTVSTKFLGNSGAILNTTDDIIGRIENRIAVWTFLPKDHSMPFQIMKYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKV

Query:  KRRFWSDRRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVF
          +  SD R+ ++ LRPVKGNA+LFF++H NASPDKS  H+R P+L+G++W ATKF + + A  G + + +SG   +C DED++CP+WA++GEC+RN VF
Subjt:  KRRFWSDRRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVF

Query:  MIGSPDYYGTCRKSCN
        M+GSPDYYGTCRKSCN
Subjt:  MIGSPDYYGTCRKSCN

A0A6J5U8N9 Procollagen-proline 4-dioxygenase3.1e-28159.9Show/hide
Query:  MSIICGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATADEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLILKKTYKDTRGRAPPYILY
        MSI+CG P++ECV CL C RW WKRCLHTAGHDSETWG ATA+EFEP+PR+CRYILAVYEDD+R+PLWEP GGYGI PDWLILKKTY+DT+G+APPYILY
Subjt:  MSIICGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATADEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLILKKTYKDTRGRAPPYILY

Query:  LDHDHADIVLAIRGLNLAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDSENETLKDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVVQNHEKLEN
        LDHDHADIVLA RGLNLA+ESDYAVL+DNKLGK KFDGGYVHNGLLKAA WVLD+E E LKDLV+KYP+YTLTF GHSLGSGVAA+LT+VVVQ+ ++L N
Subjt:  LDHDHADIVLAIRGLNLAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDSENETLKDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVVQNHEKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFCLPCLLCLGCLRDTCISEDKMLKDPRRLYTPGRLYHIVERKPFRCG
        IDRKR+R YAIAPARC+SLNLAVRYADVINSVVLQ        TPLEDIFKSLFCLPCLLC+ C+RDTCI E+KMLKDPRRLY PGRLYHIVERKPFR G
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFCLPCLLCLGCLRDTCISEDKMLKDPRRLYTPGRLYHIVERKPFRCG

Query:  RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMLEDNKVMEIPPQQKMERQNTLAREHTEEYKAALQRAVTLAVPHAYTLSPYGTFSE
        RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIE+EA+ AL+LMLE +++MEIPP+QKMERQ TLA+EHTEEY+AALQRAVTLAVPHAY+ S YGTF  
Subjt:  RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMLEDNKVMEIPPQQKMERQNTLAREHTEEYKAALQRAVTLAVPHAYTLSPYGTFSE

Query:  TVEGEEEKEEESPASSGGSSRKRKETWDELIERLYDKDDSRHDKNESRYTVLKKSFSNSWKYRGDHGFVQQQLPENLYPQLLILHTFSTSPSSSLLFGSS
            E+++EE S  SSG SS                                                                  FS++  S      S
Subjt:  TVEGEEEKEEESPASSGGSSRKRKETWDELIERLYDKDDSRHDKNESRYTVLKKSFSNSWKYRGDHGFVQQQLPENLYPQLLILHTFSTSPSSSLLFGSS

Query:  SHPSMDSRLTFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSG---HLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDK--
                                         RK LR +  N     H  +S HS RIDPSR VQ+SW+PR FLY+GFLSDEECDHL++LA   E+   
Subjt:  SHPSMDSRLTFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSG---HLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDK--

Query:  PSRNNAGSRNTVSTKFLGNSGAILNTTDDIIGRIENRIAVWTFLPKDHSMPFQIMKYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILF
           ++ G+ NT+  +   +    LN  D+I+ RIE RI+ WTFLPK++S   Q+ + G EEA  +  FFGN+S +  SEPL+ATV+LY+S+   GGEILF
Subjt:  PSRNNAGSRNTVSTKFLGNSGAILNTTDDIIGRIENRIAVWTFLPKDHSMPFQIMKYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILF

Query:  PVSKVKRRFWSDRRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECK
        P S+++   WSD  K ++ L+P KGNA+LFF++  NASPDKS  HSR P+L+G++W ATKF Y +  A G E         +C DED++CP WA+IGEC+
Subjt:  PVSKVKRRFWSDRRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECK

Query:  RNAVFMIGSPDYYGTCRKSCNAC
        RN VFM+GSPDYYGTCRKSCN C
Subjt:  RNAVFMIGSPDYYGTCRKSCNAC

A0A6J5WND9 Procollagen-proline 4-dioxygenase2.4e-28159.93Show/hide
Query:  MSIICGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATADEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLILKKTYKDTRGRAPPYILY
        MSI+CG P++ECV CL C RW WKRCLHTAGHDSETWG ATA+EFEP+PR+CRYILAVYEDD+R+PLWEP GGYGI PDWLILKKTY+DT+G+APPYILY
Subjt:  MSIICGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATADEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLILKKTYKDTRGRAPPYILY

Query:  LDHDHADIVLAIRGLNLAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDSENETLKDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVVQNHEKLEN
        LDHDHADIVLA RGLNLA+ESDYAVL+DNKLGK KFDGGYVHNGLLKAA WVLD+E E LKDLV+KYP+YTLTF GHSLGSGVAA+LT+VVVQ+ ++L N
Subjt:  LDHDHADIVLAIRGLNLAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDSENETLKDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVVQNHEKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFCLPCLLCLGCLRDTCISEDKMLKDPRRLYTPGRLYHIVERKPFRCG
        IDRKR+R YAIAPARC+SLNLAVRYADVINSVVLQ        TPLEDIFKSLFCLPCLLC+ C+RDTCI E+KMLKDPRRLY PGRLYHIVERKPFR G
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFCLPCLLCLGCLRDTCISEDKMLKDPRRLYTPGRLYHIVERKPFRCG

Query:  RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMLEDNKVMEIPPQQKMERQNTLAREHTEEYKAALQRAVTLAVPHAYTLSPYGTFSE
        RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIE+EA+ AL+LMLE +++MEIPP+QKMERQ TLA+EHTEEY+AALQRAVTLAVPHAY+ S YGTF  
Subjt:  RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMLEDNKVMEIPPQQKMERQNTLAREHTEEYKAALQRAVTLAVPHAYTLSPYGTFSE

Query:  TVEGEEEKEEESPASSGGSSRKRKETWDELIERLYDKDDSRHDKNESRYTVLKKSFSNSWKYRGDHGFVQQQLPENLYPQLLILHTFSTSPSSSLLFGSS
            E+++EE S  SSG SS                                                                  FS++  S      S
Subjt:  TVEGEEEKEEESPASSGGSSRKRKETWDELIERLYDKDDSRHDKNESRYTVLKKSFSNSWKYRGDHGFVQQQLPENLYPQLLILHTFSTSPSSSLLFGSS

Query:  SHPSMDSRLTFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSG---HLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPS
                                         RK LR +  N     H  +S HS RIDPSR VQ+SW+PR FLY+GFLSDEECDHL++LA   E+   
Subjt:  SHPSMDSRLTFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSG---HLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPS

Query:  RNNAGSRNTVSTKFLGNSGAILNTTDDIIGRIENRIAVWTFLPKDHSMPFQIMKYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPV
               NT + +   +    LN  D+I+ RIE RI+ WTFLPK++S   Q+ + G EEA  +  FFGN+S +  SEPL+ATV+LY+S+   GGEILFP 
Subjt:  RNNAGSRNTVSTKFLGNSGAILNTTDDIIGRIENRIAVWTFLPKDHSMPFQIMKYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPV

Query:  SKVKRRFWSDRRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRN
        S+++   WSD  K ++ L+P KGNA+LFF++  NASPDKS  HSR P+L+G++W ATKF Y +  A G E         +C DED++CP WA+IGEC+RN
Subjt:  SKVKRRFWSDRRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRN

Query:  AVFMIGSPDYYGTCRKSCNAC
         VFM+GSPDYYGTCRKSCN C
Subjt:  AVFMIGSPDYYGTCRKSCNAC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 62.8e-5338.99Show/hide
Query:  SNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPS-RNNAGSRNTVSTKFLGNSGAIL-NTTDDIIGRIENRIAVWTFLPKDHSMPF
        S+ S  +DP+R+ Q+SW PRAFLYKGFLSDEECDHLI LA    +K     +  S  +  ++   +SG  L    DDI+  +E ++A WTFLP+++    
Subjt:  SNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPS-RNNAGSRNTVSTKFLGNSGAIL-NTTDDIIGRIENRIAVWTFLPKDHSMPF

Query:  QIMKY-GGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFP-----VSKVKRRFWSDRRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHS
        QI+ Y  G++   H  +F ++ A+      +ATV++YLS+   GGE +FP       ++K   WS   K+   ++P KG+A+LFF++HLN + D +  H 
Subjt:  QIMKY-GGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFP-----VSKVKRRFWSDRRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHS

Query:  RTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGSPDYYGTCRKSCNAC
          P+++G+ W AT++ ++R  + G +  V       C+D+ ESC +WA  GEC++N ++M+GS    G CRKSC AC
Subjt:  RTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGSPDYYGTCRKSCNAC

F4JAU3 Prolyl 4-hydroxylase 26.1e-4836.11Show/hide
Query:  MVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALA-SNSEDKPSRNNAGSRNTVSTKFLGNSGAILNTTDDIIGRIENRIAVWTFL
        ++ S     S+ S  I+PS+V Q+S +PRAF+Y+GFL+D ECDHLI+LA  N +     +N    + VS     +   I    D I+  IE++++ WTFL
Subjt:  MVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALA-SNSEDKPSRNNAGSRNTVSTKFLGNSGAILNTTDDIIGRIENRIAVWTFL

Query:  PKDHSMPFQIMKY-GGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFP-VSKVKRRFWSDRR-------KKNNFLRPVKGNAVLFFSVHL
        PK++    Q+++Y  G++   H  +F ++  +      +ATV+LYLS+   GGE +FP   +  RR  S+ +       KK   ++P KGNA+LFF++  
Subjt:  PKDHSMPFQIMKY-GGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFP-VSKVKRRFWSDRR-------KKNNFLRPVKGNAVLFFSVHL

Query:  NASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGSPDYYGTCRKSCNAC
        +A PD    H   P+++G+ W ATK+ ++        H      D +C D +ESC +WA +GEC +N  +M+G+P+  G CR+SC AC
Subjt:  NASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGSPDYYGTCRKSCNAC

Q8GXT7 Probable prolyl 4-hydroxylase 127.4e-6242.95Show/hide
Query:  FLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNS----GHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSRNT
        FL+L+    S S       S   RK LRD+ + S       SY   S+ +DP+RV+Q+SW PR FLY+GFLS+EECDHLI+L   + +  S +  G    
Subjt:  FLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNS----GHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSRNT

Query:  VSTKFLGNSGAILNTTDDIIGRIENRIAVWTFLPKDHSMPFQIMKYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKVKRRFWS
                        D ++  IE +++ WTFLP ++    ++  Y  E++     +FG   +    E L+ATVVLYLS++  GGE+LFP S++K +  +
Subjt:  VSTKFLGNSGAILNTTDDIIGRIENRIAVWTFLPKDHSMPFQIMKYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKVKRRFWS

Query:  DRRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGSPD
           +  N LRPVKGNA+LFF+  LNAS D    H R P++ G+L VATK  Y +  A       ESG   +C DEDE+C +WA +GECK+N V+MIGSPD
Subjt:  DRRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGSPD

Query:  YYGTCRKSCNAC
        YYGTCRKSCNAC
Subjt:  YYGTCRKSCNAC

Q8L970 Probable prolyl 4-hydroxylase 74.1e-5236.02Show/hide
Query:  MDSRLTFLLLLAAAFSFSSCLAQSNSISGR-KGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGS
        MDSR+     L   F+     +  N    R    RD  V     S S+     DP+RV Q+SW PR FLY+GFLSDEECDH I LA    +K    +  S
Subjt:  MDSRLTFLLLLAAAFSFSSCLAQSNSISGR-KGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGS

Query:  RNTVSTKFLGNSGAILN-TTDDIIGRIENRIAVWTFLPKDHSMPFQIMKY-GGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPV----
          +V ++   +SG  L+   DDI+  +E ++A WTFLP+++    QI+ Y  G++   H  +F +++ +      +ATV++YLS+   GGE +FP+    
Subjt:  RNTVSTKFLGNSGAILN-TTDDIIGRIENRIAVWTFLPKDHSMPFQIMKY-GGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPV----

Query:  -SKVKRRFWSDRRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKR
         +++K   W++  K+   ++P KG+A+LFF++H NA+ D +  H   P+++G+ W AT++ +++      E A        C+DE+ SC KWA  GEC++
Subjt:  -SKVKRRFWSDRRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKR

Query:  NAVFMIGSPDYYGTCRKSCNAC
        N  +M+GS   +G CRKSC AC
Subjt:  NAVFMIGSPDYYGTCRKSCNAC

Q8LAN3 Probable prolyl 4-hydroxylase 42.2e-4534.41Show/hide
Query:  SNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSRNTVSTKFLGNSGAILNT-TDDIIGRIENRIAVWTFLPKDHSMPFQ
        S+ S  ++PS+V Q+S +PRAF+Y+GFL++ ECDH+++LA  S  + +  +  S  +  ++   +SG  ++   D I+  IE++I+ WTFLPK++    Q
Subjt:  SNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSRNTVSTKFLGNSGAILNT-TDDIIGRIENRIAVWTFLPKDHSMPFQ

Query:  IMKY-GGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKVKRR--------FWSDRRKKNNFLRPVKGNAVLFFSVHLNASPDKSCY
        +++Y  G++   H  +F ++  +      MAT+++YLS+   GGE +FP +++  R          SD  K+   ++P KG+A+LFF++H +A PD    
Subjt:  IMKY-GGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKVKRR--------FWSDRRKKNNFLRPVKGNAVLFFSVHLNASPDKSCY

Query:  HSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGSPDYYGTCRKSCNAC
        H   P+++G+ W ATK+ ++    + +     SG   +C D +ESC +WA +GEC +N  +M+G+ +  G CR+SC AC
Subjt:  HSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGSPDYYGTCRKSCNAC

Arabidopsis top hitse value%identityAlignment
AT3G49050.1 alpha/beta-Hydrolases superfamily protein3.3e-19872.9Show/hide
Query:  MSIICG-VPILECVCCLGCARWVWKRCLHTAGHDSETWGFATADEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLILKKTYKDTRGRAPPYIL
        MSI+CG  P+LECV CLGCARW +KRCL+TAGHDSE WG AT DEFEP+PR CRYILAVYEDDIR PLWEP  GYGINPDWL+LKKTY+DT+GRAP YIL
Subjt:  MSIICG-VPILECVCCLGCARWVWKRCLHTAGHDSETWGFATADEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLILKKTYKDTRGRAPPYIL

Query:  YLDHDHADIVLAIRGLNLAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDSENETLKDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVVQNHEKLE
        YLDH H DIV+AIRGLNLAKESDYA+LLDNKLG+ KFDGGYVHNGL+K+AG+VLD E + LK+LVKKYP YTLTFAGHSLGSGVA ML L+VV++ E+L 
Subjt:  YLDHDHADIVLAIRGLNLAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDSENETLKDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVVQNHEKLE

Query:  NIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFCLPCLLCLGCLRDTCISEDKMLKDPRRLYTPGRLYHIVERKPFRC
        NIDRKR+RC+AIAPARCMSLNLAVRYADVINSV+LQDDFLPRTATPLEDIFKS+FCLPCLLC+ C++DTC+ E KMLKDPRRLY PGR+YHIVERKP R 
Subjt:  NIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFCLPCLLCLGCLRDTCISEDKMLKDPRRLYTPGRLYHIVERKPFRC

Query:  GRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMLEDNKVMEIPPQQKMERQNTLAREHTEEYKAALQRAVTLAVPHAYTLS-PYGTF
        GR+PPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIE+EA+ AL LM+E+ K MEIP +Q+MERQ +LAREH  EY+AAL+RAVTL VPHA +++  YGTF
Subjt:  GRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMLEDNKVMEIPPQQKMERQNTLAREHTEEYKAALQRAVTLAVPHAYTLS-PYGTF

Query:  SETVEGE--------EEKEEESPA-------SSGGSS--------RKRKETWDELIERLYDKDDS
         +T E E        EE+EE++ +       SS  SS        R R+ +WDELIE L+++D+S
Subjt:  SETVEGE--------EEKEEESPA-------SSGGSS--------RKRKETWDELIERLYDKDDS

AT4G00500.1 alpha/beta-Hydrolases superfamily protein5.1e-16762.84Show/hide
Query:  MSIICGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATADEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLILKKTYKDTRGRAPPYILY
        MSI+C VP+LECV CLGC  W+WK+CL++AGH+SE WG AT+DEFEPIPRICR ILAVYE+++  P+W P  GYGI+P+ +ILKK Y  T GR  PY++Y
Subjt:  MSIICGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATADEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLILKKTYKDTRGRAPPYILY

Query:  LDHDHADIVLAIRGLNLAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDSENETLKDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVVQNHEKLEN
        LDH++ D+VLAIRGLNLAKE DYAVLLDNKLG+ KFDGGYVHNGLLKAA WV + E+  L++L++  P Y+LTF GHSLG+GV ++L L V+QN  +L N
Subjt:  LDHDHADIVLAIRGLNLAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDSENETLKDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVVQNHEKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFCLPCLLCLGCLRDTCISEDKMLKDPRRLYTPGRLYHIVERKPFRCG
        I+RKRIRC+AIAP RCMSL+LAV YADVINSVVLQDDFLPRT T LE++FKS+ CLPCLLCL CL+DT   E++ LKD RRLY PGRLYHIV RKP R G
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFCLPCLLCLGCLRDTCISEDKMLKDPRRLYTPGRLYHIVERKPFRCG

Query:  RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMLEDNKVMEIPPQQKMERQNTLAREHTEEYKAALQRAVTLAVPHAYTLSPYGTFSE
        R+PPVV+TAVPVDGRFE IVLSCNAT+DHAIIWIE+E++ AL+LM+E+++VM+IP +QK+ RQ ++  +H EEY+AA+ +A +L +P + + S YGTF +
Subjt:  RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMLEDNKVMEIPPQQKMERQNTLAREHTEEYKAALQRAVTLAVPHAYTLSPYGTFSE

Query:  TVEGEEEKEEESPAS-SGGSSRKRKETWDELIERLYD-KDDSRH
        T EGE         S SG S +  +  WD+ I+  +   D+S H
Subjt:  TVEGEEEKEEESPAS-SGGSSRKRKETWDELIERLYD-KDDSRH

AT4G00500.2 alpha/beta-Hydrolases superfamily protein5.1e-16762.84Show/hide
Query:  MSIICGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATADEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLILKKTYKDTRGRAPPYILY
        MSI+C VP+LECV CLGC  W+WK+CL++AGH+SE WG AT+DEFEPIPRICR ILAVYE+++  P+W P  GYGI+P+ +ILKK Y  T GR  PY++Y
Subjt:  MSIICGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATADEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLILKKTYKDTRGRAPPYILY

Query:  LDHDHADIVLAIRGLNLAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDSENETLKDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVVQNHEKLEN
        LDH++ D+VLAIRGLNLAKE DYAVLLDNKLG+ KFDGGYVHNGLLKAA WV + E+  L++L++  P Y+LTF GHSLG+GV ++L L V+QN  +L N
Subjt:  LDHDHADIVLAIRGLNLAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDSENETLKDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVVQNHEKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFCLPCLLCLGCLRDTCISEDKMLKDPRRLYTPGRLYHIVERKPFRCG
        I+RKRIRC+AIAP RCMSL+LAV YADVINSVVLQDDFLPRT T LE++FKS+ CLPCLLCL CL+DT   E++ LKD RRLY PGRLYHIV RKP R G
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFCLPCLLCLGCLRDTCISEDKMLKDPRRLYTPGRLYHIVERKPFRCG

Query:  RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMLEDNKVMEIPPQQKMERQNTLAREHTEEYKAALQRAVTLAVPHAYTLSPYGTFSE
        R+PPVV+TAVPVDGRFE IVLSCNAT+DHAIIWIE+E++ AL+LM+E+++VM+IP +QK+ RQ ++  +H EEY+AA+ +A +L +P + + S YGTF +
Subjt:  RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMLEDNKVMEIPPQQKMERQNTLAREHTEEYKAALQRAVTLAVPHAYTLSPYGTFSE

Query:  TVEGEEEKEEESPAS-SGGSSRKRKETWDELIERLYD-KDDSRH
        T EGE         S SG S +  +  WD+ I+  +   D+S H
Subjt:  TVEGEEEKEEESPAS-SGGSSRKRKETWDELIERLYD-KDDSRH

AT4G25600.1 Oxoglutarate/iron-dependent oxygenase5.3e-6342.95Show/hide
Query:  FLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNS----GHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSRNT
        FL+L+    S S       S   RK LRD+ + S       SY   S+ +DP+RV+Q+SW PR FLY+GFLS+EECDHLI+L   + +  S +  G    
Subjt:  FLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNS----GHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSRNT

Query:  VSTKFLGNSGAILNTTDDIIGRIENRIAVWTFLPKDHSMPFQIMKYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKVKRRFWS
                        D ++  IE +++ WTFLP ++    ++  Y  E++     +FG   +    E L+ATVVLYLS++  GGE+LFP S++K +  +
Subjt:  VSTKFLGNSGAILNTTDDIIGRIENRIAVWTFLPKDHSMPFQIMKYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKVKRRFWS

Query:  DRRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGSPD
           +  N LRPVKGNA+LFF+  LNAS D    H R P++ G+L VATK  Y +  A       ESG   +C DEDE+C +WA +GECK+N V+MIGSPD
Subjt:  DRRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGSPD

Query:  YYGTCRKSCNAC
        YYGTCRKSCNAC
Subjt:  YYGTCRKSCNAC

AT5G37710.1 alpha/beta-Hydrolases superfamily protein5.1e-14357.53Show/hide
Query:  MSIICGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATADEFEPIPRICRYILAVYEDDIRKPLWEP-VGGYGINPDWLILKKTYKDTRGRAPPYIL
        MS+ CG   LECV C+G +RW WKRC H    DS TW  AT +EFEPIPRI R ILAVYE D+R P   P +G + +NP+W+I + T++ T+GR+PPYI+
Subjt:  MSIICGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATADEFEPIPRICRYILAVYEDDIRKPLWEP-VGGYGINPDWLILKKTYKDTRGRAPPYIL

Query:  YLDHDHADIVLAIRGLNLAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDSENETL-KDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVVQNHEKL
        Y+DHDH +IVLAIRGLNLAKESDY +LLDNKLG+    GGYVH GLLK+A WVL+ E+ETL +   +   +Y L FAGHSLGSGVAA++ ++VV     +
Subjt:  YLDHDHADIVLAIRGLNLAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDSENETL-KDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVVQNHEKL

Query:  ENIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFCLPCLLCLGCLRDTCISEDKMLKDPRRLYTPGRLYHIVERKPFR
         +I R ++RC+A+APARCMSLNLAV+YADVI+SV+LQDDFLPRTATPLEDIFKS+FCLPCLL L CLRDT I E + L+DPRRLY PGR+YHIVERK   
Subjt:  ENIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFCLPCLLCLGCLRDTCISEDKMLKDPRRLYTPGRLYHIVERKPFR

Query:  CGRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMLE---DNKVMEIPPQQKMERQNTLAREHTEEYKAALQRAVTLAVPHAYTLSPY
          RFPP V+TA+PVDGRFEHIVLS NATSDHAI+WIE+EA+ AL+++ E   +  V   P +++MER +TL +EH    K AL+RAV+L +PHA      
Subjt:  CGRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMLE---DNKVMEIPPQQKMERQNTLAREHTEEYKAALQRAVTLAVPHAYTLSPY

Query:  GTFSETVEGEEEKEEESPASSGGSSRKRKETWDELIERLYDKDDS
              V   EE+EE +   +    + +K+ WDE++++L+ + +S
Subjt:  GTFSETVEGEEEKEEESPASSGGSSRKRKETWDELIERLYDKDDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTATCATATGTGGTGTACCTATCCTTGAGTGCGTATGCTGTCTGGGATGTGCACGTTGGGTGTGGAAACGCTGTCTCCACACAGCTGGTCATGACAGTGAA
ACTTGGGGCTTTGCCACGGCCGATGAGTTTGAACCTATTCCCCGAATTTGTCGATATATCCTAGCTGTGTATGAAGATGATATTCGAAAACCTCTTTGGGAACCA
GTTGGTGGGTATGGAATCAATCCAGATTGGTTAATCTTGAAGAAGACATACAAAGATACACGAGGGCGGGCTCCTCCATATATTTTATATCTCGATCATGATCAT
GCCGATATTGTTCTTGCCATCAGGGGATTGAATTTGGCAAAGGAGAGTGATTATGCAGTTTTATTGGACAACAAGCTGGGGAAGATGAAATTTGATGGTGGATAT
GTTCACAATGGGCTTCTGAAGGCAGCTGGATGGGTTTTGGACAGTGAGAATGAAACTTTGAAGGATTTGGTGAAGAAATATCCGGATTATACCTTGACGTTTGCT
GGGCATTCTCTTGGCTCTGGAGTAGCAGCCATGCTAACTTTGGTAGTAGTACAGAATCACGAAAAATTGGAAAATATTGATCGGAAGCGGATAAGGTGCTATGCT
ATCGCTCCTGCTAGGTGTATGTCCCTGAATTTGGCTGTTAGATATGCAGATGTCATCAACTCTGTTGTTCTTCAGGATGACTTTTTACCCAGGACAGCCACACCC
TTGGAAGACATTTTTAAGTCACTCTTCTGTTTGCCATGCCTTCTATGCCTGGGGTGCCTGCGGGATACATGCATATCGGAGGACAAGATGCTTAAAGATCCGAGA
AGACTTTACACACCCGGTCGACTCTATCACATTGTTGAGCGAAAGCCCTTCAGGTGTGGAAGATTTCCGCCAGTTGTGAAGACGGCCGTCCCGGTGGATGGGAGG
TTTGAACACATAGTTCTTTCTTGTAATGCAACTTCTGACCATGCTATCATTTGGATTGAGAAAGAAGCCAAATGGGCCCTCGAATTAATGCTGGAGGATAATAAG
GTCATGGAGATACCACCCCAACAAAAGATGGAGAGGCAAAATACTTTAGCCAGAGAGCACACTGAAGAGTACAAGGCTGCATTGCAGCGGGCCGTGACGTTAGCC
GTGCCACATGCATACACACTTTCCCCGTATGGAACCTTCAGCGAAACAGTTGAAGGTGAAGAAGAAAAAGAAGAAGAATCACCAGCATCGAGTGGAGGGTCATCG
AGGAAGAGGAAAGAAACTTGGGATGAACTGATCGAGCGTCTTTACGACAAGGATGACTCAAGACACGACAAGAATGAGTCGAGATACACCGTGCTGAAGAAATCA
TTCAGTAATTCATGGAAGTACAGAGGAGACCATGGATTTGTTCAGCAGCAGCTTCCTGAAAATCTGTATCCACAACTTCTAATTCTTCATACCTTTTCAACTTCC
CCTTCCTCGTCTCTTCTTTTTGGATCTTCGTCTCACCCATCCATGGATTCTCGTCTCACCTTTTTGCTTCTTTTAGCGGCCGCATTTTCATTCTCGAGCTGCCTT
GCACAAAGCAATTCGATTAGTGGCCGTAAGGGTTTAAGGGACCAAATGGTTAACAGTGGACATTTGAGCTACTCAAATCATTCTGAAAGAATCGACCCATCACGA
GTTGTCCAAATCTCTTGGCAACCAAGGGCCTTCTTGTATAAAGGCTTTCTCTCAGATGAGGAGTGTGATCACCTTATTGCTTTGGCTTCAAATTCTGAAGATAAA
CCTTCTAGGAACAATGCTGGTTCCAGGAACACTGTCTCAACCAAATTTCTAGGCAATTCAGGAGCTATTTTAAACACAACAGATGATATCATTGGCAGGATTGAA
AATAGAATTGCGGTGTGGACTTTTCTCCCAAAAGATCATAGCATGCCTTTCCAAATTATGAAATACGGGGGTGAAGAAGCAGCAGGGCATAAGTACTTTTTTGGC
AACAGATCTGCAATGCCATCCAGTGAACCGCTGATGGCCACGGTAGTTTTGTATCTATCAGATTCGGCTAGCGGTGGCGAGATTCTGTTCCCAGTATCAAAGGTA
AAGAGAAGATTTTGGTCAGACCGGAGAAAGAAAAACAACTTTCTGAGACCAGTGAAAGGCAATGCAGTTCTTTTTTTCTCTGTTCATCTTAATGCTTCTCCAGAC
AAGAGTTGCTACCATTCCCGAACTCCAATACTCGATGGGAAATTGTGGGTTGCTACAAAATTCTTCTACATAAGGCCAGCAGCCACTGGGAATGAACACGCAGTT
GAATCCGGTGTAGACGACGACTGCATTGATGAAGATGAAAGCTGCCCCAAATGGGCTGCCATCGGCGAATGCAAACGAAACGCAGTGTTCATGATCGGTTCTCCA
GATTACTATGGCACATGTAGAAAAAGCTGCAACGCATGTGGATGA
mRNA sequenceShow/hide mRNA sequence
ATTCACAATTTTCCCCCAACAAATTCTCTCCCTCTCAAGAATCCAATCCACCATGTTCTTCTCCAGATCCAATTTTCCCCCAAAATGCACACAATCTCCCAAATT
CCTATACCCAAAATCAGACGTTGACTTCCAGGTACAATCTGTTCTTGTCTAACTTTAGATTTATATGCATTTCGGATTTGAGTTTCTACGAAGGGTTCTCGAGGG
AGTTTATAGCTAAGTCCACTGCATTCACTCGGTTTCCTCTGTCTCTTTTGTTAATTTGAAGAAATCCAGCGTTTAATTCCACGTTTTTCCATATTTGAAACTCTG
ATTCTTAGTTAAGAATGTTCTTGTATGATTCTGATTCTCGAATTCTTGACCGGGAAGTGTGAATTTGTCCCTATCTTCTTCGTTTTCACTTCCGTTGCTTCTGTT
TTTTGCTCTTGATGATCGAAATTTGAAGCCTAGATCTTCCGTTGTTTTGGTGTTCATCCCTAAATTAGTTTAGCTCCATGGGATTCAGTGATTGGAAGCTACTGG
AAGGTTCATATGGTATTGGTTGATGAATGAATTCCCGCATTCAAGTATTTGTGTTCTTCTCTTGTTATTTGATTCTTTGGAGAGAAGTTTAAGATTTATTTTCTT
GTGAAGTATGCTTTCTAGTTCGCCCAAAATGATCGATCGAAATATTTTGAAGTTCTTCGGTGCCCCATAATCATCTACCACCTGTTATCAACTCCAAGACTTGTA
TCAATAGAGCTTGTAGTTTCCTTGCATCAAATTTCTTTATGTTTCTTCATAACTCGGGCATTTACTAGAAAAATCTGTATGATCTATCATTCCTTTGAGCTTAGA
TCATAGGCAACAATGTCTATCATATGTGGTGTACCTATCCTTGAGTGCGTATGCTGTCTGGGATGTGCACGTTGGGTGTGGAAACGCTGTCTCCACACAGCTGGT
CATGACAGTGAAACTTGGGGCTTTGCCACGGCCGATGAGTTTGAACCTATTCCCCGAATTTGTCGATATATCCTAGCTGTGTATGAAGATGATATTCGAAAACCT
CTTTGGGAACCAGTTGGTGGGTATGGAATCAATCCAGATTGGTTAATCTTGAAGAAGACATACAAAGATACACGAGGGCGGGCTCCTCCATATATTTTATATCTC
GATCATGATCATGCCGATATTGTTCTTGCCATCAGGGGATTGAATTTGGCAAAGGAGAGTGATTATGCAGTTTTATTGGACAACAAGCTGGGGAAGATGAAATTT
GATGGTGGATATGTTCACAATGGGCTTCTGAAGGCAGCTGGATGGGTTTTGGACAGTGAGAATGAAACTTTGAAGGATTTGGTGAAGAAATATCCGGATTATACC
TTGACGTTTGCTGGGCATTCTCTTGGCTCTGGAGTAGCAGCCATGCTAACTTTGGTAGTAGTACAGAATCACGAAAAATTGGAAAATATTGATCGGAAGCGGATA
AGGTGCTATGCTATCGCTCCTGCTAGGTGTATGTCCCTGAATTTGGCTGTTAGATATGCAGATGTCATCAACTCTGTTGTTCTTCAGGATGACTTTTTACCCAGG
ACAGCCACACCCTTGGAAGACATTTTTAAGTCACTCTTCTGTTTGCCATGCCTTCTATGCCTGGGGTGCCTGCGGGATACATGCATATCGGAGGACAAGATGCTT
AAAGATCCGAGAAGACTTTACACACCCGGTCGACTCTATCACATTGTTGAGCGAAAGCCCTTCAGGTGTGGAAGATTTCCGCCAGTTGTGAAGACGGCCGTCCCG
GTGGATGGGAGGTTTGAACACATAGTTCTTTCTTGTAATGCAACTTCTGACCATGCTATCATTTGGATTGAGAAAGAAGCCAAATGGGCCCTCGAATTAATGCTG
GAGGATAATAAGGTCATGGAGATACCACCCCAACAAAAGATGGAGAGGCAAAATACTTTAGCCAGAGAGCACACTGAAGAGTACAAGGCTGCATTGCAGCGGGCC
GTGACGTTAGCCGTGCCACATGCATACACACTTTCCCCGTATGGAACCTTCAGCGAAACAGTTGAAGGTGAAGAAGAAAAAGAAGAAGAATCACCAGCATCGAGT
GGAGGGTCATCGAGGAAGAGGAAAGAAACTTGGGATGAACTGATCGAGCGTCTTTACGACAAGGATGACTCAAGACACGACAAGAATGAGTCGAGATACACCGTG
CTGAAGAAATCATTCAGTAATTCATGGAAGTACAGAGGAGACCATGGATTTGTTCAGCAGCAGCTTCCTGAAAATCTGTATCCACAACTTCTAATTCTTCATACC
TTTTCAACTTCCCCTTCCTCGTCTCTTCTTTTTGGATCTTCGTCTCACCCATCCATGGATTCTCGTCTCACCTTTTTGCTTCTTTTAGCGGCCGCATTTTCATTC
TCGAGCTGCCTTGCACAAAGCAATTCGATTAGTGGCCGTAAGGGTTTAAGGGACCAAATGGTTAACAGTGGACATTTGAGCTACTCAAATCATTCTGAAAGAATC
GACCCATCACGAGTTGTCCAAATCTCTTGGCAACCAAGGGCCTTCTTGTATAAAGGCTTTCTCTCAGATGAGGAGTGTGATCACCTTATTGCTTTGGCTTCAAAT
TCTGAAGATAAACCTTCTAGGAACAATGCTGGTTCCAGGAACACTGTCTCAACCAAATTTCTAGGCAATTCAGGAGCTATTTTAAACACAACAGATGATATCATT
GGCAGGATTGAAAATAGAATTGCGGTGTGGACTTTTCTCCCAAAAGATCATAGCATGCCTTTCCAAATTATGAAATACGGGGGTGAAGAAGCAGCAGGGCATAAG
TACTTTTTTGGCAACAGATCTGCAATGCCATCCAGTGAACCGCTGATGGCCACGGTAGTTTTGTATCTATCAGATTCGGCTAGCGGTGGCGAGATTCTGTTCCCA
GTATCAAAGGTAAAGAGAAGATTTTGGTCAGACCGGAGAAAGAAAAACAACTTTCTGAGACCAGTGAAAGGCAATGCAGTTCTTTTTTTCTCTGTTCATCTTAAT
GCTTCTCCAGACAAGAGTTGCTACCATTCCCGAACTCCAATACTCGATGGGAAATTGTGGGTTGCTACAAAATTCTTCTACATAAGGCCAGCAGCCACTGGGAAT
GAACACGCAGTTGAATCCGGTGTAGACGACGACTGCATTGATGAAGATGAAAGCTGCCCCAAATGGGCTGCCATCGGCGAATGCAAACGAAACGCAGTGTTCATG
ATCGGTTCTCCAGATTACTATGGCACATGTAGAAAAAGCTGCAACGCATGTGGATGAATAACCAAGTAATTCTTTACCCTTACTGATTTGAGCAACTATTTGTTT
ATTTTTTCCTATTTTCTTTTCTCCTTTTGTTATTTATTCATTTCATCACATCTGTAACGGAAAGATGGAACCTGGAATCTTTTAAAGGAATTAGAAAGTGCATTA
GCTATTGAACCTGGAA
Protein sequenceShow/hide protein sequence
MSIICGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATADEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLILKKTYKDTRGRAPPYILYLDHDH
ADIVLAIRGLNLAKESDYAVLLDNKLGKMKFDGGYVHNGLLKAAGWVLDSENETLKDLVKKYPDYTLTFAGHSLGSGVAAMLTLVVVQNHEKLENIDRKRIRCYA
IAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFCLPCLLCLGCLRDTCISEDKMLKDPRRLYTPGRLYHIVERKPFRCGRFPPVVKTAVPVDGR
FEHIVLSCNATSDHAIIWIEKEAKWALELMLEDNKVMEIPPQQKMERQNTLAREHTEEYKAALQRAVTLAVPHAYTLSPYGTFSETVEGEEEKEEESPASSGGSS
RKRKETWDELIERLYDKDDSRHDKNESRYTVLKKSFSNSWKYRGDHGFVQQQLPENLYPQLLILHTFSTSPSSSLLFGSSSHPSMDSRLTFLLLLAAAFSFSSCL
AQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSRNTVSTKFLGNSGAILNTTDDIIGRIE
NRIAVWTFLPKDHSMPFQIMKYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKVKRRFWSDRRKKNNFLRPVKGNAVLFFSVHLNASPD
KSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGSPDYYGTCRKSCNACG