; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc05G07140 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc05G07140
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationClcChr05:5252401..5269663
RNA-Seq ExpressionClc05G07140
SyntenyClc05G07140
Gene Ontology termsGO:0000160 - phosphorelay signal transduction system (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0031418 - L-ascorbic acid binding (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0004672 - protein kinase activity (molecular function)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0000166 - nucleotide binding (molecular function)
InterPro domainsIPR045054 - Prolyl 4-hydroxylase
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR036641 - HPT domain superfamily
IPR008207 - Signal transduction histidine kinase, phosphotransfer (Hpt) domain
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR003582 - ShKT domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAV63367.1 Hpt domain-containing protein/2OG-FeII_Oxy_3 domain-containing protein [Cephalotus follicularis]1.1e-16065.77Show/hide
Query:  AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADK
        A+VYEGFLT LECDHLISLAK+ELKRSAVADNLSG+SK+SEVRTSSG FI K KDPIV GIEDKI+ WTFLPKENGEDIQVLRYE GQKY+ HFDYF DK
Subjt:  AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADK

Query:  VNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHV
        VNIARGGHR+ATVL+YL++V KGGETVFPSAE S RR+ S T+ DLS+C +KG+AVKPR+GDALLFFSLHPNA+PD SSLH GCPVIEGEKWSATKWIHV
Subjt:  VNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHV

Query:  NSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKALGHHRRNQTKPTSNFFILHTNPFKTLFLNLIPNISTLLPLARFIK---
        +SFD  +    NC D N SCE+WA LGECT N EYMVGSPELPGYCR+SCK        + KP  +          TL +++   I       R  +   
Subjt:  NSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKALGHHRRNQTKPTSNFFILHTNPFKTLFLNLIPNISTLLPLARFIK---

Query:  -MEVGLMQRQWVQYIKGLLGEGTLDSQYLQLLQLQDESNPTFVSEVVTLFFEDTEELLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKNACID
         MEVG MQR+ + Y K L  EG LD Q+LQL QLQDESNP FV EVV+LFF+D+E LLN L  A+ QPSVDF ++D HVHQLKGSSSSI A R+KNA + 
Subjt:  -MEVGLMQRQWVQYIKGLLGEGTLDSQYLQLLQLQDESNPTFVSEVVTLFFEDTEELLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKNACID

Query:  FRSACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGS
        FR+ CE+Q+ + C RCLQQ++QE+Y  +  L  L+ LE++I+ AGGS
Subjt:  FRSACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGS

KAF4393790.1 hypothetical protein G4B88_007776 [Cannabis sativa]9.3e-16567.92Show/hide
Query:  AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADK
        AF+YEGFLTDLECDHLISLAK+ELKRSAVAD+ SGES++SEVRTSSG FI KAKDPIV+GIEDKI+ WTFLPKENGEDIQVLRYE GQKY+ H+DYFADK
Subjt:  AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADK

Query:  VNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHV
        VNI RGGHR+ATVLMYL++V KGGETVFP A E+ R + S T ED S+CAKKG+AVK R+GDALLFFSL P AIPDT SLH GCPVIEGEKWSATKWIHV
Subjt:  VNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHV

Query:  NSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKALGHHRRNQTKPTSNFFILHTNPFKTLFLNLIPNISTLLPLARFIKMEV
        +SFD  V     C D N SCERWA LGECT N EYMVGSPELPGYCR+SCK    H             +H   FK                     MEV
Subjt:  NSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKALGHHRRNQTKPTSNFFILHTNPFKTLFLNLIPNISTLLPLARFIKMEV

Query:  GLMQRQWVQYIKGLLGEGT-LDSQYLQLLQLQDESNPTFVSEVVTLFFEDTEELLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKNACIDFRS
        G MQRQWV Y K L  E   LDSQ+LQLLQLQDESNP FV EVV+LFF+DTE+LLN L  A+ Q  VDFK++D HVHQLKGSSSSIGA RVKN C+ FR+
Subjt:  GLMQRQWVQYIKGLLGEGT-LDSQYLQLLQLQDESNPTFVSEVVTLFFEDTEELLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKNACIDFRS

Query:  ACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGSIP-MDLGF
         CE+Q+ D C RCLQQV+QE+Y VK KL  L+ LE++I+ AGGSIP M+LGF
Subjt:  ACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGSIP-MDLGF

RXH72084.1 hypothetical protein DVH24_025585 [Malus domestica]5.1e-16366.44Show/hide
Query:  AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADK
        AFVYEG LTD E DHLIS+AK+ELKRSAVADNLSG+SK+SEVRTSSG FI KAKDPIV+GIEDK+A WTFLPKENGEDIQVLRYE GQKY  H+DYF DK
Subjt:  AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADK

Query:  VNIARGGHRMATVLMYLSNVEKGGETVFPSAEE-SQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIH
        VNIARGGHR+ATVLMYL++V KGGETVFP AE+   RR+A+E    LS+CAKKGIAVKPR+GDALLFFSL P+A+PD +SLH GCPVIEGEKWSATKWIH
Subjt:  VNIARGGHRMATVLMYLSNVEKGGETVFPSAEE-SQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIH

Query:  VNSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKALGHHRRNQTKPTSNFFILHTNPFKTLFLNLIPNISTLLPLARFIKME
        V+SFD  +    NC D N SCERWA LGECT N EYMVG+P+LPGYCR+SCK      +  +    +F                   S  +   +   ME
Subjt:  VNSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKALGHHRRNQTKPTSNFFILHTNPFKTLFLNLIPNISTLLPLARFIKME

Query:  VGLMQRQWVQYIKGLLGEGTLDSQYLQLLQLQDESNPTFVSEVVTLFFEDTEELLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKNACIDFRS
        VG MQRQWV Y K L  EG LD Q+LQL QLQDESNP FV EVV+LFFED+E+LLN L  A+ QP+VDFK++D HVHQ KGSSSSIGA RVKNACI FR+
Subjt:  VGLMQRQWVQYIKGLLGEGTLDSQYLQLLQLQDESNPTFVSEVVTLFFEDTEELLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKNACIDFRS

Query:  ACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGSIPM
         CE+Q+ + C RC+QQV+ E+Y VK KL  L+A+E++I+ AGGSIPM
Subjt:  ACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGSIPM

RXH72084.1 hypothetical protein DVH24_025585 [Malus domestica]6.5e-1740.76Show/hide
Query:  SSLPPSRYSSIQSQVSDVGSSLAEDHDSDRDSI----------------------------------------RPRIPPGLLLASQTSTPRLTPHGSPPS
        S+LPPSR SS+QS   + GS   ED D++   +                                         P I   +LLASQTSTPRLTP GSPP 
Subjt:  SSLPPSRYSSIQSQVSDVGSSLAEDHDSDRDSI----------------------------------------RPRIPPGLLLASQTSTPRLTPHGSPPS

Query:  LSASGSPMRTSMSFSTSRNIFEDRSSEYSSAPSSHYGSISSNKGRTRVDGKEFFRQVRSRLSYARFAAFLANVKDLNSHKQTKE
         SAS SP R+S   S  R+                          TRVDGKEFFRQVRSRLSY +F+AFL NVK+LN +KQTKE
Subjt:  LSASGSPMRTSMSFSTSRNIFEDRSSEYSSAPSSHYGSISSNKGRTRVDGKEFFRQVRSRLSYARFAAFLANVKDLNSHKQTKE

RXH72084.1 hypothetical protein DVH24_025585 [Malus domestica]2.5e-16266.74Show/hide
Query:  AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADK
        AFVYEGFLTD EC+HLIS+AK ELKRS+VADN+SG+SK+S+VRTSSG FI KAKDPIVSGIE+KIA WTFLPKENGE IQVLRYE+GQKYD H+DYF DK
Subjt:  AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADK

Query:  VNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHV
        VN+ARGGHR+ATVLMYLS+V KGGETVFPSAEE+    +S + +DLS+CAKKGIAVKPRKGDALLFFSLHP AIPD  SLHGGCPVIEGEKWSATKWIHV
Subjt:  VNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHV

Query:  NSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKALGHHRRNQTKPTSNFFILHTNPFKTLFLNLIPNISTLLPLARFIK--M
        +SFD +VR   NC D N +CERWA LGECT NPEYMVG+PELPGYCR+SC+       + T P        ++P +    + +          +  K  M
Subjt:  NSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKALGHHRRNQTKPTSNFFILHTNPFKTLFLNLIPNISTLLPLARFIK--M

Query:  EVGLMQRQWVQYIKGLLGEGTLDSQYLQLLQLQDESNPTFVSEVVTLFFEDTEELLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKNACIDFR
        EVG +QRQ+V+Y   L  EG LDSQ+ QL QLQDESNP FV EVV+LFFED+E LLN L  A+ Q  VDFKK+D +VHQLKGSSSSIGA RVKNAC+ FR
Subjt:  EVGLMQRQWVQYIKGLLGEGTLDSQYLQLLQLQDESNPTFVSEVVTLFFEDTEELLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKNACIDFR

Query:  SACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGSIPM
        + CE+ + + C  CLQQV+QE+  VK KL  L+ LE++IL AGGS+PM
Subjt:  SACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGSIPM

RXH80310.1 hypothetical protein DVH24_041457 [Malus domestica]3.5e-19638.5Show/hide
Query:  MQGKGSNASGFALPDGVLQVLPSDPFEQLDVARKITSIALSTRVSLLESESSVLRSKLAEKDEIVADLRFQIESLNALLSATADKLVQADEEKESLKKEN
        M  K S +S F LP+ VL+VLP DPFEQLDVARKITS+ALSTRVS LESESS LR KLAEKD ++ADL+ Q+ESL+A LS +ADKL  A++EKE L KE 
Subjt:  MQGKGSNASGFALPDGVLQVLPSDPFEQLDVARKITSIALSTRVSLLESESSVLRSKLAEKDEIVADLRFQIESLNALLSATADKLVQADEEKESLKKEN

Query:  ASLSNTVKKLSRDVAKLEVFRKTLMLSLQEEGDSSTEVPEVVAR----------IQSQPKE--------VSSLPPSRYSSIQSQVSDVGSSLAEDHDSDR
        A L+NTV+KLSRDV+KLEVFRKTLM SL E+ ++ +   +VVA+            S+P +         S+LPPSR SS+QS  S  GS+  ED D+  
Subjt:  ASLSNTVKKLSRDVAKLEVFRKTLMLSLQEEGDSSTEVPEVVAR----------IQSQPKE--------VSSLPPSRYSSIQSQVSDVGSSLAEDHDSDR

Query:  DSIRPRIPPGLLLASQTSTPRLTPHGSPPSLSAS---------GSPMRTSMSFSTSRNIFEDRSSEYSSAPSSHYGSISSNKGRTRVDGKEFFRQVRSRL
        D+ RPRI   LLLASQTSTPR TP GSPP  SAS         GSP R SMSF+TSR +F++R    SS PSSH+GS S   GRTRVDGKEFFRQVRSRL
Subjt:  DSIRPRIPPGLLLASQTSTPRLTPHGSPPSLSAS---------GSPMRTSMSFSTSRNIFEDRSSEYSSAPSSHYGSISSNKGRTRVDGKEFFRQVRSRL

Query:  SYARFAAFLANVKDLNSHKQTKEVGSLMLVDDTAW-----------------------------------------------------------------
        SY +F AFLANVK+LN HKQTKE+G   + +D  W                                                                 
Subjt:  SYARFAAFLANVKDLNSHKQTKEVGSLMLVDDTAW-----------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHK
                                                        AFVYEG LTD ECDHLIS+AK+ELKRSAVADNLSG+SK+SEVRTSSG FI K
Subjt:  ------------------------------------------------AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHK

Query:  AKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKK
        AKDPIV+GIEDK++ WTFLPKENGEDIQVLRYE GQKY+ H+DYF DKVNIARGGHR+ATVLMYL++V KGGETVFP AE   RR+A+E    LS+CAKK
Subjt:  AKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKK

Query:  GIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVNSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA
        GIAVKP++GDALLFFSL P+A+PD +SLH GCPVIEGEKWSATKWIHV+SFD  +    +C D N SCERWA LGECT N EYMVG+PELPGYCR+SCK+
Subjt:  GIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVNSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA

Query:  LGHHRRNQTKPTSNFFILHTNPFKT-LFLNLIP-NISTLLPLARFIKMEVGLMQRQWVQYIKGLLGEGTLDSQYLQLLQLQDESNPTFVSEVVTLFFEDT
        L               I     FK   FL  +  + S  +   +   MEVG MQRQWV Y K L  EG LD Q+LQL QLQDESNP FV EVV+LFFED+
Subjt:  LGHHRRNQTKPTSNFFILHTNPFKT-LFLNLIP-NISTLLPLARFIKMEVGLMQRQWVQYIKGLLGEGTLDSQYLQLLQLQDESNPTFVSEVVTLFFEDT

Query:  EELLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKNACIDFRSACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGSIPM
        E+LLN L  A+ QP+VDFK++D HVHQ KGSSSSIGA RVKNACI FR+ CE+++ + C RC+QQV+ E+Y VK KL  L+A+E++I+ AGGSIPM
Subjt:  EELLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKNACIDFRSACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGSIPM

TrEMBL top hitse value%identityAlignment
A0A1Q3B5T1 Procollagen-proline 4-dioxygenase5.1e-16165.77Show/hide
Query:  AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADK
        A+VYEGFLT LECDHLISLAK+ELKRSAVADNLSG+SK+SEVRTSSG FI K KDPIV GIEDKI+ WTFLPKENGEDIQVLRYE GQKY+ HFDYF DK
Subjt:  AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADK

Query:  VNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHV
        VNIARGGHR+ATVL+YL++V KGGETVFPSAE S RR+ S T+ DLS+C +KG+AVKPR+GDALLFFSLHPNA+PD SSLH GCPVIEGEKWSATKWIHV
Subjt:  VNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHV

Query:  NSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKALGHHRRNQTKPTSNFFILHTNPFKTLFLNLIPNISTLLPLARFIK---
        +SFD  +    NC D N SCE+WA LGECT N EYMVGSPELPGYCR+SCK        + KP  +          TL +++   I       R  +   
Subjt:  NSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKALGHHRRNQTKPTSNFFILHTNPFKTLFLNLIPNISTLLPLARFIK---

Query:  -MEVGLMQRQWVQYIKGLLGEGTLDSQYLQLLQLQDESNPTFVSEVVTLFFEDTEELLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKNACID
         MEVG MQR+ + Y K L  EG LD Q+LQL QLQDESNP FV EVV+LFF+D+E LLN L  A+ QPSVDF ++D HVHQLKGSSSSI A R+KNA + 
Subjt:  -MEVGLMQRQWVQYIKGLLGEGTLDSQYLQLLQLQDESNPTFVSEVVTLFFEDTEELLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKNACID

Query:  FRSACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGS
        FR+ CE+Q+ + C RCLQQ++QE+Y  +  L  L+ LE++I+ AGGS
Subjt:  FRSACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGS

A0A498HP60 Procollagen-proline 4-dioxygenase2.5e-16366.44Show/hide
Query:  AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADK
        AFVYEG LTD E DHLIS+AK+ELKRSAVADNLSG+SK+SEVRTSSG FI KAKDPIV+GIEDK+A WTFLPKENGEDIQVLRYE GQKY  H+DYF DK
Subjt:  AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADK

Query:  VNIARGGHRMATVLMYLSNVEKGGETVFPSAEE-SQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIH
        VNIARGGHR+ATVLMYL++V KGGETVFP AE+   RR+A+E    LS+CAKKGIAVKPR+GDALLFFSL P+A+PD +SLH GCPVIEGEKWSATKWIH
Subjt:  VNIARGGHRMATVLMYLSNVEKGGETVFPSAEE-SQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIH

Query:  VNSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKALGHHRRNQTKPTSNFFILHTNPFKTLFLNLIPNISTLLPLARFIKME
        V+SFD  +    NC D N SCERWA LGECT N EYMVG+P+LPGYCR+SCK      +  +    +F                   S  +   +   ME
Subjt:  VNSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKALGHHRRNQTKPTSNFFILHTNPFKTLFLNLIPNISTLLPLARFIKME

Query:  VGLMQRQWVQYIKGLLGEGTLDSQYLQLLQLQDESNPTFVSEVVTLFFEDTEELLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKNACIDFRS
        VG MQRQWV Y K L  EG LD Q+LQL QLQDESNP FV EVV+LFFED+E+LLN L  A+ QP+VDFK++D HVHQ KGSSSSIGA RVKNACI FR+
Subjt:  VGLMQRQWVQYIKGLLGEGTLDSQYLQLLQLQDESNPTFVSEVVTLFFEDTEELLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKNACIDFRS

Query:  ACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGSIPM
         CE+Q+ + C RC+QQV+ E+Y VK KL  L+A+E++I+ AGGSIPM
Subjt:  ACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGSIPM

A0A498HP60 Procollagen-proline 4-dioxygenase3.1e-1740.76Show/hide
Query:  SSLPPSRYSSIQSQVSDVGSSLAEDHDSDRDSI----------------------------------------RPRIPPGLLLASQTSTPRLTPHGSPPS
        S+LPPSR SS+QS   + GS   ED D++   +                                         P I   +LLASQTSTPRLTP GSPP 
Subjt:  SSLPPSRYSSIQSQVSDVGSSLAEDHDSDRDSI----------------------------------------RPRIPPGLLLASQTSTPRLTPHGSPPS

Query:  LSASGSPMRTSMSFSTSRNIFEDRSSEYSSAPSSHYGSISSNKGRTRVDGKEFFRQVRSRLSYARFAAFLANVKDLNSHKQTKE
         SAS SP R+S   S  R+                          TRVDGKEFFRQVRSRLSY +F+AFL NVK+LN +KQTKE
Subjt:  LSASGSPMRTSMSFSTSRNIFEDRSSEYSSAPSSHYGSISSNKGRTRVDGKEFFRQVRSRLSYARFAAFLANVKDLNSHKQTKE

A0A498HP60 Procollagen-proline 4-dioxygenase1.2e-16266.74Show/hide
Query:  AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADK
        AFVYEGFLTD EC+HLIS+AK ELKRS+VADN+SG+SK+S+VRTSSG FI KAKDPIVSGIE+KIA WTFLPKENGE IQVLRYE+GQKYD H+DYF DK
Subjt:  AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADK

Query:  VNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHV
        VN+ARGGHR+ATVLMYLS+V KGGETVFPSAEE+    +S + +DLS+CAKKGIAVKPRKGDALLFFSLHP AIPD  SLHGGCPVIEGEKWSATKWIHV
Subjt:  VNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHV

Query:  NSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKALGHHRRNQTKPTSNFFILHTNPFKTLFLNLIPNISTLLPLARFIK--M
        +SFD +VR   NC D N +CERWA LGECT NPEYMVG+PELPGYCR+SC+       + T P        ++P +    + +          +  K  M
Subjt:  NSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKALGHHRRNQTKPTSNFFILHTNPFKTLFLNLIPNISTLLPLARFIK--M

Query:  EVGLMQRQWVQYIKGLLGEGTLDSQYLQLLQLQDESNPTFVSEVVTLFFEDTEELLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKNACIDFR
        EVG +QRQ+V+Y   L  EG LDSQ+ QL QLQDESNP FV EVV+LFFED+E LLN L  A+ Q  VDFKK+D +VHQLKGSSSSIGA RVKNAC+ FR
Subjt:  EVGLMQRQWVQYIKGLLGEGTLDSQYLQLLQLQDESNPTFVSEVVTLFFEDTEELLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKNACIDFR

Query:  SACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGSIPM
        + CE+ + + C  CLQQV+QE+  VK KL  L+ LE++IL AGGS+PM
Subjt:  SACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGSIPM

A0A498IAV7 Uncharacterized protein1.7e-19638.5Show/hide
Query:  MQGKGSNASGFALPDGVLQVLPSDPFEQLDVARKITSIALSTRVSLLESESSVLRSKLAEKDEIVADLRFQIESLNALLSATADKLVQADEEKESLKKEN
        M  K S +S F LP+ VL+VLP DPFEQLDVARKITS+ALSTRVS LESESS LR KLAEKD ++ADL+ Q+ESL+A LS +ADKL  A++EKE L KE 
Subjt:  MQGKGSNASGFALPDGVLQVLPSDPFEQLDVARKITSIALSTRVSLLESESSVLRSKLAEKDEIVADLRFQIESLNALLSATADKLVQADEEKESLKKEN

Query:  ASLSNTVKKLSRDVAKLEVFRKTLMLSLQEEGDSSTEVPEVVAR----------IQSQPKE--------VSSLPPSRYSSIQSQVSDVGSSLAEDHDSDR
        A L+NTV+KLSRDV+KLEVFRKTLM SL E+ ++ +   +VVA+            S+P +         S+LPPSR SS+QS  S  GS+  ED D+  
Subjt:  ASLSNTVKKLSRDVAKLEVFRKTLMLSLQEEGDSSTEVPEVVAR----------IQSQPKE--------VSSLPPSRYSSIQSQVSDVGSSLAEDHDSDR

Query:  DSIRPRIPPGLLLASQTSTPRLTPHGSPPSLSAS---------GSPMRTSMSFSTSRNIFEDRSSEYSSAPSSHYGSISSNKGRTRVDGKEFFRQVRSRL
        D+ RPRI   LLLASQTSTPR TP GSPP  SAS         GSP R SMSF+TSR +F++R    SS PSSH+GS S   GRTRVDGKEFFRQVRSRL
Subjt:  DSIRPRIPPGLLLASQTSTPRLTPHGSPPSLSAS---------GSPMRTSMSFSTSRNIFEDRSSEYSSAPSSHYGSISSNKGRTRVDGKEFFRQVRSRL

Query:  SYARFAAFLANVKDLNSHKQTKEVGSLMLVDDTAW-----------------------------------------------------------------
        SY +F AFLANVK+LN HKQTKE+G   + +D  W                                                                 
Subjt:  SYARFAAFLANVKDLNSHKQTKEVGSLMLVDDTAW-----------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHK
                                                        AFVYEG LTD ECDHLIS+AK+ELKRSAVADNLSG+SK+SEVRTSSG FI K
Subjt:  ------------------------------------------------AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHK

Query:  AKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKK
        AKDPIV+GIEDK++ WTFLPKENGEDIQVLRYE GQKY+ H+DYF DKVNIARGGHR+ATVLMYL++V KGGETVFP AE   RR+A+E    LS+CAKK
Subjt:  AKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKK

Query:  GIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVNSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA
        GIAVKP++GDALLFFSL P+A+PD +SLH GCPVIEGEKWSATKWIHV+SFD  +    +C D N SCERWA LGECT N EYMVG+PELPGYCR+SCK+
Subjt:  GIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVNSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA

Query:  LGHHRRNQTKPTSNFFILHTNPFKT-LFLNLIP-NISTLLPLARFIKMEVGLMQRQWVQYIKGLLGEGTLDSQYLQLLQLQDESNPTFVSEVVTLFFEDT
        L               I     FK   FL  +  + S  +   +   MEVG MQRQWV Y K L  EG LD Q+LQL QLQDESNP FV EVV+LFFED+
Subjt:  LGHHRRNQTKPTSNFFILHTNPFKT-LFLNLIP-NISTLLPLARFIKMEVGLMQRQWVQYIKGLLGEGTLDSQYLQLLQLQDESNPTFVSEVVTLFFEDT

Query:  EELLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKNACIDFRSACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGSIPM
        E+LLN L  A+ QP+VDFK++D HVHQ KGSSSSIGA RVKNACI FR+ CE+++ + C RC+QQV+ E+Y VK KL  L+A+E++I+ AGGSIPM
Subjt:  EELLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKNACIDFRSACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGSIPM

A0A7J6HHC7 Uncharacterized protein4.5e-16567.92Show/hide
Query:  AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADK
        AF+YEGFLTDLECDHLISLAK+ELKRSAVAD+ SGES++SEVRTSSG FI KAKDPIV+GIEDKI+ WTFLPKENGEDIQVLRYE GQKY+ H+DYFADK
Subjt:  AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADK

Query:  VNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHV
        VNI RGGHR+ATVLMYL++V KGGETVFP A E+ R + S T ED S+CAKKG+AVK R+GDALLFFSL P AIPDT SLH GCPVIEGEKWSATKWIHV
Subjt:  VNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHV

Query:  NSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKALGHHRRNQTKPTSNFFILHTNPFKTLFLNLIPNISTLLPLARFIKMEV
        +SFD  V     C D N SCERWA LGECT N EYMVGSPELPGYCR+SCK    H             +H   FK                     MEV
Subjt:  NSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKALGHHRRNQTKPTSNFFILHTNPFKTLFLNLIPNISTLLPLARFIKMEV

Query:  GLMQRQWVQYIKGLLGEGT-LDSQYLQLLQLQDESNPTFVSEVVTLFFEDTEELLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKNACIDFRS
        G MQRQWV Y K L  E   LDSQ+LQLLQLQDESNP FV EVV+LFF+DTE+LLN L  A+ Q  VDFK++D HVHQLKGSSSSIGA RVKN C+ FR+
Subjt:  GLMQRQWVQYIKGLLGEGT-LDSQYLQLLQLQDESNPTFVSEVVTLFFEDTEELLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKNACIDFRS

Query:  ACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGSIP-MDLGF
         CE+Q+ D C RCLQQV+QE+Y VK KL  L+ LE++I+ AGGSIP M+LGF
Subjt:  ACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGSIP-MDLGF

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 61.3e-8461.72Show/hide
Query:  TAWAFVYEGFLTDLECDHLISLAKAELKRS-AVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDY
        T  AF+Y+GFL+D ECDHLI LAK +L++S  VAD  SGES+ SEVRTSSG F+ K +D IV+ +E K+AAWTFLP+ENGE +Q+L YE GQKYD HFDY
Subjt:  TAWAFVYEGFLTDLECDHLISLAKAELKRS-AVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDY

Query:  FADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATK
        F DK  +  GGHR+ATVLMYLSNV KGGETVFP+    + +      +  S CAK+G AVKPRKGDALLFF+LH N   D +SLHG CPVIEGEKWSAT+
Subjt:  FADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATK

Query:  WIHVNSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA
        WIHV SF    +    C D++ SC+ WA+ GEC  NP YMVGS    G+CRKSCKA
Subjt:  WIHVNSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA

F4JAU3 Prolyl 4-hydroxylase 24.8e-11678.97Show/hide
Query:  AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADK
        AFVYEGFLTDLECDHLISLAK  L+RSAVADN +GES+VS+VRTSSG FI K KDPIVSGIEDK++ WTFLPKENGED+QVLRYE+GQKYDAHFDYF DK
Subjt:  AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADK

Query:  VNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHV
        VNIARGGHR+ATVL+YLSNV KGGETVFP A+E  RR  SE  +DLSDCAKKGIAVKP+KG+ALLFF+L  +AIPD  SLHGGCPVIEGEKWSATKWIHV
Subjt:  VNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHV

Query:  NSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA
        +SFD I+    NC D N SCERWA LGEC  NPEYMVG+PE+PG CR+SCKA
Subjt:  NSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA

Q8L970 Probable prolyl 4-hydroxylase 75.2e-9464.45Show/hide
Query:  TAWAFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYF
        T   F+YEGFL+D ECDH I LAK +L++S VADN SGES  SEVRTSSG F+ K +D IVS +E K+AAWTFLP+ENGE +Q+L YE GQKY+ HFDYF
Subjt:  TAWAFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYF

Query:  ADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSED-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATK
         D+ N+  GGHR+ATVLMYLSNVEKGGETVFP      + +A++  +D  ++CAK+G AVKPRKGDALLFF+LHPNA  D++SLHG CPV+EGEKWSAT+
Subjt:  ADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSED-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATK

Query:  WIHVNSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA
        WIHV SF+      + C DEN SCE+WA+ GEC  NP YMVGS +  GYCRKSCKA
Subjt:  WIHVNSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA

Q8LAN3 Probable prolyl 4-hydroxylase 43.8e-12182.94Show/hide
Query:  AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADK
        AFVYEGFLT+LECDH++SLAKA LKRSAVADN SGESK SEVRTSSG FI K KDPIVSGIEDKI+ WTFLPKENGEDIQVLRYE+GQKYDAHFDYF DK
Subjt:  AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADK

Query:  VNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHV
        VNI RGGHRMAT+LMYLSNV KGGETVFP AE   RR  SE  EDLSDCAK+GIAVKPRKGDALLFF+LHP+AIPD  SLHGGCPVIEGEKWSATKWIHV
Subjt:  VNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHV

Query:  NSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA
        +SFD IV    NC D N SCERWA LGECT NPEYMVG+ ELPGYCR+SCKA
Subjt:  NSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA

Q93W28 Uncharacterized protein At4g155451.1e-6756.17Show/hide
Query:  QGKGSNASG---FALPDGVLQVLPSDPFEQLDVARKITSIALSTRVSLLESESSVLRSKLAEKDEIVADLRFQIESLNALLSATADKLVQADEEKESLKK
        +G  S  +G   F LPD +LQVLPSDPFEQLDVARKITSIALSTRVS LESESS LR  LAEK++   +L+  +ESL A LS    KL  AD EKE+L +
Subjt:  QGKGSNASG---FALPDGVLQVLPSDPFEQLDVARKITSIALSTRVSLLESESSVLRSKLAEKDEIVADLRFQIESLNALLSATADKLVQADEEKESLKK

Query:  ENASLSNTVKKLSRDVAKLEVFRKTLMLSLQEEGDSSTEVPEVVARIQSQPKEVSSLPPSRYSSIQSQVSDVGSSLAEDHDSDRDSIRPRIPPGLLLASQ
        ENASLSNTVK+L RDV+KLE FRKTLM+SLQ++ D +    +++A+  +   + +   PSR+SSIQSQ +      A   D++ D+ +P +   L L SQ
Subjt:  ENASLSNTVKKLSRDVAKLEVFRKTLMLSLQEEGDSSTEVPEVVARIQSQPKEVSSLPPSRYSSIQSQVSDVGSSLAEDHDSDRDSIRPRIPPGLLLASQ

Query:  TSTPRLTPHGSPPSLSASG---------SPMRTSMSFSTSRNIFED-RSSEYSSAPSSHYGSISSNKGRTRVDGKEFFRQVRSRLSYARFAAFLANVKDL
        T+TPRLTP GSPP LSASG         SP R S+SF+T+R +F+D RSS   S P        S   RTRVDGKEFFRQVRSRLSY +F AFL NVKDL
Subjt:  TSTPRLTPHGSPPSLSASG---------SPMRTSMSFSTSRNIFED-RSSEYSSAPSSHYGSISSNKGRTRVDGKEFFRQVRSRLSYARFAAFLANVKDL

Query:  NSHKQTKE
        N+HKQT+E
Subjt:  NSHKQTKE

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 23.4e-11778.97Show/hide
Query:  AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADK
        AFVYEGFLTDLECDHLISLAK  L+RSAVADN +GES+VS+VRTSSG FI K KDPIVSGIEDK++ WTFLPKENGED+QVLRYE+GQKYDAHFDYF DK
Subjt:  AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADK

Query:  VNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHV
        VNIARGGHR+ATVL+YLSNV KGGETVFP A+E  RR  SE  +DLSDCAKKGIAVKP+KG+ALLFF+L  +AIPD  SLHGGCPVIEGEKWSATKWIHV
Subjt:  VNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHV

Query:  NSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA
        +SFD I+    NC D N SCERWA LGEC  NPEYMVG+PE+PG CR+SCKA
Subjt:  NSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase3.7e-9564.45Show/hide
Query:  TAWAFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYF
        T   F+YEGFL+D ECDH I LAK +L++S VADN SGES  SEVRTSSG F+ K +D IVS +E K+AAWTFLP+ENGE +Q+L YE GQKY+ HFDYF
Subjt:  TAWAFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYF

Query:  ADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSED-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATK
         D+ N+  GGHR+ATVLMYLSNVEKGGETVFP      + +A++  +D  ++CAK+G AVKPRKGDALLFF+LHPNA  D++SLHG CPV+EGEKWSAT+
Subjt:  ADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSED-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATK

Query:  WIHVNSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA
        WIHV SF+      + C DEN SCE+WA+ GEC  NP YMVGS +  GYCRKSCKA
Subjt:  WIHVNSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase1.5e-8860.98Show/hide
Query:  TAWAFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGES-----KVSEVRTSSGAFIHKAK---DPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQK
        T   F+YEGFL+D ECDH I LAK +L++S VADN SGES      VS VR SS    +      D IVS +E K+AAWTFLP+ENGE +Q+L YE GQK
Subjt:  TAWAFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGES-----KVSEVRTSSGAFIHKAK---DPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQK

Query:  YDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSED-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIE
        Y+ HFDYF D+ N+  GGHR+ATVLMYLSNVEKGGETVFP      + +A++  +D  ++CAK+G AVKPRKGDALLFF+LHPNA  D++SLHG CPV+E
Subjt:  YDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSED-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIE

Query:  GEKWSATKWIHVNSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA
        GEKWSAT+WIHV SF+      + C DEN SCE+WA+ GEC  NP YMVGS +  GYCRKSCKA
Subjt:  GEKWSATKWIHVNSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase9.1e-8661.72Show/hide
Query:  TAWAFVYEGFLTDLECDHLISLAKAELKRS-AVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDY
        T  AF+Y+GFL+D ECDHLI LAK +L++S  VAD  SGES+ SEVRTSSG F+ K +D IV+ +E K+AAWTFLP+ENGE +Q+L YE GQKYD HFDY
Subjt:  TAWAFVYEGFLTDLECDHLISLAKAELKRS-AVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDY

Query:  FADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATK
        F DK  +  GGHR+ATVLMYLSNV KGGETVFP+    + +      +  S CAK+G AVKPRKGDALLFF+LH N   D +SLHG CPVIEGEKWSAT+
Subjt:  FADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATK

Query:  WIHVNSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA
        WIHV SF    +    C D++ SC+ WA+ GEC  NP YMVGS    G+CRKSCKA
Subjt:  WIHVNSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.7e-12282.94Show/hide
Query:  AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADK
        AFVYEGFLT+LECDH++SLAKA LKRSAVADN SGESK SEVRTSSG FI K KDPIVSGIEDKI+ WTFLPKENGEDIQVLRYE+GQKYDAHFDYF DK
Subjt:  AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADK

Query:  VNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHV
        VNI RGGHRMAT+LMYLSNV KGGETVFP AE   RR  SE  EDLSDCAK+GIAVKPRKGDALLFF+LHP+AIPD  SLHGGCPVIEGEKWSATKWIHV
Subjt:  VNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHV

Query:  NSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA
        +SFD IV    NC D N SCERWA LGECT NPEYMVG+ ELPGYCR+SCKA
Subjt:  NSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGGGAAAGGATCTAACGCTTCCGGCTTCGCTCTTCCTGACGGAGTCTTGCAAGTCCTGCCTTCTGATCCTTTTGAACAGCTCGACGTGGCTCGCAAAATCACTTC
CATTGCTCTTTCCACTCGCGTCTCGTTGCTTGAATCAGAGTCCTCTGTTCTCCGTTCTAAACTTGCTGAGAAAGATGAGATCGTTGCTGACCTTCGGTTCCAGATCGAAT
CACTCAACGCTTTGCTCTCCGCGACTGCTGATAAACTCGTCCAGGCAGACGAGGAGAAGGAGAGCTTGAAGAAAGAGAATGCCTCGTTGTCGAACACTGTGAAGAAGCTT
AGTAGAGATGTTGCGAAGTTGGAGGTTTTCAGAAAGACGTTAATGCTATCACTTCAGGAGGAGGGAGATAGCTCTACAGAAGTTCCAGAAGTTGTTGCTAGGATACAGAG
CCAACCAAAAGAGGTTTCATCGTTGCCACCCTCTAGATATTCATCGATTCAGAGCCAGGTTTCTGACGTAGGAAGTTCGTTAGCAGAGGATCATGATTCAGATAGGGATA
GTATACGACCTCGTATTCCGCCTGGCCTCCTGTTAGCGTCCCAGACAAGTACACCTCGGCTTACTCCCCATGGCTCTCCTCCTTCACTGTCAGCATCAGGATCCCCAATG
AGAACATCAATGTCATTTTCAACGTCCAGGAACATTTTTGAAGATAGATCTTCAGAATATTCTTCTGCGCCCTCAAGCCACTATGGCTCGATTTCCTCCAACAAAGGGAG
AACTCGGGTCGATGGGAAGGAGTTTTTCCGACAAGTCAGGAGCCGTTTGTCTTATGCACGGTTTGCTGCATTTTTAGCAAATGTGAAGGATCTAAATTCCCACAAGCAAA
CAAAAGAGGTTGGCTCTCTCATGCTAGTAGATGACACTGCGTGGGCTTTTGTGTATGAAGGTTTTCTCACGGACTTAGAATGCGATCATCTCATCTCCCTCGCTAAAGCG
GAGTTGAAGAGATCTGCTGTCGCGGATAATTTGTCCGGAGAGAGTAAAGTCAGCGAGGTCCGAACTAGCTCTGGGGCGTTTATCCATAAAGCCAAGGATCCTATTGTTTC
TGGAATAGAAGACAAAATTGCAGCATGGACATTTCTGCCAAAAGAAAATGGAGAAGACATTCAAGTGTTGAGATATGAATATGGGCAGAAGTACGATGCACACTTTGATT
ACTTTGCTGACAAGGTTAATATTGCCCGAGGTGGACATCGAATGGCAACCGTTCTCATGTATCTTTCCAACGTAGAAAAAGGCGGTGAAACTGTGTTTCCTTCTGCAGAG
GAATCTCAAAGACGCCAGGCTTCGGAAACAAGTGAAGATCTCTCAGACTGTGCAAAGAAAGGGATAGCAGTTAAACCACGGAAAGGCGACGCTCTTCTCTTCTTCAGTCT
CCATCCAAATGCTATTCCAGACACAAGTAGTCTACATGGTGGATGCCCTGTGATTGAAGGTGAGAAATGGTCAGCAACAAAGTGGATTCATGTCAATTCTTTTGACACCA
TCGTGAGAGACCATACGAACTGCGCCGATGAAAATCCAAGTTGTGAGAGATGGGCTGAACTCGGTGAGTGCACGAATAACCCGGAGTATATGGTCGGATCTCCCGAGCTT
CCTGGCTACTGCAGGAAAAGTTGTAAGGCGCTCGGCCATCATCGCCGAAACCAAACCAAACCCACTTCAAATTTCTTCATTCTTCACACAAATCCCTTCAAAACCCTCTT
TCTTAATCTCATTCCTAACATCTCTACCCTACTTCCTTTAGCTCGTTTCATTAAAATGGAGGTGGGATTGATGCAGAGACAGTGGGTTCAGTATATTAAAGGCTTACTCG
GTGAGGGTACGCTGGATAGTCAGTATTTGCAGCTTTTGCAACTGCAAGATGAGAGTAATCCAACTTTCGTTTCTGAAGTGGTCACTCTTTTCTTTGAAGATACCGAGGAG
CTTCTCAATAAACTGAGAATCGCTATATCACAGCCATCTGTTGACTTCAAAAAGATTGATGATCATGTACACCAGCTGAAGGGCAGCAGTTCCAGCATAGGTGCACTTAG
AGTGAAAAATGCCTGCATTGACTTCCGGAGCGCCTGCGAGCAACAGAGTCCTGACTGGTGTTCAAGATGCCTGCAACAAGTAGAGCAAGAATTCTATGGTGTGAAGGAGA
AGCTCAGTTATTTATATGCACTGGAGAAACGGATTTTGAATGCTGGTGGATCCATTCCCATGGACTTGGGTTTCTAA
mRNA sequenceShow/hide mRNA sequence
ATTCTTCGTCGTTATGCCTACCTACAATCGGAGTCCTAACAATGGCGGATGGCAATTGAAGTTACAAGAAACTTCGTAGAAAGAGAGAGGCTGGAAAATGCAAGGGAAAG
GATCTAACGCTTCCGGCTTCGCTCTTCCTGACGGAGTCTTGCAAGTCCTGCCTTCTGATCCTTTTGAACAGCTCGACGTGGCTCGCAAAATCACTTCCATTGCTCTTTCC
ACTCGCGTCTCGTTGCTTGAATCAGAGTCCTCTGTTCTCCGTTCTAAACTTGCTGAGAAAGATGAGATCGTTGCTGACCTTCGGTTCCAGATCGAATCACTCAACGCTTT
GCTCTCCGCGACTGCTGATAAACTCGTCCAGGCAGACGAGGAGAAGGAGAGCTTGAAGAAAGAGAATGCCTCGTTGTCGAACACTGTGAAGAAGCTTAGTAGAGATGTTG
CGAAGTTGGAGGTTTTCAGAAAGACGTTAATGCTATCACTTCAGGAGGAGGGAGATAGCTCTACAGAAGTTCCAGAAGTTGTTGCTAGGATACAGAGCCAACCAAAAGAG
GTTTCATCGTTGCCACCCTCTAGATATTCATCGATTCAGAGCCAGGTTTCTGACGTAGGAAGTTCGTTAGCAGAGGATCATGATTCAGATAGGGATAGTATACGACCTCG
TATTCCGCCTGGCCTCCTGTTAGCGTCCCAGACAAGTACACCTCGGCTTACTCCCCATGGCTCTCCTCCTTCACTGTCAGCATCAGGATCCCCAATGAGAACATCAATGT
CATTTTCAACGTCCAGGAACATTTTTGAAGATAGATCTTCAGAATATTCTTCTGCGCCCTCAAGCCACTATGGCTCGATTTCCTCCAACAAAGGGAGAACTCGGGTCGAT
GGGAAGGAGTTTTTCCGACAAGTCAGGAGCCGTTTGTCTTATGCACGGTTTGCTGCATTTTTAGCAAATGTGAAGGATCTAAATTCCCACAAGCAAACAAAAGAGGTTGG
CTCTCTCATGCTAGTAGATGACACTGCGTGGGCTTTTGTGTATGAAGGTTTTCTCACGGACTTAGAATGCGATCATCTCATCTCCCTCGCTAAAGCGGAGTTGAAGAGAT
CTGCTGTCGCGGATAATTTGTCCGGAGAGAGTAAAGTCAGCGAGGTCCGAACTAGCTCTGGGGCGTTTATCCATAAAGCCAAGGATCCTATTGTTTCTGGAATAGAAGAC
AAAATTGCAGCATGGACATTTCTGCCAAAAGAAAATGGAGAAGACATTCAAGTGTTGAGATATGAATATGGGCAGAAGTACGATGCACACTTTGATTACTTTGCTGACAA
GGTTAATATTGCCCGAGGTGGACATCGAATGGCAACCGTTCTCATGTATCTTTCCAACGTAGAAAAAGGCGGTGAAACTGTGTTTCCTTCTGCAGAGGAATCTCAAAGAC
GCCAGGCTTCGGAAACAAGTGAAGATCTCTCAGACTGTGCAAAGAAAGGGATAGCAGTTAAACCACGGAAAGGCGACGCTCTTCTCTTCTTCAGTCTCCATCCAAATGCT
ATTCCAGACACAAGTAGTCTACATGGTGGATGCCCTGTGATTGAAGGTGAGAAATGGTCAGCAACAAAGTGGATTCATGTCAATTCTTTTGACACCATCGTGAGAGACCA
TACGAACTGCGCCGATGAAAATCCAAGTTGTGAGAGATGGGCTGAACTCGGTGAGTGCACGAATAACCCGGAGTATATGGTCGGATCTCCCGAGCTTCCTGGCTACTGCA
GGAAAAGTTGTAAGGCGCTCGGCCATCATCGCCGAAACCAAACCAAACCCACTTCAAATTTCTTCATTCTTCACACAAATCCCTTCAAAACCCTCTTTCTTAATCTCATT
CCTAACATCTCTACCCTACTTCCTTTAGCTCGTTTCATTAAAATGGAGGTGGGATTGATGCAGAGACAGTGGGTTCAGTATATTAAAGGCTTACTCGGTGAGGGTACGCT
GGATAGTCAGTATTTGCAGCTTTTGCAACTGCAAGATGAGAGTAATCCAACTTTCGTTTCTGAAGTGGTCACTCTTTTCTTTGAAGATACCGAGGAGCTTCTCAATAAAC
TGAGAATCGCTATATCACAGCCATCTGTTGACTTCAAAAAGATTGATGATCATGTACACCAGCTGAAGGGCAGCAGTTCCAGCATAGGTGCACTTAGAGTGAAAAATGCC
TGCATTGACTTCCGGAGCGCCTGCGAGCAACAGAGTCCTGACTGGTGTTCAAGATGCCTGCAACAAGTAGAGCAAGAATTCTATGGTGTGAAGGAGAAGCTCAGTTATTT
ATATGCACTGGAGAAACGGATTTTGAATGCTGGTGGATCCATTCCCATGGACTTGGGTTTCTAAAGTGACAGAATCCATGGAAACCAAGATTTTCCGTTCGTTCCTGTGG
AGTCTCCTTTCGCTTGTTTTCTAGGTTCTTAAATTGACTGTTACTTGTGCTTGTACAAATTCAAAACCTCTGCCTTTTTCACTCAGAACATCAGTGTTTTCTACAGCTGA
TTTTGACTTTCGATTCTTCGTGTCTGTTTAAGCTGAACTGTTGTATTGCATGGAGTCCTAGAAGATTTTCTGGGTGCTTTCAAAGCGGCAAATACTCTCTTATATATCTT
TGTTCCCTTCGTTATTCTGCTATTCAAA
Protein sequenceShow/hide protein sequence
MQGKGSNASGFALPDGVLQVLPSDPFEQLDVARKITSIALSTRVSLLESESSVLRSKLAEKDEIVADLRFQIESLNALLSATADKLVQADEEKESLKKENASLSNTVKKL
SRDVAKLEVFRKTLMLSLQEEGDSSTEVPEVVARIQSQPKEVSSLPPSRYSSIQSQVSDVGSSLAEDHDSDRDSIRPRIPPGLLLASQTSTPRLTPHGSPPSLSASGSPM
RTSMSFSTSRNIFEDRSSEYSSAPSSHYGSISSNKGRTRVDGKEFFRQVRSRLSYARFAAFLANVKDLNSHKQTKEVGSLMLVDDTAWAFVYEGFLTDLECDHLISLAKA
ELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAE
ESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVNSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPEL
PGYCRKSCKALGHHRRNQTKPTSNFFILHTNPFKTLFLNLIPNISTLLPLARFIKMEVGLMQRQWVQYIKGLLGEGTLDSQYLQLLQLQDESNPTFVSEVVTLFFEDTEE
LLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKNACIDFRSACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGSIPMDLGF