; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC05G087680 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC05G087680
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationCicolChr05:5380951..5399340
RNA-Seq ExpressionCcUC05G087680
SyntenyCcUC05G087680
Gene Ontology termsGO:0000160 - phosphorelay signal transduction system (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0031418 - L-ascorbic acid binding (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0004672 - protein kinase activity (molecular function)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0000166 - nucleotide binding (molecular function)
InterPro domainsIPR045054 - Prolyl 4-hydroxylase
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR036641 - HPT domain superfamily
IPR008207 - Signal transduction histidine kinase, phosphotransfer (Hpt) domain
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR003582 - ShKT domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAV63367.1 Hpt domain-containing protein/2OG-FeII_Oxy_3 domain-containing protein [Cephalotus follicularis]2.8e-16159.76Show/hide
Query:  SISSNKGRTRVDGKEFFRQVRSRLSYEQ-FAAFLANVKDLNSHKQTKEVGSLMLVDYTAWAFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVS
        S+S +K  T++   +F   +   +  +Q +++F+++   +    + K++ +         A+VYEGFLT LECDHLISLAK+ELKRSAVADNLSG+SK+S
Subjt:  SISSNKGRTRVDGKEFFRQVRSRLSYEQ-FAAFLANVKDLNSHKQTKEVGSLMLVDYTAWAFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVS

Query:  EVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQAS
        EVRTSSG FI K KDPIV GIEDKI+ WTFLPKENGEDIQVLRYE GQKY+ HFDYF DKVNIARGGHR+ATVL+YL++V KGGETVFPSAE S RR+ S
Subjt:  EVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQAS

Query:  ETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSP
         T+ DLS+C +KG+AVKPR+GDALLFFSLHPNA+PD SSLH GCPVIEGEKWSATKWIHVDSFD  +    NC D N SCE+WA LGECT N EYMVGSP
Subjt:  ETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSP

Query:  ELPGYCRKSCKALGHHRRNQTKPTSNFFILHTNPFKTLFLNLIPNISTLLPLARFIK----MEVGLMQRQWVQYIKGLLGEGTLDSQYLQLLQLQDESNP
        ELPGYCR+SCK        + KP  +          TL +++   I       R  +    MEVG MQR+ + Y K L  EG LD Q+LQL QLQDESNP
Subjt:  ELPGYCRKSCKALGHHRRNQTKPTSNFFILHTNPFKTLFLNLIPNISTLLPLARFIK----MEVGLMQRQWVQYIKGLLGEGTLDSQYLQLLQLQDESNP

Query:  TFVSEVVTLFFEDTEELLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKNACIDFRSACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKR
         FV EVV+LFF+D+E LLN L  A+ QPSVDF ++D HVHQLKGSSSSI A R+KNA + FR+ CE+Q+ + C RCLQQ++QE+Y  +  L  L+ LE++
Subjt:  TFVSEVVTLFFEDTEELLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKNACIDFRSACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKR

Query:  ILNAGGS
        I+ AGGS
Subjt:  ILNAGGS

KAF4393790.1 hypothetical protein G4B88_007776 [Cannabis sativa]3.8e-16663.99Show/hide
Query:  YEQFAAFLANVKDLNSHKQTKEVGSLMLVDYTAWAFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIA
        +E F+++  +   + +  + K+      + +   AF+YEGFLTDLECDHLISLAK+ELKRSAVAD+ SGES++SEVRTSSG FI KAKDPIV+GIEDKI+
Subjt:  YEQFAAFLANVKDLNSHKQTKEVGSLMLVDYTAWAFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIA

Query:  AWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLF
         WTFLPKENGEDIQVLRYE GQKY+ H+DYFADKVNI RGGHR+ATVLMYL++V KGGETVFP A E+ R + S T ED S+CAKKG+AVK R+GDALLF
Subjt:  AWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLF

Query:  FSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKALGHHRRNQTKPTSN
        FSL P AIPDT SLH GCPVIEGEKWSATKWIHVDSFD  V     C D N SCERWA LGECT N EYMVGSPELPGYCR+SCK    H          
Subjt:  FSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKALGHHRRNQTKPTSN

Query:  FFILHTNPFKTLFLNLIPNISTLLPLARFIKMEVGLMQRQWVQYIKGLLGEGT-LDSQYLQLLQLQDESNPTFVSEVVTLFFEDTEELLNKLRIAISQPS
           +H   FK                     MEVG MQRQWV Y K L  E   LDSQ+LQLLQLQDESNP FV EVV+LFF+DTE+LLN L  A+ Q  
Subjt:  FFILHTNPFKTLFLNLIPNISTLLPLARFIKMEVGLMQRQWVQYIKGLLGEGT-LDSQYLQLLQLQDESNPTFVSEVVTLFFEDTEELLNKLRIAISQPS

Query:  VDFKKIDDHVHQLKGSSSSIGALRVKNACIDFRSACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGSIP-MDLGF
        VDFK++D HVHQLKGSSSSIGA RVKN C+ FR+ CE+Q+ D C RCLQQV+QE+Y VK KL  L+ LE++I+ AGGSIP M+LGF
Subjt:  VDFKKIDDHVHQLKGSSSSIGALRVKNACIDFRSACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGSIP-MDLGF

RXH72084.1 hypothetical protein DVH24_025585 [Malus domestica]1.3e-16366.67Show/hide
Query:  AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADK
        AFVYEG LTD E DHLIS+AK+ELKRSAVADNLSG+SK+SEVRTSSG FI KAKDPIV+GIEDK+A WTFLPKENGEDIQVLRYE GQKY  H+DYF DK
Subjt:  AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADK

Query:  VNIARGGHRMATVLMYLSNVEKGGETVFPSAEE-SQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIH
        VNIARGGHR+ATVLMYL++V KGGETVFP AE+   RR+A+E    LS+CAKKGIAVKPR+GDALLFFSL P+A+PD +SLH GCPVIEGEKWSATKWIH
Subjt:  VNIARGGHRMATVLMYLSNVEKGGETVFPSAEE-SQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIH

Query:  VDSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKALGHHRRNQTKPTSNFFILHTNPFKTLFLNLIPNISTLLPLARFIKME
        VDSFD  +    NC D N SCERWA LGECT N EYMVG+P+LPGYCR+SCK      +  +    +F                   S  +   +   ME
Subjt:  VDSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKALGHHRRNQTKPTSNFFILHTNPFKTLFLNLIPNISTLLPLARFIKME

Query:  VGLMQRQWVQYIKGLLGEGTLDSQYLQLLQLQDESNPTFVSEVVTLFFEDTEELLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKNACIDFRS
        VG MQRQWV Y K L  EG LD Q+LQL QLQDESNP FV EVV+LFFED+E+LLN L  A+ QP+VDFK++D HVHQ KGSSSSIGA RVKNACI FR+
Subjt:  VGLMQRQWVQYIKGLLGEGTLDSQYLQLLQLQDESNPTFVSEVVTLFFEDTEELLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKNACIDFRS

Query:  ACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGSIPM
         CE+Q+ + C RC+QQV+ E+Y VK KL  L+A+E++I+ AGGSIPM
Subjt:  ACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGSIPM

RXH72084.1 hypothetical protein DVH24_025585 [Malus domestica]5.9e-1841.85Show/hide
Query:  SSLPPSRYSSIQSQVSDIGSSLAEDHDSDRDSI----------------------------------------RPRIPPGLLLASQTSTPRLTPHGSPPS
        S+LPPSR SS+QS   + GS   ED D++   +                                         P I   +LLASQTSTPRLTP GSPP 
Subjt:  SSLPPSRYSSIQSQVSDIGSSLAEDHDSDRDSI----------------------------------------RPRIPPGLLLASQTSTPRLTPHGSPPS

Query:  LSASGSPMRTSMSFSTSRNIFEDRSSEYSSAPSSHYGSISSNKGRTRVDGKEFFRQVRSRLSYEQFAAFLANVKDLNSHKQTKE
         SAS SP R+S   S  R+                          TRVDGKEFFRQVRSRLSYEQF+AFL NVK+LN +KQTKE
Subjt:  LSASGSPMRTSMSFSTSRNIFEDRSSEYSSAPSSHYGSISSNKGRTRVDGKEFFRQVRSRLSYEQFAAFLANVKDLNSHKQTKE

RXH72084.1 hypothetical protein DVH24_025585 [Malus domestica]5.1e-16366.3Show/hide
Query:  VDYTAWAFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHF
        V +   AFVYEGFLTD EC+HLIS+AK ELKRS+VADN+SG+SK+S+VRTSSG FI KAKDPIVSGIE+KIA WTFLPKENGE IQVLRYE+GQKYD H+
Subjt:  VDYTAWAFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHF

Query:  DYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSA
        DYF DKVN+ARGGHR+ATVLMYLS+V KGGETVFPSAEE+    +S + +DLS+CAKKGIAVKPRKGDALLFFSLHP AIPD  SLHGGCPVIEGEKWSA
Subjt:  DYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSA

Query:  TKWIHVDSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKALGHHRRNQTKPTSNFFILHTNPFKTLFLNLIPNISTLLPLAR
        TKWIHVDSFD +VR   NC D N +CERWA LGECT NPEYMVG+PELPGYCR+SC+       + T P        ++P +    + +          +
Subjt:  TKWIHVDSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKALGHHRRNQTKPTSNFFILHTNPFKTLFLNLIPNISTLLPLAR

Query:  FIK--MEVGLMQRQWVQYIKGLLGEGTLDSQYLQLLQLQDESNPTFVSEVVTLFFEDTEELLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKN
          K  MEVG +QRQ+V+Y   L  EG LDSQ+ QL QLQDESNP FV EVV+LFFED+E LLN L  A+ Q  VDFKK+D +VHQLKGSSSSIGA RVKN
Subjt:  FIK--MEVGLMQRQWVQYIKGLLGEGTLDSQYLQLLQLQDESNPTFVSEVVTLFFEDTEELLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKN

Query:  ACIDFRSACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGSIPM
        AC+ FR+ CE+ + + C  CLQQV+QE+  VK KL  L+ LE++IL AGGS+PM
Subjt:  ACIDFRSACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGSIPM

RXH80310.1 hypothetical protein DVH24_041457 [Malus domestica]1.4e-19738.97Show/hide
Query:  MQGKGSNASGFALPDGVLQVLPSDPFEQLDVARKITSIALSTRVSLLESESSVLRSKLAEKDEIVADLRFQIESLNASLSATADKLVQADEEKESLKKEN
        M  K S +S F LP+ VL+VLP DPFEQLDVARKITS+ALSTRVS LESESS LR KLAEKD ++ADL+ Q+ESL+ASLS +ADKL  A++EKE L KE 
Subjt:  MQGKGSNASGFALPDGVLQVLPSDPFEQLDVARKITSIALSTRVSLLESESSVLRSKLAEKDEIVADLRFQIESLNASLSATADKLVQADEEKESLKKEN

Query:  ASLSNTVKKLSRDVAKLEVFRKTLMLSLQEEGDSSTEVPEVVAR----------IQSQPKE--------VSSLPPSRYSSIQSQVSDIGSSLAEDHDSDR
        A L+NTV+KLSRDV+KLEVFRKTLM SL E+ ++ +   +VVA+            S+P +         S+LPPSR SS+QS  S  GS+  ED D+  
Subjt:  ASLSNTVKKLSRDVAKLEVFRKTLMLSLQEEGDSSTEVPEVVAR----------IQSQPKE--------VSSLPPSRYSSIQSQVSDIGSSLAEDHDSDR

Query:  DSIRPRIPPGLLLASQTSTPRLTPHGSPPSLSAS---------GSPMRTSMSFSTSRNIFEDRSSEYSSAPSSHYGSISSNKGRTRVDGKEFFRQVRSRL
        D+ RPRI   LLLASQTSTPR TP GSPP  SAS         GSP R SMSF+TSR +F++R    SS PSSH+GS S   GRTRVDGKEFFRQVRSRL
Subjt:  DSIRPRIPPGLLLASQTSTPRLTPHGSPPSLSAS---------GSPMRTSMSFSTSRNIFEDRSSEYSSAPSSHYGSISSNKGRTRVDGKEFFRQVRSRL

Query:  SYEQFAAFLANVKDLNSHKQTKEVG---------------------------------------------------------------------------
        SYEQF AFLANVK+LN HKQTKE+G                                                                           
Subjt:  SYEQFAAFLANVKDLNSHKQTKEVG---------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------SLML------------------------------------------------------------
                                            SL++                                                            
Subjt:  ------------------------------------SLML------------------------------------------------------------

Query:  ---------------------------------------VDYTAW---AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHK
                                               V   +W   AFVYEG LTD ECDHLIS+AK+ELKRSAVADNLSG+SK+SEVRTSSG FI K
Subjt:  ---------------------------------------VDYTAW---AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHK

Query:  AKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKK
        AKDPIV+GIEDK++ WTFLPKENGEDIQVLRYE GQKY+ H+DYF DKVNIARGGHR+ATVLMYL++V KGGETVFP AE   RR+A+E    LS+CAKK
Subjt:  AKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKK

Query:  GIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA
        GIAVKP++GDALLFFSL P+A+PD +SLH GCPVIEGEKWSATKWIHVDSFD  +    +C D N SCERWA LGECT N EYMVG+PELPGYCR+SCK+
Subjt:  GIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA

Query:  LGHHRRNQTKPTSNFFILHTNPFKT-LFLNLIP-NISTLLPLARFIKMEVGLMQRQWVQYIKGLLGEGTLDSQYLQLLQLQDESNPTFVSEVVTLFFEDT
        L               I     FK   FL  +  + S  +   +   MEVG MQRQWV Y K L  EG LD Q+LQL QLQDESNP FV EVV+LFFED+
Subjt:  LGHHRRNQTKPTSNFFILHTNPFKT-LFLNLIP-NISTLLPLARFIKMEVGLMQRQWVQYIKGLLGEGTLDSQYLQLLQLQDESNPTFVSEVVTLFFEDT

Query:  EELLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKNACIDFRSACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGSIPM
        E+LLN L  A+ QP+VDFK++D HVHQ KGSSSSIGA RVKNACI FR+ CE+++ + C RC+QQV+ E+Y VK KL  L+A+E++I+ AGGSIPM
Subjt:  EELLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKNACIDFRSACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGSIPM

TrEMBL top hitse value%identityAlignment
A0A1Q3B5T1 Procollagen-proline 4-dioxygenase1.3e-16159.76Show/hide
Query:  SISSNKGRTRVDGKEFFRQVRSRLSYEQ-FAAFLANVKDLNSHKQTKEVGSLMLVDYTAWAFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVS
        S+S +K  T++   +F   +   +  +Q +++F+++   +    + K++ +         A+VYEGFLT LECDHLISLAK+ELKRSAVADNLSG+SK+S
Subjt:  SISSNKGRTRVDGKEFFRQVRSRLSYEQ-FAAFLANVKDLNSHKQTKEVGSLMLVDYTAWAFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVS

Query:  EVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQAS
        EVRTSSG FI K KDPIV GIEDKI+ WTFLPKENGEDIQVLRYE GQKY+ HFDYF DKVNIARGGHR+ATVL+YL++V KGGETVFPSAE S RR+ S
Subjt:  EVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQAS

Query:  ETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSP
         T+ DLS+C +KG+AVKPR+GDALLFFSLHPNA+PD SSLH GCPVIEGEKWSATKWIHVDSFD  +    NC D N SCE+WA LGECT N EYMVGSP
Subjt:  ETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSP

Query:  ELPGYCRKSCKALGHHRRNQTKPTSNFFILHTNPFKTLFLNLIPNISTLLPLARFIK----MEVGLMQRQWVQYIKGLLGEGTLDSQYLQLLQLQDESNP
        ELPGYCR+SCK        + KP  +          TL +++   I       R  +    MEVG MQR+ + Y K L  EG LD Q+LQL QLQDESNP
Subjt:  ELPGYCRKSCKALGHHRRNQTKPTSNFFILHTNPFKTLFLNLIPNISTLLPLARFIK----MEVGLMQRQWVQYIKGLLGEGTLDSQYLQLLQLQDESNP

Query:  TFVSEVVTLFFEDTEELLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKNACIDFRSACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKR
         FV EVV+LFF+D+E LLN L  A+ QPSVDF ++D HVHQLKGSSSSI A R+KNA + FR+ CE+Q+ + C RCLQQ++QE+Y  +  L  L+ LE++
Subjt:  TFVSEVVTLFFEDTEELLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKNACIDFRSACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKR

Query:  ILNAGGS
        I+ AGGS
Subjt:  ILNAGGS

A0A498HP60 Procollagen-proline 4-dioxygenase6.5e-16466.67Show/hide
Query:  AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADK
        AFVYEG LTD E DHLIS+AK+ELKRSAVADNLSG+SK+SEVRTSSG FI KAKDPIV+GIEDK+A WTFLPKENGEDIQVLRYE GQKY  H+DYF DK
Subjt:  AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADK

Query:  VNIARGGHRMATVLMYLSNVEKGGETVFPSAEE-SQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIH
        VNIARGGHR+ATVLMYL++V KGGETVFP AE+   RR+A+E    LS+CAKKGIAVKPR+GDALLFFSL P+A+PD +SLH GCPVIEGEKWSATKWIH
Subjt:  VNIARGGHRMATVLMYLSNVEKGGETVFPSAEE-SQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIH

Query:  VDSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKALGHHRRNQTKPTSNFFILHTNPFKTLFLNLIPNISTLLPLARFIKME
        VDSFD  +    NC D N SCERWA LGECT N EYMVG+P+LPGYCR+SCK      +  +    +F                   S  +   +   ME
Subjt:  VDSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKALGHHRRNQTKPTSNFFILHTNPFKTLFLNLIPNISTLLPLARFIKME

Query:  VGLMQRQWVQYIKGLLGEGTLDSQYLQLLQLQDESNPTFVSEVVTLFFEDTEELLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKNACIDFRS
        VG MQRQWV Y K L  EG LD Q+LQL QLQDESNP FV EVV+LFFED+E+LLN L  A+ QP+VDFK++D HVHQ KGSSSSIGA RVKNACI FR+
Subjt:  VGLMQRQWVQYIKGLLGEGTLDSQYLQLLQLQDESNPTFVSEVVTLFFEDTEELLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKNACIDFRS

Query:  ACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGSIPM
         CE+Q+ + C RC+QQV+ E+Y VK KL  L+A+E++I+ AGGSIPM
Subjt:  ACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGSIPM

A0A498HP60 Procollagen-proline 4-dioxygenase2.8e-1841.85Show/hide
Query:  SSLPPSRYSSIQSQVSDIGSSLAEDHDSDRDSI----------------------------------------RPRIPPGLLLASQTSTPRLTPHGSPPS
        S+LPPSR SS+QS   + GS   ED D++   +                                         P I   +LLASQTSTPRLTP GSPP 
Subjt:  SSLPPSRYSSIQSQVSDIGSSLAEDHDSDRDSI----------------------------------------RPRIPPGLLLASQTSTPRLTPHGSPPS

Query:  LSASGSPMRTSMSFSTSRNIFEDRSSEYSSAPSSHYGSISSNKGRTRVDGKEFFRQVRSRLSYEQFAAFLANVKDLNSHKQTKE
         SAS SP R+S   S  R+                          TRVDGKEFFRQVRSRLSYEQF+AFL NVK+LN +KQTKE
Subjt:  LSASGSPMRTSMSFSTSRNIFEDRSSEYSSAPSSHYGSISSNKGRTRVDGKEFFRQVRSRLSYEQFAAFLANVKDLNSHKQTKE

A0A498HP60 Procollagen-proline 4-dioxygenase2.5e-16366.3Show/hide
Query:  VDYTAWAFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHF
        V +   AFVYEGFLTD EC+HLIS+AK ELKRS+VADN+SG+SK+S+VRTSSG FI KAKDPIVSGIE+KIA WTFLPKENGE IQVLRYE+GQKYD H+
Subjt:  VDYTAWAFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHF

Query:  DYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSA
        DYF DKVN+ARGGHR+ATVLMYLS+V KGGETVFPSAEE+    +S + +DLS+CAKKGIAVKPRKGDALLFFSLHP AIPD  SLHGGCPVIEGEKWSA
Subjt:  DYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSA

Query:  TKWIHVDSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKALGHHRRNQTKPTSNFFILHTNPFKTLFLNLIPNISTLLPLAR
        TKWIHVDSFD +VR   NC D N +CERWA LGECT NPEYMVG+PELPGYCR+SC+       + T P        ++P +    + +          +
Subjt:  TKWIHVDSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKALGHHRRNQTKPTSNFFILHTNPFKTLFLNLIPNISTLLPLAR

Query:  FIK--MEVGLMQRQWVQYIKGLLGEGTLDSQYLQLLQLQDESNPTFVSEVVTLFFEDTEELLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKN
          K  MEVG +QRQ+V+Y   L  EG LDSQ+ QL QLQDESNP FV EVV+LFFED+E LLN L  A+ Q  VDFKK+D +VHQLKGSSSSIGA RVKN
Subjt:  FIK--MEVGLMQRQWVQYIKGLLGEGTLDSQYLQLLQLQDESNPTFVSEVVTLFFEDTEELLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKN

Query:  ACIDFRSACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGSIPM
        AC+ FR+ CE+ + + C  CLQQV+QE+  VK KL  L+ LE++IL AGGS+PM
Subjt:  ACIDFRSACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGSIPM

A0A498IAV7 Uncharacterized protein6.8e-19838.97Show/hide
Query:  MQGKGSNASGFALPDGVLQVLPSDPFEQLDVARKITSIALSTRVSLLESESSVLRSKLAEKDEIVADLRFQIESLNASLSATADKLVQADEEKESLKKEN
        M  K S +S F LP+ VL+VLP DPFEQLDVARKITS+ALSTRVS LESESS LR KLAEKD ++ADL+ Q+ESL+ASLS +ADKL  A++EKE L KE 
Subjt:  MQGKGSNASGFALPDGVLQVLPSDPFEQLDVARKITSIALSTRVSLLESESSVLRSKLAEKDEIVADLRFQIESLNASLSATADKLVQADEEKESLKKEN

Query:  ASLSNTVKKLSRDVAKLEVFRKTLMLSLQEEGDSSTEVPEVVAR----------IQSQPKE--------VSSLPPSRYSSIQSQVSDIGSSLAEDHDSDR
        A L+NTV+KLSRDV+KLEVFRKTLM SL E+ ++ +   +VVA+            S+P +         S+LPPSR SS+QS  S  GS+  ED D+  
Subjt:  ASLSNTVKKLSRDVAKLEVFRKTLMLSLQEEGDSSTEVPEVVAR----------IQSQPKE--------VSSLPPSRYSSIQSQVSDIGSSLAEDHDSDR

Query:  DSIRPRIPPGLLLASQTSTPRLTPHGSPPSLSAS---------GSPMRTSMSFSTSRNIFEDRSSEYSSAPSSHYGSISSNKGRTRVDGKEFFRQVRSRL
        D+ RPRI   LLLASQTSTPR TP GSPP  SAS         GSP R SMSF+TSR +F++R    SS PSSH+GS S   GRTRVDGKEFFRQVRSRL
Subjt:  DSIRPRIPPGLLLASQTSTPRLTPHGSPPSLSAS---------GSPMRTSMSFSTSRNIFEDRSSEYSSAPSSHYGSISSNKGRTRVDGKEFFRQVRSRL

Query:  SYEQFAAFLANVKDLNSHKQTKEVG---------------------------------------------------------------------------
        SYEQF AFLANVK+LN HKQTKE+G                                                                           
Subjt:  SYEQFAAFLANVKDLNSHKQTKEVG---------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------SLML------------------------------------------------------------
                                            SL++                                                            
Subjt:  ------------------------------------SLML------------------------------------------------------------

Query:  ---------------------------------------VDYTAW---AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHK
                                               V   +W   AFVYEG LTD ECDHLIS+AK+ELKRSAVADNLSG+SK+SEVRTSSG FI K
Subjt:  ---------------------------------------VDYTAW---AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHK

Query:  AKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKK
        AKDPIV+GIEDK++ WTFLPKENGEDIQVLRYE GQKY+ H+DYF DKVNIARGGHR+ATVLMYL++V KGGETVFP AE   RR+A+E    LS+CAKK
Subjt:  AKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKK

Query:  GIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA
        GIAVKP++GDALLFFSL P+A+PD +SLH GCPVIEGEKWSATKWIHVDSFD  +    +C D N SCERWA LGECT N EYMVG+PELPGYCR+SCK+
Subjt:  GIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA

Query:  LGHHRRNQTKPTSNFFILHTNPFKT-LFLNLIP-NISTLLPLARFIKMEVGLMQRQWVQYIKGLLGEGTLDSQYLQLLQLQDESNPTFVSEVVTLFFEDT
        L               I     FK   FL  +  + S  +   +   MEVG MQRQWV Y K L  EG LD Q+LQL QLQDESNP FV EVV+LFFED+
Subjt:  LGHHRRNQTKPTSNFFILHTNPFKT-LFLNLIP-NISTLLPLARFIKMEVGLMQRQWVQYIKGLLGEGTLDSQYLQLLQLQDESNPTFVSEVVTLFFEDT

Query:  EELLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKNACIDFRSACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGSIPM
        E+LLN L  A+ QP+VDFK++D HVHQ KGSSSSIGA RVKNACI FR+ CE+++ + C RC+QQV+ E+Y VK KL  L+A+E++I+ AGGSIPM
Subjt:  EELLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKNACIDFRSACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGSIPM

A0A7J6HHC7 Uncharacterized protein1.8e-16663.99Show/hide
Query:  YEQFAAFLANVKDLNSHKQTKEVGSLMLVDYTAWAFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIA
        +E F+++  +   + +  + K+      + +   AF+YEGFLTDLECDHLISLAK+ELKRSAVAD+ SGES++SEVRTSSG FI KAKDPIV+GIEDKI+
Subjt:  YEQFAAFLANVKDLNSHKQTKEVGSLMLVDYTAWAFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIA

Query:  AWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLF
         WTFLPKENGEDIQVLRYE GQKY+ H+DYFADKVNI RGGHR+ATVLMYL++V KGGETVFP A E+ R + S T ED S+CAKKG+AVK R+GDALLF
Subjt:  AWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLF

Query:  FSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKALGHHRRNQTKPTSN
        FSL P AIPDT SLH GCPVIEGEKWSATKWIHVDSFD  V     C D N SCERWA LGECT N EYMVGSPELPGYCR+SCK    H          
Subjt:  FSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKALGHHRRNQTKPTSN

Query:  FFILHTNPFKTLFLNLIPNISTLLPLARFIKMEVGLMQRQWVQYIKGLLGEGT-LDSQYLQLLQLQDESNPTFVSEVVTLFFEDTEELLNKLRIAISQPS
           +H   FK                     MEVG MQRQWV Y K L  E   LDSQ+LQLLQLQDESNP FV EVV+LFF+DTE+LLN L  A+ Q  
Subjt:  FFILHTNPFKTLFLNLIPNISTLLPLARFIKMEVGLMQRQWVQYIKGLLGEGT-LDSQYLQLLQLQDESNPTFVSEVVTLFFEDTEELLNKLRIAISQPS

Query:  VDFKKIDDHVHQLKGSSSSIGALRVKNACIDFRSACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGSIP-MDLGF
        VDFK++D HVHQLKGSSSSIGA RVKN C+ FR+ CE+Q+ D C RCLQQV+QE+Y VK KL  L+ LE++I+ AGGSIP M+LGF
Subjt:  VDFKKIDDHVHQLKGSSSSIGALRVKNACIDFRSACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGSIP-MDLGF

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 61.3e-8461Show/hide
Query:  VDYTAWAFVYEGFLTDLECDHLISLAKAELKRS-AVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAH
        + +T  AF+Y+GFL+D ECDHLI LAK +L++S  VAD  SGES+ SEVRTSSG F+ K +D IV+ +E K+AAWTFLP+ENGE +Q+L YE GQKYD H
Subjt:  VDYTAWAFVYEGFLTDLECDHLISLAKAELKRS-AVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAH

Query:  FDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWS
        FDYF DK  +  GGHR+ATVLMYLSNV KGGETVFP+    + +      +  S CAK+G AVKPRKGDALLFF+LH N   D +SLHG CPVIEGEKWS
Subjt:  FDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWS

Query:  ATKWIHVDSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA
        AT+WIHV SF    +    C D++ SC+ WA+ GEC  NP YMVGS    G+CRKSCKA
Subjt:  ATKWIHVDSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA

F4JAU3 Prolyl 4-hydroxylase 21.3e-11679.37Show/hide
Query:  AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADK
        AFVYEGFLTDLECDHLISLAK  L+RSAVADN +GES+VS+VRTSSG FI K KDPIVSGIEDK++ WTFLPKENGED+QVLRYE+GQKYDAHFDYF DK
Subjt:  AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADK

Query:  VNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHV
        VNIARGGHR+ATVL+YLSNV KGGETVFP A+E  RR  SE  +DLSDCAKKGIAVKP+KG+ALLFF+L  +AIPD  SLHGGCPVIEGEKWSATKWIHV
Subjt:  VNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHV

Query:  DSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA
        DSFD I+    NC D N SCERWA LGEC  NPEYMVG+PE+PG CR+SCKA
Subjt:  DSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA

Q8L970 Probable prolyl 4-hydroxylase 73.1e-9463.71Show/hide
Query:  VDYTAWAFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHF
        + +T   F+YEGFL+D ECDH I LAK +L++S VADN SGES  SEVRTSSG F+ K +D IVS +E K+AAWTFLP+ENGE +Q+L YE GQKY+ HF
Subjt:  VDYTAWAFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHF

Query:  DYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSED-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWS
        DYF D+ N+  GGHR+ATVLMYLSNVEKGGETVFP      + +A++  +D  ++CAK+G AVKPRKGDALLFF+LHPNA  D++SLHG CPV+EGEKWS
Subjt:  DYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSED-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWS

Query:  ATKWIHVDSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA
        AT+WIHV SF+      + C DEN SCE+WA+ GEC  NP YMVGS +  GYCRKSCKA
Subjt:  ATKWIHVDSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA

Q8LAN3 Probable prolyl 4-hydroxylase 41.0e-12183.33Show/hide
Query:  AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADK
        AFVYEGFLT+LECDH++SLAKA LKRSAVADN SGESK SEVRTSSG FI K KDPIVSGIEDKI+ WTFLPKENGEDIQVLRYE+GQKYDAHFDYF DK
Subjt:  AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADK

Query:  VNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHV
        VNI RGGHRMAT+LMYLSNV KGGETVFP AE   RR  SE  EDLSDCAK+GIAVKPRKGDALLFF+LHP+AIPD  SLHGGCPVIEGEKWSATKWIHV
Subjt:  VNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHV

Query:  DSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA
        DSFD IV    NC D N SCERWA LGECT NPEYMVG+ ELPGYCR+SCKA
Subjt:  DSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA

Q93W28 Uncharacterized protein At4g155452.6e-6957.14Show/hide
Query:  QGKGSNASG---FALPDGVLQVLPSDPFEQLDVARKITSIALSTRVSLLESESSVLRSKLAEKDEIVADLRFQIESLNASLSATADKLVQADEEKESLKK
        +G  S  +G   F LPD +LQVLPSDPFEQLDVARKITSIALSTRVS LESESS LR  LAEK++   +L+  +ESL ASLS    KL  AD EKE+L +
Subjt:  QGKGSNASG---FALPDGVLQVLPSDPFEQLDVARKITSIALSTRVSLLESESSVLRSKLAEKDEIVADLRFQIESLNASLSATADKLVQADEEKESLKK

Query:  ENASLSNTVKKLSRDVAKLEVFRKTLMLSLQEEGDSSTEVPEVVARIQSQPKEVSSLPPSRYSSIQSQVSDIGSSLAEDHDSDRDSIRPRIPPGLLLASQ
        ENASLSNTVK+L RDV+KLE FRKTLM+SLQ++ D +    +++A+  +   + +   PSR+SSIQSQ +      A   D++ D+ +P +   L L SQ
Subjt:  ENASLSNTVKKLSRDVAKLEVFRKTLMLSLQEEGDSSTEVPEVVARIQSQPKEVSSLPPSRYSSIQSQVSDIGSSLAEDHDSDRDSIRPRIPPGLLLASQ

Query:  TSTPRLTPHGSPPSLSASG---------SPMRTSMSFSTSRNIFED-RSSEYSSAPSSHYGSISSNKGRTRVDGKEFFRQVRSRLSYEQFAAFLANVKDL
        T+TPRLTP GSPP LSASG         SP R S+SF+T+R +F+D RSS   S P        S   RTRVDGKEFFRQVRSRLSYEQF AFL NVKDL
Subjt:  TSTPRLTPHGSPPSLSASG---------SPMRTSMSFSTSRNIFED-RSSEYSSAPSSHYGSISSNKGRTRVDGKEFFRQVRSRLSYEQFAAFLANVKDL

Query:  NSHKQTKE
        N+HKQT+E
Subjt:  NSHKQTKE

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 29.0e-11879.37Show/hide
Query:  AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADK
        AFVYEGFLTDLECDHLISLAK  L+RSAVADN +GES+VS+VRTSSG FI K KDPIVSGIEDK++ WTFLPKENGED+QVLRYE+GQKYDAHFDYF DK
Subjt:  AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADK

Query:  VNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHV
        VNIARGGHR+ATVL+YLSNV KGGETVFP A+E  RR  SE  +DLSDCAKKGIAVKP+KG+ALLFF+L  +AIPD  SLHGGCPVIEGEKWSATKWIHV
Subjt:  VNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHV

Query:  DSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA
        DSFD I+    NC D N SCERWA LGEC  NPEYMVG+PE+PG CR+SCKA
Subjt:  DSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase2.2e-9563.71Show/hide
Query:  VDYTAWAFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHF
        + +T   F+YEGFL+D ECDH I LAK +L++S VADN SGES  SEVRTSSG F+ K +D IVS +E K+AAWTFLP+ENGE +Q+L YE GQKY+ HF
Subjt:  VDYTAWAFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHF

Query:  DYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSED-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWS
        DYF D+ N+  GGHR+ATVLMYLSNVEKGGETVFP      + +A++  +D  ++CAK+G AVKPRKGDALLFF+LHPNA  D++SLHG CPV+EGEKWS
Subjt:  DYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSED-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWS

Query:  ATKWIHVDSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA
        AT+WIHV SF+      + C DEN SCE+WA+ GEC  NP YMVGS +  GYCRKSCKA
Subjt:  ATKWIHVDSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase8.8e-8960.3Show/hide
Query:  VDYTAWAFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGES-----KVSEVRTSSGAFIHKAK---DPIVSGIEDKIAAWTFLPKENGEDIQVLRYEY
        + +T   F+YEGFL+D ECDH I LAK +L++S VADN SGES      VS VR SS    +      D IVS +E K+AAWTFLP+ENGE +Q+L YE 
Subjt:  VDYTAWAFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGES-----KVSEVRTSSGAFIHKAK---DPIVSGIEDKIAAWTFLPKENGEDIQVLRYEY

Query:  GQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSED-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCP
        GQKY+ HFDYF D+ N+  GGHR+ATVLMYLSNVEKGGETVFP      + +A++  +D  ++CAK+G AVKPRKGDALLFF+LHPNA  D++SLHG CP
Subjt:  GQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSED-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCP

Query:  VIEGEKWSATKWIHVDSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA
        V+EGEKWSAT+WIHV SF+      + C DEN SCE+WA+ GEC  NP YMVGS +  GYCRKSCKA
Subjt:  VIEGEKWSATKWIHVDSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase9.1e-8661Show/hide
Query:  VDYTAWAFVYEGFLTDLECDHLISLAKAELKRS-AVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAH
        + +T  AF+Y+GFL+D ECDHLI LAK +L++S  VAD  SGES+ SEVRTSSG F+ K +D IV+ +E K+AAWTFLP+ENGE +Q+L YE GQKYD H
Subjt:  VDYTAWAFVYEGFLTDLECDHLISLAKAELKRS-AVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAH

Query:  FDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWS
        FDYF DK  +  GGHR+ATVLMYLSNV KGGETVFP+    + +      +  S CAK+G AVKPRKGDALLFF+LH N   D +SLHG CPVIEGEKWS
Subjt:  FDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWS

Query:  ATKWIHVDSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA
        AT+WIHV SF    +    C D++ SC+ WA+ GEC  NP YMVGS    G+CRKSCKA
Subjt:  ATKWIHVDSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein7.1e-12383.33Show/hide
Query:  AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADK
        AFVYEGFLT+LECDH++SLAKA LKRSAVADN SGESK SEVRTSSG FI K KDPIVSGIEDKI+ WTFLPKENGEDIQVLRYE+GQKYDAHFDYF DK
Subjt:  AFVYEGFLTDLECDHLISLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADK

Query:  VNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHV
        VNI RGGHRMAT+LMYLSNV KGGETVFP AE   RR  SE  EDLSDCAK+GIAVKPRKGDALLFF+LHP+AIPD  SLHGGCPVIEGEKWSATKWIHV
Subjt:  VNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHV

Query:  DSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA
        DSFD IV    NC D N SCERWA LGECT NPEYMVG+ ELPGYCR+SCKA
Subjt:  DSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGGGAAAGGATCTAACGCTTCCGGCTTCGCTCTTCCTGACGGAGTCTTGCAAGTCCTGCCTTCTGATCCTTTTGAACAGCTCGACGTGGCTCGCAAAATCACTTC
CATAGCTCTTTCCACTCGCGTCTCGTTGCTCGAATCAGAGTCCTCTGTTCTCCGTTCTAAACTTGCTGAGAAAGATGAGATCGTTGCTGACCTTCGGTTCCAGATCGAAT
CACTCAACGCTTCGCTCTCCGCGACTGCTGATAAACTCGTCCAGGCAGACGAGGAGAAGGAGAGCTTGAAGAAAGAGAATGCCTCGTTGTCGAACACTGTGAAGAAGCTT
AGTAGAGATGTTGCGAAGTTGGAGGTTTTCAGAAAGACGTTAATGCTATCACTTCAGGAGGAGGGAGATAGCTCTACAGAAGTTCCAGAAGTTGTTGCTAGGATACAGAG
CCAACCAAAAGAGGTTTCATCGTTGCCACCCTCTAGATATTCATCGATTCAGAGCCAGGTTTCTGACATAGGAAGTTCGTTAGCAGAGGATCATGATTCAGATAGAGATA
GTATACGACCTCGTATTCCGCCTGGCCTCCTGTTAGCGTCCCAGACAAGTACACCTCGGCTTACTCCCCATGGCTCTCCTCCTTCACTGTCAGCATCAGGATCCCCAATG
AGAACATCAATGTCATTTTCAACTTCCAGGAACATTTTTGAAGATAGATCTTCAGAATATTCTTCTGCGCCCTCAAGCCACTATGGCTCGATTTCCTCCAACAAAGGGAG
AACTCGGGTCGATGGGAAGGAGTTTTTCCGACAAGTCAGGAGCCGTTTGTCTTATGAACAGTTTGCTGCATTTTTAGCAAATGTGAAGGATCTAAATTCCCACAAGCAAA
CAAAAGAGGTTGGCTCTCTCATGCTAGTAGATTACACTGCGTGGGCTTTTGTGTATGAAGGTTTTCTCACGGACTTAGAATGCGATCATCTCATCTCCCTCGCTAAAGCG
GAGTTGAAGAGATCTGCTGTCGCGGATAATTTGTCCGGAGAGAGTAAAGTCAGCGAGGTCCGAACTAGCTCTGGGGCGTTTATCCATAAAGCCAAGGATCCTATTGTTTC
TGGAATAGAAGACAAAATTGCAGCATGGACATTTCTGCCAAAAGAAAATGGAGAAGACATTCAAGTGTTGAGATATGAATATGGGCAGAAGTACGATGCACACTTTGATT
ACTTTGCTGACAAGGTTAATATTGCCCGAGGTGGACATCGAATGGCAACCGTTCTCATGTATCTTTCCAACGTAGAAAAAGGCGGTGAAACTGTGTTTCCTTCTGCAGAG
GAATCTCAAAGACGCCAGGCTTCGGAAACAAGTGAAGATCTCTCAGACTGTGCAAAGAAAGGGATAGCAGTTAAACCACGGAAAGGCGACGCTCTTCTCTTCTTCAGTCT
CCATCCAAATGCTATTCCAGACACAAGTAGTCTACATGGCGGATGCCCTGTGATTGAAGGTGAGAAATGGTCAGCAACAAAGTGGATTCATGTCGATTCTTTCGACACGA
TCGTGAGAGACCATACGAACTGCGCCGATGAAAATCCAAGTTGTGAGAGATGGGCTGAACTCGGTGAGTGCACGAATAACCCGGAGTATATGGTCGGATCTCCCGAGCTT
CCTGGCTACTGCAGGAAAAGTTGTAAGGCCCTCGGCCATCATCGCCGAAACCAAACCAAACCCACTTCAAATTTCTTCATTCTTCACACAAATCCCTTCAAAACCCTCTT
TCTTAATCTCATTCCTAACATCTCTACCCTACTTCCTTTAGCTCGTTTCATTAAAATGGAGGTCGGATTGATGCAGAGACAGTGGGTTCAGTATATTAAAGGCTTGCTCG
GTGAGGGTACGCTGGATAGTCAGTATTTACAGCTTTTGCAACTGCAAGATGAGAGTAATCCAACTTTCGTTTCTGAAGTGGTCACTCTTTTCTTTGAAGATACCGAGGAG
CTTCTCAATAAACTGAGAATCGCTATATCACAGCCATCTGTTGACTTCAAAAAGATTGATGATCATGTACACCAGCTGAAGGGCAGCAGTTCCAGCATAGGTGCACTTAG
AGTGAAAAATGCCTGCATTGACTTCCGGAGCGCCTGCGAGCAACAGAGTCCTGACTGGTGTTCAAGATGCCTGCAACAAGTAGAGCAAGAATTCTATGGTGTGAAGGAGA
AGCTCAGTTATTTATATGCACTGGAGAAACGGATTTTGAATGCTGGTGGATCCATTCCCATGGACTTGGGTTTCTAA
mRNA sequenceShow/hide mRNA sequence
ATTCTTCGTCGTTATGCCTACCTACAATCGCAGTCCTAACAATGGCGGATGGCAAATGAAGGCTATGATTTGAAGTTACAAGAAACTACGTAGAAAGAGACAGGCTGGAA
AATGCAAGGGAAAGGATCTAACGCTTCCGGCTTCGCTCTTCCTGACGGAGTCTTGCAAGTCCTGCCTTCTGATCCTTTTGAACAGCTCGACGTGGCTCGCAAAATCACTT
CCATAGCTCTTTCCACTCGCGTCTCGTTGCTCGAATCAGAGTCCTCTGTTCTCCGTTCTAAACTTGCTGAGAAAGATGAGATCGTTGCTGACCTTCGGTTCCAGATCGAA
TCACTCAACGCTTCGCTCTCCGCGACTGCTGATAAACTCGTCCAGGCAGACGAGGAGAAGGAGAGCTTGAAGAAAGAGAATGCCTCGTTGTCGAACACTGTGAAGAAGCT
TAGTAGAGATGTTGCGAAGTTGGAGGTTTTCAGAAAGACGTTAATGCTATCACTTCAGGAGGAGGGAGATAGCTCTACAGAAGTTCCAGAAGTTGTTGCTAGGATACAGA
GCCAACCAAAAGAGGTTTCATCGTTGCCACCCTCTAGATATTCATCGATTCAGAGCCAGGTTTCTGACATAGGAAGTTCGTTAGCAGAGGATCATGATTCAGATAGAGAT
AGTATACGACCTCGTATTCCGCCTGGCCTCCTGTTAGCGTCCCAGACAAGTACACCTCGGCTTACTCCCCATGGCTCTCCTCCTTCACTGTCAGCATCAGGATCCCCAAT
GAGAACATCAATGTCATTTTCAACTTCCAGGAACATTTTTGAAGATAGATCTTCAGAATATTCTTCTGCGCCCTCAAGCCACTATGGCTCGATTTCCTCCAACAAAGGGA
GAACTCGGGTCGATGGGAAGGAGTTTTTCCGACAAGTCAGGAGCCGTTTGTCTTATGAACAGTTTGCTGCATTTTTAGCAAATGTGAAGGATCTAAATTCCCACAAGCAA
ACAAAAGAGGTTGGCTCTCTCATGCTAGTAGATTACACTGCGTGGGCTTTTGTGTATGAAGGTTTTCTCACGGACTTAGAATGCGATCATCTCATCTCCCTCGCTAAAGC
GGAGTTGAAGAGATCTGCTGTCGCGGATAATTTGTCCGGAGAGAGTAAAGTCAGCGAGGTCCGAACTAGCTCTGGGGCGTTTATCCATAAAGCCAAGGATCCTATTGTTT
CTGGAATAGAAGACAAAATTGCAGCATGGACATTTCTGCCAAAAGAAAATGGAGAAGACATTCAAGTGTTGAGATATGAATATGGGCAGAAGTACGATGCACACTTTGAT
TACTTTGCTGACAAGGTTAATATTGCCCGAGGTGGACATCGAATGGCAACCGTTCTCATGTATCTTTCCAACGTAGAAAAAGGCGGTGAAACTGTGTTTCCTTCTGCAGA
GGAATCTCAAAGACGCCAGGCTTCGGAAACAAGTGAAGATCTCTCAGACTGTGCAAAGAAAGGGATAGCAGTTAAACCACGGAAAGGCGACGCTCTTCTCTTCTTCAGTC
TCCATCCAAATGCTATTCCAGACACAAGTAGTCTACATGGCGGATGCCCTGTGATTGAAGGTGAGAAATGGTCAGCAACAAAGTGGATTCATGTCGATTCTTTCGACACG
ATCGTGAGAGACCATACGAACTGCGCCGATGAAAATCCAAGTTGTGAGAGATGGGCTGAACTCGGTGAGTGCACGAATAACCCGGAGTATATGGTCGGATCTCCCGAGCT
TCCTGGCTACTGCAGGAAAAGTTGTAAGGCCCTCGGCCATCATCGCCGAAACCAAACCAAACCCACTTCAAATTTCTTCATTCTTCACACAAATCCCTTCAAAACCCTCT
TTCTTAATCTCATTCCTAACATCTCTACCCTACTTCCTTTAGCTCGTTTCATTAAAATGGAGGTCGGATTGATGCAGAGACAGTGGGTTCAGTATATTAAAGGCTTGCTC
GGTGAGGGTACGCTGGATAGTCAGTATTTACAGCTTTTGCAACTGCAAGATGAGAGTAATCCAACTTTCGTTTCTGAAGTGGTCACTCTTTTCTTTGAAGATACCGAGGA
GCTTCTCAATAAACTGAGAATCGCTATATCACAGCCATCTGTTGACTTCAAAAAGATTGATGATCATGTACACCAGCTGAAGGGCAGCAGTTCCAGCATAGGTGCACTTA
GAGTGAAAAATGCCTGCATTGACTTCCGGAGCGCCTGCGAGCAACAGAGTCCTGACTGGTGTTCAAGATGCCTGCAACAAGTAGAGCAAGAATTCTATGGTGTGAAGGAG
AAGCTCAGTTATTTATATGCACTGGAGAAACGGATTTTGAATGCTGGTGGATCCATTCCCATGGACTTGGGTTTCTAAAGTGACAGAATCCATGGAAACCAAGATTTTCC
ATTCGTTCCTGTGGAGTCTGCTTTCGCTTGTTCTCTAGGTTCTTAAATTGACTGTTACTTGTGCTTGTACAAATTCAAAACCTCTGCCCTTTTCACTCAGAACATCAGTG
TTTTCTACAGCTGATTTTGACTTTCGATTCTTCGTGTCTGTTTAAGCTGAACTGTTGTATTGCATGGAGTCCTAGAAGATTTTCTGGGTGCTTTCAAAGCGGCAAATACT
CTCTTATCTATCTTTGTTCCCTTAGTTATTCTGCTATTCAAA
Protein sequenceShow/hide protein sequence
MQGKGSNASGFALPDGVLQVLPSDPFEQLDVARKITSIALSTRVSLLESESSVLRSKLAEKDEIVADLRFQIESLNASLSATADKLVQADEEKESLKKENASLSNTVKKL
SRDVAKLEVFRKTLMLSLQEEGDSSTEVPEVVARIQSQPKEVSSLPPSRYSSIQSQVSDIGSSLAEDHDSDRDSIRPRIPPGLLLASQTSTPRLTPHGSPPSLSASGSPM
RTSMSFSTSRNIFEDRSSEYSSAPSSHYGSISSNKGRTRVDGKEFFRQVRSRLSYEQFAAFLANVKDLNSHKQTKEVGSLMLVDYTAWAFVYEGFLTDLECDHLISLAKA
ELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAE
ESQRRQASETSEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIVRDHTNCADENPSCERWAELGECTNNPEYMVGSPEL
PGYCRKSCKALGHHRRNQTKPTSNFFILHTNPFKTLFLNLIPNISTLLPLARFIKMEVGLMQRQWVQYIKGLLGEGTLDSQYLQLLQLQDESNPTFVSEVVTLFFEDTEE
LLNKLRIAISQPSVDFKKIDDHVHQLKGSSSSIGALRVKNACIDFRSACEQQSPDWCSRCLQQVEQEFYGVKEKLSYLYALEKRILNAGGSIPMDLGF