; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg21976 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg21976
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionProcollagen-proline 3-dioxygenase
Genome locationCarg_Chr04:130459..134462
RNA-Seq ExpressionCarg21976
SyntenyCarg21976
Gene Ontology termsGO:0019511 - peptidyl-proline hydroxylation (biological process)
GO:0032963 - collagen metabolic process (biological process)
GO:0005506 - iron ion binding (molecular function)
GO:0019797 - procollagen-proline 3-dioxygenase activity (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR039575 - Prolyl 3-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6599868.1 Prolyl 3-hydroxylase 1, partial [Cucurbita argyrosperma subsp. sororia]1.2e-16681.07Show/hide
Query:  MPFVSIRERLKEKAEEFFGCEYELFVEFTGLIRLFFFLLMCLRLKWVLSLFAFRESDGHVTGVLYNQPSLFSWTRGARIGWHSDDNRPYLKQREFT----
        MPFVSIRERLKEKAEEFFGCEYELFVEFTGLI                                       SWTRGARIGWHSDDNRPYLKQREFT    
Subjt:  MPFVSIRERLKEKAEEFFGCEYELFVEFTGLIRLFFFLLMCLRLKWVLSLFAFRESDGHVTGVLYNQPSLFSWTRGARIGWHSDDNRPYLKQREFT----

Query:  ----------------------------DCVMYTADSLNVHSVDEVTSGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRLPDSCLPQPPSCNMYWFS
                                    DCVMYTADSLNVHSVDEVTSGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRLPDSCLPQPPSCNMYWFS
Subjt:  ----------------------------DCVMYTADSLNVHSVDEVTSGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRLPDSCLPQPPSCNMYWFS

Query:  PKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKIFSQKFDSILHALQVVLFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGV
        PKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKIFSQKFDSILHALQVVLFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGV
Subjt:  PKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKIFSQKFDSILHALQVVLFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGV

Query:  DYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLRRELLRSFAHWRTSQSIYSVPYGS
        DYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLRRELLRSFAHWRTSQSIYSVPYGS
Subjt:  DYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLRRELLRSFAHWRTSQSIYSVPYGS

KAG7030552.1 hypothetical protein SDJN02_04589 [Cucurbita argyrosperma subsp. argyrosperma]5.7e-238100Show/hide
Query:  MKMGDEAEINQRRRLTLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLIRLFFFL
        MKMGDEAEINQRRRLTLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLIRLFFFL
Subjt:  MKMGDEAEINQRRRLTLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLIRLFFFL

Query:  LMCLRLKWVLSLFAFRESDGHVTGVLYNQPSLFSWTRGARIGWHSDDNRPYLKQREFTDCVMYTADSLNVHSVDEVTSGERLTLTLWFTRDSSHDEDAKL
        LMCLRLKWVLSLFAFRESDGHVTGVLYNQPSLFSWTRGARIGWHSDDNRPYLKQREFTDCVMYTADSLNVHSVDEVTSGERLTLTLWFTRDSSHDEDAKL
Subjt:  LMCLRLKWVLSLFAFRESDGHVTGVLYNQPSLFSWTRGARIGWHSDDNRPYLKQREFTDCVMYTADSLNVHSVDEVTSGERLTLTLWFTRDSSHDEDAKL

Query:  LSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKIFSQKFDSILHALQVVLFL
        LSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKIFSQKFDSILHALQVVLFL
Subjt:  LSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKIFSQKFDSILHALQVVLFL

Query:  YWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLRRELLRSFAHWRTSQSIYS
        YWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLRRELLRSFAHWRTSQSIYS
Subjt:  YWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLRRELLRSFAHWRTSQSIYS

Query:  VPYGS
        VPYGS
Subjt:  VPYGS

XP_022942569.1 prolyl 3-hydroxylase 1 isoform X1 [Cucurbita moschata]3.9e-20283.07Show/hide
Query:  MKMGDEAEINQRRRLTLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLIRLFFFL
        MKMGDEAEINQRRRL LENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLI      
Subjt:  MKMGDEAEINQRRRLTLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLIRLFFFL

Query:  LMCLRLKWVLSLFAFRESDGHVTGVLYNQPSLFSWTRGARIGWHSDDNRPYLKQREFT--------------------------------DCVMYTADSL
                                         SWTRGARIGWHSDDNRPYLKQREFT                                DCVMYTADSL
Subjt:  LMCLRLKWVLSLFAFRESDGHVTGVLYNQPSLFSWTRGARIGWHSDDNRPYLKQREFT--------------------------------DCVMYTADSL

Query:  NVHSVDEVTSGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPD
        NVHSVDEVTSGERLTLTLWFTRDSSHDEDAKL+SLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPD
Subjt:  NVHSVDEVTSGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPD

Query:  LFSQDVQLVRGNKIFSQKFDSILHALQVVLFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLA
        LFSQDVQLVRGNKIFSQKFDSILHALQVV FLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLA
Subjt:  LFSQDVQLVRGNKIFSQKFDSILHALQVVLFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLA

Query:  AVAAAWEDYASNLRRELLRSFAHWRTSQSIYSVPYGS
        AVAAAWEDYASNLRRELLRSFAHWRTSQSIYSVPYGS
Subjt:  AVAAAWEDYASNLRRELLRSFAHWRTSQSIYSVPYGS

XP_022989994.1 prolyl 3-hydroxylase 1 isoform X1 [Cucurbita maxima]5.2e-19982.15Show/hide
Query:  MKMGDEAEINQRRRLTLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLIRLFFFL
        MKMGDEAEINQR RL LENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSA LIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLI      
Subjt:  MKMGDEAEINQRRRLTLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLIRLFFFL

Query:  LMCLRLKWVLSLFAFRESDGHVTGVLYNQPSLFSWTRGARIGWHSDDNRPYLKQREFT--------------------------------DCVMYTADSL
                                         SWTRGARIGWHSDDNRPYLKQREFT                                DCVMYTADSL
Subjt:  LMCLRLKWVLSLFAFRESDGHVTGVLYNQPSLFSWTRGARIGWHSDDNRPYLKQREFT--------------------------------DCVMYTADSL

Query:  NVHSVDEVTSGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPD
        NVHSVDEVTSGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPD
Subjt:  NVHSVDEVTSGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPD

Query:  LFSQDVQLVRGNKIFSQKFDSILHALQVVLFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLA
        LFSQDVQLVRGNKIFSQKFDSILHALQVV FLYWKGKELDST+SKEDSSYAEGLSPKRNVGVD+FKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLA
Subjt:  LFSQDVQLVRGNKIFSQKFDSILHALQVVLFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLA

Query:  AVAAAWEDYASNLRRELLRSFAHWRTSQSIYSVPYGS
        AVA AWEDYASNLRRELLRSFAHWRTSQSIYSVPYGS
Subjt:  AVAAAWEDYASNLRRELLRSFAHWRTSQSIYSVPYGS

XP_023552239.1 prolyl 3-hydroxylase 1 [Cucurbita pepo subsp. pepo]1.9e-20182.84Show/hide
Query:  MKMGDEAEINQRRRLTLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLIRLFFFL
        MKMGDEAEINQRRRL LENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLI      
Subjt:  MKMGDEAEINQRRRLTLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLIRLFFFL

Query:  LMCLRLKWVLSLFAFRESDGHVTGVLYNQPSLFSWTRGARIGWHSDDNRPYLKQREFT--------------------------------DCVMYTADSL
                                         SWTRGARIGWHSDDNRPYLKQREFT                                DCVMYTADSL
Subjt:  LMCLRLKWVLSLFAFRESDGHVTGVLYNQPSLFSWTRGARIGWHSDDNRPYLKQREFT--------------------------------DCVMYTADSL

Query:  NVHSVDEVTSGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPD
        NVHSVDEVTSGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPD
Subjt:  NVHSVDEVTSGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPD

Query:  LFSQDVQLVRGNKIFSQKFDSILHALQVVLFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLA
        LFSQDVQLVRGNKIFSQKFDSILHALQVV FLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKD+ALAESVFLYASSDVKEKQHRLGWAKLA
Subjt:  LFSQDVQLVRGNKIFSQKFDSILHALQVVLFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLA

Query:  AVAAAWEDYASNLRRELLRSFAHWRTSQSIYSVPYGS
        AVAAAWEDYASNLRRELLRSF HWRTSQSIYSVPYGS
Subjt:  AVAAAWEDYASNLRRELLRSFAHWRTSQSIYSVPYGS

TrEMBL top hitse value%identityAlignment
A0A0A0KMN7 Procollagen-proline 3-dioxygenase1.1e-16269.66Show/hide
Query:  MGDEAEINQRRRLTLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLIRLFFFLLM
        M D AE  QRRRL LENFL+ EECRELEFIHKSC TVGYRP VFSTTLLHLV +NSAHLI+PFV IRE+LKEKAEEFFGC YELFVEFTGLI L      
Subjt:  MGDEAEINQRRRLTLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLIRLFFFLLM

Query:  CLRLKWVLSLFAFRESDGHVTGVLYNQPSLFSWTRGARIGWHSDDNRPYLKQREFT--------------------------------DCVMYTADSLNV
                          H    L    S   WTRGA IGWHSDDNRPYLKQREF+                                DCVMYTAD+ NV
Subjt:  CLRLKWVLSLFAFRESDGHVTGVLYNQPSLFSWTRGARIGWHSDDNRPYLKQREFT--------------------------------DCVMYTADSLNV

Query:  HSVDEVTSGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLF
        HSVDE+T+GERLTLTLWFTRDSSHDEDAKLLSLLSQS LHDR PDSCLPQPPSCNMYWFSP+DDPNFKFGFDICWARL ALGY +YFP DH  SEYPDLF
Subjt:  HSVDEVTSGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLF

Query:  SQDVQLVRGNKIFSQKFDSILHALQVVLFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAV
         QDVQLV G+KIF QKF++ILH LQVV FL WKGKELDSTN  EDSSYAE LSPKRNVGV YFKSEFSK+D LAESVF  A+SD KE Q  LGW KL A 
Subjt:  SQDVQLVRGNKIFSQKFDSILHALQVVLFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAV

Query:  AAAWEDYASNLRRELLRSFAHWRTSQSIYSVPYGS
        AAAWE YAS LRRELL SF+HWR  QSIYSV   S
Subjt:  AAAWEDYASNLRRELLRSFAHWRTSQSIYSVPYGS

A0A1S3C4U3 uncharacterized protein LOC103496668 isoform X29.1e-16574.26Show/hide
Query:  MGDEAEINQRRRLTLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLIRLFFFLLM
        M D AE  QRRRL LENFL+ EECRELEFIHKSCCTVGYRP V STTLLHLV +NSAHLI+PFV IRE+LKEKAEEFFGC YELFVEFTGLI        
Subjt:  MGDEAEINQRRRLTLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLIRLFFFLLM

Query:  CLRLKWVLSLFAFRESDGHVTGVLYNQPSLFSWTRGARIGWHSDDNRPYLKQREFTDCVMYTADSLNVHSVDEVTSGERLTLTLWFTRDSSHDEDAKLLS
                                       SWTRGA IGWHSDDNRPYLKQREF+DCVMYTADS NVHSVDE+T+GERLTLTLWFTRDSSHDEDAKLLS
Subjt:  CLRLKWVLSLFAFRESDGHVTGVLYNQPSLFSWTRGARIGWHSDDNRPYLKQREFTDCVMYTADSLNVHSVDEVTSGERLTLTLWFTRDSSHDEDAKLLS

Query:  LLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKIFSQKFDSILHALQVVLFLYW
        LLSQS LHDR  +SCLPQPPSCNMYWFSP++DPNFKFGFDICWARLHALGY IYFP DH  SEYPDLFSQDVQLV G+KIF QKF++ILH LQVV FL W
Subjt:  LLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKIFSQKFDSILHALQVVLFLYW

Query:  KGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKL-AAVAAAWEDYASNLRRELLRSFAHWRTSQSIYSV
        KGKELD+TN  EDS YAE LSPKRNVGV YFKSEFSK+D LAESVF  A+S  KE QH LGW KL  A AAAWEDYAS LRRELL SF+HWR  QSIYSV
Subjt:  KGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKL-AAVAAAWEDYASNLRRELLRSFAHWRTSQSIYSV

Query:  PYGS
           S
Subjt:  PYGS

A0A5D3CRE9 Procollagen-proline 3-dioxygenase1.8e-16069.04Show/hide
Query:  MGDEAEINQRRRLTLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLIRLFFFLLM
        M D AE  QRRRL LENFL+ EECRELEFIHKSCCTVGYRP V STTLLHLV +NSAHLI+PFV IRE+LKEKAEEFFGC YELFVEFTGLI        
Subjt:  MGDEAEINQRRRLTLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLIRLFFFLLM

Query:  CLRLKWVLSLFAFRESDGHVTGVLYNQPSLFSWTRGARIGWHSDDNRPYLKQREFT--------------------------------DCVMYTADSLNV
                                       SWTRGA IGWHSDDNRPYLKQREF+                                DCVMY ADS NV
Subjt:  CLRLKWVLSLFAFRESDGHVTGVLYNQPSLFSWTRGARIGWHSDDNRPYLKQREFT--------------------------------DCVMYTADSLNV

Query:  HSVDEVTSGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLF
        HSVDE+T+GERLTLTLWFTRDSSHDEDAKLLSLLSQS LHDR P+SCLPQPPSCNMYWFSP+DDPNFKFGFDICWARLHALGY IYFP DH  SEYPDLF
Subjt:  HSVDEVTSGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLF

Query:  SQDVQLVRGNKIFSQKFDSILHALQVVLFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKL-AA
        SQDVQLV G+KIF QKF++ILH LQVV FL WKGKELD+TN  EDS YAE LSPKRNVGV YFKSEFSK+D LAESVF  A+S  KE QH LGW KL  A
Subjt:  SQDVQLVRGNKIFSQKFDSILHALQVVLFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKL-AA

Query:  VAAAWEDYASNLRRELLRSFAHWRTSQSIYSVPYGS
         AAAWEDYAS LRRELL SF+HWR  QSIYSV   S
Subjt:  VAAAWEDYASNLRRELLRSFAHWRTSQSIYSVPYGS

A0A6J1FRP8 Procollagen-proline 3-dioxygenase1.9e-20283.07Show/hide
Query:  MKMGDEAEINQRRRLTLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLIRLFFFL
        MKMGDEAEINQRRRL LENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLI      
Subjt:  MKMGDEAEINQRRRLTLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLIRLFFFL

Query:  LMCLRLKWVLSLFAFRESDGHVTGVLYNQPSLFSWTRGARIGWHSDDNRPYLKQREFT--------------------------------DCVMYTADSL
                                         SWTRGARIGWHSDDNRPYLKQREFT                                DCVMYTADSL
Subjt:  LMCLRLKWVLSLFAFRESDGHVTGVLYNQPSLFSWTRGARIGWHSDDNRPYLKQREFT--------------------------------DCVMYTADSL

Query:  NVHSVDEVTSGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPD
        NVHSVDEVTSGERLTLTLWFTRDSSHDEDAKL+SLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPD
Subjt:  NVHSVDEVTSGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPD

Query:  LFSQDVQLVRGNKIFSQKFDSILHALQVVLFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLA
        LFSQDVQLVRGNKIFSQKFDSILHALQVV FLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLA
Subjt:  LFSQDVQLVRGNKIFSQKFDSILHALQVVLFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLA

Query:  AVAAAWEDYASNLRRELLRSFAHWRTSQSIYSVPYGS
        AVAAAWEDYASNLRRELLRSFAHWRTSQSIYSVPYGS
Subjt:  AVAAAWEDYASNLRRELLRSFAHWRTSQSIYSVPYGS

A0A6J1JQV6 Procollagen-proline 3-dioxygenase2.5e-19982.15Show/hide
Query:  MKMGDEAEINQRRRLTLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLIRLFFFL
        MKMGDEAEINQR RL LENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSA LIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLI      
Subjt:  MKMGDEAEINQRRRLTLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLIRLFFFL

Query:  LMCLRLKWVLSLFAFRESDGHVTGVLYNQPSLFSWTRGARIGWHSDDNRPYLKQREFT--------------------------------DCVMYTADSL
                                         SWTRGARIGWHSDDNRPYLKQREFT                                DCVMYTADSL
Subjt:  LMCLRLKWVLSLFAFRESDGHVTGVLYNQPSLFSWTRGARIGWHSDDNRPYLKQREFT--------------------------------DCVMYTADSL

Query:  NVHSVDEVTSGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPD
        NVHSVDEVTSGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPD
Subjt:  NVHSVDEVTSGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPD

Query:  LFSQDVQLVRGNKIFSQKFDSILHALQVVLFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLA
        LFSQDVQLVRGNKIFSQKFDSILHALQVV FLYWKGKELDST+SKEDSSYAEGLSPKRNVGVD+FKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLA
Subjt:  LFSQDVQLVRGNKIFSQKFDSILHALQVVLFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLA

Query:  AVAAAWEDYASNLRRELLRSFAHWRTSQSIYSVPYGS
        AVA AWEDYASNLRRELLRSFAHWRTSQSIYSVPYGS
Subjt:  AVAAAWEDYASNLRRELLRSFAHWRTSQSIYSVPYGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G68080.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.3e-9145.28Show/hide
Query:  RLTLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLIRLFFFLLMCLRLKWVLSLF
        RL L NFL+  EC+ELE IHKS  T+GYRP VFSTTL HL+ +NS HLI+PFVSIRERLKEK EE FGCEYELF+EFTGLI                   
Subjt:  RLTLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLIRLFFFLLMCLRLKWVLSLF

Query:  AFRESDGHVTGVLYNQPSLFSWTRGARIGWHSDDNRPYLKQREFT--------------------------------DCVMYTADSLNVHSVDEVTSGER
                            SW +GA IGWHSDDNR YLKQR+F                                 D +MYTAD  N+HSVDEVT GER
Subjt:  AFRESDGHVTGVLYNQPSLFSWTRGARIGWHSDDNRPYLKQREFT--------------------------------DCVMYTADSLNVHSVDEVTSGER

Query:  LTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRLPDSCLPQPPSCNMYWFSP-KDDPNFKFGFDICWARLHALGYGIYFPQ--DHSLSEYPDLFSQDVQLVR
        LTL LWF+RDSSHDED+KLLS LSQ   H    + CLP P S NMYWF P +D  N   GFD+C ARLH LG+ ++  Q  DHS      L    +QL +
Subjt:  LTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRLPDSCLPQPPSCNMYWFSP-KDDPNFKFGFDICWARLHALGYGIYFPQ--DHSLSEYPDLFSQDVQLVR

Query:  GNKIFSQKFDSILHALQVVLFLYWKGKELDSTNSKEDS-SYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDY
        G K+ ++KF +ILHALQVV F +WK  EL ++N + D+    + +S  +   ++  KS F  D+ L  + F Y+ S  ++++  L    +A    +WE+Y
Subjt:  GNKIFSQKFDSILHALQVVLFLYWKGKELDSTNSKEDS-SYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDY

Query:  ASNLRRELLRSFAHWRTSQSIYSV
        +  L +ELL S   W+T Q+I+ V
Subjt:  ASNLRRELLRSFAHWRTSQSIYSV

AT1G68080.2 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein7.3e-8243.92Show/hide
Query:  RLTLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLIRLFFFLLMCLRLKWVLSLF
        RL L NFL+  EC+ELE IHKS  T+GYRP VFSTTL HL+ +NS HLI+PFVSIRERLKEK EE FGCEYELF+EFTGLI                   
Subjt:  RLTLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLIRLFFFLLMCLRLKWVLSLF

Query:  AFRESDGHVTGVLYNQPSLFSWTRGARIGWHSDDNRPYLKQREFT------------DCVMYTADSLNVHSVDEVTSGERLTLTLWFTRDSSHDEDAKLL
                            SW +GA IGWHSDDNR YLKQR+F             D +MYTAD  N+HSVDEVT GERLTL LWF+RDSSHDED+KLL
Subjt:  AFRESDGHVTGVLYNQPSLFSWTRGARIGWHSDDNRPYLKQREFT------------DCVMYTADSLNVHSVDEVTSGERLTLTLWFTRDSSHDEDAKLL

Query:  SLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQ--DHSLSEYPDLFSQDVQLVRGNKIFSQKFDSILHALQVVLF
        S LSQ                                  FD+C ARLH LG+ ++  Q  DHS      L    +QL +G K+ ++KF +ILHALQVV F
Subjt:  SLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQ--DHSLSEYPDLFSQDVQLVRGNKIFSQKFDSILHALQVVLF

Query:  LYWKGKELDSTNSKEDS-SYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLRRELLRSFAHWRTSQSI
         +WK  EL ++N + D+    + +S  +   ++  KS F  D+ L  + F Y+ S  ++++  L    +A    +WE+Y+  L +ELL S   W+T Q+I
Subjt:  LYWKGKELDSTNSKEDS-SYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLRRELLRSFAHWRTSQSI

Query:  YSV
        + V
Subjt:  YSV

AT1G68080.3 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.1e-9347.52Show/hide
Query:  RLTLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLIRLFFFLLMCLRLKWVLSLF
        RL L NFL+  EC+ELE IHKS  T+GYRP VFSTTL HL+ +NS HLI+PFVSIRERLKEK EE FGCEYELF+EFTGLI                   
Subjt:  RLTLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLIRLFFFLLMCLRLKWVLSLF

Query:  AFRESDGHVTGVLYNQPSLFSWTRGARIGWHSDDNRPYLKQREFT------------DCVMYTADSLNVHSVDEVTSGERLTLTLWFTRDSSHDEDAKLL
                            SW +GA IGWHSDDNR YLKQR+F             D +MYTAD  N+HSVDEVT GERLTL LWF+RDSSHDED+KLL
Subjt:  AFRESDGHVTGVLYNQPSLFSWTRGARIGWHSDDNRPYLKQREFT------------DCVMYTADSLNVHSVDEVTSGERLTLTLWFTRDSSHDEDAKLL

Query:  SLLSQSHLHDRLPDSCLPQPPSCNMYWFSP-KDDPNFKFGFDICWARLHALGYGIYFPQ--DHSLSEYPDLFSQDVQLVRGNKIFSQKFDSILHALQVVL
        S LSQ   H    + CLP P S NMYWF P +D  N   GFD+C ARLH LG+ ++  Q  DHS      L    +QL +G K+ ++KF +ILHALQVV 
Subjt:  SLLSQSHLHDRLPDSCLPQPPSCNMYWFSP-KDDPNFKFGFDICWARLHALGYGIYFPQ--DHSLSEYPDLFSQDVQLVRGNKIFSQKFDSILHALQVVL

Query:  FLYWKGKELDSTNSKEDS-SYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLRRELLRSFAHWRTSQS
        F +WK  EL ++N + D+    + +S  +   ++  KS F  D+ L  + F Y+ S  ++++  L    +A    +WE+Y+  L +ELL S   W+T Q+
Subjt:  FLYWKGKELDSTNSKEDS-SYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLRRELLRSFAHWRTSQS

Query:  IYSV
        I+ V
Subjt:  IYSV

AT1G68080.4 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein5.1e-5145.49Show/hide
Query:  MYTADSLNVHSVDEVTSGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRLPDSCLPQPPSCNMYWFSP-KDDPNFKFGFDICWARLHALGYGIYFPQ-
        MYTAD  N+HSVDEVT GERLTL LWF+RDSSHDED+KLLS LSQ   H    + CLP P S NMYWF P +D  N   GFD+C ARLH LG+ ++  Q 
Subjt:  MYTADSLNVHSVDEVTSGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRLPDSCLPQPPSCNMYWFSP-KDDPNFKFGFDICWARLHALGYGIYFPQ-

Query:  -DHSLSEYPDLFSQDVQLVRGNKIFSQKFDSILHALQVVLFLYWKGKELDSTNSKEDS-SYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKE
         DHS      L    +QL +G K+ ++KF +ILHALQVV F +WK  EL ++N + D+    + +S  +   ++  KS F  D+ L  + F Y+ S  ++
Subjt:  -DHSLSEYPDLFSQDVQLVRGNKIFSQKFDSILHALQVVLFLYWKGKELDSTNSKEDS-SYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKE

Query:  KQHRLGWAKLAAVAAAWEDYASNLRRELLRSFAHWRTSQSIYSV
        ++  L    +A    +WE+Y+  L +ELL S   W+T Q+I+ V
Subjt:  KQHRLGWAKLAAVAAAWEDYASNLRRELLRSFAHWRTSQSIYSV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATGGGAGACGAAGCGGAGATCAATCAGCGGCGGCGTCTCACTCTGGAAAATTTCTTAACCCTCGAAGAATGCAGGGAACTGGAGTTCATCCATAAGAGCTGCTG
TACGGTGGGTTATAGACCATACGTCTTCTCCACCACTCTTTTGCATCTTGTTGTCTCTAATTCTGCTCATTTGATCATGCCTTTTGTTTCGATTAGAGAGCGGTTGAAGG
AGAAAGCGGAGGAGTTCTTTGGCTGTGAATATGAGCTCTTCGTCGAGTTCACTGGCTTGATAAGGTTATTCTTTTTCCTCCTTATGTGTTTAAGGCTTAAATGGGTGTTG
TCTCTTTTTGCTTTCAGAGAGTCTGACGGCCATGTTACTGGAGTACTTTATAATCAGCCCAGTCTGTTCAGCTGGACCAGGGGAGCAAGGATTGGATGGCATAGTGACGA
CAACCGGCCCTATCTAAAACAACGTGAATTTACAGATTGTGTGATGTACACGGCTGACAGCCTCAATGTTCATTCTGTTGATGAGGTAACTAGTGGAGAAAGACTTACGC
TGACATTATGGTTCACCCGTGACAGTTCCCATGATGAAGATGCAAAACTTCTTTCCCTTCTTTCACAAAGCCATTTACACGATCGTCTTCCTGACTCATGCCTACCTCAG
CCTCCATCCTGTAATATGTATTGGTTTTCACCAAAAGACGATCCAAATTTCAAGTTCGGTTTTGATATATGTTGGGCAAGACTGCATGCGCTAGGATACGGCATTTATTT
TCCTCAGGACCATAGTTTGTCAGAGTATCCAGATTTATTCTCACAGGATGTACAATTAGTACGTGGTAATAAGATCTTCTCTCAGAAGTTTGATAGCATTTTGCATGCAC
TTCAGGTAGTGCTATTTCTATATTGGAAAGGCAAAGAATTGGATTCTACTAACTCCAAGGAAGATTCAAGCTATGCAGAAGGTTTATCTCCAAAGAGAAATGTGGGAGTC
GATTATTTTAAATCCGAGTTCTCAAAGGACGATGCACTGGCGGAGTCAGTCTTCTTGTATGCTAGTTCTGATGTCAAGGAGAAGCAACACCGGTTGGGGTGGGCTAAGCT
TGCTGCAGTAGCAGCAGCTTGGGAAGATTATGCTTCCAATTTAAGGAGAGAACTCCTTCGGAGCTTCGCCCATTGGAGAACCAGTCAATCCATATACAGTGTTCCATATG
GTAGTTGA
mRNA sequenceShow/hide mRNA sequence
GTTTATGAGAGAGGGATACATGGAGAAGCAGCAGTCGCCTCGCCATAAAGGATTTCATTCGATTGTTGTTGCAAATTGGGATATCGCTTTCGGTATATTTTTATGGTCAA
GGGCATTCCATCCGATTACTGCCATTGAAACGAGAGACGGAGAATTGGATGAAAATGGGAGACGAAGCGGAGATCAATCAGCGGCGGCGTCTCACTCTGGAAAATTTCTT
AACCCTCGAAGAATGCAGGGAACTGGAGTTCATCCATAAGAGCTGCTGTACGGTGGGTTATAGACCATACGTCTTCTCCACCACTCTTTTGCATCTTGTTGTCTCTAATT
CTGCTCATTTGATCATGCCTTTTGTTTCGATTAGAGAGCGGTTGAAGGAGAAAGCGGAGGAGTTCTTTGGCTGTGAATATGAGCTCTTCGTCGAGTTCACTGGCTTGATA
AGGTTATTCTTTTTCCTCCTTATGTGTTTAAGGCTTAAATGGGTGTTGTCTCTTTTTGCTTTCAGAGAGTCTGACGGCCATGTTACTGGAGTACTTTATAATCAGCCCAG
TCTGTTCAGCTGGACCAGGGGAGCAAGGATTGGATGGCATAGTGACGACAACCGGCCCTATCTAAAACAACGTGAATTTACAGATTGTGTGATGTACACGGCTGACAGCC
TCAATGTTCATTCTGTTGATGAGGTAACTAGTGGAGAAAGACTTACGCTGACATTATGGTTCACCCGTGACAGTTCCCATGATGAAGATGCAAAACTTCTTTCCCTTCTT
TCACAAAGCCATTTACACGATCGTCTTCCTGACTCATGCCTACCTCAGCCTCCATCCTGTAATATGTATTGGTTTTCACCAAAAGACGATCCAAATTTCAAGTTCGGTTT
TGATATATGTTGGGCAAGACTGCATGCGCTAGGATACGGCATTTATTTTCCTCAGGACCATAGTTTGTCAGAGTATCCAGATTTATTCTCACAGGATGTACAATTAGTAC
GTGGTAATAAGATCTTCTCTCAGAAGTTTGATAGCATTTTGCATGCACTTCAGGTAGTGCTATTTCTATATTGGAAAGGCAAAGAATTGGATTCTACTAACTCCAAGGAA
GATTCAAGCTATGCAGAAGGTTTATCTCCAAAGAGAAATGTGGGAGTCGATTATTTTAAATCCGAGTTCTCAAAGGACGATGCACTGGCGGAGTCAGTCTTCTTGTATGC
TAGTTCTGATGTCAAGGAGAAGCAACACCGGTTGGGGTGGGCTAAGCTTGCTGCAGTAGCAGCAGCTTGGGAAGATTATGCTTCCAATTTAAGGAGAGAACTCCTTCGGA
GCTTCGCCCATTGGAGAACCAGTCAATCCATATACAGTGTTCCATATGGTAGTTGACTCTTCCACTTCTGGGATAGTGGCAGTCTCAAATGCTAAAAGTAGCTGAGCTTC
AAGGTTAGTTATGGGCCTTTGTACAGCTTAACTAGATTTTACTTGTCCCAAGTGTGCAATCCTTTGTGTTATATTCTTACCCTTTTCAACGATTTAGCCATTTTTCACAG
TTTTTAAAGGGGTATCATTTCTGTTTCTAGAAAACTTGGTTCTCGTTTTAGTTTCTTAGTATCCTTCTCA
Protein sequenceShow/hide protein sequence
MKMGDEAEINQRRRLTLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLIRLFFFLLMCLRLKWVL
SLFAFRESDGHVTGVLYNQPSLFSWTRGARIGWHSDDNRPYLKQREFTDCVMYTADSLNVHSVDEVTSGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRLPDSCLPQ
PPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKIFSQKFDSILHALQVVLFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGV
DYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLRRELLRSFAHWRTSQSIYSVPYGS