; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg007351 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg007351
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProcollagen-proline 3-dioxygenase
Genome locationscaffold7:19721396..19724338
RNA-Seq ExpressionSpg007351
SyntenySpg007351
Gene Ontology termsGO:0019511 - peptidyl-proline hydroxylation (biological process)
GO:0032963 - collagen metabolic process (biological process)
GO:0005506 - iron ion binding (molecular function)
GO:0019797 - procollagen-proline 3-dioxygenase activity (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR039575 - Prolyl 3-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155191.1 uncharacterized protein LOC111022327 [Momordica charantia]5.2e-4284.76Show/hide
Query:  MGDEAESRQ--RRRLILENFLTLEECRELEFIHKSCCTVGYRPSVFSTTLLHLVASNSPHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLIRWNVDV
        MGDE E+RQ  RRRLILENFLT EECRELEFIHKSCCTVGYRPSVFSTTLLHLVA+NS HLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLI W    
Subjt:  MGDEAESRQ--RRRLILENFLTLEECRELEFIHKSCCTVGYRPSVFSTTLLHLVASNSPHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLIRWNVDV

Query:  LKNWN
           W+
Subjt:  LKNWN

XP_022942569.1 prolyl 3-hydroxylase 1 isoform X1 [Cucurbita moschata]3.4e-4184.47Show/hide
Query:  MGDEAESRQRRRLILENFLTLEECRELEFIHKSCCTVGYRPSVFSTTLLHLVASNSPHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLIRWNVDVLK
        MGDEAE  QRRRL LENFLTLEECRELEFIHKSCCTVGYRP VFSTTLLHLV SNS HLIMPFV IRERLKEKAEEFFGCEYELFVEFTGLI W      
Subjt:  MGDEAESRQRRRLILENFLTLEECRELEFIHKSCCTVGYRPSVFSTTLLHLVASNSPHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLIRWNVDVLK

Query:  NWN
         W+
Subjt:  NWN

XP_022942570.1 uncharacterized protein LOC111447569 isoform X2 [Cucurbita moschata]3.4e-4184.47Show/hide
Query:  MGDEAESRQRRRLILENFLTLEECRELEFIHKSCCTVGYRPSVFSTTLLHLVASNSPHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLIRWNVDVLK
        MGDEAE  QRRRL LENFLTLEECRELEFIHKSCCTVGYRP VFSTTLLHLV SNS HLIMPFV IRERLKEKAEEFFGCEYELFVEFTGLI W      
Subjt:  MGDEAESRQRRRLILENFLTLEECRELEFIHKSCCTVGYRPSVFSTTLLHLVASNSPHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLIRWNVDVLK

Query:  NWN
         W+
Subjt:  NWN

XP_023552239.1 prolyl 3-hydroxylase 1 [Cucurbita pepo subsp. pepo]5.2e-4285.44Show/hide
Query:  MGDEAESRQRRRLILENFLTLEECRELEFIHKSCCTVGYRPSVFSTTLLHLVASNSPHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLIRWNVDVLK
        MGDEAE  QRRRLILENFLTLEECRELEFIHKSCCTVGYRP VFSTTLLHLV SNS HLIMPFV IRERLKEKAEEFFGCEYELFVEFTGLI W      
Subjt:  MGDEAESRQRRRLILENFLTLEECRELEFIHKSCCTVGYRPSVFSTTLLHLVASNSPHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLIRWNVDVLK

Query:  NWN
         W+
Subjt:  NWN

XP_038893062.1 uncharacterized protein LOC120081945 [Benincasa hispida]5.7e-4182.86Show/hide
Query:  MGDEAES--RQRRRLILENFLTLEECRELEFIHKSCCTVGYRPSVFSTTLLHLVASNSPHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLIRWNVDV
        MGDE ES  R+RRRLILENFLT EECRELEFIHKSCCTVGYRP+VFSTTLLHLVA+NS HLIMPFVPIRERLKEKAEEFFGC YELFVEFTGLI W    
Subjt:  MGDEAES--RQRRRLILENFLTLEECRELEFIHKSCCTVGYRPSVFSTTLLHLVASNSPHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLIRWNVDV

Query:  LKNWN
           W+
Subjt:  LKNWN

TrEMBL top hitse value%identityAlignment
A0A6J1DQY8 Procollagen-proline 3-dioxygenase2.5e-4284.76Show/hide
Query:  MGDEAESRQ--RRRLILENFLTLEECRELEFIHKSCCTVGYRPSVFSTTLLHLVASNSPHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLIRWNVDV
        MGDE E+RQ  RRRLILENFLT EECRELEFIHKSCCTVGYRPSVFSTTLLHLVA+NS HLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLI W    
Subjt:  MGDEAESRQ--RRRLILENFLTLEECRELEFIHKSCCTVGYRPSVFSTTLLHLVASNSPHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLIRWNVDV

Query:  LKNWN
           W+
Subjt:  LKNWN

A0A6J1FRP8 Procollagen-proline 3-dioxygenase1.6e-4184.47Show/hide
Query:  MGDEAESRQRRRLILENFLTLEECRELEFIHKSCCTVGYRPSVFSTTLLHLVASNSPHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLIRWNVDVLK
        MGDEAE  QRRRL LENFLTLEECRELEFIHKSCCTVGYRP VFSTTLLHLV SNS HLIMPFV IRERLKEKAEEFFGCEYELFVEFTGLI W      
Subjt:  MGDEAESRQRRRLILENFLTLEECRELEFIHKSCCTVGYRPSVFSTTLLHLVASNSPHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLIRWNVDVLK

Query:  NWN
         W+
Subjt:  NWN

A0A6J1FWJ2 uncharacterized protein LOC111447569 isoform X21.6e-4184.47Show/hide
Query:  MGDEAESRQRRRLILENFLTLEECRELEFIHKSCCTVGYRPSVFSTTLLHLVASNSPHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLIRWNVDVLK
        MGDEAE  QRRRL LENFLTLEECRELEFIHKSCCTVGYRP VFSTTLLHLV SNS HLIMPFV IRERLKEKAEEFFGCEYELFVEFTGLI W      
Subjt:  MGDEAESRQRRRLILENFLTLEECRELEFIHKSCCTVGYRPSVFSTTLLHLVASNSPHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLIRWNVDVLK

Query:  NWN
         W+
Subjt:  NWN

A0A6J1JHD9 uncharacterized protein LOC111487019 isoform X21.8e-4083.5Show/hide
Query:  MGDEAESRQRRRLILENFLTLEECRELEFIHKSCCTVGYRPSVFSTTLLHLVASNSPHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLIRWNVDVLK
        MGDEAE  QR RLILENFLTLEECRELEFIHKSCCTVGYRP VFSTTLLHLV SNS  LIMPFV IRERLKEKAEEFFGCEYELFVEFTGLI W      
Subjt:  MGDEAESRQRRRLILENFLTLEECRELEFIHKSCCTVGYRPSVFSTTLLHLVASNSPHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLIRWNVDVLK

Query:  NWN
         W+
Subjt:  NWN

A0A6J1JQV6 Procollagen-proline 3-dioxygenase1.8e-4083.5Show/hide
Query:  MGDEAESRQRRRLILENFLTLEECRELEFIHKSCCTVGYRPSVFSTTLLHLVASNSPHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLIRWNVDVLK
        MGDEAE  QR RLILENFLTLEECRELEFIHKSCCTVGYRP VFSTTLLHLV SNS  LIMPFV IRERLKEKAEEFFGCEYELFVEFTGLI W      
Subjt:  MGDEAESRQRRRLILENFLTLEECRELEFIHKSCCTVGYRPSVFSTTLLHLVASNSPHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLIRWNVDVLK

Query:  NWN
         W+
Subjt:  NWN

SwissProt top hitse value%identityAlignment
P55852 Small ubiquitin-related modifier 14.8e-0695.83Show/hide
Query:  EQTPDELEMEDGDEIDAMLHQTGG
        EQTPDEL+MEDGDEIDAMLHQTGG
Subjt:  EQTPDELEMEDGDEIDAMLHQTGG

P55857 Small ubiquitin-related modifier 12.1e-06100Show/hide
Query:  EQTPDELEMEDGDEIDAMLHQTGG
        EQTPDELEMEDGDEIDAMLHQTGG
Subjt:  EQTPDELEMEDGDEIDAMLHQTGG

Q9FLP6 Small ubiquitin-related modifier 24.3e-0796.15Show/hide
Query:  EQTPDELEMEDGDEIDAMLHQTGGGS
        EQTPDELEMEDGDEIDAMLHQTGGG+
Subjt:  EQTPDELEMEDGDEIDAMLHQTGGGS

Arabidopsis top hitse value%identityAlignment
AT1G68080.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.7e-3367.71Show/hide
Query:  RQRRRLILENFLTLEECRELEFIHKSCCTVGYRPSVFSTTLLHLVASNSPHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLIRWNVDVLKNWN
        ++  RLIL NFL+  EC+ELE IHKS  T+GYRP+VFSTTL HL+A+NSPHLI+PFV IRERLKEK EE FGCEYELF+EFTGLI W       W+
Subjt:  RQRRRLILENFLTLEECRELEFIHKSCCTVGYRPSVFSTTLLHLVASNSPHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLIRWNVDVLKNWN

AT1G68080.2 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.7e-3367.71Show/hide
Query:  RQRRRLILENFLTLEECRELEFIHKSCCTVGYRPSVFSTTLLHLVASNSPHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLIRWNVDVLKNWN
        ++  RLIL NFL+  EC+ELE IHKS  T+GYRP+VFSTTL HL+A+NSPHLI+PFV IRERLKEK EE FGCEYELF+EFTGLI W       W+
Subjt:  RQRRRLILENFLTLEECRELEFIHKSCCTVGYRPSVFSTTLLHLVASNSPHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLIRWNVDVLKNWN

AT1G68080.3 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.7e-3367.71Show/hide
Query:  RQRRRLILENFLTLEECRELEFIHKSCCTVGYRPSVFSTTLLHLVASNSPHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLIRWNVDVLKNWN
        ++  RLIL NFL+  EC+ELE IHKS  T+GYRP+VFSTTL HL+A+NSPHLI+PFV IRERLKEK EE FGCEYELF+EFTGLI W       W+
Subjt:  RQRRRLILENFLTLEECRELEFIHKSCCTVGYRPSVFSTTLLHLVASNSPHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLIRWNVDVLKNWN

AT5G55160.1 small ubiquitin-like modifier 23.1e-0896.15Show/hide
Query:  EQTPDELEMEDGDEIDAMLHQTGGGS
        EQTPDELEMEDGDEIDAMLHQTGGG+
Subjt:  EQTPDELEMEDGDEIDAMLHQTGGGS

AT5G55160.2 small ubiquitin-like modifier 23.1e-0896.15Show/hide
Query:  EQTPDELEMEDGDEIDAMLHQTGGGS
        EQTPDELEMEDGDEIDAMLHQTGGG+
Subjt:  EQTPDELEMEDGDEIDAMLHQTGGGS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGACGAAGCGGAGAGCAGGCAGCGGCGGCGTCTGATTCTCGAAAATTTCTTAACCCTCGAAGAATGCAGGGAACTGGAGTTCATCCATAAGAGTTGCTGTACGGT
GGGTTATAGACCAAGTGTCTTCTCCACCACTCTTTTGCATCTTGTTGCCTCTAATTCTCCTCACTTGATCATGCCTTTTGTTCCGATCAGAGAGAGGTTGAAGGAGAAAG
CGGAGGAATTCTTTGGCTGTGAATATGAACTCTTCGTTGAGTTCACTGGCTTGATAAGGTGGAATGTGGACGTTTTAAAAAACTGGAATGGGTATCGCTTTGTGGAGCAA
ACTCCTGATGAGCTAGAGATGGAGGACGGGGATGAGATTGATGCAATGTTGCACCAGACTGGTGGAGGATCTACTCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGACGAAGCGGAGAGCAGGCAGCGGCGGCGTCTGATTCTCGAAAATTTCTTAACCCTCGAAGAATGCAGGGAACTGGAGTTCATCCATAAGAGTTGCTGTACGGT
GGGTTATAGACCAAGTGTCTTCTCCACCACTCTTTTGCATCTTGTTGCCTCTAATTCTCCTCACTTGATCATGCCTTTTGTTCCGATCAGAGAGAGGTTGAAGGAGAAAG
CGGAGGAATTCTTTGGCTGTGAATATGAACTCTTCGTTGAGTTCACTGGCTTGATAAGGTGGAATGTGGACGTTTTAAAAAACTGGAATGGGTATCGCTTTGTGGAGCAA
ACTCCTGATGAGCTAGAGATGGAGGACGGGGATGAGATTGATGCAATGTTGCACCAGACTGGTGGAGGATCTACTCATTGA
Protein sequenceShow/hide protein sequence
MGDEAESRQRRRLILENFLTLEECRELEFIHKSCCTVGYRPSVFSTTLLHLVASNSPHLIMPFVPIRERLKEKAEEFFGCEYELFVEFTGLIRWNVDVLKNWNGYRFVEQ
TPDELEMEDGDEIDAMLHQTGGGSTH