; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi05G014850 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi05G014850
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationchr05:22638924..22640263
RNA-Seq ExpressionLsi05G014850
SyntenyLsi05G014850
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7028942.1 putative prolyl 4-hydroxylase 4 [Cucurbita argyrosperma subsp. argyrosperma]9.0e-5392.68Show/hide
Query:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH
        MAK C  NLLF  S+SISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAK ELKRSAVAD+LSG SKVSEVRTSSGAFIH
Subjt:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKG
        KAKDPIVSGIEDKIAAWTFLPKG
Subjt:  KAKDPIVSGIEDKIAAWTFLPKG

XP_004134175.2 probable prolyl 4-hydroxylase 4 [Cucumis sativus]1.5e-5291.8Show/hide
Query:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH
        MA+L CFNLLF F+LSIS LL+RAS+SYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAK ELKRS+VADNLSG SKVSEVRTSSGAFIH
Subjt:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPK
        KAKDPIVSGIEDKIAAWTFLPK
Subjt:  KAKDPIVSGIEDKIAAWTFLPK

XP_022137761.1 probable prolyl 4-hydroxylase 4 [Momordica charantia]5.3e-5392.62Show/hide
Query:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH
        MA+ C  +LLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDH+ISLAK ELKRSAVADNLSG SKVSEVRTSSGAFIH
Subjt:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPK
        KAKDPI+SGIEDKIAAWTFLPK
Subjt:  KAKDPIVSGIEDKIAAWTFLPK

XP_023539189.1 probable prolyl 4-hydroxylase 4 [Cucurbita pepo subsp. pepo]4.5e-5292.62Show/hide
Query:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH
        MAK C  NLLF  S+SISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAK ELKRSAVAD+LSG SKVSEVRTSSGAFIH
Subjt:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPK
        KAKDPIVSGIEDKIAAWTFLPK
Subjt:  KAKDPIVSGIEDKIAAWTFLPK

XP_038903083.1 probable prolyl 4-hydroxylase 4 [Benincasa hispida]8.1e-5494.26Show/hide
Query:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH
        MAK  CFNLLFFFSLSIS LLRRASSSYAGSASSIVNPAKVKQISW+PRAFVYEGFLTDLECDHLISLAK ELKRS+VADNLSG SKVSEVRTSSGAFIH
Subjt:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPK
        KAKDPIVSGIEDKIAAWTFLPK
Subjt:  KAKDPIVSGIEDKIAAWTFLPK

TrEMBL top hitse value%identityAlignment
A0A0A0L5Q6 Procollagen-proline 4-dioxygenase7.4e-5391.8Show/hide
Query:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH
        MA+L CFNLLF F+LSIS LL+RAS+SYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAK ELKRS+VADNLSG SKVSEVRTSSGAFIH
Subjt:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPK
        KAKDPIVSGIEDKIAAWTFLPK
Subjt:  KAKDPIVSGIEDKIAAWTFLPK

A0A6J1C7M6 Procollagen-proline 4-dioxygenase2.6e-5392.62Show/hide
Query:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH
        MA+ C  +LLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDH+ISLAK ELKRSAVADNLSG SKVSEVRTSSGAFIH
Subjt:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPK
        KAKDPI+SGIEDKIAAWTFLPK
Subjt:  KAKDPIVSGIEDKIAAWTFLPK

A0A6J1GPQ8 Procollagen-proline 4-dioxygenase2.2e-5292.62Show/hide
Query:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH
        MAK C  NLLF  S+SISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAK ELKRSAVAD+LSG SKVSEVRTSSGAFIH
Subjt:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPK
        KAKDPIVSGIEDKIAAWTFLPK
Subjt:  KAKDPIVSGIEDKIAAWTFLPK

A0A6J1GXF3 Procollagen-proline 4-dioxygenase1.1e-5190.98Show/hide
Query:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH
        M K C FNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISW+PRAFVYEGFLTDLE DHLIS+AK ELKRSAVADNLSG SKVSE+RTSSGAFI 
Subjt:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPK
        K+KDPIVSGIEDKIAAWTFLPK
Subjt:  KAKDPIVSGIEDKIAAWTFLPK

A0A6J1I971 Procollagen-proline 4-dioxygenase2.2e-5292.62Show/hide
Query:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH
        MAK C  NLLF  S+SISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAK ELKRSAVAD+LSG SKVSEVRTSSGAFIH
Subjt:  MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPK
        KAKDPIVSGIEDKIAAWTFLPK
Subjt:  KAKDPIVSGIEDKIAAWTFLPK

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 64.1e-2454.87Show/hide
Query:  FFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRS-AVADNLSGASKVSEVRTSSGAFIHKAKDPIVSG
        +F + S+SLLL     S   S S  V+P ++ Q+SW+PRAF+Y+GFL+D ECDHLI LAK +L++S  VAD  SG S+ SEVRTSSG F+ K +D IV+ 
Subjt:  FFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRS-AVADNLSGASKVSEVRTSSGAFIHKAKDPIVSG

Query:  IEDKIAAWTFLPK
        +E K+AAWTFLP+
Subjt:  IEDKIAAWTFLPK

F4JAU3 Prolyl 4-hydroxylase 22.3e-3569.3Show/hide
Query:  LLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIHKAKDPIVS
        LL F  ++I L+L ++S+    S SSI+NP+KVKQ+S  PRAFVYEGFLTDLECDHLISLAK  L+RSAVADN +G S+VS+VRTSSG FI K KDPIVS
Subjt:  LLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIHKAKDPIVS

Query:  GIEDKIAAWTFLPK
        GIEDK++ WTFLPK
Subjt:  GIEDKIAAWTFLPK

Q24JN5 Prolyl 4-hydroxylase 53.7e-1753.09Show/hide
Query:  VKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLP
        V+ ISW PRA VY  FLT+ EC+HLISLAK  + +S V D  +G SK S VRTSSG F+ +  D +V  IE +I+ +TF+P
Subjt:  VKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLP

Q8L970 Probable prolyl 4-hydroxylase 74.8e-2551.94Show/hide
Query:  FNLLFFFSLSI-----SLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRT
        F+L F F+L +     +  L R+S++  G       SASS   +P +V Q+SW+PR F+YEGFL+D ECDH I LAK +L++S VADN SG S  SEVRT
Subjt:  FNLLFFFSLSI-----SLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRT

Query:  SSGAFIHKAKDPIVSGIEDKIAAWTFLPK
        SSG F+ K +D IVS +E K+AAWTFLP+
Subjt:  SSGAFIHKAKDPIVSGIEDKIAAWTFLPK

Q8LAN3 Probable prolyl 4-hydroxylase 45.2e-3570.18Show/hide
Query:  LLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIHKAKDPIVS
        LL  F    S+LL ++S+S   S+S  VNP+KVKQ+S  PRAFVYEGFLT+LECDH++SLAK  LKRSAVADN SG SK SEVRTSSG FI K KDPIVS
Subjt:  LLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIHKAKDPIVS

Query:  GIEDKIAAWTFLPK
        GIEDKI+ WTFLPK
Subjt:  GIEDKIAAWTFLPK

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 21.6e-3669.3Show/hide
Query:  LLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIHKAKDPIVS
        LL F  ++I L+L ++S+    S SSI+NP+KVKQ+S  PRAFVYEGFLTDLECDHLISLAK  L+RSAVADN +G S+VS+VRTSSG FI K KDPIVS
Subjt:  LLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIHKAKDPIVS

Query:  GIEDKIAAWTFLPK
        GIEDK++ WTFLPK
Subjt:  GIEDKIAAWTFLPK

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase3.4e-2651.94Show/hide
Query:  FNLLFFFSLSI-----SLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRT
        F+L F F+L +     +  L R+S++  G       SASS   +P +V Q+SW+PR F+YEGFL+D ECDH I LAK +L++S VADN SG S  SEVRT
Subjt:  FNLLFFFSLSI-----SLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRT

Query:  SSGAFIHKAKDPIVSGIEDKIAAWTFLPK
        SSG F+ K +D IVS +E K+AAWTFLP+
Subjt:  SSGAFIHKAKDPIVSGIEDKIAAWTFLPK

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase1.4e-1945.99Show/hide
Query:  FNLLFFFSLSI-----SLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGAS-----KV
        F+L F F+L +     +  L R+S++  G       SASS   +P +V Q+SW+PR F+YEGFL+D ECDH I LAK +L++S VADN SG S      V
Subjt:  FNLLFFFSLSI-----SLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGAS-----KV

Query:  SEVRTSSGAFIHKAK---DPIVSGIEDKIAAWTFLPK
        S VR SS    +      D IVS +E K+AAWTFLP+
Subjt:  SEVRTSSGAFIHKAK---DPIVSGIEDKIAAWTFLPK

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase2.9e-2554.87Show/hide
Query:  FFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRS-AVADNLSGASKVSEVRTSSGAFIHKAKDPIVSG
        +F + S+SLLL     S   S S  V+P ++ Q+SW+PRAF+Y+GFL+D ECDHLI LAK +L++S  VAD  SG S+ SEVRTSSG F+ K +D IV+ 
Subjt:  FFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRS-AVADNLSGASKVSEVRTSSGAFIHKAKDPIVSG

Query:  IEDKIAAWTFLPK
        +E K+AAWTFLP+
Subjt:  IEDKIAAWTFLPK

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein3.7e-3670.18Show/hide
Query:  LLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIHKAKDPIVS
        LL  F    S+LL ++S+S   S+S  VNP+KVKQ+S  PRAFVYEGFLT+LECDH++SLAK  LKRSAVADN SG SK SEVRTSSG FI K KDPIVS
Subjt:  LLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIHKAKDPIVS

Query:  GIEDKIAAWTFLPK
        GIEDKI+ WTFLPK
Subjt:  GIEDKIAAWTFLPK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAAATTGTGCTGTTTCAATCTACTGTTTTTCTTCTCATTATCGATCTCATTGCTTCTCCGGCGAGCTTCAAGCTCCTATGCAGGTTCCGCTAGCTCAATCGTCAA
TCCTGCAAAAGTCAAACAGATTTCATGGAGTCCTCGGGCTTTTGTGTATGAAGGTTTTCTCACGGACTTAGAATGCGATCATCTCATCTCCCTCGCTAAAACGGAGTTGA
AGAGATCTGCTGTCGCGGATAATTTGTCCGGAGCGAGCAAAGTCAGCGAGGTCCGAACTAGCTCTGGGGCGTTTATTCATAAAGCCAAGGATCCTATTGTTTCTGGTATA
GAAGACAAAATTGCAGCATGGACATTTCTGCCAAAAGGTATAATACCTCTAACAGCAGGCGCAAATTTCTTGTCAAATCCCATAGTCTTTAGCAACTTTTCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGAAATTGTGCTGTTTCAATCTACTGTTTTTCTTCTCATTATCGATCTCATTGCTTCTCCGGCGAGCTTCAAGCTCCTATGCAGGTTCCGCTAGCTCAATCGTCAA
TCCTGCAAAAGTCAAACAGATTTCATGGAGTCCTCGGGCTTTTGTGTATGAAGGTTTTCTCACGGACTTAGAATGCGATCATCTCATCTCCCTCGCTAAAACGGAGTTGA
AGAGATCTGCTGTCGCGGATAATTTGTCCGGAGCGAGCAAAGTCAGCGAGGTCCGAACTAGCTCTGGGGCGTTTATTCATAAAGCCAAGGATCCTATTGTTTCTGGTATA
GAAGACAAAATTGCAGCATGGACATTTCTGCCAAAAGGTATAATACCTCTAACAGCAGGCGCAAATTTCTTGTCAAATCCCATAGTCTTTAGCAACTTTTCCTAG
Protein sequenceShow/hide protein sequence
MAKLCCFNLLFFFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKTELKRSAVADNLSGASKVSEVRTSSGAFIHKAKDPIVSGI
EDKIAAWTFLPKGIIPLTAGANFLSNPIVFSNFS