; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg17382 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg17382
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationCarg_Chr06:9013375..9016161
RNA-Seq ExpressionCarg17382
SyntenyCarg17382
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597483.1 putative prolyl 4-hydroxylase 4, partial [Cucurbita argyrosperma subsp. sororia]1.1e-13082.12Show/hide
Query:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
        MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKG--------RY----------------------------ICLYLL-------TYFIKSAESQRRQASETNEDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPK         RY                            + +YL        T F  + ESQRRQASETNEDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKG--------RY----------------------------ICLYLL-------TYFIKSAESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFD IVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
        VC
Subjt:  VC

KAG7028942.1 putative prolyl 4-hydroxylase 4 [Cucurbita argyrosperma subsp. argyrosperma]1.5e-146100Show/hide
Query:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
        MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKGRYICLYLLTYFIKSAESQRRQASETNEDLSDCAKKGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSA
        KAKDPIVSGIEDKIAAWTFLPKGRYICLYLLTYFIKSAESQRRQASETNEDLSDCAKKGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSA
Subjt:  KAKDPIVSGIEDKIAAWTFLPKGRYICLYLLTYFIKSAESQRRQASETNEDLSDCAKKGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSA

Query:  TKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC
        TKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC
Subjt:  TKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC

XP_022954026.1 probable prolyl 4-hydroxylase 4 [Cucurbita moschata]2.5e-13081.79Show/hide
Query:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
        MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKG--------RY----------------------------ICLYLL-------TYFIKSAESQRRQASETNEDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPK         RY                            + +YL        T F  + ESQRRQASETNEDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKG--------RY----------------------------ICLYLL-------TYFIKSAESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFF+LHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFD IVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
        VC
Subjt:  VC

XP_022973641.1 probable prolyl 4-hydroxylase 4 [Cucurbita maxima]1.1e-13082.12Show/hide
Query:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
        MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKG--------RY----------------------------ICLYLL-------TYFIKSAESQRRQASETNEDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPK         RY                            + +YL        T F  + ESQRRQASETNEDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKG--------RY----------------------------ICLYLL-------TYFIKSAESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFD IVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
        VC
Subjt:  VC

XP_023539189.1 probable prolyl 4-hydroxylase 4 [Cucurbita pepo subsp. pepo]1.1e-13082.12Show/hide
Query:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
        MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKG--------RY----------------------------ICLYLL-------TYFIKSAESQRRQASETNEDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPK         RY                            + +YL        T F  + ESQRRQASETNEDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKG--------RY----------------------------ICLYLL-------TYFIKSAESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFD IVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
        VC
Subjt:  VC

TrEMBL top hitse value%identityAlignment
A0A0A0L5Q6 Procollagen-proline 4-dioxygenase3.1e-11874.17Show/hide
Query:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
        MA+    NLLF+ ++SIS LL+RAS+SYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRS+VAD+LSG+SKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKG--------RY----------------------------ICLYLL-------TYFIKSAESQRRQASETNEDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPK         RY                            + +YL        T F  + ESQRRQASETNEDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKG--------RY----------------------------ICLYLL-------TYFIKSAESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNA+PDT SLHGGCPVIEGEKWSATKWI VDSFDM+V DHT+C D N SCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
         C
Subjt:  VC

A0A1S3AWU7 Procollagen-proline 4-dioxygenase8.2e-11974.5Show/hide
Query:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
        MA+F   NLLF+ +++IS LLRRAS+SYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRS+VAD+LSGESKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKG--------RY----------------------------ICLYLL-------TYFIKSAESQRRQASETNEDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPK         RY                            + +YL        T F  + ESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKG--------RY----------------------------ICLYLL-------TYFIKSAESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNA+PDT SLHGGCPVIEGEKWSATKWIHVDSFD I  DHT+C D N SCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
         C
Subjt:  VC

A0A6J1C7M6 Procollagen-proline 4-dioxygenase1.9e-12376.82Show/hide
Query:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
        MA+FCSC+LLF  S+SISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDH+ISLAKAELKRSAVAD+LSGESKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKG--------RY----------------------------ICLYLL-------TYFIKSAESQRRQASETNEDLSDCAK
        KAKDPI+SGIEDKIAAWTFLPK         RY                            + +YL        T F  + ESQRRQASETNEDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKG--------RY----------------------------ICLYLL-------TYFIKSAESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPR+GDALLFFSLHPNAVPDT SLHGGCPVIEGEKWSATKWIHVDSFD I+ DHT+C D +ASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
        VC
Subjt:  VC

A0A6J1GPQ8 Procollagen-proline 4-dioxygenase1.2e-13081.79Show/hide
Query:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
        MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKG--------RY----------------------------ICLYLL-------TYFIKSAESQRRQASETNEDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPK         RY                            + +YL        T F  + ESQRRQASETNEDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKG--------RY----------------------------ICLYLL-------TYFIKSAESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFF+LHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFD IVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
        VC
Subjt:  VC

A0A6J1I971 Procollagen-proline 4-dioxygenase5.5e-13182.12Show/hide
Query:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
        MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKG--------RY----------------------------ICLYLL-------TYFIKSAESQRRQASETNEDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPK         RY                            + +YL        T F  + ESQRRQASETNEDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKG--------RY----------------------------ICLYLL-------TYFIKSAESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFD IVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
        VC
Subjt:  VC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 61.6e-6348.28Show/hide
Query:  FILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRS-AVADHLSGESKVSEVRTSSGAFIHKAKDPIVSG
        + L+ S+SLLL     S   S S  V+P ++ Q+SW+PRAF+Y+GFL+D ECDHLI LAK +L++S  VAD  SGES+ SEVRTSSG F+ K +D IV+ 
Subjt:  FILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRS-AVADHLSGESKVSEVRTSSGAFIHKAKDPIVSG

Query:  IEDKIAAWTFLPKGRYICLYLLTY-------------FIKSA--------------ESQRRQASET-------------NEDLSDCAKKGIAVKPRKGDA
        +E K+AAWTFLP+     L +L Y             + K A               S   +  ET             ++  S CAK+G AVKPRKGDA
Subjt:  IEDKIAAWTFLPKGRYICLYLLTY-------------FIKSA--------------ESQRRQASET-------------NEDLSDCAKKGIAVKPRKGDA

Query:  LLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC
        LLFF+LH N   D  SLHG CPVIEGEKWSAT+WIHV SF         CVD++ SC+ WA+ GEC  NP YMVGS    G+CRKSCK C
Subjt:  LLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC

F4JAU3 Prolyl 4-hydroxylase 28.5e-8958.76Show/hide
Query:  ILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIHKAKDPIVSGIE
        +L ++I L+L ++S+    S SSI+NP+KVKQ+S  PRAFVYEGFLTDLECDHLISLAK  L+RSAVAD+ +GES+VS+VRTSSG FI K KDPIVSGIE
Subjt:  ILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIHKAKDPIVSGIE

Query:  DKIAAWTFLPKG--------RY----------------------------ICLYLL-------TYFIKSAESQRRQASETNEDLSDCAKKGIAVKPRKGD
        DK++ WTFLPK         RY                            + LYL        T F  + E  RR  SE  +DLSDCAKKGIAVKP+KG+
Subjt:  DKIAAWTFLPKG--------RY----------------------------ICLYLL-------TYFIKSAESQRRQASETNEDLSDCAKKGIAVKPRKGD

Query:  ALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC
        ALLFF+L  +A+PD  SLHGGCPVIEGEKWSATKWIHVDSFD I++   +C D N SCERWA LGEC  NPEYMVG+PE+PG CR+SCK C
Subjt:  ALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC

Q8L970 Probable prolyl 4-hydroxylase 73.4e-6946.53Show/hide
Query:  CSCNLLFILSISISLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSG
        C    L ++S + +  L R+S++  G       SASS   +P +V Q+SW+PR F+YEGFL+D ECDH I LAK +L++S VAD+ SGES  SEVRTSSG
Subjt:  CSCNLLFILSISISLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSG

Query:  AFIHKAKDPIVSGIEDKIAAWTFLPKGRYICLYLLTY-------------------------------FIKSAES--------QRRQASETNED-LSDCA
         F+ K +D IVS +E K+AAWTFLP+     + +L Y                               ++ + E          + +A++  +D  ++CA
Subjt:  AFIHKAKDPIVSGIEDKIAAWTFLPKGRYICLYLLTY-------------------------------FIKSAES--------QRRQASETNED-LSDCA

Query:  KKGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSC
        K+G AVKPRKGDALLFF+LHPNA  D+ SLHG CPV+EGEKWSAT+WIHV SF+   +  + C+D N SCE+WA+ GEC  NP YMVGS +  GYCRKSC
Subjt:  KKGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSC

Query:  KVC
        K C
Subjt:  KVC

Q8LAN3 Probable prolyl 4-hydroxylase 48.3e-9261.09Show/hide
Query:  LFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIHKAKDPIVSG
        L I   +I  +L ++S+S   S+S  VNP+KVKQ+S  PRAFVYEGFLT+LECDH++SLAKA LKRSAVAD+ SGESK SEVRTSSG FI K KDPIVSG
Subjt:  LFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIHKAKDPIVSG

Query:  IEDKIAAWTFLPKG--------RY----------------------------ICLYLL-------TYFIKSAESQRRQASETNEDLSDCAKKGIAVKPRK
        IEDKI+ WTFLPK         RY                            I +YL        T F  +    RR  SE  EDLSDCAK+GIAVKPRK
Subjt:  IEDKIAAWTFLPKG--------RY----------------------------ICLYLL-------TYFIKSAESQRRQASETNEDLSDCAKKGIAVKPRK

Query:  GDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC
        GDALLFF+LHP+A+PD  SLHGGCPVIEGEKWSATKWIHVDSFD IV+   +C D N SCERWA LGECT NPEYMVG+ ELPGYCR+SCK C
Subjt:  GDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC

Q9LN20 Probable prolyl 4-hydroxylase 32.0e-3741.06Show/hide
Query:  ISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKGRYICLYLL------------
        +SW PRAFVY  FL+  EC++LISLAK  + +S V D  +G+SK S VRTSSG F+ + +D I+  IE +IA +TF+P      L +L            
Subjt:  ISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKGRYICLYLL------------

Query:  TYFIKSAESQR---------------RQASET--------------NEDLSDCAKKGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATK
         YF+    ++                 +  ET                +LS+C KKG++VKPR GDALLF+S+ P+A  D  SLHGGCPVI G KWS+TK
Subjt:  TYFIKSAESQR---------------RQASET--------------NEDLSDCAKKGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATK

Query:  WIHVDSF
        W+HV  +
Subjt:  WIHVDSF

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 26.1e-9058.76Show/hide
Query:  ILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIHKAKDPIVSGIE
        +L ++I L+L ++S+    S SSI+NP+KVKQ+S  PRAFVYEGFLTDLECDHLISLAK  L+RSAVAD+ +GES+VS+VRTSSG FI K KDPIVSGIE
Subjt:  ILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIHKAKDPIVSGIE

Query:  DKIAAWTFLPKG--------RY----------------------------ICLYLL-------TYFIKSAESQRRQASETNEDLSDCAKKGIAVKPRKGD
        DK++ WTFLPK         RY                            + LYL        T F  + E  RR  SE  +DLSDCAKKGIAVKP+KG+
Subjt:  DKIAAWTFLPKG--------RY----------------------------ICLYLL-------TYFIKSAESQRRQASETNEDLSDCAKKGIAVKPRKGD

Query:  ALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC
        ALLFF+L  +A+PD  SLHGGCPVIEGEKWSATKWIHVDSFD I++   +C D N SCERWA LGEC  NPEYMVG+PE+PG CR+SCK C
Subjt:  ALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase2.4e-7046.53Show/hide
Query:  CSCNLLFILSISISLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSG
        C    L ++S + +  L R+S++  G       SASS   +P +V Q+SW+PR F+YEGFL+D ECDH I LAK +L++S VAD+ SGES  SEVRTSSG
Subjt:  CSCNLLFILSISISLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSG

Query:  AFIHKAKDPIVSGIEDKIAAWTFLPKGRYICLYLLTY-------------------------------FIKSAES--------QRRQASETNED-LSDCA
         F+ K +D IVS +E K+AAWTFLP+     + +L Y                               ++ + E          + +A++  +D  ++CA
Subjt:  AFIHKAKDPIVSGIEDKIAAWTFLPKGRYICLYLLTY-------------------------------FIKSAES--------QRRQASETNED-LSDCA

Query:  KKGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSC
        K+G AVKPRKGDALLFF+LHPNA  D+ SLHG CPV+EGEKWSAT+WIHV SF+   +  + C+D N SCE+WA+ GEC  NP YMVGS +  GYCRKSC
Subjt:  KKGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSC

Query:  KVC
        K C
Subjt:  KVC

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase9.8e-6444.05Show/hide
Query:  CSCNLLFILSISISLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGES-----KVSEV
        C    L ++S + +  L R+S++  G       SASS   +P +V Q+SW+PR F+YEGFL+D ECDH I LAK +L++S VAD+ SGES      VS V
Subjt:  CSCNLLFILSISISLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGES-----KVSEV

Query:  RTSSGAFIHKAK---DPIVSGIEDKIAAWTFLPKGRYICLYLLTY-------------------------------FIKSAES--------QRRQASETN
        R SS    +      D IVS +E K+AAWTFLP+     + +L Y                               ++ + E          + +A++  
Subjt:  RTSSGAFIHKAK---DPIVSGIEDKIAAWTFLPKGRYICLYLLTY-------------------------------FIKSAES--------QRRQASETN

Query:  ED-LSDCAKKGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPEL
        +D  ++CAK+G AVKPRKGDALLFF+LHPNA  D+ SLHG CPV+EGEKWSAT+WIHV SF+   +  + C+D N SCE+WA+ GEC  NP YMVGS + 
Subjt:  ED-LSDCAKKGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPEL

Query:  PGYCRKSCKVC
         GYCRKSCK C
Subjt:  PGYCRKSCKVC

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase1.2e-6448.28Show/hide
Query:  FILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRS-AVADHLSGESKVSEVRTSSGAFIHKAKDPIVSG
        + L+ S+SLLL     S   S S  V+P ++ Q+SW+PRAF+Y+GFL+D ECDHLI LAK +L++S  VAD  SGES+ SEVRTSSG F+ K +D IV+ 
Subjt:  FILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRS-AVADHLSGESKVSEVRTSSGAFIHKAKDPIVSG

Query:  IEDKIAAWTFLPKGRYICLYLLTY-------------FIKSA--------------ESQRRQASET-------------NEDLSDCAKKGIAVKPRKGDA
        +E K+AAWTFLP+     L +L Y             + K A               S   +  ET             ++  S CAK+G AVKPRKGDA
Subjt:  IEDKIAAWTFLPKGRYICLYLLTY-------------FIKSA--------------ESQRRQASET-------------NEDLSDCAKKGIAVKPRKGDA

Query:  LLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC
        LLFF+LH N   D  SLHG CPVIEGEKWSAT+WIHV SF         CVD++ SC+ WA+ GEC  NP YMVGS    G+CRKSCK C
Subjt:  LLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein5.9e-9361.09Show/hide
Query:  LFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIHKAKDPIVSG
        L I   +I  +L ++S+S   S+S  VNP+KVKQ+S  PRAFVYEGFLT+LECDH++SLAKA LKRSAVAD+ SGESK SEVRTSSG FI K KDPIVSG
Subjt:  LFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIHKAKDPIVSG

Query:  IEDKIAAWTFLPKG--------RY----------------------------ICLYLL-------TYFIKSAESQRRQASETNEDLSDCAKKGIAVKPRK
        IEDKI+ WTFLPK         RY                            I +YL        T F  +    RR  SE  EDLSDCAK+GIAVKPRK
Subjt:  IEDKIAAWTFLPKG--------RY----------------------------ICLYLL-------TYFIKSAESQRRQASETNEDLSDCAKKGIAVKPRK

Query:  GDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC
        GDALLFF+LHP+A+PD  SLHGGCPVIEGEKWSATKWIHVDSFD IV+   +C D N SCERWA LGECT NPEYMVG+ ELPGYCR+SCK C
Subjt:  GDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDMIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAAATTTTGTAGTTGCAATCTGCTGTTTATCCTCTCGATATCGATCTCGTTGCTTCTCCGGCGAGCTTCAAGCTCTTATGCAGGTTCCGCTAGCTCAATCGTCAA
TCCTGCAAAAGTCAAACAGATTTCATGGAGTCCCCGGGCTTTCGTGTATGAAGGTTTTCTCACGGACTTAGAATGCGATCATCTCATCTCGCTTGCTAAAGCGGAGTTGA
AGAGATCTGCTGTTGCGGATCATTTGTCTGGAGAGAGCAAGGTCAGCGAGGTCCGAACTAGCTCTGGGGCATTTATTCATAAAGCCAAGGATCCGATCGTTTCTGGTATT
GAAGACAAAATTGCAGCATGGACATTCCTGCCAAAAGGAAGATATATCTGTCTCTACTTATTGACTTACTTCATCAAATCCGCTGAATCTCAAAGACGGCAGGCTTCTGA
AACAAATGAAGATCTCTCAGACTGTGCAAAGAAAGGGATAGCAGTGAAACCACGGAAGGGTGATGCTCTTCTCTTCTTCAGTCTTCATCCAAATGCTGTTCCAGACACAA
AAAGTCTGCATGGAGGTTGCCCTGTGATTGAAGGAGAGAAATGGTCAGCAACAAAGTGGATCCATGTCGATTCTTTCGACATGATCGTGAGTGATCATACGAGTTGCGTT
GATAATAATGCAAGTTGTGAGAGATGGGCTGAACTCGGCGAGTGCACGAATAACCCGGAGTATATGGTGGGATCTCCTGAGCTTCCTGGCTATTGCAGGAAAAGTTGCAA
GGTTTGTTGA
mRNA sequenceShow/hide mRNA sequence
AAAGAAACCTTTTATTATTTTATTTTCATCGCTCTCTCTCTTTCTCCAGTGATCCGATTCAGTCAATGGCGAAATTTTGTAGTTGCAATCTGCTGTTTATCCTCTCGATA
TCGATCTCGTTGCTTCTCCGGCGAGCTTCAAGCTCTTATGCAGGTTCCGCTAGCTCAATCGTCAATCCTGCAAAAGTCAAACAGATTTCATGGAGTCCCCGGGCTTTCGT
GTATGAAGGTTTTCTCACGGACTTAGAATGCGATCATCTCATCTCGCTTGCTAAAGCGGAGTTGAAGAGATCTGCTGTTGCGGATCATTTGTCTGGAGAGAGCAAGGTCA
GCGAGGTCCGAACTAGCTCTGGGGCATTTATTCATAAAGCCAAGGATCCGATCGTTTCTGGTATTGAAGACAAAATTGCAGCATGGACATTCCTGCCAAAAGGAAGATAT
ATCTGTCTCTACTTATTGACTTACTTCATCAAATCCGCTGAATCTCAAAGACGGCAGGCTTCTGAAACAAATGAAGATCTCTCAGACTGTGCAAAGAAAGGGATAGCAGT
GAAACCACGGAAGGGTGATGCTCTTCTCTTCTTCAGTCTTCATCCAAATGCTGTTCCAGACACAAAAAGTCTGCATGGAGGTTGCCCTGTGATTGAAGGAGAGAAATGGT
CAGCAACAAAGTGGATCCATGTCGATTCTTTCGACATGATCGTGAGTGATCATACGAGTTGCGTTGATAATAATGCAAGTTGTGAGAGATGGGCTGAACTCGGCGAGTGC
ACGAATAACCCGGAGTATATGGTGGGATCTCCTGAGCTTCCTGGCTATTGCAGGAAAAGTTGCAAGGTTTGTTGA
Protein sequenceShow/hide protein sequence
MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIHKAKDPIVSGI
EDKIAAWTFLPKGRYICLYLLTYFIKSAESQRRQASETNEDLSDCAKKGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDMIVSDHTSCV
DNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC