; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg02267 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg02267
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationCarg_Chr15:1548260..1551759
RNA-Seq ExpressionCarg02267
SyntenyCarg02267
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578605.1 putative prolyl 4-hydroxylase 7, partial [Cucurbita argyrosperma subsp. sororia]6.9e-16199.64Show/hide
Query:  SGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFL
        SGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFL
Subjt:  SGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFL

Query:  PVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAFESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATT
        PVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAFESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATT
Subjt:  PVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAFESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATT

Query:  DKRSLHGSCPVIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC
        DKRSLHGSCPVIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGS+GAVGYCRKSCKAC
Subjt:  DKRSLHGSCPVIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC

KAG7016155.1 putative prolyl 4-hydroxylase 7, partial [Cucurbita argyrosperma subsp. argyrosperma]2.2e-167100Show/hide
Query:  MVLDSMFSNIRSGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAG
        MVLDSMFSNIRSGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAG
Subjt:  MVLDSMFSNIRSGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAG

Query:  IEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAFESQEKDDSWSDCARKGYAVKAQKGDAL
        IEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAFESQEKDDSWSDCARKGYAVKAQKGDAL
Subjt:  IEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAFESQEKDDSWSDCARKGYAVKAQKGDAL

Query:  LFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC
        LFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC
Subjt:  LFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC

XP_022938573.1 probable prolyl 4-hydroxylase 7 [Cucurbita moschata]2.4e-161100Show/hide
Query:  SGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFL
        SGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFL
Subjt:  SGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFL

Query:  PVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAFESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATT
        PVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAFESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATT
Subjt:  PVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAFESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATT

Query:  DKRSLHGSCPVIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC
        DKRSLHGSCPVIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC
Subjt:  DKRSLHGSCPVIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC

XP_022993651.1 probable prolyl 4-hydroxylase 7 [Cucurbita maxima]9.0e-16199.64Show/hide
Query:  SGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFL
        SGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFL
Subjt:  SGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFL

Query:  PVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAFESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATT
        PVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAFESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATT
Subjt:  PVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAFESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATT

Query:  DKRSLHGSCPVIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC
        DKRSLHGSCPVIEGEKWSATKWIHVRSFDKATR SSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC
Subjt:  DKRSLHGSCPVIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC

XP_023549944.1 probable prolyl 4-hydroxylase 7 [Cucurbita pepo subsp. pepo]1.5e-16099.28Show/hide
Query:  SGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFL
        SGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFL
Subjt:  SGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFL

Query:  PVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAFESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATT
        PVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAFES+EKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATT
Subjt:  PVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAFESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATT

Query:  DKRSLHGSCPVIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC
        DKRSLHGSCPVI+GEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC
Subjt:  DKRSLHGSCPVIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC

TrEMBL top hitse value%identityAlignment
A0A0A0KS38 Procollagen-proline 4-dioxygenase4.5e-14288.26Show/hide
Query:  SGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFL
        SGSVL LK DS  LIFDPTRVTQLSWQPRAFLYKGFL+D ECDHLIDLAKDKLEKSMVADN+SGKSVSSEVRTSSGMFLRKAQDE+VAG+EARI+AWT L
Subjt:  SGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFL

Query:  PVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAF-ESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDAT
        P ENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS F ESQ KD+SWSDC+RKGYAVKAQKGDALLFFSL+LDAT
Subjt:  PVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAF-ESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDAT

Query:  TDKRSLHGSCPVIEGEKWSATKWIHVRSFDKAT-RISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC
        TD+RSLHGSCPVI GEKWSATKWIHVRSF+K T R+S Q CVDEN+NC +WAK+GEC+KNPTYMVGS GA+GYCRKSCKAC
Subjt:  TDKRSLHGSCPVIEGEKWSATKWIHVRSFDKAT-RISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC

A0A1S3C8G4 Procollagen-proline 4-dioxygenase1.5e-14891.07Show/hide
Query:  SGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFL
        +GSVL LK DS  LIFDPTRVTQLSWQPRAFLYKGFL+D+ECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQD+IVAG+EARI+AWT L
Subjt:  SGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFL

Query:  PVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAF-ESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDAT
        P ENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS F ESQEKDDSWSDC+RKGYAVKAQKGDALLFFSLHLDAT
Subjt:  PVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAF-ESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDAT

Query:  TDKRSLHGSCPVIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC
        TD+RSLHGSCPVIEGEKWSATKWIHVRSF+K  R+S QDCVDEN+NCP+WAKRGEC+KNPTYMVGSEGA+GYCRKSCKAC
Subjt:  TDKRSLHGSCPVIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC

A0A6J1BXN9 Procollagen-proline 4-dioxygenase4.5e-14288.26Show/hide
Query:  SGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFL
        SGSVL LK +   LIFDPTRVTQLSWQPRAFLYKGFL+D+ECDHLIDLAKDKLEKSMVADN SGKSVSSEVRTSSGMFL KAQDEIVA +EARI+AWTFL
Subjt:  SGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFL

Query:  PVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAF-ESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDAT
        P ENGESIQILHYENGQKYEPHFD+FHDKVNQELGGHR+ATVLMYLSNVEKGGETIFPNS F ESQEKDDSWSDCARKGYAVKA+KGDALLFFSLHLDAT
Subjt:  PVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAF-ESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDAT

Query:  TDKRSLHGSCPVIEGEKWSATKWIHVRSFDKATRISSQ-DCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC
        TD +SLHGSCPVIEGEKWSATKWIHVRSF+K TR S + DCVDEN+NC SWAKRGEC+KNPTYMVGSE A+GYCRKSC+AC
Subjt:  TDKRSLHGSCPVIEGEKWSATKWIHVRSFDKATRISSQ-DCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC

A0A6J1FJ93 Procollagen-proline 4-dioxygenase1.1e-161100Show/hide
Query:  SGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFL
        SGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFL
Subjt:  SGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFL

Query:  PVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAFESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATT
        PVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAFESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATT
Subjt:  PVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAFESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATT

Query:  DKRSLHGSCPVIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC
        DKRSLHGSCPVIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC
Subjt:  DKRSLHGSCPVIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC

A0A6J1JWX0 Procollagen-proline 4-dioxygenase4.4e-16199.64Show/hide
Query:  SGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFL
        SGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFL
Subjt:  SGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFL

Query:  PVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAFESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATT
        PVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAFESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATT
Subjt:  PVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAFESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATT

Query:  DKRSLHGSCPVIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC
        DKRSLHGSCPVIEGEKWSATKWIHVRSFDKATR SSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC
Subjt:  DKRSLHGSCPVIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 62.8e-11272.83Show/hide
Query:  DPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSM-VADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPVENGESIQILHYEN
        DPTR+TQLSW PRAFLYKGFL+D+ECDHLI LAK KLEKSM VAD +SG+S  SEVRTSSGMFL K QD+IVA +EA+++AWTFLP ENGE++QILHYEN
Subjt:  DPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSM-VADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPVENGESIQILHYEN

Query:  GQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPN-SAFESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEG
        GQKY+PHFD+F+DK   ELGGHRIATVLMYLSNV KGGET+FPN      Q KDDSWS CA++GYAVK +KGDALLFF+LHL+ TTD  SLHGSCPVIEG
Subjt:  GQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPN-SAFESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEG

Query:  EKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC
        EKWSAT+WIHVRSF K   +    CVD++++C  WA  GEC+KNP YMVGSE ++G+CRKSCKAC
Subjt:  EKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC

F4JAU3 Prolyl 4-hydroxylase 26.8e-9561.9Show/hide
Query:  SPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPVENGESIQI
        SP  I +P++V Q+S +PRAF+Y+GFLTD ECDHLI LAK+ L++S VADN++G+S  S+VRTSSG F+ K +D IV+GIE ++S WTFLP ENGE +Q+
Subjt:  SPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPVENGESIQI

Query:  LHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAFESQ----EKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLH
        L YE+GQKY+ HFD+FHDKVN   GGHRIATVL+YLSNV KGGET+FP++   S+    E  D  SDCA+KG AVK +KG+ALLFF+L  DA  D  SLH
Subjt:  LHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAFESQ----EKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLH

Query:  GSCPVIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC
        G CPVIEGEKWSATKWIHV SFDK       +C D N++C  WA  GEC KNP YMVG+    G CR+SCKAC
Subjt:  GSCPVIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC

Q8L970 Probable prolyl 4-hydroxylase 71.9e-12172.18Show/hide
Query:  SNIRSGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISA
        SN R GSV+++K  +    FDPTRVTQLSW PR FLY+GFL+D+ECDH I LAK KLEKSMVADN+SG+SV SEVRTSSGMFL K QD+IV+ +EA+++A
Subjt:  SNIRSGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISA

Query:  WTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPN-SAFESQEKDDSWSDCARKGYAVKAQKGDALLFFSLH
        WTFLP ENGES+QILHYENGQKYEPHFD+FHD+ N ELGGHRIATVLMYLSNVEKGGET+FP      +Q KDDSW++CA++GYAVK +KGDALLFF+LH
Subjt:  WTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPN-SAFESQEKDDSWSDCARKGYAVKAQKGDALLFFSLH

Query:  LDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC
         +ATTD  SLHGSCPV+EGEKWSAT+WIHV+SF++A    S  C+DEN +C  WAK GECQKNPTYMVGS+   GYCRKSCKAC
Subjt:  LDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC

Q8LAN3 Probable prolyl 4-hydroxylase 46.2e-9661.54Show/hide
Query:  SPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPVENGESIQI
        S  +  +P++V Q+S +PRAF+Y+GFLT+ ECDH++ LAK  L++S VADN+SG+S  SEVRTSSG F+ K +D IV+GIE +IS WTFLP ENGE IQ+
Subjt:  SPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPVENGESIQI

Query:  LHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAFESQ----EKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLH
        L YE+GQKY+ HFD+FHDKVN   GGHR+AT+LMYLSNV KGGET+FP++   S+    E  +  SDCA++G AVK +KGDALLFF+LH DA  D  SLH
Subjt:  LHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAFESQ----EKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLH

Query:  GSCPVIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC
        G CPVIEGEKWSATKWIHV SFD+     S +C D N++C  WA  GEC KNP YMVG+    GYCR+SCKAC
Subjt:  GSCPVIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC

Q9LN20 Probable prolyl 4-hydroxylase 32.5e-6556.52Show/hide
Query:  LSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHF
        LSW+PRAF+Y  FL+ +EC++LI LAK  + KS V D+E+GKS  S VRTSSG FLR+ +D+I+  IE RI+ +TF+P ++GE +Q+LHYE GQKYEPH+
Subjt:  LSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHF

Query:  DFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFP--NSAFESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATK
        D+F D+ N + GG R+AT+LMYLS+VE+GGET+FP  N  F S    +  S+C +KG +VK + GDALLF+S+  DAT D  SLHG CPVI G KWS+TK
Subjt:  DFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFP--NSAFESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATK

Query:  WIHVRSF
        W+HV  +
Subjt:  WIHVRSF

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 24.9e-9661.9Show/hide
Query:  SPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPVENGESIQI
        SP  I +P++V Q+S +PRAF+Y+GFLTD ECDHLI LAK+ L++S VADN++G+S  S+VRTSSG F+ K +D IV+GIE ++S WTFLP ENGE +Q+
Subjt:  SPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPVENGESIQI

Query:  LHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAFESQ----EKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLH
        L YE+GQKY+ HFD+FHDKVN   GGHRIATVL+YLSNV KGGET+FP++   S+    E  D  SDCA+KG AVK +KG+ALLFF+L  DA  D  SLH
Subjt:  LHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAFESQ----EKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLH

Query:  GSCPVIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC
        G CPVIEGEKWSATKWIHV SFDK       +C D N++C  WA  GEC KNP YMVG+    G CR+SCKAC
Subjt:  GSCPVIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase1.4e-12272.18Show/hide
Query:  SNIRSGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISA
        SN R GSV+++K  +    FDPTRVTQLSW PR FLY+GFL+D+ECDH I LAK KLEKSMVADN+SG+SV SEVRTSSGMFL K QD+IV+ +EA+++A
Subjt:  SNIRSGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISA

Query:  WTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPN-SAFESQEKDDSWSDCARKGYAVKAQKGDALLFFSLH
        WTFLP ENGES+QILHYENGQKYEPHFD+FHD+ N ELGGHRIATVLMYLSNVEKGGET+FP      +Q KDDSW++CA++GYAVK +KGDALLFF+LH
Subjt:  WTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPN-SAFESQEKDDSWSDCARKGYAVKAQKGDALLFFSLH

Query:  LDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC
         +ATTD  SLHGSCPV+EGEKWSAT+WIHV+SF++A    S  C+DEN +C  WAK GECQKNPTYMVGS+   GYCRKSCKAC
Subjt:  LDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase1.4e-11467.81Show/hide
Query:  SNIRSGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSE-----VRTSSGMFLRKAQ---DEIVA
        SN R GSV+++K  +    FDPTRVTQLSW PR FLY+GFL+D+ECDH I LAK KLEKSMVADN+SG+SV SE     VR SS           D+IV+
Subjt:  SNIRSGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSE-----VRTSSGMFLRKAQ---DEIVA

Query:  GIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPN-SAFESQEKDDSWSDCARKGYAVKAQKGD
         +EA+++AWTFLP ENGES+QILHYENGQKYEPHFD+FHD+ N ELGGHRIATVLMYLSNVEKGGET+FP      +Q KDDSW++CA++GYAVK +KGD
Subjt:  GIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPN-SAFESQEKDDSWSDCARKGYAVKAQKGD

Query:  ALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC
        ALLFF+LH +ATTD  SLHGSCPV+EGEKWSAT+WIHV+SF++A    S  C+DEN +C  WAK GECQKNPTYMVGS+   GYCRKSCKAC
Subjt:  ALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase2.0e-11372.83Show/hide
Query:  DPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSM-VADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPVENGESIQILHYEN
        DPTR+TQLSW PRAFLYKGFL+D+ECDHLI LAK KLEKSM VAD +SG+S  SEVRTSSGMFL K QD+IVA +EA+++AWTFLP ENGE++QILHYEN
Subjt:  DPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSM-VADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPVENGESIQILHYEN

Query:  GQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPN-SAFESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEG
        GQKY+PHFD+F+DK   ELGGHRIATVLMYLSNV KGGET+FPN      Q KDDSWS CA++GYAVK +KGDALLFF+LHL+ TTD  SLHGSCPVIEG
Subjt:  GQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPN-SAFESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEG

Query:  EKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC
        EKWSAT+WIHVRSF K   +    CVD++++C  WA  GEC+KNP YMVGSE ++G+CRKSCKAC
Subjt:  EKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein4.4e-9761.54Show/hide
Query:  SPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPVENGESIQI
        S  +  +P++V Q+S +PRAF+Y+GFLT+ ECDH++ LAK  L++S VADN+SG+S  SEVRTSSG F+ K +D IV+GIE +IS WTFLP ENGE IQ+
Subjt:  SPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPVENGESIQI

Query:  LHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAFESQ----EKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLH
        L YE+GQKY+ HFD+FHDKVN   GGHR+AT+LMYLSNV KGGET+FP++   S+    E  +  SDCA++G AVK +KGDALLFF+LH DA  D  SLH
Subjt:  LHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAFESQ----EKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLH

Query:  GSCPVIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC
        G CPVIEGEKWSATKWIHV SFD+     S +C D N++C  WA  GEC KNP YMVG+    GYCR+SCKAC
Subjt:  GSCPVIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTTGGATTCAATGTTTTCGAATATCAGAAGTGGATCTGTGCTTGAATTGAAGAGGGATTCGCCACGGCTCATTTTCGATCCAACACGAGTCACTCAGCTCTCCTG
GCAACCCAGGGCATTTTTGTATAAGGGATTTTTAACTGATCAGGAATGTGATCATCTAATCGATCTGGCCAAGGATAAATTAGAGAAGTCAATGGTAGCAGATAATGAGT
CTGGTAAGAGTGTAAGTAGTGAAGTACGAACGAGTTCTGGCATGTTCCTCCGGAAGGCCCAGGATGAAATTGTTGCTGGCATTGAGGCCAGGATTTCTGCATGGACATTC
CTTCCAGTAGAAAATGGAGAGTCCATTCAAATTCTTCACTACGAAAATGGCCAAAAGTATGAACCACATTTTGATTTTTTTCACGACAAGGTGAATCAGGAGCTAGGTGG
CCACCGAATAGCCACAGTCTTGATGTATCTATCCAATGTTGAAAAGGGTGGAGAAACCATCTTTCCAAATTCAGCGTTTGAATCTCAAGAAAAGGATGACAGCTGGTCTG
ATTGTGCTCGAAAGGGTTATGCAGTTAAAGCGCAGAAGGGTGATGCATTGCTGTTCTTCAGCCTCCATCTCGATGCAACAACAGATAAAAGAAGCTTGCACGGTAGTTGC
CCTGTGATCGAGGGCGAGAAATGGTCTGCAACCAAGTGGATTCATGTGAGATCCTTCGATAAGGCAACTCGGATAAGTAGTCAGGACTGTGTGGATGAGAACAAAAATTG
CCCATCATGGGCAAAAAGGGGTGAGTGCCAAAAGAACCCTACTTATATGGTGGGTTCTGAAGGTGCAGTAGGATACTGTAGGAAGAGTTGCAAAGCGTGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTTTGGATTCAATGTTTTCGAATATCAGAAGTGGATCTGTGCTTGAATTGAAGAGGGATTCGCCACGGCTCATTTTCGATCCAACACGAGTCACTCAGCTCTCCTG
GCAACCCAGGGCATTTTTGTATAAGGGATTTTTAACTGATCAGGAATGTGATCATCTAATCGATCTGGCCAAGGATAAATTAGAGAAGTCAATGGTAGCAGATAATGAGT
CTGGTAAGAGTGTAAGTAGTGAAGTACGAACGAGTTCTGGCATGTTCCTCCGGAAGGCCCAGGATGAAATTGTTGCTGGCATTGAGGCCAGGATTTCTGCATGGACATTC
CTTCCAGTAGAAAATGGAGAGTCCATTCAAATTCTTCACTACGAAAATGGCCAAAAGTATGAACCACATTTTGATTTTTTTCACGACAAGGTGAATCAGGAGCTAGGTGG
CCACCGAATAGCCACAGTCTTGATGTATCTATCCAATGTTGAAAAGGGTGGAGAAACCATCTTTCCAAATTCAGCGTTTGAATCTCAAGAAAAGGATGACAGCTGGTCTG
ATTGTGCTCGAAAGGGTTATGCAGTTAAAGCGCAGAAGGGTGATGCATTGCTGTTCTTCAGCCTCCATCTCGATGCAACAACAGATAAAAGAAGCTTGCACGGTAGTTGC
CCTGTGATCGAGGGCGAGAAATGGTCTGCAACCAAGTGGATTCATGTGAGATCCTTCGATAAGGCAACTCGGATAAGTAGTCAGGACTGTGTGGATGAGAACAAAAATTG
CCCATCATGGGCAAAAAGGGGTGAGTGCCAAAAGAACCCTACTTATATGGTGGGTTCTGAAGGTGCAGTAGGATACTGTAGGAAGAGTTGCAAAGCGTGTTAAACTTAAC
CTATAGGAATATGTCCACGTCTCTCTCTCTCTCTCTCTCTCACCCGTTTTGCAGAGCTGAGTGTTGATTCTCTGATGGTTATGTATATAACATCGGGCAGTAACTGGGTA
TACGATACAAGTGGATATTACATATCTTTGATTAAACCTTGTAGTAGCAATTAGCCAAGTGTTTCATTTGGTAATCCAGACTCTGATGAGAAAATTTTCTCTTGATGCTA
TTGGAACTTTACAAGTGATATATTTTTCGCGTCTTACATGAAATAACCTTTTAGCATTTTACAAACGATTCTTTTGTTTGACATTTTAAAAAATTTATAATTTTATTAGA
CATAAAATTGAAACAGGACATTTCTTCTTTTATTTATATTGGCGTCAGGATAGCTGGCCTTGGCCTTGACGAGGGCTTAGAACCTTTGTCAAATTGGGTTAGAGATTTCA
AGTTTGAATCTACAAATGAGGACTTGAGAATGTGAAATCAATTTCCTATTATAAATCTATGGTTAAGCTTTTAGGTAACAATTTATGGTTAAGCTTTTAGGCAAGAAATT
GTTGCGATTATAAATT
Protein sequenceShow/hide protein sequence
MVLDSMFSNIRSGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTF
LPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAFESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSC
PVIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC