; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg039538 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg039538
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationscaffold10:43809854..43813416
RNA-Seq ExpressionSpg039538
SyntenySpg039538
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008458700.1 PREDICTED: probable prolyl 4-hydroxylase 7 [Cucumis melo]7.1e-15278.63Show/hide
Query:  MDSRRFLAFSLCSLFLSAGFARLPETRTHKKLYDPFRLLLALPFRFTNPLTHDFGFNCFPNFRSGSVIRMKGDPAPLIFDPTRVTQLSWQPRAFLYKGFL
        MDSR FLAFSLC L +   FARLPETR  K  Y                              +GSV+R+K D +PLIFDPTRVTQLSWQPRAFLYKGFL
Subjt:  MDSRRFLAFSLCSLFLSAGFARLPETRTHKKLYDPFRLLLALPFRFTNPLTHDFGFNCFPNFRSGSVIRMKGDPAPLIFDPTRVTQLSWQPRAFLYKGFL

Query:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDK
        SD+ECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFL KAQ        D+IVA +EARIAAWT LPAENGESIQILHYENGQKYEPHFDFFHDK
Subjt:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDK

Query:  VNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSF
        VNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDS+S C++KGYAVKA+KGDALLFFSLHLDA+TD +SLHGSCPVIEGEKWSATKWIHVRSF
Subjt:  VNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSF

Query:  EKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
        EK  R S + CVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
Subjt:  EKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

XP_022134044.1 probable prolyl 4-hydroxylase 7 [Momordica charantia]6.9e-15579.26Show/hide
Query:  MDSRRFLAFSLCSLFLSAGFARLPETRTHKKLYDPFRLLLALPFRFTNPLTHDFGFNCFPNFRSGSVIRMKGDPAPLIFDPTRVTQLSWQPRAFLYKGFL
        MDS RFL+FSLC LF+    ARLP+ R HKK+                               SGSV+R+KG+P+PLIFDPTRVTQLSWQPRAFLYKGFL
Subjt:  MDSRRFLAFSLCSLFLSAGFARLPETRTHKKLYDPFRLLLALPFRFTNPLTHDFGFNCFPNFRSGSVIRMKGDPAPLIFDPTRVTQLSWQPRAFLYKGFL

Query:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDK
        SDKECDHLIDLAKDKLEKSMVADN SGKSVSSEVRTSSGMFLHKAQ        DEIVAA+EARIAAWTFLPAENGESIQILHYENGQKYEPHFD+FHDK
Subjt:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDK

Query:  VNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSF
        VNQELGGHR+ATVLMYLSNVEKGGETIFPNSEFKESQEKDDS+S CA+KGYAVKAKKGDALLFFSLHLDA+TD KSLHGSCPVIEGEKWSATKWIHVRSF
Subjt:  VNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSF

Query:  EKPTRASSE-RCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
        EKPTR S    CVDENENC +WAKRGECKKNPTYMVGSE ALGYCRKSC+AC
Subjt:  EKPTRASSE-RCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

XP_022938573.1 probable prolyl 4-hydroxylase 7 [Cucurbita moschata]4.8e-14878.06Show/hide
Query:  MDSRRFLAFSLCSLFLSAGFARLPETRTHKKLYDPFRLLLALPFRFTNPLTHDFGFNCFPNFRSGSVIRMKGDPAPLIFDPTRVTQLSWQPRAFLYKGFL
        MDSRRFLAFSL  L +S GFARLPE  THKKL                               SGSV+ +K D   LIFDPTRVTQLSWQPRAFLYKGFL
Subjt:  MDSRRFLAFSLCSLFLSAGFARLPETRTHKKLYDPFRLLLALPFRFTNPLTHDFGFNCFPNFRSGSVIRMKGDPAPLIFDPTRVTQLSWQPRAFLYKGFL

Query:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDK
        +D+ECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFL KAQ        DEIVA IEARI+AWTFLP ENGESIQILHYENGQKYEPHFDFFHDK
Subjt:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDK

Query:  VNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSF
        VNQELGGHRIATVLMYLSNVEKGGETIFPNS F ESQEKDDS+S CA+KGYAVKA+KGDALLFFSLHLDA+TD +SLHGSCPVIEGEKWSATKWIHVRSF
Subjt:  VNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSF

Query:  EKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
        +K TR SS+ CVDEN+NCP+WAKRGEC+KNPTYMVGSEGA+GYCRKSCKAC
Subjt:  EKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

XP_038889686.1 probable prolyl 4-hydroxylase 7 isoform X1 [Benincasa hispida]1.0e-15078.35Show/hide
Query:  MDSRRFLAFSLCSLFLSAGFARLPETRTHKKLYDPFRLLLALPFRFTNPLTHDFGFNCFPNFRSGSVIRMKGDPAPLIFDPTRVTQLSWQPRAFLYKGFL
        MDSRRFLAF LC L +  GFARLPE R+ KK                                SGSVIR+K D +PL+FDPTRVTQLSW+PRAFLYKGFL
Subjt:  MDSRRFLAFSLCSLFLSAGFARLPETRTHKKLYDPFRLLLALPFRFTNPLTHDFGFNCFPNFRSGSVIRMKGDPAPLIFDPTRVTQLSWQPRAFLYKGFL

Query:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDK
        SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFL KAQ        DEIVAAIEARI+AWT LPAENGESIQILHYENGQKYEPHFDFFHDK
Subjt:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDK

Query:  VNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSF
        VNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKD+S+S CA+KGYAVKA+KGDALLFFSL  DA+TD KSLHGSCPVIEGEKWSATKWIHVRSF
Subjt:  VNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSF

Query:  EKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
        EK TR S + CVDENENC  WAKRGECKKNPTYMVGSE ALGYCRKSC+AC
Subjt:  EKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

XP_038889687.1 probable prolyl 4-hydroxylase 7 isoform X2 [Benincasa hispida]1.0e-15078.35Show/hide
Query:  MDSRRFLAFSLCSLFLSAGFARLPETRTHKKLYDPFRLLLALPFRFTNPLTHDFGFNCFPNFRSGSVIRMKGDPAPLIFDPTRVTQLSWQPRAFLYKGFL
        MDSRRFLAF LC L +  GFARLPE R+ KK                                SGSVIR+K D +PL+FDPTRVTQLSW+PRAFLYKGFL
Subjt:  MDSRRFLAFSLCSLFLSAGFARLPETRTHKKLYDPFRLLLALPFRFTNPLTHDFGFNCFPNFRSGSVIRMKGDPAPLIFDPTRVTQLSWQPRAFLYKGFL

Query:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDK
        SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFL KAQ        DEIVAAIEARI+AWT LPAENGESIQILHYENGQKYEPHFDFFHDK
Subjt:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDK

Query:  VNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSF
        VNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKD+S+S CA+KGYAVKA+KGDALLFFSL  DA+TD KSLHGSCPVIEGEKWSATKWIHVRSF
Subjt:  VNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSF

Query:  EKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
        EK TR S + CVDENENC  WAKRGECKKNPTYMVGSE ALGYCRKSC+AC
Subjt:  EKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

TrEMBL top hitse value%identityAlignment
A0A0A0KS38 Procollagen-proline 4-dioxygenase6.7e-14876.99Show/hide
Query:  MDSRRFLAFSLCSLFLSAGFARLPETRTHKKLYDPFRLLLALPFRFTNPLTHDFGFNCFPNFRSGSVIRMKGDPAPLIFDPTRVTQLSWQPRAFLYKGFL
        MDSR FLAFSLC L +   FARLPETRTHK+                                SGSV+R+K D +PLIFDPTRVTQLSWQPRAFLYKGFL
Subjt:  MDSRRFLAFSLCSLFLSAGFARLPETRTHKKLYDPFRLLLALPFRFTNPLTHDFGFNCFPNFRSGSVIRMKGDPAPLIFDPTRVTQLSWQPRAFLYKGFL

Query:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDK
        SD ECDHLIDLAKDKLEKSMVADN+SGKSVSSEVRTSSGMFL KAQ        DE+VA +EARIAAWT LPAENGESIQILHYENGQKYEPHFDFFHDK
Subjt:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDK

Query:  VNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSF
        VNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQ KD+S+S C++KGYAVKA+KGDALLFFSL+LDA+TD +SLHGSCPVI GEKWSATKWIHVRSF
Subjt:  VNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSF

Query:  EKPT-RASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
        EK T R S + CVDENENC AWAK+GECKKNPTYMVGS GALGYCRKSCKAC
Subjt:  EKPT-RASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

A0A1S3C8G4 Procollagen-proline 4-dioxygenase3.4e-15278.63Show/hide
Query:  MDSRRFLAFSLCSLFLSAGFARLPETRTHKKLYDPFRLLLALPFRFTNPLTHDFGFNCFPNFRSGSVIRMKGDPAPLIFDPTRVTQLSWQPRAFLYKGFL
        MDSR FLAFSLC L +   FARLPETR  K  Y                              +GSV+R+K D +PLIFDPTRVTQLSWQPRAFLYKGFL
Subjt:  MDSRRFLAFSLCSLFLSAGFARLPETRTHKKLYDPFRLLLALPFRFTNPLTHDFGFNCFPNFRSGSVIRMKGDPAPLIFDPTRVTQLSWQPRAFLYKGFL

Query:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDK
        SD+ECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFL KAQ        D+IVA +EARIAAWT LPAENGESIQILHYENGQKYEPHFDFFHDK
Subjt:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDK

Query:  VNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSF
        VNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDS+S C++KGYAVKA+KGDALLFFSLHLDA+TD +SLHGSCPVIEGEKWSATKWIHVRSF
Subjt:  VNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSF

Query:  EKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
        EK  R S + CVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
Subjt:  EKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

A0A6J1BXN9 Procollagen-proline 4-dioxygenase3.3e-15579.26Show/hide
Query:  MDSRRFLAFSLCSLFLSAGFARLPETRTHKKLYDPFRLLLALPFRFTNPLTHDFGFNCFPNFRSGSVIRMKGDPAPLIFDPTRVTQLSWQPRAFLYKGFL
        MDS RFL+FSLC LF+    ARLP+ R HKK+                               SGSV+R+KG+P+PLIFDPTRVTQLSWQPRAFLYKGFL
Subjt:  MDSRRFLAFSLCSLFLSAGFARLPETRTHKKLYDPFRLLLALPFRFTNPLTHDFGFNCFPNFRSGSVIRMKGDPAPLIFDPTRVTQLSWQPRAFLYKGFL

Query:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDK
        SDKECDHLIDLAKDKLEKSMVADN SGKSVSSEVRTSSGMFLHKAQ        DEIVAA+EARIAAWTFLPAENGESIQILHYENGQKYEPHFD+FHDK
Subjt:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDK

Query:  VNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSF
        VNQELGGHR+ATVLMYLSNVEKGGETIFPNSEFKESQEKDDS+S CA+KGYAVKAKKGDALLFFSLHLDA+TD KSLHGSCPVIEGEKWSATKWIHVRSF
Subjt:  VNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSF

Query:  EKPTRASSE-RCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
        EKPTR S    CVDENENC +WAKRGECKKNPTYMVGSE ALGYCRKSC+AC
Subjt:  EKPTRASSE-RCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

A0A6J1FJ93 Procollagen-proline 4-dioxygenase2.3e-14878.06Show/hide
Query:  MDSRRFLAFSLCSLFLSAGFARLPETRTHKKLYDPFRLLLALPFRFTNPLTHDFGFNCFPNFRSGSVIRMKGDPAPLIFDPTRVTQLSWQPRAFLYKGFL
        MDSRRFLAFSL  L +S GFARLPE  THKKL                               SGSV+ +K D   LIFDPTRVTQLSWQPRAFLYKGFL
Subjt:  MDSRRFLAFSLCSLFLSAGFARLPETRTHKKLYDPFRLLLALPFRFTNPLTHDFGFNCFPNFRSGSVIRMKGDPAPLIFDPTRVTQLSWQPRAFLYKGFL

Query:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDK
        +D+ECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFL KAQ        DEIVA IEARI+AWTFLP ENGESIQILHYENGQKYEPHFDFFHDK
Subjt:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDK

Query:  VNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSF
        VNQELGGHRIATVLMYLSNVEKGGETIFPNS F ESQEKDDS+S CA+KGYAVKA+KGDALLFFSLHLDA+TD +SLHGSCPVIEGEKWSATKWIHVRSF
Subjt:  VNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSF

Query:  EKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
        +K TR SS+ CVDEN+NCP+WAKRGEC+KNPTYMVGSEGA+GYCRKSCKAC
Subjt:  EKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

A0A6J1JWX0 Procollagen-proline 4-dioxygenase5.1e-14877.78Show/hide
Query:  MDSRRFLAFSLCSLFLSAGFARLPETRTHKKLYDPFRLLLALPFRFTNPLTHDFGFNCFPNFRSGSVIRMKGDPAPLIFDPTRVTQLSWQPRAFLYKGFL
        MDSRRFL FSL  L +S GFARLPE  THKKL                               SGSV+ +K D   LIFDPTRVTQLSWQPRAFLYKGFL
Subjt:  MDSRRFLAFSLCSLFLSAGFARLPETRTHKKLYDPFRLLLALPFRFTNPLTHDFGFNCFPNFRSGSVIRMKGDPAPLIFDPTRVTQLSWQPRAFLYKGFL

Query:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDK
        +D+ECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFL KAQ        DEIVA IEARI+AWTFLP ENGESIQILHYENGQKYEPHFDFFHDK
Subjt:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDK

Query:  VNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSF
        VNQELGGHRIATVLMYLSNVEKGGETIFPNS F ESQEKDDS+S CA+KGYAVKA+KGDALLFFSLHLDA+TD +SLHGSCPVIEGEKWSATKWIHVRSF
Subjt:  VNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSF

Query:  EKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
        +K TR SS+ CVDEN+NCP+WAKRGEC+KNPTYMVGSEGA+GYCRKSCKAC
Subjt:  EKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 65.3e-11071.79Show/hide
Query:  DPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSM-VADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGES
        DPTR+TQLSW PRAFLYKGFLSD+ECDHLI LAK KLEKSM VAD +SG+S  SEVRTSSGMFL K Q        D+IVA +EA++AAWTFLP ENGE+
Subjt:  DPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSM-VADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGES

Query:  IQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDTKSLH
        +QILHYENGQKY+PHFD+F+DK   ELGGHRIATVLMYLSNV KGGET+FPN + K  Q KDDS+S CA++GYAVK +KGDALLFF+LHL+ +TD  SLH
Subjt:  IQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDTKSLH

Query:  GSCPVIEGEKWSATKWIHVRSFEKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
        GSCPVIEGEKWSAT+WIHVRSF K        CVD++E+C  WA  GEC+KNP YMVGSE +LG+CRKSCKAC
Subjt:  GSCPVIEGEKWSATKWIHVRSFEKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

F4JAU3 Prolyl 4-hydroxylase 22.3e-8959.57Show/hide
Query:  IFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGE
        I +P++V Q+S +PRAF+Y+GFL+D ECDHLI LAK+ L++S VADN++G+S  S+VRTSSG F+ K +        D IV+ IE +++ WTFLP ENGE
Subjt:  IFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGE

Query:  SIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS-EF--KESQEKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDT
         +Q+L YE+GQKY+ HFD+FHDKVN   GGHRIATVL+YLSNV KGGET+FP++ EF  +   E  D  S CA+KG AVK KKG+ALLFF+L  DA  D 
Subjt:  SIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS-EF--KESQEKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDT

Query:  KSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
         SLHG CPVIEGEKWSATKWIHV SF+K        C D NE+C  WA  GEC KNP YMVG+    G CR+SCKAC
Subjt:  KSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

Q8L970 Probable prolyl 4-hydroxylase 71.8e-12164.67Show/hide
Query:  MDSRRFLAFSLCSLFLSAGFARLPETRTHKKLYDPFRLLLALPFRFTNPLTHDFGFNCFPNFRSGSVIRMKGDPAPLIFDPTRVTQLSWQPRAFLYKGFL
        MDSR FLAFSLC LF       LP             L+ + P RF   LT         N R GSVI+MK   +   FDPTRVTQLSW PR FLY+GFL
Subjt:  MDSRRFLAFSLCSLFLSAGFARLPETRTHKKLYDPFRLLLALPFRFTNPLTHDFGFNCFPNFRSGSVIRMKGDPAPLIFDPTRVTQLSWQPRAFLYKGFL

Query:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDK
        SD+ECDH I LAK KLEKSMVADN+SG+SV SEVRTSSGMFL K Q        D+IV+ +EA++AAWTFLP ENGES+QILHYENGQKYEPHFD+FHD+
Subjt:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDK

Query:  VNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSF
         N ELGGHRIATVLMYLSNVEKGGET+FP  + K +Q KDDS++ CA++GYAVK +KGDALLFF+LH +A+TD+ SLHGSCPV+EGEKWSAT+WIHV+SF
Subjt:  VNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSF

Query:  EKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
        E+     S  C+DEN +C  WAK GEC+KNPTYMVGS+   GYCRKSCKAC
Subjt:  EKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

Q8LAN3 Probable prolyl 4-hydroxylase 49.5e-9159.27Show/hide
Query:  DPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGESI
        +P++V Q+S +PRAF+Y+GFL++ ECDH++ LAK  L++S VADN+SG+S  SEVRTSSG F+ K +        D IV+ IE +I+ WTFLP ENGE I
Subjt:  DPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGESI

Query:  QILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQ---EKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDTKS
        Q+L YE+GQKY+ HFD+FHDKVN   GGHR+AT+LMYLSNV KGGET+FP++E    +   E  +  S CA++G AVK +KGDALLFF+LH DA  D  S
Subjt:  QILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQ---EKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDTKS

Query:  LHGSCPVIEGEKWSATKWIHVRSFEKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
        LHG CPVIEGEKWSATKWIHV SF++    S   C D NE+C  WA  GEC KNP YMVG+    GYCR+SCKAC
Subjt:  LHGSCPVIEGEKWSATKWIHVRSFEKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

Q9LN20 Probable prolyl 4-hydroxylase 31.2e-6153.7Show/hide
Query:  LSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGESIQILHYEN
        LSW+PRAF+Y  FLS +EC++LI LAK  + KS V D+E+GKS  S VRTSSG FL + +        D+I+  IE RIA +TF+PA++GE +Q+LHYE 
Subjt:  LSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGESIQILHYEN

Query:  GQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFK-ESQEKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIE
        GQKYEPH+D+F D+ N + GG R+AT+LMYLS+VE+GGET+FP +     S    +  S C +KG +VK + GDALLF+S+  DA+ D  SLHG CPVI 
Subjt:  GQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFK-ESQEKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIE

Query:  GEKWSATKWIHVRSFE
        G KWS+TKW+HV  ++
Subjt:  GEKWSATKWIHVRSFE

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 21.7e-9059.57Show/hide
Query:  IFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGE
        I +P++V Q+S +PRAF+Y+GFL+D ECDHLI LAK+ L++S VADN++G+S  S+VRTSSG F+ K +        D IV+ IE +++ WTFLP ENGE
Subjt:  IFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGE

Query:  SIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS-EF--KESQEKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDT
         +Q+L YE+GQKY+ HFD+FHDKVN   GGHRIATVL+YLSNV KGGET+FP++ EF  +   E  D  S CA+KG AVK KKG+ALLFF+L  DA  D 
Subjt:  SIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS-EF--KESQEKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDT

Query:  KSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
         SLHG CPVIEGEKWSATKWIHV SF+K        C D NE+C  WA  GEC KNP YMVG+    G CR+SCKAC
Subjt:  KSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase1.3e-12264.67Show/hide
Query:  MDSRRFLAFSLCSLFLSAGFARLPETRTHKKLYDPFRLLLALPFRFTNPLTHDFGFNCFPNFRSGSVIRMKGDPAPLIFDPTRVTQLSWQPRAFLYKGFL
        MDSR FLAFSLC LF       LP             L+ + P RF   LT         N R GSVI+MK   +   FDPTRVTQLSW PR FLY+GFL
Subjt:  MDSRRFLAFSLCSLFLSAGFARLPETRTHKKLYDPFRLLLALPFRFTNPLTHDFGFNCFPNFRSGSVIRMKGDPAPLIFDPTRVTQLSWQPRAFLYKGFL

Query:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDK
        SD+ECDH I LAK KLEKSMVADN+SG+SV SEVRTSSGMFL K Q        D+IV+ +EA++AAWTFLP ENGES+QILHYENGQKYEPHFD+FHD+
Subjt:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDK

Query:  VNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSF
         N ELGGHRIATVLMYLSNVEKGGET+FP  + K +Q KDDS++ CA++GYAVK +KGDALLFF+LH +A+TD+ SLHGSCPV+EGEKWSAT+WIHV+SF
Subjt:  VNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSF

Query:  EKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
        E+     S  C+DEN +C  WAK GEC+KNPTYMVGS+   GYCRKSCKAC
Subjt:  EKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase1.3e-11661.69Show/hide
Query:  MDSRRFLAFSLCSLFLSAGFARLPETRTHKKLYDPFRLLLALPFRFTNPLTHDFGFNCFPNFRSGSVIRMKGDPAPLIFDPTRVTQLSWQPRAFLYKGFL
        MDSR FLAFSLC LF       LP             L+ + P RF   LT         N R GSVI+MK   +   FDPTRVTQLSW PR FLY+GFL
Subjt:  MDSRRFLAFSLCSLFLSAGFARLPETRTHKKLYDPFRLLLALPFRFTNPLTHDFGFNCFPNFRSGSVIRMKGDPAPLIFDPTRVTQLSWQPRAFLYKGFL

Query:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSE----VRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDF
        SD+ECDH I LAK KLEKSMVADN+SG+SV SE    V   S  F+     ++    +D+IV+ +EA++AAWTFLP ENGES+QILHYENGQKYEPHFD+
Subjt:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSE----VRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDF

Query:  FHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIH
        FHD+ N ELGGHRIATVLMYLSNVEKGGET+FP  + K +Q KDDS++ CA++GYAVK +KGDALLFF+LH +A+TD+ SLHGSCPV+EGEKWSAT+WIH
Subjt:  FHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIH

Query:  VRSFEKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
        V+SFE+     S  C+DEN +C  WAK GEC+KNPTYMVGS+   GYCRKSCKAC
Subjt:  VRSFEKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase3.8e-11171.79Show/hide
Query:  DPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSM-VADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGES
        DPTR+TQLSW PRAFLYKGFLSD+ECDHLI LAK KLEKSM VAD +SG+S  SEVRTSSGMFL K Q        D+IVA +EA++AAWTFLP ENGE+
Subjt:  DPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSM-VADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGES

Query:  IQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDTKSLH
        +QILHYENGQKY+PHFD+F+DK   ELGGHRIATVLMYLSNV KGGET+FPN + K  Q KDDS+S CA++GYAVK +KGDALLFF+LHL+ +TD  SLH
Subjt:  IQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDTKSLH

Query:  GSCPVIEGEKWSATKWIHVRSFEKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
        GSCPVIEGEKWSAT+WIHVRSF K        CVD++E+C  WA  GEC+KNP YMVGSE +LG+CRKSCKAC
Subjt:  GSCPVIEGEKWSATKWIHVRSFEKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein6.7e-9259.27Show/hide
Query:  DPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGESI
        +P++V Q+S +PRAF+Y+GFL++ ECDH++ LAK  L++S VADN+SG+S  SEVRTSSG F+ K +        D IV+ IE +I+ WTFLP ENGE I
Subjt:  DPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGESI

Query:  QILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQ---EKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDTKS
        Q+L YE+GQKY+ HFD+FHDKVN   GGHR+AT+LMYLSNV KGGET+FP++E    +   E  +  S CA++G AVK +KGDALLFF+LH DA  D  S
Subjt:  QILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQ---EKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDTKS

Query:  LHGSCPVIEGEKWSATKWIHVRSFEKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
        LHG CPVIEGEKWSATKWIHV SF++    S   C D NE+C  WA  GEC KNP YMVG+    GYCR+SCKAC
Subjt:  LHGSCPVIEGEKWSATKWIHVRSFEKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCCCGACGGTTTCTCGCATTTTCTCTTTGCTCTCTGTTCCTGTCTGCGGGCTTCGCTCGCTTGCCGGAAACGCGTACGCACAAGAAATTGTACGATCCTTTCCG
TCTTCTTCTTGCCCTTCCTTTTCGTTTCACCAATCCACTGACTCATGATTTTGGATTCAATTGTTTTCCGAATTTCAGGAGTGGATCTGTAATTCGAATGAAGGGGGATC
CAGCTCCGTTGATTTTCGATCCAACACGAGTCACTCAGCTCTCTTGGCAACCCAGGGCATTTTTGTATAAGGGATTTTTATCTGATAAGGAATGTGATCATCTAATCGAT
CTGGCCAAGGATAAATTAGAGAAGTCAATGGTAGCAGATAATGAGTCCGGTAAGAGTGTAAGTAGTGAAGTCCGGACGAGTTCTGGCATGTTCCTTCATAAGGCCCAGTG
TGTAAAATATTGTGGTGTCATGGATGAAATTGTTGCTGCCATTGAGGCCAGGATTGCTGCATGGACATTCCTTCCAGCAGAAAACGGAGAGTCCATTCAAATACTGCACT
ATGAGAATGGTCAAAAGTATGAACCGCATTTTGATTTTTTTCATGACAAGGTGAATCAGGAGTTAGGTGGCCATCGAATAGCTACAGTCTTGATGTATTTATCCAATGTT
GAAAAGGGTGGAGAAACCATCTTTCCAAATTCAGAGTTCAAAGAATCTCAAGAAAAGGATGACAGTTTTTCTGTTTGTGCTCAAAAGGGTTATGCAGTTAAAGCGAAGAA
GGGTGATGCATTGCTGTTCTTCAGCCTCCATCTCGATGCATCGACAGATACCAAAAGCTTGCACGGTAGTTGCCCTGTGATCGAGGGCGAGAAATGGTCTGCAACCAAGT
GGATTCATGTGAGATCCTTCGAGAAGCCGACTCGTGCAAGTAGTGAGCGTTGCGTGGACGAAAATGAAAATTGCCCTGCGTGGGCCAAAAGGGGTGAGTGCAAGAAGAAC
CCTACTTACATGGTGGGTTCTGAAGGTGCTTTAGGATACTGTAGGAAGAGTTGTAAAGCGTGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATTCCCGACGGTTTCTCGCATTTTCTCTTTGCTCTCTGTTCCTGTCTGCGGGCTTCGCTCGCTTGCCGGAAACGCGTACGCACAAGAAATTGTACGATCCTTTCCG
TCTTCTTCTTGCCCTTCCTTTTCGTTTCACCAATCCACTGACTCATGATTTTGGATTCAATTGTTTTCCGAATTTCAGGAGTGGATCTGTAATTCGAATGAAGGGGGATC
CAGCTCCGTTGATTTTCGATCCAACACGAGTCACTCAGCTCTCTTGGCAACCCAGGGCATTTTTGTATAAGGGATTTTTATCTGATAAGGAATGTGATCATCTAATCGAT
CTGGCCAAGGATAAATTAGAGAAGTCAATGGTAGCAGATAATGAGTCCGGTAAGAGTGTAAGTAGTGAAGTCCGGACGAGTTCTGGCATGTTCCTTCATAAGGCCCAGTG
TGTAAAATATTGTGGTGTCATGGATGAAATTGTTGCTGCCATTGAGGCCAGGATTGCTGCATGGACATTCCTTCCAGCAGAAAACGGAGAGTCCATTCAAATACTGCACT
ATGAGAATGGTCAAAAGTATGAACCGCATTTTGATTTTTTTCATGACAAGGTGAATCAGGAGTTAGGTGGCCATCGAATAGCTACAGTCTTGATGTATTTATCCAATGTT
GAAAAGGGTGGAGAAACCATCTTTCCAAATTCAGAGTTCAAAGAATCTCAAGAAAAGGATGACAGTTTTTCTGTTTGTGCTCAAAAGGGTTATGCAGTTAAAGCGAAGAA
GGGTGATGCATTGCTGTTCTTCAGCCTCCATCTCGATGCATCGACAGATACCAAAAGCTTGCACGGTAGTTGCCCTGTGATCGAGGGCGAGAAATGGTCTGCAACCAAGT
GGATTCATGTGAGATCCTTCGAGAAGCCGACTCGTGCAAGTAGTGAGCGTTGCGTGGACGAAAATGAAAATTGCCCTGCGTGGGCCAAAAGGGGTGAGTGCAAGAAGAAC
CCTACTTACATGGTGGGTTCTGAAGGTGCTTTAGGATACTGTAGGAAGAGTTGTAAAGCGTGTTAA
Protein sequenceShow/hide protein sequence
MDSRRFLAFSLCSLFLSAGFARLPETRTHKKLYDPFRLLLALPFRFTNPLTHDFGFNCFPNFRSGSVIRMKGDPAPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLID
LAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHKAQCVKYCGVMDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNV
EKGGETIFPNSEFKESQEKDDSFSVCAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASSERCVDENENCPAWAKRGECKKN
PTYMVGSEGALGYCRKSCKAC