; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0030038 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0030038
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationchr8:43998116..44001397
RNA-Seq ExpressionLag0030038
SyntenyLag0030038
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008458700.1 PREDICTED: probable prolyl 4-hydroxylase 7 [Cucumis melo]3.1e-15279.88Show/hide
Query:  MDSRRFLALSLCSLFLSAAFARLPETRTHKKLYDPFPLLLALLFRFANPLTRDFGFNCFSNFRSGSVIRMKGDPAPLIFDPTRVTQLSWRPRAFLYKGFL
        MDSR FLA SLC L +  AFARLPETR  K  Y                              +GSV+R+K D +PLIFDPTRVTQLSW+PRAFLYKGFL
Subjt:  MDSRRFLALSLCSLFLSAAFARLPETRTHKKLYDPFPLLLALLFRFANPLTRDFGFNCFSNFRSGSVIRMKGDPAPLIFDPTRVTQLSWRPRAFLYKGFL

Query:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGH
        SD+ECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFL  AQD+IVA +EARIAAWT LPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGH
Subjt:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGH

Query:  RIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASS
        RIATVLMYLSNVEKGGETIFPNSEFKESQEKDDS+S C++KGYAVKA+KGDALLFFSLHLDA+TD +SLHGSCPVIEGEKWSATKWIHVRSFEK  R S 
Subjt:  RIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASS

Query:  ERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
        + CVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
Subjt:  ERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

XP_011655982.1 probable prolyl 4-hydroxylase 7 isoform X1 [Cucumis sativus]4.6e-14878.2Show/hide
Query:  MDSRRFLALSLCSLFLSAAFARLPETRTHKKLYDPFPLLLALLFRFANPLTRDFGFNCFSNFRSGSVIRMKGDPAPLIFDPTRVTQLSWRPRAFLYKGFL
        MDSR FLA SLC L +  AFARLPETRTHK+                                SGSV+R+K D +PLIFDPTRVTQLSW+PRAFLYKGFL
Subjt:  MDSRRFLALSLCSLFLSAAFARLPETRTHKKLYDPFPLLLALLFRFANPLTRDFGFNCFSNFRSGSVIRMKGDPAPLIFDPTRVTQLSWRPRAFLYKGFL

Query:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGH
        SD ECDHLIDLAKDKLEKSMVADN+SGKSVSSEVRTSSGMFL  AQDE+VA +EARIAAWT LPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGH
Subjt:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGH

Query:  RIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSFEKPT-RAS
        RIATVLMYLSNVEKGGETIFPNSEFKESQ KD+S+S C++KGYAVKA+KGDALLFFSL+LDA+TD +SLHGSCPVI GEKWSATKWIHVRSFEK T R S
Subjt:  RIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSFEKPT-RAS

Query:  SERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
         + CVDENENC AWAK+GECKKNPTYMVGS GALGYCRKSCKAC
Subjt:  SERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

XP_022134044.1 probable prolyl 4-hydroxylase 7 [Momordica charantia]3.0e-15580.52Show/hide
Query:  MDSRRFLALSLCSLFLSAAFARLPETRTHKKLYDPFPLLLALLFRFANPLTRDFGFNCFSNFRSGSVIRMKGDPAPLIFDPTRVTQLSWRPRAFLYKGFL
        MDS RFL+ SLC LF+  A ARLP+ R HKK+                               SGSV+R+KG+P+PLIFDPTRVTQLSW+PRAFLYKGFL
Subjt:  MDSRRFLALSLCSLFLSAAFARLPETRTHKKLYDPFPLLLALLFRFANPLTRDFGFNCFSNFRSGSVIRMKGDPAPLIFDPTRVTQLSWRPRAFLYKGFL

Query:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGH
        SDKECDHLIDLAKDKLEKSMVADN SGKSVSSEVRTSSGMFLH AQDEIVAA+EARIAAWTFLPAENGESIQILHYENGQKYEPHFD+FHDKVNQELGGH
Subjt:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGH

Query:  RIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASS
        R+ATVLMYLSNVEKGGETIFPNSEFKESQEKDDS+S CA+KGYAVKAKKGDALLFFSLHLDA+TD KSLHGSCPVIEGEKWSATKWIHVRSFEKPTR S 
Subjt:  RIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASS

Query:  E-RCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
           CVDENENC +WAKRGECKKNPTYMVGSE ALGYCRKSC+AC
Subjt:  E-RCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

XP_038889686.1 probable prolyl 4-hydroxylase 7 isoform X1 [Benincasa hispida]2.9e-15079.3Show/hide
Query:  MDSRRFLALSLCSLFLSAAFARLPETRTHKKLYDPFPLLLALLFRFANPLTRDFGFNCFSNFRSGSVIRMKGDPAPLIFDPTRVTQLSWRPRAFLYKGFL
        MDSRRFLA  LC L +   FARLPE R+ KK                                SGSVIR+K D +PL+FDPTRVTQLSW PRAFLYKGFL
Subjt:  MDSRRFLALSLCSLFLSAAFARLPETRTHKKLYDPFPLLLALLFRFANPLTRDFGFNCFSNFRSGSVIRMKGDPAPLIFDPTRVTQLSWRPRAFLYKGFL

Query:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGH
        SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFL  AQDEIVAAIEARI+AWT LPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGH
Subjt:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGH

Query:  RIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASS
        RIATVLMYLSNVEKGGETIFPNSEFKESQEKD+S+S CA+KGYAVKA+KGDALLFFSL  DA+TD KSLHGSCPVIEGEKWSATKWIHVRSFEK TR S 
Subjt:  RIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASS

Query:  ERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
        + CVDENENC  WAKRGECKKNPTYMVGSE ALGYCRKSC+AC
Subjt:  ERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

XP_038889687.1 probable prolyl 4-hydroxylase 7 isoform X2 [Benincasa hispida]3.8e-15079.3Show/hide
Query:  MDSRRFLALSLCSLFLSAAFARLPETRTHKKLYDPFPLLLALLFRFANPLTRDFGFNCFSNFRSGSVIRMKGDPAPLIFDPTRVTQLSWRPRAFLYKGFL
        MDSRRFLA  LC L +   FARLPE R+ KK                                SGSVIR+K D +PL+FDPTRVTQLSW PRAFLYKGFL
Subjt:  MDSRRFLALSLCSLFLSAAFARLPETRTHKKLYDPFPLLLALLFRFANPLTRDFGFNCFSNFRSGSVIRMKGDPAPLIFDPTRVTQLSWRPRAFLYKGFL

Query:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGH
        SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFL  AQDEIVAAIEARI+AWT LPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGH
Subjt:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGH

Query:  RIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASS
        RIATVLMYLSNVEKGGETIFPNSEFKESQEKD+S+S CA+KGYAVKA+KGDALLFFSL  DA+TD KSLHGSCPVIEGEKWSATKWIHVRSFEK TR S 
Subjt:  RIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASS

Query:  ERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
        + CVDENENC  WAKRGECKKNPTYMVGSE ALGYCRKSC+AC
Subjt:  ERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

TrEMBL top hitse value%identityAlignment
A0A0A0KS38 Procollagen-proline 4-dioxygenase2.3e-14878.2Show/hide
Query:  MDSRRFLALSLCSLFLSAAFARLPETRTHKKLYDPFPLLLALLFRFANPLTRDFGFNCFSNFRSGSVIRMKGDPAPLIFDPTRVTQLSWRPRAFLYKGFL
        MDSR FLA SLC L +  AFARLPETRTHK+                                SGSV+R+K D +PLIFDPTRVTQLSW+PRAFLYKGFL
Subjt:  MDSRRFLALSLCSLFLSAAFARLPETRTHKKLYDPFPLLLALLFRFANPLTRDFGFNCFSNFRSGSVIRMKGDPAPLIFDPTRVTQLSWRPRAFLYKGFL

Query:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGH
        SD ECDHLIDLAKDKLEKSMVADN+SGKSVSSEVRTSSGMFL  AQDE+VA +EARIAAWT LPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGH
Subjt:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGH

Query:  RIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSFEKPT-RAS
        RIATVLMYLSNVEKGGETIFPNSEFKESQ KD+S+S C++KGYAVKA+KGDALLFFSL+LDA+TD +SLHGSCPVI GEKWSATKWIHVRSFEK T R S
Subjt:  RIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSFEKPT-RAS

Query:  SERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
         + CVDENENC AWAK+GECKKNPTYMVGS GALGYCRKSCKAC
Subjt:  SERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

A0A1S3C8G4 Procollagen-proline 4-dioxygenase1.5e-15279.88Show/hide
Query:  MDSRRFLALSLCSLFLSAAFARLPETRTHKKLYDPFPLLLALLFRFANPLTRDFGFNCFSNFRSGSVIRMKGDPAPLIFDPTRVTQLSWRPRAFLYKGFL
        MDSR FLA SLC L +  AFARLPETR  K  Y                              +GSV+R+K D +PLIFDPTRVTQLSW+PRAFLYKGFL
Subjt:  MDSRRFLALSLCSLFLSAAFARLPETRTHKKLYDPFPLLLALLFRFANPLTRDFGFNCFSNFRSGSVIRMKGDPAPLIFDPTRVTQLSWRPRAFLYKGFL

Query:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGH
        SD+ECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFL  AQD+IVA +EARIAAWT LPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGH
Subjt:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGH

Query:  RIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASS
        RIATVLMYLSNVEKGGETIFPNSEFKESQEKDDS+S C++KGYAVKA+KGDALLFFSLHLDA+TD +SLHGSCPVIEGEKWSATKWIHVRSFEK  R S 
Subjt:  RIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASS

Query:  ERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
        + CVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
Subjt:  ERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

A0A6J1BXN9 Procollagen-proline 4-dioxygenase1.5e-15580.52Show/hide
Query:  MDSRRFLALSLCSLFLSAAFARLPETRTHKKLYDPFPLLLALLFRFANPLTRDFGFNCFSNFRSGSVIRMKGDPAPLIFDPTRVTQLSWRPRAFLYKGFL
        MDS RFL+ SLC LF+  A ARLP+ R HKK+                               SGSV+R+KG+P+PLIFDPTRVTQLSW+PRAFLYKGFL
Subjt:  MDSRRFLALSLCSLFLSAAFARLPETRTHKKLYDPFPLLLALLFRFANPLTRDFGFNCFSNFRSGSVIRMKGDPAPLIFDPTRVTQLSWRPRAFLYKGFL

Query:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGH
        SDKECDHLIDLAKDKLEKSMVADN SGKSVSSEVRTSSGMFLH AQDEIVAA+EARIAAWTFLPAENGESIQILHYENGQKYEPHFD+FHDKVNQELGGH
Subjt:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGH

Query:  RIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASS
        R+ATVLMYLSNVEKGGETIFPNSEFKESQEKDDS+S CA+KGYAVKAKKGDALLFFSLHLDA+TD KSLHGSCPVIEGEKWSATKWIHVRSFEKPTR S 
Subjt:  RIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASS

Query:  E-RCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
           CVDENENC +WAKRGECKKNPTYMVGSE ALGYCRKSC+AC
Subjt:  E-RCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

A0A6J1FJ93 Procollagen-proline 4-dioxygenase1.5e-14778.72Show/hide
Query:  MDSRRFLALSLCSLFLSAAFARLPETRTHKKLYDPFPLLLALLFRFANPLTRDFGFNCFSNFRSGSVIRMKGDPAPLIFDPTRVTQLSWRPRAFLYKGFL
        MDSRRFLA SL  L +S  FARLPE  THKKL                               SGSV+ +K D   LIFDPTRVTQLSW+PRAFLYKGFL
Subjt:  MDSRRFLALSLCSLFLSAAFARLPETRTHKKLYDPFPLLLALLFRFANPLTRDFGFNCFSNFRSGSVIRMKGDPAPLIFDPTRVTQLSWRPRAFLYKGFL

Query:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGH
        +D+ECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFL  AQDEIVA IEARI+AWTFLP ENGESIQILHYENGQKYEPHFDFFHDKVNQELGGH
Subjt:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGH

Query:  RIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASS
        RIATVLMYLSNVEKGGETIFPNS F ESQEKDDS+S CA+KGYAVKA+KGDALLFFSLHLDA+TD +SLHGSCPVIEGEKWSATKWIHVRSF+K TR SS
Subjt:  RIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASS

Query:  ERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
        + CVDEN+NCP+WAKRGEC+KNPTYMVGSEGA+GYCRKSCKAC
Subjt:  ERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

A0A6J1JWX0 Procollagen-proline 4-dioxygenase3.3e-14778.43Show/hide
Query:  MDSRRFLALSLCSLFLSAAFARLPETRTHKKLYDPFPLLLALLFRFANPLTRDFGFNCFSNFRSGSVIRMKGDPAPLIFDPTRVTQLSWRPRAFLYKGFL
        MDSRRFL  SL  L +S  FARLPE  THKKL                               SGSV+ +K D   LIFDPTRVTQLSW+PRAFLYKGFL
Subjt:  MDSRRFLALSLCSLFLSAAFARLPETRTHKKLYDPFPLLLALLFRFANPLTRDFGFNCFSNFRSGSVIRMKGDPAPLIFDPTRVTQLSWRPRAFLYKGFL

Query:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGH
        +D+ECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFL  AQDEIVA IEARI+AWTFLP ENGESIQILHYENGQKYEPHFDFFHDKVNQELGGH
Subjt:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGH

Query:  RIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASS
        RIATVLMYLSNVEKGGETIFPNS F ESQEKDDS+S CA+KGYAVKA+KGDALLFFSLHLDA+TD +SLHGSCPVIEGEKWSATKWIHVRSF+K TR SS
Subjt:  RIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASS

Query:  ERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
        + CVDEN+NCP+WAKRGEC+KNPTYMVGSEGA+GYCRKSCKAC
Subjt:  ERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 62.8e-11173.58Show/hide
Query:  DPTRVTQLSWRPRAFLYKGFLSDKECDHLIDLAKDKLEKSM-VADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYEN
        DPTR+TQLSW PRAFLYKGFLSD+ECDHLI LAK KLEKSM VAD +SG+S  SEVRTSSGMFL   QD+IVA +EA++AAWTFLP ENGE++QILHYEN
Subjt:  DPTRVTQLSWRPRAFLYKGFLSDKECDHLIDLAKDKLEKSM-VADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYEN

Query:  GQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEG
        GQKY+PHFD+F+DK   ELGGHRIATVLMYLSNV KGGET+FPN + K  Q KDDS+S CA++GYAVK +KGDALLFF+LHL+ +TD  SLHGSCPVIEG
Subjt:  GQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEG

Query:  EKWSATKWIHVRSFEKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
        EKWSAT+WIHVRSF K        CVD++E+C  WA  GEC+KNP YMVGSE +LG+CRKSCKAC
Subjt:  EKWSATKWIHVRSFEKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

F4JAU3 Prolyl 4-hydroxylase 29.3e-9160.97Show/hide
Query:  IFDPTRVTQLSWRPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYE
        I +P++V Q+S +PRAF+Y+GFL+D ECDHLI LAK+ L++S VADN++G+S  S+VRTSSG F+   +D IV+ IE +++ WTFLP ENGE +Q+L YE
Subjt:  IFDPTRVTQLSWRPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYE

Query:  NGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS-EF--KESQEKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCP
        +GQKY+ HFD+FHDKVN   GGHRIATVL+YLSNV KGGET+FP++ EF  +   E  D  S CA+KG AVK KKG+ALLFF+L  DA  D  SLHG CP
Subjt:  NGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS-EF--KESQEKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCP

Query:  VIEGEKWSATKWIHVRSFEKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
        VIEGEKWSATKWIHV SF+K        C D NE+C  WA  GEC KNP YMVG+    G CR+SCKAC
Subjt:  VIEGEKWSATKWIHVRSFEKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

Q8L970 Probable prolyl 4-hydroxylase 71.6e-12265.6Show/hide
Query:  MDSRRFLALSLCSLFLSAAFARLPETRTHKKLYDPFPLLLALLFRFANPLTRDFGFNCFSNFRSGSVIRMKGDPAPLIFDPTRVTQLSWRPRAFLYKGFL
        MDSR FLA SLC LF                     PL+ +   RF   LTR       SN R GSVI+MK   +   FDPTRVTQLSW PR FLY+GFL
Subjt:  MDSRRFLALSLCSLFLSAAFARLPETRTHKKLYDPFPLLLALLFRFANPLTRDFGFNCFSNFRSGSVIRMKGDPAPLIFDPTRVTQLSWRPRAFLYKGFL

Query:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGH
        SD+ECDH I LAK KLEKSMVADN+SG+SV SEVRTSSGMFL   QD+IV+ +EA++AAWTFLP ENGES+QILHYENGQKYEPHFD+FHD+ N ELGGH
Subjt:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGH

Query:  RIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASS
        RIATVLMYLSNVEKGGET+FP  + K +Q KDDS++ CA++GYAVK +KGDALLFF+LH +A+TD+ SLHGSCPV+EGEKWSAT+WIHV+SFE+     S
Subjt:  RIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASS

Query:  ERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
          C+DEN +C  WAK GEC+KNPTYMVGS+   GYCRKSCKAC
Subjt:  ERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

Q8LAN3 Probable prolyl 4-hydroxylase 44.9e-9260.67Show/hide
Query:  DPTRVTQLSWRPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYENG
        +P++V Q+S +PRAF+Y+GFL++ ECDH++ LAK  L++S VADN+SG+S  SEVRTSSG F+   +D IV+ IE +I+ WTFLP ENGE IQ+L YE+G
Subjt:  DPTRVTQLSWRPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYENG

Query:  QKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQ---EKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVI
        QKY+ HFD+FHDKVN   GGHR+AT+LMYLSNV KGGET+FP++E    +   E  +  S CA++G AVK +KGDALLFF+LH DA  D  SLHG CPVI
Subjt:  QKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQ---EKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVI

Query:  EGEKWSATKWIHVRSFEKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
        EGEKWSATKWIHV SF++    S   C D NE+C  WA  GEC KNP YMVG+    GYCR+SCKAC
Subjt:  EGEKWSATKWIHVRSFEKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

Q9LN20 Probable prolyl 4-hydroxylase 34.8e-6355.77Show/hide
Query:  LSWRPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHF
        LSW PRAF+Y  FLS +EC++LI LAK  + KS V D+E+GKS  S VRTSSG FL   +D+I+  IE RIA +TF+PA++GE +Q+LHYE GQKYEPH+
Subjt:  LSWRPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHF

Query:  DFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFK-ESQEKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATK
        D+F D+ N + GG R+AT+LMYLS+VE+GGET+FP +     S    +  S C +KG +VK + GDALLF+S+  DA+ D  SLHG CPVI G KWS+TK
Subjt:  DFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFK-ESQEKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATK

Query:  WIHVRSFE
        W+HV  ++
Subjt:  WIHVRSFE

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 26.6e-9260.97Show/hide
Query:  IFDPTRVTQLSWRPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYE
        I +P++V Q+S +PRAF+Y+GFL+D ECDHLI LAK+ L++S VADN++G+S  S+VRTSSG F+   +D IV+ IE +++ WTFLP ENGE +Q+L YE
Subjt:  IFDPTRVTQLSWRPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYE

Query:  NGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS-EF--KESQEKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCP
        +GQKY+ HFD+FHDKVN   GGHRIATVL+YLSNV KGGET+FP++ EF  +   E  D  S CA+KG AVK KKG+ALLFF+L  DA  D  SLHG CP
Subjt:  NGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS-EF--KESQEKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCP

Query:  VIEGEKWSATKWIHVRSFEKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
        VIEGEKWSATKWIHV SF+K        C D NE+C  WA  GEC KNP YMVG+    G CR+SCKAC
Subjt:  VIEGEKWSATKWIHVRSFEKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase1.1e-12365.6Show/hide
Query:  MDSRRFLALSLCSLFLSAAFARLPETRTHKKLYDPFPLLLALLFRFANPLTRDFGFNCFSNFRSGSVIRMKGDPAPLIFDPTRVTQLSWRPRAFLYKGFL
        MDSR FLA SLC LF                     PL+ +   RF   LTR       SN R GSVI+MK   +   FDPTRVTQLSW PR FLY+GFL
Subjt:  MDSRRFLALSLCSLFLSAAFARLPETRTHKKLYDPFPLLLALLFRFANPLTRDFGFNCFSNFRSGSVIRMKGDPAPLIFDPTRVTQLSWRPRAFLYKGFL

Query:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGH
        SD+ECDH I LAK KLEKSMVADN+SG+SV SEVRTSSGMFL   QD+IV+ +EA++AAWTFLP ENGES+QILHYENGQKYEPHFD+FHD+ N ELGGH
Subjt:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGH

Query:  RIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASS
        RIATVLMYLSNVEKGGET+FP  + K +Q KDDS++ CA++GYAVK +KGDALLFF+LH +A+TD+ SLHGSCPV+EGEKWSAT+WIHV+SFE+     S
Subjt:  RIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASS

Query:  ERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
          C+DEN +C  WAK GEC+KNPTYMVGS+   GYCRKSCKAC
Subjt:  ERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase3.5e-11762.68Show/hide
Query:  MDSRRFLALSLCSLFLSAAFARLPETRTHKKLYDPFPLLLALLFRFANPLTRDFGFNCFSNFRSGSVIRMKGDPAPLIFDPTRVTQLSWRPRAFLYKGFL
        MDSR FLA SLC LF                     PL+ +   RF   LTR       SN R GSVI+MK   +   FDPTRVTQLSW PR FLY+GFL
Subjt:  MDSRRFLALSLCSLFLSAAFARLPETRTHKKLYDPFPLLLALLFRFANPLTRDFGFNCFSNFRSGSVIRMKGDPAPLIFDPTRVTQLSWRPRAFLYKGFL

Query:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSE-----VRTSSGMFLHMAQ---DEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDK
        SD+ECDH I LAK KLEKSMVADN+SG+SV SE     VR SS    +M     D+IV+ +EA++AAWTFLP ENGES+QILHYENGQKYEPHFD+FHD+
Subjt:  SDKECDHLIDLAKDKLEKSMVADNESGKSVSSE-----VRTSSGMFLHMAQ---DEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDK

Query:  VNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSF
         N ELGGHRIATVLMYLSNVEKGGET+FP  + K +Q KDDS++ CA++GYAVK +KGDALLFF+LH +A+TD+ SLHGSCPV+EGEKWSAT+WIHV+SF
Subjt:  VNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSF

Query:  EKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
        E+     S  C+DEN +C  WAK GEC+KNPTYMVGS+   GYCRKSCKAC
Subjt:  EKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase2.0e-11273.58Show/hide
Query:  DPTRVTQLSWRPRAFLYKGFLSDKECDHLIDLAKDKLEKSM-VADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYEN
        DPTR+TQLSW PRAFLYKGFLSD+ECDHLI LAK KLEKSM VAD +SG+S  SEVRTSSGMFL   QD+IVA +EA++AAWTFLP ENGE++QILHYEN
Subjt:  DPTRVTQLSWRPRAFLYKGFLSDKECDHLIDLAKDKLEKSM-VADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYEN

Query:  GQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEG
        GQKY+PHFD+F+DK   ELGGHRIATVLMYLSNV KGGET+FPN + K  Q KDDS+S CA++GYAVK +KGDALLFF+LHL+ +TD  SLHGSCPVIEG
Subjt:  GQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEG

Query:  EKWSATKWIHVRSFEKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
        EKWSAT+WIHVRSF K        CVD++E+C  WA  GEC+KNP YMVGSE +LG+CRKSCKAC
Subjt:  EKWSATKWIHVRSFEKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein3.5e-9360.67Show/hide
Query:  DPTRVTQLSWRPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYENG
        +P++V Q+S +PRAF+Y+GFL++ ECDH++ LAK  L++S VADN+SG+S  SEVRTSSG F+   +D IV+ IE +I+ WTFLP ENGE IQ+L YE+G
Subjt:  DPTRVTQLSWRPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYENG

Query:  QKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQ---EKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVI
        QKY+ HFD+FHDKVN   GGHR+AT+LMYLSNV KGGET+FP++E    +   E  +  S CA++G AVK +KGDALLFF+LH DA  D  SLHG CPVI
Subjt:  QKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQ---EKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVI

Query:  EGEKWSATKWIHVRSFEKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
        EGEKWSATKWIHV SF++    S   C D NE+C  WA  GEC KNP YMVG+    GYCR+SCKAC
Subjt:  EGEKWSATKWIHVRSFEKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCCCGACGGTTCCTCGCATTGTCTCTTTGCTCTCTGTTCCTGTCTGCTGCCTTCGCTCGCTTGCCGGAAACGCGTACGCACAAGAAATTGTACGATCCTTTCCC
TCTTCTTCTTGCCCTTCTTTTTCGTTTCGCCAATCCACTGACTCGTGATTTTGGATTCAATTGTTTTTCGAATTTCAGGAGTGGATCTGTGATACGAATGAAGGGGGATC
CAGCTCCGTTGATTTTCGATCCTACAAGAGTCACTCAGCTCTCTTGGCGACCCAGGGCATTTTTGTATAAGGGATTTTTATCTGATAAGGAATGTGATCATCTAATCGAT
CTGGCCAAGGATAAATTAGAGAAGTCAATGGTAGCAGATAATGAGTCCGGTAAGAGTGTAAGTAGTGAAGTCCGGACGAGTTCTGGCATGTTCCTTCATATGGCCCAGGA
TGAAATTGTTGCTGCCATTGAGGCCAGGATTGCTGCATGGACATTCCTTCCAGCAGAAAACGGAGAGTCCATTCAAATACTGCACTATGAGAATGGTCAAAAGTATGAAC
CGCATTTTGATTTTTTTCATGACAAGGTGAATCAGGAGTTAGGTGGCCATCGAATAGCTACAGTCTTGATGTATTTATCCAATGTTGAAAAGGGTGGAGAAACCATCTTT
CCAAATTCAGAGTTCAAAGAATCTCAAGAAAAGGATGACAGTTTTTCTGCTTGTGCTCAAAAGGGTTATGCAGTTAAAGCGAAGAAGGGTGATGCATTGCTGTTCTTCAG
CCTCCATCTCGATGCATCGACAGATACCAAAAGCTTGCACGGTAGTTGCCCTGTGATCGAGGGCGAGAAATGGTCTGCAACCAAGTGGATTCATGTGAGATCCTTCGAGA
AGCCGACTCGTGCAAGTAGTGAGCGTTGCGTGGACGAAAATGAAAATTGCCCTGCGTGGGCCAAAAGGGGTGAGTGCAAGAAGAACCCTACTTACATGGTGGGTTCTGAA
GGTGCTTTAGGATACTGTAGGAAGAGTTGTAAAGCGTGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATTCCCGACGGTTCCTCGCATTGTCTCTTTGCTCTCTGTTCCTGTCTGCTGCCTTCGCTCGCTTGCCGGAAACGCGTACGCACAAGAAATTGTACGATCCTTTCCC
TCTTCTTCTTGCCCTTCTTTTTCGTTTCGCCAATCCACTGACTCGTGATTTTGGATTCAATTGTTTTTCGAATTTCAGGAGTGGATCTGTGATACGAATGAAGGGGGATC
CAGCTCCGTTGATTTTCGATCCTACAAGAGTCACTCAGCTCTCTTGGCGACCCAGGGCATTTTTGTATAAGGGATTTTTATCTGATAAGGAATGTGATCATCTAATCGAT
CTGGCCAAGGATAAATTAGAGAAGTCAATGGTAGCAGATAATGAGTCCGGTAAGAGTGTAAGTAGTGAAGTCCGGACGAGTTCTGGCATGTTCCTTCATATGGCCCAGGA
TGAAATTGTTGCTGCCATTGAGGCCAGGATTGCTGCATGGACATTCCTTCCAGCAGAAAACGGAGAGTCCATTCAAATACTGCACTATGAGAATGGTCAAAAGTATGAAC
CGCATTTTGATTTTTTTCATGACAAGGTGAATCAGGAGTTAGGTGGCCATCGAATAGCTACAGTCTTGATGTATTTATCCAATGTTGAAAAGGGTGGAGAAACCATCTTT
CCAAATTCAGAGTTCAAAGAATCTCAAGAAAAGGATGACAGTTTTTCTGCTTGTGCTCAAAAGGGTTATGCAGTTAAAGCGAAGAAGGGTGATGCATTGCTGTTCTTCAG
CCTCCATCTCGATGCATCGACAGATACCAAAAGCTTGCACGGTAGTTGCCCTGTGATCGAGGGCGAGAAATGGTCTGCAACCAAGTGGATTCATGTGAGATCCTTCGAGA
AGCCGACTCGTGCAAGTAGTGAGCGTTGCGTGGACGAAAATGAAAATTGCCCTGCGTGGGCCAAAAGGGGTGAGTGCAAGAAGAACCCTACTTACATGGTGGGTTCTGAA
GGTGCTTTAGGATACTGTAGGAAGAGTTGTAAAGCGTGTTAA
Protein sequenceShow/hide protein sequence
MDSRRFLALSLCSLFLSAAFARLPETRTHKKLYDPFPLLLALLFRFANPLTRDFGFNCFSNFRSGSVIRMKGDPAPLIFDPTRVTQLSWRPRAFLYKGFLSDKECDHLID
LAKDKLEKSMVADNESGKSVSSEVRTSSGMFLHMAQDEIVAAIEARIAAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIF
PNSEFKESQEKDDSFSACAQKGYAVKAKKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASSERCVDENENCPAWAKRGECKKNPTYMVGSE
GALGYCRKSCKAC