; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0021262 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0021262
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationchr09:2569981..2573818
RNA-Seq ExpressionIVF0021262
SyntenyIVF0021262
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039540.1 putative prolyl 4-hydroxylase 7 [Cucumis melo var. makuwa]1.10e-21393.4Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG
        MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG

Query:  KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE
        KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSE   
Subjt:  KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE

Query:  SQ--EKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTY
                 +   +    AVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTY
Subjt:  SQ--EKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTY

Query:  MVGSEGALGYCRKSCKAC
        MVGSEGALGYCRKSCKAC
Subjt:  MVGSEGALGYCRKSCKAC

TYK15293.1 putative prolyl 4-hydroxylase 7 [Cucumis melo var. makuwa]9.68e-22484.27Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG
        MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG

Query:  KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE
        KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE
Subjt:  KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE

Query:  SQEKDDSWSDCSRKGYA-----------------------------------------------------------VKAQKGDALLFFSLHLDATTDERS
        SQEKDDSWSDCSRKGYA                                                           VKAQKGDALLFFSLHLDATTDERS
Subjt:  SQEKDDSWSDCSRKGYA-----------------------------------------------------------VKAQKGDALLFFSLHLDATTDERS

Query:  LHGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
        LHGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
Subjt:  LHGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

XP_008458700.1 PREDICTED: probable prolyl 4-hydroxylase 7 [Cucumis melo]2.30e-235100Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG
        MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG

Query:  KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE
        KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE
Subjt:  KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE

Query:  SQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTYMV
        SQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTYMV
Subjt:  SQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTYMV

Query:  GSEGALGYCRKSCKAC
        GSEGALGYCRKSCKAC
Subjt:  GSEGALGYCRKSCKAC

XP_011655982.1 probable prolyl 4-hydroxylase 7 isoform X1 [Cucumis sativus]7.83e-21593.06Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG
        MDSRPFLAFSLCFLSVFTAFARLPETR    ++KQS+GSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSD ECDHLIDLAKDKLEKSMVADN+SG
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG

Query:  KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE
        KSVSSEVRTSSGMFLRKAQD++VAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE
Subjt:  KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE

Query:  SQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLP-RVSRQDCVDENENCPAWAKRGECKKNPTYM
        SQ KD+SWSDCSRKGYAVKAQKGDALLFFSL+LDATTDERSLHGSCPVI GEKWSATKWIHVRSFEK+  RVSRQ CVDENENC AWAK+GECKKNPTYM
Subjt:  SQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLP-RVSRQDCVDENENCPAWAKRGECKKNPTYM

Query:  VGSEGALGYCRKSCKAC
        VGS GALGYCRKSCKAC
Subjt:  VGSEGALGYCRKSCKAC

XP_031742194.1 probable prolyl 4-hydroxylase 7 isoform X2 [Cucumis sativus]7.23e-21393.06Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG
        MDSRPFLAFSLCFLSVFTAFARLPETR    ++KQS GSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSD ECDHLIDLAKDKLEKSMVADN+SG
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG

Query:  KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE
        KSVSSEVRTSSGMFLRKAQD++VAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE
Subjt:  KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE

Query:  SQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLP-RVSRQDCVDENENCPAWAKRGECKKNPTYM
        SQ KD+SWSDCSRKGYAVKAQKGDALLFFSL+LDATTDERSLHGSCPVI GEKWSATKWIHVRSFEK+  RVSRQ CVDENENC AWAK+GECKKNPTYM
Subjt:  SQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLP-RVSRQDCVDENENCPAWAKRGECKKNPTYM

Query:  VGSEGALGYCRKSCKAC
        VGS GALGYCRKSCKAC
Subjt:  VGSEGALGYCRKSCKAC

TrEMBL top hitse value%identityAlignment
A0A0A0KS38 Procollagen-proline 4-dioxygenase1.5e-16793.06Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG
        MDSRPFLAFSLCFLSVFTAFARLPETR    ++KQS+GSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSD ECDHLIDLAKDKLEKSMVADN+SG
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG

Query:  KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE
        KSVSSEVRTSSGMFLRKAQD++VAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE
Subjt:  KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE

Query:  SQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKL-PRVSRQDCVDENENCPAWAKRGECKKNPTYM
        SQ KD+SWSDCSRKGYAVKAQKGDALLFFSL+LDATTDERSLHGSCPVI GEKWSATKWIHVRSFEK+  RVSRQ CVDENENC AWAK+GECKKNPTYM
Subjt:  SQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKL-PRVSRQDCVDENENCPAWAKRGECKKNPTYM

Query:  VGSEGALGYCRKSCKAC
        VGS GALGYCRKSCKAC
Subjt:  VGSEGALGYCRKSCKAC

A0A1S3C8G4 Procollagen-proline 4-dioxygenase3.4e-183100Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG
        MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG

Query:  KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE
        KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE
Subjt:  KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE

Query:  SQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTYMV
        SQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTYMV
Subjt:  SQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTYMV

Query:  GSEGALGYCRKSCKAC
        GSEGALGYCRKSCKAC
Subjt:  GSEGALGYCRKSCKAC

A0A5A7T8H4 Procollagen-proline 4-dioxygenase9.9e-16793.4Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG
        MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG

Query:  KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFK-
        KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSE   
Subjt:  KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFK-

Query:  -ESQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTY
                 +   +    AVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTY
Subjt:  -ESQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTY

Query:  MVGSEGALGYCRKSCKAC
        MVGSEGALGYCRKSCKAC
Subjt:  MVGSEGALGYCRKSCKAC

A0A5D3CTS4 Procollagen-proline 4-dioxygenase4.4e-17584.27Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG
        MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG

Query:  KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE
        KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE
Subjt:  KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE

Query:  SQEKDDSWSDCSRKGY-----------------------------------------------------------AVKAQKGDALLFFSLHLDATTDERS
        SQEKDDSWSDCSRKGY                                                           AVKAQKGDALLFFSLHLDATTDERS
Subjt:  SQEKDDSWSDCSRKGY-----------------------------------------------------------AVKAQKGDALLFFSLHLDATTDERS

Query:  LHGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
        LHGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
Subjt:  LHGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

A0A6J1FJ93 Procollagen-proline 4-dioxygenase1.7e-15887.97Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG
        MDSR FLAFSL FLSV T FARLPET      +K+ +GSVL LK DS  LIFDPTRVTQLSWQPRAFLYKGFL+D+ECDHLIDLAKDKLEKSMVADNESG
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG

Query:  KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE
        KSVSSEVRTSSGMFLRKAQD+IVAG+EARI+AWT LP ENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS F E
Subjt:  KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE

Query:  SQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTYMV
        SQEKDDSWSDC+RKGYAVKAQKGDALLFFSLHLDATTD+RSLHGSCPVIEGEKWSATKWIHVRSF+K  R+S QDCVDEN+NCP+WAKRGEC+KNPTYMV
Subjt:  SQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTYMV

Query:  GSEGALGYCRKSCKAC
        GSEGA+GYCRKSCKAC
Subjt:  GSEGALGYCRKSCKAC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 61.7e-11566.56Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSM-VADNES
        MDS+ FLAFSL  L +F+                         +  S     DPTR+TQLSW PRAFLYKGFLSDEECDHLI LAK KLEKSM VAD +S
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSM-VADNES

Query:  GKSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFK
        G+S  SEVRTSSGMFL K QD IVA VEA++AAWT LP ENGE++QILHYENGQKY+PHFD+F+DK   ELGGHRIATVLMYLSNV KGGET+FPN + K
Subjt:  GKSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFK

Query:  ESQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTYM
          Q KDDSWS C+++GYAVK +KGDALLFF+LHL+ TTD  SLHGSCPVIEGEKWSAT+WIHVRSF K   V    CVD++E+C  WA  GEC+KNP YM
Subjt:  ESQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTYM

Query:  VGSEGALGYCRKSCKAC
        VGSE +LG+CRKSCKAC
Subjt:  VGSEGALGYCRKSCKAC

F4JAU3 Prolyl 4-hydroxylase 23.5e-9260.58Show/hide
Query:  SSP-LIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQ
        SSP  I +P++V Q+S +PRAF+Y+GFL+D ECDHLI LAK+ L++S VADN++G+S  S+VRTSSG F+ K +D IV+G+E +++ WT LP ENGE +Q
Subjt:  SSP-LIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQ

Query:  ILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS-EF--KESQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSL
        +L YE+GQKY+ HFD+FHDKVN   GGHRIATVL+YLSNV KGGET+FP++ EF  +   E  D  SDC++KG AVK +KG+ALLFF+L  DA  D  SL
Subjt:  ILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS-EF--KESQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSL

Query:  HGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
        HG CPVIEGEKWSATKWIHV SF+K+      +C D NE+C  WA  GEC KNP YMVG+    G CR+SCKAC
Subjt:  HGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

Q8L970 Probable prolyl 4-hydroxylase 73.9e-12870.25Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG
        MDSR FLAFSLCFL      +  P  R L  S     GSV+++KT +S   FDPTRVTQLSW PR FLY+GFLSDEECDH I LAK KLEKSMVADN+SG
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG

Query:  KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE
        +SV SEVRTSSGMFL K QD IV+ VEA++AAWT LP ENGES+QILHYENGQKYEPHFD+FHD+ N ELGGHRIATVLMYLSNVEKGGET+FP  + K 
Subjt:  KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE

Query:  SQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTYMV
        +Q KDDSW++C+++GYAVK +KGDALLFF+LH +ATTD  SLHGSCPV+EGEKWSAT+WIHV+SFE+     +  C+DEN +C  WAK GEC+KNPTYMV
Subjt:  SQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTYMV

Query:  GSEGALGYCRKSCKAC
        GS+   GYCRKSCKAC
Subjt:  GSEGALGYCRKSCKAC

Q8LAN3 Probable prolyl 4-hydroxylase 42.2e-9460.07Show/hide
Query:  SSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQI
        SS +  +P++V Q+S +PRAF+Y+GFL++ ECDH++ LAK  L++S VADN+SG+S  SEVRTSSG F+ K +D IV+G+E +I+ WT LP ENGE IQ+
Subjt:  SSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQI

Query:  LHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQ---EKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLH
        L YE+GQKY+ HFD+FHDKVN   GGHR+AT+LMYLSNV KGGET+FP++E    +   E  +  SDC+++G AVK +KGDALLFF+LH DA  D  SLH
Subjt:  LHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQ---EKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLH

Query:  GSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
        G CPVIEGEKWSATKWIHV SF+++   S  +C D NE+C  WA  GEC KNP YMVG+    GYCR+SCKAC
Subjt:  GSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

Q9LN20 Probable prolyl 4-hydroxylase 34.7e-6556.73Show/hide
Query:  LSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHF
        LSW+PRAF+Y  FLS EEC++LI LAK  + KS V D+E+GKS  S VRTSSG FLR+ +DKI+  +E RIA +T +PA++GE +Q+LHYE GQKYEPH+
Subjt:  LSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHF

Query:  DFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFK-ESQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATK
        D+F D+ N + GG R+AT+LMYLS+VE+GGET+FP +     S    +  S+C +KG +VK + GDALLF+S+  DAT D  SLHG CPVI G KWS+TK
Subjt:  DFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFK-ESQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATK

Query:  WIHVRSFE
        W+HV  ++
Subjt:  WIHVRSFE

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 22.5e-9360.58Show/hide
Query:  SSP-LIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQ
        SSP  I +P++V Q+S +PRAF+Y+GFL+D ECDHLI LAK+ L++S VADN++G+S  S+VRTSSG F+ K +D IV+G+E +++ WT LP ENGE +Q
Subjt:  SSP-LIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQ

Query:  ILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS-EF--KESQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSL
        +L YE+GQKY+ HFD+FHDKVN   GGHRIATVL+YLSNV KGGET+FP++ EF  +   E  D  SDC++KG AVK +KG+ALLFF+L  DA  D  SL
Subjt:  ILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS-EF--KESQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSL

Query:  HGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
        HG CPVIEGEKWSATKWIHV SF+K+      +C D NE+C  WA  GEC KNP YMVG+    G CR+SCKAC
Subjt:  HGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase2.8e-12970.25Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG
        MDSR FLAFSLCFL      +  P  R L  S     GSV+++KT +S   FDPTRVTQLSW PR FLY+GFLSDEECDH I LAK KLEKSMVADN+SG
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG

Query:  KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE
        +SV SEVRTSSGMFL K QD IV+ VEA++AAWT LP ENGES+QILHYENGQKYEPHFD+FHD+ N ELGGHRIATVLMYLSNVEKGGET+FP  + K 
Subjt:  KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE

Query:  SQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTYMV
        +Q KDDSW++C+++GYAVK +KGDALLFF+LH +ATTD  SLHGSCPV+EGEKWSAT+WIHV+SFE+     +  C+DEN +C  WAK GEC+KNPTYMV
Subjt:  SQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTYMV

Query:  GSEGALGYCRKSCKAC
        GS+   GYCRKSCKAC
Subjt:  GSEGALGYCRKSCKAC

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase2.8e-12166.36Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG
        MDSR FLAFSLCFL      +  P  R L  S     GSV+++KT +S   FDPTRVTQLSW PR FLY+GFLSDEECDH I LAK KLEKSMVADN+SG
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG

Query:  KSVSSE-----VRTSSGMFLRKAQ---DKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETI
        +SV SE     VR SS           D IV+ VEA++AAWT LP ENGES+QILHYENGQKYEPHFD+FHD+ N ELGGHRIATVLMYLSNVEKGGET+
Subjt:  KSVSSE-----VRTSSGMFLRKAQ---DKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETI

Query:  FPNSEFKESQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGEC
        FP  + K +Q KDDSW++C+++GYAVK +KGDALLFF+LH +ATTD  SLHGSCPV+EGEKWSAT+WIHV+SFE+     +  C+DEN +C  WAK GEC
Subjt:  FPNSEFKESQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGEC

Query:  KKNPTYMVGSEGALGYCRKSCKAC
        +KNPTYMVGS+   GYCRKSCKAC
Subjt:  KKNPTYMVGSEGALGYCRKSCKAC

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase1.2e-11666.56Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSM-VADNES
        MDS+ FLAFSL  L +F+                         +  S     DPTR+TQLSW PRAFLYKGFLSDEECDHLI LAK KLEKSM VAD +S
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSM-VADNES

Query:  GKSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFK
        G+S  SEVRTSSGMFL K QD IVA VEA++AAWT LP ENGE++QILHYENGQKY+PHFD+F+DK   ELGGHRIATVLMYLSNV KGGET+FPN + K
Subjt:  GKSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFK

Query:  ESQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTYM
          Q KDDSWS C+++GYAVK +KGDALLFF+LHL+ TTD  SLHGSCPVIEGEKWSAT+WIHVRSF K   V    CVD++E+C  WA  GEC+KNP YM
Subjt:  ESQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTYM

Query:  VGSEGALGYCRKSCKAC
        VGSE +LG+CRKSCKAC
Subjt:  VGSEGALGYCRKSCKAC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.5e-9560.07Show/hide
Query:  SSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQI
        SS +  +P++V Q+S +PRAF+Y+GFL++ ECDH++ LAK  L++S VADN+SG+S  SEVRTSSG F+ K +D IV+G+E +I+ WT LP ENGE IQ+
Subjt:  SSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQI

Query:  LHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQ---EKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLH
        L YE+GQKY+ HFD+FHDKVN   GGHR+AT+LMYLSNV KGGET+FP++E    +   E  +  SDC+++G AVK +KGDALLFF+LH DA  D  SLH
Subjt:  LHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQ---EKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLH

Query:  GSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC
        G CPVIEGEKWSATKWIHV SF+++   S  +C D NE+C  WA  GEC KNP YMVG+    GYCR+SCKAC
Subjt:  GSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCTCGACCATTCCTCGCATTTTCTCTCTGCTTTCTCTCCGTCTTCACCGCCTTCGCTCGCTTGCCGGAAACGCGTATGCTCAAGCACTCCTACAAGCAATCAAC
TGGATCTGTGCTTCGATTGAAGACGGATTCATCTCCGCTCATTTTCGATCCAACACGAGTCACTCAGCTCTCTTGGCAACCCAGGGCATTTTTGTATAAGGGATTTTTAT
CTGATGAGGAATGTGATCACCTAATTGATCTGGCTAAGGATAAATTAGAGAAGTCAATGGTAGCAGATAATGAGTCTGGTAAAAGTGTAAGTAGTGAAGTCCGAACGAGT
TCTGGCATGTTCCTTCGGAAGGCCCAGGATAAAATTGTTGCTGGCGTTGAAGCTAGGATAGCTGCATGGACACTTCTTCCAGCAGAAAATGGAGAATCCATTCAAATTCT
TCACTATGAGAATGGTCAAAAGTATGAACCACATTTTGATTTTTTTCACGACAAGGTGAATCAGGAGTTAGGTGGCCACCGCATAGCCACAGTCTTGATGTATTTATCCA
ATGTTGAAAAGGGTGGAGAAACCATCTTTCCTAATTCAGAGTTTAAAGAATCTCAAGAAAAAGATGACAGCTGGTCTGATTGTTCTCGAAAGGGTTATGCAGTTAAAGCG
CAGAAGGGCGATGCGTTGTTGTTCTTCAGCCTACATCTCGACGCAACGACAGATGAAAGAAGTTTGCATGGTAGTTGCCCCGTAATTGAAGGCGAGAAATGGTCTGCAAC
CAAATGGATTCATGTGAGATCCTTTGAGAAGCTACCTCGTGTAAGTAGGCAGGATTGCGTGGACGAGAACGAAAATTGCCCGGCATGGGCAAAAAGGGGAGAGTGCAAAA
AGAACCCTACTTACATGGTGGGTTCTGAAGGTGCTTTAGGATACTGTAGGAAGAGTTGCAAAGCATGCTAA
mRNA sequenceShow/hide mRNA sequence
CCCATTTATTTTTATTTCTTTTTTCTTTTTCCTTTCTCGTAAACGGAAGAACCCTTGAATCGTTGAATTCTTTTTCTTGTTCATTTCATTTTCTCCGATTTGATATCGGA
GAAACAATCATGGATTCTCGACCATTCCTCGCATTTTCTCTCTGCTTTCTCTCCGTCTTCACCGCCTTCGCTCGCTTGCCGGAAACGCGTATGCTCAAGCACTCCTACAA
GCAATCAACTGGATCTGTGCTTCGATTGAAGACGGATTCATCTCCGCTCATTTTCGATCCAACACGAGTCACTCAGCTCTCTTGGCAACCCAGGGCATTTTTGTATAAGG
GATTTTTATCTGATGAGGAATGTGATCACCTAATTGATCTGGCTAAGGATAAATTAGAGAAGTCAATGGTAGCAGATAATGAGTCTGGTAAAAGTGTAAGTAGTGAAGTC
CGAACGAGTTCTGGCATGTTCCTTCGGAAGGCCCAGGATAAAATTGTTGCTGGCGTTGAAGCTAGGATAGCTGCATGGACACTTCTTCCAGCAGAAAATGGAGAATCCAT
TCAAATTCTTCACTATGAGAATGGTCAAAAGTATGAACCACATTTTGATTTTTTTCACGACAAGGTGAATCAGGAGTTAGGTGGCCACCGCATAGCCACAGTCTTGATGT
ATTTATCCAATGTTGAAAAGGGTGGAGAAACCATCTTTCCTAATTCAGAGTTTAAAGAATCTCAAGAAAAAGATGACAGCTGGTCTGATTGTTCTCGAAAGGGTTATGCA
GTTAAAGCGCAGAAGGGCGATGCGTTGTTGTTCTTCAGCCTACATCTCGACGCAACGACAGATGAAAGAAGTTTGCATGGTAGTTGCCCCGTAATTGAAGGCGAGAAATG
GTCTGCAACCAAATGGATTCATGTGAGATCCTTTGAGAAGCTACCTCGTGTAAGTAGGCAGGATTGCGTGGACGAGAACGAAAATTGCCCGGCATGGGCAAAAAGGGGAG
AGTGCAAAAAGAACCCTACTTACATGGTGGGTTCTGAAGGTGCTTTAGGATACTGTAGGAAGAGTTGCAAAGCATGCTAAAACCCAAACCCTAGGAGGAGGAAGAAGAAG
TAATCGCCATATCTCTCTTTCTTTTTCTGTTTTGGTGAGCTTGAGTGTCGATTTTGTAATGGCTATGTATATAACATTGGGCAGCAACTTGGTATTATAATACTATATAC
AATTACAAGTGGATATTAATTACATCTCTTTCATTAAACCTTGTTGTAGGAATTAACCACAAGAGTTTTCATTTGATAATTAAAAT
Protein sequenceShow/hide protein sequence
MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTS
SGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSWSDCSRKGYAVKA
QKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTYMVGSEGALGYCRKSCKAC