; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G28680 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G28680
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationChr5:27018887..27022765
RNA-Seq ExpressionCSPI05G28680
SyntenyCSPI05G28680
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8648909.1 hypothetical protein Csa_008411 [Cucumis sativus]6.5e-17388.92Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVS
        MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVS
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVS

Query:  SEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPA-----------------------ENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY
        SEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPA                       ENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY
Subjt:  SEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPA-----------------------ENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY

Query:  LSNVEKGGETIFPNSE----------------FKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSF
        LSNVEKGGETIFPNSE                FKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSF
Subjt:  LSNVEKGGETIFPNSE----------------FKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSF

Query:  EKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC
        EKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC
Subjt:  EKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC

XP_008458700.1 PREDICTED: probable prolyl 4-hydroxylase 7 [Cucumis melo]2.4e-16793.06Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETR----THKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSG
        MDSRPFLAFSLCFLSVFTAFARLPETR    ++KQS+GSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSD ECDHLIDLAKDKLEKSMVADN+SG
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETR----THKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSG

Query:  KSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE
        KSVSSEVRTSSGMFLRKAQD++VAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE
Subjt:  KSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE

Query:  SQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYM
        SQ KD+SWSDCSRKGYAVKAQKGDALLFFSL+LDATTDERSLHGSCPVI GEKWSATKWIHVRSFEK+  RVSRQ CVDENENC AWAK+GECKKNPTYM
Subjt:  SQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYM

Query:  VGSGGALGYCRKSCKAC
        VGS GALGYCRKSCKAC
Subjt:  VGSGGALGYCRKSCKAC

XP_011655982.1 probable prolyl 4-hydroxylase 7 isoform X1 [Cucumis sativus]5.5e-180100Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVS
        MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVS
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVS

Query:  SEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQAK
        SEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQAK
Subjt:  SEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQAK

Query:  DESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSG
        DESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSG
Subjt:  DESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSG

Query:  GALGYCRKSCKAC
        GALGYCRKSCKAC
Subjt:  GALGYCRKSCKAC

XP_031742194.1 probable prolyl 4-hydroxylase 7 isoform X2 [Cucumis sativus]3.9e-17899.68Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVS
        MDSRPFLAFSLCFLSVFTAFARLPETRTHKQ SGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVS
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVS

Query:  SEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQAK
        SEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQAK
Subjt:  SEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQAK

Query:  DESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSG
        DESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSG
Subjt:  DESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSG

Query:  GALGYCRKSCKAC
        GALGYCRKSCKAC
Subjt:  GALGYCRKSCKAC

XP_038889686.1 probable prolyl 4-hydroxylase 7 isoform X1 [Benincasa hispida]9.8e-16189.46Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVS
        MDSR FLAF LCFLSVFT FARLPE R+ K+SSGSV+RLKTDSSPL+FDPTRVTQLSW+PRAFLYKGFLSD ECDHLIDLAKDKLEKSMVADN+SGKSVS
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVS

Query:  SEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQAK
        SEVRTSSGMFLRKAQDE+VA +EARI+AWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQ K
Subjt:  SEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQAK

Query:  DESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSG
        DESWSDC+RKGYAVKA+KGDALLFFSL  DATTD +SLHGSCPVI GEKWSATKWIHVRSFEK T RVSRQ CVDENENC  WAK+GECKKNPTYMVGS 
Subjt:  DESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSG

Query:  GALGYCRKSCKAC
         ALGYCRKSC+AC
Subjt:  GALGYCRKSCKAC

TrEMBL top hitse value%identityAlignment
A0A0A0KS38 Procollagen-proline 4-dioxygenase2.7e-180100Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVS
        MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVS
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVS

Query:  SEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQAK
        SEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQAK
Subjt:  SEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQAK

Query:  DESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSG
        DESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSG
Subjt:  DESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSG

Query:  GALGYCRKSCKAC
        GALGYCRKSCKAC
Subjt:  GALGYCRKSCKAC

A0A1S3C8G4 Procollagen-proline 4-dioxygenase1.2e-16793.06Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETR----THKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSG
        MDSRPFLAFSLCFLSVFTAFARLPETR    ++KQS+GSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSD ECDHLIDLAKDKLEKSMVADN+SG
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETR----THKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSG

Query:  KSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE
        KSVSSEVRTSSGMFLRKAQD++VAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE
Subjt:  KSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE

Query:  SQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYM
        SQ KD+SWSDCSRKGYAVKAQKGDALLFFSL+LDATTDERSLHGSCPVI GEKWSATKWIHVRSFEK+  RVSRQ CVDENENC AWAK+GECKKNPTYM
Subjt:  SQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYM

Query:  VGSGGALGYCRKSCKAC
        VGS GALGYCRKSCKAC
Subjt:  VGSGGALGYCRKSCKAC

A0A5D3CTS4 Procollagen-proline 4-dioxygenase1.5e-15978.46Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETR----THKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSG
        MDSRPFLAFSLCFLSVFTAFARLPETR    ++KQS+GSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSD ECDHLIDLAKDKLEKSMVADN+SG
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETR----THKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSG

Query:  KSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE
        KSVSSEVRTSSGMFLRKAQD++VAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE
Subjt:  KSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE

Query:  SQAKDESWSDCSRKGY-----------------------------------------------------------AVKAQKGDALLFFSLNLDATTDERS
        SQ KD+SWSDCSRKGY                                                           AVKAQKGDALLFFSL+LDATTDERS
Subjt:  SQAKDESWSDCSRKGY-----------------------------------------------------------AVKAQKGDALLFFSLNLDATTDERS

Query:  LHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC
        LHGSCPVI GEKWSATKWIHVRSFEK+  RVSRQ CVDENENC AWAK+GECKKNPTYMVGS GALGYCRKSCKAC
Subjt:  LHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC

A0A6J1BXN9 Procollagen-proline 4-dioxygenase4.6e-15686.58Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVS
        MDS  FL+FSLCFL VFTA ARLP+ R HK+ SGSVLRLK + SPLIFDPTRVTQLSWQPRAFLYKGFLSD ECDHLIDLAKDKLEKSMVADN+SGKSVS
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVS

Query:  SEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQAK
        SEVRTSSGMFL KAQDE+VA VEARIAAWT LPAENGESIQILHYENGQKYEPHFD+FHDKVNQELGGHR+ATVLMYLSNVEKGGETIFPNSEFKESQ K
Subjt:  SEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQAK

Query:  DESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSG
        D+SWSDC+RKGYAVKA+KGDALLFFSL+LDATTD +SLHGSCPVI GEKWSATKWIHVRSFEK T    R  CVDENENC +WAK+GECKKNPTYMVGS 
Subjt:  DESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSG

Query:  GALGYCRKSCKAC
         ALGYCRKSC+AC
Subjt:  GALGYCRKSCKAC

A0A6J1FJ93 Procollagen-proline 4-dioxygenase2.8e-15386.9Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVS
        MDSR FLAFSL FLSV T FARLPE  THK+ SGSVL LK DS  LIFDPTRVTQLSWQPRAFLYKGFL+D ECDHLIDLAKDKLEKSMVADN+SGKSVS
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVS

Query:  SEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQAK
        SEVRTSSGMFLRKAQDE+VAG+EARI+AWT LP ENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS F ESQ K
Subjt:  SEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQAK

Query:  DESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSG
        D+SWSDC+RKGYAVKAQKGDALLFFSL+LDATTD+RSLHGSCPVI GEKWSATKWIHVRSF+K T R+S Q CVDEN+NC +WAK+GEC+KNPTYMVGS 
Subjt:  DESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSG

Query:  GALGYCRKSCKAC
        GA+GYCRKSCKAC
Subjt:  GALGYCRKSCKAC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 66.0e-11365.29Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSM-VADNDSGKSV
        MDS+ FLAFSL  L +F+                     +  S     DPTR+TQLSW PRAFLYKGFLSD ECDHLI LAK KLEKSM VAD DSG+S 
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSM-VADNDSGKSV

Query:  SSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQA
         SEVRTSSGMFL K QD++VA VEA++AAWT LP ENGE++QILHYENGQKY+PHFD+F+DK   ELGGHRIATVLMYLSNV KGGET+FPN + K  Q 
Subjt:  SSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQA

Query:  KDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGS
        KD+SWS C+++GYAVK +KGDALLFF+L+L+ TTD  SLHGSCPVI GEKWSAT+WIHVRSF K      +  CVD++E+C  WA  GEC+KNP YMVGS
Subjt:  KDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGS

Query:  GGALGYCRKSCKAC
          +LG+CRKSCKAC
Subjt:  GGALGYCRKSCKAC

F4JAU3 Prolyl 4-hydroxylase 21.3e-9160.51Show/hide
Query:  SSP-LIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQ
        SSP  I +P++V Q+S +PRAF+Y+GFL+D ECDHLI LAK+ L++S VADND+G+S  S+VRTSSG F+ K +D +V+G+E +++ WT LP ENGE +Q
Subjt:  SSP-LIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQ

Query:  ILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS-EFKE---SQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERS
        +L YE+GQKY+ HFD+FHDKVN   GGHRIATVL+YLSNV KGGET+FP++ EF     S+ KD+  SDC++KG AVK +KG+ALLFF+L  DA  D  S
Subjt:  ILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS-EFKE---SQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERS

Query:  LHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC
        LHG CPVI GEKWSATKWIHV SF+KI +      C D NE+C  WA  GEC KNP YMVG+    G CR+SCKAC
Subjt:  LHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC

Q8L970 Probable prolyl 4-hydroxylase 71.9e-12769.3Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPE---TRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGK
        MDSR FLAFSLCFL      +  P    TR+     GSV+++KT +S   FDPTRVTQLSW PR FLY+GFLSD ECDH I LAK KLEKSMVADNDSG+
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPE---TRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGK

Query:  SVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKES
        SV SEVRTSSGMFL K QD++V+ VEA++AAWT LP ENGES+QILHYENGQKYEPHFD+FHD+ N ELGGHRIATVLMYLSNVEKGGET+FP  + K +
Subjt:  SVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKES

Query:  QAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMV
        Q KD+SW++C+++GYAVK +KGDALLFF+L+ +ATTD  SLHGSCPV+ GEKWSAT+WIHV+SFE+  ++ S  GC+DEN +C  WAK GEC+KNPTYMV
Subjt:  QAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMV

Query:  GSGGALGYCRKSCKAC
        GS    GYCRKSCKAC
Subjt:  GSGGALGYCRKSCKAC

Q8LAN3 Probable prolyl 4-hydroxylase 43.1e-9358.04Show/hide
Query:  QSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWT
        QSS S++     SS +  +P++V Q+S +PRAF+Y+GFL++ ECDH++ LAK  L++S VADNDSG+S  SEVRTSSG F+ K +D +V+G+E +I+ WT
Subjt:  QSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWT

Query:  LLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQA---KDESWSDCSRKGYAVKAQKGDALLFFSL
         LP ENGE IQ+L YE+GQKY+ HFD+FHDKVN   GGHR+AT+LMYLSNV KGGET+FP++E    +      E  SDC+++G AVK +KGDALLFF+L
Subjt:  LLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQA---KDESWSDCSRKGYAVKAQKGDALLFFSL

Query:  NLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC
        + DA  D  SLHG CPVI GEKWSATKWIHV SF++I +      C D NE+C  WA  GEC KNP YMVG+    GYCR+SCKAC
Subjt:  NLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC

Q9LN20 Probable prolyl 4-hydroxylase 32.0e-6354.81Show/hide
Query:  LSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHF
        LSW+PRAF+Y  FLS  EC++LI LAK  + KS V D+++GKS  S VRTSSG FLR+ +D+++  +E RIA +T +PA++GE +Q+LHYE GQKYEPH+
Subjt:  LSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHF

Query:  DFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQAK-DESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATK
        D+F D+ N + GG R+AT+LMYLS+VE+GGET+FP +    S        S+C +KG +VK + GDALLF+S+  DAT D  SLHG CPVI G KWS+TK
Subjt:  DFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQAK-DESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATK

Query:  WIHVRSFE
        W+HV  ++
Subjt:  WIHVRSFE

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 29.3e-9360.51Show/hide
Query:  SSP-LIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQ
        SSP  I +P++V Q+S +PRAF+Y+GFL+D ECDHLI LAK+ L++S VADND+G+S  S+VRTSSG F+ K +D +V+G+E +++ WT LP ENGE +Q
Subjt:  SSP-LIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQ

Query:  ILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS-EFKE---SQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERS
        +L YE+GQKY+ HFD+FHDKVN   GGHRIATVL+YLSNV KGGET+FP++ EF     S+ KD+  SDC++KG AVK +KG+ALLFF+L  DA  D  S
Subjt:  ILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS-EFKE---SQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERS

Query:  LHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC
        LHG CPVI GEKWSATKWIHV SF+KI +      C D NE+C  WA  GEC KNP YMVG+    G CR+SCKAC
Subjt:  LHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase1.4e-12869.3Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPE---TRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGK
        MDSR FLAFSLCFL      +  P    TR+     GSV+++KT +S   FDPTRVTQLSW PR FLY+GFLSD ECDH I LAK KLEKSMVADNDSG+
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPE---TRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGK

Query:  SVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKES
        SV SEVRTSSGMFL K QD++V+ VEA++AAWT LP ENGES+QILHYENGQKYEPHFD+FHD+ N ELGGHRIATVLMYLSNVEKGGET+FP  + K +
Subjt:  SVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKES

Query:  QAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMV
        Q KD+SW++C+++GYAVK +KGDALLFF+L+ +ATTD  SLHGSCPV+ GEKWSAT+WIHV+SFE+  ++ S  GC+DEN +C  WAK GEC+KNPTYMV
Subjt:  QAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMV

Query:  GSGGALGYCRKSCKAC
        GS    GYCRKSCKAC
Subjt:  GSGGALGYCRKSCKAC

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase1.4e-12065.43Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPE---TRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGK
        MDSR FLAFSLCFL      +  P    TR+     GSV+++KT +S   FDPTRVTQLSW PR FLY+GFLSD ECDH I LAK KLEKSMVADNDSG+
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPE---TRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGK

Query:  SVSSE-----VRTSSGMFLRKAQ---DEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIF
        SV SE     VR SS           D++V+ VEA++AAWT LP ENGES+QILHYENGQKYEPHFD+FHD+ N ELGGHRIATVLMYLSNVEKGGET+F
Subjt:  SVSSE-----VRTSSGMFLRKAQ---DEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIF

Query:  PNSEFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGEC
        P  + K +Q KD+SW++C+++GYAVK +KGDALLFF+L+ +ATTD  SLHGSCPV+ GEKWSAT+WIHV+SFE+  ++ S  GC+DEN +C  WAK GEC
Subjt:  PNSEFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGEC

Query:  KKNPTYMVGSGGALGYCRKSCKAC
        +KNPTYMVGS    GYCRKSCKAC
Subjt:  KKNPTYMVGSGGALGYCRKSCKAC

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase4.3e-11465.29Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSM-VADNDSGKSV
        MDS+ FLAFSL  L +F+                     +  S     DPTR+TQLSW PRAFLYKGFLSD ECDHLI LAK KLEKSM VAD DSG+S 
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSM-VADNDSGKSV

Query:  SSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQA
         SEVRTSSGMFL K QD++VA VEA++AAWT LP ENGE++QILHYENGQKY+PHFD+F+DK   ELGGHRIATVLMYLSNV KGGET+FPN + K  Q 
Subjt:  SSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQA

Query:  KDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGS
        KD+SWS C+++GYAVK +KGDALLFF+L+L+ TTD  SLHGSCPVI GEKWSAT+WIHVRSF K      +  CVD++E+C  WA  GEC+KNP YMVGS
Subjt:  KDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGS

Query:  GGALGYCRKSCKAC
          +LG+CRKSCKAC
Subjt:  GGALGYCRKSCKAC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.2e-9458.04Show/hide
Query:  QSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWT
        QSS S++     SS +  +P++V Q+S +PRAF+Y+GFL++ ECDH++ LAK  L++S VADNDSG+S  SEVRTSSG F+ K +D +V+G+E +I+ WT
Subjt:  QSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWT

Query:  LLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQA---KDESWSDCSRKGYAVKAQKGDALLFFSL
         LP ENGE IQ+L YE+GQKY+ HFD+FHDKVN   GGHR+AT+LMYLSNV KGGET+FP++E    +      E  SDC+++G AVK +KGDALLFF+L
Subjt:  LLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQA---KDESWSDCSRKGYAVKAQKGDALLFFSL

Query:  NLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC
        + DA  D  SLHG CPVI GEKWSATKWIHV SF++I +      C D NE+C  WA  GEC KNP YMVG+    GYCR+SCKAC
Subjt:  NLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCTCGACCATTCCTCGCATTTTCTCTCTGCTTTCTCTCCGTCTTCACCGCCTTCGCTCGCTTGCCGGAAACGCGTACCCACAAGCAATCAAGTGGATCTGTGCT
TCGATTGAAGACGGATTCATCTCCGCTCATTTTCGATCCAACACGAGTCACTCAGCTCTCCTGGCAACCCAGGGCATTTTTGTATAAGGGATTTTTATCTGATGCGGAAT
GTGATCACCTAATTGATCTGGCTAAGGATAAATTAGAGAAGTCAATGGTAGCAGATAATGATTCTGGTAAAAGTGTAAGTAGTGAAGTCCGAACGAGTTCTGGCATGTTC
CTTCGGAAGGCCCAGGATGAAGTTGTTGCTGGCGTTGAAGCCAGGATAGCTGCATGGACACTCCTTCCAGCAGAAAATGGCGAATCCATTCAAATTCTTCACTATGAGAA
TGGTCAAAAGTATGAACCACATTTTGATTTTTTTCACGACAAGGTGAATCAGGAGTTAGGTGGCCACCGCATAGCCACAGTCTTGATGTATTTATCCAATGTTGAAAAGG
GTGGAGAAACCATCTTTCCTAATTCAGAGTTTAAAGAATCTCAAGCAAAAGATGAGAGCTGGTCTGATTGTTCTCGAAAGGGTTATGCAGTTAAAGCGCAGAAGGGCGAT
GCATTGTTGTTCTTCAGCCTAAATCTCGACGCAACAACAGATGAAAGAAGTTTGCACGGTAGTTGCCCTGTAATTGCAGGCGAGAAATGGTCTGCAACCAAATGGATTCA
TGTGAGATCCTTTGAGAAGATAACTTCTCGTGTTAGTAGACAGGGTTGCGTGGACGAGAACGAAAATTGCCTGGCATGGGCAAAAAAGGGAGAGTGCAAAAAGAACCCTA
CTTACATGGTGGGTTCTGGAGGTGCTTTAGGATACTGTAGGAAGAGCTGCAAAGCATGCTAA
mRNA sequenceShow/hide mRNA sequence
CAAATTTATTATAACCGCCCACTGATTCTCATTTTTCATTAATTGTCTTCCTAGTTGAATTTTCATAATTAATTCAATTCAAATCCTTTTATTTTTCCTTTTTCCTTCCT
CGTAAACGAAGAACCCTTCAATCGTTGAATTCTTTTTCTTGTTCATTTCTCCGATTTGACATCGGAGAAACAATCATGGATTCTCGACCATTCCTCGCATTTTCTCTCTG
CTTTCTCTCCGTCTTCACCGCCTTCGCTCGCTTGCCGGAAACGCGTACCCACAAGCAATCAAGTGGATCTGTGCTTCGATTGAAGACGGATTCATCTCCGCTCATTTTCG
ATCCAACACGAGTCACTCAGCTCTCCTGGCAACCCAGGGCATTTTTGTATAAGGGATTTTTATCTGATGCGGAATGTGATCACCTAATTGATCTGGCTAAGGATAAATTA
GAGAAGTCAATGGTAGCAGATAATGATTCTGGTAAAAGTGTAAGTAGTGAAGTCCGAACGAGTTCTGGCATGTTCCTTCGGAAGGCCCAGGATGAAGTTGTTGCTGGCGT
TGAAGCCAGGATAGCTGCATGGACACTCCTTCCAGCAGAAAATGGCGAATCCATTCAAATTCTTCACTATGAGAATGGTCAAAAGTATGAACCACATTTTGATTTTTTTC
ACGACAAGGTGAATCAGGAGTTAGGTGGCCACCGCATAGCCACAGTCTTGATGTATTTATCCAATGTTGAAAAGGGTGGAGAAACCATCTTTCCTAATTCAGAGTTTAAA
GAATCTCAAGCAAAAGATGAGAGCTGGTCTGATTGTTCTCGAAAGGGTTATGCAGTTAAAGCGCAGAAGGGCGATGCATTGTTGTTCTTCAGCCTAAATCTCGACGCAAC
AACAGATGAAAGAAGTTTGCACGGTAGTTGCCCTGTAATTGCAGGCGAGAAATGGTCTGCAACCAAATGGATTCATGTGAGATCCTTTGAGAAGATAACTTCTCGTGTTA
GTAGACAGGGTTGCGTGGACGAGAACGAAAATTGCCTGGCATGGGCAAAAAAGGGAGAGTGCAAAAAGAACCCTACTTACATGGTGGGTTCTGGAGGTGCTTTAGGATAC
TGTAGGAAGAGCTGCAAAGCATGCTAAAACCCTAGGAGGAGGAAGAAGAAGAAGTAATCCCCACATCTCTCTTTCTTTTTTTCTGTTTTGCTGAGCTTGTGTGTCGATTT
TGTAATGGCTATGTATATAACATTGGGCAGCAACTTGGTATACTATATAATATTACAAGTGGATATTAATTACAGCTTTCATTAAACCTTGTTTTAGCAATTAACCACAA
AAGAGTTATCATTTGATAATTGAATATGCAATGAGAAGTTTTCTCATGTATGATCCTTATTGGCTGCTTGACTTTTATATTCAACTTTACAAACC
Protein sequenceShow/hide protein sequence
MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRTSSGMF
LRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQAKDESWSDCSRKGYAVKAQKGD
ALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC