; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0000110 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0000110
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationchr09:19491366..19495407
RNA-Seq ExpressionPI0000110
SyntenyPI0000110
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK15293.1 putative prolyl 4-hydroxylase 7 [Cucumis melo var. makuwa]5.9e-16681.33Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETR----TNKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG
        MDSRPFLAFSLCFLSVFTAFARLPETR    + KQS+GSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETR----TNKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG

Query:  KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE
        KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE
Subjt:  KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE

Query:  SQEKDDSWSDCSRKGY-----------------------------------------------------------AVKAQKGDALLFFSLHLDATTDERS
        SQEKDDSWSDCSRKGY                                                           AVKAQKGDALLFFSLHLDATTDERS
Subjt:  SQEKDDSWSDCSRKGY-----------------------------------------------------------AVKAQKGDALLFFSLHLDATTDERS

Query:  LHGSCPVIEGEKWSATKWIHVRSFEKLTHVSRQDCMDENENCPAWAKRGECKKNPTYMVGSESALGYCRKSCKAC
        LHGSCPVIEGEKWSATKWIHVRSFEKL  VSRQDC+DENENCPAWAKRGECKKNPTYMVGSE ALGYCRKSCKAC
Subjt:  LHGSCPVIEGEKWSATKWIHVRSFEKLTHVSRQDCMDENENCPAWAKRGECKKNPTYMVGSESALGYCRKSCKAC

XP_008458700.1 PREDICTED: probable prolyl 4-hydroxylase 7 [Cucumis melo]4.5e-17496.52Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETR----TNKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG
        MDSRPFLAFSLCFLSVFTAFARLPETR    + KQS+GSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETR----TNKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG

Query:  KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE
        KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE
Subjt:  KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE

Query:  SQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLTHVSRQDCMDENENCPAWAKRGECKKNPTYMV
        SQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKL  VSRQDC+DENENCPAWAKRGECKKNPTYMV
Subjt:  SQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLTHVSRQDCMDENENCPAWAKRGECKKNPTYMV

Query:  GSESALGYCRKSCKAC
        GSE ALGYCRKSCKAC
Subjt:  GSESALGYCRKSCKAC

XP_011655982.1 probable prolyl 4-hydroxylase 7 isoform X1 [Cucumis sativus]1.3e-16894.25Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRTNKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGKSVS
        MDSRPFLAFSLCFLSVFTAFARLPETRT+KQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSD ECDHLIDLAKDKLEKSMVADN+SGKSVS
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRTNKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGKSVS

Query:  SEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEK
        SEVRTSSGMFLRKAQD++VAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQ K
Subjt:  SEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEK

Query:  DDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLT-HVSRQDCMDENENCPAWAKRGECKKNPTYMVGSE
        D+SWSDCSRKGYAVKAQKGDALLFFSL+LDATTDERSLHGSCPVI GEKWSATKWIHVRSFEK+T  VSRQ C+DENENC AWAK+GECKKNPTYMVGS 
Subjt:  DDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLT-HVSRQDCMDENENCPAWAKRGECKKNPTYMVGSE

Query:  SALGYCRKSCKAC
         ALGYCRKSCKAC
Subjt:  SALGYCRKSCKAC

XP_031742194.1 probable prolyl 4-hydroxylase 7 isoform X2 [Cucumis sativus]9.1e-16793.93Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRTNKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGKSVS
        MDSRPFLAFSLCFLSVFTAFARLPETRT+KQ SGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSD ECDHLIDLAKDKLEKSMVADN+SGKSVS
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRTNKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGKSVS

Query:  SEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEK
        SEVRTSSGMFLRKAQD++VAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQ K
Subjt:  SEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEK

Query:  DDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLT-HVSRQDCMDENENCPAWAKRGECKKNPTYMVGSE
        D+SWSDCSRKGYAVKAQKGDALLFFSL+LDATTDERSLHGSCPVI GEKWSATKWIHVRSFEK+T  VSRQ C+DENENC AWAK+GECKKNPTYMVGS 
Subjt:  DDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLT-HVSRQDCMDENENCPAWAKRGECKKNPTYMVGSE

Query:  SALGYCRKSCKAC
         ALGYCRKSCKAC
Subjt:  SALGYCRKSCKAC

XP_038889686.1 probable prolyl 4-hydroxylase 7 isoform X1 [Benincasa hispida]1.9e-16490.71Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRTNKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGKSVS
        MDSR FLAF LCFLSVFT FARLPE R+ K+SSGSV+RLKTDSSPL+FDPTRVTQLSW+PRAFLYKGFLSD+ECDHLIDLAKDKLEKSMVADNESGKSVS
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRTNKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGKSVS

Query:  SEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEK
        SEVRTSSGMFLRKAQD+IVA +EARI+AWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEK
Subjt:  SEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEK

Query:  DDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLTHVSRQDCMDENENCPAWAKRGECKKNPTYMVGSES
        D+SWSDC+RKGYAVKA+KGDALLFFSL  DATTD +SLHGSCPVIEGEKWSATKWIHVRSFEK T VSRQDC+DENENC  WAKRGECKKNPTYMVGSE 
Subjt:  DDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLTHVSRQDCMDENENCPAWAKRGECKKNPTYMVGSES

Query:  ALGYCRKSCKAC
        ALGYCRKSC+AC
Subjt:  ALGYCRKSCKAC

TrEMBL top hitse value%identityAlignment
A0A0A0KS38 Procollagen-proline 4-dioxygenase6.1e-16994.25Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRTNKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGKSVS
        MDSRPFLAFSLCFLSVFTAFARLPETRT+KQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSD ECDHLIDLAKDKLEKSMVADN+SGKSVS
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRTNKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGKSVS

Query:  SEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEK
        SEVRTSSGMFLRKAQD++VAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQ K
Subjt:  SEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEK

Query:  DDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLT-HVSRQDCMDENENCPAWAKRGECKKNPTYMVGSE
        D+SWSDCSRKGYAVKAQKGDALLFFSL+LDATTDERSLHGSCPVI GEKWSATKWIHVRSFEK+T  VSRQ C+DENENC AWAK+GECKKNPTYMVGS 
Subjt:  DDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLT-HVSRQDCMDENENCPAWAKRGECKKNPTYMVGSE

Query:  SALGYCRKSCKAC
         ALGYCRKSCKAC
Subjt:  SALGYCRKSCKAC

A0A1S3C8G4 Procollagen-proline 4-dioxygenase2.2e-17496.52Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETR----TNKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG
        MDSRPFLAFSLCFLSVFTAFARLPETR    + KQS+GSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETR----TNKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG

Query:  KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE
        KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE
Subjt:  KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE

Query:  SQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLTHVSRQDCMDENENCPAWAKRGECKKNPTYMV
        SQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKL  VSRQDC+DENENCPAWAKRGECKKNPTYMV
Subjt:  SQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLTHVSRQDCMDENENCPAWAKRGECKKNPTYMV

Query:  GSESALGYCRKSCKAC
        GSE ALGYCRKSCKAC
Subjt:  GSESALGYCRKSCKAC

A0A5D3CTS4 Procollagen-proline 4-dioxygenase2.8e-16681.33Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETR----TNKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG
        MDSRPFLAFSLCFLSVFTAFARLPETR    + KQS+GSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETR----TNKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESG

Query:  KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE
        KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE
Subjt:  KSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE

Query:  SQEKDDSWSDCSRKGY-----------------------------------------------------------AVKAQKGDALLFFSLHLDATTDERS
        SQEKDDSWSDCSRKGY                                                           AVKAQKGDALLFFSLHLDATTDERS
Subjt:  SQEKDDSWSDCSRKGY-----------------------------------------------------------AVKAQKGDALLFFSLHLDATTDERS

Query:  LHGSCPVIEGEKWSATKWIHVRSFEKLTHVSRQDCMDENENCPAWAKRGECKKNPTYMVGSESALGYCRKSCKAC
        LHGSCPVIEGEKWSATKWIHVRSFEKL  VSRQDC+DENENCPAWAKRGECKKNPTYMVGSE ALGYCRKSCKAC
Subjt:  LHGSCPVIEGEKWSATKWIHVRSFEKLTHVSRQDCMDENENCPAWAKRGECKKNPTYMVGSESALGYCRKSCKAC

A0A6J1BXN9 Procollagen-proline 4-dioxygenase2.0e-15988.82Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRTNKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGKSVS
        MDS  FL+FSLCFL VFTA ARLP+ R +K+ SGSVLRLK + SPLIFDPTRVTQLSWQPRAFLYKGFLSD+ECDHLIDLAKDKLEKSMVADN SGKSVS
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRTNKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGKSVS

Query:  SEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEK
        SEVRTSSGMFL KAQD+IVA VEARIAAWT LPAENGESIQILHYENGQKYEPHFD+FHDKVNQELGGHR+ATVLMYLSNVEKGGETIFPNSEFKESQEK
Subjt:  SEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEK

Query:  DDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLTHVSRQ-DCMDENENCPAWAKRGECKKNPTYMVGSE
        DDSWSDC+RKGYAVKA+KGDALLFFSLHLDATTD +SLHGSCPVIEGEKWSATKWIHVRSFEK T  SR+ DC+DENENC +WAKRGECKKNPTYMVGSE
Subjt:  DDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLTHVSRQ-DCMDENENCPAWAKRGECKKNPTYMVGSE

Query:  SALGYCRKSCKAC
        SALGYCRKSC+AC
Subjt:  SALGYCRKSCKAC

A0A6J1FJ93 Procollagen-proline 4-dioxygenase3.7e-15888.78Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRTNKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGKSVS
        MDSR FLAFSL FLSV T FARLPE  T+K+ SGSVL LK DS  LIFDPTRVTQLSWQPRAFLYKGFL+D+ECDHLIDLAKDKLEKSMVADNESGKSVS
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRTNKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGKSVS

Query:  SEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEK
        SEVRTSSGMFLRKAQD+IVAG+EARI+AWT LP ENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS F ESQEK
Subjt:  SEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEK

Query:  DDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLTHVSRQDCMDENENCPAWAKRGECKKNPTYMVGSES
        DDSWSDC+RKGYAVKAQKGDALLFFSLHLDATTD+RSLHGSCPVIEGEKWSATKWIHVRSF+K T +S QDC+DEN+NCP+WAKRGEC+KNPTYMVGSE 
Subjt:  DDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLTHVSRQDCMDENENCPAWAKRGECKKNPTYMVGSES

Query:  ALGYCRKSCKAC
        A+GYCRKSCKAC
Subjt:  ALGYCRKSCKAC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 69.9e-11667.09Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRTNKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSM-VADNESGKSV
        MDS+ FLAFSL  L +F+                     +  S     DPTR+TQLSW PRAFLYKGFLSDEECDHLI LAK KLEKSM VAD +SG+S 
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRTNKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSM-VADNESGKSV

Query:  SSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQE
         SEVRTSSGMFL K QD IVA VEA++AAWT LP ENGE++QILHYENGQKY+PHFD+F+DK   ELGGHRIATVLMYLSNV KGGET+FPN + K  Q 
Subjt:  SSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQE

Query:  KDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLTHVSRQDCMDENENCPAWAKRGECKKNPTYMVGSE
        KDDSWS C+++GYAVK +KGDALLFF+LHL+ TTD  SLHGSCPVIEGEKWSAT+WIHVRSF K   V    C+D++E+C  WA  GEC+KNP YMVGSE
Subjt:  KDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLTHVSRQDCMDENENCPAWAKRGECKKNPTYMVGSE

Query:  SALGYCRKSCKAC
        ++LG+CRKSCKAC
Subjt:  SALGYCRKSCKAC

F4JAU3 Prolyl 4-hydroxylase 26.9e-9361.45Show/hide
Query:  SSP-LIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQ
        SSP  I +P++V Q+S +PRAF+Y+GFL+D ECDHLI LAK+ L++S VADN++G+S  S+VRTSSG F+ K +D IV+G+E +++ WT LP ENGE +Q
Subjt:  SSP-LIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQ

Query:  ILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS-EF--KESQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSL
        +L YE+GQKY+ HFD+FHDKVN   GGHRIATVL+YLSNV KGGET+FP++ EF  +   E  D  SDC++KG AVK +KG+ALLFF+L  DA  D  SL
Subjt:  ILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS-EF--KESQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSL

Query:  HGSCPVIEGEKWSATKWIHVRSFEK-LTHVSRQDCMDENENCPAWAKRGECKKNPTYMVGSESALGYCRKSCKAC
        HG CPVIEGEKWSATKWIHV SF+K LTH    +C D NE+C  WA  GEC KNP YMVG+    G CR+SCKAC
Subjt:  HGSCPVIEGEKWSATKWIHVRSFEK-LTHVSRQDCMDENENCPAWAKRGECKKNPTYMVGSESALGYCRKSCKAC

Q8L970 Probable prolyl 4-hydroxylase 72.7e-12970.48Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPE---TRTNKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGK
        MDSR FLAFSLCFL      +  P    TR++    GSV+++KT +S   FDPTRVTQLSW PR FLY+GFLSDEECDH I LAK KLEKSMVADN+SG+
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPE---TRTNKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGK

Query:  SVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKES
        SV SEVRTSSGMFL K QD IV+ VEA++AAWT LP ENGES+QILHYENGQKYEPHFD+FHD+ N ELGGHRIATVLMYLSNVEKGGET+FP  + K +
Subjt:  SVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKES

Query:  QEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLTHVSRQDCMDENENCPAWAKRGECKKNPTYMVG
        Q KDDSW++C+++GYAVK +KGDALLFF+LH +ATTD  SLHGSCPV+EGEKWSAT+WIHV+SFE+  +  +  CMDEN +C  WAK GEC+KNPTYMVG
Subjt:  QEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLTHVSRQDCMDENENCPAWAKRGECKKNPTYMVG

Query:  SESALGYCRKSCKAC
        S+   GYCRKSCKAC
Subjt:  SESALGYCRKSCKAC

Q8LAN3 Probable prolyl 4-hydroxylase 44.3e-9556.62Show/hide
Query:  LSVFTAFARLPETRTNKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRK
        +S F  F+ L ++ T+  SS SV            +P++V Q+S +PRAF+Y+GFL++ ECDH++ LAK  L++S VADN+SG+S  SEVRTSSG F+ K
Subjt:  LSVFTAFARLPETRTNKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRK

Query:  AQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQ---EKDDSWSDCSRK
         +D IV+G+E +I+ WT LP ENGE IQ+L YE+GQKY+ HFD+FHDKVN   GGHR+AT+LMYLSNV KGGET+FP++E    +   E  +  SDC+++
Subjt:  AQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQ---EKDDSWSDCSRK

Query:  GYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLTHVSRQDCMDENENCPAWAKRGECKKNPTYMVGSESALGYCRKSCK
        G AVK +KGDALLFF+LH DA  D  SLHG CPVIEGEKWSATKWIHV SF+++   S  +C D NE+C  WA  GEC KNP YMVG+    GYCR+SCK
Subjt:  GYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLTHVSRQDCMDENENCPAWAKRGECKKNPTYMVGSESALGYCRKSCK

Query:  AC
        AC
Subjt:  AC

Q9LN20 Probable prolyl 4-hydroxylase 36.1e-6556.73Show/hide
Query:  LSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHF
        LSW+PRAF+Y  FLS EEC++LI LAK  + KS V D+E+GKS  S VRTSSG FLR+ +DKI+  +E RIA +T +PA++GE +Q+LHYE GQKYEPH+
Subjt:  LSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHF

Query:  DFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFK-ESQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATK
        D+F D+ N + GG R+AT+LMYLS+VE+GGET+FP +     S    +  S+C +KG +VK + GDALLF+S+  DAT D  SLHG CPVI G KWS+TK
Subjt:  DFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFK-ESQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATK

Query:  WIHVRSFE
        W+HV  ++
Subjt:  WIHVRSFE

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 24.9e-9461.45Show/hide
Query:  SSP-LIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQ
        SSP  I +P++V Q+S +PRAF+Y+GFL+D ECDHLI LAK+ L++S VADN++G+S  S+VRTSSG F+ K +D IV+G+E +++ WT LP ENGE +Q
Subjt:  SSP-LIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQ

Query:  ILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS-EF--KESQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSL
        +L YE+GQKY+ HFD+FHDKVN   GGHRIATVL+YLSNV KGGET+FP++ EF  +   E  D  SDC++KG AVK +KG+ALLFF+L  DA  D  SL
Subjt:  ILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS-EF--KESQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSL

Query:  HGSCPVIEGEKWSATKWIHVRSFEK-LTHVSRQDCMDENENCPAWAKRGECKKNPTYMVGSESALGYCRKSCKAC
        HG CPVIEGEKWSATKWIHV SF+K LTH    +C D NE+C  WA  GEC KNP YMVG+    G CR+SCKAC
Subjt:  HGSCPVIEGEKWSATKWIHVRSFEK-LTHVSRQDCMDENENCPAWAKRGECKKNPTYMVGSESALGYCRKSCKAC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase1.9e-13070.48Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPE---TRTNKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGK
        MDSR FLAFSLCFL      +  P    TR++    GSV+++KT +S   FDPTRVTQLSW PR FLY+GFLSDEECDH I LAK KLEKSMVADN+SG+
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPE---TRTNKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGK

Query:  SVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKES
        SV SEVRTSSGMFL K QD IV+ VEA++AAWT LP ENGES+QILHYENGQKYEPHFD+FHD+ N ELGGHRIATVLMYLSNVEKGGET+FP  + K +
Subjt:  SVSSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKES

Query:  QEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLTHVSRQDCMDENENCPAWAKRGECKKNPTYMVG
        Q KDDSW++C+++GYAVK +KGDALLFF+LH +ATTD  SLHGSCPV+EGEKWSAT+WIHV+SFE+  +  +  CMDEN +C  WAK GEC+KNPTYMVG
Subjt:  QEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLTHVSRQDCMDENENCPAWAKRGECKKNPTYMVG

Query:  SESALGYCRKSCKAC
        S+   GYCRKSCKAC
Subjt:  SESALGYCRKSCKAC

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase1.9e-12266.56Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPE---TRTNKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGK
        MDSR FLAFSLCFL      +  P    TR++    GSV+++KT +S   FDPTRVTQLSW PR FLY+GFLSDEECDH I LAK KLEKSMVADN+SG+
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPE---TRTNKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGK

Query:  SVSSE-----VRTSSGMFLRKAQ---DKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIF
        SV SE     VR SS           D IV+ VEA++AAWT LP ENGES+QILHYENGQKYEPHFD+FHD+ N ELGGHRIATVLMYLSNVEKGGET+F
Subjt:  SVSSE-----VRTSSGMFLRKAQ---DKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIF

Query:  PNSEFKESQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLTHVSRQDCMDENENCPAWAKRGECK
        P  + K +Q KDDSW++C+++GYAVK +KGDALLFF+LH +ATTD  SLHGSCPV+EGEKWSAT+WIHV+SFE+  +  +  CMDEN +C  WAK GEC+
Subjt:  PNSEFKESQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLTHVSRQDCMDENENCPAWAKRGECK

Query:  KNPTYMVGSESALGYCRKSCKAC
        KNPTYMVGS+   GYCRKSCKAC
Subjt:  KNPTYMVGSESALGYCRKSCKAC

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase7.0e-11767.09Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRTNKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSM-VADNESGKSV
        MDS+ FLAFSL  L +F+                     +  S     DPTR+TQLSW PRAFLYKGFLSDEECDHLI LAK KLEKSM VAD +SG+S 
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRTNKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSM-VADNESGKSV

Query:  SSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQE
         SEVRTSSGMFL K QD IVA VEA++AAWT LP ENGE++QILHYENGQKY+PHFD+F+DK   ELGGHRIATVLMYLSNV KGGET+FPN + K  Q 
Subjt:  SSEVRTSSGMFLRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQE

Query:  KDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLTHVSRQDCMDENENCPAWAKRGECKKNPTYMVGSE
        KDDSWS C+++GYAVK +KGDALLFF+LHL+ TTD  SLHGSCPVIEGEKWSAT+WIHVRSF K   V    C+D++E+C  WA  GEC+KNP YMVGSE
Subjt:  KDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLTHVSRQDCMDENENCPAWAKRGECKKNPTYMVGSE

Query:  SALGYCRKSCKAC
        ++LG+CRKSCKAC
Subjt:  SALGYCRKSCKAC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein3.1e-9656.62Show/hide
Query:  LSVFTAFARLPETRTNKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRK
        +S F  F+ L ++ T+  SS SV            +P++V Q+S +PRAF+Y+GFL++ ECDH++ LAK  L++S VADN+SG+S  SEVRTSSG F+ K
Subjt:  LSVFTAFARLPETRTNKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRK

Query:  AQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQ---EKDDSWSDCSRK
         +D IV+G+E +I+ WT LP ENGE IQ+L YE+GQKY+ HFD+FHDKVN   GGHR+AT+LMYLSNV KGGET+FP++E    +   E  +  SDC+++
Subjt:  AQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQ---EKDDSWSDCSRK

Query:  GYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLTHVSRQDCMDENENCPAWAKRGECKKNPTYMVGSESALGYCRKSCK
        G AVK +KGDALLFF+LH DA  D  SLHG CPVIEGEKWSATKWIHV SF+++   S  +C D NE+C  WA  GEC KNP YMVG+    GYCR+SCK
Subjt:  GYAVKAQKGDALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLTHVSRQDCMDENENCPAWAKRGECKKNPTYMVGSESALGYCRKSCK

Query:  AC
        AC
Subjt:  AC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCTCGACCATTCCTCGCATTTTCTCTCTGCTTTCTCTCCGTCTTCACCGCCTTCGCTCGCTTGCCGGAAACGCGTACGAACAAGCAATCAAGTGGATCTGTGCT
TCGATTGAAGACGGATTCATCTCCGCTCATTTTCGATCCAACACGAGTCACTCAGCTCTCCTGGCAACCCAGGGCATTTTTGTATAAGGGATTTTTATCTGATGAGGAAT
GTGATCACCTAATTGATCTGGCTAAGGATAAATTAGAGAAGTCAATGGTAGCAGATAATGAGTCTGGTAAAAGTGTAAGTAGTGAAGTCCGTACGAGTTCTGGCATGTTT
CTTCGGAAGGCCCAGGATAAAATTGTTGCTGGCGTTGAAGCCAGGATAGCTGCATGGACACTCCTTCCAGCAGAAAATGGAGAATCCATTCAAATTCTTCACTATGAGAA
TGGTCAAAAATATGAACCACATTTTGATTTTTTTCACGACAAGGTGAATCAGGAGTTAGGTGGCCACCGCATAGCCACAGTCTTGATGTATTTATCCAATGTTGAAAAGG
GTGGAGAAACCATCTTTCCTAATTCCGAGTTTAAAGAATCTCAAGAAAAAGATGACAGCTGGTCTGATTGTTCTCGAAAGGGTTATGCAGTTAAAGCGCAGAAGGGCGAT
GCATTGTTGTTCTTCAGCCTACATCTCGACGCAACGACAGATGAAAGAAGTTTGCATGGTAGTTGCCCTGTAATTGAAGGCGAGAAATGGTCTGCAACCAAATGGATTCA
TGTGAGATCCTTTGAGAAGCTAACTCATGTAAGCAGGCAGGATTGCATGGACGAGAACGAAAATTGCCCGGCATGGGCGAAAAGGGGAGAGTGCAAAAAGAACCCTACTT
ACATGGTGGGTTCTGAAAGTGCTTTAGGATACTGTAGGAAGAGTTGCAAAGCATGCTAA
mRNA sequenceShow/hide mRNA sequence
AAAAAAGAAAAAAGAAAAAAAGAAAAAGAAAAGAAAGATAATTTGTTTGAATTTTCTTAATTTGCAATTTCCAAAAAACCGCCCACTGATTCTAATCTTTCATTAATTGT
CTTCATAGTTGAATTTTCATAATTAGTTCAAACCCATTTATATATTTTTTTTTTTCTTTTTCCTTTCTTGTAAACGAAAGAACCCTTGAATCGTTGAATTATTTTTCTTG
TTCATTTCTCCGATTTGATATCGGAGAAACAATCATGGATTCTCGACCATTCCTCGCATTTTCTCTCTGCTTTCTCTCCGTCTTCACCGCCTTCGCTCGCTTGCCGGAAA
CGCGTACGAACAAGCAATCAAGTGGATCTGTGCTTCGATTGAAGACGGATTCATCTCCGCTCATTTTCGATCCAACACGAGTCACTCAGCTCTCCTGGCAACCCAGGGCA
TTTTTGTATAAGGGATTTTTATCTGATGAGGAATGTGATCACCTAATTGATCTGGCTAAGGATAAATTAGAGAAGTCAATGGTAGCAGATAATGAGTCTGGTAAAAGTGT
AAGTAGTGAAGTCCGTACGAGTTCTGGCATGTTTCTTCGGAAGGCCCAGGATAAAATTGTTGCTGGCGTTGAAGCCAGGATAGCTGCATGGACACTCCTTCCAGCAGAAA
ATGGAGAATCCATTCAAATTCTTCACTATGAGAATGGTCAAAAATATGAACCACATTTTGATTTTTTTCACGACAAGGTGAATCAGGAGTTAGGTGGCCACCGCATAGCC
ACAGTCTTGATGTATTTATCCAATGTTGAAAAGGGTGGAGAAACCATCTTTCCTAATTCCGAGTTTAAAGAATCTCAAGAAAAAGATGACAGCTGGTCTGATTGTTCTCG
AAAGGGTTATGCAGTTAAAGCGCAGAAGGGCGATGCATTGTTGTTCTTCAGCCTACATCTCGACGCAACGACAGATGAAAGAAGTTTGCATGGTAGTTGCCCTGTAATTG
AAGGCGAGAAATGGTCTGCAACCAAATGGATTCATGTGAGATCCTTTGAGAAGCTAACTCATGTAAGCAGGCAGGATTGCATGGACGAGAACGAAAATTGCCCGGCATGG
GCGAAAAGGGGAGAGTGCAAAAAGAACCCTACTTACATGGTGGGTTCTGAAAGTGCTTTAGGATACTGTAGGAAGAGTTGCAAAGCATGCTAAAACCCTAGGAGGAAGAA
GAGGGAGAAGAAGAAGAAGAAGTAATCCCCACATCTCTCTTTCTTTTTCTGTTTTGCTGAGCTTGGGTGTCGATTTTGTAATTGGCTATGTATATAACATTGGGCAGCAA
CTTGGTATACTATATACAATTACAAGTGGATATTAATTACATCTCTTTCATTAAACCTTGTTGTAGCAATTAACCACAAGAGTTTCATTTGATAATTTAAATGCAATTAG
AAGTTTTCTCTTGTATGATGCTTATTGGCTGGTTAACTTTTCTATTCAACTTTACAAATTTT
Protein sequenceShow/hide protein sequence
MDSRPFLAFSLCFLSVFTAFARLPETRTNKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMF
LRKAQDKIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSWSDCSRKGYAVKAQKGD
ALLFFSLHLDATTDERSLHGSCPVIEGEKWSATKWIHVRSFEKLTHVSRQDCMDENENCPAWAKRGECKKNPTYMVGSESALGYCRKSCKAC