; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G8288 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G8288
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationctg1557:4343986..4347449
RNA-Seq ExpressionCucsat.G8288
SyntenyCucsat.G8288
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0016020 - membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8648909.1 hypothetical protein Csa_008411 [Cucumis sativus]2.54e-23188.68Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVS
        MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVS
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVS

Query:  SEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPA-----------------------ENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY
        SEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPA                       ENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY
Subjt:  SEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPA-----------------------ENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY

Query:  LSNVEKGGETIFPNSEVWYGSGSATSVPLFFLKKQTYILILLPTVMTLSLQFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSC
        LSNVEKGGETIFPNSE                   TYILILLPTVMTLSLQFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSC
Subjt:  LSNVEKGGETIFPNSEVWYGSGSATSVPLFFLKKQTYILILLPTVMTLSLQFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSC

Query:  PVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC
        PVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC
Subjt:  PVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC

XP_008458700.1 PREDICTED: probable prolyl 4-hydroxylase 7 [Cucumis melo]4.57e-20783.81Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETR----THKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSG
        MDSRPFLAFSLCFLSVFTAFARLPETR    ++KQS+GSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSD ECDHLIDLAKDKLEKSMVADN+SG
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETR----THKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSG

Query:  KSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWY
        KSVSSEVRTSSGMFLRKAQD++VAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSE   
Subjt:  KSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWY

Query:  GSGSATSVPLFFLKKQTYILILLPTVMTLSLQFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSF
                                        FKESQ KD+SWSDCSRKGYAVKAQKGDALLFFSL+LDATTDERSLHGSCPVI GEKWSATKWIHVRSF
Subjt:  GSGSATSVPLFFLKKQTYILILLPTVMTLSLQFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSF

Query:  EKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC
        EK+  RVSRQ CVDENENC AWAK+GECKKNPTYMVGS GALGYCRKSCKAC
Subjt:  EKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC

XP_011655982.1 probable prolyl 4-hydroxylase 7 isoform X1 [Cucumis sativus]1.00e-22389.94Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVS
        MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVS
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVS

Query:  SEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWYGSGS
        SEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSE       
Subjt:  SEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWYGSGS

Query:  ATSVPLFFLKKQTYILILLPTVMTLSLQFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKIT
                                    FKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKIT
Subjt:  ATSVPLFFLKKQTYILILLPTVMTLSLQFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKIT

Query:  SRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC
        SRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC
Subjt:  SRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC

XP_031742194.1 probable prolyl 4-hydroxylase 7 isoform X2 [Cucumis sativus]2.66e-22189.66Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVS
        MDSRPFLAFSLCFLSVFTAFARLPETRTHKQS GSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVS
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVS

Query:  SEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWYGSGS
        SEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSE       
Subjt:  SEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWYGSGS

Query:  ATSVPLFFLKKQTYILILLPTVMTLSLQFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKIT
                                    FKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKIT
Subjt:  ATSVPLFFLKKQTYILILLPTVMTLSLQFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKIT

Query:  SRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC
        SRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC
Subjt:  SRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC

XP_038889686.1 probable prolyl 4-hydroxylase 7 isoform X1 [Benincasa hispida]1.90e-19880.46Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVS
        MDSR FLAF LCFLSVFT FARLPE R+ K+SSGSV+RLKTDSSPL+FDPTRVTQLSW+PRAFLYKGFLSD ECDHLIDLAKDKLEKSMVADN+SGKSVS
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVS

Query:  SEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWYGSGS
        SEVRTSSGMFLRKAQDE+VA +EARI+AWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSE       
Subjt:  SEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWYGSGS

Query:  ATSVPLFFLKKQTYILILLPTVMTLSLQFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKIT
                                    FKESQ KDESWSDC+RKGYAVKA+KGDALLFFSL  DATTD +SLHGSCPVI GEKWSATKWIHVRSFEK T
Subjt:  ATSVPLFFLKKQTYILILLPTVMTLSLQFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKIT

Query:  SRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC
         RVSRQ CVDENENC  WAK+GECKKNPTYMVGS  ALGYCRKSC+AC
Subjt:  SRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC

TrEMBL top hitse value%identityAlignment
A0A0A0KS38 Procollagen-proline 4-dioxygenase4.86e-22489.94Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVS
        MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVS
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVS

Query:  SEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWYGSGS
        SEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSE       
Subjt:  SEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWYGSGS

Query:  ATSVPLFFLKKQTYILILLPTVMTLSLQFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKIT
                                    FKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKIT
Subjt:  ATSVPLFFLKKQTYILILLPTVMTLSLQFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKIT

Query:  SRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC
        SRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC
Subjt:  SRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC

A0A1S3C8G4 Procollagen-proline 4-dioxygenase2.21e-20783.81Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETR----THKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSG
        MDSRPFLAFSLCFLSVFTAFARLPETR    ++KQS+GSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSD ECDHLIDLAKDKLEKSMVADN+SG
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETR----THKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSG

Query:  KSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWY
        KSVSSEVRTSSGMFLRKAQD++VAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSE   
Subjt:  KSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWY

Query:  GSGSATSVPLFFLKKQTYILILLPTVMTLSLQFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSF
                                        FKESQ KD+SWSDCSRKGYAVKAQKGDALLFFSL+LDATTDERSLHGSCPVI GEKWSATKWIHVRSF
Subjt:  GSGSATSVPLFFLKKQTYILILLPTVMTLSLQFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSF

Query:  EKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC
        EK+  RVSRQ CVDENENC AWAK+GECKKNPTYMVGS GALGYCRKSCKAC
Subjt:  EKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC

A0A5A7T8H4 Procollagen-proline 4-dioxygenase8.19e-19280.4Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETR----THKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSG
        MDSRPFLAFSLCFLSVFTAFARLPETR    ++KQS+GSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSD ECDHLIDLAKDKLEKSMVADN+SG
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETR----THKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSG

Query:  KSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWY
        KSVSSEVRTSSGMFLRKAQD++VAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSE+  
Subjt:  KSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWY

Query:  GSGSATSVPLFFLKKQTYILILLPTVMTLSLQFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSF
        G G   S   +     T++ I                              AVKAQKGDALLFFSL+LDATTDERSLHGSCPVI GEKWSATKWIHVRSF
Subjt:  GSGSATSVPLFFLKKQTYILILLPTVMTLSLQFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSF

Query:  EKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC
        EK+  RVSRQ CVDENENC AWAK+GECKKNPTYMVGS GALGYCRKSCKAC
Subjt:  EKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC

A0A5D3CTS4 Procollagen-proline 4-dioxygenase8.81e-19671.78Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETR----THKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSG
        MDSRPFLAFSLCFLSVFTAFARLPETR    ++KQS+GSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSD ECDHLIDLAKDKLEKSMVADN+SG
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETR----THKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSG

Query:  KSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWY
        KSVSSEVRTSSGMFLRKAQD++VAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSE   
Subjt:  KSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWY

Query:  GSGSATSVPLFFLKKQTYILILLPTVMTLSLQFKESQAKDESWSDCSRKGYA------------------------------------------------
                                        FKESQ KD+SWSDCSRKGYA                                                
Subjt:  GSGSATSVPLFFLKKQTYILILLPTVMTLSLQFKESQAKDESWSDCSRKGYA------------------------------------------------

Query:  -----------VKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGA
                   VKAQKGDALLFFSL+LDATTDERSLHGSCPVI GEKWSATKWIHVRSFEK+  RVSRQ CVDENENC AWAK+GECKKNPTYMVGS GA
Subjt:  -----------VKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGA

Query:  LGYCRKSCKAC
        LGYCRKSCKAC
Subjt:  LGYCRKSCKAC

A0A6J1BXN9 Procollagen-proline 4-dioxygenase3.38e-19277.87Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVS
        MDS  FL+FSLCFL VFTA ARLP+ R HK+ SGSVLRLK + SPLIFDPTRVTQLSWQPRAFLYKGFLSD ECDHLIDLAKDKLEKSMVADN+SGKSVS
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVS

Query:  SEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWYGSGS
        SEVRTSSGMFL KAQDE+VA VEARIAAWT LPAENGESIQILHYENGQKYEPHFD+FHDKVNQELGGHR+ATVLMYLSNVEKGGETIFPNSE       
Subjt:  SEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWYGSGS

Query:  ATSVPLFFLKKQTYILILLPTVMTLSLQFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKIT
                                    FKESQ KD+SWSDC+RKGYAVKA+KGDALLFFSL+LDATTD +SLHGSCPVI GEKWSATKWIHVRSFEK T
Subjt:  ATSVPLFFLKKQTYILILLPTVMTLSLQFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKIT

Query:  SRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC
            R  CVDENENC +WAK+GECKKNPTYMVGS  ALGYCRKSC+AC
Subjt:  SRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 61.0e-10859.31Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSM-VADNDSGKSV
        MDS+ FLAFSL  L +F+                     +  S     DPTR+TQLSW PRAFLYKGFLSD ECDHLI LAK KLEKSM VAD DSG+S 
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSM-VADNDSGKSV

Query:  SSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWYGSG
         SEVRTSSGMFL K QD++VA VEA++AAWT LP ENGE++QILHYENGQKY+PHFD+F+DK   ELGGHRIATVLMYLSNV KGGET+FPN   W G  
Subjt:  SSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWYGSG

Query:  SATSVPLFFLKKQTYILILLPTVMTLSLQFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKI
                                      K  Q KD+SWS C+++GYAVK +KGDALLFF+L+L+ TTD  SLHGSCPVI GEKWSAT+WIHVRSF K 
Subjt:  SATSVPLFFLKKQTYILILLPTVMTLSLQFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKI

Query:  TSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC
             +  CVD++E+C  WA  GEC+KNP YMVGS  +LG+CRKSCKAC
Subjt:  TSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC

F4JAU3 Prolyl 4-hydroxylase 22.6e-8854.4Show/hide
Query:  SSP-LIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQ
        SSP  I +P++V Q+S +PRAF+Y+GFL+D ECDHLI LAK+ L++S VADND+G+S  S+VRTSSG F+ K +D +V+G+E +++ WT LP ENGE +Q
Subjt:  SSP-LIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQ

Query:  ILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWYGSGSATSVPLFFLKKQTYILILLPTVMTLSLQFKESQAKDESWSD
        +L YE+GQKY+ HFD+FHDKVN   GGHRIATVL+YLSNV KGGET+FP+++ +    S  S+                           S+ KD+  SD
Subjt:  ILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWYGSGSATSVPLFFLKKQTYILILLPTVMTLSLQFKESQAKDESWSD

Query:  CSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYC
        C++KG AVK +KG+ALLFF+L  DA  D  SLHG CPVI GEKWSATKWIHV SF+KI +      C D NE+C  WA  GEC KNP YMVG+    G C
Subjt:  CSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYC

Query:  RKSCKAC
        R+SCKAC
Subjt:  RKSCKAC

Q8L970 Probable prolyl 4-hydroxylase 71.9e-12362.96Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPE---TRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGK
        MDSR FLAFSLCFL      +  P    TR+     GSV+++KT +S   FDPTRVTQLSW PR FLY+GFLSD ECDH I LAK KLEKSMVADNDSG+
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPE---TRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGK

Query:  SVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWYG
        SV SEVRTSSGMFL K QD++V+ VEA++AAWT LP ENGES+QILHYENGQKYEPHFD+FHD+ N ELGGHRIATVLMYLSNVEKGGET+FP   +W G
Subjt:  SVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWYG

Query:  SGSATSVPLFFLKKQTYILILLPTVMTLSLQFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFE
                                        K +Q KD+SW++C+++GYAVK +KGDALLFF+L+ +ATTD  SLHGSCPV+ GEKWSAT+WIHV+SFE
Subjt:  SGSATSVPLFFLKKQTYILILLPTVMTLSLQFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFE

Query:  KITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC
        +  ++ S  GC+DEN +C  WAK GEC+KNPTYMVGS    GYCRKSCKAC
Subjt:  KITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC

Q8LAN3 Probable prolyl 4-hydroxylase 41.6e-9053.14Show/hide
Query:  QSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWT
        QSS S++     SS +  +P++V Q+S +PRAF+Y+GFL++ ECDH++ LAK  L++S VADNDSG+S  SEVRTSSG F+ K +D +V+G+E +I+ WT
Subjt:  QSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWT

Query:  LLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWYGSGSATSVPLFFLKKQTYILILLPTVMTLSLQFK
         LP ENGE IQ+L YE+GQKY+ HFD+FHDKVN   GGHR+AT+LMYLSNV KGGET+FP++E+                         P+   LS    
Subjt:  LLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWYGSGSATSVPLFFLKKQTYILILLPTVMTLSLQFK

Query:  ESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTY
              E  SDC+++G AVK +KGDALLFF+L+ DA  D  SLHG CPVI GEKWSATKWIHV SF++I +      C D NE+C  WA  GEC KNP Y
Subjt:  ESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTY

Query:  MVGSGGALGYCRKSCKAC
        MVG+    GYCR+SCKAC
Subjt:  MVGSGGALGYCRKSCKAC

Q9LN20 Probable prolyl 4-hydroxylase 35.9e-6147.93Show/hide
Query:  LSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHF
        LSW+PRAF+Y  FLS  EC++LI LAK  + KS V D+++GKS  S VRTSSG FLR+ +D+++  +E RIA +T +PA++GE +Q+LHYE GQKYEPH+
Subjt:  LSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHF

Query:  DFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWYGSGSATSVPLFFLKKQTYILILLPTVMTLSLQFKESQAKDESWSDCSRKGYAVKAQKGD
        D+F D+ N + GG R+AT+LMYLS+VE+GGET+FP + +     + +SVP +                                S+C +KG +VK + GD
Subjt:  DFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWYGSGSATSVPLFFLKKQTYILILLPTVMTLSLQFKESQAKDESWSDCSRKGYAVKAQKGD

Query:  ALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFE
        ALLF+S+  DAT D  SLHG CPVI G KWS+TKW+HV  ++
Subjt:  ALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFE

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 21.8e-8954.4Show/hide
Query:  SSP-LIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQ
        SSP  I +P++V Q+S +PRAF+Y+GFL+D ECDHLI LAK+ L++S VADND+G+S  S+VRTSSG F+ K +D +V+G+E +++ WT LP ENGE +Q
Subjt:  SSP-LIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQ

Query:  ILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWYGSGSATSVPLFFLKKQTYILILLPTVMTLSLQFKESQAKDESWSD
        +L YE+GQKY+ HFD+FHDKVN   GGHRIATVL+YLSNV KGGET+FP+++ +    S  S+                           S+ KD+  SD
Subjt:  ILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWYGSGSATSVPLFFLKKQTYILILLPTVMTLSLQFKESQAKDESWSD

Query:  CSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYC
        C++KG AVK +KG+ALLFF+L  DA  D  SLHG CPVI GEKWSATKWIHV SF+KI +      C D NE+C  WA  GEC KNP YMVG+    G C
Subjt:  CSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYC

Query:  RKSCKAC
        R+SCKAC
Subjt:  RKSCKAC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase1.3e-12462.96Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPE---TRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGK
        MDSR FLAFSLCFL      +  P    TR+     GSV+++KT +S   FDPTRVTQLSW PR FLY+GFLSD ECDH I LAK KLEKSMVADNDSG+
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPE---TRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGK

Query:  SVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWYG
        SV SEVRTSSGMFL K QD++V+ VEA++AAWT LP ENGES+QILHYENGQKYEPHFD+FHD+ N ELGGHRIATVLMYLSNVEKGGET+FP   +W G
Subjt:  SVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWYG

Query:  SGSATSVPLFFLKKQTYILILLPTVMTLSLQFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFE
                                        K +Q KD+SW++C+++GYAVK +KGDALLFF+L+ +ATTD  SLHGSCPV+ GEKWSAT+WIHV+SFE
Subjt:  SGSATSVPLFFLKKQTYILILLPTVMTLSLQFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFE

Query:  KITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC
        +  ++ S  GC+DEN +C  WAK GEC+KNPTYMVGS    GYCRKSCKAC
Subjt:  KITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase1.3e-11659.61Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPE---TRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGK
        MDSR FLAFSLCFL      +  P    TR+     GSV+++KT +S   FDPTRVTQLSW PR FLY+GFLSD ECDH I LAK KLEKSMVADNDSG+
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPE---TRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGK

Query:  SVSSE-----VRTSSGMFLRKAQ---DEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIF
        SV SE     VR SS           D++V+ VEA++AAWT LP ENGES+QILHYENGQKYEPHFD+FHD+ N ELGGHRIATVLMYLSNVEKGGET+F
Subjt:  SVSSE-----VRTSSGMFLRKAQ---DEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIF

Query:  PNSEVWYGSGSATSVPLFFLKKQTYILILLPTVMTLSLQFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATK
        P   +W G                                K +Q KD+SW++C+++GYAVK +KGDALLFF+L+ +ATTD  SLHGSCPV+ GEKWSAT+
Subjt:  PNSEVWYGSGSATSVPLFFLKKQTYILILLPTVMTLSLQFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATK

Query:  WIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC
        WIHV+SFE+  ++ S  GC+DEN +C  WAK GEC+KNPTYMVGS    GYCRKSCKAC
Subjt:  WIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase7.1e-11059.31Show/hide
Query:  MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSM-VADNDSGKSV
        MDS+ FLAFSL  L +F+                     +  S     DPTR+TQLSW PRAFLYKGFLSD ECDHLI LAK KLEKSM VAD DSG+S 
Subjt:  MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSM-VADNDSGKSV

Query:  SSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWYGSG
         SEVRTSSGMFL K QD++VA VEA++AAWT LP ENGE++QILHYENGQKY+PHFD+F+DK   ELGGHRIATVLMYLSNV KGGET+FPN   W G  
Subjt:  SSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWYGSG

Query:  SATSVPLFFLKKQTYILILLPTVMTLSLQFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKI
                                      K  Q KD+SWS C+++GYAVK +KGDALLFF+L+L+ TTD  SLHGSCPVI GEKWSAT+WIHVRSF K 
Subjt:  SATSVPLFFLKKQTYILILLPTVMTLSLQFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKI

Query:  TSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC
             +  CVD++E+C  WA  GEC+KNP YMVGS  +LG+CRKSCKAC
Subjt:  TSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.1e-9153.14Show/hide
Query:  QSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWT
        QSS S++     SS +  +P++V Q+S +PRAF+Y+GFL++ ECDH++ LAK  L++S VADNDSG+S  SEVRTSSG F+ K +D +V+G+E +I+ WT
Subjt:  QSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWT

Query:  LLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWYGSGSATSVPLFFLKKQTYILILLPTVMTLSLQFK
         LP ENGE IQ+L YE+GQKY+ HFD+FHDKVN   GGHR+AT+LMYLSNV KGGET+FP++E+                         P+   LS    
Subjt:  LLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWYGSGSATSVPLFFLKKQTYILILLPTVMTLSLQFK

Query:  ESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTY
              E  SDC+++G AVK +KGDALLFF+L+ DA  D  SLHG CPVI GEKWSATKWIHV SF++I +      C D NE+C  WA  GEC KNP Y
Subjt:  ESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTY

Query:  MVGSGGALGYCRKSCKAC
        MVG+    GYCR+SCKAC
Subjt:  MVGSGGALGYCRKSCKAC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCTCGACCATTCCTCGCATTTTCTCTCTGCTTTCTCTCCGTCTTCACCGCCTTCGCTCGCTTGCCGGAAACGCGTACCCACAAGCAATCAAGTGGATCTGTGCT
TCGATTGAAGACGGATTCATCTCCGCTCATTTTCGATCCAACACGAGTCACTCAGCTCTCCTGGCAACCCAGGGCATTTTTGTATAAGGGATTTTTATCTGATGCGGAAT
GTGATCACCTAATTGATCTGGCTAAGGATAAATTAGAGAAGTCAATGGTAGCAGATAATGATTCTGGTAAAAGTGTAAGTAGTGAAGTCCGAACGAGTTCTGGCATGTTC
CTTCGGAAGGCCCAGGATGAAGTTGTTGCTGGCGTTGAAGCCAGGATAGCTGCATGGACACTCCTTCCAGCAGAAAATGGCGAATCCATTCAAATTCTTCACTATGAGAA
TGGTCAAAAGTATGAACCACATTTTGATTTTTTTCACGACAAGGTGAATCAGGAGTTAGGTGGCCACCGCATAGCCACAGTCTTGATGTATTTATCCAATGTTGAAAAGG
GTGGAGAAACCATCTTTCCTAATTCAGAGGTATGGTATGGCAGTGGTTCTGCTACTTCAGTACCTTTGTTTTTTTTAAAAAAACAGACGTATATTTTAATTTTGTTGCCA
ACTGTGATGACTTTGTCTCTGCAGTTTAAAGAATCTCAAGCAAAAGATGAGAGCTGGTCTGATTGTTCTCGAAAGGGTTATGCAGTTAAAGCGCAGAAGGGCGATGCATT
GTTGTTCTTCAGCCTAAATCTCGACGCAACAACAGATGAAAGAAGTTTGCACGGTAGTTGCCCTGTAATTGCAGGCGAGAAATGGTCTGCAACCAAATGGATTCATGTGA
GATCCTTTGAGAAGATAACTTCTCGTGTTAGTAGACAGGGTTGCGTGGACGAGAACGAAAATTGCCTGGCATGGGCAAAAAAGGGAGAGTGCAAAAAGAACCCTACTTAC
ATGGTGGGTTCTGGAGGTGCTTTAGGATACTGTAGGAAGAGCTGCAAAGCATGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATTCTCGACCATTCCTCGCATTTTCTCTCTGCTTTCTCTCCGTCTTCACCGCCTTCGCTCGCTTGCCGGAAACGCGTACCCACAAGCAATCAAGTGGATCTGTGCT
TCGATTGAAGACGGATTCATCTCCGCTCATTTTCGATCCAACACGAGTCACTCAGCTCTCCTGGCAACCCAGGGCATTTTTGTATAAGGGATTTTTATCTGATGCGGAAT
GTGATCACCTAATTGATCTGGCTAAGGATAAATTAGAGAAGTCAATGGTAGCAGATAATGATTCTGGTAAAAGTGTAAGTAGTGAAGTCCGAACGAGTTCTGGCATGTTC
CTTCGGAAGGCCCAGGATGAAGTTGTTGCTGGCGTTGAAGCCAGGATAGCTGCATGGACACTCCTTCCAGCAGAAAATGGCGAATCCATTCAAATTCTTCACTATGAGAA
TGGTCAAAAGTATGAACCACATTTTGATTTTTTTCACGACAAGGTGAATCAGGAGTTAGGTGGCCACCGCATAGCCACAGTCTTGATGTATTTATCCAATGTTGAAAAGG
GTGGAGAAACCATCTTTCCTAATTCAGAGGTATGGTATGGCAGTGGTTCTGCTACTTCAGTACCTTTGTTTTTTTTAAAAAAACAGACGTATATTTTAATTTTGTTGCCA
ACTGTGATGACTTTGTCTCTGCAGTTTAAAGAATCTCAAGCAAAAGATGAGAGCTGGTCTGATTGTTCTCGAAAGGGTTATGCAGTTAAAGCGCAGAAGGGCGATGCATT
GTTGTTCTTCAGCCTAAATCTCGACGCAACAACAGATGAAAGAAGTTTGCACGGTAGTTGCCCTGTAATTGCAGGCGAGAAATGGTCTGCAACCAAATGGATTCATGTGA
GATCCTTTGAGAAGATAACTTCTCGTGTTAGTAGACAGGGTTGCGTGGACGAGAACGAAAATTGCCTGGCATGGGCAAAAAAGGGAGAGTGCAAAAAGAACCCTACTTAC
ATGGTGGGTTCTGGAGGTGCTTTAGGATACTGTAGGAAGAGCTGCAAAGCATGCTAA
Protein sequenceShow/hide protein sequence
MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRTSSGMF
LRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWYGSGSATSVPLFFLKKQTYILILLP
TVMTLSLQFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTY
MVGSGGALGYCRKSCKAC