; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh04G027450 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh04G027450
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationCma_Chr04:18452684..18456438
RNA-Seq ExpressionCmaCh04G027450
SyntenyCmaCh04G027450
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602475.1 putative prolyl 4-hydroxylase 7, partial [Cucurbita argyrosperma subsp. sororia]1.3e-17896.46Show/hide
Query:  MDSRWFLTFSLCFLFVFTAFARLPQSLTHNELGGSALRLKGNSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS
        MDSRWFLTFSLC LFVFTAFARLPQSL+HN+LGGSALRLKG+SSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHL+NLSKGRLERSMVADN+SGKRIS
Subjt:  MDSRWFLTFSLCFLFVFTAFARLPQSLTHNELGGSALRLKGNSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS

Query:  SKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK
        S+VRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSE KESQEK
Subjt:  SKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK

Query:  DDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNQNCPLWAENGECKNNPRYMLGSETS
        DDSWSDCARMGYAVKP+KGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFN+NCPLWAENGECKNNPRYMLGSET+
Subjt:  DDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNQNCPLWAENGECKNNPRYMLGSETS

Query:  SGYCRKSCQAC
        SGYCRKSCQAC
Subjt:  SGYCRKSCQAC

KAG7033150.1 putative prolyl 4-hydroxylase 7, partial [Cucurbita argyrosperma subsp. argyrosperma]9.6e-17793.5Show/hide
Query:  MDSRWFLTFSLCFLFVFTAFARLPQSLTHNEL------------GGSALRLKGNSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERS
        MDSRWFLTFSLC LFVFTAFARLPQSLTHN+L            GGSALRL+G+SSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERS
Subjt:  MDSRWFLTFSLCFLFVFTAFARLPQSLTHNEL------------GGSALRLKGNSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERS

Query:  MVADNKSGKRISSKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETV
        MVADNKSGKRISS+VRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETV
Subjt:  MVADNKSGKRISSKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETV

Query:  FPHSEFKESQEKDDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNQNCPLWAENGECK
        FPHSE KESQEKDDSWSDCARMGYAVKP+KGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFN+NCPLWAENGECK
Subjt:  FPHSEFKESQEKDDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNQNCPLWAENGECK

Query:  NNPRYMLGSETSSGYCRKSCQAC
        NNPRYMLGSET+SGYCRKSCQAC
Subjt:  NNPRYMLGSETSSGYCRKSCQAC

XP_022961014.1 probable prolyl 4-hydroxylase 6 [Cucurbita moschata]4.2e-18097.43Show/hide
Query:  MDSRWFLTFSLCFLFVFTAFARLPQSLTHNELGGSALRLKGNSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS
        MDSRWFLTFSLC LFVFTAFARLPQSLTHN+LGGSALRL+G+SSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS
Subjt:  MDSRWFLTFSLCFLFVFTAFARLPQSLTHNELGGSALRLKGNSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS

Query:  SKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK
        S+VRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK
Subjt:  SKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK

Query:  DDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNQNCPLWAENGECKNNPRYMLGSETS
        DDSWSDCARMGYAVKP+KGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFN+NCPLWAENGECKNNPRYMLGSET+
Subjt:  DDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNQNCPLWAENGECKNNPRYMLGSETS

Query:  SGYCRKSCQAC
        SGYCRKSCQAC
Subjt:  SGYCRKSCQAC

XP_022990688.1 probable prolyl 4-hydroxylase 7 [Cucurbita maxima]6.2e-184100Show/hide
Query:  MDSRWFLTFSLCFLFVFTAFARLPQSLTHNELGGSALRLKGNSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS
        MDSRWFLTFSLCFLFVFTAFARLPQSLTHNELGGSALRLKGNSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS
Subjt:  MDSRWFLTFSLCFLFVFTAFARLPQSLTHNELGGSALRLKGNSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS

Query:  SKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK
        SKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK
Subjt:  SKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK

Query:  DDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNQNCPLWAENGECKNNPRYMLGSETS
        DDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNQNCPLWAENGECKNNPRYMLGSETS
Subjt:  DDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNQNCPLWAENGECKNNPRYMLGSETS

Query:  SGYCRKSCQAC
        SGYCRKSCQAC
Subjt:  SGYCRKSCQAC

XP_023523636.1 probable prolyl 4-hydroxylase 7 [Cucurbita pepo subsp. pepo]4.2e-18096.78Show/hide
Query:  MDSRWFLTFSLCFLFVFTAFARLPQSLTHNELGGSALRLKGNSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS
        MDSRWFLTFSLC+LFVFTA ARLPQSLTHN+LGGSALRLKG+SSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLS+GRLERSMVADNKSGKR+S
Subjt:  MDSRWFLTFSLCFLFVFTAFARLPQSLTHNELGGSALRLKGNSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS

Query:  SKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK
        SKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLP+ENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK
Subjt:  SKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK

Query:  DDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNQNCPLWAENGECKNNPRYMLGSETS
        DDSWSDCARMGYAVKP+KGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFN+NCPLWAENGECKNNPRYMLGSET+
Subjt:  DDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNQNCPLWAENGECKNNPRYMLGSETS

Query:  SGYCRKSCQAC
        SGYCRKSCQAC
Subjt:  SGYCRKSCQAC

TrEMBL top hitse value%identityAlignment
A0A0A0KS38 Procollagen-proline 4-dioxygenase1.6e-13774.76Show/hide
Query:  MDSRWFLTFSLCFLFVFTAFARLPQSLTHNELGGSALRLKGNSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS
        MDSR FL FSLCFL VFTAFARLP++ TH +  GS LRLK +SSPLIFDPTRVTQLSWQPRA LYKGFLSD ECDHLI+L+K +LE+SMVADN SGK +S
Subjt:  MDSRWFLTFSLCFLFVFTAFARLPQSLTHNELGGSALRLKGNSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS

Query:  SKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK
        S+VRTSSG F+ K QDE++A +EARIAAWT LP ENGE IQILHYENG+KYEPHFDFF D+VN+ELGGHR+ATVLMYLSNVEKGGET+FP+SEFKESQ K
Subjt:  SKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK

Query:  DDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFD--NPILPNKGCMDFNQNCPLWAENGECKNNPRYMLGSE
        D+SWSDC+R GYAVK +KGDALLFFSLN+D TT+ RS+HGSCPVI GEKWSATKWIHVRSF+     +  +GC+D N+NC  WA+ GECK NP YM+GS 
Subjt:  DDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFD--NPILPNKGCMDFNQNCPLWAENGECKNNPRYMLGSE

Query:  TSSGYCRKSCQAC
         + GYCRKSC+AC
Subjt:  TSSGYCRKSCQAC

A0A1S3C8G4 Procollagen-proline 4-dioxygenase5.6e-13874.68Show/hide
Query:  MDSRWFLTFSLCFLFVFTAFARLPQSL----THNELGGSALRLKGNSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSG
        MDSR FL FSLCFL VFTAFARLP++     ++ +  GS LRLK +SSPLIFDPTRVTQLSWQPRA LYKGFLSD+ECDHLI+L+K +LE+SMVADN+SG
Subjt:  MDSRWFLTFSLCFLFVFTAFARLPQSL----THNELGGSALRLKGNSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSG

Query:  KRISSKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKE
        K +SS+VRTSSG F+ K QD+I+A +EARIAAWT LP ENGE IQILHYENG+KYEPHFDFF D+VN+ELGGHR+ATVLMYLSNVEKGGET+FP+SEFKE
Subjt:  KRISSKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKE

Query:  SQEKDDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDN-PILPNKGCMDFNQNCPLWAENGECKNNPRYML
        SQEKDDSWSDC+R GYAVK +KGDALLFFSL++D TT+ RS+HGSCPVIEGEKWSATKWIHVRSF+  P +  + C+D N+NCP WA+ GECK NP YM+
Subjt:  SQEKDDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDN-PILPNKGCMDFNQNCPLWAENGECKNNPRYML

Query:  GSETSSGYCRKSCQAC
        GSE + GYCRKSC+AC
Subjt:  GSETSSGYCRKSCQAC

A0A6J1BXN9 Procollagen-proline 4-dioxygenase2.2e-14277Show/hide
Query:  MDSRWFLTFSLCFLFVFTAFARLPQSLTHNELGGSALRLKGNSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS
        MDS  FL+FSLCFLFVFTA ARLP    H ++ GS LRLKG  SPLIFDPTRVTQLSWQPRA LYKGFLSDKECDHLI+L+K +LE+SMVADN SGK +S
Subjt:  MDSRWFLTFSLCFLFVFTAFARLPQSLTHNELGGSALRLKGNSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS

Query:  SKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK
        S+VRTSSG F+ K QDEI+AA+EARIAAWTFLP ENGE IQILHYENG+KYEPHFD+F D+VN+ELGGHRVATVLMYLSNVEKGGET+FP+SEFKESQEK
Subjt:  SKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK

Query:  DDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNK--GCMDFNQNCPLWAENGECKNNPRYMLGSE
        DDSWSDCAR GYAVK +KGDALLFFSL++D TT+ +S+HGSCPVIEGEKWSATKWIHVRSF+ P  P++   C+D N+NC  WA+ GECK NP YM+GSE
Subjt:  DDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNK--GCMDFNQNCPLWAENGECKNNPRYMLGSE

Query:  TSSGYCRKSCQAC
        ++ GYCRKSCQAC
Subjt:  TSSGYCRKSCQAC

A0A6J1HCS1 Procollagen-proline 4-dioxygenase2.0e-18097.43Show/hide
Query:  MDSRWFLTFSLCFLFVFTAFARLPQSLTHNELGGSALRLKGNSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS
        MDSRWFLTFSLC LFVFTAFARLPQSLTHN+LGGSALRL+G+SSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS
Subjt:  MDSRWFLTFSLCFLFVFTAFARLPQSLTHNELGGSALRLKGNSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS

Query:  SKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK
        S+VRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK
Subjt:  SKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK

Query:  DDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNQNCPLWAENGECKNNPRYMLGSETS
        DDSWSDCARMGYAVKP+KGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFN+NCPLWAENGECKNNPRYMLGSET+
Subjt:  DDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNQNCPLWAENGECKNNPRYMLGSETS

Query:  SGYCRKSCQAC
        SGYCRKSCQAC
Subjt:  SGYCRKSCQAC

A0A6J1JU08 Procollagen-proline 4-dioxygenase3.0e-184100Show/hide
Query:  MDSRWFLTFSLCFLFVFTAFARLPQSLTHNELGGSALRLKGNSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS
        MDSRWFLTFSLCFLFVFTAFARLPQSLTHNELGGSALRLKGNSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS
Subjt:  MDSRWFLTFSLCFLFVFTAFARLPQSLTHNELGGSALRLKGNSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS

Query:  SKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK
        SKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK
Subjt:  SKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK

Query:  DDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNQNCPLWAENGECKNNPRYMLGSETS
        DDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNQNCPLWAENGECKNNPRYMLGSETS
Subjt:  DDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNQNCPLWAENGECKNNPRYMLGSETS

Query:  SGYCRKSCQAC
        SGYCRKSCQAC
Subjt:  SGYCRKSCQAC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 68.1e-11061.86Show/hide
Query:  MDSRWFLTFSLCFLFVFTAFARLPQSLTHNELGGSALRLKGNSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSM-VADNKSGKRI
        MDS++FL FSL  L +F+  +    S+                     DPTR+TQLSW PRA LYKGFLSD+ECDHLI L+KG+LE+SM VAD  SG+  
Subjt:  MDSRWFLTFSLCFLFVFTAFARLPQSLTHNELGGSALRLKGNSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSM-VADNKSGKRI

Query:  SSKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQE
         S+VRTSSG F+ K QD+I+A +EA++AAWTFLP ENGE +QILHYENG+KY+PHFD+F D+   ELGGHR+ATVLMYLSNV KGGETVFP+ + K  Q 
Subjt:  SSKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQE

Query:  KDDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNQNCPLWAENGECKNNPRYMLGSET
        KDDSWS CA+ GYAVKP KGDALLFF+L+++GTT+P S+HGSCPVIEGEKWSAT+WIHVRSF    L    C+D +++C  WA+ GEC+ NP YM+GSET
Subjt:  KDDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNQNCPLWAENGECKNNPRYMLGSET

Query:  SSGYCRKSCQAC
        S G+CRKSC+AC
Subjt:  SSGYCRKSCQAC

F4JAU3 Prolyl 4-hydroxylase 29.3e-9057.14Show/hide
Query:  NSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRISSKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQ
        +S   I +P++V Q+S +PRA +Y+GFL+D ECDHLI+L+K  L+RS VADN +G+   S VRTSSGTF+ KG+D I++ IE +++ WTFLP ENGE +Q
Subjt:  NSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRISSKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQ

Query:  ILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHS-EF--KESQEKDDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSM
        +L YE+G+KY+ HFD+F D+VN   GGHR+ATVL+YLSNV KGGETVFP + EF  +   E  D  SDCA+ G AVKP+KG+ALLFF+L  D   +P S+
Subjt:  ILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHS-EF--KESQEKDDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSM

Query:  HGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNQNCPLWAENGECKNNPRYMLGSETSSGYCRKSCQAC
        HG CPVIEGEKWSATKWIHV SFD  +  +  C D N++C  WA  GEC  NP YM+G+    G CR+SC+AC
Subjt:  HGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNQNCPLWAENGECKNNPRYMLGSETSSGYCRKSCQAC

Q8L970 Probable prolyl 4-hydroxylase 76.6e-12063.06Show/hide
Query:  MDSRWFLTFSLCFLFVFTAFARLPQ---SLTHNELGGSALRLKGNSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGK
        MDSR FL FSLCFLF     +  P    + + N   GS +++K ++S   FDPTRVTQLSW PR  LY+GFLSD+ECDH I L+KG+LE+SMVADN SG+
Subjt:  MDSRWFLTFSLCFLFVFTAFARLPQ---SLTHNELGGSALRLKGNSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGK

Query:  RISSKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKES
         + S+VRTSSG F+ K QD+I++ +EA++AAWTFLP ENGE +QILHYENG+KYEPHFD+F D+ N ELGGHR+ATVLMYLSNVEKGGETVFP  + K +
Subjt:  RISSKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKES

Query:  QEKDDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNQNCPLWAENGECKNNPRYMLGS
        Q KDDSW++CA+ GYAVKP KGDALLFF+L+ + TT+  S+HGSCPV+EGEKWSAT+WIHV+SF+       GCMD N +C  WA+ GEC+ NP YM+GS
Subjt:  QEKDDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNQNCPLWAENGECKNNPRYMLGS

Query:  ETSSGYCRKSCQAC
        +   GYCRKSC+AC
Subjt:  ETSSGYCRKSCQAC

Q8LAN3 Probable prolyl 4-hydroxylase 42.4e-9357.51Show/hide
Query:  NSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRISSKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQ
        +SS +  +P++V Q+S +PRA +Y+GFL++ ECDH+++L+K  L+RS VADN SG+   S+VRTSSGTF+ KG+D I++ IE +I+ WTFLP ENGE IQ
Subjt:  NSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRISSKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQ

Query:  ILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQ---EKDDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSM
        +L YE+G+KY+ HFD+F D+VN   GGHR+AT+LMYLSNV KGGETVFP +E    +   E  +  SDCA+ G AVKP KGDALLFF+L+ D   +P S+
Subjt:  ILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQ---EKDDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSM

Query:  HGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNQNCPLWAENGECKNNPRYMLGSETSSGYCRKSCQAC
        HG CPVIEGEKWSATKWIHV SFD  + P+  C D N++C  WA  GEC  NP YM+G+    GYCR+SC+AC
Subjt:  HGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNQNCPLWAENGECKNNPRYMLGSETSSGYCRKSCQAC

Q9LN20 Probable prolyl 4-hydroxylase 32.3e-6454.59Show/hide
Query:  LSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRISSKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHF
        LSW+PRA +Y  FLS +EC++LI+L+K  + +S V D+++GK   S+VRTSSGTF+ +G+D+II  IE RIA +TF+P ++GE +Q+LHYE G+KYEPH+
Subjt:  LSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRISSKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHF

Query:  DFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFK-ESQEKDDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATK
        D+FVDE N + GG R+AT+LMYLS+VE+GGETVFP +     S    +  S+C + G +VKP  GDALLF+S+  D T +P S+HG CPVI G KWS+TK
Subjt:  DFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFK-ESQEKDDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATK

Query:  WIHVRSF
        W+HV  +
Subjt:  WIHVRSF

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 26.6e-9157.14Show/hide
Query:  NSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRISSKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQ
        +S   I +P++V Q+S +PRA +Y+GFL+D ECDHLI+L+K  L+RS VADN +G+   S VRTSSGTF+ KG+D I++ IE +++ WTFLP ENGE +Q
Subjt:  NSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRISSKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQ

Query:  ILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHS-EF--KESQEKDDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSM
        +L YE+G+KY+ HFD+F D+VN   GGHR+ATVL+YLSNV KGGETVFP + EF  +   E  D  SDCA+ G AVKP+KG+ALLFF+L  D   +P S+
Subjt:  ILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHS-EF--KESQEKDDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSM

Query:  HGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNQNCPLWAENGECKNNPRYMLGSETSSGYCRKSCQAC
        HG CPVIEGEKWSATKWIHV SFD  +  +  C D N++C  WA  GEC  NP YM+G+    G CR+SC+AC
Subjt:  HGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNQNCPLWAENGECKNNPRYMLGSETSSGYCRKSCQAC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase4.7e-12163.06Show/hide
Query:  MDSRWFLTFSLCFLFVFTAFARLPQ---SLTHNELGGSALRLKGNSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGK
        MDSR FL FSLCFLF     +  P    + + N   GS +++K ++S   FDPTRVTQLSW PR  LY+GFLSD+ECDH I L+KG+LE+SMVADN SG+
Subjt:  MDSRWFLTFSLCFLFVFTAFARLPQ---SLTHNELGGSALRLKGNSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGK

Query:  RISSKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKES
         + S+VRTSSG F+ K QD+I++ +EA++AAWTFLP ENGE +QILHYENG+KYEPHFD+F D+ N ELGGHR+ATVLMYLSNVEKGGETVFP  + K +
Subjt:  RISSKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKES

Query:  QEKDDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNQNCPLWAENGECKNNPRYMLGS
        Q KDDSW++CA+ GYAVKP KGDALLFF+L+ + TT+  S+HGSCPV+EGEKWSAT+WIHV+SF+       GCMD N +C  WA+ GEC+ NP YM+GS
Subjt:  QEKDDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNQNCPLWAENGECKNNPRYMLGS

Query:  ETSSGYCRKSCQAC
        +   GYCRKSC+AC
Subjt:  ETSSGYCRKSCQAC

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase4.2e-11459.63Show/hide
Query:  MDSRWFLTFSLCFLFVFTAFARLPQ---SLTHNELGGSALRLKGNSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGK
        MDSR FL FSLCFLF     +  P    + + N   GS +++K ++S   FDPTRVTQLSW PR  LY+GFLSD+ECDH I L+KG+LE+SMVADN SG+
Subjt:  MDSRWFLTFSLCFLFVFTAFARLPQ---SLTHNELGGSALRLKGNSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGK

Query:  RISSK----VRTSSGTFVLKGQ----DEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVF
         + S+    V   S +F+        D+I++ +EA++AAWTFLP ENGE +QILHYENG+KYEPHFD+F D+ N ELGGHR+ATVLMYLSNVEKGGETVF
Subjt:  RISSK----VRTSSGTFVLKGQ----DEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVF

Query:  PHSEFKESQEKDDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNQNCPLWAENGECKN
        P  + K +Q KDDSW++CA+ GYAVKP KGDALLFF+L+ + TT+  S+HGSCPV+EGEKWSAT+WIHV+SF+       GCMD N +C  WA+ GEC+ 
Subjt:  PHSEFKESQEKDDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNQNCPLWAENGECKN

Query:  NPRYMLGSETSSGYCRKSCQAC
        NP YM+GS+   GYCRKSC+AC
Subjt:  NPRYMLGSETSSGYCRKSCQAC

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase5.7e-11161.86Show/hide
Query:  MDSRWFLTFSLCFLFVFTAFARLPQSLTHNELGGSALRLKGNSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSM-VADNKSGKRI
        MDS++FL FSL  L +F+  +    S+                     DPTR+TQLSW PRA LYKGFLSD+ECDHLI L+KG+LE+SM VAD  SG+  
Subjt:  MDSRWFLTFSLCFLFVFTAFARLPQSLTHNELGGSALRLKGNSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSM-VADNKSGKRI

Query:  SSKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQE
         S+VRTSSG F+ K QD+I+A +EA++AAWTFLP ENGE +QILHYENG+KY+PHFD+F D+   ELGGHR+ATVLMYLSNV KGGETVFP+ + K  Q 
Subjt:  SSKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQE

Query:  KDDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNQNCPLWAENGECKNNPRYMLGSET
        KDDSWS CA+ GYAVKP KGDALLFF+L+++GTT+P S+HGSCPVIEGEKWSAT+WIHVRSF    L    C+D +++C  WA+ GEC+ NP YM+GSET
Subjt:  KDDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNQNCPLWAENGECKNNPRYMLGSET

Query:  SSGYCRKSCQAC
        S G+CRKSC+AC
Subjt:  SSGYCRKSCQAC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.7e-9457.51Show/hide
Query:  NSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRISSKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQ
        +SS +  +P++V Q+S +PRA +Y+GFL++ ECDH+++L+K  L+RS VADN SG+   S+VRTSSGTF+ KG+D I++ IE +I+ WTFLP ENGE IQ
Subjt:  NSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRISSKVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQ

Query:  ILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQ---EKDDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSM
        +L YE+G+KY+ HFD+F D+VN   GGHR+AT+LMYLSNV KGGETVFP +E    +   E  +  SDCA+ G AVKP KGDALLFF+L+ D   +P S+
Subjt:  ILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQ---EKDDSWSDCARMGYAVKPEKGDALLFFSLNVDGTTNPRSM

Query:  HGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNQNCPLWAENGECKNNPRYMLGSETSSGYCRKSCQAC
        HG CPVIEGEKWSATKWIHV SFD  + P+  C D N++C  WA  GEC  NP YM+G+    GYCR+SC+AC
Subjt:  HGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNQNCPLWAENGECKNNPRYMLGSETSSGYCRKSCQAC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCCCGTTGGTTCCTCACATTTTCCCTTTGCTTTCTGTTCGTCTTCACTGCCTTCGCTCGCTTGCCCCAATCGCTTACGCACAACGAACTAGGCGGATCT
GCACTTCGGTTGAAGGGGAATTCATCTCCGCTGATTTTCGATCCAACTCGAGTCACTCAGCTCTCCTGGCAACCTAGGGCATTGTTATACAAGGGATTTCTATCT
GATAAGGAATGCGATCACCTAATCAATCTGTCAAAGGGAAGGTTAGAGAGGTCGATGGTAGCAGATAATAAGTCCGGTAAGAGAATTAGTAGTAAAGTCCGGACC
AGCTCCGGCACGTTCGTGCTGAAGGGACAGGATGAAATAATTGCTGCCATTGAAGCCAGAATTGCGGCATGGACATTCCTTCCAATAGAAAATGGAGAGCCCATT
CAAATTCTGCACTATGAGAATGGTGAGAAGTATGAACCGCATTTTGATTTTTTTGTGGACGAGGTGAATAAGGAGTTGGGCGGCCACCGAGTAGCCACAGTTTTG
ATGTATTTATCCAATGTTGAGAAGGGTGGAGAGACCGTCTTTCCACATTCAGAGTTTAAAGAGTCTCAAGAAAAGGATGATAGCTGGTCTGATTGTGCTCGAATG
GGTTATGCAGTTAAACCGGAGAAGGGTGATGCATTGCTGTTCTTCAGCCTCAATGTGGATGGAACCACAAATCCGAGAAGCATGCACGGTAGCTGCCCTGTGATT
GAGGGTGAGAAATGGAGTGCAACCAAATGGATTCACGTCAGATCCTTCGATAACCCAATTCTCCCAAACAAGGGCTGCATGGACTTCAACCAAAATTGCCCTTTG
TGGGCCGAAAACGGTGAGTGCAAAAACAACCCCAGGTACATGCTGGGCTCTGAAACTTCTTCAGGATACTGTAGGAAGAGTTGCCAAGCCTGCTAA
mRNA sequenceShow/hide mRNA sequence
CAAAATCGTTCCTATATTTGCAGTTTGATTGGTACAATTTCCAAGTTTGAAACGGTTCTGAAAAACAGCCCACCGATTCTCATCCTCATTCATGGTCTTCACGGT
TGAAATTGCGTAATAAATTTCAAATCCGTTTCTCCGACACCAACCAAAGAAGTGTTCAGTGCTTGAATTTGGGTGATGGATTCCCGTTGGTTCCTCACATTTTCC
CTTTGCTTTCTGTTCGTCTTCACTGCCTTCGCTCGCTTGCCCCAATCGCTTACGCACAACGAACTAGGCGGATCTGCACTTCGGTTGAAGGGGAATTCATCTCCG
CTGATTTTCGATCCAACTCGAGTCACTCAGCTCTCCTGGCAACCTAGGGCATTGTTATACAAGGGATTTCTATCTGATAAGGAATGCGATCACCTAATCAATCTG
TCAAAGGGAAGGTTAGAGAGGTCGATGGTAGCAGATAATAAGTCCGGTAAGAGAATTAGTAGTAAAGTCCGGACCAGCTCCGGCACGTTCGTGCTGAAGGGACAG
GATGAAATAATTGCTGCCATTGAAGCCAGAATTGCGGCATGGACATTCCTTCCAATAGAAAATGGAGAGCCCATTCAAATTCTGCACTATGAGAATGGTGAGAAG
TATGAACCGCATTTTGATTTTTTTGTGGACGAGGTGAATAAGGAGTTGGGCGGCCACCGAGTAGCCACAGTTTTGATGTATTTATCCAATGTTGAGAAGGGTGGA
GAGACCGTCTTTCCACATTCAGAGTTTAAAGAGTCTCAAGAAAAGGATGATAGCTGGTCTGATTGTGCTCGAATGGGTTATGCAGTTAAACCGGAGAAGGGTGAT
GCATTGCTGTTCTTCAGCCTCAATGTGGATGGAACCACAAATCCGAGAAGCATGCACGGTAGCTGCCCTGTGATTGAGGGTGAGAAATGGAGTGCAACCAAATGG
ATTCACGTCAGATCCTTCGATAACCCAATTCTCCCAAACAAGGGCTGCATGGACTTCAACCAAAATTGCCCTTTGTGGGCCGAAAACGGTGAGTGCAAAAACAAC
CCCAGGTACATGCTGGGCTCTGAAACTTCTTCAGGATACTGTAGGAAGAGTTGCCAAGCCTGCTAAACTACAAACTACAACAACCCAGTCTCCTTTCTGTGATGG
TTATCATGTATATAACATTGGGCACTAACTAGATATACCTTTGCTTCTTCTGGTCACCATTCTCTTGTCCGAGAGGGTTCCATTATACAAATAAAGTGGGTGGTC
Protein sequenceShow/hide protein sequence
MDSRWFLTFSLCFLFVFTAFARLPQSLTHNELGGSALRLKGNSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRISSKVRT
SSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEKDDSWSDCARM
GYAVKPEKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNQNCPLWAENGECKNNPRYMLGSETSSGYCRKSCQAC