; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G028750 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G028750
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationCmo_Chr04:20446789..20451284
RNA-Seq ExpressionCmoCh04G028750
SyntenyCmoCh04G028750
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602475.1 putative prolyl 4-hydroxylase 7, partial [Cucurbita argyrosperma subsp. sororia]2.9e-18198.39Show/hide
Query:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS
        MDSRWFLTFSLCVLFVFTAFARLPQSL+HNKLGGSALRL+GSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHL+NLSKGRLERSMVADN+SGKRIS
Subjt:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS

Query:  SEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK
        SEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSE KESQEK
Subjt:  SEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK

Query:  DDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSETA
        DDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSETA
Subjt:  DDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSETA

Query:  SGYCRKSCQAC
        SGYCRKSCQAC
Subjt:  SGYCRKSCQAC

KAG7033150.1 putative prolyl 4-hydroxylase 7, partial [Cucurbita argyrosperma subsp. argyrosperma]2.4e-18095.98Show/hide
Query:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKL------------GGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERS
        MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKL            GGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERS
Subjt:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKL------------GGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERS

Query:  MVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETV
        MVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETV
Subjt:  MVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETV

Query:  FPHSEFKESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECK
        FPHSE KESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECK
Subjt:  FPHSEFKESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECK

Query:  NNPRYMLGSETASGYCRKSCQAC
        NNPRYMLGSETASGYCRKSCQAC
Subjt:  NNPRYMLGSETASGYCRKSCQAC

XP_022961014.1 probable prolyl 4-hydroxylase 6 [Cucurbita moschata]1.1e-183100Show/hide
Query:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS
        MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS
Subjt:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS

Query:  SEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK
        SEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK
Subjt:  SEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK

Query:  DDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSETA
        DDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSETA
Subjt:  DDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSETA

Query:  SGYCRKSCQAC
        SGYCRKSCQAC
Subjt:  SGYCRKSCQAC

XP_022990688.1 probable prolyl 4-hydroxylase 7 [Cucurbita maxima]2.4e-18097.43Show/hide
Query:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS
        MDSRWFLTFSLC LFVFTAFARLPQSLTHN+LGGSALRL+G+SSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS
Subjt:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS

Query:  SEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK
        S+VRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK
Subjt:  SEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK

Query:  DDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSETA
        DDSWSDCARMGYAVKP+KGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFN+NCPLWAENGECKNNPRYMLGSET+
Subjt:  DDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSETA

Query:  SGYCRKSCQAC
        SGYCRKSCQAC
Subjt:  SGYCRKSCQAC

XP_023523636.1 probable prolyl 4-hydroxylase 7 [Cucurbita pepo subsp. pepo]8.4e-18197.75Show/hide
Query:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS
        MDSRWFLTFSLC LFVFTA ARLPQSLTHNKLGGSALRL+GSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLS+GRLERSMVADNKSGKR+S
Subjt:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS

Query:  SEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK
        S+VRTSSGTFVLKGQDEIIAAIEARIAAWTFLP+ENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK
Subjt:  SEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK

Query:  DDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSETA
        DDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSETA
Subjt:  DDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSETA

Query:  SGYCRKSCQAC
        SGYCRKSCQAC
Subjt:  SGYCRKSCQAC

TrEMBL top hitse value%identityAlignment
A0A0A0KS38 Procollagen-proline 4-dioxygenase9.5e-13875.4Show/hide
Query:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS
        MDSR FL FSLC L VFTAFARLP++ TH +  GS LRL+  SSPLIFDPTRVTQLSWQPRA LYKGFLSD ECDHLI+L+K +LE+SMVADN SGK +S
Subjt:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS

Query:  SEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK
        SEVRTSSG F+ K QDE++A +EARIAAWT LP ENGE IQILHYENG+KYEPHFDFF D+VN+ELGGHR+ATVLMYLSNVEKGGET+FP+SEFKESQ K
Subjt:  SEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK

Query:  DDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFD--NPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSE
        D+SWSDC+R GYAVK QKGDALLFFSLN+D TT+ RS+HGSCPVI GEKWSATKWIHVRSF+     +  +GC+D NENC  WA+ GECK NP YM+GS 
Subjt:  DDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFD--NPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSE

Query:  TASGYCRKSCQAC
         A GYCRKSC+AC
Subjt:  TASGYCRKSCQAC

A0A1S3C8G4 Procollagen-proline 4-dioxygenase3.3e-13875.32Show/hide
Query:  MDSRWFLTFSLCVLFVFTAFARLPQSL----THNKLGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSG
        MDSR FL FSLC L VFTAFARLP++     ++ +  GS LRL+  SSPLIFDPTRVTQLSWQPRA LYKGFLSD+ECDHLI+L+K +LE+SMVADN+SG
Subjt:  MDSRWFLTFSLCVLFVFTAFARLPQSL----THNKLGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSG

Query:  KRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKE
        K +SSEVRTSSG F+ K QD+I+A +EARIAAWT LP ENGE IQILHYENG+KYEPHFDFF D+VN+ELGGHR+ATVLMYLSNVEKGGET+FP+SEFKE
Subjt:  KRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKE

Query:  SQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDN-PILPNKGCMDFNENCPLWAENGECKNNPRYML
        SQEKDDSWSDC+R GYAVK QKGDALLFFSL++D TT+ RS+HGSCPVIEGEKWSATKWIHVRSF+  P +  + C+D NENCP WA+ GECK NP YM+
Subjt:  SQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDN-PILPNKGCMDFNENCPLWAENGECKNNPRYML

Query:  GSETASGYCRKSCQAC
        GSE A GYCRKSC+AC
Subjt:  GSETASGYCRKSCQAC

A0A6J1BXN9 Procollagen-proline 4-dioxygenase5.8e-14377.64Show/hide
Query:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS
        MDS  FL+FSLC LFVFTA ARLP    H K+ GS LRL+G  SPLIFDPTRVTQLSWQPRA LYKGFLSDKECDHLI+L+K +LE+SMVADN SGK +S
Subjt:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS

Query:  SEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK
        SEVRTSSG F+ K QDEI+AA+EARIAAWTFLP ENGE IQILHYENG+KYEPHFD+F D+VN+ELGGHRVATVLMYLSNVEKGGET+FP+SEFKESQEK
Subjt:  SEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK

Query:  DDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNK--GCMDFNENCPLWAENGECKNNPRYMLGSE
        DDSWSDCAR GYAVK +KGDALLFFSL++D TT+ +S+HGSCPVIEGEKWSATKWIHVRSF+ P  P++   C+D NENC  WA+ GECK NP YM+GSE
Subjt:  DDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNK--GCMDFNENCPLWAENGECKNNPRYMLGSE

Query:  TASGYCRKSCQAC
        +A GYCRKSCQAC
Subjt:  TASGYCRKSCQAC

A0A6J1HCS1 Procollagen-proline 4-dioxygenase5.1e-184100Show/hide
Query:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS
        MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS
Subjt:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS

Query:  SEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK
        SEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK
Subjt:  SEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK

Query:  DDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSETA
        DDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSETA
Subjt:  DDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSETA

Query:  SGYCRKSCQAC
        SGYCRKSCQAC
Subjt:  SGYCRKSCQAC

A0A6J1JU08 Procollagen-proline 4-dioxygenase1.2e-18097.43Show/hide
Query:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS
        MDSRWFLTFSLC LFVFTAFARLPQSLTHN+LGGSALRL+G+SSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS
Subjt:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRIS

Query:  SEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK
        S+VRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK
Subjt:  SEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEK

Query:  DDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSETA
        DDSWSDCARMGYAVKP+KGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFN+NCPLWAENGECKNNPRYMLGSET+
Subjt:  DDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSETA

Query:  SGYCRKSCQAC
        SGYCRKSCQAC
Subjt:  SGYCRKSCQAC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 67.3e-11162.5Show/hide
Query:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSM-VADNKSGKRI
        MDS++FL FSL +L +F+                       SS     DPTR+TQLSW PRA LYKGFLSD+ECDHLI L+KG+LE+SM VAD  SG+  
Subjt:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSM-VADNKSGKRI

Query:  SSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQE
         SEVRTSSG F+ K QD+I+A +EA++AAWTFLP ENGE +QILHYENG+KY+PHFD+F D+   ELGGHR+ATVLMYLSNV KGGETVFP+ + K  Q 
Subjt:  SSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQE

Query:  KDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSET
        KDDSWS CA+ GYAVKP+KGDALLFF+L+++GTT+P S+HGSCPVIEGEKWSAT+WIHVRSF    L    C+D +E+C  WA+ GEC+ NP YM+GSET
Subjt:  KDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSET

Query:  ASGYCRKSCQAC
        + G+CRKSC+AC
Subjt:  ASGYCRKSCQAC

F4JAU3 Prolyl 4-hydroxylase 26.4e-9157.88Show/hide
Query:  SSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQ
        SS   I +P++V Q+S +PRA +Y+GFL+D ECDHLI+L+K  L+RS VADN +G+   S+VRTSSGTF+ KG+D I++ IE +++ WTFLP ENGE +Q
Subjt:  SSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQ

Query:  ILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHS-EF--KESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSM
        +L YE+G+KY+ HFD+F D+VN   GGHR+ATVL+YLSNV KGGETVFP + EF  +   E  D  SDCA+ G AVKP+KG+ALLFF+L  D   +P S+
Subjt:  ILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHS-EF--KESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSM

Query:  HGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSETASGYCRKSCQAC
        HG CPVIEGEKWSATKWIHV SFD  +  +  C D NE+C  WA  GEC  NP YM+G+    G CR+SC+AC
Subjt:  HGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSETASGYCRKSCQAC

Q8L970 Probable prolyl 4-hydroxylase 71.5e-11963.06Show/hide
Query:  MDSRWFLTFSLCVLFVFTAFARLPQ---SLTHNKLGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGK
        MDSR FL FSLC LF     +  P    + + N   GS ++++ S+S   FDPTRVTQLSW PR  LY+GFLSD+ECDH I L+KG+LE+SMVADN SG+
Subjt:  MDSRWFLTFSLCVLFVFTAFARLPQ---SLTHNKLGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGK

Query:  RISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKES
         + SEVRTSSG F+ K QD+I++ +EA++AAWTFLP ENGE +QILHYENG+KYEPHFD+F D+ N ELGGHR+ATVLMYLSNVEKGGETVFP  + K +
Subjt:  RISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKES

Query:  QEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGS
        Q KDDSW++CA+ GYAVKP+KGDALLFF+L+ + TT+  S+HGSCPV+EGEKWSAT+WIHV+SF+       GCMD N +C  WA+ GEC+ NP YM+GS
Subjt:  QEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGS

Query:  ETASGYCRKSCQAC
        +   GYCRKSC+AC
Subjt:  ETASGYCRKSCQAC

Q8LAN3 Probable prolyl 4-hydroxylase 49.6e-9558.61Show/hide
Query:  SSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQ
        SSS +  +P++V Q+S +PRA +Y+GFL++ ECDH+++L+K  L+RS VADN SG+   SEVRTSSGTF+ KG+D I++ IE +I+ WTFLP ENGE IQ
Subjt:  SSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQ

Query:  ILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQ---EKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSM
        +L YE+G+KY+ HFD+F D+VN   GGHR+AT+LMYLSNV KGGETVFP +E    +   E  +  SDCA+ G AVKP+KGDALLFF+L+ D   +P S+
Subjt:  ILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQ---EKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSM

Query:  HGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSETASGYCRKSCQAC
        HG CPVIEGEKWSATKWIHV SFD  + P+  C D NE+C  WA  GEC  NP YM+G+    GYCR+SC+AC
Subjt:  HGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSETASGYCRKSCQAC

Q9LN20 Probable prolyl 4-hydroxylase 32.3e-6454.59Show/hide
Query:  LSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHF
        LSW+PRA +Y  FLS +EC++LI+L+K  + +S V D+++GK   S VRTSSGTF+ +G+D+II  IE RIA +TF+P ++GE +Q+LHYE G+KYEPH+
Subjt:  LSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHF

Query:  DFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFK-ESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATK
        D+FVDE N + GG R+AT+LMYLS+VE+GGETVFP +     S    +  S+C + G +VKP+ GDALLF+S+  D T +P S+HG CPVI G KWS+TK
Subjt:  DFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFK-ESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATK

Query:  WIHVRSF
        W+HV  +
Subjt:  WIHVRSF

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 24.6e-9257.88Show/hide
Query:  SSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQ
        SS   I +P++V Q+S +PRA +Y+GFL+D ECDHLI+L+K  L+RS VADN +G+   S+VRTSSGTF+ KG+D I++ IE +++ WTFLP ENGE +Q
Subjt:  SSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQ

Query:  ILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHS-EF--KESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSM
        +L YE+G+KY+ HFD+F D+VN   GGHR+ATVL+YLSNV KGGETVFP + EF  +   E  D  SDCA+ G AVKP+KG+ALLFF+L  D   +P S+
Subjt:  ILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHS-EF--KESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSM

Query:  HGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSETASGYCRKSCQAC
        HG CPVIEGEKWSATKWIHV SFD  +  +  C D NE+C  WA  GEC  NP YM+G+    G CR+SC+AC
Subjt:  HGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSETASGYCRKSCQAC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase1.0e-12063.06Show/hide
Query:  MDSRWFLTFSLCVLFVFTAFARLPQ---SLTHNKLGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGK
        MDSR FL FSLC LF     +  P    + + N   GS ++++ S+S   FDPTRVTQLSW PR  LY+GFLSD+ECDH I L+KG+LE+SMVADN SG+
Subjt:  MDSRWFLTFSLCVLFVFTAFARLPQ---SLTHNKLGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGK

Query:  RISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKES
         + SEVRTSSG F+ K QD+I++ +EA++AAWTFLP ENGE +QILHYENG+KYEPHFD+F D+ N ELGGHR+ATVLMYLSNVEKGGETVFP  + K +
Subjt:  RISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKES

Query:  QEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGS
        Q KDDSW++CA+ GYAVKP+KGDALLFF+L+ + TT+  S+HGSCPV+EGEKWSAT+WIHV+SF+       GCMD N +C  WA+ GEC+ NP YM+GS
Subjt:  QEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGS

Query:  ETASGYCRKSCQAC
        +   GYCRKSC+AC
Subjt:  ETASGYCRKSCQAC

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase9.5e-11459.63Show/hide
Query:  MDSRWFLTFSLCVLFVFTAFARLPQ---SLTHNKLGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGK
        MDSR FL FSLC LF     +  P    + + N   GS ++++ S+S   FDPTRVTQLSW PR  LY+GFLSD+ECDH I L+KG+LE+SMVADN SG+
Subjt:  MDSRWFLTFSLCVLFVFTAFARLPQ---SLTHNKLGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGK

Query:  RISSE----VRTSSGTFVLKGQ----DEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVF
         + SE    V   S +F+        D+I++ +EA++AAWTFLP ENGE +QILHYENG+KYEPHFD+F D+ N ELGGHR+ATVLMYLSNVEKGGETVF
Subjt:  RISSE----VRTSSGTFVLKGQ----DEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVF

Query:  PHSEFKESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKN
        P  + K +Q KDDSW++CA+ GYAVKP+KGDALLFF+L+ + TT+  S+HGSCPV+EGEKWSAT+WIHV+SF+       GCMD N +C  WA+ GEC+ 
Subjt:  PHSEFKESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKN

Query:  NPRYMLGSETASGYCRKSCQAC
        NP YM+GS+   GYCRKSC+AC
Subjt:  NPRYMLGSETASGYCRKSCQAC

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase5.2e-11262.5Show/hide
Query:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSM-VADNKSGKRI
        MDS++FL FSL +L +F+                       SS     DPTR+TQLSW PRA LYKGFLSD+ECDHLI L+KG+LE+SM VAD  SG+  
Subjt:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSM-VADNKSGKRI

Query:  SSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQE
         SEVRTSSG F+ K QD+I+A +EA++AAWTFLP ENGE +QILHYENG+KY+PHFD+F D+   ELGGHR+ATVLMYLSNV KGGETVFP+ + K  Q 
Subjt:  SSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQE

Query:  KDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSET
        KDDSWS CA+ GYAVKP+KGDALLFF+L+++GTT+P S+HGSCPVIEGEKWSAT+WIHVRSF    L    C+D +E+C  WA+ GEC+ NP YM+GSET
Subjt:  KDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSET

Query:  ASGYCRKSCQAC
        + G+CRKSC+AC
Subjt:  ASGYCRKSCQAC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein6.8e-9658.61Show/hide
Query:  SSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQ
        SSS +  +P++V Q+S +PRA +Y+GFL++ ECDH+++L+K  L+RS VADN SG+   SEVRTSSGTF+ KG+D I++ IE +I+ WTFLP ENGE IQ
Subjt:  SSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQ

Query:  ILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQ---EKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSM
        +L YE+G+KY+ HFD+F D+VN   GGHR+AT+LMYLSNV KGGETVFP +E    +   E  +  SDCA+ G AVKP+KGDALLFF+L+ D   +P S+
Subjt:  ILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQ---EKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSM

Query:  HGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSETASGYCRKSCQAC
        HG CPVIEGEKWSATKWIHV SFD  + P+  C D NE+C  WA  GEC  NP YM+G+    GYCR+SC+AC
Subjt:  HGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSETASGYCRKSCQAC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCCCGTTGGTTCCTCACATTCTCCCTTTGCGTTCTGTTCGTCTTCACTGCCTTCGCTCGCTTGCCCCAATCGCTTACGCACAACAAACTAGGCGGATCTGCACT
TCGGTTGGAGGGGAGTTCATCTCCGCTGATTTTCGATCCAACTCGAGTCACTCAGCTCTCCTGGCAACCTAGGGCATTATTATATAAGGGATTTCTATCTGATAAGGAAT
GCGATCACCTAATCAATCTGTCAAAGGGAAGGTTAGAGAGGTCGATGGTAGCAGATAACAAGTCCGGTAAGAGAATAAGTAGTGAAGTCCGGACCAGCTCCGGCACGTTC
GTGCTGAAGGGGCAGGATGAAATAATTGCTGCCATTGAAGCCAGAATTGCGGCATGGACATTCCTTCCAATAGAAAATGGAGAGCCCATTCAAATTCTGCACTATGAGAA
TGGTGAGAAGTATGAACCGCATTTTGATTTTTTTGTGGACGAGGTGAATAAGGAGTTGGGTGGCCACCGAGTAGCCACAGTTTTGATGTATTTATCCAATGTTGAGAAGG
GTGGAGAGACCGTGTTTCCACATTCAGAGTTTAAAGAGTCTCAAGAAAAGGATGATAGCTGGTCTGATTGTGCTCGAATGGGTTATGCAGTTAAACCGCAGAAGGGTGAT
GCATTGCTGTTCTTCAGCCTCAATGTGGATGGAACCACAAATCCAAGAAGCATGCACGGTAGCTGCCCAGTGATTGAGGGAGAGAAATGGTCTGCAACCAAATGGATTCA
TGTCAGATCCTTTGATAACCCAATTCTCCCAAACAAGGGCTGCATGGACTTCAACGAAAATTGCCCTTTGTGGGCCGAAAACGGTGAGTGCAAAAACAACCCCAGGTACA
TGCTGGGCTCTGAAACTGCTTCAGGGTACTGTAGGAAGAGTTGCCAAGCCTGCTAA
mRNA sequenceShow/hide mRNA sequence
TCTGCAATATTCAAAATCGTTCCTATATTTGCTGTGTCATTGGTACAATTTCCAAGTTTGAAACGGTTCTGAAGAACAGCCCACCGATTCCCATCCTCATTCATGGTCTT
CACGGTTGAAATTGCGTAATAAATTTCAAATCCGTTTCTCCGGCACCAACCAAAGAAGTGTTCGGTGATGGATTCCCGTTGGTTCCTCACATTCTCCCTTTGCGTTCTGT
TCGTCTTCACTGCCTTCGCTCGCTTGCCCCAATCGCTTACGCACAACAAACTAGGCGGATCTGCACTTCGGTTGGAGGGGAGTTCATCTCCGCTGATTTTCGATCCAACT
CGAGTCACTCAGCTCTCCTGGCAACCTAGGGCATTATTATATAAGGGATTTCTATCTGATAAGGAATGCGATCACCTAATCAATCTGTCAAAGGGAAGGTTAGAGAGGTC
GATGGTAGCAGATAACAAGTCCGGTAAGAGAATAAGTAGTGAAGTCCGGACCAGCTCCGGCACGTTCGTGCTGAAGGGGCAGGATGAAATAATTGCTGCCATTGAAGCCA
GAATTGCGGCATGGACATTCCTTCCAATAGAAAATGGAGAGCCCATTCAAATTCTGCACTATGAGAATGGTGAGAAGTATGAACCGCATTTTGATTTTTTTGTGGACGAG
GTGAATAAGGAGTTGGGTGGCCACCGAGTAGCCACAGTTTTGATGTATTTATCCAATGTTGAGAAGGGTGGAGAGACCGTGTTTCCACATTCAGAGTTTAAAGAGTCTCA
AGAAAAGGATGATAGCTGGTCTGATTGTGCTCGAATGGGTTATGCAGTTAAACCGCAGAAGGGTGATGCATTGCTGTTCTTCAGCCTCAATGTGGATGGAACCACAAATC
CAAGAAGCATGCACGGTAGCTGCCCAGTGATTGAGGGAGAGAAATGGTCTGCAACCAAATGGATTCATGTCAGATCCTTTGATAACCCAATTCTCCCAAACAAGGGCTGC
ATGGACTTCAACGAAAATTGCCCTTTGTGGGCCGAAAACGGTGAGTGCAAAAACAACCCCAGGTACATGCTGGGCTCTGAAACTGCTTCAGGGTACTGTAGGAAGAGTTG
CCAAGCCTGCTAAACTACAAACTACAACAACCAAGTCTCCATTCTGTCATGGTTATCATGTATATAACATTCGACACTAACTAGATATAGCTTTGCTTCTTCTAGTCGAC
CTTCTCGGTTCCGTTATACAAATAAGGTGGGTGGTCAAATGGATATTATATCTATTTCTTTAAATTGTATCTTTTGCTTCACTCCAAC
Protein sequenceShow/hide protein sequence
MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRISSEVRTSSGTF
VLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSEFKESQEKDDSWSDCARMGYAVKPQKGD
ALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSETASGYCRKSCQAC