; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg04403 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg04403
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationCarg_Chr04:20582420..20585615
RNA-Seq ExpressionCarg04403
SyntenyCarg04403
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602475.1 putative prolyl 4-hydroxylase 7, partial [Cucurbita argyrosperma subsp. sororia]3.7e-17995.05Show/hide
Query:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLYDPFPLLLLTLRGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERS
        MDSRWFLTFSLCVLFVFTAFARLPQSL+HNKL            GGSALRL+GSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHL+NLSKGRLERS
Subjt:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLYDPFPLLLLTLRGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERS

Query:  MVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETV
        MVADN+SGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETV
Subjt:  MVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETV

Query:  FPHSELKESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECK
        FPHSELKESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECK
Subjt:  FPHSELKESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECK

Query:  NNPRYMLGSETASGYCRKSCQAC
        NNPRYMLGSETASGYCRKSCQAC
Subjt:  NNPRYMLGSETASGYCRKSCQAC

KAG7033150.1 putative prolyl 4-hydroxylase 7, partial [Cucurbita argyrosperma subsp. argyrosperma]9.3e-191100Show/hide
Query:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLYDPFPLLLLTLRGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERS
        MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLYDPFPLLLLTLRGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERS
Subjt:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLYDPFPLLLLTLRGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERS

Query:  MVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETV
        MVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETV
Subjt:  MVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETV

Query:  FPHSELKESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECK
        FPHSELKESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECK
Subjt:  FPHSELKESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECK

Query:  NNPRYMLGSETASGYCRKSCQAC
        NNPRYMLGSETASGYCRKSCQAC
Subjt:  NNPRYMLGSETASGYCRKSCQAC

XP_022961014.1 probable prolyl 4-hydroxylase 6 [Cucurbita moschata]1.9e-18095.98Show/hide
Query:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLYDPFPLLLLTLRGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERS
        MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKL            GGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERS
Subjt:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLYDPFPLLLLTLRGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERS

Query:  MVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETV
        MVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETV
Subjt:  MVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETV

Query:  FPHSELKESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECK
        FPHSE KESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECK
Subjt:  FPHSELKESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECK

Query:  NNPRYMLGSETASGYCRKSCQAC
        NNPRYMLGSETASGYCRKSCQAC
Subjt:  NNPRYMLGSETASGYCRKSCQAC

XP_022990688.1 probable prolyl 4-hydroxylase 7 [Cucurbita maxima]4.5e-17793.5Show/hide
Query:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLYDPFPLLLLTLRGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERS
        MDSRWFLTFSLC LFVFTAFARLPQSLTHN+L            GGSALRL+G+SSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERS
Subjt:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLYDPFPLLLLTLRGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERS

Query:  MVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETV
        MVADNKSGKRISS+VRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETV
Subjt:  MVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETV

Query:  FPHSELKESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECK
        FPHSE KESQEKDDSWSDCARMGYAVKP+KGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFN+NCPLWAENGECK
Subjt:  FPHSELKESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECK

Query:  NNPRYMLGSETASGYCRKSCQAC
        NNPRYMLGSET+SGYCRKSCQAC
Subjt:  NNPRYMLGSETASGYCRKSCQAC

XP_023523636.1 probable prolyl 4-hydroxylase 7 [Cucurbita pepo subsp. pepo]1.5e-17793.81Show/hide
Query:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLYDPFPLLLLTLRGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERS
        MDSRWFLTFSLC LFVFTA ARLPQSLTHNKL            GGSALRL+GSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLS+GRLERS
Subjt:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLYDPFPLLLLTLRGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERS

Query:  MVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETV
        MVADNKSGKR+SS+VRTSSGTFVLKGQDEIIAAIEARIAAWTFLP+ENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETV
Subjt:  MVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETV

Query:  FPHSELKESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECK
        FPHSE KESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECK
Subjt:  FPHSELKESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECK

Query:  NNPRYMLGSETASGYCRKSCQAC
        NNPRYMLGSETASGYCRKSCQAC
Subjt:  NNPRYMLGSETASGYCRKSCQAC

TrEMBL top hitse value%identityAlignment
A0A0A0KS38 Procollagen-proline 4-dioxygenase1.0e-13472.31Show/hide
Query:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLYDPFPLLLLTLRGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERS
        MDSR FL FSLC L VFTAFARLP++ TH +              GS LRL+  SSPLIFDPTRVTQLSWQPRA LYKGFLSD ECDHLI+L+K +LE+S
Subjt:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLYDPFPLLLLTLRGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERS

Query:  MVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETV
        MVADN SGK +SSEVRTSSG F+ K QDE++A +EARIAAWT LP ENGE IQILHYENG+KYEPHFDFF D+VN+ELGGHR+ATVLMYLSNVEKGGET+
Subjt:  MVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETV

Query:  FPHSELKESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFD--NPILPNKGCMDFNENCPLWAENGE
        FP+SE KESQ KD+SWSDC+R GYAVK QKGDALLFFSLN+D TT+ RS+HGSCPVI GEKWSATKWIHVRSF+     +  +GC+D NENC  WA+ GE
Subjt:  FPHSELKESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFD--NPILPNKGCMDFNENCPLWAENGE

Query:  CKNNPRYMLGSETASGYCRKSCQAC
        CK NP YM+GS  A GYCRKSC+AC
Subjt:  CKNNPRYMLGSETASGYCRKSCQAC

A0A1S3C8G4 Procollagen-proline 4-dioxygenase1.1e-13673.46Show/hide
Query:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLYDPFPLLLLTLRGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERS
        MDSR FL FSLC L VFTAFARLP++      Y            GS LRL+  SSPLIFDPTRVTQLSWQPRA LYKGFLSD+ECDHLI+L+K +LE+S
Subjt:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLYDPFPLLLLTLRGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERS

Query:  MVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETV
        MVADN+SGK +SSEVRTSSG F+ K QD+I+A +EARIAAWT LP ENGE IQILHYENG+KYEPHFDFF D+VN+ELGGHR+ATVLMYLSNVEKGGET+
Subjt:  MVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETV

Query:  FPHSELKESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDN-PILPNKGCMDFNENCPLWAENGEC
        FP+SE KESQEKDDSWSDC+R GYAVK QKGDALLFFSL++D TT+ RS+HGSCPVIEGEKWSATKWIHVRSF+  P +  + C+D NENCP WA+ GEC
Subjt:  FPHSELKESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDN-PILPNKGCMDFNENCPLWAENGEC

Query:  KNNPRYMLGSETASGYCRKSCQAC
        K NP YM+GSE A GYCRKSC+AC
Subjt:  KNNPRYMLGSETASGYCRKSCQAC

A0A6J1BXN9 Procollagen-proline 4-dioxygenase1.1e-13974.46Show/hide
Query:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLYDPFPLLLLTLRGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERS
        MDS  FL+FSLC LFVFTA ARLP    H K+             GS LRL+G  SPLIFDPTRVTQLSWQPRA LYKGFLSDKECDHLI+L+K +LE+S
Subjt:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLYDPFPLLLLTLRGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERS

Query:  MVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETV
        MVADN SGK +SSEVRTSSG F+ K QDEI+AA+EARIAAWTFLP ENGE IQILHYENG+KYEPHFD+F D+VN+ELGGHRVATVLMYLSNVEKGGET+
Subjt:  MVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETV

Query:  FPHSELKESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNK--GCMDFNENCPLWAENGE
        FP+SE KESQEKDDSWSDCAR GYAVK +KGDALLFFSL++D TT+ +S+HGSCPVIEGEKWSATKWIHVRSF+ P  P++   C+D NENC  WA+ GE
Subjt:  FPHSELKESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNK--GCMDFNENCPLWAENGE

Query:  CKNNPRYMLGSETASGYCRKSCQAC
        CK NP YM+GSE+A GYCRKSCQAC
Subjt:  CKNNPRYMLGSETASGYCRKSCQAC

A0A6J1HCS1 Procollagen-proline 4-dioxygenase9.4e-18195.98Show/hide
Query:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLYDPFPLLLLTLRGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERS
        MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKL            GGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERS
Subjt:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLYDPFPLLLLTLRGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERS

Query:  MVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETV
        MVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETV
Subjt:  MVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETV

Query:  FPHSELKESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECK
        FPHSE KESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECK
Subjt:  FPHSELKESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECK

Query:  NNPRYMLGSETASGYCRKSCQAC
        NNPRYMLGSETASGYCRKSCQAC
Subjt:  NNPRYMLGSETASGYCRKSCQAC

A0A6J1JU08 Procollagen-proline 4-dioxygenase2.2e-17793.5Show/hide
Query:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLYDPFPLLLLTLRGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERS
        MDSRWFLTFSLC LFVFTAFARLPQSLTHN+L            GGSALRL+G+SSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERS
Subjt:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLYDPFPLLLLTLRGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERS

Query:  MVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETV
        MVADNKSGKRISS+VRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETV
Subjt:  MVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETV

Query:  FPHSELKESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECK
        FPHSE KESQEKDDSWSDCARMGYAVKP+KGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFN+NCPLWAENGECK
Subjt:  FPHSELKESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECK

Query:  NNPRYMLGSETASGYCRKSCQAC
        NNPRYMLGSET+SGYCRKSCQAC
Subjt:  NNPRYMLGSETASGYCRKSCQAC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 62.4e-10960.19Show/hide
Query:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLYDPFPLLLLTLRGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERS
        MDS++FL FSL +L +F+                                   SS     DPTR+TQLSW PRA LYKGFLSD+ECDHLI L+KG+LE+S
Subjt:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLYDPFPLLLLTLRGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERS

Query:  M-VADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGET
        M VAD  SG+   SEVRTSSG F+ K QD+I+A +EA++AAWTFLP ENGE +QILHYENG+KY+PHFD+F D+   ELGGHR+ATVLMYLSNV KGGET
Subjt:  M-VADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGET

Query:  VFPHSELKESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGEC
        VFP+ + K  Q KDDSWS CA+ GYAVKP+KGDALLFF+L+++GTT+P S+HGSCPVIEGEKWSAT+WIHVRSF    L    C+D +E+C  WA+ GEC
Subjt:  VFPHSELKESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGEC

Query:  KNNPRYMLGSETASGYCRKSCQAC
        + NP YM+GSET+ G+CRKSC+AC
Subjt:  KNNPRYMLGSETASGYCRKSCQAC

F4JAU3 Prolyl 4-hydroxylase 21.0e-9155.33Show/hide
Query:  FPLLLLTLRGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIE
        F  +LL L   S   +   SS  I +P++V Q+S +PRA +Y+GFL+D ECDHLI+L+K  L+RS VADN +G+   S+VRTSSGTF+ KG+D I++ IE
Subjt:  FPLLLLTLRGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIE

Query:  ARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSE---LKESQEKDDSWSDCARMGYAVKPQKGD
         +++ WTFLP ENGE +Q+L YE+G+KY+ HFD+F D+VN   GGHR+ATVL+YLSNV KGGETVFP ++    +   E  D  SDCA+ G AVKP+KG+
Subjt:  ARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSE---LKESQEKDDSWSDCARMGYAVKPQKGD

Query:  ALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSETASGYCRKSCQAC
        ALLFF+L  D   +P S+HG CPVIEGEKWSATKWIHV SFD  +  +  C D NE+C  WA  GEC  NP YM+G+    G CR+SC+AC
Subjt:  ALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSETASGYCRKSCQAC

Q8L970 Probable prolyl 4-hydroxylase 79.9e-11961.73Show/hide
Query:  MDSRWFLTFSLCVLFVFTAFARLP-QSLTHNKLYDPFPLLLLTLRGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLER
        MDSR FL FSLC LF     +  P + LT +             R GS ++++ S+S   FDPTRVTQLSW PR  LY+GFLSD+ECDH I L+KG+LE+
Subjt:  MDSRWFLTFSLCVLFVFTAFARLP-QSLTHNKLYDPFPLLLLTLRGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLER

Query:  SMVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGET
        SMVADN SG+ + SEVRTSSG F+ K QD+I++ +EA++AAWTFLP ENGE +QILHYENG+KYEPHFD+F D+ N ELGGHR+ATVLMYLSNVEKGGET
Subjt:  SMVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGET

Query:  VFPHSELKESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGEC
        VFP  + K +Q KDDSW++CA+ GYAVKP+KGDALLFF+L+ + TT+  S+HGSCPV+EGEKWSAT+WIHV+SF+       GCMD N +C  WA+ GEC
Subjt:  VFPHSELKESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGEC

Query:  KNNPRYMLGSETASGYCRKSCQAC
        + NP YM+GS+   GYCRKSC+AC
Subjt:  KNNPRYMLGSETASGYCRKSCQAC

Q8LAN3 Probable prolyl 4-hydroxylase 44.5e-9558.61Show/hide
Query:  SSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQ
        SSS +  +P++V Q+S +PRA +Y+GFL++ ECDH+++L+K  L+RS VADN SG+   SEVRTSSGTF+ KG+D I++ IE +I+ WTFLP ENGE IQ
Subjt:  SSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQ

Query:  ILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSELKESQ---EKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSM
        +L YE+G+KY+ HFD+F D+VN   GGHR+AT+LMYLSNV KGGETVFP +E+   +   E  +  SDCA+ G AVKP+KGDALLFF+L+ D   +P S+
Subjt:  ILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSELKESQ---EKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSM

Query:  HGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSETASGYCRKSCQAC
        HG CPVIEGEKWSATKWIHV SFD  + P+  C D NE+C  WA  GEC  NP YM+G+    GYCR+SC+AC
Subjt:  HGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSETASGYCRKSCQAC

Q9LN20 Probable prolyl 4-hydroxylase 31.4e-6454.59Show/hide
Query:  LSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHF
        LSW+PRA +Y  FLS +EC++LI+L+K  + +S V D+++GK   S VRTSSGTF+ +G+D+II  IE RIA +TF+P ++GE +Q+LHYE G+KYEPH+
Subjt:  LSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHF

Query:  DFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSELK-ESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATK
        D+FVDE N + GG R+AT+LMYLS+VE+GGETVFP + +   S    +  S+C + G +VKP+ GDALLF+S+  D T +P S+HG CPVI G KWS+TK
Subjt:  DFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSELK-ESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATK

Query:  WIHVRSF
        W+HV  +
Subjt:  WIHVRSF

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 27.3e-9355.33Show/hide
Query:  FPLLLLTLRGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIE
        F  +LL L   S   +   SS  I +P++V Q+S +PRA +Y+GFL+D ECDHLI+L+K  L+RS VADN +G+   S+VRTSSGTF+ KG+D I++ IE
Subjt:  FPLLLLTLRGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIE

Query:  ARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSE---LKESQEKDDSWSDCARMGYAVKPQKGD
         +++ WTFLP ENGE +Q+L YE+G+KY+ HFD+F D+VN   GGHR+ATVL+YLSNV KGGETVFP ++    +   E  D  SDCA+ G AVKP+KG+
Subjt:  ARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSE---LKESQEKDDSWSDCARMGYAVKPQKGD

Query:  ALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSETASGYCRKSCQAC
        ALLFF+L  D   +P S+HG CPVIEGEKWSATKWIHV SFD  +  +  C D NE+C  WA  GEC  NP YM+G+    G CR+SC+AC
Subjt:  ALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSETASGYCRKSCQAC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase7.0e-12061.73Show/hide
Query:  MDSRWFLTFSLCVLFVFTAFARLP-QSLTHNKLYDPFPLLLLTLRGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLER
        MDSR FL FSLC LF     +  P + LT +             R GS ++++ S+S   FDPTRVTQLSW PR  LY+GFLSD+ECDH I L+KG+LE+
Subjt:  MDSRWFLTFSLCVLFVFTAFARLP-QSLTHNKLYDPFPLLLLTLRGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLER

Query:  SMVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGET
        SMVADN SG+ + SEVRTSSG F+ K QD+I++ +EA++AAWTFLP ENGE +QILHYENG+KYEPHFD+F D+ N ELGGHR+ATVLMYLSNVEKGGET
Subjt:  SMVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGET

Query:  VFPHSELKESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGEC
        VFP  + K +Q KDDSW++CA+ GYAVKP+KGDALLFF+L+ + TT+  S+HGSCPV+EGEKWSAT+WIHV+SF+       GCMD N +C  WA+ GEC
Subjt:  VFPHSELKESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGEC

Query:  KNNPRYMLGSETASGYCRKSCQAC
        + NP YM+GS+   GYCRKSC+AC
Subjt:  KNNPRYMLGSETASGYCRKSCQAC

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase6.4e-11358.43Show/hide
Query:  MDSRWFLTFSLCVLFVFTAFARLP-QSLTHNKLYDPFPLLLLTLRGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLER
        MDSR FL FSLC LF     +  P + LT +             R GS ++++ S+S   FDPTRVTQLSW PR  LY+GFLSD+ECDH I L+KG+LE+
Subjt:  MDSRWFLTFSLCVLFVFTAFARLP-QSLTHNKLYDPFPLLLLTLRGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLER

Query:  SMVADNKSGKRISSE----VRTSSGTFVLKGQ----DEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLS
        SMVADN SG+ + SE    V   S +F+        D+I++ +EA++AAWTFLP ENGE +QILHYENG+KYEPHFD+F D+ N ELGGHR+ATVLMYLS
Subjt:  SMVADNKSGKRISSE----VRTSSGTFVLKGQ----DEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLS

Query:  NVEKGGETVFPHSELKESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCP
        NVEKGGETVFP  + K +Q KDDSW++CA+ GYAVKP+KGDALLFF+L+ + TT+  S+HGSCPV+EGEKWSAT+WIHV+SF+       GCMD N +C 
Subjt:  NVEKGGETVFPHSELKESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCP

Query:  LWAENGECKNNPRYMLGSETASGYCRKSCQAC
         WA+ GEC+ NP YM+GS+   GYCRKSC+AC
Subjt:  LWAENGECKNNPRYMLGSETASGYCRKSCQAC

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase1.7e-11060.19Show/hide
Query:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLYDPFPLLLLTLRGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERS
        MDS++FL FSL +L +F+                                   SS     DPTR+TQLSW PRA LYKGFLSD+ECDHLI L+KG+LE+S
Subjt:  MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLYDPFPLLLLTLRGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERS

Query:  M-VADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGET
        M VAD  SG+   SEVRTSSG F+ K QD+I+A +EA++AAWTFLP ENGE +QILHYENG+KY+PHFD+F D+   ELGGHR+ATVLMYLSNV KGGET
Subjt:  M-VADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGET

Query:  VFPHSELKESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGEC
        VFP+ + K  Q KDDSWS CA+ GYAVKP+KGDALLFF+L+++GTT+P S+HGSCPVIEGEKWSAT+WIHVRSF    L    C+D +E+C  WA+ GEC
Subjt:  VFPHSELKESQEKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGEC

Query:  KNNPRYMLGSETASGYCRKSCQAC
        + NP YM+GSET+ G+CRKSC+AC
Subjt:  KNNPRYMLGSETASGYCRKSCQAC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein3.2e-9658.61Show/hide
Query:  SSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQ
        SSS +  +P++V Q+S +PRA +Y+GFL++ ECDH+++L+K  L+RS VADN SG+   SEVRTSSGTF+ KG+D I++ IE +I+ WTFLP ENGE IQ
Subjt:  SSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKRISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQ

Query:  ILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSELKESQ---EKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSM
        +L YE+G+KY+ HFD+F D+VN   GGHR+AT+LMYLSNV KGGETVFP +E+   +   E  +  SDCA+ G AVKP+KGDALLFF+L+ D   +P S+
Subjt:  ILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSELKESQ---EKDDSWSDCARMGYAVKPQKGDALLFFSLNVDGTTNPRSM

Query:  HGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSETASGYCRKSCQAC
        HG CPVIEGEKWSATKWIHV SFD  + P+  C D NE+C  WA  GEC  NP YM+G+    GYCR+SC+AC
Subjt:  HGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSETASGYCRKSCQAC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCCCGTTGGTTCCTCACATTCTCCCTTTGCGTTCTGTTCGTCTTCACTGCCTTCGCTCGCTTGCCCCAATCGCTTACGCACAACAAACTGTATGATCCTTTTCC
TCTTCTTCTTCTTACTCTTCGAGGCGGATCTGCACTTCGGTTGGAGGGGAGTTCATCTCCGCTGATTTTTGATCCAACTCGAGTCACTCAGCTCTCCTGGCAACCTAGGG
CATTATTATATAAGGGATTTCTATCTGATAAGGAATGCGATCACCTAATCAATCTGTCAAAGGGAAGGTTAGAGAGGTCGATGGTAGCAGATAACAAGTCCGGTAAGAGA
ATAAGTAGTGAAGTCCGGACCAGCTCCGGCACGTTCGTGCTGAAGGGGCAGGATGAAATAATTGCTGCCATTGAAGCCAGAATTGCGGCATGGACATTCCTTCCAATAGA
AAATGGAGAGCCCATTCAAATTCTGCACTATGAGAATGGTGAGAAGTATGAACCGCATTTTGATTTTTTTGTGGACGAGGTGAATAAGGAGTTGGGTGGCCACCGAGTAG
CCACAGTTTTGATGTATTTATCCAATGTTGAGAAGGGTGGAGAGACCGTGTTTCCACATTCAGAGTTGAAAGAGTCTCAAGAAAAGGATGATAGCTGGTCTGATTGTGCT
CGAATGGGTTATGCAGTTAAACCGCAGAAGGGTGATGCATTGCTGTTCTTCAGCCTCAATGTGGATGGAACCACAAATCCAAGAAGCATGCACGGTAGCTGCCCAGTGAT
TGAAGGAGAGAAATGGTCTGCAACCAAATGGATTCATGTCAGATCCTTTGATAACCCAATTCTCCCAAACAAGGGCTGCATGGACTTCAACGAAAATTGCCCTTTGTGGG
CCGAAAACGGTGAGTGCAAAAACAACCCCAGGTACATGCTGGGCTCTGAAACTGCTTCAGGATACTGTAGGAAGAGTTGCCAAGCCTGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATTCCCGTTGGTTCCTCACATTCTCCCTTTGCGTTCTGTTCGTCTTCACTGCCTTCGCTCGCTTGCCCCAATCGCTTACGCACAACAAACTGTATGATCCTTTTCC
TCTTCTTCTTCTTACTCTTCGAGGCGGATCTGCACTTCGGTTGGAGGGGAGTTCATCTCCGCTGATTTTTGATCCAACTCGAGTCACTCAGCTCTCCTGGCAACCTAGGG
CATTATTATATAAGGGATTTCTATCTGATAAGGAATGCGATCACCTAATCAATCTGTCAAAGGGAAGGTTAGAGAGGTCGATGGTAGCAGATAACAAGTCCGGTAAGAGA
ATAAGTAGTGAAGTCCGGACCAGCTCCGGCACGTTCGTGCTGAAGGGGCAGGATGAAATAATTGCTGCCATTGAAGCCAGAATTGCGGCATGGACATTCCTTCCAATAGA
AAATGGAGAGCCCATTCAAATTCTGCACTATGAGAATGGTGAGAAGTATGAACCGCATTTTGATTTTTTTGTGGACGAGGTGAATAAGGAGTTGGGTGGCCACCGAGTAG
CCACAGTTTTGATGTATTTATCCAATGTTGAGAAGGGTGGAGAGACCGTGTTTCCACATTCAGAGTTGAAAGAGTCTCAAGAAAAGGATGATAGCTGGTCTGATTGTGCT
CGAATGGGTTATGCAGTTAAACCGCAGAAGGGTGATGCATTGCTGTTCTTCAGCCTCAATGTGGATGGAACCACAAATCCAAGAAGCATGCACGGTAGCTGCCCAGTGAT
TGAAGGAGAGAAATGGTCTGCAACCAAATGGATTCATGTCAGATCCTTTGATAACCCAATTCTCCCAAACAAGGGCTGCATGGACTTCAACGAAAATTGCCCTTTGTGGG
CCGAAAACGGTGAGTGCAAAAACAACCCCAGGTACATGCTGGGCTCTGAAACTGCTTCAGGATACTGTAGGAAGAGTTGCCAAGCCTGCTAAACTACAAACTCCAACAAC
CAAGTCTCCATTCTGTCATGGTTATCATGTACATAACATTCGACACTAACTAGATATAGCTTTGCTTCTTCTAGTCGACCTTCTCGGTTCCGTTATACAAATAAGGTGGG
CGGTCAAATG
Protein sequenceShow/hide protein sequence
MDSRWFLTFSLCVLFVFTAFARLPQSLTHNKLYDPFPLLLLTLRGGSALRLEGSSSPLIFDPTRVTQLSWQPRALLYKGFLSDKECDHLINLSKGRLERSMVADNKSGKR
ISSEVRTSSGTFVLKGQDEIIAAIEARIAAWTFLPIENGEPIQILHYENGEKYEPHFDFFVDEVNKELGGHRVATVLMYLSNVEKGGETVFPHSELKESQEKDDSWSDCA
RMGYAVKPQKGDALLFFSLNVDGTTNPRSMHGSCPVIEGEKWSATKWIHVRSFDNPILPNKGCMDFNENCPLWAENGECKNNPRYMLGSETASGYCRKSCQAC