; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS003192 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS003192
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationscaffold234:1050356..1053814
RNA-Seq ExpressionMS003192
SyntenyMS003192
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008458700.1 PREDICTED: probable prolyl 4-hydroxylase 7 [Cucumis melo]5.0e-15787.38Show/hide
Query:  MDSPRFLSFSLCFLFVFTALARLPDMR----AHKKISGSVLRLKGEPSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSG
        MDS  FL+FSLCFL VFTA ARLP+ R    ++K+ +GSVLRLK + SPLIFDPTRVTQLSWQPRAFLYKGFLSD+ECDHLIDLAKDKLEKSMVADN SG
Subjt:  MDSPRFLSFSLCFLFVFTALARLPDMR----AHKKISGSVLRLKGEPSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSG

Query:  KSVSSEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGESIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNSEFKE
        KSVSSEVRTSSGMFL KAQD+IVA VEARIAAWT LPAENGESIQILHYENGQKYEPHFD+FHDKVNQELGGHR+ATVLMYLSNVEKGGETIFPNSEFKE
Subjt:  KSVSSEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGESIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNSEFKE

Query:  SQEKDDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYM
        SQEKDDSWSDC+RKGYAVKA+KGDALLFFSLHLDATTD +SLHGSCPVIEGEKWSATKWIHVRSFEK  R SR+ DCVDENENC +WAKRGECKKNPTYM
Subjt:  SQEKDDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYM

Query:  VGSESALGYCRKSCQAC
        VGSE ALGYCRKSC+AC
Subjt:  VGSESALGYCRKSCQAC

XP_011655982.1 probable prolyl 4-hydroxylase 7 isoform X1 [Cucumis sativus]1.2e-15586.58Show/hide
Query:  MDSPRFLSFSLCFLFVFTALARLPDMRAHKKISGSVLRLKGEPSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSGKSVS
        MDS  FL+FSLCFL VFTA ARLP+ R HK+ SGSVLRLK + SPLIFDPTRVTQLSWQPRAFLYKGFLSD ECDHLIDLAKDKLEKSMVADN+SGKSVS
Subjt:  MDSPRFLSFSLCFLFVFTALARLPDMRAHKKISGSVLRLKGEPSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSGKSVS

Query:  SEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGESIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNSEFKESQEK
        SEVRTSSGMFL KAQDE+VA VEARIAAWT LPAENGESIQILHYENGQKYEPHFD+FHDKVNQELGGHR+ATVLMYLSNVEKGGETIFPNSEFKESQ K
Subjt:  SEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGESIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNSEFKESQEK

Query:  DDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYMVGSE
        D+SWSDC+RKGYAVKA+KGDALLFFSL+LDATTD +SLHGSCPVI GEKWSATKWIHVRSFEK T    R  CVDENENC +WAK+GECKKNPTYMVGS 
Subjt:  DDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYMVGSE

Query:  SALGYCRKSCQAC
         ALGYCRKSC+AC
Subjt:  SALGYCRKSCQAC

XP_022134044.1 probable prolyl 4-hydroxylase 7 [Momordica charantia]1.3e-181100Show/hide
Query:  MDSPRFLSFSLCFLFVFTALARLPDMRAHKKISGSVLRLKGEPSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSGKSVS
        MDSPRFLSFSLCFLFVFTALARLPDMRAHKKISGSVLRLKGEPSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSGKSVS
Subjt:  MDSPRFLSFSLCFLFVFTALARLPDMRAHKKISGSVLRLKGEPSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSGKSVS

Query:  SEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGESIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNSEFKESQEK
        SEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGESIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNSEFKESQEK
Subjt:  SEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGESIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNSEFKESQEK

Query:  DDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYMVGSE
        DDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYMVGSE
Subjt:  DDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYMVGSE

Query:  SALGYCRKSCQAC
        SALGYCRKSCQAC
Subjt:  SALGYCRKSCQAC

XP_038889686.1 probable prolyl 4-hydroxylase 7 isoform X1 [Benincasa hispida]1.4e-15988.5Show/hide
Query:  MDSPRFLSFSLCFLFVFTALARLPDMRAHKKISGSVLRLKGEPSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSGKSVS
        MDS RFL+F LCFL VFT  ARLP++R+ KK SGSV+RLK + SPL+FDPTRVTQLSW+PRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADN SGKSVS
Subjt:  MDSPRFLSFSLCFLFVFTALARLPDMRAHKKISGSVLRLKGEPSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSGKSVS

Query:  SEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGESIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNSEFKESQEK
        SEVRTSSGMFL KAQDEIVAA+EARI+AWT LPAENGESIQILHYENGQKYEPHFD+FHDKVNQELGGHR+ATVLMYLSNVEKGGETIFPNSEFKESQEK
Subjt:  SEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGESIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNSEFKESQEK

Query:  DDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYMVGSE
        D+SWSDCARKGYAVKA+KGDALLFFSL  DATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEK TR SR+ DCVDENENC  WAKRGECKKNPTYMVGSE
Subjt:  DDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYMVGSE

Query:  SALGYCRKSCQAC
         ALGYCRKSC+AC
Subjt:  SALGYCRKSCQAC

XP_038889687.1 probable prolyl 4-hydroxylase 7 isoform X2 [Benincasa hispida]2.0e-15888.5Show/hide
Query:  MDSPRFLSFSLCFLFVFTALARLPDMRAHKKISGSVLRLKGEPSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSGKSVS
        MDS RFL+F LCFL VFT  ARLP++R+ KK SGSV+RLK + SPL+FDPTRVTQLSW+PRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADN SGKSVS
Subjt:  MDSPRFLSFSLCFLFVFTALARLPDMRAHKKISGSVLRLKGEPSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSGKSVS

Query:  SEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGESIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNSEFKESQEK
        SEVRTSSGMFL KAQDEIVAA+EARI+AWT LPAENGESIQILHYENGQKYEPHFD+FHDKVNQELGGHR+ATVLMYLSNVEKGGETIFPNSEFKESQEK
Subjt:  SEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGESIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNSEFKESQEK

Query:  DDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYMVGSE
        D+SWSDCARKGYAVKA+KGDALLFFSL  DATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEK TR SR+ DCVDENENC  WAKRGECKKNPTYMVGSE
Subjt:  DDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYMVGSE

Query:  SALGYCRKSCQAC
         ALGYCRKSC+AC
Subjt:  SALGYCRKSCQAC

TrEMBL top hitse value%identityAlignment
A0A0A0KS38 Procollagen-proline 4-dioxygenase6.0e-15686.58Show/hide
Query:  MDSPRFLSFSLCFLFVFTALARLPDMRAHKKISGSVLRLKGEPSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSGKSVS
        MDS  FL+FSLCFL VFTA ARLP+ R HK+ SGSVLRLK + SPLIFDPTRVTQLSWQPRAFLYKGFLSD ECDHLIDLAKDKLEKSMVADN+SGKSVS
Subjt:  MDSPRFLSFSLCFLFVFTALARLPDMRAHKKISGSVLRLKGEPSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSGKSVS

Query:  SEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGESIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNSEFKESQEK
        SEVRTSSGMFL KAQDE+VA VEARIAAWT LPAENGESIQILHYENGQKYEPHFD+FHDKVNQELGGHR+ATVLMYLSNVEKGGETIFPNSEFKESQ K
Subjt:  SEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGESIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNSEFKESQEK

Query:  DDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYMVGSE
        D+SWSDC+RKGYAVKA+KGDALLFFSL+LDATTD +SLHGSCPVI GEKWSATKWIHVRSFEK T    R  CVDENENC +WAK+GECKKNPTYMVGS 
Subjt:  DDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYMVGSE

Query:  SALGYCRKSCQAC
         ALGYCRKSC+AC
Subjt:  SALGYCRKSCQAC

A0A1S3C8G4 Procollagen-proline 4-dioxygenase2.4e-15787.38Show/hide
Query:  MDSPRFLSFSLCFLFVFTALARLPDMR----AHKKISGSVLRLKGEPSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSG
        MDS  FL+FSLCFL VFTA ARLP+ R    ++K+ +GSVLRLK + SPLIFDPTRVTQLSWQPRAFLYKGFLSD+ECDHLIDLAKDKLEKSMVADN SG
Subjt:  MDSPRFLSFSLCFLFVFTALARLPDMR----AHKKISGSVLRLKGEPSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSG

Query:  KSVSSEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGESIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNSEFKE
        KSVSSEVRTSSGMFL KAQD+IVA VEARIAAWT LPAENGESIQILHYENGQKYEPHFD+FHDKVNQELGGHR+ATVLMYLSNVEKGGETIFPNSEFKE
Subjt:  KSVSSEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGESIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNSEFKE

Query:  SQEKDDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYM
        SQEKDDSWSDC+RKGYAVKA+KGDALLFFSLHLDATTD +SLHGSCPVIEGEKWSATKWIHVRSFEK  R SR+ DCVDENENC +WAKRGECKKNPTYM
Subjt:  SQEKDDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYM

Query:  VGSESALGYCRKSCQAC
        VGSE ALGYCRKSC+AC
Subjt:  VGSESALGYCRKSCQAC

A0A6J1BXN9 Procollagen-proline 4-dioxygenase6.3e-182100Show/hide
Query:  MDSPRFLSFSLCFLFVFTALARLPDMRAHKKISGSVLRLKGEPSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSGKSVS
        MDSPRFLSFSLCFLFVFTALARLPDMRAHKKISGSVLRLKGEPSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSGKSVS
Subjt:  MDSPRFLSFSLCFLFVFTALARLPDMRAHKKISGSVLRLKGEPSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSGKSVS

Query:  SEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGESIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNSEFKESQEK
        SEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGESIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNSEFKESQEK
Subjt:  SEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGESIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNSEFKESQEK

Query:  DDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYMVGSE
        DDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYMVGSE
Subjt:  DDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYMVGSE

Query:  SALGYCRKSCQAC
        SALGYCRKSCQAC
Subjt:  SALGYCRKSCQAC

A0A6J1FJ93 Procollagen-proline 4-dioxygenase2.6e-15185.62Show/hide
Query:  MDSPRFLSFSLCFLFVFTALARLPDMRAHKKISGSVLRLKGEPSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSGKSVS
        MDS RFL+FSL FL V T  ARLP+   HKK+SGSVL LK +   LIFDPTRVTQLSWQPRAFLYKGFL+D+ECDHLIDLAKDKLEKSMVADN SGKSVS
Subjt:  MDSPRFLSFSLCFLFVFTALARLPDMRAHKKISGSVLRLKGEPSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSGKSVS

Query:  SEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGESIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNSEFKESQEK
        SEVRTSSGMFL KAQDEIVA +EARI+AWTFLP ENGESIQILHYENGQKYEPHFD+FHDKVNQELGGHR+ATVLMYLSNVEKGGETIFPNS F ESQEK
Subjt:  SEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGESIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNSEFKESQEK

Query:  DDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYMVGSE
        DDSWSDCARKGYAVKA+KGDALLFFSLHLDATTD +SLHGSCPVIEGEKWSATKWIHVRSF+K TR S + DCVDEN+NC SWAKRGEC+KNPTYMVGSE
Subjt:  DDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYMVGSE

Query:  SALGYCRKSCQAC
         A+GYCRKSC+AC
Subjt:  SALGYCRKSCQAC

A0A6J1JWX0 Procollagen-proline 4-dioxygenase2.0e-15185.62Show/hide
Query:  MDSPRFLSFSLCFLFVFTALARLPDMRAHKKISGSVLRLKGEPSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSGKSVS
        MDS RFL FSL FL V T  ARLP+   HKK+SGSVL LK +   LIFDPTRVTQLSWQPRAFLYKGFL+D+ECDHLIDLAKDKLEKSMVADN SGKSVS
Subjt:  MDSPRFLSFSLCFLFVFTALARLPDMRAHKKISGSVLRLKGEPSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSGKSVS

Query:  SEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGESIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNSEFKESQEK
        SEVRTSSGMFL KAQDEIVA +EARI+AWTFLP ENGESIQILHYENGQKYEPHFD+FHDKVNQELGGHR+ATVLMYLSNVEKGGETIFPNS F ESQEK
Subjt:  SEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGESIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNSEFKESQEK

Query:  DDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYMVGSE
        DDSWSDCARKGYAVKA+KGDALLFFSLHLDATTD +SLHGSCPVIEGEKWSATKWIHVRSF+K TR S + DCVDEN+NC SWAKRGEC+KNPTYMVGSE
Subjt:  DDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYMVGSE

Query:  SALGYCRKSCQAC
         A+GYCRKSC+AC
Subjt:  SALGYCRKSCQAC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 64.0e-11767.52Show/hide
Query:  MDSPRFLSFSLCFLFVFTALARLPDMRAHKKISGSVLRLKGEPSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSM-VADNNSGKSV
        MDS  FL+FSL  L +F+ ++           S SV            DPTR+TQLSW PRAFLYKGFLSD+ECDHLI LAK KLEKSM VAD +SG+S 
Subjt:  MDSPRFLSFSLCFLFVFTALARLPDMRAHKKISGSVLRLKGEPSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSM-VADNNSGKSV

Query:  SSEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGESIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNSEFKESQE
         SEVRTSSGMFL K QD+IVA VEA++AAWTFLP ENGE++QILHYENGQKY+PHFDYF+DK   ELGGHR+ATVLMYLSNV KGGET+FPN + K  Q 
Subjt:  SSEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGESIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNSEFKESQE

Query:  KDDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYMVGS
        KDDSWS CA++GYAVK +KGDALLFF+LHL+ TTD  SLHGSCPVIEGEKWSAT+WIHVRSF K     ++L CVD++E+C  WA  GEC+KNP YMVGS
Subjt:  KDDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYMVGS

Query:  ESALGYCRKSCQAC
        E++LG+CRKSC+AC
Subjt:  ESALGYCRKSCQAC

F4JAU3 Prolyl 4-hydroxylase 22.6e-9260.07Show/hide
Query:  LKGEPSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSGKSVSSEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGE
        L   PS +I +P++V Q+S +PRAF+Y+GFL+D ECDHLI LAK+ L++S VADN++G+S  S+VRTSSG F+ K +D IV+ +E +++ WTFLP ENGE
Subjt:  LKGEPSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSGKSVSSEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGE

Query:  SIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNS-EF--KESQEKDDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDV
         +Q+L YE+GQKY+ HFDYFHDKVN   GGHR+ATVL+YLSNV KGGET+FP++ EF  +   E  D  SDCA+KG AVK KKG+ALLFF+L  DA  D 
Subjt:  SIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNS-EF--KESQEKDDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDV

Query:  KSLHGSCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYMVGSESALGYCRKSCQAC
         SLHG CPVIEGEKWSATKWIHV SF+K    +   +C D NE+C  WA  GEC KNP YMVG+    G CR+SC+AC
Subjt:  KSLHGSCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYMVGSESALGYCRKSCQAC

Q8L970 Probable prolyl 4-hydroxylase 72.5e-12768.99Show/hide
Query:  MDSPRFLSFSLCFLFVFTALARLPD---MRAHKKISGSVLRLKGEPSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSGK
        MDS  FL+FSLCFLF    ++  P+    R+     GSV+++K   S   FDPTRVTQLSW PR FLY+GFLSD+ECDH I LAK KLEKSMVADN+SG+
Subjt:  MDSPRFLSFSLCFLFVFTALARLPD---MRAHKKISGSVLRLKGEPSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSGK

Query:  SVSSEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGESIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNSEFKES
        SV SEVRTSSGMFL K QD+IV+ VEA++AAWTFLP ENGES+QILHYENGQKYEPHFDYFHD+ N ELGGHR+ATVLMYLSNVEKGGET+FP  + K +
Subjt:  SVSSEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGESIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNSEFKES

Query:  QEKDDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYMV
        Q KDDSW++CA++GYAVK +KGDALLFF+LH +ATTD  SLHGSCPV+EGEKWSAT+WIHV+SFE+    +++  C+DEN +C  WAK GEC+KNPTYMV
Subjt:  QEKDDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYMV

Query:  GSESALGYCRKSCQAC
        GS+   GYCRKSC+AC
Subjt:  GSESALGYCRKSCQAC

Q8LAN3 Probable prolyl 4-hydroxylase 47.4e-9560.44Show/hide
Query:  SPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSGKSVSSEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGESIQIL
        S +  +P++V Q+S +PRAF+Y+GFL++ ECDH++ LAK  L++S VADN+SG+S  SEVRTSSG F+ K +D IV+ +E +I+ WTFLP ENGE IQ+L
Subjt:  SPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSGKSVSSEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGESIQIL

Query:  HYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNSEFKESQ---EKDDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHG
         YE+GQKY+ HFDYFHDKVN   GGHR+AT+LMYLSNV KGGET+FP++E    +   E  +  SDCA++G AVK +KGDALLFF+LH DA  D  SLHG
Subjt:  HYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNSEFKESQ---EKDDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHG

Query:  SCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYMVGSESALGYCRKSCQAC
         CPVIEGEKWSATKWIHV SF++   PS   +C D NE+C  WA  GEC KNP YMVG+    GYCR+SC+AC
Subjt:  SCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYMVGSESALGYCRKSCQAC

Q9LN20 Probable prolyl 4-hydroxylase 33.0e-6455.77Show/hide
Query:  LSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSGKSVSSEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGESIQILHYENGQKYEPHF
        LSW+PRAF+Y  FLS +EC++LI LAK  + KS V D+ +GKS  S VRTSSG FL + +D+I+  +E RIA +TF+PA++GE +Q+LHYE GQKYEPH+
Subjt:  LSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSGKSVSSEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGESIQILHYENGQKYEPHF

Query:  DYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNSEFK-ESQEKDDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATK
        DYF D+ N + GG R+AT+LMYLS+VE+GGET+FP +     S    +  S+C +KG +VK + GDALLF+S+  DAT D  SLHG CPVI G KWS+TK
Subjt:  DYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNSEFK-ESQEKDDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATK

Query:  WIHVRSFE
        W+HV  ++
Subjt:  WIHVRSFE

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 21.9e-9360.07Show/hide
Query:  LKGEPSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSGKSVSSEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGE
        L   PS +I +P++V Q+S +PRAF+Y+GFL+D ECDHLI LAK+ L++S VADN++G+S  S+VRTSSG F+ K +D IV+ +E +++ WTFLP ENGE
Subjt:  LKGEPSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSGKSVSSEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGE

Query:  SIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNS-EF--KESQEKDDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDV
         +Q+L YE+GQKY+ HFDYFHDKVN   GGHR+ATVL+YLSNV KGGET+FP++ EF  +   E  D  SDCA+KG AVK KKG+ALLFF+L  DA  D 
Subjt:  SIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNS-EF--KESQEKDDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDV

Query:  KSLHGSCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYMVGSESALGYCRKSCQAC
         SLHG CPVIEGEKWSATKWIHV SF+K    +   +C D NE+C  WA  GEC KNP YMVG+    G CR+SC+AC
Subjt:  KSLHGSCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYMVGSESALGYCRKSCQAC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase1.8e-12868.99Show/hide
Query:  MDSPRFLSFSLCFLFVFTALARLPD---MRAHKKISGSVLRLKGEPSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSGK
        MDS  FL+FSLCFLF    ++  P+    R+     GSV+++K   S   FDPTRVTQLSW PR FLY+GFLSD+ECDH I LAK KLEKSMVADN+SG+
Subjt:  MDSPRFLSFSLCFLFVFTALARLPD---MRAHKKISGSVLRLKGEPSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSGK

Query:  SVSSEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGESIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNSEFKES
        SV SEVRTSSGMFL K QD+IV+ VEA++AAWTFLP ENGES+QILHYENGQKYEPHFDYFHD+ N ELGGHR+ATVLMYLSNVEKGGET+FP  + K +
Subjt:  SVSSEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGESIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNSEFKES

Query:  QEKDDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYMV
        Q KDDSW++CA++GYAVK +KGDALLFF+LH +ATTD  SLHGSCPV+EGEKWSAT+WIHV+SFE+    +++  C+DEN +C  WAK GEC+KNPTYMV
Subjt:  QEKDDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYMV

Query:  GSESALGYCRKSCQAC
        GS+   GYCRKSC+AC
Subjt:  GSESALGYCRKSCQAC

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase1.4e-12065.12Show/hide
Query:  MDSPRFLSFSLCFLFVFTALARLPD---MRAHKKISGSVLRLKGEPSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSGK
        MDS  FL+FSLCFLF    ++  P+    R+     GSV+++K   S   FDPTRVTQLSW PR FLY+GFLSD+ECDH I LAK KLEKSMVADN+SG+
Subjt:  MDSPRFLSFSLCFLFVFTALARLPD---MRAHKKISGSVLRLKGEPSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSGK

Query:  SVSSE-----VRTSSGMFLHKAQ---DEIVAAVEARIAAWTFLPAENGESIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIF
        SV SE     VR SS    +      D+IV+ VEA++AAWTFLP ENGES+QILHYENGQKYEPHFDYFHD+ N ELGGHR+ATVLMYLSNVEKGGET+F
Subjt:  SVSSE-----VRTSSGMFLHKAQ---DEIVAAVEARIAAWTFLPAENGESIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIF

Query:  PNSEFKESQEKDDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGEC
        P  + K +Q KDDSW++CA++GYAVK +KGDALLFF+LH +ATTD  SLHGSCPV+EGEKWSAT+WIHV+SFE+    +++  C+DEN +C  WAK GEC
Subjt:  PNSEFKESQEKDDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGEC

Query:  KKNPTYMVGSESALGYCRKSCQAC
        +KNPTYMVGS+   GYCRKSC+AC
Subjt:  KKNPTYMVGSESALGYCRKSCQAC

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase2.9e-11867.52Show/hide
Query:  MDSPRFLSFSLCFLFVFTALARLPDMRAHKKISGSVLRLKGEPSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSM-VADNNSGKSV
        MDS  FL+FSL  L +F+ ++           S SV            DPTR+TQLSW PRAFLYKGFLSD+ECDHLI LAK KLEKSM VAD +SG+S 
Subjt:  MDSPRFLSFSLCFLFVFTALARLPDMRAHKKISGSVLRLKGEPSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSM-VADNNSGKSV

Query:  SSEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGESIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNSEFKESQE
         SEVRTSSGMFL K QD+IVA VEA++AAWTFLP ENGE++QILHYENGQKY+PHFDYF+DK   ELGGHR+ATVLMYLSNV KGGET+FPN + K  Q 
Subjt:  SSEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGESIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNSEFKESQE

Query:  KDDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYMVGS
        KDDSWS CA++GYAVK +KGDALLFF+LHL+ TTD  SLHGSCPVIEGEKWSAT+WIHVRSF K     ++L CVD++E+C  WA  GEC+KNP YMVGS
Subjt:  KDDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYMVGS

Query:  ESALGYCRKSCQAC
        E++LG+CRKSC+AC
Subjt:  ESALGYCRKSCQAC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein5.2e-9660.44Show/hide
Query:  SPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSGKSVSSEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGESIQIL
        S +  +P++V Q+S +PRAF+Y+GFL++ ECDH++ LAK  L++S VADN+SG+S  SEVRTSSG F+ K +D IV+ +E +I+ WTFLP ENGE IQ+L
Subjt:  SPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSGKSVSSEVRTSSGMFLHKAQDEIVAAVEARIAAWTFLPAENGESIQIL

Query:  HYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNSEFKESQ---EKDDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHG
         YE+GQKY+ HFDYFHDKVN   GGHR+AT+LMYLSNV KGGET+FP++E    +   E  +  SDCA++G AVK +KGDALLFF+LH DA  D  SLHG
Subjt:  HYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNSEFKESQ---EKDDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHG

Query:  SCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYMVGSESALGYCRKSCQAC
         CPVIEGEKWSATKWIHV SF++   PS   +C D NE+C  WA  GEC KNP YMVG+    GYCR+SC+AC
Subjt:  SCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYMVGSESALGYCRKSCQAC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCCCCGCGGTTCCTCTCATTTTCTCTTTGTTTTCTCTTCGTGTTCACTGCCTTGGCTCGCTTGCCGGACATGCGCGCGCACAAGAAAATAAGTGGATCTGTACT
TCGATTGAAGGGGGAACCCTCTCCGCTCATTTTCGATCCAACACGAGTCACTCAGCTCTCCTGGCAACCCAGGGCATTTTTGTATAAGGGATTTTTATCTGATAAGGAAT
GTGACCATCTAATCGATCTGGCCAAGGATAAATTGGAGAAGTCAATGGTAGCAGATAATAATTCTGGTAAGAGTGTAAGTAGTGAAGTCCGGACGAGTTCTGGCATGTTC
CTTCACAAGGCCCAGGATGAAATAGTTGCTGCCGTTGAGGCCAGGATTGCTGCATGGACATTCCTTCCAGCAGAAAATGGAGAGTCCATTCAAATTCTGCACTATGAGAA
TGGTCAGAAGTATGAACCACATTTTGATTATTTTCACGACAAGGTGAACCAGGAGTTAGGTGGCCACCGAGTAGCCACAGTCTTGATGTATTTATCCAATGTTGAAAAGG
GTGGAGAAACCATCTTTCCAAATTCAGAGTTTAAAGAATCTCAAGAAAAGGATGACAGCTGGTCTGATTGTGCTCGAAAGGGTTATGCAGTTAAAGCGAAGAAGGGTGAT
GCATTGCTGTTCTTCAGTCTCCATCTCGATGCAACAACAGATGTCAAAAGCTTGCACGGCAGTTGCCCCGTGATTGAGGGCGAGAAATGGTCTGCAACCAAATGGATTCA
TGTGAGATCCTTCGAGAAGCCAACTCGCCCAAGTCGTCGTCTAGATTGCGTTGATGAGAACGAAAATTGCGCTTCATGGGCCAAAAGGGGTGAGTGCAAAAAGAACCCTA
CTTACATGGTGGGTTCTGAAAGTGCTTTAGGATACTGTAGGAAGAGTTGCCAAGCCTGT
mRNA sequenceShow/hide mRNA sequence
ATGGATTCCCCGCGGTTCCTCTCATTTTCTCTTTGTTTTCTCTTCGTGTTCACTGCCTTGGCTCGCTTGCCGGACATGCGCGCGCACAAGAAAATAAGTGGATCTGTACT
TCGATTGAAGGGGGAACCCTCTCCGCTCATTTTCGATCCAACACGAGTCACTCAGCTCTCCTGGCAACCCAGGGCATTTTTGTATAAGGGATTTTTATCTGATAAGGAAT
GTGACCATCTAATCGATCTGGCCAAGGATAAATTGGAGAAGTCAATGGTAGCAGATAATAATTCTGGTAAGAGTGTAAGTAGTGAAGTCCGGACGAGTTCTGGCATGTTC
CTTCACAAGGCCCAGGATGAAATAGTTGCTGCCGTTGAGGCCAGGATTGCTGCATGGACATTCCTTCCAGCAGAAAATGGAGAGTCCATTCAAATTCTGCACTATGAGAA
TGGTCAGAAGTATGAACCACATTTTGATTATTTTCACGACAAGGTGAACCAGGAGTTAGGTGGCCACCGAGTAGCCACAGTCTTGATGTATTTATCCAATGTTGAAAAGG
GTGGAGAAACCATCTTTCCAAATTCAGAGTTTAAAGAATCTCAAGAAAAGGATGACAGCTGGTCTGATTGTGCTCGAAAGGGTTATGCAGTTAAAGCGAAGAAGGGTGAT
GCATTGCTGTTCTTCAGTCTCCATCTCGATGCAACAACAGATGTCAAAAGCTTGCACGGCAGTTGCCCCGTGATTGAGGGCGAGAAATGGTCTGCAACCAAATGGATTCA
TGTGAGATCCTTCGAGAAGCCAACTCGCCCAAGTCGTCGTCTAGATTGCGTTGATGAGAACGAAAATTGCGCTTCATGGGCCAAAAGGGGTGAGTGCAAAAAGAACCCTA
CTTACATGGTGGGTTCTGAAAGTGCTTTAGGATACTGTAGGAAGAGTTGCCAAGCCTGT
Protein sequenceShow/hide protein sequence
MDSPRFLSFSLCFLFVFTALARLPDMRAHKKISGSVLRLKGEPSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSGKSVSSEVRTSSGMF
LHKAQDEIVAAVEARIAAWTFLPAENGESIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSNVEKGGETIFPNSEFKESQEKDDSWSDCARKGYAVKAKKGD
ALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYMVGSESALGYCRKSCQAC