; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0019548 (gene) of Chayote v1 genome

Gene IDSed0019548
OrganismSechium edule (Chayote v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationLG03:1100888..1105811
RNA-Seq ExpressionSed0019548
SyntenySed0019548
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008458700.1 PREDICTED: probable prolyl 4-hydroxylase 7 [Cucumis melo]8.5e-15786.39Show/hide
Query:  MDSRRLLTFSVCSLFVFTAFARFP----VSHTHNNLTGSVLRLKTDPSPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSMVADNESG
        MDSR  L FS+C L VFTAFAR P    + H++   TGSVLRLKTD SPLIFDPTRV QLSW+PRAFLYKGFLSD+ECDHLIDLAKDKL+KSMVADNESG
Subjt:  MDSRRLLTFSVCSLFVFTAFARFP----VSHTHNNLTGSVLRLKTDPSPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSMVADNESG

Query:  KSVSSEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGESIQILHYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSMLKD
        KSVSSEVRTSSGMFLRKAQD+IVAG+E+RIAAWT LPAENGESIQILHYENGQKYEPHFDFF DKVNQELGGHRIATVLMYLSNVEKGGETIFPNS  K+
Subjt:  KSVSSEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGESIQILHYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSMLKD

Query:  SQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASTQGCVDEDEHCPGWAAKGECKKNPAYMV
        SQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTD+RSLHGSCPVIEGEKWSATKWIHVRSFEK  R S Q CVDE+E+CP WA +GECKKNP YMV
Subjt:  SQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASTQGCVDEDEHCPGWAAKGECKKNPAYMV

Query:  GSEGALGYCRKSCKAC
        GSEGALGYCRKSCKAC
Subjt:  GSEGALGYCRKSCKAC

XP_011655982.1 probable prolyl 4-hydroxylase 7 isoform X1 [Cucumis sativus]1.1e-15385.94Show/hide
Query:  MDSRRLLTFSVCSLFVFTAFARFPVSHTHNNLTGSVLRLKTDPSPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSMVADNESGKSVS
        MDSR  L FS+C L VFTAFAR P + TH   +GSVLRLKTD SPLIFDPTRV QLSW+PRAFLYKGFLSD ECDHLIDLAKDKL+KSMVADN+SGKSVS
Subjt:  MDSRRLLTFSVCSLFVFTAFARFPVSHTHNNLTGSVLRLKTDPSPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSMVADNESGKSVS

Query:  SEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGESIQILHYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSMLKDSQEK
        SEVRTSSGMFLRKAQDE+VAG+E+RIAAWT LPAENGESIQILHYENGQKYEPHFDFF DKVNQELGGHRIATVLMYLSNVEKGGETIFPNS  K+SQ K
Subjt:  SEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGESIQILHYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSMLKDSQEK

Query:  DDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFEKPT-RASTQGCVDEDEHCPGWAAKGECKKNPAYMVGSE
        D+SWSDCSRKGYAVKAQKGDALLFFSL+LDATTD+RSLHGSCPVI GEKWSATKWIHVRSFEK T R S QGCVDE+E+C  WA KGECKKNP YMVGS 
Subjt:  DDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFEKPT-RASTQGCVDEDEHCPGWAAKGECKKNPAYMVGSE

Query:  GALGYCRKSCKAC
        GALGYCRKSCKAC
Subjt:  GALGYCRKSCKAC

XP_022134044.1 probable prolyl 4-hydroxylase 7 [Momordica charantia]3.3e-15384.35Show/hide
Query:  MDSRRLLTFSVCSLFVFTAFARFPVSHTHNNLTGSVLRLKTDPSPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSMVADNESGKSVS
        MDS R L+FS+C LFVFTA AR P    H  ++GSVLRLK +PSPLIFDPTRV QLSW+PRAFLYKGFLSDKECDHLIDLAKDKL+KSMVADN SGKSVS
Subjt:  MDSRRLLTFSVCSLFVFTAFARFPVSHTHNNLTGSVLRLKTDPSPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSMVADNESGKSVS

Query:  SEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGESIQILHYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSMLKDSQEK
        SEVRTSSGMFL KAQDEIVA +E+RIAAWTFLPAENGESIQILHYENGQKYEPHFD+F DKVNQELGGHR+ATVLMYLSNVEKGGETIFPNS  K+SQEK
Subjt:  SEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGESIQILHYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSMLKDSQEK

Query:  DDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASTQ-GCVDEDEHCPGWAAKGECKKNPAYMVGSE
        DDSWSDC+RKGYAVKA+KGDALLFFSLHLDATTD +SLHGSCPVIEGEKWSATKWIHVRSFEKPTR S +  CVDE+E+C  WA +GECKKNP YMVGSE
Subjt:  DDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASTQ-GCVDEDEHCPGWAAKGECKKNPAYMVGSE

Query:  GALGYCRKSCKAC
         ALGYCRKSC+AC
Subjt:  GALGYCRKSCKAC

XP_031742194.1 probable prolyl 4-hydroxylase 7 isoform X2 [Cucumis sativus]1.7e-15285.94Show/hide
Query:  MDSRRLLTFSVCSLFVFTAFARFPVSHTHNNLTGSVLRLKTDPSPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSMVADNESGKSVS
        MDSR  L FS+C L VFTAFAR P + TH   +GSVLRLKTD SPLIFDPTRV QLSW+PRAFLYKGFLSD ECDHLIDLAKDKL+KSMVADN+SGKSVS
Subjt:  MDSRRLLTFSVCSLFVFTAFARFPVSHTHNNLTGSVLRLKTDPSPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSMVADNESGKSVS

Query:  SEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGESIQILHYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSMLKDSQEK
        SEVRTSSGMFLRKAQDE+VAG+E+RIAAWT LPAENGESIQILHYENGQKYEPHFDFF DKVNQELGGHRIATVLMYLSNVEKGGETIFPNS  K+SQ K
Subjt:  SEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGESIQILHYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSMLKDSQEK

Query:  DDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFEKPT-RASTQGCVDEDEHCPGWAAKGECKKNPAYMVGSE
        D+SWSDCSRKGYAVKAQKGDALLFFSL+LDATTD+RSLHGSCPVI GEKWSATKWIHVRSFEK T R S QGCVDE+E+C  WA KGECKKNP YMVGS 
Subjt:  DDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFEKPT-RASTQGCVDEDEHCPGWAAKGECKKNPAYMVGSE

Query:  GALGYCRKSCKAC
        GALGYCRKSCKAC
Subjt:  GALGYCRKSCKAC

XP_038889686.1 probable prolyl 4-hydroxylase 7 isoform X1 [Benincasa hispida]9.1e-15183.97Show/hide
Query:  MDSRRLLTFSVCSLFVFTAFARFPVSHTHNNLTGSVLRLKTDPSPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSMVADNESGKSVS
        MDSRR L F +C L VFT FAR P   +    +GSV+RLKTD SPL+FDPTRV QLSWEPRAFLYKGFLSDKECDHLIDLAKDKL+KSMVADNESGKSVS
Subjt:  MDSRRLLTFSVCSLFVFTAFARFPVSHTHNNLTGSVLRLKTDPSPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSMVADNESGKSVS

Query:  SEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGESIQILHYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSMLKDSQEK
        SEVRTSSGMFLRKAQDEIVA IE+RI+AWT LPAENGESIQILHYENGQKYEPHFDFF DKVNQELGGHRIATVLMYLSNVEKGGETIFPNS  K+SQEK
Subjt:  SEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGESIQILHYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSMLKDSQEK

Query:  DDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASTQGCVDEDEHCPGWAAKGECKKNPAYMVGSEG
        D+SWSDC+RKGYAVKA+KGDALLFFSL  DATTD +SLHGSCPVIEGEKWSATKWIHVRSFEK TR S Q CVDE+E+C  WA +GECKKNP YMVGSE 
Subjt:  DDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASTQGCVDEDEHCPGWAAKGECKKNPAYMVGSEG

Query:  ALGYCRKSCKAC
        ALGYCRKSC+AC
Subjt:  ALGYCRKSCKAC

TrEMBL top hitse value%identityAlignment
A0A0A0KS38 Procollagen-proline 4-dioxygenase5.6e-15485.94Show/hide
Query:  MDSRRLLTFSVCSLFVFTAFARFPVSHTHNNLTGSVLRLKTDPSPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSMVADNESGKSVS
        MDSR  L FS+C L VFTAFAR P + TH   +GSVLRLKTD SPLIFDPTRV QLSW+PRAFLYKGFLSD ECDHLIDLAKDKL+KSMVADN+SGKSVS
Subjt:  MDSRRLLTFSVCSLFVFTAFARFPVSHTHNNLTGSVLRLKTDPSPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSMVADNESGKSVS

Query:  SEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGESIQILHYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSMLKDSQEK
        SEVRTSSGMFLRKAQDE+VAG+E+RIAAWT LPAENGESIQILHYENGQKYEPHFDFF DKVNQELGGHRIATVLMYLSNVEKGGETIFPNS  K+SQ K
Subjt:  SEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGESIQILHYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSMLKDSQEK

Query:  DDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFEKPT-RASTQGCVDEDEHCPGWAAKGECKKNPAYMVGSE
        D+SWSDCSRKGYAVKAQKGDALLFFSL+LDATTD+RSLHGSCPVI GEKWSATKWIHVRSFEK T R S QGCVDE+E+C  WA KGECKKNP YMVGS 
Subjt:  DDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFEKPT-RASTQGCVDEDEHCPGWAAKGECKKNPAYMVGSE

Query:  GALGYCRKSCKAC
        GALGYCRKSCKAC
Subjt:  GALGYCRKSCKAC

A0A1S3C8G4 Procollagen-proline 4-dioxygenase4.1e-15786.39Show/hide
Query:  MDSRRLLTFSVCSLFVFTAFARFP----VSHTHNNLTGSVLRLKTDPSPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSMVADNESG
        MDSR  L FS+C L VFTAFAR P    + H++   TGSVLRLKTD SPLIFDPTRV QLSW+PRAFLYKGFLSD+ECDHLIDLAKDKL+KSMVADNESG
Subjt:  MDSRRLLTFSVCSLFVFTAFARFP----VSHTHNNLTGSVLRLKTDPSPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSMVADNESG

Query:  KSVSSEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGESIQILHYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSMLKD
        KSVSSEVRTSSGMFLRKAQD+IVAG+E+RIAAWT LPAENGESIQILHYENGQKYEPHFDFF DKVNQELGGHRIATVLMYLSNVEKGGETIFPNS  K+
Subjt:  KSVSSEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGESIQILHYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSMLKD

Query:  SQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASTQGCVDEDEHCPGWAAKGECKKNPAYMV
        SQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTD+RSLHGSCPVIEGEKWSATKWIHVRSFEK  R S Q CVDE+E+CP WA +GECKKNP YMV
Subjt:  SQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASTQGCVDEDEHCPGWAAKGECKKNPAYMV

Query:  GSEGALGYCRKSCKAC
        GSEGALGYCRKSCKAC
Subjt:  GSEGALGYCRKSCKAC

A0A6J1BXN9 Procollagen-proline 4-dioxygenase1.6e-15384.35Show/hide
Query:  MDSRRLLTFSVCSLFVFTAFARFPVSHTHNNLTGSVLRLKTDPSPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSMVADNESGKSVS
        MDS R L+FS+C LFVFTA AR P    H  ++GSVLRLK +PSPLIFDPTRV QLSW+PRAFLYKGFLSDKECDHLIDLAKDKL+KSMVADN SGKSVS
Subjt:  MDSRRLLTFSVCSLFVFTAFARFPVSHTHNNLTGSVLRLKTDPSPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSMVADNESGKSVS

Query:  SEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGESIQILHYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSMLKDSQEK
        SEVRTSSGMFL KAQDEIVA +E+RIAAWTFLPAENGESIQILHYENGQKYEPHFD+F DKVNQELGGHR+ATVLMYLSNVEKGGETIFPNS  K+SQEK
Subjt:  SEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGESIQILHYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSMLKDSQEK

Query:  DDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASTQ-GCVDEDEHCPGWAAKGECKKNPAYMVGSE
        DDSWSDC+RKGYAVKA+KGDALLFFSLHLDATTD +SLHGSCPVIEGEKWSATKWIHVRSFEKPTR S +  CVDE+E+C  WA +GECKKNP YMVGSE
Subjt:  DDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASTQ-GCVDEDEHCPGWAAKGECKKNPAYMVGSE

Query:  GALGYCRKSCKAC
         ALGYCRKSC+AC
Subjt:  GALGYCRKSCKAC

A0A6J1FJ93 Procollagen-proline 4-dioxygenase9.8e-15184.62Show/hide
Query:  MDSRRLLTFSVCSLFVFTAFARFPVSHTHNNLTGSVLRLKTDPSPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSMVADNESGKSVS
        MDSRR L FS+  L V T FAR P   TH  L+GSVL LK D   LIFDPTRV QLSW+PRAFLYKGFL+D+ECDHLIDLAKDKL+KSMVADNESGKSVS
Subjt:  MDSRRLLTFSVCSLFVFTAFARFPVSHTHNNLTGSVLRLKTDPSPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSMVADNESGKSVS

Query:  SEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGESIQILHYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSMLKDSQEK
        SEVRTSSGMFLRKAQDEIVAGIE+RI+AWTFLP ENGESIQILHYENGQKYEPHFDFF DKVNQELGGHRIATVLMYLSNVEKGGETIFPNS   +SQEK
Subjt:  SEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGESIQILHYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSMLKDSQEK

Query:  DDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASTQGCVDEDEHCPGWAAKGECKKNPAYMVGSEG
        DDSWSDC+RKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSF+K TR S+Q CVDE+++CP WA +GEC+KNP YMVGSEG
Subjt:  DDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASTQGCVDEDEHCPGWAAKGECKKNPAYMVGSEG

Query:  ALGYCRKSCKAC
        A+GYCRKSCKAC
Subjt:  ALGYCRKSCKAC

A0A6J1JWX0 Procollagen-proline 4-dioxygenase1.3e-15084.62Show/hide
Query:  MDSRRLLTFSVCSLFVFTAFARFPVSHTHNNLTGSVLRLKTDPSPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSMVADNESGKSVS
        MDSRR L FS+  L V T FAR P   TH  L+GSVL LK D   LIFDPTRV QLSW+PRAFLYKGFL+D+ECDHLIDLAKDKL+KSMVADNESGKSVS
Subjt:  MDSRRLLTFSVCSLFVFTAFARFPVSHTHNNLTGSVLRLKTDPSPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSMVADNESGKSVS

Query:  SEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGESIQILHYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSMLKDSQEK
        SEVRTSSGMFLRKAQDEIVAGIE+RI+AWTFLP ENGESIQILHYENGQKYEPHFDFF DKVNQELGGHRIATVLMYLSNVEKGGETIFPNS   +SQEK
Subjt:  SEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGESIQILHYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSMLKDSQEK

Query:  DDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASTQGCVDEDEHCPGWAAKGECKKNPAYMVGSEG
        DDSWSDC+RKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSF+K TR S+Q CVDE+++CP WA +GEC+KNP YMVGSEG
Subjt:  DDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASTQGCVDEDEHCPGWAAKGECKKNPAYMVGSEG

Query:  ALGYCRKSCKAC
        A+GYCRKSCKAC
Subjt:  ALGYCRKSCKAC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 64.6e-11365.18Show/hide
Query:  MDSRRLLTFSVCSLFVFTAFARFPVSHTHNNLTGSVLRLKTDPSPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSM-VADNESGKSV
        MDS+  L FS+  L +F+  + F  S                      DPTR+ QLSW PRAFLYKGFLSD+ECDHLI LAK KL+KSM VAD +SG+S 
Subjt:  MDSRRLLTFSVCSLFVFTAFARFPVSHTHNNLTGSVLRLKTDPSPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSM-VADNESGKSV

Query:  SSEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGESIQILHYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSMLKDSQE
         SEVRTSSGMFL K QD+IVA +E+++AAWTFLP ENGE++QILHYENGQKY+PHFD+F DK   ELGGHRIATVLMYLSNV KGGET+FPN   K  Q 
Subjt:  SSEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGESIQILHYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSMLKDSQE

Query:  KDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASTQGCVDEDEHCPGWAAKGECKKNPAYMVGSE
        KDDSWS C+++GYAVK +KGDALLFF+LHL+ TTD  SLHGSCPVIEGEKWSAT+WIHVRSF K        CVD+ E C  WA  GEC+KNP YMVGSE
Subjt:  KDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASTQGCVDEDEHCPGWAAKGECKKNPAYMVGSE

Query:  GALGYCRKSCKAC
         +LG+CRKSCKAC
Subjt:  GALGYCRKSCKAC

F4JAU3 Prolyl 4-hydroxylase 29.9e-9259.57Show/hide
Query:  LKTDPSPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGE
        L + PS +I +P++V Q+S +PRAF+Y+GFL+D ECDHLI LAK+ LQ+S VADN++G+S  S+VRTSSG F+ K +D IV+GIE +++ WTFLP ENGE
Subjt:  LKTDPSPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGE

Query:  SIQILHYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSM---LKDSQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDK
         +Q+L YE+GQKY+ HFD+F DKVN   GGHRIATVL+YLSNV KGGET+FP++     +   E  D  SDC++KG AVK +KG+ALLFF+L  DA  D 
Subjt:  SIQILHYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSM---LKDSQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDK

Query:  RSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASTQGCVDEDEHCPGWAAKGECKKNPAYMVGSEGALGYCRKSCKAC
         SLHG CPVIEGEKWSATKWIHV SF+K        C D +E C  WA  GEC KNP YMVG+    G CR+SCKAC
Subjt:  RSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASTQGCVDEDEHCPGWAAKGECKKNPAYMVGSEGALGYCRKSCKAC

Q8L970 Probable prolyl 4-hydroxylase 78.3e-12366.67Show/hide
Query:  MDSRRLLTFSVCSLFVFTAFARFP---VSHTHNNLTGSVLRLKTDPSPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSMVADNESGK
        MDSR  L FS+C LF     +  P   ++ + N   GSV+++KT  S   FDPTRV QLSW PR FLY+GFLSD+ECDH I LAK KL+KSMVADN+SG+
Subjt:  MDSRRLLTFSVCSLFVFTAFARFP---VSHTHNNLTGSVLRLKTDPSPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSMVADNESGK

Query:  SVSSEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGESIQILHYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSMLKDS
        SV SEVRTSSGMFL K QD+IV+ +E+++AAWTFLP ENGES+QILHYENGQKYEPHFD+F D+ N ELGGHRIATVLMYLSNVEKGGET+FP    K +
Subjt:  SVSSEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGESIQILHYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSMLKDS

Query:  QEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASTQGCVDEDEHCPGWAAKGECKKNPAYMVG
        Q KDDSW++C+++GYAVK +KGDALLFF+LH +ATTD  SLHGSCPV+EGEKWSAT+WIHV+SFE+     + GC+DE+  C  WA  GEC+KNP YMVG
Subjt:  QEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASTQGCVDEDEHCPGWAAKGECKKNPAYMVG

Query:  SEGALGYCRKSCKAC
        S+   GYCRKSCKAC
Subjt:  SEGALGYCRKSCKAC

Q8LAN3 Probable prolyl 4-hydroxylase 41.2e-9259.56Show/hide
Query:  SPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGESIQIL
        S +  +P++V Q+S +PRAF+Y+GFL++ ECDH++ LAK  L++S VADN+SG+S  SEVRTSSG F+ K +D IV+GIE +I+ WTFLP ENGE IQ+L
Subjt:  SPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGESIQIL

Query:  HYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSMLKDSQ---EKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDKRSLHG
         YE+GQKY+ HFD+F DKVN   GGHR+AT+LMYLSNV KGGET+FP++ +   +   E  +  SDC+++G AVK +KGDALLFF+LH DA  D  SLHG
Subjt:  HYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSMLKDSQ---EKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDKRSLHG

Query:  SCPVIEGEKWSATKWIHVRSFEKPTRASTQGCVDEDEHCPGWAAKGECKKNPAYMVGSEGALGYCRKSCKAC
         CPVIEGEKWSATKWIHV SF++    S   C D +E C  WA  GEC KNP YMVG+    GYCR+SCKAC
Subjt:  SCPVIEGEKWSATKWIHVRSFEKPTRASTQGCVDEDEHCPGWAAKGECKKNPAYMVGSEGALGYCRKSCKAC

Q9LN20 Probable prolyl 4-hydroxylase 37.2e-6657.21Show/hide
Query:  LSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGESIQILHYENGQKYEPHF
        LSWEPRAF+Y  FLS +EC++LI LAK  + KS V D+E+GKS  S VRTSSG FLR+ +D+I+  IE RIA +TF+PA++GE +Q+LHYE GQKYEPH+
Subjt:  LSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGESIQILHYENGQKYEPHF

Query:  DFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSMLK-DSQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATK
        D+F D+ N + GG R+AT+LMYLS+VE+GGET+FP + +   S    +  S+C +KG +VK + GDALLF+S+  DAT D  SLHG CPVI G KWS+TK
Subjt:  DFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSMLK-DSQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATK

Query:  WIHVRSFE
        W+HV  ++
Subjt:  WIHVRSFE

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 27.1e-9359.57Show/hide
Query:  LKTDPSPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGE
        L + PS +I +P++V Q+S +PRAF+Y+GFL+D ECDHLI LAK+ LQ+S VADN++G+S  S+VRTSSG F+ K +D IV+GIE +++ WTFLP ENGE
Subjt:  LKTDPSPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGE

Query:  SIQILHYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSM---LKDSQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDK
         +Q+L YE+GQKY+ HFD+F DKVN   GGHRIATVL+YLSNV KGGET+FP++     +   E  D  SDC++KG AVK +KG+ALLFF+L  DA  D 
Subjt:  SIQILHYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSM---LKDSQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDK

Query:  RSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASTQGCVDEDEHCPGWAAKGECKKNPAYMVGSEGALGYCRKSCKAC
         SLHG CPVIEGEKWSATKWIHV SF+K        C D +E C  WA  GEC KNP YMVG+    G CR+SCKAC
Subjt:  RSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASTQGCVDEDEHCPGWAAKGECKKNPAYMVGSEGALGYCRKSCKAC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase5.9e-12466.67Show/hide
Query:  MDSRRLLTFSVCSLFVFTAFARFP---VSHTHNNLTGSVLRLKTDPSPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSMVADNESGK
        MDSR  L FS+C LF     +  P   ++ + N   GSV+++KT  S   FDPTRV QLSW PR FLY+GFLSD+ECDH I LAK KL+KSMVADN+SG+
Subjt:  MDSRRLLTFSVCSLFVFTAFARFP---VSHTHNNLTGSVLRLKTDPSPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSMVADNESGK

Query:  SVSSEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGESIQILHYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSMLKDS
        SV SEVRTSSGMFL K QD+IV+ +E+++AAWTFLP ENGES+QILHYENGQKYEPHFD+F D+ N ELGGHRIATVLMYLSNVEKGGET+FP    K +
Subjt:  SVSSEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGESIQILHYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSMLKDS

Query:  QEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASTQGCVDEDEHCPGWAAKGECKKNPAYMVG
        Q KDDSW++C+++GYAVK +KGDALLFF+LH +ATTD  SLHGSCPV+EGEKWSAT+WIHV+SFE+     + GC+DE+  C  WA  GEC+KNP YMVG
Subjt:  QEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASTQGCVDEDEHCPGWAAKGECKKNPAYMVG

Query:  SEGALGYCRKSCKAC
        S+   GYCRKSCKAC
Subjt:  SEGALGYCRKSCKAC

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase5.9e-11662.85Show/hide
Query:  MDSRRLLTFSVCSLFVFTAFARFP---VSHTHNNLTGSVLRLKTDPSPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSMVADNESGK
        MDSR  L FS+C LF     +  P   ++ + N   GSV+++KT  S   FDPTRV QLSW PR FLY+GFLSD+ECDH I LAK KL+KSMVADN+SG+
Subjt:  MDSRRLLTFSVCSLFVFTAFARFP---VSHTHNNLTGSVLRLKTDPSPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSMVADNESGK

Query:  SVSSE-----VRTSSGMFLRKAQ---DEIVAGIESRIAAWTFLPAENGESIQILHYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIF
        SV SE     VR SS           D+IV+ +E+++AAWTFLP ENGES+QILHYENGQKYEPHFD+F D+ N ELGGHRIATVLMYLSNVEKGGET+F
Subjt:  SVSSE-----VRTSSGMFLRKAQ---DEIVAGIESRIAAWTFLPAENGESIQILHYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIF

Query:  PNSMLKDSQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASTQGCVDEDEHCPGWAAKGECK
        P    K +Q KDDSW++C+++GYAVK +KGDALLFF+LH +ATTD  SLHGSCPV+EGEKWSAT+WIHV+SFE+     + GC+DE+  C  WA  GEC+
Subjt:  PNSMLKDSQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASTQGCVDEDEHCPGWAAKGECK

Query:  KNPAYMVGSEGALGYCRKSCKAC
        KNP YMVGS+   GYCRKSCKAC
Subjt:  KNPAYMVGSEGALGYCRKSCKAC

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase3.3e-11465.18Show/hide
Query:  MDSRRLLTFSVCSLFVFTAFARFPVSHTHNNLTGSVLRLKTDPSPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSM-VADNESGKSV
        MDS+  L FS+  L +F+  + F  S                      DPTR+ QLSW PRAFLYKGFLSD+ECDHLI LAK KL+KSM VAD +SG+S 
Subjt:  MDSRRLLTFSVCSLFVFTAFARFPVSHTHNNLTGSVLRLKTDPSPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSM-VADNESGKSV

Query:  SSEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGESIQILHYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSMLKDSQE
         SEVRTSSGMFL K QD+IVA +E+++AAWTFLP ENGE++QILHYENGQKY+PHFD+F DK   ELGGHRIATVLMYLSNV KGGET+FPN   K  Q 
Subjt:  SSEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGESIQILHYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSMLKDSQE

Query:  KDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASTQGCVDEDEHCPGWAAKGECKKNPAYMVGSE
        KDDSWS C+++GYAVK +KGDALLFF+LHL+ TTD  SLHGSCPVIEGEKWSAT+WIHVRSF K        CVD+ E C  WA  GEC+KNP YMVGSE
Subjt:  KDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASTQGCVDEDEHCPGWAAKGECKKNPAYMVGSE

Query:  GALGYCRKSCKAC
         +LG+CRKSCKAC
Subjt:  GALGYCRKSCKAC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein8.4e-9459.56Show/hide
Query:  SPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGESIQIL
        S +  +P++V Q+S +PRAF+Y+GFL++ ECDH++ LAK  L++S VADN+SG+S  SEVRTSSG F+ K +D IV+GIE +I+ WTFLP ENGE IQ+L
Subjt:  SPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIESRIAAWTFLPAENGESIQIL

Query:  HYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSMLKDSQ---EKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDKRSLHG
         YE+GQKY+ HFD+F DKVN   GGHR+AT+LMYLSNV KGGET+FP++ +   +   E  +  SDC+++G AVK +KGDALLFF+LH DA  D  SLHG
Subjt:  HYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSMLKDSQ---EKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDKRSLHG

Query:  SCPVIEGEKWSATKWIHVRSFEKPTRASTQGCVDEDEHCPGWAAKGECKKNPAYMVGSEGALGYCRKSCKAC
         CPVIEGEKWSATKWIHV SF++    S   C D +E C  WA  GEC KNP YMVG+    GYCR+SCKAC
Subjt:  SCPVIEGEKWSATKWIHVRSFEKPTRASTQGCVDEDEHCPGWAAKGECKKNPAYMVGSEGALGYCRKSCKAC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCCCGTCGATTACTCACATTTTCCGTTTGCTCTCTGTTCGTCTTCACCGCCTTCGCTCGCTTTCCTGTATCGCATACGCACAACAATCTGACTGGATCTGTGCT
CCGGTTGAAGACTGATCCATCTCCGCTGATTTTTGATCCAACTCGAGTCGCCCAGCTCTCCTGGGAACCCAGGGCATTTTTGTATAAGGGGTTTTTATCTGATAAGGAAT
GTGATCATCTAATCGATCTGGCTAAGGATAAATTACAGAAGTCTATGGTAGCCGATAATGAGTCTGGTAAGAGTGTAAGTAGCGAAGTCCGGACGAGTTCTGGCATGTTT
CTCCGGAAGGCTCAGGATGAAATCGTTGCCGGTATTGAGTCAAGAATTGCTGCATGGACATTCCTTCCAGCAGAAAATGGAGAGTCCATTCAAATTCTGCACTATGAGAA
TGGCCAAAAGTATGAACCACATTTTGATTTTTTTCAAGACAAGGTGAATCAGGAGTTAGGTGGCCATCGAATAGCCACAGTCTTGATGTATTTATCCAATGTTGAAAAGG
GTGGAGAAACTATCTTTCCAAACTCAATGCTTAAAGATTCTCAAGAGAAGGATGATAGTTGGTCTGATTGTTCTCGAAAGGGCTATGCAGTTAAAGCGCAGAAGGGCGAT
GCATTGCTGTTCTTCAGCCTCCATCTCGATGCAACAACAGATAAGAGAAGCTTGCACGGTAGTTGCCCTGTGATTGAGGGCGAGAAATGGTCTGCAACCAAGTGGATTCA
TGTTAGATCCTTTGAGAAGCCAACTCGTGCAAGTACTCAGGGTTGCGTGGACGAGGACGAACATTGCCCTGGGTGGGCGGCAAAGGGTGAGTGCAAAAAGAACCCTGCTT
ACATGGTGGGTTCTGAAGGTGCTTTAGGATACTGTAGGAAGAGTTGCAAAGCGTGTTAA
mRNA sequenceShow/hide mRNA sequence
AAAAAAAAAAAAAAAAAAAGAAAAGAGATTGGAAATGGAGTTCCGAACAACAGCCGACTGATTCTCCATTCTATTGACAGTTGAATCTTCGTAAGAAAATTCAAATCCAT
CACTTTTTTTTTTCCTTAGCGTACACGAAAGAATCCTTCAATCGTCGAATTTTCACATCTCCCTTTCTCCGATTTGATATCGGAGAATCGAGATTTGGGCAATGGATTCC
CGTCGATTACTCACATTTTCCGTTTGCTCTCTGTTCGTCTTCACCGCCTTCGCTCGCTTTCCTGTATCGCATACGCACAACAATCTGACTGGATCTGTGCTCCGGTTGAA
GACTGATCCATCTCCGCTGATTTTTGATCCAACTCGAGTCGCCCAGCTCTCCTGGGAACCCAGGGCATTTTTGTATAAGGGGTTTTTATCTGATAAGGAATGTGATCATC
TAATCGATCTGGCTAAGGATAAATTACAGAAGTCTATGGTAGCCGATAATGAGTCTGGTAAGAGTGTAAGTAGCGAAGTCCGGACGAGTTCTGGCATGTTTCTCCGGAAG
GCTCAGGATGAAATCGTTGCCGGTATTGAGTCAAGAATTGCTGCATGGACATTCCTTCCAGCAGAAAATGGAGAGTCCATTCAAATTCTGCACTATGAGAATGGCCAAAA
GTATGAACCACATTTTGATTTTTTTCAAGACAAGGTGAATCAGGAGTTAGGTGGCCATCGAATAGCCACAGTCTTGATGTATTTATCCAATGTTGAAAAGGGTGGAGAAA
CTATCTTTCCAAACTCAATGCTTAAAGATTCTCAAGAGAAGGATGATAGTTGGTCTGATTGTTCTCGAAAGGGCTATGCAGTTAAAGCGCAGAAGGGCGATGCATTGCTG
TTCTTCAGCCTCCATCTCGATGCAACAACAGATAAGAGAAGCTTGCACGGTAGTTGCCCTGTGATTGAGGGCGAGAAATGGTCTGCAACCAAGTGGATTCATGTTAGATC
CTTTGAGAAGCCAACTCGTGCAAGTACTCAGGGTTGCGTGGACGAGGACGAACATTGCCCTGGGTGGGCGGCAAAGGGTGAGTGCAAAAAGAACCCTGCTTACATGGTGG
GTTCTGAAGGTGCTTTAGGATACTGTAGGAAGAGTTGCAAAGCGTGTTAAACTAGAAAGAAGTCTTGTTTTTGCAGAGGTTAAGTTTTGATTCTGTGATGCTTATTATGT
ATATAGCATTGAGCAGTAACTGGGTATACAAGTGGATATTAAATTTCTCTGATTAAACCTTATAGTAGCAATTAGCCAATTGGTTCATTTGGTACACTATAAAATTTTCT
CTGTAATTTCATGTATTTGGTCTAACTCTTCTTTTGAACTTTACGTATTATATCTTACAAGTAATATACTTTGTCCCCTTGTTGAAGTATGTAATAAAATTTGCCTTCAA
TTATAAAATTGAGAATTTTAACATTTGG
Protein sequenceShow/hide protein sequence
MDSRRLLTFSVCSLFVFTAFARFPVSHTHNNLTGSVLRLKTDPSPLIFDPTRVAQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLQKSMVADNESGKSVSSEVRTSSGMF
LRKAQDEIVAGIESRIAAWTFLPAENGESIQILHYENGQKYEPHFDFFQDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSMLKDSQEKDDSWSDCSRKGYAVKAQKGD
ALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASTQGCVDEDEHCPGWAAKGECKKNPAYMVGSEGALGYCRKSCKAC