; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh11G010710 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh11G010710
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationCmo_Chr11:5991869..5995511
RNA-Seq ExpressionCmoCh11G010710
SyntenyCmoCh11G010710
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588394.1 putative prolyl 4-hydroxylase 7, partial [Cucurbita argyrosperma subsp. sororia]3.9e-18196.28Show/hide
Query:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT
        MDSRFFLAFSLCFLCSFPLFAR ANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEEC H++ +   + EQSLVVDDIT
Subjt:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT

Query:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDSPAK
        GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDSPAK
Subjt:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDSPAK

Query:  VFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPVDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVG
        VFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPVDEIWRNPDCVDENEHCS WAKAGECEKNPGYMVG
Subjt:  VFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPVDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVG

Query:  SSLGSKEELGYCRLSCKACSPPS
        SSLGSKEELGYCRLSCKACSPPS
Subjt:  SSLGSKEELGYCRLSCKACSPPS

XP_022931100.1 probable prolyl 4-hydroxylase 7 [Cucurbita moschata]4.6e-190100Show/hide
Query:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT
        MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT
Subjt:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT

Query:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDSPAK
        GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDSPAK
Subjt:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDSPAK

Query:  VFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPVDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVG
        VFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPVDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVG
Subjt:  VFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPVDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVG

Query:  SSLGSKEELGYCRLSCKACSPPS
        SSLGSKEELGYCRLSCKACSPPS
Subjt:  SSLGSKEELGYCRLSCKACSPPS

XP_022971148.1 probable prolyl 4-hydroxylase 7 [Cucurbita maxima]2.4e-18698.45Show/hide
Query:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT
        MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT
Subjt:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT

Query:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDSPAK
        GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEP+QILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVL+YLSNVERGGETVFPDSPAK
Subjt:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDSPAK

Query:  VFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPVDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVG
        VF EENKDL DCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLP+DEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVG
Subjt:  VFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPVDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVG

Query:  SSLGSKEELGYCRLSCKACSPPS
        SSLGSKEELGYCRLSCKACSPPS
Subjt:  SSLGSKEELGYCRLSCKACSPPS

XP_023530715.1 probable prolyl 4-hydroxylase 7 [Cucurbita pepo subsp. pepo]8.7e-18999.38Show/hide
Query:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT
        MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT
Subjt:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT

Query:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDSPAK
        GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDSPAK
Subjt:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDSPAK

Query:  VFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPVDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVG
        VFEEENKDL DCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPVDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVG
Subjt:  VFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPVDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVG

Query:  SSLGSKEELGYCRLSCKACSPPS
        SSLGSKE+LGYCRLSCKACSPPS
Subjt:  SSLGSKEELGYCRLSCKACSPPS

XP_038905408.1 probable prolyl 4-hydroxylase 7 isoform X1 [Benincasa hispida]1.5e-14075.23Show/hide
Query:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT
        M SRFFLAFSLCFLC FP F+RSANRLPKLLL +   + SVIRMK  GS + IDPTRV++LSS+PRAFLYKGFLS +ECQHLI+LAK  L+QSLV  + T
Subjt:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT

Query:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDSPAK
        G S +S +RTSTGMFL +AQD+IVA IE++IAAWTFLP+DNGEPIQILRYENGQ+Y PHFDFFQDPVN+A GGHRIAT+LMYLS+VE+GGETVFP+SP K
Subjt:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDSPAK

Query:  VFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPVDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVG
        + E+E  DL DC+  GYGVKPK GDALLFFSL+PNVT D TSYHGSCPVIEGEKWSATKWIHMLP+ EIWRNP CVDEN  C AWA AGECEKNP YM  
Subjt:  VFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPVDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVG

Query:  SSLGSKEELGYCRLSCKACSPPS
          +GSK ELG+CR+SCK CSPPS
Subjt:  SSLGSKEELGYCRLSCKACSPPS

TrEMBL top hitse value%identityAlignment
A0A1S3B814 Procollagen-proline 4-dioxygenase1.3e-13473.37Show/hide
Query:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT
        M S F LAFS+ FL   PL + SANR PK+LL +    +SVIRMK  GS+I IDPTRV+QLSS+PRAFLYKGFLS EECQHLI LAK  L QSLV    T
Subjt:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT

Query:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDSPAK
        G S +S +RTSTGMFL KAQD IVA IE++IAAWTFLP+DNGEPIQILRYENGQ+Y PHFDFFQDP N+A GGHRIAT+LMYLS+VE+GGETVFP+SP K
Subjt:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDSPAK

Query:  VFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPVDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVG
        + EEE  DL +C+  GYGV+PK GDALLFFS++PNVT D TSYHGSCPVIEGEKWSATKWIHMLP+DE+WRNP CVDEN+HCSAWAKAGEC+KNP YM  
Subjt:  VFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPVDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVG

Query:  SSLGSKEELGYCRLSCKACSPPS
          +GSK ELG+CRLSCK CSP S
Subjt:  SSLGSKEELGYCRLSCKACSPPS

A0A6J1DTY4 Procollagen-proline 4-dioxygenase4.1e-13673.83Show/hide
Query:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLD-DTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDI
        MDSR FLAFSLCFLC FPLF RS N +P+LL+D +     S+IRMK  GSSI IDP+RV QLSSQPRAF+YKGFLSAEEC+HLI+LAKD LE+SLV DD+
Subjt:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLD-DTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDI

Query:  TGASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDSPA
        TG S +S +RTSTGMFL K QD IVAGIE++IAAWTFLPVDNGEP+Q+LRYENGQ+Y PHFDFFQDPVN+A GGHRIATVLMYLSNVE GGETVFP+S  
Subjt:  TGASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDSPA

Query:  KVFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPVDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMV
        K+   E K+L DC+  GY VKPK GDALLFFSLH N TTD +SYHGSCPVI+GEKWSATKWIHML  DEIWR+PDCVD +  C+AWA  GEC KNPGYM+
Subjt:  KVFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPVDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMV

Query:  GSSLGSKEELGYCRLSCKACS
            GSK ELGYCR SC ACS
Subjt:  GSSLGSKEELGYCRLSCKACS

A0A6J1DX45 Procollagen-proline 4-dioxygenase4.1e-13673.83Show/hide
Query:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLD-DTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDI
        MDSR FLAFSLCFLC FPLF RS N +P+LL+D +     S+IRMK  GSSI IDP+RV QLSSQPRAF+YKGFLSAEEC+HLI+LAKD LE+SLV DD+
Subjt:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLD-DTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDI

Query:  TGASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDSPA
        TG S +S +RTSTGMFL K QD IVAGIE++IAAWTFLPVDNGEP+Q+LRYENGQ+Y PHFDFFQDPVN+A GGHRIATVLMYLSNVE GGETVFP+S  
Subjt:  TGASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDSPA

Query:  KVFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPVDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMV
        K+   E K+L DC+  GY VKPK GDALLFFSLH N TTD +SYHGSCPVI+GEKWSATKWIHML  DEIWR+PDCVD +  C+AWA  GEC KNPGYM+
Subjt:  KVFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPVDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMV

Query:  GSSLGSKEELGYCRLSCKACS
            GSK ELGYCR SC ACS
Subjt:  GSSLGSKEELGYCRLSCKACS

A0A6J1EYJ1 Procollagen-proline 4-dioxygenase2.2e-190100Show/hide
Query:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT
        MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT
Subjt:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT

Query:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDSPAK
        GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDSPAK
Subjt:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDSPAK

Query:  VFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPVDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVG
        VFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPVDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVG
Subjt:  VFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPVDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVG

Query:  SSLGSKEELGYCRLSCKACSPPS
        SSLGSKEELGYCRLSCKACSPPS
Subjt:  SSLGSKEELGYCRLSCKACSPPS

A0A6J1I5Z9 Procollagen-proline 4-dioxygenase1.1e-18698.45Show/hide
Query:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT
        MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT
Subjt:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT

Query:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDSPAK
        GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEP+QILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVL+YLSNVERGGETVFPDSPAK
Subjt:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDSPAK

Query:  VFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPVDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVG
        VF EENKDL DCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLP+DEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVG
Subjt:  VFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPVDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVG

Query:  SSLGSKEELGYCRLSCKACSPPS
        SSLGSKEELGYCRLSCKACSPPS
Subjt:  SSLGSKEELGYCRLSCKACSPPS

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 64.0e-9656.88Show/hide
Query:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDI-
        MDS++FLAFSL  L  F                           ++   S  +DPTR+ QLS  PRAFLYKGFLS EEC HLI LAK  LE+S+VV D+ 
Subjt:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDI-

Query:  TGASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDSPA
        +G S  S  RTS+GMFL K QDDIVA +EAK+AAWTFLP +NGE +QIL YENGQ+Y PHFD+F D   +  GGHRIATVLMYLSNV +GGETVFP+   
Subjt:  TGASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDSPA

Query:  KVFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPVDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMV
        K  + ++     C+  GY VKP+KGDALLFF+LH N TTDP S HGSCPVIEGEKWSAT+WIH+    +  +   CVD++E C  WA AGECEKNP YMV
Subjt:  KVFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPVDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMV

Query:  GSSLGSKEELGYCRLSCKAC
            GS+  LG+CR SCKAC
Subjt:  GSSLGSKEELGYCRLSCKAC

F4JAU3 Prolyl 4-hydroxylase 23.8e-8657.2Show/hide
Query:  IDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDITGASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYEN
        I+P++V Q+SS+PRAF+Y+GFL+  EC HLI LAK+ L++S V D+  G S+ S  RTS+G F+ K +D IV+GIE K++ WTFLP +NGE +Q+LRYE+
Subjt:  IDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDITGASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYEN

Query:  GQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDS---PAKVFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPV
        GQ+Y  HFD+F D VN+A GGHRIATVL+YLSNV +GGETVFPD+     +   E   DL DC+  G  VKPKKG+ALLFF+L  +   DP S HG CPV
Subjt:  GQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDS---PAKVFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPV

Query:  IEGEKWSATKWIHMLPVDEI-WRNPDCVDENEHCSAWAKAGECEKNPGYMVGSSLGSKEELGYCRLSCKAC
        IEGEKWSATKWIH+   D+I   + +C D NE C  WA  GEC KNP YMV    G+ E  G CR SCKAC
Subjt:  IEGEKWSATKWIHMLPVDEI-WRNPDCVDENEHCSAWAKAGECEKNPGYMVGSSLGSKEELGYCRLSCKAC

Q8L970 Probable prolyl 4-hydroxylase 73.5e-10860.12Show/hide
Query:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT
        MDSR FLAFSLCFL + PL + + NR   L       + SVI+MK   SS   DPTRV QLS  PR FLY+GFLS EEC H I LAK  LE+S+V D+ +
Subjt:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT

Query:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDSPAK
        G S  S  RTS+GMFL K QDDIV+ +EAK+AAWTFLP +NGE +QIL YENGQ+Y PHFD+F D  N+  GGHRIATVLMYLSNVE+GGETVFP    K
Subjt:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDSPAK

Query:  VFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPVDEIW-RNPDCVDENEHCSAWAKAGECEKNPGYMV
          + ++    +C+  GY VKP+KGDALLFF+LHPN TTD  S HGSCPV+EGEKWSAT+WIH+   +  + +   C+DEN  C  WAKAGEC+KNP YMV
Subjt:  VFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPVDEIW-RNPDCVDENEHCSAWAKAGECEKNPGYMV

Query:  GSSLGSKEELGYCRLSCKACS
        GS     ++ GYCR SCKACS
Subjt:  GSSLGSKEELGYCRLSCKACS

Q8LAN3 Probable prolyl 4-hydroxylase 41.5e-9056.83Show/hide
Query:  MDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDITGASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPI
        +  SS+ ++P++V Q+SS+PRAF+Y+GFL+  EC H++ LAK  L++S V D+ +G S+ S  RTS+G F+ K +D IV+GIE KI+ WTFLP +NGE I
Subjt:  MDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDITGASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPI

Query:  QILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDS---PAKVFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTS
        Q+LRYE+GQ+Y  HFD+F D VN+  GGHR+AT+LMYLSNV +GGETVFPD+     +V  E  +DL DC+  G  VKP+KGDALLFF+LHP+   DP S
Subjt:  QILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDS---PAKVFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTS

Query:  YHGSCPVIEGEKWSATKWIHMLPVDEI-WRNPDCVDENEHCSAWAKAGECEKNPGYMVGSSLGSKEELGYCRLSCKAC
         HG CPVIEGEKWSATKWIH+   D I   + +C D NE C  WA  GEC KNP YMVG++    E  GYCR SCKAC
Subjt:  YHGSCPVIEGEKWSATKWIHMLPVDEI-WRNPDCVDENEHCSAWAKAGECEKNPGYMVGSSLGSKEELGYCRLSCKAC

Q9LN20 Probable prolyl 4-hydroxylase 36.1e-6053.43Show/hide
Query:  LSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDITGASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHF
        LS +PRAF+Y  FLS EEC++LI LAK ++ +S VVD  TG S+ S  RTS+G FL + +D I+  IE +IA +TF+P D+GE +Q+L YE GQ+Y PH+
Subjt:  LSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDITGASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHF

Query:  DFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDSPAKVFEEE-NKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATK
        D+F D  N   GG R+AT+LMYLS+VE GGETVFP +           +L +C   G  VKP+ GDALLF+S+ P+ T DPTS HG CPVI G KWS+TK
Subjt:  DFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDSPAKVFEEE-NKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATK

Query:  WIHM
        W+H+
Subjt:  WIHM

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 22.7e-8757.2Show/hide
Query:  IDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDITGASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYEN
        I+P++V Q+SS+PRAF+Y+GFL+  EC HLI LAK+ L++S V D+  G S+ S  RTS+G F+ K +D IV+GIE K++ WTFLP +NGE +Q+LRYE+
Subjt:  IDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDITGASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYEN

Query:  GQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDS---PAKVFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPV
        GQ+Y  HFD+F D VN+A GGHRIATVL+YLSNV +GGETVFPD+     +   E   DL DC+  G  VKPKKG+ALLFF+L  +   DP S HG CPV
Subjt:  GQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDS---PAKVFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPV

Query:  IEGEKWSATKWIHMLPVDEI-WRNPDCVDENEHCSAWAKAGECEKNPGYMVGSSLGSKEELGYCRLSCKAC
        IEGEKWSATKWIH+   D+I   + +C D NE C  WA  GEC KNP YMV    G+ E  G CR SCKAC
Subjt:  IEGEKWSATKWIHMLPVDEI-WRNPDCVDENEHCSAWAKAGECEKNPGYMVGSSLGSKEELGYCRLSCKAC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase2.5e-10960.12Show/hide
Query:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT
        MDSR FLAFSLCFL + PL + + NR   L       + SVI+MK   SS   DPTRV QLS  PR FLY+GFLS EEC H I LAK  LE+S+V D+ +
Subjt:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT

Query:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDSPAK
        G S  S  RTS+GMFL K QDDIV+ +EAK+AAWTFLP +NGE +QIL YENGQ+Y PHFD+F D  N+  GGHRIATVLMYLSNVE+GGETVFP    K
Subjt:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDSPAK

Query:  VFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPVDEIW-RNPDCVDENEHCSAWAKAGECEKNPGYMV
          + ++    +C+  GY VKP+KGDALLFF+LHPN TTD  S HGSCPV+EGEKWSAT+WIH+   +  + +   C+DEN  C  WAKAGEC+KNP YMV
Subjt:  VFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPVDEIW-RNPDCVDENEHCSAWAKAGECEKNPGYMV

Query:  GSSLGSKEELGYCRLSCKACS
        GS     ++ GYCR SCKACS
Subjt:  GSSLGSKEELGYCRLSCKACS

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase7.8e-10356.84Show/hide
Query:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT
        MDSR FLAFSLCFL + PL + + NR   L       + SVI+MK   SS   DPTRV QLS  PR FLY+GFLS EEC H I LAK  LE+S+V D+ +
Subjt:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT

Query:  GASRSSTDRTS----TGMFLYKAQ----DDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGET
        G S  S D  S    +  F+        DDIV+ +EAK+AAWTFLP +NGE +QIL YENGQ+Y PHFD+F D  N+  GGHRIATVLMYLSNVE+GGET
Subjt:  GASRSSTDRTS----TGMFLYKAQ----DDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGET

Query:  VFPDSPAKVFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPVDEIW-RNPDCVDENEHCSAWAKAGEC
        VFP    K  + ++    +C+  GY VKP+KGDALLFF+LHPN TTD  S HGSCPV+EGEKWSAT+WIH+   +  + +   C+DEN  C  WAKAGEC
Subjt:  VFPDSPAKVFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPVDEIW-RNPDCVDENEHCSAWAKAGEC

Query:  EKNPGYMVGSSLGSKEELGYCRLSCKACS
        +KNP YMVGS     ++ GYCR SCKACS
Subjt:  EKNPGYMVGSSLGSKEELGYCRLSCKACS

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase2.9e-9756.88Show/hide
Query:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDI-
        MDS++FLAFSL  L  F                           ++   S  +DPTR+ QLS  PRAFLYKGFLS EEC HLI LAK  LE+S+VV D+ 
Subjt:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDI-

Query:  TGASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDSPA
        +G S  S  RTS+GMFL K QDDIVA +EAK+AAWTFLP +NGE +QIL YENGQ+Y PHFD+F D   +  GGHRIATVLMYLSNV +GGETVFP+   
Subjt:  TGASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDSPA

Query:  KVFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPVDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMV
        K  + ++     C+  GY VKP+KGDALLFF+LH N TTDP S HGSCPVIEGEKWSAT+WIH+    +  +   CVD++E C  WA AGECEKNP YMV
Subjt:  KVFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPVDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMV

Query:  GSSLGSKEELGYCRLSCKAC
            GS+  LG+CR SCKAC
Subjt:  GSSLGSKEELGYCRLSCKAC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.1e-9156.83Show/hide
Query:  MDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDITGASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPI
        +  SS+ ++P++V Q+SS+PRAF+Y+GFL+  EC H++ LAK  L++S V D+ +G S+ S  RTS+G F+ K +D IV+GIE KI+ WTFLP +NGE I
Subjt:  MDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDITGASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPI

Query:  QILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDS---PAKVFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTS
        Q+LRYE+GQ+Y  HFD+F D VN+  GGHR+AT+LMYLSNV +GGETVFPD+     +V  E  +DL DC+  G  VKP+KGDALLFF+LHP+   DP S
Subjt:  QILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDS---PAKVFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTS

Query:  YHGSCPVIEGEKWSATKWIHMLPVDEI-WRNPDCVDENEHCSAWAKAGECEKNPGYMVGSSLGSKEELGYCRLSCKAC
         HG CPVIEGEKWSATKWIH+   D I   + +C D NE C  WA  GEC KNP YMVG++    E  GYCR SCKAC
Subjt:  YHGSCPVIEGEKWSATKWIHMLPVDEI-WRNPDCVDENEHCSAWAKAGECEKNPGYMVGSSLGSKEELGYCRLSCKAC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCTCGATTTTTTCTTGCATTTTCTCTTTGTTTCCTCTGTTCATTCCCTCTATTTGCTCGCTCCGCCAATCGATTGCCGAAATTACTCTTGGACGACACGAAAAC
GGAAGATTCTGTTATTAGGATGAAAATGGACGGTTCCTCCATTAAAATCGATCCCACTCGGGTCGTTCAGCTTTCATCGCAACCCAGGGCTTTCTTGTACAAGGGATTTC
TGTCTGCAGAGGAGTGCCAGCATCTTATCGATTTGGCGAAGGATTACCTAGAGCAATCATTGGTGGTCGATGACATTACGGGTGCAAGTCGTTCGAGTACTGACCGGACG
AGTACCGGCATGTTTCTTTATAAGGCTCAGGATGACATAGTTGCTGGCATTGAGGCCAAGATTGCTGCATGGACGTTCCTTCCCGTCGATAATGGGGAGCCTATACAAAT
ACTAAGGTATGAAAATGGTCAGCAATATGTACCACATTTTGATTTTTTTCAAGATCCAGTTAATGTAGCTGCTGGTGGTCATCGGATAGCCACAGTCTTGATGTATTTGT
CCAATGTTGAAAGGGGTGGAGAAACTGTCTTTCCCGATTCTCCGGCTAAAGTATTCGAGGAGGAGAACAAGGATTTGTTCGATTGCTCTACGACCGGTTATGGAGTTAAG
CCAAAGAAGGGCGACGCTCTACTATTCTTCAGTCTCCATCCAAACGTGACGACAGACCCGACGAGCTATCACGGGAGCTGCCCAGTGATAGAGGGGGAGAAGTGGTCTGC
AACAAAATGGATTCACATGCTACCAGTAGATGAGATTTGGAGGAATCCAGATTGTGTGGATGAGAATGAGCACTGTAGTGCATGGGCCAAAGCAGGTGAATGTGAAAAGA
ACCCTGGTTATATGGTGGGTTCTTCCTTGGGTTCTAAGGAAGAACTTGGATATTGTAGGCTTAGTTGCAAAGCCTGCTCTCCTCCCTCATAA
mRNA sequenceShow/hide mRNA sequence
TTTCGTTCTTTGATTTGATTTCTCGTTCTTTTCCATTTATTCGTTTATATAATTCGCTTTTTTTCTCTTGAATTTCTGAGTTTCAGAGCTCGAGTTTCGCCATGGATTCT
CGATTTTTTCTTGCATTTTCTCTTTGTTTCCTCTGTTCATTCCCTCTATTTGCTCGCTCCGCCAATCGATTGCCGAAATTACTCTTGGACGACACGAAAACGGAAGATTC
TGTTATTAGGATGAAAATGGACGGTTCCTCCATTAAAATCGATCCCACTCGGGTCGTTCAGCTTTCATCGCAACCCAGGGCTTTCTTGTACAAGGGATTTCTGTCTGCAG
AGGAGTGCCAGCATCTTATCGATTTGGCGAAGGATTACCTAGAGCAATCATTGGTGGTCGATGACATTACGGGTGCAAGTCGTTCGAGTACTGACCGGACGAGTACCGGC
ATGTTTCTTTATAAGGCTCAGGATGACATAGTTGCTGGCATTGAGGCCAAGATTGCTGCATGGACGTTCCTTCCCGTCGATAATGGGGAGCCTATACAAATACTAAGGTA
TGAAAATGGTCAGCAATATGTACCACATTTTGATTTTTTTCAAGATCCAGTTAATGTAGCTGCTGGTGGTCATCGGATAGCCACAGTCTTGATGTATTTGTCCAATGTTG
AAAGGGGTGGAGAAACTGTCTTTCCCGATTCTCCGGCTAAAGTATTCGAGGAGGAGAACAAGGATTTGTTCGATTGCTCTACGACCGGTTATGGAGTTAAGCCAAAGAAG
GGCGACGCTCTACTATTCTTCAGTCTCCATCCAAACGTGACGACAGACCCGACGAGCTATCACGGGAGCTGCCCAGTGATAGAGGGGGAGAAGTGGTCTGCAACAAAATG
GATTCACATGCTACCAGTAGATGAGATTTGGAGGAATCCAGATTGTGTGGATGAGAATGAGCACTGTAGTGCATGGGCCAAAGCAGGTGAATGTGAAAAGAACCCTGGTT
ATATGGTGGGTTCTTCCTTGGGTTCTAAGGAAGAACTTGGATATTGTAGGCTTAGTTGCAAAGCCTGCTCTCCTCCCTCATAA
Protein sequenceShow/hide protein sequence
MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDITGASRSSTDRT
STGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLMYLSNVERGGETVFPDSPAKVFEEENKDLFDCSTTGYGVK
PKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPVDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVGSSLGSKEELGYCRLSCKACSPPS