; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh11G010480 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh11G010480
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationCma_Chr11:5762340..5766041
RNA-Seq ExpressionCmaCh11G010480
SyntenyCmaCh11G010480
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588394.1 putative prolyl 4-hydroxylase 7, partial [Cucurbita argyrosperma subsp. sororia]1.5e-17794.74Show/hide
Query:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT
        MDSRFFLAFSLCFLCSFPLFAR ANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEEC H++ +   + EQSLVVDDIT
Subjt:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT

Query:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPLQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPDSPAK
        GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEP+QILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVL+YLSNVERGGETVFPDSPAK
Subjt:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPLQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPDSPAK

Query:  VF-EENKDLSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPLDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVG
        VF EENKDL DCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLP+DEIWRNPDCVDENEHCS WAKAGECEKNPGYMVG
Subjt:  VF-EENKDLSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPLDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVG

Query:  SSLGSKEELGYCRLSCKACSPPS
        SSLGSKEELGYCRLSCKACSPPS
Subjt:  SSLGSKEELGYCRLSCKACSPPS

XP_022931100.1 probable prolyl 4-hydroxylase 7 [Cucurbita moschata]1.8e-18698.45Show/hide
Query:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT
        MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT
Subjt:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT

Query:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPLQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPDSPAK
        GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEP+QILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVL+YLSNVERGGETVFPDSPAK
Subjt:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPLQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPDSPAK

Query:  VF-EENKDLSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPLDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVG
        VF EENKDL DCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLP+DEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVG
Subjt:  VF-EENKDLSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPLDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVG

Query:  SSLGSKEELGYCRLSCKACSPPS
        SSLGSKEELGYCRLSCKACSPPS
Subjt:  SSLGSKEELGYCRLSCKACSPPS

XP_022971148.1 probable prolyl 4-hydroxylase 7 [Cucurbita maxima]2.3e-189100Show/hide
Query:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT
        MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT
Subjt:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT

Query:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPLQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPDSPAK
        GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPLQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPDSPAK
Subjt:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPLQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPDSPAK

Query:  VFEENKDLSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPLDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVGS
        VFEENKDLSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPLDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVGS
Subjt:  VFEENKDLSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPLDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVGS

Query:  SLGSKEELGYCRLSCKACSPPS
        SLGSKEELGYCRLSCKACSPPS
Subjt:  SLGSKEELGYCRLSCKACSPPS

XP_023530715.1 probable prolyl 4-hydroxylase 7 [Cucurbita pepo subsp. pepo]8.1e-18798.45Show/hide
Query:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT
        MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT
Subjt:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT

Query:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPLQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPDSPAK
        GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEP+QILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVL+YLSNVERGGETVFPDSPAK
Subjt:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPLQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPDSPAK

Query:  VF-EENKDLSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPLDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVG
        VF EENKDLSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLP+DEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVG
Subjt:  VF-EENKDLSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPLDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVG

Query:  SSLGSKEELGYCRLSCKACSPPS
        SSLGSKE+LGYCRLSCKACSPPS
Subjt:  SSLGSKEELGYCRLSCKACSPPS

XP_038905408.1 probable prolyl 4-hydroxylase 7 isoform X1 [Benincasa hispida]4.8e-13974.92Show/hide
Query:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT
        M SRFFLAFSLCFLC FP F+RSANRLPKLLL +   + SVIRMK  GS + IDPTRV++LSS+PRAFLYKGFLS +ECQHLI+LAK  L+QSLV  + T
Subjt:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT

Query:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPLQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPDSPAK
        G S +S +RTSTGMFL +AQD+IVA IE++IAAWTFLP+DNGEP+QILRYENGQ+Y PHFDFFQDPVN+A GGHRIAT+L+YLS+VE+GGETVFP+SP K
Subjt:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPLQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPDSPAK

Query:  VFE-ENKDLSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPLDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVG
        + E E  DLSDC+  GYGVKPK GDALLFFSL+PNVT D TSYHGSCPVIEGEKWSATKWIHMLP+ EIWRNP CVDEN  C AWA AGECEKNP YM  
Subjt:  VFE-ENKDLSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPLDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVG

Query:  SSLGSKEELGYCRLSCKACSPPS
          +GSK ELG+CR+SCK CSPPS
Subjt:  SSLGSKEELGYCRLSCKACSPPS

TrEMBL top hitse value%identityAlignment
A0A1S3B814 Procollagen-proline 4-dioxygenase1.5e-13373.07Show/hide
Query:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT
        M S F LAFS+ FL   PL + SANR PK+LL +    +SVIRMK  GS+I IDPTRV+QLSS+PRAFLYKGFLS EECQHLI LAK  L QSLV    T
Subjt:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT

Query:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPLQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPDSPAK
        G S +S +RTSTGMFL KAQD IVA IE++IAAWTFLP+DNGEP+QILRYENGQ+Y PHFDFFQDP N+A GGHRIAT+L+YLS+VE+GGETVFP+SP K
Subjt:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPLQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPDSPAK

Query:  VFEENK-DLSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPLDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVG
        + EE K DLS+C+  GYGV+PK GDALLFFS++PNVT D TSYHGSCPVIEGEKWSATKWIHMLP+DE+WRNP CVDEN+HCSAWAKAGEC+KNP YM  
Subjt:  VFEENK-DLSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPLDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVG

Query:  SSLGSKEELGYCRLSCKACSPPS
          +GSK ELG+CRLSCK CSP S
Subjt:  SSLGSKEELGYCRLSCKACSPPS

A0A6J1DTY4 Procollagen-proline 4-dioxygenase2.7e-13573.83Show/hide
Query:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLD-DTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDI
        MDSR FLAFSLCFLC FPLF RS N +P+LL+D +     S+IRMK  GSSI IDP+RV QLSSQPRAF+YKGFLSAEEC+HLI+LAKD LE+SLV DD+
Subjt:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLD-DTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDI

Query:  TGASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPLQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPDSPA
        TG S +S +RTSTGMFL K QD IVAGIE++IAAWTFLPVDNGEP+Q+LRYENGQ+Y PHFDFFQDPVN+A GGHRIATVL+YLSNVE GGETVFP+S  
Subjt:  TGASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPLQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPDSPA

Query:  KV-FEENKDLSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPLDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMV
        K+   E K+LSDC+  GY VKPK GDALLFFSLH N TTD +SYHGSCPVI+GEKWSATKWIHML  DEIWR+PDCVD +  C+AWA  GEC KNPGYM+
Subjt:  KV-FEENKDLSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPLDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMV

Query:  GSSLGSKEELGYCRLSCKACS
            GSK ELGYCR SC ACS
Subjt:  GSSLGSKEELGYCRLSCKACS

A0A6J1DX45 Procollagen-proline 4-dioxygenase2.7e-13573.83Show/hide
Query:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLD-DTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDI
        MDSR FLAFSLCFLC FPLF RS N +P+LL+D +     S+IRMK  GSSI IDP+RV QLSSQPRAF+YKGFLSAEEC+HLI+LAKD LE+SLV DD+
Subjt:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLD-DTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDI

Query:  TGASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPLQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPDSPA
        TG S +S +RTSTGMFL K QD IVAGIE++IAAWTFLPVDNGEP+Q+LRYENGQ+Y PHFDFFQDPVN+A GGHRIATVL+YLSNVE GGETVFP+S  
Subjt:  TGASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPLQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPDSPA

Query:  KV-FEENKDLSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPLDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMV
        K+   E K+LSDC+  GY VKPK GDALLFFSLH N TTD +SYHGSCPVI+GEKWSATKWIHML  DEIWR+PDCVD +  C+AWA  GEC KNPGYM+
Subjt:  KV-FEENKDLSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPLDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMV

Query:  GSSLGSKEELGYCRLSCKACS
            GSK ELGYCR SC ACS
Subjt:  GSSLGSKEELGYCRLSCKACS

A0A6J1EYJ1 Procollagen-proline 4-dioxygenase8.8e-18798.45Show/hide
Query:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT
        MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT
Subjt:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT

Query:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPLQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPDSPAK
        GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEP+QILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVL+YLSNVERGGETVFPDSPAK
Subjt:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPLQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPDSPAK

Query:  VF-EENKDLSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPLDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVG
        VF EENKDL DCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLP+DEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVG
Subjt:  VF-EENKDLSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPLDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVG

Query:  SSLGSKEELGYCRLSCKACSPPS
        SSLGSKEELGYCRLSCKACSPPS
Subjt:  SSLGSKEELGYCRLSCKACSPPS

A0A6J1I5Z9 Procollagen-proline 4-dioxygenase1.1e-189100Show/hide
Query:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT
        MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT
Subjt:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT

Query:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPLQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPDSPAK
        GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPLQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPDSPAK
Subjt:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPLQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPDSPAK

Query:  VFEENKDLSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPLDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVGS
        VFEENKDLSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPLDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVGS
Subjt:  VFEENKDLSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPLDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVGS

Query:  SLGSKEELGYCRLSCKACSPPS
        SLGSKEELGYCRLSCKACSPPS
Subjt:  SLGSKEELGYCRLSCKACSPPS

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 64.0e-9657.5Show/hide
Query:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDI-
        MDS++FLAFSL  L  F                           ++   S  +DPTR+ QLS  PRAFLYKGFLS EEC HLI LAK  LE+S+VV D+ 
Subjt:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDI-

Query:  TGASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPLQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPDSPA
        +G S  S  RTS+GMFL K QDDIVA +EAK+AAWTFLP +NGE LQIL YENGQ+Y PHFD+F D   +  GGHRIATVL+YLSNV +GGETVFP+   
Subjt:  TGASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPLQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPDSPA

Query:  KVFEENKD-LSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPLDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMV
        K  +   D  S C+  GY VKP+KGDALLFF+LH N TTDP S HGSCPVIEGEKWSAT+WIH+    +  +   CVD++E C  WA AGECEKNP YMV
Subjt:  KVFEENKD-LSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPLDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMV

Query:  GSSLGSKEELGYCRLSCKAC
            GS+  LG+CR SCKAC
Subjt:  GSSLGSKEELGYCRLSCKAC

F4JAU3 Prolyl 4-hydroxylase 21.5e-8758.67Show/hide
Query:  IDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDITGASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPLQILRYEN
        I+P++V Q+SS+PRAF+Y+GFL+  EC HLI LAK+ L++S V D+  G S+ S  RTS+G F+ K +D IV+GIE K++ WTFLP +NGE LQ+LRYE+
Subjt:  IDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDITGASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPLQILRYEN

Query:  GQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPD----SPAKVFEENKDLSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPV
        GQ+Y  HFD+F D VN+A GGHRIATVLLYLSNV +GGETVFPD    S   + E   DLSDC+  G  VKPKKG+ALLFF+L  +   DP S HG CPV
Subjt:  GQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPD----SPAKVFEENKDLSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPV

Query:  IEGEKWSATKWIHMLPLDEI-WRNPDCVDENEHCSAWAKAGECEKNPGYMVGSSLGSKEELGYCRLSCKAC
        IEGEKWSATKWIH+   D+I   + +C D NE C  WA  GEC KNP YMV    G+ E  G CR SCKAC
Subjt:  IEGEKWSATKWIHMLPLDEI-WRNPDCVDENEHCSAWAKAGECEKNPGYMVGSSLGSKEELGYCRLSCKAC

Q8L970 Probable prolyl 4-hydroxylase 71.0e-10760.12Show/hide
Query:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT
        MDSR FLAFSLCFL + PL + + NR   L       + SVI+MK   SS   DPTRV QLS  PR FLY+GFLS EEC H I LAK  LE+S+V D+ +
Subjt:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT

Query:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPLQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPDSPAK
        G S  S  RTS+GMFL K QDDIV+ +EAK+AAWTFLP +NGE +QIL YENGQ+Y PHFD+F D  N+  GGHRIATVL+YLSNVE+GGETVFP    K
Subjt:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPLQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPDSPAK

Query:  VFEENKD-LSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPLDEIW-RNPDCVDENEHCSAWAKAGECEKNPGYMV
          +   D  ++C+  GY VKP+KGDALLFF+LHPN TTD  S HGSCPV+EGEKWSAT+WIH+   +  + +   C+DEN  C  WAKAGEC+KNP YMV
Subjt:  VFEENKD-LSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPLDEIW-RNPDCVDENEHCSAWAKAGECEKNPGYMV

Query:  GSSLGSKEELGYCRLSCKACS
        GS     ++ GYCR SCKACS
Subjt:  GSSLGSKEELGYCRLSCKACS

Q8LAN3 Probable prolyl 4-hydroxylase 41.1e-9057.19Show/hide
Query:  MDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDITGASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPL
        +  SS+ ++P++V Q+SS+PRAF+Y+GFL+  EC H++ LAK  L++S V D+ +G S+ S  RTS+G F+ K +D IV+GIE KI+ WTFLP +NGE +
Subjt:  MDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDITGASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPL

Query:  QILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPDS---PAKVFEENK-DLSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTS
        Q+LRYE+GQ+Y  HFD+F D VN+  GGHR+AT+L+YLSNV +GGETVFPD+     +V  ENK DLSDC+  G  VKP+KGDALLFF+LHP+   DP S
Subjt:  QILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPDS---PAKVFEENK-DLSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTS

Query:  YHGSCPVIEGEKWSATKWIHMLPLDEI-WRNPDCVDENEHCSAWAKAGECEKNPGYMVGSSLGSKEELGYCRLSCKAC
         HG CPVIEGEKWSATKWIH+   D I   + +C D NE C  WA  GEC KNP YMVG++    E  GYCR SCKAC
Subjt:  YHGSCPVIEGEKWSATKWIHMLPLDEI-WRNPDCVDENEHCSAWAKAGECEKNPGYMVGSSLGSKEELGYCRLSCKAC

Q9LN20 Probable prolyl 4-hydroxylase 31.2e-6053.92Show/hide
Query:  LSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDITGASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPLQILRYENGQQYVPHF
        LS +PRAF+Y  FLS EEC++LI LAK ++ +S VVD  TG S+ S  RTS+G FL + +D I+  IE +IA +TF+P D+GE LQ+L YE GQ+Y PH+
Subjt:  LSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDITGASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPLQILRYENGQQYVPHF

Query:  DFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPDSPAKVFEE--NKDLSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATK
        D+F D  N   GG R+AT+L+YLS+VE GGETVFP +           +LS+C   G  VKP+ GDALLF+S+ P+ T DPTS HG CPVI G KWS+TK
Subjt:  DFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPDSPAKVFEE--NKDLSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATK

Query:  WIHM
        W+H+
Subjt:  WIHM

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 21.1e-8858.67Show/hide
Query:  IDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDITGASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPLQILRYEN
        I+P++V Q+SS+PRAF+Y+GFL+  EC HLI LAK+ L++S V D+  G S+ S  RTS+G F+ K +D IV+GIE K++ WTFLP +NGE LQ+LRYE+
Subjt:  IDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDITGASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPLQILRYEN

Query:  GQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPD----SPAKVFEENKDLSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPV
        GQ+Y  HFD+F D VN+A GGHRIATVLLYLSNV +GGETVFPD    S   + E   DLSDC+  G  VKPKKG+ALLFF+L  +   DP S HG CPV
Subjt:  GQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPD----SPAKVFEENKDLSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPV

Query:  IEGEKWSATKWIHMLPLDEI-WRNPDCVDENEHCSAWAKAGECEKNPGYMVGSSLGSKEELGYCRLSCKAC
        IEGEKWSATKWIH+   D+I   + +C D NE C  WA  GEC KNP YMV    G+ E  G CR SCKAC
Subjt:  IEGEKWSATKWIHMLPLDEI-WRNPDCVDENEHCSAWAKAGECEKNPGYMVGSSLGSKEELGYCRLSCKAC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase7.3e-10960.12Show/hide
Query:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT
        MDSR FLAFSLCFL + PL + + NR   L       + SVI+MK   SS   DPTRV QLS  PR FLY+GFLS EEC H I LAK  LE+S+V D+ +
Subjt:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT

Query:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPLQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPDSPAK
        G S  S  RTS+GMFL K QDDIV+ +EAK+AAWTFLP +NGE +QIL YENGQ+Y PHFD+F D  N+  GGHRIATVL+YLSNVE+GGETVFP    K
Subjt:  GASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPLQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPDSPAK

Query:  VFEENKD-LSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPLDEIW-RNPDCVDENEHCSAWAKAGECEKNPGYMV
          +   D  ++C+  GY VKP+KGDALLFF+LHPN TTD  S HGSCPV+EGEKWSAT+WIH+   +  + +   C+DEN  C  WAKAGEC+KNP YMV
Subjt:  VFEENKD-LSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPLDEIW-RNPDCVDENEHCSAWAKAGECEKNPGYMV

Query:  GSSLGSKEELGYCRLSCKACS
        GS     ++ GYCR SCKACS
Subjt:  GSSLGSKEELGYCRLSCKACS

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase2.3e-10256.84Show/hide
Query:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT
        MDSR FLAFSLCFL + PL + + NR   L       + SVI+MK   SS   DPTRV QLS  PR FLY+GFLS EEC H I LAK  LE+S+V D+ +
Subjt:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDIT

Query:  GASRSSTDRTS----TGMFLYKAQ----DDIVAGIEAKIAAWTFLPVDNGEPLQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGET
        G S  S D  S    +  F+        DDIV+ +EAK+AAWTFLP +NGE +QIL YENGQ+Y PHFD+F D  N+  GGHRIATVL+YLSNVE+GGET
Subjt:  GASRSSTDRTS----TGMFLYKAQ----DDIVAGIEAKIAAWTFLPVDNGEPLQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGET

Query:  VFPDSPAKVFEENKD-LSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPLDEIW-RNPDCVDENEHCSAWAKAGEC
        VFP    K  +   D  ++C+  GY VKP+KGDALLFF+LHPN TTD  S HGSCPV+EGEKWSAT+WIH+   +  + +   C+DEN  C  WAKAGEC
Subjt:  VFPDSPAKVFEENKD-LSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPLDEIW-RNPDCVDENEHCSAWAKAGEC

Query:  EKNPGYMVGSSLGSKEELGYCRLSCKACS
        +KNP YMVGS     ++ GYCR SCKACS
Subjt:  EKNPGYMVGSSLGSKEELGYCRLSCKACS

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase2.9e-9757.5Show/hide
Query:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDI-
        MDS++FLAFSL  L  F                           ++   S  +DPTR+ QLS  PRAFLYKGFLS EEC HLI LAK  LE+S+VV D+ 
Subjt:  MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDI-

Query:  TGASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPLQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPDSPA
        +G S  S  RTS+GMFL K QDDIVA +EAK+AAWTFLP +NGE LQIL YENGQ+Y PHFD+F D   +  GGHRIATVL+YLSNV +GGETVFP+   
Subjt:  TGASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPLQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPDSPA

Query:  KVFEENKD-LSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPLDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMV
        K  +   D  S C+  GY VKP+KGDALLFF+LH N TTDP S HGSCPVIEGEKWSAT+WIH+    +  +   CVD++E C  WA AGECEKNP YMV
Subjt:  KVFEENKD-LSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPLDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMV

Query:  GSSLGSKEELGYCRLSCKAC
            GS+  LG+CR SCKAC
Subjt:  GSSLGSKEELGYCRLSCKAC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein8.1e-9257.19Show/hide
Query:  MDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDITGASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPL
        +  SS+ ++P++V Q+SS+PRAF+Y+GFL+  EC H++ LAK  L++S V D+ +G S+ S  RTS+G F+ K +D IV+GIE KI+ WTFLP +NGE +
Subjt:  MDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDITGASRSSTDRTSTGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPL

Query:  QILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPDS---PAKVFEENK-DLSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTS
        Q+LRYE+GQ+Y  HFD+F D VN+  GGHR+AT+L+YLSNV +GGETVFPD+     +V  ENK DLSDC+  G  VKP+KGDALLFF+LHP+   DP S
Subjt:  QILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPDS---PAKVFEENK-DLSDCSTTGYGVKPKKGDALLFFSLHPNVTTDPTS

Query:  YHGSCPVIEGEKWSATKWIHMLPLDEI-WRNPDCVDENEHCSAWAKAGECEKNPGYMVGSSLGSKEELGYCRLSCKAC
         HG CPVIEGEKWSATKWIH+   D I   + +C D NE C  WA  GEC KNP YMVG++    E  GYCR SCKAC
Subjt:  YHGSCPVIEGEKWSATKWIHMLPLDEI-WRNPDCVDENEHCSAWAKAGECEKNPGYMVGSSLGSKEELGYCRLSCKAC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCTCGATTTTTTCTTGCATTTTCTCTTTGTTTCCTCTGTTCATTCCCTCTATTTGCTCGCTCCGCCAATCGCTTGCCGAAATTACTCTTGGACGACACGAAAAC
GGAAGATTCTGTTATTAGGATGAAAATGGACGGTTCCTCCATTAAAATCGATCCCACTCGCGTCGTTCAGCTTTCATCGCAACCTAGGGCTTTCTTATACAAGGGATTTT
TGTCTGCAGAGGAGTGCCAGCATCTTATCGATTTGGCAAAGGATTACCTAGAGCAATCATTGGTGGTCGATGACATTACGGGTGCAAGTCGTTCGAGTACTGACCGGACG
AGTACCGGCATGTTTCTTTATAAGGCTCAGGATGACATAGTTGCTGGCATTGAGGCCAAGATTGCTGCGTGGACGTTCCTTCCCGTCGATAATGGGGAGCCTCTACAAAT
ACTAAGGTATGAAAATGGTCAGCAATATGTACCACATTTTGATTTTTTTCAAGATCCAGTTAATGTAGCTGCTGGTGGTCATCGGATAGCCACAGTCTTGTTGTATTTGT
CCAACGTTGAAAGGGGTGGAGAAACTGTCTTTCCCGATTCTCCGGCTAAAGTATTCGAGGAGAACAAGGATTTGTCCGATTGCTCTACAACCGGTTATGGAGTTAAGCCA
AAGAAGGGCGACGCTTTACTATTCTTCAGTCTCCATCCAAACGTAACGACAGACCCGACGAGCTATCACGGGAGCTGCCCAGTGATAGAGGGGGAGAAGTGGTCTGCAAC
AAAATGGATTCACATGCTACCACTCGATGAGATTTGGAGGAATCCAGATTGTGTGGATGAGAATGAGCACTGTAGTGCATGGGCCAAAGCAGGTGAATGTGAAAAGAACC
CTGGTTATATGGTGGGTTCTTCCTTGGGTTCTAAGGAAGAACTTGGATATTGTAGGCTTAGTTGCAAAGCCTGCTCTCCCCCCTCATAA
mRNA sequenceShow/hide mRNA sequence
ATTCTCGTCTTAAATGAAGCTCGAGTTTCGCCATGGATTCTCGATTTTTTCTTGCATTTTCTCTTTGTTTCCTCTGTTCATTCCCTCTATTTGCTCGCTCCGCCAATCGC
TTGCCGAAATTACTCTTGGACGACACGAAAACGGAAGATTCTGTTATTAGGATGAAAATGGACGGTTCCTCCATTAAAATCGATCCCACTCGCGTCGTTCAGCTTTCATC
GCAACCTAGGGCTTTCTTATACAAGGGATTTTTGTCTGCAGAGGAGTGCCAGCATCTTATCGATTTGGCAAAGGATTACCTAGAGCAATCATTGGTGGTCGATGACATTA
CGGGTGCAAGTCGTTCGAGTACTGACCGGACGAGTACCGGCATGTTTCTTTATAAGGCTCAGGATGACATAGTTGCTGGCATTGAGGCCAAGATTGCTGCGTGGACGTTC
CTTCCCGTCGATAATGGGGAGCCTCTACAAATACTAAGGTATGAAAATGGTCAGCAATATGTACCACATTTTGATTTTTTTCAAGATCCAGTTAATGTAGCTGCTGGTGG
TCATCGGATAGCCACAGTCTTGTTGTATTTGTCCAACGTTGAAAGGGGTGGAGAAACTGTCTTTCCCGATTCTCCGGCTAAAGTATTCGAGGAGAACAAGGATTTGTCCG
ATTGCTCTACAACCGGTTATGGAGTTAAGCCAAAGAAGGGCGACGCTTTACTATTCTTCAGTCTCCATCCAAACGTAACGACAGACCCGACGAGCTATCACGGGAGCTGC
CCAGTGATAGAGGGGGAGAAGTGGTCTGCAACAAAATGGATTCACATGCTACCACTCGATGAGATTTGGAGGAATCCAGATTGTGTGGATGAGAATGAGCACTGTAGTGC
ATGGGCCAAAGCAGGTGAATGTGAAAAGAACCCTGGTTATATGGTGGGTTCTTCCTTGGGTTCTAAGGAAGAACTTGGATATTGTAGGCTTAGTTGCAAAGCCTGCTCTC
CCCCCTCATAAACAAATGCTCACATGCCTCCTTATTTATACACAAATTTGAGGTTAGAGTTCTTTCTTTTTTACACGCACACATGTAAATGACATTGACA
Protein sequenceShow/hide protein sequence
MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQLSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDITGASRSSTDRT
STGMFLYKAQDDIVAGIEAKIAAWTFLPVDNGEPLQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVLLYLSNVERGGETVFPDSPAKVFEENKDLSDCSTTGYGVKP
KKGDALLFFSLHPNVTTDPTSYHGSCPVIEGEKWSATKWIHMLPLDEIWRNPDCVDENEHCSAWAKAGECEKNPGYMVGSSLGSKEELGYCRLSCKACSPPS