; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005190 (gene) of Snake gourd v1 genome

Gene IDTan0005190
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationLG07:58700436..58718465
RNA-Seq ExpressionTan0005190
SyntenyTan0005190
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022157663.1 probable prolyl 4-hydroxylase 7 isoform X1 [Momordica charantia]6.4e-14477.36Show/hide
Query:  MDGRFLLAFSLCFLCSFPLFSRSANRLPKLLMNNKNMGQ-SVIRMKPGGSSIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDL
        MD R  LAFSLCFLC FPLF RS N +P+LLM+  NMG+ S+IRMK GGSSI IDP+RV QLSSQPRAF+YKGFLSAEEC HLINLAKDKL++SLVADD+
Subjt:  MDGRFLLAFSLCFLCSFPLFSRSANRLPKLLMNNKNMGQ-SVIRMKPGGSSIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDL

Query:  TGASVTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQILRYEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPNSPV
        TG SVTS ERTSTGMFL K QD+IVAGIES+IAAWTFLPVDNGEP+Q+LRYE GQKY+PHFDFFQDP N+A GGHRIAT+LMYL++VE+GGETVFPNS V
Subjt:  TGASVTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQILRYEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPNSPV

Query:  KLSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHGSCPVIEGEKWSATKWLHMLPVNEIWRNPDCVDESEYCRVWANAGECEKNPGYMV
        KLS +EK  LSDCAK+GY VKPK GDALLFFSLH N T D +SYHGSCPVI+GEKWSATKW+HML  +EIWR+PDCVD S  C  WA+ GEC KNPGYM+
Subjt:  KLSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHGSCPVIEGEKWSATKWLHMLPVNEIWRNPDCVDESEYCRVWANAGECEKNPGYMV

Query:  GSKDDLGYCRKSCKACSS
        GSK +LGYCRKSC ACSS
Subjt:  GSKDDLGYCRKSCKACSS

XP_022931100.1 probable prolyl 4-hydroxylase 7 [Cucurbita moschata]3.0e-14979.38Show/hide
Query:  MDGRFLLAFSLCFLCSFPLFSRSANRLPKLLMNNKNMGQSVIRMKPGGSSIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDLT
        MD RF LAFSLCFLCSFPLF+RSANRLPKLL+++     SVIRMK  GSSI IDPTRV+QLSSQPRAFLYKGFLSAEEC HLI+LAKD L+QSLV DD+T
Subjt:  MDGRFLLAFSLCFLCSFPLFSRSANRLPKLLMNNKNMGQSVIRMKPGGSSIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDLT

Query:  GASVTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQILRYEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPNSPVK
        GAS +S +RTSTGMFL KAQD+IVAGIE+KIAAWTFLPVDNGEPIQILRYE GQ+Y PHFDFFQDP N+A GGHRIAT+LMYL++VE+GGETVFP+SP K
Subjt:  GASVTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQILRYEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPNSPVK

Query:  LSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHGSCPVIEGEKWSATKWLHMLPVNEIWRNPDCVDESEYCRVWANAGECEKNPGYMV-
        + E+E  +L DC+  GYGVKPKKGDALLFFSLHPN T DPTSYHGSCPVIEGEKWSATKW+HMLPV+EIWRNPDCVDE+E+C  WA AGECEKNPGYMV 
Subjt:  LSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHGSCPVIEGEKWSATKWLHMLPVNEIWRNPDCVDESEYCRVWANAGECEKNPGYMV-

Query:  ---GSKDDLGYCRKSCKACS
           GSK++LGYCR SCKACS
Subjt:  ---GSKDDLGYCRKSCKACS

XP_022971148.1 probable prolyl 4-hydroxylase 7 [Cucurbita maxima]2.5e-14879.06Show/hide
Query:  MDGRFLLAFSLCFLCSFPLFSRSANRLPKLLMNNKNMGQSVIRMKPGGSSIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDLT
        MD RF LAFSLCFLCSFPLF+RSANRLPKLL+++     SVIRMK  GSSI IDPTRV+QLSSQPRAFLYKGFLSAEEC HLI+LAKD L+QSLV DD+T
Subjt:  MDGRFLLAFSLCFLCSFPLFSRSANRLPKLLMNNKNMGQSVIRMKPGGSSIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDLT

Query:  GASVTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQILRYEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPNSPVK
        GAS +S +RTSTGMFL KAQD+IVAGIE+KIAAWTFLPVDNGEP+QILRYE GQ+Y PHFDFFQDP N+A GGHRIAT+L+YL++VE+GGETVFP+SP K
Subjt:  GASVTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQILRYEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPNSPVK

Query:  LSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHGSCPVIEGEKWSATKWLHMLPVNEIWRNPDCVDESEYCRVWANAGECEKNPGYMV-
        + E+ KD LSDC+  GYGVKPKKGDALLFFSLHPN T DPTSYHGSCPVIEGEKWSATKW+HMLP++EIWRNPDCVDE+E+C  WA AGECEKNPGYMV 
Subjt:  LSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHGSCPVIEGEKWSATKWLHMLPVNEIWRNPDCVDESEYCRVWANAGECEKNPGYMV-

Query:  ---GSKDDLGYCRKSCKACS
           GSK++LGYCR SCKACS
Subjt:  ---GSKDDLGYCRKSCKACS

XP_023530715.1 probable prolyl 4-hydroxylase 7 [Cucurbita pepo subsp. pepo]2.1e-15080Show/hide
Query:  MDGRFLLAFSLCFLCSFPLFSRSANRLPKLLMNNKNMGQSVIRMKPGGSSIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDLT
        MD RF LAFSLCFLCSFPLF+RSANRLPKLL+++     SVIRMK  GSSI IDPTRV+QLSSQPRAFLYKGFLSAEEC HLI+LAKD L+QSLV DD+T
Subjt:  MDGRFLLAFSLCFLCSFPLFSRSANRLPKLLMNNKNMGQSVIRMKPGGSSIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDLT

Query:  GASVTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQILRYEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPNSPVK
        GAS +S +RTSTGMFL KAQD+IVAGIE+KIAAWTFLPVDNGEPIQILRYE GQ+Y PHFDFFQDP N+A GGHRIAT+LMYL++VE+GGETVFP+SP K
Subjt:  GASVTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQILRYEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPNSPVK

Query:  LSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHGSCPVIEGEKWSATKWLHMLPVNEIWRNPDCVDESEYCRVWANAGECEKNPGYMV-
        + E+E  +LSDC+  GYGVKPKKGDALLFFSLHPN T DPTSYHGSCPVIEGEKWSATKW+HMLPV+EIWRNPDCVDE+E+C  WA AGECEKNPGYMV 
Subjt:  LSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHGSCPVIEGEKWSATKWLHMLPVNEIWRNPDCVDESEYCRVWANAGECEKNPGYMV-

Query:  ---GSKDDLGYCRKSCKACS
           GSK+DLGYCR SCKACS
Subjt:  ---GSKDDLGYCRKSCKACS

XP_038905408.1 probable prolyl 4-hydroxylase 7 isoform X1 [Benincasa hispida]8.4e-15282.28Show/hide
Query:  MDGRFLLAFSLCFLCSFPLFSRSANRLPKLLMNNKNMGQSVIRMKPGGSSIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDLT
        M  RF LAFSLCFLC FP FSRSANRLPKLL++N NM QSVIRMK  GS + IDPTRVI+LSS+PRAFLYKGFLS +EC HLINLAK KLQQSLVA + T
Subjt:  MDGRFLLAFSLCFLCSFPLFSRSANRLPKLLMNNKNMGQSVIRMKPGGSSIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDLT

Query:  GASVTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQILRYEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPNSPVK
        G SVTS+ERTSTGMFLT+AQDEIVA IES+IAAWTFLP+DNGEPIQILRYE GQKYEPHFDFFQDP NIAIGGHRIATILMYL+DVEKGGETVFPNSP+K
Subjt:  GASVTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQILRYEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPNSPVK

Query:  LSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHGSCPVIEGEKWSATKWLHMLPVNEIWRNPDCVDESEYCRVWANAGECEKNPGYMVG
        LSEQE+ +LSDCAK+GYGVKPK GDALLFFSL+PN TPD TSYHGSCPVIEGEKWSATKW+HMLP+ EIWRNP CVDE+  CR WANAGECEKNP YM+G
Subjt:  LSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHGSCPVIEGEKWSATKWLHMLPVNEIWRNPDCVDESEYCRVWANAGECEKNPGYMVG

Query:  SKDDLGYCRKSCKACS
        SK++LG+CR SCK CS
Subjt:  SKDDLGYCRKSCKACS

TrEMBL top hitse value%identityAlignment
A0A1S3B814 Procollagen-proline 4-dioxygenase3.1e-14477.99Show/hide
Query:  MDGRFLLAFSLCFLCSFPLFSRSANRLPKLLMNNKNMGQSVIRMKPGGSSIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDLT
        M   FLLAFS+ FL   PL S SANR PK+L++N +M +SVIRMK GGS+I IDPTRVIQLSS+PRAFLYKGFLS EEC HLI+LAK KL+QSLVA   T
Subjt:  MDGRFLLAFSLCFLCSFPLFSRSANRLPKLLMNNKNMGQSVIRMKPGGSSIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDLT

Query:  GASVTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQILRYEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPNSPVK
        G SVTS+ERTSTGMFL KAQD+IVA IES+IAAWTFLP+DNGEPIQILRYE GQKYEPHFDFFQDPGNIAIGGHRIATILMYL+DVEKGGETVFPNSPVK
Subjt:  GASVTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQILRYEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPNSPVK

Query:  LSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHGSCPVIEGEKWSATKWLHMLPVNEIWRNPDCVDESEYCRVWANAGECEKNPGYMVG
        LSE+EK +LS+CAK+GYGV+PK GDALLFFS++PN TPD TSYHGSCPVIEGEKWSATKW+HMLP++E+WRNP CVDE+++C  WA AGEC+KNP YM+G
Subjt:  LSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHGSCPVIEGEKWSATKWLHMLPVNEIWRNPDCVDESEYCRVWANAGECEKNPGYMVG

Query:  SKDDLGYCRKSCKACSSS
        SK++LG+CR SCK CS S
Subjt:  SKDDLGYCRKSCKACSSS

A0A6J1DTY4 Procollagen-proline 4-dioxygenase3.1e-14477.36Show/hide
Query:  MDGRFLLAFSLCFLCSFPLFSRSANRLPKLLMNNKNMGQ-SVIRMKPGGSSIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDL
        MD R  LAFSLCFLC FPLF RS N +P+LLM+  NMG+ S+IRMK GGSSI IDP+RV QLSSQPRAF+YKGFLSAEEC HLINLAKDKL++SLVADD+
Subjt:  MDGRFLLAFSLCFLCSFPLFSRSANRLPKLLMNNKNMGQ-SVIRMKPGGSSIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDL

Query:  TGASVTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQILRYEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPNSPV
        TG SVTS ERTSTGMFL K QD+IVAGIES+IAAWTFLPVDNGEP+Q+LRYE GQKY+PHFDFFQDP N+A GGHRIAT+LMYL++VE+GGETVFPNS V
Subjt:  TGASVTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQILRYEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPNSPV

Query:  KLSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHGSCPVIEGEKWSATKWLHMLPVNEIWRNPDCVDESEYCRVWANAGECEKNPGYMV
        KLS +EK  LSDCAK+GY VKPK GDALLFFSLH N T D +SYHGSCPVI+GEKWSATKW+HML  +EIWR+PDCVD S  C  WA+ GEC KNPGYM+
Subjt:  KLSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHGSCPVIEGEKWSATKWLHMLPVNEIWRNPDCVDESEYCRVWANAGECEKNPGYMV

Query:  GSKDDLGYCRKSCKACSS
        GSK +LGYCRKSC ACSS
Subjt:  GSKDDLGYCRKSCKACSS

A0A6J1DX45 Procollagen-proline 4-dioxygenase3.1e-14477.36Show/hide
Query:  MDGRFLLAFSLCFLCSFPLFSRSANRLPKLLMNNKNMGQ-SVIRMKPGGSSIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDL
        MD R  LAFSLCFLC FPLF RS N +P+LLM+  NMG+ S+IRMK GGSSI IDP+RV QLSSQPRAF+YKGFLSAEEC HLINLAKDKL++SLVADD+
Subjt:  MDGRFLLAFSLCFLCSFPLFSRSANRLPKLLMNNKNMGQ-SVIRMKPGGSSIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDL

Query:  TGASVTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQILRYEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPNSPV
        TG SVTS ERTSTGMFL K QD+IVAGIES+IAAWTFLPVDNGEP+Q+LRYE GQKY+PHFDFFQDP N+A GGHRIAT+LMYL++VE+GGETVFPNS V
Subjt:  TGASVTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQILRYEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPNSPV

Query:  KLSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHGSCPVIEGEKWSATKWLHMLPVNEIWRNPDCVDESEYCRVWANAGECEKNPGYMV
        KLS +EK  LSDCAK+GY VKPK GDALLFFSLH N T D +SYHGSCPVI+GEKWSATKW+HML  +EIWR+PDCVD S  C  WA+ GEC KNPGYM+
Subjt:  KLSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHGSCPVIEGEKWSATKWLHMLPVNEIWRNPDCVDESEYCRVWANAGECEKNPGYMV

Query:  GSKDDLGYCRKSCKACSS
        GSK +LGYCRKSC ACSS
Subjt:  GSKDDLGYCRKSCKACSS

A0A6J1EYJ1 Procollagen-proline 4-dioxygenase1.4e-14979.38Show/hide
Query:  MDGRFLLAFSLCFLCSFPLFSRSANRLPKLLMNNKNMGQSVIRMKPGGSSIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDLT
        MD RF LAFSLCFLCSFPLF+RSANRLPKLL+++     SVIRMK  GSSI IDPTRV+QLSSQPRAFLYKGFLSAEEC HLI+LAKD L+QSLV DD+T
Subjt:  MDGRFLLAFSLCFLCSFPLFSRSANRLPKLLMNNKNMGQSVIRMKPGGSSIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDLT

Query:  GASVTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQILRYEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPNSPVK
        GAS +S +RTSTGMFL KAQD+IVAGIE+KIAAWTFLPVDNGEPIQILRYE GQ+Y PHFDFFQDP N+A GGHRIAT+LMYL++VE+GGETVFP+SP K
Subjt:  GASVTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQILRYEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPNSPVK

Query:  LSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHGSCPVIEGEKWSATKWLHMLPVNEIWRNPDCVDESEYCRVWANAGECEKNPGYMV-
        + E+E  +L DC+  GYGVKPKKGDALLFFSLHPN T DPTSYHGSCPVIEGEKWSATKW+HMLPV+EIWRNPDCVDE+E+C  WA AGECEKNPGYMV 
Subjt:  LSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHGSCPVIEGEKWSATKWLHMLPVNEIWRNPDCVDESEYCRVWANAGECEKNPGYMV-

Query:  ---GSKDDLGYCRKSCKACS
           GSK++LGYCR SCKACS
Subjt:  ---GSKDDLGYCRKSCKACS

A0A6J1I5Z9 Procollagen-proline 4-dioxygenase1.2e-14879.06Show/hide
Query:  MDGRFLLAFSLCFLCSFPLFSRSANRLPKLLMNNKNMGQSVIRMKPGGSSIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDLT
        MD RF LAFSLCFLCSFPLF+RSANRLPKLL+++     SVIRMK  GSSI IDPTRV+QLSSQPRAFLYKGFLSAEEC HLI+LAKD L+QSLV DD+T
Subjt:  MDGRFLLAFSLCFLCSFPLFSRSANRLPKLLMNNKNMGQSVIRMKPGGSSIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDLT

Query:  GASVTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQILRYEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPNSPVK
        GAS +S +RTSTGMFL KAQD+IVAGIE+KIAAWTFLPVDNGEP+QILRYE GQ+Y PHFDFFQDP N+A GGHRIAT+L+YL++VE+GGETVFP+SP K
Subjt:  GASVTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQILRYEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPNSPVK

Query:  LSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHGSCPVIEGEKWSATKWLHMLPVNEIWRNPDCVDESEYCRVWANAGECEKNPGYMV-
        + E+ KD LSDC+  GYGVKPKKGDALLFFSLHPN T DPTSYHGSCPVIEGEKWSATKW+HMLP++EIWRNPDCVDE+E+C  WA AGECEKNPGYMV 
Subjt:  LSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHGSCPVIEGEKWSATKWLHMLPVNEIWRNPDCVDESEYCRVWANAGECEKNPGYMV-

Query:  ---GSKDDLGYCRKSCKACS
           GSK++LGYCR SCKACS
Subjt:  ---GSKDDLGYCRKSCKACS

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 67.7e-10057.59Show/hide
Query:  MDGRFLLAFSLCFLCSFPLFSRSANRLPKLLMNNKNMGQSVIRMKPGGSSIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDL-
        MD ++ LAFSL  L  F   S                            S  +DPTR+ QLS  PRAFLYKGFLS EEC HLI LAK KL++S+V  D+ 
Subjt:  MDGRFLLAFSLCFLCSFPLFSRSANRLPKLLMNNKNMGQSVIRMKPGGSSIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDL-

Query:  TGASVTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQILRYEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPNSPV
        +G S  SE RTS+GMFLTK QD+IVA +E+K+AAWTFLP +NGE +QIL YE GQKY+PHFD+F D   + +GGHRIAT+LMYL++V KGGETVFPN   
Subjt:  TGASVTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQILRYEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPNSPV

Query:  KLSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHGSCPVIEGEKWSATKWLHMLPVNEIWRNPDCVDESEYCRVWANAGECEKNPGYMV
        K  + + D+ S CAK GY VKP+KGDALLFF+LH N T DP S HGSCPVIEGEKWSAT+W+H+    +  +   CVD+ E C+ WA+AGECEKNP YMV
Subjt:  KLSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHGSCPVIEGEKWSATKWLHMLPVNEIWRNPDCVDESEYCRVWANAGECEKNPGYMV

Query:  GSKDDLGYCRKSCKAC
        GS+  LG+CRKSCKAC
Subjt:  GSKDDLGYCRKSCKAC

F4JAU3 Prolyl 4-hydroxylase 21.2e-8959.04Show/hide
Query:  SIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDLTGASVTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQILR
        S +I+P++V Q+SS+PRAF+Y+GFL+  EC HLI+LAK+ LQ+S VAD+  G S  S+ RTS+G F++K +D IV+GIE K++ WTFLP +NGE +Q+LR
Subjt:  SIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDLTGASVTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQILR

Query:  YEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPN----SPVKLSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHG
        YE GQKY+ HFD+F D  NIA GGHRIAT+L+YL++V KGGETVFP+    S   LSE  KD+LSDCAK G  VKPKKG+ALLFF+L  +  PDP S HG
Subjt:  YEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPN----SPVKLSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHG

Query:  SCPVIEGEKWSATKWLHMLPVNEI-WRNPDCVDESEYCRVWANAGECEKNPGYMVGSKDDLGYCRKSCKAC
         CPVIEGEKWSATKW+H+   ++I   + +C D +E C  WA  GEC KNP YMVG+ +  G CR+SCKAC
Subjt:  SCPVIEGEKWSATKWLHMLPVNEI-WRNPDCVDESEYCRVWANAGECEKNPGYMVGSKDDLGYCRKSCKAC

F4JNU8 Probable prolyl 4-hydroxylase 83.8e-6246.42Show/hide
Query:  LLAFSLCFLCSFPLFS-RSANRLPKLLMNNKNMGQSVIRMKPGGSSIVIDPTRVIQ-LSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDLTGAS
        L+ F +  L    +FS  S N+   + M+   + Q++   +  G     +  R ++ +S +PRAF+Y  FL+ EEC HLI+LAK  + +S V D  TG S
Subjt:  LLAFSLCFLCSFPLFS-RSANRLPKLLMNNKNMGQSVIRMKPGGSSIVIDPTRVIQ-LSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDLTGAS

Query:  VTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQILRYEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPNSPVKLSE
        + S  RTS+G FL +  DEIV  IE++I+ +TF+P +NGE +Q+L YE+GQ+YEPH D+F D  N+  GG RIAT+LMYL+DV++GGETVFP +   +S+
Subjt:  VTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQILRYEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPNSPVKLSE

Query:  QE-KDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHGSCPVIEGEKWSATKWLHMLPVN
            D LS C K G  V PKK DALLF+S+ P+ + DP+S HG CPVI+G KWS+TKW H+   N
Subjt:  QE-KDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHGSCPVIEGEKWSATKWLHMLPVN

Q8L970 Probable prolyl 4-hydroxylase 76.8e-11261.01Show/hide
Query:  MDGRFLLAFSLCFLCSFPLFSRSANRLPKLLMNNKNMGQSVIRMKPGGSSIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDLT
        MD R  LAFSLCFL + PL S + NR   L  ++     SVI+MK   SS   DPTRV QLS  PR FLY+GFLS EEC H I LAK KL++S+VAD+ +
Subjt:  MDGRFLLAFSLCFLCSFPLFSRSANRLPKLLMNNKNMGQSVIRMKPGGSSIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDLT

Query:  GASVTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQILRYEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPNSPVK
        G SV SE RTS+GMFL+K QD+IV+ +E+K+AAWTFLP +NGE +QIL YE GQKYEPHFD+F D  N+ +GGHRIAT+LMYL++VEKGGETVFP    K
Subjt:  GASVTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQILRYEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPNSPVK

Query:  LSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHGSCPVIEGEKWSATKWLHMLPVNEIW-RNPDCVDESEYCRVWANAGECEKNPGYMV
         ++ + D+ ++CAK GY VKP+KGDALLFF+LHPN T D  S HGSCPV+EGEKWSAT+W+H+      + +   C+DE+  C  WA AGEC+KNP YMV
Subjt:  LSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHGSCPVIEGEKWSATKWLHMLPVNEIW-RNPDCVDESEYCRVWANAGECEKNPGYMV

Query:  GSKDDLGYCRKSCKACSS
        GS  D GYCRKSCKACSS
Subjt:  GSKDDLGYCRKSCKACSS

Q8LAN3 Probable prolyl 4-hydroxylase 41.6e-9258.3Show/hide
Query:  SSIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDLTGASVTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQIL
        SS+ ++P++V Q+SS+PRAF+Y+GFL+  EC H+++LAK  L++S VAD+ +G S  SE RTS+G F++K +D IV+GIE KI+ WTFLP +NGE IQ+L
Subjt:  SSIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDLTGASVTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQIL

Query:  RYEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPNSPV---KLSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHG
        RYE GQKY+ HFD+F D  NI  GGHR+ATILMYL++V KGGETVFP++ +   ++  + K++LSDCAK G  VKP+KGDALLFF+LHP+  PDP S HG
Subjt:  RYEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPNSPV---KLSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHG

Query:  SCPVIEGEKWSATKWLHMLPVNEI-WRNPDCVDESEYCRVWANAGECEKNPGYMVGSKDDLGYCRKSCKAC
         CPVIEGEKWSATKW+H+   + I   + +C D +E C  WA  GEC KNP YMVG+ +  GYCR+SCKAC
Subjt:  SCPVIEGEKWSATKWLHMLPVNEI-WRNPDCVDESEYCRVWANAGECEKNPGYMVGSKDDLGYCRKSCKAC

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 28.8e-9159.04Show/hide
Query:  SIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDLTGASVTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQILR
        S +I+P++V Q+SS+PRAF+Y+GFL+  EC HLI+LAK+ LQ+S VAD+  G S  S+ RTS+G F++K +D IV+GIE K++ WTFLP +NGE +Q+LR
Subjt:  SIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDLTGASVTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQILR

Query:  YEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPN----SPVKLSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHG
        YE GQKY+ HFD+F D  NIA GGHRIAT+L+YL++V KGGETVFP+    S   LSE  KD+LSDCAK G  VKPKKG+ALLFF+L  +  PDP S HG
Subjt:  YEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPN----SPVKLSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHG

Query:  SCPVIEGEKWSATKWLHMLPVNEI-WRNPDCVDESEYCRVWANAGECEKNPGYMVGSKDDLGYCRKSCKAC
         CPVIEGEKWSATKW+H+   ++I   + +C D +E C  WA  GEC KNP YMVG+ +  G CR+SCKAC
Subjt:  SCPVIEGEKWSATKWLHMLPVNEI-WRNPDCVDESEYCRVWANAGECEKNPGYMVGSKDDLGYCRKSCKAC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase4.8e-11361.01Show/hide
Query:  MDGRFLLAFSLCFLCSFPLFSRSANRLPKLLMNNKNMGQSVIRMKPGGSSIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDLT
        MD R  LAFSLCFL + PL S + NR   L  ++     SVI+MK   SS   DPTRV QLS  PR FLY+GFLS EEC H I LAK KL++S+VAD+ +
Subjt:  MDGRFLLAFSLCFLCSFPLFSRSANRLPKLLMNNKNMGQSVIRMKPGGSSIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDLT

Query:  GASVTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQILRYEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPNSPVK
        G SV SE RTS+GMFL+K QD+IV+ +E+K+AAWTFLP +NGE +QIL YE GQKYEPHFD+F D  N+ +GGHRIAT+LMYL++VEKGGETVFP    K
Subjt:  GASVTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQILRYEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPNSPVK

Query:  LSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHGSCPVIEGEKWSATKWLHMLPVNEIW-RNPDCVDESEYCRVWANAGECEKNPGYMV
         ++ + D+ ++CAK GY VKP+KGDALLFF+LHPN T D  S HGSCPV+EGEKWSAT+W+H+      + +   C+DE+  C  WA AGEC+KNP YMV
Subjt:  LSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHGSCPVIEGEKWSATKWLHMLPVNEIW-RNPDCVDESEYCRVWANAGECEKNPGYMV

Query:  GSKDDLGYCRKSCKACSS
        GS  D GYCRKSCKACSS
Subjt:  GSKDDLGYCRKSCKACSS

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase7.4e-10657.36Show/hide
Query:  MDGRFLLAFSLCFLCSFPLFSRSANRLPKLLMNNKNMGQSVIRMKPGGSSIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDLT
        MD R  LAFSLCFL + PL S + NR   L  ++     SVI+MK   SS   DPTRV QLS  PR FLY+GFLS EEC H I LAK KL++S+VAD+ +
Subjt:  MDGRFLLAFSLCFLCSFPLFSRSANRLPKLLMNNKNMGQSVIRMKPGGSSIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDLT

Query:  GASVTSEERTS----TGMFLTKAQ----DEIVAGIESKIAAWTFLPVDNGEPIQILRYEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGET
        G SV SE+  S    +  F+        D+IV+ +E+K+AAWTFLP +NGE +QIL YE GQKYEPHFD+F D  N+ +GGHRIAT+LMYL++VEKGGET
Subjt:  GASVTSEERTS----TGMFLTKAQ----DEIVAGIESKIAAWTFLPVDNGEPIQILRYEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGET

Query:  VFPNSPVKLSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHGSCPVIEGEKWSATKWLHMLPVNEIW-RNPDCVDESEYCRVWANAGEC
        VFP    K ++ + D+ ++CAK GY VKP+KGDALLFF+LHPN T D  S HGSCPV+EGEKWSAT+W+H+      + +   C+DE+  C  WA AGEC
Subjt:  VFPNSPVKLSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHGSCPVIEGEKWSATKWLHMLPVNEIW-RNPDCVDESEYCRVWANAGEC

Query:  EKNPGYMVGSKDDLGYCRKSCKACSS
        +KNP YMVGS  D GYCRKSCKACSS
Subjt:  EKNPGYMVGSKDDLGYCRKSCKACSS

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase5.5e-10157.59Show/hide
Query:  MDGRFLLAFSLCFLCSFPLFSRSANRLPKLLMNNKNMGQSVIRMKPGGSSIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDL-
        MD ++ LAFSL  L  F   S                            S  +DPTR+ QLS  PRAFLYKGFLS EEC HLI LAK KL++S+V  D+ 
Subjt:  MDGRFLLAFSLCFLCSFPLFSRSANRLPKLLMNNKNMGQSVIRMKPGGSSIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDL-

Query:  TGASVTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQILRYEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPNSPV
        +G S  SE RTS+GMFLTK QD+IVA +E+K+AAWTFLP +NGE +QIL YE GQKY+PHFD+F D   + +GGHRIAT+LMYL++V KGGETVFPN   
Subjt:  TGASVTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQILRYEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPNSPV

Query:  KLSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHGSCPVIEGEKWSATKWLHMLPVNEIWRNPDCVDESEYCRVWANAGECEKNPGYMV
        K  + + D+ S CAK GY VKP+KGDALLFF+LH N T DP S HGSCPVIEGEKWSAT+W+H+    +  +   CVD+ E C+ WA+AGECEKNP YMV
Subjt:  KLSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHGSCPVIEGEKWSATKWLHMLPVNEIWRNPDCVDESEYCRVWANAGECEKNPGYMV

Query:  GSKDDLGYCRKSCKAC
        GS+  LG+CRKSCKAC
Subjt:  GSKDDLGYCRKSCKAC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.1e-9358.3Show/hide
Query:  SSIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDLTGASVTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQIL
        SS+ ++P++V Q+SS+PRAF+Y+GFL+  EC H+++LAK  L++S VAD+ +G S  SE RTS+G F++K +D IV+GIE KI+ WTFLP +NGE IQ+L
Subjt:  SSIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDLTGASVTSEERTSTGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQIL

Query:  RYEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPNSPV---KLSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHG
        RYE GQKY+ HFD+F D  NI  GGHR+ATILMYL++V KGGETVFP++ +   ++  + K++LSDCAK G  VKP+KGDALLFF+LHP+  PDP S HG
Subjt:  RYEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPNSPV---KLSEQEKDNLSDCAKIGYGVKPKKGDALLFFSLHPNRTPDPTSYHG

Query:  SCPVIEGEKWSATKWLHMLPVNEI-WRNPDCVDESEYCRVWANAGECEKNPGYMVGSKDDLGYCRKSCKAC
         CPVIEGEKWSATKW+H+   + I   + +C D +E C  WA  GEC KNP YMVG+ +  GYCR+SCKAC
Subjt:  SCPVIEGEKWSATKWLHMLPVNEI-WRNPDCVDESEYCRVWANAGECEKNPGYMVGSKDDLGYCRKSCKAC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGGTCGATTTCTTCTCGCATTTTCTCTCTGTTTCCTTTGCTCCTTCCCTCTCTTTTCTCGCTCTGCCAATCGCTTGCCGAAATTACTCATGAACAACAAGAACAT
GGGACAATCTGTCATTAGGATGAAACCTGGCGGTTCCTCCATTGTAATCGATCCCACTCGTGTCATTCAGCTTTCATCGCAACCCAGGGCTTTCTTATATAAGGGATTCT
TGTCTGCCGAGGAGTGCCATCATCTTATCAATTTGGCGAAGGATAAGCTACAGCAATCATTGGTGGCCGATGACCTAACGGGTGCGAGTGTTACGAGTGAAGAACGGACG
AGTACCGGCATGTTTCTTACTAAGGCTCAGGATGAAATAGTTGCTGGCATTGAGTCCAAGATTGCTGCGTGGACCTTTCTTCCCGTCGATAATGGGGAGCCTATTCAAAT
ACTAAGGTATGAAATTGGTCAGAAATATGAGCCACATTTTGATTTCTTTCAAGATCCAGGTAATATAGCCATTGGTGGTCATCGGATCGCCACAATCTTGATGTATTTGA
CCGATGTTGAAAAGGGTGGAGAAACAGTCTTTCCGAATTCTCCGGTTAAATTATCCGAGCAGGAGAAGGACAACTTGTCCGATTGTGCTAAGATTGGTTATGGAGTAAAA
CCAAAGAAGGGTGATGCTTTACTGTTCTTCAGTCTCCATCCAAATCGGACGCCAGACCCGACCAGCTACCATGGGAGCTGCCCGGTGATAGAGGGCGAGAAGTGGTCTGC
AACAAAATGGCTTCACATGCTTCCAGTCAATGAGATTTGGAGGAATCCAGATTGTGTGGATGAGAGTGAATACTGTCGTGTGTGGGCAAATGCAGGTGAGTGTGAAAAGA
ATCCTGGTTATATGGTGGGTTCTAAGGATGATCTTGGATATTGTAGGAAGAGTTGCAAGGCGTGCTCTTCCTCATAA
mRNA sequenceShow/hide mRNA sequence
GCTGGAAGAAACTTCCATTTCCATGGTTGTGAACAGCGGTTTTGCAGCAGATCCTCTTCTGGGTTGGCTATTAATTTTCGTTTTAAATGAAGCTCGAGTTTCGACATGGA
TGGTCGATTTCTTCTCGCATTTTCTCTCTGTTTCCTTTGCTCCTTCCCTCTCTTTTCTCGCTCTGCCAATCGCTTGCCGAAATTACTCATGAACAACAAGAACATGGGAC
AATCTGTCATTAGGATGAAACCTGGCGGTTCCTCCATTGTAATCGATCCCACTCGTGTCATTCAGCTTTCATCGCAACCCAGGGCTTTCTTATATAAGGGATTCTTGTCT
GCCGAGGAGTGCCATCATCTTATCAATTTGGCGAAGGATAAGCTACAGCAATCATTGGTGGCCGATGACCTAACGGGTGCGAGTGTTACGAGTGAAGAACGGACGAGTAC
CGGCATGTTTCTTACTAAGGCTCAGGATGAAATAGTTGCTGGCATTGAGTCCAAGATTGCTGCGTGGACCTTTCTTCCCGTCGATAATGGGGAGCCTATTCAAATACTAA
GGTATGAAATTGGTCAGAAATATGAGCCACATTTTGATTTCTTTCAAGATCCAGGTAATATAGCCATTGGTGGTCATCGGATCGCCACAATCTTGATGTATTTGACCGAT
GTTGAAAAGGGTGGAGAAACAGTCTTTCCGAATTCTCCGGTTAAATTATCCGAGCAGGAGAAGGACAACTTGTCCGATTGTGCTAAGATTGGTTATGGAGTAAAACCAAA
GAAGGGTGATGCTTTACTGTTCTTCAGTCTCCATCCAAATCGGACGCCAGACCCGACCAGCTACCATGGGAGCTGCCCGGTGATAGAGGGCGAGAAGTGGTCTGCAACAA
AATGGCTTCACATGCTTCCAGTCAATGAGATTTGGAGGAATCCAGATTGTGTGGATGAGAGTGAATACTGTCGTGTGTGGGCAAATGCAGGTGAGTGTGAAAAGAATCCT
GGTTATATGGTGGGTTCTAAGGATGATCTTGGATATTGTAGGAAGAGTTGCAAGGCGTGCTCTTCCTCATAAAAAATGCTTCACATTCCAATCCACATGCCTCTTTTACT
TATACACAAATCGGAGGTATGGGTTCTTCTTTCTTACACATATACACATGTAAATGACTCGAG
Protein sequenceShow/hide protein sequence
MDGRFLLAFSLCFLCSFPLFSRSANRLPKLLMNNKNMGQSVIRMKPGGSSIVIDPTRVIQLSSQPRAFLYKGFLSAEECHHLINLAKDKLQQSLVADDLTGASVTSEERT
STGMFLTKAQDEIVAGIESKIAAWTFLPVDNGEPIQILRYEIGQKYEPHFDFFQDPGNIAIGGHRIATILMYLTDVEKGGETVFPNSPVKLSEQEKDNLSDCAKIGYGVK
PKKGDALLFFSLHPNRTPDPTSYHGSCPVIEGEKWSATKWLHMLPVNEIWRNPDCVDESEYCRVWANAGECEKNPGYMVGSKDDLGYCRKSCKACSSS