; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0009641 (gene) of Chayote v1 genome

Gene IDSed0009641
OrganismSechium edule (Chayote v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationLG08:28224791..28228023
RNA-Seq ExpressionSed0009641
SyntenySed0009641
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008443446.1 PREDICTED: probable prolyl 4-hydroxylase 7 [Cucumis melo]5.7e-13774.45Show/hide
Query:  MGCQFFLAFSLCFLCFFPRFSRSANRFPKLLMNN-----AANIMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSLVADVVT
        M   F LAFS+ FL   P  S SANRFPK+L++N     +   MKT GS++ IDPTRVIQLSS+PRAFLYKGFLS EEC  LI+LAK KL+ SLVA   T
Subjt:  MGCQFFLAFSLCFLCFFPRFSRSANRFPKLLMNN-----AANIMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSLVADVVT

Query:  GASVTSEERTSTGMFLRKAQDKIVAGIESKIAAWTFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSPVK
        G SVTS+ERTSTGMFLRKAQDKIVA IES+IAAWTFLP+D+GEP+Q+LRYENGQKY PHFDFFQDP N+AIGGHRIAT+LMYLSDVEKGGETVFPNSPVK
Subjt:  GASVTSEERTSTGMFLRKAQDKIVAGIESKIAAWTFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSPVK

Query:  LSEEEKNDLSDCARIGYGVKPRKGDALLFFSLNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKIWRNPDCVDESEHCGVWANAGDCEKNPAYMVG
        LSEEEK DLS+CA++GYGV+P+ GDALLFFS+NPN+T D TSYHGSCPVIEGEKWSATKWIHMLP +++WRNP CVDE++HC  WA AG+C+KNP YM+G
Subjt:  LSEEEKNDLSDCARIGYGVKPRKGDALLFFSLNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKIWRNPDCVDESEHCGVWANAGDCEKNPAYMVG

Query:  SKDDPGYCRQSCKVCSP
        SK++ G+CR SCKVCSP
Subjt:  SKDDPGYCRQSCKVCSP

XP_022931100.1 probable prolyl 4-hydroxylase 7 [Cucurbita moschata]2.3e-14175.39Show/hide
Query:  MGCQFFLAFSLCFLCFFPRFSRSANRFPKLLMNNAAN-----IMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSLVADVVT
        M  +FFLAFSLCFLC FP F+RSANR PKLL+++         MK  GSS++IDPTRV+QLSSQPRAFLYKGFLSAEEC  LI+LAKD L+ SLV D +T
Subjt:  MGCQFFLAFSLCFLCFFPRFSRSANRFPKLLMNNAAN-----IMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSLVADVVT

Query:  GASVTSEERTSTGMFLRKAQDKIVAGIESKIAAWTFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSPVK
        GAS +S +RTSTGMFL KAQD IVAGIE+KIAAWTFLPVD+GEP+Q+LRYENGQ+YVPHFDFFQDPVN+A GGHRIATVLMYLS+VE+GGETVFP+SP K
Subjt:  GASVTSEERTSTGMFLRKAQDKIVAGIESKIAAWTFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSPVK

Query:  LSEEEKNDLSDCARIGYGVKPRKGDALLFFSLNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKIWRNPDCVDESEHCGVWANAGDCEKNPAYMV-
        + EEE  DL DC+  GYGVKP+KGDALLFFSL+PN+TTDPTSYHGSCPVIEGEKWSATKWIHMLP ++IWRNPDCVDE+EHC  WA AG+CEKNP YMV 
Subjt:  LSEEEKNDLSDCARIGYGVKPRKGDALLFFSLNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKIWRNPDCVDESEHCGVWANAGDCEKNPAYMV-

Query:  ---GSKDDPGYCRQSCKVCSP
           GSK++ GYCR SCK CSP
Subjt:  ---GSKDDPGYCRQSCKVCSP

XP_022971148.1 probable prolyl 4-hydroxylase 7 [Cucurbita maxima]1.5e-14075.39Show/hide
Query:  MGCQFFLAFSLCFLCFFPRFSRSANRFPKLLMNNAAN-----IMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSLVADVVT
        M  +FFLAFSLCFLC FP F+RSANR PKLL+++         MK  GSS++IDPTRV+QLSSQPRAFLYKGFLSAEEC  LI+LAKD L+ SLV D +T
Subjt:  MGCQFFLAFSLCFLCFFPRFSRSANRFPKLLMNNAAN-----IMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSLVADVVT

Query:  GASVTSEERTSTGMFLRKAQDKIVAGIESKIAAWTFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSPVK
        GAS +S +RTSTGMFL KAQD IVAGIE+KIAAWTFLPVD+GEP+Q+LRYENGQ+YVPHFDFFQDPVN+A GGHRIATVL+YLS+VE+GGETVFP+SP K
Subjt:  GASVTSEERTSTGMFLRKAQDKIVAGIESKIAAWTFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSPVK

Query:  LSEEEKNDLSDCARIGYGVKPRKGDALLFFSLNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKIWRNPDCVDESEHCGVWANAGDCEKNPAYMV-
        + EE K DLSDC+  GYGVKP+KGDALLFFSL+PN+TTDPTSYHGSCPVIEGEKWSATKWIHMLP ++IWRNPDCVDE+EHC  WA AG+CEKNP YMV 
Subjt:  LSEEEKNDLSDCARIGYGVKPRKGDALLFFSLNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKIWRNPDCVDESEHCGVWANAGDCEKNPAYMV-

Query:  ---GSKDDPGYCRQSCKVCSP
           GSK++ GYCR SCK CSP
Subjt:  ---GSKDDPGYCRQSCKVCSP

XP_023530715.1 probable prolyl 4-hydroxylase 7 [Cucurbita pepo subsp. pepo]1.6e-14276.01Show/hide
Query:  MGCQFFLAFSLCFLCFFPRFSRSANRFPKLLMNNAAN-----IMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSLVADVVT
        M  +FFLAFSLCFLC FP F+RSANR PKLL+++         MK  GSS++IDPTRV+QLSSQPRAFLYKGFLSAEEC  LI+LAKD L+ SLV D +T
Subjt:  MGCQFFLAFSLCFLCFFPRFSRSANRFPKLLMNNAAN-----IMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSLVADVVT

Query:  GASVTSEERTSTGMFLRKAQDKIVAGIESKIAAWTFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSPVK
        GAS +S +RTSTGMFL KAQD IVAGIE+KIAAWTFLPVD+GEP+Q+LRYENGQ+YVPHFDFFQDPVN+A GGHRIATVLMYLS+VE+GGETVFP+SP K
Subjt:  GASVTSEERTSTGMFLRKAQDKIVAGIESKIAAWTFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSPVK

Query:  LSEEEKNDLSDCARIGYGVKPRKGDALLFFSLNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKIWRNPDCVDESEHCGVWANAGDCEKNPAYMV-
        + EEE  DLSDC+  GYGVKP+KGDALLFFSL+PN+TTDPTSYHGSCPVIEGEKWSATKWIHMLP ++IWRNPDCVDE+EHC  WA AG+CEKNP YMV 
Subjt:  LSEEEKNDLSDCARIGYGVKPRKGDALLFFSLNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKIWRNPDCVDESEHCGVWANAGDCEKNPAYMV-

Query:  ---GSKDDPGYCRQSCKVCSP
           GSK+D GYCR SCK CSP
Subjt:  ---GSKDDPGYCRQSCKVCSP

XP_038905408.1 probable prolyl 4-hydroxylase 7 isoform X1 [Benincasa hispida]9.2e-14377.29Show/hide
Query:  MGCQFFLAFSLCFLCFFPRFSRSANRFPKLLMNN-----AANIMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSLVADVVT
        M  +FFLAFSLCFLCFFP FSRSANR PKLL++N     +   MKT GS V IDPTRVI+LSS+PRAFLYKGFLS +EC  LINLAK KLQ SLVA   T
Subjt:  MGCQFFLAFSLCFLCFFPRFSRSANRFPKLLMNN-----AANIMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSLVADVVT

Query:  GASVTSEERTSTGMFLRKAQDKIVAGIESKIAAWTFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSPVK
        G SVTS+ERTSTGMFL +AQD+IVA IES+IAAWTFLP+D+GEP+Q+LRYENGQKY PHFDFFQDPVN+AIGGHRIAT+LMYLSDVEKGGETVFPNSP+K
Subjt:  GASVTSEERTSTGMFLRKAQDKIVAGIESKIAAWTFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSPVK

Query:  LSEEEKNDLSDCARIGYGVKPRKGDALLFFSLNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKIWRNPDCVDESEHCGVWANAGDCEKNPAYMVG
        LSE+E+ DLSDCA++GYGVKP+ GDALLFFSLNPN+T D TSYHGSCPVIEGEKWSATKWIHMLP  +IWRNP CVDE+  C  WANAG+CEKNP YM+G
Subjt:  LSEEEKNDLSDCARIGYGVKPRKGDALLFFSLNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKIWRNPDCVDESEHCGVWANAGDCEKNPAYMVG

Query:  SKDDPGYCRQSCKVCSP
        SK++ G+CR SCKVCSP
Subjt:  SKDDPGYCRQSCKVCSP

TrEMBL top hitse value%identityAlignment
A0A0A0LG32 Procollagen-proline 4-dioxygenase3.8e-13473.67Show/hide
Query:  MGCQFFLAFSLCFLCFF--PRFSRSANRFPKLLMNN-----AANIMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSLVADV
        M   FFL FS+ FL  F  P  S SANRFPKL+++N     +   MKT GS++ IDPTRVIQLSS+PRAFLYKGFLSAEEC  LIN AK KL  SLVA  
Subjt:  MGCQFFLAFSLCFLCFF--PRFSRSANRFPKLLMNN-----AANIMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSLVADV

Query:  VTGASVTSEERTSTGMFLRKAQDKIVAGIESKIAAWTFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSP
         TG SVTS+ERTSTGMFL KAQD+IVA IES+IAAWTFLP+D+GEP+Q+LRYENGQKY PHFDFFQDP N+AIGGHRIAT+LMYLS+VEKGGETVFPNSP
Subjt:  VTGASVTSEERTSTGMFLRKAQDKIVAGIESKIAAWTFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSP

Query:  VKLSEEEKNDLSDCARIGYGVKPRKGDALLFFSLNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKIWRNPDCVDESEHCGVWANAGDCEKNPAYM
        VKLSEEEK DLS+C ++GYGV+P+ GDALLFFS+NPN+T D TSYHGSCPVIEGEKWSATKWIHMLP ++ WRNP CVDE++HC  WA AG+CEKNP YM
Subjt:  VKLSEEEKNDLSDCARIGYGVKPRKGDALLFFSLNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKIWRNPDCVDESEHCGVWANAGDCEKNPAYM

Query:  VGSKDDPGYCRQSCKVCSP
        +GSK++ G+CR SCKVCSP
Subjt:  VGSKDDPGYCRQSCKVCSP

A0A1S3B814 Procollagen-proline 4-dioxygenase2.8e-13774.45Show/hide
Query:  MGCQFFLAFSLCFLCFFPRFSRSANRFPKLLMNN-----AANIMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSLVADVVT
        M   F LAFS+ FL   P  S SANRFPK+L++N     +   MKT GS++ IDPTRVIQLSS+PRAFLYKGFLS EEC  LI+LAK KL+ SLVA   T
Subjt:  MGCQFFLAFSLCFLCFFPRFSRSANRFPKLLMNN-----AANIMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSLVADVVT

Query:  GASVTSEERTSTGMFLRKAQDKIVAGIESKIAAWTFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSPVK
        G SVTS+ERTSTGMFLRKAQDKIVA IES+IAAWTFLP+D+GEP+Q+LRYENGQKY PHFDFFQDP N+AIGGHRIAT+LMYLSDVEKGGETVFPNSPVK
Subjt:  GASVTSEERTSTGMFLRKAQDKIVAGIESKIAAWTFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSPVK

Query:  LSEEEKNDLSDCARIGYGVKPRKGDALLFFSLNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKIWRNPDCVDESEHCGVWANAGDCEKNPAYMVG
        LSEEEK DLS+CA++GYGV+P+ GDALLFFS+NPN+T D TSYHGSCPVIEGEKWSATKWIHMLP +++WRNP CVDE++HC  WA AG+C+KNP YM+G
Subjt:  LSEEEKNDLSDCARIGYGVKPRKGDALLFFSLNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKIWRNPDCVDESEHCGVWANAGDCEKNPAYMVG

Query:  SKDDPGYCRQSCKVCSP
        SK++ G+CR SCKVCSP
Subjt:  SKDDPGYCRQSCKVCSP

A0A6J1DX45 Procollagen-proline 4-dioxygenase1.4e-13374.76Show/hide
Query:  MGCQFFLAFSLCFLCFFPRFSRSANRFPKLLM--NNAAN----IMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSLVADVV
        M  + FLAFSLCFLC FP F RS N  P+LLM  NN        MKT GSS+ IDP+RV QLSSQPRAF+YKGFLSAEEC  LINLAKDKL+ SLVAD V
Subjt:  MGCQFFLAFSLCFLCFFPRFSRSANRFPKLLM--NNAAN----IMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSLVADVV

Query:  TGASVTSEERTSTGMFLRKAQDKIVAGIESKIAAWTFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSPV
        TG SVTS ERTSTGMFL K QDKIVAGIES+IAAWTFLPVD+GEPMQVLRYENGQKY PHFDFFQDPVNMA GGHRIATVLMYLS+VE+GGETVFPNS V
Subjt:  TGASVTSEERTSTGMFLRKAQDKIVAGIESKIAAWTFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSPV

Query:  KLSEEEKNDLSDCARIGYGVKPRKGDALLFFSLNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKIWRNPDCVDESEHCGVWANAGDCEKNPAYMV
        KLS  EK +LSDCA++GY VKP+ GDALLFFSL+ N TTD +SYHGSCPVI+GEKWSATKWIHML  ++IWR+PDCVD S  C  WA+ G+C KNP YM+
Subjt:  KLSEEEKNDLSDCARIGYGVKPRKGDALLFFSLNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKIWRNPDCVDESEHCGVWANAGDCEKNPAYMV

Query:  GSKDDPGYCRQSCKVCS
        GSK + GYCR+SC  CS
Subjt:  GSKDDPGYCRQSCKVCS

A0A6J1EYJ1 Procollagen-proline 4-dioxygenase1.1e-14175.39Show/hide
Query:  MGCQFFLAFSLCFLCFFPRFSRSANRFPKLLMNNAAN-----IMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSLVADVVT
        M  +FFLAFSLCFLC FP F+RSANR PKLL+++         MK  GSS++IDPTRV+QLSSQPRAFLYKGFLSAEEC  LI+LAKD L+ SLV D +T
Subjt:  MGCQFFLAFSLCFLCFFPRFSRSANRFPKLLMNNAAN-----IMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSLVADVVT

Query:  GASVTSEERTSTGMFLRKAQDKIVAGIESKIAAWTFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSPVK
        GAS +S +RTSTGMFL KAQD IVAGIE+KIAAWTFLPVD+GEP+Q+LRYENGQ+YVPHFDFFQDPVN+A GGHRIATVLMYLS+VE+GGETVFP+SP K
Subjt:  GASVTSEERTSTGMFLRKAQDKIVAGIESKIAAWTFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSPVK

Query:  LSEEEKNDLSDCARIGYGVKPRKGDALLFFSLNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKIWRNPDCVDESEHCGVWANAGDCEKNPAYMV-
        + EEE  DL DC+  GYGVKP+KGDALLFFSL+PN+TTDPTSYHGSCPVIEGEKWSATKWIHMLP ++IWRNPDCVDE+EHC  WA AG+CEKNP YMV 
Subjt:  LSEEEKNDLSDCARIGYGVKPRKGDALLFFSLNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKIWRNPDCVDESEHCGVWANAGDCEKNPAYMV-

Query:  ---GSKDDPGYCRQSCKVCSP
           GSK++ GYCR SCK CSP
Subjt:  ---GSKDDPGYCRQSCKVCSP

A0A6J1I5Z9 Procollagen-proline 4-dioxygenase7.1e-14175.39Show/hide
Query:  MGCQFFLAFSLCFLCFFPRFSRSANRFPKLLMNNAAN-----IMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSLVADVVT
        M  +FFLAFSLCFLC FP F+RSANR PKLL+++         MK  GSS++IDPTRV+QLSSQPRAFLYKGFLSAEEC  LI+LAKD L+ SLV D +T
Subjt:  MGCQFFLAFSLCFLCFFPRFSRSANRFPKLLMNNAAN-----IMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSLVADVVT

Query:  GASVTSEERTSTGMFLRKAQDKIVAGIESKIAAWTFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSPVK
        GAS +S +RTSTGMFL KAQD IVAGIE+KIAAWTFLPVD+GEP+Q+LRYENGQ+YVPHFDFFQDPVN+A GGHRIATVL+YLS+VE+GGETVFP+SP K
Subjt:  GASVTSEERTSTGMFLRKAQDKIVAGIESKIAAWTFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSPVK

Query:  LSEEEKNDLSDCARIGYGVKPRKGDALLFFSLNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKIWRNPDCVDESEHCGVWANAGDCEKNPAYMV-
        + EE K DLSDC+  GYGVKP+KGDALLFFSL+PN+TTDPTSYHGSCPVIEGEKWSATKWIHMLP ++IWRNPDCVDE+EHC  WA AG+CEKNP YMV 
Subjt:  LSEEEKNDLSDCARIGYGVKPRKGDALLFFSLNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKIWRNPDCVDESEHCGVWANAGDCEKNPAYMV-

Query:  ---GSKDDPGYCRQSCKVCSP
           GSK++ GYCR SCK CSP
Subjt:  ---GSKDDPGYCRQSCKVCSP

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 61.6e-9758.52Show/hide
Query:  MGCQFFLAFSLCFLCFFPRFSRSANRFPKLLMNNAANIMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSL-VADVVTGASV
        M  Q+FLAFSL  L  F + S                       S  +DPTR+ QLS  PRAFLYKGFLS EEC  LI LAK KL+ S+ VADV +G S 
Subjt:  MGCQFFLAFSLCFLCFFPRFSRSANRFPKLLMNNAANIMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSL-VADVVTGASV

Query:  TSEERTSTGMFLRKAQDKIVAGIESKIAAWTFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSPVKLSEE
         SE RTS+GMFL K QD IVA +E+K+AAWTFLP ++GE +Q+L YENGQKY PHFD+F D   + +GGHRIATVLMYLS+V KGGETVFPN   K  + 
Subjt:  TSEERTSTGMFLRKAQDKIVAGIESKIAAWTFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSPVKLSEE

Query:  EKNDLSDCARIGYGVKPRKGDALLFFSLNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKIWRNPDCVDESEHCGVWANAGDCEKNPAYMVGSKDD
        + +  S CA+ GY VKPRKGDALLFF+L+ N TTDP S HGSCPVIEGEKWSAT+WIH+  F K  +   CVD+ E C  WA+AG+CEKNP YMVGS+  
Subjt:  EKNDLSDCARIGYGVKPRKGDALLFFSLNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKIWRNPDCVDESEHCGVWANAGDCEKNPAYMVGSKDD

Query:  PGYCRQSCKVC
         G+CR+SCK C
Subjt:  PGYCRQSCKVC

F4JAU3 Prolyl 4-hydroxylase 23.0e-8855.79Show/hide
Query:  LLMNNAANIMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSLVADVVTGASVTSEERTSTGMFLRKAQDKIVAGIESKIAAW
        L++  ++  + ++ SS+ I+P++V Q+SS+PRAF+Y+GFL+  EC  LI+LAK+ LQ S VAD   G S  S+ RTS+G F+ K +D IV+GIE K++ W
Subjt:  LLMNNAANIMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSLVADVVTGASVTSEERTSTGMFLRKAQDKIVAGIESKIAAW

Query:  TFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSPV---KLSEEEKNDLSDCARIGYGVKPRKGDALLFFS
        TFLP ++GE +QVLRYE+GQKY  HFD+F D VN+A GGHRIATVL+YLS+V KGGETVFP++     +   E K+DLSDCA+ G  VKP+KG+ALLFF+
Subjt:  TFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSPV---KLSEEEKNDLSDCARIGYGVKPRKGDALLFFS

Query:  LNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKI-WRNPDCVDESEHCGVWANAGDCEKNPAYMVGSKDDPGYCRQSCKVC
        L  +   DP S HG CPVIEGEKWSATKWIH+  F+KI   + +C D +E C  WA  G+C KNP YMVG+ + PG CR+SCK C
Subjt:  LNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKI-WRNPDCVDESEHCGVWANAGDCEKNPAYMVGSKDDPGYCRQSCKVC

Q8L970 Probable prolyl 4-hydroxylase 77.1e-10659.68Show/hide
Query:  MGCQFFLAFSLCFLCFFPRFSRSANRFPKLLMN---NAANIMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSLVADVVTGA
        M  + FLAFSLCFL   P  S + NRF     N    +   MKT+ SS   DPTRV QLS  PR FLY+GFLS EEC   I LAK KL+ S+VAD  +G 
Subjt:  MGCQFFLAFSLCFLCFFPRFSRSANRFPKLLMN---NAANIMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSLVADVVTGA

Query:  SVTSEERTSTGMFLRKAQDKIVAGIESKIAAWTFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSPVKLS
        SV SE RTS+GMFL K QD IV+ +E+K+AAWTFLP ++GE MQ+L YENGQKY PHFD+F D  N+ +GGHRIATVLMYLS+VEKGGETVFP    K +
Subjt:  SVTSEERTSTGMFLRKAQDKIVAGIESKIAAWTFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSPVKLS

Query:  EEEKNDLSDCARIGYGVKPRKGDALLFFSLNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKIW-RNPDCVDESEHCGVWANAGDCEKNPAYMVGS
        + + +  ++CA+ GY VKPRKGDALLFF+L+PN TTD  S HGSCPV+EGEKWSAT+WIH+  F + + +   C+DE+  C  WA AG+C+KNP YMVGS
Subjt:  EEEKNDLSDCARIGYGVKPRKGDALLFFSLNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKIW-RNPDCVDESEHCGVWANAGDCEKNPAYMVGS

Query:  KDDPGYCRQSCKVCS
          D GYCR+SCK CS
Subjt:  KDDPGYCRQSCKVCS

Q8LAN3 Probable prolyl 4-hydroxylase 43.8e-9156.25Show/hide
Query:  FPKLLMNNAANIMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSLVADVVTGASVTSEERTSTGMFLRKAQDKIVAGIESKI
        F  LL ++ + I   + SSV ++P++V Q+SS+PRAF+Y+GFL+  EC  +++LAK  L+ S VAD  +G S  SE RTS+G F+ K +D IV+GIE KI
Subjt:  FPKLLMNNAANIMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSLVADVVTGASVTSEERTSTGMFLRKAQDKIVAGIESKI

Query:  AAWTFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSPV---KLSEEEKNDLSDCARIGYGVKPRKGDALL
        + WTFLP ++GE +QVLRYE+GQKY  HFD+F D VN+  GGHR+AT+LMYLS+V KGGETVFP++ +   ++  E K DLSDCA+ G  VKPRKGDALL
Subjt:  AAWTFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSPV---KLSEEEKNDLSDCARIGYGVKPRKGDALL

Query:  FFSLNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKI-WRNPDCVDESEHCGVWANAGDCEKNPAYMVGSKDDPGYCRQSCKVC
        FF+L+P+   DP S HG CPVIEGEKWSATKWIH+  F++I   + +C D +E C  WA  G+C KNP YMVG+ + PGYCR+SCK C
Subjt:  FFSLNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKI-WRNPDCVDESEHCGVWANAGDCEKNPAYMVGSKDDPGYCRQSCKVC

Q9LN20 Probable prolyl 4-hydroxylase 34.0e-6457.84Show/hide
Query:  LSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSLVADVVTGASVTSEERTSTGMFLRKAQDKIVAGIESKIAAWTFLPVDHGEPMQVLRYENGQKYVPHF
        LS +PRAF+Y  FLS EEC  LI+LAK  +  S V D  TG S  S  RTS+G FLR+ +DKI+  IE +IA +TF+P DHGE +QVL YE GQKY PH+
Subjt:  LSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSLVADVVTGASVTSEERTSTGMFLRKAQDKIVAGIESKIAAWTFLPVDHGEPMQVLRYENGQKYVPHF

Query:  DFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSPVKLSEEE-KNDLSDCARIGYGVKPRKGDALLFFSLNPNITTDPTSYHGSCPVIEGEKWSATK
        D+F D  N   GG R+AT+LMYLSDVE+GGETVFP + +  S     N+LS+C + G  VKPR GDALLF+S+ P+ T DPTS HG CPVI G KWS+TK
Subjt:  DFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSPVKLSEEE-KNDLSDCARIGYGVKPRKGDALLFFSLNPNITTDPTSYHGSCPVIEGEKWSATK

Query:  WIHM
        W+H+
Subjt:  WIHM

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 22.1e-8955.79Show/hide
Query:  LLMNNAANIMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSLVADVVTGASVTSEERTSTGMFLRKAQDKIVAGIESKIAAW
        L++  ++  + ++ SS+ I+P++V Q+SS+PRAF+Y+GFL+  EC  LI+LAK+ LQ S VAD   G S  S+ RTS+G F+ K +D IV+GIE K++ W
Subjt:  LLMNNAANIMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSLVADVVTGASVTSEERTSTGMFLRKAQDKIVAGIESKIAAW

Query:  TFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSPV---KLSEEEKNDLSDCARIGYGVKPRKGDALLFFS
        TFLP ++GE +QVLRYE+GQKY  HFD+F D VN+A GGHRIATVL+YLS+V KGGETVFP++     +   E K+DLSDCA+ G  VKP+KG+ALLFF+
Subjt:  TFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSPV---KLSEEEKNDLSDCARIGYGVKPRKGDALLFFS

Query:  LNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKI-WRNPDCVDESEHCGVWANAGDCEKNPAYMVGSKDDPGYCRQSCKVC
        L  +   DP S HG CPVIEGEKWSATKWIH+  F+KI   + +C D +E C  WA  G+C KNP YMVG+ + PG CR+SCK C
Subjt:  LNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKI-WRNPDCVDESEHCGVWANAGDCEKNPAYMVGSKDDPGYCRQSCKVC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase5.1e-10759.68Show/hide
Query:  MGCQFFLAFSLCFLCFFPRFSRSANRFPKLLMN---NAANIMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSLVADVVTGA
        M  + FLAFSLCFL   P  S + NRF     N    +   MKT+ SS   DPTRV QLS  PR FLY+GFLS EEC   I LAK KL+ S+VAD  +G 
Subjt:  MGCQFFLAFSLCFLCFFPRFSRSANRFPKLLMN---NAANIMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSLVADVVTGA

Query:  SVTSEERTSTGMFLRKAQDKIVAGIESKIAAWTFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSPVKLS
        SV SE RTS+GMFL K QD IV+ +E+K+AAWTFLP ++GE MQ+L YENGQKY PHFD+F D  N+ +GGHRIATVLMYLS+VEKGGETVFP    K +
Subjt:  SVTSEERTSTGMFLRKAQDKIVAGIESKIAAWTFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSPVKLS

Query:  EEEKNDLSDCARIGYGVKPRKGDALLFFSLNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKIW-RNPDCVDESEHCGVWANAGDCEKNPAYMVGS
        + + +  ++CA+ GY VKPRKGDALLFF+L+PN TTD  S HGSCPV+EGEKWSAT+WIH+  F + + +   C+DE+  C  WA AG+C+KNP YMVGS
Subjt:  EEEKNDLSDCARIGYGVKPRKGDALLFFSLNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKIW-RNPDCVDESEHCGVWANAGDCEKNPAYMVGS

Query:  KDDPGYCRQSCKVCS
          D GYCR+SCK CS
Subjt:  KDDPGYCRQSCKVCS

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase6.0e-10056.04Show/hide
Query:  MGCQFFLAFSLCFLCFFPRFSRSANRFPKLLMN---NAANIMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSLVADVVTGA
        M  + FLAFSLCFL   P  S + NRF     N    +   MKT+ SS   DPTRV QLS  PR FLY+GFLS EEC   I LAK KL+ S+VAD  +G 
Subjt:  MGCQFFLAFSLCFLCFFPRFSRSANRFPKLLMN---NAANIMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSLVADVVTGA

Query:  SVTSEERTS----TGMFLRKAQ----DKIVAGIESKIAAWTFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVF
        SV SE+  S    +  F+        D IV+ +E+K+AAWTFLP ++GE MQ+L YENGQKY PHFD+F D  N+ +GGHRIATVLMYLS+VEKGGETVF
Subjt:  SVTSEERTS----TGMFLRKAQ----DKIVAGIESKIAAWTFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVF

Query:  PNSPVKLSEEEKNDLSDCARIGYGVKPRKGDALLFFSLNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKIW-RNPDCVDESEHCGVWANAGDCEK
        P    K ++ + +  ++CA+ GY VKPRKGDALLFF+L+PN TTD  S HGSCPV+EGEKWSAT+WIH+  F + + +   C+DE+  C  WA AG+C+K
Subjt:  PNSPVKLSEEEKNDLSDCARIGYGVKPRKGDALLFFSLNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKIW-RNPDCVDESEHCGVWANAGDCEK

Query:  NPAYMVGSKDDPGYCRQSCKVCS
        NP YMVGS  D GYCR+SCK CS
Subjt:  NPAYMVGSKDDPGYCRQSCKVCS

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase1.1e-9858.52Show/hide
Query:  MGCQFFLAFSLCFLCFFPRFSRSANRFPKLLMNNAANIMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSL-VADVVTGASV
        M  Q+FLAFSL  L  F + S                       S  +DPTR+ QLS  PRAFLYKGFLS EEC  LI LAK KL+ S+ VADV +G S 
Subjt:  MGCQFFLAFSLCFLCFFPRFSRSANRFPKLLMNNAANIMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSL-VADVVTGASV

Query:  TSEERTSTGMFLRKAQDKIVAGIESKIAAWTFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSPVKLSEE
         SE RTS+GMFL K QD IVA +E+K+AAWTFLP ++GE +Q+L YENGQKY PHFD+F D   + +GGHRIATVLMYLS+V KGGETVFPN   K  + 
Subjt:  TSEERTSTGMFLRKAQDKIVAGIESKIAAWTFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSPVKLSEE

Query:  EKNDLSDCARIGYGVKPRKGDALLFFSLNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKIWRNPDCVDESEHCGVWANAGDCEKNPAYMVGSKDD
        + +  S CA+ GY VKPRKGDALLFF+L+ N TTDP S HGSCPVIEGEKWSAT+WIH+  F K  +   CVD+ E C  WA+AG+CEKNP YMVGS+  
Subjt:  EKNDLSDCARIGYGVKPRKGDALLFFSLNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKIWRNPDCVDESEHCGVWANAGDCEKNPAYMVGSKDD

Query:  PGYCRQSCKVC
         G+CR+SCK C
Subjt:  PGYCRQSCKVC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.7e-9256.25Show/hide
Query:  FPKLLMNNAANIMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSLVADVVTGASVTSEERTSTGMFLRKAQDKIVAGIESKI
        F  LL ++ + I   + SSV ++P++V Q+SS+PRAF+Y+GFL+  EC  +++LAK  L+ S VAD  +G S  SE RTS+G F+ K +D IV+GIE KI
Subjt:  FPKLLMNNAANIMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSLVADVVTGASVTSEERTSTGMFLRKAQDKIVAGIESKI

Query:  AAWTFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSPV---KLSEEEKNDLSDCARIGYGVKPRKGDALL
        + WTFLP ++GE +QVLRYE+GQKY  HFD+F D VN+  GGHR+AT+LMYLS+V KGGETVFP++ +   ++  E K DLSDCA+ G  VKPRKGDALL
Subjt:  AAWTFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSPV---KLSEEEKNDLSDCARIGYGVKPRKGDALL

Query:  FFSLNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKI-WRNPDCVDESEHCGVWANAGDCEKNPAYMVGSKDDPGYCRQSCKVC
        FF+L+P+   DP S HG CPVIEGEKWSATKWIH+  F++I   + +C D +E C  WA  G+C KNP YMVG+ + PGYCR+SCK C
Subjt:  FFSLNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKI-WRNPDCVDESEHCGVWANAGDCEKNPAYMVGSKDDPGYCRQSCKVC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTGTCAATTTTTTCTCGCATTTTCTCTCTGTTTCCTTTGCTTCTTCCCTCGCTTTTCTCGCTCTGCCAATCGCTTCCCGAAATTGCTCATGAACAACGCCGCCAA
CATCATGAAAACGGCCGGTTCCTCCGTTAGAATCGATCCCACCCGTGTCATTCAGCTTTCATCGCAACCCAGGGCTTTCTTATATAAGGGATTTTTGTCAGCAGAGGAGT
GCCATCAACTTATCAATTTGGCGAAGGATAAGTTGCAGCTGTCATTGGTGGCGGATGTGGTAACGGGTGCGAGTGTTACGAGTGAAGAACGGACGAGTACTGGCATGTTT
CTTAGAAAGGCTCAGGATAAAATAGTTGCTGGCATTGAGTCCAAGATTGCGGCGTGGACCTTCCTTCCTGTCGATCATGGTGAGCCTATGCAAGTACTAAGGTACGAGAA
CGGTCAGAAATACGTGCCACATTTTGATTTTTTTCAAGACCCTGTCAATATGGCTATTGGTGGTCATCGGATAGCGACTGTTTTGATGTATTTGTCCGATGTTGAAAAGG
GTGGTGAAACAGTCTTTCCCAATTCTCCGGTTAAATTATCTGAGGAGGAAAAGAATGACTTGTCCGATTGCGCTAGGATTGGTTATGGAGTAAAACCAAGGAAGGGAGAT
GCTTTACTGTTCTTCAGTCTCAATCCAAATATAACGACAGATCCAACGAGCTACCACGGGAGCTGCCCGGTGATAGAGGGGGAGAAGTGGTCTGCAACAAAATGGATTCA
CATGCTTCCATTCAATAAGATTTGGAGGAATCCAGATTGTGTGGATGAGAGTGAGCACTGTGGTGTGTGGGCAAATGCAGGTGATTGTGAGAAGAATCCTGCTTACATGG
TGGGTTCCAAGGATGATCCTGGATATTGTAGGCAGAGTTGCAAAGTCTGCTCTCCCAAATAA
mRNA sequenceShow/hide mRNA sequence
AAAGAAACTGGGAGTTTAGAGAGAGAAAATGCAAAGCAATTTCCCGGTGGAAGAAGAAACTCCCATTTCCATGGTTGTCATCAGCCGTTTCGAAGCAGATACTTATCTGG
GCTGGCTATTAATGTTCGTCTTAATTGAAGCTCGAGTTTCGACATGGGTTGTCAATTTTTTCTCGCATTTTCTCTCTGTTTCCTTTGCTTCTTCCCTCGCTTTTCTCGCT
CTGCCAATCGCTTCCCGAAATTGCTCATGAACAACGCCGCCAACATCATGAAAACGGCCGGTTCCTCCGTTAGAATCGATCCCACCCGTGTCATTCAGCTTTCATCGCAA
CCCAGGGCTTTCTTATATAAGGGATTTTTGTCAGCAGAGGAGTGCCATCAACTTATCAATTTGGCGAAGGATAAGTTGCAGCTGTCATTGGTGGCGGATGTGGTAACGGG
TGCGAGTGTTACGAGTGAAGAACGGACGAGTACTGGCATGTTTCTTAGAAAGGCTCAGGATAAAATAGTTGCTGGCATTGAGTCCAAGATTGCGGCGTGGACCTTCCTTC
CTGTCGATCATGGTGAGCCTATGCAAGTACTAAGGTACGAGAACGGTCAGAAATACGTGCCACATTTTGATTTTTTTCAAGACCCTGTCAATATGGCTATTGGTGGTCAT
CGGATAGCGACTGTTTTGATGTATTTGTCCGATGTTGAAAAGGGTGGTGAAACAGTCTTTCCCAATTCTCCGGTTAAATTATCTGAGGAGGAAAAGAATGACTTGTCCGA
TTGCGCTAGGATTGGTTATGGAGTAAAACCAAGGAAGGGAGATGCTTTACTGTTCTTCAGTCTCAATCCAAATATAACGACAGATCCAACGAGCTACCACGGGAGCTGCC
CGGTGATAGAGGGGGAGAAGTGGTCTGCAACAAAATGGATTCACATGCTTCCATTCAATAAGATTTGGAGGAATCCAGATTGTGTGGATGAGAGTGAGCACTGTGGTGTG
TGGGCAAATGCAGGTGATTGTGAGAAGAATCCTGCTTACATGGTGGGTTCCAAGGATGATCCTGGATATTGTAGGCAGAGTTGCAAAGTCTGCTCTCCCAAATAAAATTT
GTCTCTCATTCCAA
Protein sequenceShow/hide protein sequence
MGCQFFLAFSLCFLCFFPRFSRSANRFPKLLMNNAANIMKTAGSSVRIDPTRVIQLSSQPRAFLYKGFLSAEECHQLINLAKDKLQLSLVADVVTGASVTSEERTSTGMF
LRKAQDKIVAGIESKIAAWTFLPVDHGEPMQVLRYENGQKYVPHFDFFQDPVNMAIGGHRIATVLMYLSDVEKGGETVFPNSPVKLSEEEKNDLSDCARIGYGVKPRKGD
ALLFFSLNPNITTDPTSYHGSCPVIEGEKWSATKWIHMLPFNKIWRNPDCVDESEHCGVWANAGDCEKNPAYMVGSKDDPGYCRQSCKVCSPK