; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh06G014520 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh06G014520
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationCmo_Chr06:10488986..10491900
RNA-Seq ExpressionCmoCh06G014520
SyntenyCmoCh06G014520
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597483.1 putative prolyl 4-hydroxylase 4, partial [Cucurbita argyrosperma subsp. sororia]4.4e-16794.3Show/hide
Query:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
        MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQ
        KAKDPIVSGIEDKIAAWTFLPK                +NGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQ
Subjt:  KAKDPIVSGIEDKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQ

Query:  RRQASETNEDLSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEY
        RRQASETNEDLSDCAKKGIAVKPRKGDALLFF+LHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEY
Subjt:  RRQASETNEDLSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEY

Query:  MVGSPELPGYCRKSCK
        MVGSPELPGYCRKSCK
Subjt:  MVGSPELPGYCRKSCK

XP_022137761.1 probable prolyl 4-hydroxylase 4 [Momordica charantia]4.9e-15888.29Show/hide
Query:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
        MA+FCSC+LLF  S+SISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDH+ISLAKAELKRSAVAD+LSGESKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQ
        KAKDPI+SGIEDKIAAWTFLPK                +NGEDIQVLRYEYGQKYDAH+DYF+DKVNIARGGHRMATVLMYLSNVEKGGETVFP+AEESQ
Subjt:  KAKDPIVSGIEDKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQ

Query:  RRQASETNEDLSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEY
        RRQASETNEDLSDCAKKGIAVKPR+GDALLFF+LHPNAVPDT SLHGGCPVIEGEKWSATKWIHVDSFDTI+ DHT+C D +ASCERWAELGECTNNPEY
Subjt:  RRQASETNEDLSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEY

Query:  MVGSPELPGYCRKSCK
        MVGSPELPGYCRKSCK
Subjt:  MVGSPELPGYCRKSCK

XP_022954026.1 probable prolyl 4-hydroxylase 4 [Cucurbita moschata]1.2e-16794.62Show/hide
Query:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
        MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQ
        KAKDPIVSGIEDKIAAWTFLPK                +NGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQ
Subjt:  KAKDPIVSGIEDKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQ

Query:  RRQASETNEDLSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEY
        RRQASETNEDLSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEY
Subjt:  RRQASETNEDLSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEY

Query:  MVGSPELPGYCRKSCK
        MVGSPELPGYCRKSCK
Subjt:  MVGSPELPGYCRKSCK

XP_022973641.1 probable prolyl 4-hydroxylase 4 [Cucurbita maxima]1.3e-16693.99Show/hide
Query:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
        MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQ
        KAKDPIVSGIEDKIAAWTFLPK                +NGEDIQVLRYEYGQKYDAH+DYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQ
Subjt:  KAKDPIVSGIEDKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQ

Query:  RRQASETNEDLSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEY
        RRQASETNEDLSDCAKKGIAVKPRKGDALLFF+LHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEY
Subjt:  RRQASETNEDLSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEY

Query:  MVGSPELPGYCRKSCK
        MVGSPELPGYCRKSCK
Subjt:  MVGSPELPGYCRKSCK

XP_023539189.1 probable prolyl 4-hydroxylase 4 [Cucurbita pepo subsp. pepo]1.3e-16693.99Show/hide
Query:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
        MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQ
        KAKDPIVSGIEDKIAAWTFLPK                +NGEDIQVLRYEYGQKYD HYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQ
Subjt:  KAKDPIVSGIEDKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQ

Query:  RRQASETNEDLSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEY
        RRQASETNEDLSDCAKKGIAVKPRKGDALLFF+LHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEY
Subjt:  RRQASETNEDLSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEY

Query:  MVGSPELPGYCRKSCK
        MVGSPELPGYCRKSCK
Subjt:  MVGSPELPGYCRKSCK

TrEMBL top hitse value%identityAlignment
A0A0A0L5Q6 Procollagen-proline 4-dioxygenase1.3e-15184.64Show/hide
Query:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
        MA+    NLLF+ ++SIS LL+RAS+SYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRS+VAD+LSG+SKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQ
        KAKDPIVSGIEDKIAAWTFLPK                 NGEDIQVLRYEYGQKYDAH+DYF DKVNIARGGHRMATVLMYLS+VEKGGETVFP+AEESQ
Subjt:  KAKDPIVSGIEDKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQ

Query:  RRQASETNEDLSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEY
        RRQASETNEDLSDCAKKGIAVKPRKGDALLFF+LHPNA+PDT SLHGGCPVIEGEKWSATKWI VDSFD +V DHT+C D N SCERWAELGECTNNPEY
Subjt:  RRQASETNEDLSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEY

Query:  MVGSPELPGYCRKSCKSLS
        MVGSPELPGYCRKSCK+ S
Subjt:  MVGSPELPGYCRKSCKSLS

A0A1S3AWU7 Procollagen-proline 4-dioxygenase6.0e-15485.58Show/hide
Query:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
        MA+F   NLLF+ +++IS LLRRAS+SYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRS+VAD+LSGESKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQ
        KAKDPIVSGIEDKIAAWTFLPK                +NGEDIQVLRYEYGQKYDAH+DYF DKVNIARGGHRMATVLMYLS+VEKGGETVFP+AEESQ
Subjt:  KAKDPIVSGIEDKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQ

Query:  RRQASETNEDLSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEY
        RRQASETN+DLSDCAKKGIAVKPRKGDALLFF+LHPNA+PDT SLHGGCPVIEGEKWSATKWIHVDSFDTI  DHT+C D N SCERWAELGECTNNPEY
Subjt:  RRQASETNEDLSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEY

Query:  MVGSPELPGYCRKSCKSLS
        MVGSPELPGYCRKSCK+ S
Subjt:  MVGSPELPGYCRKSCKSLS

A0A6J1C7M6 Procollagen-proline 4-dioxygenase2.4e-15888.29Show/hide
Query:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
        MA+FCSC+LLF  S+SISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDH+ISLAKAELKRSAVAD+LSGESKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQ
        KAKDPI+SGIEDKIAAWTFLPK                +NGEDIQVLRYEYGQKYDAH+DYF+DKVNIARGGHRMATVLMYLSNVEKGGETVFP+AEESQ
Subjt:  KAKDPIVSGIEDKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQ

Query:  RRQASETNEDLSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEY
        RRQASETNEDLSDCAKKGIAVKPR+GDALLFF+LHPNAVPDT SLHGGCPVIEGEKWSATKWIHVDSFDTI+ DHT+C D +ASCERWAELGECTNNPEY
Subjt:  RRQASETNEDLSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEY

Query:  MVGSPELPGYCRKSCK
        MVGSPELPGYCRKSCK
Subjt:  MVGSPELPGYCRKSCK

A0A6J1GPQ8 Procollagen-proline 4-dioxygenase5.6e-16894.62Show/hide
Query:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
        MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQ
        KAKDPIVSGIEDKIAAWTFLPK                +NGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQ
Subjt:  KAKDPIVSGIEDKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQ

Query:  RRQASETNEDLSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEY
        RRQASETNEDLSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEY
Subjt:  RRQASETNEDLSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEY

Query:  MVGSPELPGYCRKSCK
        MVGSPELPGYCRKSCK
Subjt:  MVGSPELPGYCRKSCK

A0A6J1I971 Procollagen-proline 4-dioxygenase6.2e-16793.99Show/hide
Query:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
        MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQ
        KAKDPIVSGIEDKIAAWTFLPK                +NGEDIQVLRYEYGQKYDAH+DYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQ
Subjt:  KAKDPIVSGIEDKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQ

Query:  RRQASETNEDLSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEY
        RRQASETNEDLSDCAKKGIAVKPRKGDALLFF+LHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEY
Subjt:  RRQASETNEDLSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEY

Query:  MVGSPELPGYCRKSCK
        MVGSPELPGYCRKSCK
Subjt:  MVGSPELPGYCRKSCK

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 68.5e-8956.17Show/hide
Query:  FILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRS-AVADHLSGESKVSEVRTSSGAFIHKAKDPIVSG
        + L+ S+SLLL     S   S S  V+P ++ Q+SW+PRAF+Y+GFL+D ECDHLI LAK +L++S  VAD  SGES+ SEVRTSSG F+ K +D IV+ 
Subjt:  FILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRS-AVADHLSGESKVSEVRTSSGAFIHKAKDPIVSG

Query:  IEDKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNE
        +E K+AAWTFLP                ++NGE +Q+L YE GQKYD H+DYF DK  +  GGHR+ATVLMYLSNV KGGETVFPN    + +     ++
Subjt:  IEDKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNE

Query:  DLSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPG
          S CAK+G AVKPRKGDALLFFNLH N   D  SLHG CPVIEGEKWSAT+WIHV SF         CVD++ SC+ WA+ GEC  NP YMVGS    G
Subjt:  DLSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPG

Query:  YCRKSCKS
        +CRKSCK+
Subjt:  YCRKSCKS

F4JAU3 Prolyl 4-hydroxylase 21.7e-12169.61Show/hide
Query:  ILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIHKAKDPIVSGIE
        +L ++I L+L ++S+    S SSI+NP+KVKQ+S  PRAFVYEGFLTDLECDHLISLAK  L+RSAVAD+ +GES+VS+VRTSSG FI K KDPIVSGIE
Subjt:  ILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIHKAKDPIVSGIE

Query:  DKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDL
        DK++ WTFLPK                +NGED+QVLRYE+GQKYDAH+DYF DKVNIARGGHR+ATVL+YLSNV KGGETVFP+A+E  RR  SE  +DL
Subjt:  DKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDL

Query:  SDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYC
        SDCAKKGIAVKP+KG+ALLFFNL  +A+PD  SLHGGCPVIEGEKWSATKWIHVDSFD I++   +C D N SCERWA LGEC  NPEYMVG+PE+PG C
Subjt:  SDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYC

Query:  RKSCKS
        R+SCK+
Subjt:  RKSCKS

Q8L970 Probable prolyl 4-hydroxylase 72.1e-9555.25Show/hide
Query:  CSCNLLFILSISISLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSG
        C    L ++S + +  L R+S++  G       SASS   +P +V Q+SW+PR F+YEGFL+D ECDH I LAK +L++S VAD+ SGES  SEVRTSSG
Subjt:  CSCNLLFILSISISLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSG

Query:  AFIHKAKDPIVSGIEDKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNA
         F+ K +D IVS +E K+AAWTFLP                ++NGE +Q+L YE GQKY+ H+DYF D+ N+  GGHR+ATVLMYLSNVEKGGETVFP  
Subjt:  AFIHKAKDPIVSGIEDKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNA

Query:  EESQRRQASETNED-LSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECT
            + +A++  +D  ++CAK+G AVKPRKGDALLFFNLHPNA  D+ SLHG CPV+EGEKWSAT+WIHV SF+   +  + C+D N SCE+WA+ GEC 
Subjt:  EESQRRQASETNED-LSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECT

Query:  NNPEYMVGSPELPGYCRKSCKSLS
         NP YMVGS +  GYCRKSCK+ S
Subjt:  NNPEYMVGSPELPGYCRKSCKSLS

Q8LAN3 Probable prolyl 4-hydroxylase 41.9e-12572.4Show/hide
Query:  LFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIHKAKDPIVSG
        L I   +I  +L ++S+S   S+S  VNP+KVKQ+S  PRAFVYEGFLT+LECDH++SLAKA LKRSAVAD+ SGESK SEVRTSSG FI K KDPIVSG
Subjt:  LFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIHKAKDPIVSG

Query:  IEDKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNE
        IEDKI+ WTFLPK                +NGEDIQVLRYE+GQKYDAH+DYF DKVNI RGGHRMAT+LMYLSNV KGGETVFP+AE   RR  SE  E
Subjt:  IEDKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNE

Query:  DLSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPG
        DLSDCAK+GIAVKPRKGDALLFFNLHP+A+PD  SLHGGCPVIEGEKWSATKWIHVDSFD IV+   +C D N SCERWA LGECT NPEYMVG+ ELPG
Subjt:  DLSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPG

Query:  YCRKSCKS
        YCR+SCK+
Subjt:  YCRKSCKS

Q9LN20 Probable prolyl 4-hydroxylase 32.0e-6152Show/hide
Query:  ISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKGKPLAEGANFWLNSIDKNGED
        +SW PRAFVY  FL+  EC++LISLAK  + +S V D  +G+SK S VRTSSG F+ + +D I+  IE +IA +TF+P                  +GE 
Subjt:  ISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKGKPLAEGANFWLNSIDKNGED

Query:  IQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTK
        +QVL YE GQKY+ HYDYF D+ N   GG RMAT+LMYLS+VE+GGETVFP A  +    +     +LS+C KKG++VKPR GDALLF+++ P+A  D  
Subjt:  IQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTK

Query:  SLHGGCPVIEGEKWSATKWIHVDSF
        SLHGGCPVI G KWS+TKW+HV  +
Subjt:  SLHGGCPVIEGEKWSATKWIHVDSF

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 21.2e-12269.61Show/hide
Query:  ILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIHKAKDPIVSGIE
        +L ++I L+L ++S+    S SSI+NP+KVKQ+S  PRAFVYEGFLTDLECDHLISLAK  L+RSAVAD+ +GES+VS+VRTSSG FI K KDPIVSGIE
Subjt:  ILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIHKAKDPIVSGIE

Query:  DKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDL
        DK++ WTFLPK                +NGED+QVLRYE+GQKYDAH+DYF DKVNIARGGHR+ATVL+YLSNV KGGETVFP+A+E  RR  SE  +DL
Subjt:  DKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDL

Query:  SDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYC
        SDCAKKGIAVKP+KG+ALLFFNL  +A+PD  SLHGGCPVIEGEKWSATKWIHVDSFD I++   +C D N SCERWA LGEC  NPEYMVG+PE+PG C
Subjt:  SDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYC

Query:  RKSCKS
        R+SCK+
Subjt:  RKSCKS

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase1.5e-9655.25Show/hide
Query:  CSCNLLFILSISISLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSG
        C    L ++S + +  L R+S++  G       SASS   +P +V Q+SW+PR F+YEGFL+D ECDH I LAK +L++S VAD+ SGES  SEVRTSSG
Subjt:  CSCNLLFILSISISLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSG

Query:  AFIHKAKDPIVSGIEDKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNA
         F+ K +D IVS +E K+AAWTFLP                ++NGE +Q+L YE GQKY+ H+DYF D+ N+  GGHR+ATVLMYLSNVEKGGETVFP  
Subjt:  AFIHKAKDPIVSGIEDKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNA

Query:  EESQRRQASETNED-LSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECT
            + +A++  +D  ++CAK+G AVKPRKGDALLFFNLHPNA  D+ SLHG CPV+EGEKWSAT+WIHV SF+   +  + C+D N SCE+WA+ GEC 
Subjt:  EESQRRQASETNED-LSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECT

Query:  NNPEYMVGSPELPGYCRKSCKSLS
         NP YMVGS +  GYCRKSCK+ S
Subjt:  NNPEYMVGSPELPGYCRKSCKSLS

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase6.1e-9052.71Show/hide
Query:  CSCNLLFILSISISLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGES-----KVSEV
        C    L ++S + +  L R+S++  G       SASS   +P +V Q+SW+PR F+YEGFL+D ECDH I LAK +L++S VAD+ SGES      VS V
Subjt:  CSCNLLFILSISISLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGES-----KVSEV

Query:  RTSSGAFIHKAK---DPIVSGIEDKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKG
        R SS    +      D IVS +E K+AAWTFLP                ++NGE +Q+L YE GQKY+ H+DYF D+ N+  GGHR+ATVLMYLSNVEKG
Subjt:  RTSSGAFIHKAK---DPIVSGIEDKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKG

Query:  GETVFPNAEESQRRQASETNED-LSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCER
        GETVFP      + +A++  +D  ++CAK+G AVKPRKGDALLFFNLHPNA  D+ SLHG CPV+EGEKWSAT+WIHV SF+   +  + C+D N SCE+
Subjt:  GETVFPNAEESQRRQASETNED-LSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCER

Query:  WAELGECTNNPEYMVGSPELPGYCRKSCKSLS
        WA+ GEC  NP YMVGS +  GYCRKSCK+ S
Subjt:  WAELGECTNNPEYMVGSPELPGYCRKSCKSLS

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase6.1e-9056.17Show/hide
Query:  FILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRS-AVADHLSGESKVSEVRTSSGAFIHKAKDPIVSG
        + L+ S+SLLL     S   S S  V+P ++ Q+SW+PRAF+Y+GFL+D ECDHLI LAK +L++S  VAD  SGES+ SEVRTSSG F+ K +D IV+ 
Subjt:  FILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRS-AVADHLSGESKVSEVRTSSGAFIHKAKDPIVSG

Query:  IEDKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNE
        +E K+AAWTFLP                ++NGE +Q+L YE GQKYD H+DYF DK  +  GGHR+ATVLMYLSNV KGGETVFPN    + +     ++
Subjt:  IEDKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNE

Query:  DLSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPG
          S CAK+G AVKPRKGDALLFFNLH N   D  SLHG CPVIEGEKWSAT+WIHV SF         CVD++ SC+ WA+ GEC  NP YMVGS    G
Subjt:  DLSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPG

Query:  YCRKSCKS
        +CRKSCK+
Subjt:  YCRKSCKS

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.4e-12672.4Show/hide
Query:  LFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIHKAKDPIVSG
        L I   +I  +L ++S+S   S+S  VNP+KVKQ+S  PRAFVYEGFLT+LECDH++SLAKA LKRSAVAD+ SGESK SEVRTSSG FI K KDPIVSG
Subjt:  LFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIHKAKDPIVSG

Query:  IEDKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNE
        IEDKI+ WTFLPK                +NGEDIQVLRYE+GQKYDAH+DYF DKVNI RGGHRMAT+LMYLSNV KGGETVFP+AE   RR  SE  E
Subjt:  IEDKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNE

Query:  DLSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPG
        DLSDCAK+GIAVKPRKGDALLFFNLHP+A+PD  SLHGGCPVIEGEKWSATKWIHVDSFD IV+   +C D N SCERWA LGECT NPEYMVG+ ELPG
Subjt:  DLSDCAKKGIAVKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPG

Query:  YCRKSCKS
        YCR+SCK+
Subjt:  YCRKSCKS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAAATTTTGTAGTTGCAATCTGCTGTTTATCCTCTCGATATCGATCTCGTTGCTTCTCCGGCGAGCTTCAAGCTCTTATGCAGGTTCCGCTAGCTCAATCGTCAA
TCCTGCAAAAGTCAAACAGATTTCATGGAGTCCCCGGGCTTTCGTGTATGAAGGTTTTCTCACGGACTTAGAATGCGATCATCTCATCTCGCTTGCTAAAGCGGAGTTGA
AGAGATCTGCTGTTGCGGATCATTTGTCTGGAGAGAGCAAGGTCAGCGAGGTCCGAACTAGCTCTGGGGCATTTATTCATAAAGCCAAGGATCCGATCGTTTCTGGTATT
GAAGACAAAATTGCAGCATGGACATTCCTGCCAAAAGGTAAACCATTAGCAGAAGGCGCAAATTTCTGGTTAAATTCCATAGATAAAAATGGAGAAGACATTCAAGTGTT
GAGATATGAATATGGGCAGAAGTATGATGCCCATTACGATTACTTTACTGACAAGGTTAATATTGCCCGAGGTGGACACCGAATGGCAACTGTTCTTATGTATCTTTCCA
ATGTAGAAAAAGGTGGTGAAACTGTGTTTCCTAATGCCGAGGAATCTCAAAGACGGCAGGCTTCTGAAACAAATGAAGATCTCTCAGACTGTGCAAAGAAAGGCATAGCA
GTGAAACCACGGAAGGGTGATGCTCTTCTCTTCTTCAATCTTCATCCAAATGCTGTTCCAGACACAAAAAGTCTGCATGGAGGTTGCCCTGTGATTGAAGGAGAGAAATG
GTCAGCAACAAAGTGGATCCATGTCGATTCTTTCGACACGATCGTGAGTGATCATACGAGTTGCGTTGATAATAATGCAAGTTGTGAGAGATGGGCTGAACTCGGCGAGT
GCACGAATAACCCGGAGTATATGGTGGGATCTCCTGAGCTTCCTGGCTATTGCAGGAAAAGTTGCAAGAGCTTGTCCAAAAACACAAATGGTTGTTTTCCTGATAGCATT
TTCATTGCAATGTTTTGTGACAGTTAG
mRNA sequenceShow/hide mRNA sequence
AAGAAACCTTTTATTATTTTATTTTCATCGCTCTCTCTCTTTCTCCAGTGATCCGATTCAGTCAATGGCGAAATTTTGTAGTTGCAATCTGCTGTTTATCCTCTCGATAT
CGATCTCGTTGCTTCTCCGGCGAGCTTCAAGCTCTTATGCAGGTTCCGCTAGCTCAATCGTCAATCCTGCAAAAGTCAAACAGATTTCATGGAGTCCCCGGGCTTTCGTG
TATGAAGGTTTTCTCACGGACTTAGAATGCGATCATCTCATCTCGCTTGCTAAAGCGGAGTTGAAGAGATCTGCTGTTGCGGATCATTTGTCTGGAGAGAGCAAGGTCAG
CGAGGTCCGAACTAGCTCTGGGGCATTTATTCATAAAGCCAAGGATCCGATCGTTTCTGGTATTGAAGACAAAATTGCAGCATGGACATTCCTGCCAAAAGGTAAACCAT
TAGCAGAAGGCGCAAATTTCTGGTTAAATTCCATAGATAAAAATGGAGAAGACATTCAAGTGTTGAGATATGAATATGGGCAGAAGTATGATGCCCATTACGATTACTTT
ACTGACAAGGTTAATATTGCCCGAGGTGGACACCGAATGGCAACTGTTCTTATGTATCTTTCCAATGTAGAAAAAGGTGGTGAAACTGTGTTTCCTAATGCCGAGGAATC
TCAAAGACGGCAGGCTTCTGAAACAAATGAAGATCTCTCAGACTGTGCAAAGAAAGGCATAGCAGTGAAACCACGGAAGGGTGATGCTCTTCTCTTCTTCAATCTTCATC
CAAATGCTGTTCCAGACACAAAAAGTCTGCATGGAGGTTGCCCTGTGATTGAAGGAGAGAAATGGTCAGCAACAAAGTGGATCCATGTCGATTCTTTCGACACGATCGTG
AGTGATCATACGAGTTGCGTTGATAATAATGCAAGTTGTGAGAGATGGGCTGAACTCGGCGAGTGCACGAATAACCCGGAGTATATGGTGGGATCTCCTGAGCTTCCTGG
CTATTGCAGGAAAAGTTGCAAGAGCTTGTCCAAAAACACAAATGGTTGTTTTCCTGATAGCATTTTCATTGCAATGTTTTGTGACAGTTAG
Protein sequenceShow/hide protein sequence
MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIHKAKDPIVSGI
EDKIAAWTFLPKGKPLAEGANFWLNSIDKNGEDIQVLRYEYGQKYDAHYDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAKKGIA
VKPRKGDALLFFNLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCKSLSKNTNGCFPDSI
FIAMFCDS