; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy3G013910 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy3G013910
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationGy14Chr3:10314639..10317916
RNA-Seq ExpressionCsGy3G013910
SyntenyCsGy3G013910
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049426.1 putative prolyl 4-hydroxylase 4 [Cucumis melo var. makuwa]9.11e-20894.72Show/hide
Query:  MAELLCFNLLFLFTLSISFLLQRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADDLSGKSKVSEVRTSSGAFIH
        MAE L FNLLFLFTL+IS+LL+RASASYAGSASSIVNPAKVKQISW   AFVYEGFLTDLECDHLISLAKAELKRSSVAD+LSG+SKVSEVRTSSGAFIH
Subjt:  MAELLCFNLLFLFTLSISFLLQRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADDLSGKSKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPK+NGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWI VDSFD + RDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  ACS
        ACS
Subjt:  ACS

XP_004134175.2 probable prolyl 4-hydroxylase 4 [Cucumis sativus]7.55e-24699.71Show/hide
Query:  MTVNSLKKRWKSENFIAIFFFFFFISSLSLVIPFSSMAELLCFNLLFLFTLSISFLLQRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDH
        MTVNSLKKRWKSENFIAIFFFFFFISSLSLVIPFSSMAELLCFNLLFLFTLSISFLLQRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDH
Subjt:  MTVNSLKKRWKSENFIAIFFFFFFISSLSLVIPFSSMAELLCFNLLFLFTLSISFLLQRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDH

Query:  LISLAKAELKRSSVADDLSGKSKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLM
        LISLAKAELKRSSVAD+LSGKSKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLM
Subjt:  LISLAKAELKRSSVADDLSGKSKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLM

Query:  YLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGD
        YLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGD
Subjt:  YLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGD

Query:  ENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKACS
        ENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKACS
Subjt:  ENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKACS

XP_008438765.1 PREDICTED: probable prolyl 4-hydroxylase 4 [Cucumis melo]1.92e-21295.71Show/hide
Query:  MAELLCFNLLFLFTLSISFLLQRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADDLSGKSKVSEVRTSSGAFIH
        MAE L FNLLFLFTL+IS+LL+RASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVAD+LSG+SKVSEVRTSSGAFIH
Subjt:  MAELLCFNLLFLFTLSISFLLQRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADDLSGKSKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPK+NGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWI VDSFD + RDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  ACS
        ACS
Subjt:  ACS

XP_022137761.1 probable prolyl 4-hydroxylase 4 [Momordica charantia]1.42e-20090.07Show/hide
Query:  MAELLCFNLLFLFTLSISFLLQRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADDLSGKSKVSEVRTSSGAFIH
        MA     +LLF F+LSIS LL+RAS+SYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDH+ISLAKAELKRS+VAD+LSG+SKVSEVRTSSGAFIH
Subjt:  MAELLCFNLLFLFTLSISFLLQRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADDLSGKSKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAK
        KAKDPI+SGIEDKIAAWTFLPK+NGEDIQVLRYEYGQKYDAHFDYF+DKVNIARGGHRMATVLMYLS+VEKGGETVFPSAEESQRRQASETNEDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPR+GDALLFFSLHPNA+PDT SLHGGCPVIEGEKWSATKWI VDSFD ++RDHTNC DE+ SCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  AC
         C
Subjt:  AC

XP_038903083.1 probable prolyl 4-hydroxylase 4 [Benincasa hispida]5.89e-20792.74Show/hide
Query:  MAELLCFNLLFLFTLSISFLLQRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADDLSGKSKVSEVRTSSGAFIH
        MA+  CFNLLF F+LSIS LL+RAS+SYAGSASSIVNPAKVKQISW+PRAFVYEGFLTDLECDHLISLAKAELKRSSVAD+LSG+SKVSEVRTSSGAFIH
Subjt:  MAELLCFNLLFLFTLSISFLLQRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADDLSGKSKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPK+NGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAIPD SSLHGGCPVIEGEKWSATKWI VDSFD ++RDHT+C DENPSCERWAEL ECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  ACS
        AC+
Subjt:  ACS

TrEMBL top hitse value%identityAlignment
A0A0A0L5Q6 Procollagen-proline 4-dioxygenase3.66e-24699.71Show/hide
Query:  MTVNSLKKRWKSENFIAIFFFFFFISSLSLVIPFSSMAELLCFNLLFLFTLSISFLLQRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDH
        MTVNSLKKRWKSENFIAIFFFFFFISSLSLVIPFSSMAELLCFNLLFLFTLSISFLLQRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDH
Subjt:  MTVNSLKKRWKSENFIAIFFFFFFISSLSLVIPFSSMAELLCFNLLFLFTLSISFLLQRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDH

Query:  LISLAKAELKRSSVADDLSGKSKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLM
        LISLAKAELKRSSVAD+LSGKSKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLM
Subjt:  LISLAKAELKRSSVADDLSGKSKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLM

Query:  YLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGD
        YLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGD
Subjt:  YLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGD

Query:  ENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKACS
        ENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKACS
Subjt:  ENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKACS

A0A1S3AWU7 Procollagen-proline 4-dioxygenase9.31e-21395.71Show/hide
Query:  MAELLCFNLLFLFTLSISFLLQRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADDLSGKSKVSEVRTSSGAFIH
        MAE L FNLLFLFTL+IS+LL+RASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVAD+LSG+SKVSEVRTSSGAFIH
Subjt:  MAELLCFNLLFLFTLSISFLLQRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADDLSGKSKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPK+NGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWI VDSFD + RDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  ACS
        ACS
Subjt:  ACS

A0A5A7U593 Procollagen-proline 4-dioxygenase4.41e-20894.72Show/hide
Query:  MAELLCFNLLFLFTLSISFLLQRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADDLSGKSKVSEVRTSSGAFIH
        MAE L FNLLFLFTL+IS+LL+RASASYAGSASSIVNPAKVKQISW   AFVYEGFLTDLECDHLISLAKAELKRSSVAD+LSG+SKVSEVRTSSGAFIH
Subjt:  MAELLCFNLLFLFTLSISFLLQRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADDLSGKSKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPK+NGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWI VDSFD + RDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  ACS
        ACS
Subjt:  ACS

A0A6J1C7M6 Procollagen-proline 4-dioxygenase6.89e-20190.07Show/hide
Query:  MAELLCFNLLFLFTLSISFLLQRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADDLSGKSKVSEVRTSSGAFIH
        MA     +LLF F+LSIS LL+RAS+SYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDH+ISLAKAELKRS+VAD+LSG+SKVSEVRTSSGAFIH
Subjt:  MAELLCFNLLFLFTLSISFLLQRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADDLSGKSKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAK
        KAKDPI+SGIEDKIAAWTFLPK+NGEDIQVLRYEYGQKYDAHFDYF+DKVNIARGGHRMATVLMYLS+VEKGGETVFPSAEESQRRQASETNEDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPR+GDALLFFSLHPNA+PDT SLHGGCPVIEGEKWSATKWI VDSFD ++RDHTNC DE+ SCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  AC
         C
Subjt:  AC

A0A6J1I971 Procollagen-proline 4-dioxygenase2.30e-19990.07Show/hide
Query:  MAELLCFNLLFLFTLSISFLLQRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADDLSGKSKVSEVRTSSGAFIH
        MA+    NLLF+ ++SIS LL+RAS+SYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRS+VAD LSG+SKVSEVRTSSGAFIH
Subjt:  MAELLCFNLLFLFTLSISFLLQRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADDLSGKSKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPK+NGEDIQVLRYEYGQKYDAHFDYF DKVNIARGGHRMATVLMYLS+VEKGGETVFP+AEESQRRQASETNEDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNA+PDT SLHGGCPVIEGEKWSATKWI VDSFD +V DHT+C D N SCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  AC
         C
Subjt:  AC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 61.7e-8957Show/hide
Query:  FLFTLSISFLLQRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADDL-SGKSKVSEVRTSSGAFIHKAKDPIVSG
        +    S+S LL     S   S S  V+P ++ Q+SW+PRAF+Y+GFL+D ECDHLI LAK +L++S V  D+ SG+S+ SEVRTSSG F+ K +D IV+ 
Subjt:  FLFTLSISFLLQRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADDL-SGKSKVSEVRTSSGAFIHKAKDPIVSG

Query:  IEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRK
        +E K+AAWTFLP++NGE +Q+L YE GQKYD HFDYF DK  +  GGHR+ATVLMYLS+V KGGETVFP+    + +     ++  S CAK+G AVKPRK
Subjt:  IEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRK

Query:  GDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC
        GDALLFF+LH N   D +SLHG CPVIEGEKWSAT+WI V SF    +    C D++ SC+ WA+ GEC  NP YMVGS    G+CRKSCKAC
Subjt:  GDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC

F4JAU3 Prolyl 4-hydroxylase 23.1e-12373.04Show/hide
Query:  LFLFTLSISFLLQRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADDLSGKSKVSEVRTSSGAFIHKAKDPIVSG
        L LF   +  LLQ +S     S SSI+NP+KVKQ+S  PRAFVYEGFLTDLECDHLISLAK  L+RS+VAD+ +G+S+VS+VRTSSG FI K KDPIVSG
Subjt:  LFLFTLSISFLLQRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADDLSGKSKVSEVRTSSGAFIHKAKDPIVSG

Query:  IEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRK
        IEDK++ WTFLPK+NGED+QVLRYE+GQKYDAHFDYF DKVNIARGGHR+ATVL+YLS+V KGGETVFP A+E  RR  SE  +DLSDCAKKGIAVKP+K
Subjt:  IEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRK

Query:  GDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC
        G+ALLFF+L  +AIPD  SLHGGCPVIEGEKWSATKWI VDSFD ++    NC D N SCERWA LGEC  NPEYMVG+PE+PG CR+SCKAC
Subjt:  GDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC

Q8L970 Probable prolyl 4-hydroxylase 78.3e-10058.79Show/hide
Query:  LCFNLLFLFTLSI-----SFLLQRASASYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADDLSGKSKVSEV
        L F+L FLFTL +     +  L R+S +  G       SASS   +P +V Q+SW+PR F+YEGFL+D ECDH I LAK +L++S VAD+ SG+S  SEV
Subjt:  LCFNLLFLFTLSI-----SFLLQRASASYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADDLSGKSKVSEV

Query:  RTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASET
        RTSSG F+ K +D IVS +E K+AAWTFLP++NGE +Q+L YE GQKY+ HFDYF D+ N+  GGHR+ATVLMYLS+VEKGGETVFP      + +A++ 
Subjt:  RTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASET

Query:  NED-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGDENPSCERWAELGECTNNPEYMVGSPE
         +D  ++CAK+G AVKPRKGDALLFF+LHPNA  D++SLHG CPV+EGEKWSAT+WI V SF+      + C DEN SCE+WA+ GEC  NP YMVGS +
Subjt:  NED-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGDENPSCERWAELGECTNNPEYMVGSPE

Query:  LPGYCRKSCKACS
          GYCRKSCKACS
Subjt:  LPGYCRKSCKACS

Q8LAN3 Probable prolyl 4-hydroxylase 45.5e-12876.53Show/hide
Query:  LLFLFTLSISFLLQRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADDLSGKSKVSEVRTSSGAFIHKAKDPIVS
        LL  F    S LLQ +S S   S+S  VNP+KVKQ+S  PRAFVYEGFLT+LECDH++SLAKA LKRS+VAD+ SG+SK SEVRTSSG FI K KDPIVS
Subjt:  LLFLFTLSISFLLQRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADDLSGKSKVSEVRTSSGAFIHKAKDPIVS

Query:  GIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPR
        GIEDKI+ WTFLPK+NGEDIQVLRYE+GQKYDAHFDYF DKVNI RGGHRMAT+LMYLS+V KGGETVFP AE   RR  SE  EDLSDCAK+GIAVKPR
Subjt:  GIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPR

Query:  KGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC
        KGDALLFF+LHP+AIPD  SLHGGCPVIEGEKWSATKWI VDSFD +V    NC D N SCERWA LGECT NPEYMVG+ ELPGYCR+SCKAC
Subjt:  KGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC

Q9LN20 Probable prolyl 4-hydroxylase 32.7e-6656.94Show/hide
Query:  ISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADDLSGKSKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHF
        +SW PRAFVY  FL+  EC++LISLAK  + +S+V D  +GKSK S VRTSSG F+ + +D I+  IE +IA +TF+P D+GE +QVL YE GQKY+ H+
Subjt:  ISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADDLSGKSKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHF

Query:  DYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSA
        DYF D+ N   GG RMAT+LMYLSDVE+GGETVFP+A  +    +     +LS+C KKG++VKPR GDALLF+S+ P+A  D +SLHGGCPVI G KWS+
Subjt:  DYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSA

Query:  TKWIRVDSF
        TKW+ V  +
Subjt:  TKWIRVDSF

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 22.2e-12473.04Show/hide
Query:  LFLFTLSISFLLQRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADDLSGKSKVSEVRTSSGAFIHKAKDPIVSG
        L LF   +  LLQ +S     S SSI+NP+KVKQ+S  PRAFVYEGFLTDLECDHLISLAK  L+RS+VAD+ +G+S+VS+VRTSSG FI K KDPIVSG
Subjt:  LFLFTLSISFLLQRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADDLSGKSKVSEVRTSSGAFIHKAKDPIVSG

Query:  IEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRK
        IEDK++ WTFLPK+NGED+QVLRYE+GQKYDAHFDYF DKVNIARGGHR+ATVL+YLS+V KGGETVFP A+E  RR  SE  +DLSDCAKKGIAVKP+K
Subjt:  IEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRK

Query:  GDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC
        G+ALLFF+L  +AIPD  SLHGGCPVIEGEKWSATKWI VDSFD ++    NC D N SCERWA LGEC  NPEYMVG+PE+PG CR+SCKAC
Subjt:  GDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase5.9e-10158.79Show/hide
Query:  LCFNLLFLFTLSI-----SFLLQRASASYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADDLSGKSKVSEV
        L F+L FLFTL +     +  L R+S +  G       SASS   +P +V Q+SW+PR F+YEGFL+D ECDH I LAK +L++S VAD+ SG+S  SEV
Subjt:  LCFNLLFLFTLSI-----SFLLQRASASYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADDLSGKSKVSEV

Query:  RTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASET
        RTSSG F+ K +D IVS +E K+AAWTFLP++NGE +Q+L YE GQKY+ HFDYF D+ N+  GGHR+ATVLMYLS+VEKGGETVFP      + +A++ 
Subjt:  RTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASET

Query:  NED-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGDENPSCERWAELGECTNNPEYMVGSPE
         +D  ++CAK+G AVKPRKGDALLFF+LHPNA  D++SLHG CPV+EGEKWSAT+WI V SF+      + C DEN SCE+WA+ GEC  NP YMVGS +
Subjt:  NED-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGDENPSCERWAELGECTNNPEYMVGSPE

Query:  LPGYCRKSCKACS
          GYCRKSCKACS
Subjt:  LPGYCRKSCKACS

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase2.4e-9456.07Show/hide
Query:  LCFNLLFLFTLSI-----SFLLQRASASYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADDLSGKS-----
        L F+L FLFTL +     +  L R+S +  G       SASS   +P +V Q+SW+PR F+YEGFL+D ECDH I LAK +L++S VAD+ SG+S     
Subjt:  LCFNLLFLFTLSI-----SFLLQRASASYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADDLSGKS-----

Query:  KVSEVRTSSGAFIHKAK---DPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEES
         VS VR SS    +      D IVS +E K+AAWTFLP++NGE +Q+L YE GQKY+ HFDYF D+ N+  GGHR+ATVLMYLS+VEKGGETVFP     
Subjt:  KVSEVRTSSGAFIHKAK---DPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEES

Query:  QRRQASETNED-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGDENPSCERWAELGECTNNP
         + +A++  +D  ++CAK+G AVKPRKGDALLFF+LHPNA  D++SLHG CPV+EGEKWSAT+WI V SF+      + C DEN SCE+WA+ GEC  NP
Subjt:  QRRQASETNED-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGDENPSCERWAELGECTNNP

Query:  EYMVGSPELPGYCRKSCKACS
         YMVGS +  GYCRKSCKACS
Subjt:  EYMVGSPELPGYCRKSCKACS

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase1.2e-9057Show/hide
Query:  FLFTLSISFLLQRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADDL-SGKSKVSEVRTSSGAFIHKAKDPIVSG
        +    S+S LL     S   S S  V+P ++ Q+SW+PRAF+Y+GFL+D ECDHLI LAK +L++S V  D+ SG+S+ SEVRTSSG F+ K +D IV+ 
Subjt:  FLFTLSISFLLQRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADDL-SGKSKVSEVRTSSGAFIHKAKDPIVSG

Query:  IEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRK
        +E K+AAWTFLP++NGE +Q+L YE GQKYD HFDYF DK  +  GGHR+ATVLMYLS+V KGGETVFP+    + +     ++  S CAK+G AVKPRK
Subjt:  IEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRK

Query:  GDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC
        GDALLFF+LH N   D +SLHG CPVIEGEKWSAT+WI V SF    +    C D++ SC+ WA+ GEC  NP YMVGS    G+CRKSCKAC
Subjt:  GDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein3.9e-12976.53Show/hide
Query:  LLFLFTLSISFLLQRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADDLSGKSKVSEVRTSSGAFIHKAKDPIVS
        LL  F    S LLQ +S S   S+S  VNP+KVKQ+S  PRAFVYEGFLT+LECDH++SLAKA LKRS+VAD+ SG+SK SEVRTSSG FI K KDPIVS
Subjt:  LLFLFTLSISFLLQRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADDLSGKSKVSEVRTSSGAFIHKAKDPIVS

Query:  GIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPR
        GIEDKI+ WTFLPK+NGEDIQVLRYE+GQKYDAHFDYF DKVNI RGGHRMAT+LMYLS+V KGGETVFP AE   RR  SE  EDLSDCAK+GIAVKPR
Subjt:  GIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPR

Query:  KGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC
        KGDALLFF+LHP+AIPD  SLHGGCPVIEGEKWSATKWI VDSFD +V    NC D N SCERWA LGECT NPEYMVG+ ELPGYCR+SCKAC
Subjt:  KGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGGTTAACAGCTTGAAGAAGAGATGGAAATCTGAAAATTTTATTGCAATTTTCTTCTTCTTCTTCTTCATTTCTTCTCTTTCTCTAGTGATCCCATTCAGTTCCAT
GGCGGAATTACTCTGTTTCAATCTACTCTTTCTCTTCACATTATCCATCTCCTTTCTTCTCCAGCGAGCTTCAGCCTCCTATGCAGGTTCCGCTAGCTCAATCGTCAACC
CTGCAAAAGTCAAACAGATTTCATGGTCTCCTCGGGCTTTTGTGTATGAAGGTTTTCTCACGGATTTAGAATGCGATCATCTCATTTCCCTCGCTAAAGCGGAGCTGAAG
AGATCTTCTGTTGCGGATGATTTGTCCGGAAAGAGCAAAGTCAGCGAGGTTCGAACTAGCTCTGGGGCGTTTATTCATAAAGCCAAGGATCCTATTGTTTCTGGTATAGA
AGACAAAATTGCAGCATGGACATTTCTGCCAAAAGATAATGGCGAAGACATTCAAGTGTTGAGATATGAATATGGACAGAAGTACGATGCACACTTTGATTACTTTGCTG
ACAAGGTTAACATTGCTCGAGGTGGACATCGAATGGCAACCGTTCTCATGTATCTTTCTGATGTAGAAAAAGGCGGTGAAACTGTGTTTCCTTCTGCGGAGGAATCTCAA
AGACGCCAGGCTTCCGAAACAAACGAAGATCTTTCAGATTGTGCAAAGAAAGGCATAGCAGTAAAACCCCGGAAAGGTGACGCTCTTCTCTTCTTCAGCCTCCATCCAAA
TGCTATTCCAGACACAAGTAGTCTACATGGTGGGTGCCCTGTGATTGAAGGTGAGAAATGGTCAGCAACAAAGTGGATTCGTGTCGATTCCTTCGACATGGTCGTGAGAG
ACCATACAAATTGCGGTGATGAAAATCCAAGTTGTGAGAGATGGGCTGAACTCGGCGAGTGCACGAATAACCCGGAGTATATGGTGGGATCTCCTGAGCTTCCTGGCTAC
TGCCGGAAAAGTTGTAAGGCATGTTCATAA
mRNA sequenceShow/hide mRNA sequence
CGTGCGAAGTGTGTGAGTGAGAGTTTAGAAAATATGACGGTTAACAGCTTGAAGAAGAGATGGAAATCTGAAAATTTTATTGCAATTTTCTTCTTCTTCTTCTTCATTTC
TTCTCTTTCTCTAGTGATCCCATTCAGTTCCATGGCGGAATTACTCTGTTTCAATCTACTCTTTCTCTTCACATTATCCATCTCCTTTCTTCTCCAGCGAGCTTCAGCCT
CCTATGCAGGTTCCGCTAGCTCAATCGTCAACCCTGCAAAAGTCAAACAGATTTCATGGTCTCCTCGGGCTTTTGTGTATGAAGGTTTTCTCACGGATTTAGAATGCGAT
CATCTCATTTCCCTCGCTAAAGCGGAGCTGAAGAGATCTTCTGTTGCGGATGATTTGTCCGGAAAGAGCAAAGTCAGCGAGGTTCGAACTAGCTCTGGGGCGTTTATTCA
TAAAGCCAAGGATCCTATTGTTTCTGGTATAGAAGACAAAATTGCAGCATGGACATTTCTGCCAAAAGATAATGGCGAAGACATTCAAGTGTTGAGATATGAATATGGAC
AGAAGTACGATGCACACTTTGATTACTTTGCTGACAAGGTTAACATTGCTCGAGGTGGACATCGAATGGCAACCGTTCTCATGTATCTTTCTGATGTAGAAAAAGGCGGT
GAAACTGTGTTTCCTTCTGCGGAGGAATCTCAAAGACGCCAGGCTTCCGAAACAAACGAAGATCTTTCAGATTGTGCAAAGAAAGGCATAGCAGTAAAACCCCGGAAAGG
TGACGCTCTTCTCTTCTTCAGCCTCCATCCAAATGCTATTCCAGACACAAGTAGTCTACATGGTGGGTGCCCTGTGATTGAAGGTGAGAAATGGTCAGCAACAAAGTGGA
TTCGTGTCGATTCCTTCGACATGGTCGTGAGAGACCATACAAATTGCGGTGATGAAAATCCAAGTTGTGAGAGATGGGCTGAACTCGGCGAGTGCACGAATAACCCGGAG
TATATGGTGGGATCTCCTGAGCTTCCTGGCTACTGCCGGAAAAGTTGTAAGGCATGTTCATAAACTTGGTTCATTCTTTATAATGAGCATTTCCTTGCCTTTTCTTTTTT
GTACCAAAAACAGAAATGTTAGTTTTTCGAGAGCATTTTCGTTGAAATGTTTTGTGAGAGTTAGCATTTGTATTGATTACTCAATCATGTAACATTATTTGAACAATAAT
GTAGTTTGCAAAGTTCTTTTGGTTTTTATGGATAGTCCAATTATAACTCAGTTTAAAAATCTATTTGGATCGATAAGTGCTTACATATACGTATATGTGTAGACTTTAGA
AGTTGTAAAAGTTAAATGTCGAAGAAGA
Protein sequenceShow/hide protein sequence
MTVNSLKKRWKSENFIAIFFFFFFISSLSLVIPFSSMAELLCFNLLFLFTLSISFLLQRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELK
RSSVADDLSGKSKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQ
RRQASETNEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGY
CRKSCKACS