; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018483 (gene) of Snake gourd v1 genome

Gene IDTan0018483
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationLG10:7763039..7766806
RNA-Seq ExpressionTan0018483
SyntenyTan0018483
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597483.1 putative prolyl 4-hydroxylase 4, partial [Cucurbita argyrosperma subsp. sororia]1.5e-16695.03Show/hide
Query:  MAKFCSCNLLFILSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
        MAKFCSCNLLFILS+SISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRS+VAD+LSGESKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNEDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAH+DYF DKVNIARGGHRMATVLMYLSNVEKGGETVFP+AEESQRRQASETNEDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNA+PDT+SLHGGCPV+EGEKWSATKWIHVD+FD IV DHT C+D NASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
        VC
Subjt:  VC

XP_022137761.1 probable prolyl 4-hydroxylase 4 [Momordica charantia]6.3e-16594.04Show/hide
Query:  MAKFCSCNLLFILSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
        MA+FCSC+LLF  SLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDH+ISLAKAELKRS+VADNLSGESKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNEDLSDCAK
        KAKDPI+SGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYF+DKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNEDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPR+GDALLFFSLHPNA+PDT SLHGGCPV+EGEKWSATKWIHVD+FD I+RDHT C DE+ASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
        VC
Subjt:  VC

XP_022954026.1 probable prolyl 4-hydroxylase 4 [Cucurbita moschata]3.3e-16694.7Show/hide
Query:  MAKFCSCNLLFILSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
        MAKFCSCNLLFILS+SISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRS+VAD+LSGESKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNEDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAH+DYF DKVNIARGGHRMATVLMYLSNVEKGGETVFP+AEESQRRQASETNEDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFF+LHPNA+PDT+SLHGGCPV+EGEKWSATKWIHVD+FD IV DHT C+D NASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
        VC
Subjt:  VC

XP_022973641.1 probable prolyl 4-hydroxylase 4 [Cucurbita maxima]6.7e-16795.36Show/hide
Query:  MAKFCSCNLLFILSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
        MAKFCSCNLLFILS+SISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRS+VAD+LSGESKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNEDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYF DKVNIARGGHRMATVLMYLSNVEKGGETVFP+AEESQRRQASETNEDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNA+PDT+SLHGGCPV+EGEKWSATKWIHVD+FD IV DHT C+D NASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
        VC
Subjt:  VC

XP_023539189.1 probable prolyl 4-hydroxylase 4 [Cucurbita pepo subsp. pepo]4.4e-16694.7Show/hide
Query:  MAKFCSCNLLFILSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
        MAKFCSCNLLFILS+SISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRS+VAD+LSGESKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNEDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYD H+DYF DKVNIARGGHRMATVLMYLSNVEKGGETVFP+AEESQRRQASETNEDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNA+PDT+SLHGGCPV+EGEKWSATKWIHVD+FD IV DHT C+D NASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
        VC
Subjt:  VC

TrEMBL top hitse value%identityAlignment
A0A0A0L5Q6 Procollagen-proline 4-dioxygenase3.9e-16092.38Show/hide
Query:  MAKFCSCNLLFILSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
        MA+    NLLF+ +LSIS LL+RAS+SYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSG+SKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNEDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPK+NGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLS+VEKGGETVFPSAEESQRRQASETNEDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAIPDT SLHGGCPV+EGEKWSATKWI VD+FDM+VRDHT C DEN SCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
         C
Subjt:  VC

A0A1S3AWU7 Procollagen-proline 4-dioxygenase3.5e-16193.05Show/hide
Query:  MAKFCSCNLLFILSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
        MA+F   NLLF+ +L+IS LLRRAS+SYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNEDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLS+VEKGGETVFPSAEESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAIPDT SLHGGCPV+EGEKWSATKWIHVD+FD I RDHT C DEN SCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
         C
Subjt:  VC

A0A6J1C7M6 Procollagen-proline 4-dioxygenase3.0e-16594.04Show/hide
Query:  MAKFCSCNLLFILSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
        MA+FCSC+LLF  SLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDH+ISLAKAELKRS+VADNLSGESKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNEDLSDCAK
        KAKDPI+SGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYF+DKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNEDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPR+GDALLFFSLHPNA+PDT SLHGGCPV+EGEKWSATKWIHVD+FD I+RDHT C DE+ASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
        VC
Subjt:  VC

A0A6J1GPQ8 Procollagen-proline 4-dioxygenase1.6e-16694.7Show/hide
Query:  MAKFCSCNLLFILSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
        MAKFCSCNLLFILS+SISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRS+VAD+LSGESKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNEDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAH+DYF DKVNIARGGHRMATVLMYLSNVEKGGETVFP+AEESQRRQASETNEDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFF+LHPNA+PDT+SLHGGCPV+EGEKWSATKWIHVD+FD IV DHT C+D NASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
        VC
Subjt:  VC

A0A6J1I971 Procollagen-proline 4-dioxygenase3.3e-16795.36Show/hide
Query:  MAKFCSCNLLFILSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
        MAKFCSCNLLFILS+SISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRS+VAD+LSGESKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNEDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYF DKVNIARGGHRMATVLMYLSNVEKGGETVFP+AEESQRRQASETNEDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNA+PDT+SLHGGCPV+EGEKWSATKWIHVD+FD IV DHT C+D NASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
        VC
Subjt:  VC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 62.5e-9258.36Show/hide
Query:  FILSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSS-VADNLSGESKVSEVRTSSGAFIHKAKDPIVSG
        + L+ S+SLLL     S   S S  V+P ++ Q+SW+PRAF+Y+GFL+D ECDHLI LAK +L++S  VAD  SGES+ SEVRTSSG F+ K +D IV+ 
Subjt:  FILSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSS-VADNLSGESKVSEVRTSSGAFIHKAKDPIVSG

Query:  IEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRK
        +E K+AAWTFLP+ENGE +Q+L YE GQKYD HFDYF DK  +  GGHR+ATVLMYLSNV KGGETVFP+    + +     ++  S CAK+G AVKPRK
Subjt:  IEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRK

Query:  GDALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC
        GDALLFF+LH N   D  SLHG CPV+EGEKWSAT+WIHV +F    +    C+D++ SC+ WA+ GEC  NP YMVGS    G+CRKSCK C
Subjt:  GDALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC

F4JAU3 Prolyl 4-hydroxylase 22.3e-12573.54Show/hide
Query:  ILSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIE
        +L ++I L+L ++S+    S SSI+NP+KVKQ+S  PRAFVYEGFLTDLECDHLISLAK  L+RS+VADN +GES+VS+VRTSSG FI K KDPIVSGIE
Subjt:  ILSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIE

Query:  DKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGD
        DK++ WTFLPKENGED+QVLRYE+GQKYDAHFDYF DKVNIARGGHR+ATVL+YLSNV KGGETVFP A+E  RR  SE  +DLSDCAKKGIAVKP+KG+
Subjt:  DKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGD

Query:  ALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC
        ALLFF+L  +AIPD  SLHGGCPV+EGEKWSATKWIHVD+FD I+     C D N SCERWA LGEC  NPEYMVG+PE+PG CR+SCK C
Subjt:  ALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC

Q8L970 Probable prolyl 4-hydroxylase 75.6e-10058.96Show/hide
Query:  CSCNLLFILSLSISLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSG
        C    L ++S + +  L R+S++  G       SASS   +P +V Q+SW+PR F+YEGFL+D ECDH I LAK +L++S VADN SGES  SEVRTSSG
Subjt:  CSCNLLFILSLSISLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSG

Query:  AFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNED-L
         F+ K +D IVS +E K+AAWTFLP+ENGE +Q+L YE GQKY+ HFDYF D+ N+  GGHR+ATVLMYLSNVEKGGETVFP      + +A++  +D  
Subjt:  AFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNED-L

Query:  SDCAKKGIAVKPRKGDALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVGSPELPGYC
        ++CAK+G AVKPRKGDALLFF+LHPNA  D+ SLHG CPV+EGEKWSAT+WIHV +F+      + C+DEN SCE+WA+ GEC  NP YMVGS +  GYC
Subjt:  SDCAKKGIAVKPRKGDALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVGSPELPGYC

Query:  RKSCKVC
        RKSCK C
Subjt:  RKSCKVC

Q8LAN3 Probable prolyl 4-hydroxylase 41.5e-12976.45Show/hide
Query:  LFILSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIHKAKDPIVSG
        L I   +I  +L ++S+S   S+S  VNP+KVKQ+S  PRAFVYEGFLT+LECDH++SLAKA LKRS+VADN SGESK SEVRTSSG FI K KDPIVSG
Subjt:  LFILSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIHKAKDPIVSG

Query:  IEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRK
        IEDKI+ WTFLPKENGEDIQVLRYE+GQKYDAHFDYF DKVNI RGGHRMAT+LMYLSNV KGGETVFP AE   RR  SE  EDLSDCAK+GIAVKPRK
Subjt:  IEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRK

Query:  GDALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC
        GDALLFF+LHP+AIPD  SLHGGCPV+EGEKWSATKWIHVD+FD IV     C D N SCERWA LGECT NPEYMVG+ ELPGYCR+SCK C
Subjt:  GDALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC

Q9LN20 Probable prolyl 4-hydroxylase 32.6e-6555.5Show/hide
Query:  ISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHF
        +SW PRAFVY  FL+  EC++LISLAK  + +S+V D+ +G+SK S VRTSSG F+ + +D I+  IE +IA +TF+P ++GE +QVL YE GQKY+ H+
Subjt:  ISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHF

Query:  DYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSA
        DYF D+ N   GG RMAT+LMYLS+VE+GGETVFP+A  +    +     +LS+C KKG++VKPR GDALLF+S+ P+A  D  SLHGGCPV+ G KWS+
Subjt:  DYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSA

Query:  TKWIHVDAF
        TKW+HV  +
Subjt:  TKWIHVDAF

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 21.6e-12673.54Show/hide
Query:  ILSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIE
        +L ++I L+L ++S+    S SSI+NP+KVKQ+S  PRAFVYEGFLTDLECDHLISLAK  L+RS+VADN +GES+VS+VRTSSG FI K KDPIVSGIE
Subjt:  ILSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIE

Query:  DKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGD
        DK++ WTFLPKENGED+QVLRYE+GQKYDAHFDYF DKVNIARGGHR+ATVL+YLSNV KGGETVFP A+E  RR  SE  +DLSDCAKKGIAVKP+KG+
Subjt:  DKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGD

Query:  ALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC
        ALLFF+L  +AIPD  SLHGGCPV+EGEKWSATKWIHVD+FD I+     C D N SCERWA LGEC  NPEYMVG+PE+PG CR+SCK C
Subjt:  ALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase4.0e-10158.96Show/hide
Query:  CSCNLLFILSLSISLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSG
        C    L ++S + +  L R+S++  G       SASS   +P +V Q+SW+PR F+YEGFL+D ECDH I LAK +L++S VADN SGES  SEVRTSSG
Subjt:  CSCNLLFILSLSISLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSG

Query:  AFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNED-L
         F+ K +D IVS +E K+AAWTFLP+ENGE +Q+L YE GQKY+ HFDYF D+ N+  GGHR+ATVLMYLSNVEKGGETVFP      + +A++  +D  
Subjt:  AFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNED-L

Query:  SDCAKKGIAVKPRKGDALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVGSPELPGYC
        ++CAK+G AVKPRKGDALLFF+LHPNA  D+ SLHG CPV+EGEKWSAT+WIHV +F+      + C+DEN SCE+WA+ GEC  NP YMVGS +  GYC
Subjt:  SDCAKKGIAVKPRKGDALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVGSPELPGYC

Query:  RKSCKVC
        RKSCK C
Subjt:  RKSCKVC

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase1.6e-9456.19Show/hide
Query:  CSCNLLFILSLSISLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGES-----KVSEV
        C    L ++S + +  L R+S++  G       SASS   +P +V Q+SW+PR F+YEGFL+D ECDH I LAK +L++S VADN SGES      VS V
Subjt:  CSCNLLFILSLSISLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGES-----KVSEV

Query:  RTSSGAFIHKAK---DPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQA
        R SS    +      D IVS +E K+AAWTFLP+ENGE +Q+L YE GQKY+ HFDYF D+ N+  GGHR+ATVLMYLSNVEKGGETVFP      + +A
Subjt:  RTSSGAFIHKAK---DPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQA

Query:  SETNED-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVG
        ++  +D  ++CAK+G AVKPRKGDALLFF+LHPNA  D+ SLHG CPV+EGEKWSAT+WIHV +F+      + C+DEN SCE+WA+ GEC  NP YMVG
Subjt:  SETNED-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVG

Query:  SPELPGYCRKSCKVC
        S +  GYCRKSCK C
Subjt:  SPELPGYCRKSCKVC

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase1.8e-9358.36Show/hide
Query:  FILSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSS-VADNLSGESKVSEVRTSSGAFIHKAKDPIVSG
        + L+ S+SLLL     S   S S  V+P ++ Q+SW+PRAF+Y+GFL+D ECDHLI LAK +L++S  VAD  SGES+ SEVRTSSG F+ K +D IV+ 
Subjt:  FILSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSS-VADNLSGESKVSEVRTSSGAFIHKAKDPIVSG

Query:  IEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRK
        +E K+AAWTFLP+ENGE +Q+L YE GQKYD HFDYF DK  +  GGHR+ATVLMYLSNV KGGETVFP+    + +     ++  S CAK+G AVKPRK
Subjt:  IEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRK

Query:  GDALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC
        GDALLFF+LH N   D  SLHG CPV+EGEKWSAT+WIHV +F    +    C+D++ SC+ WA+ GEC  NP YMVGS    G+CRKSCK C
Subjt:  GDALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.1e-13076.45Show/hide
Query:  LFILSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIHKAKDPIVSG
        L I   +I  +L ++S+S   S+S  VNP+KVKQ+S  PRAFVYEGFLT+LECDH++SLAKA LKRS+VADN SGESK SEVRTSSG FI K KDPIVSG
Subjt:  LFILSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIHKAKDPIVSG

Query:  IEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRK
        IEDKI+ WTFLPKENGEDIQVLRYE+GQKYDAHFDYF DKVNI RGGHRMAT+LMYLSNV KGGETVFP AE   RR  SE  EDLSDCAK+GIAVKPRK
Subjt:  IEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRK

Query:  GDALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC
        GDALLFF+LHP+AIPD  SLHGGCPV+EGEKWSATKWIHVD+FD IV     C D N SCERWA LGECT NPEYMVG+ ELPGYCR+SCK C
Subjt:  GDALLFFSLHPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAAATTTTGTAGTTGCAATCTACTGTTCATCCTCTCATTATCGATCTCGTTGCTTCTCCGGCGAGCTTCAAGCTCCTATGCAGGTTCCGCTAGCTCAATCGTCAA
TCCTGCAAAAGTCAAACAGATTTCATGGAGTCCCCGGGCTTTTGTGTATGAAGGTTTTCTCACGGACTTAGAATGCGATCATCTCATCTCGCTTGCTAAGGCGGAGCTGA
AGAGATCTTCTGTTGCGGATAATTTGTCCGGAGAGAGTAAGGTCAGCGAGGTCCGGACTAGCTCTGGGGCGTTTATTCATAAAGCCAAGGATCCTATTGTCTCTGGTATA
GAAGACAAAATTGCAGCGTGGACATTTCTGCCAAAAGAAAATGGGGAAGACATTCAAGTGTTGAGATATGAATATGGACAGAAGTATGATGCACACTTTGATTACTTTGC
TGACAAGGTTAATATTGCCCGAGGTGGACATCGAATGGCAACTGTTCTCATGTATCTTTCTAATGTAGAAAAAGGCGGTGAAACTGTCTTTCCTTCTGCAGAGGAATCTC
AAAGGCGCCAAGCTTCTGAAACAAATGAAGATCTCTCAGACTGTGCAAAGAAAGGGATAGCCGTGAAACCACGGAAAGGCGATGCTCTTCTCTTCTTCAGTCTCCATCCA
AATGCTATTCCAGACACAAGAAGTCTGCATGGAGGGTGCCCTGTGCTTGAAGGTGAGAAATGGTCAGCAACGAAATGGATTCATGTCGATGCTTTCGACATGATCGTTAG
AGACCATACGAAATGCATCGATGAGAATGCTAGTTGTGAGAGATGGGCCGAACTCGGCGAGTGCACGAACAACCCGGAGTATATGGTGGGATCGCCTGAGCTTCCTGGCT
ACTGCAGGAAAAGTTGTAAGGTGTGTTGA
mRNA sequenceShow/hide mRNA sequence
AATTTTCAGAAGAAAAGAAACATTTTGCTATTTTCATCGCTCTCTCTCTCTTTCTCCAGTGATCCGATTCAGTCAATGGCGAAATTTTGTAGTTGCAATCTACTGTTCAT
CCTCTCATTATCGATCTCGTTGCTTCTCCGGCGAGCTTCAAGCTCCTATGCAGGTTCCGCTAGCTCAATCGTCAATCCTGCAAAAGTCAAACAGATTTCATGGAGTCCCC
GGGCTTTTGTGTATGAAGGTTTTCTCACGGACTTAGAATGCGATCATCTCATCTCGCTTGCTAAGGCGGAGCTGAAGAGATCTTCTGTTGCGGATAATTTGTCCGGAGAG
AGTAAGGTCAGCGAGGTCCGGACTAGCTCTGGGGCGTTTATTCATAAAGCCAAGGATCCTATTGTCTCTGGTATAGAAGACAAAATTGCAGCGTGGACATTTCTGCCAAA
AGAAAATGGGGAAGACATTCAAGTGTTGAGATATGAATATGGACAGAAGTATGATGCACACTTTGATTACTTTGCTGACAAGGTTAATATTGCCCGAGGTGGACATCGAA
TGGCAACTGTTCTCATGTATCTTTCTAATGTAGAAAAAGGCGGTGAAACTGTCTTTCCTTCTGCAGAGGAATCTCAAAGGCGCCAAGCTTCTGAAACAAATGAAGATCTC
TCAGACTGTGCAAAGAAAGGGATAGCCGTGAAACCACGGAAAGGCGATGCTCTTCTCTTCTTCAGTCTCCATCCAAATGCTATTCCAGACACAAGAAGTCTGCATGGAGG
GTGCCCTGTGCTTGAAGGTGAGAAATGGTCAGCAACGAAATGGATTCATGTCGATGCTTTCGACATGATCGTTAGAGACCATACGAAATGCATCGATGAGAATGCTAGTT
GTGAGAGATGGGCCGAACTCGGCGAGTGCACGAACAACCCGGAGTATATGGTGGGATCGCCTGAGCTTCCTGGCTACTGCAGGAAAAGTTGTAAGGTGTGTTGATAAACT
TGCTCCATTCTTTTATGTGCATCCCTTCCAGGTCTCAGTGTTTTGCAGAGTTTGTCCAAGAACACAAATGTTAGTTTTTTTGAGAGCAGTTTCATTGCAATGCTTTGTGA
TAGTTAGCATGTGTATTTGATAACTCAATCATGTAACATTATTTGAAAAATAAATGTAGTTTTGGAAAGTTATTTGGATTTTAAGGGTTTGTATTGGAATTTTCCAATTG
TTTATGATTTCATTCAACTATTGAAAGGATGGTTCTAAAATGAGGGAAAAAAAAAATTGAAGATGGA
Protein sequenceShow/hide protein sequence
MAKFCSCNLLFILSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGI
EDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSNVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGDALLFFSLHP
NAIPDTRSLHGGCPVLEGEKWSATKWIHVDAFDMIVRDHTKCIDENASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC