; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021342 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021342
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationchr7:6589132..6592003
RNA-Seq ExpressionLag0021342
SyntenyLag0021342
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597483.1 putative prolyl 4-hydroxylase 4, partial [Cucurbita argyrosperma subsp. sororia]1.2e-16394.04Show/hide
Query:  MAKFRTCNLLFIFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH
        MAKF +CNLLFI S+SISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHL+SLAKAELKRSAVAD+LSGESKVSEVRTSSGAFIH
Subjt:  MAKFRTCNLLFIFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAH+DYF DKVNIARGGHRMATVLMYLS+VEKGGETVFP+AEESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNA+PDT SLHGGCPVIEGEKWSATKWIHVDSFD IV DHT+C D NASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
        VC
Subjt:  VC

XP_008438765.1 PREDICTED: probable prolyl 4-hydroxylase 4 [Cucumis melo]9.1e-16494.37Show/hide
Query:  MAKFRTCNLLFIFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH
        MA+F   NLLF+F+L+IS LLRRAS+SYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHL+SLAKAELKRS+VADNLSGESKVSEVRTSSGAFIH
Subjt:  MAKFRTCNLLFIFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFD I RDHTNC DEN SCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
         C
Subjt:  VC

XP_022137761.1 probable prolyl 4-hydroxylase 4 [Momordica charantia]3.7e-16594.37Show/hide
Query:  MAKFRTCNLLFIFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH
        MA+F +C+LLF FSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDH++SLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH
Subjt:  MAKFRTCNLLFIFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK
        KAKDPI+SGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYF+DKVNIARGGHRMATVLMYLS+VEKGGETVFPSAEESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPR+GDALLFFSLHPNA+PDT SLHGGCPVIEGEKWSATKWIHVDSFD I+RDHTNCADE+ASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
        VC
Subjt:  VC

XP_022973641.1 probable prolyl 4-hydroxylase 4 [Cucurbita maxima]5.3e-16494.37Show/hide
Query:  MAKFRTCNLLFIFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH
        MAKF +CNLLFI S+SISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHL+SLAKAELKRSAVAD+LSGESKVSEVRTSSGAFIH
Subjt:  MAKFRTCNLLFIFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYF DKVNIARGGHRMATVLMYLS+VEKGGETVFP+AEESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNA+PDT SLHGGCPVIEGEKWSATKWIHVDSFD IV DHT+C D NASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
        VC
Subjt:  VC

XP_038903083.1 probable prolyl 4-hydroxylase 4 [Benincasa hispida]2.6e-16394.7Show/hide
Query:  MAKFRTCNLLFIFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH
        MAKF   NLLF FSLSIS LLRRASSSYAGSASSIVNPAKVKQISW+PRAFVYEGFLTDLECDHL+SLAKAELKRS+VADNLSGESKVSEVRTSSGAFIH
Subjt:  MAKFRTCNLLFIFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETN DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAIPD SSLHGGCPVIEGEKWSATKWIHVDSFD I+RDHT+CADEN SCERWAEL ECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
         C
Subjt:  VC

TrEMBL top hitse value%identityAlignment
A0A0A0L5Q6 Procollagen-proline 4-dioxygenase1.1e-16293.38Show/hide
Query:  MAKFRTCNLLFIFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH
        MA+    NLLF+F+LSIS LL+RAS+SYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHL+SLAKAELKRS+VADNLSG+SKVSEVRTSSGAFIH
Subjt:  MAKFRTCNLLFIFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPK+NGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWI VDSFDM+VRDHTNC DEN SCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
         C
Subjt:  VC

A0A1S3AWU7 Procollagen-proline 4-dioxygenase4.4e-16494.37Show/hide
Query:  MAKFRTCNLLFIFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH
        MA+F   NLLF+F+L+IS LLRRAS+SYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHL+SLAKAELKRS+VADNLSGESKVSEVRTSSGAFIH
Subjt:  MAKFRTCNLLFIFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFD I RDHTNC DEN SCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
         C
Subjt:  VC

A0A6J1C7M6 Procollagen-proline 4-dioxygenase1.8e-16594.37Show/hide
Query:  MAKFRTCNLLFIFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH
        MA+F +C+LLF FSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDH++SLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH
Subjt:  MAKFRTCNLLFIFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK
        KAKDPI+SGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYF+DKVNIARGGHRMATVLMYLS+VEKGGETVFPSAEESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPR+GDALLFFSLHPNA+PDT SLHGGCPVIEGEKWSATKWIHVDSFD I+RDHTNCADE+ASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
        VC
Subjt:  VC

A0A6J1GPQ8 Procollagen-proline 4-dioxygenase1.3e-16393.71Show/hide
Query:  MAKFRTCNLLFIFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH
        MAKF +CNLLFI S+SISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHL+SLAKAELKRSAVAD+LSGESKVSEVRTSSGAFIH
Subjt:  MAKFRTCNLLFIFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAH+DYF DKVNIARGGHRMATVLMYLS+VEKGGETVFP+AEESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFF+LHPNA+PDT SLHGGCPVIEGEKWSATKWIHVDSFD IV DHT+C D NASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
        VC
Subjt:  VC

A0A6J1I971 Procollagen-proline 4-dioxygenase2.6e-16494.37Show/hide
Query:  MAKFRTCNLLFIFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH
        MAKF +CNLLFI S+SISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHL+SLAKAELKRSAVAD+LSGESKVSEVRTSSGAFIH
Subjt:  MAKFRTCNLLFIFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYF DKVNIARGGHRMATVLMYLS+VEKGGETVFP+AEESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNA+PDT SLHGGCPVIEGEKWSATKWIHVDSFD IV DHT+C D NASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
        VC
Subjt:  VC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 61.6e-9158.42Show/hide
Query:  FSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRS-AVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIE
        FSLS+ L+  + S     S S  V+P ++ Q+SW+PRAF+Y+GFL+D ECDHL+ LAK +L++S  VAD  SGES+ SEVRTSSG F+ K +D IV+ +E
Subjt:  FSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRS-AVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIE

Query:  DKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAKKGIAVKPRKGD
         K+AAWTFLP+ENGE +Q+L YE GQKYD HFDYF DK  +  GGHR+ATVLMYLS+V KGGETVFP+ +    +   ++    S CAK+G AVKPRKGD
Subjt:  DKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAKKGIAVKPRKGD

Query:  ALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC
        ALLFF+LH N   D +SLHG CPVIEGEKWSAT+WIHV SF    +    C D++ SC+ WA+ GEC  NP YMVGS    G+CRKSCK C
Subjt:  ALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC

F4JAU3 Prolyl 4-hydroxylase 26.0e-12672.85Show/hide
Query:  MAKFRTCNLLFIFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH
        M+  R   LLF+   +I L+L ++S+    S SSI+NP+KVKQ+S  PRAFVYEGFLTDLECDHL+SLAK  L+RSAVADN +GES+VS+VRTSSG FI 
Subjt:  MAKFRTCNLLFIFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK
        K KDPIVSGIEDK++ WTFLPKENGED+QVLRYE+GQKYDAHFDYF DKVNIARGGHR+ATVL+YLS+V KGGETVFP A+E  RR  SE   DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKP+KG+ALLFF+L  +AIPD  SLHGGCPVIEGEKWSATKWIHVDSFD I+    NC D N SCERWA LGEC  NPEYMVG+PE+PG CR+SCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
         C
Subjt:  VC

Q8L970 Probable prolyl 4-hydroxylase 75.6e-10058.15Show/hide
Query:  FRTCNLLFIFSLSI-----SLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSE
        F   +L F+F+L +     +  L R+S++  G       SASS   +P +V Q+SW+PR F+YEGFL+D ECDH + LAK +L++S VADN SGES  SE
Subjt:  FRTCNLLFIFSLSI-----SLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSE

Query:  VRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASE
        VRTSSG F+ K +D IVS +E K+AAWTFLP+ENGE +Q+L YE GQKY+ HFDYF D+ N+  GGHR+ATVLMYLS+VEKGGETVFP      + +A++
Subjt:  VRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASE

Query:  TNKD-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSP
           D  ++CAK+G AVKPRKGDALLFF+LHPNA  D++SLHG CPV+EGEKWSAT+WIHV SF+      + C DEN SCE+WA+ GEC  NP YMVGS 
Subjt:  TNKD-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSP

Query:  ELPGYCRKSCKVC
        +  GYCRKSCK C
Subjt:  ELPGYCRKSCKVC

Q8LAN3 Probable prolyl 4-hydroxylase 44.0e-13077.47Show/hide
Query:  LFIFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSG
        L I   +I  +L ++S+S   S+S  VNP+KVKQ+S  PRAFVYEGFLT+LECDH+VSLAKA LKRSAVADN SGESK SEVRTSSG FI K KDPIVSG
Subjt:  LFIFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSG

Query:  IEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAKKGIAVKPRK
        IEDKI+ WTFLPKENGEDIQVLRYE+GQKYDAHFDYF DKVNI RGGHRMAT+LMYLS+V KGGETVFP AE   RR  SE  +DLSDCAK+GIAVKPRK
Subjt:  IEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAKKGIAVKPRK

Query:  GDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC
        GDALLFF+LHP+AIPD  SLHGGCPVIEGEKWSATKWIHVDSFD IV    NC D N SCERWA LGECT NPEYMVG+ ELPGYCR+SCK C
Subjt:  GDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC

Q9LN20 Probable prolyl 4-hydroxylase 34.1e-6655.98Show/hide
Query:  ISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHF
        +SW PRAFVY  FL+  EC++L+SLAK  + +S V D+ +G+SK S VRTSSG F+ + +D I+  IE +IA +TF+P ++GE +QVL YE GQKY+ H+
Subjt:  ISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHF

Query:  DYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSA
        DYF D+ N   GG RMAT+LMYLSDVE+GGETVFP+A  +    +     +LS+C KKG++VKPR GDALLF+S+ P+A  D +SLHGGCPVI G KWS+
Subjt:  DYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSA

Query:  TKWIHVDSF
        TKW+HV  +
Subjt:  TKWIHVDSF

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 24.2e-12772.85Show/hide
Query:  MAKFRTCNLLFIFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH
        M+  R   LLF+   +I L+L ++S+    S SSI+NP+KVKQ+S  PRAFVYEGFLTDLECDHL+SLAK  L+RSAVADN +GES+VS+VRTSSG FI 
Subjt:  MAKFRTCNLLFIFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK
        K KDPIVSGIEDK++ WTFLPKENGED+QVLRYE+GQKYDAHFDYF DKVNIARGGHR+ATVL+YLS+V KGGETVFP A+E  RR  SE   DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKP+KG+ALLFF+L  +AIPD  SLHGGCPVIEGEKWSATKWIHVDSFD I+    NC D N SCERWA LGEC  NPEYMVG+PE+PG CR+SCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
         C
Subjt:  VC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase4.0e-10158.15Show/hide
Query:  FRTCNLLFIFSLSI-----SLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSE
        F   +L F+F+L +     +  L R+S++  G       SASS   +P +V Q+SW+PR F+YEGFL+D ECDH + LAK +L++S VADN SGES  SE
Subjt:  FRTCNLLFIFSLSI-----SLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSE

Query:  VRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASE
        VRTSSG F+ K +D IVS +E K+AAWTFLP+ENGE +Q+L YE GQKY+ HFDYF D+ N+  GGHR+ATVLMYLS+VEKGGETVFP      + +A++
Subjt:  VRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASE

Query:  TNKD-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSP
           D  ++CAK+G AVKPRKGDALLFF+LHPNA  D++SLHG CPV+EGEKWSAT+WIHV SF+      + C DEN SCE+WA+ GEC  NP YMVGS 
Subjt:  TNKD-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSP

Query:  ELPGYCRKSCKVC
        +  GYCRKSCK C
Subjt:  ELPGYCRKSCKVC

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase1.6e-9455.45Show/hide
Query:  FRTCNLLFIFSLSI-----SLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGES----
        F   +L F+F+L +     +  L R+S++  G       SASS   +P +V Q+SW+PR F+YEGFL+D ECDH + LAK +L++S VADN SGES    
Subjt:  FRTCNLLFIFSLSI-----SLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGES----

Query:  -KVSEVRTSSGAFIHKAK---DPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEE
          VS VR SS    +      D IVS +E K+AAWTFLP+ENGE +Q+L YE GQKY+ HFDYF D+ N+  GGHR+ATVLMYLS+VEKGGETVFP    
Subjt:  -KVSEVRTSSGAFIHKAK---DPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEE

Query:  SQRRQASETNKD-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNN
          + +A++   D  ++CAK+G AVKPRKGDALLFF+LHPNA  D++SLHG CPV+EGEKWSAT+WIHV SF+      + C DEN SCE+WA+ GEC  N
Subjt:  SQRRQASETNKD-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNN

Query:  PEYMVGSPELPGYCRKSCKVC
        P YMVGS +  GYCRKSCK C
Subjt:  PEYMVGSPELPGYCRKSCKVC

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase1.2e-9258.42Show/hide
Query:  FSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRS-AVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIE
        FSLS+ L+  + S     S S  V+P ++ Q+SW+PRAF+Y+GFL+D ECDHL+ LAK +L++S  VAD  SGES+ SEVRTSSG F+ K +D IV+ +E
Subjt:  FSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRS-AVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIE

Query:  DKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAKKGIAVKPRKGD
         K+AAWTFLP+ENGE +Q+L YE GQKYD HFDYF DK  +  GGHR+ATVLMYLS+V KGGETVFP+ +    +   ++    S CAK+G AVKPRKGD
Subjt:  DKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAKKGIAVKPRKGD

Query:  ALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC
        ALLFF+LH N   D +SLHG CPVIEGEKWSAT+WIHV SF    +    C D++ SC+ WA+ GEC  NP YMVGS    G+CRKSCK C
Subjt:  ALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.8e-13177.47Show/hide
Query:  LFIFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSG
        L I   +I  +L ++S+S   S+S  VNP+KVKQ+S  PRAFVYEGFLT+LECDH+VSLAKA LKRSAVADN SGESK SEVRTSSG FI K KDPIVSG
Subjt:  LFIFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSG

Query:  IEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAKKGIAVKPRK
        IEDKI+ WTFLPKENGEDIQVLRYE+GQKYDAHFDYF DKVNI RGGHRMAT+LMYLS+V KGGETVFP AE   RR  SE  +DLSDCAK+GIAVKPRK
Subjt:  IEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAKKGIAVKPRK

Query:  GDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC
        GDALLFF+LHP+AIPD  SLHGGCPVIEGEKWSATKWIHVDSFD IV    NC D N SCERWA LGECT NPEYMVG+ ELPGYCR+SCK C
Subjt:  GDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAAATTTCGTACTTGTAATCTACTCTTCATCTTCTCATTATCGATCTCCTTGCTTCTCCGGCGAGCTTCAAGCTCCTATGCAGGTTCCGCTAGCTCAATCGTCAA
TCCTGCAAAAGTCAAACAGATTTCATGGAGTCCCCGGGCTTTTGTGTACGAAGGTTTTCTCACGGACTTGGAATGCGATCATCTCGTCTCGCTTGCTAAAGCGGAGTTGA
AGAGATCTGCTGTTGCGGATAATTTGTCTGGAGAGAGTAAGGTCAGCGAAGTCCGAACTAGCTCTGGGGCGTTTATTCATAAAGCCAAGGATCCTATTGTTTCTGGTATA
GAAGACAAAATTGCAGCATGGACATTCCTGCCAAAAGAAAATGGGGAAGACATTCAAGTGTTGAGATATGAATATGGGCAGAAGTACGATGCACACTTTGATTACTTTGC
TGATAAGGTTAATATTGCCCGAGGTGGACATCGAATGGCAACTGTTCTTATGTATCTTTCCGACGTAGAAAAAGGCGGTGAAACAGTGTTTCCTTCTGCAGAGGAATCTC
AAAGACGCCAGGCTTCTGAAACAAACAAAGATCTCTCAGACTGTGCAAAGAAAGGGATAGCAGTGAAACCACGGAAAGGTGACGCTCTTCTCTTCTTCAGTCTTCATCCA
AATGCTATTCCAGACACGAGTAGTCTGCACGGAGGTTGCCCTGTGATTGAAGGTGAAAAATGGTCAGCAACGAAGTGGATTCATGTCGATTCTTTCGACATGATTGTGAG
AGACCATACCAATTGTGCTGATGAGAATGCAAGTTGTGAGAGATGGGCTGAACTCGGCGAGTGCACGAATAACCCGGAGTATATGGTCGGATCCCCTGAGCTTCCTGGCT
ACTGCAGGAAAAGTTGTAAGGTGTGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGAAATTTCGTACTTGTAATCTACTCTTCATCTTCTCATTATCGATCTCCTTGCTTCTCCGGCGAGCTTCAAGCTCCTATGCAGGTTCCGCTAGCTCAATCGTCAA
TCCTGCAAAAGTCAAACAGATTTCATGGAGTCCCCGGGCTTTTGTGTACGAAGGTTTTCTCACGGACTTGGAATGCGATCATCTCGTCTCGCTTGCTAAAGCGGAGTTGA
AGAGATCTGCTGTTGCGGATAATTTGTCTGGAGAGAGTAAGGTCAGCGAAGTCCGAACTAGCTCTGGGGCGTTTATTCATAAAGCCAAGGATCCTATTGTTTCTGGTATA
GAAGACAAAATTGCAGCATGGACATTCCTGCCAAAAGAAAATGGGGAAGACATTCAAGTGTTGAGATATGAATATGGGCAGAAGTACGATGCACACTTTGATTACTTTGC
TGATAAGGTTAATATTGCCCGAGGTGGACATCGAATGGCAACTGTTCTTATGTATCTTTCCGACGTAGAAAAAGGCGGTGAAACAGTGTTTCCTTCTGCAGAGGAATCTC
AAAGACGCCAGGCTTCTGAAACAAACAAAGATCTCTCAGACTGTGCAAAGAAAGGGATAGCAGTGAAACCACGGAAAGGTGACGCTCTTCTCTTCTTCAGTCTTCATCCA
AATGCTATTCCAGACACGAGTAGTCTGCACGGAGGTTGCCCTGTGATTGAAGGTGAAAAATGGTCAGCAACGAAGTGGATTCATGTCGATTCTTTCGACATGATTGTGAG
AGACCATACCAATTGTGCTGATGAGAATGCAAGTTGTGAGAGATGGGCTGAACTCGGCGAGTGCACGAATAACCCGGAGTATATGGTCGGATCCCCTGAGCTTCCTGGCT
ACTGCAGGAAAAGTTGTAAGGTGTGTTGA
Protein sequenceShow/hide protein sequence
MAKFRTCNLLFIFSLSISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLVSLAKAELKRSAVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGI
EDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNKDLSDCAKKGIAVKPRKGDALLFFSLHP
NAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDMIVRDHTNCADENASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC