; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MELO3C006592 (gene) of Melon (DHL92) v4 genome

Gene IDMELO3C006592
OrganismCucumis melo DHL92 (Melon (DHL92) v4)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationchr06:4346427..4349726
RNA-Seq ExpressionMELO3C006592
SyntenyMELO3C006592
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049426.1 putative prolyl 4-hydroxylase 4 [Cucumis melo var. makuwa]1.7e-16999.01Show/hide
Query:  MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
        MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISW   AFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
Subjt:  MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  ACS
        ACS
Subjt:  ACS

XP_004134175.2 probable prolyl 4-hydroxylase 4 [Cucumis sativus]4.3e-18193.06Show/hide
Query:  LTVNSLKKRWKSENFVKKKKIFIAFFFFFFFISSLSLLVRFSSMAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFL
        +TVNSLKKRWKSEN       FIA FFFFFFISSLSL++ FSSMAE L FNLLFLFTL+IS+LL+RASASYAGSASSIVNPAKVKQISWSPRAFVYEGFL
Subjt:  LTVNSLKKRWKSENFVKKKKIFIAFFFFFFFISSLSLLVRFSSMAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFL

Query:  TDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGH
        TDLECDHLISLAKAELKRSSVADNLSG+SKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPK+NGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGH
Subjt:  TDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGH

Query:  RMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIAR
        RMATVLMYLSDVEKGGETVFPSAEESQRRQASETN+DLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWI VDSFD + R
Subjt:  RMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIAR

Query:  DHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKACS
        DHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKACS
Subjt:  DHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKACS

XP_008438765.1 PREDICTED: probable prolyl 4-hydroxylase 4 [Cucumis melo]4.4e-173100Show/hide
Query:  MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
        MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
Subjt:  MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  ACS
        ACS
Subjt:  ACS

XP_022137761.1 probable prolyl 4-hydroxylase 4 [Momordica charantia]2.5e-16092.05Show/hide
Query:  MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
        MA F   +LLF F+L+IS LLRRAS+SYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDH+ISLAKAELKRS+VADNLSGESKVSEVRTSSGAFIH
Subjt:  MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK
        KAKDPI+SGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYF+DKVNIARGGHRMATVLMYLS+VEKGGETVFPSAEESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPR+GDALLFFSLHPNA+PDT SLHGGCPVIEGEKWSATKWIHVDSFDTI RDHTNC DE+ SCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  AC
         C
Subjt:  AC

XP_038903083.1 probable prolyl 4-hydroxylase 4 [Benincasa hispida]4.1e-16394.39Show/hide
Query:  MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
        MA+F  FNLLF F+L+IS LLRRAS+SYAGSASSIVNPAKVKQISW+PRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
Subjt:  MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETN DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAIPD SSLHGGCPVIEGEKWSATKWIHVDSFD I RDHT+C DENPSCERWAEL ECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  ACS
        AC+
Subjt:  ACS

TrEMBL top hitse value%identityAlignment
A0A0A0L5Q6 Procollagen-proline 4-dioxygenase2.1e-18193.06Show/hide
Query:  LTVNSLKKRWKSENFVKKKKIFIAFFFFFFFISSLSLLVRFSSMAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFL
        +TVNSLKKRWKSEN       FIA FFFFFFISSLSL++ FSSMAE L FNLLFLFTL+IS+LL+RASASYAGSASSIVNPAKVKQISWSPRAFVYEGFL
Subjt:  LTVNSLKKRWKSENFVKKKKIFIAFFFFFFFISSLSLLVRFSSMAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFL

Query:  TDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGH
        TDLECDHLISLAKAELKRSSVADNLSG+SKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPK+NGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGH
Subjt:  TDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGH

Query:  RMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIAR
        RMATVLMYLSDVEKGGETVFPSAEESQRRQASETN+DLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWI VDSFD + R
Subjt:  RMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIAR

Query:  DHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKACS
        DHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKACS
Subjt:  DHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKACS

A0A1S3AWU7 Procollagen-proline 4-dioxygenase2.1e-173100Show/hide
Query:  MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
        MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
Subjt:  MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  ACS
        ACS
Subjt:  ACS

A0A5A7U593 Procollagen-proline 4-dioxygenase8.3e-17099.01Show/hide
Query:  MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
        MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISW   AFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
Subjt:  MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  ACS
        ACS
Subjt:  ACS

A0A6J1C7M6 Procollagen-proline 4-dioxygenase1.2e-16092.05Show/hide
Query:  MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
        MA F   +LLF F+L+IS LLRRAS+SYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDH+ISLAKAELKRS+VADNLSGESKVSEVRTSSGAFIH
Subjt:  MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK
        KAKDPI+SGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYF+DKVNIARGGHRMATVLMYLS+VEKGGETVFPSAEESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPR+GDALLFFSLHPNA+PDT SLHGGCPVIEGEKWSATKWIHVDSFDTI RDHTNC DE+ SCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  AC
         C
Subjt:  AC

A0A6J1I971 Procollagen-proline 4-dioxygenase6.6e-15991.39Show/hide
Query:  MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH
        MA+F   NLLF+ +++IS LLRRAS+SYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRS+VAD+LSGESKVSEVRTSSGAFIH
Subjt:  MAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYF DKVNIARGGHRMATVLMYLS+VEKGGETVFP+AEESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNA+PDT SLHGGCPVIEGEKWSATKWIHVDSFDTI  DHT+C D N SCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  AC
         C
Subjt:  AC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 64.3e-9160.29Show/hide
Query:  SYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSS-VADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENG
        S   S S  V+P ++ Q+SW+PRAF+Y+GFL+D ECDHLI LAK +L++S  VAD  SGES+ SEVRTSSG F+ K +D IV+ +E K+AAWTFLP+ENG
Subjt:  SYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSS-VADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENG

Query:  EDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPD
        E +Q+L YE GQKYD HFDYF DK  +  GGHR+ATVLMYLS+V KGGETVFP+ +    +   ++    S CAK+G AVKPRKGDALLFF+LH N   D
Subjt:  EDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPD

Query:  TSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC
         +SLHG CPVIEGEKWSAT+WIHV SF    +    C D++ SC+ WA+ GEC  NP YMVGS    G+CRKSCKAC
Subjt:  TSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC

F4JAU3 Prolyl 4-hydroxylase 27.8e-12574.23Show/hide
Query:  LFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIE
        L  + I  +L ++S     S SSI+NP+KVKQ+S  PRAFVYEGFLTDLECDHLISLAK  L+RS+VADN +GES+VS+VRTSSG FI K KDPIVSGIE
Subjt:  LFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIE

Query:  DKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAKKGIAVKPRKGD
        DK++ WTFLPKENGED+QVLRYE+GQKYDAHFDYF DKVNIARGGHR+ATVL+YLS+V KGGETVFP A+E  RR  SE   DLSDCAKKGIAVKP+KG+
Subjt:  DKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAKKGIAVKPRKGD

Query:  ALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC
        ALLFF+L  +AIPD  SLHGGCPVIEGEKWSATKWIHVDSFD I     NC D N SCERWA LGEC  NPEYMVG+PE+PG CR+SCKAC
Subjt:  ALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC

F4JNU8 Probable prolyl 4-hydroxylase 82.0e-6456.13Show/hide
Query:  ISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHF
        ISW PRAFVY  FLT+ EC+HLISLAK  + +S V D  +G+S  S VRTSSG F+++  D IV  IE++I+ +TF+P ENGE +QVL YE GQ+Y+ H 
Subjt:  ISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHF

Query:  DYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETN--QDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKW
        DYF D+ N+ +GG R+ATVLMYLSDV++GGETVFP+A    +   S+     +LS C K+G++V P+K DALLF+S+ P+A  D SSLHGGCPVI+G KW
Subjt:  DYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETN--QDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKW

Query:  SATKWIHVDSFD
        S+TKW HV  ++
Subjt:  SATKWIHVDSFD

Q8L970 Probable prolyl 4-hydroxylase 77.1e-10260.19Show/hide
Query:  FLRFNLLFLFTLTI-----SYLLRRASASYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSE
        FL F+L FLFTL +     +  L R+S +  G       SASS   +P +V Q+SW+PR F+YEGFL+D ECDH I LAK +L++S VADN SGES  SE
Subjt:  FLRFNLLFLFTLTI-----SYLLRRASASYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSE

Query:  VRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASE
        VRTSSG F+ K +D IVS +E K+AAWTFLP+ENGE +Q+L YE GQKY+ HFDYF D+ N+  GGHR+ATVLMYLS+VEKGGETVFP      + +A++
Subjt:  VRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASE

Query:  TNQD-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSP
           D  ++CAK+G AVKPRKGDALLFF+LHPNA  D++SLHG CPV+EGEKWSAT+WIHV SF+      + C DEN SCE+WA+ GEC  NP YMVGS 
Subjt:  TNQD-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSP

Query:  ELPGYCRKSCKACS
        +  GYCRKSCKACS
Subjt:  ELPGYCRKSCKACS

Q8LAN3 Probable prolyl 4-hydroxylase 44.0e-12976.77Show/hide
Query:  RFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIHKAKDP
        R  LL  F    S LL ++S S   S+S  VNP+KVKQ+S  PRAFVYEGFLT+LECDH++SLAKA LKRS+VADN SGESK SEVRTSSG FI K KDP
Subjt:  RFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIHKAKDP

Query:  IVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAKKGIAV
        IVSGIEDKI+ WTFLPKENGEDIQVLRYE+GQKYDAHFDYF DKVNI RGGHRMAT+LMYLS+V KGGETVFP AE   RR  SE  +DLSDCAK+GIAV
Subjt:  IVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAKKGIAV

Query:  KPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC
        KPRKGDALLFF+LHP+AIPD  SLHGGCPVIEGEKWSATKWIHVDSFD I     NC D N SCERWA LGECT NPEYMVG+ ELPGYCR+SCKAC
Subjt:  KPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 25.6e-12674.23Show/hide
Query:  LFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIE
        L  + I  +L ++S     S SSI+NP+KVKQ+S  PRAFVYEGFLTDLECDHLISLAK  L+RS+VADN +GES+VS+VRTSSG FI K KDPIVSGIE
Subjt:  LFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIE

Query:  DKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAKKGIAVKPRKGD
        DK++ WTFLPKENGED+QVLRYE+GQKYDAHFDYF DKVNIARGGHR+ATVL+YLS+V KGGETVFP A+E  RR  SE   DLSDCAKKGIAVKP+KG+
Subjt:  DKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAKKGIAVKPRKGD

Query:  ALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC
        ALLFF+L  +AIPD  SLHGGCPVIEGEKWSATKWIHVDSFD I     NC D N SCERWA LGEC  NPEYMVG+PE+PG CR+SCKAC
Subjt:  ALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase5.1e-10360.19Show/hide
Query:  FLRFNLLFLFTLTI-----SYLLRRASASYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSE
        FL F+L FLFTL +     +  L R+S +  G       SASS   +P +V Q+SW+PR F+YEGFL+D ECDH I LAK +L++S VADN SGES  SE
Subjt:  FLRFNLLFLFTLTI-----SYLLRRASASYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSE

Query:  VRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASE
        VRTSSG F+ K +D IVS +E K+AAWTFLP+ENGE +Q+L YE GQKY+ HFDYF D+ N+  GGHR+ATVLMYLS+VEKGGETVFP      + +A++
Subjt:  VRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASE

Query:  TNQD-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSP
           D  ++CAK+G AVKPRKGDALLFF+LHPNA  D++SLHG CPV+EGEKWSAT+WIHV SF+      + C DEN SCE+WA+ GEC  NP YMVGS 
Subjt:  TNQD-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSP

Query:  ELPGYCRKSCKACS
        +  GYCRKSCKACS
Subjt:  ELPGYCRKSCKACS

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase2.1e-9657.45Show/hide
Query:  FLRFNLLFLFTLTI-----SYLLRRASASYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGES----
        FL F+L FLFTL +     +  L R+S +  G       SASS   +P +V Q+SW+PR F+YEGFL+D ECDH I LAK +L++S VADN SGES    
Subjt:  FLRFNLLFLFTLTI-----SYLLRRASASYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGES----

Query:  -KVSEVRTSSGAFIHKAK---DPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEE
          VS VR SS    +      D IVS +E K+AAWTFLP+ENGE +Q+L YE GQKY+ HFDYF D+ N+  GGHR+ATVLMYLS+VEKGGETVFP    
Subjt:  -KVSEVRTSSGAFIHKAK---DPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEE

Query:  SQRRQASETNQD-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNN
          + +A++   D  ++CAK+G AVKPRKGDALLFF+LHPNA  D++SLHG CPV+EGEKWSAT+WIHV SF+      + C DEN SCE+WA+ GEC  N
Subjt:  SQRRQASETNQD-LSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNN

Query:  PEYMVGSPELPGYCRKSCKACS
        P YMVGS +  GYCRKSCKACS
Subjt:  PEYMVGSPELPGYCRKSCKACS

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase3.1e-9260.29Show/hide
Query:  SYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSS-VADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENG
        S   S S  V+P ++ Q+SW+PRAF+Y+GFL+D ECDHLI LAK +L++S  VAD  SGES+ SEVRTSSG F+ K +D IV+ +E K+AAWTFLP+ENG
Subjt:  SYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSS-VADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENG

Query:  EDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPD
        E +Q+L YE GQKYD HFDYF DK  +  GGHR+ATVLMYLS+V KGGETVFP+ +    +   ++    S CAK+G AVKPRKGDALLFF+LH N   D
Subjt:  EDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPD

Query:  TSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC
         +SLHG CPVIEGEKWSAT+WIHV SF    +    C D++ SC+ WA+ GEC  NP YMVGS    G+CRKSCKAC
Subjt:  TSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.9e-13076.77Show/hide
Query:  RFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIHKAKDP
        R  LL  F    S LL ++S S   S+S  VNP+KVKQ+S  PRAFVYEGFLT+LECDH++SLAKA LKRS+VADN SGESK SEVRTSSG FI K KDP
Subjt:  RFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIHKAKDP

Query:  IVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAKKGIAV
        IVSGIEDKI+ WTFLPKENGEDIQVLRYE+GQKYDAHFDYF DKVNI RGGHRMAT+LMYLS+V KGGETVFP AE   RR  SE  +DLSDCAK+GIAV
Subjt:  IVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNQDLSDCAKKGIAV

Query:  KPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC
        KPRKGDALLFF+LHP+AIPD  SLHGGCPVIEGEKWSATKWIHVDSFD I     NC D N SCERWA LGECT NPEYMVG+ ELPGYCR+SCKAC
Subjt:  KPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ACACGTGCGAAGTGTGCGAGTGAGACTTTAGAAAATTTGACAGTTAACAGCTTGAAGAAGAGATGGAAATCTGAAAATTTTGTGAAGAAAAAGAAAATTTTTATTGCATT
TTTCTTCTTCTTCTTCTTCATTTCTTCTCTTTCTCTTTTGGTCCGATTCAGTTCCATGGCTGAATTTCTCCGTTTCAATCTACTTTTTCTTTTCACATTAACCATTTCCT
ACCTTCTCCGGCGAGCTTCAGCCTCCTATGCAGGTTCCGCTAGCTCAATCGTCAACCCTGCAAAAGTCAAACAGATTTCCTGGTCTCCTCGGGCTTTTGTCTATGAAGGT
TTTCTCACGGATTTAGAATGCGATCATCTCATTTCCCTTGCTAAAGCGGAGCTGAAGAGATCCTCTGTTGCGGATAATTTGTCCGGAGAGAGCAAAGTCAGCGAGGTCCG
AACTAGCTCTGGGGCGTTTATTCATAAAGCCAAGGATCCTATTGTTTCTGGTATAGAAGACAAAATTGCAGCATGGACATTTCTGCCAAAAGAAAATGGAGAAGACATTC
AAGTATTGAGATATGAATATGGGCAGAAGTACGATGCTCACTTTGATTACTTTGCTGACAAGGTTAACATTGCCCGAGGTGGACATCGAATGGCAACCGTTCTCATGTAT
CTTTCCGACGTGGAAAAAGGCGGTGAAACTGTGTTTCCTTCTGCAGAGGAATCTCAAAGACGTCAGGCTTCCGAAACAAACCAAGATCTCTCAGATTGTGCAAAGAAAGG
CATAGCAGTAAAACCTCGGAAAGGCGATGCTCTTCTCTTCTTCAGCCTCCATCCAAATGCTATTCCAGACACAAGTAGTCTACATGGTGGGTGCCCTGTGATTGAAGGTG
AGAAATGGTCAGCAACAAAGTGGATTCATGTTGATTCCTTCGACACGATCGCGAGAGACCATACAAATTGTGGTGATGAAAATCCAAGTTGTGAGAGATGGGCTGAACTC
GGCGAGTGCACGAATAACCCAGAGTATATGGTGGGATCTCCTGAGCTTCCTGGCTACTGCAGGAAAAGTTGCAAGGCATGCTCATAA
mRNA sequenceShow/hide mRNA sequence
AACACGTGCGAAGTGTGCGAGTGAGACTTTAGAAAATTTGACAGTTAACAGCTTGAAGAAGAGATGGAAATCTGAAAATTTTGTGAAGAAAAAGAAAATTTTTATTGCAT
TTTTCTTCTTCTTCTTCTTCATTTCTTCTCTTTCTCTTTTGGTCCGATTCAGTTCCATGGCTGAATTTCTCCGTTTCAATCTACTTTTTCTTTTCACATTAACCATTTCC
TACCTTCTCCGGCGAGCTTCAGCCTCCTATGCAGGTTCCGCTAGCTCAATCGTCAACCCTGCAAAAGTCAAACAGATTTCCTGGTCTCCTCGGGCTTTTGTCTATGAAGG
TTTTCTCACGGATTTAGAATGCGATCATCTCATTTCCCTTGCTAAAGCGGAGCTGAAGAGATCCTCTGTTGCGGATAATTTGTCCGGAGAGAGCAAAGTCAGCGAGGTCC
GAACTAGCTCTGGGGCGTTTATTCATAAAGCCAAGGATCCTATTGTTTCTGGTATAGAAGACAAAATTGCAGCATGGACATTTCTGCCAAAAGAAAATGGAGAAGACATT
CAAGTATTGAGATATGAATATGGGCAGAAGTACGATGCTCACTTTGATTACTTTGCTGACAAGGTTAACATTGCCCGAGGTGGACATCGAATGGCAACCGTTCTCATGTA
TCTTTCCGACGTGGAAAAAGGCGGTGAAACTGTGTTTCCTTCTGCAGAGGAATCTCAAAGACGTCAGGCTTCCGAAACAAACCAAGATCTCTCAGATTGTGCAAAGAAAG
GCATAGCAGTAAAACCTCGGAAAGGCGATGCTCTTCTCTTCTTCAGCCTCCATCCAAATGCTATTCCAGACACAAGTAGTCTACATGGTGGGTGCCCTGTGATTGAAGGT
GAGAAATGGTCAGCAACAAAGTGGATTCATGTTGATTCCTTCGACACGATCGCGAGAGACCATACAAATTGTGGTGATGAAAATCCAAGTTGTGAGAGATGGGCTGAACT
CGGCGAGTGCACGAATAACCCAGAGTATATGGTGGGATCTCCTGAGCTTCCTGGCTACTGCAGGAAAAGTTGCAAGGCATGCTCATAAACTTGGTCCATTCTTTATAATG
AGCATTCCCTTGCCTTTTCGTTTTTGTACCAAAAACAGAAATGTTAGTTTTTCGAGAGCATTTTCATTGAAATGTTTTGTGAGAGTTAGCATTTGTATTGATTACTCAAT
CATGTAACATTATTTGAACACTAACGTAGTTTGCAAAGTTATTTTGGTTTGTATGGATCGTCCAATTATAACTCAGTTAAAGACCTATTTGGATTGATAAGTGCTTAAAT
ATACATTTAGGAGTTGTGTAAATTTAATGTCGAAGAAATTGACATAGAAGGAAGAGAGACAATCTCTTTTTTTAAGAATAGAA
Protein sequenceShow/hide protein sequence
TRAKCASETLENLTVNSLKKRWKSENFVKKKKIFIAFFFFFFFISSLSLLVRFSSMAEFLRFNLLFLFTLTISYLLRRASASYAGSASSIVNPAKVKQISWSPRAFVYEG
FLTDLECDHLISLAKAELKRSSVADNLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVLMY
LSDVEKGGETVFPSAEESQRRQASETNQDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIHVDSFDTIARDHTNCGDENPSCERWAEL
GECTNNPEYMVGSPELPGYCRKSCKACS