; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh17G002790 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh17G002790
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationCmo_Chr17:1676125..1680515
RNA-Seq ExpressionCmoCh17G002790
SyntenyCmoCh17G002790
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575033.1 putative prolyl 4-hydroxylase 4, partial [Cucurbita argyrosperma subsp. sororia]3.3e-16997.07Show/hide
Query:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
        MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
Subjt:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS

Query:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
        KDPIVSGIEDKIAAWTFLPK       ENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
Subjt:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL

Query:  SECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYC
        +ECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDC+DLNESCERWAALGECTKNPEYMVGSPELPGYC
Subjt:  SECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYC

Query:  RRSCRIC
        RRSCRIC
Subjt:  RRSCRIC

KAG7013608.1 putative prolyl 4-hydroxylase 4 [Cucurbita argyrosperma subsp. argyrosperma]1.8e-17599.35Show/hide
Query:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
        MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
Subjt:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS

Query:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
        KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
Subjt:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL

Query:  SECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYC
        +ECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDC+DLNESCERWAALGECTKNPEYMVGSPELPGYC
Subjt:  SECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYC

Query:  RRSCRIC
        RRSCRIC
Subjt:  RRSCRIC

XP_022959148.1 probable prolyl 4-hydroxylase 4 [Cucurbita moschata]6.6e-17097.72Show/hide
Query:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
        MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
Subjt:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS

Query:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
        KDPIVSGIEDKIAAWTFLPK       ENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
Subjt:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL

Query:  SECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYC
        SECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYC
Subjt:  SECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYC

Query:  RRSCRIC
        RRSCRIC
Subjt:  RRSCRIC

XP_023006272.1 probable prolyl 4-hydroxylase 4 [Cucurbita maxima]1.1e-16796.74Show/hide
Query:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
        MSKFR LLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
Subjt:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS

Query:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
        KDPIVSGIEDKIAAWTFLPK       ENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
Subjt:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL

Query:  SECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYC
        SECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDC+DLNESCERWAALGECTKNPEYMVGS ELPGYC
Subjt:  SECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYC

Query:  RRSCRIC
        RRSCRIC
Subjt:  RRSCRIC

XP_023547984.1 probable prolyl 4-hydroxylase 4 [Cucurbita pepo subsp. pepo]5.2e-16796.42Show/hide
Query:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
        M KFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
Subjt:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS

Query:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
        KD IVSGIEDKIAAWTFLPK       ENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
Subjt:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL

Query:  SECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYC
        SECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDC+DLNESCERWAALGECTKNPEYMVGS ELPGYC
Subjt:  SECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYC

Query:  RRSCRIC
        RRSCRIC
Subjt:  RRSCRIC

TrEMBL top hitse value%identityAlignment
A0A0A0KCQ5 Procollagen-proline 4-dioxygenase1.1e-15789.58Show/hide
Query:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
        M KF NLLF+FLIL SS +RES+CSYAGSA++TVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSG SKLSTVRTSSGMFISK+
Subjt:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS

Query:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
        KDPIVSGIEDKI+AWTFLPK       ENGEDIQVLRYEHGQKYESHYDYFVDKVNIA GGHRLATVLMYLSNVT+GGETVFPLAEK   RRA ETDEDL
Subjt:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL

Query:  SECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYC
        SECA++G+AVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLG+IG+C+DLNESCERWAALGECTKNPEYMVGSPE+PGYC
Subjt:  SECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYC

Query:  RRSCRIC
        RRSCRIC
Subjt:  RRSCRIC

A0A1S3C816 Procollagen-proline 4-dioxygenase8.7e-16091.21Show/hide
Query:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
        M KFRNLLF FLILISS VRES+CSYAGSA++TVDPS+VKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSG SKLSTVRTSSGMFISK+
Subjt:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS

Query:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
        KDPIVSGIEDKI+AWTFLPK       ENGEDIQVLRYEHGQKYESHYDYFVDKVNIA GGHRLATVLMYLSNVTKGGETVFPLAEKS  RRA ETDEDL
Subjt:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL

Query:  SECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYC
        SECA++GIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLG+IG+C+DLNESCERWAALGECTKNPEYMVGSPE+PGYC
Subjt:  SECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYC

Query:  RRSCRIC
        RRSCRIC
Subjt:  RRSCRIC

A0A5A7SVW6 Procollagen-proline 4-dioxygenase8.7e-16091.21Show/hide
Query:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
        M KFRNLLF FLILISS VRES+CSYAGSA++TVDPS+VKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSG SKLSTVRTSSGMFISK+
Subjt:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS

Query:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
        KDPIVSGIEDKI+AWTFLPK       ENGEDIQVLRYEHGQKYESHYDYFVDKVNIA GGHRLATVLMYLSNVTKGGETVFPLAEKS  RRA ETDEDL
Subjt:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL

Query:  SECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYC
        SECA++GIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLG+IG+C+DLNESCERWAALGECTKNPEYMVGSPE+PGYC
Subjt:  SECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYC

Query:  RRSCRIC
        RRSCRIC
Subjt:  RRSCRIC

A0A6J1H545 Procollagen-proline 4-dioxygenase3.2e-17097.72Show/hide
Query:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
        MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
Subjt:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS

Query:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
        KDPIVSGIEDKIAAWTFLPK       ENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
Subjt:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL

Query:  SECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYC
        SECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYC
Subjt:  SECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYC

Query:  RRSCRIC
        RRSCRIC
Subjt:  RRSCRIC

A0A6J1L4G1 Procollagen-proline 4-dioxygenase5.1e-16896.74Show/hide
Query:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
        MSKFR LLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
Subjt:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS

Query:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
        KDPIVSGIEDKIAAWTFLPK       ENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
Subjt:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL

Query:  SECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYC
        SECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDC+DLNESCERWAALGECTKNPEYMVGS ELPGYC
Subjt:  SECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYC

Query:  RRSCRIC
        RRSCRIC
Subjt:  RRSCRIC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 68.3e-9157.89Show/hide
Query:  SYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRS-EVADNDSGDSKLSTVRTSSGMFISKSKDPIVSGIEDKIAAWTFLPKAGI
        S   S + +VDP+++ Q+SW PRAF+Y+GFL+D ECDHL+ +A+ +L++S  VAD DSG+S+ S VRTSSGMF++K +D IV+ +E K+AAWTFLP+   
Subjt:  SYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRS-EVADNDSGDSKLSTVRTSSGMFISKSKDPIVSGIEDKIAAWTFLPKAGI

Query:  SHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFP-LAEKSPRRRASETDEDLSECARQGIAVKPKKGDALLFFS
            ENGE +Q+L YE+GQKY+ H+DYF DK  +  GGHR+ATVLMYLSNVTKGGETVFP    K+P+ +    D+  S+CA+QG AVKP+KGDALLFF+
Subjt:  SHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFP-LAEKSPRRRASETDEDLSECARQGIAVKPKKGDALLFFS

Query:  LEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRIC
        L  N   D NSLHG CPV+EGEKWSAT+WIHV SF K       C D +ESC+ WA  GEC KNP YMVGS    G+CR+SC+ C
Subjt:  LEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRIC

F4JAU3 Prolyl 4-hydroxylase 23.3e-12469.71Show/hide
Query:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
        MS+   LLF+ ++L   V+ +SS     S +S ++PSKVKQ+S KPRAFVYEGFLTDLECDHL+S+A+  L+RS VADND+G+S++S VRTSSG FISK 
Subjt:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS

Query:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
        KDPIVSGIEDK++ WTFLPK       ENGED+QVLRYEHGQKY++H+DYF DKVNIARGGHR+ATVL+YLSNVTKGGETVFP A++  RR  SE  +DL
Subjt:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL

Query:  SECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYC
        S+CA++GIAVKPKKG+ALLFF+L+ +AIPD  SLHGGCPV+EGEKWSATKWIHVDSF K L + G+C+D+NESCERWA LGEC KNPEYMVG+PE+PG C
Subjt:  SECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYC

Query:  RRSCRIC
        RRSC+ C
Subjt:  RRSCRIC

Q8L970 Probable prolyl 4-hydroxylase 71.6e-9758.16Show/hide
Query:  SSVVRESSCSYAGSATST--VDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKSKDPIVSGIEDKIA
        SS  R+ S     ++ S+   DP++V Q+SW PR F+YEGFL+D ECDH + +A+ +L++S VADNDSG+S  S VRTSSGMF+SK +D IVS +E K+A
Subjt:  SSVVRESSCSYAGSATST--VDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKSKDPIVSGIEDKIA

Query:  AWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDLSECARQGIAVKPK
        AWTFLP+       ENGE +Q+L YE+GQKYE H+DYF D+ N+  GGHR+ATVLMYLSNV KGGETVFP+ +    +     D+  +ECA+QG AVKP+
Subjt:  AWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDLSECARQGIAVKPK

Query:  KGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRIC
        KGDALLFF+L PNA  D+NSLHG CPV+EGEKWSAT+WIHV SF +       C D N SCE+WA  GEC KNP YMVGS +  GYCR+SC+ C
Subjt:  KGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRIC

Q8LAN3 Probable prolyl 4-hydroxylase 43.6e-12672.28Show/hide
Query:  RNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKSKDPI
        R  L +    I SV+ +SS S   S++  V+PSKVKQ+S KPRAFVYEGFLT+LECDH+VS+A++ LKRS VADNDSG+SK S VRTSSG FISK KDPI
Subjt:  RNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKSKDPI

Query:  VSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDLSECA
        VSGIEDKI+ WTFLPK       ENGEDIQVLRYEHGQKY++H+DYF DKVNI RGGHR+AT+LMYLSNVTKGGETVFP AE   RR  SE  EDLS+CA
Subjt:  VSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDLSECA

Query:  RQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSC
        ++GIAVKP+KGDALLFF+L P+AIPD  SLHGGCPV+EGEKWSATKWIHVDSF + +   G+C+D+NESCERWA LGECTKNPEYMVG+ ELPGYCRRSC
Subjt:  RQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSC

Query:  RIC
        + C
Subjt:  RIC

Q9LN20 Probable prolyl 4-hydroxylase 31.8e-6151.85Show/hide
Query:  ISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKSKDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHG
        +SW+PRAFVY  FL+  EC++L+S+A+  + +S V D+++G SK S VRTSSG F+ + +D I+  IE +IA +TF+P       +++GE +QVL YE G
Subjt:  ISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKSKDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHG

Query:  QKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDLSECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVL
        QKYE HYDYFVD+ N   GG R+AT+LMYLS+V +GGETVFP A  +    +     +LSEC ++G++VKP+ GDALLF+S+ P+A  D  SLHGGCPV+
Subjt:  QKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDLSECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVL

Query:  EGEKWSATKWIHVDSF
         G KWS+TKW+HV  +
Subjt:  EGEKWSATKWIHVDSF

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 22.4e-12569.71Show/hide
Query:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS
        MS+   LLF+ ++L   V+ +SS     S +S ++PSKVKQ+S KPRAFVYEGFLTDLECDHL+S+A+  L+RS VADND+G+S++S VRTSSG FISK 
Subjt:  MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKS

Query:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL
        KDPIVSGIEDK++ WTFLPK       ENGED+QVLRYEHGQKY++H+DYF DKVNIARGGHR+ATVL+YLSNVTKGGETVFP A++  RR  SE  +DL
Subjt:  KDPIVSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDL

Query:  SECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYC
        S+CA++GIAVKPKKG+ALLFF+L+ +AIPD  SLHGGCPV+EGEKWSATKWIHVDSF K L + G+C+D+NESCERWA LGEC KNPEYMVG+PE+PG C
Subjt:  SECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYC

Query:  RRSCRIC
        RRSC+ C
Subjt:  RRSCRIC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase1.1e-9858.16Show/hide
Query:  SSVVRESSCSYAGSATST--VDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKSKDPIVSGIEDKIA
        SS  R+ S     ++ S+   DP++V Q+SW PR F+YEGFL+D ECDH + +A+ +L++S VADNDSG+S  S VRTSSGMF+SK +D IVS +E K+A
Subjt:  SSVVRESSCSYAGSATST--VDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKSKDPIVSGIEDKIA

Query:  AWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDLSECARQGIAVKPK
        AWTFLP+       ENGE +Q+L YE+GQKYE H+DYF D+ N+  GGHR+ATVLMYLSNV KGGETVFP+ +    +     D+  +ECA+QG AVKP+
Subjt:  AWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDLSECARQGIAVKPK

Query:  KGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRIC
        KGDALLFF+L PNA  D+NSLHG CPV+EGEKWSAT+WIHV SF +       C D N SCE+WA  GEC KNP YMVGS +  GYCR+SC+ C
Subjt:  KGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRIC

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase5.9e-9254.64Show/hide
Query:  SSVVRESSCSYAGSATST--VDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDS-----KLSTVRTSSGMFISKSK---DPIV
        SS  R+ S     ++ S+   DP++V Q+SW PR F+YEGFL+D ECDH + +A+ +L++S VADNDSG+S      +S VR SS    +      D IV
Subjt:  SSVVRESSCSYAGSATST--VDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDS-----KLSTVRTSSGMFISKSK---DPIV

Query:  SGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDLSECAR
        S +E K+AAWTFLP+       ENGE +Q+L YE+GQKYE H+DYF D+ N+  GGHR+ATVLMYLSNV KGGETVFP+ +    +     D+  +ECA+
Subjt:  SGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDLSECAR

Query:  QGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCR
        QG AVKP+KGDALLFF+L PNA  D+NSLHG CPV+EGEKWSAT+WIHV SF +       C D N SCE+WA  GEC KNP YMVGS +  GYCR+SC+
Subjt:  QGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCR

Query:  IC
         C
Subjt:  IC

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase5.9e-9257.89Show/hide
Query:  SYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRS-EVADNDSGDSKLSTVRTSSGMFISKSKDPIVSGIEDKIAAWTFLPKAGI
        S   S + +VDP+++ Q+SW PRAF+Y+GFL+D ECDHL+ +A+ +L++S  VAD DSG+S+ S VRTSSGMF++K +D IV+ +E K+AAWTFLP+   
Subjt:  SYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRS-EVADNDSGDSKLSTVRTSSGMFISKSKDPIVSGIEDKIAAWTFLPKAGI

Query:  SHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFP-LAEKSPRRRASETDEDLSECARQGIAVKPKKGDALLFFS
            ENGE +Q+L YE+GQKY+ H+DYF DK  +  GGHR+ATVLMYLSNVTKGGETVFP    K+P+ +    D+  S+CA+QG AVKP+KGDALLFF+
Subjt:  SHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFP-LAEKSPRRRASETDEDLSECARQGIAVKPKKGDALLFFS

Query:  LEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRIC
        L  N   D NSLHG CPV+EGEKWSAT+WIHV SF K       C D +ESC+ WA  GEC KNP YMVGS    G+CR+SC+ C
Subjt:  LEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRIC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.5e-12772.28Show/hide
Query:  RNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKSKDPI
        R  L +    I SV+ +SS S   S++  V+PSKVKQ+S KPRAFVYEGFLT+LECDH+VS+A++ LKRS VADNDSG+SK S VRTSSG FISK KDPI
Subjt:  RNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKSKDPI

Query:  VSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDLSECA
        VSGIEDKI+ WTFLPK       ENGEDIQVLRYEHGQKY++H+DYF DKVNI RGGHR+AT+LMYLSNVTKGGETVFP AE   RR  SE  EDLS+CA
Subjt:  VSGIEDKIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDLSECA

Query:  RQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSC
        ++GIAVKP+KGDALLFF+L P+AIPD  SLHGGCPV+EGEKWSATKWIHVDSF + +   G+C+D+NESCERWA LGECTKNPEYMVG+ ELPGYCRRSC
Subjt:  RQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSC

Query:  RIC
        + C
Subjt:  RIC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCAAATTTCGCAATCTGTTATTTCTCTTCTTGATTTTGATCTCATCGGTTGTTCGGGAATCAAGTTGTTCGTATGCTGGTTCGGCTACCTCCACCGTAGATCCTAG
TAAAGTGAAGCAGATTTCATGGAAACCGAGAGCTTTTGTATATGAAGGATTTCTCACGGACCTAGAATGCGACCATCTGGTTTCTATAGCGAGATCCGAGCTAAAGAGAT
CTGAAGTTGCTGATAATGATTCAGGAGATAGCAAGCTCAGTACTGTTCGAACGAGCTCAGGAATGTTCATTTCTAAGAGCAAGGATCCTATTGTTTCTGGCATCGAGGAC
AAAATTGCTGCATGGACTTTTCTTCCAAAAGCGGGTATTTCCCATTTTTCAGAGAATGGAGAGGACATTCAGGTATTGAGATATGAGCATGGCCAGAAGTATGAATCACA
TTATGATTACTTTGTTGACAAGGTTAATATTGCCCGGGGAGGACATCGTTTAGCTACAGTCCTTATGTATCTCTCTAATGTGACCAAAGGCGGTGAAACAGTTTTTCCCT
TGGCTGAGAAATCTCCCCGCCGGAGGGCTTCTGAAACAGACGAGGATCTCTCAGAGTGTGCAAGGCAAGGAATTGCAGTGAAACCAAAGAAAGGCGATGCCCTTCTTTTC
TTTAGTCTCGAACCAAATGCAATCCCAGACACCAATAGTCTCCATGGAGGCTGCCCTGTTCTTGAAGGAGAAAAATGGTCAGCAACAAAGTGGATTCACGTTGACTCTTT
CAGCAAAAACTTGGGAAACATTGGGGACTGTTCGGATCTAAACGAAAGCTGTGAGAGATGGGCTGCCTTAGGGGAGTGCACCAAAAACCCTGAGTATATGGTCGGATCTC
CGGAGCTTCCAGGCTACTGTAGGCGGAGTTGCAGGATCTGTTGA
mRNA sequenceShow/hide mRNA sequence
CTGGATTTGCAGACTTCGAATAAATTTTCTCCGATCACTCTCTCTCTCTCTCTCTCTCTCTCTCTAATTTAATCCGATCGAGACTATGTCCAAATTTCGCAATCTGTTAT
TTCTCTTCTTGATTTTGATCTCATCGGTTGTTCGGGAATCAAGTTGTTCGTATGCTGGTTCGGCTACCTCCACCGTAGATCCTAGTAAAGTGAAGCAGATTTCATGGAAA
CCGAGAGCTTTTGTATATGAAGGATTTCTCACGGACCTAGAATGCGACCATCTGGTTTCTATAGCGAGATCCGAGCTAAAGAGATCTGAAGTTGCTGATAATGATTCAGG
AGATAGCAAGCTCAGTACTGTTCGAACGAGCTCAGGAATGTTCATTTCTAAGAGCAAGGATCCTATTGTTTCTGGCATCGAGGACAAAATTGCTGCATGGACTTTTCTTC
CAAAAGCGGGTATTTCCCATTTTTCAGAGAATGGAGAGGACATTCAGGTATTGAGATATGAGCATGGCCAGAAGTATGAATCACATTATGATTACTTTGTTGACAAGGTT
AATATTGCCCGGGGAGGACATCGTTTAGCTACAGTCCTTATGTATCTCTCTAATGTGACCAAAGGCGGTGAAACAGTTTTTCCCTTGGCTGAGAAATCTCCCCGCCGGAG
GGCTTCTGAAACAGACGAGGATCTCTCAGAGTGTGCAAGGCAAGGAATTGCAGTGAAACCAAAGAAAGGCGATGCCCTTCTTTTCTTTAGTCTCGAACCAAATGCAATCC
CAGACACCAATAGTCTCCATGGAGGCTGCCCTGTTCTTGAAGGAGAAAAATGGTCAGCAACAAAGTGGATTCACGTTGACTCTTTCAGCAAAAACTTGGGAAACATTGGG
GACTGTTCGGATCTAAACGAAAGCTGTGAGAGATGGGCTGCCTTAGGGGAGTGCACCAAAAACCCTGAGTATATGGTCGGATCTCCGGAGCTTCCAGGCTACTGTAGGCG
GAGTTGCAGGATCTGTTGATACACTCCAAATTTGGATACCTTTTGGCGCTATCTTATGCCTTTTGAACTCTGAAGCACGGATGAATGAAGCAGTAGCCGTCGGCGAACTC
CAGGAACCCAGCAGGACTTTTCAATCCATCTGAAACCAAAGGACAAAGGAAGAAAAGAGGTAAAAATAGGTATTGACAAAAAATTATCCATTTAATATGTAGGAGTGAGA
TCGAGCAAAAACTCGCGGTTTTTGGTAATTTATTAGAAAGAACGAAAGAAAGATTCCGTAAGCTACTTAGTAGTGGATAGGTAAAAAGAAAGCACTCCATTACGAATGTT
GGGACATATCAATAAAGCAATGTCTAAGATAATGAACAAGATGCTCACTCACTCAACTAGCGCACCTCTAAAGTGGCCATGGAGATTCTTCCTCCCATCACCCACCCAGT
AACTCAATCCCCTCTAATGTTAACTGTTTTTCTTCCTTCCTATTCTAAAAGATGGGAGCCTTCTTGGACAAGAAAAATGATATGTAGCATCAAACATTGACTATGATCCA
TCACGACTAACCCAAGCTAAAAGTAGTTGAAAACTATTAGTCAAACAAATTATTTATCTTTTAGATTGAACTCAATTGGTCAAAGAATCTTTTTTGACGGTATGCCTCTA
GGGGAATCTCAGGTACTTTCAACTTAATGATTATATTCCAGGTTAAGATCAACATCTTATGCACCCATTCCACATGTTAGGAGTAGCTGGTATTTCGGCGACTCCCTATT
CAATGTAGTGCATGGTTCCTTGATAACTTCTGGTTTGATCAGGAAAACCACAAAAAAAAAATGAATATGCTAATGAAAGTTACAGATTTAATCAAGAACAAGAAACTTAT
AATATCATAGCTACTCATAGTTATTTTGGTCAATTGATCTTCTAATGTACTAGTTTCAACAACTCTCG
Protein sequenceShow/hide protein sequence
MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKSKDPIVSGIED
KIAAWTFLPKAGISHFSENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEKSPRRRASETDEDLSECARQGIAVKPKKGDALLF
FSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRIC