; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10022103 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10022103
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationChr05:20874457..20878029
RNA-Seq ExpressionHG10022103
SyntenyHG10022103
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8648909.1 hypothetical protein Csa_008411 [Cucumis sativus]4.8e-16684.38Show/hide
Query:  MDSRRFLAFSLCFLSVFTGFARLPETRTHKKLSGSVLQLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSMVADNESGKSVS
        MDSR FLAFSLCFLSVFT FARLPETRTHK+ SGSVL+LKTDSSPLIFDPTRVTQLSWQPRAFLYKGFL+D ECDHLIDLAKDKLEKSMVADN+SGKSVS
Subjt:  MDSRRFLAFSLCFLSVFTGFARLPETRTHKKLSGSVLQLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSMVADNESGKSVS

Query:  SEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPA----DYKALTITYFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY
        SEVRTSSGMFLRKAQDE+VAG+EARI+AWT LPA    DY+A+ ITY SFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY
Subjt:  SEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPA----DYKALTITYFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY

Query:  LSNVEKGGETIFPNSE----------------LKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSF
        LSNVEKGGETIFPNSE                 KESQ KD+SWSDC+ KGYAVKAQKGDALLFFSL+LDATTD++SLHGSCPVI GEKWSATKWIHVRSF
Subjt:  LSNVEKGGETIFPNSE----------------LKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSF

Query:  EKPT-RVSSQDCMDENENCPLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC
        EK T RVS Q C+DENENC  WAK+GECKKNPTYMVGS GALGYCRKSC+AC
Subjt:  EKPT-RVSSQDCMDENENCPLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC

XP_008458700.1 PREDICTED: probable prolyl 4-hydroxylase 7 [Cucumis melo]1.9e-16285.67Show/hide
Query:  MDSRRFLAFSLCFLSVFTGFARLPETR----THKKLSGSVLQLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSMVADNESG
        MDSR FLAFSLCFLSVFT FARLPETR    ++K+ +GSVL+LKTDSSPLIFDPTRVTQLSWQPRAFLYKGFL+D+ECDHLIDLAKDKLEKSMVADNESG
Subjt:  MDSRRFLAFSLCFLSVFTGFARLPETR----THKKLSGSVLQLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSMVADNESG

Query:  KSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPADYKALTITYFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY
        KSVSSEVRTSSGMFLRKAQD+IVAG+EARI+AWT LPA                   ENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY
Subjt:  KSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPADYKALTITYFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY

Query:  LSNVEKGGETIFPNSELKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVSSQDCMDENE
        LSNVEKGGETIFPNSE KESQEKDDSWSDC+ KGYAVKAQKGDALLFFSLHLDATTD++SLHGSCPVIEGEKWSATKWIHVRSFEK  RVS QDC+DENE
Subjt:  LSNVEKGGETIFPNSELKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVSSQDCMDENE

Query:  NCPLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC
        NCP WAKRGECKKNPTYMVGSEGALGYCRKSC+AC
Subjt:  NCPLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC

XP_022938573.1 probable prolyl 4-hydroxylase 7 [Cucurbita moschata]6.7e-16086.71Show/hide
Query:  MDSRRFLAFSLCFLSVFTGFARLPETRTHKKLSGSVLQLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSMVADNESGKSVS
        MDSRRFLAFSL FLSV TGFARLPE  THKKLSGSVL+LK DS  LIFDPTRVTQLSWQPRAFLYKGFLTD+ECDHLIDLAKDKLEKSMVADNESGKSVS
Subjt:  MDSRRFLAFSLCFLSVFTGFARLPETRTHKKLSGSVLQLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSMVADNESGKSVS

Query:  SEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPADYKALTITYFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNV
        SEVRTSSGMFLRKAQDEIVAGIEARISAWTFLP                    ENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNV
Subjt:  SEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPADYKALTITYFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNV

Query:  EKGGETIFPNSELKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVSSQDCMDENENCPL
        EKGGETIFPNS   ESQEKDDSWSDCA KGYAVKAQKGDALLFFSLHLDATTD +SLHGSCPVIEGEKWSATKWIHVRSF+K TR+SSQDC+DEN+NCP 
Subjt:  EKGGETIFPNSELKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVSSQDCMDENENCPL

Query:  WAKRGECKKNPTYMVGSEGALGYCRKSCRAC
        WAKRGEC+KNPTYMVGSEGA+GYCRKSC+AC
Subjt:  WAKRGECKKNPTYMVGSEGALGYCRKSCRAC

XP_038889686.1 probable prolyl 4-hydroxylase 7 isoform X1 [Benincasa hispida]2.5e-16286.71Show/hide
Query:  MDSRRFLAFSLCFLSVFTGFARLPETRTHKKLSGSVLQLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSMVADNESGKSVS
        MDSRRFLAF LCFLSVFTGFARLPE R+ KK SGSV++LKTDSSPL+FDPTRVTQLSW+PRAFLYKGFL+DKECDHLIDLAKDKLEKSMVADNESGKSVS
Subjt:  MDSRRFLAFSLCFLSVFTGFARLPETRTHKKLSGSVLQLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSMVADNESGKSVS

Query:  SEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPADYKALTITYFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNV
        SEVRTSSGMFLRKAQDEIVA IEARISAWT LPA                   ENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNV
Subjt:  SEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPADYKALTITYFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNV

Query:  EKGGETIFPNSELKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVSSQDCMDENENCPL
        EKGGETIFPNSE KESQEKD+SWSDCA KGYAVKA+KGDALLFFSL  DATTD KSLHGSCPVIEGEKWSATKWIHVRSFEK TRVS QDC+DENENC +
Subjt:  EKGGETIFPNSELKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVSSQDCMDENENCPL

Query:  WAKRGECKKNPTYMVGSEGALGYCRKSCRAC
        WAKRGECKKNPTYMVGSE ALGYCRKSCRAC
Subjt:  WAKRGECKKNPTYMVGSEGALGYCRKSCRAC

XP_038889687.1 probable prolyl 4-hydroxylase 7 isoform X2 [Benincasa hispida]3.5e-16186.71Show/hide
Query:  MDSRRFLAFSLCFLSVFTGFARLPETRTHKKLSGSVLQLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSMVADNESGKSVS
        MDSRRFLAF LCFLSVFTGFARLPE R+ KK SGSV++LKTDSSPL+FDPTRVTQLSW+PRAFLYKGFL+DKECDHLIDLAKDKLEKSMVADNESGKSVS
Subjt:  MDSRRFLAFSLCFLSVFTGFARLPETRTHKKLSGSVLQLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSMVADNESGKSVS

Query:  SEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPADYKALTITYFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNV
        SEVRTSSGMFLRKAQDEIVA IEARISAWT LPA                   ENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNV
Subjt:  SEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPADYKALTITYFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNV

Query:  EKGGETIFPNSELKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVSSQDCMDENENCPL
        EKGGETIFPNSE KESQEKD+SWSDCA KGYAVKA+KGDALLFFSL  DATTD KSLHGSCPVIEGEKWSATKWIHVRSFEK TRVS QDC+DENENC +
Subjt:  EKGGETIFPNSELKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVSSQDCMDENENCPL

Query:  WAKRGECKKNPTYMVGSEGALGYCRKSCRAC
        WAKRGECKKNPTYMVGSE ALGYCRKSCRAC
Subjt:  WAKRGECKKNPTYMVGSEGALGYCRKSCRAC

TrEMBL top hitse value%identityAlignment
A0A0A0KS38 Procollagen-proline 4-dioxygenase2.7e-15984.94Show/hide
Query:  MDSRRFLAFSLCFLSVFTGFARLPETRTHKKLSGSVLQLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSMVADNESGKSVS
        MDSR FLAFSLCFLSVFT FARLPETRTHK+ SGSVL+LKTDSSPLIFDPTRVTQLSWQPRAFLYKGFL+D ECDHLIDLAKDKLEKSMVADN+SGKSVS
Subjt:  MDSRRFLAFSLCFLSVFTGFARLPETRTHKKLSGSVLQLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSMVADNESGKSVS

Query:  SEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPADYKALTITYFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNV
        SEVRTSSGMFLRKAQDE+VAG+EARI+AWT LPA                   ENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNV
Subjt:  SEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPADYKALTITYFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNV

Query:  EKGGETIFPNSELKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSFEKPT-RVSSQDCMDENENCP
        EKGGETIFPNSE KESQ KD+SWSDC+ KGYAVKAQKGDALLFFSL+LDATTD++SLHGSCPVI GEKWSATKWIHVRSFEK T RVS Q C+DENENC 
Subjt:  EKGGETIFPNSELKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSFEKPT-RVSSQDCMDENENCP

Query:  LWAKRGECKKNPTYMVGSEGALGYCRKSCRAC
         WAK+GECKKNPTYMVGS GALGYCRKSC+AC
Subjt:  LWAKRGECKKNPTYMVGSEGALGYCRKSCRAC

A0A1S3C8G4 Procollagen-proline 4-dioxygenase9.1e-16385.67Show/hide
Query:  MDSRRFLAFSLCFLSVFTGFARLPETR----THKKLSGSVLQLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSMVADNESG
        MDSR FLAFSLCFLSVFT FARLPETR    ++K+ +GSVL+LKTDSSPLIFDPTRVTQLSWQPRAFLYKGFL+D+ECDHLIDLAKDKLEKSMVADNESG
Subjt:  MDSRRFLAFSLCFLSVFTGFARLPETR----THKKLSGSVLQLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSMVADNESG

Query:  KSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPADYKALTITYFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY
        KSVSSEVRTSSGMFLRKAQD+IVAG+EARI+AWT LPA                   ENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY
Subjt:  KSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPADYKALTITYFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY

Query:  LSNVEKGGETIFPNSELKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVSSQDCMDENE
        LSNVEKGGETIFPNSE KESQEKDDSWSDC+ KGYAVKAQKGDALLFFSLHLDATTD++SLHGSCPVIEGEKWSATKWIHVRSFEK  RVS QDC+DENE
Subjt:  LSNVEKGGETIFPNSELKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVSSQDCMDENE

Query:  NCPLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC
        NCP WAKRGECKKNPTYMVGSEGALGYCRKSC+AC
Subjt:  NCPLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC

A0A6J1BXN9 Procollagen-proline 4-dioxygenase4.0e-15884.04Show/hide
Query:  MDSRRFLAFSLCFLSVFTGFARLPETRTHKKLSGSVLQLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSMVADNESGKSVS
        MDS RFL+FSLCFL VFT  ARLP+ R HKK+SGSVL+LK + SPLIFDPTRVTQLSWQPRAFLYKGFL+DKECDHLIDLAKDKLEKSMVADN SGKSVS
Subjt:  MDSRRFLAFSLCFLSVFTGFARLPETRTHKKLSGSVLQLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSMVADNESGKSVS

Query:  SEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPADYKALTITYFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNV
        SEVRTSSGMFL KAQDEIVA +EARI+AWTFLPA                   ENGESIQILHYENGQKYEPHFD+FHDKVNQELGGHR+ATVLMYLSNV
Subjt:  SEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPADYKALTITYFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNV

Query:  EKGGETIFPNSELKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVSSQ-DCMDENENCP
        EKGGETIFPNSE KESQEKDDSWSDCA KGYAVKA+KGDALLFFSLHLDATTD KSLHGSCPVIEGEKWSATKWIHVRSFEKPTR S + DC+DENENC 
Subjt:  EKGGETIFPNSELKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVSSQ-DCMDENENCP

Query:  LWAKRGECKKNPTYMVGSEGALGYCRKSCRAC
         WAKRGECKKNPTYMVGSE ALGYCRKSC+AC
Subjt:  LWAKRGECKKNPTYMVGSEGALGYCRKSCRAC

A0A6J1FJ93 Procollagen-proline 4-dioxygenase3.2e-16086.71Show/hide
Query:  MDSRRFLAFSLCFLSVFTGFARLPETRTHKKLSGSVLQLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSMVADNESGKSVS
        MDSRRFLAFSL FLSV TGFARLPE  THKKLSGSVL+LK DS  LIFDPTRVTQLSWQPRAFLYKGFLTD+ECDHLIDLAKDKLEKSMVADNESGKSVS
Subjt:  MDSRRFLAFSLCFLSVFTGFARLPETRTHKKLSGSVLQLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSMVADNESGKSVS

Query:  SEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPADYKALTITYFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNV
        SEVRTSSGMFLRKAQDEIVAGIEARISAWTFLP                    ENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNV
Subjt:  SEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPADYKALTITYFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNV

Query:  EKGGETIFPNSELKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVSSQDCMDENENCPL
        EKGGETIFPNS   ESQEKDDSWSDCA KGYAVKAQKGDALLFFSLHLDATTD +SLHGSCPVIEGEKWSATKWIHVRSF+K TR+SSQDC+DEN+NCP 
Subjt:  EKGGETIFPNSELKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVSSQDCMDENENCPL

Query:  WAKRGECKKNPTYMVGSEGALGYCRKSCRAC
        WAKRGEC+KNPTYMVGSEGA+GYCRKSC+AC
Subjt:  WAKRGECKKNPTYMVGSEGALGYCRKSCRAC

A0A6J1JWX0 Procollagen-proline 4-dioxygenase2.1e-15986.4Show/hide
Query:  MDSRRFLAFSLCFLSVFTGFARLPETRTHKKLSGSVLQLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSMVADNESGKSVS
        MDSRRFL FSL FLSV TGFARLPE  THKKLSGSVL+LK DS  LIFDPTRVTQLSWQPRAFLYKGFLTD+ECDHLIDLAKDKLEKSMVADNESGKSVS
Subjt:  MDSRRFLAFSLCFLSVFTGFARLPETRTHKKLSGSVLQLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSMVADNESGKSVS

Query:  SEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPADYKALTITYFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNV
        SEVRTSSGMFLRKAQDEIVAGIEARISAWTFLP                    ENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNV
Subjt:  SEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPADYKALTITYFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNV

Query:  EKGGETIFPNSELKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVSSQDCMDENENCPL
        EKGGETIFPNS   ESQEKDDSWSDCA KGYAVKAQKGDALLFFSLHLDATTD +SLHGSCPVIEGEKWSATKWIHVRSF+K TR SSQDC+DEN+NCP 
Subjt:  EKGGETIFPNSELKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVSSQDCMDENENCPL

Query:  WAKRGECKKNPTYMVGSEGALGYCRKSCRAC
        WAKRGEC+KNPTYMVGSEGA+GYCRKSC+AC
Subjt:  WAKRGECKKNPTYMVGSEGALGYCRKSCRAC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 61.2e-11162.35Show/hide
Query:  MDSRRFLAFSLCFLSVFTGFARLPETRTHKKLSGSVLQLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSM-VADNESGKSV
        MDS+ FLAFSL  L +F+                     +  S     DPTR+TQLSW PRAFLYKGFL+D+ECDHLI LAK KLEKSM VAD +SG+S 
Subjt:  MDSRRFLAFSLCFLSVFTGFARLPETRTHKKLSGSVLQLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSM-VADNESGKSV

Query:  SSEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPADYKALTITYFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSN
         SEVRTSSGMFL K QD+IVA +EA+++AWTFLP                    ENGE++QILHYENGQKY+PHFD+F+DK   ELGGHRIATVLMYLSN
Subjt:  SSEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPADYKALTITYFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSN

Query:  VEKGGETIFPNSELKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVSSQDCMDENENCP
        V KGGET+FPN + K  Q KDDSWS CA +GYAVK +KGDALLFF+LHL+ TTD  SLHGSCPVIEGEKWSAT+WIHVRSF K   V    C+D++E+C 
Subjt:  VEKGGETIFPNSELKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVSSQDCMDENENCP

Query:  LWAKRGECKKNPTYMVGSEGALGYCRKSCRAC
         WA  GEC+KNP YMVGSE +LG+CRKSC+AC
Subjt:  LWAKRGECKKNPTYMVGSEGALGYCRKSCRAC

F4JAU3 Prolyl 4-hydroxylase 25.8e-9055.78Show/hide
Query:  LSGSVLQLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTF
        L  S   + + SS  I +P++V Q+S +PRAF+Y+GFLTD ECDHLI LAK+ L++S VADN++G+S  S+VRTSSG F+ K +D IV+GIE ++S WTF
Subjt:  LSGSVLQLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTF

Query:  LPADYKALTITYFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSE---LKESQEKDDSWSDCAH
        LP                    ENGE +Q+L YE+GQKY+ HFD+FHDKVN   GGHRIATVL+YLSNV KGGET+FP+++    +   E  D  SDCA 
Subjt:  LPADYKALTITYFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSE---LKESQEKDDSWSDCAH

Query:  KGYAVKAQKGDALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVSSQDCMDENENCPLWAKRGECKKNPTYMVGSEGALGYCRKSC
        KG AVK +KG+ALLFF+L  DA  D  SLHG CPVIEGEKWSATKWIHV SF+K       +C D NE+C  WA  GEC KNP YMVG+    G CR+SC
Subjt:  KGYAVKAQKGDALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVSSQDCMDENENCPLWAKRGECKKNPTYMVGSEGALGYCRKSC

Query:  RAC
        +AC
Subjt:  RAC

Q8L970 Probable prolyl 4-hydroxylase 71.6e-12465.87Show/hide
Query:  MDSRRFLAFSLCFLSVFTGFARLPE---TRTHKKLSGSVLQLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSMVADNESGK
        MDSR FLAFSLCFL      +  P    TR+     GSV+++KT +S   FDPTRVTQLSW PR FLY+GFL+D+ECDH I LAK KLEKSMVADN+SG+
Subjt:  MDSRRFLAFSLCFLSVFTGFARLPE---TRTHKKLSGSVLQLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSMVADNESGK

Query:  SVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPADYKALTITYFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYL
        SV SEVRTSSGMFL K QD+IV+ +EA+++AWTFLP                    ENGES+QILHYENGQKYEPHFD+FHD+ N ELGGHRIATVLMYL
Subjt:  SVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPADYKALTITYFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYL

Query:  SNVEKGGETIFPNSELKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVSSQDCMDENEN
        SNVEKGGET+FP  + K +Q KDDSW++CA +GYAVK +KGDALLFF+LH +ATTD  SLHGSCPV+EGEKWSAT+WIHV+SFE+     S  CMDEN +
Subjt:  SNVEKGGETIFPNSELKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVSSQDCMDENEN

Query:  CPLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC
        C  WAK GEC+KNPTYMVGS+   GYCRKSC+AC
Subjt:  CPLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC

Q8LAN3 Probable prolyl 4-hydroxylase 42.8e-9257.53Show/hide
Query:  SSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPADYKALTIT
        SS +  +P++V Q+S +PRAF+Y+GFLT+ ECDH++ LAK  L++S VADN+SG+S  SEVRTSSG F+ K +D IV+GIE +IS WTFLP         
Subjt:  SSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPADYKALTIT

Query:  YFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELKESQ---EKDDSWSDCAHKGYAVKAQKGD
                   ENGE IQ+L YE+GQKY+ HFD+FHDKVN   GGHR+AT+LMYLSNV KGGET+FP++E+   +   E  +  SDCA +G AVK +KGD
Subjt:  YFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELKESQ---EKDDSWSDCAHKGYAVKAQKGD

Query:  ALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVSSQDCMDENENCPLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC
        ALLFF+LH DA  D  SLHG CPVIEGEKWSATKWIHV SF++     S +C D NE+C  WA  GEC KNP YMVG+    GYCR+SC+AC
Subjt:  ALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVSSQDCMDENENCPLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC

Q9LN20 Probable prolyl 4-hydroxylase 34.3e-6151.54Show/hide
Query:  LSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPADYKALTITYFSFLELLLKSEN
        LSW+PRAF+Y  FL+ +EC++LI LAK  + KS V D+E+GKS  S VRTSSG FLR+ +D+I+  IE RI+ +TF+PAD+                   
Subjt:  LSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPADYKALTITYFSFLELLLKSEN

Query:  GESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELK-ESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDD
        GE +Q+LHYE GQKYEPH+D+F D+ N + GG R+AT+LMYLS+VE+GGET+FP + +   S    +  S+C  KG +VK + GDALLF+S+  DAT D 
Subjt:  GESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELK-ESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDD

Query:  KSLHGSCPVIEGEKWSATKWIHVRSFE
         SLHG CPVI G KWS+TKW+HV  ++
Subjt:  KSLHGSCPVIEGEKWSATKWIHVRSFE

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 24.1e-9155.78Show/hide
Query:  LSGSVLQLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTF
        L  S   + + SS  I +P++V Q+S +PRAF+Y+GFLTD ECDHLI LAK+ L++S VADN++G+S  S+VRTSSG F+ K +D IV+GIE ++S WTF
Subjt:  LSGSVLQLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTF

Query:  LPADYKALTITYFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSE---LKESQEKDDSWSDCAH
        LP                    ENGE +Q+L YE+GQKY+ HFD+FHDKVN   GGHRIATVL+YLSNV KGGET+FP+++    +   E  D  SDCA 
Subjt:  LPADYKALTITYFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSE---LKESQEKDDSWSDCAH

Query:  KGYAVKAQKGDALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVSSQDCMDENENCPLWAKRGECKKNPTYMVGSEGALGYCRKSC
        KG AVK +KG+ALLFF+L  DA  D  SLHG CPVIEGEKWSATKWIHV SF+K       +C D NE+C  WA  GEC KNP YMVG+    G CR+SC
Subjt:  KGYAVKAQKGDALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVSSQDCMDENENCPLWAKRGECKKNPTYMVGSEGALGYCRKSC

Query:  RAC
        +AC
Subjt:  RAC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase1.1e-12565.87Show/hide
Query:  MDSRRFLAFSLCFLSVFTGFARLPE---TRTHKKLSGSVLQLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSMVADNESGK
        MDSR FLAFSLCFL      +  P    TR+     GSV+++KT +S   FDPTRVTQLSW PR FLY+GFL+D+ECDH I LAK KLEKSMVADN+SG+
Subjt:  MDSRRFLAFSLCFLSVFTGFARLPE---TRTHKKLSGSVLQLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSMVADNESGK

Query:  SVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPADYKALTITYFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYL
        SV SEVRTSSGMFL K QD+IV+ +EA+++AWTFLP                    ENGES+QILHYENGQKYEPHFD+FHD+ N ELGGHRIATVLMYL
Subjt:  SVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPADYKALTITYFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYL

Query:  SNVEKGGETIFPNSELKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVSSQDCMDENEN
        SNVEKGGET+FP  + K +Q KDDSW++CA +GYAVK +KGDALLFF+LH +ATTD  SLHGSCPV+EGEKWSAT+WIHV+SFE+     S  CMDEN +
Subjt:  SNVEKGGETIFPNSELKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVSSQDCMDENEN

Query:  CPLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC
        C  WAK GEC+KNPTYMVGS+   GYCRKSC+AC
Subjt:  CPLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase1.1e-11762.28Show/hide
Query:  MDSRRFLAFSLCFLSVFTGFARLPE---TRTHKKLSGSVLQLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSMVADNESGK
        MDSR FLAFSLCFL      +  P    TR+     GSV+++KT +S   FDPTRVTQLSW PR FLY+GFL+D+ECDH I LAK KLEKSMVADN+SG+
Subjt:  MDSRRFLAFSLCFLSVFTGFARLPE---TRTHKKLSGSVLQLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSMVADNESGK

Query:  SVSSE-----VRTSSGMFLRKAQ---DEIVAGIEARISAWTFLPADYKALTITYFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHR
        SV SE     VR SS           D+IV+ +EA+++AWTFLP                    ENGES+QILHYENGQKYEPHFD+FHD+ N ELGGHR
Subjt:  SVSSE-----VRTSSGMFLRKAQ---DEIVAGIEARISAWTFLPADYKALTITYFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHR

Query:  IATVLMYLSNVEKGGETIFPNSELKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVSSQ
        IATVLMYLSNVEKGGET+FP  + K +Q KDDSW++CA +GYAVK +KGDALLFF+LH +ATTD  SLHGSCPV+EGEKWSAT+WIHV+SFE+     S 
Subjt:  IATVLMYLSNVEKGGETIFPNSELKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVSSQ

Query:  DCMDENENCPLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC
         CMDEN +C  WAK GEC+KNPTYMVGS+   GYCRKSC+AC
Subjt:  DCMDENENCPLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase8.5e-11362.35Show/hide
Query:  MDSRRFLAFSLCFLSVFTGFARLPETRTHKKLSGSVLQLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSM-VADNESGKSV
        MDS+ FLAFSL  L +F+                     +  S     DPTR+TQLSW PRAFLYKGFL+D+ECDHLI LAK KLEKSM VAD +SG+S 
Subjt:  MDSRRFLAFSLCFLSVFTGFARLPETRTHKKLSGSVLQLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSM-VADNESGKSV

Query:  SSEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPADYKALTITYFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSN
         SEVRTSSGMFL K QD+IVA +EA+++AWTFLP                    ENGE++QILHYENGQKY+PHFD+F+DK   ELGGHRIATVLMYLSN
Subjt:  SSEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPADYKALTITYFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSN

Query:  VEKGGETIFPNSELKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVSSQDCMDENENCP
        V KGGET+FPN + K  Q KDDSWS CA +GYAVK +KGDALLFF+LHL+ TTD  SLHGSCPVIEGEKWSAT+WIHVRSF K   V    C+D++E+C 
Subjt:  VEKGGETIFPNSELKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVSSQDCMDENENCP

Query:  LWAKRGECKKNPTYMVGSEGALGYCRKSCRAC
         WA  GEC+KNP YMVGSE +LG+CRKSC+AC
Subjt:  LWAKRGECKKNPTYMVGSEGALGYCRKSCRAC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.0e-9357.53Show/hide
Query:  SSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPADYKALTIT
        SS +  +P++V Q+S +PRAF+Y+GFLT+ ECDH++ LAK  L++S VADN+SG+S  SEVRTSSG F+ K +D IV+GIE +IS WTFLP         
Subjt:  SSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPADYKALTIT

Query:  YFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELKESQ---EKDDSWSDCAHKGYAVKAQKGD
                   ENGE IQ+L YE+GQKY+ HFD+FHDKVN   GGHR+AT+LMYLSNV KGGET+FP++E+   +   E  +  SDCA +G AVK +KGD
Subjt:  YFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELKESQ---EKDDSWSDCAHKGYAVKAQKGD

Query:  ALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVSSQDCMDENENCPLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC
        ALLFF+LH DA  D  SLHG CPVIEGEKWSATKWIHV SF++     S +C D NE+C  WA  GEC KNP YMVG+    GYCR+SC+AC
Subjt:  ALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVSSQDCMDENENCPLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCCCGACGATTCCTCGCATTTTCTCTCTGCTTTCTGTCCGTCTTTACTGGCTTCGCTCGCTTGCCGGAAACGCGTACGCACAAAAAATTAAGTGGATCTGTGCT
TCAATTGAAGACGGATTCATCTCCGCTCATTTTCGATCCAACACGAGTCACTCAGCTCTCCTGGCAACCCAGGGCATTTTTGTATAAGGGATTTTTAACTGATAAGGAAT
GTGATCATCTAATCGATCTGGCCAAGGATAAATTAGAGAAGTCAATGGTAGCAGATAATGAGTCTGGTAAAAGTGTAAGTAGTGAAGTCCGAACGAGTTCTGGCATGTTC
CTTCGGAAGGCCCAGGATGAAATTGTTGCTGGCATTGAGGCCAGGATATCTGCGTGGACATTCCTTCCAGCAGATTATAAAGCTTTAACGATCACTTACTTTTCTTTCTT
GGAACTGTTGTTAAAATCAGAAAACGGAGAATCTATTCAAATTCTTCACTATGAGAATGGTCAAAAGTATGAACCACATTTTGATTTTTTTCACGACAAGGTGAATCAGG
AGTTGGGTGGCCACCGAATAGCCACAGTCTTGATGTATTTATCCAATGTTGAAAAGGGTGGAGAAACCATCTTTCCAAATTCAGAGTTGAAAGAATCTCAAGAAAAGGAT
GACAGCTGGTCTGATTGTGCTCATAAGGGTTATGCAGTTAAGGCACAGAAGGGTGATGCATTGCTGTTCTTCAGCCTCCATCTTGATGCAACGACAGATGACAAAAGCTT
GCACGGTAGTTGCCCTGTGATTGAGGGCGAGAAATGGTCTGCAACCAAATGGATTCATGTGAGATCCTTCGAGAAGCCAACTCGTGTAAGTAGTCAGGATTGCATGGACG
AGAACGAAAATTGCCCGTTATGGGCAAAAAGAGGAGAGTGCAAAAAGAACCCTACTTACATGGTGGGTTCTGAAGGTGCTTTAGGATACTGTAGGAAGAGTTGCAGAGCA
TGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATTCCCGACGATTCCTCGCATTTTCTCTCTGCTTTCTGTCCGTCTTTACTGGCTTCGCTCGCTTGCCGGAAACGCGTACGCACAAAAAATTAAGTGGATCTGTGCT
TCAATTGAAGACGGATTCATCTCCGCTCATTTTCGATCCAACACGAGTCACTCAGCTCTCCTGGCAACCCAGGGCATTTTTGTATAAGGGATTTTTAACTGATAAGGAAT
GTGATCATCTAATCGATCTGGCCAAGGATAAATTAGAGAAGTCAATGGTAGCAGATAATGAGTCTGGTAAAAGTGTAAGTAGTGAAGTCCGAACGAGTTCTGGCATGTTC
CTTCGGAAGGCCCAGGATGAAATTGTTGCTGGCATTGAGGCCAGGATATCTGCGTGGACATTCCTTCCAGCAGATTATAAAGCTTTAACGATCACTTACTTTTCTTTCTT
GGAACTGTTGTTAAAATCAGAAAACGGAGAATCTATTCAAATTCTTCACTATGAGAATGGTCAAAAGTATGAACCACATTTTGATTTTTTTCACGACAAGGTGAATCAGG
AGTTGGGTGGCCACCGAATAGCCACAGTCTTGATGTATTTATCCAATGTTGAAAAGGGTGGAGAAACCATCTTTCCAAATTCAGAGTTGAAAGAATCTCAAGAAAAGGAT
GACAGCTGGTCTGATTGTGCTCATAAGGGTTATGCAGTTAAGGCACAGAAGGGTGATGCATTGCTGTTCTTCAGCCTCCATCTTGATGCAACGACAGATGACAAAAGCTT
GCACGGTAGTTGCCCTGTGATTGAGGGCGAGAAATGGTCTGCAACCAAATGGATTCATGTGAGATCCTTCGAGAAGCCAACTCGTGTAAGTAGTCAGGATTGCATGGACG
AGAACGAAAATTGCCCGTTATGGGCAAAAAGAGGAGAGTGCAAAAAGAACCCTACTTACATGGTGGGTTCTGAAGGTGCTTTAGGATACTGTAGGAAGAGTTGCAGAGCA
TGTTGA
Protein sequenceShow/hide protein sequence
MDSRRFLAFSLCFLSVFTGFARLPETRTHKKLSGSVLQLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLTDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMF
LRKAQDEIVAGIEARISAWTFLPADYKALTITYFSFLELLLKSENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSELKESQEKD
DSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDDKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVSSQDCMDENENCPLWAKRGECKKNPTYMVGSEGALGYCRKSCRA
C