; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG09G018490 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG09G018490
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationCG_Chr09:35629528..35633746
RNA-Seq ExpressionClCG09G018490
SyntenyClCG09G018490
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008458700.1 PREDICTED: probable prolyl 4-hydroxylase 7 [Cucumis melo]1.9e-16286.14Show/hide
Query:  MDSRRFLAFSLCFLSVFTTAFARLPETRM----HKKLTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNES
        MDSR FLAFSLCFLSVF TAFARLPETRM    +K+ TGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSD+ECDHLIDLAKDKLEKSMVADNES
Subjt:  MDSRRFLAFSLCFLSVFTTAFARLPETRM----HKKLTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNES

Query:  GRSVSSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIVAGIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIAT
        G+SVSSEVRTSSGMFLRKAQ                      D IVAG+EARI+AWT LPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIAT
Subjt:  GRSVSSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIVAGIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIAT

Query:  VLMYLSNVEKGGETIFPNSEFKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVGRQDCV
        VLMYLSNVEKGGETIFPNSEFKESQEKDDSWSDC+ KGYAVKAQKGDALLFFSLHLDATTD +SLHGSCPVIEGEKWSATKWIHVRSFEK  RV RQDCV
Subjt:  VLMYLSNVEKGGETIFPNSEFKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVGRQDCV

Query:  DENENCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC
        DENENC  WAKRGECKKNPTYMVGSEGALGYCRKSC+AC
Subjt:  DENENCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC

XP_011655982.1 probable prolyl 4-hydroxylase 7 isoform X1 [Cucumis sativus]1.4e-15784.23Show/hide
Query:  MDSRRFLAFSLCFLSVFTTAFARLPETRMHKKLTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGRSV
        MDSR FLAFSLCFLSVF TAFARLPETR HK+ +GSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSD ECDHLIDLAKDKLEKSMVADN+SG+SV
Subjt:  MDSRRFLAFSLCFLSVFTTAFARLPETRMHKKLTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGRSV

Query:  SSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIVAGIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY
        SSEVRTSSGMFLRKAQ                      D +VAG+EARI+AWT LPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY
Subjt:  SSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIVAGIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY

Query:  LSNVEKGGETIFPNSEFKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPT-RVGRQDCVDEN
        LSNVEKGGETIFPNSEFKESQ KD+SWSDC+ KGYAVKAQKGDALLFFSL+LDATTD +SLHGSCPVI GEKWSATKWIHVRSFEK T RV RQ CVDEN
Subjt:  LSNVEKGGETIFPNSEFKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPT-RVGRQDCVDEN

Query:  ENCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC
        ENC  WAK+GECKKNPTYMVGS GALGYCRKSC+AC
Subjt:  ENCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC

XP_022134044.1 probable prolyl 4-hydroxylase 7 [Momordica charantia]2.2e-15883.93Show/hide
Query:  MDSRRFLAFSLCFLSVFTTAFARLPETRMHKKLTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGRSV
        MDS RFL+FSLCFL VF TA ARLP+ R HKK++GSVLRLK + SPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADN SG+SV
Subjt:  MDSRRFLAFSLCFLSVFTTAFARLPETRMHKKLTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGRSV

Query:  SSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIVAGIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY
        SSEVRTSSGMFL KAQ                      D IVA +EARI+AWTFLPAENGESIQILHYENGQKYEPHFD+FHDKVNQELGGHR+ATVLMY
Subjt:  SSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIVAGIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY

Query:  LSNVEKGGETIFPNSEFKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVGRQ-DCVDEN
        LSNVEKGGETIFPNSEFKESQEKDDSWSDCA KGYAVKA+KGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTR  R+ DCVDEN
Subjt:  LSNVEKGGETIFPNSEFKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVGRQ-DCVDEN

Query:  ENCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC
        ENC+ WAKRGECKKNPTYMVGSE ALGYCRKSC+AC
Subjt:  ENCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC

XP_038889686.1 probable prolyl 4-hydroxylase 7 isoform X1 [Benincasa hispida]1.6e-16185.97Show/hide
Query:  MDSRRFLAFSLCFLSVFTTAFARLPETRMHKKLTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGRSV
        MDSRRFLAF LCFLSVF T FARLPE R  KK +GSV+RLKTDSSPL+FDPTRVTQLSW+PRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESG+SV
Subjt:  MDSRRFLAFSLCFLSVFTTAFARLPETRMHKKLTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGRSV

Query:  SSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIVAGIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY
        SSEVRTSSGMFLRKAQ                      D IVA IEARISAWT LPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY
Subjt:  SSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIVAGIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY

Query:  LSNVEKGGETIFPNSEFKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVGRQDCVDENE
        LSNVEKGGETIFPNSEFKESQEKD+SWSDCA KGYAVKA+KGDALLFFSL  DATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEK TRV RQDCVDENE
Subjt:  LSNVEKGGETIFPNSEFKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVGRQDCVDENE

Query:  NCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC
        NC +WAKRGECKKNPTYMVGSE ALGYCRKSCRAC
Subjt:  NCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC

XP_038889687.1 probable prolyl 4-hydroxylase 7 isoform X2 [Benincasa hispida]2.3e-16085.97Show/hide
Query:  MDSRRFLAFSLCFLSVFTTAFARLPETRMHKKLTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGRSV
        MDSRRFLAF LCFLSVF T FARLPE R  KK +GSV+RLKTDSSPL+FDPTRVTQLSW+PRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESG+SV
Subjt:  MDSRRFLAFSLCFLSVFTTAFARLPETRMHKKLTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGRSV

Query:  SSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIVAGIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY
        SSEVRTSSGMFLRKAQ                      D IVA IEARISAWT LPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY
Subjt:  SSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIVAGIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY

Query:  LSNVEKGGETIFPNSEFKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVGRQDCVDENE
        LSNVEKGGETIFPNSEFKESQEKD+SWSDCA KGYAVKA+KGDALLFFSL  DATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEK TRV RQDCVDENE
Subjt:  LSNVEKGGETIFPNSEFKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVGRQDCVDENE

Query:  NCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC
        NC +WAKRGECKKNPTYMVGSE ALGYCRKSCRAC
Subjt:  NCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC

TrEMBL top hitse value%identityAlignment
A0A0A0KS38 Procollagen-proline 4-dioxygenase6.8e-15884.23Show/hide
Query:  MDSRRFLAFSLCFLSVFTTAFARLPETRMHKKLTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGRSV
        MDSR FLAFSLCFLSVF TAFARLPETR HK+ +GSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSD ECDHLIDLAKDKLEKSMVADN+SG+SV
Subjt:  MDSRRFLAFSLCFLSVFTTAFARLPETRMHKKLTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGRSV

Query:  SSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIVAGIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY
        SSEVRTSSGMFLRKAQ                      D +VAG+EARI+AWT LPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY
Subjt:  SSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIVAGIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY

Query:  LSNVEKGGETIFPNSEFKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPT-RVGRQDCVDEN
        LSNVEKGGETIFPNSEFKESQ KD+SWSDC+ KGYAVKAQKGDALLFFSL+LDATTD +SLHGSCPVI GEKWSATKWIHVRSFEK T RV RQ CVDEN
Subjt:  LSNVEKGGETIFPNSEFKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPT-RVGRQDCVDEN

Query:  ENCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC
        ENC  WAK+GECKKNPTYMVGS GALGYCRKSC+AC
Subjt:  ENCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC

A0A1S3C8G4 Procollagen-proline 4-dioxygenase9.2e-16386.14Show/hide
Query:  MDSRRFLAFSLCFLSVFTTAFARLPETRM----HKKLTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNES
        MDSR FLAFSLCFLSVF TAFARLPETRM    +K+ TGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSD+ECDHLIDLAKDKLEKSMVADNES
Subjt:  MDSRRFLAFSLCFLSVFTTAFARLPETRM----HKKLTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNES

Query:  GRSVSSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIVAGIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIAT
        G+SVSSEVRTSSGMFLRKAQ                      D IVAG+EARI+AWT LPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIAT
Subjt:  GRSVSSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIVAGIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIAT

Query:  VLMYLSNVEKGGETIFPNSEFKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVGRQDCV
        VLMYLSNVEKGGETIFPNSEFKESQEKDDSWSDC+ KGYAVKAQKGDALLFFSLHLDATTD +SLHGSCPVIEGEKWSATKWIHVRSFEK  RV RQDCV
Subjt:  VLMYLSNVEKGGETIFPNSEFKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVGRQDCV

Query:  DENENCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC
        DENENC  WAKRGECKKNPTYMVGSEGALGYCRKSC+AC
Subjt:  DENENCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC

A0A5D3CTS4 Procollagen-proline 4-dioxygenase1.2e-15473.37Show/hide
Query:  MDSRRFLAFSLCFLSVFTTAFARLPETRM----HKKLTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNES
        MDSR FLAFSLCFLSVF TAFARLPETRM    +K+ TGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSD+ECDHLIDLAKDKLEKSMVADNES
Subjt:  MDSRRFLAFSLCFLSVFTTAFARLPETRM----HKKLTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNES

Query:  GRSVSSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIVAGIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIAT
        G+SVSSEVRTSSGMFLRKAQ                      D IVAG+EARI+AWT LPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIAT
Subjt:  GRSVSSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIVAGIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIAT

Query:  VLMYLSNVEKGGETIFPNSEFKESQEKDDSWSDCAHKGY-----------------------------------------------------------AV
        VLMYLSNVEKGGETIFPNSEFKESQEKDDSWSDC+ KGY                                                           AV
Subjt:  VLMYLSNVEKGGETIFPNSEFKESQEKDDSWSDCAHKGY-----------------------------------------------------------AV

Query:  KAQKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVGRQDCVDENENCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC
        KAQKGDALLFFSLHLDATTD +SLHGSCPVIEGEKWSATKWIHVRSFEK  RV RQDCVDENENC  WAKRGECKKNPTYMVGSEGALGYCRKSC+AC
Subjt:  KAQKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVGRQDCVDENENCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC

A0A6J1BXN9 Procollagen-proline 4-dioxygenase1.1e-15883.93Show/hide
Query:  MDSRRFLAFSLCFLSVFTTAFARLPETRMHKKLTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGRSV
        MDS RFL+FSLCFL VF TA ARLP+ R HKK++GSVLRLK + SPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADN SG+SV
Subjt:  MDSRRFLAFSLCFLSVFTTAFARLPETRMHKKLTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGRSV

Query:  SSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIVAGIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY
        SSEVRTSSGMFL KAQ                      D IVA +EARI+AWTFLPAENGESIQILHYENGQKYEPHFD+FHDKVNQELGGHR+ATVLMY
Subjt:  SSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIVAGIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY

Query:  LSNVEKGGETIFPNSEFKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVGRQ-DCVDEN
        LSNVEKGGETIFPNSEFKESQEKDDSWSDCA KGYAVKA+KGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTR  R+ DCVDEN
Subjt:  LSNVEKGGETIFPNSEFKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVGRQ-DCVDEN

Query:  ENCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC
        ENC+ WAKRGECKKNPTYMVGSE ALGYCRKSC+AC
Subjt:  ENCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC

A0A6J1FJ93 Procollagen-proline 4-dioxygenase2.4e-15583.88Show/hide
Query:  MDSRRFLAFSLCFLSVFTTAFARLPETRMHKKLTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGRSV
        MDSRRFLAFSL FLSV +T FARLPET  HKKL+GSVL LK DS  LIFDPTRVTQLSWQPRAFLYKGFL+D+ECDHLIDLAKDKLEKSMVADNESG+SV
Subjt:  MDSRRFLAFSLCFLSVFTTAFARLPETRMHKKLTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGRSV

Query:  SSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIVAGIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY
        SSEVRTSSGMFLRKAQ                      D IVAGIEARISAWTFLP ENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY
Subjt:  SSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIVAGIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY

Query:  LSNVEKGGETIFPNSEFKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVGRQDCVDENE
        LSNVEKGGETIFPNS F ESQEKDDSWSDCA KGYAVKAQKGDALLFFSLHLDATTD +SLHGSCPVIEGEKWSATKWIHVRSF+K TR+  QDCVDEN+
Subjt:  LSNVEKGGETIFPNSEFKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVGRQDCVDENE

Query:  NCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC
        NC  WAKRGEC+KNPTYMVGSEGA+GYCRKSC+AC
Subjt:  NCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 61.6e-11162.2Show/hide
Query:  MDSRRFLAFSLCFLSVFTTAFARLPETRMHKKLTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSM-VADNESGRS
        MDS+ FLAFSL  L +F+                      +  S     DPTR+TQLSW PRAFLYKGFLSD+ECDHLI LAK KLEKSM VAD +SG S
Subjt:  MDSRRFLAFSLCFLSVFTTAFARLPETRMHKKLTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSM-VADNESGRS

Query:  VSSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIVAGIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLM
          SEVRTSSGMFL K Q                      D IVA +EA+++AWTFLP ENGE++QILHYENGQKY+PHFD+F+DK   ELGGHRIATVLM
Subjt:  VSSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIVAGIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLM

Query:  YLSNVEKGGETIFPNSEFKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVGRQDCVDEN
        YLSNV KGGET+FPN + K  Q KDDSWS CA +GYAVK +KGDALLFF+LHL+ TTD  SLHGSCPVIEGEKWSAT+WIHVRSF K   V    CVD++
Subjt:  YLSNVEKGGETIFPNSEFKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVGRQDCVDEN

Query:  ENCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC
        E+C  WA  GEC+KNP YMVGSE +LG+CRKSC+AC
Subjt:  ENCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC

F4JAU3 Prolyl 4-hydroxylase 22.9e-8957.58Show/hide
Query:  SSP-LIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGRSVSSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIV
        SSP  I +P++V Q+S +PRAF+Y+GFL+D ECDHLI LAK+ L++S VADN++G S  S+VRTSSG F+ K +                      D IV
Subjt:  SSP-LIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGRSVSSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIV

Query:  AGIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS-EF--KESQEKDDSWSDCAHKGYAVKA
        +GIE ++S WTFLP ENGE +Q+L YE+GQKY+ HFD+FHDKVN   GGHRIATVL+YLSNV KGGET+FP++ EF  +   E  D  SDCA KG AVK 
Subjt:  AGIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS-EF--KESQEKDDSWSDCAHKGYAVKA

Query:  QKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEK-PTRVGRQDCVDENENCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC
        +KG+ALLFF+L  DA  D  SLHG CPVIEGEKWSATKWIHV SF+K  T  G  +C D NE+C  WA  GEC KNP YMVG+    G CR+SC+AC
Subjt:  QKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEK-PTRVGRQDCVDENENCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC

Q8L970 Probable prolyl 4-hydroxylase 71.3e-12165.09Show/hide
Query:  MDSRRFLAFSLCF---LSVFTTAFARLPETRMHKKLTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESG
        MDSR FLAFSLCF   L + ++A  R   TR      GSV+++KT +S   FDPTRVTQLSW PR FLY+GFLSD+ECDH I LAK KLEKSMVADN+SG
Subjt:  MDSRRFLAFSLCF---LSVFTTAFARLPETRMHKKLTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESG

Query:  RSVSSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIVAGIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATV
         SV SEVRTSSGMFL K Q                      D IV+ +EA+++AWTFLP ENGES+QILHYENGQKYEPHFD+FHD+ N ELGGHRIATV
Subjt:  RSVSSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIVAGIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATV

Query:  LMYLSNVEKGGETIFPNSEFKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVGRQDCVD
        LMYLSNVEKGGET+FP  + K +Q KDDSW++CA +GYAVK +KGDALLFF+LH +ATTD  SLHGSCPV+EGEKWSAT+WIHV+SFE+     +  C+D
Subjt:  LMYLSNVEKGGETIFPNSEFKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVGRQDCVD

Query:  ENENCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC
        EN +C  WAK GEC+KNPTYMVGS+   GYCRKSC+AC
Subjt:  ENENCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC

Q8LAN3 Probable prolyl 4-hydroxylase 45.3e-9156.27Show/hide
Query:  SSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGRSVSSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIVA
        SS +  +P++V Q+S +PRAF+Y+GFL++ ECDH++ LAK  L++S VADN+SG S  SEVRTSSG F+ K +                      D IV+
Subjt:  SSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGRSVSSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIVA

Query:  GIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQ---EKDDSWSDCAHKGYAVKAQ
        GIE +IS WTFLP ENGE IQ+L YE+GQKY+ HFD+FHDKVN   GGHR+AT+LMYLSNV KGGET+FP++E    +   E  +  SDCA +G AVK +
Subjt:  GIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQ---EKDDSWSDCAHKGYAVKAQ

Query:  KGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVGRQDCVDENENCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC
        KGDALLFF+LH DA  D  SLHG CPVIEGEKWSATKWIHV SF++       +C D NE+C  WA  GEC KNP YMVG+    GYCR+SC+AC
Subjt:  KGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVGRQDCVDENENCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC

Q9LN20 Probable prolyl 4-hydroxylase 36.3e-6050.43Show/hide
Query:  LSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGRSVSSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIVAGIEARISAWTFLP
        LSW+PRAF+Y  FLS +EC++LI LAK  + KS V D+E+G+S  S VRTSSG FLR+ +                      D I+  IE RI+ +TF+P
Subjt:  LSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGRSVSSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIVAGIEARISAWTFLP

Query:  AENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFK-ESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDAT
        A++GE +Q+LHYE GQKYEPH+D+F D+ N + GG R+AT+LMYLS+VE+GGET+FP +     S    +  S+C  KG +VK + GDALLF+S+  DAT
Subjt:  AENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFK-ESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDAT

Query:  TDVKSLHGSCPVIEGEKWSATKWIHVRSFE
         D  SLHG CPVI G KWS+TKW+HV  ++
Subjt:  TDVKSLHGSCPVIEGEKWSATKWIHVRSFE

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 22.1e-9057.58Show/hide
Query:  SSP-LIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGRSVSSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIV
        SSP  I +P++V Q+S +PRAF+Y+GFL+D ECDHLI LAK+ L++S VADN++G S  S+VRTSSG F+ K +                      D IV
Subjt:  SSP-LIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGRSVSSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIV

Query:  AGIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS-EF--KESQEKDDSWSDCAHKGYAVKA
        +GIE ++S WTFLP ENGE +Q+L YE+GQKY+ HFD+FHDKVN   GGHRIATVL+YLSNV KGGET+FP++ EF  +   E  D  SDCA KG AVK 
Subjt:  AGIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS-EF--KESQEKDDSWSDCAHKGYAVKA

Query:  QKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEK-PTRVGRQDCVDENENCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC
        +KG+ALLFF+L  DA  D  SLHG CPVIEGEKWSATKWIHV SF+K  T  G  +C D NE+C  WA  GEC KNP YMVG+    G CR+SC+AC
Subjt:  QKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEK-PTRVGRQDCVDENENCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase9.2e-12365.09Show/hide
Query:  MDSRRFLAFSLCF---LSVFTTAFARLPETRMHKKLTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESG
        MDSR FLAFSLCF   L + ++A  R   TR      GSV+++KT +S   FDPTRVTQLSW PR FLY+GFLSD+ECDH I LAK KLEKSMVADN+SG
Subjt:  MDSRRFLAFSLCF---LSVFTTAFARLPETRMHKKLTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESG

Query:  RSVSSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIVAGIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATV
         SV SEVRTSSGMFL K Q                      D IV+ +EA+++AWTFLP ENGES+QILHYENGQKYEPHFD+FHD+ N ELGGHRIATV
Subjt:  RSVSSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIVAGIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATV

Query:  LMYLSNVEKGGETIFPNSEFKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVGRQDCVD
        LMYLSNVEKGGET+FP  + K +Q KDDSW++CA +GYAVK +KGDALLFF+LH +ATTD  SLHGSCPV+EGEKWSAT+WIHV+SFE+     +  C+D
Subjt:  LMYLSNVEKGGETIFPNSEFKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVGRQDCVD

Query:  ENENCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC
        EN +C  WAK GEC+KNPTYMVGS+   GYCRKSC+AC
Subjt:  ENENCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase5.8e-11762.72Show/hide
Query:  MDSRRFLAFSLCF---LSVFTTAFARLPETRMHKKLTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESG
        MDSR FLAFSLCF   L + ++A  R   TR      GSV+++KT +S   FDPTRVTQLSW PR FLY+GFLSD+ECDH I LAK KLEKSMVADN+SG
Subjt:  MDSRRFLAFSLCF---LSVFTTAFARLPETRMHKKLTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESG

Query:  RSVSSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIVAGIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATV
         SV SE    S   +R++   ++    +E+           D IV+ +EA+++AWTFLP ENGES+QILHYENGQKYEPHFD+FHD+ N ELGGHRIATV
Subjt:  RSVSSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIVAGIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATV

Query:  LMYLSNVEKGGETIFPNSEFKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVGRQDCVD
        LMYLSNVEKGGET+FP  + K +Q KDDSW++CA +GYAVK +KGDALLFF+LH +ATTD  SLHGSCPV+EGEKWSAT+WIHV+SFE+     +  C+D
Subjt:  LMYLSNVEKGGETIFPNSEFKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVGRQDCVD

Query:  ENENCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC
        EN +C  WAK GEC+KNPTYMVGS+   GYCRKSC+AC
Subjt:  ENENCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase1.1e-11262.2Show/hide
Query:  MDSRRFLAFSLCFLSVFTTAFARLPETRMHKKLTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSM-VADNESGRS
        MDS+ FLAFSL  L +F+                      +  S     DPTR+TQLSW PRAFLYKGFLSD+ECDHLI LAK KLEKSM VAD +SG S
Subjt:  MDSRRFLAFSLCFLSVFTTAFARLPETRMHKKLTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSM-VADNESGRS

Query:  VSSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIVAGIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLM
          SEVRTSSGMFL K Q                      D IVA +EA+++AWTFLP ENGE++QILHYENGQKY+PHFD+F+DK   ELGGHRIATVLM
Subjt:  VSSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIVAGIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLM

Query:  YLSNVEKGGETIFPNSEFKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVGRQDCVDEN
        YLSNV KGGET+FPN + K  Q KDDSWS CA +GYAVK +KGDALLFF+LHL+ TTD  SLHGSCPVIEGEKWSAT+WIHVRSF K   V    CVD++
Subjt:  YLSNVEKGGETIFPNSEFKESQEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVGRQDCVDEN

Query:  ENCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC
        E+C  WA  GEC+KNP YMVGSE +LG+CRKSC+AC
Subjt:  ENCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein3.8e-9256.27Show/hide
Query:  SSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGRSVSSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIVA
        SS +  +P++V Q+S +PRAF+Y+GFL++ ECDH++ LAK  L++S VADN+SG S  SEVRTSSG F+ K +                      D IV+
Subjt:  SSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGRSVSSEVRTSSGMFLRKAQIVVSWYVLVELVEKDNVIVDVNDAIVA

Query:  GIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQ---EKDDSWSDCAHKGYAVKAQ
        GIE +IS WTFLP ENGE IQ+L YE+GQKY+ HFD+FHDKVN   GGHR+AT+LMYLSNV KGGET+FP++E    +   E  +  SDCA +G AVK +
Subjt:  GIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQ---EKDDSWSDCAHKGYAVKAQ

Query:  KGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVGRQDCVDENENCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC
        KGDALLFF+LH DA  D  SLHG CPVIEGEKWSATKWIHV SF++       +C D NE+C  WA  GEC KNP YMVG+    GYCR+SC+AC
Subjt:  KGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVGRQDCVDENENCSLWAKRGECKKNPTYMVGSEGALGYCRKSCRAC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCCCGACGATTCCTCGCATTTTCTCTCTGCTTTCTATCCGTCTTTACTACTGCCTTCGCTCGCTTGCCGGAAACGCGTATGCACAAGAAATTAACTGGATCTGT
GCTTCGATTGAAGACGGATTCATCTCCGCTCATTTTCGATCCAACACGAGTCACTCAGCTCTCCTGGCAACCCAGGGCATTTTTGTATAAGGGATTTTTGTCTGATAAGG
AATGTGATCATCTAATCGATCTGGCCAAGGATAAATTAGAGAAGTCAATGGTAGCAGATAATGAGTCTGGTAGAAGTGTAAGTAGTGAAGTCCGAACGAGTTCTGGCATG
TTCCTTCGGAAGGCCCAGATCGTAGTGTCATGGTATGTTCTTGTAGAATTAGTTGAGAAGGATAATGTAATTGTAGATGTCAATGATGCAATTGTTGCTGGCATTGAGGC
CAGGATTTCTGCATGGACATTCCTTCCAGCAGAAAATGGAGAATCCATTCAAATTCTTCACTATGAGAATGGTCAAAAGTATGAACCACATTTTGATTTTTTTCACGACA
AGGTGAATCAGGAGTTAGGTGGCCACCGAATCGCCACAGTCTTGATGTATTTATCCAATGTTGAAAAGGGTGGAGAAACCATCTTTCCAAATTCAGAGTTTAAAGAATCT
CAAGAAAAGGATGACAGCTGGTCTGATTGTGCTCATAAGGGTTATGCAGTTAAAGCGCAGAAGGGTGATGCGTTGCTGTTCTTCAGCCTCCATCTCGATGCAACGACAGA
CGTCAAAAGCTTGCATGGTAGTTGCCCTGTGATTGAGGGCGAGAAATGGTCTGCAACCAAGTGGATTCACGTGAGATCCTTCGAGAAGCCAACTCGTGTAGGTAGGCAGG
ATTGTGTGGACGAGAACGAAAATTGCTCATTATGGGCAAAAAGAGGAGAGTGCAAAAAGAACCCTACTTACATGGTGGGTTCTGAAGGTGCTTTAGGATACTGTAGGAAG
AGCTGCAGAGCATGTTAA
mRNA sequenceShow/hide mRNA sequence
TTATTTGAAGTTTCTAAATGTGCAATTTCCGAAAAACAGCCCACCGGTTCTCATTTTTCATTCATTGTCTTGATAGTTGAATTTTCGTAATAAATTCAAAGCCATTTATT
TTTCTTTTTCCCCTTCTCGTACACCCAAGAAACCTGCAGTTGAATTTTTTCTTGTTCATTTCTCCGATTTGATATCGGAGAAACAATCATGGATTCCCGACGATTCCTCG
CATTTTCTCTCTGCTTTCTATCCGTCTTTACTACTGCCTTCGCTCGCTTGCCGGAAACGCGTATGCACAAGAAATTAACTGGATCTGTGCTTCGATTGAAGACGGATTCA
TCTCCGCTCATTTTCGATCCAACACGAGTCACTCAGCTCTCCTGGCAACCCAGGGCATTTTTGTATAAGGGATTTTTGTCTGATAAGGAATGTGATCATCTAATCGATCT
GGCCAAGGATAAATTAGAGAAGTCAATGGTAGCAGATAATGAGTCTGGTAGAAGTGTAAGTAGTGAAGTCCGAACGAGTTCTGGCATGTTCCTTCGGAAGGCCCAGATCG
TAGTGTCATGGTATGTTCTTGTAGAATTAGTTGAGAAGGATAATGTAATTGTAGATGTCAATGATGCAATTGTTGCTGGCATTGAGGCCAGGATTTCTGCATGGACATTC
CTTCCAGCAGAAAATGGAGAATCCATTCAAATTCTTCACTATGAGAATGGTCAAAAGTATGAACCACATTTTGATTTTTTTCACGACAAGGTGAATCAGGAGTTAGGTGG
CCACCGAATCGCCACAGTCTTGATGTATTTATCCAATGTTGAAAAGGGTGGAGAAACCATCTTTCCAAATTCAGAGTTTAAAGAATCTCAAGAAAAGGATGACAGCTGGT
CTGATTGTGCTCATAAGGGTTATGCAGTTAAAGCGCAGAAGGGTGATGCGTTGCTGTTCTTCAGCCTCCATCTCGATGCAACGACAGACGTCAAAAGCTTGCATGGTAGT
TGCCCTGTGATTGAGGGCGAGAAATGGTCTGCAACCAAGTGGATTCACGTGAGATCCTTCGAGAAGCCAACTCGTGTAGGTAGGCAGGATTGTGTGGACGAGAACGAAAA
TTGCTCATTATGGGCAAAAAGAGGAGAGTGCAAAAAGAACCCTACTTACATGGTGGGTTCTGAAGGTGCTTTAGGATACTGTAGGAAGAGCTGCAGAGCATGTTAAACCT
TGGAAGAAAGTCCACATCTCTTTCTTTCTCTTTTTGTTTTGCAGAGCTTTAGTGTTGATTTTGTGATGGGTATGTAAATAACATTGGGCAGTAAGTGGGTATACAATACA
AGCGGATATTACATCTCTTTCATTAAACCTTGTAGTAGCAATTAGCCACAAAGTGTTTCATTTGGTTATTGAAACGCAATGAGAAGTTTTCTCATGTACGATGCTTATTG
GTTGGTTAACTTTTCTTTTTTCAACTTTACAAATGATATATTTTGTCCCTTCTTGAAAATTGC
Protein sequenceShow/hide protein sequence
MDSRRFLAFSLCFLSVFTTAFARLPETRMHKKLTGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGRSVSSEVRTSSGM
FLRKAQIVVSWYVLVELVEKDNVIVDVNDAIVAGIEARISAWTFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKES
QEKDDSWSDCAHKGYAVKAQKGDALLFFSLHLDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKPTRVGRQDCVDENENCSLWAKRGECKKNPTYMVGSEGALGYCRK
SCRAC