; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0014741 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0014741
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationchr04:8946777..8950092
RNA-Seq ExpressionPI0014741
SyntenyPI0014741
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053723.1 putative prolyl 4-hydroxylase 7 isoform X2 [Cucumis melo var. makuwa]4.9e-15286.08Show/hide
Query:  ESVIRMKTGGSAITIDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAAGTGESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPI
        ESVIRMKTGGSAITIDPT V+QLSSKPRAFLYKGFLS EECQHLI+LAKGKL QSLVAAGTGESVTSKERTSTGMFLRKAQD+IVARIESRIAAWTFLP+
Subjt:  ESVIRMKTGGSAITIDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAAGTGESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPI

Query:  -----------------------------DNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKLSEEEKADLS
                                     DNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKLSEEEK DLS
Subjt:  -----------------------------DNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKLSEEEKADLS

Query:  ACAKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPINEFWRNPACVDENDHCSAWAKAGECEKNPVYMMGSKNELGFCRL
         CAKVGYGVRPKLGDALLFFSMNPNVTPD TSYHGSCPVIEGEKWSATKWIHMLPI+E WRNPACVDENDHCSAWAKAGEC+KNPVYMMGSKNELGFCRL
Subjt:  ACAKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPINEFWRNPACVDENDHCSAWAKAGECEKNPVYMMGSKNELGFCRL

Query:  SCKVCSPPS
        SCKVCSP S
Subjt:  SCKVCSPPS

TYK17735.1 putative prolyl 4-hydroxylase 7 isoform X2 [Cucumis melo var. makuwa]1.1e-15695Show/hide
Query:  ESVIRMKTGGSAITIDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAAGTGESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPI
        ESVIRMKTGGSAITIDPT V+QLSSKPRAFLYKGFLS EECQHLI+LAKGKL QSLVAAGTGESVTSKERTSTGMFLRKAQD+IVARIESRIAAWTFLP+
Subjt:  ESVIRMKTGGSAITIDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAAGTGESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPI

Query:  DNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKLSEEEKADLSACAKVGYGVRPKLGDALLFFSMNPNVTPD
        DNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKLSEEEK DLS CAKVGYGVRPKLGDALLFFSMNPNVTPD
Subjt:  DNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKLSEEEKADLSACAKVGYGVRPKLGDALLFFSMNPNVTPD

Query:  TTSYHGSCPVIEGEKWSATKWIHMLPINEFWRNPACVDENDHCSAWAKAGECEKNPVYMMGSKNELGFCRLSCKVCSPPS
         TSYHGSCPVIEGEKWSATKWIHMLPI+E WRNPACVDENDHCSAWAKAGEC+KNPVYMMGSKNELGFCRLSCKVCSP S
Subjt:  TTSYHGSCPVIEGEKWSATKWIHMLPINEFWRNPACVDENDHCSAWAKAGECEKNPVYMMGSKNELGFCRLSCKVCSPPS

XP_004147455.1 probable prolyl 4-hydroxylase 7 [Cucumis sativus]8.6e-17392.45Show/hide
Query:  MASPFFLAFSIFCLWLF--PISSRSANRFPKLLSHNNDMDESVIRMKTGGSAITIDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAAG
        MASPFFL FSIF L+LF  P SS SANRFPKL+ HNND+DESVIRMKTGGSA+TIDPT V+QLSSKPRAFLYKGFLS EECQHLIN AKGKLHQSLVAAG
Subjt:  MASPFFLAFSIFCLWLF--PISSRSANRFPKLLSHNNDMDESVIRMKTGGSAITIDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAAG

Query:  TGESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPIDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPV
        TG+SVTSKERTSTGMFL KAQDEIVARIESRIAAWTFLP+DNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLS+VEKGGETVFPNSPV
Subjt:  TGESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPIDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPV

Query:  KLSEEEKADLSACAKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPINEFWRNPACVDENDHCSAWAKAGECEKNPVYMM
        KLSEEEKADLS C KVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPI+EFWRNPACVDENDHC+AWAKAGECEKNPVYMM
Subjt:  KLSEEEKADLSACAKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPINEFWRNPACVDENDHCSAWAKAGECEKNPVYMM

Query:  GSKNELGFCRLSCKVCSP
        GSKNELGFCR SCKVCSP
Subjt:  GSKNELGFCRLSCKVCSP

XP_008443446.1 PREDICTED: probable prolyl 4-hydroxylase 7 [Cucumis melo]4.6e-17493.08Show/hide
Query:  MASPFFLAFSIFCLWLFPISSRSANRFPKLLSHNNDMDESVIRMKTGGSAITIDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAAGTG
        MASPF LAFSIF LWL P+SS SANRFPK+L HNNDM ESVIRMKTGGSAITIDPT V+QLSSKPRAFLYKGFLS EECQHLI+LAKGKL QSLVAAGTG
Subjt:  MASPFFLAFSIFCLWLFPISSRSANRFPKLLSHNNDMDESVIRMKTGGSAITIDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAAGTG

Query:  ESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPIDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKL
        ESVTSKERTSTGMFLRKAQD+IVARIESRIAAWTFLP+DNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKL
Subjt:  ESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPIDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKL

Query:  SEEEKADLSACAKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPINEFWRNPACVDENDHCSAWAKAGECEKNPVYMMGS
        SEEEK DLS CAKVGYGVRPKLGDALLFFSMNPNVTPD TSYHGSCPVIEGEKWSATKWIHMLPI+E WRNPACVDENDHCSAWAKAGEC+KNPVYMMGS
Subjt:  SEEEKADLSACAKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPINEFWRNPACVDENDHCSAWAKAGECEKNPVYMMGS

Query:  KNELGFCRLSCKVCSPPS
        KNELGFCRLSCKVCSP S
Subjt:  KNELGFCRLSCKVCSPPS

XP_038905408.1 probable prolyl 4-hydroxylase 7 isoform X1 [Benincasa hispida]3.1e-16286.79Show/hide
Query:  MASPFFLAFSIFCLWLFPISSRSANRFPKLLSHNNDMDESVIRMKTGGSAITIDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAAGTG
        MAS FFLAFS+  L  FP  SRSANR PKLL HNN+MD+SVIRMKT GS +TIDPT V++LSSKPRAFLYKGFLS++ECQHLINLAKGKL QSLVAA TG
Subjt:  MASPFFLAFSIFCLWLFPISSRSANRFPKLLSHNNDMDESVIRMKTGGSAITIDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAAGTG

Query:  ESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPIDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKL
        ESVTS+ERTSTGMFL +AQDEIVARIESRIAAWTFLPIDNGEPIQILRYENGQKYEPHFDFFQDP NIAIGGHRIATILMYLSDVEKGGETVFPNSP+KL
Subjt:  ESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPIDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKL

Query:  SEEEKADLSACAKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPINEFWRNPACVDENDHCSAWAKAGECEKNPVYMMGS
        SE+E+ADLS CAKVGYGV+PK+GDALLFFS+NPNVTPD TSYHGSCPVIEGEKWSATKWIHMLPI E WRNPACVDEN  C AWA AGECEKNPVYMMGS
Subjt:  SEEEKADLSACAKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPINEFWRNPACVDENDHCSAWAKAGECEKNPVYMMGS

Query:  KNELGFCRLSCKVCSPPS
        KNELG CR+SCKVCSPPS
Subjt:  KNELGFCRLSCKVCSPPS

TrEMBL top hitse value%identityAlignment
A0A0A0LG32 Procollagen-proline 4-dioxygenase4.2e-17392.45Show/hide
Query:  MASPFFLAFSIFCLWLF--PISSRSANRFPKLLSHNNDMDESVIRMKTGGSAITIDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAAG
        MASPFFL FSIF L+LF  P SS SANRFPKL+ HNND+DESVIRMKTGGSA+TIDPT V+QLSSKPRAFLYKGFLS EECQHLIN AKGKLHQSLVAAG
Subjt:  MASPFFLAFSIFCLWLF--PISSRSANRFPKLLSHNNDMDESVIRMKTGGSAITIDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAAG

Query:  TGESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPIDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPV
        TG+SVTSKERTSTGMFL KAQDEIVARIESRIAAWTFLP+DNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLS+VEKGGETVFPNSPV
Subjt:  TGESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPIDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPV

Query:  KLSEEEKADLSACAKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPINEFWRNPACVDENDHCSAWAKAGECEKNPVYMM
        KLSEEEKADLS C KVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPI+EFWRNPACVDENDHC+AWAKAGECEKNPVYMM
Subjt:  KLSEEEKADLSACAKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPINEFWRNPACVDENDHCSAWAKAGECEKNPVYMM

Query:  GSKNELGFCRLSCKVCSP
        GSKNELGFCR SCKVCSP
Subjt:  GSKNELGFCRLSCKVCSP

A0A1S3B814 Procollagen-proline 4-dioxygenase2.2e-17493.08Show/hide
Query:  MASPFFLAFSIFCLWLFPISSRSANRFPKLLSHNNDMDESVIRMKTGGSAITIDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAAGTG
        MASPF LAFSIF LWL P+SS SANRFPK+L HNNDM ESVIRMKTGGSAITIDPT V+QLSSKPRAFLYKGFLS EECQHLI+LAKGKL QSLVAAGTG
Subjt:  MASPFFLAFSIFCLWLFPISSRSANRFPKLLSHNNDMDESVIRMKTGGSAITIDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAAGTG

Query:  ESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPIDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKL
        ESVTSKERTSTGMFLRKAQD+IVARIESRIAAWTFLP+DNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKL
Subjt:  ESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPIDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKL

Query:  SEEEKADLSACAKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPINEFWRNPACVDENDHCSAWAKAGECEKNPVYMMGS
        SEEEK DLS CAKVGYGVRPKLGDALLFFSMNPNVTPD TSYHGSCPVIEGEKWSATKWIHMLPI+E WRNPACVDENDHCSAWAKAGEC+KNPVYMMGS
Subjt:  SEEEKADLSACAKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPINEFWRNPACVDENDHCSAWAKAGECEKNPVYMMGS

Query:  KNELGFCRLSCKVCSPPS
        KNELGFCRLSCKVCSP S
Subjt:  KNELGFCRLSCKVCSPPS

A0A5A7UCT9 Procollagen-proline 4-dioxygenase2.4e-15286.08Show/hide
Query:  ESVIRMKTGGSAITIDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAAGTGESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPI
        ESVIRMKTGGSAITIDPT V+QLSSKPRAFLYKGFLS EECQHLI+LAKGKL QSLVAAGTGESVTSKERTSTGMFLRKAQD+IVARIESRIAAWTFLP+
Subjt:  ESVIRMKTGGSAITIDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAAGTGESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPI

Query:  -----------------------------DNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKLSEEEKADLS
                                     DNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKLSEEEK DLS
Subjt:  -----------------------------DNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKLSEEEKADLS

Query:  ACAKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPINEFWRNPACVDENDHCSAWAKAGECEKNPVYMMGSKNELGFCRL
         CAKVGYGVRPKLGDALLFFSMNPNVTPD TSYHGSCPVIEGEKWSATKWIHMLPI+E WRNPACVDENDHCSAWAKAGEC+KNPVYMMGSKNELGFCRL
Subjt:  ACAKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPINEFWRNPACVDENDHCSAWAKAGECEKNPVYMMGSKNELGFCRL

Query:  SCKVCSPPS
        SCKVCSP S
Subjt:  SCKVCSPPS

A0A5D3D1X2 Procollagen-proline 4-dioxygenase5.5e-15795Show/hide
Query:  ESVIRMKTGGSAITIDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAAGTGESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPI
        ESVIRMKTGGSAITIDPT V+QLSSKPRAFLYKGFLS EECQHLI+LAKGKL QSLVAAGTGESVTSKERTSTGMFLRKAQD+IVARIESRIAAWTFLP+
Subjt:  ESVIRMKTGGSAITIDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAAGTGESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPI

Query:  DNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKLSEEEKADLSACAKVGYGVRPKLGDALLFFSMNPNVTPD
        DNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKLSEEEK DLS CAKVGYGVRPKLGDALLFFSMNPNVTPD
Subjt:  DNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKLSEEEKADLSACAKVGYGVRPKLGDALLFFSMNPNVTPD

Query:  TTSYHGSCPVIEGEKWSATKWIHMLPINEFWRNPACVDENDHCSAWAKAGECEKNPVYMMGSKNELGFCRLSCKVCSPPS
         TSYHGSCPVIEGEKWSATKWIHMLPI+E WRNPACVDENDHCSAWAKAGEC+KNPVYMMGSKNELGFCRLSCKVCSP S
Subjt:  TTSYHGSCPVIEGEKWSATKWIHMLPINEFWRNPACVDENDHCSAWAKAGECEKNPVYMMGSKNELGFCRLSCKVCSPPS

A0A6J1EYJ1 Procollagen-proline 4-dioxygenase3.7e-13773.99Show/hide
Query:  MASPFFLAFSIFCLWLFPISSRSANRFPKLLSHNNDMDESVIRMKTGGSAITIDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAAG-T
        M S FFLAFS+  L  FP+ +RSANR PKLL  +   ++SVIRMK  GS+I IDPT VVQLSS+PRAFLYKGFLS EECQHLI+LAK  L QSLV    T
Subjt:  MASPFFLAFSIFCLWLFPISSRSANRFPKLLSHNNDMDESVIRMKTGGSAITIDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAAG-T

Query:  GESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPIDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVK
        G S +S +RTSTGMFL KAQD+IVA IE++IAAWTFLP+DNGEPIQILRYENGQ+Y PHFDFFQDP N+A GGHRIAT+LMYLS+VE+GGETVFP+SP K
Subjt:  GESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPIDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVK

Query:  LSEEEKADLSACAKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPINEFWRNPACVDENDHCSAWAKAGECEKNPVYM--
        + EEE  DL  C+  GYGV+PK GDALLFFS++PNVT D TSYHGSCPVIEGEKWSATKWIHMLP++E WRNP CVDEN+HCSAWAKAGECEKNP YM  
Subjt:  LSEEEKADLSACAKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPINEFWRNPACVDENDHCSAWAKAGECEKNPVYM--

Query:  --MGSKNELGFCRLSCKVCSPPS
          +GSK ELG+CRLSCK CSPPS
Subjt:  --MGSKNELGFCRLSCKVCSPPS

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 61.8e-9655.84Show/hide
Query:  MASPFFLAFSIFCLWLF-PISSRSANRFPKLLSHNNDMDESVIRMKTGGSAITIDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAA--
        M S +FLAFS+  L +F  ISS S                            ++DPT + QLS  PRAFLYKGFLSDEEC HLI LAKGKL +S+V A  
Subjt:  MASPFFLAFSIFCLWLF-PISSRSANRFPKLLSHNNDMDESVIRMKTGGSAITIDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAA--

Query:  GTGESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPIDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSP
         +GES  S+ RTS+GMFL K QD+IVA +E+++AAWTFLP +NGE +QIL YENGQKY+PHFD+F D   + +GGHRIAT+LMYLS+V KGGETVFPN  
Subjt:  GTGESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPIDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSP

Query:  VKLSEEEKADLSACAKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPINEFWRNPACVDENDHCSAWAKAGECEKNPVYM
         K  + +    S CAK GY V+P+ GDALLFF+++ N T D  S HGSCPVIEGEKWSAT+WIH+    +  +   CVD+++ C  WA AGECEKNP+YM
Subjt:  VKLSEEEKADLSACAKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPINEFWRNPACVDENDHCSAWAKAGECEKNPVYM

Query:  MGSKNELGFCRLSCKVC
        +GS+  LGFCR SCK C
Subjt:  MGSKNELGFCRLSCKVC

F4JAU3 Prolyl 4-hydroxylase 25.0e-8355.81Show/hide
Query:  IDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAAG-TGESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPIDNGEPIQILRYEN
        I+P+ V Q+SSKPRAF+Y+GFL+D EC HLI+LAK  L +S VA    GES  S  RTS+G F+ K +D IV+ IE +++ WTFLP +NGE +Q+LRYE+
Subjt:  IDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAAG-TGESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPIDNGEPIQILRYEN

Query:  GQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPV---KLSEEEKADLSACAKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPV
        GQKY+ HFD+F D  NIA GGHRIAT+L+YLS+V KGGETVFP++     +   E K DLS CAK G  V+PK G+ALLFF++  +  PD  S HG CPV
Subjt:  GQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPV---KLSEEEKADLSACAKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPV

Query:  IEGEKWSATKWIHMLPINEFWRNPA-CVDENDHCSAWAKAGECEKNPVYMMGSKNELGFCRLSCKVC
        IEGEKWSATKWIH+   ++   +   C D N+ C  WA  GEC KNP YM+G+    G CR SCK C
Subjt:  IEGEKWSATKWIHMLPINEFWRNPA-CVDENDHCSAWAKAGECEKNPVYMMGSKNELGFCRLSCKVC

Q8L970 Probable prolyl 4-hydroxylase 71.3e-10758.99Show/hide
Query:  MASPFFLAFSIFCLWLFPISSRSANRFPKLLSHNNDMDESVIRMKTGGSAITIDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAAG-T
        M S  FLAFS+  L+  P+ S + NRF  L   +N  D SVI+MKT  S+   DPT V QLS  PR FLY+GFLSDEEC H I LAKGKL +S+VA   +
Subjt:  MASPFFLAFSIFCLWLFPISSRSANRFPKLLSHNNDMDESVIRMKTGGSAITIDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAAG-T

Query:  GESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPIDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVK
        GESV S+ RTS+GMFL K QD+IV+ +E+++AAWTFLP +NGE +QIL YENGQKYEPHFD+F D  N+ +GGHRIAT+LMYLS+VEKGGETVFP    K
Subjt:  GESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPIDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVK

Query:  LSEEEKADLSACAKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPINE-FWRNPACVDENDHCSAWAKAGECEKNPVYMM
         ++ +    + CAK GY V+P+ GDALLFF+++PN T D+ S HGSCPV+EGEKWSAT+WIH+      F +   C+DEN  C  WAKAGEC+KNP YM+
Subjt:  LSEEEKADLSACAKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPINE-FWRNPACVDENDHCSAWAKAGECEKNPVYMM

Query:  GSKNELGFCRLSCKVCS
        GS  + G+CR SCK CS
Subjt:  GSKNELGFCRLSCKVCS

Q8LAN3 Probable prolyl 4-hydroxylase 41.1e-8555.11Show/hide
Query:  SAITIDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAAG-TGESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPIDNGEPIQIL
        S++ ++P+ V Q+SSKPRAF+Y+GFL++ EC H+++LAK  L +S VA   +GES  S+ RTS+G F+ K +D IV+ IE +I+ WTFLP +NGE IQ+L
Subjt:  SAITIDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAAG-TGESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPIDNGEPIQIL

Query:  RYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPV---KLSEEEKADLSACAKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHG
        RYE+GQKY+ HFD+F D  NI  GGHR+ATILMYLS+V KGGETVFP++ +   ++  E K DLS CAK G  V+P+ GDALLFF+++P+  PD  S HG
Subjt:  RYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPV---KLSEEEKADLSACAKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHG

Query:  SCPVIEGEKWSATKWIHMLPINEFWR----NPACVDENDHCSAWAKAGECEKNPVYMMGSKNELGFCRLSCKVC
         CPVIEGEKWSATKWIH   ++ F R    +  C D N+ C  WA  GEC KNP YM+G+    G+CR SCK C
Subjt:  SCPVIEGEKWSATKWIHMLPINEFWR----NPACVDENDHCSAWAKAGECEKNPVYMMGSKNELGFCRLSCKVC

Q9LN20 Probable prolyl 4-hydroxylase 32.4e-6155.88Show/hide
Query:  LSSKPRAFLYKGFLSDEECQHLINLAKGKLHQS-LVAAGTGESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPIDNGEPIQILRYENGQKYEPHF
        LS +PRAF+Y  FLS EEC++LI+LAK  + +S +V + TG+S  S+ RTS+G FLR+ +D+I+  IE RIA +TF+P D+GE +Q+L YE GQKYEPH+
Subjt:  LSSKPRAFLYKGFLSDEECQHLINLAKGKLHQS-LVAAGTGESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPIDNGEPIQILRYENGQKYEPHF

Query:  DFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKLSEEE-KADLSACAKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATK
        D+F D  N   GG R+AT+LMYLSDVE+GGETVFP + +  S      +LS C K G  V+P++GDALLF+SM P+ T D TS HG CPVI G KWS+TK
Subjt:  DFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKLSEEE-KADLSACAKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATK

Query:  WIHM
        W+H+
Subjt:  WIHM

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 23.6e-8455.81Show/hide
Query:  IDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAAG-TGESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPIDNGEPIQILRYEN
        I+P+ V Q+SSKPRAF+Y+GFL+D EC HLI+LAK  L +S VA    GES  S  RTS+G F+ K +D IV+ IE +++ WTFLP +NGE +Q+LRYE+
Subjt:  IDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAAG-TGESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPIDNGEPIQILRYEN

Query:  GQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPV---KLSEEEKADLSACAKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPV
        GQKY+ HFD+F D  NIA GGHRIAT+L+YLS+V KGGETVFP++     +   E K DLS CAK G  V+PK G+ALLFF++  +  PD  S HG CPV
Subjt:  GQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPV---KLSEEEKADLSACAKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPV

Query:  IEGEKWSATKWIHMLPINEFWRNPA-CVDENDHCSAWAKAGECEKNPVYMMGSKNELGFCRLSCKVC
        IEGEKWSATKWIH+   ++   +   C D N+ C  WA  GEC KNP YM+G+    G CR SCK C
Subjt:  IEGEKWSATKWIHMLPINEFWRNPA-CVDENDHCSAWAKAGECEKNPVYMMGSKNELGFCRLSCKVC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase9.4e-10958.99Show/hide
Query:  MASPFFLAFSIFCLWLFPISSRSANRFPKLLSHNNDMDESVIRMKTGGSAITIDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAAG-T
        M S  FLAFS+  L+  P+ S + NRF  L   +N  D SVI+MKT  S+   DPT V QLS  PR FLY+GFLSDEEC H I LAKGKL +S+VA   +
Subjt:  MASPFFLAFSIFCLWLFPISSRSANRFPKLLSHNNDMDESVIRMKTGGSAITIDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAAG-T

Query:  GESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPIDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVK
        GESV S+ RTS+GMFL K QD+IV+ +E+++AAWTFLP +NGE +QIL YENGQKYEPHFD+F D  N+ +GGHRIAT+LMYLS+VEKGGETVFP    K
Subjt:  GESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPIDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVK

Query:  LSEEEKADLSACAKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPINE-FWRNPACVDENDHCSAWAKAGECEKNPVYMM
         ++ +    + CAK GY V+P+ GDALLFF+++PN T D+ S HGSCPV+EGEKWSAT+WIH+      F +   C+DEN  C  WAKAGEC+KNP YM+
Subjt:  LSEEEKADLSACAKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPINE-FWRNPACVDENDHCSAWAKAGECEKNPVYMM

Query:  GSKNELGFCRLSCKVCS
        GS  + G+CR SCK CS
Subjt:  GSKNELGFCRLSCKVCS

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase1.1e-10155.38Show/hide
Query:  MASPFFLAFSIFCLWLFPISSRSANRFPKLLSHNNDMDESVIRMKTGGSAITIDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAAG-T
        M S  FLAFS+  L+  P+ S + NRF  L   +N  D SVI+MKT  S+   DPT V QLS  PR FLY+GFLSDEEC H I LAKGKL +S+VA   +
Subjt:  MASPFFLAFSIFCLWLFPISSRSANRFPKLLSHNNDMDESVIRMKTGGSAITIDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAAG-T

Query:  GESVTSKERTS----TGMFLRKAQ----DEIVARIESRIAAWTFLPIDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGET
        GESV S++  S    +  F+        D+IV+ +E+++AAWTFLP +NGE +QIL YENGQKYEPHFD+F D  N+ +GGHRIAT+LMYLS+VEKGGET
Subjt:  GESVTSKERTS----TGMFLRKAQ----DEIVARIESRIAAWTFLPIDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGET

Query:  VFPNSPVKLSEEEKADLSACAKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPINE-FWRNPACVDENDHCSAWAKAGEC
        VFP    K ++ +    + CAK GY V+P+ GDALLFF+++PN T D+ S HGSCPV+EGEKWSAT+WIH+      F +   C+DEN  C  WAKAGEC
Subjt:  VFPNSPVKLSEEEKADLSACAKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPINE-FWRNPACVDENDHCSAWAKAGEC

Query:  EKNPVYMMGSKNELGFCRLSCKVCS
        +KNP YM+GS  + G+CR SCK CS
Subjt:  EKNPVYMMGSKNELGFCRLSCKVCS

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase1.3e-9755.84Show/hide
Query:  MASPFFLAFSIFCLWLF-PISSRSANRFPKLLSHNNDMDESVIRMKTGGSAITIDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAA--
        M S +FLAFS+  L +F  ISS S                            ++DPT + QLS  PRAFLYKGFLSDEEC HLI LAKGKL +S+V A  
Subjt:  MASPFFLAFSIFCLWLF-PISSRSANRFPKLLSHNNDMDESVIRMKTGGSAITIDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAA--

Query:  GTGESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPIDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSP
         +GES  S+ RTS+GMFL K QD+IVA +E+++AAWTFLP +NGE +QIL YENGQKY+PHFD+F D   + +GGHRIAT+LMYLS+V KGGETVFPN  
Subjt:  GTGESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPIDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSP

Query:  VKLSEEEKADLSACAKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPINEFWRNPACVDENDHCSAWAKAGECEKNPVYM
         K  + +    S CAK GY V+P+ GDALLFF+++ N T D  S HGSCPVIEGEKWSAT+WIH+    +  +   CVD+++ C  WA AGECEKNP+YM
Subjt:  VKLSEEEKADLSACAKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPINEFWRNPACVDENDHCSAWAKAGECEKNPVYM

Query:  MGSKNELGFCRLSCKVC
        +GS+  LGFCR SCK C
Subjt:  MGSKNELGFCRLSCKVC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein7.7e-8755.11Show/hide
Query:  SAITIDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAAG-TGESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPIDNGEPIQIL
        S++ ++P+ V Q+SSKPRAF+Y+GFL++ EC H+++LAK  L +S VA   +GES  S+ RTS+G F+ K +D IV+ IE +I+ WTFLP +NGE IQ+L
Subjt:  SAITIDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAAG-TGESVTSKERTSTGMFLRKAQDEIVARIESRIAAWTFLPIDNGEPIQIL

Query:  RYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPV---KLSEEEKADLSACAKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHG
        RYE+GQKY+ HFD+F D  NI  GGHR+ATILMYLS+V KGGETVFP++ +   ++  E K DLS CAK G  V+P+ GDALLFF+++P+  PD  S HG
Subjt:  RYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPV---KLSEEEKADLSACAKVGYGVRPKLGDALLFFSMNPNVTPDTTSYHG

Query:  SCPVIEGEKWSATKWIHMLPINEFWR----NPACVDENDHCSAWAKAGECEKNPVYMMGSKNELGFCRLSCKVC
         CPVIEGEKWSATKWIH   ++ F R    +  C D N+ C  WA  GEC KNP YM+G+    G+CR SCK C
Subjt:  SCPVIEGEKWSATKWIHMLPINEFWR----NPACVDENDHCSAWAKAGECEKNPVYMMGSKNELGFCRLSCKVC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTCCATTTTTTCTCGCATTTTCTATCTTTTGCCTTTGGCTTTTCCCCATTTCTTCTCGCTCCGCCAATCGCTTCCCCAAATTGCTTTCACACAACAACGACAT
GGATGAATCTGTTATTAGGATGAAAACGGGTGGTTCCGCCATTACAATCGATCCCACTCATGTCGTTCAGCTTTCATCCAAACCCAGGGCTTTCTTATATAAGGGATTTT
TGTCTGATGAGGAGTGCCAACATCTTATCAATTTGGCGAAGGGTAAGCTACATCAATCATTGGTGGCGGCTGGAACAGGTGAGAGTGTTACAAGTAAAGAACGGACGAGT
ACTGGCATGTTTCTTCGTAAGGCCCAGGATGAAATAGTTGCTCGCATTGAGTCAAGGATTGCTGCGTGGACTTTCCTTCCCATTGATAATGGGGAGCCTATTCAAATACT
ACGGTATGAGAATGGACAGAAATACGAGCCACATTTTGATTTTTTTCAAGACCCAGGCAATATAGCCATTGGAGGTCATCGAATAGCGACAATCTTGATGTATTTATCTG
ATGTTGAAAAGGGTGGAGAAACAGTCTTTCCCAATTCCCCGGTTAAATTATCCGAGGAGGAAAAGGCTGACTTGTCTGCTTGCGCTAAGGTTGGCTATGGAGTAAGACCA
AAGTTGGGTGATGCTTTACTGTTCTTCAGTATGAATCCAAATGTGACGCCAGACACGACCAGCTATCACGGGAGCTGCCCAGTGATAGAGGGTGAGAAATGGTCTGCAAC
TAAATGGATTCACATGCTTCCAATCAATGAATTTTGGAGGAATCCAGCTTGTGTAGACGAAAATGACCACTGTAGTGCGTGGGCAAAAGCAGGTGAATGTGAAAAGAATC
CTGTTTATATGATGGGTTCTAAGAATGAACTTGGATTTTGTAGGTTGAGTTGCAAAGTATGCTCTCCTCCCTCATAG
mRNA sequenceShow/hide mRNA sequence
TTCAGAGAGAGTCAATGCAAAGCAATATCCCGCTGGAAGAAACTTCCATTTCCATGGTTATCAACGGCGGGTTTGCAATTGCAGCAAATCCGCTTCTGGGTTGCCTATTA
AATTTCATCATAAATGAGTATGAACCCCAAAAATCGAATCTACCTCCATTTTCGCTATGGCTTCTCCATTTTTTCTCGCATTTTCTATCTTTTGCCTTTGGCTTTTCCCC
ATTTCTTCTCGCTCCGCCAATCGCTTCCCCAAATTGCTTTCACACAACAACGACATGGATGAATCTGTTATTAGGATGAAAACGGGTGGTTCCGCCATTACAATCGATCC
CACTCATGTCGTTCAGCTTTCATCCAAACCCAGGGCTTTCTTATATAAGGGATTTTTGTCTGATGAGGAGTGCCAACATCTTATCAATTTGGCGAAGGGTAAGCTACATC
AATCATTGGTGGCGGCTGGAACAGGTGAGAGTGTTACAAGTAAAGAACGGACGAGTACTGGCATGTTTCTTCGTAAGGCCCAGGATGAAATAGTTGCTCGCATTGAGTCA
AGGATTGCTGCGTGGACTTTCCTTCCCATTGATAATGGGGAGCCTATTCAAATACTACGGTATGAGAATGGACAGAAATACGAGCCACATTTTGATTTTTTTCAAGACCC
AGGCAATATAGCCATTGGAGGTCATCGAATAGCGACAATCTTGATGTATTTATCTGATGTTGAAAAGGGTGGAGAAACAGTCTTTCCCAATTCCCCGGTTAAATTATCCG
AGGAGGAAAAGGCTGACTTGTCTGCTTGCGCTAAGGTTGGCTATGGAGTAAGACCAAAGTTGGGTGATGCTTTACTGTTCTTCAGTATGAATCCAAATGTGACGCCAGAC
ACGACCAGCTATCACGGGAGCTGCCCAGTGATAGAGGGTGAGAAATGGTCTGCAACTAAATGGATTCACATGCTTCCAATCAATGAATTTTGGAGGAATCCAGCTTGTGT
AGACGAAAATGACCACTGTAGTGCGTGGGCAAAAGCAGGTGAATGTGAAAAGAATCCTGTTTATATGATGGGTTCTAAGAATGAACTTGGATTTTGTAGGTTGAGTTGCA
AAGTATGCTCTCCTCCCTCATAGAAAAGAAAATGCTTCTTTTACTTATACACAAATCAGTGGTAAGAGTTTTTTCTTCTTACACATGTACACATGTAAATGACTTTGAGA
GGTATCACTTGGTATATTTAACTATTGAATCTCTCTGTCTCCTAGACACCTATATATCTATCTATTGTGCACACACGTACGTATGTATTTTTTTGTGGAATATTTTGATA
GGAAACTAGCTAGTCCTCTCGAATTAGCGAAGCTTGTGTCGGTTTGGTGCAACTTGTTAGCTTGCTGATTGTTAATGAATTTCTATAGTGATCATTTATATGTTTTGGTG
GAG
Protein sequenceShow/hide protein sequence
MASPFFLAFSIFCLWLFPISSRSANRFPKLLSHNNDMDESVIRMKTGGSAITIDPTHVVQLSSKPRAFLYKGFLSDEECQHLINLAKGKLHQSLVAAGTGESVTSKERTS
TGMFLRKAQDEIVARIESRIAAWTFLPIDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYLSDVEKGGETVFPNSPVKLSEEEKADLSACAKVGYGVRP
KLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPINEFWRNPACVDENDHCSAWAKAGECEKNPVYMMGSKNELGFCRLSCKVCSPPS