; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009550 (gene) of Snake gourd v1 genome

Gene IDTan0009550
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationLG07:7290145..7292833
RNA-Seq ExpressionTan0009550
SyntenyTan0009550
Gene Ontology termsGO:0019511 - peptidyl-proline hydroxylation (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579383.1 putative prolyl 4-hydroxylase 12, partial [Cucurbita argyrosperma subsp. sororia]5.9e-15084.08Show/hide
Query:  MDSRLNFLLLLVTAFSFSSCLARSNLISGRKGLRDQLVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLASNSEDKPSGNSAGSR
        MDSRLNFLLLL  AFSFSSCLA+SN ISGRKGLRDQ+V+S  LSYSNHS RIDPSRVVQ+SW+PR FLYKGFLSD ECDHLI+LASNSEDKPS N+AGSR
Subjt:  MDSRLNFLLLLVTAFSFSSCLARSNLISGRKGLRDQLVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLASNSEDKPSGNSAGSR

Query:  NTVSTKFLNNSGAILNTTDDIIARVENRIAVWTFLPKDYSMPLQIMHYGGEEAE-HKYIFGNRSAMSSSEPLMATVVLYLSDSASGGEMLFPESKVKSSF
        NTVSTKFL NSGAILNTTDDIIAR+ENRIAVWTFLPKD+SMP QIM YGGEEA  HKY FGNRSAM SSEPLMATVVLYLSDSASGGE+LFP SKVK  F
Subjt:  NTVSTKFLNNSGAILNTTDDIIARVENRIAVWTFLPKDYSMPLQIMHYGGEEAE-HKYIFGNRSAMSSSEPLMATVVLYLSDSASGGEMLFPESKVKSSF

Query:  WSDRRKKSNILRPVRGNAILFFSVHLNASPDKSSYHTRSPVLDGKLWVATKFFYLRP-TTGNKYTVES-IGDDCIDEDKSCPQWAAIGECERNAVFMIGS
        WSD+RKK+N LRPV+GNA+LFFSVHLNASPDKS YH+R+P+LDGKLWVATKFFY+RP  TGN++ VES + DDCIDED+SCP+WAAIGEC+RNAVFMIGS
Subjt:  WSDRRKKSNILRPVRGNAILFFSVHLNASPDKSSYHTRSPVLDGKLWVATKFFYLRP-TTGNKYTVES-IGDDCIDEDKSCPQWAAIGECERNAVFMIGS

Query:  PDYYGTCRKSCNAC
        PDYYGTCRKSCNAC
Subjt:  PDYYGTCRKSCNAC

XP_022159842.1 probable prolyl 4-hydroxylase 12 [Momordica charantia]3.0e-15487.14Show/hide
Query:  MDSRLNFLLLLVTAFSFSSCLARSNLISGRKGLRDQLVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLASNSEDKPSGNSAGSR
        MDSRL  LLLL TA SF SCLA+SNLISGRKGLRDQL++SVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSD ECDHLISLA++SEDKPSGNS  S 
Subjt:  MDSRLNFLLLLVTAFSFSSCLARSNLISGRKGLRDQLVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLASNSEDKPSGNSAGSR

Query:  NTVSTKFLNNSGAILNTTDDIIARVENRIAVWTFLPKDYSMPLQIMHYGGEEAEHKYIFGNRSAMSSSEPLMATVVLYLSDSASGGEMLFPESKVKSSFW
        NTV TK L +SGAILNTTDDIIAR+ENRIAVWTFLPKDYSMPLQI+ YGGEEAEHKY+FGNRSAM SSEPLMATVVLYLSDSASGGEM FPESKVKS FW
Subjt:  NTVSTKFLNNSGAILNTTDDIIARVENRIAVWTFLPKDYSMPLQIMHYGGEEAEHKYIFGNRSAMSSSEPLMATVVLYLSDSASGGEMLFPESKVKSSFW

Query:  SDRRKKSNILRPVRGNAILFFSVHLNASPDKSSYHTRSPVLDGKLWVATKFFYLRPTTGNKYTVESIGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDY
        SDRRKK+NILRPV+GNA+L FSVHLNASPDKSS HTRSP+LDG+LW+ATKFFYLRP TGNK+T E  G DC DEDKSCPQWAAIGECERNAVFMIGSPDY
Subjt:  SDRRKKSNILRPVRGNAILFFSVHLNASPDKSSYHTRSPVLDGKLWVATKFFYLRPTTGNKYTVESIGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDY

Query:  YGTCRKSCNAC
        YGTCRKSCNAC
Subjt:  YGTCRKSCNAC

XP_022922237.1 probable prolyl 4-hydroxylase 12 [Cucurbita moschata]3.8e-14983.76Show/hide
Query:  MDSRLNFLLLLVTAFSFSSCLARSNLISGRKGLRDQLVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLASNSEDKPSGNSAGSR
        MDSRL FLLLL  AFSFSSCLA+SN ISGRKGLRDQ+V+S  LSYSNHS RIDPSRVVQ+SW+PR FLYKGFLSD ECDHLI+LASNSEDKPS N+AGSR
Subjt:  MDSRLNFLLLLVTAFSFSSCLARSNLISGRKGLRDQLVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLASNSEDKPSGNSAGSR

Query:  NTVSTKFLNNSGAILNTTDDIIARVENRIAVWTFLPKDYSMPLQIMHYGGEEAE-HKYIFGNRSAMSSSEPLMATVVLYLSDSASGGEMLFPESKVKSSF
        NTVSTKFL NSGAILNTTDDII R+ENRIAVWTFLPKD+SMP QIM YGGEEA  HKY FGNRSAM SSEPLMATVVLYLSDSASGGE+LFP SKVK  F
Subjt:  NTVSTKFLNNSGAILNTTDDIIARVENRIAVWTFLPKDYSMPLQIMHYGGEEAE-HKYIFGNRSAMSSSEPLMATVVLYLSDSASGGEMLFPESKVKSSF

Query:  WSDRRKKSNILRPVRGNAILFFSVHLNASPDKSSYHTRSPVLDGKLWVATKFFYLRP-TTGNKYTVES-IGDDCIDEDKSCPQWAAIGECERNAVFMIGS
        WSDRRKK+N LRPV+GNA+LFFSVHLNASPDKS YH+R+P+LDGKLWVATKFFY+RP  TGN++ VES + DDCIDED+SCP+WAAIGEC+RNAVFMIGS
Subjt:  WSDRRKKSNILRPVRGNAILFFSVHLNASPDKSSYHTRSPVLDGKLWVATKFFYLRP-TTGNKYTVES-IGDDCIDEDKSCPQWAAIGECERNAVFMIGS

Query:  PDYYGTCRKSCNAC
        PDYYGTCRKSCNAC
Subjt:  PDYYGTCRKSCNAC

XP_023549812.1 probable prolyl 4-hydroxylase 12 [Cucurbita pepo subsp. pepo]6.5e-14983.12Show/hide
Query:  MDSRLNFLLLLVTAFSFSSCLARSNLISGRKGLRDQLVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLASNSEDKPSGNSAGSR
        MDSRLNFLLL   AFSFSSCLA+SN +SGRKGLRDQ+V+S  LSYSNH  RIDPSRVVQ+SW+PRVFLYKGFLSD ECDHLI+LASNSEDKPS ++AGSR
Subjt:  MDSRLNFLLLLVTAFSFSSCLARSNLISGRKGLRDQLVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLASNSEDKPSGNSAGSR

Query:  NTVSTKFLNNSGAILNTTDDIIARVENRIAVWTFLPKDYSMPLQIMHYGGEEAE-HKYIFGNRSAMSSSEPLMATVVLYLSDSASGGEMLFPESKVKSSF
        NTVSTKFL NSGA+LNTTDDIIAR+ENRIAVWTFLPKD+SMP QIM YGGEEA  HKY FGNRSAM SSEPLMATVVLYLSDSASGGE+LFP SKVK  F
Subjt:  NTVSTKFLNNSGAILNTTDDIIARVENRIAVWTFLPKDYSMPLQIMHYGGEEAE-HKYIFGNRSAMSSSEPLMATVVLYLSDSASGGEMLFPESKVKSSF

Query:  WSDRRKKSNILRPVRGNAILFFSVHLNASPDKSSYHTRSPVLDGKLWVATKFFYLRP-TTGNKYTVES-IGDDCIDEDKSCPQWAAIGECERNAVFMIGS
        WSDRRKK+N LRPV+GNA+LFFSVHLNASPDKS YH+R+P+LDGKLWVATKFFY+RP  TGN++ VES + DDCIDED+SCP+WAAIGEC+RNAVFMIGS
Subjt:  WSDRRKKSNILRPVRGNAILFFSVHLNASPDKSSYHTRSPVLDGKLWVATKFFYLRP-TTGNKYTVES-IGDDCIDEDKSCPQWAAIGECERNAVFMIGS

Query:  PDYYGTCRKSCNAC
        PDYYGTCRKSCNAC
Subjt:  PDYYGTCRKSCNAC

XP_038906497.1 probable prolyl 4-hydroxylase 12 [Benincasa hispida]1.4e-15688.42Show/hide
Query:  MDSRLNFLLLLVTAFSFSSCLARSNLISGRKGLRDQLVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLASNSEDKPSGNSAGSR
        MDSRLNFLLLL TAFSFS+CLA+SNLISGRKGLRDQLVD  PLSYSNHSGRIDPSRVVQVSW+PRVFLYKGFLSD ECDHLISLASNSED PSGNSAGS 
Subjt:  MDSRLNFLLLLVTAFSFSSCLARSNLISGRKGLRDQLVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLASNSEDKPSGNSAGSR

Query:  NTVSTKFLNNSGAILNTTDDIIARVENRIAVWTFLPKDYSMPLQIMHYGGEEAEHKYIFGNRSAMSSSEPLMATVVLYLSDSASGGEMLFPESKVKSSFW
        NTVSTK LN+SG ILNT+DDIIAR+EN+IAVWTFLPKD+ MP QIM Y GEEAEHKY +GN SAMSSSEPLMATVVLYLSDSA GGEMLFPESKVKS FW
Subjt:  NTVSTKFLNNSGAILNTTDDIIARVENRIAVWTFLPKDYSMPLQIMHYGGEEAEHKYIFGNRSAMSSSEPLMATVVLYLSDSASGGEMLFPESKVKSSFW

Query:  SDRRKKSNILRPVRGNAILFFSVHLNASPDKSSYHTRSPVLDGKLWVATKFFYLRPTTGNKYTVESIGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDY
        SDRRKK+N LRPV+GNAILFFSVHLNASPDKSSYHTRSP+L+G+LWVATKFFYLRPTTGNK TVES  D CIDEDKSCPQWAAIGECERN VFMIGSPDY
Subjt:  SDRRKKSNILRPVRGNAILFFSVHLNASPDKSSYHTRSPVLDGKLWVATKFFYLRPTTGNKYTVESIGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDY

Query:  YGTCRKSCNAC
        YGTCRKSCNAC
Subjt:  YGTCRKSCNAC

TrEMBL top hitse value%identityAlignment
A0A1S3AT39 Procollagen-proline 4-dioxygenase5.4e-14984.94Show/hide
Query:  MDSRLNFLLLLVTAFSFSSCLARSNLISGRKGLRDQLVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLASNSEDKPSGNSAGSR
        MDSRLNFLLL  TAFSFS+CLA+SNLISGRKGLRDQLVD  PLSYSN S RIDPSRVVQVSWRPRVFLYKGFLSD ECDHLISLASNSED PS NSAGS 
Subjt:  MDSRLNFLLLLVTAFSFSSCLARSNLISGRKGLRDQLVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLASNSEDKPSGNSAGSR

Query:  NTVSTKFLNNSGAILNTTDDIIARVENRIAVWTFLPKDYSMPLQIMHYGGEEAEHKYIFGNRSAM-SSSEPLMATVVLYLSDSASGGEMLFPESKVKSSF
        NTVST+ LN SG ILNTTDDIIAR+ENRIAVWT LPKD+ MP QIM Y GEEA+HKY +GNRSAM SSSEPLMATVVLYLSDSASGGEMLFPESKVKS F
Subjt:  NTVSTKFLNNSGAILNTTDDIIARVENRIAVWTFLPKDYSMPLQIMHYGGEEAEHKYIFGNRSAM-SSSEPLMATVVLYLSDSASGGEMLFPESKVKSSF

Query:  WSDRRKKSNILRPVRGNAILFFSVHLNASPDKSSYHTRSPVLDGKLWVATKFFYLRPTTGNKYTVESIGDDCIDEDKSCPQWAAIGECERNAVFMIGSPD
        WS RRKK N LRPV+GNAILFFSVHLNASPDKSSYH R P+ +G+LWVATKF YLRP TGNK+T++S  D CIDEDKSCPQWAAIGECERNAVFM+GSPD
Subjt:  WSDRRKKSNILRPVRGNAILFFSVHLNASPDKSSYHTRSPVLDGKLWVATKFFYLRPTTGNKYTVESIGDDCIDEDKSCPQWAAIGECERNAVFMIGSPD

Query:  YYGTCRKSCNAC
        YYGTCRKSCNAC
Subjt:  YYGTCRKSCNAC

A0A5A7TKX1 Procollagen-proline 4-dioxygenase2.0e-14884.62Show/hide
Query:  MDSRLNFLLLLVTAFSFSSCLARSNLISGRKGLRDQLVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLASNSEDKPSGNSAGSR
        MDSRLNFLLL  TAFSFS+CLA+SNLISGRKGLRDQLVD  PLSYSN S RIDPSRVVQVSWRPRVFLYKGFLSD ECDHLISLASNS+D PS NSAGS 
Subjt:  MDSRLNFLLLLVTAFSFSSCLARSNLISGRKGLRDQLVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLASNSEDKPSGNSAGSR

Query:  NTVSTKFLNNSGAILNTTDDIIARVENRIAVWTFLPKDYSMPLQIMHYGGEEAEHKYIFGNRSAM-SSSEPLMATVVLYLSDSASGGEMLFPESKVKSSF
        NTVST+ LN SG ILNTTDDIIAR+ENRIAVWT LPKD+ MP QIM Y GEEA+HKY +GNRSAM SSSEPLMATVVLYLSDSASGGEMLFPESKVKS F
Subjt:  NTVSTKFLNNSGAILNTTDDIIARVENRIAVWTFLPKDYSMPLQIMHYGGEEAEHKYIFGNRSAM-SSSEPLMATVVLYLSDSASGGEMLFPESKVKSSF

Query:  WSDRRKKSNILRPVRGNAILFFSVHLNASPDKSSYHTRSPVLDGKLWVATKFFYLRPTTGNKYTVESIGDDCIDEDKSCPQWAAIGECERNAVFMIGSPD
        WS RRKK N LRPV+GNAILFFSVHLNASPDKSSYH R P+ +G+LWVATKF YLRP TGNK+T++S  D CIDEDKSCPQWAAIGECERNAVFM+GSPD
Subjt:  WSDRRKKSNILRPVRGNAILFFSVHLNASPDKSSYHTRSPVLDGKLWVATKFFYLRPTTGNKYTVESIGDDCIDEDKSCPQWAAIGECERNAVFMIGSPD

Query:  YYGTCRKSCNAC
        YYGTCRKSCNAC
Subjt:  YYGTCRKSCNAC

A0A6J1E0X9 Procollagen-proline 4-dioxygenase1.5e-15487.14Show/hide
Query:  MDSRLNFLLLLVTAFSFSSCLARSNLISGRKGLRDQLVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLASNSEDKPSGNSAGSR
        MDSRL  LLLL TA SF SCLA+SNLISGRKGLRDQL++SVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSD ECDHLISLA++SEDKPSGNS  S 
Subjt:  MDSRLNFLLLLVTAFSFSSCLARSNLISGRKGLRDQLVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLASNSEDKPSGNSAGSR

Query:  NTVSTKFLNNSGAILNTTDDIIARVENRIAVWTFLPKDYSMPLQIMHYGGEEAEHKYIFGNRSAMSSSEPLMATVVLYLSDSASGGEMLFPESKVKSSFW
        NTV TK L +SGAILNTTDDIIAR+ENRIAVWTFLPKDYSMPLQI+ YGGEEAEHKY+FGNRSAM SSEPLMATVVLYLSDSASGGEM FPESKVKS FW
Subjt:  NTVSTKFLNNSGAILNTTDDIIARVENRIAVWTFLPKDYSMPLQIMHYGGEEAEHKYIFGNRSAMSSSEPLMATVVLYLSDSASGGEMLFPESKVKSSFW

Query:  SDRRKKSNILRPVRGNAILFFSVHLNASPDKSSYHTRSPVLDGKLWVATKFFYLRPTTGNKYTVESIGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDY
        SDRRKK+NILRPV+GNA+L FSVHLNASPDKSS HTRSP+LDG+LW+ATKFFYLRP TGNK+T E  G DC DEDKSCPQWAAIGECERNAVFMIGSPDY
Subjt:  SDRRKKSNILRPVRGNAILFFSVHLNASPDKSSYHTRSPVLDGKLWVATKFFYLRPTTGNKYTVESIGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDY

Query:  YGTCRKSCNAC
        YGTCRKSCNAC
Subjt:  YGTCRKSCNAC

A0A6J1E2P0 Procollagen-proline 4-dioxygenase1.8e-14983.76Show/hide
Query:  MDSRLNFLLLLVTAFSFSSCLARSNLISGRKGLRDQLVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLASNSEDKPSGNSAGSR
        MDSRL FLLLL  AFSFSSCLA+SN ISGRKGLRDQ+V+S  LSYSNHS RIDPSRVVQ+SW+PR FLYKGFLSD ECDHLI+LASNSEDKPS N+AGSR
Subjt:  MDSRLNFLLLLVTAFSFSSCLARSNLISGRKGLRDQLVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLASNSEDKPSGNSAGSR

Query:  NTVSTKFLNNSGAILNTTDDIIARVENRIAVWTFLPKDYSMPLQIMHYGGEEAE-HKYIFGNRSAMSSSEPLMATVVLYLSDSASGGEMLFPESKVKSSF
        NTVSTKFL NSGAILNTTDDII R+ENRIAVWTFLPKD+SMP QIM YGGEEA  HKY FGNRSAM SSEPLMATVVLYLSDSASGGE+LFP SKVK  F
Subjt:  NTVSTKFLNNSGAILNTTDDIIARVENRIAVWTFLPKDYSMPLQIMHYGGEEAE-HKYIFGNRSAMSSSEPLMATVVLYLSDSASGGEMLFPESKVKSSF

Query:  WSDRRKKSNILRPVRGNAILFFSVHLNASPDKSSYHTRSPVLDGKLWVATKFFYLRP-TTGNKYTVES-IGDDCIDEDKSCPQWAAIGECERNAVFMIGS
        WSDRRKK+N LRPV+GNA+LFFSVHLNASPDKS YH+R+P+LDGKLWVATKFFY+RP  TGN++ VES + DDCIDED+SCP+WAAIGEC+RNAVFMIGS
Subjt:  WSDRRKKSNILRPVRGNAILFFSVHLNASPDKSSYHTRSPVLDGKLWVATKFFYLRP-TTGNKYTVES-IGDDCIDEDKSCPQWAAIGECERNAVFMIGS

Query:  PDYYGTCRKSCNAC
        PDYYGTCRKSCNAC
Subjt:  PDYYGTCRKSCNAC

A0A6J1IBS3 Procollagen-proline 4-dioxygenase3.5e-14883.12Show/hide
Query:  MDSRLNFLLLLVTAFSFSSCLARSNLISGRKGLRDQLVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLASNSEDKPSGNSAGSR
        MDSRLNFLLLL  AFSF SCLA+SN ISGRKGLRDQ+V+S  LSYSNHS RIDPSRVVQ+SW+PR FLYKGFLSD ECDHLI+LASNSEDKPS N+AGSR
Subjt:  MDSRLNFLLLLVTAFSFSSCLARSNLISGRKGLRDQLVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLASNSEDKPSGNSAGSR

Query:  NTVSTKFLNNSGAILNTTDDIIARVENRIAVWTFLPKDYSMPLQIMHYGGEEAE-HKYIFGNRSAMSSSEPLMATVVLYLSDSASGGEMLFPESKVKSSF
        NTVSTKFL NSGAILNTTDDIIAR+ENRIAVW FLPKD+SMP QIM YGGEEA   KY FGNRSAM SSEPLMATVVLYLSDSA+GGE+LFP SKVK  F
Subjt:  NTVSTKFLNNSGAILNTTDDIIARVENRIAVWTFLPKDYSMPLQIMHYGGEEAE-HKYIFGNRSAMSSSEPLMATVVLYLSDSASGGEMLFPESKVKSSF

Query:  WSDRRKKSNILRPVRGNAILFFSVHLNASPDKSSYHTRSPVLDGKLWVATKFFYLRP-TTGNKYTVES-IGDDCIDEDKSCPQWAAIGECERNAVFMIGS
        WSDRRKK+N LRPV+GNA+LFFSVHLNASPDKS YH+R+P+LDGKLWVATKFFY+RP  TGN++ VES + DDCIDED+SCP+WAAIGEC+RNAVFMIGS
Subjt:  WSDRRKKSNILRPVRGNAILFFSVHLNASPDKSSYHTRSPVLDGKLWVATKFFYLRP-TTGNKYTVES-IGDDCIDEDKSCPQWAAIGECERNAVFMIGS

Query:  PDYYGTCRKSCNAC
        PDYYGTCRKSCNAC
Subjt:  PDYYGTCRKSCNAC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 62.2e-5441.09Show/hide
Query:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLASNSEDKPS-GNSAGSRNTVSTKFLNNSGAIL-NTTDDIIARVENRIAVWTFLPKDYSMPL
        S+ S  +DP+R+ Q+SW PR FLYKGFLSD ECDHLI LA    +K        S  +  ++   +SG  L    DDI+A VE ++A WTFLP++    L
Subjt:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLASNSEDKPS-GNSAGSRNTVSTKFLNNSGAIL-NTTDDIIARVENRIAVWTFLPKDYSMPL

Query:  QIMHY--GGEEAEHKYIFGNRSAMSSSEPLMATVVLYLSDSASGGEMLFPESK-----VKSSFWSDRRKKSNILRPVRGNAILFFSVHLNASPDKSSYHT
        QI+HY  G +   H   F ++ A+      +ATV++YLS+   GGE +FP  K     +K   WS   K+   ++P +G+A+LFF++HLN + D +S H 
Subjt:  QIMHY--GGEEAEHKYIFGNRSAMSSSEPLMATVVLYLSDSASGGEMLFPESK-----VKSSFWSDRRKKSNILRPVRGNAILFFSVHLNASPDKSSYHT

Query:  RSPVLDGKLWVATKFFYLRPTTGNKYTVESIGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
          PV++G+ W AT++ ++R + G K  V      C+D+ +SC +WA  GECE+N ++M+GS    G CRKSC AC
Subjt:  RSPVLDGKLWVATKFFYLRPTTGNKYTVESIGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

F4JAU3 Prolyl 4-hydroxylase 22.8e-4636.01Show/hide
Query:  LVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLA-SNSEDKPSGNSAGSRNTVSTKFLNNSGAILNTTDDIIARVENRIAVWTFL
        L+ S     S+ S  I+PS+V QVS +PR F+Y+GFL+D ECDHLISLA  N +     ++    + VS    ++   I    D I++ +E++++ WTFL
Subjt:  LVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLA-SNSEDKPSGNSAGSRNTVSTKFLNNSGAILNTTDDIIARVENRIAVWTFL

Query:  PKDYSMPLQIMHY--GGEEAEHKYIFGNRSAMSSSEPLMATVVLYLSDSASGGEMLFPESKVKS--------SFWSDRRKKSNILRPVRGNAILFFSVHL
        PK+    LQ++ Y  G +   H   F ++  ++     +ATV+LYLS+   GGE +FP+++  S           SD  KK   ++P +GNA+LFF++  
Subjt:  PKDYSMPLQIMHY--GGEEAEHKYIFGNRSAMSSSEPLMATVVLYLSDSASGGEMLFPESKVKS--------SFWSDRRKKSNILRPVRGNAILFFSVHL

Query:  NASPDKSSYHTRSPVLDGKLWVATKFFYLRPTTGNKYTVESIGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        +A PD  S H   PV++G+ W ATK+ ++     +   + +   +C D ++SC +WA +GEC +N  +M+G+P+  G CR+SC AC
Subjt:  NASPDKSSYHTRSPVLDGKLWVATKFFYLRPTTGNKYTVESIGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

Q8GXT7 Probable prolyl 4-hydroxylase 122.8e-6243.55Show/hide
Query:  FLLLLVTAFSFSSCLARSNLISGRKGLRDQLV----DSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLASNSEDKPSGNSAGSRNT
        FL+L++T  S S           RK LRD+ +    D    SY   S  +DP+RV+Q+SW PRVFLY+GFLS+ ECDHLISL   + +  S ++ G    
Subjt:  FLLLLVTAFSFSSCLARSNLISGRKGLRDQLV----DSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLASNSEDKPSGNSAGSRNT

Query:  VSTKFLNNSGAILNTTDDIIARVENRIAVWTFLPKDYSMPLQIMHYGGEEAEHKY-IFGNRSAMSSSEPLMATVVLYLSDSASGGEMLFPESKVKSSFWS
                        D ++A +E +++ WTFLP +    +++  Y  E++  K   FG   +    E L+ATVVLYLS++  GGE+LFP S++K    +
Subjt:  VSTKFLNNSGAILNTTDDIIARVENRIAVWTFLPKDYSMPLQIMHYGGEEAEHKY-IFGNRSAMSSSEPLMATVVLYLSDSASGGEMLFPESKVKSSFWS

Query:  DRRKKSNILRPVRGNAILFFSVHLNASPDKSSYHTRSPVLDGKLWVATKFFYLRPTTGNKYTVESIGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDYY
           +  NILRPV+GNAILFF+  LNAS D  S H R PV+ G+L VATK  Y +     +  +E  G +C DED++C +WA +GEC++N V+MIGSPDYY
Subjt:  DRRKKSNILRPVRGNAILFFSVHLNASPDKSSYHTRSPVLDGKLWVATKFFYLRPTTGNKYTVESIGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDYY

Query:  GTCRKSCNAC
        GTCRKSCNAC
Subjt:  GTCRKSCNAC

Q8L970 Probable prolyl 4-hydroxylase 75.3e-5336.25Show/hide
Query:  MDSRLNFLLLLVTAFSFSSCLARSN-LISGRKGLRDQLVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLASNSEDKPSGNSAGS
        MDSR+     L   F+     +  N  ++     RD  V  + +  S  S   DP+RV Q+SW PRVFLY+GFLSD ECDH I LA    +K       S
Subjt:  MDSRLNFLLLLVTAFSFSSCLARSN-LISGRKGLRDQLVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLASNSEDKPSGNSAGS

Query:  RNTVSTKFLNNSGAILN-TTDDIIARVENRIAVWTFLPKDYSMPLQIMHY--GGEEAEHKYIFGNRSAMSSSEPLMATVVLYLSDSASGGEMLFP-----
          +V ++   +SG  L+   DDI++ VE ++A WTFLP++    +QI+HY  G +   H   F +++ +      +ATV++YLS+   GGE +FP     
Subjt:  RNTVSTKFLNNSGAILN-TTDDIIARVENRIAVWTFLPKDYSMPLQIMHY--GGEEAEHKYIFGNRSAMSSSEPLMATVVLYLSDSASGGEMLFP-----

Query:  ESKVKSSFWSDRRKKSNILRPVRGNAILFFSVHLNASPDKSSYHTRSPVLDGKLWVATKFFYLRPTTGNKYTVESIGDDCIDEDKSCPQWAAIGECERNA
         +++K   W++  K+   ++P +G+A+LFF++H NA+ D +S H   PV++G+ W AT++ +++ +    +  +S    C+DE+ SC +WA  GEC++N 
Subjt:  ESKVKSSFWSDRRKKSNILRPVRGNAILFFSVHLNASPDKSSYHTRSPVLDGKLWVATKFFYLRPTTGNKYTVESIGDDCIDEDKSCPQWAAIGECERNA

Query:  VFMIGSPDYYGTCRKSCNAC
         +M+GS   +G CRKSC AC
Subjt:  VFMIGSPDYYGTCRKSCNAC

Q8LAN3 Probable prolyl 4-hydroxylase 41.8e-4532.78Show/hide
Query:  LARSNLISGRKGLRDQLVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLASNSEDKPSGNSAGSRNTVSTKFLNNSGAILNT-TD
        +AR  L+     +   L+ S     S+ S  ++PS+V QVS +PR F+Y+GFL++ ECDH++SLA  S  + +     S  +  ++   +SG  ++   D
Subjt:  LARSNLISGRKGLRDQLVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLASNSEDKPSGNSAGSRNTVSTKFLNNSGAILNT-TD

Query:  DIIARVENRIAVWTFLPKDYSMPLQIMHY--GGEEAEHKYIFGNRSAMSSSEPLMATVVLYLSDSASGGEMLFPESKVKS--------SFWSDRRKKSNI
         I++ +E++I+ WTFLPK+    +Q++ Y  G +   H   F ++  +      MAT+++YLS+   GGE +FP++++ S           SD  K+   
Subjt:  DIIARVENRIAVWTFLPKDYSMPLQIMHY--GGEEAEHKYIFGNRSAMSSSEPLMATVVLYLSDSASGGEMLFPESKVKS--------SFWSDRRKKSNI

Query:  LRPVRGNAILFFSVHLNASPDKSSYHTRSPVLDGKLWVATKFFYLRPTTGNKYTVESIGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCN
        ++P +G+A+LFF++H +A PD  S H   PV++G+ W ATK+ ++     +   + +   +C D ++SC +WA +GEC +N  +M+G+ +  G CR+SC 
Subjt:  LRPVRGNAILFFSVHLNASPDKSSYHTRSPVLDGKLWVATKFFYLRPTTGNKYTVESIGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCN

Query:  AC
        AC
Subjt:  AC

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 22.0e-4736.01Show/hide
Query:  LVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLA-SNSEDKPSGNSAGSRNTVSTKFLNNSGAILNTTDDIIARVENRIAVWTFL
        L+ S     S+ S  I+PS+V QVS +PR F+Y+GFL+D ECDHLISLA  N +     ++    + VS    ++   I    D I++ +E++++ WTFL
Subjt:  LVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLA-SNSEDKPSGNSAGSRNTVSTKFLNNSGAILNTTDDIIARVENRIAVWTFL

Query:  PKDYSMPLQIMHY--GGEEAEHKYIFGNRSAMSSSEPLMATVVLYLSDSASGGEMLFPESKVKS--------SFWSDRRKKSNILRPVRGNAILFFSVHL
        PK+    LQ++ Y  G +   H   F ++  ++     +ATV+LYLS+   GGE +FP+++  S           SD  KK   ++P +GNA+LFF++  
Subjt:  PKDYSMPLQIMHY--GGEEAEHKYIFGNRSAMSSSEPLMATVVLYLSDSASGGEMLFPESKVKS--------SFWSDRRKKSNILRPVRGNAILFFSVHL

Query:  NASPDKSSYHTRSPVLDGKLWVATKFFYLRPTTGNKYTVESIGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
        +A PD  S H   PV++G+ W ATK+ ++     +   + +   +C D ++SC +WA +GEC +N  +M+G+P+  G CR+SC AC
Subjt:  NASPDKSSYHTRSPVLDGKLWVATKFFYLRPTTGNKYTVESIGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase3.8e-5436.25Show/hide
Query:  MDSRLNFLLLLVTAFSFSSCLARSN-LISGRKGLRDQLVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLASNSEDKPSGNSAGS
        MDSR+     L   F+     +  N  ++     RD  V  + +  S  S   DP+RV Q+SW PRVFLY+GFLSD ECDH I LA    +K       S
Subjt:  MDSRLNFLLLLVTAFSFSSCLARSN-LISGRKGLRDQLVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLASNSEDKPSGNSAGS

Query:  RNTVSTKFLNNSGAILN-TTDDIIARVENRIAVWTFLPKDYSMPLQIMHY--GGEEAEHKYIFGNRSAMSSSEPLMATVVLYLSDSASGGEMLFP-----
          +V ++   +SG  L+   DDI++ VE ++A WTFLP++    +QI+HY  G +   H   F +++ +      +ATV++YLS+   GGE +FP     
Subjt:  RNTVSTKFLNNSGAILN-TTDDIIARVENRIAVWTFLPKDYSMPLQIMHY--GGEEAEHKYIFGNRSAMSSSEPLMATVVLYLSDSASGGEMLFP-----

Query:  ESKVKSSFWSDRRKKSNILRPVRGNAILFFSVHLNASPDKSSYHTRSPVLDGKLWVATKFFYLRPTTGNKYTVESIGDDCIDEDKSCPQWAAIGECERNA
         +++K   W++  K+   ++P +G+A+LFF++H NA+ D +S H   PV++G+ W AT++ +++ +    +  +S    C+DE+ SC +WA  GEC++N 
Subjt:  ESKVKSSFWSDRRKKSNILRPVRGNAILFFSVHLNASPDKSSYHTRSPVLDGKLWVATKFFYLRPTTGNKYTVESIGDDCIDEDKSCPQWAAIGECERNA

Query:  VFMIGSPDYYGTCRKSCNAC
         +M+GS   +G CRKSC AC
Subjt:  VFMIGSPDYYGTCRKSCNAC

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase1.1e-5336.36Show/hide
Query:  MDSRLNFLLLLVTAFSFSSCLARSN-LISGRKGLRDQLVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLA------SNSEDKPS
        MDSR+     L   F+     +  N  ++     RD  V  + +  S  S   DP+RV Q+SW PRVFLY+GFLSD ECDH I LA      S   D  S
Subjt:  MDSRLNFLLLLVTAFSFSSCLARSN-LISGRKGLRDQLVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLA------SNSEDKPS

Query:  GNSAGSRNTV-----STKFLNNSGAILNTTDDIIARVENRIAVWTFLPKDYSMPLQIMHY--GGEEAEHKYIFGNRSAMSSSEPLMATVVLYLSDSASGG
        G S  S ++V     S+ F+ N  ++    DDI++ VE ++A WTFLP++    +QI+HY  G +   H   F +++ +      +ATV++YLS+   GG
Subjt:  GNSAGSRNTV-----STKFLNNSGAILNTTDDIIARVENRIAVWTFLPKDYSMPLQIMHY--GGEEAEHKYIFGNRSAMSSSEPLMATVVLYLSDSASGG

Query:  EMLFP-----ESKVKSSFWSDRRKKSNILRPVRGNAILFFSVHLNASPDKSSYHTRSPVLDGKLWVATKFFYLRPTTGNKYTVESIGDDCIDEDKSCPQW
        E +FP      +++K   W++  K+   ++P +G+A+LFF++H NA+ D +S H   PV++G+ W AT++ +++ +    +  +S    C+DE+ SC +W
Subjt:  EMLFP-----ESKVKSSFWSDRRKKSNILRPVRGNAILFFSVHLNASPDKSSYHTRSPVLDGKLWVATKFFYLRPTTGNKYTVESIGDDCIDEDKSCPQW

Query:  AAIGECERNAVFMIGSPDYYGTCRKSCNAC
        A  GEC++N  +M+GS   +G CRKSC AC
Subjt:  AAIGECERNAVFMIGSPDYYGTCRKSCNAC

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase1.5e-5541.09Show/hide
Query:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLASNSEDKPS-GNSAGSRNTVSTKFLNNSGAIL-NTTDDIIARVENRIAVWTFLPKDYSMPL
        S+ S  +DP+R+ Q+SW PR FLYKGFLSD ECDHLI LA    +K        S  +  ++   +SG  L    DDI+A VE ++A WTFLP++    L
Subjt:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLASNSEDKPS-GNSAGSRNTVSTKFLNNSGAIL-NTTDDIIARVENRIAVWTFLPKDYSMPL

Query:  QIMHY--GGEEAEHKYIFGNRSAMSSSEPLMATVVLYLSDSASGGEMLFPESK-----VKSSFWSDRRKKSNILRPVRGNAILFFSVHLNASPDKSSYHT
        QI+HY  G +   H   F ++ A+      +ATV++YLS+   GGE +FP  K     +K   WS   K+   ++P +G+A+LFF++HLN + D +S H 
Subjt:  QIMHY--GGEEAEHKYIFGNRSAMSSSEPLMATVVLYLSDSASGGEMLFPESK-----VKSSFWSDRRKKSNILRPVRGNAILFFSVHLNASPDKSSYHT

Query:  RSPVLDGKLWVATKFFYLRPTTGNKYTVESIGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
          PV++G+ W AT++ ++R + G K  V      C+D+ +SC +WA  GECE+N ++M+GS    G CRKSC AC
Subjt:  RSPVLDGKLWVATKFFYLRPTTGNKYTVESIGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC

AT4G25600.1 Oxoglutarate/iron-dependent oxygenase2.0e-6343.55Show/hide
Query:  FLLLLVTAFSFSSCLARSNLISGRKGLRDQLV----DSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLASNSEDKPSGNSAGSRNT
        FL+L++T  S S           RK LRD+ +    D    SY   S  +DP+RV+Q+SW PRVFLY+GFLS+ ECDHLISL   + +  S ++ G    
Subjt:  FLLLLVTAFSFSSCLARSNLISGRKGLRDQLV----DSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLASNSEDKPSGNSAGSRNT

Query:  VSTKFLNNSGAILNTTDDIIARVENRIAVWTFLPKDYSMPLQIMHYGGEEAEHKY-IFGNRSAMSSSEPLMATVVLYLSDSASGGEMLFPESKVKSSFWS
                        D ++A +E +++ WTFLP +    +++  Y  E++  K   FG   +    E L+ATVVLYLS++  GGE+LFP S++K    +
Subjt:  VSTKFLNNSGAILNTTDDIIARVENRIAVWTFLPKDYSMPLQIMHYGGEEAEHKY-IFGNRSAMSSSEPLMATVVLYLSDSASGGEMLFPESKVKSSFWS

Query:  DRRKKSNILRPVRGNAILFFSVHLNASPDKSSYHTRSPVLDGKLWVATKFFYLRPTTGNKYTVESIGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDYY
           +  NILRPV+GNAILFF+  LNAS D  S H R PV+ G+L VATK  Y +     +  +E  G +C DED++C +WA +GEC++N V+MIGSPDYY
Subjt:  DRRKKSNILRPVRGNAILFFSVHLNASPDKSSYHTRSPVLDGKLWVATKFFYLRPTTGNKYTVESIGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDYY

Query:  GTCRKSCNAC
        GTCRKSCNAC
Subjt:  GTCRKSCNAC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCTCGTCTTAACTTTCTGCTTCTTTTAGTGACTGCATTTTCATTCTCAAGCTGCCTTGCACGAAGCAATTTGATTAGTGGTCGGAAGGGTTTAAGGGACCAGTT
GGTCGACAGTGTTCCTTTGAGCTACTCAAATCATTCTGGAAGAATCGACCCTTCAAGAGTTGTCCAAGTCTCTTGGCGACCAAGGGTTTTCTTGTATAAAGGTTTTCTCT
CAGATGCGGAATGTGATCACCTTATTTCTTTGGCTTCAAATTCAGAAGATAAACCTTCTGGGAACAGTGCTGGTTCTAGGAACACTGTCTCAACCAAATTCCTTAACAAT
TCAGGAGCCATTTTAAACACAACAGATGATATCATTGCAAGGGTTGAAAATCGAATTGCAGTGTGGACTTTTCTCCCAAAAGATTATAGCATGCCTTTACAGATTATGCA
TTACGGGGGTGAAGAAGCAGAGCATAAGTACATTTTTGGCAACAGATCTGCAATGTCGTCCAGTGAGCCTTTGATGGCCACAGTAGTTTTGTACCTCTCAGATTCCGCTA
GCGGTGGCGAGATGCTCTTTCCTGAATCAAAGGTAAAGAGCAGCTTTTGGTCAGACCGGAGAAAGAAAAGCAACATTCTGAGACCAGTGAGAGGCAATGCAATTCTTTTT
TTCTCTGTTCATCTTAATGCTTCTCCAGACAAGAGTAGCTACCATACCCGATCACCAGTACTTGATGGGAAATTGTGGGTTGCTACAAAATTCTTCTACTTAAGACCAAC
CACTGGGAATAAATACACAGTTGAGTCCATTGGAGACGACTGCATTGATGAAGATAAAAGCTGCCCCCAGTGGGCTGCCATTGGCGAATGCGAACGAAACGCTGTTTTCA
TGATTGGTTCTCCAGATTACTATGGAACATGTAGAAAAAGCTGCAATGCATGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATTCTCGTCTTAACTTTCTGCTTCTTTTAGTGACTGCATTTTCATTCTCAAGCTGCCTTGCACGAAGCAATTTGATTAGTGGTCGGAAGGGTTTAAGGGACCAGTT
GGTCGACAGTGTTCCTTTGAGCTACTCAAATCATTCTGGAAGAATCGACCCTTCAAGAGTTGTCCAAGTCTCTTGGCGACCAAGGGTTTTCTTGTATAAAGGTTTTCTCT
CAGATGCGGAATGTGATCACCTTATTTCTTTGGCTTCAAATTCAGAAGATAAACCTTCTGGGAACAGTGCTGGTTCTAGGAACACTGTCTCAACCAAATTCCTTAACAAT
TCAGGAGCCATTTTAAACACAACAGATGATATCATTGCAAGGGTTGAAAATCGAATTGCAGTGTGGACTTTTCTCCCAAAAGATTATAGCATGCCTTTACAGATTATGCA
TTACGGGGGTGAAGAAGCAGAGCATAAGTACATTTTTGGCAACAGATCTGCAATGTCGTCCAGTGAGCCTTTGATGGCCACAGTAGTTTTGTACCTCTCAGATTCCGCTA
GCGGTGGCGAGATGCTCTTTCCTGAATCAAAGGTAAAGAGCAGCTTTTGGTCAGACCGGAGAAAGAAAAGCAACATTCTGAGACCAGTGAGAGGCAATGCAATTCTTTTT
TTCTCTGTTCATCTTAATGCTTCTCCAGACAAGAGTAGCTACCATACCCGATCACCAGTACTTGATGGGAAATTGTGGGTTGCTACAAAATTCTTCTACTTAAGACCAAC
CACTGGGAATAAATACACAGTTGAGTCCATTGGAGACGACTGCATTGATGAAGATAAAAGCTGCCCCCAGTGGGCTGCCATTGGCGAATGCGAACGAAACGCTGTTTTCA
TGATTGGTTCTCCAGATTACTATGGAACATGTAGAAAAAGCTGCAATGCATGTTGA
Protein sequenceShow/hide protein sequence
MDSRLNFLLLLVTAFSFSSCLARSNLISGRKGLRDQLVDSVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDAECDHLISLASNSEDKPSGNSAGSRNTVSTKFLNN
SGAILNTTDDIIARVENRIAVWTFLPKDYSMPLQIMHYGGEEAEHKYIFGNRSAMSSSEPLMATVVLYLSDSASGGEMLFPESKVKSSFWSDRRKKSNILRPVRGNAILF
FSVHLNASPDKSSYHTRSPVLDGKLWVATKFFYLRPTTGNKYTVESIGDDCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC