; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005881 (gene) of Snake gourd v1 genome

Gene IDTan0005881
OrganismTrichosanthes anguina (Snake gourd v1)
Description2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein
Genome locationLG03:76050835..76054073
RNA-Seq ExpressionTan0005881
SyntenyTan0005881
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134841.1 probable prolyl 4-hydroxylase 10 [Cucumis sativus]1.2e-15795.82Show/hide
Query:  MAKHRQSRFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIVRKTSDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECEYL
        MAKHRQSRFP+RKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGS K HDLSSIVRKTSDDVDEEKGEQWVEVISWEPRAF+YHNFLTKEECEYL
Subjt:  MAKHRQSRFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIVRKTSDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECEYL

Query:  ISLAKPHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVLMY
        ISLAKPHMQKSTVVDSETG+SKDSRVRTSSGTFL RGRDK +RTIEKR++DFSFIP EHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVLMY
Subjt:  ISLAKPHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVLMY

Query:  LSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYKA
        LSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRV+EYKA
Subjt:  LSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYKA

XP_008440878.1 PREDICTED: probable prolyl 4-hydroxylase 10 [Cucumis melo]4.1e-15896.86Show/hide
Query:  MAKHRQSRFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIVRKTSDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECEYL
        MAKHRQSRFP+RKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGS K HDLSSIVRKTSDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECEYL
Subjt:  MAKHRQSRFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIVRKTSDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECEYL

Query:  ISLAKPHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVLMY
        ISLAKPHMQKSTVVDSETG+SKDSRVRTSSGTFL RGRDK IRTIEKRI+DFSFIP EHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVLMY
Subjt:  ISLAKPHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVLMY

Query:  LSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYKA
        LSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRV+EYKA
Subjt:  LSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYKA

XP_022978860.1 probable prolyl 4-hydroxylase 10 [Cucurbita maxima]6.6e-15694.77Show/hide
Query:  MAKHRQSRFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIVRKTSDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECEYL
        MAKHRQSRFP+RKSSSSST++FTLLIMFTFVILILLALGILSIPGNSGGSPK HDLSSIVRKTS+DVDEEKGE+W EVISWEPRAF+YHNFLTKEECEYL
Subjt:  MAKHRQSRFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIVRKTSDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECEYL

Query:  ISLAKPHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVLMY
        IS AKPHMQKSTVVDSETGKSKDSRVRTSSGTFL RG DKII TIEKRIADF+FIP EHGEGLQVLHYEVGQKYEPHFDYFLD+YNTKNGGQRIATVLMY
Subjt:  ISLAKPHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVLMY

Query:  LSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYKA
        LSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYKA
Subjt:  LSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYKA

XP_023521904.1 probable prolyl 4-hydroxylase 10 [Cucurbita pepo subsp. pepo]1.5e-15594.77Show/hide
Query:  MAKHRQSRFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIVRKTSDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECEYL
        MAKHRQSR P+RKSSSSSTL+FTLLIMFTFVILILLALGILSIPGNSGGSPK HDLSSIVRKTS+DVDEEKGE+W EVISWEPRAF+YHNFLTKEECEYL
Subjt:  MAKHRQSRFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIVRKTSDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECEYL

Query:  ISLAKPHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVLMY
        IS AKPHMQKSTVVDSETGKSKDSRVRTSSGTFL RG DKII TIEKRIADF+FIP EHGEGLQVLHYEVGQKYEPHFDYFLD+YNTKNGGQRIATVLMY
Subjt:  ISLAKPHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVLMY

Query:  LSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYKA
        LSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYKA
Subjt:  LSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYKA

XP_038881910.1 probable prolyl 4-hydroxylase 10 isoform X2 [Benincasa hispida]3.5e-15796.15Show/hide
Query:  MAKHRQSRFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIVRKTSDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECEYL
        MAKHRQSRFP+RKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPK HDLSSIVRKTSDDVDEE+GEQWVEVISWEPRAF+YHNFLTKEECEYL
Subjt:  MAKHRQSRFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIVRKTSDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECEYL

Query:  ISLAKPHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVLMY
        ISLA PHMQKSTVVDSETGKSKDSRVRTSSGTFL RGRDK IRTIEKRIADFSFIP EHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVLMY
Subjt:  ISLAKPHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVLMY

Query:  LSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYK
        LS+VEEGGETVFPAAKGNFSSVPWWNELS+CGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRV+EYK
Subjt:  LSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYK

TrEMBL top hitse value%identityAlignment
A0A0A0KJH9 Fe2OG dioxygenase domain-containing protein5.8e-15895.82Show/hide
Query:  MAKHRQSRFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIVRKTSDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECEYL
        MAKHRQSRFP+RKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGS K HDLSSIVRKTSDDVDEEKGEQWVEVISWEPRAF+YHNFLTKEECEYL
Subjt:  MAKHRQSRFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIVRKTSDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECEYL

Query:  ISLAKPHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVLMY
        ISLAKPHMQKSTVVDSETG+SKDSRVRTSSGTFL RGRDK +RTIEKR++DFSFIP EHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVLMY
Subjt:  ISLAKPHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVLMY

Query:  LSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYKA
        LSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRV+EYKA
Subjt:  LSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYKA

A0A1S3B1P6 probable prolyl 4-hydroxylase 102.0e-15896.86Show/hide
Query:  MAKHRQSRFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIVRKTSDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECEYL
        MAKHRQSRFP+RKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGS K HDLSSIVRKTSDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECEYL
Subjt:  MAKHRQSRFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIVRKTSDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECEYL

Query:  ISLAKPHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVLMY
        ISLAKPHMQKSTVVDSETG+SKDSRVRTSSGTFL RGRDK IRTIEKRI+DFSFIP EHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVLMY
Subjt:  ISLAKPHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVLMY

Query:  LSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYKA
        LSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRV+EYKA
Subjt:  LSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYKA

A0A6J1EFG6 probable prolyl 4-hydroxylase 102.7e-15594.77Show/hide
Query:  MAKHRQSRFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIVRKTSDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECEYL
        MAKHRQ RFPSRKSSSSSTL+FTLLIMFTFVILILLALGILSIPGNS G+PKAHDLSSIVRKTSD+VDEEKGEQW EVISWEPRAF+YHNFLTKEECEYL
Subjt:  MAKHRQSRFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIVRKTSDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECEYL

Query:  ISLAKPHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVLMY
        ISLAKPHMQKS+VVDSETGKSKDSRVRTSSGTFL RGRDKIIR IEKRIADFSF+P EHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIAT+LMY
Subjt:  ISLAKPHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVLMY

Query:  LSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYKA
        LSDVEEGGETVFPAAKGNFSSVPWW+ELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRV+EYKA
Subjt:  LSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYKA

A0A6J1GDV6 probable prolyl 4-hydroxylase 109.3e-15694.77Show/hide
Query:  MAKHRQSRFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIVRKTSDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECEYL
        MAKHRQSR P+RKSSSSSTL+FTLLIMFTFVILILLALGILSIPGNSGGSPK HDLSSIVRKTS+DVDEEKGE+W EVISWEPRAF+YHNFLTKEECEYL
Subjt:  MAKHRQSRFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIVRKTSDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECEYL

Query:  ISLAKPHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVLMY
        IS AKPHMQKSTVVDSETGKSKDSRVRTSSGTFL RG DKII TIEKRIADF+FIP EHGEGLQVLHYEVGQKYEPHFDYFLD+YNTKNGGQRIATVLMY
Subjt:  ISLAKPHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVLMY

Query:  LSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYKA
        LSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYKA
Subjt:  LSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYKA

A0A6J1IRF1 probable prolyl 4-hydroxylase 103.2e-15694.77Show/hide
Query:  MAKHRQSRFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIVRKTSDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECEYL
        MAKHRQSRFP+RKSSSSST++FTLLIMFTFVILILLALGILSIPGNSGGSPK HDLSSIVRKTS+DVDEEKGE+W EVISWEPRAF+YHNFLTKEECEYL
Subjt:  MAKHRQSRFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIVRKTSDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECEYL

Query:  ISLAKPHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVLMY
        IS AKPHMQKSTVVDSETGKSKDSRVRTSSGTFL RG DKII TIEKRIADF+FIP EHGEGLQVLHYEVGQKYEPHFDYFLD+YNTKNGGQRIATVLMY
Subjt:  ISLAKPHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVLMY

Query:  LSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYKA
        LSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYKA
Subjt:  LSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYKA

SwissProt top hitse value%identityAlignment
F4JNU8 Probable prolyl 4-hydroxylase 83.6e-11269.1Show/hide
Query:  KHRQSRFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIV-----RKTSDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEEC
        K +Q R   RKS S+ T  FT++++  FVILIL+ LGI S+P  +  S    DL++IV     R++  D ++  G++W+EVISWEPRAF+YHNFLT EEC
Subjt:  KHRQSRFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIV-----RKTSDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEEC

Query:  EYLISLAKPHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATV
        E+LISLAKP M KS VVD +TGKS DSRVRTSSGTFL RG D+I+  IE RI+DF+FIP E+GEGLQVLHYEVGQ+YEPH DYF DE+N + GGQRIATV
Subjt:  EYLISLAKPHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATV

Query:  LMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEY
        LMYLSDV+EGGETVFPAAKGN S VPWW+ELS CGK+GLSV PK+ DALLFWSMKPDASLDPSSLHGGCPVIKGNKWS+TKW  V EY
Subjt:  LMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEY

F4JZ24 Probable prolyl 4-hydroxylase 102.0e-13179.17Show/hide
Query:  MAKHRQSRFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIVRKT--SDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECE
        MA+ R  R PS + SS STLVF +LIM TFVILILLA GILS+P N+ GS KA+DL+SIVRKT      D+ K E+WVE+ISWEPRA +YHNFLTKEEC+
Subjt:  MAKHRQSRFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIVRKT--SDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECE

Query:  YLISLAKPHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVL
        YLI LAKPHM+KSTVVD +TGKS DSRVRTSSGTFL RGRDK IR IEKRI+DF+FIP EHGEGLQVLHYE+GQKYEPH+DYF+DEYNT+NGGQRIATVL
Subjt:  YLISLAKPHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVL

Query:  MYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYK
        MYLSDVEEGGETVFPAAKGN+S+VPWWNELS+CGK GLSVKPK GDALLFWSM PDA+LDPSSLHGGC VIKGNKWS+TKW+RV EYK
Subjt:  MYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYK

Q24JN5 Prolyl 4-hydroxylase 56.1e-12073.67Show/hide
Query:  RFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIVRK--TSDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECEYLISLAK
        R+  RKS S ST  FT+LI+   VILILL LGILS+P  +  S K +DL++IVRK  TS   +E  GE+WVEVISWEPRA +YHNFLT EECE+LISLAK
Subjt:  RFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIVRK--TSDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECEYLISLAK

Query:  PHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVLMYLSDVE
        P M KSTVVD +TG SKDSRVRTSSGTFL+RG D+++  IEKRI+DF+FIP E+GEGLQVLHY+VGQKYEPH+DYFLDE+NTKNGGQRIATVLMYLSDV+
Subjt:  PHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVLMYLSDVE

Query:  EGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYK
        +GGETVFPAA+GN S+VPWWNELS CGK+GLSV PK+ DALLFW+M+PDASLDPSSLHGGCPV+KGNKWS+TKW  V E+K
Subjt:  EGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYK

Q8LAN3 Probable prolyl 4-hydroxylase 41.7e-6652.81Show/hide
Query:  SSIVRKTSDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECEYLISLAKPHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIP
        +S++  +S  V+  K    V+ +S +PRAF+Y  FLT+ EC++++SLAK  +++S V D+++G+SK S VRTSSGTF+ +G+D I+  IE +I+ ++F+P
Subjt:  SSIVRKTSDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECEYLISLAKPHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIP

Query:  AEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWN--ELSDCGKKGLSVKPKRGDALLFWSMKPD
         E+GE +QVL YE GQKY+ HFDYF D+ N   GG R+AT+LMYLS+V +GGETVFP A+     V   N  +LSDC K+G++VKP++GDALLF+++ PD
Subjt:  AEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWN--ELSDCGKKGLSVKPKRGDALLFWSMKPD

Query:  ASLDPSSLHGGCPVIKGNKWSATKWMRVDEY
        A  DP SLHGGCPVI+G KWSATKW+ VD +
Subjt:  ASLDPSSLHGGCPVIKGNKWSATKWMRVDEY

Q9LN20 Probable prolyl 4-hydroxylase 31.9e-12173.96Show/hide
Query:  MAKHRQSRFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIVRKTSDDVD--EEKGEQWVEVISWEPRAFIYHNFLTKEECE
        MAK R SRF +RK S+   ++F +L M T V+L+LLA G+ S+P N+  S    DLS   R  ++  +   ++G+QW EV+SWEPRAF+YHNFL+KEECE
Subjt:  MAKHRQSRFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIVRKTSDDVD--EEKGEQWVEVISWEPRAFIYHNFLTKEECE

Query:  YLISLAKPHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVL
        YLISLAKPHM KSTVVDSETGKSKDSRVRTSSGTFL+RGRDKII+TIEKRIAD++FIPA+HGEGLQVLHYE GQKYEPH+DYF+DE+NTKNGGQR+AT+L
Subjt:  YLISLAKPHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVL

Query:  MYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYK
        MYLSDVEEGGETVFPAA  NFSSVPW+NELS+CGKKGLSVKP+ GDALLFWSM+PDA+LDP+SLHGGCPVI+GNKWS+TKWM V EYK
Subjt:  MYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYK

Arabidopsis top hitse value%identityAlignment
AT1G20270.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.3e-12273.96Show/hide
Query:  MAKHRQSRFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIVRKTSDDVD--EEKGEQWVEVISWEPRAFIYHNFLTKEECE
        MAK R SRF +RK S+   ++F +L M T V+L+LLA G+ S+P N+  S    DLS   R  ++  +   ++G+QW EV+SWEPRAF+YHNFL+KEECE
Subjt:  MAKHRQSRFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIVRKTSDDVD--EEKGEQWVEVISWEPRAFIYHNFLTKEECE

Query:  YLISLAKPHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVL
        YLISLAKPHM KSTVVDSETGKSKDSRVRTSSGTFL+RGRDKII+TIEKRIAD++FIPA+HGEGLQVLHYE GQKYEPH+DYF+DE+NTKNGGQR+AT+L
Subjt:  YLISLAKPHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVL

Query:  MYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYK
        MYLSDVEEGGETVFPAA  NFSSVPW+NELS+CGKKGLSVKP+ GDALLFWSM+PDA+LDP+SLHGGCPVI+GNKWS+TKWM V EYK
Subjt:  MYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYK

AT2G17720.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein4.3e-12173.67Show/hide
Query:  RFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIVRK--TSDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECEYLISLAK
        R+  RKS S ST  FT+LI+   VILILL LGILS+P  +  S K +DL++IVRK  TS   +E  GE+WVEVISWEPRA +YHNFLT EECE+LISLAK
Subjt:  RFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIVRK--TSDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECEYLISLAK

Query:  PHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVLMYLSDVE
        P M KSTVVD +TG SKDSRVRTSSGTFL+RG D+++  IEKRI+DF+FIP E+GEGLQVLHY+VGQKYEPH+DYFLDE+NTKNGGQRIATVLMYLSDV+
Subjt:  PHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVLMYLSDVE

Query:  EGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYK
        +GGETVFPAA+GN S+VPWWNELS CGK+GLSV PK+ DALLFW+M+PDASLDPSSLHGGCPV+KGNKWS+TKW  V E+K
Subjt:  EGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYK

AT4G35810.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.5e-11369.1Show/hide
Query:  KHRQSRFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIV-----RKTSDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEEC
        K +Q R   RKS S+ T  FT++++  FVILIL+ LGI S+P  +  S    DL++IV     R++  D ++  G++W+EVISWEPRAF+YHNFLT EEC
Subjt:  KHRQSRFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIV-----RKTSDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEEC

Query:  EYLISLAKPHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATV
        E+LISLAKP M KS VVD +TGKS DSRVRTSSGTFL RG D+I+  IE RI+DF+FIP E+GEGLQVLHYEVGQ+YEPH DYF DE+N + GGQRIATV
Subjt:  EYLISLAKPHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATV

Query:  LMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEY
        LMYLSDV+EGGETVFPAAKGN S VPWW+ELS CGK+GLSV PK+ DALLFWSMKPDASLDPSSLHGGCPVIKGNKWS+TKW  V EY
Subjt:  LMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEY

AT5G66060.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.4e-13279.17Show/hide
Query:  MAKHRQSRFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIVRKT--SDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECE
        MA+ R  R PS + SS STLVF +LIM TFVILILLA GILS+P N+ GS KA+DL+SIVRKT      D+ K E+WVE+ISWEPRA +YHNFLTKEEC+
Subjt:  MAKHRQSRFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIVRKT--SDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECE

Query:  YLISLAKPHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVL
        YLI LAKPHM+KSTVVD +TGKS DSRVRTSSGTFL RGRDK IR IEKRI+DF+FIP EHGEGLQVLHYE+GQKYEPH+DYF+DEYNT+NGGQRIATVL
Subjt:  YLISLAKPHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVL

Query:  MYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYK
        MYLSDVEEGGETVFPAAKGN+S+VPWWNELS+CGK GLSVKPK GDALLFWSM PDA+LDPSSLHGGC VIKGNKWS+TKW+RV EYK
Subjt:  MYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYK

AT5G66060.2 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.9e-10076.79Show/hide
Query:  MAKHRQSRFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIVRKT--SDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECE
        MA+ R  R PS + SS STLVF +LIM TFVILILLA GILS+P N+ GS KA+DL+SIVRKT      D+ K E+WVE+ISWEPRA +YHNFL  EEC+
Subjt:  MAKHRQSRFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIVRKT--SDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECE

Query:  YLISLAKPHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVL
        YLI LAKPHM+KSTVVD +TGKS DSRVRTSSGTFL RGRDK IR IEKRI+DF+FIP EHGEGLQVLHYE+GQKYEPH+DYF+DEYNT+NGGQRIATVL
Subjt:  YLISLAKPHMQKSTVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVL

Query:  MYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKG
        MYLSDVEEGGETVFPAAKGN+S+VPWWNELS+CGK G
Subjt:  MYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAAGCACCGGCAATCTCGTTTTCCTTCTCGGAAGTCCTCTTCTTCTTCTACTCTCGTCTTTACCCTGCTCATTATGTTCACCTTCGTCATTCTCATTCTTCTTGC
CCTTGGAATCCTCTCCATCCCCGGCAATTCCGGCGGTTCGCCCAAGGCTCATGATCTGAGCTCGATTGTGCGGAAAACTTCCGACGATGTTGACGAGGAGAAAGGGGAGC
AGTGGGTTGAAGTGATCTCATGGGAACCTAGAGCCTTCATTTACCACAATTTTCTGACAAAGGAGGAATGTGAGTACCTAATCAGCCTTGCCAAGCCTCACATGCAAAAG
TCTACCGTCGTTGACAGTGAAACTGGAAAGAGCAAAGATAGCAGAGTTCGTACTAGCTCTGGAACATTTTTGCAGAGAGGACGTGATAAGATTATTAGAACTATTGAGAA
AAGAATTGCTGATTTCAGCTTCATACCCGCAGAGCATGGAGAAGGGCTTCAGGTTCTTCATTACGAAGTGGGACAGAAATATGAACCTCATTTTGATTACTTCCTTGATG
AATACAATACCAAGAATGGAGGTCAACGTATAGCAACAGTGCTGATGTATCTCTCAGACGTTGAAGAAGGAGGCGAGACAGTGTTCCCTGCTGCCAAAGGAAACTTTAGT
TCTGTACCTTGGTGGAATGAGCTTTCAGATTGTGGGAAGAAAGGACTTTCTGTTAAACCGAAGAGGGGTGATGCGTTGCTTTTCTGGAGCATGAAGCCTGATGCCTCTCT
CGATCCATCAAGTTTGCATGGTGGTTGCCCTGTTATCAAGGGGAATAAATGGTCTGCTACTAAATGGATGCGAGTAGACGAATACAAAGCTTGA
mRNA sequenceShow/hide mRNA sequence
GATCGGTCGAAACAGCTAGAGGCAGAGCCATTAGCCAAACCCCAAAAACAAAATCAAAACAAAATCAATCTCAAATTACGGTGTCAAAGTTCACCAATTTTTCACAGCAC
AGAATCCTTGGACTTTTGAGACGAAGACCAAACACGAACCGATACTCTTCTCCAAAGGGGCTCTGCAATTCCAGATCTTGCTCTCCTCCTTCCATGGCGAAGCACCGGCA
ATCTCGTTTTCCTTCTCGGAAGTCCTCTTCTTCTTCTACTCTCGTCTTTACCCTGCTCATTATGTTCACCTTCGTCATTCTCATTCTTCTTGCCCTTGGAATCCTCTCCA
TCCCCGGCAATTCCGGCGGTTCGCCCAAGGCTCATGATCTGAGCTCGATTGTGCGGAAAACTTCCGACGATGTTGACGAGGAGAAAGGGGAGCAGTGGGTTGAAGTGATC
TCATGGGAACCTAGAGCCTTCATTTACCACAATTTTCTGACAAAGGAGGAATGTGAGTACCTAATCAGCCTTGCCAAGCCTCACATGCAAAAGTCTACCGTCGTTGACAG
TGAAACTGGAAAGAGCAAAGATAGCAGAGTTCGTACTAGCTCTGGAACATTTTTGCAGAGAGGACGTGATAAGATTATTAGAACTATTGAGAAAAGAATTGCTGATTTCA
GCTTCATACCCGCAGAGCATGGAGAAGGGCTTCAGGTTCTTCATTACGAAGTGGGACAGAAATATGAACCTCATTTTGATTACTTCCTTGATGAATACAATACCAAGAAT
GGAGGTCAACGTATAGCAACAGTGCTGATGTATCTCTCAGACGTTGAAGAAGGAGGCGAGACAGTGTTCCCTGCTGCCAAAGGAAACTTTAGTTCTGTACCTTGGTGGAA
TGAGCTTTCAGATTGTGGGAAGAAAGGACTTTCTGTTAAACCGAAGAGGGGTGATGCGTTGCTTTTCTGGAGCATGAAGCCTGATGCCTCTCTCGATCCATCAAGTTTGC
ATGGTGGTTGCCCTGTTATCAAGGGGAATAAATGGTCTGCTACTAAATGGATGCGAGTAGACGAATACAAAGCTTGAGTTGATGGTGACTAAGGTTTGGTTGACTTTTCT
TGTTATTGCTCAAAGTCACAGATGCATTTGCTGTAGGGAATCATCATTTCTATACTTTAACCTTGTGCGCGACTATCCGATCCTTAAACCTATGAATTTTGATGTGCTTA
ATATTTTAATTACGAGCTTACATTGTCATTGATTTCTTGTTTATGGCCACGAGAGAATAAGAGTTCATATTTTTGCTAGAAATGCCACCTCCGAACCTCAGTTATAGATT
CATATGGTATGTTATGTAAAAAAAAACAAACAAACAAACAAAGGTTTTATGCTTGATGTACAACTCCCTTGCTGCAACATTGTTATGTTTTTTTTATTTGATAACACTTT
TTAAAGCCTTCTCATTCTTCCAACTGGCCAGCCATTTGGAACGAAAAATAGGATTAGGGCTCTCTTACTCAA
Protein sequenceShow/hide protein sequence
MAKHRQSRFPSRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKAHDLSSIVRKTSDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECEYLISLAKPHMQK
STVVDSETGKSKDSRVRTSSGTFLQRGRDKIIRTIEKRIADFSFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFS
SVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYKA