; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy5G102700 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy5G102700
OrganismCucumis hystrix (Cucumber (hystrix) v1)
Description2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein
Genome locationchrH05:16657775..16680030
RNA-Seq ExpressionChy5G102700
SyntenyChy5G102700
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005506 - iron ion binding (molecular function)
GO:0016705 - oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
GO:0051213 - dioxygenase activity (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF2543524.1 hypothetical protein F2Q68_00028798 [Brassica cretica]3.19e-19345.41Show/hide
Query:  DSWVECISWEPRAFIYHNFLSEKECSQLINLAKPRMERSLVSGQNTNWEGVVSSRRTSSGRFLAKGQNQLVRRIEKRIAEFTFIPVENGEGLSILHYEVG
        + W+E ISWEPRAF+YHNFL+ +EC  LI+LAKP M +S V+   T      S  RTSSG FL  G +++VR IE +I++FTFIPVENGE L +LHYEVG
Subjt:  DSWVECISWEPRAFIYHNFLSEKECSQLINLAKPRMERSLVSGQNTNWEGVVSSRRTSSGRFLAKGQNQLVRRIEKRIAEFTFIPVENGEGLSILHYEVG

Query:  QKFEPHYDYTHPDSFSFKSLGQRNATLVMYLSDVKEGGATVFPEAKKCASSARRWWKKLPEYGKDNGLSVKPKMGDALLFWSVKPDGTLDPTSLHVSLGK
        QK+EPH+DY   D  + K  GQR AT++MYLSDV EGG TVFP AK   S    WW +L + GK+ GLSV PK  DALLFWS++PDG+LDP+SLH     
Subjt:  QKFEPHYDYTHPDSFSFKSLGQRNATLVMYLSDVKEGGATVFPEAKKCASSARRWWKKLPEYGKDNGLSVKPKMGDALLFWSVKPDGTLDPTSLHVSLGK

Query:  YIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPLETSHHRFSSVRHTAFLSDGLGKKGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHME
          K+                           +LRF       H R                    W E IS +PRA +YHNFL  EEC +LISLAKP+M 
Subjt:  YIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPLETSHHRFSSVRHTAFLSDGLGKKGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHME

Query:  KSTVVDNETGKNVDSSVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDFFDDEYNLKEIGQRMATLLMYLSDVEEGGE
        +S V++  TG  V SS RTS+G  + RG DKI+  IEKRI++FTF+P+E+GE LQ++HYEVGQK D H+D             R+AT LMYLSDV+EGGE
Subjt:  KSTVVDNETGKNVDSSVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDFFDDEYNLKEIGQRMATLLMYLSDVEEGGE

Query:  TVFPAAKGNFSSVP-------WWNELSECGR--GGLSVKPKMGDALLFWSMKPDTTLDPTTLHVSKGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIA
        T FP + G  S  P       +WN+  +  +    LSV P    ALLFWSMKPD +LDP+TL  +  K                               +
Subjt:  TVFPAAKGNFSSVP-------WWNELSECGR--GGLSVKPKMGDALLFWSMKPDTTLDPTTLHVSKGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIA

Query:  LRFFSPPETSHHRFSSVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKI
        LRF  P E                        +W E IS +PRAF+YHNFL  EEC +LISLAKP+M +S V  S  G    S  RTS+G F++RG DK 
Subjt:  LRFFSPPETSHHRFSSVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKI

Query:  IRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPK
        +  IE+RI++FTF+P+E+GE LQ++HY++GQK+D H+D          G  R+AT LMYLSDV++GGET FP + G                  LSV PK
Subjt:  IRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPK

Query:  MGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMH
         GDA+LFW+ +PD + DP+S H   PVI+GNKW+ TKW H
Subjt:  MGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMH

KAG6586330.1 putative prolyl 4-hydroxylase 3, partial [Cucurbita argyrosperma subsp. sororia]2.84e-31269.69Show/hide
Query:  VSLGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPLETSH----HRFSSVRHTAFLSDGLGKKGDQWVEFISWEPRAFVYHNFLSKEECLYL
        VS GKYIK QG+KWSTF+LSK+IM  +LALG  M IA RFFSP E+SH    HR +SV+H+A  SDGLGK+ DQWVE ISWEPRAFVYHNFLSKEECLYL
Subjt:  VSLGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPLETSH----HRFSSVRHTAFLSDGLGKKGDQWVEFISWEPRAFVYHNFLSKEECLYL

Query:  ISLAKPHMEKSTVVDNETGKNVDSSVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDFFDDEYNLKEIGQRMATLLMY
        ISLAKP+MEKS+VVDN+TGKN+DS  RTSSGMFL RGQ+KI+ NIEKRIADFTFIP+EHGE LQILHYEVGQKYDAHYDFF DE+N+K  GQR+ATLLMY
Subjt:  ISLAKPHMEKSTVVDNETGKNVDSSVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDFFDDEYNLKEIGQRMATLLMY

Query:  LSDVEEGGETVFPAAKGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTTLH-------------------------------------
        LSDVEEGGETVFPAA+GNFSS+P WNE SECG+GGLS+KPKMGDALLFWSM+PD TLDPT++H                                     
Subjt:  LSDVEEGGETVFPAAKGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTTLH-------------------------------------

Query:  ------------------------------------------------------------VSKGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRF
                                                                    VSKGKYIK Q +KWSTF+LSK+IMA +LALG  MLIA RF
Subjt:  ------------------------------------------------------------VSKGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRF

Query:  FSPPETSH----HRFSSVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDK
        FSPPE+SH    HR +SV+H A  SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKP+M+KSTVVD+KTG+S+DSRVRTSSGMFL RGQ+K
Subjt:  FSPPETSH----HRFSSVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDK

Query:  IIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKP
        I+ NIEKRIADFTFIP+EHGE LQILHY+VGQKYDAH+DYF DE+NIK+GGQRMATLLMYLSDVEEGGETVFPAA+GNFSS+P WNELSECGKGGLSVKP
Subjt:  IIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKP

Query:  KMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWM
        KMGDALLFWSMKPD T+DPTSLHGACPVIRGNKWSCTKWM
Subjt:  KMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWM

XP_008456388.1 PREDICTED: probable prolyl 4-hydroxylase 3 isoform X2 [Cucumis melo]6.07e-19296.79Show/hide
Query:  VSKGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHRFSSVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLA
        VSKGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHR  SVR TAF SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLA
Subjt:  VSKGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHRFSSVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLA

Query:  KPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDV
        KPHMEKSTVVDSKTG+SVDSRVRTSSGMFLNRGQDKII NIEKRIADFTFIPIEHGEGLQILHY+VGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDV
Subjt:  KPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDV

Query:  EEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHYIK
        EEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMH  K
Subjt:  EEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHYIK

XP_011648735.2 probable prolyl 4-hydroxylase 3 [Cucumis sativus]1.27e-19698.21Show/hide
Query:  VSKGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHRFSSVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLA
        +SKGKYIKLQG+KWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHRFSSVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLA
Subjt:  VSKGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHRFSSVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLA

Query:  KPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDV
        KPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHY+VGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDV
Subjt:  KPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDV

Query:  EEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHYIK
        EEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMH  K
Subjt:  EEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHYIK

XP_038889689.1 probable prolyl 4-hydroxylase 3 [Benincasa hispida]7.66e-18692.61Show/hide
Query:  VSKGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSH----HRFSSVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYL
        +SKGKY K+QGKKWSTF+LSKMIMALVLALGFFML+ALRFFSPPETSH    H  +SVRH+A  SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYL
Subjt:  VSKGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSH----HRFSSVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYL

Query:  ISLAKPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLLMY
        ISLAKPHMEKSTVVDSKTG+SVDSRVRTSSGMFLNRGQDKII NIEKRIADFTFIPIEHGEGLQILHY+VGQKYDAHYDYFVDEYNIKKGGQRMATLLMY
Subjt:  ISLAKPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLLMY

Query:  LSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHYIK
        LSDVEEGGETVFPAA+GNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMH  K
Subjt:  LSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHYIK

TrEMBL top hitse value%identityAlignment
A0A0A0LFF5 Fe2OG dioxygenase domain-containing protein8.1e-14798.44Show/hide
Query:  MALVLALGFFMLIALRFFSPPETSHHRFSSVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESVDSRVR
        MALVLALGFFMLIALRF SPPETSHHRFSSVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESVDSRVR
Subjt:  MALVLALGFFMLIALRFFSPPETSHHRFSSVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESVDSRVR

Query:  TSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNE
        TSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHY+VGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNE
Subjt:  TSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNE

Query:  LSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHYIK
        LSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMH  K
Subjt:  LSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHYIK

A0A1S3C2P6 probable prolyl 4-hydroxylase 3 isoform X25.0e-15796.79Show/hide
Query:  VSKGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHRFSSVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLA
        VSKGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHR  SVR TAF SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLA
Subjt:  VSKGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHRFSSVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLA

Query:  KPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDV
        KPHMEKSTVVDSKTG+SVDSRVRTSSGMFLNRGQDKII NIEKRIADFTFIPIEHGEGLQILHY+VGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDV
Subjt:  KPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDV

Query:  EEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHYIK
        EEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMH  K
Subjt:  EEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHYIK

A0A1S3C367 probable prolyl 4-hydroxylase 3 isoform X12.9e-14497.31Show/hide
Query:  VSKGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHRFSSVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLA
        VSKGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHR  SVR TAF SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLA
Subjt:  VSKGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHRFSSVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLA

Query:  KPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDV
        KPHMEKSTVVDSKTG+SVDSRVRTSSGMFLNRGQDKII NIEKRIADFTFIPIEHGEGLQILHY+VGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDV
Subjt:  KPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDV

Query:  EEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHG
        EEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHG
Subjt:  EEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHG

A0A1S4DZG7 probable prolyl 4-hydroxylase 33.5e-15092.5Show/hide
Query:  VSKGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHRFSSVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLA
        VS GKYIKLQGKKWSTFQLSKMIMALVLALGFFML AL FFSPPETSHHR SSVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLA
Subjt:  VSKGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHRFSSVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLA

Query:  KPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDV
        KPHMEKSTVVDS+TG+SVDS VRTSSGMFLNRGQDKII NIEKRIADFTFIPIEHGE +QILHY VGQKYDAHYD+FVDEYN+K  GQRMATLLMYLSDV
Subjt:  KPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDV

Query:  EEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHYIK
        EEGGETVFPAAKGNFSSVPWWNELSECGK GLS+KPKMGDALLFWSMKPD TLDPTSLHGACPVIRGNKWSCTKW+H  K
Subjt:  EEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHYIK

A0A5D3BDQ0 Putative prolyl 4-hydroxylase 3 isoform X12.9e-14497.31Show/hide
Query:  VSKGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHRFSSVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLA
        VSKGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHR  SVR TAF SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLA
Subjt:  VSKGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHRFSSVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLA

Query:  KPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDV
        KPHMEKSTVVDSKTG+SVDSRVRTSSGMFLNRGQDKII NIEKRIADFTFIPIEHGEGLQILHY+VGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDV
Subjt:  KPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDV

Query:  EEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHG
        EEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHG
Subjt:  EEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHG

SwissProt top hitse value%identityAlignment
F4JNU8 Probable prolyl 4-hydroxylase 82.3e-9860.93Show/hide
Query:  KLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHR---------FSSVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLIS
        +L+ K   +F      + +++     +L+ L  FS P T+              +++      D     GD+W+E ISWEPRAFVYHNFL+ EEC +LIS
Subjt:  KLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHR---------FSSVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLIS

Query:  LAKPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLS
        LAKP M KS VVD KTG+S+DSRVRTSSG FLNRG D+I+  IE RI+DFTFIP E+GEGLQ+LHY+VGQ+Y+ H+DYF DE+N++KGGQR+AT+LMYLS
Subjt:  LAKPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLS

Query:  DVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMH
        DV+EGGETVFPAAKGN S VPWW+ELS+CGK GLSV PK  DALLFWSMKPDA+LDP+SLHG CPVI+GNKWS TKW H
Subjt:  DVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMH

F4JZ24 Probable prolyl 4-hydroxylase 103.9e-10669.12Show/hide
Query:  SKMIMALVLALGFFMLIALRFFSPPETSHHRFSS--------VRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVD
        S ++ A+++   F +LI L F      S++  SS        VR T   S     + ++WVE ISWEPRA VYHNFL+KEEC YLI LAKPHMEKSTVVD
Subjt:  SKMIMALVLALGFFMLIALRFFSPPETSHHRFSS--------VRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVD

Query:  SKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAA
         KTG+S DSRVRTSSG FL RG+DK IR IEKRI+DFTFIP+EHGEGLQ+LHY++GQKY+ HYDYF+DEYN + GGQR+AT+LMYLSDVEEGGETVFPAA
Subjt:  SKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAA

Query:  KGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWM--HYIKV
        KGN+S+VPWWNELSECGKGGLSVKPKMGDALLFWSM PDATLDP+SLHG C VI+GNKWS TKW+  H  KV
Subjt:  KGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWM--HYIKV

Q24JN5 Prolyl 4-hydroxylase 59.6e-9759.45Show/hide
Query:  SKGK-YIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHRFSS------VRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECL
        SK K +++ Q +K  +       + ++L +   +L+ L   S P  + +   +      VR +   S      G++WVE ISWEPRA VYHNFL+ EEC 
Subjt:  SKGK-YIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHRFSS------VRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECL

Query:  YLISLAKPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLL
        +LISLAKP M KSTVVD KTG S DSRVRTSSG FL RG D+++  IEKRI+DFTFIP+E+GEGLQ+LHY+VGQKY+ HYDYF+DE+N K GGQR+AT+L
Subjt:  YLISLAKPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLL

Query:  MYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHYIKVERFK
        MYLSDV++GGETVFPAA+GN S+VPWWNELS+CGK GLSV PK  DALLFW+M+PDA+LDP+SLHG CPV++GNKWS TKW H   V  FK
Subjt:  MYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHYIKVERFK

Q8L970 Probable prolyl 4-hydroxylase 78.8e-6655.5Show/hide
Query:  ISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHY
        +SW PR F+Y  FLS EEC + I LAK  +EKS V D+ +GESV+S VRTSSGMFL++ QD I+ N+E ++A +TF+P E+GE +QILHY+ GQKY+ H+
Subjt:  ISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHY

Query:  DYFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTK
        DYF D+ N++ GG R+AT+LMYLS+VE+GGETVFP  KG  + +   +  +EC K G +VKP+ GDALLF+++ P+AT D  SLHG+CPV+ G KWS T+
Subjt:  DYFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTK

Query:  WMHYIKVER
        W+H    ER
Subjt:  WMHYIKVER

Q9LN20 Probable prolyl 4-hydroxylase 31.0e-11471.99Show/hide
Query:  VSKGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPP----ETSHHRFSSVRHTAF-LSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLY
        ++K ++ + Q +KWST  L  + M  +L +   ML+A   FS P    E+S    S  R  A   S+GLGKRGDQW E +SWEPRAFVYHNFLSKEEC Y
Subjt:  VSKGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPP----ETSHHRFSSVRHTAF-LSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLY

Query:  LISLAKPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLLM
        LISLAKPHM KSTVVDS+TG+S DSRVRTSSG FL RG+DKII+ IEKRIAD+TFIP +HGEGLQ+LHY+ GQKY+ HYDYFVDE+N K GGQRMAT+LM
Subjt:  LISLAKPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLLM

Query:  YLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMH
        YLSDVEEGGETVFPAA  NFSSVPW+NELSECGK GLSVKP+MGDALLFWSM+PDATLDPTSLHG CPVIRGNKWS TKWMH
Subjt:  YLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMH

Arabidopsis top hitse value%identityAlignment
AT1G20270.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein7.3e-11671.99Show/hide
Query:  VSKGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPP----ETSHHRFSSVRHTAF-LSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLY
        ++K ++ + Q +KWST  L  + M  +L +   ML+A   FS P    E+S    S  R  A   S+GLGKRGDQW E +SWEPRAFVYHNFLSKEEC Y
Subjt:  VSKGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPP----ETSHHRFSSVRHTAF-LSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLY

Query:  LISLAKPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLLM
        LISLAKPHM KSTVVDS+TG+S DSRVRTSSG FL RG+DKII+ IEKRIAD+TFIP +HGEGLQ+LHY+ GQKY+ HYDYFVDE+N K GGQRMAT+LM
Subjt:  LISLAKPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLLM

Query:  YLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMH
        YLSDVEEGGETVFPAA  NFSSVPW+NELSECGK GLSVKP+MGDALLFWSM+PDATLDPTSLHG CPVIRGNKWS TKWMH
Subjt:  YLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMH

AT2G17720.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein6.8e-9859.45Show/hide
Query:  SKGK-YIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHRFSS------VRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECL
        SK K +++ Q +K  +       + ++L +   +L+ L   S P  + +   +      VR +   S      G++WVE ISWEPRA VYHNFL+ EEC 
Subjt:  SKGK-YIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHRFSS------VRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECL

Query:  YLISLAKPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLL
        +LISLAKP M KSTVVD KTG S DSRVRTSSG FL RG D+++  IEKRI+DFTFIP+E+GEGLQ+LHY+VGQKY+ HYDYF+DE+N K GGQR+AT+L
Subjt:  YLISLAKPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLL

Query:  MYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHYIKVERFK
        MYLSDV++GGETVFPAA+GN S+VPWWNELS+CGK GLSV PK  DALLFW+M+PDA+LDP+SLHG CPV++GNKWS TKW H   V  FK
Subjt:  MYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHYIKVERFK

AT4G35810.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.6e-9960.93Show/hide
Query:  KLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHR---------FSSVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLIS
        +L+ K   +F      + +++     +L+ L  FS P T+              +++      D     GD+W+E ISWEPRAFVYHNFL+ EEC +LIS
Subjt:  KLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHR---------FSSVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLIS

Query:  LAKPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLS
        LAKP M KS VVD KTG+S+DSRVRTSSG FLNRG D+I+  IE RI+DFTFIP E+GEGLQ+LHY+VGQ+Y+ H+DYF DE+N++KGGQR+AT+LMYLS
Subjt:  LAKPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLS

Query:  DVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMH
        DV+EGGETVFPAAKGN S VPWW+ELS+CGK GLSV PK  DALLFWSMKPDA+LDP+SLHG CPVI+GNKWS TKW H
Subjt:  DVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMH

AT5G66060.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.8e-10769.12Show/hide
Query:  SKMIMALVLALGFFMLIALRFFSPPETSHHRFSS--------VRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVD
        S ++ A+++   F +LI L F      S++  SS        VR T   S     + ++WVE ISWEPRA VYHNFL+KEEC YLI LAKPHMEKSTVVD
Subjt:  SKMIMALVLALGFFMLIALRFFSPPETSHHRFSS--------VRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVD

Query:  SKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAA
         KTG+S DSRVRTSSG FL RG+DK IR IEKRI+DFTFIP+EHGEGLQ+LHY++GQKY+ HYDYF+DEYN + GGQR+AT+LMYLSDVEEGGETVFPAA
Subjt:  SKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAA

Query:  KGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWM--HYIKV
        KGN+S+VPWWNELSECGKGGLSVKPKMGDALLFWSM PDATLDP+SLHG C VI+GNKWS TKW+  H  KV
Subjt:  KGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWM--HYIKV

AT5G66060.2 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein7.1e-7966.36Show/hide
Query:  SKMIMALVLALGFFMLIALRFFSPPETSHHRFSS--------VRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVD
        S ++ A+++   F +LI L F      S++  SS        VR T   S     + ++WVE ISWEPRA VYHNFL  EEC YLI LAKPHMEKSTVVD
Subjt:  SKMIMALVLALGFFMLIALRFFSPPETSHHRFSS--------VRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVD

Query:  SKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAA
         KTG+S DSRVRTSSG FL RG+DK IR IEKRI+DFTFIP+EHGEGLQ+LHY++GQKY+ HYDYF+DEYN + GGQR+AT+LMYLSDVEEGGETVFPAA
Subjt:  SKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAA

Query:  KGNFSSVPWWNELSECGKGG
        KGN+S+VPWWNELSECGKGG
Subjt:  KGNFSSVPWWNELSECGKGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGGTTGGATGTCGAAGAAGGGGGCGAGACGTGTTCCCGGCTGCGAACAAATGCGTGCCATGGTGGAAGAAATTGCCTACACATGGTAAAGATGGACTCTCTATAAA
ACCAAAGATGGGAGATGCATTATTTTTCTGGAGCATGAAACCTGATGATGGTACATTGGATTATACAAGTTTGCATGGTTCCTACCCCGTTATAAGAGGGGACAGATGGG
CATGTGTAAAGCTCATGGGACAAATGGGCATCAGTGGTGGGATGAGGTTGAGGAGCCACAAGAAGGGAAAGGATTCATGGGTTGAGTGCATTTCTTGGGAGCCTAGAGCT
TTCATTTACCATAATTTCTTGTCGGAGAAAGAATGCTCGCAGTTGATTAATCTTGCAAAGCCTCGCATGGAGAGATCACTTGTTAGTGGCCAAAATACTAATTGGGAAGG
TGTAGTGAGCAGCCGCCGCACTAGTTCGGGTAGGTTTCTTGCTAAAGGGCAGAACCAACTCGTCCGTAGAATAGAGAAAAGAATAGCAGAATTTACATTCATTCCCGTAG
AAAATGGAGAAGGATTGAGTATTCTACATTATGAAGTTGGGCAGAAGTTTGAACCTCACTATGATTACACTCATCCTGATTCATTCAGCTTTAAAAGTTTGGGCCAAAGA
AATGCCACCCTCGTCATGTATCTGTCGGATGTCAAAGAAGGGGGTGCGACGGTGTTCCCGGAGGCGAAAAAATGCGCCAGCTCTGCACGACGATGGTGGAAGAAACTGCC
TGAATATGGTAAAGATAATGGACTCTCCGTAAAACCAAAGATGGGAGATGCTTTATTGTTTTGGAGCGTGAAGCCTGATGGTACATTGGATCCTACAAGTTTGCATGTAT
CTCTAGGCAAATACATCAAGTTACAGGGTAAGAAATGGTCCACATTCCAGCTTTCGAAGATGATCATGGCCCTCGTTTTGGCACTTGGGTTTTTCATGCTTATTGCTCTT
CGCTTCTTCTCTCCTCTTGAAACTTCTCACCACCGTTTCTCTTCAGTCCGGCATACAGCATTTCTAAGTGATGGGTTGGGGAAGAAAGGGGATCAGTGGGTTGAGTTCAT
TTCTTGGGAGCCTAGAGCTTTTGTTTATCATAACTTCTTGTCCAAGGAAGAATGCTTGTATTTGATTAGTCTTGCAAAACCCCACATGGAGAAATCAACTGTGGTTGATA
ACGAAACTGGCAAGAATGTGGATAGCAGTGTGCGAACCAGTTCTGGGATGTTTCTGAATAGAGGGCAGGACAAAATCATCAGAAACATAGAGAAAAGAATAGCAGATTTT
ACATTCATTCCTATAGAGCATGGAGAAGGACTTCAAATTCTCCATTATGAAGTTGGGCAGAAGTATGATGCGCATTATGATTTCTTTGACGATGAGTACAACCTCAAAGA
AATAGGCCAAAGAATGGCCACCCTCCTCATGTATTTGTCGGATGTTGAAGAGGGGGGCGAGACGGTGTTCCCGGCTGCGAAAGGAAACTTTAGCTCTGTGCCATGGTGGA
ATGAACTGTCTGAATGTGGTAGAGGCGGACTCTCTGTAAAACCAAAAATGGGAGATGCTTTATTGTTCTGGAGCATGAAGCCCGATACTACCTTAGACCCTACAACTTTA
CATGTATCTAAAGGGAAATACATCAAGTTACAGGGTAAGAAATGGTCCACATTCCAGCTTTCGAAGATGATCATGGCCCTCGTTTTGGCACTTGGGTTTTTCATGCTTAT
TGCTCTTCGCTTCTTCTCTCCTCCTGAAACTTCTCACCACCGTTTCTCTTCCGTCCGGCATACAGCATTTCTAAGTGATGGGTTGGGGAAGAGAGGGGATCAGTGGGTTG
AGTTCATTTCTTGGGAGCCTAGAGCTTTTGTTTATCATAATTTCTTGTCCAAGGAAGAATGCTTGTACTTGATTAGTCTTGCAAAACCCCACATGGAGAAATCAACTGTG
GTTGATAGCAAAACTGGCGAGAGTGTGGATAGCAGGGTGCGAACCAGTTCTGGGATGTTTCTGAATAGAGGGCAGGACAAAATCATCAGAAACATAGAGAAAAGAATAGC
AGATTTTACATTCATTCCTATAGAGCATGGAGAAGGACTTCAAATTCTCCATTACAAAGTTGGGCAGAAGTATGATGCTCATTATGATTACTTTGTTGATGAGTACAACA
TCAAAAAAGGAGGCCAAAGAATGGCCACCCTTCTCATGTATTTGTCCGATGTTGAAGAGGGGGGCGAGACGGTGTTCCCGGCTGCAAAAGGAAACTTCAGCTCTGTGCCA
TGGTGGAATGAACTGTCTGAATGTGGTAAAGGTGGACTCTCTGTAAAACCAAAAATGGGAGATGCTTTATTGTTCTGGAGCATGAAGCCCGATGCTACCTTAGACCCTAC
AAGTTTACATGGTGCCTGTCCTGTCATTAGAGGGAACAAATGGTCATGTACAAAGTGGATGCATTACATAAAGGTAGAACGTTTCAAGTCGCTAAAGATGAAGCTACAGC
TGAAACCATCAATTGAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAGGTTGGATGTCGAAGAAGGGGGCGAGACGTGTTCCCGGCTGCGAACAAATGCGTGCCATGGTGGAAGAAATTGCCTACACATGGTAAAGATGGACTCTCTATAAA
ACCAAAGATGGGAGATGCATTATTTTTCTGGAGCATGAAACCTGATGATGGTACATTGGATTATACAAGTTTGCATGGTTCCTACCCCGTTATAAGAGGGGACAGATGGG
CATGTGTAAAGCTCATGGGACAAATGGGCATCAGTGGTGGGATGAGGTTGAGGAGCCACAAGAAGGGAAAGGATTCATGGGTTGAGTGCATTTCTTGGGAGCCTAGAGCT
TTCATTTACCATAATTTCTTGTCGGAGAAAGAATGCTCGCAGTTGATTAATCTTGCAAAGCCTCGCATGGAGAGATCACTTGTTAGTGGCCAAAATACTAATTGGGAAGG
TGTAGTGAGCAGCCGCCGCACTAGTTCGGGTAGGTTTCTTGCTAAAGGGCAGAACCAACTCGTCCGTAGAATAGAGAAAAGAATAGCAGAATTTACATTCATTCCCGTAG
AAAATGGAGAAGGATTGAGTATTCTACATTATGAAGTTGGGCAGAAGTTTGAACCTCACTATGATTACACTCATCCTGATTCATTCAGCTTTAAAAGTTTGGGCCAAAGA
AATGCCACCCTCGTCATGTATCTGTCGGATGTCAAAGAAGGGGGTGCGACGGTGTTCCCGGAGGCGAAAAAATGCGCCAGCTCTGCACGACGATGGTGGAAGAAACTGCC
TGAATATGGTAAAGATAATGGACTCTCCGTAAAACCAAAGATGGGAGATGCTTTATTGTTTTGGAGCGTGAAGCCTGATGGTACATTGGATCCTACAAGTTTGCATGTAT
CTCTAGGCAAATACATCAAGTTACAGGGTAAGAAATGGTCCACATTCCAGCTTTCGAAGATGATCATGGCCCTCGTTTTGGCACTTGGGTTTTTCATGCTTATTGCTCTT
CGCTTCTTCTCTCCTCTTGAAACTTCTCACCACCGTTTCTCTTCAGTCCGGCATACAGCATTTCTAAGTGATGGGTTGGGGAAGAAAGGGGATCAGTGGGTTGAGTTCAT
TTCTTGGGAGCCTAGAGCTTTTGTTTATCATAACTTCTTGTCCAAGGAAGAATGCTTGTATTTGATTAGTCTTGCAAAACCCCACATGGAGAAATCAACTGTGGTTGATA
ACGAAACTGGCAAGAATGTGGATAGCAGTGTGCGAACCAGTTCTGGGATGTTTCTGAATAGAGGGCAGGACAAAATCATCAGAAACATAGAGAAAAGAATAGCAGATTTT
ACATTCATTCCTATAGAGCATGGAGAAGGACTTCAAATTCTCCATTATGAAGTTGGGCAGAAGTATGATGCGCATTATGATTTCTTTGACGATGAGTACAACCTCAAAGA
AATAGGCCAAAGAATGGCCACCCTCCTCATGTATTTGTCGGATGTTGAAGAGGGGGGCGAGACGGTGTTCCCGGCTGCGAAAGGAAACTTTAGCTCTGTGCCATGGTGGA
ATGAACTGTCTGAATGTGGTAGAGGCGGACTCTCTGTAAAACCAAAAATGGGAGATGCTTTATTGTTCTGGAGCATGAAGCCCGATACTACCTTAGACCCTACAACTTTA
CATGTATCTAAAGGGAAATACATCAAGTTACAGGGTAAGAAATGGTCCACATTCCAGCTTTCGAAGATGATCATGGCCCTCGTTTTGGCACTTGGGTTTTTCATGCTTAT
TGCTCTTCGCTTCTTCTCTCCTCCTGAAACTTCTCACCACCGTTTCTCTTCCGTCCGGCATACAGCATTTCTAAGTGATGGGTTGGGGAAGAGAGGGGATCAGTGGGTTG
AGTTCATTTCTTGGGAGCCTAGAGCTTTTGTTTATCATAATTTCTTGTCCAAGGAAGAATGCTTGTACTTGATTAGTCTTGCAAAACCCCACATGGAGAAATCAACTGTG
GTTGATAGCAAAACTGGCGAGAGTGTGGATAGCAGGGTGCGAACCAGTTCTGGGATGTTTCTGAATAGAGGGCAGGACAAAATCATCAGAAACATAGAGAAAAGAATAGC
AGATTTTACATTCATTCCTATAGAGCATGGAGAAGGACTTCAAATTCTCCATTACAAAGTTGGGCAGAAGTATGATGCTCATTATGATTACTTTGTTGATGAGTACAACA
TCAAAAAAGGAGGCCAAAGAATGGCCACCCTTCTCATGTATTTGTCCGATGTTGAAGAGGGGGGCGAGACGGTGTTCCCGGCTGCAAAAGGAAACTTCAGCTCTGTGCCA
TGGTGGAATGAACTGTCTGAATGTGGTAAAGGTGGACTCTCTGTAAAACCAAAAATGGGAGATGCTTTATTGTTCTGGAGCATGAAGCCCGATGCTACCTTAGACCCTAC
AAGTTTACATGGTGCCTGTCCTGTCATTAGAGGGAACAAATGGTCATGTACAAAGTGGATGCATTACATAAAGGTAGAACGTTTCAAGTCGCTAAAGATGAAGCTACAGC
TGAAACCATCAATTGAGTAA
Protein sequenceShow/hide protein sequence
MQVGCRRRGRDVFPAANKCVPWWKKLPTHGKDGLSIKPKMGDALFFWSMKPDDGTLDYTSLHGSYPVIRGDRWACVKLMGQMGISGGMRLRSHKKGKDSWVECISWEPRA
FIYHNFLSEKECSQLINLAKPRMERSLVSGQNTNWEGVVSSRRTSSGRFLAKGQNQLVRRIEKRIAEFTFIPVENGEGLSILHYEVGQKFEPHYDYTHPDSFSFKSLGQR
NATLVMYLSDVKEGGATVFPEAKKCASSARRWWKKLPEYGKDNGLSVKPKMGDALLFWSVKPDGTLDPTSLHVSLGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIAL
RFFSPLETSHHRFSSVRHTAFLSDGLGKKGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDNETGKNVDSSVRTSSGMFLNRGQDKIIRNIEKRIADF
TFIPIEHGEGLQILHYEVGQKYDAHYDFFDDEYNLKEIGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGRGGLSVKPKMGDALLFWSMKPDTTLDPTTL
HVSKGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHRFSSVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTV
VDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYKVGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVP
WWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHYIKVERFKSLKMKLQLKPSIE