; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr026032 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr026032
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Description2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein
Genome locationtig00153031:1075597..1084881
RNA-Seq ExpressionSgr026032
SyntenySgr026032
Gene Ontology termsGO:0006402 - mRNA catabolic process (biological process)
GO:0032259 - methylation (biological process)
GO:0070988 - demethylation (biological process)
GO:0003729 - mRNA binding (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
GO:0032451 - demethylase activity (molecular function)
GO:0051213 - dioxygenase activity (molecular function)
InterPro domainsIPR001876 - Zinc finger, RanBP2-type
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR027450 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like
IPR036443 - Zinc finger, RanBP2-type superfamily
IPR037151 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamily
IPR044842 - RNA demethylase ALKBH9B/ALKBH10B-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8531968.1 hypothetical protein F0562_006890 [Nyssa sinensis]1.8e-28457.44Show/hide
Query:  EGKEGDWECSGCKNRNYAFRSFCNRCKQPRLLVDNKTPPDSKWLPRIGDWICTGCTNNNYASREKCKKCGQPKEVAAMPAIAIPGASFPTYSHYFARTQG
        EG+EGDW+CSGC NRNYAFRSFCNRCKQPRLLVD KTP DSKWLPRIGDWICTGCTNNNYASREKCKKCGQPKE+AAMPAIA+PGAS PT+ HYFAR QG
Subjt:  EGKEGDWECSGCKNRNYAFRSFCNRCKQPRLLVDNKTPPDSKWLPRIGDWICTGCTNNNYASREKCKKCGQPKEVAAMPAIAIPGASFPTYSHYFARTQG

Query:  GLDTKMNLGLIGNGNSQQLHPLSSNWSLGG--------ADKYGIHAAPTFPLGGN-SSAISYMNSTNQILSVPKGWRNGDWICNCGFHNYSSRAQCKKCN
         L+ KMN+GL+GNG  QQ  PLSSNWSLGG        ADKYG   A T+PLGGN +S + Y N TNQ++  PKGWR+GDWICNCGFHNYSSRAQCKKCN
Subjt:  GLDTKMNLGLIGNGNSQQLHPLSSNWSLGG--------ADKYGIHAAPTFPLGGN-SSAISYMNSTNQILSVPKGWRNGDWICNCGFHNYSSRAQCKKCN

Query:  ASP-QALGMKRLASEELVHHWDNKRLNIGQANEQQQSYSGFEQMIGSSSDPNAGLYNSYPHESSGVAPNLEMPMQFPQQATAPTLLGKGSIA-----FSA
        AS   A+G KRLASEE VH WDNKRLN G  +  QQ+Y GFEQM GS S+   G+Y  YP  SS  APN ++ +Q P   T PTLLGKG+       +  
Subjt:  ASP-QALGMKRLASEELVHHWDNKRLNIGQANEQQQSYSGFEQMIGSSSDPNAGLYNSYPHESSGVAPNLEMPMQFPQQATAPTLLGKGSIA-----FSA

Query:  TLISSHFFGASEAVSSCAFLDRPF--PIAPSSLS---CL-DRLLNL--LRNRTSTVSGGVIAGGTCGGPLSPPVPVLPPSVPVALIDTFPIILSFSPHRL
        T  ++H + +    + C         P AP+ +S   C+ D ++     R+  S +S  +   G      SP +  + P+   A+      +L F    +
Subjt:  TLISSHFFGASEAVSSCAFLDRPF--PIAPSSLS---CL-DRLLNL--LRNRTSTVSGGVIAGGTCGGPLSPPVPVLPPSVPVALIDTFPIILSFSPHRL

Query:  SPPALLCPISCSLNTFLHNHRLRRSQLIGHCLPMSHQHLKLKDPFLHNYKPSELRIASEFLTTWLPFLSRDLCRDCTQLLSDRIRALDPDM--------Q
            +LC  S                       M+    +  DPFL +Y  ++L+IASEFLTTWLPFLSRDLC  CT++LSDRIR+LDP++        +
Subjt:  SPPALLCPISCSLNTFLHNHRLRRSQLIGHCLPMSHQHLKLKDPFLHNYKPSELRIASEFLTTWLPFLSRDLCRDCTQLLSDRIRALDPDM--------Q

Query:  ESN--------------GNHDDGCDANSLGSWK------DEAETNSLGSWKDGINEGTEAEGVPKTSSSERFSKLPSTKTSGPRMSWADMIQEDELEEEE
        E N                + D C+ NSLGS K      D A+TNSLGSWKDG N  +E   + K S+         T++   RMSWADM  EDELE EE
Subjt:  ESN--------------GNHDDGCDANSLGSWK------DEAETNSLGSWKDGINEGTEAEGVPKTSSSERFSKLPSTKTSGPRMSWADMIQEDELEEEE

Query:  DECESEKRLVNVNASTGKLTIS-KVIEKPKLSREQREYIRFMNVGRKKDFICLERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKKGELRE
        ++ E+ ++  + +AS+G+ T++ K   KP+L REQREYIRF NV RKKDFICLER KGK+VNIL+GLELHTG+FSAAEQKRIVD VY LQEMGKKGEL+E
Subjt:  DECESEKRLVNVNASTGKLTIS-KVIEKPKLSREQREYIRFMNVGRKKDFICLERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKKGELRE

Query:  RTFSAPKKWMKGKGRVTLQFGCCY--NYAHDKNGNPPGILQNEIVDPIPPLFKVIIRRLVRWHVLPPTCVPDSCIVNIYEEGDCIPPHIDNHDFVRPFCT
        RTF+AP+K M+GKG VT+QFGCCY  NYA DKN NP GILQNEIVDPIPPLFKVII RLVRWHV+PP+CVPDSCIVNIY+EGDCIPPHIDNHDF+RPFCT
Subjt:  RTFSAPKKWMKGKGRVTLQFGCCY--NYAHDKNGNPPGILQNEIVDPIPPLFKVIIRRLVRWHVLPPTCVPDSCIVNIYEEGDCIPPHIDNHDFVRPFCT

Query:  VSFLSECNIVFGTNLAIVGPGEFSGPIAIPLPVGSVLVLNGNGADVAKHCIPAVPTKRISITFRRMDDSKRPIEYAPEPDLQGIQPLPYEASENDVPTSP
        VSFLSEC+IVFG+NL IVG G+ SG  AIPLPVGSVLVLN NGADVAKHC+PAVPTKRISITFR+MD+S+ P  + PEPDLQG+QPL YE   +      
Subjt:  VSFLSECNIVFGTNLAIVGPGEFSGPIAIPLPVGSVLVLNGNGADVAKHCIPAVPTKRISITFRRMDDSKRPIEYAPEPDLQGIQPLPYEASENDVPTSP

Query:  VSEREIRRQPFRRGSHM-RTRGSGNRSETRFEPRNSGRAQHRPADRRS-RGNLD
         S R + RQ  RR  +    R S  R  T  EP  S + Q  P +RR  R NLD
Subjt:  VSEREIRRQPFRRGSHM-RTRGSGNRSETRFEPRNSGRAQHRPADRRS-RGNLD

XP_022133128.1 uncharacterized protein LOC111005802 [Momordica charantia]1.9e-25288.36Show/hide
Query:  MSHQHLKLKDPFLHNYKPSELRIASEFLTTWLPFLSRDLCRDCTQLLSDRIRALDP----------------DMQESNGNHDDGCDANSLGSWKDEAETN
        M+HQHL+L+D FL NYKPSE RIASEFLTTWLPFLSRDLC DCTQLLSDRIRALDP                DM  SNGN DD CD NSLGSWKDEAETN
Subjt:  MSHQHLKLKDPFLHNYKPSELRIASEFLTTWLPFLSRDLCRDCTQLLSDRIRALDP----------------DMQESNGNHDDGCDANSLGSWKDEAETN

Query:  SLGSWKDGINEGTEAEGVPKTSSSERFSKLPSTKTSGPRMSWADMIQEDELEEEEDECESEKRLVNVNASTGKLTISKVIEKPKLSREQREYIRFMNVGR
        SLGSWKD INEG EAEG+P+TSSSE  SKL STKTS PRMSWADM QEDELE EEDECESEKR+VNVNASTGKLTISK+  KPKLSREQREYIRFMNVGR
Subjt:  SLGSWKDGINEGTEAEGVPKTSSSERFSKLPSTKTSGPRMSWADMIQEDELEEEEDECESEKRLVNVNASTGKLTISKVIEKPKLSREQREYIRFMNVGR

Query:  KKDFICLERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKKGELRERTFSAPKKWMKGKGRVTLQFGCCYNYAHDKNGNPPGILQNEIVDPI
        KKDFICLERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKKGELRERTFSAPKKWMKGKGRVTLQFGCCYNYA DKNGNPPG+L+NEIVDPI
Subjt:  KKDFICLERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKKGELRERTFSAPKKWMKGKGRVTLQFGCCYNYAHDKNGNPPGILQNEIVDPI

Query:  PPLFKVIIRRLVRWHVLPPTCVPDSCIVNIYEEGDCIPPHIDNHDFVRPFCTVSFLSECNIVFGTNLAIVGPGEFSGPIAIPLPVGSVLVLNGNGADVAK
        P LFKVIIRR+VRWHVLPPTCVPDSCIVNIYEEGDCIPPHIDNHDFVRPFCTVSFLSECNIVFG+NL+IVGPGEFSGPIAIPLPVGSVLVLNGNGADVAK
Subjt:  PPLFKVIIRRLVRWHVLPPTCVPDSCIVNIYEEGDCIPPHIDNHDFVRPFCTVSFLSECNIVFGTNLAIVGPGEFSGPIAIPLPVGSVLVLNGNGADVAK

Query:  HCIPAVPTKRISITFRRMDDSKRPIEYAPEPDLQGIQPLPYEASENDVPTSPV-SEREIRRQPFRRGSHMRTRGSGNRSETRFEPRNSGRAQHRPAD-RR
        HC+PAVPTKRISITFRR+DDSKRP EYA EPDLQGIQPLPYE SENDVPTSPV SEREIRRQPF RGSHMRTRGSGNRS+TRFEPRNSGRAQHRPAD RR
Subjt:  HCIPAVPTKRISITFRRMDDSKRPIEYAPEPDLQGIQPLPYEASENDVPTSPV-SEREIRRQPFRRGSHMRTRGSGNRSETRFEPRNSGRAQHRPAD-RR

Query:  SRGNLDS
        SR NLDS
Subjt:  SRGNLDS

XP_022926828.1 uncharacterized protein LOC111433825 [Cucurbita moschata]3.6e-24886.05Show/hide
Query:  LPMSHQHLKLKDPFLHNYKPSELRIASEFLTTWLPFLSRDLCRDCTQLLSDRIRALDP-----------------DMQESNGNHDDGCDANSLGSWKDEA
        + +SHQHLKL+DPFL NYKPSELRIASEFLTTWLPFLSRDLC+DCTQLLSDRIRALDP                 DM ESNGN  D CDANSLGSWKDEA
Subjt:  LPMSHQHLKLKDPFLHNYKPSELRIASEFLTTWLPFLSRDLCRDCTQLLSDRIRALDP-----------------DMQESNGNHDDGCDANSLGSWKDEA

Query:  ETNSLGSWKDGINEGTEAEGVPKTSSSERFSKLPSTKTSGPRMSWADMIQEDELEEEEDECESEKRLVNVNASTGKLTISKVIEKPKLSREQREYIRFMN
        ETNSLGSW+DGINEG EA+G+P+TSSSE  SKL STKTSGPR+SWADM QEDELEEEEDECE+EKRLVN NA  GKLTISKVIEKPKLSREQREYIRFM+
Subjt:  ETNSLGSWKDGINEGTEAEGVPKTSSSERFSKLPSTKTSGPRMSWADMIQEDELEEEEDECESEKRLVNVNASTGKLTISKVIEKPKLSREQREYIRFMN

Query:  VGRKKDFICLERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKKGELRERTFSAPKKWMKGKGRVTLQFGCCYNYAHDKNGNPPGILQNEIV
        VGRKKDFICLERFKGK VNILEGLELHTGIFSAAEQKRIVDHVYALQEMG KGELRERTFSAPKKWMKGKGRVTLQFGCCYNYA DKNGNPPGILQ+EIV
Subjt:  VGRKKDFICLERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKKGELRERTFSAPKKWMKGKGRVTLQFGCCYNYAHDKNGNPPGILQNEIV

Query:  DPIPPLFKVIIRRLVRWHVLPPTCVPDSCIVNIYEEGDCIPPHIDNHDFVRPFCTVSFLSECNIVFGTNLAIVGPGEFSGPIAIPLPVGSVLVLNGNGAD
        D IP LFKVIIRRLVRWHV+PPTCVPDSCIVNIY+EGDCIPPHIDNHDFVRPFCTVSFLSECNIVFGTNL+IVGPG+FSGPIAIPLPVGSVLVLNGN AD
Subjt:  DPIPPLFKVIIRRLVRWHVLPPTCVPDSCIVNIYEEGDCIPPHIDNHDFVRPFCTVSFLSECNIVFGTNLAIVGPGEFSGPIAIPLPVGSVLVLNGNGAD

Query:  VAKHCIPAVPTKRISITFRRMDDSKRPIEYAPEPDLQGIQPLPYEASENDVPTSPV-SEREIRRQPFRR-GSHMRTRGSGNRSETRFEPRNSGRAQHRPA
        VA+HC+PAVPTKRISITFRR+D+ KRPIEYAPEPDLQGIQPLPYEAS+NDVPTSPV SEREIRRQPFRR G HMRTRGSGNRS T FEPRN GRA+H  +
Subjt:  VAKHCIPAVPTKRISITFRRMDDSKRPIEYAPEPDLQGIQPLPYEASENDVPTSPV-SEREIRRQPFRR-GSHMRTRGSGNRSETRFEPRNSGRAQHRPA

Query:  DRRSRGNLD
        DRRSR NL+
Subjt:  DRRSRGNLD

XP_023518278.1 uncharacterized protein LOC111781804 [Cucurbita pepo subsp. pepo]9.5e-24986.05Show/hide
Query:  LPMSHQHLKLKDPFLHNYKPSELRIASEFLTTWLPFLSRDLCRDCTQLLSDRIRALDP-----------------DMQESNGNHDDGCDANSLGSWKDEA
        + +SHQHLKL+DPFL NYKPSELRIASEFLTTWLPFLSRDLC+DCTQLLSDRIRALDP                  M ESNGN DD CDANSLGSWKDEA
Subjt:  LPMSHQHLKLKDPFLHNYKPSELRIASEFLTTWLPFLSRDLCRDCTQLLSDRIRALDP-----------------DMQESNGNHDDGCDANSLGSWKDEA

Query:  ETNSLGSWKDGINEGTEAEGVPKTSSSERFSKLPSTKTSGPRMSWADMIQEDELEEEEDECESEKRLVNVNASTGKLTISKVIEKPKLSREQREYIRFMN
        ETNSLGSW+DGINEG EA+G+P+TSSSE  SKL STKTSGPR+SWADM QEDELEEEEDECE+EKRLVN NA  GKLTISKVIEKPKLSREQREYIRFM+
Subjt:  ETNSLGSWKDGINEGTEAEGVPKTSSSERFSKLPSTKTSGPRMSWADMIQEDELEEEEDECESEKRLVNVNASTGKLTISKVIEKPKLSREQREYIRFMN

Query:  VGRKKDFICLERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKKGELRERTFSAPKKWMKGKGRVTLQFGCCYNYAHDKNGNPPGILQNEIV
        VGRKKDFICLERFKGK VNILEGLELHTGIFSAAEQKRIVDHVYALQEMG KGELRERTFSAPKKWMKGKGRVTLQFGCCYNYA DKNGNPPGILQ+EIV
Subjt:  VGRKKDFICLERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKKGELRERTFSAPKKWMKGKGRVTLQFGCCYNYAHDKNGNPPGILQNEIV

Query:  DPIPPLFKVIIRRLVRWHVLPPTCVPDSCIVNIYEEGDCIPPHIDNHDFVRPFCTVSFLSECNIVFGTNLAIVGPGEFSGPIAIPLPVGSVLVLNGNGAD
        D IP LFKVIIRRLVRWHV+PPTCVPDSCIVNIY+EGDCIPPHIDNHDFVRPFCTVSFLSECNIVFGTNL+IVGPG+FSGPIAIPLPVGSVLVLNGN AD
Subjt:  DPIPPLFKVIIRRLVRWHVLPPTCVPDSCIVNIYEEGDCIPPHIDNHDFVRPFCTVSFLSECNIVFGTNLAIVGPGEFSGPIAIPLPVGSVLVLNGNGAD

Query:  VAKHCIPAVPTKRISITFRRMDDSKRPIEYAPEPDLQGIQPLPYEASENDVPTSPV-SEREIRRQPFRR-GSHMRTRGSGNRSETRFEPRNSGRAQHRPA
        VA+HC+PAVPTKRISITFRR+D+SKRPIEYAPEPDLQGIQPLPYEAS+NDVPTSPV SEREIRRQPFRR G HMRTRGSGNRS+T+FEPRN GRA++  +
Subjt:  VAKHCIPAVPTKRISITFRRMDDSKRPIEYAPEPDLQGIQPLPYEASENDVPTSPV-SEREIRRQPFRR-GSHMRTRGSGNRSETRFEPRNSGRAQHRPA

Query:  DRRSRGNLD
        DRRSR NL+
Subjt:  DRRSRGNLD

XP_038882153.1 RNA demethylase ALKBH9B [Benincasa hispida]9.2e-25287.65Show/hide
Query:  LPMSHQHLKLKDPFLHNYKPSELRIASEFLTTWLPFLSRDLCRDCTQLLSDRIRALDP---------------DMQESNGNHDDGCDANSLGSWKDEAET
        + +SHQHLKL+DPFLHNYKPSELRIASEFLTTWLPFLSRDLCRDCT+LLSDRIRALDP               DM ESNGN DD CDANSLGSWKDE ET
Subjt:  LPMSHQHLKLKDPFLHNYKPSELRIASEFLTTWLPFLSRDLCRDCTQLLSDRIRALDP---------------DMQESNGNHDDGCDANSLGSWKDEAET

Query:  NSLGSWKDGINEGTEAEGVPKTSSSERFSKLPSTKTSGPRMSWADMIQEDELEEEEDECESEKRLVNVNASTGKLTISKVIEKPKLSREQREYIRFMNVG
        NSLGSWKDGINEG EA+ VP+TSSSE  SKL STKTSGPRMSWADM QEDELEEE+DE ESEKRLV++N ST KLTISKVIEKP LSREQRE+IRFMNVG
Subjt:  NSLGSWKDGINEGTEAEGVPKTSSSERFSKLPSTKTSGPRMSWADMIQEDELEEEEDECESEKRLVNVNASTGKLTISKVIEKPKLSREQREYIRFMNVG

Query:  RKKDFICLERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKKGELRERTFSAPKKWMKGKGRVTLQFGCCYNYAHDKNGNPPGILQNEIVDP
        RKKDFICLERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKKGELRERTFSAPKKWMKGKGRVTLQFGCCYNYA DKNGNPPGIL++EIVDP
Subjt:  RKKDFICLERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKKGELRERTFSAPKKWMKGKGRVTLQFGCCYNYAHDKNGNPPGILQNEIVDP

Query:  IPPLFKVIIRRLVRWHVLPPTCVPDSCIVNIYEEGDCIPPHIDNHDFVRPFCTVSFLSECNIVFGTNLAIVGPGEFSGPIAIPLPVGSVLVLNGNGADVA
        IP LFKVIIRRLVRWHVLPPTCVPDSCIVNIY+EGDCIPPHIDNHDFVRPFCTVSFLSECNIVFG+NL+IVGPGEFSGPIAIPLPVGSVLVLNGNGADVA
Subjt:  IPPLFKVIIRRLVRWHVLPPTCVPDSCIVNIYEEGDCIPPHIDNHDFVRPFCTVSFLSECNIVFGTNLAIVGPGEFSGPIAIPLPVGSVLVLNGNGADVA

Query:  KHCIPAVPTKRISITFRRMDDSKRPIEYAPEPDLQGIQPLPYEASENDVPTSPV-SEREIRRQPFRRGSHMRTRGSGNRSETRFEPRNSGRAQHRPADRR
        KHC+PAVPTKRISITFRR+D+SKRPIEYA EPDLQGIQPLPYEASENDVPTSPV SEREIRRQPFRRG HMR RGSGNRS+TRF+PRN GR +H  ADRR
Subjt:  KHCIPAVPTKRISITFRRMDDSKRPIEYAPEPDLQGIQPLPYEASENDVPTSPV-SEREIRRQPFRRGSHMRTRGSGNRSETRFEPRNSGRAQHRPADRR

Query:  SR
        +R
Subjt:  SR

TrEMBL top hitse value%identityAlignment
A0A5J5AP34 Uncharacterized protein8.9e-28557.44Show/hide
Query:  EGKEGDWECSGCKNRNYAFRSFCNRCKQPRLLVDNKTPPDSKWLPRIGDWICTGCTNNNYASREKCKKCGQPKEVAAMPAIAIPGASFPTYSHYFARTQG
        EG+EGDW+CSGC NRNYAFRSFCNRCKQPRLLVD KTP DSKWLPRIGDWICTGCTNNNYASREKCKKCGQPKE+AAMPAIA+PGAS PT+ HYFAR QG
Subjt:  EGKEGDWECSGCKNRNYAFRSFCNRCKQPRLLVDNKTPPDSKWLPRIGDWICTGCTNNNYASREKCKKCGQPKEVAAMPAIAIPGASFPTYSHYFARTQG

Query:  GLDTKMNLGLIGNGNSQQLHPLSSNWSLGG--------ADKYGIHAAPTFPLGGN-SSAISYMNSTNQILSVPKGWRNGDWICNCGFHNYSSRAQCKKCN
         L+ KMN+GL+GNG  QQ  PLSSNWSLGG        ADKYG   A T+PLGGN +S + Y N TNQ++  PKGWR+GDWICNCGFHNYSSRAQCKKCN
Subjt:  GLDTKMNLGLIGNGNSQQLHPLSSNWSLGG--------ADKYGIHAAPTFPLGGN-SSAISYMNSTNQILSVPKGWRNGDWICNCGFHNYSSRAQCKKCN

Query:  ASP-QALGMKRLASEELVHHWDNKRLNIGQANEQQQSYSGFEQMIGSSSDPNAGLYNSYPHESSGVAPNLEMPMQFPQQATAPTLLGKGSIA-----FSA
        AS   A+G KRLASEE VH WDNKRLN G  +  QQ+Y GFEQM GS S+   G+Y  YP  SS  APN ++ +Q P   T PTLLGKG+       +  
Subjt:  ASP-QALGMKRLASEELVHHWDNKRLNIGQANEQQQSYSGFEQMIGSSSDPNAGLYNSYPHESSGVAPNLEMPMQFPQQATAPTLLGKGSIA-----FSA

Query:  TLISSHFFGASEAVSSCAFLDRPF--PIAPSSLS---CL-DRLLNL--LRNRTSTVSGGVIAGGTCGGPLSPPVPVLPPSVPVALIDTFPIILSFSPHRL
        T  ++H + +    + C         P AP+ +S   C+ D ++     R+  S +S  +   G      SP +  + P+   A+      +L F    +
Subjt:  TLISSHFFGASEAVSSCAFLDRPF--PIAPSSLS---CL-DRLLNL--LRNRTSTVSGGVIAGGTCGGPLSPPVPVLPPSVPVALIDTFPIILSFSPHRL

Query:  SPPALLCPISCSLNTFLHNHRLRRSQLIGHCLPMSHQHLKLKDPFLHNYKPSELRIASEFLTTWLPFLSRDLCRDCTQLLSDRIRALDPDM--------Q
            +LC  S                       M+    +  DPFL +Y  ++L+IASEFLTTWLPFLSRDLC  CT++LSDRIR+LDP++        +
Subjt:  SPPALLCPISCSLNTFLHNHRLRRSQLIGHCLPMSHQHLKLKDPFLHNYKPSELRIASEFLTTWLPFLSRDLCRDCTQLLSDRIRALDPDM--------Q

Query:  ESN--------------GNHDDGCDANSLGSWK------DEAETNSLGSWKDGINEGTEAEGVPKTSSSERFSKLPSTKTSGPRMSWADMIQEDELEEEE
        E N                + D C+ NSLGS K      D A+TNSLGSWKDG N  +E   + K S+         T++   RMSWADM  EDELE EE
Subjt:  ESN--------------GNHDDGCDANSLGSWK------DEAETNSLGSWKDGINEGTEAEGVPKTSSSERFSKLPSTKTSGPRMSWADMIQEDELEEEE

Query:  DECESEKRLVNVNASTGKLTIS-KVIEKPKLSREQREYIRFMNVGRKKDFICLERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKKGELRE
        ++ E+ ++  + +AS+G+ T++ K   KP+L REQREYIRF NV RKKDFICLER KGK+VNIL+GLELHTG+FSAAEQKRIVD VY LQEMGKKGEL+E
Subjt:  DECESEKRLVNVNASTGKLTIS-KVIEKPKLSREQREYIRFMNVGRKKDFICLERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKKGELRE

Query:  RTFSAPKKWMKGKGRVTLQFGCCY--NYAHDKNGNPPGILQNEIVDPIPPLFKVIIRRLVRWHVLPPTCVPDSCIVNIYEEGDCIPPHIDNHDFVRPFCT
        RTF+AP+K M+GKG VT+QFGCCY  NYA DKN NP GILQNEIVDPIPPLFKVII RLVRWHV+PP+CVPDSCIVNIY+EGDCIPPHIDNHDF+RPFCT
Subjt:  RTFSAPKKWMKGKGRVTLQFGCCY--NYAHDKNGNPPGILQNEIVDPIPPLFKVIIRRLVRWHVLPPTCVPDSCIVNIYEEGDCIPPHIDNHDFVRPFCT

Query:  VSFLSECNIVFGTNLAIVGPGEFSGPIAIPLPVGSVLVLNGNGADVAKHCIPAVPTKRISITFRRMDDSKRPIEYAPEPDLQGIQPLPYEASENDVPTSP
        VSFLSEC+IVFG+NL IVG G+ SG  AIPLPVGSVLVLN NGADVAKHC+PAVPTKRISITFR+MD+S+ P  + PEPDLQG+QPL YE   +      
Subjt:  VSFLSECNIVFGTNLAIVGPGEFSGPIAIPLPVGSVLVLNGNGADVAKHCIPAVPTKRISITFRRMDDSKRPIEYAPEPDLQGIQPLPYEASENDVPTSP

Query:  VSEREIRRQPFRRGSHM-RTRGSGNRSETRFEPRNSGRAQHRPADRRS-RGNLD
         S R + RQ  RR  +    R S  R  T  EP  S + Q  P +RR  R NLD
Subjt:  VSEREIRRQPFRRGSHM-RTRGSGNRSETRFEPRNSGRAQHRPADRRS-RGNLD

A0A6J1BY69 uncharacterized protein LOC1110058029.0e-25388.36Show/hide
Query:  MSHQHLKLKDPFLHNYKPSELRIASEFLTTWLPFLSRDLCRDCTQLLSDRIRALDP----------------DMQESNGNHDDGCDANSLGSWKDEAETN
        M+HQHL+L+D FL NYKPSE RIASEFLTTWLPFLSRDLC DCTQLLSDRIRALDP                DM  SNGN DD CD NSLGSWKDEAETN
Subjt:  MSHQHLKLKDPFLHNYKPSELRIASEFLTTWLPFLSRDLCRDCTQLLSDRIRALDP----------------DMQESNGNHDDGCDANSLGSWKDEAETN

Query:  SLGSWKDGINEGTEAEGVPKTSSSERFSKLPSTKTSGPRMSWADMIQEDELEEEEDECESEKRLVNVNASTGKLTISKVIEKPKLSREQREYIRFMNVGR
        SLGSWKD INEG EAEG+P+TSSSE  SKL STKTS PRMSWADM QEDELE EEDECESEKR+VNVNASTGKLTISK+  KPKLSREQREYIRFMNVGR
Subjt:  SLGSWKDGINEGTEAEGVPKTSSSERFSKLPSTKTSGPRMSWADMIQEDELEEEEDECESEKRLVNVNASTGKLTISKVIEKPKLSREQREYIRFMNVGR

Query:  KKDFICLERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKKGELRERTFSAPKKWMKGKGRVTLQFGCCYNYAHDKNGNPPGILQNEIVDPI
        KKDFICLERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKKGELRERTFSAPKKWMKGKGRVTLQFGCCYNYA DKNGNPPG+L+NEIVDPI
Subjt:  KKDFICLERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKKGELRERTFSAPKKWMKGKGRVTLQFGCCYNYAHDKNGNPPGILQNEIVDPI

Query:  PPLFKVIIRRLVRWHVLPPTCVPDSCIVNIYEEGDCIPPHIDNHDFVRPFCTVSFLSECNIVFGTNLAIVGPGEFSGPIAIPLPVGSVLVLNGNGADVAK
        P LFKVIIRR+VRWHVLPPTCVPDSCIVNIYEEGDCIPPHIDNHDFVRPFCTVSFLSECNIVFG+NL+IVGPGEFSGPIAIPLPVGSVLVLNGNGADVAK
Subjt:  PPLFKVIIRRLVRWHVLPPTCVPDSCIVNIYEEGDCIPPHIDNHDFVRPFCTVSFLSECNIVFGTNLAIVGPGEFSGPIAIPLPVGSVLVLNGNGADVAK

Query:  HCIPAVPTKRISITFRRMDDSKRPIEYAPEPDLQGIQPLPYEASENDVPTSPV-SEREIRRQPFRRGSHMRTRGSGNRSETRFEPRNSGRAQHRPAD-RR
        HC+PAVPTKRISITFRR+DDSKRP EYA EPDLQGIQPLPYE SENDVPTSPV SEREIRRQPF RGSHMRTRGSGNRS+TRFEPRNSGRAQHRPAD RR
Subjt:  HCIPAVPTKRISITFRRMDDSKRPIEYAPEPDLQGIQPLPYEASENDVPTSPV-SEREIRRQPFRRGSHMRTRGSGNRSETRFEPRNSGRAQHRPAD-RR

Query:  SRGNLDS
        SR NLDS
Subjt:  SRGNLDS

A0A6J1EJB3 uncharacterized protein LOC1114338251.8e-24886.05Show/hide
Query:  LPMSHQHLKLKDPFLHNYKPSELRIASEFLTTWLPFLSRDLCRDCTQLLSDRIRALDP-----------------DMQESNGNHDDGCDANSLGSWKDEA
        + +SHQHLKL+DPFL NYKPSELRIASEFLTTWLPFLSRDLC+DCTQLLSDRIRALDP                 DM ESNGN  D CDANSLGSWKDEA
Subjt:  LPMSHQHLKLKDPFLHNYKPSELRIASEFLTTWLPFLSRDLCRDCTQLLSDRIRALDP-----------------DMQESNGNHDDGCDANSLGSWKDEA

Query:  ETNSLGSWKDGINEGTEAEGVPKTSSSERFSKLPSTKTSGPRMSWADMIQEDELEEEEDECESEKRLVNVNASTGKLTISKVIEKPKLSREQREYIRFMN
        ETNSLGSW+DGINEG EA+G+P+TSSSE  SKL STKTSGPR+SWADM QEDELEEEEDECE+EKRLVN NA  GKLTISKVIEKPKLSREQREYIRFM+
Subjt:  ETNSLGSWKDGINEGTEAEGVPKTSSSERFSKLPSTKTSGPRMSWADMIQEDELEEEEDECESEKRLVNVNASTGKLTISKVIEKPKLSREQREYIRFMN

Query:  VGRKKDFICLERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKKGELRERTFSAPKKWMKGKGRVTLQFGCCYNYAHDKNGNPPGILQNEIV
        VGRKKDFICLERFKGK VNILEGLELHTGIFSAAEQKRIVDHVYALQEMG KGELRERTFSAPKKWMKGKGRVTLQFGCCYNYA DKNGNPPGILQ+EIV
Subjt:  VGRKKDFICLERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKKGELRERTFSAPKKWMKGKGRVTLQFGCCYNYAHDKNGNPPGILQNEIV

Query:  DPIPPLFKVIIRRLVRWHVLPPTCVPDSCIVNIYEEGDCIPPHIDNHDFVRPFCTVSFLSECNIVFGTNLAIVGPGEFSGPIAIPLPVGSVLVLNGNGAD
        D IP LFKVIIRRLVRWHV+PPTCVPDSCIVNIY+EGDCIPPHIDNHDFVRPFCTVSFLSECNIVFGTNL+IVGPG+FSGPIAIPLPVGSVLVLNGN AD
Subjt:  DPIPPLFKVIIRRLVRWHVLPPTCVPDSCIVNIYEEGDCIPPHIDNHDFVRPFCTVSFLSECNIVFGTNLAIVGPGEFSGPIAIPLPVGSVLVLNGNGAD

Query:  VAKHCIPAVPTKRISITFRRMDDSKRPIEYAPEPDLQGIQPLPYEASENDVPTSPV-SEREIRRQPFRR-GSHMRTRGSGNRSETRFEPRNSGRAQHRPA
        VA+HC+PAVPTKRISITFRR+D+ KRPIEYAPEPDLQGIQPLPYEAS+NDVPTSPV SEREIRRQPFRR G HMRTRGSGNRS T FEPRN GRA+H  +
Subjt:  VAKHCIPAVPTKRISITFRRMDDSKRPIEYAPEPDLQGIQPLPYEASENDVPTSPV-SEREIRRQPFRR-GSHMRTRGSGNRSETRFEPRNSGRAQHRPA

Query:  DRRSRGNLD
        DRRSR NL+
Subjt:  DRRSRGNLD

A0A6J1GGD7 uncharacterized protein LOC111453671 isoform X14.3e-24786.11Show/hide
Query:  QLIGHCLPM--SHQHLKLKDPFLHNYKPSELRIASEFLTTWLPFLSRDLCRDCTQLLSDRIRALDPD---------------MQESNGNHDDGCDANSLG
        +LI   LPM  SHQHLKL+DPFLHNYKPSELRIASEFLTTWLPFLSRDLCR+CT+LLSDRIRALDP                M ESNGN +D CDANSLG
Subjt:  QLIGHCLPM--SHQHLKLKDPFLHNYKPSELRIASEFLTTWLPFLSRDLCRDCTQLLSDRIRALDPD---------------MQESNGNHDDGCDANSLG

Query:  SWKDEAETNSLGSWKDGINEGTEAEGVPKTSSSERFSKLPSTKTSGPRMSWADMIQEDEL-EEEEDECESEKRLVNVNASTGKLTISKVIEKPKLSREQR
        SWKDEAETNSL SWKDGINEG EA+GVP+TSSS + SK+ STKTSGPRMSWADM QEDEL EEEEDE ESEKR+V+VN ST KLTISKVIEK KLSREQR
Subjt:  SWKDEAETNSLGSWKDGINEGTEAEGVPKTSSSERFSKLPSTKTSGPRMSWADMIQEDEL-EEEEDECESEKRLVNVNASTGKLTISKVIEKPKLSREQR

Query:  EYIRFMNVGRKKDFICLERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKKGELRERTFSAPKKWMKGKGRVTLQFGCCYNYAHDKNGNPPG
        E+IRF+NVGRKKDFICLERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVY+LQEMGKKGELRERTFSAPKKWMKGKGRVTLQFGCCYNYA DKNGNPPG
Subjt:  EYIRFMNVGRKKDFICLERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKKGELRERTFSAPKKWMKGKGRVTLQFGCCYNYAHDKNGNPPG

Query:  ILQNEIVDPIPPLFKVIIRRLVRWHVLPPTCVPDSCIVNIYEEGDCIPPHIDNHDFVRPFCTVSFLSECNIVFGTNLAIVGPGEFSGPIAIPLPVGSVLV
        I+++EIVDPIP LFKVIIRRLVRWHVLPPTCVP+SCIVNIY EGDCIPPHID+HDFVRPFCTVSFLSECNIVFGTNL+IVGPGEFSGPIAIPLPVGSVLV
Subjt:  ILQNEIVDPIPPLFKVIIRRLVRWHVLPPTCVPDSCIVNIYEEGDCIPPHIDNHDFVRPFCTVSFLSECNIVFGTNLAIVGPGEFSGPIAIPLPVGSVLV

Query:  LNGNGADVAKHCIPAVPTKRISITFRRMDDSKRPIEYAPEPDLQGIQPLPYEASENDVPTSPV-SEREIRRQPFRRGSHMRTRGSGNRSETRFEPRNSGR
        LNGNGADVAKHC+PAVPTKRISITFRRMD SK PIEYAPEPDLQGIQPLPYEASEN+VPTSPV SEREIRRQPFRRGSHMRTRGSGNR++TRF+ RNSGR
Subjt:  LNGNGADVAKHCIPAVPTKRISITFRRMDDSKRPIEYAPEPDLQGIQPLPYEASENDVPTSPV-SEREIRRQPFRRGSHMRTRGSGNRSETRFEPRNSGR

Query:  AQHRPADRRSR
         +H  ADRRSR
Subjt:  AQHRPADRRSR

A0A6J1KUK5 uncharacterized protein LOC1114973369.6e-24785.46Show/hide
Query:  LPMSHQHLKLKDPFLHNYKPSELRIASEFLTTWLPFLSRDLCRDCTQLLSDRIRALDP-----------------DMQESNGNHDDGCDANSLGSWKDEA
        + +SHQHLKL+DPFL NYKPSELRIASEFLTTWLPFLSRDLC+DCTQLLSDRIRALDP                 DM ESNGN  D CDANSLGSWKDEA
Subjt:  LPMSHQHLKLKDPFLHNYKPSELRIASEFLTTWLPFLSRDLCRDCTQLLSDRIRALDP-----------------DMQESNGNHDDGCDANSLGSWKDEA

Query:  ETNSLGSWKDGINEGTEAEGVPKTSSSERFSKLPSTKTSGPRMSWADMIQEDELEEEEDECESEKRLVNVNASTGKLTISKVIEKPKLSREQREYIRFMN
        ETNSLGSW+DGINEG EA+G+P+TSSSE  SKL STKTSGPR+SWADM QEDELEEEEDECE+EKR+VN NA  GKLTISKVIEKPKLSREQREYIRFM+
Subjt:  ETNSLGSWKDGINEGTEAEGVPKTSSSERFSKLPSTKTSGPRMSWADMIQEDELEEEEDECESEKRLVNVNASTGKLTISKVIEKPKLSREQREYIRFMN

Query:  VGRKKDFICLERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKKGELRERTFSAPKKWMKGKGRVTLQFGCCYNYAHDKNGNPPGILQNEIV
        VGRKKDFICLERFKGK VNILEGLELHTGIFSAAEQKRIVDHVYALQEMG KGEL+ERTFSAPKKWMKGKGRVTLQFGCCYNYA DKNGNPPGILQ+EIV
Subjt:  VGRKKDFICLERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKKGELRERTFSAPKKWMKGKGRVTLQFGCCYNYAHDKNGNPPGILQNEIV

Query:  DPIPPLFKVIIRRLVRWHVLPPTCVPDSCIVNIYEEGDCIPPHIDNHDFVRPFCTVSFLSECNIVFGTNLAIVGPGEFSGPIAIPLPVGSVLVLNGNGAD
        D IP LFKVIIRRLVRWHV+PPTCVPDSCIVNIY+E DCIPPHIDNHDFVRPFCTVSFLSECNIVFGTNL+IVGPG+FSGPIAIPLPVGSVLVLNGN AD
Subjt:  DPIPPLFKVIIRRLVRWHVLPPTCVPDSCIVNIYEEGDCIPPHIDNHDFVRPFCTVSFLSECNIVFGTNLAIVGPGEFSGPIAIPLPVGSVLVLNGNGAD

Query:  VAKHCIPAVPTKRISITFRRMDDSKRPIEYAPEPDLQGIQPLPYEASENDVPTSPV-SEREIRRQPFRR-GSHMRTRGSGNRSETRFEPRNSGRAQHRPA
        VA+HC+PAVPTKRISITFRR+D+SKRPIEYAPEPDLQGIQPLPYEAS+NDVPTS V SEREIRRQPFRR G HMRTRGSGNRS+T+FEPRN GRA H  +
Subjt:  VAKHCIPAVPTKRISITFRRMDDSKRPIEYAPEPDLQGIQPLPYEASENDVPTSPV-SEREIRRQPFRR-GSHMRTRGSGNRSETRFEPRNSGRAQHRPA

Query:  DRRSRGNLD
        DRRSR NL+
Subjt:  DRRSRGNLD

SwissProt top hitse value%identityAlignment
O13801 Uncharacterized RNA-binding protein C17H9.04c5.8e-0726.32Show/hide
Query:  RIGDWIC--TGCTNNNYASREKCKKCGQPKEVAAMPAIAIPGASFPTYSH-YFARTQGGLDTKMNLGLIGNGNSQQLHPLSSN--WSLGGADKYGIHAAP
        R GDW C   GC  +N+A    C +CG  +  AA+ A    G    +YSH  ++     + T      +   +   ++ +++N   + GG +        
Subjt:  RIGDWIC--TGCTNNNYASREKCKKCGQPKEVAAMPAIAIPGASFPTYSH-YFARTQGGLDTKMNLGLIGNGNSQQLHPLSSN--WSLGGADKYGIHAAP

Query:  TFPLGGNSSAISYMNSTNQILSVPKGWRNGDWICNCGFHNYSSRAQCKKCNA
            GGN S         ++         GDW+C CGF N+  R+ C +CNA
Subjt:  TFPLGGNSSAISYMNSTNQILSVPKGWRNGDWICNCGFHNYSSRAQCKKCNA

O95218 Zinc finger Ran-binding domain-containing protein 22.5e-0532.53Show/hide
Query:  EGDWEC--SGCKNRNYAFRSFCNRCKQPRLL-----------VDNKTPPDSKWLPRIGDWICTGCTNNNYASREKCKKCGQPK
        +GDW C    C N N+A R+ CNRC + +             +       S+ L    DW C  C+N N+A R +C  C  PK
Subjt:  EGDWEC--SGCKNRNYAFRSFCNRCKQPRLL-----------VDNKTPPDSKWLPRIGDWICTGCTNNNYASREKCKKCGQPK

Q66JG8 RNA demethylase ALKBH53.5e-1232.69Show/hide
Query:  IFSAAEQKRIVDHVYALQEMGKKGELRERTFS-AP--KKWMKGKGRVTLQFGCCYNYAHDKNGNPPG---ILQNEIVDPIPP-LFKVIIRRLVRWHVLPP
        +FS  E  RI   +  +    +KG  RE T   AP   K+  G+G         Y Y        PG   +     VD IP  + +++IRRLV   V+P 
Subjt:  IFSAAEQKRIVDHVYALQEMGKKGELRERTFS-AP--KKWMKGKGRVTLQFGCCYNYAHDKNGNPPG---ILQNEIVDPIPP-LFKVIIRRLVRWHVLPP

Query:  TCVPDSCIVNIYEEGDCIPPHIDN-HDFVRPFCTVSFLSECNIVFGTNLAIVGPGEFSGPI-AIPLPVGSVLVLNGNGADVAKHCI-PAVPTKRISITFR
          V +S ++N Y+ G CI  H+D  H F RP  +VSF S+  + FG       P   S P+  +P+  GSV VL+G  AD   HCI P    +R ++   
Subjt:  TCVPDSCIVNIYEEGDCIPPHIDN-HDFVRPFCTVSFLSECNIVFGTNLAIVGPGEFSGPI-AIPLPVGSVLVLNGNGADVAKHCI-PAVPTKRISITFR

Query:  RMDDSKRP
        R   ++ P
Subjt:  RMDDSKRP

Q9SL49 RNA demethylase ALKBH9B6.8e-15757.93Show/hide
Query:  DPFLHNYKPSELRIASEFLTTWLPFLSRDLCRDCTQLLSDRIRALDPDMQESNGNHDDG-CDANS------LGSWK----------------DEAETNSL
        DPFL  Y+PSEL+IASEFLT WLPFLS+DLC+DC  LLS+RIR+LDP    +N +  DG C   S      +GS +                ++ + +SL
Subjt:  DPFLHNYKPSELRIASEFLTTWLPFLSRDLCRDCTQLLSDRIRALDPDMQESNGNHDDG-CDANS------LGSWK----------------DEAETNSL

Query:  GSWKDGINEGTEA--EGVPKTSSSERFSKLPSTKTSGPRMSWADMIQEDELEEEEDECESEKRLVNVNASTGKLTISKVIEKPKLSREQREYIRFMNVGR
        GSWK     G+E      P+  SS   S+    +T+ PRM WADM QEDE +EEE+E E E+           +   K  EKPKLSR+QRE +R +NV R
Subjt:  GSWKDGINEGTEA--EGVPKTSSSERFSKLPSTKTSGPRMSWADMIQEDELEEEEDECESEKRLVNVNASTGKLTISKVIEKPKLSREQREYIRFMNVGR

Query:  KKDFICLERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKKGELRERTFSAPKKWMKGKGRVTLQFGCCYNYAHDKNGNPPGILQNEIVDPI
        KKDFICLER KGK+VN+L+GLELHTG+FSA EQKRIVD VY LQE G++GEL++RTF+AP KWM+GKGR T+QFGCCYNYA D+ GNPPGILQ E VDP+
Subjt:  KKDFICLERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKKGELRERTFSAPKKWMKGKGRVTLQFGCCYNYAHDKNGNPPGILQNEIVDPI

Query:  PPLFKVIIRRLVRWHVLPPTCVPDSCIVNIYEEGDCIPPHIDNHDFVRPFCTVSFLSECNIVFGTNLAIVGPGEFSGPIAIPLPVGSVLVLNGNGADVAK
        P LFKVIIR+L++WHVLPPTCVPDSCIVNIY+EGDCIPPHIDNHDF+RPFCT+SFLSEC+I+FG+NL + GPG+FSG  +IPLPVGSVLVLNGNGADVAK
Subjt:  PPLFKVIIRRLVRWHVLPPTCVPDSCIVNIYEEGDCIPPHIDNHDFVRPFCTVSFLSECNIVFGTNLAIVGPGEFSGPIAIPLPVGSVLVLNGNGADVAK

Query:  HCIPAVPTKRISITFRRMDDSKRPIEYAPEPDLQGIQPLPYEASENDVPTSPVSEREIRRQPFRRGSHMRTRGSGNRSETRFEPRNSGR---------AQ
        HC+PAVPTKRISITFR+MD+SKRP+ + PEPDLQGI+PLP + + +   TS  S         +RG   R  G+G  S   + P  S           +Q
Subjt:  HCIPAVPTKRISITFRRMDDSKRPIEYAPEPDLQGIQPLPYEASENDVPTSPVSEREIRRQPFRRGSHMRTRGSGNRSETRFEPRNSGR---------AQ

Query:  HRPADRRSRGN
         R   R SR N
Subjt:  HRPADRRSRGN

Q9ZT92 RNA demethylase ALKBH10B9.3e-2933.49Show/hide
Query:  KDFICLERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKKGELRERTFSAPKKWMKGKGRVTLQFGCCYNYAHDKNGNPPGILQNEI-VDPI
        K F   E+ KG  VN+++GL+L+  +    E  +++D V  L+E G  G+L   +F    K +KG  R  +Q G    + H K         N + ++PI
Subjt:  KDFICLERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKKGELRERTFSAPKKWMKGKGRVTLQFGCCYNYAHDKNGNPPGILQNEI-VDPI

Query:  PPLFKVIIRRLVRWHVLPPTCVPDSCIVNIYEEGDCIPPHIDNHDFVRPFCTVSFLSECNIVFGTNLAIVGPGEFSGPIAIPLPVGSVLVLNGNGADVAK
        PPL + +I   V W ++P    P+ C++N +EEG+   P +      +P  T+  LSE  + +G  L+    G F GP+ + L  GS+LV+ GN AD+A+
Subjt:  PPLFKVIIRRLVRWHVLPPTCVPDSCIVNIYEEGDCIPPHIDNHDFVRPFCTVSFLSECNIVFGTNLAIVGPGEFSGPIAIPLPVGSVLVLNGNGADVAK

Query:  HCIPAVPTKRISITFRRM
        H +     KR+SITF R+
Subjt:  HCIPAVPTKRISITFRRM

Arabidopsis top hitse value%identityAlignment
AT2G17970.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein4.8e-15857.93Show/hide
Query:  DPFLHNYKPSELRIASEFLTTWLPFLSRDLCRDCTQLLSDRIRALDPDMQESNGNHDDG-CDANS------LGSWK----------------DEAETNSL
        DPFL  Y+PSEL+IASEFLT WLPFLS+DLC+DC  LLS+RIR+LDP    +N +  DG C   S      +GS +                ++ + +SL
Subjt:  DPFLHNYKPSELRIASEFLTTWLPFLSRDLCRDCTQLLSDRIRALDPDMQESNGNHDDG-CDANS------LGSWK----------------DEAETNSL

Query:  GSWKDGINEGTEA--EGVPKTSSSERFSKLPSTKTSGPRMSWADMIQEDELEEEEDECESEKRLVNVNASTGKLTISKVIEKPKLSREQREYIRFMNVGR
        GSWK     G+E      P+  SS   S+    +T+ PRM WADM QEDE +EEE+E E E+           +   K  EKPKLSR+QRE +R +NV R
Subjt:  GSWKDGINEGTEA--EGVPKTSSSERFSKLPSTKTSGPRMSWADMIQEDELEEEEDECESEKRLVNVNASTGKLTISKVIEKPKLSREQREYIRFMNVGR

Query:  KKDFICLERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKKGELRERTFSAPKKWMKGKGRVTLQFGCCYNYAHDKNGNPPGILQNEIVDPI
        KKDFICLER KGK+VN+L+GLELHTG+FSA EQKRIVD VY LQE G++GEL++RTF+AP KWM+GKGR T+QFGCCYNYA D+ GNPPGILQ E VDP+
Subjt:  KKDFICLERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKKGELRERTFSAPKKWMKGKGRVTLQFGCCYNYAHDKNGNPPGILQNEIVDPI

Query:  PPLFKVIIRRLVRWHVLPPTCVPDSCIVNIYEEGDCIPPHIDNHDFVRPFCTVSFLSECNIVFGTNLAIVGPGEFSGPIAIPLPVGSVLVLNGNGADVAK
        P LFKVIIR+L++WHVLPPTCVPDSCIVNIY+EGDCIPPHIDNHDF+RPFCT+SFLSEC+I+FG+NL + GPG+FSG  +IPLPVGSVLVLNGNGADVAK
Subjt:  PPLFKVIIRRLVRWHVLPPTCVPDSCIVNIYEEGDCIPPHIDNHDFVRPFCTVSFLSECNIVFGTNLAIVGPGEFSGPIAIPLPVGSVLVLNGNGADVAK

Query:  HCIPAVPTKRISITFRRMDDSKRPIEYAPEPDLQGIQPLPYEASENDVPTSPVSEREIRRQPFRRGSHMRTRGSGNRSETRFEPRNSGR---------AQ
        HC+PAVPTKRISITFR+MD+SKRP+ + PEPDLQGI+PLP + + +   TS  S         +RG   R  G+G  S   + P  S           +Q
Subjt:  HCIPAVPTKRISITFRRMDDSKRPIEYAPEPDLQGIQPLPYEASENDVPTSPVSEREIRRQPFRRGSHMRTRGSGNRSETRFEPRNSGR---------AQ

Query:  HRPADRRSRGN
         R   R SR N
Subjt:  HRPADRRSRGN

AT2G17970.2 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.5e-13858.77Show/hide
Query:  QESNGNHDDGCDANSLGSWKDEAETNSLGSWKDGINEGTEA--EGVPKTSSSERFSKLPSTKTSGPRMSWADMIQEDELEEEEDECESEKRLVNVNASTG
        + ++ N DD  D  S     ++ + +SLGSWK     G+E      P+  SS   S+    +T+ PRM WADM QEDE +EEE+E E E+          
Subjt:  QESNGNHDDGCDANSLGSWKDEAETNSLGSWKDGINEGTEA--EGVPKTSSSERFSKLPSTKTSGPRMSWADMIQEDELEEEEDECESEKRLVNVNASTG

Query:  KLTISKVIEKPKLSREQREYIRFMNVGRKKDFICLERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKKGELRERTFSAPKKWMKGKGRVTL
         +   K  EKPKLSR+QRE +R +NV RKKDFICLER KGK+VN+L+GLELHTG+FSA EQKRIVD VY LQE G++GEL++RTF+AP KWM+GKGR T+
Subjt:  KLTISKVIEKPKLSREQREYIRFMNVGRKKDFICLERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKKGELRERTFSAPKKWMKGKGRVTL

Query:  QFGCCYNYAHDKNGNPPGILQNEIVDPIPPLFKVIIRRLVRWHVLPPTCVPDSCIVNIYEEGDCIPPHIDNHDFVRPFCTVSFLSECNIVFGTNLAIVGP
        QFGCCYNYA D+ GNPPGILQ E VDP+P LFKVIIR+L++WHVLPPTCVPDSCIVNIY+EGDCIPPHIDNHDF+RPFCT+SFLSEC+I+FG+NL + GP
Subjt:  QFGCCYNYAHDKNGNPPGILQNEIVDPIPPLFKVIIRRLVRWHVLPPTCVPDSCIVNIYEEGDCIPPHIDNHDFVRPFCTVSFLSECNIVFGTNLAIVGP

Query:  GEFSGPIAIPLPVGSVLVLNGNGADVAKHCIPAVPTKRISITFRRMDDSKRPIEYAPEPDLQGIQPLPYEASENDVPTSPVSEREIRRQPFRRGSHMRTR
        G+FSG  +IPLPVGSVLVLNGNGADVAKHC+PAVPTKRISITFR+MD+SKRP+ + PEPDLQGI+PLP + + +   TS  S         +RG   R  
Subjt:  GEFSGPIAIPLPVGSVLVLNGNGADVAKHCIPAVPTKRISITFRRMDDSKRPIEYAPEPDLQGIQPLPYEASENDVPTSPVSEREIRRQPFRRGSHMRTR

Query:  GSGNRSETRFEPRNSGR---------AQHRPADRRSRGN
        G+G  S   + P  S           +Q R   R SR N
Subjt:  GSGNRSETRFEPRNSGR---------AQHRPADRRSRGN

AT2G17970.3 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.5e-13858.77Show/hide
Query:  QESNGNHDDGCDANSLGSWKDEAETNSLGSWKDGINEGTEA--EGVPKTSSSERFSKLPSTKTSGPRMSWADMIQEDELEEEEDECESEKRLVNVNASTG
        + ++ N DD  D  S     ++ + +SLGSWK     G+E      P+  SS   S+    +T+ PRM WADM QEDE +EEE+E E E+          
Subjt:  QESNGNHDDGCDANSLGSWKDEAETNSLGSWKDGINEGTEA--EGVPKTSSSERFSKLPSTKTSGPRMSWADMIQEDELEEEEDECESEKRLVNVNASTG

Query:  KLTISKVIEKPKLSREQREYIRFMNVGRKKDFICLERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKKGELRERTFSAPKKWMKGKGRVTL
         +   K  EKPKLSR+QRE +R +NV RKKDFICLER KGK+VN+L+GLELHTG+FSA EQKRIVD VY LQE G++GEL++RTF+AP KWM+GKGR T+
Subjt:  KLTISKVIEKPKLSREQREYIRFMNVGRKKDFICLERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKKGELRERTFSAPKKWMKGKGRVTL

Query:  QFGCCYNYAHDKNGNPPGILQNEIVDPIPPLFKVIIRRLVRWHVLPPTCVPDSCIVNIYEEGDCIPPHIDNHDFVRPFCTVSFLSECNIVFGTNLAIVGP
        QFGCCYNYA D+ GNPPGILQ E VDP+P LFKVIIR+L++WHVLPPTCVPDSCIVNIY+EGDCIPPHIDNHDF+RPFCT+SFLSEC+I+FG+NL + GP
Subjt:  QFGCCYNYAHDKNGNPPGILQNEIVDPIPPLFKVIIRRLVRWHVLPPTCVPDSCIVNIYEEGDCIPPHIDNHDFVRPFCTVSFLSECNIVFGTNLAIVGP

Query:  GEFSGPIAIPLPVGSVLVLNGNGADVAKHCIPAVPTKRISITFRRMDDSKRPIEYAPEPDLQGIQPLPYEASENDVPTSPVSEREIRRQPFRRGSHMRTR
        G+FSG  +IPLPVGSVLVLNGNGADVAKHC+PAVPTKRISITFR+MD+SKRP+ + PEPDLQGI+PLP + + +   TS  S         +RG   R  
Subjt:  GEFSGPIAIPLPVGSVLVLNGNGADVAKHCIPAVPTKRISITFRRMDDSKRPIEYAPEPDLQGIQPLPYEASENDVPTSPVSEREIRRQPFRRGSHMRTR

Query:  GSGNRSETRFEPRNSGR---------AQHRPADRRSRGN
        G+G  S   + P  S           +Q R   R SR N
Subjt:  GSGNRSETRFEPRNSGR---------AQHRPADRRSRGN

AT4G36090.2 oxidoreductase, 2OG-Fe(II) oxygenase family protein2.0e-12756.42Show/hide
Query:  DRIRALDPDMQESNGNHDDGCDANSLGSWKDEAETNSLGSWKDGINEGTEAEGVPKTSSSERFSKLPSTKTSGPRMSWADMIQEDELEEEEDECESEKRL
        D + + + D +  N      C ++SL S K  A     GS  D +     +  +P + S+        ++ +  +MSWADM +ED L EEED+ E+E   
Subjt:  DRIRALDPDMQESNGNHDDGCDANSLGSWKDEAETNSLGSWKDGINEGTEAEGVPKTSSSERFSKLPSTKTSGPRMSWADMIQEDELEEEEDECESEKRL

Query:  VNVNASTGKLTISKVIEKPKLSREQREYIRFMNVGRKKDFICLERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKKGELRERTFSAPKKWM
          V+ S       K  EK KLSRE+RE  RFMNV + K F C E+ +G+ VNILEGLELHTG+FSA EQK+IVD VY LQE G++GELRERTF+AP KWM
Subjt:  VNVNASTGKLTISKVIEKPKLSREQREYIRFMNVGRKKDFICLERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKKGELRERTFSAPKKWM

Query:  KGKGRVTLQFGCCYNYAHDKNGNPPGILQNEIVDPIPPLFKVIIRRLVRWHVLPPTCVPDSCIVNIYEEGDCIPPHIDNHDFVRPFCTVSFLSECNIVFG
        +GKGRVT+QFGCCYNYA DK GNPPGILQ   VDP+P +FKVII+RLV WHVLPPTCVPDSCIVNIYEE DCIPPHIDNHDF+RPFCTVSFLSECNI+FG
Subjt:  KGKGRVTLQFGCCYNYAHDKNGNPPGILQNEIVDPIPPLFKVIIRRLVRWHVLPPTCVPDSCIVNIYEEGDCIPPHIDNHDFVRPFCTVSFLSECNIVFG

Query:  TNLAIVGPGEFSGPIAIPLPVGSVLVLNGNGADVAKHCIPAVPTKRISITFRRMDDSKRPIEYAPEPDLQGIQPLPYEASENDVPTSPV--SEREIRRQP
        +NL ++GPGEFSG  +IPLPVGSVLVL GNGADVAKHC+PAVPTKRISITFR+MD+SKRP+ + PEPDL+ I+PLPYE +    P   V  S R    Q 
Subjt:  TNLAIVGPGEFSGPIAIPLPVGSVLVLNGNGADVAKHCIPAVPTKRISITFRRMDDSKRPIEYAPEPDLQGIQPLPYEASENDVPTSPV--SEREIRRQP

Query:  FRRGSHMRTRGSGNRSETRFEPRNSGRAQHRPADRR
            ++    G G++     +  +  R +     RR
Subjt:  FRRGSHMRTRGSGNRSETRFEPRNSGRAQHRPADRR

AT4G36090.3 oxidoreductase, 2OG-Fe(II) oxygenase family protein1.7e-14756.48Show/hide
Query:  KDPFLHNYKPSELRIASEFLTTWLPFLSRDLCRDCTQLLSDRIRALDP------DMQESNGNHDDGCDANSLGSWKDEAETNSLGSWKDG----------
        +D FL  Y+ SEL+IASEFLT WLPFLSRDLC DC  +LSDRIR+LDP      +++  +G+  D  ++  + + + E   N + +  DG          
Subjt:  KDPFLHNYKPSELRIASEFLTTWLPFLSRDLCRDCTQLLSDRIRALDP------DMQESNGNHDDGCDANSLGSWKDEAETNSLGSWKDG----------

Query:  INEGTEAEGVPKTSSSERFSKLPST--KTSGPRMSWADMIQEDELEEEEDECESEKRLVNVNASTGKLTISKVIEKPKLSREQREYIRFMNVGRKKDFIC
        +  G    G    S S       ST  + +  +MSWADM +ED L EEED+ E+E     V+ S       K  EK KLSRE+RE  RFMNV + K F C
Subjt:  INEGTEAEGVPKTSSSERFSKLPST--KTSGPRMSWADMIQEDELEEEEDECESEKRLVNVNASTGKLTISKVIEKPKLSREQREYIRFMNVGRKKDFIC

Query:  LERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKKGELRERTFSAPKKWMKGKGRVTLQFGCCYNYAHDKNGNPPGILQNEIVDPIPPLFKV
         E+ +G+ VNILEGLELHTG+FSA EQK+IVD VY LQE G++GELRERTF+AP KWM+GKGRVT+QFGCCYNYA DK GNPPGILQ   VDP+P +FKV
Subjt:  LERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKKGELRERTFSAPKKWMKGKGRVTLQFGCCYNYAHDKNGNPPGILQNEIVDPIPPLFKV

Query:  IIRRLVRWHVLPPTCVPDSCIVNIYEEGDCIPPHIDNHDFVRPFCTVSFLSECNIVFGTNLAIVGPGEFSGPIAIPLPVGSVLVLNGNGADVAKHCIPAV
        II+RLV WHVLPPTCVPDSCIVNIYEE DCIPPHIDNHDF+RPFCTVSFLSECNI+FG+NL ++GPGEFSG  +IPLPVGSVLVL GNGADVAKHC+PAV
Subjt:  IIRRLVRWHVLPPTCVPDSCIVNIYEEGDCIPPHIDNHDFVRPFCTVSFLSECNIVFGTNLAIVGPGEFSGPIAIPLPVGSVLVLNGNGADVAKHCIPAV

Query:  PTKRISITFRRMDDSKRPIEYAPEPDLQGIQPLPYEASENDVPTSPV--SEREIRRQPFRRGSHMRTRGSGNRSETRFEPRNSGRAQHRPADRR
        PTKRISITFR+MD+SKRP+ + PEPDL+ I+PLPYE +    P   V  S R    Q     ++    G G++     +  +  R +     RR
Subjt:  PTKRISITFRRMDDSKRPIEYAPEPDLQGIQPLPYEASENDVPTSPV--SEREIRRQPFRRGSHMRTRGSGNRSETRFEPRNSGRAQHRPADRR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGGAGGAGGAGATAGAAGGAAAAGCAGCGTCACAAGTAGCAATGGCCGAAGGCAAAGAAGGCGATTGGGAGTGCAGTGGGTGCAAGAACAGGAACTATGCCTTCCG
ATCGTTCTGTAACAGATGCAAACAACCTCGCCTTCTCGTCGACAATAAAACCCCTCCAGATTCAAAATGGCTTCCTCGCATCGGTGACTGGATCTGCACGGGTTGCACCA
ACAACAATTATGCATCAAGAGAGAAGTGCAAAAAGTGTGGACAACCAAAGGAGGTAGCAGCAATGCCAGCAATTGCAATCCCTGGAGCTTCTTTTCCAACTTATTCACAC
TATTTTGCCAGGACCCAAGGAGGACTGGATACAAAGATGAATCTTGGATTAATAGGAAATGGCAATTCACAACAACTGCATCCTCTGAGCTCCAACTGGTCCCTGGGAGG
TGCTGATAAGTATGGAATTCATGCAGCTCCAACATTTCCTTTGGGTGGAAATAGTTCTGCAATCTCATATATGAATTCCACTAATCAGATCCTTTCAGTTCCAAAGGGTT
GGCGAAATGGTGACTGGATATGCAACTGTGGTTTCCATAATTACTCCTCACGTGCACAGTGCAAAAAATGTAATGCTTCACCACAAGCACTTGGAATGAAACGCTTAGCA
TCAGAAGAGCTTGTTCATCACTGGGATAACAAGAGATTAAATATTGGACAGGCAAATGAGCAACAGCAATCTTACTCAGGTTTCGAGCAGATGATAGGTTCCAGTAGTGA
CCCAAATGCGGGATTATATAATTCCTATCCCCACGAAAGTTCTGGTGTGGCTCCAAATTTGGAAATGCCGATGCAGTTTCCCCAACAAGCAACTGCACCGACACTCCTAG
GAAAAGGCTCCATTGCCTTCTCCGCCACGCTCATATCCTCCCACTTCTTTGGCGCATCCGAAGCAGTCTCCAGCTGCGCCTTTCTCGACCGCCCATTCCCCATTGCTCCA
TCCTCGCTCTCCTGCTTAGATCGTCTTCTCAACCTCCTCCGAAACCGAACTTCAACCGTCTCCGGCGGCGTTATCGCCGGCGGAACCTGCGGAGGCCCACTGTCGCCACC
GGTACCAGTGTTGCCGCCGTCAGTGCCGGTGGCATTGATCGACACGTTTCCCATAATTTTGAGTTTTAGCCCCCATCGCTTGAGTCCTCCGGCTTTGCTCTGTCCAATTT
CCTGCTCGCTGAACACCTTCCTTCACAACCACCGTCTCCGCCGCTCGCAGTTGATTGGCCACTGCTTGCCGATGAGTCATCAGCACCTCAAACTGAAAGACCCTTTCCTT
CATAATTATAAGCCCTCCGAGCTACGGATCGCGTCTGAATTTCTCACCACTTGGCTTCCCTTTCTGTCAAGAGATCTCTGCCGAGACTGCACCCAACTGCTCTCGGATCG
AATCCGCGCCCTTGATCCAGACATGCAAGAGAGTAATGGGAATCATGATGATGGTTGTGATGCAAATTCTCTTGGAAGTTGGAAAGATGAAGCGGAAACGAATTCATTAG
GGAGTTGGAAGGACGGCATAAATGAAGGGACTGAAGCTGAGGGAGTACCTAAAACTTCTTCTAGTGAACGATTTTCTAAATTACCTTCCACCAAGACCTCAGGGCCTCGA
ATGTCGTGGGCTGACATGATTCAGGAAGATGAACTAGAAGAAGAGGAAGATGAGTGTGAATCAGAAAAAAGATTAGTCAATGTTAATGCGTCCACAGGGAAATTAACAAT
ATCGAAGGTTATTGAGAAGCCAAAGCTTTCTAGGGAGCAGAGAGAGTACATCAGATTCATGAATGTTGGGCGGAAGAAAGATTTTATTTGTCTGGAGAGGTTTAAAGGAA
AATTAGTAAACATTCTTGAGGGACTCGAGCTTCATACAGGTATTTTTAGTGCTGCCGAACAGAAGAGAATAGTTGATCATGTTTATGCACTTCAGGAGATGGGCAAGAAG
GGAGAGTTAAGAGAACGAACATTTTCAGCTCCCAAAAAGTGGATGAAGGGAAAGGGACGTGTAACTCTTCAGTTTGGGTGCTGTTACAATTATGCACATGATAAAAATGG
CAATCCTCCTGGCATTCTTCAAAATGAAATTGTGGATCCAATACCTCCTCTCTTTAAGGTGATAATTAGAAGGTTGGTGAGATGGCACGTACTTCCTCCAACATGTGTTC
CTGATAGTTGCATTGTGAACATCTATGAAGAAGGGGACTGTATTCCTCCCCATATTGACAACCACGATTTTGTTCGACCTTTTTGTACTGTGTCATTCCTCAGTGAATGC
AATATTGTTTTTGGAACAAACCTTGCTATTGTAGGTCCTGGTGAATTTTCTGGGCCAATTGCAATCCCGCTGCCTGTGGGGTCTGTTCTTGTGTTAAATGGAAATGGAGC
TGATGTTGCTAAACATTGTATACCTGCAGTCCCCACAAAGAGGATATCAATTACATTTAGAAGAATGGATGATTCTAAGCGGCCAATTGAGTATGCTCCAGAACCAGATT
TGCAGGGAATTCAACCATTGCCCTATGAAGCTAGTGAAAATGATGTACCAACTTCACCAGTATCAGAAAGGGAAATAAGGAGGCAGCCATTTAGGAGAGGTAGTCATATG
AGAACCAGGGGATCTGGAAACAGAAGCGAAACCCGATTTGAGCCTCGCAATTCAGGTAGGGCTCAACATAGGCCAGCAGATAGGAGGAGCAGAGGAAATCTAGACAGCTG
A
mRNA sequenceShow/hide mRNA sequence
ATGAGGGAGGAGGAGATAGAAGGAAAAGCAGCGTCACAAGTAGCAATGGCCGAAGGCAAAGAAGGCGATTGGGAGTGCAGTGGGTGCAAGAACAGGAACTATGCCTTCCG
ATCGTTCTGTAACAGATGCAAACAACCTCGCCTTCTCGTCGACAATAAAACCCCTCCAGATTCAAAATGGCTTCCTCGCATCGGTGACTGGATCTGCACGGGTTGCACCA
ACAACAATTATGCATCAAGAGAGAAGTGCAAAAAGTGTGGACAACCAAAGGAGGTAGCAGCAATGCCAGCAATTGCAATCCCTGGAGCTTCTTTTCCAACTTATTCACAC
TATTTTGCCAGGACCCAAGGAGGACTGGATACAAAGATGAATCTTGGATTAATAGGAAATGGCAATTCACAACAACTGCATCCTCTGAGCTCCAACTGGTCCCTGGGAGG
TGCTGATAAGTATGGAATTCATGCAGCTCCAACATTTCCTTTGGGTGGAAATAGTTCTGCAATCTCATATATGAATTCCACTAATCAGATCCTTTCAGTTCCAAAGGGTT
GGCGAAATGGTGACTGGATATGCAACTGTGGTTTCCATAATTACTCCTCACGTGCACAGTGCAAAAAATGTAATGCTTCACCACAAGCACTTGGAATGAAACGCTTAGCA
TCAGAAGAGCTTGTTCATCACTGGGATAACAAGAGATTAAATATTGGACAGGCAAATGAGCAACAGCAATCTTACTCAGGTTTCGAGCAGATGATAGGTTCCAGTAGTGA
CCCAAATGCGGGATTATATAATTCCTATCCCCACGAAAGTTCTGGTGTGGCTCCAAATTTGGAAATGCCGATGCAGTTTCCCCAACAAGCAACTGCACCGACACTCCTAG
GAAAAGGCTCCATTGCCTTCTCCGCCACGCTCATATCCTCCCACTTCTTTGGCGCATCCGAAGCAGTCTCCAGCTGCGCCTTTCTCGACCGCCCATTCCCCATTGCTCCA
TCCTCGCTCTCCTGCTTAGATCGTCTTCTCAACCTCCTCCGAAACCGAACTTCAACCGTCTCCGGCGGCGTTATCGCCGGCGGAACCTGCGGAGGCCCACTGTCGCCACC
GGTACCAGTGTTGCCGCCGTCAGTGCCGGTGGCATTGATCGACACGTTTCCCATAATTTTGAGTTTTAGCCCCCATCGCTTGAGTCCTCCGGCTTTGCTCTGTCCAATTT
CCTGCTCGCTGAACACCTTCCTTCACAACCACCGTCTCCGCCGCTCGCAGTTGATTGGCCACTGCTTGCCGATGAGTCATCAGCACCTCAAACTGAAAGACCCTTTCCTT
CATAATTATAAGCCCTCCGAGCTACGGATCGCGTCTGAATTTCTCACCACTTGGCTTCCCTTTCTGTCAAGAGATCTCTGCCGAGACTGCACCCAACTGCTCTCGGATCG
AATCCGCGCCCTTGATCCAGACATGCAAGAGAGTAATGGGAATCATGATGATGGTTGTGATGCAAATTCTCTTGGAAGTTGGAAAGATGAAGCGGAAACGAATTCATTAG
GGAGTTGGAAGGACGGCATAAATGAAGGGACTGAAGCTGAGGGAGTACCTAAAACTTCTTCTAGTGAACGATTTTCTAAATTACCTTCCACCAAGACCTCAGGGCCTCGA
ATGTCGTGGGCTGACATGATTCAGGAAGATGAACTAGAAGAAGAGGAAGATGAGTGTGAATCAGAAAAAAGATTAGTCAATGTTAATGCGTCCACAGGGAAATTAACAAT
ATCGAAGGTTATTGAGAAGCCAAAGCTTTCTAGGGAGCAGAGAGAGTACATCAGATTCATGAATGTTGGGCGGAAGAAAGATTTTATTTGTCTGGAGAGGTTTAAAGGAA
AATTAGTAAACATTCTTGAGGGACTCGAGCTTCATACAGGTATTTTTAGTGCTGCCGAACAGAAGAGAATAGTTGATCATGTTTATGCACTTCAGGAGATGGGCAAGAAG
GGAGAGTTAAGAGAACGAACATTTTCAGCTCCCAAAAAGTGGATGAAGGGAAAGGGACGTGTAACTCTTCAGTTTGGGTGCTGTTACAATTATGCACATGATAAAAATGG
CAATCCTCCTGGCATTCTTCAAAATGAAATTGTGGATCCAATACCTCCTCTCTTTAAGGTGATAATTAGAAGGTTGGTGAGATGGCACGTACTTCCTCCAACATGTGTTC
CTGATAGTTGCATTGTGAACATCTATGAAGAAGGGGACTGTATTCCTCCCCATATTGACAACCACGATTTTGTTCGACCTTTTTGTACTGTGTCATTCCTCAGTGAATGC
AATATTGTTTTTGGAACAAACCTTGCTATTGTAGGTCCTGGTGAATTTTCTGGGCCAATTGCAATCCCGCTGCCTGTGGGGTCTGTTCTTGTGTTAAATGGAAATGGAGC
TGATGTTGCTAAACATTGTATACCTGCAGTCCCCACAAAGAGGATATCAATTACATTTAGAAGAATGGATGATTCTAAGCGGCCAATTGAGTATGCTCCAGAACCAGATT
TGCAGGGAATTCAACCATTGCCCTATGAAGCTAGTGAAAATGATGTACCAACTTCACCAGTATCAGAAAGGGAAATAAGGAGGCAGCCATTTAGGAGAGGTAGTCATATG
AGAACCAGGGGATCTGGAAACAGAAGCGAAACCCGATTTGAGCCTCGCAATTCAGGTAGGGCTCAACATAGGCCAGCAGATAGGAGGAGCAGAGGAAATCTAGACAGCTG
A
Protein sequenceShow/hide protein sequence
MREEEIEGKAASQVAMAEGKEGDWECSGCKNRNYAFRSFCNRCKQPRLLVDNKTPPDSKWLPRIGDWICTGCTNNNYASREKCKKCGQPKEVAAMPAIAIPGASFPTYSH
YFARTQGGLDTKMNLGLIGNGNSQQLHPLSSNWSLGGADKYGIHAAPTFPLGGNSSAISYMNSTNQILSVPKGWRNGDWICNCGFHNYSSRAQCKKCNASPQALGMKRLA
SEELVHHWDNKRLNIGQANEQQQSYSGFEQMIGSSSDPNAGLYNSYPHESSGVAPNLEMPMQFPQQATAPTLLGKGSIAFSATLISSHFFGASEAVSSCAFLDRPFPIAP
SSLSCLDRLLNLLRNRTSTVSGGVIAGGTCGGPLSPPVPVLPPSVPVALIDTFPIILSFSPHRLSPPALLCPISCSLNTFLHNHRLRRSQLIGHCLPMSHQHLKLKDPFL
HNYKPSELRIASEFLTTWLPFLSRDLCRDCTQLLSDRIRALDPDMQESNGNHDDGCDANSLGSWKDEAETNSLGSWKDGINEGTEAEGVPKTSSSERFSKLPSTKTSGPR
MSWADMIQEDELEEEEDECESEKRLVNVNASTGKLTISKVIEKPKLSREQREYIRFMNVGRKKDFICLERFKGKLVNILEGLELHTGIFSAAEQKRIVDHVYALQEMGKK
GELRERTFSAPKKWMKGKGRVTLQFGCCYNYAHDKNGNPPGILQNEIVDPIPPLFKVIIRRLVRWHVLPPTCVPDSCIVNIYEEGDCIPPHIDNHDFVRPFCTVSFLSEC
NIVFGTNLAIVGPGEFSGPIAIPLPVGSVLVLNGNGADVAKHCIPAVPTKRISITFRRMDDSKRPIEYAPEPDLQGIQPLPYEASENDVPTSPVSEREIRRQPFRRGSHM
RTRGSGNRSETRFEPRNSGRAQHRPADRRSRGNLDS