; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy4G025800 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy4G025800
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionRNA-binding (RRM/RBD/RNP motifs) family protein
Genome locationGy14Chr4:31062796..31067424
RNA-Seq ExpressionCsGy4G025800
SyntenyCsGy4G025800
Gene Ontology termsGO:0003723 - RNA binding (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR027450 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like
IPR035979 - RNA-binding domain superfamily
IPR037151 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052265.1 alkylated DNA repair protein alkB-like protein 8 isoform X1 [Cucumis melo var. makuwa]1.40e-24292.76Show/hide
Query:  MELPRFSRPNQDYGSSSSSTPILYVANCGPAVGISHPAIAAVFAHFGHVKGVHPADDTGARVIVCFSEESSARAALEALHGRPCPLLGGRTLHIRYSITR
        MELPRF+RPNQ  GSSSS  PILYVANCGPAVGISHP IAAVFAHFGHVKGVH ADDTGARVIVCFSEESSA+AALE LHGRPCPLLGGRTLHIRYSI R
Subjt:  MELPRFSRPNQDYGSSSSSTPILYVANCGPAVGISHPAIAAVFAHFGHVKGVHPADDTGARVIVCFSEESSARAALEALHGRPCPLLGGRTLHIRYSITR

Query:  PSISQPNDSLSVSLSASELDIPGLFLLHDFVNAKEEEDLLREVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNTEDIA
        PSISQPNDS+SVSLSASELDIPGLFLLHDFVNAKEEEDLLREVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELP FVSHVVDRIS+FPNTED+A
Subjt:  PSISQPNDSLSVSLSASELDIPGLFLLHDFVNAKEEEDLLREVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNTEDIA

Query:  DASLDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYVWHHYI
        DASLDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEG WH+FP SIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARY WHHYI
Subjt:  DASLDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYVWHHYI

Query:  PHHKIDMVKDSSIRRGRRRVSFTFRKVSRYNAYVLSLPLVPVELTLRASIIIDVKTAII
        PHHKIDMVKDSSIRR  RRVSFTFRKV R NAYVLS+PLVPVELTL A IIIDVKTA+I
Subjt:  PHHKIDMVKDSSIRRGRRRVSFTFRKVSRYNAYVLSLPLVPVELTLRASIIIDVKTAII

KAG6591851.1 Alkylated DNA repair protein ALKBH8-like protein, partial [Cucurbita argyrosperma subsp. sororia]3.03e-21390.21Show/hide
Query:  MELPRFSRPNQDYGSSSSSTPILYVANCGPAVGISHPAIAAVFAHFGHVKGVHPADDTGARVIVCFSEESSARAALEALHGRPCPLLGGRTLHIRYSITR
        M+LPRF+RP  D GSSSSS P LYVANCGPAVGISH  +AAVF  FG VKGVH AD+TG RVIVCFSEESSARAALEALHGRPC LLGGRTLHIRYSI R
Subjt:  MELPRFSRPNQDYGSSSSSTPILYVANCGPAVGISHPAIAAVFAHFGHVKGVHPADDTGARVIVCFSEESSARAALEALHGRPCPLLGGRTLHIRYSITR

Query:  PSISQPNDSLSVSLSASELDIPGLFLLHDFVNAKEEEDLLREVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNTEDIA
        PSISQ NDS+SVSLSASELDIPGL+LLHDFV AKEEEDLL EVDARPWNNLAKRRVQHYGYEFCYQTRNVNT+HQLGELPSFVSHVVDRISMFPN ED+A
Subjt:  PSISQPNDSLSVSLSASELDIPGLFLLHDFVNAKEEEDLLREVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNTEDIA

Query:  DASLDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYVWHHYI
        DA LDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRY EGTWHK PSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARY WHHYI
Subjt:  DASLDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYVWHHYI

Query:  PHHKIDMVKDSSIRRGRRRVSFTFRKV
        PHHKIDMVKDS+IRRG RRVSFTFRKV
Subjt:  PHHKIDMVKDSSIRRGRRRVSFTFRKV

XP_004145573.1 alkylated DNA repair protein ALKBH8 homolog [Cucumis sativus]1.76e-242100Show/hide
Query:  MELPRFSRPNQDYGSSSSSTPILYVANCGPAVGISHPAIAAVFAHFGHVKGVHPADDTGARVIVCFSEESSARAALEALHGRPCPLLGGRTLHIRYSITR
        MELPRFSRPNQDYGSSSSSTPILYVANCGPAVGISHPAIAAVFAHFGHVKGVHPADDTGARVIVCFSEESSARAALEALHGRPCPLLGGRTLHIRYSITR
Subjt:  MELPRFSRPNQDYGSSSSSTPILYVANCGPAVGISHPAIAAVFAHFGHVKGVHPADDTGARVIVCFSEESSARAALEALHGRPCPLLGGRTLHIRYSITR

Query:  PSISQPNDSLSVSLSASELDIPGLFLLHDFVNAKEEEDLLREVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNTEDIA
        PSISQPNDSLSVSLSASELDIPGLFLLHDFVNAKEEEDLLREVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNTEDIA
Subjt:  PSISQPNDSLSVSLSASELDIPGLFLLHDFVNAKEEEDLLREVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNTEDIA

Query:  DASLDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYVWHHYI
        DASLDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYVWHHYI
Subjt:  DASLDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYVWHHYI

Query:  PHHKIDMVKDSSIRRGRRRVSFTFRKV
        PHHKIDMVKDSSIRRGRRRVSFTFRKV
Subjt:  PHHKIDMVKDSSIRRGRRRVSFTFRKV

XP_008444260.1 PREDICTED: alkylated DNA repair protein alkB homolog 8 isoform X1 [Cucumis melo]1.34e-22593.88Show/hide
Query:  MELPRFSRPNQDYGSSSSSTPILYVANCGPAVGISHPAIAAVFAHFGHVKGVHPADDTGARVIVCFSEESSARAALEALHGRPCPLLGGRTLHIRYSITR
        MELPRF+RPNQ  GSSSS  PILYVANCGPAVGISHP IAAVFAHFGHVKGVH ADDTGARVIVCFSEESSA+AALE LHGRPCPLLGGRTLHIRYSI R
Subjt:  MELPRFSRPNQDYGSSSSSTPILYVANCGPAVGISHPAIAAVFAHFGHVKGVHPADDTGARVIVCFSEESSARAALEALHGRPCPLLGGRTLHIRYSITR

Query:  PSISQPNDSLSVSLSASELDIPGLFLLHDFVNAKEEEDLLREVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNTEDIA
        PSISQPNDS+SVSLSASELDIPGLFLLHDFVNAKEEEDLLREVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELP FVSHVVDRIS+FPNTED+A
Subjt:  PSISQPNDSLSVSLSASELDIPGLFLLHDFVNAKEEEDLLREVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNTEDIA

Query:  DASLDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYVWHHYI
        DASLDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEG WH+FP SIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARY WHHYI
Subjt:  DASLDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYVWHHYI

Query:  PHHKIDMVKDSSIRRGRRRVSFTFRKV
        PHHKIDMVKDSSIRR  RRVSFTFRKV
Subjt:  PHHKIDMVKDSSIRRGRRRVSFTFRKV

XP_038897079.1 alkylated DNA repair protein ALKBH8 homolog [Benincasa hispida]1.92e-21691.13Show/hide
Query:  MELPRFSRPNQDYGSSSSSTPILYVANCGPAVGISHPAIAAVFAHFGHVKGVHPADDTGARVIVCFSEESSARAALEALHGRPCPLLGGRTLHIRYSITR
        MELP+F RP +D GSSSSS P LYVANCGPAVGISH  IAAVF  FG VKGVH AD+TGARVIVCFSEESSARAALEALHGR CPLLGGRTLHIRYSI R
Subjt:  MELPRFSRPNQDYGSSSSSTPILYVANCGPAVGISHPAIAAVFAHFGHVKGVHPADDTGARVIVCFSEESSARAALEALHGRPCPLLGGRTLHIRYSITR

Query:  PSISQPNDSLSVSLSASELDIPGLFLLHDFVNAKEEEDLLREVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNTEDIA
        PSIS PNDS+SVSLSASELDIPGLFLLHDFVNAKEEEDLL EVDARPW+NLAKRRVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPN ED+A
Subjt:  PSISQPNDSLSVSLSASELDIPGLFLLHDFVNAKEEEDLLREVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNTEDIA

Query:  DASLDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYVWHHYI
        DASLDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHK PSS+DLKMENSVNDSNYLR+ IYLPPRSMLLLSGEARY WHHYI
Subjt:  DASLDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYVWHHYI

Query:  PHHKIDMVKDSSIRRGRRRVSFTFRKV
        PHHKIDMVKDS+IRRG RRVSFTFRKV
Subjt:  PHHKIDMVKDSSIRRGRRRVSFTFRKV

TrEMBL top hitse value%identityAlignment
A0A0A0L3H6 Uncharacterized protein8.51e-243100Show/hide
Query:  MELPRFSRPNQDYGSSSSSTPILYVANCGPAVGISHPAIAAVFAHFGHVKGVHPADDTGARVIVCFSEESSARAALEALHGRPCPLLGGRTLHIRYSITR
        MELPRFSRPNQDYGSSSSSTPILYVANCGPAVGISHPAIAAVFAHFGHVKGVHPADDTGARVIVCFSEESSARAALEALHGRPCPLLGGRTLHIRYSITR
Subjt:  MELPRFSRPNQDYGSSSSSTPILYVANCGPAVGISHPAIAAVFAHFGHVKGVHPADDTGARVIVCFSEESSARAALEALHGRPCPLLGGRTLHIRYSITR

Query:  PSISQPNDSLSVSLSASELDIPGLFLLHDFVNAKEEEDLLREVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNTEDIA
        PSISQPNDSLSVSLSASELDIPGLFLLHDFVNAKEEEDLLREVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNTEDIA
Subjt:  PSISQPNDSLSVSLSASELDIPGLFLLHDFVNAKEEEDLLREVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNTEDIA

Query:  DASLDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYVWHHYI
        DASLDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYVWHHYI
Subjt:  DASLDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYVWHHYI

Query:  PHHKIDMVKDSSIRRGRRRVSFTFRKV
        PHHKIDMVKDSSIRRGRRRVSFTFRKV
Subjt:  PHHKIDMVKDSSIRRGRRRVSFTFRKV

A0A1S3B9H0 alkylated DNA repair protein alkB homolog 8 isoform X16.48e-22693.88Show/hide
Query:  MELPRFSRPNQDYGSSSSSTPILYVANCGPAVGISHPAIAAVFAHFGHVKGVHPADDTGARVIVCFSEESSARAALEALHGRPCPLLGGRTLHIRYSITR
        MELPRF+RPNQ  GSSSS  PILYVANCGPAVGISHP IAAVFAHFGHVKGVH ADDTGARVIVCFSEESSA+AALE LHGRPCPLLGGRTLHIRYSI R
Subjt:  MELPRFSRPNQDYGSSSSSTPILYVANCGPAVGISHPAIAAVFAHFGHVKGVHPADDTGARVIVCFSEESSARAALEALHGRPCPLLGGRTLHIRYSITR

Query:  PSISQPNDSLSVSLSASELDIPGLFLLHDFVNAKEEEDLLREVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNTEDIA
        PSISQPNDS+SVSLSASELDIPGLFLLHDFVNAKEEEDLLREVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELP FVSHVVDRIS+FPNTED+A
Subjt:  PSISQPNDSLSVSLSASELDIPGLFLLHDFVNAKEEEDLLREVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNTEDIA

Query:  DASLDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYVWHHYI
        DASLDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEG WH+FP SIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARY WHHYI
Subjt:  DASLDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYVWHHYI

Query:  PHHKIDMVKDSSIRRGRRRVSFTFRKV
        PHHKIDMVKDSSIRR  RRVSFTFRKV
Subjt:  PHHKIDMVKDSSIRRGRRRVSFTFRKV

A0A5D3BS48 Alkylated DNA repair protein alkB-like protein 8 isoform X16.77e-24392.76Show/hide
Query:  MELPRFSRPNQDYGSSSSSTPILYVANCGPAVGISHPAIAAVFAHFGHVKGVHPADDTGARVIVCFSEESSARAALEALHGRPCPLLGGRTLHIRYSITR
        MELPRF+RPNQ  GSSSS  PILYVANCGPAVGISHP IAAVFAHFGHVKGVH ADDTGARVIVCFSEESSA+AALE LHGRPCPLLGGRTLHIRYSI R
Subjt:  MELPRFSRPNQDYGSSSSSTPILYVANCGPAVGISHPAIAAVFAHFGHVKGVHPADDTGARVIVCFSEESSARAALEALHGRPCPLLGGRTLHIRYSITR

Query:  PSISQPNDSLSVSLSASELDIPGLFLLHDFVNAKEEEDLLREVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNTEDIA
        PSISQPNDS+SVSLSASELDIPGLFLLHDFVNAKEEEDLLREVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELP FVSHVVDRIS+FPNTED+A
Subjt:  PSISQPNDSLSVSLSASELDIPGLFLLHDFVNAKEEEDLLREVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNTEDIA

Query:  DASLDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYVWHHYI
        DASLDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEG WH+FP SIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARY WHHYI
Subjt:  DASLDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYVWHHYI

Query:  PHHKIDMVKDSSIRRGRRRVSFTFRKVSRYNAYVLSLPLVPVELTLRASIIIDVKTAII
        PHHKIDMVKDSSIRR  RRVSFTFRKV R NAYVLS+PLVPVELTL A IIIDVKTA+I
Subjt:  PHHKIDMVKDSSIRRGRRRVSFTFRKVSRYNAYVLSLPLVPVELTLRASIIIDVKTAII

A0A6J1FEN4 alkylated DNA repair protein alkB homolog 82.08e-21389.91Show/hide
Query:  MELPRFSRPNQDYGSSSSSTPILYVANCGPAVGISHPAIAAVFAHFGHVKGVHPADDTGARVIVCFSEESSARAALEALHGRPCPLLGGRTLHIRYSITR
        M+LPRF+RP  D GSSSSS P LYVANCGPAVGISH A+AAVF  FG VKGVH AD+TG RVIVCFSEESSARAALEALHGRPC LLGGRTLHIRYSI R
Subjt:  MELPRFSRPNQDYGSSSSSTPILYVANCGPAVGISHPAIAAVFAHFGHVKGVHPADDTGARVIVCFSEESSARAALEALHGRPCPLLGGRTLHIRYSITR

Query:  PSISQPNDSLSVSLSASELDIPGLFLLHDFVNAKEEEDLLREVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNTEDIA
        PSISQ NDS+SVSLSASELDIPGL+LLHDFV AKEEE+LL EVDARPWNNLAKRRVQHYGYEFCYQTRNVNT+HQLGELPSFVSHVVDRISMFPN ED+A
Subjt:  PSISQPNDSLSVSLSASELDIPGLFLLHDFVNAKEEEDLLREVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNTEDIA

Query:  DASLDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYVWHHYI
        DA LDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRY EGTWHK PSSIDLKMENSVNDSNYLR+AIYLPPRSMLLLSGEARY WHHYI
Subjt:  DASLDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYVWHHYI

Query:  PHHKIDMVKDSSIRRGRRRVSFTFRKV
        PHHKIDMVKDS+IRRG RRVSFTFRKV
Subjt:  PHHKIDMVKDSSIRRGRRRVSFTFRKV

A0A6J1IPP9 alkylated DNA repair protein alkB homolog 8 isoform X19.87e-21289.3Show/hide
Query:  MELPRFSRPNQDYGSSSSSTPILYVANCGPAVGISHPAIAAVFAHFGHVKGVHPADDTGARVIVCFSEESSARAALEALHGRPCPLLGGRTLHIRYSITR
        M+LPRF+RP  + GSSSSS P LYVANCGPAVGISH  +AAVF  FG VKGVH AD+TG RVIVCFSEESSARAALEALHGRPCPLLGGRTLHIRYSI R
Subjt:  MELPRFSRPNQDYGSSSSSTPILYVANCGPAVGISHPAIAAVFAHFGHVKGVHPADDTGARVIVCFSEESSARAALEALHGRPCPLLGGRTLHIRYSITR

Query:  PSISQPNDSLSVSLSASELDIPGLFLLHDFVNAKEEEDLLREVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNTEDIA
        PSISQ NDS+SVSLSASEL+IPGL+LLHDFV AKEEE+LL EVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLG LPSFVSHVVDRISMFPN ED+A
Subjt:  PSISQPNDSLSVSLSASELDIPGLFLLHDFVNAKEEEDLLREVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNTEDIA

Query:  DASLDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYVWHHYI
        DA LDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRY EGTWHK PSSIDLKMENSVNDSNYLR+AIYLPPRSMLLLSGEARY WHHYI
Subjt:  DASLDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYVWHHYI

Query:  PHHKIDMVKDSSIRRGRRRVSFTFRKV
        PHHKIDMVKDS+IRRG RRVSFTFRKV
Subjt:  PHHKIDMVKDSSIRRGRRRVSFTFRKV

SwissProt top hitse value%identityAlignment
A1A4L5 Alkylated DNA repair protein alkB homolog 89.6e-2533.88Show/hide
Query:  PGLFLLHDFVNAKEEEDLLREV----DARPWN---NLAKRRVQHYGYEFCYQTRNVNTKHQL-GELPSFVSHVVDRISMFPNTEDIADASLDQLTVNEYP
        PGL ++ + +++++E+ LL  V    D    N   +L  RRV+H+GYEF Y+  NV+    L G LP     ++++       E       DQLT+N+Y 
Subjt:  PGLFLLHDFVNAKEEEDLLREV----DARPWN---NLAKRRVQHYGYEFCYQTRNVNTKHQL-GELPSFVSHVVDRISMFPNTEDIADASLDQLTVNEYP

Query:  PGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYVWHHYIPHHKIDM------
        PG G+  HIDTHSAFE  I SLSL    +M+F ++P+G                      +   + LP RS+L+++GE+RY+W H I   K D       
Subjt:  PGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYVWHHYIPHHKIDM------

Query:  ---------VKDSSIRRGRRRVSFTFRKVSRYNAYVLSLPLV
                 V+D ++ +   R SFTFRKV R      S PLV
Subjt:  ---------VKDSSIRRGRRRVSFTFRKVSRYNAYVLSLPLV

Q07G10 Alkylated DNA repair protein alkB homolog 88.3e-2931.19Show/hide
Query:  LYVANCGPAVGISHPAIAAVFAHFGHVKG-VHPADDTGARVIVCFSEESSARAALEALHGRP-CPLLGGRTLHIRYSITRPSISQPNDSLSVSLSASELD
        L VAN G   G+S   + AV    G V+  + P +   A   V +S    A  A  +L G+  C     + + +  S     + +  + LS SL      
Subjt:  LYVANCGPAVGISHPAIAAVFAHFGHVKG-VHPADDTGARVIVCFSEESSARAALEALHGRP-CPLLGGRTLHIRYSITRPSISQPNDSLSVSLSASELD

Query:  IPGLFLLHDFVNAKEEEDLLREVD----ARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQL-GELPSFVSHVVDRISMFPNTEDIADASLDQLTVNEYPPG
         PGL ++ DFV+ ++E  +L  +D         +L  R+V+HYGYEF Y   NV+    L G LP F +  + +         +     DQLT+N+Y PG
Subjt:  IPGLFLLHDFVNAKEEEDLLREVD----ARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQL-GELPSFVSHVVDRISMFPNTEDIADASLDQLTVNEYPPG

Query:  VGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYVWHHYIPHHKIDMVKDS----
         G+ PH+DTHSAFE  I SLSL    +M+F ++P G+                         + LP RS+L++SGE+RY+W H I   K D+++ S    
Subjt:  VGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYVWHHYIPHHKIDMVKDS----

Query:  -----------SIRRGRRRVSFTFRKV
                   ++ +   R SFTFRKV
Subjt:  -----------SIRRGRRRVSFTFRKV

Q80Y20 Alkylated DNA repair protein alkB homolog 82.3e-2633.76Show/hide
Query:  PGLFLLHDFVNAKEEEDLLREVDARPW----------NNLAKRRVQHYGYEFCYQTRNVNTKHQL-GELPSFVSHVVDRISMFPNTEDIADASLDQLTVN
        PGL ++ + ++++EE+ LL  V+   W           +L  RRV+H+GYEF Y++  V+    L G LP   S +++++      E       DQLT+N
Subjt:  PGLFLLHDFVNAKEEEDLLREVDARPW----------NNLAKRRVQHYGYEFCYQTRNVNTKHQL-GELPSFVSHVVDRISMFPNTEDIADASLDQLTVN

Query:  EYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYVWHHYIPHHKIDMVK-
        +Y PG G+  HIDTHSAFE  I SLSL    +M+F ++PEG                      +   + LP RS+L+++GE+RY+W H I   K D V+ 
Subjt:  EYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYVWHHYIPHHKIDMVK-

Query:  --------------DSSIRRGRRRVSFTFRKVSR
                      D ++ +   R SFTFRKV R
Subjt:  --------------DSSIRRGRRRVSFTFRKVSR

Q8RWY1 Alkylated DNA repair protein ALKBH8 homolog6.2e-11766.57Show/hide
Query:  MELPRFSRPNQDYGSSSSSTP---ILYVANCGPAVGISHPAIAAVFAHFGHVKGVHPADDTGARVIVCFSEESSARAALEALHGRPCPLLGGRTLHIRYS
        M  PRF RP Q   SS S  P    LYVANCGPAVG++H AIAAVFA FG V GV+ ADD+G RVIV F++  SA+AALEAL GRPCP L GR+LHIRYS
Subjt:  MELPRFSRPNQDYGSSSSSTP---ILYVANCGPAVGISHPAIAAVFAHFGHVKGVHPADDTGARVIVCFSEESSARAALEALHGRPCPLLGGRTLHIRYS

Query:  ITR-PSISQPNDSLSVSLSASELDIPGLFLLHDFVNAKEEEDLLREVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNT
        + + PS +Q ND + VSL  SEL+IPGLFLL DFV   EE+ LL  VDAR W  LAKRRVQHYGYEFCY TRNV+TK +LGELPSFVS +++RI +FPN 
Subjt:  ITR-PSISQPNDSLSVSLSASELDIPGLFLLHDFVNAKEEEDLLREVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNT

Query:  ED-IADASLDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYV
        ++  A  +LDQLTVNEYP GVGLSPHIDTHSAFE  IFSLSLAGPCIMEFRRY   TW    S+ D +      DS+ ++KA+YLPPRSMLLLSGEARY 
Subjt:  ED-IADASLDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYV

Query:  WHHYIPHHKIDMVKDSSIRRGRRRVSFTFRKVSRY
        W+HYIPHHKID VKD  IRR  RRVSFT RKV  +
Subjt:  WHHYIPHHKIDMVKDSSIRRGRRRVSFTFRKVSRY

Q95K79 Alkylated DNA repair protein alkB homolog 81.5e-2533.88Show/hide
Query:  PGLFLLHDFVNAKEEEDLLREVD-------ARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQL-GELPSFVSHVVDRISMFPNTEDIADASLDQLTVNEYP
        PGL ++ + ++++EE+ LL  VD            +L  RRV+H+GYEF Y+  NV+    L G LP      +++       E       DQ+T+N+Y 
Subjt:  PGLFLLHDFVNAKEEEDLLREVD-------ARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQL-GELPSFVSHVVDRISMFPNTEDIADASLDQLTVNEYP

Query:  PGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYVWHHYIPHHKIDMVK----
        PG G+  HIDTHSAFE  I SLSL    +M+F ++P+GT                         + LP RS+L+++GE+RY+W H I   K D V+    
Subjt:  PGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYVWHHYIPHHKIDMVK----

Query:  -----------DSSIRRGRRRVSFTFRKVSRYNAYVLSLPLV
                   D ++ +   R SFTFRKV R      S PLV
Subjt:  -----------DSSIRRGRRRVSFTFRKVSRYNAYVLSLPLV

Arabidopsis top hitse value%identityAlignment
AT1G31600.1 RNA-binding (RRM/RBD/RNP motifs) family protein4.4e-11866.57Show/hide
Query:  MELPRFSRPNQDYGSSSSSTP---ILYVANCGPAVGISHPAIAAVFAHFGHVKGVHPADDTGARVIVCFSEESSARAALEALHGRPCPLLGGRTLHIRYS
        M  PRF RP Q   SS S  P    LYVANCGPAVG++H AIAAVFA FG V GV+ ADD+G RVIV F++  SA+AALEAL GRPCP L GR+LHIRYS
Subjt:  MELPRFSRPNQDYGSSSSSTP---ILYVANCGPAVGISHPAIAAVFAHFGHVKGVHPADDTGARVIVCFSEESSARAALEALHGRPCPLLGGRTLHIRYS

Query:  ITR-PSISQPNDSLSVSLSASELDIPGLFLLHDFVNAKEEEDLLREVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNT
        + + PS +Q ND + VSL  SEL+IPGLFLL DFV   EE+ LL  VDAR W  LAKRRVQHYGYEFCY TRNV+TK +LGELPSFVS +++RI +FPN 
Subjt:  ITR-PSISQPNDSLSVSLSASELDIPGLFLLHDFVNAKEEEDLLREVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNT

Query:  ED-IADASLDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYV
        ++  A  +LDQLTVNEYP GVGLSPHIDTHSAFE  IFSLSLAGPCIMEFRRY   TW    S+ D +      DS+ ++KA+YLPPRSMLLLSGEARY 
Subjt:  ED-IADASLDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYV

Query:  WHHYIPHHKIDMVKDSSIRRGRRRVSFTFRKVSRY
        W+HYIPHHKID VKD  IRR  RRVSFT RKV  +
Subjt:  WHHYIPHHKIDMVKDSSIRRGRRRVSFTFRKVSRY

AT1G31600.2 RNA-binding (RRM/RBD/RNP motifs) family protein1.1e-11666.17Show/hide
Query:  MELPRFSRPNQDYGSSSSSTP---ILYVANCGPAVGISHPAIAAVFAHFGHVKGVHPADDTGARVIVCFSEESSARAALEALHGRPCPLLGGRTLHIRYS
        M  PRF RP Q   SS S  P    LYVANCGPAVG++H AIAAVFA FG V GV+ ADD+G RVIV F++  SA+AALEAL GRPCP L GR+LHIRYS
Subjt:  MELPRFSRPNQDYGSSSSSTP---ILYVANCGPAVGISHPAIAAVFAHFGHVKGVHPADDTGARVIVCFSEESSARAALEALHGRPCPLLGGRTLHIRYS

Query:  ITRPSISQPNDSLSVSLSASELDIPGLFLLHDFVNAKEEEDLLREVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNTE
        + +   S+ ND + VSL  SEL+IPGLFLL DFV   EE+ LL  VDAR W  LAKRRVQHYGYEFCY TRNV+TK +LGELPSFVS +++RI +FPN +
Subjt:  ITRPSISQPNDSLSVSLSASELDIPGLFLLHDFVNAKEEEDLLREVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNTE

Query:  D-IADASLDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYVW
        +  A  +LDQLTVNEYP GVGLSPHIDTHSAFE  IFSLSLAGPCIMEFRRY   TW    S+ D +      DS+ ++KA+YLPPRSMLLLSGEARY W
Subjt:  D-IADASLDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYVW

Query:  HHYIPHHKIDMVKDSSIRRGRRRVSFTFRKVSRY
        +HYIPHHKID VKD  IRR  RRVSFT RKV  +
Subjt:  HHYIPHHKIDMVKDSSIRRGRRRVSFTFRKVSRY

AT1G31600.3 RNA-binding (RRM/RBD/RNP motifs) family protein4.4e-11866.57Show/hide
Query:  MELPRFSRPNQDYGSSSSSTP---ILYVANCGPAVGISHPAIAAVFAHFGHVKGVHPADDTGARVIVCFSEESSARAALEALHGRPCPLLGGRTLHIRYS
        M  PRF RP Q   SS S  P    LYVANCGPAVG++H AIAAVFA FG V GV+ ADD+G RVIV F++  SA+AALEAL GRPCP L GR+LHIRYS
Subjt:  MELPRFSRPNQDYGSSSSSTP---ILYVANCGPAVGISHPAIAAVFAHFGHVKGVHPADDTGARVIVCFSEESSARAALEALHGRPCPLLGGRTLHIRYS

Query:  ITR-PSISQPNDSLSVSLSASELDIPGLFLLHDFVNAKEEEDLLREVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNT
        + + PS +Q ND + VSL  SEL+IPGLFLL DFV   EE+ LL  VDAR W  LAKRRVQHYGYEFCY TRNV+TK +LGELPSFVS +++RI +FPN 
Subjt:  ITR-PSISQPNDSLSVSLSASELDIPGLFLLHDFVNAKEEEDLLREVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNT

Query:  ED-IADASLDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYV
        ++  A  +LDQLTVNEYP GVGLSPHIDTHSAFE  IFSLSLAGPCIMEFRRY   TW    S+ D +      DS+ ++KA+YLPPRSMLLLSGEARY 
Subjt:  ED-IADASLDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYV

Query:  WHHYIPHHKIDMVKDSSIRRGRRRVSFTFRKVSRY
        W+HYIPHHKID VKD  IRR  RRVSFT RKV  +
Subjt:  WHHYIPHHKIDMVKDSSIRRGRRRVSFTFRKVSRY

AT4G02485.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.9e-1226.48Show/hide
Query:  DIPGLFLLHDFVNAKEEEDLLREVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNTEDIADAS---------LDQLTVN
        +I GL+L  +F++   +  LL  +    W                +   ++N   + G+LPS+ + + D I     + D+   S          DQL VN
Subjt:  DIPGLFLLHDFVNAKEEEDLLREVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNTEDIADAS---------LDQLTVN

Query:  EYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYVWHHYI--PHHKIDMV
         Y PG G+  H+D    FE  I  +SL  PC+M F                    +    + Y    + L P S++L+SGEARY W H I    +   + 
Subjt:  EYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYVWHHYI--PHHKIDMV

Query:  KDSSIRRGRRRVSFTFRKV
        +   I + +RR+S T RK+
Subjt:  KDSSIRRGRRRVSFTFRKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTTGCCAAGATTCAGCCGTCCAAACCAAGATTATGGATCTTCTTCTTCTTCTACCCCTATTCTCTACGTTGCTAACTGCGGACCCGCCGTCGGAATCAGCCACCC
CGCAATCGCGGCCGTTTTTGCCCATTTTGGCCATGTCAAAGGTGTTCACCCCGCCGACGATACCGGCGCTCGTGTCATCGTATGTTTCTCTGAAGAATCCAGCGCCCGAG
CCGCTCTTGAGGCGCTTCACGGCCGCCCTTGCCCTCTCCTTGGAGGCCGGACTTTGCACATACGTTATTCCATCACCAGACCATCCATTTCGCAACCTAATGATTCTCTT
TCAGTTTCTTTGTCGGCTTCGGAGCTGGACATTCCCGGACTTTTCTTATTGCACGATTTCGTTAATGCTAAAGAAGAGGAGGATTTGCTTAGGGAAGTTGATGCTCGACC
TTGGAATAATCTGGCGAAACGCAGAGTTCAGCATTATGGGTATGAGTTTTGTTATCAAACGAGGAATGTTAATACTAAACATCAGTTGGGGGAACTTCCATCATTTGTTT
CCCATGTAGTTGATAGGATCTCCATGTTTCCAAACACTGAGGATATTGCAGATGCTTCTCTTGATCAATTGACGGTTAATGAATACCCACCTGGGGTGGGTTTGTCCCCT
CATATAGACACCCATTCTGCATTTGAAGGATTAATTTTCAGCCTTTCCTTAGCAGGGCCATGCATTATGGAGTTTAGGAGATATCCTGAAGGCACTTGGCACAAATTCCC
TTCAAGTATAGATTTGAAAATGGAGAATTCTGTAAACGACTCAAATTATCTGAGGAAAGCCATTTACCTCCCCCCTCGGTCTATGCTTCTACTGTCTGGAGAGGCACGCT
ATGTTTGGCATCACTACATTCCTCACCATAAGATTGACATGGTAAAGGACAGTTCTATCAGAAGGGGTCGTAGGAGAGTTTCTTTTACATTTCGCAAGGTGAGTAGATAT
AATGCATATGTTCTTTCCCTCCCTCTCGTACCTGTTGAGCTTACCTTACGTGCTTCTATTATCATTGATGTCAAGACAGCTATTATCGTACCCATGGCAATTCTGAAATA
TTATAAATGTTAA
mRNA sequenceShow/hide mRNA sequence
TTTTGATGTATTTGTACCTAAGTTAGCTGATTATGATATATTTGAAAACAAGAGATATAATTGCAATTTCTATCTACCAATGCCTTACATGCAGTTCTTGTAATTATTTC
AATAAATGGTCAAGAAGTCAAAAGGTCGAAATTAGCCAACGTAGATATCAGTTTACTTAGACACACATTTGTGAGTCAGATTGGACGAGTAATAAAAAGAAAAGCTAAGC
ATCAAAATCACCCGCCGTGCATTAATCCTCGAGTTCATAGTTATTTAGTTCCTTCTCCAAATCCTCAAATGGAGTTGCCAAGATTCAGCCGTCCAAACCAAGATTATGGA
TCTTCTTCTTCTTCTACCCCTATTCTCTACGTTGCTAACTGCGGACCCGCCGTCGGAATCAGCCACCCCGCAATCGCGGCCGTTTTTGCCCATTTTGGCCATGTCAAAGG
TGTTCACCCCGCCGACGATACCGGCGCTCGTGTCATCGTATGTTTCTCTGAAGAATCCAGCGCCCGAGCCGCTCTTGAGGCGCTTCACGGCCGCCCTTGCCCTCTCCTTG
GAGGCCGGACTTTGCACATACGTTATTCCATCACCAGACCATCCATTTCGCAACCTAATGATTCTCTTTCAGTTTCTTTGTCGGCTTCGGAGCTGGACATTCCCGGACTT
TTCTTATTGCACGATTTCGTTAATGCTAAAGAAGAGGAGGATTTGCTTAGGGAAGTTGATGCTCGACCTTGGAATAATCTGGCGAAACGCAGAGTTCAGCATTATGGGTA
TGAGTTTTGTTATCAAACGAGGAATGTTAATACTAAACATCAGTTGGGGGAACTTCCATCATTTGTTTCCCATGTAGTTGATAGGATCTCCATGTTTCCAAACACTGAGG
ATATTGCAGATGCTTCTCTTGATCAATTGACGGTTAATGAATACCCACCTGGGGTGGGTTTGTCCCCTCATATAGACACCCATTCTGCATTTGAAGGATTAATTTTCAGC
CTTTCCTTAGCAGGGCCATGCATTATGGAGTTTAGGAGATATCCTGAAGGCACTTGGCACAAATTCCCTTCAAGTATAGATTTGAAAATGGAGAATTCTGTAAACGACTC
AAATTATCTGAGGAAAGCCATTTACCTCCCCCCTCGGTCTATGCTTCTACTGTCTGGAGAGGCACGCTATGTTTGGCATCACTACATTCCTCACCATAAGATTGACATGG
TAAAGGACAGTTCTATCAGAAGGGGTCGTAGGAGAGTTTCTTTTACATTTCGCAAGGTGAGTAGATATAATGCATATGTTCTTTCCCTCCCTCTCGTACCTGTTGAGCTT
ACCTTACGTGCTTCTATTATCATTGATGTCAAGACAGCTATTATCGTACCCATGGCAATTCTGAAATATTATAAATGTTAAACTTCGGTATCACCAGACCATGTTATTCT
AGTTTGATCCATTAATATTGAGGCACCTTGGCAATGGCTATATAACAGTAAAGAGAGGAAGCTAGAATTTAAAATGTCAGCTCTCCTATGGTTTCTTTCTTCCTTCATTA
GATTTTGAATGATTGCTCACCTCTTAAAGTTCAGCGTATTTTCTCTTTTAGTGAACTTTAGTGCACAATCCATGGTGGTCACCTACTTAGGATGTAATATCCTTTAAGTT
TTTTTAACACTACACCCAAATATTGTAAGCTCAGTCCTGTGAGATCAGCAGAGATGTTTGTAAATTAGCTCGAACAATCGCAAATAAAAAAATAGCCCAATCTCCTCCAC
GATTAACTTTTGCTCGATCTGTAGATTTTAGTCCTCAAGAGGTGCTTGGCTCATGGGGGCTTTTGGAGCTCATCTGATAGAATATAGGCCACAAAACCACAAATCAATCA
CAGCCACAAAGAGTACATAAACAGAAATTTGAAACCTTATTCTTCTCAGCATAATTCATATTGAGGTTATTCAAATCGAGAAACCTTTTACATGAAGCACGGTTTTTGCT
TTACTAATGCTTGTTTGGCCATCCCGCTTTACAATGATGAAAACTTTTTGATAATAATGGAACTAAATATGTGGAAACTGGTTGAAATGTATGTGAATGGAATGACTAAG
CCATGCTACAAAGTTGAAAATCTCAAGCCAATGCCTTGCGACTTGAAGATATGGTGGTAAAAACATATCTAGTAACCAGTTATTCGTTTGCTTCGATTTTCTCGGGATGC
TATGCAAAATTCTACCGTTCTCTATGGTCTAAAAGTTGATGTTTTTATTTCATCTTATCATCTCACATGCCTTACCATGGTTGAATAAAGGGGAAAAAAATTGTCTTATT
GTAGGTGAGAACTGATCCTTGCCAATGCAAATTTCCCCATTATTGCGATTCTCAGAGATAAATGAAAGGGCTCAATCCTCATTGAAGAACAGGCTGAGTTTTCTTTTTAT
CCTCATTCATCCATCCTTTTACATATGGTTCTTTAAAGTAGATTTTCTACATATTTTTACTTGGTTATACTTACAGGGGGATTTTTCGATACCTCTCACATTTATTGTTT
AAATATTTTACCCTCTTATGTTCTAATTATGGATGCAGCAACAGCATTTTGTTAAACTCCCATTAATTACAGGGAATGAAACTGGGGATGTGGAAACGTTATAGAGGTTA
AAATTCAGGACCTCTTTTCTCACTTTTATTCATTTTAAAATCGTTTTAATTGTCTGTTTGATAATGTTATTTTGGCGTAAATGATTGTTTGATAACCATTTGATTTTTTA
ATTAGTTTCAAACTATTGGATAAAACATTCAACTTGAAGATTTCATCTCTCGTTCACATTTTTTTCTAGTTACAATGAAAGGTTGCGGGACATTAATCCATGGTCATTTA
GGATGTCAACAAGCACTTTATCTATTAGATGATGTTTGGATTTACTGTTTTACATTTTTTGAGTGAAAAATATTTTTTTAAAATTATTATTTCTTTACCAATTTCATTGT
GATAACTCAGTCAAATGTAGCTTTTTAGGGCCGTTCGGAACTAGAACTGAGACTGAACTGAGTTGTTATAGTCCTAGAGCCCGTTCGGGGAAAGGAAAGGAATAGGGATT
TGATCCCTCATTTTTCCTTCTTATTCCTTCCTCTTTCCCTCTTCAGCTCTCTTTCCCTCATTTATTTATTTTTATTATTTTGTTTCTTAAAATATCACTTTGTTATTTCC
CTCTCAGCCCCTTTCCTTATTAAAATATCATAAAATAAAGTTGATTAATTTATTTTTATTATTTTGTTTATTAAAATATCATTTTTAATTTGTGAGATTTAAATTTTAAT
TATTAGGACGTAAATATTCTAAGTTTAGCATTTTCTAAATTTAAGATTTAACCAAATTTAAGATTTCAAACAAATTGATATCTGATTGTGTTGTTAATTTGATATATATT
CTTTAGTATAATAAATTTTAGTCGGCTAATAATTCATAAAAATATTATAAATTTATTTGGGAAATTTAGCTTATAAAATATCAGTGTGGTG
Protein sequenceShow/hide protein sequence
MELPRFSRPNQDYGSSSSSTPILYVANCGPAVGISHPAIAAVFAHFGHVKGVHPADDTGARVIVCFSEESSARAALEALHGRPCPLLGGRTLHIRYSITRPSISQPNDSL
SVSLSASELDIPGLFLLHDFVNAKEEEDLLREVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNTEDIADASLDQLTVNEYPPGVGLSP
HIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKFPSSIDLKMENSVNDSNYLRKAIYLPPRSMLLLSGEARYVWHHYIPHHKIDMVKDSSIRRGRRRVSFTFRKVSRY
NAYVLSLPLVPVELTLRASIIIDVKTAIIVPMAILKYYKC