; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC05G084360 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC05G084360
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionProtein of unknown function (DUF506)
Genome locationCiama_Chr05:4746794..4749030
RNA-Seq ExpressionCaUC05G084360
SyntenyCaUC05G084360
Gene Ontology termsNA
InterPro domainsIPR006502 - Protein of unknown function PDDEXK-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049317.1 uncharacterized protein E6C27_scaffold171G005590 [Cucumis melo var. makuwa]3.7e-20590.39Show/hide
Query:  MPFPMKIQPIDIDVRTVREQVRTESAKPVFKSRLRRLFDRPFPSVLRISAVEKPIIVGEPAQFSSKDGGGGGTELEPSSVCLDKMVQNFIEESNEKQPAT
        MPFPMKIQPIDIDV+T REQVRTESAKPVFKSRLRRLFDRPFPSVLRISAVEKPI VGE AQFSSKDGGGGGT+LEPSSVCLDKMVQNFIEE+NEKQPAT
Subjt:  MPFPMKIQPIDIDVRTVREQVRTESAKPVFKSRLRRLFDRPFPSVLRISAVEKPIIVGEPAQFSSKDGGGGGTELEPSSVCLDKMVQNFIEESNEKQPAT

Query:  VKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDALSSLGYNSSICK
        VKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLR+IVTDALSSLGYNSSICK
Subjt:  VKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDALSSLGYNSSICK

Query:  SKWEKSPSFPAGEYEYVDVILDGERLLIDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLS
        SKWEKSPSFPAGEYEYVDVILDGERLLIDIDFRSEFEIARSTGTYK ILQTLPY+FVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLS
Subjt:  SKWEKSPSFPAGEYEYVDVILDGERLLIDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLS

Query:  SPIRTADSIPNITPEAEPEEETKPPIPNDLLVIDTDCGEFELIFGEES---APNVSVSGDPESPSGENKPPTGTAPPWQPPAIKPKSIDKGAKIVTGLAS
        +P RTADS+ N +P+ EP+E   P  PND LV  TDCGEFELIFGEES   +PN+S SGD ESP  ENK P G+AP WQPPAIKPKSIDKGAKIVTGLAS
Subjt:  SPIRTADSIPNITPEAEPEEETKPPIPNDLLVIDTDCGEFELIFGEES---APNVSVSGDPESPSGENKPPTGTAPPWQPPAIKPKSIDKGAKIVTGLAS

Query:  LLKEKS
        LLKEKS
Subjt:  LLKEKS

XP_004134108.1 uncharacterized protein LOC101220013 [Cucumis sativus]5.7e-20690.91Show/hide
Query:  MPFPMKIQPIDIDVRTVREQVRTESAKPVFKSRLRRLFDRPFPSVLRISAVEKPIIVGEPAQFSSKD-GGGGGTELEPSSVCLDKMVQNFIEESNEKQPA
        MPFPMKIQPIDIDV+TVREQVRTESAKPVFKSRLRRLFDRPFPSVLRISAVEKPIIVGE AQFSSKD GGGGGTELEPSSVCLDKMVQNFIEE+NE+QPA
Subjt:  MPFPMKIQPIDIDVRTVREQVRTESAKPVFKSRLRRLFDRPFPSVLRISAVEKPIIVGEPAQFSSKD-GGGGGTELEPSSVCLDKMVQNFIEESNEKQPA

Query:  TVKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDALSSLGYNSSIC
        TVKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLR+IVTDALS LGYNSSIC
Subjt:  TVKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDALSSLGYNSSIC

Query:  KSKWEKSPSFPAGEYEYVDVILDGERLLIDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWL
        KSKWEKSPSFPAGEYEYVDVILDGERLLIDIDFRSEFEIARSTG YK ILQTLPY+FVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWL
Subjt:  KSKWEKSPSFPAGEYEYVDVILDGERLLIDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWL

Query:  SSPIRTADSIPNITPEAEPEEETKPPIPNDLLVIDTDCGEFELIFGEES---APNVSVSGDPESPSGENKPPTGTAPPWQPPAIKPKSIDKGAKIVTGLA
        S+P RTADSI N +P+ EP+E   P I ND LV  TDCGEFELIFGEES   + N+S+SGD ESP+GENKPP G+APPWQPPAIKPKSIDKGAKIVTGLA
Subjt:  SSPIRTADSIPNITPEAEPEEETKPPIPNDLLVIDTDCGEFELIFGEES---APNVSVSGDPESPSGENKPPTGTAPPWQPPAIKPKSIDKGAKIVTGLA

Query:  SLLKEKS
        SLLKEKS
Subjt:  SLLKEKS

XP_008438600.1 PREDICTED: uncharacterized protein LOC103483661 [Cucumis melo]2.3e-20289.41Show/hide
Query:  MPFPMKIQPIDIDVRTVREQVRTESAKPVFKSRLRRLFDRPFPSVLRISAVEKPIIVGEPAQFSSKDGGGGGTELEPSSVCLDKMVQNFIEESNEKQPAT
        MPFPMKIQPIDIDV+T REQVRTESAKPVFKSRLRRLFDRPFPSVLRISAVEKPI VGE AQFSSKDGGGGGT+LEPSSVCLDKMVQNFIEE+NEKQPAT
Subjt:  MPFPMKIQPIDIDVRTVREQVRTESAKPVFKSRLRRLFDRPFPSVLRISAVEKPIIVGEPAQFSSKDGGGGGTELEPSSVCLDKMVQNFIEESNEKQPAT

Query:  VKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDALSSLGYNSSICK
        VKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLR+IVTDALSSLGYNSSICK
Subjt:  VKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDALSSLGYNSSICK

Query:  SKWEKSPSFPAGEYEYVDVILDGERLLIDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLS
        SKWEKSPSFPAGEYEYVDVILDG+RLLIDIDFRSEFEIARSTGTYK ILQTLPY+FVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLS
Subjt:  SKWEKSPSFPAGEYEYVDVILDGERLLIDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLS

Query:  SPIRTADSIPNITPEAEPEEETKPPIPNDLLVIDTDCGEFELIFGEES---APNVSVSGDPESPSGENKPPTGTAPPWQPPAIKPKSIDKGAKIVTGLAS
        +P RTADS+ N +P+ EP+E   P  P+D LV  TDCGEFELIFGEES   +PN+S SGD ESP  ENK    +AP WQPPAIKPKSIDKGAKIVTGLAS
Subjt:  SPIRTADSIPNITPEAEPEEETKPPIPNDLLVIDTDCGEFELIFGEES---APNVSVSGDPESPSGENKPPTGTAPPWQPPAIKPKSIDKGAKIVTGLAS

Query:  LLKEKS
        LLKEKS
Subjt:  LLKEKS

XP_023539817.1 uncharacterized protein LOC111800384 [Cucurbita pepo subsp. pepo]3.3e-19387.01Show/hide
Query:  MPFPMKIQPIDIDVRTVREQVRTESAKPVFKSRLRRLFDRPFPSVLRISAVEKPIIVGEPAQFSSKDGGGGGTELEPSSVCLDKMVQNFIEESNEKQPAT
        MPFPMKIQPIDIDVRTVREQVRT++AKP+FKSRLRRLFDRPFPSVLR S VEKP IVGEPAQFS       GTE EPSSVCLDKMVQNFIEESNEKQP  
Subjt:  MPFPMKIQPIDIDVRTVREQVRTESAKPVFKSRLRRLFDRPFPSVLRISAVEKPIIVGEPAQFSSKDGGGGGTELEPSSVCLDKMVQNFIEESNEKQPAT

Query:  VKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDALSSLGYNSSICK
        VKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILK LIPC SVTERNLLADASKIVEKHNK+HKRKDDLRRIVTD LSSLGYNSSICK
Subjt:  VKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDALSSLGYNSSICK

Query:  SKWEKSPSFPAGEYEYVDVILDGERLLIDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLS
        SKWEKSPSFPAGEYEY+DVI+DGERLLIDIDF+SEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLS
Subjt:  SKWEKSPSFPAGEYEYVDVILDGERLLIDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLS

Query:  SPIRTADSIPNITPEAEPEEETKPPIPNDLLVIDTDCGEFELIFGEESAPNVSVSGDPESP-----SGENKPPTGTAPPWQPPAIKPKSIDKGAKIVTGL
        SPIRTAD + N  PEAEP E   P I ND LV DTDCGEFELIFGEESAP+ S + DP SP     S + KPP GTA PWQPPA+KPKSIDKGAKIVTGL
Subjt:  SPIRTADSIPNITPEAEPEEETKPPIPNDLLVIDTDCGEFELIFGEESAPNVSVSGDPESP-----SGENKPPTGTAPPWQPPAIKPKSIDKGAKIVTGL

Query:  ASLLKEKS
        ASLLKEKS
Subjt:  ASLLKEKS

XP_038898340.1 uncharacterized protein LOC120086017 [Benincasa hispida]8.8e-21595.04Show/hide
Query:  MPFPMKIQPIDIDVRTVREQVRTESAKPVFKSRLRRLFDRPFPSVLRISAVEKPIIVGEPAQFSSKDGGGGGTELEPSSVCLDKMVQNFIEESNEKQPAT
        MPFPMKIQPIDIDV+TVREQVRTESAKPVFKSRLRRLFDRPFPSVLRI+AVEKPIIVGEPAQFSSKD GGGGTELEPSSVCLDKMVQNFIEE+NEKQPAT
Subjt:  MPFPMKIQPIDIDVRTVREQVRTESAKPVFKSRLRRLFDRPFPSVLRISAVEKPIIVGEPAQFSSKDGGGGGTELEPSSVCLDKMVQNFIEESNEKQPAT

Query:  VKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDALSSLGYNSSICK
        VKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDALSSLGYNSSICK
Subjt:  VKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDALSSLGYNSSICK

Query:  SKWEKSPSFPAGEYEYVDVILDGERLLIDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLS
        SKWEKSPSFPAGEYEYVDVILDGERLL+DIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVS+AARQSLKKKGMHFPPWRKAEYMLAKWLS
Subjt:  SKWEKSPSFPAGEYEYVDVILDGERLLIDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLS

Query:  SPIRTADSIPNITPEAEPEEETKPPIPNDLLVIDTDCGEFELIFGEESAPNVSVSGDPESPSGENKPPTGTAPPWQPPAIKPKSIDKGAKIVTGLASLLK
        SP RTADS+P   P+AEP EETKPPI ND LV DTDCGEFELIFGEES P+VS+SGD ESPSGENKPPTGTAPPWQPPAIKPKSIDKGAKIVTGLASLLK
Subjt:  SPIRTADSIPNITPEAEPEEETKPPIPNDLLVIDTDCGEFELIFGEESAPNVSVSGDPESPSGENKPPTGTAPPWQPPAIKPKSIDKGAKIVTGLASLLK

Query:  EKS
        EKS
Subjt:  EKS

TrEMBL top hitse value%identityAlignment
A0A0A0L8I2 Uncharacterized protein2.8e-20690.91Show/hide
Query:  MPFPMKIQPIDIDVRTVREQVRTESAKPVFKSRLRRLFDRPFPSVLRISAVEKPIIVGEPAQFSSKD-GGGGGTELEPSSVCLDKMVQNFIEESNEKQPA
        MPFPMKIQPIDIDV+TVREQVRTESAKPVFKSRLRRLFDRPFPSVLRISAVEKPIIVGE AQFSSKD GGGGGTELEPSSVCLDKMVQNFIEE+NE+QPA
Subjt:  MPFPMKIQPIDIDVRTVREQVRTESAKPVFKSRLRRLFDRPFPSVLRISAVEKPIIVGEPAQFSSKD-GGGGGTELEPSSVCLDKMVQNFIEESNEKQPA

Query:  TVKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDALSSLGYNSSIC
        TVKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLR+IVTDALS LGYNSSIC
Subjt:  TVKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDALSSLGYNSSIC

Query:  KSKWEKSPSFPAGEYEYVDVILDGERLLIDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWL
        KSKWEKSPSFPAGEYEYVDVILDGERLLIDIDFRSEFEIARSTG YK ILQTLPY+FVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWL
Subjt:  KSKWEKSPSFPAGEYEYVDVILDGERLLIDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWL

Query:  SSPIRTADSIPNITPEAEPEEETKPPIPNDLLVIDTDCGEFELIFGEES---APNVSVSGDPESPSGENKPPTGTAPPWQPPAIKPKSIDKGAKIVTGLA
        S+P RTADSI N +P+ EP+E   P I ND LV  TDCGEFELIFGEES   + N+S+SGD ESP+GENKPP G+APPWQPPAIKPKSIDKGAKIVTGLA
Subjt:  SSPIRTADSIPNITPEAEPEEETKPPIPNDLLVIDTDCGEFELIFGEES---APNVSVSGDPESPSGENKPPTGTAPPWQPPAIKPKSIDKGAKIVTGLA

Query:  SLLKEKS
        SLLKEKS
Subjt:  SLLKEKS

A0A1S3AXG0 uncharacterized protein LOC1034836611.1e-20289.41Show/hide
Query:  MPFPMKIQPIDIDVRTVREQVRTESAKPVFKSRLRRLFDRPFPSVLRISAVEKPIIVGEPAQFSSKDGGGGGTELEPSSVCLDKMVQNFIEESNEKQPAT
        MPFPMKIQPIDIDV+T REQVRTESAKPVFKSRLRRLFDRPFPSVLRISAVEKPI VGE AQFSSKDGGGGGT+LEPSSVCLDKMVQNFIEE+NEKQPAT
Subjt:  MPFPMKIQPIDIDVRTVREQVRTESAKPVFKSRLRRLFDRPFPSVLRISAVEKPIIVGEPAQFSSKDGGGGGTELEPSSVCLDKMVQNFIEESNEKQPAT

Query:  VKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDALSSLGYNSSICK
        VKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLR+IVTDALSSLGYNSSICK
Subjt:  VKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDALSSLGYNSSICK

Query:  SKWEKSPSFPAGEYEYVDVILDGERLLIDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLS
        SKWEKSPSFPAGEYEYVDVILDG+RLLIDIDFRSEFEIARSTGTYK ILQTLPY+FVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLS
Subjt:  SKWEKSPSFPAGEYEYVDVILDGERLLIDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLS

Query:  SPIRTADSIPNITPEAEPEEETKPPIPNDLLVIDTDCGEFELIFGEES---APNVSVSGDPESPSGENKPPTGTAPPWQPPAIKPKSIDKGAKIVTGLAS
        +P RTADS+ N +P+ EP+E   P  P+D LV  TDCGEFELIFGEES   +PN+S SGD ESP  ENK    +AP WQPPAIKPKSIDKGAKIVTGLAS
Subjt:  SPIRTADSIPNITPEAEPEEETKPPIPNDLLVIDTDCGEFELIFGEES---APNVSVSGDPESPSGENKPPTGTAPPWQPPAIKPKSIDKGAKIVTGLAS

Query:  LLKEKS
        LLKEKS
Subjt:  LLKEKS

A0A5A7U4P5 Uncharacterized protein1.8e-20590.39Show/hide
Query:  MPFPMKIQPIDIDVRTVREQVRTESAKPVFKSRLRRLFDRPFPSVLRISAVEKPIIVGEPAQFSSKDGGGGGTELEPSSVCLDKMVQNFIEESNEKQPAT
        MPFPMKIQPIDIDV+T REQVRTESAKPVFKSRLRRLFDRPFPSVLRISAVEKPI VGE AQFSSKDGGGGGT+LEPSSVCLDKMVQNFIEE+NEKQPAT
Subjt:  MPFPMKIQPIDIDVRTVREQVRTESAKPVFKSRLRRLFDRPFPSVLRISAVEKPIIVGEPAQFSSKDGGGGGTELEPSSVCLDKMVQNFIEESNEKQPAT

Query:  VKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDALSSLGYNSSICK
        VKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLR+IVTDALSSLGYNSSICK
Subjt:  VKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDALSSLGYNSSICK

Query:  SKWEKSPSFPAGEYEYVDVILDGERLLIDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLS
        SKWEKSPSFPAGEYEYVDVILDGERLLIDIDFRSEFEIARSTGTYK ILQTLPY+FVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLS
Subjt:  SKWEKSPSFPAGEYEYVDVILDGERLLIDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLS

Query:  SPIRTADSIPNITPEAEPEEETKPPIPNDLLVIDTDCGEFELIFGEES---APNVSVSGDPESPSGENKPPTGTAPPWQPPAIKPKSIDKGAKIVTGLAS
        +P RTADS+ N +P+ EP+E   P  PND LV  TDCGEFELIFGEES   +PN+S SGD ESP  ENK P G+AP WQPPAIKPKSIDKGAKIVTGLAS
Subjt:  SPIRTADSIPNITPEAEPEEETKPPIPNDLLVIDTDCGEFELIFGEES---APNVSVSGDPESPSGENKPPTGTAPPWQPPAIKPKSIDKGAKIVTGLAS

Query:  LLKEKS
        LLKEKS
Subjt:  LLKEKS

A0A6J1C8R5 uncharacterized protein LOC1110094231.0e-19285.34Show/hide
Query:  MPFPMKIQPIDIDVRTVREQVRTESAKPVFKSRLRRLFDRPFPSVLRISAVEKPIIVGEPAQFSSKDGGGG-----GTELEPSSVCLDKMVQNFIEESNE
        MPFPMKIQPIDID RT REQ+RT+SAKPVFKSRLRRLFDRPFPSVLRISAVEKP IVGEPAQFSSKDGGGG     GTE EP+SVCLDKMVQNFIE+SNE
Subjt:  MPFPMKIQPIDIDVRTVREQVRTESAKPVFKSRLRRLFDRPFPSVLRISAVEKPIIVGEPAQFSSKDGGGG-----GTELEPSSVCLDKMVQNFIEESNE

Query:  KQPATVKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDALSSLGYN
        KQPA VKYGRNRCNCFN NSNDSSDDEFDVFGGFGESITSGSSGGDACD+LK LIPC SVTERNLLADASKIVEKHNKIHKRKDDLRRIVTD LSSLGYN
Subjt:  KQPATVKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDALSSLGYN

Query:  SSICKSKWEKSPSFPAGEYEYVDVILDGERLLIDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYML
        SSIC+SKW+KSPSFPAGEYEYVDV +DGERLLIDIDFRSEFEIARSTGTYKAILQTLP VFVGKSDRL QIVSIVSEAARQSLKKKGMHFPPWRKAEY L
Subjt:  SSICKSKWEKSPSFPAGEYEYVDVILDGERLLIDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYML

Query:  AKWLSSPIRTADSIPNITPEAEPEEETKPPIPNDLLVIDTDCGEFELIFG--EESAPNVSVSGDPESP------SGENKPPTGTAPPWQPPAIKPKSIDK
        AKWLS P R  DS     PE    EE +P I ND LV DTDCGEFELIFG  EESAP+ +V GDPESP      SG+NK PTGTA PWQPPAIKPKS+DK
Subjt:  AKWLSSPIRTADSIPNITPEAEPEEETKPPIPNDLLVIDTDCGEFELIFG--EESAPNVSVSGDPESP------SGENKPPTGTAPPWQPPAIKPKSIDK

Query:  GAKIVTGLASLLKEKS
        GAKIVTGLASLLKEKS
Subjt:  GAKIVTGLASLLKEKS

A0A6J1F9C5 uncharacterized protein LOC1114434881.0e-19286.52Show/hide
Query:  MPFPMKIQPIDIDVRTVREQVRTESAKPVFKSRLRRLFDRPFPSVLRISAVEKPIIVGEPAQFSSKDGGGGGTELEPSSVCLDKMVQNFIEESNEKQPAT
        MPFPMKIQPIDIDVRTVREQVRT++AKP+FK RLRRLFDRPFPSVLR S VEKP IV EPAQFS       GTE EPSSVCLDKMVQNFIEESNEKQP  
Subjt:  MPFPMKIQPIDIDVRTVREQVRTESAKPVFKSRLRRLFDRPFPSVLRISAVEKPIIVGEPAQFSSKDGGGGGTELEPSSVCLDKMVQNFIEESNEKQPAT

Query:  VKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDALSSLGYNSSICK
        VKYGRN CNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILK LIPC SVTERNLLADASKIVEKHNK+HKRKDDLRRIVTD LSSLGYNSSICK
Subjt:  VKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDALSSLGYNSSICK

Query:  SKWEKSPSFPAGEYEYVDVILDGERLLIDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLS
        SKWEKSPSFPAGEYEY+DVI+DGERLL+DIDF+SEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLS
Subjt:  SKWEKSPSFPAGEYEYVDVILDGERLLIDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLS

Query:  SPIRTADSIPNITPEAEPEEETKPPIPNDLLVIDTDCGEFELIFGEESAPNVSVSGDPESP-----SGENKPPTGTAPPWQPPAIKPKSIDKGAKIVTGL
        SPIRTADS+ N  PEAEP E   P I ND LV DTDCGEFELIFGEES+P+ SV+GDP SP     S + KPP GTA PWQPPA+KPKSIDKGAKIVTGL
Subjt:  SPIRTADSIPNITPEAEPEEETKPPIPNDLLVIDTDCGEFELIFGEESAPNVSVSGDPESP-----SGENKPPTGTAPPWQPPAIKPKSIDKGAKIVTGL

Query:  ASLLKEKS
        ASLLKEKS
Subjt:  ASLLKEKS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G38820.1 Protein of unknown function (DUF506)4.4e-7149.35Show/hide
Query:  MPFPMKIQPIDIDVRTVREQVRTESAKPVFKSRLRRLFDRPFPSVLRISAVEKPIIVGEPAQFSSKDGGGGGTELEPSSVCLDKMVQNFIEESN--EKQP
        MP  MKIQPID +     E    E+ + + KSRL+RLF+R F +    +  EK    G   +     G  G  + EPSSVCL KMV NF+E++N  EKQ 
Subjt:  MPFPMKIQPIDIDVRTVREQVRTESAKPVFKSRLRRLFDRPFPSVLRISAVEKPIIVGEPAQFSSKDGGGGGTELEPSSVCLDKMVQNFIEESN--EKQP

Query:  ATVKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDALSSLGYNSSI
           + GR+RCNCF+G+  +SSDDE +             S G+AC+ILK L+ C S+  RNLL D +KI E                        Y++++
Subjt:  ATVKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDALSSLGYNSSI

Query:  CKSKWEKSPSFPAGEYEYVDVILDGERLLIDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKW
        CKS+WEKSPS PAGEYEYVDVI+ GERLLIDIDF+S+FEIAR+T TYK++LQTLPY+FVGK+DRL +I+ ++ +AA+QSLKKKG+H PPWR+AEY+ +KW
Subjt:  CKSKWEKSPSFPAGEYEYVDVILDGERLLIDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKW

Query:  LSSPIR
        LSS +R
Subjt:  LSSPIR

AT2G38820.2 Protein of unknown function (DUF506)3.5e-7650.98Show/hide
Query:  MPFPMKIQPIDIDVRTVREQVRTESAKPVFKSRLRRLFDRPFPSVLRISAVEKPIIVGEPAQFSSKDGGGGGTELEPSSVCLDKMVQNFIEESN--EKQP
        MP  MKIQPID +     E    E+ + + KSRL+RLF+R F +    +  EK    G   +     G  G  + EPSSVCL KMV NF+E++N  EKQ 
Subjt:  MPFPMKIQPIDIDVRTVREQVRTESAKPVFKSRLRRLFDRPFPSVLRISAVEKPIIVGEPAQFSSKDGGGGGTELEPSSVCLDKMVQNFIEESN--EKQP

Query:  ATVKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDALSSLGYNSSI
           + GR+RCNCF+G+  +SSDDE +             S G+AC+ILK L+ C S+  RNLL D +KI E       +     + V + L SLGY++++
Subjt:  ATVKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDALSSLGYNSSI

Query:  CKSKWEKSPSFPAGEYEYVDVILDGERLLIDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKW
        CKS+WEKSPS PAGEYEYVDVI+ GERLLIDIDF+S+FEIAR+T TYK++LQTLPY+FVGK+DRL +I+ ++ +AA+QSLKKKG+H PPWR+AEY+ +KW
Subjt:  CKSKWEKSPSFPAGEYEYVDVILDGERLLIDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKW

Query:  LSSPIR
        LSS +R
Subjt:  LSSPIR

AT3G22970.1 Protein of unknown function (DUF506)1.2e-10555.64Show/hide
Query:  MPFPMKIQPIDIDVRTVREQVRTESA-KPVFKSRLRRLFDRPFPSVLRIS---AVEKPIIV-GEPAQFSSKDGGGGGTELEPSSVCLDKMVQNFIEESNE
        MPF MKIQPIDID  +     R ES  KPV KSRL+RLFDRPF +VLR S     EKP +V G   Q      GG  TE EPSSVCL KMVQNFIEE+NE
Subjt:  MPFPMKIQPIDIDVRTVREQVRTESA-KPVFKSRLRRLFDRPFPSVLRIS---AVEKPIIV-GEPAQFSSKDGGGGGTELEPSSVCLDKMVQNFIEESNE

Query:  KQPATVKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDALSSLGYN
        KQ    K GRNRCNCFNGN++ SSDDE D+FGG          G DA D LK LIPCT+V ERNLLADA+KIV+K NK  KRKDD+++IV + L SL YN
Subjt:  KQPATVKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDALSSLGYN

Query:  SSICKSKWEKSPSFPAGEYEYVDVILDGERLLIDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYML
        SSICKSKW+KSPSFPAGEYEY+DVI+  ERL+ID+DFRSEF+IAR T  YK +LQ+LP++FVGKSDRL QIV ++SEAA+QSLKKKGM FPPWRKAEYM 
Subjt:  SSICKSKWEKSPSFPAGEYEYVDVILDGERLLIDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYML

Query:  AKWLSSPIRTADSIPNITPEAEPEEETKPPIPNDLLVIDTDCGEFELIFGEES-APNVSVSGDPESPSGENKPPTGTAPPWQPPAIKPKSIDKGAKIVTG
        +KWLSS  R +  + + T            +  ++     D  E EL+F E+  +P V V+       G++                  ++++  K VTG
Subjt:  AKWLSSPIRTADSIPNITPEAEPEEETKPPIPNDLLVIDTDCGEFELIFGEES-APNVSVSGDPESPSGENKPPTGTAPPWQPPAIKPKSIDKGAKIVTG

Query:  LASLLKEK
        LASL KEK
Subjt:  LASLLKEK

AT3G22970.2 Protein of unknown function (DUF506)3.9e-6753.28Show/hide
Query:  ILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDALSSLGYNSSICKSKWEKSPSFPAGEYEYVDVILDGERLLIDIDFRSEFEIARSTGT
        I+  LIPCT+V ERNLLADA+KIV+K NK  KRKDD+++IV + L SL YNSSICKSKW+KSPSFPAGEYEY+DVI+  ERL+ID+DFRSEF+IAR T  
Subjt:  ILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDALSSLGYNSSICKSKWEKSPSFPAGEYEYVDVILDGERLLIDIDFRSEFEIARSTGT

Query:  YKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLSSPIRTADSIPNITPEAEPEEETKPPIPNDLLVIDTDCGEFELIF
        YK +LQ+LP++FVGKSDRL QIV ++SEAA+QSLKKKGM FPPWRKAEYM +KWLSS  R +  + + T            +  ++     D  E EL+F
Subjt:  YKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLSSPIRTADSIPNITPEAEPEEETKPPIPNDLLVIDTDCGEFELIF

Query:  GEES-APNVSVSGDPESPSGENKPPTGTAPPWQPPAIKPKSIDKGAKIVTGLASLLKEK
         E+  +P V V+       G++                  ++++  K VTGLASL KEK
Subjt:  GEES-APNVSVSGDPESPSGENKPPTGTAPPWQPPAIKPKSIDKGAKIVTGLASLLKEK

AT4G14620.1 Protein of unknown function (DUF506)4.2e-9051.97Show/hide
Query:  MKIQPIDIDVRTVREQVRTESAKPVFKSRLRRLFDRPFPSVLRISAVEKPIIVGEPAQFSSKDGGGGGTELEPSSVCLDKMVQNFIEESNEKQPATVKYG
        MKIQPI+ D+   R +    S KPV KSRL+RL DRPF    RIS  EK +I G        DG   GTE EPS   L KMVQN++EE+N+KQ    K G
Subjt:  MKIQPIDIDVRTVREQVRTESAKPVFKSRLRRLFDRPFPSVLRISAVEKPIIVGEPAQFSSKDGGGGGTELEPSSVCLDKMVQNFIEESNEKQPATVKYG

Query:  RN--RCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDALSSLGYNSSICKSK
        RN  RCNCFNGN ND SDDE D F                 D  K LI C S  E++LL +A+KI+EK NK  KRKD+LR+IV D LSSLGY+SSICKSK
Subjt:  RN--RCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDALSSLGYNSSICKSK

Query:  WEKSPSFPAGEYEYVDVILDGERLLIDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLSSP
        W+K+ S PAGEYEY+DVI++GERL+IDIDFRSEFEIAR T  YK +LQ+LP +FVGKSDR+ QIVSIVSEA++QSLKKKGMHFPPWRKA+YM AKWLSS 
Subjt:  WEKSPSFPAGEYEYVDVILDGERLLIDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLSSP

Query:  IR-TADSIPNITPEAEPEEETKPPIPNDLLVIDTDCGEFELIFGEE------SAPNVSVSGDPESPSGENKPPTGTAPPWQPPAIKPKSIDKGAKIVTGL
         R + +  P +T  A+   E            + D  E ELIF E+       +P  SV  D +  +                    +S+ K AK+VTGL
Subjt:  IR-TADSIPNITPEAEPEEETKPPIPNDLLVIDTDCGEFELIFGEE------SAPNVSVSGDPESPSGENKPPTGTAPPWQPPAIKPKSIDKGAKIVTGL

Query:  ASLLKE
        A L KE
Subjt:  ASLLKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTTTTCCGATGAAGATCCAACCGATTGATATCGATGTTCGGACGGTGAGAGAACAGGTTCGTACCGAGTCGGCCAAGCCTGTGTTCAAATCACGACTGAGGCGGCT
TTTCGATCGGCCGTTTCCGAGCGTTCTGAGAATTTCTGCGGTTGAGAAGCCGATTATTGTTGGAGAACCAGCTCAGTTTAGCAGCAAAGATGGAGGAGGAGGAGGGACGG
AGTTGGAGCCAAGCTCGGTTTGCTTAGATAAGATGGTACAGAATTTCATCGAGGAGAGCAACGAAAAACAACCGGCAACCGTCAAATATGGGCGCAATCGCTGCAATTGT
TTCAACGGGAATAGCAATGACAGCTCCGACGATGAATTCGATGTATTTGGGGGTTTTGGTGAATCGATCACCTCCGGATCATCCGGTGGGGATGCATGTGATATACTCAA
GGGTTTAATTCCTTGCACGAGCGTTACGGAGAGAAACCTCTTAGCAGACGCTTCGAAAATCGTCGAAAAGCATAACAAAATTCACAAACGGAAAGACGATTTACGCAGAA
TCGTAACCGATGCTCTCTCATCCCTCGGTTACAATTCTTCCATCTGCAAATCAAAATGGGAGAAATCCCCTTCTTTCCCAGCTGGTGAATACGAATACGTGGATGTGATT
CTGGACGGAGAACGATTGCTGATTGATATCGATTTTAGGTCTGAATTCGAGATTGCTCGTTCGACAGGAACATATAAGGCTATCCTGCAGACGCTGCCGTACGTCTTCGT
TGGAAAATCGGATCGTCTCGGACAAATTGTATCCATCGTATCGGAAGCTGCAAGACAGAGCCTGAAGAAGAAAGGAATGCACTTTCCGCCATGGAGGAAAGCCGAGTACA
TGCTTGCCAAATGGCTCTCTTCTCCGATCAGAACCGCCGATTCTATCCCCAACATTACTCCAGAGGCCGAACCCGAAGAAGAAACCAAACCCCCAATCCCAAATGACCTG
CTGGTAATCGACACCGATTGCGGTGAATTCGAGTTGATCTTCGGCGAGGAATCGGCGCCGAATGTGTCCGTCTCCGGCGATCCTGAATCGCCGTCCGGTGAAAATAAACC
GCCGACGGGGACTGCTCCCCCGTGGCAACCTCCTGCGATTAAGCCAAAGAGCATAGATAAAGGGGCTAAGATAGTCACCGGATTAGCTTCCCTTCTCAAAGAGAAATCTT
GA
mRNA sequenceShow/hide mRNA sequence
TCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCACAGAATTCATCTTCTCAGGCGACCTCCTCGTCGGAAGATGAATTTTCCGATCAACCTCCCCTTCAGATCTGGAC
TTGCTGTTTCCTTCTCCGATCATCGTCGTCATGTTTTTCACGAAAAATCTCTGAGATAGACCTCCCTCCTCTGTTTCCTTTCCTTATTCTGTACTTGCTTCTTGCTCCAG
CATTTTTCCGGAGGTATTTTCTCTTTCGGCCTTTGTATATACATTTTTGCGGTGGTTTTGTTTTTTCCTTTCTATCGCTGCACGGATATTCTGAGTGTGGAGTAGCTGAC
CGGTGGGTGTGTGGAGGTATTAGTGGAAGGGTGGCATTGATTCCAGAGCTGTTTGGTTCTTTGTTTTTGAGTGGGAGATTTTGATCGTACATGCAACTATCTGGAGCTCA
TTTCTCTGATTTTTCTCTGTGATTTGGTTTCTGCGTGTCCGTGAATTTCAATGCCAGACCGTGTTTGTCATTAGTCTCTGTAAATCGAGCTGCACAGAGAAGAGTTTCAG
GAGGCGTAGTTAGGGAGTTCCCCCGGAGCAAATTTTTTTCCTTTACCCCTTTTCATTTTTGGAAATTTCTCGAGGGAGAAGAGATCTTTCTGTTTGTTTCTTCTCGGTCA
TGCCTTTTCCGATGAAGATCCAACCGATTGATATCGATGTTCGGACGGTGAGAGAACAGGTTCGTACCGAGTCGGCCAAGCCTGTGTTCAAATCACGACTGAGGCGGCTT
TTCGATCGGCCGTTTCCGAGCGTTCTGAGAATTTCTGCGGTTGAGAAGCCGATTATTGTTGGAGAACCAGCTCAGTTTAGCAGCAAAGATGGAGGAGGAGGAGGGACGGA
GTTGGAGCCAAGCTCGGTTTGCTTAGATAAGATGGTACAGAATTTCATCGAGGAGAGCAACGAAAAACAACCGGCAACCGTCAAATATGGGCGCAATCGCTGCAATTGTT
TCAACGGGAATAGCAATGACAGCTCCGACGATGAATTCGATGTATTTGGGGGTTTTGGTGAATCGATCACCTCCGGATCATCCGGTGGGGATGCATGTGATATACTCAAG
GGTTTAATTCCTTGCACGAGCGTTACGGAGAGAAACCTCTTAGCAGACGCTTCGAAAATCGTCGAAAAGCATAACAAAATTCACAAACGGAAAGACGATTTACGCAGAAT
CGTAACCGATGCTCTCTCATCCCTCGGTTACAATTCTTCCATCTGCAAATCAAAATGGGAGAAATCCCCTTCTTTCCCAGCTGGTGAATACGAATACGTGGATGTGATTC
TGGACGGAGAACGATTGCTGATTGATATCGATTTTAGGTCTGAATTCGAGATTGCTCGTTCGACAGGAACATATAAGGCTATCCTGCAGACGCTGCCGTACGTCTTCGTT
GGAAAATCGGATCGTCTCGGACAAATTGTATCCATCGTATCGGAAGCTGCAAGACAGAGCCTGAAGAAGAAAGGAATGCACTTTCCGCCATGGAGGAAAGCCGAGTACAT
GCTTGCCAAATGGCTCTCTTCTCCGATCAGAACCGCCGATTCTATCCCCAACATTACTCCAGAGGCCGAACCCGAAGAAGAAACCAAACCCCCAATCCCAAATGACCTGC
TGGTAATCGACACCGATTGCGGTGAATTCGAGTTGATCTTCGGCGAGGAATCGGCGCCGAATGTGTCCGTCTCCGGCGATCCTGAATCGCCGTCCGGTGAAAATAAACCG
CCGACGGGGACTGCTCCCCCGTGGCAACCTCCTGCGATTAAGCCAAAGAGCATAGATAAAGGGGCTAAGATAGTCACCGGATTAGCTTCCCTTCTCAAAGAGAAATCTTG
AGAGAGAGAAAAAGAAGTTTTGGATTTTTTTTTTTTTTCCCCTTTAATTTTCAGGATTAAAAAAATATATATATGAACATAAAAAAAAAATTTATATATATATATATATA
TATAAATACCTTCATAATATTTTGATACTTCTCTCATTATTTCGGATTCCA
Protein sequenceShow/hide protein sequence
MPFPMKIQPIDIDVRTVREQVRTESAKPVFKSRLRRLFDRPFPSVLRISAVEKPIIVGEPAQFSSKDGGGGGTELEPSSVCLDKMVQNFIEESNEKQPATVKYGRNRCNC
FNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKGLIPCTSVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDALSSLGYNSSICKSKWEKSPSFPAGEYEYVDVI
LDGERLLIDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLSSPIRTADSIPNITPEAEPEEETKPPIPNDL
LVIDTDCGEFELIFGEESAPNVSVSGDPESPSGENKPPTGTAPPWQPPAIKPKSIDKGAKIVTGLASLLKEKS