python - Scipy:ipython 笔记本中的并行计算？-6ren

python - Scipy:ipython 笔记本中的并行计算？

转载作者：行者123 更新时间：2023-12-05 05:25:02

30

4

我正在对数据集(点的集合)进行核密度估计。

估计过程没问题，问题是，当我尝试获取每个点的密度值时，速度很慢:

from sklearn.neighbors import KernelDensity
# this speed is ok
kde = KernelDensity(bandwidth=2.0,atol=0.0005,rtol=0.01).fit(sample) 
# this is very slow
kde_result = kde.score_samples(sample)

样本由 300,000 (x,y) 个点组成。

我想知道是否可以让它并行运行，这样速度会更快？

例如，也许我可以将sample 分成更小的集合并同时为每个集合运行score_samples？具体来说:

我根本不熟悉并行计算。所以我想知道它是否适用于我的情况？
如果这真的可以加快进程，我应该怎么做？我只是在 ipython notebook 中运行脚本，之前没有这方面的经验，有没有适合我的案例的简单好例子？

更新:

import cProfile
cProfile.run('kde.score_samples(sample)')

        64 function calls in 8.653 seconds

   Ordered by: standard name

   ncalls  tottime  percall  cumtime  percall filename:lineno(function)
        1    0.000    0.000    8.653    8.653 <string>:1(<module>)
        2    0.000    0.000    0.000    0.000 _methods.py:31(_sum)
        2    0.000    0.000    0.000    0.000 base.py:870(isspmatrix)
        1    0.000    0.000    8.653    8.653 kde.py:133(score_samples)
        4    0.000    0.000    0.000    0.000 numeric.py:464(asanyarray)
        2    0.000    0.000    0.000    0.000 shape_base.py:60(atleast_2d)
        2    0.000    0.000    0.000    0.000 validation.py:105(_num_samples)
        2    0.000    0.000    0.000    0.000 validation.py:126(_shape_repr)
        6    0.000    0.000    0.000    0.000 validation.py:153(<genexpr>)
        2    0.000    0.000    0.000    0.000 validation.py:268(check_array)
        2    0.000    0.000    0.000    0.000 validation.py:43(_assert_all_finite)
        6    0.000    0.000    0.000    0.000 {hasattr}
        4    0.000    0.000    0.000    0.000 {isinstance}
       12    0.000    0.000    0.000    0.000 {len}
        2    0.000    0.000    0.000    0.000 {method 'append' of 'list' objects}
        1    0.000    0.000    0.000    0.000 {method 'disable' of '_lsprof.Profiler' objects}
        2    0.000    0.000    0.000    0.000 {method 'join' of 'str' objects}
        1    8.652    8.652    8.652    8.652 {method 'kernel_density' of 'sklearn.neighbors.kd_tree.BinaryTree' objects}
        2    0.000    0.000    0.000    0.000 {method 'reduce' of 'numpy.ufunc' objects}
        2    0.000    0.000    0.000    0.000 {method 'sum' of 'numpy.ndarray' objects}
        6    0.000    0.000    0.000    0.000 {numpy.core.multiarray.array}

最佳答案

这是一个使用 multiprocessing built-in module 进行并行化的简单示例:

import numpy as np
import multiprocessing
from sklearn.neighbors import KernelDensity

def parrallel_score_samples(kde, samples, thread_count=int(0.875 * multiprocessing.cpu_count())):
    with multiprocessing.Pool(thread_count) as p:
        return np.concatenate(p.map(kde.score_samples, np.array_split(samples, thread_count)))

kde = KernelDensity(bandwidth=2.0,atol=0.0005,rtol=0.01).fit(sample) 
kde_result = parrallel_score_samples(kde, sample)

正如您从上面的代码中看到的，multiprocessing.Pool 允许您映射一个工作进程池，在您的样本子集上执行 kde.score_samples。
如果您的处理器有足够的核心，加速将是显着的。

关于python - Scipy:ipython 笔记本中的并行计算？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/32625094/

30

4

0

文章推荐： .net - ILMerge 错误 : Key needs to be greater than 0

文章推荐： php - 比较两个 SQL 查询结果集

ipython - 我可以在不离开 ipython 的情况下停止在 ipython 中执行当前模块吗
我想跳出当前正在运行的程序并返回到 shell(无需重新启动 ipython) 最佳答案在 Windows 上重新安装 console2 和 ipython 后，我遇到了同样的问题。如果您使用 ip
ipython - 将 IPython 控制台与 IPython 笔记本一起使用
在使用 IPython 笔记本时，我越来越希望笔记本上附有一个控制台，以进行交互式编程。我发现自己添加了几行来测试代码片段，然后删除它们，这就是很好的用法。在更糟糕的用法中，我会更改同一行中的命令，一
ipython - IPython 魔法是如何工作的
ipthon-sql 是 ipython 的扩展，我先通过 pip install ipython-sql 安装项目在这里:https://github.com/catherinedevlin/ip
ipython - IPython Notebook中的while循环的优美中断
我正在ipython Notebook中运行一些数据分析。一台单独的计算机收集一些数据并将其保存到服务器文件夹中，我的笔记本电脑会定期在该服务器上扫描新文件并进行分析。我在while循环中执行此操作
ipython - ipython Notebook在不同端口上的多个实例
我想让多个ipython Notebook实例在同一用户的不同端口上运行。可能吗？类似于“NotebookApp.port”的端口列表(带有默认端口)。最佳答案再次运行jupyter noteb
ipython - `get_ipython' 在 IPython/IPython Notebook 的启动脚本中不起作用...？
所以 - ROOT 社区中的好人创造了以下魔法: # This is for intercepting the output of ROOT # In a cell, put %%rootprint
ipython - IPython 笔记本的外部编辑器
我正在使用 IPython 笔记本，我想在外部编辑器中编辑程序。我如何获得 %edit file_name.py打开 Notepad++ 等编辑器。最佳答案运行 %edit?将为您提供%edit
ipython - ipython 笔记本中几何代数的良好输出
精简版我能否在 ipython 笔记本中获得 sympy 几何代数对象的漂亮 latex 风格打印？更长的版本在ipython笔记本，我可以从 sympy 得到各种数学对象的 pretty-pr
ipython - ipython 变量扩展如何工作？
我不明白第四个和第六个提示中的 ${} 正在做什么，并且我找不到任何关于此的文档，Python for Unix and Linux 一书系统管理员有一个类似于第六个提示中的示例，其中变量不仅前面加上
ipython - 启动 IPython
我想在已安装 Python 2.7 的 Windows XP 计算机上运行 IPython(版本 0.12)。我通过 Windows 二进制安装程序安装，但安装后 IPython 没有显示在菜单中，
ipython - ipython 中的括号匹配
IPython 中是否有自动关闭方括号、引号、圆括号等的选项？我希望有一个类似于 gedit 插件中的功能。最佳答案通过调整 ~/.inputrc 可以让应用程序(包括 IPython)使用 r
ipython - IPython 提示号有什么用？
我正在使用 IPython Web 笔记本，每个 block 之前都有一个提示编号，例如“In [68]:”。这个提示号码的用途是什么？你能用它做任何事吗？您可以/应该重置它吗？最佳答案 IPyth
ipython - iPython Notebook 中的默认笔记本目录 - iPython 3.0.0
我升级到 iPython 3.0.0(Python 3.4；使用 Anaconda 环境；Mac OSX 10.9.5)，打开新的 iPython Notebook session 的行为似乎发生了变
ipython - 打开 'ipython notebook' 为 : IPython notebook vs Jupyter
我希望能找到更多关于以下内容的文档: From one computer: C:\Python>ipython notebook opens the browser as 'IPython Noteb
ipython - 无法在 IPython 笔记本中上传大文件
我正在尝试在我的 IPython 笔记本中上传一个大小为 500MB 的网络日志文件。但是我收到错误消息“无法上传文件 >25Mb”。有什么方法可以克服这个错误。任何帮助将不胜感激。谢谢。最佳答
ipython - %precision 不尊重 IPython
简单地说，魔术函数 %precision 不考虑简单变量输出的浮点精度。 #Configure matplotlib to run on the browser %matplotlib noteboo
ipython - 设置 iPython 默认配置文件
安装 IPython 后，我立即创建了一个默认配置文件: $ ipython profile create 然后，我创建了另一个，这次我给它起了名字testing: $ ipython profile
ipython - jupyter/ipython 中的拆分单元格不起作用
我已经尝试使用命令来拆分单元格“m -”，但它不起作用。使用 esc 或 fn 键作为修饰符时，所有键命令的重置都可以正常工作。我也处于正确的模式(edititng 模式)。最佳答案在编辑模式下，
ipython - 更改 ipython 笔记本字体类型
我想将 ipython 笔记本中的字体类型更改为 consolas 字体类型。我首先使用 ipython profile create 但是，我不清楚在此配置文件中指定字体类型的语法。任何帮助表示赞
ipython - 我可以将 ipython 命令行历史保存到笔记本文件中吗？
我正在使用 iPython 命令行界面，经过一些操作后，我想将操作历史记录保存到笔记本文件中。但我从一开始就没有使用 iPython notebook。我还能做到吗？最佳答案来自@Thomas K

首页

博学

6Ren·AI

商城

python - Scipy:ipython 笔记本中的并行计算？