python - uWSGI 和 joblib 信号量 : Joblib will operate in serial mode

转载作者：太空宇宙更新时间：2023-11-03 14:39:13

我在 Docker 容器内的 Flask 应用程序中运行 joblib 以及由 supervisord 启动的 uWSGI(启动时启用线程)。

网络服务器启动显示如下错误:

unable to load configuration from from multiprocessing.semaphore_tracker import main;main(15)
/usr/local/lib/python3.5/dist-packages/sklearn/externals/joblib/_multiprocessing_helpers.py:38: UserWarning:

[Errno 32] Broken pipe.  joblib will operate in serial mode

知道如何解决这个问题并使 joblib 并行运行吗？谢谢!

docker 容器中安装了以下包:

pytest==4.0.1
pytest-cov==2.6.0
flake8==3.6.0
Cython==0.29.3
numpy==1.16.1
pandas==0.24.0
scikit-learn==0.20.2
fancyimpute==0.4.2
scikit-garden==0.1.3
category_encoders==1.3.0
boto3==1.9.86
joblib==0.13.1
dash==0.37.0
dash-renderer==0.18.0
dash-core-components==0.43.1
dash-table==3.4.0
dash-html-components==0.13.5
dash-auth==1.3.2
Flask-Caching==1.4.0
plotly==3.6.1
APScheduler==3.5.3

编辑

问题是由于 uWSGI、nginx 或 supervisord。缺少 dev/shm 的权限不是问题，因为如果我直接运行 flask 服务器就可以创建信号量。在下面找到三个服务的配置文件。免责声明，我是网络服务器菜鸟，配置是通过从不同的博客复制和粘贴而诞生的，只是为了让它工作:-D

所以这是我的 uwsgi 配置:

[uwsgi]
module = prism_dash_frontend.__main__
callable = server

uid = nginx
gid = nginx

plugins = python3

socket = /tmp/uwsgi.sock
chown-socket = nginx:nginx
chmod-socket = 664

# set cheaper algorithm to use, if not set default will be used
cheaper-algo = spare

# minimum number of workers to keep at all times
cheaper = 3

# number of workers to spawn at startup
cheaper-initial = 5

# maximum number of workers that can be spawned
workers = 5

# how many workers should be spawned at a time
cheaper-step = 1
processes = 5

die-on-term = true
enable-threads = true

nginx 配置:

# based on default config of nginx 1.12.1
# Define the user that will own and run the Nginx server
user nginx;
# Define the number of worker processes; recommended value is the number of
# cores that are being used by your server
# auto will default to number of vcpus/cores
worker_processes auto;

# altering default pid file location
pid /tmp/nginx.pid;

# turn off daemon mode to be watched by supervisord
daemon off;

# Enables the use of JIT for regular expressions to speed-up their processing.
pcre_jit on;

# Define the location on the file system of the error log, plus the minimum
# severity to log messages for
error_log /var/log/nginx/error.log warn;

# events block defines the parameters that affect connection processing.
events {
    # Define the maximum number of simultaneous connections that can be opened by a worker process
    worker_connections  1024;
}


# http block defines the parameters for how NGINX should handle HTTP web traffic
http {
    # Include the file defining the list of file types that are supported by NGINX
    include /etc/nginx/mime.types;
    # Define the default file type that is returned to the user
    default_type text/html;

    # Don't tell nginx version to clients.
    server_tokens off;

    # Specifies the maximum accepted body size of a client request, as
    # indicated by the request header Content-Length. If the stated content
    # length is greater than this size, then the client receives the HTTP
    # error code 413. Set to 0 to disable.
    client_max_body_size 0;

    # Define the format of log messages.
    log_format  main  '$remote_addr - $remote_user [$time_local] "$request" '
                        '$status $body_bytes_sent "$http_referer" '
                        '"$http_user_agent" "$http_x_forwarded_for"';

    # Define the location of the log of access attempts to NGINX
    access_log /var/log/nginx/access.log  main;

    # Define the parameters to optimize the delivery of static content
    sendfile       on;
    tcp_nopush     on;
    tcp_nodelay    on;

    # Define the timeout value for keep-alive connections with the client
    keepalive_timeout  65;

    # Define the usage of the gzip compression algorithm to reduce the amount of data to transmit
    #gzip  on;

    # Include additional parameters for virtual host(s)/server(s)
    include /etc/nginx/conf.d/*.conf;
}

supervisord 配置:

[supervisord]
nodaemon=true

[program:uwsgi]
command=/usr/bin/uwsgi --ini /etc/uwsgi/uwsgi.ini
stdout_logfile=/dev/stdout
stdout_logfile_maxbytes=0
stderr_logfile=/dev/stderr
stderr_logfile_maxbytes=0

[program:nginx]
command=/usr/sbin/nginx
stdout_logfile=/dev/stdout
stdout_logfile_maxbytes=0
stderr_logfile=/dev/stderr
stderr_logfile_maxbytes=0

第二次编辑

从 Python 3.5 迁移到 3.7.2 后，错误的性质略有变化:

unable to load configuration from from multiprocessing.semaphore_tracker import main;main(15)
/usr/local/lib/python3.7/multiprocessing/semaphore_tracker.py:55: UserWarning:

semaphore_tracker: process died unexpectedly, relaunching.  Some semaphores might leak.

unable to load configuration from from multiprocessing.semaphore_tracker import main;main(15)

非常感谢帮助，这目前对我来说是一个很大的障碍:-/

第三次编辑:

HERE on my github account是一个最小的、完整的、可验证的例子。

您可以通过以下方式轻松运行它make build 然后是 make run。

它将显示以下日志消息:

unable to load configuration from from multiprocessing.semaphore_tracker import main;main(14)

一旦您访问 http://127.0.0.1:8080/ 并出现以下错误，就会崩溃:

exception calling callback for <Future at 0x7fbc520c7eb8 state=finished raised TerminatedWorkerError>
Traceback (most recent call last):
  File "/usr/local/lib/python3.7/site-packages/joblib/externals/loky/_base.py", line 625, in _invoke_callbacks
    callback(self)
  File "/usr/local/lib/python3.7/site-packages/joblib/parallel.py", line 309, in __call__
    self.parallel.dispatch_next()
  File "/usr/local/lib/python3.7/site-packages/joblib/parallel.py", line 731, in dispatch_next
    if not self.dispatch_one_batch(self._original_iterator):
  File "/usr/local/lib/python3.7/site-packages/joblib/parallel.py", line 759, in dispatch_one_batch
    self._dispatch(tasks)
  File "/usr/local/lib/python3.7/site-packages/joblib/parallel.py", line 716, in _dispatch
    job = self._backend.apply_async(batch, callback=cb)
  File "/usr/local/lib/python3.7/site-packages/joblib/_parallel_backends.py", line 510, in apply_async
    future = self._workers.submit(SafeFunction(func))
  File "/usr/local/lib/python3.7/site-packages/joblib/externals/loky/reusable_executor.py", line 151, in submit
    fn, *args, **kwargs)
  File "/usr/local/lib/python3.7/site-packages/joblib/externals/loky/process_executor.py", line 1022, in submit
    raise self._flags.broken
joblib.externals.loky.process_executor.TerminatedWorkerError: A worker process managed by the executor was unexpectedly terminated. This could be caused by a segmentation fault while calling the function or by an excessive memory usage causing the Operating System to kill the worker. The exit codes of the workers are {EXIT(1), EXIT(1), EXIT(1), EXIT(1)}

最佳答案

这真是一个兔子洞。

Github 上的 joblib 问题页面有类似的 joblib failing with Uwsgi 帖子。但大多数是针对较旧的 multiprocessing 后端。新的 loky 后端应该可以解决这些问题。

有PR对于为 uwsgi 解决此问题的 multiprocessing 后端:

joblib.Parallel(n_jobs=4,backend="multiprocessing")(joblib.delayed(sqrt)(i ** 2) for i in range(10))

但它有时会随机失败并返回到上述 PR 试图解决的同一个问题。

进一步挖掘表明，当前后端 loky 默认情况下并行处理进程 (docs)。但是这些进程没有共享内存访问，因此需要序列化和排队的 channel 。这可能是 uWSGI 失败而 gunicorn 工作的原因。

所以我尝试切换到线程而不是进程:

joblib.Parallel(n_jobs=4,prefer="threads")(joblib.delayed(sqrt)(i ** 2) for i in range(10))

而且有效:)

关于python - uWSGI 和 joblib 信号量 : Joblib will operate in serial mode，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/54751297/

文章推荐： node.js - MongoDB - SSL 连接问题

文章推荐： ssl - 无需 SSL 的 IIS 多对一证书

文章推荐： python - 如何从文本文件中提取特定部分？

C 信号。信号()与信号集()？
所以我目前正在研究 C 中的 POSIX 线程和信号编程。我的讲师使用 sigset(int sigNumber, void* signalHandlerFUnction) 因为他的笔记不是世界上最好
c++ - while 和 for 循环中的 vector push_back 返回 SIGABRT 信号(信号 6)(C++)
我正在制作一个 C++ 游戏，它要求我将 36 个数字初始化为一个 vector 。你不能用初始化列表初始化一个 vector ，所以我创建了一个 while 循环来更快地初始化它。我想让它把每个数字
python-2.7 - 尝试通过 Popen() 使用 Python 发送 EOF 信号(Ctrl+D)信号
我正在尝试让 Python 发送 EOF信号 (Ctrl+D) 通过 Popen() .不幸的是，我找不到任何关于 Popen() 的引用资料。 *nix 类系统上的信号。这里有谁知道如何发送 EOF
python-2.7 - 尝试通过 Popen() 使用 Python 发送 EOF 信号(Ctrl+D)信号
我正在尝试让 Python 发送 EOF信号 (Ctrl+D) 通过 Popen() .不幸的是，我找不到任何关于 Popen() 的引用资料。 *nix 类系统上的信号。这里有谁知道如何发送 EOF
用于处理简单用户通知系统的 Django 信号
我正在学习编码并拥有一个实时的 Django 项目来保持我的动力。在我的 Django 应用程序中，用户留下评论，而其他人则回复所述评论。每次用户刷新他们的主页时，我都会计算他们是否收到了关于他们之
登录中的 Django 信号
登录功能中的django信号有什么用？用户已添加到请求 session 表中。那么 Django auth.login 函数中对信号的最后一行调用是什么？ @sensitive_post_param
用户创建时的 Django 信号
我已经将用户的创建与函数 create_user_profile 连接起来，当我创建我的用户时出现问题，我似乎连接的函数被调用了两次，而 UserProfile 试图被创建两次，女巫触发了一个错误列
插槽断开后的 Qt 信号
我有一个来自生产者对象处理的硬件的实时数据流。这会连接到一个消费者，该消费者在自己的线程中处理它以保持 gui 响应。 mainwindow::startProcessing(){ QObje
iphone - 如何正确处理异常情况(信号？)
在我的 iPhone 应用程序中，我想提供某种应用程序终止处理程序，该处理程序将在应用程序终止之前执行一些最终工作(删除一些敏感数据)。我想尽可能多地处理终止情况: 1) 用户终止应用 2) 设备电
Angular 信号 - 有什么优势？
我试图了解使用 Angular Signals 的优势。许多解释中都给出了计数示例，但我试图理解的是，与我下面通过变量 myCount 和 myCountDouble 所做的方式相比，以这种方式使用信
Django 信号 dispatch_uid
我对 dispatch_uid 的用法有疑问为信号。目前，我通过简单地添加 if not instance.order_reference 来防止信号的多次使用。 .我现在想知道是否dispatch
Django 信号。如何创建唯一的调度ID？
有时 django 中的信号会被触发两次。在文档中，它说创建(唯一)dispatch_uid 的一个好方法是模块的路径或名称[1] 或任何可哈希对象的 ID[2]。今天我尝试了这个: import
捕获 CTRL-\信号
我有一个用户定义的 shell 项目，我试图在其中实现 cat 命令，但允许用户单击 CTRL-/ 以显示下一个 x 行。我对信号很陌生，所以我认为我在某个地方有一些语法错误...... 主要...
使用定时器处理 C 信号
http://codepad.org/rHIKj7Cd (不是全部代码) 我想要完成的任务是， parent 在共享内存中写入一些内容，然后 child 做出相应的 react ，并每五秒写回一些内容
c++ - 信号/槽连接总数？
有没有一种方法可以找到 Qt 应用程序中信号/槽连接的总数有人向我推荐 Gamma 射线，但有没有更简单的解决方案？最佳答案检查 Qt::UniqueConnection . This is a
C++:信号/槽库中的线程安全
我正在实现一个信号/插槽框架，并且到了我希望它是线程安全的地步。我已经从 Boost 邮件列表中获得了很多支持，但由于这与 boost 无关，我将在这里提出我的未决问题。什么时候信号/槽实现(或任何
c++ - 信号 - 循环内的槽连接
在我的代码中，我在循环内创建相同类型的新对象并将信号连接到对象槽。这是我的试用版。 A * a; QList aList; int aCounter = 0; while(aCounter aLis
c++ - 如何在windows平台上使用c++信号
我知道 UNIX 上的 C 有 signal() 可以在某些操作后调用某些函数。我在 Windows 上需要它。我发现了，它存在什么 from here .但是我不明白如何正确使用它。我在 UNIX
c++ - 信号、槽和其他类
目前我正在将控制台 C++ 项目移植到 Qt。关于移植，我有一些问题。现在我的项目调整如下我有一个派生自 QWidget 的 Form 类，它使用派生自 QObject 的其他类。现在请告诉我我是否
c++ - 信号/槽基类多继承
在我的 Qt 多线程程序中，我想实现一个基于 QObject 的基类，以便从它派生的每个类都可以使用它的信号和槽(例如抛出错误)。我实现了 MyQObject : public QObject{..

太空宇宙

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

python - uWSGI 和 joblib 信号量 : Joblib will operate in serial mode

编辑

第二次编辑

第三次编辑: