selenium - 如何解决 urllib3.exceptions.MaxRetryError : HTTPConnectionPool(host ='127.0.0.1' , port=58408): Max retries exceeded with url-6ren

selenium - 如何解决 urllib3.exceptions.MaxRetryError : HTTPConnectionPool(host ='127.0.0.1' , port=58408): Max retries exceeded with url

转载作者：行者123 更新时间：2023-12-03 23:42:49

30

4

我正在尝试用 selenium 抓取网站的几页并使用结果，但是当我运行该函数两次时

[WinError 10061] No connection could be made because the target machine actively refused it'

第二个函数调用出现错误。
这是我的方法:

import os
import re
from selenium import webdriver
from selenium.webdriver.common.keys import Keys
from selenium.webdriver.chrome.options import Options
from bs4 import BeautifulSoup as soup

opts = webdriver.ChromeOptions()
opts.binary_location = os.environ.get('GOOGLE_CHROME_BIN', None)
opts.add_argument("--headless")
opts.add_argument("--disable-dev-shm-usage")
opts.add_argument("--no-sandbox")
browser = webdriver.Chrome(executable_path="CHROME_DRIVER PATH", options=opts)

lst =[]
def search(st):
    for i in range(1,3):
        url = "https://gogoanime.so/anime-list.html?page=" + str(i)
        browser.get(url)
        req = browser.page_source
        sou = soup(req, "html.parser")
        title = sou.find('ul', class_ = "listing")
        title = title.find_all("li")
        for j in range(len(title)):
            lst.append(title[j].getText().lower()[1:])
    browser.quit()
    print(len(lst))
    
search("a")
search("a")

输出

272
raise MaxRetryError(_pool, url, error or ResponseError(cause))
urllib3.exceptions.MaxRetryError: HTTPConnectionPool(host='127.0.0.1', port=58408): Max retries exceeded with url: /session/4b3cb270d1b5b867257dcb1cee49b368/url (Caused by NewConnectionError('<urllib3.connection.HTTPConnection object at 0x000001D5B378FA60>: Failed to establish a new connection: [WinError 10061] No connection could be made because the target machine actively refused it'))

最佳答案

这个错误信息...

raise MaxRetryError(_pool, url, error or ResponseError(cause))
urllib3.exceptions.MaxRetryError: HTTPConnectionPool(host='127.0.0.1', port=58408): Max retries exceeded with url: /session/4b3cb270d1b5b867257dcb1cee49b368/url (Caused by NewConnectionError('<urllib3.connection.HTTPConnection object at 0x000001D5B378FA60>: Failed to establish a new connection: [WinError 10061] No connection could be made because the target machine actively refused it'))

...暗示未能建立新的连接引发 MaxRetryError 因为无法建立连接。

几件事:

首先，根据讨论max-retries-exceeded exceptions are confusing追溯有些误导。为方便用户，请求包装了异常。原始异常是显示消息的一部分。

请求从不重试(它为 urllib3 的 retries=0 设置了 HTTPConnectionPool)，所以如果没有 ，错误会更加规范。 MaxRetryError 和 HTTPConnectionPool 关键词。所以理想的回溯应该是:

  ConnectionError(<class 'socket.error'>: [Errno 1111] Connection refused)

根本原因和解决方案
一旦您启动了 webdriver 和 web 客户端 session ，接下来是 def search(st)您正在调用 get() o 访问一个 url 并在随后的行中调用 browser.quit()用于调用 /shutdown端点，随后 webdriver 和 web 客户端实例被完全销毁，关闭所有页面/选项卡/窗口。因此不再存在连接。

You can find a couple of relevant detailed discussion in:

PhantomJS web driver stays in memory

Selenium : How to stop geckodriver process impacting PC memory, without callingdriver.quit()?

在下一次迭代中的这种情况下(由于 for 循环)当 browser.get()被调用时没有事件连接。因此你会看到错误。
所以一个简单的解决方案是删除行 browser.quit()并调用 browser.get(url)在相同的浏览上下文中。

结论
一旦升级到 Selenium 3.14.1 您将能够设置超时并查看规范的回溯，并能够采取必要的行动。

引用
您可以在以下位置找到相关的详细讨论:

MaxRetryError: HTTPConnectionPool: Max retries exceeded (Caused by ProtocolError('Connection aborted.', error(111, 'Connection refused')))

tl;博士
几个相关的讨论:

Adding max_retries as an argument

Removed the bundled charade and urllib3.

Third party libraries committed verbatim

关于selenium - 如何解决 urllib3.exceptions.MaxRetryError : HTTPConnectionPool(host ='127.0.0.1' , port=58408): Max retries exceeded with url，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/64745726/

30

4

0

文章推荐： c++ - 如何仅将可变参数模板与模板模板参数匹配？

文章推荐：使用 --app-image 选项创建安装程序时 jpackage 崩溃

文章推荐： emacs 键绑定(bind)在终端中不起作用

port - 你为什么说 "TCP port"？
我正在学习网络和套接字，但有些东西我不明白。我经常听说“TCP端口”但我认为端口与应用层有关(例如 HTTP 服务器为 80)。那你为什么不说“应用程序端口”呢？为什么端口似乎与 TCP 层相关联(它
Nginx，如何允许DOMAIN :PORT and IP:PORT requests
配置 Nginx 以允许像这样的 DOMAIN:PORT 请求的正确方法是什么: http://example.com:8080/?a=xxx&b=yyy&c=zzz over TCP or UDP
python - 访问域:port instead of IP:port
已关闭。此问题不符合Stack Overflow guidelines 。目前不接受答案。这个问题似乎不是关于 a specific programming problem, a software
Nginx:如何在以下配置中将每个 http:port 请求重定向到 HTTPS:port？
这是我的 nginx.conf，适用于 https。如果有人输入 HTTP://dev.local.org:3002，我该如何重定向到 HTTPS://dev.local.org:3002？这个
php - 解析 IP :Port from string with characters after the port #
我在这方面需要一点帮助，而我对这方面的 RegEx 知识有点欠缺。我有一个代理列表，我正在尝试解析该列表并将 IP 和端口号与字符串分开。正在读取的字符串看起来像这样。(示例 1) 121.121
java - Firefox port.emit 和 port.on 在扩展中不起作用
我正在尝试制作一个 Firefox 扩展。我需要与后台脚本 (main.js) 交换数据，所以我尝试使用端口，但它不起作用。 //Content.js self.port.on("alert",fun
bash - [[ -z "$PORT"]] && export PORT=8080 bash 命令有什么作用？
我正在学习教程，他们使用命令[[ -z "$PORT" ]] && export PORT=8080我不完全明白它在做什么。我对 bash 命令的了解非常基础，所以我什至不知道用什么谷歌来解决这个问题
port - PIC 18F 上 PORT 和 LATCH 的区别
我已经阅读了数据表和谷歌，但我仍然不明白。就我而言，我将 PIC18F26K20 的 PIN RC6 设置为 INPUT 模式: TRISCbits.TRISC6 = 1; 然后我用 PORT 和
Azure VM端点: mapping public port to a different local port
我想知道是否可以将公共(public) IP 端口(例如端口 80)映射到 Azure iaas VM 上的不同本地/私有(private) IP 端口(例如端口 81)。我相信这在旧门户中是可行的，
c - libuv : src port of response not same as port on which process is listening
我有一个用 python-twisted 编写的客户端，它将 UDP 数据包发送到 IP aaa.bbb.ccc.ddd 的端口 1234，然后等待响应。我还有用 C-libuv 编写的 UDP 服务
node.js - 为什么我可以访问我的网站(IP地址):port but not (domain name):port?
我有一个使用弹性 IP 12.34.56.78 运行的 Amazon EC2 实例。我拥有一个域名 example.com，我已将其设置为指向 EC2 实例。我在 EC2 实例的端口 80 上运行 A
linux - AWS : SSH port timeout after changing port number
我正在尝试在 AWS Lightsail 上配置网站。我做的第一件事是在中将端口号从 22 更改为 2200 /etc/ssh/sshd_config ，然后我像这样配置了简单的防火墙 sudo u
Docker 使用 "-p :"时忽略 iptable 规则
几天前才意识到 Docker 似乎绕过了我的 iptable 规则。我对 Docker 和 iptables 的经验并不令人难以置信。最近几天尝试了很多不同的东西。还看到最近的 docker 版本有很
ubuntu - Zerotier cli命令在ubuntu中给出错误 "missing port and zerotier-one.port not found"
我从他们的website 下载了零层使用以下命令: curl -s https://install.zerotier.com | sudo bash 每当我尝试使用 zerotier cli 时，都会
php - 如何用 "Port O' Brian 替换 "Port O' Brian”？
我是字符串操作的新手，只是试图替换列表中的值。我试图修复的两个输入是 MCAFEE和 PORT O'BRIAN . 所以我跑 ucwords(strtolower($rawTitle)) .但现在我
openafs - 警告 : remote port forwarding failed for listen port 52698
我正在使用 SSH 访问我大学的 afs 系统。我喜欢使用 rmate(远程 TextMate)，它需要 SSH 隧道，因此我在 .bashrc 中包含了这个别名。 alias sshr=ssh -R
port - Heroku 打开 "Puma Port 5000 Already In Use"导轨
当我使用 Control-C 退出“Heroku Open”(Heroku 工具栏服务器命令)时。我无法重新启动。我收到此错误: /vendor/bundle/gems/puma-2.14.0/lib
javascript - 无法让 port.emit 和 port.on 在 Firefox 附加组件中工作
我正在发送这样的消息: self.port.emit("nodes_grubed", textNodesValues); 并想对此使用react: worker.port.on("nodes_grub
javascript - 在 main.js 和附加到选项卡的脚本之间正确使用 port.emit 和 port.on
我正在尝试在此扩展中创建一个函数，该函数将打开具有给定网址的选项卡，并在该选项卡上使用给定文件名运行脚本。该功能大部分工作正常，只是我无法在主脚本和我在新选项卡上运行的脚本之间进行通信(我为此使用了
c# - .NET MVC 4调用localhost :port and localhost:port/home/index之间的区别
我在我的 .NET MVC 4 元素中使用 Bootstrap ，我使用 NuGet 导入 Bootstrap 我的元素，我有一个布局页面，我在这个页面中包含 Bootstrap 标签，我的索引页面正

首页

博学

6Ren·AI

商城

selenium - 如何解决 urllib3.exceptions.MaxRetryError : HTTPConnectionPool(host ='127.0.0.1' , port=58408): Max retries exceeded with url