Python:Socket.timeout 不由 except 处理-6ren

Python:Socket.timeout 不由 except 处理

转载作者：行者123 更新时间：2023-12-03 11:55:09

26

4

有时我可以有效地处理socket.timeout，尽管有时我会收到套接字超时错误并且我的脚本突然停止......我的异常处理中是否缺少一些东西？怎么会顺利通过呢？

在以下任一代码中随机发生:

第一个片段:

for _ in range(max_retries):
    try:
        req = Request(url, headers={'User-Agent' :'Mozilla/5.0'})
        response = urlopen(req,timeout=5)
        break
    except error.URLError as err: 
        print("URL that generated the error code: ", url)
        print("Error description:",err.reason)
    except error.HTTPError as err:
        print("URL that generated the error code: ", url)
        print("Error code:", err.code)
        print("Error description:", err.reason)
    except socket.timeout:
        print("URL that generated the error code: ", url)
        print("Error description: No response.")
    except socket.error:
        print("URL that generated the error code: ", url)
        print("Error description: Socket error.")

if response.getheader('Content-Type').startswith('text/html'):
    htmlBytes = response.read()
    htmlString = htmlBytes.decode("utf-8")
    self.feed(htmlString)

第二个片段

for _ in range(max_retries):
    try:
        req = Request(i, headers={'User-Agent' :'Mozilla/5.0'})
        with urlopen(req,timeout=5) as response, open(aux, 'wb') as out_file:
            shutil.copyfileobj(response, out_file)  
        with open(path, fname), 'a') as f:
            f.write(("link" + str(intaux) + "-" + auxstr + str(index) + i[-4:] + " --- " + metadata[index%batch] + '\n'))
        break
    except error.URLError as err:
        print("URL that generated the error code: ", i)
        print("Error description:",err.reason)
    except error.HTTPError as err:
        print("URL that generated the error code: ", i)
        print("Error code:", err.code)
        print("Error description:", err.reason)
    except socket.timeout:
        print("URL that generated the error code: ", i)
        print("Error description: No response.")
    except socket.error:
        print("URL that generated the error code: ", i)
        print("Error description: Socket error.")

错误:

Traceback (most recent call last):
  File "/mydir/crawler.py", line 202, in <module>
    spider("urls.txt", maxPages=0, debug=1, dailyRequests=9600) 
  File "/mydir/crawler.py", line 142, in spider
    parser.getLinks(url + "?start=" + str(currbot) + "&tab=" + auxstr,auxstr)
  File "/mydir/crawler.py", line 81, in getLinks
    htmlBytes = response.read()
  File "/usr/lib/python3.5/http/client.py", line 455, in read
    return self._readall_chunked()
  File "/usr/lib/python3.5/http/client.py", line 561, in _readall_chunked
    value.append(self._safe_read(chunk_left))
  File "/usr/lib/python3.5/http/client.py", line 607, in _safe_read
    chunk = self.fp.read(min(amt, MAXAMOUNT))
  File "/usr/lib/python3.5/socket.py", line 575, in readinto
    return self._sock.recv_into(b)
  File "/usr/lib/python3.5/ssl.py", line 929, in recv_into
    return self.read(nbytes, buffer)
  File "/usr/lib/python3.5/ssl.py", line 791, in read
    return self._sslobj.read(len, buffer)
  File "/usr/lib/python3.5/ssl.py", line 575, in read
    v = self._sslobj.read(len, buffer)
socket.timeout: The read operation timed out

编辑:

我注意到由于@tdelaney，我错过了几行代码我将它们添加到上面的代码中，如果您发布解决方案或者如果您有更好的方法来解决它，我将发布我编写的解决方案我会将答案标记为正确的

解决方案:

for _ in range(max_retries):
    try:
        req = Request(url, headers={'User-Agent' :'Mozilla/5.0'})
        response = urlopen(req,timeout=5)
        break
    except error.URLError as err: 
        print("URL that generated the error code: ", url)
        print("Error description:",err.reason)
    except error.HTTPError as err:
        print("URL that generated the error code: ", url)
        print("Error code:", err.code)
        print("Error description:", err.reason)
    except socket.timeout:
        print("URL that generated the error code: ", url)
        print("Error description: No response.")
    except socket.error:
        print("URL that generated the error code: ", url)
        print("Error description: Socket error.")

if response.getheader('Content-Type').startswith('text/html'):
    for _ in range(max_retries):
        try:
            htmlBytes = response.read()
            htmlString = htmlBytes.decode("utf-8")
            self.feed(htmlString)
            break
        except error.URLError as err: 
            print("URL that generated the error code: ", url)
            print("Error description:",err.reason)
        except error.HTTPError as err:
            print("URL that generated the error code: ", url)
            print("Error code:", err.code)
            print("Error description:", err.reason)
        except socket.timeout:
            print("URL that generated the error code: ", url)
            print("Error description: No response.")
        except socket.error:
            print("URL that generated the error code: ", url)
            print("Error description: Socket error.")

最佳答案

python "Requests"库使用它自己的一组异常来处理与 HTTP 协议(protocol)和套接字有关的错误。它会自动将嵌入的 socket() 函数返回的异常映射到 requests.exceptions 中定义的自定义异常。

因此，由此引发的异常......

import Requests

try:
    req = Request("http://stackoverflow.com", headers={'User-Agent' :'Mozilla/5.0'})
    urlopen(req,timeout=5)
except Timeout:
    print "Session Timed Out!"

等同于由此引发的异常...

import socket

s = socket.socket(socket.AF_INET, socket.SOCK_STREAM)
try:
    s.connect(("127.0.0.1", 80))
except socket.timeout:
    print "Session Timed Out"

你的固定密码...

for _ in range(max_retries):
try:
    req = Request(url, headers={'User-Agent' :'Mozilla/5.0'})
    response = urlopen(req,timeout=5)
    break
except error.URLError as err: 
    print("URL that generated the error code: ", url)
    print("Error description:",err.reason)
except error.HTTPError as err:
    print("URL that generated the error code: ", url)
    print("Error code:", err.code)
    print("Error description:", err.reason)
except Timeout:
    print("URL that generated the error code: ", url)
    print("Error description: Session timed out.")
except ConnectionError:
    print("URL that generated the error code: ", url)
    print("Error description: Socket error timed out.")

关于Python:Socket.timeout 不由 except 处理，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/43942710/

26

4

0

文章推荐： java - Java:在运行其他进程时从套接字监听

文章推荐： php - 带有代理的PHP CURL在套接字上导致CLOSE_WAIT

文章推荐： sockets - Hololens UDP服务器从不接收消息

文章推荐： python - ResourceWarning : python-memcached not closing socket?

node.js - socket.io - socket.emit, socket.on, socket.send
基于 socket.io 的官方网站 http://socket.io/#how-to-use , 我找不到任何术语。socket.emit 、 socket.on 和 socket.send 之间有
sockets - lua-socket : unix domain sockets?
我正在使用 lua-socket 3.0rc1.3(Ubuntu Trusty 附带)和 lua 5.1。我正在尝试监听 unix 域套接字，我能找到的唯一示例代码是 this -- send std
sockets - socket.emit() 与 socket.send()
这两者有什么区别？我注意到如果我在一个工作程序中从 socket.emit 更改为 socket.send ，服务器无法接收到消息，虽然我不明白为什么。我还注意到，在我的程序中，如果我从 sock
sockets - socket 可靠吗？
使用套接字在两台服务器之间发送数据是个好主意，还是应该使用 MQ 之类的东西来移动数据。我的问题:套接字是否可靠，如果我只需要一次/有保证的数据传输？还有其他解决方案吗？谢谢。最佳答案套接字
sockets - 被动和主动 socket
引自 this socket tutorial : Sockets come in two primary flavors. An active socket is connected to a
sockets - 如何绕过 socket ？
我已经安装了在端口81上运行的流服务器“Lighttpd”(light-tpd)。我有一个C程序，它使用套接字api创建的服务器套接字在端口80上监听http请求。我希望从客户端收到端口80上的请
sockets - socket 未正确关闭的原因？
这是我正在尝试做的事情: 当有新消息可用时，服务器会将消息发送给已连接的客户端。另一方面，客户端在连接时尝试使用send()向服务器发送消息，然后使用recv()接收消息，此后，客户端调用close(
sockets - socket.io动态发送和接收消息
如何将消息发送到动态 session 室，以及当服务器收到该消息时，如何将该消息发送到其他成员所在的同一个 session 室？ table_id是房间，它将动态设置。客户: var table_i
sockets - 如何使用NodeJS将监听一个端口的WebSocket连接到监听另一个端口的Net socket？
这是我尝试但不起作用的方法。我可以将传入的消息从WebSocket连接转发到NetSocket，但是只有NetSocket收到的第一个消息才到达WebSocket后面的客户端。 const WebSo
sockets - 如何使用升压冲洗 socket
我正在实现使用boost将xml发送到客户端的服务器。我面临的问题是缓冲区不会立即发送并累积到一个点，然后发送整个内容。这在我的客户端造成了一个问题，当它解析xml时，它可能具有不完整的xml标记(不
sockets - Nginx权限被拒绝连接到.socket
尝试使用Nginx代理Gunicorn套接字。 /etc/systemd/system/gunicorn.service文件 [Unit] Description=gunicorn daemon Af
sockets - 多个连接Lua socket
我正在使用Lua套接字和TCP制作像聊天客户端和服务器这样的IRC。我要弄清楚的主要事情是如何使客户端和服务器监听消息并同时发送它们。由于在服务器上执行socket:accept()时，它将暂停程序，
sockets - 具有自定义负载平衡功能的ZMQ socket
我看了一下ZMQ PUSH/PULL套接字，尽管我非常喜欢简单性(特别是与我现在正在通过UDP套接字在系统中实现的自定义碎片/ack相比)，但我还是希望有自定义负载平衡功能，而不是幼稚的回合-robi
javascript - Socket.io socket.emit/socket.on 不工作
我正在编写一个应用程序，其中有多个 socket.io 自定义事件，并且所有工作正常，除了这个: socket.on("incomingImg", function(data) {
sockets - socket recv() 是否强制刷新 socket send() 缓冲区？
在我的应用程序中，我向服务器发送了两条小消息(类似 memcached 的服务)。在类似 Python 的伪代码中，这看起来像: sock.send("add some-key 0") ignored
javascript - socket.io 重新连接 socket.socket.connect 不起作用
很抱歉再次发布此问题，但大多数相关帖子都没有回答我的问题。我在使用 socket.io 的多个连接时遇到问题我没有使用“socket.socket.connect”方法，但我从第一次连接中得到了反馈。
sockets - 带有非 socket.io 服务器的 Socket.io 客户端
我尝试使用 socket.io 客户端连接到非 socket.io websocket 服务器。但我做不到。我正在尝试像这样连接到套接字服务器: var socket = io.connect('ws
javascript - 已定义 Socket.io，但未定义 socket.io.sockets
我遇到了一个奇怪的问题。在我非常基本的服务器中，我有: server.listen(8001); io.listen(server); var sockets = io.sockets; 不幸的是，套
socket.io - 在sails js socket.io中使用io.socket.get
我正在使用带套接字 io 的sailsjs。帆的版本是 0.10.5。我有以下套接字客户端进行测试: var socketIOClient = require('socket.io-client');
sockets - TCP sockets 和 web sockets 的区别，再来一次
这个问题在这里已经有了答案: What is the fundamental difference between WebSockets and pure TCP? (4 个答案) 关闭 4 年前。

首页

博学

6Ren·AI

商城

Python:Socket.timeout 不由 except 处理