python - 为什么我会收到这个(显然)不寻常的 AttributeError : 'bytes' object has no attribute '_all

python - 为什么我会收到这个(显然)不寻常的 AttributeError : 'bytes' object has no attribute '_all_strings' ? 有没有办法解决它？

转载作者：太空宇宙更新时间：2023-11-03 14:24:58

我一直在寻找解决这个 AttributeError 问题的方法，但没有找到与“_all_strings”相关的解决方案。

我想编写一个网络爬虫程序，但页面顶部和底部有很多无意义的内容，因此我尝试清理 HTML 代码，作为排除顶部和底部不需要的噪音的先导。网页底部。

当我运行下面的代码时，特别是最后一行，我得到一个 AttributeError:

from __future__ import division
from urllib.request import urlopen
from bs4 import BeautifulSoup

textSource = 'http://celt.ucc.ie/irlpage.html'
html = urlopen(textSource).read()
raw = BeautifulSoup.get_text(html)

这是我得到的完整回溯:

Traceback (most recent call last):
  File "...Crawler_Celt_Namelink_Test.py", line 7, in <module>
    raw = BeautifulSoup.get_text(html)
  File "...Python\Python35\lib\site-packages\bs4\element.py", line 950, in get_text
    return separator.join([s for s in self._all_strings(
AttributeError: 'bytes' object has no attribute '_all_strings'

以前有人遇到过这个错误吗？或者有人可以建议我如何克服它吗？

最佳答案

当您查看BeautifulSoup docs时它的使用方式如下:

from urllib.request import urlopen
from bs4 import BeautifulSoup
textSource = 'http://celt.ucc.ie/irlpage.html'
html = urlopen(textSource).read()

soup = BeautifulSoup(html, 'html.parser')

raw = BeautifulSoup.get_text(soup)

关于python - 为什么我会收到这个(显然)不寻常的 AttributeError : 'bytes' object has no attribute '_all_strings' ? 有没有办法解决它？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/47703639/

文章推荐： python - 检查排列是否存在/组合是否唯一

文章推荐： python - HSM 与 Python 请求模块的集成

文章推荐： python - PyQt 中表示路径的字符串中的前导冒号是什么意思

文章推荐： python - 使用 Scapy 制作 DTLS ClientHello 数据包

太空宇宙

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

python - 为什么我会收到这个(显然)不寻常的 AttributeError : 'bytes' object has no attribute '_all_strings' ? 有没有办法解决它？