python - 如何在 BeautifulSoup 中使用 find() 和 find

python - 如何在 BeautifulSoup 中使用 find() 和 find_all()？

转载作者：行者123 更新时间：2023-12-04 17:31:21

25

4

我目前正在做一些网络抓取。我有这个 HTML:

<meta property="og:price:amount" content="1.89"/>
<meta property="og:price:standard_amount" content="6.31"/>
<meta property="og:price:currency" content="USD"/>

我正在使用 BeautifulSoup (Python)。

我要提取的信息是 1.89 和 6.31(产品价格)。

这是我的代码:

import requests
from bs4 import BeautifulSoup


page = requests.get('https://spanish.alibaba.com/product-detail/crazy-hot-selling-multifunctional-battery-powered-360-degree-rotation-led-light-makeup-mirror-60769168637.html?spm=a2700.8270666-66.2016122619262.17.5a4d5d09En8wm9')

# Create a BeautifulSoup object
soup = BeautifulSoup(page.text, 'html.parser')
#print(soup.get_text())
# get the repo list


v2 = soup.find_all("meta", {"property": "og:price:amount", "content": True}['content'] )
print("v2 is",v2)

错误在 .find_all()函数，我不确定如何提取数据。我试过 .find()功能也一样

这是我得到的关于美丽汤功能如何工作的信息: Signature: find_all(name, attrs, recursive, string, limit, **kwargs)
帮我配置 .find()功能。谢谢!

最佳答案

而不是 find_all()只需使用 find()find_all()返回元素列表。

v2 = soup.find("meta", {"property": "og:price:amount", "content": True})['content'] 
print("v2 is",v2)

或者您可以使用 CSS选择器 :

v2 = soup.select_one('meta[property="og:price:amount"][content]')['content']
print("v2 is",v2)

关于python - 如何在 BeautifulSoup 中使用 find() 和 find_all()？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/59377462/

25

4

0

文章推荐： php - WooCommerce 自定义设置选项卡验证

文章推荐： git - 在单个主机上使用多个 key 时，SSH 使用错误的 key

python - find_all 没有在混合内容中找到文本
我有一些使用 BeautifulSoup 的 Python 屏幕抓取代码，这让我很头疼。对 html 的一个小改动使我的代码中断，但我不明白为什么它无法工作。这基本上是 html 在解析时的外观演示:
python - find_all 具有多个属性
我想找到页面上的所有链接，此代码仅获取以http://开头的链接，但大多数链接都是https:// 我如何编辑下面的代码来找到两者？ for link in soup.find_all('a',att
python - BeautifulSoup- find_all- 订单保存
我正在尝试解析一个具有多个同名类的网站。我想采用第一个(如网页上所示)类(class)的元素。但是，find_all 或 find 不保留解析的顺序。下面是我对 find_all 的实现请帮忙: i
Python Beautiful Soup find_all
您好，我正在尝试从网站获取一些信息。请原谅我，如果我的格式有任何错误，这是我第一次发布到 SO。 soup.find('div', {"class":"stars"}) 从这里我收到我需要 “
python - BeautifulSoup find_all() 是否保留标签顺序？
我希望使用 BeautifulSoup 来解析一些 HMTL。我有一个有几行的表。我正在尝试查找满足某些条件(某些属性值)的行，并稍后在我的代码中使用该行的索引。问题是:是否find_all()在它
python - 如何将一组参数作为一个长变量传递给 find()/find_all()
假设我有这个 html 代码: html = """ Text 1 Text 2 """ 使用此代码: from bs4 import BeautifulSoup as bs soup = bs
python - 忽略美丽汤中 find_all 中字符串的一部分
我想提取 this 中的所有网址网页。我使用的python代码是这个 htmlfile=urllib.urlopen("http://dubai.dubizzle.com/property-for-
python - ResultSet 对象没有属性 'find_all'
当我抓取一个网页时，我总是遇到一个问题。 AttributeError: ResultSet object has no attribute 'find'. You're probably treat
python - BeautifulSoup find_all() 未找到所有请求的元素
我在 BeautifulSoup 中发现了一些奇怪的行为，如下面的示例所示。 import re from bs4 import BeautifulSoup html = """This has a
python - BeautifulSoup find_all 带参数
这是我第一次使用 BeautifulSoup，我不知道我做错了什么 Picks Bans Combined 这是我正在使用的 HTM
python - BeautifulSoup find_all() 方法正在抓取比过滤器指定的标签更多的标签
我有以下 xml， https://mystore.com/products-t-shirt.xml 2019-04-11T00:01:42-04:00 daily
python - BeautifulSoup find_all() 不返回任何内容 []
我正在尝试抓取this page所有优惠，并想要迭代但是page_soup.find_all("p", "white-strip")返回一个空列表 []。到目前为止我的代码- from urlli
Python bs4 - find_all 多个标签和类
我正在做一些抓取并遇到了问题。现在我的代码如下所示: pn = soup.find_all("a", {"class": "full"}) pfp = soup.find_all("td", {"c
python - beautifulsoup find_all() 类快捷方式不起作用
我正在尝试查找具有 column 类的所有 p 标签。 This is a column More columns heh 我试过: soup.find_all(class_='column') 它返
python - BeautifulSoup find_all() 不返回任何数据
我是 Python 的新手。我最近的项目是从博彩网站上抓取数据。我要抓取的是网页中的赔率信息。这是我的代码 from urllib.request import urlopen as uReq fr
python - 将 find_all 漂亮的汤标签组合成一个字符串
我正在使用 beautifulsoup 和 html 解析器执行抓取，并选择了我想要使用的 html 部分并将其保存为“容器”。 from urllib.request import urlopen
python - BeautifulSoup find_all() 查找具有多个可接受属性值之一的元素
主要问题我知道如何使用 find_all() 检索具有特定值属性的元素，但我找不到任何示例来说明如何检索具有多个可接受值之一的属性的元素。在我的例子中，我正在使用 DITA XML，我想检索范围属性
python - BS4 find_all 在空间中带有标签
如何将 bs4 与带有空格的类标签的 find_all 一起使用？ container = containers[0] product_container = container.find_all('
python - Beautifulsoup find_all 没有找到所有
我目前正在研究网络爬虫。我希望我的代码从我抓取的所有网址中获取文本。函数 getLinks() 找到我想从中获取数据的链接并将它们放入数组中。该数组目前充满了 12 个链接，如下所示:' http:/
python - BeautifulSoup find_all UnicodeEncodeError
这个问题在这里已经有了答案: python3 print unicode to windows xp console encode cp437 (2 个答案) 关闭 7 年前。我从 tutoria

首页

博学

6Ren·AI

商城

python - 如何在 BeautifulSoup 中使用 find() 和 find_all()？