python - 如何用漂亮的汤跳过<span>-6ren

python - 如何用漂亮的汤跳过

转载作者：太空宇宙更新时间：2023-11-04 07:56:12

这是我的代码的输出

<h1 class="it-ttl" id="itemTitle" itemprop="name"><span class="g-hdn">Details about   </span>item name goes here</h1>

我只想获取项目名称，没有“详细信息”部分。

我的 Python 代码选择了特定的 div id 是

for content in soup.select('#itemTitle'):
    print(content.text)

最佳答案

您可以使用 decompose() clear()或 extract() .根据文档:

Tag.decompose() removes a tag from the tree, then completely destroys it and its contents

Tag.clear() removes the contents of a tag

PageElement.extract() removes a tag or string from the tree. It returns the tag or string that was extracted

from bs4 import BeautifulSoup
html = '''<h1 class="it-ttl" id="itemTitle" itemprop="name"><span class="g-hdn">Details about   </span>item name goes here</h1>'''

soup = BeautifulSoup(html, 'lxml')
for content in soup.select('#itemTitle'):
    content.span.decompose()
    print(content.text)

输出:

  item name goes here

关于python - 如何用漂亮的汤跳过<span>，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/48414407/

文章推荐： css - 在禁用的选定元素上更改 CSS

文章推荐： java - Esper窗口使用: Recalculation based on event leaving window

文章推荐： java - 在客户端和服务器之间以字节数组形式发送图像

文章推荐： python - 具有特定间隙的 numpy 排列

Ruby Greed Koan - 如何改进我的 if/then 汤？
我正在努力学习 Ruby Koans 以尝试学习 Ruby，到目前为止一切顺利。我已经得到了贪婪的公案，在撰写本文时它是 183。我有一个可行的解决方案，但我觉得我只是拼凑了一堆 if/then 逻辑
c++ - 使用 boost 图形库的模板化 typedef 汤
我正在尝试创建一个扩展 boost 图形库行为的类。我希望我的类是一个模板，用户提供一个类型(类)，用于在每个顶点存储属性。那只是背景。我正在努力创建一个更简洁的 typedef 来定义我的新类。基
python - 来自 SUDS.client 的未知字符串格式(汤？)的可能解析器
我正在使用 suds 包从网站查询 API，从他们的网站返回的数据如下所示: (1)。谁能告诉我这是什么格式？ (2)。如果是这样，解析数据的最简单方法是什么？我已经使用 BeautifulSoup
python (汤): get nested data and get last item in a tag
所以我有一个看起来像这样的 html 文档: Speaker Name: Title of Talk | Subtitle | website.com ... [Other Stuff] Poste

太空宇宙

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

python - 如何用漂亮的汤跳过