python - 无法理解如何使用 findNextSiblings-6ren

python - 无法理解如何使用 findNextSiblings

转载作者：行者123 更新时间：2023-12-01 05:24:42

我有一些 html 正在用 BeautifulSoup 解析......

<div class="book-page-sidebar">
    <h4>Production details</h4>
    <dl class="product-details clearfix">
      <dt>Running Time:</dt>
      <dd>1:20:25</dd>

      <dt>Zip file size:</dt>
      <dd>38.6MB</dd>

      <dt>Catalog date:</dt>
      <dd>2010-05-04</dd>

      <dt>Read by:</dt>
      <dd><a href="https://librivox.org/reader/204">Mark F. Smith</a></dd> 

      <dt>Book Coordinator:</dt>
      <dd><a href="https://librivox.org/reader/204">Mark F. Smith</a></dd>

      <dt>Meta Coordinator:</dt>
      <dd><a href="https://librivox.org/reader/4009">Diana Majlinger</a></dd>

      <dt>Proof Listener:</dt>
      <dd><a href="https://librivox.org/reader/496">Ans Wink</a></dd>                 
    </dl>

</div>

我想抓取 dt 标记后面的 dd 标记中的文本 1:20:25 以及文本 Running Time:。所以我尝试了这段代码...

paraText = soup.find( text = re.compile ('Running Time:') )
print paraText.findNextSiblings('dd')

...但是没有输出。我做错了什么？

最佳答案

find_next 或 findNext 应该可以解决问题。

paraText = soup.find( text = re.compile ('Running Time:') )
print paraText.findNext('dd')

关于python - 无法理解如何使用 findNextSiblings，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/21655474/

文章推荐： python - 我的 if else 语句中的语法错误

文章推荐： python - 如何使用 re.sub 来实际替换文件名？

文章推荐： spring - 关闭程序的 DefaultMessageListenerContainer 问题

python - 无法理解如何使用 findNextSiblings
我有一些 html 正在用 BeautifulSoup 解析...... Production details Running Time: 1:20:25

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

python - 无法理解如何使用 findNextSiblings