python - 美丽汤4 : Need to add inverse paragraphs tags to separate a field into two paragraphs-6ren

python - 美丽汤4 : Need to add inverse paragraphs tags to separate a field into two paragraphs

转载作者：行者123 更新时间：2023-11-30 21:56:41

24

4

目前，有一个 header 标签附加了其内容。我需要将标题与其内容分开，方法是将它们维护在单独的段落标签中。

block_tag = <p>1.1 <u>Header Information</u>.  Content of the header with multiple lines</p>

type(block_tag)
<class 'bs4.element.Tag'>

header 应包含在  中或标签

预期结果:

block_tag
<p>1.1 <u>Header Information</u>.</p><p>  Content of the header with multiple lines</p>

到目前为止，我已经尝试使用 - 添加段落标签

new_tag("p") 创建  。需要反向标签<\p>

方法1

para_tag = soup.new_tag("p")
block_tag.insert(2,para_tag)
block_tag
<p>1.1 <u>Header Information</u>. <p></p> Content of the header with multiple lines</p>

方法2

block_tag.insert(2,"<\p><p>")
block_tag
<p>1.1 <u>Header Information</u>&lt;\p&gt;&lt;p&gt;.  Content of the header with multiple lines</p>

谢谢

最佳答案

可以获取 header 和 之后的剩余内容 wrap 它位于新的 p 标记内。然后 extract 它来自原始标签和 insert_after原始标签。

from bs4 import BeautifulSoup
html="""
<p>1.1 <u>Header Information</u>.  Content of the header with multiple lines</p>
"""
soup=BeautifulSoup(html,'html.parser')
block_tag=soup.find('p')
remaining=block_tag.contents[-1]
new_tag=remaining.wrap(soup.new_tag("p"))
block_tag.insert_after(new_tag.extract())
print(soup)

输出:

<p>1.1 <u>Header Information</u></p><p>.  Content of the header with multiple lines</p>

除了句号之外几乎完美。

注意:我不确定多行标题的内容到底是什么，但不要将此视为确切的答案。您可能需要即兴发挥。

关于python - 美丽汤4 : Need to add inverse paragraphs tags to separate a field into two paragraphs，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/55443180/

24

4

0

文章推荐： python - 关闭 shell 后保持 python 代码运行

文章推荐： php - 在 Eloquent 连接表字段上应用顺序

string - GNU 制作 : how to join list and separate it with separator?
这个问题在这里已经有了答案: Joining elements of a list in GNU Make (3 个回答) 2年前关闭。我有这个: FOO = foo1 foo2 ... fooN
sql-server - SQL 服务器 : Separate schemas or separate databases
我们有两种产品在客户现场实现，其中一个需要另一个在场。我在与附加组件所需的数据库对象的主要产品相同的数据库中实现了一个单独的模式。因为该附加组件理论上也可以成为 future 产品的附加组件(尽管目前
java - Android JSON解析问题: seems to separate it into 2 separate objects and only recognizes one
它的作用:这是一个 Android 应用程序，将模仿我的团队正在创建的网站 apdata.info。这是第二页，它将显示符合他们请求的搜索的机场。 PHP ... $sql = "SELECT apn
javascript - Angular 4 : Slider change should trigger separate functions for separate range values
我在 Angular 4 中使用 slider 。下面是我的 HTML 代码: Actual/Optimised
linux - Sort numbers values - separated by a dot or any other separator character - 在 RHEL5 中排序版本值
Linux RHEL5 机器如何对以下输入进行排序以在 latest 变量中获取 1.0.0.1019？尝试了 -t、-k 和 -n 但没有帮助，或者可能是我遗漏了什么。 $ echo '1.0.0
excel - MS Excel : how to compare a cell with comma separated values to a cell with comma separated values
因此，我有一个值为“app、beta、theta”的单元格，我想查看填充有上述单元格的列是否包含我的单元格值。例如:AA 列有这些单元格:“app”； “theta，应用程序”； “theta，app
batch-file - cmd 批处理 - usebackq 为 : string to split altered from comma separated to space separated
我想读取一个包含多个(未终止的)Kafka 主机的字符串，并使用 cmd for 将它们列出在单独的行中。字符串如下: host1:9092,host2:9092,host3:9092,... 我所
algorithm - C4.5 决策树 : can deeps be higher in linear separable data then non-linear separable?
我突然想到，例如，假设我们有二维 N 点的训练数据。我们知道我们总是可以天真地构建一个决策树，以便我们可以对每个数据点进行分类。 (可能我们过拟合了，深度可以到2N) 但是，我们知道如果数据集是线性可
java - Selenium /Java : can I separate a Drag and Drop action in two actions belonging to two separate classes?
我知道这是一件奇怪的事情，但现在我正在构建一个测试项目，并且我将正在测试的应用程序的几个区域/容器分开在不同的类中，如保持组织的措施。因此，如果我想将一个元素从区域 A 拖动到区域 B，我必须将 D&
XSLT 非法属性 'separator'
你好，在编写 XSLT 样式表时，我遇到了一个无法解决的问题。我的基本 XML 结构如下我想打印所有列的名称。因此，我使用了以下语句(我正在遍历所有 nonUniqueConstr
separation-of-concerns - 单一责任原则与关注点分离的区别
单一职责原则和关注点分离有什么区别？最佳答案 Single Responsibility Principle (SRP)- give each class just one reason to ch
java - 获取指定文件路径的file.separator
如何获取指定文件/文件夹路径的文件分隔符？在Java中，我们可以这样写 File f = new File("C:\\MyFolder\\MyText.txt"); 请记住，这是一种文件表示(该文件
c - 如何检查文本文件中是否有 "separators"？
我正在开始开发一个新程序(用于学校项目)，其中用户登录并有一个类似“时间线”的页面。目前它是一个简单的命令行 C 项目。我想通过以下方式将数据存储在文本文件中: # Message here
ios - 为什么通过UIView可以看到UITableView Separator？
我正在将 UIView(作为容器 View )添加到 UITableViewController。出于某种原因，UITableView 分隔符通过 UIView 可见。我运行的是 iOS 7。 UIV
java - 需要在不更改java代码的情况下覆盖windows默认的file.separator
产品代码在 UNIX 上运行，但需要在 Windows 上运行本地 DEV。当前代码从数据库获取 UNIX 格式的路径，然后使用 file.separator 在该路径上构建，它添加了窗口分隔符，导
linux - Linux部署中的File.separator
我正在使用 Eclipse 和本地服务器(如 XAMPP)在 Windows 中开发我的企业应用程序项目。要从文件系统(部署文件夹外部)加载配置，我使用: String dataOrdner = S
c# - 将文本拆分为单词 : Separators
我刚刚使用 iTextSharp 从 pdf 中获取所有文本，现在我需要将该文本拆分为单词。我以前使用 Acrobat 库，它会自动将它分成单词(使用 getPageNthWord())。我不知道使
WPF集合控件实现分隔符(ItemsControl Separator)
在WPF的集合控件中常常需要在每一个集合项之间插入一个分隔符样式，但是WPF的ItemsControl没有相关功能的直接实现，所以只能考虑曲线救国，经过研究，大概想到了以下两种实现方式。先写出I
Groovy 拆分使用 file.separator
出现如下错误 Groovy script throws an exception of type class java.util.regex.PatternSyntaxException with m
symfony - Twig : separator in a "for" tag
请问有没有语法来分隔“for”标签中的某些元素？例如我有一个用户列表，我想用“-”分隔符显示他们的用户名，所以预期的结果是:Mickael - Dave - Chris - ... 我找到了这个解决

首页

博学

6Ren·AI

商城

python - 美丽汤4 : Need to add inverse paragraphs tags to separate a field into two paragraphs