python - 向 lxml 中的多个子项添加相同的元素-6ren

python - 向 lxml 中的多个子项添加相同的元素

转载作者：行者123 更新时间：2023-11-28 22:31:07

26

4

背景:我正在尝试使用从 Web 服务端点检索的元数据充实 XML 报告。报告列出了文本模块和图形，每个图形都有几种分辨率。我无法为每个分辨率添加元数据。

问题:这是简化的问题。

from lxml import etree as ET

myxml = """\
<report>
    <object id="foo">
        <reportitems>
            <reportitem id="1"/>
            <reportitem id="2"/>
            <reportitem id="3"/>
        </reportitems>
    </object>
</report>
"""

report = ET.fromstring(myxml)
test = ET.Element("test", foo="bar")

for r in report.findall("object/reportitems/reportitem"):
    r.append(test)

我得到这个输出:

<report>
    <object id="foo">
        <reportitems>
            <reportitem id="1"/>
            <reportitem id="2"/>
            <reportitem id="3"><test foo="bar"/></reportitem>
        </reportitems>
    </object>
</report>

现在，如果我像这样修改代码(使用相同的 XML 片段):

report = ET.fromstring(myxml)
myElements = [ET.Element("test1"), ET.Element("test2"), ET.Element("test3")]

counter = 0
for r in report.findall("object/reportitems/reportitem"):
    r.append(myElements[counter])
    counter += 1

...然后我得到这个输出:

<report>
    <object id="foo">
        <reportitems>
            <reportitem id="1"><test1/></reportitem>
            <reportitem id="2"><test2/></reportitem>
            <reportitem id="3"><test3/></reportitem>
        </reportitems>
    </object>
</report>

为什么我不能将相同(相同)的元素作为子元素添加到我迭代的多个元素中？

最佳答案

此行为在 lxml tutorial 中进行了描述:

There is another important case where the behaviour of Elements in lxml (in 2.0 and later) deviates from that of lists and from that of the original ElementTree (prior to version 1.3 or Python 2.7/3.2):

>>> for child in root:
...     print(child.tag)
child0 child1 child2 child3
>>> root[0] = root[-1]  # this moves the element in lxml.etree!
>>> for child in root:
...     print(child.tag)
child3 child1 child2

In this example, the last element is moved to a different position, instead of being copied, i.e. it is automatically removed from its previous position when it is put in a different place. In lists, objects can appear in multiple positions at the same time, and the above assignment would just copy the item reference into the first position, so that both contain the exact same item:

>>> l = [0, 1, 2, 3]
>>> l[0] = l[-1]
>>> l
[3, 1, 2, 3]

Note that in the original ElementTree, a single Element object can sit in any number of places in any number of trees, which allows for the same copy operation as with lists. The obvious drawback is that modifications to such an Element will apply to all places where it appears in a tree, which may or may not be intended. The upside of this difference is that an Element in lxml.etree always has exactly one parent, which can be queried through the getparent() method. This is not supported in the original ElementTree.

>>> root is root[0].getparent()  # lxml.etree only!
True

If you want to copy an element to a different position in lxml.etree, consider creating an independent deep copy using the copy module from Python's standard library:

>>> from copy import deepcopy

>>> element = etree.Element("neu")
>>> element.append( deepcopy(root[1]) )

>>> print(element[0].tag)
child1
>>> print([ c.tag for c in root ])
['child3', 'child1', 'child2']

关于python - 向 lxml 中的多个子项添加相同的元素，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/41766062/

26

4

0

文章推荐： python - 如何将项目 append 到 Pandas 中不同列的列表

文章推荐： tomcat - Nsis服务删除

文章推荐： ios - 如何将按钮链接到操作

文章推荐： tomcat - 在 8080 端口打开 Tomcat 页面时出错

lxml - 使用 lxml 删除带有文本的元素
我有以下xml文件 10.10.0.135::3111 10.10.0.130::3111 10.10.0.129::3111 10.
lxml - 如何为 PyPy 安装 lxml？
我已经为 PyPy 创建了一个 virtualenv: virtualenv test -p `which pypy` source test/bin/activate 我安装了以下依赖项: sudo
lxml - 在 python lxml 中查找前缀标签的提示？
我正在尝试使用 lxml 的 ElementTree etree 在我的 xml 文档中查找特定标签。标签如下所示: 12 我希望使用 etree.find('text:statedAge
Python lxml 自定义解析器不适用于 lxml.etree.parse(...) 函数
在 python 3.5 项目中，我必须读取一些 xml 文件，因此我决定使用 lxml 库。由于我正在阅读文件，因此根据文档执行此操作的最有效方法是使用 lxml.etree.parse(...)
python - pip install lxml centOSFailed building wheel for lxml
执行 pip install lxml 或 pip install pyquery 会出现此错误: gcc: error trying to exec 'cc1': execvp: No such f
Python、lxml 和使用 lxml.html.tostring(el) 删除外部标签
我正在使用下面的代码获取一个部分的所有 html 内容以保存到数据库 el = doc.get_element_by_id('productDescription') lxml.html.tostri
python - lxml(或 lxml.html): print tree structure
我想以可区分的方式打印出 etree 的树结构(由 html 文档形成)(意味着两个 etree 应该以不同的方式打印出来)。我所说的结构是指树的“形状”，基本上是指所有标签，但没有属性，也没有文本
python - lxml: cssselect(): AttributeError: 'lxml.etree._Element' 对象没有属性 'cssselect'
谁能解释为什么第一次调用 root.cssselect() 有效，而第二次失败？ from lxml.html import fromstring from lxml import etree htm
python - 导入错误 : No module named lxml - Even though LXML Is installed
我收到此错误“ImportError: No module named lxml”，即使确实安装了 LXML。具体来说，它安装在项目的 python Virtualenv 中。最终我正在研究 Pyth
python - Windows 上的 lxml 错误 - AttributeError : module 'lxml' has no attribute 'etree'
我在 Windows 32 位上使用 Anaconda v4.2 和 Python 3.5，并想使用 lxml etree。我的 Anaconda 发行版包括 lxml 3.6.4，但我的 IDE(P
python - lxml:来自 URL 的一些 XML 给出了这个 lxml.etree.XMLSyntaxError
我有一个脚本，可以从 URL 列表的 XML 文件中提取一些术语。所有 URL 都可以访问 XML 数据。它在第一次正确打开、解析和提取时工作正常，但随后在过程中被某些 XML 文件中断并出现此错误
python - 使用 lxml，是什么导致 "lxml.etree.XMLSyntaxError: Document is empty"错误？
我正在使用 mechanize/cookiejar/lxml 来读取页面，它适用于某些页面但不适用于其他页面。我在其中遇到的错误是标题中的错误。我不能在这里发布页面，因为它们不是 SFW，但是有没有办
python - 令人难以置信的基本 lxml 问题 : getting HTML/string content of lxml. etree._Element？
这是一个基本的问题，我实际上在文档中找不到它:-/ 如下: img = house_tree.xpath('//img[@id="mainphoto"]')[0] 如何获取的 HTML标记？我尝
Python 3 - 使用 pip(非 root)安装 lxml 后得到 "No module named ' lxml'"
我只想说，这个问题我已经在Pip is already installed: but I am getting no module named lxml看到了并且已经看到关于以非 root 用户身份安
python lxml 3.3.5 - 加载代码时出错 - "ValueError: lxml.etree._Element has the wrong size, try recompiling"
我通过 pip 安装了 lxml 3.3.5。现在我在运行一些 Django 测试时遇到了问题: Traceback (most recent call last): File "manage.p
python - 带有 Apache LXML 和 DefusedXML 的 Django Tastypie : Error: "Usage of the xml aspects requires lxml and defusedxml"
我有一个 Django 应用程序，其中包含 Tastypie 提供的 RESTFull 服务。我使用 easy_install 安装 lxml 和 defusedxml，这样我就可以使用 xml 而不
创建时的python lxml etree命名空间
我正在使用 lxml etree 创建 xml 或 REST 调用。我有命名空间的问题，因为如果没有正确制定，我会从服务器收到语法错误。正如您在以下 2 个示例中所看到的，我应该得到例如 ns1、n
python - lxml 在存在命名空间时搜索标签
我对导航具有 lxml.etree namespace 的 xml 文档感到有些困惑。 .我已经看到了一些关于这个主题的主题( 1 、 2 )以及 lxml docs但仍然没有想出答案。 xml =
lxml - 将lxml _Element转换为HtmlElement
由于各种原因，我试图从lxml.html.fromstring()切换到lxml.html.html5parser.document_fromstring()。两者之间的最大区别是，第一个返回lxml
Python:LXML - 如何将元素添加到现有元素树
我需要通过向现有元素添加子元素来修改现有的 xml 文件。我使用 lxml 库。 Eric Idle 999-999-999 555-555-555

首页

博学

6Ren·AI

商城

python - 向 lxml 中的多个子项添加相同的元素