Python eTree Parser 没有附加元素-6ren

Python eTree Parser 没有附加元素

转载作者：行者123 更新时间：2023-11-28 18:48:35

26

4

看看我的日志，看看它是怎么说我从 Postgres 返回的行已经从字符串变成了元素(我打印了字符串，打印了元素，打印了 isElement bool 值!)当我尝试附加它时，错误是它不是一个元素。呼，呼。

import sys
from HTMLParser import HTMLParser
from xml.etree import cElementTree as etree
import xml.etree.ElementTree as ET
from xml.etree.ElementTree import Element, SubElement, tostring
import psycopg2
import psycopg2.extras

def main():
    # Connect to an existing database
    conn = psycopg2.connect(dbname="**", user="**", password="**", host="/tmp/", port="**")

    # Open a cursor to perform database operations
    cur = conn.cursor(cursor_factory = psycopg2.extras.RealDictCursor)

    cur.execute("SELECT * FROM landingpagedata;")
    rows = cur.fetchall()

    class LinksParser(HTMLParser):
      def __init__(self):
          HTMLParser.__init__(self)
          self.tb = etree.TreeBuilder()

      def handle_starttag(self, tag, attributes):
          self.tb.start(tag, dict(attributes))

      def handle_endtag(self, tag):
          self.tb.end(tag)

      def handle_data(self, data):
          self.tb.data(data)

      def close(self):
          HTMLParser.close(self)
          return self.tb.close()

    template = 'template.html'



    # parser.feed(open('landingIndex.html').read()) #for testing
    # root = parser.close()

    for row in rows:
        parser = LinksParser()

        parser.feed(open(template).read())
        root = parser.close()




        #title
        title = root.find(".//title")
        title.text = row['title']

        #headline
        h1_id_headline = root.find(".//h1")
        h1_id_headline.text = row['h1_id_headline']
        # print row['h1_id_headline']

        #intro
        p_class_intro = root.find(".//p[@class='intro']")
        p_class_intro.text = row['p_class_intro']
        # print row['p_class_intro']

问题就出在这里!

        #recommended
        p_class_recommendedbackground = root.find(".//div[@class='recommended_background_div']")
        print p_class_recommendedbackground
        p_class_recommendedbackground.clear()
        newElement = ET.fromstring(row['p_class_recommendedbackground'])
        print row['p_class_recommendedbackground']
        print ET.iselement(newElement)
        p_class_recommendedbackground.append(newElement)

        html = tostring(root)
        f = open(row['page_name'], 'w').close()
        f = open(row['page_name'], 'w')
        f.write(html)
        f.close()
        # f = ''
        # html = ''
        parser.reset()
        root = ''

    # Close communication with the database
    cur.close()
    conn.close()

if __name__ == "__main__":
  main()

我的日志是这样的:

{background: url(/images/courses/azRealEstate.png) center no-repeat;}
<Element 'div' at 0x10a999720>
<p class="recommended_background">Materials are are aimed to all aspiring real estate sales associates who wish to obtain the Arizona Real Estate Salesperson license, which is provided by the <a href="http://www.re.state.az.us/" style="text-decoration: underline;">Arizona Department of Real Estate</a>.</p>
True
Traceback (most recent call last):
  File "/Users/Morgan13/Programming/LandingPageBuilder/landingPages/landingBuilderTest.py", line 108, in <module> main()
  File "/Users/Morgan13/Programming/LandingPageBuilder/landingPages/landingBuilderTest.py", line 84, in main
    p_class_recommendedbackground.append(newElement)
TypeError: must be Element, not Element
[Finished in 0.1s with exit code 1]

最佳答案

我可以这样重现错误消息:

from xml.etree import cElementTree as etree
import xml.etree.ElementTree as ET

croot = etree.Element('root')
child = ET.Element('child')
croot.append(child)
# TypeError: must be Element, not Element

问题的根本原因是我们将 ElementTree 的 cElementTree 实现与 xml.etree.ElementTree 实现混合在一起 元素树。永远不要让两人相遇。

所以解决方法就是选择一个，比如 etree，然后替换所有出现的另一个(例如，用 etree 替换 ET) .

关于Python eTree Parser 没有附加元素，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/16467640/

26

4

0

文章推荐： javascript - 独立的javascript同时移动同一个圆？

文章推荐： ios - NotificationCenter 没有调用选择器

parsing - 创建类型为 Parser a -> Parser b -> Parser (要么 a b)的解析器组合器
我想解析一些文本，其中某些字段在大多数情况下都具有结构，但偶尔(由于特殊大小写、拼写错误等)该结构会丢失。例如常规情况是 Cost: 5，但偶尔会显示 Cost: 5m 或 Cost: 3 + 1
java - parser.setFeature 与 parser.setValidating
以下有什么区别: parser.setFeature("http://xml.org/sax/features/validation",true); and parser.setFeature("ht
node.js - npm 错误! EPROTO : protocol error, 符号链接(symbolic link) '../@babel/parser/bin/babel-parser.js' -> '/home/vagrant/code/proadco.test/node_modules/.bin/parser'
我尝试在 Windows 8.1 上的 Git Bash 客户端中执行 npm install，但收到以下错误: npm WARN optional SKIPPING OPTIONAL DEPENDE
types - 理解这个 elm url-parser Parser 类型声明
试图理解 evancz/url-parser 模块时，我偶然发现了这种我难以理解的类型声明:( source ) type Parser a b = Parser (State a -> List
typescript - ESLint 配置中的 "parser"和 "parserOptions.parser"有什么区别？
我长期使用下面的 TypeScript 和 Vue 预设。它有效，但我还没有理解每个选项，现在要理解它。第一:parser之间有什么区别？和 @typescript-eslint/parser ？ p
javascript - node-sql-parser ， parser.astify(sqlquery) 不返回准确的输出
我正在尝试使用node-sql-parser在nodejs中解析sql查询。 const {Parser} = require('node-sql-parser'); const parser = n
javascript - AgnualrJs - $parsers.unshift 与 $parsers.push 的区别
自定义指令中的 ngModelCtrl.$parsers.unshift 和 ngModelCtrl.$parsers.push 之间的确切区别是什么。当发生对模型生效但对表单本身无效的事情时，我想
java - XML Pull Parser 和 SAX Parser 有什么区别
我正在寻找 SAX 和 Pull Parser 之间的主要区别。我知道 SAX 解析器非常适合处理大型 XML 文件，因为它不存储 XML 并且只在一个方向上遍历。与 DOM 相比。但我无法找到 SA
c++ - 对 QJson::Parser::Parser 的 undefined reference
我已经按照存档中的说明成功(？)安装了 QJson 库。但是编译器给我这个错误: Undefined reference to QJSon::Parser::Parser(). 我找到了安装库文件的位
html - 我应该使用 HTML::Parser 还是 XML::Parser 来提取和替换文本？
关闭。这个问题不符合Stack Overflow guidelines .它目前不接受答案。要求我们推荐或查找工具、库或最喜欢的场外资源的问题对于 Stack Overflow 来说是偏离主题的，
Java cucumber -java.lang.NoSuchMethodError : gherkin. parser.Parser.parse
尝试实现 Cucumber 来进行一些自动化测试。当我使用 junit 运行测试时，出现以下错误项目路径: src/test/java/ cucumberJava -cucumberjava,ja
node.js - Node Body Parser 和 cookie parser 有什么作用？我应该使用它们吗？
我已经阅读了我能找到的所有文档，但是我找不到关于这两个中间件的作用的简单解释。 body-parser 中的body 指的是什么？为什么需要解析正文？ Cookie 也类似。我是否更正了 cookie
java - Deorg.apache.xerces.parsers.XIncludeAwareParserConfiguration无法转换为org.apache.xerces.xni.parser.XMLParserConfiguration
我在gradle项目中使用Xerces jar，然后在gradle插件中使用该项目的jar:实际上，我正在开发一个gradle插件，其中包含使用gradle项目jar的某些功能的自定义任务；当我尝试运
java - org.apache.commons.net.ftp.parser.ParserInitializationException : Unknown parser type: Linux
我正在尝试连接到 Linux 上的 FTP 服务器，当我执行 ftp.listFiles(remote); 时遇到异常 SYST 215 Linux Exception in thread "AW
reactjs - 创建 react 应用程序-没有 typescript ，得到了错误: Failed to load parser '@typescript-eslint/parser'
我有react-app的示例安装，我得到了以下内容 Error: Failed to load parser '@typescript-eslint/parser' declared in '.esl
java - ClassCastException : org. apache.xerces.parsers.XIncludeAwareParserConfiguration 无法转换为 org.apache.xerces.xni.parser.XMLParserConfiguration
我在 there 中提到过类似的问题.我正在使用 mvn clean compile site 命令，我的版本是: cobertura.version: 2.5.1 findbugs.version:
java - ClassCastException : org. apache.xerces.parsers.XIncludeAwareParserConfiguration 无法转换为 org.apache.xerces.xni.parser.XMLParserConfiguration
我正在 Eclipse 中开发 GWT 应用程序并使用 jdom2 读取一些自定义 xml 属性文件。在最近的更新之后，我的应用程序现在在尝试读取 xml 文件时失败并出现上述错误。相关堆栈跟踪是:
java.lang.IllegalAccessError : class javax. xml.parsers.SecuritySupport12 无法访问其父类(super class) javax.xml.parsers.SecuritySupport
我正在使用 spring+maven。我正在 tomcat 服务器中部署我的应用程序。当我尝试运行我的应用程序时，突然出现以下错误。 INFO: Starting Servlet Engine: Ap
python - 使用 "import dateutil"和 "dateutil.parser.parse()"时出现 AttributeError 但使用 "from dateutil import parser"时没有问题
我在玩dateutil module在 Python 2.7.3 中。我只是想使用: import dateutil dateutil.parser.parse("01-02-2013") 但我得到了
Python基于argparse与ConfigParser库进行入参解析与ini parser
一.入参解析库 argparse 　　有时候写Python脚本，需要处理入参[-h][-v][-F]...等情况，如果自己来解析的话，会花费很多时间，而且也容易出问题，好在Python有现成的li

首页

博学

6Ren·AI

商城

Python eTree Parser 没有附加元素