gpt4 book ai didi

python - 将 FeedParser 对象序列化为 Atom

转载 作者:太空宇宙 更新时间:2023-11-04 01:40:12 39 4
gpt4 key购买 nike

我使用 feedparser http://www.feedparser.org/解析 Atom 提要,并对生成的 Python objetcs 进行一些操作。之后,我想将对象序列化回 Atom。但是 feedparser 似乎没有提供这样做的方法?

我注意到其他 Atom 库,例如 gdata http://code.google.com/p/gdata-python-client/或 demokritos http://jtauber.com/demokritos/但是,说实话,它们对初学者来说似乎很难。我使用 feedparser 正是因为它极其简单。

在 namsral 的良好反响下,我用我最喜欢的模板语言 SimpleTAL 编写了一个序列化程序

import feedparser

from simpletal import simpleTAL, simpleTALES, simpleTALUtils

mytemplate = """
<feed xmlns="http://www.w3.org/2005/Atom">
<title tal:condition="feed/title" tal:content="feed/title"/>
<link tal:condition="feed/link" tal:content="feed/link"/>
<updated tal:condition="feed/updated" tal:content="feed/updated"/>
<id tal:condition="feed/id" tal:content="feed/id"/>
<!-- TODO other feed variables -->
<entry xmlns='http://www.w3.org/2005/Atom'
xmlns:thr='http://purl.org/syndication/thread/1.0'
tal:repeat="entry entries">
<title tal:condition="entry/title" tal:content="entry/title"/>
<summary tal:condition="entry/summary" tal:content="entry/summary"/>
<content tal:condition="entry/content" tal:content="python: entry.content[0]['value']"/> <!-- TODO: metadata and the other items in content -->
<id tal:condition="entry/id" tal:content="entry/id"/>
<published tal:condition="entry/published" tal:content="entry/published"/>
<updated tal:condition="entry/updated" tal:content="entry/updated"/>
<!-- TODO other entry fields -->
</entry>
</feed>
"""
context = simpleTALES.Context(allowPythonPath=True)
template = simpleTAL.compileXMLTemplate (mytemplate)

class FeedParserPlus(feedparser.FeedParserDict):

def serialize(self):
context.addGlobal ("feed", self.feed)
context.addGlobal ("entries", self.entries)
result = simpleTALUtils.FastStringOutput()
template.expand (context, result)
return result.getvalue()

@classmethod
def parse(klass, text):
result = feedparser.parse(text)
return FeedParserPlus(result)

最佳答案

使用 Mako、Jinja 或 Django 等 Python 模板库生成提要相当容易。

一个使用 Bottle.py 的例子:

<?xml version="1.0" encoding="utf-8"?>
<feed xmlns="http://www.w3.org/2005/Atom">
<title>{{! d['title'] }}</title>
<subtitle>{{! d['subtitle'] }}</subtitle>
<link rel="alternate" type="text/html" href="{{! d['site_url'] }}" />
<link rel="self" type="application/atom+xml" href="{{! d['feed_url'] }}" />
<id>{{! d['feed_url'] }}</id>
<updated>{{! d['date_updated'] }}</updated>
<rights>{{! d['copyright'] }}</rights>

%for entry in entries:
<entry>
<title>{{! entry['title'] }}</title>
<link rel="alternate" type="text/html" href="{{! entry['url'] }}" />
<id>{{! entry['atom_id'] }}</id>
<published>{{! entry['date_published'] }}</published>
<updated>{{! entry['date_updated'] }}</updated>
<author>
<name>{{! d['author'] }}</name>
<uri>{{! d['site_url'] }}</uri>
</author>
<content type="html" xml:base="{{! d['site_url'] }}" xml:lang="en">
<![CDATA[{{! entry['body'] }}]]>
</content>
</entry>
%end

</feed>

有关使用 Django 尤其是 django-atompub 的更多信息:http://code.google.com/p/django-atompub/wiki/UserGuide

关于python - 将 FeedParser 对象序列化为 Atom,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/5916375/

39 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com