gpt4 book ai didi

python - 无法运行Scrapy程序

转载 作者:太空宇宙 更新时间:2023-11-03 17:49:37 28 4
gpt4 key购买 nike

我一直在通过以下链接学习如何使用 Scrapy:

http://doc.scrapy.org/en/master/intro/tutorial.html

当我尝试运行 Crawling(scrapy scrapy dmoz) 部分中编写的代码时,出现以下错误:

AttributeError:“模块”对象没有属性“Spider

但是,我将“Spider”更改为“spider”,但除了一个新错误之外什么也没有:

TypeError: Error when calling the metaclass bases
module.__init__() takes at most 2 arguments (3 given)

我很困惑,这是什么问题?任何帮助将不胜感激。谢谢。顺便说一下,我使用的是 Windows。

编辑(添加源):

首先,我使用 Scrapy 创建了一个项目,方法是转到一个目录并通过 cmd 运行以下命令,如下所示:

cd #DIRECTORY PATH#

scrapy startproject tutorial

这将在给定目录中创建一个名为教程的文件夹。教程文件夹包含:

教程/ scrapy.cfg 教程/ init.py 项目.py 管道.py 设置.py 蜘蛛/ init.py ...

然后我定义了我的项目:

import scrapy

class DmozItem(scrapy.Item):
title = scrapy.Field()
link = scrapy.Field()
desc = scrapy.Field()

之后,我创建了蜘蛛:

导入scrapy

class DmozSpider(scrapy.Spider):
name = "dmoz"
allowed_domains = ["dmoz.org"]
start_urls = [
"http://www.dmoz.org/Computers/Programming/Languages/Python/Books/",
"http://www.dmoz.org/Computers/Programming/Languages/Python/Resources/"
]

def parse(self, response):
filename = response.url.split("/")[-2]
with open(filename, 'wb') as f:
f.write(response.body)

之后,运行代码时,显示错误。我使用 Windows 7 64 位和 Python 2.7 32 位。

编辑2:

我尝试卸载并安装另一个 Scrapy 版本,但没有成功。这是日志:

C:\Users\Novin Pendar\Desktop\FS\tutorial>scrapy crawl dmoz
2015-03-26 17:48:29+0430 [scrapy] INFO: Scrapy 0.16.5 started (bot: tutorial)
2015-03-26 17:48:29+0430 [scrapy] DEBUG: Enabled extensions: LogStats, TelnetCon
sole, CloseSpider, WebService, CoreStats, SpiderState
C:\Python27\lib\site-packages\scrapy-0.16.5-py2.7.egg\scrapy\__init__.pyc
Traceback (most recent call last):
File "C:\Python27\lib\runpy.py", line 162, in _run_module_as_main
"__main__", fname, loader, pkg_name)
File "C:\Python27\lib\runpy.py", line 72, in _run_code
exec code in run_globals
File "C:\Python27\lib\site-packages\scrapy-0.16.5-py2.7.egg\scrapy\cmdline.py"
, line 156, in <module>
execute()
File "C:\Python27\lib\site-packages\scrapy-0.16.5-py2.7.egg\scrapy\cmdline.py"
, line 131, in execute
_run_print_help(parser, _run_command, cmd, args, opts)
File "C:\Python27\lib\site-packages\scrapy-0.16.5-py2.7.egg\scrapy\cmdline.py"
, line 76, in _run_print_help
func(*a, **kw)
File "C:\Python27\lib\site-packages\scrapy-0.16.5-py2.7.egg\scrapy\cmdline.py"
, line 138, in _run_command
cmd.run(args, opts)
File "C:\Python27\lib\site-packages\scrapy-0.16.5-py2.7.egg\scrapy\commands\cr
awl.py", line 43, in run
spider = self.crawler.spiders.create(spname, **opts.spargs)
File "C:\Python27\lib\site-packages\scrapy-0.16.5-py2.7.egg\scrapy\command.py"
, line 33, in crawler
self._crawler.configure()
File "C:\Python27\lib\site-packages\scrapy-0.16.5-py2.7.egg\scrapy\crawler.py"
, line 40, in configure
self.spiders = spman_cls.from_crawler(self)
File "C:\Python27\lib\site-packages\scrapy-0.16.5-py2.7.egg\scrapy\spidermanag
er.py", line 35, in from_crawler
sm = cls.from_settings(crawler.settings)
File "C:\Python27\lib\site-packages\scrapy-0.16.5-py2.7.egg\scrapy\spidermanag
er.py", line 31, in from_settings
return cls(settings.getlist('SPIDER_MODULES'))
File "C:\Python27\lib\site-packages\scrapy-0.16.5-py2.7.egg\scrapy\spidermanag
er.py", line 22, in __init__
for module in walk_modules(name):
File "C:\Python27\lib\site-packages\scrapy-0.16.5-py2.7.egg\scrapy\utils\misc.
py", line 65, in walk_modules
submod = __import__(fullpath, {}, {}, [''])
File "tutorial\spiders\dmoz_spider.py", line 3, in <module>
class DmozSpider(scrapy.Spider):
AttributeError: 'module' object has no attribute 'Spider'

编辑3:

问题已经解决了。我下载了最新版本的Scrapy(0.24)并安装。一切都那么顺利。只是想对那些和我有同样问题的人说,所以,他们会节省很多时间。谢谢。

最佳答案

如果你的安装正确。试试这个

检查工作文件夹中的任何 scrapy.pyscrapy.pyc。如果存在,请将其重命名。不要将 Spider 更改为 蜘蛛

关于python - 无法运行Scrapy程序,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/29275958/

28 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com