python - 抓取错误 : TypeError: __init__() got an unexpected keyword argument 'callback'-6ren

python - 抓取错误 : TypeError: init() got an unexpected keyword argument 'callback'

转载作者：太空宇宙更新时间：2023-11-04 00:48:15

24

4

我试图通过提取其中包含“huis”(荷兰语中的“house”)的所有链接来抓取网站。正在关注http://doc.scrapy.org/en/latest/topics/spiders.html , 我在努力

import scrapy
from scrapy.spiders import CrawlSpider, Rule
from scrapy.linkextractors import LinkExtractor

from Funda.items import FundaItem

class FundaSpider(scrapy.Spider):
    name = "Funda"
    allowed_domains = ["funda.nl"]
    start_urls = [
        "http://www.funda.nl/koop/amsterdam/"
    ]

    rules = (
    Rule(LinkExtractor(allow=r'.*huis.*', callback='parse_item'))
    )

    def parse_item(self, response):
        item = FundaItem()
        item['title'] = response.extract()
        return item

但是，我收到了错误消息

Rule(LinkExtractor(allow=r'.*huis.*', callback='parse_item'))
TypeError: __init__() got an unexpected keyword argument 'callback'

从之前的帖子 (Scrapy Error: TypeError: __init__() got an unexpected keyword argument 'deny') 看来，可能的原因是括号不匹配，因此关键字被传递给 Rule 而不是 LinkExtractor。然而，在我看来，在这种情况下，callback 按预期位于 LinkExtractor 括号内。

知道是什么导致了这个错误吗？

最佳答案

是的，callback 肯定会传递给 LinkExtractor。实际上，这似乎是问题所在，因为我在 the documentation 中的该类的预期参数下没有看到 callback .

我看到 Rule类确实有一个在文档中列出的回调参数。所以也许您应该将它传递给 Rule 而不是 LinkExtractor？

Rule(LinkExtractor(allow=r'.*huis.*'), callback='parse_item')

如果您在想“但是为什么链接问题的回答者将 callback 放在 LinkExtractor 调用中？”，我认为您可能误解了嵌套括号，诚然这有点令人困惑。更改布局使其更清晰:

rules = (
    Rule(
        LinkExtractor(
            allow=[r'/*'], 
            deny=('blogs/*', 'videos/*', )
        ),
        callback='parse_html'
    ), 
)

关于python - 抓取错误 : TypeError: __init__() got an unexpected keyword argument 'callback' ，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/38335472/

24

4

0

文章推荐： python - 如何展平 python 列表？输入是列表、字符串或整数

文章推荐： angularjs - Grunt 在开发环境中美化或不丑化

文章推荐： c - 将指针的最低有效位设置为 0

python - 为什么在多重继承中执行 Base.__init__(self) 而不是 super().__init__() 时会跳过 __init__？
为什么正是是 A.__init__() B.__init__() D.__init__() 由以下代码打印？特别是: 为什么是C.__init__() 未打印？为什么是C.__init__()如果我
python - 在数据类的自定义 `__init__` 覆盖中调用生成的 `__init__`
目前我有这样的事情: @dataclass(frozen=True) class MyClass: a: str b: str c: str d: Dict[str, str] ...
python - 如何将父类的 __init__ 属性继承到子类中的 __init__ ？
我正在尝试从父类继承属性: class Human: def __init__(self,name,date_of_birth,gender,nationality): self.name =
python - 派生类 __init__ 中的参数多于基类 __init__
如何扩展基类的 __init__，添加更多要解析的参数，而不需要 super().__init__(foo, bar) 在每个派生类中？ class Ipsum: """ A base ips
python - __init__ 类(不是实例 __init__)
这是我试图解决的一个非常简单的例子: class Test(object): some_dict = {Test: True} 问题是我无法在 Test 仍在定义时引用它通常，我会这样做:
python - __init__() 应该调用父类的 __init__() 吗？
我在 Objective-C 中使用过这个结构: - (void)init { if (self = [super init]) { // init class }
python:在 __init__ 方法中过早调用 super().__init__ ？
我有一个类层次结构，其中 class Base 中的 __init__ 执行一些预初始化，然后调用方法 calculate。 calculate 方法在 class Base 中定义，但预计会在派生类
python - 在基类 __init__ 的子类 __init__ 之后调用基类方法？
这是我在多种语言中都怀念的一个特性，想知道是否有人知道如何在 Python 中完成它。我的想法是我有一个基类: class Base(object): def __init__(self):
python - 如果只调用 super.__init__ 是否需要 __init__？
我正在对 threading.Thread 类进行子类化，它目前看起来像这样: class MyThread(threading.Thread): def __init__(self:
python - __init__.so(而不是 __init__.py)掩码子包
我正在用 cython 写一些代码，我有一些 "Packages “within” modules" . — 这实际上是对我在那里的问题的跟进，结构应该是一样的。问题是这是 cython，所以我处理的
python - 如何覆盖 __init__ 同时保存继承自 OrderedDict 的旧 __init__
class AppendiveDict(c.OrderedDict): def __init__(self,func,*args): c.OrderedDict.__init_
python - 在 __init__ 内部，使用 __init__ 外部的变量
看完this回答，我明白 __init__ 之外的变量由类的所有实例和 __init__ 内的变量共享每个实例都是唯一的。我想使用所有实例共享的变量，随机给我的类实例一个唯一的参数。这是我尝试过的较
python - 对 def __init__ 中的 __init__ 方法感到困惑
在下面的代码中: import tkinter as tk class CardShuffling(tk.Tk): background_colour = '#D3D3D3'
python - 为什么 Python 在创建实例时不调用实例方法 __init__() 而是调用类提供的 __init__() ？
我正在覆盖类的 __new__() 方法以返回具有特定 __init__() 集的类实例。 Python 似乎调用类提供的 __init__() 方法而不是特定于实例的方法，尽管 Python 文档在
Python 3 内置类型 __init__ 不调用 super().__init__？
从内置类型和其他类派生时，内置类型的构造函数似乎没有调用父类(super class)构造函数。这会导致 __init__ 方法不会被 MRO 中内置函数之后的类型调用。例子: class A:
Python: super 和 __init__() 与 __init__( self )
答: super( BasicElement, self ).__init__() 乙: super( BasicElement, self ).__init__( self ) A 和 B 有什么区
python - 为什么 super(A, self).__init__() 不调用 A 的 __init__() ？
class A(object): def __init__(self): print('A.__init__()') class D(A): def __init__(
python - 使用 super().__init__() 混合 type() 和自定义 __init__()
到目前为止我已经成功地做了什么: 我创建了一个 elem 类来表示 html 元素(div、html、span、body 等)。我可以像这样派生这个类来为每个元素创建子类: class elem:
python - 为什么在父类 __init__() 中调用 super() 会改变子类 __init__() 的行为？
我一直在努力理解 super() 在多重继承的上下文中的行为。我很困惑为什么在 test2.py 的父类中调用 super() 会导致为父类调用 __init__()？ test1.py #!/usr
python - 为什么不将 __init__ 分配给父类(super class)的 __init__？
为什么我在 Python 代码中看不到以下内容？ class A: def __init__(self, ...): # something important class B

首页

博学

6Ren·AI

商城

python - 抓取错误 : TypeError: init() got an unexpected keyword argument 'callback'