python - 自定义tensorflow解码器TypeError : __call__() missing 1 required positional argument: 'inputs'-6ren

python - 自定义tensorflow解码器TypeError : call() missing 1 required positional argument: 'inputs'

转载作者：行者123 更新时间：2023-12-04 10:05:00

我正在使用 tensorflow 2.0 来训练我自己的注意力模型，
但是我在构建解码器类时遇到了一个大问题，
像这样

TypeError                                 Traceback (most recent call last)
<ipython-input-19-3042369c4295> in <module>
      9     enc_hidden_h=fw_sample_state_h,
     10     enc_hidden_c=fw_sample_state_c,
---> 11     enc_output=sample_output)
     12 
     13 print ('Decoder output shape: (batch_size, vocab size) {}'.format(sample_decoder_output.shape))

TypeError: __call__() missing 1 required positional argument: 'inputs'

我的编码器-解码器注意力模型如下所示
编码器:这是一个自定义的 pBLSTM

class Encoder(tf.keras.Model):
    def __init__(self, lstm_units, final_units, batch_sz, conv_filters, mfcc_dims):
        super(Encoder, self).__init__()
        self.lstm_units = lstm_units
        self.final_units = final_units
        self.batch_sz = batch_sz
        self.conv_filters = conv_filters
        self.mfcc_dims = mfcc_dims

        # Convolution layer to extract feature after MFCC
        self.conv_feat = tf.keras.layers.Conv1D(filters=self.conv_filters, 
                                                kernel_size=self.mfcc_dims, 
                                                padding='valid', 
                                                activation='relu', 
                                                strides=self.mfcc_dims)

    def call(self, x):
        '''
        build a pyramidal LSTM neural network encoder
        '''
        # Convolution Feature Extraction
        x = self.conv_feat(x)

        # initialize states for forward and backward
        initial_state_fw = None
        initial_state_bw = None

        counter = 0
        while(x.shape[1] > self.final_units):
            counter += 1
            # forward LSTM
            fw_output, fw_state_h, fw_state_c = self.build_lstm(True)(x, initial_state=initial_state_fw)

            # backward LSTM
            bw_output, bw_state_h, bw_state_c = self.build_lstm(False)(x, initial_state=initial_state_bw)

            x = tf.concat([fw_output, bw_output], -1)
            x = self.reshape_pyramidal(x)

            initial_state_fw = [fw_state_h, fw_state_c]
            initial_state_bw = [bw_state_h, bw_state_c]

        print(f"Encoder pyramid layer number: {counter}\n")
        return x, (fw_state_h, fw_state_c), (bw_state_h, bw_state_c)

    def build_lstm(self, back=True):
        '''
        build LSTM layer for forward and backward
        '''
        return tf.keras.layers.LSTM(units=self.lstm_units, 
                                    return_sequences=True, 
                                    return_state=True, 
                                    go_backwards=back)

    def reshape_pyramidal(self, outputs):
        '''
        After concatenating forward and backward outputs
        return the reshaped output
        '''
        batch_size, time_steps, num_units = outputs.shape

        return tf.reshape(outputs, (batch_size, -1, num_units * 2))

注意力模型:根据本文构建:
https://arxiv.org/abs/1508.04025v5

class BahdanauAttention(tf.keras.layers.Layer):
    def __init__(self, units):
        super(BahdanauAttention, self).__init__()
        self.W1 = tf.keras.layers.Dense(units)
        self.W2 = tf.keras.layers.Dense(units)
        self.V = tf.keras.layers.Dense(1)

    def call(self, query, values):
        # query hidden state shape == (batch_size, hidden size)
        # query_with_time_axis shape == (batch_size, 1, hidden size)
        # values shape == (batch_size, max_len, hidden size)
        # we are doing this to broadcast addition along the time axis to calculate the score
        query_with_time_axis = tf.expand_dims(query, 1)

        # score shape == (batch_size, max_length, 1)
        # we get 1 at the last axis because we are applying score to self.V
        # the shape of the tensor before applying self.V is (batch_size, max_length, units)
        score = self.V(tf.nn.tanh(
            self.W1(query_with_time_axis) + self.W2(values)))

        # attention_weights shape == (batch_size, max_length, 1)
        attention_weights = tf.nn.softmax(score, axis=1)

        # context_vector shape after sum == (batch_size, hidden_size)
        context_vector = attention_weights * values
        context_vector = tf.reduce_sum(context_vector, axis=1)

        return context_vector, attention_weights

解码器:一个 1 层 LSTM 解码器

class Decoder(tf.keras.Model):
    def __init__(self, target_sz, embedding_dim, decoder_units, batch_sz, **kwargs):
        super(Decoder, self).__init__(**kwargs)
        self.batch_sz = batch_sz
        self.decoder_units = decoder_units
        self.embedding = tf.keras.layers.Embedding(target_sz, embedding_dim)
        self.attention = BahdanauAttention(self.decoder_units)
        self.lstm = tf.keras.layers.LSTM(units=self.decoder_units, return_sequences=True, return_state=True)
        self.fc = tf.keras.layers.Dense(target_sz)


    def call(self, x, enc_hidden_h, enc_hidden_c, enc_output):
        '''
        build LSTM decoder
        '''
        # enc_output shape == (batch_size, max_length, hidden_size)
        context_vector, attention_weights = self.attention(enc_hidden_h, enc_output)

        # x shape after passing through embedding == (batch_size, 1, embedding_dim)
        x = self.embedding(x)

        # x shape after concatenation == (batch_size, 1, embedding_dim + hidden_size)
        x = tf.concat([tf.expand_dims(context_vector, 1), x], axis=-1)

        # passing the concatenated vector to the LSTM
        output, state_h, state_c = self.lstm(x)

        # output shape == (batch_size * 1, hidden_size)
        output = tf.reshape(output, (-1, output.shape[-1]))

        # output shape == (batch_size, vocab)
        x = self.fc(output)

        return x, (state_h, state_c), attention_weights

我在使用我的示例输入进行测试时遇到了这个错误，如下所示

example_input_batch, example_target_batch = next(iter(dataset))
sample_output, (fw_sample_state_h, fw_sample_state_c), bw_sample_state = encoder(example_input_batch)
decoder = Decoder(target_sz=PHONEME_SIZE, 
                  embedding_dim=EMBEDDING_DIM, 
                  decoder_units=LSTM_UNITS, 
                  batch_sz=BATCH_SIZE)

sample_target_size = tf.random.uniform((BATCH_SIZE, 1))
sample_decoder_output, sample_decoder_hidden, attention_weights = decoder(
    x=sample_target_size, 
    enc_hidden_h=fw_sample_state_h, 
    enc_hidden_c=fw_sample_state_c, 
    enc_output=sample_output)

最佳答案

正如评论中所讨论的，问题是海报继承自 tf.keras.Model在创建 Decoder() 时类(class)。这个父类(super class)期待一个 inputs __call__() 中的参数运算符(operator)。

因此，可以通过更改 x 来解决此错误至 inputs在 Decoder.call()像这样的方法:

def call(self, inputs, enc_hidden_h, enc_hidden_c, enc_output):
    '''
    build LSTM decoder
    '''
    # enc_output shape == (batch_size, max_length, hidden_size)
    context_vector, attention_weights = self.attention(enc_hidden_h, enc_output)

    # x shape after passing through embedding == (batch_size, 1, embedding_dim)
    x = self.embedding(inputs)

    # x shape after concatenation == (batch_size, 1, embedding_dim + hidden_size)
    x = tf.concat([tf.expand_dims(context_vector, 1), x], axis=-1)

    # passing the concatenated vector to the LSTM
    output, state_h, state_c = self.lstm(x)

    # output shape == (batch_size * 1, hidden_size)
    output = tf.reshape(output, (-1, output.shape[-1]))

    # output shape == (batch_size, vocab)
    x = self.fc(output)

    return x, (state_h, state_c), attention_weights

关于python - 自定义tensorflow解码器TypeError : __call__() missing 1 required positional argument: 'inputs' ，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/61631360/

文章推荐： macos - Catalyst 转换的 Mac 应用程序中的滚轮事件捕获

文章推荐： node.js - 如何查看 nodejs 中 readline 模块的源代码？

文章推荐： python - 如何使用邻接矩阵处理内存错误？

文章推荐： scala - Spark 在 Databricks 上使用 Scala 加载 Z 压缩文件

javascript - 将文本从一个 INPUT 拖动到另一个 INPUT，而不从源 INPUT 中删除文本
我有两个文本输入元素 A 和 B。我希望用户能够从 A 中选择部分或全部文本并拖动到 B，但文本不会从 A 中消失。假设“A”包含“quick brown fox”，用户突出显示“fox”一词并将
JavaScript 无法将 input.min、input.max、input.value 识别为整数
我正在一个网站上工作，如果在提交表单之前数字不在最小值和最大值之间，我希望数字输入能够自行更正。我的代码如下: HTML: JavaScript: function CorrectOverUnder
php - Laravel 4:Input::has() + Input::get() vs. ($var = Input::get()) != null
在检查输入值是否存在并将其分配给变量时，我看到了两种实现此目的的方法: if(Input::has('id')) { $id = Input::get('id'); // do som
input - <input type ="submit"/> 和 <input type ="text"/> 之间的盒模型不一致
我意识到有一个 border-box盒子模型，而有一个 content-box盒子模型。此行为存在于 IE8 和 FF 中。不幸的是，这使我无法将这种样式应用于大小均匀的输入: input, tex
html - 在 Polymer 中，为什么要使用 `<input is="iron-input">` instead of ` `？
在 Polymer 文档 ( https://elements.polymer-project.org/elements/iron-input ) 中，我发现: 而在另一个官方文档(https://
PHP : multiple inputs into other multiple inputs
我使用 jquery 添加/删除输入我使用append为日期/收入添加多个Tr 我还使用另一个附加来添加多个 td 以获取同一日期 Tr 中的收入我添加多个日期输入，并在此表中添加多个收入输入我
python input() 在调用 input() 之前采用旧的标准输入
Python3 的 input() 似乎在两次调用 input() 之间采用旧的 std 输入。有没有办法忽略旧输入，只接受新输入(在 input() 被调用之后)？ import time a =
jquery - :input and input in Jquery 之间的区别
在一些教程中，我看到了这些选择器: $(':input'); 或 $('input'); 注意“:”。有什么不同吗？最佳答案 $('input') = 仅包含元素名称，仅选择 HTML 元素。 $
javascript - $(input).blur() & $(input).focusout
我有下一个 html 表单: Nombre: El nombre es obligatorio. Solo se pe
javascript - 为什么 `@input` 装饰器优先于 `inputs:[]`
有两种方法可以在组件上定义输入: @Component({ inputs: ['displayEntriesCount'], ... }) export class MyTable i
input: dynamic input is missing dimensions in profile
input: dynamic input is missing dimensions in profile onnx2trt代码报错： import numpy as np import tensor
python - 如何在 "two-input input"中允许单个输入？
所以，我有允许两个输入的代码: a, b = input("Enter a command: ").split() if(a == 'hello'): print("Hi") elif(a =
python - 如何在 input() 进行时访问 input() 函数
我有一个与用户交流的程序。我正在使用 input() 从用户那里获取数据，但是，我想告诉用户，例如，如果用户输入脏话，我想打印 You are swearing!立即删除它! 而用户正在输入。如您
Java input = ""与 input = null 不同吗？
我在运行 J2ME 应用程序时遇到了一些严重的内存问题。所以我建立了另一个步骤来清除巨大的输入字符串并处理它的数据并清除它。但直到我设置 input = null 而不是 input = "" 才解
安卓模拟器 : both soft input and hard keyboard input
我想在我的 android 虚拟设备中同时启用软输入和硬键盘。我知道如何两者兼得，但不会两者。同时想要BOTH的原因: 软输入:预览当键盘缩小屏幕时布局如何调整大小硬键盘:显然是快速输入。提前致
javascript - addClass "input-invalid"to input 在我点击输入元素外部之前不会显示
我有一个邮政编码字段，在 keyup 上我执行了一个 ajax 调用。如果没有可用的邮政编码，那么我想添加类“input-invalid”。但问题是，在我单击输入字段的外部某处之前，红色边框验证不会
Angular4 @Input() 与 Component.inputs 数组
根据我的理解使用 @Input() name: string; 并在组件装饰器中使用输入数组，如下所示 @Component({ ... inputs:
typescript - 如何将语法 "inputs"属性更改为 "@Input"属性装饰器
我有一段代码是这样的 @Component({ selector: 'control-messages', inputs: ['controlName: control'],
input - Angular 2 : What are @input and @output properties
在@component中， @input 和@output 属性代表什么以及它们的用途是什么？什么是指令，为什么我们必须把指令放在下面的结构中？ directives:[CORE_DIRECTIVE
input - 避免SAS错误消息: "NOTE: Invalid argument to function INPUT"
有没有一种方法可以测试变量是否会使SAS中的INPUT转换过程失败？或者，是否可以避免生成的“NOTE:无效参数”消息？ data _null_; format test2 date9.; inp

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

python - 自定义tensorflow解码器TypeError : call() missing 1 required positional argument: 'inputs'