python - Tensorflow/Keras : Input 0 of layer lstm is incompatible with the layer: expected ndim=3, 发现 ndim=2-6ren

python - Tensorflow/Keras : Input 0 of layer lstm is incompatible with the layer: expected ndim=3, 发现 ndim=2

转载作者：行者123 更新时间：2023-12-04 07:40:01

我正在尝试实现联合训练 Keras/Tensorflow 模型来检测文本文章中的假新闻，但我在使用该模型时遇到了问题。当我尝试运行代码时，出现以下错误:

 ValueError: Input 0 of layer lstm is incompatible with the layer: expected ndim=3, found ndim=2. Full shape received: [None, 50]

以及以下警告:

WARNING:tensorflow:Model was constructed with shape (None, 400) for input Tensor("embedding_input:0", shape=(None, 400), dtype=float32), but it was called on an input with incompatible shape (None,).

直觉上我明白嵌入层输出应该是形状 (None, 400, 50) 但由于某种原因，它只提供一个 2d 输入，或者该层需要一个 3d 张量，但只提供一个 2d 张量。但是，我不知道如何修复它，也不知道如何更改输入/输出形状以使它们匹配。我已经在这个问题上停留了几天。我在 ML 和神经网络领域还是新手。任何建议都值得感谢，非常感谢您提前。
使用的模型:

max_words = 2000
max_len = 400
embed_dim = 50
lstm_out = 64
batch_size = 32

def getTextModel():
    model = Sequential()
    model.add(Embedding(max_words, embed_dim, input_length = max_len, input_shape=preprocessed_sample_dataset.element_spec))
    model.add(LSTM(lstm_out))
    model.add(Dense(256))
    model.add(Activation('relu'))
    model.add(Dropout(0.5))
    model.add(Dense(1, name='out_layer'))
    model.add(Activation('sigmoid'))
return model

型号概要:

Model: "sequential"
_________________________________________________________________
Layer (type)                 Output Shape              Param #   
=================================================================
embedding (Embedding)        (None, 400, 50)           100000    
_________________________________________________________________
lstm (LSTM)                  (None, 64)                29440     
_________________________________________________________________
dense (Dense)                (None, 256)               16640     
_________________________________________________________________
activation (Activation)      (None, 256)               0         
_________________________________________________________________
dropout (Dropout)            (None, 256)               0         
_________________________________________________________________
out_layer (Dense)            (None, 1)                 257       
_________________________________________________________________
activation_1 (Activation)    (None, 1)                 0         
=================================================================
Total params: 146,337
Trainable params: 146,337
Non-trainable params: 0

其他信息:
数据预处理:

def preprocess(dataset):

  def batch_format_fn(element):
    """Flatten a batch `pixels` and return the features as an `OrderedDict`."""
    print(element['features'])
    return collections.OrderedDict(
        x=element['features'],
        y=tf.reshape(element['label'], [-1, 1])
    )
  return dataset.repeat(NUM_EPOCHS).shuffle(SHUFFLE_BUFFER).batch(
      BATCH_SIZE).map(batch_format_fn).prefetch(PREFETCH_BUFFER)

preprocessed_sample_dataset = preprocess(sample_dataset)


def make_federated_data(client_data, client_ids):
    return [preprocess(client_data.create_tf_dataset_for_client(x)) for x in client_ids]

federated_train_data = make_federated_data(train_dataset, train_dataset.client_ids)

print('Number of client datasets: {l}'.format(l=len(federated_train_data)))
print('First dataset: {d}'.format(d=federated_train_data[0]))

数据集格式:

Number of client datasets: 4
First dataset: <PrefetchDataset shapes: OrderedDict([(x, (None,)), (y, (None, 1))]), types: OrderedDict([(x, tf.string), (y, tf.int64)])>

调用函数的代码:

def model_fn():

  keras_model = getTextModel() #create_keras_model()
  input_spec_aux = preprocessed_sample_dataset.element_spec
  return tff.learning.from_keras_model(
      keras_model,
      input_spec= input_spec_aux,
      loss=tf.keras.losses.SparseCategoricalCrossentropy(),
      metrics=[tf.keras.metrics.SparseCategoricalAccuracy()])

#Error occurs in iterative_process
iterative_process = tff.learning.build_federated_averaging_process(
    model_fn,
    client_optimizer_fn=lambda: tf.keras.optimizers.Adam(learning_rate=client_lr),
    server_optimizer_fn=lambda: tf.keras.optimizers.SGD(learning_rate=server_lr))

print(str(iterative_process.initialize.type_signature))

state = iterative_process.initialize()

最佳答案

数据集格式表示输入的形状 x是 (None,) (ndim/rank, = 1) 和数据类型 tf.string) . None来自这样一个事实，即数据集可能会产生不“完整”的批次，因此实际上第一个维度的范围是 [1, BATCH_SIZE] .这个形状意味着我们有一批单标量字符串。这可能是问题所在，通常在 LSTM 中，我们需要一批字符串序列，例如形状像 (None, SEQUENCE_LENGTH) .
嵌入层会将最后一个维度投影到嵌入维度z ，例如成型(x, y)并产生形状 (x, y, z) .所以我们在嵌入层之后的输入将是 (None, 50) (或 ndim/rank = 2)。回想一下 LSTM 需要序列，而 Keras 需要批处理，错误消息说所需的形状是 (None, SEQUENCE_LENGTH, 50) (ndim/等级 = 3)。
我建议返回数据集并确定 element['features'] 的格式。是。似乎在这种情况下，它可能是一个完整的句子，需要被标记为一系列单词(例如，对于英语在空格上分割)。
不过有一句警告:即使在修复了形状之后，我怀疑 Keras 接下来会提示 tf.string 的 dtype |不能用于嵌入层。首先需要将序列转换为整数 id，可能使用来自 tf.lookup 的东西。或来自 tf_text 的东西.
一些可能有用的资源:

Federated Learning for Text Generation Tutorial ，特别是数据集构建部分。

Load text tutorial

关于python - Tensorflow/Keras : Input 0 of layer lstm is incompatible with the layer: expected ndim=3, 发现 ndim=2，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/67533039/

文章推荐： regex - 如何使用 Oracle regexp_substr 从字符串中提取单词？

文章推荐： ruby-on-rails - 使用命名路由与使用 url_for()

文章推荐： ruby-on-rails - Rails 迁移 : primary key id with unsigned int(10)

文章推荐： python - Py_Finalize() 导致 Python 3.9 的段错误但不是 Python 2.7

expect - Expect 脚本还值得学习吗？
就目前而言，这个问题不适合我们的问答形式。我们希望答案得到事实、引用资料或专业知识的支持，但这个问题可能会引发辩论、争论、投票或扩展讨论。如果您觉得这个问题可以改进并可能重新打开，visit the
expect - Expect 脚本的用户输入
我是脚本新手。如何编写 Expect 脚本以通过 ssh 连接到设备并提示用户输入密码？我们使用 pin + RSA token 代码作为密码，因此我无法存储密码。 #!/usr/bin/expect
expect - Expect 脚本中的 Do-while
我编写了以下代码并尝试执行它。但我在执行 do {”时遇到“无效的命令名称“do”” 代码: #!/usr/bin/expect set val 0; set input 5; do { pu
expect - Expect 的 "-r"命令中的 "expect -r PATTERN"是什么意思？
我已经查看了 Expect 联机帮助页并用 Google 搜索了它，但我还没有找到 expect 的 -r 是什么。我看到这个选项以前是这样用的 expect -r "\r\n\r\n" 在 expe
expect - 如何将调试信息重定向到 expect 脚本中的文本文件？
我的 shebang 看起来像这样: #!/usr/bin/expect -d 当我从命令行运行脚本时，它会提供我想要的内容。但是，我通过 crontab 运行这个脚本。是否可以将调试开关保持打开状
expect - 在一个 Expect 脚本中处理多个语句
我是 Expect 脚本的新手。我在 Linux 机器上为 ssh 编写了一个 Expect 脚本，在那里我在 ssh 到不同的 Linux 机器时遇到了问题。下面我复制了脚本。 !/usr/loc
actionscript-3 - 语法错误: expecting identifier before this. expecting colon before leftparen. expecting identifier before rightbrace
Scene 1, Layer 'script', Frame 1, Line 9 1084: Syntax error: expecting identifier before this. Sc
expect - log_file 命令不在 Expect 脚本中记录命令的输出
我正在运行调试命令以将命令的输出记录到文件中。我尝试了 log_file 命令，但它没有记录输出。我的代码如下: log_file -a gdb.txt send "~/debugulator.sh
Expect - expect_user 和 expect 的超时时间不同？
我希望 expect_user 有一个无限的(或非常大的)超时和 expect 的默认超时。有没有办法设置不同的超时？或者我是否只需要在每次更改用途之前手动执行此操作？最佳答案 expect 和ex
iOS内联if else编译错误: "Expected : "; "Expected expression"
我正在学习 iOS 编程(我来自 Android)，我正在寻找更容易获取字符串的方法。有了这个建议，我定义了下一个宏并在一些代码片段中使用它: #define STRING_BASE @"InfoPl
ruby-on-rails - Rspec expect( ) 与 expect { }
你好我是 rspec 的新手，我想弄清楚将 block 传递给 expect{} 和只使用 expect() 之间的区别这是一个简单的例子 require "rails_helper" RSpec.
reactjs - expect(received).toEqual(expected) - 错误
我正在尝试为 React JS 运行单元测试 - 使用 jest/enzyme。目前测试失败。不太清楚为什么，也许我没有正确调用 expect(wrapper.find)。这是我测试的一部分: F
expect - 如何在连接到 ssh 服务器时执行 expect 脚本
例如，现在我有一个“root.exp”期望脚本如下: spawn ssh user@ip expect "Password:" send "password" 然后，我要发送到这个ssh服务器的exp
expect - 使用 Expect 脚本将 IP 地址提取到变量
您好，我是 Expect 脚本编写的新手，我一直在尝试使用以下方法将 IP 地址获取到变量中: set timeout -1 spawn $env(SHELL) match_max 100000 se
javascript - expect.anything() 不适用于 expect.toBe()
expect.anything() 不适用于 expect.toBe()，但适用于 expect.toEqual() test("this will pass", () => { expect("
Linux shell : my `expect` script doesn't work as expected
我有一个如下所示的简单脚本，从命令行读取 2 个数字并将它们加在一起: $cat runexp.sh #!/bin/bash echo "read 1st number" read n1 echo "
linux - expect script + fit expect 以防不需要密码
当 Linux 机器的 $IP 登录后询问密码时，下面的 expect 脚本工作正常但在某些情况下，某些Linux机器不需要ssh密码(我们可以不用密码登录)，所以我需要更改我的期望脚本以支持没有
linux - Expect 脚本 - 发送字符串所需的引号与 expect 所需的引号冲突
我正在尝试使用 expect 远程登录服务器并更改用户密码。该应用程序要求，如果您要更改的密码包含特殊字符，则将其引用。问题是，还需要引用 expect send 语句，当我尝试将两者结合起来时，脚本
linux - expect + 如何识别 expect break 因为超时？
下面这个简单的 expect 脚本的目标是获取远程机器上的 hostname 名称有时期望脚本无法执行到 $IP_ADDRESS 的 ssh(因为远程机器不活动等) 所以在这种情况下，expect
rust - .expect( format!() ) : expected `&str` , 找到结构 `String`
我试图创建一个宏来替换， first: Some(first.as_ref().parse::().expect("Could not parse 'first'")) 我在其他模块(如 Clap w

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

python - Tensorflow/Keras : Input 0 of layer lstm is incompatible with the layer: expected ndim=3, 发现 ndim=2