tensorflow - 无效参数 : Input size should match but they differ by 2-6ren

tensorflow - 无效参数 : Input size should match but they differ by 2

转载作者：行者123 更新时间：2023-12-02 01:55:54

我正在尝试使用 tf.keras 训练 dl 模型。我的图像目录中有 67 类图像，例如机场、书店、赌场。对于每个类，我至少有 100 张图像。数据来自mit indoor scene但是当我尝试训练模型时，我不断收到此错误。

tensorflow.python.framework.errors_impl.InvalidArgumentError: 2 root error(s) found.
  (0) Invalid argument:  Input size should match (header_size + row_size * abs_height) but they differ by 2
         [[{{node decode_image/DecodeImage}}]]
         [[IteratorGetNext]]
  (1) Invalid argument:  Input size should match (header_size + row_size * abs_height) but they differ by 2
         [[{{node decode_image/DecodeImage}}]]
         [[IteratorGetNext]]
         [[IteratorGetNext/_7]]
0 successful operations.
0 derived errors ignored. [Op:__inference_train_function_1570]

Function call stack:
train_function -> train_function

我尝试通过使用调整大小图层调整图像大小来解决该问题，还包括 labels='inferred'和label_mode='categorical'在 image_dataset_from_directory方法并包含loss='categorical_crossentropy'在模型编译方法中。以前标签和 label_model 没有设置，损失是稀疏_分类_交叉熵，我认为这是不正确的。所以我按照上面的描述更改了它们。但我仍然遇到问题。

stackoverflow中有一个与此相关的问题但该人没有提及他如何解决问题，只是更新了 - 我的建议是检查数据集的元数据。它帮助解决了我的问题。但没有提到要寻找哪些元数据或者他做了什么来解决问题。

我用来训练模型的代码 -

import os
import PIL
import numpy as np
import pandas as pd
import tensorflow as tf
from tensorflow.keras.layers import Conv2D, Dense, MaxPooling2D, GlobalAveragePooling2D
from tensorflow.keras.layers import Flatten, Dropout, BatchNormalization, Rescaling
from tensorflow.keras.models import Sequential
from tensorflow.keras.callbacks import ModelCheckpoint, EarlyStopping
from tensorflow.keras.regularizers import l1, l2
import matplotlib.pyplot as plt
import matplotlib.image as mpimg
from pathlib import Path
os.environ['TF_CPP_MIN_LOG_LEVEL'] = '2'

# define directory paths
PROJECT_PATH = Path.cwd()
DATA_PATH = PROJECT_PATH.joinpath('data', 'Images')

# create a dataset
batch_size = 32
img_height = 180
img_width = 180

train = tf.keras.utils.image_dataset_from_directory(
    DATA_PATH,
    validation_split=0.2,
    subset="training",
    labels="inferred",
    label_mode="categorical",
    seed=123,
    image_size=(img_height, img_width),
    batch_size=batch_size
)

valid = tf.keras.utils.image_dataset_from_directory(
    DATA_PATH,
    validation_split=0.2,
    subset="validation",
    labels="inferred",
    label_mode="categorical",
    seed=123,
    image_size=(img_height, img_width),
    batch_size=batch_size
)

class_names = train.class_names

for image_batch, label_batch in train.take(1):
    print("\nImage shape:", image_batch.shape)
    print("Label Shape", label_batch.shape)

# resize image
resize_layer = tf.keras.layers.Resizing(img_height, img_width)
train = train.map(lambda x, y: (resize_layer(x), y))
valid = valid.map(lambda x, y: (resize_layer(x), y))

# standardize the data
normalization_layer = tf.keras.layers.Rescaling(1./255)
train = train.map(lambda x, y: (normalization_layer(x), y))
valid = valid.map(lambda x, y: (normalization_layer(x), y))

image_batch, labels_batch = next(iter(train))
first_image = image_batch[0]
print("\nImage (min, max) value:", (np.min(first_image), np.max(first_image)))
print()

# configure the dataset for performance
AUTOTUNE = tf.data.AUTOTUNE

train = train.cache().prefetch(buffer_size=AUTOTUNE)
valid = valid.cache().prefetch(buffer_size=AUTOTUNE)


# create a basic model architecture

num_classes = len(class_names)

# initiate a sequential model
model = Sequential()

# CONV1
model.add(Conv2D(filters=64, kernel_size=3, activation="relu",
          input_shape=(img_height, img_width, 3)))
model.add(BatchNormalization())

# CONV2
model.add(Conv2D(filters=64, kernel_size=3,
          activation="relu"))
model.add(BatchNormalization())

# Pool + Dropout
model.add(MaxPooling2D(pool_size=2))
model.add(Dropout(0.3))

# CONV3
model.add(Conv2D(filters=128, kernel_size=3,
          activation="relu"))
model.add(BatchNormalization())

# CONV4
model.add(Conv2D(filters=128, kernel_size=3,
          activation="relu"))
model.add(BatchNormalization())

# POOL + Dropout
model.add(MaxPooling2D(pool_size=2))
model.add(Dropout(0.3))

# FC5
model.add(Flatten())
model.add(Dense(128, activation="relu"))
model.add(Dense(num_classes, activation="softmax"))


# compile the model

model.compile(loss="categorical_crossentropy",
              optimizer="adam", metrics=['accuracy'])

# train the model
epochs = 25
early_stopping_cb = EarlyStopping(patience=10, restore_best_weights=True)

history = model.fit(train, validation_data=valid, epochs=epochs,
                    callbacks=[early_stopping_cb], verbose=2)

result = pd.DataFrame(history.history)
print()
print(result.head())

注意-我只是修改了代码，使其尽可能简单以减少错误。模型运行几个批处理后再次出现上述错误。

Epoch 1/10
732/781 [===========================>..] - ETA: 22s - loss: 3.7882Traceback (most recent call last):
  File ".\02_model1.py", line 139, in <module>
    model.fit(train, epochs=10, validation_data=valid)
  File "C:\Users\BHOLA\anaconda3\lib\site-packages\keras\engine\training.py", line 1184, in fit
    tmp_logs = self.train_function(iterator)
  File "C:\Users\BHOLA\anaconda3\lib\site-packages\tensorflow\python\eager\def_function.py", line 885, in __call__
    result = self._call(*args, **kwds)
  File "C:\Users\BHOLA\anaconda3\lib\site-packages\tensorflow\python\eager\def_function.py", line 917, in _call
    return self._stateless_fn(*args, **kwds)  # pylint: disable=not-callable
  File "C:\Users\BHOLA\anaconda3\lib\site-packages\tensorflow\python\eager\function.py", line 3039, in __call__
    return graph_function._call_flat(
  File "C:\Users\BHOLA\anaconda3\lib\site-packages\tensorflow\python\eager\function.py", line 1963, in _call_flat
    return self._build_call_outputs(self._inference_function.call(
  File "C:\Users\BHOLA\anaconda3\lib\site-packages\tensorflow\python\eager\function.py", line 591, in call
    outputs = execute.execute(
  File "C:\Users\BHOLA\anaconda3\lib\site-packages\tensorflow\python\eager\execute.py", line 59, in quick_execute
    tensors = pywrap_tfe.TFE_Py_Execute(ctx._handle, device_name, op_name,
tensorflow.python.framework.errors_impl.InvalidArgumentError: 2 root error(s) found.
  (0) Invalid argument:  Input size should match (header_size + row_size * abs_height) but they differ by 2
         [[{{node decode_image/DecodeImage}}]]
         [[IteratorGetNext]]
  (1) Invalid argument:  Input size should match (header_size + row_size * abs_height) but they differ by 2
         [[{{node decode_image/DecodeImage}}]]
         [[IteratorGetNext]]
         [[IteratorGetNext/_2]]
0 successful operations.
0 derived errors ignored. [Op:__inference_train_function_11840]

Function call stack:
train_function -> train_function

修改代码-

# create a dataset
batch_size = 16
img_height = 256
img_width = 256

train = image_dataset_from_directory(
    DATA_PATH,
    validation_split=0.2,
    subset="training",
    labels="inferred",
    label_mode="categorical",
    seed=123,
    image_size=(img_height, img_width),
    batch_size=batch_size
)

valid = image_dataset_from_directory(
    DATA_PATH,
    validation_split=0.2,
    subset="validation",
    labels="inferred",
    label_mode="categorical",
    seed=123,
    image_size=(img_height, img_width),
    batch_size=batch_size
)

model = tf.keras.applications.Xception(
    weights=None, input_shape=(img_height, img_width, 3), classes=67)
model.compile(optimizer='rmsprop', loss='categorical_crossentropy')
model.fit(train, epochs=10, validation_data=valid)

最佳答案

我认为这可能是一个损坏的文件。在 DecodeBMPv2 函数 ( https://github.com/tensorflow/tensorflow/blob/0b6b491d21d6a4eb5fbab1cca565bc1e94ca9543/tensorflow/core/kernels/image/decode_image_op.cc#L594 )

中进行数据完整性检查后引发异常

如果这就是问题所在，并且您想找出哪些文件引发了异常，您可以在包含这些文件的目录上尝试如下操作。删除/替换您找到的任何文件，它应该可以正常训练。

import glob

img_paths = glob.glob(os.path.join(<path_to_dataset>,'*/*.*') # assuming you point to the directory containing the label folders.

bad_paths = []

for image_path in img_paths:
    try:
      img_bytes = tf.io.read_file(path)
      decoded_img = tf.io.decode_image(img_bytes)
    except tf.errors.InvalidArgumentError as e:
      print(f"Found bad path {image_path}...{e}")
      bad_paths.append(image_path)

    print(f"{image_path}: OK")

print("BAD PATHS:")
for bad_path in bad_paths:
    print(f"{bad_path}")

关于 tensorflow - 无效参数 : Input size should match but they differ by 2，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/69607117/

文章推荐： c++ - CUDA channel ID 与基于 threadIdx.x 的计算

文章推荐： sql - 使用 FAST_FORWARD 定义游标有什么好处？

文章推荐： sql - 在 sql case 语句中使用比较符号

文章推荐： java - 如何为 Bukkit 插件从 10 开始倒计时？

java - 登录: different different files for different log levels
我知道这类问题已经得到解答，但就我而言，我已经尝试了所有配置，但仍然不起作用。我需要对我的配置有一个新的看法(我确信我错过了一些东西)。两个附加程序都会记录所有级别我想将所有包的信息 >= 记录到控
optimization - 针对 ARM : Why different CPUs affects different algorithms differently (and drastically) 进行优化
我正在对 Windows 移动设备上的代码性能进行一些基准测试，并注意到某些算法在某些主机上的表现明显更好，而在其他主机上则明显更差。当然，考虑到时钟速度的差异。供引用的统计数据(所有结果均由同一个
c - 奇怪的问题 : Getting different calculation results of the area and perimeter of a polyngn (on different machines and on different times)
我有一个程序可以计算多边形的面积和周长。程序还会确认面积和周长的计算结果是否与预期结果相同。我不明白发生了什么，但确认面积和周长是否与预期相同的验证部分无法正常工作。例如，我现在测试并在所有情况下
jquery - CSS3 过渡 + jQuery : translations of the x-axis have different results in different browsers for two different items
Codepen :(对于那些想直接进入的人来说，这是一个代码笔。在 Chrome 和 IE 中尝试一下，看看结果的不同) 我正在尝试使用 css3 转换/过渡，因为它们比 jquery 效果更流畅。
python : different regular expressions with different substitutions
我有几个不同的正则表达式要在给定文本中匹配和替换。 regex1 :如果文本包含单词“Founder”，则将所有文本替换为首席执行官正则表达式2:如果文本包含9位数字，则将其替换为NUM 我尝试使用
Java邮件 : How to use different SOCKS5 for different threads?
我编写了多线程应用程序，它从每个线程的数据库连接到一些电子邮件帐户。我知道 JavaMail 没有任何选项可以使用 SOCKS5 进行连接，因此我决定通过 System.setProperty 方法使
iOS Storyboard : Different Layouts for Different Devices
如您所见，这是我当前 Storyboard的不同设备预览。底部的透明绿色被另一个 View Controller 占用，但需要为每个不同的尺寸类固定间距。我尝试将 Storyboard 中的宽度和高度
swift 2 : Different gravity to different sprites
我正在创建一个游戏，我需要能够改变玩家 Sprite 的速度。我认为最好的选择是通过重力影响 Sprite 。为了给用户运动的感觉，我希望背景以完全相同的速度向相反的方向移动。我怎样才能给背景一个不
python - B树 : Is there a difference between different TreeSet incarnations?
我正在查看BTrees库并注意到有多个 TreeSet (和其他)类，例如 BTrees.IOBTree.TreeSet BTrees.OOBTree.TreeSet BTrees.LFBTree.T
安卓NDK : Compiling different libraries for different architectures
我有一个小型 C++ 库，必须为 armeabi 和 armeabi7a 编译。我还有一个非常大的 c++ 库，只需要为 armeabi 编译。现在正在为两种架构编译它们(使用 NDK)，但这使我的
reactjs - MuiThemeProvider : How to use different themes for different routes?
我需要根据站点的当前部分稍微更改主题。似乎 MuiThemeProvider 只在加载时设置 muiTheme；但需要在 props 变化时更新。如何做到这一点？最佳答案您可以尝试将主题放在包
latex 列表 : different counters for different listing environments
如何创建两个每个都有自己的计数器的 lSTListing 环境？如果我使用例如 \lstnewenvironment{algorithm}[2]{ \renewcommand\lstlist
travis-ci - 特拉维斯 : different `script` for different branch?
我想使用 Travis-CI 和 Github 基于分支设置部署。 IE。 - 如果我们从 develop 构建- 然后执行 /deploy.rb使用 DEV 环境主机名，如果 master - 然后
wpf - 数据绑定(bind) : Different triggers for different purposes
我有一个带有数据验证的 WPF MVVM 数据表单窗口。很多控件都是文本框。目前，数据绑定(bind)触发器设置为默认值，即。 e.失去焦点。这意味着仅在可能完全填写字段时才对其进行验证。所以当删除一
Xamarin 表单 : Is it normal to have different screen for different viewModel
我有许多应用程序的内容页面，并最终为每个内容页面编写了很多 View 模型。例如。如果我有一个包含项目组的列表，我将有一个 ShowAllViewModel并绑定(bind)到内容页面和列表中单个项目
javascript - Backbone : Different views for different tab content
我有一个通用 View 和 4 个其他 View 。我在通用 View 中使用 Bootstrap 选项卡(导航选项卡)。我希望其他 4 个 View 成为通用 View 中 4 个选项卡的内容。由于
maven-2 - Maven : Different configuration for different goals
我希望针对 Maven 发布插件的不同目标有不同的配置选项。故事是这样的: 我正在将 Git 用于 SCM。我希望release:prepare插件在本地完成所有操作，并让release:perfor
java - Java中的TableModel : how to specify different renderers for different rows?
我正在为一个项目使用AbstractTableModel制作一个自定义TableModel，并且我需要找到一种方法让复选框显示在某些行上，而不是其他行上。我已经实现了 getColumn 方法，但我希
JavaScript 事件循环 : Different queue for different types of events?
摘自《Javascript 忍者的 secret 》一书: EVENTS ARE ASYNCHRONOUS Events, when they happen, can occur at unpredi
java - GWT 记录器 : Different Levels to Different Handlers
我正在尝试配置我的第一个 GWT 记录器，到目前为止，我已经将日志消息打印到我的 JS 控制台(FF 的 Firebug): 最终，我希望非SEVERE 消息转到consoleHa

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

tensorflow - 无效参数 : Input size should match but they differ by 2