python - 数据增强完成后会发生什么？-6ren

python - 数据增强完成后会发生什么？

转载作者：行者123 更新时间：2023-11-30 09:19:35

26

4

我使用 Kaggle 的“狗与猫”date set ，并按照TensorFlow的cifar-10教程(为了方便我没有使用权重衰减，移动平均和L2损失)，我已经成功地训练了我的网络，但是当我将数据增强部分添加到我的代码中时，奇怪的事情发生了，即使经过数千步，损失也从未减少(在添加之前，一切都很好)。代码如下:

def get_batch(image, label, image_w, image_h, batch_size, capacity, test_flag=False):
  '''
  Args:
      image: list type
      label: list type
      image_w: image width
      image_h: image height
      batch_size: batch size
      capacity: the maximum elements in queue 
      test_flag: create training batch or test batch
  Returns:
      image_batch: 4D tensor [batch_size, width, height, 3], dtype=tf.float32
      label_batch: 1D tensor [batch_size], dtype=tf.int32
  '''

  image = tf.cast(image, tf.string)
  label = tf.cast(label, tf.int32)

  # make an input queue
  input_queue = tf.train.slice_input_producer([image, label])

  label = input_queue[1]
  image_contents = tf.read_file(input_queue[0])
  image = tf.image.decode_jpeg(image_contents, channels=3)

  ####################################################################
  # Data argumentation should go to here
  # but when we want to do test, stay the images what they are

  if not test_flag:
     image = tf.image.resize_image_with_crop_or_pad(image, RESIZED_IMG, RESIZED_IMG)
     # Randomly crop a [height, width] section of the image.
     distorted_image = tf.random_crop(image, [image_w, image_h, 3])

    # Randomly flip the image horizontally.
     distorted_image = tf.image.random_flip_left_right(distorted_image)

    # Because these operations are not commutative, consider randomizing
    # the order their operation.
    # NOTE: since per_image_standardization zeros the mean and makes
    # the stddev unit, this likely has no effect see tensorflow#1458.
     distorted_image = tf.image.random_brightness(distorted_image, max_delta=63)

     image = tf.image.random_contrast(distorted_image, lower=0.2, upper=1.8)
  else:
     image = tf.image.resize_image_with_crop_or_pad(image, image_w, image_h)

  ######################################################################

  # Subtract off the mean and divide by the variance of the pixels.
  image = tf.image.per_image_standardization(image)
  # Set the shapes of tensors.
  image.set_shape([image_h, image_w, 3])
  # label.set_shape([1])

  image_batch, label_batch = tf.train.batch([image, label],
                                            batch_size=batch_size,
                                            num_threads=64,
                                            capacity=capacity)

  label_batch = tf.reshape(label_batch, [batch_size])
  image_batch = tf.cast(image_batch, tf.float32)

  return image_batch, label_batch

最佳答案

确保您使用的限制(例如，亮度的 max_delta=63、对比度的 upper=1.8)足够低，以便图像仍可识别。其他问题之一可能是一遍又一遍地应用增强，因此经过几次迭代后它完全扭曲了(尽管我没有在您的代码片段中发现这个错误)。

我建议您将数据可视化添加到tensorboard中。要可视化图像，请使用 tf.summary.image方法。您将能够清楚地看到增强的结果。

tf.summary.image('input', image_batch, 10)

This gist可以作为例子。

关于python - 数据增强完成后会发生什么？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/45002525/

26

4

0

文章推荐： java - 如何在 Java 中将类名解析为完全限定？

文章推荐： python - Keras 包安装

发生 VBA 编译错误
下面的代码旨在在首次打开工作簿时运行。 Sub Auto_Open() Dim LastRow As Integer LastRow = Sheet6.UsedRange.Rows.Count Act
发生 C++ 堆损坏检测错误
当我尝试操作我的代码时，除了弹出调试错误外，它执行得很好。错误信息在这里。我的完整代码在这里。 #include using namespace std; class String { publi
c# - 发生 XMLParseException
The invocation of the constructor on type 'WpfApplication1.MainWindow' that matches the specified bi
android - 发生 ArrayIndexOutOfBoundsException
我正在使用 BaseAdapter: public class MyAdapter extends BaseAdapter{ private final LayoutInflater mInflate
mysql - 发生 ER_PARSE_ERROR
我想做网页抓取。我写了代码 var connection = require('./mysqlConnection'); var c = new Crawler({ maxConnections
发生 Java 堆空间错误
我的系统中发生 Java 堆空间错误。我尝试了很多来自 Stack Overflow 的解决方案，但没有任何效果。当我工作时当按下 OK 然后 (我的项目没有错误) 我的 eclipse.ini 是
c++ - D3DXERR_INVALIDDATA 发生
环境: i5 750 DDR3 4GWin7 专业版 x64 sp1 DXSDK 9.0c 2010 年 6 月 GeForce GT240(驱动程序 275.33)512MB MSVC 2008 s
发生 Python 套接字错误
这段代码是我写的。 import socket host = 'localhost' port = 3794 s = socket.socket(socket.AF_INET, socket.SOCK
c# - 发生 DateTimeInvalidLocalFormat
我正在尝试引用 UTC 时间间隔获取本地日期时间，我正在执行下面的代码。 var dtString =DateTime.UtcNow.ToString(@"yyyy-MM-ddTHH\:mm\:ss
c# - LoadFromContext 发生
我有一个非常简单的 C# 问题，它从库中加载 Windows WPF 窗口。这是代码: public partial class App : Application { public App(
android - 发生 fragment 加载闪烁时带有导航组件的底部导航
我目前正在使用带有导航组件的底部导航，它工作正常但是当我们点击导航项 fragment 正在加载然后闪烁正在发生，即使当前选择的项目也会发生闪烁。它在加载 fragment 时发生。我的应用程序屏幕背
nullpointerexception - Kotlin NullPointerException 发生
我是新来的 kotlin , 当我开始 Null Safety 时，我对下面的情况感到困惑. There's some data inconsistency with regard to initia
css - 发生 css 转换时如何阻止我的文本移动
我有一个框，其中包含同时发生的两个独立的 css 转换。当转换发生时，图标下方的标题和段落文本移动位置参见 JS Fiddle:http://jsfiddle.net/Lsnbpt8r/ 这是我的
cordova - 发生 native 打包程序异常
在为黑莓 10 构建电话间隙应用程序时，我遇到了异常情况。 [BUILD] Populating application source [BUILD] Parsing config.xml [
java - 发生 JNI 代码错误时如何正确停止线程？
这个问题在这里已经有了答案: How to properly stop the Thread in Java? (8 个回答) 3年前关闭。我看过How to properly stop the T
发生 fatal error 时php重新加载页面
我试图弄清楚发生 fatal error 时如何刷新页面。基本上我正在访问图像 api 并将图像复制到我的服务器。我还每次都创建照片的缩略图版本。我会每隔一段时间收到一条错误消息，指出我的脚本试图分配
java - 使用断言检查元素是否在屏幕上，发生 NoSuchElementException
我正在尝试使用断言函数检查元素是否在屏幕上。我在我的测试应用程序 (AndroidDriver) 中使用 Appium 和 Java。我期望的是，如果元素在屏幕上，则返回 1；如果不在屏幕上，则返回
java - 发生 MaxUploadSizeExceededException 时如何关闭套接字？
我正在开发图像上传系统。我使用 CommonsMultipartResolver 设置 maxUploadSize。当我尝试上传超过最大尺寸的图像文件时，会发生 MaxUploadSizeExcced
java - 发生 UnsatisfiedDependencyException 错误
我有以下代码和@ComponentScan(basePackages = "com.project.shopping")，包结构为 com.project.shopping.Controller co
java - 发生 JNI 错误
我尝试运行此程序作为测试，但收到错误“发生了 JNI 错误，请检查您的安装并重试”，然后是“发生了 Java 异常”。关于如何解决这个问题有什么想法吗？ package java; public cl

首页

博学

6Ren·AI

商城

python - 数据增强完成后会发生什么？