tensorflow - 如何在 Tensorflow Estimator 的 input_fn 中执行数据增强

转载作者：行者123 更新时间：2023-11-30 09:17:27

26

4

使用 Tensorflow 的 Estimator API，我应该在管道中的哪个点执行数据增强？

据此官方Tensorflow guide ，执行数据增强的一个位置是 input_fn:

def parse_fn(example):
  "Parse TFExample records and perform simple data augmentation."
  example_fmt = {
    "image": tf.FixedLengthFeature((), tf.string, ""),
    "label": tf.FixedLengthFeature((), tf.int64, -1)
  }
  parsed = tf.parse_single_example(example, example_fmt)
  image = tf.image.decode_image(parsed["image"])

  # augments image using slice, reshape, resize_bilinear
  #         |
  #         |
  #         |
  #         v
  image = _augment_helper(image)

  return image, parsed["label"]

def input_fn():
  files = tf.data.Dataset.list_files("/path/to/dataset/train-*.tfrecord")
  dataset = files.interleave(tf.data.TFRecordDataset)
  dataset = dataset.map(map_func=parse_fn)
  # ...
  return dataset

我的问题

如果我在 input_fn 内执行数据增强，parse_fn 是否返回单个示例或包含原始输入图像 + 所有增强变体的批处理？如果它只返回一个[增强]示例，我如何确保数据集中的所有图像以及所有变体都以其未增强的形式使用？

最佳答案

如果您在数据集上使用迭代器，则您的 _augment_helper 函数将在输入的每个数据 block 上的数据集的每次迭代中被调用(就像您在 dataset.map 中调用 parse_fn 一样)

将代码更改为

  ds_iter = dataset.make_one_shot_iterator()
  ds_iter = ds_iter.get_next()
  return ds_iter

我已经用一个简单的增强函数对此进行了测试

  def _augment_helper(image):
       print(image.shape)
       image = tf.image.random_brightness(image,255.0, 1)
       image = tf.clip_by_value(image, 0.0, 255.0)
       return image

将 255.0 更改为数据集中的最大值，我使用 255.0 作为示例的数据集采用 8 位像素值

关于tensorflow - 如何在 Tensorflow Estimator 的 input_fn 中执行数据增强，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/51594027/

26

4

0

文章推荐： machine-learning - keras新手: how to get better accuracy

文章推荐： java - schemagen:如何共享类，而不是它们的命名空间？

文章推荐： javascript - 单击外部事件 div 以停用功能

文章推荐： java - Spring 不使用默认语言环境中的源

machine-learning - TensorFlow 中 tf.estimator.Estimator 和 tf.contrib.learn.Estimator 有什么区别
几个月前，我使用了tf.contrib.learn.DNNRegressor来自 TensorFlow 的 API，我发现它使用起来非常方便。最近几个月我没有跟上TensorFlow的发展。现在我有一
tensorflow - 使用 tf.estimator.Estimator 加载检查点和微调
我们正在尝试将旧的训练代码转换为更符合 tf.estimator.Estimator 的代码。在初始代码中，我们针对目标数据集微调原始模型。在使用 variables_to_restore 和 ini
python - tf.estimator.Estimator 不记录任何事件文件，张量板不显示任何内容
我目前运行的是 TensorFlow 1.9.0。我的自定义估算器是使用 tf.estimator.Estimator 创建的，并且运行时没有出现任何故障。但是，我在 model_dir 下没有找到任
python - 如何从检查点使用 tf.estimator.Estimator 进行预测？
我刚刚用 tensorflow 训练了一个 CNN 来识别太阳黑子。我的模型与 this 几乎相同.问题是我无法在任何地方找到关于如何使用训练阶段生成的检查点进行预测的明确解释。尝试使用标准恢复方法
python - 使用 tf.estimator.Estimator 框架进行迁移学习
我正在尝试使用我自己的数据集和类对在 imagenet 上预训练的 Inception-resnet v2 模型进行迁移学习。我的原始代码库是对 tf.slim 的修改我再也找不到的示例，现在我正在尝
python - 如何从 tf.estimator.Estimator 获取最后一个 global_step
在 train(...) 完成后，如何从 tf.estimator.Estimator 获取最后一个 global_step ？例如，典型的基于估算器的训练例程可能如下设置: n_epochs = 1
tensorflow - tf.estimator.Estimator.train() 是否保持 input_fn 状态
一年多来我一直在使用自己的 Estimator/Experiment 之类的代码，但我最终想加入 Dataset+Estimator 的行列。我想做如下的事情: for _ in range(N):
python-3.x - 如何将张量板与 tf.estimator.Estimator 一起使用
我正在考虑将我的代码库移动到 tf.estimator.Estimator ，但我找不到如何将它与张量板摘要结合使用的示例。 MWE: import numpy as np import tensor
python - tf.estimator.Estimator.evaluate() 是否总是在一个 GPU 上运行？
我的印象是在 tf.estimator.Estimator 实例上调用 evaluate() 不会在多个 GPU 上运行模型，即使分配策略是 MirroredStrategy，配置为至少使用 2 个
python - 如何使用MonitoredTrainingSession像 `global_step/sec`一样打开日志 `tf.estimator.Estimator`？
我遇到了一些小问题，但我不知道如何处理。当我使用 tf.estimator.Estimator 时，它会在每个步骤中记录两行，例如: INFO:tensorflow:global_step/sec:
python - 如何在 tf.estimator.Estimator() 中记录 tensorflow 层输出
在此tutorial ，他们通过为 tf.nn.softmax 节点命名成功地记录了 softmax 函数。 tf.nn.softmax(logits, name="softmax_tensor")
python - 推荐什么？ tensorflow train_and_evaluate 或 estimator.train, estimator.evaluate
我发现 tensorflow train_and_evaluate 的工作方式与传统的 tf.estimator train 和 evaluate 相比有点不同。train_and_evaluate
python - 模块 'tensorflow_estimator.python.estimator.api._v2.estimator' 没有属性 'inputs'
我正在使用 tensorflow 版本 2.0.0-beta1。打电话时 tf.estimator.inputs.pandas_input_fn 它给了我一个错误。 module 'tensorflo
python - Tensorflow，在另一个 tf.estimator model_fn 中使用 tf.estimator 训练模型
有没有办法在另一个模型 B 中使用经过 tf.estimator 训练的模型 A？这是情况，假设我有一个训练有素的“模型 A”和 model_a_fn()。“模型 A”获取图像作为输入，并输出一些类
tensorflow - Estimator 的 model_fn 包含 params 参数，但 params 不会传递给 Estimator
我正在尝试在本地运行对象检测 API。我相信我已经按照 TensorFlow Object Detection API 中的描述设置了所有内容。但是，当我尝试运行 model_main.py 时，会
python - gridSearch in loop **estimator should be an estimator implementing 'fit' method, 0 was passed** 错误
请原谅我的编码经验。我正在尝试使用 GridSearch 进行一系列回归。我正在尝试循环整个过程以使过程更快，但我的代码不够好并且不介意提高效率。这是我的代码: classifiers=[Lasso(
python - 使用 `tensorflow.python.keras.estimator.model_to_estimator` 将 Keras 模型转换为 Estimator API 时如何通知类权重？
我在将纯 Keras 模型转换为不平衡数据集上的 TensorFlow Estimator API 时遇到了一些麻烦。使用纯 Keras API 时，class_weight 参数在 model.f
python - 当使用 tf-tutorials 运行时，发生了 :AttributeError: module 'tensorflow.python.estimator.api.estimator' has no attribute 'SessionRunHook'
当发生上述错误时，我经常使用有关估计器的tensorflow官方教程，而它在google.colab中正常运行。我使用的环境是win10-64bit＆tensorflow-gpu==1.12.0＆p
estimation - 不花费大量时间进行估算的最佳方法是什么？
Closed. This question is opinion-based。它当前不接受答案。想要改善这个问题吗？更新问题，以便editing this post用事实和引用来回答。已关闭6年。
estimation - 您如何完善估算过程？
Closed. This question is opinion-based。它当前不接受答案。想要改善这个问题吗？更新问题，以便editing this post用事实和引用来回答。 1年前关闭。

首页

博学

6Ren·AI

商城

tensorflow - 如何在 Tensorflow Estimator 的 input_fn 中执行数据增强

我的问题