tensorflow - 使用对象检测API的默认配置时，图像缩放器的不同尺寸有何影响-6ren

tensorflow - 使用对象检测API的默认配置时，图像缩放器的不同尺寸有何影响

转载作者：行者123 更新时间：2023-12-03 00:36:58

25

4

我尝试使用 Tensorflow 的对象检测 API 来训练模型。我正在使用更快的 rcnn resnet101 ( https://github.com/tensorflow/models/blob/master/object_detection/samples/configs/faster_rcnn_resnet101_voc07.config ) 的示例配置。
以下代码是我不太理解的配置文件的一部分:

image_resizer {
  keep_aspect_ratio_resizer {
    min_dimension: 600
    max_dimension: 1024
  }
}

我的问题是:

min_dimension 和 max_dimension 的确切含义是什么？这是否意味着输入图像的大小将调整为 600x1024 或 1024x600？
如果我有不同尺寸的图像，并且其中一些图像相对大于 600x1024(或 1024x600)，我可以/应该增加 min_dimension 和 max_dimension 的值？

我之所以有这样的疑问，是来自这篇文章: TensorFlow Object Detection API Weird Behaviour

在这篇文章中，作者自己也给出了这个问题的答案:

Then I decided to crop the input image and provide that as an input. Just to see if the results improve and it did!
It turns out that the dimensions of the input image were much larger than the 600 x 1024 that is accepted by the model. So, it was scaling down these images to 600 x 1024 which meant that the cigarette boxes were losing their details :)

它使用的配置与我使用的相同。我不确定是否可以更改这些参数，如果它们是这个特殊模型的默认或推荐设置，faster_rcnn_resnet101。

最佳答案

经过一些测试，我想我找到了答案。如有错误请指正。

在.config文件中:

image_resizer {
  keep_aspect_ratio_resizer {
    min_dimension: 600
    max_dimension: 1024
  }
}

根据'object_detection/builders/image_resizer_builder.py'的图像缩放设置

if image_resizer_config.WhichOneof(
    'image_resizer_oneof') == 'keep_aspect_ratio_resizer':
  keep_aspect_ratio_config = image_resizer_config.keep_aspect_ratio_resizer
  if not (keep_aspect_ratio_config.min_dimension
          <= keep_aspect_ratio_config.max_dimension):
    raise ValueError('min_dimension > max_dimension')
  return functools.partial(
      preprocessor.resize_to_range,
      min_dimension=keep_aspect_ratio_config.min_dimension,
      max_dimension=keep_aspect_ratio_config.max_dimension)

然后它尝试使用“object_detection/core/preprocessor.py”的“resize_to_range”函数

  with tf.name_scope('ResizeToRange', values=[image, min_dimension]):
    image_shape = tf.shape(image)
    orig_height = tf.to_float(image_shape[0])
    orig_width = tf.to_float(image_shape[1])
    orig_min_dim = tf.minimum(orig_height, orig_width)

    # Calculates the larger of the possible sizes
    min_dimension = tf.constant(min_dimension, dtype=tf.float32)
    large_scale_factor = min_dimension / orig_min_dim
    # Scaling orig_(height|width) by large_scale_factor will make the smaller
    # dimension equal to min_dimension, save for floating point rounding errors.
    # For reasonably-sized images, taking the nearest integer will reliably
    # eliminate this error.
    large_height = tf.to_int32(tf.round(orig_height * large_scale_factor))
    large_width = tf.to_int32(tf.round(orig_width * large_scale_factor))
    large_size = tf.stack([large_height, large_width])

    if max_dimension:
      # Calculates the smaller of the possible sizes, use that if the larger
      # is too big.
      orig_max_dim = tf.maximum(orig_height, orig_width)
      max_dimension = tf.constant(max_dimension, dtype=tf.float32)
      small_scale_factor = max_dimension / orig_max_dim
      # Scaling orig_(height|width) by small_scale_factor will make the larger
      # dimension equal to max_dimension, save for floating point rounding
      # errors. For reasonably-sized images, taking the nearest integer will
      # reliably eliminate this error.
      small_height = tf.to_int32(tf.round(orig_height * small_scale_factor))
      small_width = tf.to_int32(tf.round(orig_width * small_scale_factor))
      small_size = tf.stack([small_height, small_width])

      new_size = tf.cond(
          tf.to_float(tf.reduce_max(large_size)) > max_dimension,
          lambda: small_size, lambda: large_size)
    else:
      new_size = large_size

    new_image = tf.image.resize_images(image, new_size,
                                       align_corners=align_corners)

从上面的代码中，我们可以知道是否有一张尺寸为800*1000的图像。最终输出图像的尺寸为600*750。

也就是说，此图像调整器将始终根据“min_dimension”和“max_dimension”的设置调整您的输入图像的大小。

关于tensorflow - 使用对象检测API的默认配置时，图像缩放器的不同尺寸有何影响，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/45137835/

25

4

0

文章推荐： ruby-on-rails - 如何修改rails 4中的bootstrap flash消息

文章推荐： tensorflow - 计算整个训练集的准确率

文章推荐： php - 如何隐藏实际的下载文件夹位置

文章推荐： Tensorflow cifar同步点

java - 配置 logback 以遵循 Java 配置，也就是 Logback 的纯 Java 配置
我只是不喜欢 Logback 的 XML 或 Groovy 配置，而更喜欢用 Java 进行配置(这也是因为我将在初始化后的不同时间在运行时更改配置)。似乎对 Logback 进行 Java 配置的
yaml - sphinx 配置 ||配置/sphinx.yml
我的 sphinx 配置是: ================================ config/sphinx.yml development: bin_path: "/usr/loc
Sitecore 性能优化 - Sitecore 配置、IIS 配置
我们计划在生产服务器中部署我们的系统。我有兴趣了解更多有关优化网站性能的信息。 Sitecore 有哪些优化建议？ (缓存，网络配置中的其他设置) 我们可以在 IIS 中做哪些优化？找不到关于这些主
python - 根目录上静态站点的 Apache 配置，子位置上的 Django 配置
我有一个 Django 应用程序，可以处理网站的两个(或更多)部分，例如网站的“admin”和“api”部分。我还为网站的其余部分提供了普通的 html 页面，其中不需要 Django。例如，我希望
node.js - 配置 Dockerfile 以设置 AWS 配置
我刚刚开始研究Docker。我有一个 Node 应用程序，可以调整大小和图像，然后在完成后向 aws 发送 SQS 消息。我已成功创建应用程序的 docker 镜像，并从本地计算机复制它，但遇到了无法
ant - 如何在 Hudson 中为 Ant 配置 checkstyle 配置？
如何配置 checkstyle(在 Ant nt Maven 中)任务？我尝试了一点，但没有正确收到报告。这是我的 Ant 脚本。
java - 如何将 xml 配置 bean 转换为 Java 配置 bean？
我正在使用 Quartz 和 Spring 框架重写一个遗留项目。原始配置是 XML 格式，现在我将其转换为 Java Config。 xml 配置使用 jobDetail 设置触发器 bean 的作
mysql - 最佳 Mysql 配置(分区)和索引/Hypertable/RAID 配置(大数据库)
tl;rd: 使用主键对数据库进行分区索引大小问题。数据库大小每天增长约 1-3 GB 突袭设置。您有使用 Hypertable 的经验吗？长版: 我刚刚建立/购买了一个家庭服务器: 至强 E
使用图形 API 配置 GCP 的 Azure AD saml 配置 "Sign on URL"
在安装 gcp 应用程序后，我们尝试使用 GCP 的图形 api 配置 Azure Active Directory saml 配置。我们正在遵循相同的 AWS graph api saml 设置 U
java - 如何使用 Spring-Security 3 和 Hibernate 4 将 spring security xml 配置 hibernate 转换为 java 配置
我刚刚了解了 spring security 并想使用 java hibernate 配置连接到数据库，但我发现的示例或教程很少。我通过使用 xml 配置找到了更多。我在这里使用 Spring 4.0
java - 是否可以通过 jboss-deployment-struction.xml 配置 JPA 2.1，为 Spring 4 和 Hibernate 4.3.10 配置 JBoss EAP 6.4.x？
我们最近切换到 Java 8 以使用 java.time API(LocalDate、LocalDateTime，...)。因此，我们将 Hibernate 依赖项更新到版本 4.3.10。我们编写了
quarkus实战之六：配置
欢迎访问我的GitHub 这里分类和汇总了欣宸的全部原创(含配套源码)：https://github.com/zq2599/blog_demos 本篇概览本文是《quarkus实战》系列的第六篇，咱
NGINX 配置 :
我是 NGINX 的新手，我正在尝试对我们的 ERP 网络服务器进行负载平衡。我有 3 个网络服务器在由 websphere 提供支持的端口 80 上运行，这对我来说是一个黑盒子: * web01.e
Gerrit 配置
我们想使用 gerrit 进行代码审查，但我们在 webview 中缺少一些设置。是否可以禁止提交者审查/验证他们自己的提交？是否有可能两个审稿人给 +1 一个累积它到+2，以便可以提交？谢
AEM 配置
配置根据运行模式应用于 AEM 实例。在多个运行模式和多个配置的情况下，AEM 如何确定要选择的配置文件？假设以下配置在 AEM 项目中可用， /apps /myproject - con
Neo4j 配置
我正在使用 Neo4j 服务器。我遇到了负载相对较低的问题。但是，响应时间相当长。我认为为请求提供服务的线程数太少了。有没有办法调整为 HTTP 请求提供服务的线程池的大小。那可能吗？最佳答案线程
CELERYD_OPTS 配置
我在/etc/default/celeryd 中有以下配置 CELERYD_NODES = "worker1 worker2 worker3" CELERYD_CHDIR = "path to pro
Plone 配置
Plone 在其页面中显示来 self 的母语(巴西葡萄牙语)的特殊字符。但是，当我使用我创建的 spt 页面时，它会显示转义序列，例如: Educa\xc3\xa7\xc3\xa3o 代替 Educ
Emacs 配置
我正在尝试开始使用 Emacs/Clojure。安装 emacs 扩展的正确方法是什么。我正在尝试安装以下插件: https://bitbucket.org/kotarak/vimclojure 我已
CMake 配置
我有一个简单的 C 项目结构: proj/ src/ docs/ build/ tests/ lib/ 尝试编写合适的 CMake 文件。到目前为止我的尝试:http://pas

首页

博学

6Ren·AI

商城

tensorflow - 使用对象检测API的默认配置时，图像缩放器的不同尺寸有何影响