pytorch - torchvision 的 AnchorGenerator 中的 anchor 框大小是否与输入图像、特征图或其他内容有关？-6ren

pytorch - torchvision 的 AnchorGenerator 中的 anchor 框大小是否与输入图像、特征图或其他内容有关？

转载作者：行者123 更新时间：2023-12-05 05:56:01

26

4

这不是关于 anchor 框、Faster-RCNN 或任何与理论相关的一般性问题。这是一个关于如何在pytorch中实现 anchor 框的问题，因为我是新手。我已经阅读了这段代码，以及 torch 仓库中的许多其他内容:

https://github.com/pytorch/vision/blob/main/torchvision/models/detection/anchor_utils.py

AnchorGenerator 的“大小”参数是关于原始图像大小，还是关于从主干输出的特征图？

为了更加清晰和简化，假设我只对检测输入图像中 32x32 像素的对象感兴趣。所以我的 anchor 框长宽比肯定是 1.0，因为高度=宽度。但是，我放入 AnchorGenerator 32 的大小是多少？或者我是否需要使用主干进行一些数学运算(例如，我有 2 个 2x2 最大池化层，步幅为 2，所以我给 AnchorGenerator 的大小应该是 32/(2^2) = 8)？

最佳答案

Is the "sizes" argument to AnchorGenerator with respect to theoriginal image size, or with respect to the feature map being outputfrom the backbone?

sizes 参数是应用于输入图像的每个边界框的大小。如果您有兴趣检测 32x32 像素的对象，您应该使用

anchor_generator = AnchorGenerator(sizes=((32,),),
                                   aspect_ratios=((1.0,),))

关于pytorch - torchvision 的 AnchorGenerator 中的 anchor 框大小是否与输入图像、特征图或其他内容有关？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/69307589/

26

4

0

文章推荐： python - 一维卷积层尺寸不匹配问题

python - 如何进行自定义 torchvision 转换？
我有一个以 20% 的几率改变图像像素的函数，但不确定如何让它在 transforms.Compose([]) 中工作。请帮忙! def random_t(img): im = Image.o
python - 有没有办法检索在随机 Torchvision 变换中使用的特定参数？
我可以在训练期间通过应用随机变换(旋转/平移/重新缩放)来增加我的数据，但我不知道选择的值。我需要知道应用了哪些值。我可以手动设置这些值，但是我失去了 Torch Vision 转换提供的很多好处。
python - MNIST、torchvision 中的输出和广播形状不匹配
在 Torchvision 中使用 MNIST 数据集时出现以下错误 RuntimeError: output with shape [1, 28, 28] doesn't match the bro
python - 有没有办法通过字符串加载 torchvision 模型？
这个问题在这里已经有了答案: How to call Python function by name dynamically using a string? (1 个回答) 关闭去年。目前，我使用
python - 修复 torchvision 变换的随机种子
我使用一些类似于以下的代码 - 用于数据增强: from torchvision import transforms #... augmentation = transform
python - Keras、TorchVision 中的预训练模型
我有以下代码，它使用 Keras 中预先训练的 ResNet50 模型和 imagenet 数据集: from keras.applications.resnet50 import ResNet50
python - Pyinstaller 可执行文件无法导入 torchvision
这是我的main.py: import torchvision input("Press key") 在命令行中正确运行:python main.py 我需要一个适用于 Windows 的可执行文
python - 尽管已安装，但无法加载 torchvision
我已经使用以下方法安装了 pytorch 和 torchvision: conda install pytorch-cpu -c pytorch pip install torchvision 当我尝
image-processing - 获取 torchvision 预训练网络的分类标签
Pytorch 的 torchvision 包提供了用于图像分类的 pre-trained neural networks。我一直在使用以下代码使用 Alexnet 对图像进行分类(注意:部分代码来自
pytorch - 没有这样的运算符(operator) torchvision::nms
当我尝试运行 yoloV3 检测时，发生了错误 op = torch._C._jit_get_operation(qualified_op_name) RuntimeError: No such op
python - 将 Torchvision ImageFolder 与测试集结合使用
我正在尝试使用 Sample Notebook 解决 Kaggle 上的 Dogs-vs-Cats 挑战Udacity 类(class)中提供了这一点。我已将文件重新排列到 train/ 目录中的两个
image-processing - 仅使用预训练的 torchvision 网络的某些层
我尝试仅使用预训练的 torchvision Faster-RCNN 网络中的某些层，该网络初始化为: model = torchvision.models.detection.fasterrcnn_
python - torchvision 没有安装 torch 怎么办？
不知何故，当我进行安装时，它会安装 torchvision 但不会安装 torch。我按照主网站的指示运行的命令: conda install pytorch torchvision cudatool
python - 使用 torchvision 下载 celebA 数据集时出错
使用 torchvision 模块数据集，我无法下载 celebA 图像数据集。我很确定我做的一切都是正确的。 dataset = datasets.CelebA( root='../data
python - Pytorch - 无法切片 torchvision MNIST 数据集
在Pytorch中，当使用torchvision的MNIST数据集时，我们可以得到一个数字如下: from torchvision import datasets, transforms from t
python - Faster R-CNN torchvision 实现的说明
我正在挖掘 source code torchvision 的 Faster R-CNN 实现我正面临一些我不太明白的事情。也就是说，假设我想创建一个 Faster R-CNN 模型，而不是在 COC
python - 如何修改 PyTorch 中的预训练 Torchvision 模型以返回两个输出以进行多标签图像分类
输入:一组十个“元音”，一组十个“辅音”，图像数据集，其中每个图像中都写有一个元音和一个辅音。任务:从给定图像中识别元音和辅音。方法:首先在图像上应用 CNN 隐藏层，然后应用两个平行的全连接/密
pytorch - 无法从 'ResNet50_Weights' 导入名称 'torchvision.models.resnet'
我之前成功加载了带有 ResNet50_Weights 参数的 ResNet 模型，但突然我开始收到以下错误: Traceback (most recent call last): File "s
image - 在同一图像张量上两次使用 torchvision.utils.save_image 会使第二次保存无效。这是怎么回事？
(此处详细介绍了快速梯度符号攻击方法:https://pytorch.org/tutorials/beginner/fgsm_tutorial.html) 我有一个训练有素的分类器，准确率 >90%，
python - 通过 torchvision 下载 pytorch 数据集时出现 SSLCertVerificationError
我在从 pytorch 下载 CIFAR-10 数据集时遇到问题。大多数情况下，它似乎是一些我真的不知道如何解释的 SSL 错误。我也尝试过将根目录更改为其他各种文件夹，但它们都不起作用。我想知道这是

首页

博学

6Ren·AI

商城

pytorch - torchvision 的 AnchorGenerator 中的 anchor 框大小是否与输入图像、特征图或其他内容有关？