python - 为什么对于 Keras 中的 Theano 和 Tensorflow 后端，一对图像之间的 L2 距离会得到不同的结果？-6ren

python - 为什么对于 Keras 中的 Theano 和 Tensorflow 后端，一对图像之间的 L2 距离会得到不同的结果？

转载作者：行者123 更新时间：2023-11-30 09:20:06

我附上一个脚本，该脚本从 Keras 的预训练 Resnet50 卷积网络中提取特征。您可以向它提供任何一对图像，它会从网络的某一层打印出特征嵌入的 L2 距离(我选择“activation_43”)。

我的问题是，在 Keras 使用的两个可用的数值计算后端(Theano 和 Tensorflow)之间切换时，我得到不同的结果。据我所知，我已经尝试处理它们各自的图像轴约定，但我一定忽略了一些东西。

以下是打印一对图像的 L2 距离的代码:

'''

Compare the L2 distance between features extracted from 2 images. Which specific images we use doesn't matter --
what we're interested in comparing is the L2 distance between an image pair in the THEANO backend vs the TENSORFLOW
backend.

I pasted my personal results at the bottom of the script in comments.

usage: $python this_script.py image1.jpg image2.jpg

'''

import cv2
import numpy as np
import keras.backend as K
from keras.applications import ResNet50
from keras.models import Model
from sklearn.preprocessing import normalize
import sys

def preprocess_cv2(images, dim_ordering='default'):
    '''
    :param images: rank 4 tensor of concatenated cv2_images
                    note: channels will be ordered BGR by default
    :param dim_ordering: keras backend - either 'tf' or 'th'
                    note: if 'th', images must be (batch, channels, height, width)
                          if 'tf', images must be (batch, height, width channels)
    :return: preprocessed batch of images
    '''
    images = images.astype(np.float64)
    if dim_ordering == 'default':
        dim_ordering = K.image_dim_ordering()
        assert dim_ordering in {'tf', 'th'}
        if dim_ordering == 'th':
            # need to transpose axes to make (batch, channels, height, width)
            print('Image batch arrived with shape: {}'.format(str(images.shape)))
            images = np.transpose(images, (0, 3, 1, 2))
            print('Image batch axes were transposed to shape: {} for THEANO dim-ordering convention'.format(str(images.shape)))
            # # 'RGB'->'BGR'
            # x = x[:, ::-1, :, :]
            # Zero-center by mean pixel
            images[:, 0, :, :] -= 103.939
            images[:, 1, :, :] -= 116.779
            images[:, 2, :, :] -= 123.68
        else:
            # 'RGB'->'BGR'
            # x = x[:, :, :, ::-1]
            # # Zero-center by mean pixel
            images[:, :, :, 0] -= 103.939
            images[:, :, :, 1] -= 116.779
            images[:, :, :, 2] -= 123.68
        return images

def extract_resnet_features(x, layer_name):
    net = ResNet50(include_top=False, weights='imagenet')
    model = Model(input=net.input, output=net.get_layer(layer_name).output)
    return model.predict(x)

def l2_distance(A, B):
    return np.linalg.norm(A - B)

########################################################################################################################

print('Using backend {}'.format(K.image_dim_ordering()))

layer = 'activation_43'

# img_path_1 = '/home/hal9000/Pictures/eeeeeeeeeeeeeeeeee.png'
# img_path_2 = '/home/hal9000/Pictures/joe_camel2.png'
img_path_1 = sys.argv[1]
img_path_2 = sys.argv[2]
im_1 = cv2.imread(img_path_1)
im_2 = cv2.imread(img_path_2)

# resize both images such they they have the same size (so that the extracted features have the same dimension)
h, w, c = 320, 320, 3
im_1 = cv2.resize(im_1, (w, h), interpolation=cv2.INTER_CUBIC)
im_2 = cv2.resize(im_2, (w, h), interpolation=cv2.INTER_CUBIC)

# construct a batch
batch = np.zeros(shape=(2, h, w, c))
batch[0] = im_1
batch[1] = im_2

# preprocess the batch
x = preprocess_cv2(batch)

# forward pass the network and extract features
print('Computing features...')
features = extract_resnet_features(x, layer)
features_shape = features.shape
print('Finished computing features!')
feature_1 = features[0]
feature_2 = features[1]

# save the features for later use
# with open('features_{}_1.nparray'.format(K.image_dim_ordering()), 'wb') as f:
#     np.save(f, feature_1)
# with open('features_{}_2.nparray'.format(K.image_dim_ordering()), 'wb') as f:
#     np.save(f, feature_2)

# l2 normalize the features
normalized_feature_1 = normalize(feature_1.flatten(), norm='l2', axis=0)
normalized_feature_2 = normalize(feature_2.flatten(), norm='l2', axis=0)

distance = l2_distance(normalized_feature_1, normalized_feature_2)
print('Distance between features: {}'.format(distance))

以下是我分别得到的 Theano 和 Tensorflow 的结果:

# /usr/bin/python2.7 /home/hal9000/tf_vs_th/comparison.py /home/hal9000/Pictures/eeeeeeeeeeeeeeeeee.png /home/hal9000/Pictures/joe_camel2.png
# Using Theano backend.
# WARNING (theano.sandbox.cuda): CUDA is installed, but device gpu is not available  (error: Unable to get the number of gpus available: CUDA driver version is insufficient for CUDA runtime version)
# Using backend th
# Image batch arrived with shape: (2, 320, 320, 3)
# Image batch axes were transposed to shape: (2, 3, 320, 320) for THEANO dim-ordering convention
# Computing features...
# Finished computing features!
# /usr/local/lib/python2.7/dist-packages/sklearn/utils/validation.py:395: DeprecationWarning: Passing 1d arrays as data is deprecated in 0.17 and will raise ValueError in 0.19. Reshape your data either using X.reshape(-1, 1) if your data has a single feature or X.reshape(1, -1) if it contains a single sample.
#   DeprecationWarning)
# /usr/local/lib/python2.7/dist-packages/sklearn/utils/validation.py:395: DeprecationWarning: Passing 1d arrays as data is deprecated in 0.17 and will raise ValueError in 0.19. Reshape your data either using X.reshape(-1, 1) if your data has a single feature or X.reshape(1, -1) if it contains a single sample.
#   DeprecationWarning)
# Distance between features: 446.848968506
# 
# Process finished with exit code 0

# /usr/bin/python2.7 /home/hal9000/tf_vs_th/comparison.py /home/hal9000/Pictures/eeeeeeeeeeeeeeeeee.png /home/hal9000/Pictures/joe_camel2.png
# Using TensorFlow backend.
# Using backend tf
# Computing features...
# Finished computing features!
# /usr/local/lib/python2.7/dist-packages/sklearn/utils/validation.py:395: DeprecationWarning: Passing 1d arrays as data is deprecated in 0.17 and will raise ValueError in 0.19. Reshape your data either using X.reshape(-1, 1) if your data has a single feature or X.reshape(1, -1) if it contains a single sample.
#   DeprecationWarning)
# /usr/local/lib/python2.7/dist-packages/sklearn/utils/validation.py:395: DeprecationWarning: Passing 1d arrays as data is deprecated in 0.17 and will raise ValueError in 0.19. Reshape your data either using X.reshape(-1, 1) if your data has a single feature or X.reshape(1, -1) if it contains a single sample.
#   DeprecationWarning)
# Distance between features: 261.067047119
# 
# Process finished with exit code 0

最佳答案

事实证明，问题在于我如何计算输出特征的 L2 标准化版本。

而不是按特征归一化

normalized_feature_1 = normalize(feature_1.flatten(), norm='l2', axis=0)

我应该像这样进行样本归一化

normalized_feature_1 = normalize(feature_1.flatten(), norm='l2', axis=1)

这会导致 TF 和 TH 模型执行相同的距离测量(取决于两个库处理舍入的方式之间的差异)

关于python - 为什么对于 Keras 中的 Theano 和 Tensorflow 后端，一对图像之间的 L2 距离会得到不同的结果？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/42841839/

文章推荐： java - javafx 应用程序中的 solr 而不是 apache lucene

文章推荐： python - 如何在keras训练期间调用reset_states()？

文章推荐： scala - 使用 Spark MLib 与 PredictionIO 进行 Play 框架比较

文章推荐： machine-learning - Tensorflow 中 cifar10 数据集的 CNN

c# - 使用{设置;得到;}而不是{得到;放;}
在 C# 及其同类语言中，我们总是使用 public string SomeString { get; set;} 但是你也可以使用(我最近才发现这个，而且是在和编译器闲逛的时候发现的) public
php - Laravel Swagger 得到 [语法错误] 预期值，得到 '@'
我已经为 Controller 中的函数编写了 Swagger 注释，但在生成 swagger-ui 代码时出现错误。以下是我的注释代码 /*** End of Annotation For dele
java - 得到??通过php代码调用jar文件时的字符
我正在 PHP 中开发一项服务，该服务使用 exec 函数调用 jar 文件，如下所示: $text = "string with accents á, ó, ú or العربية"; exec(
c - 随机大小缓冲区的缓冲区溢出？ (得到)
我正在尝试了解有关在程序中利用/防止缓冲区溢出的方法的更多信息。我知道如果大小是恒定的，下面的代码很容易受到攻击，但是如果大小每次都是随机的怎么办？是否还有办法从堆栈中获取它并以某种方式动态改变溢出字
c# - 得到;不能正常工作
对于一项学校作业，我应该制作一个可以以小时、分钟和秒为单位存储时间的时间类。一切正常，但仅声明 get 时属性总是返回 0；并设置； private int seconds, minutes, hou
javascript - 得到“未定义ReferenceError”
我正在遍历一些测验对象并将结果存储到json变量中。出现"ReferenceError is not defined"错误，不确定原因。 JS代码 // This function will send
mysql - 得到？？？通过Nifi将非拉丁数据放入mysql时
使用 Nifi 的 PutDatabaseRecord 处理器在 MySQL 中插入阿拉伯字符(非拉丁语)时，字符被“？？？？？？”替换插入后，阿拉伯字符串被替换为??????。我已经使用 utf8
c++ - 得到(变量)
谁能告诉我为什么 gets(abc) 使用 char[] 而不是使用 int？ int abc; char name[] = "lolrofl"; printf("Hello %s.\n",na
r as.POSIXct 得到 NA
为什么在使用 as.POSIXct 转换下面的时间戳时得到所有 NA？ > head(tmp$timestamp_utc) [1] Fri Jul 03 00:15:00 EDT 2015 Fri J
python - 得到 n 的所有约数的这个算法的运行时间复杂度是多少？
def get_submultiples(n): # Get all submultiples of n if n == 1: return [1] i = 2
Django 模型继承 - 得到 child
有没有办法访问基本模型的实际 child ，意思是:继续使用 django Docs 中的示例，让我们假设我正在建模不同的外卖餐厅，它们只是有共同点姓名都有deliver方法至此: class
javascript - 范围总和---得到 "undefined"
我正在寻找一个范围的总和，但我总是得到“未定义”。我相信有些东西出现在错误的位置，但我不确定它是什么。第 1 部分:“编写一个范围函数，它接受两个参数(start 和 end)，并返回一个包含从 s
java - 得到 JdkVersion classNotFoundException
我已将 spring 版本从 4.2.3 更新到 5.0.2，并将安全性从 5.0.1 更新到 5.0.10 并使用 spring -flex版本1.6.0.RC1。像这样使用 BlazeDS 依赖
java - 得到 0 输出而不是正确的输出
我可以输入但在输出中，我得到的结果为零。我使用两门类(class)，一门是主要的，是日志，另一门是成绩计算。在成绩计算器中，我编写了方法和构造函数，在日志中，类通过构造函数调用这些方法。 import
go - 构建时出错，得到 : "suspect or "
我在使用 go 时遇到了构建问题。我想知道这是编译器中的错误还是代码的问题。 // removed the error handling for sake of clarity file, _ :=
c# - 尝试与光线转换命中的对象上的组件进行交互，得到 NullReferenceException
我的角色在与盒子互动时出现问题。我有一个 GameObject Player 附加了一个脚本来与游戏中的盒子交互，脚本是: using UnityEngine; using System.Collec
javascript - 无法可视化百分比数字(得到 NaN)
有谁知道为什么我不能在下面生成百分比 codeIshere (第 97-117 行)？ var format=d3.format(".1%"); var percent = format(functi
python - 需要整数参数，得到 float
我正在尝试编写图像识别代码，以针对不同动物图像训练系统，这就是代码。我使用 anaconda 作为解释器，使用pycharm作为环境。 import tensorflow as tf import o
java - 使用初始化的字符串初始化匹配器，得到 NullPointerException
我正在尝试在 Java 中初始化 Matcher，但无论字符串是否已初始化且不为 null，都会继续获取 NPE。这是代码: pattern.compile("\\s"); System.out.p
javascript语法错误预期表达式，得到 '<'
所以我有这段代码: ; (function (g) { var d = document, i, am = d.createElement('script'), h = d.head || d.g

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

python - 为什么对于 Keras 中的 Theano 和 Tensorflow 后端，一对图像之间的 L2 距离会得到不同的结果？