machine-learning - TensorFlow:多次评估测试集但得到不同的精度-6ren

machine-learning - TensorFlow:多次评估测试集但得到不同的精度

转载作者：行者123 更新时间：2023-11-30 09:19:31

25

4

我使用CNN训练了MNIST模型，但是当我在训练后用测试数据检查模型的准确性时，我发现我的准确性会提高。这是代码。

BATCH_SIZE = 50
LR = 0.001              # learning rate
mnist = input_data.read_data_sets('./mnist', one_hot=True)  # they has been normalized to range (0,1)
test_x = mnist.test.images[:2000]
test_y = mnist.test.labels[:2000]

def new_cnn(imageinput, inputshape):
    weights = tf.Variable(tf.truncated_normal(inputshape, stddev = 0.1),name = 'weights')
    biases = tf.Variable(tf.constant(0.05, shape = [inputshape[3]]),name = 'biases')
    layer = tf.nn.conv2d(imageinput, weights, strides = [1, 1, 1, 1], padding = 'SAME')
    layer = tf.nn.relu(layer)
    return weights, layer

tf_x = tf.placeholder(tf.float32, [None, 28 * 28])
image = tf.reshape(tf_x, [-1, 28, 28, 1])              # (batch, height, width, channel)
tf_y = tf.placeholder(tf.int32, [None, 10])            # input y

# CNN
weights1, layer1 = new_cnn(image, [5, 5, 1, 32])
pool1 = tf.layers.max_pooling2d(
    layer1,
    pool_size=2,
    strides=2,
)           # -> (14, 14, 32)
weight2, layer2 = new_cnn(pool1, [5, 5, 32, 64])    # -> (14, 14, 64)
pool2 = tf.layers.max_pooling2d(layer2, 2, 2)    # -> (7, 7, 64)
flat = tf.reshape(pool2, [-1, 7 * 7 * 64])          # -> (7*7*64, )
hide = tf.layers.dense(flat, 1024, name = 'hide')              # hidden layer
output = tf.layers.dense(hide, 10, name = 'output')
loss = tf.losses.softmax_cross_entropy(onehot_labels=tf_y, logits=output)           # compute cost
accuracy = tf.metrics.accuracy( labels=tf.argmax(tf_y, axis=1), predictions=tf.argmax(output, axis=1),)[1]
train_op = tf.train.AdamOptimizer(LR).minimize(loss)



sess = tf.Session()
init_op = tf.group(tf.global_variables_initializer(), tf.local_variables_initializer()) # the local var is for accuracy
sess.run(init_op)     # initialize var in graph
saver = tf.train.Saver()
for step in range(101):
    b_x, b_y = mnist.train.next_batch(BATCH_SIZE)
    _, loss_ = sess.run([train_op, loss], {tf_x: b_x, tf_y: b_y})
    if step % 50 == 0:
        print(loss_)
        accuracy_, loss2 = sess.run([accuracy, loss], {tf_x: test_x, tf_y: test_y })
        print('Step:', step, '| test accuracy: %f' % accuracy_)

为了简化问题，我只使用 100 次训练迭代。测试集的最终准确率约为0.655000。

但是当我运行以下代码时:

for i in range(5):
  accuracy2 = sess.run(accuracy, {tf_x: test_x, tf_y: test_y })
  print(sess.run(weight2[1,:,0,0])) # To show that the model parameters won't update 
  print(accuracy2)

输出为

[-0.06928255 -0.13498515  0.01266837  0.05656774  0.09438231]
0.725875
[-0.06928255 -0.13498515  0.01266837  0.05656774  0.09438231]
0.7684
[-0.06928255 -0.13498515  0.01266837  0.05656774  0.09438231]
0.79675
[-0.06928255 -0.13498515  0.01266837  0.05656774  0.09438231]
0.817
[-0.06928255 -0.13498515  0.01266837  0.05656774  0.09438231]
0.832187

这让我很困惑，有人可以告诉我出了什么问题吗？感谢您的耐心等待!

最佳答案

tf.metrics.accuracy并不像你想象的那么微不足道。看一下它的文档:

The accuracy function creates two local variables, total and
count that are used to compute the frequency with which predictions matches labels. This frequency is ultimately returned as accuracy: an idempotent operation that simply divides total by count.

Internally, an is_correct operation computes a Tensor with elements 1.0 where the corresponding elements of predictions and labels match and 0.0 otherwise. Then update_op increments total with the reduced sum of the product of weights and is_correct, and it increments count with the reduced sum of weights.

For estimation of the metric over a stream of data, the function creates an update_op operation that updates these variables and returns the accuracy.

...

Returns:

accuracy: A Tensor representing the accuracy, the value of total divided by count.

update_op: An operation that increments the total and count variables appropriately and whose value matches accuracy.

请注意，它返回一个元组，并且您获取第二项，即update_op。连续调用 update_op 被视为数据流，这不是您想要做的(因为训练期间的每个评估都会影响 future 的评估)。事实上，这个运行指标是 pretty counter-intuitive .

您的解决方案是使用简单的精度计算。将此行更改为:

accuracy = tf.reduce_mean(tf.cast(tf.equal(tf.argmax(tf_y, axis=1), tf.argmax(output, axis=1)), tf.float32))

您将获得稳定的准确度计算。

关于machine-learning - TensorFlow:多次评估测试集但得到不同的精度，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/45127327/

25

4

0

文章推荐： python - 如何找到 tensorflow 数组中的第二低值

文章推荐： python - Python的sklearn(DecisionTreeClassifier，SVM)之间的区别？

文章推荐： python - Keras 训练循环如何过滤损失值？

文章推荐： java - ArrayList 类型的完整名称

JavaScript 精度
关于这个话题已经说了很多，但是我找不到我的问题的确切答案。 JavaScript 无法准确表示 0.1 等小数，这是可以理解的。例如，由于乘法运算期间发生舍入误差，这是正确的: 0.1 * 3 ==
之字形打印 float 精度
在 zig 中，可以使用“{d}”以十进制表示法打印浮点值。这将自动以全精度打印该值。有没有办法指定位数？是针对每个值，还是作为某种全局设置？最佳答案这将限制小数点后的位数，四舍五入和零填充: f
减法时的 MATLAB 精度
我正在进行的项目需要高精度。减法时我遇到的问题在这里说明: >> 1-0.9999999999999999 ans = 1.1102e-16 >> 1-0.99999999999999999 ans
Fortran - 在运行时设置变量的种类/精度
是否可以使变量本身的精度成为将在运行时定义的变量？说，如果我尝试编译: SUBROUTINE FOO( VARIABLE, PRECISION_VALUE ) IMPLICI
sqlite - 纬度查询-精度
我正在查询 SQLite 数据库以获取纬度/经度详细信息。 SELECT * FROM tblMain where latitude > -33.866 and latitude 151.20
fortran - 使用下划线定义种类/精度
我一直使用下划线将整数定义为 Fortran 中的特定类型。下面是一段代码，用于演示 1_8 的含义，例如: program main implicit none integer(2)
python - 品脱有效数字，精度
我正在寻找一种方法来告诉 pint 要打印多少个有效数字。例如，当我输入以下内容时: import pint ureg = pint.UnitRegistry() print(3*ureg.m /9)
安卓 IMU 精度
我正在从事一个项目，目标是从山上追踪动物。在第一个实地考察季中，我们使用了 OpenTags 和经过校准的摄像头，虽然可以正常工作，但需要大量的处理/校准，而且至关重要的是，当系统出现问题时无法提供任
划分时的 Javascript 精度
在 JavaScript 中有没有一种方法可以确定一个数除以另一个数是否会得到整数？就像 18.4/0.002 给我们 9200，但是 18.4/0.1 给我们 183.99999999999997。
数学库的 Javascript 精度
我正在尝试使用 Big.js 在 javascript 中完成此计算 r = (a * b)/ sqrt( ( a*sin(θ) )^2 + ( b*cos(θ) )^2 ) 我也试过 math.js
ios - gl_FragCoord 精度
我有这个片段着色器代码，它在 iOS 模拟器(非视网膜)和 iPad2(非视网膜)之间显示不同: highp vec2 textCoord; textCoord.x = gl_Fr
c++ double /精度
这个问题在这里已经有了答案: C++ calculating more precise than double or long double (2 个答案) 关闭 6 年前。是否有任何浮点类型在小
java - 位运算符 - 精度
我似乎一直困惑的三个问题: 为什么代码是 x & ~077比这行代码 x & 0177700 更好。是因为精度损失较小吗？为什么此代码对于设置数字中的第 5 位不正确？ num = num + 0x
c - float 精度
这个问题在这里已经有了答案: 关闭 10 年前。 Possible Duplicate: Precision of Floating Point 我正在尝试使用一些 float 来计算概率，但我的最
c - float 精度
由于微 Controller 的精度，我定义了一个包含两个 float 比率的符号，而不是直接写结果。 #define INTERVAL (0.01F/0.499F) 代替 #defi
c - c语言搜索算法执行时间(精度)
我试图比较这 3 种搜索算法，起初我使用 time.h 库但没有任何反应，输出始终是 0.00000 秒。现在我试图在循环中使用一些计数器。但我在这里也有问题, 任何人都可以帮我处理代码吗？这是我的
c - printf() 精度
char buf[10]; int counter, x = 0; snprintf (buf, sizeof buf , "%.100d%n", x, &counter); printf("Coun
matlab:评估向量时的错误/精度
我注意到在评估向量时对我来说是不可预测的行为。直接执行它与在循环中进行索引似乎是完全不同的。谁能帮我解决这个问题？我知道可能在它如何进行每个操作中都有解释，所以我需要一些关于如何查找它的键多谢指教提
安卓 GPS 精度
我想在我的应用程序中使用精确的 gps 定位。所以我遵循了一个简单的教程(LocationManager 的基本用法，明确要求 GPS 提供商，要求更新 0 ms，0 m)并创建了一个应用程序。我对更
c++ - float 精度
float 在 1.0f 和 0.0f 之间有多少位精度，这样每个值都可以唯一表示？例如，如果第一个小数 float 不能表示 0.13f，答案就是 float 只有一位精度。最佳答案 std::

首页

博学

6Ren·AI

商城

machine-learning - TensorFlow:多次评估测试集但得到不同的精度