python - Tensorflow 精度为 0.99，但预测很糟糕-6ren

python - Tensorflow 精度为 0.99，但预测很糟糕

转载作者：太空宇宙更新时间：2023-11-03 13:13:14

也许我预测错了？

这是项目...我有一个灰度输入图像，我正在尝试对其进行分割。分割是一个简单的二元分类(考虑前景与背景)。所以基本事实 (y) 是一个由 0 和 1 组成的矩阵——所以有 2 个分类。哦，输入图像是一个正方形，所以我只使用一个名为 n_input

的变量

我的准确率基本上收敛到 0.99，但当我做出预测时，我得到的结果全为零。编辑 --> 每个输出矩阵中都有一个 1，都在同一个地方...

这是我的 session 代码(其他一切正常)...

with tf.Session() as sess:
    sess.run(init)
    summary = tf.train.SummaryWriter('/tmp/logdir/', sess.graph_def)
    step = 1
    from tensorflow.contrib.learn.python.learn.datasets.scroll import scroll_data
    data = scroll_data.read_data('/home/kendall/Desktop/')
    # Keep training until reach max iterations
    flag = 0
    # while flag == 0:
    while step * batch_size < training_iters:
        batch_y, batch_x = data.train.next_batch(batch_size)
        # pdb.set_trace()
        # batch_x = batch_x.reshape((batch_size, n_input))
        batch_x = batch_x.reshape((batch_size, n_input, n_input))
        batch_y = batch_y.reshape((batch_size, n_input, n_input))
        batch_y = convert_to_2_channel(batch_y, batch_size)
        # batch_y = batch_y.reshape((batch_size, n_output, n_classes))
        batch_y = batch_y.reshape((batch_size, 200, 200, n_classes))
        sess.run(optimizer, feed_dict={x: batch_x, y: batch_y,
                                       keep_prob: dropout})
        if step % display_step == 0:
            flag = 1
            # Calculate batch loss and accuracy
            loss, acc = sess.run([cost, accuracy], feed_dict={x: batch_x,
                                                              y: batch_y,
                                                              keep_prob: 1.})
            print "Iter " + str(step*batch_size) + ", Minibatch Loss= " + \
                  "{:.6f}".format(loss) + ", Training Accuracy= " + \
                  "{:.5f}".format(acc)
        step += 1
    print "Optimization Finished!"
    save_path = "model.ckpt"
    saver.save(sess, save_path)

    im = Image.open('/home/kendall/Desktop/HA900_frames/frame0635.tif')
    batch_x = np.array(im)
    pdb.set_trace()
    batch_x = batch_x.reshape((1, n_input, n_input))
    batch_x = batch_x.astype(float)
    # pdb.set_trace()
    prediction = sess.run(pred, feed_dict={x: batch_x, keep_prob: 1.})
    print prediction
    arr1 = np.empty((n_input,n_input))
    arr2 = np.empty((n_input,n_input))
    for i in xrange(n_input):
        for j in xrange(n_input):
            for k in xrange(2):
                if k == 0:
                    arr1[i][j] = prediction[0][i][j][k]
                else:
                    arr2[i][j] = prediction[0][i][j][k]
    # prediction = np.asarray(prediction)
    # prediction = np.reshape(prediction, (200,200))
    # np.savetxt("prediction.csv", prediction, delimiter=",")
    np.savetxt("prediction1.csv", arr1, delimiter=",")
    np.savetxt("prediction2.csv", arr2, delimiter=",")

由于有两个分类，最后部分(带有几个循环)只是将预测划分为两个 2x2 矩阵。

我将预测数组保存到 CSV 文件中，正如我所说，它们全为零。

我还确认所有数据都是正确的(尺寸和值)。

为什么训练会收敛，但预测却很糟糕？

如果你想查看所有代码，就在这里...

import tensorflow as tf
import pdb
import numpy as np
from numpy import genfromtxt
from PIL import Image

# Import MINST data
# from tensorflow.examples.tutorials.mnist import input_data
# mnist = input_data.read_data_sets("/tmp/data/", one_hot=True)


# Parameters
learning_rate = 0.001
training_iters = 20000
batch_size = 128
display_step = 1

# Network Parameters
n_input = 200 # MNIST data input (img shape: 28*28)
n_output = 40000 # MNIST total classes (0-9 digits)
n_classes = 2
#n_input = 200

dropout = 0.75 # Dropout, probability to keep units

# tf Graph input
x = tf.placeholder(tf.float32, [None, n_input, n_input])
y = tf.placeholder(tf.float32, [None, n_input, n_input, n_classes])
keep_prob = tf.placeholder(tf.float32) #dropout (keep probability)

# Create some wrappers for simplicity
def conv2d(x, W, b, strides=1):
    # Conv2D wrapper, with bias and relu activation
    x = tf.nn.conv2d(x, W, strides=[1, strides, strides, 1], padding='SAME')
    x = tf.nn.bias_add(x, b)
    return tf.nn.relu(x)

def maxpool2d(x, k=2):
    # MaxPool2D wrapper
    return tf.nn.max_pool(x, ksize=[1, k, k, 1], strides=[1, k, k, 1],
                          padding='SAME')


# Create model
def conv_net(x, weights, biases, dropout):
    # Reshape input picture
    x = tf.reshape(x, shape=[-1, n_input, n_input, 1])

    # Convolution Layer
    conv1 = conv2d(x, weights['wc1'], biases['bc1'])
    # Max Pooling (down-sampling)
    conv1 = maxpool2d(conv1, k=2)
    conv1 = tf.nn.local_response_normalization(conv1)

    # Convolution Layer
    conv2 = conv2d(conv1, weights['wc2'], biases['bc2'])
    # Max Pooling (down-sampling)
    conv2 = tf.nn.local_response_normalization(conv2)
    conv2 = maxpool2d(conv2, k=2)

    # Convolution Layer
    conv3 = conv2d(conv2, weights['wc3'], biases['bc3'])
    # Max Pooling (down-sampling)
    conv3 = tf.nn.local_response_normalization(conv3)
    conv3 = maxpool2d(conv3, k=2)

    # pdb.set_trace()

    # Fully connected layer
    # Reshape conv2 output to fit fully connected layer input
    fc1 = tf.reshape(conv3, [-1, weights['wd1'].get_shape().as_list()[0]])
    fc1 = tf.add(tf.matmul(fc1, weights['wd1']), biases['bd1'])
    fc1 = tf.nn.relu(fc1)
    # Apply Dropout
    fc1 = tf.nn.dropout(fc1, dropout)

    output = []
    for i in xrange(2):
        output.append(tf.nn.softmax(tf.add(tf.matmul(fc1, weights['out']), biases['out'])))

    return output
    # return tf.nn.softmax(tf.add(tf.matmul(fc1, weights['out']), biases['out']))


# Store layers weight & bias
weights = {
    # 5x5 conv, 1 input, 32 outputs
    'wc1': tf.Variable(tf.random_normal([5, 5, 1, 32])),
    # 5x5 conv, 32 inputs, 64 outputs
    'wc2': tf.Variable(tf.random_normal([5, 5, 32, 64])),
    # 5x5 conv, 32 inputs, 64 outputs
    'wc3': tf.Variable(tf.random_normal([5, 5, 64, 128])),
    # fully connected, 7*7*64 inputs, 1024 outputs
    'wd1': tf.Variable(tf.random_normal([25*25*128, 1024])),
    # 1024 inputs, 10 outputs (class prediction)
    'out': tf.Variable(tf.random_normal([1024, n_output]))
}

biases = {
    'bc1': tf.Variable(tf.random_normal([32])),
    'bc2': tf.Variable(tf.random_normal([64])),
    'bc3': tf.Variable(tf.random_normal([128])),
    'bd1': tf.Variable(tf.random_normal([1024])),
    'out': tf.Variable(tf.random_normal([n_output]))
}

# Construct model
pred = conv_net(x, weights, biases, keep_prob)
# pdb.set_trace()
pred = tf.pack(tf.transpose(pred,[1,2,0]))
pred = tf.reshape(pred, [-1,n_input,n_input,n_classes])
# Define loss and optimizer
cost = tf.reduce_mean(tf.nn.sigmoid_cross_entropy_with_logits(pred, y))
optimizer = tf.train.AdamOptimizer(learning_rate=learning_rate).minimize(cost)

# Evaluate model
correct_pred = tf.equal(tf.argmax(pred, 1), tf.argmax(y, 1))
accuracy = tf.reduce_mean(tf.cast(correct_pred, tf.float32))

# Initializing the variables
init = tf.initialize_all_variables()
saver = tf.train.Saver()

def convert_to_2_channel(x, batch_size):
    #assume input has dimension (batch_size,x,y)
    #output will have dimension (batch_size,x,y,2)
    output = np.empty((batch_size, 200, 200, 2))

    temp_arr1 = np.empty((batch_size, 200, 200))
    temp_arr2 = np.empty((batch_size, 200, 200))

    for i in xrange(batch_size):
        for j in xrange(200):
            for k in xrange(200):
                if x[i][j][k] == 1:
                    temp_arr1[i][j][k] = 1
                    temp_arr2[i][j][k] = 0
                else:
                    temp_arr1[i][j][k] = 0
                    temp_arr2[i][j][k] = 1

    for i in xrange(batch_size):
        for j in xrange(200):
            for k in xrange(200):
                for l in xrange(2):
                    if l == 0:
                        output[i][j][k][l] = temp_arr1[i][j][k]
                    else:
                        output[i][j][k][l] = temp_arr2[i][j][k]

    return output

# Launch the graph
with tf.Session() as sess:
    sess.run(init)
    summary = tf.train.SummaryWriter('/tmp/logdir/', sess.graph_def)
    step = 1
    from tensorflow.contrib.learn.python.learn.datasets.scroll import scroll_data
    data = scroll_data.read_data('/home/kendall/Desktop/')
    # Keep training until reach max iterations
    flag = 0
    # while flag == 0:
    while step * batch_size < training_iters:
        batch_y, batch_x = data.train.next_batch(batch_size)
        # pdb.set_trace()
        # batch_x = batch_x.reshape((batch_size, n_input))
        batch_x = batch_x.reshape((batch_size, n_input, n_input))
        batch_y = batch_y.reshape((batch_size, n_input, n_input))
        batch_y = convert_to_2_channel(batch_y, batch_size)
        # batch_y = batch_y.reshape((batch_size, n_output, n_classes))
        batch_y = batch_y.reshape((batch_size, 200, 200, n_classes))
        sess.run(optimizer, feed_dict={x: batch_x, y: batch_y,
                                       keep_prob: dropout})
        if step % display_step == 0:
            flag = 1
            # Calculate batch loss and accuracy
            loss, acc = sess.run([cost, accuracy], feed_dict={x: batch_x,
                                                              y: batch_y,
                                                              keep_prob: 1.})
            print "Iter " + str(step*batch_size) + ", Minibatch Loss= " + \
                  "{:.6f}".format(loss) + ", Training Accuracy= " + \
                  "{:.5f}".format(acc)
        step += 1
    print "Optimization Finished!"
    save_path = "model.ckpt"
    saver.save(sess, save_path)

    im = Image.open('/home/kendall/Desktop/HA900_frames/frame0635.tif')
    batch_x = np.array(im)
    pdb.set_trace()
    batch_x = batch_x.reshape((1, n_input, n_input))
    batch_x = batch_x.astype(float)
    # pdb.set_trace()
    prediction = sess.run(pred, feed_dict={x: batch_x, keep_prob: 1.})
    print prediction
    arr1 = np.empty((n_input,n_input))
    arr2 = np.empty((n_input,n_input))
    for i in xrange(n_input):
        for j in xrange(n_input):
            for k in xrange(2):
                if k == 0:
                    arr1[i][j] = prediction[0][i][j][k]
                else:
                    arr2[i][j] = prediction[0][i][j][k]
    # prediction = np.asarray(prediction)
    # prediction = np.reshape(prediction, (200,200))
    # np.savetxt("prediction.csv", prediction, delimiter=",")
    np.savetxt("prediction1.csv", arr1, delimiter=",")
    np.savetxt("prediction2.csv", arr2, delimiter=",")

    # Calculate accuracy for 256 mnist test images
    print "Testing Accuracy:", \
        sess.run(accuracy, feed_dict={x: data.test.images[:256],
                                      y: data.test.labels[:256],
                                      keep_prob: 1.})

最佳答案

代码中的错误

您的代码中存在多个错误:

你不应该调用tf.nn.sigmoid_cross_entropy_with_logits使用 softmax 层的输出，但使用未缩放的 logits:

WARNING: This op expects unscaled logits, since it performs a softmax on logits internally for efficiency. Do not call this op with the output of softmax, as it will produce incorrect results.

事实上，由于您有 2 个类，您应该使用 softmax 损失，使用 tf.nn.softmax_cross_entropy_with_logits
使用 tf.argmax(pred, 1) 时，您仅在轴 1 上应用 argmax，这是输出图像的高度。您应该在最后一个轴(大小为 2)上使用 tf.argmax(pred, 3)。
- 这或许可以解释为什么你的准确率是 0.99
- 在输出图像上，它会采用图像高度上的 argmax，默认情况下为 0(因为每个 channel 的所有值都相等)

错误的模型

最大的缺点是您的模型通常很难优化。

您有超过 40,000 个类别的 softmax，这是巨大的。
您根本没有利用想要输出图像(预测前景/背景)这一事实。
- 例如预测 2,345 与预测 2,346 和预测 2,545 高度相关，但您没有考虑到这一点

我建议先阅读一些有关语义分割的内容:

this paper : 用于语义分割的全卷积网络
these slides来自 CS231n (Stanford):特别是关于上采样和反卷积的部分

建议

如果您想使用 TensorFlow，您需要从小处着手。首先尝试一个可能只有 1 个隐藏层的非常简单的网络。

您需要绘制张量的所有形状，以确保它们符合您的想法。例如，如果您绘制了 tf.argmax(y, 1)，您会意识到形状是 [batch_size, 200, 2] 而不是预期的 [batch_size, 200, 200].

TensorBoard 是您的好 helper ，您应该尝试在此处绘制输入图像以及您的预测，看看它们是什么样子。

尝试小，使用包含 10 张图像的非常小的数据集，看看您是否可以过度拟合它并预测几乎准确的响应。

总而言之，我不确定我的所有建议，但它们值得一试，我希望这能帮助您走向成功!

关于python - Tensorflow 精度为 0.99，但预测很糟糕，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/37898795/

文章推荐： c# - 如果控件已更改，如何中断 mouseUp 事件

文章推荐： c# - 快速应用恢复不工作

文章推荐： python - 为什么 "raise"和 "del"不是 Python 中的函数？

JavaScript 精度
关于这个话题已经说了很多，但是我找不到我的问题的确切答案。 JavaScript 无法准确表示 0.1 等小数，这是可以理解的。例如，由于乘法运算期间发生舍入误差，这是正确的: 0.1 * 3 ==
之字形打印 float 精度
在 zig 中，可以使用“{d}”以十进制表示法打印浮点值。这将自动以全精度打印该值。有没有办法指定位数？是针对每个值，还是作为某种全局设置？最佳答案这将限制小数点后的位数，四舍五入和零填充: f
减法时的 MATLAB 精度
我正在进行的项目需要高精度。减法时我遇到的问题在这里说明: >> 1-0.9999999999999999 ans = 1.1102e-16 >> 1-0.99999999999999999 ans
Fortran - 在运行时设置变量的种类/精度
是否可以使变量本身的精度成为将在运行时定义的变量？说，如果我尝试编译: SUBROUTINE FOO( VARIABLE, PRECISION_VALUE ) IMPLICI
sqlite - 纬度查询-精度
我正在查询 SQLite 数据库以获取纬度/经度详细信息。 SELECT * FROM tblMain where latitude > -33.866 and latitude 151.20
fortran - 使用下划线定义种类/精度
我一直使用下划线将整数定义为 Fortran 中的特定类型。下面是一段代码，用于演示 1_8 的含义，例如: program main implicit none integer(2)
python - 品脱有效数字，精度
我正在寻找一种方法来告诉 pint 要打印多少个有效数字。例如，当我输入以下内容时: import pint ureg = pint.UnitRegistry() print(3*ureg.m /9)
安卓 IMU 精度
我正在从事一个项目，目标是从山上追踪动物。在第一个实地考察季中，我们使用了 OpenTags 和经过校准的摄像头，虽然可以正常工作，但需要大量的处理/校准，而且至关重要的是，当系统出现问题时无法提供任
划分时的 Javascript 精度
在 JavaScript 中有没有一种方法可以确定一个数除以另一个数是否会得到整数？就像 18.4/0.002 给我们 9200，但是 18.4/0.1 给我们 183.99999999999997。
数学库的 Javascript 精度
我正在尝试使用 Big.js 在 javascript 中完成此计算 r = (a * b)/ sqrt( ( a*sin(θ) )^2 + ( b*cos(θ) )^2 ) 我也试过 math.js
ios - gl_FragCoord 精度
我有这个片段着色器代码，它在 iOS 模拟器(非视网膜)和 iPad2(非视网膜)之间显示不同: highp vec2 textCoord; textCoord.x = gl_Fr
c++ double /精度
这个问题在这里已经有了答案: C++ calculating more precise than double or long double (2 个答案) 关闭 6 年前。是否有任何浮点类型在小
java - 位运算符 - 精度
我似乎一直困惑的三个问题: 为什么代码是 x & ~077比这行代码 x & 0177700 更好。是因为精度损失较小吗？为什么此代码对于设置数字中的第 5 位不正确？ num = num + 0x
c - float 精度
这个问题在这里已经有了答案: 关闭 10 年前。 Possible Duplicate: Precision of Floating Point 我正在尝试使用一些 float 来计算概率，但我的最
c - float 精度
由于微 Controller 的精度，我定义了一个包含两个 float 比率的符号，而不是直接写结果。 #define INTERVAL (0.01F/0.499F) 代替 #defi
c - c语言搜索算法执行时间(精度)
我试图比较这 3 种搜索算法，起初我使用 time.h 库但没有任何反应，输出始终是 0.00000 秒。现在我试图在循环中使用一些计数器。但我在这里也有问题, 任何人都可以帮我处理代码吗？这是我的
c - printf() 精度
char buf[10]; int counter, x = 0; snprintf (buf, sizeof buf , "%.100d%n", x, &counter); printf("Coun
matlab:评估向量时的错误/精度
我注意到在评估向量时对我来说是不可预测的行为。直接执行它与在循环中进行索引似乎是完全不同的。谁能帮我解决这个问题？我知道可能在它如何进行每个操作中都有解释，所以我需要一些关于如何查找它的键多谢指教提
安卓 GPS 精度
我想在我的应用程序中使用精确的 gps 定位。所以我遵循了一个简单的教程(LocationManager 的基本用法，明确要求 GPS 提供商，要求更新 0 ms，0 m)并创建了一个应用程序。我对更
c++ - float 精度
float 在 1.0f 和 0.0f 之间有多少位精度，这样每个值都可以唯一表示？例如，如果第一个小数 float 不能表示 0.13f，答案就是 float 只有一位精度。最佳答案 std::

太空宇宙

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城