python - 使用 Google 的 TensorFlow 添加额外的隐藏层-6ren

python - 使用 Google 的 TensorFlow 添加额外的隐藏层

转载作者：太空狗更新时间：2023-10-29 20:49:11

我正在尝试使用 TensorFlow 创建多标签分类器。虽然我在添加和连接隐藏层时遇到了问题。

我正在学习本教程:http://jrmeyer.github.io/tutorial/2016/02/01/TensorFlow-Tutorial.html

我使用的数据是 UCI 的 Iris 数据，编码为 one-hot:

训练 X [105,4]

5,3.2,1.2,0.2
5.5,3.5,1.3,0.2
4.9,3.1,1.5,0.1
4.4,3,1.3,0.2
5.1,3.4,1.5,0.2
.
.
.

训练 Y [105,3]

0,0,1
0,0,1
0,0,1
0,0,1
0,0,1
0,0,1
.
.
.

我还使用了测试数据 X 和 Y，它们分别是 [45,4] 和 [45,3]。

这是我的python代码:

import tensorflow as tf
import numpy as np
import tarfile
import os
import matplotlib.pyplot as plt
import time

## Import data
def csv_to_numpy_array(filePath, delimiter):
    return np.genfromtxt(filePath, delimiter=delimiter, dtype=None)

trainX = csv_to_numpy_array("Iris_training_x.csv", delimiter=",").astype(np.float32)
trainY = csv_to_numpy_array("Iris_training_y.csv", delimiter=",").astype(np.float32)
testX = csv_to_numpy_array("Iris_testing_x.csv", delimiter=",").astype(np.float32)
testY = csv_to_numpy_array("Iris_testing_y.csv", delimiter=",").astype(np.float32)


# Data Set Paramaters
numFeatures = trainX.shape[1]
numLabels = trainY.shape[1]

# Training Session Parameters
numEpochs = 1000

learningRate = tf.train.exponential_decay(learning_rate=0.008,
                                          global_step= 1,
                                          decay_steps=trainX.shape[0],
                                          decay_rate= 0.95,
                                          staircase=True)

# Placeholders
X=tf.placeholder(tf.float32, [None, numFeatures])
y=tf.placeholder(tf.float32, [None, numLabels])

# Initialize our weights and biases

Weights = tf.Variable(tf.random_normal([numFeatures, numLabels],
                                       mean=0,
                                       stddev=(np.sqrt(6 / numFeatures + numLabels + 1)),
                                       name="Weights"))

bias = tf.Variable(tf.random_normal([1, numLabels],
                                    mean=0,
                                    stddev=(np.sqrt(6 / numFeatures + numLabels + 1)),
                                    name="bias"))


# Prediction algorithm (feedforward)

apply_weights_OP = tf.matmul(X, Weights, name="apply_weights")
add_bias_OP = tf.add(apply_weights_OP, bias, name="add_bias")
activation_OP = tf.nn.sigmoid(add_bias_OP, name="activation")

numFeatures = activation_OP

apply_weights_OP = tf.matmul(X, Weights, name="apply_weights")
add_bias_OP = tf.add(apply_weights_OP, bias, name="add_bias")
activation_OP = tf.nn.sigmoid(add_bias_OP, name="activation")

init_OP = tf.initialize_all_variables()


# Cost function (Mean Squeared Error)
cost_OP = tf.nn.l2_loss(activation_OP-y, name="squared_error_cost")

# Optimization Algorithm (Gradient Descent)
training_OP = tf.train.GradientDescentOptimizer(learningRate).minimize(cost_OP)

# Visualize
epoch_values=[]
accuracy_values=[]
cost_values=[]
# Turn on interactive plotting
plt.ion()
# Create the main, super plot
fig = plt.figure()
# Create two subplots on their own axes and give titles
ax1 = plt.subplot("211")
ax1.set_title("TRAINING ACCURACY", fontsize=18)
ax2 = plt.subplot("212")
ax2.set_title("TRAINING COST", fontsize=18)
plt.tight_layout()

# Create a tensorflow session
sess = tf.Session()

# Initialize all tensorflow variables
sess.run(init_OP)

## Ops for vizualization
# argmax(activation_OP, 1) gives the label our model thought was most likely
# argmax(y, 1) is the correct label
correct_predictions_OP = tf.equal(tf.argmax(activation_OP,1),tf.argmax(y,1))
# False is 0 and True is 1, what was our average?
accuracy_OP = tf.reduce_mean(tf.cast(correct_predictions_OP, "float"))
# Summary op for regression output
activation_summary_OP = tf.histogram_summary("output", activation_OP)
# Summary op for accuracy
accuracy_summary_OP = tf.scalar_summary("accuracy", accuracy_OP)
# Summary op for cost
cost_summary_OP = tf.scalar_summary("cost", cost_OP)
# Summary ops to check how variables (W, b) are updating after each iteration
weightSummary = tf.histogram_summary("Weights", Weights.eval(session=sess))
biasSummary = tf.histogram_summary("biases", bias.eval(session=sess))
# Merge all summaries
all_summary_OPS = tf.merge_all_summaries()
# Summary writer
writer = tf.train.SummaryWriter("summary_logs", sess.graph_def)

# Initialize reporting variables
cost = 0
diff = 1

# Training epochs
for i in range(numEpochs):
    if i > 1 and diff < .0001:
        print("change in cost %g; convergence."%diff)
        break
    else:
        # Run training step
        step = sess.run(training_OP, feed_dict={X: trainX, y: trainY})
        # Report occasional stats
        if i % 10 == 0:
            #Add epoch to epoch_values
            epoch_values.append(i)
            #Generate accuracy stats on test data
            summary_results, train_accuracy, newCost = sess.run(
                [all_summary_OPS, accuracy_OP, cost_OP],
                feed_dict={X: trainX, y: trainY}
            )
            # Add accuracy to live graphing variable
            accuracy_values.append(train_accuracy)
            # Add cost to live graphing variable
            cost_values.append(newCost)
            #Write summary stats to writer
            #writer.add_summary(summary_results, i)
            # Re-assign values for variables
            diff = abs(newCost - cost)
            cost = newCost

            #generate print statements
            print("step %d, training accuracy %g"%(i, train_accuracy))
            print("step %d, cost %g"%(i, newCost))
            print("step %d, change in cost %g"%(i, diff))

            # Plot progress to our two subplots
            accuracyLine, = ax1.plot(epoch_values, accuracy_values)
            costLine, = ax2.plot(epoch_values, cost_values)
            fig.canvas.draw()
            #time.sleep(1)


# How well do we perform on held-out test data?
print("final accuracy on test set: %s" %str(sess.run(accuracy_OP, feed_dict={X: testX, y: testY})))

# Create Saver
saver = tf.train.Saver()
# Save variables to .ckpt file
# saver.save(sess, "trained_variables.ckpt")

# Close tensorflow session
sess.close()

问题在这里:

# Prediction algorithm (feedforward)

apply_weights_OP = tf.matmul(X, Weights, name="apply_weights")
add_bias_OP = tf.add(apply_weights_OP, bias, name="add_bias")
activation_OP = tf.nn.sigmoid(add_bias_OP, name="activation")

numFeatures = activation_OP

apply_weights_OP = tf.matmul(activation_OP, Weights, name="apply_weights")
add_bias_OP = tf.add(apply_weights_OP, bias, name="add_bias")
activation_OP = tf.nn.sigmoid(add_bias_OP, name="activation")

我的理解是一层的输出应该连接到下一层的输入。我只是不知道如何修改图层的输出或输入；它一直给我这个兼容性错误:

/usr/bin/python3.5 /home/marco/PycharmProjects/NN_Iris/main
Traceback (most recent call last):
  File "/home/marco/PycharmProjects/NN_Iris/main", line 132, in <module>
    apply_weights_OP = tf.matmul(activation_OP, Weights, name="apply_weights")
  File "/usr/local/lib/python3.5/dist-packages/tensorflow/python/ops/math_ops.py", line 1346, in matmul
    name=name)
  File "/usr/local/lib/python3.5/dist-packages/tensorflow/python/ops/gen_math_ops.py", line 1271, in _mat_mul
    transpose_b=transpose_b, name=name)
  File "/usr/local/lib/python3.5/dist-packages/tensorflow/python/framework/op_def_library.py", line 703, in apply_op
    op_def=op_def)
  File "/usr/local/lib/python3.5/dist-packages/tensorflow/python/framework/ops.py", line 2312, in create_op
    set_shapes_for_outputs(ret)
  File "/usr/local/lib/python3.5/dist-packages/tensorflow/python/framework/ops.py", line 1704, in set_shapes_for_outputs
    shapes = shape_func(op)
  File "/usr/local/lib/python3.5/dist-packages/tensorflow/python/framework/common_shapes.py", line 94, in matmul_shape
    inner_a.assert_is_compatible_with(inner_b)
  File "/usr/local/lib/python3.5/dist-packages/tensorflow/python/framework/tensor_shape.py", line 108, in assert_is_compatible_with
    % (self, other))
ValueError: Dimensions 3 and 4 are not compatible

Process finished with exit code 1

关于如何正确连接两个隐藏层有什么建议吗？谢谢。

最佳答案

如果您想要一个具有一个隐藏层和一个输出层的全连接网络，那么它们的形状应该是这样的:

# hidden layer
weights_hidden = tf.Variable(tf.random_normal([numFeatures, num_nodes])
bias_hidden = tf.Variable(tf.random_normal([num_nodes])
preactivations_hidden = tf.add(tf.matmul(X, weights_hidden), bias_hidden)
activations_hidden = tf.nn.sigmoid(preactivations_hidden)

# output layer
weights_output = tf.Variable(tf.random_normal([num_nodes, numLabels])
bias_output = tf.Variable(tf.random_normal([numLabels]) 
preactivations_output = tf.add(tf.matmul(activations_hidden, weights_output), bias_output)

其中 num_nodes 是您自己选择的隐藏层中的节点数。 X是一个[105, numFeatures]矩阵，weights_hidden是[numFeatures, num_nodes]矩阵，所以输出第一个隐藏层是[105, num_nodes]。同样，[105, num_nodes] 乘以 [num_nodes, numLabels] 产生 [105, numLabels] 输出。

关于python - 使用 Google 的 TensorFlow 添加额外的隐藏层，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/39414060/

文章推荐： c# - 使用 ResourceLoader.GetString 方法检索键中带点的资源

文章推荐： c++ - MFC 扩展库(GUI/控件)推荐

文章推荐： c# - 将以@符号开头的JSON属性反序列化为C#动态对象？

dart - 额外/重定向默认路由时
我配置了我的RouteInitializer如下: class AppRouteInitializer implements RouteInitializer { init(Router rout
Android PendingIntent 额外
我正在尝试从 Android 应用程序发送短信。我正在使用 PendingIntent 以便我可以使用 Broadcast Receiver 检查它是否发送正常。由于 sendTextMessage
mongoDB使用投影剔除‘额外’字段的操作过程
目录简介 1 "额外"字段是什么 1.1 "额外"是指与业务无关 1.2 产生
java - android发送 Intent 额外
应用程序读取 JSON 数据。然后它会将其放入 ListView (正确)，但在按下某个项目后，我总是会得到显示的相同值。下面的代码我认为是问题所在，但我找不到。 try{ JSONArray
android - 覆盖 Intent 额外
我正在使用以下代码 (Kotlin) 创建通知 val builder = NotificationCompat.Builder(ctx) ........ .set
Java JPanel 自己稍微移动 + 额外
我有一个问题。现在我正在使用 3 个面板，mainPanel 和其他 2 个面板(btnPanel 和 iconPanel)。所以问题是当我按下“重置”按钮时，我删除了 iconPanel 并再次添加
css - 跨度，额外 margin
这是我的 html: Settings Export Import 和CSS: span.button { float:right; margin-righ
mongodb - Golang Mongodb %!(额外
我正在尝试将一个结构编码为 JSON，然后将其插入我的 Mongo 数据库，但不断出现此错误:%!(EXTRA main.Test={575590180 Me})。我究竟做错了什么？我完全从我从事的另
latex - !额外}，或者忘记\endgroup。 latex
嘿，我遇到了这些 latex 格式问题，有人可以提供一些帮助吗？ .tex 文件: \begin{table}{} \renewcommand{\arraystretch}{1.1} \c
java - fragment 作为可序列化 bundle 额外
我在 FragmentPagerAdapter 中使用了 Fragment 的 ArrayList。我想在 saveState() 中保存此 ArrayList 的状态，并在 restoreStat
Swift MapKit - 向注释添加(额外)属性？
我做了this MapKit-教程一切正常，但如何为我的 pin 添加额外的属性？这是我的课车: import Foundation import MapKit class Car: NSObje
android - Android 额外 Intent 的文档
关于 Android intent 将提供的附加功能有哪些文档？更新: 我做了一些进一步的调查。我知道我们可以假设每个 Intent 都不会解析任何数据或额外内容，除非有明确记录。此外，一些(但不是
python - SqlAlchemy TIMESTAMP 'on update' 额外
我在 python3.4.3 上使用 SqlAlchemy 来管理 MySQL 数据库。我正在创建一个表: from datetime import datetime from sqlalchemy
html - Bootstrap 1 额外 px in chrome
我正在使用 bootstrap 创建网页。我在两个 block (内容和标题)上派生了正文。在内容 block 中，我有 div 类 .container .sameTable 在里面我有 div 类
mingw - 构建无需 MinGW 和 MSYS 额外 dll 即可运行的程序
我在Windows 7上的MinGW和MSYS下使用gfortran构建了一些fortran程序。但是当我在未安装MinGW和MSYS的其他计算机上运行它们时，系统总是要求一些dll，例如libgfo
javascript - 使用 jQuery 将多个元素彼此相邻放置后，第一个元素的(额外)填充/边距是多少
第一个元素的右侧似乎有额外的间距，我不知道它是从哪里来的。有人可以帮助我吗？这是我使用的代码: http://jsfiddle.net/srabeat/tenx4y1c/1/ for (i = 0;
javascript - fs-额外 : Error: EPERM: operation not permitted, 取消链接
我使用 fs-extra 收到以下错误: ERROR { [Error: EPERM: operation not permitted, unlink 'C:\Projects\xxx\branche
c++ - AES-128 CBC 模式下加密流末尾的 Crypto++ 额外 block
我正在尝试在 CBC 模式下使用 AES-128 加密 320 字节的二进制数据，并将密码存储到一个文件中。输出文件应该是 320 字节，但我得到了 336 字节。这是我的代码: #include
java - 使用带有 Android 额外 header 的 URL 打开浏览器
我有一个特定的要求，我必须从我的 Activity 中触发浏览器上的 url。我可以使用以下代码执行此操作: Intent browserIntent = new Intent( Intent.A
php - JMS DI 额外 bundle : How do I inject a dependency only if it exists
我正在使用 JMS DI 注入(inject)带有注解的服务: use JMS\DiExtraBundle\Annotation as DI; /** * @DI\Service("foo.bar.

太空狗

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

python - 使用 Google 的 TensorFlow 添加额外的隐藏层