tensorflow - TensorFlow 如何知道要更改哪些变量以进行优化？-6ren

tensorflow - TensorFlow 如何知道要更改哪些变量以进行优化？

转载作者：行者123 更新时间：2023-12-04 15:29:50

32

4

代码取自:- http://adventuresinmachinelearning.com/python-tensorflow-tutorial/

import tensorflow as tf
from tensorflow.examples.tutorials.mnist import input_data
mnist = input_data.read_data_sets("MNIST_data/", one_hot=True)
# Python optimisation variables
learning_rate = 0.5
epochs = 10
batch_size = 100

# declare the training data placeholders
# input x - for 28 x 28 pixels = 784
x = tf.placeholder(tf.float32, [None, 784])
# now declare the output data placeholder - 10 digits
y = tf.placeholder(tf.float32, [None, 10])
# now declare the weights connecting the input to the hidden layer
W1 = tf.Variable(tf.random_normal([784, 300], stddev=0.03), name='W1')
b1 = tf.Variable(tf.random_normal([300]), name='b1')
# and the weights connecting the hidden layer to the output layer
W2 = tf.Variable(tf.random_normal([300, 10], stddev=0.03), name='W2')
b2 = tf.Variable(tf.random_normal([10]), name='b2')
# calculate the output of the hidden layer
hidden_out = tf.add(tf.matmul(x, W1), b1)
hidden_out = tf.nn.relu(hidden_out)
# now calculate the hidden layer output - in this case, let's use a softmax activated
# output layer
y_ = tf.nn.softmax(tf.add(tf.matmul(hidden_out, W2), b2))
y_clipped = tf.clip_by_value(y_, 1e-10, 0.9999999)
cross_entropy = -tf.reduce_mean(tf.reduce_sum(y * tf.log(y_clipped)
                         + (1 - y) * tf.log(1 - y_clipped), axis=1))
# add an optimiser
optimiser = tf.train.GradientDescentOptimizer(learning_rate=learning_rate).minimize(cross_entropy)
# finally setup the initialisation operator
init_op = tf.global_variables_initializer()

# define an accuracy assessment operation
correct_prediction = tf.equal(tf.argmax(y, 1), tf.argmax(y_, 1))
accuracy = tf.reduce_mean(tf.cast(correct_prediction, tf.float32))
# start the session
with tf.Session() as sess:
   # initialise the variables
   sess.run(init_op)
   total_batch = int(len(mnist.train.labels) / batch_size)
   for epoch in range(epochs):
        avg_cost = 0
        for i in range(total_batch):
            batch_x, batch_y = mnist.train.next_batch(batch_size=batch_size)
            _, c = sess.run([optimiser, cross_entropy], 
                         feed_dict={x: batch_x, y: batch_y})
            avg_cost += c / total_batch
        print("Epoch:", (epoch + 1), "cost =", "{:.3f}".format(avg_cost))
   print(sess.run(accuracy, feed_dict={x: mnist.test.images, y: mnist.test.labels}))

我想问一下，tensorflow 如何识别它需要优化的参数，就像在上面的代码中我们需要优化 w1,w2,b1 & b2 但我们从未在任何地方指定过。我们确实要求 GradientDescentOptimizer 最小化 cross_entropy，但我们从未告诉它必须更改 w1,w2,b1&b2 的值才能这样做，那么它如何知道 cross_entropy 所依赖的参数呢？

最佳答案

Cory Nezin 的回答只是部分正确，可能会导致错误的假设!

您实际上确实指定了优化哪些参数(=可训练)，即通过执行以下操作:

# now declare the weights connecting the input to the hidden layer
W1 = tf.Variable(tf.random_normal([784, 300], stddev=0.03), name='W1')
b1 = tf.Variable(tf.random_normal([300]), name='b1')
# and the weights connecting the hidden layer to the output layer
W2 = tf.Variable(tf.random_normal([300, 10], stddev=0.03), name='W2')
b2 = tf.Variable(tf.random_normal([10]), name='b2')

总之，TensorFlow 只会更新 tf.Variables .如果你想使用类似 tf.Variable(...,trainable=False) 的东西，无论“网络依赖于什么”，您都不会获得任何更新。您仍然会指定它，并且网络仍然会通过该部分传播，但是您永远不会收到该特定变量的任何更新。

Cory 的答案是正确的，因为网络会自动识别要更新它的值，但是您指定必须首先定义/更新的值!

关于tensorflow - TensorFlow 如何知道要更改哪些变量以进行优化？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/51757209/

32

4

0

文章推荐： viewmodel - Livedata与Rxjava

文章推荐： entity-framework-core - Entity Framework Core 中的 Where IN 子句

文章推荐： Twilio WebRTC TURN 中继在几分钟后随机停止工作

php - 知道 youtube 使用的是什么视频编码技术吗？
关闭。这个问题是off-topic .它目前不接受答案。想改进这个问题？ Update the question所以它是on-topic对于堆栈溢出。 9年前关闭。 Improve this que
php - 知道 php 脚本是否仍在运行
我有一堆 php 脚本计划在 CentOS 机器上的 cron 中每隔几分钟运行一次。我希望每个脚本在启动时自我检查它的前一个实例是否仍在运行，如果是则停止。最佳答案我这样做是为了管理任务并确保它
endpoint - 知道 USB 设备的端点
是否有 bash 命令、程序或 libusb 函数(尽管我没有找到)来指示 USB 设备的 OUT 或 IN 端点是什么？例如，libusb_interface_descriptor(来自 libu
cocoa - 知道 NSTextField 何时成为第一响应者
我如何知道 NSTextField 何时成为第一响应者(即当用户单击它来激活它时，但在他们开始输入之前)。我尝试了 controlTextDidBeginEditing 但直到用户键入第一个字符后才会
javascript - 知道 forEach 循环何时结束
我怎么知道我的代码何时完成循环？完成后我还得再运行一些代码，但只有当我在那里写的所有东西都完成后它才能运行。 obj.data.forEach(function(collection) {
javascript - 知道 “audio”标签html何时被播放
我正在使用音频标签，我希望它能计算播放了多少次。我的代码是这样的: ; ; ; 然后在一个javascript文件中 Var n=0; function doing(onplaying)
eclipse - 我怎样才能得到(知道)eclipse中特定菜单的menuid？
我正在尝试向 Package-Explorer 的项目上下文菜单添加一个子菜单。但是，我找不到该菜单的 menuid。所以我的问题是如何在 eclipse 中找到 menuid？非常感谢您的帮助。
javascript - 知道 JavaScript 中表单的名称
我有一个名为“下一步”的按钮，它存在于几个 asp.net 页面中。实际上它是在用户控件中。单击“下一步”时，它会调用 JavaScript 中的函数 CheckServicesAndStates。我
c++ - 知道 CPU 是否支持纳秒
我正在尝试在 Visual Studio 中使用 C++ 以纳秒为单位计算耗时。我做了一些测试，结果总是以 00 结尾。这是否意味着我的处理器(Ryzen 7-1800X)不支持 ~1 纳秒的分辨率，
java - 知道 ListView 中单击的复选框项吗？
我有一个自定义 ListView ，其中包含一些元素和一个复选框。当我点击一个按钮时。我想知道已检查的元素的位置。下面是我的代码 public class Results extends ListAc
java-me - 知道 J2ME 中的网络运营商名称
如何在使用 J2ME 编写的应用程序中获取网络运营商名称？我最近正在尝试在 Nokia s40 上开发一个应用程序，它应该具有对特定网络运营商的独占访问权限。有没有这样的API或库？最佳答案没有
delphi - 知道 Onclick 事件被触发
我使用服务器客户端组件，当在此组件的 TransferFile 事件中接收文件时，我使用警报消息组件。所以我希望，如果用户单击警报消息，程序将继续执行 TransferFile 事件中的代码，以在单击
java - 有没有办法获取(知道)从同一个类中的类创建的所有对象？
如果我创建一个类A具有一些属性，例如 a, b, c我创建对象 A x1; A x2; A x3; ... A xN 。有没有办法在同一个类中创建一个方法来检索我创建的所有对象？我想创建类似 stat
java - 知道 Android 中点击了哪个按钮
我正在制作一个应用程序，其中包含相同布局的 81 个按钮。它们都被称为我创建的名为“Tile”的对象。问题是这些图 block 存储在数组中，因此我需要知道以 int 格式单击了哪个按钮才能调用图 b
ios - 知道 UIProgressView 何时停止动画
UIProgressView有这个setProgress:animated: API。有没有办法确切知道动画何时停止？我的意思是这样的？ [myProgress setProgress:0.8f
jquery - 知道 jquery 队列何时完成
我正在使用两个 jQuery 队列，我希望其中一个队列在另一个队列完成后出队。我怎么知道第一个是否完成？我应该使用第三个队列吗？! 这是我所拥有的: var $q = $({}); $q.que
jquery - 知道 Jquery 中是否选中了一个或多个复选框
jQuery 中有没有一种方法可以知道是否至少有一个复选框已被选中？我有一个包含很多复选框的表单，每个复选框都不同。我需要一种 jQuery 的方式来表达这样的内容，这就是逻辑: If at le
javascript - 知道 HTML 标签是否有文本节点
给定 2 个选择 100 50 100 在这两种情况下，我都想在 .example 中获取数字，使用相同的选择器或者以某种方式知道 .no-text 和之间的区别。带文字执行
c# - 知道 DataBinding 何时完成
我在我的应用程序中使用 System.ComponentModel.BindingList 作为 DataGridView.DataSource。该列表非常大，需要几秒钟才能绘制到 DataGridV
java - 知道 android 上的默认键盘
我想知道用户在 Android 中选择的默认键盘。我知道我可以使用 InputMethodManager 访问已启用的输入法列表，但我想知道用户当前使用的是哪一个。到目前为止，我已经尝试获取当前的输

首页

博学

6Ren·AI

商城

tensorflow - TensorFlow 如何知道要更改哪些变量以进行优化？