python - GradientTape.gradient的概念理解-6ren

python - GradientTape.gradient的概念理解

转载作者：行者123 更新时间：2023-12-04 01:32:32

24

4

背景

在 Tensorflow 2 中，存在一个名为 GradientTape 的类。它用于记录对张量的操作，然后可以将其结果微分并馈送到一些最小化算法。例如，from the documentation我们有这个例子:

x = tf.constant(3.0)
with tf.GradientTape() as g:
  g.watch(x)
  y = x * x
dy_dx = g.gradient(y, x) # Will compute to 6.0

docstring为 gradient方法意味着第一个参数不仅可以是张量，还可以是张量列表:

 def gradient(self,
               target,
               sources,
               output_gradients=None,
               unconnected_gradients=UnconnectedGradients.NONE):
    """Computes the gradient using operations recorded in context of this tape.

    Args:
      target: a list or nested structure of Tensors or Variables to be
        differentiated.
      sources: a list or nested structure of Tensors or Variables. `target`
        will be differentiated against elements in `sources`.
      output_gradients: a list of gradients, one for each element of
        target. Defaults to None.
      unconnected_gradients: a value which can either hold 'none' or 'zero' and
        alters the value which will be returned if the target and sources are
        unconnected. The possible values and effects are detailed in
        'UnconnectedGradients' and it defaults to 'none'.

    Returns:
      a list or nested structure of Tensors (or IndexedSlices, or None),
      one for each element in `sources`. Returned structure is the same as
      the structure of `sources`.

    Raises:
      RuntimeError: if called inside the context of the tape, or if called more
       than once on a non-persistent tape.
      ValueError: if the target is a variable or if unconnected gradients is
       called with an unknown value.
    """

在上面的例子中，很容易看出 y , target , 是要微分的函数， x是“梯度”的因变量。

从我有限的经验看来， gradient方法返回一个张量列表，每个 sources 的每个元素一个。，并且这些梯度中的每一个都是与 sources 的相应成员形状相同的张量.

题

以上对 gradients行为的描述如果 target 才有意义包含要微分的单个 1x1“张量”，因为在数学上梯度向量应该与函数域具有相同的维度。

但是，如果 target是张量列表，输出 gradients还是一样的形状。为什么会这样？如 target被认为是一个函数列表，输出不应该类似于雅可比行列式吗？我如何从概念上解释这种行为？

最佳答案

就是这样tf.GradientTape().gradient()被定义为。它具有与 tf.gradients() 相同的功能，除了后者不能在 Eager 模式下使用。来自 docs的 tf.gradients() :

It returns a list of Tensor of length len(xs) where each tensor is the sum(dy/dx) for y in ys

哪里 xs是 sources和 ys是 target .

示例 1 :

所以让我们说 target = [y1, y2]和 sources = [x1, x2] .结果将是:

[dy1/dx1 + dy2/dx1, dy1/dx2 + dy2/dx2]

示例 2 :

计算每样本损失(张量)与减少损失(标量)的梯度

Let w, b be two variables. 
xentropy = [y1, y2] # tensor
reduced_xentropy = 0.5 * (y1 + y2) # scalar
grads = [dy1/dw + dy2/dw, dy1/db + dy2/db]
reduced_grads = [d(reduced_xentropy)/dw, d(reduced_xentropy)/db]
              = [d(0.5 * (y1 + y2))/dw, d(0.5 * (y1 + y2))/db] 
              == 0.5 * grads

上述代码段的 Tensorflow 示例:

import tensorflow as tf

print(tf.__version__) # 2.1.0

inputs = tf.convert_to_tensor([[0.1, 0], [0.5, 0.51]]) # two two-dimensional samples
w = tf.Variable(initial_value=inputs)
b = tf.Variable(tf.zeros((2,)))
labels = tf.convert_to_tensor([0, 1])

def forward(inputs, labels, var_list):
    w, b = var_list
    logits = tf.matmul(inputs, w) + b
    xentropy = tf.nn.sparse_softmax_cross_entropy_with_logits(
        labels=labels, logits=logits)
    return xentropy

# `xentropy` has two elements (gradients of tensor - gradient
# of each sample in a batch)
with tf.GradientTape() as g:
    xentropy = forward(inputs, labels, [w, b])
    reduced_xentropy = tf.reduce_mean(xentropy)
grads = g.gradient(xentropy, [w, b])
print(xentropy.numpy()) # [0.6881597  0.71584916]
print(grads[0].numpy()) # [[ 0.20586157 -0.20586154]
                        #  [ 0.2607238  -0.26072377]]

# `reduced_xentropy` is scalar (gradients of scalar)
with tf.GradientTape() as g:
    xentropy = forward(inputs, labels, [w, b])
    reduced_xentropy = tf.reduce_mean(xentropy)
grads_reduced = g.gradient(reduced_xentropy, [w, b])
print(reduced_xentropy.numpy()) # 0.70200443 <-- scalar
print(grads_reduced[0].numpy()) # [[ 0.10293078 -0.10293077]
                                #  [ 0.1303619  -0.13036188]]

如果您为批次中的每个元素计算损失( xentropy )，则每个变量的最终梯度将是批次中每个样本的所有梯度的总和(这是有道理的)。

关于python - GradientTape.gradient的概念理解，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/60665006/

24

4

0

文章推荐： spring-mvc - 如何在 Spring Boot 中忽略 Post 请求正文中的空值

文章推荐： scala - 多语言环境日期解析

.net - 尽管我了解大多数 OOP 概念，但我无法清晰地描绘实现 OOP 概念。为什么？
关闭。这个问题是off-topic .它目前不接受答案。想改进这个问题？ Update the question所以它是on-topic对于堆栈溢出。 10年前关闭。 Improve this qu
android - Android 是否有 MasterPage 概念，如 .NET 或 Struts 中的 Tiles 概念，以在所有页面上添加页眉？
我正在开发一个 Android 应用程序。在此应用程序中， Logo 栏显示在所有页面( Activity )上，或者我们可以说它在所有页面上都有标题。这个 Logo 栏有几个图标，如主页、登录、通知
Hadoop 概念
我正在使用 hadoop 使用开源接口(interface) HVPI 处理视频。然而，inputsplit 的实现，更准确地说是在 isSplitableobContext (context, Pa
新手入门Mysql--概念
1. 是什么？ MySQL 是最流行的关系型数据库管理系统，在 WEB 应用方面 MySQL 是最好的 RDBMS(Relational Database Management System
检查需求值的 C++ 概念
有没有办法使用 c++20s 的概念来检查一个值是否满足某些要求？假设我正在编写某种使用分页的容器，并且我想让页面大小成为模板参数。 template class container; 我可以使用带
Java ArrayList 概念
如何在 ArrayList 中循环遍历 ArrayList？例如，如果我有一个名为 Plants of Plant 对象的 ArrayList。每个 Plant 对象内部都有一个随机数量的花名。我如
c++ - 如何在UML类图中绘制C++概念？
如何在UML类图中绘制C++概念？具体来说，我有以下代码: template concept Printable = requires(T a, std::ostream &where) {
使用历史对象的 Javascript 概念
我有兴趣制作一个网站，在访问者访问时闪现整个网络历史记录。我计划使用 JavaScript 来获取每个观看者计算机上的历史记录，并根据他们拥有的内容以不同的速度对其进行动画处理。我的想法是使用 his
c++ - 概念-如何限制积分模板值
有一个模板定义，例如: template void foo( void ) { /* ... */ } 如何定义一个概念，以便N必须为非零正值(N> = 1)？就像是: template con
封装和抽象 OOP 概念
封装是信息隐藏还是导致信息隐藏？正如我们所说，封装将数据和函数绑定(bind)在单个实体中，因此它为我们提供了对数据流的控制，并且我们只能通过一些定义良好的函数来访问实体的数据。因此，当我们说封装导
C++ 概念 - 我可以有一个要求类中存在函数的约束吗？
下面有一个简单的代码片段，它使用以下方式进行编译: g++-9 -std=c++2a -fconcepts 这是试图定义一个需要存在函数的概念。我希望输出是"is"，但事实并非如此……知道为什么吗？谢
复合赋值运算符的 C++ 概念
我有一个普通二元运算符的概念 template concept is_binary_operation = requires (const T& t1, const T& t2) // e.g
c++ - 如何为启发式函数编写C++概念
我正在c++ 20中实现具有启发式功能的搜索算法。我试图用类似这样的概念来约束我的算法可以使用的功能: template concept Heuristic = requires(SelfType
sas - 解释SAS读取数据步骤的顺序(概念)
我需要了解 SAS 如何读取/执行数据步骤。当我查找有关 SAS 如何读取数据步骤的信息时，我似乎只找到有关它如何读取以进行合并的信息，我不了解与常规数据步骤相关的信息。比方说，我有这行代码: dat
java - 关于框架(概念)
最近我看到一个关于“框架”的问题，如果“框架”有不同的类型或概念。那么，存在不同“类型”的“框架”吗？例如:NodeJS 是一种“类型”(概念)，而 Hibernate ORM 是另一种“类型”(概
php - cookies 概念
如何使用任何技术禁用或清除客户端浏览器 Cookie 我认为使用 javascript 可以用于任何技术最佳答案 var cookies = document.cookie.split(";");
javascript - 概念 - 单击链接时保持对当前页面的关注？
我正在使用 target = "_blank" 单击链接时生成新选项卡。但是，浏览器会将焦点移至该选项卡。有没有办法让焦点保持在当前标签页上？回答摘要基本上，只需发送一个模拟控件点击的当前事件。
Android 如何请求其他用户的操作或批准 - 概念
我正在尝试在我的 android/firebase(cloud firestore) 应用程序上添加一项需要其他用户批准/拒绝的功能。例如，当 Air&BnB 上的用户想要预订一个地方时，所有者必须批
php - 概念 - 组织数据库
这个问题在这里已经有了答案: mysql_fetch_array()/mysql_fetch_assoc()/mysql_fetch_row()/mysql_num_rows etc... expec
Java OOP 概念
public class MyClass { public static void main(String[] args) { System.out.println("Hell

首页

博学

6Ren·AI

商城

python - GradientTape.gradient的概念理解