tensorflow - 为什么 tf.contrib.layers.instance_norm 层包含 StopGradient 操作？-6ren

tensorflow - 为什么 tf.contrib.layers.instance_norm 层包含 StopGradient 操作？

转载作者：行者123 更新时间：2023-12-05 06:57:53

27

4

为什么tf.contrib.layers.instance_norm层包含StopGradient操作？即为什么需要它？

即使在更简单的层 tf.nn.moments 中似乎也有 StopGradient(它可以是 tf.contrib.layers.instance_norm).

x_m, x_v = tf.nn.moments(x, [1, 2], keep_dims=True)

我还在 tf.nn.moments 源代码中找到关于 StopGradient 的注释:

# The dynamic range of fp16 is too limited to support the collection of
# sufficient statistics. As a workaround we simply perform the operations
# on 32-bit floats before converting the mean and variance back to fp16
y = math_ops.cast(x, dtypes.float32) if x.dtype == dtypes.float16 else x
# Compute true mean while keeping the dims for proper broadcasting.
mean = math_ops.reduce_mean(y, axes, keepdims=True, name="mean")
# sample variance, not unbiased variance
# Note: stop_gradient does not change the gradient that gets
#       backpropagated to the mean from the variance calculation,
#       because that gradient is zero
variance = math_ops.reduce_mean(
    math_ops.squared_difference(y, array_ops.stop_gradient(mean)),
    axes,
    keepdims=True,
    name="variance")

所以这是一种优化，因为梯度始终为零？

最佳答案

尝试回答。

这个设计告诉我们，最小化第二个矩我们不希望通过第一个矩传播梯度。是否有意义？如果我们尝试最小化 E[x^2]-E[x]^2 我们将最小化 E[x^2] 同时最大化 E[x ]^2。第一项会减少每个元素的绝对值(将它们拖到中心)。第二项将通过梯度增加所有值，这不会最小化方差，但可能会对其他梯度路径产生负面影响。

因此，我们不会通过第一个矩传播第二个矩的梯度，因为这个梯度不会影响第二个矩，至少在使用普通 SGD 时是这样。

关于tensorflow - 为什么 tf.contrib.layers.instance_norm 层包含 StopGradient 操作？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/64776769/

27

4

0

文章推荐： PayPal HTML 按钮缺少发票参数

文章推荐： matlab - Octave 积分不计算带符号变量的定积分

文章推荐： r - 如何使用两个因子根据因子水平在 facet_wrap 中换行？

python - 如何使用Keras API提取权重 "from input layer to hidden layer"和 "from hidden layer to output layer"？
我是 Keras 新手，我正在尝试获取 Keras 中的权重。我知道如何在 Python 中的 Tensorflow 中执行此操作。代码: data = np.array(attributes, '
python - tf.contrib.layer.fully_connected、tf.layers.dense、tf.contrib.slim.fully_connected、tf.keras.layers.Dense 之间的不一致
我正在尝试为上下文强盗问题 (https://medium.com/emergent-future/simple-reinforcement-learning-with-tensorflow-part
javascript - Open Layers 无法读取未定义的属性 'add Layer'
我尝试在单击时向 map 添加新标记，并尝试保存标题和描述以在标记悬停时显示，但出现以下错误: Cannot read property 'add Layer' of undefined Javasc
传单层控制 : select Layer inside Layer Group?
我想要一个传单图层控件，我可以在其中选择一个基础图层，并使这个基础图层本身成为一个 LayerGroup，我可以从中选择要显示的子图层。我正在考虑一个设置，我单击一个单选按钮来选择基础层(层组)，然后
layer - 金普 : To move a layer's position within an image
我在 GIMP Script-fu 和过程浏览器中遇到了一个简单的问题。我正在尝试在具有 40 层的图像中向上/向下移动一个层。让我们调用图像 test.xcf 和所述层 Chromask-snap
javascript - InDesign 脚本错误 : "Expected Layer, but received Layer"
我有一个(非常大的)脚本在 InDesign 中运行，该脚本在某一时刻将库资源放置到页面上，然后将其移动到特定图层。此脚本在我们这里的所有计算机上都运行良好，但仅当当时 InDesign 中没有打开其
tensorflow - 即使我们不使用model.fit，我们什么时候应该继承keras.Model而不是keras.layers.Layer？
在一些使用 tf2 的 Tensorflow 教程(例如 Neural Machine Translation with Attention 和 Eager essentials )中，他们定义了自定
android - “com.layer.atlas:layer-atlas”有什么问题？
现在我无法解决依赖性，怎么了？公司会更改名称吗？但是，我在他们的网站上看到它，但没有“com.layer.atlas:layer-atlas”，但是我的应用程序包含此依赖项，谁能告诉我原因？最佳答
merge - 凯拉斯 : How to merge a dense layer and an embedding layer
我使用 Keras 并尝试将两个不同的层连接成一个向量(向量的第一个值是第一层的值，另一部分是第二层的值)。其中一层是密集层，另一层是嵌入层。我知道如何合并两个嵌入层或两个密集层，但我不知道如何合
python - 如何将 tf.keras.layers.layer 分配给一个类而不初始化它？
我正在开发一个类来创建各种对称 AE。我现在把这个类移植到TF 2.0，比我想象的要复杂。但是，我使用层和模型的子类来实现此目的。因此，我想将多个 keras 层分组为一个 keras 层。但如果我想
ios - layer.addSublayer 与 layer.insertSublayer 动画
我正在为 CAGradient 设置动画 let gradientChangeAnimation = CABasicAnimation(keyPath: "colors") gradientC
PHP 面向对象 : business logic layer - DB layer
什么是使用 OOP 在业务逻辑对象和数据库之间分层的良好设计？最佳答案这些中的任何一个都可以( from Fowler's POEAA ): 数据源架构模式: 表数据网关:充当数据库表网关的对象。
iphone - layer.renderInContext 没有考虑 layer.mask 吗？
我正在尝试将一些 UIImages 渲染成一张我可以保存在我的相册中的图像。但是好像 layer.renderInContext 没有考虑图层蒙版？当前行为:照片保存，我看到了 mosaicLaye
Dojo 构建 profile.layers 还是 profile.dependencies.layers？
哇，这完全令人困惑，而且 dojo 1.8 文档似乎是围绕构建层的完整 clusterf**k。有人知道那里发生了什么吗？在构建脚本示例配置文件中，示例 amd.profile.js 有 profi
spacemacs - `dotspacemacs-configuration-layers' 在 `dotspacemacs/layers' 之外被改变是什么意思？
我的 spacemacs 是 0.200.3@25.1.1 每次启动spacemacs时都会收到警告，如何解决？ Warnings: - dotspacemacs-configuration-laye
computer-science - 有人知道 “layer of abstraction”/“layer of indirection”报价来自哪里吗？
引用是这样的: There's no problem in Computer Science that can't be solved by adding another layer of abstr
python - 当我有自定义图层时，为什么会出现此错误 "The following are legacy tf.layers.Layers"？
我正在使用 Keras 并且有一个自定义层，但是当我使用它时，会发生以下错误，我不知道问题是什么。你能帮我解决这个问题吗？奇怪的是，当我在另一个系统上使用相同的代码时，没有出现此错误! import
tensorflow - Keras:layers.Input 和 layers.InputLayer 有什么区别？
我应该什么时候使用 Input我什么时候应该使用 InputLayer ?在 source code有一个描述，但我不确定它是什么意思。输入层: Layer to be used as an ent
python - 值错误: Please initialize `TimeDistributed` layer with a `Layer` instance
我正在尝试构建一个可以在音频和视频样本上进行训练的模型，但出现此错误 ValueError:请使用“Layer”实例初始化“TimeDistributed”层。您传递了:Tensor("input_1
python - 如何在自定义 tf.keras.layers.Layer 中支持 mask
我正在实现一个需要支持 mask 的自定义 tf.keras.layers.Layer。考虑以下场景 embedded = tf.keras.layer.Embedding(input_dim=vo

首页

博学

6Ren·AI

商城

tensorflow - 为什么 tf.contrib.layers.instance_norm 层包含 StopGradient 操作？