tensorflow - tf.tape.gradient() 对于某些损失返回 None-6ren

tensorflow - tf.tape.gradient() 对于某些损失返回 None

转载作者：行者123 更新时间：2023-12-03 17:02:48

我想弄清楚为什么有时 tf.GradientTape().gradient 会返回 None ，所以我使用了以下三个损失函数( mmd0() 、 mmd1() 、 mmd2() )，尽管格式和 mms 的返回值有点不同，但是对于 mmd，mmd、2d 和 mms 仍然返回梯度是 None 。我打印出这三个函数的损失，有人为什么会这样？

def mmd0(x, y): # a and b are lists of aribiturary lengths
  return x  

def mmd1(x1, x2): # a and b are lists of aribiturary lengths
  dis = sum([x**2 for x in x1])/len(x1) - sum([x**2 for x in x2])/len(x2)
  return dis**2

def mmd2(x, y):
  dis = x-y
  return [tf.convert_to_tensor(elem) for elem in dis]

def get_MMD_norm(errors, sigma=0.1): 
  x2 = np.random.normal(0, sigma, len(errors))
  loss0 = mmd0(errors, x2)
  loss1 = mmd1(errors, x2)
  loss2 = mmd2(errors, x2)
  print("loss0:", loss0)
  print("loss1:", loss1)
  print("loss2:", loss2)
  return tf.cast(loss2, tf.float32)

def loss(model, x, y, sigma=0.1):
  y_ = model(x) # y_.shape is (batch_size, 3) for Iris dataset
  losses = []
  loss_object = tf.keras.losses.SparseCategoricalCrossentropy(from_logits=True)
  for i in range(y.shape[0]):
    loss = loss_object(y_true=y[i], y_pred=y_[i])
    losses.append(loss) 
  batch_loss = get_MMD_norm(losses)
  single_losses_list = [loss.numpy() for loss in losses]
  return tf.convert_to_tensor(batch_loss, dtype=np.float32), single_losses_list

def grad(model, inputs, targets, sigma=0.1):
  with tf.GradientTape() as tape:
    tape.watch(model.trainable_variables)
    batch_loss, single_losses = loss(model, inputs, targets, sigma=0.1)
  return tape.gradient(batch_loss, model.trainable_variables), batch_loss, single_losses 

grads, batch_loss, single_losses = grad(model, features, labels)
print("grads:", grads)
print("batch_loss:", batch_loss)
##########################################################
loss0: [<tf.Tensor: id=39621, shape=(), dtype=float32, numpy=2.1656876>, <tf.Tensor: id=39659, shape=(), dtype=float32, numpy=2.057112>, <tf.Tensor: id=39697, shape=(), dtype=float32, numpy=2.2769136>, <tf.Tensor: id=39735, shape=(), dtype=float32, numpy=2.0263004>, <tf.Tensor: id=39773, shape=(), dtype=float32, numpy=2.1568372>, <tf.Tensor: id=39811, shape=(), dtype=float32, numpy=0.7392154>, <tf.Tensor: id=39849, shape=(), dtype=float32, numpy=0.7742219>, <tf.Tensor: id=39887, shape=(), dtype=float32, numpy=2.2176154>, <tf.Tensor: id=39925, shape=(), dtype=float32, numpy=1.0187237>, <tf.Tensor: id=39963, shape=(), dtype=float32, numpy=2.160415>, <tf.Tensor: id=40001, shape=(), dtype=float32, numpy=0.80997854>, <tf.Tensor: id=40039, shape=(), dtype=float32, numpy=0.70803094>, <tf.Tensor: id=40077, shape=(), dtype=float32, numpy=0.8207226>, <tf.Tensor: id=40115, shape=(), dtype=float32, numpy=0.82957774>, <tf.Tensor: id=40153, shape=(), dtype=float32, numpy=0.88732547>, <tf.Tensor: id=40191, shape=(), dtype=float32, numpy=0.90633464>, <tf.Tensor: id=40229, shape=(), dtype=float32, numpy=0.7932346>, <tf.Tensor: id=40267, shape=(), dtype=float32, numpy=2.1767666>, <tf.Tensor: id=40305, shape=(), dtype=float32, numpy=0.80166155>, <tf.Tensor: id=40343, shape=(), dtype=float32, numpy=0.7831647>, <tf.Tensor: id=40381, shape=(), dtype=float32, numpy=0.77431095>, <tf.Tensor: id=40419, shape=(), dtype=float32, numpy=0.82067406>, <tf.Tensor: id=40457, shape=(), dtype=float32, numpy=0.74510425>, <tf.Tensor: id=40495, shape=(), dtype=float32, numpy=2.1666338>, <tf.Tensor: id=40533, shape=(), dtype=float32, numpy=0.7922478>, <tf.Tensor: id=40571, shape=(), dtype=float32, numpy=0.73235756>, <tf.Tensor: id=40609, shape=(), dtype=float32, numpy=2.1792874>, <tf.Tensor: id=40647, shape=(), dtype=float32, numpy=0.919183>, <tf.Tensor: id=40685, shape=(), dtype=float32, numpy=0.761979>, <tf.Tensor: id=40723, shape=(), dtype=float32, numpy=2.1664479>, <tf.Tensor: id=40761, shape=(), dtype=float32, numpy=0.77892226>, <tf.Tensor: id=40799, shape=(), dtype=float32, numpy=0.99058735>]
loss1: tf.Tensor(4.158007, shape=(), dtype=float32)
loss2: [<tf.Tensor: id=40935, shape=(), dtype=float64, numpy=2.325676997771268>, <tf.Tensor: id=40936, shape=(), dtype=float64, numpy=1.9988182000798667>, <tf.Tensor: id=40937, shape=(), dtype=float64, numpy=2.303379813455908>, <tf.Tensor: id=40938, shape=(), dtype=float64, numpy=2.0615775258879356>, <tf.Tensor: id=40939, shape=(), dtype=float64, numpy=2.2949723624257774>, <tf.Tensor: id=40940, shape=(), dtype=float64, numpy=0.7019287657319235>, <tf.Tensor: id=40941, shape=(), dtype=float64, numpy=0.8522054859739794>, <tf.Tensor: id=40942, shape=(), dtype=float64, numpy=2.0819949907118125>, <tf.Tensor: id=40943, shape=(), dtype=float64, numpy=1.065878291073558>, <tf.Tensor: id=40944, shape=(), dtype=float64, numpy=2.1225998300026805>, <tf.Tensor: id=40945, shape=(), dtype=float64, numpy=0.9485520218242218>, <tf.Tensor: id=40946, shape=(), dtype=float64, numpy=0.7221746903906889>, <tf.Tensor: id=40947, shape=(), dtype=float64, numpy=0.9985009994522388>, <tf.Tensor: id=40948, shape=(), dtype=float64, numpy=0.9143119687525019>, <tf.Tensor: id=40949, shape=(), dtype=float64, numpy=0.9230117922853999>, <tf.Tensor: id=40950, shape=(), dtype=float64, numpy=1.0220225043292934>, <tf.Tensor: id=40951, shape=(), dtype=float64, numpy=0.8735972169951878>, <tf.Tensor: id=40952, shape=(), dtype=float64, numpy=2.1279260795512753>, <tf.Tensor: id=40953, shape=(), dtype=float64, numpy=0.9597649765787801>, <tf.Tensor: id=40954, shape=(), dtype=float64, numpy=0.8338326272407959>, <tf.Tensor: id=40955, shape=(), dtype=float64, numpy=0.6674084331022461>, <tf.Tensor: id=40956, shape=(), dtype=float64, numpy=0.8679296826013285>, <tf.Tensor: id=40957, shape=(), dtype=float64, numpy=0.8174893483228802>, <tf.Tensor: id=40958, shape=(), dtype=float64, numpy=2.212290299049252>, <tf.Tensor: id=40959, shape=(), dtype=float64, numpy=0.7304098620074719>, <tf.Tensor: id=40960, shape=(), dtype=float64, numpy=0.8463413221121661>, <tf.Tensor: id=40961, shape=(), dtype=float64, numpy=2.3081013094190443>, <tf.Tensor: id=40962, shape=(), dtype=float64, numpy=1.0314178020997722>, <tf.Tensor: id=40963, shape=(), dtype=float64, numpy=0.774951045805575>, <tf.Tensor: id=40964, shape=(), dtype=float64, numpy=2.127838465488091>, <tf.Tensor: id=40965, shape=(), dtype=float64, numpy=0.909498425717612>, <tf.Tensor: id=40966, shape=(), dtype=float64, numpy=1.0217239989370837>]
grads: [None, None, None, None, None, None]
batch_loss: tf.Tensor(
[2.325677   1.9988182  2.3033798  2.0615776  2.2949724  0.7019288
 0.8522055  2.081995   1.0658783  2.1225998  0.948552   0.7221747
 0.998501   0.91431195 0.9230118  1.0220225  0.8735972  2.127926
 0.95976496 0.8338326  0.6674084  0.8679297  0.8174893  2.2122903
 0.73040986 0.8463413  2.3081014  1.0314178  0.77495104 2.1278384
 0.90949845 1.021724  ], shape=(32,), dtype=float32)

最佳答案

你看到了吗this回答？我想我有类似的问题，我相信你可能与我的有关。它与在过程中某个步骤计算的损失有关，在该过程中，感兴趣的张量从磁带开始到结束“丢失”。引用的答案指出，原始海报有一个区域，其中返回了一个 numpy 数组而不是 tensorflow 张量，从而导致 Gradient Tape 无法计算梯度。

我可能是错的，因为我离 tensorflow 专家还很远，但这是我在寻找类似问题的解决方案时不断看到的问题。

关于tensorflow - tf.tape.gradient() 对于某些损失返回 None，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/56858378/

文章推荐： macos - NSTableView 和 CoreData : Delete Object at clicked row

文章推荐： cocoa - NSTableView 设置自动调整中间列的大小？

文章推荐： Laravel nova 操作 - 获取字段上的当前模型实例()

python - 列表理解返回值加上 [None, None, None]，为什么？
这个问题在这里已经有了答案: Why does the print function return None? (1 个回答) 关闭 6 年前。我正在学习理解。我得到了 print(x) 部分(我
python - 如何理解Python中 `None or False`、 `False or None`、 `None and False`、 `False and None`的结果？
我以为我理解了 Python 中的这两个单例值，直到我看到有人在代码中使用 return l1 or l2，其中 l1 和 l2 都是链表对象，并且(s)他想如果不为 None 则返回 l1，否则返回
python - IPython Notebook 中的列表理解返回 [None, None, None...]
我希望在 IPython Notebook 中使用列表理解生成枚举字符串列表。它有效，但给了我一个我不理解的奇怪输出。 cols = [] [cols.append('Value'+str(hour)
python - 为什么 `None is None is None` 返回 True？
这个问题在这里已经有了答案: Why does the expression 0 >> import dis >>> def a(): ... return None is None is N
python - 为什么 list(print(x.upper(), end =' ' ) for x in 'spam' ) 得到一个 [None, None, None, None] 列表？
《Learning Python 5th》第608页有示例代码: >>> list(print(x.upper(), end=' ') for x in 'spam') S P A M [None,
python - 为什么这个函数也返回 "None None"？
我对此进行了搜索并遇到了列表返回函数，但我仍然不明白。我试图理解为什么 Print 函数到另一个函数返回以下内容: 生日快乐生日快乐无无我的代码: def happy(): prin
python - "None not in"与 "not None in"
除非我疯了 if None not in x 和 if not None in x 是等价的。有首选版本吗？我想 None not in 更像英语，因此更像 pythonic，但 not None i
python - 获取类型错误 : '(slice(None, None, None), 0)' is an invalid key
尝试绘制 k-NN 分类器的决策边界但无法这样做得到 TypeError: '(slice(None, None, None), 0)' is an invalid key h = .01 # st
python - 如何在 Keras 中将 (None, 10) 维张量 reshape 为 (None, None, 10)？
我正在尝试将可变大小的序列输入 LSTM。因此我使用生成器且批量大小为 1。我有一个嵌入的 (sequence_length,)-input-tensor，并输出 (batch_size,equen
python - 区分参数值 `None` 和默认参数值 `None`
这个问题在这里已经有了答案: 关闭 10 年前。 Possible Duplicate: Is there any way to know if the value of an argument i
Python:字符串与 None 连接以返回 None？
我正在尝试根据环境变量的返回值进行条件赋值。 self._TBLFilePath = iTBLFilePath or os.environ.get("CDO_TBLPATH") + os.enviro
python - 强制加载 `None` 并在转储时跳过 `None`
我正在使用 marshmallow 2.0.0rc2 验证 HTTP 请求的输入数据，并在 HTTP 响应上将 SQLAlchemy 模型加载到 JSON。我偶然发现了两个问题: 首先，在通过 HTT
python - lxml 'None' 类型不是 None
我想将我设置为 None 的变量与 is 进行比较，但它失败了。当我使用 == 将此变量与 None 进行比较时，它起作用了。这就是我所说的变量: print type(xml.a) -> 因为
python - "is None"和 "== None"有什么区别
我最近遇到了这种语法，我不知道有什么区别。如果有人能告诉我其中的区别，我将不胜感激。最佳答案答案解释here . 引用: A class is free to implement compari
python - 获取类型错误 : '(slice(None, None, None), array([0, 1, 2, 3, 4]))' is an invalid key
尝试使用 BorutaPy 进行特征选择。但出现 TypeError: '(slice(None, None, None), array([0, 1, 2, 3, 4]))' 是无效键。 from s
tensorflow - 对于占位符的形状，[]、[None]、None 和 () 有什么区别？
我见过使用 [] 的代码片段, [None] , None或 ()作为 placeholder 的形状，那是 x = tf.placeholder(..., shape=[], ...) y = t
ansible - 为什么 `default( None )` 并不总是显示为 `None`
是否有逻辑推理可以解释为什么下面的 Ansible playbook 中的两个 debug 任务分别输出 "NONE" 和 "FALSE"并且不是两者都“NONE”？ - hosts: 'all'
python - 我应该使用 `==` 与 `(None, None)` 元组进行比较吗？
我有一个函数，它可以返回两个整数的元组或(None, None)的元组: (出于本问题的目的，我们假设此返回格式是执行此操作的唯一方法，并且无法更改) from typing import Tuple
python - 从嵌套字典中递归删除 None 值或 None 键
问题: 如何遍历字典并从中删除 None 键或值？这是我尝试过的: 代码: import copy def _ignore(data): copied_data = copy.deepcop
python - 简明地说 "none of the elements of an array are None"？
什么是简洁的 python 表达方式 if : # do a bunch of stuff once 最佳答案为什么不简单， None not in lst 关于python - 简明地说 "

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

tensorflow - tf.tape.gradient() 对于某些损失返回 None