python - 无法弄清楚 pytorch 代码中的就地操作？-6ren

python - 无法弄清楚 pytorch 代码中的就地操作？

转载作者：行者123 更新时间：2023-11-30 09:03:54

27

4

我在 PyTorch 中有以下实现，用于使用 LSTM 进行学习:

https://gist.github.com/rahulbhadani/f1d64042cc5a80280755cac262aa48aa

但是，代码遇到就地操作错误

我的错误输出是:

/home/ivory/anaconda3/lib/python3.7/site-packages/ipykernel_launcher.py:10: UserWarning: To copy construct from a tensor, it is recommended to use sourceTensor.clone().detach() or sourceTensor.clone().detach().requires_grad_(True), rather than torch.tensor(sourceTensor).
  # Remove the CWD from sys.path while we load stuff.
---------------------------------------------------------------------------
RuntimeError                              Traceback (most recent call last)
<ipython-input-86-560ec78f2b64> in <module>
     27 linear = torch.nn.Linear(hidden_nums, output_dim)
     28 
---> 29 global_loss_list = global_training(lstm2)

<ipython-input-84-152890a3028c> in global_training(optimizee)
      3     adam_global_optimizer = torch.optim.Adam([{'params': optimizee.parameters()}, 
      4                                      {'params':linear.parameters()}], lr = 0.0001)
----> 5     _, global_loss_1 = learn2(LSTM_Optimizee, training_steps, retain_graph_flag=True, reset_theta=True)
      6 
      7     print(global_loss_1)

<ipython-input-83-0357a528b94d> in learn2(optimizee, unroll_train_steps, retain_graph_flag, reset_theta)
     43             # requires_grad=True. These are accumulated into x.grad for every
     44             # parameter x. In pseudo-code: x.grad += dloss/dx
---> 45             loss.backward(retain_graph = retain_graph_flag) #The default is False, when the optimized LSTM is set to True
     46 
     47             print('x.grad: {}'.format(x.grad))

~/anaconda3/lib/python3.7/site-packages/torch/tensor.py in backward(self, gradient, retain_graph, create_graph)
    116                 products. Defaults to ``False``.
    117         """
--> 118         torch.autograd.backward(self, gradient, retain_graph, create_graph)
    119 
    120     def register_hook(self, hook):

~/anaconda3/lib/python3.7/site-packages/torch/autograd/__init__.py in backward(tensors, grad_tensors, retain_graph, create_graph, grad_variables)
     91     Variable._execution_engine.run_backward(
     92         tensors, grad_tensors, retain_graph, create_graph,
---> 93         allow_unreachable=True)  # allow_unreachable flag
     94 
     95 

RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation: [torch.FloatTensor [1, 10]] is at version 1; expected version 0 instead. Hint: enable anomaly detection to find the operation that failed to compute its gradient, with torch.autograd.set_detect_anomaly(True).

我试图追踪错误，但没有成功。在这方面的任何帮助将不胜感激。

谢谢。

最佳答案

我认为问题在于以下行:

global_loss_list.append(global_loss.detach_())

PyTorch 中就地操作的惯例是在函数名称末尾使用 _ (如 detach_ 中所示)。我相信你不应该就地分离。换句话说，将 detach_ 更改为 detach

关于python - 无法弄清楚 pytorch 代码中的就地操作？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/57500582/

27

4

0

文章推荐： python - 预期类型错误 : numpy. ndarray 或 cuda.ndarray

文章推荐： python - 是否可以在不同的 Keras 回调中设置值？

文章推荐： tensorflow - 低密度区域中神经网络的回归精度

c++ - 对一系列数字进行排序 - 就地
我正在尝试解决以下问题: We are given an array containing ‘n’ objects. Each object, when created, was assigned a
arrays - 将单个变量分配给数组(就地)
考虑以下代码: a=(1 2 3) a='seven' export a declare -p a 输出(来自declare)是: declare -ax a='([0]="seven" [1]="2
python - 列表转换 - 就地
我正在尝试将 ['1','2','3','4'] 转换为 [1,2,3,4]我想就地进行此转换。有可能做到吗？如果不是，最佳解决方案是什么。最佳答案我觉得用map比较好对于这类任务。这会创建迭代器
c - 从字符串中删除多余的空格(就地)
好的，所以我之前发布了关于尝试(没有任何预建函数)删除额外空间的信息 "this is a test"会回来的 "this is a test" Remove spaces from a strin
grails - Grails内联(就地)插件无法解决依赖关系
我有一个名为Media的插件，该插件应负责图像大小调整等工作。它具有以下依赖性: dependencies { compile group: 'org.ccil.cowan.tagsoup'
C - 在不使用内存(就地)的情况下读取然后写入文件？
我需要将一个大字符串向左“移动”X 个空格。它太大了，无法放入内存，所以我需要就地做。我需要使用最少量的系统调用来完成此操作。我知道我可以使用缓冲区并重用内存来最大限度地减少内存消耗，然后使用 fs
c++ - 就地 Cholesky 逆
我想知道是否可以在不需要临时数组的情况下通过 Cholesky 分解获得矩阵的逆。截至目前，我可以在不使用临时数组的情况下进行 cholesky 分解，但从那里我还没有想出一种方法来获得原始矩阵的逆矩
javascript - 就地 Javascript 编辑插件
是否有任何用于 Javascript 的就地编辑插件..像 firebug 之类的东西，它对即时 CSS 编辑和预览非常有用，但不允许就地 JS 编辑..那么，有没有我们可以立即更新和更新的工具或插件
java - 将链表的所有其他元素(就地)移动到java中链表的末尾
题目如下:给定一个 linked list，将备用 indices 移到 list 的后面例如: input: : [0] -> [1] -> [2] -> [3] -> [4]
c++ - 就地 std::copy_if
在我看来，std::copy_if 对于过滤容器非常有用: std::vector vec { 1, 2, 3, 4 }; auto itEnd = std::copy_if(vec.begin(),
c++ - 就地 C++ 设置交集
在 C++ 中相交两个集合的标准方法是执行以下操作: std::set set_1; // With some elements std::set set_2; // With some othe
python - Python中的“就地”字符串修改
在 Python 中，字符串是不可变的。逐个字符遍历字符串并对其进行修改的标准习语是什么？我能想到的唯一方法是一些与加入结果字符串相关的真正臭名昭著的黑客攻击。 -- 在 C 中: for(i
scala 移除(就地)ListBuffer 中满足条件的所有元素
我有一个 ListBuffer。我想删除满足特定条件的所有元素。我可以迭代它并删除每个元素。但是 Scala 对改变你正在迭代的列表有什么看法呢？它会起作用，还是会删除错误的元素/不返回所有元素？
r - rbind 的内存高效替代方案 - 就地 rbind？
我需要重新绑定(bind)两个大数据帧。现在我用的是 df 根据 nikola 的评论，这里是 ?rbindlist 的描述(v1.8.2 中的新增功能): Same as do.call("rbi
python - 如何使用 python 复制字符串中的字符(就地)？
关闭。此题需要details or clarity 。目前不接受答案。想要改进这个问题吗？通过 editing this post 添加详细信息并澄清问题. 已关闭 3 年前。 Improve th
c++ - 就地 Win32 ListView 编辑总是被取消
我在带有 LVS_EDITLABELS 的无模式 Win32 对话框中有一个小图标模式的 ListView 放。无论编辑是通过单击鼠标还是通过调用 ListView_LabelEdit() 以编程方式
java - 如何在 Java 中交换字符数组(就地)
所以基本上不能/允许创建一个新数组。除了实际更改和操作当前数组外，无法返回任何内容。您如何获取字符数组并简单地翻转/反转它们。 Starting array: ['P','e','r','f','e'
c++ - 对 vector 的一部分进行排序(就地)
我不明白为什么下面的代码没有对 vector 的前两个元素进行排序: int main() { std::vector v = {2,1,3,1,2}; std::sort(v.beg
Python - 取每个第 n 个元素(就地)
我有以下(简化的)代码: a = a[::3] b = b[::3] c = c[::3] d = d[::3] a,b,c,d,其实都是很复杂的表达式，所以我想这样写: for l in [a, b
c# - 就地 reshape c# 数组
可以对数组进行不依赖于数组秩的操作。迭代器也不总是合适的解决方案。给定数组 double[,] myarray = new double[10,5]; 实现以下工作流程是可取的: 将 Rank>1 的

首页

博学

6Ren·AI

商城

python - 无法弄清楚 pytorch 代码中的就地操作？