Storing a struct as a blob of data breaks with some optimization passes(将结构存储为BLOB数据会中断某些优化过程)-6ren

Storing a struct as a blob of data breaks with some optimization passes(将结构存储为BLOB数据会中断某些优化过程)

转载作者：bug小助手更新时间：2023-10-25 14:11:59

I'm writing a compiler with LLVM as backend. Recently I turned on the optimizations, and saw my programs break in strange ways. I managed to boil things down to a minimal code, and set of optimization passes, that reproduce the problem. Here's the code:

我正在编写一个以LLVM为后端的编译器。最近，我打开了优化，看到我的程序以奇怪的方式崩溃。我设法将事情归结为最少的代码和一组重现问题的优化过程。代码如下：

define i1 @main() {
entry:
  ; Allocate space on the stack for a record
  %record = alloca { i8, i64, float }, align 8

  ; Store 1.0 in the third field of the record
  %value = getelementptr inbounds { i8, i64, float }, { i8, i64, float }* %record, i32 0, i32 2
  store float 1.000000e+00, float* %value, align 4

  ; Cast the record's location as a [3 x i64] blob, and load it
  %tmpptr = bitcast { i8, i64, float }* %record to [3 x i64]*
  %tmp = load [3 x i64], [3 x i64]* %tmpptr, align 4

  ; Store that blob on the stack
  %blob = alloca [3 x i64], align 8
  store [3 x i64] %tmp, [3 x i64]* %blob, align 4

  ; Load `value` in blob (the third field)
  %record2 = bitcast [3 x i64]* %blob to { i8, i64, float }*
  %value2ptr = getelementptr inbounds { i8, i64, float }, { i8, i64, float }* %record2, i32 0, i32 2
  %value2 = load float, float* %value2ptr, align 4

  ; Check that value is `1.0`
  %eq = fcmp oeq float %value2, 1.000000e+00

  ; Without optimization, returns 1 (true). With optimization, returns 0 (false)
  ret i1 %eq
}

And here are the command I run to execute it, without and with optimizations:

下面是我运行来执行它的命令，没有经过优化，也有经过优化：

$ cat program.ll | opt | llc --filetype=obj -o /tmp/program.o && clang /tmp/program.o -o a.out && ./a.out
$ echo $?
1
$ cat program.ll | opt --instcombine --gvn | llc --filetype=obj -o /tmp/program.o && clang /tmp/program.o -o a.out && ./a.out
$ echo $?
0

Also, here's the version of LLVM I use:

另外，以下是我使用的LLVM版本：

$ llc --version
LLVM (http://llvm.org/):
  LLVM version 14.0.6
  Optimized build.
  Default target: x86_64-unknown-linux-gnu
  Host CPU: tigerlake

...

It seems like the part that breaks is loading then storing the blob of data on the stack, then reading the value from there. If I only do two bitcasts in a row, without store/load, the problem vanishes.

看起来中断的部分是加载，然后将数据块存储在堆栈上，然后从那里读取值。如果我只连续进行两个位广播，而不存储/加载，这个问题就会消失。

Note that I left the first two fields of the record undefined for terseness, but if you do write data to them, the problem remains. Also, if you remove either the i8 or the i64 field, the problem disappears.

请注意，为了简洁起见，我没有定义记录的前两个字段，但如果您确实向它们写入数据，问题仍然存在。此外，如果删除i8或i64字段，问题也会消失。

I also noticed that if I manually pad the structure as { i8, i8, i16, i32, i64, float, i32 }, the problem also disappears. I don't understand why, however, because I computed the size of the array to be [3 x i64] based on the store size of the struct, which LLVM tells me is 24 bytes on my machine.

我还注意到，如果我手动将结构填充为{i8，i8，i16，i32，i64，Float，I32}，问题也会消失。然而，我不明白为什么，因为我根据结构的存储大小计算出数组的大小为[3xi64]，LLVM告诉我在我的机器上是24字节。

Looking a the IR generated by the optimization pass, it seems like it stores the float value in the 2nd i64 location in the array, instead of the 3rd. I cannot understand why. I imagine something in this code is undefined behavior, but I have no idea what.

从优化过程生成的IR来看，它似乎将浮点值存储在数组中的第二个i64位置，而不是第三个位置。我不明白为什么。我想这段代码中有一些东西是未定义的行为，但我不知道是什么。

更多回答

Haven't the time to really look at this, but in general, running the module verifier points to almost all such problems. It leaves the solution as an exercise for the readers, though ;)

我没有时间真正考虑这一点，但总的来说，运行模块验证器会指出几乎所有这样的问题。然而，它将解决方案留给读者作为练习；)

Thanks for the tip! Unfortunately, running the module verifier yields no results :/

谢谢你告诉我!遗憾的是，运行模块验证器没有产生任何结果：/

Shocking. Try -print-after-all to see which pass does the bad change, perhaps? I'll try to have a look when I'm back from vacation on Tuesday. No promises.

令人震惊。尝试打印之后，看看糟糕的传球可能会改变哪一种传球？当我星期二度假回来时，我会试着去看看。不能保证。

Thank you. It seems like the --instcombine pass is causing issues, essentially by inferring that the 3rd i64 in the array is unused, and that the float is in the 2nd i64. I don't understand why, which troubles me. For my own purposes, I managed to fix this in my compiler by replacing the store with a memcpy. I'm still however really curious to know what the problem might be, if you end up finding the time for this.

谢谢。似乎--instCombine传递引起了问题，主要是通过推断数组中的第三个i64未使用，而浮点数位于第二个i64。我不明白为什么，这让我很困扰。出于我自己的目的，我设法在我的编译器中修复了这个问题，用一个Memcpy替换了存储。然而，我仍然非常好奇地想知道问题可能是什么，如果你最终找到时间来做这个的话。

优秀答案推荐

更多回答

python - Azure Blob 存储 - 可以列出 blob，但不能删除 blob
我正在尝试从 Azure 容器中删除 blob。我能够连接到它并列出此问题中代码后面的所有 blob:Upload and Delete Azure Storage Blob using azure-
python - Azure Blob 存储 - 可以列出 blob，但不能删除 blob
我正在尝试从 Azure 容器中删除 blob。我能够连接到它并列出此问题中代码后面的所有 blob:Upload and Delete Azure Storage Blob using azure-
Azure blob 错误 :The specified blob does not exist, 但 Blob 存在
运行我的 azure 函数(用于读取 azure blob 存储)后出现错误。错误是 ID 0dad768d-36d4-4c1a-85ae-2a5122533b3c fail: Func
Azure blob 错误 :The specified blob does not exist, 但 Blob 存在
运行我的 azure 函数(用于读取 azure blob 存储)后出现错误。错误是 ID 0dad768d-36d4-4c1a-85ae-2a5122533b3c fail: Func
c# - Azure Blob 存储 - 上传 Blob 后如何获取 Blob 存储 ID？
我正在使用 C# 控制台应用程序 (.NET Core 3.1) 从 Azure Blob 存储读取大量图像文件并生成这些图像的缩略图。新图像将保存回 Azure，并将 Blob ID 存储在我们的数
c# - 如何使用 Azure.Storage.Blobs BlobClient 检索 Blob 目录路径中的 Blob？
我没有在网上看到任何有关如何获取位于 BlobContainerClient 内特定目录内的所有 blob 的示例。以前，我使用的是 Microsoft.Azure.Storage 软件包，但这些软
c# - Azure Blob 存储 - 上传 Blob 后如何获取 Blob 存储 ID？
我正在使用 C# 控制台应用程序 (.NET Core 3.1) 从 Azure Blob 存储读取大量图像文件并生成这些图像的缩略图。新图像将保存回 Azure，并将 Blob ID 存储在我们的数
c# - 如何使用 Azure.Storage.Blobs BlobClient 检索 Blob 目录路径中的 Blob？
我没有在网上看到任何有关如何获取位于 BlobContainerClient 内特定目录内的所有 blob 的示例。以前，我使用的是 Microsoft.Azure.Storage 软件包，但这些软
javascript - 如何使用 Azure Blob 服务将 Blob 上传到 Azure Blob 存储
我正在编写一些代码，允许用户使用麦克风录制自己的声音，然后将录音上传到 Azure Blob 存储。为了录制音频，我使用类似于下面的代码 let recordedBlobs = []; this.m
azure - Golang azure blob 存储，0b blob 并覆盖下载的 blob 数据
当前使用:https://github.com/Azure/azure-sdk-for-go 概述:我当前正在从 azure blob 存储中下载一个 blob，解析该 blob，然后将转录的 blo
blob - 二进制文件和 BLOB 之间的区别
正在观看 this video about how to design Tinder ，在 06:50 提出了关于文件与 BLOBS 的观点。我想知道大二进制文件和 BLOB(二进制大对象)之间有什
java - 如何创建 blob/blob？
目前我有 hibernate JPA HSQLDB 来自动创建我的数据库表。如何告诉 JPA 或 Hibernate 将字符串保存为 clob/blob 字段？即一个很长的字符串。到目前为止我找不
python - 消除一维阵列中的 Blob / Blob
我有一个一维 NumPy 数组，其中包含一些“坏”值。我想剔除它们。每个坏值的邻居只是“顽皮”，但我也想剔除它们。对不良值的可靠测试是询问: arr<0.1 但是，(我能想到的)对于顽皮值的唯一可
Azure Blob 存储 REST API : Why "Get Blob Properties" and "Get Blob" requests are the same?
查看有关获取 Blob 和获取 Blob 属性的 MSDN 文档。两个请求看起来相同 "https://myaccount.blob.core.windows.net/mycontainer/mybl
azure-blob-storage - 无法通过 SAS 使用 azcopy 从一个 blob 到另一个 blob
我有 2 个 Blob 存储，一个在 eastus，一个在 canadaeast，我想将一个 .vhd 从 eastus 复制到 canadaeast。我去了 eastus，在我想要复制的 blob
azure - 拥有许多小型 Azure 存储 Blob 容器(每个容器都包含一些 Blob)更好，还是拥有一个包含大量 Blob 的大型容器更好？
所以场景如下: 我有多个 Web 服务实例，用于将 blob 数据写入 Azure 存储。我需要能够根据收到的时间将 blob 分组到容器(或虚拟目录)中。偶尔(最坏的情况是每天)旧的 blob 会被
angular - 仅列出 Azure Blob 存储中 100 个 Blob 中的 10 个 Blob
在 Azure Blobstorage 中，我有 100 个 Blob，但我只想列出前 10 个 Blob。我该怎么做？我写的{maxResults:1}没有任何效果，它仍然列出了我所有的 Blob
azure - 使用 Azure SDK v1.8 创建的 Blob 是页 Blob 还是 block Blob？
我们当前的代码使用 Azure SDK 1.8，为了生成共享访问签名，它将首先调用 CloudBlobContainer.GetBlobReference()，然后调用 CloudBlob.GetSh
blob - 隐藏 Azure Blob 网址
我有大量文件存储在公共(public) Azure blob 容器中，所有这些文件都通过我的 ASP.NET MVC Web 应用程序中的 HTML 直接引用。例如，blob 存储中一个图像的路径如下
JavaScript Azure Blob 存储移动 Blob
我有一个 NodeJS 后端，它使用 Microsoft 的官方 Blob 存储库 (@azure/storage-blob) 来管理我的 Blob 存储: https://www.npmjs.com

bug小助手

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

Storing a struct as a blob of data breaks with some optimization passes(将结构存储为BLOB数据会中断某些优化过程)