Swift SIMD 或 Accelerate Sum UInt32-6ren

Swift SIMD 或 Accelerate Sum UInt32

转载作者：搜寻专家更新时间：2023-11-01 06:17:29

24

4

是否有内置工具来加速或在其他地方使用加速矢量运算对 UInt32 数组求和？

最佳答案

我想你想加速这样的功能

func scalarsum (_ test_array: [UInt32]) -> UInt32 {
   var result : UInt32 = 0
   for x in test_array {
     result = result &+ x
   }
   return result
}

所以也许你可以写一些像这样复杂的东西......

func simdsum (_ test_array: [UInt32]) -> UInt32 {
   var tmpvector=uint4(0)
   // assume test_array.count is divisible by four
   let limit = test_array.count/4
   for i in 0..<limit {
     let thisvector = uint4(test_array[4*i],test_array[4*i+1],test_array[4*i+2],test_array[4*i+3])
     tmpvector = tmpvector &+ thisvector
   }
   return tmpvector[0] + tmpvector[1] + tmpvector[2] + tmpvector[3]
}

但是，让我们看看 swift 汇编为第一个函数生成了什么...

simdsum[0x100001070] <+448>: movdqu 0x20(%rcx,%rdi,4), %xmm2 simdsum[0x100001076] <+454>: movdqu 0x30(%rcx,%rdi,4), %xmm3 (...) simdsum[0x10000107c] <+460>: paddd %xmm2, %xmm0 simdsum[0x100001080] <+464>: paddd %xmm3, %xmm1

啊!啊! Swift 足够聪明，可以将总和向量化。

所以简短的回答是，如果您尝试在 Swift 中使用 SIMD 指令手动设计求和函数，您可能是在浪费时间……编译器会自动为您完成工作。

关于Swift SIMD 或 Accelerate Sum UInt32，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/41257678/

24

4

0

文章推荐： arrays - 确定字符串来自哪个数组

文章推荐： ios - App被拒但被告知要回复继续审核

文章推荐： ios - 使 UICollectionViewFlowLayout 不在同一行上居中单元格

将 uint** 转换为 uint
我有以下代码 unsigned int headerbytes = 0U; headerbytes = (unsigned int*)strtoull(packet_space->header
c# - 无法将 uint* 转换为 uint[]
我有这段无法编译的代码: public struct MyStruct { private fixed uint myUints[32]; public uint[] MyUints
performance - 返回指向 uint 或 uint 的指针哪个更有效？
在 Go 中，从函数返回哪个更有效:返回 uint 还是返回 *uint？该函数在 cpu 密集型库的内部 for 循环中调用。最佳答案一般来说，只要效率是个问题，您就应该运行基准测试。让我们
c++ - 为什么 int 加上 uint 返回 uint？
int 加上 unsigned int 返回一个 unsigned int。应该这样吗？考虑这段代码: #include #include #include class test {
ios - 无法将类型 'UInt' 的值转换为预期的参数类型 'UnsafeMutablePointer'
我正在尝试从可通过 URL 访问的内容中初始化一个字符串: actualresponse.response = String(contentsOfURL: url, usedEncoding: NSU
c - uint vs. unsigned int - 为什么不用 typedef uint？
关闭。这个问题是opinion-based .它目前不接受答案。想改进这个问题？更新问题，以便 editing this post 提供事实和引用来回答它. 1年前关闭。 Improve this
swift - 在 Swift 中将 UnsafeMutablePointer 转换为 UInt
我从函数 Swift 得到类型为 UnsafeMutablePointer 的结果我可以把它转换到UInt吗？？最佳答案只需使用memory 属性来访问底层数据。 let ptr: Unsaf
c# - (uint) index >= (uint)_size 比 index >= _size 更好吗？
我深入了解了 List并发现了以下代码: public T this[int index] { get { // Following trick can red
c - 如何将四个 16 位 uint 编码为 64 位 uint，然后再次解码它们？
我在 this page on bit twiddling 的帮助下编写了这个函数: uint16_t *decode(uint64_t instr) { // decode instr (thi
将两个 8 位 uint 转换为 1 个 12 位 uint
我正在从微 Controller 读取两个寄存器。一个具有 4 位 MSB(前 4 位有一些其他内容)，另一个具有 8 位 LSB。我想将其转换为一个 12 位 uint(准确地说是 16 位)。到目
c# - 常数值 '-1' 无法转换为 'uint' ，当尝试 `uint mask = ~0;`
要演示的示例代码: public int FindComplement(int num) { //uint mask = ~0; //<-- error CS0031 //
types - 不匹配的类型 : `expected fn@(&&@type) -> uint` but found `extern fn(@map_a) -> uint` (expected argument mode++ but found &&)
$ rustc --test mapAsMapKey.rs mapAsMapKey.rs:18:43: 18:52 error: mismatched types: expected `fn@(&&@
哈希函数从整数坐标对中提供唯一的 uint
一般问题:我有一个很大的二维点空间，里面稀疏地分布着点。把它想象成一 block 撒满黑点的白色大 Canvas 。我必须多次迭代和搜索这些点。 Canvas (点空间)可能很大，接近极限int 的值
ethereum - uint 启动什么？
假设我们只是调用一个普通数字，数字会启动什么。 uint256 plainNumber 我明白它是零。但是我要问的是，有没有办法检测该数字是由编译器还是用户变量设置的。例如... uint256 pl
c# - uint 的二进制表示是什么样的？
我试图在 leetcode.com ( https://leetcode.com/problems/number-of-1-bits/ ) 上解决一个简单的问题，我遇到了一个奇怪的行为，这可能是我缺乏
c# - uint 中的按位移位
uint number = 0x418 in bits : 0000010000011000 uint number1 = 0x8041 in bits: 1000000001000001 uint
c# - 如何生成具有最大值的随机 uint？
我如何在 C# 中生成具有某个最大值的伪随机 uint？ (不需要最低限度。)似乎有很多问题要求完全随机，但没有上限。澄清:此上限可能大于 int.MaxValue，因此仅强制转换 Random.N
c# - 当没有这样的运算符时显式转换为 uint
我已经用私有(private)数据成员围绕 ulong 编写了一个简单的包装器。我希望能够将包装器转换为 ulong 以检索数据。我希望强制转换为 uint 并丢失数据是非法的，因此我没有编写对 ui
C++ - 哪些是可变的 "Uint "？
哪些是“Uint”变量？就是有“Uint8”、“Uint16”等…… 但是它们是什么？现在我有一些时间使用 C++，但我从来不需要使用这些变量并引起我的好奇。提前致谢。最佳答案 uint 不是标
c# - 如何使用编码从指针中读取 uint？
我有一个 native 方法，它需要一个指针来写出一个双字(uint)。现在我需要从 (Int) 指针中获取实际的 uint 值，但是 Marshal 类只有方便的方法来读取(有符号)整数。如何从

首页

博学

6Ren·AI

商城

Swift SIMD 或 Accelerate Sum UInt32