c - sparc64 上 sparc 汇编代码的 unsigned long long int 问题-6ren

c - sparc64 上 sparc 汇编代码的 unsigned long long int 问题

转载作者：太空宇宙更新时间：2023-11-04 03:17:45

25

4

我在下面包含 Sparc 程序集的 C 代码中遇到问题。代码在 Debian 9.0 Sparc64 上编译运行。它做一个简单的求和并打印这个总和的结果等于nLoop .

问题是对于大于 1e+9 的初始迭代次数，最后的总和系统地等于 1410065408 :我不明白为什么，因为我明确地输入了 unsigned long long int输入 sum变量等等 sum可以在[0, +18,446,744,073,709,551,615]范围。

例如，对于 nLoop = 1e+9 , 我期待 sum等于1e+9 .

问题是否来自无法处理 64 位变量(输入或输出)的包含的 Assembly Sparc 代码？

#include <stdio.h>
#include <stdlib.h>

int main (int argc, char *argv[])
{
  int i;
  // Init sum
  unsigned long long int sum = 0ULL;
  // Number of iterations
  unsigned long long int nLoop = 10000000000ULL;

   // Loop with Sparc assembly into C source
   asm volatile ("clr %%g1\n\t"
                 "clr %%g2\n\t"
                 "mov %1, %%g1\n" // %1 = input parameter
                 "loop:\n\t"
                 "add %%g2, 1, %%g2\n\t"
                 "subcc %%g1, 1, %%g1\n\t"
                 "bne loop\n\t"
                 "nop\n\t"
                 "mov %%g2, %0\n" // %0 = output parameter
                 : "=r" (sum)     // output
                 : "r" (nLoop)    // input
                 : "g1", "g2");   // clobbers

  // Print results
  printf("Sum = %llu\n", sum);

  return 0;

}

如何解决这个范围问题并允许在 Sparc 汇编代码中使用 64 位变量？

PS:我尝试用 gcc -m64 编译，问题依旧。

更新1

应@zwol 的要求，下面是生成的输出 Assembly Sparc 代码:gcc -O2 -m64 -S loop.c -o loop.s

        .file   "loop.c"
        .section        ".text"
        .section        .rodata.str1.8,"aMS",@progbits,1
        .align 8
.LC0:
        .asciz  "Sum = %llu\n"
        .section        .text.startup,"ax",@progbits
        .align 4
        .global main
        .type   main, #function
        .proc   04
main:
        .register       %g2, #scratch
        save    %sp, -176, %sp
        sethi   %hi(_GLOBAL_OFFSET_TABLE_-4), %l7
        call    __sparc_get_pc_thunk.l7
         add    %l7, %lo(_GLOBAL_OFFSET_TABLE_+4), %l7
        sethi   %hi(9764864), %o1
        or      %o1, 761, %o1
        sllx    %o1, 10, %o1
#APP
! 13 "loop.c" 1
        clr %g1
        clr %g2
        mov %o1, %g1
loop:
        add %g2, 1, %g2
        subcc %g1, 1, %g1
        bne loop
        nop
        mov %g2, %o1

! 0 "" 2
#NO_APP
        mov     0, %i0
        sethi   %gdop_hix22(.LC0), %o0
        xor     %o0, %gdop_lox10(.LC0), %o0
        call    printf, 0
         ldx    [%l7 + %o0], %o0, %gdop(.LC0)
        return  %i7+8
         nop
        .size   main, .-main
        .ident  "GCC: (Debian 7.3.0-15) 7.3.0"
        .section        .text.__sparc_get_pc_thunk.l7,"axG",@progbits,__sparc_get_pc_thunk.l7,comdat
        .align 4
        .weak   __sparc_get_pc_thunk.l7
        .hidden __sparc_get_pc_thunk.l7
        .type   __sparc_get_pc_thunk.l7, #function
        .proc   020
__sparc_get_pc_thunk.l7:
        jmp     %o7+8
         add    %o7, %l7, %l7
        .section        .note.GNU-stack,"",@progbits

更新 2:

根据@Martin Rosenau 的建议，我做了以下修改:

loop:
        add %g2, 1, %g2
        subcc %g1, 1, %g1
        bpne %icc, loop
        bpne %xcc, loop
        nop
        mov %g2, %o1

但是在编译时，我得到:

Error: Unknown opcode: `bpne'

这个编译错误可能是什么原因？

最佳答案

subcc %%g1, 1, %%g1
bne loop

你的问题是 bne 指令:

与 x86-64 CPU 不同，Sparc64 CPU 没有不同的 32 位和 64 位减法指令:

如果你想从 0x12345678 中减去 1，结果是 0x12345677。如果您从 0xF00D12345678 中减去 1，则结果为 0xF00D12345677 因此，如果您仅使用寄存器的低 32 位，则 64 位减法与 32 位减法具有相同的效果-位减法。

因此 Sparc64 CPU 没有不同的 64 位和 32 位加法、减法、乘法、左移等指令。

当高 32 位影响低 32 位(例如右移)时，这些 CPU 对 32 位和 64 位操作有不同的指令。

然而，零标志 取决于subcc 操作的结果。

为了解决这个问题，Sparc64 CPU 将每个整数标志(零、溢出、进位、符号)都设置了两次:

如果寄存器的低 32 位为零，则设置 32 位零标志；如果寄存器的所有 64 位都为零，将设置 64 位零标志。

为了与现有的 32 位程序兼容，bne 指令将检查 32 位零标志，而不是 64 位零标志。

is systematically equal to 1410065408

1e10 = 0x200000000 + 1410065408 所以在 1410065408 步之后达到值 0x200000000，其低 32 位设置为 0，bne 将不再跳转。

然而，对于 1e11，您不应该得到 1410065408，而是 1215752192，因为 1e11 = 0x1700000000 + 1215752192。

bne

有一个名为 bpne 的新指令，最多有 4 个参数!

在最简单的变体(只有两个参数)中，指令应该(我已经 5 年没有使用 Sparc，所以我不确定)像这样工作:

bpne %icc, loop   # Like bne (based on the 32-bit result)
bpne %xcc, loop   # Like bne, but based on the 64-bit result

编辑

Error: Unknown opcode: 'bpne'

我刚刚尝试使用 GNU 汇编程序:

GNU 汇编程序将新指令命名为 bne - 就像旧指令一样:

bne loop         # Old variant
bne %icc, loop   # New variant based on the 32-bit result
bne %xcc, loop   # (New variant) Based on the 64-bit result

  subcc %g1, 1, %g1
  bpne %icc, loop
  bpne %xcc, loop
  nop

第一个 bpne(或 bne)没有意义:只要第一行跳转，第二行也会跳转。如果您不使用 .reorder(但这是默认设置)，您还需要在两个分支指令之间添加一个 nop...

代码应该如下所示(假设您的汇编器也命名为 bpne bne):

   subcc %g1, 1, %g1
   bne %xcc, loop
   nop

关于c - sparc64 上 sparc 汇编代码的 unsigned long long int 问题，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/49801769/

25

4

0

文章推荐： c - 如何使用调色板设置 libpng 背景透明度？

文章推荐： node.js - 如何使用csv-parse的读写流

文章推荐： c - 获取 unsigned int 或 float 的(float 的)尾数 (C)

c++ - 不匹配调用 '(std::pair) (unsigned int&, unsigned int)'
我不知道下面的代码有什么问题，它应该读取数字并将它们的值与位置放在一个成对的 vector 中，然后对它们进行排序并打印出位置。我用 sort 删除了部分 - 我认为问题就在那里，但我再次收到编译错误
c++ - 为什么 "unsigned int"+ "unsigned int"返回 "unsigned int"？
我相信当您将两个 unsigned int 值相加时，返回值的数据类型将是 unsigned int。但是两个 unsigned int 值相加可能会返回一个大于 unsigned int 的值。
c++ - 为什么 unsigned char << unsigned char 的结果不是 unsigned char
我从左移得到的结果我找不到解释。 unsigned char value = 0xff; // 1111 1111 unsigned char = 0x01; // 0000 0001 std::c
C、unsigned int(a)^unsigned int(b) 和 unsigned int(a^b) 有什么区别？
关闭。此题需要details or clarity 。目前不接受答案。想要改进这个问题吗？通过 editing this post 添加详细信息并澄清问题. 已关闭 8 年前。 Improve th
c++ - unsigned short int 和 unsigned int 或 unsigned short 之间有什么区别？
根据:http://en.wikipedia.org/wiki/C_data_types您可以使用 unsigned short 类型或 unsigned short int 类型。但是它们之间有什么
c - 未定义对 crcsum 的引用(unsigned char const*，unsigned long，unsigned short)
我正在尝试在 arduino 草图中实现 CRC16。从网上拿到了crc.c文件，想试试看。我创建了其他文件以允许 crc.c 正确运行。这是我的文件。 crc.h: #ifndef CRC_C_ #
c++ - 错误 : assignment to '_List_iterator' from 'int'
我正在尝试实现一个将存储相关索引的列表。但是，我在标题中提到的 for (index_itr = (list_size - numberOfEvents - 1) 处遇到错误。我在做什么错误，以及如何
c++ - unsigned char 数组到 unsigned int 通过 memcpy 返回到 unsigned char 数组被反转
这不是跨平台代码...所有内容都在同一平台上执行(即字节序是相同的......小字节序)。我有这个代码: unsigned char array[4] = {'t', 'e', 's', '
c++ - 如何将 16 位 unsigned int 转换为 8 位 unsigned char 并最终返回 unsigned char*？
我有一个 8 位 unsigned char vector 和一个 16 位 unsigned short vector std::vector eight_bit_array; std::vecto
C++ : subtracting unsigned values is unsigned
这个问题在这里已经有了答案: Is subtracting larger unsigned value from smaller in C++ undefined behaviour? (2 个答案
c - unsigned 和 unsigned int 有区别吗
这个问题已经有答案了: Difference between unsigned and unsigned int in C (5 个回答) 已关闭 6 年前。在 C 语言中，unsigned 之间有
c - 了解警告 "comparison of promoted ~unsigned with unsigned"
我遇到了一个我不太理解的警告。该警告是通过比较我认为是一个未签名的内容与另一个未签名的内容而生成的。来源如下: #include #include #include #include int
iphone - unsigned int* 赋值改变目标 unsigned int
好吧，这一定很愚蠢。我在移动一些代码时遇到了这个问题，并认为我打错了字或未能正确使用调试器。作为健全性检查，我创建了这个测试用例，但它似乎仍然失败。 unsigned int vtxIdx
c++ - 为什么应该使用 unsigned int 而不是 unsigned？
我有一个同事不热衷于使用现代 C++ 例如，当我要求他开始使用 r_value 引用时，他不会这样做。当我要求他使用 std::array 而不是 c 数组(char example[8])时，他不会
unsigned int 和 unsigned char 的比较
我有一个无符号字符数组，例如Data[2]。我需要它来与返回 unsigned int 的函数的输出进行比较。我尝试将 Data[2] 转换为 unsigned int，反之亦然。它没有用。我想做
c - 了解警告 "comparison of promoted ~unsigned with unsigned"
我遇到了一个我不太明白的警告。警告是通过将我认为是未签名的与另一个未签名的进行比较而生成的。这是来源: #include #include #include #include int mai
c - unsigned char i 是否等同于 unsigned j？
在下面的程序中，我使用了 unsigned 关键字。 #include int main() { unsigned char i = 'A'; unsigned j
在不损失精度的情况下将 unsigned 转换为 double 到 unsigned
整数值转换为浮点值并再次返回时是否与原始整数值相同？例如: unsigned x = 42; double y = x; unsigned z = y; 假设编译器没有优化浮点转换，x == z 是
c - (unsigned *) 比 (unsigned int *) 更适合解析内存？
这个问题在这里已经有了答案: Difference between unsigned and unsigned int in C (5 个答案) 关闭 9 年前。我理解 unsigned 和 un
c - unsigned int 结合 unsigned char 的小位操作问题
您好，我遇到了一个关于位运算的小概念问题。请参阅下面的代码，其中我有一个 4 字节的无符号整数。然后我通过将地址分配给无符号字符来访问各个字节。然后我将最后一个字节的值设置为 1。并对 unsign

首页

博学

6Ren·AI

商城

c - sparc64 上 sparc 汇编代码的 unsigned long long int 问题

更新1