Smallest 32-bit Bit Reversal in C using inline __asm()(使用INLINE_

Smallest 32-bit Bit Reversal in C using inline asm()(使用INLINEASM()的C语言中最小的32位反转)

转载作者：bug小助手更新时间：2023-10-25 09:16:42

Trying to create the smallest 32-bit Bit Reversal in C using inline __asm().

尝试使用INLINE__ASM()在C中创建最小的32位反转。

So far, I've managed to get the __asm() code size to 23 bytes.

到目前为止，我已经设法将__ASM()代码大小设置为23个字节。

I'm curious if there are ways to further decrease the code size by using

我很好奇是否有方法通过使用

compiler instrinsics

specialized assembly instructions

vanilla C code

Example

示例

godbolt

龙珠

#include <stdio.h>

unsigned int CodeSize;


void print_binary(unsigned int number) { if (number >> 1) print_binary(number >> 1); putc((number & 1) ? '1' : '0', stdout); }


void reverseBits(unsigned int OriginalValue)
{
    unsigned int ReversedValue = 0;

    start_asm:
    __asm__ (
        "movl  %1, %%eax\n"       // load the value
        "xorl  %%ebx, %%ebx\n"    // clear EBX (optimized)
        "bsrl  %%eax, %%ecx\n"    // find highest order bit set to 1 in EAX
        "incl  %%ecx\n"           // increment to get the correct number of iterations
    "reverse:\n\t"                // local label
        "shrl  $1, %%eax\n"       // shift the LSB of EAX to the carry flag CF
        "rcll  $1, %%ebx\n"       // the MSB of EBX goes into CF and CF's previous value goes into the LSB of EBX
        "loop  reverse\n"         // loop back to the local label
        "movl  %%ebx, %0"         // move the result to the output
        : "=r" (ReversedValue)    // output
        : "r" (OriginalValue)     // input
        : "eax", "ebx", "ecx"     // clobbered registers
    );
    end_asm:

    CodeSize = (char *)&&end_asm - (char *)&&start_asm;

    printf("\nOriginal: 0x%X ", OriginalValue); print_binary(OriginalValue);
    printf("\nReversed: 0x%X ", ReversedValue); print_binary(ReversedValue);
    printf("\n");
}


int main()
{
    reverseBits(0xfeedface);  
    reverseBits(0xfeed);  
    reverseBits(0xfe);               

    printf("\nCodeSize: %u bytes\n", CodeSize); 
    return 0;
}

Output

输出

Original: 0xFEEDFACE 11111110111011011111101011001110
Reversed: 0x735FB77F 1110011010111111011011101111111

Original: 0xFEEDFA 111111101110110111111010
Reversed: 0x5FB77F 10111111011011101111111

Original: 0xFEED 1111111011101101
Reversed: 0xB77F 1011011101111111

Original: 0xFE 11111110
Reversed: 0x7F 1111111

CodeSize: 23 bytes

Update

更新

From the helpful comments below, the code size is now 13 bytes

根据下面有用的注释，代码大小现在是13个字节

uint32_t reverseBits(uint32_t OriginalValue) {
    uint32_t ReversedValue = 0;

    __asm__ (
        "start_asm:\n"             // start label
        "xorl  %%ebx, %%ebx\n"     // clear EBX
        "bsrl  %1, %%ecx\n"        // find highest order bit set to 1 in EAX (which is %1)
    "reverse:\n"                   // local label for looping
        "shrl  $1, %1\n"           // shift the LSB of EAX (which is %1) to the carry flag CF
        "rcll  $1, %%ebx\n"        // the MSB of EBX goes into CF and CF's previous value goes into the LSB of EBX
        "decl  %%ecx\n"            // manually decrement ECX
        "jns   reverse\n"          // jump to reverse if sign flag is set (i.e., if ecx is negative)
        "end_asm:\n"               // end label
        : "=b" (ReversedValue)     // output directly to EBX
        : "a" (OriginalValue)      // input directly to EAX
        : "ecx"                    // clobbered register
    );

    return ReversedValue;
}

更多回答

Since you specified input and output in registers it makes little sense to hardcode eax and ebx. Just use those registers instead. Also if you are optimizing for size, you can loop a fixed 32 times

由于您在寄存器中指定了输入和输出，因此硬编码eax和ebx没有什么意义。只需使用这些寄存器即可。此外，如果您正在优化大小，您可以循环固定的32次

This may be better suited for codegolf.stackexchange.com. Incidentally, such a challenge already exists there. You can do it in 12 bytes.

这可能更适合codes olf.stackexchange.com。顺便说一句，中国已经存在这样的挑战。您可以在12个字节内完成此操作。

Though I see that you wish to only reverse as many bits as the number is large. Try this (input in ecx, output in eax): xor %eax, %eax; 0: rcl %eax; shr %ecx; jbe 0b. 8 bytes in total.

虽然我看到您只希望反转数字很大的位数。试试这个(输入为ecx，输出为eax)：XOR%eax，%eax；0：rcl%eax；shr%ecx；jbe 0b。总共8个字节。

Added x86-64 since that appears to be the only architecture you are interested in. Please always use an architecture tag for assembly questions. The answer could be very different for other architectures, e.g. ARMv8 simply has an rbit instruction.

添加了x86-64，因为这似乎是您唯一感兴趣的体系结构。请始终使用架构标签回答组装问题。对于其他体系结构，答案可能会非常不同，例如ARMv8只有一条rbit指令。

@vengy: If you want to minimize machine-code size using inline asm, don't make it a naked function; let the compiler inline it so you don't need a ret. A naked function is defeating the purpose of inline asm(); it's basically equivalent to writing a separate .s.

@vengy：如果您想使用内联ASM最小化机器代码大小，请不要使其成为一个裸函数；让编译器内联它，这样您就不需要ret了。裸函数违背了内联ASM()的目的；它基本上等同于编写一个单独的.s。

优秀答案推荐

10 bytes:

10个字节：

__asm__ (
    "start_asm:\n"
    "xorl  %%ebx, %%ebx\n"    //Clear destination and CF
 "repeat:\n\t"
    "rcll  $1, %%ebx\n"       // Shift CF into destination LSB
    "shrl  $1, %%eax\n"       // Shift LSB from source into CF
    "jnz  repeat\n"           // If source is not zero - repeat
                              // else (source is zero, CF is always 1)
    "rcll  $1, %%ebx\n"       // Shift the last 1 into destination
    "end_asm:\n"
    : "=b" (ReversedValue)    // output
    : "a" (OriginalValue)     // input
    :                         // no clobbered registers
);

6 bytes

6个字节

uint32_t reverseBits(uint32_t OriginalValue) {
    uint32_t ReversedValue = 0;

    __asm__ (
    "repeat:\n\t"
        "shrl  $1, %%eax\n"        // Shift LSB from source into CF
        "rcll  $1, %%ebx\n"        // Shift CF into destination LSB
        "jnz   repeat\n"           // If source EAX is not zero - repeat
        : "=b" (ReversedValue)     // output: EBX
        : "a"  (OriginalValue),    // input: OriginalValue is loaded into EAX.
          "b"  (ReversedValue)     // input: ReversedValue (0) is loaded into EBX. 
        :                          // no clobbered registers
    );    

    return ReversedValue;
}

更多回答

Nice! So the last instruction that affects ZF is shrl $1, %%eax. The jnz checks if EAX has become zero after the shift operation. If it's zero, all the bits from the original number have been processed, and we can move on. If it's non-zero, there are still bits left to reverse, so we keep looping. The final instruction rcll $1, %%ebx processes the last bit still in the carry flag (CF) to ensure that the final bit from EAX is shifted into EBX. Very clever! :)

好的!因此，影响ZF的最后一条指令是SHRL$1，%%eax。JNZ检查在移位操作之后EAX是否已变为零。如果它是零，则来自原始数字的所有位都已被处理，我们可以继续。如果它不是零，则仍有位需要反转，因此我们继续循环。最终指令RCLL$1，%%EBX处理进位标志(CF)中的最后一位，以确保来自EAX的最后一位被移位到EBX。非常聪明！：)

I generally prefer adc reg, reg to rcl reg, 1, but it does not matter for code size here (+1)

我通常更喜欢ADC reg，reg而不是RCL reg，1，但这里的代码大小(+1)无关紧要

Oh, RCL does not affect ZF. Didn't know that.

哦，RCL不影响ZF。我不知道这一点。

I was surprised too that RCL does not affect the SF, ZF, AF, and PF flags. Credit really goes to your ideas. Thanks!

我也很惊讶，RCL不影响SF、ZF、AF和PF旗帜。功劳真的归功于你的想法。谢谢!

html - css vertical inline 2 inline-blocks in another inline-block
看看这个 fiddle http://jsfiddle.net/9S4zc/2/ 为什么这在 firefox 和 chrome 中看起来不同(文本对齐方式不同) 如何让 inner:before 元素
inline - SWIG : What is the different between "%inline %{ %}" and "%{ %}"?
我从文档中了解到的是 %{ %} 之间的内容。被插入到包装器中，%inline %{ %} 呢？ ? 是一样的吗？如果不是，有什么区别？也许我们可以找到很多%inline %{ %}的出现。但仅出现
html - 为什么 inline-flex/inline-grid 的行为与 contenteditable 中的 inline 不一致？
当我使用显示:inline-flex；或显示:内联网格；似乎有一些额外的“空间”或某种额外的重点计算发生。我不确定到底发生了什么。当使用箭头键在 contentediatble div 中导航
html - 目的 : making div inline. 应显示为 Inline-Flex 或 Inline-block
如果我想让一个 div 与容器 div 的其他内联元素内联，而我的目的仅此而已，我应该更喜欢使用 inline-block 或 display property 还是 inline-flex？不能使用
css - 为什么设置显示:none to an element inside inline-block div make inline-block not render as an inline-block;
这个问题在这里已经有了答案: Why does an inline-block align to top if it has no content? (2 个答案) 关闭 8 年前。
inline - 我可以将 pragma `Inline` 放在正文中而不是规范中吗？
Ada 信息交换所 states the following : The use of pragma Inline does have its disadvantages. It can create
css - 如何获得显示 :inline-block to be inline?
Name
c++ - "inline"关键字与 "inlining"概念
我问这个基本问题是为了让记录更正。已转介 this question和 its currently accepted answer ，这没有说服力。然而second most voted answer
Django 管理员 : update inline based on other inline
你好，在管理面板中，我创建了用于添加产品的表单。表单包括 2 个内联表单集，因为有一些与产品相关的模型。用户可以创建产品，然后定义该产品的不同属性的变体。我将举例说明这一点。用户拥有一个品牌的 3
C++ 编译器不喜欢 "using INLINE = extern inline"
有很多关于 inline 的使用以及如何正确执行此操作以达到所需目的的信息，例如此处(我目前将其用作引用)Inline Functions in C . 当我尝试实现页面中指定的内容时，出现编译器错误
python 风格 : inline function that needs no inlining?
我正在编写 gtk 代码。我经常有不需要闭包的简短回调，因为它们传递了它们需要的所有参数。例如，我在创建一些 gtk.TreeViewColumns 时将其置于循环中: def widthChange
CSS 显示 : inline vs inline-block
这个问题在这里已经有了答案: What is the difference between display: inline and display: inline-block? (7 个答案) 关闭
css - 如何垂直对齐显示 :inline (not inline-block) elements in a div?
我已经搜索了很长时间来找到答案，但是我没有找到解决方案... 我制作了一个无序列表的链接，并将它们放在标题下，就像导航栏一样。然而，在 IE 中(是的那个恶魔..)我的链接似乎没有对齐到中间。下面是我
css - IE7 中的表 inline-block/inline-table
我想将两张 table 并排放置。由于我不是 floating 或使用“css hacks”的忠实拥护者，您有什么建议？没有它是否可以解决，还是我运气不好？最佳答案使用 table-cell显示以
html - 显示跨度 inline-block 与 inline 相结合
这个问题在这里已经有了答案: Why is this inline-block element pushed downward? (8 个答案) 关闭 6 年前。
css - 显示: inline and display: inline-block?有什么区别
CSS display 的 inline 和 inline-block 值到底有什么区别？最佳答案视觉答案想象一个中的元素.如果你给例如，元素高度为 100px 和红色边框，它看起来像这
html - 强制
显示为 :inline instead of inline-block
我想使用 /纯 CSS 弹出窗口的标签，但是表现为内联 block ，我无法将其更改为内联。有没有办法强制表现得像 display:inline 而不是 inline-block？
templates - 我可以将 "inline-template"组件传递给另一个 "inline-template"组件吗？
我的想法是这是不可能的，或者我缺少一个额外的步骤。无论哪种方式，我都被卡住了，无法弄清楚。使用内联模板的原因是能够使用 Laravel Blade 语法并结合 Vue Js 的强大功能。似乎是两者中
css - 对于此页面，为什么我必须使用显示 : inline-block instead of display: inline?
http://christianselig.com/wp/ 对于主导航，如果我使用 display: inline，它们将显示为 block 。我心血来潮添加了 display: inline-blo
css - -moz-inline-box 与 -moz-inline-stack
Firefox 的 -moz-inline-box 和 -moz-inline-stack 专有显示值有什么区别？最佳答案 https://developer.mozilla.org/en/CSS/

bug小助手

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城