performance - 在我的速度测试中，Lua 表哈希索引比数组索引更快。为什么？-6ren

performance - 在我的速度测试中，Lua 表哈希索引比数组索引更快。为什么？

转载作者：行者123 更新时间：2023-12-02 00:56:23

我正在做一些测试，看看我可以在哪里提高我的 lua 代码的性能。

我正在阅读这份文件:https://www.lua.org/gems/sample.pdf
我认为使用整数作为表索引应该快得多，因为它使用表的数组部分并且不需要散列。

所以我写了这个测试程序:

    print('local x=0 local y=0 local z=0')
    local x=0 local y=0 local z=0
    t0 = os.clock()
    for i=1,1e7 do
        x = 1
        y = 2
        z = 3
    end
    print(os.clock()-t0 .. "\n")


    print("tab = {1,2,3}")
    tab = {1,2,3}
    t0 = os.clock()
    for i=1,1e7 do
        tab[1] = 1
        tab[2] = 2
        tab[3] = 3
    end
    print(os.clock()-t0 .. "\n")


    print("tab = {[1]=1,[2]=2,[3]=3}")
    tab = {[1]=1,[2]=2,[3]=3}
    t0 = os.clock()
    for i=1,1e7 do
        tab[1] = 1
        tab[2] = 2
        tab[3] = 3
    end
    print(os.clock()-t0 .. "\n")


    print("tab = {a=1,b=2,c=3}")
    tab = {a=1,b=2,c=3}
    t0 = os.clock()
    for i=1,1e7 do
        tab.a = 1
        tab.b = 2
        tab.c = 3
    end
    print(os.clock()-t0 .. "\n")


    print('tab = {["bli"]=1,["bla"]=2,["blu"]=3}')
    tab = {["bli"]=1,["bla"]=2,["blu"]=3}
    t0 = os.clock()
    for i=1,1e7 do
        tab["bli"] = 1
        tab["bla"] = 2
        tab["blu"] = 3
    end
    print(os.clock()-t0 .. "\n")


    print("tab = {verylongfieldname=1,anotherevenlongerfieldname=2,superincrediblylongfieldname=3}")
    tab = {verylongfieldname=1,anotherevenlongerfieldname=2,superincrediblylongfieldname=3}
    t0 = os.clock()
    for i=1,1e7 do
        tab.verylongfieldname = 1
        tab.anotherevenlongerfieldname = 2
        tab.superincrediblylongfieldname = 3
    end
    print(os.clock()-t0 .. "\n")


    print('local f = function(p1, p2, p3)')
    local f = function(p1, p2, p3)
        x = p1
        y = p2
        z = p3
        return x,y,z
    end

    local a=0
    local b=0
    local c=0
    t0 = os.clock()
    for i=1,1e7 do
        a,b,c = f(1,2,3)
    end
    print(os.clock()-t0 .. "\n")


    print('local g = function(params)')
    local g = function(params)
        x = params.p1
        y = params.p2
        z = params.p3
        return {x,y,z}
    end

    t0 = os.clock()
    for i=1,1e7 do
        t = g{p1=1, p2=2, p3=3}
    end
    print(os.clock()-t0 .. "\n")

我已经按照我预期会增加的时间消耗来订购块。 (我不确定函数调用，那只是一个测试。)但这里是令人惊讶的结果:

    local x=0 local y=0 local z=0
    0.093613

    tab = {1,2,3}
    0.678514

    tab = {[1]=1,[2]=2,[3]=3}
    0.83678

    tab = {a=1,b=2,c=3}
    0.62888

    tab = {["bli"]=1,["bla"]=2,["blu"]=3}
    0.733916

    tab = {verylongfieldname=1,anotherevenlongerfieldname=2,superincrediblylongfieldname=3}
    0.536726

    local f = function(p1, p2, p3)
    0.475592

    local g = function(params)
    3.576475

甚至应该导致最长散列过程的长字段名称也比使用整数访问数组更快。难道我做错了什么？

最佳答案

您的文档第6页(实际第20页)linked解释你所看到的。

If you write something like {[1] = true, [2] = true, [3] = true}, however, Lua is not smart enough to detect that the given expressions (literal numbers, in this case) describe array indices, so it creates a table with four slots in its hash part, wasting memory and CPU time.

您只有在 时才能获得阵列部分的主要好处。分配 不使用键的表。

table = {1,2,3}

如果您正在读取/写入已存在的表或数组，您将不会看到处理时间的大偏差。

文档中的例子包括for循环中表的创建

for i = 1, 1000000 do
    local a = {true, true, true}
    a[1] = 1; a[2] = 2; a[3] = 3
end

循环内所有局部变量的结果。编辑:如 siffiejoe 所指出的，将长字符串加长到 40 个字节

local x=0 local y=0 local z=0
0.18

tab = {1,2,3}
3.089

tab = {[1]=1,[2]=2,[3]=3}
4.59

tab = {a=1,b=2,c=3}
3.79

tab = {["bli"]=1,["bla"]=2,["blu"]=3}
3.967

tab = {verylongfieldnameverylongfieldnameverylongfieldname=1,anotherevenlongerfieldnameanotherevenlongerfieldname=2,superincrediblylongfieldnamesuperincrediblylongfieldname=3}
4.013

local f = function(p1, p2, p3)
1.238

local g = function(params)
6.325

此外，lua 对不同的键类型以不同的方式执行散列。

源代码可以在这里查看 5.2.4 ltable.c ，这包含我将要讨论的代码。
mainposition函数处理关于要执行哪个散列的决策

/*
** returns the `main' position of an element in a table (that is, the index
** of its hash value)
*/
static Node *mainposition (const Table *t, const TValue *key) {
  switch (ttype(key)) {
    case LUA_TNUMBER:
      return hashnum(t, nvalue(key));
    case LUA_TLNGSTR: {
      TString *s = rawtsvalue(key);
      if (s->tsv.extra == 0) {  /* no hash? */
        s->tsv.hash = luaS_hash(getstr(s), s->tsv.len, s->tsv.hash);
        s->tsv.extra = 1;  /* now it has its hash */
      }
      return hashstr(t, rawtsvalue(key));
    }
    case LUA_TSHRSTR:
      return hashstr(t, rawtsvalue(key));
    case LUA_TBOOLEAN:
      return hashboolean(t, bvalue(key));
    case LUA_TLIGHTUSERDATA:
      return hashpointer(t, pvalue(key));
    case LUA_TLCF:
      return hashpointer(t, fvalue(key));
    default:
      return hashpointer(t, gcvalue(key));
  }
}

当键是 Lua_Number 时，我们调用 hashnum

/*
** hash for lua_Numbers
*/
static Node *hashnum (const Table *t, lua_Number n) {
  int i;
  luai_hashnum(i, n);
  if (i < 0) {
    if (cast(unsigned int, i) == 0u - i)  /* use unsigned to avoid overflows */
      i = 0;  /* handle INT_MIN */
    i = -i;  /* must be a positive value */
  }
  return hashmod(t, i);
}

以下是其他类型的其他哈希实现:

#define hashpow2(t,n)           (gnode(t, lmod((n), sizenode(t))))

#define hashstr(t,str)          hashpow2(t, (str)->tsv.hash)
#define hashboolean(t,p)        hashpow2(t, p)


/*
** for some types, it is better to avoid modulus by power of 2, as
** they tend to have many 2 factors.
*/
#define hashmod(t,n)    (gnode(t, ((n) % ((sizenode(t)-1)|1))))


#define hashpointer(t,p)        hashmod(t, IntPoint(p))

这些散列解析为 2 个路径 hashpow2 和 hashmod。 LUA_TNUMBER使用 hashnum > hashmod 和 LUA_TSHRSTR使用 hashstr > hashpow2

关于performance - 在我的速度测试中，Lua 表哈希索引比数组索引更快。为什么？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/53921190/

文章推荐： java - 如何处理PreparedStatement中的单引号？

文章推荐： java - 如何从字符串反序列化请求多部分正文

文章推荐： java - 检查对象的过期日期以在后台删除

regex - Grep 所有不以#(哈希)或贪心空格和#(哈希)开头的行
我正在尝试 grep conf 文件中所有不以开头的有效行哈希(或) 任意数量的空格(0 个或多个)和一个散列下面的正则表达式似乎不起作用。 grep ^[^[[:blank:]]*#] /op
带斜线的 Laravel 哈希
我正在使用哈希通过 URL 发送 protected 电子邮件以激活帐户 Hash::make($data["email"]); 但是哈希结果是 %242y%2410%24xaiB/eO6knk8sL
来自文本文件的 Perl 哈希
我是 Perl 的新手，正在尝试从文本文件创建散列。我有一个代码外部的文本文件，旨在供其他人编辑。前提是他们应该熟悉 Perl 并且知道在哪里编辑。文本文件本质上包含几个散列的散列，具有正确的语法、缩
perl 哈希 - 比较键和值
我一直在阅读 perl 文档，但我不太了解哈希。我正在尝试查找哈希键是否存在，如果存在，则比较其值。让我感到困惑的是，我的搜索结果表明您可以通过 if (exists $files{$key}) 找到
当键和值都是数组引用时的 Perl 哈希
我遇到了数字对映射到其他数字对的问题。例如，(1,2)->(12,97)。有些对可能映射到多个其他对，所以我真正需要的是将一对映射到列表列表的能力，例如 (1,2)->((12,97),(4,1))。
Mustache:从模板中检索标签列表/哈希？
我见过的所有 Mustache 文档和示例都展示了如何使用散列来填充模板。我有兴趣去另一个方向。 EG，如果我有这个: Hello {{name}} mustache 能否生成这个(伪代码): tag
hash - ColdFusion 哈希
我正在尝试使用此公式创建密码摘要以获取以下变量，但我的代码不匹配。不确定我做错了什么，但当我需要帮助时我会承认。希望有人在那里可以提供帮助。文档中的公式:Base64(SHA1(NONCE + TI
arrays - 遍历数据数组/哈希
我希望遍历我传递给定路径的这些数据结构(基本上是目录结构)。目标是列出根/基本路径，然后列出所有子 path s 如果它们存在并且对于每个子 path存在，列出 file从那个子路径。我知道这可能
子函数的 Perl 哈希
我希望有一个包含对子函数的引用的散列，我可以在其中根据用户定义的变量调用这些函数，我将尝试给出我正在尝试做的事情的简化示例。 my %colors = ( vim => setup_vim()
vim - 为什么写入文件会更改内容(哈希)？
我注意到，在使用 vim 将它们复制粘贴到文件中后尝试生成一些散列时，散列不是它应该的样子。打开和写出文件时相同。与 nano 的行为相同，所以一定有我遗漏的地方。 $ echo -n "foo"
perl - 为什么我们不能在列表上下文中初始化状态数组/哈希？
数组和散列作为状态变量存在限制。从 Perl 5.10 开始，我们无法在列表上下文中初始化它们: 所以 state @array = qw(a b c); #Error! 为什么会这样？为什么这是不允
Varnish vcl_backend_response检测vcl_recv返回(哈希)
在端口 80 上使用 varnish 5.1 的多网站设置中，我不想缓存所有域。这在 vcl_recv 中很容易完成。 if ( req.http.Host == "cache.this.domai
Django 管道缓存破坏不更新缓存文件/哈希
基本上，缓存破坏文件上的哈希不会更新。 class S3PipelineStorage(PipelineMixin, CachedFilesMixin, S3BotoStorage): pa
eclipse - 调试Dart应用程序时变量的唯一ID(哈希？)
eclipse dart插件在“变量” View 中显示如下内容: 在“值”列中可见的“id”是什么意思？ “id”是唯一的吗？在调试期间，如何确定两个实例是否相同？我是否需要在所有类中重写toStr
arrays - 将相同类型的命令行参数读入Powershell中的数组/哈希
如何将Powershell中的命令行参数读入数组？就像是 myprogram -file file1 -file file2 -file file3 然后我有一个数组 [file1,file2,fil
用于安全支付网关的 coldfusion 哈希
我正尝试在 coldfusion 中为我们的安全支付网关创建哈希密码以接受交易。很遗憾，支付网关拒绝接受我生成的哈希值。表单发送交易的所有元素，并发送基于五个不同字段生成的哈希值。在 PHP 中
Ruby - 哈希 - 组合
例如，我有一个包含 5 个元素的哈希: my_hash = {a: 'qwe', b: 'zcx', c: 'dss', d: 'ccc', e: 'www' } 我的目标是每次循环哈希时都返回，但没
哈希问题的 Perl 哈希
我在这里看到了令人作呕的类似问题，但没有一个能具体回答我自己的问题。我正在尝试以编程方式创建哈希的哈希。我的问题代码如下: my %this_hash = (); if ($user_hash{$u
用于安全支付网关的 coldfusion 哈希
我正尝试在 coldfusion 中为我们的安全支付网关创建哈希密码以接受交易。很遗憾，支付网关拒绝接受我生成的哈希值。表单发送交易的所有元素，并发送基于五个不同字段生成的哈希值。在 PHP 中
Java 哈希(简单)
这个问题已经有答案了: Java - how to convert letters in a string to a number? (9 个回答) 已关闭 7 年前。我需要一种简短的方法将字符串转

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

performance - 在我的速度测试中，Lua 表哈希索引比数组索引更快。为什么？