common-lisp - 从 Common Lisp/SBCL 中获得更高的速度-6ren

common-lisp - 从 Common Lisp/SBCL 中获得更高的速度

转载作者：行者123 更新时间：2023-12-02 22:28:04

24

4

This paper声称让某个 Lisp 程序运行得比 C 语言更快相等的。尝试重现结果，我能够接近(Lisp 是比 C) 慢 50%，但想知道是否有人知道如何挤压更多SBCL 1.3.1 的性能。

目标问题是向 800 x 中的每个单元格添加一个恒定的单个 float 800 个单 float 数组。方法是用C语言编写程序和 Common Lisp 时代比较一下。使用这个portable timer ，C代码如下如下:

#include <stddef.h>
#include <stdint.h>
#include <stdio.h>
#include <stdlib.h>
#include <assert.h>
#include <time.h>

#include "./modules/tictoc/tictoc.h"

const int HORZ = 800;
const int VERT = 800;

#define PERF_REPS 1000

typedef float DATA_T;

struct image_s {
    size_t n;
    size_t w, h;
    DATA_T * data;
};
typedef struct image_s image;

image * make_image (size_t w, size_t h) {
    size_t n = w * h;
    DATA_T * data = (DATA_T *)malloc(sizeof(DATA_T) * n);
    assert (NULL != data);
    image * result = (image *)malloc(sizeof(image));
    assert (NULL != result);
    result->n = n;
    result->w = w;
    result->h = h;
    result->data = data;
    return result;
}

void free_image (image * it) {
    assert (NULL != it);
    assert (NULL != it->data);
    free (it->data);
    free (it);
}

image * init_to_value (image * it, DATA_T val) {
    assert (NULL != it);
    assert (NULL != it->data);
    size_t i;
    const size_t n = it->n;
    for (i = 0; i < n; ++i) {
        it->data[i] = val;
    }
    return it;
}

void add (image * to, image * from, DATA_T val) {
    assert (NULL != to);
    assert (NULL != to->data);
    assert (NULL != from);
    assert (NULL != from->data);
    size_t i;
    const size_t n = to->n;
    for (i = 0; i < n; ++i) {
        to->data[i] = from->data[i] + val;
    }
}

int main (int argc, char ** argv) {
    image * from = init_to_value (make_image (HORZ, VERT), 0.0f);
    image * to = init_to_value (make_image (HORZ, VERT), 0.0f);
    TicTocTimer clock = tic();
    for (size_t i = 0; i < PERF_REPS; ++i)
        add (to, from, 42.0);
    printf("Elapsed time %f seconds.\n",toc(&clock));
    free_image (to);
    free_image (from);
    return 0;
}

我编译并运行代码如下:

gcc -O3 image-add.c ./modules/tictoc/libtictoc.a && ./a.out

我的 mac book pro 上的典型时间约为 0.178 秒。相当不错。

等效的 Lisp 代码，使用我能在 Lisp 中找到的每个选项 hyperspec ，在新书中Common Lisp Recipes ，并在SBCL user manual中，是如下。评论指出了我尝试过的一些事情，但没有成功不同之处。

;;; None of the commented-out declarations made any difference in speed. 

(declaim (optimize speed (safety 0)))

(defun image-add (to from val)
  (declare (type (simple-array single-float (*))
                 to from))
  (declare (type single-float val))
  (let ((size (array-dimension to 0)))
    ;(declare (type fixnum size))
    (dotimes (i size)
      ;(declare (type fixnum i))
      (setf (aref to i) (+ (aref from i) val)))))

(defparameter HORZ 800)
(defparameter VERT 800)

(defparameter PERF-REPS 1000)

(let ((to (make-array (* HORZ VERT) :element-type 'single-float))
      (fm (make-array (* HORZ VERT) :element-type 'single-float)))
  ;(declare (type fixnum HORZ))
  ;(declare (type fixnum VERT))
  (time (dotimes (_ PERF-REPS)
          ;(declare (type fixnum PERF-REPS))
          ;(declare (type fixnum _))
          ;(declare (inline image-add))
          (image-add to fm 42.0))))

我编译并运行它如下:

sbcl --script image-perf.lisp

典型的运行时间是 0.276。不错，但我想要更好。当然，这个练习的重点是 Lisp 代码更短，但是有人知道一种方法可以让它更快或更快吗？

最佳答案

以下是经过稍微修改后的版本的一些结果，供引用。

C 版本

C 版本平均需要 0.197s。

Lisp版本

(declaim (optimize (speed 3) (debug 0) (safety 0)))

(defconstant HORZ 800)
(defconstant VERT 800)
(defconstant PERF-REPS 1000)

(defun test ()
  (let ((target #1=(make-array (* HORZ VERT)
                               :element-type 'single-float
                               :initial-element 0f0))
        (source #1#))
    (declare (type (simple-array single-float (*)) target source))
    (time 
      (dotimes (_ PERF-REPS)
        (map-into target
                  (lambda (x)
                    (declare (single-float x))
                    (the single-float (+ x 42f0)))
                  source)))))

这是输出:

Evaluation took:                                                                                                 
  0.372 seconds of real time                                                                                     
  0.372024 seconds of total run time (0.368023 user, 0.004001 system)                                            
  100.00% CPU                                                                                                    
  965,075,988 processor cycles                                                                                   
  0 bytes consed

用 lparallel:pmap-into 替换 map-into，使用由 4 个工作线程组成的内核获得最短时间，并给出:

Evaluation took:                                                                                                 
 0.122 seconds of real time                                                                                     
 0.496031 seconds of total run time (0.492030 user, 0.004001 system)                                            
 406.56% CPU                                                                                                    
 316,445,789 processor cycles                                                                                   
 753,280 bytes consed

注意内存使用情况的差异。

关于common-lisp - 从 Common Lisp/SBCL 中获得更高的速度，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/34999052/

24

4

0

文章推荐： java - 使用 Java 时如何保持命令提示符打开？

文章推荐： office365 - 订阅 - MSA 请求不支持指定的资源

文章推荐： sql-server - 找不到表[表名] -2147189180

lisp - 关于 LISP 程序，common lisp
你们能帮帮我吗，这是我们的讲师给我们的教程问题，无论我们尝试了多少，我们实际上似乎都无法破解它。请帮忙 ; perform some type/error checking, ; then ca
lisp - 如何在 Lisp 中编写函数。编译那个 Lisp 文件。然后运行那个 Lisp 程序
在 Common Lisp 中编写、编译和测试一个函数，该函数接受一个列表并计算列表中正整数的总数。必须编译然后执行包含函数的 .lisp 文件。在编译该文件后开始传递它，列出要生成的结果的结果，从而
lisp - Lisp 中的反引号扩展
我是 Lisp 初学者，我很难理解为什么下面的代码会给我一个错误。 (dolist (elem '(mapcar mapcon)) (when (fboundp `
lisp - Lisp 如何让您重新定义语言本身？
我听说 Lisp 可以让你重新定义语言本身，我也试图研究它，但没有任何地方明确的解释。有人有一个简单的例子吗？最佳答案 Lisp 用户将 Lisp 称为可编程编程语言 .用于符号计算 - 用符号计
lisp - 如何从Lambda转换为函数？ (LISP)
Closed. This question is off-topic. It is not currently accepting answers. Learn more。想改进这个问题吗Updat
lisp - LISP 的每个缺点有何不同？
这些是 cons 参数的不同组合的输出。我刚开始学习 lisp。有人可以帮我理解这些吗？ Break 80 [81]> (CONS '(A) 'B) ((A) . B) Break 80 [81]>
lisp - 子集总和 - lisp
这个问题不太可能帮助任何 future 的访问者；它只与一个小的地理区域、一个特定的时间点或一个非常狭窄的情况有关，这些情况并不普遍适用于互联网的全局受众。为了帮助使这个问题更广泛地适用，visit
lisp - 不是数字原子 LISP
我想问一下为什么这个功能不起作用... (defun nenum(ls) (cond ((null ls) nil) ((listp car(ls)) (nenum (rest ls)
lisp - 将原子拆分成列表 lisp
如果我有一个原子，例如“a4”，我需要能够将 1 添加到“4”部分以使其成为 a5，但是因为它被认为是一个字符串，所以这是不可能的，所以如果我可以拆分 (a4 ) 到 ((a)(4)) 中，然后我可以
lisp - LISP 中的动态构建函数
我有一个关于动态构建函数(或类似的东西)的问题。在 Java 中，我可以通过编程将一些 Source 写入字符串，编译该字符串并像函数一样执行它多次。假设我有一些遗传算法来创建最佳代码以获取 n 个
lisp - Lisp 中的相等和反向函数
我是 Common Lisp 的新手，正在学习教程，但无法全神贯注 (equal '(reverse (a b)) '(b a))) 返回零。非常感谢您的协助。 M. 最佳答案在 lisp 中引
lisp - LISP 广度优先搜索
我有一个使用列表表示的树。例如: (1 ((2 (3)) (3 (2)))) (2 ((1 (3)) (3 (1)))) (3 ((1 (2)) (2 (1)))))` 现在我需要在维护层次结构树的同
lisp - Lisp 中简单与复杂的用户输入函数
在此站点:http://www.gigamonkeys.com/book/practical-a-simple-database.html有如下列出的用户入口函数: (defun prompt-rea
lisp - 使用嵌套列表 LISP
我对 lisp 比较陌生，对在以下上下文中使用嵌套列表的最佳方法很好奇: 所以，我有以下功能: (defun get-p0 (points) (loop for (label x y) in
lisp - Lisp 中的实例变量？
我正在为 CLOS 类编写一个函数，该函数反转所述类对象的列表元素。我有一个返回反向列表的方法，但如何让它将对象的列表设置为该列表？我可以在存储列表的函数中有一个实例变量，然后将元素设置为那个吗？或
lisp - LISP 是编译语言还是解释语言？
我知道，严格来说，没有编译语言或解释语言这回事。但是，一般来说，LISP 是用来编写 Python、bash 脚本、批处理脚本之类的脚本的吗？还是像 C++、JAVA 和 C# 这样的通用编程语言
lisp - Lisp 中的递归与迭代
在此站点 http://jatha.sourceforge.net/快速函数的示例是通过递归。是不是递归通常比 Lisp 中的迭代更快并且性能更好？编辑:Lisp 是否比其他语言更优化递归？最佳答
lisp - lisp 中的变量引用
另一个新手(常见)LISP 问题: 基本上在大多数编程语言中，函数都有一种方法接收对变量的引用而不仅仅是值，即通过引用传递而不是通过值传递。比方说，为了简单起见，我想编写一个 LISP 函数来接收一个
lisp - Lisp 中的位置函数错误
这个问题在这里已经有了答案: How do I find the index of an element in a list in Racket? (3 个答案) 关闭 9 年前。如果我有这样的列
lisp - 骑士之旅回溯 Lisp
我在为这个程序生成正确的输出时遇到了一些问题。我的输出几乎是正确的，但缺少一些步骤。我的代码如下: (defun kt (x y m n) ;set the

首页

博学

6Ren·AI

商城