performance - 了解 Clojure 转换器性能-6ren

performance - 了解 Clojure 转换器性能

转载作者：行者123 更新时间：2023-12-03 16:47:03

26

4

在较高的层次上，我理解使用转换器不会创建任何中间数据结构，而通过 ->> 产生一长串操作。确实如此，因此换能器方法的性能更高。这在我下面的一个例子中被证明是正确的。但是，当我添加 clojure.core.async/chan 时我没有得到我期望的相同性能改进。显然有些东西我不明白，我希望得到解释。

(ns dev
  (:require [clojure.core.async :as async]

            [criterium.core :as crit]))

;; Setup some toy data.
(def n 1e6)
(def data (repeat n "1"))


;; Reusable thread-last operation (the "slower" one).
(defn tx [x]
  (->> x
       (map #(Integer. %))
       (map inc) (map inc) (map inc) (map inc) (map inc) (map inc)
       (map inc) (map inc) (map inc) (map inc) (map inc)))

;; Reusable transducer (the "faster" one).
(def xf (comp
          (map #(Integer. %))
          (map inc) (map inc) (map inc) (map inc) (map inc) (map inc)
          (map inc) (map inc) (map inc) (map inc) (map inc)))

;; For these first two I expect the second to be faster and it is.
(defn nested []
  (last (tx data)))

(defn into-xf []
  (last (into [] xf data)))

;; For the next two I again expect the second to be faster but it is NOT.
(defn chan-then-nested []
  (let [c (async/chan n)]
    (async/onto-chan! c data)
    (->> c
         (async/into [])
         async/<!!
         tx
         last)))

(defn chan-xf []
  (let [c (async/chan n xf)]
    (async/onto-chan! c data)
    (->> c
         (async/into [])
         async/<!!
         last)))

(comment

  (crit/quick-bench (nested)) ; 1787.672 ms
  (crit/quick-bench (into-xf)) ; 822.8626 ms
  (crit/quick-bench (chan-then-nested)) ; 1535.628 ms
  (crit/quick-bench (chan-xf)) ; 2072.626 ms

  ;; Expected ranking fastest to slowest
  ;; into-xf
  ;; nested
  ;; chan-xf
  ;; chan-then-nested

  ;; Actual ranking fastest to slowest
  ;; into-xf
  ;; chan-then-nested
  ;; nested
  ;; chan-xf

  )

最后有两个结果我不明白。首先，为什么使用带有 channel 的传感器比从 channel 读取所有内容然后进行嵌套映射慢？看起来，使用带有 channel 的换能器的“开销”或其他任何东西要慢得多，以至于它压倒了不创建中间数据结构的 yield 。其次，这个真的出乎意料，为什么把数据放到一个 channel 上然后取下来然后使用嵌套映射技术比不做 channel 舞只使用嵌套映射技术更快？ (说得更短，为什么 chan-then-nested 比 nested 快？)这一切可能只是基准测试或随机性的产物吗？ (我已经为每个这些都运行了几次 quick-bench，结果相同。)我想知道它是否与 into 有关。调用 transduce而在 channel 版本中根本没有以相同的方式实现。转换器提供了相同的界面来应用跨向量或 channel 的转换，但应用转换的方式不同；而这种差异决定了一切。

最佳答案

关于你的方法的一些评论:

拥有一个缓冲区大小为 100 万的 channel 是非常不寻常的。我不希望从这种用法得出的基准对现实世界的程序有很大的适用性。只需使用大小为 1 的缓冲区。这足以让此应用程序成功，并且更接近真实世界的使用情况。

没必要挑这么大的n .如果您的函数运行得更快，则标准可以采集更多样本，从而更准确地估计其平均时间。 n=100 足够了。

进行这些更改后，这是我看到的基准数据:

Evaluation count : 14688 in 6 samples of 2448 calls.
             Execution time mean : 39.978735 µs
    Execution time std-deviation : 1.238587 µs
   Execution time lower quantile : 38.870558 µs ( 2.5%)
   Execution time upper quantile : 41.779784 µs (97.5%)
                   Overhead used : 10.162171 ns
Evaluation count : 20094 in 6 samples of 3349 calls.
             Execution time mean : 30.557295 µs
    Execution time std-deviation : 562.641738 ns
   Execution time lower quantile : 29.936152 µs ( 2.5%)
   Execution time upper quantile : 31.330094 µs (97.5%)
                   Overhead used : 10.162171 ns
Evaluation count : 762 in 6 samples of 127 calls.
             Execution time mean : 740.642963 µs
    Execution time std-deviation : 176.879454 µs
   Execution time lower quantile : 515.588780 µs ( 2.5%)
   Execution time upper quantile : 949.109898 µs (97.5%)
                   Overhead used : 10.162171 ns

Found 2 outliers in 6 samples (33.3333 %)
    low-severe   1 (16.6667 %)
    low-mild     1 (16.6667 %)
 Variance from outliers : 64.6374 % Variance is severely inflated by outliers
Evaluation count : 816 in 6 samples of 136 calls.
             Execution time mean : 748.782942 µs
    Execution time std-deviation : 7.157018 µs
   Execution time lower quantile : 740.139618 µs ( 2.5%)
   Execution time upper quantile : 756.102312 µs (97.5%)
                   Overhead used : 10.162171 ns

关键要点是:

异步开销在实际处理时间中占主导地位。两个 channel 版本都比非 channel 版本慢得多，所以我们不再需要担心“为什么把整个东西放到一个 channel 然后再取下来会更快”。

chan-then-nested的区别和 chan-xf比你的版本小得多。 chan-xf仍然有点慢，但很容易在一个标准偏差内:并不是一个了不起的结果。

关于performance - 了解 Clojure 转换器性能，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/65757419/

26

4

0

文章推荐： python - 在python中使用未指定的加密 key 解码加密文本

文章推荐： javascript - 如何使用样式化组件设置嵌套功能组件的样式

文章推荐： javascript - react : Calling Render() during animation. 会发生什么？

clojure - Clojure 为什么命名为 Clojure
为什么该语言的名称是“Clojure”？我用谷歌搜索了一下，在#clojure 中询问。到目前为止，还没有运气。最佳答案 Rich Hickey(他是 Clojure 的设计者)对此的评论是 wi
clojure - 从 Clojure 应用程序转换到最新的 Clojure 时遇到问题
我不明白为什么升级后会出现以下编译错误: Compiling addr-verify.core Exception in thread "main" java.lang.NoClassDefFound
clojure - 如何防止 Clojure 异常 : clojure. lang.LazySeq cannot be cast to clojure.lang.IFn
我试图将从映射操作返回的(惰性)序列传递给另一个映射操作，以便我可以在第一个序列中查找元素。代码从文本文件(以行/列格式)解析一些足球装置，清理它，然后返回一张 map 。这是代码: (ns fix
clojure - 在 Clojure 中过滤集合 clojure.set/select 与 clojure.core/filter
我想过滤一组，例如: (filter-set even? #{1 2 3 4 5}) ; => #{2 4} 如果我使用clojure.core/filter我得到一个不是集合的seq: (filte
clojure - clojure 中具有相同名称的多个函数
(defn hi[](+ 5 6)) (hi) (defn hi[](+ 6 7)) (hi) 你好，我是 clojure 的新手。如上所述，我编写了两个具有相同名称的函数。我们可以在 cloj
clojure - 十进制到二进制 Clojure
我按照这个伪代码递归地将十进制转换为二进制。 findBinary(decimal) if (decimal == 0) binary = 0 else binar
clojure - Clojure 中的浮点精度
我正在尝试学习 Clojure 并尝试定义这个简单的函数: user=> (defn triple [arg] (* 3 arg)) #'user/triple user=> (triple 1) 3
clojure - Clojure 中的宏 -> ->>
是->和 ->>宏只是为了使代码更具可读性还是它们还有其他特定功能？最佳答案线程优先( -> )和线程最后( ->> )是为了使代码更具可读性。但这已经很重要了! 它允许取消嵌套函数调用(示例取自
clojure - Clojure 中的线程安全流行？
我在 http://www.learningclojure.com/2010/11/yet-another-way-to-write-factorial.html 上找到了这个代码，但我不明白 pop
clojure - Clojure 序列推导式中是否存在变量？
我正在阅读 Programming Clojure 2nd edition，在第 49 页它涵盖了 Clojure 的 for 循环结构，它说它实际上是一个序列理解。作者建议使用以下代码: (def
clojure - Clojure 中的双端队列
Clojure 中有双端队列吗？我的印象是 Clojure 的 PersistentQueue 是单端的(我错了吗？)。我需要能够从队列的任一端删除(即“pop”)和“peek”数据。我所说的双端队列
clojure - clojure 有标识符宏吗？
换句话说，有没有办法在看起来不像 (MACRO arg* ...) 的表单上触发宏扩展？ . 举一个假设的例子: (defmacro my-var (do (printf "Using my-va
clojure - Clojure 中最简单的惰性函数
我很难理解懒惰。有人能帮我理解为什么我下面的函数不是懒惰的吗 (defn my-red ([f coll] (my-red f (first coll) (rest coll) ))
clojure - Clojure 中函数参数排序的经验法则
在 Clojure 核心中决定参数函数顺序的规则是什么(如果有的话)？类似 map 的函数和 filter期望数据结构作为最后一个争论。类似 assoc 的函数和 select-keys期待数据
clojure - Clojure 中的完成函数有什么作用？
我在 clojuredocs 上遇到过 completing 函数，但目前没有文档。你能提供一些例子吗？最佳答案 completing 用于扩充可能没有具有一元“完成”元数的一元重载的二元归约函数
clojure - Clojure 中记录的模式匹配
这个现在支持吗？我能找到的唯一信息是来自维基的示例( https://github.com/clojure/core.match/wiki/Deftype-and-defrecord-matching
clojure - clojure 代码示例中的这个函数定义有什么作用？
我正在关注“Clojure in Action”，对此我感到困惑: (defn with-log [function-to-call log-statement ] (fn [& args
clojure - Clojure 中函数名称中的箭头
对于下面的代码，箭头是宏还是函数名称中的简单字符？ (来自 here) (defn file->map [file] ;; TODO ) 最佳答案箭头是函数名称的一部分。有一个函数定义，不是
clojure - clojure 是否具有完全包含的范围功能？
Clojure 的 range函数包含来自 start独家在end (如果提供)。核心库中是否有一个函数可以提供完全包含(开始和结束)的范围？我发现在某些情况下必须调整最终值的代码 - 例如向下而不
clojure - Clojure 中的动态记录
当我尝试从 REPL 运行以下代码时(使用动态记录): (defrecord (symbol "rec2") (vec (map symbol ["f1" "f2"]))) 我收到错误 Compile

首页

博学

6Ren·AI

商城

performance - 了解 Clojure 转换器性能