gpt4 book ai didi

haskell - 为什么 writeSTRef 比 if 表达式快?

转载 作者:行者123 更新时间:2023-12-04 10:01:26 26 4
gpt4 key购买 nike

每次迭代 writeSTRef 两次

fib3 :: Int -> Integer
fib3 n = runST $ do
a <- newSTRef 1
b <- newSTRef 1
replicateM_ (n-1) $ do
!a' <- readSTRef a
!b' <- readSTRef b
writeSTRef a b'
writeSTRef b $! a'+b'
readSTRef b

每次迭代 writeSTRef 一次
fib4 :: Int -> Integer
fib4 n = runST $ do
a <- newSTRef 1
b <- newSTRef 1
replicateM_ (n-1) $ do
!a' <- readSTRef a
!b' <- readSTRef b
if a' > b'
then writeSTRef b $! a'+b'
else writeSTRef a $! a'+b'
a'' <- readSTRef a
b'' <- readSTRef b
if a'' > b''
then return a''
else return b''

基准,给定 n = 20000 :

benchmarking 20000/fib3 mean: 5.073608 ms, lb 5.071842 ms, ub 5.075466 ms, ci 0.950 std dev: 9.284321 us, lb 8.119454 us, ub 10.78107 us, ci 0.950

benchmarking 20000/fib4 mean: 5.384010 ms, lb 5.381876 ms, ub 5.386099 ms, ci 0.950 std dev: 10.85245 us, lb 9.510215 us, ub 12.65554 us, ci 0.950



fib3 比 fib4 快一点。

最佳答案

我想你已经从#haskell 那里得到了一些答案;基本上,每个 writeSTRef 归结为对内存的一两次写入,在这种情况下这很便宜,因为它们甚至可能永远不会超过 1 级缓存。

另一方面,由 fib3 中的 if-then-else 产生的分支创建了两条在连续迭代中交替采用的路径,这对许多 CPU 分支预测器来说是一个糟糕的情况,给管道添加了气泡。见 http://en.wikipedia.org/wiki/Instruction_pipeline .

纯版怎么样?

fib0 :: Int -> Integer
fib0 = go 0 1 where
go :: Integer -> Integer -> Int -> Integer
go a b n = case n > 0 of
True -> go b (a + b) (n - 1)
False -> b

它甚至更快:
benchmarking fib0 40000
mean: 17.14679 ms, lb 17.12902 ms, ub 17.16739 ms, ci 0.950
std dev: 97.28594 us, lb 82.39644 us, ub 120.1041 us, ci 0.950

benchmarking fib3 40000
mean: 17.32658 ms, lb 17.30739 ms, ub 17.34931 ms, ci 0.950
std dev: 106.7610 us, lb 89.69371 us, ub 126.8279 us, ci 0.950

benchmarking fib4 40000
mean: 18.13887 ms, lb 18.11173 ms, ub 18.16868 ms, ci 0.950
std dev: 145.9772 us, lb 127.6892 us, ub 168.3347 us, ci 0.950

关于haskell - 为什么 writeSTRef 比 if 表达式快?,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/9883791/

26 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com