Java伪尾调用递归产生更好的性能-6ren

Java伪尾调用递归产生更好的性能

转载作者：行者123 更新时间：2023-12-01 21:49:26

我只是编写一些简单的实用程序来计算 linkedList 的长度，这样 linkedList 就不会托管其大小/长度的“内部”计数器。考虑到这一点，我有 3 个简单的方法:

迭代linkedList，直到到达“结束”
递归计算长度
递归计算长度，无需将控制权返回给调用函数(使用某种尾部调用递归)

下面是一些捕获这 3 种情况的代码:

// 1. iterative approach
public static <T> int getLengthIteratively(LinkedList<T> ll) {

    int length = 0;
    for (Node<T> ptr = ll.getHead(); ptr != null; ptr = ptr.getNext()) {
        length++;
    }

    return length;
}

// 2. recursive approach
public static <T> int getLengthRecursively(LinkedList<T> ll) {
    return getLengthRecursively(ll.getHead());
}

private static <T> int getLengthRecursively(Node<T> ptr) {

    if (ptr == null) {
        return 0;
    } else {
        return 1 + getLengthRecursively(ptr.getNext());
    }
}

// 3. Pseudo tail-recursive approach
public static <T> int getLengthWithFakeTailRecursion(LinkedList<T> ll) {
    return getLengthWithFakeTailRecursion(ll.getHead());
}

private static <T> int getLengthWithFakeTailRecursion(Node<T> ptr) {
    return getLengthWithFakeTailRecursion(ptr, 0);
}

private static <T> int getLengthWithFakeTailRecursion(Node<T> ptr, int result) {
    if (ptr == null) {
        return result;
    } else {
        return getLengthWithFakeTailRecursion(ptr.getNext(), result + 1);
    }
}

现在我知道 JVM 不支持开箱即用的尾递归，但是当我运行一些具有约 10k 节点的字符串 linkedList 的简单测试时，我注意到 getLengthWithFakeTailRecursion 始终优于 getLengthRecursively 方法(约 40%)。增量是否仅归因于案例#2 的控制权被传回每个节点并且我们被迫遍历所有堆栈帧这一事实？

编辑:这是我用来检查性能数字的简单测试:

public class LengthCheckerTest {

@Test
public void testLengthChecking() {

    LinkedList<String> ll = new LinkedList<String>();
    int sizeOfList = 12000;
    // int sizeOfList = 100000; // Danger: This causes a stackOverflow in recursive methods!
    for (int i = 1; i <= sizeOfList; i++) {
        ll.addNode(String.valueOf(i));
    }

    long currTime = System.nanoTime();
    Assert.assertEquals(sizeOfList, LengthChecker.getLengthIteratively(ll));
    long totalTime = System.nanoTime() - currTime;
    System.out.println("totalTime taken with iterative approach: " + (totalTime / 1000) + "ms");

    currTime = System.nanoTime();
    Assert.assertEquals(sizeOfList, LengthChecker.getLengthRecursively(ll));
    totalTime = System.nanoTime() - currTime;
    System.out.println("totalTime taken with recursive approach: " + (totalTime / 1000) + "ms");

    // Interestingly, the fakeTailRecursion always runs faster than the vanillaRecursion
    // TODO: Look into whether stack-frame collapsing has anything to do with this
    currTime = System.nanoTime();
    Assert.assertEquals(sizeOfList, LengthChecker.getLengthWithFakeTailRecursion(ll));
    totalTime = System.nanoTime() - currTime;
    System.out.println("totalTime taken with fake TCR approach: " + (totalTime / 1000) + "ms");
}
}

最佳答案

您的基准测试方法存在缺陷。您在同一个 JVM 中执行所有三个测试，因此它们的位置并不相同。当执行假尾测试时，LinkedList 和 Node 类已经经过 JIT 编译，因此运行速度更快。您可以更改测试顺序，您会看到不同的数字。每个测试都应该在单独的 JVM 中执行。

我们写简单的JMH microbenchmark对于您的情况:

import java.util.concurrent.TimeUnit;

import org.openjdk.jmh.infra.Blackhole;
import org.openjdk.jmh.annotations.*;

// 5 warm-up iterations, 500 ms each, then 10 measurement iterations 500 ms each
// repeat everything three times (with JVM restart)
// output average time in microseconds
@Warmup(iterations = 5, time = 500, timeUnit = TimeUnit.MILLISECONDS)
@Measurement(iterations = 10, time = 500, timeUnit = TimeUnit.MILLISECONDS)
@BenchmarkMode(Mode.AverageTime)
@OutputTimeUnit(TimeUnit.MICROSECONDS)
@Fork(3)
@State(Scope.Benchmark)
public class ListTest {
    // You did not supply Node and LinkedList implementation
    // but I assume they look like this
    static class Node<T> {
        final T value;
        Node<T> next;

        public Node(T val) {value = val;}
        public void add(Node<T> n) {next = n;}

        public Node<T> getNext() {return next;}
    }

    static class LinkedList<T> {
        Node<T> head;

        public void setHead(Node<T> h) {head = h;}
        public Node<T> getHead() {return head;}
    }

    // Code from your question follows

    // 1. iterative approach
    public static <T> int getLengthIteratively(LinkedList<T> ll) {

        int length = 0;
        for (Node<T> ptr = ll.getHead(); ptr != null; ptr = ptr.getNext()) {
            length++;
        }

        return length;
    }

    // 2. recursive approach
    public static <T> int getLengthRecursively(LinkedList<T> ll) {
        return getLengthRecursively(ll.getHead());
    }

    private static <T> int getLengthRecursively(Node<T> ptr) {

        if (ptr == null) {
            return 0;
        } else {
            return 1 + getLengthRecursively(ptr.getNext());
        }
    }

    // 3. Pseudo tail-recursive approach
    public static <T> int getLengthWithFakeTailRecursion(LinkedList<T> ll) {
        return getLengthWithFakeTailRecursion(ll.getHead());
    }

    private static <T> int getLengthWithFakeTailRecursion(Node<T> ptr) {
        return getLengthWithFakeTailRecursion(ptr, 0);
    }

    private static <T> int getLengthWithFakeTailRecursion(Node<T> ptr, int result) {
        if (ptr == null) {
            return result;
        } else {
            return getLengthWithFakeTailRecursion(ptr.getNext(), result + 1);
        }
    }

    // Benchmarking code

    // Measure for different list length        
    @Param({"10", "100", "1000", "10000"})
    int n;
    LinkedList<Integer> list;

    @Setup    
    public void setup() {
        list = new LinkedList<>();
        Node<Integer> cur = new Node<>(0);
        list.setHead(cur);
        for(int i=1; i<n; i++) {
            Node<Integer> next = new Node<>(i);
            cur.add(next);
            cur = next;
        }
    }

    // Do not forget to return result to the caller, so it's not optimized out
    @Benchmark    
    public int testIteratively() {
        return getLengthIteratively(list);
    }

    @Benchmark    
    public int testRecursively() {
        return getLengthRecursively(list);
    }

    @Benchmark    
    public int testRecursivelyFakeTail() {
        return getLengthWithFakeTailRecursion(list);
    }
}

这是我的机器上的结果(x64 Win7，Java 8u71)

Benchmark                           (n)  Mode  Cnt   Score    Error  Units
ListTest.testIteratively             10  avgt   30   0,009 ±  0,001  us/op
ListTest.testIteratively            100  avgt   30   0,156 ±  0,001  us/op
ListTest.testIteratively           1000  avgt   30   2,248 ±  0,036  us/op
ListTest.testIteratively          10000  avgt   30  26,416 ±  0,590  us/op
ListTest.testRecursively             10  avgt   30   0,014 ±  0,001  us/op
ListTest.testRecursively            100  avgt   30   0,191 ±  0,003  us/op
ListTest.testRecursively           1000  avgt   30   3,599 ±  0,031  us/op
ListTest.testRecursively          10000  avgt   30  40,071 ±  0,328  us/op
ListTest.testRecursivelyFakeTail     10  avgt   30   0,015 ±  0,001  us/op
ListTest.testRecursivelyFakeTail    100  avgt   30   0,190 ±  0,002  us/op
ListTest.testRecursivelyFakeTail   1000  avgt   30   3,609 ±  0,044  us/op
ListTest.testRecursivelyFakeTail  10000  avgt   30  41,534 ±  1,186  us/op

如您所见，假尾速度与简单递归速度相同(在误差范围内)，但比迭代方法慢 20-60%。所以你的结果没有被重现。

如果您实际上想要的不是稳态测量结果，而是单次测量结果(无需预热)，您可以使用以下选项启动相同的基准测试:-ss -wi 0 -i 1 -f 10 。结果如下:

Benchmark                           (n)  Mode  Cnt    Score     Error  Units
ListTest.testIteratively             10    ss   10   16,095 ±   0,831  us/op
ListTest.testIteratively            100    ss   10   19,780 ±   6,440  us/op
ListTest.testIteratively           1000    ss   10   74,316 ±  26,434  us/op
ListTest.testIteratively          10000    ss   10  366,496 ±  42,299  us/op
ListTest.testRecursively             10    ss   10   19,594 ±   7,084  us/op
ListTest.testRecursively            100    ss   10   21,973 ±   0,701  us/op
ListTest.testRecursively           1000    ss   10  165,007 ±  54,915  us/op
ListTest.testRecursively          10000    ss   10  563,739 ±  74,908  us/op
ListTest.testRecursivelyFakeTail     10    ss   10   19,454 ±   4,523  us/op
ListTest.testRecursivelyFakeTail    100    ss   10   25,518 ±  11,802  us/op
ListTest.testRecursivelyFakeTail   1000    ss   10  158,336 ±  43,646  us/op
ListTest.testRecursivelyFakeTail  10000    ss   10  755,384 ± 232,940  us/op

正如您所看到的，第一次启动比后续启动慢很多倍。而且你的结果仍然没有重现。我观察到，对于 n = 10000，testRecursivelyFakeTail 实际上较慢(但在预热后，它达到了与 testRecursively 相同的峰值速度。

关于Java伪尾调用递归产生更好的性能，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/35423375/

文章推荐： java - 制作Java游戏，发现很容易作弊，不知道如何预防

文章推荐： java - ListView不显示ArrayList中的数据

http - 伪/冒号标题字段的用途
如标题所示，我正在寻找有关伪/冒号 header 字段用途的一些信息，即我想知道为什么我们有第二种类型的 header 字段... 另外 - 我知道在 http2 中使用伪/冒号 header 字段代
深入解析PHP中的(伪)多线程与多进程
(伪)多线程：借助外力利用WEB服务器本身的多线程来处理，从WEB服务器多次调用我们需要实现多线程的程序。 QUOTE: 我们知道PHP本身是不支持多线程的, 但是我们的WEB服务器是支持多线程的
random - 如何实现(伪)硬件随机数生成器
您如何在 HDL (verilog) 中实现硬件随机数生成器？需要考虑哪些选项？这个问题是在self-answer之后格式。鼓励添加答案和更新。最佳答案正如摩根的回答中所指出的，这只会产生一个
css - 伪 :before outside of div
我写了这个CSS: div { width: 500px; height:150px; margin-left:150px; background: lightblue; } div:
java - 将“伪”时间戳存储到数据库中
这是我要解决的问题：从数据库A读取一个字符串，将该字符串转换为Date对象，将Date对象存储到数据库B中。例）数据库A：从数据库A读入日期字符串“ 2015-03-08 02:00:00”，转换为
c++ - 如何在(伪？)运行时迭代可变参数模板包？
我想创建 std::fscanf() 的 sibling (我知道这是一个 C 函数)。所以，我的界面是这样的: template std::size_t ts_scanf(is, format,
sql - 伪 IF/Case 帮助
运行 PostgreSQL 7.x(是的，我正在升级) 问题: 如果没有返回数据，我有三到四个字段需要设置。正在考虑这样的事情 SELECT CASE WHEN default_fie
javascript - 重复无效的用户输入，直到其有效(伪)
出于某种原因，我很难在 JS 中为我的游戏执行以下代码: 假设我们要求用户在棋盘上移动一个棋子。他们可以做的位置是位置A、位置B或位置C。每个位置一次只能容纳一件。否则为无效移动。第一个用户决定
c++ - 霍夫曼编码 - 伪 EOF
我已经毫无问题地编写了霍夫曼树的代码，但现在我希望在文件和树中添加伪 EOF，以便我知道何时停止从文件中读取。我完全掌握了伪 EOF 的概念。我还了解到没有 ASCII 值 > 255 的字符。我
javascript - 伪 :after class 的切换类
给定一个按钮 ::after 当被触发时，伪 :after 类需要有一个类 search-active 切换，为按钮设置背景颜色 .primary .search:after, .primary
自动编号前的 CSS 伪 - 缩进
我想让第一行的文本像第二行一样缩进 (50px)。有什么办法吗？非常感谢! body{ counter-reset: h2counter; } h1{ counter-reset: h2counter
不同继承之前/之后的 CSS 伪
:before 或 :after 这样的伪元素是否可以从父元素的不同属性继承值？在我的例子中，我有一个第三方组件设置其元素运行时的背景颜色...我需要继承该颜色并将其设置为伪元素的边框颜色。最佳答
matlab - Matlab中并行循环中的不同(伪)随机数
在并行循环中请求随机数总是返回相同的伪随机数。我怎样才能避免这种情况？ % workers initialization: if matlabpool('size') == 0 matlabp
c# - 伪 IPv4 正则表达式
假设最大IP可以包含每个“点”括号中的最大数量999，即999.999.999.999 是最大的可用值。我已经在计算器中检查了正则表达式 ([0-9]+.){3}[0-9]。那么，为什么程序抛出运行
c - 很好的介绍(伪)随机数生成
我对随机数生成的概念非常陌生，我需要为用c编写的工作创建自己的算法（内置的随机数生成器对我不起作用）。有人能给我介绍一个很好的主题，这样我就可以理解这个概念了吗？到目前为止，我所发现的一切似乎都是用
algorithm - 寻找一种算法以(伪)随机顺序吐出一系列数字
假设我有一个数字序列:{n, n+1, n+2, ... n+m} 在不提前存储数字的情况下，我想创建一个函数 f()，给定序列 {1,2,3,...m} 将以随机(或至少伪)的方式吐出原始集合随机)
tcp - 伪 Tcp channel
什么是伪 tcp channel ，如何实现？最佳答案伪 TCP 是一种协议(protocol)，它实现了 TCP 的一些思想，以通过不可靠的、基于数据包的接口(interface)提供可靠的数据
python - 以(伪)随机顺序从大列表中高效地生成元素
我正在尝试展开一些嵌套循环，以牺牲内存为代价(可能)获得更好的性能。在我的场景中，我最终会得到一个包含大约 3 亿个元素(元组)的列表，我必须以(或多或少)随机顺序产生这些元素。在这个数量级上，ra
php - 生成(伪)随机字母数字字符串
如何在 PHP 中生成(伪)随机字母数字字符串，例如:'d79jd8c'？最佳答案首先创建一个包含所有可能字符的字符串: $characters = 'abcdefghijklmnopqrstu
c++ - 如何获得两个(伪)随机但彼此不同的容器迭代器/元素？
我有一段代码可以为玩家生成迷你任务。这很简单，要获得两个不同的点(起点和终点)，我有一个如下所示的算法: std::vector missions; missions.push_bac

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

Java伪尾调用递归产生更好的性能