Slower performance after upgrading from OptaPlanner 8.22.1 to Timefold 1.1.0 or OptaPlanner 8.37.0(从OptaPlanner 8.22.1升级到TimeFold 1.1.0或OptaPlanner 8.37.0后性能降低)-6ren

Slower performance after upgrading from OptaPlanner 8.22.1 to Timefold 1.1.0 or OptaPlanner 8.37.0(从OptaPlanner 8.22.1升级到TimeFold 1.1.0或OptaPlanner 8.37.0后性能降低)

转载作者：bug小助手更新时间：2023-10-24 22:29:14

I have a course scheduling application based on OptaPlanner 8.22.1.Final.

我有一个基于OptaPlanner 8.22.1的排课应用程序。

After upgrading to Timefold 1.1.0, the execution time for the performance test cases increases about 100%. The application code is the same other than changes to point to the Timefold library. The JDK goes up from 11 to 17.

升级到TimeFold1.1.0后，性能测试用例的执行时间增加了大约100%。应用程序代码相同，只是更改为指向TimeFold库。JDK指数从11上升到17。

Here are some details about the tests.

以下是有关测试的一些细节。

Test case 1

测试用例1

Before

在此之前

2023-09-10 13:02:17,390 INFO Solving started: time spent (5), best score (-730init/0hard/0medium/0soft), environment mode (REPRODUCIBLE), move thread count (4), random (JDK with seed 0).
2023-09-10 13:02:17,902 INFO Construction Heuristic phase (0) ended: time spent (517), best score (-40hard/-2265medium/0soft), score calculation speed (32158/sec), step total (365).
2023-09-10 13:03:31,283 INFO Local Search phase (1) ended: time spent (73898), best score (0hard/0medium/0soft), score calculation speed (24932/sec), step total (28873).
2023-09-10 13:03:31,283 INFO Solving ended: time spent (73898), best score (0hard/0medium/0soft), score calculation speed (24979/sec), phase total (2), environment mode (REPRODUCIBLE), move thread count (4).

After

之后

2023-09-10 15:38:24,216 INFO Solving started: time spent (11), best score (-730init/0hard/0medium/0soft), environment mode (REPRODUCIBLE), move thread count (4), random (JDK with seed 0).
2023-09-10 15:38:24,590 INFO Construction Heuristic phase (0) ended: time spent (385), best score (-40hard/-2265medium/0soft), score calculation speed (87852/sec), step total (365).
2023-09-10 15:42:03,882 INFO Local Search phase (1) ended: time spent (219677), best score (0hard/-10medium/0soft), score calculation speed (35648/sec), step total (31041).
2023-09-10 15:42:03,883 INFO Solving ended: time spent (219677), best score (0hard/-10medium/0soft), score calculation speed (35734/sec), phase total (2), environment mode (REPRODUCIBLE), move thread count (4).

Test case 2:

测试用例2：

Before

在此之前

2023-09-10 13:03:32,508 INFO Solving started: time spent (16), best score (-3796init/0hard/0medium/0soft), environment mode (REPRODUCIBLE), move thread count (4), random (JDK with seed 13).
2023-09-10 13:03:34,728 INFO Construction Heuristic phase (0) ended: time spent (2236), best score (-10hard/-6460medium/0soft), score calculation speed (40084/sec), step total (1898).
2023-09-10 13:08:37,166 INFO Local Search phase (1) ended: time spent (304674), best score (0hard/0medium/0soft), score calculation speed (13550/sec), step total (83120).
2023-09-10 13:08:37,167 INFO Solving ended: time spent (304675), best score (0hard/0medium/0soft), score calculation speed (13742/sec), phase total (2), environment mode (REPRODUCIBLE), move thread count (4).

After

之后

2023-09-10 15:42:04,616 INFO Solving started: time spent (32), best score (-3796init/0hard/0medium/0soft), environment mode (REPRODUCIBLE), move thread count (4), random (JDK with seed 13).
2023-09-10 15:42:07,385 INFO Construction Heuristic phase (0) ended: time spent (2801), best score (-10hard/-6460medium/0soft), score calculation speed (64265/sec), step total (1898).
2023-09-10 15:52:28,340 INFO Local Search phase (1) ended: time spent (623756), best score (0hard/0medium/0soft), score calculation speed (12726/sec), step total (82742).
2023-09-10 15:52:28,341 INFO Solving ended: time spent (623757), best score (0hard/0medium/0soft), score calculation speed (12954/sec), phase total (2), environment mode (REPRODUCIBLE), move thread count (4).

I also tried OptaPlanner 8.37.0.Final and a few other version after the introduction of Bavet. They all caused performance degradation.

在引入Bavet之后，我也尝试了OptaPlanner 8.37.0Final和其他几个版本。它们都会导致性能下降。

What changes do I need to make to make the application run faster? I was expect the application to run slightly faster.

我需要进行哪些更改才能使应用程序运行得更快？我预计应用程序的运行速度会稍微快一些。

更多回答

Would you mind doing the runs on the same JDK version? We have reason to believe it could be involved and love to exclude that.

您介意在相同的JDK版本上运行吗？我们有理由相信它可能参与其中，并乐于排除这一点。

You can use Timefold 0.8.40 with JDK 11 as a comparison. Or the original code on JDK 17.

您可以将TimeFold 0.8.40与JDK 11进行比较。或JDK 17上的原始代码。

I will run more tests

我会做更多的测试

优秀答案推荐

We believe the original code might be overfitting and the change in JDK version exposed that. If that's true, the change in OptaPlanner/Timefold version isn't relevant. In fact, the upgrade will probably improve your production quality (= paradox).

我们认为原始代码可能过多，而JDK版本的更改暴露了这一点。如果这是真的，那么OptaPlanner/Timeold版本的变化就无关紧要了。事实上，升级可能会提高你的产品质量。

A benchmark on the same JDK version should (dis)prove that.

同一个JDK版本上的基准测试应该(不)证明这一点。

Motivation

动机

The testcases use a different random seed according to the log:

Testcase 1 (before+after): Solving started: ... random (... seed 0)

Testcase 2 (before+after): Solving started: ... random (... seed 13)

That is very unusual. It's a sign of overfitting. Overfitting gives better results during testing, but equal or worse results in production.
I suspect optaplanner-benchmarker has been (mis)used to find the "best random seed" for each dataset. That makes it vulnerable to any changes in the random implementation.

这是非常不寻常的。这是一种过度适应的迹象。过拟合会在测试过程中提供更好的结果，但在生产中会产生相同或更差的结果。我怀疑optaplanner基准已经被(错误地)用来为每个数据集寻找“最佳随机种子”。这使得它很容易受到随机实现中的任何更改的影响。

JDK 17 changed the random implementation.

The time spent more than doubled, but the score calculation speed and LS steps remained the same. That means the number of moves per step more than doubled. It just generated more unlucky moves.

From looking at the test logs, it seems to me that you're comparing apples to oranges. Specifically:

从测试日志来看，在我看来，你是在拿苹果和橙子做比较。具体地说，就是：

Test case 1:

测试用例1：

"before" ran for ~ 1 minute 15 seconds

"after" ran for ~ 3 minutes 30 seconds.

the result "after" was significantly faster than "before". (~ +45 %)

Test case 2:

测试用例2：

"before" ran for ~ 5 minutes.

"after" ran for ~ 10 minutes.

the result "after" was only worse by about 6 %, which can be easily explained by natural variance in JVM performance from one run to another.

Considering that the input conditions (specifically solving time or target score) are not the same and that this is not a scientific benchmark, I would be cautious in drawing any conclusions from it.

考虑到输入条件(特别是求解时间或目标分数)不同，而且这不是一个科学的基准，我将谨慎地从中得出任何结论。

Here are a few guidelines to improve the reliability of your benchmarks:

以下是提高基准可靠性的一些指导原则：

Set a shared termination condition. If you're measuring performance, time-based terminations aren't ideal. Maybe a score-based or a step count-based termination.

Run each experiment multiple times, average the results. 10 iterations each should suffice.

Try to eliminate variables. If you say you've reproduced this with OptaPlanner 8, don't benchmark Timefold (yet). Instead, try to reproduce it without a switch to JDK 17. That way, we'll know if it's JDK-related or not.

As a final note, it is not the first time that I'm hearing that something happened some time after OptaPlanner 8.22 which caused it to go significantly slower in some cases. Unfortunately, no one has yet provided code that would show the slowdown. You can be the first.

作为最后的提示，这不是我第一次听说OptaPlanner 8.22之后的一段时间发生了什么事情，在某些情况下，它的运行速度明显变慢了。不幸的是，目前还没有人提供显示经济放缓的代码。你可以是第一个。

更多回答

Here are the steps I have taken.

以下是我采取的步骤。

Thanks for the analysis. Here were the steps I took. 1. Upgraded JDK from 11 to 17. That resulted score calculation speed from ~15% percent to 80% faster. 2. Upgraded from OptaPlanner 8.22.1.Final to Timefold 1.1.0. The score calculation speed was about the same as step 1. The number of steps were about the same. As you mentioned, it was the number of moves per step. What should I do to keep down the number of steps.

谢谢你的分析。以下是我采取的步骤。1.将JDK从11升级到17，计算分数的速度从~15%提高到80%。2.从OptaPlanner 8.22.1.Final升级到TimeFold 1.1.0。计算分数的速度和第一步差不多，步数也差不多。正如你提到的，这是每一步的移动次数。我该怎么做才能减少台阶数呢？

I meant to ask how to keep down the number of moves per step.

我的意思是问如何减少每一步的移动次数。

Regarding overfitting, I can think of a few things. AcceptedCountLimit and StepCountingHillClimbingSize. Should I tweak them?

关于过度着装，我能想到几件事。AcceptedCountLimit和StepCountingHillHenbingSize。我应该调整它们吗？

What we see is not directly related to number of steps, or any advanced configuration of the algorithm. We believe we are seeing that, for each solution, your program chooses a very specific random seed that has been benchmarked at some point in the past to work best for that data set. Unfortunately, the RNG changed in the JDK and those hand-crafted random seeds no longer work well. The solution is to either re-benchmark every data set to find the new "best" random seeds, or stop this practice altogether. We think this is not a solver performance regression.

我们所看到的与步骤数或算法的任何高级配置没有直接关系。我们相信我们看到，对于每个解决方案，您的程序选择了一个非常具体的随机种子，该种子在过去的某个时候已经过基准测试，以最适合该数据集。不幸的是，JDK中的RNG发生了变化，那些手工创建的随机种子不再能很好地工作。解决方案是要么重新对每个数据集进行基准测试，以找到新的“最佳”随机种子，要么干脆停止这种做法。我们认为这不是求解器性能的回归。

Thanks. I use step count hill climbing. The termination conditions include: UnimprovedStepCountLimit, SecondsSpentLimit, UnimprovedSecondsSpentLimit. I will try the other suggeestion.

谢谢。我用计步法爬山。终止条件包括：未改进的StepCountLimit、Second dsSpentLimit、未改进的Second SpentLimit。我会尝试另一个建议。

javascript - Ember.js，性能，性能 :
性能:数据存储写入与请求日志写入
我们希望通过我们的应用收集使用情况统计信息。因此，我们希望在服务器端的某个地方跟踪用户操作。就性能而言，哪个选项更合适: 在 App Engine 请求日志中跟踪用户操作。即为每个用户操作写入一个日
LINQ 性能
在针对对象集合的 LINQ 查询的幕后究竟发生了什么？它只是语法糖还是发生了其他事情使其更有效的查询？最佳答案您是指查询表达式，还是查询在幕后的作用？查询表达式首先扩展为“普通”C#。例如: v
WPF 性能
我正在构建一个简单的照片库应用程序，它在列表框中显示图像。 xaml 是:
java缓存系统和静态HashMap存储-性能
对于基于 Web 的企业应用程序，使用“静态 Hashmap 存储对象” 和 apache java 缓存系统有何优缺点？哪一个最有利于性能并减少堆内存问题例如: Map store=Applica
jquery存储变量类(性能)
我想知道在性能方面存储类变量的最佳方式是什么。我的意思是，由于 Children() 函数，存储一个 div id 比查找所有其他类名更好。还是把类名写在变量里比较好？例如这样: var $inne
Cassandra 性能
我已经阅读了所有这些关于 cassandra 有多快的文章，例如单行读取可能需要大约 5 毫秒。到目前为止，我不太关心我的网站速度，但是随着网站变得越来越大，一些页面开始需要相当多的查询，例如一个页
MySQL 性能
最近，我在缓存到内存缓存之前的查询一直需要很长时间才能处理!在这个例子中，它花费了 10 秒。在这种情况下，我要做的就是获得 10 个最近的点击。我感觉它加载了所有 125,592 行然后只返回 1
基本操作的C#性能
我找了几篇文章(包括SA中的一些问题)，试图找到基本操作的成本。但是，我尝试制作自己的小程序，以便自己进行测试。在尝试测试加法和减法时，我遇到了一些问题，我用简单的代码向您展示了这一点
Java远程调试——性能
这个问题在这里已经有了答案: Will Java app slow down by presence of -Xdebug or only when stepping through code? (
Javascript with() 性能
我记得很久以前读过 with() 对 JavaScript 有一些严重的性能影响，因为它可能对范围堆栈进行非确定性更改。我很难找到最近对此的讨论。这仍然是真的吗？最佳答案与其说 with 对性能有
MySQL 性能
我们有一个数据仓库，其中包含非规范化表，行数从 50 万行到 6 多万行不等。我正在开发一个报告解决方案，因此出于性能原因我们正在使用数据库分页。我们的报告有搜索条件，并且我们已经创建了必要的索引，但
mysql - 性能
我有一条有效的 SQL 语句，但需要很长时间才能处理我有一个 a_log 表和一个 people 表。我需要在 people 表中找到给定人员的每个 ID 的最后一个事件和关联的用户。 SELECT
JavaScript 性能
很难说出这里问的是什么。这个问题是含糊的、模糊的、不完整的、过于宽泛的或修辞性的，无法以目前的形式得到合理的回答。如需帮助澄清此问题以便重新打开它，visit the help center 。已关
CSS 性能
通常当我建立一个站点时，我将所有的 CSS 放在一个文件中，并且一次性定义与一组元素相关的所有属性。像这样: #myElement { color: #fff; background-
CSS 性能
两者之间是否存在任何性能差异: p { margin:0px; padding:0px; } 并省略最后的分号: p { margin:0px; padding:0px } 提前致谢!
PHP高精数学-性能
我的应用程序 (PHP) 需要执行大量高精度数学运算(甚至可能出现一共100个数字) 通过这个论坛的最后几篇帖子，我发现我必须使用任何高精度库，如 BC Math 或 GMP，因为 float 类型不
Javamail 性能
我一直在使用 javamail 从 IMAP 服务器(目前是 GMail)检索邮件。 Javamail 非常快速地从服务器检索特定文件夹中的消息列表(仅 id)，但是当我实际获取消息(仅包含甚至不包含
ruby 性能
我非常渴望开发我的第一个 Ruby 应用程序，因为我的公司终于在内部批准了它的使用。在我读到的关于 Ruby v1.8 之前的所有内容中，从来没有任何关于性能的正面评价，但我没有发现关于 1.9 版
redis结构、性能
我是 Redis 的新手，我有一个包含数百万个成员(member) ID、电子邮件和用户名的数据集，并且正在考虑将它们存储在例如列表结构中。我认为 list 和 sorted set 可能最适合我的情

bug小助手

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

Slower performance after upgrading from OptaPlanner 8.22.1 to Timefold 1.1.0 or OptaPlanner 8.37.0(从OptaPlanner 8.22.1升级到TimeFold 1.1.0或OptaPlanner 8.37.0后性能降低)