Scala 2.10 基准测试 : generic methods from the collections are useless when performance is important?-6ren

Scala 2.10 基准测试 : generic methods from the collections are useless when performance is important?

转载作者：行者123 更新时间：2023-12-01 20:27:30

24

4

我对折叠大量基元的几种方法(“直接”和使用迭代器)进行了基准测试，结果令人失望。 (是的，我已经完成了预热、中间 GC 和许多运行过程，在服务器模式下运行 JVM 并启用了 scalac 优化(并且禁用了调试信息)。

我认为代码太大，无法在这里发布，所以这里是链接:http://pastebin.com/18dWWBM4唯一运行得几乎与普通的命令式循环一样好的方法是这个不那么通用的手写函数:

@inline def array_foldl[@specialized A, @specialized B](init: B)(src: Array[A])(fun: (B, A) => B) = {
  var res = init
  var i = 0
  var len = src.length
  while (i < len) {
    res = fun(res, src(i))
    i += 1
  }
  res
}

其他视觉上不错的方法完全是局外人。此外，使用迭代器抽象在所有情况下都会失败，对称为 SpecializedIterator 的标准迭代器的手写模仿会稍微快一些。所以有什么问题？可以以某种方式改进吗？有没有办法制作“快速”迭代器，或者原理本身有很大问题？
感谢您的关注。

最佳答案

问题是拳击。创建一个对象比将两个数字相加花费的时间要长得多，但是如果您使用通用(非专用)折叠，则每次都必须创建一个对象。只专门化所有内容的问题是，您会使整个库增大 100 倍，因为您需要两个基本参数(包括非基本参数)的每种组合，以及原始的无类型参数版本。 (100x，因为有 8 个基元加上 Unit 加上 AnyRef/非专用 T。)这是站不住脚的，因为没有现成的可用方法作为替代解决方案，这些集合目前尚未专门化。

此外，特化本身相对较新，因此在实现中仍然存在一些缺陷。特别是，您似乎用 SpecializedIterator 击中了一个:foreach 中的函数最终并没有专门化(我将特征/对象事物折叠到一个类中以使得更容易追踪):

public class Main$SpecializedArrayIterator$mcJ$sp extends Main$SpecializedArrayIterator{
public final void foreach$mcJ$sp(scala.Function1);
  Code:
   0:   aload_0
   1:   invokevirtual   #39; //Method Main$SpecializedArrayIterator.hasNext:()Z
   4:   ifeq    24
   7:   aload_1
   8:   aload_0
   9:   invokevirtual   #14; //Method next$mcJ$sp:()J
   12:  invokestatic    #45; //Method scala/runtime/BoxesRunTime.boxToLong:(J)Ljava/lang/Long;
   15:  invokeinterface #51,  2; //InterfaceMethod scala/Function1.apply:(Ljava/lang/Object;)Ljava/lang/Object;
   20:  pop
   21:  goto    0
   24:  return

看到第 12 行的框，后面是对非专用 Function1 的调用吗？哎呀。 (sum 中使用的元组 (A, (A,A) => A) 也搞乱了专门化。)像这样的实现是全速的:

class SpecializedArrayIterator[@specialized A](src: Array[A]) {
  var i = 0
  val l = src.length
  @inline final def hasNext: Boolean = i < l
  @inline final def next(): A = { val res = src(i); i += 1; res }
  @inline final def foldLeft[@specialized B](z: B)(op: (B, A) => B): B = {
    var result = z
    while (hasNext) result = op(result,next)
    result
  }
}

...
measure((new SpecializedArrayIterator[Long](test)).foldLeft(0L)(_ + _))
...

结果如下:

Launched 51298 times in 2000 milliseconds, ratio = 25.649    // New impl
Launched 51614 times in 2000 milliseconds, ratio = 25.807    // While loop

关于Scala 2.10 基准测试 : generic methods from the collections are useless when performance is important?，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/14822700/

24

4

0

文章推荐： java - 在java中通过pdfbox阅读pdf

文章推荐： wget - 比较文件大小，如果不同则通过 wget 下载

文章推荐： java - 什么是NumberFormatException，我该如何解决？

import - ES2015 `import * as` 与普通 `import` 之间的区别
我刚刚通过更改 import * as CodeMirror 修复了一个错误简单明了import CodeMirror . 我复制了this code . (从 TypeScript 移植) impo
python : "de-import"， "re-import"， "reset import"？
我调试(在 PyCharm 中)一个脚本。我在断点处停止，然后转到调试控制台窗口，然后从那里调用导入行，如下所示: import my_util1 from my_utils 然后我调用 my_uti
import - `import` 语句的用法
谁能给我解释一下 import 语句是如何工作的？例如，我在 myapp/app/models 包中有一个类型 User: package models type User struct {
import - Idris2 中的 `import using` 或 `import hiding`
我想导入 Control.App进入一个引用 PrimIO.PrimIO 的模块通过不合格的名称 PrimIO在很多地方。当然，问题在于 Control.App还导出一个名为 PrimIO 的定义.我
python - from ... import OR import ... 对于模块
我应该使用 from foo import bar 或者 import foo.bar as bar 当导入模块还有无需/希望更改名称 (bar)？有什么不同吗？有关系吗？最佳答案假设 bar
import - "import theano"运行需要多长时间？
我正在 Windows 上使用 Theano 进行深度学习实验的第一步，我很惊讶仅仅加载库需要多少时间。这是小测试程序: from time import time t0 = time() impo
import - TypeScript import * 不创建别名
在 TypeScript 中，如何在不创建任何别名的情况下从文件“导入 *”？例如我有一个包含顶级导出函数的文件“utils”，我想导入所有这些函数而不为每个函数重新创建别名。像这样: impor
python - from ... import OR import ... 对于模块
我应该使用 from foo import bar 或 import foo.bar as bar 当导入模块并且不需要/希望更改名称(bar)？有什么不同吗？有关系吗？最佳答案假设bar是fo
python - `from ... import` 与 `import .`
这个问题在这里已经有了答案: Use 'import module' or 'from module import'? (23 个回答) 关闭8年前。我想知道代码片段之间是否有任何区别 from u
python - 'import x' vs "' from x import y' and 'import x.y' "
我试过了 from urllib import request mine = request.Request() 和 import urllib.request mine = urllib.reque
python - 为什么是: Python 'Import x' then Re- 'import x as y' clobbers import of x?
所以，我有一个关于 Python 导入的小谜团。我确信出于某种原因事情应该是这样的，因为 Guido 很少出错。但是，为什么会这样呢？ $ cat myModule.py #!/usr/bin/pyt
import - Rails4 : @import "my_folder" (with index. css.sass 在里面)不再工作，只有 @import "my_folder/index"可以
我们正在将 Rails 3.2 应用程序升级到 Rails 4.0。我们有一个 assets/stylesheets/application/index.css.sass加载一些其他 sass 文件
import - typescript 中的 `from foo import *`
我正在开发一个相当小的 Typescript 代码库，该代码库已经足够大，可以拆分到多个文件中。这是一个二十一点游戏。我目前有一堆代码，看起来像: var player = new Player();
import - Perl6 : implicit and explicit import
是否可以以当模块为 use 时的方式编写模块？ d 没有显式导入所有子例程都被导入，当它是 use d 显式导入只有这些显式导入的子程序可用？ #!/usr/bin/env perl6 use v6;
import - Sass 不观察@import 文件中的变化
这个问题在这里已经有了答案: how to watch changes in whole directory/folder containing many sass files (9 个回答) 5年前
import - xcode4 工作区中的两个项目(#import 失败)
我真的很难让它在 xcode 4 中工作。我有一个项目将在许多应用程序(网络)中重用，因此我创建一个工作区并添加我的两个项目。到目前为止，一切都很好....这就是失败的地方.. #import "J
import.io - import.io 中的经典提取器和新提取器有什么区别？
经典提取器和新提取器之间的主要区别是什么，哪个最好用？最佳答案经典提取器使用原始工作流程，与爬虫和连接器相同。新的提取器更加精简，通常看起来和感觉都更好，并且经典提取器中的许多小错误已在新提取器
import - 动态构建 less @import url
在处理 google webfont import mixin 时，我注意到无法动态构建 @import URL。 .gFontImport (@name, @weights, @subsets) {
python - "from . import views": Unresolved import
我正在关注Django 1.8 tutorial 。在我的项目中mysite ，有一个源文件夹polls 。文件夹中有views.py模块其中 index函数已定义。还有一个urls.py文件: fr
import - 第三方库上的 Rust `unresolved import`
我想使用名为 warp 的第三方库编译一个简单的 Rust 程序: [package] name = "hello-world-warp" version = "0.1.0" [dependencie

首页

博学

6Ren·AI

商城

Scala 2.10 基准测试 : generic methods from the collections are useless when performance is important?