java - 在REPL中的Scala中具有java.util.concurrent.

java - 在REPL中的Scala中具有java.util.concurrent._的死锁

转载作者：行者123 更新时间：2023-12-04 21:30:09

我在学习Paul Chiusano和Runar Bjanarson的著作“Scala中的函数编程”(第7章-纯函数并行性)时遇到了以下情况。

    package fpinscala.parallelism

    import java.util.concurrent._
    import language.implicitConversions


    object Par {
      type Par[A] = ExecutorService => Future[A]

      def run[A](s: ExecutorService)(a: Par[A]): Future[A] = a(s)

      def unit[A](a: A): Par[A] = (es: ExecutorService) => UnitFuture(a) // `unit` is represented as a function that returns a `UnitFuture`, which is a simple implementation of `Future` that just wraps a constant value. It doesn't use the `ExecutorService` at all. It's always done and can't be cancelled. Its `get` method simply returns the value that we gave it.

      private case class UnitFuture[A](get: A) extends Future[A] {
        def isDone = true
        def get(timeout: Long, units: TimeUnit) = get
        def isCancelled = false
        def cancel(evenIfRunning: Boolean): Boolean = false
      }

      def map2[A,B,C](a: Par[A], b: Par[B])(f: (A,B) => C): Par[C] = // `map2` doesn't evaluate the call to `f` in a separate logical thread, in accord with our design choice of having `fork` be the sole function in the API for controlling parallelism. We can always do `fork(map2(a,b)(f))` if we want the evaluation of `f` to occur in a separate thread.
        (es: ExecutorService) => {
          val af = a(es)
          val bf = b(es)
          UnitFuture(f(af.get, bf.get)) // This implementation of `map2` does _not_ respect timeouts. It simply passes the `ExecutorService` on to both `Par` values, waits for the results of the Futures `af` and `bf`, applies `f` to them, and wraps them in a `UnitFuture`. In order to respect timeouts, we'd need a new `Future` implementation that records the amount of time spent evaluating `af`, then subtracts that time from the available time allocated for evaluating `bf`.
        }

      def fork[A](a: => Par[A]): Par[A] = // This is the simplest and most natural implementation of `fork`, but there are some problems with it--for one, the outer `Callable` will block waiting for the "inner" task to complete. Since this blocking occupies a thread in our thread pool, or whatever resource backs the `ExecutorService`, this implies that we're losing out on some potential parallelism. Essentially, we're using two threads when one should suffice. This is a symptom of a more serious problem with the implementation, and we will discuss this later in the chapter.
        es => es.submit(new Callable[A] {
          def call = a(es).get
        })

      def lazyUnit[A](a: => A): Par[A] = fork(unit(a))

 def equal[A](e: ExecutorService)(p: Par[A], p2: Par[A]): Boolean =
    p(e).get == p2(e).get

}

您可以在Github here上找到原始代码。有关java.util.concurrent文档，请参见 here。

我关注 fork的实现。特别地，当ThreadPool太小时，据称 fork可能导致死锁。

我考虑以下示例:

val a = Par.lazyUnit(42 + 1)
val es: ExecutorService = Executors.newFixedThreadPool(2)
println(Par.fork(a)(es).get)

我不希望这个示例最终陷入僵局，因为有两个线程。但是，当我在Scala REPL中运行它时，它将在我的计算机上运行。为什么会这样呢？

初始化 ExecutorService时的输出为
es:java.util.concurrent.ExecutorService =

java.util.concurrent.ThreadPoolE
xecutor@73a86d72[Running, pool size = 0, active threads = 0, queued tasks =
 0, completed tasks = 0]

pool size = 0在这里正确吗？换句话说，这是不了解 java.util.concurrent._的问题还是不了解Scala部分的问题？

最佳答案

好吧，经过长时间的调查，我相信我会回答。完整的故事很长，但是我将尝试通过简化和避免许多细节来缩短它。

注意:可以将Scala编译为各种不同的目标平台，但是这个特定问题发生在以Java/JVM为目标的情况下，因此这就是此答案的内容。

您看到的死锁与线程池的大小无关。实际上是挂起的外部fork调用。它与REPL实现细节和多线程结合在一起，但是需要学习一些知识才能理解它是如何发生的:

Scala REPL如何工作

Scala如何将object编译为Java/JVM

Scala如何在Java/JVM上模拟名称参数

Java/JVM如何运行类

的静态初始化程序

一个简短的版本(另请参见摘要结尾)是该代码卡在REPL之下，因为在REPL执行该代码时，它在逻辑上类似于以下代码:

object DeadLock {

  import scala.concurrent._
  import scala.concurrent.duration.Duration
  import scala.concurrent.ExecutionContext.Implicits.global

  val foo: Int = Await.result(Future(calc()), Duration.Inf)

  def printFoo(): Unit = {
    println(s"Foo = $foo")
  }

  private def calc(): Int = {
    println("Before calc")
    42
  }
}


def test(): Unit = {
  println("Before printFoo")
  DeadLock.printFoo()
  println("After printFoo")
}

或在Java世界中非常相似:

class Deadlock {
    static CompletableFuture<Integer> cf;
    static int foo;

    public static void printFoo() {
        System.out.println("Print foo " + foo);
    }

    static {
        cf = new CompletableFuture<Integer>();
        new Thread(new Runnable() {
            @Override
            public void run() {
                calcF();
            }
        }).start();
        try {
            foo = cf.get();
            System.out.println("Future result = " + cf.get());
        } catch (InterruptedException e) {
            e.printStackTrace();f
        } catch (ExecutionException e) {
            e.printStackTrace();
        }
    }


    private static void calcF() {
        cf.complete(42);
    }
}

public static void main(String[] args) {
    System.out.println("Before foo");
    Deadlock.printFoo();
    System.out.println("After foo");
}

如果您清楚此代码为何会陷入僵局，那么您已经了解了大部分内容，并且可以自己推断出其余内容。您可能只需要看一下最后的摘要部分。

Java静态初始化程序如何死锁？

让我们从这个故事的结尾开始:为什么Java代码挂起？发生这种情况是因为Java/JVM对静态初始化程序有两个保证(有关更多详细信息，请参见JLS的 12.4.2. Detailed Initialization Procedure部分):

静态初始化程序将在对

类的任何其他“外部”使用之前运行

静态初始化程序将只运行一次，并通过全局锁定

完成

静态初始化程序使用的锁是隐式的，由JVM管理，但在那里。这意味着代码在逻辑上类似于以下内容:

class Deadlock {

    static boolean staticInitFinished = false;
    // unique value for each thread!
    static ThreadLocal<Boolean> currentThreadRunsStaticInit = ThreadLocal.withInitial(() -> Boolean.FALSE);


    static CompletableFuture<Integer> cf;
    static int foo;

    static void enforceStaticInit() {
        synchronized (Deadlock.class) {
            // is init finished?
            if (staticInitFinished)
                return;
            // are we the thread already running the init?
            if(currentThreadRunsStaticInit.get())
                return;
            currentThreadRunsStaticInit.set(true);

            cf = new CompletableFuture<Integer>();
            new Thread(new Runnable() {
                @Override
                public void run() {
                    calcF();
                }
            }).start();
            try {
                foo = cf.get();
                System.out.println("Future result = " + cf.get());
            } catch (InterruptedException e) {
                e.printStackTrace();
            } catch (ExecutionException e) {
                e.printStackTrace();
            }
            currentThreadRunsStaticInit.set(false);
            staticInitFinished = true;
        }
    }

    private static void calcF() {
        enforceStaticInit();
        cf.complete(42);
    }

    public static void printFoo() {
        enforceStaticInit();
        System.out.println("Print foo " + foo);
    }
}

现在很清楚为什么此代码会死锁:我们的静态初始化程序启动一个新线程并阻止等待其结果。但是，该新线程尝试访问相同的类( calcF方法)，并且作为另一个线程，它必须等待已经运行的静态初始化程序完成。请注意，如果 calcF方法在另一个类中，那么一切都会正常进行。

Scala REPL的工作方式

现在让我们回到有关Scala REPL如何工作的故事的开始。这个答案是对实际交易的极大简化，但是它捕获了有关此情况的详细信息的重要信息。幸运的是，对于REPL实现者来说，Scala编译器是用Scala编写的。这意味着REPL不必以某种方式解释代码，它可以通过标准编译器运行，然后通过Java Reflection API运行已编译的代码。这仍然需要对代码进行一些修饰，以使编译器满意并返回结果。

当您键入类似的内容时，可以稍微简化一下(或者很多)

val a = Par.lazyUnit(42 + 1)

到REPL中，对代码进行分析并将其转换为类似以下内容的代码:

package line3

object read {
    val a = Par.lazyUnit(42 + 1)
    val res3 = a
}

object eval {
    def print() = {
        println("a: Par.Par[Int] = " + read.res3)
    }
}

然后通过反射调用 line3.eval.print()。

类似的故事发生在:

val es: ExecutorService = Executors.newFixedThreadPool(2)

最后当你这样做

Par.fork(a)(es).get

事情变得更加有趣了，因为您对前面的行有依赖性，可以使用 import巧妙地实现它:

package line5

object read {
    import line2.read.Par
    import line3.read.a
    import line4.read.es

    val res5 = Par.fork(a)(es).get
}

object eval {
    def print() = {
        println("res5: Int = " + read.res5)
    }
}

在这里的重要之处在于，您写入REPL的所有内容都被包装到全新的object中，然后作为常规代码进行编译和运行。

Scala如何在Java/JVM上模拟名称参数
fork方法的定义使用by-name parameter:
def fork[A](a: => Par[A]): Par[A] =

在这里，它用于懒惰地评估a，这对于fork的整个逻辑至关重要。 Java/JVM不对延迟评估提供标准支持，但是可以对其进行仿真，这就是Scala编译器的作用。在内部将签名更改为使用Function0:

def fork[A](aWrapper: () => Par[A]): Par[A] =

每次对a的访问都将替换为对aWrapper.apply()的调用。魔术的另一部分发生在带有by-name参数的方法的调用方:该参数也应该包装在Function0中，这样代码就变成了类似

object read { import line2.read.Par import line3.read.a import line4.read.es val res5 = Par.fork(() => a)(es).get }

但这实际上有点不同。天真的，只为这个小功能要花一个类，而对于这样一个简单的逻辑来说，这是浪费的。在Scala 2.12中的实践中，使用了Java 8 LambdaMetafactory的魔力，因此代码的确变成了类似

object read { import line2.read.Par import line3.read.a import line4.read.es def aWrapper():Int = a val res5 = Par.fork(aWrapper _)(es).get }

其中aWrapper _表示将方法转换为Funciton0完成的LambdaMetafactory。您可能会在Java静态初始化程序死锁一章中对此有所怀疑，def aWrapper的引入是的关键区别。您已经可以看到该代码与挂起的答案中的第一个Scala代码段非常相似。

Scala如何在Java/JVM上编译object

最后一个难题是如何在Java/JVM中编译Scala object。好吧，实际上它被编译为类似于“静态类”的东西，但是由于您可以将object用作对象参数，因此它必须稍微复杂一些。实际上，所有初始化逻辑都移至object类的构造函数，并且有一个简单的静态初始化程序对其进行调用。因此，我们在Java中最后一个read对象将(忽略import)如下所示:
class read$ { static read$ MODULE$ static { new read$() } private Par[Int] res5; private read$() { MODULE$ = this; res5 = Par.fork(read$::aWrapper)(es).get } private static int aWrapper(){ return line3.read$.MODULE$.a; } }

这里read$::aWrapper再次表示使用Function0从aWrapper方法构建LambdaMetafactory。换句话说，Scala object 的 初始化被转换为作为Java静态初始化程序 的一部分运行的代码。

摘要

总结一下如何弄糟:
REPL将您的代码转换为每行的新object并对其进行编译
object初始化逻辑被翻译成Java静态初始化逻辑
在简单情况下，将带有名字参数的方法的
调用转换为包装“返回值”逻辑的方法，并将该方法添加到相同的class或object
中
作为Par.fork初始化的一部分(即Java静态初始化程序的一部分)执行的
object尝试在不同的线程上评估by-name参数(即，调用同一类的方法)，并阻止等待结果的执行。该线程
Java静态初始化程序在逻辑上在全局锁下执行，因此它阻止了不同的线程调用该方法。但是它本身被阻止等待该方法调用完成。

关于java - 在REPL中的Scala中具有java.util.concurrent._的死锁，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/54390881/

文章推荐： SQL Server : Update, 设置了另一个选择中的值

文章推荐： CSS 嵌套给出了意想不到的结果

javascript - 我需要将文本放在一个中，它位于一个 Div 中，该 Div 位于另一个 Div 中，该 Div 位于另一个 Div 中
我需要将文本放在中在一个 Div 中，在另一个 Div 中，在另一个 Div 中。所以这是它的样子: #document Change PIN
html - 两个背景图像。一个在 HTML 中，一个在 BODY 中。在 Firefox 中，主体图像未呈现
奇怪的事情发生了。我有一个基本的 html 代码。 html，头部， body 。(因为我收到了一些反对票，这里是完整的代码) 这是我的CSS: html { backgroun
ios - 将图像从 asset.xcassets 加载到 imageArray 中，并将其动态加载到 UIImageView 中，该 UIImageView 存在于 UICollectionView 中 - swift
我正在尝试将 Assets 中的一组图像加载到 UICollectionview 中存在的 ImageView 中，但每当我运行应用程序时它都会显示错误。而且也没有显示图像。我在ViewDidLoa
linux - 在 BASH 中，我需要根据 perl 脚本的输出更改一些环境变量。在 tcsh 中，我可以使用别名 eval 组合。不能在 bash 中
我需要根据带参数的 perl 脚本的输出更改一些环境变量。在 tcsh 中，我可以使用别名命令来评估 perl 脚本的输出。 tcsh: alias setsdk 'eval `/localhome/
asp.net - Windows 身份验证适用于 IIS，但不适用于 Kestrel/Microsoft.AspNetCore.Authentication.Negotiate(不在 Chrome 中，有时在 Edge 中，始终在 IE 中)？
我使用 Windows 身份验证创建了一个新的 Blazor(服务器端)应用程序，并使用 IIS Express 运行它。它将显示一条消息“Hello Domain\User!”来自右上方的以下 Ra
java - java 中 Kotlin 中的等价物是什么？
这是我的方法 void login(Event event);我想知道 Kotlin 中应该如何最佳答案在 Kotlin 中通配符运算符是 * 。它指示编译器它是未知的，但一旦知道，就不会有其他类
express - 在 Jade 中，为什么有时我可以按原样使用变量而有时必须将它们包含在#{......} 中？
看下面的代码 for story in book if story.title.length < 140 - var story
c - C 中 strstr() 中 for 循环的错误使用
我正在尝试用 C 语言学习字符串处理。我写了一个程序，它存储了一些音乐轨道，并帮助用户检查他/她想到的歌曲是否存在于存储的轨道中。这是通过要求用户输入一串字符来完成的。然后程序使用 strstr()
c - * 在 sscanf 中，* 在 [] 中
我正在学习 sscanf 并遇到如下格式字符串: sscanf("%[^:]:%[^*=]%*[*=]%n",a,b,&c); 我理解 %[^:] 部分意味着扫描直到遇到 ':' 并将其分配给 a。:
python - 在 Python (2.7.3) 中，如果 str(x) 中的任何字符在 str(y) 中(或 str(y) 在 str(x) 中)，我如何编写一个函数来回答？
def char_check(x,y): if (str(x) in y or x.find(y) > -1) or (str(y) in x or y.find(x) > -1):
ansible - 在 Ansible 中，如何将一行移动到一个 block 中？
我有一种情况，我想将文本文件中的现有行包含到一个新 block 中。 line 1 line 2 line in block line 3 line 4 应该变成 line 1 line 2 line
Django 调试工具栏显示在根 URL 中，但不显示在应用程序 URL 中
我有一个新项目，我正在尝试设置 Django 调试工具栏。首先，我尝试了快速设置，它只涉及将 'debug_toolbar' 添加到我的已安装应用程序列表中。有了这个，当我转到我的根 URL 时，调试
r - 在 R 中，Matlab 中 @ 函数句柄的等价物是什么？
在 Matlab 中，如果我有一个函数 f，例如签名是 f(a,b,c)，我可以创建一个只有一个变量 b 的函数，它将使用固定的 a=a1 和 c=c1 调用 f: g = @(b) f(a1, b,
swiftui - SwiftUI 中 ScrollView 中 VStack 元素中的神秘间距或填充
我不明白为什么 ForEach 中的元素之间有多余的垂直间距在 VStack 里面在 ScrollView 里面使用 GeometryReader 时渲染自定义水平分隔线。 Scrol
cookies - 什么应该存储在 session 中，什么应该存储在 cookie 中？
我想知道，是否有关于何时使用 session 和 cookie 的指南或最佳实践？什么应该和什么不应该存储在其中？谢谢! 最佳答案这些文档很好地了解了 session cookie 的安全问题以及
python - Python 中 matplotlib 中 3d 直方图的奇怪行为
我在 scipy/numpy 中有一个 Nx3 矩阵，我想用它制作一个 3 维条形图，其中 X 轴和 Y 轴由矩阵的第一列和第二列的值、高度确定每个条形的是矩阵中的第三列，条形的数量由 N 确定。
c - c 中 sem_init(...) 中 value 参数的不同用法
假设我用两种不同的方式初始化信号量 sem_init(&randomsem,0,1) sem_init(&randomsem,0,0) 现在， sem_wait(&randomsem) 在这两种情况下
c - 实际值存储在 pstr 中，但是该值如何存储在数组 "WORD"中
我怀疑该值如何存储在“WORD”中，因为 PStr 包含实际输出。？既然Pstr中存储的是小写到大写的字母，那么在printf中如何将其给出为“WORD”。有人可以吗？解释一下？ #include
javascript - 数组索引选择像在 numpy 中，但在 javascript 中
我有一个 3x3 数组: var my_array = [[0,1,2], [3,4,5], [6,7,8]]; 并想获得它的第一个 2
javascript - 在 Javascript 中，如何检测浏览器窗口何时在 View 中？
我意识到您可以使用如下方式轻松检查焦点: var hasFocus = true; $(window).blur(function(){ hasFocus = false; }); $(win

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

java - 在REPL中的Scala中具有java.util.concurrent._的死锁