python - 星火/PySpark : An error occurred while trying to connect to the Java server (127. 0.0.1:39543)-6ren

python - 星火/PySpark : An error occurred while trying to connect to the Java server (127. 0.0.1:39543)

转载作者：太空宇宙更新时间：2023-11-03 11:18:04

25

4

下午好

在过去的两天里，Java 服务器出现了许多连接问题。这有点不常见，因为错误并不总是发生，只是有时......

我正在结合使用 PySpark 和 Jupyter Notebook。一切都在谷歌云中的虚拟机实例上运行。我在 Google Cloud 中使用这个:

custom (8 vCPUs, 200 GB)

这些是其他设置:

conf = pyspark.SparkConf().setAppName("App")
conf = (conf.setMaster('local[*]')
        .set('spark.executor.memory', '180G')
        .set('spark.driver.memory', '180G')
        .set('spark.driver.maxResultSize', '180G'))

sc = pyspark.SparkContext(conf=conf)
sq = pyspark.sql.SQLContext(sc)

我训练了一个随机森林模型并做出了预测:

model = rf.fit(train)
predictions = model.transform(test)

然后我创建了 ROC 曲线并计算了 AUC 值。

然后我想看看混淆矩阵:

confusion_mat = metrics.confusionMatrix().toArray()
print(confusion_mat_train_rf)

现在错误发生了:

    Traceback (most recent call last):
  File "/usr/lib/python2.7/SocketServer.py", line 290, in _handle_request_noblock
    self.process_request(request, client_address)
  File "/usr/lib/python2.7/SocketServer.py", line 318, in process_request
    self.finish_request(request, client_address)
  File "/usr/lib/python2.7/SocketServer.py", line 331, in finish_request
    self.RequestHandlerClass(request, client_address, self)
  File "/usr/lib/python2.7/SocketServer.py", line 652, in __init__
    self.handle()
  File "/usr/local/lib/python2.7/dist-packages/pyspark/accumulators.py", line 235, in handle
    num_updates = read_int(self.rfile)
  File "/usr/local/lib/python2.7/dist-packages/pyspark/serializers.py", line 577, in read_int
    raise EOFError
EOFError
ERROR:root:Exception while sending command.
Traceback (most recent call last):
  File "/usr/local/lib/python2.7/dist-packages/py4j/java_gateway.py", line 883, in send_command
    response = connection.send_command(command)
  File "/usr/local/lib/python2.7/dist-packages/py4j/java_gateway.py", line 1040, in send_command
    "Error while receiving", e, proto.ERROR_ON_RECEIVE)
Py4JNetworkError: Error while receiving
ERROR:py4j.java_gateway:An error occurred while trying to connect to the Java server (127.0.0.1:39543)
Traceback (most recent call last):
  File "/usr/local/lib/python2.7/dist-packages/py4j/java_gateway.py", line 963, in start
    self.socket.connect((self.address, self.port))
  File "/usr/lib/python2.7/socket.py", line 228, in meth
    return getattr(self._sock,name)(*args)
error: [Errno 111] Connection refused

这是控制台的输出:

OpenJDK 64-Bit Server VM warning
: INFO: os::commit_memory(0x00007f4998300000, 603979776, 0) failed; error='Cannot allocate memory' (errno=12)
#
# There is insufficient memory for the Java Runtime Environment to continue.
# Native memory allocation (mmap) failed to map 603979776 bytes for committing reserved memory.

日志文件:

#
# There is insufficient memory for the Java Runtime Environment to continue.
# Native memory allocation (mmap) failed to map 603979776 bytes for committing reserved memory.
# Possible reasons:
#   The system is out of physical RAM or swap space
#   In 32 bit mode, the process size limit was hit
# Possible solutions:
#   Reduce memory load on the system
#   Increase physical memory or swap space
#   Check if swap backing store is full
#   Use 64 bit Java on a 64 bit OS
#   Decrease Java heap size (-Xmx/-Xms)
#   Decrease number of Java threads
#   Decrease Java thread stack sizes (-Xss)
#   Set larger code cache with -XX:ReservedCodeCacheSize=
# This output file may be truncated or incomplete.
#
#  Out of Memory Error (os_linux.cpp:2643), pid=2377, tid=0x00007f1c94fac700
#
# JRE version: OpenJDK Runtime Environment (8.0_151-b12) (build 1.8.0_151-8u151-b12-0ubuntu0.16.04.2-b12)
# Java VM: OpenJDK 64-Bit Server VM (25.151-b12 mixed mode linux-amd64 )
# Failed to write core dump. Core dumps have been disabled. To enable core dumping, try "ulimit -c unlimited" before starting Java again
#

---------------  S Y S T E M  ---------------

OS:DISTRIB_ID=Ubuntu
DISTRIB_RELEASE=16.04
DISTRIB_CODENAME=xenial
DISTRIB_DESCRIPTION="Ubuntu 16.04.3 LTS"

uname:Linux 4.13.0-1008-gcp #11-Ubuntu SMP Thu Jan 25 11:08:44 UTC 2018 x86_64
libc:glibc 2.23 NPTL 2.23 
rlimit: STACK 8192k, CORE 0k, NPROC 805983, NOFILE 1048576, AS infinity
load average:7.69 4.51 3.57

/proc/meminfo:
MemTotal:       206348252 kB
MemFree:         1298460 kB
MemAvailable:     250308 kB
Buffers:            6812 kB
Cached:           438232 kB
SwapCached:            0 kB
Active:         203906416 kB
Inactive:         339540 kB
Active(anon):   203804300 kB
Inactive(anon):     8392 kB
Active(file):     102116 kB
Inactive(file):   331148 kB
Unevictable:        3652 kB
Mlocked:            3652 kB
SwapTotal:             0 kB
SwapFree:              0 kB
Dirty:              4688 kB
Writeback:             0 kB
AnonPages:      203805168 kB
Mapped:            23076 kB
Shmem:              8776 kB
Slab:             114476 kB
SReclaimable:      50640 kB
SUnreclaim:        63836 kB
KernelStack:        4752 kB
PageTables:       404292 kB
NFS_Unstable:          0 kB
Bounce:                0 kB
WritebackTmp:          0 kB
CommitLimit:    103174124 kB
Committed_AS:   205956256 kB
VmallocTotal:   34359738367 kB
VmallocUsed:           0 kB
VmallocChunk:          0 kB
HardwareCorrupted:     0 kB
AnonHugePages:         0 kB
ShmemHugePages:        0 kB
ShmemPmdMapped:        0 kB
CmaTotal:              0 kB
CmaFree:               0 kB
HugePages_Total:       0
HugePages_Free:        0
HugePages_Rsvd:        0
HugePages_Surp:        0
Hugepagesize:       2048 kB
DirectMap4k:       71628 kB
DirectMap2M:     4122624 kB
DirectMap1G:    207618048 kB


CPU:total 8 (initial active 8) (4 cores per cpu, 2 threads per core) family 6 model 85 stepping 3, cmov, cx8, fxsr, mmx, sse, sse2, sse3, ssse3, sse4.1, sse4.2, popcnt, avx, avx2, aes, clmul, erms, rtm, 3dnowpref, lzcnt, ht, tsc, tscinvbit, bmi1, bmi2, adx

有没有人知道问题可能是什么以及我该如何解决？我很绝望。 :(

//我认为 Java Runtime Environment 没有足够的内存来继续......但是我该怎么办？

非常感谢!

最佳答案

如果你是

using this one in Google Cloud:

custom (8 vCPUs, 200 GB)

然后你显着超额订阅了内存。忽略 spark.executor.memory 在 local 模式下没有效果。

spark.executor.memory 仅占 JVM 堆，不包括:

PySpark worker 内存。
PySpark 驱动程序内存。

即使使用 JVM，也只有一部分可用于数据处理(参见 Memory Management Overview)，因此 spark.driver.maxResultSize 等于分配的总内存没有意义。

关于python - 星火/PySpark : An error occurred while trying to connect to the Java server (127. 0.0.1:39543)，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/48523629/

25

4

0

文章推荐： mysql - CakePHP 将值作为 HTML 编码值保存到 MySQL 数据库

文章推荐： C# DataGridView，不保存列编辑器中的自定义属性

文章推荐： python - sympy+matplotlib 不绘制几何图元

scala - 为什么可以将 Try[Try[Unit]] 值分配给 Try[Unit]？
我刚刚遇到了一个非常奇怪的行为。这是代码: // So far everything's fine val x: Try[Try[Unit]] = Try(Try{}) x: scala.util.T
ruby-on-rails - 更好的替代方法 try( :output). try( :data). try( :name)?
“输出”是一个序列化的 OpenStruct。定义标题 try(:output).try(:data).try(:title) 结束什么会更好？ :) 最佳答案或者只是这样: def title
scala 模式匹配 (Try,Try)
我有以下元组 - (t1,t2) :(Try,Try) 我想检查两者是否成功或其中之一是否失败，但避免代码重复。像这样的东西: (t1,t2) match { case (Success(v1),Su
java - 是否必须将内部 try-with-resources 放入内部 try-with-resources 或其中一个 try-with-resources 中的所有内容都将自动关闭？
是否必须放置内部 try-with-resources 或其中一个 try-with-resources 中的所有内容都会自动关闭？ try (BasicDataSource ds = Bas
java - grails: try 抛出意外的标记: try:
有一点特殊，尝试创建一段 try catch 代码来处理 GoogleTokenResponse，但编译器在 try 时抛出异常错误。有什么想法吗？错误信息: | Loading Grails 2.
try-catch - try ... catch ... [finally] 是如何工作的？
它几乎可以在所有语言中找到，而且我大部分时间都在使用它。我不知道它是内部的，不知道它是如何真正起作用的。它如何在任何语言的运行时在 native 级别工作？例如:如果在 try 内部发生 sta
java - try catch 与 try-with-resources
为什么在 readFile2() 中我需要捕获 FileNotFoundException 以及稍后由 close( ) 方法，并且在 try-with-resources(inside readfi
java - 有没有办法可以制作一个 try-try-catch block ？
我正在使用 Apache POI 尝试读取 Word 文件，但即使您使用过 Apache POI，这仍然应该是可以回答的。在 HWPF.extractor 包中有两个对象:WordExtractor
try-catch - try catch finally 执行流程
如果try-catch的catch block 中抛出异常，那么finally block 会被调用吗？ try { //some thing which throws error } cat
java - Try With Resources 与 Try-Catch
这个问题已经有答案了: What's the purpose of try-with-resources statements? (7 个回答) 已关闭 3 年前。我一直在查看代码，并且已经看到了对
java - Try With Resources 与 Try-Catch
这个问题已经有答案了: What's the purpose of try-with-resources statements? (7 个回答) 已关闭 3 年前。我一直在查看代码，并且已经看到了对
perl - Try::Tiny:try-catch 的奇怪行为与否？
我正在使用 Try::Tiny尝试捕捉。代码如下: use Try::Tiny; try { print "In try"; wrongsubroutine(); # undefi
c++ - try-catch 在 try 中检查多个对象
我想知道这样的代码是否会在抛出异常后总是中断而不继续运行，因此代码不会继续执行第二个 temp.dodaj(b)。 Avto *a = new Avto("lambo",4); Avt
Java 7 try-with-resources - try 子句中可以包含什么
我知道在try子句中必须有一个与资源关联的变量声明。但是除了被分配一个通常的资源实例化之外，它是否可以被分配一个已经存在的资源，例如: public String getAsString(HttpS
java - try-catch 语句在捕获异常时不返回 try block
我有一个写的方法。此方法仅扫描用户输入的整数输入。如果用户输入一个字符值，它将抛出一个输入不匹配异常，这是在我的 Try-Catch 语句中处理的。问题是，如果用户输入任何不是数字的东西，然后抛出异常
java - 为什么不能在 try-with-resources try 子句中重用引用变量？
我注意到这不会编译: PrintWriter printWriter = new PrintWriter("test.txt"); printWriter.append('a'); printWrit
Python:将 try 代码与 try 语句放在同一行有什么好处吗？
我经常看到人们写这样的代码: try: some_function() except: print 'something' 当我认为这样做更干净时: try: some_functio
objective-c - iOS 方向 : Tried this and tried that
该应用程序将在第二个显示器上正常显示内容。问题是当我旋转 iPad 时内容不会在 iPad 上旋转。看过: http://developer.apple.com/library/ios/#qa/qa
java - 我的 try 语句之后的所有内容都必须包含在该 try 语句中才能访问其中的变量吗？
我正在学习 java，我发现我不喜欢的一件事通常是当我有这样的代码时: import java.util.*; import java.io.*; public class GraphProblem
c++ - TRY/CATCH_ALL 与 try/catch
我使用 C++ 有一段时间了，对普通的 try/catch 很熟悉。但是，我现在发现自己在 Windows 上，在 VisualStudio 中编码以进行 COM 开发。代码的几个部分使用了如下内容:

首页

博学

6Ren·AI

商城

python - 星火/PySpark : An error occurred while trying to connect to the Java server (127. 0.0.1:39543)