api - Spark 作业已提交 - 正在等待(TaskSchedulerImpl : Initial job not accepted)-6ren

api - Spark 作业已提交 - 正在等待(TaskSchedulerImpl : Initial job not accepted)

转载作者：行者123 更新时间：2023-12-03 07:10:08

25

4

调用 API 来提交作业。响应状态 - 正在运行

在集群 UI 上 -

Worker (slave) - worker-20160712083825-172.31.17.189-59433 is Alive

Core 1 out of 2 used

Memory 1Gb out of 6 used

运行应用程序

app-20160713130056-0020 - Waiting since 5hrs

Cores - unlimited

应用程序的工作描述

活跃阶段

reduceByKey at /root/wordcount.py:23

待定阶段

takeOrdered at /root/wordcount.py:26

运行驱动程序 -

stderr log page for driver-20160713130051-0025 

WARN scheduler.TaskSchedulerImpl: Initial job has not accepted any resources; check your cluster UI to ensure that workers are registered and have sufficient resources

根据Initial job has not accepted any resources; check your cluster UI to ensure that workers are registered and have sufficient resources从站尚未启动 - 因此它没有资源。

但是就我而言 - Slave 1 正在工作

根据Unable to Execute More than a spark Job "Initial job has not accepted any resources"我正在使用部署模式=集群(不是客户端)，因为我有1个主站1个从站，并且提交API正在通过Postman/任何地方调用

集群还有可用的核心、RAM、内存 - Still Job 会抛出错误由 UI 传达

根据TaskSchedulerImpl: Initial job has not accepted any resources;我分配了

~/spark-1.5.0/conf/spark-env.sh

Spark环境变量

SPARK_WORKER_INSTANCES=1
SPARK_WORKER_MEMORY=1000m
SPARK_WORKER_CORES=2

在从站中复制这些

sudo /root/spark-ec2/copy-dir /root/spark/conf/spark-env.sh

上述问题答案中的所有情况均适用，但仍没有找到解决方案。因此，因为我正在使用 API 和 Apache SPark - 也许需要一些其他帮助。

Edited July 18,2016

Wordcount.py - My PySpark application code -

from pyspark import SparkContext, SparkConf

logFile = "/user/root/In/a.txt"

conf = (SparkConf().set("num-executors", "1"))

sc = SparkContext(master = "spark://ec2-54-209-108-127.compute-1.amazonaws.com:7077", appName = "MyApp", conf = conf)
print("in here")
lines = sc.textFile(logFile)
print("text read")
c = lines.count()
print("lines counted")

错误

Starting job: count at /root/wordcount.py:11
16/07/18 07:46:39 INFO scheduler.DAGScheduler: Got job 0 (count at /root/wordcount.py:11) with 2 output partitions
16/07/18 07:46:39 INFO scheduler.DAGScheduler: Final stage: ResultStage 0 (count at /root/wordcount.py:11)
16/07/18 07:46:39 INFO scheduler.DAGScheduler: Parents of final stage: List()
16/07/18 07:46:39 INFO scheduler.DAGScheduler: Missing parents: List()
16/07/18 07:46:39 INFO scheduler.DAGScheduler: Submitting ResultStage 0 (PythonRDD[2] at count at /root/wordcount.py:11), which has no missing parents
16/07/18 07:46:39 INFO storage.MemoryStore: Block broadcast_1 stored as values in memory (estimated size 5.6 KB, free 56.2 KB)
16/07/18 07:46:39 INFO storage.MemoryStore: Block broadcast_1_piece0 stored as bytes in memory (estimated size 3.4 KB, free 59.7 KB)
16/07/18 07:46:39 INFO storage.BlockManagerInfo: Added broadcast_1_piece0 in memory on 172.31.17.189:43684 (size: 3.4 KB, free: 511.5 MB)
16/07/18 07:46:39 INFO spark.SparkContext: Created broadcast 1 from broadcast at DAGScheduler.scala:1006
16/07/18 07:46:39 INFO scheduler.DAGScheduler: Submitting 2 missing tasks from ResultStage 0 (PythonRDD[2] at count at /root/wordcount.py:11)
16/07/18 07:46:39 INFO scheduler.TaskSchedulerImpl: Adding task set 0.0 with 2 tasks
16/07/18 07:46:54 WARN scheduler.TaskSchedulerImpl: Initial job has not accepted any resources; check your cluster UI to ensure that workers are registered and have sufficient resources

根据Spark UI showing 0 cores even when setting cores in App ,

Spark WebUI 声明使用了零个核心并且无限期等待没有任务运行。该应用程序在运行时或核心期间也不使用任何内存，并且在启动时立即进入等待状态

Spark 版本 1.6.1乌类图亚马逊EC2

最佳答案

我也有同样的问题。以下是我在发生这种情况时的评论。

1:17:46 WARN TaskSchedulerImpl: Initial job has not accepted any resources; check your cluster UI to ensure that workers are registered and have sufficient resources

我注意到它只发生在 scala shell 的第一次查询期间，我在其中运行从 hdfs 获取数据的操作。

出现问题时，WebUI 会显示没有任何正在运行的应用程序。

URL: spark://spark1:7077
REST URL: spark://spark1:6066 (cluster mode)
Alive Workers: 4
Cores in use: 26 Total, 26 Used
Memory in use: 52.7 GB Total, 4.0 GB Used
Applications: 0 Running, 0 Completed
Drivers: 0 Running, 0 Completed 
Status: ALIVE

好像有什么东西无法启动，我无法确切地分辨出是哪一个。

但是，第二次重新启动集群会将应用程序值设置为 1一切正常。

URL: spark://spark1:7077
REST URL: spark://spark1:6066 (cluster mode)
Alive Workers: 4
Cores in use: 26 Total, 26 Used
Memory in use: 52.7 GB Total, 4.0 GB Used
Applications: 1 Running, 0 Completed
Drivers: 0 Running, 0 Completed
Status: ALIVE

我仍在调查，这种快速解决方法可以节省最终解决方案的时间。

关于api - Spark 作业已提交 - 正在等待(TaskSchedulerImpl : Initial job not accepted)，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/38359801/

25

4

0

文章推荐： tmux - 如何自动将 tmux 窗口重命名为当前目录

文章推荐： qt - Qt 中的 QRadioButton 选中/取消选中问题

文章推荐： python - Keras 中的自定义损失函数

jira - 正在 Jira 中搜索标签!=
我为我的一些问题设置了标签。当搜索 labels="ab"时，我得到了相关的标签，但我似乎找不到询问标签的正确语法!="ab"。如何查询不等于ab的？最佳答案 != 对我有用，尽管它只显示带有标签的
GITHUB 正在 push 旧凭证而不是新凭证的更改
我最近使用 Visual Studio 2013 Express 配置了 GITHUB 和 Demo GITHub 帐户，即用于练习目的。 Good News is that : I have con
javascript - 正在 react 组件内调用暂停功能，但没有任何反应
我有一个用于播放和暂停的切换按钮。这是代码: export default (props) => { let [soundState, setSoundState] = useState({ s
java - 正在 http 发布文件，如何通过索引引用参数？
一个 XML 文件被发布到我的 spring mvc 正在响应的 URL。在 .NET 中，我可以这样做: request.Form[0] request.Form["abc"] 或 request
java - CAMEL 正在 "mv"命令完成之前处理文件
我们的监控脚本遇到问题。程序流程为客户将文件(.csv 格式)ftp/sftp 到“源”目录 Bash 脚本将完成的 .csv 文件重命名为 .aaa 文件另一个 Bash 脚本将“.aaa”文
java - 如果操作系统挂起，正在 hibernate 的线程会发生什么情况？
如果我开始一个线程: new Thread(() -> { while (running) { try { Thread.sl
ios - CATiledLayer 正在 UIView 下渲染
我正在制作一个看起来像真正的书的 PDF 阅读器。我在 ScrollView 中有一个 UIImageView 作为书的背景(想象一本打开的书，有空页)。 UIImageView 的层有 2 个子层
javascript - SlideToggle 正在 Accordion 末端创建摆动
创建 Accordion - 在幻灯片上 - 正在滑动的元素下方的元素似乎向下移动了 px，然后又向上移动，从而产生了颤动效果。 $(document).ready(function() { //Pr
Perl PP 正在/script/中搜索输出脚本
我有一个非常奇怪的问题，但只有在运行 Ubuntu 时才会出现(在 CentOS 上一切正常)。我用 Perl 编写了一个脚本并使用了 Mail::IMAPClient模块。当我运行以下命令时: p
iOS - 如果 UITextView 正在 if 语句中编辑
我知道我可以检查 UITextView 是否正在使用 textViewDidBeginEditing: 进行编辑，但我想检查它是否正在使用 if 语句进行编辑？最佳答案使用方法isFirstRes
java - JPanel GUI 正在 self 更新
我正在制作一个简单的点击器类型的游戏。问题是，我的 JPanel 忽略了我设置为每秒更新的 Swing 计时器，而是每毫秒更新一次，即使我删除了计时器也是如此。除了计时器的监听器之外，不会在任何地方调
C# 正在 Visual Studio 中编写文件，但在我发布文件时却没有
我有以下代码，应该通过组织列表对每个组织进行 td，对每个组织调用 toString 方法，并将结果打印到控制台和名为 Debug1.tab 的文件。 try { StreamWriter p
java - 正在 sdcard 中下载文件，显示大小为 0
我有以下代码用于将文件从 url 下载到 sdcard 。此代码适用于小文件，但当文件大时，我下载的文件大小为 0。任何帮助将不胜感激。 Java 代码 setContentView(R.layout
正在 tomcat 服务器的根目录中搜索 Angular Assets 文件夹
我有一个必须使用 tomcat 部署的 Angular 项目。 Angular 文件在 dist/project-ui/ 中构建文件夹。我复制了 project-ui文件夹到 webapps tomc
css - Div 正在 catch 其他 div
我有一堆切换按钮，下面有标签。如果按钮的标签变得太长，那么下一行的第一个按钮将卡在该标签上。这是我的代码: https://jsfiddle.net/Android272/c150305z/ 我查了
javascript - 具有特殊字符的 InnerHTML 正在 trim 数据
具有特殊字符的 InnerHTML 正在 trim 数据。 elem.innerHTML = displayedObjects.name; 这里的 displayedObjects.name 包含一个
mysql - 我怎么知道 ssl 正在 mysql 复制中使用？
我已经成功地设置了我的证书和 key ，并使用了在这里找到的 mysql 文档: http://dev.mysql.com/doc/refman/5.1/en/replication-solution
java - 如何保证 ScheduledExecutorService 正在 EDT 上运行？
在为游戏制作动画和更新计时器时，我读到任何与 GUI 相关的 Activity 都应该在 EDT 上运行，包括重新绘制屏幕。我正在使用单个 ScheduledExecutorService 来更新和绘
java - window.parseJSON 正在 chop 大量数字
这个问题在这里已经有了答案: 关闭 10 年前。 Possible Duplicate: Large numbers erroneously rounded in Javascript 我正在使用
c# - 我如何验证 ryujit 正在 jitting 我的应用程序？
我已经为 .NET RyuJit 安装了新的 Jit 编译器，并按照安装文档中的说明在 regedit 的 .NetFramework 中设置了 AltJit=* 键。 http://blogs.ms

首页

博学

6Ren·AI

商城

api - Spark 作业已提交 - 正在等待(TaskSchedulerImpl : Initial job not accepted)