gpt4 book ai didi

apache-spark - Yarn上的Spark如何存储随机播放的文件?

转载 作者:行者123 更新时间:2023-12-04 04:10:41 24 4
gpt4 key购买 nike

我正在使用Yarn在Spark中执行过滤器,并收到以下错误。感谢您的帮助,但是我的主要问题是为什么找不到该文件。

/hdata/10/yarn/nm/usercache/spettinato/appcache/application_1428497227446_131967/spark-local-20150708124954-aa00/05/merged_shuffle_1_343_1

改组后,Spark似乎找不到已存储到HDFS的文件。

为什么Spark访问目录“/hdata/”?
该目录在HDFS中不存在,应该是本地目录还是HDFS目录?
我可以配置随机数据的存储位置吗?

15/07/08 12:57:03 WARN TaskSetManager: Loss was due to java.io.FileNotFoundException
java.io.FileNotFoundException: /hdata/10/yarn/nm/usercache/spettinato/appcache/application_1428497227446_131967/spark-local-20150708124954-aa00/05/merged_shuffle_1_343_1 (No such file or directory)
at java.io.FileOutputStream.open(Native Method)
at java.io.FileOutputStream.<init>(FileOutputStream.java:221)
at org.apache.spark.storage.DiskBlockObjectWriter.open(BlockObjectWriter.scala:116)
at org.apache.spark.storage.DiskBlockObjectWriter.write(BlockObjectWriter.scala:177)
at org.apache.spark.scheduler.ShuffleMapTask$$anonfun$runTask$1.apply(ShuffleMapTask.scala:161)
at org.apache.spark.scheduler.ShuffleMapTask$$anonfun$runTask$1.apply(ShuffleMapTask.scala:158)
at scala.collection.Iterator$class.foreach(Iterator.scala:727)
at scala.collection.AbstractIterator.foreach(Iterator.scala:1157)
at org.apache.spark.scheduler.ShuffleMapTask.runTask(ShuffleMapTask.scala:158)
at org.apache.spark.scheduler.ShuffleMapTask.runTask(ShuffleMapTask.scala:99)
at org.apache.spark.scheduler.Task.run(Task.scala:51)
at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:187)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
at java.lang.Thread.run(Thread.java:745)

编辑:我想通了一些。 spark.local.dir配置的目录是根据 http://spark.apache.org/docs/latest/configuration.html用于将RDD存储到磁盘的本地目录

最佳答案

最有可能的答案是任务死了。例如来自OutOfMemory或其他异常。

关于apache-spark - Yarn上的Spark如何存储随机播放的文件?,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/31303568/

24 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com