gpt4 book ai didi

apache-spark - Apache Spark Web UI 中的 "Stage Skipped"是什么意思?

转载 作者:行者123 更新时间:2023-12-04 04:23:27 25 4
gpt4 key购买 nike

来自我的 Spark UI。跳过是什么意思?

enter image description here

最佳答案

通常这意味着数据已经从缓存中获取,不需要重新执行给定的阶段。它与您的 DAG 一致,表明下一阶段需要改组( reduceByKey )。每当涉及洗牌时 Spark automatically caches generated data :

Shuffle also generates a large number of intermediate files on disk. As of Spark 1.3, these files are preserved until the corresponding RDDs are no longer used and are garbage collected. This is done so the shuffle files don’t need to be re-created if the lineage is re-computed.

关于apache-spark - Apache Spark Web UI 中的 "Stage Skipped"是什么意思?,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/43766087/

25 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com