gpt4 book ai didi

hadoop - 在Hadoop中, reducer 在随机播放阶段在哪里复制其输入

转载 作者:行者123 更新时间:2023-12-02 21:27:57 25 4
gpt4 key购买 nike

在Hadoop中,映射器的输出在随机播放阶段复制到reducer。还原器必须从不同的映射器复制其对应的分区。 reducer 在开始实际的减速过程之前将输入存储在哪里?

最佳答案

The map outputs are copied to the reduce task JVM’s memory if they are small enough (the buffer’s size is controlled by mapred.job.shuffle.input.buffer.percent, which specifies the proportion of the heap to use for this purpose); otherwise, they are copied to disk. When the in-memory buffer reaches a threshold size (controlled by mapred.job.shuffle.merge.percent) or reaches a threshold number of map outputs (mapred.inmem.merge.threshold), it is merged and spilled to disk. If a combiner is specified, it will be run during the merge to reduce the amount of data written to disk.



引用-Hadoop权威指南

关于hadoop - 在Hadoop中, reducer 在随机播放阶段在哪里复制其输入,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/35286549/

25 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com