gpt4 book ai didi

hadoop - Hadoop Pig 中的点击率计算

转载 作者:可可西里 更新时间:2023-11-01 16:47:06 25 4
gpt4 key购买 nike

需要计算点击率(点击次数超过展示次数,在国家/地区级别),我列出了表结构(印象表和点击表),以及我在 Hadoop Pig 中的代码。我的问题是下面的实现是否最有效,还有更有效的解决方案吗?谢谢。

表格印象:

impressionID、时间戳、countryID

点击表格:

impressionID, 时间戳

joined_feed = join impression by impressionID, click by impression ID;
joined_feed = foreach joined_feed generate impression::countryID, click::impressionID is null? 0 : 1 as clicked;
ctr_result = foreach (group joined_feed by country) generate group as countryID, SUM(clicked)/COUNT(joined_feed)

最佳答案

虽然你应该添加类型转换,但你获得点击率的方法非常有效,否则你会得到一堆零和一

ctr_result = foreach (group joined_feed by country) generate group as countryID, (double) SUM(clicked)/(double) COUNT(joined_feed) as ctr

关于hadoop - Hadoop Pig 中的点击率计算,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/36140556/

25 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com