gpt4 book ai didi

hadoop - 如何在 pig 中按项目分组的两列

转载 作者:可可西里 更新时间:2023-11-01 16:52:57 26 4
gpt4 key购买 nike

我已经从“n”列中生成了两列(起点和终点)。现在我想为这两列组合生成计数。我无法得到结果。我收到错误消息,错误 1070:无法使用导入解析计数:下面是我的脚本,

mydata = load '/Projects/Flightdata/1987/Rawdata' using PigStorage(',') as (year:int, month:int, dom:int, dow:int, deptime:long, crsdeptime:long, arrtime:long, crsarrtime:long, uniqcarcode:chararray, flightnum:long, tailnum:chararray, actelaptime:long, crselaptime:long, airtime:long, arrdeltime:long, depdeltime:long, origcode:chararray, destcode:chararray, dist:long, taxintime:long, taxiouttime:long, flightcancl:int, canclcode:chararray, diverted:int, carrierdel:long, weatherdel:long, nasdel:long, securitydel:long, lateaircraftdel:long);

Step2 = foreach mydata generate origcode, destcode;
grpby = group Step2 by (origcode, destcode) ;
step3 = foreach grpby generate group.origcode as source, group.destcode as destination, Count(step2);

这里我想为每个起点和终点的组合生成计数。任何指导都会有所帮助。

最佳答案

请参阅Pig documentation about case sensitivity

The names of Pig Latin functions are case sensitive.

关于hadoop - 如何在 pig 中按项目分组的两列,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/31244248/

26 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com