gpt4 book ai didi

hadoop - Apache pig : Calculate number of days between a date and current date

转载 作者:可可西里 更新时间:2023-11-01 16:40:32 34 4
gpt4 key购买 nike

我有一个格式为 (#,title,year,rating,duration) 的电影列表:

1,The Nightmare Before Christmas,1993,3.9,4568
2,The Mummy,1932,3.5,4388
3,Orphans of the Storm,1921,3.2,9062
4,The Object of Beauty,1991,2.8,6150
5,Night Tide,1963,2.8,5126
6,One Magic Christmas,1985,3.8,5333
7,Muriel's Wedding,1994,3.5,6323
8,Mother's Boys,1994,3.4,5733
9,Nosferatu: Original Version,1929,3.5,5651
10,Nick of Time,1995,3.4,5333
...

我在每个元组中都有年份,我需要将其视为每年 1 月 1 日

我需要计算这个日期和当前日期之间的天数

我的方法:

movies = LOAD 'movies_data.csv' USING PigStorage(',') as (id,name,year,rating,duration);
daysbetween_data = foreach movies generate DaysBetween(ToDate(year,'<WHAT FORMAT TO GIVE HERE>'), ToDate(<CURRENT DATE HERE>));

知道怎么做吗?

最佳答案

将年份加载到字符数组字段,使用 CONCAT 将 01-01- 附加到年份字段,以便获得格式“MM-dd-yyyy”,然后使用 ToDate 和 DaysBetween。

movies = LOAD 'movies_data.csv' USING PigStorage(',') as (id:int,name:chararray,year:chararray,rating:double,duration:int);
daysbetween_data = foreach movies generate DaysBetween(ToDate(CONCAT('01-01-',year),'MM-dd-yyyy'),CurrentTime());

关于hadoop - Apache pig : Calculate number of days between a date and current date,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/41647644/

34 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com