r - 使用分组计算过去和 future 特定事件的发生次数-6ren

r - 使用分组计算过去和 future 特定事件的发生次数

转载作者：行者123 更新时间：2023-12-05 00:53:12

24

4

这个问题是我发布的一个问题的修改here我在不同的日子出现了特定类型的事件，但这次它们被分配给多个用户，例如:

df = data.frame(user_id = c(rep(1:2, each=5)),
            cancelled_order = c(rep(c(0,1,1,0,0), 2)),
            order_date = as.Date(c('2015-01-28', '2015-01-31', '2015-02-08', '2015-02-23',  '2015-03-23',
                                   '2015-01-25', '2015-01-28', '2015-02-06', '2015-02-21',  '2015-03-26')))


user_id cancelled_order order_date
      1               0 2015-01-28
      1               1 2015-01-31
      1               1 2015-02-08
      1               0 2015-02-23
      1               0 2015-03-23
      2               0 2015-01-25
      2               1 2015-01-28
      2               1 2015-02-06
      2               0 2015-02-21
      2               0 2015-03-26

我想计算

1)每个客户的取消订单数量将有在接下来的 x 天(例如 7、14)， 排除当前一和

1)每个客户的取消订单数量有在过去 x 天(例如 7、14)， 排除当前 .

所需的输出如下所示:

solution
user_id cancelled_order order_date plus14 minus14
      1               0 2015-01-28      2       0
      1               1 2015-01-31      1       0
      1               1 2015-02-08      0       1
      1               0 2015-02-23      0       0
      1               0 2015-03-23      0       0
      2               0 2015-01-25      2       0
      2               1 2015-01-28      1       0
      2               1 2015-02-06      0       1
      2               0 2015-02-21      0       0
      2               0 2015-03-26      0       0

solution @joel.wilson 使用 data.table 提出了非常适合此目的的方法。

library(data.table)
vec <- c(14, 30) # Specify desired ranges
setDT(df)[, paste0("x", vec) := 
        lapply(vec, function(i) sum(df$cancelled_order[between(df$order_date, 
                                                 order_date, 
                                                 order_date + i, # this part can be changed to reflect the past date ranges
                                                 incbounds = FALSE)])),
        by = order_date]

但是，它不考虑按 user_id 分组.当我尝试通过将此分组添加为 by = c("user_id", "order_date") 来修改公式时或 by = list(user_id, order_date) ，这没用。似乎这是非常基本的东西，有关如何解决此细节的任何提示？

另外，请记住，我正在寻找一个有效的解决方案，即使它不是基于上述代码或 data.table根本!

谢谢!

最佳答案

这是一种方法:

library(data.table)
orderDT = with(df, data.table(id = user_id, completed = !cancelled_order, d = order_date))

vec = list(minus = 14L, plus = 14L)
orderDT[, c("dplus", "dminus") := .(
    orderDT[!(completed)][orderDT[, .(id, d_plus = d + vec$plus, d_tom = d + 1L)], on=.(id, d <= d_plus, d >= d_tom), .N, by=.EACHI]$N
    ,
    orderDT[!(completed)][orderDT[, .(id, d_minus = d - vec$minus, d_yest = d - 1L)], on=.(id, d >= d_minus, d <= d_yest), .N, by=.EACHI]$N
)]


    id completed          d dplus dminus
 1:  1      TRUE 2015-01-28     2      0
 2:  1     FALSE 2015-01-31     1      0
 3:  1     FALSE 2015-02-08     0      1
 4:  1      TRUE 2015-02-23     0      0
 5:  1      TRUE 2015-03-23     0      0
 6:  2      TRUE 2015-01-25     2      0
 7:  2     FALSE 2015-01-28     1      0
 8:  2     FALSE 2015-02-06     0      1
 9:  2      TRUE 2015-02-21     0      0
10:  2      TRUE 2015-03-26     0      0

(我发现 OP 的列名很麻烦，因此将它们缩短了。)

这个怎么运作

每列都可以单独运行，例如

orderDT[!(completed)][orderDT[, .(id, d_plus = d + vec$plus, d_tom = d + 1L)], on=.(id, d <= d_plus, d >= d_tom), .N, by=.EACHI]$N

这可以通过简化分解为步骤:

orderDT[!(completed)][
  orderDT[, .(id, d_plus = d + vec$plus, d_tom = d + 1L)], 
  on=.(id, d <= d_plus, d >= d_tom), 
  .N, 
  by=.EACHI]$N
# original version

orderDT[!(completed)][
  orderDT[, .(id, d_plus = d + vec$plus, d_tom = d + 1L)], 
  on=.(id, d <= d_plus, d >= d_tom), 
  .N, 
  by=.EACHI] 
# don't extract the N column of counts

orderDT[!(completed)][
  orderDT[, .(id, d_plus = d + vec$plus, d_tom = d + 1L)], 
  on=.(id, d <= d_plus, d >= d_tom)]
# don't create the N column of counts

orderDT[!(completed)]
# don't do the join

orderDT[, .(id, d_plus = d + vec$plus, d_tom = d + 1L)]
# see the second table used in the join

这使用“非对等”连接，采用不等式来定义日期范围。有关更多详细信息，请参阅通过键入 ?data.table 找到的文档页面。 .

关于r - 使用分组计算过去和 future 特定事件的发生次数，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/41615967/

24

4

0

文章推荐： f# - 在 F# 中通过组合定义 EntryPoint

文章推荐： tensorflow - mnist 导出示例中使用的 tf.parse_example

文章推荐： wso2 - 在私有(private)云中配置 WSO2 Stratos

文章推荐： linux - perf 记录的默认行为是什么？

rust - Tokio core.run无法编译。错误: the trait `futures::future::Future` is not implemented for `impl futures::Future`
我正在通过这个示例https://www.rusoto.org/futures.html学习Rust和Rusoto 而且我发现许多代码已经过时了。所以我改变了这样的代码: use rusoto_cor
scala - Future[Future[T]] 到 Future[T] 在另一个 Future.map 中而不使用 Await？
这是一个理论问题。我有一个服务可以调用来完成工作，但该服务可能无法完成所有工作，因此我需要调用第二个服务来完成它。我想知道是否有办法在没有 Await.result 的情况下做类似的事情map 函数
rust - 理解错误 : trait `futures::future::Future` is not implemented for `()`
这个问题是关于如何阅读 Rust 文档并提高我对 Rust 的理解，从而了解如何解决这个特定的编译器错误。我读过 tokio docs并试验了许多 examples .在编写自己的代码时，我经常遇到
rust - 如何满足 `impl futures::Future: futures::TryStream` 的特征界限
我有一个使用分页的 HTTP api，我想将它包装到一个通用的 Rust 流中，以便所有端点都可以使用相同的接口(interface)，这样我就可以使用 Stream 附带的特征函数特征。我收到了这
java - 处理两种不同类型的 future (其中一种 future 依赖于另一种 future )的理想方式是什么？
我正在查看 AKKA 的 Java Futures API，我看到了很多处理同一类型的多个 future 的方法，但我没有看到任何处理不同类型的 future 的方法。我猜我让事情变得更加复杂了。无
java - 我怎样才能把 future 的 future 变成一个 future 的对象？
环境:Akka 2.1，scala 版本 2.10.M6，JDK 1.7，u5 现在是我的问题: 我有: future1 = Futures.future(new Callable>(){...});
java - 有没有一种简单的方法可以将 Future> 变成 Future？
我有一些代码可以将请求提交给另一个线程，该线程可能会也可能不会将该请求提交给另一个线程。这会产生 Future> 的返回类型.是否有一些非令人发指的方法可以立即将其变成 Future等待整个 futu
dart - 在 Dart 中，如果我将 Future.wait 与 Futures 列表一起使用，并且在其中一个 Futures 上抛出错误，那么其他 Futures 会发生什么？
如果我有以下代码: Future a = new Future(() { print('a'); return 1; }); Future b = new Future.error('Error!')
scala - Future[Option[Future[Option[Boolean]]] 简化 future 和期权？
我一直试图简化我在 Scala 中做 future 的方式。我有一次收到了 Future[Option[Future[Option[Boolean]]但我在下面进一步简化了它。有没有更好的方法来简化这
scala - Future[Option[Future[Int]]] 到 Future[Option[Int]]
Scala 中从 Future[Option[Future[Int]]] 转换的最干净的方法是什么？至 Future[Option[Int]] ?甚至有可能吗？最佳答案有两个嵌套Future s
python - 如何以非阻塞方式链接 future ？即，如何在不阻塞的情况下将一个 future 作为另一个 future 的输入？
使用下面的示例，future2 如何在 future1 完成后使用 future1 的结果(不阻塞 future3 从被提交)? from concurrent.futures import Proc
python - 为什么 asyncio.Future 与 concurrent.futures.Future 不兼容？
这两个类代表了并发编程的优秀抽象，因此它们不支持相同的 API 有点令人不安。具体根据docs : asyncio.Future is almost compatible with concurre
rust - 类型不匹配解决 ::Output == std::result::Result
我正在尝试使用 wasm_bindgen 实现 API 类使用异步调用。 #![allow(non_snake_case)] use std::future::Future; use serde::{
scala - 在 Scala 中，如何将 future 列表转换为返回第一个成功 future 的 future ？
这个问题在这里已经有了答案: Futures / Success race (3 个回答) 去年关闭。所有的 future 最终可能会成功(有些可能会失败)，但我们希望第一个成功。并希望将这一结果表
python-3.x - concurrent.futures.Future 可以转换为 asyncio.Future 吗？
我在练习asyncio在编写多线程代码多年之后。注意到一些我觉得很奇怪的东西。都在 asyncio在 concurrent有一个Future目的。 from asyncio import Futur
scala - `Future[Option[Future[Option[X]]]]` 变为 `Future[Option[X]]`
如何将Future[Option[Future[Option[X]]]]转换为Future[Option[X]]？如果它是 TraversableOnce 而不是 Option 我会使用 Futur
python - 为什么在所有 futures 完成后与 futures.as_completed 一起使用时，concurrent.futures 执行器映射会抛出错误？
我正在尝试同时发送 HTTP 请求。为此，我使用 concurrent.futures 这是简单的代码: import requests from concurrent import futures
future - Vertx 中任意数量调用的顺序组合与 Futures
我们在 vertx 中使用 Futures 的例子如下: Future fetchVehicle = getUserBookedVehicle(routingContext, client);
rust - future.then() 如何返回一个 Future？
下面的函数，取自 here : fn connection_for( &self, pool_key: PoolKey, ) -> impl Future>, ClientError>
scala - future Scala的 future
我正在围绕Java库编写一个小的Scala包装器。 Java库有一个对象QueryExecutor，它公开了2种方法: execute(query):结果 asyncExecute(query):Li

首页

博学

6Ren·AI

商城

r - 使用分组计算过去和 future 特定事件的发生次数