python Pandas : case statement in agg function-6ren

python Pandas : case statement in agg function

转载作者：太空宇宙更新时间：2023-11-04 05:27:33

27

4

我有这样的sql语句:

select id
        , avg(case when rate=1 then rate end) as "P_Rate"
        , stddev(case when rate=1 then rate end) as "std P_Rate",
        , avg(case when f_rate = 1 then f_rate else 0 end) as "A_Rate"
        , stddev(case when f_rate = 1 then f_rate else 0 end) as "std A_Rate"
from (
 select id, connected_date,payment_type,acc_type,
  max(case when is s_rate > 1 then 1 else 0 end) / count(open) as rate
  sum(case when is hire_days <= 5 and paid>1000 then 1 else 0 end )/count(open) as f_rate
from analysis_table where alloc_date <= '2016-01-01' group by 1,2
) a group by id

我尝试使用 Pandas 重写:首先，我将为“内部”表创建数据框:

filtered_data = data.where(data['alloc_date'] <= analysis_date)

然后我将这些数据分组

grouped = filtered_data.groupby(['id','connected_date'])

但是我必须使用什么来过滤每一列并在其上使用 max/sum。

我试过这样的:

`def my_agg_function(hire_days,paid,open):
     r_arr = []
     if hire_days <= 5 and paid > 1000:
        r_arr.append(1)
     else:
        r.append(0)
     return np.max(r_arr)/len(????)
inner_table['f_rate'] = grouped.agg(lambda row: my_agg_function(row['hire_days'],row['paid'],row['open'])`

还有类似的速率

最佳答案

你应该在你的问题中加入一点 DataFrame 以使其更容易回答。

根据您的需要，您可能希望使用 groupby 数据帧的 agg 方法。假设您有以下数据框:

    connected_date  id      number_of_clicks    time_spent
0   Mon             matt    15                  124
1   Tue             john    13                  986
2   Mon             matt    48                  451
3   Thu             jack    68                  234
4   Sun             john    52                  976
5   Sat             sabrina 13                  156

并且您想获得用户每天花费的时间和单次 session 中的最大点击次数的总和。然后以这种方式使用 groupby:

df.groupby(['id','connected_date'],as_index = False).agg({'number_of_clicks':max,'time_spent':sum})

输出:

    id      connected_date  time_spent  number_of_clicks
0   jack    Thu             234         68
1   john    Sun             976         52
2   john    Tue             986         13
3   matt    Mon             575         48
4   sabrina Sat             156         13

请注意，为了输出清晰，我只传递了 as_index=False。

关于 python Pandas : case statement in agg function，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/38247763/

27

4

0

文章推荐： c - 这个 C 函数有什么问题？

文章推荐： css - 为什么我的网站在 Internet Explorer 8 中没有 CSS 样式？

if-statement - "If with a short statement"有什么好处
在 go lang 中使用“If with a short statement”有什么好处。引用:go tour if v := math.Pow(x, n); v < lim { retur
mysql - 警告 : Unsafe statement written to the binary log using statement format since BINLOG_FORMAT = STATEMENT
完全错误: Warning: Unsafe statement written to the binary log using statement format since BINLOG_FORMAT
mysql - 警告 : Unsafe statement written to the binary log using statement format since BINLOG_FORMAT = STATEMENT
完全错误: Warning: Unsafe statement written to the binary log using statement format since BINLOG_FORMAT
sql-server - 错误: "INSERT EXEC statement cannot be nested." and "Cannot use the ROLLBACK statement within an INSERT-EXEC statement." How to solve this?
我有三个存储过程 Sp1、Sp2 和 Sp3。第一个 (Sp1) 将执行第二个 (Sp2) 并将返回的数据保存到 @tempTB1 中，第二个将执行第三个 (Sp3) 并将数据保存到 @tempTB
mainframe - JCL : IF Statement with SET Statement
我已将 FLAG 设置为 1，并且正在执行 ARG 值应该仅为 DEV。但是我得到的是 ARG= DEV + CLIENTID 000023 // FLAG=1 000026 // I
mainframe - JCL : IF Statement with SET Statement
我已将 FLAG 设置为 1，并且正在执行 ARG 值应该仅为 DEV。但是我得到的是 ARG= DEV + CLIENTID 000023 // FLAG=1 000026 // I
C程序: if statement within switch statement
已关闭。这个问题是 not reproducible or was caused by typos 。目前不接受答案。这个问题是由拼写错误或无法再重现的问题引起的。虽然类似的问题可能是 on-top
java - 效率 : switch statements over if statements
PMD告诉我 A switch with less than 3 branches is inefficient, use a if statement instead. 这是为什么呢？为什么是3？他
if-statement - 新 Racket : why is the if statement affecting the return?
我刚开始学习 Racket，所以我仍在努力弄清楚这门语言的复杂性。我正在尝试在列表中实现我自己的搜索功能。如果函数找到它，则返回索引，否则返回 -1。 (define (find-index list
Kotlin - 相同条件 : multiple if statements or one if statement
在 Kotlin 中，您可以使用类似于三元运算符的 if 语句。我们可以选择做这样的事情: val x = if (isOdd) 1 else 2 但是如果我们有多个变量需要根据某些条件进行设置，那
java - 安卓 : Nested if statements jump between statements
在我的 Android 应用程序中，我尝试使用 XMLPullParser 使用以下代码读取 xml 文件: while (eventType != XmlPullParser.END_DOCUMEN
java - if (!statement) 和 if (statement != true) 之间的区别？
这个问题已经有答案了: Boolean expressions in Java (8 个回答) 已关闭 7 年前。我遇到过一些情况，其中第一个似乎改变了 boolean 值，而第二个却没有!两者之间
mysql - Eloquent : "WHERE (equals statement) AND (LIKE statement)"
我基本上想做的是: select * from request where id = 1 and created_at like (today's date); 但使用 Eloquent 。我试过:
javascript - 语法错误: missing ; before statement on normal statement
我不确定为什么会收到此代码。基本上我希望能够动态定位我的发射器，但是当我添加一个选项来检查位置并根据需要进行纠正时，我不断收到此错误。添加的代码是 if (
java - 什么时候使用 Statement 而不是 Prepared Statement？
何时使用语句而不是准备语句。我想语句用于没有参数的查询，但为什么不使用准备好的语句呢？对于没有参数的查询，哪个更快。最佳答案 I suppose statement is used in queri
MySQL : Select statement inside Check statement
我必须创建一个表，如下所示借款人(客户编号，贷款编号) 如果客户没有超过 3 笔贷款，则可以贷款。我创建的表如下 create table borrower( customerno int(5),
java - case-statement 或 if-statement 效率视角
这个问题在这里已经有了答案: 关闭 12 年前。 Possible Duplicates: Is "else if" faster than "switch() case"? What is the
typescript : require statement not part of an import statement
typescript 版本 2.2.2 我在我的 UserRoutzr.ts 中写了这个要求 const users = require(path.join(process.cwd() + "/da
java - EclipseLink JPQL : Generic COUNT statement for any statement
我有一个用 JPQL 编写的应用程序，它可以命中非常不同的查询(在不同的资源上)。对于很多此类查询，我需要知道结果总数(计数)，因为我没有应用任何 LIMIT/OFFSET 由于此查询的性质非常不同
java - "constructor statement"和 "declaration statement"是描述这些语句的正确名称吗？
我对以下 Java 语句感到困惑: ArtClass artClass0 = new ArtClass(); int int3 = 73; boolean boolean0 = artClass0.f

首页

博学

6Ren·AI

商城

python Pandas : case statement in agg function