python - Pandas 数据帧 : how to get column mean valuebut taking into account only the rows that have lower index than the one I want to get the mean-6ren

python - Pandas 数据帧 : how to get column mean valuebut taking into account only the rows that have lower index than the one I want to get the mean

转载作者：行者123 更新时间：2023-12-01 09:06:07

我遇到的问题是我想预测一支球队对另一支球队的胜利，为此我希望在比赛日期之前获得每场比赛的胜率。

但是，使用 df.groupBy("teamName").agg({"isVictory":"mean"}) 为我提供了团队的全局信息，但该信息不可用，因为您不应该知道此时所有比赛的胜率。

所以我想要的是获取本场比赛之前比赛的胜率，知道我的 DataFrame 中有一个列 index 来保持比赛的顺序(即，如果索引匹配的索引低于当前匹配的索引，这意味着之前已经进行过匹配，因此应将本次匹配视为平均值)

请注意，我的专栏是:

indexMatch, nameTeam, isVictoryTeam

(isVictoryTeam= 如果团队 1 获胜，则为 0 如果团队失败)

数据集示例:

   IndexMatch  isVictoryTeam team   winrate
0           1              1    a       NaN
1           2              0    a         1
2           3              1    a       0.5
3           4              1    a    0.6667

胜率是预期的输出。
预先感谢您的帮助。

最佳答案

一定有更好的方法，但这个有效:

df = pd.DataFrame({'team': [' a', ' a', ' a', ' a', 'b', 'b', 'c'],
                   'IndexMatch': [1, 2, 3, 4, 5, 6, 7],
                   'isVictoryTeam': [1, 0, 1, 1, 0, 1, 1]})
df['winrate'] = df.groupby('team')['isVictoryTeam'].expanding().mean().reset_index().groupby('team')['isVictoryTeam'].shift().reset_index(drop=True)
df
#   IndexMatch  isVictoryTeam team   winrate
#0           1              1    a       NaN
#1           2              0    a  1.000000
#2           3              1    a  0.500000
#3           4              1    a  0.666667
#4           5              0    b       NaN
#5           6              1    b  0.000000
#6           7              1    c       NaN

关于python - Pandas 数据帧 : how to get column mean valuebut taking into account only the rows that have lower index than the one I want to get the mean，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/52037095/

文章推荐： gmail - 如何查找其他语言的 Gmail 邮箱

文章推荐： java - 新创建的文件无法在 Java 中打开

文章推荐： sql - 采样 SQL 时间序列

python - Pandas 数据帧 : how to get column mean valuebut taking into account only the rows that have lower index than the one I want to get the mean
我遇到的问题是我想预测一支球队对另一支球队的胜利，为此我希望在比赛日期之前获得每场比赛的胜率。但是，使用 df.groupBy("teamName").agg({"isVictory":"mean"

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

python - Pandas 数据帧 : how to get column mean valuebut taking into account only the rows that have lower index than the one I want to get the mean