python - Dask: Groupby 和 'First'/'Last' in agg-6ren

python - Dask: Groupby 和 'First'/'Last' in agg

转载作者：太空宇宙更新时间：2023-11-04 04:48:29

26

4

我想按单个列分组，然后对几列使用 agg 和均值，但只需选择 first 或 last对于其余的列。这在 Pandas 中是可能的，但目前在 Dask 中不受支持。这个怎么做？谢谢。

aggs = {'B': 'mean', 'C': 'mean', 'D': 'first', 'E': 'first'}
ddf.groupby(by='A').agg(aggs)

最佳答案

您可以使用 dask.dataframe.DataFrame.drop_duplicates然后加入聚合DataFrame:

df = pd.DataFrame({'F':list('abcdef'),
                   'B':[4,5,4,5,5,4],
                   'C':[7,8,9,4,2,3],
                   'D':[1,3,5,7,1,0],
                   'E':[5,3,6,9,2,4],
                   'A':list('aaabbb')})

print (df)
   A  B  C  D  E  F
0  a  4  7  1  5  a
1  a  5  8  3  3  b
2  a  4  9  5  6  c
3  b  5  4  7  9  d
4  b  5  2  1  2  e
5  b  4  3  0  4  f

from dask import dataframe as dd 
ddf = dd.from_pandas(df, npartitions=3)
#print (ddf)


c = ['B','C']
a = ddf.groupby(by='A')[c].mean()
b = ddf.drop(c, axis=1).drop_duplicates(subset=['A'])
df = b.join(a, on='A').compute()
print (df)
   A  D  E  F         B    C
0  a  1  5  a  4.333333  8.0
3  b  7  9  d  4.666667  3.0

关于python - Dask: Groupby 和 'First'/'Last' in agg，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/48961304/

26

4

0

文章推荐： css - 作为响应式设计的背景图像

文章推荐： CSS 媒体查询样式被覆盖

文章推荐： javascript - 用javascript出现onclick效果？

文章推荐： python - 使用 Homebrew 在 Mac 上使用 Python 3 安装 GDB

mysql - 通过 "Last Week"、 "Last Month"和 "Last Year"搜索记录的选择查询可能是什么？
我有一个系统，我需要在其中显示记录列表，这样我们可以使用三个选项上周上个月去年我的表结构有一个名为 createdate 的字段，它是 BIGINT 类型并保存从 PHP 的 time() 函
javascript - .last() 和 :last 的性能差异
假设，我对某些功能使用哪一个并不重要，是否使用它会在性能方面产生任何差异 $('div:last'); 或 $('div').last(); 谢谢! 最佳答案 last() 在大型 DOM 集上明显比
javascript - .last() 和 :last? 有什么区别
有人可以解释 .last() 和 :last 之间的区别吗？我似乎找不到明确的解释。为什么 $('td.cellsOfSpecificClass:last', '.table tr') 返回每个 t
jquery - 获取可见li :last and not hidden li:last
我想用 li:last 做点什么: var p = $("li:last"); 我需要它的位置:position.left 通过该位置，我可以对齐一些元素。问题是，在某些情况下，最后一个 li 被 e
jquery - :last and :last-of-type in jQuery? 和有什么区别
请看这张图片: 谁能解释一下其中的区别吗？编辑让我指出什么让我困惑。请注意: $row.is('tr.items:last') === false $row[0].id === $('tr.ite
jquery - :last vs :last-child selector
我注意到 $( 'filter:last' ) 与 jQuery 中的 $( 'filter:last-child' ) 不同。我尝试了 jQuery 文档，但很难理解 :last 的额外用途以及它
html - 如何选择 "last-child of a last-child"？
我正在尝试使用 CSS 选择最后一个 col-xs-12 div 中包含的最后一个元素。关键是元素是动态的，所以它可以是 h2 或 h3 等。
css - :last-child doesn't target last div
当我使用 :last-child 定位 div 时，它不起作用。使用 :first-child 没关系。 :last-of-type 也可以。有任何想法吗？谢谢。 HTML Lorem
html - Last-Of Type 和 Last-Of-Child
HTML: ... ... ... ... CSS: .plan-box:last-of-type { ... } 在上面的 CSS 代码中，如果我在
javascript - .filter (':last' ) 与 .last()
我想知道 .filter(':last') 和 .last() 之间是否有任何区别？对我来说，他们似乎也在做同样的事情，但我是 jQuery 的新手。如果结果没有差异，推荐使用哪一种还是只是个人喜好
c++ - std::adjacent_find(last, last) 是否未定义？
std::adjacent_find searches the range [first, last) for two consecutive identical elements. Return v
css - :last-child does not select the last element
这个问题在这里已经有了答案: How can I select the last element with a specific class, not last child inside of pa
excel - 将excel中的名称从(Last，First)重组为(First Last)
目标创建一个辅助列，将单元格的值从 Last, First 转换至First Last 下面的公式工作正常。 A1包含 Last, First下面的公式转换为所需的输出。 A2 = MID(A1,
last.fm - 获取专辑 last.fm api 的发布日期
我需要获取歌曲的发行日期。在 last.fm API 中，如文档中所述，足以向服务器发出 HTTP 请求，它将使用包含字段“”的 XML(或 JSON)进行回复(如示例响应中所示在网站上)。问题是
R .Last.call 功能 - 类似于 .Last.value
类似于 .Last.value有什么办法可以访问上次通话吗？低于预期的潜在结果.Last.call . sum(1, 2) # [1] 3 str(.Last.call) # language su
perl - 为什么 'last' 在 Perl 中被称为 'last' ？
在 Perl 中调用 last 而不是在 C 中调用 break 的历史原因是什么？ Perl 的设计受到 C 的影响(此外还有 awk、sed 和 sh - 请参阅下面的手册页)，因此不采用熟悉的
powershell - 在文本文件中交换 last.first 到 first.last
我正在尝试交换字符串中的两个单词。我目前有一个 txt 文件，其中有一列用户格式为 last.first。我如何将它交换为 first.last？最佳答案 -split 字符串并连接: $Last,
html - ":last-child"和 ":not(:last-child)"之间的不同行为
我有以下 html 代码: Text1 Text2 Text3 使用: nav :last-child { text-transform: upperca
java - Java 中的惯用 "First Last"=> "Last, First"
我主要是一名Python程序员，正在学习一些Java。我需要一个函数将包含“First Last”形式的名称的字符串转换为“Last, First”(我还需要它能够处理单个名称:“Cher”=>“Ch
css - :last-child/last-of-type pseudo difficulties
难以尝试将集合中的特定元素作为最后一个匹配的选择器。 this is today this is today

首页

博学

6Ren·AI

商城

python - Dask: Groupby 和 'First'/'Last' in agg