python - 执行时 Pandas Pivot_table 错误 -Type 错误-6ren

python - 执行时 Pandas Pivot_table 错误 -Type 错误

转载作者：太空宇宙更新时间：2023-11-04 06:11:48

26

4

我是 Pandas 的新手，正在测试和学习。从 Excel 导入的数据框有以下问题:- 数据框包含以下变量:

<class 'pandas.core.frame.DataFrame'>
Int64Index: 48062 entries, 0 to 48061
Data columns (total 11 columns):
Konskund_MEAB         48062  non-null values
Strukturordn          48062  non-null values
Antal_forsandelser    48062  non-null values
ProdID                48062  non-null values
Sort                  48062  non-null values
Storstad              48062  non-null values
Year                  48062  non-null values
snittvikt             48062  non-null values
Totsum                48062  non-null values
Prodsum               48062  non-null values
snittpris             48062  non-null values
dtypes: float64(9), object(2)

运行:
np.average(df['snittpris'],weights=df['Antal_forsandelser'])

产生正确的结果

当我尝试使用以下命令运行数据透视表时:
df_sum=pd.pivot_table(df,rows=['Konskund_MEAB','ProdID'],cols=['年'], aggfunc=np.average(df ['snittpris'],weights=df['Antal_forsandelser']))

我收到以下错误消息。

---------------------------------------------------------------------------
TypeError                                 Traceback (most recent call last)
<ipython-input-90-9fd03896c806> in <module>()
----> 1 df_sum=pd.pivot_table(df,rows=['Konskund_MEAB','ProdID'],cols=['Year'],
aggfunc=np.average(df['snittpris'],weights=df['Antal_forsandelser']))

C:\Users\Bengtw\AppData\Local\Enthought\Canopy32\User\lib\site-packages\pandas\tools\pivot.pyc
in pivot_table(data, values, rows, cols, aggfunc, fill_value, margins, dropna)
    101 
    102     grouped = data.groupby(keys)
--> 103     agged = grouped.agg(aggfunc)
    104 
    105     table = agged

C:\Users\Bengtw\AppData\Local\Enthought\Canopy32\User\lib\site-packages\pandas\core\groupby.pyc
 in agg(self, func, *args, **kwargs)
342     @Appender(_agg_doc)
343     def agg(self, func, *args, **kwargs):
--> 344         return self.aggregate(func, *args, **kwargs)
345 
346     def _iterate_slices(self):

C:\Users\Bengtw\AppData\Local\Enthought\Canopy32\User\lib\site-packages\pandas\core\groupby.pyc
 in aggregate(self, arg, *args, **kwargs)
   1741 
   1742             if self.grouper.nkeys > 1:
-> 1743                 return self._python_agg_general(arg, *args, **kwargs)
   1744             else:
   1745                 result = self._aggregate_generic(arg, *args, **kwargs)

C:\Users\Bengtw\AppData\Local\Enthought\Canopy32\User\lib\site-packages\pandas\core\groupby.pyc
 in _python_agg_general(self, func, *args, **kwargs)
    480 
    481         if len(output) == 0:
--> 482             return self._python_apply_general(f)
    483 
    484         if self.grouper._filter_empty_groups:

C:\Users\Bengtw\AppData\Local\Enthought\Canopy32\User\lib\site-packages\pandas\core\groupby.pyc
 in _python_apply_general(self, f)
    332 
    333     def _python_apply_general(self, f):
--> 334         keys, values, mutated = self.grouper.apply(f, self.obj, self.axis)
    335 
    336         return self._wrap_applied_output(keys, values,

C:\Users\Bengtw\AppData\Local\Enthought\Canopy32\User\lib\site-packages\pandas\core\groupby.pyc
 in apply(self, f, data, axis, keep_internal)
    628             # group might be modified
    629             group_axes = _get_axes(group)
--> 630             res = f(group)
    631             if not _is_indexed_like(res, group_axes):
    632                 mutated = True

C:\Users\Bengtw\AppData\Local\Enthought\Canopy32\User\lib\site-packages\pandas\core\groupby.pyc
 in <lambda>(x)
    468     def _python_agg_general(self, func, *args, **kwargs):
    469         func = _intercept_function(func)
--> 470         f = lambda x: func(x, *args, **kwargs)
    471 
    472         # iterate through "columns" ex exclusions to populate output dict

TypeError: 'numpy.float64' object is not callable

问题是什么？？行变量 Konskund_MEAB 包含字符串(几百个不同的字符串)，ProdID 是数字并且有 4 个唯一值。年份是什么(4 个离散值)。

最佳答案

参数 aggfunc 应该是一个函数，但你传入的是一个 float 。
因此类型错误:

TypeError: 'numpy.float64' object is not callable

您可以传入一个匿名 (lambda) 函数，这可能就是您想要的:

aggfunc=lambda x: np.average(x['snittpris'], weights=x['Antal_forsandelser'])

不幸的是，这在这种情况下不起作用(因为 aggfunc 无法访问未使用的列)...

相反，您可以使用 groupby :

rows = ['Konskund_MEAB','ProdID']
cols = ['Year']
g = df.groupby(rows + columns)

并将函数应用到每个组，然后是unstack从 Series 到 DataFrame:

s_av = g.apply(lambda x: np.average(x['snittpris'], weights=x['Antal_forsandelser']))
df_av = s_av.unstack(cols)

关于python - 执行时 Pandas Pivot_table 错误 -Type 错误，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/18382632/

26

4

0

文章推荐： python - vim python-mode 中的自动滚动输出

文章推荐： css - react native 返回 ImageBackground 居中

文章推荐： c - 展开链表比链表好在哪里？

文章推荐：仅在c中比较第一个小数点

python - Pandas :pivot 和 pivot_table 之间的区别。为什么只有 pivot_table 工作？
我有以下数据框。 df.head(30) struct_id resNum score_type_name score_value 0 4294967297 1
Python pivot_table - 添加差异列
我是python的新手。我有以下数据框。我能够在 Excel 中旋转。我想添加差异列(在图像中，我手动添加了它)。区别在于B-A值。我能够使用 Python 数据透视表复制差异列和总计。下面是我的
dataframe - 如何使用Dask Pivot_table？
我正在尝试在 Dask 上使用 Pivot_table 和以下数据框: date store_nbr item_nbr unit_sales year month 0
python - Pivot_table 来自列值中的列表
我有一个像这样的数据框: ID Sim Items 1 0.345 [7,7] 2
python - pivot_table 没有要聚合的数字类型
我想根据以下数据框制作一个数据透视表，其中包含列 sales、rep。数据透视表显示 sales 但没有 rep。当我尝试仅使用 rep 时，出现错误 DataError: No numeric ty
pandas pivot_table() 按日期分多列数据的方法
如下所示： date 20170307 20170308 iphone4 2 0
python - pandas pivot_table 的边距仅垂直聚合
考虑一个数据框: df = pd.DataFrame( {'last_year': [1, 2, 3], 'next_year': [4, 5, 6]}, index=['foo',
python - 操作 pivot_table 中的多索引列
我看到这个问题被问过多次，但其他问题的解决方案没有奏效! 我有这样的数据框 df = pd.DataFrame({ "date": ["20180920"] * 3 + ["20180921"] *
python - Pandas pivot_table 的更快替代品
我正在使用 Pandas pivot_table在大型数据集(1000 万行，6 列)上运行。由于执行时间是最重要的，我尝试加快进程。目前处理整个数据集需要大约 8 秒，这很慢，我希望找到提高速度/性
python - Pandas pivot_table 因列和边距而失败
我收到了 KeyError: "... not in index"使用pandas的pivot_table时。这是示例代码: arrays = [['bar', 'bar', 'foo', 'foo
python - 带有pd.grouper和Margins的Pivos Pivot_table
当将列设置为Margins=True时，pd.grouper datetime在 Pandas 数据透视表中将不起作用。这是我的代码，可以按预期工作- p = df.pivot_table(value
python - Pandas pivot_table 保留顺序
>>> df A B C D 0 foo one small 1 1 foo one large 2 2 foo one large 2 3 foo two sm
python - Pandas pivot_table 包含空身份
数据集 x y a 1 3 0 1 1 0 1 2 0 3 6 0 5 3 1 1 5 0 1 7 0 1 6 0 1 4
python - Pandas pivot_table 包含空身份
数据集 x y a 1 3 0 1 1 0 1 2 0 3 6 0 5 3 1 1 5 0 1 7 0 1 6 0 1 4
python - 使用 pivot_table 时应用不同的聚合函数
我有这个样本: import pandas as pd import numpy as np dic = {'name': ['j','c','q','j','c','q','j','c
python - Pandas pivot_table 不符合值顺序
我对 pandas pivot_table 有疑问。有时，“值”列表中指定的列的顺序不匹配 In [11]: p = pivot_table(df, values=["x","y"], cols=[
python - Pandas pivot_table 百分位数
我试图通过平均值、中位数、第 25 个百分位数、第 75 个百分位数、标准差来描述 A 列、B 列。 df = pd.DataFrame({'A':[1,9,3,4,6,8,2,7],
python pandas pivot_table 列一级错误名称
我有下表: ID Metric Level Level(% Change) Level(Diff) Index 0 2016 A 10
python - Pivot_table MultiIndex 到列
我有下表: In [303]: table.head() Out[303]: people weekday weekofyear 2012-01-01 119
python - 将总列的百分比添加到 Pandas pivot_table
我似乎无法弄清楚如何将每个 date_submitted 组的总列百分比添加到下面的 pandas 数据透视表中: In [177]: pass_rate_pivot date_submitted

首页

博学

6Ren·AI

商城

python - 执行时 Pandas Pivot_table 错误 -Type 错误