python - 不使用元组的多索引列的 set

python - 不使用元组的多索引列的 set_index

转载作者：行者123 更新时间：2023-12-02 03:07:06

25

4

设置:

np.random.seed(0)
iix = pd.MultiIndex.from_product([['bar', 'baz', 'foo'],['one', 'two']])
df = pd.DataFrame(np.random.randn(3, 6), columns=iix)

Out[120]:
        bar                 baz                 foo
        one       two       one       two       one       two
0  1.764052  0.400157  0.978738  2.240893  1.867558 -0.977278
1  0.950088 -0.151357 -0.103219  0.410599  0.144044  1.454274
2  0.761038  0.121675  0.443863  0.333674  1.494079 -0.205158

在 level = 1 和所有列 two 上set_index。我需要使用元组列表，如下所示

df.set_index([('bar', 'two'), ('baz', 'two'), ('foo', 'two')])

Out[121]:
                                       bar       baz       foo
                                       one       one       one
(bar, two) (baz, two) (foo, two)
 0.400157  2.240893   -0.977278   1.764052  0.978738  1.867558
-0.151357  0.410599    1.454274   0.950088 -0.103219  0.144044
 0.121675  0.333674   -0.205158   0.761038  0.443863  1.494079

问题:是否有其他简单的方法可以在不使用上述元组列表的情况下实现此set_index？

注意:我知道我可以使用列表理解和get_level_values来概括元组列表的构造。不过，我对不使用元组列表的方式感兴趣。

最佳答案

让我们尝试一下这种疯狂的做法:

df.set_index(pd.MultiIndex.from_frame(df.loc(axis=1)[:, 'two'])).loc(axis=1)[:,'one']

输出:

                                       bar       baz       foo
                                       one       one       one
(bar, two) (baz, two) (foo, two)                              
 0.400157  2.240893   -0.977278   1.764052  0.978738  1.867558
-0.151357  0.410599    1.454274   0.950088 -0.103219  0.144044
 0.121675  0.333674   -0.205158   0.761038  0.443863  1.494079

使用一个鲜为人知的参数.loc，axis。 See docs

You can also specify the axis argument to .loc to interpret the passed slicers on a single axis.

关于python - 不使用元组的多索引列的 set_index，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/59570054/

25

4

0

文章推荐： php - 在 CodeIgniter 中路由所有 URL 请求？

python - 不使用元组的多索引列的 set_index
设置: np.random.seed(0) iix = pd.MultiIndex.from_product([['bar', 'baz', 'foo'],['one', 'two']]) df =
python - `set_index` 为什么要为列名创建索引标签？
我有一个这样开头的 CSV 文件: Year,Boys,Girls 1996,333490,315995 1997,329577,313518 1998,325903,309998 当我将它读入 pa
dataframe - Julia 数据帧上的 set_index()
我在 Julia 数据帧的 python 中寻找类似 .set_index() 的函数。我搜索并发现 NamedArray 可以给出与 Python 中的 .set_index() 类似的结果，如下
Pandas set_index 创建了不需要的新行——我该怎么做才能做到这一点？
我正在尝试从我的数据框中的列之一设置数据框的索引。这个数据框的旧索引本质上是没有意义的。但是当我使用 set_index(['Name']) 时，我添加了一个新列，这不是我想要的行为。我找不到解决方
python - “DataFrameGroupBy”对象没有属性 'set_index'
我有一个数据帧数据。分组并重置索引后，我无法将日期列设置为索引。 data = data.groupby('Payment Date ') data['Payment Amount '].sum().
dask 数据框 set_index 抛出错误
我有一个从 HDFS 上的 parquet 文件创建的 dask 数据框。使用 api: set_index 创 build 置索引时，它失败并出现以下错误。 File "/ebs/d1/agent/
python - Pandas set_index 多索引查找
我找不到在 Pandas 0.14 中查找多重索引的方法。这是我遇到问题的一些模拟数据。代码: row1 = ['red', 'ferrari', 'mine'] row2 = ['blue', '
python - Dataframe set_index 产生重复的索引值而不是进行分层分组
我有一个看起来像这样的数据框(索引未显示) Time Letter Type Value 0 A x 10 0 B y
python - df.set_index() 没有按我的预期工作
从上面，你可以看到我已经将索引设置为“index”。我的期望是能够使用“索引”列来删除行，并且仅使用“Barangay”列作为功能而不是数据框的索引。如上所示，仍然使用“Barangay”列作为引用
python - 如何在不丢失索引名称的情况下删除 set_index() 之后的多余行？
我想用 df.set_index() 函数更改我的 DataFrame 索引列。虽然这提供了一个功能解决方案，但它创建了一个我想摆脱的“额外”行。 df = pd.DataFrame({'A': ['
python - for 循环中的 pandas set_index
我有很多大约这种类型的 DataFrame: import pandas as pd import numpy as np x1 = pd.DataFrame(np.vstack((np.random
python - 为已被 set_index() 更改的数据帧提供正常索引
我有一个如下所示的数据框: Priority RID_solve Prob RID_prob Remarks 0 1 5001 34
python - Pandas:Set_index 函数不删除列
我有以下数据框: df = pd.DataFrame({ 'Trader': 'Carl Mark Carl Joe Joe Carl Joe Carl'.split(), 'Product': li
python - Pandas set_index 不会删除该列
我在我的数据框上运行以下代码函数: del dfname["Unnamed: 0"] dfname["date"] = pd.to_datetime(dfname["date"]) dfname.se
python - 列标题的 set_index 等效项
在 Pandas 中，如果我有一个如下所示的 DataFrame: 0 1 2 3 4 5 6 0
python - 数据框 set_index 未设置
我有一个数据框，正在尝试将索引设置为“时间戳”列。目前索引只是一个行号。时间戳格式的一个例子是:2015-09-03 16:35:00 我试过设置索引: df.set_index('Timestamp
python - Pandas set_index 不设置索引
假设我创建了一个带有两列的 pandas DataFrame，b(一个 DateTime)和 c(一个整数)。现在我想从第一列 (b) 中的值创建一个 DatetimeIndex: import pa
Python Pandas datetime set_index 给出了意想不到的结果
我有一个看起来像 like this 的数据框: 我想将 'TIME_STAMP_NEW' 列作为索引。当前代码: twoweektable['TIME_STAMP_NEW'] = pd.to_dat
python - Pandas 意外的 set_index 行为
data = [['g1','a',1],['g1','b',2],['g2','b',3],['g2','a',4]] df = pandas.DataFrame(data=data, column
python - pd.DataFrame.set_index 可以维护数据类型吗？
我正在尝试调用 df.set_index，使我设置索引的列的 dtype 是新的 index.dtype。不幸的是，在下面的示例中，set_index 更改了 dtype。 df = pd.DataF

首页

博学

6Ren·AI

商城

python - 不使用元组的多索引列的 set_index