python - Pandas DataFrame 列的 boolean 掩码-6ren

python - Pandas DataFrame 列的 boolean 掩码

转载作者：太空宇宙更新时间：2023-11-04 07:16:48

我的目标是使用 boolean 掩码从 DataFrame 中获取有用的列。

我尝试了这样一段代码:

df = pd.DataFrame({'a': [1,2,3,4,5], 'b': [101, 101, 102, 101, 102], 'c': [23, 12, 54, 65, 21]})
mask = [True, False, True]
df.columns[mask]

结果就是我真正需要的:

Index([u'a', u'c'], dtype='object')

然后我尝试使用相同的代码，但使用另一个掩码:

mask_i = [1, 0, 1]

df = pd.DataFrame({'a': [1,2,3,4,5], 'b': [101, 101, 102, 101, 102], 'c': [23, 12, 54, 65, 21]})
mask_i = [1, 0, 1]
df.columns[mask]

我期望得到相同的结果，但得到了所有索引:

Index([u'b', u'a', u'b'], dtype='object')

然后我检查:

mask_i = [1, 0, 1]
mask = [True, False, True]
print mask == mask_i`

# Result: `True`

有人可以解释一下为什么掩码是相等的，但我得到不同的结果。

最佳答案

这是因为 Pandas 使用将 boolean 切片视为掩码，而将整数切片视为查找。在您的示例中，您可以看到 columns[[1, 0, 1]] 查找第二列，然后是第一列，然后是第二列:["b", "a", "b"].

要将整数索引转换为 boolean 值，您可以使用:

>>> np.array([1, 0, 1]).astype(bool)
array([ True, False,  True], dtype=bool)
>>> map(bool, [1, 0, 1])
[True, False, True]

关于python - Pandas DataFrame 列的 boolean 掩码，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/41333935/

文章推荐： c - 将参数传递到 ./configure

文章推荐： javascript - Bootstrap 警报附加和淡入/淡出

文章推荐： c - PIC16f877a 构建失败

文章推荐： python - 无法读取所需的 excel 文件作为 Python 中的输出

太空宇宙

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

python - Pandas DataFrame 列的 boolean 掩码