gpt4 book ai didi

python - 如何从基于三列的DataFrame中提取数据?

转载 作者:行者123 更新时间:2023-12-01 08:00:52 24 4
gpt4 key购买 nike

我试图从基于三列的数据框中洗牌并提取重复的列。

我尝试将三列转换为字典并存储它们的索引并比较高度。第 4 行不是唯一删除的。

df['C']=df[["Color1","Color2","Color3"]].stack().apply(tuple)
df = df.duplicated(subset=["Color1","Color2","Color3"], keep=False)


Height Color1 Color2 Color3
0 Short NaN Blue Red
1 High Red Blue NaN
2 Medium Blue Red NaN
3 Short NaN NaN Blue
4 Short NaN Red Blue
5 High NaN NaN NaN

代码的输出应该是:

     Height    Color1    Color2    Color3
0 Short NaN Blue Red
1 High Red Blue NaN
2 Medium Blue Red NaN

最佳答案

您可以使用drop_duplicates :

df.drop_duplicates(subset="Height")

关于python - 如何从基于三列的DataFrame中提取数据?,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/55738144/

24 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com