gpt4 book ai didi

python - 从 Pandas DataFrame 列中删除字符串

转载 作者:行者123 更新时间:2023-12-01 01:49:10 25 4
gpt4 key购买 nike

我有一个 pandas 数据框,如下所示。

DF1 =

sid path
1 '["rome","is","in","province","lazio"]'
1 "['rome', 'is', 'in', 'province', 'naples']"
1 ['N']
1 "['rome', 'is', 'in', 'province', 'in', 'campania']"
....

我想删除 path 列中所有不必要的字符,因此结果应如下所示:

DF2 =

sid path
1 rome is in province lazio
1 rome is in province naples
1 N
1 rome is in province in campania
....

我尝试替换所有不必要的字符,如下所示:

 DF1["path"].replace("[","").replace("]","").replace('"',"").replace(","," ").replace("'","")

但是没有成功。我想这是由于条目 ["N"]

我该怎么做?如有任何帮助,我们将不胜感激!

最佳答案

使用ast.literal_evalstr.join

演示:

import pandas as pd
import ast
df = pd.DataFrame({"path": ['["rome","is","in","province","lazio"]', "['rome', 'is', 'in', 'province', 'naples']", ['N']]})
df['path'] = df['path'].astype(str).apply(ast.literal_eval).apply(lambda x: " ".join(x))
print(df)

输出:

                         path
0 rome is in province lazio
1 rome is in province naples
2 N

关于python - 从 Pandas DataFrame 列中删除字符串,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/50912376/

25 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com