gpt4 book ai didi

python - 如何提取单个字符串并删除其他字符串,类似于DataFrame

转载 作者:行者123 更新时间:2023-12-01 00:35:23 24 4
gpt4 key购买 nike

我将类别名称与技能名称结合起来,按类别名称对其进行排序。现在我的表格列如下

(Category1) Skill 1
(Category1) Skill 2
(Category1) Skill 3
(Category1) Skill 4
(Category1) Skill 5
(Category1) Skill 6
(Category2) Skill 7
(Category2) Skill 8
(Category2) Skill 9
(Category2) Skill 10
(Category2) Skill 11
(Category2) Skill 12

我想为每个第一项技能只保留一个类别标题并删除其他标题,类似于这样的表格

(Category1) Skill 1
Skill 2
Skill 3
Skill 4
Skill 5
Skill 6
(Category2) Skill 7
Skill 8
Skill 9
Skill 10
Skill 11
Skill 12

有什么想法吗?谢谢

最佳答案

您可以拆分字符串并检索最后一部分 Skill x,并检查 Categoryx 的重复位置,并使用结果替换拆分的部分:

import numpy as np

m = df.col1.str.split(r'\) ', expand=True)
df['col1'] = np.where(m.duplicated(subset=0), m[1], df.col1)

col1
0 (Category1) Skill 1
1 Skill 2
2 Skill 3
3 Skill 4
4 Skill 5
5 Skill 6
6 (Category2) Skill 7
7 Skill 8
8 Skill 9
9 Skill 10
10 Skill 11
11 Skill 12
<小时/>

输入数据 -

 col1
0 (Category1) Skill 1
1 (Category1) Skill 2
2 (Category1) Skill 3
3 (Category1) Skill 4
4 (Category1) Skill 5
5 (Category1) Skill 6
6 (Category2) Skill 7
7 (Category2) Skill 8
8 (Category2) Skill 9
9 (Category2) Skill 10
10 (Category2) Skill 11
11 (Category2) Skill 12

关于python - 如何提取单个字符串并删除其他字符串,类似于DataFrame,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/57820325/

24 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com