gpt4 book ai didi

R删除数据框中的多个文本字符串

转载 作者:行者123 更新时间:2023-12-01 09:25:47 26 4
gpt4 key购买 nike

R 新手。我希望从数据框中删除某些单词。由于有多个单词,我想将这个单词列表定义为一个字符串,并使用 gsub 删除。然后转换回数据帧并保持相同的结构。

wordstoremove <- c("ai", "computing", "ulitzer", "ibm", "privacy", "cognitive")

a
id text time username
1 "ai and x" 10 "me"
2 "and computing" 5 "you"
3 "nothing" 15 "everyone"
4 "ibm privacy" 0 "know"

我在想这样的事情:
a2 <- apply(a, 1, gsub(wordstoremove, "", a)

但显然这不起作用,在转换回数据框之前。

最佳答案

wordstoremove <- c("ai", "computing", "ulitzer", "ibm", "privacy", "cognitive")

(dat <- read.table(header = TRUE, text = 'id text time username
1 "ai and x" 10 "me"
2 "and computing" 5 "you"
3 "nothing" 15 "everyone"
4 "ibm privacy" 0 "know"'))

# id text time username
# 1 1 ai and x 10 me
# 2 2 and computing 5 you
# 3 3 nothing 15 everyone
# 4 4 ibm privacy 0 know

(dat1 <- as.data.frame(sapply(dat, function(x)
gsub(paste(wordstoremove, collapse = '|'), '', x))))

# id text time username
# 1 1 and x 10 me
# 2 2 and 5 you
# 3 3 nothing 15 everyone
# 4 4 0 know

关于R删除数据框中的多个文本字符串,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/24645390/

26 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com