r - 通过分组变量折叠列(在基础中)-6ren

r - 通过分组变量折叠列(在基础中)

转载作者：行者123 更新时间：2023-12-04 10:49:37

24

4

我有一个文本变量和一个分组变量。我想按因子将文本变量折叠成每行(组合)一个字符串。所以只要组列显示 m 我想将文本组合在一起等等。我在之前和之后提供了一个示例数据集。我正在为一个包写这篇文章，到目前为止，除了 wordcloud 之外，我已经避免了对其他包的所有依赖，并希望保持这种方式。

我怀疑 rle 可能对 cumsum 有用，但还没弄清楚这一点。

提前谢谢你。

数据是什么样的

                                 text group
1       Computer is fun. Not too fun.     m
2               No its not, its dumb.     m
3              How can we be certain?     f
4                    There is no way.     m
5                     I distrust you.     m
6         What are you talking about?     f
7       Shall we move on?  Good then.     f
8 Im hungry.  Lets eat.  You already?     m

我希望数据看起来像什么

                                                       text group
1       Computer is fun. Not too fun. No its not, its dumb.     m
2                                    How can we be certain?     f
3                          There is no way. I distrust you.     m
4 What are you talking about? Shall we move on?  Good then.     f
5                       Im hungry.  Lets eat.  You already?     m

数据

dat <- structure(list(text = c("Computer is fun. Not too fun.", "No its not, its dumb.", 
"How can we be certain?", "There is no way.", "I distrust you.", 
"What are you talking about?", "Shall we move on?  Good then.", 
"Im hungry.  Lets eat.  You already?"), group = structure(c(2L, 
2L, 1L, 2L, 2L, 1L, 1L, 2L), .Label = c("f", "m"), class = "factor")), .Names = c("text", 
"group"), row.names = c(NA, 8L), class = "data.frame")

编辑:我发现我可以为组变量的每次运行添加唯一列:

x <- rle(as.character(dat$group))[[1]]
dat$new <- as.factor(rep(1:length(x), x))

产量:

                                 text group new
1       Computer is fun. Not too fun.     m   1
2               No its not, its dumb.     m   1
3              How can we be certain?     f   2
4                    There is no way.     m   3
5                     I distrust you.     m   3
6         What are you talking about?     f   4
7       Shall we move on?  Good then.     f   4
8 Im hungry.  Lets eat.  You already?     m   5

最佳答案

这利用 rle 创建一个 id 来对句子进行分组。它使用 tapply 和 paste 将输出放在一起

## Your example data
dat <- structure(list(text = c("Computer is fun. Not too fun.", "No its not, its dumb.", 
"How can we be certain?", "There is no way.", "I distrust you.", 
"What are you talking about?", "Shall we move on?  Good then.", 
"Im hungry.  Lets eat.  You already?"), group = structure(c(2L, 
2L, 1L, 2L, 2L, 1L, 1L, 2L), .Label = c("f", "m"), class = "factor")), .Names = c("text", 
"group"), row.names = c(NA, 8L), class = "data.frame")


# Needed for later
k <- rle(as.numeric(dat$group))
# Create a grouping vector
id <- rep(seq_along(k$len), k$len)
# Combine the text in the desired manner
out <- tapply(dat$text, id, paste, collapse = " ")
# Bring it together into a data frame
answer <- data.frame(text = out, group = levels(dat$group)[k$val])

关于r - 通过分组变量折叠列(在基础中)，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/9857787/

24

4

0

文章推荐： r - 基于另一个数据框的两行比较

文章推荐： r - 从数据框中提取列并对其进行排序

文章推荐： r - 如何加快 R data.table 中缺失的搜索过程

文章推荐： r - 在 roxygen2 R 包文档中插入 Markdown 表

javascript if( 变量 = =(变量 2 || 变量 3 || ...))
这个问题在这里已经有了答案: 关闭 10 年前。 Possible Duplicate: How to nest OR statements in JavaScript? 有没有办法做到这一点:
JavaScript 变量 = 变量
在 JavaScript 中有没有办法让一个变量总是等于一个变量？喜欢var1 = var2但是当var2更新，也是var1 . 例子 var var1 = document.getElementBy
python - 如何阅读此 python 代码？变量 1 = 变量 2 == 变量 3
我正在努力理解这代表什么 var1 = var2 == var3 我的猜测是这等同于: if (var2 == var3): var1 = var2 最佳答案赋值 var1 = var2
php - 变量 $_GET 变量
这个问题已经有答案了: What does the PHP error message "Notice: Use of undefined constant" mean? (2 个回答) 已关闭 8
MySQL:变量=变量+select语句
我在临时表中有几条记录，我想从每条记录中获取一个值并将其添加到一个变量中，例如 color | caption -------------------------------- re
linux - 如何将原始字符串转换为变量(变量 --> $变量)？
如何将字符串转为变量(字符串变量--> $variable)？或者用逗号分隔的变量列表然后转换为实际变量。我有 2 个文件: 列名文件行文件我需要根据字符串匹配行文件中的整行，并根据列名文件命
PHP:来自与变量(变量-变量)连接的字符串的新变量
我有一个我无法解决的基本 php 问题，我也想了解为什么! $upperValueCB = 10; $passNodeMatrixSource = 'CB'; $topValue= '$uppe
php 变量 = 变量 1 ||变量2
这可能吗？ php $variable = $variable1 || $variable2? 如果 $variable1 为空则使用 $variable2 是否存在类似的东西？最佳答案 PHP 5
perl - for 循环不会修改 `my` 变量，但会修改 `our` 变量
在 Perl 5.20 中，for 循环似乎能够修改模块作用域的变量，但不能修改父作用域中的词法变量。 #!/usr/bin/env perl use strict; use warnings; ou
JavaScript: 变量 = 变量.concat(另一个变量);
为什么这不起作用: var variable; variable = variable.concat(variable2); $('#lunk').append(variable) 我无法弄清楚这一点
c++ - 指针的大小(*变量 VS 变量)
根据我的理解，在32位机器上，指针的sizeof是32位(4字节)，而在64位机器上，它是8字节。无论它们指向什么数据类型，它们都有固定的大小。我的计算机在 64 位上运行，但是当我打印包含 * 的大
java - 变量+=值和变量=变量+值之间的区别；
例如: int a = 10; a += 1.5; 这运行得很完美，但是 a = a+1.5; 此作业表示类型不匹配:无法从 double 转换为 int。所以我的问题是:+= 运算符和= 运算符
MySQL 语法错误 |变量 = 变量 + 整数
您好，我写了这个 MySQL 存储过程，但我一直收到这个语法错误 #1064 - You have an error in your SQL syntax; check the manual that
swift - 如果(变量 == 变量 + 5)
我试图在我的场景中显示特定的奖牌，这取决于你的高分是基于关卡的目标。 // Get Medal Colour if levelHighscore goalScore { sc
c++ - 变量 = !!变量与变量 =(变量!= 0)
我必须维护相当古老的 Visual C++ 源代码的大型代码库。我发现代码如下: bIsOk = !!m_ptr->isOpen(some Parameters) bIsOk的数据类型是bool，is
php - Javascript 变量，发送到 PHP 变量
我有一个从 MySQL 数据库中提取的动态产品列表。在 list 上有一个立即联系按钮，我正在使用一个 jquery Modal 脚本，它会弹出一个表单。我的问题是尝试将产品信息变量传递给该弹出窗
c++ - 类型(变量)与(类型)变量
这个问题在这里已经有了答案: 关闭 10 年前。 Possible Duplicate: What is the difference between (type)value and type(va
javascript - 变量 === 未定义与 typeof 变量 === "undefined"
jQuery Core Style Guidelines建议两种不同的方法来检查变量是否已定义。全局变量:typeof variable === "undefined" 局部变量:variable
jquery - 动态(变量)变量(如 php 中的？)
这个问题已经有答案了: 已关闭11 年前。 Possible Duplicate: “Variable” Variables in Javascript? 我想肯定有一种方法可以在 JavaScrip
c# - 变量 1 = 变量 2 = 真；优点缺点？
在语句中使用多重赋值有什么优点或缺点吗？在简单的例子中 var1 = var2 = true; 赋值是从右到左的(我相信 C# 中的所有赋值都是如此，而且可能是 Java，尽管我没有检查后者)。但是，

首页

博学

6Ren·AI

商城

r - 通过分组变量折叠列(在基础中)