gpt4 book ai didi

r - 计算每个因素按另一个因素分组的数量

转载 作者:行者123 更新时间:2023-12-04 14:20:11 25 4
gpt4 key购买 nike

我知道这个问题的答案很简单,但我广泛搜索了论坛,但一直找不到解决方案。

我有一个名为 Data_source 的列,这是我想对变量进行分组的一个因素。

我有一系列 symptom* 变量,我希望根据 Data_source 进行计数。

出于某种原因,我不知道该怎么做。正常的 group_by 函数似乎无法正常工作。

这是有问题的数据框

 df <- wrapr::build_frame(
"Data_source" , "Sex" , "symptoms_decLOC", "symptoms_nausea_vomitting" |
"1" , "Female", NA_character_ , NA_character_ |
"1" , "Female", NA_character_ , NA_character_ |
"1" , "Female", "No" , NA_character_ |
"1" , "Female", "Yes" , "No" |
"1" , "Female", "Yes" , "No" |
"1" , "Female", "Yes" , "No" |
"1" , "Male" , "Yes" , "No" |
"1" , "Female", "Yes" , "No" |
"2" , "Female", NA_character_ , NA_character_ |
"2" , "Male" , NA_character_ , NA_character_ |
"2" , "Male" , NA_character_ , NA_character_ |
"2" , "Female", "Yes" , "No" |
"2" , "Female", "Yes" , "No" |
"2" , "Male" , NA_character_ , NA_character_ |
"2" , "Male" , NA_character_ , NA_character_ |
"2" , "Male" , NA_character_ , NA_character_ |
"2" , "Female", NA_character_ , NA_character_ |
"2" , "Female", NA_character_ , NA_character_ |
"2" , "Male" , NA_character_ , NA_character_ |
"2" , "Female", NA_character_ , NA_character_ )

请注意,性别和症状变量都是包含 NA 的因素。我尝试了以下内容

df %>% na.omit() %>% group_by(Data_source) %>% count("symptoms_decLOC")

这是行不通的,而且不是最优的,因为我必须为每一列重复它。理想的是使用类似于 lapply(df, count) 的东西,但这并没有给我每个组的描述。

编辑

为了回答下面的问题,我添加了预期的输出。我已经在 excel 中对此进行了编辑,为了清楚起见,对 group_by 进行了颜色编码。

enter image description here

请注意我是如何对每个可能的答案进行分解的。当我使用 dplyr 运行它时,这里是输出。

> df %>% na.omit() %>% group_by(Data_source) %>% count("symptoms_decLOC")
# A tibble: 2 x 3
# Groups: Data_source [2]
Data_source `"symptoms_decLOC"` n
<chr> <chr> <int>
1 1 symptoms_decLOC 5
2 2 symptoms_decLOC 2

最佳答案

大部分情况:还没有想出如何包含零计数组……应该添加 .drop=FALSE takes care of this ,但它对我不起作用(使用 dplyr v. 0.8.0.9001)。

library(dplyr)
library(tidyr)
(df
%>% tidyr::gather(var,val,-Data_source)
%>% count(Data_source,var,val, .drop=FALSE)
%>% na.omit()
)

结果:

  Data_source var                       val        n
<chr> <chr> <chr> <int>
1 1 Sex Female 7
2 1 Sex Male 1
3 1 symptoms_decLOC No 1
4 1 symptoms_decLOC Yes 5
5 1 symptoms_nausea_vomitting No 5
6 2 Sex Female 6
7 2 Sex Male 6
8 2 symptoms_decLOC Yes 2
9 2 symptoms_nausea_vomitting No 2

关于r - 计算每个因素按另一个因素分组的数量,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/55714290/

25 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com