r - dplyr::mutate 给出 x/y = NA，summary 给出 x/y = 实数-6ren

r - dplyr::mutate 给出 x/y = NA，summary 给出 x/y = 实数

转载作者：行者123 更新时间：2023-12-04 20:37:18

26

4

我正在验证一个函数来计算我实验室中某个标准的通过率。这背后的数学原理非常简单:给定一些通过或失败的测试，通过的百分比是多少。

数据将作为一列值提供，即 P1 (第一次测试通过)，F1 (第一次测试失败)，P2或 F2 (分别在第二次测试中通过或失败)。我写了函数passRate下面有助于计算整体(第一次和第二次尝试)以及第一次测试和第二次测试的通过率。

为验证设置参数的质量专家给了我一个通过和失败计数的列表，我正在使用 test_vector 将其转换为向量。下面的功能。

一切看起来都很棒，直到我到达 Pass 的第三排数据框，其中包含来自我的质量专家的通过/失败计数。它没有返回 100% 的第二次测试通过率，而是返回 NA...但仅当我使用 mutate 时

library(dplyr)

Pass <- structure(list(P1 = c(2L, 0L, 10L), 
                       F1 = c(0L, 2L, 0L), 
                       P2 = c(0L, 3L, 2L), 
                       F2 = c(0L, 2L, 0L), 
                       id = 1:3), 
                  .Names = c("P1", "F1", "P2", "F2", "id"), 
                  class = c("tbl_df", "data.frame"), 
                  row.names = c(NA, -3L))

所以这类似于我对 mutate 所做的事情.

Pass %>%
  group_by(id) %>%
  mutate(pass_rate = (P1 + P2) / (P1 + P2 + F1 + F2) * 100,
         pass_rate1 = P1 / (P1 + F1) * 100,
         pass_rate2 = P2 / (P2 + F2) * 100)

Source: local data frame [3 x 8]
Groups: id [3]

     P1    F1    P2    F2    id pass_rate pass_rate1 pass_rate2
  (int) (int) (int) (int) (int)     (dbl)      (dbl)      (dbl)
1     2     0     0     0     1 100.00000        100         NA
2     0     2     3     2     2  42.85714          0         60
3    10     0     3     1     3 100.00000        100         NA

我用时比较 summarise

Pass %>%
  group_by(id) %>%
  summarise(pass_rate = (P1 + P2) / (P1 + P2 + F1 + F2) * 100,
            pass_rate1 = P1 / (P1 + F1) * 100,
            pass_rate2 = P2 / (P2 + F2) * 100)

Source: local data frame [3 x 4]

     id pass_rate pass_rate1 pass_rate2
  (int)     (dbl)      (dbl)      (dbl)
1     1 100.00000        100         NA
2     2  42.85714          0         60
3     3 100.00000        100        100

我原以为这些会返回相同的结果。我的猜测是 mutate某处有问题，因为它假设 n每组行应该映射到 n结果中的行(是否在计算 n 时感到困惑？)，而 summarise知道无论它从多少行开始，它都会以 1 行结束。

有没有人对这种行为背后的机制有任何想法？

最佳答案

在我看来，dplyr 之间有点干扰和 plyr .我在另一个不平衡的数据集上遇到了同样的问题(所以分组是必要的)，正好在 中。第三个 组变异变量错误地为 NA!然后我在家里复制了你的例子。首先，之后

library("dplyr", lib.loc="~/R/x86_64-pc-linux-gnu-library/3.2")

我得到了你的结果。然后我执行了我自己的脚本，其中包 plyr已加载。警告后不要加载 plyr之后 dplyr ，我的 NA 第三个 组不见了，你的例子也计算正确!这是我所做的(我又添加了一行以查看 NA 是否仍留在第三组中):

> Pass <- structure(list(P1 = c(2L, 0L, 10L,8L), 
+                        F1 = c(0L, 2L, 0L, 4L), 
+                        P2 = c(0L, 3L, 2L, 2L), 
+                        F2 = c(0L, 2L, 0L, 1L), 
+                        id = 1:4), 
+                   .Names = c("P1", "F1", "P2", "F2", "id"), 
+                   class = c("tbl_df", "data.frame"), 
+                   row.names = c(NA, -4L))
> Pass %>%
+     group_by(id) %>%
+     mutate(pass_rate = (P1 + P2) / (P1 + P2 + F1 + F2) * 100,
+            pass_rate1 = P1 / (P1 + F1) * 100,
+            pass_rate2 = P2 / (P2 + F2) * 100)
Source: local data frame [4 x 8]
Groups: id [4]

 P1    F1    P2    F2    id pass_rate pass_rate1 pass_rate2
(int) (int) (int) (int) (int)     (dbl)      (dbl)      (dbl)
 1     2     0     0     0     1 100.00000  100.00000         NA
 2     0     2     3     2     2  42.85714    0.00000   60.00000
 3    10     0     2     0     3 100.00000  100.00000         NA
 4     8     4     2     1     4  66.66667   66.66667   66.66667

然后我做了:

> library("plyr", lib.loc="~/R/x86_64-pc-linux-gnu-library/3.2")
> Pass %>%
+     group_by(id) %>%
+     mutate(pass_rate = (P1 + P2) / (P1 + P2 + F1 + F2) * 100,
+            pass_rate1 = P1 / (P1 + F1) * 100,
+            pass_rate2 = P2 / (P2 + F2) * 100)
Source: local data frame [4 x 8]
Groups: id [4]

 P1    F1    P2    F2    id pass_rate pass_rate1 pass_rate2
(int) (int) (int) (int) (int)     (dbl)      (dbl)      (dbl)
 1     2     0     0     0     1 100.00000  100.00000        NaN
 2     0     2     3     2     2  42.85714    0.00000   60.00000
 3    10     0     2     0     3 100.00000  100.00000  100.00000
 4     8     4     2     1     4  66.66667   66.66667   66.66667

我知道这不是一个令人满意的答案，因为 plyr应该不是后加载 dplyr ，但也许它可以帮助那些需要 group_by(id) 的人.或使用 plyr::mutate() .然后你可以加载 dplyr之后 plyr :

 > Pass %>%
+     group_by(id) %>%
+     plyr::mutate(pass_rate = (P1 + P2) / (P1 + P2 + F1 + F2) * 100,
+            pass_rate1 = P1 / (P1 + F1) * 100,
+            pass_rate2 = P2 / (P2 + F2) * 100)
Source: local data frame [4 x 8]
Groups: id [4]

 P1    F1    P2    F2    id pass_rate pass_rate1 pass_rate2
(int) (int) (int) (int) (int)     (dbl)      (dbl)      (dbl)
 1     2     0     0     0     1 100.00000  100.00000        NaN
 2     0     2     3     2     2  42.85714    0.00000   60.00000
 3    10     0     2     0     3 100.00000  100.00000  100.00000
 4     8     4     2     1     4  66.66667   66.66667   66.66667

关于r - dplyr::mutate 给出 x/y = NA，summary 给出 x/y = 实数，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/33107956/

26

4

0

文章推荐： visual-studio - 在 F# Interactive 上运行 F# 测试

文章推荐： php结合url和变量

文章推荐： YII2 在 bootstrap.js 之前调用 jquery-ui 代码

文章推荐： spring - spring-security中基于路径变量的授权

vue.js - Nuxt 错误 : [vuex] Do not mutate vuex store state outside mutation handlers when mutating from plugin
我在 Nuxt 项目旁边使用 Firebase，在下面的插件中，我调用 onAuthStateChanged 来检查用户是否已经登录，如果他是，我设置用户状态并将他重定向到仪表板如下: import
r - mutate `:=` 和 mutate `=` 之间的 tidyeval 差异
这两个代码块都可以工作，即使它们使用不同的等号，一个使用 :=，另一个使用 =。哪个是正确的，为什么？我认为 tidyeval 在使用 dplyr 函数时需要 := ，但奇怪的是 = 在我的 muta
c++ - 是否可以在 C++ 的 Mutator 中使用 Mutator？
下午好! 我做了一些快速搜索，我很难弄清楚我应该如何去做我需要做的事情。对于这个程序，我们正在创建一个基本的工作票类。每个属性都有自己的修改器和访问器，但除此之外还有一个修改器将所有属性作为参数并一
php - Laravel 5 - 分形更改器(mutator) - 将参数发送到更改器(mutator)以缩小响应范围
所以我有一个名为 VIP 的模型，其中包含大量相关信息。因此，当我们转到路线 vip/{id} 时，我会返回大部分信息。但是，当我转到 vips/{per-page} 时，我不想返回所有数据，因为 A
javascript - Vuex和mysql连接对象: Do not mutate vuex store state outside mutation handlers
我有一个电子应用程序，它使用 mysql 包直接连接到我的数据库。我想做的是将使用 mysql.createConnection() 创建的 connection 对象存储在 Vuex 状态中。然后我
C++ : Suggest names for mutating and non-mutating versions of a member function
假设我有一个 Image 类，我想提供一些图像操作，比如缩放、旋转等。我想为每个操作提供两种类型的功能。一种修改对象，另一种不修改。在 Ruby 中，有些函数以 !并指出这个将修改参数。因为这在 C
javascript - DOM Mutation Observers 是否比 DOM Mutation Events 慢？
以下代码利用 DOM 突变事件 DOMNodeInserted检测 body 的存在元素并包裹它的 innerHTML放入 wrapper 中。 functi
vuejs2 - Vuex - 'do not mutate vuex store state outside mutation handlers'
我正在尝试从 Firestore 初始化我的 Vuex 商店。最后一行代码 context.commit('SET_ACTIVITIES', acts) 是产生错误的原因。我不认为我在直接改变状态，因
javascript - 尝试从 indexedDB 中的对象存储中删除对象时出现错误 "A mutation operation was attempted on a database that did not allow mutations."
所以基本上我已经阅读了相当多的教程、演示和 API 规范本身，但并没有深入了解，非常感谢你们的帮助。我最近一直在努力更好地掌握 IndexedDB，但遇到了一些问题，希望对这段代码提出一些批评/反馈
javascript - 在 indexedDB 中检索数据时出现错误 "A mutation operation was attempted on a database that did not allow mutations."
我有这个简单的示例代码: var request = mozIndexedDB.open('MyTestDatabase'); request.onsuccess = function(event){
javascript - .push end 位于 "Do not mutate vuex store state outside mutation handlers"
我定义了一个 Vuex 存储( Action 、状态、突变和 getter) 当我在突变中向状态数组添加新的待办事项时，出现以下错误:错误:[vuex] 不要在突变处理程序之外改变 vuex 存储状态
vue.js - 更好的方法处理 : 'Do not mutate vuex store state outside mutation handlers' errors
事前:我的应用程序按预期工作，但我想知道是否有更好的方法来解决我遇到的问题。情况:我有一个项目，目前正在实现权限系统。当前的流程是加载特定对象(在本例中让我们采用 user)，然后注入(inject
swift 4 : "cannot use mutating member on immutable value: ' self' is immutable"in mutating function
这段代码 extension Collection { mutating func f() { removeFirst() } } 处理错误 cannot use mutating m
r - 在 R 和 dplyr 中，使用 "mutate"和 "mutate"将多次调用 "across"替换为一次调用
我们在 R 中有以下数据框 # Create example dataframe df % dplyr::mutate(col1A = ifelse(gp == 0, col1B, col1A))
vue.js - nuxt 应用程序中的 Vuex 抛出 "Do not mutate vuex store state outside mutation handlers"!
在我的 NUXT 应用程序中，我正在使用 vuex 存储模块!当我运行应用程序并调用时 this.$store.dispatch('userStore/setLoggedInUser',current
r - "Error in UseMethod("mutate ") : no applicable method for ' mutate ' applied to an object of class "function"尝试分隔列时
所以我有这个数据集 # A tibble: 268 x 1 `Which of these social media platforms do you have an account in ri
VUEX使用学习三:mutations
转载请注明出处：　　在 Vuex 中 store 数据改变的唯一方法就是提交 mutations 。 mutations 里面装着一些改变数据方法的集合，这是Vuex 设
r - 在使用变量调用的函数中实现 mutate
我想用不同的变量多次调用一个函数，每次都为数据框中的一个新变量设置一个值。这是我失败的尝试。感谢您的帮助! dat % mutate({{var3}} := ifelse({{var1}} >
r - 如何在列表中使用 mutate？
改变列表的正确方法是什么？在这种特定情况下，列表由 split 返回。 library(dplyr) csv%split(.,.$participant_number)%>%mutate(.,var(
mutation-testing - 哪些编程语言可以支持变异测试？
在某些语言中比其他语言更难(或不可能)实现变异测试吗？例如，是否可以在功能编程语言中实现变异测试？最佳答案我看不出任何语言都无法做到的任何理由。我当然不是专家，但是我认为使用功能语言进行突变测试

首页

博学

6Ren·AI

商城

r - dplyr::mutate 给出 x/y = NA，summary 给出 x/y = 实数