R矩阵包: Demean sparse matrix-6ren

R矩阵包: Demean sparse matrix

转载作者：行者123 更新时间：2023-12-03 03:40:38

38

4

是否有一种简单的方法可以按列贬低稀疏矩阵，同时将零值视为缺失(使用 Matrix 包)？

我似乎遇到两个问题:

找到合适的列意味着

空单元格被视为零而不是缺失:

M0 <- matrix(rep(1:5,4),nrow = 4)
M0[2,2] <- M0[2,3] <- 0
M <- as(M0, "sparseMatrix")
M
#[1,] 1 5 4 3 2
#[2,] 2 . . 4 3
#[3,] 3 2 1 5 4
#[4,] 4 3 2 1 5
colMeans(M)
#[1] 2.50 2.50 1.75 3.25 3.50

正确的结果应该是:

colMeans_correct <- colSums(M) / c(4,3,3,4,4)
colMeans_correct
#[1] 2.500000 3.333333 2.333333 3.250000 3.500000

减去列平均值

对缺失的单元格也进行减法:

sweep(M, 2, colMeans_correct)
#4 x 5 Matrix of class "dgeMatrix"
#     [,1]       [,2]       [,3]  [,4] [,5]
#[1,] -1.5  1.6666667  1.6666667 -0.25 -1.5
#[2,] -0.5 -3.3333333 -2.3333333  0.75 -0.5
#[3,]  0.5 -1.3333333 -1.3333333  1.75  0.5
#[4,]  1.5 -0.3333333 -0.3333333 -2.25  1.5

附注希望发布由两个问题组成的问题不是问题。它们连接到相同的任务，似乎反射(reflect)了相同的问题 - 区分缺失值和实际零值。

最佳答案

一种选择是将 colSums 除以非零逻辑矩阵的 colSums

colSums(M)/colSums(M!=0)
#[1] 2.500000 3.333333 2.333333 3.250000 3.500000

或者另一种选择是将 0 替换为 NA 并使用 na.rm = TRUE 参数获取 colMeans

colMeans(M*NA^!M, na.rm = TRUE)
#[1] 2.500000 3.333333 2.333333 3.250000 3.500000

<小时/>

或者@user20650评论

colSums(M) / diff(M@p)
#[1] 2.500000 3.333333 2.333333 3.250000 3.500000

其中“p”是?sparseMatrix中提到的指针

In typical usage, p is missing, i and j are vectors of positive integers and x is a numeric vector. These three vectors, which must have the same length, form the triplet representation of the sparse matrix.

If i or j is missing then p must be a non-decreasing integer vector whose first element is zero. It provides the compressed, or “pointer” representation of the row or column indices, whichever is missing. The expanded form of p, rep(seq_along(dp),dp) where dp <- diff(p), is used as the (1-based) row or column indices.

关于R矩阵包: Demean sparse matrix，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/43824496/

38

4

0

文章推荐： R 对连续重复的奇数列表求和并删除除第一个列表之外的所有列表

文章推荐： javascript - Chart.js不会在IE中显示图表

文章推荐： sql - 我需要在每年的 Oracle SQL 查询中找到最高投票者

R矩阵包: Demean sparse matrix
是否有一种简单的方法可以按列贬低稀疏矩阵，同时将零值视为缺失(使用 Matrix 包)？我似乎遇到两个问题: 找到合适的列意味着空单元格被视为零而不是缺失: M0 或者@user20650评论
r - Demean R 数据框
我想贬低 R data.frame 中的多个列。使用 this question 中的示例 set.seed(999) library(plyr) library(plm) # random data

首页

博学

6Ren·AI

商城

R矩阵包: Demean sparse matrix