r - ggplot : transperancy of histogram as function of stat(count)-6ren

r - ggplot : transperancy of histogram as function of stat(count)

转载作者：行者123 更新时间：2023-12-04 03:51:30

24

4

我正在尝试以这种方式制作缩放直方图，每个“列”(bin？)的透明度取决于给定 x 范围内的观察数量。这是我的代码:

set.seed(1)
test = data.frame(x = rnorm(200, mean = 0, sd = 10),
                  y = as.factor(sample(c(0,1), replace=TRUE, size=100)))
threshold = 20 
ggplot(test,
       aes(x = x))+
  geom_histogram(aes(fill = y, alpha = stat(count) > threshold),
                 position = "fill", bins = 10)

基本上我想制作看起来像这样的图:

但是我的代码生成的图表基于分组后的计数应用透明度，最终以这样的挂列结束:

对于这个例子，为了模拟一个“正确的”图，我只是调整了阈值，但我需要 alpha 来考虑给定“列”(bin) 中两组的计数总和。

更新:我还希望它以这样一种方式处理分面图，即每个分面中的突出显示区域独立于其他分面。 @Stefan 提出的方法非常适合单个情节，但在多面情节中突出显示所有方面的同一区域。

library(ggplot2)

set.seed(1)
test = data.frame(x = rnorm(1000, mean = 0, sd = 10),
                  y = as.factor(sample(c(0,1), replace=TRUE, size=1000)),
                  n = as.factor(sample(c(0,1,2), replace=TRUE, size=1000)),
                  m = as.factor(sample(c(0,1,3,4), replace=TRUE, size=1000)))
f = function(..count.., ..x..) tapply(..count.., factor(..x..), sum)[factor(..x..)]
threshold = 10 
ggplot(test,
       aes(x = x))+
  geom_histogram(aes(fill = y, alpha = f(..count.., ..x..) > threshold),
                 position = "fill", bins = 10)+
  facet_grid(rows = vars(n),
             cols = vars(m))

最佳答案

这可以这样实现:

由于 stat_count 计算的 count 是分组后的 obs 数量，我们必须手动汇总各组的 count 以获得总数count 每个 bin。
为了汇总每个 bin 的计数，我使用 tapply，其中我使用 .. 符号来获取由 stat_count 计算的变量.
作为分组变量，我使用了计算变量 ..x..，据我所知，它没有记录。基本上 ..x.. 默认包含 bin 的中点，因此可以用作 bin 的标识符。但是，由于这些是连续值，我们已将它们转换为一个因子。

最后，为了使代码更具可读性，我使用了一个辅助函数来计算聚合计数。此外，我将 threshold 值加倍到 20。

library(ggplot2)

set.seed(1)
test <- data.frame(
  x = rnorm(200, mean = 0, sd = 10),
  y = as.factor(sample(c(0, 1), replace = TRUE, size = 100))
)
threshold <- 20

f <- function(..count.., ..x..) tapply(..count.., factor(..x..), sum)[factor(..x..)]
p <- ggplot(
  test,
  aes(x = x)
) +
  geom_histogram(aes(fill = y, alpha = f(..count.., ..x..) > threshold),
    position = "fill", bins = 10
  )
p

EDIT 为了允许分面，我们必须将 ..PANEL.. 标识符作为附加参数传递给函数。我不再使用 tapply，而是使用 dplyr::group_by 和 dplyr::add_count 来计算每个 bin 和 facet 面板的总数:

library(ggplot2)
library(dplyr)

set.seed(1)
test <- data.frame(
  x = rnorm(200, mean = 0, sd = 10),
  y = as.factor(sample(c(0, 1), replace = TRUE, size = 100)),
  type = rep(c("A", "B"), each = 100)
)
threshold <- 20

f <- function(count, x, PANEL) {
  data.frame(count, x, PANEL) %>% 
    add_count(x, PANEL, wt = count) %>% 
    pull(n)
}
p <- ggplot(
  test,
  aes(x = x)
) +
  geom_histogram(aes(fill = y, alpha = f(..count.., ..x.., ..PANEL..) > threshold),
                 position = "fill", bins = 10
  ) +
  facet_wrap(~type)
p
#> Warning: Using alpha for a discrete variable is not advised.
#> Warning: Removed 2 rows containing missing values (geom_bar).

关于r - ggplot : transperancy of histogram as function of stat(count)，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/64382175/

24

4

0

文章推荐： python - Pandas 如何在数据框列中提取整数和 float 的混合

文章推荐： c# - 从列表中获取通用数据

文章推荐： python - Bot 在 Top.gg 上投票时没有发送 DM

c - struct stat 和 stat 函数失败
所以，我正在尝试创建一种 ls 函数。这是我对每个文件的描述的代码 struct stat fileStat; struct dirent **files; num_entries = scandir
C sys/stat.h 并非 stat 结构的每个字段都被初始化
我最近一直在尝试实现我自己的 linux ls 命令版本。一切都很好，但是当我尝试使用 ls -l 功能时，struct stat 的某些字段未初始化 - 我得到 NULL 指针或垃圾值，尽管它似乎只
php - 相关模型的 STAT 关系的 STAT 关系，Yii
我在 Yii 中遇到 STAT 关系问题。我不确定我正在寻找的东西是否可以通过本地 Yii 关系实现。我会尽力描述我的问题，如果不清楚，请询问任何具体细节。我有三个表，因此有三个模型 | table
python - 部署后在 Django 中使用 scipy.stats.stats
我正在为一个严重依赖 scipy.stats.stats(scipy 版本 0.9.0)的包创建一个 django-powered (1.3) 接口(interface)，称为 ovl 。在早期开发阶
c++ - 我想重置 C++ struct stat，我可以以某种方式使用 stat() 语法吗？
为了安全起见，我喜欢显式初始化我的变量(当您编写大量代码时，它通常会使它更安全，因为您的代码最终不会崩溃那么多。) 对于大多数类型，无论是结构还是整数等基本 C++ 类型，我都可以编写以下内容: ti
c - 是否有 stat() 的宽字符版本(来自 sys/stat.h)？
我一直在使用 stat() 检查文件是否存在，据我所知，这比尝试打开文件更好。但是，stat() 不适用于包含其他语言的 unicode 字符的文件名。是否有 stat() 的宽字符版本或我可以使用的
python - 无法从 scipy.stats.stats 导入 ss 函数
错误: File "/usr/lib/python2.7/dist-packages/statsmodels/regression/linear_model.py", line 36, in
linux - 在一行中使用 awk 和 stat 来打印 stat 的值
下面是我要运行的脚本。我不能在 awk 中使用 stat。 cat /etc/passwd | awk 'BEGIN{FS=":"}{print $6 }' | (stat $6 | sed -n '
python - Seaborn regplot 拟合线与来自 stats.linregress 或 stats 模型的计算拟合不匹配
我正在尝试拟合 xlog 线性回归。我使用 Seaborn regplot 来绘制拟合，看起来很合适(绿线)。然后，因为 regplot 不提供系数。我使用 stats.linregress 来查找系
c - libc_nonshared.a(stat.oS) 中的隐藏符号 `stat' 被 DSO 引用
我正在尝试使用共享库 (libscplugin.so) 中包含的方法。我已经满足了库的所有要求: libc.so 带有指向 libc.so.6 的符号链接(symbolic link) libz.s
c - C 编程中的错误 : stat: No Such File Or Directory, opendir() 和 stat()
嘿，感谢阅读。我正在制作一个程序，它接受 1 个参数(目录)并使用 opendir()/readdir() 读取目录中的所有文件，并使用 stat 显示文件类型(reg、链接、目录等)。当我在 sh
c - 非设备文件上的 major(stat.st_rdev) 和 minor(stat.st_rdev)
简单问题:在 Linux 中，我 stat() 一个不是设备的文件。 st_rdev 字段的期望值是多少？我可以运行 major(stat.st_rdev) 和 minor(stat.st_rdev)
使用 --stats-json 构建 Angular 6 不生成 stats.json 文件
我正在尝试为我的 Angular 6 应用程序生成 stats.json 文件。下面的事情我已经尝试过，但根本没有生成文件。我的系统需要有 “npm 运行”在每个 angular cli 命令之前。
c - [C][Stat][Fileinfo] 当我使用 stat() 调用返回结构时，为什么 st_mode 定义为不在结构中的内容？
我正在尝试使用返回的 stat 结构中的 st_mode，该结构是我通过以下方式从 stat() 调用获得的； char *fn = "test.c" struct s
c++ - stat(char[], stat) 当 string.c_str() = "c:/"时返回 -1
关闭。这个问题需要debugging details .它目前不接受答案。编辑问题以包含 desired behavior, a specific problem or error, and th
c - 系统/stat.h :456: error: nested function 'stat' declared 'extern'
我有一个程序，是我通过修改原始暗网(深度学习图像识别，Yolov2)的许多地方而制作的。几个月前我一直在使用它，但是今天当我编译它时，它给了我一个错误: gcc -DSAVE_LAYER_INPUT
python - 为什么 scipy.stats.mstats.pearsonr 结果与 scipy.stats.pearsonr 不一致？
我预计 scipy.stats.mstats.pearsonr 对于屏蔽数组输入的结果将与 scipy.stats.pearsonr 对于输入数据的 unmasked 值给出相同的结果，但它不会't:
从调用 stat 失败并出现 "Value too large for defined data type"错误
给定 tmp.c: #include #include #include int main(int argc, const char *argv[]) { struct stat st;
python - 为什么 scipy.stats.entropy(a, b) 返回 inf 而 scipy.stats.entropy(b, a) 不返回？
In [15]: a = np.array([0.5, 0.5, 0, 0, 0]) In [16]: b = np.array([1, 0, 0, 0, 0]) In [17]: entropy(a
linux - Stat 命令将具有更改日期的文件列入候选名单
当我们运行 stat filename我们得到 Access: 2021-06-25 15:40:18.532621916 +0530 Modify: 2020-08-13 15:57:30.0000

首页

博学

6Ren·AI

商城

r - ggplot : transperancy of histogram as function of stat(count)