r - 如何为 html 和 pdf 创建带标签的加权汇总表？-6ren

r - 如何为 html 和 pdf 创建带标签的加权汇总表？

转载作者：行者123 更新时间：2023-12-04 15:41:54

24

4

我想创建大型交叉表，其中包含多个列变量的多个行变量的汇总统计信息 - 并找到了两个包，可以非常轻松地制作漂亮的大表:Duncan Murdoch 的 tables 包和 Gregory Demin 的 expss 包(它在 table 旁边做了很多令人惊奇的事情)。还有一些其他的包，如 moonBook(与同一作者的 ztable 包一起工作)，但据我所知，它们中的大多数都缺少我需要的东西:

我要……

(可重复地)创建大型汇总表
这些汇总统计数据的用例权重
为变量使用变量和值标签
轻松创建 html 和 pdf 表格(无需更改函数选项/ block 选项)。

ad 1) tables 和 expss 都可以轻松创建具有多个行和列变量的复杂表。作为示例，我们使用 tables 和 expss 中的表函数生成 iris 数据的汇总表。

library(magrittr)  # Pipes %>% 

data(iris)  # Use iris data for examples

## Tables with the `tables` packages
library(tables)
tabular( (Species + 1) ~ (n=1) + Format(digits=2)*(Sepal.Length + Sepal.Width)*(mean + sd + Format(digits=0)*(n=1)), data=iris )

## Tables with the `expss` package
library(expss)
iris %>%
  tab_cells(Sepal.Length, Sepal.Width) %>% 
  tab_cols(Species, total()) %>% 
  tab_stat_mean_sd_n() %>% 
  tab_transpose() %>% 
  tab_pivot()

ad 2) 使用 expss 很容易使用标签

iris_raw <- iris  # Save iris for later without any labels (they cause some problems with tabular)

iris <- apply_labels(iris,
                     Sepal.Length = "Sepal Length",
                     Sepal.Width = "Sepal With",
                     Species = "Species of Iris", 
                     Species = c("Setosa" = "setosa",
                                 "Versicolor" = "versicolor",
                                 "Virginica" = "virginica"))
str(iris)  # we can see the labels

library(expss)
expss_digits(digits = 2)
iris %>%
  tab_cells(Sepal.Length, Sepal.Width) %>% 
  tab_cols(Species, total()) %>% 
  tab_stat_mean_sd_n() %>% 
  tab_transpose() %>% 
  tab_pivot()

ad 3) 对于调查，我们通常需要个案权重，例如抽样设计权重、无响应权重、分层后权重 - 确定单个案例在描述性统计计算和模型估计中的权重。 expss 只需添加一行即可使用案例权重:tab_weight(caseweight)

set.seed(123)  # Make following random numbers reproducible
iris$caseweight <- rnorm(nrow(iris), mean = 1, sd = .5) %>% abs() # Add some positive random caseweight to each case

library(expss)
expss_digits(digits = 2)
iris %>%
  tab_cells(Sepal.Length, Sepal.Width) %>% 
  tab_cols(Species, total()) %>% 
  tab_weight(caseweight) %>%  # Weight the cases
  tab_stat_mean_sd_n(weighted_valid_n = TRUE) %>% 
  tab_last_round(digits = 2) %>% 
  tab_transpose() %>% 
  tab_pivot()

在 tables 中，也可以使用(自定义)函数来计算加权汇总统计信息，但不像使用 expss 那样容易(我在这里可能是错的 - 如果那是情况，请指正)。

ad 4) 来到我愿望 list 的最后一点:html 和 pdf 表格。现在使用 tables 很容易，而使用 expss 就难多了。在 tables 中，函数 toKable() 并将输出通过管道传输到 kableExtra 以进一步细化是关键。

library(tables)
tab <- tabular( (Species + 1) ~ (n=1) + Format(digits=2)*(Sepal.Length + Sepal.Width)*(mean + sd + Format(digits=0)*(n=1)), data=iris_raw)
tab %>% 
  toKable() %>%  # Translates the tabular-object to a kable-object
  kable_styling(full_width = TRUE) %>%  # Functions from kableExtra
  column_spec(2, color = "red")


# Remark: in expss it's possible to transform the (html-) table output to a data frame via `split_table_to_df()`, s. https://stackoverflow.com/questions/55542838/formatting-tables-in-r-markdown-to-export-to-ms-word-document/55576202#55576202
# But all the formatting gets lost - since it's just a df then.

library(expss)
expss_digits(digits = 2)
tab <- iris %>%
  tab_cells(Sepal.Length, Sepal.Width) %>% 
  tab_cols(Species, total()) %>% 
  tab_stat_mean_sd_n() %>% 
  tab_transpose() %>% 
  tab_pivot()

tab.df <- split_table_to_df(tab)  # Here the table 
str(tab.df)  # a df with the numbers and the labels
kable(tab.df)  # We could use kable on this (but the result does not look like a usual kable table)

所以这两个包中的每一个都有它的超能力:expss 在创建带有标签和案例权重的表格方面做得非常出色，而 tables 使得使用表格输出变得容易从 tabular() 通过 toKable、kable 和 kableExtra 创建 html 和 pdf 表格 - 因为 Hao Zhu 的 kableExtra 包生成 html或 pdf，具体取决于编织的文档类型 - 这非常简单，无需更改任何代码(例如，从“html”到“latex”)，只需按 Knit to pdf/html - 和效果非常好。

the rendered table for html

the rendered table in the pdf

问题:对于一个简单可重现的工作流程来说，最好同时拥有所有这些功能(1 到 4)，从而将 expss 与 knitrExtra 结合起来 - 是否有像 toKable 这样的函数用于 expss 中的表(或更通用的 html 表)，可以通过 kableExtra 和简单的 html 和 pdf 导出进行改进不改变任何可能的选择？还是有其他工作流程可以实现1到4？感谢您的宝贵时间和任何提示!

最佳答案

另一种选择是使用 pander 包中的 pander。它识别来自 split_table_to_df 的数据框，并为 PDF 文档生成一个很好的输出。

关于r - 如何为 html 和 pdf 创建带标签的加权汇总表？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/57548982/

24

4

0

文章推荐： playstation - 索尼 PS4 应用程序开发

文章推荐： haskell - 具有 Bound 的相互递归语法

首页

博学

6Ren·AI

商城