gpt4 book ai didi

r - 在 1 个变量和变量列表之间执行所有可能的线性回归

转载 作者:行者123 更新时间:2023-12-02 20:21:03 27 4
gpt4 key购买 nike

我正在使用以下代码(开发 in a previous post )来执行以下任务:在第一个变量和其他变量之间执行所有可能的线性回归,并将结果保存在新的数据框中。

library(broom)
library(dplyr)
x <- names(data[,-1])
out <- unlist(lapply(1, function(n) combn(x, 1, FUN=function(row)
paste0("tlv ~ ", paste0(row, collapse = "+")))))
## get the regression coefficients
tmp1 = bind_rows(lapply(out, function(frml) {
a = tidy(lm(frml, data=data))
a$frml = frml
return(a)
}))
reg_coeff2 <- tmp1
## Get regression results i.e. R2, AIC, BIC
tmp2 = bind_rows(lapply(out, function(frml) {
a = glance(lm(frml, data=data))
a$frml = frml
return(a)
}))
reg_results2 <- tmp2
reg_results2$frml <- sub("tlv ~ ", "", reg_results2$frml)

该代码运行得很好,但我想实现它以便执行以下操作。

我有以下数据框(数据)

structure(list(id = c(5309039, 5284969, 5300279, 5270289, 5259957, 
5267086, 5173196), var1 = c(0, 0, 0, 0, 0, 0, 0), var2 = c(23,
24, 20, 32, 31, 37, 43), var3 = c(162, 154, 156, 154, 151.5,
171, 154), var4 = c(62.8, 52.7, 64.5, 70.9, 63, 66.2, 60.3),
tlv = c(1049, 978, 1131, 1292, 1228, 1593, 1265), form20 = c(1674.12110392683,
1517.06018080512, 1666.03606715029, 1726.99450999549, 1627.94506984781,
1754.74878787639, 1608.54623766777), form19 = c(1062.84280028848,
902.364998653641, 1054.58187260355, 1116.8664734097, 1015.66220125765,
1145.22454880977, 995.841345244203), form18 = c(1050.91941325579,
891.3634649201, 1026.84722464179, 1073.58291322486, 980.997498562542,
1147.23019335865, 971.271632531001), form17 = c(1404.10436829839,
1220.98291088203, 1419.72032143583, 1517.11065788694, 1386.31581471687,
1477.21675910098, 1347.52393410332), form16 = c(1248.12292187059,
1126.73082253566, 1229.80850901466, 1265.36558733196, 1194.92548170827,
1321.39733067342, 1187.52592495257), form15 = c(990.132,
866.003, 1011.025, 1089.681, 992.59, 1031.918, 959.407),
form14 = c(1590.6052, 1436.4718, 1582.993, 1830.3706, 1688.692,
1812.3808, 1786.5202), form13 = c(1300.81321145176, 1130.23869905075,
1292.03253463863, 1358.23586808642, 1250.66417156907, 1388.37813595599,
1277.89625553694), form12 = c(1329.6, 1104.4, 1272, 1322.8,
1195.5, 1487.4, 1195.6)), row.names = c(NA, -7L), class = c("tbl_df",
"tbl", "data.frame"))

我需要在变量 tlv 和名称以前缀 "form"开头的所有变量之间执行线性回归,因此排除其他变量(即 var1 , var2, var3, ...)

最佳答案

考虑使用 apply 系列来构建所有可能组合所需的公式,然后迭代地传递到 lm 中。除了 broom 函数之外,下面演示了基本 R:

indvar_list <- lapply(1:9, function(x) combn(paste0("form", 12:20), x, simplify = FALSE)) 

formulas_list <- rapply(indvar_list, function(x) as.formula(paste("tlv ~", paste(x, collapse="+"))))

tmp1 <- do.call(rbind, lapply(formulas_list, function(f)
transform(tidy(lm(f, data=data)), frml = f)
))

tmp2 <- do.call(rbind, (lapply(formulas_list, function(f)
transform(glance(lm(f, data=data)), frml = f)
))

关于r - 在 1 个变量和变量列表之间执行所有可能的线性回归,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/59482181/

27 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com