r - 为什么 do(lm...) 和 geom_smooth(method ="lm") 之间有区别？-6ren

r - 为什么 do(lm...) 和 geom_smooth(method ="lm") 之间有区别？

转载作者：行者123 更新时间：2023-12-04 14:20:39

26

4

我有一个稍微进入饱和状态的外部校准曲线。所以我拟合了一个二阶多项式和一个测量样本的数据框，我想知道其中的浓度。

df_calibration=structure(list(dilution = c(0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 
0.8, 0.9, 1, 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 1), 
    area = c(1000, 2000, 3000, 4000, 5000, 6000, 7000, 7800, 
    8200, 8500, 1200, 2200, 3200, 4200, 5200, 6200, 7200, 8000, 
    8400, 8700), substance = c("A", "A", "A", "A", "A", "A", 
    "A", "A", "A", "A", "b", "b", "b", "b", "b", "b", "b", "b", 
    "b", "b")), row.names = c(NA, 20L), class = "data.frame")

df_samples=structure(list(area = c(1100, 1800, 2500, 3200, 3900, 1300, 2000, 
2700, 3400, 4100), substance = c("A", "A", "A", "A", "A", "b", 
"b", "b", "b", "b")), row.names = c(NA, 10L), class = "data.frame")

现在为了计算测量 sample 的实际稀释度，我采用了由此拟合生成的参数:

df_fits=df_calibration %>% group_by(substance) %>% 
  do(fit = lm(area ~ poly(dilution,2), data = .))%>%
  tidy(fit) %>% 
  select(substance, term, estimate) %>% 
  spread(term, estimate)

df_fits=df_fits %>% rename(a=`poly(dilution, 2)2`,b=`poly(dilution, 2)1`,c=`(Intercept)`)

#join parameters with sample data
df_samples=left_join(df_samples,df_fits)

而这个公式

#calculate with general solution for polynomial 2nd order
df_samples$dilution_calc=
  (df_samples$b*(-1)+sqrt(df_samples$b^2-(4*df_samples$a*(df_samples$c-df_samples$area))))/(2*df_samples$a)

但是，当我现在绘制此图时，我注意到一些非常奇怪的事情。
计算出的 x 值(稀释)没有出现在来自 stat_smooth() 的曲线上.附加的虚线与物质“A”的图形中方程的参数(与数据框中的数字匹配)一起放置。所以我的计算应该是正确的(或不正确？)为什么会有差异？我究竟做错了什么？我如何从 stat_smooth() 完成的拟合中获取参数?

my.formula=y ~ poly(x,2)
ggplot(df_calibration, aes(x = dilution, y = area)) +
  stat_smooth(method = "lm", se=FALSE, formula = my.formula) +

  stat_function(fun=function(x){5250+(7980*x)+(-905*x^2)},      
              inherit.aes = F,linetype="dotted")+

  stat_poly_eq(formula = my.formula, 
               aes(label = paste(..eq.label.., ..rr.label.., sep = "~~~")), 
               parse = TRUE) +         
  geom_point(shape=17)+
  geom_point(data=df_samples,
           aes(x=dilution_calc,y=area),
           shape=1,color="red")+
  facet_wrap(~substance,scales = "free")

任何建议将不胜感激:-)

最佳答案

默认情况下，poly计算正交多项式。您可以使用 raw=TRUE 关闭正交化。争论。

请注意，该公式有两次出现:一次在拟合回归时使用原始变量名称，然后在 stat_smooth 中使用。使用通用变量名 x和 y .但否则它应该是相同的公式，与raw=TRUE .

library("tidyverse")

# Define/import your data here....

df_fits <- df_calibration %>%
  group_by(substance) %>%
  do(fit = lm(area ~ poly(dilution, 2, raw = TRUE), data = .)) %>%
  broom::tidy(fit) %>%
  select(substance, term, estimate) %>%
  spread(term, estimate) %>%
  # It is simpler to rename the coefficients here
  setNames(c("substance", "c", "b", "a"))

# join parameters with sample data
df_samples <- left_join(df_samples, df_fits)

# calculate with general solution for polynomial 2nd order
df_samples <- df_samples %>%
  mutate(dilution_calc = (b * (-1) + sqrt(b^2 - (4 * a * (c - area)))) / (2 * a))

my.formula <- y ~ poly(x, 2, raw = TRUE)

df_calibration %>%
  ggplot(aes(x = dilution, y = area)) +
  stat_smooth(method = "lm", se = FALSE, formula = my.formula) +
  geom_point(shape = 17) +
  geom_point(
    data = df_samples,
    aes(x = dilution_calc, y = area),
    shape = 1, color = "red"
  ) +
  facet_wrap(~substance, scales = "free")

创建于 2019-03-31 由 reprex package (v0.2.1)

关于r - 为什么 do(lm...) 和 geom_smooth(method ="lm") 之间有区别？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/55437248/

26

4

0

文章推荐： angular - ngx-spinner 与 Angular 页面的加载不同步

文章推荐： php - Composer 脚本 echo

文章推荐： java - Spring boot 中 Post 映射的 Cors 错误

c++ - 为什么for循环中的 "++i"和 "i++"有区别？
我觉得 for(int i = 0; i < 2; i++) 和 for(int i = 0; i < 2; ++i) 不应该做同样的事情。对于第二个例子，从循环开始 i 应该等于 1 对我来说更符合
haskell - 为什么 throw 和 throwIO 有区别？
我试图牢牢掌握异常情况，以便改进我的conditional loop implementation .为此，我进行了各种实验，扔东西，看看会被抓到什么。这个让我惊喜不已: % cat X.hs mo
css - 为什么内联 CSS 和普通 CSS 有区别
我只是想回答一个问题，但我遇到了一些我不明白的事情!为什么如果我在文件中使用内联 CSS 或 CSS，如本例中的颜色，结果就不一样! 代码相同，但第一段是绿色，第二段是红色! 我真的不明白为什么？谢
HTML 为什么我的边框中的 span 和 div 有区别？
我目前正在学习 CSS 并进行试验，我偶然发现了输出中的这种差异。所以这是代码: .red-text { color: red;
python - 为什么 "import"与 "import *"有区别？
"""module a.py""" test = "I am test" _test = "I am _test" __test = "I am __test" ============= ~ $ p
firebase - 为什么 servertimestamp 和 Firestore 中的 js new Date() 有区别？
在向 Firestore 写入文档时，我经常看到 serverTimestamp() 标记和 new Date() 对象之间的差异不为零。差异范围从几秒到几十分钟。他们不是在做同样的事情吗？
python - 为什么 round(x) 和 round(np.float64(x)) 有区别？
据我了解，2.675 和 numpy.float64(2.675) 都是相同的数字。然而，round(2.675, 2) 给出 2.67，而 round(np.float64(2.675), 2) 给
linux - 为什么使用 std::thread::hardware_concurrency() 和 boost::thread::hardware_concurrency() 有区别？
问题本身的描述很简单。我正在测试 C++11 中 std::thread 库和 boost::thread 库的区别。这些的输出: #include #include #include int
apache-spark - 为什么 sqlContext.read.load 和 sqlContext.read.text 有区别？
我只是想将文本文件读入 pyspark RDD，我注意到 sqlContext.read.load 之间的巨大差异和 sqlContext.read.text . s3_single_file_inp
.net - 使用 SC.exe 或 InstallUtil.exe 安装 Windows 服务 - 有区别，但哪个？
SC.exe 和 InstallUtil 都可以安装/卸载 Windows 服务。但它们的工作方式似乎并不相同。有什么区别？例如，InstallUtil 失败(找不到某些文件或依赖项错误)，而 S
Why is there difference between static Thread.currentThread().getName() and getName()?(为什么Static Thread.CurrentThread().getName()和getName()有区别？)
我认为Thread对象就像是带有名称和静态Thread.CurrentThread()的抽象对象，就像访问Thread对象的方式一样。显然，这是错误的假设。。是这样的吗？
Why is there difference between static Thread.currentThread().getName() and getName()?(为什么Static Thread.CurrentThread().getName()和getName()有区别？)
我认为Thread对象就像是带有名称和静态Thread.CurrentThread()的抽象对象，就像访问Thread对象的方式一样。显然，这是错误的假设。。是这样的吗？

首页

博学

6Ren·AI

商城

r - 为什么 do(lm...) 和 geom_smooth(method ="lm") 之间有区别？