r - 回归和 PCA 的视觉比较-6ren

r - 回归和 PCA 的视觉比较

转载作者：行者123 更新时间：2023-12-04 03:59:59

受博客 Cerebral Mastication 的启发，我正在尝试完善一种比较回归和 PCA 的方法。这也已在 SO 上从不同角度进行了讨论.在我忘记之前，非常感谢 JD Long 和 Josh Ulrich 的大部分核心内容。我将在下学期的类(class)中使用它。对不起，这很长!

更新:我发现了一种几乎可以工作的不同方法(如果可以的话，请修复它!)。我把它贴在了底部。比我想出的更聪明、更短的方法!

我基本上遵循了之前的方案:生成随机数据，找出最佳拟合线，绘制残差。这显示在下面的第二个代码块中。但我也挖掘并编写了一些函数来绘制垂直于通过随机点(本例中的数据点)的线的线。我认为这些工作正常，它们显示在 First Code Chunk 中以及它们工作的证明。

现在，第二个代码块使用与@JDLong 相同的流程显示了整个操作，我正在添加结果图的图像。黑色数据，红色是带有残差粉红色的回归，蓝色是第一个 PC，浅蓝色应该是法线，但显然不是。 First Code Chunk 中绘制这些法线的函数看起来不错，但演示中有些地方不对:我想我一定是误解了某些东西或传递了错误的值。我的法线是水平的，这似乎是一个有用的线索(但到目前为止，对我来说不是)。谁能看到这里有什么问题？

谢谢，这个问题困扰了我一段时间...
Plot showing problem

第一个代码块(绘制法线并证明它们有效的功能):

##### The functions below are based very loosely on the citation at the end

pointOnLineNearPoint <- function(Px, Py, slope, intercept) {
    # Px, Py is the point to test, can be a vector.
    # slope, intercept is the line to check distance.

    Ax <- Px-10*diff(range(Px))
    Bx <- Px+10*diff(range(Px))
    Ay <- Ax * slope + intercept
    By <- Bx * slope + intercept
    pointOnLine(Px, Py, Ax, Ay, Bx, By)
    }

pointOnLine <- function(Px, Py, Ax, Ay, Bx, By) {

    # This approach based upon comingstorm's answer on
    # stackoverflow.com/questions/3120357/get-closest-point-to-a-line
    # Vectorized by Bryan

    PB <- data.frame(x = Px - Bx, y = Py - By)
    AB <- data.frame(x = Ax - Bx, y = Ay - By)
    PB <- as.matrix(PB)
    AB <- as.matrix(AB)
    k_raw <- k <- c()
    for (n in 1:nrow(PB)) {
        k_raw[n] <- (PB[n,] %*% AB[n,])/(AB[n,] %*% AB[n,])
        if (k_raw[n] < 0)  { k[n] <- 0
            } else { if (k_raw[n] > 1) k[n] <- 1
                else k[n] <- k_raw[n] }
        }
    x = (k * Ax + (1 - k)* Bx)
    y = (k * Ay + (1 - k)* By)
    ans <- data.frame(x, y)
    ans
    }

# The following proves that pointOnLineNearPoint
# and pointOnLine work properly and accept vectors

par(mar = c(4, 4, 4, 4)) # otherwise the plot is slightly distorted
# and right angles don't appear as right angles

m <- runif(1, -5, 5)
b <- runif(1, -20, 20)
plot(-20:20, -20:20, type = "n", xlab = "x values", ylab = "y values")
abline(b, m )

Px <- rnorm(10, 0, 4)
Py <- rnorm(10, 0, 4)

res <- pointOnLineNearPoint(Px, Py, m, b)
points(Px, Py, col = "red")
segments(Px, Py, res[,1], res[,2], col = "blue")

##========================================================
##
##  Credits:
##  Theory by Paul Bourke http://local.wasp.uwa.edu.au/~pbourke/geometry/pointline/
##  Based in part on C code by Damian Coventry Tuesday, 16 July 2002
##  Based on VBA code by Brandon Crosby 9-6-05 (2 dimensions)
##  With grateful thanks for answering our needs!
##  This is an R (http://www.r-project.org) implementation by Gregoire Thomas 7/11/08
##
##========================================================

第二个代码块(绘制演示):

set.seed(55)
np <- 10 # number of data points
x <- 1:np
e <- rnorm(np, 0, 60)
y <- 12 + 5 * x + e

par(mar = c(4, 4, 4, 4)) # otherwise the plot is slightly distorted

plot(x, y, main = "Regression minimizes the y-residuals & PCA the normals")
yx.lm <- lm(y ~ x)
lines(x, predict(yx.lm), col = "red", lwd = 2)
segments(x, y, x, fitted(yx.lm), col = "pink")

# pca "by hand"
xyNorm <- cbind(x = x - mean(x), y = y - mean(y)) # mean centers
xyCov <- cov(xyNorm)
eigenValues <- eigen(xyCov)$values
eigenVectors <- eigen(xyCov)$vectors

# Add the first PC by denormalizing back to original coords:

new.y <- (eigenVectors[2,1]/eigenVectors[1,1] * xyNorm[x]) + mean(y)
lines(x, new.y, col = "blue", lwd = 2)

# Now add the normals

yx2.lm <- lm(new.y ~ x) # zero residuals: already a line
res <- pointOnLineNearPoint(x, y, yx2.lm$coef[2], yx2.lm$coef[1])
points(res[,1], res[,2], col = "blue", pch = 20) # segments should end here
segments(x, y, res[,1], res[,2], col = "lightblue1") # the normals

＃＃＃＃＃＃＃＃＃＃＃＃更新

在 Vincent Zoonekynd's Page我几乎完全找到了我想要的东西。但是，它并不完全有效(显然曾经有效)。这是该站点的代码摘录，它绘制了通过垂直轴反射的第一台 PC 的法线:

set.seed(1)
x <- rnorm(20)
y <- x + rnorm(20)
plot(y~x, asp = 1)
r <- lm(y~x)
abline(r, col='red')

r <- princomp(cbind(x,y))
b <- r$loadings[2,1] / r$loadings[1,1]
a <- r$center[2] - b * r$center[1]
abline(a, b, col = "blue")
title(main='Appears to use the reflection of PC1')

u <- r$loadings
# Projection onto the first axis
p <- matrix( c(1,0,0,0), nrow=2 )
X <- rbind(x,y)
X <- r$center + solve(u, p %*% u %*% (X - r$center))
segments( x, y, X[1,], X[2,] , col = "lightblue1")

结果如下:

最佳答案

好吧，我将不得不回答我自己的问题!经过进一步阅读和比较人们在互联网上的方法，我已经解决了这个问题。我不确定我是否可以清楚地说明我“修复”了什么，因为我经历了很多次迭代。无论如何，这是情节和代码(MWE)。为了清楚起见，帮助函数在最后。

Working Demo

# Comparison of Linear Regression & PCA
# Generate sample data

set.seed(39) # gives a decent-looking example
np <- 10 # number of data points
x <- -np:np
e <- rnorm(length(x), 0, 10)
y <- rnorm(1, 0, 2) * x + 3*rnorm(1, 0, 2) + e

# Plot the main data & residuals

plot(x, y, main = "Regression minimizes the y-residuals & PCA the normals", asp = 1)
yx.lm <- lm(y ~ x)
lines(x, predict(yx.lm), col = "red", lwd = 2)
segments(x, y, x, fitted(yx.lm), col = "pink")

# Now the PCA using built-in functions
# rotation = loadings = eigenvectors

r <- prcomp(cbind(x,y), retx = TRUE)
b <- r$rotation[2,1] / r$rotation[1,1] # gets slope of loading/eigenvector 1
a <- r$center[2] - b * r$center[1]
abline(a, b, col = "blue") # Plot 1st PC

# Plot normals to 1st PC

X <- pointOnLineNearPoint(x, y, b, a)
segments( x, y, X[,1], X[,2], col = "lightblue1")

###### Needed Functions

pointOnLineNearPoint <- function(Px, Py, slope, intercept) {
    # Px, Py is the point to test, can be a vector.
    # slope, intercept is the line to check distance.

    Ax <- Px-10*diff(range(Px))
    Bx <- Px+10*diff(range(Px))
    Ay <- Ax * slope + intercept
    By <- Bx * slope + intercept
    pointOnLine(Px, Py, Ax, Ay, Bx, By)
    }

pointOnLine <- function(Px, Py, Ax, Ay, Bx, By) {

    # This approach based upon comingstorm's answer on
    # stackoverflow.com/questions/3120357/get-closest-point-to-a-line
    # Vectorized by Bryan

    PB <- data.frame(x = Px - Bx, y = Py - By)
    AB <- data.frame(x = Ax - Bx, y = Ay - By)
    PB <- as.matrix(PB)
    AB <- as.matrix(AB)
    k_raw <- k <- c()
    for (n in 1:nrow(PB)) {
        k_raw[n] <- (PB[n,] %*% AB[n,])/(AB[n,] %*% AB[n,])
        if (k_raw[n] < 0)  { k[n] <- 0
            } else { if (k_raw[n] > 1) k[n] <- 1
                else k[n] <- k_raw[n] }
        }
    x = (k * Ax + (1 - k)* Bx)
    y = (k * Ay + (1 - k)* By)
    ans <- data.frame(x, y)
    ans
    }

关于r - 回归和 PCA 的视觉比较，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/8457279/

文章推荐： r - 将工具提示格式化为货币并打开比较模式

文章推荐： java - 合并Spring引用的多个属性文件

文章推荐： java - 将 Exchange 文件夹分类为可分辨文件夹和普通文件夹

文章推荐： Erlang 和 Singleton 的好习惯？

java - 缓慢移动的物体(视觉)
当我尝试以非整数的步长(例如，每帧 0.5 像素)在屏幕上移动图形对象时，这会导致移动不稳定和“滞后”；因为对象只会每两帧移动 1 个像素。我理解为什么会发生这种情况，因为对象的 x/y 值必须是整
Android - 如何创建家谱图(视觉)
市面上有大量的家谱应用程序，但出于某种原因，我找不到一个示例来说明如何为 Android 应用程序创建一个。我是否使用 Canvas ，是否有图表库？我的基本要求是画一个三层的树(节点)图/图表，其
vim:视觉 block 选择直到每一行的字符
[ {name: 'John'}, {name: 'Plasmody'}, {name: 'Kugelschreiber'}, {name: 'Sarrah'}, ] 如果我在 J并做
php - 视觉 Composer 在哪里存储链接？
我试图定位所有没有 www 的链接。在数据库中。 https://launchhousing.org.au 并替换为 https://www.launchhousing.org.au 我使用了“搜索和
svn - 视觉 SVN 还是替代方案？
关闭。这个问题不符合Stack Overflow guidelines .它目前不接受答案。想改进这个问题？将问题更新为 on-topic对于堆栈溢出。 6年前关闭。 Improve this qu
svn - (视觉)SVN 通过正则表达式忽略文件
我需要排除具有以下模式的文件: ProjectFoo.Data[0-9]{14}.lgp 如何将 RegEx 用于 (Visual)SVN 忽略列表？最佳答案 subversion 忽略列表不支持正
audio - 创建绘制的音频 react 视觉
我正在寻找在处理中创建该项目的方法，但是，我发现该术语有点困难。我不确定如何调用在整个歌曲中线条永久保持的效果来“绘制”音乐数据。对于我可以查看哪些教程或某人的回答，我将不胜感激。我的目标是创建尽
android:视觉/艺术 Assets
我正在尝试为 android 制作游戏。我目前已将所有美术资源加载到 drawables 文件夹中，但我的问题是如何实际引用特定资源来渲染它？我知道每个文件都有一个唯一的@id，我可能必须在onDr
android - 视觉，图标，图像等移动应用程序设计
Closed. This question is off-topic。它当前不接受答案。
ios - 视觉 QuartzCore iOS
只是一个简单的问题。有一个简单的可视化工具可以生成iOS/QuartzCore的源代码吗？例如，我会制作一个带有路径和a的CAKeyframeAnimation(例如CGPathMoveToPoi
html - 如何调整文本宽度并将其放在帖子(视觉)编辑器的中心？
编辑 3:我想这已经解决了。我刚刚启用了古腾堡编辑器并发现了它的“经典编辑器”部分，即代码编辑器。我唯一需要习惯的是我无法轻易修改的编辑器行高，这还不错。这对我有用，它超过了修改 functions.
jquery - 视觉 Composer 挑战
我想在具有背景 slider 的可视 Composer 行内创建一个下拉菜单，最重要的是我要切换的链接。我在编辑自定义 css 时面临的问题是链接没有设置为 bottom:0;已设置position:
c++ - 视觉 C++ 开发
我正在学习 C++，并且了解一点 Visual Basic 和 Delphi。但我想知道，有没有像 Delphi 这样的程序，但适用于 C++。您可以将按钮拖到窗体上，双击它，就像在 Delphi
python - 找不到匹配的 GLX 视觉
我正在努力使用 pygame 初始化 OpenGL 显示。和pyopengl . import pygame pygame.init() pygame.display.set_mode((1920,
html - Wordpress 视觉 - 没有换行符被识别
不确定我做错了什么。我创建了一个主题，除了我在可视化编辑器中创建帖子外，一切都很好。对我来说，这很好，但大多数用户不了解 HTML，因此无法真正进入并编辑代码。在元素检查器(Chrome)中，文章是
c# - 如何在所谓的事件中找到哪个控件引发了事件？视觉 C#
我正在编写一个 C# 程序，它接受一堆参数并对数据点进行一些转换，然后将它们绘制到屏幕上。在我的一个表单上，我有一堆文本框，我都想执行相同的 KeyPress 事件。在我只做一个 switch 语句
uml - 视觉 2013 : Where are the signal shapes?
我正在创建 UML 事件图，我需要使用发送和接受信号，但我似乎找不到它。我试图用谷歌搜索它，但我似乎找不到任何东西。有谁知道我在哪里可以找到它们，或者它们在 Visio 中不存在？最佳答案想知道为
visual-studio - 视觉 haskell 2008
是 Haskell for Visual Studio 2005兼容VS2008 SP1 ? 最佳答案您最初问题的答案是否定的。visual haskell 的代码是用 Haskell 编写的，并通
wordpress - 是否可以在可分页容器(视觉 Composer )上使用箭头？
我正在使用 Visual Composer 开发我的 WordPress 网站。我需要包含一个可分页的容器，但如果它可以像幻灯片一样就更好了。 This is my pageable contain
rest - 视觉 REST API Playground
有哪些 Web 应用程序可以让我直观地(通过单击)使用任何 REST API 并生成一些代码(以任何语言)来捕捉我所描述的视觉内容？与 Swagger 或 Google API Playground

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

r - 回归和 PCA 的视觉比较