gpt4 book ai didi

r - 如何在 R 中编写 trycatch

转载 作者:太空宇宙 更新时间:2023-11-04 13:01:40 25 4
gpt4 key购买 nike

我想写trycatch代码来处理从网络下载的错误。

url <- c(
"http://stat.ethz.ch/R-manual/R-devel/library/base/html/connections.html",
"http://en.wikipedia.org/wiki/Xz")
y <- mapply(readLines, con=url)

这两条语句运行成功。下面,我创建了一个不存在的网址:

url <- c("xxxxx", "http://en.wikipedia.org/wiki/Xz")

url[1] 不存在。如何编写一个 trycatch 循环(函数)以便:

  1. 当 URL 错误时,输出将是:"web URL is wrong, can't get"。
  2. 当 URL 错误时,代码不会停止,而是继续下载,直到 URL 列表结束?

最佳答案

那么:欢迎来到 R 世界 ;-)

给你

设置代码

urls <- c(
"http://stat.ethz.ch/R-manual/R-devel/library/base/html/connections.html",
"http://en.wikipedia.org/wiki/Xz",
"xxxxx"
)
readUrl <- function(url) {
out <- tryCatch(
{
# Just to highlight: if you want to use more than one
# R expression in the "try" part then you'll have to
# use curly brackets.
# 'tryCatch()' will return the last evaluated expression
# in case the "try" part was completed successfully

message("This is the 'try' part")

readLines(con=url, warn=FALSE)
# The return value of `readLines()` is the actual value
# that will be returned in case there is no condition
# (e.g. warning or error).
# You don't need to state the return value via `return()` as code
# in the "try" part is not wrapped inside a function (unlike that
# for the condition handlers for warnings and error below)
},
error=function(cond) {
message(paste("URL does not seem to exist:", url))
message("Here's the original error message:")
message(cond)
# Choose a return value in case of error
return(NA)
},
warning=function(cond) {
message(paste("URL caused a warning:", url))
message("Here's the original warning message:")
message(cond)
# Choose a return value in case of warning
return(NULL)
},
finally={
# NOTE:
# Here goes everything that should be executed at the end,
# regardless of success or error.
# If you want more than one expression to be executed, then you
# need to wrap them in curly brackets ({...}); otherwise you could
# just have written 'finally=<expression>'
message(paste("Processed URL:", url))
message("Some other message at the end")
}
)
return(out)
}

应用代码

> y <- lapply(urls, readUrl)
Processed URL: http://stat.ethz.ch/R-manual/R-devel/library/base/html/connections.html
Some other message at the end
Processed URL: http://en.wikipedia.org/wiki/Xz
Some other message at the end
URL does not seem to exist: xxxxx
Here's the original error message:
cannot open the connection
Processed URL: xxxxx
Some other message at the end
Warning message:
In file(con, "r") : cannot open file 'xxxxx': No such file or directory

调查输出

> head(y[[1]])
[1] "<!DOCTYPE html PUBLIC \"-//W3C//DTD HTML 4.01 Transitional//EN\">"
[2] "<html><head><title>R: Functions to Manipulate Connections</title>"
[3] "<meta http-equiv=\"Content-Type\" content=\"text/html; charset=utf-8\">"
[4] "<link rel=\"stylesheet\" type=\"text/css\" href=\"R.css\">"
[5] "</head><body>"
[6] ""

> length(y)
[1] 3

> y[[3]]
[1] NA

补充说明

tryCatch

tryCatch 返回与执行 expr 相关的值,除非出现错误或警告。在这种情况下,可以通过提供相应的处理函数(参见参数 errorwarning 来指定特定的返回值(参见上面的 return(NA)) > 在 ?tryCatch 中)。这些可以是已经存在的函数,但您也可以在 tryCatch() 中定义它们(就像我在上面所做的那样)。

选择处理函数的特定返回值的含义

正如我们指定的那样,NA 应该在出错的情况下返回,y 中的第三个元素是 NA。如果我们选择 NULL 作为返回值,则 y 的长度将只是 2 而不是 3 as lapply() 将简单地“忽略”NULL 的返回值。另请注意,如果您未通过 return() 指定 explicit 返回值,则处理函数将返回 NULL(即,如果错误或警告条件)。

“不受欢迎的”警告消息

由于 warn=FALSE 似乎没有任何效果,另一种抑制警告的方法(在这种情况下并不是真正感兴趣的)是使用

suppressWarnings(readLines(con=url))

代替

readLines(con=url, warn=FALSE)

多重表达

请注意,如果将多个表达式用大括号括起来(就像我在 finally 部分进行了说明)。

关于r - 如何在 R 中编写 trycatch,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/33733102/

25 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com