r - 加速 Rcpp `anyNA` 等效-6ren

r - 加速 Rcpp `anyNA` 等效

转载作者：行者123 更新时间：2023-12-03 16:54:38

这个问题与this old question有关和 this old question .
R 有很好的 wrapper-ish 函数 anyNA更快地评估 any(is.na(x)) .在 Rcpp 中工作时，可以通过以下方式给出类似的最小实现:

// CharacterVector example
#include <Rcpp.h>
using namespace Rcpp;
template<typename T, typename S>
bool any_na(S x){
  T xx = as<T>(x);
  for(auto i : xx){
    if(T::is_na(i))
      return true;
  }
  return false;
}

// [[Rcpp::export(rng = false)]]
LogicalVector any_na(SEXP x){
  return any_na<CharacterVector>(x);
}

// [[Rcpp::export(rng = false)]]
SEXP overhead(SEXP x){
  CharacterVector xx = as<CharacterVector>(x);
  return wrap(xx);
}
/***R

library(microbenchmark)
vec <- sample(letters, 1e6, TRUE)
vec[1e6] <- NA_character_
any_na(vec)
# [1] TRUE
*/

但是将其性能与 anyNA 进行比较我对下面的基准感到惊讶

library(microbenchmark)
microbenchmark(
  Rcpp = any_na(vec), 
  R = anyNA(vec),
  overhead = overhead(vec), 
  unit = "ms"
)
Unit: milliseconds
     expr      min        lq     mean    median       uq      max neval cld
     Rcpp 2.647901 2.8059500 3.243573 3.0435010 3.675051 5.899100   100   c
        R 0.800300 0.8151005 0.952301 0.8577015 0.961201 3.467402   100  b 
 overhead 0.001300 0.0029010 0.011388 0.0122510 0.015751 0.048401   100 a

其中最后一行是从 SEXP 来回转换所产生的“开销”。至 CharacterVector (结果可以忽略不计)。显而易见，Rcpp 版本比 R 版本慢约 3.5 倍。我很好奇所以我检查了 Rcpp 的来源 is_na并没有发现性能缓慢的明显原因，我继续检查 source for anyNA for R's own character vectors's并使用 R 的 C API 思想重新实现该功能以加快速度

// Added after SEXP overhead(SEXP x){ --- }
inline bool anyNA2(SEXP x){
  R_xlen_t n = Rf_length(x);
  for(R_xlen_t i = 0; i < n; i++){
    if(STRING_ELT(x, i) == NA_STRING)
      return true;
  }
  return false;
}
// [[Rcpp::export(rng = false)]]
SEXP any_na2(SEXP x){
  bool xx = anyNA2(x);
  return wrap(xx);
}
// [[Rcpp::export(rng = false)]]
SEXP any_na3(SEXP x){
  Function anyNA("anyNA");
  return anyNA(x);
}
/***R
microbenchmark(
  Rcpp = any_na(vec), 
  R = anyNA(vec),
  R_C_api = any_na2(vec),
  Rcpp_Function = any_na3(vec),
  overhead = overhead(vec),
  unit = "ms"
)
# Unit: milliseconds
# expr      min        lq       mean    median       uq      max neval  cld
# Rcpp 2.654901 2.8650515 3.54936501 3.2392510 3.997901 8.074201   100    d
# R 0.803701 0.8303015 1.01017200 0.9400015 1.061751 2.019902   100  b
# R_C_api 2.336402 2.4536510 3.01576302 2.7220010 3.314951 6.905101   100   c
# Rcpp_Function 0.844001 0.8862510 1.09259990 0.9597505 1.120701 3.011801   100  b
# overhead 0.001500 0.0071005 0.01459391 0.0146510 0.017651 0.101401   100 a
*/

请注意，我包含了一个简单的包装器，调用 anyNA通过 Rcpp::Function以及。再次实现 anyNA不只是一点点而是很多比基本实现慢。
所以问题变成了2折:

为什么 Rcpp 这么慢？

源自 1:如何“更改”以加快代码速度？

这些问题本身并不是很有趣，但是如果这会影响 Rcpp 实现的多个部分，这些部分可能会总体上获得显着的性能提升，这很有趣。
类(class)信息()

sessionInfo()
R version 4.0.3 (2020-10-10)
Platform: x86_64-w64-mingw32/x64 (64-bit)
Running under: Windows 10 x64 (build 19042)

Matrix products: default

locale:
[1] LC_COLLATE=English_Denmark.1252  LC_CTYPE=English_Denmark.1252    LC_MONETARY=English_Denmark.1252 LC_NUMERIC=C                     LC_TIME=English_Denmark.1252    

attached base packages:
[1] stats     graphics  grDevices utils     datasets  methods   base     

other attached packages:
[1] microbenchmark_1.4-7    cmdline.arguments_0.0.1 glue_1.4.2              R6_2.5.0                Rcpp_1.0.6             

loaded via a namespace (and not attached):
 [1] codetools_0.2-18 lattice_0.20-41  mvtnorm_1.1-1    zoo_1.8-8        MASS_7.3-53      grid_4.0.3       multcomp_1.4-15  Matrix_1.2-18    sandwich_3.0-0   splines_4.0.3   
[11] TH.data_1.0-10   tools_4.0.3      survival_3.2-7   compiler_4.0.3

编辑(不仅是 Windows 问题):
我想确保这不是“Windows 问题”，因此我在运行 linux 的 Docker 容器中检查并执行了该问题。结果如下所示，非常相似

# Unit: milliseconds
#           expr    min      lq     mean  median      uq     max neval
#           Rcpp 2.3399 2.62155 4.093380 3.12495 3.92155 26.2088   100
#              R 0.7635 0.84415 1.459659 1.10350 1.42145 12.1148   100
#        R_C_api 2.3358 2.56500 3.833955 3.11075 3.65925 14.2267   100
#  Rcpp_Function 0.8163 0.96595 1.574403 1.27335 1.56730 11.9240   100
#       overhead 0.0009 0.00530 0.013330 0.01195 0.01660  0.0824   100

session 信息:

sessionInfo()
R version 4.0.2 (2020-06-22)
Platform: x86_64-pc-linux-gnu (64-bit)
Running under: Ubuntu 20.04 LTS

Matrix products: default
BLAS/LAPACK: /usr/lib/x86_64-linux-gnu/openblas-openmp/libopenblasp-r0.3.8.so

locale:
 [1] LC_CTYPE=en_US.UTF-8       LC_NUMERIC=C
 [3] LC_TIME=en_US.UTF-8        LC_COLLATE=en_US.UTF-8
 [5] LC_MONETARY=en_US.UTF-8    LC_MESSAGES=C
 [7] LC_PAPER=en_US.UTF-8       LC_NAME=C
 [9] LC_ADDRESS=C               LC_TELEPHONE=C
[11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C

attached base packages:
[1] stats     graphics  grDevices utils     datasets  methods   base

other attached packages:
[1] microbenchmark_1.4-7 Rcpp_1.0.5

loaded via a namespace (and not attached):
[1] compiler_4.0.2 tools_4.0.2

最佳答案

这是一个有趣的问题，但答案很简单:STRING_ELT 有两个版本。 R 内部使用的一种，或者如果您设置了 USE_RINTERNALS宏在 Rinlinedfuns.h和一个在 memory.c 中的平民.
对比两个版本，可以看到pleb版本的check比较多，完全说明了速度上的差异。
如果你真的想要速度而不关心安全，你通常可以至少比 R 少一点。

// [[Rcpp::export(rng = false)]]
bool any_na_unsafe(SEXP x) {
  SEXP* ptr = STRING_PTR(x);
  R_xlen_t n = Rf_xlength(x);
  for(R_xlen_t i=0; i<n; ++i) {
    if(ptr[i] == NA_STRING) return true;
  }
  return false;
}

长椅:

> microbenchmark(
+   R = anyNA(vec),
+   R_C_api = any_na2(vec),
+   unsafe = any_na_unsafe(vec),
+   unit = "ms"
+ )
Unit: milliseconds
    expr    min      lq     mean  median      uq     max neval
       R 0.5058 0.52830 0.553696 0.54000 0.55465  0.7758   100
 R_C_api 1.9990 2.05170 2.214136 2.06695 2.10220 12.2183   100
  unsafe 0.3170 0.33135 0.369585 0.35270 0.37730  1.2856   100

尽管这样写是不安全的，但如果您在开始时在循环之前添加一些检查，那就没问题了。

关于r - 加速 Rcpp `anyNA` 等效，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/66532459/

文章推荐： ios - Cocoa/iOS 错误处理风格，没有 NSError

文章推荐： regex - 用一个空行替换两个空行的正则表达式

文章推荐： flutter - 什么是 MaterialStateProperty？

文章推荐： android - 底部导航栏与 Jetpack Compose 中的屏幕内容重叠

r - 加速 Rcpp `anyNA` 等效
这个问题与this old question有关和 this old question . R 有很好的 wrapper-ish 函数 anyNA更快地评估 any(is.na(x)) .在 Rcpp
r - complete.cases 和 anyNA 没有在行中检测到 NA？
为什么 complete.cases 或 anyNA 都不为第 1 行返回 FALSE？根据他们的文档，他们似乎应该......我读错了文档吗？ csv csv c.100..50..100. c.1

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

r - 加速 Rcpp `anyNA` 等效