c++ - Python + alglib + NumPy : how to avoid converting arrays to lists?-6ren

c++ - Python + alglib + NumPy : how to avoid converting arrays to lists?

转载作者：太空狗更新时间：2023-10-29 20:21:59

41

4

上下文:我最近发现了 alglib library (用于数值计算)，这似乎是我一直在寻找的东西(稳健插值、数据分析......)，但在 numpy 或 scipy 中找不到。

但是，我担心的事实是(例如，对于插值)它不接受 numpy 数组作为有效输入格式，而是仅常规 python 列表对象。

问题:我深入研究了代码和文档，发现(正如预期的那样)这个列表格式只是为了转换，因为库无论如何都会将它转换成 ctypes(cpython 库只是底层 C/C++ 的接口(interface)库)。

这就是我担心的地方:在我的代码中，我正在使用 numpy 数组，因为它大大提高了我在其上执行的科学计算的性能。因此，我担心必须将传递给 alglib 例程的任何数据转换为列表(将转换为 ctypes)会对性能产生巨大影响(我正在使用内部可能有数十万个 float 的数组，以及数千个数组)。

问题:你认为我确实会有性能损失，还是你认为我应该开始修改 alglib 代码(仅 python 接口(interface))以便它可以接受 numpy 数组，并且只进行一次转换(从 numpy 数组到 ctypes)？我什至不知道这是否可行，因为它是一个相当大的图书馆......也许你们有更好的想法或建议(即使是关于相似但不同的图书馆)...

编辑

似乎我的问题没有引起太多兴趣，或者我的问题不明确/不相关。或者也许没有人有解决方案或建议，但我怀疑周围有这么多专家:)不管怎样，我已经写了一个小的、快速的、肮脏的测试代码来说明这个问题……

#!/usr/bin/env python

import xalglib as al
import timeit
import numpy as np

def func(x):
    return (3.14 *x**2.3 + x**3 -x**2.34 +x)/(1.+x)**2

def fa(x, y, val=3.14):
    s = al.spline1dbuildakima(x, y)
    return (al.spline1dcalc(s, val), func(val))

def fb(x, y, val=3.14):
    _x = list(x)
    _y = list(y)
    s = al.spline1dbuildakima(_x, _y)
    return (al.spline1dcalc(s, val), func(val))

ntot = 10000
maxi = 100
x = np.random.uniform(high=maxi, size=ntot)
y = func(x)
xl = list(x)
yl = list(y)

print "Test for len(x)=%d, and x between [0 and %.2f):" % (ntot, maxi)
print "Function: (3.14 *x**2.3 + x**3 -x**2.34 +x)/(1.+x)**2"
a, b = fa(xl, yl)
err = np.abs(a-b)/b * 100
print "(x=3.14) interpolated, exact =", (a, b)
print "(x=3.14) relative error should be <= 1e-2: %s (=%.2e)" % ((err <= 1e-2), err)

if __name__ == "__main__":
    t = timeit.Timer(stmt="fa(xl, yl)", setup="from __main__ import fa, xl, yl, func")
    tt = timeit.Timer(stmt="fb(x, y)", setup="from __main__ import fb, x, y, func")
    v = 1000 * t.timeit(number=100)/100
    vv = 1000 * tt.timeit(number=100)/100
    print "%.2f usec/pass" % v
    print "%.2f usec/pass" % vv
    print "%.2f %% less performant using numpy arrays" % ((vv-v)/v*100.)

并运行它，我得到:

"""
Test for len(x)=10000, and x between [0 and 100.00):
Function: (3.14 *x**2.3 + x**3 -x**2.34 +x)/(1.+x)**2
(x=3.14) interpolated, exact = (3.686727834705164, 3.6867278531266905)
(x=3.14) relative error should be <= 1e-2: True (=5.00e-07)
25.85 usec/pass
28.46 usec/pass
10.09 % less performant using numpy arrays
"""

性能损失在 8% 到 14% 之间波动，这对我来说是巨大的......

最佳答案

您可以创建自己的 wrap 函数，将 numpy 数组的数据缓冲区直接传递给 vector 的数据指针，这不会复制数据，并且可以大大加快 wrap 函数的速度。以下代码将 x.ctypes.data 传递给 x_vector.ptr.p_ptr，其中 x 是一个 numpy 数组。

当你传递 numpy 数组时，你必须确保数组的元素在连续的内存中。以下代码不检查此项。

import xalglib as al
import numpy as np
import ctypes

def spline1dbuildakima(x, y):
    n = len(x)
    _error_msg = ctypes.c_char_p(0)
    __c = ctypes.c_void_p(0)
    __n = al.c_ptrint_t(n)
    __x = al.x_vector(cnt=n, datatype=al.DT_REAL, owner=al.OWN_CALLER, 
                      last_action=0,ptr=al.x_multiptr(p_ptr=x.ctypes.data))
    __y = al.x_vector(cnt=n, datatype=al.DT_REAL, owner=al.OWN_CALLER, 
                      last_action=0,ptr=al.x_multiptr(p_ptr=y.ctypes.data))

    al._lib_alglib.alglib_spline1dbuildakima(
        ctypes.byref(_error_msg), 
        ctypes.byref(__x), 
        ctypes.byref(__y), 
        ctypes.byref(__n), 
        ctypes.byref(__c))

    __r__c = al.spline1dinterpolant(__c)
    return __r__c    

def func(x):
    return (3.14 *x**2.3 + x**3 -x**2.34 +x)/(1.+x)**2

def fa(x, y, val=3.14):
    s = spline1dbuildakima(x, y)
    return al.spline1dcalc(s, val), func(val)

def fb(x, y, val=3.14):
    s = al.spline1dbuildakima(x, y)
    return al.spline1dcalc(s, val), func(val)

ntot = 10000
maxi = 100
x = np.random.uniform(high=maxi, size=ntot)
y = func(x)
xl = list(x)
yl = list(y)

import time
start = time.clock()
for i in xrange(100):
    a, b = fa(x, y)
print time.clock()-start
err = np.abs(a-b)/b * 100
print a, b, err

start = time.clock()
for i in xrange(100):
    a, b = fb(xl, yl)
print time.clock()-start
err = np.abs(a-b)/b * 100
print a, b, err

输出是:

0.722314760822 <- seconds of numpy array version
3.68672728107 3.68672785313 1.55166878281e-05
3.22011891502  <- seconds of list version
3.68672728107 3.68672785313 1.55166878281e-05

关于c++ - Python + alglib + NumPy : how to avoid converting arrays to lists?，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/9448915/

41

4

0

文章推荐： c++ - 将 dnorm 与 RcppArmadillo 结合使用

文章推荐： c#接口(interface)问题

文章推荐： c++ - 交叉文件 #if 和 #endif - 它应该合法吗？

文章推荐： c# - 如何使用 DotNumerics 解决线性规划问题？

r - 如何创建像这样的多维度列表 DATA<-list(list(list(),list(),list()),list(list(),list(),list()),list(list() ，列表()，列表()))？
我想使用 R 预定义这样的列表 DATA<-list( list(list(),list(),list()), list(list(),list(),list()), list(list(),l
haskell - 如何 `List + List = List[List]]`
如何将一个列表添加到另一个列表，返回一个列表的列表？ foo :: [a] -> [a] -> [[a]] 例如，我想要的结果是: foo [1,2] [3,4] 将是 [[1,2], [3,4]]。
python - 从 "lists of lists"和 "list"创建两个单独的 "list of lists"
我还没有在这里找到类似问题的解决方案，所以我会寻求你的帮助。有 2 个列表，其中之一是列表列表: categories = ['APPLE', 'ORANGE', 'BANANA'] test_re
python - "Flatten"list 包含lists of lists to lists of lists
这个问题不同于Converting list of lists / nested lists to list of lists without nesting (这会产生一组非常具体的响应，但无法解决
java - 无法从 List 转换为 List>
原始列表转换为 List正好。为什么原始列表的列表不能转换为 List 的列表？ { // works List raw = null; List wild = raw; } {
java - 涉及类型参数时，List> 不能赋值给 List>
在下面的代码中，get()被调用并将其结果分配给类型为 List> 的变量. get()返回 List>并在类型参数为 T 的实例上调用设置为 ? ，所以它应该适合。 import java.util
java - 无法从 List 转换为 List>
原始列表转换为 List正好。为什么原始列表的列表不能转换为 List 的列表? { // works List raw = null; List wild = raw; } {
scala - 在不够多态的情况下，为什么实现 `List a -> List a -> List a` 的方法比 `List Char -> List Char -> List Char` 少
在insufficiently-polymorphic 作者说: def foo[A](fst: List[A], snd: List[A]): List[A] There are fewer way
kotlin - List > + List = List <任何>？
我有下面的代码有效。 class ListManipulate(val list: List, val blockCount: Int) { val result: MutableList>
java - 有没有一种好的方法可以将 List>> 转换为 List>> 而不需要 3 个嵌套循环？
关闭。这个问题需要多问focused 。目前不接受答案。想要改进此问题吗？更新问题，使其仅关注一个问题 editing this post . 已关闭 5 年前。 Improve this ques
Scala - 将列表列表转换为单个列表 : List[List[A]] to List[A]
在 scala (2.9) 中转换列表列表的最佳方法是什么？我有一个 list : List[List[A]] 我想转换成 List[A] 如何递归地实现这一点？或者还有其他更好的办法吗？最佳答案
list - 标准ML : Searching through a list of lists
我编写了这个函数来确定给定元素是否存储在元组列表的列表中，但目前它只搜索第一个列表。我将如何搜索其余列表？ fun findItem (name : command, ((x,y)::firstlis
Java List of List of List，更好的解决方案？
我创建了一个类名 objectA，它有 4 个变量:约会时间;字符串文本；变量 1，变量 2 我需要创建一个 ObjectA() 列表。然后首先按时间对它们进行分组，其次按 var1，然后按 var2
python : Removing a List from List of List?
我有一套说法 char={'J','A'} 和列表的列表 content = [[1,'J', 2], [2, 'K', 3], [2, 'A', 3], [3,'A', 9], [5, 'J', 9
java - 访问List>>> titles = new ArrayList>>>();
我有以下列表 List >>> titles = new ArrayList >>> ();我想访问它的元素，但我不知道该怎么做.. 该列表有 1 个元素，它又包含 3 个元素，这 3 个元素中的
scala - 如何将 List[List[Long]] 转换为 List[List[Int]]？
转换 List[List[Long]] 的最佳方法是什么？到 List[List[Int]]在斯卡拉？例如，给定以下类型列表 List[List[Long]] val l: List[List[Lo
Java:将 List> 转换为 List>
我有一个来自 Filereader (String) 的 List-List，如何将其转换为 List-List (Double):我必须返回一个包含 line-Array 的第一个 Values 的
c# - 将 List> 转换为 List>
我收集了List> 。我需要将其转换为List> 。这是我尝试过的， List> dataOne = GetDataOne(); var dataTwo = dataOne.Select(x => x
java - List> 和 List 是 java 中不兼容的类型
这个问题在这里已经有了答案: Cannot convert from List to List> (3 个答案) 关闭 7 年前。我没有得到这段代码以任何方式编译: List a = new Ar
java - List> 和 List 是 java 中不兼容的类型
这个问题在这里已经有了答案: Cannot convert from List to List> (3 个答案) 关闭 7 年前。我没有得到这段代码以任何方式编译: List a = new Ar

首页

博学

6Ren·AI

商城

c++ - Python + alglib + NumPy : how to avoid converting arrays to lists?

编辑