python - CUDA GPU处理: TypeError: compile_kernel() got an unexpected keyword argument 'boundscheck'-6ren

python - CUDA GPU处理: TypeError: compile_kernel() got an unexpected keyword argument 'boundscheck'

转载作者：行者123 更新时间：2023-12-03 16:45:06

26

4

今天，我开始使用CUDA和GPU处理。我找到了本教程:
https://www.geeksforgeeks.org/running-python-script-on-gpu/

不幸的是，我第一次运行gpu代码的尝试失败了:

from numba import jit, cuda 
import numpy as np 
# to measure exec time 
from timeit import default_timer as timer 

# normal function to run on cpu 
def func(a):                                 
    for i in range(10000000): 
        a[i]+= 1    

# function optimized to run on gpu 
@jit(target ="cuda")                         
def func2(a): 
    for i in range(10000000): 
        a[i]+= 1
if __name__=="__main__": 
    n = 10000000                            
    a = np.ones(n, dtype = np.float64) 
    b = np.ones(n, dtype = np.float32) 

    start = timer() 
    func(a) 
    print("without GPU:", timer()-start)     

    start = timer() 
    func2(a) 
    print("with GPU:", timer()-start)

输出:

/home/amu/anaconda3/bin/python /home/amu/PycharmProjects/gpu_processing_base/gpu_base_1.py
without GPU: 4.89985659904778
Traceback (most recent call last):
  File "/home/amu/PycharmProjects/gpu_processing_base/gpu_base_1.py", line 30, in <module>
    func2(a)
  File "/home/amu/anaconda3/lib/python3.7/site-packages/numba/cuda/dispatcher.py", line 40, in __call__
    return self.compiled(*args, **kws)
  File "/home/amu/anaconda3/lib/python3.7/site-packages/numba/cuda/compiler.py", line 758, in __call__
    kernel = self.specialize(*args)
  File "/home/amu/anaconda3/lib/python3.7/site-packages/numba/cuda/compiler.py", line 769, in specialize
    kernel = self.compile(argtypes)
  File "/home/amu/anaconda3/lib/python3.7/site-packages/numba/cuda/compiler.py", line 785, in compile
    **self.targetoptions)
  File "/home/amu/anaconda3/lib/python3.7/site-packages/numba/core/compiler_lock.py", line 32, in _acquire_compile_lock
    return func(*args, **kwargs)
TypeError: compile_kernel() got an unexpected keyword argument 'boundscheck'

Process finished with exit code 1

我已经在pycharm的anaconda环境中安装了教程中提到的 numba和 cudatoolkit。

最佳答案

添加答案以使此答案脱离未答复的队列。

该示例中的代码已损坏。您的numba或CUDA安装没有任何问题。问题中的代码(或从其复制博客的博客)无法发出博客帖子声明的结果。

有很多方法可以将其修改为起作用。一个会是这样的:

from numba import vectorize, jit, cuda 
import numpy as np 
# to measure exec time 
from timeit import default_timer as timer 

# normal function to run on cpu 
def func(a):                                 
    for i in range(10000000): 
        a[i]+= 1    

# function optimized to run on gpu 
@vectorize(['float64(float64)'], target ="cuda")                         
def func2(x): 
    return x+1

if __name__=="__main__": 
    n = 10000000                            
    a = np.ones(n, dtype = np.float64) 

    start = timer() 
    func(a) 
    print("without GPU:", timer()-start)     

    start = timer() 
    func2(a) 
    print("with GPU:", timer()-start)

在这里， func2变成为设备编译的ufunc。然后，它将在GPU的整个输入阵列上运行。这样做是这样的:

$ python bogoexample.py 
without GPU: 4.314514834433794
with GPU: 0.21419800259172916

因此速度更快，但请记住，GPU时间包括编译GPU ufunc所需的时间

另一种选择是实际编写GPU内核。像这样:

from numba import vectorize, jit, cuda 
import numpy as np 
# to measure exec time 
from timeit import default_timer as timer 

# normal function to run on cpu 
def func(a):                                 
    for i in range(10000000): 
        a[i]+= 1    

# function optimized to run on gpu 
@vectorize(['float64(float64)'], target ="cuda")                         
def func2(x): 
    return x+1

# kernel to run on gpu
@cuda.jit
def func3(a, N):
    tid = cuda.grid(1)
    if tid < N:
        a[tid] += 1


if __name__=="__main__": 
    n = 10000000                            
    a = np.ones(n, dtype = np.float64) 

    for i in range(0,5):
         start = timer() 
         func(a) 
         print(i, " without GPU:", timer()-start)     

    for i in range(0,5):
         start = timer() 
         func2(a) 
         print(i, " with GPU ufunc:", timer()-start) 

    threadsperblock = 1024
    blockspergrid = (a.size + (threadsperblock - 1)) // threadsperblock
    for i in range(0,5):
         start = timer() 
         func3[blockspergrid, threadsperblock](a, n) 
         print(i, " with GPU kernel:", timer()-start)

像这样运行:

$ python bogoexample.py 
0  without GPU: 4.885275377891958
1  without GPU: 4.748716968111694
2  without GPU: 4.902181145735085
3  without GPU: 4.889955999329686
4  without GPU: 4.881594380363822
0  with GPU ufunc: 0.16726416163146496
1  with GPU ufunc: 0.03758022002875805
2  with GPU ufunc: 0.03580896370112896
3  with GPU ufunc: 0.03530424740165472
4  with GPU ufunc: 0.03579768259078264
0  with GPU kernel: 0.1421878095716238
1  with GPU kernel: 0.04386183246970177
2  with GPU kernel: 0.029975440353155136
3  with GPU kernel: 0.029602501541376114
4  with GPU kernel: 0.029780613258481026

在这里，您可以看到内核的运行速度比ufunc快，并且缓存(这是JIT编译函数的缓存，而不是调用的内存)大大提高了GPU上的调用速度。

关于python - CUDA GPU处理: TypeError: compile_kernel() got an unexpected keyword argument 'boundscheck' ，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/61982672/

26

4

0

文章推荐： Deno 权限 (--allow-net)

文章推荐： r - 如何在自定义包中使用 tidyselect "where"？

c - 应用程序接受 : *argument but not of the form argument* or *argument* 形式的命令行参数
例如，如果我的程序名称是 test.c 然后对于以下运行命令，argc = 2 而不是 4。 $test abc pqr* *xyz* 最佳答案尝试运行: $ echo abc pqr* *xyz*
flutter - “Positional arguments must occur before named arguments. Try moving all of the positional arguments before the named arguments”错误抖动
我正在尝试使用一个容器来显示TextField，但是该容器不喜欢我的操作顺序。这是我的代码: Widget build(BuildContext context) { return Scaffol
javascript - 未捕获的 SyntaxError : Unexpected eval or arguments in strict mode: window. gtag = (arguments) => dataLayer.push(arguments);
我有以下代码: class MetricGoogleGateway extends AMetricGateway{ constructor(id, name, token) {
javascript - this.argument 和 argument 之间的区别？
我像这样调用下面的对象方法。 new Cout( elem1 ).load( 'body' ) new COut( elem1 ).display( 'email' ) 我一次只使用一个实例。因为我一
c++ - 可变模板函数 : argument number for each argument
我正在尝试使用 C++11 中的可变参数函数模板，并通过如下代码了解了基本思想: void helper() { std::cout void helper( T&& arg ) {
javascript - "arguments"变量从哪里来 "this.callParent(arguments)"？
在学习 ExtJS 4 时，我发现在定义一个新类时，在 initComponent 中方法可以使用 this.callParent(arguments) 调用父类的构造函数. 我想知道这个 argum
swift 4 : Cannot convert value of type '(_) -> ()' to expected argument type '() -> ()' or Argument passed to call that takes no arguments
使用 XCode 9，Beta 3。Swift 4。 statsView.createButton("Button name") { [weak self] Void in //stuff st
javascript - 如果其中一个参数称为 `arguments` ，我可以获得 "arguments"对象吗？
以下代码将打印1: (function (arguments) { console.log(arguments); }(1, 2)); 实际上，arguments 对象已被覆盖。是否可以恢复函
php - 编译错误 : Cannot use positional argument after named argument
/** * @param $name * @return Response * @Route ("/afficheN/{name}",name="afficheN") */ public fu
Scala scopt : argument required() based on one or more other arguments
我习惯使用Scala scopt用于命令行选项解析。您可以选择参数是否为 .required()通过调用刚刚显示的函数。如何定义仅在定义了另一个参数时才需要的参数？例如，我有一个标志 --writ
python - 语法错误 : positional argument follows keyword argument:
所以这是我的代码: def is_valid_move(board, column): '''Returns True if and only if there is an o
python - 我该如何解决SyntaxError : positional argument follows keyword argument
我试图在这里运行此代码: threads = [threading.Thread(name='ThreadNumber{}'.format(n),target=SB, args(shoe_type,m
haskell - 输入 FP : Tuple Arguments and Curriable Arguments
在静态类型函数编程语言(例如 Standard ML、F#、OCaml 和 Haskell)中，编写函数时通常将参数彼此分开，并通过空格与函数名称分开: let add a b = a + b
javascript - 获取被调用者 Function.Arguments 之一的 Function.Arguments
function validateArguments(args) { if(args.length 2) { throw new RangeError("Invalid amo
django - 无反向匹配 : with arguments '()' and keyword arguments
我正在使用 Django 1.5 并尝试将参数传递到我的 URL。当我使用前两个参数时，下面的代码工作正常，使用第三个参数时我收到错误。我已经引用了新的 Django 1.5 更新中的 url 用法，
ember.js - emberjs : What does the . ..arguments in this._super(...arguments) 表示什么？
我刚刚开始使用 ember js 并且多次被这个功能绊倒有人可以简要介绍一下 this._super() 的使用，并解释 ...arguments 的重要性谢谢最佳答案每当您覆盖类/函数(例如
ios - 错误 : Argument passed to call that takes no arguments
这个问题在这里已经有了答案: How to fix an "Argument passed to call that takes no arguments" error? (2 个答案) 关闭 3
ios - 错误 : Argument passed to call that takes no arguments
我正在创建一个简单的登录注册应用程序。但是我遇到了错误，我不知道如何解决，请帮忙!这是我的代码: // // ViewController.swift // CHLogbook-Applicati
Swift 构造函数未出现在方法列表中， "Arguments passed to call that takes no arguments"
我是 Swift 的初学者。我尝试创建一个表示 Meal 的简单类。它有一些属性和一个返回可选的构造函数但是当我尝试测试它或在任何地方实例化它时，我得到的只是一个错误。似乎无法弄清楚发生了什么。
java - Linux 终端 : How to pass an argument to another argument
我有一个在特殊环境下运行其他程序的系统程序: cset shield -e PROGRAM .现在要运行一个 java 程序，我输入了 cset shield -e java PROGRAM ，但这不

首页

博学

6Ren·AI

商城

python - CUDA GPU处理: TypeError: compile_kernel() got an unexpected keyword argument 'boundscheck'