c++ - 如何使用跳过部分 device_vector 的自定义仿函数实现 thrust::transform？-6ren

c++ - 如何使用跳过部分 device_vector 的自定义仿函数实现 thrust::transform？

转载作者：行者123 更新时间：2023-12-05 07:21:07

我正在从事一个项目(本质上是一个物理模拟)，我需要在多个时间步长上对大量节点执行计算。我目前通过编写一个在 thrust::transform 中调用的自定义仿函数来实现每种类型的计算。

作为一个最小的例子(使用伪代码)，假设我有一些数据都共享一个共同的结构，但可以分解成不同的类型(A、B 和 C)，例如都有一个

double value.

因此，我将此数据存储在单个 device_vector 中，如下所示:

class Data {
    thrust::device_vector<double> values;
    unsigned values_begin_A, values_end_A;
    unsigned values_begin_B, values_end_B;
    unsigned values_begin_C, values_end_C;
}

其中类型 A 占据 vector 的第一部分，然后是类型 B，然后是类型 C。为了保持跟踪，我保存了每种类型的开始/结束索引值。

不同类型的数据需要由不同的仿函数作用(例如，仿函数 1 应用于类型 A 和 B；仿函数 2 应用于 A、B 和 C；仿函数 3 应用于 A 和 C)。每个仿函数都需要访问 vector 中值的索引，由 counting_iterator 提供，并将结果存储在单独的 vector 中。

struct my_functor : public thrust::unary_function< thrust::tuple<unsigned, double> , double > {

    __host__ __device__
    double operator() (const thrust::tuple<unsigned, double> index_value) {

        // Do something with the index and value.

        return result;
    }
}

我的问题是我不知道实现最后一个作用于类型 A 和 C 值同时跳过 B 的仿函数的最佳方法。特别是，我正在寻找一种推力友好的解决方案，它可以像我一样合理地扩展添加更多节点类型和更多仿函数(作用于新旧类型的组合)，同时仍然获得并行化的好处。

我想出了四个选项:

选项 1:

对每种数据类型进行一次转换调用，例如

void Option_One(thrust::device_vector<double>& result) {
    // Multiple transform calls.

    thrust::counting_iterator index(0);

    // Apply functor to 'A' values.
    thrust::transform( 
        thrust::make_zip_iterator(thrust::make_tuple(index, values.begin())),
        thrust::make_zip_iterator(thrust::make_tuple(index, values.begin())) + values_end_A,
        result.begin(),
        my_functor());

    // Apply functor to 'C' values.
    thrust::transform( 
        thrust::make_zip_iterator(thrust::make_tuple(index, values.begin())) + values_begin_C,
        thrust::make_zip_iterator(thrust::make_tuple(index, values.begin())) + values_end_C,
        result.begin() + values_begin_C,
        my_functor());
}

这看起来相当简单，但以牺牲效率为代价，因为我牺牲了并行计算 A 和 C 的能力。

选项 2:

将值复制到临时 vector 中，对临时 vector 调用转换，然后将临时结果复制回结果中。这看起来像很多来回复制，但只允许在 A 和 C 上一起调用一次转换。

void Option_Two(thrust::device_vector<double>& result) {

    // Copy 'A' and 'C' values into temporary vector
    thrust::device_vector<double> temp_values_A_and_C(size_A + size_C);
    thrust::copy(values.begin(), values.begin() + values_end_A, temp_values_A_and_C.begin());
    thrust::copy(values.begin() + values_begin_C, values.begin() + values_end_C, temp_values_A_and_C.begin() + values_end_A);

    // Store results in temporary vector.
    thrust::device_vector<double> temp_results_A_and_C(size_A + size_C);

    thrust::transform( 
        thrust::make_zip_iterator(thrust::make_tuple(index, temp_values_A_and_C.begin())),
        thrust::make_zip_iterator(thrust::make_tuple(index, temp_values_A_and_C.begin())) + size_A + size_C,
        temp_results_A_and_C.begin(),
        my_functor());


    // Copy temp results back into result
    // ....
}

选项 3:

对所有值调用转换，但更改仿函数以检查索引并仅对 A 或 C 范围内的索引起作用。

struct my_functor_with_index_checking : public thrust::unary_function< thrust::tuple<unsigned, double> , double > {

    __host__ __device__
    double operator() (const thrust::tuple<unsigned, double> index_value) {

        if ( (index >= values_begin_A && index <= values_end_A ) ||
            ( index >= values_begin_C && index <= values_end_C ) ) {

                // Do something with the index and value.
                return result;
             }
        else {
            // Do nothing;
            return 0; //Result is 0 by default.
        }
    }
}

void Option_Three(thrust::device_vector<double>& result) {

    // Apply functor to all values, but check index inside functor.
    thrust::transform( 
        thrust::make_zip_iterator(thrust::make_tuple(index, values.begin())),
        thrust::make_zip_iterator(thrust::make_tuple(index, values.begin())) + values.size(),
        result.begin(),
        my_functor_with_index_checking());
}

选项 4:

我想出的最后一个选择是创建一个基于 counting_iterator 的自定义迭代器，该迭代器通常在 A 范围内计数，但一旦到达末尾就会跳到 C 的开头A 的。这似乎是一个优雅的解决方案，但我不知道该怎么做。

void Option_Four(thrust::device_vector<double>& result) {

    // Create my own version of a counting iterator
    // that skips from the end of 'A' to the beginning of 'C'
    // I don't know how to do this!
    FancyCountingIterator fancyIndex(0); 

    thrust::transform( 
        thrust::make_zip_iterator(thrust::make_tuple(fancyIndex, values.begin())),
        thrust::make_zip_iterator(thrust::make_tuple(fancyIndex, values.begin())) + values.size(),
        result.begin(),
        my_functor());
}

最佳答案

将 permutation_iterator 与自定义 transform_iterator(您正在寻找的奇特迭代器)结合使用。

Data d; //assuming this has values.
unsigned A_size = d.values_end_A - d.values_begin_A;
unsigned C_size = d.values_end_C - d.values_begin_C;
auto A_C_index_iter = thrust::make_transform_iterator( thrust::make_counting_iterator(0), 
[&]__device__(int i) {
  if (i<A_size)
    return i+d.values_begin_A; 
  else 
    return (i-A_size)+d.values_begin_C;
});
auto permuted_input_iter = thrust::make_permutation_iterator(values.begin(), A_C_index_iter);
auto permuted_output_iter = thrust::make_permutation_iterator(result.begin(), A_C_index_iter);
thrust::transform(permuted_input_iter, permuted_input_iter + A_size + C_size, permuted_output_iter);

这利用了完全并行性 (A_size + C_size)。

关于c++ - 如何使用跳过部分 device_vector 的自定义仿函数实现 thrust::transform？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/57085412/

文章推荐： android - react native : compileDebugJavaWithJavac FAILED for

文章推荐： jenkins - 我无法打开蓝海管道编辑器

文章推荐： javascript - 使用 CSS Grid 调整图表大小问题

typescript - A 部分部分 io-ts
我在使用 io-ts 时遇到一些问题。我发现它确实缺乏文档，我取得的大部分进展都是通过 GitHub issues 取得的。不，我不明白 HKT，所以没有帮助。基本上，我在其他地方创建一个类型，ty
java - 匹配完整文件正则表达式中的 A 部分，但不匹配 B 部分
我必须创建一个正则表达式来搜索整个文件，以找到与 Java XML 解析器的第一部分(但不是第二部分)的匹配项。这将用于防止某些 XXE 攻击。不幸的是，它确实必须是单个正则表达式，并且它确实需要搜索
c# - 部分/部分中的 asp.net mvs 部分？
我有一些简单的 Shared/_Header.cshtml 文件中的内容。 My Shared/_Layout.cshtml 通过调用插入该代码 @Html.Partial("_Header") 目前
java - Selenium 只执行循环的 if != null 部分，不运行循环的 "else if null "部分
我有一个 if-else 语句，其中: 条件 1:ID 匹配并且自动填充某些字段。然后 if 语句只填充其余字段条件 2:ID 不匹配，所有字段均为空白。 ELSE 语句将它们全部填充当我使条件
javascript - 无法在 JSFIDDLE 中使用滚动魔法(第 1 部分，共 2 部分)
我正在开发一个单页滚动网站。我正在尝试实现 ScrollMagic 并固定第一部分，以便网站的其余部分滚动到固定部分的顶部。我尝试创建一个 jsfiddle 来显示问题，但我似乎无法让 jsfiddl
javascript - 既然有

首页

博学

6Ren·AI

商城

c++ - 如何使用跳过部分 device_vector 的自定义仿函数实现 thrust::transform？