c++ - 在 C++ 中使用 MPI_Gatherv() 和 MPI_Datatype 到 'gather' 动态分配的二维数组-6ren

c++ - 在 C++ 中使用 MPI_Gatherv() 和 MPI_Datatype 到 'gather' 动态分配的二维数组

转载作者：行者123 更新时间：2023-11-30 00:36:33

我认为描述问题的最简单方法是使用简单的代码。在每个处理器上，我都动态分配了“2D 数组”(通过 new*[rows]、new[cols] 形式实现，请参阅下面的代码进行说明)。无论对错，我都在尝试使用已提交的 MPI_Datatype 来帮助我执行 MPI_Gatherv() 以将所有数组收集到根处理器上的单个二维数组中。

这是代码，在它下面我突出显示了它的要点(如果编译和运行它应该很容易理解 - 它询问你想要的数组的维度):

#include <iostream>
#include <string>
#include <cmath>
#include <cstdlib>
#include <time.h>

#include "mpi.h" 


using namespace std;

// A function that prints out the 2D arrays to the terminal.
void print_2Darray(int **array_in,int dim_rows, int dim_cols) {
    cout << endl;
    for (int i=0;i<dim_rows;i++) {
        for (int j=0;j<dim_cols;j++) {
            cout << array_in[i][j] << " ";
            if (j==(dim_cols-1)) {
                cout << endl;
            }
        }
    }
    cout << endl;
}


int main(int argc, char *argv[]) {

    MPI::Init(argc, argv);


    // Typical MPI incantations...

    int size, rank;

    size = MPI::COMM_WORLD.Get_size(); 
    rank = MPI::COMM_WORLD.Get_rank();

    cout << "size = " << size << endl;
    cout << "rank = " << rank << endl;

    sleep(1);

    // Dynamically allocate a 2D square array of user-defined size 'dim'.

    int dim;
    if (rank == 0) {
        cout << "Please enter dimensions of 2D array ( dim x dim array ): ";
        cin >> dim;
        cout << "dim = " << dim << endl;
    }   

    MPI_Bcast(&dim,1,MPI_INT,0,MPI_COMM_WORLD);

    int **array2D;
    array2D = new int*[dim];
    for (int i=0; i<dim; i++) {
        array2D[i] = new int[dim](); // the extra '()' initializes to zero.
    }

    // Fill the arrays with i*j+rank where i and j are the indices.
    for (int i=0;i<dim;i++) {
        for (int j=0;j<dim;j++) {
            array2D[i][j] = i*j + rank;
        }
    }

    // Print out the arrays.
    print_2Darray(array2D,dim,dim);

    // Commit a MPI_Datatype for these arrays.
    MPI_Datatype MPI_ARRAYROW;
    MPI_Type_contiguous(dim, MPI_INT, &MPI_ARRAYROW);
    MPI_Type_commit(&MPI_ARRAYROW);

    // Declare 'all_array2D[][]' which will contain array2D[][] from all procs.
    int **all_array2D;
    all_array2D = new int*[size*dim];
    for (int i=0; i<size*dim; i++) {
        all_array2D[i] = new int[dim]();  // the extra '()' initializes to zero.
    }

    // Print out the arrays.
    print_2Darray(all_array2D,size*dim,dim);


    // Displacement vector for MPI_Gatherv() call.
    int *displace;
    displace = (int *)calloc(size,sizeof(int));
    int *dim_list;
    dim_list = (int *)calloc(size,sizeof(int));
    int j = 0;
    for (int i=0; i<size; i++) {
        displace[i] = j;
        cout << "displace[" << i << "] = " << displace[i] << endl;
        j += dim;
        dim_list[i] = dim;
    }

    // MPI_Gatherv call.
    MPI_Barrier(MPI_COMM_WORLD);
    MPI_Gatherv(array2D,dim,MPI_ARRAYROW,all_array2D,&dim_list[rank],&displace[rank],MPI_ARRAYROW,0,MPI_COMM_WORLD);

    // Print out the arrays.
    print_2Darray(all_array2D,size*dim,dim);

    MPI::Finalize();

    return 0;
}

代码可以编译，但会遇到段错误(我使用“mpic++”编译并使用“mpirun -np 2”来使用 2 个处理器):

[unknown-78-ca-39-b4-09-4f:02306] *** Process received signal ***
[unknown-78-ca-39-b4-09-4f:02306] Signal: Segmentation fault (11)
[unknown-78-ca-39-b4-09-4f:02306] Signal code: Address not mapped (1)
[unknown-78-ca-39-b4-09-4f:02306] Failing at address: 0x0
[unknown-78-ca-39-b4-09-4f:02306] [ 0] 2   libSystem.B.dylib                   0x00007fff844021ba _sigtramp + 26
[unknown-78-ca-39-b4-09-4f:02306] [ 1] 3   ???                                 0x0000000000000001 0x0 + 1
[unknown-78-ca-39-b4-09-4f:02306] [ 2] 4   gatherv2Darrays.x                   0x00000001000010c2 main + 1106
[unknown-78-ca-39-b4-09-4f:02306] [ 3] 5   gatherv2Darrays.x                   0x0000000100000a98 start + 52
[unknown-78-ca-39-b4-09-4f:02306] *** End of error message ***
mpirun noticed that job rank 0 with PID 2306 on node unknown-78-ca-39-b4-09-4f.home exited on signal 11 (Segmentation fault). 
1 additional process aborted (not shown)

在代码末尾附近执行“print_2Darray(all_array2D,size*dim,dim)”函数时发生段错误，其中“all_array2D”“应该”包含收集的数组。更具体地说，代码似乎为从主处理器收集的位打印“all_array2D”OK，但当 print_2Darray() 函数开始处理来自其他处理器的位时，会给出段错误。

代码要点:

我声明了一个 MPI_Datatype，它是一个连续的内存块，其大小足以存储二维数组的一行。然后，我使用 MPI_Gatherv() 尝试收集这些行。
代码的 sleep(1) 调用只是为了帮助用户更清楚地看到“dims”提示，否则它会被埋在“size”和“rank”cout 之间。
二维数组的元素被初始化为值“i*j + rank”，其中 i 和 j 分别是行和列索引。我的理由是，生成的数字很容易泄露生成该数组的处理器的等级。

我想这归结为我不知道 MPI_Gatherv() 动态分配数组的正确方式......我应该使用 MPI_Datatypes 吗？动态分配数组对我来说非常重要。

如果有任何帮助/建议，我将不胜感激!我几乎没有想法了!

最佳答案

MPI_Gatherv、MPI_Scatterv，实际上所有其他采用数组参数的 MPI 通信调用，都希望数组元素在内存中连续排列。这意味着在调用 MPI_Gatherv(array2D, dim, MPI_ARRAYROW, ...) 时，MPI 期望 MPI_ARRAYROW 类型的第一个元素从 的内存位置开始>array2D 指向，第二个元素开始于(BYTE*)array2D + extent_of(MPI_ARRAYROW)，第三个元素开始于(BYTE*)array2D + 2*extent_of( MPI_ARRAYROW)，等等。这里的extent_of()是MPI_ARRAYROW类型的范围，可以通过调用MPI_Type_get_extent获取。

很明显，二维数组的行在内存中不是连续的，因为它们中的每一行都是通过单独调用 new 运算符分配的。此外，array2D 不是指向数据的指针，而是指向指向每一行的指针的 vector 的指针。这在 MPI 中不起作用，StackOverflow 上有无数其他问题，其中讨论了这个事实 - 只需搜索 MPI 2D 并亲自查看。

解决方案是使用一大块单独分配的内存块和伴随的掺杂 vector - 参见 this question以及答案中提到的 arralloc() 函数。

关于c++ - 在 C++ 中使用 MPI_Gatherv() 和 MPI_Datatype 到 'gather' 动态分配的二维数组，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/15800720/

文章推荐： c++ - 抽象类作为参数(接口(interface)案例)

文章推荐： php - Zend框架教程——连接数据库的问题

文章推荐： php - 使用php语句更新MySQL数据库

文章推荐： c++ - 使用 unordered_map 时编译时错误

解析Pytorch中的torch.gather()函数
参数说明以官方说明为例，gather()函数需要三个参数，输入input，维度dim，以及索引index input必须为Tensor类型 dim为int类型，代表从哪个维度进行索引 in
r - dplyr:gather 中的两个键
我知道如何在 melt 中使用两个 id.vars .这很简单: x = data.frame(subject = c("John", "Mary"), time = c
r - 无法将 "gather"输出的列名称更改为默认名称以外的任何名称
我正在尝试使用 gather在 tidyr包，但我无法从默认名称更改输出的列名称。例如: df = data.frame(time = 1:100,a = 1:100,b = 101:200) df.
python - 带有生成器表达式的 asyncio.gather
为什么 asyncio.gather 不适用于生成器表达式？ import asyncio async def func(): await asyncio.sleep(2) # Works a
R:使用 Gather() 来整理具有两个列标题的数据
我想整理一些不幸的是在前两行中设置了两个列标题的数据: 第一行(标题):实际上是度量的类型(例如。估计、标准误差、上限、下限)。第二行(也是标题):是度量的年份。有什么方法可以使用gather()
NuGet "Gather Dependencies"挂起
当我添加 NuGet 包(最新版本的 NuGet 和 Visual Studio 2015)时，它在安装包之前在“尝试收集依赖项”处挂起大约 5 分钟。我可以指向 NuGet.org、我们的内部服务器
r - 在melt/gather 中为新列指定类
我想在 melt 中指定输出列的类别(或 gather)。我想为所有列和不同的类做这件事。例如，我有一些数据: example example day max min 1 1 20
R tidyr gather() 基于查找的两组列
我有一个按地区进行满意度调查的结果数据集。调查中的每个问题都采用 4 分制评分(从非常满意到非常不满意)。数据集中的每一行都包含给定“财政年度”结束时给定区域中给定问题的汇总结果。它还包含每个级别的受
r - 键排序与使用 gather() 的原始列排序
键排序是否取决于我是否首先列出要收集的列与不收集的列？这是我的数据框: library(tidyr) wide_df <- data.frame(c("a", "b"), c("oh", "ah")
python - 在超时中包装 asyncio.gather
我见过asyncio.gather vs asyncio.wait ，但不确定这是否解决了这个特定问题。我想做的是将 asyncio.gather() 协程包装在 asyncio.wait_for()
c++ - AVX2 Gather 指令使用细节
我正在尝试了解 AVX2 intel intrinsic 的收集功能。根据官方文档Link ，函数定义为， __m256i _mm256_i32gather_epi32 (int const* ba
c - MPI Gather 仅从根进程收集
首先，我一直在使用 this code作为引用，它显示了不使用 MPI_Scatter 的 MPI_Gather 的使用，因为这就是我在这里想要实现的目标。我已经为此工作了很长时间，只是无法弄清楚这个
c - MPI Gather 没有按预期合并数组
我正在使用 MPI 开发 mandelbrot 生成器，它在完成时输出 PPM 文件。我使用 MPI gather 将计算结果 block 收集到最终数组中。代码生成文件但不完整；仅显示图片的上半部分
r - 在 gather 函数中使用变量
我正在使用 R 将宽格式数据表转换为长格式。它有效，除了必须为新列使用变量: library(readr) library(tidyr) files <- Sys.glob("sources/*.cs
python - 使用 asyncio.gather 不会引发内部异常
使用 Python 3.7，我试图捕获异常并通过 following an example I found on StackOverflow 重新引发它.虽然该示例确实有效，但它似乎并不适用于所有情况
r 使用 dplyr 'gather' 函数
我有一个数据框，看起来像下面“输入”中显示的图片。我尝试每行获取 1 个日期(请参见下面“所需输出”中的图片)。换句话说，我尝试为每一行做一种“转置”。让我们规定组合 'LC' 和 'Prod'
python - 使用索引张量和 tf.gather 对张量进行切片
我正在尝试使用索引张量对张量进行切片。为此，我尝试使用 tf.gather . 但是，我很难理解 documentation并且不要让它像我期望的那样工作: 我有两个张量。安 activations形
r - Gather() 列出列到 R 中的行
我想 gather() 列出列以在我的数据框中创建新行。我正在使用 repurrrsive 包中的《权力的游戏》数据集。下面是我设置问题的代码: library(tidyverse) got_char
python - 如何在 asyncio.gather 中使用条件逻辑？
我想有条件地运行异步函数，如下所示: one, two, three = await asyncio.gather( some_async_method1(), some_async_
python - 带轴参数的 Tensorflow tf.gather
我正在使用tensorflow的tf.gather从多维数组中获取元素，如下所示: import tensorflow as tf indices = tf.constant([0, 1, 1]) x

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

c++ - 在 C++ 中使用 MPI_Gatherv() 和 MPI_Datatype 到 'gather' 动态分配的二维数组