python - 如何使用 mpi4py 收集长度不等的数组-6ren

python - 如何使用 mpi4py 收集长度不等的数组

转载作者：行者123 更新时间：2023-12-01 12:27:20

期望的行为:

我正在尝试在不同节点上获取许多不同长度的列表，将它们收集到一个节点中，然后让该主节点将它们放在一组中。此列表名为 rout_array在每个节点。注意 rout_array 中的元素只是整数，并且在节点之间不唯一。

错误:

Traceback (most recent call last):
  File "prout.py", line 160, in <module>
    main()
  File "prout.py", line 153, in main
    num = DetermineRoutingNumber(steps, goal, vertexSetSize)
  File "prout.py", line 129, in DetermineRoutingNumber
    comm.Gather(send_buffer, recv_buffer, root = 0)

  File "MPI\Comm.pyx", line 589, in mpi4py.MPI.Comm.Gather (c:\projects\mpi4py\src\mpi4py.MPI.c:97806)
  File "MPI\msgbuffer.pxi", line 525, in mpi4py.MPI._p_msg_cco.for_gather (c:\projects\mpi4py\src\mpi4py.MPI.c:34678)
  File "MPI\msgbuffer.pxi", line 446, in mpi4py.MPI._p_msg_cco.for_cco_send (c:\projects\mpi4py\src\mpi4py.MPI.c:33938)
  File "MPI\msgbuffer.pxi", line 148, in mpi4py.MPI.message_simple (c:\projects\mpi4py\src\mpi4py.MPI.c:30349)
  File "MPI\msgbuffer.pxi", line 93, in mpi4py.MPI.message_basic (c:\projects\mpi4py\src\mpi4py.MPI.c:29448)

  KeyError: 'O'

我不知道如何得到 'O' 的 KeyError当我的代码中没有字符串时。所有列表都包含整数，numpy 数组包含整数，这里唯一事件的字典只有整数作为键。需要注意的是，每个节点都会输出这个错误。

代码:

import numpy, math
from mpi4py import MPI
from sympy.combinatorics import Permutation as Perm     

def GetEdges(size,file):
    """This function takes in a file of edges in a graph in the form 'u,v'
    without quotes, where u and v are vertices of the graph. It then
    generates a permutation that swaps those vertices, and returns these
    transpositions."""

    edgeFile = open(file, "r")
    edges = []
    for line in edgeFile:
        line = line.strip()
        line = line.split(",")
        for vertex in line:
            line[line.index(vertex)] = int(vertex)
        edges.append(Perm([line], size = size))

    edgeFile.close()
    edges.append(Perm([[size - 1]], size = size))

    return edges


def AreDisjoint(p1,p2):
    """This function determines whether or not two permutations move any
    common elements, and returns the appropriate boolean."""
    v1 = set(p1.support())
    v2 = set(p2.support())

    return len(v1 & v2) == 0


def GetMatchings(edges, maxMatching, size):
    """This function takes in a set of edges given by GetEdges(), and 
    generates all possible matchings in the given graph. It then converts
    each matching into its rank given by lexicographical order, and appends
    that rank to a set, which is then returned."""

    stepDict = {1:set(edges)}
    steps = set(edges)
    for i in range(1,maxMatching):
        temp = set()
        for p1 in stepDict[1]:
            for p2 in stepDict[i]:
                newPerm = p1 * p2
                if AreDisjoint(p1,p2) and newPerm not in steps:
                    temp.add(newPerm)
                    steps.add(newPerm)

        stepDict[i+1] = temp

    newSteps = set()
    for step in steps:
        newSteps.add(step.rank())
    return newSteps


def FromRank(rank,level):
    """This function takes in a rank and size of a permutation, then returns
    the permutation that lies at the rank according to lexicographical 
    ordering. """

    lst = list(range(level + 1))
    perm = []
    while lst:
        fact = math.factorial(len(lst) - 1)
        index, rank = divmod(rank, fact)
        perm.append(lst.pop(index))
    assert rank == 0 
    return perm


def SplitArrayBetweenNodes(rank, rem, length):
    """This function takes in the rank of a node and any remainder after
    dividing up an array between all the nodes. It then returns a starting
    and ending partition index unique to each node."""
    if rem != 0:
        if rank in list(range(rem)):
            if rank == 0:
                part_start = 0
                part_end = length
            else:
                part_start = rank * (length + 1)
                part_end = part_start + length
        else:
            part_start = rank * length + rem
            part_end = part_start + length - 1
    else:
        part_start = rank * length
        part_end = part_start + length - 1

    return part_start, part_end


def DetermineRoutingNumber(steps, goal, vertexSetSize):
    """This function takes in the matchings created by GetMatchings(), 
    and calculates all possible products between its own elements. It then
    takes all unique products, and calculates all possible prducts between
    the matching set and the previous output. This repeats until all 
    permutations of a given type are found. The level at which this occurs
    is then returned."""

    comm = MPI.COMM_WORLD
    rank = comm.Get_rank()
    size = comm.Get_size()

    length = len(steps)
    rem = length % size
    part_len = length // size

    part_start, part_end = SplitArrayBetweenNodes(rank,rem, part_len)

    permDict = {1: steps}
    i = 1
    while True:
        rout_array = set()
        work_array = set(list(permDict[i])[part_start:part_end + 1])

        #Calculate all possible products    
        for p1 in permDict[1]:
            for p2 in work_array:
                p2_perm = Perm(FromRank(p2,vertexSetSize - 1))
                p1_perm = Perm(FromRank(p1,vertexSetSize - 1))
                new = p2_perm * p1_perm

                if new(0) == 0 or new(0) == 1:
                    order = new.rank()
                    rout_array.add(order)

        #All nodes send their work to master node
        comm.Barrier()

        send_buffer = numpy.array(rout_array)
        sendcounts = numpy.array(comm.gather(len(rout_array), root = 0))

        if rank == 0:
            recv_buffer = numpy.empty(sum(sendcounts), dtype = int)
        else:
            recv_buffer = None

        comm.Gatherv(sendbuf = send_buffer, recvbuf = (recv_buffer, sendcounts), root = 0) 

        #Generate input for next level of the loop, and weed out repeats.
        permDict[i+1] = rout_array
        for j in range(1,i+1):
            permDict[i+1] = permDict[i+1] - permDict[j]


def main():
    file = "EdgesQ2.txt"
    maxMatching = 2
    vertexSetSize = 4

    edges = GetEdges(vertexSetSize, file)
    steps = GetMatchings(edges, maxMatching, vertexSetSize)
    goal = 2 * math.factorial(vertexSetSize-1)

    num = DetermineRoutingNumber(steps, goal, vertexSetSize)
    print(num)


main()

测试用例:

EdgesQ2.txt:

请注意 maxMatching = 2和 vertexSetSize = 4在这个例子中。输出应为 3 .

0,1
1,2
2,3
0,3

EdgesQ3.txt:

请注意 maxMatching = 4和 vertexSetSize = 8在这个例子中。输出应为 4 .

0,1
0,3
0,4
1,2
1,5
2,3
2,6
3,7
4,5
4,7
5,6
6,7

最佳答案

如果不同进程的长度不同，则需要使用向量变体 Gatherv .使用该函数，您可以提供一个包含各种长度(recvcounts)的数组。

不幸的是，mpi4py 文档目前没有描述如何使用 Gatherv或任何其他载体变体。这是一个简单的例子:

#!/usr/bin/env python3

import numpy as np
from mpi4py import MPI
import random

comm = MPI.COMM_WORLD
rank = comm.Get_rank()
root = 0

local_array = [rank] * random.randint(2, 5)
print("rank: {}, local_array: {}".format(rank, local_array))

sendbuf = np.array(local_array)

# Collect local array sizes using the high-level mpi4py gather
sendcounts = np.array(comm.gather(len(sendbuf), root))

if rank == root:
    print("sendcounts: {}, total: {}".format(sendcounts, sum(sendcounts)))
    recvbuf = np.empty(sum(sendcounts), dtype=int)
else:
    recvbuf = None

comm.Gatherv(sendbuf=sendbuf, recvbuf=(recvbuf, sendcounts), root=root)
if rank == root:
    print("Gathered array: {}".format(recvbuf))

如您所见，mpi4py 不将 sendcounts 或 recvcounts 作为额外参数，而是作为 recvbuf 的元组/列表。范围。如果您通过 (recvbuf, sendcounts)它将从 recvbuf 派生类型。将进行位移/偏移，以使来自所有等级的数据连续存储并按等级排序。

基本上，mpi4py 会疯狂地猜测您对 recvbuf 的各种形式可能意味着什么。范围。完整且明确的形式是 (buffer, counts, displacements, type) .

关于 KeyError 的编辑:

相当困惑的名称 rout_array是 set ，这不是 numpy.array 的有效输入.一个 set既不是序列也没有数组接口(interface)。不幸的是， numpy.array 并没有失败，而是创建一个非常奇怪的 ndarray没有尺寸的物体。您可以将数组创建包装在列表中:

send_buffer = numpy.array(list(rout_array))

集体工作，但循环不会终止，考虑到没有 return，这并不奇怪。或 break在 while true循环进入 DetermineRoutingNumber .

关于python - 如何使用 mpi4py 收集长度不等的数组，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/37999866/

文章推荐： java - 这个符号在Java中是什么意思？

文章推荐： java - 将元素从不同的类添加到 ArrayList 时出现问题

文章推荐： java - 在java中调整svg图像的大小

文章推荐： java - 缓冲区和数据包

python - 将键-长度-值 (KLV) 字符串拆分为键、长度、值的小列表
将 KLV 字符串拆分为键、长度、值作为元素的列表/元组的更有效方法是什么？为了添加一点背景，前 3 位数字作为键，接下来的 2 位表示值的长度。我已经能够使用以下代码解决该问题。但我不认为我的代
javascript - 上传前如何获取视频的元数据(长度)？
首先，我试图从文件中提取视频持续时间，然后在无需实际上传文件的情况下显示它。当用户选择视频时 - 信息将显示在其下方，包括文件名、文件大小、文件类型。不管我的技能多么糟糕 - 我无法显示持续时间。我
scala - 获取数组列的大小/长度
我是 Scala 编程新手，这是我的问题:如何计算每行的字符串数量？我的数据框由一列 Array[String] 类型组成。 friendsDF: org.apache.spark.sql.DataF
javascript - 查找上传到React应用程序的音频文件的持续时间/长度
我有一个React Web应用程序(create-react-app)，该应用程序使用react-hook-forms上传歌曲并使用axios将其发送到我的Node / express服务器。我想确
Java - 如何将网络掩码转换为位/长度
如果给你一个网络掩码(例如 255.255.255.0)，你如何在 Java 中获得它的长度/位(例如 8)？最佳答案如果您想找出整数低端有多少个零位，请尝试 Integer.numberOfTr
追加和删除点击时的 Jquery 长度
我需要使用 jQuery 获取 div 数量的长度。我可以得到它，但在两个单击事件中声明变量，但这似乎是错误的，然后我还需要使用它来根据数字显示隐藏按钮。我觉得我不必将代码加倍。在这里摆弄 htt
python散点图面积大小比例 Axis 长度
我对此感到非常绝望，到目前为止我在 www 上找不到任何东西。情况如下: 我正在使用 Python。我有 3 个数组:x 坐标、y 坐标和半径。我想使用给定的 x 和 y 坐标创建散点图。到目
jQuery:动态加载元素的大小/长度
我有一个表单，我通过 jQuery 的加载函数动态添加新的输入和选择元素。有时加载的元素故意为空，在这种情况下我想隐藏容器 div，这样它就不会破坏样式。问题是，我似乎无法计算加载的元素，因此不知道
jQuery 长度 > 0
我决定通过替换来使我的代码更清晰 if (wrappedSet.length > 0) 类似 if (wrappedSet.exists()) 是否有任何 native jq 函数可以实现此目的？或者
MySQL更改 key 长度
简单的问题。如果我有一个如下表: CREATE TABLE `exampletable` ( `id` int(11) NOT NULL AUTO_INCREMENT, `textfield`
mysql - 参数化查询中的数据类型/长度
我正在使用经典 ASP/MySQL 将长用户输入插入到我的数据库中，该输入是从富文本编辑器生成的。该列设置为 LONG-TEXT。作为参数化查询(准备语句)的新手，我不确定用于此特定查询的数据长度。
javascript - 如何在文本输入字段上设置自动宽度/长度？
我正在获取 Stripe 交易费用的值(value)并通过禁用的文本字段显示它。由于输入文本域，句子出现较大空隙 This is the amount $3.50____________that n
css - 背景大小包含 AND 长度
我有一个 div，其背景图像的大小设置为包含。但是，图像是视网膜计算机(Macbook Pro 等)的双分辨率图像，所以我希望能够以某种方式让页面知道即使我说的是背景大小:包含 200x200 图像，
javascript - 从dom中删除元素时如何更改计数器/长度
我正在开发一个具有“已保存”和“已完成”模块的小部件。当我删除元素时，它会从 dom 中删除/淡化它，但是当我将其标记为完成时，它会将其克隆到已完成的选项卡。这工作很棒，但顶部括号内的数字不适合我。这
android JSONArray 长度
我有一个来自 json 提要的数组，我知道在 jArray 中有一个联盟，但我需要计算出该数组的计数，以防稍后将第二个添加到提要中。目前 log cat 没有注销“teamFeedStructure”
嵌套列表的 Javascript 长度
目标:给定一个混合类型的数组，确定每个级别的元素数量。如果同一层有两个子数组，则它们的每个元素都计入该层元素的总数。方法: Array.prototype.elementsAtLevels = fu
java - 测量Java中单链表的大小/长度？
我需要帮助为 Java 中的单链表制作 int size(); 方法。这是我目前所拥有的，但它没有返回正确的列表大小。 public int size() { int size = 0;
Java DataInputStream 长度
我正在为学校作业创建一个文件服务器应用程序。我目前拥有的是一个简单的 Client 类，它通过 TCP 发送图像，还有一个 Server 类接收图像并将其写入文件。这是我的客户端代码 import
haskell - 理解(,)<$>长度<*>头部的类型
我有这对功能 (,) length :: Foldable t => t a -> b -> (Int, b) 和， head :: [a] -> a 我想了解的类型 (,) length he
github - 如何在Jekyll's Liquid模板中比较字符串大小/长度？
我正在GitHub Pages上使用Jekyll来构建博客，并希望获得传递给YAML前题中Liquid模板的page.title字符串的长度，该字符串在每个帖子的YAML主题中。我还没有找到一种简单的

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

python - 如何使用 mpi4py 收集长度不等的数组