python ，Scipy : Building triplets using large adjacency matrix-6ren

python ，Scipy : Building triplets using large adjacency matrix

转载作者：太空狗更新时间：2023-10-29 18:06:19

25

4

我正在使用邻接矩阵来表示可以在视觉上解释为的 friend 网络

Mary     0        1      1      1

Joe      1        0      1      1

Bob      1        1      0      1

Susan    1        1      1      0 

         Mary     Joe    Bob    Susan

使用这个矩阵，我想编译所有可能的友谊三角列表，条件是用户 1 是用户 2 的 friend ，用户 2 是用户 3 的 friend 。对于我的列表，用户 1 不需要是用户 3 的 friend 。

(joe, mary, bob)
(joe, mary, susan)
(bob, mary, susan)
(bob, joe, susan)

我有一些代码可以很好地处理小三角形，但我需要它来缩放非常大的稀疏矩阵。

from numpy import *
from scipy import *

def buildTriangles(G):
    # G is a sparse adjacency matrix
    start = time.time()
    ctr = 0
    G = G + G.T          # I do this to make sure it is symmetric
    triples = []
    for i in arange(G.shape[0] - 1):  # for each row but the last one
        J,J = G[i,:].nonzero()        # J: primary friends of user i
                                      # I do J,J because I do not care about the row values
        J = J[ J < i ]                # only computer the lower triangle to avoid repetition
        for j in J:
            K, buff = G[:,j].nonzero() # K: secondary friends of user i
            K = K[ K > i ]             # only compute below i to avoid repetition
            for k in K:
                ctr = ctr + 1
                triples.append( (i,j,k) )
    print("total number of triples: %d" % ctr)
    print("run time is %.2f" % (time.time() - start())
    return triples

我能够在大约 21 分钟内在 csr_matrix 上运行代码。该矩阵为 1032570 x 1032570，包含 88910 个存储元素。一共生成了2178893个三元组。

我需要能够对具有 9428596 个存储元素的 1968654 x 1968654 稀疏矩阵执行类似操作。

我是 python 的新手(不到一个月的经验)并且在线性代数方面不是最好的，这就是为什么我的代码没有利用矩阵运算的原因。谁能提出任何改进建议，或者让我知道我的目标是否现实？

最佳答案

我认为您只能在行或列中找到三角形。例如:

Susan    1        1      1      0 
        Mary     Joe    Bob    Susan

这意味着 Mary、Joe、Bob 都是 Susan 的 friend ，因此，使用组合从 [Mary, Joe, Bob] 中选择两个人并将其与 Susan 组合将得到一个三角形。 itertools.combinations() 快速执行此操作。

代码如下:

import itertools
import numpy as np

G = np.array(   # clear half of the matrix first
    [[0,0,0,0],
     [1,0,0,0],
     [1,1,0,0],
     [1,1,1,0]])
triples = []     
for i in xrange(G.shape[0]):
    row = G[i,:]
    J = np.nonzero(row)[0].tolist() # combinations() with list is faster than NumPy array.
    for t1,t2 in itertools.combinations(J, 2):
        triples.append((i,t1,t2))
print triples

关于 python ，Scipy : Building triplets using large adjacency matrix，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/6931985/

25

4

0

文章推荐： C# List : How would this be done? 的泛型重载

文章推荐： Python 空闲和键盘中断

build - Xcode 4 术语 "Build for testing/Build for running/build for profiling/build for archiving"
Xcode 4 中的以下操作有什么作用？为测试而构建为运行而构建为分析而构建为存档而构建我不确定何时使用这些(或是否使用其中任何一个)。最佳答案 Running 用于运行您的应用(在 Ma
build - Jenkins Partial Build/Modular Build on Commit Hook
工具: Jenkins 版1.470 Maven 2 颠覆环境假设我的构建有许多项目 A-D。如图所示，依赖关系图存在。也就是说:B 依赖于 A 中的类，C 依赖于 B 中的类，D 依赖于 A 中
c++ - 在单独的 'build' 目录中使用 autotools build system/Building
我正在创建一个软件项目，我想使用 autotools 为我生成 makefile 等脚本，我手动创建了 Makefile.am 和 configure.in 文件，我正在使用 autogen.sh 脚
npm - "yarn build "命令有什么作用？ "npm build "和 "yarn build"是类似的命令吗？
什么yarn build命令做什么？是 yarn build和 npm build相同？如果不是有什么区别？最佳答案 yarn build和 npm build默认情况下不是现有的命令。我想你是说
c# - Cake Build - 如何从另一个 Cake Build 脚本调用其他 Cake Build 脚本
如果我有一个包含许多相互依赖的项目的大型代码库，例如，projects/A、projects/B 和 projects/C ，其中 A 需要 B，B 需要 C，每个项目都有一个Cake 构建脚本，例如
javascript - 排毒问题 : BUILD FAILED Ld build/Build/Products/Debug-iphonesimulator
我正在尝试使用 Wix/Detox 来测试我的 react-native 应用程序(iOS 版本)。我已成功遵循 https://github.com/wix/detox/blob/master/do
build - 如何加速 Nant Builds？
我们有许多编译 .NET 代码的 Nant 脚本。这些构建需要 5 到 10 分钟才能运行，我想找到一种方法来加速它们。我们的 Nant 脚本看起来像
build - ffmpeg build - 未知的临时文件夹
你好当我在 windows 下使用 gnu 构建 ffmpeg-3.4.1 时，谁能帮我解决这个错误: /tmp/9747a756ee05ef34cc3fcf51eabde826/sysroot/u
build - 编程定义 : What exactly is 'Building' .
构建解决方案/项目/程序意味着什么？我想确保我的定义是正确的(所以我在交谈时听起来不像个白痴)。在 IDE 中，您可以(如果我错了，请纠正我)编译源代码/编程代码为计算机可读的机器代码。您可以调试程序
android - Eclipse 保持 Building workspace... 和 Building workspace... 和 Building workspace
为什么 Eclipse 在构建 Android 项目时会陷入无限循环，用于构建工作区...和(重新)构建工作区...和(重新)构建工作区... 这是一个已知的错误吗？摆脱这个循环的正确方法是什么？
javascript - ng build -prod 与 ng build --prod --build-optimizer=true
我的 Angular 项目是 @Angular4.3.3 ng build -prod 构建需要 77 秒 ng build --prod --build-optimizer=true 构建需要 19
android - 无法在 app/build.gradle 中在线导入 com.android.build.OutputFile 时解析符号 'build'
所以我刚刚使用命令创建了一个 React Native 项目 react-native init "项目名称" 我进入应用程序级别的 build.gradle 以连接 firebase，但出现错误提示
tfs - TFS Build Online 2017 中 $(Build.Repository.LocalPath) 和 $(Build.SourcesDirectory) 的区别
我想弄清楚 TFS Online 2017 中的两个预定义变量之间是否存在差异:$(Build.Repository.LocalPath)和 $(Build.SourcesDirectory) .我有
ios - Xcode Build Script (Build Phases->Run Script) Increment Build Version 基于用户名(用户)
编译项目时，当系统用户名匹配时，此脚本应将 Xcode 项目的构建版本递增 1。请记住，这些只是 Target->Build Phases->Run Script in Xcode 中脚本(不是 Ap
build - 是否有工具可以显示 MS Build 项目文件中的构建依赖关系图和目标序列？
是否有一种工具可以在给定 MS Build 项目文件的情况下构建一个视觉对象，显示将在何时以及从哪个导入文件执行哪个目标？如果给定一个解决方案文件，它会构建项目构建顺序的视觉效果？最佳答案是的，
build - 如何使用 Bazel Build 从生成的源文件构建静态库
我正在尝试使用 Bazel 进行以下设置。通过调用“bazel build”，Python 脚本应该生成未知数量的具有随机名称的 *.cc 文件，然后将这些文件编译成单个静态库(.a 文件)，所有这些
Bazel BUILD 文件与 build/文件夹冲突
我正在将我的 Cmake 项目迁移到 Bazel。我项目的根目录是 build我用来运行 Cmake 的文件夹。迁移到 Bazel ，我需要创建一个 BUILD我的项目根目录下的文件。但是，在 ma
build - 专用 "Build Server"的用途是什么？
关闭。这个问题是opinion-based 。目前不接受答案。想要改进这个问题吗？更新问题，以便 editing this post 可以用事实和引文来回答它。 . 已关闭 5 年前。此帖子已于
docker - 带有github密码的docker build --build-arg无法正常工作
当我的Dockerfile如下所示时，它运行良好。 ... RUN pip install git+https://user_name:my_password@github.com/repo_name
build - 自动将新的存储库标签添加到Docker Registry Automated Build
当前的自动构建功能集是否可以从存储库中添加新标签并标记生成的图像？还是我需要3party服务将新标签自动推送到Docker Registry？最佳答案目前不行。当前(2014年10月)尚无Doc

首页

博学

6Ren·AI

商城

python ，Scipy : Building triplets using large adjacency matrix