python-3.x - 为什么 heapq.heapify 这么快？-6ren

python-3.x - 为什么 heapq.heapify 这么快？

转载作者：行者123 更新时间：2023-12-04 11:26:11

26

4

我尝试重新实现 heapify 方法以使用 _siftup和 _siftdown用于更新或删除堆中的任何节点并保持 O(log(n)) 的时间复杂度。
我做了一些努力来优化我的代码，但与 heapq.heapify 相比，它们被证明更糟。 (就所用的总时间而言)。所以我决定调查source code .并将复制的代码与模块的代码进行比较。

# heap invariant.
def _siftdown(heap, startpos, pos):
    newitem = heap[pos]
    # Follow the path to the root, moving parents down until finding a place
    # newitem fits.
    while pos > startpos:
        parentpos = (pos - 1) >> 1
        parent = heap[parentpos]
        if newitem < parent:
            heap[pos] = parent
            pos = parentpos
            continue
        break
    heap[pos] = newitem

def _siftup(heap, pos):
    endpos = len(heap)
    startpos = pos
    newitem = heap[pos]
    # Bubble up the smaller child until hitting a leaf.
    childpos = 2*pos + 1    # leftmost child position
    while childpos < endpos:
        # Set childpos to index of smaller child.
        rightpos = childpos + 1
        if rightpos < endpos and not heap[childpos] < heap[rightpos]:
            childpos = rightpos
        # Move the smaller child up.
        heap[pos] = heap[childpos]
        pos = childpos
        childpos = 2*pos + 1
    # The leaf at pos is empty now.  Put newitem there, and bubble it up
    # to its final resting place (by sifting its parents down).
    heap[pos] = newitem
    _siftdown(heap, startpos, pos)


def heapify(x):
    """Transform list into a heap, in-place, in O(len(x)) time."""
    n = len(x)
    # Transform bottom-up.  The largest index there's any point to looking at
    # is the largest with a child index in-range, so must have 2*i + 1 < n,
    # or i < (n-1)/2.  If n is even = 2*j, this is (2*j-1)/2 = j-1/2 so
    # j-1 is the largest, which is n//2 - 1.  If n is odd = 2*j+1, this is
    # (2*j+1-1)/2 = j so j-1 is the largest, and that's again n//2-1.
    for i in reversed(range(n//2)):
        _siftup(x, i)

a = list(reversed(range(1000000)))
b = a.copy()

import heapq

import time

cp1 = time.time()
heapq.heapify(a)
cp2 = time.time()
heapify(b)
cp3 = time.time()

print(a == b)
print(cp3 - cp2, cp2 - cp1)

我总是发现 cp3 - cp2 >= cp2 - cp1和不一样，heapify花了比heapq.heaify更多的时间即使两者相同。
在某些情况下 heapify花了 3 秒和 heapq.heapify用了 0.1 秒
heapq.heapfy 模块比相同的 heapify 执行得更快，它们仅通过导入而不同。
请让我知道原因，如果我犯了一些愚蠢的错误，我很抱歉。

最佳答案

heapify来自 heapq模块实际上是一个内置函数:

>>> import heapq
>>> heapq
<module 'heapq' from 'python3.9/heapq.py'>
>>> heapq.heapify
<built-in function heapify>

help(heapq.heapify)说:

Help on built-in function heapify in module _heapq...

所以它实际上是在导入内置模块 _heapq从而运行 C 代码，而不是 Python。
如果您滚动 heapq.py进一步代码，你会看到 this :

# If available, use C implementation
try:
    from _heapq import *
except ImportError:
    pass

这将覆盖像 heapify 这样的函数与他们的 C 实现。例如， _heapq.heapify是 here .

关于python-3.x - 为什么 heapq.heapify 这么快？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/66442043/

26

4

0

文章推荐： typescript - EventSource 使用 TypeScript 命名事件

文章推荐： flutter - Getx Flutter - 更新列表中的项目不是 react 性的

python - 为什么使用 heapq.heapify 创建的堆与迭代 heapq.heappush 创建的堆不同
我注意到给定一个列表，如果我使用 heapq.heapify() 创建一个堆，元素的顺序与我在列表上迭代并执行 heap.heappush() 时获得的顺序不同。谁能帮我理解为什么？此外，对于可迭
Python heapq 替换优先级
我正在尝试使用 Python 的 heapq 来实现 Dijkstra 的算法。如果发现通往它的较短路径，则该算法需要更改单元格的值。我正在通过此检查执行此操作: if curr_cell[0] +
python - heapq 成员资格测试和替换
来自官方heapq的示例: >>> heap = [] >>> data = [(1, 'J'), (4, 'N'), (3, 'H'), (2, 'O')] >>> for item in data
Python heapq 没有按正确的顺序推送？
工具.py import heapq class PriorityQueue: def __init__(self): self.heap=[] def push(se
Python heapq 与预排序列表的排序速度
我有相当多的 n=10000 个排序列表，每个列表的长度为 k=100。由于合并两个排序列表需要线性时间，我认为在深度为 log(n) 的树中递归合并长度为 O(nk) 的排序列表与 heapq.me
Python heapq 与排序的复杂性和性能
我是 python 的新手(使用 v3.x 语法)，希望得到有关 heapq 与排序的复杂性和性能的说明。我已经为贪婪的“找到最佳工作安排”算法实现了基于 heapq 的解决方案。但后来我了解了将“
python - heapq 库中函数的时间复杂度是多少
我的问题来自下面leetcode中的解决方案，我不明白为什么是O(k+(n-k)log(k))。补充:可能复杂度不是这个，其实我不知道heappush()和heappop()的时间复杂度 # O(k
python - 具有自定义比较谓词的 heapq
我正在尝试使用自定义排序谓词构建堆。由于进入它的值是“用户定义”类型，我无法修改它们的内置比较谓词。有没有办法做类似的事情: h = heapq.heapify([...], key=my_lt_p
python - 如果项目不可比较，heapq 无法处理具有相同优先级的元组
>>> from heapq import heappush >>> heap = [] >>> heappush(heap,(0,{"k":0})) >>> heappush(heap,(0,{"k
python - 如何使用对象比较函数反转 heapq 堆中元素的顺序？
首先，我阅读了这个SO question但它实际上不包括我想要的方法。此外，否定实际值不适用于我的用例。 Heapq 文档:https://docs.python.org/3.6/library/he
python - 检查 heapq 是否包含值
我正在使用 heapq 对象来存储我实现的类的对象。 import heapq heap = [] element1 = Element('A', 1) element2 = Element('B',
Python:heapq.heappop() 给出奇怪的结果
我试图在我的程序中使用 Python 模块 heapq，但我在使用 heapq.heappop() 时遇到了一个奇怪的问题。该函数似乎没有返回堆中的最小元素。看看下面的代码: Python 2.7.1
python - heapq.nlargest() 的关键函数
我有一本包含 {key: count} 的字典，比如说status_count = {'管理分析':13859，'计算机程序员':72112}我正在尝试为 heapq.nlargest() 编写一个键
python - 访问 heapq 的索引和长度？
我最初尝试使用优先级队列编写算法来解决 15 题，但我的导师告诉我们，我们需要编写 a* 实现，并建议我们使用 heapq 而不是优先级队列。我无法找到我的 heapq 的长度/大小，也无法访问我的
python - heapq python - 如何修改堆排序的值
我将一个名为 UNVISITED 的空列表转换为一个堆，这样: UNVISITED = [] heapq.heappush(UNVISITED, (a.f, a)) 我推送的对象a，是从一个类中实例化
python - python中的堆(heapq)稳定吗？
如果堆化此 [(10,'Mike'),(20,'Jack'),(10,'Bob')] 并返回堆的最小值，它会保证返回 (10,'Mike') 而不是 (10,'Bob') 吗？最佳答案 no hea
python - heapq.heapify 不适用于子类列表
每次 heapq.heapify 函数更改我的堆列表中的元素时，我都希望得到回调通知(顺便说一句，这是跟踪列表中的对象以及它们的索引如何获取所需要的改变了)。我的计划是从 list 继承并重写 __
python - heapq.n 返回结果在原始序列中的最大索引
如何返回可迭代的第n大项的原始列表中的索引 heapq.nlargest(2, [100, 2, 400, 500, 400]) output = [(3,500), (2, 400)] 这已经花费了
python - 使用 heapq 降序
我正在使用 Python 的 heapq 模块按升序和降序获取数据。对于升序，我使用的是最小堆，它运行良好，如下所示: >>> from heapq import heapify, heappop
python - 如何让 heapq 评估特定属性的堆？
我希望拥有一堆对象，而不仅仅是数字。它们将具有堆可以排序的整数属性。在python中使用堆最简单的方法是heapq，但是在使用heapq时如何告诉它按特定属性排序呢？最佳答案根据 document

首页

博学

6Ren·AI

商城

python-3.x - 为什么 heapq.heapify 这么快？