How to convert a pydub `AudioSegment` for streaming purposes without an intermediate audio file?(如何在没有中间音频文件的情况下，将一个pydub`AudioSegment`转换成流媒体？)-6ren

How to convert a pydub `AudioSegment` for streaming purposes without an intermediate audio file?(如何在没有中间音频文件的情况下，将一个pydub`AudioSegment`转换成流媒体？)

转载作者：bug小助手更新时间：2023-10-24 18:41:27

I'm working on a software to stream audio from a source over HTTP using a Flask webservice. I can get the sound frames through sounddevice and route them to a browser through a Flask route with yield and the right mimetype, but the raw audio format is quite cumbersome for remote streaming, and not really the best when it comes to client compatibility.

我正在开发一个软件，使用FlaskWeb服务通过HTTP从一个来源流传输音频。我可以通过SoundDevice获得声音帧，然后用Year和正确的Mimetype将它们通过Flask路由发送到浏览器，但原始音频格式对于远程流媒体来说相当麻烦，而且在客户端兼容性方面也不是最好的。

I'd love to use pydub to convert the raw audio frames to a format like mp3 or ogg, but it's not clear to me neither from the documentation nor from the source code how to achieve an on-the-fly format conversion without dumping the output to a file through .export().

我很乐意使用pydub将原始音频帧转换为mp3或ogg等格式，但无论是从文档还是从源代码来看，我都不清楚如何实现即时格式转换，而无需通过.export()将输出转储到文件。

The skeleton of my code so far is something like:

到目前为止，我的代码框架类似于：

### audio.py

import queue
import sounddevice as sd
from pydub.audio_segment import AudioSegment


def input_stream(device, sample_width=2, sample_rate=44100,
        channels=1, latency=0, blocksize=2048, timeout=5.0):
    audio_queue = queue.Queue()

    def audio_callback(indata, frames, time_duration, status):
        audio = AudioSegment(indata, sample_width=sample_width,
                channels=channels, frame_rate=sample_rate)

        # Some pydub magic should happen here to convert the raw frame to mp3/ogg

        audio_queue.put(audio.raw_data)


    with sd.InputStream(samplerate=sample_rate, device=device,
                        channels=channels, callback=audio_callback,
                        latency=latency, blocksize=blocksize):
        while not recording_terminated():
            yield audio_queue.get(block=True, timeout=timeout)


### web.py

from flask import route, request, Response

from audio import input_stream


@route('/sound/stream', methods=['GET'])
def get_sound_feed():
    device = request.args.get('device')
    return Response(input_stream(device), mimetype='audio/ogg')

How would one convert the raw AudioSegment object in audio_callback into a compressed mp3/ogg suitable for web streaming? I know that it's possible to create a segment from an mp3 through AudioSegment.from_file, or dump it to an mp3 file through .export(), but that wouldn't really be an option as such I/O operations would introduce non-negligible latency. I think it might be theoretically possible to hack .export() to get it to dump to a socket or fifo file descriptor, but that sounds a bit as a hacky workaround to me, plus I'm not sure whether it's sufficient for the file descriptor to provide the .write() method or if it'd break because other methods (e.g. seek) are required.

如何将AUDIO_CALLBACK中的原始AudioSegment对象转换为适合Web流媒体的压缩mp3/ogg？我知道可以通过AudioSegment.from_file从mp3创建段，或者通过.export()将其转储到mp3文件，但这不是一个真正的选择，因为这样的I/O操作将引入不可忽略的延迟。我认为从理论上讲，可以破解.export()以将其转储到套接字或FIFO文件描述符，但对我来说，这听起来有点像是一种老套的解决办法，此外，我不确定文件描述符提供.write()方法是否足够，或者它是否会中断，因为需要其他方法(例如查找)。

更多回答

I've bumped into this question from 2014: stackoverflow.com/questions/25469161/…. The last comment seems to confirm my suspicion that exporting on-the-fly (not to a file) is not an option in pydub. I'm not sure if any progress has been made in the meantime or if some user has already made a fork to support it. If that's not the case, I might give it a try and submit a PR to pydub.

我从2014年就遇到了这个问题：Stackoverflow.com/Questions/25469161/…。最后一条评论似乎证实了我的怀疑，即动态导出(不是导出到文件)不是pydub中的选项。我不确定在此期间是否取得了任何进展，或者是否有一些用户已经做出了支持它的分支。如果不是这样，我可能会试一试，向PYDUB提交一份公关。

have you found how to do this? I'm trying to save the file in a BytesIO to remove the persistence aspect

你找到怎么做了吗？我正在尝试将文件保存在BytesIO中，以删除持久性方面

优秀答案推荐

I don't know if you can prevent pydub from saving the file to the disk, but you can just get the file at the end of conversion without reopen it.
Actually, the .export() function return the file object at the end of execution.

我不知道你是否可以阻止pydub将文件保存到磁盘上，但你可以在转换结束时获得文件，而不需要重新打开它。实际上，.EXPORT()函数在执行结束时返回文件对象。

convert_file = audio_file.export(format="flac")

I have done this and I could process the convert_file as if I had used the open() function. (I convert to flac for my own project but you can do any format)

I found out that if you don't provide a file name, the .export() function wont even write the file to disk without any errors.

I hope you can found a workaround for your issue.

我已经这样做了，我可以处理转换文件，就像我使用了Open()函数一样。(我为我自己的项目转换为FLAC，但您可以执行任何格式)。我发现，如果您不提供文件名，.EXPORT()函数甚至不会毫无错误地将文件写入磁盘。我希望您能找到解决您的问题的方法。

In case anyone is interested.
One can use a BytesIO object to export a converted file into an in-memory object.

如果有人感兴趣的话。可以使用BytesIO对象将转换后的文件导出到内存对象中。

# Load input song
song = from_mp3("my_song.mp3")
# create new empty BytesIO
exported_io = BytesIO()
# export to BytesIO object
song.export(exported_io, format="wav")

One can then perform any action on this BytesIO object. No file will be created using this approach.

然后，用户可以对该BytesIO对象执行任何操作。不会使用此方法创建任何文件。

更多回答

文章推荐： Haskell data vs class(Haskell数据与类)

pydub - 如何有效地将 gtts 音频转换为 pydub 音频片段？
我想在 pydub 中操作 gtts 音频，但我不确定如何将 gtts 文件转换为 pydub 音频。我知道我可以将谷歌文本转换为语音音频到 mp3，我知道我可以使用 pydub 导入 mp3，但是
python - 使用 pydub 导出时出错 - 如何为 pydub 安装 mp3 编解码器？
我是第一次使用这个库，所以我不确定这是一个错误还是我没有正确地做某事。我想将文件导出为 mp3，加载工作完美: wav=AudioSegment.from_wav(Path) #If I exec
python | pydub : how to load wav sample into pydub from np. 数组而不是 wav 文件？
如何将音频 np.array 文件加载到 PyDub 库中？目前，我使用 AudioSegment.from_wav(file_path)，但如果我已经将 wav 文件加载为 numpy 数组，这并不
python - Pydub 系统找不到指定的文件
所以这是我的代码: from pydub import AudioSegment sound1 = AudioSegment.from_mp3("sound_0.mp3") sound2 = Audi
python - pydub 可以知道文件是否曾经播放过吗？
我正在尝试创建一个 python 脚本来帮助我管理我的汽车 radio 的音乐库。想法如下:我有一个带有 2 小时播客 mp3 文件的 USB 闪存驱动器。由于我从来没有开过这么长的路，脚本将文件分成
python - PyDub:结合音频大小错误？
我是一名新手程序员，目前正在编写一些代码，其中音频片段被叠加并连接在一起。到目前为止，连接工作得很好，但似乎有关于覆盖的错误。我实际上遵循了另一个stackoverflow问题的以下代码: from
python - Pydub 按样本切片音频段
假设我有两个相同采样率的音频段，它们是从 Pydub 中的 .wav 文件导入的，并假设我知道哪个更短。现在假设我想将较长的音频文件分成两段，以便第一段与较短的音频文件具有完全相同的长度(直到完全相同
python - pydub-内存错误
我正在尝试使用 python 和 pydub 库将大型播客 mp3 文件分割成更小的 5 分钟 block 。这是我的代码: folder = r"C:\temp" filename = r"p967
python - Pydub 原始音频数据
我在 Python 3.4 中使用 Pydub 来尝试检测某些音频文件的音调。我有一个有效的音高检测算法(McLeod Pitch Method)，它对实时应用程序很稳健(我什至用它制作了一个 An
python - pydub 支持音调调制吗？
This old thread似乎表明 pydub 的 AudioSegment._data 可用于以某种方式计算声音的音调；不幸的是，这似乎是使用分配给未公开的 Mpm 类的方法来完成的。但是，如果
python - Pydub - 如何在不改变播放速度的情况下改变帧率
我在 Pydub 中使用 AudioSegment 打开了几个音频文件。我想将音频质量从帧速率 22050 降低到 16000 Hz。 (单 channel 文件) 如果我简单地更改 AudioSe
python - 如何使用 pydub 提取专辑封面
我想从歌曲中提取专辑封面并在转换后将其嵌入回歌曲中，使用 Pydub .有人可以帮助我吗？最佳答案 Pydub 的核心是对原始音频数据(样本)进行操作。为方便起见，它提供了解码非原始音频文件(通过
python - 使用 PyDub 在波形文件的开头和结尾删除静音
如何使用 PyDub 从波形文件的开头和结尾删除静音？我想我应该逐段访问并检查它是否静音(但我无法做到):/ 例如我有一个在开头、结尾或两者都有静音的波形文件(如下所示)，我想删除文件开头和结尾的静
python - 停止在 pydub 中播放音频
pydub 中是否有诸如终止或停止功能之类的东西，以便在 play() 启动后的流可以在它仍在播放时突然停止，而不是音频播放到它的全长然后停止。最佳答案如前所述，pydub 本身不提供此类功能。但
python - 如何使用 Pydub 更改音频播放速度？
我是音频编辑库的新学习者 - Pydub .我想使用 Pydub(比如 .wav/mp3 格式文件)更改一些音频文件的播放速度，但我不知道如何制作。我看到的唯一可能处理这个问题的模块是speedup
python - pydub 附加 - 引擎盖下行为的澄清
我一直在使用 pydub 将短声音文件连接成更大的声音文件。基本代码如下所示: def permuPhrase(iterations, joins): # Builds a single phrase
python - 使用 pydub 进行多次淡入淡出效果后质量很差
我想生成锻炼 mp3 文件，其中包含背景音乐和某些时间的说明(例如“用力推”、“再重复一次!”) 我用 pico2wave 生成指令并用 pydub 组装它们。我这样做: for timing,
python - 如何使用 pydub 检测音频流中的静音？
我想监控音频流中的静音。知道我该怎么做吗？它是流，而不是音频文件。最佳答案您最好的选择是从流中获取数据 block (我建议使用 50 毫秒的数据 block ，因为一个完整的 20Hz 波形是
python - 如何在 pydub 中获得相同的输入和输出文件比特率？
我已经使用 pydub 输出了一个文件(将文件切成更短的文件)，一切都很好，但是比特率从 256k 变成了 124k(为什么我得到这个数字而不是 128k？)。我知道 AudioSegment 有一个
python - 使用 Pydub 实时发出连续声音
我正在尝试制作一个从 wifi 探针日志生成声音的程序，以便一定数量的设备(在一定距离内)生成声音，而 rssi 就是频率。我试图使其尽可能实时，但无法弄清楚如何使音调连续并根据值的变化更改频率。

bug小助手

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

How to convert a pydub `AudioSegment` for streaming purposes without an intermediate audio file?(如何在没有中间音频文件的情况下，将一个pydub`AudioSegment`转换成流媒体？)