gpt4 book ai didi

python - 使用 Python 3 捕获 192 kHz 音频

转载 作者:太空狗 更新时间:2023-10-29 17:50:19 24 4
gpt4 key购买 nike

我需要使用 Python 3 为一些生物声学实验捕获 192 kHz 音频。我有硬件、Sound Devices USBPre 2 声卡、具有高达 100 kHz 良好频率响应曲线的麦克风,并且我已启用我的操作系统 (ubuntu 13.04) 以 192 kHz 从这张卡中采样。

我试过用 PyAudio 录音。它似乎可以工作,并且会给我一个采样率为 192 kHz 的 wav 文件。然而,当我查看频谱时,没有超过 24 kHz 的功率,这表明 PyAudio 并不是真正在 192 kHz 捕获,而是在 48 kHz 捕获。然而,当我使用来自 JACK 的输入使用 Audacity 进行录音时,我得到了一个不错的录音,功率高达 96kHz。所以,我的印象是 PyAudio 实际上并没有对 192 kHz 的声音进行采样,尽管它应该能够。如何解决这个问题?

我启动 JACK 没有错误:

/usr/bin/jackd -R -dalsa -Chw:1,0 -n3 -o1 -p2048 -r192000

jackd 0.122.0
Copyright 2001-2009 Paul Davis, Stephane Letz, Jack O'Quinn, Torben Hohn and others.
jackd comes with ABSOLUTELY NO WARRANTY
This is free software, and you are welcome to redistribute it
under certain conditions; see the file COPYING for details

JACK compiled with System V SHM support.
loading driver ..
apparent rate = 192000
creating alsa driver ... -|hw:1,0|2048|3|192000|0|1|nomon|swmeter|-|32bit
control device hw:0
configuring for 192000Hz, period = 2048 frames (10.7 ms), buffer = 3 periods
ALSA: final selected sample format for capture: 24bit little-endian
ALSA: use 3 periods for capture

初始化 PyAudio(没有任何真正的错误(据我所知)):

p = pyaudio.PyAudio()
ALSA lib pcm.c:2217:(snd_pcm_open_noupdate) Unknown PCM cards.pcm.rear
ALSA lib pcm.c:2217:(snd_pcm_open_noupdate) Unknown PCM cards.pcm.center_lfe
ALSA lib pcm.c:2217:(snd_pcm_open_noupdate) Unknown PCM cards.pcm.side
bt_audio_service_open: connect() failed: Connection refused (111)
bt_audio_service_open: connect() failed: Connection refused (111)
bt_audio_service_open: connect() failed: Connection refused (111)
bt_audio_service_open: connect() failed: Connection refused (111)
ALSA lib pcm_dmix.c:957:(snd_pcm_dmix_open) The dmix plugin supports only playback stream

打开 PyAudio 流:

stream = p.open(format=pyaudio.paInt32,
channels=1,rate=192000,
input=True,
frames_per_buffer=2048)

我有频谱图的图像,以防有人想验证我的解释,即 PyAudio 没有在 192 kHz 捕获(但 Audacity 可以):

使用 PyAudio 捕获的声音频谱图 Spectrogram of sound captured usig PyAudio

使用 Audacity 捕获的声音频谱图 Spectrogram of sound captured usig Audacity

如何使用 PyAudio 以 192 000 采样/秒的速度录制声音?也欢迎提出使用 Python 3 捕获声音的其他方法的建议。

最佳答案

这并不是一个决定性的答案,而是试图帮助您自己找出问题。

当尝试在 OS X 上使用 PyAudio 重现您的问题时,我总是遇到 [Errno Input overflowed] -9981(例如 several other people ,它 seems )。p.is_format_supported() 报告为正常的配置也导致了这些错误。所以我编写了一个脚本,它只是尝试记录所有可能的记录设置排列。

此脚本探测设备采样率格式 channel 列表的所有排列,在一个防御方式,并将结果保存到根据录制设置命名的文件中。

import os
import pyaudio
import sys

# === These parameters will be permuted ===========
DEVICES = [0, 1, 2]
RATES = [44100, 48000, 192000]
FORMATS = ['Float32', 'Int32', 'Int24', 'Int16', 'Int8', 'UInt8']
CHANNELS = [1, 2]
# =================================================

CHUNK = 1024
COLUMNS = (('filename', 30),
('result', 9),
('dev', 5),
('rate', 8),
('format', 9),
('channels', 10),
('chunk', 7),
('reason', 0))
STATUS_MSG = "Recording... "

pa = pyaudio.PyAudio()


def get_format(format):
fmt = getattr(pyaudio, 'pa%s' % format)
return fmt


def record(filename=None,
duration=5,
dev=0,
rate=44100,
format='Float32',
channels=2,
chunk=1024,):
"""Record `duration` seconds of audio from the device with index `dev`.
Store the result in a file named according to recording settings.
"""
if filename is None:
filename = "dev{dev}-{rate}-{format}-{channels}ch.raw".format(**locals())
result = 'FAILURE'
reason = ''

outfile = open(filename, 'w')
print STATUS_MSG,
sys.stdout.flush()

try:
stream = pa.open(input_device_index=dev,
rate=rate,
format=get_format(format),
channels=channels,
frames_per_buffer=chunk,
input=True,
)

try:
for i in range(0, rate / (chunk) * duration):
a = stream.read(chunk)
outfile.write(a)
result = 'SUCCESS'
# Catch exceptions when trying to read from stream
except Exception, e:
reason = "'%s'" % e
# Catch exceptions when trying to even open the stream
except Exception, e:
reason = "'%s'" % e

outfile.close()

# Don't leave files behind for unsuccessful attempts
if result == 'FAILURE':
os.remove(filename)
filename = ''

info = {}
for col_name, width in COLUMNS:
info[col_name] = str(locals()[col_name]).ljust(width)

msg = "{filename}{result}{dev}{rate}{format}{channels}{chunk}{reason}"
print msg.format(**info)

def main():
# Build the header line
header = 'STATUS'.ljust(len(STATUS_MSG) + 1)
for col_name, width in COLUMNS:
header += col_name.upper().ljust(width)
print header
print "=" * len(header)

# Record samples for all permutations of our parameter lists
for dev in DEVICES:
for rate in RATES:
for format in FORMATS:
for channels in CHANNELS:
record(duration=2,
dev=dev,
rate=rate,
format=format,
channels=channels,
chunk=CHUNK)

if __name__ == '__main__':
main()

样本输出(简化):

STATUS        FILENAME                      RESULT   DEV  RATE    FORMAT   CHANNELS  CHUNK  REASON
==================================================================================================
Recording... dev0-44100-Float32-1ch.raw SUCCESS 0 44100 Float32 1 1024
Recording... dev0-44100-Float32-2ch.raw SUCCESS 0 44100 Float32 2 1024
Recording... dev0-44100-Int16-1ch.raw SUCCESS 0 44100 Int16 1 1024
Recording... dev0-44100-Int16-2ch.raw SUCCESS 0 44100 Int16 2 1024
Recording... FAILURE 0 192000 Float32 1 1024 '[Errno Input overflowed] -9981'
Recording... FAILURE 0 192000 Float32 2 1024 '[Errno Input overflowed] -9981'
Recording... FAILURE 0 192000 Int16 1 1024 '[Errno Input overflowed] -9981'
Recording... FAILURE 0 192000 Int16 2 1024 '[Errno Input overflowed] -9981'
Recording... dev1-44100-Float32-1ch.raw SUCCESS 1 44100 Float32 1 1024
Recording... dev1-44100-Float32-2ch.raw SUCCESS 1 44100 Float32 2 1024
Recording... dev1-44100-Int16-1ch.raw SUCCESS 1 44100 Int16 1 1024
Recording... dev1-44100-Int16-2ch.raw SUCCESS 1 44100 Int16 2 1024
Recording... FAILURE 1 192000 Float32 1 1024 '[Errno Input overflowed] -9981'
Recording... FAILURE 1 192000 Float32 2 1024 '[Errno Input overflowed] -9981'
Recording... FAILURE 1 192000 Int16 1 1024 '[Errno Input overflowed] -9981'
Recording... FAILURE 1 192000 Int16 2 1024 '[Errno Input overflowed] -9981'
Recording... FAILURE 2 44100 Float32 1 1024 '[Errno Invalid number of channels] -9998'
Recording... FAILURE 2 44100 Float32 2 1024 '[Errno Invalid number of channels] -9998'
Recording... FAILURE 2 44100 Int16 1 1024 '[Errno Invalid number of channels] -9998'
Recording... FAILURE 2 44100 Int16 2 1024 '[Errno Invalid number of channels] -9998'
Recording... FAILURE 2 192000 Float32 1 1024 '[Errno Invalid number of channels] -9998'
Recording... FAILURE 2 192000 Float32 2 1024 '[Errno Invalid number of channels] -9998'
Recording... FAILURE 2 192000 Int16 1 1024 '[Errno Invalid number of channels] -9998'
Recording... FAILURE 2 192000 Int16 2 1024 '[Errno Invalid number of channels] -9998'

关于python - 使用 Python 3 捕获 192 kHz 音频,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/19692003/

24 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com