python - PyAudio - 将stream.read转换为int以获得幅度-6ren

python - PyAudio - 将stream.read转换为int以获得幅度

转载作者：行者123 更新时间：2023-12-01 04:01:28

25

4

我正在尝试录制音频并同时打印录制信号的幅度。所以我将所有数据保存在stream.read中。但是当我尝试打印它们时，我有一个字节字符串，没有整数。我想知道如何转换这些符号以获得幅度。

这是我的代码:

import pyaudio
import wave

CHUNK = 1024 
FORMAT = pyaudio.paInt16
CHANNELS = 1 
RATE = 44100 
RECORD_SECONDS = 5
WAVE_OUTPUT_FILENAME = "output.wav"

p = pyaudio.PyAudio()

stream = p.open(format=FORMAT,
                channels=CHANNELS,
                rate=RATE,
                input=True,
                frames_per_buffer=CHUNK) 

print("* recording")

frames = []

for i in range(0, int(RATE / CHUNK * RECORD_SECONDS)):
    data = stream.read(CHUNK)
    frames.append(data) # 2 bytes(16 bits) per channel

print("* done recording")

stream.stop_stream()
stream.close()
p.terminate()

for data in frames:
    print(data)

这就是我得到的:

       ����#  ����
          
 !$
          

                 ��  ���� ��������������������������
           ������  �� ��                                           
��

   �� ������ ����������������������������
                            ��    
                                     ����

��

% �� (��)��,��.��%��#��

最佳答案

您当然可以通过以下代码来启发自己:

#!/usr/bin/python

# open a microphone in pyAudio and listen for taps

import pyaudio
import struct
import math

INITIAL_TAP_THRESHOLD = 0.010
FORMAT = pyaudio.paInt16 
SHORT_NORMALIZE = (1.0/32768.0)
CHANNELS = 2
RATE = 44100  
INPUT_BLOCK_TIME = 0.05
INPUT_FRAMES_PER_BLOCK = int(RATE*INPUT_BLOCK_TIME)
# if we get this many noisy blocks in a row, increase the threshold
OVERSENSITIVE = 15.0/INPUT_BLOCK_TIME                    
# if we get this many quiet blocks in a row, decrease the threshold
UNDERSENSITIVE = 120.0/INPUT_BLOCK_TIME 
# if the noise was longer than this many blocks, it's not a 'tap'
MAX_TAP_BLOCKS = 0.15/INPUT_BLOCK_TIME

def get_rms( block ):
    # RMS amplitude is defined as the square root of the 
    # mean over time of the square of the amplitude.
    # so we need to convert this string of bytes into 
    # a string of 16-bit samples...

# we will get one short out for each 
# two chars in the string.
count = len(block)/2
format = "%dh"%(count)
shorts = struct.unpack( format, block )

# iterate over the block.
    sum_squares = 0.0
    for sample in shorts:
        # sample is a signed short in +/- 32768. 
        # normalize it to 1.0
        n = sample * SHORT_NORMALIZE
        sum_squares += n*n

    return math.sqrt( sum_squares / count )

class TapTester(object):
    def __init__(self):
        self.pa = pyaudio.PyAudio()
        self.stream = self.open_mic_stream()
        self.tap_threshold = INITIAL_TAP_THRESHOLD
        self.noisycount = MAX_TAP_BLOCKS+1 
        self.quietcount = 0 
        self.errorcount = 0

    def stop(self):
        self.stream.close()

    def find_input_device(self):
        device_index = None            
        for i in range( self.pa.get_device_count() ):     
            devinfo = self.pa.get_device_info_by_index(i)   
            print( "Device %d: %s"%(i,devinfo["name"]) )

            for keyword in ["mic","input"]:
                if keyword in devinfo["name"].lower():
                    print( "Found an input: device %d - %s"%        (i,devinfo["name"]) )
                    device_index = i
                    return device_index

    if device_index == None:
        print( "No preferred input found; using default input device." )

    return device_index

def open_mic_stream( self ):
    device_index = self.find_input_device()

    stream = self.pa.open(   format = FORMAT,
                             channels = CHANNELS,
                             rate = RATE,
                             input = True,
                             input_device_index = device_index,
                             frames_per_buffer = INPUT_FRAMES_PER_BLOCK)

    return stream

def tapDetected(self):
    print "Tap!"

def listen(self):
    try:
        block = self.stream.read(INPUT_FRAMES_PER_BLOCK)
    except IOError, e:
        # dammit. 
        self.errorcount += 1
        print( "(%d) Error recording: %s"%(self.errorcount,e) )
        self.noisycount = 1
        return

    amplitude = get_rms( block )
    if amplitude > self.tap_threshold:
        # noisy block
        self.quietcount = 0
        self.noisycount += 1
        if self.noisycount > OVERSENSITIVE:
            # turn down the sensitivity
            self.tap_threshold *= 1.1
    else:            
        # quiet block.

        if 1 <= self.noisycount <= MAX_TAP_BLOCKS:
            self.tapDetected()
        self.noisycount = 0
        self.quietcount += 1
        if self.quietcount > UNDERSENSITIVE:
            # turn up the sensitivity
            self.tap_threshold *= 0.9

if __name__ == "__main__":
tt = TapTester()

for i in range(1000):
    tt.listen()

它来自这篇文章:[Detect tap with pyaudio from live mic

您可以轻松地对其进行调整，将 RMS 放入表格中并绘制表格。

关于python - PyAudio - 将stream.read转换为int以获得幅度，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/36413567/

25

4

0

文章推荐： javascript - CSS 动画在 HTML 注入(inject)时重新启动

文章推荐： python - 我可以在 OS X 上使用 Numba 吗？

文章推荐： python - 获取 IMDbPY 中的所有列表 movieID

文章推荐： javascript - 这是一种在 Grafana 行周围放置边框的方法吗？

Scala: (Int, Int) => Int 不匹配 (Int, Int) => Int
我正在尝试使用 y 组合器在 Scala 中定义 gcd: object Main { def y[A,B]( f : (A => B) => A => B ) : A => B = f(y(f)
c++ - 无法将 int (*(int))(int) 转换为 int (*(int))(int)
我正在尝试了解返回指向函数的指针的函数，在我尝试编译代码后，它给了我这种错误: cannot convert int (*(int))(int) to int (*(int))(int) in ass
java - BufferedImage.getRGB(int, int, int, int, int[], int, int) 如何工作？
所以我一直在关注 youtube 上的游戏编程教程，然后弹出了这段代码:bufferedImageObject.getRGB(int, int, int, int, int[], int, int);
c# - 将格式化的日期字符串转换为 DateTime(int,int,int,int,int,int) 以传递给函数
我正在将时间现在与存储在数据库某处的时间进行比较。数据库中存储的时间格式为“yyyyMMddHHmmss”。例如，数据库可能会为存储的时间值返回 201106203354。然后我使用一个函数将时间现
java - 如何以这种格式编写java模式 : any characters (int, int) (int,int) number number any number of (int,int,int)
例如 Maze0.bmp (0,0) (319,239) 65 120 Maze0.bmp (0,0) (319,239) 65 120 (254,243,90) Maze0.bmp (0,0) (
haskell - 理解类型错误 : "expected signature Int*Int->Int but got Int*Int->Int"
评论 Steve Yegge的post关于 server-side Javascript开始讨论语言中类型系统的优点和这个 comment描述: ... examples from H-M style
c - int(*function)(int,int) 和 int*function(int,int) 的区别
我正在研究 C 的指针，从 Deitel 的书中我不明白 int(*function)(int,int) 和 int*function(int, int) 表示函数时。最佳答案 C 中读取类型的经验
java - joda new DateTime(int，int，int，int，int，int)的问题
您好，我使用 weblogic 11g 创建 war 应用程序，我对 joda time 的方法有疑问 new DateTime(int, int, int, int, int, int); 这抛出了
java - 方法 sum(int, int, int, int) 不适用于参数 (int)
Create a method called average that calculates the average of the numbers passed as parameters. The
swift - 二元运算符 "=="不能应用于 (Int, Int, Int, Int) -> Int 类型的操作数
var a11: Int = 0 var a12: Int = 0 var a21: Int = 0 var a22: Int = 0 var valueDeterminant = a11 * a12
c++ - 阿杜伊诺错误 : too few arguments to function 'int getMode(int, int, int, int, int)'
我正在为一个项目设置 LED 阵列。我得到了一个 LED 阵列，可以根据引脚变化电压进行更改，但我无法添加更多引脚。当我尝试时，编译失败并显示错误:函数“int getMode(int, int,
haskell - 创建 Int 和函数列表 Int -> Int -> Int
除了创建对列表执行简单操作的函数之外，我对 haskell 还是很陌生。我想创建一个列表，其中包含 Int 类型的内容, 和 Int -> Int -> Int 类型的函数. 这是我尝试过的: dat
Java-高效地执行 .setBounds(int, int, int, int);
这个问题已经有答案了: Java add buttons dynamically as an array [duplicate] (4 个回答) 已关闭 7 年前。 StackOverFlow问题今天
android - setCompoundDrawablesWithIntrinsicBounds(int，int，int，int)不起作用
我有几个 EditText View ，我想在其中设置左侧的图像，而 setCompoundDrawablesWithIntrinsicBounds 似乎不起作用。图形似乎没有改变。有人知道为什么会
c++ - 为什么 `is_constructible, int(*)(int,int)>::value`在VC2015RC下为true
#include using namespace std; int main() { static_assert(is_constructible, int(*)(int,int)>::val
java - Kotlin:用 Pair 调用 (Int, Int) -> Int 的惯用方式？
fun sum(a: Int, b: Int) = a + b val x = 1.to(2) 我在找: sum.tupled(x)，或者 sum(*x) 当然，以上都不能用 Kotlin 1.1.3
ios - 类型 "Int -> Bool","Int-> Bool -> Int","Int-> String -> Int－> Bool"
有一个函数: func (first: Int) -> Int -> Bool -> String { return ? } 返回值怎么写？我对上面 func 的返回类型感到很困惑。最
ocaml - OCaml 求和类型中的 int * int 与 (int * int)
type foo = A of int * int | B of (int * int) int * int 和 (int * int) 有什么区别？我看到的唯一区别在于模式匹配: let test_
java - 找不到符号方法drawImage(SlidingBlockModel, int, int, int, int, )
我正在尝试制作一个 slider 游戏。在这个类中，我使用 Graphics 对象 g2 的 drawImage 方法来显示“拼图”的 block 。但在绘制类方法中，我收到此错误:找不到符号方法dr
c# - int int.operator(int left, int right) &
我试着理解这个表达: static Func isOdd = i => (i & 1) == 1; 但是这是什么意思呢？例如我有 i = 3。然后 (3 & 1) == 1 或 i = 4。然后

首页

博学

6Ren·AI

商城

python - PyAudio - 将stream.read转换为int以获得幅度