python-3.x - librosa.util.exceptions.ParameterError : Invalid shape for monophonic audio: ndim=2, 形状=(1025, 5341)-6ren

python-3.x - librosa.util.exceptions.ParameterError : Invalid shape for monophonic audio: ndim=2, 形状=(1025, 5341)

转载作者：行者123 更新时间：2023-12-02 22:31:28

27

4

我正在尝试使用 python 将音频文件中的声音与背景噪音分开，然后提取 mfcc 功能

但我得到“librosa.util.exceptions.ParameterError:单声道音频的形状无效:ndim=2，形状=(1025、5341)”错误

这是代码

from __future__ import print_function
import numpy as np
import matplotlib.pyplot as plt
import librosa

import librosa.display

import scipy
from scipy.io.wavfile import write
import soundfile as sf
from sklearn.preprocessing import normalize
from scipy.io.wavfile import read, write
from scipy.fftpack import rfft, irfft

y, sr = librosa.load('/home/osboxes/Desktop/AccentReco1/audio-files/egyptiansong.mp3', duration=124)

y=rfft(y) 

# And compute the spectrogram magnitude and phase
S_full, phase = librosa.magphase(librosa.stft(y))


# We'll compare frames using cosine similarity, and aggregate similar frames
# by taking their (per-frequency) median value.
#
# To avoid being biased by local continuity, we constrain similar frames to be
# separated by at least 2 seconds.
#
# This suppresses sparse/non-repetetitive deviations from the average spectrum,
# and works well to discard vocal elements.

S_filter = librosa.decompose.nn_filter(S_full,
                                       aggregate=np.median,
                                       metric='cosine',
                                       width=int(librosa.time_to_frames(2, sr=sr)))

# The output of the filter shouldn't be greater than the input
# if we assume signals are additive.  Taking the pointwise minimium
# with the input spectrum forces this.
S_filter = np.minimum(S_full, S_filter)

# We can also use a margin to reduce bleed between the vocals and instrumentation masks.
# Note: the margins need not be equal for foreground and background separation
margin_i, margin_v = 2, 10
power = 2

mask_i = librosa.util.softmask(S_filter,
                               margin_i * (S_full - S_filter),
                               power=power)

mask_v = librosa.util.softmask(S_full - S_filter,
                               margin_v * S_filter,
                               power=power)

# Once we have the masks, simply multiply them with the input spectrum
# to separate the components

S_foreground = mask_v * S_full
S_background = mask_i * S_full

# extract mfcc feature from data
mfccs = np.mean(librosa.feature.mfcc(y=S_foreground, sr=sr, n_mfcc=40).T,axis=0) 
print(mfccs)

有什么想法吗？

最佳答案

您正在尝试获取频谱图的 MFCC。

您必须使用逆 STFT 将它们转换回音频样本。

from librosa.core import istft
vocals = istft(S_foreground )

关于python-3.x - librosa.util.exceptions.ParameterError : Invalid shape for monophonic audio: ndim=2, 形状=(1025, 5341)，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/51753936/

27

4

0

文章推荐： elasticsearch - Elasticsearch:如何查询或汇总多值字段计数

文章推荐： elasticsearch - 从 FTP 收集日志文件到 Logstash/Elasticsearch

文章推荐： audio - FFMPEG加入多个视频丢失音频

ios - 应用提交 : Invalid Binary - Invalid Signature
我正在尝试向 iOS 应用商店提交更新。我要从 Buzztouch 应用程序转到 Sprite Kit 应用程序。我能够存档 Xcode 项目并提交它。该应用程序的状态为“上传已收到”，但大约一分钟后
xcode - 无法读取序列化诊断文件 : Invalid File: Invalid diagnostics signature
我收到了这个奇怪的警告。我不确定是什么原因造成的。 .dia文件扩展名应该表示核心有向图图形文件。我没有添加，应用程序几乎没有用户界面。最佳答案我对这个答案并不满意，但我认为它可以帮助人们，直到找
wpf - UriFormatException : Invalid URI: Invalid port specified
下面用作 Uri 参数的程序集限定字符串在 XAML 中工作，但在代码中使用时会出现错误。我尝试了各种 UriKind，结果都相同。我该如何解决这个问题？ [Test] public void La
css - Angular 7 : ng-invalid vs :invalid
我正在开发一个 Angular 应用程序，目的是将其部署到移动设备和 Web 浏览器上。设置表单样式以显示无效输入时，我应该定位 Angular“ng-invalid”类还是 HTML5“:inval
java.net.SocketException : Invalid request: Invalid how 异常
我有一个在 Google App Engine 上运行的应用程序，它是 Android 应用程序的后端。它基本上是 Android 应用程序和在我自己的服务器上运行的 MySQL 数据库之间的桥梁。
ios - 当我已经更新数据时出现错误 "Invalid update: invalid number of rows"
我的代码是这样的: func tableView(_ tableView: UITableView, commit editingStyle: UITableViewCellEditingStyle,
JWE Invalid Invalid Initialization Vector length(JWE无效初始化矢量长度无效)
I need to encrypt using Python with the A256GCM algorithm, and getting back a JWT that I need to
javascript - 网络包 : Invalid configuration object/Invalid Module Entry
无法成功编译webpack并生成bundle.js文件。据我了解，我的 src_dir 和 dist_dir 变量能够指向正确的路径，但在尝试编译时我仍然始终收到两个错误之一。配置对象无效。 Web
regexp_matches - 错误 : invalid regular expression: quantifier operand invalid
因此，当我在 postgres 上运行 regexp_matches 时收到一条错误消息，并且无法弄清楚如何通过它。它似乎在 regex101 等 reg_exp 测试站点上运行良好，但不幸的是在实际
java - LDAP异常 : Invalid Credentials (49) Invalid Credentials with grails
这些是我正在使用的导入: import com.novell.ldap.*; import java.io.UnsupportedEncodingException; 我正在尝试进行一个非常简单的密码
python - Pylint 消息 : Invalid constant name (invalid-name)
在记录器函数的简写情况下，Pylint 提示 Invalid constant name "myprint"(invalid-name)。 # import from utils import get
regex - 为什么和我得到: “Invalid regular expression. Uncaught SyntaxError. Invalid escape.” ?
我试图创建一个HTML输入标签，该标签仅接受以2种格式之一输入的数字，并拒绝所有其他输入。我只想接受以下格式的数字，包括破折号: 1234-12 和 1234-12-12 注意:不是日期，而是合法的
css - :focus:required:invalid:focus and :focus:required:invalid?有什么区别
我一直在尝试使用 Bootstrap 的表单样式处理 AngularJS 的电子邮件验证，并遇到了这个 CSS block 。 input:focus:required:invalid, textar
c - 为什么我使用以下代码从 valgrind 获取 "invalid read"和 "invalid write"？
我正在编写一个程序，以确保我了解如何在 C 中正确实现单向链表。我目前正在哈佛的 CS50 类(class)中学习，并且使用本教程，因为 CS50 人员不解释链接详细列出数据结构:https://ww
ios - 上传应用图片 : "Invalid GeoJSON: Your routing app coverage file is invalid."
此问题与询问同一消息的另一个问题不重复，但在另一个上下文中。这个问题的上下文只是关于上传截图图像和获取消息。今天，我在将图片上传到 App Store Connect 时收到一条新消息: Inval
ios - 尝试删除表中的行时出现错误 'Invalid update: invalid number of rows in section 0'
我的代码似乎运行良好，但当我滑动以删除 UITableView 中的一行时，应用程序崩溃并显示以下内容: 错误 LittleToDoApp[70390:4116002] *** Terminating
getting a `InValid URL` when I send a voice message(当我发送语音消息时收到`Invalid URL`)
当我尝试发送语音消息时，总是收到无效的url错误。我正在使用Whisper将音频转换为文本，但由于某种原因，我似乎无法将文件传递给Whisper。当我在Java脚本中使用它而不是在TypeScrip中
unit-testing - flutter 单元测试 :Invalid argument (string): Contains invalid characters
我正在尝试在 flutter 上对 http 客户端进行单元测试。在模拟 http 和我的存储库类之后: void main() { MockHttpCLient mockHttpCLient;
haskell - 使用 pandoc 作为库时，什么可能导致 "commitAndReleaseBuffer: invalid argument (invalid character)"？
我正在使用 pandoc 作为一个库，相关的代码片段是: module Lib ( latexDirToTex, latexToTxt ) where import qualified
ruby-on-rails - 设计 “Sign In”表单错误地显示 “Invalid Invalid email or password”错误消息
我正在开发一个(相对简单的)Rails应用程序。我正在使用Devise gem处理用户 session 。每当我导航到localhost:3000/users/sign_in时，我都会看到Devise

首页

博学

6Ren·AI

商城

python-3.x - librosa.util.exceptions.ParameterError : Invalid shape for monophonic audio: ndim=2, 形状=(1025, 5341)