python - 如何在 python 中将 pocketsphinx (5prealpha) 与 gstreamer-1.0 一起使用？-6ren

python - 如何在 python 中将 pocketsphinx (5prealpha) 与 gstreamer-1.0 一起使用？

转载作者：行者123 更新时间：2023-11-28 17:29:48

26

4

我正在尝试创建一个小的 Python 脚本，它将通过网络接收音频流，通过 pocketspinx 将其馈送以将语音转换为文本，并根据 pocketsphinx 的输出运行一些命令。

我已经在 Ubuntu 15.10 虚拟机上安装了 sphinxbase 和 pocketsphinx (5prealpha)，并且能够在 Python 中正确处理示例音频文件(pocketsphinx 安装的一部分)的内容。所以我有理由相信我的 sphinx 安装工作正常。不幸的是，测试 python 脚本无法处理连续音频并使用 native pocketsphinx API。根据 cmusphinx 网站，我应该使用 gstreamer 进行连续翻译。不幸的是，关于如何在 Python 中将 pocketsphinx 与 gstreamer 结合使用的信息相当有限。根据我能找到的示例，我拼凑了以下脚本。

import gi
gi.require_version('Gst', '1.0')
from gi.repository import GObject, Gst
GObject.threads_init()
Gst.init(None)

def element_message( bus, msg ):
         msgtype = msg.get_structure().get_name()
         if msgtype != 'pocketsphinx':
                 return
         print "hypothesis= '%s'  confidence=%s\n" % (msg.get_structure().get_value('hypothesis'), msg.get_structure().get_value('confidence'))

pipeline = Gst.parse_launch('udpsrc port=3000 name=src caps=application/x-rtp ! rtppcmadepay name=rtpp ! alawdec name=decoder ! queue ! pocketsphinx name=asr ! fakesink')

asr = pipeline.get_by_name("asr")
asr.set_property("configured", "true")

bus = pipeline.get_bus()
bus.add_signal_watch()
bus.connect('message::element', element_message)

pipeline.set_state(Gst.State.PLAYING)

# enter into a mainloop
loop = GObject.MainLoop()
loop.run()

发送方看起来像:

import gobject, pygst
pygst.require("0.10")
import gst

pipeline = gst.parse_launch('alsasrc ! audioconvert ! audioresample ! alawenc ! rtppcmapay ! udpsink  port=3000 host=192.168.13.120')
pipeline.set_state(gst.STATE_PLAYING)
loop = gobject.MainLoop()
loop.run()

这应该从网络接收一个 udp 流，将其馈送到 pocketsphinx 并将输出打印到终端。如果我更换“队列!口袋狮身人面像! fakesink'部分由'wavenc! filesink'，我确实得到了一个内容正确的有效音频文件，所以我知道网络发送部分工作正常。 (我的测试机器上没有音频，所以我无法使用本地音频源进行测试)。

当我启动脚本时，我看到 pocketspinx 配置经过，但随后脚本似乎不再执行任何操作。当我使用 GST_DEBUG=*:4 启动脚本时，我看到以下输出:

0:00:04.789157687  2220      0x86fff70 INFO               GST_EVENT gstevent.c:760:gst_event_new_segment: creating segment event time segment start=0:00:00.000000000, offset=0:00:00.000000000, stop=99:99:99.999999999, rate=1.000000, applied_rate=1.000000, flags=0x00, time=0:00:00.000000000, base=0:00:00.000000000, position 0:00:00.000000000, duration 99:99:99.999999999
0:00:04.789616981  2220      0x86fff70 INFO                 basesrc gstbasesrc.c:2838:gst_base_src_loop:<src> marking pending DISCONT
0:00:04.789995780  2220      0x86fff70 INFO               GST_EVENT gstevent.c:760:gst_event_new_segment: creating segment event time segment start=0:00:00.000000000, offset=0:00:00.000000000, stop=99:99:99.999999999, rate=1.000000, applied_rate=1.000000, flags=0x00, time=0:00:00.000000000, base=0:00:00.000000000, position 0:00:04.079311489, duration 99:99:99.999999999
0:00:04.790420834  2220      0x86fff70 INFO               GST_EVENT gstevent.c:679:gst_event_new_caps: creating caps event audio/x-raw, format=(string)S16LE, layout=(string)interleaved, rate=(int)8000, channels=(int)1
0:00:04.790851965  2220      0x86fff70 WARN                GST_PADS gstpad.c:3989:gst_pad_peer_query:<decoder:src> could not send sticky events
0:00:04.791258320  2220      0x86fff70 WARN                 basesrc gstbasesrc.c:2943:gst_base_src_loop:<src> error: Internal data flow error.
0:00:04.791572605  2220      0x86fff70 WARN                 basesrc gstbasesrc.c:2943:gst_base_src_loop:<src> error: streaming task paused, reason not-negotiated (-4)
0:00:04.791917073  2220      0x86fff70 INFO        GST_ERROR_SYSTEM gstelement.c:1837:gst_element_message_full:<src> posting message: Internal data flow error.
0:00:04.792305347  2220      0x86fff70 INFO        GST_ERROR_SYSTEM gstelement.c:1860:gst_element_message_full:<src> posted error message: Internal data flow error.
0:00:04.792633841  2220      0x86fff70 INFO                    task gsttask.c:315:gst_task_func:<src:src> Task going to paused

根据我在谷歌上找到的信息和示例，我不明白出了什么问题。

如有任何帮助，我们将不胜感激。

妮可

最佳答案

Gstreamer 元素需要 16000 khz 音频，您正在尝试超过 8000。您必须修改 pocketsphinx 源以在 pocketsphinx 元素中启用 8000。您需要更新元素规范率、pocketsphinx 的采样率配置参数和声学模型。

或者，您需要通过网络发送宽带音频。在这种情况下，您不应使用非法编解码器。

关于python - 如何在 python 中将 pocketsphinx (5prealpha) 与 gstreamer-1.0 一起使用？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/35232989/

26

4

0

文章推荐： python - 从 ManyToMany 在 admin.py 中添加字段

文章推荐： javascript - 更改给定脚本的 html

文章推荐： html - 将页眉和页脚添加到布局

文章推荐： javascript - jQuery 计数字符达到限制

Python pocketsphinx RequestError : missing PocketSphinx module: ensure that PocketSphinx is set up correctly
我正在尝试制作一个 Python 应用程序，它可以使用 PyAudio、SpeechRecognition 和 PocketSphinx 录制音频并将其翻译成英文文本。我在 Mac OS X El C
ubuntu - 错误 : Failed building wheel for pocketsphinx while trying to install pocketsphinx with pip
我正在尝试安装 pocketsphinx 并出现以下错误: jandornhege@JanDornhegeUbuntu:~/Hermes/Basefunktions$ pip install pock
来自麦克风的连续语音识别 pocketsphinx
如何使用 pocketsphinx 从麦克风执行连续语音识别。如何在 C 中使用 gstreamer 插件 api？最佳答案 how to perform continuous speech rec
cmusphinx - Pocketsphinx 如何确定不符合语法的单词
我目前正在使用 Pocketsphix 演示(android 和 Visual Studio 2010)并且我已经配置了一个 jsgf 语法像这样 #JSGF V1.0; grammar Nam
sphinx - Pocketsphinx-添加单词并提高准确性
我终于设法构建并运行Pocketsphinx（pocketsphinx_continuous）。我遇到的问题是如何提高准确性。据我了解，您可以指定一个字典文件（-dict test.dic）。因此，我
cmusphinx - pocketsphinx 中配置解码器的参数
我开始研究口袋狮身人面像。我有一个用于配置解码器的可能参数列表。但没有说明哪个参数负责哪个配置。在 tutorial CMUSphinx这只是其中的一小部分。这对我来说还不够。有人有资料，解释了哪些参
python - 树莓派 + PocketSphinx
我正在尝试使用带有 PyDev 插件的 Eclipse (Juno) IDE 在 Windows 8 上开发 python 应用程序。我已经设置了环境，并配有解释器。能够运行诸如“hello, wo
python - pocketsphinx 的正确导入查询是什么？
关闭。这个问题是not reproducible or was caused by typos .它目前不接受答案。这个问题是由于错别字或无法再重现的问题引起的。虽然类似的问题可能是on-topi
c++ - Pocketsphinx 为假设返回空字符串
我在自定义 C++ 应用程序中使用 pocketshpinx 进行语音识别。我注意到有时 ps_get_hyp() 方法返回的假设字符串是一个空字符串。问题:这是预期的行为吗？如果是这样，有没有办法
android - Pocketsphinx - 完善热词检测
我重新访问了 CMU Sphinx最近并尝试为 Android 设置一个基本的热词检测器，从 tutorial 开始并调整 sample application . 我遇到了各种问题，尽管深入研究了他
speech-recognition - pocketsphinx - 如何从关键字识别切换到语法模式
我正在使用 pocketsphinx 和树莓派来实现家庭自动化。我用支持的命令编写了一个简单的 JSGF 语法文件。现在，我想在命令之前使用激活短语，例如“嘿计算机”，以避免错误检测，并且仅在说出激活
speech-recognition - 正确配置 Pocketsphinx
我一直在运行 Debian Squeeze 的虚拟机上安装 Pocketsphinx0.7。这工作得很好，我可以尝试识别文件中的语音。有了这个，我构建了一些 python 脚本，这些脚本可以识别我得到
speech-recognition - pocketsphinx 之前的降噪降低了识别准确率
我正在尝试提高 pocketsphinx 在嘈杂环境中的识别准确率。但是，用户可能会在可变环境中使用该应用程序。因此，噪声训练不是我想做的事情。我的问题是，在将语音信号输入 pocketsphinx
cmusphinx - Pocketsphinx setKeywordThreshold() 问题
我正在考虑为我的应用程序使用 Pocketsphinx 离线语音识别，但它的文档不清楚。如果有人可以给出以下问题的答案，那么它真的会对我有很大帮助。 setKeywordThreshold(1e-5f
c - 构建 pocketsphinx 应用程序的未解析符号
我尝试开始使用 pocketsphinx 但收到此错误: gcc -I /home/noahchalifour/libraries/pocketsphinx/include -I /home/noah
java - Pocketsphinx Demo 中空指针异常切换到搜索
我想在 android 中开发一个语音识别器。我用过 this thread和 this video在 Android 设备中使用语音识别。这是我的代码: MainActivity.java: pa
android - pocketsphinx android 动态语法
我正在开发一个使用 pocketsphinx 的 android 应用程序。不幸的是识别准确率很差，因此我想将语法中的单词限制为真正需要的单词。目前我使用的是 pocketsphinx 的演示应用程
android - PocketSphinx native 崩溃
在某些设备(不是实际测试设备)上，当我开始 PocketSphinx 识别时，我得到了一个强制关闭。我正在尝试从其中一台设备获取日志文件，但这很困难，因为我测试过的设备都没有出现此错误。让我知道从我的
android - PocketSphinx android 即使没有我说话也能显示结果
我在 android 上使用 pocketsphinx 来识别关键字，但它无法识别所需关键字以外的任何其他关键字。而且它甚至不等我说话并在 logcat 中显示关键字。这是我的代码: public
python - python pocketsphinx 中的单词时间戳
我在 ubuntu 11.10 上使用 pocketsphinx python 绑定(bind)。我将语法和音频文件传递给它，它运行良好。我现在正在寻找音频文件中每个单词的时间戳。我见过: void

首页

博学

6Ren·AI

商城

python - 如何在 python 中将 pocketsphinx (5prealpha) 与 gstreamer-1.0 一起使用？