android - Pocketsphinx - 完善热词检测-6ren

android - Pocketsphinx - 完善热词检测

转载作者：搜寻专家更新时间：2023-11-01 09:42:32

我重新访问了 CMU Sphinx最近并尝试为 Android 设置一个基本的热词检测器，从 tutorial 开始并调整 sample application .

我遇到了各种问题，尽管深入研究了他们的文档，但我一直无法解决这些问题，直到我无法阅读更多...

为了复制它们，我做了一个基本项目，旨在检测关键字wakeup you 和wakeup me .

我的字典:

me M IY
wakeup W EY K AH P
you Y UW

我的语言模型:

\data\
ngram 1=5
ngram 2=5
ngram 3=4

\1-grams:
-0.9031 </s> -0.3010
-0.9031 <s> -0.2430
-1.2041 me -0.2430
-0.9031 wakeup -0.2430
-1.2041 you -0.2430

\2-grams:
-0.3010 <s> wakeup 0.0000
-0.3010 me </s> -0.3010
-0.6021 wakeup me 0.0000
-0.6021 wakeup you 0.0000
-0.3010 you </s> -0.3010

\3-grams:
-0.6021 <s> wakeup me
-0.6021 <s> wakeup you
-0.3010 wakeup me </s>
-0.3010 wakeup you </s>

\end\

以上两个都是使用 suggested tool 创建的.

还有我的关键短语文件:

wakeup you /1e-20/
wakeup me /1e-20/

改编上面链接的示例应用程序，这是我的代码:

public class PocketSphinxActivity extends Activity implements RecognitionListener {

    private static final String CLS_NAME = PocketSphinxActivity.class.getSimpleName();

    private static final String HOTWORD_SEARCH = "hot_words";

    private volatile SpeechRecognizer recognizer;

    @Override
    public void onCreate(Bundle state) {
        super.onCreate(state);
        setContentView(R.layout.main);

        new AsyncTask<Void, Void, Exception>() {
            @Override
            protected Exception doInBackground(Void... params) {
                Log.i(CLS_NAME, "doInBackground");

                try {

                    final File assetsDir = new Assets(PocketSphinxActivity.this).syncAssets();

                    recognizer = defaultSetup()
                            .setAcousticModel(new File(assetsDir, "en-us-ptm"))
                            .setDictionary(new File(assetsDir, "basic.dic"))
                            .setKeywordThreshold(1e-20f)
                            .setBoolean("-allphone_ci", true)
                            .setFloat("-vad_threshold", 3.0)
                            .getRecognizer();

                    recognizer.addNgramSearch(HOTWORD_SEARCH, new File(assetsDir, "basic.lm"));
                    recognizer.addKeywordSearch(HOTWORD_SEARCH, new File(assetsDir, "hotwords.txt"));
                    recognizer.addListener(PocketSphinxActivity.this);

                } catch (final IOException e) {
                    Log.e(CLS_NAME, "doInBackground IOException");
                    return e;
                }

                return null;
            }

            @Override
            protected void onPostExecute(final Exception e) {
                Log.i(CLS_NAME, "onPostExecute");

                if (e != null) {
                    e.printStackTrace();
                } else {
                    recognizer.startListening(HOTWORD_SEARCH);
                }
            }
        }.execute();
    }

    @Override
    public void onBeginningOfSpeech() {
        Log.i(CLS_NAME, "onBeginningOfSpeech");
    }

    @Override
    public void onPartialResult(final Hypothesis hypothesis) {
        Log.i(CLS_NAME, "onPartialResult");

        if (hypothesis == null)
            return;

        final String text = hypothesis.getHypstr();
        Log.i(CLS_NAME, "onPartialResult: text: " + text);

    }

    @Override
    public void onResult(final Hypothesis hypothesis) {
        // unused
        Log.i(CLS_NAME, "onResult");
    }

    @Override
    public void onEndOfSpeech() {
        // unused
        Log.i(CLS_NAME, "onEndOfSpeech");
    }


    @Override
    public void onError(final Exception e) {
        Log.e(CLS_NAME, "onError");
        e.printStackTrace();
    }

    @Override
    public void onTimeout() {
        Log.i(CLS_NAME, "onTimeout");
    }

    @Override
    public void onDestroy() {
        super.onDestroy();
        Log.i(CLS_NAME, "onDestroy");

        recognizer.cancel();
        recognizer.shutdown();
    }
}

注意:- 我是否应该更改我选择的关键短语(和其他相关文件)以使其更加不同并且我在安静的环境中测试实现，应用的设置和阈值工作非常成功。

问题

当我说叫醒你或叫醒我时，两者都会被检测到。

我无法确定如何对结束音节应用增加的权重。

当我说只是唤醒时，通常(但不总是)两者都会被检测到。

我无法确定如何避免这种情况的发生。

在针对背景噪音进行测试时，误报过于频繁。

我不能降低我正在使用的基本阈值，否则在正常情况下无法始终如一地检测到关键短语。

如果长时间针对背景噪音进行测试(5 分钟应该足以复制)，立即返回安静的环境并说出关键短语，则不会检测到任何结果。

成功并重复检测关键短语需要一段不确定的时间——就好像测试是在安静的环境中开始的。

我找到了一个 potentially related question ，但链接不再有效。我想知道我是否应该更频繁地重置识别器，以便以某种方式重置背景噪声，使其不被平均到检测阈值？

最后，我想知道我对有限关键词的要求是否允许我减小声学模型的大小？

在我的应用程序中打包时的任何开销当然都是有益的。

最后(老实说!)，特别希望 @NikolayShmyrev会发现这个问题，有没有计划完全通过 gradle 包装基本的 Android 实现/sdk？

我感谢那些走到这一步的人......

最佳答案

My language model:

您不需要语言模型，因为您不使用它。

I can't lower the base thresholds I am using, otherwise the keyphrases are not detected consistently under normal conditions.

1e-20 是一个合理的阈值，您可以提供错误检测的样本记录，让我更好地了解发生了什么。

When testing against background noise for a long period (5 minutes should be sufficient to replicate), returning immediately to a quiet environment and uttering the keyphrases, results in no detection.

这是预期的行为。总的来说，长背景噪声使识别器更难快速适应音频参数。如果您的任务是在嘈杂的地方识别单词，最好使用某种硬件降噪功能，例如具有降噪功能的蓝牙耳机。

Finally, I wonder if my requirements for limited keyphrases, would allow me to reduce the size of the acoustic model?

现在不可能了。如果你只是为了发现你可以尝试 https://snowboy.kitt.ai

关于android - Pocketsphinx - 完善热词检测，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/39310163/

文章推荐： c - 什么是用c写的windows系统调用？

文章推荐： .net - 非托管到托管(.Net)

文章推荐： PHP文件上传

MySQL:完善“查找表中最常出现的值”的答案
昨天发布于: MySQL: Finding most frequently occuring values in table 但想知道是否有办法改进答案，因为您不必在最后重复代码来获取MAX(COUN
ruby - 递归 self 完善
在开发了一个展示基本智能的简单算法后，我渴望进行递归 self 改进，我遇到的问题是由于我对递归缺乏理解。我明白，如果我有一些东西来评估我正在使用的算法的“适应性”以进行改进(我为算法提供其自身的二
sql - 完善 SQL 二进制到 IP 地址字符串的转换
我们使用二进制 (16) 字段来存储 IP 地址。我们这样做是因为它可以同时保存 IPv4 和 IPv6 地址，并且可以轻松地与 .Net IPAddress 类一起使用。但是，我创建了以下 SQL
python - 如何润色/完善 wxPython TextCtrls 的定位和外观？
我在使用 wxPython 时遇到了一些纯粹的外观问题。例如，标签和它们所代表的控件之间的关系——一切看起来总是至少偏离目标 2-3 个像素，有时甚至更糟。在我当前正在创建的对话中，我已将文本控件的字
ios - 完善 UICollectionView 布局转换中的 subview 大小调整
我正在开发一个具有 UICollectionView 的应用程序 - Collection View 的工作是显示来自网络服务的数据。我正在尝试实现的应用程序的一个功能是使用户能够将此 UIColl
c# - 玻璃 : Solution found, 上的渲染控件需要双缓冲/完善
我(终于!)找到了一种在玻璃上呈现 Windows.Forms 控件的方法，它似乎没有任何重大缺点，也没有任何大的实现时间。它的灵感来自 this article来自 Coded，它基本上解释了如何

搜寻专家

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

android - Pocketsphinx - 完善热词检测