android - 有可能使 SpeechRecognizer 更快吗？-6ren

android - 有可能使 SpeechRecognizer 更快吗？

转载作者：塔克拉玛干更新时间：2023-11-03 01:15:23

我正在开发一个使用 android SpeechRecognizer 的应用程序。我用它来做一些简单的事情。我点击一个按钮，我的 SpeechRecognizer 开始收听，我从我所说的内容中得到了一些结果。

简单吧？好吧，我的问题是我需要快速制作 SpeechRecognizer。我的意思是，我单击我的按钮，说“你好”，然后 SpeechRecognizer 需要大约 3-4 秒的时间返回一个包含可能结果的数组。我的问题是:

是否可以让 SpeechRecognizer 更快地返回结果？或者花更少的时间关闭 Listening Intent 并开始处理它所听的内容？也许另一种方法来做到这一点？哪个性能会比这个更好？

我在检查库时看到了这 3 个参数:

EXTRA_SPEECH_INPUT_MINIMUM_LENGTH_MILLIS:

The amount of time that it should take after we stop hearing speech to consider the input complete.

EXTRA_SPEECH_INPUT_COMPLETE_SILENCE_LENGTH_MILLIS

The minimum length of an utterance.

EXTRA_SPEECH_INPUT_POSSIBLY_COMPLETE_SILENCE_LENGTH_MILLIS

The amount of time that it should take after we stop hearing speech toconsider the input possibly complete.

http://developer.android.com/intl/es/reference/android/speech/RecognizerIntent.html

我已经尝试了所有这些方法，但都不起作用，或者我没有正确使用它们。这是我的代码:

public class MainActivity extends Activity {
private static final String TIME_FORMAT = "%02d:%02d:%02d";
private final String TAG = "MainActivity";

private StartTimerButton mSpeakButton;
private CircleProgressBar mCountdownProgressBar;
private CountDownTimer mCountDownTimer;
private TextView mTimer;
private int mRunSeconds = 0;
private SpeechRecognizer mSpeechRecognizer;
private Intent mSpeechRecognizerIntent;
private boolean mIsListening = false;

@Override
protected void onCreate(Bundle savedInstanceState) {
    super.onCreate(savedInstanceState);
    setContentView(R.layout.activity_main);
    mRunSeconds = 0;
    mTimer = (TextView) findViewById(R.id.timerText);
    mCountdownProgressBar = (CircleProgressBar) findViewById(R.id.progressBar);
    mSpeechRecognizer = SpeechRecognizer.createSpeechRecognizer(this);
    mSpeechRecognizerIntent = new Intent(RecognizerIntent.ACTION_RECOGNIZE_SPEECH);
    mSpeechRecognizerIntent.putExtra(RecognizerIntent.EXTRA_LANGUAGE_MODEL,
            RecognizerIntent.LANGUAGE_MODEL_FREE_FORM);
    mSpeechRecognizerIntent.putExtra(RecognizerIntent.EXTRA_CALLING_PACKAGE,
            this.getPackageName());

//          mSpeechRecognizerIntent.putExtra(RecognizerIntent.EXTRA_SPEECH_INPUT_MINIMUM_LENGTH_MILLIS,
//                1000);
//        mSpeechRecognizerIntent.putExtra(RecognizerIntent.EXTRA_SPEECH_INPUT_COMPLETE_SILENCE_LENGTH_MILLIS,
//                1000);
//        mSpeechRecognizerIntent.putExtra(RecognizerIntent.EXTRA_SPEECH_INPUT_POSSIBLY_COMPLETE_SILENCE_LENGTH_MILLIS,
//                1000);

    SpeechRecognitionListener listener = new SpeechRecognitionListener();
    mSpeechRecognizer.setRecognitionListener(listener);
    mSpeakButton = (StartTimerButton) findViewById(R.id.btnSpeak);
    mSpeakButton.setReadyState(false);
    mSpeakButton.setOnClickListener(new View.OnClickListener() {

        @Override
        public void onClick(View v) {
            if (mSpeakButton.isReady()) {
                if (!mIsListening)
                    mSpeechRecognizer.startListening(mSpeechRecognizerIntent);
            } else
                mSpeakButton.setReadyState(true);
        }
    });

}     

@Override
public boolean onCreateOptionsMenu(Menu menu) {
    // Inflate the menu; this adds items to the action bar if it is present.
    return true;
}

public void onSpeechResults(ArrayList<String> matches) {
    for (String match : matches) {

        match = match.toLowerCase();
        Log.d(TAG, "Got speech: " + match);

        if (match.contains("go")) {
            //Do Something
            mSpeechRecognizer.stopListening();
        }
        if (match.contains("stop")) {
            //Do Something
            mSpeechRecognizer.stopListening();
        }
    }
}

protected class SpeechRecognitionListener implements RecognitionListener
{

    @Override
    public void onBeginningOfSpeech()
    {
        //Log.d(TAG, "onBeginingOfSpeech");
    }

    @Override
    public void onBufferReceived(byte[] buffer)
    {

    }

    @Override
    public void onEndOfSpeech()
    {
        //Log.d(TAG, "onEndOfSpeech");
    }

    @Override
    public void onError(int error)
    {
        mSpeechRecognizer.startListening(mSpeechRecognizerIntent);

        //Log.d(TAG, "error = " + error);
    }

    @Override
    public void onEvent(int eventType, Bundle params)
    {

    }

    @Override
    public void onPartialResults(Bundle partialResults)
    {
        ArrayList<String> matches = partialResults.getStringArrayList(SpeechRecognizer.RESULTS_RECOGNITION);
        for (String match : matches) {
            match = match.toLowerCase();
            Log.d(TAG, "onPartialResults : " + match);
        }
    }

    @Override
    public void onReadyForSpeech(Bundle params)
    {
        Log.d(TAG, "onReadyForSpeech"); //$NON-NLS-1$
    }

    @Override
    public void onResults(Bundle results)
    {
        //Log.d(TAG, "onResults"); //$NON-NLS-1$
        ArrayList<String> matches = results.getStringArrayList(SpeechRecognizer.RESULTS_RECOGNITION);
        onSpeechResults(matches);
        // matches are the return values of speech recognition engine
        // Use these values for whatever you wish to do
    }

    @Override
    public void onRmsChanged(float rmsdB)
    {
    }
}}

最佳答案

是的，可以减少关机前的延迟....

您无法更改 Google 认为在用户讲话结束时处于静音状态的时间长度。 EXTRA_SPEECH_* 参数曾经有效，现在它们似乎充其量只是偶尔有效，或者根本无效。

您可以做的是使用部分结果来检测您想要的单词或短语，然后手动关闭识别服务。

这是一个如何做到这一点的例子:

public boolean isHelloDetected(@NonNull final Context ctx, @NonNull final Locale loc, @NonNull final Bundle results) {

        boolean helloDetected = false;

        if (!results.isEmpty()) {

            final String hello = ctx.getString(R.string.hello);

            final ArrayList<String> partialData = results.getStringArrayList(SpeechRecognizer.RESULTS_RECOGNITION);

                /* handles empty string bug */
            if (partialData != null && !partialData.isEmpty()) {
                partialData.removeAll(Collections.singleton(""));

                if (!partialData.isEmpty()) {
                    final ListIterator<String> itr = partialData.listIterator();

                    String vd;
                    while (itr.hasNext()) {
                        vd = itr.next().toLowerCase(loc).trim();

                        if (vd.startsWith(hello)) {
                            helloDetected = true;
                            break;
                        }
                    }
                }
            }

            if (!helloDetected) {
                final ArrayList<String> unstableData = results.getStringArrayList("android.speech.extra.UNSTABLE_TEXT");

                    /* handles empty string bug */
                if (unstableData != null && !unstableData.isEmpty()) {
                    unstableData.removeAll(Collections.singleton(""));

                    if (!unstableData.isEmpty()) {
                        final ListIterator<String> itr = unstableData.listIterator();

                        String vd;
                        while (itr.hasNext()) {
                            vd = itr.next().toLowerCase(loc).trim();

                            if (vd.startsWith(hello)) {
                                helloDetected = true;
                                break;
                            }
                        }
                    }
                }
            }
        }

        return helloDetected;
    }

每次从 onPartialResults()

接收到信息时，您都会运行此方法

如果返回 true，则需要在主线程上调用 stopListening()(可能通过 new Handler(Looper.getMainLooper()))。post(...

不过请注意，一旦您关闭识别器，您在 onResults() 中收到的后续结果和最终结果可能不包含“hello”。因为那个词可能只被归类为不稳定的。

您需要编写额外的逻辑来防止在检测到 hello 后使用 detectHello()(否则您将重复调用 stopListening())- 一些简单的 bool 标记可以解决这个问题。

最后，使用 Collections.singleton("") 删除空字符串是内部错误报告的一部分，details to replicate here并且使用 ListIterator 可能对您的样本来说太过分了；一个简单的 for 循环就足够了。

祝你好运。

关于android - 有可能使 SpeechRecognizer 更快吗？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/36947200/

文章推荐： c++ - 代理调用功能如何工作？

文章推荐： python - 在用户接受的 python 中验证 `su` 密码

文章推荐： c++ - 如何使迭代器成为 "automatically dereferenced"？

android - SpeechRecognizer 突然无法离线工作
SpeechRecognizer 正常工作了很长时间，突然它不工作了，现在没有检测到任何东西.. 示例代码是这样的。当它在线时它可以正常工作但离线时不能，我错过了吗什么地方都有？？？ Spee
Android SpeechRecognizer 录音错误
我正在尝试在没有 Intent Dialog 的情况下在 Android 上使用 SpeechRecognizer。它在大多数设备上都可以正常工作，但有些设备会返回音频录制错误(错误代码 3)并且没有
audio - SpeechRecognizer WinRT不需要的音频输出
我正在编写WinRT应用，并在VS 2015，x86调试中对其进行测试。我正进入(状态我的应用程序未创建不需要的音频输出。这是我的代码: using Windows.Media.SpeechReco
Android SpeechRecognizer 准确性差
我在 Activity 中使用 Native SpeechRecognizer 服务，它“有效”，但我遇到了一些严重的问题。返回的“语音转文本”坦率地说非常令人震惊，而且非常非常差，在应用程序中肯定
android - SpeechRecognizer 未启动
简介我需要在我的代码中实现语音识别。我按照这里的其他帖子和一些教程来获取它，但它不适合我。方法这是在 onCreate 中初始化它的代码: Log.d("SPEECH", "speech rec
Android SpeechRecognizer 在第二次收听时失败
我遇到一个问题，SpeechRecognizer 将正确收听，当我下次收听它时转到 SpeechRecognizer.ERROR_NO_MATCH，识别器立即无法 SpeechRecognizer.E
Android SpeechRecognizer 没有重新开始
我知道 SpeechRecognizer 在 android 中是如何工作的。我有一个要求，我需要在一段时间后调用 SpeechRecognizer.stopListening() 方法。但在那之后，
ios - speechRecognizer 不返回答案
我正在学习有关 iOS 10 语音识别 API 的一些教程 (https://code.tutsplus.com/tutorials/using-the-speech-recognition-api-
android - SpeechRecognizer - 时间限制
我正在使用 SppechRecognizer用于语音识别器应用程序。它工作正常。我的要求是我想在 1 秒或 2 秒后停止收听语音。如何实现？最佳答案 1 或 2 秒似乎不是很多时间，但如果你想设置一
java - 在通话期间运行 SpeechRecognizer
有没有办法在通话时运行 SpeechRecognizer？我是这样做的: BroadcastReceiver 处理电话状态的变化(例如摘机)。 SpeechRecognizer 在当前(主)线程中启动
java - SpeechRecognizer 在启动后立即不匹配
我正在尝试在我的 Android 应用程序中使用 SpeechRecognizer 库，到目前为止，它的工作给我留下了很多问题。首先，当我停止说话时它不会停止。如果我试图停止语音识别自己，下次它会给我
java - 如何查询默认的 SpeechRecognizer
如何找到默认系统语音识别器的 ComponentName，即 createSpeechRecognizer(Context context) 时返回的那个叫做？ (其实我只需要知道它支持哪些输入语言，
Android SpeechRecognizer 网络错误
我正在尝试在 Android 4.4 中创建连续语音识别，简单地在 TextView 中显示口语，就像命令一样。我遵循了多个教程，例如 https://github.com/fcrisciani/an
android - SpeechRecognizer 在第一个结果后没有听到
我在 Android 中使用 SpeechRecognizer 和 RecognizerIntent 来实现语音识别。我的目标是在我的语音识别器在屏幕上显示结果后重新开始收听语音。为此，我使用了以下代
安卓， Kotlin 。 SpeechRecognizer 由于某种原因无法正常工作
我正在使用名为 SpeechRecognizer 的 Android 语音 API 尝试将语音翻译成文本，但由于某种原因，只要我点击按钮 - 我就会看到消息 “目前无法访问谷歌” ，不等我讲话， wi
java - 使用 SpeechRecognizer 获取已识别数据的转录
我使用 SpeechRecognizer 来实现“语音到文本”功能，她的工作结果是文本数据: Intent intent = new Intent(RecognizerIntent.ACTION_RE
java - 没有从 SpeechRecognizer 类获得结果？
我有一个 maind UI 类，其中有一个按钮实例化一个类，该类正在实现 SpeechRecognizer 库以将语音从文本转换为文本。 btnSpeak = (ImageButton) findVi
java - SpeechRecognizer onResults 未触发
我有一个相当奇怪的问题。我有一个 Android 应用程序，我使用 SpeechRecognizer 类为其添加语音识别。我创建了实现 RecognitionListener 的类，它只打印每个事件的
C# UWP SpeechRecognizer 问题
我正在开发 UWP 并想使用 SpeechRecognizer。它应该只对“Next”和“Back”这两个词使用react。但通常，它会将“NExt”识别为“Back”。我的代码如下。如何解决？ va
java - SpeechRecognizer 什么都不做......我错过了什么
请看下面的代码。 onBeginningOfSpeech() 被调用(甚至在我开始说话之前，顺便说一句)，但随后 - 没有。我错过了什么？我承认代码大部分是由不同的例子组成的，我没有完全理解。但我希

塔克拉玛干

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

android - 有可能使 SpeechRecognizer 更快吗？