jquery - Microsoft Azure 认知项目 Oxford Speech API 文本转语音 jQuery REST 示例-6ren

jquery - Microsoft Azure 认知项目 Oxford Speech API 文本转语音 jQuery REST 示例

转载作者：行者123 更新时间：2023-12-03 04:42:07

24

4

任何人都可以获得 Azure 项目牛津语音 API 的有效 jQuery REST 代码示例。

我已启动 Azure 应用程序服务和 key 。只需要一个简单的原型(prototype)页面，在加载时播放当前特定的一些文本

是否可以仅在客户端使用 javascript/jQuery 和 REST 而无需服务器端代码来完成此操作？

此外，我安装了服务器端示例，但它只能从本地主机播放。没有错误，但无法从 azure 网站播放。

更新:仅使用客户端 js 代码。我能够进行身份验证，并且我收到了 RIFF AWAVEfmt >}数据，但似乎无法弄清楚如何从浏览器中播放它。我没有收到任何错误。

    $.ajax({
    url: ttsServiceUri,
    beforeSend: function (xhrObj) {
        xhrObj.setRequestHeader("Content-Type", "application/ssml+xml");
        xhrObj.setRequestHeader("X-Microsoft-OutputFormat", "riff-16khz-16bit-mono-pcm");
        xhrObj.setRequestHeader("Authorization", "Bearer " + response.access_token);
        xhrObj.setRequestHeader("User-Agent", "TTSNodeJS");
        xhrObj.setRequestHeader("X-Search-AppId", "xxxxxxxxxxxDAA29772419F436CA");
        xhrObj.setRequestHeader("X-Search-ClientID", "xxxxxxxxxxxx1A480F00935DC390960");

    },
    data: post_data,
    type: "POST"
})

.done(函数(响应){ var audio = new Audio(响应); 音频.play();

谢谢。

最佳答案

Here您可以找到API文档。
有关工作示例，请查看使用认知服务语音 API http://github.com/Danielius1012/Text-To-Speech 的示例代码
上面提到的代码中的有用代码片段:

function sendAudioRequest()
{
textToSpeak = $("#my-text")[0].value;
sendString = "<speak version='1.0' xml:lang='"+language+"'><voice xml:lang='"+language+"' xml:gender='Female' name='"+nameLanguage+"'>"+textToSpeak+"</voice></speak>";

console.info($("#text-to-speak"));

var xhttp = new XMLHttpRequest();

xhttp.onreadystatechange = function() 
{
    if (xhttp.readyState == 4 && xhttp.status == 200) 
    {
        context.decodeAudioData(xhttp.response, function(buffer) 
        {
            speechBuffer = buffer;
            console.info(speechBuffer);
            playAudio(speechBuffer);  
        });

    }
}; 

xhttp.open("POST", audioURL, true);
xhttp.setRequestHeader("Content-type", 'application/ssml+xml');
xhttp.setRequestHeader("Authorization", 'Bearer ' + token);
xhttp.setRequestHeader("X-Microsoft-OutputFormat", 'riff-16khz-16bit-mono-pcm');
xhttp.setRequestHeader("X-Search-AppId", '07D3234E49CE426DAA29772419F436CA');
xhttp.setRequestHeader("X-Search-ClientID", '1ECFAE91408841A480F00935DC390960');
xhttp.responseType = 'arraybuffer'

xhttp.send(sendString);
}

关于jquery - Microsoft Azure 认知项目 Oxford Speech API 文本转语音 jQuery REST 示例，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/36534876/

24

4

0

文章推荐： javascript - 从python登录JS网页

文章推荐： javascript - 如何制作易于理解的填写表格说明

文章推荐： Azure:无法删除容器，blob 上有租约，但未提供租约 ID

speech-recognition - Microsoft.Speech.Synthesis 不适用于文本转语音但 System.Speech.Synthesis 有效。为什么？
我只是尝试使用 Microsoft.Speech.dll; 为文本转语音运行简单的 Microsoft 示例 using System; using Microsoft.Speech.Synthesi
speech-recognition - Microsoft Speech 产品/平台之间的差异
微软似乎提供了不少语音识别产品，我想知道它们之间的区别。有Microsoft Speech API ，或 SAPI。但不知何故Microsoft Cognitive Service Speech A
speech-recognition - 各种 Microsoft Speech 技术之间的差异
我希望编写一个应用程序，将语音到文本转换为仓库应用程序，反之亦然。主要用例是运算符(operator)将在仓库中佩戴耳机并将指令发送回服务器并从仓库软件接收指令以拣选和打包订单。我们将使用由 Wind
speech-recognition - 如何在python中使用google cloud speech api
我正在探索 python 中的谷歌云语音 api。我正在关注这个 link .我也提到了这个 stackoverflow link .但是我对设置环境变量感到震惊。我做过的事情: 1.安装gclou
speech-to-text - IBM Speech to Text 字母数字字符串识别？
在尝试让 Speech to Text(IBM 语音网关 IVR 应用程序)识别字母数字字符串时，我想知道我是否可以创建一个自定义语法或实体来限制 STT 仅识别单个字母和数字，不包括完全的话。例如，
speech - Web Speech API可以与Web Audio API一起使用吗？
是否可以将来自Web Speech API的合成语音用作Web Audio API音频上下文中的SourceNode？最佳答案实际上，我问过要在Web Speech邮件列表中添加此内容，并且基本上
speech-recognition - 语音到文本的大型音频文件 [Microsoft Speech API]
使用 Microsoft Speech API 转录中/大型音频文件(每个文件约 6-10 分钟)的最佳方法是什么？像批处理音频文件转录这样的东西？我使用了 https://docs.microso
speech-recognition - 408 请求超时 Microsoft Speech to Text
我的 .wav 文件长度只有 4 秒。即使在多次重试并在云端运行后，我仍然不断收到以下错误 * upload completely sent off: 12 out of 12 bytes
speech-recognition - 有人在生产中使用 Google Speech API 吗？
我找到了一些描述如何使用 Google 语音 API 的文章 ( http://mikepultz.com/2011/03/accessing-google-speech-api-chrome-11/
google-cloud-speech - 需要帮助 Speech-to-text，重试次数过多总是失败
我使用 google 语音转文本 API 从音频中获取字幕，但是当音频太长时，通常超过 60 分钟，重试次数过多会失败。它说:google.api_core.exceptions.GoogleAPIC
c# - System.Speech.Recognition 是否使用 "speech training"？
我有一些来自 System.Speech.Recognition 的简单代码可以正常工作: using (var recognizer = new SpeechRecognitionEngine(ne
text-to-speech - Speech API OneCore 中的 Sayaka 语音在哪里？
Windows 10。我在“设置”中安装了日语 TTS 语音。现在，当我在 Speech API 5.4 OneCore 中使用语音枚举时(虽然不是在 5.4 中)，我得到 6 个语音: 大卫齐拉
google-text-to-speech - Google Cloud Text-to-Speech 请求的最大大小
当我提交对太长文本的综合请求时，我收到以下错误: google.api_core.exceptions.ResourceExhausted: 429 Received message larger t
C# 和 Microsoft Speech.Recognition 和 Speech.Synthesis
我是 C# 的新手，也是 Speech.Recognition 的新手。我搜索了很长时间的教程，但没有找到那么多，我什至不确定我是否正确包含了所有内容。我下载了: SDK Runtime Langu
html - 是否可以使 "HTML to speech"与 "Text to speech"相同？
我有一个奇怪的要求，即在我现有的应用程序中我有 Text2Speech 并且为此，我使用了 AVSpeechSynthesizer 来语音文本，但现在要求改变了，现在我需要将 HTML 文件数据转换为
speech-recognition - 谷歌语音 API : Can recognize speech from OGG file
我使用 Google Speech API 通过 Python 识别 .OGG 文件音频中的越南语语音。但它不会返回任何结果。最佳答案至少在英文版的Google Speech API中，需要使用F
speech-recognition - 如何使用 Codename One 中的 Google Speech API？
我想从手机录制音频，然后将其发送到谷歌语音非流媒体 API。我可以使用 Capture.captureAudio() 进行录音，但是我不知道音频编码和采样率是什么，因为它们是必需的 for the a
speech-to-text - 在 google-cloud-speech 中识别 .wav 音频文件的问题
我使用谷歌云语音到文本 API 将音频转换为文本。对于 .raw文件它工作正常但是对于 .wav文件它给了我类似的错误: Google::Gax::RetryError Exception: Ga
.net - System.Speech.Recognition 和 Microsoft.Speech.Recognition 之间有什么区别？
.NET 中有两个类似的用于语音识别的命名空间和程序集。我试图了解其中的差异以及何时适合使用其中之一。程序集 System.Speech(在 System.Speech.dll 中)有 System
google-cloud-speech - Google Cloud Speech API 使用的端点/端口是什么
通过流式 API (Performing Streaming Speech Recognition on an Audio Stream) 使用 Google Cloud Speech API，我们

首页

博学

6Ren·AI

商城

jquery - Microsoft Azure 认知项目 Oxford Speech API 文本转语音 jQuery REST 示例