amazon-web-services - AWS 转录流 BadRequestException : "Could not decode the audio stream..."-6ren

amazon-web-services - AWS 转录流 BadRequestException : "Could not decode the audio stream..."

转载作者：行者123 更新时间：2023-12-04 17:17:56

27

4

我正在使用 websockets 在 Dart/Flutter 中构建一个 Transcribe Streaming 应用程序。当我流式传输测试音频(从单声道、16kHz、16 位签名小端 WAV 文件中提取)时，我得到...

BadRequestException: Could not decode the audio stream that you provided. Check that the audio stream is valid and try your request again.

作为测试，我使用一个文件来传输音频。我每秒发送 32k 数据字节(大致模拟实时麦克风流)。如果我流式传输所有 0x00 或所有 0xFF 或随机字节，我什至会收到错误消息。如果我将块大小划分为 16k，将间隔时间划分为 0.5 秒，那么它会在出错之前再增加一帧......
至于数据，我只是简单地将字节打包在 EventStream 帧的数据部分中，就像它们在文件中一样。显然 Event Stream 打包是正确的(字节布局、CRC)，否则我会收到一个错误指示，不是吗？
什么会向 AWSTrans 表明它不可解码？关于如何进行此操作的任何其他想法？
谢谢你的帮助...
这是进行打包的代码。完整版在这里(如果你敢的话……现在有点乱) https://pastebin.com/PKTj5xM2

Uint8List createEventStreamFrame(Uint8List audioChunk) {
  final headers = [
    EventStreamHeader(":content-type", 7, "application/octet-stream"),
    EventStreamHeader(":event-type", 7, "AudioEvent"),
    EventStreamHeader(":message-type", 7, "event")
  ];
  final headersData = encodeEventStreamHeaders(headers);
 
  final int totalLength = 16 + audioChunk.lengthInBytes + headersData.lengthInBytes;
  // final prelude = [headersData.length, totalLength];
  // print("Prelude: " + prelude.toString());
 
  // Convert a 32b int to 4 bytes
  List<int> int32ToBytes(int i) { return [(0xFF000000 & i) >> 24, (0x00FF0000 & i) >> 16, (0x0000FF00 & i) >> 8, (0x000000FF & i)]; }
 
  final audioBytes = ByteData.sublistView(audioChunk);
  var offset = 0;
  var audioDataList = <int>[];
  while (offset < audioBytes.lengthInBytes) {
    audioDataList.add(audioBytes.getInt16(offset, Endian.little));
    offset += 2;
  }
 
  final crc = CRC.crc32();
  final messageBldr = BytesBuilder();
  messageBldr.add(int32ToBytes(totalLength));
  messageBldr.add(int32ToBytes(headersData.length));
 
  // Now we can calc the CRC. We need to do it on the bytes, not the Ints
  final preludeCrc = crc.calculate(messageBldr.toBytes());
 
  // Continue adding data
  messageBldr.add(int32ToBytes(preludeCrc));
  messageBldr.add(headersData.toList());
  // messageBldr.add(audioChunk.toList());
  messageBldr.add(audioDataList);
  final messageCrc = crc.calculate(messageBldr.toBytes().toList());
  messageBldr.add(int32ToBytes(messageCrc));
  final frame = messageBldr.toBytes();
  //print("${frame.length} == $totalLength");
  return frame;
}

最佳答案

BadRequestException，至少在我的情况下，是指帧编码不正确，而不是音频数据错误。
AWS 事件流编码详细信息为 here .
我在字节序和字节大小方面遇到了一些问题。您需要对消息编码和音频缓冲区非常了解。音频需要是 16 位/有符号 (int)/小端 ( See here )。消息包装器中的那些长度参数是 32 位(4 字节)大端。 ByteData是你在 Dart 的 friend 。这是我更新后的代码中的一个片段:

final messageBytes = ByteData(totalLength);

...

for (var i=0; i<audioChunk.length; i++) {
  messageBytes.setInt16(offset, audioChunk[i], Endian.little);
  offset += 2;
}

请注意，16 位 int 实际上占用了 2 个字节的位置。如果您不指定 Endian 样式，那么它将默认为您的系统，这将导致标题 int 编码或音频数据出错...丢失丢失!
确保一切正确的最佳方法是编写 AWS 响应所需的解码函数，然后对编码的帧进行解码，看看结果是否相同。使用像 [-32000, -100, 0, 200 31000] 之类的 audo 测试数据或类似的东西，这样你就可以测试字节顺序等都是正确的。

关于amazon-web-services - AWS 转录流 BadRequestException : "Could not decode the audio stream..."，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/68037614/

27

4

0

文章推荐： react-native - 在水平平面列表上方垂直对齐 ListHeaderComponent

文章推荐： javascript - 如何在 React 应用程序中隐藏 reCaptcha-v2 SITE_KEY

Perl:utf8::decode 与 Encode::decode
我得到了一些有趣的结果，试图辨别使用 Encode::decode("utf8", $var) 之间的区别。和 utf8::decode($var) .我已经发现，在一个变量上多次调用前者最终会导致错
android - 下载图片并在ImageView中查看时为"decoder->decode returned false"
我尝试使用 FlushedInputStream :Android decoder->decode returned false for Bitmap download 但没有任何变化，因为我使用:B
python-2.7 - Pyasn1 decoder.decode 是如何工作的？
我有一小部分代码: from pyasn1.type import univ from pyasn1.codec.ber import decoder decoder.decode(binary_fi
ios - decoder.decode 将有效的 iso8601 日期设置为 nil
这个问题在这里已经有了答案: Instantiated optional variable shows as nil in Xcode debugger (2 个答案) 关闭 2 年前。在 Swi
swift - 无法使用类型为 'decode' 的参数列表调用 '(Decodable, from: Data)'
我在 Playground 中有以下示例代码。如果结果符合 Decodable 协议(protocol)，我想解码网络请求的结果。知道为什么这段代码不起作用吗？ protocol APIReques
php - Imagecreatefromwebp() : WebP decode: fail to decode input data
我正在尝试使用 imagecreatefromwebp() 将 webp 文件转换为 JPEG，但不幸的是，它向我发出警告:警告:imagecreatefromwebp():WebP 解码:无法解码输
swift - 为什么在使用 JSONDecoder.decode 方法时没有调用 Decodable 的 init 方法？
我试图覆盖 JSONDecoder 解码数据的方式。我尝试了以下方法: struct Response : Decodable { init(from decoder: Decoder) t
python - '"sss 的用途.decode ("base64".decode ("zlib")'
ACTIVATE_THIS = """ eJx1UsGOnDAMvecrIlYriDRlKvU20h5aaY+teuilGo1QALO4CwlKAjP8fe1QGGalRoLEefbzs+Mk Sb7
ios - fatal error : Dictionary 不符合 Decodable 因为 Any 不符合 Decodable
我正在尝试使用 swift 4 来解析本地 json 文件: { "success": true, "lastId": null, "hasMore": false,
file - 错误: Uncaught Ext.JSON.decode(): You're trying to decode an invalid JSON String
我的代码有问题。我正在尝试使用ExtJS和Codeigniter制作上传文件格式。这是我的下面的代码， Ext.require([ 'Ext.form.field.File',
java - sun.net.www.ParseUtil.decode() 与 java.net.URLDecoder.decode()
我有一些遗留代码正在调用 sun.net.www.ParseUtil.decode()。我想避免调用供应商特定的函数，所以我想用其他东西替换调用。我可以使用 java.net.URLDecoder.
extjs - 访问 Nexus 配置 - Ext.JSON.decode() : You're trying to decode an invalid JSON String:
使用 Sonatype Nexus，我仅在访问 /nexus/#admin/support/status 时收到此错误消息. Ext.JSON.decode(): You're trying to d
json - 榆树 'Json.Decode.succeed' : how is it used in a decode pipeline if it is supposed to always return the same value?
我正在学习 Elm，让我感到困惑的一件事是“Json.Decode.succeed”。根据docs succeed : a -> Decoder a Ignore the JSON and produ
Java - URLDecoder.decode(String s) 与 URLDecoder.decode(String s, String enc)
有什么区别 URLDecoder.decode(String s) 和 URLDecoder.decode(String s, String enc) 我有一个 cookie 值，例如 val=%22
javascript - 气体 : parse XML - decode HTML entity name fails - decode entity decimal code succeeds
使用 Google Apps 脚本，我想解码 HTML，例如: Some text & text ¢ 存储为: Some text & text ¢ 所以，类似的问题:How t
ffmpeg - 忽略错误 "Invalid UTF-8 in decoded subtitles text; maybe missing -sub_charenc option Error while decoding stream"是否安全？
我正在对带有字幕的视频进行编码，但出现错误“解码的字幕文本中的 UTF-8 无效；可能缺少 -sub_charenc 选项。解码流时出错”，但视频还是编码了。忽略此错误的后果是什么？谷歌搜索显示一个人
python - Unicode解码错误: 'utf-8' codec can't decode byte 0x9d in position 0: invalid start byte when I execute the ` b.decode()`
我有如下代码: cn_bytes = [157, 188, 156] cn_str = "" clen = len(cn_bytes) count = int(clen / 3) for x in r
decode - 分析部分或损坏的二维码
关闭。这个问题不满足Stack Overflow guidelines .它目前不接受答案。想改善这个问题吗？更新问题，使其成为 on-topic对于堆栈溢出。 4年前关闭。 Improve thi
VBE decoder
This script give you a decoded listing from an encoded file. Supports *,je, ,vbe, .asp, .hta, .htm,
decode - telnet 客户端响应如何解码
telnet客户端响应如何解码我认为这是一个特定的响应，因为所有思科服务器都有相同的响应.这段文字的名称是什么，我如何解密它 '\xff\xfb\x01\xff\xfb\x03\xff\xfd\x1

首页

博学

6Ren·AI

商城

amazon-web-services - AWS 转录流 BadRequestException : "Could not decode the audio stream..."