google-cloud-platform - 谷歌云平台 : Speech to Text Conversion of Large Media Files-6ren

google-cloud-platform - 谷歌云平台 : Speech to Text Conversion of Large Media Files

转载作者：行者123 更新时间：2023-12-04 12:11:18

58

4

我正在尝试从从 youtube 下载的 mp4 媒体文件中提取文本。由于我正在使用谷歌云平台，所以想尝试一下谷歌云语音。

在所有安装和配置之后，我复制了以下代码片段以开始使用:

with io.open(file_name, 'rb') as audio_file:
    content = audio_file.read()
    audio = types.RecognitionAudio(content=content)

config = types.RecognitionConfig(encoding=enums.RecognitionConfig.AudioEncoding.LINEAR16, sample_rate_hertz=16000, language_code='en-US')   

response = client.long_running_recognize(config, audio)

但是我收到以下关于文件大小的错误:

InvalidArgument: 400 Inline audio exceeds duration limit. Please use a GCS URI.

然后我读到我应该对大型媒体文件使用流。所以，我尝试了以下代码片段:

with io.open(file_name, 'rb') as audio_file:
    content = audio_file.read()

#In practice, stream should be a generator yielding chunks of audio data.

stream = [content]
requests = (types.StreamingRecognizeRequest(audio_content=chunk)for chunk in stream)

config = types.RecognitionConfig(encoding=enums.RecognitionConfig.AudioEncoding.LINEAR16,sample_rate_hertz=16000,language_code='en-US')

streaming_config = types.StreamingRecognitionConfig(config=config)

responses = client.streaming_recognize(streaming_config, requests)

但我仍然收到以下错误:

InvalidArgument: 400 Invalid audio content: too long.

因此，任何人都可以提出一种转录 mp4 文件和提取文本的方法。我对非常大的媒体文件没有任何复杂的要求。媒体文件最长可达 10-15 分钟。谢谢

最佳答案

该错误消息表示文件太大，您需要先将媒体文件复制到 Google Cloud Storage，然后指定 Cloud Storage URI，例如 gs://bucket/path/mediafile。

使用 Cloud Storage URI 的关键是:

RecognitionAudio audio = RecognitionAudio.newBuilder().setUri(gcsUri).build();

以下代码将向您展示如何为输入指定 GCS URI。 Google 有一个 complete example在github上。

  public static void syncRecognizeGcs(String gcsUri) throws Exception {
    // Instantiates a client with GOOGLE_APPLICATION_CREDENTIALS
    try (SpeechClient speech = SpeechClient.create()) {
      // Builds the request for remote FLAC file
      RecognitionConfig config =
          RecognitionConfig.newBuilder()
              .setEncoding(AudioEncoding.FLAC)
              .setLanguageCode("en-US")
              .setSampleRateHertz(16000)
              .build();
      RecognitionAudio audio = RecognitionAudio.newBuilder().setUri(gcsUri).build();

      // Use blocking call for getting audio transcript
      RecognizeResponse response = speech.recognize(config, audio);
      List<SpeechRecognitionResult> results = response.getResultsList();

      for (SpeechRecognitionResult result : results) {
        // There can be several alternative transcripts for a given chunk of speech. Just use the
        // first (most likely) one here.
        SpeechRecognitionAlternative alternative = result.getAlternativesList().get(0);
        System.out.printf("Transcription: %s%n", alternative.getTranscript());
      }
    }
  }

关于google-cloud-platform - 谷歌云平台 : Speech to Text Conversion of Large Media Files，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/53307696/

58

4

0

文章推荐： apache-kafka - Kafka 代理自动扩展

文章推荐： powershell - 在 Windows 8 上使用 PowerShell 2 作为默认版本

文章推荐： vim - 当文件被另一个应用程序更改时如何得到通知？

文章推荐： shell - Google Cloud DNS 更改记录资源更新时间

google-cloud-platform - 如何自动启动 AI-Platform 作业？
我创建了一个训练作业，我从大查询中获取数据、执行训练和部署模型。我想在这两种情况下自动开始训练: 向数据集添加了 1000 多个新行有时间表(例如，每周一次) 我检查了 GCP Cloud Sche
google-cloud-platform - Google Cloud Platform 服务帐户无法访问项目
我遇到以下警告: WARNING: You do not appear to have access to project [$PROJECT] or it does not exist. 在本地运行
google-cloud-platform - Google Cloud Platform 中的身份验证
我正在使用 Google Cloud Platform，我必须使用 java 非 Web 应用程序访问云功能，就像我尝试使用 Google Cloud Storage JSON API 从 Googl
google-cloud-platform - Google Identity Platform 第三方访问？
我的问题是第三方开发人员如何通过我的身份平台登录用户？我查看了文档，但一无所获。本质上，我想将 Identity Platform 用作 OIDC 提供者，但我不知道这是否受支持。最佳答案 Clo
google-cloud-platform - Google Cloud Platform 凭据页面未加载
在我去这里的过去 12 个小时左右: https://console.developers.google.com/apis/credentials?project=MYPROJECTNAME 我只是得
python - platform.system 和 platform.linux_distribution 究竟输出什么？
我正在尝试创建一个 python 脚本来在 linux 机器上自动安装和配置某些程序。我的想法是使用平台和多处理库来询问系统信息(platform.system、platform.linux_dis
google-cloud-platform - 在没有控制台页面的情况下创建 Google Cloud Platform 项目。
我正在尝试创建没有控制台网页的 Google Cloud Platform 项目，因为我考虑创建多个项目。因为我查了gcloud，目前只支持project describe和list。 https:
google-cloud-platform - 如何在 Google Cloud Platform 中获取用户托管服务帐户的公钥
我正在使用 Google Cloud Scheduler 调用外部应用程序。 Google Cloud Scheduler 使用 OIDC 身份验证并使用服务帐户。我只能从 Google 服务帐户 U
google-cloud-platform - 在 Google Cloud Platform 中启用双因素身份验证
如何在我的 Google Cloud Platform 帐户上启用 Google Authenticator 双重身份验证？我在 Web 界面中上下查看了“IAM 和管理员”，但没有看到在帐户上启用
google-cloud-platform - 如何在 Google Cloud Platform 上安排虚拟机的开启和关闭？
我们在 Google Cloud 上设置了一个虚拟机，并希望能够自动或计划打开和关闭它。我们内部有自动脚本，之后可以完成工作，到目前为止，我在 google 的文献中读到的更多与这些实例有关，但我找
google-cloud-platform - 无法删除 Google Cloud Platform 项目
我试图删除一个 GCP 项目，但不断弹出以下错误。 Lien origin You cannot delete this project because it is linked with a Dia
google-cloud-platform - 在 Google Cloud Platform 中重命名组织的权限
我从 Google Domains 购买了一个域，称为 example.com。我已订阅 G Suite 基本版并创建了一个 admin@example.com 帐户以在 GCP 上使用，而不是我的
google-cloud-platform - Google AI Platform 训练 - 等待作业完成
我构建了一个包含许多并行进程的 AI Platform 流水线。每个流程都会在 AI Platform 上启动一个训练作业，如下所示: gcloud ai-platform jobs submit t
windows-runtime - 如何区分空 Platform.String 和空 Platform.String^
我们正在验证函数输入时方法参数不为空，但这不适用于 Platform::String (或 Platform.String ，C# 或 C++ 之间没有区别)，因为它们用空实例重载空字符串的语义。考
google-cloud-platform - Google Cloud Platform HTTP 函数是否支持路由参数？
这个问题比我想来这里的问题要简单一些，但我一直在努力寻找答案，但我绝对不能—— 谷歌云平台 HTTP 函数是否支持路由参数，如此处？ http://expressjs.com/en/guide/rou
google-cloud-platform - 如何增加 Google Cloud Platform 中的后端服务配额？
我正在使用 Kubernetes，我正在尝试创建一个 ingress resource .我使用以下方法创建它: $ kubectl create -f my-ingress.yaml 我等了一会儿，
google-cloud-platform - 您能否将项目从一个 Google Cloud Platform 组织转移到另一个组织
我是 Google Cloud 的新手，所以我希望得到一些有关“组织”的指导。我可以将项目从一个“组织”转移到另一个“组织”吗？我正在我的个人 GSuite 组织下启动一些项目，但我必须将它们转移到
api-platform.com - 如何在 api-platform GET 操作中始终过滤特定字段值的集合？
在 GET 操作中，我想从返回的集合中排除具有等于“true”的“存档”字段的实体。我希望这是我的端点(如/users 或/companies)的默认设置，并且我想避免手动添加 URL 过滤器，如
google-cloud-platform - 在 Google Cloud Platform 中创建实例模板
实例模板对于创建托管实例组至关重要。事实上，托管实例组对于在 GCP 中创建自动扩缩组至关重要。这个问题是另一个问题 question's answer 的一部分，这是关于构建一个自动缩放和负载平衡
google-cloud-platform - Google Cloud Platform GPU 配额并不总是显示
我正在将 GCP 用于多个相同的项目。对于每个新项目我都需要一个1 个 GPU 的配额(Tesla K80)。为了申请增加我的GPU配额，我打开console并导航至“IAM 和管理”>“配额”。我在

首页

博学

6Ren·AI

商城

google-cloud-platform - 谷歌云平台 : Speech to Text Conversion of Large Media Files