gpt4 book ai didi

audio - 使用 Google Speech 时从 Google Cloud Storage 访问音频文件

转载 作者:行者123 更新时间:2023-12-02 22:22:25 25 4
gpt4 key购买 nike

我已经使用下面的这段代码使用 Google Speech 成功地将包含语音的 .wav 文件解析为文本。

但我想访问一个不同的 .wav 文件,我已将其(公开)放在 Google Cloud Storage 上,而不是放在我的本地硬盘上。为什么不简单地改变

speech_file = 'my/local/system/sample.wav'


speech_file = 'https://console.cloud.google.com/storage/browser/speech_proj_files/sample.wav'

工作可以接受吗?

这是我的代码:

speech_file = 'https://console.cloud.google.com/storage/browser/speech_proj_files/sample.wav'

DISCOVERY_URL = ('https://{api}.googleapis.com/$discovery/rest?'
'version={apiVersion}')


def get_speech_service():
credentials = GoogleCredentials.get_application_default().create_scoped(
['https://www.googleapis.com/auth/cloud-platform'])
http = htt|plib2.Http()
credentials.authorize(http)

return discovery.build(
'speech', 'v1beta1', http=http, discoveryServiceUrl=DISCOVERY_URL)

def main(speech_file):
"""Transcribe the given audio file.

Args:
speech_file: the name of the audio file.
"""
with open(speech_file, 'rb') as speech:
speech_content = base64.b64encode(speech.read())

service = get_speech_service()
service_request = service.speech().syncrecognize(
body={
'config': {
'encoding': 'LINEAR16', # raw 16-bit signed LE samples
'sampleRate': 44100, # 16 khz
'languageCode': 'en-US', # a BCP-47 language tag
},
'audio': {
'content': speech_content.decode('UTF-8')
}
})
response = service_request.execute()
return response

最佳答案

我不确定为什么您的方法不起作用,但我想提供一个快速建议。

Google Cloud Speech API 原生支持 Google Cloud Storage 对象。与其下载整个对象只是为了将其上传回 Cloud Speech API,不如通过换掉这一行来指定对象:

        'audio': {
# Remove this: 'content': speech_content.decode('UTF-8')
'uri': 'gs://speech_proj_files/sample.wav' # Do this!
}

另一个建议。您可能会发现 google-cloud Python 库更易于使用。试试这个:

from google.cloud import speech
speech_client = speech.Client()

audio_sample = speech_client.sample(
content=None,
source_uri='gs://speech_proj_files/sample.wav',
encoding='LINEAR16',
sample_rate_hertz= 44100)
results_list = audio_sample.sync_recognize(language_code='en-US')

这里有一些很好的例子:https://github.com/GoogleCloudPlatform/python-docs-samples/tree/master/speech/cloud-client

关于audio - 使用 Google Speech 时从 Google Cloud Storage 访问音频文件,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/43532247/

25 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com