gpt4 book ai didi

python - 在Watson Speech to Text API中不返回说话者标签/歧义

转载 作者:行者123 更新时间:2023-12-03 17:12:10 31 4
gpt4 key购买 nike

我正在尝试通过IBM Watson语音获取文本标签的演讲者标签。
在我的最终输出中,我希望它显示整个音频的笔录,自信心和说话者标签。我的代码如下:

import json
from os.path import join, dirname
from ibm_watson import SpeechToTextV1
from ibm_watson.websocket import RecognizeCallback, AudioSource
import threading
from ibm_cloud_sdk_core.authenticators import IAMAuthenticator
import pandas as pd
authenticator = IAMAuthenticator('rXXXYYZZ')
service = SpeechToTextV1(authenticator=authenticator)
service.set_service_url('https://api.us-east.speech-to-text.watson.cloud.ibm.com')

models = service.list_models().get_result()
#print(json.dumps(models, indent=2))

model = service.get_model('en-US_BroadbandModel').get_result()
#print(json.dumps(model, indent=2))

with open(join(dirname('__file__'), 'testvoicejen.wav'),
'rb') as audio_file:
# print(json.dumps(
output = service.recognize(
audio=audio_file,
speaker_labels=True,
content_type='audio/wav',
#timestamps=True,
#word_confidence=True,
model='en-US_NarrowbandModel',
continuous=True).get_result(),
indent=2
df = pd.DataFrame([i for elts in output for alts in elts['results'] for i in alts['alternatives']])

但是,df的输出为:
df
Out[22]:
timestamps ... transcript
0 [[thank, 3.88, 4.04], [you, 4.04, 4.13], [for,... ... thank you for calling my name is Britney and h...
1 [[thank, 30.21, 30.56], [you, 30.56, 30.74], [... ... thank you %HESITATION and then %HESITATION you..

如您所见,我确实获得了成绩单,但是,我得到了时间戳,而不是说话人的二字化或标签化。演讲者标签如下所示:
from": 0.68,
"to": 1.19,
"speaker": 2

我怎么得到这个?

最佳答案

当您打开speaker_labels时,您会自动获得timestamps。如果您查看服务文档中的示例输出-https://cloud.ibm.com/docs/speech-to-text?topic=speech-to-text-output#speaker_labels

您会看到“演讲者标签”部分与“替代/结果”部分是分开的。您的代码仅解析结果/替代部分。要获得扬声器标签,您需要-

df = pd.DataFrame([i for elts in output for i in elts['speaker_labels']])

关于python - 在Watson Speech to Text API中不返回说话者标签/歧义,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/61092036/

31 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com