gpt4 book ai didi

java - 解析 LIUM 说话人二值化输出

转载 作者:塔克拉玛干 更新时间:2023-11-02 08:03:13 24 4
gpt4 key购买 nike

如何使用 LIUM Speaker Diarization 工具包知道哪个演讲者讲了多少时间?

例如,这是我的 .seg 文件。

;; cluster S0 [ score:FS = -33.93166562542459 ] [ score:FT = 
-34.24966646974656 ] [ score:MS = -34.05223781565528 ] [ score:MT =
-34.32834794609819 ]
Seq06 1 0 237 F S U S0
Seq06 1 2960 278 F S U S0
;; cluster S1 [ score:FS = -33.33289449700619 ] [ score:FT =
-33.64489165914674 ] [ score:MS = -32.71833169822944 ] [ score:MT =
-33.380835069917275 ]
Seq06 1 238 594 M S U S1
Seq06 1 1327 415 M S U S1
Seq06 1 2311 649 M S U S1
;; cluster S2 [ score:FS = -33.354874450638064 ] [ score:FT =
-33.46618707052516 ] [ score:MS = -32.70702429201772 ] [ score:MT =
-33.042146088874844 ]
Seq06 1 832 495 M S U S2
Seq06 1 1742 569 M S U S2

如何从这个文件中提取时间?

最佳答案

在这一行

Seq06 1 2960 278 F S U S0

你有

field 1: Seq06 = the show name
field 2: 1 = the channel number
field 3: 2960 = the start of the segment (in features)
field 4: 278 = the length of the segment (in features)
field 5: F = the speaker gender (U=unknown, F=female, M=Male)
field 6: S = the type of band (T=telephone, S=studio)
field 7: U = the type of environment (music, speech only, …)
field 8: S0 = the speaker label

时间以特征为单位,因此 2960 为 29.60 秒(除以 100 以从特征秒数转换)。长度也在特征中,因此您的片段长度为 2.78 秒。

记录在 LIUM WIKI

关于java - 解析 LIUM 说话人二值化输出,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/45309366/

24 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com