gpt4 book ai didi

Question on training with label and unlabeled data(关于标签数据和非标签数据训练的问题)

转载 作者:bug小助手 更新时间:2023-10-25 11:14:49 25 4
gpt4 key购买 nike



I have a large labeled dataset with 26.7M reviews written in Modern standard Arabic, and I have another dataset but unlabeled with 16K reviews written in both Modern standard Arabic and colloquial Arabic.

我有一个有标签的大型数据集,其中有2670万条评论是用现代标准阿拉伯语写的,我还有另一个数据集,但没有标注16K条评论,这些评论都是用现代标准阿拉伯语和口语阿拉伯语写的。


What are the possible and correct approaches to label the unlabeled dataset? when the goal is also to increase the accuracy?

标记未标记的数据集的可能且正确的方法是什么?什么时候的目标也是为了提高精确度?


Provide me with some examples in python that could help.

给我提供一些可以帮助我的python的例子。


更多回答
优秀答案推荐
更多回答

25 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com