python - 训练模型并有利于查全率/查准率的最佳方法是什么？-6ren

python - 训练模型并有利于查全率/查准率的最佳方法是什么？

转载作者：行者123 更新时间：2023-12-01 01:47:17

27

4

我有一个二元分类问题，我的数据集由 5% 的正标签组成。我正在使用 tensorflow 训练我的模型。这是我训练期间的结果:

Step 3819999: loss = 0.22 (0.004 sec)
Accuracy = 0.955; Recall = 0.011; Precision = 0.496

Step 3820999: loss = 0.21 (0.003 sec)
Accuracy = 0.955; Recall = 0.011; Precision = 0.496

Step 3821999: loss = 0.15 (0.003 sec)
Accuracy = 0.955; Recall = 0.011; Precision = 0.496

Step 3822999: loss = 0.15 (0.003 sec)
Accuracy = 0.955; Recall = 0.011; Precision = 0.496

提高召回率的主要策略是什么？更改数据集并添加更多正面标签可能会解决问题，但改变问题的现实似乎很奇怪......

在我看来，应该有一种方法可以支持“真阳性”而不是“假阴性”，但我似乎找不到。

最佳答案

您应该使用“weighted cross entropy ”而不是经典的 CE。来自 Tensorflow 文档:

This is like sigmoid_cross_entropy_with_logits() except that pos_weight, allows one to trade off recall and precision by up- or down-weighting the cost of a positive error relative to a negative error. The usual cross-entropy cost is defined as:

targets * -log(sigmoid(logits)) + (1 - targets) * -log(1 - sigmoid(logits))

A value pos_weights > 1 decreases the false negative count, hence increasing the recall. Conversely setting pos_weights < 1 decreases the false positive count and increases the precision. This can be seen from the fact that pos_weight is introduced as a multiplicative coefficient for the positive targets term in the loss expression:

targets * -log(sigmoid(logits)) * pos_weight + (1 - targets) * -log(1 - sigmoid(logits))

关于python - 训练模型并有利于查全率/查准率的最佳方法是什么？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/51157904/

27

4

0

文章推荐： python - 在 numpy 中向量化基于索引的矩阵运算

文章推荐： jquery - 如何停止 jQuery SlidesJS 幻灯片放映？

文章推荐： vba - 打开excel文件时出现运行时错误

git - 使用 "theirs"选项解决 merge 冲突(有利于 merge 分支)
所以我有两个分支，假设分支 A 和分支 B 我已将分支 A merge 到分支 B。 git checkout B git merge A 现在我想在命令行中解决有利于分支 A 的差异。我该怎么做？
git - 使用 "theirs"选项解决 merge 冲突(有利于 merge 分支)
所以我有两个分支，假设分支 A 和分支 B 我已将分支 A merge 到分支 B。 git checkout B git merge A 现在我想在命令行中解决有利于分支 A 的差异。我该怎么做？
visual-studio - Visual Studio Team System 有利于 Solo Dev
我是一家相对较小的商店中唯一的开发人员，该公司确实制作了大量各种定制应用程序。钱不是这个讨论中的一个因素，所以除此之外，有什么理由支持和反对我使用团队系统。我们目前严重依赖 VS2008 Pro、V

首页

博学

6Ren·AI

商城

python - 训练模型并有利于查全率/查准率的最佳方法是什么？