python - cv_result中的 'mean_test_score'是什么意思？-6ren

python - cv_result中的 'mean_test_score'是什么意思？

转载作者：太空狗更新时间：2023-10-30 02:16:17

25

4

你好，我正在做一个 GridSearchCV，我正在使用 scikit learn 的 .cv_results_ 函数打印结果。

我的问题是，当我手动评估所有测试分数拆分的平均值时，我得到的数字与 'mean_test_score' 中所写的数字不同。哪个与标准的 np.mean() 不同？

我在这里附上代码和结果:

n_estimators = [100]
max_depth = [3]
learning_rate = [0.1]

param_grid = dict(max_depth=max_depth, n_estimators=n_estimators, learning_rate=learning_rate)

gkf = GroupKFold(n_splits=7)


grid_search = GridSearchCV(model, param_grid, scoring=score_auc, cv=gkf)
grid_result = grid_search.fit(X, Y, groups=patients)

grid_result.cv_results_

这个操作的结果是:

{'mean_fit_time': array([ 8.92773601]),
 'mean_score_time': array([ 0.04288721]),
 'mean_test_score': array([ 0.83490629]),
 'mean_train_score': array([ 0.95167036]),
 'param_learning_rate': masked_array(data = [0.1],
              mask = [False],
        fill_value = ?),
 'param_max_depth': masked_array(data = [3],
              mask = [False],
        fill_value = ?),
 'param_n_estimators': masked_array(data = [100],
              mask = [False],
        fill_value = ?),
 'params': ({'learning_rate': 0.1, 'max_depth': 3, 'n_estimators': 100},),
 'rank_test_score': array([1]),
 'split0_test_score': array([ 0.74821666]),
 'split0_train_score': array([ 0.97564995]),
 'split1_test_score': array([ 0.80089016]),
 'split1_train_score': array([ 0.95361201]),
 'split2_test_score': array([ 0.92876979]),
 'split2_train_score': array([ 0.93935856]),
 'split3_test_score': array([ 0.95540287]),
 'split3_train_score': array([ 0.94718634]),
 'split4_test_score': array([ 0.89083901]),
 'split4_train_score': array([ 0.94787374]),
 'split5_test_score': array([ 0.90926355]),
 'split5_train_score': array([ 0.94829775]),
 'split6_test_score': array([ 0.82520379]),
 'split6_train_score': array([ 0.94971417]),
 'std_fit_time': array([ 1.79167576]),
 'std_score_time': array([ 0.02970254]),
 'std_test_score': array([ 0.0809713]),
 'std_train_score': array([ 0.0105566])}

如您所见，对所有 test_score 执行 np.mean 得到的值大约为 0.8655122606479532，而“mean_test_score”为 0.83490629

谢谢你的帮助，莱昂纳多。

最佳答案

由于代码太多，我会将其作为新答案发布:

折叠的测试和训练分数是:(取自您在问题中发布的结果)

test_scores = [0.74821666,0.80089016,0.92876979,0.95540287,0.89083901,0.90926355,0.82520379]
train_scores = [0.97564995,0.95361201,0.93935856,0.94718634,0.94787374,0.94829775,0.94971417]

这些折叠中的训练样本数量是:(取自 print([(len(train), len(test)) for train, test in gkf.split(X, groups=patients) 的输出)]))

train_len = [41835, 56229, 56581, 58759, 60893, 60919, 62056]
test_len = [24377, 9983, 9631, 7453, 5319, 5293, 4156]

然后以每折的训练样本量为权重的测试和训练均值是:

train_avg = np.average(train_scores, weights=train_len)
-> 0.95064898361714389
test_avg = np.average(test_scores, weights=test_len)
-> 0.83490628649308296

所以这正是 sklearn 给你的值(value)。它也是分类的正确平均准确度。折叠的平均值是不正确的，因为它取决于您选择的有点随意的拆分/折叠。

所以在脑震荡中，这两种解释确实是相同和正确的。

关于python - cv_result中的 'mean_test_score'是什么意思？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/44947574/

25

4

0

文章推荐： Python round() 太慢，更快的方法来降低精度？

文章推荐： python - 如何打印 FF(换页)字符？

文章推荐： python - Pandas :将成对的列转换为单元格

JavaScript:意思？
关闭。这个问题需要多问focused 。目前不接受答案。想要改进此问题吗？更新问题，使其仅关注一个问题 editing this post . 已关闭 9 年前。 Improve this ques
python - 装箱(意思)
关闭。这个问题是not reproducible or was caused by typos .它目前不接受答案。这个问题是由于错别字或无法再重现的问题引起的。虽然类似的问题可能是on-topi
c - 返回什么；意思？
在main()中声明其原型(prototype)的函数的返回数据类型为void。它包含一个指令返回；如 main() { void create(int *p); *some code
scala 奇怪的符号 "_@"意思
我想知道这个 scala 符号是什么:_@。 (搜索引擎无法识别奇怪的字符，因此很难在 google 上找到任何内容...) 这里是上下文: def doNodeParse(json: JValue)
gcc - 什么 '__asm__(".previous");'意思？
在尝试编译我的项目时，它使用了一些第三方头文件，使用 mingw 4.4，我遇到了以下错误: Assembler messages: Error: junk at end of line, first
c - 什么 ((void (*)())buf)();意思？
我正在解决 picoCTF 上的二进制漏洞利用挑战，并遇到了这段代码: ((void (*)())buf)(); 哪里buf是一个字符数组。我解决了挑战，但似乎无法理解它到底在做什么。我看了this
javascript - export default something() 意思
我正在浏览 React Navigation docs我在那里遇到了这样的事情: import Ionicons from 'react-native-vector-icons/Ionicons';
selenium -//按钮[@type ='submit']意思
selenium 中以下命令的含义是什么？我尝试创建一个自动测试用例。然后如下://button[@type='submit'] 我在 selenium 工具中看到的语法。最佳答案这是一个 XP
c - a[0] 在多维数组中表示(意思)是什么？
我刚开始看书学习 C 语言，对他们讨论指针和数组的部分并没有感到困惑。如果有一个名为 a[NUM_ROW][NUM_COLS] 的多维数组(我只是将此数组讨论为特定的二维数组)，那么 a[0] 是什么
c - while (*p2++ = *p1++); 是做什么的？意思？
这个问题在这里已经有了答案: How does "while(*s++ = *t++)" copy a string? (17 个答案) 关闭 9 年前。我有一个代码块: int main ()
linux - 什么信号(SIGCHLD，SIG_DFL);意思？
我没有在我的代码中处理 SIGCHLD。我的进程在终止后仍然立即被删除。我希望它成为僵尸进程。如果我将 SIGCHLD 设置为 SIG_DFL 那么它会起作用吗？如何将 SIGCHLD 设置为 SI
python - fig, ax = plt.subplots() 意思
我已经使用 matplotlib 一段时间了，但我并不真正理解这一行的作用。 fig, ax = plt.subplots() 谁能解释一下？最佳答案 plt.subplots() 基本上是一个(非
c - double* (*p[3]) (void* (*)()); 是什么意思？意思？
我很难理解以下声明的含义。这个申报标准吗？ double* (*p[3]) (void* (*)()); 谁能帮我理解这个声明的意思？最佳答案阅读复杂声明的规则:找到最左边的标识符并向外工作，记住
c - 这个声明 typedef void foo(); 是什么意思？意思？
关闭。这个问题需要details or clarity .它目前不接受答案。想改进这个问题吗？通过 editing this post 添加细节并澄清问题. 关闭 8 年前。 Improve t
shell - 1>/dev/null 2>&1 & pid1=$!意思？
我正在学习如何并行运行多个进程 ./script1.sh param1 1>/dev/null 2>&1 & pid1=$! ./script1.sh param2 1>/dev/null
javascript - 在 Chaplin js 框架中，以 ! 为前缀的事件是做什么的？意思？
我看到这些事件散布在 chaplin 示例代码中，但在文档或源代码中没有任何解释。似乎这意味着它是一个全局事件，触发了一个 Action 。那是对的吗？它们只是一个惯例，还是以某种方式强制执行？ #
c - C 表达式 ((void(*)(void))0)(); 是什么意思？意思？
((void(*)(void))0)(); 所以我们将整数 0 类型转换为这个棘手的类型 (void(*))(void) 然后执行它。消息来源声称这应该有效，但实际上是什么？我想这一定是像 #def
javascript - following a function with (jQuery, window, document) 是什么意思？意思？
这个问题在这里已经有了答案: How does this JavaScript/jQuery syntax work: (function( window, undefined ) { })(win
fortran - 在 Fortran 中，.feq 有什么作用？或.fne。意思？
if(a .feq. 5.0_dp) then **** if(a .fne. 5.2_dp) then ***我遇到了一些这样的代码。 .feq 有什么作用？或.fne。意思？是“=”还是“\=”？
java - Java 中的通配符 Generic 和意思，下界或上界
所以我在阅读泛型方法时感到很困惑。先说一下这里的问题: 在这个例子中:假设我需要一个适用于任何类型 T 的 selectionSort 版本，方法是使用调用者提供的外部可比较对象。第一次尝试: pu

首页

博学

6Ren·AI

商城

python - cv_result中的 'mean_test_score'是什么意思？