具有warm_start结果的Scikit-learn Randomforest(不可广播的输出...)-6ren

具有warm_start结果的Scikit-learn Randomforest(不可广播的输出...)

转载作者：行者123 更新时间：2023-12-03 09:02:18

我正在尝试构建一个在线随机森林分类器。在 for 循环中，我遇到了一个错误，但找不到原因。

clf = RandomForestClassifier(n_estimators=1, warm_start=True)

在 for 循环中，我在读取新数据时增加估计器的数量。

clf.n_estimators = (clf.n_estimators + 1)
clf = clf.fit(data_batch, label_batch)

循环3次后，运行代码时在循环中预测如下:

predicted = clf.predict(data_batch)

我收到以下错误:

ValueError: non-broadcastable output operand with shape (500,1) doesn't match the broadcast shape (500,2)

数据的形状为 (500,153)，标签为 (500,)。

这是更完整的代码:

clf = RandomForestClassifier(n_estimators=1, warm_start=True)
clf = clf.fit(X_train, y_train)
predicted = clf.predict(X_test)

batch_size = 500

for i in xrange(batch_init_size, records, batch_size):
    from_ = (i + 1)
    to_ = (i + batch_size + 1)

    data_batch = data[from_:to_, :]
    label_batch = label[from_:to_]

    predicted = clf.predict(data_batch)

    clf.n_estimators = (clf.n_estimators + 1)
    clf = clf.fit(data_batch, label_batch)

最佳答案

是的，该错误是由于批处理中样本类数量不等造成的。我通过使用包含所有类的批量大小解决了这个问题。

关于具有warm_start结果的Scikit-learn Randomforest(不可广播的输出...)，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/49334063/

文章推荐： c# - 无法从 Dictionary 转换为 IDictionary

文章推荐： php - 从 WooCommerce 3 中的电子邮件模板获取产品 ID

文章推荐： networking - Erlang:从主机名获取IP地址

文章推荐： r - 在ggplot中添加正交回归线

scikit-learn - `warm_start` 参数及其对计算时间的影响
我有一个logistic regression具有一组已定义参数的模型 (warm_start=True)。一如既往，我调用 LogisticRegression.fit(X_train, y_tr
python - Keras 训练 warm_start
是否有可能像在 scikit-learn 中使用 warm_start 参数一样，使用所有超参数(包括降低学习率)和从以前的 epoch 保存的权重继续训练 Keras 估计器？像这样: estima
python - Warm_Start=True 的 MLPClassifier 在一次迭代中收敛
我正在使用 scikit learn 的 MLPClassifier 和以下参数 mlp = MLPClassifier(hidden_layer_sizes=(3,2,),solver='sgd',

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

具有warm_start结果的Scikit-learn Randomforest(不可广播的输出...)