r - 比例尺错误。默认 : length of 'center' must equal the number of columns of 'x'-6ren

r - 比例尺错误。默认 : length of 'center' must equal the number of columns of 'x'

转载作者：行者123 更新时间：2023-12-01 23:20:24

我正在使用 mboost 包进行一些分类。这是代码

library('mboost')
load('so-data.rdata')
model <- glmboost(is_exciting~., data=training, family=Binomial())
pred <- predict(model, newdata=test, type="response")

但是 R 在进行预测时提示

Error in scale.default(X, center = cm, scale = FALSE) : 
  length of 'center' must equal the number of columns of 'x'

数据(训练和测试)可以在此处下载( 7z 、 zip )。错误的原因是什么以及如何消除它？谢谢。

更新:

> str(training)
'data.frame':   439599 obs. of  24 variables:
 $ is_exciting                           : Factor w/ 2 levels "f","t": 1 1 1 1 1 1 1 1 1 1 ...
 $ school_state                          : Factor w/ 52 levels "AK","AL","AR",..: 15 5 5 23 47 5 44 42 42 5 ...
 $ school_charter                        : Factor w/ 2 levels "f","t": 1 1 1 1 1 1 1 1 1 1 ...
 $ school_magnet                         : Factor w/ 2 levels "f","t": 1 1 1 1 2 1 1 1 1 1 ...
 $ school_year_round                     : Factor w/ 2 levels "f","t": 1 1 1 1 1 2 1 1 1 2 ...
 $ school_nlns                           : Factor w/ 2 levels "f","t": 1 1 1 1 1 1 1 1 1 1 ...
 $ school_charter_ready_promise          : Factor w/ 2 levels "f","t": 1 1 1 1 1 1 1 1 1 1 ...
 $ teacher_prefix                        : Factor w/ 6 levels "","Dr.","Mr.",..: 5 5 3 5 6 5 6 6 5 6 ...
 $ teacher_teach_for_america             : Factor w/ 2 levels "f","t": 1 1 1 1 1 1 2 1 2 1 ...
 $ teacher_ny_teaching_fellow            : Factor w/ 2 levels "f","t": 1 1 1 1 1 1 1 1 1 1 ...
 $ primary_focus_subject                 : Factor w/ 28 levels "","Applied Sciences",..: 19 17 18 18 10 4 17 17 18 17 ...
 $ primary_focus_area                    : Factor w/ 8 levels "","Applied Learning",..: 6 5 5 5 5 4 5 5 5 5 ...
 $ secondary_focus_subject               : Factor w/ 28 levels "","Applied Sciences",..: 28 18 17 19 26 18 18 28 24 25 ...
 $ secondary_focus_area                  : Factor w/ 8 levels "","Applied Learning",..: 7 5 5 6 8 5 5 7 7 4 ...
 $ resource_type                         : Factor w/ 7 levels "","Books","Other",..: 4 4 2 5 5 2 2 5 5 5 ...
 $ poverty_level                         : Factor w/ 4 levels "high poverty",..: 2 2 4 2 1 2 2 1 2 1 ...
 $ grade_level                           : Factor w/ 5 levels "","Grades 3-5",..: 5 5 2 5 5 2 3 2 4 2 ...
 $ fulfillment_labor_materials           : num  30 35 35 30 30 35 30 35 35 35 ...
 $ total_price_excluding_optional_support: num  1274 477 892 548 385 ...
 $ total_price_including_optional_support: num  1499 562 1050 645 453 ...
 $ students_reached                      : int  31 20 250 36 19 28 90 21 60 56 ...
 $ eligible_double_your_impact_match     : Factor w/ 2 levels "f","t": 1 2 1 2 1 2 1 1 1 1 ...
 $ eligible_almost_home_match            : Factor w/ 2 levels "f","t": 1 1 1 1 1 1 2 2 1 1 ...
 $ essay_length                          : int  236 285 194 351 383 273 385 437 476 159 ...


> str(test)
'data.frame':   44772 obs. of  23 variables:
 $ school_state                          : Factor w/ 51 levels "AK","AL","AR",..: 22 35 11 46 5 35 11 28 28 10 ...
 $ school_charter                        : Factor w/ 2 levels "f","t": 1 1 1 1 2 1 1 1 1 1 ...
 $ school_magnet                         : Factor w/ 2 levels "f","t": 1 1 1 1 1 1 1 1 1 1 ...
 $ school_year_round                     : Factor w/ 2 levels "f","t": 1 1 1 1 1 1 1 1 1 1 ...
 $ school_nlns                           : Factor w/ 2 levels "f","t": 1 1 1 1 1 1 1 1 1 1 ...
 $ school_charter_ready_promise          : Factor w/ 2 levels "f","t": 1 1 1 1 1 1 1 1 1 1 ...
 $ teacher_prefix                        : Factor w/ 6 levels "","Dr.","Mr.",..: 3 5 6 6 3 5 5 5 3 5 ...
 $ teacher_teach_for_america             : Factor w/ 2 levels "f","t": 1 1 1 1 1 1 1 1 1 1 ...
 $ teacher_ny_teaching_fellow            : Factor w/ 2 levels "f","t": 1 2 1 1 1 1 1 1 1 1 ...
 $ primary_focus_subject                 : Factor w/ 28 levels "","Applied Sciences",..: 5 16 17 17 18 11 16 17 2 17 ...
 $ primary_focus_area                    : Factor w/ 8 levels "","Applied Learning",..: 2 4 5 5 5 2 4 5 6 5 ...
 $ secondary_focus_subject               : Factor w/ 28 levels "","Applied Sciences",..: 25 1 19 1 17 9 17 11 1 1 ...
 $ secondary_focus_area                  : Factor w/ 8 levels "","Applied Learning",..: 4 1 6 1 5 6 5 2 1 1 ...
 $ resource_type                         : Factor w/ 7 levels "","Books","Other",..: 5 5 5 2 5 6 4 5 5 4 ...
 $ poverty_level                         : Factor w/ 4 levels "high poverty",..: 1 2 4 4 1 2 2 2 1 2 ...
 $ grade_level                           : Factor w/ 5 levels "","Grades 3-5",..: 4 3 3 5 4 5 5 4 3 5 ...
 $ fulfillment_labor_materials           : num  30 30 30 30 30 30 30 30 30 30 ...
 $ total_price_excluding_optional_support: num  2185 149 1017 156 860 ...
 $ total_price_including_optional_support: num  2571 175 1197 183 1012 ...
 $ students_reached                      : int  200 110 10 22 180 51 30 15 260 20 ...
 $ eligible_double_your_impact_match     : Factor w/ 2 levels "f","t": 1 1 1 1 1 1 1 1 1 1 ...
 $ eligible_almost_home_match            : Factor w/ 2 levels "f","t": 2 1 1 1 1 1 1 1 2 1 ...
 $ essay_length                          : int  221 137 313 243 373 344 304 431 231 173 ...


> summary(model)

     Generalized Linear Models Fitted via Gradient Boosting

Call:
glmboost.formula(formula = is_exciting ~ ., data = training,     family = Binomial())


     Negative Binomial Likelihood 

Loss function: { 
     f <- pmin(abs(f), 36) * sign(f) 
     p <- exp(f)/(exp(f) + exp(-f)) 
     y <- (y + 1)/2 
     -y * log(p) - (1 - y) * log(1 - p) 
 } 


Number of boosting iterations: mstop = 100 
Step size:  0.1 
Offset:  -1.197806 

Coefficients: 

NOTE: Coefficients from a Binomial model are half the size of coefficients
 from a model fitted via glm(... , family = 'binomial').
See Warning section in ?coef.mboost

                       (Intercept)                     school_stateDC 
                     -0.5250166130                       0.0426909965 
                    school_stateIL                    school_chartert 
                      0.0084191638                       0.0729272310 
                teacher_prefixMrs.                  teacher_prefixMs. 
                     -0.0181489492                       0.0438425925 
        teacher_teach_for_americat                 resource_typeBooks 
                      0.2593005345                       0.0046126706 
           resource_typeTechnology        fulfillment_labor_materials 
                     -0.0313904871                       0.0120086140 
eligible_double_your_impact_matcht        eligible_almost_home_matcht 
                     -0.0316376431                      -0.0522717398 
                      essay_length 
                      0.0004993224 
attr(,"offset")
[1] -1.197806

Selection frequencies:
       fulfillment_labor_materials         teacher_teach_for_americat 
                              0.24                               0.15 
                      essay_length                    school_chartert 
                              0.15                               0.09 
                 teacher_prefixMs.            resource_typeTechnology 
                              0.08                               0.07 
eligible_double_your_impact_matcht        eligible_almost_home_matcht 
                              0.07                               0.07 
                teacher_prefixMrs.                     school_stateDC 
                              0.04                               0.02 
                    school_stateIL                 resource_typeBooks 
                              0.01                               0.01

我也尝试了glm，但它说

Error in model.frame.default(Terms, newdata, na.action = na.action, xlev = object$xlevels) : 
  factor teacher_prefix has new levels

但我在 teacher_prefix 变量中没有看到任何新级别:

> levels(training$teacher_prefix)
[1] ""           "Dr."        "Mr."        "Mr. & Mrs." "Mrs."       "Ms."       
> levels(test$teacher_prefix)
[1] ""           "Dr."        "Mr."        "Mr. & Mrs." "Mrs."       "Ms."

最佳答案

实际上，glmboost和glm的问题是相关的。您的 teacher_prefix 变量有问题。

正如glm示例所指出的，测试中的某些级别不在训练中(某种程度)。虽然这两个因素具有相同的 levels()，但训练集没有观察到 teacher_prefix=="" 但测试有的情况。比较

table(test$teacher_prefix)
table(training$teacher_prefix)

所以 glm 实际上给出了更准确、更有用的错误消息。问题与 glmboost 相同，尽管它没有那么直接地说出来。

这样做似乎可以“修复”它

test2 <- subset(test, teacher_prefix %in% c("Dr.","Mr.","Mrs.","Ms."))
test2$teacher_prefix <- droplevels(test2$teacher_prefix)
pred <- predict(model, newdata=test2, type="response")

我们只是去掉未使用的级别，然后进行标准预测。

关于r - 比例尺错误。默认 : length of 'center' must equal the number of columns of 'x' ，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/24224554/

文章推荐： java - 使用状态来管理 onTouchEvent

文章推荐： apache-spark - rdd后面的数字是什么意思

文章推荐： iis - Chrome 是否忽略缓存控制 : max-age?

文章推荐： ios - Swift Codable 将空 json 解码为 nil 或空对象

css - 使用flexbox对齐2个div : one center-center and the other bottom center
这个问题在这里已经有了答案: Flexbox: Align between bottom and center? [duplicate] (2 个答案) 关闭 6 年前。我正在尝试使用 flexb
html - 是还是？
用于居中元素的标签是或 ? 我知道或标签不再使用，但哪一个是正确的？我很迷惑。 Visual Studio 代码说错了，但只有适用于 Microsoft Edge。编辑:Microsof
game-center - Game Center 回合制自动匹配不工作
我使用的是标准匹配用户界面和两台 iPad iOS6。问题是当我在第一台设备中创建新的匹配项时，第二台设备应该在我查看匹配用户界面时看到现有的匹配项，但事实并非如此。我确定我的代码是正确的。这是方法:
java - setHorizontalAlignment(CENTER) - CENTER 无法解析为变量
在我的项目的两个不同的类(都扩展 JFrame)中，我尝试这样做: header = new JLabel("Header"); header.setHorizontalAlignme
jQuery - 视差效果 - 将背景位置更改为 "Center Center"
我已经在我的网站中实现了以下 jQuery 视差效果(取自 http://www.brandaiddesignco.com/insights/parallax-scrolling-made-easy/
css - 文本对齐 :center centers images too
每当我在我的 css 中居中文本时，它也会使图像居中。这是为什么？文本位于一个 div(“左”)内，图像位于第二个 div(“右”)内。这是 HTML - BANKS 还有
html - 没有怎么居中？
我现在有这段代码: 1 enE jk 5.6 mtE hp 6.4 hpE eb HML reE pm 514 hpE e
html - 我怎样才能让我的按钮居中，不起作用
我只是制作一个页面或类似的东西。我想让我的按钮居中，但它不起作用。我尝试用不同的方法将它居中，但它不起作用。看看代码。它有但它并没有将它置于整个页面的中心但只是喜欢......我不知道。照片:htt
css - 文本对齐 :center not centering text
我想知道如何将此页面中的字形居中，它们是使用@font-face 的图标字体，但即使在我将 text-align:center 应用于包含每个字形的 anchor 之后，它们也不是居中。它们显示为左
html - 有没有办法消除 "..."标签对特定元素的影响？
考虑以下 HTML 片段， Hyperlink 如上所示，由于“...”标签，所有内容都将居中对齐。但是，我不希望带有“id=three”的 HTML 元素的中心对齐
CSS 文本对齐 : center; is not centering things
我有以下 html: Site Map Privacy Policy Terms & Condi
安卓蜂窝 : Off-center button text - unable to center
我在将我正在处理的布局中的按钮上的文本居中时遇到问题。请查看下面的屏幕截图(在 eclipse 的“图形布局”选项卡中截取): 我不确定是什么原因造成的。我试着玩弄按钮的各个布局属性，但这没有任何效果
css - 有没有办法确保所有文本对齐 : center text is indeed centered?
我正在尝试使用 Bootstrap 组装页脚。由于某些原因，尽管确保应用了 text-align: center;，但页脚链接看起来略微偏离中心。有没有办法确保所有文本元素确实正确对齐？ HTML
css - Flexbox 对齐元素 :center not perfect in the center
我在使用 flexbox 工具 align-items:center; 时遇到了问题，它在中间并不完美，但文本、图标等的像素太高了……有人知道如何解决这个问题？ screenshot how it l
html - flex 对齐元素 : center not centering items
关闭。这个问题需要debugging details .它目前不接受答案。编辑问题以包含 desired behavior, a specific problem or error, and t
game-center - 以编程方式邀请 Game Center friend 参加比赛
GameKit 是否允许您以编程方式邀请特定的 Game Center friend 参加比赛，即不提供 GC ViewController？以下 handleInviteFromGameCenter
Android - 布局问题 - Textviews top center and bottom center
我有一个布局问题。假设我有一个 RelativeLayout 出现在我的屏幕底部。在此，我想添加 2 个 TextView ，一个在中心，一个在顶部中心，一个在底部中心。 |------------
安卓布局 : Center Text and Image View to screen center?
我有一个启动画面的 Activity layout.xml，它基本上显示了我想要的内容。但是如果我在更大或更小的屏幕设备上打开应用程序，它就不再居中了。有没有办法在元素周围放置一个包装器并将其在屏幕
javascript - 背景位置为 : center center without jump 的视差
尝试了几十种方法后，我自己也想不出解决方案 #banner .slide { position: static; top: 0px; left: 0px; z-inde
c++ - 避免过多的函数参数 : class-centered or function-centered approach?
您将如何修复以下传递过多参数的错误代码？ void helper1(int p1, int p3, int p5, int p7, int p9, int p10) { // ... } void

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

r - 比例尺错误。默认 : length of 'center' must equal the number of columns of 'x'