python - 数字错误 "sequence item 0: expected str instance, type found"-6ren

python - 数字错误 "sequence item 0: expected str instance, type found"

转载作者：行者123 更新时间：2023-11-28 17:29:25

26

4

我想在多元回归分析中选择变量。我尝试使用此代码 http://planspace.org/20150423-forward_selection_with_statsmodels/ .问题是我想从 50 个变量中选择，这需要太多时间。我使用 Numba 使其更快并编写了以下代码:

@jit
def forward_selected(data, response):
"""Linear model designed by forward selection.

Parameters:
-----------
data : pandas DataFrame with all possible predictors and response

response: string, name of response column in data

Returns:
--------
model: an "optimal" fitted statsmodels linear model
       with an intercept
       selected by forward selection
       evaluated by adjusted R-squared
"""
remaining = set(data.columns)
remaining.remove(response)
selected = [str]
current_score, best_new_score = 0.0, 0.0
while remaining and current_score == best_new_score:
    scores_with_candidates = [str]
    for candidate in remaining:
        formula = "{} ~ {} + 1".format(response,
                                       ' + '.join(selected + [candidate]))
        score = smf.ols(formula, data).fit().rsquared_adj
        scores_with_candidates.append((score, candidate))
    scores_with_candidates.sort()
    best_new_score, best_candidate = scores_with_candidates.pop()
    if current_score < best_new_score:
        remaining.remove(best_candidate)
        selected.append(best_candidate)
        current_score = best_new_score
formula = "{} ~ {} + 1".format(response,
                               ' + '.join(selected))
model = smf.ols(formula, data).fit()
return model

model = forward_selected(df, col)

但它返回以下错误:

TypeError: sequence item 0: expected str instance, type found

请告诉我如何修复它。如果您不明白我的问题，我很乐意在评论中提供更多信息。

Traceback (most recent call last):

File "~/PycharmProjects/anacondaenv/touhu_1.py", line 164, in

submit = forecast(col)

File "~/PycharmProjects/anacondaenv/touhu_1.py", line 75, in forecast

model = forward_selected(df, col)TypeError: sequence item 0: expected str instance, type found

最佳答案

我认为查看 numba 是否真的起到助推器作用的最佳方法之一是尝试使用 njit 而不是 jit 装饰器。 njit 强制 no-python-mode 并在有任何退回到 python 时中断(它根本没有速度优势)。简短回答:不要使用除 np.ndarrays 之外的任何东西。因此，没有字符串、没有元组、没有列表，也没有调用未编译的函数。

所以我修复了错误:numba 不允许在主函数主体中使用空列表...不知道为什么(也许是错误？!)但是如果你将它移动到 while 阻止。

import statsmodels.formula.api as smf
import numba as nb

@nb.jit
def forward_selected_nojit(data, response):
    """Linear model designed by forward selection.

    Parameters:
    -----------
    data : pandas DataFrame with all possible predictors and response

    response: string, name of response column in data

    Returns:
    --------
    model: an "optimal" fitted statsmodels linear model
           with an intercept
           selected by forward selection
           evaluated by adjusted R-squared
    """
    remaining = set(data.columns)
    remaining.remove(response)
    selected = None  # Changed this line
    current_score, best_new_score = 0.0, 0.0
    while remaining and current_score == best_new_score:
        if selected is None:  # Changed this and next line
            selected = []
        scores_with_candidates = []
        for candidate in remaining:
            formula = "{} ~ {} + 1".format(response,
                                           ' + '.join(selected + [candidate]))
            score = smf.ols(formula, data).fit().rsquared_adj
            scores_with_candidates.append((score, candidate))
        scores_with_candidates.sort()
        best_new_score, best_candidate = scores_with_candidates.pop()
        if current_score < best_new_score:
            remaining.remove(best_candidate)
            selected.append(best_candidate)
            current_score = best_new_score
    formula = "{} ~ {} + 1".format(response,
                                   ' + '.join(selected))
    model = smf.ols(formula, data).fit()
    return model

这可能可以用更好的方式解决，但这里重要的是时间安排。但首先检查 numba 是否会产生任何奇怪的东西:

# With numba
sl ~ rk + yr + 1
0.835190760538

# Without numba
sl ~ rk + yr + 1
0.835190760538

所以结果是一样的，现在让我们看看它们的表现如何:

# with numba
10 loops, best of 3: 264 ms per loop

# without numba
10 loops, best of 3: 252 ms per loop

所以这完全符合我的预期。使用 python 类型并调用未编译的外部函数，您不会获得任何速度增益。您可能可以使用 numba 使其更快，但请务必阅读 numba 文档并查看支持的内容:Python types和 Numpy Types

关于python - 数字错误 "sequence item 0: expected str instance, type found"，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/35605333/

26

4

0

文章推荐： python - 如何检查 docker 实例是否正在运行？

文章推荐： python - 用 lambda pickle defaultdict

文章推荐： javascript - Array.prototype.find() 在异步函数中返回未定义

文章推荐： python - 将 pandas dataframe 转换为新格式的有效方法

swift - str = str + "abc"比 str = "abc"+ str 慢？
你信吗？我有一个这样的循环(请原谅任何错误，我不得不大量编辑大量信息和变量名称，相信我它有效)。 ...旧示例已删除，请参见下面的代码... 如果我将那些中间的 str = "Blah\(odat.c
c# - 为什么是 str = str.Replace().Replace();比 str = str.Replace(); 快str = str.替换()？
我正在做一个本地测试来比较 C# 中 String 和 StringBuilder 的 Replace 操作性能，但是对于 String 我使用了以下代码: String str = "String
c++ - 使用 str += "A"或 str = str + "A"连接字符串之间的性能差异
我想知道为什么str += "A"和 str = str + "A"有不同的表现。在实践中， string str = "cool" for(int i = 0; i approximately
python - 转换类型列表 [ ("[' str' ]", int), ("['str' ]", int)] to [(' str', int), ('str' , int)]
我有一个类型列表 [("['106.52.116.101']", 1), ("['45.136.108.85']", 1)] 并想将其转换为 [('106.52.116.101', 1), ('45.
python - 转换类型列表 [ ("[' str' ]", int), ("['str' ]", int)] to [(' str', int), ('str' , int)]
我有一个类型列表 [("['106.52.116.101']", 1), ("['45.136.108.85']", 1)] 并想将其转换为 [('106.52.116.101', 1), ('45.
string - 为什么遍历 HashMap<&str,&str> 会产生 &&str？
我正在遍历 HashMap并通过一些本地变量中的模式匹配将值放入其中。委托(delegate)者 fn lyrics_no_bottles(song_template:&mut String){
python - 为什么是 str.count ('' ) ≠ (from str.count ('A' ) + str.count ('B' ) + ... + str.count ('Z' ))
如果字符串(短语)中只有元音，它(对我而言)说True；否则说 False。我不明白为什么它总是返回 False，因为 (x >= x) 总是返回 True。我感谢任何人检查此查询的解决方案。 (st
rust - 我如何实现一种方法来处理 &str、Box、Rc 等？
我有代码以某种方式转换字符串引用，例如取第一个字母 trait Tr { fn trim_indent(self) -> Self; } impl Tr for &'a str { f
c++ - char* str ="ab", str 和 &str 的混淆
我正在学习指针，这是我的代码。我定义了一个指向 char(实际上是字符串)的指针 *str 和一个指向 int *a 的指针，它们的定义方式相同。我认为 str 和 a 都应该是一个地址，但是当我试图
python - Mypy 索引类型 "str"为 "Union[str, Dict[str, str]]"无效；预期类型 "Union[int, slice]"
为什么我会收到错误消息？我已经正确添加了类型，对吗？ Invalid index type "str" for "Union[str, Dict[str, str]]"; expected type
javascript - ['null' ,'' ,'undefined' ].indexOf(str) < 0 和 (str !== null || str !== '' || str !== undefined) 等价吗？
你知道下面两个函数是否等价吗？ function validate(str) { return ( ['null','','undefined'].indexOf(str) [v, valida
python - pd.Series.str.lower.replace ('str' , 'replace_str' ) 不起作用但 pd.Series.str.replace。 ('STR' , 'replace_str' ) 呢？
我正在解决这里的 Dataquest 问题:https://app.dataquest.io/m/293/data-cleaning-basics/5/removing-non-digit-chara
python - 将 str 列表排序为成对的 str，其中一个 str 具有 -R
我有一个字符串列表，如下所示: ["A TB", "A-R TB", "B TB", "B-R TB", "C TB", "C-R TB"...] 但字符串的顺序是随机的。我如何编写一个将元素配对的函
python - Pandas str.extract : AttributeError: 'str' object has no attribute 'str'
我正在尝试将此函数从使用 split 改为使用 str.extract (正则表达式)。 def bull_lev(x): spl = x.rsplit(None, 2)[-2].strip(
python - 将 [{str :int}, {str :int}, ... ] 的字典列表转换为 {str:int} 的单个字典
给定这样的数据结构: [{'a':1, 'b': 2}, {'c':3 }, {'a':4, 'c':9}, {'d':0}, {'d': 0, 'b':6}] 目标是解析数据以产生: {'a': 2
python - 将 [{str :int}, {str :int}, ... ] 的字典列表转换为 {str:int} 的单个字典
给定这样的数据结构: [{'a':1, 'b': 2}, {'c':3 }, {'a':4, 'c':9}, {'d':0}, {'d': 0, 'b':6}] 目标是解析数据以产生: {'a': 2
python - pyside/pyqt : when converting str() to QTreeWidgetItem() the str() is shortened to the [0] of str()
s = 'someString' s = QTreeWidgetItem(s) print(s.text(0)) # 0 being 'column' 输出: 's' 如果我对另一
c++ - 黑白 char* str[]、char *str 和 char str[] 的区别
黑白有什么区别: function(char* str ) function(char* str[] ) function(char str[] ) 它们是如何被调用的(通过什么类型的string/c
javascript - JavaScript 中的 str.fun()/str.fun/fun(str) 有什么区别？
我试过谷歌搜索但找不到准确的答案，所以请允许我尝试在这里提问。如果问题看起来不合适，请告诉我，我会删除它。在 JS 中，您可以通过三种不同的方式编写特定的内置功能: 字符串长度 str.toStri
c - *str 和 *str++
我有这段代码(我的 strlen 函数) size_t slen(const char *str) { size_t len = 0; while (*str) {

首页

博学

6Ren·AI

商城

python - 数字错误 "sequence item 0: expected str instance, type found"