python - 关于使用 dict 分割数据帧的错误-6ren

python - 关于使用 dict 分割数据帧的错误

转载作者：太空宇宙更新时间：2023-11-03 20:15:27

28

4

训练数据如下所示:

p,x,s,n,t,p,f,c,n,k,e,e,s,s,w,w,p,w,o,p,k,s,u
e,x,s,y,t,a,f,c,b,k,e,c,s,s,w,w,p,w,o,p,n,n,g
e,b,s,w,t,l,f,c,b,n,e,c,s,s,w,w,p,w,o,p,n,n,m
p,x,y,w,t,p,f,c,n,n,e,e,s,s,w,w,p,w,o,p,k,s,u
e,x,s,g,f,n,f,w,b,k,t,e,s,s,w,w,p,w,o,e,n,a,g
e,x,y,y,t,a,f,c,b,n,e,c,s,s,w,w,p,w,o,p,k,n,g
e,b,s,w,t,a,f,c,b,g,e,c,s,s,w,w,p,w,o,p,k,n,m

第一栏是关于该蘑菇是否可以食用的标签。(e:可食用，p:有毒)我想根据是否可食用将这些数据分成两部分。我的代码如下:

mushdf = pd.read_csv('agaricus-lepiota.data') #load in two data for mushroom and iris
mushdf.columns = ['edible?','cap-shape','cap-surface','cap-color','bruises?','odor',
                    'gill-attachment','gill-spacing','gill-size','gill-color',
                    'stalk-shape','stalk-root','stalk-surface-above-ring','stalk-surface-below-ring',
                    'stalk-color-above-ring','stalk-color-below-ring','veil-type','veil-color',
                    'ring-number','ring-type','spore-print-color','population','habitat']
print(mushdf)
mushdic = {key: mushdf for (key, mushdf) in mushdf.groupby('edible?')}
for key in mushdic:
    print(f'mushdic[{key}]')
    print(mushdic[key])
    print('-'*50)

问题是，当我删除第 2 行到第 6 行中的 mushdf.columns 时，此代码有效。但是，当我执行 mushdf.columns 时，终端返回错误消息。

用另一列同样的方法就可以了。例如，mushdic = {key: mushdf for (key, mushdf) in mushdf.groupby('bruises?')} 运行正确。

我对此一无所知。

Traceback (most recent call last):
  File "e:\Visual Studio Project\LiMing\vs2017_python\.vscode\helloworld.py", line 11, in <module>
    mushdic = {key: mushdf for (key, mushdf) in mushdf.groupby('edible?')}
  File "C:\Program Files (x86)\Microsoft Visual Studio\Shared\Python36_64\lib\site-packages\pandas\core\generic.py", line 7894, in groupby
    **kwargs
  File "C:\Program Files (x86)\Microsoft Visual Studio\Shared\Python36_64\lib\site-packages\pandas\core\groupby\groupby.py", line 2522, in groupby
    return klass(obj, by, **kwds)
  File "C:\Program Files (x86)\Microsoft Visual Studio\Shared\Python36_64\lib\site-packages\pandas\core\groupby\groupby.py", line 391, in __init__
    mutated=self.mutated,
  File "C:\Program Files (x86)\Microsoft Visual Studio\Shared\Python36_64\lib\site-packages\pandas\core\groupby\grouper.py", line 621, in _get_grouper
    raise KeyError(gpr)
KeyError: 'edible?'
The terminal process terminated with exit code: 1

最佳答案

pandas.read_csv意味着 csv 文件中的第一行是标题。由于您的 csv 文件没有标题，因此您需要在导入过程中告知这一点。您还应该在此处传递列名称:

mushdf = pd.read_csv('agaricus-lepiota.data', header=None, names=[
                'edible?','cap-shape','cap-surface','cap-color','bruises?','odor',
                'gill-attachment','gill-spacing','gill-size','gill-color',
                'stalk-shape','stalk-root','stalk-surface-above-ring','stalk-surface-below-ring',
                'stalk-color-above-ring','stalk-color-below-ring','veil-type','veil-color',
                'ring-number','ring-type','spore-print-color','population','habitat'])

关于python - 关于使用 dict 分割数据帧的错误，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/58476374/

28

4

0

文章推荐： python - Django 模型 : Default for Choice Foreign Key

文章推荐： html - 完整的背景图像变得可点击

文章推荐： c# - 我应该创建单元测试基地吗

python - 通过 dict of dict of dict 计算值
我有以下数据: foo red test foo red test foo red test2 foo blue test bar red test bar blue test bar red tes
python - 将 2d dict 的 dict 内的值除以与该键匹配的另一个 dicts 值
这些字典很长，我需要循环遍历多个键值。举一个简短的例子。从长远来看，我需要将所有数字除以适当的计数以获得平均值。 counts = {'A':10, 'B':14} totals = {'A':{'a
python - 如何在 dict 创建期间从另一个 dict 构建 Python dict 值？
我想构建一个字典，其中一个值是从另一个值构建的。我想写 d = { 'a':1, 'b':self['a']+1 } 但它没有按预期工作: >>> {'a':1, 'b':self[
python - 当 dict 用作另一个 dict 的键时，TypeError : unhashable type: 'dict' ,
这个问题在这里已经有了答案: TypeError: unhashable type: 'dict' (4 个回答) 关闭5年前。我有这段代码: for element in json[referen
Python:对这个字典进行排序(dict in dict)
d = { 'a':{'k':1, 'b':'whatever'}, 'b':{'k':2, 'b':'sort by k'} } 想在 python 中按 k 降序对这个字典进行排序。有点棘手，
python - 如何将 defaultdicts [of defaultdicts] 的 defaultdict 转换为 dicts [of dicts] 的 dict？
使用 this answer ，我创建了 defaultdict 的 defaultdict。现在，我想把那个嵌套很深的 dict 对象变回一个普通的 python dict。 from collec
python - Pandas:将数据框列转换为 dict，其中 col 标题作为 dict 键，col 值作为 dict 值
我有一个如下所示的数据框: NAME ID 155 ARBITRARY_A 697381 208 ARBITRARY_B 691820 2
Python 类型列表[Dict] 与列表[dict]
在添加类型提示 python 函数中哪个是首选？ from typing import List, Dict def example_1() -> List[Dict]: pass def e
python - 字典结构(dict -> dict)与比较中的列表
有这个字典 -> 字典 -> 列表结构想要比较这种类型的 2 个结构。 one = {"1iG5NDGVre": {"118": ["test1", "test2", "test3", "tcp",
python - 递归搜索和修改复杂对象(dict ofists of dicts)
我有一个复杂的对象。目前它是字典列表的字典。但将来可能会发生变化，同时仍然只使用列表和字典。我想查找所有类型为“datetime”的列表元素或字典值并将它们更改为字符串。递归搜索似乎有效，但无法
python - Django-不支持的操作数类型/: 'dict' and 'dict'
我不知道如何在 html 中显示“净利润”的结果，这是 net_profit/sales 的结果。我不想把这个除法公式直接放在html中，因为除此之外还有很多其他复杂的计算。那么如何把这个除法结果同
列表的 Python dict 到单项匹配索引的 dict
我找不到与此用例类似的问题。我有一个包含列表的字典，我想从每个列表中提取一个特定的索引，并将其分配到一个具有相同键的新字典中。 dict1 = { 'key1': ['a', 'b', 'c'],
python - 为什么 dict(dict) 得到相同的字典？
这听起来可能很愚蠢。当我在 python 中重写 dict 时: class idict(dict): def __init__ (self, *args, **kwargs):
python - 将 dict 的元组键转换为新的 dict
我有一个像这样的 dict: { ('America', 25, 'm', 'IT'): 10000, ('America', 22, 'm', 'IT'): 8999, ('
键上的 Python dict of dict 正则表达式
我仍在努力学习 Python 中的字典。是否可以使用正则表达式或 startswith 函数在字典中引用键？我有以下格式的字典，我正在尝试对“AreaOfInterest1”、“AreaOfInte
python - 如何将 dict 功能重新分配回 'dict' ？
这个问题在这里已经有了答案: How to restore a builtin that I overwrote by accident? (3 个答案) 关闭 5 年前。我不小心为关键字 dic
Python:如何遍历一个列表[Dict{List[Dict{}]}]
我只是想知道是否有一种简单的方法可以做到这一点。我有一个从文件解析的特定结构，输出是一个字典列表的一个列表。目前，我只有一些看起来像这样的代码: for i in xrange(len(data)):
Python:dict within dict，通配符第一个键
我想在字典中匹配字典的数据。这: print(a["myval"]["val1"]) 努力获得所需的输出。但是我想“通配” myval 条目。同时输出 myval2 的结果 print(a['*'][
dictionary - React JS setState({dict : dict })
我想知道这是否是用两个字典更新状态的正确解决方案 var PopulationCityView = React.createClass({ getInitialState: function(
python - CSV 到 dict，dict 找不到该项目
我正在将 CSV 转换为 dict，所有值均已正确加载，但有一个问题。 CSV: Testing testing\nwe are into testing mode My\nServer Thi

首页

博学

6Ren·AI

商城

python - 关于使用 dict 分割数据帧的错误