python - Pandas pd.merge "TypeError: string indices must be integers, not str"-6ren

python - Pandas pd.merge "TypeError: string indices must be integers, not str"

转载作者：行者123 更新时间：2023-12-01 05:37:38

我已经广泛研究了这个简单的问题，但找不到答案。我正在尝试使用 pandas 的 pd.merge 基于名为“JN”的公共(public)列来合并两个文件。我相信它将我的“加入”(os.path.join)文件名视为字符串而不是数据帧/csv 文件。在我调用 pd.merge 函数后，错误提示“字符串索引必须是整数，而不是 str”。

import pandas as pd
import os

path = r"C:/Users/St/Documents/House/m2"

dirs = os.listdir(path)

for file in dirs:
    if file.endswith("J.csv"):
        J = file
        if len(J) is 12: #some filenames are 12 chars others 11
            jroot = J[:7]
        else:
            jroot = J[:6]

for file in dirs:
    if file.endswith("2.csv"):
        W = file
        if len(W) is 12:
            root2 = W[:7]
        else:
            root2 = W[:6]

JJ = os.path.join(path, J)
WW = os.path.join(path, W)

if jroot == root2:          # if the first 7 (or 6) characters match, then merge
    JW = pd.merge(JJ, WW, on="JN")

在与上述 pd.merge 函数调用相关的过程中，我收到此错误:

TypeError: string indices must be integers, not str

我想知道如何让它读取我的文件名字符串作为实际文件或数据帧。 JJ 和 WW 相当于打印出来的完整路径。我尝试使用 pd.DataFrame 创建这些“文件名”数据帧，但无法做到这一点。

最佳答案

您无法合并两个字符串。我认为您对 os.path.join 返回的内容感到困惑。它返回一个字符串。您必须实际从名为 JJ 和 WW 的文件中读取 DataFrame，然后执行合并。

以下是编写 2 个 DataFrame、使用 read_csv 读回它们，然后将它们合并到列 group 上的完整示例:

In [49]: df1 = DataFrame(randn(10, 1), columns=['a'])

In [50]: df1['group'] = np.random.choice(['b', 'c'], size=len(df1))

In [51]: df2 = DataFrame(randn(10, 1), columns=['b'])

In [52]: df2['group'] = np.random.choice(['b', 'c'], size=len(df1))

In [53]: df1.to_csv('df1.csv', index=False)

In [54]: cat df1.csv
a,group
-1.590035935931282,b
0.5496398501891229,c
-0.6484689548035797,b
0.19162302248253205,b
-0.9852064283582675,c
0.5975155551821989,b
0.29443634291217047,b
-0.7929994157215382,b
-1.9546460886048795,b
0.19195457928475546,c

In [55]: df2.to_csv('df2.csv', index=False)

In [56]: cat df2.csv
b,group
-1.2874060006117918,c
1.1037959548210117,b
0.47172389260467507,c
0.12802538607490285,c
-0.8753708425917293,b
-0.09187827793091947,b
1.140204215271196,c
0.4862940170888638,b
-1.1080430563137758,b
-1.3698112665693232,c

In [57]: df1_csv = read_csv('df1.csv', index_col=None)

In [58]: df2_csv = read_csv('df2.csv', index_col=None)

In [59]: df1_csv
Out[59]:
       a group
0 -1.590     b
1  0.550     c
2 -0.648     b
3  0.192     b
4 -0.985     c
5  0.598     b
6  0.294     b
7 -0.793     b
8 -1.955     b
9  0.192     c

In [60]: df2_csv
Out[60]:
       b group
0 -1.287     c
1  1.104     b
2  0.472     c
3  0.128     c
4 -0.875     b
5 -0.092     b
6  1.140     c
7  0.486     b
8 -1.108     b
9 -1.370     c

In [61]: df3 = pd.merge(df1_csv, df2_csv, on='group')

In [62]: df3
Out[62]:
        a group      b
0  -1.590     b  1.104
1  -1.590     b -0.875
2  -1.590     b -0.092
3  -1.590     b  0.486
4  -1.590     b -1.108
5  -0.648     b  1.104
6  -0.648     b -0.875
7  -0.648     b -0.092
8  -0.648     b  0.486
9  -0.648     b -1.108
10  0.192     b  1.104
11  0.192     b -0.875
12  0.192     b -0.092
13  0.192     b  0.486
14  0.192     b -1.108
15  0.598     b  1.104
16  0.598     b -0.875
17  0.598     b -0.092
18  0.598     b  0.486
19  0.598     b -1.108
20  0.294     b  1.104
21  0.294     b -0.875
22  0.294     b -0.092
23  0.294     b  0.486
24  0.294     b -1.108
25 -0.793     b  1.104
26 -0.793     b -0.875
27 -0.793     b -0.092
28 -0.793     b  0.486
29 -0.793     b -1.108
30 -1.955     b  1.104
31 -1.955     b -0.875
32 -1.955     b -0.092
33 -1.955     b  0.486
34 -1.955     b -1.108
35  0.550     c -1.287
36  0.550     c  0.472
37  0.550     c  0.128
38  0.550     c  1.140
39  0.550     c -1.370
40 -0.985     c -1.287
41 -0.985     c  0.472
42 -0.985     c  0.128
43 -0.985     c  1.140
44 -0.985     c -1.370
45  0.192     c -1.287
46  0.192     c  0.472
47  0.192     c  0.128
48  0.192     c  1.140
49  0.192     c -1.370

其他一些事情:

不要使用is来比较对象是否相等，而是使用==。只有在小整数的情况下，它才能可靠地工作，即使这样，您也不应该依赖它，因为这是 CPython 的实现细节。

不必使用 str.endswith 检查文件名，只需首先通过通配符迭代您想要的内容即可:

import glob

for f in glob.glob(os.path.join(path, '*J.csv')):
    if len(f) == 12:
        # do all the thingz!

关于python - Pandas pd.merge "TypeError: string indices must be integers, not str"，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/18553893/

文章推荐： java - 创建我自己的 YAJSW Java 服务包装器实现？

文章推荐： jquery - 如何计算每一行的乘法并用jquery显示

文章推荐： java - SOAP 故障堆栈跟踪

文章推荐： javascript - 如何连接数组值作为键来获取 json 对象值

javascript - native 基础 toast - TypeError : TypeError: TypeError: null is not an object (evaluating 'this.toastInstance._root.getModalState' )
我正在使用 React Native 构建移动应用程序。我面临 Nativ Base Toast 问题。当我第一次加载应用程序然后导航到工单状态时，如果我返回带有 android 后退按钮的主页，则会
TypeError: $(...).perfectScrollbar is not a function(TypeError：$(...).Perfect滚动条不是函数)
我正在尝试创建一个“完美的滚动条”，它是这样的：。Https://github.com/noraesae/perfect-scrollbar-bower。使用尽可能简单的代码：。我犯了以下错误：。当然
javascript - Draftjs: TypeError: TypeError: this.getImmutable(...) 未定义
我正在尝试在简单的 Draftjs 编辑器上应用自定义装饰器: import React from 'react'; import {Editor, EditorState, RichUtils} f
TypeError - read csv functionality(TypeError-读取CSV功能)
读取以钟形字符作为分隔符的CSV文件时，出现类型错误。我不想使用熊猫，我需要使用CSV库来解决这个问题。。示例标题：。数据类型。样本数据：。示例代码。我明白这个错误-。铃声字符参考-https://w
reactjs - TypeError : TypeError: (0, _reactRedux.useSelector) 不是函数
我正在处理 useSelector的 react-redux在我的 React Native 应用程序中，我收到以下错误: TypeError: TypeError: (0, _reactRedux.
javascript - Node 子进程生成 "TypeError: Bad argument TypeError"？
当我用 Node 运行以下代码时: var command = "/home/myScript.sh"; fs.exists(command, function(exists){ if(exi
reactjs - TypeError : wrapper. 存在不是函数 && TypeError : wrapper. find 不是函数
我正在为我的一个组件编写测试用例，该组件具有路由器(使用 withrouter)。我收到错误 wrapper.find is not a function。基本要求是需要检查我的渲染中是否存在标签，还
javascript - jquery TypeError : $(. ..).validate 和 TypeError : $(. ..).modal 不是函数
我一直在研究一个简单的表单提交。首先，我想在提交表单之前创建一个模式警报。于是，我使用了bootstrap的modal函数，反复得到 TypeError: $(...).modal is not a
python - is_authenticated() 引发 TypeError TypeError : 'bool' object is not callable
这个问题在这里已经有了答案: Flask-Login raises TypeError: 'bool' object is not callable when trying to override
TypeError: 'ListNode' object has no attribute '__getitem__'(TypeError：‘ListNode’对象没有属性‘__getitem__’)
这是我在leetcode中遇到的问题。您将看到两个非空链接表，表示两个非负整数。数字以相反的顺序存储，并且它们的每个节点都包含一个数字。将这两个数字相加，然后以链表的形式返回总和。。你可以假设这两个数
Why am I seeing "TypeError: string indices must be integers"?(为什么我看到“TypeError：字符串索引必须是整数”？)
我正在尝试学习Python，并试图将GitHub问题变成一种可读的形式。根据关于如何将JSON转换为CSV的建议，我得出了以下结论：。其中“Issues.json”是包含GitHub问题的JSON文件
javascript - 代理类的 TypeError - TypeError : 'set' on proxy: trap returned truish for property
我在使用 Proxy 类时遇到了这个有趣的错误: TypeError: 'set' on proxy: trap returned truish for property 'users' which
TypeError:unsupported format string passed to function .__format__(TypeError：传递给函数的格式字符串不受支持。__FORMAT__)
在研究Jupyter笔记本电脑时，我遇到了这个问题：。这是代码开始的地方：。下面的代码是在jupyter笔记本的另一个单元上运行的。我怎么才能解决它呢？。尝试更改参数和一系列其他内容，但所有这些都弹出
TypeError:unsupported format string passed to function .__format__(TypeError：传递给函数的格式字符串不受支持。__FORMAT__)
Working on jupyter notebooks, I came across this problem:在研究Jupyter笔记本电脑时，我遇到了这个问题： TypeError:un
javascript - TypeError : object is not a function - Javascript, ExtJS、Jasmine 和 TypeError:将循环结构转换为 JSON
我对此很陌生(对于 Jasmine 测试、ExtJs 和 JS 来说确实很陌生)，我必须修复这个错误/错误。我正在运行一些单元测试，但不断收到以下错误: TypeError: object is no
TypeError: run_simple() got an unexpected keyword argument 'jupyter_mode'(TypeError：Run_Simple()获得意外的关键字参数‘jupyter_mode’)
在下面的文档中，我们可以不使用JupyterDash在笔记本中运行应用程序，而只需运行app.run(jupyter_mode=“外部”)。。Https://dash.plotly.com/dash-
angular - ionic 错误地理定位 ionic 未捕获( promise ): TypeError: Object(…) is not a function TypeError: Object(…) is not a function
导入地理位置时: import { Geolocation } from '@ionic-native/geolocation/ngx'; 获取错误: ionic Geolocation :Ionic
python - TypeError: __getitem__() takes exactly 2 arguments (2 given) TypeError? ( python 3)
我定义了以下函数: def eigval(matrix): a = matrix[0, 0] b = matrix[0, 1] c = matrix[1, 0] d =
Diffusers SDXL "TypeError: argument of type 'NoneType' is not iterable"(Differs SDXL“TypeError：‘NoneType’类型的参数不可迭代”)
刚刚获得了SDXL模型的访问权限，希望为即将发布的版本进行测试...不幸的是，我们当前用于我们服务的代码似乎不能与稳定ai/稳定-扩散-xl-base-0.9一起工作，我不完全确定SDXL有什么不同，
ERROR: TypeError: Cannot read properties of undefined (reading 'username')(错误：TypeError：无法读取未定义的属性(正在读取‘UserName’))
这是我的全部代码。我试图通过/insta/：id在我的page.ejs页面上查找，但它显示错误：。无法读取未定义的属性(正在读取‘UserName’)。。我希望获得uuidv4()将提供的id，但它返

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

python - Pandas pd.merge "TypeError: string indices must be integers, not str"