gpt4 book ai didi

python - 这个错误: python pandas的来源是什么

转载 作者:行者123 更新时间:2023-12-01 03:23:34 25 4
gpt4 key购买 nike

import pandas as pd
census_df = pd.read_csv('census.csv')
#census_df.head()

def answer_seven():

census_df_1 = census_df[(census_df['SUMLEV'] == 50)].set_index('CTYNAME')
census_df_1['highest'] = census_df_1[['POPESTIAMTE2010','POPESTIAMTE2011','POPESTIAMTE2012','POPESTIAMTE2013','POPESTIAMTE2014','POPESTIAMTE2015']].max()
census_df_1['lowest'] =census_df_1[['POPESTIAMTE2010','POPESTIAMTE2011','POPESTIAMTE2012','POPESTIAMTE2013','POPESTIAMTE2014','POPESTIAMTE2015']].min()
x = abs(census_df_1['highest'] - census_df_1['lowest']).tolist()
return x[0]
answer_seven()

这是尝试使用census.csv中的数据来查找2010-2015年人口绝对变化最大的县(POPESTIMATES),我想简单地找出abs之间的差异.每年/列的最大值和最小值。您必须返回一个字符串。另外 [(census_df['SUMLEV'] ==50)] 表示仅采用县,因为它们设置为 50。但是代码给出了一个以

结尾的错误

KeyError: "['POPESTIAMTE2010' 'POPESTIAMTE2011' 'POPESTIAMTE2012' 'POPESTIAMTE2013'\n 'POPESTIAMTE2014' 'POPESTIAMTE2015'] not in index"

我是否索引了错误的数据结构?我对数据科学和编码非常陌生。

最佳答案

我认为代码中的列名称有拼写错误。模式是“POPESTIMATE201?”而不是“POPESTIAMTE201?”

任何有关缩短代码的帮助将不胜感激。这是有效的代码 -

census_df = pd.read_csv('census.csv')
def answer_seven():
cdf = census_df[(census_df['SUMLEV'] == 50)].set_index('CTYNAME')
columns = ['POPESTIMATE2010', 'POPESTIMATE2011', 'POPESTIMATE2012', 'POPESTIMATE2013', 'POPESTIMATE2014', 'POPESTIMATE2015']
cdf['big'] = cdf[columns].max(axis =1)
cdf['sml'] = cdf[columns].min(axis =1)
cdf['change'] = cdf[['big']].sub(cdf['sml'], axis=0)
return cdf['change'].idxmax()

关于python - 这个错误: python pandas的来源是什么,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/41663030/

25 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com