gpt4 book ai didi

python - 如何将 csv 字符串转换为 pandas 中的列表?

转载 作者:太空宇宙 更新时间:2023-11-03 14:10:52 24 4
gpt4 key购买 nike

我正在处理具有以下格式的 csv 文件:

"Id","Sequence"
3,"1,3,13,87,1053,28576,2141733,508147108,402135275365,1073376057490373,9700385489355970183,298434346895322960005291,31479360095907908092817694945,11474377948948020660089085281068730"
7,"1,2,1,5,5,1,11,16,7,1,23,44,30,9,1,47,112,104,48,11,1,95,272,320,200,70,13,1,191,640,912,720,340,96,15,1,383,1472,2464,2352,1400,532,126,17,1,767,3328,6400,7168,5152,2464,784,160,19,1,1535,7424"
8,"1,2,4,5,8,10,16,20,32,40,64,80,128,160,256,320,512,640,1024,1280,2048,2560,4096,5120,8192,10240,16384,20480,32768,40960,65536,81920,131072,163840,262144,327680,524288,655360,1048576,1310720,2097152"
11,"1,8,25,83,274,2275,132224,1060067,3312425,10997342,36304451,301432950,17519415551,140456757358,438889687625,1457125820233,4810267148324,39939263006825,2321287521544174,18610239435360217"

我想将其读入一个类型为 df['Id'] 的数据框,类型为类整数,类型为 df['Sequence'] 类似于列表。

我目前有以下笨拙的代码:

def clean(seq_string):
return list(map(int, seq_string.split(',')))

# Read data
training_data_file = "data/train.csv"
train = pd.read_csv(training_data_file)
train['Sequence'] = list(map(clean, train['Sequence'].values))

这似乎可行,但我觉得使用 pandas 和 numpy native 也可以实现同样的效果。

有人推荐吗?

最佳答案

您可以指定 converter对于 Sequence 列:

converters: dict, default None

Dict of functions for converting values in certain columns. Keys can either be integers or column labels

train = pd.read_csv(training_data_file, converters={'Sequence': clean})

关于python - 如何将 csv 字符串转换为 pandas 中的列表?,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/38171072/

24 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com