string - 执行 pandas to_hdf 时出现错误消息 "Exception: cannot find the correct atom type"-6ren

string - 执行 pandas to_hdf 时出现错误消息 "Exception: cannot find the correct atom type"

转载作者：行者123 更新时间：2023-12-02 09:57:36

27

4

我想将数据帧 df 保存到 .h5 文件 MainDataFile.h5 :

df.to_hdf ("c:/Temp/MainDataFile.h5", "MainData", mode = "w", format = "table", data_columns=['_FirstDayOfPeriod','Category','ChannelId'])

并出现以下错误:

*** Exception: cannot find the correct atom type -> > [dtype->object,items->Index(['Libellé_Article', 'Libellé_segment'], dtype='object')]

现在，如果我从 df 中删除“Libellé_Article”列(这是一个字符串列)，我将不再收到错误消息。

此专栏可能存在什么问题？我怀疑其中有一个特殊的、禁止的字符，但目前还无法找到。

更新1

根据 Jeff 的评论，我尝试对“Libellé_Article”列进行编码:

df['Libellé_Article'] = df['Libellé_Article'].str.encode('utf-8')

该列现在显示如下:

df['Libellé_Article']
0                                               b'PAPETERIE'
2                                    b'NR CONTRIBUTION DEEE'
4                                         b'NON UTILISE 103'
7                         b"L'ENFANT SOUS TERREUR/MILLER A."
10                 b'ENERGIE VITALE ET AUTOGUERISON/CHIA M.'
12         b'ENERGIE COSMIQUE CETTE PUISSANCE QUI EST EN ...
13         b'ENERGIE COSMIQUE CETTE PUISSANCE QUI EST EN ...
18                     b"COMMENT ATTIRER L'ARGENT/MURPHY J."
19                     b"COMMENT ATTIRER L'ARGENT/MURPHY J."

当我执行命令 to_hdf 时，我得到:

*** TypeError: Cannot serialize the column [Libellé_Article] because its data contents are [mixed] object dtype

最佳答案

这将在 py2 中工作。对于 py3，这应该无需编码步骤即可工作。这实际上是一个“混合”列，因为它包含字符串和 unicode。

In [24]: from pandas.compat import u

In [25]: df = DataFrame({'unicode':[u('\u03c3')] * 5 + list('abc') })

In [26]: df
Out[26]: 
  unicode
0       ?
1       ?
2       ?
3       ?
4       ?
5       a
6       b
7       c

In [27]: df['unicode'] = df.unicode.str.encode('utf-8')

In [28]: df.to_hdf('test.h5','df',mode='w',data_columns=['unicode'],format='table')

In [29]: pd.read_hdf('test.h5','df')
Out[29]: 
  unicode
0       ?
1       ?
2       ?
3       ?
4       ?
5       a
6       b
7       c

关于string - 执行 pandas to_hdf 时出现错误消息 "Exception: cannot find the correct atom type"，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/30101088/

27

4

0

文章推荐： sql-server - 替换sql数据透视表中的空值

文章推荐： python - 如何使用 scikit-learn 按标准差标准化数据帧？

文章推荐： java - Freemarker自定义字符转义(转义LaTeX特殊字符)

文章推荐： java - 如何将 HTML 标签与 Java 字符串一起添加？

python - Pandas to_hdf 溢出错误
Python新手在这里。我正在尝试使用 to_hdf 将大数据帧保存到带有 lz4 压缩的 HDF 文件中。我使用 Windows 10、Python 3、Pandas 20.2 我收到错误“溢出
python - pandas to_hdf 函数获取非法指令
这似乎是 OS X 10.10 和 docker 特定的错误。当我尝试 import pandas as pd df = pd.DataFrame([[1,2,3], [2,3,4]], column
python - dataframe.to_hdf() 中的参数键是什么意思
DataFrame.to_hdf(path_or_buf, key, **kwargs) pandas官方文档中说key是store中组的标识。但是，这是什么意思？尽管如此，我还是找不到足够的例子。
python-3.x - Pandas to_hdf() 溢出警告
我需要将一个大数据帧保存到 hdf5 文件，所以我使用了: self.feature_concated.to_hdf(self.h5_result_name, key='feature_data',
python - 使用 to_hdf 将 pandas 数据帧保存到磁盘时出现段错误
我正在尝试使用 hdf5 格式将数据帧保存到磁盘。即使是这段简单的代码也会给我“段错误(核心已转储)” import pandas as pd import tables df=pd.DataFram
python - 缺少可选依赖项 'tables' 。在 Pandas to_hdf
以下代码给我错误。 import pandas as pd df = pd.DataFrame({'a' : [1,2,3]}) df.to_hdf('temp.h5', key='df', mode
python - 追加带有 pandas' to_hdf 的行是 H5 文件大小的倍数吗？
我有一个大约 13,000 行 × 5 列的 HDF5，随着时间的推移，这些行通过 DF.to_hdf(Filename, 'df',append=True, format='table') 这是大小
python - 使用 pandas.to_hdf 快速读取 df 中的指定列
我有一个 2Gb 的数据帧，一次写入，多次读取 df。我想在 pandas 中使用 df，因此我使用了固定格式的 df.read_hdf 和 df.to_hdf，在读写方面效果很好。但是，df随着列
python - 尽管有 'to_hdf'，但用 pandas 'index=None' 重复索引
我想将数据存储在 HDFS 文件中，但将新数据附加到该文件会使索引重复。我可以知道如何避免吗？ In [35]: hdf = pd.HDFStore('temp.h5') In [36]: hdf.i
python - Pandas to_hdf() TypeError : object of type 'int' has no len()
我想存储一个 pandas DataFrame，这样当我稍后再次加载它时，我只加载它的某些列而不是整个东西。因此，我试图以 hdf 格式存储 pandas DataFrame。 DataFrame 包
python - pandas，使用 pd.to_hdf 将多个数据集存储在一个 h5 文件中
假设我有两个数据框， import pandas as pd df1 = pd.DataFrame({'col1':[0,2,3,2],'col2':[1,0,0,1]}) df2 = pd.Data
string - 执行 pandas to_hdf 时出现错误消息 "Exception: cannot find the correct atom type"
我想将数据帧 df 保存到 .h5 文件 MainDataFile.h5 : df.to_hdf ("c:/Temp/MainDataFile.h5", "MainData", mode = "w",

首页

博学

6Ren·AI

商城

string - 执行 pandas to_hdf 时出现错误消息 "Exception: cannot find the correct atom type"