How do I properly set the Datetimeindex for a Pandas datetime object in a dataframe?(如何正确设置DataFrame中Pandas DateTime对象的Datetimeindex？)-6ren

How do I properly set the Datetimeindex for a Pandas datetime object in a dataframe?(如何正确设置DataFrame中Pandas DateTime对象的Datetimeindex？)

转载作者：bug小助手更新时间：2023-10-25 09:26:16

24

4

I have a pandas dataframe:

我有一个熊猫数据框：

    lat         lng         alt days              date        time
0   40.003834   116.321462  211 39745.175405      2008-10-24  04:12:35
1   40.003783   116.321431  201 39745.175463  2008-10-24      04:12:40
2   40.003690   116.321429  203 39745.175521      2008-10-24      04:12:45
3   40.003589   116.321427  194 39745.175579      2008-10-24      04:12:50
4   40.003522   116.321412  190 39745.175637      2008-10-24      04:12:55
5   40.003509   116.321484  188 39745.175694      2008-10-24      04:13:00

For which I am trying to convert the df['date'] and df['time'] columns into a datetime. I can do:

为此，我尝试将df[‘date’]和df[‘time’]列转换为日期时间。我可以做到：

df['Datetime'] = pd.to_datetime(df['date']+df['time'])
df = df.set_index(['Datetime'])
del df['date']
del df['time']

And I get:

我得到了：

                    lat         lng         alt days
Datetime                            
2008-10-2404:12:35  40.003834   116.321462  211 39745.175405    
2008-10-2404:12:40  40.003783   116.321431  201 39745.175463
2008-10-2404:12:45  40.003690   116.321429  203 39745.175521    
2008-10-2404:12:50  40.003589   116.321427  194 39745.175579    
2008-10-2404:12:55  40.003522   116.321412  190 39745.175637

But then if I try:

但如果我试一试：

df.between_time(time(1),time(22,59,59))['lng'].std()

I get an error - 'TypeError: Index must be DatetimeIndex'

我收到错误-‘TypeError：Index必须为DatetimeIndex’

So, I've also tried setting the DatetimeIndex:

因此，我还尝试设置了DatetimeIndex：

df['Datetime'] = pd.to_datetime(df['date']+df['time'])
#df = df.set_index(['Datetime'])
df = df.set_index(pd.DatetimeIndex(df['Datetime']))
del df['date']
del df['time']

And this throws an error also - 'DateParseError: unknown string format'

这也抛出了一个错误--‘DateParseError：未知的字符串格式’

How do I create the datetime column and DatetimeIndex correctly so that df.between_time() works right?

如何正确地创建datetime列和DatetimeIndex，以便df.between_time（）正常工作？

更多回答

The 'DateParseError: unknown string format' is that it cannot figure out the "2008-10-2404:12:35" format since the 'DD' and 'HH' are adjacent.

‘DateParseError：未知字符串格式’是因为‘DD’和‘HH’是相邻的，所以它无法识别“2008-10-2404：12：35”格式。

优秀答案推荐

To simplify Kirubaharan's answer a bit:

将Kirubaharan的回答简单化一点：

df['Datetime'] = pd.to_datetime(df['date'] + ' ' + df['time'])
df = df.set_index('Datetime')

And to get rid of unwanted columns (as OP did but did not specify per se in the question):

并删除不需要的列(就像OP所做的那样，但没有在问题中具体说明其本身)：

df = df.drop(['date','time'], axis=1)

You are not creating datetime index properly,

您没有正确创建日期时间索引，

format = '%Y-%m-%d %H:%M:%S'
df['Datetime'] = pd.to_datetime(df['date'] + ' ' + df['time'], format=format)
df = df.set_index(pd.DatetimeIndex(df['Datetime']))

You may also want to set inplace=True. This way it returns the same df

您可能还希望设置inplace=True。通过这种方式，它返回相同的df

df["datetime"] = pd.to_datetime(df["date"] + " " + df["time"], format = "%Y-%m-%d %H:%M:%S")
df.set_index(["datetime"], inplace=True)

This worked best for me:

这对我来说效果最好：

format = '%Y-%m-%d%H:%M:%S'
df['Datetime'] = pd.to_datetime(df['date'] + df['time'].astype("string"), format=format)

In some cases Python treats df['date'] as column of integers.

在某些情况下，Python将df[‘date’]视为整数列。

I had trouble with setting a column formatted as YYYY-MM-DD as a date time index column in a data frame I needed for time series forecasting. This is how I solved it for a dateframe where I wanted "dateCol" to be the datetime index:

我在将YYYY-MM-DD格式的列设置为时间序列预测所需的数据框中的日期时间索引列时遇到了麻烦。这就是我如何解决日期框问题的方法，在该日期框中，我希望将“date Col”作为日期时间索引：

idx = pd.DatetimeIndex(self.df[dateCol])
self.df = self.df.set_index(idx)

Then to drop the column so it's not duplicated in the dataframe

然后删除该列，这样它就不会在数据帧中重复

self.df = self.df.drop(dateCol, axis=1)

更多回答

So the trick here is adding a space between the date and time and then the pd.to_datetime() Does The Right Thing with the resultant strings?

所以这里的诀窍是在日期和时间之间添加一个空格，然后pd.to_Datetime()对结果字符串做正确的事情？

if inplace=True, does it really return anything? can we simply remove the assignment operator and just use the right-hand-side?

如果inplace=True，它是否真的返回任何内容？我们可以简单地删除赋值操作符，只使用右侧吗？

@MJK In place prevents you from creating df object by performing the operation on the same df. To use the assignment on the right, you'd have to type in the operation as the first argument to df.set_index, It is cleaner to use the assignment operator first.

@MJK in Place防止您通过在同一个DF上执行操作来创建DF对象。要使用右侧的赋值，您必须键入操作作为df.set_index的第一个参数，首先使用赋值操作符会更简洁。

24

4

0

python - pd.DatetimeIndex.weekday 和 pd.DatetimeIndex.dayofweek 有什么区别
在 pandas datetimeindex 中，dayofweek和 weekday似乎是一样的。他们只是彼此的别名吗？我发现了这些功能 here 最佳答案根据pandas源码定义的Datetim
Python DatetimeIndex 错误 - TypeError : ("cannot do label indexing on
到目前为止，我有 EdChum 提供的以下代码: In [1]: df = pd.DataFrame({'a': [None] * 6, 'b': [2, 3, 10, 3, 5, 8]}) df["

python - 如何使用包含过滤数据帧 DatetimeIndex
我有一个按日期时间索引的数据框。我正在尝试创建某种过滤器，它只提供包含特定时间的帧。例如，所有包含“09:30”的帧 df.dtypes open float64 high
python - 没有频率的差异pandas.DateTimeIndex
不规则时间序列 data存储在 pandas.DataFrame 中.一个 DatetimeIndex已经设置好了。我需要索引中连续条目之间的时间差。我以为就这么简单 data.index.diff
Pandas DatetimeIndex 到数据帧
如何将 DatetimeIndex 更改为像这样的简单数据框: month 0 2013-07-31 1 2013-08-31 2 2013-09-30 3 2013-10-3
python - 从日期列表中删除单词 DateTimeIndex
我在 pandas 数据框中有多个以下格式的日期列表: col1 col2 1 [DatetimeInde
python - DatetimeIndex 对象中的日期列表
我有一个 DatetimeIndex 对象，它由两个日期组成，如下所示: import pandas as pd timestamps = pd.DatetimeIndex(['2014-1-1',
python - DatetimeIndex 偏移量
我有一个数据框，使用以下代码生成: time_index = pd.date_range(start=datetime(2013, 1, 1, 3), e
python - x轴不连续时如何删除冗余日期时间pandas DatetimeIndex
我想绘制一个 pandas 系列，其索引是不计其数的 DatatimeIndex。我的代码如下: import matplotlib.dates as mdates index = pd.Dateti
python - 如何有效地重新采样 DatetimeIndex
Pandas 在系列/数据帧上有一个 resample 方法，但似乎没有办法单独对 DatetimeIndex 进行重采样？具体来说，我有一个每日 Datetimeindex，其中可能缺少日期，我想
python - 如何提取 DateTimeIndex 以在新列中使用？
我已将一组 Excel 文件中的文件名中的日期提取到 DateTimeIndex 对象列表中。我现在需要将每个提取的日期写入我从每个 Excel 工作表创建的数据框的新日期列。我的代码的工作原理是将新
pandas - 计算 DateTimeIndex 的时间差
我想计算 DateTimeIndex 中时间之间的时间差 import pandas as pd p = pd.DatetimeIndex(['1985-11-14', '1985-11-28', '
date - 如何舍入 Pandas `DatetimeIndex` ？
我有一个 pandas.DatetimeIndex ，例如: pd.date_range('2012-1-1 02:03:04.000',periods=3,freq='1ms') >>> [2012
python - Pandas:重新采样数据帧以匹配不同数据帧的 DatetimeIndex
我在单独的 pandas.dataframe 中有两个时间序列，第一个 - series1与第二个条目相比，条目较少且开始数据时间不同 - series2 : index1 = pd.date_ran
python - 使用 DatetimeIndex 选择单行作为数据框
我在数据框中有一个带有 DatetimeIndex 的时间序列，如下所示: import pandas as pd dates= ["2015-10-01 00:00:00", "2
python - 从 DateTimeIndex 中截断毫秒
当我使用pandas.date_range()时，有时我的时间戳有很多我不想保留的毫秒数。假设我... import pandas as pd dr = pd.date_range('2011-01
python - 根据 DateTimeIndex 更新列
我有一个带有 DateTimeIndex 的 Pandas 数据框和一个名为 WEEKEND 的空列。如果索引中的日期时间是周末，我想将该列的值设置为“YES”，以便生成的数据帧如下所示: TIME
python - 创建每月的 pandas DatetimeIndex
我有一个包含 12 个值的数据框，我想将其转换为 DatetimeIndex 类型 months = df['date'] #e.g. '2016-04-01' idx = pd.date_range
python - Pandas DatetimeIndex 奇怪的行为
我处理一个DataFrame，其索引是字符串，年月，例如: index = ['2007-01', '2007-03', ...] 但是，索引未满。例如缺少 2007-02。我想要的是使用完整索引重新
python - 仅包含时间部分的 DatetimeIndex : is it possible
我一直被这样的问题困扰。我有一套客流量的观察。数据存储在.xlsx文件中，结构如下:观察日期、时间、车站名称、登机、下车。我想知道如果我只需要日期时间的“时间”组件，是否可以从此类数据创建带有 Da

首页

博学

6Ren·AI

商城

How do I properly set the Datetimeindex for a Pandas datetime object in a dataframe?(如何正确设置DataFrame中Pandas DateTime对象的Datetimeindex？)