python - 极差的预测 : LSTM time-series-6ren

python - 极差的预测 : LSTM time-series

转载作者：行者123 更新时间：2023-11-28 20:01:48

我尝试为时间序列预测实现 LSTM 模型。下面是我的试用代码。此代码运行无误。您也可以在没有依赖的情况下尝试。

import numpy as np, pandas as pd, matplotlib.pyplot as plt
from sklearn.preprocessing import MinMaxScaler
from keras.models import Sequential
from keras.layers import LSTM, Dense, TimeDistributed, Bidirectional
from sklearn.metrics import mean_squared_error, accuracy_score
from scipy.stats import linregress
from sklearn.utils import shuffle

fi = 'pollution.csv'
raw = pd.read_csv(fi, delimiter=',')
raw = raw.drop('Dates', axis=1)
print (raw.shape)

scaler = MinMaxScaler(feature_range=(-1, 1))
raw = scaler.fit_transform(raw)

time_steps = 7
def create_ds(data, t_steps):
    data = pd.DataFrame(data)
    data_s = data.copy()
    for i in range(time_steps):
        data = pd.concat([data, data_s.shift(-(i+1))], axis = 1)   
    data.dropna(axis=0, inplace=True)
    return data.values

ds = create_ds(raw, time_steps)
print (ds.shape)
n_feats = raw.shape[1]
n_obs = time_steps * n_feats

n_rows = ds.shape[0]
train_size = int(n_rows * 0.8)

train_data = ds[:train_size, :]
train_data = shuffle(train_data)

test_data = ds[train_size:, :]

x_train = train_data[:, :n_obs]
y_train = train_data[:, n_obs:]
x_test = test_data[:, :n_obs]
y_test = test_data[:, n_obs:]

x_train = x_train.reshape(1, x_train.shape[0], x_train.shape[1])
y_train = y_train.reshape(1, y_train.shape[0], y_train.shape[1])
x_test = x_test.reshape(1, x_test.shape[0], x_test.shape[1])

print (x_train.shape)
print (y_train.shape)
print (x_test.shape)
print (y_test.shape)

model = Sequential()
model.add(LSTM(64, return_sequences=True, input_shape=(None, x_train.shape[2]), stateful=True, batch_size=1))
model.add(LSTM(32, return_sequences=True, stateful=True))
model.add(LSTM(n_feats, return_sequences=True, stateful=True)) 

model.compile(loss='mse', optimizer='rmsprop')
model.fit(x_train, y_train, epochs=10, batch_size=1, verbose=2)  
y_predict = model.predict(x_test)
y_predict = y_predict.reshape(y_predict.shape[1], y_predict.shape[2])

y_predict = scaler.inverse_transform(y_predict)

y_test = scaler.inverse_transform(y_test)
y_test = y_test[:,0]
y_predict = y_predict[:,0]

print (y_test.shape)
print (y_predict.shape)

plt.plot(y_test, label='True')
plt.plot(y_predict,  label='Predict')
plt.legend()
plt.show()

然而，预测非常糟糕。如何提高预测素？你有什么改进的想法吗？

有没有通过重新设计架构和/或层来改进预测的想法？

最佳答案

如果你想在我的代码中使用模型(你传递的链接)，你需要正确塑造数据:(1 个序列，total_time_steps，5 个特征)

重要提示:我不知道这是执行此操作的最佳方法或最佳模型，但该模型预测输入之前的 7 个时间步长 (time_shift=7 )

数据和初始变量

    fi = 'pollution.csv'
raw = pd.read_csv(fi, delimiter=',')
raw = raw.drop('Dates', axis=1)
print("raw shape:")
print (raw.shape)
#(1789,5) - 1789 time steps / 5 features

scaler = MinMaxScaler(feature_range=(-1, 1))
raw = scaler.fit_transform(raw)

time_shift = 7 #shift is the number of steps we are predicting ahead
n_rows = raw.shape[0] #n_rows is the number of time steps of our sequence
n_feats = raw.shape[1]
train_size = int(n_rows * 0.8)


#I couldn't understand how "ds" worked, so I simply removed it because in the code below it's not necessary

#getting the train part of the sequence
train_data = raw[:train_size, :] #first train_size steps, all 5 features
test_data = raw[train_size:, :] #I'll use the beginning of the data as state adjuster


#train_data = shuffle(train_data) !!!!!! we cannot shuffle time steps!!! we lose the sequence doing this

x_train = train_data[:-time_shift, :] #the entire train data, except the last shift steps 
x_test = test_data[:-time_shift,:] #the entire test data, except the last shift steps
x_predict = raw[:-time_shift,:] #the entire raw data, except the last shift steps

y_train = train_data[time_shift:, :] 
y_test = test_data[time_shift:,:]
y_predict_true = raw[time_shift:,:]

x_train = x_train.reshape(1, x_train.shape[0], x_train.shape[1]) #ok shape (1,steps,5) - 1 sequence, many steps, 5 features
y_train = y_train.reshape(1, y_train.shape[0], y_train.shape[1])
x_test = x_test.reshape(1, x_test.shape[0], x_test.shape[1])
y_test = y_test.reshape(1, y_test.shape[0], y_test.shape[1])
x_predict = x_predict.reshape(1, x_predict.shape[0], x_predict.shape[1])
y_predict_true = y_predict_true.reshape(1, y_predict_true.shape[0], y_predict_true.shape[1])

print("\nx_train:")
print (x_train.shape)
print("y_train")
print (y_train.shape)
print("x_test")
print (x_test.shape)
print("y_test")
print (y_test.shape)

型号

你的模型对于这个任务不是很强大，所以我尝试了一个更大的模型(另一方面，这个太强大了)

model = Sequential()
model.add(LSTM(64, return_sequences=True, input_shape=(None, x_train.shape[2])))
model.add(LSTM(128, return_sequences=True))
model.add(LSTM(256, return_sequences=True))
model.add(LSTM(128, return_sequences=True))
model.add(LSTM(64, return_sequences=True))
model.add(LSTM(n_feats, return_sequences=True)) 

model.compile(loss='mse', optimizer='adam')

配件

请注意，我必须训练 2000 多个时期才能使模型获得良好的结果。
我添加了验证数据，以便我们可以比较训练和测试的损失。

#notice that I'm predicting from the ENTIRE sequence, including x_train      
#is important for the model to adjust its states before predicting the end
model.fit(x_train, y_train, epochs=1000, batch_size=1, verbose=2, validation_data=(x_test,y_test))

预测

重要:至于根据开头预测序列的结尾，模型看到开头以调整内部状态很重要，所以我预测整个数据 (x_predict) ，不仅是测试数据。

y_predict_model = model.predict(x_predict)

print("\ny_predict_true:")
print (y_predict_true.shape)
print("y_predict_model: ")
print (y_predict_model.shape)


def plot(true, predicted, divider):

    predict_plot = scaler.inverse_transform(predicted[0])
    true_plot = scaler.inverse_transform(true[0])

    predict_plot = predict_plot[:,0]
    true_plot = true_plot[:,0]

    plt.figure(figsize=(16,6))
    plt.plot(true_plot, label='True',linewidth=5)
    plt.plot(predict_plot,  label='Predict',color='y')

    if divider > 0:
        maxVal = max(true_plot.max(),predict_plot.max())
        minVal = min(true_plot.min(),predict_plot.min())

        plt.plot([divider,divider],[minVal,maxVal],label='train/test limit',color='k')

    plt.legend()
    plt.show()

test_size = n_rows - train_size
print("test length: " + str(test_size))

plot(y_predict_true,y_predict_model,train_size)
plot(y_predict_true[:,-2*test_size:],y_predict_model[:,-2*test_size:],test_size)

显示全部数据

显示它的结尾部分以获取更多详细信息

请注意该模型过度拟合，这意味着它可以学习训练数据并在测试数据中得到不好的结果。

要解决这个问题，您必须通过实验尝试较小的模型，使用 dropout 层和其他技术来防止过度拟合。

另请注意，此数据很可能包含大量随机因素，这意味着模型无法从中学到任何有用的东西。当您制作较小的模型以避免过度拟合时，您可能还会发现该模型会对训练数据做出更差的预测。

找到完美的模型不是一件容易的事，这是一个悬而未决的问题，您必须进行试验。也许 LSTM 模型根本不是解决方案。也许您的数据根本无法预测，等等。对此没有明确的答案。

如何知道模型好不好

利用训练中的验证数据，您可以比较训练数据和测试数据的损失。

Train on 1 samples, validate on 1 samples
Epoch 1/1000
9s - loss: 0.4040 - val_loss: 0.3348
Epoch 2/1000
4s - loss: 0.3332 - val_loss: 0.2651
Epoch 3/1000
4s - loss: 0.2656 - val_loss: 0.2035
Epoch 4/1000
4s - loss: 0.2061 - val_loss: 0.1696
Epoch 5/1000
4s - loss: 0.1761 - val_loss: 0.1601
Epoch 6/1000
4s - loss: 0.1697 - val_loss: 0.1476
Epoch 7/1000
4s - loss: 0.1536 - val_loss: 0.1287
Epoch 8/1000
.....

两者应该一起下降。当测试数据停止下降，但训练数据继续改善时，您的模型开始过度拟合。

尝试另一种模式

我能做的最好的(但我并没有真正尝试太多)是使用这个模型:

model = Sequential()
model.add(LSTM(64, return_sequences=True, input_shape=(None, x_train.shape[2])))
model.add(LSTM(128, return_sequences=True))
model.add(LSTM(128, return_sequences=True))
model.add(LSTM(64, return_sequences=True))
model.add(LSTM(n_feats, return_sequences=True)) 

model.compile(loss='mse', optimizer='adam')

当损失约为:

loss: 0.0389 - val_loss: 0.0437

在这一点之后，验证损失开始上升(因此超过这一点的训练完全没有用)

结果:

这表明该模型可以学习的是非常全面的行为，例如具有较高值的区域。

但是高频太随机或者模型不够好...

关于python - 极差的预测 : LSTM time-series，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/50054419/

文章推荐： android - PhoneGap 原生插件生命周期

文章推荐： asp.net - 自动化功能性 Web GUI 测试框架 (asp.net)

python 时间结果不符合预期 : time. time() - time.time()
在尝试 time 的 python 执行时，我发现在一条语句中两次调用 time.time() 时出现奇怪的行为。在语句执行期间获取time.time() 有一个非常小的处理延迟。例如time.ti
c# - 无限循环 : while(Time. time < Time.time + 5f)
我要疯了。对于我的生活，我无法弄清楚为什么以下代码会导致 Unity 在我按下播放键后立即卡住。这是一个空的项目，脚本附加到一个空的游戏对象。在控制台中，什么也没有出现，甚至没有出现初始的 Debug
c# - 无限循环 : while(Time. time < Time.time + 5f)
我要疯了。对于我的生活，我无法弄清楚为什么以下代码会导致 Unity 在我按下播放键后立即卡住。这是一个空的项目，脚本附加到一个空的游戏对象。在控制台中，什么也没有出现，甚至没有出现初始的 Debug
string - 为什么打印 time.Time 和指向 time.Time 的指针具有相同的结果？
我不明白为什么下面的结果是一样的。我预计第一个结果是指针地址。 func print(t *time.Time) { fmt.Println(t) // 2009-11-10 23:00:00
python - 为什么 time.time() - time.time() = 0.0？
Python 3.6.4 (v3.6.4:d48eceb, Dec 19 2017, 06:54:40) [MSC v.1900 64 bit (AMD64)] on win32 Type "help
time - 获取 time.Time 月份的最后一天
当我有一个time.Time时: // January, 29th t, _ := time.Parse("2006-01-02", "2016-01-29") 如何获得代表 1 月 31 日的 ti
sql - 从 "time with time zone"和时区名称中获取 "time without time zone"
首先，我意识到不推荐使用 time with time zone。我要使用它是因为我将多个 time with time zone 值与我当前的系统时间进行比较，而不管是哪一天。 IE。用户说每天 0
time - std::time::Duration 是否与 "time" crate 中的 time::precise_time_ns 一样精确？
长期以来，在 Rust 中精确测量时间的标准方法是 time crate 及其 time::precise_time_ns功能。但是，time crate 现在已被弃用，std 库有 std::tim
time - $time 在科学集群上使用并行处理时的含义？
我正在我学校的一个科学集群上运行我的有限差分程序。该程序使用 openmpi 来并行化代码。当程序连续运行时，我得到: real 78m40.592s user 78m34.920s s
python - 理解 time.clock() 和 time.time()
尽管它们已被弃用并且有比 time 更好的模块(即 timeit)，但我想知道这两个函数 time 之间的区别.clock() 和 time.time()。从后者 (time.time()) 开始，
python - time.time 和 time.clock 有什么区别？
这个问题在这里已经有了答案: Python's time.clock() vs. time.time() accuracy? (16 个答案) 关闭 6 年前。我认为两者都衡量时间量？但是他们返回
Python:time.time() 与 time.clock() 之间有显着差异吗？
我正在尝试测试 http 请求处理代码块在我的 Flask Controller 中需要多长时间，这是我使用的示例代码: cancelled = [] t0 = time.time() t1 = ti
python time.time() 和 "Daylight Saving Time"
运行 python 的计算机时钟(Windows 或 Linux)时会发生什么自动更改并调用 time.time()? 我读到，当时钟手动更改为过去的某个值时，time.time() 的值会变小。最
time - 准时测零最简洁的方法.Time
我有一个结构可能无法在其字段之一上设置 time.Time 值。测试无效性时，我不能使用 nil 或 0。time.Unix(0,0) 也不相同。我想到了这个: var emptyTime time.
time - 可空时间.Time
我有一个打算用数据库记录填充的结构，其中一个日期时间列可以为空: type Reminder struct { Id int CreatedAt time.Time
java - Execute CommandA A% of time, CommandB B% of time, CommandA C% of time ----- Command Z% time 使用随机数
问题陈述:通过匹配其百分比随机执行各种命令。比如执行 CommandA 50% 的时间和 commandB 25% 的时间和 commandC 15% 的时间等等，总百分比应该是 100%。我的问题
php - [路由 : time. 更新] [URI: time/{time}] 缺少必需的参数
我正在使用 laravel 6。我在同一个应用程序中有类似的 Controller 和类似的 View ，它工作正常。对比之后还是找不到错误。 Facade\Ignition\Exceptions\V
Python:从 time.time() 值转换为 time.strftime() 值的最简单方法是什么？
我需要用 ("%m/%d/%Y %H:%M:%S") 格式表示时间，我得到的浮点值是 time.time(). 我已经有了一个 time.time() 形式的值。例如，我已经有一个值，我每 0.3 秒
python - 将 datetime.time() 转换为与 time.time() 相同的格式
我正在使用以下方法获取 utc 日期时间: import datetime import time from pytz import timezone now_utc = datetime.datet
python - 为什么 time.clock 给出的耗时比 time.time 长？
我在 Ubuntu 上使用 time.clock 和 time.time 为一段 python 代码计时: clock elapsed time: 8.770 s time elapsed time

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城