gpt4 book ai didi

python - 如何正确塑造 RNN 的时间序列数据?

转载 作者:太空宇宙 更新时间:2023-11-03 21:10:42 26 4
gpt4 key购买 nike

我已经开始使用 Python 中的 TensorFlow 开发一个简单的项目,通过循环网络预测股票市场价格。到目前为止,这是我的代码:

n_steps = 30
n_inputs = 1
n_neurons = 100
n_outputs = 1

X = tf.placeholder(tf.float32, [1, n_steps, n_inputs])
y = tf.placeholder(tf.float32, [None, n_steps, n_inputs])
cell = tf.contrib.rnn.OutputProjectionWrapper(
tf.contrib.rnn.BasicRNNCell(num_units=n_neurons, activation=tf.nn.relu),
output_size = n_outputs
)
outputs, states = tf.nn.dynamic_rnn(cell, X, dtype=tf.float32)

learning_rate = 0.001

loss = tf.reduce_mean(tf.square(outputs - y))
optimizer = tf.train.AdamOptimizer(learning_rate=learning_rate)
training_op = optimizer.minimize(loss)

init = tf.global_variables_initializer()
n_iterations = numStocks
batch_size = 1

def priceArrayToRNNFormat(priceArray):
list = []
print(priceArray)
for price in priceArray:
list.append(price)
return np.array(list)

with tf.Session() as sess:
init.run()
for iteration in range(n_iterations):
dataOrig = [allStocksDict[list(allStocksDict.keys())[iteration]]]
data = priceArrayToRNNFormat(dataOrig)
print(data)
X_batch = data
y_batch = data
sess.run(training_op, feed_dict={X: X_batch, y: y_batch})
if iteration % 100 == 0:
mse = loss.eval(feed_dict={X: X_batch, y: y_batch})
print(iteration, "\tMSE", mse)

作为引用,allStocksDict 只是一个字典,其中每个键都是股票代码,值是随时间变化的价格的 30 个元素数组。运行代码时,我得到以下输出:

[['14.9400', '15.0000', '14.8800', '14.6900', '14.6300', '15.0000', '14.9400', '15.1300', '15.5600', '15.3100', '15.3800', '14.6900', '15.0000', '15.1300', '14.6300', '14.0600', '14.1300', '14.9400', '14.4400', '13.6300', '13.0000', '12.3800', '12.5000', '12.6300', '13.0000', '12.6900', '13.1300', '13.1900', '13.0600', '12.9400']]
[['14.9400' '15.0000' '14.8800' '14.6900' '14.6300' '15.0000' '14.9400'
'15.1300' '15.5600' '15.3100' '15.3800' '14.6900' '15.0000' '15.1300'
'14.6300' '14.0600' '14.1300' '14.9400' '14.4400' '13.6300' '13.0000'
'12.3800' '12.5000' '12.6300' '13.0000' '12.6900' '13.1300' '13.1900'
'13.0600' '12.9400']]
Traceback (most recent call last):
File "/home/john/Python/StockProject/monthlyRnn1.py", line 127, in <module>
sess.run(training_op, feed_dict={X: X_batch, y: y_batch})
File "/home/john/.local/lib/python3.6/site-packages/tensorflow/python/client/session.py", line 929, in run
run_metadata_ptr)
File "/home/john/.local/lib/python3.6/site-packages/tensorflow/python/client/session.py", line 1128, in _run
str(subfeed_t.get_shape())))
ValueError: Cannot feed value of shape (1, 30) for Tensor 'Placeholder:0', which has shape '(1, 30, 1)'

我尝试过单独提供列表,而不将其转换为数组,并且在将其转换为数组之前不将数组转换为向量,尽管此错误仍然存​​在。我非常感谢这方面的帮助。

最佳答案

一种可能的解决方案是

def priceArrayToRNNFormat(priceArray):
#list = []
#print(priceArray)
#for price in priceArray:
# list.append(price)
#return np.array(list)
return np.reshape(np.asarray(priceArray, dtype=np.float32), (1, n_steps, n_inputs))

嵌套列表也是可以接受的,另一种选择是转置priceArray并将其作为小批量再次包装到列表中。
但前一个选项 np.reshape() 既简单又快速。

关于python - 如何正确塑造 RNN 的时间序列数据?,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/55085971/

26 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com