gpt4 book ai didi

python - tensorflow 中的多个 gpus 和 AdamOptimizer

转载 作者:行者123 更新时间:2023-11-28 19:08:40 24 4
gpt4 key购买 nike

我正在使用多个 GPU 练习 tensorflow。每个 gpu 计算的平均梯度。但是,当我的优化器是 AdamOptimizer 时它不起作用。当我使用 GradientDescent 时,它总是有效。

代码如下:

G = tf.Graph()
with G.as_default(), tf.device('/cpu:0'):
full_data_dims = [batch_size*num_gpus] + data_dims
data = tf.placeholder(dtype=tf.float32, shape=full_data_dims, name='data')
labels = tf.placeholder(dtype=tf.int32, shape=[batch_size*num_gpus], name='labels')

split_data = tf.split(data, num_gpus, axis=0)
split_labels = tf.split(labels, num_gpus, axis=0)

optimizer = tf.train.AdamOptimizer(learning_rate)

replica_grads = []
for i in range(num_gpus):
with tf.name_scope('tower_{}'.format(i)), tf.device('/gpu:{}'.format(i)):

model = build_model(split_data[i], split_labels[i])
loss = model['loss']
grads = optimizer.compute_gradients(loss)
replica_grads.append(grads)
tf.get_variable_scope().reuse_variables()


tf.get_variable_scope().reuse_variables()
average_grad = average_gradients_layer(replica_grads)
grad_step = optimizer.apply_gradients(average_grad)
train_step = tf.group(grad_step)
init = tf.global_variables_initializer()

# Part3
config_proto = tf.ConfigProto(allow_soft_placement=True)
sess = tf.Session(graph=G, config=config_proto)
sess.run(init)
tf.train.start_queue_runners(sess=sess)
with sess.as_default():
for step in range(num_steps):
data_batch, label_batch = batch_maker(X_ok, y_ok, X_ng, y_ng, batch_size*num_gpus)
results = sess.run([train_step, loss], feed_dict={data : data_batch, labels : label_batch})
if step % flag == 0:
print('\n')
print('step : %s loss : %s' % (step, results[1]))
sys.stdout.write('\r'+str(step)+'/'+str(num_steps))

这是我的错误信息:

 32     tf.get_variable_scope().reuse_variables()
33 average_grad = average_gradients_layer(replica_grads)
---> 34 grad_step = optimizer.apply_gradients(average_grad)
35 train_step = tf.group(grad_step)
36 init = tf.global_variables_initializer()

Variable conv1_1/weight/Adam/ does not exist, or was not created with
tf.get_variable(). Did you mean to set reuse=None in VarScope?

AdamOptimizer 似乎在我的变量名后寻找额外的 '/Adam/'。谁能修好?

最佳答案

我不知道是否有错误,但问题是“任何人都可以修复它”。是的。

用“with tf.variable_scope”上下文管理器封装 gpu 循环(而不是 apply_gradients 代码),以便在退出 gpu 循环后停止重用作用域。

关于python - tensorflow 中的多个 gpus 和 AdamOptimizer,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/43281626/

24 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com