gpt4 book ai didi

python - 如何使用Tensorflow的PTB模型示例?

转载 作者:太空狗 更新时间:2023-10-30 01:05:59 26 4
gpt4 key购买 nike

我正在尝试 Tensorflow's rnn example .开始时遇到一些问题,我可以运行该示例以训练 ptb,现在我已经训练了一个模型。

我现在如何准确地使用该模型来创建句子,而不必每次都重新训练?

我正在使用像 python ptb_word_lm.py --data_path=/home/data/--model medium --save_path=/home/medium 这样的命令运行它

是否有关于如何使用经过训练的模型造句的示例?

最佳答案

1.在PTBModel:__init__()函数的最后一行添加如下代码:

self._output_probs = tf.nn.softmax(logits)

2.在PTBModel中添加如下函数:

@property
def output_probs(self):
return self._output_probs

3.尝试运行以下代码:

raw_data = reader.ptb_raw_data(FLAGS.data_path)
train_data, valid_data, test_data, vocabulary, word_to_id, id_to_word = raw_data

eval_config = get_config()
eval_config.batch_size = 1
eval_config.num_steps = 1

sess = tf.Session()

initializer = tf.random_uniform_initializer(-eval_config.init_scale,
eval_config.init_scale)
with tf.variable_scope("model", reuse=None, initializer=initializer):
mtest = PTBModel(is_training=False, config=eval_config)

sess.run(tf.initialize_all_variables())

saver = tf.train.Saver()

ckpt = tf.train.get_checkpoint_state('/home/medium') # __YOUR__MODEL__SAVE__PATH__
if ckpt and gfile.Exists(ckpt.model_checkpoint_path):
msg = 'Reading model parameters from %s' % ckpt.model_checkpoint_path
print(msg)
saver.restore(sess, ckpt.model_checkpoint_path)

def pick_from_weight(weight, pows=1.0):
weight = weight**pows
t = np.cumsum(weight)
s = np.sum(weight)
return int(np.searchsorted(t, np.random.rand(1) * s))

while True:
number_of_sentences = 10 # generate 10 sentences one time
sentence_cnt = 0
text = '\n'
end_of_sentence_char = word_to_id['<eos>']
input_char = np.array([[end_of_sentence_char]])
state = sess.run(mtest.initial_state)
while sentence_cnt < number_of_sentences:
feed_dict = {mtest.input_data: input_char,
mtest.initial_state: state}
probs, state = sess.run([mtest.output_probs, mtest.final_state],
feed_dict=feed_dict)
sampled_char = pick_from_weight(probs[0])
if sampled_char == end_of_sentence_char:
text += '.\n'
sentence_cnt += 1
else:
text += ' ' + id_to_word[sampled_char]
input_char = np.array([[sampled_char]])
print(text)
raw_input('press any key to continue ...')

关于python - 如何使用Tensorflow的PTB模型示例?,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/40786771/

26 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com