python - "Decoder"模型的 "Sequence-to-Sequence"应该输入什么？-6ren

python - "Decoder"模型的 "Sequence-to-Sequence"应该输入什么？

转载作者：行者123 更新时间：2023-11-30 09:48:10

25

4

我正在开发一个用于文本生成的序列到序列模型 ( paper )。我没有在解码器端使用“教师强制”，即 t0 时解码器的输出被馈送到 t1 时解码器的输入。

现在，实际上，解码器(LSTM/GRU)的输出通过密集层传递，然后密集层生成单词的索引，该索引被视为解码器的输出。

但是，为了将输出馈送到下一层，我们应该将 h_t (即解码器的输出/解码器的隐藏状态)馈送到下一步，还是下一个单词的单词嵌入是正确的选择吗？

最佳答案

简短的答案是:可能两者都有，但隐藏状态 h_t 至关重要。

需要馈送隐藏状态 h_t 才能将整个句子(不仅仅是前一个单词)的信息从一个解码器层传递到下一个解码器层。

提供所选单词的嵌入并不是必需的，但这可能是一个好主意。这允许解码器以之前被迫做出的选择为条件。

关于python - "Decoder"模型的 "Sequence-to-Sequence"应该输入什么？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/49611510/

25

4

0

文章推荐： python - 将初始状态输入 LSTMCell

文章推荐： machine-learning - 训练集准确度随着集合大小的增加而降低

文章推荐： java - Maven 依赖于项目——没有 jar，只有类

Perl:utf8::decode 与 Encode::decode
我得到了一些有趣的结果，试图辨别使用 Encode::decode("utf8", $var) 之间的区别。和 utf8::decode($var) .我已经发现，在一个变量上多次调用前者最终会导致错
android - 下载图片并在ImageView中查看时为"decoder->decode returned false"
我尝试使用 FlushedInputStream :Android decoder->decode returned false for Bitmap download 但没有任何变化，因为我使用:B
python-2.7 - Pyasn1 decoder.decode 是如何工作的？
我有一小部分代码: from pyasn1.type import univ from pyasn1.codec.ber import decoder decoder.decode(binary_fi
ios - decoder.decode 将有效的 iso8601 日期设置为 nil
这个问题在这里已经有了答案: Instantiated optional variable shows as nil in Xcode debugger (2 个答案) 关闭 2 年前。在 Swi
swift - 无法使用类型为 'decode' 的参数列表调用 '(Decodable, from: Data)'
我在 Playground 中有以下示例代码。如果结果符合 Decodable 协议(protocol)，我想解码网络请求的结果。知道为什么这段代码不起作用吗？ protocol APIReques
php - Imagecreatefromwebp() : WebP decode: fail to decode input data
我正在尝试使用 imagecreatefromwebp() 将 webp 文件转换为 JPEG，但不幸的是，它向我发出警告:警告:imagecreatefromwebp():WebP 解码:无法解码输
swift - 为什么在使用 JSONDecoder.decode 方法时没有调用 Decodable 的 init 方法？
我试图覆盖 JSONDecoder 解码数据的方式。我尝试了以下方法: struct Response : Decodable { init(from decoder: Decoder) t
python - '"sss 的用途.decode ("base64".decode ("zlib")'
ACTIVATE_THIS = """ eJx1UsGOnDAMvecrIlYriDRlKvU20h5aaY+teuilGo1QALO4CwlKAjP8fe1QGGalRoLEefbzs+Mk Sb7
ios - fatal error : Dictionary 不符合 Decodable 因为 Any 不符合 Decodable
我正在尝试使用 swift 4 来解析本地 json 文件: { "success": true, "lastId": null, "hasMore": false,
file - 错误: Uncaught Ext.JSON.decode(): You're trying to decode an invalid JSON String
我的代码有问题。我正在尝试使用ExtJS和Codeigniter制作上传文件格式。这是我的下面的代码， Ext.require([ 'Ext.form.field.File',
java - sun.net.www.ParseUtil.decode() 与 java.net.URLDecoder.decode()
我有一些遗留代码正在调用 sun.net.www.ParseUtil.decode()。我想避免调用供应商特定的函数，所以我想用其他东西替换调用。我可以使用 java.net.URLDecoder.
extjs - 访问 Nexus 配置 - Ext.JSON.decode() : You're trying to decode an invalid JSON String:
使用 Sonatype Nexus，我仅在访问 /nexus/#admin/support/status 时收到此错误消息. Ext.JSON.decode(): You're trying to d
json - 榆树 'Json.Decode.succeed' : how is it used in a decode pipeline if it is supposed to always return the same value?
我正在学习 Elm，让我感到困惑的一件事是“Json.Decode.succeed”。根据docs succeed : a -> Decoder a Ignore the JSON and produ
Java - URLDecoder.decode(String s) 与 URLDecoder.decode(String s, String enc)
有什么区别 URLDecoder.decode(String s) 和 URLDecoder.decode(String s, String enc) 我有一个 cookie 值，例如 val=%22
javascript - 气体 : parse XML - decode HTML entity name fails - decode entity decimal code succeeds
使用 Google Apps 脚本，我想解码 HTML，例如: Some text & text ¢ 存储为: Some text & text ¢ 所以，类似的问题:How t
ffmpeg - 忽略错误 "Invalid UTF-8 in decoded subtitles text; maybe missing -sub_charenc option Error while decoding stream"是否安全？
我正在对带有字幕的视频进行编码，但出现错误“解码的字幕文本中的 UTF-8 无效；可能缺少 -sub_charenc 选项。解码流时出错”，但视频还是编码了。忽略此错误的后果是什么？谷歌搜索显示一个人
python - Unicode解码错误: 'utf-8' codec can't decode byte 0x9d in position 0: invalid start byte when I execute the ` b.decode()`
我有如下代码: cn_bytes = [157, 188, 156] cn_str = "" clen = len(cn_bytes) count = int(clen / 3) for x in r
decode - 分析部分或损坏的二维码
关闭。这个问题不满足Stack Overflow guidelines .它目前不接受答案。想改善这个问题吗？更新问题，使其成为 on-topic对于堆栈溢出。 4年前关闭。 Improve thi
VBE decoder
This script give you a decoded listing from an encoded file. Supports *,je, ,vbe, .asp, .hta, .htm,
decode - telnet 客户端响应如何解码
telnet客户端响应如何解码我认为这是一个特定的响应，因为所有思科服务器都有相同的响应.这段文字的名称是什么，我如何解密它 '\xff\xfb\x01\xff\xfb\x03\xff\xfd\x1

首页

博学

6Ren·AI

商城

python - "Decoder"模型的 "Sequence-to-Sequence"应该输入什么？