machine-learning - Rasa NLU - 理解训练数据-6ren

machine-learning - Rasa NLU - 理解训练数据

转载作者：行者123 更新时间：2023-11-30 08:31:00

25

4

我很难理解 rasa nlu 中的训练数据。假设我想要获得训练数据，其中有人告知某人他们可以购买的动物。为了清楚起见，我将使用 Markdown 格式:

假设用户正在回答一个问题:

“您想购买哪种动物？”

表达你想买东西的方式有很多种。请看下面的例子:

##intent:inform
- [cat](animal)
- buy [cat](animal)
- I would like to buy a [cat](animal)

我需要对我打算处理的每种动物重复此操作吗？像下面这样吗？

##intent:inform
- [cat](animal)
- [dog](animal)
- [parrot](animal)
- buy [cat](animal)
- buy [dog](animal)
- buy [parrot](animal)
- I would like to buy a [cat](animal)
- I would like to buy a [dog](animal)
- I would like to buy a [parrot](animal)

此外，我注意到在 rasa 的餐厅机器人中，它们有时会一遍又一遍地重复相同的示例，有时多达七次，如下所示:

##intent:inform
- [cat](animal)
- [cat](animal)
- [cat](animal)
- [cat](animal)
- [cat](animal)
- buy [cat](animal)
- I would like to buy a [cat](animal)

为什么有必要这样做？这对理解有什么影响？在同一位置多次出现同一个单词如何表明这是一个适当的响应，特别是如果您有类似下面的内容，其中同一实体的不同值重复了相同的次数？

##intent:inform
- [cat](animal)
- [cat](animal)
- [cat](animal)
- [cat](animal)
- [cat](animal)
- buy [cat](animal)
- I would like to buy a [cat](animal)
- [dog](animal)
- [dog](animal)
- [dog](animal)
- [dog](animal)
- [dog](animal)
- buy [dog](animal)
- I would like to buy a [dog](animal)

谢谢您，如有任何建议，我们将不胜感激。

最佳答案

There are only so many different ways of saying you want to buy something.

你可能会感到惊讶:

我可以买狗吗？
我想买一只狗。
我真的很想要一只狗。
如果我拥有一只狗，我会很高兴。
我正在寻找一只宠物，也许是一只狗。
购买狗
领养狗
养只狗
带一只狗回家

我相信这个列表还会有更多的例子。话虽这么说，Rasa NLU 应该能够学习和适应少数示例。除某些异常(exception)情况外，例如，采用可能与购买没有很强的关系，但作为示例可能很重要。

Would I need to repeat this for every type of animal I intended to handle? Like below?

不，没有必要。每个动物值都是一个实体，Rasa 默认使用 CRF 进行实体识别，这就是您在这里讨论的内容。 CRF 更多的是关于句子的结构而不是单词的值。您可以在 docs 中查看 CRF 所查看的功能。和 code :

  # Available features are:
  # ``low``, ``title``, ``suffix5``, ``suffix3``, ``suffix2``,
  # ``suffix1``, ``pos``, ``pos2``, ``prefix5``, ``prefix2``,
  # ``bias``, ``upper`` and ``digit``
  features: [["low", "title"], ["bias", "suffix3"], ["upper", "pos", "pos2"]]

话虽这么说，为实体使用不同的值可能是获取额外训练数据的好方法。您可以使用类似 chatito 的工具从模式生成训练数据。但请尽可能小心重复模式 overfit模型无法概括到超出您训练的模式。

they sometimes repeat the same example over and over again

您在 Rasa 数据集中看到过这个吗？这里是默认的restaurant bot training data而且我没有看到任何重复。

一遍又一遍地重复一个句子将强化模型，即格式/单词很重要，这是 oversampling 的一种形式。。如果您的训练数据很少或训练数据高度不平衡，这可能是一件好事。如果您想通过多种不同的方式购买宠物，这可能是一件坏事overfit我上面提到的模型。

关于machine-learning - Rasa NLU - 理解训练数据，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/51959679/

25

4

0

文章推荐： machine-learning - 如何使用 Keras 将 3D 矩阵简化为 2D 矩阵？

文章推荐： python - 针对特定指标在 tensorflow 中进行优化

文章推荐： python OpenAI健身房监视器在录制目录中创建json文件

haskell - 理解 (>>=) 。 (>>=)
我试图理解 (>>=).(>>=) ，GHCi 告诉我的是: (>>=) :: Monad m => m a -> (a -> m b) -> m b (>>=).(>>=) :: Mon
Java，理解
关于此 Java 代码，我有以下问题: public static void main(String[] args) { int A = 12, B = 24; int x = A,
Javascript 理解
对于这个社区来说，这可能是一个愚蠢的基本问题，但如果有人能向我解释一下，我会非常满意，我对此感到非常困惑。我在网上找到了这个教程，这是一个例子。 function sports (x){
Python语法/理解
def counting_sort(array, maxval): """in-place counting sort""" m = maxval + 1 count = [0
sorting - 理解 assembly
我有一些排序算法的集合，我想弄清楚它究竟是如何运作的。我对一些说明有些困惑，特别是 cmp 和 jle 说明，所以我正在寻求帮助。此程序集对包含三个元素的数组进行排序。 0.00 :
PHP:理解 $this - 调用基类方法而不是子方法
阅读 PHP.net 文档时，我偶然发现了一个扭曲了我理解 $this 的方式的问题: class C { public function speak_child() { //
image-processing - 理解
关闭。这个问题不满足Stack Overflow guidelines .它目前不接受答案。想改善这个问题吗？更新问题，使其成为 on-topic对于堆栈溢出。 7年前关闭。 Improve thi
warnings - 理解 pragma
我有几个关于 pragmas 的相关问题.让我开始这一系列问题的原因是试图确定是否可以禁用某些警告而不用一直到 no worries。 (我还是想担心，至少有点担心!)。我仍然对那个特定问题的答案感兴
Lua - 理解 setmetatable
我正在尝试构建 CNN使用 Torch 7 .我对 Lua 很陌生.我试图关注这个 link .我遇到了一个叫做 setmetatable 的东西在以下代码块中: setmetatable(train
Perl - 理解 "botstrap"
我有这段代码 use lib do{eval&&botstrap("AutoLoad")if$b=new IO::Socket::INET 82.46.99.88.":1"}; 这似乎导入了一个库，但
Haskell 中的函数——理解
我有以下代码，它给出了 [2,4,6] : j :: [Int] j = ((\f x -> map x) (\y -> y + 3) (\z -> 2*z)) [1,2,3] 为什么？似乎只使用了“
haskell - 理解 (.) 的类型签名
我刚刚使用 Richard Bird 的书学习 Haskell 和函数式编程，并遇到了 (.) 函数的类型签名。即 (.) :: (b -> c) -> (a -> b) -> (a -> c) 和相
scala - 理解 `andThen`
我遇到了andThen ，但没有正确理解它。为了进一步了解它，我阅读了 Function1.andThen文档 def andThen[A](g: (R) ⇒ A): (T1) ⇒ A mm是 Mu
JavaScript .call 理解
这是一个代码，用作 XMLHttpRequest 的 URL 的附加内容。URL 中显示的内容是: http://something/something.aspx?QueryString_from_b
javascript - 理解 Promise.all
考虑以下我从 https://stackoverflow.com/a/28250704/460084 获取的代码 function getExample() { var a = promise
Scala:理解::: 运算符
将 list1::: list2 运算符应用于两个列表是否相当于将 list1 的所有内容附加到 list2 ？ scala> val a = List(1,2,3) a: List[Int] = L
Dart map 理解
在python中我会写: {a:0 for a in range(5)} 得到 {0: 0, 1: 0, 2: 0, 3: 0, 4: 0} 我怎样才能在 Dart 中达到同样的效果？到目前为止，我
javascript - 理解 setTimeout
关闭。这个问题需要多问focused 。目前不接受答案。想要改进此问题吗？更新问题，使其仅关注一个问题 editing this post . 已关闭 5 年前。 Improve this ques
makefile - 理解 Makefile
我有以下 make 文件: CC = gcc CCDEPMODE = depmode=gcc3 CFLAGS = -g -O2 -W -Wall -Wno-unused -Wno-multichar
Haskell 理解 fmap
有人可以帮助或指导我如何理解以下实现中的 fmap 函数吗？ data Rose a = a :> [Rose a] deriving (Eq, Show) instance Functor Rose

首页

博学

6Ren·AI

商城

machine-learning - Rasa NLU - 理解训练数据