c# - 具有动态时间扭曲内核的 SVM 返回错误率大于 0-6ren

c# - 具有动态时间扭曲内核的 SVM 返回错误率大于 0

转载作者：太空宇宙更新时间：2023-11-03 23:38:32

我在研究中使用 Accord.net。我有一个可变大小的向量序列作为输入，所以我使用 DynamicTimeWarping 作为 MulticlassSupportVectorMachine 的内核。

IKernel kernel = new DynamicTimeWarping(dimension);
        var machine = new MulticlassSupportVectorMachine(0, kernel, 2);
        // Create the Multi-class learning algorithm for the machine
        var teacher = new MulticlassSupportVectorLearning(machine, inputs.ToArray(), outputs.ToArray());
        // Configure the learning algorithm to use SMO to train the
        //  underlying SVMs in each of the binary class subproblems.
        teacher.Algorithm = (svm, classInputs, classOutputs, i, j) =>
            new SequentialMinimalOptimization(svm, classInputs, classOutputs)
            {
                Complexity = 1.5
            };
        // Run the learning algorithm
        double error = teacher.Run();

输入和输出如下所示:

?inputs.ToArray()
{double[22][]}
    [0]: {double[10656]}
    [1]: {double[9360]}
    [2]: {double[9216]}
    [3]: {double[9864]}
    [4]: {double[10296]}
    [5]: {double[10152]}
    [6]: {double[9936]}
    [7]: {double[9216]}
    [8]: {double[10944]}
    [9]: {double[9504]}
    [10]: {double[11880]}
    [11]: {double[22752]}
    [12]: {double[23688]}
    [13]: {double[29880]}
    [14]: {double[32328]}
    [15]: {double[37224]}
    [16]: {double[30024]}
    [17]: {double[27288]}
    [18]: {double[26064]}
    [19]: {double[22032]}
    [20]: {double[21672]}
    [21]: {double[22680]}
?inputs[0]
{double[10656]}
    [0]: 7.6413027545068823
    [1]: -61.607203372756942
    [2]: 7.7375128997886513
    [3]: -25.704529598536471
    [4]: -0.4124927191531238
    [5]: 9.6820255661415011
    [6]: 3.0674374003781861
    [7]: 4.6364653722537668
    [8]: 3.3559314278499177
    [9]: 0.93969394152714925
    [10]: -6.3800159552064146
    [11]: 1.4239779356781062
    [12]: -2.25349154655782
    [13]: -1.5457194406236221
    [14]: -0.7612541874802764
    [15]: -3.3364791133985348
    [16]: 0.67816801816804861
    [17]: -3.4117217877592343
    [18]: 1.5785492543017225
    [19]: 0.31091690789261689
    [20]: -2.5526646739208712
    [21]: -1.0550268575680164
    [22]: -0.9598271201088191
    [23]: -1.1797916101998056
    [24]: 0.56157735657438412
    [25]: -0.16309890421998655
    [26]: 0.29765136770064271
    [27]: -0.35684735108472643
    [28]: -0.52382117896006564
    [29]: -0.052087258844925849
    [30]: -0.45363669419489172
    [31]: -0.16216259086709361
    [32]: -0.25958480481802632
    [33]: 0.081248839173330589
    [34]: -0.019783293216956807
    [35]: 0.14139773316964666
    [36]: 0.088466551256948273
    [37]: -0.019528343614348152
    [38]: 0.087073343332064762
    [39]: 0.048432068369313144
    [40]: -0.0069171431858626713
    [41]: -0.0095272766950126042
    [42]: 0.016639887499893875
    [43]: -0.009108847017642599
    [44]: 0.0017424263597747487
    [45]: 0.0042160613810267641
    [46]: -0.002793626734247919
    [47]: 0.00092130299196750763
    [48]: 0.0024488939699103319
    [49]: 0.0021684669072286468
    [50]: 0.000000000000010673294119695543
    [51]: -0.000000000000014072530108313123
    [52]: 0.000000000000000069063495074940116
    [53]: 8.73342598612937E-17
    [54]: 0.000000000000000030048643853749834
    [55]: -6.95380121971215E-17
    [56]: 0.00000000000000010093927767292201
    [57]: 0.000000000000000046158366228268829
    [58]: 0.000000000000000039070100378142324
    [59]: 0.00000000000000010492059540665321
    [60]: -0.000000000000000014254591247067773
    [61]: -0.0000000000000000015902697756329909
    [62]: 0.0000000000000000017024249964704589
    [63]: 0.0000000000000000010277956708903136
    [64]: 3.5875442986020568E-28
    [65]: -2.215158998843094E-31
    [66]: 1.041379591973569E-31
    [67]: -4.3897715186113276E-31
    [68]: 4.248432864156974E-34
    [69]: 4.3718530099471368E-47
    [70]: 1.4551856970655856E-50
    [71]: 0.0
    [72]: 11.031182384920639
    [73]: -63.434486026723626
    [74]: 1.7731679007864651
    [75]: -23.968196466652273
    [76]: 2.2753564408666507
    [77]: 9.5492641110324534
    [78]: 3.4465209481281054
    [79]: 4.7979691924966161
    [80]: 2.0147801482840508
    [81]: 1.1858337013571998
    [82]: -4.607944757859336
    [83]: 0.75637871318664485
    [84]: -3.8397810581420115
    [85]: -2.1276086210477514
    [86]: -0.4060620782117581
    [87]: -2.9313848427777227
    [88]: 0.052605148372525556
    [89]: -1.5948208186863277
    [90]: 0.36061926783486992
    [91]: -0.12623742266247567
    [92]: -1.1889713301479885
    [93]: -0.33299631607409635
    [94]: -0.00912650336180437
    [95]: -0.52707950657313729
    [96]: 0.52115933681848092
    [97]: 0.46870463636533816
    [98]: -0.18482093982467213
    [99]: -0.49350561475314514
    < More... (The first 100 of 10656 items were displayed.) >
?outputs
Count = 22
    [0]: 0
    [1]: 0
    [2]: 0
    [3]: 0
    [4]: 0
    [5]: 0
    [6]: 0
    [7]: 0
    [8]: 0
    [9]: 0
    [10]: 0
    [11]: 1
    [12]: 1
    [13]: 1
    [14]: 1
    [15]: 1
    [16]: 1
    [17]: 1
    [18]: 1
    [19]: 1
    [20]: 1
    [21]: 1

使用该代码，错误返回值为 0.5。

问题:

这是否意味着我的训练数据有问题？
是否有任何其他内核可用于我的可变大小序列？

谢谢。

最佳答案

我将举例说明如何结合使用 DynamicTimeWarping 内核和高斯内核来执行序列分类，这有望提供更好的结果。

序列分类问题的首要任务是正确组织序列以提供给学习算法。每个序列都可以由多元向量组成，因此，输入数据必须相应地组织(这是使用序列机器时最有可能的错误来源，所以请花一两分钟来理解下面的代码在做什么).

// Suppose you have sequences of multivariate observations, and that
// those sequences could be of arbitrary length. On the other hand, 
// each observation have a fixed, delimited number of dimensions.

// In this example, we have sequences of 3-dimensional observations. 
// Each sequence can have an arbitrary length, but each observation
// will always have length 3:

double[][][] sequences =
{
    new double[][] // first sequence
    {
        new double[] { 1, 1, 1 }, // first observation of the first sequence
        new double[] { 1, 2, 1 }, // second observation of the first sequence
        new double[] { 1, 4, 2 }, // third observation of the first sequence
        new double[] { 2, 2, 2 }, // fourth observation of the first sequence
    },

    new double[][] // second sequence (note that this sequence has a different length)
    {
        new double[] { 1, 1, 1 }, // first observation of the second sequence
        new double[] { 1, 5, 6 }, // second observation of the second sequence
        new double[] { 2, 7, 1 }, // third observation of the second sequence
    },

    new double[][] // third sequence 
    {
        new double[] { 8, 2, 1 }, // first observation of the third sequence
    },

    new double[][] // fourth sequence 
    {
        new double[] { 8, 2, 5 }, // first observation of the fourth sequence
        new double[] { 1, 5, 4 }, // second observation of the fourth sequence
    }
};

那些是我们的输入序列。现在，由于我们正在尝试执行分类问题，因此我们必须有一个与每个序列相关联的输出类标签。如果我们有 4 个序列，那么我们将需要 4 个类标签:

// Now, we will also have different class labels associated which each 
// sequence. We will assign -1 to sequences whose observations start 
// with { 1, 1, 1 } and +1 to those that do not:

int[] outputs =
{
    -1,-1,  // First two sequences are of class -1 (those start with {1,1,1})
        1, 1,  // Last two sequences are of class +1  (don't start with {1,1,1})
};

既然问题已经定义好了，就必须对其进行一些转换，以便 DTW-SVM 可以对其进行处理:

// At this point, we will have to "flat" out the input sequences from double[][][]
// to a double[][] so they can be properly understood by the SVMs. The problem is 
// that, normally, SVMs usually expect the data to be comprised of fixed-length 
// input vectors and associated class labels. But in this case, we will be feeding
// them arbitrary-length sequences of input vectors and class labels associated with
// each sequence, instead of each vector.

double[][] inputs = new double[sequences.Length][];
for (int i = 0; i < sequences.Length; i++)
    inputs[i] = Matrix.Concatenate(sequences[i]);


// Now we have to setup the Dynamic Time Warping kernel. We will have to
// inform the length of the fixed-length observations contained in each
// arbitrary-length sequence:
// 
var kernel = new Gaussian<DynamicTimeWarping>(new DynamicTimeWarping(length: 3));

// Now we can create the machine. When using variable-length
// kernels, we will need to pass zero as the input length:
var svm = new KernelSupportVectorMachine(kernel, inputs: 0);

// Create the Sequential Minimal Optimization learning algorithm
var smo = new SequentialMinimalOptimization(svm, inputs, outputs);

// And start learning it!
double error = smo.Run(); // error will be 0.0

此时，应该已经创建了机器。参数 C 应该是由机器自动猜测的，但您可以事后对其进行微调，以尝试提高机器的泛化性能。无论如何，

// At this point, we should have obtained an useful machine. Let's
// see if it can understand a few examples it hasn't seem before:

double[][] a = 
{ 
    new double[] { 1, 1, 1 },
    new double[] { 7, 2, 5 },
    new double[] { 2, 5, 1 },
};

double[][] b =
{
    new double[] { 7, 5, 2 },
    new double[] { 4, 2, 5 },
    new double[] { 1, 1, 1 },
};

// Following the aforementioned logic, sequence (a) should be
// classified as -1, and sequence (b) should be classified as +1.

int resultA = System.Math.Sign(svm.Compute(Matrix.Concatenate(a))); // -1
int resultB = System.Math.Sign(svm.Compute(Matrix.Concatenate(b))); // +1

关于c# - 具有动态时间扭曲内核的 SVM 返回错误率大于 0，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/29887010/

文章推荐： c - 在至少出现 3 次的数组中查找最频繁的元素

文章推荐： django - 将socket.io连接到不同的地址

文章推荐： C# 防火墙，删除特定条目

文章推荐： c# - 使用 LINQ C# 从 List 中获取最长的字符串

python - 扭曲-工厂对象线程安全吗？
以下代码在2端口上监听，当有消息时修改全局dict对象。并且有一个计时器也会修改字典。 d = {} class x(Protocol): def dataReceived(self, dat
asynchronous - 扭曲/延期的工作原理
Twisted 怎么样？知道函数应该以异步方式执行吗？异步函数应该返回一个带有call-/errbacks的Deferred(immeadiately)，当收到“异步”数据时将被调用。接收到的数据作
logging - 扭曲。如何为每个请求在日志中写入唯一的前缀
我扭曲了服务器。它与插件一起运行。我想根据请求为每个条目编写唯一的前缀。这意味着当user1发出请求时，它将生成一个唯一的字符串，该字符串将以日志记录为前缀（仅用于此请求）。当user2发出请求时，
matlab - 扭曲/弯曲的点云
我正在使用校准的立体声对进行稀疏重建。这是我一步一步采取的方法: 1- 我使用 MATLAB 中的立体相机校准器应用程序校准了我的立体相机。 2-我拍摄了一对立体图像，并对每个图像进行了不失真处理。
python - 扭曲在客户端获取响应数据
我关注了这个tutorial但我不知道如何从服务器获取响应数据。 class Service(Resource): def render_POST(self, request):
javascript - 显示具有固定宽度和高度的图像而不会变形/扭曲
我的网站上有一个页面，它从数据库中获取大量图像并将它们放在一个网格中。图像的形状和大小各不相同。我想要做的是显示图像，每个图像都具有相同的宽度和高度，但不会扭曲。现在我的CSS是 .image{
Python 扭曲 - 需要遍历所有连接并找到客户端
我正在尝试创建一个简单的代金券程序。客户端连接到服务器并询问凭证上是否还有时间，如果是，服务器会响应多少时间。我控制服务器和客户端，客户端也由我编写代码。现在这就是我的服务器端，客户端是不言自明
python - 扭曲:减慢接收到的数据
假设我通过 TCP 连接快速接收数据。我必须对其进行某种处理。因为我不想阻塞 react 器线程，所以我将处理卸载到后台线程。数据到达的速度超过了我处理它的速度。如果我将数据放入队列中，队列会无限增
python - 循环http客户端python，扭曲
我有一个简单的客户端，它向服务器发送请求并接收响应: from StringIO import StringIO from twisted.internet import reactor fro
python - 扭曲、延迟的请求以及文件与错误页面
我目前正在使用 python/twisted 构建一个 http 服务器。该服务器必须在另一个 Web 服务器上获取内容，将其存储在本地并将响应发送回客户端。如果遇到 404，它必须尝试提供本地文件
python - 扭曲/执行异步http请求
我有一个扭曲的 react 堆监听传入的数据。我有第二个 react 器在特定时间间隔执行 http 请求，将结果发送到第一个 react 器。两者都运行良好。现在我想把它放在一起在一个 react
c# - ImageMagick 扭曲
我正在尝试使用 ImageMagick 的透视功能。我看过这些例子，但我无法理解值对应的是什么。我有这段代码: var stream = new MemoryStream(); using (Mag
algorithm - 根据形状的已知尺寸消除倾斜/扭曲
我有一个应用程序的想法，该应用程序采用每个角落有四个正方形的打印页面，并允许您在至少有两个正方形可见的情况下测量纸上的对象。我希望能够让用户从不太完美的角度拍照，但仍能准确测量物体。由于我在该领域缺
c# - 计算一个字符串的所有可能组合，扭曲
我试图让用户在文本框中输入文本，并让程序生成所有可能的组合，但最少 3 个字符和最多 6 个字符除外。我不需要像 ' 这样的无用词as'、'a'、'i'、'to' 等弄乱了我的阵列。我还将根据字典检查
algorithm - 查找数组中元素的最大总和(扭曲)
给定一个包含 +ve 和 -ve 整数的数组，找出不允许跳过 2 个连续元素的最大总和(即，您必须至少选择其中一个才能向前移动)。例如:- 10、20、30、-10、-50、40、-50、-1、-3
python - 扭曲:失败与错误
什么时候应该使用 twisted.python.failure.Failure，什么时候应该使用 twisted.internet.error.ConnectionDone？或者我应该做 twiste
Python 扭曲 : how to schedule?
在 Twisted 中有 1 天的经验，我尝试安排消息发送以回复 tcp 客户端: import os, sys, time from twisted.internet import protocol
Python 扭曲 : where to start
就目前而言，这个问题不适合我们的问答形式。我们希望答案得到事实、引用资料或专业知识的支持，但这个问题可能会引发辩论、争论、投票或扩展讨论。如果您觉得这个问题可以改进并可能重新打开，visit the
python - 扭曲:正确捕获键盘中断和关闭
更新:为了便于阅读，这里是如何在 react 器关闭之前添加回调: reactor.addSystemEventTrigger('before', 'shutdown', callable) 原始问题
Python - 扭曲、代理和修改内容
所以我已经查看了一些涉及使用 python 和 Twisted 框架编写 HTTP 代理的事情。基本上，就像其他一些问题一样，我希望能够修改将发送回浏览器的数据。也就是说，浏览器请求资源，代理将获取

太空宇宙

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

c# - 具有动态时间扭曲内核的 SVM 返回错误率大于 0