java - 梯度下降的铰链损失最小化的正确实现-6ren

java - 梯度下降的铰链损失最小化的正确实现

转载作者：行者123 更新时间：2023-11-30 09:12:38

26

4

我从 here 复制了铰链损失函数(还有它所基于的 LossC 和 LossFunc。然后我将它包含在我的梯度下降算法中，如下所示:

  do 
  {
    iteration++;
    error = 0.0;
    cost = 0.0;

    //loop through all instances (complete one epoch)
    for (p = 0; p < number_of_files__train; p++) 
    {

      // 1. Calculate the hypothesis h = X * theta
      hypothesis = calculateHypothesis( theta, feature_matrix__train, p, globo_dict_size );

      // 2. Calculate the loss = h - y and maybe the squared cost (loss^2)/2m
      //cost = hypothesis - outputs__train[p];
      cost = HingeLoss.loss(hypothesis, outputs__train[p]);
      System.out.println( "cost " + cost );

      // 3. Calculate the gradient = X' * loss / m
      gradient = calculateGradent( theta, feature_matrix__train, p, globo_dict_size, cost, number_of_files__train);

      // 4. Update the parameters theta = theta - alpha * gradient
      for (int i = 0; i < globo_dict_size; i++) 
      {
          theta[i] = theta[i] - LEARNING_RATE * gradient[i];
      }

    }

    //summation of squared error (error value for all instances)
    error += (cost*cost);       

  /* Root Mean Squared Error */
  //System.out.println("Iteration " + iteration + " : RMSE = " + Math.sqrt( error/number_of_files__train ) );
  System.out.println("Iteration " + iteration + " : RMSE = " + Math.sqrt( error/number_of_files__train ) );

  } 
  while( error != 0 );

但这根本不起作用。这是由于损失函数吗？也许我如何将损失函数添加到我的代码中？

我想我的梯度下降的实现也可能是错误的。

这是我计算梯度和假设的方法，对吗？

static double calculateHypothesis( double[] theta, double[][] feature_matrix, int file_index, int globo_dict_size )
{
    double hypothesis = 0.0;

     for (int i = 0; i < globo_dict_size; i++) 
     {
         hypothesis += ( theta[i] * feature_matrix[file_index][i] );
     }
     //bias
     hypothesis += theta[ globo_dict_size ];

     return hypothesis;
}

static double[] calculateGradent( double theta[], double[][] feature_matrix, int file_index, int globo_dict_size, double cost, int number_of_files__train)
{
    double m = number_of_files__train;

    double[] gradient = new double[ globo_dict_size];//one for bias?

    for (int i = 0; i < gradient.length; i++) 
    {
        gradient[i] = (1.0/m) * cost * feature_matrix[ file_index ][ i ] ;
    }

    return gradient;
}

其余代码为here如果你有兴趣看一下。

这句话下面是那些损失函数的样子。我应该使用 loss 还是 deriv，这些是否正确？

/**
 * Computes the HingeLoss loss
 *
 * @param pred the predicted value
 * @param y the target value
 * @return the HingeLoss loss
 */
public static double loss(double pred, double y)
{
    return Math.max(0, 1 - y * pred);
}

/**
 * Computes the first derivative of the HingeLoss loss
 *
 * @param pred the predicted value
 * @param y the target value
 * @return the first derivative of the HingeLoss loss
 */
public static double deriv(double pred, double y)
{
    if (pred * y > 1)
        return 0;
    else
        return -y;
}

最佳答案

您提供的梯度代码看起来不像铰链损失的梯度。看一下有效的方程，例如: https://stats.stackexchange.com/questions/4608/gradient-of-hinge-loss

关于java - 梯度下降的铰链损失最小化的正确实现，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/28988732/

26

4

0

文章推荐： machine-learning - SMOTE后产生的结果可靠吗？

文章推荐： javascript - 如何识别正在使用哪个元素滚动？

文章推荐： javascript - 在 react 中切换输入字段表单验证

文章推荐： python - 使用 sklearn 使用卡方核进行多标签预测

PHP循环通过GPX来计算轨道的总上升/下降
我想循环遍历 gpx 文件并计算总上升和下降。我有一个函数可以计算两组经纬度点之间的高程差异，我已经设置了 simplexml 来读取和循环遍历 gpx 文件 trkseg 点。问题是，这不准确(实
javascript - 比较两个玩家分数数组，看看谁在列表中上升/下降
我有两个在不同时间段拍摄的数组。如何通过将新玩家标记为上升来检查哪些玩家在列表中上升/下降？附言- 数组已经根据分数排序。 pastData:[ { playerName:'Jo
vb6 - 无论我的表单上的哪个控件是目标，我如何捕捉关键的上升/下降？
我想捕获 ctrl/alt/etc 键的起伏，无论表单上的哪个控件获取 keyup 或 keydown 事件。由于我的表单上有大约 100 个控件，如果我要为每个单独的控件添加代码，那将非常难看。我怎
r - R:如何检查向量是否正在上升/下降
vector1 = c(2, 2, 2, 2, 2, 2) vector2 = c(2, 2, 3, 3, 3, 3) vector3 = c(2, 2, 1, 2, 2, 2) 我想知道向量中的数字
rust - 借入时暂时值(value)下降
我不知道如何遵循编译器的建议:consider using a let binding to create a longer lived value。 Playground #![allow(unus
javascript - AngularJS $scope 下降
我希望有人能帮助我理解 AngularJS 中的 $scope 遇到的一个恼人的问题。请参阅下面我的代码中的注释: app.controller('MyController', function ($
elasticsearch - 几个月后 Elasticsearch 下降
我有一个 flex 搜索集群，其中有2个节点在2核CPU 8GB ram实例上运行。每个节点都传入了参数“ES_JAVA_OPTS = -Xms3g -Xmx3g”。我有4个索引，每个索引有2个分片和
r - 从局部最小值/最大值计算累积增长/下降
我正在学习 R(及其通过 quantmod lib 在交易任务中的应用)并定期浏览社区以从这里获得许多新知识和技巧。我对 R 的总体印象和特别是 quantmod lib 的印象 - 它很棒。在这一
ios - 一些绘制周期后 FPS 下降
当我们点击屏幕时，我正在绘制纹理正方形。我正在使用相同的纹理。在新 ios 设备中点击几次后，FPS 从 120 下降到 4 左右。每次手指点击时，我都会将点击的点以及纹理和纹理的大小传递给着色器。
java - 为什么我的 FPS 下降？
只有当对象被点击并且需要从列表中移除时它才会掉落。这是代码: if(event.type == TouchEvent.TOUCH_DOWN){ for(Bottle bottl
ios - SKLabelNodes 下降 fps
我有一个基于SpriteKit的小游戏。在这个游戏中，我使用了很多带有字母(或字母组合)的节点，用户可以四处移动来构建单词。这些节点基本上是带有 SKLabelNode 的 SKSpriteNod
css - 为什么 float 下降？
我有一个简单的CSS布局 wrapper header left-sidebar / main-content / right-sidebar footer 但我的主要内容似乎下降了(float dr
html - 在浏览器重新调整大小时，div 下降
在标题中，我给出了四个不同的部分，并使用 float 属性使所有内容都显示在一条水平线上。当我调整浏览器窗口大小时，最后一个 div 位于黑色边框线下方。如何解决。 http://jsfiddle
javascript - 调整页面大小时 div 下降
CSS: .desc{ text-align: center; color:#60A8D5; padding-top: 17px;
html - float Div 下降
这是一段简单的代码，但我为这个问题尝试过的解决方案都没有奏效。 #ONE { float: left; border: 1
ios - 使用重力使 SCNNode 下降？
我有一个 SceneKit 设置，其中有一个 Sphere 设置为 Dynamic body。我能够运行该应用程序并看到球体落在静态 body 地板上。我想做的是设置场景，这样 sfere 最初就
javascript - 未使用的属性(property)下降？
首先，我的类(class): export class FooBar { ... isFavorite: boolean = false; constructor() { this.isF
linux - 如何使所有传出的 RST 下降
我正在尝试删除所有端口上的所有传出 RST 和传入 RST。我正在使用 Debian Linux。我尝试了互联网上列出的所有可能的命令组合，但似乎没有任何效果。例如，我试过: iptables -A
rust - 借用时临时值(value)下降，但我不想租借
我正在做这样的事情: fn main() { //[1, 0, 0, 0, 99]; // return [2, 0, 0, 0, 99] //[2, 3, 0, 3, 99]; //
rust - if else 借入时临时值(value)下降
我正在使用 Rusqlite，它可以让你做这样的查询: statement.query_row(params!([1, 2, 3]), ...); params!()定义如下: macro_rules

首页

博学

6Ren·AI

商城

java - 梯度下降的铰链损失最小化的正确实现