matlab - Logistic回归中的麻烦计算成本-6ren

matlab - Logistic回归中的麻烦计算成本

转载作者：行者123 更新时间：2023-11-30 09:09:28

25

4

我正在从Andrewra Ng上Coursera上的机器学习课程。在这项工作中，我正在使用MatLab中的逻辑回归来计算成本函数，但正在收到“使用sfminbx出错（第27行）
目标函数在开始时未定义。 fminunc无法继续。”。

我应该补充一点，下面的costFunction函数中的成本J是NaN，因为log（sigmoid（X * theta））是-Inf向量。我确定这与异常有关。你能帮忙吗？

我的成本函数如下所示：

function [J, grad] = costFunction(theta, X, y)

  m = length(y); % number of training examples
  J = 0;
  grad = zeros(size(theta));

  h = sigmoid(theta * X);
  J    = - (1 / m) * ((log(h)' * y) + (log(1 - h)' * (1 - y)));
  grad = (1 / m) * X' * (h - y);

end

我的调用此函数的代码如下所示：

data = load('ex2data1.txt');
X = data(:, [1, 2]); y = data(:, 3);

[m, n] = size(X);

% Add intercept term to x and X_test
X = [ones(m, 1) X];

% Initialize fitting parameters
initial_theta = zeros(n + 1, 1);

% Compute and display initial cost and gradient
[cost, grad] = costFunction(initial_theta, X, y);

fprintf('Cost at initial theta (zeros): %f\n', cost);
fprintf('Expected cost (approx): 0.693\n');
fprintf('Gradient at initial theta (zeros): \n');
fprintf(' %f \n', grad);
fprintf('Expected gradients (approx):\n -0.1000\n -12.0092\n -11.2628\n');

% Compute and display cost and gradient with non-zero theta
test_theta = [-24; 0.2; 0.2];
[cost, grad] = costFunction(test_theta, X, y);

fprintf('\nCost at test theta: %f\n', cost);
fprintf('Expected cost (approx): 0.218\n');
fprintf('Gradient at test theta: \n');
fprintf(' %f \n', grad);
fprintf('Expected gradients (approx):\n 0.043\n 2.566\n 2.647\n');
fprintf('\nProgram paused. Press enter to continue.\n');
pause;


%% ============= Part 3: Optimizing using fminunc  =============
%  In this exercise, you will use a built-in function (fminunc) to find the
%  optimal parameters theta.

%  Set options for fminunc
options = optimset('GradObj', 'on', 'MaxIter', 400, 'Algorithm', 'trust-
region');

%  Run fminunc to obtain the optimal theta
%  This function will return theta and the cost 

[theta, cost] = ...
    fminunc(@(t)(costFunction(t, X, y)), initial_theta, options);

end

数据集如下所示：

34.62365962451697,78.0246928153624,0
30.28671076822607,43.89499752400101,0
35.84740876993872,72.90219802708364,0
60.18259938620976,86.30855209546826,1
79.0327360507101,75.3443764369103,1
45.08327747668339,56.3163717815305,0
61.10666453684766,96.51142588489624,1
75.02474556738889,46.55401354116538,1
76.09878670226257,87.42056971926803,1
84.43281996120035,43.53339331072109,1
95.86155507093572,38.22527805795094,0
75.01365838958247,30.60326323428011,0
82.30705337399482,76.48196330235604,1
69.36458875970939,97.71869196188608,1
39.53833914367223,76.03681085115882,0
53.9710521485623,89.20735013750205,1
69.07014406283025,52.74046973016765,1
67.94685547711617,46.67857410673128,0
70.66150955499435,92.92713789364831,1
76.97878372747498,47.57596364975532,1
67.37202754570876,42.83843832029179,0
89.67677575072079,65.79936592745237,1
50.534788289883,48.85581152764205,0
34.21206097786789,44.20952859866288,0
77.9240914545704,68.9723599933059,1
62.27101367004632,69.95445795447587,1
80.1901807509566,44.82162893218353,1
93.114388797442,38.80067033713209,0
61.83020602312595,50.25610789244621,0
38.78580379679423,64.99568095539578,0
61.379289447425,72.80788731317097,1
85.40451939411645,57.05198397627122,1
52.10797973193984,63.12762376881715,0
52.04540476831827,69.43286012045222,1
40.23689373545111,71.16774802184875,0
54.63510555424817,52.21388588061123,0
33.91550010906887,98.86943574220611,0
64.17698887494485,80.90806058670817,1
74.78925295941542,41.57341522824434,0
34.1836400264419,75.2377203360134,0
83.90239366249155,56.30804621605327,1
51.54772026906181,46.85629026349976,0
94.44336776917852,65.56892160559052,1
82.36875375713919,40.61825515970618,0
51.04775177128865,45.82270145776001,0
62.22267576120188,52.06099194836679,0
77.19303492601364,70.45820000180959,1
97.77159928000232,86.7278223300282,1
62.07306379667647,96.76882412413983,1
91.56497449807442,88.69629254546599,1
79.94481794066932,74.16311935043758,1
99.2725269292572,60.99903099844988,1
90.54671411399852,43.39060180650027,1
34.52451385320009,60.39634245837173,0
50.2864961189907,49.80453881323059,0
49.58667721632031,59.80895099453265,0
97.64563396007767,68.86157272420604,1
32.57720016809309,95.59854761387875,0
74.24869136721598,69.82457122657193,1
71.79646205863379,78.45356224515052,1
75.3956114656803,85.75993667331619,1
35.28611281526193,47.02051394723416,0
56.25381749711624,39.26147251058019,0
30.05882244669796,49.59297386723685,0
44.66826172480893,66.45008614558913,0
66.56089447242954,41.09209807936973,0
40.45755098375164,97.53518548909936,1
49.07256321908844,51.88321182073966,0
80.27957401466998,92.11606081344084,1
66.74671856944039,60.99139402740988,1
32.72283304060323,43.30717306430063,0
64.0393204150601,78.03168802018232,1
72.34649422579923,96.22759296761404,1
60.45788573918959,73.09499809758037,1
58.84095621726802,75.85844831279042,1
99.82785779692128,72.36925193383885,1
47.26426910848174,88.47586499559782,1
50.45815980285988,75.80985952982456,1
60.45555629271532,42.50840943572217,0
82.22666157785568,42.71987853716458,0
88.9138964166533,69.80378889835472,1
94.83450672430196,45.69430680250754,1
67.31925746917527,66.58935317747915,1
57.23870631569862,59.51428198012956,1
80.36675600171273,90.96014789746954,1
68.46852178591112,85.59430710452014,1
42.0754545384731,78.84478600148043,0
75.47770200533905,90.42453899753964,1
78.63542434898018,96.64742716885644,1
52.34800398794107,60.76950525602592,0
94.09433112516793,77.15910509073893,1
90.44855097096364,87.50879176484702,1
55.48216114069585,35.57070347228866,0
74.49269241843041,84.84513684930135,1
89.84580670720979,45.35828361091658,1
83.48916274498238,48.38028579728175,1
42.2617008099817,87.10385094025457,1
99.31500880510394,68.77540947206617,1
55.34001756003703,64.9319380069486,1
74.77589300092767,89.52981289513276,1

最佳答案

我看到的唯一问题是您应该写h = sigmoid(X * theta)而不是h = sigmoid(theta * X)。更改此代码后，我从您的代码中得到的答案与我从同一任务的代码中得到的答案相同。

关于matlab - Logistic回归中的麻烦计算成本，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/43578301/

25

4

0

文章推荐： machine-learning - SVM中的决策边界计算

文章推荐： TensorFlow - MNIST 数据中的训练准确性没有提高

文章推荐： machine-learning - 没有为任何变量提供梯度

azure - 二分类 Logistic VS 二元 Logistic 回归
在 Azure 机器学习工作室的测试项目中，根据我的理解，我有一些问题。在我的项目(在 R 中)中，我使用了二元 Logistic 回归，但在 AML 中我发现了两个 Logistic 回归:二类和多
python - 如何在 Logistic 回归中查找 Logistic/Sigmoidal 函数参数
我想估计医疗数据逻辑回归中使用的 sigmoidal/logistic 的最佳参数(在最后提到:斜率和截距)。这是我用 python 所做的: import numpy as np from skle
r - Logistic 回归的模型拟合统计量
我在 R 中运行逻辑回归模型。我使用了 Zelig 和 Car 包。但是，我想知道是否有一种简单的方法可以获得模型的模型拟合统计数据。 (伪 R 方、卡方、对数似然等) 最佳答案假设 glm1 is
r - Logistic 回归中的排序
在逻辑回归中，SAS 可以选择使用“降序”选项对 1 而不是 0 进行建模。 R 中有什么方法可以让我们做同样的事情吗？我正在使用的代码如下: glm(y~x1+x2+x3, family=bino
r - 具有定量和定性解释变量之间相互作用的多元 Logistic 回归
作为后续 this question ，我拟合了具有定量和定性解释变量之间相互作用的多元 Logistic 回归。 MWE如下: Type |z|) (Intercept) -0.65518
logistic-regression - Vowpal Wabbit逻辑回归的正确性？
我已经开始使用 Vowpal Wabbit 对于逻辑回归，但是我无法重现它给出的结果。也许它确实有一些未记录的“魔法”，但是有没有人能够复制/验证/检查逻辑回归的计算？例如，使用下面的简单数据，我们
python - Scikit Logistic 回归汇总输出？
有没有办法像 statsmodels 一样为 scikit 逻辑回归模型提供类似的、不错的输出？有了所有的 p 值，标准。一张表中的错误等？最佳答案正如您和其他人所指出的，这是 scikit le
logistic-regression - 在Vowpal wabbit中如何选择保留集
我正在使用 vowpal wabbit 进行逻辑回归。我了解到，vowpal wabbit 从给定的训练数据中选择一个保留集进行验证。这组是随机选择的吗？我有一个非常不平衡的数据集，包含 100 多个
optimization - 多类 Logistic 回归的学习曲线
我使用逻辑回归编写了一个多类分类器，该分类器使用一对多方法进行训练。我想绘制经过训练的分类器的学习曲线。学习曲线应该按类别绘制，还是应该作为整个分类器的单个图？这有什么不同吗？需要澄清的是，学习曲
python - logistic/sigmoid 函数实现数值精度
在scipy.special.expit中，逻辑函数实现如下: if x < 0 a = exp(x) a / (1 + a) else 1 / (1 + exp(-x)) 但
使用python画出逻辑斯蒂映射(logistic map)中的分叉图案例
逻辑斯蒂映射在混沌数学中是一个很经典的例子，它可以说明混沌可以从很简单的非线性方程中产生。逻辑斯蒂映射公式如下： x_n表示当前人口与最大人口数量的比值，mu为参数，相当于人口增长速率。
python - Logistic 回归仅预测 1 个类别
我是数据科学或机器学习的新手。我尝试从 here 实现代码，但预测只返回 1 个类。这是我的代码: classification_data = data.drop([10], axis=1).val
weka - 如何解释 Weka Logistic 回归输出？
请帮助解释 Weka 库中由 weka.classifiers.functions.Logistic 生成的逻辑回归结果。我使用来自 Weka 示例的数字数据: @relation weather
r - 除了 Logistic，RSNNS 包中还有哪些激活函数？
RSNNS 上的 CRAN 文档仅提及 Act_Logistic 作为隐藏层激活函数的示例。 RSNNS 中是否有所有可用激活函数的列表？我专门寻找双曲正切函数的语法。最佳答案是的，大多数(全部
python - 在 Python Logistic 回归中为求解器提供种子值
我正在使用 scikit-learn 的 linear_model.LogisticRegression 来执行多项逻辑回归。我想初始化求解器的种子值，即我想给求解器它的初始猜测作为系数的值。有谁知
r - 如何为 Lasso Logistic 回归生成所有一阶交互项？
glmnet 中有没有办法进行一阶交互？例如，如果我的 X 矩阵是: V1 V2 V3 0 1 0 1 0 1 1 0 0 ... 有没有办法指定它在不手动创建列的情况下按照 `y
java - 将 Logistic 回归损失函数转换为 Softmax
我目前有一个程序，它采用特征向量和分类，并将其应用于已知的权重 vector ，以使用逻辑回归生成损失梯度。这是代码: double[] grad = new double[featureSize];
machine-learning - 使用梯度下降理解 Logistic 回归的代码
我正在关注 Siraj Raval 关于使用梯度下降的逻辑回归的视频: 1) 较长视频的链接: https://www.youtube.com/watch?v=XdM6ER7zTLk&t=2686s
machine-learning - Logistic 函数加法或减法
我目前正在学习机器学习，但没有统计学背景。无论我在哪里看到物流功能，它总是: wx + b 但是this example in Theano documentation使用: wx - b 请问是哪一
function - 神经激活函数 - Logistic/Tanh/等之间的差异
我正在编写一些基本的神经网络方法 - 特别是激活函数 - 并且已经达到了我垃圾数学知识的极限。我理解各自的范围(-1/1)(0/1)等，但不同的描述和实现让我感到困惑。具体来说，sigmoid、lo

首页

博学

6Ren·AI

商城

matlab - Logistic回归中的麻烦计算成本