java

java - 使用自己的 Java 代码在 WEKA 中获取风险预测

转载作者：行者123 更新时间：2023-11-30 08:38:46

27

4

我已经检查了"Making predictions" WEKA 文档，它包含命令行和 GUI 预测的明确说明。

我想知道如何获得预测值，就像下面我使用Agrawal从GUI获得的预测值一样。我自己的 Java 代码中的数据集 ( weka.datagenerators.classifiers.classification.Agrawal ):

inst#,  actual,     predicted,  error,  prediction
1,      1:0,        2:1,        +,      0.941
2,      1:0,        1:0,        ,       1
3,      1:0,        1:0,        ,       1
4,      1:0,        1:0,        ,       1
5,      1:0,        1:0,        ,       1
6,      1:0,        1:0,        ,       1
7,      1:0,        2:1,        +,      0.941
8,      2:1,        2:1,        ,       0.941
9,      2:1,        2:1,        ,       0.941
10,     2:1,        2:1,        ,       0.941
1,      1:0,        1:0,        ,       1
2,      1:0,        1:0,        ,       1
3,      1:0,        1:0,        ,       1

<小时/>

即使it我也无法复制这个结果说:

Java

If you want to perform the classification within your own code, see the classifying instances section of this article, explaining the Weka API in general.

我去了link它说:

Classifying instances

In case you have an unlabeled dataset that you want to classify with your newly trained classifier, you can use the following code snippet. It loads the file /some/where/unlabeled.arff, uses the previously built classifier tree to label the instances, and saves the labeled data as /some/where/labeled.arff.

这不是我想要的情况，因为我只想对当前数据集建模的 k 倍交叉验证预测。

<小时/>

更新

predictions

public FastVector predictions()

Returns the predictions that have been collected.

Returns:

a reference to the FastVector containing the predictions that have been collected. This should be null if no predictions have been collected.

我找到了 predictions() Evaluation 类型对象的方法并使用代码:
Object[] preds = evaluation.predictions().toArray();
for(Object pred : preds) {
    System.out.println(pred);
}
结果是:
...
NOM: 0.0 0.0 1.0 0.9466666666666667 0.05333333333333334
NOM: 0.0 0.0 1.0 0.8947368421052632 0.10526315789473684
NOM: 0.0 0.0 1.0 0.9934883720930232 0.0065116279069767444
NOM: 0.0 0.0 1.0 0.9466666666666667 0.05333333333333334
NOM: 0.0 0.0 1.0 0.9912575655682583 0.008742434431741762
NOM: 0.0 0.0 1.0 0.9934883720930232 0.0065116279069767444
...
这和上面的一样吗？

最佳答案

经过深入的谷歌搜索(并且因为 documentation provides minimal help )我终于找到了答案。

我希望这个明确的答案将来对其他人有所帮助。

对于示例代码，我看到了问题 "How to print out the predicted class after cross-validation in WEKA"我很高兴能够解读这个不完整的答案，其中有些内容很难理解。
这是我的代码，其工作方式与 GUI 的输出类似
```
StringBuffer predictionSB = new StringBuffer();
Range attributesToShow = null;
Boolean outputDistributions = new Boolean(true);

PlainText predictionOutput = new PlainText();
predictionOutput.setBuffer(predictionSB);
predictionOutput.setOutputDistribution(true);

Evaluation evaluation = new Evaluation(data);
evaluation.crossValidateModel(j48Model, data, numberOfFolds,
        randomNumber, predictionOutput, attributesToShow,
        outputDistributions);
```
为了帮助您理解，我们需要实现 StringBuffer将被类型转换在 AbstractOutput 对象以便函数crossValidateModel能认出来。
使用StringBuffer只会导致java.lang.ClassCastException使用 PlainText 时与问题中的类似没有StringBuffer将显示java.lang.IllegalStateException .
谢谢ManChon U (Kevin)和他们的问题"How to identify the cross-evaluation result to its corresponding instance in the input data set?"让我了解这意味着什么:

... you just need a single addition argument that is a concrete subclass of weka.classifiers.evaluation.output.prediction.AbstractOutput. weka.classifiers.evaluation.output.prediction.PlainText is probably the one you want to use. Source

和

... Try creating a PlainText object, which extends AbstractOutput (called output for example) instance and calling output.setBuffer(forPredictionsPrinting) and passing that in instead of the buffer. Source

这些实际上只是为了创建一个 PlainText对象，放置一个StringBuffer并使用它通过方法 setOutput(boolean) 调整输出以及其他。
最后，要获得我们想要的预测，只需使用:
```
System.out.println(predictionOutput.getBuffer());
```
其中predictionOutput是 AbstractOutput 中的一个对象家庭( PlainText 、 CSV 、 XML 等)。
此外，evaluation.predictions()的结果与 WEKA GUI 中提供的不同。幸运的是，马克·霍尔在问题"Print out the predict class after cross-validation"中解释了这一点。

Evaluation.predictions() returns a FastVector containing either NominalPrediction or NumericPrediction objects from the weka.classifiers.evaluation package. Calling Evaluation.crossValidateModel() with the additional AbstractOutput object results in the evaluation object printing the prediction/distribution information from Nominal/NumericPrediction objects to the StringBuffer in the format that you see in the Explorer or from the command line.

引用文献:

关于java - 使用自己的 Java 代码在 WEKA 中获取风险预测，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/21424248/

27

4

0

文章推荐： Java任务控制——可视化事件

文章推荐： java - Jsprit : Can't add multiple related jobs

文章推荐： java - 如何对读取属性文件的类进行单元测试

文章推荐： machine-learning - 什么是 sklearn.cross_validation.cross_val_score

Magento 重新索引数据 - 风险
我有一个 Magento 网站，其中似乎没有出现交叉销售产品。在查看 Stack 和 Google 之后，似乎“重新索引数据”已经为很多人解决了这个问题。我的问题是，执行此任务是否有任何风险？或者
r - 一元运算符重载:风险？
为了避免在某些简单命令中使用括号，我编写了以下运算符以创建新的图形窗口。我的问题是：除了明显无法在我的变量“ newdev”上执行“ not”功能之外，我是否有“破坏” R中任何内容的风险？ # fu
php - 使用邮件功能是否存在注入(inject)风险
你好我正在开发一个联系表格。我正在使用邮件功能将其通过电子邮件发送给网站管理员。是否存在有人可以注入(inject)恶意 javascript 和任何其他注入(inject)攻击的风险？ $to
objective-c - 将消息传递为零的开销/风险
我想知道在 Objective C 中将消息传递给 nil 对象不会执行任何操作，这样依赖是否存在任何风险。在我的代码中，我有很多对 UIKit 和其他对象的弱引用，这些引用可能随时被清除。由于我来
javascript - 提交表单仍然存在 XSS 风险
我被指派修复遗留代码的安全问题，并获得了安全扫描的结果: Poor Error Handling: Server Error Message ( 10932 ) 基本上，当扫描尝试使用一些奇怪的代码进
java - 这段代码是否存在额外的 NullPointerException 风险？
我有一个匿名类，在创建时需要使用自引用。我的业务代码可以简化为如下代码，我知道这段代码: final Runnable runnable=new Runnable() { @Override
c++ - 对象切片或 UB 风险？
有几个基类是我无法控制的:- class BaseNode // Just a POD class. No VTable { void foo(); } class BaseHost { publ
mysql - 下拉菜单是否存在MySQL注入(inject)风险
这个问题已经有答案了: Do I have to guard against SQL injection if I used a dropdown? (11 个回答) 已关闭 7 年前。我希望我的网
mysql - 动态建表时如何避免SQL注入(inject)风险？
这是根据用户提供的输入创建表的简单过程: PROCEDURE `hackProcedure`( IN tab_name VARCHAR(63)) BEGIN IF (tab_name REGEXP '
html - 将未经过滤的用户输入放入文本输入中是否存在 XSS 风险？
我知道在您的网站上显示用户输入时，使用 php 中的 htmlentities() 之类的函数来清理用户输入很有用。但是这样的事情会带来 XSS 风险吗？ " /> 像这样清理输入会更好吗？ " />
python - Django SECRET_KEY 风险
关闭。这个问题是off-topic .它目前不接受答案。想改进这个问题吗？ Update the question所以它是on-topic用于堆栈溢出。关闭 10 年前。 Improve thi
c# - NCrunch 风险/进度窗口上的图表是如何计算的？
谁能阐明图表上曲线的含义、阴影区域的含义以及轴的含义？最佳答案这是对风险随时间变化的预测。在 NCrunch wiki 中有一些文档对此进行了描述:http://wiki.ncrunch.net/
azure - Cosmosdb mongo 在大集合上创建索引 - 风险
我有一个包含 200 万条记录的集合，如果可能的话，我希望在不停机的情况下在其上添加单个字段索引。我的问题是:是否可以/需要备份集合？使用 azure 门户创建索引在后台运行它而不会造成任何服务停机
ajax - Ajax 中的 CSRF 风险
我正在使用Symfony2并使用 CSRF token 保护我的表单。我有一个基于 Ajax 调用的评论系统。如果用户想要编辑他的评论，则会发生以下情况: 用户点击编辑按钮。通过 ajax 加载“
java - Checkmarx 显示代码存在二阶注入(inject)风险
Checkmark 扫描了我们的代码并显示这些代码存在二阶注入(inject)的风险像这样的代码 @SuppressWarnings("unchecked") public List> findByS
javascript - 如果没有将用户输入发送到数据库，是否存在注入(inject)风险？
我有一个包含几百行的小型 MySQL 数据库(全部为文本，无图像)。我正在使用 iQuery 请求所有行并在客户端进行所有过滤。 iQuery 代码如下: $(document).ready( fun
php - 如果用户只查看自己的数据——是否存在 XSS 风险？
如果我的网站只允许用户查看他们自己提交的数据，而永远不允许其他用户提交的数据(即没有一般的“帖子”等)——那么我的网站是否真的存在 XSS 风险？我仍将致力于 XSS 解决方案(如 httmlspe
ecmascript-6 - 使用带有不受信任字符串的模板文字设置属性值时，是否存在 XSS 风险？
我正在构建一个 iframe，而不是 innerHTML , 但带有 createElement .. 我使用了两个不受信任的字符串: iframeEl.title = untrustedStr1;
java - 排序方向上的 SQL 注入(inject)风险？
我正在生成如下动态prepareStatement(字段可能会根据请求输入而变化)。 Veracode 扫描持续报告 SQL 注入(inject)风险 - CWE-89:SQL 命令中使用的特殊元素的
java - Java 中的 Locale setDefault() 风险
我有一个可以在英语和德语之间切换语言的应用程序。当使用德语时，我希望货币显示将自动转换为德语格式。因此，在我的程序中，我必须检查区域设置，然后根据所选语言转换货币。我选择使用 locale.setDe

首页

博学

6Ren·AI

商城

java - 使用自己的 Java 代码在 WEKA 中获取风险预测

Classifying instances

更新