tensorflow - 多目标和多类预测

转载作者：行者123 更新时间：2023-12-02 15:08:38

我对机器学习和 TensorFlow 都比较陌生。我想训练数据，以便可以对 2 个目标和多个类进行预测。这是可以做到的吗？我能够为 1 个目标实现该算法，但不知道我还需要如何为第二个目标实现该算法。

示例数据集: DayOfYear 温度流量可见性

316 8   1   4
285 -1  1   4
326 8   2   5
323 -1  0   3
10  7   3   6
62  8   0   3
56  8   1   4
347 7   2   5
363 7   0   3
77  7   3   6
1   7   1   4
308 -1  2   5
364 7   3   6

如果我训练(DayOfYear Temperature Flow)，我可以很好地预测能见度。但我也需要以某种方式预测 Flow。我很确定 Flow 会影响 Visibility，所以我不确定如何处理它。

这是我的实现

from __future__ import absolute_import
from __future__ import division
from __future__ import print_function

import os
import urllib

import numpy as np
import tensorflow as tf

# Data sets
TRAINING = "/ml_baetterich_learn.csv"
TEST = "/ml_baetterich_test.csv"
VALIDATION = "/ml_baetterich_validation.csv"

def main():

  # Load datasets.
  training_set = tf.contrib.learn.datasets.base.load_csv_without_header(
      filename=TRAINING,
      target_dtype=np.int,
      features_dtype=np.int,
      target_column=-1)
  test_set = tf.contrib.learn.datasets.base.load_csv_without_header(
      filename=TEST,
      target_dtype=np.int,
      features_dtype=np.int,
      target_column=-1)
  validation_set = tf.contrib.learn.datasets.base.load_csv_without_header(
      filename=VALIDATION,
      target_dtype=np.int,
      features_dtype=np.int,
      target_column=-1)

  # Specify that all features have real-value data
  feature_columns = [tf.contrib.layers.real_valued_column("", dimension=3)]

  # Build 3 layer DNN with 10, 20, 10 units respectively.
  classifier = tf.contrib.learn.DNNClassifier(feature_columns=feature_columns,
                                              hidden_units=[10, 20, 10],
                                              n_classes=9,
                                              model_dir="/tmp/iris_model")
  # Define the training inputs
  def get_train_inputs():
    x = tf.constant(training_set.data)
    y = tf.constant(training_set.target)

    return x, y

  # Fit model.
  classifier.fit(input_fn=get_train_inputs, steps=4000)

  # Define the test inputs
  def get_test_inputs():
    x = tf.constant(test_set.data)
    y = tf.constant(test_set.target)

    return x, y

  # Define the test inputs
  def get_validation_inputs():
    x = tf.constant(validation_set.data)
    y = tf.constant(validation_set.target)

    return x, y

  # Evaluate accuracy.
  accuracy_test_score = classifier.evaluate(input_fn=get_test_inputs,
                                       steps=1)["accuracy"]

  accuracy_validation_score = classifier.evaluate(input_fn=get_validation_inputs,
                                       steps=1)["accuracy"]

  print ("\nValidation Accuracy: {0:0.2f}\nTest Accuracy: {1:0.2f}\n".format(accuracy_validation_score,accuracy_test_score))

  # Classify two new flower samples.
  def new_samples():
    return np.array(
      [[327,8,3],
       [47,8,0]], dtype=np.float32)

  predictions = list(classifier.predict_classes(input_fn=new_samples))

  print(
      "New Samples, Class Predictions:    {}\n"
      .format(predictions))

if __name__ == "__main__":
    main()

最佳答案

选项 1:多头模型

您可以使用多头 DNNEstimator 模型。这将 Flow 和 Visibility 视为两个独立的 softmax 分类目标，每个都有自己的一组类。我不得不修改 load_csv_without_header 辅助函数以支持多个目标(这可能更清晰，但这不是重点 - 请随意忽略其细节)。

import numpy as np
import tensorflow as tf
from tensorflow.python.platform import gfile
import csv
import collections

num_flow_classes = 4
num_visib_classes = 7

Dataset = collections.namedtuple('Dataset', ['data', 'target'])

def load_csv_without_header(fn, target_dtype, features_dtype, target_columns):
    with gfile.Open(fn) as csv_file:
        data_file = csv.reader(csv_file)
        data = []
        targets = {
            target_cols: []
            for target_cols in target_columns.keys()
        }
        for row in data_file:
            cols = sorted(target_columns.items(), key=lambda tup: tup[1], reverse=True)
            for target_col_name, target_col_i in cols:
                targets[target_col_name].append(row.pop(target_col_i))
            data.append(np.asarray(row, dtype=features_dtype))

        targets = {
            target_col_name: np.array(val, dtype=target_dtype)
            for target_col_name, val in targets.items()
        }
        data = np.array(data)
        return Dataset(data=data, target=targets)

feature_columns = [
    tf.contrib.layers.real_valued_column("", dimension=1),
    tf.contrib.layers.real_valued_column("", dimension=2),
]
head = tf.contrib.learn.multi_head([
    tf.contrib.learn.multi_class_head(
        num_flow_classes, label_name="Flow", head_name="Flow"),
    tf.contrib.learn.multi_class_head(
        num_visib_classes, label_name="Visibility", head_name="Visibility"),
])
classifier = tf.contrib.learn.DNNEstimator(
    feature_columns=feature_columns,
    hidden_units=[10, 20, 10],
    model_dir="iris_model",
    head=head,
)

def get_input_fn(filename):
    def input_fn():
        dataset = load_csv_without_header(
            fn=filename,
            target_dtype=np.int,
            features_dtype=np.int,
            target_columns={"Flow": 2, "Visibility": 3}
        )
        x = tf.constant(dataset.data)
        y = {k: tf.constant(v) for k, v in dataset.target.items()}
        return x, y
    return input_fn

classifier.fit(input_fn=get_input_fn("tmp_train.csv"), steps=4000)
res = classifier.evaluate(input_fn=get_input_fn("tmp_test.csv"), steps=1)

print("Validation:", res)

选项2:多标签头

如果您将 CSV 数据用逗号分隔，并保留一行可能包含的所有类的最后一列(用空格等标记分隔)，您可以使用以下代码:

import numpy as np
import tensorflow as tf

all_classes = ["0", "1", "2", "3", "4", "5", "6"]

def k_hot(classes_col, all_classes, delimiter=' '):
    table = tf.contrib.lookup.index_table_from_tensor(
        mapping=tf.constant(all_classes)
    )
    classes = tf.string_split(classes_col, delimiter)
    ids = table.lookup(classes)
    num_items = tf.cast(tf.shape(ids)[0], tf.int64)
    num_entries = tf.shape(ids.indices)[0]

    y = tf.SparseTensor(
        indices=tf.stack([ids.indices[:, 0], ids.values], axis=1),
        values=tf.ones(shape=(num_entries,), dtype=tf.int32),
        dense_shape=(num_items, len(all_classes)),
    )
    y = tf.sparse_tensor_to_dense(y, validate_indices=False)
    return y

def feature_engineering_fn(features, labels):
    labels = k_hot(labels, all_classes)
    return features, labels

feature_columns = [
    tf.contrib.layers.real_valued_column("", dimension=1), # DayOfYear
    tf.contrib.layers.real_valued_column("", dimension=2), # Temperature
]
classifier = tf.contrib.learn.DNNEstimator(
    feature_columns=feature_columns,
    hidden_units=[10, 20, 10],
    model_dir="iris_model",
    head=tf.contrib.learn.multi_label_head(n_classes=len(all_classes)),
    feature_engineering_fn=feature_engineering_fn,
)

def get_input_fn(filename):
    def input_fn():
        dataset = tf.contrib.learn.datasets.base.load_csv_without_header(
            filename=filename,
            target_dtype="S100", # strings of length up to 100 characters
            features_dtype=np.int,
            target_column=-1
        )
        x = tf.constant(dataset.data)
        y = tf.constant(dataset.target)
        return x, y
    return input_fn

classifier.fit(input_fn=get_input_fn("tmp_train.csv"), steps=4000)
res = classifier.evaluate(input_fn=get_input_fn("tmp_test.csv"), steps=1)

print("Validation:", res)

我们将 DNNEstimator 与 multi_label_head 一起使用，它使用 sigmoid 交叉熵而不是 softmax 交叉熵作为损失函数。这意味着每个输出单位/logits 都通过 sigmoid 函数传递，它给出了数据点属于该类的可能性，即这些类是独立计算的，并不像 softmax 交叉熵那样相互排斥。这意味着您可以为训练集中的每一行和最终预测设置 0 到 len(all_classes) 类。

另请注意，类表示为字符串(k_hot 转换为标记索引)，因此您可以在电子商务设置中使用任意类标识符，例如类别 UUID。如果第 3 列和第 4 列中的类别不同(Flow ID 1 != Visibility ID 1)，您可以将列名称添加到每个类 ID 之前，例如

316,8,flow1 visibility4 285,-1,flow1 可见度4 326,8,flow2 能见度5

有关k_hot 工作原理的描述，请参阅my other SO answer .我决定使用 k_hot 作为一个单独的函数(而不是直接在 feature_engineering_fn 中定义它，因为它是一个独特的功能，并且 TensorFlow 可能很快就会有一个类似的实用函数。

请注意，如果您现在使用前两列来预测最后两列，您的准确性肯定会下降，因为最后两列高度相关并且使用其中一列会为您提供很多关于另一个。实际上，您的代码仅使用了第 3 列，如果目标是预测第 3 列和第 4 列，这无论如何都是一种欺骗。

关于tensorflow - 多目标和多类预测，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/45514517/

文章推荐： amazon-web-services - AWS SimpleDB CLI : How to use the 'select' command?

文章推荐： php - 从 Laravel 中的 url 获取最新的 slug

文章推荐： css - 带有 CSS 的略微弧形页脚

R 预测 - 如何仅绘制子集？
我正在使用 R 预测包拟合模型，如下所示: fit <- auto.arima(df) plot(forecast(fit,h=200)) 打印原始数据框和预测。当 df 相当大时，这
r - 预测-回归的神经网络预测相同的值
我正在尝试预测自有住房的中位数，这是一个行之有效的例子，给出了很好的结果。 https://heuristically.wordpress.com/2011/11/17/using-neural-ne
r - 预测()函数的类型参数
type="class"函数中的type="response"和predict有什么区别？例如： predict(modelName, newdata=testData, type = "class
python - 如何以图像的形式保存CNN模型的输出(预测)？
我有一个名为 Downloaded 的文件夹，其中包含经过训练的 CNN 模型必须对其进行预测的图像。下面是导入图片的代码: import os images = [] for filename i
区间内的 R 预测
关于预测的快速问题。我尝试预测的值是 0 或 1(它设置为数字，而不是因子)，因此当我运行随机森林时: fit , data=trainData, ntree=50) 并预测: pred, data
python - 预测，(找到正确的模型)
使用 Python，我尝试使用历史销售数据来预测产品的 future 销售数量。我还试图预测各组产品的这些计数。例如，我的专栏如下所示: Date Sales_count Department It
R SVM 预测
我是 R 新手，所以请帮助我了解问题所在。我试图预测一些数据，但预测函数返回的对象(这是奇怪的类(因子))包含低数据。测试集大小为 5886 obs。 160 个变量，当预测对象长度为 110 时..
java - 预测/识别电话号码的国家代码
关闭。这个问题需要更多focused .它目前不接受答案。想改进这个问题吗？更新问题，使其只关注一个问题 editing this post . 关闭 6 年前。 Improve this qu
python - 您如何从训练有素的网络对给定输入进行预测(预测)？
下面是我的神经网络代码，有 3 个输入和 1 个隐藏层和 1 个输出: #Data ds = SupervisedDataSet(3,1) myfile = open('my_file.csv','r
php - 预测/纠正全文搜索
我正在开发一个 Web 应用程序，它具有全文搜索功能，可以正常运行。我想对此进行改进并向其添加预测/更正功能，这意味着如果用户输入错误或结果为 0，则会查询该输入的更正版本，而不是查询结果。基本上类似
python - 具有单一分类特征的 LSTM 预测
我对时间序列还很陌生。这是我正在处理的数据集: Date Price Location 0 2012-01-01 1771.0
sequence - 如何使用隐马尔可夫模型进行 future 预测
我有许多可变长度的序列。对于这些，我想训练一个隐马尔可夫模型，稍后我想用它来预测(部分)序列的可能延续。到目前为止，我已经找到了两种使用 HMM 预测 future 的方法: 1) 幻觉延续并获得该延
映射到标签的 Tensorflow Serving 预测
我正在使用 TensorFlow 服务提供初始模型。我在 Azure Kubernetes 上这样做，所以不是通过更标准和有据可查的谷歌云。无论如何，这一切都在起作用，但是我感到困惑的是预测作为浮点
r - AWS 预测。项目数量的观察值太少
我正在尝试使用 Amazon Forecast 进行一些测试。我现在尝试了两个不同的数据集，它们看起来像这样: 13,2013-03-31 19:25:00,93.10999 14,2013-03-3
python - 预测 ufunc 输出的内存布局
使用 numpy ndarray大多数时候我们不需要担心内存布局的问题，因为结果并不依赖于它。除非他们这样做。例如，考虑这种设置 3x2 矩阵对角线的稍微过度设计的方法 >>> a = np.zer
R:如何在同一时间序列上绘制多个 ARIMA 预测
我想在同一个地 block 上用不同颜色绘制多个预测，但是，比例尺不对。我对任何其他方法持开放态度。可重现的例子: require(forecast) # MAKING DATA data
r - 通过分类变量和连续变量的交互可视化 GLMM 预测
我正在 R 中使用 GLMM，其中混合了连续变量和 calcategories 变量，并具有一些交互作用。我使用 MuMIn 中的 dredge 和 model.avg 函数来获取每个变量的效果估计。
output - 在命令行中导出 Weka 预测
我能够在 GUI 中成功导出分类器错误，但无法在命令行中执行此操作。有什么办法可以在命令行上完成此操作吗？我使用的是 Weka 3.6.x。在这里，您可以右键单击模型，选择“可视化分类器错误”并从那
R:如何在同一时间序列上绘制多个 ARIMA 预测
我想在同一个地 block 上用不同颜色绘制多个预测，但是，比例尺不对。我对任何其他方法持开放态度。可重现的例子: require(forecast) # MAKING DATA data
r - 预测 R 中的内存使用情况
我从 UCI 机器学习数据集库下载了一个巨大的文件。 (~300mb)。有没有办法在将数据集加载到 R 内存之前预测加载数据集所需的内存？ Google 搜索了很多，但我到处都能找到如何使用 R-p

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

tensorflow - 多目标和多类预测

选项 1:多头模型

选项2:多标签头