Balancing Data for Multiple-Instance Learning with Unbalanced Classes(具有不平衡类的多实例学习中的数据平衡)-6ren

Balancing Data for Multiple-Instance Learning with Unbalanced Classes(具有不平衡类的多实例学习中的数据平衡)

转载作者：bug小助手更新时间：2023-10-22 17:35:02

27

4

Balancing Data for Multiple-Instance Learning with Unbalanced Classes

Problem Statement (Simplified):

问题陈述（简化）：

I have a CSV file where each row is labeled as either class A or B. Class A has 906 instances, while class B has 255 instances. I want to use this Multiple-instance Learning (MIL) classifier https://github.com/garydoranjr/misvm for classification. But apparently the data is very imbalanced.

我有一个CSV文件，其中每行都标记为a类或B类。a类有906个实例，而B类有255个实例。我想使用这个多实例学习（MIL）分类器https://github.com/garydoranjr/misvm用于分类。但显然数据非常不平衡。

Additional Details:

其他详细信息：

I'm conducting an analysis on time-series patterns of specific activities, particularly brain activities. Each row in the CSV file represents a 5-second window for a single instance. The total duration of the experiment is 'n' seconds, resulting in approximately 'n/5' 5-second windows with a 1-second shift between them (ignore if unfamiliar with this concept). Therefore, the total number of rows in the CSV file is roughly calculated as:

我正在对特定活动的时间序列模式进行分析，尤其是大脑活动。CSV文件中的每一行代表单个实例的5秒窗口。实验的总持续时间为“n”秒，导致大约“n/5”个5秒的窗口之间有1秒的偏移（如果不熟悉这个概念，请忽略）。因此，CSV文件中的总行数大致计算为：

Total Rows = 906 * (n/5) + 255 * (n/5)

Question:

问题：

I'm considering duplicating rows of class B a certain number of times (e.g., 3 times) to balance the dataset. Is this a valid approach? Please also tell me if there are other approaches to tickle this kinda problem? Thanks in advance!

我正在考虑将类B的行复制一定次数（例如3次），以平衡数据集。这是一种有效的方法吗？还请告诉我是否有其他方法来解决这种问题？提前感谢！

更多回答

优秀答案推荐

更多回答

27

4

0

文章推荐： Docu sign api integration in bubble(bubble中的Docu-sign-api集成)

文章推荐： Custom CA certificate using OpenSSL(使用OpenSSL自定义CA证书)

ruby-on-rails - Ruby/Rails : Is it possible to execute a default method when calling an instance (@instance == @instance. 所有 IF "all"是默认方法)？
我知道我的问题有点含糊，但我不知道如何描述它。我问过很多地方，但似乎没有人理解我为什么要这样做。但请耐心等待，我会解释为什么我想要这样的东西。我使用 Liquid Templates 允许用户在我的
java - null==instance 而不是 instance==null
这个问题在这里已经有了答案: what is the difference between null != object and object!=null [duplicate] (2 个回答) 7年
java - 异常获取服务器实例 : No valid instance id for this instance
当我在我的本地主机 Google App Engine 应用程序中将日志记录级别更改为 FINE 时，我开始在我的跟踪堆栈中看到这些: Apr 17, 2013 4:54:20 PM com.goog
python - type(instance) 何时不同于 instance.__class__？
Python 有内置函数 type : class type(object) With one argument, return the type of an object. The return v
instance - “instance detection” 和 "semantic segmentation"有什么区别？
我正在使用深度学习进行语义分割，我遇到了以下术语:语义分割、实例检测、对象检测和对象分割. 它们有什么区别？最佳答案这些术语的某些用法对用户而言是主观的或依赖于上下文，但据我所知对这些术语的合理
instance - -[NSConcreteMutableData 发布] : message sent to deallocated instance
我面临 -[NSConcreteMutableData release] 的问题:消息发送到已释放的实例，我也附上了我的示例代码。 - (IBAction)uploadImage { NSString
Django 管理员 : show single instance of duplicate instances
我试图显示模型中的单个实例(数据库行)，其中多个实例共享多行的相同字段(列)值。为了澄清这一说法，我有以下情况: ID/Title/Slug/Modified 1 Car A 1s ag
java - 莫基托 : given an instance of Class return the same instance
我正在尝试使用mockito来模拟服务。然而，我没有找到一种方法来告诉mockito，给定一个类的实例返回给我相同的实例: 类似于: given(service.add(any(Individua
javascript - JS : create a subclass instance from a superclass instance
我知道如何从父类(super class)原型(prototype)创建子类原型(prototype)。但是，如果我已经有了父类(super class)对象的实例来创建子类对象怎么办？在 JS 中
reflection - instance::class.java 与 instance.javaClass
鉴于 Kotlin 1.1。对于某个类的 instance，instance::class.java 和 instance.javaClass 似乎几乎是等价的: val i = 0 println(
amazon-ec2 - EC2 : get instance id from within the instance itself
这个问题在这里已经有了答案: 8年前关闭。 Possible Duplicate: Find out the instance id from within an ec2 machine 我正在寻找从
oop - MATLAB : Instantiate a class from an empty Instance to a 'Blank' Instance
为什么我的 Instantiate 函数没有创建 That 的“空白”实例？我有以下最小类: classdef That < handle properties This = '' end
java - hibernate : merge detatched instance data in to persistent instance
Session session = HibernateUtil.getSessionFactory().openSession(); Transaction tx = session.beginTra
Java 反射 : making a subclass instance be referred to as if it was super instance
考虑以下几点: public class A { public String name = "i am a A instance"; } public class B extends A {
mysql - 有没有办法在 Apache Instance on Scale 之前启动 MySQL Instance？
我正在使用 Scalr 来扩展网站服务器。在 Apache 服务器上，我安装了 Sakai，并为 Linux 机器创建了一个启动脚本。问题是，如何确保MySQL实例在Apache服务器启动之前启动
android - Realm 数据库 : having multiple instances vs Single instance
Android Realm DB 允许使用 Realm.getInstance() 获取多个实例。这些中的最佳实践是什么？ :1.创建单个实例(应用程序范围)并在任何地方使用它2. 在需要时获取一个新
javascript - 理解 Javascript 的 OOP : instances modifying other instances
我很难理解为什么修改实例 a 中的属性会修改实例 b 中的相同属性。 var A = function (){ }; A.prototype.data = { value : 0 }; var
java - 如何从 `Instances` 在 Weka 中创建 `List`？
我将 Weka 用作更长管道的一部分，因此，我无法承受将所有数据写入文件或数据库只是为了创建一个 Instances。目的。我可以即时做的是创建 Instance 的列表对象。来自 this pag
python 3 : When is a call to an instance attribute resolved as if it is an instance method?
class C: def func(self, a): print(a) c = C() print(c.__dict__) # {} c.func = c.func # c.func i
javascript - Angular 路由 : Instance Creation vs Instance Activation
Angular Routing 文档提到了组件实例创建、组件实例激活和路由激活。文档没有解释这些概念的区别，以及每次创建/激活发生的时间。问题实例创建和实例激活有什么区别？实例激活和路由激活有

首页

博学

6Ren·AI

商城

Balancing Data for Multiple-Instance Learning with Unbalanced Classes(具有不平衡类的多实例学习中的数据平衡)

Balancing Data for Multiple-Instance Learning with Unbalanced Classes