python - 标准化 : how to avoid zero standard deviation-6ren

python - 标准化 : how to avoid zero standard deviation

转载作者：行者123 更新时间：2023-12-01 06:22:53

26

4

有以下任务:

Normalize the matrix by columns. From each value in column subtract average (in column) and divide it by standard deviation (in the column). Your output should not contain nan (caused by division by zero). Replace Nans with 1. Don't use if and while/for.

我正在使用 numpy，所以我编写了以下代码:

def normalize(matrix: np.array) -> np.array:
    res = (matrix - np.mean(matrix, axis = 0)) / np.std(matrix, axis = 0, dtype=np.float64)
    return res
matrix = np.array([[1, 4, 4200], [0, 10, 5000], [1, 2, 1000]])
assert np.allclose(
    normalize(matrix),
    np.array([[ 0.7071, -0.39223,  0.46291],
              [-1.4142,  1.37281,  0.92582],
              [ 0.7071, -0.98058, -1.38873]])
)

答案是正确的。

但是，我的问题是:如何避免被零除？如果我有一列相似的数字，我将得到标准差 = 0 和结果中的 Nan 值。我该如何解决？将不胜感激!

最佳答案

您的任务指定避免输出中的 nan 并将出现的 nan 替换为 1。它没有指定中间结果可能不包含 nan。 一个有效的解决方案是在返回之前在 res 上使用 numpy.nan_to_num:

import numpy as np
def normalize(matrix: np.array) -> np.array:
    res = (matrix - np.mean(matrix, axis = 0)) / np.std(matrix, axis = 0, dtype=np.float64)
    return np.nan_to_num(res, False, 1.0)
matrix = np.array([[2, 4, 4200], [2, 10, 5000], [2, 2, 1000]])
print(normalize(matrix))

产量:

[[ 1.         -0.39223227  0.46291005]
 [ 1.          1.37281295  0.9258201 ]
 [ 1.         -0.98058068 -1.38873015]]

关于python - 标准化 : how to avoid zero standard deviation，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/60283097/

26

4

0

文章推荐： Python pandas 从嵌入 Web txt 文件中的 csv 创建数据框

文章推荐： jquery - 为scrolltop添加100px空间

文章推荐： java - 使用 JAVA 和 JPA 1.0 对 DB2 进行大量更新或插入

JavaScript Standard Style（JS Standard 代码风格规则详解）
JavaScript Standard Style 翻译： Português, Spanish, 繁體中文, 简体中文 standard 规则列表，太多不必阅读。了解 standard 的最好方式
Ant 执行 : redirecting standard out but not standard error
我有一个 exec我使用 outputproperty 将其输出放入属性的任务属性。该命令可能会向 stderr 打印一些错误，我不希望将这些错误包含在输出中(因为输出被馈送到另一个命令中)，而是要打
.net-standard - 如何将 .NET Standard 代码标记为符合 CLS？
标题说明了一切 - 如何将 .NET 标准库标记为符合 CLS？我用 C# 编写了一个简单的库，目标是 .NET Standard 1.0 框架。它包括两个枚举: public enum Align
powershell - 将 'standard error' 更改为 'standard output'
我有一个写入错误输出的 PowerShell 脚本。该脚本可以简单如下: Write-Error 'foo' Start-Sleep -s 5 Write-Error 'bar' 我实际调用的脚本产生
.net-standard - 使 .NET Standard 库 COM 可见？
对于完整的 .NET 项目，您可以在 Project Properties > Application tab > Assembly Information.. 中勾选一个框以使项目 COM 可见。
.net-standard - 基于 appveyor .NET Standard 2.0 构建
我将我的项目 ( https://github.com/MarkKhromov/The-Log) 迁移到 .NET Standard 2.0，但我的应用程序构建已损坏。我该如何解决这个问题？我的解决
c++ - "standard output stream"和 "standard output device"有什么区别？
互联网上的许多文章都使用“标准输入/输出/错误流”术语好像每个术语都与使用的“标准输入/输出/错误设备”术语具有相同的含义在其他文章上。例如，很多文章说标准输出流默认是监视器，但可以重定向到文件、打印
go - 错误 : Non-standard import "gopkg.in/yaml.v2" in standard package
我正在尝试从 https://github.com/go-yaml/yaml 导入 go-yaml ，并且我看到了 Google 未提供帮助的错误。我运行了 go get gopkg.in/yaml
c# - .NET 中的 "US Eastern Standard Time"与 "Eastern Standard Time"
在列出 TimeZoneInfo.GetSystemTimeZones 返回的 TimeZoneInfo 的所有 Id 属性时，出现了两个版本的 EST:美国东部标准时间和东部标准时间。有什么区别？
C 统一码 : How do I apply C11 standard amendment DR488 fix to C11 standard function c16rtomb()?
问题: 如函数的 C 引用页所述，c16rtomb，来自 CPPReference ，在注释部分下: In C11 as published, unlike mbrtoc16, which conve
mysql - 错误 : non-standard import "github.com/go-sql-driver/mysql" in standard package
我想使用 go 语言从我的数据库中检索一些数据。这是我在文件 main.go 中的代码的开头 package main import ( _ "github.com/go-sql-driver
standards - STM32F4立体声MEMS麦克风
我一直在通过STM32F4发现进行音频项目，我注意到一件事，所有I2S标准仅适用于一个麦克风(取决于标准使用单独的位的哪个边缘)。例如飞利浦(Philips)，MSB或LSB标准使用下降沿作为位触发，
standards - 汇编语言标准
有没有标准定义了语法和语义的汇编语言 ?与语言类似 C 有 ISO 标准和语言 C# 有 ECMA 标准？是只有一种标准，还是有更多标准？我问是因为我noticed那个汇编语言代码看了不同
standards - 软件版本标准
关闭。这个问题是opinion-based .它目前不接受答案。想改进这个问题？更新问题，以便 editing this post 提供事实和引用来回答它. 1年前关闭。 Improve this
standards - 在URL中使用重复的参数
我们正在内部构建API，并且经常传递带有多个值的参数。他们使用：mysite.com?id=1&id=2&id=3 代替：mysite.com?id=1,2,3 我赞成第二种方法，但我很好奇是否真的
standards - 是否有任何NoSQL标准出现？
As it currently stands, this question is not a good fit for our Q&A format. We expect answers to be
standards - 什么是RFC？
我认为有很多人不知道RFC（征求意见）。我知道它们在逻辑上是什么，但是有人能为新开发人员提供一个很好的描述吗？另外，共享一些有关如何使用和阅读它们的资源也很好。最佳答案这个术语来自互联网的前身AR
standards - 索马里兰国家缩写
我找不到 Somaliland 的两个字母的国家/地区缩写，可能是因为它不是一个国家，而是正如维基百科所说:“一个未被承认的 self 宣布的事实上的主权国家，被国际承认为索马里的一个自治区”。尽管如
standards - 格式化日志的最佳实践是什么？
我正在编写一款蜜 jar 软件，该软件将对其交互进行大量记录，我计划记录纯文本 .log 文件。我有两个问题，来自不太熟悉服务器日志方式的人。首先，我该如何分解我的日志文件，我假设运行一个月后我不
standards - 什么时候最好更改代码以符合标准？
我最近负责调试两个不同的程序，这两个程序最终至少需要共享一个 XML 解析脚本。一个是用 PureMVC 编写的，另一个是从头开始构建的。虽然最初从头开始编写是有意义的(它节省了大量内存，但内存问题已

首页

博学

6Ren·AI

商城

python - 标准化 : how to avoid zero standard deviation