Return row containing value(返回包含值的行)-6ren

Return row containing value(返回包含值的行)

转载作者：bug小助手更新时间：2023-10-25 09:46:22

32

4

I have a df where the last row is the median.

我有一个df，最后一行是中位数。

print(income.head(7))
geo_code                      1       2       3  ...     114     115     116
 1 228 801 -  2 457 600     NaN     NaN     NaN  ...     NaN     NaN     NaN
 1228801 -  2457600       305.0   104.0    74.0  ...     6.0   251.0    15.0
 153601 -  307200        2028.0  2330.0  2341.0  ...   153.0  2256.0  1149.0
 153 801 -  307 600         NaN     NaN     NaN  ...     NaN     NaN     NaN
 19201 -  38400           408.0   642.0   505.0  ...  2215.0   659.0  1006.0
 19 601 -  38 200           NaN     NaN     NaN  ...     NaN     NaN     NaN
 1 -  4800                 28.0    38.0    31.0  ...   497.0    80.0   106.0

print(income.tail(3))
geo_code                      1       2       3  ...     114     115     116
 9601 -  19200            167.0   401.0   237.0  ...  1551.0   476.0   583.0
 9601 -  19 600             NaN     NaN     NaN  ...     NaN     NaN     NaN
median                    408.0   627.0   505.0  ...   497.0   659.0   494.0

I need the index (the row) of the median please. How do I return the row that matches the last value in a column?

So the median of column 1, which is 408, will return: 19201 - 38400.

请给我中位数的指数(排)。如何返回与列中最后一个值匹配的行？因此，第1列的中位数为408，将返回：19201-38400。

更多回答

I just noticed, the columns with spaces in the numbers seem to be duplicates. So for the sake of a minimal reproducible example, you could remove them. See also reproducible pandas examples.

我刚刚注意到，数字中有空格的栏似乎是重复的。因此，为了达到最小的可重现性，您可以删除它们。另见可繁殖大熊猫的例子。

优秀答案推荐

You can find the median row using .tail(1).squeeze() and then iterate through the columns, finding the row index where the median value is located. Then a dictionary median_rows stores the row indices of the median values for each column. Note that the .index[0] part in (income[column_name] == median_value) extracts the index (row number) of the first row where the condition is met, assuming that there is only one median value per column.

您可以使用.ail(1).Squeeze()找到中位数行，然后迭代列，找到中位数所在的行索引。然后，字典MIDENT_ROWS存储每一列的中值的行索引。请注意，(Income[Column_Name]==Medium_Value)中的.index[0]部分提取满足条件的第一行的索引(行号)，假设每列只有一个中值。

# Calculate the median row for each column
median_row = income.tail(1).squeeze()

# Iterate through columns and find the row with the median value
median_rows = {}
for column_name, median_value in median_row.items():
    median_rows[column_name] = income[income[column_name] == median_value].index[0]

# Print 
for column_name, median_index in median_rows.items():
    print(f"Median of column {column_name}: {median_row[column_name]} is in row {median_index}")

IIUC, you can use idxmax:

IIUC，您可以使用idxmax：

df.loc[df.iloc[:-1, 1:].eq(df.iloc[-1, 1:], axis=1).idxmax(), 'geo_code']

Output:

产出：

4            19201 - 38400
0    1 228 801 - 2 457 600
4            19201 - 38400
6                 1 - 4800
4            19201 - 38400
0    1 228 801 - 2 457 600
Name: geo_code, dtype: object

If geo_code is the Index, you can simplify to:

如果Geo_code是索引，则可以简化为：

out = df.iloc[:-1].eq(df.iloc[-1], axis=1).idxmax()

And if you can have no-matches for some columns you further need to mask:

如果某些列没有匹配项，则需要进一步掩码：

m = df.iloc[:-1].eq(df.iloc[-1], axis=1)

m.idxmax().where(m.any())

1      19201 - 38400
2                NaN
3      19201 - 38400
114         1 - 4800
115    19201 - 38400
116              NaN
dtype: object

更多回答

income.tail(1).squeeze() can be simplified to income.iloc[-1].

可以将income.ail(1).挤压()简化为income.iloc[-1]。

In OP's df, geo_code is the index. That lets you simplify to df.iloc[:-1].eq(df.iloc[-1], axis=1).idxmax() or df.loc[df.index != 'median'].eq(df.loc['median'], ..., with the result being indexed by column instead of by row.

在op的df中，geo_code是索引。这使您可以简化为df.iloc[：-1].eq(df.iloc[-1]，轴=1).idxmax()或df.loc[df.index！=‘Medium’].eq(df.loc[‘Medium’]，...)，结果按列而不是按行索引。

It's worth noting that the data in the question is incomplete, so the two results at row 0 are incorrect.

值得注意的是，问题中的数据不完整，因此第0行的两个结果是不正确的。

@wjandrea good point, I assumed this was a column but the formatting suggests it could be otherwise. I included other options and showed how to mask the values should there be no match

@wjandrea很好，我以为这是一个列，但格式显示它可能不是。我还包括了其他选项，并展示了如何在没有匹配的情况下屏蔽值

32

4

0

文章推荐： failure to send email by SendInBlue(SendInBlue发送电子邮件失败)

containers - Sparql查询集合和rdf :containers?
大家好，所有rdf/sparql开发人员。这是一个困扰了我一段时间的问题，但是自从发布rdf和sparql规范以来，似乎没人能准确回答这个问题。为了说明这种情况，RDF定义了几种方法来处理资源的多值
containers - Bootstrap .container 元素的边距不够大
我在我的应用程序中使用 Bootstrap ，现在遇到了一个大问题。问题是 .container 元素在 1360 px 的屏幕上具有 274px 的左右边距，这是相当大的。结果，一切看起来都被挤到了
docker - “docker container rm ”和“docker rm ”
我在删除Docker容器时遇到问题-当我使用前一个命令时，它不起作用(Docker报告了容器ID，但没有删除它)。后者起作用了。据我所知，Docker语法是相同的: C:\Users\user>doc
c++ - 我可以始终使用 std::inserter(container, container.end()) 而不是 std::back_inserter(container) 吗？
std::back_inserter 仅适用于带有 push_back 的容器，因此它不适用于 set 和 map 另一方面，std::inserter 适用于所有容器类型。那么我可以一直使用 std
java - Caused by : java. lang.IllegalArgumentException : CONTAINING (1): [IsContaining, Containing, Contains]不支持redis查询推导-Redis
我正在开发 Spring Boot + Redis 示例。在此示例中，我开发了一些自定义方法，这些方法基于 RoleName 提取详细信息。对于以下方法 userRepository.findByRo
ios - GoogleTagManager 警告 : No default container found. Container needs to be added to a container folder and added to the target
在我的 Swift 应用程序中尝试实现 Google Tag Manager v5 时，我遇到了以下警告，这给我带来了一些麻烦: GoogleTagManager warning: No defaul
php - Illuminate\Container\Container::get($id) 的声明必须与 Psr\Container\ContainerInterface::get(string $id) 兼容
安装了新的 Laravel 8 项目并在加载第一个实例时，出现以下错误。这很奇怪，因为我把它放在一边，后来从 Laravel 5.8 -> 6 升级了另一个项目(工作正常)，当我去检查网站时遇到了类似
containers - Octave container.map 在成员函数中不起作用
我有以下测试代码，它只创建一个空的 hashmap (containers.map) 并在之后填充它: hashtable = containers.Map('KeyType','char','Va
containers - Google Container Engine和容器优化的Compute Engine有什么区别？
我对它们之间的差异有一点了解，但是拥有专家意见将是很棒的。 Container-Optimized Google Compute Engine Images Google Container Engi
c++ - 模板 : How to return container of container
我会模板化一个函数，以便将它与 vector、set 或任何其他 STL 容器(具有正确的 API...)一起使用我的函数当前原型(prototype)是: vector> f ( const ve
python Pandas : String Contains and Doesn't Contain
我正在尝试匹配包含和不包含某些字符串的 Pandas DataFrame 的行。例如: import pandas df = pandas.Series(['ab1', 'ab2', 'b2', 'c
sql - 在 SQL Server FullText 中使用 'CONTAINS(Foo, "A") OR CONTAINS(Foo, "B") 与 CONTAINS(Foo, '"A"OR "B"')
我需要在一个非常庞大的全文索引数据库中找到一些文本，但我不知道在我的查询术语变体中使用什么更好。我看过一些使用的例子 SELECT Foo.Bar FROM Foo WHERE
python - OpenCV 错误:(-215:断言失败)函数 'CvtHelper' 中的 VScn::contains(scn) && VDcn::contains(dcn) && VDepth::contains(depth)
Traceback (most recent call last): File "demo.py", line 132, in `result = find_strawberry(image
Excel公式: If cell contains substring "this" AND does not contain substring "that"
我正在尝试编写一个函数，其中一列包含一个子字符串并且不包含另一个子字符串。在下面的示例中，如果我的行包含“某些项目”并且不包含“开销”，我希望我的函数返回 1。 row| example strin
java - String.contains 注册为 !String.contains
我试图在文本文件中 append 包含给定字符串集的任何行。我创建了一个测试文件，在其中放置了这些字符串之一。我的代码应该将文本文件中包含这些字符串之一的任何行打印在与文本文件中的上一行相同的行上。这
containers - D: 不清楚如何使用 std.container 结构
我正在尝试学习如何使用 std.container 中可用的各种容器结构，但我无法理解如何执行以下操作: 1) 如何创建一个空容器？例如，假设我有一个用户定义的类 Foo，并且想要创建一个应该包含 F
mysql - contains 和 contained in sequelize Where 子句有什么用？
$contains: [1, 2] // @> [1, 2] (PG array contains operator) $contained: [1, 2] // <@ [1,
CSS:为什么使用 "div#container"语法而不只是 #container？
我看到 CSS 中使用了这种“div#container”语法，我想知道它是如何工作的。有人有它的资源吗？最佳答案除了作为上面提到的唯一引用之外，ID 还增加了特异性(我强烈建议您阅读这篇文章或一
c++ - "Inherit not, contain"或 "inherit, not contain"
我有一个生成很多子对象的应用程序，每个子对象都与一些全局应用程序对象一起工作，例如在全局应用程序注册表中注册自己，更新应用程序统计信息等。应用程序应该如何将访问这些全局对象的能力传递给 child
javascript - 如何让 Container 中的多个组件继承 Container 的计算宽度？
Here is a Sencha fiddle of my tab panel setup.按钮被动态添加到 vbox 选项卡容器中，该容器是 hbox 布局设置的一部分。选项卡容器的宽度由 flex

首页

博学

6Ren·AI

商城

Return row containing value(返回包含值的行)