Hierarchical Data frame from a flat dataframe(来自平面数据帧的分层数据帧)-6ren

Hierarchical Data frame from a flat dataframe(来自平面数据帧的分层数据帧)

转载作者：bug小助手更新时间：2023-10-25 13:37:19

I having a nested json object, I am able to parse and flatten it to a single level dataframe by preserving hierarchy. Now I need to generate hierarchical data frame need some help on that.

我有一个嵌套的json对象，我能够通过保留层次结构将其解析和扁平化为单层数据帧。现在我需要生成分层数据帧，需要一些帮助。

Sample Object:

示例对象：

{"rr":{"bp": {
"0": "0 - 10",
"1": "10 - 20",
"2": "20 - 30"
},
"al": {
"0": 11.8,
"1": 77.2,
"2": 98.4
}}
}

{“rr”：{“BP”：{“0”：“0-10”，“1”：“10-20”，“2”：“20-30”}，“al”：{“0”：11.8，“1”：77.2，“2”：98.4}}

flattened Dataframe:
  rr.bp.0  rr.bp.1  rr.bp.2  rr.al.0  rr.al.1  rr.al.2
0  0 - 10  10 - 20  20 - 30     11.8     77.2     98.4

expected hierarchy dataframe 
Header1     Header2 0   1   2   3   4   5   6   7   8
rr      bp  0 - 10  10 - 20 20 - 30                      
rr      al  11.8    77.2    98.4

expected hierarchy dataframe

预期的层次结构数据帧

trying something like this

尝试像这样的事情


``
for key, value in flattened_data.items():
    keys = key.split(".")
    column_headers = keys[:-1]
    index_label = keys[-1]
    columns = pd.MultiIndex.from_tuples([tuple(column_headers)], names=column_headers)
    temp_df = pd.DataFrame([value],columns=columns)
    temp_df.index = [index_label]
df = pd.concat([df,temp_df])
``

but everything is coming as NaN values

但一切都是以NaN价值观为基础的

Update
I want the output to be dynamic . You can assume that my key names are in order and depth is preserved in flattened dataframe.

更新我希望输出是动态的。您可以假定我的密钥名称是有序的，并且深度被保留在扁平的数据帧中。

Thanks @Timeless. Based on your answer I tried this, it is working
df will be a single level flattened df
example: For this json {"rr":{"bp": { "0": "0 - 10", "1": "10 - 20", "2": "20 - 30" }, "al": { "0": 11.8, "1": 77.2, "2": 98.4 }} }
df will be flattened Dataframe:
rr.bp.0 rr.bp.1 rr.bp.2 rr.al.0 rr.al.1 rr.al.2
0 0 - 10 10 - 20 20 - 30 11.8 77.2 98.4

谢谢@Timeless。根据你的回答我试了这个，它工作的df将是一个单层扁平化的df例子：对于这个json {“rr”：{“bp”：{“0”：“0 - 10”，“1”：“10 - 20”，“2”：“20 - 30”}，“al”：{“0”：11.8，“1”：77.2，“2”：98.4 } df将变平数据帧：rr.bp.0 rr.bp.1 rr.bp.2 rr.al.0 rr.al.1 rr.al.2 0 0 - 10 10 - 20 20 - 30 11.8 77.2 98.4

If more nested keys are there then more dots and keys are added in the same order

如果存在更多嵌套键，则会按相同顺序添加更多点和键

max_depth = 0
max_depth_list = []
for col in flatten_copy.columns:
    max_depth_list.append(len(re.findall('\.', col)))
max_depth = max(max_depth_list)
df = (
    pd.DataFrame(df).pipe(lambda x: x.set_axis(x.columns.str.split(".", expand=True), axis=1))
    .stack(list(range(max_depth))).droplevel(0).rename_axis([f"Header{i}" for i in range(max_depth)]).reset_index()
)

更多回答

优秀答案推荐

Your expected output is ambiguous or at least doesn't match with the title of your question.

您的预期输出不明确，或者至少与您问题的标题不匹配。

I suppose that you're expecting a DataFrame like this one :

我想您正在期待这样的DataFrame：

sample_obj = {
    "rr": {
        "bp": {"0": "0 - 10", "1": "10 - 20", "2": "20 - 30"},
        "al": { "0": 11.8, "1": 77.2, "2": 98.4}
          }
}

df = (
    pd.DataFrame(sample_obj).stack()
        .apply(pd.Series) # with a FutureWarning in 2.1.0
        .swaplevel().rename_axis(["Header1", "Header2"]).reset_index()
)

Output :

输出：

print(df)

  Header1 Header2       0        1        2
0      rr      al    11.8     77.2     98.4
1      rr      bp  0 - 10  10 - 20  20 - 30

UPDATE :

更新：

If you start from flattened_data, you can use this :

如果从FLATEED_DATA开始，则可以使用以下命令：

flattened_data = {
    'rr.bp.0': {0: '0 - 10'},
    'rr.bp.1': {0: '10 - 20'},
    'rr.bp.2': {0: '20 - 30'},
    'rr.al.0': {0: 11.8},
    'rr.al.1': {0: 77.2},
    'rr.al.2': {0: 98.4}
} # which is the result of pd.json_normalize(sample_obj).to_dict()

df = (
    pd.DataFrame(flattened_data)
        .pipe(lambda x: x.set_axis(
            x.columns.str.split(".", expand=True), axis=1))
        .stack([0, 1]).droplevel(0)
        .rename_axis(["Header1", "Header2"])
        .reset_index()
)

更多回答

Thanks you solution gives me the required output. But I need it to process from flattened dataframe. As I am handling list of arrays also to single level while flattening. Is it possible. Again thanks for the solution

谢谢您的解决方案给了我所需的输出。但我需要它来处理扁平的数据帧。因为我正在处理数组列表，同时也将其展平到单级。有没有可能。再次感谢您的解决方案

You're welcome! I update the answer to address your comment. If not, you need to provide a clear/explicit input that matches with your actual data.

不客气！我更新了答案，以回应您的评论。如果不是，您需要提供与您的实际数据相匹配的明确/显式输入。

your code is working really well. I tried to make it dynamic. Can you check it. [ flatten_copy = df.copy() max_depth = 0 max_depth_list = [] for col in flatten_copy.columns: max_depth_list.append(len(re.findall('\.', col))) max_depth = max(max_depth_list) df = ( pd.DataFrame(df).pipe(lambda x: x.set_axis(x.columns.str.split(".", expand=True), axis=1)) .stack(list(range(max_depth))).droplevel(0).rename_axis([f"Header{i}" for i in range(max_depth)]).reset_index() ) ] Thanks for the Help.

您的代码运行得非常好。我试着让它充满活力。你能查一下吗？[Flatten_Copy=df.Copy()max_Depth=0 For ol in Flatten_Copy.Columns：max_Depth_list.append(len(re.findall(‘\.’，ol)max_Depth=max(Max_Deep_List)df=(pd.DataFrame(Df).tube(lambda x：x.set_axis(x.Columns.str.Split(“.”，Expand=True))，Axis=1)).stack(list(range(max_depth))).droplevel(0).rename_axis([f“Header{i}”for i in Range(Max_Depth)]).Reset_Index()]感谢您的帮助。

As said before, you need to provide a clear input and include it to your question.

如前所述，您需要提供一个明确的输入，并将其包含在您的问题中。

Sorry for inconvenience. Really appreciate your support. Thanks I will update properly. But for now it is working Thanks

很抱歉给您带来不便。非常感谢您的支持。谢谢，我会及时更新的。但就目前而言，它正在发挥作用

javascript - 将复杂对象转换为表格格式(平面)
我有一个对象: [ { TEAMGROUP: "AB", TEAMNAME: "TEAM1", SPRINTS: [ { ID: 1,
colors - 平面、半平面和交错格式之间有什么区别？
颜色模型和颜色空间之间的差异 RGB565 与 RGB888 有何不同任何建议链接 YUV vs RGB vs YCbCr。？最佳答案 RGB 是一种加法颜色模型，其中红色、绿色和蓝色强度以不同的组
c++ - GLSL无法编译没有插值的着色器(平面)
我正在从单个顶点/索引缓冲区绘制一个具有多个网格的完整对象，并且它们具有不同的纹理。因此，我想到将纹理 ID 与顶点一起从顶点着色器传递到片段着色器中的片段。问题是禁用插值。我正在使用 GLSL ve
android - 如何创建具有所需宽度和长度段数的网格/平面？
我有一个包含 40000 个 float 的数组，用于指定 map 上的高度级别。我想在 OpenGL ES 2.0 中创建一个网格/平面，为该网格中的每个顶点分配一个来自该数组的高度值，以便它们创建
glsl - 有符号距离函数 - 3D 平面
我真的很喜欢 IQ 的页面以及有关 SDF 的信息: ( https://www.iquilezles.org/www/articles/distfunctions/distfunctions.htm
qt - 平面 QPushButton，背景颜色不起作用
我创建了 QPushButton在带有此样式表的 Qt Designer 中: QPushButton#pushButton { background-color: #ffffff; } QP
Javascript - 平面 map 的解决方法
所以我正在寻找一些平面 map 的解决方法，因为它在 IE 上不起作用，我找到了这个:但我不太明白为什么它会起作用 var gadjets = [ {computers:['asus', 'hp'
scala - 平面 Actor 树
child Actor 会不会太多？例如，如果我有一个有 10000 个 child Actor 的 Actor ，与每个有 1000 个 child Actor 的 10 个 Actor 相比，这会
3d - 如何有效地旋转和平移 3D 平面
我有一个由法线 (n) 和距离 (d)(距原点)定义的平面。我想把它改造成一个新的系统。长路是这样的: 1) 将距离 (d) 与法线 (n) 相乘得到一个向量 (p) 2) 旋转 (R) 并平移 (
javascript - 期望体积结果时的 Threecsg 平面
问题: 从球体中减去立方体会得到一个结果，其中 z 轴保留体积，但 y 轴和 x 轴产生平面圆盘，如图所示。我不确定为什么球体在那些方面正在失去体积。我正在使用 threeCSG 的典型减法。代码:
c# - 从单个(平面)数据库查询创建复合对象的方法
我通过 SQL 查询从我们的 ERP 获取产品数据，由此返回的数据在大小级别非常平坦。一个产品有 3 个级别: 风格颜色尺寸一种款式有多种颜色，一种颜色有多种尺码。我创建了以下模型: publ
javascript - 平面 JSON 展开为具有多个父级的层次结构作为字符串
我正在尝试展开一些 json 数据。如果我像下面这样使用我的测试数据，一切正常! var data = [ { "title": 1, "parentids": [0] }, { "title
ios - 绘制 3D 平面
我希望使用 SceneKit 在 Swift 中的 3D 空间中绘制多个平面。具体来说，这些表面都将位于双曲面内。我以前从未绘制过自定义形状/对象，而且在尝试理解文档时我已经迷失了方向。关于在 3D
ios - ARKit 平面，上面有现实世界的物体
预先感谢您阅读我的问题。我对 ARKit 非常陌生，并且已经学习了几个教程，这些教程向我展示了如何使用平面检测以及如何为平面使用不同的纹理。这个功能真的很棒，但这是我的问题。玩家是否可以先将飞机放置在
java - 使用(平面)映射优于简单的空检查的优点？
我正在阅读下面的源代码，我想知道我到底为什么要使用平面图方式。正如我所看到的，与通过 if 语句进行简单的 null 检查相比，实例化了更多的对象，执行了更多代码，这将在第一个 null 时终止，而不
javascript - 平面 UI 复选框样式不起作用
我正在编写一个 Rails 应用程序并使用 Flat UI 进行样式设置。我目前正在将 flatui-rails gem 与 twitter-bootstrap-rails gem 结合使用。一切正常
c++ - 平面/射线交点与点/平面投影的区别
我在维基百科中找到了射线平面相交代码的解决方案，该解决方案有效，我只是在其中求解线性方程组。后来我找到了一些点到平面投影的代码，显然实现方式不同，并且在特定条件下也会产生不同的解决方案。但是，我并
javascript - 平面 UI 复选框样式在启动时不应用
我正在使用 http://designmodo.github.io/Flat-UI/ 中的扁平 UI 我复制了复选框示例页面中的所有文件和代码。但是我注意到该复选框并未显示为样式复选框，但在我单击初
javascript - 如何用图像绘制等 Angular 平面？
这个问题已经有答案了: True Isometric Projection with HTML5 Canvas (3 个回答) 已关闭 7 年前。我想创建一个等轴测图。该 map 存在等距矩形，如图
CSS 下拉菜单 + 平面 UI
http://designmodo.github.io/Flat-UI/ 我想创建一个 Css 下拉菜单，我已经完成了下拉部分，但是我似乎无法模拟转换，也不知道如何编写这些代码。这是我目前所知道的，在

bug小助手

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

Hierarchical Data frame from a flat dataframe(来自平面数据帧的分层数据帧)