gpt4 book ai didi

python - 在 pandas/numpy 中,如何制作包含字符串项计数的数据透视表?

转载 作者:行者123 更新时间:2023-12-01 09:11:53 24 4
gpt4 key购买 nike

在 python3 和 pandas 中,我有这个数据框:

df_selecao_atual.info()
<class 'pandas.core.frame.DataFrame'>
Int64Index: 63 entries, 2 to 72
Data columns (total 24 columns):
nome 63 non-null object
nome_completo 63 non-null object
partido 63 non-null object
uf 63 non-null object
cargo_parlamentar 63 non-null object
tipo 63 non-null object
classe 63 non-null object
numero 63 non-null object
único 63 non-null object
assunto 63 non-null object
data_inicial 63 non-null object
data_final 63 non-null object
andamento 63 non-null object
link 63 non-null object
transparencia 63 non-null object
conferencia 63 non-null object
data_conferencia 63 non-null object
resumo 62 non-null object
observacao 60 non-null object
link_noticia_tribunal 22 non-null object
interessa 62 non-null object
ministro_relator 63 non-null object
processo_conectado 20 non-null object
situacao 63 non-null object
dtypes: object(24)
memory usage: 12.3+ KB

列“nomes”包含人名。 “tipos”列包含这些人拥有的流程类型 - “INQ”或“AP”

我制作了一个数据透视表来计算每个进程有多少种类型:

df_selecao_atual.pivot_table('tipo','nome',aggfunc='count').fillna(0).astype(int)

tipo
nome
ARTHUR LIRA 10
BENEDITO DE LIRA 3
CÉSAR MESSIAS 2
CÍCERO ALMEIDA 4
FERNANDO COLLOR 12
FLAVIANO MELO 4
GLADSON CAMELI 3
JORGE VIANA 5
MARX BELTRÃO 1
MAURÍCIO QUINTELLA LESSA 1
PAULÃO 2
RENAN CALHEIROS 3
ROCHA 1
RONALDO LESSA 3
SIBÁ MACHADO 1
SÉRGIO PETECÃO 8

但结果只是计算“tipo”列中有多少行

我寻找这样的结果:

ARTHUR LIRA     
INQ 9
AP 1
BENEDITO DE LIRA
INQ 3
AP 0
CÉSAR MESSIAS
INQ 1
AP 1
...

即统计每个名称中存在多少种“INQ”和“AP”类型

请问有人知道我该怎么做吗?

数据样本:

df_selecao_atual[['nome','tipo']]
nome tipo
2 CÉSAR MESSIAS INQ
3 CÉSAR MESSIAS AP
4 FLAVIANO MELO INQ
5 FLAVIANO MELO INQ
6 FLAVIANO MELO AP
7 FLAVIANO MELO INQ
10 ROCHA AP
13 SIBÁ MACHADO INQ
14 GLADSON CAMELI INQ
15 GLADSON CAMELI INQ
16 GLADSON CAMELI INQ
17 JORGE VIANA INQ
18 JORGE VIANA INQ
19 JORGE VIANA INQ
20 JORGE VIANA INQ
21 JORGE VIANA INQ
22 SÉRGIO PETECÃO INQ
23 SÉRGIO PETECÃO INQ
...

最佳答案

您可以使用:

df_selecao_atual.pivot_table(index=['tipo','nome'],aggfunc='size')

或者:

df_selecao_atual.groupby(['tipo','nome']).size()

关于python - 在 pandas/numpy 中,如何制作包含字符串项计数的数据透视表?,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/51594431/

24 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com