gpt4 book ai didi

python-3.x - 如何使用 Pandas txt 文件读取每行列名

转载 作者:行者123 更新时间:2023-12-04 13:34:32 25 4
gpt4 key购买 nike

我是 Python 初学者,我需要读取一个 txt 文件,其中每一行的列名,列是无序的,并不是所有的列都被告知。有没有办法用 Pandas 读取这种文件?
这是一个示例(3 行):

pepe01@mail.com:{ssha}fiy9XI6d:created="1575487257" fwd="" spf_block="" quota="1024mb" full_name="Full Name" mailaccess="envia" mailstatus="cancelled"

pepe02@mail.com:{ssha}Q0H90Rf9:created="1305323967" mailaccess="1" mailstatus="active" admin_access="" quota="" expire="0" full_name="Full Name" pais="CO"

pepe03@mail.com:{ssha}sCPC3HOE:created="1550680636" fwd="" pass_question="" pass_answer="" disabled="Y" mailstatus="cancelled" full_name="Name"

最佳答案

您可以使用 re模块来解析文件。
例如:

import re
import pandas as pd


all_data = []
with open('<YOUR FILE>', 'r') as f_in:
for line in f_in:
m = re.search(r'^(.*?):(.*?):', line)
if not m:
continue
data = dict(re.findall(r'([^\s]+)="([^"]+)"', line.split(':', maxsplit=2)[-1]))
data['mail'] = m.group(1)
data['password'] = m.group(2)
all_data.append(data)

df = pd.DataFrame(all_data).fillna('')
print(df)
打印数据框:
      created   quota  full_name mailaccess mailstatus             mail        password expire pais disabled
0 1575487257 1024mb Full Name envia cancelled pepe01@mail.com {ssha}fiy9XI6d
1 1305323967 Full Name 1 active pepe02@mail.com {ssha}Q0H90Rf9 0 CO
2 1550680636 Name cancelled pepe03@mail.com {ssha}sCPC3HOE Y

关于python-3.x - 如何使用 Pandas txt 文件读取每行列名,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/62961933/

25 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com