gpt4 book ai didi

python - 无法以 CSV 格式剥离和存储某些文件的内容

转载 作者:太空宇宙 更新时间:2023-11-03 15:39:29 25 4
gpt4 key购买 nike

我有一个看起来像这样的文件:

它们被放置在

~/ansible-environments/aws/random_name_1/inventory/group_vars/all 
~/ansible-environments/aws/random_name_2/inventory/group_vars/all
~/ansible-environments/aws/random_name_3/inventory/group_vars/all

我写道:

    import os
import sys
rootdir='/home/USER/ansible-environments/aws'
#print "aa"
for root, subdirs, files in os.walk(rootdir):
for subdir in subdirs:
all_path = os.path.join(rootdir, subdir, "inventory", "group_vars", "all")
if not os.path.isfile(all_path):
continue
try:
with open(all_path, "r") as f:
all_content = f.readlines()
except (OSError, IOError):
continue # ignore errors
csv_line = [""] * 3
for line in all_content:
if line[:9] == "isv_alias:":
csv_line[0] = line[7:].strip()
elif line[:21] == "LMID:":
csv_line[1] = line[6:].strip()
elif line[:17] == "products:":
csv_line[2] = line[10:].strip()
if all(value != "" for value in csv_line):
with open(os.path.join("/home/nsingh/nishlist.csv"), "a") as csv:
csv.write(",".join(csv_line))
csv.write("\n")

我只需要 LMIT、isv_alias、以下格式的产品:

alias,LMIT,product
bloodyhell,80,rms_scl
something_else,434,some_other_prod

最佳答案

这里存在三个问题:

  1. 查找所有键值文件
  2. 从每个文件中提取键和值
  3. 将每个文件中的键和值转换为 CSV 中的行

首先使用os.listdir()查找以下内容~/ansible-environments/aws,然后构建预期的路径每个 using 内的 inventory/group_vars 目录os.path.join(),并查看哪些确实存在。然后列出那些确实存在的目录的内容,并假设所有里面的文件(例如all)是键值文件。这个例子该答案末尾的代码假设所有文件都可以找到了这个方法;如果他们不能,你可能需要调整这个例子使用 os.walk() 或其他方法查找文件的代码。

每个键值文件都是一系列行,其中每一行都是一个键和值以冒号分隔(":")。您使用搜索的方法对于子字符串(运算符in)将失败,例如, key 包含字符串“LMIT”。相反,在冒号处分割线。表达式 line.split(":", 1) 在第一个位置分割行冒号,但不包括后续冒号,以防值本身具有冒号。然后从键和值中去除多余的空格,并构建一个键和值的字典。

现在选择您要保留的 key 。一旦你解析完每一个文件,从中查找字典中的关联值文件,并从中构建一个列表。然后添加值列表从此文件到所有文件的值列表的列表,以及使用 csv.writer 将列表列表写为 CSV 文件。

它可能看起来像这样:

#!/usr/bin/env python2
from __future__ import with_statement, print_function, division
import os
import csv

def read_kv_file(filename):
items = {}
with open(filename, "rU") as infp:
for line in infp:
# Split at a colon and strip leading and trailing space
line = [x.strip() for x in line.split(":", 1)]

# Add the key and value to the dictionary
if len(line) > 1:
items[line[0]] = line[1]
return items

# First find all random names
outer_dir = os.path.expanduser("~/ansible-environments/aws")
random_names = os.listdir(outer_dir)
inner_dirs = [
os.path.join(outer_dir, name, "inventory/group_vars")
for name in random_names
]

# Now filter it to those directories that actually exist
inner_dirs = [name for name in inner_dirs if os.path.isdir(name)]

wanted_keys = ["alias", "LMIT", "products"]
out_columns = ["alias", "LMIT", "product"]

# Collect key-value pairs from all files in these folders
rows = []
for dirname in inner_dirs:
for filename in os.listdir(dirname):
path = os.path.join(dirname, filename)

# Skip non-files in this directory
if not os.path.isfile(path):
continue

# If the file has a non-blank value for any of the keys of
# interest, add a row
items = read_kv_file(path)
this_file_values = [items.get(key) for key in wanted_keys]
if any(this_file_values):
rows.append(this_file_values)

# And write them out
with open("out.csv", "wb") as outfp:
writer = csv.writer(outfp, "excel")
writer.writerow(out_columns)
writer.writerows(rows)

关于python - 无法以 CSV 格式剥离和存储某些文件的内容,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/42288538/

25 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com