gpt4 book ai didi

python - 使用python的elasticsearch批量索引

转载 作者:IT王子 更新时间:2023-10-29 00:12:01 26 4
gpt4 key购买 nike

我正在尝试使用 python pyes 模块将具有 6M 记录的 csv 文件索引到 elasticsearch,代码逐行读取记录并将其推送到 elasticsearch ...知道如何将其作为批量发送吗?

import csv
from pyes import *
import sys

header = ['col1','col2','col3','col3', 'col4', 'col5', 'col6']

conn = ES('xx.xx.xx.xx:9200')

counter = 0

for row in reader:
#print len(row)
if counter >= 0:
if counter == 0:
pass
else:
colnum = 0
data = {}
for j in row:
data[header[colnum]] = str(j)
colnum += 1
print data
print counter
conn.index(data,'accidents-index',"accidents-type",counter)
else:
break

counter += 1

最佳答案

pyelasticsearch支持批量索引:

bulk_index(index, doc_type, docs, id_field='id', parent_field='_parent'[, other kwargs listed below])

例如,

cities = []
for line in f:
fields = line.rstrip().split("\t")
city = { "id" : fields[0], "city" : fields[1] }
cities.append(cities)
if len(cities) == 1000:
es.bulk_index(es_index, "city", cities, id_field="id")
cities = []
if len(cities) > 0:
es.bulk_index(es_index, "city", cities, id_field="id")

关于python - 使用python的elasticsearch批量索引,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/19271943/

26 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com