gpt4 book ai didi

elasticsearch - Elassandra索引数据大小比实际数据大10倍

转载 作者:行者123 更新时间:2023-12-03 00:46:26 26 4
gpt4 key购买 nike

在Elassandra中,cassandra数据大小为8GB,但elasticsearch.data大小为83GB。我们的数据输入流为5 msgs / sec,以下是用于创建表和索引的查询:

表创建:

CREATE TABLE IF NOT EXISTS x.abc (
internal_tag text,
generated_at timestamp,
collected_at timestamp,
data_type text,
metadata text,
recorded_at timestamp,
value text,
PRIMARY KEY(internal_tag, generated_at)
)
WITH CLUSTERING ORDER BY(generated_at ASC)
AND bloom_filter_fp_chance = 0.01
AND caching = { 'keys': 'ALL', 'rows_per_partition': 'NONE' }
AND comment = ''
AND compaction = { 'class': 'org.apache.cassandra.db.compaction.SizeTieredCompactionStrategy', 'max_threshold': '32', 'min_threshold': '4' }
AND compression = { 'chunk_length_in_kb': '64', 'class': 'org.apache.cassandra.io.compress.LZ4Compressor' }
AND crc_check_chance = 1.0
AND default_time_to_live = 0
AND gc_grace_seconds = 864000
AND max_index_interval = 2048
AND memtable_flush_period_in_ms = 0
AND min_index_interval = 128
AND read_repair_chance = 0.0
AND speculative_retry = '99PERCENTILE';

索引创建:
curl -XPUT -H 'Content-Type: application/json' 'http://10.0.0.01:9200/x_abc_index' -d '{
"settings": {
"keyspace": "x"
},
"mappings":{
"abc" : {
"discover":".*"
}
}
}'

请提出解决数据大小问题的任何解决方案。
谢谢

最佳答案

我由LeBigCat建议,您可以通过减少映射中索引字段的数量来减少Elasticsearch索引的大小,或者选择正确的映射。

关于elasticsearch - Elassandra索引数据大小比实际数据大10倍,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/60315993/

26 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com