gpt4 book ai didi

javascript - Elasticsearch 按自定义项目重量排序

转载 作者:行者123 更新时间:2023-12-01 00:31:07 25 4
gpt4 key购买 nike

我已经存储了包含状态属性的文档。我想按状态优先级(而不是按状态字母顺序)对文档进行排序。我遵循了之前的答案并编写了以下函数,但它仍然无法按预期工作;文档按状态名称排序(按字母顺序):

function getESSortingByStatusQuery(query, order) {
let statusOrder = ['BLUE', 'RED', 'BLACK', 'YELLOW', 'GREEN'];
if(order == 'desc'){
statusOrder.reverse();
}
const functions = statusOrder.map((item) => {
const idx = statusOrder.indexOf(item);
return {filter: {match: {statusColor: item}},
weight: (idx + 1) * 50}
});
const queryModified = {
"function_score": {
"query": {"match_all": {}}, // this is for testing purposes and should be replaced with original query
"boost": "5",
"functions": functions,
"score_mode": "multiply",
"boost_mode": "replace"
}
}
return queryModified;
}

如果有人建议根据属性的预定义优先级(在本例中为状态)对项目进行排序的方法,我将不胜感激。

最佳答案

以下是示例 custom sort script我认为这就是你正在寻找的。我已经添加了示例映射、文档、查询和响应,如下所示。

映射:

PUT color_index
{
"mappings": {
"properties": {
"color":{
"type": "keyword"
},
"product":{
"type": "text"
}
}
}
}

示例文档:

POST color_index/_doc/1
{
"color": "BLUE",
"product": "adidas and nike"
}

POST color_index/_doc/2
{
"color": "GREEN",
"product": "adidas and nike and puma"
}

POST color_index/_doc/3
{
"color": "GREEN",
"product": "adidas and nike"
}

POST color_index/_doc/4
{
"color": "RED",
"product": "nike"
}

POST color_index/_doc/5
{
"color": "RED",
"product": "adidas and nike"
}

查询:

POST color_index/_search
{
"query": {
"bool": {
"must": [
{
"query_string": {
"default_field": "*",
"query": "adidas OR nike"
}
}
]
}
},
"sort": [
{ "_score": { "order": "desc"} }, <---- First sort by score
{ "_script": { <---- Second sort by Colors
"type": "number",
"script": {
"lang": "painless",
"source": "if(params.scores.containsKey(doc['color'].value)) { return params.scores[doc['color'].value];} return 100000;",
"params": {
"scores": {
"BLUE": 0,
"RED": 1,
"BLACK": 2,
"YELLOW": 3,
"GREEN": 4
}
}
},
"order": "asc"
}

}
]
}

首先,它将返回按分数排序的文档,然后将第二个排序逻辑应用于该结果。

对于第二次排序,即使用脚本排序,请注意我如何将数值添加到 scores 部分中的颜色。您需要相应地构造您的查询。

其工作原理位于 source 部分,我相信这是不言而喻的,我在其中使用了 doc['color'].value ,因为它是我正在应用自定义排序逻辑的领域。

响应:

{
"took" : 5,
"timed_out" : false,
"_shards" : {
"total" : 1,
"successful" : 1,
"skipped" : 0,
"failed" : 0
},
"hits" : {
"total" : {
"value" : 5,
"relation" : "eq"
},
"max_score" : null,
"hits" : [
{
"_index" : "color_index",
"_type" : "_doc",
"_id" : "1",
"_score" : 0.5159407,
"_source" : {
"color" : "BLUE",
"product" : "adidas and nike"
},
"sort" : [
0.5159407, <--- This value is score(desc by nature)
0.0 <--- This value comes from script sort as its BLUE and I've used value 0 in the script which is in 'asc' order
]
},
{
"_index" : "color_index",
"_type" : "_doc",
"_id" : "5",
"_score" : 0.5159407,
"_source" : {
"color" : "RED",
"product" : "adidas and nike"
},
"sort" : [
0.5159407,
1.0
]
},
{
"_index" : "color_index",
"_type" : "_doc",
"_id" : "3",
"_score" : 0.5159407,
"_source" : {
"color" : "GREEN",
"product" : "adidas and nike"
},
"sort" : [
0.5159407,
4.0
]
},
{
"_index" : "color_index",
"_type" : "_doc",
"_id" : "2",
"_score" : 0.40538198,
"_source" : {
"color" : "GREEN",
"product" : "adidas and nike and puma"
},
"sort" : [
0.40538198,
4.0
]
},
{
"_index" : "color_index",
"_type" : "_doc",
"_id" : "4",
"_score" : 0.10189847,
"_source" : {
"color" : "RED",
"product" : "nike"
},
"sort" : [
0.10189847,
1.0
]
}
]
}
}

注意前三个文档,它具有精确的 product 值,但不同的 color,您可以看到它们被分组在一起,因为我们首先按 _score 排序 然后我们按颜色排序

请告诉我这是否有帮助!

关于javascript - Elasticsearch 按自定义项目重量排序,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/58522821/

25 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com