Elasticsearch : search results on clicking on Hashtag-6ren

Elasticsearch : search results on clicking on Hashtag

转载作者：行者123 更新时间：2023-11-29 02:47:39

我有一个带有驼峰式标签的标签，例如#teamIndia。现在，当单击此主题标签时，它应该获取其中包含“#teamIndia”的所有结果，它应该首先显示带有“#teamIndia”的结果，然后显示带有“teamIndia”的结果，然后是“印度团队”，然后是“团队”或“印度”等。

我在做什么:

搜索文本:“#teamIndia”、“#NEWYORK”、“#profession”、“#2016”

POST /clip
{
    "settings": {
        "analysis": {
            "char_filter" : {
                "space_hashtags" : {
                    "type" : "mapping",
                    "mappings" : ["#=>|#"]
                }
            },
            "filter": {
                "substring": {
                    "max_gram": "20",
                    "type": "nGram",
                    "min_gram": "1",
                    "token_chars": [
                        "whitespace"
                    ]
                },
                "camelcase": {
                    "type": "word_delimiter",
                    "type_table": ["# => ALPHANUM", "@ => ALPHANUM"]
                },
                "stopword": {
                    "type":       "stop",
                    "stopwords": ["and", "is", "the"]
                }
            },
            "analyzer": {
                "substring_analyzer": {
                    "filter": [
                        "lowercase",
                        "substring"
                    ],
                    "tokenizer": "standard"
                },
                "camelcase_analyzer": {
                    "type" : "custom",
                    "char_filter" : "space_hashtags",
                    "tokenizer" : "whitespace",
                    "filter": [
                        "camelcase",
                        "lowercase",
                        "stopword"
                    ]
                }
            }
        }
    },
    "mappings": {
        "Clip": {
            "properties": {
                "description": {
                    "type": "multi_field",
                    "fields": {
                        "description": {
                            "type": "string",
                            "analyzer": "substring_analyzer",
                            "search_analyzer": "standard"
                        },
                        "raw": {
                            "type": "string",
                            "index": "not_analyzed"
                        },
                        "hashtag": {
                            "type": "string",
                            "index": "analyzed",
                            "analyzer": "camelcase_analyzer"
                        }
                    }
                },
                ....
            }
        }
    }
}

文档示例:-

POST /clip/Clip/2 {"id" : 1, "description" : "TheBestAndTheBeast"}

POST /clip/Clip/3 {"id" : 2, "description" : "bikes in DUBAI TheBestAndTheBeast profession"}

POST /clip/Clip/3 {"id" : 2, "description" : "Know how a software engineer surprised his wife!    &lt;a href=&quot;search/clips?q=%23theProvider&amp;source=hashtag&quot; ng-click=&quot;handleModalClick()&quot;&gt; #theProvider &lt;/a&gt;     rioOlympic   &lt;a href=&quot;search/clips?q=%23DUBAI&amp;source=hashtag&quot; ng-click=&quot;handleModalClick()&quot;&gt; #DUBAI &lt;/a&gt;    &lt;a href=&quot;search/clips?q=%23TheBestAndTheBeast&amp;source=hashtag&quot; ng-click=&quot;handleModalClick()&quot;&gt; #TheBestAndTheBeast &lt;/a&gt;   &lt;a href=&quot;search/clips?q=%23rioOlympic&amp;source=hashtag&quot; ng-click=&quot;handleModalClick()&quot;&gt; #rioOlympic &lt;/a&gt;"}

** 搜索查询 **

GET clip/_search
{
"size": 100,
"query": {
    "filtered": {
        "query": {
            "bool": {
                "must":
                   {
                    "query_string": {
                        "fields": [
                           "description.hashtag"
                        ],
                        "query": "teamIndia"
                    }                         
                },
                "should": { 
                    "match": 
                        { "description.raw": "#teamIndia"}   
                }
            }
        }
    }
}

异常结果:"#teamIndia",“印度队”，“印度队”，“团队”，“印度”，

和其他测试关键字类似。

最佳答案

原始帖子中的查询无法按预期工作的原因之一是因为 description.raw 是 not_analyzed 。因此，#teamIndia 永远不会匹配具有 description: "Animals and Pets and #teamIndia" 的文档，因为 description.raw 将包含未分析的术语 Animals and Pets 和 #teamIndia 而不是 #teamIndia

假设您拥有的文档类似于 OP 中的第二个示例。

示例:

{"id" : 2, "description" : "Animals and Pets and #teamIndia"}

或

{"id":7,"description":"This &lt;a href=&quot;search/clips?q=%23teamIndia&amp;source=hashtag&quot;&gt;#teamIndia&lt;/a&gt;"}

然后您应该能够按以下顺序对文档进行排名:

1) 包含“#teamIndia”的描述，
2) 包含“teamIndia”的描述
3) 包含“印度队”的描述
4) 包含“印度”的描述

通过在 wordlimiter 中启用 preserve_orginal 和 catenate_words过滤如下例所示

示例:

索引文件

 PUT clip
{
   "settings": {
      "analysis": {
         "char_filter": {
            "zwsp_normalize": {
               "type": "mapping",
               "mappings": [
                  "\\u200B=>",
                  "\\u200C=>",
                  "\\u200D=>"
               ]
            },
            "html_decoder": {
               "type": "mapping",
               "mappings": [
                  "&lt;=> <",
                  "&gt;=> >"
               ]
            }
         },
         "filter": {
            "camelcase": {
               "type": "word_delimiter",
               "preserve_original": "true",
               "catenate_all": "true"
            },
            "stopword": {
               "type": "stop",
               "stopwords": [
                  "and",
                  "is",
                  "the"
               ]
            }
         },
         "analyzer": {
            "camelcase_analyzer": {
               "type": "custom",
               "tokenizer": "whitespace",
               "filter": [
                  "camelcase",
                  "lowercase",
                  "stopword"
               ],
               "char_filter": [
                  "zwsp_normalize",
                  "html_decoder",
                  "html_strip"
               ]
            }
         }
      }
   },
   "mappings": {
      "Clip": {
         "properties": {
            "description": {
               "type": "multi_field",
               "fields": {
                  "hashtag": {
                     "type": "string",
                     "index": "analyzed",
                     "analyzer": "camelcase_analyzer",
                     "norms": {
                         "enabled": false
                     }
                  }
               }
            }
         }
      }
   }
}



POST /clip/Clip/1
{
   "id": 1,
   "description": "Animals and Pets and #teamIndia"
}

POST /clip/Clip/2 
{
   "id": 2,
   "description": "Animals and Pets and teamIndia"
}


POST /clip/Clip/3
{
   "id": 3,
   "description": "Animals and Pets and team India"
}


POST /clip/Clip/4 
{
   "id": 4,
   "description": "Animals and Pets and India"
}



  POST /clip/Clip/7
    {
       "id": 7,
       "description": "This &lt;a href=&quot;search/clips?q=%23teamIndia&amp;source=hashtag&quot;&gt;#teamIndia&lt;/a&gt;"
    }

查询结果:

POST clip/_search?search_type=dfs_query_then_fetch
{
   "size": 100,
   "query": {
      "filtered": {
         "query": {
            "bool": {
               "must": [
                  {
                     "query_string": {
                        "fields": [
                           "description.hashtag"
                        ],
                        "query": "#teamIndia"
                     }
                  }
               ]
            }
         }
      }
   }
}

结果:

      "hits": {
      "total": 5,
      "max_score": 1.4969246,
      "hits": [
         {
            "_index": "clip",
            "_type": "Clip",
            "_id": "7",
            "_score": 1.4969246,
            "_source": {
               "id": 7,
               "description": "This &lt;a href=&quot;search/clips?q=%23teamIndia&amp;source=hashtag&quot;&gt;#teamIndia&lt;/a&gt;"
            }
         },
         {
            "_index": "clip",
            "_type": "Clip",
            "_id": "1",
            "_score": 1.4969246,
            "_source": {
               "id": 1,
               "description": "Animals and Pets and #teamIndia"
            }
         },
         {
            "_index": "clip",
            "_type": "Clip",
            "_id": "2",
            "_score": 1.0952718,
            "_source": {
               "id": 2,
               "description": "Animals and Pets and teamIndia"
            }
         },
         {
            "_index": "clip",
            "_type": "Clip",
            "_id": "3",
            "_score": 0.5207714,
            "_source": {
               "id": 3,
               "description": "Animals and Pets and team India"
            }
         },
         {
            "_index": "clip",
            "_type": "Clip",
            "_id": "4",
            "_score": 0.11123338,
            "_source": {
               "id": 4,
               "description": "Animals and Pets and India"
            }
         }
      ]
   }

示例#dubai:

POST /clip/Clip/5
{
   "id": 5,
   "description": "#dubai is hot"
}

POST /clip/Clip/6
{
   "id": 6,
   "description": "dubai airport is huge"
}

POST clip/_search?search_type=dfs_query_then_fetch
{
   "size": 100,
   "query": {
      "filtered": {
         "query": {
            "bool": {
               "must": [
                  {
                     "query_string": {
                        "fields": [
                           "description.hashtag"
                        ],
                        "query": "#dubai"
                     }
                  }
               ]
            }
         }
      }
   }
}

   "hits": {
      "total": 2,
      "max_score": 1.820827,
      "hits": [
         {
            "_index": "clip",
            "_type": "Clip",
            "_id": "5",
            "_score": 1.820827,
            "_source": {
               "id": 5,
               "description": "#dubai is hot"
            }
         },
         {
            "_index": "clip",
            "_type": "Clip",
            "_id": "6",
            "_score": 0.5856731,
            "_source": {
               "id": 6,
               "description": "dubai airport is huge"
            }
         }
      ]
   }

示例#professionalAndPunctual:

POST /clip/Clip/7
{
   "id": 7,
   "description": "professionalAndPunctual"
}
POST clip/_search?search_type=dfs_query_then_fetch
{
   "size": 100,
   "query": {
      "filtered": {
         "query": {
            "bool": {
               "must": [
                  {
                     "query_string": {
                        "fields": [
                           "description.hashtag"
                        ],
                        "query": "#professionalAndPunctual"
                     }
                  }
               ]
            }
         }
      }
   }
} 

 "hits": [
     {
        "_index": "clip",
        "_type": "Clip",
        "_id": "7",
        "_score": 2.2149992,
        "_source": {
           "id": 7,
           "description": "professionalAndPunctual"
        }
     }
  ]

编辑示例

示例:#TheBestAndTheBea‌ st

   POST /clip/Clip/10
{"id" : 10, "description" : "TheBestAndTheBeast"}

POST /clip/Clip/11
{"id" :11, "description" : "bikes in DUBAI TheBestAndTheBeast profession"}

POST /clip/Clip/12
{"id" : 12, "description" : "Know how a software engineer surprised his wife! <a href=\"search/clips?q=%23theProvider&source=hashtag\" ng-click=\"handleModalClick()\"> #theProvider </a> rioOlympic <a href=\"search/clips?q=%23DUBAI&source=hashtag\" ng-click=\"handleModalClick()\"> #DUBAI </a> <a href=\"search/clips?q=%23TheBestAndTheBeast&source=hashtag\" ng-click=\"handleModalClick()\"> #TheBestAndTheBeast </a> <a href=\"search/clips?q=%23rioOlympic&source=hashtag\" ng-click=\"handleModalClick()\"> #rioOlympic </a>"}

POST clip/_search?search_type=dfs_query_then_fetch
{
   "size": 100,
   "query": {
      "filtered": {
         "query": {
            "bool": {
               "must": [
                  {
                     "query_string": {
                        "fields": [
                           "description.hashtag"
                        ],
                        "query": "#TheBestAndTheBeast"
                     }
                  }
               ]
            }
         }
      }
   }
}

#结果

 "hits": [
         {
            "_index": "clip",
            "_type": "Clip",
            "_id": "12",
            "_score": 1.8701664,
            "_source": {
               "id": 12,
               "description": "Know how a software engineer surprised his wife! <a href=\"search/clips?q=%23theProvider&source=hashtag\" ng-click=\"handleModalClick()\"> #theProvider </a> rioOlympic <a href=\"search/clips?q=%23DUBAI&source=hashtag\" ng-click=\"handleModalClick()\"> #DUBAI </a> <a href=\"search/clips?q=%23TheBestAndTheBeast&source=hashtag\" ng-click=\"handleModalClick()\"> #TheBestAndTheBeast </a> <a href=\"search/clips?q=%23rioOlympic&source=hashtag\" ng-click=\"handleModalClick()\"> #rioOlympic </a>"
            }
         },
         {
            "_index": "clip",
            "_type": "Clip",
            "_id": "10",
            "_score": 0.9263139,
            "_source": {
               "id": 10,
               "description": "TheBestAndTheBeast"
            }
         },
         {
            "_index": "clip",
            "_type": "Clip",
            "_id": "11",
            "_score": 0.9263139,
            "_source": {
               "id": 11,
               "description": "bikes in DUBAI TheBestAndTheBeast profession"
            }
         }
      ]

分析器示例:

get clip/_analyze?analyzer=camelcase_analyzer&text=%23DUBAI

   {
   "tokens": [
      {
         "token": "#dubai",
         "start_offset": 0,
         "end_offset": 6,
         "type": "word",
         "position": 0
      },
      {
         "token": "dubai",
         "start_offset": 1,
         "end_offset": 6,
         "type": "word",
         "position": 0
      }
   ]
}

get clip/_analyze?analyzer=camelcase_analyzer&text=This%20%26lt%3Ba%20href%3D%26quot%3Bsearch%2Fclips%3Fq%3D%2523teamIndia%26amp%3Bsource%3Dhashtag%26quot%3B%26gt%3B%23teamIndia%26lt%3B%2Fa%26gt%3B

  {
   "tokens": [
      {
         "token": "this",
         "start_offset": 0,
         "end_offset": 4,
         "type": "word",
         "position": 0
      },
      {
         "token": "#teamindia",
         "start_offset": 78,
         "end_offset": 98,
         "type": "word",
         "position": 1
      },
      {
         "token": "india",
         "start_offset": 78,
         "end_offset": 98,
         "type": "word",
         "position": 2
      },
      {
         "token": "team",
         "start_offset": 78,
         "end_offset": 98,
         "type": "word",
         "position": 2
      },
      {
         "token": "teamindia",
         "start_offset": 78,
         "end_offset": 98,
         "type": "word",
         "position": 2
      }
   ]
}

关于 Elasticsearch : search results on clicking on Hashtag，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/39345299/

文章推荐： c# - 为 Xamarin 重定向 UIViewController

文章推荐： elasticsearch - 如何创建时间戳增量的直方图？

文章推荐： mysql - 如何将变量从r程序传递给mysql函数

javascript - 构建搜索应用程序时什么最有效 : search:search or cts:search?
关闭。这个问题是opinion-based .它目前不接受答案。想要改进这个问题？更新问题，以便 editing this post 可以用事实和引用来回答它. 关闭 4 年前。 Improve
search - 自动包装 I-search？
在 Vim 中，我可以:set wrapscan，这样当我进行增量搜索时，无论第一个匹配项位于光标上方还是下方，光标都会跳转到第一个匹配项。在 Emacs 中，如果我通过 C-s 开始搜索，如果第一
search - Elastic Search 和 Google Search Appliance 页面排名之间的区别
Elasticsearch 中的页面排名是如何工作的。一旦我们创建了一个索引，就会有一个底层智能层创建一个元数据存储库并提供结果以根据相关性进行查询。我已经创建了几个索引，我想知道在提供查询后结果是如
search - Elastic Search 在多次运行相同查询时返回不一致的结果
我们在单个节点上使用 Elasticsearch 对数据进行了索引。我们在后台运行了一个线程，用于使用最近的更改更新索引。现在我们使用 Elasticsearch API 来运行搜索查询。 {
search - twitter search api没有OR结果
这突然停止工作，正在工作，但现在却没有: 如果我使用Twitter UI并转到: https://twitter.com/#!/search/%22social%20snap%22%20OR%20%
java - Hibernate-search:通过在 hibernate-search 中以编程方式注册字段，基类字段未在 elastic-search 中注册
我在基类中声明了某些字段，并且我想仅为某些子类(实体)注册这些字段。因此，我不想通过 @Field 注释基类中的这些字段，尽管只需以编程方式注册某些实体就足够了。但是在基本实体中声明的字段未注册/
full-text-search - 优势数据库 : Full Text Search not returning results that start with the search string
我的全文搜索索引有问题。我有一个字符字段大小为 30 的表。我在这个字段上创建了一个全文搜索索引，以便在这个不区分大小写的字段上进行快速搜索操作。现在，当我执行以下查询时:SELECT fieldna
search - SandcaSTLe 帮助构建器: Search Functionality
我对SandCaSTLe的输出感到非常满意，但我也想在HTML输出中包含一些搜索功能，这可能吗？最佳答案 SandCaSTLe帮助文件生成器的网站输出包含 index.aspx 和 index.ht
search - Apache Lucene与Google Search Appliance
有没有人遇到过Apache Lucene的功能？我听说它甚至可以与Google Search Appliance(GSA)相提并论。我正在寻找两者之间的明确比较，如果可能的话？在线上进行的比较非常模
search - “Find”和 “Search”之间的语义差异？
在构建应用程序时，“查找”与“搜索”之间有什么有意义的区别吗？您是否将它们视为同义词？我在询问应用程序UI和API设计的标签方面。最佳答案查找是搜索的完成。如果您可能无法成功找到某些东西，则将
search - 谷歌图片搜索: search similar images
我想编写一个移动应用程序，它可以拍照并在谷歌图像中搜索类似的图片，然后显示结果。但是，使用谷歌图像搜索我只能搜索文本字符串，而使用搜索 API 似乎无法搜索相似图片；此功能似乎只能通过网络界面使用。
search - "Search More..."在我的 Many2one 字段中
当我从 Many2one 列表框中选择一个项目时，我想要进行高级搜索。例如，此功能是针对“res.groups”对象实现的。我在/addons 中找不到此功能。更准确地说，我定义了我的对象 clas
search - Amazon Cloud Search - 按时间和日期获取地点
我正在使用 Amazon CloudSearch 存储大量地点。每个地方在一周中的每一天都有开放时间和关闭时间。我需要按当前时间检索地点。您如何建议对索引进行建模？我想通过创建 7 个文本索引来解决
search - 执行 "related searches"功能的方法
我见过一些网站，当您执行搜索时会列出相关搜索，即它们会建议您可能感兴趣的其他搜索查询。我想知道在中型网站中对此进行建模的最佳方法(没有足够的流量来依赖访问者统计数据来推断关系)。我最初的想法是存储每
search - Sitecore Search Api - 如何获取格式化的网址
如何从 Sitecore Lucene 搜索中获取格式化的 url？我创建了一个自定义索引，并在根目录下将其更新为/sitecore/content/websitename/home。检索到搜索结果
search - Vim 状态行 : Word search
我一直在努力寻找这个并且无法找到我想要的东西。在我的状态行上，我想要计算当前文件中出现的匹配数。下面的 vim 命令返回我想要的。我需要返回的号码显示在我的状态行中。 :%s/^I^I//n vim
search - "The specified CFC search could not be found"
我们有自己的服务器与应用程序一起工作。我们开始使用不同的提供商进行托管，现在我们遇到了上述错误。关于同页面，这有效: 但是这个不我们无法弄清楚为什么会这样。您
700. Search in a Binary Search Tree 二叉搜索树中的搜索
题目地址：https://leetcode.com/problems/search-in-a-binary-search-tree/description/ 题目描述 Given the root
search - 传单控件搜索 : open Popup for search result
我正在使用很棒的插件 Leaflet.Control.Search为了在我的 map 上搜索标记(来自 geoJson 标记组)——效果很好。我现在只有一个简单的问题:如何打开搜索结果标记的弹出窗口
search - TYPO3 后端 : search custom records
我开发了一个允许创建新记录的扩展。在列表模块中，在记录列表下，有搜索表单。例如，它适用于 fe 用户，但不适用于我的自定义记录。是否必须在我的 tca 中添加任何特殊配置才能使此表单与我的自定义

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

Elasticsearch : search results on clicking on Hashtag

编辑示例