Elasticsearch。如何结合快速搜索实现以下原则？

My mapping is:

"current_name" => [
    "type"     => "string",
    "index"    => "analyzed",
    "analyzer" => "russian",
    "fields"   => [
        "raw"           => [
            "type"  => "string",
            "index" => "not_analyzed"
        ],
        "raw_lowercase" => [
            "type"     => "string",
            "analyzer" => "tolowercase"
        ]
    ]
],

I need to search the field using the following examples of principles (all together):

Indexed string - "monkeys". I need to find this document by "monkey".
Indexed string - "hello my beautiful world". I need to have possibility to find this document by "hello big world".
Indexed string - "appropriate". I need to have possibility to find this document by "apropriat".

Overall: Indexed - "the Earth planet is the most beautiful in our Solar system". I want to find this document by "earth is beautifal".

All those principles should be applied while user type in his query - quick search. Language is Russian.

Optional: 1) Indexed - "great job". I want to find the document by synonim word "good". 2) Indexed - "beautiful world" find by "beaut worl"

How can I realize described? What are your remarks about combining those principles with quick search?

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dongshan4518 2015-12-15 05:19
关注
Autosuggest considerations

Searchers expect autosuggest to be highly responsive. If any one of your lenient suggestion features costs >100ms, consider moving it out of autosuggest and into search results.

Autosuggest helps to affirm that a searcher is headed in the right direction. For each new lenient suggestion feature you describe and implement, be conscious of the ratio of bad suggestions introduced alongside the good ones. With the limited screen real-estate available for auto-suggest, it's often better to be precise rather than comprehensive.

Strategies to accomplish what you're asking

1) Indexed string - "monkeys". I need to find this document by "monkey".

This is an example of stemming or reducing common inflections of a term to a root form.

For example, mapping inputs of "fitted", "fitting", "fits", "fit" all to a common form, "fit".

Stemming has to occur both for indexed terms and for query terms, so that searches for any of the inflections will yield results containing any other inflections.

Within the Elasticsearch distribution are included two Russian stemmers, russian and light_russian, listed here (follow links to implementation descriptions).

Any of the suggester implementations can be parameterized with a custom analyzer. By default, they use the analyzer defined in the mapping for the field being suggested.

2) Indexed string - "hello my beautiful world". I need to have possibility to find this document by "hello big world"

One solution is simply a boolean search: hello OR my OR beautiful OR world. The implementation of the Elasticsearch match query defaults to boolean and would do what you describe given the phrase "hello my beautiful world" (assuming "hello" and "world" are tokens generated by the searched field's analyzer)

Another solution try would be using the phrase suggester to piece-together the matching terms in the query. (with max_errors >= 0.5 so that terms my beautiful could be considered misspellings.)

3) Indexed string - "appropriate". I need to have possibility to find this document by "apropriat".

You're describing a fuzzy search. This search provides 1-2 characters of leniency in the spelling of a term, and would certainly help chronic misspellers, and poor typists.

Both the completion suggester (which only needs a word prefix to provide suggestions), and the term suggester (which only suggests based on entire terms being entered) have the ability to specify fuzziness or leniency in the "edit distance" between the query and the field value.

Overall: Indexed - "the Earth planet is the most beautiful in our Solar system". I want to find this document by "earth is beautifal".

Optional: 1) Indexed - "great job". I want to find the document by synonim word "good". 2) Indexed - "beautiful world" find by "beaut worl"

(Overall) The phrase suggester may not be able to suggest "the Earth planet is the most beautiful in our Solar system" given the typed phrase "earth is beautifal". This is because there are a number of unrelated terms seperating "earth" and "beautiful" in the source document. A phrase search, with slop set to allow, say a gap of four terms (as in the example), would satisfy this solution. But you'd have to execute a (slower) search request inside your completion logic.

(Optional 1) Synonyms are discussed here, and can be included in your analyzer. Though, I would split-test this thoroughly, as searchers may not expect to see synonyms in their suggestions.

(Optional 1) I doubt the completion suggester will complete multiple terms like "beaut worl" you may have to use edge-ngrams. Practically speaking, however, I doubt anyone will ever type this, even accidentally.

Multiple suggester types can be requested within a _suggest call. You may end up running with a combination of completion and phrase suggesters to cover all of your bases.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

Elasticsearch。如何结合快速搜索实现以下原则？ elasticsearch php
2015-12-13 19:46

回答 1 已采纳 Autosuggest considerations Searchers expect autosuggest to be highly responsive. If any one of
求问以下elastic search query要变成Java query builder怎么实现？ elasticsearch java 有问必答
2021-03-16 11:52

回答 1 已采纳 public SearchSourceBuilder build(){ SearchSourceBuilder builder = new SearchSourceBuilder()
Java程序向elasticsearch服务器发出搜索请求 elasticsearch java javascript 全文检索搜索引擎
2020-09-22 21:01

回答 1 已采纳 https://blog.csdn.net/ROAOR1/article/details/88356225
Elasticsearch：什么是搜索引擎？
2024-02-19 11:58

Elastic 中国社区官方博客的博客搜索引擎对于希望快速有效地查找特定信息的用户来说是有用的工具。它们的范围、功能和索引的内容类型各不相同。这种多功能性可以满足不同环境下的特定用户需求。搜索引擎可以是巨大的互联网搜索引擎，旨在对网络上的...
ElasticSearch内容推荐实现 elasticsearch
2018-07-05 08:42

回答 1 已采纳 https://www.cnblogs.com/luckcs/articles/7052942.html
请问沙箱cuckoosandbox2.0.7的elasticsearch搜索怎么配置？ python 数据库
2023-02-13 20:39

回答 1 已采纳回答不易求求您采纳点赞哦感激不尽如果您的 Cuckoo Sandbox 2.0.7 安装没有安装 Elasticsearch，您需要安装它才能使用搜索功能。您可以通过以下步骤来安装 Ela
IK 分词，当英文与数字混合搜索时，遇到 Elasticsearch 分词问题。 elasticsearch
2021-09-06 12:29

回答 5 已采纳 PUT /test_analyzer { "settings": { "analysis": { "analyzer": { "test_analyzer":
ElasticSearch 之文本搜索
2022-08-02 23:00

Kuo-Teng的博客 1. 作为一款搜索引擎框架，文本搜索...2. ES在文本索引的建立和搜索过程中依赖两大组件，即Lucene和分析器。 3. Lucene负责进行倒排索引的物理构建，分析器负责在建立倒排索引前和搜索前对文本进行分词和语法处理。...
keyword和text能一起搜索吗？ elasticsearch lucene 中文分词全文检索搜索引擎
2020-05-21 23:43

回答 2 已采纳可以这样写： ``` POST YOUR_INDEX/_search { "query": { "bool": { "must": [ {
Elasticsearch 7.8版本能和JDK8完美匹配吗？ elasticsearch
2022-01-03 10:26

回答 1 已采纳 ES7.8版本的官网说明：https://www.elastic.co/guide/en/elasticsearch/reference/7.8/targz.html7.8版本的 JAVA需求：htt
JFinal如何与Elasticsearch连接？ elasticsearch java 搜索引擎
2017-07-06 00:45

回答 1 已采纳 http://blog.csdn.net/tianyaleixiaowu/article/details/72844584 通过这个工具类就OK
全文搜索引擎 ElasticSearch 还是 Solr？
2022-08-29 23:54

ThinkWon的博客最近项目组安排了一个任务，项目中用到了全文搜索，基于全文搜索 Solr，但是该 Solr 搜索云项目不稳定，经常查询不出来数据，需要手动...所以考虑开发一个适配层，如果 Solr 搜索出问题，自动切换到新的搜索--ES。...
ES的聚合搜索可以对所有数据进行分组吗？ elasticsearch java 有问必答
2021-09-06 20:32

回答 1 已采纳 terms支持一个size参数
ElasticSearch 分布式搜索引擎详解
2021-12-23 14:30

Modify_QmQ的博客 The Elastic Stack, 包括 Elasticsearch、Kibana、Beats 和 Logstash（也称为 ELK Stack）。能够安全可靠地获取任何来源、任何格式的数据，然后实时地对数据进行搜索、分析和可视化。Elaticsearch，简称为 ES，ES 是...
【搜索引擎:Elasticsearch】从0了解ES，整合springboot，京东搜索实战
2022-04-13 11:50

冷环渊的博客像类似百度、谷歌这种大数据全文搜索引擎的场景都可以使用Elasticsearch作为底层支持框架，可见Elasticsearch提供的搜索能力确实强大,市面上很多时候我们简称Elasticsearch为es。Logstash是ELK的中央数据流引擎，...
没有解决我的问题, 去提问

悬赏问题

¥15 2024-五一综合模拟赛
¥15 下图接收小电路，谁知道原理
¥15 装 pytorch 的时候出了好多问题，遇到这种情况怎么处理？
¥20 IOS游览器某宝手机网页版自动立即购买JavaScript脚本
¥15 手机接入宽带网线，如何释放宽带全部速度
¥30 关于#r语言#的问题：如何对R语言中mfgarch包中构建的garch-midas模型进行样本内长期波动率预测和样本外长期波动率预测
¥15 ETLCloud 处理json多层级问题
¥15 matlab中使用gurobi时报错
¥15 这个主板怎么能扩出一两个sata口
¥15 不是，这到底错哪儿了😭

Elasticsearch。 如何结合快速搜索实现以下原则？

1条回答 默认 最新

悬赏问题

Elasticsearch。如何结合快速搜索实现以下原则？

1条回答默认最新