weixin_39949386
2020-12-25 22:54 阅读 0

When searching chinese characters result is entry with description that consists of text with characters searched located randomly in text

Hi! When i`m trying to find chinese text, i`m getting results that consists of not exactly matched text. For example: Searching "自动提取代码库标签": One of the results with description (take a look to bold characters - it is what i searched): SqlMapConfig.xml文件使用连接池的方式解决了数据连接创建和释放频繁所造成的性能影响。 2. 大量的sql存在于代码之中,造成代码的可维护性低。mybatis只用xml文件对sql进行统一管理,方便维护。 3. jdbc操作中存在参数时,需要准确的定位参数的位置和对应占位符的个数,否则会出错。mybatis通过供参数对象的方式解决了该问题。 4. sql语句在编写时,如果存在态条件则不容易处理。mybatis态sql编写机制,时用户可以根据己传入参数的情况进行sql语句的态编写

Does here any way to find exactly combination of characters? Thanks!

该提问来源于开源项目:ipfs-search/ipfs-search

  • 点赞
  • 写回答
  • 关注问题
  • 收藏
  • 复制链接分享

4条回答 默认 最新

  • weixin_39993989 weixin_39993989 2020-12-25 22:54

    I'm not sure. We're using Elasticsearch. The query is made in this file: https://github.com/ipfs-search/ipfs-search-api/blob/master/search.js

    As you see, we're taking the query literally from the input. But perhaps there are UTF-8 issues, or perhaps Elasticsearch has difficulties with Chinese. You might run the crawler on your own machine using Docker, then index a few Chinese files and play around with it. You might also look into any issues involving Elastcsearch and Chinese characters.

    点赞 评论 复制链接分享
  • weixin_39949386 weixin_39949386 2020-12-25 22:54

    Are you using any analyzer plugin (see https://www.sitepoint.com/efficient-chinese-search-elasticsearch/) for chinese?

    点赞 评论 复制链接分享
  • weixin_39993989 weixin_39993989 2020-12-25 22:54

    We’re not using any analyser for any languages at the moment. We’d love to but we’d need help integrating it into our infrastructure. We currently have all languages in the same index.

    Op 13 feb. 2019, om 10:21 heeft Rkrushanovskij het volgende geschreven:

    Are you using any analyzer plugin (see https://www.sitepoint.com/efficient-chinese-search-elasticsearch/) for chinese?

    — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub, or mute the thread.

    点赞 评论 复制链接分享
  • weixin_39949386 weixin_39949386 2020-12-25 22:54

    Got it, thanks!

    点赞 评论 复制链接分享

相关推荐