用PHP创建自然语言搜索引擎

I'm trying to code up a natural language parser and search engine in PHP. All of the ways that I have thought of thus far have been either cumbersome to implement, use, or not that efficient.

One of my ideas included a script that would perform regular expression on a simplified string, ie. various words removed from the string, and then the resulting string checked first for what the user is looking for - ie, "opening times", then if possible the venue they're searching for - lets say "Derngate". The rest is similar to that.

Can anyone point me in the direction of a more efficient way of doing things? I don't want to be doing 25 different regular expressions - or what ever the count is - per each page load if I can help it.

Many thanks!

Edit: I'm just curious, that's all. I'd rather make my own (to see how it works) rather than jumping into something like Lucene.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

3条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dongwen2794 2011-08-23 19:02
关注
I think that after a review of the state of the art, I'd look at root/stem word extraction as a start. (Not too heavy a task if your document corpus is relatively static, since this can be done at document-capture time.)

There's a PHP extension for that, stem. http://pecl.php.net/package/stem

There's the Porter Stemmer implemented in PHP, that's the key operation in the above, implemented as a function.

本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(2条)

报告相同问题？

关注问题

用PHP创建自然语言搜索引擎 mysql php
2011-03-24 15:14

回答 3 已采纳 I think that after a review of the state of the art, I'd look at root/stem word extraction as a st
PHP动态创建多维数组 php 后端开发语言
2022-05-05 16:40

回答 2 已采纳 <?php $resultArr = array(); $arr = array(); while ($data = mysqli_fetch_assoc($result)) { $re
php能否实现以下功能？ mysql php 搜索引擎
2022-01-20 13:31

回答 2 已采纳肯定可以啊
2万字详解，彻底讲透全文搜索引擎 Elasticsearch
2022-04-11 09:30

Hollis Chuang的博客来源：cnblogs.com/jajian/p/11223992.html由于近期在公司内部做了一次 Elasticsearch 的...生活中的数据搜索引擎是对数据的检索，所以我们先从生活中的数据说起。我们生活中的数据总体分为两种：结构化数据非结构化...
PHP preg_match_all不处理大数据 laravel php
2018-05-16 06:59

回答 1 已采纳 The pattern at play matches balanced curly brackets using regex recursion. The pattern itself look
Yii2为ajax搜索字段创建多语言网站 php
2018-04-09 07:50

回答 1 已采纳 First set up multi language for your site there is doc for this. Best way of auto support multi l
我想问一下数据库中的索引和搜索引擎中所说的索引是不是不一样啊？？？ php 搜索引擎
2019-04-24 15:40

回答 2 已采纳不一样的，es中的索引是一个存储的模块，相当于关系数据库的DB；关系数据库中的索引是帮助表中的列，提升查询效率的，实际基于某个列创建。
php使用百度自定义ocr_使用PHP构建自定义搜索引擎
2020-06-30 18:26

cuxiong8996的博客在互联网时代，人们希望信息像快餐食品一样包装：即食，无忧且按口大小（或字节大小）包装。实际上，为了养活那些急躁而又饥饿的人，即使现在最谦虚的网站也可以提供各种快速格式... 搜索就像在您当地的自助餐厅吃...
php数据库bewteen使用，如何查询100万-200万之间的数据 php 搜索引擎有问必答算法
2021-08-26 23:30

回答 2 已采纳是什么数据库呢？如果是mysql的话，并且中文的字都是万的话，可以参考sql语句： select * from 表名 where REPLACE(`字段`, '万', '') between 100
PHP+MYSQL 搜索条件使用变量时双引号的运用 php 开发语言
2019-11-05 17:06

回答 3 已采纳我一般都是单引号操作,然后如果有需要拼接的话,则直接用英文符号 ' . '来拼接一条sql语句比如: $fieldVal = *; $where = 'where field = ' .
PHP 创建图片找不到 imageCreateTrueColor() php
2021-06-28 14:29

回答 1 已采纳 imagecreatetruecolor，都是小写的，其他的函数也是
全文搜索引擎 ElasticSearch 还是 Solr？
2022-05-18 14:06

程序IT圈的博客前言最近项目组安排了一个任务，项目中用到了基于 Solr 的全文搜索，但是该 Solr 搜索云项目不稳定，经常查询不出来数据，需要手动全量同步。而且它还是其他团队在维护，依赖性太强，导致 Solr 服务一出问题，我们的...
如何使用自定义字段构建php搜索引擎[关闭] github php
2013-02-02 07:41

回答 1 已采纳 Take a look at Sphinx and at PHP Sphinx clien.
ElasticSearch（搜索引擎）
2023-03-30 16:34

KeWS的博客 ElasticSearch：智能搜索，分布式的搜索引擎Elaticsearch，简称为 ES，是一个开源的高扩展的分布式全文检索引擎，特点：近乎实时的存储、检索数据；扩展性好，可以扩展到上百台服务器，处理PB级别的数据；使用 Java...
Elasticsearch搜索引擎
2022-10-26 23:37

栀铭、辞花洛的博客 Elasticsearch搜索引擎的介绍和基本使用
没有解决我的问题, 去提问

悬赏问题

¥15 HFSS 中的 H 场图与 MATLAB 中绘制的 B1 场部分对应不上
¥15 如何在scanpy上做差异基因和通路富集？
¥20 关于#硬件工程#的问题，请各位专家解答！
¥15 关于#matlab#的问题：期望的系统闭环传递函数为G(s)=wn^2/s^2+2¢wn+wn^2阻尼系数¢=0.707，使系统具有较小的超调量
¥15 FLUENT如何实现在堆积颗粒的上表面加载高斯热源
¥30 截图中的mathematics程序转换成matlab
¥15 动力学代码报错，维度不匹配
¥15 Power query添加列问题
¥50 Kubernetes&Fission&Eleasticsearch
¥15 報錯：Person is not mapped，如何解決？

用PHP创建自然语言搜索引擎

3条回答 默认 最新

悬赏问题

3条回答默认最新