使用mysql提高数据库查询的速度

I have the following query for a type-ahead search (as you type into the form it displays matches in a drop down). This query worked well until I switched to a database with about a million records. Now it takes 15 seconds for the match to be displayed. Because search hits are displayed as you type, the query is inside a loop. Is there anything about this query that can be changed to speed it up?

$diagnosis = isset($_GET['diagnosis']) ? $_GET['diagnosis'] : '';

$data = array();

if ($diagnosis) {
$query = explode(' ', $diagnosis);

for ($i = 0, $c = count($query); $i < $c; $i ++) {
    $query[$i] = '+' . mysql_real_escape_string($query[$i]) . '*';
}

$query = implode(' ', $query);

$sql = "SELECT diagnosis, icd9, MATCH(diagnosis) AGAINST('$query' IN BOOLEAN MODE) AS relevance 
        FROM icd10 WHERE MATCH(diagnosis) AGAINST('$query' IN BOOLEAN MODE) HAVING relevance > 0 ORDER BY relevance ";

$r = mysql_query($sql);

    while ($row = mysql_fetch_array($r)) {
        $data[] = $row;
    }
}

echo json_encode($data);
exit;

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
duandi2853 2016-04-25 19:57
关注
You can try some stuff:

First, make sure you have a fulltext index for diagnosis. Second, make sure you have a fulltext index for diagnosis! A million rows isn't that much (depending on the number of words in diagnosis of course), so that just already might be the problem.

Then try the following code:

SELECT diagnosis, icd9, MATCH(diagnosis) AGAINST('$query' IN BOOLEAN MODE) AS relevance FROM icd10 ORDER BY relevance desc limit 30

(It might not be obvious that this is faster, and it might not be, so just try it).

If you need to support short words, e.g. if 3 digit icd9-codes are entered often, you should check your ft_min_word_len / innodb_ft_min_token_size-values (depending on your database) to make sure they are included in the index - but be aware it will increase your index size. Maybe check the stopwords.

You didn't specify your setup; you can often improve general database performance by e.g. changing settings, hdds or ram. Especially ram.

Some general ideas: You might want to call the function asynchronously (the user should be able to type while the query runs). As soon as you hit less than 30 results (or whatever limit you set), you can just filter the remaining results on the fly in php (as long as the query gets longer/no words are removed) - it's the closest you get to a cache. Or set the limit to 1000 and filter manually afterwards, php regex is fast too, you just need a score-function. Depending on your data, you might want to not run the query when you just add a single letter to the query (every text will contain a word beginning with an "a", so you might not get a better result - that might not be the case for "q" though). That won't reduce runtime of the query, but you can just save one execution.
解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

在AJAX请求中从MySQL数据库中获取大数据 ajax mysql php
2014-05-25 17:56

回答 1 已采纳 First var_dump($row); and check it is as you expect. Then instead of echo $key.' <input
大学数据库mysql 求解答下 mysql
2022-10-11 15:18

回答 3 已采纳 -- 1 写出创建Course表的语句。 CREATE TABLE course ( cno char(6) NOT NULL COMMENT '课程号', cname varchar
Mysql查询结果分组 mysql 数据库
2022-03-10 11:43

回答 1 已采纳你是只需要一个汇总sql,还是要用sql输出json数组？还有你mysql的版本是多少？假设只是汇总数据 select sum(今日上报) 今日上报, sum(办理中) 办理中, sum(待受理)
大数据知识点mysql数据库
2023-04-24 08:41

大数据知识点mysql数据库
关于mysql关系数据库问题 mysql
2022-04-26 16:37

回答 4 已采纳是的，很多搜索引擎也就是通过这样把数据记录成json保存在内存中
请教MySql多表查询统计的相关问题 mysql 数据库开发
2022-03-15 23:16

回答 1 已采纳 select 子查询思路，如果子查询报错就把case when放到外面，里面只计算count返回一行一列 select a.* , (select case when count(data3) &
MySQL replace 使用中出现了问题 mysql sql 数据库
2021-12-16 10:06

回答 2 已采纳已自行解决，改用when...then...来解决 CASE 字段名 WHEN 'Coupe' THEN 'Tayro
大数据环境下基于MySQL的数据库架构设计与实现.pdf
2021-10-10 09:29

大数据环境下基于MySQL的数据库架构设计与实现.pdf
mysql 搭建本地数据库 mysql
2015-06-05 01:26

回答 13 已采纳请先使用“数据库设计”文件夹中的“导入数据库用脚本（完整） wl_db.sql”文件中的SQL脚本来创建程序所需要使用的数据库、表、触发器、存储过程、以及表的数据。数据库为mysql。，这句话是说
mysql数据库锁机制及隔离级别 mysql 数据库
2017-08-14 05:53

回答 3 已采纳数据库的隔离机制是针对数据库的，比如innodb引擎，默认会是可重复读等，你可以通过修改设置来使数据库用其他的隔离机制而锁是针对数据库的表和行，当你对数据库进行操作时，数据库内部会根据条件对表
mysql表记录很多查询慢是否影响插入 mysql 数据库
2017-05-03 08:00

回答 3 已采纳在查询一个表时，服务器首先会对这个对象加锁，为了保证数据的统一性，所以在查询的过程中，如果前台插入语句，是需要等待的。具体解释可以看下sql执行过程： http://www.cnblogs.
Java大数据期末：银行管理系统(MySQL数据库)
2024-02-19 20:43

通过Java控制台开发一个银行管理系统，使用MySQL作为后台数据，实现银行管理员工功能和顾客功能。具体要求如下：（1）管理员功能：登录、添加顾客、删除顾客、计算存储金额、富豪排行榜、退出。（2）顾客功能：...
在sql中多大的数据才算是大数据？ java mysql 数据库
2022-03-31 17:24

回答 5 已采纳其实没有实际的标准明确定义多少数据量算大数据，不过阿里开发手册中建议，表数据超过500万条时，建议考虑分表，以防影响查询效率，不过我们公司也有单表超过几千万条的数据，效率确实不高，所以理论上百万级别以
Java提升数据库大数据查询速度的几种方式
2023-03-01 11:17

吾疾唯君医的博客 Java提升数据查询速度常见的几种方案
大数据-Superset安装篇（二）Python3.8环境+MySQL元数据库
2023-06-18 03:34

base.txt
没有解决我的问题, 去提问

悬赏问题

¥15 下图接收小电路，谁知道原理
¥15 装 pytorch 的时候出了好多问题，遇到这种情况怎么处理？
¥20 IOS游览器某宝手机网页版自动立即购买JavaScript脚本
¥15 手机接入宽带网线，如何释放宽带全部速度
¥30 关于#r语言#的问题：如何对R语言中mfgarch包中构建的garch-midas模型进行样本内长期波动率预测和样本外长期波动率预测
¥15 ETLCloud 处理json多层级问题
¥15 matlab中使用gurobi时报错
¥15 这个主板怎么能扩出一两个sata口
¥15 不是，这到底错哪儿了😭
¥15 2020长安杯与连接网探

使用mysql提高数据库查询的速度

1条回答 默认 最新

悬赏问题

1条回答默认最新