Mysql选择查询性能变差

I got a mysql query that selects all clicks for each hour of a day. This query worked good till we have alot of click entries in our database. Now it needs sometimes several seconds (up to 9!) to request the datas...

The query is:

SELECT h.clickHour, COUNT(clicktime) AS c
      FROM ( SELECT 0 AS clickHour
             UNION ALL SELECT 1
             UNION ALL SELECT 2
             UNION ALL SELECT 3
             UNION ALL SELECT 4
             UNION ALL SELECT 5
             UNION ALL SELECT 6
             UNION ALL SELECT 7
             UNION ALL SELECT 8
             UNION ALL SELECT 9
             UNION ALL SELECT 10
             UNION ALL SELECT 11
             UNION ALL SELECT 12
             UNION ALL SELECT 13
             UNION ALL SELECT 14
             UNION ALL SELECT 15
             UNION ALL SELECT 16
             UNION ALL SELECT 17
             UNION ALL SELECT 18
             UNION ALL SELECT 19
             UNION ALL SELECT 20
             UNION ALL SELECT 21
             UNION ALL SELECT 22
             UNION ALL SELECT 23 ) AS h
    INNER JOIN links l ON l.user_id = 1
    LEFT OUTER
      JOIN clicks
        ON EXTRACT(HOUR FROM clicks.clicktime) = h.clickHour
          AND DATE(clicks.clicktime) = '2014-09-21'
          AND clicks.link_id = l.id
    GROUP
        BY h.clickHour

I got these unions because i need clicks for each hour also empty hours... Please help!

Ok so we are talking about 0 to several thousand rows for the table clicks. The click time is saved as a timestamp and every click got a unique id. I see that the union thing is bad and i have to change it.

What i try now is to select all clicks of a day grouped by HOUR(clicktime): But when i do so I get too many results like 10x then it should be.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dsy19890123 2014-09-29 05:20
关注
I'd rewrite the query like this:

SELECT h.clickHour , IFNULL(d.clickCount,0) AS c FROM ( SELECT 0 AS clickHour UNION ALL SELECT 1 UNION ALL SELECT 2 UNION ALL SELECT 3 UNION ALL SELECT 4 UNION ALL SELECT 5 UNION ALL SELECT 6 UNION ALL SELECT 7 UNION ALL SELECT 8 UNION ALL SELECT 9 UNION ALL SELECT 10 UNION ALL SELECT 11 UNION ALL SELECT 12 UNION ALL SELECT 13 UNION ALL SELECT 14 UNION ALL SELECT 15 UNION ALL SELECT 16 UNION ALL SELECT 17 UNION ALL SELECT 18 UNION ALL SELECT 19 UNION ALL SELECT 20 UNION ALL SELECT 21 UNION ALL SELECT 22 UNION ALL SELECT 23 ) h LEFT JOIN ( SELECT EXTRACT(HOUR FROM c.clicktime) AS clickHour , SUM(1) AS clickCount FROM clicks c JOIN links l ON l.user_id = 1 AND l.id = c.link_id WHERE c.clicktime >= '2014-09-21' AND c.clicktime < '2014-09-21' + INTERVAL 1 DAY GROUP BY EXTRACT(HOUR FROM c.clicktime) ) d ON d.clickHour = h.clickHour

The approach here is to get the inline view query d to return a maximum of 24 rows. This cranks through the clicks table to get the counts. W're going to defer the join operation to the fixed set of 24 rows until after we have calculated the hourly counts. (The join to h is there only to get rows with zero counts returned, which would otherwise just be "missing" rows.)

You can test the performance of the inline view query d, and of the entire query, I suspect there won't be much difference. The cost of materializing the inline view h isn't that much (there's some overhead, but it's very likely that will use the Memory storage engine; it's small enough and it should be simple integer datatype.) And that join operation of 24 rows to 24 rows won't be that expensive, even without any indexes available.

I suspect that the majority of time will be in materializing the derived table d.

We're going to want an index with a leading column of clickDate, so that we can use a more efficient index range scan operation, to avoid evaluating expressions for every flipping row in the table.

I changed this predicate: DATE(clickTime) = '2014-09-21' into a predicates that reference the bare column, this enables MySQL to consider an efficient range scan operation on the clickTime column, (to quickly eliminate a boatload of rows from consideration), rather than requiring that MySQL evaluate a function on every flipping row in the table.

Some performance gain may be obtained by making covering indexes available on the clicks and links tables (so that the query can be satisfied from the indexes, without a need to visit pages in the underlying table.)

At a minimum on the clicks table:

ON clicks (clickTime, link_id)

If id is unique (or primary key) on the links table, this index may not give any performance benefit:

ON links (id, user_id)

If a covering index used, the EXPLAIN output should show "Using index".

I don't see a way around the "Using filesort" operation, not without adding a column to clicks table that stores the clickTime truncated to the hour. With a column like that, and an appropriate index, it's possible that we could get the GROUP BY operation optimized using the index, avoiding the "Using filesort" operation.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(1条)

报告相同问题？

关注问题

Mysql选择查询性能变差 mysql php sql
2014-09-29 04:27

回答 2 已采纳 I'd rewrite the query like this: SELECT h.clickHour , IFNULL(d.clickCount,0) AS c FROM ( S
mysql数据同步到大数据后通过查询大数据实现报表的实时展示 big data java mysql 有问必答
2021-09-18 10:38

回答 4 已采纳你好像描述的有些问题吧？为什么查询大数据展示？你口中的大数据指定到底是什么？如果你是学习大数据（hadoop相关），使用java语言在做数仓项目，那么你可以参考一下这张图查询数据一般从数据库里面查
mysql判断null的查询速度优化 mysql
2022-08-10 15:46

回答 2 已采纳 mysql的null不存储索引，你可以存储对应类型的默认值替换null值存储使其可以被索引加速
mysql union性能变差和一个方案
2021-02-18 15:40

青碧凝霜的博客由于mysql不支持outer join，只能用unoin。然后查询(请原谅我喜欢给表起名t1)： select * from t1; 速度很快。0.4秒。然后我用了union命令。 select * from t1 union select * from t1; 2秒。由于我使用的是...
Mysql查询结果分组 mysql 数据库
2022-03-10 11:43

回答 1 已采纳你是只需要一个汇总sql,还是要用sql输出json数组？还有你mysql的版本是多少？假设只是汇总数据 select sum(今日上报) 今日上报, sum(办理中) 办理中, sum(待受理)
大量mysql数据查询，优化到秒出 java mysql 大数据
2022-12-07 17:18

回答 4 已采纳使用 instr函数试试看 ,查询 like '%121%' select * from test t where instr(t.requestdata,'121')> 0;
【求解】MySQL模糊查询问题 mysql
2017-06-17 09:47

回答 5 已采纳为什么楼上全用AND...假如中间规定不允许断格，把中间的通配符取消就好了 SELECT * FROM test where name like '%浩%特%' OR '%文%成%'
Mysql大数据表处理方案
2022-10-07 10:04

java小姜在线冲的博客而分区呢，如何突破磁盘的读写能力，从而达到提高mysql性能的目的。 3、实现的难易度上 1、分表的方法有很多，用merge来分表，是最简单的一种方式。这种方式根分区难易度差不多，并且对程序代码来说可以做到透明的...
mysql 遍历查询部门下的用户数量 mysql 数据库数据挖掘
2022-03-16 10:34

回答 1 已采纳请说明一下mysql的版本，8之前和8之后的写法不一样 --测试数据 create table test_20220315_c (id int,name VARCHAR(20),pid int);
mysql两个查询结果求和 mysql
2021-01-12 18:03

回答 2 已采纳参考：，直接将sql放字段就行了 select (select 1 from dual) ，(select 2 from dual) from dual
求mysql循环查询表数据 mysql
2019-12-29 13:38

回答 4 已采纳使用存储过程先统计出数据条数count, 定义int =1, 在while(int>0)循环中累加int, selelct "地区" from table whe
调优攻略：10个提高MySQL性能的实用技巧
2023-10-23 20:00

Java程序员廖志伟的博客调优攻略：10个提高MySQL性能的实用技巧
mysql 大数据量查询优化 mysql 大数据
2017-11-09 03:01

回答 3 已采纳 http://blog.csdn.net/panjican/article/details/52523410这个里面有相关讲解，看对你是否有帮助
【MySQL调优】查询优化
2020-04-04 01:41

zclhit_的博客查询变差的原因：查询是由多个子任务所实现的，需要优化查询，要么就是消除其中一些不必要的子任务，要么就是减少子任务的执行次数，要么就是让子任务执行的更快。查询的生命周期：客户端 -> ...
mysql大规模读写性能_十招搞定 MySQL 大规模数据库的性能和伸缩性优化
2021-02-05 15:37

weixin_39634132的博客珠海源创会在需要支持移动/平板电脑应用及普通桌面浏览器访问的时代，网站的普及率和有效性很大程度上取决于其可用性和性能。一个访问缓慢的网站会使得访问者或潜在的客户流失，并导致商业的失败。一个访问速度相当...
没有解决我的问题, 去提问

悬赏问题

¥15 关于#matlab#的问题：在模糊控制器中选出线路信息，在simulink中根据线路信息生成速度时间目标曲线（初速度为20m/s，15秒后减为0的速度时间图像）我想问线路信息是什么
¥15 banner广告展示设置多少时间不怎么会消耗用户价值
¥16 mybatis的代理对象无法通过@Autowired装填
¥15 可见光定位matlab仿真
¥15 arduino 四自由度机械臂
¥15 wordpress 产品图片 GIF 没法显示
¥15 求三国群英传pl国战时间的修改方法
¥15 matlab代码代写，需写出详细代码，代价私
¥15 ROS系统搭建请教（跨境电商用途）
¥15 AIC3204的示例代码有吗，想用AIC3204测量血氧，找不到相关的代码。

Mysql选择查询性能变差

2条回答 默认 最新

悬赏问题

2条回答默认最新