从后缀表中获取前十名设备

I've been developing an application that show what are the top equipment that has problems in the system. For that I´ve create a tables like:

---------------------      ---------------------
- equipments_201404 -      - equipments_201405 -
---------------------      ---------------------
- id                -      - id                -
- equipName         -      - equipName         -
- dateTime          -      - dateTime          -
- ...               -      - ...               -
---------------------      ---------------------

This kind of separation has to be with the amount of data that has to be storage. Because that, I wondering if there is a way to obtain the top ten equipment in a query or through PHP.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
drpqxogph15436713 2014-05-02 14:37
关注
If you have less than approximately one hundred thousand trouble reports (entries in your equipments_* tables per month, than splitting the records into separate tables by month, or partititioning the tables, is definitely a bad idea. MySQL does just fine at handling tables containing dozens of millions of rows. Just fine. Seriously.

There are tens of thousands of successful applications in the world on modestly sized MySQL servers handling data sets of this size.

On the other hand, systems that employ partitioning require constant maintenance.

If your experience is to the contrary, it's because you haven't figured out how to use indexing and querying correctly. We can't tell from your question what kind of queries you are running in routine production, so it's not possible to give you clear advice about indexing. That being said, I guess it makes sense to put an index on (dateTime,id).

If you had one table rather than one per month as I suggest, you could do this to get your top ten equipment failures.

SELECT equipName FROM equipments GROUP BY equipName ORDER BY COUNT(*) DESC LIMIT 10

If you wanted the top ten failures for the 6 month period ending at the present time, you could use this query.

SELECT equipName FROM equipments WHERE dateTime >= NOW() - INTERVAL 6 MONTH GROUP BY equipName ORDER BY COUNT(*) DESC LIMIT 10

This query would be made very efficient by a compound index on (dateTime, equipName) even for a dataset containing millions of rows spanning decades of time.

As it is, you have split your data into monthly tables. Here's how you can deal with that. First: use a sequence of UNION ALL operations to create a virtual table containing all the data. If all your monthly tables have the same columns in the same order, that's pretty easy if a little boring.

SELECT * FROM equipments_201404 UNION ALL SELECT * FROM equipments_201403 UNION ALL SELECT * FROM equipments_201402 UNION ALL SELECT * FROM equipments_201401 UNION ALL SELECT * FROM equipments_201312 UNION ALL SELECT * FROM equipments_201311 UNION ALL SELECT * FROM equipments_201310 UNION ALL SELECT * FROM equipments_201309 UNION ALL SELECT * FROM equipments_201308 UNION ALL SELECT * FROM equipments_201307 UNION ALL SELECT * FROM equipments_201306 UNION ALL SELECT * FROM equipments_201305 UNION ALL SELECT * FROM equipments_201304 /* etc etc you get the idea */

If you issue this query you'll get all your records as if they were in one table. Then you can use that as a subquery in the query shown above, as follows.

SELECT equipName FROM ( SELECT * FROM equipments_201404 UNION ALL SELECT * FROM equipments_201403 UNION ALL SELECT * FROM equipments_201402 UNION ALL SELECT * FROM equipments_201401 UNION ALL SELECT * FROM equipments_201312 UNION ALL SELECT * FROM equipments_201311 UNION ALL SELECT * FROM equipments_201310 UNION ALL SELECT * FROM equipments_201309 UNION ALL SELECT * FROM equipments_201308 UNION ALL SELECT * FROM equipments_201307 UNION ALL SELECT * FROM equipments_201306 UNION ALL SELECT * FROM equipments_201305 UNION ALL SELECT * FROM equipments_201304 /* etc etc you get the idea */ ) AS equipments WHERE dateTime >= NOW() - INTERVAL 6 MONTH GROUP BY equipName ORDER BY COUNT(*) DESC LIMIT 10

This lets you fake your main summary query into thinking it has one set of uniform data to process. Of course, indexes won't help much here.

Obviously, I have included too many monthly tables in the six-month query. You can fix that. But you'll need to fix it every month.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

从后缀表中获取前十名设备 mysql php
2014-05-02 10:54

回答 1 已采纳 If you have less than approximately one hundred thousand trouble reports (entries in your equipmen
python怎么实现多文件夹后缀名与excel数据一一对应并呈现在一张表中 python 数据分析
2022-07-26 17:33

回答 2 已采纳已解决代码现成
利用链表堆栈进行后缀表达式运算 c++ 链表
2021-08-18 23:53

回答 1 已采纳利用栈实现中缀转化成后缀表达式并求值_溪月~的博客-CSDN博客后缀表达式假设我们计算一个表达式：4*2+5+6*7=，他的计算顺序可以是将4*2的值存为A1，然后将A1和5相加，在将结果
大数据技术
2022-09-16 22:13

clown空城的博客 大数据架构
关于在共享后缀的链表遇到段错误求解链表
2022-03-21 20:42

回答 1 已采纳我觉得应该是这段代码导致段错误。 while(Q&&S&&Q->Data==S->Data)//这里要判断一下Q和S是否是NULL { Q=Q->Next; S=S-
关于数据结构前中后缀表达式的问题 c++ 开发语言
2019-06-07 17:41

回答 1 已采纳 ``` 有中缀转前缀，因为中缀是人们日常使用的。所以一般都是中缀转前/后缀。而前缀转后缀，则可以用中缀过渡一下。好比我们将x进制转换为y进制，一般来说都是x进制转10进制，10进制转y进制。
如何用C++获取程序名？ c++
2022-04-03 17:10

回答 3 已采纳 emmm，默认情况下不会给main函数传参吧。获取程序名似乎可以用GetModuleFileName还有，你这个s2没内存啊，第二个for循环改成 for (int i = flag; i <
大数据技术——Linux常用命令（入门Hadoop前需掌握）
2022-08-09 20:42

彬彬弟的博客 Linux命令大全——每天都是工作在黑色背景的命令行环境中. 自己记忆力不好, 很多有用的Linux命令不能很好的记忆, 现在逐渐总结一下, 以便后续查看。
python1-100中的练习:设计一个函数返回给定文件名的后缀名 python
2021-11-04 23:41

回答 1 已采纳点后面至少有一个字符，否则当点在最后一个不是后缀
后缀名改了Java没有用，求解答 java
2022-09-13 11:17

回答 3 已采纳谁让你乱改扩展名的扩展名只是用来告诉系统，这个文件应该用什么工具打开它本身的信息（二进制数据）并没有改变比喻一下：扩展名好比门锁上贴的小纸条，告诉你这里应该用长钥匙还是短钥匙你把小纸条撕掉重新贴，并不
关于excel的后缀名从xlsx自动变换成xlsm，里面的数据全部没了 windows
2022-01-12 14:13

回答 1 已采纳我之前也遇到过这种问题，是在别人的电脑上，我自己电脑就没事
大数据概论
2022-09-05 21:27

zjh0101的博客 大数据概轮
机器语言的后缀名（扩展名）和机器语言的编译或解释代码方法是什么？
2020-02-09 14:38

回答 1 已采纳机器语言和后缀名没有必然关系，但是在windows系统上，典型的是 .exe .dll .com .sys 等机器语言不存在所谓的编译或者解释（除非异构cpu的模拟器）。高级语言才需要编译和解释。
大数据实训
2023-08-04 15:33

码喵喵的博客 大数据实训，hadoop分布式集群，mapreduce数据处理，spark数据处理，大数据可视化，企业级项目实战。
【大数据面试题大全】大数据真实面试题（持续更新）
2023-04-28 16:53

bmyyyyyy的博客 Flink 是一个分布式的流式数据的处理引擎，对于有界和无界数据进行状态计算，提供了很多便于用户编写分布式任务的 API，有 DataSetAPI，但是新版本中已经被舍弃了，即将淘汰了，现在用的是 DataStreamAPI，还有一些 ...
没有解决我的问题, 去提问

悬赏问题

¥15 微信会员卡接入微信支付商户号收款
¥15 如何获取烟草零售终端数据
¥15 数学建模招标中位数问题
¥15 phython路径名过长报错不知道什么问题
¥15 深度学习中模型转换该怎么实现
¥15 HLs设计手写数字识别程序编译通不过
¥15 Stata外部命令安装问题求帮助！
¥15 从键盘随机输入A-H中的一串字符串，用七段数码管方法进行绘制。提交代码及运行截图。
¥15 TYPCE母转母，插入认方向
¥15 如何用python向钉钉机器人发送可以放大的图片？

从后缀表中获取前十名设备

1条回答 默认 最新

悬赏问题

1条回答默认最新