过滤MySQL结果的最佳实践

I want to implement a filter-function in my PHP project. To implement a filter, I usually just add a WHERE clause in my query to show filtered results.

My Problem is: These filters require not only a smple added WHERE clause, but a huge Query including multiple JOINs. The resulting Query has > 30 lines.

Later, there should also be a search function which would then also require this huge query. I wonder if this is a good practice or if I should add a "redundant" Database column to my database table where I compute the attribute I need for filtering on every update. With this column, I wouldnt have my huge query on different places over my project, but have a redundant column.

What do you think?

Greetings

As questioned, here the table structure/code. This is not the exact code, because there is also a revision system which makes it even more complex, but for understanding this is enough:

table submissions:

ID (primary)
(additionalColumns)

table reports:

ID (primary)
submissionID (reference to submission table)
(additionalColumns)

table report_objects:

reportID (reference to reports table, multiple report_object for one report)

table accounting:

ID (primary)
reportID (reference to reports table, multiple accountings for one report)
(additionalColumns)

table accounting_objects:

ID
accountingID (reference to accounting table, multiple accounting_object for one accounting)
(additionalColumns)

For a submission, one or multiple reports are being create with multiple objects to account (report_objects). For each report, I can create multiple accountings, where each accounting is for a few objects of the report. The accounted report_objects are stored in accounting_object

My query/filter checks, if each report_object of a submissionID is accounted (accounting_object exists) for one submissionID.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dongquan8753 2012-12-14 10:21
关注
There isn't one definitive answer and, in practice, if it works and runs quickly enough for your needs then you can leave it as is. Optimization is always something you can come back to.

Joining correctly

If you are simply checking for the existence of a join table and only including the results with that join you can do this through the correct LEFT / RIGHT JOIN expressions. This is always the first call.

Expressiveness

Also be as expressive as you can with SQL, you want to give it the best chance to optimize your query, there are keywords such as EXISTS, for example, make sure to use them.

Denormalization

You can add in a column that stores the computed value, the complexity that arises out of this is ensuring that the value is always up to date. This can be done by triggers or manually. The pros:

It is the easiest method of getting around slowness introduced by computed columns.

The cons:

Ruins your nice normalized schema

If you do it manually in code, you will forget to do it somewhere, causing headaches.

Triggers can be a bit of a pain.

Materialized view

This is like denormalization but prevents polluting your normalized tables by created a stored view. This is achieved in MySQL by storing the result of your complex select into a results table when the values change. Again, the same as denormalization, the complexity is keeping this up to date. It is typically done with triggers. This can be a pain but keeps the complexity out of your schema. As mentioned by@eggyal it isn't a supported feature of MySQL yet so you will have to DIY... Materialized views with MySQL

Pros:

Keeps dirty denormalized stuff away from your nice normalized schema.

Cons:

Materialized views aren't supported so setting them up requires work.

If you trigger the refresh of your views in code you get stale data, but isn't quite as painful as the single column staleness of denormalization.

Triggers can be a bit of a pain.

If you aren't sure, and it really matters, do some benchmarking.

EDIT If you code has this query in one form or another across your code base then that has the possibility of cause headaches in future as you will have to remember to change the statements in all of the places if or when they change.

If by doing the above you have made your statements really simple and concise then they may differ enough from each other for it to not be a problem.

You can do some things to help you out:

Put all of the related queries in a single place, i.e. a single class or script that handles this query in its various forms. This way at least all of the changes are limited to the one file.

You can, to help yourself out a bit more, do a bit of refactoring it to remove duplication between the queries.

Also, If you feel the database information is too exposed to the code you may want to abstract it behind a view.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

Mysql查询结果分组 mysql 数据库
2022-03-10 11:43

回答 1 已采纳你是只需要一个汇总sql,还是要用sql输出json数组？还有你mysql的版本是多少？假设只是汇总数据 select sum(今日上报) 今日上报, sum(办理中) 办理中, sum(待受理)
mysql数据同步到大数据后通过查询大数据实现报表的实时展示 big data java mysql 有问必答
2021-09-18 10:38

回答 4 已采纳你好像描述的有些问题吧？为什么查询大数据展示？你口中的大数据指定到底是什么？如果你是学习大数据（hadoop相关），使用java语言在做数仓项目，那么你可以参考一下这张图查询数据一般从数据库里面查
mysql大数据如何存储方便 java mysql
2019-12-28 10:42

回答 2 已采纳你修改过问题？500亿以上数据，根本不适合使用mysql这样的关系型数据库，应该用neo4j这种图数据库 https://baike.baidu.com/item/Neo4j/9952114?fr=
开源大数据OLAP引擎最佳实践
2022-04-18 09:11

zhisheng_blog的博客本篇内容将通过六个部分来介绍开源大数据OLAP引擎最佳实践。一、开源OLAP综述二、开源数仓解决方案三、ClickHouse介绍四、StarRocks介绍五、Trino介绍六、客户案例01开源OLAP综述如今的开源数据引擎多种多样，不同...
mysql两个查询结果求和 mysql
2021-01-12 18:03

回答 2 已采纳参考：，直接将sql放字段就行了 select (select 1 from dual) ，(select 2 from dual) from dual
Ubuntu java连接Mysql 失败 java mysql 大数据
2022-07-09 11:38

回答 3 已采纳 dbUrl 不要以&结尾，另外，试试在dbUrl中加上 &useSSL=false，禁用ssl
Mysql插入大数据 mysql php
2013-09-28 04:38

回答 7 已采纳 Assuming that you are using InnoDB engine (which is default in most recent MySQL versions), you sh
大数据最佳实践-hive
2021-04-21 08:06

猿与禅的博客笛卡尔积行列过滤合理设置 Map 及 Reduce 数合理设置 Reduce 数并行执行 JVM重用压缩安装 Tez 引擎（了解）小文件问题分区表 join优化 Group By优化 HiveServer2内存配置 HiveServer2性能最佳实践 ...
mysql多个查询结果组合 mysql
2019-06-11 14:19

回答 2 已采纳 select * from article where title like '%"+search+"%' order by time union select * from article wh
mysql,将两个查询语局结果横向合并 mysql
2018-05-21 14:02

回答 4 已采纳 ``` select (第二个查询) as repairnum, (第一个查询) as leftnum ```
mysql修改数据没有反应是为什么 mysql sql 大数据
2023-04-02 16:21

回答 4 已采纳先用select查一波，确定这个编号有数据，再update
大数据最佳实践-hbase
2021-04-18 22:45

猿与禅的博客 Bloomfilter设置是否合理 Bloomfilter主要用来在查询时，过滤HFile的，避免不需要的IO操作。Bloomfilter能提高读取的性能，一般情况下创建表，都会默认设置为：BLOOMFILTER => ‘ROW’ 提升实时读数据效率调用...
虚拟机中安装mysql服务器失败找不到路径 hadoop mysql 大数据
2022-06-07 16:24

回答 2 已采纳你是要从自己挂载的镜像里面来安装吗？如果是这样的话你需要在yum的配置文件里面添加一个挂载路径的本地源，这样才可以正常工作的。可以参考： CentOS软件管理 - YUM
大数据最佳实践-kafka
2021-04-16 12:48

猿与禅的博客如何实现消息幂等如何提高消费速度消息过滤消息广播消息格式开发规范 kafkastream API 代码实战如何处理大消息群集大小调整 zookeeper调优 kafka 与 spark streaming 集成,如何保证 exactly once 语义通过...
开源大数据 OLAP 引擎最佳实践
2022-03-23 09:30

云祁的博客本篇内容将通过六个部分来介绍开源大数据OLAP引擎最佳实践。01开源OLAP综述如今的开源数据引擎多种多样，不同种类的引擎满足了我们不同的需求。现在ROLAP计算存储一体的数据仓库主要有三种，即StarRocks...
没有解决我的问题, 去提问

悬赏问题

¥15 Vue3 大型图片数据拖动排序
¥15 划分vlan后不通了
¥15 GDI处理通道视频时总是带有白色锯齿
¥20 用雷电模拟器安装百达屋apk一直闪退
¥15 算能科技20240506咨询（拒绝大模型回答）
¥15 自适应 AR 模型参数估计Matlab程序
¥100 角动量包络面如何用MATLAB绘制
¥15 merge函数占用内存过大
¥15 使用EMD去噪处理RML2016数据集时候的原理
¥15 神经网络预测均方误差很小但是图像上看着差别太大

过滤MySQL结果的最佳实践

1条回答 默认 最新

Joining correctly

Expressiveness

Denormalization

Materialized view

悬赏问题

1条回答默认最新