搜索多种内容类型

This is a generalization of a question that I've been struggling with for a while. My case is that I have a WordPress site with multiple "post types" (e.g. Articles, Blog Posts, Products, etc.). As is common practice these days, I want to display search results from each post type in separate categories.

The problem I have is in structuring the search. Should I run a separate database query for each post type, or should I run one big query and separate everything out via PHP? I tend to lean towards the latter, but the problem I'm running into is with pagination. I would probably have to not set any LIMIT on the query because if I had several matches from one post type, the search would not return any results from the other post types.

So, from a performance and general best practices stand point, is it better to have one big query without a LIMIT clause, or to run several queries for each search?

Note: This is similar to a question I asked on the WordPress Stack Exchange site a while back. I accepted the multiple query solution then, but I'm still pretty unsure about this.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dongliang_bj2016 2012-08-16 19:19
关注
In my experience, it is usually better to ask the database to do as little work as possible, and have PHP do most of the heavy lifting. It's usually faster.

So, I would try doing two very simple queries (one for each table) and then merging/sorting them with PHP code.

If your data set is very big, or if your web host is crap, then your PHP script may run out of memory... then, and only then, it is a good idea to start hunting around for the right way to do it in MySQL (I suspect temporary tables might be the right place to look).

But if you run into PHP's performance limits, then I suspect anything you do in MySQL is actually going to be even slower and you'll have to change your database structure to get good performance. One way to do this is to keep your existing table structure, but have a third table that contains duplicate data from all the tables - just for searching, and some code to keep everything in sync.

For example, we have a table that contains every pdf document uploaded by the website's users, and we have another table that contains every word that is in any document, and many-to-many linking table in between those.

Whenever a new pdf is uploaded, we find every word in it, and insert records into the linking table. This way we never actually have to search in the PDF documents, we only search the index tables which have been structured to allow for fast searching.

本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(1条)

报告相同问题？

关注问题

华为认证大数据工程师大数据
2022-03-29 22:18

回答 3 已采纳如果家里条件允许的话可以考虑一下，华为认证社会认可度还是蛮高的
Java建议转大数据吗本科 etl工程师大数据数据库开发
2022-07-02 22:58

回答 2 已采纳你所搜的岗位都有，但是每个岗位的工作内容有很大的区别比如大数据开发工程师，这是一个比较宽泛的定义，没有具体到岗位职责，可以是etl工程师，也可以是大数据平台开发，也可以是大数据实时开发，也有大数据运维
java后端转大数据怎么样 java 大数据
2022-06-17 21:25

回答 1 已采纳前端和云原生也不错
913大数据综合复试内容
2023-03-04 15:10

小福贵会富贵的博客本文帮助考研复试913大数据综合快速复习，了解考试内容
转行大数据要学什么框架大数据
2022-07-09 09:51

回答 1 已采纳学flink和hadoop把，我最近也在学，同学加油
python大数据整理 python 大数据
2023-01-30 19:57

回答 2 已采纳如果数据是按照天来统计的，可以参考以下步骤：利用 pandas 读取每天的数据并将其合并到一个大的 DataFrame 里面在 DataFrame 里面根据日期列创建一个新的日期列，用来储存日期的月
大数据、Hadoop hadoop 大数据
2022-12-19 16:44

回答 1 已采纳 format只需要对NameNode做，如果你在node3做了也没关系，删除node3上的、hdfs-site.xml中配置的NameNode对应的目录即可，然后在node1上也删除相同的目录后，重新
大数据的相关内容
2022-08-31 17:39

刘先生TT的博客 大数据（英语：Big data），又称为巨量资料，指的是在传统数据处理应用软件不足以处理的大或复杂的数据集的术语数据也可以定义为来自各种来源的大量非结构化或结构化数据。从学术角度而言，大数据的出现促成广泛主题...
数据仓库 大数据 apachhive hive 大数据数据仓库
2023-02-24 15:51

回答 1 已采纳是的，Apache Hive的分层就是指把不同维度的数据抽取出来，并根据不同的维度进行分类存放。通过这种方式，可以更好的管理数据，便于后续的查询和分析处理。
大数据开发小白，求电脑推荐大数据数据库有问必答
2021-10-01 20:36

回答 3 已采纳开发，买内存大一点的，因为比较耗内存。条件允许可以考虑苹果笔记本。否则可以考虑买联想系列。如果是学生，买价格4000多的就够了，i5处理器，4g或者8g运行内存。如果为了工作中用，买价格6000左右的
大数据，spark ，doris mysql spark 大数据
2023-01-17 22:05

回答 2 已采纳这是一个连接Doris服务器失败的错误，具体原因可能是Doris服务器无法连接或网络故障导致的。
大数据技术框架有哪些类型？大数据技术栈包括哪些框架？
2021-05-05 13:00

Shockang的博客本文隶属于专栏《100个问题搞定大数据理论体系》，该专栏为笔者原创，引用请注明来源，不足和错误之处请在评论区帮忙指出，谢谢！本专栏目录结构和文献引用请见100个问题搞定大数据理论体系解答从数据在信息...
大一新生大数据基础知识其他大数据学习方法
2022-11-03 09:05

回答 1 已采纳这题自嗨一下就得了，考试也不会考，实际应用中绝对不会这样用，每个子网至少是个C类网，没有人会用一个非标准的子网掩码来搞子网，都是坑爹的设计思路所谓主机地址通常就是指主机的ip地址
大数据技术
2022-09-16 22:13

clown空城的博客 大数据架构
【大数据】常用大数据工具介绍
2022-07-01 16:51

MachineCYL的博客【大数据】常用大数据工具介绍
没有解决我的问题, 去提问

悬赏问题

¥100 set_link_state
¥15 虚幻5 UE美术毛发渲染
¥15 CVRP 图论物流运输优化
¥15 Tableau online 嵌入ppt失败
¥100 支付宝网页转账系统不识别账号
¥15 基于单片机的靶位控制系统
¥15 真我手机蓝牙传输进度消息被关闭了，怎么打开？(关键词-消息通知)
¥15 装 pytorch 的时候出了好多问题，遇到这种情况怎么处理？
¥20 IOS游览器某宝手机网页版自动立即购买JavaScript脚本
¥15 手机接入宽带网线，如何释放宽带全部速度

搜索多种内容类型

2条回答 默认 最新

悬赏问题

2条回答默认最新