针对80k行的PHP数组优化

I need help to find workaround for getting over memory_limit. My limit is 128MB, from database I'm getting something about 80k rows, script stops at 66k. Thanks for help.

Code:

$posibilities = [];
    foreach ($result as $item) {
            $domainWord = str_replace("." . $item->tld, "", $item->address);

            for ($i = 0; $i + 2 < strlen($domainWord); $i++) {
                $tri = $domainWord[$i] . $domainWord[$i + 1] . $domainWord[$i + 2];


                if (array_key_exists($tri, $possibilities)) {
                    $possibilities[$tri] += 1;
                } else {
                    $possibilities[$tri] = 1;
                }
            }
        }

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dongmeng9048 2015-06-29 13:55
关注
Your bottleneck, given your algorithm, is most possibly not the database query, but the $possibilities array you're building.

If I read your code correctly, you get a list of domain names from the database. From each of the domain names you strip off the top-level-domain at the end first.

Then you walk character-by-character from left to right of the resulting string and collect triplets of the characters from that string, like this:

example.com => ['exa', 'xam', 'amp', 'mpl', 'ple']

You store those triplets in the keys of the array, which is nice idea, and you also count them, which doesn't have any effect on the memory consumption. However, my guess is that the sheer number of possible triplets, which is for 26 letters and 10 digits is 36^3 = 46656 possibilities each taking 3 bytes just for key inside array, don't know how many boilerplate code around it, take quite a lot from your memory limit.

Probably someone will tell you how PHP uses memory with its database cursors, I don't know it, but you can do one trick to profile your memory consumption.

Put the calls to memory-get-usage:

before and after each iteration, so you'll know how many memory was wasted on each cursor advancement,

before and after each addition to $possibilities.

And just print them right away. So you'll be able to run your code and see in real time what and how seriously uses your memory.

Also, try to unset the $item after each iteration. It may actually help.

Knowledge of specific database access library you are using to obtain $result iterator will help immensely.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(1条)

报告相同问题？

关注问题

针对80k行的PHP数组优化 php
2015-06-29 12:52

回答 2 已采纳 Your bottleneck, given your algorithm, is most possibly not the database query, but the $possibili
80K行mysql用户表导致登录问题[关闭] database mysql php
2016-10-24 15:18

回答 1 已采纳 Is it a good solution!? No, it is not a good solution. It is, in fact, a terrible solution.
PHP MySQL - 非常慢的循环 mysql php
2014-05-08 18:23

回答 1 已采纳 For every row from speechesLCMcoded (400K rows), you exec str_replace and sql query. You can r
php选取数组里最大的数据库,php&mysql：针对数据库检查大型数组的最有效方法
2021-05-06 05:53

weixin_39711914的博客我有一大堆数据存储在一个多维数组中.示例结构如下：Array([1] => Array([0] => motomummy.com[1] => 1921[2] => 473)[4] => Array([0] => kneedraggers.com[1] => 3051[2] => 5067))我在...
转换复杂的文本列表 mysql php
2014-05-07 18:05

回答 1 已采纳 Explanation: The code opens your text file and grabs them into an array, splits them into chunks
删除小数，跟随数字并舍入值 php
2013-11-06 03:21

回答 2 已采纳 <?php $number = '$96.2k'; $number = str_replace(array('$', 'k'), '', $number); $number = roun
按查询排序需要花费太多时间 mysql php sql
2013-10-18 06:46

回答 4 已采纳 You need an index on the recipient_firstname field (so really customers.customers_firstname). An i
php mysql查询出来二叉树的数据,php&mysql：针对数据库检查大型数组的最有效方法...
2021-03-13 10:46

刘幺幺的博客我有一大堆数据存储在一个多维数组中.示例结构如下：Array([1] => Array([0] => motomummy.com[1] => 1921[2] => 473)[4] => Array([0] => kneedraggers.com[1] => 3051[2] => 5067))我在...
数据桌 vs dplyr: 一个人能做好别人做不好的事情吗？ r语言
2014-01-29 15:21

回答 3 已采纳 We need to cover at least these aspects to provide a comprehensive answer/comparison (in no partic
PHP文件处理
2019-07-21 13:22

转圈的行星的博客查看文件名称： ...$path = '/var/www/web/default.php'; echo besename($path); echo '<br>'; echo besename($path,'.php'); //不存在扩展名时，加上.php扩展名 ?> 显示目录名称： d...
php echo 输出最大值,谈谈PHP中echo输出大量内容的效率问题
2021-03-23 23:17

混乱博物馆的博客 PHP中我们最常用的输出内容函数应该是echo了，但在我们使用的同时应该注意一点就是，使用echo输出大量内容会影响网页速度。可能很多朋友之前都没发现过这个问题，包括我之前也没遇到这个问题。今天帮朋友处理一个...
一个php代码太多执行太慢,PHP的echo输出内容过多会很慢
2021-03-23 23:58

吃掉惆怅的博客问题页的情况如下：apache + php使用smarty模板输出内容页面最终输出内容较大，80k+页面执行时间在500ms以上祭出法宝xhprof对问题页面做了细致检查，发现页面的瓶颈竟然是模板(编译后的)中的一个echo语句，这个echo...
echo输出大花括号 php_PHP的echo输出内容过多会很慢
2020-12-30 09:07

瞬光寂暗的博客问题页的情况如下：apache + php使用smarty模板输出内容页面最终输出内容较大，80k+页面执行时间在500ms以上祭出法宝xhprof对问题页面做了细致检查，发现页面的瓶颈竟然是模板(编译后的)中的一个echo语句，这个echo...
echo输出大花括号 php_谈谈PHP中echo输出大量内容的效率问题
2021-01-12 16:32

weixin_39824191的博客 PHP中我们最常用的输出内容函数应该是echo了，但在我们使用的同时应该注意一点就是，使用echo输出大量内容会影响网页速度。可能很多朋友之前都没发现过这个问题，包括我之前也没遇到这个问题。今天帮朋友处理一个...
php判断是否为3的n次方,不骗你，没读这一篇，你不可能懂二分
2021-04-26 14:44

一只小风的博客上篇文章讲动态规划获得了80k浏览，这次的二分也值得你们一看，这个系列是特别用心写的，准备出书的哦3.0 引子图书馆自习的时候,一女生背着一堆书进阅览室,结果警报响了,大妈让女生看是哪本书把警报弄响了，女生把书...
冰蝎2流量分析,解密以及其防守姿势
2021-11-12 11:00

瑞雪掩晨曦❀的博客文章目录冰蝎2流量分析(php)1.先来截图一下一下webshell2.通过上面解读的webshell,按照流程绘制了一个大致流程图3.接下来来测试冰蝎2的流量特征流量解密1.首先将第二次的秘钥填入密码内，将post请求内容放入解密2....
GET和POST提交数据方式的区别和使用
2016-07-08 15:29

_acme_的博客 ==> 学习汇总（持续更新） ==> 从零搭建后端基础设施系列（一）-- 背景介绍数据提交到服务器一般有两种方式，GET和POST。...2.可以通过url传递数据，查找..._POST都是一个关联数组,所以可以通过name取value.
Vue的常见性能优化
2022-09-09 22:43

跳跳的小古风的博客防抖、节流运用第三方模块按需导入大数据列表和表格性能优化 - 虚拟列表/虚拟表格搜索引擎 SEO 优化预渲染服务端渲染 SSR，nuxt.js 打包优化压缩代码使用 CDN 加载第三方模块多线程打包 happypack ...
PHP的echo输出内容过多会很慢
2020-12-30 07:58

weixin_34281537的博客 //对大字节字符串进行分割保存到数组中，然后循环输出 function echoBigChar($string,$bufferSize=8192){ $splitString=str_split($string,$bufferSize); foreach($splitString as $chunk){ echo ...
没有解决我的问题, 去提问

悬赏问题

¥20 数学建模，尽量用matlab回答，论文格式
¥15 昨天挂载了一下u盘，然后拔了
¥30 win from 窗口最大最小化，控件放大缩小，闪烁问题
¥20 易康econgnition精度验证
¥15 msix packaging tool打包问题
¥28 微信小程序开发页面布局没问题，真机调试的时候页面布局就乱了
¥15 python的qt5界面
¥15 无线电能传输系统MATLAB仿真问题
¥50 如何用脚本实现输入法的热键设置
¥20 我想使用一些网络协议或者部分协议也行，主要想实现类似于traceroute的一定步长内的路由拓扑功能

针对80k行的PHP数组优化

2条回答 默认 最新

悬赏问题

2条回答默认最新