如何管理PHP内存？

I wrote a one-off script that I use to parse PDFs saved on the database. So far it is working okay until I ran out of memory after parsing 2,700+ documents.

The basic flow of the script is as follows:

Get a list of all the document IDs to be parsed and save it as an array in the session (~155k documents).
Display a page that has a button to start parsing
Make an AJAX request when that button is clicked that would parse the first 50 documents in the session array

$files = $_SESSION['files'];

$ids = array();

$slice = array_slice($files, 0, 50);
$files = array_slice($files, 50, null); // remove the 50 we are parsing on this request

if(session_status() == PHP_SESSION_NONE) {
  session_start();
}
$_SESSION['files'] = $files;
session_write_close();

for($i = 0; $i < count($slice); $i++) {
  $ids[] = ":id_{$i}";
}
$ids = implode(", ", $ids);

$sql = "SELECT d.id, d.filename, d.doc_content
  FROM proj_docs d
  WHERE d.id IN ({$ids})";

$stmt = oci_parse($objConn, $sql);
for($i = 0; $i < count($slice); $i++) {
  oci_bind_by_name($stmt, ":id_{$i}", $slice[$i]);
}
oci_execute($stmt, OCI_DEFAULT);
$cnt = oci_fetch_all($stmt, $data);
oci_free_statement($stmt);

# Do the parsing..
# Output a table row..

The response to the AJAX request typically includes a status whether the script has finished parsing the total ~155k documents - if it's not done, another AJAX request is made to parse the next 50. There's a 5 second delay between each request.

Questions

Why am I running out of memory when I was expecting that peak memory usage would be when I get a list of all the document IDs on #1 since it holds all of the possible documents not a few minutes later when the session array holds 2,700 elements less?
I saw a few questions similar to my problem and they suggested to either set the memory to unlimited which I don't want to do at all. The others suggested to set my variables to null when appropriate and I did that but I still ran out of memory after parsing ~2,700 documents. So what other approaches should I try?

# Freeing some memory space
$batch_size = null;
$with_xfa = null;
$non_xfa = null;
$total = null;
$files = null;
$ids = null;
$slice = null;
$sql = null;
$stmt = null;
$objConn = null;
$i = null;
$data = null;
$cnt = null;
$display_class = null;
$display = null;
$even = null;
$tr_class = null;

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
doufan3958 2018-10-15 20:13
关注
So I'm not really sure why but reducing the number of documents I'm parsing from 50 down to 10 for each batch seems to fix the issue. I've gone past 5,000 documents now and the script is still running. My only guess is that when I was parsing 50 documents I must have encountered a lot of large files which used up all of the memory allotted.

Update #1

I got another error about memory running out at 8,500+ documents. I've reduced the batches further down to 5 documents each and will see tomorrow if it goes all the way to parsing everything. If that fails, I'll just increase the memory allocated temporarily.

Update #2

So it turns out that the only reason why I'm running out of memory is that we apparently have multiple PDF files that are over 300MB uploaded on to the database. I increased the memory allotted to PHP to 512MB and this seems to have allowed me to finish parsing everything.

本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

PHP如何管理加载到内存中的函数？ laravel php
2017-12-30 21:55

回答 2 已采纳 No it isn't all loaded into memory! That's because Laravel (like a great many PHP libraries and fr
php 内存不够申请失败 php
2017-06-21 07:55

回答 2 已采纳我觉得可能是循环导致不停的添加数组？
如何使用AJAX调用PHP函数？ ajax php
2019-04-27 01:21

回答 1 已采纳 You can call the function when the get emaiAddress is detected on your database.php page. For exam
php内存管理之内存分配
2017-03-18 14:44

rich_family的博客鉴于系统开销的调用，一些对性能有要求的应用通常会在自己的用户态进行内存管理，比如在第一次申请一块较大的内存留着备用，而不是使用完马上还给操作系统，可以进行复用，避免多次申请和...php内存是通过zendmm来进
php如何判断单双大小？ php
2019-11-17 23:50

回答 2 已采纳 ``` $arr = [4,5,6,7,8]; //判断数字奇偶性 $single = $double = []; fo
打开index.php页面报错内存溢出 php
2020-08-11 20:27

回答 1 已采纳你的配置没有问题，限制是2G，但是你上传的文件怎么那么大，换一个小一点的看看。
PHP内存溢出报错，修改limit后无效，但物理内存实际够用为什么？ apache php 开发语言
2019-08-14 09:20

回答 4 已采纳我的问题解决了！我用的phpstudy集成环境，php5.58+apache的服务器，导致我apache内存越来越多的原因是这个整合包里的版本有BUG！换成php5.4后apahce 的内存整理机制就
宝塔php管理不能装扩展,宝塔面板PHP无法安装fileinfo扩展怎么解决?
2021-04-15 20:59

赵有名的博客接下来遇上一个宝塔面板客户，因此服务器内存是1G在安装PHP拓展fileinfo的过程中始终失败，我们在宝塔面板中也能够见到有关提醒”若可用内存小于1G，将会会安装不了”。那样怎样在不升级服务器内存的状况下正常安装...
如何为php项目设置端口？ php ubuntu
2019-06-13 19:04

回答 1 已采纳 For HTTP, browser by default is sending request to port 80, if you want run your project under ano
php做控制下载速度代码怎么又生带宽，又省cpu 和内存？ php
2015-01-31 12:24

回答 3 已采纳 slepp会让出CPU时间片，从而可以达到降低CPU，下载带宽都会降低。一般下载缓冲区控制在一个MTU的附近，然后再微调，达到一个最优结果。
如何在PHP中为管理员和用户分别登录？ php
2017-02-25 06:37

回答 1 已采纳 In both function change $count < 1 to $count == 1 If in DB it matches a record and have
PHP利用ini_set(‘memory_limit‘, ‘XXXM‘)设置内存小结
2020-10-19 10:45

像树一样活着的博客以前在使用 "ini_set()"这个函数给当前执行脚本设置可用内存时，产生了一些奇怪的想法例： ini_set("memory_limit", "1024M"); 当时自己就在想，对于一个客户端请求，分配给当前执行脚本1G的内存，如果同时存在...
php识别验证码问题？ php
2017-08-24 12:03

回答 1 已采纳 PHP识别验证码（适合大部分验证码） 'recognize', 'softID'=>'3', 'softKey'=>'623527b90698a47ec626043d
PHP内存泄漏问题解析
2016-12-18 20:37

stevsun的博客随着系统运行占用的内存会持续上升，可能会因为占用内存过高而崩溃，或被系统杀掉PHP的内存泄漏PHP属于高级语言，语言级别并没有内存的概念，在使用过程中完全不需要主动申请或释放内存，所以在P
PHP7内存分配管理(一)
2017-01-04 22:58

飘忽-不定的博客最近没什么事情，就决定分析一下php7的内存管理方面的博客，首先不得不说，PHP7的内存管理的代码和之前的版本比起来，思路上清晰了很多。内存的分配也分为了，小内存,大内存和超大内存。第一个章节我们就分析一下...
没有解决我的问题, 去提问

悬赏问题

¥15 使用ue5插件narrative时如何切换关卡也保存叙事任务记录
¥20 软件测试决策法疑问求解答
¥15 win11 23H2删除推荐的项目，支持注册表等
¥15 matlab 用yalmip搭建模型，cplex求解，线性化处理的方法
¥15 qt6.6.3 基于百度云的语音识别不会改
¥15 关于#目标检测#的问题：大概就是类似后台自动检测某下架商品的库存，在他监测到该商品上架并且可以购买的瞬间点击立即购买下单
¥15 神经网络怎么把隐含层变量融合到损失函数中？
¥15 lingo18勾选global solver求解使用的算法
¥15 全部备份安卓app数据包括密码，可以复制到另一手机上运行
¥20 测距传感器数据手册i2c

如何管理PHP内存？

Questions

1条回答 默认 最新

Update #1

Update #2

悬赏问题

1条回答默认最新