Laravel - 预定的长期任务。高记忆力

I'm currently writing an application in Laravel and am setting up some scheduled tasks that will be run periodically (artisan commands) -- maybe once or twice a month. These tasks grab some CSV files using CURL, parse them, and import them into my local MySQL database.

However, a couple of these of these CSV files are quite large -- around 500,000 lines each. So I have run into some memory issues.

Obviously I can increase the memory in my VM, but i'm wondering how what other practices I can implement to ensure that I don't run out of memory. If I try to parse the files in chunks, rather than all at once, will each chunk be treated as a separate system process? Or will it still be treated as one long execution for the server?

EDIT: My current approach. I'm using League CSV for the Parsing.

protected $offset = 1;
protected $parse_chunk = 10000;

public function parse() 
{
     $this->data = [];
     $this->reader->setOffset($this->offset)->setLimit($this->parse_chunk)->fetchAll(function($row, $offset) {
            array_push($this->data, array_combine($this->keys, $row));
            $this->offset = $offset;
     });
}

So currently, the above fetches the first 10000 rows, and does the processing I need, which stores them in a new $data array. This data then gets passed to a Import Class, but my scripts are timing out in the parsing currently.

So the above works, but if I put that within a while loop to access all 500,000 records, then it uses too much memory.

Should I be dispatching each chunk to a queue which gets processed in the background?

EDIT: Added Benchmarks

I spent some time testing how long different processes take. The Parsing had interesting results. Keep in mind, this is JUST parsing. So it is not being stored in the DB, but it is being stored in an Array in Memory.

Parsing 1000 records: .017 seconds
Parsing 10,000 records: .188 seconds
Parsing 100,000 records: 2.273 seconds
Parsing 500,000 records: Never completes.

I'm using the code above to execute this. What might cause the parsing to take so long (or perhaps fail) at 500,000 records?

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

报告相同问题？

关注问题

laravel-echo-server 404尝试进行身份验证时 laravel php
2017-12-23 06:47

回答 1 已采纳 After the longest time, (I've been trying to fix this issue for ages), I found what I missed. In
PHP / Laravel - Chartjs内爆不起作用 laravel php
2018-07-16 07:30

回答 1 已采纳 You tried to do like this: labels([String]), but you need to pass array as label. Please try this:
Laravel - 将表单数据传递到另一个页面以继续表单 laravel php
2017-04-03 19:46

回答 1 已采纳 You must store data in session to use it in the next request See the docs Flash Data i hope thi
PHP面试题(一)
2018-03-24 11:56

钟长森的博客一：php部分用PHP实现一个双向队列(使用deque) deque，全名double-ended queue，是一种具有队列和栈的性质的数据结构。双端队列中的元素可以从两端弹出，其限定插入和删除操作在表的两端进行。双向队列（双端队列...
Laravel - 带有刀片compileString的php eval laravel php
2018-04-27 02:01

回答 1 已采纳 As stated in the manual: The code must not be wrapped in opening and closing PHP tags, i.e. 'e
PHP Carbon Laravel - No 022017 laravel php
2017-09-29 09:31

回答 1 已采纳 Because you use Carbon::now() and you send this question on the 29th. It will skip february becaus
Laravel - 按关系表排序 mysql php
2017-05-24 14:51

回答 1 已采纳 Just check if issue is_null, if not, add the image to $images: foreach ($magazines as $magazi
关于如何训搭建企业自然语自训练库
2023-05-08 06:51

学习3人组的博客以下是您可以采取的步骤：收集企业资料：您需要获得尽可能多的企业资料，例如公司报告、商业计划书、投资策略等。这些文件应该包含足够的信息...这些参数将取决于数据的大小和复杂性以及您希望模型能够完成的任务类型。
使用多个变量调用函数。 PHP - Laravel - Blade laravel php
2017-09-02 14:10

回答 2 已采纳 Make the function as static. public static function getWeekDay($monthYear, $qtdDay){ //** static
Laravel-datatables保护 javascript laravel php
2017-04-16 16:36

回答 1 已采纳 You can fetch only necessary columns from database like this: return Datatables::of(User::all(['f
Laravel 5.7无法加载`storage / framework / cache / data`并写入`./ storage / logs / laravel-2019-06-11.log` laravel php vagrant
2019-06-10 15:05

回答 2 已采纳 The problem is that the code is running as www-data user. To fix tat you should make a custom php-
【Java基础系列教程】第一章编程入门
2022-02-08 17:11

我是波哩个波的博客必须具有长期记忆程序、数据、中间结果及最终运算结果的能力。能够完成各种算术、逻辑运算和数据传送等数据加工处理的能力。能够根据需要控制程序走向，并能根据指令控制机器的各部件协调操作。能够按照要求将...
Laravel - php工匠的观点是什么：清楚吗？ laravel php
2017-02-07 23:54

回答 1 已采纳 This command basically just clears out all cached views. Rather than loading your view every tim
【全文】狼叔：如何正确的学习Node.js
2019-11-01 11:17

IT 哈的博客 Node.js 不是 JavaScript 应用，不是语言（JavaScript 是语言），不是像 Rails(Ruby)、 Laravel(PHP) 或 Django(Python) 一样的框架，也不是像 Nginx 一样的 Web 服务器。Node.js 是 JavaScript 运行时环境构建...
shell实例手册
2017-05-27 09:19

少林码僧的博客 shell实例手册 1文件{ touch file # 创建空白文件 rm -rf 目录名 # 不提示删除非空目录(-r:递归删除 -f强制) dos2unix # windows文本转linux文本 unix2dos # linux文本转windows文本 enca filename ...
没有解决我的问题, 去提问

悬赏问题

¥15 全部备份安卓app数据包括密码，可以复制到另一手机上运行
¥15 Python3.5 相关代码写作
¥20 测距传感器数据手册i2c
¥15 RPA正常跑，cmd输入cookies跑不出来
¥15 求帮我调试一下freefem代码
¥15 matlab代码解决，怎么运行
¥15 R语言Rstudio突然无法启动
¥15 关于#matlab#的问题：提取2个图像的变量作为另外一个图像像元的移动量，计算新的位置创建新的图像并提取第二个图像的变量到新的图像
¥15 改算法，照着压缩包里边，参考其他代码封装的格式写到main函数里
¥15 用windows做服务的同志有吗

码龄粉丝数原力等级 --

Laravel - 预定的长期任务。高记忆力

0条回答默认最新

悬赏问题

Laravel - 预定的长期任务。 高记忆力

0条回答 默认 最新

悬赏问题

Laravel - 预定的长期任务。高记忆力

0条回答默认最新