大型CSV文件导入到mysql，最佳实践

Looking for insight on the best approach for large csv file imports to mysql and managing the dataset. This is for an ecommerce storefront "startup". All product data will be read from csv files which are download via curl (server to server).

Each csv file represents a different supplier/warehouse with up to 100,000 products. In total there are roughly 1.2 million products spread over 90-100 suppliers. At least 75% of the row data (51 columns) is redundant garbage and will not be needed.

Would it be better to use mysqli LOAD DATA LOCAL INFILE to 'temp_products' table. Then, make the needed data adjustments per row, then insert to the live 'products' table or simply use fgetcsv() and go row by row? The import will be handled by a CronJob using the sites php.ini with a memory limit of 128M.

Apache V2.2.29
PHP V5.4.43
MySQL V5.5.42-37.1-log
memory_limit 128M

I'm not looking for "How to's". I'm simply looking for the "best approach" from the communities perspective and experience.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
doujuanxun7167 2015-08-16 21:27
关注
I have direct experience of doing something virtually identical to what you describe -- lots of third party data sources in different formats all needing to go into a single master table.

I needed to take different approaches for different data sources, because some were in XML, some in CSV, some large, some small, etc. For the large CSV ones, I did indeed follow roughly your suggested routed:

I used LOAD DATA INFILE to dump the raw contents into a temporary table.

I took the opportunity to transform or discard some of the data within this query; LOAD DATA INFILE allows some quite complex queries. This allowed me to use the same temp table for several of the import processes even though they had quite different CSV data, which made the next step easier.

I then used a set of secondary SQL queries to pull the temp data into the various main tables. All told, I had about seven steps to the process.

I had a set of PHP classes to do the imports, which all implemented a common interface. This meant that I could have a common front-end program which could run any of the importers.

Since a lot of the importers did similar tasks, I put the commonly used code in traits so that the code could be shared.

Some thoughts based on the things you said in your question:

LOAD DATA INFILE will be orders of magnitude quicker than fgetcsv() with a PHP loop.

LOAD DATA INFILE queries can be very complex and achieve very good data mapping without ever having to run any other code, as long as the imported data is going into a single table.

Your memory limit is likely to need to be raised. However, using LOAD DATA INFILE means that it will be MySQL which will use the memory, not PHP, so the PHP limit won't come into play for that. 128M is still likely to be too low for you though. -If you struggle to import the whole thing in one go, try using some simple Linux shell commands to split the file into several smaller chunks. CSV data format should make that fairly simple.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

大型CSV文件导入到mysql，最佳实践 php
2015-08-16 19:48

回答 1 已采纳 I have direct experience of doing something virtually identical to what you describe -- lots of th
使用PHP将CSV文件导入MySQL mysql php sql
2017-02-21 15:48

回答 1 已采纳 There's lots of good advice in the help centre - notably How to create a Minimal, Complete, and Ve
将CSV文件导入mysql数据库 mysql php
2015-07-11 21:23

回答 1 已采纳 Edited Code; $db = new PDO('mysql:host=localhost;dbname=apdatabase', 'root', ''); $db->exec("S
php将csv文件导入到mysql数据库的方法
2020-10-25 00:21

主要介绍了php将csv文件导入到mysql数据库的方法,通过读取csv文件到数组再调用while循环实现插入数据到数据库,是非常实用的技巧,需要的朋友可以参考下
将csv文件导入MySQL时，在创建表时更改数据类型 - PHP mysql php
2017-06-15 13:22

回答 1 已采纳 If you need to use a different column type based on the name of the column, you could use either a
自动格式化1000个相同的csv文件导入MySQL？ mysql php
2017-03-23 18:06

回答 1 已采纳 This is a general version of my CSV importer. Create a table and replace 'your_table_name_here' w
如何使用PHP将大型Excel文件导入MySql数据库 mysql php
2014-05-26 12:03

回答 4 已采纳 According to ASNAOUI Ayoub I made the following changes: ; Maximum allowed size for uploaded file
PHP编程实现csv文件导入mysql数据库的方法
2020-10-19 22:33

主要介绍了PHP编程实现csv文件导入mysql数据库的方法,涉及php文件读取、转换、数据库的连接、插入等相关操作技巧,需要的朋友可以参考下
使用PHP在MySQL中导入CSV mysql php symfony
2017-08-25 08:07

回答 2 已采纳 Ok, It's working. Error was in counting lines in terminal: find ./ -type f -name "*csv*" -exec wc
将CSV文件直接导入MySQL mysql php
2010-11-10 11:24

回答 5 已采纳 You can create a script to parse your csv file and to put the data into db. Something like:
逗号在地址时将CSV文件导入MySQL的问题 mysql php
2013-03-21 06:02

回答 2 已采纳 If you use comma as column separator, then you should quote all string field values, e.g. - 1,'My
php 导入csv到mysql_php将csv文件导入到mysql实例详解
2021-01-28 12:53

weixin_39897749的博客这篇文章主要介绍了php将csv文件导入到mysql数据库的方法,通过读取csv文件到数组再调用while循环实现插入数据到数据库,是非常实用的技巧,需要的朋友可以参考下本程序实现数据导入原理是先把csv文件上传到服务器,然后...
使用php将excel / csv文件导入phpmyadmin php
2019-03-10 17:36

回答 2 已采纳 There is an error in the line $query = "INSERT INTO `student`(`student_id`, `student_login`, `stu
php 导入csv到mysql_php将csv文件导入到mysql数据库的方法
2021-01-19 00:50

若水如鱼的博客本文实例讲述了php将csv文件导入到mysql数据库的方法。分享给大家供大家参考。具体分析如下：本程序实现数据导入原理是先把csv文件上传到服务器,然后再通过php的fopen与fgetcsv文件把数据保存到数组,然后再用while把...
php 导入csv到mysql_php 导入csv数据到mysql数据库_PHP教程
2021-01-28 12:53

闻省的博客 class Import{var $csv_fields=array(); //fields in csv to care about...var $csv_file; //file to openvar $csv_data; //data in filevar $csv_array = array(); //array of data in filevar $csv_all_fields = a...
没有解决我的问题, 去提问

悬赏问题

¥20 关于#qt#的问题：Qt代码的移植问题
¥50 求图像处理的matlab方案
¥50 winform中使用edge的Kiosk模式
¥15 关于#python#的问题：功能监听网页
¥15 怎么让wx群机器人发送音乐
¥15 fesafe材料库问题
¥35 beats蓝牙耳机怎么查看日志
¥15 Fluent齿轮搅油
¥15 八爪鱼爬数据为什么自己停了
¥15 交替优化波束形成和ris反射角使保密速率最大化

大型CSV文件导入到mysql，最佳实践

1条回答 默认 最新

悬赏问题

1条回答默认最新