duanmao1872 2013-09-11 14:39
浏览 509

PHP MYSQL ETL。 我应该使用ETL工具,存储过程还是php脚本?

We're restructuring our entire database. Said that, the database structure has changed drastically. Some table fields will be translated to table rows; there will be a lot of validations; some tables are broken up into multiple tables and other are combined. Basically we're changing the legacy database to 3NF.

New schema is ready and I am tasked to do ETL. How should I proceed? I am unfamiliar with any ETL tools so there is a learning curve there. I was thinking to write stored procedures, are they capable of dealing with all these complex stuff I am trying to achieve? I can write PHP scripts so I have better control over data validations but not sure if that is a way to go because of timeout and long scripts. The database size is about 6 GB and it has about 450 tables.

Suggestions please.

  • 写回答

2条回答 默认 最新

  • doudaotui4297 2013-09-11 14:56
    关注

    I would recommend creating a PHP migration script, which is run from the command line. PHP scripts that are run from the command line do not time out.

    You could also add "set_time_limit(0)" at the top of your script:

    <?php
    set_time_limit(0);
    //rest of the code goes here
    

    I'm not sure stored procedures or commercial ETL tools are necessary. Just make sure to break the migration script code into appropriate functions / methods, and you should be fine.

    评论

报告相同问题?

悬赏问题

  • ¥15 eclipse运行项目时遇到的问题
  • ¥15 关于#c##的问题:最近需要用CAT工具Trados进行一些开发
  • ¥15 南大pa1 小游戏没有界面,并且报了如下错误,尝试过换显卡驱动,但是好像不行
  • ¥15 没有证书,nginx怎么反向代理到只能接受https的公网网站
  • ¥50 成都蓉城足球俱乐部小程序抢票
  • ¥15 yolov7训练自己的数据集
  • ¥15 esp8266与51单片机连接问题(标签-单片机|关键词-串口)(相关搜索:51单片机|单片机|测试代码)
  • ¥15 电力市场出清matlab yalmip kkt 双层优化问题
  • ¥30 ros小车路径规划实现不了,如何解决?(操作系统-ubuntu)
  • ¥20 matlab yalmip kkt 双层优化问题