Every month I receive a CSV file, around 2 GB size. I import this file in a table in MySql database and this is almost instant.
Then using PHP, I query this table, filter data from this table and write relevant data to several other tables. This take several days - all queries are optimized.
I want to move this data to Hadoop but do not understand what should be the starting point. I am studying Hadoop and I know this can be done using Sqoop but still too confused, where to start in terms of how to migrate this data to Hadoop.