du_1993 2017-08-03 07:35
浏览 388
已采纳

PHP:从CSV导入数据到数据库时删除�等特殊字符[复制]

This question already has an answer here:

I created a PHP script that allows me to upload a huge file of data from csv file. While importing, I'd like to replace the special character like to a letter c. Below is my code:

        $sql ="INSERT INTO bill_of_materials(allotment_code, category_name, activity, quantity, end_unit_quantity, unit, description,
        unit_cost, regular_labor_cost, end_unit_labor_cost, type, batch) VALUES";

        while (($line = fgets($handle)) !== false) {

          $sql .= "('".implode("', '", explode(";", sanitize($line)))."'),";
          $counter++;
        }

            $sql = substr($sql, 0, strlen($sql) - 1);
             if (mysqli_query($new_conn, $sql) === TRUE) {

                echo 1;

                //database file name
                $new_database_file = $new_database.'.sql';

                if(file_exists('backup/'.$new_database_file)) {

                    unlink('backup/'.$new_database_file);

                    // backup main database

                    $command = "C:/xampp/mysql/bin/mysqldump --host=$host --user=$user --password=$pass $database_name > backup/$new_database_file";
                    system($command);

                } else {
                    // backup main database

                    $command = "C:/xampp/mysql/bin/mysqldump --host=$host --user=$user --password=$pass $database_name > backup/$new_database_file";
                    system($command);
                }
            } else {
                echo $sql;
            }

In addition, I have a data from my CSV that is W2-A1 2/F Front Fa�ade - B and I'd like to see an output like W2-A1 2/F Front Facade - B. How can i do this?

</div>
  • 写回答

1条回答 默认 最新

  • doufan3408 2017-08-03 07:53
    关注

    First of all, make sure you are using correct database client charset collation. If database charset/collation is correct, you may use preg_replace to sanitize dirty characters like so:

    function sanitize($line){
       $clean = iconv('UTF-8', 'ASCII//TRANSLIT', $line); // attempt to translate similar characters
       $clean = preg_replace('/[^\w]/', '', $clean); // drop anything but ASCII
       return $clean;
    }
    

    If that won't help (e.g. you have truly corrupted binary stream - for example saving into CSV from old Excel source file) you may wont to use binary translated characters (first you must find out corrupted binary sequence e.g. by dumping it via chr(ord($line[$position]))) - example:

    function sanitize($line){
        $map = [
            // corrupted chars sequence -> fixed chars
            "\xC3\xA8" => 'č',
            "\xC3\x88" => 'Č',
            "\xC3\xB9" => 'ů',
            "\xC3\x99" => 'Ů',
            "\xC3\xAC" => 'ě',
            "\xC3\x8C" => 'Ě',
            "\xC3\xB8" => 'ř',
            "\xC3\x98" => 'Ř',
            "\x53\xC2\x8D" => 'Š',
            "\xC2\xA9" => 'Š',
        ];
        return str_replace(array_keys($map), $map, $line);
    }
    
    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论

报告相同问题?

悬赏问题

  • ¥15 如果要做一个老年人平板有哪些需求
  • ¥15 k8s生产配置推荐配置及部署方案
  • ¥15 matlab提取运动物体的坐标
  • ¥15 人大金仓下载,有人知道怎么解决吗
  • ¥15 一个小问题,本人刚入门,哪位可以help
  • ¥15 python安卓开发
  • ¥15 使用R语言GD包一直不出结果
  • ¥15 计算机微处理器与接口技术相关问题,求解答图片的这个问题,有多少个端口,端口地址和解答问题的方法和思路,不要AI作答
  • ¥15 如何根据一个截图编写对应的HTML代码
  • ¥15 stm32标准库的PID角度环