duanni5726 2013-05-07 16:38
浏览 9

用于web抓取的php脚本给了我一个空数组

I'm trying a web scraping with my virtual web server; I'm looking for the name of projects + the name of creator in the page for example Bring THE PEOPLE TO COME to New York City by Yanira Castro

These information are locaded in bbcard_name

My problem is that the array and csv i receive at the end of the script always are empty...

<?php

set_time_limit(0);

$data = array ()

$listpage = file_get_contents('http://www.kickstarter.com/discover/categories/dance/');

preg_match_all('#<h2> <a href="([A-Z]+)\.html">([a-za-Z ]+)</a></li>#', $listpage, $pagesurl);

    foreach($pageurl[1] AS $pagesurl) {

    $projectPage = file_get_contents('http://www.kickstarter.com/discover/categories/dance/' . $pagesurl . '.html');

    preg_match('#<h2>bbcard_name ([a-zA-Z ]+)</h2>#', $projectPage, $name);
    $name = $name[1];

    preg_match_all('#<h2><a href="https?://.+\.[a-z]{2,5}">([^<]+)</a>#', $projectPage, $namefound);

    foreach($namefound[1] AS $name) {

        if(!isset($data[$name]))

            $data[$name] = array('name' => $name);
        else
            $data[$name]['name'] .= ' - ' . $name;
    }
 }

print_r($data);

$out = fopen('data.csv', 'w'); 
fputcsv($out, array('Titre')); 

foreach ($data as $name => $data) {
    $name = (isset($data['name'])) ? $data['name'] : ''; 
    fputcsv($out, array($data,$name));
}

fclose($out);

echo "FINITO";
 exit;

?>

Thanks

  • 写回答

0条回答 默认 最新

    报告相同问题?

    悬赏问题

    • ¥15 基于卷积神经网络的声纹识别
    • ¥15 Python中的request,如何使用ssr节点,通过代理requests网页。本人在泰国,需要用大陆ip才能玩网页游戏,合法合规。
    • ¥100 为什么这个恒流源电路不能恒流?
    • ¥15 有偿求跨组件数据流路径图
    • ¥15 写一个方法checkPerson,入参实体类Person,出参布尔值
    • ¥15 我想咨询一下路面纹理三维点云数据处理的一些问题,上传的坐标文件里是怎么对无序点进行编号的,以及xy坐标在处理的时候是进行整体模型分片处理的吗
    • ¥15 CSAPPattacklab
    • ¥15 一直显示正在等待HID—ISP
    • ¥15 Python turtle 画图
    • ¥15 stm32开发clion时遇到的编译问题