donglingsai2880 2015-01-27 11:44
浏览 17
已采纳

file_get_contents不适用于某些域

As a part of the requirement I need to find out whether a domain is parked or not. As there is no efficient way to find out this, I'm going to check the DOM for phrases like "Buy this domain", "may be for sale".. etc.

I found some parked domains which can be accessed through browser, but cannot able to get them using file_get_contents.

Example

$url = 'http://buythisdomain.com/'
$get = file_get_contents($url);

For the above got the following message at output.

Warning: file_get_contents(http://buythisdomain.com/): failed to open stream: HTTP request failed!

But able to access the same URL via browser.I tried fopen method too, but same result. Is there any way to achieve this?

  • 写回答

1条回答 默认 最新

  • dsfsad089111 2015-01-27 11:56
    关注

    Many sites, not only parked domains use some mechanism to block basic requests without valid browser headers.

    Try to use stream context that send that required headers like a browser like this

    $url = "http://buythisdomain.com/"
    $context = stream_context_create(array(
        'http' => array(
            'method' => "GET",
            'header' =>
                "Accept: text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8
    " .
                "Accept-Language: en-US,en;q=0.8
    ".
                "Keep-Alive: timeout=3, max=10
    ",
                "Connection: keep-alive",
            'user_agent' => "User-Agent: Mozilla/5.0 (Windows NT 6.1) AppleWebKit/535.11 (KHTML, like Gecko) Chrome/17.0.963.66 Safari/535.11",
            "ignore_errors" => true,
            "timeout" => 3
        )
    ));
    file_get_contents($url, false, $context);
    
    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论

报告相同问题?

悬赏问题

  • ¥15 目前主流的音乐软件,像网易云音乐,QQ音乐他们的前端和后台部分是用的什么技术实现的?求解!
  • ¥60 pb数据库修改与连接
  • ¥15 spss统计中二分类变量和有序变量的相关性分析可以用kendall相关分析吗?
  • ¥15 拟通过pc下指令到安卓系统,如果追求响应速度,尽可能无延迟,是不是用安卓模拟器会优于实体的安卓手机?如果是,可以快多少毫秒?
  • ¥20 神经网络Sequential name=sequential, built=False
  • ¥16 Qphython 用xlrd读取excel报错
  • ¥15 单片机学习顺序问题!!
  • ¥15 ikuai客户端多拨vpn,重启总是有个别重拨不上
  • ¥20 关于#anlogic#sdram#的问题,如何解决?(关键词-performance)
  • ¥15 相敏解调 matlab