使用浏览器打开URL并且URL有效时，file_get_contents返回404

I get the following Error:

Warning: file_get_contents(https://www.readability.com/api/content/v1/parser?url=http://www.redmondpie.com/ps1-and-ps2-games-will-be-playable-on-playstation-4-very-soon/?utm_source=dlvr.it&utm_medium=twitter&token=MYAPIKEY) [function.file-get-contents]: failed to open stream: HTTP request failed! HTTP/1.1 404 NOT FOUND in /home/DIR/htdocs/readability.php on line 23

With some Echoes I got the URL parsed by the function and it is fine and valid, I do the request from my Browser and it is OK.

The thing is that I get the Error Above with file_get_contents and I really don't understand why.

The URL is Valid and the Function is NOT Blocked by the Free Hosting Service (So I don't need Curl).

If someone could spot the error in my Code, I would appreciate it! Thanks...

Here is my Code:

<?php

class jsonRes{
    public $url;
    public $author;
    public $url;
    public $image;
    public $excerpt;
}

function getReadable($url){
 $api_key='MYAPIKEY';
 if(isset($url) && !empty($url)){

    // I tried changing to http, no 'www' etc... -THE URL IS VALID/The browser opens it normally-

    $requesturl='https://www.readability.com/api/content/v1/parser?url=' . urlencode($url) . '&token=' . $api_key;
    $response = file_get_contents($requesturl);   // * here the code FAILS! *

    $g = json_decode($response);

    $article_link=$g->url;
    $article_author='';
    if($g->author != null){
       $article_author=$g->author;
    }

    $article_url=$g->url;
    $article_image=''; 
    if($g->lead_image_url != null){
        $article_image=$g->lead_image_url;
    }
    $article_excerpt=$g->excerpt;

    $toJSON=new jsonRes();
    $toJSON->url=$article_link;
    $toJSON->author=$article_author;
    $toJSON->url=$article_url;
    $toJSON->image=$article_image;
    $toJSON->excerpt->$article_excerpt;

    $retJSONf=json_encode($toJSON);
    return $retJSONf;
 }
}
?>

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
drtpbx3606 2014-01-29 18:29
关注
Sometimes a website will block crawlers(from remote servers) from getting to their pages.

What they do to work around this is spoof a browsers headers. Like pretend to be Mozilla Firefox instead of the sneaky PHP web scraper they are.

This is a function which uses the cURL library to do just that.

function get_data($url) { $userAgent = 'Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.8.1.13) Gecko/20080311 Firefox/2.0.0.13'; $ch = curl_init(); curl_setopt($ch, CURLOPT_USERAGENT, $userAgent); curl_setopt($ch, CURLOPT_URL, $url); curl_setopt($ch, CURLOPT_FAILONERROR, true); curl_setopt($ch, CURLOPT_FOLLOWLOCATION, true); curl_setopt($ch, CURLOPT_AUTOREFERER, true); curl_setopt($ch, CURLOPT_RETURNTRANSFER,true); curl_setopt($ch, CURLOPT_TIMEOUT, 10); $html = curl_exec($ch); if (!$html) { echo "<br />cURL error number:" .curl_errno($ch); echo "<br />cURL error:" . curl_error($ch); exit; } else{ return $html; } //End of cURL function }

One would then call it as below:

$response = get_data($requesturl);

Curl offers much more options in fetching of remote content and error checking than file_get_contents does. If you even want to customize it further, check out the list of cURL options here - Abridged list of cURL options
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

使用浏览器打开URL并且URL有效时，file_get_contents返回404 json php
2014-01-29 18:15

回答 1 已采纳 Sometimes a website will block crawlers(from remote servers) from getting to their pages. What th
file_get_contents - URL中的特殊字符 - 特殊情况 php
2015-07-30 09:48

回答 2 已采纳 URLs cannot contain "Ö"! Start from this basic premise. Any characters not within a narrowly defin
file_get_contents（）失败，URL中包含特殊字符 php
2015-06-28 08:48

回答 2 已采纳 The problem is likely due to urlencode escaping your protocol: https://en.wikipedia.org/wiki/Ålan
php伪造页面url地址,php 伪造HTTP_REFERER网页URL来源的三种方法
2021-03-24 11:17

凿船尸爷的博客 php获取当前网页的前一个网页URL地址，即当前网页是从哪个网页链接过来的，可以使用$_SERVER['HTTP_REFERER']，但是这个来源网页的URL地址是可以被伪造和欺骗的，本文章向大家简介伪造HTTP_REFERER网页URL的三种方法...
使用file_get_contents从url获取JSON json php
2016-11-24 11:20

回答 3 已采纳 Note on the docs for json_decode: http://php.net/manual/en/function.json-decode.php This function
file_get_contents不适用于无协议URL？ php
2015-05-04 15:11

回答 2 已采纳 you can't use file_get_contents with //google.com because what it's actually doing is file:///goog
来自多个URL的'file_get_contents'函数和重定向限制已达到警告 json php
2018-12-25 21:18

回答 1 已采纳 I would suggest using cURL for fetching remote data. You could do this: $urls = [ "https://ww
php parse url ctf,XTCTF Web_php_wrong_nginx_config
2021-05-08 13:15

聂飞琼的博客 $o = ob_get_contents(); ob_end_clean(); $d = base64_encode(x(gzcompress($o), $k)); print("<$k>$d"); @session_destroy(); } } } } 然后就是代码审计。。去读个几遍，因此代码本身的逻辑不难理解。可以参考： ...
为什么我的POST file_get_contents返回HTTP错误请求？ php
2016-02-03 16:24

回答 2 已采纳 http_build_query() converts an array to a URL-encoded query string like name=john&password=s3cr3t
PHP使用file_get_contents（）检查外部服务器上是否存在文件 php
2014-08-18 01:29

回答 3 已采纳 I think best method for me is using this script: $file = "http://website.com/dir/filename.php"; $
在PHP中，使用file_get_contents && file_get_contents会返回不同的值吗？ php
2016-03-27 18:06

回答 2 已采纳 file_get_contents never returns true. It returns file (or URL) contents or false if the contents
easyui datagrid url不请求请求_Go Web编程--深入学习解析HTTP请求
2020-11-22 10:17

weixin_39942451的博客之前这个系列的文章一直在讲用Go语言怎么编写HTTP...不过一直漏掉了一个环节是服务器接收到请求后如何解析请求拿到想要的数据，Go语言使用net/http包中的Request结构体对象来表示HTTP请求，通过Request结构对象上定...
在浏览器中访问时，file_put_contents不起作用 apache php
2018-10-30 04:07

回答 1 已采纳 Right. So it is a permissions problem. You need to make sure that the "user" that runs the php c
AJAX（Asynchronous JavaScript And Xml）、get和post请求、url和跨域问题、JSON.parse和JSON.stringify方法
2021-05-02 21:12

YF-SOD的博客 xml.open(method,url,async) xml.setRequestHeader(name,value) xml.send(string) xml.getAllResponseHeaders() xml.getResponseHeader(name) xml.abort() XMLHttpRequest属性 xml.onreadystatechange ..
BUGKU-WEB never_give_up_never_give_up bugku，大厂的前端面试难吗
2024-04-21 06:22

m0_60607289的博客这是一段js代码，作用就是嵌入在HTML文档中，...当你给window.location.href赋值时，浏览器会立即导航到指定的新URL。已经被弃用（有漏洞，这里利用的就是这个漏洞，称为。2.1.6 v-show 与 v-if 选择。2.2.3 目录说明。
没有解决我的问题, 去提问

悬赏问题

¥15 逻辑谓词和消解原理的运用
¥15 请求分析基于spring boot+vue的前后端分离的项目
¥15 三菱伺服电机按启动按钮有使能但不动作
¥15 js，页面2返回页面1时定位进入的设备
¥200 关于#c++#的问题，请各位专家解答！网站的邀请码
¥50 导入文件到网吧的电脑并且在重启之后不会被恢复
¥15 （希望可以解决问题）ma和mb文件无法正常打开，打开后是空白，但是有正常内存占用，但可以在打开Maya应用程序后打开场景ma和mb格式。
¥20 ML307A在使用AT命令连接EMQX平台的MQTT时被拒绝
¥20 腾讯企业邮箱邮件可以恢复么
¥15 有人知道怎么将自己的迁移策略布到edgecloudsim上使用吗？

使用浏览器打开URL并且URL有效时，file_get_contents返回404

1条回答 默认 最新

悬赏问题

1条回答默认最新