php Curl 405不允许

Final Update It appears that the targeted website blocked DO IPs and are giving the problems which I've been resolving for days. I spinned a EC2 instance and manage to work the code working, together with caching etc so as to reduce the hit on the website and allow my user to share the website.

UPDATE: I manage to get the Html by setting curl error to off, however the website other than returning 405 error is also not setting some cookies which are required for the website content to be loaded.

curl_setopt($ch, CURLOPT_FAILONERROR, FALSE);

I'm using the following codes for ajax->PHP to retrieve og: meta for websites. However, there's 1 or 2 specific sites that returns error and would not retrieve the info. With the following errors. The code works seamlessly for majority of the websites.

Warning: DOMDocument::loadHTML(): Empty string supplied as input in /my/home/path/getUrlMeta.php on line 58

From curl_error in my error_log

The requested URL returned error: 405 Not Allowed

And

Failed to connect to www.something.com port 443: Connection refused

I have no problems getting the html of the website when I use curl on my server console and no problem retrieving information needed for majority of the websites using codes below

function file_get_contents_curl($url)
{
    $ch = curl_init();
    $header[0] = "Accept: text/html, text/xml,application/xml,application/xhtml+xml,";
    $header[0] .= "text/html;q=0.9,text/plain;q=0.8,image/png,*/*;q=0.5";
    $header[] = "Cache-Control: max-age=0";
    $header[] = "Connection: keep-alive";
    $header[] = "Keep-Alive: 300";
    $header[] = "Accept-Charset: ISO-8859-1,utf-8;q=0.7,*;q=0.7";
    $header[] = "Accept-Language: en-us,en;q=0.5";
    $header[] = "Pragma: no-cache";
    curl_setopt($ch, CURLOPT_HTTPHEADER, $header);

    curl_setopt($ch, CURLOPT_HEADER, 0);
    curl_setopt($ch, CURLOPT_RETURNTRANSFER, 1);
    curl_setopt($ch, CURLOPT_URL, $url);
    curl_setopt($ch, CURLOPT_FOLLOWLOCATION, 1);
    //curl_setopt($ch, CURLOPT_CUSTOMREQUEST, 'GET');

    curl_setopt($ch, CURLOPT_FAILONERROR, true);
    curl_setopt($ch, CURLOPT_FOLLOWLOCATION, true);
    curl_setopt($ch, CURLOPT_RETURNTRANSFER, true);
    curl_setopt($ch, CURLOPT_TIMEOUT, 30);
    curl_setopt($ch, CURLOPT_USERAGENT,"Mozilla/5.0 (X11; Ubuntu; Linux x86_64; rv:31.0) Gecko/20100101 Firefox/31.0 " );
    curl_setopt($ch, CURLOPT_SSL_VERIFYHOST, false);
    curl_setopt($ch, CURLOPT_SSL_VERIFYPEER, false);
    //The following 2 set up lines work with sites like www.nytimes.com

    //Update: Added option for cookie jar since some websites recommended it. cookies.txt is set to permission 777. Still doesn't work.
    $cookiefile = '/home/my/folder/cookies.txt';
    curl_setopt( $ch, CURLOPT_COOKIESESSION, true );
    curl_setopt( $ch, CURLOPT_COOKIEJAR,  $cookiefile );
    curl_setopt( $ch, CURLOPT_COOKIEFILE, $cookiefile );

    $data = curl_exec($ch);

  if(curl_error($ch))
    {
        error_log(curl_error($ch));
    }
    curl_close($ch);

    return $data;
}

$html = file_get_contents_curl($url);

libxml_use_internal_errors(true); // Yeah if you are so worried about using @ with warnings
$doc = new DomDocument();
$doc->loadHTML($html);
$xpath = new DOMXPath($doc);
$query = '//*/meta[starts-with(@property, \'og:\')]';
$metas = $xpath->query($query);
$rmetas = array();
foreach ($metas as $meta) {
    $property = substr($meta->getAttribute('property'),3);
    $content = $meta->getAttribute('content');
    $rmetas[$property] = $content;
}

/*below code retrieves the next bigger than 600px image should og:image be empty.*/
if (empty($rmetas['image'])) {
    //$src = $xpath->evaluate("string(//img/@src)");
    //echo "src=" . $src . "
";
    $query = '//*/img';
    $srcs = $xpath->query($query);
    foreach ($srcs as $src) {

        $property = $src->getAttribute('src');


        if (substr($property,0,4) == 'http' && in_array(substr($property,-3), array('jpg','png','peg'), true)) {
            if (list($width, $height) = getimagesize($property)) {
            do if ($width > 600) {
                $rmetas['image'] = $property;
                break;
            } while (0);
            }
        }

    }
}

echo json_encode($rmetas);


die();

UPDATE: Error on my part that website is not https enabled so I still have the 405 not allowed error.

curl info

{
    "url": "http://www.example.com/",
    "content_type": null,
    "http_code": 405,
    "header_size": 0,
    "request_size": 458,
    "filetime": -1,
    "ssl_verify_result": 0,
    "redirect_count": 0,
    "total_time": 0.326782,
    "namelookup_time": 0.004364,
    "connect_time": 0.007725,
    "pretransfer_time": 0.007867,
    "size_upload": 0,
    "size_download": 0,
    "speed_download": 0,
    "speed_upload": 0,
    "download_content_length": -1,
    "upload_content_length": -1,
    "starttransfer_time": 0.326634,
    "redirect_time": 0,
    "redirect_url": "",
    "primary_ip": "SOME IP",
    "certinfo": [],
    "primary_port": 80,
    "local_ip": "SOME IP",
    "local_port": 52966
}

Update: If I do a curl -i from console I get the following response. A error 405 but it follows by all the HTML that I need.

Home> curl -i http://www.domain.com
HTTP/1.1 405 Not Allowed
Server: nginx
Date: Wed, 22 Feb 2017 17:57:03 GMT
Content-Type: text/html; charset=UTF-8
Transfer-Encoding: chunked
Vary: Accept-Encoding
Vary: Accept-Encoding
Set-Cookie: PHPSESSID2=ko67tfga36gpvrkk0rtqga4g94; path=/; domain=.domain.com
Expires: Thu, 19 Nov 1981 08:52:00 GMT
Cache-Control: no-store, no-cache, must-revalidate, post-check=0, pre-check=0
Pragma: no-cache
Set-Cookie: __PAGE_REFERRER=deleted; expires=Thu, 01-Jan-1970 00:00:01 GMT; Max-Age=0; path=/; domain=www.domain.com
Set-Cookie: __PAGE_SITE_REFERRER=deleted; expires=Thu, 01-Jan-1970 00:00:01 GMT; Max-Age=0; path=/; domain=www.domain.com
X-Repository: legacy
X-App-Server: production-web23:8018
X-App-Server: distil2-kvm:80

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
doulv8162 2017-02-22 15:46
关注
Add the following to your code to help debug the issue:

$info = curl_getinfo($ch); print_r( $info );

More than likely, the issues are as follows:

405 Not Allowed - the cURL call you are trying to make it not allowed. e.g. Making a GET call, when only POST is permitted.

443: Connection refused - the site you are trying to access does not support HTTPS. Or, the site is using cryptographic protocols not supported by your code, e.g. using only TLSv1.2, while you code may be using TLSv1.1.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(1条)

报告相同问题？

关注问题

不允许使用curl发布到pingomatic php
2012-11-06 21:41

回答 1 已采纳 Pingomatic requires the user agent header be set. Adding this fixes the problem: curl_setopt( $cu
curl复用连接,如何设置保持连接的时间? c# c++ php
2022-10-18 14:35

回答 1 已采纳这个试过吗？
为什么curl需要以这种方式设置以允许使用fiddler而其余部分不允许？ fiddler php
2017-11-06 17:43

回答 1 已采纳 Configure Browsers for Fiddler > Manual Configuration: To manually configure any browser to
php curl批量请求url
2024-02-26 17:34

在PHP开发中，cURL库是一个非常强大的工具，用于处理HTTP和其他协议的网络请求。它允许程序员模拟浏览器的行为，发送GET、POST等不同类型的HTTP请求，甚至可以处理HTTPS、cookies、HTTP头等复杂情况。本篇文章将深入...
怎么在Curl中指定传出端口？ php
2018-04-13 08:20

回答 1 已采纳 You could try this. i think this option CURLOPT_LOCALPORT is what you're looking for $ch = curl_i
如何通过https允许PHP REST请求过期证书 php ssl
2016-04-28 07:25

回答 1 已采纳 <?php $curl = curl_init(); curl_setopt_array($curl, array( CURLOPT_URL => "https://myUR
如何配置Firebase安全规则以仅允许来自CURL / PHP源的写入？ php
2014-01-19 05:17

回答 1 已采纳 The basic principle is to only give your PHP script auth credentials that allow write, which I thi
php curl 上传附件
2024-02-29 10:15

在PHP开发中，cURL库是一个非常重要的工具，它允许我们执行HTTP请求并与各种Web服务进行交互。在这个场景中，我们关注的是如何使用PHP和cURL来上传附件到一个远程API，这在服务商模式下尤其常见。让我们深入探讨这个...
阻止外部访问PHP脚本但允许AJAX ajax php
2013-04-10 17:18

回答 4 已采纳 There is NO way absolutely to safely/reliably identify which part of the browser the request comes
cURL HTTPS遵循重定向 php
2013-05-10 10:47

回答 1 已采纳 Add the following options: CURLOPT_SSL_VERIFYHOST => false, CURLOPT_SSL_VERIFYPEER => false
配置不允许连接到http://packagist.org/packages.json laravel php ssl
2016-08-07 20:54

回答 3 已采纳 You need to enable OpenSSL in your Windows. You can enable it from your php.ini file: extension=
php采用curl访问域名返回405 method not allowed提示的解决方法
2020-10-25 17:41

HTTP状态码405 Method Not Allowed表示客户端请求的HTTP方法不被服务器允许。这通常发生在客户端向服务器发送的请求方法（如GET、POST、PUT、DELETE等）不符合服务器端的配置时。比如，如果服务器端配置了仅接受GET...
PHP CURL获取返回值的方法
2020-10-25 23:41

在探讨PHP中使用CURL库来获取HTTP请求的返回值时，我们首先要了解CURL库的基本功能和使用场景。CURL库是一个广泛用于发送和接收文件的工具，支持多种协议，包括HTTP、HTTPS、FTP等。它允许开发者在脚本中执行各种...
php实现的Curl封装类
2022-05-01 11:23

在PHP开发中，Curl库是一个非常重要的工具，它允许我们执行HTTP请求并获取服务器的响应，支持多种协议，如HTTP、HTTPS、FTP等。为了更方便地使用Curl功能，开发者通常会创建一个Curl封装类，将复杂的Curl设置和调用...
PHP Curl出现403错误的解决办法
2020-10-25 21:02

如果服务器不允许使用代理或者有特定的代理白名单，而Curl的请求未能匹配这些条件，也可能产生403错误。 6. 代理服务器自身问题：如果使用第三方代理服务器，该代理服务器可能自身存在问题，如IP被封锁、代理设置有...
php curl 伪造IP来源的实例代码
2020-12-19 15:42

在PHP编程中，cURL库是一个非常强大的工具，它允许开发者发送HTTP请求并接收响应，广泛用于API调用、网页抓取以及模拟浏览器行为。在本文中，我们将深入探讨如何使用PHP cURL来伪造IP来源，这是一个在某些情况下非常...
PHP技巧PHPCURL函数库.doc
2022-11-30 08:41

你可以传入一个URL作为参数，但不传入URL也是允许的，这样你可以稍后通过`curl_setopt()`设置URL。例如： ```php $ch = curl_init("http://www.example.com/"); ``` 2. `curl_setopt_array()` 和 `curl_setopt()...
没有解决我的问题, 去提问

悬赏问题

¥15 乌班图ip地址配置及远程SSH
¥15 怎么让点阵屏显示静态爱心，用keiluVision5写出让点阵屏显示静态爱心的代码，越快越好
¥15 PSPICE制作一个加法器
¥15 javaweb项目无法正常跳转
¥15 VMBox虚拟机无法访问
¥15 skd显示找不到头文件
¥15 机器视觉中图片中长度与真实长度的关系
¥15 fastreport table 怎么只让每页的最下面和最顶部有横线
¥15 java 的protected权限，问题在注释里
¥15 这个是哪里有问题啊？

php Curl 405不允许

2条回答 默认 最新

悬赏问题

2条回答默认最新