php Curl 405不允许

Final Update It appears that the targeted website blocked DO IPs and are giving the problems which I've been resolving for days. I spinned a EC2 instance and manage to work the code working, together with caching etc so as to reduce the hit on the website and allow my user to share the website.

UPDATE: I manage to get the Html by setting curl error to off, however the website other than returning 405 error is also not setting some cookies which are required for the website content to be loaded.

curl_setopt($ch, CURLOPT_FAILONERROR, FALSE);

I'm using the following codes for ajax->PHP to retrieve og: meta for websites. However, there's 1 or 2 specific sites that returns error and would not retrieve the info. With the following errors. The code works seamlessly for majority of the websites.

Warning: DOMDocument::loadHTML(): Empty string supplied as input in /my/home/path/getUrlMeta.php on line 58

From curl_error in my error_log

The requested URL returned error: 405 Not Allowed

And

Failed to connect to www.something.com port 443: Connection refused

I have no problems getting the html of the website when I use curl on my server console and no problem retrieving information needed for majority of the websites using codes below

function file_get_contents_curl($url)
{
    $ch = curl_init();
    $header[0] = "Accept: text/html, text/xml,application/xml,application/xhtml+xml,";
    $header[0] .= "text/html;q=0.9,text/plain;q=0.8,image/png,*/*;q=0.5";
    $header[] = "Cache-Control: max-age=0";
    $header[] = "Connection: keep-alive";
    $header[] = "Keep-Alive: 300";
    $header[] = "Accept-Charset: ISO-8859-1,utf-8;q=0.7,*;q=0.7";
    $header[] = "Accept-Language: en-us,en;q=0.5";
    $header[] = "Pragma: no-cache";
    curl_setopt($ch, CURLOPT_HTTPHEADER, $header);

    curl_setopt($ch, CURLOPT_HEADER, 0);
    curl_setopt($ch, CURLOPT_RETURNTRANSFER, 1);
    curl_setopt($ch, CURLOPT_URL, $url);
    curl_setopt($ch, CURLOPT_FOLLOWLOCATION, 1);
    //curl_setopt($ch, CURLOPT_CUSTOMREQUEST, 'GET');

    curl_setopt($ch, CURLOPT_FAILONERROR, true);
    curl_setopt($ch, CURLOPT_FOLLOWLOCATION, true);
    curl_setopt($ch, CURLOPT_RETURNTRANSFER, true);
    curl_setopt($ch, CURLOPT_TIMEOUT, 30);
    curl_setopt($ch, CURLOPT_USERAGENT,"Mozilla/5.0 (X11; Ubuntu; Linux x86_64; rv:31.0) Gecko/20100101 Firefox/31.0 " );
    curl_setopt($ch, CURLOPT_SSL_VERIFYHOST, false);
    curl_setopt($ch, CURLOPT_SSL_VERIFYPEER, false);
    //The following 2 set up lines work with sites like www.nytimes.com

    //Update: Added option for cookie jar since some websites recommended it. cookies.txt is set to permission 777. Still doesn't work.
    $cookiefile = '/home/my/folder/cookies.txt';
    curl_setopt( $ch, CURLOPT_COOKIESESSION, true );
    curl_setopt( $ch, CURLOPT_COOKIEJAR,  $cookiefile );
    curl_setopt( $ch, CURLOPT_COOKIEFILE, $cookiefile );

    $data = curl_exec($ch);

  if(curl_error($ch))
    {
        error_log(curl_error($ch));
    }
    curl_close($ch);

    return $data;
}

$html = file_get_contents_curl($url);

libxml_use_internal_errors(true); // Yeah if you are so worried about using @ with warnings
$doc = new DomDocument();
$doc->loadHTML($html);
$xpath = new DOMXPath($doc);
$query = '//*/meta[starts-with(@property, \'og:\')]';
$metas = $xpath->query($query);
$rmetas = array();
foreach ($metas as $meta) {
    $property = substr($meta->getAttribute('property'),3);
    $content = $meta->getAttribute('content');
    $rmetas[$property] = $content;
}

/*below code retrieves the next bigger than 600px image should og:image be empty.*/
if (empty($rmetas['image'])) {
    //$src = $xpath->evaluate("string(//img/@src)");
    //echo "src=" . $src . "
";
    $query = '//*/img';
    $srcs = $xpath->query($query);
    foreach ($srcs as $src) {

        $property = $src->getAttribute('src');


        if (substr($property,0,4) == 'http' && in_array(substr($property,-3), array('jpg','png','peg'), true)) {
            if (list($width, $height) = getimagesize($property)) {
            do if ($width > 600) {
                $rmetas['image'] = $property;
                break;
            } while (0);
            }
        }

    }
}

echo json_encode($rmetas);


die();

UPDATE: Error on my part that website is not https enabled so I still have the 405 not allowed error.

curl info

{
    "url": "http://www.example.com/",
    "content_type": null,
    "http_code": 405,
    "header_size": 0,
    "request_size": 458,
    "filetime": -1,
    "ssl_verify_result": 0,
    "redirect_count": 0,
    "total_time": 0.326782,
    "namelookup_time": 0.004364,
    "connect_time": 0.007725,
    "pretransfer_time": 0.007867,
    "size_upload": 0,
    "size_download": 0,
    "speed_download": 0,
    "speed_upload": 0,
    "download_content_length": -1,
    "upload_content_length": -1,
    "starttransfer_time": 0.326634,
    "redirect_time": 0,
    "redirect_url": "",
    "primary_ip": "SOME IP",
    "certinfo": [],
    "primary_port": 80,
    "local_ip": "SOME IP",
    "local_port": 52966
}

Update: If I do a curl -i from console I get the following response. A error 405 but it follows by all the HTML that I need.

Home> curl -i http://www.domain.com
HTTP/1.1 405 Not Allowed
Server: nginx
Date: Wed, 22 Feb 2017 17:57:03 GMT
Content-Type: text/html; charset=UTF-8
Transfer-Encoding: chunked
Vary: Accept-Encoding
Vary: Accept-Encoding
Set-Cookie: PHPSESSID2=ko67tfga36gpvrkk0rtqga4g94; path=/; domain=.domain.com
Expires: Thu, 19 Nov 1981 08:52:00 GMT
Cache-Control: no-store, no-cache, must-revalidate, post-check=0, pre-check=0
Pragma: no-cache
Set-Cookie: __PAGE_REFERRER=deleted; expires=Thu, 01-Jan-1970 00:00:01 GMT; Max-Age=0; path=/; domain=www.domain.com
Set-Cookie: __PAGE_SITE_REFERRER=deleted; expires=Thu, 01-Jan-1970 00:00:01 GMT; Max-Age=0; path=/; domain=www.domain.com
X-Repository: legacy
X-App-Server: production-web23:8018
X-App-Server: distil2-kvm:80

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
doulv8162 2017-02-22 15:46
关注
Add the following to your code to help debug the issue:

$info = curl_getinfo($ch); print_r( $info );

More than likely, the issues are as follows:

405 Not Allowed - the cURL call you are trying to make it not allowed. e.g. Making a GET call, when only POST is permitted.

443: Connection refused - the site you are trying to access does not support HTTPS. Or, the site is using cryptographic protocols not supported by your code, e.g. using only TLSv1.2, while you code may be using TLSv1.1.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(1条)

报告相同问题？

关注问题

php Curl 405不允许 php
2017-02-22 15:37

回答 2 已采纳 Add the following to your code to help debug the issue: $info = curl_getinfo($ch); print_r( $info
不允许使用curl发布到pingomatic php
2012-11-06 21:41

回答 1 已采纳 Pingomatic requires the user agent header be set. Adding this fixes the problem: curl_setopt( $cu
curl复用连接,如何设置保持连接的时间? c# c++ php
2022-10-18 14:35

回答 1 已采纳这个试过吗？
php之curl设置超时实例
2020-12-18 23:31

本文实例讲述了php中curl超时设置方法。分享给大家供大家参考。具体实现方法如下： ...) 等方法。 ... ...curl_setopt($ch, opt) 可以设置一些超时的设置，主要包括： ...② (重要) CURLOPT_TIMEOUT_MS 设置cURL允许执
为什么curl需要以这种方式设置以允许使用fiddler而其余部分不允许？ fiddler php
2017-11-06 17:43

回答 1 已采纳 Configure Browsers for Fiddler > Manual Configuration: To manually configure any browser to
怎么在Curl中指定传出端口？ php
2018-04-13 08:20

回答 1 已采纳 You could try this. i think this option CURLOPT_LOCALPORT is what you're looking for $ch = curl_i
如何通过https允许PHP REST请求过期证书 php ssl
2016-04-28 07:25

回答 1 已采纳 <?php $curl = curl_init(); curl_setopt_array($curl, array( CURLOPT_URL => "https://myUR
PHP利用curl发送HTTP请求的实例代码
2020-12-17 03:49

PHP支持的由Daniel Stenberg创建的libcurl库允许你与各种的服务器使用各种类型的协议进行连接和通讯。 libcurl目前支持http、https、ftp、gopher、telnet、dict、file和ldap协议。libcurl同时也支持HTTPS认证、...
如何配置Firebase安全规则以仅允许来自CURL / PHP源的写入？ php
2014-01-19 05:17

回答 1 已采纳 The basic principle is to only give your PHP script auth credentials that allow write, which I thi
阻止外部访问PHP脚本但允许AJAX ajax php
2013-04-10 17:18

回答 4 已采纳 There is NO way absolutely to safely/reliably identify which part of the browser the request comes
cURL HTTPS遵循重定向 php
2013-05-10 10:47

回答 1 已采纳 Add the following options: CURLOPT_SSL_VERIFYHOST => false, CURLOPT_SSL_VERIFYPEER => false
PHP Curl 请求API
2018-01-02 23:44

利用PHP中的 Curl 请求API PHP支持的由Daniel Stenberg创建的libcurl库允许你与各种的服务器使用各种类型的协议进行连接和通讯。 libcurl目前支持http、https、ftp、gopher、telnet、dict、file和ldap协议。libcurl...
配置不允许连接到http://packagist.org/packages.json laravel php ssl
2016-08-07 20:54

回答 3 已采纳 You need to enable OpenSSL in your Windows. You can enable it from your php.ini file: extension=
php 设置curl不超时时间,curl命令的超时时间
2021-04-22 16:55

黑日终的博客今天在一台服务器上突然...　我在使用curl的时候也设置了超时时间， --connect-timeout 5curl --connect-timeout 5 --data-binary "set=${L_UPLOAD_DATA_ENCODED}"http://172.88.99.00:8080/xxx.php&>/dev/nu...
php curlget方法_PHP cURL请求详解
2021-03-22 19:34

simmmmmmmcha的博客在PHP后端的开发过程中，除了获取数据库的数据和处理数据的内部逻辑，往往还需要请求其他服务器接口的数据，我们一般有3种方式来获取数据，分别是：file_get_contentsfsockopencurl3种常用的接口获取方式简述file_...
php curl重定向,PHP：cURL并跟踪所有重定向
2021-04-14 01:43

橙子青提的博客你有curl_setopt($ch, CURLOPT_FOLLOWLOCATION, true);这意味着cURL将遵循重定向并仅返回没有Location头的最终页面.要手动关注位置：function getWebPage($url, $redirectcallback = null){$ch = curl_init($url);...
php curl 连接超时,phpcurl超时设置详解
2021-03-23 13:52

随缘随喜的博客本文介绍下，在php中使用curl时，进行超时设置的详细方法，大家参考下，希望对大家有一定的帮助。访问HTTP方式很多，可以使用curl, socket, file_get_contents() 等方法。在访问http时，需要考虑超时的问题。一、...
没有解决我的问题, 去提问

悬赏问题

¥20 双层网络上信息-疾病传播
¥50 paddlepaddle pinn
¥20 idea运行测试代码报错问题
¥15 网络监控：网络故障告警通知
¥15 django项目运行报编码错误
¥15 请问这个是什么意思？
¥15 STM32驱动继电器
¥15 Windows server update services
¥15 关于#c语言#的问题：我现在在做一个墨水屏设计，2.9英寸的小屏怎么换4.2英寸大屏
¥15 模糊pid与pid仿真结果几乎一样

php Curl 405不允许

2条回答 默认 最新

悬赏问题

2条回答默认最新