PHP Multi-cURL请求延迟到超时

Summary

I have some PHP 5.4 code which fetches a batch of Facebook/Instagram photos in parallel using multi curl. This code has been working for years, and nothing has changed as far as I can tell.

I add multiple curl requests to a 'multi' request. Each curl request gets a CURLOPT_TIMEOUT. The problem I'm seeing is that, all of a sudden, some of my requests don't complete until this timeout is reached (no matter what timeout I set).

Code

I do something like this (simplified):

do {
    while (CURLM_CALL_MULTI_PERFORM === curl_multi_exec($mh, $running));

    // Wait for activity on any curl-connection (optional, reduces CPU)
    curl_multi_select($mh);

    // a request was just completed -- find out which one
    while($done = curl_multi_info_read($mh))
    {
        $completedCurlRequest = $done['handle'];

        //save the file
        do_some_work(completedCurlRequest);

        curl_multi_remove_handle($mh, $completedCurlRequest);
    }
} while ($running);

I use this script to run batches of about 40 parallel requests to fetch some images (from Facebook). Most of them take about 500ms to complete. However, a few of the requests "hang" (until the CURLOPT_TIMEOUT) before they arrive.

Basically the curl_multi_select step takes the entire timeout. Or, if I remove that curl_multi_select line, the outer loop spins (burning CPU) until the timeout.

Considerations

It doesn't matter what the timeout is - if I set the timeout to 30s, they arrive after 30 seconds, If I set the timeout to 1s, they arrive after 1s!
This is a really sudden change that does not correlate with any code release - it was all working fine up until 30th Jan 2019, but on the 31st it suddenly stopped working.
This isn't easy to reproduce, as it only affects an image once. If I repeat it for a batch of images I already fetched, it works fine the next time round.
It affects both Facebook and Instagram images, so I think the issue must be to do with my code or my server (and not Facebook or Instagram), as they wouldn't have both changed something simultaneously.

Questions

Am I doing something wrong in my use of multi-curl that could cause this? (but if so, what's changed?)
Have Facebook and Instagram changed anything that might cause this?
Could something on my server have changed to trigger this?
How can I debug this?

Update Here is the what I get back from a slow request when it finally completes:

INFO

"content_type": "image/jpeg",
"http_code": 200,
"header_size": 377,
"request_size": 180,
"total_time": 15.001012,    //<----- Total time == CURLOPT_TIMEOUT
"namelookup_time": 0.007149,
"connect_time": 0.12018,
"pretransfer_time": 0.441911,
"size_download": 40714,
"speed_download": 2714,
"download_content_length": -1,   //<------Not set

HEADER

HTTP/2 200 
content-type: image/jpeg
x-haystack-needlechecksum: 3529661797
timing-allow-origin: *
access-control-allow-origin: *
cache-control: max-age=1209600, no-transform
date: Mon, 04 Feb 2019 14:04:17 GMT
access-control-expose-headers: X-FB-CEC-Video-Limit

It is missing the content-length header, but that always seems to be the case the first time a file is fetched. Only 1 or 2 of the 50 parallel requests are slow, yet all of the requests are missing their content length headers.

If I fetch the same file again, it is much quicker, and I do see content length being set this time

INFO

"download_content_length": 52721,

HEADER

content-length: 52721

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dongshi1914 2019-02-04 09:28
关注
My current theory is that there is a bug in Facebook fileserver that means the connection is sometimes not being closed even though the data has been sent, so the connection stays open until the timeout. In the absence of the (optional) content-length header being sent by Facebook's fileserver, cURL can't know if the payload is complete, and so hangs.

My current solution is to 'prime' the fileserver by first making a request for the image without a body, like this:

$ch = curl_init(); curl_setopt($ch, CURLOPT_URL, $url); curl_setopt($ch, CURLOPT_NOBODY, 1); curl_exec($ch);

This is a pretty quick process, since there is no image being returned. I actually do this in the background using asynchronous multi curl, so I can get on with doing some other processing.

After priming the fileserver, subsequent requests for the files are even quicker than before, as the content-length is known.

This is a bit of a clumsy approach, but in the absence of any response from Facebook so far I'm not sure what else to do.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

PHP Multi-cURL请求延迟到超时 php
2019-02-01 17:50

回答 1 已采纳 My current theory is that there is a bug in Facebook fileserver that means the connection is somet
PHP - cURL请求不起作用 php
2014-10-16 07:22

回答 1 已采纳 You need to pass user agent $defaults = array( CURLOPT_URL => "https://www.facebook.com/video.
php - 跨curl请求维护会话 php
2017-11-16 12:44

回答 1 已采纳 I'm struggling to imagine how this runs at all. However these session variables do not reflect
PHP实现的curl批量请求操作示例
2020-12-19 23:55

为了防止因网络延迟导致的阻塞，我们可以配合`curl_multi_select()`来检查是否有活动的连接： ```php do { $mrc = curl_multi_exec($mh, $active); } while ($mrc == CURLM_CALL_MULTI_PERFORM); while ($...
php-curl-class检查登录是否正常 php
2019-02-25 07:39

回答 1 已采纳 Okay, thanks to all for like to help. Now i find by myself the solution. ;-) It's all the time: "9
通过PHP Curl调用Soap WebService时超时 php xml
2019-04-04 15:54

回答 2 已采纳 Alright, after many attempts the issue laid on the web service itself, their testing tables were o
php传递var到curl请求 php
2016-09-24 04:54

回答 1 已采纳 Here is what you can use. Your JSON string is not got the correct quotations. I personally find it
php的curl非堵塞调用类.zip
2019-07-11 09:52

2. **cURL多路复用**：通过cURL的`curl_multi_init`、`curl_multi_add_handle`和`curl_multi_exec`等函数，可以实现对多个cURL句柄的并发处理。这使得程序能够在同一时间发送多个HTTP请求，提高执行效率。 3. **...
来自API的curl请求与php php
2018-05-04 10:25

回答 4 已采纳 First enable curl for your xamp. To do so follow the steps 1. Go to C:\Program Files\xampp\php\ph
PHP cURL请求之间的multi_exec延迟 php
2011-08-08 19:21

回答 4 已采纳 Don't think you can. If you run this from the cli, you could instead fork your script into 10 proc
无法在ubuntu 16.10上再次安装php5.6-curl。 ppa被添加。 php ubuntu
2017-08-13 17:17

回答 1 已采纳 guys from askubuntu helped. Thing was obvious. Just change ppa repo to xenial from yakkety and it
php5.3 curl超时,PHP中curl设置毫秒级超时的问题
2021-03-25 10:45

HANCVS 韓的博客 DRPHP5/7加上7.19的libcurl，设置低于1s的超时时间时，curl_exec仍会执行超过1s以上。原因在于此版本的libcurl实现逻辑上以1000ms作为curl_exec中poll系统调用的超时值。问题某些HTTP接口响应时间可能因为种种原因会...
PHP - 从curl响应中获取特定值 json php
2019-03-05 05:09

回答 1 已采纳 <?php $url = 'hxxp://domain.com/univ/v8?q=tas+wanita'; $ch=curl_init($url);
php curl 并发访问,PHP并发之用curl 并发减少后端访问时间
2021-04-26 14:03

萬重的博客本篇文章给大家分享的内容是关于PHP并发之用curl 并发减少后端访问时间，有着一定的参考价值，有需要的朋友可以参考一下首先，先了解下 php中的curl多线程函数：# curl_multi_add_handle# curl_multi_close# curl_...
php curl 模拟多线程,利用curl 多线程模拟并发的详解_PHP教程
2021-04-23 16:51

裙主的博客首先，先了解下 php中的curl多线程函数：复制代码代码如下:# curl_multi_add_handle# curl_multi_close# curl_multi_exec# curl_multi_getcontent# curl_multi_info_read# curl_multi_init# curl_multi_remove_...
PHP---CURL并发访问链接
2017-06-23 15:31

luyaran的博客首先，先了解下 PHP中的curl多线程函数： # curl_multi_add_handle # curl_multi_close # curl_multi_exec # curl_multi_getcontent # curl_multi_info_read # curl_multi_init # curl_multi_remove_handle ...
使用PHP cURL设置超时时间并解决超时异常
2023-10-04 19:51

MdlForward的博客通过正确设置cURL的超时时间，并采取一些额外的措施，如增加超时时间、优化网络环境、检查目标服务器状态或使用并发请求，可以解决cURL超时异常的问题。在使用PHP的cURL库进行网络请求时，设置超时时间是一个常见的...
没有解决我的问题, 去提问

悬赏问题

¥15 关于arduino编程toCharArray()函数的使用
¥100 vc++混合CEF采用CLR方式编译报错
¥15 coze 的插件输入飞书多维表格 app_token 后一直显示错误，如何解决？
¥15 vite+vue3+plyr播放本地public文件夹下视频无法加载
¥15 c#逐行读取txt文本，但是每一行里面数据之间空格数量不同
¥50 如何openEuler 22.03上安装配置drbd
¥20 ING91680C BLE5.3 芯片怎么实现串口收发数据
¥15 无线连接树莓派，无法执行update，如何解决？（相关搜索：软件下载）
¥15 Windows11, backspace, enter, space键失灵
¥15 cfx离心泵非稳态计算

PHP Multi-cURL请求延迟到超时

1条回答 默认 最新

悬赏问题

1条回答默认最新