如何使用file_get_contents作为数组来获取图像

I have the following problem with getting images as array. In this code I'm trying to check if images for search Test 1 exist - if yes, then display, if not then try with Test 2 and that's it. Current code can do it but is super slow.

This if (sizeof($matches[1]) > 3) { because this 3 sometimes contains advertisement on crawled website, so this is my secure how to skip it.

My question is how I can speed up code below to get if (sizeof($matches[1]) > 3) { faster? I believe that this makes code very slow, because this array may contain up to 1000 images

$get_search = 'Test 1';

$html = file_get_contents('https://www.everypixel.com/search?q='.$get_search.'&is_id=1&st=free');
preg_match_all('|<img.*?src=[\'"](.*?)[\'"].*?>|i', $html, $matches);

if (sizeof($matches[1]) > 3) {
  $ch_foreach = 1;
}

if ($ch_foreach == 0) {

    $get_search = 'Test 2';

  $html = file_get_contents('https://www.everypixel.com/search?q='.$get_search.'&is_id=1&st=free');
  preg_match_all('|<img.*?src=[\'"](.*?)[\'"].*?>|i', $html, $matches);

  if (sizeof($matches[1]) > 3) {
     $ch_foreach = 1;
  }

}

foreach ($matches[1] as $match) if ($tmp++ < 20) {

  if (@getimagesize($match)) {

    // display image
    echo $match;

  }

}

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
duanchuonong5370 2019-04-20 08:26
关注
$html = file_get_contents('https://www.everypixel.com/search?q='.$get_search.'&is_id=1&st=free');

unless the www.everypixel.com server is is on the same LAN (in which case compression overhead may be slower than transferring it in plain), curl with CURLOPT_ENCODING should do this faster than file_get_contents, and even if it is on the same lan, curl should be faster than file_get_contents because file_get_contents keeps reading until the server close the connection, but curl keeps reading until Content-Length bytes has been read, which is faster than waiting for a server to close a socket, so do this instead:

$ch=curl_init('https://www.everypixel.com/search?q='.$get_search.'&is_id=1&st=free'); curl_setopt_array($ch,array(CURLOPT_ENCODING=>'',CURLOPT_RETURNTRANSFER=>1)); $html=curl_exec($ch);

about your regex:

preg_match_all('|<img.*?src=[\'"](.*?)[\'"].*?>|i', $html, $matches);

DOMDocument with getElementsByTagName("img") and getAttribute("src") should be faster than using your regex, so do this instead:

$domd=@DOMDocument::loadHTML($html); $urls=[]; foreach($domd->getElementsByTagName("img") as $img){ $url=$img->getAttribute("src"); if(!empty($url)){ $urls[]=$url; } }

and probably the slowest part of your entire code, the @getimagesize($match) inside a loop potentially containing over 1000 urls, every call to getimagesize() with an url makes php download the image, and it uses the file_get_contents method meaning it suffers from the same Content-Length issue that makes file_get_contents slow. in addition, all the images are downloaded sequentially, downloading them in parallel should be much faster, which can be done with the curl_multi api, but doing that is a complex task and i cba writing an example for you, but i can point you to an example: https://stackoverflow.com/a/54717579/1067003
解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

如何使用file_get_contents php获取图像扩展？ php
2012-01-09 19:54

回答 4 已采纳 $size = getimagesize($fromPath); $extension = image_type_to_extension($size[2]); Get the image s
使用带数组的file_get_contents发布请求 php
2015-12-07 18:33

回答 1 已采纳 Try this patch: $data = array('app_key' => '8948a6aa12a1a23Yzglj17QO91Geg', 'methods' => $a
使用file_get_contents创建php缓存 php
2015-06-26 18:57

回答 2 已采纳 This line file_get_contents('includes/menu.php'); will just read the php file, without executin
PHP - 使用file_get_contents下载远程文件到本地
2020-04-09 17:25

般若Neo的博客使用file_get_contents()获取远程文件的内容；使用file_put_contents()把内容写入本地文件；获取远程文件内容 file_get_contents()函数把整个文件内容读入到一个字符串中，可以是本地文件，也可以是远程文件。该...
如何在php中执行file_get_contents后清除内存 php
2015-03-20 20:26

回答 3 已采纳 if ($_POST["submit"]) { $ip = $_POST['ip']; $subnet = $_POST['subnet'];
使用file_get_contents从url获取JSON json php
2016-11-24 11:20

回答 3 已采纳 Note on the docs for json_decode: http://php.net/manual/en/function.json-decode.php This function
使用file_get_contents获取存储的内容 php
2015-11-26 02:19

回答 2 已采纳 This depends on your setup, but if your script is in an externally accessible location and you tru
file_get_contents file_put_contents
2019-04-23 17:48

Mzzad的博客 file_get_contents() 函数定义和用法 file_get_contents() 把整个文件读入一个字符串中。该函数是用于把文件的内容读入到一个字符串中的首选方法。如果服务器操作系统支持，还会使用内存映射技术来增强性能。 ...
PHP使用file_get_contents（）检查外部服务器上是否存在文件 php
2014-08-18 01:29

回答 3 已采纳 I think best method for me is using this script: $file = "http://website.com/dir/filename.php"; $
从fread（），file_get_contents（）等获取读取文件 php
2016-01-15 09:04

回答 1 已采纳 No but you could create a wrapper function that logs all that information for you. <? $__modi
PHP file_get_contents使用变量 javascript jquery php
2012-08-14 16:01

回答 1 已采纳 Use output buffering and require: $org_ID = 5; $member_ID = 10; ob_start(); require '/path/to/jav
php 读写文件 file_put_contents() 与 file_get_contents() 函数用法
2020-07-03 09:05

whatday的博客 file_put_contents file_put_contents() 函数用于把字符串写入文件，成功返回写入到文件内数据的字节数，失败则返回 FALSE。语法： int file_put_contents ( string filename, string data [, int flags [, ...
file_get_contents未获取所有内容 php
2019-07-21 15:08

回答 1 已采纳 The CPU data is not part of this page. When a user clicks the "CPU" link on the page javascript is
file_put_contents() 与 file_get_contents() 函数用法
2017-03-31 11:32

GodFu1012的博客 file_put_contents() 函数用于把字符串写入文件，成功返回写入到文件内数据的字节数，失败则返回 FALSE。语法： int file_put_contents ( string filename, string data [, int flags [, resource context]] ) ...
php move_uploaded_file 出错,php – move_uploaded_file不起作用,没有错误
2021-04-26 12:19

科学声音的博客我正在运行一个脚本,该脚本使用move_uploaded_file()移动上传的文件.我做了好几千次,但由于某种原因,它不起作用.我已经克服了以下内容：>< form>使用method =“post”并更正enctype>从表单引用的正确...
没有解决我的问题, 去提问

悬赏问题

¥20 模型在y分布之外的数据上预测能力不好如何解决
¥15 processing提取音乐节奏
¥15 gg加速器加速游戏时，提示不是x86架构
¥15 python按要求编写程序
¥15 Python输入字符串转化为列表排序具体见图，严格按照输入
¥20 XP系统在重新启动后进不去桌面，一直黑屏。
¥15 opencv图像处理，需要四个处理结果图
¥15 无线移动边缘计算系统中的系统模型
¥15 深度学习中的画图问题
¥15 java报错:使用mybatis plus查询一个只返回一条数据的sql，却报错返回了1000多条

如何使用file_get_contents作为数组来获取图像

1条回答 默认 最新

悬赏问题

1条回答默认最新