抓第二页答案; 卷曲

I'm crawling a few websites, everything it's working fine,

but .... I have one specific website that I'm trying to crawl, and it's making a few "redirects" before landing to the web I want.

So it's something like ...

http://www.example.com/?day=01/01/2016&action=search_prices

this will go to http://www.example.com/search/default.aspx take a few seconds to search the answer page and then show it on there.

Is there any way to easily do this? any hint, clue, etc would be awesome

Simple code right now (almost all the sites I was crawling were jsons):

function get_web_page( $url ){
        $options = array(
            CURLOPT_RETURNTRANSFER => true,     // return web page
            CURLOPT_HEADER         => false,    // don't return headers
            CURLOPT_FOLLOWLOCATION => true,     // follow redirects
            CURLOPT_ENCODING       => "",       // handle all encodings
            CURLOPT_USERAGENT      => "spider", // who am i
            CURLOPT_AUTOREFERER    => true,     // set referer on redirect
            CURLOPT_CONNECTTIMEOUT => 120,      // timeout on connect

            CURLOPT_HTTPHEADER     => array('HeaderName: HeaderValue'),

            CURLOPT_TIMEOUT        => 120,      // timeout on response
            CURLOPT_MAXREDIRS      => 10,       // stop after 10 redirects
            CURLOPT_SSL_VERIFYPEER => false     // Disabled SSL Cert checks
        );

        $ch      = curl_init( $url );
        curl_setopt_array( $ch, $options );
        $content = curl_exec( $ch );
        $err     = curl_errno( $ch );
        $errmsg  = curl_error( $ch );
        $header  = curl_getinfo( $ch );
        curl_close( $ch );

        $header['errno']   = $err;
        $header['errmsg']  = $errmsg;
        $header['content'] = $content;
        return $header;
}

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

报告相同问题？

关注问题

卷曲，没有获取页面源代码 php
2017-01-10 07:22

回答 1 已采纳 I have modified your code.. Try this. Remove header('content-type:text/plain'); and replace you
PHP和Curl：从多页卷曲请求中获取所有结果 php
2017-10-23 09:51

回答 1 已采纳 make an array that holds all results you've recieved thus far. then start on the first page. fetc
卷曲页面不起作用 php
2016-03-10 17:00

回答 2 已采纳 i GUESS its because its detecting the 404 error and thus want to return an "error" (bool(false)?)
PHP API
2021-02-12 06:59

第二个客户带着已付款的订单返回到商店。系统减少库存负数（-1）的项目数。为什么会这样：当多个消费者同时结帐同一产品时，这就是竞争状态。成功付款后没有库存检查，导致库存数量错误建议的解决方案： ...
卷曲php没有加载网站样式 php
2017-08-09 12:58

回答 1 已采纳 Because curl only gets html (source code) of specified url and i think styles are addressed relati
PHP卷曲 - 发送证书的正确方法？ php
2018-08-31 14:42

回答 1 已采纳 I was using "HTTP", and to send the certificate on the request, i need to use "HTTPS". Don't need
PHP在Foreach循环中运行卷曲函数 php
2017-04-11 20:26

回答 1 已采纳 One solution that comes to mind is using array_chunk() to break $rmobiles into multiple arrays wit
BookmarkerAPI:简单的 Laravel PHP Rest API 将您的书签存储在您的数据库中
2021-06-04 04:01

如果你不想开发或者你已经安装了 laravel 和 composer 只需转到第 5 步。 1. 安装作曲家带卷曲： curl -sS https://getcomposer.org/installer | php 没有卷曲： php -r "readfile('...
PHP杀死正在进行的多卷曲请求 http nginx php
2017-08-03 19:52

回答 1 已采纳 well, you should be able to cancel them at will with a CURLOPT_PROGRESSFUNCTION, have a global var
复杂（卷曲）语法PHP [重复] php
2017-01-07 16:53

回答 1 已采纳 Your string has to be placed in double quotes (") instead of single quotes ('). Otherwise variable
PHP网络工具 - 卷曲[关闭] php
2014-01-01 14:43

回答 1 已采纳 You can look at http://it2.php.net/curl_setopt to see all the possible options available on PHP'
oauth2 demo php,OAuth2 Demo PHP
2021-04-11 11:40

投研双杰的博客如果这是你第一次来这里，试图尝试的现场演示让OAuth2.0流更好的感觉。这个图书馆是oauth2服务器运行PHP库。安装使用Composer安装这个应用程序:$ git clone git://github.com/bshaffer/oauth2-demo-php.git$ cd ...
显示卷曲数据，空白页 php
2016-06-10 14:52

回答 1 已采纳 You shuld add CURLOPT_USERPWD curl_setopt($curl, CURLOPT_USERPWD, "YOUR API KEY"); #Add your api
TMDb-PHP-API:***已弃用-该存储库不再维护。请阅读自述文件以了解更多信息和替代方法
2021-05-10 20:50

进行的第二个原因很简单：我喜欢他们在所做的工作。他们提供了一个出色的API，因此每个人都可以使用他们的数据库来制作出色的应用程序。现在有一个新的API v3，它也受支持。您可以的找到旧版本。要求PHP 5.2.x或...
php wait for,在繼續之前如何讓php等待卷曲完成？ - How do I make php wait for curl to finish before continuing? - 开发...
2021-04-09 12:06

凌溪每天哈哈哈的博客第二次。我的猜測是,當執行filesize()時,curl仍在下載。 2 个解决方案 #1 2 Note that functions like filesize() cache their result, try adding a call to clearstatcache() above 'print filesize(...);'. Here...
php curl 连接超时,PHP curl连接超时错误
2021-03-23 13:52

Leung Rick的博客我在PHP中使用curl调用API,有时它工作正常,有时我得到 Failed to connect to api-domain.com port 80: Connection timed out这有点奇怪,有时它正在工作,有时它不是.要解决我打印的问题,curl_getinfo()当它不工作时,...
brew php curl 扩展,Home Brew PHP 7.2.5使用cURL安装
2021-04-25 11:56

weixin_39628105的博客我的Mac上有一个本地主机开发环境,它使用自制软件的php公式,我试着用cURL的自定义路径安装,而不是使用SecureTransport for SSL的默认Mac OS版本(v7.54.0)....使用OpenSSL通过家庭酿造安装卷曲：brew instal...
php curl同时执行post,php – Curl POST作为GET执行
2021-04-20 00:44

篝火营地的博客我正在尝试使用PHP开发一种浏览器.到目前为止,我的班级可以使用以下内容类型处理GET或POST请求：application / x-www-form-urlencoded.现在我需要转向JSON.我已将Content-Type标头设置为application / json.事实是,...
php7 ext skel_PHP7.3 正式发布
2021-03-22 23:53

VS华的博客 2018-12-06号，php7.3.0正式发布。改动列表及新特性：核心：改进了PHP GC。重新设计了用PHP编写的旧的ext_skel程序，运行：'php ext_skel.php'获取所有选项。这意味着没有依赖关系，因此它可以在Windows上开箱即用。...
php 获取pdf中的图片,使用PHP从PDF中提取图像
2021-04-13 11:50

北川格林的博客 AFAIK,没有PHP模块可以做到.有一个命令行工具,pdfimages(xpdf的一部分).作为参考,这是如何工作的：pdfimages -j source.pdf image这将从source.pdf中提取所有图像为image-000.jpg,image-001.jpg等.请注意,输出格式...
没有解决我的问题, 去提问

悬赏问题

¥100 任意维数的K均值聚类
¥15 stamps做sbas-insar，时序沉降图怎么画
¥15 unity第一人称射击小游戏，有demo，在原脚本的基础上进行修改以达到要求
¥15 买了个传感器，根据商家发的代码和步骤使用但是代码报错了不会改，有没有人可以看看
¥15 关于#Java#的问题，如何解决？
¥15 加热介质是液体，换热器壳侧导热系数和总的导热系数怎么算
¥100 嵌入式系统基于PIC16F882和热敏电阻的数字温度计
¥15 cmd cl 0x000007b
¥20 BAPI_PR_CHANGE how to add account assignment information for service line
¥500 火焰左右视图、视差（基于双目相机）

码龄粉丝数原力等级 --

抓第二页答案; 卷曲

0条回答默认最新

悬赏问题

抓第二页答案; 卷曲

0条回答 默认 最新

悬赏问题

0条回答默认最新