PHP Curl重定向后

I'm trying to be a bit sneeky and as part of a learning process try and improve my page scraping skills.

One thing i've come across that I have yet to be able to solve is that certain sites will use an internal link which then redirects to an external link.

What I want to do is modify some curl code to follow the redirects until they stop and then obtain the final resting place URL.

Anyone recommend some code for me?

I have this at the moment, but it's not following the redirects properly at the moment.

        $opts = array(CURLOPT_URL => $url,
                      CURLOPT_RETURNTRANSFER => true,
                      CURLOPT_HEADER => true,
                      CURLOPT_FOLLOWLOCATION => true);      

        $curl = curl_init(); 
        curl_setopt_array($curl, $opts);  
        $str = curl_exec($curl);  
        curl_close($curl);

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答默认最新

dongshengheng1013 2012-04-23 20:59

关注

http.//php.net/manual/en/ref.curl.php

   function get_final_url( $url, $timeout = 5 )
 {
    $url = str_replace( "&amp;", "&", urldecode(trim($url)) );

   $cookie = tempnam ("/tmp", "CURLCOOKIE");
$ch = curl_init();
curl_setopt( $ch, CURLOPT_USERAGENT, "Mozilla/5.0 (Windows; U; Windows NT 5.1; rv:1.7.3) Gecko/20041001 Firefox/0.10.1" );
curl_setopt( $ch, CURLOPT_URL, $url );
curl_setopt( $ch, CURLOPT_COOKIEJAR, $cookie );
curl_setopt( $ch, CURLOPT_FOLLOWLOCATION, true );
curl_setopt( $ch, CURLOPT_ENCODING, "" );
curl_setopt( $ch, CURLOPT_RETURNTRANSFER, true );
curl_setopt( $ch, CURLOPT_AUTOREFERER, true );
curl_setopt( $ch, CURLOPT_CONNECTTIMEOUT, $timeout );
curl_setopt( $ch, CURLOPT_TIMEOUT, $timeout );
curl_setopt( $ch, CURLOPT_MAXREDIRS, 10 );
$content = curl_exec( $ch );
$response = curl_getinfo( $ch );
curl_close ( $ch );

if ($response['http_code'] == 301 || $response['http_code'] == 302)
{
    ini_set("user_agent", "Mozilla/5.0 (Windows; U; Windows NT 5.1; rv:1.7.3) Gecko/20041001 Firefox/0.10.1");
    $headers = get_headers($response['url']);

    $location = "";
    foreach( $headers as $value )
    {
        if ( substr( strtolower($value), 0, 9 ) == "location:" )
            return get_final_url( trim( substr( $value, 9, strlen($value) ) ) );
    }
}

if (    preg_match("/window\.location\.replace\('(.*)'\)/i", $content, $value) ||
        preg_match("/window\.location\=\"(.*)\"/i", $content, $value)
)
{
    return get_final_url ( $value[1] );
}
else
{
    return $response['url'];
   }
}

本回答被题主选为最佳回答 , 对您是否有帮助呢?

查看更多回答(1条)

报告相同问题？

关注问题

PHP cURL重定向到另一个URL php
2017-09-16 03:19

回答 1 已采纳 Try to follow the redirect with CURLOPT_FOLLOWLOCATION. curl_setopt($ch, CURLOPT_FOLLOWLOCATION,
使用Curl PHP获得最终重定向 php
2016-01-27 17:35

回答 1 已采纳 Use curl_getinfo() with CURLINFO_REDIRECT_URL or CURLINFO_EFFECTIVE_URL depending on your use case
使用cURL抓取重定向的网址 php
2019-02-22 05:48

回答 2 已采纳 this page does not use a redirect-scheme that libcurl understands (it uses a html <meta http-eq
php curl重定向问题,phpcurl重定向问题
2021-05-08 02:05

郑耀东的博客刚接触curl想用curl post信息，并且重定向到目的页面(带着发送的POST信息重定向)"bar", "query" => "Nettuts", "action" => "Submit");$ch = curl_init();curl_setopt($ch, CURLOPT_URL, $url); curl_setopt($...
可以cUrl重定向到URL（帖子）页面？ php
2017-01-04 17:23

回答 2 已采纳 curl is for calling a URL and fetching the response. If you echo it to the browser then the brows
如何在PHP中使用cURL发送post请求后获取/重定向到下一页？ php
2014-11-07 21:59

回答 1 已采纳 As 'singin1.php' page is using redirection with header, and session. It is compulsory to tell cURL
php curl关注重定向？ php
2010-11-29 00:11

回答 2 已采纳 Curl doesn't follow redirects by default. If you're running curl from the command line, you need
php curl 重定向内容,PHP cURL重定向到本地主机
2021-04-23 16:15

李昦的博客我正在尝试使用带有cURL的php脚本登录到外部网页.我是cURL的新手,所以我觉得我缺少很多东西.我找到了一些示例并将其修改为允许访问https页面.最终,我的目标是能够登录到页面并通过登录后遵循指定的链接下载.csv.到...
登录站点并在登录后重定向到php curl中的页面 php
2016-09-22 05:58

回答 1 已采纳 $cookie = getcwd().DIRECTORY_SEPARATOR.'cookie.txt'; curl_setopt($ch, CURLOPT_COOKIEJAR, $cookie);
PHP cURL从t.co获取最后一个重定向URL php
2015-07-31 09:24

回答 1 已采纳 function unshorten_url($url) { $ch = curl_init($url); curl_setopt_array($ch, array( CURLOP
curl获取重定向网页的内容 php
2011-11-01 23:58

回答 1 已采纳 There was a problem with javascript. It stucked on an 'in-between'-page and I had to make another
php curl重定向,PHP：cURL并跟踪所有重定向
2021-04-14 01:43

橙子青提的博客这意味着cURL将遵循重定向并仅返回没有Location头的最终页面.要手动关注位置：function getWebPage($url, $redirectcallback = null){$ch = curl_init($url);curl_setopt($ch, CURLOPT_RETURNTRA...
php curl重定向登录,PHP Curl重定向后
2021-04-22 18:00

冉桥兵的博客 http.//php.net/manual/en/ref.curl.phpfunction get_final_url( $url, $timeout = 5 ){$url = str_replace( "&", "&", urldecode(trim($url)) );$cookie = tempnam ("/tmp", "CURLCOOKIE");$ch = curl_init...
php curl 获取重定向地址,CURL获取重定向URL
2021-03-23 21:45

CodeStar的博客使用CURL获取下面链接重定向URL：...
php 禁止 curl,如何防止使用PHP cURL重定向
2021-04-05 08:18

张老三丶的博客我正在开发一个表单,其中要求将收集的...我正在运行的问题是,一旦提交表单,就会执行cURL,但我将被重定向到我指定的域.相反,我想将用户重定向到我的域内的确认页面而不是第三方网站.这是我正在使用的代码示例：$URL=...
php curl批量请求url
2024-02-26 17:34

在PHP开发中，cURL库是一个非常强大的工具，用于处理HTTP和其他协议的网络请求。它允许程序员模拟浏览器的行为，发送GET、POST等不同类型的HTTP请求，甚至可以处理HTTPS、cookies、HTTP头等复杂情况。本篇文章将深入...
php curl 获取重定向地址,php curl获取重定向Location后的url
2021-03-23 21:45

一二三是五六十mk~的博客 php curl获取重定向 301 302 Location后的url值，使用到curl中的curl_setopt($ch, CURLOPT_FOLLOWLOCATION, 1);设置和curl_getinfo($ch,CURLINFO_EFFECTIVE_URL);来获取url。CURLOPT_FOLLOWLOCATION :TRUE 时将会...
没有解决我的问题, 去提问

悬赏问题

¥15 如何让企业微信机器人实现消息汇总整合
¥50 关于#ui#的问题：做yolov8的ui界面出现的问题
¥15 如何用Python爬取各高校教师公开的教育和工作经历
¥15 TLE9879QXA40 电机驱动
¥20 对于工程问题的非线性数学模型进行线性化
¥15 Mirare PLUS 进行密钥认证？（详解）
¥15 物体双站RCS和其组成阵列后的双站RCS关系验证
¥20 想用ollama做一个自己的AI数据库
¥15 关于qualoth编辑及缝合服装领子的问题解决方案探寻
¥15 请问怎么才能复现这样的图呀

码龄粉丝数原力等级 --

PHP Curl重定向后

2条回答默认最新

码龄粉丝数原力等级 --

悬赏问题

PHP Curl重定向后

2条回答 默认 最新

悬赏问题

2条回答默认最新