curl将重定向的url放入浏览器的地址

I am pretty new to cURL and have only been using it for a short time. My problem is that I want to get the content of a page (file_get_content() doesn't work) by using cURL. Unfortunately, the site in question has bot protection, meaning it checks whether you are a bot or not when you first arrive at the site. If you are not a bot it will redirect you to the real site with an absolute path (I guess). Whenever I load this site with cURL it appends the path to my server address.

For example: My server has the address: http://examplepage.com/ cURL appends the redirected path to my URL. So it would be something like: http://examplepage.com/absolute/path?with=parameters

On the original page, where I try to get the content from, it works because they have a path like that but I do not (I want some html-content of theire site).

Here is my code so far:

    <?php

  /* getting site */
  $website = "https://originalsite.com/?some=parameters";
  $redirectURL;

  function curl_download($url) {
    //initialize curl handler
    $c = curl_init();

    // Include header in result? (0 = yes, 1 = no)
    curl_setopt($c, CURLOPT_HEADER, 1);

    //set url to download
    curl_setopt($c, CURLOPT_URL, $url);

    // follow redirection
    curl_setopt($c, CURLOPT_FOLLOWLOCATION, 1);

    //set referer
    curl_setopt($c, CURLOPT_REFERER, "https://originalsite.com/");

    // User agent
    curl_setopt($c, CURLOPT_USERAGENT, "MozillaXYZ/1.0");

    // Should cURL return or print out the data? (true = return, false = print)
    curl_setopt($c, CURLOPT_RETURNTRANSFER, 1);

    // Timeout in seconds
    curl_setopt($c, CURLOPT_TIMEOUT, 10);

    // Download the given URL, and return output
    $output = curl_exec($c);

    // Close the cURL resource, and free system resources
    curl_close($c);

    return $output;
  }

  $content = curl_download($website);

  echo $content;

?>

so it'll enter the site where it checks whether I am a bot or not and after that it redirects me to the site (or it least, it tries to).

I have searched the internet and StackOverflow but I couldn't find an answer to my problem.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
drvxclagw656708070 2017-02-13 14:43
关注
What's happening is that there is some JavaScript code issuing a redirect once you render the page. Try disabling JavaScript in your browser for a quick test.

本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

PHP cURL重定向到另一个URL php
2017-09-16 03:19

回答 1 已采纳 Try to follow the redirect with CURLOPT_FOLLOWLOCATION. curl_setopt($ch, CURLOPT_FOLLOWLOCATION,
可以cUrl重定向到URL（帖子）页面？ php
2017-01-04 17:23

回答 2 已采纳 curl is for calling a URL and fetching the response. If you echo it to the browser then the brows
PHP cURL从t.co获取最后一个重定向URL php
2015-07-31 09:24

回答 1 已采纳 function unshorten_url($url) { $ch = curl_init($url); curl_setopt_array($ch, array( CURLOP
php curl批量请求url
2024-02-26 17:34

3. **使用队列**：将URL放入队列，按照一定的速率（例如每秒几个）从队列中取出URL进行请求。 4. **使用API令牌**：如果对方提供了API令牌系统，确保每个请求都携带正确的令牌，并遵守令牌的使用规则。 5. **监控...
使用cURL抓取重定向的网址 php
2019-02-22 05:48

回答 2 已采纳 this page does not use a redirect-scheme that libcurl understands (it uses a html <meta http-eq
PHP Curl在浏览器中返回不同的URL结果 php
2018-06-01 11:31

回答 2 已采纳 Because JavaScript is the root of all evil. the website gets the search results you want with AJAX
使用Curl PHP获得最终重定向 php
2016-01-27 17:35

回答 1 已采纳 Use curl_getinfo() with CURLINFO_REDIRECT_URL or CURLINFO_EFFECTIVE_URL depending on your use case
php的header传参数,php如何使用curl设置header头传参
2021-04-27 00:42

weixin_39822629的博客 php如何使用curl设置header头传参,参数,浏览器,自定义,数据,下划线php如何使用curl设置header头传参易采站长站，站长之家为您整理了php如何使用curl设置header头传参的相关内容。php curl设置header的方法：首先初始...
在curl php中获取最后一个重定向的url php
2013-12-14 11:29

回答 3 已采纳 Thank you everyone for helping me in my situation. Actually I want to develop a scrapper in php f
PHP Curl重定向后 php
2012-04-23 20:53

回答 2 已采纳 http.//php.net/manual/en/ref.curl.php function get_final_url( $url, $timeout = 5 ) {
cUrl无法访问文件但浏览器无法访问 php
2017-02-28 13:54

回答 1 已采纳 try this one to view your link image contents <?php header("Content-Type: image/jpeg"); $url
php curl密码控件,检索通过curl传递的用户名，密码参数 - php
2021-05-08 06:58

少横的博客我将参数发送到页面，如下所示：$curl = curl_init('http://localhost/sample.php');curl_setopt($curl, CURLOPT_RETURNTRANSFER, 1);curl_setopt($curl, CURLOPT_USERPWD, 'key:123456');curl_s...
PHP中CURL方法curl_setopt()函数的参数
2020-07-15 17:52

Light_zhao的博客 curl_setopt()函数将为一个CURL会话设置选项。option参数是你想要的设置，value是这个选项给定的值。下列选项的值将被作为长整形使用(在option参数中指定)： ? CURLOPT_INFILESIZE : 当你上传一个文件到远程站点，...
php判断url是否为图片,睿智的小狗 - PHP判断一个URL是否为可用图片链接
2021-03-24 13:54

毛如意SAMA的博客以下列举通过php判断链接类型的3种方法方式一直接正则匹配URL链接，是否是以.png,.gif,.jpg,.jpeg结尾的。preg_match('/.*(\.png|\.jpg|\.jpeg|\.gif)$/'...方式二用CURL获取图片URL的response header首先创建一个cu...
php post数据 url编码,关于http：我应该对POST数据进行URL编码吗？
2021-04-10 10:14

北决霓音的博客我正在将数据发布到外部API(如果相关，则使用PHP)。是否应该对传递的POST变量进行URL编码？还是只需要对GET数据进行URL编码？谢谢！更新：如果相关，这是我的PHP：$fields = array('mediaupload'=>$file_field,'...
精通 PHP 设计模式（四）
2024-07-29 00:42

绝不原创的飞龙的博客我们可以看到 curl 如何重建 URL，使其正确（包含末尾的斜杠），然后解析服务器的 IP 地址（在我的情况下是 IPv6 地址），最后建立与 Web 服务器的连接： * Rebuilt URL to: http://junade.com/ * Trying 2400:cb00...
php 使用CURL函数采集
2016-07-14 13:58

亢士群的blog的博客 <?php ...charset=utf-8"); //信息采集，首先确定采集是否需要进行登录？如果不需要登录，就直接进行抓取数据即可 //第一步，确定采集的URL $url= ...//第二步：选择采集的技术（CURL、file_get_c
没有解决我的问题, 去提问

悬赏问题

¥15 做个有关计算的小程序
¥15 MPI读取tif文件无法正常给各进程分配路径
¥15 如何用MATLAB实现以下三个公式（有相互嵌套）
¥30 关于#算法#的问题：运用EViews第九版本进行一系列计量经济学的时间数列数据回归分析预测问题求各位帮我解答一下
¥15 setInterval 页面闪烁，怎么解决
¥15 如何让企业微信机器人实现消息汇总整合
¥50 关于#ui#的问题：做yolov8的ui界面出现的问题
¥15 如何用Python爬取各高校教师公开的教育和工作经历
¥15 TLE9879QXA40 电机驱动
¥20 对于工程问题的非线性数学模型进行线性化

curl将重定向的url放入浏览器的地址

1条回答 默认 最新

悬赏问题

1条回答默认最新