dtjo87679 2017-10-27 05:20
浏览 18
已采纳

表单提交多重重定向

I'm trying to fetch data from a website where once you submit the form it redirects to a loading page which is set to be automatically redirected to the final results page. The issue is that the crawler only gets the data of the loading page and does not go fully to the final results page which is what I actually need. Can someone please tell me how I can achieve that? If not possible then what could be an alternative way to do this?

  • 写回答

1条回答 默认 最新

  • dongqian1925 2017-10-27 05:34
    关注

    If you're using curl, you can try the following:

    curl_setopt($ch, CURLOPT_FOLLOWLOCATION, true);

    If you still aren't getting past the loading page, its possible its not an http redirect.

    In that case you'll have to manually parse the target location. A lot of websites use a meta refresh tag for such loading pages. Look for something similar to the following:

    <meta http-equiv="refresh" content="5; url=http://example.com/" />

    You can easily parse the above with regex or any dom parsing library for php.

    Another possibility is a javascript redirect. Look for lines containing window.location in the source code.

    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论

报告相同问题?

悬赏问题

  • ¥15 急matlab编程仿真二阶震荡系统
  • ¥20 TEC-9的数据通路实验
  • ¥15 ue5 .3之前好好的现在只要是激活关卡就会崩溃
  • ¥50 MATLAB实现圆柱体容器内球形颗粒堆积
  • ¥15 python如何将动态的多个子列表,拼接后进行集合的交集
  • ¥20 vitis-ai量化基于pytorch框架下的yolov5模型
  • ¥15 如何实现H5在QQ平台上的二次分享卡片效果?
  • ¥15 python爬取bilibili校园招聘网站
  • ¥30 求解达问题(有红包)
  • ¥15 请解包一个pak文件