duanfei1987 2019-07-17 19:24
浏览 134

从Chromedp查找最终到达网址

I'm trying to use Go and Chromedp to scrape some information from some websites. I'm currently using:

err := chromedp.Run(ctx, chromedp.Navigate(url), chromedp.Evaluate(`document.body.innerHTML`, &textHTML))

to pull the website text and parse it. I've run into an issue regarding URL redirects and I need to know the final URL in the chain to check it against a list of URLs. I could double pull it using net/http and CheckRedirect but that seems ridiculously inefficient. Is there a way I can add an Action to Chromedp to get the redirect chain, or at least the final redirect URL? I was looking at NavigationEntries but I don't see examples of it and I'm not sure if that's used for keeping track of Navigate actions.

EDIT: I would still like to know if there's a solution that utilizes Chromedp internals, but I have worked around the problem by adding chromedp.Evaluate(`window.location.href`, &newURL) to the Actions. Is there a case where this would not give me the desired result?

  • 写回答

0条回答 默认 最新

    报告相同问题?

    悬赏问题

    • ¥15 c语言怎么用printf(“\b \b”)与getch()实现黑框里写入与删除?
    • ¥20 怎么用dlib库的算法识别小麦病虫害
    • ¥15 华为ensp模拟器中S5700交换机在配置过程中老是反复重启
    • ¥15 java写代码遇到问题,求帮助
    • ¥15 uniapp uview http 如何实现统一的请求异常信息提示?
    • ¥15 有了解d3和topogram.js库的吗?有偿请教
    • ¥100 任意维数的K均值聚类
    • ¥15 stamps做sbas-insar,时序沉降图怎么画
    • ¥15 买了个传感器,根据商家发的代码和步骤使用但是代码报错了不会改,有没有人可以看看
    • ¥15 关于#Java#的问题,如何解决?