douzai9405 2013-10-10 20:22
浏览 35

PHP - Scrape url(获取元og:,元链接或图像)

I'm investigating about how to scrape a url in the "best and most recent way". I intend to retrieve one image from a url. First from a link tag <link rel="image_src" href="http://stackoverflow.com/images/logo.gif" />, then from an og tag... and maybe, if I still got nothing, try to get the first big enough img. Put differently, a light version of facebook on thumbnail-retrieving.

So I'm reading stuff on the internet, and when I thought I had found what I need it appeared the solution was pretty old (like 5-6y old http://www.lightspeedretail.com/cloud/blog/2007/08/scraping-links-with-php/) : solution using cURL, DOMDocument, and XPath basically. Then I would just have to work on the image url I got, store a few versions of it in different sizes for instance. But I'm fine for this part.

Would there be something better than this solution ? Ideally an example for the link tag would be fantastic.

  • 写回答

0条回答 默认 最新

    报告相同问题?

    悬赏问题

    • ¥15 在获取boss直聘的聊天的时候只能获取到前40条聊天数据
    • ¥20 关于URL获取的参数,无法执行二选一查询
    • ¥15 液位控制,当液位超过高限时常开触点59闭合,直到液位低于低限时,断开
    • ¥15 marlin编译错误,如何解决?
    • ¥15 有偿四位数,节约算法和扫描算法
    • ¥15 VUE项目怎么运行,系统打不开
    • ¥50 pointpillars等目标检测算法怎么融合注意力机制
    • ¥20 Vs code Mac系统 PHP Debug调试环境配置
    • ¥60 大一项目课,微信小程序
    • ¥15 求视频摘要youtube和ovp数据集