doujiao7679 2013-09-11 17:53
浏览 72
已采纳

PHP:获取HTML网页的所有CSS文件

I'm trying to get all CSS files of an html file from URL.

I know that if I want to get the HTML code it is easy - just using PHP function - file_get_contents.

The question is - if I could search easily inside an a URL of HTML and get from there the files or content of all related CSS files?

Note - I want to build an engine for getting a lot of CSS files, this is why just reading the source is not enough..

Thanks,

  • 写回答

2条回答 默认 最新

  • dongwei1954 2013-09-11 18:01
    关注

    You could try using http://simplehtmldom.sourceforge.net/ for HTML parsing.

    require_once 'SimpleHtmlDom/simple_html_dom.php';
    
    $url = 'www.website-to-scan.com';
    $website = file_get_html($url);
    
    // You might need to tweak the selector based on the website you are scanning
    // Example: some websites don't set the rel attribute
    // others might use less instead of css
    //
    // Some other options:
    // link[href] - Any link with a href attribute (might get favicons and other resources but should catch all the css files)
    // link[href="*.css*"] - Might miss files that aren't .css extension but return valid css (e.g.: .less, .php, etc)
    // link[type="text/css"] - Might miss stylesheets without this attribute set
    foreach ($website->find('link[rel="stylesheet"]') as $stylesheet)
    {
        $stylesheet_url = $stylesheet->href;
    
        // Do something with the URL
    }
    
    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论
查看更多回答(1条)

报告相同问题?

悬赏问题

  • ¥20 软件测试决策法疑问求解答
  • ¥15 win11 23H2删除推荐的项目,支持注册表等
  • ¥15 matlab 用yalmip搭建模型,cplex求解,线性化处理的方法
  • ¥15 qt6.6.3 基于百度云的语音识别 不会改
  • ¥15 关于#目标检测#的问题:大概就是类似后台自动检测某下架商品的库存,在他监测到该商品上架并且可以购买的瞬间点击立即购买下单
  • ¥15 神经网络怎么把隐含层变量融合到损失函数中?
  • ¥15 lingo18勾选global solver求解使用的算法
  • ¥15 全部备份安卓app数据包括密码,可以复制到另一手机上运行
  • ¥20 测距传感器数据手册i2c
  • ¥15 RPA正常跑,cmd输入cookies跑不出来