dongxidui1227 2014-09-02 01:57
浏览 24
已采纳

如何摆脱HTML页面中的所有JavaScript?

I could use regex to get rid of the <script> tags in the HTML like this

$html = preg_replace('#<script(.*?)>(.*?)</script>#is','', $html);

So that works fine, but what about inline JavaScript? I figured out I could do it this way

$nodes = $dom->getElementsByTagName('*');
foreach($nodes as $node)
{
  if ($node->hasAttribute('onload')){
    $node->removeAttribute('onload');
  }
}

The issue with this is I'd have to find all the attributes, and keep making if statements. I've also seen libraries, but I want to keep things small. So is there any quick way? Also any nice lists with inline attributes if I have to keep doing what I'm doing?

  • 写回答

1条回答 默认 最新

  • dras2334 2014-09-02 03:13
    关注

    I would say, don't reinvent the wheel, use a library like http://htmlpurifier.org/ to accomplish this.

    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论

报告相同问题?

悬赏问题

  • ¥15 Mac系统vs code使用phpstudy如何配置debug来调试php
  • ¥15 目前主流的音乐软件,像网易云音乐,QQ音乐他们的前端和后台部分是用的什么技术实现的?求解!
  • ¥60 pb数据库修改与连接
  • ¥15 spss统计中二分类变量和有序变量的相关性分析可以用kendall相关分析吗?
  • ¥15 拟通过pc下指令到安卓系统,如果追求响应速度,尽可能无延迟,是不是用安卓模拟器会优于实体的安卓手机?如果是,可以快多少毫秒?
  • ¥20 神经网络Sequential name=sequential, built=False
  • ¥16 Qphython 用xlrd读取excel报错
  • ¥15 单片机学习顺序问题!!
  • ¥15 ikuai客户端多拨vpn,重启总是有个别重拨不上
  • ¥20 关于#anlogic#sdram#的问题,如何解决?(关键词-performance)