duanchuang1935 2014-12-04 09:44
浏览 40

DOMDocument从内联脚本PHP中剥离标记

This is a strange one but looks like $dom->saveHTML() is stripping tags from inline javascript

$domStr = '
<!DOCTYPE html>
   <html>
    <head>
        <meta charset="utf-8"/>
        <title>my page</title>
        <script>
            var elem = "<div>some content</div>";
        </script>
    </head>
    <body>
        <div>
            MY PAGE
        </div>
    </body>
</html>
';
    $doc = new DOMDocument();
    libxml_use_internal_errors(true);//prevents tags in js from throwing errors; see php.net manual
    $doc->formatOutput = true;
    $doc->strictErrorChecking = false;
    $doc->preserveWhiteSpace  = true;

    $doc->loadHTML($domStr);
    echo $doc->saveHTML();
exit;

http://sandbox.onlinephpfunctions.com/code/ad59a2a1016b2128e437ef61dbe00f1c511bff8d

if you use libxml_use_internal_errors(true); you will not see what is wrong but if removed you get

<b>Warning</b>:  DOMDocument::loadHTML(): Unexpected end tag : div

Same thing happens with

$doc->formatOutput = false;

Any help is appreciated.

  • 写回答

2条回答 默认 最新

  • douyannuo7733 2014-12-04 09:46
    关注

    You're missing the opening <html> tag right after the DOCTYPE declaration.

    评论

报告相同问题?

悬赏问题

  • ¥20 测距传感器数据手册i2c
  • ¥15 RPA正常跑,cmd输入cookies跑不出来
  • ¥15 求帮我调试一下freefem代码
  • ¥15 matlab代码解决,怎么运行
  • ¥15 R语言Rstudio突然无法启动
  • ¥15 关于#matlab#的问题:提取2个图像的变量作为另外一个图像像元的移动量,计算新的位置创建新的图像并提取第二个图像的变量到新的图像
  • ¥15 改算法,照着压缩包里边,参考其他代码封装的格式 写到main函数里
  • ¥15 用windows做服务的同志有吗
  • ¥60 求一个简单的网页(标签-安全|关键词-上传)
  • ¥35 lstm时间序列共享单车预测,loss值优化,参数优化算法