dongpaipu8394 2017-07-19 11:54
浏览 115
已采纳

如何使用DOMDocument排除body标签中的特定html块?

I'm using DOMDocument to get the HTML from a website. I want to get html within the <body></body> and I got it. But inside body here is a <nav>...</nav> block. How can I exclude <nav></nav> block only by using DOMDocument.

Here is my Code:

<!DOCTYPE html>
<head>
    <title>Title Here</title>
<head>
<?php
  $d = new DOMDocument;
  $mock = new DOMDocument;
  $internalErrors = libxml_use_internal_errors(true);
  $d->loadHTML(file_get_contents('http://www.example.com'));
  $body = $d->getElementsByTagName('body')->item(0);
  foreach ($body->childNodes as $child){
      $mock->appendChild($mock->importNode($child, true));
  }
  libxml_use_internal_errors($internalErrors);
  echo $mock->saveHTML(); //<body>.....</body>
?>
</html>
  • 写回答

1条回答 默认 最新

  • douhei8633 2017-07-19 12:00
    关注

    Please look at the accepted answer on this one, PHP DOM: Get NodeValue excluding the child nodes

    You can remove 'nav' node just after gathering all child nodes of the body.

    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论

报告相同问题?

悬赏问题

  • ¥15 各位请问平行检验趋势图这样要怎么调整?说标准差差异太大了
  • ¥15 delphi webbrowser组件网页下拉菜单自动选择问题
  • ¥15 wpf界面一直接收PLC给过来的信号,导致UI界面操作起来会卡顿
  • ¥15 init i2c:2 freq:100000[MAIXPY]: find ov2640[MAIXPY]: find ov sensor是main文件哪里有问题吗
  • ¥15 运动想象脑电信号数据集.vhdr
  • ¥15 三因素重复测量数据R语句编写,不存在交互作用
  • ¥15 微信会员卡等级和折扣规则
  • ¥15 微信公众平台自制会员卡可以通过收款码收款码收款进行自动积分吗
  • ¥15 随身WiFi网络灯亮但是没有网络,如何解决?
  • ¥15 gdf格式的脑电数据如何处理matlab