donglu5047 2013-08-29 16:21
浏览 68
已采纳

DOMDocument删除html元素

Here is my code:

$text = '<div class="cgus_post"><a href="?p=15055"><div class="imgbox"><img src="/cgmedia/default.gif"></div></a>
        <h2 id="post-15055">
        <a href="?p=15055" rel="bookmark" title="Permanent Link to Willie Nelson Celebrates 80th Birthday Stoned and Auditioning for Gandalf">Willie Nelson Celebrates 80th Birthday Stoned and Auditioning for Gandalf</a></h2>
        <p>This video pretty much sums up why Willie Nelson is fucking awesome. Willie decided to celebrate his 80th birthday by recording an ‘audition’ for Peter Jackson. &nbsp;Willie wants to take the reigns from Ian McKellan in The Hobbit 2, and decided to show off his acting skills and give some of his own wizardly advice. The result is &nbsp;hilarious. Watch …</p>
        <br class="clear">
        </div>';
$dom = new DomDocument();
$dom->loadHTML($text);
$classname = 'cgus_post';
$finder = new DomXPath($dom);
$nodes = $finder->query("//*[contains(concat(' ', normalize-space(@class), ' '), ' $classname ')]");
foreach($nodes as $node){
    echo $node->nodeValue;  
}

The problem I am having is I am querying for the div that contains the class cgus_post and its returning just the text. How do I have it return the HTML elements also?

  • 写回答

1条回答 默认 最新

  • douchengchu8374 2013-08-29 19:03
    关注

    Here's my innerHTML function that I always use:

    function innerHTML(DOMNode $node, $trim = true, $decode = true) {
       $innerHTML = '';
    
       foreach ($node->childNodes as $inner_node) {
          $temp_container = new DOMDocument();
          $temp_container->appendChild($temp_container->importNode($inner_node, true));
    
          $innerHTML .= ($trim ? trim($temp_container->saveHTML()) : $temp_container->saveHTML());
       }
    
       return ($decode ? html_entity_decode($innerHTML) : $innerHTML);
    }
    

    So then you do:

    $dom = new DOMDocument();
    $dom->loadHTML($html);
    
    echo htmlentities(innerHTML($dom->documentElement->childNodes->item(0)->firstChild));
    
    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论

报告相同问题?

悬赏问题

  • ¥20 腾讯企业邮箱邮件可以恢复么
  • ¥15 有人知道怎么将自己的迁移策略布到edgecloudsim上使用吗?
  • ¥15 错误 LNK2001 无法解析的外部符号
  • ¥50 安装pyaudiokits失败
  • ¥15 计组这些题应该咋做呀
  • ¥60 更换迈创SOL6M4AE卡的时候,驱动要重新装才能使用,怎么解决?
  • ¥15 让node服务器有自动加载文件的功能
  • ¥15 jmeter脚本回放有的是对的有的是错的
  • ¥15 r语言蛋白组学相关问题
  • ¥15 Python时间序列如何拟合疏系数模型