duanfan5012 2018-07-03 18:42
浏览 53
已采纳

PHP - 删除XML空节点

I found this code to remove empty nodes from and XML file but it isn't working correctly. It leaves an empty node that really needs to be removed. Yes, it is empty, just white space in it.

$domxml = new DOMDocument('1.0');
$domxml->preserveWhiteSpace = false;
$domxml->formatOutput = true;
$domxml->loadXML($this->response);
$this->response = $domxml->saveXML($domxml->documentElement);

Anyone know of a better way to do this?

  • 写回答

2条回答 默认 最新

  • dongzhenge2014 2018-07-04 08:13
    关注

    In other words you would like to remove any element node that has no text content, no attribute, no children with text content or attributes and have a parent element node (are not the document element).

    Here is an Xpath function normalize-space() that converts any whitespace sequences to single spaces and strips them from the start/end. Any whitespace only content will result in an empty string.

    Xpath

    //* fetches any element node in the document in a list. You just need to add conditions.

    • Has no text content
      normalize-space(.) = ""
    • No attributes
      not(@*)
    • No descendant node with content (includes comments, ...)
      not(.//node()[normalize-space(.) != ""])
    • No descendant element nodes with attributes
      not(.//*[@*])
    • Has a parent element node
      parent::*

    Put together:

    $xml = <<<'XML'
    <foo>
      <bar></bar>
      <bar>123</bar>
      <bar foo="123"></bar>
      <bar><foo>   </foo></bar>
      <bar><!-- test --></bar>
    </foo>
    XML;
    
    $document = new DOMDocument();
    $document->preserveWhiteSpace = FALSE;
    $document->formatOutput = TRUE; 
    $document->loadXml($xml);
    $xpath = new DOMXpath($document);
    
    $expression = 
      '//*[
        normalize-space(.) = "" and 
        not(@*) and  
        not(.//node()[normalize-space(.) != ""]) and 
        not(.//*[@*]) and
        parent::*
      ]';
    
    $nodes = $xpath->evaluate($expression);
    for ($i = $nodes->length - 1; $i >= 0; $i--) {
      $nodes[$i]->parentNode->removeChild($nodes[$i]);
    }
    
    echo $document->saveXml();
    

    Output:

    <?xml version="1.0"?>
    <foo>
      <bar>123</bar>
      <bar foo="123"/>
      <bar>
        <!-- test -->
      </bar>
    </foo>
    
    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论
查看更多回答(1条)

报告相同问题?

悬赏问题

  • ¥20 机器学习能否像多层线性模型一样处理嵌套数据
  • ¥20 西门子S7-Graph,S7-300,梯形图
  • ¥50 用易语言http 访问不了网页
  • ¥50 safari浏览器fetch提交数据后数据丢失问题
  • ¥15 matlab不知道怎么改,求解答!!
  • ¥15 永磁直线电机的电流环pi调不出来
  • ¥15 用stata实现聚类的代码
  • ¥15 请问paddlehub能支持移动端开发吗?在Android studio上该如何部署?
  • ¥20 docker里部署springboot项目,访问不到扬声器
  • ¥15 netty整合springboot之后自动重连失效