duanfan5012 2018-07-03 18:42
浏览 53
已采纳

PHP - 删除XML空节点

I found this code to remove empty nodes from and XML file but it isn't working correctly. It leaves an empty node that really needs to be removed. Yes, it is empty, just white space in it.

$domxml = new DOMDocument('1.0');
$domxml->preserveWhiteSpace = false;
$domxml->formatOutput = true;
$domxml->loadXML($this->response);
$this->response = $domxml->saveXML($domxml->documentElement);

Anyone know of a better way to do this?

  • 写回答

2条回答 默认 最新

  • dongzhenge2014 2018-07-04 08:13
    关注

    In other words you would like to remove any element node that has no text content, no attribute, no children with text content or attributes and have a parent element node (are not the document element).

    Here is an Xpath function normalize-space() that converts any whitespace sequences to single spaces and strips them from the start/end. Any whitespace only content will result in an empty string.

    Xpath

    //* fetches any element node in the document in a list. You just need to add conditions.

    • Has no text content
      normalize-space(.) = ""
    • No attributes
      not(@*)
    • No descendant node with content (includes comments, ...)
      not(.//node()[normalize-space(.) != ""])
    • No descendant element nodes with attributes
      not(.//*[@*])
    • Has a parent element node
      parent::*

    Put together:

    $xml = <<<'XML'
    <foo>
      <bar></bar>
      <bar>123</bar>
      <bar foo="123"></bar>
      <bar><foo>   </foo></bar>
      <bar><!-- test --></bar>
    </foo>
    XML;
    
    $document = new DOMDocument();
    $document->preserveWhiteSpace = FALSE;
    $document->formatOutput = TRUE; 
    $document->loadXml($xml);
    $xpath = new DOMXpath($document);
    
    $expression = 
      '//*[
        normalize-space(.) = "" and 
        not(@*) and  
        not(.//node()[normalize-space(.) != ""]) and 
        not(.//*[@*]) and
        parent::*
      ]';
    
    $nodes = $xpath->evaluate($expression);
    for ($i = $nodes->length - 1; $i >= 0; $i--) {
      $nodes[$i]->parentNode->removeChild($nodes[$i]);
    }
    
    echo $document->saveXml();
    

    Output:

    <?xml version="1.0"?>
    <foo>
      <bar>123</bar>
      <bar foo="123"/>
      <bar>
        <!-- test -->
      </bar>
    </foo>
    
    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论
查看更多回答(1条)

报告相同问题?

悬赏问题

  • ¥15 java 操作 elasticsearch 8.1 实现 索引的重建
  • ¥15 数据可视化Python
  • ¥15 要给毕业设计添加扫码登录的功能!!有偿
  • ¥15 kafka 分区副本增加会导致消息丢失或者不可用吗?
  • ¥15 微信公众号自制会员卡没有收款渠道啊
  • ¥15 stable diffusion
  • ¥100 Jenkins自动化部署—悬赏100元
  • ¥15 关于#python#的问题:求帮写python代码
  • ¥20 MATLAB画图图形出现上下震荡的线条
  • ¥15 关于#windows#的问题:怎么用WIN 11系统的电脑 克隆WIN NT3.51-4.0系统的硬盘