使用DOMxpath或regex删除 ？

I use DOMxpath to remove html tags that have empty text node but to keep   tags,

$xpath = new DOMXPath($dom);

while(($nodeList = $xpath->query('//*[not(text()) and not(node()) and not(self::br)]')) && $nodeList->length > 0) 
{
    foreach ($nodeList as $node) 
    {
        $node->parentNode->removeChild($node);
    }
}

it works perfectly until I came across another problem,

$content = '<p><br/><br/><br/><br/></p>';

How do remove this kind of messy  and? which means I don't want to allow   alone with  but I allow   with proper text like this only,

$content = '<p>first break <br/> second break <br/> the last line</p>';

Is that possible?

Or is it better with a regular expression?

I tried something like this,

$nodeList = $xpath->query("//p[text()=<br\s*\/?>\s*]");
    foreach($nodeList as $node) 
    {
        $node->parentNode->removeChild($node);
    }

but it return this error,

Warning: DOMXPath::query() [domxpath.query]: Invalid expression in...

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

3条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
drbxr86044 2011-07-27 09:48
关注
You can select the unwanted p using XPath:

"//p[count(*)=count(br) and br and normalize-space(.)='']"

Note to select empty-text nodes shouldn't you better use (?):

"//*[normalize-space(.)='' and not(self::br)]"

This will select any element (but br) whithout text nodes, nodes like:



or

 

included.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(2条)

报告相同问题？

关注问题

使用DOMxpath或regex删除 ？ php
2011-07-26 23:38

回答 3 已采纳 You can select the unwanted p using XPath: "//p[count(*)=count(br) and br and normalize-space(.)=
使用DOM或正则表达式删除＆nbsp; html php
2011-07-23 17:15

回答 3 已采纳 If you want to remove a string that is exactly, always, ' ', the simplest
使用DOMXPath清理已弃用的HTML代码（将嵌套的<div>标记转换为标记） html php
2019-05-05 10:13

回答 1 已采纳 There are a couple of things I've changed. The first is that rather than just append the existing
php xmldom扩展,PHP_用PHP读取和编写XML DOM的实现代码，用 PHP 读取和编写可扩展标记 - phpStudy...
2021-04-23 05:57

马向文的博客用PHP读取和编写XML DOM的实现代码用 PHP 读取和编写可扩展标记语言(XML)看起来可能有点恐怖。实际上，XML 和它的所有相关技术可能是恐怖的，但是用 PHP 读取和编写 XML 不一定是项恐怖的任务。首先，需要学习一点...
PHP，从数据库处理字符串，更改 php
2017-09-24 01:40

回答 1 已采纳 You can use substr_replace to accomplish this task. It is fairly simple. $text = ''; $cl
DOMXpath和PHP：如何在<ul>中包含一堆<li> php
2015-11-25 19:41

回答 2 已采纳 Maybe you can get the parentNode of the first <li> and then use the insertBefore method: $h
$("").text("Text.")，帮忙解释一下这个是什么？ jquery
2016-01-26 23:11

回答 3 已采纳 jquery参数除了是选择器，还可以是是dom对象和html字符串，html字符串就是在内存中生成新的dom，你要显示需要添加到dom树种才行
php 模板下载xml,用PHP读取和编写XML DOM的实现代码
2021-03-26 12:07

weixin_39980809的博客用 PHP 读取和编写可扩展标记语言(XML)看起来可能有点恐怖。实际上，XML 和它的所有相关技术可能是恐怖的，但是用 PHP 读取和编写 XML 不一定是项恐怖的任务。首先，需要学习一点关于 XML 的知识 —— 它是什么，用...
PHP：如何检查是否具有DOMDocument的<iframe>子节点？ php
2013-09-26 07:29

回答 2 已采纳 You're trying to find all iframe elements that are the only childnodes of the p elements. If foun
php - loadHTML（） - 每个直到某个类 html php
2018-04-08 13:22

回答 2 已采纳 You could use DOMDocument and DOMXPath with for example an xpath expression like: //div[@id="toc"
如何在使用DOMDocument时将文本内容分隔为 php
2017-01-12 05:00

回答 3 已采纳 In your example, $n contains 5 child nodes: "Name" " " " " "<span class='class2'&gt
java dom读写xml_用PHP读写XML DOM
2020-06-20 01:16

cuyi7076的博客用PHP读写可扩展标记语言（XML）似乎有些令人恐惧。实际上，XML及其所有相关技术可能令人生畏。但是，用PHP读写XML并不是一项艰巨的任务。首先，您需要了解一些有关XML的知识-XML的含义和用途。然后，您需要学习...
带有DOMDocument的DomXPath获取<img>类URL php
2013-04-17 08:11

回答 2 已采纳 $dom = new DOMDocument(); $dom->loadHTML($x); $xpath = new DomXpath($dom); $imgs = $xpath-&gt
您如何在PHP中解析和处理HTML / XML？
2019-12-04 10:40

asdfgh0077的博客 ' '; #3楼我写了一个通用的XML解析器，可以轻松处理GB文件。它基于XMLReader，非常易于使用： $source = new XmlExtractor("path/to/tag", "/path/to/file.xml"); foreach ($source as $tag) { echo $...
PHP读取和编写XML DOM的实现示例
2015-01-09 12:42

hhkiss1的博客本文提供了三种方法读取 XML：使用 DOM 库、使用 SAX 解析器和使用正则表达式。还介绍了使用 DOM 和 PHP 文本模板编写 XML。用 PHP 读取和编写可扩展标记语言（XML）看起来可能有点恐怖。实际上，XML 和它的所有...
没有解决我的问题, 去提问

悬赏问题

¥15 装 pytorch 的时候出了好多问题，遇到这种情况怎么处理？
¥20 IOS游览器某宝手机网页版自动立即购买JavaScript脚本
¥15 手机接入宽带网线，如何释放宽带全部速度
¥30 关于#r语言#的问题：如何对R语言中mfgarch包中构建的garch-midas模型进行样本内长期波动率预测和样本外长期波动率预测
¥15 ETLCloud 处理json多层级问题
¥15 matlab中使用gurobi时报错
¥15 这个主板怎么能扩出一两个sata口
¥15 不是，这到底错哪儿了😭
¥15 2020长安杯与连接网探
¥15 关于#matlab#的问题：在模糊控制器中选出线路信息，在simulink中根据线路信息生成速度时间目标曲线（初速度为20m/s，15秒后减为0的速度时间图像）我想问线路信息是什么

使用DOMxpath或regex删除<p> <br/> </ p>？

3条回答 默认 最新

悬赏问题

3条回答默认最新