preg_replace vs DOMDocument replaceChild

I was wondering which method mentioned in the title is more efficient to replace content in a html page.

I have this custom tag in my page: <includes module='footer'/> which will be replaced with some content.

Now there are some downsides with using DOMDocument->getElementsByTagName('includes')->item(0)->parentNode->replaceChild for instance when i forgot to add the slash in the tag, like so <includes module='footer'> the whole site crashes.

Regex allows exceptions like these, as long it matches the rule. It would even allow me to replace any string, like {includes:footer}.

Now back to my actual question. Are there any downsides using regex for this purpose, like performance issues...?

More here: Append child/element in head using XML Manipulation

cheers

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dongshuobei1037 2014-05-23 17:42
关注
I wouldn't be too worried about performance here, I would consider them "comparable". Benchmarks would need to be ran to truly determine this, as it would depend on the size of the document and how the regular expression is written.

Instead, I would be concerned about accuracy. In general DOMDocument will be much better at parsing XML since it was built to read and understand the language. However, it does fail on <includes module='footer'> because it is an un-closed tag (expecting: </includes>).

Most common HTML/XML formatting issues can be fixed with PHP's Tidy class. I would check this out, since you should receive much more "expected results" compared to if you used regex to parse. If you used a regular expression, there could technically be attributes before/after the module, elements within the includes element, unexpected characters like <includes module='foo>bar'>, etc.

In the end, if your XML is in a "controlled" environment (i.e. you know what can and can't happen, you know what possible characters module will contain, you know that it will always be a self closing element containing now children, etc.) than by all means use a regular expression. Just know it is looking for a very specific set of rules. However, if you expect for this to work with "anything you throw at it"..please use a DOM parser (after Tidy'ing to avoid the exceptions), regardless of performance (although I bet it will be very comparable in many instances).

Also, final note, if you plan to find/replace/manipulate many nodes in a document, you will see a large performance increase by going with a DOM parser. A DOM parser will take a document and parse it, once. Then you just traverse the data it already has loaded into its class. This is compared to using regular expressions, where each individual one will be ran across the whole document looking for a set of matches.

If you want me to get more specific in any area (i.e. give a Tidy example, or work on a benchmark), let me know.

本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(1条)

报告相同问题？

关注问题

preg_replace vs DOMDocument replaceChild php
2014-05-23 16:46

回答 2 已采纳 I wouldn't be too worried about performance here, I would consider them "comparable". Benchmarks w
在preg_replace（）中使用捕获组 php
2019-06-22 05:08

回答 1 已采纳 Great question! For substitution parts My guess is that <\1> and $1 ${1} are pretty much t
如何使用preg_replace忽略特定单词 php
2018-06-29 13:06

回答 1 已采纳 You could use a (*SKIP)(*F) solution: <!END!>(*SKIP)(FAIL)|[^A-Za-z0-9 ] That would match:
php preg_replace html,php – 忽略preg_replace中的html标签
2021-04-13 13:34

朱王勇的博客我假设您应该基于DOMDocument和DOMXPath而不是使用正则表达式来创建函数.即使那些功能非常强大,您也遇到了一些问题,例如您所描述的问题,这些问题并非(总是)通过正则表达式来解决.一般说法是：不要用正则表达式解析...
Preg_Replace（删除）精确匹配单词PHP的数组 php
2018-06-03 07:08

回答 1 已采纳 Maybe instead of using preg_replace() you might just try turning your string into an array and the
PHP：preg_replace只是数组中第一个匹配的字符串 php
2018-11-25 13:08

回答 1 已采纳 You may use plain text in the associative array keys that you will use to create dynamic regex pat
如何通过preg_replace_callback替换此preg_replace以获得php 5.6兼容性[重复] php
2019-05-07 09:39

回答 1 已采纳 Finally, it works with this code : function _munge_input($template) { $that = $this; $tem
replace html标签,忽略preg_replace中的html标签
2021-06-27 08:20

weixin_39710288的博客小编典典我假设您应该基于DOMDocument和DOMXPath而不是使用正则表达式来创建函数。即使那些功能非常强大，您也会遇到像您描述的问题那样的问题，这些问题不是(总是)很容易且不易用正则表达式解决的。俗话说：不要用...
警告：preg_replace_callback（）：需要参数2，才能成为有效的回调 php
2017-06-17 09:23

回答 1 已采纳 If I'm right about what you're trying to do, this might be what you're looking for: $text = 'a:1:
PHP关于preg_replace函数的一些问题 php
2017-12-06 13:36

回答 2 已采纳 i是匹配模式，不区分大小写。还有其他的一些匹配模式如/i, /s, /x,/u, /U, /A, /D, /S。
PHP preg_replace不能用于多行 php
2019-07-18 14:26

回答 2 已采纳 $text = "abc - xyz abc- -xyz 123- ---s--- --sss "; $text = preg_replace("/\- $/m", " ", $t
pygments_在PHP和WordPress上的Pygments
2020-08-08 20:21

culuo8053的博客但是为什么我们要检查php并首先猜测，您可能会问，好吧，我们要检查php，因为如果我们使用get_lexer_by_name('php')并且php代码没有所需的开头php标签， <？php将不会突出显示代码正确或符合我们的预期，我们需要...
替换preg_replace模式中的两个值 php
2017-02-23 12:49

回答 1 已采纳 Change to this: $pwText = preg_replace('#(\]).*?(\[/pass])#', '$1'.$password->getPass().'$2',
html删除子元素无效,删除父元素，使用saveHTML保留DOMDocument中的所有内部子元素...
2021-06-11 15:15

马未都的博客原始答案 h2>这个解决方案相当冗长，但这是因为它通过...如果您有任何疑问，请与我们联系：class DOMDocumentExtended extends DOMDocument{public function __construct( $version = "1.0", $encoding = "UTF-...
使用DOM解析来实现PHP模版引擎
2018-11-11 15:50

weixin_34392435的博客 $str = preg_replace_callback('/\{\{(.*?)\}\}/', function ($matches) use ($params) { // ...处理逻辑 }, $item->nodeValue); $item->nodeValue = $str; } (6). 节点遍历以上就是最常用的几种节点类型...
php块元素怎么移过来,PHP DOM：如何将元素移动到默认命名空间？
2021-04-21 17:27

ZHUMAOBo朱是福的博客这真是一个有趣的问题.我的第一个目的是克隆<...经过一些与PHP的DOM方法的斗争,我开始谷歌问题.我在PHP文档中发现了this评论.用户建议编写一个函数,在没有命名空间的情况下手动克隆节点：/***...
php 点击链接时url不变,关于url：可点击链接的最佳PHP脚本
2021-03-24 11:47

weixin_39678493的博客我发现许多PHP脚本可将文本中的URL转换为可单击的链接。但是它们大多数无法正常工作，有些会产生很大的错误。其中一些转换链接已经可以单击。其他人则无法使用，而第三者则通过文本链接制作零件。我需要一个脚本...
php remove tag,从PHP中的字符串中移除 div> HTML标记(Remove
2021-04-11 12:13

体制老司机的博客试试这个 $domDoc = new DomDocument(); $domDoc->loadHTML($reportGen); $xpath = new DOMXpath($domDoc); $tags = $xpath->query('//td'); foreach($tags as $tag) { $value = $tag->nodeValue; if(preg_match('/...
PHP各类别常用函数
2014-08-24 22:40

jasonkent27的博客 PHP各类别常用函数字符串函数(important) addcslashes – 像C语言一样使用反斜线转义字符串中的字符 addslashes -- 使用反斜线引用字符串 ★bin2hex -- 将二进制数据转换成十六进制表示 chop --...
用trie树实现输入提示功能，输入php函数名，提示php函数
2019-09-22 13:05

andi8430的博客输入php函数名按回车获取提示\n " ); gets(line); suggest(root, line); } return 0 ; } PHP函数列表 phpfunc.txt apc_add apc_bin_dump apc_bin_dumpfile apc_bin_load apc_bin_loadfile apc_...
没有解决我的问题, 去提问

悬赏问题

¥15 HFSS 中的 H 场图与 MATLAB 中绘制的 B1 场部分对应不上
¥15 如何在scanpy上做差异基因和通路富集？
¥20 关于#硬件工程#的问题，请各位专家解答！
¥15 关于#matlab#的问题：期望的系统闭环传递函数为G(s)=wn^2/s^2+2¢wn+wn^2阻尼系数¢=0.707，使系统具有较小的超调量
¥15 FLUENT如何实现在堆积颗粒的上表面加载高斯热源
¥30 截图中的mathematics程序转换成matlab
¥15 动力学代码报错，维度不匹配
¥15 Power query添加列问题
¥50 Kubernetes&Fission&Eleasticsearch
¥15 報錯：Person is not mapped，如何解決？

preg_replace vs DOMDocument replaceChild

2条回答 默认 最新

悬赏问题

2条回答默认最新