正则表达式匹配HTML标记内的文本

I'm trying to write a regex that will remove HTML tags around a placeholder text, so that this:

<p>
    Blah</p>
<p>
    {{{body}}}</p>
<p>
    Blah</p>

Becomes this:

<p>
    Blah</p>
{{{body}}}
<p>
    Blah</p>

My current regex is /<.+>.*\{\{\{body\}\}\}<\/.+>/msU. However, it will also remove the contents of the tag preceding the placeholder, resulting in:

{{{body}}}
<p>
    Blah</p>

I can't assume the users will always place the placeholder inside <p>, so I would like it to remove any pair of tags immediately around the placeholder. I would appreciate some help with correcting my regex.

[EDIT]

I think it's important to note that the input may or may not be processed by CKEditor. It adds newlines and tabs to the opening tags, thus the regex needs to go with the /sm (dotall + multiline) modifiers.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
douliedai4838 2012-05-06 16:20
关注
Try this:

<[^>]+>\s*\{{3}body\}{3}\s*<\/[^>]+>

See it here in action: http://regexr.com?30s4o

Here's the breakdown:

<[^>]+> matches an opening HTML tag, and only that.

\s* captures any whitespace (equivalent to [ \t ]*)

\{{3} matches a { exactly 3 times

body matches the string literally

\}{3} matches a } exactly 3 times

\s* again, captures any whitespace

<\/[^>]+> matches a closing HTML tag
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(1条)

报告相同问题？

关注问题

想使用正则表达式匹配，提取文本中特定的内容。 python 正则表达式
2022-01-19 16:23

回答 2 已采纳这应该就是你想要的功能： import os, re def GetMiddleStr(content,startStr,endStr): '''提取字符串content当中，startStr
正则表达式匹配正负整数和正负小数或者空有问必答正则表达式
2021-08-25 15:28

回答 6 已采纳已私聊解决
正则表达式匹配不包含某个字符串的字符串 python 正则表达式
2021-03-07 09:46

回答 2 已采纳。。。 import re l = [] res = re.findall('ABC.*?BCD', r'ABC/dABC/213BCD/sfoajs/ABC/dddd/BCD') fo
HTML匹配文本,什么正则表达式将匹配文本，不包括HTML标记内的内容？
2021-06-12 18:37

Sei Kyo的博客这些术语碰巧发生在表格单元格中(应用程序正在迭代GridView行单元格)，这些表格单元格可能包含HTML。目前，我的代码看起来像这样(相关的帅哥如下所示)：const string highlightPattern = @"$0";...
使用java正则表达式匹配日期 java 正则表达式
2020-01-31 15:18

回答 1 已采纳 ``` ^\d{4}-0*((1|3|5|7|8|10|12)-0*([1-9]|[1-2]\d|3[0-1])|(4|6|9|11)-0*([1-9]|[1-2]\d|30)|2-0*([1-
正则表达式匹配 正则表达式匹配 c语言
2021-11-11 23:14

回答 1 已采纳 public static boolean matchDP1(char[] str, char[] pattern) { if(str == null || pattern == n
正则表达式匹配路径中文件 java 正则表达式
2018-09-29 02:59

回答 2 已采纳首先 file是你得到的文件 File[] files = file.listFiles(); 获取目录下的所有文件 List fileList = new ArrayList();//定义一个
java 正则表达式替换 html,java 正则表达式替换 html
2021-06-14 02:12

德州小王子的博客 java 正则表达式替换 html[2021-01-29 22:37:07]简介:java正则表达式用法：1、使用Pattern类进行字符串...相关免费学习推荐：javaphp正则表达式替换图片地址的方法：首先PHP正则提取图片img标记中的任意属性；然后...
Drools 正则表达式匹配不准 java 正则表达式
2022-08-23 11:07

回答 1 已采纳你显示的加上^或者*号用来区分是否必须从头开始匹配
Python正则表达式匹配图片 python
2021-03-30 12:52

回答 1 已采纳 pattern = re.compile(r'<a href="/desk.+?<img src="(.+?)"', flags=re.S)
正则表达式匹配1-1200的正整数开发语言正则表达式
2021-10-25 14:36

回答 2 已采纳 ^([1-9]|[1-9]\d|1\d{2}|1200)$
php正则表达式全局查找,执行一个全局正则表达式匹配 - PHP 7 中文文档
2021-04-23 10:02

往后清白的博客 (PHP 4, PHP 5, PHP 7)preg_match_all – 执行一个全局正则表达式匹配说明preg_match_all( string $pattern, string $subject[, array &$matches[, int $flags = PREG_PATTERN_ORDER[, int $offset = 0]]] ) : ...
Python正则表达式匹配电话 python 正则表达式爬虫
2021-09-13 15:23

回答 1 已采纳 import pyperclip text = str(pyperclip.paste()) # 将最近一次复制的内容转换为字符串 import re regex = re.compile('(
php正则表达式除什么之外,正则表达式：匹配除特定模式以外的所有内容
2021-04-07 08:22

王司图的博客 )开头的字符串之外的所有内容的正则表达式您不希望匹配哪种特定模式？是否有原因为什么您不能匹配您的模式，并且如果字符串与之匹配则无法执行某些操作？正则表达式可能重复，以匹配不包含单词的行？正则表达式：...
php正则表达式查找html内容,如何利用PHP的正则表达式来获取HTML中的内容
2021-04-23 06:07

清清凉凉甜甜的的博客话题：如何利用PHP的正则表达式来获取HTML中的内容回答：preg_match('/(.*?)/',$str,$result);$str就是上面的html里面的内容，$result就是匹配到的字符串，你可以print_r($result)；看看里面就有你要的结果，或者...
没有解决我的问题, 去提问

悬赏问题

¥15 winform的chart曲线生成时有凸起
¥15 msix packaging tool打包问题
¥15 finalshell节点的搭建代码和那个端口代码教程
¥15 用hfss做微带贴片阵列天线的时候分析设置有问题
¥15 Centos / PETSc / PETGEM
¥15 centos7.9 IPv6端口telnet和端口监控问题
¥20 完全没有学习过GAN，看了CSDN的一篇文章，里面有代码但是完全不知道如何操作
¥15 使用ue5插件narrative时如何切换关卡也保存叙事任务记录
¥20 海浪数据南海地区海况数据，波浪数据
¥20 软件测试决策法疑问求解答

正则表达式匹配HTML标记内的文本

2条回答 默认 最新

悬赏问题

2条回答默认最新