正则表达式匹配HTML标记内的文本

I'm trying to write a regex that will remove HTML tags around a placeholder text, so that this:

<p>
    Blah</p>
<p>
    {{{body}}}</p>
<p>
    Blah</p>

Becomes this:

<p>
    Blah</p>
{{{body}}}
<p>
    Blah</p>

My current regex is /<.+>.*\{\{\{body\}\}\}<\/.+>/msU. However, it will also remove the contents of the tag preceding the placeholder, resulting in:

{{{body}}}
<p>
    Blah</p>

I can't assume the users will always place the placeholder inside <p>, so I would like it to remove any pair of tags immediately around the placeholder. I would appreciate some help with correcting my regex.

[EDIT]

I think it's important to note that the input may or may not be processed by CKEditor. It adds newlines and tabs to the opening tags, thus the regex needs to go with the /sm (dotall + multiline) modifiers.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
douliedai4838 2012-05-06 16:20
关注
Try this:

<[^>]+>\s*\{{3}body\}{3}\s*<\/[^>]+>

See it here in action: http://regexr.com?30s4o

Here's the breakdown:

<[^>]+> matches an opening HTML tag, and only that.

\s* captures any whitespace (equivalent to [ \t ]*)

\{{3} matches a { exactly 3 times

body matches the string literally

\}{3} matches a } exactly 3 times

\s* again, captures any whitespace

<\/[^>]+> matches a closing HTML tag
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(1条)

报告相同问题？

关注问题

想使用正则表达式匹配，提取文本中特定的内容。 python 正则表达式
2022-01-19 16:23

回答 2 已采纳这应该就是你想要的功能： import os, re def GetMiddleStr(content,startStr,endStr): '''提取字符串content当中，startStr
正则表达式 匹配正负整数和正负小数或者空有问必答正则表达式
2021-08-25 15:28

回答 6 已采纳已私聊解决
正则表达式匹配不包含某个字符串的字符串 python 正则表达式
2021-03-07 09:46

回答 2 已采纳。。。 import re l = [] res = re.findall('ABC.*?BCD', r'ABC/dABC/213BCD/sfoajs/ABC/dddd/BCD') fo
HTML匹配文本,什么正则表达式将匹配文本，不包括HTML标记内的内容？
2021-06-12 18:37

Sei Kyo的博客这些术语碰巧发生在表格单元格中(应用程序正在迭代GridView行单元格)，这些表格单元格可能包含HTML。目前，我的代码看起来像这样(相关的帅哥如下所示)：const string highlightPattern = @"$0";...
使用java正则表达式匹配日期 java 正则表达式
2020-01-31 15:18

回答 1 已采纳 ``` ^\d{4}-0*((1|3|5|7|8|10|12)-0*([1-9]|[1-2]\d|3[0-1])|(4|6|9|11)-0*([1-9]|[1-2]\d|30)|2-0*([1-
正则表达式匹配 正则表达式匹配 c语言
2021-11-11 23:14

回答 1 已采纳 public static boolean matchDP1(char[] str, char[] pattern) { if(str == null || pattern == n
正则表达式匹配路径中文件 java 正则表达式
2018-09-29 02:59

回答 2 已采纳首先 file是你得到的文件 File[] files = file.listFiles(); 获取目录下的所有文件 List fileList = new ArrayList();//定义一个
java 正则表达式 替换 html,java 正则表达式 替换 html
2021-06-14 02:12

德州小王子的博客 java 正则表达式 替换 html[2021-01-29 22:37:07]简介:java正则表达式用法：1、使用Pattern类进行字符串...相关免费学习推荐：javaphp正则表达式替换图片地址的方法：首先PHP正则提取图片img标记中的任意属性；然后...
Drools 正则表达式匹配不准 java 正则表达式
2022-08-23 11:07

回答 1 已采纳你显示的加上^或者*号用来区分是否必须从头开始匹配
Python正则表达式匹配图片 python
2021-03-30 12:52

回答 1 已采纳 pattern = re.compile(r'<a href="/desk.+?<img src="(.+?)"', flags=re.S)
正则表达式 匹配1-1200的正整数开发语言正则表达式
2021-10-25 14:36

回答 2 已采纳 ^([1-9]|[1-9]\d|1\d{2}|1200)$
php正则表达式 全局查找,执行一个全局正则表达式匹配 - PHP 7 中文文档
2021-04-23 10:02

往后清白的博客 (PHP 4, PHP 5, PHP 7)preg_match_all – 执行一个全局正则表达式匹配说明preg_match_all( string $pattern, string $subject[, array &$matches[, int $flags = PREG_PATTERN_ORDER[, int $offset = 0]]] ) : ...
Python正则表达式匹配电话 python 正则表达式爬虫
2021-09-13 15:23

回答 1 已采纳 import pyperclip text = str(pyperclip.paste()) # 将最近一次复制的内容转换为字符串 import re regex = re.compile('(
php正则表达式除什么之外,正则表达式：匹配除特定模式以外的所有内容
2021-04-07 08:22

王司图的博客 )开头的字符串之外的所有内容的正则表达式您不希望匹配哪种特定模式？是否有原因为什么您不能匹配您的模式，并且如果字符串与之匹配则无法执行某些操作？正则表达式可能重复，以匹配不包含单词的行？正则表达式：...
php正则表达式查找html内容,如何利用PHP的正则表达式来获取HTML中的内容
2021-04-23 06:07

清清凉凉甜甜的的博客话题：如何利用PHP的正则表达式来获取HTML中的内容回答：preg_match('/(.*?)/',$str,$result);$str就是上面的html里面的内容，$result就是匹配到的字符串，你可以print_r($result)；看看里面就有你要的结果，或者...
没有解决我的问题, 去提问

悬赏问题

¥15 socket通信实现多人聊天室疑惑
¥15 DEV-C++编译缺失
¥33 找熟练码农写段Pyhthon程序
¥100 怎么让数据库字段自动更新
¥15 antv g6 力导向图布局
¥15 quartz框架，No record found for selection of Trigger with key
¥15 锅炉建模+优化算法，遗传算法优化锅炉燃烧模型，ls-svm会搞，后面的智能算法不会
¥20 MATLAB多目标优化问题求解
¥15 windows2003服务器按你VPN教程设置后，本地win10如何连接？
¥15 求一阶微分方程的幂级数

正则表达式匹配HTML标记内的文本

2条回答 默认 最新

悬赏问题

2条回答默认最新