只有当某些词语不在它之前时，才能在字符串中加粗

I have a string like this.

$dot_prod = "at the coast will reach the Douglas County coast";

I'd like this result by using a regex: at the coast will reach the Douglas County coast

Specifically, I want to bold the word "coast" and "the" but only the word coast if not preceded by the word "county" and only the word "the" if not preceded by the word "at". So, essentially I want an array of words or phrases (case-insensitive that keeps the case the word/phrase was originally in) to be bolded and then an array of words or phrases that I want to ensure are not bolded. For instance, the array of words/phrases that I want bolded are:

$bold = array("coast", "the", "pass");

and the array of words I want to ensure are unbolded are:

$unbold = array("county coast", "at the", "grants pass");

I'm able to do the bolding with this:

$bold = array("coast", "the", "pass");

$dot_prod = preg_replace("/(" . implode("|", $bold) . ")/i", "<b>$1</b>", $dot_prod);

However, I've been unsuccessful at unbolding afterwards, and I definitely couldn't figure out how to do it all in one expression. Can you offer any help please? Thank you.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
duanfei1987 2018-10-30 14:15
关注
You may match and skip the patterns you want to "unbold" and match those you want to bold in any other context.

Build a regex like this (I added word boundaries to match whole words, you do not have to use them probably, but that seems a good idea for your current input):

'~\b(?:county coast|at the|grants pass)\b(*SKIP)(*F)|\b(?:coast|the|pass)\b~i'

See the regex demo.

Details

\b - word boundary

(?:county coast|at the|grants pass) - any of the alternatives

\b - a word boundary

(*SKIP)(*F) - PCRE verbs to skip the current match and proceed looking for a match from the end of the current match

| - or

\b - a word boundary

(?:coast|the|pass) - any of the alternatives

\b - a word boundary.

The $0 in the replacement is the reference to the whole match value.

PHP demo:

$dot_prod = "at the coast will reach the Douglas County coast"; $bold = array("coast", "the", "pass"); $unbold = array("county coast", "at the", "grants pass"); $rx = "~\b(?:" . implode("|", $unbold) . ")\b(*SKIP)(*F)|\b(?:" . implode("|", $bold) . ")\b~i"; echo preg_replace($rx, "$0", $dot_prod); // => at the coast will reach the Douglas County coast

One caveat: since your search terms can include whitespace, it is a good idea to sort the $bold and $unbold array by length in the descending order before building the pattern:

usort($unbold, function($a, $b) { return strlen($b) - strlen($a); }); usort($bold, function($a, $b) { return strlen($b) - strlen($a); });

See another PHP demo.

In case these items can contain special regex metachars, also use preg_quote on them.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

输入中文字符串 按词语进行逆序输出 python 有问必答
2021-06-09 09:56

回答 3 已采纳 import jieba n = input() lst = jieba.lcut(n) #分词 print(lst) print(''.join([i for i in lst[::-1]]))
正则表达式怎么表示匹配的字符串中不能包含某个字串？ java
2020-04-15 11:09

回答 2 已采纳 var reg = /^($/ ^符号可以匹配开头 $字符可以锁定结尾或者试试 /^[ ]$/
此文件中的某些Unicode字符未能保存在当前代码页中 c语言有问必答
2022-02-27 20:55

回答 4 已采纳编码方式的问题，你的代码是直接复制的吗？参考如下方案解决：此文件中的某些Unicode字符未能保存在当前代码页中，是否以Unicode编码重新保存此文件..._ruog
易创网站管理系统(DIRCMS) 2011 SP3 UTF8.rar
2019-07-06 09:24

17：优化程序安装时的随机字符串生成； 18：优化投票； 19：加入在线申请友链功能； 20：优化在线留言，加入更多字段； 21：优化会员； 22：修复个人空间最近访客不显示的问题； 23：加入多种数据库...
JS中怎样将字符串中的指定下标的值删除？ javascript
2017-08-06 06:28

回答 6 已采纳可以这样： ``` var str = "acbabca"; str = str.slice(0, 3) + str.slice(4); ``` slice方法的参数意义和使用范例可以参
输入一个字符串（包括大小写字母和空格），除去空格输出在字符串中出现过的字符。 python
2021-11-15 16:25

回答 2 已采纳 n = input(">>>") res = set(''.join(n.split())) print(''.join(sorted(res)))
关于字符串中取出某些字符的操作
2017-06-18 01:50

回答 1 已采纳首先说一说你这么写，strncpy里面的第一个参数是char型的，所以你应该这样定义char * psh，然后你直接用strncpy赋值这是不行的，因为你上面的psh指针并没有给它内存，你直接赋值
【ChatGPT】实用 Prompt 指令大全 —— 一文教你如何更好地挖掘 GPT 的价值
2023-04-15 03:43

禅与计算机程序设计艺术的博客 ChatGPT 是由 OpenAI 开发的...该模型在自然语言处理领域有着广泛的应用，可以用于文本生成、文本分类、情感分析、问答系统等任务。本文将从模型的架构、训练方式、优缺点等方面对ChatGPT进行详细介绍。一、模型架构。
正则表达式匹配不包含某个字符串的字符串 python 正则表达式
2021-03-07 09:46

回答 2 已采纳。。。 import re l = [] res = re.findall('ABC.*?BCD', r'ABC/dABC/213BCD/sfoajs/ABC/dddd/BCD') fo
如何统计字符串中某个字符的个数
2015-07-16 09:10

回答 7 已采纳 ``` 如果针对某个字符的话，比较简单的就是 String str="oiwerwwoijjjwwnlamxjswwkxmn2w" str.replaceAll("w","").le
c++中如何在字符串中加入变量？ c++ mysql sql
2017-11-16 11:22

回答 3 已采纳可以使用stringstream，示例代码如下： ``` #include //必要的头文件 string currency; //std::string cu
建议收藏，最全ChatGPT 中文调教指南：提供各个领域的角色提示词（prompts）及使用技巧，当然也有不正经指南
2023-05-23 09:31

yumuing blog的博客 ChatGPT在日常的对话中，表现的非常的完美，当在其他的场景希望使用ChatGPT来解决问题的时候，通常需要给ChatGPT一些提示，或者说暗示，让其进入某种角色，这种情况下，ChatGPT的表现能够更加的游刃有余。...
php后台echo数值给java端字符串长度不符。 java php
2017-04-01 06:53

回答 1 已采纳应该是bom头，php存储为没有bom头的 [php隐形字符65279](http://www.w3dev.cn/article/20110817/php-hidden-char-65279-u
LLMs：《PaLM: Scaling Language Modeling with Pathways》翻译与解读
2022-06-27 00:29

一个处女座的程序猿的博客其中最强大的后GPT-3模型是GLaM（Du等，2021）、Gopher（Rae等，2021）、Chinchilla（Hoffmann等，2022）、Megatron–Turing NLG（Smith等，2022）和LaMDA（Thoppilan等，2022），它们在发布时在大量任务上取得了少...
易创互联 php,易创网站管理系统(DIRCMS) 2011 SP3 UTF8
2021-03-24 08:07

weixin_39609170的博客 PHPBB简介易创网站管理系统(DIRCMS)是国内自主研发的一款功能强大而又不失小巧简洁的由PHP+Mysql架构的内容管理系统。DirCMS代码全部开源，便于使用者二次开发或定制；并采用简洁的模板标签技术，使制作模板更加容易...
没有解决我的问题, 去提问

悬赏问题

¥20 access多表提取相同字段数据并合并
¥20 基于MSP430f5529的MPU6050驱动，求出欧拉角
¥20 Java-Oj-桌布的计算
¥15 powerbuilder中的datawindow数据整合到新的DataWindow
¥20 有人知道这种图怎么画吗？
¥15 pyqt6如何引用qrc文件加载里面的的资源
¥15 安卓JNI项目使用lua上的问题
¥20 RL+GNN解决人员排班问题时梯度消失
¥60 要数控稳压电源测试数据
¥15 能帮我写下这个编程吗

只有当某些词语不在它之前时，才能在字符串中加粗

1条回答 默认 最新

悬赏问题

1条回答默认最新