使用正则表达式在句子中包装单词

I'm converting sentences like:

Phasellus turpis, elit. Tempor et lobortis? Venenatis: sed enim!

to:

_________ ______, ____. ______ __ ________? _________: ___ ____!

using:

utf8_encode(preg_replace("/[^.,:;!?¿¡ ]/", "_", utf8_decode($ss->phrase) ))

But I'm facing a problem: Google is indexing all those empty words as keywords. I'd like to convert the original strings to something invisible to Google, like:

<span>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</span> <span>&nbsp;&nbsp;&nbsp;&nbsp</span>, ....

using:

.parent span { text-decoration:underline; }

that is, wrapping words inside span tags, replacing words' characters with &nbsp ; and leaving untouched the special characters .,:;!?¿¡ and space.

Is this possible to solve using a regex? I actually solved this by using a non very efficient loop that scans every character of the string, but I must scan many sentences per page.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
duanhegn231318 2012-08-28 03:23
关注
Use preg_replace_callback and have the callback create the appropriate replacement. Something along the lines of (untested)

function replacer($match) { return "<span>".str_repeat(" ",strlen($match[1]))."</span>"; } // Note the addition of the () and the + near the end of the regex utf8_encode(preg_replace_callback("/([^.,:;!?¿¡ ]+)/", "replacer", utf8_decode($ss->phrase) ))
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(1条)

报告相同问题？

关注问题

使用正则表达式在句子中包装单词 php
2012-08-28 02:06

回答 2 已采纳 Use preg_replace_callback and have the callback create the appropriate replacement. Something alo
如何在正则表达式中使用变量？ javascript 前端正则表达式
2022-01-09 11:44

回答 1 已采纳 /regex\d/g您可以构造一个新的RegExp对象，而不使用语法：var replace = "regex\d";var re = new RegExp(replace,"g"); 您可以通过这种
想使用正则表达式匹配，提取文本中特定的内容。 python 正则表达式
2022-01-19 16:23

回答 2 已采纳这应该就是你想要的功能： import os, re def GetMiddleStr(content,startStr,endStr): '''提取字符串content当中，startStr
在Visual Basic中为应用程序和Visual Basic 6使用正则表达式
2020-07-18 23:40

cunchi8090的博客 ", 1, 4, False, False) If instead this log were in an Access table, you could use these wrapper functions in a query such as this: 相反，如果此日志位于Access表中，则可以在以下查询中使用这些包装器函数...
在PHP中使用正则表达式进行用户名验证 php
2017-07-08 07:51

回答 3 已采纳 The following pattern will work: ^[a-z0-9][a-z0-9_]*[a-z0-9]$ ^[a-z0-9]: first character may not
正则表达式如何写，在一段字符串中提取指定的内容。 python 正则表达式
2022-05-03 20:38

回答 8 已采纳 import re text = """福建省2022年道路交通事故人身损害赔偿相关数据【福建一般地区（除厦门外）】 1、全省城镇居民人均年可支配收入 51140元2、全省农村居民人均年可支配收
使用正则表达式提取文本数据，正则表达式如何写 python 有问必答正则表达式爬虫
2021-10-25 18:26

回答 2 已采纳 regex = r"('gender':\s*{[^}]+})|('glasses':\s*{[^}]+})|('emotion':.+.jpg')" 不清楚是否你每个文件都是类似的，如果不行，再
【源码学习】正则表达式
2023-01-04 10:20

稀饭过霍的博客没有修饰符和特殊符号（稍后我们会学到），那么正则表达式的搜索和子字符串的搜索相同。方法寻找匹配项：如果带有修饰符pattern:g，则会返回所有匹配项，否则只会返回第一个匹配项。方法使用替换regexp的匹配项：...
QT中怎么使用正则表达式来表示float类型正则表达式
2018-12-07 02:49

回答 1 已采纳问题已解决 ^(-?\d{1,19})(\.\d[0-9]{1,2})?$ 整数19位以内小数点后2位以内
求一个php正则表达式 php 正则表达式
2022-01-23 19:47

回答 1 已采纳试试这个import repattern = re.compile (r'(?:money=)\d+.?\d*')pattern.findall(string)
如何使用Golang正则表达式查找完全匹配的单词？
2018-12-20 15:44

回答 1 已采纳 Use the zero-length word boundry sequence \b: https://play.golang.org/p/-f0KEKb2EbF regexp.MatchS
php正则表达式重复出现的相同字母,正则表达式:字母后不能再有另一个字母 - php...
2021-03-23 22:31

weixin_39883670的博客 var_dump( strtotime('29.03.2015 03:00', time()) === strtotime('29.03.2015 04:00…PHP-全局变量的性能和内存问题 - php 假设情况：我在php中运行一个复杂的站点，并且我使用了很多全局变量。我可以将变量存储在...
如何用正则表达式匹配一个单词？正则表达式
2021-12-15 10:33

回答 1 已采纳加上单词边界 \b\bcat\b
正则表达式、Object、包装类
2017-09-12 16:02

csdn_SirLiu的博客 Top 1. 正则表达式 1.1. 基本正则表达式 ...1.1.1. 正则表达式简介 ...所谓正则表达式就是使用一系列预定义的特殊字符来描述一个..."\"在正则表达式中是转意字符，当我们需要描述一个已经被正则表达式使用的特殊字
正则表达式大全
2019-01-04 17:35

孙瑞宇的博客定义：（维基百科）正则表达式，又称规则表达式。（英语：Regular Expression，在代码中常简写为regex、...例如，在Perl中就内建了一个功能强大的正则表达式引擎特点： 1. 灵活性、逻辑性和功能性非常强；...
没有解决我的问题, 去提问

悬赏问题

¥15 我的数据无法存进链表里
¥15 神经网络预测均方误差很小但是图像上看着差别太大
¥15 Oracle中如何从clob类型截取特定字符串后面的字符
¥15 想通过pywinauto自动电机应用程序按钮，但是找不到应用程序按钮信息
¥15 如何在炒股软件中，爬到我想看的日k线
¥15 seatunnel 怎么配置Elasticsearch
¥15 PSCAD安装问题 ERROR: Visual Studio 2013, 2015, 2017 or 2019 is not found in the system.
¥15 (标签-MATLAB|关键词-多址)
¥15 关于#MATLAB#的问题，如何解决？（相关搜索：信噪比，系统容量）
¥500 52810做蓝牙接受端

使用正则表达式在句子中包装单词

2条回答 默认 最新

悬赏问题

2条回答默认最新