如何在PHP中使用preg_replace匹配俄语单词？

How do I go about matching a Russian word in a string (also in Russian) in PHP?

So for example something like this:

$pattern = '/слово/';
preg_replace($pattern, $replacement, $string_in_russian)

I tried utf8_encode and htmlentities with UTF-8 flag for $pattern, but it didn't work. Should I also encode $string_in_russian?

Update: Suggestion for /u flag didn't work so I'm putting the actual code I need this for. It is from a glossary plugin for Wordpress (my site is properly setup to use Russian language, and it does work, but not in this instance). So here's the code

$glossary_title = $glossary_item->post_title;
$glossary_search = '/\b'.$glossary_title.'s*?\b(?=([^"]\*"[^"]\*")\*[^"]*$)/iu';
$glossary_replace = '&lt;a'.$timestamp.'&gt;$0&lt;/a'.$timestamp.'&gt;';
$content_temp = preg_replace($glossary_search, $glossary_replace, $content, 1);

When I do a quick echo into HTML comment this is the kind of string I get for the pattern
/\bсловоs*?\b(?=([^"]*"[^"]")[^"]*$)/iu

And well, that still doesn't seem to work. I thought maybe it was the "s" that was screwing me over (this level of regex is a bit beyond me but I assume it's there for possible plurals), but removing it didn't help.

Update #2: Okay so I decided to do a complete "blank slate" test - plain PHP file with some $content strings in English and Russian and target words to replace. Here is the code

$content_en = 'Nulla volutpat pretium nunc, ac feugiat neque lobortis vitae. In eu sapien sit amet eros tincidunt viverra. <b style="color:purple">Proin</b> congue hendrerit felis, et consequat neque ultrices lobortis. <b style="color:purple">Proin</b> luctus bibendum libero et molestie. Sed tristique lacus a urna semper eget feugiat lacus varius. Donec vel sodales diam. <b style="color:purple">Proin</b> fringilla laoreet purus, a facilisis nisi porttitor vel. Nullam ac justo ac elit laoreet ullamcorper vel a magna. Suspendisse in arcu sapien.';
$find_en = 'proin';
$replace_with_en = '<em style="color:red">REPLACEMENT</em>';
$glossary_search = '/\b'.$find_en.'s*?\b(?=([^"]*"[^"]*")*[^"]*$)/iu';
$content_en_replaced = preg_replace($glossary_search, $replace_with_en, $content_en);

$content_ru = 'Lorem Ipsum используют потому, что тот обеспечивает более или менее стандартное заполнение шаблона, а также реальное распределение букв и пробелов в абзацах, которое не получается при простой дубликации "Здесь <b style="color:purple">ваш</b> текст.. Здесь <b style="color:purple">ваш</b> текст.. Здесь <b style="color:purple">ваш</b> текст.." Многие программы электронной вёрстки и редакторы HTML используют Lorem Ipsum в качестве текста по умолчанию.';
$find_ru = 'ваш';
$replace_with_ru = '<em style="color:red">Многие</em>';
$glossary_search = '/\b'.$find_ru.'s*?\b(?=([^"]*"[^"]*")*[^"]*$)/iu';
$content_ru_replaced = preg_replace($glossary_search, $replace_with_ru, $content_ru);

And here is a screenshot of the output http://www.flickr.com/photos/iliadraznin/5372578707/

As you can see the English text had the target word replaced, while the Russian hasn't and the code is identical and I'm using the /u flag. The file is also UTF-8 encoded. Any suggestions? (and again, I tried removing the "s", still nothing)

写回答
好问题 0 提建议
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

3条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
douju1365 2011-01-20 17:34
关注
If you do a real blank slate test, you will find there's nothing wrong with the Russian - it's actually the word boundary aspect that is breaking the regex.

$glossary_search = '/'.$find_ru.'/iu'; // Works fine $glossary_search = '/\b'.$find_ru.'\b/iu'; // Breaks

Word boundary shorthand is not UTF-8 aware, so, per this question: php regex word boundary matching in utf-8 you can try the following:

$glossary_search = '/(?<!\pL)'.$find_ru.'(?!\pL)/iu';

That works fine on my test here.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(2条)

报告相同问题？

关注问题

php中使用preg_replace函数匹配图片并加上链接的方法
2020-10-27 16:35

然而，开发者在使用时应考虑到其性能开销，并在适合的场景中选择`str_replace()`或`preg_replace()`。在上面的例子中，通过编写一段代码即可实现为图片添加超链接的功能，大大简化了网页内容编辑和维护的工作。
PHP正则替换函数preg_replace和preg_replace_callback使用总结
2020-10-25 10:27

在PHP中，有两个常用的函数用于执行正则替换操作：preg_replace() 和 preg_replace_callback()。首先，我们来看一下preg_replace()函数。这个函数的基本形式是：mixed preg_replace ( mixed $pattern , mixed $...
c#中的实现php中的preg_replace
2020-12-18 01:38

- PHP中使用`preg_replace`函数匹配图片并加上链接的方法 - PHP 正则表达式之正则处理函数小结(`preg_match`, `preg_match_all`, `preg_replace`, `preg_split`) - PHP正则替换变量指定字符的方法 - PHP中正则替换...
php preg_match_all结合str_replace替换内容中所有img
2020-10-30 06:02

在PHP编程中，`preg_match_all` 和 `str_replace` 是两个非常重要的字符串处理函数，它们经常被用来处理HTML或XML文档中的特定内容。在这个场景中，开发者需要从采集的数据中提取并替换`<img>`标签，以符合站点的...
php中preg_replace_callback函数简单用法示例
2020-12-19 17:41

在PHP编程语言中，`preg_replace_callback`是一个非常实用的函数，它允许开发者在处理正则表达式匹配时，自定义替换的过程。这个函数的工作原理是：它接收一个模式（pattern）、一个回调函数（callback）以及一个...
详解PHP正则表达式替换实现(PHP preg_replace，PHP preg_replace)
2021-01-19 21:04

PHP正则表达式替换实现是如何的呢？首先向你介绍下PHP preg_replace，PHP preg_replace的使用是我们...preg_replace：允许你替换字符串中匹配到你定义的正则表达式。一个简单的注释移除功能： preg_replace(‘[(/*)+.
php preg_replace替换实例讲解
2020-10-26 18:24

这个过程不仅帮助我们理解了PHP preg_replace()函数在多模式匹配下的使用方法，还展示了在复杂条件下进行精确文本替换的技巧。这对于进行动态内容生成、数据清洗、文本格式化等应用场景非常关键。学习并掌握preg_...
深入研究PHP中的preg_replace和代码执行
2020-10-18 05:20

在PHP中，如果在双引号字符串中使用变量的变量名（比如${变量}），PHP会先解析该变量变量名，然后再执行。这也是为什么能够执行payload中${phpinfo()}的原因，因为${phpinfo()}会被解析成phpinfo()函数的调用。 ...
php中preg_replace正则替换用法分析【一次替换多个值】
2020-10-20 15:00

使用preg_replace，我们可以实现一次替换多个值的效果，这一点在处理复杂的文本替换任务时尤其有用。首先，了解php的str_replace函数与preg_replace的差别是很重要的。str_replace用于简单的字符串替换，而不需要...
php中的preg_replace函数,PHP函数preg_replace()
2021-05-02 10:25

weixin_39546312的博客该函数可以执行正则表达式的搜索和替换，是一个最强大的字符串替换处理函数，该函数会有三个参数，subject中搜索第一个参数pattern模式的匹配项，并替换为第二个参数，如果指定了第四个可选参数limit，则仅替换limit...
PHP函数：preg_replace()和preg_replace_callback() 【详记】
2024-10-20 15:31

小纭在努力的博客本文详细介绍了PHP的函数preg_replace()和preg_replace_callback() 【详记】
php中的preg_replace函数,PHP正则替换preg_replace函数如何使用
2021-05-02 10:26

Design设迹的博客 PHP正则替换preg_replace函数的使用方法：1、去掉0字符，代码为【preg_replace("/0/","",$str)】；2、去掉所有数字，代码为【preg_replace("/[0-9]/","",$str)】。PHP正则替换preg_replace函数的使用方法：...
PHP5.2下preg_replace函数的问题
2020-12-18 19:39

在PHP编程语言中，`preg_replace` 是一个非常重要的正则表达式替换函数，它能够按照指定的模式匹配字符串，并将匹配到的部分替换为新的内容。然而，在PHP 5.2版本中，`preg_replace` 函数可能会遇到一些特定的问题，...
php正则preg_replace_callback函数用法实例
2020-12-18 11:24

在PHP编程语言中，`preg_replace_callback`是一个非常有用的函数，它允许你在匹配正则表达式后执行自定义回调函数来处理匹配到的内容。本文将深入讲解`preg_replace_callback`函数的用法，并通过实例来展示如何使用...
php小经验:解析preg_match与preg_match_all 函数
2020-12-19 00:23

这个例子中，`preg_match()`会在找到第一次匹配后停止，如果你想获取所有匹配项，就需要使用`preg_match_all()`函数。 `preg_match_all()`函数执行全局正则匹配，找出所有匹配的结果。其语法与`preg_match()`相似，...
PHP preg_replace正则表达式涉及汉字乱码
2024-05-28 11:01

zpj~.~的博客因此，如果您使用的是 PHP 4.2.3 或更高版本，您就可以放心地在正则表达式中使用。1、中文汉字、中文字符匹配出现乱码，只针对["省","市","自治州","自治区"]表达式，需要添加/u修饰符，才不会乱码（php高版本支持）...
没有解决我的问题, 去提问

如何在PHP中使用preg_replace匹配俄语单词？

3条回答 默认 最新

3条回答默认最新