dounaoji2054 2014-02-26 17:53
浏览 56
已采纳

替换utf8字符串中的所有非单词字符[重复]

This question already has an answer here:

how can i replace all non word characters (utf-8) in a string ?

for ASCII:

$url = preg_replace("/\W+/", " ", $url);

is there any equivalent for UTF-8 ?

</div>
  • 写回答

2条回答 默认 最新

  • duanpin9531 2014-02-26 18:07
    关注

    You can use the Xwd character class that contains letters, digits and underscore:

    $url = preg_replace('~\P{Xwd}+~u', ' ', $url);
    

    If you don't want the underscore, you can use Xan

    \p{Xwd} (Perl word character) is a predefined character class and \P{Xwd} is the negation of this class.

    The u modifier means that the string must be treated as an unicode string.

    equivalence:

    \p{Xan}        <=>     [\p{L}\p{N}]
    \p{Xwd}        <=>     [\p{Xan}_]
    
    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论
查看更多回答(1条)

报告相同问题?

悬赏问题

  • ¥15 用lstm来预测股票价格
  • ¥15 请问paddlehub能支持移动端开发吗?在Android studio上该如何部署?
  • ¥170 如图所示配置eNSP
  • ¥20 docker里部署springboot项目,访问不到扬声器
  • ¥15 netty整合springboot之后自动重连失效
  • ¥15 悬赏!微信开发者工具报错,求帮改
  • ¥20 wireshark抓不到vlan
  • ¥20 关于#stm32#的问题:需要指导自动酸碱滴定仪的原理图程序代码及仿真
  • ¥20 设计一款异域新娘的视频相亲软件需要哪些技术支持
  • ¥15 stata安慰剂检验作图但是真实值不出现在图上