dpsfay2510 2014-11-19 15:00
浏览 67

欧洲'é'字符,ASCII码为101 204 129

I have an issue with the character 'é'.

With a ftp_nlist($this->ftpStream, $directory); I've a string like that 'Parté.mp4' but the 'é' doesnt match the regex [\p{L}]*\.mp4

There are example here:

The ASCII code of the 'é' who doesn't work is '101 204 129'. The function ord($e); where $e is the weird character return '101' which is the code of the simple letter e.

It's seems like my 'é' is composed of three characters because I've to make a
$e = substr($fileName,4,3); to obtain my single character.

I would like to be able to authorize these characters in my regex... If you have any leads, thanks.

  • 写回答

2条回答 默认 最新

  • dsfds2343 2014-11-19 16:33
    关注

    Use the extended unicode option.

    \X*.mp4
    

    Regex Demo

    Here's the PHP manual that describes the extended unicode option.

    The \X escape matches a Unicode extended grapheme cluster. An extended grapheme cluster is one or more Unicode characters that combine to form a single glyph. In effect, this can be thought of as the Unicode equivalent of . as it will match one composed character, regardless of how many individual characters are actually used to render it.

    评论

报告相同问题?

悬赏问题

  • ¥15 R语言Rstudio突然无法启动
  • ¥15 关于#matlab#的问题:提取2个图像的变量作为另外一个图像像元的移动量,计算新的位置创建新的图像并提取第二个图像的变量到新的图像
  • ¥15 改算法,照着压缩包里边,参考其他代码封装的格式 写到main函数里
  • ¥15 用windows做服务的同志有吗
  • ¥60 求一个简单的网页(标签-安全|关键词-上传)
  • ¥35 lstm时间序列共享单车预测,loss值优化,参数优化算法
  • ¥15 Python中的request,如何使用ssr节点,通过代理requests网页。本人在泰国,需要用大陆ip才能玩网页游戏,合法合规。
  • ¥100 为什么这个恒流源电路不能恒流?
  • ¥15 有偿求跨组件数据流路径图
  • ¥15 写一个方法checkPerson,入参实体类Person,出参布尔值