dongren1011 2016-08-22 11:13
浏览 32
已采纳

使用mb_substr仍然会在最后破坏重音字符

Logic: I am getting username from DB and if it is greater than 30 in length then i show 30 characters with "..." appended at the end. Code is

$username = htmlspecialchars($username);
if(mb_strlen($username, 'utf-8')>30){
    $username_trimmed = mb_substr($username, 0, 30, 'utf-8').'...';
}

and in my navivation I am just printing this username

<class="userName">Hello, <?php echo $username_trimmed; ?>

My encoding in set as utf-8, and mbstring extension is enabled in php.

Output of above code : It still breaks the accent character É because it is multi-byte character and it is getting cut the in the middle. Actual word is MARCHÉS and output is:
Erroneous output

Question what am I missing? mb_substr should not consider it as a single character and should not stop it from breaking in the middle as it does?

  • 写回答

2条回答 默认 最新

  • doubi2228 2016-08-22 12:19
    关注

    Your string is actually "&Eacute;", not "É". mb_substr handles your characters just fine, it does not handle HTML entities. Don't store HTML entities in your database, store actual Unicode characters. At the very least, decode from HTML entities to actual characters using html_entity_decode($str, ENT_COMPAT, 'UTF-8') before applying mb_substr (and then apply htmlspecialchars again afterwards to preserve HTML syntax).

    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论
查看更多回答(1条)

报告相同问题?

悬赏问题

  • ¥15 #MATLAB仿真#车辆换道路径规划
  • ¥15 java 操作 elasticsearch 8.1 实现 索引的重建
  • ¥15 数据可视化Python
  • ¥15 要给毕业设计添加扫码登录的功能!!有偿
  • ¥15 kafka 分区副本增加会导致消息丢失或者不可用吗?
  • ¥15 微信公众号自制会员卡没有收款渠道啊
  • ¥100 Jenkins自动化部署—悬赏100元
  • ¥15 关于#python#的问题:求帮写python代码
  • ¥20 MATLAB画图图形出现上下震荡的线条
  • ¥15 关于#windows#的问题:怎么用WIN 11系统的电脑 克隆WIN NT3.51-4.0系统的硬盘