I am generating CSV files. Occasionally the data source will pass along characters with accents etc... that I would like to strip out. Is there a reasonably straightforward way to detect and strip out UTF-8 characters?
2条回答 默认 最新
- drpmazn9021 2012-08-07 22:31关注
If you're sure you're getting UTF-8 as input, use iconv to convert the values to the encoding you're using in your output - detecting UTF-8 chars isn't failsafe (as the values are valid iso-8859-1 characters as well (or all 8 bit encodings, really).
If you just want to use the regular ascii set of values (byte-values 0 - 127), you can let iconv convert to the 'ascii' encoding and transliterate:
iconv("utf-8", "ascii//TRANSLIT", "Hei og hå")
will result in
hei og ha
being returned.
本回答被题主选为最佳回答 , 对您是否有帮助呢?解决 无用评论 打赏 举报
悬赏问题
- ¥15 linux驱动,linux应用,多线程
- ¥20 我要一个分身加定位两个功能的安卓app
- ¥15 基于FOC驱动器,如何实现卡丁车下坡无阻力的遛坡的效果
- ¥15 IAR程序莫名变量多重定义
- ¥15 (标签-UDP|关键词-client)
- ¥15 关于库卡officelite无法与虚拟机通讯的问题
- ¥15 目标检测项目无法读取视频
- ¥15 GEO datasets中基因芯片数据仅仅提供了normalized signal如何进行差异分析
- ¥100 求采集电商背景音乐的方法
- ¥15 数学建模竞赛求指导帮助