PHP：什么时候应该将未转义的UTF-8保存到json文件中？

Is there any benefit of saving UTF-8 characters unescaped in a json file if one only access them through PHP?

Here is what I tested:

fwrite(fopen('fileA.json','w'), json_encode('аккредитовать'));

then the content of fileA.json is given by

"\u0413\u043b\u0430\u0432\u043d\u0430\u044f"

However, when I store it with

fwrite(fopen('fileB.json','w'), json_encode('аккредитовать', JSON_UNESCAPED_UNICODE));

the content of fileB.json is given by

"аккредитовать"

To my surprise each of the following calls

echo json_decode(file_get_contents('fileA.json'));
echo json_decode(file_get_contents('fileB.json'));
echo json_decode(file_get_contents('fileA.json')), false, 512, JSON_UNESCAPED_UNICODE);
echo json_decode(file_get_contents('fileB.json')), false, 512, JSON_UNESCAPED_UNICODE);

gives the same output:

'аккредитовать'

So as a result I would conclude that I only need to save UTF-8 chars in a json file if I want to open and read the json file directly with an editor. If I only plan to show/save the content of the json file with php then I don't need save the content unescaped and I can use

fwrite(fopen('fileA.json','w'), json_encode('аккредитовать'));  
echo json_decode(file_get_contents('fileA.json'));`

Is that correct, or did I miss anything important?

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dtng5978 2017-10-25 12:15
关注
With JSON_UNESCAPED_UNICODE the JSON is now:

more human readable

not ASCII-safe

That's the only tradeoff you're making. Once you have non-ASCII characters in your JSON, you need to ensure the JSON is handled in a binary-safe manner; e.g. you cannot simply send it over a channel that expects only ASCII data, or you need to care about the specific encoding if a channel is encoding aware (e.g. storing it in a database). None of this is of any concern when simply writing the data to a file and then reading it again, as long as the reader is treating the encoding correctly (which PHP is doing here, since it doesn't care about the encoding).

The JSON format itself doesn't care either way, "а" and "\u0413" represent the exact same character.

It should be noted that escaped Unicode takes up more storage than UTF-8 encoded text (6-12 bytes vs. 2-4 bytes). But that hardly matters in the majority of cases.

Note also: JSON_UNESCAPED_UNICODE is not a valid flag for json_decode; it's simply superfluous there.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

阻止Go的json.Marshal将字符串强制转换为有效的UTF-8 Unicode？
2015-03-04 14:35

回答 2 已采纳 I just realized after posting other characters are getting encoded to unicode as well and those ar
如何将这个UTF-8转义字符串从亚马逊MWS响应转换为正确的UTF-8？ mongodb php
2014-12-29 19:11

回答 2 已采纳 SimpleXML does not decode the hex entities and understand the result as UTF-8, because that's not
在JSON字符串数组PHP中的引号之前删除转义字符 javascript json php
2018-06-11 17:30

回答 2 已采纳 Try: var allMarkers = ["{\"address_components\":[{\"long_name\":\"London\",\"short_name\":\
php的中文json打包，实现websocket大数据量传输，解决Could not decode a text frame as UTF-8
2020-02-18 14:03

wmdscjhdpy的博客前言本篇文章借json的websocket传输为示例，解决websocket大数据量超过128字节(126字节)的传输，解决Could not decode a text frame as UTF-8的接收端错误，json打包过程中对中文的处理问题
javascript中JSON.parse()函数处理json中的\斜杠转义字符报错 javascript json
2019-09-20 12:09

回答 1 已采纳一般来说在JSON.parse的参数包含转移字符的时候会遇到两次转义的问题，其实第一次是字符串本身的转义，第二次是将真正转为js对象的转义。比如：将字符串'{"a":"b","b":"\\\\"}
由于表情符号，Go在JSON输出中生成了未转义的控制字符 json
2016-05-31 00:43

回答 1 已采纳 Go is doing fine. fmt.Println([]byte("☮️")) //[226 152 174 239 184 143] //Yup, 1 character - 6 by
oracle模糊查询报错：ORA-01425 转义符必须是长度为 1 的字符串 oracle 有问必答
2021-06-21 18:19

回答 2 已采纳已解决解决方式：通过将参数CURSOR_SHARING更改为EXACT alter system set cursor_sharing=EXACT scope=both;
PHP去掉json字符串中的反斜杠\及去掉双引号前的反斜杠
2020-10-23 13:05

在处理某些编码格式的文件时（比如UTF-8编码的文件），文件开头可能会包含BOM头信息（`chr(239).chr(187).chr(191)`），这会影响`json_decode()`函数的解析。可以使用`trim()`函数去除字符串开头和结尾的BOM头信息：...
shell awk -F 中分隔符转义'\' 向大咖问开源
2021-03-25 14:26

回答 1 已采纳被单引号括起来的字符都是普通字符，就算特殊字符也不再有特殊含义被双引号括起来的字符中，"$"、"\"和反引号是拥有特殊含义的所以你用了单引号，所以反斜杠并没有转义的意义。所以两个表达式的分割字
在Redis中将JSON存储为字符串时转义特殊字符 json php redis
2015-10-27 15:37

回答 1 已采纳 Well, it turns out that line 118 at https://github.com/joelcox/codeigniter-redis/blob/develop/libr
对象转成json 调用接口上传的数据为什么有反斜杠的转义符 android android-studio java
2023-01-15 16:05

回答 1 已采纳望采纳！！！点击回答右侧采纳即可！！JSON.stringify() 方法会把一个对象转换成一个 JSON 字符串，但是它会把特殊字符（比如双引号）转换成转义符（比如反斜杠加双引号），以便能够正确地传
php mysql json 转义字符_php查询mysql中的json编码后的字符串内容的方法
2021-01-28 11:54

居居是居居啦的博客问题mysql里存的是json编码后的字符串，其中中文会被转为unicode码，所以直接查询是查询不到的。mysql里的查询如 like "%\u6211\u662f%" 也是不会有结果的，原因是反斜线被mysql转义了，需要如下的格式才能查询: ...
php后端接收前端发送的json,php 怎么接收前端传来的json数据
2021-03-23 18:30

陈晓卿的博客 php 如何接收前端传来的json数据前端用JQ 生成一个有字段名和值格式的键值对的JSON 格式的字串转码后提交给后台的PHP 处理代码如下json_data+="\"emp_id\":\""+emp_id+"\",\"action_type\":\""+action_type+"\"})...
php json_encode奇怪问题说明
2020-12-19 08:57

对于处理中文字符时可能出现的乱码问题，可以使用`iconv`或`mb_convert_encoding`函数将数据转换为UTF-8，或者在`json_encode`时使用`JSON_UNESCAPED_UNICODE`选项保留原始的中文字符而不是转换为Unicode转义序列。...
php之麻烦的json字符串转化-——解析为空
2022-10-30 08:56

apple_51426592的博客今天想利用利用jQuery的ajax和服务器语言php进行交互练习瀑布流，结果在交互过程中，不知哪一步使得php获取的字符串存在，可转为数组却是空，我天，这足足浪费我三个小时找这个bug,把每一个字符进行了一一比对，发现...
没有解决我的问题, 去提问

悬赏问题

¥15 如何让企业微信机器人实现消息汇总整合
¥50 关于#ui#的问题：做yolov8的ui界面出现的问题
¥15 如何用Python爬取各高校教师公开的教育和工作经历
¥15 TLE9879QXA40 电机驱动
¥20 对于工程问题的非线性数学模型进行线性化
¥15 Mirare PLUS 进行密钥认证？（详解）
¥15 物体双站RCS和其组成阵列后的双站RCS关系验证
¥20 想用ollama做一个自己的AI数据库
¥15 关于qualoth编辑及缝合服装领子的问题解决方案探寻
¥15 请问怎么才能复现这样的图呀

PHP：什么时候应该将未转义的UTF-8保存到json文件中？

1条回答 默认 最新

悬赏问题

1条回答默认最新