在Go中将带有UTF-8字节字符串的命令行输出转换为Unicode代码点

I am running an executable from Go via os.Exec, which gives me the following output: (\\xe2\\x96\\xb2). The output contains a UTF-8 byte string, which I want to convert to the corresponding Unicode codepoint (U+25B2). What I am expecting to see, or trying to convert to is: "(▲)". I have looked at this entry in the Go Blog (https://blog.golang.org/strings), but it starts out with an Interpreted string literal, whereas the command output seems to be a Raw string literal. I have tried strconv.Quote and strconv.Unquote, which does not achieve what I'm looking for.

写回答
好问题 0 提建议
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
douyong4623 2019-04-10 13:30
关注
You can use the strconv package to parse the string literal containing the escape sequences.

The quick and dirty way is to simply add the missing quotes and interpret it as a quoted string using strconv.Unquote

s := `\xe2\x96\xb2` s, err := strconv.Unquote(`"` + s + `"`)

You can also directly parse the string one character at a time (which is what Unquote does internally), using strconv.UnquoteChar

s := `\xe2\x96\xb2` buf := make([]byte, 0, 3*len(s)/2) for len(s) > 0 { c, _, ss, err := strconv.UnquoteChar(s, 0) if err != nil { log.Fatal(err) } s = ss buf = append(buf, byte(c)) } s = string(buf)

https://play.golang.org/p/6SDij9d-aRr
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报
编辑

预览
轻敲空格完成输入
显示为

卡片

标题

链接
评论

按下Enter换行，Ctrl+Enter发表内容

编辑

预览

报告相同问题？

关注问题

在Go中将带有UTF-8字节字符串的命令行输出转换为Unicode代码点

1条回答 默认 最新

1条回答默认最新