dqwmhrxt68679 2019-04-10 18:21
浏览 758
已采纳

在Go中将带有UTF-8字节字符串的命令行输出转换为Unicode代码点

I am running an executable from Go via os.Exec, which gives me the following output: (\\xe2\\x96\\xb2). The output contains a UTF-8 byte string, which I want to convert to the corresponding Unicode codepoint (U+25B2). What I am expecting to see, or trying to convert to is: "(▲)". I have looked at this entry in the Go Blog (https://blog.golang.org/strings), but it starts out with an Interpreted string literal, whereas the command output seems to be a Raw string literal. I have tried strconv.Quote and strconv.Unquote, which does not achieve what I'm looking for.

  • 写回答

1条回答 默认 最新

  • douyong4623 2019-04-10 21:30
    关注

    You can use the strconv package to parse the string literal containing the escape sequences.

    The quick and dirty way is to simply add the missing quotes and interpret it as a quoted string using strconv.Unquote

    s := `\xe2\x96\xb2`
    s, err := strconv.Unquote(`"` + s + `"`)
    

    You can also directly parse the string one character at a time (which is what Unquote does internally), using strconv.UnquoteChar

    s := `\xe2\x96\xb2`
    buf := make([]byte, 0, 3*len(s)/2)
    for len(s) > 0 {
        c, _, ss, err := strconv.UnquoteChar(s, 0)
        if err != nil {
            log.Fatal(err)
        }
        s = ss
        buf = append(buf, byte(c))
    }
    s = string(buf)
    

    https://play.golang.org/p/6SDij9d-aRr

    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论

报告相同问题?

悬赏问题

  • ¥15 matlab答疑 关于海上风电的爬坡事件检测
  • ¥88 python部署量化回测异常问题
  • ¥30 酬劳2w元求合作写文章
  • ¥15 在现有系统基础上增加功能
  • ¥15 远程桌面文档内容复制粘贴,格式会变化
  • ¥15 关于#java#的问题:找一份能快速看完mooc视频的代码
  • ¥15 这种微信登录授权 谁可以做啊
  • ¥15 请问我该如何添加自己的数据去运行蚁群算法代码
  • ¥20 用HslCommunication 连接欧姆龙 plc有时会连接失败。报异常为“未知错误”
  • ¥15 网络设备配置与管理这个该怎么弄