dqwmhrxt68679 2019-04-10 18:21
浏览 759
已采纳

在Go中将带有UTF-8字节字符串的命令行输出转换为Unicode代码点

I am running an executable from Go via os.Exec, which gives me the following output: (\\xe2\\x96\\xb2). The output contains a UTF-8 byte string, which I want to convert to the corresponding Unicode codepoint (U+25B2). What I am expecting to see, or trying to convert to is: "(▲)". I have looked at this entry in the Go Blog (https://blog.golang.org/strings), but it starts out with an Interpreted string literal, whereas the command output seems to be a Raw string literal. I have tried strconv.Quote and strconv.Unquote, which does not achieve what I'm looking for.

  • 写回答

1条回答 默认 最新

  • douyong4623 2019-04-10 21:30
    关注

    You can use the strconv package to parse the string literal containing the escape sequences.

    The quick and dirty way is to simply add the missing quotes and interpret it as a quoted string using strconv.Unquote

    s := `\xe2\x96\xb2`
    s, err := strconv.Unquote(`"` + s + `"`)
    

    You can also directly parse the string one character at a time (which is what Unquote does internally), using strconv.UnquoteChar

    s := `\xe2\x96\xb2`
    buf := make([]byte, 0, 3*len(s)/2)
    for len(s) > 0 {
        c, _, ss, err := strconv.UnquoteChar(s, 0)
        if err != nil {
            log.Fatal(err)
        }
        s = ss
        buf = append(buf, byte(c))
    }
    s = string(buf)
    

    https://play.golang.org/p/6SDij9d-aRr

    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论

报告相同问题?

悬赏问题

  • ¥15 metadata提取的PDF元数据,如何转换为一个Excel
  • ¥15 关于arduino编程toCharArray()函数的使用
  • ¥100 vc++混合CEF采用CLR方式编译报错
  • ¥15 coze 的插件输入飞书多维表格 app_token 后一直显示错误,如何解决?
  • ¥15 vite+vue3+plyr播放本地public文件夹下视频无法加载
  • ¥15 c#逐行读取txt文本,但是每一行里面数据之间空格数量不同
  • ¥50 如何openEuler 22.03上安装配置drbd
  • ¥20 ING91680C BLE5.3 芯片怎么实现串口收发数据
  • ¥15 无线连接树莓派,无法执行update,如何解决?(相关搜索:软件下载)
  • ¥15 Windows11, backspace, enter, space键失灵