dsk88199 2014-08-28 20:18
浏览 342
已采纳

无效的Unicode代码点0xd83f

I'm trying to port some Java to Go. The Java code has a character variable with the value '\ud83f'. When I try to use this value in Go, it doesn't compile:

package main
func main() {
    c := '\ud83f'
    println(c)
}
$ go run a.go
# command-line-arguments
./a.go:3: invalid Unicode code point in escape sequence: 0xd83f

Why? I also tried making a string with that value in Python and it worked too. It's just not working in Go for some reason.

  • 写回答

2条回答

  • dsm13698679318 2014-08-29 00:59
    关注

    That rune literal you tried to use is invalid because it denotes a surrogate code point. The spec says rune literals cannot denote a surrogate code point ("as well as others" (which?)):

    Rune Literals

    [...]

    The escapes \u and \U represent Unicode code points so within them some values are illegal, in particular those above 0x10FFFF and surrogate halves.

    Further below in the examples, you can see another case which is deemed illegal:

    '\U00110000' // illegal: invalid Unicode code point

    Which seems to imply that invalid code points (such as those above 10ffff) are also illegal in rune literals.

    Note that since rune is merely an alias for int32, you can simply do:

    var r rune = 0xd8f3
    

    instead of

    var r rune = '\ud8f3'
    

    And if you wanted to get a number above 10FFFF you could do

    var r rune = 0x11ffff
    

    instead of

    var r rune = '\U0011ffff'
    
    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论
查看更多回答(1条)

报告相同问题?

悬赏问题

  • ¥50 易语言把MYSQL数据库中的数据添加至组合框
  • ¥20 求数据集和代码#有偿答复
  • ¥15 关于下拉菜单选项关联的问题
  • ¥20 java-OJ-健康体检
  • ¥15 rs485的上拉下拉,不会对a-b<-200mv有影响吗,就是接受时,对判断逻辑0有影响吗
  • ¥15 使用phpstudy在云服务器上搭建个人网站
  • ¥15 应该如何判断含间隙的曲柄摇杆机构,轴与轴承是否发生了碰撞?
  • ¥15 vue3+express部署到nginx
  • ¥20 搭建pt1000三线制高精度测温电路
  • ¥15 使用Jdk8自带的算法,和Jdk11自带的加密结果会一样吗,不一样的话有什么解决方案,Jdk不能升级的情况