如何在golang中使用表情符号处理（解码或删除无效的Unicode代码点）字符串？

Example string:

"\u0410\u043b\u0435\u043a\u0441\u0430\u043d\u0434\u0440\u044b! 
\u0421\u043f\u0430\u0441\u0438\u0431\u043e \ud83d\udcf8 link.ru \u0437\u0430 
#hashtag  Русское слово, an English word"

Without this \ud83d\udcf8 my func works well:

func convertUnicode(text string) string {
    s, err := strconv.Unquote(`"` + text + `"`)
    if err != nil {
        // Error.Printf("can't convert: %s | err: %s
", text, err)
        return text
    }
    return s
}

My question is how to detect that text contains this kind of entries? And how to convert it to emoji or how to remove from the text? Thanks

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dsadsadsa1231 2018-10-18 20:44
关注
Well, probably not so simple as neither \ud83d nor \udcf8 are valid code points but together are a surrogate pair used in UTF-16 encoding to encode \U0001F4F8. Now strconv.Unquote will give you two surrogate halves which you have to combine yourself.

Use strconv.Unquote to unquote as you did.

Convert to []rune for convenience.

Find surrogate pairs with unicode/utf16.IsSurrogate.

Combine surrogate pairs with unicode/utf16.DecodeRune.

Convert back to string.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

如何在golang中使用表情符号处理（解码或删除无效的Unicode代码点）字符串？
2018-10-18 17:30

回答 1 已采纳 Well, probably not so simple as neither \ud83d nor \udcf8 are valid code points but together are a
如何在Golang中使用正则表达式替换字符串中的表情符号字符
2016-09-19 14:35

回答 2 已采纳 You seem to want to match a specific set of emojis. Use package main import ( "fmt" "r
如何在golang中删除字符串中的最后一个字母？
2019-07-12 09:33

回答 4 已采纳 How to remove the last letter from the string? In Go, character strings are UTF-8 encoded.
Golang中的Unicode与字符串示例详解
2020-09-16 19:07

主要给大家介绍了关于Golang中Unicode与字符串的相关资料，文中通过示例代码介绍的非常详细，对大家学习或者使用Golang具有一定的参考学习价值，需要的朋友们下面来一起学习学习吧
GoLang中的模板字符串？
2018-04-02 11:39

回答 1 已采纳 The Solution is sentence := fmt.Sprintf("My Name is %s", name)
如何在GoLang中比较字符串？
2015-12-20 17:23

回答 3 已采纳 == is the correct operator to compare strings in Go. However, the strings that you read from STDIN
我可以用make或new在golang中预填充字符串吗？
2018-08-31 19:18

回答 1 已采纳 You can only use make() and new() to allocate buffers (byte slices or arrays) that are zeroed. You
Golang中unicode与字符编码理解
2020-09-28 11:51

阿磊的博客的博客 1）Go 语言的代码是由 Unicode 字符组成的，它们都必须由 Unicode 编码规范中的 UTF-8 编码格式进行编码并存储。 2）Unicode 编码规范中的编码格式定义的是:字符与字节序列之间的转换方式。其中的 UTF- 8 是一种可...
在Golang中提取部分字符串？
2016-07-27 23:08

回答 3 已采纳 There are a few options: // match regexp as in question pat := regexp.MustCompile(`https?://.*\.t
在Go中将带有UTF-8字节字符串的命令行输出转换为Unicode代码点
2019-04-10 18:21

回答 1 已采纳 You can use the strconv package to parse the string literal containing the escape sequences. The
如何在Golang中的空字符串字段中返回空值？
2019-03-13 19:43

回答 2 已采纳 Edit: What you want to do is not possible according to the oracle go driver docs: sql.NullStr
golang插入字符串_Golang 入门 : 字符串
2020-12-20 18:58

weixin_40005887的博客在 Golang 中，字符串是一种基本类型，这一点和 C 语言不同。C 语言没有原生的字符串类型，而是使用字符数组来表示字符串，并以字符指针来传递字符串。Golang 中的字符串是一个不可改变的 UTF-8 字符序列，一个 ...
通过在golang中使用for循环来反转字符串效率低下？
2019-06-05 19:25

回答 2 已采纳 I'd like to know is my method less efficient than the ones I saw online that use runes Nothin
Python调用Golang，字符串处理
2020-05-25 15:28

昵称6550523的博客 "golang.org/x/text/encoding/simplifiedchinese" "golang.org/x/text/transform" ) //export cstr func cstr(s *C.char) *C.char { gostr := C.GoString(s) fmt.Println("go:" + gostr) return C
golang插入字符串_golang字符串的一般操作
2021-01-12 17:29

Jancon的博客生成MD5字符串import ("crypto/md5""fmt""io")func main() {str := "123456"fmt.Print(Md5one(str))fmt.Print("------------")fmt.Print(Md5two(str))}//方法一func Md5one(str string) (md5str string) {data := []...
没有解决我的问题, 去提问

悬赏问题

¥15 树莓派与pix飞控通信
¥15 自动转发微信群信息到另外一个微信群
¥15 outlook无法配置成功
¥30 这是哪个作者做的宝宝起名网站
¥60 版本过低apk如何修改可以兼容新的安卓系统
¥25 由IPR导致的DRIVER_POWER_STATE_FAILURE蓝屏
¥50 有数据，怎么建立模型求影响全要素生产率的因素
¥50 有数据，怎么用matlab求全要素生产率
¥15 TI的insta-spin例程
¥15 完成下列问题完成下列问题

如何在golang中使用表情符号处理（解码或删除无效的Unicode代码点）字符串？

1条回答 默认 最新

悬赏问题

1条回答默认最新