如何在Go中将所有编码都转换为UTF 8？

我正在通过 IMAP 下载消息。接下来，我将解析消息添加到 MongoDB 中，但我遇到了一个问题，因为 MongoDB 只支持 UTF 8，我想把所有编码都转换成 UTF 8的话该怎么做？

我知道可以转换为二进制，但我必须有正常的文本，因为我必须在数据库中搜索短语。除非——我可以搜索二进制的正常文本吗？

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

3条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
doulierong0334 2014-12-04 15:37
关注
I'm using the go-charset project to do this: https://code.google.com/p/go-charset/

It's pretty straightforward, you create a reader from a charset and it translates to utf-8 automatically. example from the library:

r, err := charset.NewReader(strings.NewReader("\xa35 for Pepp\xe9"), "latin1") if err != nil { log.Fatal(err) } result, err := ioutil.ReadAll(r) if err != nil { log.Fatal(err) } fmt.Printf("%s ", result) //outputs £5 for Peppé

Now, in my case I know the charset because it comes from web pages and I read the headers/meta tags. If you need to detect the charset automatically by heuristics, you'll need another library for that, such as this one: https://github.com/saintfish/chardet

I haven't used it but it also looks pretty simple to use:

detector := chardet.NewTextDetector() result, err := detector.DetectBest(some_text) if err == nil { fmt.Printf( "Detected charset is %s, language is %s", result.Charset, result.Language) }
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(2条)

报告相同问题？

关注问题

如何在Go中将所有编码都转换为UTF 8？ mongodb
2014-12-04 15:07

回答 3 已采纳 I'm using the go-charset project to do this: https://code.google.com/p/go-charset/ It's pretty st
如何在Go中从编码转换为UTF-8？
2015-09-11 07:58

回答 2 已采纳 You can use the encoding package, which includes support for Windows-1256 via the package golang.o
在Go中将带有UTF-8字节字符串的命令行输出转换为Unicode代码点
2019-04-10 18:21

回答 1 已采纳 You can use the strconv package to parse the string literal containing the escape sequences. The
大数据智慧数字电商第一课实时数仓技术选型和架构设计
2022-04-28 10:23

办公模板库素材蛙的博客公司内已经采用MR与spark之类的技术，做离线计算，为什么用实时计算？离线的伤痛就是数据出的太慢有对实时数据要求高的场景比如：滴滴的风控、淘宝双十一营销大屏、电商购物推荐、春晚的观众数统计实时计算...
如何在PHP中将m3u8转换为base64编码？ php
2019-06-30 08:51

回答 1 已采纳 it should work like this: <script> var player = new Clappr.Player({ source: window.atob
如何在Go中将unicode字符串从数据库转换为utf字符串？
2015-09-25 12:29

回答 1 已采纳 To decode the string you have, you can do: import "net/url" ... url.QueryUnescape("\u0435\u043e
qt 在utf-8的编码环境中将unsigned char*转成Ansi编码的char* c++ c语言 qt
2021-06-21 16:52

回答 2 已采纳使用QTextCodec转码https://doc.qt.io/qt-5/qtextcodec.html。或者QString自带的一些转码。这个ANSI不能算是一个确切的编码格式，在window，中文
gRPC基础--Protobuf编码格式详解
2021-12-01 08:45

kevin_tech的博客什么是 ProtobufProtobuf是Protocol Buffers的简称，它是Google公司开发的一种数据描述语言，用于描述一种轻便高效的结构化数据存储格式，并于2008年对外开...
如何在Golang中将自定义类型切片转换为原始切片？
2017-12-29 04:31

回答 1 已采纳 Yes, you have to copy it manually with a for loop.
如何在Go中将Float64转换为Base36？
2018-09-26 18:15

回答 1 已采纳 Go base36 packages don't convert this properly because Javascript's toString(36) does not base36 e
如何在go中将int64转换为字节数组？
2016-02-12 20:11

回答 4 已采纳 Converting between int64 and uint64 doesn't change the sign bit, only the way it's interpreted.
从JavaScript看字符编码的前世今生！
2022-05-17 17:56

腾讯云开发者的博客导语|每个程序员都应该了解一下字符编码，有了基础概念之后我们对编程语言、字符处理能有更深入的理解。本文我花了大量时间进行资料查阅和考证，希望能够给大家带来一些帮助，多多交流！一、起因最近在研究Babel的...
如何在Golang中将“ uint”类型转换为“ string”类型？
2019-07-24 16:55

回答 1 已采纳 Use strconv.FormatUint(): package main import ( "fmt" "strconv" ) func main() { var
golang数据类型_了解Go中的数据类型
2020-08-12 04:24

cukw6666的博客 golang数据类型介绍 (Introduction) Data types specify the kinds of values that particular variables will store when you are writing a program. The data type also determines what operations can be ...
【Go】Go语言数据类型
2021-11-09 16:11

想变厉害的大白菜的博客 Go语言数据类型介绍
没有解决我的问题, 去提问

悬赏问题

¥60 版本过低apk如何修改可以兼容新的安卓系统
¥25 由IPR导致的DRIVER_POWER_STATE_FAILURE蓝屏
¥50 有数据，怎么建立模型求影响全要素生产率的因素
¥50 有数据，怎么用matlab求全要素生产率
¥15 TI的insta-spin例程
¥15 完成下列问题完成下列问题
¥15 C#算法问题, 不知道怎么处理这个数据的转换
¥15 YoloV5 第三方库的版本对照问题
¥15 请完成下列相关问题！
¥15 drone 推送镜像时候 purge: true 推送完毕后没有删除对应的镜像,手动拷贝到服务器执行结果正确在样才能让指令自动执行成功删除对应镜像，如何解决？

如何在Go中将所有编码都转换为UTF 8？

3条回答 默认 最新

悬赏问题

3条回答默认最新