golang将iso8859-1转换为utf8

I am trying to convert an ISO 8859-1 encoded string to UTF-8.

The following function works with my testdata which contains german umlauts, but I'm not quite sure what source encoding the rune(b) cast assumes. Is it assuming some kind of default encoding, e.g. ISO8859-1 or is there any way to tell it what encoding to use?

func toUtf8(iso8859_1_buf []byte) string {
   var buf = bytes.NewBuffer(make([]byte, len(iso8859_1_buf)*4))
   for _, b := range(iso8859_1_buf) {
      r := rune(b)
      buf.WriteRune(r)
   }
   return string(buf.Bytes())
}

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
douruanfan3030 2012-11-22 11:11
关注
rune is an alias for int32, and when it comes to encoding, a rune is assumed to have a Unicode character value (code point). So the value b in rune(b) should be a unicode value. For 0x00 - 0xFF this value is identical to Latin-1, so you don't have to worry about it.

Then you need to encode the runes into UTF8. But this encoding is simply done by converting a []rune to string.

This is an example of your function without using the bytes package:

func toUtf8(iso8859_1_buf []byte) string { buf := make([]rune, len(iso8859_1_buf)) for i, b := range iso8859_1_buf { buf[i] = rune(b) } return string(buf) }
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(1条)

报告相同问题？

关注问题

Golang将UTF16字符串转换为UTF8
2016-10-19 00:29

回答 2 已采纳 Parse the hex string as an integer. Use a string conversion to convert the integer to UTF-8. n, e
GoLang-使用ISO-8859-1字符集进行持久化 database mysql
2014-07-03 14:02

回答 1 已采纳 package charset import "code.google.com/p/go-charset/charset" func NewWriter func NewW
在Go中将带有UTF-8字节字符串的命令行输出转换为Unicode代码点
2019-04-10 18:21

回答 1 已采纳 You can use the strconv package to parse the string literal containing the escape sequences. The
mysql iso88591_GoLang-使用ISO-8859-1字符集进行持久化
2021-01-27 15:54

weixin_39888080的博客小编典典import "code.google.com/p/go-charset/charset"func NewWriter(charset string, w io.Writer) (io...它将UTF-8文本的写入转换w为命名字符集中的文本的写入。所述Close冲洗任何残留的部分翻译的字符到所述输...
将时间戳转换为golang中的ISO格式
2018-12-20 18:40

回答 2 已采纳 How about this? ts, err := time.Parse("2006-01-02T15:04:05.000+0000", currentTime) since time.R
如何在golang中以UTF-8编码gob？
2017-01-21 22:49

回答 1 已采纳 When you call Encode(msg), you are not sending UTF-8 plain text. To send plain text: conn.Write(
golang XML结束解析并显示“无效的UTF-8”错误 xml
2016-10-17 22:59

回答 2 已采纳 Reader that filters out invalid UTF-8 characters package main import ( "bufio" "io"
Tomcat中ISO-8859-1转UTF-8中文乱码的问题
2017-12-08 13:52

先树立一个小目标的博客本来是没问题的，tomcat默认编码是ISO-8859-1，但是！！ "tomcatThreadPool" URIEncoding= "utf-8" port= "5095" protocol= "HTTP/1.1" connectionTimeout= "20000" redirectPort= "3443" ...
如何在Golang中使用utf8将[] rune编码为[] byte？
2015-03-25 12:31

回答 1 已采纳 You can simply convert a rune slice ([]rune) to string which you can convert back to []byte. Exam
golang json marshal将标签转换为utf符号
2014-12-15 13:31

回答 1 已采纳 This is already explained in this question. In short - you can create your own Marshaller implemen
utf8 golang中的第二个字节下限
2017-12-12 09:48

回答 2 已采纳 The reason is to prevent so-called overlong sequences. Quoting the RFC: Implementations of the
常见的编码格式包括 UTF-8、ASCII、ISO-8859-1、GBK转换方法
2024-02-29 22:23

rockmelodies的博客【代码】常见的编码格式包括 UTF-8、ASCII、ISO-8859-1、GBK转换方法。
php gb18030 utf-8,Unicode UTF-8与GB18030编码解析(golang)
2021-03-18 04:43

息相吹的博客最早接触到编码问题时，无非是关于『乱码』一词，当某个程序或者网页或者数据库或者IDE中一看出现了乱码，就马上知道这是字符编码与解码不匹配，改下编码就好，就因为这个事情太简单，容易...为什么都通用utf-8还有...
Golang 字符编码、UTF-8、Unicode之间的关系
2020-10-31 14:33

Linux猿的博客 Unicode 编码 UTF-8编码
Go转码非UTF-8格式文件
2024-04-22 21:25

花千树-010的博客 = nil { // fmt.Println("读取文件出错:", err) // return //} // 示例数据 data := []byte{0xD2, 0xBB, 0xCA, 0xC7, 0xD6, 0xD0, 0xCE, 0xC4, 0xC6, 0xF7} // 将数据转换为UTF-8编码 utf8Data, err := ...
没有解决我的问题, 去提问

悬赏问题

¥15 乌班图ip地址配置及远程SSH
¥15 怎么让点阵屏显示静态爱心，用keiluVision5写出让点阵屏显示静态爱心的代码，越快越好
¥15 PSPICE制作一个加法器
¥15 javaweb项目无法正常跳转
¥15 VMBox虚拟机无法访问
¥15 skd显示找不到头文件
¥15 机器视觉中图片中长度与真实长度的关系
¥15 fastreport table 怎么只让每页的最下面和最顶部有横线
¥15 java 的protected权限，问题在注释里
¥15 这个是哪里有问题啊？

golang将iso8859-1转换为utf8

2条回答 默认 最新

悬赏问题

2条回答默认最新