douyi02577 2014-07-03 14:02
浏览 142
已采纳

GoLang-使用ISO-8859-1字符集进行持久化

I'm developing a project where we need to persist our information in a legacy database that has ISO-8859-1 tables. So before writing something to the database I need to convert it from UTF-8 to ISO-8859-1, and every time I retrieve it from the database, I need to convert it back to UTF-8.

I was trying to use the library code.google.com/p/go-charset/ as the following for each text field that I need to persist.

import (
  "bytes"
  "code.google.com/p/go-charset/charset"
  _ "code.google.com/p/go-charset/data"
  "fmt"
  "io/ioutil"
  "strings"
)

func toISO88591(utf8 string) string {
    buf := new(bytes.Buffer)

    w, err := charset.NewWriter("latin1", buf)
    if err != nil {
        panic(err)
    }
    defer w.Close()

    fmt.Fprintf(w, utf8)
    return buf.String()
}

func fromISO88591(iso88591 string) string {
    r, err := charset.NewReader("latin1", strings.NewReader(iso88591))
    if err != nil {
        panic(err)
    }

    buf, err := ioutil.ReadAll(r)
    if err != nil {
        panic(err)
    }

    return string(buf)
}

The problem is that the data is still persisted in UTF-8 even if I use the function toISO88591. I am doing something wrong in this conversion?

My database is a MySQL, and I'm using the github.com/go-sql-driver/mysql driver with the following connection parameters:

<user>:<password>@tcp(<host>:<port>)/<database>?collation=latin1_general_ci

Best regards!

  • 写回答

1条回答 默认 最新

  • doudiaozhi6658 2014-07-03 18:04
    关注

    package charset

    import "code.google.com/p/go-charset/charset" 
    

    func NewWriter

    func NewWriter(charset string, w io.Writer) (io.WriteCloser, error)
    

    NewWriter returns a new WriteCloser writing to w. It converts writes of UTF-8 text into writes on w of text in the named character set. The Close is necessary to flush any remaining partially translated characters to the output.


    I would follow the instructions: "The Close is necessary to flush any remaining partially translated characters to the output." For example,

    package main
    
    import (
        "bytes"
        "code.google.com/p/go-charset/charset"
        _ "code.google.com/p/go-charset/data"
        "fmt"
        "io/ioutil"
        "strings"
    )
    
    func toISO88591(utf8 string) (string, error) {
        buf := new(bytes.Buffer)
        w, err := charset.NewWriter("latin1", buf)
        if err != nil {
            return "", err
        }
        fmt.Fprintf(w, utf8)
        w.Close()
        return buf.String(), nil
    }
    
    func fromISO88591(iso88591 string) (string, error) {
        r, err := charset.NewReader("latin1", strings.NewReader(iso88591))
        if err != nil {
            return "", err
        }
        buf, err := ioutil.ReadAll(r)
        if err != nil {
            return "", err
        }
        return string(buf), nil
    }
    
    func main() {
        utfi := "£5 for Peppé"
        fmt.Printf("%q
    ", utfi)
        iso, err := toISO88591(utfi)
        if err != nil {
            fmt.Println(err)
        }
        fmt.Printf("%q
    ", iso)
        utfo, err := fromISO88591(iso)
        if err != nil {
            fmt.Println(err)
        }
        fmt.Printf("%q
    ", utfo)
        fmt.Println(utfi == utfo)
    }
    

    Output:

    "£5 for Peppé"
    "\xa35 for Pepp\xe9"
    "£5 for Peppé"
    true
    
    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论

报告相同问题?

悬赏问题

  • ¥15 关于#matlab#的问题:在模糊控制器中选出线路信息,在simulink中根据线路信息生成速度时间目标曲线(初速度为20m/s,15秒后减为0的速度时间图像)我想问线路信息是什么
  • ¥15 banner广告展示设置多少时间不怎么会消耗用户价值
  • ¥16 mybatis的代理对象无法通过@Autowired装填
  • ¥15 可见光定位matlab仿真
  • ¥15 arduino 四自由度机械臂
  • ¥15 wordpress 产品图片 GIF 没法显示
  • ¥15 求三国群英传pl国战时间的修改方法
  • ¥15 matlab代码代写,需写出详细代码,代价私
  • ¥15 ROS系统搭建请教(跨境电商用途)
  • ¥15 AIC3204的示例代码有吗,想用AIC3204测量血氧,找不到相关的代码。