douyi02577 2014-07-03 14:02
浏览 142
已采纳

GoLang-使用ISO-8859-1字符集进行持久化

I'm developing a project where we need to persist our information in a legacy database that has ISO-8859-1 tables. So before writing something to the database I need to convert it from UTF-8 to ISO-8859-1, and every time I retrieve it from the database, I need to convert it back to UTF-8.

I was trying to use the library code.google.com/p/go-charset/ as the following for each text field that I need to persist.

import (
  "bytes"
  "code.google.com/p/go-charset/charset"
  _ "code.google.com/p/go-charset/data"
  "fmt"
  "io/ioutil"
  "strings"
)

func toISO88591(utf8 string) string {
    buf := new(bytes.Buffer)

    w, err := charset.NewWriter("latin1", buf)
    if err != nil {
        panic(err)
    }
    defer w.Close()

    fmt.Fprintf(w, utf8)
    return buf.String()
}

func fromISO88591(iso88591 string) string {
    r, err := charset.NewReader("latin1", strings.NewReader(iso88591))
    if err != nil {
        panic(err)
    }

    buf, err := ioutil.ReadAll(r)
    if err != nil {
        panic(err)
    }

    return string(buf)
}

The problem is that the data is still persisted in UTF-8 even if I use the function toISO88591. I am doing something wrong in this conversion?

My database is a MySQL, and I'm using the github.com/go-sql-driver/mysql driver with the following connection parameters:

<user>:<password>@tcp(<host>:<port>)/<database>?collation=latin1_general_ci

Best regards!

  • 写回答

1条回答 默认 最新

  • doudiaozhi6658 2014-07-03 18:04
    关注

    package charset

    import "code.google.com/p/go-charset/charset" 
    

    func NewWriter

    func NewWriter(charset string, w io.Writer) (io.WriteCloser, error)
    

    NewWriter returns a new WriteCloser writing to w. It converts writes of UTF-8 text into writes on w of text in the named character set. The Close is necessary to flush any remaining partially translated characters to the output.


    I would follow the instructions: "The Close is necessary to flush any remaining partially translated characters to the output." For example,

    package main
    
    import (
        "bytes"
        "code.google.com/p/go-charset/charset"
        _ "code.google.com/p/go-charset/data"
        "fmt"
        "io/ioutil"
        "strings"
    )
    
    func toISO88591(utf8 string) (string, error) {
        buf := new(bytes.Buffer)
        w, err := charset.NewWriter("latin1", buf)
        if err != nil {
            return "", err
        }
        fmt.Fprintf(w, utf8)
        w.Close()
        return buf.String(), nil
    }
    
    func fromISO88591(iso88591 string) (string, error) {
        r, err := charset.NewReader("latin1", strings.NewReader(iso88591))
        if err != nil {
            return "", err
        }
        buf, err := ioutil.ReadAll(r)
        if err != nil {
            return "", err
        }
        return string(buf), nil
    }
    
    func main() {
        utfi := "£5 for Peppé"
        fmt.Printf("%q
    ", utfi)
        iso, err := toISO88591(utfi)
        if err != nil {
            fmt.Println(err)
        }
        fmt.Printf("%q
    ", iso)
        utfo, err := fromISO88591(iso)
        if err != nil {
            fmt.Println(err)
        }
        fmt.Printf("%q
    ", utfo)
        fmt.Println(utfi == utfo)
    }
    

    Output:

    "£5 for Peppé"
    "\xa35 for Pepp\xe9"
    "£5 for Peppé"
    true
    
    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论

    报告相同问题?

    悬赏问题

    • ¥20 如何利用C语言实现用最小二乘法选配两个经验公式
    • ¥50 vue-codemirror如何对指定行 指定位置的 字符进行背景颜色或者字体颜色的修改?
    • ¥15 有人会思科模拟器嘛?
    • ¥30 遇到一个的问题,请教各位
    • ¥20 matlab报错,vflux计算潜流通量
    • ¥15 我该如何实现鼠标按下GUI按钮时就执行按钮里面的操作的方法
    • ¥15 关于#硬件工程#的问题:我这边有个锁相环电路没有效果
    • ¥15 20款 27寸imac苹果一体机装win10后,蓝牙耳机和音响放歌曲卡顿断断续续.
    • ¥15 VB.NET 父窗体调取子窗体报错
    • ¥15 python海龟作图如何改代码使其最后画出来的是一个镜像翻转的图形