douyi02577 2014-07-03 14:02
浏览 142
已采纳

GoLang-使用ISO-8859-1字符集进行持久化

I'm developing a project where we need to persist our information in a legacy database that has ISO-8859-1 tables. So before writing something to the database I need to convert it from UTF-8 to ISO-8859-1, and every time I retrieve it from the database, I need to convert it back to UTF-8.

I was trying to use the library code.google.com/p/go-charset/ as the following for each text field that I need to persist.

import (
  "bytes"
  "code.google.com/p/go-charset/charset"
  _ "code.google.com/p/go-charset/data"
  "fmt"
  "io/ioutil"
  "strings"
)

func toISO88591(utf8 string) string {
    buf := new(bytes.Buffer)

    w, err := charset.NewWriter("latin1", buf)
    if err != nil {
        panic(err)
    }
    defer w.Close()

    fmt.Fprintf(w, utf8)
    return buf.String()
}

func fromISO88591(iso88591 string) string {
    r, err := charset.NewReader("latin1", strings.NewReader(iso88591))
    if err != nil {
        panic(err)
    }

    buf, err := ioutil.ReadAll(r)
    if err != nil {
        panic(err)
    }

    return string(buf)
}

The problem is that the data is still persisted in UTF-8 even if I use the function toISO88591. I am doing something wrong in this conversion?

My database is a MySQL, and I'm using the github.com/go-sql-driver/mysql driver with the following connection parameters:

<user>:<password>@tcp(<host>:<port>)/<database>?collation=latin1_general_ci

Best regards!

  • 写回答

1条回答 默认 最新

  • doudiaozhi6658 2014-07-03 18:04
    关注

    package charset

    import "code.google.com/p/go-charset/charset" 
    

    func NewWriter

    func NewWriter(charset string, w io.Writer) (io.WriteCloser, error)
    

    NewWriter returns a new WriteCloser writing to w. It converts writes of UTF-8 text into writes on w of text in the named character set. The Close is necessary to flush any remaining partially translated characters to the output.


    I would follow the instructions: "The Close is necessary to flush any remaining partially translated characters to the output." For example,

    package main
    
    import (
        "bytes"
        "code.google.com/p/go-charset/charset"
        _ "code.google.com/p/go-charset/data"
        "fmt"
        "io/ioutil"
        "strings"
    )
    
    func toISO88591(utf8 string) (string, error) {
        buf := new(bytes.Buffer)
        w, err := charset.NewWriter("latin1", buf)
        if err != nil {
            return "", err
        }
        fmt.Fprintf(w, utf8)
        w.Close()
        return buf.String(), nil
    }
    
    func fromISO88591(iso88591 string) (string, error) {
        r, err := charset.NewReader("latin1", strings.NewReader(iso88591))
        if err != nil {
            return "", err
        }
        buf, err := ioutil.ReadAll(r)
        if err != nil {
            return "", err
        }
        return string(buf), nil
    }
    
    func main() {
        utfi := "£5 for Peppé"
        fmt.Printf("%q
    ", utfi)
        iso, err := toISO88591(utfi)
        if err != nil {
            fmt.Println(err)
        }
        fmt.Printf("%q
    ", iso)
        utfo, err := fromISO88591(iso)
        if err != nil {
            fmt.Println(err)
        }
        fmt.Printf("%q
    ", utfo)
        fmt.Println(utfi == utfo)
    }
    

    Output:

    "£5 for Peppé"
    "\xa35 for Pepp\xe9"
    "£5 for Peppé"
    true
    
    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论

报告相同问题?

悬赏问题

  • ¥15 Stata链式中介效应代码修改
  • ¥15 latex投稿显示click download
  • ¥15 请问读取环境变量文件失败是什么原因?
  • ¥15 在若依框架下实现人脸识别
  • ¥15 网络科学导论,网络控制
  • ¥100 安卓tv程序连接SQLSERVER2008问题
  • ¥15 利用Sentinel-2和Landsat8做一个水库的长时序NDVI的对比,为什么Snetinel-2计算的结果最小值特别小,而Lansat8就很平均
  • ¥15 metadata提取的PDF元数据,如何转换为一个Excel
  • ¥15 关于arduino编程toCharArray()函数的使用
  • ¥100 vc++混合CEF采用CLR方式编译报错