dongshuo6503 2014-01-02 22:27
浏览 33
已采纳

排序字符串时忽略字符重音

I'm writing a golang program, which takes a list of strings and sorts them into bucket lists by the first character of string. However, I want it to group accented characters with the unaccented character that it most resembles. So, if I have a bucket for the letter A, then I want strings that start with Á to be included.

Does Go have anything built-in for determining this, or is my best bet to just have a large switch statement with all characters and their accented variations?

  • 写回答

1条回答 默认 最新

  • doudi8525 2014-01-02 22:38
    关注

    Looks like there are some addon packages for this. Here's an example...

    package main
    
    import (
       "fmt"
       "code.google.com/p/go.text/collate"
       "code.google.com/p/go.text/language"
    )
    
    func main() {
       strs := []string{"abc", "áab", "aaa"}
       cl := collate.New(language.En)
       cl.SetOptions(collate.Loose)
       cl.SortStrings(strs)
       fmt.Println(strs) 
    }
    

    outputs:

    [aaa áab abc]
    

    Also, check out the following reference on text normalization: http://blog.golang.org/normalization

    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论

报告相同问题?

悬赏问题

  • ¥15 ogg dd trandata 报错
  • ¥15 高缺失率数据如何选择填充方式
  • ¥50 potsgresql15备份问题
  • ¥15 Mac系统vs code使用phpstudy如何配置debug来调试php
  • ¥15 目前主流的音乐软件,像网易云音乐,QQ音乐他们的前端和后台部分是用的什么技术实现的?求解!
  • ¥60 pb数据库修改与连接
  • ¥15 spss统计中二分类变量和有序变量的相关性分析可以用kendall相关分析吗?
  • ¥15 拟通过pc下指令到安卓系统,如果追求响应速度,尽可能无延迟,是不是用安卓模拟器会优于实体的安卓手机?如果是,可以快多少毫秒?
  • ¥20 神经网络Sequential name=sequential, built=False
  • ¥16 Qphython 用xlrd读取excel报错