Golang：过程花了太长时间。实施拼写检查器

http://play.golang.org/p/H5E0ExL85d

I've implemented some Peter Norvig's spelling check algorithm with Go.

It's weird that the FIRST THREE calling works correct giving me the desired output.

But from the second, it is saying "process took too long."

Could anybody look at my code and tell what goes wrong?

Here's the snippet that is possibly going wrong.

Everything works perfect with the same code in English version.

UNICODE format and boundary have changed according to language because English contain 1 byte per alphabet and Asian languages in this case contain 3 bytes per one character.

This is trying to run the same Algorithm as the one for English that was working perfect. But this is NOT working.

total_set := []string{}
for _, elem := range splits {

    if len(elem.str2) > 3 {
        //deletion
        total_set = append(total_set, elem.str1+elem.str2[3:])

        //replace
        for i:=0; i<len(koreanletter)/3; i++ {
            total_set = append(total_set, elem.str1+string(koreanletter[3*i:3*(i+1)])+elem.str2[3:])
        }

        //transpose
        if len(elem.str2) > 9 {
            total_set = append(total_set, elem.str1+string(elem.str2[3:6])+string(elem.str2[:3])+elem.str2[9:])
        }

    } else {
        //deletion
        total_set = append(total_set, elem.str1)
    }

    //insertion
    for _, c := range koreanletter {
        total_set = append(total_set, elem.str1+string(c)+elem.str2)
    }
    return RemoveDuplicateStringArrayForKorean(total_set)
}

The one for English is below. This is working perfect.

//Edits1 is to measure the distance between strings.
func (model *Model) Edits1(word string) []string {
  const alphabet = "abcdefghijklmnopqrstuvwxyz"

  splits := []Pair{}
  for i := 0; i <= len(word); i++ {
    splits = append(splits, Pair{word[:i], word[i:]})
  }

  total_set := []string{}
  for _, elem := range splits {

    if len(elem.str2) > 0 {
      //deletion
      total_set = append(total_set, elem.str1+elem.str2[1:])

      //replace
      for _, c := range alphabet {
        total_set = append(total_set, elem.str1+string(c)+elem.str2[1:])
      }

      //transpose
      if len(elem.str2) > 1 {
        total_set = append(total_set, elem.str1+string(elem.str2[1])+string(elem.str2[0])+elem.str2[2:])
      }

    } else {
      //deletion
      total_set = append(total_set, elem.str1)
    }

    //insertion
    for _, c := range alphabet {
      total_set = append(total_set, elem.str1+string(c)+elem.str2)
    }
  }
  return RemoveDuplicateStringArrayLowerCase(total_set)
}

Addition: ordered arguments and now I have three things working.

None of the characters are missing from the koreanletter.

Is there anyway that I can see the error more specifically? I just can't figure out.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
doudandui1592 2013-11-06 11:46
关注
Playing around with your code, it seems it is your KoreanKnownEdits2 which is taking too long. In your forth example (the one failing), the length of model.KoreanEdits1(input_word) is 28197 and the length of the first model.KoreanEdits1(elem1) is 23499, which makes around 662 millions cases to try. It seems the program is failing after the first 147 thousands, because it takes too long (playground).

Any examples which did not required a call to KoreanKnownEdits2 seem to work, so I suspect you should rewrite this function to avoid the exhaustive search, or at least limit it to a more reasonable size if you want to use it with the playground's time limit. I haven't studied your code in enough details to be 100% certain of that, but I suspect the 26 letters of western alphabet make it manageable for the English version, while the extended Korean alphabet makes the size of your input too large to be processed on the playground's time limit, regardless of the number of bytes each character is encoded with.

本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

Golang：获取系统解析器的DNS服务器列表
2019-02-27 00:02

回答 1 已采纳 The Resolver type in the net package lets you resolve DNS names but it doesn't seem to export the
Golang：获取其他时区的等效时间
2017-03-21 23:25

回答 3 已采纳 Don't use the Go Playground for time calculations. It runs in a sandbox with a fake time: Abou
golang：将毫秒转换为时间
2016-12-08 10:29

回答 2 已采纳 As per GoDoc time.Unix : Unix returns the local Time corresponding to the given Unix time, sec
Golang笔记
2023-10-18 15:27

The Straggling Crow的博客不支持不过，有几种方法可以模拟这种行为： 1、使用变长参数 package main import "fmt" func printMessage(message string, optionalParts ...string) { fullMessage := message for _, part := range ...
Golang：在特定时间实施cron /执行任务
2013-10-23 18:07

回答 5 已采纳 This is a general implementation, which lets you set: interval period hour to tick minute to tic
Golang：进行测试时出错：信号：被杀死
2017-03-29 17:21

回答 2 已采纳 UPDATE on 04/27/2017: The new Go1.8.1 (released 2017/04/07) fixed this issue. Please download an
进程花费了太长时间程序退出：Golang错误[重复]
2018-10-25 12:00

回答 1 已采纳 Because you are adding 0 to 0 inside the loop, which always results in 0. Therefore the loop will
Web安全研究（八）
2024-05-05 00:15

西杭的博客在七个月的时间（2020年1月24日至8月24日）里，Aristaeus记录了来自287,017个独特IP地址的2640万次机器人请求，总共生成了超过200GB的来自没有自然用户流量的网站的原始日志。通过分析接收到的流量，我们发现平均每...
Golang：中断有时间的无限轮询。睡眠
2018-05-14 08:50

回答 1 已采纳 You cannot interrupt a time.Sleep(). Instead if you need to listen to other "events" while waitin
Golang：如何在Web服务器中使用pprof获得计算时间
2016-03-08 06:36

回答 1 已采纳 I've had success using Dmitry's advice from here: https://software.intel.com/en-us/blogs/2014/05/1
Golang：使用PostgreSQL模式进行连接 postgresql
2018-07-21 18:54

回答 2 已采纳 You should add search_path=myschema to dataSourceName P.S. better use fmt.Sprintf("host=%s port=%
ChatGPT使用总结：150个GPT使用指令（完整版）
2023-07-24 17:32

DAIWeize_222的博客你会写出有创意的、吸引人的故事，能吸引读者很长一段时间。你可以选择任何类型的小说，比如奇幻小说、浪漫小说、历史小说等等，但目标是写一些有突出的情节、引人入胜的人物和意想不到的高潮的东西。我的第一个要求...
Google Cloud Storage GoLang：错误处理
2018-12-14 10:32

回答 1 已采纳 From documentation for Google Cloud Storage Client package Errors returned by this client are
【ChatGPT】实用 Prompt 指令大全 —— 一文教你如何更好地挖掘 GPT 的价值
2023-04-15 03:43

禅与计算机程序设计艺术的博客担任法律顾问作为个人造型师担任机器学习工程师担任 SVG 设计师作为 IT 专家作为项目经理作为专业DBA 下棋充当全栈软件开发人员充当数学家充当正则表达式生成器充当时间旅行指南担任人才教练充当 R ...
提示工程师：如何高效的向ChatGPT提问对话
2023-04-02 17:24

Steven灬的博客最近一段时间，我也像大家一样，每天痴迷于花大量的时间去探索ChatGPT的各种超能力。在使用的过程中发现，提问是需要方法和技巧的，是需要有一定的专业知识的。如果一个人不懂这个专业领域，那么他会很难问出高价值...
没有解决我的问题, 去提问

悬赏问题

¥20 我想使用一些网络协议或者部分协议也行，主要想实现类似于traceroute的一定步长内的路由拓扑功能
¥30 深度学习，前后端连接
¥15 孟德尔随机化结果不一致
¥15 apm2.8飞控罗盘bad health，加速度计校准失败
¥15 求解O-S方程的特征值问题给出边界层布拉休斯平行流的中性曲线
¥15 谁有desed数据集呀
¥20 手写数字识别运行c仿真时，程序报错错误代码sim211-100
¥15 关于#hadoop#的问题
¥15 (标签-Python|关键词-socket)
¥15 keil里为什么main.c定义的函数在it.c调用不了

Golang：过程花了太长时间。 实施拼写检查器

1条回答 默认 最新

悬赏问题

Golang：过程花了太长时间。实施拼写检查器

1条回答默认最新