从N个元素的切片生成K个元素的算法

I'm trying to port an algorithm from this Stackoverflow question in Go. The algorithm I'm trying to get working is as follows: given a slice of strings of an arbitrary length, and a "depth", find all the combinations of the elements in the original slice that are of length depth. For example, if given a slice containing A, B, C, D, E, and F, and a depth of 3, the result should be:

[A, B, C]
[A, B, D]
[A, B, E]
[A, B, F]
[A, C, D]
[A, C, E]
[A, C, F]
[A, D, E]
[A, D, F]
[A, E, F]
[B, C, D]
[B, C, E]
[B, C, F]
[B, D, E]
[B, D, F]
[B, E, F]
[C, D, E]
[C, D, F]
[C, E, F]
[D, E, F]

I've tried to implement a few of the proposed solutions in the above post in Go, but unfortunately my Go skills aren't quite up to snuff yet. I've only started programming in Go a few weeks ago.

Here is the broken code that was a failed attempt to port this implementation in Java:

package main

import (
    "fmt"
)

func main() {
    combos := []string{"A","B","C","D","E","F"}
    combos = GetCombos(combos, 3)

    fmt.Println(combos)
}

func GetCombos(set []string, depth int) []string {
    var results []string
    element := make([]string, depth)
    return GetEnvCombos2(set, depth, 0, element, results)
}

func GetCombos2(set []string, depth int, start int, element, results []string) []string {
    if depth == 0 {
        var guess string
        for _, e := range element {
            guess += e
        }
        results = append(results, guess)
        return results
    }
    for i := start; i <= len(set) - depth; i++ {
        element[len(element) - depth] = set[i]
        results = append(results, GetEnvCombos2(set, depth - 1, i + 1, element, results)...)
    }

    return nil
}

I don't know if that implementation in Java is the most efficient way to do it, but it seemed fairly efficient and (I thought) relatively easy to port to Go. If there's a totally different, yet more efficient way to do this, I'd gladly accept that.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dongtu4028 2019-01-05 18:56
关注
Note:

The correct answer to any combinatoric problem is practically never to put all possibly combinations into a container and process them afterwards. There are typically an enormous number of combinations and the temporary container tends to use up all available memory for items which are only going to be referenced once. The original Java program buries the processing step (in this case, "print the combination") deep inside the generation function, which is also practically never a good solution because it requires creating an entire new generator function for every different action.

One way to structure combinatorial generation is to use a function which finds the next combination, given the previous one. Such a function is typically called an "iterator". If the last combination is provided, the function returns a return value indicating that there are no more combinations available. Often, the provided combination is modified "in-place", so that the return value is just a boolean indicating whether the combination was the last one or not. (It's usually considered best practice to reset the supplied combination to the first combination as well as reporting that it was previously the last combination.) That strategy doesn't work well with recursive algorithms such as the one you are porting.

Many languages include some facility which allows recursive generation of possible values. For example, in Go you can write an iterator as a "go routine". That can produce really elegant code, although there is an underlying cost.

It is always possible to reimplement a recursive function as an iterative one, by simulating the call stack with some kind of stack datastructure; however, the result is harder to understand and often slower (because native recursion is almost always faster than simulated recursion). And you might be able to find a non-recursive algorithm for iterating (possibly changing the iteration order).

I'm not going to do any of those things, here, though. The following simply satisfies the same prototype as the original code, returning a (possibly enormous) slice of results, because the underlying issue is simply a question of design of recursive functions.

The prototype of the internal recursive generator is

func GetCombos2(set []string, depth int, start int, element []string, results []string) []string

(I added the type of element, for clarity.) It's useful to try to articulate what this function does, exactly, which might go something like this:

Given a list of items set, a partial combination element which still requires depth items to be appended to it, and a list of combinations results, return results appended with the possible combinations starting with the prefix indicated by element whose continuations only contain items whose index is greater than equal to start. Combinations are generated in monotonically increasing index order, and it is required that all items in the prefix have indices less than start.

That's a bit of a mouthful, and I'm not sure that reading it is immediately clearer than the code. But it is possibly helpful as a way of understanding what is going on. Here, I'm just going to focus on one small part:

Given… results, … return results appended with… [the new combinations computed with these arguments]

That's not the only possible way of writing this recursion. Another way would be to not require results as an argument, and to simply return the list of combinations generated according to the other arguments. That would produce slightly simpler code but it could be quite a bit slower because of the number of slices of partial results generated and immediately discarded. The use of "accumulator" arguments like results is a common technique for making recursion more efficient.

What's important about this discussion is understanding what the return value of the recursive function is. If you use the "accumulator" strategy (with the results argument) then the return value is the entire list of results found up to this point, and you only append to it if you are adding a new result. If you use the non-accumulator strategy, then when you find a new result you return it immediately, leaving it to the caller to concatenate the various lists it receives from multiple calls.

So the two strategies would look like this:

Accumulator version:

func GetCombos2(set []string, depth int, start int, element []string, results []string) []string { if depth == 0 { results = append(results, strings.Join(element, "")) } else { for i := start; i <= len(set) - depth; i++ { element[len(element) - depth] = set[i] results = GetEnvCombos2(set, depth - 1, i + 1, element, results) } } return results }

Non-accumulator version:

func GetCombos2(set []string, depth int, start int, element []string) []string { if depth == 0 { return []string { strings.Join(element, "") } } else { var results []string for i := start; i <= len(set) - depth; i++ { element[len(element) - depth] = set[i] results = append(results, GetCombos2(set, depth - 1, i + 1, element)...) } return results } }

EDIT: After writing that, I realised that the use of the string array elements is really a Java-ism which doesn't translate well to Go. (Or perhaps it's a C-ism badly translated to Java.) Anyway, the functions are slightly faster and quite a bit easier to read if we just pass a string representing the prefix so that we don't need to do the Join. (Go strings are immutable, so there's no need to copy them before putting them into the results slice.)

That reduces the code to the following:

Accumulator version (recommended, but an iterator would be even better):

func GetCombos(set []string, depth int) []string { return GetCombosHelper(set, depth, 0, "", []string{}) } func GetCombosHelper(set []string, depth int, start int, prefix string, accum []string) []string { if depth == 0 { return append(accum, prefix) } else { for i := start; i <= len(set) - depth; i++ { accum = GetCombosHelper(set, depth - 1, i + 1, prefix + set[i], accum) } return accum } }

Non-accumulator version:

func GetCombos(set []string, depth int) []string { return GetCombosHelper(set, depth, 0, "") } func GetCombosHelper(set []string, depth int, start int, prefix string) []string { if depth == 0 { return []string{prefix} } else { accum := []string{} for i := start; i <= len(set) - depth; i++ { accum = append(accum, GetCombosHelper(set, depth - 1, i + 1, prefix + set[i])...) } return accum } }

On my laptop, given a set of 62 elements (upper and lower case letters plus digits) with depth 6, the non-accumulator version took 29.7 seconds (elapsed) and the accumulator version took 13.4 seconds. Both used about 4.5 gigabytes of memory, which seemed a bit high to me since there are "only" 61,474,519 six-character combinations, and the memory consumption works out to almost 80 bytes peak memory usage per combination.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

从N个元素的切片生成K个元素的算法
2019-01-05 16:48

回答 1 已采纳 Note: The correct answer to any combinatoric problem is practically never to put all possibly com
在切片中仅保留n个元素
2017-10-23 14:43

回答 3 已采纳 For example, addimport.go: package main import ( "fmt" "time" ) type Statistics struct
用切片打印列表中间三个元素 python
2023-01-29 14:38

回答 2 已采纳如果列表是偶数和，怎么取到中间3个？假设列表是奇数个 lst=…#这里是列表内容 s=len(lst)＋1)//2 print(lst[s-2:s+1]) 本人初学者，不确定是否正确，且对列表已经很
python生成列表、给定一个n、删除列表中n的倍数_python每日经典算法题5(基础题)+1(中难题)...
2020-12-18 06:02

weixin_39974811的博客现在，越来越多的公司面试以及考验面试对算法要求都提高了一个层次，从现在，我讲每日抽出时间进行5+1算法题讲解，5是指基础题，1是指1道中等偏难。希望能够让大家熟练掌握python的语法结构已经一些高级函数的应用。...
比较两个缺少元素的切片
2019-08-11 02:34

回答 2 已采纳 Comparing two slices for missing element I have two slices: a[]string, b[]string. b contai
golang按第一个元素对切片进行切片
2019-03-26 14:51

回答 1 已采纳 The question does not state what should be done with empty slices, so treating them like an empty
如何从切片中删除最后一个元素？
2014-10-03 01:49

回答 2 已采纳 You can use len() to find the length and re-slice using the index before the last element: if len
【STL切片算法文献笔记】一种使用 STL 文件进行高效轮廓构造的改进切片算法
2022-04-22 15:15

会走路的胖虎的博客文章目录一、前言二、研究导览三、方案概述四.切片过程4.1 数据结构4.2 全局切片算法4.3 前向边和后向边判断... 在 STL 文件创建过程中生成的大量三角形可能会影响对模型进行切片以及随后为每个切片创建轮廓所需的时间
python删除切片元素 python
2022-10-20 19:32

回答 3 已采纳 words = ['hello', 'good', '', '', 'yes', '', 'ok', ''] n = eval(input("请输入数字：")) # ---- begin ----
删除切片中的每个元素
2015-03-26 21:27

回答 3 已采纳 The "what" is because append(slice, elems...) is actually updating the slice with the new elements
如何获得切片的最后一个元素？
2014-03-20 14:19

回答 2 已采纳 For just reading the last element of a slice: sl[len(sl)-1] For removing it: sl = sl[:len(sl)-
k-means聚类算法的原理
2023-05-05 01:13

小白脸cty的博客 K-means是一种聚类算法，其原理是将数据集划分为k个簇，使得每个数据点都属于最近的簇，并且簇的中心是所有数据点的平均值。这个算法是基于迭代优化的，每个迭代步骤会更新簇的中心点，直到达到收敛条件。下面是K-...
切片的最后一个元素 golang
2014-03-20 14:19

回答 2 已采纳 For just reading the last element of a slice: sl[len(sl)-1] For removing it: sl = sl[:len(sl)-
机器学习实战教程（一）：K-近邻算法
2022-12-11 17:18

MqtGhj的博客 k近邻法(k-nearest neighbor, k-NN)是1967年由Cover T和Hart P提出的一种基本分类与回归方法。...一般来说，我们只选择样本数据集中前k个最相似的数据，这就是k-近邻算法中k的出处，通常k是不大于20的整数。最后，
python输入一个整数列表列表元素为18_GitHub - lightxiang/python_interview_question at ce043d32dc109265dc916187dcc6...
2020-12-10 05:12

weixin_40004502的博客 Python基础1、文件操作1.1、有一个jsonline格式的文件file.txt大小约为...3、数据类型3.1、现有字典 d={‘a’:24，‘g’:52，‘i’:12，‘k’:33}请按value值进行3.2、字典推导式？3.3、请反转字符串“aStr”?3.4、...
没有解决我的问题, 去提问

悬赏问题

¥50 求解vmware的网络模式问题别拿AI回答
¥24 EFS加密后，在同一台电脑解密出错，证书界面找不到对应指纹的证书，未备份证书，求在原电脑解密的方法，可行即采纳
¥15 springboot 3.0 实现Security 6.x版本集成
¥15 PHP-8.1 镜像无法用dockerfile里的CMD命令启动只能进入容器启动，如何解决？(操作系统-ubuntu)
¥30 请帮我解决一下下面六个代码
¥15 关于资源监视工具的e-care有知道的嘛
¥35 MIMO天线稀疏阵列排布问题
¥60 用visual studio编写程序，利用间接平差求解水准网
¥15 Llama如何调用shell或者Python
¥20 谁能帮我挨个解读这个php语言编的代码什么意思？

从N个元素的切片生成K个元素的算法

1条回答 默认 最新

悬赏问题

1条回答默认最新