Golang的递归并发

I'd like to distribute some load across some goroutines. If the number of tasks is known beforehand then it is easy to organize. For example, I could do fan out with a wait group.

nTasks := 100
nGoroutines := 10

// it is important that this channel is not buffered
ch := make(chan *Task)
done := make(chan bool)
var w sync.WaitGroup
// Feed the channel until done
go func () {
    for i:= 0; i < nTasks; i++ {
        task := getTaskI(i)
        ch <- task
    }
    // as ch is not buffered once everything is read we know we have delivered all of them
    for i:=0; i < nGoroutines; i++ {
        done <- false
    }
}()
for i:= 0; i < nGoroutines; i ++ {
    w.Add(1)
    go func () {
        defer w.Done()
        select {
        case task := <-ch:
            doSomethingWithTask(task)
        case <- done:
            return
        }
    }()
}
w.Wait()
// All tasks done, all goroutines closed

However, in my case each task returns more tasks to be done. Say for example a crawler where we receive all the links from the crawled web. My initial hunch was to have a main loop where I track the number of tasks done and tasks pending. When I'm done I send a finish signal to all goroutines:

nGoroutines := 10
ch := make(chan *Task, nGoroutines)
feedBackChannel := make(chan * Task, nGoroutines)
done := make(chan bool)

for i:= 0; i < nGoroutines; i ++ {
    go func () {
        select {
        case task := <-ch:
            task.NextTasks = doSomethingWithTask(task)
            feedBackChannel <- task
        case <- done:
            return
        }
    }()
}

// seed first task
ch <- firstTask
nTasksRemaining := 1

for nTasksRemaining > 0 {
    task := <- feedBackChannel
    nTasksRemaining -= 1
    for _, t := range(task.NextTasks) {
        ch <- t
        nTasksRemaining++
    }
}
for i:=0; i < nGoroutines; i++ {
    done <- false
}

However, this produces a deadlock. For example if NextTasks is bigger than the number of goroutines then the main loop will stall when the first tasks finish. But the first tasks can't finish because the feedBack is blocked since the mainLoop is waiting to write.

One "easy" way out of this is to post to the channel asynchronously: Instead of doing feedBackChannel <- task do go func () {feedBackChannel <- task}(). Now, this feels like an awful hack. Specially since there might be hundred of thousands of tasks.

What would be a nice way to avoid this deadlock? I've searched for concurrency patterns, but mostly are simpler things like fanning out or pipelines where the later stage does not affect the earlier steps.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dop2144 2017-12-10 13:50
关注
If I understand your problem correctly, your solution is pretty complex. Here are some points. Hope it helps.

As people mentioned in comments, launching a goroutine is cheap (both memory and switch between them is much cheaper that OS level theread) and you could have hundred thousand of them. Let's assume for some reasons you want to have worker goroutines.

Instead of done channel you could just close ch channel and instead of select you just range over your channel getting tasks.

I don't see the point of separating ch and feedBackChannel just push every task you have into ch and increase its capacity.

As mentioned you may get a deadlock when you trying to enqueue new task. My solution is pretty naive. Just increase its capacity until you are sure that it won't overflow (you could also log warnings if cap(ch) - len(ch) < threshold). If you create a channel (of pointers) with 1 million capacity it will take about 8 * 1e6 ~= 8MB of ram.
解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

golang递归反射
2017-12-03 23:05

回答 1 已采纳 Here's one way to do it. func printType(prefix string, t reflect.Type, visited map[reflect.Type]
golang递归函数如何检查返回值？
2016-08-26 02:52

回答 1 已采纳 Oh, and besides the comments above, it should also break once the answer is found. package main
Golang频道并发问题
2019-04-16 11:24

回答 2 已采纳 When the main() function (the main goroutine) ends, your program ends as well (it doesn't wait for
golang 递归死锁案例分析
2022-02-20 15:24

从未想放弃的博客限制并发量，避免cpu频繁发生上下文切换一个工人辛勤劳作版 func countFile(path string) { if infos, err := ioutil.ReadDir(path); err != nil { fmt.Println("some errors curd") return }
Golang：递归数据结构
2016-05-21 17:44

回答 2 已采纳 Is this what you're looking for?
golang递归函数调用自己作为goroutine不能按预期方式工作[重复]
2016-09-18 04:54

回答 2 已采纳 If you make everything asynchronous, there is nothing in main to wait on. You have to explicitly w
Golang中的并发
2017-03-26 14:49

回答 6 已采纳 No. For example, using your program, $ go run -race dz00dz.go ================== WARNING: DATA RA
golang 并发
2021-05-17 14:02

code_AC的博客并发 Go语言的并发通过goroutine`实现。goroutine类似于线程，属于用户态的线程，我们可以根据需要创建成千上万个goroutine并发工作。goroutine是由Go语言的运行时（runtime）调度完成，而线程是由操作系统调度完成...
如何在golang处理并发日志？
2016-07-29 19:45

回答 3 已采纳 There's no goroutine ID available from the runtime. Goroutines are used more liberally than thread
Golang并发SQL事务 postgresql sql
2015-10-19 20:52

回答 1 已采纳 After further research, it looks like this isn't actually an issue with Golang, or the SQL or PQ p
golang指针问题 golang
2022-07-25 18:29

回答 3 已采纳戳啦，这里的 map2 是对 map1 创建了 shallow copy，它们里面的东西装的一样，但是却不是同一个玩意。
golang中的几种并发模式
2022-09-19 09:51

六月的的博客为了避免内存泄漏，goroutine应该有被触发取消的机制。父 Goroutine 需要通过一个名为 done 的只读通道向其子 Goroutine 发送取消信号。按照惯例，它被设置为第一个参数。
golang格式问题报错 golang
2023-01-30 18:45

回答 3 已采纳望采纳：这是 Go 语言代码的格式错误，具体来说是 flag 包的参数定义的问题。错误的语法是： flag.Int64Var(&timeout,name:"w",value:1000,usage:"请
golang：并发目录遍历
2020-04-03 13:13

Karl_zhujt的博客并发遍历目录实现了统计文件个数及文件总大小。 package main import ( "flag" "fmt" "os" "path/filepath" "sync" "time" ) var vFlag = flag.Bool("v", false, "show verbose progress messages") ...
golang json 创建递归_玩转golang——JSON高性能自动字段名
2021-01-14 14:40

weixin_39627405的博客原生并发支持、优秀的性能、统一的风格，极大提升了开发效率。笔者用golang独立开发过不少小中型系统，写了几万行代码，确实很爽。不过，统一的风格，也带来了一些问题。从一个久远的争论说起There are only two ...
没有解决我的问题, 去提问

悬赏问题

¥100 角动量包络面如何用MATLAB绘制
¥15 merge函数占用内存过大
¥15 Revit2020下载问题
¥15 使用EMD去噪处理RML2016数据集时候的原理
¥15 神经网络预测均方误差很小但是图像上看着差别太大
¥15 单片机无法进入HAL_TIM_PWM_PulseFinishedCallback回调函数
¥15 Oracle中如何从clob类型截取特定字符串后面的字符
¥15 想通过pywinauto自动电机应用程序按钮，但是找不到应用程序按钮信息
¥15 如何在炒股软件中，爬到我想看的日k线
¥15 seatunnel 怎么配置Elasticsearch

Golang的递归并发

1条回答 默认 最新

悬赏问题

1条回答默认最新