确保goroutine清理，最佳实践

I have a fundamental understanding problem about how to make sure that spawned goroutines are "closed" properly in the context of long-running processes. I watched talks regarding that topic and read about best practices. In order to understand my question please refer to the video "Advanced Go Concurrency Patterns" here

For the following, if you run code on your machine please export the environment variable GOTRACEBACK=all so you are able to see routine states after panic.

I put the code for the original example here: naive (it does not execute on go playground, I guess bacause a time statement is used. Please copy the code and execute it locally)

The result of the panic of the naive implementation after execution is

panic: show me the stacks goroutine 1 [running]: panic(0x48a680, 0xc4201d8480) /usr/lib/go/src/runtime/panic.go:500 +0x1a1 main.main() /home/flx/workspace/go/go-rps/playground/ball-naive.go:18 +0x16b goroutine 5 [chan receive]: main.player(0x4a4ec4, 0x2, 0xc42006a060) /home/flx/workspace/go/go-rps/playground/ball-naive.go:23 +0x61 created by main.main /home/flx/workspace/go/go-rps/playground/ball-naive.go:13 +0x76 goroutine 6 [chan receive]: main.player(0x4a4ec6, 0x2, 0xc42006a060) /home/flx/workspace/go/go-rps/playground/ball-naive.go:23 +0x61 created by main.main /home/flx/workspace/go/go-rps/playground/ball-naive.go:14 +0xad exit status 2

That demonstrates the underlying problem of leaving dangling goroutines on the system, which is especially bad for long running processes.

So for my personal understanding I tried two slightly more sophisticated variants to be found here:

for-select with default

generator pattern with quit channel

(again, not executable on the playground, cause "process takes too long")

The first solution is not fitting for various reasons, even leading to non-determinism in executed steps, depending on goroutine execution speed.

Now I thought -- and here finally comes the question! -- that the second solution with the quit channel would be appropriate to eliminate all executional traces from the system before exiting. Anyhow, "sometimes" the program exits too fast and the panic reports an additional goroutine runnable still residing on the system. The panic output:

panic: show me the stacks goroutine 1 [running]: panic(0x48d8e0, 0xc4201e27c0) /usr/lib/go/src/runtime/panic.go:500 +0x1a1 main.main() /home/flx/workspace/go/go-rps/playground/ball-perfect.go:20 +0x1a9 goroutine 20 [runnable]: main.player.func1(0xc420070060, 0x4a8986, 0x2, 0xc420070120) /home/flx/workspace/go/go-rps/playground/ball-perfect.go:27 +0x211 created by main.player /home/flx/workspace/go/go-rps/playground/ball-perfect.go:36 +0x7f exit status 2

My question is: that should not happen, right? I do use a quit channel to cleanup state before stepping forward to panicking.

I did a final try of implementing safe cleanup behavior here: artificial wait time for runnables to close

Anyhow, that solution does not feel right and may as well not be applicable to large amounts of runnables?

What would be the recommended and most idiomatic pattern to ensure correct cleanup?

Thanks for your time

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dongshi2836 2016-08-25 07:17
关注
Your are fooled by the output: Your "generator pattern with quit channel" works perfectly fine, the two goroutines actually are terminated properly.

You see them in the trace because you panic too early. Remember: You have to goroutines running concurrently with main. main "stops" these goroutines by signaling on the quit channel. After these two sends on line 18 and 19 the two receives on line 32 have happened. And nothing more! You still have three goroutines running: Main is between lines 19 and 20 and the player goroutines are between lines 32 and 33. If now the panic in main happens before the return in player then the player goroutines are still there and are show in the panic stacktrace. These goroutines would have ended several milliseconds later if only the scheduler would have had time to execute the return on line 33 (which it hadn't as you killed it by panicking).

This is an instance of the "main ends to early to see concurrent goroutines do work" problem asked once a month here. You do see the concorrent goroutines doing work, but not all work. You might try sleeping 2 milliseconds before the panic and your player goroutines will have time to execute the return and everything is fine.

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

退出前如何确保goroutine完全运行 mongodb
2019-01-10 23:27

回答 1 已采纳 The WaitGroup is a counting semaphore and can be used to count off goroutines as they finish their
如何使用WaitGroup确保goroutine在for循环中完成？
2017-12-30 19:34

回答 1 已采纳 The sync.Waitgroup is working as expected. w, y and z will not reach 10000 because multiple gorout
Goroutine超时
2018-07-07 12:07

回答 2 已采纳 You control cancelation of http requests with a context.Context. // create a timeout or cancelati
请解释一下Go语言中的defer、panic和recover在并发编程中的最佳实践和注意事项。
2023-06-11 00:17

学亮编程手记的博客总之，在并发编程中，要小心使用defer、panic和recover，并结合其他的并发编程技术和最佳实践来确保代码的正确性、可靠性和性能。同时，对于并发编程的复杂性，需要有深入的理解和经验积累，以避免常见的并发问题和...
在golang中优先使用goroutine
2018-12-21 20:09

回答 2 已采纳 I have created threadpools on golang. This should allow easily one to prioritize certain goroutine
Goroutine循环未完成
2018-10-12 05:15

回答 1 已采纳 Finally figured the answer... The problem was that I needed to close my monitoringChan in the fir
Goroutine和互斥锁
2018-06-19 19:36

回答 1 已采纳 We'll call the initial goroutine that's running when start is entered G1. start (in G1) locks th
Golang 高效实践之defer、panic、recover实践
2022-10-17 19:21

大叶子不小的博客我们知道Golang处理异常是用error返回的方式，然后调用方根据error的值...文章介绍了defer、panic和recover的原理和用法，并且在最后给出了一些在实际应用的实践建议，不要滥用defer，注意defer搭配闭包时的一些特性。
更改goroutine睡眠时间
2018-06-20 23:54

回答 3 已采纳 If you just need a way to "wakeup" a sleeping goroutine, you could use sync.Once to ensure your fu
步骤顺序时使用goroutine
2018-02-10 11:36

回答 4 已采纳 There's nothing standard that would say yes or no to this question. Although you can do it correc
Goroutine无法运行
2017-11-13 21:19

回答 1 已采纳 Your function blocks before it gets to start the goroutine: eventCh := make(chan []byte) eventCh
万级 K8s 集群背后 etcd 稳定性及性能优化实践
2020-07-14 13:54

灵雀云的博客从这两个数据不一致bug中我们获得了以下收获和最佳实践: •算法理论数据一致性，不代表整体服务实现能保证数据一致性，目前业界对于这种基于日志复制状态机实现的分布式存储系统，没有一个核心的机制能保证raft、...
使用通道同步多个goroutine
2018-05-13 06:29

回答 2 已采纳 Use sync.WaitGroup to wait for goroutines to complete. Close channels to cause loops reading on c
网易基于Filebeat的日志采集服务设计与实践
2021-01-29 00:00

公众号:肉眼品世界的博客网易架构师朱剑锋：网易中台的博弈与演进网易严选数据中台建设之道小米中台架构分享，小米市值2690亿是因为这群可爱的人【中台实践】华为大数据中台架构分享.pdf 【中台实践】滴滴大数据研发中台的最佳实践.pdf...
万级K8s集群背后etcd稳定性及性能优化实践
2020-07-06 18:00

腾讯技术工程的博客从这两个数据不一致bug中我们获得了以下收获和最佳实践: 算法理论数据一致性，不代表整体服务实现能保证数据一致性，目前业界对于这种基于日志复制状态机实现的分布式存储系统，没有一个核心的机制能保证raft、wal...
没有解决我的问题, 去提问

悬赏问题

¥15 matlab有关常微分方程的问题求解决
¥15 perl MISA分析p3_in脚本出错
¥15 k8s部署jupyterlab，jupyterlab保存不了文件
¥15 ubuntu虚拟机打包apk错误
¥199 rust编程架构设计的方案有偿
¥15 回答4f系统的像差计算
¥15 java如何提取出pdf里的文字？
¥100 求三轴之间相互配合画圆以及直线的算法
¥100 c语言，请帮蒟蒻写一个题的范例作参考
¥15 名为“Product”的列已属于此 DataTable

确保goroutine清理，最佳实践

1条回答 默认 最新

悬赏问题

1条回答默认最新