dtmm0148603 2017-08-04 07:37

已采纳

如果一次执行中发生错误，则关闭多个goroutine

consider this function :

func doAllWork() error {
    var wg sync.WaitGroup
    wg.Add(3)
    for i := 0; i < 2; i++ {
        go func() {
            defer wg.Done()
            for j := 0; j < 10; j++ {
                result, err := work(j)
                if err != nil {
                    // can't use `return err` here
                    // what sould I put instead ? 
                    os.Exit(0)
                }
            }
        }()
    }
    wg.Wait()
    return nil
}

In each goroutine, the function work() is called 10 times. If one call to work() returns an error in any of the running goroutines, I want all the goroutines to stop immediately, and the program to exit. Is it ok to use os.Exit() here ? How should I handle this ?

Edit: this question is different from how to stop a goroutine as here I need to close all goroutines if an error occurs in one

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

dongluolie3487 2017-08-04 09:04

关注

You may use the context package which was created for things like this ("carries deadlines, cancelation signals...").

You create a context capable of publishing cancelation signals with context.WithCancel() (parent context may be the one returned by context.Background()). This will return you a cancel() function which may be used to cancel (or more precisely signal the cancel intent) to the worker goroutines.
And in the worker goroutines you have to check if such intent has been initiated, by checking if the channel returned by Context.Done() is closed, easiest done by attempting to receive from it (which proceeds immediately if it is closed). And to do a non-blocking check (so you can continue if it is not closed), use the select statement with a default branch.

I will use the following work() implementation, which simulates a 10% failure chance, and simulates 1 second of work:

func work(i int) (int, error) {
    if rand.Intn(100) < 10 { // 10% of failure
        return 0, errors.New("random error")
    }
    time.Sleep(time.Second)
    return 100 + i, nil
}

And the doAllWork() may look like this:

func doAllWork() error {
    var wg sync.WaitGroup

    ctx, cancel := context.WithCancel(context.Background())
    defer cancel() // Make sure it's called to release resources even if no errors

    for i := 0; i < 2; i++ {
        wg.Add(1)
        go func(i int) {
            defer wg.Done()

            for j := 0; j < 10; j++ {
                // Check if any error occurred in any other gorouties:
                select {
                case <-ctx.Done():
                    return // Error somewhere, terminate
                default: // Default is must to avoid blocking
                }
                result, err := work(j)
                if err != nil {
                    fmt.Printf("Worker #%d during %d, error: %v
", i, j, err)
                    cancel()
                    return
                }
                fmt.Printf("Worker #%d finished %d, result: %d.
", i, j, result)
            }
        }(i)
    }
    wg.Wait()

    return ctx.Err()
}

This is how it can be tested:

func main() {
    rand.Seed(time.Now().UnixNano() + 1) // +1 'cause Playground's time is fixed
    fmt.Printf("doAllWork: %v
", doAllWork())
}

Output (try it on the Go Playground):

Worker #0 finished 0, result: 100.
Worker #1 finished 0, result: 100.
Worker #1 finished 1, result: 101.
Worker #0 finished 1, result: 101.
Worker #0 finished 2, result: 102.
Worker #1 finished 2, result: 102.
Worker #1 finished 3, result: 103.
Worker #1 during 4, error: random error
Worker #0 finished 3, result: 103.
doAllWork: context canceled

If there would be no errors, e.g. when using the following work() function:

func work(i int) (int, error) {
    time.Sleep(time.Second)
    return 100 + i, nil
}

The output would be like (try it on the Go Playground):

Worker #0 finished 0, result: 100.
Worker #1 finished 0, result: 100.
Worker #1 finished 1, result: 101.
Worker #0 finished 1, result: 101.
Worker #0 finished 2, result: 102.
Worker #1 finished 2, result: 102.
Worker #1 finished 3, result: 103.
Worker #0 finished 3, result: 103.
Worker #0 finished 4, result: 104.
Worker #1 finished 4, result: 104.
Worker #1 finished 5, result: 105.
Worker #0 finished 5, result: 105.
Worker #0 finished 6, result: 106.
Worker #1 finished 6, result: 106.
Worker #1 finished 7, result: 107.
Worker #0 finished 7, result: 107.
Worker #0 finished 8, result: 108.
Worker #1 finished 8, result: 108.
Worker #1 finished 9, result: 109.
Worker #0 finished 9, result: 109.
doAllWork: <nil>

Notes:

Basically we just used the Done() channel of the context, so it seems we could just as easily (if not even easier) use a done channel instead of the Context, closing the channel to do what cancel() does in the above solution.

This is not true. This can only be used if only one goroutine may close the channel, but in our case any of the workers may do so. And attempting to close an already closed channel panics (see details here: How does a non initialized channel behave?). So you would have to ensure some kind of synchronization / exclusion around the close(done), which will make it less readable and even more complex. Actually this is exactly what the cancel() function does under the hood, hidden / abstracted away from your eyes, so cancel() may be called multiple times to make your code / use of it simpler.

How to get and return the error(s) from the workers?

For this you may use an error channel:

errs := make(chan error, 2) // Buffer for 2 errors

And inside the workers when an error is encountered, send it on the channel instead of printing it:

result, err := work(j)
if err != nil {
    errs <- fmt.Errorf("Worker #%d during %d, error: %v
", i, j, err)
    cancel()
    return
}

And after the loop, if there was an error, return that (and nil otherwise):

// Return (first) error, if any:
if ctx.Err() != nil {
    return <-errs
}
return nil

Output this time (try this on the Go Playground):

Worker #0 finished 0, result: 100.
Worker #1 finished 0, result: 100.
Worker #1 finished 1, result: 101.
Worker #0 finished 1, result: 101.
Worker #0 finished 2, result: 102.
Worker #1 finished 2, result: 102.
Worker #1 finished 3, result: 103.
Worker #0 finished 3, result: 103.
doAllWork: Worker #1 during 4, error: random error

Note that I used a buffered channel with a buffer size equal to the number of workers, which ensures sending on it is always non-blocking. This also gives you the possibility to receive and process all errors, not just one (e.g. the first). Another option could be to use a buffered channel to hold only 1, and do a non-blocking send on it, which could look like this:

errs := make(chan error, 1) // Buffered only for the first error

// ...and inside the worker:

result, err := work(j)
if err != nil {
    // Non-blocking send:
    select {
    case errs <- fmt.Errorf("Worker #%d during %d, error: %v
", i, j, err):
    default:
    }
    cancel()
    return
}

本回答被题主选为最佳回答 , 对您是否有帮助呢?

报告相同问题？

关注问题

如果一次执行中发生错误，则关闭多个goroutine
2017-08-04 07:37

回答 1 已采纳 You may use the context package which was created for things like this ("carries deadlines, cancel
Golang：在多个goroutine中发送关闭通道错误
2018-08-08 07:32

回答 2 已采纳 When you work with channels in Go always the sender should close the channel. Because that signals
如何等待其他多个Goroutine的单个Goroutine响应？
2019-02-24 16:50

回答 1 已采纳 What I would to to solve your task is I would use a goroutine pool for this. There would be a prod
golang goroutine实现_Golang 多goroutine异步通知error的一种方法
2021-01-14 07:52

weixin_39517199的博客作者近期在写一个项目时遇到了这样的需求：调用一个库API函数，函数内部...作者最终的解决方案概括为：使用者另启一个goroutine监听Err channel，库后台goroutine出现的错误将直接发送至Err channel中。作者以自己项...
使用通道同步多个goroutine
2018-05-13 06:29

回答 2 已采纳 Use sync.WaitGroup to wait for goroutines to complete. Close channels to cause loops reading on c
具有多个通道的多个goroutine的死锁
2018-11-05 05:11

回答 1 已采纳 We can iterate through values sent over a channel. To break such iteration channel needs to be clo
如果一个goroutine完成，控制goroutine关闭的规范方法是什么？
2017-07-05 18:14

回答 1 已采纳 The common idiom is to have a Done channel shared between the calling code and the goroutines. Th
按需启动任意多个goroutine的方法，通过通道在不同goroutine之间进行通信
2020-05-20 15:46

人邮异步社区的博客假设现在有一个地鼠工厂，里面绝大多数地鼠都在忙着干活，当然也有少数地鼠在角落偷偷睡懒觉。工厂里面有一只位高权重的地鼠，她负责向其他地鼠发号施令。地鼠们会为了完成她分派的任务而四处奔波并且相互协作，最后...
同一频道中的两个goroutine-如何执行？
2017-10-09 06:54

回答 2 已采纳 the thing is u've missed the concept of concurrency there is no guarantee in executing functions i
如何使用Sarama在多个goroutine中从Kafka主题消费？
2019-07-04 17:49

回答 1 已采纳 This example shows a fully working console application which can consume for all partitions in a t
什么时候执行下一个goroutine？
2017-04-03 09:23

回答 1 已采纳 Your code does not use goroutines, in order to use go routines you should do something like this:
Go 中channel/goroutine实现并发和并行
2022-11-17 20:29

悟道xn的博客 end := time.Now().Unix() fmt.Println("全部执行完毕,共用时", end - start,"ms") } 6.Channel 管道管道是Golang在语言级别上提供的goroutine间的通讯方式，我们可以使用channel在多个goroutine之间传递消息。...
如何从以特定顺序执行的N个goroutine中收集值？
2016-06-16 10:18

回答 2 已采纳 Goroutines run concurrently, independently, so without explicit synchronization you can't predict
golang创建linux线程,多线程-您如何定义Goroutine池以在Golang中立即执行？
2021-05-15 01:46

今日解股的博客我想用不同的命令行参数多次调用Go的封闭源可执行文件，并发一点。我得到的代码工作得很好，但是我想得到您的意见，以便进行改进。由于我处于早期学习阶段，因此我还将解释我的工作流程。为简单起见，在此假定此...
golang并发 goroutine
2021-08-10 17:21

长睡将军的博客 goroutine说到底其实就是线程，但是它比线程更小，十几个goroutine可能体现在底层就是五六个线程，Go语言内部帮你实现了这些goroutine之间的内存共享。执行goroutine只需极少的栈内存(大概是4~5KB)，当然会根据相应...
没有解决我的问题, 去提问

悬赏问题

¥20 基于MSP430f5529的MPU6050驱动，求出欧拉角
¥20 Java-Oj-桌布的计算
¥15 powerbuilder中的datawindow数据整合到新的DataWindow
¥20 有人知道这种图怎么画吗？
¥15 pyqt6如何引用qrc文件加载里面的的资源
¥15 安卓JNI项目使用lua上的问题
¥20 RL+GNN解决人员排班问题时梯度消失
¥60 要数控稳压电源测试数据
¥15 能帮我写下这个编程吗
¥15 ikuai客户端l2tp协议链接报终止15信号和无法将p.p.p6转换为我的l2tp线路

码龄粉丝数原力等级 --

如果一次执行中发生错误，则关闭多个goroutine

1条回答默认最新

码龄粉丝数原力等级 --

How to get and return the error(s) from the workers?

悬赏问题

如果一次执行中发生错误，则关闭多个goroutine

1条回答 默认 最新

How to get and return the error(s) from the workers?

悬赏问题

1条回答默认最新