具有多个等待组的管道中通道范围内的死锁

I'm practicing a challenge to calculate factorials by splitting calculations into 100 groups concurrently, I solved lots of issue on WaitGroups, but still in the calculateFactorial function I got the deadlock on range over channel part. Wish someone could point the issue here, thank you.

package main

import (
    "fmt"
    "sync"
)

func main() {
    var wg sync.WaitGroup
    wg.Add(2)
    in := make (chan int)
    out := make (chan float64)



    out = calculateFactorial(genConcurrentGroup(in, &wg), &wg)

    go func() {
        in <- 10
        close(in)
    }()

    fmt.Println(<-out)

    wg.Wait()


}

//split input number into groups
//the result should be a map of [start number, number in group]
//this is not heavy task so run in one go routine
func genConcurrentGroup(c chan int, wg *sync.WaitGroup) chan map[int]int{
    out := make(chan map[int]int)

    go func() {
        //100 groups
        total:= <- c
        wg.Done()
        //element number in group
        elemNumber := total / 100
        extra := total % 100
        result := make(map[int]int)
        if elemNumber>0{
            //certain 100 groups
            for i:=1 ;i<=99;i++{
                result[(i-1) * elemNumber + 1] = elemNumber
            }
            result[100] = extra + elemNumber
        }else{
            //less than 100
            for i:=1;i<=total;i++{
                result[i] = 1
            }
        }

        out <- result
        close(out)
    }()
    return out
}

//takes in all numbers to calculate multiply result
//this could be heavy so can do it 100 groups together
func calculateFactorial(nums chan map[int]int, wg *sync.WaitGroup) chan float64{
    out := make(chan float64)


    go func() {
        total:= <- nums
        wg.Done()
        fmt.Println(total)

        oneResult := make(chan float64)

        var wg2 sync.WaitGroup
        wg2.Add(len(total))

        for k,v := range total{
            fmt.Printf("%d %d 
",k,v)
            go func(k int, v int) {
                t := 1.0
                for i:=0;i<v;i++{
                    t = t * (float64(k) + float64(i))
                }
                fmt.Println(t)
                oneResult <- t
                wg2.Done()
            }(k,v)
        }

        wg2.Wait()
        close(oneResult)

        result := 1.0
        for n := range oneResult{  //DEADLOCK HERE! Why?
            result *= n
        }


        fmt.Printf("Result: %f
",result)

        out <- result

    }()
    return out
}

Update:

Thanks to Jessé Catrinck's answer which fixed the issue in the above code by simply change the oneResult to a buffered channel. However in https://stackoverflow.com/a/15144455/921082 there's a quote

You should never add buffering merely to fix a deadlock. If your program deadlocks, it's far easier to fix by starting with zero buffering and think through the dependencies. Then add buffering when you know it won't deadlock.

So could anyone please help me figure out how to not to use buffered channel for this? Is it possible?

Furthermore, I did some research on what exactly causes a deadlock.

Some quote like from https://stackoverflow.com/a/18660709/921082,

If the channel is unbuffered, the sender blocks until the receiver has received the value. If the channel has a buffer, the sender blocks only until the value has been copied to the buffer; if the buffer is full, this means waiting until some receiver has retrieved a value.

Said otherwise :

when a channel is full, the sender waits for another goroutine to make some room by receiving

you can see an unbuffered channel as an always full one : there must be another goroutine to take what the sender sends.

So in my original situation, what is probably causing the deadlock is maybe :

the range over channel is not receiving ?
the range over channel is not receiving on a separated go routine. ?
the oneResult is not properly closed, so range over channel doesn't know where's the end?

for number 3, I don't know if there's anything wrong about closing the oneResult before range over, since this pattern appears on many examples on the internet. If it is number 3, could it be something wrong in the wait group?

I got another article very similar to my situation https://robertbasic.com/blog/buffered-vs-unbuffered-channels-in-golang/, in its second lesson learned, he uses a for { select {} } infinite loop as an alternative to range over, it seems solved his problem.

 go func() {
        for{
            select {
            case p := <-pch:
                findcp(p)
            }
        }
    }()

Lesson number 2 — an unbuffered channel can’t hold on to values (yah, it’s right there in the name “unbuffered”), so whatever is sent to that channel, it must be received by some other code right away. That receiving code must be in a different goroutine because one goroutine can’t do two things at the same time: it can’t send and receive; it must be one or the other.

Thanks

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dongmie3526 2019-02-26 16:40
关注
The deadlock isn't on the range-over-channel loop. If you run the code on playground you'll see at the top of the stacktrace that the error is caused by wg2.Wait (line 88 on playground and pointed to by the stacktrace). Also in the stacktrace you can see all the goroutines that haven't finished because of the deadlock, this is because oneResult<-t never completes, so none of the goroutines started in the loop ever finish.

So the main problem is here:

wg2.Wait() close(oneResult) // ... for n := range oneResult{ // ...

Also looping over a closed channel is not what you want, I assume. However even if you didn't close the channel, that loop would never start because wg2.Wait() will wait until its done.

oneResult <- t wg2.Done()

But it will never be done because it relies on the loop to be already running. The line oneResult <- t will not complete unless there's someone on the other side receiving from that channel, which is your loop, however that range-over-channel loop is still waiting for wg2.Wait() to complete.

So essentially you have a "circular dependency" between the channel's sender and receiver.

To fix the issue you need to allow the loop to start receiving from the channel while still making sure that channel's closed when done. You can do thing by wrapping the two wait-and-close lines into their own goroutine.

https://play.golang.com/p/rwwCFVszZ6Q
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(1条)

报告相同问题？

关注问题

具有多个通道的多个goroutine的死锁
2018-11-05 05:11

回答 1 已采纳 We can iterate through values sent over a channel. To break such iteration channel needs to be clo
范围内的通道完成死锁
2017-07-10 19:52

回答 2 已采纳 Range only stops when the channel is closed. You're hitting a deadlock because nothing is writing
mysql update操作时where条件中包含多个普通索引，多事务操作为什么会产生死锁 mysql 有问必答
2021-10-13 15:04

回答 3 已采纳 create index xx on worknote_account (accountid,opersn,worknote);
考试范围1
2022-08-08 21:46

I/O控制方式有程序控制、中断、DMA和通道，其中 DMA 方式直接由硬件完成数据传输，而通道方式允许同时处理多个I/O操作。第六章涉及文件系统，包括文件的逻辑结构、外存分配方式、目录结构以及文件的共享与保护。...
os中死锁四个条件为什么不是充分条件 centos linux windows
2023-01-12 10:05

回答 2 已采纳 4个必要条件要同时满足的时候才会死锁，那单独拿出来哪一个显然都不是充分条件
具有通道参数的功能出现死锁
2019-03-16 08:51

回答 1 已采纳 As defined in the spec: The function value and parameters are evaluated as usual in the callin
Linux死锁多线程编程在qt中运行异常 c++ c语言 linux ubuntu
2019-04-17 21:20

回答 1 已采纳改成用qt自带的Qthread写多线程通过信号和槽实现线程进度与界面变化的同步
2022年Android中高级面试框架
2022-04-04 16:10

Swuagg的博客 Java泛型集合ArrayListLinkedListHashMapLinkedHashMapConcurrentHashMap多线程并发volatile线程反射JVM类加载怎么判断对象是否已死？垃圾回收机制四大引用泛型集合 ——HashMap、ConcurrentHashMap源码和数据结构多...
java程序设计中的死锁问题 java
2022-05-31 19:59

回答 1 已采纳你应该是这两个概念混淆了, 如果有帮助辛苦点击采纳~两个线程访问同一个资源, 是可能会出现线程不安全的情况, 为了解决线程不安全的情况可以加锁死锁是两个线程各自持有一把锁, 又在等待对方的那把锁而出现
线程锁中关于死锁的疑问 python 有问必答
2021-06-27 16:43

回答 1 已采纳线程的运行是不确保运行顺序的，所以出现的顺序是随机的。
oralce 多个存储过程往同一张表插入数据可行吗，会不会出现死锁等异常现象
2017-08-11 02:47

回答 1 已采纳可以的，不会死锁。
C++ 多进程多线程间通信
2024-07-05 15:57

一条闲鱼。的博客请注意，这里的互斥锁是在全局命名空间中创建的（通过前缀"Global\\"），这意味着它可以在系统范围内的任何进程中访问。这是必需的，因为我们的目标是让多个不同的进程能够识别并访问同一个互斥锁。请注意，我们使用...
操作系统之进程管理(下)，同步互斥死锁问题，看看操作系统怎么解决的
2021-07-15 08:00

小龙飞2的博客目录：进程同步，进程互斥进程同步进程互斥临界区的互斥访问进程互斥的软件实现方法（很多，可跳过）进程互斥的硬件实现-中断屏蔽方法进程互斥的硬件实现-TestAndSet指令进程互斥的硬件实现...
实现基于Linux网络编程+多线程编程的简易网络聊天室
2024-05-29 21:49

Leon_Chenl的博客众所周知，网络聊天室的应用已经融入我们生活的方方面面，微信、QQ、飞书等等，这篇文章介绍如果利用套接字编程（网络编程）+多线程编程实现一个简易的网络聊天室。相信通过对这个网络聊天室的编写，我们可以对网络...
2024java面试题（含答案，持续更新中）
2020-03-08 21:43

麦芽糖0219的博客 2.2 TPS：每秒钟最大能处理的请求数每秒钟处理完的事务次数，一个应用系统1s能完成多少事务处理，一个事务在分布式处理中，可能会对应多个请求，对于衡量单个接口服务的处理能力，用QPS比较合理 2.3 使用Redis的...
没有解决我的问题, 去提问

悬赏问题

¥30 Matlab打开默认名称带有/的光谱数据
¥50 easyExcel模板动态单元格合并列
¥15 res.rows如何取值使用
¥15 在odoo17开发环境中，怎么实现库存管理系统，或独立模块设计与AGV小车对接？开发方面应如何设计和开发？请详细解释MES或WMS在与AGV小车对接时需完成的设计和开发
¥15 CSP算法实现EEG特征提取，哪一步错了？
¥15 游戏盾如何溯源服务器真实ip?需要30个字。后面的字是凑数的
¥15 vue3前端取消收藏的不会引用collectId
¥15 delphi7 HMAC_SHA256方式加密
¥15 关于#qt#的问题：我想实现qcustomplot完成坐标轴
¥15 下列c语言代码为何输出了多余的空格

具有多个等待组的管道中通道范围内的死锁

2条回答 默认 最新

悬赏问题

2条回答默认最新