Go例行泄漏在哪里？

I'm trying to run several tasks concurrently and return immediately if there is any error without to wait for all of the routines to return. The code looks as below. I've stripped out the noise to make it easier to digest but I can post the full code if the leak is not obvious. It's worth to note that I'm deploying this on google app engine. I can't reproduce the leak on my machine but when I replace the concurrency after // Consume the results comment the app is working fine, though I don't understand why because the code looks correct to me.

package main

import "fmt"
import "sync"
import "errors"

func main() {
    indexes := []int{1, 2, 3, 4, 5, 6, 7}
    devCh := make(chan int, 7)
    stopCh := make(chan struct{})
    errCh := make(chan error, 7)
    var wg sync.WaitGroup
    go func() {
        for _, sub := range indexes {
            wg.Add(1)
            go func(sub int) {
                defer wg.Done()
                // some code which creates other
                // wait groups and spans other go routines
                // handle errors
                if sub == 99 { // unreachable 
                    errCh <- errors.New("new error")

                }
            }(sub)
            select {
            // If there is any error we better stop the
            // loop
            case <-stopCh:
                return
            default:
            }
            devCh <- sub
        }
        wg.Wait()
        close(devCh)
    }()
    // Consume the results
    var results []int
    var wt sync.WaitGroup
    wt.Add(1)
    go func() {
        defer wt.Done()
        for s := range devCh {
            results = append(results, s)
        }
        return
    }()
    done := make(chan struct{})
    go func() {
        wt.Wait()
        close(done)
    }()

L:
    for {
        select {
        case err := <-errCh:
            fmt.Printf("error was %v", err)
            close(stopCh)
            return
        case <-done:
            break L
        default:
        }
    }
    fmt.Printf("all done, %v", results)
}

Edit: added some working code.

Edit: added code closer to the real code which may explain the need of the for loop.

package main

import "fmt"
import "sync"
import "errors"

func main() {
    indexes := []int{1, 2, 3, 4, 5, 6, 7}
    indexesString := []string{"a", "b", "c", "d"}
    devChS := make(chan string, 1000)

    devCh := make(chan int, 7)
    stopCh := make(chan struct{})
    errCh := make(chan error, 7)
    var wg sync.WaitGroup
    go func() {
        for _, sub := range indexes {
            wg.Add(1)
            go func(sub int) {
                defer wg.Done()
                // some code which creates other
                // wait groups and spans other go routines
                // handle errors
                if sub == 99 { // unreachable
                    errCh <- errors.New("new error")

                }
                wg.Add(1)
                go func(sub int) {
                    defer wg.Done()
                    for _, s := range indexesString {
                        devChS <- fmt.Sprintf("%s %s", s, sub)

                    }

                    return
                }(sub)
            }(sub)
            select {
            // If there is any error we better stop the
            // loop
            case <-stopCh:
                return
            default:
            }
            devCh <- sub
        }
        wg.Wait()
        close(devCh)
        close(devChS)
    }()
    // Consume the results
    var results = struct {
        integers []int
        strings  []string
    }{}
    var wt sync.WaitGroup
    wt.Add(1)
    go func() {
        defer wt.Done()
        for s := range devCh {
            results.integers = append(results.integers, s)
        }
        return
    }()
    wt.Add(1)
    go func() {
        defer wt.Done()
        for s := range devChS {
            results.strings = append(results.strings, s)
        }
        return
    }()
    done := make(chan struct{})
    go func() {
        wt.Wait()
        close(done)
    }()

L:
    for {
        select {
        case err := <-errCh:
            fmt.Printf("error was %v", err)
            close(stopCh)
            return
        case <-done:
            break L
        default:
        }
    }
    fmt.Printf("all done, can return the results: %v", results)
}

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
duanjiaolao1187 2015-03-26 04:44
关注
tl;dr: A loop that does nothing but repeat a non-blocking check until it succeeds can cause hard-to-diagnose trouble (at a minimum, it can overuse CPU); using a blocking check can fix it.

I'm not all that sure about the details of your case; I wrote a loop like yours that consistently hangs with "process took too long" on the Playground, but when I run it locally it does complete.

As I commented, I'd aim for a simpler design, too.

Go only has limited pre-emption of running goroutines: the running thread only yields control to the goroutine scheduler when a blocking operation (an like I/O or channel op or waiting to take a lock) happens.

So with GOMAXPROCS=1, if the (one) running thread starts looping, nothing else will necessarily get a chance to run.

A for { select { ...default: } } can therefore start a loop checking for items in a channel but never give up control of the main thread so that another goroutine can write an item. Other code gets to run anyway when when GOMAXPROCS is over 1, but not when it's 1 as it is on App Engine (or the Playground). The behavior depends not only on GOMAXPROCS, but on which goroutine happens to run first, which isn't necessarily defined.

To avoid that situation, remove the default: so the select is a blocking operation that yields to the scheduler when it can't receive an item, allowing other code to run. You can generalize this to other cases where you might loop doing a nonblocking check; any of them could keep resources busy constantly rechecking when a blocking call would not. When GOMAXPROCS>1 or the runtime's limited preemption saves you, polling (as repeated checking is called) can still consume more CPU than blocking.

For example, this fails with "process took too long" on the Playground, though annoyingly it completes reliably on my machine:

package main import "fmt" func main() { c := make(chan struct{}) go func() { c <- struct{}{} }() for { select { case <-c: fmt.Println("success") return default: } } }

I can't tell if there are other problems, but the hang for a pattern similar to the sample is noteworthy.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

Go例行泄漏在哪里？
2015-03-26 01:14

回答 1 已采纳 tl;dr: A loop that does nothing but repeat a non-blocking check until it succeeds can cause hard-t
如何保持死去的去例行程序的计数？
2018-03-13 20:43

回答 1 已采纳 Use deferred function that will recover from panic. func count() { err := recover() if er
Golang例行程序
2018-10-11 03:02

回答 1 已采纳 You cannot interrupt time.Sleep. Use time.After in the select statement instead of the default cas
go-leak:检测Go中的各种泄漏
2021-05-02 07:24

泄漏 Go的std软件包当前不包括泄漏检测。 go-leak是一个软件包，可以帮助您发现代码中的泄漏。如果您对如何改进此软件包有任何想法或对它有任何疑问，请通过。注意：此软件包不再起作用。它适用于我使用较旧的Go...
例行泄漏会发生在具有一个缓冲器的通道中，该缓冲器具有两个输入但只有一个输出吗？
2017-12-04 04:05

回答 1 已采纳 The value from one goroutine is received and the other is buffered. Both goroutines can send to th
例行检查：选择是否真的选择了随机情况？
2018-06-21 01:31

回答 3 已采纳 I think you're mistaken about what's happening in your main method... I'm gonna break down what I
go例行程序的奇怪行为
2012-06-26 15:54

回答 2 已采纳 More than likely you are only running with one cpu thread. so it runs the first goroutine and then
go4noobs:我学习Golang的进度
2021-05-29 00:53

谢谢你特别感谢Ellen Körbes在 PT-BR 中为巴西带来了惊人的 GOLang 内容 || 概括练习题编程基础I 布尔型细绳常数井田第1部分-练习程序编程环路有条件的逻辑运算符数据结构I 大批切片切片片切片制作地图练习题结构...
进入例行程序-为什么websocket将连接报告为已关闭？ websocket
2015-08-07 06:03

回答 1 已采纳 The websocket server closes the connection when the handler returns. Removing the Go routine is t
进入例行程序，出现通道死锁
2019-09-13 01:21

回答 1 已采纳 range ch reads from the channel until it is closed. How many times do you call close(ch)? When w
去例行公事
2018-07-16 11:05

回答 1 已采纳 Your routines are sort-of blocking because the channels aren't buffered. A write/read on an unbuff
泄露扫描系统/泄露扫描系统
2022-05-29 08:59

泄露巡航工具：提供了WEB管理端，后台数据库支持SQLITE3、MYSQL和POSTGRES 双引擎搜索，github code接口搜索全局github以及本地搜索例行监控的repos 支持规则管理（github搜索规则及本地repos搜索规则）支持...
惯用的例行程序终止和错误处理
2016-11-25 16:51

回答 4 已采纳 All but one of your goroutines are leaked, because they're still waiting to send to the errs chann
Go语言错误处理
2023-06-19 20:36

242030的博客 Go语言错误处理
Go程序设计语言学习笔记第一章入门
2024-03-15 22:39

吃着火锅x唱着歌的博客本章是对于Go语言基本组件的一些说明。本书所有例子都是针对现实世界的任务的。本章将带您尝试体验用Go语言来编写各种程序：从简单的文件、图片处理到并发的客户端和服务器的互联网应用开发。虽然在一章里不能把所有...
都快2022年了，还在纠结要学什么编程语言吗？看看哪个最适合你
2021-05-04 14:42

小助手爱编程的博客都快2022年了，还在纠结要学什么编程语言吗？看看哪个最适合你先上图，不用说你自己心里都有底了吧! 下面我来分析一下排名前几位的编程语言文章目录都快2022年了，还在纠结要学什么编程语言吗？看看哪个最适合你...
用户名和密码使用的字段类型_如果在“用户名”字段中提交密码，对安全有何影响？...
2020-09-15 03:48

culunyi0802的博客 The website’s administrator could routinely go through the log files and accidentally find your password. He can then find the IP address this record came from, and thus he can theoretically find ...
基于Docker和Kubernetes的最佳架构——神话还是现实？
2018-03-10 08:08

weixin_34279061的博客软件开发领域在Docker和Kubernetes时代是如何变化的？是否有可能使用这些技术搭建一劳永逸的架构？当所有东西都被“打包”进容器中时，是否有可能统一开发及集成的流程？这些决策的需求是什么？它们会带来什么...
我们从工程项目中学习什么？
2015-06-02 11:04

u014766462的博客这个题目有点大，我在此仅限于技术层面。这篇文章呢很早就想写，真正促使我付诸行动的是大约两个月前我开始给Mimas team讲街机模拟器的设计，大家普遍很漠然，大概完全搞不懂我为什么要讲这种东东...
看看下边的问题你能够回答出来多少？
2015-12-30 00:04

Hustudent20080101的博客 Java语言中一个显著的特点就是引入了垃圾回收机制，使c++程序员最头疼的内存管理的问题迎刃而解，它使得Java程序员在编写程序的时候不再需要考虑内存管理。由于有个垃圾回收机制，Java中的对象不再有...
没有解决我的问题, 去提问

悬赏问题

¥30 这是哪个作者做的宝宝起名网站
¥60 版本过低apk如何修改可以兼容新的安卓系统
¥25 由IPR导致的DRIVER_POWER_STATE_FAILURE蓝屏
¥50 有数据，怎么建立模型求影响全要素生产率的因素
¥50 有数据，怎么用matlab求全要素生产率
¥15 TI的insta-spin例程
¥15 完成下列问题完成下列问题
¥15 C#算法问题, 不知道怎么处理这个数据的转换
¥15 YoloV5 第三方库的版本对照问题
¥15 请完成下列相关问题！

Go例行泄漏在哪里？

1条回答 默认 最新

悬赏问题

1条回答默认最新